diff --git a/.gitignore b/.gitignore
new file mode 100644
index 0000000..64dcd00
--- /dev/null
+++ b/.gitignore
@@ -0,0 +1,74 @@
+# Temporarily ignore CSVs until we choose which ones should be committed
+*.csv
+slurm
+archives
+output
+__pycache__
+*.pyc
+.ipynb_checkpoints
+*.html
+
+# From GitHub .gitignore Python template
+
+# Ignore large files, download manually
+# data/reference_files/
+# data/
+alignments
+baseline_models/model_checkpoints/
+outputs/intermediate_outputs/
+outputs/output_scores/
+clustal/
+processed_data/
+make_website_data_folder.py
+website_data/
+
+# Ignore vespa output
+vespa_run_directory
+
+# Ignoring slurm scripts that we use for scoring
+o2_scripts/
+slurm_scripts/
+
+# Ignoring HMM model files
+# And temporary scripts directories
+scripts/scoring_DMS_zero_shot/HMM_temp/
+scripts/scoring_DMS_zero_shot/gemme_tmp/
+**/gemme_tmp/*
+**/vespa_tmp/*
+
+
+# Optional:
+baseline_models/bash_scripts/old/
+# Slurm script outputs
+*.out
+*.err
+logs/
+
+# Byte-compiled / optimized / DLL files
+__pycache__/
+*.py[cod]
+*$py.class
+
+
+# Distribution / packaging
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+share/python-wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+MANIFEST
+
+# Jupyter Notebook
+.ipynb_checkpoints
\ No newline at end of file
diff --git a/LICENSE b/LICENSE
new file mode 100644
index 0000000..3c47051
--- /dev/null
+++ b/LICENSE
@@ -0,0 +1,21 @@
+MIT License
+
+Copyright (c) 2023 OATML-Markslab, Pascal Notin, Aaron Kollasch, Daniel Ritter, Lood van Niekerk
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
diff --git a/README.md b/README.md
new file mode 100644
index 0000000..6b020ea
--- /dev/null
+++ b/README.md
@@ -0,0 +1,226 @@
+# ProteinGym
+
+## Table of Contents
+
+- [Overview](#overview)
+- [Fitness prediction performance](#fitness-prediction-performance)
+- [Resources](#resources)
+- [How to contribute?](#how-to-contribute)
+- [Usage and reproducibility](#usage-and-reproducibility)
+- [Acknowledgements](#acknowledgements)
+- [Releases](#releases)
+- [License](#license)
+- [Reference](#reference)
+- [Links](#links)
+
+## Overview
+
+ProteinGym is an extensive set of Deep Mutational Scanning (DMS) assays and annotated human clinical variants curated to enable thorough comparisons of various mutation effect predictors in different regimes. Both the DMS assays and clinical variants are divided into 1) a substitution benchmark which currently consists of the experimental characterisation of ~2.7M missense variants across 217 DMS assays and 2,525 clinical proteins, and 2) an indel benchmark that includes ∼300k mutants across 74 DMS assays and 1,555 clinical proteins.
+
+Each processed file in each benchmark corresponds to a single DMS assay or clinical protein, and contains the following variables:
+- mutant (str): describes the set of substitutions to apply on the reference sequence to obtain the mutated sequence (eg., A1P:D2N implies the amino acid 'A' at position 1 should be replaced by 'P', and 'D' at position 2 should be replaced by 'N'). Present in the the ProteinGym substitution benchmark only (not indels).
+- mutated_sequence (str): represents the full amino acid sequence for the mutated protein.
+- DMS_score (float): corresponds to the experimental measurement in the DMS assay. Across all assays, the higher the DMS_score value, the higher the fitness of the mutated protein. This column is not present in the clinical files, since they are classified as benign/pathogenic, and do not have continuous scores
+- DMS_score_bin (int): indicates whether the DMS_score is above the fitness cutoff (1 is fit (pathogenic for clinical variants), 0 is not fit (benign for clinical variants))
+
+Additionally, we provide two reference files for each benchmark that give further details on each assay and contain in particular:
+- The UniProt_ID of the corresponding protein, along with taxon and MSA depth category
+- The target sequence (target_seq) used in the assay
+- For the assays, details on how the DMS_score was created from the raw files and how it was binarized
+
+To download the benchmarks, please see `DMS benchmark - Substitutions` and `DMS benchmark - Indels` in the "Resources" section below.
+
+## Fitness prediction performance
+
+The [benchmarks](https://github.com/OATML-Markslab/ProteinGym/tree/main/benchmarks) folder provides detailed performance files for all baselines on the DMS and clinical benchmarks.
+
+We report the following metrics:
+- For DMS benchmarks in the zero-shot setting: Spearman, NDCG, AUC, MCC and Top-K recall
+- For DMS benchmarks in the supervised setting: Spearman and MSE
+- For clinical benchmarks: AUC
+
+Metrics are aggregated as follows:
+1. Aggregating by UniProt ID (to avoid biasing results towards proteins for which several DMS assays are available in ProteinGym)
+2. Aggregating by different functional categories, and taking the mean across those categories.
+
+These files are named e.g. `DMS_substitutions_Spearman_DMS_level.csv`, `DMS_substitutions_Spearman_Uniprot_level` and `DMS_substitutions_Spearman_Uniprot_Selection_Type_level` respectively for these different steps.
+
+For other deep dives (performance split by taxa, MSA depth, mutational depth and more), these are all contained in the `benchmarks/DMS_zero_shot/substitutions/Spearman/Summary_performance_DMS_substitutions_Spearman.csv` folder (resp. DMS_indels/clinical_substitutions/clinical_indels & their supervised counterparts). These files are also what are hosted on the website.
+
+We also include, as on the website, a bootstrapped standard error of these aggregated metrics to reflect the variance in the final numbers with respect to the individual assays.
+
+To calculate the DMS substitution benchmark metrics:
+1. Download the model scores from the website
+2. Run `./scripts/scoring_DMS_zero_shot/performance_substitutions.sh`
+
+And for indels, follow step #1 and run `./scripts/scoring_DMS_zero_shot/performance_substitutions_indels.sh`.
+
+### ProteinGym benchmarks - Leaderboard
+
+The full ProteinGym benchmarks performance files are also accessible via our dedicated website: https://www.proteingym.org/.
+It includes leaderboards for the substitution and indel benchmarks, as well as detailed DMS-level performance files for all baselines.
+The current version of the substitution benchmark includes the following baselines:
+
+Model name | Model type | Reference
+--- | --- | --- |
+Site Independent | Alignment-based model | [Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.](https://www.nature.com/articles/nbt.3769)
+EVmutation | Alignment-based model | [Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.](https://www.nature.com/articles/nbt.3769)
+WaveNet | Alignment-based model | [Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12.](https://www.nature.com/articles/s41467-021-22732-w)
+DeepSequence | Alignment-based model | [Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.](https://www.nature.com/articles/s41592-018-0138-4)
+GEMME | Alignment-based model | [Laine, É., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619.](https://pubmed.ncbi.nlm.nih.gov/31406981/)
+EVE | Alignment-based model | [Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.](https://www.nature.com/articles/s41586-021-04043-8)
+Unirep | Protein language model | [Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8](https://www.nature.com/articles/s41592-019-0598-1)
+ESM-1b | Protein language model | Original model: [Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118](https://www.biorxiv.org/content/10.1101/622803v4); Extensions: [Brandes, N., Goldman, G., Wang, C.H. et al. Genome-wide prediction of disease variant effects with a deep protein language model. Nat Genet 55, 1512–1522 (2023).](https://doi.org/10.1038/s41588-023-01465-0)
+ESM-1v | Protein language model | [Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.](https://proceedings.neurips.cc/paper/2021/hash/f51338d736f95dd42427296047067694-Abstract.html)
+VESPA | Protein language model | [Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.](https://link.springer.com/article/10.1007/s00439-021-02411-y)
+RITA | Protein language model | [Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.](https://arxiv.org/abs/2205.05789)
+ProtGPT2 | Protein language model | [Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13.](https://www.nature.com/articles/s41467-022-32007-7)
+ProGen2 | Protein language model | [Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517.](https://arxiv.org/abs/2206.13517)
+MSA Transformer | Hybrid - Alignment & PLM |[Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.](http://proceedings.mlr.press/v139/rao21a.html)
+Tranception | Hybrid - Alignment & PLM | [Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: protein fitness prediction with autoregressive transformers and inference-time retrieval. ICML.](https://proceedings.mlr.press/v162/notin22a.html)
+TranceptEVE | Hybrid - Alignment & PLM | [Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.](https://www.biorxiv.org/content/10.1101/2022.12.07.519495v1?rss=1)
+CARP | Protein language model | [Yang, K.K., Fusi, N., Lu, A.X. (2022). Convolutions are competitive with transformers for protein sequence pretraining.](https://doi.org/10.1101/2022.05.19.492714)
+MIF | Inverse folding | [Yang, K.K., Yeh, H., Zanichelli, N. (2022). Masked Inverse Folding with Sequence Transfer for Protein Representation Learning.](https://doi.org/10.1101/2022.05.25.493516)
+ProteinMPNN | Inverse folding | [J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378.](https://www.science.org/doi/10.1126/science.add2187)
+ESM-IF1 | Inverse folding | [Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. ICML](https://www.biorxiv.org/content/10.1101/2022.04.10.487779v2.full.pdf+html)
+ProtSSN | Hybrid - Structure & PLM | [Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability.](https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1)
+SaProt | Hybrid - Structure & PLM | [Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. ICLR](href='https://www.biorxiv.org/content/10.1101/2023.10.01.560349v5)
+
+Except for the WaveNet model (which only uses alignments to recover a set of homologous protein sequences to train on, but then trains on non-aligned sequences), all alignment-based methods are unable to score indels given the fixed coordinate system they are trained on. Similarly, the masked-marginals procedure to generate the masked-marginals for ESM-1v and MSA Transformer requires the position to exist in the wild-type sequence. All the other model architectures listed above (eg., Tranception, RITA, ProGen2) are included in the indel benchmark.
+
+For clinical baselines, we used dbNSFP 4.4a as detailed in the manuscript appendix (and in `proteingym/clinical_benchmark_notebooks/clinical_subs_processing.ipynb`).
+
+## Resources
+
+To download and unzip the data, use the following template, replacing {VERSION} with the desired version number (e.g., "v1.1") and {FILENAME} with the specific file you want to download, as listed in the table below. The latest version is v1.1.
+For example, you can download & unzip the zero-shot predictions for all baselines for all DMS substitution assays as follows:
+```
+VERSION="v1.1"
+FILENAME="DMS_ProteinGym_substitutions.zip"
+curl -o ${FILENAME} https://marks.hms.harvard.edu/proteingym/ProteinGym_${VERSION}/${FILENAME}
+unzip ${FILENAME} && rm ${FILENAME}
+```
+
+Data | Size (unzipped) | Filename
+--- | --- | --- |
+DMS benchmark - Substitutions | 1.0GB | DMS_ProteinGym_substitutions.zip
+DMS benchmark - Indels | 200MB | DMS_ProteinGym_indels.zip
+Zero-shot DMS Model scores - Substitutions | 31GB | zero_shot_substitutions_scores.zip
+Zero-shot DMS Model scores - Indels | 5.2GB | zero_shot_indels_scores.zip
+Supervised DMS Model performance - Substitutions | 2.7MB | DMS_supervised_substitutions_scores.zip
+Supervised DMS Model performance - Indels | 0.9MB | DMS_supervised_indels_scores.zip
+Multiple Sequence Alignments (MSAs) for DMS assays | 5.2GB | DMS_msa_files.zip
+Redundancy-based sequence weights for DMS assays | 200MB | DMS_msa_weights.zip
+Predicted 3D structures from inverse-folding models | 84MB | ProteinGym_AF2_structures.zip
+Clinical benchmark - Substitutions | 123MB | clinical_ProteinGym_substitutions.zip
+Clinical benchmark - Indels | 2.8MB | clinical_ProteinGym_indels.zip
+Clinical MSAs | 17.8GB | clinical_msa_files.zip
+Clinical MSA weights | 250MB | clinical_msa_weights.zip
+Clinical Model scores - Substitutions | 0.9GB | zero_shot_clinical_substitutions_scores.zip
+Clinical Model scores - Indels | 0.7GB | zero_shot_clinical_indels_scores.zip
+CV folds - Substitutions - Singles | 50M | cv_folds_singles_substitutions.zip
+CV folds - Substitutions - Multiples | 81M | cv_folds_multiples_substitutions.zip
+CV folds - Indels | 19MB | cv_folds_indels.zip
+
+Then we also host the raw DMS assays (before preprocessing)
+
+Data | Size (unzipped) | Link
+--- | --- | --- |
+DMS benchmark: Substitutions (raw) | 500MB | substitutions_raw_DMS.zip
+DMS benchmark: Indels (raw) | 450MB | indels_raw_DMS.zip
+Clinical benchmark: Substitutions (raw) | 58MB | substitutions_raw_clinical.zip
+Clinical benchmark: Indels (raw) | 12.4MB | indels_raw_clinical.zip
+
+## How to contribute?
+
+### New assays
+If you would like to suggest new assays to be part of ProteinGym, please raise an issue on this repository with a `new_assay' label. The criteria we typically consider for inclusion are as follows:
+1. The corresponding raw dataset needs to be publicly available
+2. The assay needs to be protein-related (ie., exclude UTR, tRNA, promoter, etc.)
+3. The dataset needs to have insufficient number of measurements
+4. The assay needs to have a sufficiently high dynamic range
+5. The assay has to be relevant to fitness prediction
+
+### New baselines
+If you would like new baselines to be included in ProteinGym (ie., website, performance files, detailed scoring files), please follow the following steps:
+1. Submit a PR to our repo with two things:
+ - A new subfolder under proteingym/baselines named with your new model name. This subfolder should include a python scoring script similar to [this script](https://github.com/OATML-Markslab/ProteinGym/blob/main/proteingym/baselines/rita/compute_fitness.py), as well as all code dependencies required for the scoring script to run properly
+ - An example bash script (e.g., under scripts/scoring_DMS_zero_shot) with all relevant hyperparameters for scoring, similar to [this script](https://github.com/OATML-Markslab/ProteinGym/blob/main/scripts/scoring_DMS_zero_shot/scoring_RITA_substitutions.sh)
+2. Raise an issue with a 'new model' label, providing instructions on how to download relevant model checkpoints for scoring, and reporting the performance of your model on the relevant benchmark using our performance scripts (e.g., [for zero-shot DMS benchmarks](https://github.com/OATML-Markslab/ProteinGym/blob/main/proteingym/performance_DMS_benchmarks.py)). Please note that our DMS performance scripts correct for various biases (e.g., number of assays per protein family and function groupings) and thus the resulting aggregated performance is not the same as the arithmetic average across assays.
+
+At this point we are only considering new baselines satisfying the following conditions:
+1. The model is able to score all mutants in the relevant benchmark (to ensure all models are compared exactly on the same set of mutants everywhere);
+2. The corresponding model is open source (we should be able to reproduce scores if needed).
+
+At this stage, we are only considering requests for which all model scores for all mutants in a given benchmark (substitution or indel) are provided by the requester; but we are planning on regularly scoring new baselines ourselves for methods with wide adoption by the community and/or suggestions with many upvotes.
+
+### Notes
+12 December 2023: The code for training and evaluating supervised models is currently shared in https://github.com/OATML-Markslab/ProteinNPT. We are in the process of integrating the code into this repo.
+
+## Usage and reproducibility
+
+If you would like to compute all performance metrics for the various benchmarks, please follow the following steps:
+1. Download locally all relevant files as per instructions above (see Resources)
+2. Update the paths for all files downloaded in the prior step in the [config script](https://github.com/OATML-Markslab/ProteinGym/blob/main/scripts/zero_shot_config.sh)
+3. If adding a new model, adjust the [config.json](https://github.com/OATML-Markslab/ProteinGym/blob/main/config.json) file accordingly and add the model scores to the relevant path (e.g., [DMS_output_score_folder_subs](https://github.com/OATML-Markslab/ProteinGym/blob/main/scripts/zero_shot_config.sh#L19))
+4. If focusing on DMS benchmarks, run the [merge script](https://github.com/OATML-Markslab/ProteinGym/blob/main/scripts/scoring_DMS_zero_shot/merge_all_scores.sh). This will create a single file for each DMS assay, with scores for all model baselines
+5. Run the relevant performance script (eg., [scripts/scoring_DMS_zero_shot/performance_substitutions.sh](https://github.com/OATML-Markslab/ProteinGym/blob/main/scripts/scoring_DMS_zero_shot/performance_substitutions.sh))
+
+## Acknowledgements
+
+Our codebase leveraged code from the following repositories to compute baselines:
+
+Model | Repo
+--- | ---
+UniRep | https://github.com/churchlab/UniRep
+UniRep | https://github.com/chloechsu/combining-evolutionary-and-assay-labelled-data
+EVE | https://github.com/OATML-Markslab/EVE
+GEMME | https://hub.docker.com/r/elodielaine/gemme
+ESM | https://github.com/facebookresearch/esm
+EVmutation | https://github.com/debbiemarkslab/EVcouplings
+ProGen2 | https://github.com/salesforce/progen
+HMMER | https://github.com/EddyRivasLab/hmmer
+MSA Transformer | https://github.com/rmrao/msa-transformer
+ProtGPT2 | https://huggingface.co/nferruz/ProtGPT2
+ProteinMPNN | https://github.com/dauparas/ProteinMPNN
+RITA | https://github.com/lightonai/RITA
+Tranception | https://github.com/OATML-Markslab/Tranception
+VESPA | https://github.com/Rostlab/VESPA
+CARP | https://github.com/microsoft/protein-sequence-models
+MIF | https://github.com/microsoft/protein-sequence-models
+Foldseek | https://github.com/steineggerlab/foldseek
+ProtSSN | https://github.com/tyang816/ProtSSN
+SaProt | https://github.com/westlake-repl/SaProt
+
+We would like to thank the GEMME team for providing model scores on an earlier version of the benchmark (ProteinGym v0.1), and the ProtSSN and SaProt teams for integrating their model in the ProteinGym repo.
+
+Special thanks the teams of experimentalists who developed and performed the assays that ProteinGym is built on. If you are using ProteinGym in your work, please consider citing the corresponding papers. To facilitate this, we have prepared a file (assays.bib) containing the bibtex entries for all these papers.
+
+## Releases
+
+1. [ProteinGym_v1.0](https://zenodo.org/records/13932633): Initial release.
+2. [ProteinGym_v1.1](https://zenodo.org/records/13936340): Updates to reference file, and addition of ProtSSN and SaProt baselines.
+
+## License
+This project is available under the MIT license found in the LICENSE file in this GitHub repository.
+
+## Reference
+If you use ProteinGym in your work, please cite the following paper:
+```bibtex
+@inproceedings{NEURIPS2023_cac723e5,
+ author = {Notin, Pascal and Kollasch, Aaron and Ritter, Daniel and van Niekerk, Lood and Paul, Steffanie and Spinner, Han and Rollins, Nathan and Shaw, Ada and Orenbuch, Rose and Weitzman, Ruben and Frazer, Jonathan and Dias, Mafalda and Franceschi, Dinko and Gal, Yarin and Marks, Debora},
+ booktitle = {Advances in Neural Information Processing Systems},
+ editor = {A. Oh and T. Neumann and A. Globerson and K. Saenko and M. Hardt and S. Levine},
+ pages = {64331--64379},
+ publisher = {Curran Associates, Inc.},
+ title = {ProteinGym: Large-Scale Benchmarks for Protein Fitness Prediction and Design},
+ url = {https://proceedings.neurips.cc/paper_files/paper/2023/file/cac723e5ff29f65e3fcbb0739ae91bee-Paper-Datasets_and_Benchmarks.pdf},
+ volume = {36},
+ year = {2023}
+}
+```
+
+## Links
+- Website: https://www.proteingym.org/
+- NeurIPS proceedings: [link to abstract](https://papers.nips.cc/paper_files/paper/2023/hash/cac723e5ff29f65e3fcbb0739ae91bee-Abstract-Datasets_and_Benchmarks.html)
+- Preprint: [link to abstract](https://www.biorxiv.org/content/10.1101/2023.12.07.570727v1)
diff --git a/assays.bib b/assays.bib
new file mode 100644
index 0000000..a1fae35
--- /dev/null
+++ b/assays.bib
@@ -0,0 +1,3526 @@
+@article{sourisseau_deep_2019,
+ title = {Deep {Mutational} {Scanning} {Comprehensively} {Maps} {How} {Zika} {Envelope} {Protein} {Mutations} {Affect} {Viral} {Growth} and {Antibody} {Escape}},
+ volume = {93},
+ issn = {0022-538X, 1098-5514},
+ url = {https://journals.asm.org/doi/10.1128/JVI.01291-19},
+ doi = {10.1128/JVI.01291-19},
+ abstract = {Zika virus has recently been shown to be associated with severe birth defects. The virus’s E protein mediates its ability to infect cells and is also the primary target of the antibodies that are elicited by natural infection and vaccines that are being developed against the virus. Therefore, determining the effects of mutations to this protein is important for understanding its function, its susceptibility to vaccine-mediated immunity, and its potential for future evolution. We completely mapped how amino acid mutations to the E protein affected the virus’s ability to grow in cells in the laboratory and escape from several antibodies. The resulting maps relate changes in the E protein’s sequence to changes in viral function and therefore provide a valuable complement to existing maps of the physical structure of the protein.
+ ,
+ ABSTRACT
+ Functional constraints on viral proteins are often assessed by examining sequence conservation among natural strains, but this approach is relatively ineffective for Zika virus because all known sequences are highly similar. Here, we take an alternative approach to map functional constraints on Zika virus’s envelope (E) protein by using deep mutational scanning to measure how all amino acid mutations to the E protein affect viral growth in cell culture. The resulting sequence-function map is consistent with existing knowledge about E protein structure and function but also provides insight into mutation-level constraints in many regions of the protein that have not been well characterized in prior functional work. In addition, we extend our approach to completely map how mutations affect viral neutralization by two monoclonal antibodies, thereby precisely defining their functional epitopes. Overall, our study provides a valuable resource for understanding the effects of mutations to this important viral protein and also offers a roadmap for future work to map functional and antigenic selection to Zika virus at high resolution.
+
+ IMPORTANCE
+ Zika virus has recently been shown to be associated with severe birth defects. The virus’s E protein mediates its ability to infect cells and is also the primary target of the antibodies that are elicited by natural infection and vaccines that are being developed against the virus. Therefore, determining the effects of mutations to this protein is important for understanding its function, its susceptibility to vaccine-mediated immunity, and its potential for future evolution. We completely mapped how amino acid mutations to the E protein affected the virus’s ability to grow in cells in the laboratory and escape from several antibodies. The resulting maps relate changes in the E protein’s sequence to changes in viral function and therefore provide a valuable complement to existing maps of the physical structure of the protein.},
+ language = {en},
+ number = {23},
+ urldate = {2022-07-11},
+ journal = {Journal of Virology},
+ author = {Sourisseau, Marion and Lawrence, Daniel J. P. and Schwarz, Megan C. and Storrs, Carina H. and Veit, Ethan C. and Bloom, Jesse D. and Evans, Matthew J.},
+ editor = {Pfeiffer, Julie K.},
+ month = dec,
+ year = {2019},
+ pages = {e01291--19},
+ file = {Full Text:/Users/admin/Zotero/storage/D4BDUGQ6/Sourisseau et al. - 2019 - Deep Mutational Scanning Comprehensively Maps How .pdf:application/pdf},
+}
+
+@article{deng_deep_2012,
+ title = {Deep {Sequencing} of {Systematic} {Combinatorial} {Libraries} {Reveals} Beta-{Lactamase} {Sequence} {Constraints} at {High} {Resolution}},
+ volume = {424},
+ issn = {00222836},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0022283612007711},
+ doi = {10.1016/j.jmb.2012.09.014},
+ language = {en},
+ number = {3-4},
+ urldate = {2022-07-11},
+ journal = {Journal of Molecular Biology},
+ author = {Deng, Zhifeng and Huang, Wanzhi and Bakkalbasi, Erol and Brown, Nicholas G. and Adamski, Carolyn J. and Rice, Kacie and Muzny, Donna and Gibbs, Richard A. and Palzkill, Timothy},
+ month = dec,
+ year = {2012},
+ pages = {150--167},
+ file = {Accepted Version:/Users/admin/Zotero/storage/X5RENMCM/Deng et al. - 2012 - Deep Sequencing of Systematic Combinatorial Librar.pdf:application/pdf},
+}
+
+@article{amorosi_massively_2021,
+ title = {Massively parallel characterization of {CYP2C9} variant enzyme activity and abundance},
+ volume = {108},
+ issn = {00029297},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S000292972100269X},
+ doi = {10.1016/j.ajhg.2021.07.001},
+ language = {en},
+ number = {9},
+ urldate = {2022-07-11},
+ journal = {The American Journal of Human Genetics},
+ author = {Amorosi, Clara J. and Chiasson, Melissa A. and McDonald, Matthew G. and Wong, Lai Hong and Sitko, Katherine A. and Boyle, Gabriel and Kowalski, John P. and Rettie, Allan E. and Fowler, Douglas M. and Dunham, Maitreya J.},
+ month = sep,
+ year = {2021},
+ pages = {1735--1751},
+ file = {Full Text:/Users/admin/Zotero/storage/8DWGHK9I/Amorosi et al. - 2021 - Massively parallel characterization of CYP2C9 vari.pdf:application/pdf},
+}
+
+@article{chiasson_multiplexed_2020,
+ title = {Multiplexed measurement of variant abundance and activity reveals {VKOR} topology, active site and human variant impact},
+ volume = {9},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/58026},
+ doi = {10.7554/eLife.58026},
+ abstract = {Vitamin K epoxide reductase (VKOR) drives the vitamin K cycle, activating vitamin K-dependent blood clotting factors. VKOR is also the target of the widely used anticoagulant drug, warfarin. Despite VKOR’s pivotal role in coagulation, its structure and active site remain poorly understood. In addition, VKOR variants can cause vitamin K-dependent clotting factor deficiency or alter warfarin response. Here, we used multiplexed, sequencing-based assays to measure the effects of 2,695 VKOR missense variants on abundance and 697 variants on activity in cultured human cells. The large-scale functional data, along with an evolutionary coupling analysis, supports a four transmembrane domain topology, with variants in transmembrane domains exhibiting strongly deleterious effects on abundance and activity. Functionally constrained regions of the protein define the active site, and we find that, of four conserved cysteines putatively critical for function, only three are absolutely required. Finally, 25\% of human VKOR missense variants show reduced abundance or activity, possibly conferring warfarin sensitivity or causing disease.},
+ language = {en},
+ urldate = {2022-07-11},
+ journal = {eLife},
+ author = {Chiasson, Melissa A and Rollins, Nathan J and Stephany, Jason J and Sitko, Katherine A and Matreyek, Kenneth A and Verby, Marta and Sun, Song and Roth, Frederick P and DeSloover, Daniel and Marks, Debora S and Rettie, Allan E and Fowler, Douglas M},
+ month = sep,
+ year = {2020},
+ pages = {e58026},
+ file = {Full Text:/Users/admin/Zotero/storage/GWWNSVDK/Chiasson et al. - 2020 - Multiplexed measurement of variant abundance and a.pdf:application/pdf},
+}
+
+@article{wrenbeck_single-mutation_2017,
+ title = {Single-mutation fitness landscapes for an enzyme on multiple substrates reveal specificity is globally encoded},
+ volume = {8},
+ issn = {2041-1723},
+ url = {http://www.nature.com/articles/ncomms15695},
+ doi = {10.1038/ncomms15695},
+ language = {en},
+ number = {1},
+ urldate = {2022-07-11},
+ journal = {Nature Communications},
+ author = {Wrenbeck, Emily E. and Azouz, Laura R. and Whitehead, Timothy A.},
+ month = aug,
+ year = {2017},
+ pages = {15695},
+ file = {Full Text:/Users/admin/Zotero/storage/7WF3NED2/Wrenbeck et al. - 2017 - Single-mutation fitness landscapes for an enzyme o.pdf:application/pdf},
+}
+
+@article{brenan_phenotypic_2016,
+ title = {Phenotypic {Characterization} of a {Comprehensive} {Set} of {MAPK1} /{ERK2} {Missense} {Mutants}},
+ volume = {17},
+ issn = {22111247},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S2211124716313171},
+ doi = {10.1016/j.celrep.2016.09.061},
+ language = {en},
+ number = {4},
+ urldate = {2022-07-11},
+ journal = {Cell Reports},
+ author = {Brenan, Lisa and Andreev, Aleksandr and Cohen, Ofir and Pantel, Sasha and Kamburov, Atanas and Cacchiarelli, Davide and Persky, Nicole S. and Zhu, Cong and Bagul, Mukta and Goetz, Eva M. and Burgin, Alex B. and Garraway, Levi A. and Getz, Gad and Mikkelsen, Tarjei S. and Piccioni, Federica and Root, David E. and Johannessen, Cory M.},
+ month = oct,
+ year = {2016},
+ pages = {1171--1183},
+ file = {Full Text:/Users/admin/Zotero/storage/W3B46C64/Brenan et al. - 2016 - Phenotypic Characterization of a Comprehensive Set.pdf:application/pdf},
+}
+
+@article{kozek_high-throughput_2020,
+ title = {High-throughput discovery of trafficking-deficient variants in the cardiac potassium channel {KV11}.1},
+ volume = {17},
+ issn = {15475271},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S1547527120305427},
+ doi = {10.1016/j.hrthm.2020.05.041},
+ language = {en},
+ number = {12},
+ urldate = {2022-07-11},
+ journal = {Heart Rhythm},
+ author = {Kozek, Krystian A. and Glazer, Andrew M. and Ng, Chai-Ann and Blackwell, Daniel and Egly, Christian L. and Vanags, Loren R. and Blair, Marcia and Mitchell, Devyn and Matreyek, Kenneth A. and Fowler, Douglas M. and Knollmann, Bjorn C. and Vandenberg, Jamie I. and Roden, Dan M. and Kroncke, Brett M.},
+ month = dec,
+ year = {2020},
+ pages = {2180--2189},
+ file = {Accepted Version:/Users/admin/Zotero/storage/JETSTCHG/Kozek et al. - 2020 - High-throughput discovery of trafficking-deficient.pdf:application/pdf},
+}
+
+@article{davidi_highly_2020,
+ title = {Highly active rubiscos discovered by systematic interrogation of natural sequence diversity},
+ volume = {39},
+ issn = {0261-4189, 1460-2075},
+ url = {https://onlinelibrary.wiley.com/doi/10.15252/embj.2019104081},
+ doi = {10.15252/embj.2019104081},
+ language = {en},
+ number = {18},
+ urldate = {2022-07-11},
+ journal = {The EMBO Journal},
+ author = {Davidi, Dan and Shamshoum, Melina and Guo, Zhijun and Bar‐On, Yinon M and Prywes, Noam and Oz, Aia and Jablonska, Jagoda and Flamholz, Avi and Wernick, David G and Antonovsky, Niv and Pins, Benoit and Shachar, Lior and Hochhauser, Dina and Peleg, Yoav and Albeck, Shira and Sharon, Itai and Mueller‐Cajar, Oliver and Milo, Ron},
+ month = sep,
+ year = {2020},
+ file = {Full Text:/Users/admin/Zotero/storage/PYUPVUZK/Davidi et al. - 2020 - Highly active rubiscos discovered by systematic in.pdf:application/pdf},
+}
+
+@article{sarkisyan_local_2016,
+ title = {Local fitness landscape of the green fluorescent protein},
+ volume = {533},
+ issn = {0028-0836, 1476-4687},
+ url = {http://www.nature.com/articles/nature17995},
+ doi = {10.1038/nature17995},
+ language = {en},
+ number = {7603},
+ urldate = {2022-07-11},
+ journal = {Nature},
+ author = {Sarkisyan, Karen S. and Bolotin, Dmitry A. and Meer, Margarita V. and Usmanova, Dinara R. and Mishin, Alexander S. and Sharonov, George V. and Ivankov, Dmitry N. and Bozhanova, Nina G. and Baranov, Mikhail S. and Soylemez, Onuralp and Bogatyreva, Natalya S. and Vlasov, Peter K. and Egorov, Evgeny S. and Logacheva, Maria D. and Kondrashov, Alexey S. and Chudakov, Dmitry M. and Putintseva, Ekaterina V. and Mamedov, Ilgar Z. and Tawfik, Dan S. and Lukyanov, Konstantin A. and Kondrashov, Fyodor A.},
+ month = may,
+ year = {2016},
+ pages = {397--401},
+ file = {Accepted Version:/Users/admin/Zotero/storage/KPQ2DLXG/Sarkisyan et al. - 2016 - Local fitness landscape of the green fluorescent p.pdf:application/pdf},
+}
+
+@article{olson_comprehensive_2014,
+ title = {A {Comprehensive} {Biophysical} {Description} of {Pairwise} {Epistasis} throughout an {Entire} {Protein} {Domain}},
+ volume = {24},
+ issn = {09609822},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0960982214012688},
+ doi = {10.1016/j.cub.2014.09.072},
+ language = {en},
+ number = {22},
+ urldate = {2022-07-11},
+ journal = {Current Biology},
+ author = {Olson, C. Anders and Wu, Nicholas C. and Sun, Ren},
+ month = nov,
+ year = {2014},
+ pages = {2643--2651},
+ file = {Full Text:/Users/admin/Zotero/storage/XHQR6TLB/Olson et al. - 2014 - A Comprehensive Biophysical Description of Pairwis.pdf:application/pdf},
+}
+
+@article{duenas-decamp_saturation_2016,
+ title = {Saturation {Mutagenesis} of the {HIV}-1 {Envelope} {CD4} {Binding} {Loop} {Reveals} {Residues} {Controlling} {Distinct} {Trimer} {Conformations}},
+ volume = {12},
+ issn = {1553-7374},
+ url = {https://dx.plos.org/10.1371/journal.ppat.1005988},
+ doi = {10.1371/journal.ppat.1005988},
+ language = {en},
+ number = {11},
+ urldate = {2022-07-11},
+ journal = {PLOS Pathogens},
+ author = {Duenas-Decamp, Maria and Jiang, Li and Bolon, Daniel and Clapham, Paul R.},
+ editor = {Desrosiers, Ronald C.},
+ month = nov,
+ year = {2016},
+ pages = {e1005988},
+ file = {Full Text:/Users/admin/Zotero/storage/B948VI9X/Duenas-Decamp et al. - 2016 - Saturation Mutagenesis of the HIV-1 Envelope CD4 B.pdf:application/pdf},
+}
+
+@article{thompson_altered_2020,
+ title = {Altered expression of a quality control protease in {E}. coli reshapes the in vivo mutational landscape of a model enzyme},
+ volume = {9},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/53476},
+ doi = {10.7554/eLife.53476},
+ abstract = {Protein mutational landscapes are shaped by the cellular environment, but key factors and their quantitative effects are often unknown. Here we show that Lon, a quality control protease naturally absent in common
+ E. coli
+ expression strains, drastically reshapes the mutational landscape of the metabolic enzyme dihydrofolate reductase (DHFR). Selection under conditions that resolve highly active mutants reveals that 23.3\% of all single point mutations in DHFR are advantageous in the absence of Lon, but advantageous mutations are largely suppressed when Lon is reintroduced. Protein stability measurements demonstrate extensive activity-stability tradeoffs for the advantageous mutants and provide a mechanistic explanation for Lon’s widespread impact. Our findings suggest possibilities for tuning mutational landscapes by modulating the cellular environment, with implications for protein design and combatting antibiotic resistance.},
+ language = {en},
+ urldate = {2022-07-11},
+ journal = {eLife},
+ author = {Thompson, Samuel and Zhang, Yang and Ingle, Christine and Reynolds, Kimberly A and Kortemme, Tanja},
+ month = jul,
+ year = {2020},
+ pages = {e53476},
+ file = {Full Text:/Users/admin/Zotero/storage/4U2CHYQR/Thompson et al. - 2020 - Altered expression of a quality control protease i.pdf:application/pdf},
+}
+
+@article{starita_activity-enhancing_2013,
+ title = {Activity-enhancing mutations in an {E3} ubiquitin ligase identified by high-throughput mutagenesis},
+ volume = {110},
+ issn = {0027-8424, 1091-6490},
+ url = {https://pnas.org/doi/full/10.1073/pnas.1303309110},
+ doi = {10.1073/pnas.1303309110},
+ abstract = {Significance
+ Ubiquitin is a 76 residue protein that is attached to target proteins as a posttranslational modification. This modification is dependent on the successive activity of three enzymes, designated E1, E2, and E3. We developed a high-throughput mutagenesis strategy to probe the mechanism of E3-catalyzed transfer of ubiquitin from the E2 to the target protein. By scoring the effect of nearly 100,000 mutations in an E3, we identified mutations that affect direct and allosteric interactions between the E3 and the E2. These results highlight the general utility of high-throughput mutagenesis in delineating the molecular basis of enzyme activity.
+ ,
+ Although ubiquitination plays a critical role in virtually all cellular processes, mechanistic details of ubiquitin (Ub) transfer are still being defined. To identify the molecular determinants within E3 ligases that modulate activity, we scored each member of a library of nearly 100,000 protein variants of the murine ubiquitination factor E4B (Ube4b) U-box domain for auto-ubiquitination activity in the presence of the E2 UbcH5c. This assay identified mutations that enhance activity both in vitro and in cellular p53 degradation assays. The activity-enhancing mutations fall into two distinct mechanistic classes: One increases the U-box:E2-binding affinity, and the other allosterically stimulates the formation of catalytically active conformations of the E2∼Ub conjugate. The same mutations enhance E3 activity in the presence of another E2, Ube2w, implying a common allosteric mechanism, and therefore the general applicability of our observations to other E3s. A comparison of the E3 activity with the two different E2s identified an additional variant that exhibits E3:E2 specificity. Our results highlight the general utility of high-throughput mutagenesis in delineating the molecular basis of enzyme activity.},
+ language = {en},
+ number = {14},
+ urldate = {2022-07-11},
+ journal = {Proceedings of the National Academy of Sciences},
+ author = {Starita, Lea M. and Pruneda, Jonathan N. and Lo, Russell S. and Fowler, Douglas M. and Kim, Helen J. and Hiatt, Joseph B. and Shendure, Jay and Brzovic, Peter S. and Fields, Stanley and Klevit, Rachel E.},
+ month = apr,
+ year = {2013},
+ file = {Full Text:/Users/admin/Zotero/storage/FBVQ2HYR/Starita et al. - 2013 - Activity-enhancing mutations in an E3 ubiquitin li.pdf:application/pdf;Full Text:/Users/admin/Zotero/storage/SF4WML4G/Starita et al. - 2013 - Activity-enhancing mutations in an E3 ubiquitin li.pdf:application/pdf},
+}
+
+@article{araya_fundamental_2012,
+ title = {A fundamental protein property, thermodynamic stability, revealed solely from large-scale measurements of protein function},
+ volume = {109},
+ issn = {0027-8424, 1091-6490},
+ url = {https://pnas.org/doi/full/10.1073/pnas.1209751109},
+ doi = {10.1073/pnas.1209751109},
+ abstract = {The ability of a protein to carry out a given function results from fundamental physicochemical properties that include the protein’s structure, mechanism of action, and thermodynamic stability. Traditional approaches to study these properties have typically required the direct measurement of the property of interest, oftentimes a laborious undertaking. Although protein properties can be probed by mutagenesis, this approach has been limited by its low throughput. Recent technological developments have enabled the rapid quantification of a protein’s function, such as binding to a ligand, for numerous variants of that protein. Here, we measure the ability of 47,000 variants of a WW domain to bind to a peptide ligand and use these functional measurements to identify stabilizing mutations without directly assaying stability. Our approach is rooted in the well-established concept that protein function is closely related to stability. Protein function is generally reduced by destabilizing mutations, but this decrease can be rescued by stabilizing mutations. Based on this observation, we introduce partner potentiation, a metric that uses this rescue ability to identify stabilizing mutations, and identify 15 candidate stabilizing mutations in the WW domain. We tested six candidates by thermal denaturation and found two highly stabilizing mutations, one more stabilizing than any previously known mutation. Thus, physicochemical properties such as stability are latent within these large-scale protein functional data and can be revealed by systematic analysis. This approach should allow other protein properties to be discovered.},
+ language = {en},
+ number = {42},
+ urldate = {2022-07-11},
+ journal = {Proceedings of the National Academy of Sciences},
+ author = {Araya, Carlos L. and Fowler, Douglas M. and Chen, Wentao and Muniez, Ike and Kelly, Jeffery W. and Fields, Stanley},
+ month = oct,
+ year = {2012},
+ pages = {16858--16863},
+ file = {Full Text:/Users/admin/Zotero/storage/KR46AVWQ/Araya et al. - 2012 - A fundamental protein property, thermodynamic stab.pdf:application/pdf},
+}
+
+@article{wu_high-throughput_2014,
+ title = {High-throughput profiling of influenza {A} virus hemagglutinin gene at single-nucleotide resolution},
+ volume = {4},
+ issn = {2045-2322},
+ url = {https://www.nature.com/articles/srep04942},
+ doi = {10.1038/srep04942},
+ abstract = {Abstract
+ Genetic research on influenza virus biology has been informed in large part by nucleotide variants present in seasonal or pandemic samples, or individual mutants generated in the laboratory, leaving a substantial part of the genome uncharacterized. Here, we have developed a single-nucleotide resolution genetic approach to interrogate the fitness effect of point mutations in 98\% of the amino acid positions in the influenza A virus hemagglutinin (HA) gene. Our HA fitness map provides a reference to identify indispensable regions to aid in drug and vaccine design as targeting these regions will increase the genetic barrier for the emergence of escape mutations. This study offers a new platform for studying genome dynamics, structure-function relationships, virus-host interactions and can further rational drug and vaccine design. Our approach can also be applied to any virus that can be genetically manipulated.},
+ language = {en},
+ number = {1},
+ urldate = {2022-07-11},
+ journal = {Scientific Reports},
+ author = {Wu, Nicholas C. and Young, Arthur P. and Al-Mawsawi, Laith Q. and Olson, C. Anders and Feng, Jun and Qi, Hangfei and Chen, Shu-Hwa and Lu, I.-Hsuan and Lin, Chung-Yen and Chin, Robert G. and Luan, Harding H. and Nguyen, Nguyen and Nelson, Stanley F. and Li, Xinmin and Wu, Ting-Ting and Sun, Ren},
+ month = dec,
+ year = {2014},
+ pages = {4942},
+ file = {Full Text:/Users/admin/Zotero/storage/IPZQQNJQ/Wu et al. - 2014 - High-throughput profiling of influenza A virus hem.pdf:application/pdf},
+}
+
+@techreport{young_deep_2021,
+ type = {preprint},
+ title = {Deep {Mutagenesis} of a {Transporter} for {Uptake} of a {Non}-{Native} {Substrate} {Identifies} {Conformationally} {Dynamic} {Regions}},
+ url = {http://biorxiv.org/lookup/doi/10.1101/2021.04.19.440442},
+ abstract = {Abstract
+
+ The serotonin transporter, SERT, catalyzes serotonin reuptake at the synapse to terminate neurotransmission via an alternating access mechanism, and SERT inhibitors are the most widely prescribed antidepressants. Here, deep mutagenesis is used to determine the effects of nearly all amino acid substitutions on human SERT surface expression and transport of the fluorescent substrate analogue APP+, identifying many mutations that enhance APP+ import. Comprehensive simulations of the entire ion-coupled import process reveal that while binding of the native substrate, serotonin, reduces free energy barriers between conformational states to promote SERT dynamics, the conformational free energy landscape in the presence of APP+ instead resembles Na
+ +
+ bound-SERT, with a higher free energy barrier for transitioning to an inward-facing state. The deep mutational scan for SERT-catalyzed import of APP+ finds mutations that promote the necessary conformational changes that would otherwise be facilitated by the native substrate. Indeed, hundreds of gain-of-function mutations for APP+ import are found along the permeation pathway, most notably mutations that favor opening of a solvent-exposed intracellular vestibule. The mutagenesis data support the simulated mechanism in which the neurotransmitter and a symported sodium share a common cytosolic exit pathway to achieve coupling. Furthermore, the mutational landscape for SERT surface trafficking, which likely filters out misfolded sequences, reveals that residues along the permeation pathway are mutationally tolerant, providing plausible evolutionary pathways for changes in transporter properties while maintaining folded structure.},
+ language = {en},
+ urldate = {2022-07-11},
+ institution = {Biochemistry},
+ author = {Young, Heather J. and Chan, Matthew and Selvam, Balaji and Szymanski, Steven K. and Shukla, Diwakar and Procko, Erik},
+ month = apr,
+ year = {2021},
+ doi = {10.1101/2021.04.19.440442},
+ file = {Submitted Version:/Users/admin/Zotero/storage/T69VH4DC/Young et al. - 2021 - Deep Mutagenesis of a Transporter for Uptake of a .pdf:application/pdf},
+}
+
+@article{jiang_balance_2016,
+ title = {A {Balance} between {Inhibitor} {Binding} and {Substrate} {Processing} {Confers} {Influenza} {Drug} {Resistance}},
+ volume = {428},
+ issn = {00222836},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0022283615006907},
+ doi = {10.1016/j.jmb.2015.11.027},
+ language = {en},
+ number = {3},
+ urldate = {2022-07-11},
+ journal = {Journal of Molecular Biology},
+ author = {Jiang, Li and Liu, Ping and Bank, Claudia and Renzette, Nicholas and Prachanronarong, Kristina and Yilmaz, Lutfu S. and Caffrey, Daniel R. and Zeldovich, Konstantin B. and Schiffer, Celia A. and Kowalik, Timothy F. and Jensen, Jeffrey D. and Finberg, Robert W. and Wang, Jennifer P. and Bolon, Daniel N.A.},
+ month = feb,
+ year = {2016},
+ pages = {538--553},
+ file = {Accepted Version:/Users/admin/Zotero/storage/TJGBH3YJ/Jiang et al. - 2016 - A Balance between Inhibitor Binding and Substrate .pdf:application/pdf},
+}
+
+@article{melamed_deep_2013,
+ title = {Deep mutational scanning of an {RRM} domain of the \textit{{Saccharomyces} cerevisiae} poly({A})-binding protein},
+ volume = {19},
+ issn = {1355-8382, 1469-9001},
+ url = {http://rnajournal.cshlp.org/lookup/doi/10.1261/rna.040709.113},
+ doi = {10.1261/rna.040709.113},
+ abstract = {The RNA recognition motif (RRM) is the most common RNA-binding domain in eukaryotes. Differences in RRM sequences dictate, in part, both RNA and protein-binding specificities and affinities. We used a deep mutational scanning approach to study the sequence-function relationship of the RRM2 domain of the
+ Saccharomyces cerevisiae
+ poly(A)-binding protein (Pab1). By scoring the activity of more than 100,000 unique Pab1 variants, including 1246 with single amino acid substitutions, we delineated the mutational constraints on each residue. Clustering of residues with similar mutational patterns reveals three major classes, composed principally of RNA-binding residues, of hydrophobic core residues, and of the remaining residues. The first class also includes a highly conserved residue not involved in RNA binding, G150, which can be mutated to destabilize Pab1. A comparison of the mutational sensitivity of yeast Pab1 residues to their evolutionary conservation reveals that most residues tolerate more substitutions than are present in the natural sequences, although other residues that tolerate fewer substitutions may point to specialized functions in yeast. An analysis of ∼40,000 double mutants indicates a preference for a short distance between two mutations that display an epistatic interaction. As examples of interactions, the mutations N139T, N139S, and I157L suppress other mutations that interfere with RNA binding and protein stability. Overall, this study demonstrates that living cells can be subjected to a single assay to analyze hundreds of thousands of protein variants in parallel.},
+ language = {en},
+ number = {11},
+ urldate = {2022-07-11},
+ journal = {RNA},
+ author = {Melamed, Daniel and Young, David L. and Gamble, Caitlin E. and Miller, Christina R. and Fields, Stanley},
+ month = nov,
+ year = {2013},
+ pages = {1537--1551},
+ file = {Full Text:/Users/admin/Zotero/storage/AFZEI4G6/Melamed et al. - 2013 - Deep mutational scanning of an RRM domain of the .pdf:application/pdf},
+}
+
+@article{jia_massively_2021,
+ title = {Massively parallel functional testing of {MSH2} missense variants conferring {Lynch} syndrome risk},
+ volume = {108},
+ issn = {00029297},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0002929720304390},
+ doi = {10.1016/j.ajhg.2020.12.003},
+ language = {en},
+ number = {1},
+ urldate = {2022-07-11},
+ journal = {The American Journal of Human Genetics},
+ author = {Jia, Xiaoyan and Burugula, Bala Bharathi and Chen, Victor and Lemons, Rosemary M. and Jayakody, Sajini and Maksutova, Mariam and Kitzman, Jacob O.},
+ month = jan,
+ year = {2021},
+ pages = {163--175},
+ file = {Full Text:/Users/admin/Zotero/storage/4J7YRE9U/Jia et al. - 2021 - Massively parallel functional testing of MSH2 miss.pdf:application/pdf},
+}
+
+@article{mavor_determination_2016,
+ title = {Determination of ubiquitin fitness landscapes under different chemical stresses in a classroom setting},
+ volume = {5},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/15802},
+ doi = {10.7554/eLife.15802},
+ abstract = {Ubiquitin is essential for eukaryotic life and varies in only 3 amino acid positions between yeast and humans. However, recent deep sequencing studies indicate that ubiquitin is highly tolerant to single mutations. We hypothesized that this tolerance would be reduced by chemically induced physiologic perturbations. To test this hypothesis, a class of first year UCSF graduate students employed deep mutational scanning to determine the fitness landscape of all possible single residue mutations in the presence of five different small molecule perturbations. These perturbations uncover 'shared sensitized positions' localized to areas around the hydrophobic patch and the C-terminus. In addition, we identified perturbation specific effects such as a sensitization of His68 in HU and a tolerance to mutation at Lys63 in DTT. Our data show how chemical stresses can reduce buffering effects in the ubiquitin proteasome system. Finally, this study demonstrates the potential of lab-based interdisciplinary graduate curriculum.
+ ,
+ The ability of an organism to grow and reproduce, that is, it’s “fitness”, is determined by how its genes interact with the environment. Yeast is a model organism in which researchers can control the exact mutations present in the yeast’s genes (its genotype) and the conditions in which the yeast cells live (their environment). This allows researchers to measure how a yeast cell’s genotype and environment affect its fitness.
+ Ubiquitin is a protein that many organisms depend on to manage cell stress by acting as a tag that targets other proteins for degradation. Essential proteins such as ubiquitin often remain unchanged by mutation over long periods of time. As a result, these proteins evolve very slowly. Like all proteins, ubiquitin is built from a chain of amino acid molecules linked together, and the ubiquitin proteins of yeast and humans are made of almost identical sequences of amino acids.
+ Although ubiquitin has barely changed its sequence over evolution, previous studies have shown that – under normal growth conditions in the laboratory – most amino acids in ubiquitin can be mutated without any loss of cell fitness. This led Mavor et al. to hypothesize that treating the yeast cells with chemicals that cause cell stress might lead to amino acids in ubiquitin becoming more sensitive to mutation.
+ To test this idea, a class of graduate students at the University of California, San Francisco grew yeast cells with different ubiquitin mutations together, and with different chemicals that induce cell stress, and measured their growth rates. Sequencing the ubiquitin gene in the thousands of tested yeast cells revealed that three of the chemicals cause a shared set of amino acids in ubiquitin to become more sensitive to mutation.
+ This result suggests that these amino acids are important for the stress response, possibly by altering the ability of yeast cells to target certain proteins for degradation. Conversely, another chemical causes yeast to become more tolerant to changes in the ubiquitin sequence. The experiments also link changes in particular amino acids in ubiquitin to specific stress responses.
+ Mavor et al. show that many of ubquitin’s amino acids are sensitive to mutation under different stress conditions, while others can be mutated to form different amino acids without effecting fitness. By testing the effects of other chemicals, future experiments could further characterize how the yeast’s genotype and environment interact.},
+ language = {en},
+ urldate = {2022-07-11},
+ journal = {eLife},
+ author = {Mavor, David and Barlow, Kyle and Thompson, Samuel and Barad, Benjamin A and Bonny, Alain R and Cario, Clinton L and Gaskins, Garrett and Liu, Zairan and Deming, Laura and Axen, Seth D and Caceres, Elena and Chen, Weilin and Cuesta, Adolfo and Gate, Rachel E and Green, Evan M and Hulce, Kaitlin R and Ji, Weiyue and Kenner, Lillian R and Mensa, Bruk and Morinishi, Leanna S and Moss, Steven M and Mravic, Marco and Muir, Ryan K and Niekamp, Stefan and Nnadi, Chimno I and Palovcak, Eugene and Poss, Erin M and Ross, Tyler D and Salcedo, Eugenia C and See, Stephanie K and Subramaniam, Meena and Wong, Allison W and Li, Jennifer and Thorn, Kurt S and Conchúir, Shane Ó and Roscoe, Benjamin P and Chow, Eric D and DeRisi, Joseph L and Kortemme, Tanja and Bolon, Daniel N and Fraser, James S},
+ month = apr,
+ year = {2016},
+ pages = {e15802},
+ file = {Full Text:/Users/admin/Zotero/storage/H2575JWN/Mavor et al. - 2016 - Determination of ubiquitin fitness landscapes unde.pdf:application/pdf},
+}
+
+@article{glazer_deep_2020,
+ title = {Deep {Mutational} {Scan} of an \textit{{SCN5A}} {Voltage} {Sensor}},
+ volume = {13},
+ issn = {2574-8300},
+ url = {https://www.ahajournals.org/doi/10.1161/CIRCGEN.119.002786},
+ doi = {10.1161/CIRCGEN.119.002786},
+ abstract = {Background:
+ Variants in ion channel genes have classically been studied in low throughput by patch clamping. Deep mutational scanning is a complementary approach that can simultaneously assess function of thousands of variants.
+
+
+ Methods:
+
+ We have developed and validated a method to perform a deep mutational scan of variants in
+ SCN5A
+ , which encodes the major voltage-gated sodium channel in the heart. We created a library of nearly all possible variants in a 36 base region of
+ SCN5A
+ in the S4 voltage sensor of domain IV and stably integrated the library into HEK293T cells.
+
+
+
+ Results:
+ In preliminary experiments, challenge with 3 drugs (veratridine, brevetoxin, and ouabain) could discriminate wild-type channels from gain- and loss-of-function pathogenic variants. High-throughput sequencing of the pre- and postdrug challenge pools was used to count the prevalence of each variant and identify variants with abnormal function. The deep mutational scan scores identified 40 putative gain-of-function and 33 putative loss-of-function variants. For 8 of 9 variants, patch clamping data were consistent with the scores.
+
+
+ Conclusions:
+
+ These experiments demonstrate the accuracy of a high-throughput in vitro scan of
+ SCN5A
+ variant function, which can be used to identify deleterious variants in
+ SCN5A
+ and other ion channel genes.},
+ language = {en},
+ number = {1},
+ urldate = {2022-07-11},
+ journal = {Circulation: Genomic and Precision Medicine},
+ author = {Glazer, Andrew M. and Kroncke, Brett M. and Matreyek, Kenneth A. and Yang, Tao and Wada, Yuko and Shields, Tiffany and Salem, Joe-Elie and Fowler, Douglas M. and Roden, Dan M.},
+ month = feb,
+ year = {2020},
+ pages = {e002786},
+ file = {Full Text:/Users/admin/Zotero/storage/AHYB9LRW/Glazer et al. - 2020 - Deep Mutational Scan of an SCN5A Voltage Se.pdf:application/pdf},
+}
+
+@article{kelsic_rna_2016,
+ title = {{RNA} {Structural} {Determinants} of {Optimal} {Codons} {Revealed} by {MAGE}-{Seq}},
+ volume = {3},
+ issn = {24054712},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S2405471216303684},
+ doi = {10.1016/j.cels.2016.11.004},
+ language = {en},
+ number = {6},
+ urldate = {2022-07-11},
+ journal = {Cell Systems},
+ author = {Kelsic, Eric D. and Chung, Hattie and Cohen, Niv and Park, Jimin and Wang, Harris H. and Kishony, Roy},
+ month = dec,
+ year = {2016},
+ pages = {563--571.e6},
+ file = {Full Text:/Users/admin/Zotero/storage/IF39P5MD/Kelsic et al. - 2016 - RNA Structural Determinants of Optimal Codons Reve.pdf:application/pdf},
+}
+
+@article{rockah-shmuel_systematic_2015,
+ title = {Systematic {Mapping} of {Protein} {Mutational} {Space} by {Prolonged} {Drift} {Reveals} the {Deleterious} {Effects} of {Seemingly} {Neutral} {Mutations}},
+ volume = {11},
+ issn = {1553-7358},
+ url = {https://dx.plos.org/10.1371/journal.pcbi.1004421},
+ doi = {10.1371/journal.pcbi.1004421},
+ language = {en},
+ number = {8},
+ urldate = {2022-07-11},
+ journal = {PLOS Computational Biology},
+ author = {Rockah-Shmuel, Liat and Tóth-Petróczy, Ágnes and Tawfik, Dan S.},
+ editor = {Orengo, Christine A.},
+ month = aug,
+ year = {2015},
+ pages = {e1004421},
+ file = {Full Text:/Users/admin/Zotero/storage/QVEEKDBN/Rockah-Shmuel et al. - 2015 - Systematic Mapping of Protein Mutational Space by .pdf:application/pdf},
+}
+
+@article{kitzman_massively_2015,
+ title = {Massively parallel single-amino-acid mutagenesis},
+ volume = {12},
+ issn = {1548-7091, 1548-7105},
+ url = {http://www.nature.com/articles/nmeth.3223},
+ doi = {10.1038/nmeth.3223},
+ language = {en},
+ number = {3},
+ urldate = {2022-07-11},
+ journal = {Nature Methods},
+ author = {Kitzman, Jacob O and Starita, Lea M and Lo, Russell S and Fields, Stanley and Shendure, Jay},
+ month = mar,
+ year = {2015},
+ pages = {203--206},
+ file = {Accepted Version:/Users/admin/Zotero/storage/SSUEVZ32/Kitzman et al. - 2015 - Massively parallel single-amino-acid mutagenesis.pdf:application/pdf},
+}
+
+@article{aakre_evolving_2015,
+ title = {Evolving {New} {Protein}-{Protein} {Interaction} {Specificity} through {Promiscuous} {Intermediates}},
+ volume = {163},
+ issn = {00928674},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0092867415012726},
+ doi = {10.1016/j.cell.2015.09.055},
+ language = {en},
+ number = {3},
+ urldate = {2022-07-11},
+ journal = {Cell},
+ author = {Aakre, Christopher D. and Herrou, Julien and Phung, Tuyen N. and Perchuk, Barrett S. and Crosson, Sean and Laub, Michael T.},
+ month = oct,
+ year = {2015},
+ pages = {594--606},
+ file = {Full Text:/Users/admin/Zotero/storage/4IIQTP93/Aakre et al. - 2015 - Evolving New Protein-Protein Interaction Specifici.pdf:application/pdf},
+}
+
+@article{bolognesi_mutational_2019,
+ title = {The mutational landscape of a prion-like domain},
+ volume = {10},
+ issn = {2041-1723},
+ url = {http://www.nature.com/articles/s41467-019-12101-z},
+ doi = {10.1038/s41467-019-12101-z},
+ abstract = {Abstract
+ Insoluble protein aggregates are the hallmarks of many neurodegenerative diseases. For example, aggregates of TDP-43 occur in nearly all cases of amyotrophic lateral sclerosis (ALS). However, whether aggregates cause cellular toxicity is still not clear, even in simpler cellular systems. We reasoned that deep mutagenesis might be a powerful approach to disentangle the relationship between aggregation and toxicity. We generated {\textgreater}50,000 mutations in the prion-like domain (PRD) of TDP-43 and quantified their toxicity in yeast cells. Surprisingly, mutations that increase hydrophobicity and aggregation strongly decrease toxicity. In contrast, toxic variants promote the formation of dynamic liquid-like condensates. Mutations have their strongest effects in a hotspot that genetic interactions reveal to be structured in vivo, illustrating how mutagenesis can probe the in vivo structures of unstructured proteins. Our results show that aggregation of TDP-43 is not harmful but protects cells, most likely by titrating the protein away from a toxic liquid-like phase.},
+ language = {en},
+ number = {1},
+ urldate = {2022-07-11},
+ journal = {Nature Communications},
+ author = {Bolognesi, Benedetta and Faure, Andre J. and Seuma, Mireia and Schmiedel, Jörn M. and Tartaglia, Gian Gaetano and Lehner, Ben},
+ month = dec,
+ year = {2019},
+ pages = {4162},
+ file = {Full Text:/Users/admin/Zotero/storage/3J8H73B4/Bolognesi et al. - 2019 - The mutational landscape of a prion-like domain.pdf:application/pdf},
+}
+
+@techreport{flynn_comprehensive_2022,
+ type = {preprint},
+ title = {Comprehensive fitness landscape of {SARS}-{CoV}-2 {M} $^{\textrm{pro}}$ reveals insights into viral resistance mechanisms},
+ url = {http://biorxiv.org/lookup/doi/10.1101/2022.01.26.477860},
+ abstract = {Abstract
+
+ With the continual evolution of new strains of SARS-CoV-2 that are more virulent, transmissible, and able to evade current vaccines, there is an urgent need for effective anti-viral drugs. SARS-CoV-2 main protease (M
+ pro
+ ) is a leading target for drug design due to its conserved and indispensable role in the viral life cycle. Drugs targeting M
+ pro
+ appear promising but will elicit selection pressure for resistance. To understand resistance potential in M
+ pro
+ , we performed a comprehensive mutational scan of the protease that analyzed the function of all possible single amino acid changes. We developed three separate high-throughput assays of M
+ pro
+ function in yeast, based on either the ability of M
+ pro
+ variants to cleave at a defined cut-site or on the toxicity of their expression to yeast. We used deep sequencing to quantify the functional effects of each variant in each screen. The protein fitness landscapes from all three screens were strongly correlated, indicating that they captured the biophysical properties critical to M
+ pro
+ function. The fitness landscapes revealed a non-active site location on the surface that is extremely sensitive to mutation making it a favorable location to target with inhibitors. In addition, we found a network of critical amino acids that physically bridge the two active sites of the M
+ pro
+ dimer. The clinical variants of M
+ pro
+ were predominantly functional in our screens, indicating that M
+ pro
+ is under strong selection pressure in the human population. Our results provide predictions of mutations that will be readily accessible to M
+ pro
+ evolution and that are likely to contribute to drug resistance. This complete mutational guide of M
+ pro
+ can be used in the design of inhibitors with reduced potential of evolving viral resistance.},
+ language = {en},
+ urldate = {2022-07-11},
+ institution = {Molecular Biology},
+ author = {Flynn, Julia M. and Samant, Neha and Schneider-Nachum, Gily and Barkan, David T. and Yilmaz, Nese Kurt and Schiffer, Celia A. and Moquin, Stephanie A. and Dovala, Dustin and Bolon, Daniel N.A.},
+ month = jan,
+ year = {2022},
+ doi = {10.1101/2022.01.26.477860},
+ file = {Submitted Version:/Users/admin/Zotero/storage/MR6KIKAK/Flynn et al. - 2022 - Comprehensive fitness landscape of SARS-CoV-2 M s.pdf:application/pdf},
+}
+
+@article{haddox_mapping_2018,
+ title = {Mapping mutational effects along the evolutionary landscape of {HIV} envelope},
+ volume = {7},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/34420},
+ doi = {10.7554/eLife.34420},
+ abstract = {The immediate evolutionary space accessible to HIV is largely determined by how single amino acid mutations affect fitness. These mutational effects can shift as the virus evolves. However, the prevalence of such shifts in mutational effects remains unclear. Here, we quantify the effects on viral growth of all amino acid mutations to two HIV envelope (Env) proteins that differ at
+
+ {\textgreater}
+
+ 100 residues. Most mutations similarly affect both Envs, but the amino acid preferences of a minority of sites have clearly shifted. These shifted sites usually prefer a specific amino acid in one Env, but tolerate many amino acids in the other. Surprisingly, shifts are only slightly enriched at sites that have substituted between the Envs—and many occur at residues that do not even contact substitutions. Therefore, long-range epistasis can unpredictably shift Env’s mutational tolerance during HIV evolution, although the amino acid preferences of most sites are conserved between moderately diverged viral strains.
+
+ ,
+ The virus that causes AIDS, or HIV, has a protein called Env on its surface, which is essential for the virus to infect cells. Env can also be recognized by the immune system, which then targets the virus for destruction or blocks it from infecting cells. Unfortunately, Env evolves very quickly, which means that HIV can evade our defenses. However, there are limits to how much this protein can change, since it still needs to perform its essential role in helping viruses enter cells.
+ In the century since HIV first appeared in human populations, the virus has evolved considerably. There are now many HIV strains that infect people, and they bear Env proteins with substantially different sequences. However, it is not clear if these changes in sequence have resulted in Envs from distinct strains being able to tolerate different mutations.
+ To examine this question, Haddox et al. compared how the Envs from two strains of HIV react to modifications in their sequences. They created all possible individual mutations in the proteins, and the resulting collections of mutated viruses were then tested for their ability to infect cells in the laboratory.
+ Most mutations had similar effects in both Env proteins. This allowed Haddox et al. to identify portions of the protein that easily accommodate changes, and portions that must remain unchanged for viruses to remain infectious—at least in the laboratory. Some of these mutations are under different types of pressures when the virus faces the immune system, and those were identified using computational approaches.
+ However, some mutations were tolerated differently by the two Env proteins. Therefore, viral strains differ in how their Env proteins can evolve. The parts of Env that showed differences in mutational tolerance between the strains were not necessarily the parts that differ in sequence. This shows that changes in sequence in one part of the protein can modify how other portions evolve.
+ It remains to be determined whether changes in tolerance to mutations translate into differences in how the virus can escape immunity. This is an important question given that the rapid evolution of Env is a major obstacle to creating a vaccine for HIV.},
+ language = {en},
+ urldate = {2022-07-11},
+ journal = {eLife},
+ author = {Haddox, Hugh K and Dingens, Adam S and Hilton, Sarah K and Overbaugh, Julie and Bloom, Jesse D},
+ month = mar,
+ year = {2018},
+ pages = {e34420},
+ file = {Full Text:/Users/admin/Zotero/storage/2JX4KQTR/Haddox et al. - 2018 - Mapping mutational effects along the evolutionary .pdf:application/pdf},
+}
+
+@article{stiffler_evolvability_2015,
+ title = {Evolvability as a {Function} of {Purifying} {Selection} in {TEM}-1 Beta-{Lactamase}},
+ volume = {160},
+ issn = {00928674},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0092867415000781},
+ doi = {10.1016/j.cell.2015.01.035},
+ language = {en},
+ number = {5},
+ urldate = {2022-07-11},
+ journal = {Cell},
+ author = {Stiffler, Michael A. and Hekstra, Doeke R. and Ranganathan, Rama},
+ month = feb,
+ year = {2015},
+ pages = {882--892},
+ file = {Full Text:/Users/admin/Zotero/storage/D95TSRL8/Stiffler et al. - 2015 - Evolvability as a Function of Purifying Selection .pdf:application/pdf},
+}
+
+@article{tripathi_molecular_2016,
+ title = {Molecular {Determinants} of {Mutant} {Phenotypes}, {Inferred} from {Saturation} {Mutagenesis} {Data}},
+ volume = {33},
+ issn = {0737-4038, 1537-1719},
+ url = {https://academic.oup.com/mbe/article-lookup/doi/10.1093/molbev/msw182},
+ doi = {10.1093/molbev/msw182},
+ language = {en},
+ number = {11},
+ urldate = {2022-07-11},
+ journal = {Molecular Biology and Evolution},
+ author = {Tripathi, Arti and Gupta, Kritika and Khare, Shruti and Jain, Pankaj C. and Patel, Siddharth and Kumar, Prasanth and Pulianmackal, Ajai J. and Aghera, Nilesh and Varadarajan, Raghavan},
+ month = nov,
+ year = {2016},
+ pages = {2960--2975},
+ file = {Full Text:/Users/admin/Zotero/storage/UFKDKVYH/Tripathi et al. - 2016 - Molecular Determinants of Mutant Phenotypes, Infer.pdf:application/pdf},
+}
+
+@article{findlay_accurate_2018,
+ title = {Accurate classification of {BRCA1} variants with saturation genome editing},
+ volume = {562},
+ issn = {0028-0836, 1476-4687},
+ url = {http://www.nature.com/articles/s41586-018-0461-z},
+ doi = {10.1038/s41586-018-0461-z},
+ language = {en},
+ number = {7726},
+ urldate = {2022-07-11},
+ journal = {Nature},
+ author = {Findlay, Gregory M. and Daza, Riza M. and Martin, Beth and Zhang, Melissa D. and Leith, Anh P. and Gasperini, Molly and Janizek, Joseph D. and Huang, Xingfan and Starita, Lea M. and Shendure, Jay},
+ month = oct,
+ year = {2018},
+ pages = {217--222},
+ file = {Accepted Version:/Users/admin/Zotero/storage/QFJF3L2Y/Findlay et al. - 2018 - Accurate classification of BRCA1 variants with sat.pdf:application/pdf},
+}
+
+@article{lee_deep_2018,
+ title = {Deep mutational scanning of hemagglutinin helps predict evolutionary fates of human {H3N2} influenza variants},
+ volume = {115},
+ issn = {0027-8424, 1091-6490},
+ url = {https://pnas.org/doi/full/10.1073/pnas.1806133115},
+ doi = {10.1073/pnas.1806133115},
+ abstract = {Significance
+ A key goal in the study of influenza virus evolution is to forecast which viral strains will persist and which ones will die out. Here we experimentally measure the effects of all amino acid mutations to the hemagglutinin protein from a human H3N2 influenza strain on viral growth in cell culture. We show that these measurements have utility for distinguishing among viral strains that do and do not succeed in nature. Overall, our work suggests that new high-throughput experimental approaches may be useful for understanding virus evolution in nature.
+ ,
+ Human influenza virus rapidly accumulates mutations in its major surface protein hemagglutinin (HA). The evolutionary success of influenza virus lineages depends on how these mutations affect HA’s functionality and antigenicity. Here we experimentally measure the effects on viral growth in cell culture of all single amino acid mutations to the HA from a recent human H3N2 influenza virus strain. We show that mutations that are measured to be more favorable for viral growth are enriched in evolutionarily successful H3N2 viral lineages relative to mutations that are measured to be less favorable for viral growth. Therefore, despite the well-known caveats about cell-culture measurements of viral fitness, such measurements can still be informative for understanding evolution in nature. We also compare our measurements for H3 HA to similar data previously generated for a distantly related H1 HA and find substantial differences in which amino acids are preferred at many sites. For instance, the H3 HA has less disparity in mutational tolerance between the head and stalk domains than the H1 HA. Overall, our work suggests that experimental measurements of mutational effects can be leveraged to help understand the evolutionary fates of viral lineages in nature—but only when the measurements are made on a viral strain similar to the ones being studied in nature.},
+ language = {en},
+ number = {35},
+ urldate = {2022-07-11},
+ journal = {Proceedings of the National Academy of Sciences},
+ author = {Lee, Juhye M. and Huddleston, John and Doud, Michael B. and Hooper, Kathryn A. and Wu, Nicholas C. and Bedford, Trevor and Bloom, Jesse D.},
+ month = aug,
+ year = {2018},
+ file = {Full Text:/Users/admin/Zotero/storage/HG3QHAPN/Lee et al. - 2018 - Deep mutational scanning of hemagglutinin helps pr.pdf:application/pdf},
+}
+
+@article{weile_framework_2017,
+ title = {A framework for exhaustively mapping functional missense variants},
+ volume = {13},
+ issn = {1744-4292, 1744-4292},
+ url = {https://onlinelibrary.wiley.com/doi/10.15252/msb.20177908},
+ doi = {10.15252/msb.20177908},
+ language = {en},
+ number = {12},
+ urldate = {2022-07-11},
+ journal = {Molecular Systems Biology},
+ author = {Weile, Jochen and Sun, Song and Cote, Atina G and Knapp, Jennifer and Verby, Marta and Mellor, Joseph C and Wu, Yingzhou and Pons, Carles and Wong, Cassandra and Lieshout, Natascha and Yang, Fan and Tasan, Murat and Tan, Guihong and Yang, Shan and Fowler, Douglas M and Nussbaum, Robert and Bloom, Jesse D and Vidal, Marc and Hill, David E and Aloy, Patrick and Roth, Frederick P},
+ month = dec,
+ year = {2017},
+ pages = {957},
+ file = {Full Text:/Users/admin/Zotero/storage/YYUXTHFT/Weile et al. - 2017 - A framework for exhaustively mapping functional mi.pdf:application/pdf;Full Text:/Users/admin/Zotero/storage/HCRUH6RQ/Weile et al. - 2017 - A framework for exhaustively mapping functional mi.pdf:application/pdf},
+}
+
+@article{qi_quantitative_2014,
+ title = {A {Quantitative} {High}-{Resolution} {Genetic} {Profile} {Rapidly} {Identifies} {Sequence} {Determinants} of {Hepatitis} {C} {Viral} {Fitness} and {Drug} {Sensitivity}},
+ volume = {10},
+ issn = {1553-7374},
+ url = {https://dx.plos.org/10.1371/journal.ppat.1004064},
+ doi = {10.1371/journal.ppat.1004064},
+ language = {en},
+ number = {4},
+ urldate = {2022-07-11},
+ journal = {PLoS Pathogens},
+ author = {Qi, Hangfei and Olson, C. Anders and Wu, Nicholas C. and Ke, Ruian and Loverdo, Claude and Chu, Virginia and Truong, Shawna and Remenyi, Roland and Chen, Zugen and Du, Yushen and Su, Sheng-Yao and Al-Mawsawi, Laith Q. and Wu, Ting-Ting and Chen, Shu-Hua and Lin, Chung-Yen and Zhong, Weidong and Lloyd-Smith, James O. and Sun, Ren},
+ editor = {Wilke, Claus O.},
+ month = apr,
+ year = {2014},
+ pages = {e1004064},
+ file = {Full Text:/Users/admin/Zotero/storage/ZXNH72PK/Qi et al. - 2014 - A Quantitative High-Resolution Genetic Profile Rap.pdf:application/pdf},
+}
+
+@article{chan_correlation_2017,
+ title = {Correlation of fitness landscapes from three orthologous {TIM} barrels originates from sequence and structure constraints},
+ volume = {8},
+ issn = {2041-1723},
+ url = {http://www.nature.com/articles/ncomms14614},
+ doi = {10.1038/ncomms14614},
+ language = {en},
+ number = {1},
+ urldate = {2022-07-11},
+ journal = {Nature Communications},
+ author = {Chan, Yvonne H. and Venev, Sergey V. and Zeldovich, Konstantin B. and Matthews, C. Robert},
+ month = apr,
+ year = {2017},
+ pages = {14614},
+ file = {Full Text:/Users/admin/Zotero/storage/A6E42JVX/Chan et al. - 2017 - Correlation of fitness landscapes from three ortho.pdf:application/pdf},
+}
+
+@article{melnikov_comprehensive_2014,
+ title = {Comprehensive mutational scanning of a kinase \textit{in vivo} reveals substrate-dependent fitness landscapes},
+ volume = {42},
+ issn = {0305-1048, 1362-4962},
+ url = {https://academic.oup.com/nar/article-lookup/doi/10.1093/nar/gku511},
+ doi = {10.1093/nar/gku511},
+ language = {en},
+ number = {14},
+ urldate = {2022-07-11},
+ journal = {Nucleic Acids Research},
+ author = {Melnikov, Alexandre and Rogov, Peter and Wang, Li and Gnirke, Andreas and Mikkelsen, Tarjei S.},
+ month = aug,
+ year = {2014},
+ pages = {e112--e112},
+ file = {Full Text:/Users/admin/Zotero/storage/U454BG33/Melnikov et al. - 2014 - Comprehensive mutational scanning of a kinase i.pdf:application/pdf},
+}
+
+@article{nutschel_systematically_2020,
+ title = {Systematically {Scrutinizing} the {Impact} of {Substitution} {Sites} on {Thermostability} and {Detergent} {Tolerance} for \textit{{Bacillus} subtilis} {Lipase} {A}},
+ volume = {60},
+ issn = {1549-9596, 1549-960X},
+ url = {https://pubs.acs.org/doi/10.1021/acs.jcim.9b00954},
+ doi = {10.1021/acs.jcim.9b00954},
+ language = {en},
+ number = {3},
+ urldate = {2022-07-11},
+ journal = {Journal of Chemical Information and Modeling},
+ author = {Nutschel, Christina and Fulton, Alexander and Zimmermann, Olav and Schwaneberg, Ulrich and Jaeger, Karl-Erich and Gohlke, Holger},
+ month = mar,
+ year = {2020},
+ pages = {1568--1584},
+}
+
+@article{jacquier_capturing_2013,
+ title = {Capturing the mutational landscape of the beta-lactamase {TEM}-1},
+ volume = {110},
+ issn = {0027-8424, 1091-6490},
+ url = {https://pnas.org/doi/full/10.1073/pnas.1215206110},
+ doi = {10.1073/pnas.1215206110},
+ abstract = {Adaptation proceeds through the selection of mutations. The distribution of mutant fitness effect and the forces shaping this distribution are therefore keys to predict the evolutionary fate of organisms and their constituents such as enzymes. Here, by producing and sequencing a comprehensive collection of 10,000 mutants, we explore the mutational landscape of one enzyme involved in the spread of antibiotic resistance, the beta-lactamase TEM-1. We measured mutation impact on the enzyme activity through the estimation of amoxicillin minimum inhibitory concentration on a subset of 990 mutants carrying a unique missense mutation, representing 64\% of possible amino acid changes in that protein reachable by point mutation. We established that mutation type, solvent accessibility of residues, and the predicted effect of mutations on protein stability primarily determined alone or in combination changes in minimum inhibitory concentration of mutants. Moreover, we were able to capture the drastic modification of the mutational landscape induced by a single stabilizing point mutation (M182T) by a simple model of protein stability. This work thereby provides an integrated framework to study mutation effects and a tool to understand/define better the epistatic interactions.},
+ language = {en},
+ number = {32},
+ urldate = {2022-07-11},
+ journal = {Proceedings of the National Academy of Sciences},
+ author = {Jacquier, Hervé and Birgy, André and Le Nagard, Hervé and Mechulam, Yves and Schmitt, Emmanuelle and Glodt, Jérémy and Bercot, Beatrice and Petit, Emmanuelle and Poulain, Julie and Barnaud, Guilène and Gros, Pierre-Alexis and Tenaillon, Olivier},
+ month = aug,
+ year = {2013},
+ pages = {13067--13072},
+ file = {Full Text:/Users/admin/Zotero/storage/KCXP8PUL/Jacquier et al. - 2013 - Capturing the mutational landscape of the beta-lac.pdf:application/pdf},
+}
+
+@article{mishra_systematic_2016,
+ title = {Systematic {Mutant} {Analyses} {Elucidate} {General} and {Client}-{Specific} {Aspects} of {Hsp90} {Function}},
+ volume = {15},
+ issn = {22111247},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S2211124716303175},
+ doi = {10.1016/j.celrep.2016.03.046},
+ language = {en},
+ number = {3},
+ urldate = {2022-07-11},
+ journal = {Cell Reports},
+ author = {Mishra, Parul and Flynn, Julia M. and Starr, Tyler N. and Bolon, Daniel N.A.},
+ month = apr,
+ year = {2016},
+ pages = {588--598},
+ file = {Full Text:/Users/admin/Zotero/storage/N3VLLXTY/Mishra et al. - 2016 - Systematic Mutant Analyses Elucidate General and C.pdf:application/pdf},
+}
+
+@article{pokusaeva_experimental_2019,
+ title = {An experimental assay of the interactions of amino acids from orthologous sequences shaping a complex fitness landscape},
+ volume = {15},
+ issn = {1553-7404},
+ url = {https://dx.plos.org/10.1371/journal.pgen.1008079},
+ doi = {10.1371/journal.pgen.1008079},
+ language = {en},
+ number = {4},
+ urldate = {2022-07-11},
+ journal = {PLOS Genetics},
+ author = {Pokusaeva, Victoria O. and Usmanova, Dinara R. and Putintseva, Ekaterina V. and Espinar, Lorena and Sarkisyan, Karen S. and Mishin, Alexander S. and Bogatyreva, Natalya S. and Ivankov, Dmitry N. and Akopyan, Arseniy V. and Avvakumov, Sergey Ya. and Povolotskaya, Inna S. and Filion, Guillaume J. and Carey, Lucas B. and Kondrashov, Fyodor A.},
+ editor = {Petrov, Dmitri A.},
+ month = apr,
+ year = {2019},
+ pages = {e1008079},
+ file = {Full Text:/Users/admin/Zotero/storage/39VZ7AQN/Pokusaeva et al. - 2019 - An experimental assay of the interactions of amino.pdf:application/pdf},
+}
+
+@article{fernandes_functional_2016,
+ title = {Functional {Segregation} of {Overlapping} {Genes} in {HIV}},
+ volume = {167},
+ issn = {00928674},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0092867416316038},
+ doi = {10.1016/j.cell.2016.11.031},
+ language = {en},
+ number = {7},
+ urldate = {2022-07-11},
+ journal = {Cell},
+ author = {Fernandes, Jason D. and Faust, Tyler B. and Strauli, Nicolas B. and Smith, Cynthia and Crosby, David C. and Nakamura, Robert L. and Hernandez, Ryan D. and Frankel, Alan D.},
+ month = dec,
+ year = {2016},
+ pages = {1762--1773.e12},
+ file = {Full Text:/Users/admin/Zotero/storage/M2JCUQLM/Fernandes et al. - 2016 - Functional Segregation of Overlapping Genes in HIV.pdf:application/pdf},
+}
+
+@techreport{sinai_generative_2021,
+ type = {preprint},
+ title = {Generative {AAV} capsid diversification by latent interpolation},
+ url = {http://biorxiv.org/lookup/doi/10.1101/2021.04.16.440236},
+ abstract = {Summary
+ Adeno-associated virus (AAV) capsids have shown clinical promise as delivery vectors for gene therapy. However, the high prevalence of pre-existing immunity against natural capsids poses a challenge for widespread treatment. The generation of diverse capsids that are potentially more capable of immune evasion is challenging because introducing multiple mutations often breaks capsid assembly. Here we target a representative, immunologically relevant 28-amino-acid segment of the AAV2 capsid and show that a low-complexity Variational Auto-encoder (VAE) can interpolate in sequence space to produce diverse and novel capsids capable of packaging their own genomes. We first train the VAE on a 564-sample Multiple-Sequence Alignment (MSA) of dependo-parvoviruses, and then further augment this dataset by adding 22,704 samples from a deep mutational exploration (DME) on the target region. In both cases the VAE generated viable variants with many mutations, which we validated experimentally. We propose that this simple approach can be used to optimize and diversify other proteins, as well as other capsid traits of interest for gene delivery.},
+ language = {en},
+ urldate = {2022-07-11},
+ institution = {Synthetic Biology},
+ author = {Sinai, Sam and Jain, Nina and Church, George M and Kelsic, Eric D},
+ month = apr,
+ year = {2021},
+ doi = {10.1101/2021.04.16.440236},
+ file = {Submitted Version:/Users/admin/Zotero/storage/TEMTVFT7/Sinai et al. - 2021 - Generative AAV capsid diversification by latent in.pdf:application/pdf},
+}
+
+@article{wu_functional_2015,
+ title = {Functional {Constraint} {Profiling} of a {Viral} {Protein} {Reveals} {Discordance} of {Evolutionary} {Conservation} and {Functionality}},
+ volume = {11},
+ issn = {1553-7404},
+ url = {https://dx.plos.org/10.1371/journal.pgen.1005310},
+ doi = {10.1371/journal.pgen.1005310},
+ language = {en},
+ number = {7},
+ urldate = {2022-07-11},
+ journal = {PLOS Genetics},
+ author = {Wu, Nicholas C. and Olson, C. Anders and Du, Yushen and Le, Shuai and Tran, Kevin and Remenyi, Roland and Gong, Danyang and Al-Mawsawi, Laith Q. and Qi, Hangfei and Wu, Ting-Ting and Sun, Ren},
+ editor = {Worobey, Michael},
+ month = jul,
+ year = {2015},
+ pages = {e1005310},
+ file = {Full Text:/Users/admin/Zotero/storage/NNQX65KR/Wu et al. - 2015 - Functional Constraint Profiling of a Viral Protein.pdf:application/pdf},
+}
+
+@article{matreyek_integrating_2021,
+ title = {Integrating thousands of {PTEN} variant activity and abundance measurements reveals variant subgroups and new dominant negatives in cancers},
+ volume = {13},
+ issn = {1756-994X},
+ url = {https://genomemedicine.biomedcentral.com/articles/10.1186/s13073-021-00984-x},
+ doi = {10.1186/s13073-021-00984-x},
+ abstract = {Abstract
+
+ Background
+ PTEN is a multi-functional tumor suppressor protein regulating cell growth, immune signaling, neuronal function, and genome stability. Experimental characterization can help guide the clinical interpretation of the thousands of germline or somatic PTEN variants observed in patients. Two large-scale mutational datasets, one for PTEN variant intracellular abundance encompassing 4112 missense variants and one for lipid phosphatase activity encompassing 7244 variants, were recently published. The combined information from these datasets can reveal variant-specific phenotypes that may underlie various clinical presentations, but this has not been comprehensively examined, particularly for somatic PTEN variants observed in cancers.
+
+
+ Methods
+ Here, we add to these efforts by measuring the intracellular abundance of 764 new PTEN variants and refining abundance measurements for 3351 previously studied variants. We use this expanded and refined PTEN abundance dataset to explore the mutational patterns governing PTEN intracellular abundance, and then incorporate the phosphatase activity data to subdivide PTEN variants into four functionally distinct groups.
+
+
+ Results
+ This analysis revealed a set of highly abundant but lipid phosphatase defective variants that could act in a dominant-negative fashion to suppress PTEN activity. Two of these variants were, indeed, capable of dysregulating Akt signaling in cells harboring a WT PTEN allele. Both variants were observed in multiple breast or uterine tumors, demonstrating the disease relevance of these high abundance, inactive variants.
+
+
+ Conclusions
+ We show that multidimensional, large-scale variant functional data, when paired with public cancer genomics datasets and follow-up assays, can improve understanding of uncharacterized cancer-associated variants, and provide better insights into how they contribute to oncogenesis.},
+ language = {en},
+ number = {1},
+ urldate = {2022-07-11},
+ journal = {Genome Medicine},
+ author = {Matreyek, Kenneth A. and Stephany, Jason J. and Ahler, Ethan and Fowler, Douglas M.},
+ month = dec,
+ year = {2021},
+ pages = {165},
+ file = {Full Text:/Users/admin/Zotero/storage/8VXIZT8Q/Matreyek et al. - 2021 - Integrating thousands of PTEN variant activity and.pdf:application/pdf},
+}
+
+@article{mighell_saturation_2018,
+ title = {A {Saturation} {Mutagenesis} {Approach} to {Understanding} {PTEN} {Lipid} {Phosphatase} {Activity} and {Genotype}-{Phenotype} {Relationships}},
+ volume = {102},
+ issn = {00029297},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0002929718301071},
+ doi = {10.1016/j.ajhg.2018.03.018},
+ language = {en},
+ number = {5},
+ urldate = {2023-06-14},
+ journal = {The American Journal of Human Genetics},
+ author = {Mighell, Taylor L. and Evans-Dutson, Sara and O’Roak, Brian J.},
+ month = may,
+ year = {2018},
+ pages = {943--955},
+ file = {Full Text:/Users/admin/Zotero/storage/9UEQ49C4/Mighell et al. - 2018 - A Saturation Mutagenesis Approach to Understanding.pdf:application/pdf},
+}
+
+@article{russ_evolution-based_2020,
+ title = {An evolution-based model for designing chorismate mutase enzymes},
+ volume = {369},
+ issn = {0036-8075, 1095-9203},
+ url = {https://www.sciencemag.org/lookup/doi/10.1126/science.aba3304},
+ doi = {10.1126/science.aba3304},
+ abstract = {The rational design of enzymes is an important goal for both fundamental and practical reasons. Here, we describe a process to learn the constraints for specifying proteins purely from evolutionary sequence data, design and build libraries of synthetic genes, and test them for activity in vivo using a quantitative complementation assay. For chorismate mutase, a key enzyme in the biosynthesis of aromatic amino acids, we demonstrate the design of natural-like catalytic function with substantial sequence diversity. Further optimization focuses the generative model toward function in a specific genomic context. The data show that sequence-based statistical models suffice to specify proteins and provide access to an enormous space of functional sequences. This result provides a foundation for a general process for evolution-based design of artificial proteins.},
+ language = {en},
+ number = {6502},
+ urldate = {2023-06-14},
+ journal = {Science},
+ author = {Russ, William P. and Figliuzzi, Matteo and Stocker, Christian and Barrat-Charlaix, Pierre and Socolich, Michael and Kast, Peter and Hilvert, Donald and Monasson, Remi and Cocco, Simona and Weigt, Martin and Ranganathan, Rama},
+ month = jul,
+ year = {2020},
+ pages = {440--445},
+}
+
+@article{gonzalez_fitness_2019,
+ title = {Fitness {Effects} of {Single} {Amino} {Acid} {Insertions} and {Deletions} in {TEM}-1 Beta-{Lactamase}},
+ volume = {431},
+ issn = {00222836},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0022283619302372},
+ doi = {10.1016/j.jmb.2019.04.030},
+ language = {en},
+ number = {12},
+ urldate = {2023-06-14},
+ journal = {Journal of Molecular Biology},
+ author = {Gonzalez, Courtney E. and Roberts, Paul and Ostermeier, Marc},
+ month = may,
+ year = {2019},
+ pages = {2320--2330},
+ file = {Accepted Version:/Users/admin/Zotero/storage/ZWGQXGBD/Gonzalez et al. - 2019 - Fitness Effects of Single Amino Acid Insertions an.pdf:application/pdf},
+}
+
+@article{macdonald_dimple_2023,
+ title = {{DIMPLE}: deep insertion, deletion, and missense mutation libraries for exploring protein variation in evolution, disease, and biology},
+ volume = {24},
+ issn = {1474-760X},
+ shorttitle = {{DIMPLE}},
+ url = {https://genomebiology.biomedcentral.com/articles/10.1186/s13059-023-02880-6},
+ doi = {10.1186/s13059-023-02880-6},
+ abstract = {Abstract
+ Insertions and deletions (indels) enable evolution and cause disease. Due to technical challenges, indels are left out of most mutational scans, limiting our understanding of them in disease, biology, and evolution. We develop a low cost and bias method, DIMPLE, for systematically generating deletions, insertions, and missense mutations in genes, which we test on a range of targets, including Kir2.1. We use DIMPLE to study how indels impact potassium channel structure, disease, and evolution. We find deletions are most disruptive overall, beta sheets are most sensitive to indels, and flexible loops are sensitive to deletions yet tolerate insertions.},
+ language = {en},
+ number = {1},
+ urldate = {2023-06-14},
+ journal = {Genome Biology},
+ author = {Macdonald, Christian B. and Nedrud, David and Grimes, Patrick Rockefeller and Trinidad, Donovan and Fraser, James S. and Coyote-Maestas, Willow},
+ month = feb,
+ year = {2023},
+ pages = {36},
+ file = {Full Text:/Users/admin/Zotero/storage/3MMXFUJ4/Macdonald et al. - 2023 - DIMPLE deep insertion, deletion, and missense mut.pdf:application/pdf},
+}
+
+@article{jones_structural_2020,
+ title = {Structural and functional characterization of {G} protein–coupled receptors with deep mutational scanning},
+ volume = {9},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/54895},
+ doi = {10.7554/eLife.54895},
+ abstract = {The {\textgreater}800 human G protein–coupled receptors (GPCRs) are responsible for transducing diverse chemical stimuli to alter cell state- and are the largest class of drug targets. Their myriad structural conformations and various modes of signaling make it challenging to understand their structure and function. Here, we developed a platform to characterize large libraries of GPCR variants in human cell lines with a barcoded transcriptional reporter of G protein signal transduction. We tested 7800 of 7828 possible single amino acid substitutions to the beta-2 adrenergic receptor (Beta
+ 2
+ AR) at four concentrations of the agonist isoproterenol. We identified residues specifically important for Beta
+ 2
+ AR signaling, mutations in the human population that are potentially loss of function, and residues that modulate basal activity. Using unsupervised learning, we identify residues critical for signaling, including all major structural motifs and molecular interfaces. We also find a previously uncharacterized structural latch spanning the first two extracellular loops that is highly conserved across Class A GPCRs and is conformationally rigid in both the inactive and active states of the receptor. More broadly, by linking deep mutational scanning with engineered transcriptional reporters, we establish a generalizable method for exploring pharmacogenomics, structure and function across broad classes of drug receptors.},
+ language = {en},
+ urldate = {2023-06-14},
+ journal = {eLife},
+ author = {Jones, Eric M and Lubock, Nathan B and Venkatakrishnan, Aj and Wang, Jeffrey and Tseng, Alex M and Paggi, Joseph M and Latorraca, Naomi R and Cancilla, Daniel and Satyadi, Megan and Davis, Jessica E and Babu, M Madan and Dror, Ron O and Kosuri, Sriram},
+ month = oct,
+ year = {2020},
+ pages = {e54895},
+ file = {Full Text:/Users/admin/Zotero/storage/T2SD9K3D/Jones et al. - 2020 - Structural and functional characterization of G pr.pdf:application/pdf;Submitted Version:/Users/admin/Zotero/storage/WYWCXHH9/Jones et al. - 2020 - Structural and functional characterization of G pr.pdf:application/pdf},
+}
+
+@article{chen_comprehensive_2020,
+ title = {Comprehensive exploration of the translocation, stability and substrate recognition requirements in {VIM}-2 lactamase},
+ volume = {9},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/56707},
+ doi = {10.7554/eLife.56707},
+ abstract = {Metallo-Beta-lactamases (MBLs) degrade a broad spectrum of Beta-lactam antibiotics, and are a major disseminating source for multidrug resistant bacteria. Despite many biochemical studies in diverse MBLs, molecular understanding of the roles of residues in the enzyme’s stability and function, and especially substrate specificity, is lacking. Here, we employ deep mutational scanning (DMS) to generate comprehensive single amino acid variant data on a major clinical MBL, VIM-2, by measuring the effect of thousands of VIM-2 mutants on the degradation of three representative classes of Beta-lactams (ampicillin, cefotaxime, and meropenem) and at two different temperatures (25°C and 37°C). We revealed residues responsible for expression and translocation, and mutations that increase resistance and/or alter substrate specificity. The distribution of specificity-altering mutations unveiled distinct molecular recognition of the three substrates. Moreover, these function-altering mutations are frequently observed among naturally occurring variants, suggesting that the enzymes have continuously evolved to become more potent resistance genes.},
+ language = {en},
+ urldate = {2023-06-14},
+ journal = {eLife},
+ author = {Chen, John Z and Fowler, Douglas M and Tokuriki, Nobuhiko},
+ month = jun,
+ year = {2020},
+ pages = {e56707},
+ file = {Full Text:/Users/admin/Zotero/storage/LBP8NZRP/Chen et al. - 2020 - Comprehensive exploration of the translocation, st.pdf:application/pdf},
+}
+
+@article{kennouche_deep_2019,
+ title = {Deep mutational scanning of the \textit{{Neisseria} meningitidis} major pilin reveals the importance of pilus tip‐mediated adhesion},
+ volume = {38},
+ issn = {0261-4189, 1460-2075},
+ url = {https://www.embopress.org/doi/10.15252/embj.2019102145},
+ doi = {10.15252/embj.2019102145},
+ language = {en},
+ number = {22},
+ urldate = {2023-06-14},
+ journal = {The EMBO Journal},
+ author = {Kennouche, Paul and Charles‐Orszag, Arthur and Nishiguchi, Daiki and Goussard, Sylvie and Imhaus, Anne‐Flore and Dupré, Mathieu and Chamot‐Rooke, Julia and Duménil, Guillaume},
+ month = nov,
+ year = {2019},
+ pages = {e102145},
+ file = {Full Text:/Users/admin/Zotero/storage/K9LIAMBH/Kennouche et al. - 2019 - Deep mutational scanning of the Neisseria menin.pdf:application/pdf;Submitted Version:/Users/admin/Zotero/storage/9587BW8A/Kennouche et al. - 2019 - Deep mutational scanning of the Neisseria menin.pdf:application/pdf},
+}
+
+@article{haddox_experimental_2016,
+ title = {Experimental {Estimation} of the {Effects} of {All} {Amino}-{Acid} {Mutations} to {HIV}’s {Envelope} {Protein} on {Viral} {Replication} in {Cell} {Culture}},
+ volume = {12},
+ issn = {1553-7374},
+ url = {https://dx.plos.org/10.1371/journal.ppat.1006114},
+ doi = {10.1371/journal.ppat.1006114},
+ language = {en},
+ number = {12},
+ urldate = {2023-06-14},
+ journal = {PLOS Pathogens},
+ author = {Haddox, Hugh K. and Dingens, Adam S. and Bloom, Jesse D.},
+ editor = {Swanstrom, Ronald},
+ month = dec,
+ year = {2016},
+ pages = {e1006114},
+ file = {Full Text:/Users/admin/Zotero/storage/MJ2VVEBN/Haddox et al. - 2016 - Experimental Estimation of the Effects of All Amin.pdf:application/pdf},
+}
+
+@article{roscoe_systematic_2014,
+ title = {Systematic {Exploration} of {Ubiquitin} {Sequence}, {E1} {Activation} {Efficiency}, and {Experimental} {Fitness} in {Yeast}},
+ volume = {426},
+ issn = {00222836},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0022283614002587},
+ doi = {10.1016/j.jmb.2014.05.019},
+ language = {en},
+ number = {15},
+ urldate = {2023-06-14},
+ journal = {Journal of Molecular Biology},
+ author = {Roscoe, Benjamin P. and Bolon, Daniel N.A.},
+ month = jul,
+ year = {2014},
+ pages = {2854--2870},
+ file = {Accepted Version:/Users/admin/Zotero/storage/ZZGGND7E/Roscoe and Bolon - 2014 - Systematic Exploration of Ubiquitin Sequence, E1 A.pdf:application/pdf},
+}
+
+@article{klesmith_comprehensive_2015,
+ title = {Comprehensive {Sequence}-{Flux} {Mapping} of a {Levoglucosan} {Utilization} {Pathway} in \textit{{E}. coli}},
+ volume = {4},
+ issn = {2161-5063, 2161-5063},
+ url = {https://pubs.acs.org/doi/10.1021/acssynbio.5b00131},
+ doi = {10.1021/acssynbio.5b00131},
+ language = {en},
+ number = {11},
+ urldate = {2023-06-14},
+ journal = {ACS Synthetic Biology},
+ author = {Klesmith, Justin R. and Bacik, John-Paul and Michalczyk, Ryszard and Whitehead, Timothy A.},
+ month = nov,
+ year = {2015},
+ pages = {1235--1243},
+ file = {Submitted Version:/Users/admin/Zotero/storage/EPYWRHC2/Klesmith et al. - 2015 - Comprehensive Sequence-Flux Mapping of a Levogluco.pdf:application/pdf},
+}
+
+@article{gonzalez_somermeyer_heterogeneity_2022,
+ title = {Heterogeneity of the {GFP} fitness landscape and data-driven protein design},
+ volume = {11},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/75842},
+ doi = {10.7554/eLife.75842},
+ abstract = {Studies of protein fitness landscapes reveal biophysical constraints guiding protein evolution and empower prediction of functional proteins. However, generalisation of these findings is limited due to scarceness of systematic data on fitness landscapes of proteins with a defined evolutionary relationship. We characterized the fitness peaks of four orthologous fluorescent proteins with a broad range of sequence divergence. While two of the four studied fitness peaks were sharp, the other two were considerably flatter, being almost entirely free of epistatic interactions. Mutationally robust proteins, characterized by a flat fitness peak, were not optimal templates for machine-learning-driven protein design – instead, predictions were more accurate for fragile proteins with epistatic landscapes. Our work paves insights for practical application of fitness landscape heterogeneity in protein engineering.},
+ language = {en},
+ urldate = {2023-06-14},
+ journal = {eLife},
+ author = {Gonzalez Somermeyer, Louisa and Fleiss, Aubin and Mishin, Alexander S and Bozhanova, Nina G and Igolkina, Anna A and Meiler, Jens and Alaball Pujol, Maria-Elisenda and Putintseva, Ekaterina V and Sarkisyan, Karen S and Kondrashov, Fyodor A},
+ month = may,
+ year = {2022},
+ pages = {e75842},
+ file = {Full Text:/Users/admin/Zotero/storage/N7EZUGF7/Gonzalez Somermeyer et al. - 2022 - Heterogeneity of the GFP fitness landscape and dat.pdf:application/pdf},
+}
+
+@article{doud_accurate_2016,
+ title = {Accurate {Measurement} of the {Effects} of {All} {Amino}-{Acid} {Mutations} on {Influenza} {Hemagglutinin}},
+ volume = {8},
+ issn = {1999-4915},
+ url = {http://www.mdpi.com/1999-4915/8/6/155},
+ doi = {10.3390/v8060155},
+ language = {en},
+ number = {6},
+ urldate = {2023-06-14},
+ journal = {Viruses},
+ author = {Doud, Michael and Bloom, Jesse},
+ month = jun,
+ year = {2016},
+ pages = {155},
+ file = {Full Text:/Users/admin/Zotero/storage/Z3RRFCKV/Doud and Bloom - 2016 - Accurate Measurement of the Effects of All Amino-A.pdf:application/pdf},
+}
+
+@article{soh_comprehensive_2019,
+ title = {Comprehensive mapping of adaptation of the avian influenza polymerase protein {PB2} to humans},
+ volume = {8},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/45079},
+ doi = {10.7554/eLife.45079},
+ abstract = {Viruses like influenza are infamous for their ability to adapt to new hosts. Retrospective studies of natural zoonoses and passaging in the lab have identified a modest number of host-adaptive mutations. However, it is unclear if these mutations represent all ways that influenza can adapt to a new host. Here we take a prospective approach to this question by completely mapping amino-acid mutations to the avian influenza virus polymerase protein PB2 that enhance growth in human cells. We identify numerous previously uncharacterized human-adaptive mutations. These mutations cluster on PB2’s surface, highlighting potential interfaces with host factors. Some previously uncharacterized adaptive mutations occur in avian-to-human transmission of H7N9 influenza, showing their importance for natural virus evolution. But other adaptive mutations do not occur in nature because they are inaccessible via single-nucleotide mutations. Overall, our work shows how selection at key molecular surfaces combines with evolutionary accessibility to shape viral host adaptation.
+ ,
+ Viruses copy themselves by hijacking the cells of an infected host, but this comes with some limitations. Cells from different species have different molecular machinery and so viruses often have to specialize to a narrow group of species. This specialization consists largely of fine-tuning the way that viral proteins interact with host proteins.
+ For instance, in bird flu viruses, a protein known as PB2 does not interact well with the machinery in human cells. Because PB2 proteins form part of the viral polymerase (the structure that copies the viral genome), this prevents bird flu viruses from replicating efficiently in humans. Sometimes however, changes in the PB2 protein allow bird flu viruses to better replicate in humans, potentially leading to deadly flu pandemics.
+ To understand exactly how this happens, researchers have previously used two approaches: examining the changes that have happened in past flu viruses, and monitoring the evolution of bird flu viruses grown in human cells in the lab. However, these approaches can only look at a small number of the many possible genetic changes to the virus. This makes it hard to anticipate the new ways that flu might adapt to human cells in the future.
+ To overcome this problem, Soh et al. systematically created all of the single changes to the bird flu PB2, altering every element of the protein sequence one-by-one. They then tested which of the changes to PB2 helped the virus grow better in human cells. The modifications that made the viruses thrive were on the surface of the protein, suggesting that they might improve interaction with the cell machinery of the host. Some changes have been found in bird flu viruses that have recently jumped into humans in nature, although fortunately none of these viruses have yet spread widely to cause a pandemic.
+ Many factors affect the evolution of viruses, and their ability to infect new species. Understanding which changes in proteins help these microbes adapt to new hosts is an important element that scientists could consider to assess future risks of pandemics.},
+ language = {en},
+ urldate = {2023-06-14},
+ journal = {eLife},
+ author = {Soh, Yq Shirleen and Moncla, Louise H and Eguia, Rachel and Bedford, Trevor and Bloom, Jesse D},
+ month = apr,
+ year = {2019},
+ pages = {e45079},
+ file = {Full Text:/Users/admin/Zotero/storage/U4C4J7MA/Soh et al. - 2019 - Comprehensive mapping of adaptation of the avian i.pdf:application/pdf},
+}
+
+@article{seuma_genetic_2021,
+ title = {The genetic landscape for amyloid beta fibril nucleation accurately discriminates familial {Alzheimer}’s disease mutations},
+ volume = {10},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/63364},
+ doi = {10.7554/eLife.63364},
+ abstract = {Plaques of the amyloid beta (Aß) peptide are a pathological hallmark of Alzheimer’s disease (AD), the most common form of dementia. Mutations in Aß also cause familial forms of AD (fAD). Here, we use deep mutational scanning to quantify the effects of {\textgreater}14,000 mutations on the aggregation of Aß. The resulting genetic landscape reveals mechanistic insights into fibril nucleation, including the importance of charge and gatekeeper residues in the disordered region outside of the amyloid core in preventing nucleation. Strikingly, unlike computational predictors and previous measurements, the empirical nucleation scores accurately identify all known dominant fAD mutations in Aß, genetically validating that the mechanism of nucleation in a cell-based assay is likely to be very similar to the mechanism that causes the human disease. These results provide the first comprehensive atlas of how mutations alter the formation of any amyloid fibril and a resource for the interpretation of genetic variation in Aß.
+ ,
+ Alzheimer’s disease is the most common form of dementia, affecting more than 50 million people worldwide. Despite more than 400 clinical trials, there are still no effective drugs that can prevent or treat the disease. A common target in Alzheimer’s disease trials is a small protein called amyloid beta. Amyloid beta proteins are ‘sticky’ molecules. In the brains of people with Alzheimer’s disease, they join to form first small aggregates and then long chains called fibrils, a process which is toxic to neurons.
+ Specific mutations in the gene for amyloid beta are known to cause rare, aggressive forms of Alzheimer’s disease that typically affect people in their fifties or sixties. But these are not the only mutations that can occur in amyloid beta. In principle, any part of the protein could undergo mutation. And given the size of the human population, it is likely that each of these mutations exists in someone alive today.
+ Seuma et al. reasoned that studying these mutations could help us understand the process by which amyloid beta forms new aggregates. Using an approach called deep mutational scanning, Seuma et al. mutated each point in the protein, one at a time. This produced more than 14,000 different versions of amyloid beta. Seuma et al. then measured how quickly these mutants were able to form aggregates by introducing them into yeast cells.
+ All the mutations known to cause early-onset Alzheimer’s disease accelerated amyloid beta aggregation in the yeast. But the results also revealed previously unknown properties that control how fast aggregation occurs. In addition, they highlighted a number of positions in the amyloid beta sequence that act as ‘gatekeepers’. In healthy brains, these gatekeepers prevent amyloid beta proteins from sticking together. When mutated, they drive the protein to form aggregates.
+ This comprehensive dataset will help researchers understand how proteins form toxic aggregates, which could in turn help them find ways to prevent this from happening. By providing an ‘atlas’ of all possible amyloid beta mutations, the dataset will also help clinicians interpret any new mutations they encounter in patients. By showing whether or not a mutation speeds up aggregation, the atlas will help clinicians predict whether that mutation increases the risk of Alzheimer’s disease.},
+ language = {en},
+ urldate = {2023-06-14},
+ journal = {eLife},
+ author = {Seuma, Mireia and Faure, Andre J and Badia, Marta and Lehner, Ben and Bolognesi, Benedetta},
+ month = feb,
+ year = {2021},
+ pages = {e63364},
+ file = {Full Text:/Users/admin/Zotero/storage/8WFU6TK4/Seuma et al. - 2021 - The genetic landscape for amyloid beta fibril nucl.pdf:application/pdf},
+}
+
+@article{dandage_differential_2018,
+ title = {Differential strengths of molecular determinants guide environment specific mutational fates},
+ volume = {14},
+ issn = {1553-7404},
+ url = {https://dx.plos.org/10.1371/journal.pgen.1007419},
+ doi = {10.1371/journal.pgen.1007419},
+ language = {en},
+ number = {5},
+ urldate = {2023-06-14},
+ journal = {PLOS Genetics},
+ author = {Dandage, Rohan and Pandey, Rajesh and Jayaraj, Gopal and Rai, Manish and Berger, David and Chakraborty, Kausik},
+ editor = {Matic, Ivan},
+ month = may,
+ year = {2018},
+ pages = {e1007419},
+ file = {Full Text:/Users/admin/Zotero/storage/GELATYV7/Dandage et al. - 2018 - Differential strengths of molecular determinants g.pdf:application/pdf},
+}
+
+@article{firnberg_comprehensive_2014,
+ title = {A {Comprehensive}, {High}-{Resolution} {Map} of a {Gene}’s {Fitness} {Landscape}},
+ volume = {31},
+ issn = {1537-1719, 0737-4038},
+ url = {https://academic.oup.com/mbe/article-lookup/doi/10.1093/molbev/msu081},
+ doi = {10.1093/molbev/msu081},
+ language = {en},
+ number = {6},
+ urldate = {2023-06-14},
+ journal = {Molecular Biology and Evolution},
+ author = {Firnberg, Elad and Labonte, Jason W. and Gray, Jeffrey J. and Ostermeier, Marc},
+ month = jun,
+ year = {2014},
+ pages = {1581--1592},
+ file = {Full Text:/Users/admin/Zotero/storage/9TTFDAL5/Firnberg et al. - 2014 - A Comprehensive, High-Resolution Map of a Gene’s F.pdf:application/pdf},
+}
+
+@article{adkar_protein_2012,
+ title = {Protein {Model} {Discrimination} {Using} {Mutational} {Sensitivity} {Derived} from {Deep} {Sequencing}},
+ volume = {20},
+ issn = {09692126},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0969212612000068},
+ doi = {10.1016/j.str.2011.11.021},
+ language = {en},
+ number = {2},
+ urldate = {2023-06-14},
+ journal = {Structure},
+ author = {Adkar, Bharat V. and Tripathi, Arti and Sahoo, Anusmita and Bajaj, Kanika and Goswami, Devrishi and Chakrabarti, Purbani and Swarnkar, Mohit K. and Gokhale, Rajesh S. and Varadarajan, Raghavan},
+ month = feb,
+ year = {2012},
+ pages = {371--381},
+ file = {Full Text:/Users/admin/Zotero/storage/TIV665XI/Adkar et al. - 2012 - Protein Model Discrimination Using Mutational Sens.pdf:application/pdf},
+}
+
+@article{giacomelli_mutational_2018,
+ title = {Mutational processes shape the landscape of {TP53} mutations in human cancer},
+ volume = {50},
+ issn = {1061-4036, 1546-1718},
+ url = {https://www.nature.com/articles/s41588-018-0204-y},
+ doi = {10.1038/s41588-018-0204-y},
+ language = {en},
+ number = {10},
+ urldate = {2023-06-14},
+ journal = {Nature Genetics},
+ author = {Giacomelli, Andrew O. and Yang, Xiaoping and Lintner, Robert E. and McFarland, James M. and Duby, Marc and Kim, Jaegil and Howard, Thomas P. and Takeda, David Y. and Ly, Seav Huong and Kim, Eejung and Gannon, Hugh S. and Hurhula, Brian and Sharpe, Ted and Goodale, Amy and Fritchman, Briana and Steelman, Scott and Vazquez, Francisca and Tsherniak, Aviad and Aguirre, Andrew J. and Doench, John G. and Piccioni, Federica and Roberts, Charles W. M. and Meyerson, Matthew and Getz, Gad and Johannessen, Cory M. and Root, David E. and Hahn, William C.},
+ month = oct,
+ year = {2018},
+ pages = {1381--1387},
+ file = {Accepted Version:/Users/admin/Zotero/storage/EFU93FB6/Giacomelli et al. - 2018 - Mutational processes shape the landscape of TP53 m.pdf:application/pdf},
+}
+
+@article{suiter_massively_2020,
+ title = {Massively parallel variant characterization identifies \textit{{NUDT15}} alleles associated with thiopurine toxicity},
+ volume = {117},
+ issn = {0027-8424, 1091-6490},
+ url = {https://pnas.org/doi/full/10.1073/pnas.1915680117},
+ doi = {10.1073/pnas.1915680117},
+ abstract = {Significance
+
+ Pharmacogenetics is a prototype of genomics-guided precision medicine. While there is a rapid expansion of novel pharmacogenetic variants discovered by genome sequencing, the lack of variant interpretation in a scalable fashion is a formidable barrier in this field.
+ NUDT15
+ polymorphism is a major genetic cause for hematopoietic toxicity during thiopurine therapy. Motivated by the need to understand
+ NUDT15
+ variant effects for clinical actions, we developed a massively parallel assay to preemptively characterize 91.8\% of all possible missense variants in
+ NUDT15
+ . Our function-based variant classification accurately predicted thiopurine toxicity risk alleles in patients. These results vastly improved the ability to implement genotype-guided thiopurine therapy and illustrated the value and potential of a high-throughput variant effect screen in general.
+
+ ,
+
+ As a prototype of genomics-guided precision medicine, individualized thiopurine dosing based on pharmacogenetics is a highly effective way to mitigate hematopoietic toxicity of this class of drugs. Recently,
+ NUDT15
+ deficiency was identified as a genetic cause of thiopurine toxicity, and
+ NUDT15
+ -informed preemptive dose reduction was quickly adopted in clinical settings. To exhaustively identify pharmacogenetic variants in this gene, we developed massively parallel NUDT15 function assays to determine the variants’ effect on protein abundance and thiopurine cytotoxicity. Of the 3,097 possible missense variants, we characterized the abundance of 2,922 variants and found 54 hotspot residues at which variants resulted in complete loss of protein stability. Analyzing 2,935 variants in the thiopurine cytotoxicity-based assay, we identified 17 additional residues where variants altered NUDT15 activity without affecting protein stability. We identified structural elements key to NUDT15 stability and/or catalytical activity with single amino acid resolution. Functional effects for
+ NUDT15
+ variants accurately predicted toxicity risk alleles in patients treated with thiopurines with far superior sensitivity and specificity compared to bioinformatic prediction algorithms. In conclusion, our massively parallel variant function assays identified 1,152 deleterious
+ NUDT15
+ variants, providing a comprehensive reference of variant function and vastly improving the ability to implement pharmacogenetics-guided thiopurine treatment individualization.},
+ language = {en},
+ number = {10},
+ urldate = {2023-06-14},
+ journal = {Proceedings of the National Academy of Sciences},
+ author = {Suiter, Chase C. and Moriyama, Takaya and Matreyek, Kenneth A. and Yang, Wentao and Scaletti, Emma Rose and Nishii, Rina and Yang, Wenjian and Hoshitsuki, Keito and Singh, Minu and Trehan, Amita and Parish, Chris and Smith, Colton and Li, Lie and Bhojwani, Deepa and Yuen, Liz Y. P. and Li, Chi-kong and Li, Chak-ho and Yang, Yung-li and Walker, Gareth J. and Goodhand, James R. and Kennedy, Nicholas A. and Klussmann, Federico Antillon and Bhatia, Smita and Relling, Mary V. and Kato, Motohiro and Hori, Hiroki and Bhatia, Prateek and Ahmad, Tariq and Yeoh, Allen E. J. and Stenmark, Pål and Fowler, Douglas M. and Yang, Jun J.},
+ month = mar,
+ year = {2020},
+ pages = {5394--5401},
+ file = {Full Text:/Users/admin/Zotero/storage/CALTVXTU/Suiter et al. - 2020 - Massively parallel variant characterization identi.pdf:application/pdf},
+}
+
+@article{hobbs_saturation_2022,
+ title = {Saturation mutagenesis of a predicted ancestral {Syk}‐family kinase},
+ volume = {31},
+ issn = {0961-8368, 1469-896X},
+ url = {https://onlinelibrary.wiley.com/doi/10.1002/pro.4411},
+ doi = {10.1002/pro.4411},
+ language = {en},
+ number = {10},
+ urldate = {2023-06-14},
+ journal = {Protein Science},
+ author = {Hobbs, Helen T. and Shah, Neel H. and Shoemaker, Sophie R. and Amacher, Jeanine F. and Marqusee, Susan and Kuriyan, John},
+ month = oct,
+ year = {2022},
+ file = {Full Text:/Users/admin/Zotero/storage/5TCADGH8/Hobbs et al. - 2022 - Saturation mutagenesis of a predicted ancestral Sy.pdf:application/pdf},
+}
+
+@techreport{gersing_comprehensive_2022,
+ type = {preprint},
+ title = {A comprehensive map of human glucokinase variant activity},
+ url = {http://biorxiv.org/lookup/doi/10.1101/2022.05.04.490571},
+ abstract = {Abstract
+
+ Glucokinase (GCK) regulates insulin secretion to maintain appropriate blood glucose levels. Sequence variants can alter GCK activity to cause hyperinsulinemic hypoglycemia (HH) or hyperglycemia associated with GCK-maturity-onset diabetes of the young (GCK-MODY), collectively affecting up to 10 million people worldwide. Patients with GCK-MODY are frequently misdiagnosed and treated unnecessarily. Genetic testing can prevent this but is hampered by the challenge of interpreting novel missense variants. Here we exploited a multiplexed yeast complementation assay to measure both hyper- and hypoactive GCK variation, capturing 97\% of all possible missense and nonsense variants. Activity scores correlated with
+ in vitro
+ catalytic efficiency, fasting glucose levels in carriers of
+ GCK
+ variants and with evolutionary conservation. Hypoactive variants were concentrated at buried positions, near the active site, and at a region of known importance for GCK conformational dynamics. Some hyperactive variants shifted the conformational equilibrium towards the active state through a relative destabilization of the inactive conformation. Our comprehensive assessment of GCK variant activity promises to facilitate variant interpretation and diagnosis, expand our mechanistic understanding of hyperactive variants, and inform development of therapeutics targeting GCK.},
+ language = {en},
+ urldate = {2023-06-14},
+ institution = {Genetics},
+ author = {Gersing, Sarah and Cagiada, Matteo and Gebbia, Marinella and Gjesing, Anette P. and Coté, Atina G. and Seesankar, Gireesh and Li, Roujia and Tabet, Daniel and Stein, Amelie and Gloyn, Anna L. and Hansen, Torben and Roth, Frederick P. and Lindorff-Larsen, Kresten and Hartmann-Petersen, Rasmus},
+ month = may,
+ year = {2022},
+ doi = {10.1101/2022.05.04.490571},
+ file = {Submitted Version:/Users/admin/Zotero/storage/U2HI9MX9/Gersing et al. - 2022 - A comprehensive map of human glucokinase variant a.pdf:application/pdf},
+}
+
+@article{staller_high-throughput_2018,
+ title = {A {High}-{Throughput} {Mutational} {Scan} of an {Intrinsically} {Disordered} {Acidic} {Transcriptional} {Activation} {Domain}},
+ volume = {6},
+ issn = {24054712},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S2405471218300528},
+ doi = {10.1016/j.cels.2018.01.015},
+ language = {en},
+ number = {4},
+ urldate = {2023-06-14},
+ journal = {Cell Systems},
+ author = {Staller, Max V. and Holehouse, Alex S. and Swain-Lenz, Devjanee and Das, Rahul K. and Pappu, Rohit V. and Cohen, Barak A.},
+ month = apr,
+ year = {2018},
+ pages = {444--455.e6},
+ file = {Full Text:/Users/admin/Zotero/storage/WTXI6TPC/Staller et al. - 2018 - A High-Throughput Mutational Scan of an Intrinsica.pdf:application/pdf},
+}
+
+@article{bandaru_deconstruction_2017,
+ title = {Deconstruction of the {Ras} switching cycle through saturation mutagenesis},
+ volume = {6},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/27810},
+ doi = {10.7554/eLife.27810},
+ abstract = {Ras proteins are highly conserved signaling molecules that exhibit regulated, nucleotide-dependent switching between active and inactive states. The high conservation of Ras requires mechanistic explanation, especially given the general mutational tolerance of proteins. Here, we use deep mutational scanning, biochemical analysis and molecular simulations to understand constraints on Ras sequence. Ras exhibits global sensitivity to mutation when regulated by a GTPase activating protein and a nucleotide exchange factor. Removing the regulators shifts the distribution of mutational effects to be largely neutral, and reveals hotspots of activating mutations in residues that restrain Ras dynamics and promote the inactive state. Evolutionary analysis, combined with structural and mutational data, argue that Ras has co-evolved with its regulators in the vertebrate lineage. Overall, our results show that sequence conservation in Ras depends strongly on the biochemical network in which it operates, providing a framework for understanding the origin of global selection pressures on proteins.},
+ language = {en},
+ urldate = {2023-06-14},
+ journal = {eLife},
+ author = {Bandaru, Pradeep and Shah, Neel H and Bhattacharyya, Moitrayee and Barton, John P and Kondo, Yasushi and Cofsky, Joshua C and Gee, Christine L and Chakraborty, Arup K and Kortemme, Tanja and Ranganathan, Rama and Kuriyan, John},
+ month = jul,
+ year = {2017},
+ pages = {e27810},
+ file = {Full Text:/Users/admin/Zotero/storage/GX5ZXSNL/Bandaru et al. - 2017 - Deconstruction of the Ras switching cycle through .pdf:application/pdf},
+}
+
+@article{bridgford_novel_2020,
+ title = {Novel drivers and modifiers of {MPL}-dependent oncogenic transformation identified by deep mutational scanning},
+ volume = {135},
+ issn = {0006-4971, 1528-0020},
+ url = {https://ashpublications.org/blood/article/135/4/287/381157/Novel-drivers-and-modifiers-of-MPLdependent},
+ doi = {10.1182/blood.2019002561},
+ abstract = {Abstract
+ The single transmembrane domain (TMD) of the human thrombopoietin receptor (TpoR/myeloproliferative leukemia [MPL] protein), encoded by exon 10 of the MPL gene, is a hotspot for somatic mutations associated with myeloproliferative neoplasms (MPNs). Approximately 6\% and 14\% of JAK2 V617F− essential thrombocythemia and primary myelofibrosis patients, respectively, have “canonical” MPL exon 10 driver mutations W515L/K/R/A or S505N, which generate constitutively active receptors and consequent loss of Tpo dependence. Other “noncanonical” MPL exon 10 mutations have also been identified in patients, both alone and in combination with canonical mutations, but, in almost all cases, their functional consequences and relevance to disease are unknown. Here, we used a deep mutational scanning approach to evaluate all possible single amino acid substitutions in the human TpoR TMD for their ability to confer cytokine-independent growth in Ba/F3 cells. We identified all currently recognized driver mutations and 7 novel mutations that cause constitutive TpoR activation, and a much larger number of second-site mutations that enhance S505N-driven activation. We found examples of both of these categories in published and previously unpublished MPL exon 10 sequencing data from MPN patients, demonstrating that some, if not all, of the new mutations reported here represent likely drivers or modifiers of myeloproliferative disease.},
+ language = {en},
+ number = {4},
+ urldate = {2023-06-14},
+ journal = {Blood},
+ author = {Bridgford, Jessica L. and Lee, Su Min and Lee, Christine M. M. and Guglielmelli, Paola and Rumi, Elisa and Pietra, Daniela and Wilcox, Stephen and Chhabra, Yash and Rubin, Alan F. and Cazzola, Mario and Vannucchi, Alessandro M. and Brooks, Andrew J. and Call, Matthew E. and Call, Melissa J.},
+ month = jan,
+ year = {2020},
+ pages = {287--292},
+ file = {Full Text:/Users/admin/Zotero/storage/EAT9FPV2/Bridgford et al. - 2020 - Novel drivers and modifiers of MPL-dependent oncog.pdf:application/pdf},
+}
+
+@article{spencer_deep_2017,
+ title = {Deep mutational scanning of {S}. pyogenes {Cas9} reveals important functional domains},
+ volume = {7},
+ issn = {2045-2322},
+ url = {https://www.nature.com/articles/s41598-017-17081-y},
+ doi = {10.1038/s41598-017-17081-y},
+ abstract = {Abstract
+
+ RNA-guided endonucleases (RGENs) have invigorated the field of site-specific nucleases. The success of
+ Streptococcus pyogenes
+ Cas9 (SpCas9) has led to the discovery of several other CRISPR-associated RGENs. As more RGENs become available, it will be necessary to refine their activity before they can be translated into the clinic. With this in mind, we sought to demonstrate how deep mutational scanning (DMS) could provide details about important functional regions in SpCas9 and speed engineering efforts. Consequently, we developed a nuclease screening platform which could distinguish active Cas9 mutants. We screened a library of 1.9 × 10
+ 7
+ with over 8500 possible non-synonymous mutations and inferred the effects of each mutation using DMS. We demonstrate that the RuvC and HNH domains are the least tolerant regions to mutation. In contrast, the Rec2 and PI domains tolerate mutation better than other regions. The mutation information defined in this work provides a foundation for further SpCas9 engineering. Together, our results demonstrate how DMS can be a powerful tool to uncover features important to RGEN function. Application of this approach to emerging RGENs should enhance their engineering and optimization for therapeutic and other applications.},
+ language = {en},
+ number = {1},
+ urldate = {2023-06-14},
+ journal = {Scientific Reports},
+ author = {Spencer, Jeffrey M. and Zhang, Xiaoliu},
+ month = dec,
+ year = {2017},
+ pages = {16836},
+ file = {Full Text:/Users/admin/Zotero/storage/BWJHTMKA/Spencer and Zhang - 2017 - Deep mutational scanning of S. pyogenes Cas9 revea.pdf:application/pdf},
+}
+
+@article{mclaughlin_jr_spatial_2012,
+ title = {The spatial architecture of protein function and adaptation},
+ volume = {491},
+ issn = {0028-0836, 1476-4687},
+ url = {https://www.nature.com/articles/nature11500},
+ doi = {10.1038/nature11500},
+ language = {en},
+ number = {7422},
+ urldate = {2023-06-14},
+ journal = {Nature},
+ author = {McLaughlin, Jr., Richard N. and Poelwijk, Frank J. and Raman, Arjun and Gosal, Walraj S. and Ranganathan, Rama},
+ month = nov,
+ year = {2012},
+ pages = {138--142},
+ file = {Accepted Version:/Users/admin/Zotero/storage/8SK84J8S/McLaughlin Jr et al. - 2012 - The spatial architecture of protein function and a.pdf:application/pdf},
+}
+
+@article{doud_site-specific_2015,
+ title = {Site-{Specific} {Amino} {Acid} {Preferences} {Are} {Mostly} {Conserved} in {Two} {Closely} {Related} {Protein} {Homologs}},
+ volume = {32},
+ issn = {0737-4038, 1537-1719},
+ url = {https://academic.oup.com/mbe/article-lookup/doi/10.1093/molbev/msv167},
+ doi = {10.1093/molbev/msv167},
+ language = {en},
+ number = {11},
+ urldate = {2023-06-14},
+ journal = {Molecular Biology and Evolution},
+ author = {Doud, Michael B. and Ashenberg, Orr and Bloom, Jesse D.},
+ month = nov,
+ year = {2015},
+ pages = {2944--2960},
+ file = {Full Text:/Users/admin/Zotero/storage/KVBHH6S5/Doud et al. - 2015 - Site-Specific Amino Acid Preferences Are Mostly Co.pdf:application/pdf},
+}
+
+@article{mattenberger_globally_2021,
+ title = {Globally defining the effects of mutations in a picornavirus capsid},
+ volume = {10},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/64256},
+ doi = {10.7554/eLife.64256},
+ abstract = {The capsids of non-enveloped viruses are highly multimeric and multifunctional protein assemblies that play key roles in viral biology and pathogenesis. Despite their importance, a comprehensive understanding of how mutations affect viral fitness across different structural and functional attributes of the capsid is lacking. To address this limitation, we globally define the effects of mutations across the capsid of a human picornavirus. Using this resource, we identify structural and sequence determinants that accurately predict mutational fitness effects, refine evolutionary analyses, and define the sequence specificity of key capsid-encoded motifs. Furthermore, capitalizing on the derived sequence requirements for capsid-encoded protease cleavage sites, we implement a bioinformatic approach for identifying novel host proteins targeted by viral proteases. Our findings represent the most comprehensive investigation of mutational fitness effects in a picornavirus capsid to date and illuminate important aspects of viral biology, evolution, and host interactions.
+ ,
+ A virus is made up of genetic material that is encased with a protective protein coat called the capsid. The capsid also helps the virus to infect host cells by binding to the host receptor proteins and releasing its genetic material. Inside the cell, the virus hitchhikes the infected cell’s machinery to grow or replicate its own genetic material.
+ Viral capsids are the main target of the host’s defence system, and therefore, continuously change in an attempt to escape the immune system by introducing alterations (known as mutations) into the genes encoding viral capsid proteins. Mutations occur randomly, and so while some changes to the viral capsid might confer an advantage, others may have no effect at all, or even weaken the virus.
+ To better understand the effect of capsid mutations on the virus’ ability to infect host cells, Mattenberger et al. studied the Coxsackievirus B3, which is linked to heart problems and acute heart failure in humans. The researchers analysed around 90\% of possible amino acid mutations (over 14,800 mutations) and correlated each mutation to how it influenced the virus’ ability to replicate in human cells grown in the laboratory.
+ Based on these results, Mattenberger et al. developed a computer model to predict how a particular mutation might affect the virus. The analysis also identified specific amino acid sequences of capsid proteins that are essential for certain tasks, such as building the capsid. It also included an analysis of sequences in the capsid that allow it to be recognized by another viral protein, which cuts the capsid proteins into the right size from a larger precursor. By looking for similar sequences in human genes, the researchers identified several ones that the virus may attack and inactivate to support its own replication.
+ These findings may help identify potential drug targets to develop new antiviral therapies. For example, proteins of the capsid that are less likely to mutate will provide a better target as they lower the possibility of the virus to become resistant to the treatment. They also highlight new proteins in human cells that could potentially block the virus in cells.},
+ language = {en},
+ urldate = {2023-06-14},
+ journal = {eLife},
+ author = {Mattenberger, Florian and Latorre, Victor and Tirosh, Omer and Stern, Adi and Geller, Ron},
+ month = jan,
+ year = {2021},
+ pages = {e64256},
+ file = {Full Text:/Users/admin/Zotero/storage/Z7NP9QNK/Mattenberger et al. - 2021 - Globally defining the effects of mutations in a pi.pdf:application/pdf},
+}
+
+@article{matreyek_multiplex_2018,
+ title = {Multiplex assessment of protein variant abundance by massively parallel sequencing},
+ volume = {50},
+ issn = {1061-4036, 1546-1718},
+ url = {https://www.nature.com/articles/s41588-018-0122-z},
+ doi = {10.1038/s41588-018-0122-z},
+ language = {en},
+ number = {6},
+ urldate = {2023-06-14},
+ journal = {Nature Genetics},
+ author = {Matreyek, Kenneth A. and Starita, Lea M. and Stephany, Jason J. and Martin, Beth and Chiasson, Melissa A. and Gray, Vanessa E. and Kircher, Martin and Khechaduri, Arineh and Dines, Jennifer N. and Hause, Ronald J. and Bhatia, Smita and Evans, William E. and Relling, Mary V. and Yang, Wenjian and Shendure, Jay and Fowler, Douglas M.},
+ month = jun,
+ year = {2018},
+ pages = {874--882},
+ file = {Accepted Version:/Users/admin/Zotero/storage/IRF7FEAI/Matreyek et al. - 2018 - Multiplex assessment of protein variant abundance .pdf:application/pdf},
+}
+
+@article{flynn_comprehensive_2020,
+ title = {Comprehensive fitness maps of {Hsp90} show widespread environmental dependence},
+ volume = {9},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/53810},
+ doi = {10.7554/eLife.53810},
+ abstract = {Gene-environment interactions have long been theorized to influence molecular evolution. However, the environmental dependence of most mutations remains unknown. Using deep mutational scanning, we engineered yeast with all 44,604 single codon changes encoding 14,160 amino acid variants in Hsp90 and quantified growth effects under standard conditions and under five stress conditions. To our knowledge, these are the largest determined comprehensive fitness maps of point mutants. The growth of many variants differed between conditions, indicating that environment can have a large impact on Hsp90 evolution. Multiple variants provided growth advantages under individual conditions; however, these variants tended to exhibit growth defects in other environments. The diversity of Hsp90 sequences observed in extant eukaryotes preferentially contains variants that supported robust growth under all tested conditions. Rather than favoring substitutions in individual conditions, the long-term selective pressure on Hsp90 may have been that of fluctuating environments, leading to robustness under a variety of conditions.},
+ language = {en},
+ urldate = {2023-06-14},
+ journal = {eLife},
+ author = {Flynn, Julia M and Rossouw, Ammeret and Cote-Hammarlof, Pamela and Fragata, Inês and Mavor, David and Hollins, Carl and Bank, Claudia and Bolon, Daniel Na},
+ month = mar,
+ year = {2020},
+ pages = {e53810},
+ file = {Full Text:/Users/admin/Zotero/storage/CQWF9FY9/Flynn et al. - 2020 - Comprehensive fitness maps of Hsp90 show widesprea.pdf:application/pdf},
+}
+
+@article{ahler_combined_2019,
+ title = {A {Combined} {Approach} {Reveals} a {Regulatory} {Mechanism} {Coupling} {Src}’s {Kinase} {Activity}, {Localization}, and {Phosphotransferase}-{Independent} {Functions}},
+ volume = {74},
+ issn = {10972765},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S1097276519300930},
+ doi = {10.1016/j.molcel.2019.02.003},
+ language = {en},
+ number = {2},
+ urldate = {2023-06-14},
+ journal = {Molecular Cell},
+ author = {Ahler, Ethan and Register, Ames C. and Chakraborty, Sujata and Fang, Linglan and Dieter, Emily M. and Sitko, Katherine A. and Vidadala, Rama Subba Rao and Trevillian, Bridget M. and Golkowski, Martin and Gelman, Hannah and Stephany, Jason J. and Rubin, Alan F. and Merritt, Ethan A. and Fowler, Douglas M. and Maly, Dustin J.},
+ month = apr,
+ year = {2019},
+ pages = {393--408.e20},
+ file = {Full Text:/Users/admin/Zotero/storage/BAUMXZ8I/Ahler et al. - 2019 - A Combined Approach Reveals a Regulatory Mechanism.pdf:application/pdf},
+}
+
+@article{romero_dissecting_2015,
+ title = {Dissecting enzyme function with microfluidic-based deep mutational scanning},
+ volume = {112},
+ issn = {0027-8424, 1091-6490},
+ url = {https://pnas.org/doi/full/10.1073/pnas.1422285112},
+ doi = {10.1073/pnas.1422285112},
+ abstract = {Significance
+ As powerful biological catalysts, enzymes can solve challenging problems that range from the industrial production of chemicals to the treatment of human disease. The ability to design new enzymes with tailor-made chemical functions would have a far-reaching impact. However, this important capability has been limited by our cursory understanding of enzyme catalysis. Here, we report a method that uses unbiased empirical analysis to dissect the molecular basis of enzyme function. By comprehensively mapping how changes in an enzyme’s amino acid sequence affect its activity, we obtain a detailed view of the interactions that shape the enzyme function landscape. Large, unbiased analyses of enzyme function allow the discovery of new biochemical mechanisms that will improve our ability to engineer custom biocatalysts.
+ ,
+ Natural enzymes are incredibly proficient catalysts, but engineering them to have new or improved functions is challenging due to the complexity of how an enzyme’s sequence relates to its biochemical properties. Here, we present an ultrahigh-throughput method for mapping enzyme sequence–function relationships that combines droplet microfluidic screening with next-generation DNA sequencing. We apply our method to map the activity of millions of glycosidase sequence variants. Microfluidic-based deep mutational scanning provides a comprehensive and unbiased view of the enzyme function landscape. The mapping displays expected patterns of mutational tolerance and a strong correspondence to sequence variation within the enzyme family, but also reveals previously unreported sites that are crucial for glycosidase function. We modified the screening protocol to include a high-temperature incubation step, and the resulting thermotolerance landscape allowed the discovery of mutations that enhance enzyme thermostability. Droplet microfluidics provides a general platform for enzyme screening that, when combined with DNA-sequencing technologies, enables high-throughput mapping of enzyme sequence space.},
+ language = {en},
+ number = {23},
+ urldate = {2023-06-14},
+ journal = {Proceedings of the National Academy of Sciences},
+ author = {Romero, Philip A. and Tran, Tuan M. and Abate, Adam R.},
+ month = jun,
+ year = {2015},
+ pages = {7159--7164},
+ file = {Full Text:/Users/admin/Zotero/storage/7D996XKU/Romero et al. - 2015 - Dissecting enzyme function with microfluidic-based.pdf:application/pdf},
+}
+
+@article{faure_mapping_2022,
+ title = {Mapping the energetic and allosteric landscapes of protein binding domains},
+ volume = {604},
+ issn = {0028-0836, 1476-4687},
+ url = {https://www.nature.com/articles/s41586-022-04586-4},
+ doi = {10.1038/s41586-022-04586-4},
+ language = {en},
+ number = {7904},
+ urldate = {2023-06-14},
+ journal = {Nature},
+ author = {Faure, Andre J. and Domingo, Júlia and Schmiedel, Jörn M. and Hidalgo-Carcedo, Cristina and Diss, Guillaume and Lehner, Ben},
+ month = apr,
+ year = {2022},
+ pages = {175--183},
+ file = {Accepted Version:/Users/admin/Zotero/storage/H46C8P4R/Faure et al. - 2022 - Mapping the energetic and allosteric landscapes of.pdf:application/pdf},
+}
+
+@article{roscoe_analyses_2013,
+ title = {Analyses of the {Effects} of {All} {Ubiquitin} {Point} {Mutants} on {Yeast} {Growth} {Rate}},
+ volume = {425},
+ issn = {00222836},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0022283613000636},
+ doi = {10.1016/j.jmb.2013.01.032},
+ language = {en},
+ number = {8},
+ urldate = {2023-06-14},
+ journal = {Journal of Molecular Biology},
+ author = {Roscoe, Benjamin P. and Thayer, Kelly M. and Zeldovich, Konstantin B. and Fushman, David and Bolon, Daniel N.A.},
+ month = apr,
+ year = {2013},
+ pages = {1363--1377},
+ file = {Accepted Version:/Users/admin/Zotero/storage/Z4CIGXLZ/Roscoe et al. - 2013 - Analyses of the Effects of All Ubiquitin Point Mut.pdf:application/pdf},
+}
+
+@article{newberry_robust_2020,
+ title = {Robust {Sequence} {Determinants} of alpha-{Synuclein} {Toxicity} in {Yeast} {Implicate} {Membrane} {Binding}},
+ volume = {15},
+ issn = {1554-8929, 1554-8937},
+ url = {https://pubs.acs.org/doi/10.1021/acschembio.0c00339},
+ doi = {10.1021/acschembio.0c00339},
+ language = {en},
+ number = {8},
+ urldate = {2023-06-14},
+ journal = {ACS Chemical Biology},
+ author = {Newberry, Robert W. and Arhar, Taylor and Costello, Jean and Hartoularos, George C. and Maxwell, Alison M. and Naing, Zun Zar Chi and Pittman, Maureen and Reddy, Nishith R. and Schwarz, Daniel M. C. and Wassarman, Douglas R. and Wu, Taia S. and Barrero, Daniel and Caggiano, Christa and Catching, Adam and Cavazos, Taylor B. and Estes, Laurel S. and Faust, Bryan and Fink, Elissa A. and Goldman, Miriam A. and Gomez, Yessica K. and Gordon, M. Grace and Gunsalus, Laura M. and Hoppe, Nick and Jaime-Garza, Maru and Johnson, Matthew C. and Jones, Matthew G. and Kung, Andrew F. and Lopez, Kyle E. and Lumpe, Jared and Martyn, Calla and McCarthy, Elizabeth E. and Miller-Vedam, Lakshmi E. and Navarro, Erik J. and Palar, Aji and Pellegrino, Jenna and Saylor, Wren and Stephens, Christina A. and Strickland, Jack and Torosyan, Hayarpi and Wankowicz, Stephanie A. and Wong, Daniel R. and Wong, Garrett and Redding, Sy and Chow, Eric D. and DeGrado, William F. and Kampmann, Martin},
+ month = aug,
+ year = {2020},
+ pages = {2137--2153},
+ file = {Accepted Version:/Users/admin/Zotero/storage/NQ4BLNN6/Newberry et al. - 2020 - Robust Sequence Determinants of alpha-Synuclein Toxici.pdf:application/pdf;Submitted Version:/Users/admin/Zotero/storage/7IFR9453/Newberry et al. - 2020 - Robust Sequence Determinants of alpha-Synuclein Toxici.pdf:application/pdf},
+}
+
+@article{wu_adaptation_2016,
+ title = {Adaptation in protein fitness landscapes is facilitated by indirect paths},
+ volume = {5},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/16965},
+ doi = {10.7554/eLife.16965},
+ abstract = {The structure of fitness landscapes is critical for understanding adaptive protein evolution. Previous empirical studies on fitness landscapes were confined to either the neighborhood around the wild type sequence, involving mostly single and double mutants, or a combinatorially complete subgraph involving only two amino acids at each site. In reality, the dimensionality of protein sequence space is higher (20L) and there may be higher-order interactions among more than two sites. Here we experimentally characterized the fitness landscape of four sites in protein GB1, containing 204 = 160,000 variants. We found that while reciprocal sign epistasis blocked many direct paths of adaptation, such evolutionary traps could be circumvented by indirect paths through genotype space involving gain and subsequent loss of mutations. These indirect paths alleviate the constraint on adaptive protein evolution, suggesting that the heretofore neglected dimensions of sequence space may change our views on how proteins evolve.
+ ,
+ Proteins can evolve over time by changing their component parts, which are called amino acids. These changes usually happen one at a time and natural selection tends to preserve those changes that make the protein more efficient at its specific tasks, while discarding those that impair the protein’s activity. However the effect of each change depends on the protein as a whole, and so two changes that separately make the protein worse can make it much better if they occur together. This phenomenon is called epistasis and in some cases it can trap proteins in a sub-optimal form and prevent them from improving further.
+ Proteins are made from twenty different kinds of amino acid, and there are millions of different combinations of amino acids that could, in theory, make a protein of a given length. Studying protein evolution involves making variants of the same protein, each with just a few changes, and comparing how efficient, or “fit”, they are. Previous studies only measured the fitness of a few variants and showed that epistasis could block protein evolution by requiring the protein to lose some fitness before it could improve further. However, new techniques have now made it easier to study protein evolution by testing many more protein variants.
+ Wu, Dai et al. focused on four amino acids in part of a protein called GB1 and tested the efficiency of every possible combination of these four amino acids, a total of 160,000 (204) variants. Contrary to expectations, the results suggested that the protein could evolve quickly to maximise fitness despite there being epistasis between the four amino acids. Overcoming epistasis typically involved making a change to one amino acid that paved the way for further changes while avoiding the need to lose fitness. The original change could then be reversed once the epistasis was overcome. The complexity of this solution means it can only be seen by studying a large number of protein variants that represent many alternative sequences of protein changes.
+ Wu, Dai et al. conclude that proteins are able to achieve a higher level of fitness through evolution by exploring a large number of changes. There are many possible changes for each protein and it is this variety that, despite epistasis, allows proteins to become naturally optimised for the tasks that they perform. While the full complexity of protein evolution cannot be explored at the moment, as technology advances it will become possible to study more protein variants. Such advances would therefore hopefully allow researchers to discover even more about the natural mechanisms of protein evolution.},
+ language = {en},
+ urldate = {2023-06-14},
+ journal = {eLife},
+ author = {Wu, Nicholas C and Dai, Lei and Olson, C Anders and Lloyd-Smith, James O and Sun, Ren},
+ month = jul,
+ year = {2016},
+ pages = {e16965},
+ file = {Full Text:/Users/admin/Zotero/storage/GYE5JDBN/Wu et al. - 2016 - Adaptation in protein fitness landscapes is facili.pdf:application/pdf},
+}
+
+@article{starr_deep_2020,
+ title = {Deep {Mutational} {Scanning} of {SARS}-{CoV}-2 {Receptor} {Binding} {Domain} {Reveals} {Constraints} on {Folding} and {ACE2} {Binding}},
+ volume = {182},
+ issn = {00928674},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0092867420310035},
+ doi = {10.1016/j.cell.2020.08.012},
+ language = {en},
+ number = {5},
+ urldate = {2023-06-14},
+ journal = {Cell},
+ author = {Starr, Tyler N. and Greaney, Allison J. and Hilton, Sarah K. and Ellis, Daniel and Crawford, Katharine H.D. and Dingens, Adam S. and Navarro, Mary Jane and Bowen, John E. and Tortorici, M. Alejandra and Walls, Alexandra C. and King, Neil P. and Veesler, David and Bloom, Jesse D.},
+ month = sep,
+ year = {2020},
+ pages = {1295--1310.e20},
+ file = {Submitted Version:/Users/admin/Zotero/storage/TCMBL3SJ/Starr et al. - 2020 - Deep Mutational Scanning of SARS-CoV-2 Receptor Bi.pdf:application/pdf},
+}
+
+@techreport{brauer_comprehensive_2021,
+ type = {preprint},
+ title = {Comprehensive {Fitness} {Landscape} of a {Multi}-{Geometry} {Protein} {Capsid} {Informs} {Machine} {Learning} {Models} of {Assembly}},
+ url = {http://biorxiv.org/lookup/doi/10.1101/2021.12.21.473721},
+ abstract = {Abstract
+
+ Virus-like particles (VLPs) are non-infections viral-derived nanomaterials poised for biotechnological applications due to their well-defined, modular self-assembling architecture. Although progress has been made in understanding the complex effects that mutations may have on VLPs, nuanced understanding of the influence particle mutability has on quaternary structure has yet to be achieved. Here, we generate and compare the apparent fitness landscapes of two capsid geometries (T=3 and T=1 icosahedral) of the bacteriophage MS2 VLP. We find significant shifts in mutability at the symmetry interfaces of the T=1 capsid when compared to the wildtype T=3 assembly. Furthermore, we use the generated landscapes to benchmark the performance of
+ in silico
+ mutational scanning tools in capturing the effect of missense mutation on complex particle assembly. Finding that predicted stability effects correlated relatively poorly with assembly phenotype, we used a combination of
+ de novo
+ features in tandem with
+ in silico
+ results to train machine learning algorithms for the classification of variant effects on assembly. Our findings not only reveal ways that assembly geometry affects the mutable landscape of a self-assembled particle, but also establish a template for the generation of predictive mutational models of self-assembled capsids using minimal empirical training data.},
+ language = {en},
+ urldate = {2023-06-14},
+ institution = {Bioengineering},
+ author = {Brauer, Daniel D. and Santiago, Celine B. and Merz, Zoe N. and McCarthy, Esther and Tullman-Ercek, Danielle and Francis, Matthew B.},
+ month = dec,
+ year = {2021},
+ doi = {10.1101/2021.12.21.473721},
+ file = {Submitted Version:/Users/admin/Zotero/storage/TRVZDGPY/Brauer et al. - 2021 - Comprehensive Fitness Landscape of a Multi-Geometr.pdf:application/pdf},
+}
+
+@article{linsky_novo_2020,
+ title = {De novo design of potent and resilient {hACE2} decoys to neutralize {SARS}-{CoV}-2},
+ volume = {370},
+ issn = {0036-8075, 1095-9203},
+ url = {https://www.science.org/doi/10.1126/science.abe0075},
+ doi = {10.1126/science.abe0075},
+ abstract = {A decoy to neutralize SARS-CoV-2
+
+ Many efforts to develop therapies against severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) are focused on the interaction between the spike protein, which decorates the surface of the virus, and its host receptor, human angiotensin-converting enzyme 2 (hACE2). Linsky
+ et al.
+ describe a de novo design strategy that allowed them to engineer decoy proteins that bind to the spike protein by replicating the hACE2 interface. The best decoy, CTC-445, bound with low nanomolar affinity, and selection of viral mutants that decrease binding is unlikely because this would also affect binding to hACE2. A bivalent version of CTC-445 bound even more tightly, neutralized SARS-CoV-2 infection of cells, and protected hamsters from a SARS-CoV-2 challenge. The stable decoy has the potential for respiratory therapeutic delivery.
+
+
+ Science
+ , this issue p.
+ 1208
+
+ ,
+ Designed de novo protein decoys neutralize SARS-CoV-2 in vitro and in vivo and are resilient to viral mutational escape.
+ ,
+ We developed a de novo protein design strategy to swiftly engineer decoys for neutralizing pathogens that exploit extracellular host proteins to infect the cell. Our pipeline allowed the design, validation, and optimization of de novo human angiotensin-converting enzyme 2 (hACE2) decoys to neutralize severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The best monovalent decoy, CTC-445.2, bound with low nanomolar affinity and high specificity to the receptor-binding domain (RBD) of the spike protein. Cryo–electron microscopy (cryo-EM) showed that the design is accurate and can simultaneously bind to all three RBDs of a single spike protein. Because the decoy replicates the spike protein target interface in hACE2, it is intrinsically resilient to viral mutational escape. A bivalent decoy, CTC-445.2d, showed {\textasciitilde}10-fold improvement in binding. CTC-445.2d potently neutralized SARS-CoV-2 infection of cells in vitro, and a single intranasal prophylactic dose of decoy protected Syrian hamsters from a subsequent lethal SARS-CoV-2 challenge.},
+ language = {en},
+ number = {6521},
+ urldate = {2023-06-14},
+ journal = {Science},
+ author = {Linsky, Thomas W. and Vergara, Renan and Codina, Nuria and Nelson, Jorgen W. and Walker, Matthew J. and Su, Wen and Barnes, Christopher O. and Hsiang, Tien-Ying and Esser-Nobis, Katharina and Yu, Kevin and Reneer, Z. Beau and Hou, Yixuan J. and Priya, Tanu and Mitsumoto, Masaya and Pong, Avery and Lau, Uland Y. and Mason, Marsha L. and Chen, Jerry and Chen, Alex and Berrocal, Tania and Peng, Hong and Clairmont, Nicole S. and Castellanos, Javier and Lin, Yu-Ru and Josephson-Day, Anna and Baric, Ralph S. and Fuller, Deborah H. and Walkey, Carl D. and Ross, Ted M. and Swanson, Ryan and Bjorkman, Pamela J. and Gale, Michael and Blancas-Mejia, Luis M. and Yen, Hui-Ling and Silva, Daniel-Adriano},
+ month = dec,
+ year = {2020},
+ pages = {1208--1214},
+ file = {Full Text:/Users/admin/Zotero/storage/RDJMYBGK/Linsky et al. - 2020 - De novo design of potent and resilient hACE2 decoy.pdf:application/pdf},
+}
+
+@techreport{stadelmann_deep_2021,
+ type = {preprint},
+ title = {A deep mutational scanning platform to characterize the fitness landscape of anti-{CRISPR} proteins},
+ url = {http://biorxiv.org/lookup/doi/10.1101/2021.08.21.457204},
+ abstract = {ABSTRACT
+
+ Deep mutational scanning is a powerful method to explore the mutational fitness landscape of proteins. Its adaptation to anti-CRISPR proteins, which are natural CRISPR-Cas inhibitors and key players in the co-evolution of microbes and phages, would facilitate their in-depth characterization and optimization. Here, we developed a robust anti-CRISPR deep mutational scanning pipeline in
+ Escherichia coli
+ combining synthetic gene circuits based on CRISPR interference with flow cytometry-coupled sequencing and mathematical modeling. Using this pipeline, we created and characterized comprehensive single point mutation libraries for AcrIIA4 and AcrIIA5, two potent inhibitors of
+ Streptococcus pyogenes
+ Cas9. The resulting mutational fitness landscapes revealed that both Acrs possess a considerable mutational tolerance as well as an intrinsic redundancy with respect to Cas9 inhibitory features, suggesting evolutionary pressure towards high plasticity and robustness. Finally, to demonstrate that our pipeline can inform the optimization and fine-tuning of Acrs for genome editing applications, we cross-validated a subset of AcrIIA4 mutants via gene editing assays in mammalian cells and
+ in vitro
+ affinity measurements. Together, our work establishes deep mutational scanning as powerful method for anti-CRISPR protein characterization and optimization.},
+ language = {en},
+ urldate = {2023-06-14},
+ institution = {Synthetic Biology},
+ author = {Stadelmann, Tobias and Heid, Daniel and Jendrusch, Michael and Mathony, Jan and Rosset, Stéphane and Correia, Bruno E. and Niopek, Dominik},
+ month = aug,
+ year = {2021},
+ doi = {10.1101/2021.08.21.457204},
+ file = {Submitted Version:/Users/admin/Zotero/storage/WPHVK96E/Stadelmann et al. - 2021 - A deep mutational scanning platform to characteriz.pdf:application/pdf},
+}
+
+@article{koch_optimization_2022,
+ title = {Optimization of the antimicrobial peptide {Bac7} by deep mutational scanning},
+ volume = {20},
+ issn = {1741-7007},
+ url = {https://bmcbiol.biomedcentral.com/articles/10.1186/s12915-022-01304-4},
+ doi = {10.1186/s12915-022-01304-4},
+ abstract = {Abstract
+
+ Background
+
+ Intracellularly active antimicrobial peptides are promising candidates for the development of antibiotics for human applications. However, drug development using peptides is challenging as, owing to their large size, an enormous sequence space is spanned. We built a high-throughput platform that incorporates rapid investigation of the sequence-activity relationship of peptides and enables rational optimization of their antimicrobial activity. The platform is based on deep mutational scanning of DNA-encoded peptides and employs highly parallelized bacterial self-screening coupled to next-generation sequencing as a readout for their antimicrobial activity. As a target, we used Bac7
+ 1-23
+ , a 23 amino acid residues long variant of bactenecin-7, a potent translational inhibitor and one of the best researched proline-rich antimicrobial peptides.
+
+
+
+ Results
+
+ Using the platform, we simultaneously determined the antimicrobial activity of {\textgreater}600,000 Bac7
+ 1-23
+ variants and explored their sequence-activity relationship. This dataset guided the design of a focused library of {\textasciitilde}160,000 variants and the identification of a lead candidate Bac7PS. Bac7PS showed high activity against multidrug-resistant clinical isolates of
+ E. coli,
+ and its activity was less dependent on SbmA, a transporter commonly used by proline-rich antimicrobial peptides to reach the cytosol and then inhibit translation. Furthermore, Bac7PS displayed strong ribosomal inhibition and low toxicity against eukaryotic cells and demonstrated good efficacy in a murine septicemia model induced by
+ E. coli
+ .
+
+
+
+ Conclusion
+ We demonstrated that the presented platform can be used to establish the sequence-activity relationship of antimicrobial peptides, and showed its usefulness for hit-to-lead identification and optimization of antimicrobial drug candidates.},
+ language = {en},
+ number = {1},
+ urldate = {2023-06-14},
+ journal = {BMC Biology},
+ author = {Koch, Philipp and Schmitt, Steven and Heynisch, Alexander and Gumpinger, Anja and Wüthrich, Irene and Gysin, Marina and Shcherbakov, Dimitri and Hobbie, Sven N. and Panke, Sven and Held, Martin},
+ month = dec,
+ year = {2022},
+ pages = {114},
+ file = {Full Text:/Users/admin/Zotero/storage/8QZH39IE/Koch et al. - 2022 - Optimization of the antimicrobial peptide Bac7 by .pdf:application/pdf},
+}
+
+@article{kotler_systematic_2018,
+ title = {A {Systematic} p53 {Mutation} {Library} {Links} {Differential} {Functional} {Impact} to {Cancer} {Mutation} {Pattern} and {Evolutionary} {Conservation}},
+ volume = {71},
+ issn = {10972765},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S1097276518304544},
+ doi = {10.1016/j.molcel.2018.06.012},
+ language = {en},
+ number = {1},
+ urldate = {2023-06-14},
+ journal = {Molecular Cell},
+ author = {Kotler, Eran and Shani, Odem and Goldfeld, Guy and Lotan-Pompan, Maya and Tarcic, Ohad and Gershoni, Anat and Hopf, Thomas A. and Marks, Debora S. and Oren, Moshe and Segal, Eran},
+ month = jul,
+ year = {2018},
+ pages = {178--190.e8},
+ file = {Full Text:/Users/admin/Zotero/storage/7M4DXV8M/Kotler et al. - 2018 - A Systematic p53 Mutation Library Links Differenti.pdf:application/pdf},
+}
+
+@article{campbell_determinants_2022,
+ title = {Determinants of {Multiheme} {Cytochrome} {Extracellular} {Electron} {Transfer} {Uncovered} by {Systematic} {Peptide} {Insertion}},
+ volume = {61},
+ issn = {0006-2960, 1520-4995},
+ url = {https://pubs.acs.org/doi/10.1021/acs.biochem.2c00148},
+ doi = {10.1021/acs.biochem.2c00148},
+ language = {en},
+ number = {13},
+ urldate = {2023-06-14},
+ journal = {Biochemistry},
+ author = {Campbell, Ian J. and Atkinson, Joshua T. and Carpenter, Matthew D. and Myerscough, Dru and Su, Lin and Ajo-Franklin, Caroline M. and Silberg, Jonathan J.},
+ month = jul,
+ year = {2022},
+ pages = {1337--1350},
+ file = {Submitted Version:/Users/admin/Zotero/storage/CQJVV8DL/Campbell et al. - 2022 - Determinants of Multiheme Cytochrome Extracellular.pdf:application/pdf},
+}
+
+@article{ding_co-evolution_2022,
+ title = {Co-evolution of interacting proteins through non-contacting and non-specific mutations},
+ volume = {6},
+ issn = {2397-334X},
+ url = {https://www.nature.com/articles/s41559-022-01688-0},
+ doi = {10.1038/s41559-022-01688-0},
+ language = {en},
+ number = {5},
+ urldate = {2023-06-14},
+ journal = {Nature Ecology \& Evolution},
+ author = {Ding, David and Green, Anna G. and Wang, Boyuan and Lite, Thuy-Lan Vo and Weinstein, Eli N. and Marks, Debora S. and Laub, Michael T.},
+ month = mar,
+ year = {2022},
+ pages = {590--603},
+ file = {Accepted Version:/Users/admin/Zotero/storage/RIQVW8ID/Ding et al. - 2022 - Co-evolution of interacting proteins through non-c.pdf:application/pdf},
+}
+
+% pgv1
+
+@article{tsuboyama_mega-scale_2023,
+ title = {Mega-scale experimental analysis of protein folding stability in biology and design},
+ volume = {620},
+ issn = {0028-0836, 1476-4687},
+ url = {https://www.nature.com/articles/s41586-023-06328-6},
+ doi = {10.1038/s41586-023-06328-6},
+ abstract = {Abstract
+
+ Advances in DNA sequencing and machine learning are providing insights into protein sequences and structures on an enormous scale
+ 1
+ . However, the energetics driving folding are invisible in these structures and remain largely unknown
+ 2
+ . The hidden thermodynamics of folding can drive disease
+ 3,4
+ , shape protein evolution
+ 5–7
+ and guide protein engineering
+ 8–10
+ , and new approaches are needed to reveal these thermodynamics for every sequence and structure. Here we present cDNA display proteolysis, a method for measuring thermodynamic folding stability for up to 900,000 protein domains in a one-week experiment. From 1.8 million measurements in total, we curated a set of around 776,000 high-quality folding stabilities covering all single amino acid variants and selected double mutants of 331 natural and 148 de novo designed protein domains 40–72 amino acids in length. Using this extensive dataset, we quantified (1) environmental factors influencing amino acid fitness, (2) thermodynamic couplings (including unexpected interactions) between protein sites, and (3) the global divergence between evolutionary amino acid usage and protein folding stability. We also examined how our approach could identify stability determinants in designed proteins and evaluate design methods. The cDNA display proteolysis method is fast, accurate and uniquely scalable, and promises to reveal the quantitative rules for how amino acid sequences encode folding stability.},
+ language = {en},
+ number = {7973},
+ urldate = {2023-08-15},
+ journal = {Nature},
+ author = {Tsuboyama, Kotaro and Dauparas, Justas and Chen, Jonathan and Laine, Elodie and Mohseni Behbahani, Yasser and Weinstein, Jonathan J. and Mangan, Niall M. and Ovchinnikov, Sergey and Rocklin, Gabriel J.},
+ month = aug,
+ year = {2023},
+ pages = {434--444},
+ file = {Full Text:/Users/admin/Zotero/storage/54IHFJGX/Tsuboyama et al. - 2023 - Mega-scale experimental analysis of protein foldin.pdf:application/pdf},
+}
+
+
+@UNPUBLISHED{suphatrakul_functional_2023,
+ title = "Functional analysis of flavivirus replicase by deep mutational
+ scanning of dengue {NS5}",
+ author = "Suphatrakul, Amporn and Posiri, Pratsaneeyaporn and Srisuk,
+ Nittaya and Nantachokchawapan, Rapirat and Onnome, Suppachoke and
+ Mongkolsapaya, Juthathip and Siridechadilok, Bunpote",
+ abstract = "Flavivirus NS5 is multi-functional viral protein that play
+ critical roles in virus replication, evolution, and immune
+ antagonism against the hosts. Its error-prone replicase activity
+ copies viral RNA for progeny virus particles and shapes virus
+ evolution. Its methyltransferase activity and STAT2-targeting
+ activity compromise type-I interferon signalling, dampening
+ protective immune response during infection. It interacts with
+ several host factors to shape the host-cell environment for virus
+ replication. Thus, NS5 represents a critical target for both
+ vaccine and antiviral drug development. Here, we performed deep
+ mutational scanning (DMS) on the NS5 of dengue virus serotype 2
+ in mammalian cells. In combination with available structural and
+ biochemical data, the comprehensive single amino-acid mutational
+ data corroborated key residues and interactions involved in
+ enzymatic functions of the replicase and suggested potential
+ plasticity in NS5 guanylyl transferase. Strikingly, we identified
+ that a set of strictly conserved residues in the motifs lining
+ the replicase active site could tolerate mutations, suggesting
+ additional roles of the priming loop in viral RNA synthesis and
+ possible strategies to modulate the error rate of viral replicase
+ activity through active-site engineering. Our DMS dataset and NS5
+ libraries could provide a framework and a resource to investigate
+ molecular, evolutionary, and immunological aspects of NS5
+ functions, with relevance to vaccine and antiviral drug
+ development. \#\#\# Competing Interest Statement B.S., A.S.,
+ P.P., N.S., R.N., and S.O. have filed a patent application
+ related to this work.",
+ journal = "bioRxiv",
+ pages = "2023.03.07.531617",
+ month = mar,
+ year = 2023,
+ language = "en"
+}
+
+@ARTICLE{weile_shifting_2021,
+ title = "Shifting landscapes of human {MTHFR} missense-variant effects",
+ author = "Weile, Jochen and Kishore, Nishka and Sun, Song and Maaieh, Ranim
+ and Verby, Marta and Li, Roujia and Fotiadou, Iosifina and
+ Kitaygorodsky, Julia and Wu, Yingzhou and Holenstein, Alexander
+ and B{\"u}rer, C{\'e}line and Blomgren, Linnea and Yang, Shan and
+ Nussbaum, Robert and Rozen, Rima and Watkins, David and Gebbia,
+ Marinella and Kozich, Viktor and Garton, Michael and Froese, D
+ Sean and Roth, Frederick P",
+ abstract = "Most rare clinical missense variants cannot currently be
+ classified as pathogenic or benign. Deficiency in human
+ 5,10-methylenetetrahydrofolate reductase (MTHFR), the most common
+ inherited disorder of folate metabolism, is caused primarily by
+ rare missense variants. Further complicating variant
+ interpretation, variant impacts often depend on environment. An
+ important example of this phenomenon is the MTHFR variant
+ p.Ala222Val (c.665C>T), which is carried by half of all humans
+ and has a phenotypic impact that depends on dietary folate. Here
+ we describe the results of 98,336 variant functional-impact
+ assays, covering nearly all possible MTHFR amino acid
+ substitutions in four folinate environments, each in the presence
+ and absence of p.Ala222Val. The resulting atlas of MTHFR variant
+ effects reveals many complex dependencies on both folinate and
+ p.Ala222Val. MTHFR atlas scores can distinguish pathogenic from
+ benign variants and, among individuals with severe MTHFR
+ deficiency, correlate with age of disease onset. Providing a
+ powerful tool for understanding structure-function relationships,
+ the atlas suggests a role for a disordered loop in retaining
+ cofactor at the active site and identifies variants that enable
+ escape of inhibition by S-adenosylmethionine. Thus, a model based
+ on eight MTHFR variant effect maps illustrates how shifting
+ landscapes of environment- and genetic-background-dependent
+ missense variation can inform our clinical, structural, and
+ functional understanding of MTHFR deficiency.",
+ journal = "American Journal of Human Genetics",
+ volume = 108,
+ number = 7,
+ pages = "1283--1300",
+ month = jul,
+ year = 2021,
+ keywords = "clinical variant interpretation; cystathionine beta synthase;
+ deep mutational scanning; folate; gene- environment interaction;
+ homocystinuria; methylenetetrahydrofolate reductase; molecular
+ dynamics; mthfr; variant effect mapping",
+ language = "en"
+}
+
+@ARTICLE{xie_predicting_2023,
+ title = "Predicting the functional effect of compound heterozygous
+ genotypes from large scale variant effect maps",
+ author = "Xie, Michael J and Cromie, Gareth A and Owens, Katherine and
+ Timour, Martin S and Tang, Michelle and Kutz, J Nathan and
+ El-Hattab, Ayman W and McLaughlin, Richard N and Dudley,
+ Aim{\'e}e M",
+ abstract = "BACKGROUND: Pathogenic variants in PHGDH, PSAT1 , and PSPH cause
+ a set of rare, autosomal recessive diseases known as serine
+ biosynthesis defects. Serine biosynthesis defects present in a
+ broad phenotypic spectrum that includes, at the severe end,
+ Neu-Laxova syndrome, a lethal multiple congenital anomaly
+ disease, intermediately in the form of infantile serine
+ biosynthesis defects with severe neurological manifestations and
+ growth deficiency, and at the mild end, as childhood disease with
+ intellectual disability. However, because L-serine
+ supplementation, especially if started early, can ameliorate and
+ in some cases even prevent symptoms, knowledge of pathogenic
+ variants is highly actionable. METHODS: Recently, our laboratory
+ established a yeast-based assay for human PSAT1 function. We have
+ now applied it at scale to assay the functional impact of 1,914
+ SNV-accessible amino acid substitutions. In addition to assaying
+ the functional impact of individual variants in yeast haploid
+ cells, we can assay pairwise combinations of PSAT1 alleles that
+ recapitulate human genotypes, including compound heterozygotes,
+ in yeast diploids. RESULTS: Results of our assays of individual
+ variants (in haploid yeast cells) agree well with clinical
+ interpretations and protein structure-function relationships,
+ supporting the use of our data as functional evidence under the
+ ACMG interpretation guidelines. Results from our diploid assay
+ successfully distinguish patient genotypes from those of healthy
+ carriers and agree well with disease severity. Finally, we
+ present a linear model that uses individual allele measurements
+ (in haploid yeast cells) to accurately predict the biallelic
+ function (in diploid yeast cells) of ~ 1.8 million allele
+ combinations corresponding to potential human genotypes.
+ CONCLUSIONS: Taken together, our work provides an example of how
+ large-scale functional assays in model systems can be powerfully
+ applied to the study of a rare disease.",
+ journal = "bioRxiv",
+ month = jan,
+ year = 2023,
+ language = "en"
+}
+
+@ARTICLE{roychowdhury_microfluidic_2022,
+ title = "Microfluidic deep mutational scanning of the human executioner
+ caspases reveals differences in structure and regulation",
+ author = "Roychowdhury, Hridindu and Romero, Philip A",
+ abstract = "The human caspase family comprises 12 cysteine proteases that are
+ centrally involved in cell death and inflammation responses. The
+ members of this family have conserved sequences and structures,
+ highly similar enzymatic activities and substrate preferences,
+ and overlapping physiological roles. In this paper, we present a
+ deep mutational scan of the executioner caspases CASP3 and CASP7
+ to dissect differences in their structure, function, and
+ regulation. Our approach leverages high-throughput microfluidic
+ screening to analyze hundreds of thousands of caspase variants in
+ tightly controlled in vitro reactions. The resulting data
+ provides a large-scale and unbiased view of the impact of amino
+ acid substitutions on the proteolytic activity of CASP3 and
+ CASP7. We use this data to pinpoint key functional differences
+ between CASP3 and CASP7, including a secondary internal cleavage
+ site, CASP7 Q196 that is not present in CASP3. Our results will
+ open avenues for inquiry in caspase function and regulation that
+ could potentially inform the development of future
+ caspase-specific therapeutics.",
+ journal = "Cell Death Discovery",
+ volume = 8,
+ number = 1,
+ pages = "7",
+ month = jan,
+ year = 2022,
+ language = "en"
+}
+
+@ARTICLE{andrews_distinct_2020,
+ title = "Distinct patterns of mutational sensitivity for $\lambda$
+ resistance and maltodextrin transport in Escherichia coli {LamB}",
+ author = "Andrews, Bryan and Fields, Stanley",
+ abstract = "Bacteria can evade cohabiting phages through mutations in phage
+ receptors, but these mutations may come at a cost if they disrupt
+ the receptor's native cellular function. To investigate the
+ relationship between these two conflicting activities, we
+ generated sequence-function maps of Escherichia coli LamB with
+ respect to sensitivity to phage $\lambda$ and transport of
+ maltodextrin. By comparing 413 missense mutations whose effect on
+ both traits could be analysed, we find that these two phenotypes
+ were correlated, implying that most mutations affect these
+ phenotypes through a common mechanism such as loss of protein
+ stability. However, individual mutations could be found that
+ specifically disrupt $\lambda$-sensitivity without affecting
+ maltodextrin transport. We identify and individually assay nine
+ such mutations, whose spatial positions implicate loop L6 of LamB
+ in $\lambda$ binding. Although missense mutations that lead to
+ $\lambda$-resistance are rare, they were approximately as likely
+ to be maltodextrin-utilizing (Mal+) as not (Mal-), implying that
+ E. coli can adapt to $\lambda$ while conserving the receptor's
+ native function. We propose that in order for E. coli and
+ $\lambda$ to stably cohabitate, selection for
+ $\lambda$-resistance and maltose transport must be spatially or
+ temporally separated.",
+ journal = "Microbial Genomics",
+ volume = 6,
+ number = 4,
+ month = apr,
+ year = 2020,
+ keywords = "Escherichia coli; LamB; evolution; nutrient transport; phage
+ $\lambda$",
+ language = "en"
+}
+
+@ARTICLE{lo_functional_2023,
+ title = "The functional impact of 1,570 individual amino acid
+ substitutions in human {OTC}",
+ author = "Lo, Russell S and Cromie, Gareth A and Tang, Michelle and Teng,
+ Kevin and Owens, Katherine and Sirr, Amy and Kutz, J Nathan and
+ Morizono, Hiroki and Caldovic, Ljubica and Ah Mew, Nicholas and
+ Gropman, Andrea and Dudley, Aim{\'e}e M",
+ abstract = "Deleterious mutations in the X-linked gene encoding ornithine
+ transcarbamylase (OTC) cause the most common urea cycle disorder,
+ OTC deficiency. This rare but highly actionable disease can
+ present with severe neonatal onset in males or with later onset
+ in either sex. Individuals with neonatal onset appear normal at
+ birth but rapidly develop hyperammonemia, which can progress to
+ cerebral edema, coma, and death, outcomes ameliorated by rapid
+ diagnosis and treatment. Here, we develop a high-throughput
+ functional assay for human OTC and individually measure the
+ impact of 1,570 variants, 84\% of all SNV-accessible missense
+ mutations. Comparison to existing clinical significance calls,
+ demonstrated that our assay distinguishes known benign from
+ pathogenic variants and variants with neonatal onset from
+ late-onset disease presentation. This functional stratification
+ allowed us to identify score ranges corresponding to clinically
+ relevant levels of impairment of OTC activity. Examining the
+ results of our assay in the context of protein structure further
+ allowed us to identify a 13 amino acid domain, the SMG loop,
+ whose function appears to be required in human cells but not in
+ yeast. Finally, inclusion of our data as PS3 evidence under the
+ current ACMG guidelines, in a pilot reclassification of 34
+ variants with complete loss of activity, would change the
+ classification of 22 from variants of unknown significance to
+ clinically actionable likely pathogenic variants. These results
+ illustrate how large-scale functional assays are especially
+ powerful when applied to rare genetic diseases.",
+ journal = "American Journal of Human Genetics",
+ volume = 110,
+ number = 5,
+ pages = "863--879",
+ month = may,
+ year = 2023,
+ keywords = "OTC deficiency; Oxford Nanopore sequencing; SNV
+ analysis/discovery; X-linked disease; metabolic disorder; model
+ organisms; multiplexed assays of variant effect; rare disease;
+ rare variants; urea cycle disorder; variant interpretation",
+ language = "en"
+}
+
+@ARTICLE{gajula_high_2014,
+ title = "High-throughput mutagenesis reveals functional determinants for
+ {DNA} targeting by activation-induced deaminase",
+ author = "Gajula, Kiran S and Huwe, Peter J and Mo, Charlie Y and Crawford,
+ Daniel J and Stivers, James T and Radhakrishnan, Ravi and Kohli,
+ Rahul M",
+ abstract = "Antibody maturation is a critical immune process governed by the
+ enzyme activation-induced deaminase (AID), a member of the
+ AID/APOBEC DNA deaminase family. AID/APOBEC deaminases
+ preferentially target cytosine within distinct preferred sequence
+ motifs in DNA, with specificity largely conferred by a small 9-11
+ residue protein loop that differs among family members. Here, we
+ aimed to determine the key functional characteristics of this
+ protein loop in AID and to thereby inform our understanding of
+ the mode of DNA engagement. To this end, we developed a
+ methodology (Sat-Sel-Seq) that couples saturation mutagenesis at
+ each position across the targeting loop, with iterative
+ functional selection and next-generation sequencing. This
+ high-throughput mutational analysis revealed dominant
+ characteristics for residues within the loop and additionally
+ yielded enzymatic variants that enhance deaminase activity. To
+ rationalize these functional requirements, we performed molecular
+ dynamics simulations that suggest that AID and its hyperactive
+ variants can engage DNA in multiple specific modes. These
+ findings align with AID's competing requirements for specificity
+ and flexibility to efficiently drive antibody maturation. Beyond
+ insights into the AID-DNA interface, our Sat-Sel-Seq approach
+ also serves to further expand the repertoire of techniques for
+ deep positional scanning and may find general utility for
+ high-throughput analysis of protein function.",
+ journal = "Nucleic Acids Research",
+ volume = 42,
+ number = 15,
+ pages = "9964--9975",
+ month = sep,
+ year = 2014,
+ language = "en"
+}
+
+% The entry below contains non-ASCII chars that could not be converted
+% to a LaTeX equivalent.
+@UNPUBLISHED{chakraborty_profiling_2021,
+ title = "Profiling of the drug resistance of thousands of Src tyrosine
+ kinase mutants uncovers a regulatory network that couples
+ autoinhibition to catalytic domain dynamics",
+ author = "Chakraborty, Sujata and Ahler, Ethan and Simon, Jessica J and
+ Fang, Linglan and Potter, Zachary E and Sitko, Katherine A and
+ Stephany, Jason J and Guttman, Miklos and Fowler, Douglas M and
+ Maly, Dustin J",
+ abstract = "SUMMARYKinase inhibitors are effective cancer therapies but
+ resistance often limits clinical efficacy. Despite the
+ cataloguing of numerous resistance mutations, our understanding
+ of kinase inhibitor resistance is still incomplete. Here, we
+ comprehensively profiled the resistance of ∼3500 Src tyrosine
+ kinase mutants to four different ATP-competitive inhibitors. We
+ found that ATP-competitive inhibitor resistance mutations are
+ distributed throughout Src's catalytic domain. In addition to
+ inhibitor contact residues, residues that participate in
+ regulating Src's phosphotransferase activity were prone to the
+ development of resistance. Unexpectedly, we found that a
+ resistance-prone cluster of residues located on the top face of
+ the N-terminal lobe of Src's catalytic domain contributes to
+ autoinhibition by reducing catalytic domain dynamics, and
+ mutations in this cluster led to resistance by lowering inhibitor
+ affinity and promoting kinase hyperactivation. Together, our
+ studies demonstrate how drug resistance profiling can be used to
+ define potential resistance pathways and uncover new mechanisms
+ of kinase regulation.",
+ journal = "bioRxiv",
+ month = dec,
+ year = 2021
+}
+
+@UNPUBLISHED{weng_energetic_2022,
+ title = "The energetic and allosteric landscape for {KRAS} inhibition",
+ author = "Weng, Chenchun and Faure, Andre J and Lehner, Ben",
+ abstract = "Thousands of proteins have now been genetically-validated as
+ therapeutic targets in hundreds of human diseases. However, very
+ few have actually been successfully targeted and many are
+ considered `undruggable'. This is particularly true for proteins
+ that function via protein-protein interactions: direct inhibition
+ of binding interfaces is difficult, requiring the identification
+ of allosteric sites. However, most proteins have no known
+ allosteric sites and a comprehensive allosteric map does not
+ exist for any protein. Here we address this shortcoming by
+ charting multiple global atlases of inhibitory allosteric
+ communication in KRAS, a protein mutated in 1 in 10 human
+ cancers. We quantified the impact of >26,000 mutations on the
+ folding of KRAS and its binding to six interaction partners.
+ Genetic interactions in double mutants allowed us to perform
+ biophysical measurements at scale, inferring >22,000 causal free
+ energy changes, a similar number of measurements as the total
+ made for proteins to date. These energy landscapes quantify how
+ mutations tune the binding specificity of a signalling protein
+ and map the inhibitory allosteric sites for an important
+ therapeutic target. Allosteric propagation is particularly
+ effective across the central beta sheet of KRAS and multiple
+ surface pockets are genetically-validated as allosterically
+ active, including a distal pocket in the C-terminal lobe of the
+ protein. Allosteric mutations typically inhibit binding to all
+ tested effectors but they can also change the binding
+ specificity, revealing the regulatory, evolutionary and
+ therapeutic potential to tune pathway activation. Using the
+ approach described here it should be possible to rapidly and
+ comprehensively identify allosteric target sites in many
+ important proteins. \#\#\# Competing Interest Statement The
+ authors have declared no competing interest.",
+ journal = "bioRxiv",
+ pages = "2022.12.06.519122",
+ month = dec,
+ year = 2022,
+ language = "en"
+}
+
+% The entry below contains non-ASCII chars that could not be converted
+% to a LaTeX equivalent.
+@UNPUBLISHED{ding_protein_2023,
+ title = "Protein design using structure-based residue preferences",
+ author = "Ding, David and Shaw, Ada and Sinai, Sam and Rollins, Nathan and
+ Prywes, Noam and Savage, David F and Laub, Michael T and Marks,
+ Debora S",
+ abstract = "Recent developments in protein design have adapted large neural
+ networks with up to 100s of millions of parameters to learn
+ complex sequence-function mappings. However, it is unclear which
+ dependencies between residues are critical for determining
+ protein function, and a better empirical understanding could
+ enable high quality models that are also more data- and
+ resource-efficient. Here, we observe that the per residue amino
+ acid preferences - without considering interactions between
+ mutations are sufficient to explain much, and sometimes virtually
+ all of the combinatorial mutation effects across 7 datasets (R2 ∼
+ 78-98\%), including one generated here. These preference
+ parameters (20*N, where N is the number of mutated residues) can
+ be learned from as few as ∼5\textbackslash*20\textbackslash*N
+ observations to predict a much larger number (potentially up to
+ 20N) of combinatorial variant effects with high accuracy (Pearson
+ r > 0.8). We hypothesized that the local structural dependencies
+ surrounding a residue could be sufficient to learn these required
+ mutation preferences, and developed an unsupervised design
+ approach, which we term CoVES for `Combinatorial Variant Effects
+ from Structure'. We show that CoVES outperforms not just model
+ free sampling approaches but also complicated, high-capacity
+ autoregressive neural networks in generating functional and
+ diverse sequence variants for two example proteins. This simple,
+ biologically-rooted model can be an effective alternative to
+ high-capacity, out of domain models for the design of functional
+ proteins. \#\#\# Competing Interest Statement SS is employed by
+ Dyno Therapeutics. DSM is an advisor for Dyno Therapeutics,
+ Octant, Jura Bio, Tectonic Therapeutics, and Genentech, and a
+ co-founder of Seismic. NR is employed by Seismic. DFS is a
+ co-founder and scientific advisory board member of Scribe
+ Therapeutics. The remaining authors declare no competing
+ interests.",
+ journal = "bioRxiv",
+ pages = "2022.10.31.514613",
+ month = jun,
+ year = 2023,
+ language = "en"
+}
+
+@ARTICLE{nguyen_molecular_2023,
+ title = "Molecular determinants of Hsp90 dependence of Src kinase revealed
+ by deep mutational scanning",
+ author = "Nguyen, Vanessa and Ahler, Ethan and Sitko, Katherine A and
+ Stephany, Jason J and Maly, Dustin J and Fowler, Douglas M",
+ abstract = "Hsp90 is a molecular chaperone involved in the refolding and
+ activation of numerous protein substrates referred to as clients.
+ While the molecular determinants of Hsp90 client specificity are
+ poorly understood and limited to a handful of client proteins,
+ strong clients are thought to be destabilized and
+ conformationally extended. Here, we measured the
+ phosphotransferase activity of 3929 variants of the tyrosine
+ kinase Src in both the presence and absence of an Hsp90
+ inhibitor. We identified 84 previously unknown functionally
+ dependent client variants. Unexpectedly, many destabilized or
+ extended variants were not functionally dependent on Hsp90.
+ Instead, functionally dependent client variants were clustered in
+ the $\alpha$F pocket and $\beta$1-$\beta$2 strand regions of Src,
+ which have yet to be described in driving Hsp90 dependence. Hsp90
+ dependence was also strongly correlated with kinase activity. We
+ found that a combination of activation, global extension, and
+ general conformational flexibility, primarily induced by variants
+ at the $\alpha$F pocket and $\beta$1-$\beta$2 strands, was
+ necessary to render Src functionally dependent on Hsp90.
+ Moreover, the degree of activation and flexibility required to
+ transform Src into a functionally dependent client varied with
+ variant location, suggesting that a combination of regulatory
+ domain disengagement and catalytic domain flexibility are
+ required for chaperone dependence. Thus, by studying the
+ chaperone dependence of a massive number of variants, we
+ highlight factors driving Hsp90 client specificity and propose a
+ model of chaperone-kinase interactions.",
+ journal = "Protein Science",
+ volume = 32,
+ number = 7,
+ pages = "e4656",
+ month = jul,
+ year = 2023,
+ keywords = "Hsp90; Hsp90 inhibitor; chaperone; client; src kinase; tyrosine
+ kinase",
+ language = "en"
+}
+
+@ARTICLE{seuma_atlas_2022,
+ title = "An atlas of amyloid aggregation: the impact of substitutions,
+ insertions, deletions and truncations on amyloid beta fibril
+ nucleation",
+ author = "Seuma, Mireia and Lehner, Ben and Bolognesi, Benedetta",
+ abstract = "Multiplexed assays of variant effects (MAVEs) guide clinical
+ variant interpretation and reveal disease mechanisms. To date,
+ MAVEs have focussed on a single mutation type-amino acid (AA)
+ substitutions-despite the diversity of coding variants that cause
+ disease. Here we use Deep Indel Mutagenesis (DIM) to generate a
+ comprehensive atlas of diverse variant effects for a disease
+ protein, the amyloid beta (A$\beta$) peptide that aggregates in
+ Alzheimer's disease (AD) and is mutated in familial AD (fAD). The
+ atlas identifies known fAD mutations and reveals that many
+ variants beyond substitutions accelerate A$\beta$ aggregation and
+ are likely to be pathogenic. Truncations, substitutions,
+ insertions, single- and internal multi-AA deletions differ in
+ their propensity to enhance or impair aggregation, but likely
+ pathogenic variants from all classes are highly enriched in the
+ polar N-terminal region of A$\beta$. This comparative atlas
+ highlights the importance of including diverse mutation types in
+ MAVEs and provides important mechanistic insights into amyloid
+ nucleation.",
+ journal = "Nature Communications",
+ volume = 13,
+ number = 1,
+ pages = "7084",
+ month = nov,
+ year = 2022,
+ language = "en"
+}
+
+@ARTICLE{ursu_massively_2022,
+ title = "Massively parallel phenotyping of coding variants in cancer with
+ Perturb-seq",
+ author = "Ursu, Oana and Neal, James T and Shea, Emily and Thakore,
+ Pratiksha I and Jerby-Arnon, Livnat and Nguyen, Lan and Dionne,
+ Danielle and Diaz, Celeste and Bauman, Julia and Mosaad, Mariam
+ Mounir and Fagre, Christian and Lo, April and McSharry, Maria and
+ Giacomelli, Andrew O and Ly, Seav Huong and Rozenblatt-Rosen,
+ Orit and Hahn, William C and Aguirre, Andrew J and Berger, Alice
+ H and Regev, Aviv and Boehm, Jesse S",
+ abstract = "Genome sequencing studies have identified millions of somatic
+ variants in cancer, but it remains challenging to predict the
+ phenotypic impact of most. Experimental approaches to distinguish
+ impactful variants often use phenotypic assays that report on
+ predefined gene-specific functional effects in bulk cell
+ populations. Here, we develop an approach to functionally assess
+ variant impact in single cells by pooled Perturb-seq. We measured
+ the impact of 200 TP53 and KRAS variants on RNA profiles in over
+ 300,000 single lung cancer cells, and used the profiles to
+ categorize variants into phenotypic subsets to distinguish
+ gain-of-function, loss-of-function and dominant negative
+ variants, which we validated by comparison with orthogonal
+ assays. We discovered that KRAS variants did not merely fit into
+ discrete functional categories, but spanned a continuum of
+ gain-of-function phenotypes, and that their functional impact
+ could not have been predicted solely by their frequency in
+ patient cohorts. Our work provides a scalable, gene-agnostic
+ method for coding variant impact phenotyping, with potential
+ applications in multiple disease settings.",
+ journal = "Nature Biotechnology",
+ volume = 40,
+ number = 6,
+ pages = "896--905",
+ month = jun,
+ year = 2022,
+ language = "en"
+}
+
+% The entry below contains non-ASCII chars that could not be converted
+% to a LaTeX equivalent.
+@ARTICLE{wrenbeck_automated_2019,
+ title = "An Automated {Data-Driven} Pipeline for Improving Heterologous
+ Enzyme Expression",
+ author = "Wrenbeck, Emily E and Bedewitz, Matthew A and Klesmith, Justin R
+ and Noshin, Syeda and Barry, Cornelius S and Whitehead, Timothy A",
+ abstract = "Enzymes are the ultimate entities responsible for chemical
+ transformations in natural and engineered biosynthetic pathways.
+ However, many natural enzymes suffer from suboptimal functional
+ expression due to poor intrinsic protein stability. Further,
+ stability enhancing mutations often come at the cost of impaired
+ function. Here we demonstrate an automated protein engineering
+ strategy for stabilizing enzymes while retaining catalytic
+ function using deep mutational scanning coupled to
+ multiple-filter based screening and combinatorial mutagenesis. We
+ validated this strategy by improving the functional expression of
+ a Type III polyketide synthase from the Atropa belladonna
+ biosynthetic pathway for tropane alkaloids. The best variant had
+ a total of 8 mutations with over 25-fold improved activity over
+ wild-type in E. coli cell lysates, an improved melting
+ temperature of 11.5 $\pm$ 0.6 °C, and only minimal reduction in
+ catalytic efficiency. We show that the multiple-filter approach
+ maintains acceptable sensitivity with homology modeling
+ structures up to 4 {\AA} RMS. Our results highlight an automated
+ protein engineering tool for improving the stability and
+ solubility of difficult to express enzymes, which has impact for
+ biotechnological applications.",
+ journal = "ACS Synthetic Biology",
+ volume = 8,
+ number = 3,
+ pages = "474--481",
+ month = mar,
+ year = 2019,
+ keywords = "deep mutational scanning; enzyme stability; heterologous pathway
+ expression; high-throughput screening; polyketide synthase;
+ tropane alkaloids",
+ language = "en"
+}
+
+@ARTICLE{hom_deep_2019,
+ title = "Deep Mutational Scan of the Highly Conserved Influenza A Virus
+ {M1} Matrix Protein Reveals Substantial Intrinsic Mutational
+ Tolerance",
+ author = "Hom, Nancy and Gentles, Lauren and Bloom, Jesse D and Lee, Kelly
+ K",
+ abstract = "Influenza A virus matrix protein M1 is involved in multiple
+ stages of the viral infectious cycle. Despite its functional
+ importance, our present understanding of this essential viral
+ protein is limited. The roles of a small subset of specific amino
+ acids have been reported, but a more comprehensive understanding
+ of the relationship between M1 sequence, structure, and virus
+ fitness remains elusive. In this study, we used deep mutational
+ scanning to measure the effect of every amino acid substitution
+ in M1 on viral replication in cell culture. The map of amino acid
+ mutational tolerance we have generated allows us to identify
+ sites that are functionally constrained in cell culture as well
+ as sites that are less constrained. Several sites that exhibit
+ low tolerance to mutation have been found to be critical for M1
+ function and production of viable virions. Surprisingly,
+ significant portions of the M1 sequence, especially in the
+ C-terminal domain, whose structure is undetermined, were found to
+ be highly tolerant of amino acid variation, despite having
+ extremely low levels of sequence diversity among natural
+ influenza virus strains. This unexpected discrepancy indicates
+ that not all sites in M1 that exhibit high sequence conservation
+ in nature are under strong constraint during selection for viral
+ replication in cell culture.IMPORTANCE The M1 matrix protein is
+ critical for many stages of the influenza virus infection cycle.
+ Currently, we have an incomplete understanding of this highly
+ conserved protein's function and structure. Key regions of M1,
+ particularly in the C terminus of the protein, remain poorly
+ characterized. In this study, we used deep mutational scanning to
+ determine the extent of M1's tolerance to mutation. Surprisingly,
+ nearly two-thirds of the M1 sequence exhibits a high tolerance
+ for substitutions, contrary to the extremely low sequence
+ diversity observed across naturally occurring M1 isolates. Sites
+ with low mutational tolerance were also identified, suggesting
+ that they likely play critical functional roles and are under
+ selective pressure. These results reveal the intrinsic mutational
+ tolerance throughout M1 and shape future inquiries probing the
+ functions of this essential influenza A virus protein.",
+ journal = "Journal of Virology",
+ volume = 93,
+ number = 13,
+ month = jul,
+ year = 2019,
+ keywords = "M1 matrix; codon library; deep mutational scan; influenza A
+ virus; mutational tolerance; selection; viral evolution",
+ language = "en"
+}
+
+% The entry below contains non-ASCII chars that could not be converted
+% to a LaTeX equivalent.
+@ARTICLE{gray_elucidating_2019,
+ title = "Elucidating the Molecular Determinants of {A$\beta$} Aggregation
+ with Deep Mutational Scanning",
+ author = "Gray, Vanessa E and Sitko, Katherine and Kameni, Floriane Z Ngako
+ and Williamson, Miriam and Stephany, Jason J and Hasle, Nicholas
+ and Fowler, Douglas M",
+ abstract = "Despite the importance of A$\beta$ aggregation in Alzheimer's
+ disease etiology, our understanding of the sequence determinants
+ of aggregation is sparse and largely derived from in vitro
+ studies. For example, in vitro proline and alanine scanning
+ mutagenesis of A$\beta$40 proposed core regions important for
+ aggregation. However, we lack even this limited mutagenesis data
+ for the more disease-relevant A$\beta$42 Thus, to better
+ understand the molecular determinants of A$\beta$42 aggregation
+ in a cell-based system, we combined a yeast DHFR aggregation
+ assay with deep mutational scanning. We measured the effect of
+ 791 of the 798 possible single amino acid substitutions on the
+ aggregation propensity of A$\beta$42 We found that ∼75\% of
+ substitutions, largely to hydrophobic residues, maintained or
+ increased aggregation. We identified 11 positions at which
+ substitutions, particularly to hydrophilic and charged amino
+ acids, disrupted A$\beta$ aggregation. These critical positions
+ were similar but not identical to critical positions identified
+ in previous A$\beta$ mutagenesis studies. Finally, we analyzed
+ our large-scale mutagenesis data in the context of different
+ A$\beta$ aggregate structural models, finding that the
+ mutagenesis data agreed best with models derived from fibrils
+ seeded using brain-derived A$\beta$ aggregates.",
+ journal = "G3",
+ volume = 9,
+ number = 11,
+ pages = "3683--3689",
+ month = nov,
+ year = 2019,
+ keywords = "Amyloid; Amyloid beta; Deep mutational scanning; Protein
+ aggregation",
+ language = "en"
+}
+
+@ARTICLE{veerapandian_directed_2018,
+ title = "Directed Evolution of Reprogramming Factors by Cell Selection and
+ Sequencing",
+ author = "Veerapandian, Veeramohan and Ackermann, Jan Ole and Srivastava,
+ Yogesh and Malik, Vikas and Weng, Mingxi and Yang, Xiaoxiao and
+ Jauch, Ralf",
+ abstract = "Directed biomolecular evolution is widely used to tailor and
+ enhance enzymes, fluorescent proteins, and antibodies but has
+ hitherto not been applied in the reprogramming of mammalian
+ cells. Here, we describe a method termed directed evolution of
+ reprogramming factors by cell selection and sequencing
+ (DERBY-seq) to identify artificially enhanced and evolved
+ reprogramming transcription factors. DERBY-seq entails pooled
+ screens with libraries of positionally randomised genes, cell
+ selection based on phenotypic readouts, and genotyping by
+ amplicon sequencing for candidate identification. We benchmark
+ this approach using pluripotency reprogramming with libraries
+ based on the reprogramming factor SOX2 and the reprogramming
+ incompetent endodermal factor SOX17. We identified several SOX2
+ variants outperforming the wild-type protein in three- and
+ four-factor cocktails. The most effective variants were
+ discovered from the SOX17 library, demonstrating that this factor
+ can be converted into a highly potent inducer of pluripotency
+ with a range of diverse modifications. We propose DERBY-seq as a
+ broad-based approach to discover reprogramming factors for any
+ donor/target cell combination applicable to direct lineage
+ reprogramming in vitro and in vivo.",
+ journal = "Stem Cell Reports",
+ volume = 11,
+ number = 2,
+ pages = "593--606",
+ month = aug,
+ year = 2018,
+ keywords = "OCT4; SOX17; SOX2; cellular reprogramming; deep mutational
+ scanning; directed evolution; protein engineering; synthetic
+ biology; synthetic transcription factors",
+ language = "en"
+}
+
+@ARTICLE{thyagarajan_inherent_2014,
+ title = "The inherent mutational tolerance and antigenic evolvability of
+ influenza hemagglutinin",
+ author = "Thyagarajan, Bargavi and Bloom, Jesse D",
+ abstract = "Influenza is notable for its evolutionary capacity to escape
+ immunity targeting the viral hemagglutinin. We used deep
+ mutational scanning to examine the extent to which a high
+ inherent mutational tolerance contributes to this antigenic
+ evolvability. We created mutant viruses that incorporate most of
+ the $\approx$10(4) amino-acid mutations to hemagglutinin from
+ A/WSN/1933 (H1N1) influenza. After passaging these viruses in
+ tissue culture to select for functional variants, we used deep
+ sequencing to quantify mutation frequencies before and after
+ selection. These data enable us to infer the preference for each
+ amino acid at each site in hemagglutinin. These inferences are
+ consistent with existing knowledge about the protein's structure
+ and function, and can be used to create a model that describes
+ hemagglutinin's evolution far better than existing phylogenetic
+ models. We show that hemagglutinin has a high inherent tolerance
+ for mutations at antigenic sites, suggesting that this is one
+ factor contributing to influenza's antigenic evolution.",
+ journal = "Elife",
+ volume = 3,
+ month = jul,
+ year = 2014,
+ keywords = "antigenic evolution; deep mutational scanning; evolutionary
+ biology; evolvability; genomics; hemagglutinin; infectious
+ disease; influenza; microbiology; phylogenetics; viruses",
+ language = "en"
+}
+
+@ARTICLE{ostermaier_functional_2014,
+ title = "Functional map of arrestin-1 at single amino acid resolution",
+ author = "Ostermaier, Martin K and Peterhans, Christian and Jaussi, Rolf
+ and Deupi, Xavier and Standfuss, J{\"o}rg",
+ abstract = "Arrestins function as adapter proteins that mediate G
+ protein-coupled receptor (GPCR) desensitization, internalization,
+ and additional rounds of signaling. Here we have compared binding
+ of the GPCR rhodopsin to 403 mutants of arrestin-1 covering its
+ complete sequence. This comprehensive and unbiased mutagenesis
+ approach provides a functional dimension to the crystal
+ structures of inactive, preactivated p44 and phosphopeptide-bound
+ arrestins and will guide our understanding of arrestin-GPCR
+ complexes. The presented functional map quantitatively connects
+ critical interactions in the polar core and along the C tail of
+ arrestin. A series of amino acids (Phe375, Phe377, Phe380, and
+ Arg382) anchor the C tail in a position that blocks binding of
+ the receptor. Interaction of phosphates in the rhodopsin C
+ terminus with Arg29 controls a C-tail exchange mechanism in which
+ the C tail of arrestin is released and exposes several charged
+ amino acids (Lys14, Lys15, Arg18, Lys20, Lys110, and Lys300) for
+ binding of the phosphorylated receptor C terminus. In addition to
+ this arrestin phosphosensor, our data reveal several patches of
+ amino acids in the finger (Gln69 and Asp73-Met75) and the lariat
+ loops (L249-S252 and Y254) that can act as direct binding
+ interfaces. A stretch of amino acids at the edge of the C domain
+ (Trp194-Ser199, Gly337-Gly340, Thr343, and Thr345) could act as
+ membrane anchor, binding interface for a second rhodopsin, or
+ rearrange closer to the central loops upon complex formation. We
+ discuss these interfaces in the context of experimentally guided
+ docking between the crystal structures of arrestin and
+ light-activated rhodopsin.",
+ journal = "Proceedings of the National Academy of Sciences",
+ volume = 111,
+ number = 5,
+ pages = "1825--1830",
+ month = feb,
+ year = 2014,
+ keywords = "cell signaling; membrane receptor; protein engineering; scanning
+ mutagenesis; visual system",
+ language = "en"
+}
+
+@ARTICLE{bloom_experimentally_2014,
+ title = "An experimentally determined evolutionary model dramatically
+ improves phylogenetic fit",
+ author = "Bloom, Jesse D",
+ abstract = "All modern approaches to molecular phylogenetics require a
+ quantitative model for how genes evolve. Unfortunately, existing
+ evolutionary models do not realistically represent the
+ site-heterogeneous selection that governs actual sequence change.
+ Attempts to remedy this problem have involved augmenting these
+ models with a burgeoning number of free parameters. Here, I
+ demonstrate an alternative: Experimental determination of a
+ parameter-free evolutionary model via mutagenesis, functional
+ selection, and deep sequencing. Using this strategy, I create an
+ evolutionary model for influenza nucleoprotein that describes the
+ gene phylogeny far better than existing models with dozens or
+ even hundreds of free parameters. Emerging high-throughput
+ experimental strategies such as the one employed here provide
+ fundamentally new information that has the potential to transform
+ the sensitivity of phylogenetic and genetic analyses.",
+ journal = "Molecular Biology and Evolution",
+ volume = 31,
+ number = 8,
+ pages = "1956--1978",
+ month = aug,
+ year = 2014,
+ keywords = "codon model; deep mutational scanning; influenza; nucleoprotein;
+ phylogenetics; substitution model",
+ language = "en"
+}
+
+@ARTICLE{hietpas_experimental_2011,
+ title = "Experimental illumination of a fitness landscape",
+ author = "Hietpas, Ryan T and Jensen, Jeffrey D and Bolon, Daniel N A",
+ abstract = "The genes of all organisms have been shaped by selective
+ pressures. The relationship between gene sequence and fitness has
+ tremendous implications for understanding both evolutionary
+ processes and functional constraints on the encoded proteins.
+ Here, we have exploited deep sequencing technology to
+ experimentally determine the fitness of all possible individual
+ point mutants under controlled conditions for a nine-amino acid
+ region of Hsp90. Over the past five decades, limited glimpses
+ into the relationship between gene sequence and function have
+ sparked a long debate regarding the distribution, relative
+ proportion, and evolutionary significance of deleterious,
+ neutral, and advantageous mutations. Our systematic experimental
+ measurement of fitness effects of Hsp90 mutants in yeast,
+ evaluated in the light of existing population genetic theory, are
+ remarkably consistent with a nearly neutral model of molecular
+ evolution.",
+ journal = "Proceedings of the National Academy of Sciences of the United States of America",
+ volume = 108,
+ number = 19,
+ pages = "7896--7901",
+ month = may,
+ year = 2011,
+ language = "en"
+}
+
+@ARTICLE{gill_multiple_2023,
+ title = "Multiple mechanisms of self-association of chemokine receptors
+ {CXCR4} and {CCR5} demonstrated by deep mutagenesis",
+ author = "Gill, Kevin S and Mehta, Kritika and Heredia, Jeremiah D and
+ Krishnamurthy, Vishnu V and Zhang, Kai and Procko, Erik",
+ abstract = "Chemokine receptors are members of the rhodopsin-like class A
+ GPCRs whose signaling through G proteins drives the directional
+ movement of cells in response to a chemokine gradient. Chemokine
+ receptors CXCR4 and CCR5 have been extensively studied due to
+ their roles in white blood cell development and inflammation and
+ their status as coreceptors for HIV-1 infection, among other
+ functions. Both receptors form dimers or oligomers but the
+ function/s of self-associations are unclear. While CXCR4 has been
+ crystallized in a dimeric arrangement, available atomic
+ resolution structures of CCR5 are monomeric. To investigate the
+ dimerization interfaces of these chemokine receptors, we used a
+ bimolecular fluorescence complementation (BiFC)-based screen and
+ deep mutational scanning to find mutations that modify receptor
+ self-association. Many disruptive mutations promoted
+ self-associations nonspecifically, suggesting they aggregated in
+ the membrane. A mutationally intolerant region was found on CXCR4
+ that matched the crystallographic dimer interface, supporting
+ this dimeric arrangement in living cells. A mutationally
+ intolerant region was also observed on the surface of CCR5 by
+ transmembrane helices 3 and 4. Mutations from the deep mutational
+ scan that reduce BiFC were validated and were localized in the
+ transmembrane domains as well as the C-terminal cytoplasmic tails
+ where they reduced lipid microdomain localization. The reduced
+ self-association mutants of CXCR4 had increased binding to the
+ ligand CXCL12 but diminished calcium signaling. There was no
+ change in syncytia formation with cells expressing HIV-1 Env. The
+ data highlight that multiple mechanisms are involved in
+ self-association of chemokine receptor chains.",
+ journal = "bioRxiv",
+ month = mar,
+ year = 2023,
+ language = "en"
+}
+
+@ARTICLE{meier_deep_2023,
+ title = "Deep mutational scan of a drug efflux pump reveals its
+ structure-function landscape",
+ author = "Meier, Gianmarco and Thavarasah, Sujani and Ehrenbolger, Kai and
+ Hutter, Cedric A J and H{\"u}rlimann, Lea M and Barandun, Jonas
+ and Seeger, Markus A",
+ abstract = "Drug efflux is a common resistance mechanism found in bacteria
+ and cancer cells, but studies providing comprehensive functional
+ insights are scarce. In this study, we performed deep mutational
+ scanning (DMS) on the bacterial ABC transporter EfrCD to
+ determine the drug efflux activity profile of more than 1,430
+ single variants. These systematic measurements revealed that the
+ introduction of negative charges at different locations within
+ the large substrate binding pocket results in strongly increased
+ efflux activity toward positively charged ethidium, whereas
+ additional aromatic residues did not display the same effect.
+ Data analysis in the context of an inward-facing cryogenic
+ electron microscopy structure of EfrCD uncovered a high-affinity
+ binding site, which releases bound drugs through a peristaltic
+ transport mechanism as the transporter transits to its
+ outward-facing conformation. Finally, we identified substitutions
+ resulting in rapid Hoechst influx without affecting the efflux
+ activity for ethidium and daunorubicin. Hence, single mutations
+ can convert EfrCD into a drug-specific ABC importer.",
+ journal = "Nature Chemical Biology",
+ volume = 19,
+ number = 4,
+ pages = "440--450",
+ month = apr,
+ year = 2023,
+ language = "en"
+}
+
+@ARTICLE{tan_high_2023,
+ title = "High-throughput identification of prefusion-stabilizing mutations
+ in {SARS-CoV-2} spike",
+ author = "Tan, Timothy J C and Mou, Zongjun and Lei, Ruipeng and Ouyang,
+ Wenhao O and Yuan, Meng and Song, Ge and Andrabi, Raiees and
+ Wilson, Ian A and Kieffer, Collin and Dai, Xinghong and Matreyek,
+ Kenneth A and Wu, Nicholas C",
+ abstract = "Designing prefusion-stabilized SARS-CoV-2 spike is critical for
+ the effectiveness of COVID-19 vaccines. All COVID-19 vaccines in
+ the US encode spike with K986P/V987P mutations to stabilize its
+ prefusion conformation. However, contemporary methods on
+ engineering prefusion-stabilized spike immunogens involve tedious
+ experimental work and heavily rely on structural information.
+ Here, we establish a systematic and unbiased method of
+ identifying mutations that concomitantly improve expression and
+ stabilize the prefusion conformation of the SARS-CoV-2 spike. Our
+ method integrates a fluorescence-based fusion assay, mammalian
+ cell display technology, and deep mutational scanning. As a
+ proof-of-concept, we apply this method to a region in the S2
+ domain that includes the first heptad repeat and central helix.
+ Our results reveal that besides K986P and V987P, several
+ mutations simultaneously improve expression and significantly
+ lower the fusogenicity of the spike. As prefusion stabilization
+ is a common challenge for viral immunogen design, this work will
+ help accelerate vaccine development against different viruses.",
+ journal = "Nature Communications",
+ volume = 14,
+ number = 1,
+ pages = "2003",
+ month = apr,
+ year = 2023,
+ language = "en"
+}
+
+@ARTICLE{macrae_protein_2023,
+ title = "Protein-protein interactions in the Mla lipid transport system
+ probed by computational structure prediction and deep mutational
+ scanning",
+ author = "MacRae, Mark R and Puvanendran, Dhenesh and Haase, Max A B and
+ Coudray, Nicolas and Kolich, Ljuvica and Lam, Cherry and Baek,
+ Minkyung and Bhabha, Gira and Ekiert, Damian C",
+ abstract = "The outer membrane (OM) of Gram-negative bacteria is an
+ asymmetric bilayer that protects the cell from external
+ stressors, such as antibiotics. The Mla transport system is
+ implicated in the Maintenance of OM Lipid Asymmetry by mediating
+ retrograde phospholipid transport across the cell envelope. Mla
+ uses a shuttle-like mechanism to move lipids between the MlaFEDB
+ inner membrane complex and the MlaA-OmpF/C OM complex, via a
+ periplasmic lipid-binding protein, MlaC. MlaC binds to MlaD and
+ MlaA, but the underlying protein-protein interactions that
+ facilitate lipid transfer are not well understood. Here, we take
+ an unbiased deep mutational scanning approach to map the fitness
+ landscape of MlaC from Escherichia coli, which provides insights
+ into important functional sites. Combining this analysis with
+ AlphaFold2 structure predictions and binding experiments, we map
+ the MlaC-MlaA and MlaC-MlaD protein-protein interfaces. Our
+ results suggest that the MlaD and MlaA binding surfaces on MlaC
+ overlap to a large extent, leading to a model in which MlaC can
+ only bind one of these proteins at a time. Low-resolution
+ cryo-electron microscopy (cryo-EM) maps of MlaC bound to MlaFEDB
+ suggest that at least two MlaC molecules can bind to MlaD at
+ once, in a conformation consistent with AlphaFold2 predictions.
+ These data lead us to a model for MlaC interaction with its
+ binding partners and insights into lipid transfer steps that
+ underlie phospholipid transport between the bacterial inner and
+ OMs.",
+ journal = "Journal of Biological Chemistry",
+ volume = 299,
+ number = 6,
+ pages = "104744",
+ month = jun,
+ year = 2023,
+ keywords = "AlphaFold2; E. coli; Mla; bacterial cell envelope; deep
+ mutational scanning; lipid transfer; protein interaction",
+ language = "en"
+}
+
+@ARTICLE{ghose_marginal_2023,
+ title = "Marginal specificity in protein interactions constrains evolution
+ of a paralogous family",
+ author = "Ghose, Dia A and Przydzial, Kaitlyn E and Mahoney, Emily M and
+ Keating, Amy E and Laub, Michael T",
+ abstract = "The evolution of novel functions in biology relies heavily on
+ gene duplication and divergence, creating large paralogous
+ protein families. Selective pressure to avoid detrimental
+ cross-talk often results in paralogs that exhibit exquisite
+ specificity for their interaction partners. But how robust or
+ sensitive is this specificity to mutation? Here, using deep
+ mutational scanning, we demonstrate that a paralogous family of
+ bacterial signaling proteins exhibits marginal specificity, such
+ that many individual substitutions give rise to substantial
+ cross-talk between normally insulated pathways. Our results
+ indicate that sequence space is locally crowded despite overall
+ sparseness, and we provide evidence that this crowding has
+ constrained the evolution of bacterial signaling proteins. These
+ findings underscore how evolution selects for ``good enough''
+ rather than optimized phenotypes, leading to restrictions on the
+ subsequent evolution of paralogs.",
+ journal = "Proceedings of the National Academy of Sciences of the United States of America",
+ volume = 120,
+ number = 18,
+ pages = "e2221163120",
+ month = may,
+ year = 2023,
+ keywords = "gene duplication; paralogous proteins; protein evolution;
+ protein-protein interactions; signal transduction",
+ language = "en"
+}
+
+@UNPUBLISHED{nguyen_genetic_2023,
+ title = "The genetic landscape of a metabolic interaction",
+ author = "Nguyen, Thuy N and Ingle, Christine and Thompson, Samuel and
+ Reynolds, Kimberly A",
+ abstract = "AbstractEnzyme abundance, catalytic activity, and ultimately
+ sequence are all shaped by the need of growing cells to maintain
+ metabolic flux while minimizing accumulation of deleterious
+ intermediates. To quantify how variation in the activity of one
+ enzyme constrains the biochemical parameters and sequence of
+ another, we focused on dihydrofolate reductase (DHFR) and
+ thymidylate synthase (TYMS), a pair of enzymes catalyzing
+ consecutive reactions in folate metabolism. We used deep
+ mutational scanning to quantify the growth rate effect of 2,696
+ DHFR single mutations in 3 TYMS backgrounds and show that our
+ data are well-described by a relatively simple enzyme velocity to
+ growth rate model. From the data and model we estimate the
+ approximate effects of all single mutations on DHFR catalytic
+ power. Together our data provide a comprehensive view of
+ epistasis between mutations in a biochemically linked enzyme
+ pair, reveal the structural distribution of positions tuning DHFR
+ catalysis, and establish a foundation for the design of
+ multi-enzyme systems.",
+ journal = "bioRxiv",
+ month = may,
+ year = 2023
+}
+
+@UNPUBLISHED{clausen_mutational_2023,
+ title = "A mutational atlas for Parkin proteostasis",
+ author = "Clausen, Lene and Voutsinos, Vasileios and Cagiada, Matteo and
+ Johansson, Kristoffer E and Gr{\o}nb{\ae}k-Thygesen, Martin and
+ Nariya, Snehal and Powell, Rachel L and Have, Magnus K N and
+ Oestergaard, Vibe H and Stein, Amelie and Fowler, Douglas M and
+ Lindorff-Larsen, Kresten and Hartmann-Petersen, Rasmus",
+ abstract = "The delicate balance of protein homeostasis can be disturbed by
+ mutations that affect folding and stability of the encoded
+ protein. More than half of disease-causing missense variants are
+ thought to lead to protein degradation, but determining which and
+ the molecular mechanisms involved remain enigmatic. To examine
+ these issues, we selected the ubiquitin-protein ligase Parkin,
+ where known missense variants result in an autosomal recessive,
+ early onset Parkinsonism. We used the variant abundance by
+ massively parallel sequencing (VAMP-seq) approach to quantify the
+ abundance of Parkin missense variants in cultured human cells.
+ The resulting mutational map, covering 9219 out of the 9300
+ possible single-site amino acid substitutions and nonsense Parkin
+ variants, show that most low abundance variants are located
+ within the structured domains of the protein, while the flexible
+ linker regions are more tolerant. The vast majority of low
+ abundance Parkin variants are degraded through the
+ ubiquitin-proteasome system and are stabilized at a lowered
+ temperature. The cellular abundance data correlate with
+ thermodynamic stability, evolutionary conservation, and show that
+ half of the known disease-linked variants are found at low
+ abundance. Systematic mapping of degradation signals (degrons)
+ shows that inherent primary degrons in Parkin largely overlap
+ with regions that are buried, and highly sensitive to mutations.
+ An exposed degron region proximal to the so-called ``activation
+ element'' is enhanced by substitutions to hydrophobic residues
+ and destroyed by introduction of hydrophilic residues. The data
+ provide examples of how missense variants may cause degradation
+ either via destabilization of the native protein, or by
+ introducing local signals for degradation. Combined with the
+ computational methods based on Parkin structure and conservation,
+ our abundance map sheds light on the mechanisms that cause loss
+ of function, and points to variants where function potentially
+ can be restored. \#\#\# Competing Interest Statement The authors
+ have declared no competing interest.",
+ journal = "bioRxiv",
+ pages = "2023.06.08.544160",
+ month = jun,
+ year = 2023,
+ language = "en"
+}
+
+@UNPUBLISHED{vanella_understanding_2023,
+ title = "Understanding {Activity-Stability} Tradeoffs in Biocatalysts by
+ Enzyme Proximity Sequencing",
+ author = "Vanella, Rosario and K{\"u}ng, Christoph and Schoepfer, Alexandre
+ A and Doffini, Vanni and Ren, Jin and Nash, Michael A",
+ abstract = "Understanding the complex relationships between enzyme sequence,
+ folding stability and catalytic activity is crucial for
+ applications in industry and biomedicine. However, current enzyme
+ assay technologies are limited by an inability to simultaneously
+ resolve both stability and activity phenotypes and to couple
+ these to gene sequences at large scale. Here we developed Enzyme
+ Proximity Sequencing (EP-Seq), a deep mutational scanning method
+ that leverages peroxidase-mediated radical labeling with single
+ cell fidelity to dissect the effects of thousands of mutations on
+ stability and catalytic activity of oxidoreductase enzymes in a
+ single experiment. We used EP-Seq to analyze how 6,387 missense
+ mutations influence folding stability and catalytic activity in a
+ D-amino acid oxidase (DAOx) from R. gracilis . The resulting
+ datasets demonstrate activity-based constraints that limit
+ folding stability during natural evolution, and identify hotspots
+ distant from the active site as candidates for mutations that
+ improve catalytic activity without sacrificing stability. EP-Seq
+ can be extended to other enzyme classes and provides valuable
+ insights into biophysical principles governing enzyme structure
+ and function. \#\#\# Competing Interest Statement The authors
+ have declared no competing interest.",
+ journal = "bioRxiv",
+ pages = "2023.02.24.529916",
+ month = mar,
+ year = 2023,
+ language = "en"
+}
+
+@ARTICLE{lei_mutational_2023,
+ title = "Mutational fitness landscape of human influenza {H3N2}
+ neuraminidase",
+ author = "Lei, Ruipeng and Hernandez Garcia, Andrea and Tan, Timothy J C
+ and Teo, Qi Wen and Wang, Yiquan and Zhang, Xiwen and Luo,
+ Shitong and Nair, Satish K and Peng, Jian and Wu, Nicholas C",
+ abstract = "Influenza neuraminidase (NA) has received increasing attention as
+ an effective vaccine target. However, its mutational tolerance is
+ not well characterized. Here, the fitness effects of >6,000
+ mutations in human H3N2 NA are probed using deep mutational
+ scanning. Our result shows that while its antigenic regions have
+ high mutational tolerance, there are solvent-exposed regions with
+ low mutational tolerance. We also find that protein stability is
+ a major determinant of NA mutational fitness. The deep mutational
+ scanning result correlates well with mutational fitness inferred
+ from natural sequences using a protein language model,
+ substantiating the relevance of our findings to the natural
+ evolution of circulating strains. Additional analysis further
+ suggests that human H3N2 NA is far from running out of mutations
+ despite already evolving for >50 years. Overall, this study
+ advances our understanding of the evolutionary potential of NA
+ and the underlying biophysical constraints, which in turn provide
+ insights into NA-based vaccine design.",
+ journal = "Cell Reports",
+ volume = 42,
+ number = 1,
+ pages = "111951",
+ month = jan,
+ year = 2023,
+ keywords = "CP: Molecular biology; deep mutational scanning; evolution;
+ influenza; neuraminidase; protein language model; protein
+ stability; protein structure",
+ language = "en"
+}
+
+% The entry below contains non-ASCII chars that could not be converted
+% to a LaTeX equivalent.
+@ARTICLE{loggerenberg_systematically_2023,
+ title = "Systematically testing human {HMBS} missense variants to reveal
+ mechanism and pathogenic variation",
+ author = "van Loggerenberg, Warren and Sowlati-Hashjin, Shahin and Weile,
+ Jochen and Hamilton, Rayna and Chawla, Aditya and Gebbia,
+ Marinella and Kishore, Nishka and Fr{\'e}sard, Laure and
+ Mustajoki, Sami and Pischik, Elena and Pierro, Elena Di and
+ Barbaro, Michela and Floderus, Ylva and Schmitt, Caroline and
+ Gouya, Laurent and Colavin, Alexandre and Nussbaum, Robert and
+ Friesema, Edith C H and Kauppinen, Raili and To-Figueras, Jordi
+ and Aarsand, Aasne K and Desnick, Robert J and Garton, Michael
+ and Roth, Frederick P",
+ abstract = "Defects in hydroxymethylbilane synthase (HMBS) can cause Acute
+ Intermittent Porphyria (AIP), an acute neurological disease.
+ Although sequencing-based diagnosis can be definitive, ~⅓ of
+ clinical HMBS variants are missense variants, and most
+ clinically-reported HMBS missense variants are designated as
+ ``variants of uncertain significance'' (VUS). Using saturation
+ mutagenesis, en masse selection, and sequencing, we applied a
+ multiplexed validated assay to both the erythroid-specific and
+ ubiquitous isoforms of HMBS, obtaining confident functional
+ impact scores for >84\% of all possible amino-acid substitutions.
+ The resulting variant effect maps generally agreed with
+ biochemical expectation. However, the maps showed variants at the
+ dimerization interface to be unexpectedly well tolerated, and
+ suggested residue roles in active site dynamics that were
+ supported by molecular dynamics simulations. Most importantly,
+ these HMBS variant effect maps can help discriminate pathogenic
+ from benign variants, proactively providing evidence even for
+ yet-to-be-observed clinical missense variants.",
+ journal = "bioRxiv",
+ month = feb,
+ year = 2023,
+ language = "en"
+}
+
+% The entry below contains non-ASCII chars that could not be converted
+% to a LaTeX equivalent.
+@ARTICLE{weeks_fitness_2023,
+ title = "Fitness and Functional Landscapes of the E. coli {RNase} {III}
+ Gene rnc",
+ author = "Weeks, Ryan and Ostermeier, Marc",
+ abstract = "How protein properties such as protein activity and protein
+ essentiality affect the distribution of fitness effects (DFE) of
+ mutations are important questions in protein evolution. Deep
+ mutational scanning studies typically measure the effects of a
+ comprehensive set of mutations on either protein activity or
+ fitness. Our understanding of the underpinnings of the DFE would
+ be enhanced by a comprehensive study of both for the same gene.
+ Here, we compared the fitness effects and in vivo protein
+ activity effects of ∼4,500 missense mutations in the E. coli rnc
+ gene. This gene encodes RNase III, a global regulator enzyme that
+ cleaves diverse RNA substrates including precursor ribosomal RNA
+ and various mRNAs including its own 5' untranslated region
+ (5'UTR). We find that RNase III's ability to cleave dsRNA is the
+ most important determinant of the fitness effects of rnc
+ mutations. The DFE of RNase III was bimodal, with mutations
+ centered around neutral and deleterious effects, consistent with
+ previously reported DFE's of enzymes with a singular
+ physiological role. Fitness was buffered to small effects on
+ RNase III activity. The enzyme's RNase III domain, which contains
+ the RNase III signature motif and all active site residues, was
+ more sensitive to mutation than its dsRNA binding domain, which
+ is responsible for recognition and binding to dsRNA. Differential
+ effects on fitness and functional scores for mutations at highly
+ conserved residues G97, G99, and F188 suggest that these
+ positions may be important for RNase III cleavage specificity.",
+ journal = "Molecular Biology and Evolution",
+ volume = 40,
+ number = 3,
+ month = mar,
+ year = 2023,
+ keywords = "RNase III; fitness landscape; protein evolution",
+ language = "en"
+}
+
+@ARTICLE{muhammad_high_2023,
+ title = "High-throughput functional mapping of variants in an arrhythmia
+ gene, {KCNE1} , reveals novel biology",
+ author = "Muhammad, Ayesha and Calandranis, Maria E and Li, Bian and Yang,
+ Tao and Blackwell, Daniel J and Harvey, M Lorena and Smith,
+ Jeremy E and Chew, Ashli E and Capra, John A and Matreyek,
+ Kenneth A and Fowler, Douglas M and Roden, Dan M and Glazer,
+ Andrew M",
+ abstract = "BACKGROUND: KCNE1 encodes a 129-residue cardiac potassium channel
+ (I Ks ) subunit. KCNE1 variants are associated with long QT
+ syndrome and atrial fibrillation. However, most variants have
+ insufficient evidence of clinical consequences and thus limited
+ clinical utility. RESULTS: Here, we demonstrate the power of
+ variant effect mapping, which couples saturation mutagenesis with
+ high-throughput sequencing, to ascertain the function of
+ thousands of protein coding KCNE1 variants. We comprehensively
+ assayed KCNE1 variant cell surface expression (2,554/2,709
+ possible single amino acid variants) and function (2,539
+ variants). We identified 470 loss-of-surface expression and 588
+ loss-of-function variants. Out of the 588 loss-of-function
+ variants, only 155 had low cell surface expression. The latter
+ half of the protein is dispensable for protein trafficking but
+ essential for channel function. 22 of the 30 KCNE1 residues
+ (73\%) highly intolerant of variation were in predicted close
+ contact with binding partners KCNQ1 or calmodulin. Our data were
+ highly concordant with gold standard electrophysiological data
+ ($\rho$ = -0.65), population and patient cohorts (32/38
+ concordant variants), and computational metrics ($\rho$ = -0.55).
+ Our data provide moderate-strength evidence for the ACMG/AMP
+ functional criteria for benign and pathogenic variants.
+ CONCLUSIONS: Comprehensive variant effect maps of KCNE1 can both
+ provide insight into I Ks channel biology and help reclassify
+ variants of uncertain significance.",
+ journal = "bioRxiv",
+ month = apr,
+ year = 2023,
+ language = "en"
+}
+
+@ARTICLE{yee_full_2023,
+ title = "The full spectrum of {OCT1} ({SLC22A1}) mutations bridges
+ transporter biophysics to drug pharmacogenomics",
+ author = "Yee, Sook Wah and Macdonald, Christian and Mitrovic, Darko and
+ Zhou, Xujia and Koleske, Megan L and Yang, Jia and Silva, Dina
+ Buitrago and Grimes, Patrick Rockefeller and Trinidad, Donovan
+ and More, Swati S and Kachuri, Linda and Witte, John S and
+ Delemotte, Lucie and Giacomini, Kathleen M and Coyote-Maestas,
+ Willow",
+ abstract = "Membrane transporters play a fundamental role in the tissue
+ distribution of endogenous compounds and xenobiotics and are
+ major determinants of efficacy and side effects profiles.
+ Polymorphisms within these drug transporters result in
+ inter-individual variation in drug response, with some patients
+ not responding to the recommended dosage of drug whereas others
+ experience catastrophic side effects. For example, variants
+ within the major hepatic Human organic cation transporter OCT1
+ (SLC22A1) can change endogenous organic cations and many
+ prescription drug levels. To understand how variants
+ mechanistically impact drug uptake, we systematically study how
+ all known and possible single missense and single amino acid
+ deletion variants impact expression and substrate uptake of OCT1.
+ We find that human variants primarily disrupt function via
+ folding rather than substrate uptake. Our study revealed that the
+ major determinants of folding reside in the first 300 amino
+ acids, including the first 6 transmembrane domains and the
+ extracellular domain (ECD) with a stabilizing and highly
+ conserved stabilizing helical motif making key interactions
+ between the ECD and transmembrane domains. Using the functional
+ data combined with computational approaches, we determine and
+ validate a structure-function model of OCT1s conformational
+ ensemble without experimental structures. Using this model and
+ molecular dynamic simulations of key mutants, we determine
+ biophysical mechanisms for how specific human variants alter
+ transport phenotypes. We identify differences in frequencies of
+ reduced function alleles across populations with East Asians vs
+ European populations having the lowest and highest frequency of
+ reduced function variants, respectively. Mining human population
+ databases reveals that reduced function alleles of OCT1
+ identified in this study associate significantly with high LDL
+ cholesterol levels. Our general approach broadly applied could
+ transform the landscape of precision medicine by producing a
+ mechanistic basis for understanding the effects of human
+ mutations on disease and drug response.",
+ journal = "bioRxiv",
+ month = jun,
+ year = 2023,
+ language = "en"
+}
+
+@ARTICLE{chen_deep_2023,
+ title = "Deep Mutational Scanning of an {Oxygen-Independent} Fluorescent
+ Protein {CreiLOV} for Comprehensive Profiling of Mutational and
+ Epistatic Effects",
+ author = "Chen, Yongcan and Hu, Ruyun and Li, Keyi and Zhang, Yating and
+ Fu, Lihao and Zhang, Jianzhi and Si, Tong",
+ abstract = "Oxygen-independent, flavin mononucleotide-based fluorescent
+ proteins (FbFPs) are promising alternatives to green fluorescent
+ protein in anaerobic contexts. Deep mutational scanning performs
+ systematic profiling of protein sequence-function relationships
+ but has not been applied to FbFPs. Focusing on CreiLOV from
+ Chlamydomonas reinhardtii, we created and analyzed two
+ comprehensive mutant collections: (1) single-residue,
+ site-saturation mutagenesis libraries covering all 118 residues;
+ and (2) a full combinatorial metagenesis library among 20
+ mutations at 15 residues, where mutation and residue selection
+ was based on single-site mutagenesis results. Notably, the second
+ type of library is indispensable to study higher-order epistasis
+ but underrepresented in the literature. Using optimized FACS-seq
+ assays, 2,185 (>92.5\%) out of 2,360 possible single-site mutants
+ and 165,428 (>89.7\%) out of 184,320 possible combinatorial
+ mutants were reliably assigned with fitness values. We
+ constructed statistical and machine-learning models to analyze
+ the CreiLOV data set, enabling accurate fitness prediction of
+ higher-order mutants using lower-order mutagenesis data. In
+ addition, we successfully isolated CreiLOV variants with improved
+ fluorescence quantum yield and thermostability. This work
+ provides new empirical data and design rules to engineer
+ combinatorial protein variants.",
+ journal = "ACS Synthetic Biology",
+ volume = 12,
+ number = 5,
+ pages = "1461--1473",
+ month = may,
+ year = 2023,
+ keywords = "CreiLOV; FACS-Seq; deep mutational scanning; epistasis; protein
+ engineering",
+ language = "en"
+}
+
+@ARTICLE{gersing_characterizing_2023,
+ title = "Characterizing glucokinase variant mechanisms using a multiplexed
+ abundance assay",
+ author = "Gersing, Sarah and Schulze, Thea K and Cagiada, Matteo and Stein,
+ Amelie and Roth, Frederick P and Lindorff-Larsen, Kresten and
+ Hartmann-Petersen, Rasmus",
+ abstract = "Amino acid substitutions can perturb protein activity in multiple
+ ways. Understanding their mechanistic basis may pinpoint how
+ residues contribute to protein function. Here, we characterize
+ the mechanisms of human glucokinase (GCK) variants, building on
+ our previous comprehensive study on GCK variant activity. We
+ assayed the abundance of 95\% of GCK missense and nonsense
+ variants, and found that 43\% of hypoactive variants have a
+ decreased cellular abundance. By combining our abundance scores
+ with predictions of protein thermodynamic stability, we identify
+ residues important for GCK metabolic stability and conformational
+ dynamics. These residues could be targeted to modulate GCK
+ activity, and thereby affect glucose homeostasis.",
+ journal = "bioRxiv",
+ month = may,
+ year = 2023,
+ language = "en"
+}
+
+@ARTICLE{huttinger_deep_2021,
+ title = "Deep mutational scanning of the plasminogen activator inhibitor-1
+ functional landscape",
+ author = "Huttinger, Zachary M and Haynes, Laura M and Yee, Andrew and
+ Kretz, Colin A and Holding, Matthew L and Siemieniak, David R and
+ Lawrence, Daniel A and Ginsburg, David",
+ abstract = "The serine protease inhibitor (SERPIN) plasminogen activator
+ inhibitor-1 (PAI-1) is a key regulator of the fibrinolytic
+ system, inhibiting the serine proteases tissue- and
+ urokinase-type plasminogen activator (tPA and uPA, respectively).
+ Missense variants render PAI-1 non-functional through misfolding,
+ leading to its turnover as a protease substrate, or to a more
+ rapid transition to the latent/inactive state. Deep mutational
+ scanning was performed to evaluate the impact of amino acid
+ sequence variation on PAI-1 inhibition of uPA using an M13
+ filamentous phage display system. Error prone PCR was used to
+ construct a mutagenized PAI-1 library encompassing ~ 70\% of
+ potential single amino acid substitutions. The relative effects
+ of 27\% of all possible missense variants on PAI-1 inhibition of
+ uPA were determined using high-throughput DNA sequencing. 826
+ missense variants demonstrated conserved inhibitory activity
+ while 1137 resulted in loss of PAI-1 inhibitory function. The
+ least evolutionarily conserved regions of PAI-1 were also
+ identified as being the most tolerant of missense mutations. The
+ results of this screen confirm previous low-throughput mutational
+ studies, including those of the reactive center loop. These data
+ provide a powerful resource for explaining structure-function
+ relationships for PAI-1 and for the interpretation of human
+ genomic sequence variants.",
+ journal = "Scientific Reports",
+ volume = 11,
+ number = 1,
+ pages = "18827",
+ month = sep,
+ year = 2021,
+ language = "en"
+}
+
+@article{kwon_structurefunction_2022,
+ title = {Structure–function analysis of the {SHOC2}–{MRAS}–{PP1C} holophosphatase complex},
+ volume = {609},
+ issn = {0028-0836, 1476-4687},
+ url = {https://www.nature.com/articles/s41586-022-04928-2},
+ doi = {10.1038/s41586-022-04928-2},
+ language = {en},
+ number = {7926},
+ urldate = {2023-10-16},
+ journal = {Nature},
+ author = {Kwon, Jason J. and Hajian, Behnoush and Bian, Yuemin and Young, Lucy C. and Amor, Alvaro J. and Fuller, James R. and Fraley, Cara V. and Sykes, Abbey M. and So, Jonathan and Pan, Joshua and Baker, Laura and Lee, Sun Joo and Wheeler, Douglas B. and Mayhew, David L. and Persky, Nicole S. and Yang, Xiaoping and Root, David E. and Barsotti, Anthony M. and Stamford, Andrew W. and Perry, Charles K. and Burgin, Alex and McCormick, Frank and Lemke, Christopher T. and Hahn, William C. and Aguirre, Andrew J.},
+ month = sep,
+ year = {2022},
+ pages = {408--415},
+ file = {Accepted Version:/Users/admin/Zotero/storage/R96FAMJM/Kwon et al. - 2022 - Structure–function analysis of the SHOC2–MRAS–PP1C.pdf:application/pdf},
+}
+
+@article{sun_proactive_2020,
+ title = {A proactive genotype-to-patient-phenotype map for cystathionine beta-synthase},
+ volume = {12},
+ issn = {1756-994X},
+ url = {https://genomemedicine.biomedcentral.com/articles/10.1186/s13073-020-0711-1},
+ doi = {10.1186/s13073-020-0711-1},
+ abstract = {Abstract
+
+ Background
+
+ For the majority of rare clinical missense variants, pathogenicity status cannot currently be classified. Classical homocystinuria, characterized by elevated homocysteine in plasma and urine, is caused by variants in the cystathionine beta-synthase (
+ CBS
+ ) gene, most of which are rare. With early detection, existing therapies are highly effective.
+
+
+
+ Methods
+
+ Damaging
+ CBS
+ variants can be detected based on their failure to restore growth in yeast cells lacking the yeast ortholog
+ CYS4
+ . This assay has only been applied reactively, after first observing a variant in patients. Using saturation codon-mutagenesis, en masse growth selection, and sequencing, we generated a comprehensive, proactive map of CBS missense variant function.
+
+
+
+ Results
+
+ Our CBS variant effect map far exceeds the performance of computational predictors of disease variants. Map scores correlated strongly with both disease severity (Spearman’s
+ ϱ
+ = 0.9) and human clinical response to vitamin B
+ 6
+ (
+ ϱ
+ = 0.93).
+
+
+
+ Conclusions
+ We demonstrate that highly multiplexed cell-based assays can yield proactive maps of variant function and patient response to therapy, even for rare variants not previously seen in the clinic.},
+ language = {en},
+ number = {1},
+ urldate = {2023-10-16},
+ journal = {Genome Medicine},
+ author = {Sun, Song and Weile, Jochen and Verby, Marta and Wu, Yingzhou and Wang, Yang and Cote, Atina G. and Fotiadou, Iosifina and Kitaygorodsky, Julia and Vidal, Marc and Rine, Jasper and Ješina, Pavel and Kožich, Viktor and Roth, Frederick P.},
+ month = dec,
+ year = {2020},
+ pages = {13},
+ file = {Full Text:/Users/admin/Zotero/storage/4KCD9FVJ/Sun et al. - 2020 - A proactive genotype-to-patient-phenotype map for .pdf:application/pdf},
+}
+
+@article{wan_characterizing_2019,
+ title = {Characterizing variants of unknown significance in rhodopsin: {A} functional genomics approach},
+ volume = {40},
+ issn = {1059-7794, 1098-1004},
+ shorttitle = {Characterizing variants of unknown significance in rhodopsin},
+ url = {https://onlinelibrary.wiley.com/doi/10.1002/humu.23762},
+ doi = {10.1002/humu.23762},
+ language = {en},
+ number = {8},
+ urldate = {2023-10-16},
+ journal = {Human Mutation},
+ author = {Wan, Aliete and Place, Emily and Pierce, Eric A. and Comander, Jason},
+ month = aug,
+ year = {2019},
+ pages = {1127--1144},
+ file = {Full Text:/Users/admin/Zotero/storage/SSKD3J59/Wan et al. - 2019 - Characterizing variants of unknown significance in.pdf:application/pdf},
+}
+
+@article{chan_engineering_2020,
+ title = {Engineering human {ACE2} to optimize binding to the spike protein of {SARS} coronavirus 2},
+ volume = {369},
+ issn = {0036-8075, 1095-9203},
+ url = {https://www.science.org/doi/10.1126/science.abc0870},
+ doi = {10.1126/science.abc0870},
+ abstract = {A decoy receptor for SARS-CoV-2
+
+ For severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) to enter human cells, the spike protein on the surface of the virus must bind to the host receptor protein, angiotensin-converting enzyme 2 (ACE2). A soluble version of the receptor is being explored as a therapeutic. Chan
+ et al.
+ used deep mutagenesis to identify ACE2 mutants that bind more tightly to the spike protein and combined mutations to further increase binding affinity (see the Perspective by DeKosky). A promising variant was engineered to be a stable dimer that has a binding affinity for the spike protein; it is comparable with neutralizing antibodies and neutralized both SARS-CoV-2 and SARS-CoV-1 in a cell-based assay. In addition, the similarity to the natural receptor may limit the possibility for viral escape.
+
+
+ Science
+ , this issue p.
+ 1261
+ ; see also p.
+ 1167
+
+ ,
+ A variant of ACE2 based on deep mutagenesis far outcompetes the natural receptor in binding the SARS-CoV-2 spike protein.
+ ,
+ The spike (S) protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) binds angiotensin-converting enzyme 2 (ACE2) on host cells to initiate entry, and soluble ACE2 is a therapeutic candidate that neutralizes infection by acting as a decoy. By using deep mutagenesis, mutations in ACE2 that increase S binding are found across the interaction surface, in the asparagine 90–glycosylation motif and at buried sites. The mutational landscape provides a blueprint for understanding the specificity of the interaction between ACE2 and S and for engineering high-affinity decoy receptors. Combining mutations gives ACE2 variants with affinities that rival those of monoclonal antibodies. A stable dimeric variant shows potent SARS-CoV-2 and -1 neutralization in vitro. The engineered receptor is catalytically active, and its close similarity with the native receptor may limit the potential for viral escape.},
+ language = {en},
+ number = {6508},
+ urldate = {2023-10-16},
+ journal = {Science},
+ author = {Chan, Kui K. and Dorosky, Danielle and Sharma, Preeti and Abbasi, Shawn A. and Dye, John M. and Kranz, David M. and Herbert, Andrew S. and Procko, Erik},
+ month = sep,
+ year = {2020},
+ pages = {1261--1265},
+ file = {Full Text:/Users/admin/Zotero/storage/RHH3R5SV/Chan et al. - 2020 - Engineering human ACE2 to optimize binding to the .pdf:application/pdf},
+}
+
+@techreport{silverstein_systematic_2021,
+ type = {preprint},
+ title = {A systematic genotype-phenotype map for missense variants in the human intellectual disability-associated gene \textit{{GDI1}}},
+ url = {http://biorxiv.org/lookup/doi/10.1101/2021.10.06.463360},
+ abstract = {Abstract
+
+ Next generation sequencing has become a common tool in the diagnosis of genetic diseases. However, for the vast majority of genetic variants that are discovered, a clinical interpretation is not available. Variant effect mapping allows the functional effects of many single amino acid variants to be characterized in parallel. Here, we combine multiplexed functional assays with machine learning to assess the effects of amino acid substitutions in the human intellectual disability-associated gene,
+ GDI1
+ . We show that the resulting variant effect map can be used to discriminate pathogenic from benign variants. Our variant effect map recovers known biochemical and structural features of
+ GDI1
+ and reveals additional aspects of
+ GDI1
+ function. We explore how our functional assays can aid in the interpretation of novel
+ GDI1
+ variants as they are discovered, and to re-classify previously observed variants of unknown significance.},
+ language = {en},
+ urldate = {2023-10-16},
+ institution = {Genetics},
+ author = {Silverstein, Rachel A. and Sun, Song and Verby, Marta and Weile, Jochen and Wu, Yingzhou and Gebbia, Marinella and Fotiadou, Iosifina and Kitaygorodsky, Julia and Roth, Frederick P.},
+ month = oct,
+ year = {2021},
+ doi = {10.1101/2021.10.06.463360},
+ file = {Submitted Version:/Users/admin/Zotero/storage/4EYJCVVM/Silverstein et al. - 2021 - A systematic genotype-phenotype map for missense v.pdf:application/pdf},
+}
+
+@article{zinkus-boltz_phage-assisted_2019,
+ title = {A {Phage}-{Assisted} {Continuous} {Selection} {Approach} for {Deep} {Mutational} {Scanning} of {Protein}–{Protein} {Interactions}},
+ volume = {14},
+ issn = {1554-8929, 1554-8937},
+ url = {https://pubs.acs.org/doi/10.1021/acschembio.9b00669},
+ doi = {10.1021/acschembio.9b00669},
+ language = {en},
+ number = {12},
+ urldate = {2023-10-16},
+ journal = {ACS Chemical Biology},
+ author = {Zinkus-Boltz, Julia and DeValk, Craig and Dickinson, Bryan C.},
+ month = dec,
+ year = {2019},
+ pages = {2757--2767},
+ file = {Submitted Version:/Users/admin/Zotero/storage/XRNNITUF/Zinkus-Boltz et al. - 2019 - A Phage-Assisted Continuous Selection Approach for.pdf:application/pdf},
+}
+
+@article{klesmith_retargeting_2019,
+ title = {Retargeting {CD19} {Chimeric} {Antigen} {Receptor} {T} {Cells} via {Engineered} {CD19}-{Fusion} {Proteins}},
+ volume = {16},
+ issn = {1543-8384, 1543-8392},
+ url = {https://pubs.acs.org/doi/10.1021/acs.molpharmaceut.9b00418},
+ doi = {10.1021/acs.molpharmaceut.9b00418},
+ language = {en},
+ number = {8},
+ urldate = {2023-10-16},
+ journal = {Molecular Pharmaceutics},
+ author = {Klesmith, Justin R. and Su, Lihe and Wu, Lan and Schrack, Ian A. and Dufort, Fay J. and Birt, Alyssa and Ambrose, Christine and Hackel, Benjamin J. and Lobb, Roy R. and Rennert, Paul D.},
+ month = aug,
+ year = {2019},
+ pages = {3544--3558},
+}
+
+@article{elazar_mutational_2016,
+ title = {Mutational scanning reveals the determinants of protein insertion and association energetics in the plasma membrane},
+ volume = {5},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/12125},
+ doi = {10.7554/eLife.12125},
+ abstract = {Insertion of helix-forming segments into the membrane and their association determines the structure, function, and expression levels of all plasma membrane proteins. However, systematic and reliable quantification of membrane-protein energetics has been challenging. We developed a deep mutational scanning method to monitor the effects of hundreds of point mutations on helix insertion and self-association within the bacterial inner membrane. The assay quantifies insertion energetics for all natural amino acids at 27 positions across the membrane, revealing that the hydrophobicity of biological membranes is significantly higher than appreciated. We further quantitate the contributions to membrane-protein insertion from positively charged residues at the cytoplasm-membrane interface and reveal large and unanticipated differences among these residues. Finally, we derive comprehensive mutational landscapes in the membrane domains of Glycophorin A and the ErbB2 oncogene, and find that insertion and self-association are strongly coupled in receptor homodimers.
+ ,
+ Cells are defined by a thin membrane that separates the inside of the cell from the outside. The core of this membrane is hydrophobic, meaning that it repels water. Many signals and nutrients cannot pass through the membrane itself, but can pass through the proteins that span the membrane. Membrane proteins are therefore essential for living cells; yet even after decades of research, it remains unclear how proteins interact with the membrane and which features determine a protein’s stability in a biological membrane.
+ Since the early 1980s it was known that the bacterium E. coli could grow on a common antibiotic called ampicillin if it had enough of an antibiotic-degrading enzyme called β-lactamase anchored into its inner membrane. Now, Elazar et al. have used this enzyme to obtain detailed information on the interactions between a biological membrane and a membrane protein. First, hundreds of different mutations were introduced into the gene that encodes the enzyme to generate a population of bacteria that each had a slightly different membrane anchor. The mutant bacteria were then grown in the presence of the antibiotic, meaning that those mutants with a more stable membrane anchor were more likely to survive and grow than those with less stable anchors.
+ Elazar et al. then collected all the surviving bacteria, sequenced their DNA and measured how common the different mutations were in the final population. This approach was less labor-intensive and more accurate than traditional methods for monitoring membrane-anchored proteins, and the resulting large dataset was used to uncover which features affect a protein’s stability in a membrane. These results also showed that a biological membrane’s core is considerably more hydrophobic than was previously thought.
+ In addition to being hydrophobic, biological membranes have more negative charge in the side that faces into the cell. This means that membrane proteins with a positive charge in this region will be more stable, and Elazar et al. were able to use their new system to measure this effect for the first time.
+ Finally, membrane proteins do not only span the membrane; they also bind with other membrane proteins in order to carry out their roles. Elazar et al. used their system to look at the surfaces of human membrane proteins that interact with one another, and build a detailed map of the interaction surfaces, from which they derived accurate models of the membrane proteins.
+ Overall, these new findings could now be used to model the three-dimensional structures of membrane proteins and improve their stability. This in turn may help efforts to develop these proteins into more robust experimental tools and in the search for drugs that target membrane proteins.},
+ language = {en},
+ urldate = {2023-10-16},
+ journal = {eLife},
+ author = {Elazar, Assaf and Weinstein, Jonathan and Biran, Ido and Fridman, Yearit and Bibi, Eitan and Fleishman, Sarel Jacob},
+ month = jan,
+ year = {2016},
+ pages = {e12125},
+ file = {Full Text:/Users/admin/Zotero/storage/UVHQ2HLP/Elazar et al. - 2016 - Mutational scanning reveals the determinants of pr.pdf:application/pdf},
+}
+
+@article{coyote-maestas_determinants_2022,
+ title = {Determinants of trafficking, conduction, and disease within a {K}+ channel revealed through multiparametric deep mutational scanning},
+ volume = {11},
+ issn = {2050-084X},
+ url = {https://elifesciences.org/articles/76903},
+ doi = {10.7554/eLife.76903},
+ abstract = {A long-standing goal in protein science and clinical genetics is to develop quantitative models of sequence, structure, and function relationships to understand how mutations cause disease. Deep mutational scanning (DMS) is a promising strategy to map how amino acids contribute to protein structure and function and to advance clinical variant interpretation. Here, we introduce 7429 single-residue missense mutations into the inward rectifier K
+ +
+ channel Kir2.1 and determine how this affects folding, assembly, and trafficking, as well as regulation by allosteric ligands and ion conduction. Our data provide high-resolution information on a cotranslationally folded biogenic unit, trafficking and quality control signals, and segregated roles of different structural elements in fold stability and function. We show that Kir2.1 surface trafficking mutants are underrepresented in variant effect databases, which has implications for clinical practice. By comparing fitness scores with expert-reviewed variant effects, we can predict the pathogenicity of ‘variants of unknown significance’ and disease mechanisms of known pathogenic mutations. Our study in Kir2.1 provides a blueprint for how multiparametric DMS can help us understand the mechanistic basis of genetic disorders and the structure–function relationships of proteins.},
+ language = {en},
+ urldate = {2023-10-16},
+ journal = {eLife},
+ author = {Coyote-Maestas, Willow and Nedrud, David and He, Yungui and Schmidt, Daniel},
+ month = may,
+ year = {2022},
+ pages = {e76903},
+ file = {Submitted Version:/Users/admin/Zotero/storage/AB4N657M/Coyote-Maestas et al. - 2022 - Determinants of trafficking, conduction, and disea.pdf:application/pdf},
+}
+
+@techreport{li_deep_2023,
+ type = {preprint},
+ title = {Deep mutational scanning reveals the functional constraints and evolutionary potential of the influenza {A} virus {PB1} protein},
+ url = {http://biorxiv.org/lookup/doi/10.1101/2023.08.27.554986},
+ abstract = {Abstract
+ The influenza virus polymerase is central to influenza virus evolution. Adaptive mutations within the polymerase are often a prerequisite for efficient spread of novel animal-derived viruses in human populations. The polymerase also determines fidelity, and therefore the rate at which the virus will acquire mutations that lead to host range expansion, drug resistance, or antigenic drift. Despite its importance to viral replication and evolution, our understanding of the mutational effects and associated constraints on the influenza RNA-dependent RNA polymerase (RdRp) is relatively limited. We performed deep mutational scanning of the A/WSN/1933(H1N1) PB1, generating a library of 95.4\% of amino acid substitutions at 757 sites. After accuracy filters, we were able to measure replicative fitness for 13,354 (84\%) of all possible amino acid substitutions, and 16 were validated by results from pairwise competition assays. Functional and structural constraints were better revealed by individual sites involved in RNA or protein interactions than by major subdomains defined by sequence conservation. Mutational tolerance, as defined by site entropy, was correlated with evolutionary potential, as captured by diversity in available H1N1 sequences. Of 29 beneficial sites, many have either been identified in the natural evolution of PB1 or shown experimentally to have important impacts on replication and adaptation. Accessibility of amino acid substitutions by single nucleotide mutation was a key factor in determining whether mutations appeared in natural PB1 evolution. Our work provides a comprehensive map of mutational effects on a viral RdRp and a valuable resource for subsequent studies of influenza replication and evolution.},
+ language = {en},
+ urldate = {2023-10-16},
+ institution = {Microbiology},
+ author = {Li, Yuan and Arcos, Sarah and Sabsay, Kimberly R. and Te Velthuis, Aartjan J.W. and Lauring, Adam S.},
+ month = aug,
+ year = {2023},
+ doi = {10.1101/2023.08.27.554986},
+ file = {Submitted Version:/Users/admin/Zotero/storage/8WM9KUK5/Li et al. - 2023 - Deep mutational scanning reveals the functional co.pdf:application/pdf},
+}
+
+@article{meitlis_multiplexed_2020,
+ title = {Multiplexed {Functional} {Assessment} of {Genetic} {Variants} in {CARD11}},
+ volume = {107},
+ issn = {00029297},
+ url = {https://linkinghub.elsevier.com/retrieve/pii/S0002929720303736},
+ doi = {10.1016/j.ajhg.2020.10.015},
+ language = {en},
+ number = {6},
+ urldate = {2023-10-16},
+ journal = {The American Journal of Human Genetics},
+ author = {Meitlis, Iana and Allenspach, Eric J. and Bauman, Bradly M. and Phan, Isabelle Q. and Dabbah, Gina and Schmitt, Erica G. and Camp, Nathan D. and Torgerson, Troy R. and Nickerson, Deborah A. and Bamshad, Michael J. and Hagin, David and Luthers, Christopher R. and Stinson, Jeffrey R. and Gray, Jessica and Lundgren, Ingrid and Church, Joseph A. and Butte, Manish J. and Jordan, Mike B. and Aceves, Seema S. and Schwartz, Daniella M. and Milner, Joshua D. and Schuval, Susan and Skoda-Smith, Suzanne and Cooper, Megan A. and Starita, Lea M. and Rawlings, David J. and Snow, Andrew L. and James, Richard G.},
+ month = dec,
+ year = {2020},
+ pages = {1029--1043},
+ file = {Full Text:/Users/admin/Zotero/storage/CFKJ6Y9I/Meitlis et al. - 2020 - Multiplexed Functional Assessment of Genetic Varia.pdf:application/pdf},
+}
+
+@article{uk_monogenic_diabetes_consortium_prospective_2016,
+ title = {Prospective functional classification of all possible missense variants in {PPARG}},
+ volume = {48},
+ issn = {1061-4036, 1546-1718},
+ url = {https://www.nature.com/articles/ng.3700},
+ doi = {10.1038/ng.3700},
+ language = {en},
+ number = {12},
+ urldate = {2023-10-16},
+ journal = {Nature Genetics},
+ author = {{UK Monogenic Diabetes Consortium} and {Myocardial Infarction Genetics Consortium} and {UK Congenital Lipodystrophy Consortium} and Majithia, Amit R and Tsuda, Ben and Agostini, Maura and Gnanapradeepan, Keerthana and Rice, Robert and Peloso, Gina and Patel, Kashyap A and Zhang, Xiaolan and Broekema, Marjoleine F and Patterson, Nick and Duby, Marc and Sharpe, Ted and Kalkhoven, Eric and Rosen, Evan D and Barroso, Inês and Ellard, Sian and Kathiresan, Sekar and O'Rahilly, Stephen and Chatterjee, Krishna and Florez, Jose C and Mikkelsen, Tarjei and Savage, David B and Altshuler, David},
+ month = dec,
+ year = {2016},
+ pages = {1570--1575},
+ file = {Accepted Version:/Users/admin/Zotero/storage/MEZTM6WW/UK Monogenic Diabetes Consortium et al. - 2016 - Prospective functional classification of all possi.pdf:application/pdf},
+}
+
+@techreport{estevam_conserved_2023,
+ type = {preprint},
+ title = {Conserved regulatory motifs in the juxtamembrane domain and kinase {N}-lobe revealed through deep mutational scanning of the {MET} receptor tyrosine kinase domain},
+ url = {http://biorxiv.org/lookup/doi/10.1101/2023.08.03.551866},
+ abstract = {Abstract
+ MET is a receptor tyrosine kinase (RTK) responsible for initiating signaling pathways involved in development and wound repair. MET activation relies on ligand binding to the extracellular receptor, which prompts dimerization, intracellular phosphorylation, and recruitment of associated signaling proteins. Mutations, which are predominantly observed clinically in the intracellular juxtamembrane and kinase domains, can disrupt typical MET regulatory mechanisms. Understanding how juxtamembrane variants, such as exon 14 skipping (METΔEx14), and rare kinase domain mutations can increase signaling, often leading to cancer, remains a challenge. Here, we perform a parallel deep mutational scan (DMS) of MET intracellular kinase domain in two fusion protein backgrounds: wild type and METΔEx14. Our comparative approach has revealed a critical hydrophobic interaction between a juxtamembrane segment and the kinase ⍺C helix, pointing to differences in regulatory mechanisms between MET and other RTKs. Additionally, we have uncovered a β5 motif that acts as a structural pivot for kinase domain activation in MET and other TAM family of kinases. We also describe a number of previously unknown activating mutations, aiding the effort to annotate driver, passenger, and drug resistance mutations in the MET kinase domain.},
+ language = {en},
+ urldate = {2023-10-16},
+ institution = {Molecular Biology},
+ author = {Estevam, Gabriella O. and Linossi, Edmond M. and Macdonald, Christian B. and Espinoza, Carla A. and Michaud, Jennifer M. and Coyote-Maestas, Willow and Collisson, Eric A. and Jura, Natalia and Fraser, James S.},
+ month = aug,
+ year = {2023},
+ doi = {10.1101/2023.08.03.551866},
+ file = {Full Text:/Users/admin/Zotero/storage/S7UXHT33/Estevam et al. - 2023 - Conserved regulatory motifs in the juxtamembrane d.pdf:application/pdf},
+}
+
+@article{miller_allosteric_2022,
+ title = {Allosteric inhibition of {PPM1D} serine/threonine phosphatase via an altered conformational state},
+ volume = {13},
+ issn = {2041-1723},
+ url = {https://www.nature.com/articles/s41467-022-30463-9},
+ doi = {10.1038/s41467-022-30463-9},
+ abstract = {Abstract
+
+ PPM1D
+ encodes a serine/threonine phosphatase that regulates numerous pathways including the DNA damage response and p53. Activating mutations and amplification of
+ PPM1D
+ are found across numerous cancer types. GSK2830371 is a potent and selective allosteric inhibitor of PPM1D, but its mechanism of binding and inhibition of catalytic activity are unknown. Here we use computational, biochemical and functional genetic studies to elucidate the molecular basis of GSK2830371 activity. These data confirm that GSK2830371 binds an allosteric site of PPM1D with high affinity. By further incorporating data from hydrogen deuterium exchange mass spectrometry and sedimentation velocity analytical ultracentrifugation, we demonstrate that PPM1D exists in an equilibrium between two conformations that are defined by the movement of the flap domain, which is required for substrate recognition. A hinge region was identified that is critical for switching between the two conformations and was directly implicated in the high-affinity binding of GSK2830371 to PPM1D. We propose that the two conformations represent active and inactive forms of the protein reflected by the position of the flap, and that binding of GSK2830371 shifts the equilibrium to the inactive form. Finally, we found that C-terminal truncating mutations proximal to residue 400 result in destabilization of the protein via loss of a stabilizing N- and C-terminal interaction, consistent with the observation from human genetic data that nearly all
+ PPM1D
+ mutations in cancer are truncating and occur distal to residue 400. Taken together, our findings elucidate the mechanism by which binding of a small molecule to an allosteric site of PPM1D inhibits its activity and provides insights into the biology of PPM1D.},
+ language = {en},
+ number = {1},
+ urldate = {2023-10-16},
+ journal = {Nature Communications},
+ author = {Miller, Peter G. and Sathappa, Murugappan and Moroco, Jamie A. and Jiang, Wei and Qian, Yue and Iqbal, Sumaiya and Guo, Qi and Giacomelli, Andrew O. and Shaw, Subrata and Vernier, Camille and Bajrami, Besnik and Yang, Xiaoping and Raffier, Cerise and Sperling, Adam S. and Gibson, Christopher J. and Kahn, Josephine and Jin, Cyrus and Ranaghan, Matthew and Caliman, Alisha and Brousseau, Merissa and Fischer, Eric S. and Lintner, Robert and Piccioni, Federica and Campbell, Arthur J. and Root, David E. and Garvie, Colin W. and Ebert, Benjamin L.},
+ month = jun,
+ year = {2022},
+ pages = {3778},
+ file = {Full Text:/Users/admin/Zotero/storage/93RHZ3F3/Miller et al. - 2022 - Allosteric inhibition of PPM1D serinethreonine ph.pdf:application/pdf},
+}
+
+@article{erwood_saturation_2022,
+ title = {Saturation variant interpretation using {CRISPR} prime editing},
+ volume = {40},
+ issn = {1087-0156, 1546-1696},
+ url = {https://www.nature.com/articles/s41587-021-01201-1},
+ doi = {10.1038/s41587-021-01201-1},
+ language = {en},
+ number = {6},
+ urldate = {2023-10-16},
+ journal = {Nature Biotechnology},
+ author = {Erwood, Steven and Bily, Teija M. I. and Lequyer, Jason and Yan, Joyce and Gulati, Nitya and Brewer, Reid A. and Zhou, Liangchi and Pelletier, Laurence and Ivakine, Evgueni A. and Cohn, Ronald D.},
+ month = jun,
+ year = {2022},
+ pages = {885--895},
+ file = {Submitted Version:/Users/admin/Zotero/storage/Q5IRIPHG/Erwood et al. - 2022 - Saturation variant interpretation using CRISPR pri.pdf:application/pdf},
+}
+
+@phdthesis{jiang_exhaustive_2019,
+ title = {Exhaustive {Mapping} of {Missense} {Variation} in {Coronary} {Heart} {Disease}-related {Genes}},
+ url = {https://hdl.handle.net/1807/98076},
+ abstract = {Coronary heart disease (CHD) is a leading cause of death worldwide. A major risk factor of CHD is high low-density lipoprotein cholesterol (LDL-C), which is the defining feature of familial hypercholesterolemia, a genetic disorder characterized by mutations in genes involved in cholesterol uptake, such as LDLRAP1. The risk of CHD can be reduced by LDL-C-lowering medications called statins that potently inhibit HMG-CoA reductase (HMGCR), the enzyme that catalyzes the rate-limiting reaction of the mevalonate pathway. The study of genetic variation in CHD-related genes may contribute to early detection or prevention of CHD. I exhaustively mapped missense variant effects of two CHD-related genes, HMGCR and LDLRAP1. Proactive maps of human HMGCR may be able to model patient response to statins and identify the statin with the greatest likelihood of efficacy. Proactive maps of LDLRAP1 may identify pathogenic variation of autosomal recessive hypercholesterolemia (ARH) for early diagnosis of ARH.},
+ school = {University of Toronto},
+ author = {Jiang, Rosanna Junchen},
+ month = nov,
+ year = {2019},
+}
+
+%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% Moving assays over from references.bib
+@inproceedings{chen_hotprotein_2022,
+ title = {{HotProtein}: {A} {Novel} {Framework} for {Protein} {Thermostability} {Prediction} and {Editing}},
+ shorttitle = {{HotProtein}},
+ url = {https://openreview.net/forum?id=RtV_iEbWeGE},
+ abstract = {The molecular basis of protein thermal stability is only partially understood and has major significance for drug and vaccine discovery. The lack of datasets and standardized benchmarks considerably limits learning-based discovery methods. We present \${\textbackslash}texttt\{HotProtein\}\$, a large-scale protein dataset with {\textbackslash}textit\{growth temperature\} annotations of thermostability, containing \$182\$K amino acid sequences and \$3\$K folded structures from \$230\$ different species with a wide temperature range \$-20{\textasciicircum}\{{\textbackslash}circ\}{\textbackslash}texttt\{C\}{\textbackslash}sim 120{\textasciicircum}\{{\textbackslash}circ\}{\textbackslash}texttt\{C\}\$. Due to functional domain differences and data scarcity within each species, existing methods fail to generalize well on our dataset. We address this problem through a novel learning framework, consisting of (\$1\$) Protein structure-aware pre-training (SAP) which leverages 3D information to enhance sequence-based pre-training; (\$2\$) Factorized sparse tuning (FST) that utilizes low-rank and sparse priors as an implicit regularization, together with feature augmentations. Extensive empirical studies demonstrate that our framework improves thermostability prediction compared to other deep learning models. Finally, we propose a novel editing algorithm to efficiently generate positive amino acid mutations that improve thermostability.},
+ language = {en},
+ urldate = {2023-06-05},
+ author = {Chen, Tianlong and Gong, Chengyue and Diaz, Daniel Jesus and Chen, Xuxi and Wells, Jordan Tyler and Liu, Qiang and Wang, Zhangyang and Ellington, Andrew and Dimakis, Alex and Klivans, Adam},
+ month = oct,
+ year = {2022},
+ file = {Full Text PDF:/Users/pascalnotin/Zotero/storage/WIGJZS6V/Chen et al. - 2022 - HotProtein A Novel Framework for Protein Thermost.pdf:application/pdf},
+}
diff --git a/benchmarks/DMS_supervised/indels/MSE/DMS_indels_MSE_DMS_level.csv b/benchmarks/DMS_supervised/indels/MSE/DMS_indels_MSE_DMS_level.csv
new file mode 100644
index 0000000..ec86879
--- /dev/null
+++ b/benchmarks/DMS_supervised/indels/MSE/DMS_indels_MSE_DMS_level.csv
@@ -0,0 +1,67 @@
+DMS_id,ESM-1v Embeddings,MSA Transformer Embeddings,Tranception Embeddings
+B1LPA6_ECOSM_Russ_2020_indels,0.547,0.569,0.793
+BLAT_ECOLX_Gonzalez_2019_indels,0.366,0.313,0.273
+CAPSD_AAV2S_Sinai_2021_designed_indels,0.216,0.205,0.25
+CAPSD_AAV2S_Sinai_2021_library_indels,0.434,0.414,0.299
+HIS7_YEAST_Pokusaeva_2019_indels,0.161,0.198,0.158
+PTEN_HUMAN_Mighell_2018_indels,0.274,0.467,0.365
+P53_HUMAN_Kotler_2018_indels,0.315,0.383,0.348
+KCNJ2_MOUSE_Macdonald_2022_indels,0.517,0.671,0.391
+Q8EG35_SHEON_Campbell_2022_indels,0.57,0.624,0.512
+A4_HUMAN_Seuma_2022_indels,0.262,0.36,0.249
+S22A1_HUMAN_Yee_2023_abundance_indels,0.42,0.613,0.44
+S22A1_HUMAN_Yee_2023_activity_indels,0.427,0.606,0.397
+AMFR_HUMAN_Rocklin_2023_4G3O_indels,0.267,0.467,0.848
+ARGR_ECOLI_Rocklin_2023_1AOY_indels,0.138,0.259,0.204
+BBC1_YEAST_Rocklin_2023_1TG0_indels,0.224,0.328,0.391
+BCHB_CHLTE_Rocklin_2023_2KRU_indels,0.194,0.503,0.449
+CATR_CHLRE_Rocklin_2023_2AMI_indels,0.192,0.362,0.247
+CBPA2_HUMAN_Rocklin_2023_1O6X_indels,0.131,0.292,0.346
+CBX4_HUMAN_Rocklin_2023_2K28_indels,0.246,0.423,0.409
+CSN4_MOUSE_Rocklin_2023_1UFM_indels,0.087,0.292,0.288
+CUE1_YEAST_Rocklin_2023_2MYX_indels,0.216,0.361,0.56
+DN7A_SACS2_Rocklin_2023_1JIC_indels,0.282,0.547,0.549
+DNJA1_HUMAN_Rocklin_2023_2LO1_indels,0.128,0.25,0.162
+DOCK1_MOUSE_Rocklin_2023_2M0Y_indels,0.281,0.491,0.353
+EPHB2_HUMAN_Rocklin_2023_1F0M_indels,0.125,0.299,0.307
+FECA_ECOLI_Rocklin_2023_2D1U_indels,0.488,0.508,0.592
+HCP_LAMBD_Rocklin_2023_2L6Q_indels,0.091,0.267,0.238
+HECD1_HUMAN_Rocklin_2023_3DKM_indels,0.63,0.568,0.645
+ILF3_HUMAN_Rocklin_2023_2L33_indels,0.135,0.304,0.245
+MAFG_MOUSE_Rocklin_2023_1K1V_indels,0.257,0.548,0.493
+MBD11_ARATH_Rocklin_2023_6ACV_indels,0.272,0.377,0.597
+MYO3_YEAST_Rocklin_2023_2BTT_indels,0.392,0.612,0.514
+NKX31_HUMAN_Rocklin_2023_2L9R_indels,0.153,0.367,0.156
+NUSA_ECOLI_Rocklin_2023_1WCL_indels,0.154,0.284,0.527
+NUSG_MYCTU_Rocklin_2023_2MI6_indels,0.22,0.388,0.327
+OBSCN_HUMAN_Rocklin_2023_1V1C_indels,0.178,0.282,0.342
+ODP2_GEOSE_Rocklin_2023_1W4G_indels,0.46,0.682,0.543
+OTU7A_HUMAN_Rocklin_2023_2L2D_indels,0.216,0.422,0.56
+PIN1_HUMAN_Rocklin_2023_1I6C_indels,0.113,0.389,0.269
+PITX2_HUMAN_Rocklin_2023_2L7M_indels,0.495,0.773,0.605
+PKN1_HUMAN_Rocklin_2023_1URF_indels,0.082,0.178,0.156
+POLG_PESV_Rocklin_2023_2MXD_indels,0.198,0.495,0.384
+PR40A_HUMAN_Rocklin_2023_1UZC_indels,0.168,0.312,0.279
+PSAE_SYNP2_Rocklin_2023_1PSE_indels,0.24,0.259,0.311
+RAD_ANTMA_Rocklin_2023_2CJJ_indels,0.193,0.33,0.294
+RCD1_ARATH_Rocklin_2023_5OAO_indels,0.251,0.359,0.424
+RD23A_HUMAN_Rocklin_2023_1IFY_indels,0.154,0.349,0.29
+RPC1_BP434_Rocklin_2023_1R69_indels,0.484,0.511,0.431
+RS15_GEOSE_Rocklin_2023_1A32_indels,0.203,0.355,0.331
+SAV1_MOUSE_Rocklin_2023_2YSB_indels,0.531,0.708,0.648
+SDA_BACSU_Rocklin_2023_1PV0_indels,0.113,0.197,0.184
+SOX30_HUMAN_Rocklin_2023_7JJK_indels,0.195,0.336,0.566
+SPG2_STRSG_Rocklin_2023_5UBS_indels,0.16,0.256,0.262
+SPTN1_CHICK_Rocklin_2023_1TUD_indels,0.237,0.332,0.471
+SQSTM_MOUSE_Rocklin_2023_2RRU_indels,0.225,0.559,0.442
+SR43C_ARATH_Rocklin_2023_2N88_indels,0.223,0.323,0.235
+SRBS1_HUMAN_Rocklin_2023_2O2W_indels,0.153,0.366,0.201
+TCRG1_MOUSE_Rocklin_2023_1E0L_indels,0.118,0.22,0.329
+THO1_YEAST_Rocklin_2023_2WQG_indels,0.233,0.421,0.623
+TNKS2_HUMAN_Rocklin_2023_5JRT_indels,0.22,0.405,0.477
+UBE4B_HUMAN_Rocklin_2023_3L1X_indels,0.348,0.431,0.533
+UBR5_HUMAN_Rocklin_2023_1I2T_indels,0.196,0.344,0.288
+VG08_BPP22_Rocklin_2023_2GP8_indels,0.187,0.291,0.394
+VILI_CHICK_Rocklin_2023_1YU5_indels,0.133,0.215,0.184
+VRPI_BPT7_Rocklin_2023_2WNM_indels,0.235,0.496,0.474
+YNZC_BACSU_Rocklin_2023_2JVD_indels,0.1,0.343,0.303
diff --git a/benchmarks/DMS_supervised/indels/MSE/DMS_indels_MSE_DMS_level_fold_random_5.csv b/benchmarks/DMS_supervised/indels/MSE/DMS_indels_MSE_DMS_level_fold_random_5.csv
new file mode 100644
index 0000000..ec86879
--- /dev/null
+++ b/benchmarks/DMS_supervised/indels/MSE/DMS_indels_MSE_DMS_level_fold_random_5.csv
@@ -0,0 +1,67 @@
+DMS_id,ESM-1v Embeddings,MSA Transformer Embeddings,Tranception Embeddings
+B1LPA6_ECOSM_Russ_2020_indels,0.547,0.569,0.793
+BLAT_ECOLX_Gonzalez_2019_indels,0.366,0.313,0.273
+CAPSD_AAV2S_Sinai_2021_designed_indels,0.216,0.205,0.25
+CAPSD_AAV2S_Sinai_2021_library_indels,0.434,0.414,0.299
+HIS7_YEAST_Pokusaeva_2019_indels,0.161,0.198,0.158
+PTEN_HUMAN_Mighell_2018_indels,0.274,0.467,0.365
+P53_HUMAN_Kotler_2018_indels,0.315,0.383,0.348
+KCNJ2_MOUSE_Macdonald_2022_indels,0.517,0.671,0.391
+Q8EG35_SHEON_Campbell_2022_indels,0.57,0.624,0.512
+A4_HUMAN_Seuma_2022_indels,0.262,0.36,0.249
+S22A1_HUMAN_Yee_2023_abundance_indels,0.42,0.613,0.44
+S22A1_HUMAN_Yee_2023_activity_indels,0.427,0.606,0.397
+AMFR_HUMAN_Rocklin_2023_4G3O_indels,0.267,0.467,0.848
+ARGR_ECOLI_Rocklin_2023_1AOY_indels,0.138,0.259,0.204
+BBC1_YEAST_Rocklin_2023_1TG0_indels,0.224,0.328,0.391
+BCHB_CHLTE_Rocklin_2023_2KRU_indels,0.194,0.503,0.449
+CATR_CHLRE_Rocklin_2023_2AMI_indels,0.192,0.362,0.247
+CBPA2_HUMAN_Rocklin_2023_1O6X_indels,0.131,0.292,0.346
+CBX4_HUMAN_Rocklin_2023_2K28_indels,0.246,0.423,0.409
+CSN4_MOUSE_Rocklin_2023_1UFM_indels,0.087,0.292,0.288
+CUE1_YEAST_Rocklin_2023_2MYX_indels,0.216,0.361,0.56
+DN7A_SACS2_Rocklin_2023_1JIC_indels,0.282,0.547,0.549
+DNJA1_HUMAN_Rocklin_2023_2LO1_indels,0.128,0.25,0.162
+DOCK1_MOUSE_Rocklin_2023_2M0Y_indels,0.281,0.491,0.353
+EPHB2_HUMAN_Rocklin_2023_1F0M_indels,0.125,0.299,0.307
+FECA_ECOLI_Rocklin_2023_2D1U_indels,0.488,0.508,0.592
+HCP_LAMBD_Rocklin_2023_2L6Q_indels,0.091,0.267,0.238
+HECD1_HUMAN_Rocklin_2023_3DKM_indels,0.63,0.568,0.645
+ILF3_HUMAN_Rocklin_2023_2L33_indels,0.135,0.304,0.245
+MAFG_MOUSE_Rocklin_2023_1K1V_indels,0.257,0.548,0.493
+MBD11_ARATH_Rocklin_2023_6ACV_indels,0.272,0.377,0.597
+MYO3_YEAST_Rocklin_2023_2BTT_indels,0.392,0.612,0.514
+NKX31_HUMAN_Rocklin_2023_2L9R_indels,0.153,0.367,0.156
+NUSA_ECOLI_Rocklin_2023_1WCL_indels,0.154,0.284,0.527
+NUSG_MYCTU_Rocklin_2023_2MI6_indels,0.22,0.388,0.327
+OBSCN_HUMAN_Rocklin_2023_1V1C_indels,0.178,0.282,0.342
+ODP2_GEOSE_Rocklin_2023_1W4G_indels,0.46,0.682,0.543
+OTU7A_HUMAN_Rocklin_2023_2L2D_indels,0.216,0.422,0.56
+PIN1_HUMAN_Rocklin_2023_1I6C_indels,0.113,0.389,0.269
+PITX2_HUMAN_Rocklin_2023_2L7M_indels,0.495,0.773,0.605
+PKN1_HUMAN_Rocklin_2023_1URF_indels,0.082,0.178,0.156
+POLG_PESV_Rocklin_2023_2MXD_indels,0.198,0.495,0.384
+PR40A_HUMAN_Rocklin_2023_1UZC_indels,0.168,0.312,0.279
+PSAE_SYNP2_Rocklin_2023_1PSE_indels,0.24,0.259,0.311
+RAD_ANTMA_Rocklin_2023_2CJJ_indels,0.193,0.33,0.294
+RCD1_ARATH_Rocklin_2023_5OAO_indels,0.251,0.359,0.424
+RD23A_HUMAN_Rocklin_2023_1IFY_indels,0.154,0.349,0.29
+RPC1_BP434_Rocklin_2023_1R69_indels,0.484,0.511,0.431
+RS15_GEOSE_Rocklin_2023_1A32_indels,0.203,0.355,0.331
+SAV1_MOUSE_Rocklin_2023_2YSB_indels,0.531,0.708,0.648
+SDA_BACSU_Rocklin_2023_1PV0_indels,0.113,0.197,0.184
+SOX30_HUMAN_Rocklin_2023_7JJK_indels,0.195,0.336,0.566
+SPG2_STRSG_Rocklin_2023_5UBS_indels,0.16,0.256,0.262
+SPTN1_CHICK_Rocklin_2023_1TUD_indels,0.237,0.332,0.471
+SQSTM_MOUSE_Rocklin_2023_2RRU_indels,0.225,0.559,0.442
+SR43C_ARATH_Rocklin_2023_2N88_indels,0.223,0.323,0.235
+SRBS1_HUMAN_Rocklin_2023_2O2W_indels,0.153,0.366,0.201
+TCRG1_MOUSE_Rocklin_2023_1E0L_indels,0.118,0.22,0.329
+THO1_YEAST_Rocklin_2023_2WQG_indels,0.233,0.421,0.623
+TNKS2_HUMAN_Rocklin_2023_5JRT_indels,0.22,0.405,0.477
+UBE4B_HUMAN_Rocklin_2023_3L1X_indels,0.348,0.431,0.533
+UBR5_HUMAN_Rocklin_2023_1I2T_indels,0.196,0.344,0.288
+VG08_BPP22_Rocklin_2023_2GP8_indels,0.187,0.291,0.394
+VILI_CHICK_Rocklin_2023_1YU5_indels,0.133,0.215,0.184
+VRPI_BPT7_Rocklin_2023_2WNM_indels,0.235,0.496,0.474
+YNZC_BACSU_Rocklin_2023_2JVD_indels,0.1,0.343,0.303
diff --git a/benchmarks/DMS_supervised/indels/MSE/Summary_performance_DMS_indels_MSE.csv b/benchmarks/DMS_supervised/indels/MSE/Summary_performance_DMS_indels_MSE.csv
new file mode 100644
index 0000000..328b4bc
--- /dev/null
+++ b/benchmarks/DMS_supervised/indels/MSE/Summary_performance_DMS_indels_MSE.csv
@@ -0,0 +1,4 @@
+Model_rank,Model_name,Model type,Average_MSE,Bootstrap_standard_error_MSE,Average_MSE_fold_random_5,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,References,Model details
+1,ESM-1v Embeddings,Embedding,0.365,0.0,0.365,0.416,N/A,0.468,0.347,0.229,0.248,0.301,0.25,0.262,0.234,0.268,0.253,"[1] Original model: Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",ESM-1v Embeddings
+2,Tranception Embeddings,Embedding,0.41,0.022,0.41,0.518,N/A,0.415,0.313,0.391,0.385,0.291,0.409,0.412,0.378,0.398,0.366,"[1] Original model: Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",Tranception Embeddings
+3,MSA Transformer Embeddings,Embedding,0.486,0.013,0.486,0.547,N/A,0.642,0.366,0.389,0.396,0.351,0.407,0.411,0.391,0.399,0.395,"[1] Original model: Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",MSA Transformer Embeddings
diff --git a/benchmarks/DMS_supervised/indels/Spearman/DMS_indels_Spearman_DMS_level.csv b/benchmarks/DMS_supervised/indels/Spearman/DMS_indels_Spearman_DMS_level.csv
new file mode 100644
index 0000000..ae37987
--- /dev/null
+++ b/benchmarks/DMS_supervised/indels/Spearman/DMS_indels_Spearman_DMS_level.csv
@@ -0,0 +1,67 @@
+DMS_id,ESM-1v Embeddings,MSA Transformer Embeddings,Tranception Embeddings
+B1LPA6_ECOSM_Russ_2020_indels,0.591,0.581,0.536
+BLAT_ECOLX_Gonzalez_2019_indels,0.78,0.814,0.78
+CAPSD_AAV2S_Sinai_2021_designed_indels,0.834,0.832,0.828
+CAPSD_AAV2S_Sinai_2021_library_indels,0.656,0.655,0.708
+HIS7_YEAST_Pokusaeva_2019_indels,0.623,0.62,0.597
+PTEN_HUMAN_Mighell_2018_indels,0.821,0.767,0.789
+P53_HUMAN_Kotler_2018_indels,0.68,0.679,0.653
+KCNJ2_MOUSE_Macdonald_2022_indels,0.678,0.563,0.733
+Q8EG35_SHEON_Campbell_2022_indels,0.764,0.693,0.782
+A4_HUMAN_Seuma_2022_indels,0.816,0.746,0.813
+S22A1_HUMAN_Yee_2023_abundance_indels,0.779,0.664,0.773
+S22A1_HUMAN_Yee_2023_activity_indels,0.705,0.636,0.696
+AMFR_HUMAN_Rocklin_2023_4G3O_indels,0.757,0.636,0.621
+ARGR_ECOLI_Rocklin_2023_1AOY_indels,0.926,0.87,0.9
+BBC1_YEAST_Rocklin_2023_1TG0_indels,0.849,0.795,0.811
+BCHB_CHLTE_Rocklin_2023_2KRU_indels,0.881,0.718,0.776
+CATR_CHLRE_Rocklin_2023_2AMI_indels,0.914,0.847,0.89
+CBPA2_HUMAN_Rocklin_2023_1O6X_indels,0.937,0.847,0.846
+CBX4_HUMAN_Rocklin_2023_2K28_indels,0.874,0.786,0.782
+CSN4_MOUSE_Rocklin_2023_1UFM_indels,0.9,0.813,0.831
+CUE1_YEAST_Rocklin_2023_2MYX_indels,0.759,0.639,0.588
+DN7A_SACS2_Rocklin_2023_1JIC_indels,0.858,0.707,0.74
+DNJA1_HUMAN_Rocklin_2023_2LO1_indels,0.93,0.879,0.925
+DOCK1_MOUSE_Rocklin_2023_2M0Y_indels,0.836,0.719,0.823
+EPHB2_HUMAN_Rocklin_2023_1F0M_indels,0.907,0.828,0.81
+FECA_ECOLI_Rocklin_2023_2D1U_indels,0.729,0.679,0.677
+HCP_LAMBD_Rocklin_2023_2L6Q_indels,0.954,0.858,0.906
+HECD1_HUMAN_Rocklin_2023_3DKM_indels,0.708,0.721,0.673
+ILF3_HUMAN_Rocklin_2023_2L33_indels,0.938,0.858,0.893
+MAFG_MOUSE_Rocklin_2023_1K1V_indels,0.843,0.696,0.705
+MBD11_ARATH_Rocklin_2023_6ACV_indels,0.84,0.813,0.72
+MYO3_YEAST_Rocklin_2023_2BTT_indels,0.761,0.599,0.771
+NKX31_HUMAN_Rocklin_2023_2L9R_indels,0.853,0.746,0.86
+NUSA_ECOLI_Rocklin_2023_1WCL_indels,0.917,0.847,0.781
+NUSG_MYCTU_Rocklin_2023_2MI6_indels,0.831,0.712,0.768
+OBSCN_HUMAN_Rocklin_2023_1V1C_indels,0.913,0.867,0.85
+ODP2_GEOSE_Rocklin_2023_1W4G_indels,0.83,0.733,0.812
+OTU7A_HUMAN_Rocklin_2023_2L2D_indels,0.805,0.693,0.742
+PIN1_HUMAN_Rocklin_2023_1I6C_indels,0.839,0.754,0.793
+PITX2_HUMAN_Rocklin_2023_2L7M_indels,0.678,0.579,0.657
+PKN1_HUMAN_Rocklin_2023_1URF_indels,0.925,0.879,0.876
+POLG_PESV_Rocklin_2023_2MXD_indels,0.93,0.763,0.812
+PR40A_HUMAN_Rocklin_2023_1UZC_indels,0.893,0.827,0.843
+PSAE_SYNP2_Rocklin_2023_1PSE_indels,0.821,0.822,0.85
+RAD_ANTMA_Rocklin_2023_2CJJ_indels,0.87,0.799,0.847
+RCD1_ARATH_Rocklin_2023_5OAO_indels,0.861,0.798,0.771
+RD23A_HUMAN_Rocklin_2023_1IFY_indels,0.908,0.792,0.844
+RPC1_BP434_Rocklin_2023_1R69_indels,0.658,0.609,0.677
+RS15_GEOSE_Rocklin_2023_1A32_indels,0.93,0.838,0.846
+SAV1_MOUSE_Rocklin_2023_2YSB_indels,0.692,0.569,0.727
+SDA_BACSU_Rocklin_2023_1PV0_indels,0.946,0.907,0.92
+SOX30_HUMAN_Rocklin_2023_7JJK_indels,0.875,0.81,0.826
+SPG2_STRSG_Rocklin_2023_5UBS_indels,0.922,0.862,0.869
+SPTN1_CHICK_Rocklin_2023_1TUD_indels,0.865,0.806,0.81
+SQSTM_MOUSE_Rocklin_2023_2RRU_indels,0.859,0.675,0.773
+SR43C_ARATH_Rocklin_2023_2N88_indels,0.879,0.821,0.87
+SRBS1_HUMAN_Rocklin_2023_2O2W_indels,0.927,0.814,0.916
+TCRG1_MOUSE_Rocklin_2023_1E0L_indels,0.9,0.834,0.792
+THO1_YEAST_Rocklin_2023_2WQG_indels,0.833,0.687,0.698
+TNKS2_HUMAN_Rocklin_2023_5JRT_indels,0.852,0.746,0.726
+UBE4B_HUMAN_Rocklin_2023_3L1X_indels,0.733,0.647,0.584
+UBR5_HUMAN_Rocklin_2023_1I2T_indels,0.893,0.82,0.869
+VG08_BPP22_Rocklin_2023_2GP8_indels,0.851,0.824,0.78
+VILI_CHICK_Rocklin_2023_1YU5_indels,0.932,0.899,0.91
+VRPI_BPT7_Rocklin_2023_2WNM_indels,0.873,0.725,0.77
+YNZC_BACSU_Rocklin_2023_2JVD_indels,0.878,0.759,0.839
diff --git a/benchmarks/DMS_supervised/indels/Spearman/DMS_indels_Spearman_DMS_level_fold_random_5.csv b/benchmarks/DMS_supervised/indels/Spearman/DMS_indels_Spearman_DMS_level_fold_random_5.csv
new file mode 100644
index 0000000..ae37987
--- /dev/null
+++ b/benchmarks/DMS_supervised/indels/Spearman/DMS_indels_Spearman_DMS_level_fold_random_5.csv
@@ -0,0 +1,67 @@
+DMS_id,ESM-1v Embeddings,MSA Transformer Embeddings,Tranception Embeddings
+B1LPA6_ECOSM_Russ_2020_indels,0.591,0.581,0.536
+BLAT_ECOLX_Gonzalez_2019_indels,0.78,0.814,0.78
+CAPSD_AAV2S_Sinai_2021_designed_indels,0.834,0.832,0.828
+CAPSD_AAV2S_Sinai_2021_library_indels,0.656,0.655,0.708
+HIS7_YEAST_Pokusaeva_2019_indels,0.623,0.62,0.597
+PTEN_HUMAN_Mighell_2018_indels,0.821,0.767,0.789
+P53_HUMAN_Kotler_2018_indels,0.68,0.679,0.653
+KCNJ2_MOUSE_Macdonald_2022_indels,0.678,0.563,0.733
+Q8EG35_SHEON_Campbell_2022_indels,0.764,0.693,0.782
+A4_HUMAN_Seuma_2022_indels,0.816,0.746,0.813
+S22A1_HUMAN_Yee_2023_abundance_indels,0.779,0.664,0.773
+S22A1_HUMAN_Yee_2023_activity_indels,0.705,0.636,0.696
+AMFR_HUMAN_Rocklin_2023_4G3O_indels,0.757,0.636,0.621
+ARGR_ECOLI_Rocklin_2023_1AOY_indels,0.926,0.87,0.9
+BBC1_YEAST_Rocklin_2023_1TG0_indels,0.849,0.795,0.811
+BCHB_CHLTE_Rocklin_2023_2KRU_indels,0.881,0.718,0.776
+CATR_CHLRE_Rocklin_2023_2AMI_indels,0.914,0.847,0.89
+CBPA2_HUMAN_Rocklin_2023_1O6X_indels,0.937,0.847,0.846
+CBX4_HUMAN_Rocklin_2023_2K28_indels,0.874,0.786,0.782
+CSN4_MOUSE_Rocklin_2023_1UFM_indels,0.9,0.813,0.831
+CUE1_YEAST_Rocklin_2023_2MYX_indels,0.759,0.639,0.588
+DN7A_SACS2_Rocklin_2023_1JIC_indels,0.858,0.707,0.74
+DNJA1_HUMAN_Rocklin_2023_2LO1_indels,0.93,0.879,0.925
+DOCK1_MOUSE_Rocklin_2023_2M0Y_indels,0.836,0.719,0.823
+EPHB2_HUMAN_Rocklin_2023_1F0M_indels,0.907,0.828,0.81
+FECA_ECOLI_Rocklin_2023_2D1U_indels,0.729,0.679,0.677
+HCP_LAMBD_Rocklin_2023_2L6Q_indels,0.954,0.858,0.906
+HECD1_HUMAN_Rocklin_2023_3DKM_indels,0.708,0.721,0.673
+ILF3_HUMAN_Rocklin_2023_2L33_indels,0.938,0.858,0.893
+MAFG_MOUSE_Rocklin_2023_1K1V_indels,0.843,0.696,0.705
+MBD11_ARATH_Rocklin_2023_6ACV_indels,0.84,0.813,0.72
+MYO3_YEAST_Rocklin_2023_2BTT_indels,0.761,0.599,0.771
+NKX31_HUMAN_Rocklin_2023_2L9R_indels,0.853,0.746,0.86
+NUSA_ECOLI_Rocklin_2023_1WCL_indels,0.917,0.847,0.781
+NUSG_MYCTU_Rocklin_2023_2MI6_indels,0.831,0.712,0.768
+OBSCN_HUMAN_Rocklin_2023_1V1C_indels,0.913,0.867,0.85
+ODP2_GEOSE_Rocklin_2023_1W4G_indels,0.83,0.733,0.812
+OTU7A_HUMAN_Rocklin_2023_2L2D_indels,0.805,0.693,0.742
+PIN1_HUMAN_Rocklin_2023_1I6C_indels,0.839,0.754,0.793
+PITX2_HUMAN_Rocklin_2023_2L7M_indels,0.678,0.579,0.657
+PKN1_HUMAN_Rocklin_2023_1URF_indels,0.925,0.879,0.876
+POLG_PESV_Rocklin_2023_2MXD_indels,0.93,0.763,0.812
+PR40A_HUMAN_Rocklin_2023_1UZC_indels,0.893,0.827,0.843
+PSAE_SYNP2_Rocklin_2023_1PSE_indels,0.821,0.822,0.85
+RAD_ANTMA_Rocklin_2023_2CJJ_indels,0.87,0.799,0.847
+RCD1_ARATH_Rocklin_2023_5OAO_indels,0.861,0.798,0.771
+RD23A_HUMAN_Rocklin_2023_1IFY_indels,0.908,0.792,0.844
+RPC1_BP434_Rocklin_2023_1R69_indels,0.658,0.609,0.677
+RS15_GEOSE_Rocklin_2023_1A32_indels,0.93,0.838,0.846
+SAV1_MOUSE_Rocklin_2023_2YSB_indels,0.692,0.569,0.727
+SDA_BACSU_Rocklin_2023_1PV0_indels,0.946,0.907,0.92
+SOX30_HUMAN_Rocklin_2023_7JJK_indels,0.875,0.81,0.826
+SPG2_STRSG_Rocklin_2023_5UBS_indels,0.922,0.862,0.869
+SPTN1_CHICK_Rocklin_2023_1TUD_indels,0.865,0.806,0.81
+SQSTM_MOUSE_Rocklin_2023_2RRU_indels,0.859,0.675,0.773
+SR43C_ARATH_Rocklin_2023_2N88_indels,0.879,0.821,0.87
+SRBS1_HUMAN_Rocklin_2023_2O2W_indels,0.927,0.814,0.916
+TCRG1_MOUSE_Rocklin_2023_1E0L_indels,0.9,0.834,0.792
+THO1_YEAST_Rocklin_2023_2WQG_indels,0.833,0.687,0.698
+TNKS2_HUMAN_Rocklin_2023_5JRT_indels,0.852,0.746,0.726
+UBE4B_HUMAN_Rocklin_2023_3L1X_indels,0.733,0.647,0.584
+UBR5_HUMAN_Rocklin_2023_1I2T_indels,0.893,0.82,0.869
+VG08_BPP22_Rocklin_2023_2GP8_indels,0.851,0.824,0.78
+VILI_CHICK_Rocklin_2023_1YU5_indels,0.932,0.899,0.91
+VRPI_BPT7_Rocklin_2023_2WNM_indels,0.873,0.725,0.77
+YNZC_BACSU_Rocklin_2023_2JVD_indels,0.878,0.759,0.839
diff --git a/benchmarks/DMS_supervised/indels/Spearman/Summary_performance_DMS_indels_Spearman.csv b/benchmarks/DMS_supervised/indels/Spearman/Summary_performance_DMS_indels_Spearman.csv
new file mode 100644
index 0000000..e972c93
--- /dev/null
+++ b/benchmarks/DMS_supervised/indels/Spearman/Summary_performance_DMS_indels_Spearman.csv
@@ -0,0 +1,4 @@
+Model_rank,Model_name,Model type,Average_Spearman,Bootstrap_standard_error_Spearman,Average_Spearman_fold_random_5,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,References,Model details
+1,ESM-1v Embeddings,Embedding,0.752,0.0,0.752,0.705,N/A,0.728,0.718,0.856,0.843,0.747,0.838,0.821,0.842,0.848,0.835,"[1] Original model: Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",ESM-1v Embeddings
+2,Tranception Embeddings,Embedding,0.735,0.007,0.735,0.674,N/A,0.753,0.716,0.797,0.797,0.745,0.774,0.767,0.789,0.8,0.785,"[1] Original model: Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",Tranception Embeddings
+3,MSA Transformer Embeddings,Embedding,0.689,0.004,0.689,0.661,N/A,0.614,0.71,0.769,0.764,0.723,0.752,0.733,0.765,0.776,0.754,"[1] Original model: Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",MSA Transformer Embeddings
diff --git a/benchmarks/DMS_supervised/substitutions/MSE/DMS_substitutions_MSE_DMS_level.csv b/benchmarks/DMS_supervised/substitutions/MSE/DMS_substitutions_MSE_DMS_level.csv
new file mode 100644
index 0000000..6ddfd4a
--- /dev/null
+++ b/benchmarks/DMS_supervised/substitutions/MSE/DMS_substitutions_MSE_DMS_level.csv
@@ -0,0 +1,218 @@
+DMS_id,ESM-1v Embeddings,MSA Transformer Embeddings,DeepSequence + One-Hot Encodings,ESM-1v + One-Hot Encodings,MSA_Transformer + One-Hot Encodings,Tranception + One-Hot Encodings,TranceptEVE + One-Hot Encodings,One-Hot Encodings,ProteinNPT,Tranception Embeddings
+A0A140D2T1_ZIKV_Sourisseau_2019,1.174,0.972,1.096,1.123,0.98,1.032,1.019,1.057,0.962,1.025
+A0A192B1T2_9HIV1_Haddox_2018,0.774,0.73,0.814,0.766,0.739,0.739,0.728,1.0,0.607,0.69
+A0A1I9GEU1_NEIME_Kennouche_2019,1.165,1.098,1.01,1.017,1.01,1.019,1.016,1.005,1.254,1.559
+A0A2Z5U3Z0_9INFA_Doud_2016,0.687,0.778,0.737,0.739,0.721,0.718,0.687,0.991,0.787,0.632
+A0A2Z5U3Z0_9INFA_Wu_2014,0.992,1.25,0.988,0.995,0.984,0.979,0.976,1.048,1.074,0.976
+A4D664_9INFA_Soh_2019,1.047,0.985,0.959,1.055,0.978,0.956,0.927,1.04,0.925,0.883
+A4GRB6_PSEAI_Chen_2020,0.442,0.399,0.559,0.559,0.532,0.584,0.533,0.978,0.314,0.503
+AACC1_PSEAI_Dandage_2018,0.785,0.715,0.836,0.767,0.771,0.817,0.782,0.98,0.826,0.902
+ADRB2_HUMAN_Jones_2020,0.753,0.831,0.784,0.761,0.808,0.783,0.767,1.008,0.77,0.767
+AMIE_PSEAE_Wrenbeck_2017,0.589,0.611,0.795,0.663,0.721,0.866,0.814,1.005,0.542,0.705
+LGK_LIPST_Klesmith_2015,0.698,0.675,0.801,0.754,0.731,0.844,0.775,0.996,0.563,0.778
+BLAT_ECOLX_Deng_2012,0.714,0.674,0.718,0.741,0.727,0.798,0.719,0.973,0.639,0.788
+BLAT_ECOLX_Firnberg_2014,0.468,0.396,0.487,0.578,0.543,0.673,0.512,0.964,0.352,0.595
+BLAT_ECOLX_Jacquier_2013,0.443,0.401,0.524,0.57,0.553,0.581,0.498,0.992,0.412,0.565
+BLAT_ECOLX_Stiffler_2015,0.425,0.375,0.5,0.562,0.541,0.649,0.511,0.966,0.344,0.524
+BRCA1_HUMAN_Findlay_2018,0.711,0.775,0.861,0.73,0.766,0.712,0.715,0.991,0.692,0.742
+C6KNH7_9INFA_Lee_2018,0.742,0.812,0.865,0.817,0.841,0.813,0.799,0.99,0.634,0.688
+CALM1_HUMAN_Weile_2017,0.963,0.933,0.944,0.948,0.936,0.931,0.936,1.002,1.156,1.176
+CAPSD_AAV2S_Sinai_2021,0.604,0.536,0.646,0.949,0.645,0.732,0.701,0.941,0.495,0.528
+CCDB_ECOLI_Adkar_2012,0.663,0.486,0.658,0.877,0.761,0.72,0.651,0.925,0.513,0.724
+CCDB_ECOLI_Tripathi_2016,0.7,0.563,0.726,0.845,0.757,0.669,0.674,1.008,0.604,0.832
+CP2C9_HUMAN_Amorosi_2021_abundance,0.566,0.643,0.643,0.646,0.653,0.592,0.596,0.992,0.446,0.494
+CP2C9_HUMAN_Amorosi_2021_activity,0.564,0.581,0.604,0.592,0.645,0.62,0.603,0.983,0.411,0.532
+DLG4_HUMAN_Faure_2021,0.579,0.718,0.803,0.792,0.712,0.579,0.621,0.941,0.648,0.548
+DLG4_RAT_McLaughlin_2012,0.611,0.769,0.781,0.697,0.772,0.756,0.729,0.97,0.56,0.594
+DYR_ECOLI_Thompson_2019,0.722,0.617,0.748,0.789,0.745,0.839,0.757,0.984,0.619,0.866
+ENV_HV1B9_DuenasDecamp_2016,0.869,0.862,0.872,0.84,0.882,0.827,0.833,1.046,1.053,0.861
+ENV_HV1BR_Haddox_2016,0.965,0.952,0.961,0.971,0.961,0.952,0.95,1.006,0.957,0.947
+ESTA_BACSU_Nutschel_2020,0.85,0.699,0.826,0.912,0.832,0.929,0.859,0.975,0.729,0.98
+F7YBW8_MESOW_Aakre_2015,0.979,1.813,1.436,1.463,1.617,1.16,1.253,2.513,2.252,0.54
+GAL4_YEAST_Kitzman_2015,0.8,0.651,0.739,0.775,0.695,0.778,0.83,0.935,0.723,0.795
+GCN4_YEAST_Staller_2018,3.779,2.162,2.703,2.849,3.01,3.225,2.965,3.057,2.617,2.511
+GFP_AEQVI_Sarkisyan_2016,0.878,0.83,0.932,1.066,0.941,0.925,0.875,1.06,0.867,0.801
+GRB2_HUMAN_Faure_2021,0.447,0.542,0.635,0.612,0.595,0.733,0.624,0.903,0.536,0.7
+HIS7_YEAST_Pokusaeva_2019,1.45,1.767,1.232,1.187,1.193,1.146,1.223,1.278,1.395,1.483
+HSP82_YEAST_Flynn_2019,0.786,0.798,0.795,0.803,0.794,0.761,0.764,0.999,0.59,0.622
+HSP82_YEAST_Mishra_2016,0.613,0.821,0.725,0.718,0.813,0.666,0.665,1.058,0.614,0.558
+I6TAH8_I68A0_Doud_2015,1.054,0.889,0.952,1.017,0.891,0.887,0.852,0.998,0.799,0.794
+IF1_ECOLI_Kelsic_2016,0.599,0.493,0.629,0.663,0.805,0.602,0.636,0.959,0.608,0.586
+KCNH2_HUMAN_Kozek_2020,0.719,0.831,0.947,0.996,0.923,0.704,0.735,1.004,0.809,0.911
+KKA2_KLEPN_Melnikov_2014,0.619,0.566,0.68,0.695,0.696,0.72,0.677,0.984,0.573,0.667
+MK01_HUMAN_Brenan_2016,0.951,0.939,0.977,1.005,1.01,1.016,0.993,1.006,0.89,1.007
+MSH2_HUMAN_Jia_2020,0.803,0.776,0.811,0.801,0.771,0.828,0.783,1.007,0.639,0.763
+MTH3_HAEAE_RockahShmuel_2015,0.518,0.505,0.499,0.542,0.552,0.686,0.547,0.986,0.541,0.69
+NCAP_I34A1_Doud_2015,1.057,0.878,0.886,1.011,0.871,0.835,0.826,0.991,0.801,0.788
+NRAM_I33A0_Jiang_2016,1.184,0.541,0.829,1.116,0.661,0.76,0.689,1.082,0.517,0.653
+NUD15_HUMAN_Suiter_2020,0.541,0.438,0.632,0.6,0.521,0.63,0.587,0.954,0.408,0.571
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,0.615,0.738,0.908,0.71,0.888,0.775,0.793,1.02,0.551,0.681
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,0.751,0.775,1.035,0.867,1.028,0.938,0.96,1.08,0.698,0.785
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,0.519,0.661,0.984,0.761,1.015,0.9,0.902,1.079,0.417,0.642
+P53_HUMAN_Kotler_2018,0.657,0.635,0.705,0.693,0.685,0.701,0.681,0.973,0.628,0.676
+P84126_THETH_Chan_2017,0.534,0.404,0.557,0.644,0.534,0.676,0.615,0.939,0.424,0.648
+PA_I34A1_Wu_2015,1.078,1.498,0.874,1.053,0.996,0.86,0.84,1.037,1.013,0.873
+PABP_YEAST_Melamed_2013,0.443,0.466,0.69,0.538,0.535,0.53,0.534,0.942,0.436,0.457
+POLG_CXB3N_Mattenberger_2021,1.057,0.711,0.803,1.066,0.707,0.839,0.77,0.999,0.598,0.826
+POLG_HCVJF_Qi_2014,0.675,0.483,0.782,0.626,0.623,0.712,0.75,0.961,0.531,0.845
+PTEN_HUMAN_Matreyek_2021,0.751,0.736,0.852,0.84,0.821,0.898,0.825,0.986,0.633,0.796
+PTEN_HUMAN_Mighell_2018,0.669,0.683,0.681,0.744,0.789,0.807,0.665,1.008,0.538,0.71
+Q2N0S5_9HIV1_Haddox_2018,0.746,0.731,0.844,0.743,0.736,0.749,0.74,0.991,0.647,0.63
+Q59976_STRSQ_Romero_2015,0.686,0.629,0.638,0.728,0.639,0.644,0.608,0.993,0.549,0.634
+R1AB_SARS2_Flynn_2022,0.897,0.966,0.949,1.003,1.002,0.846,0.698,0.982,0.79,0.651
+RASH_HUMAN_Bandaru_2017,0.601,0.581,0.755,0.843,0.799,0.755,0.733,0.971,0.563,0.632
+REV_HV1H2_Fernandes_2016,0.828,0.794,0.945,0.94,0.93,0.939,0.942,0.993,0.944,0.905
+RL40A_YEAST_Mavor_2016,0.717,0.685,0.832,0.905,0.832,0.832,0.825,0.959,0.807,0.867
+RL40A_YEAST_Roscoe_2013,0.607,0.524,0.804,0.902,0.831,0.792,0.792,0.958,0.65,0.63
+RL40A_YEAST_Roscoe_2014,0.746,0.66,0.826,0.901,0.836,0.802,0.824,0.947,0.75,0.778
+SC6A4_HUMAN_Young_2021,0.747,0.721,0.839,0.76,0.715,0.792,0.755,1.008,0.618,0.722
+SCN5A_HUMAN_Glazer_2019,1.179,1.283,1.027,1.002,1.025,1.064,1.037,1.022,1.343,1.566
+SPG1_STRSG_Olson_2014,0.532,1.241,0.973,0.905,0.905,0.883,0.875,0.948,0.987,0.605
+SPIKE_SARS2_Starr_2020_binding,1.356,0.948,1.01,1.025,0.868,0.941,0.931,0.983,0.888,0.683
+SPIKE_SARS2_Starr_2020_expression,1.091,0.711,0.913,1.006,0.678,0.863,0.778,0.967,0.597,0.676
+SRC_HUMAN_Ahler_2019,0.711,0.905,0.861,0.771,0.911,0.813,0.818,0.985,0.79,0.731
+SUMO1_HUMAN_Weile_2017,0.742,0.691,0.811,0.777,0.75,0.836,0.783,0.977,0.811,0.893
+SYUA_HUMAN_Newberry_2020,0.399,0.442,1.075,0.941,1.084,0.999,1.01,1.145,0.466,0.405
+TADBP_HUMAN_Bolognesi_2019,0.368,0.669,1.114,1.119,1.121,1.106,1.114,1.093,0.395,0.479
+TAT_HV1BR_Fernandes_2016,0.801,0.74,0.967,0.904,0.923,0.983,0.964,1.02,0.761,0.913
+TPK1_HUMAN_Weile_2017,0.914,0.894,0.959,0.942,0.935,0.939,0.938,0.996,1.036,1.114
+TPMT_HUMAN_Matreyek_2018,0.629,0.662,0.763,0.735,0.788,0.807,0.769,0.999,0.622,0.871
+TPOR_HUMAN_Bridgford_2020,0.734,0.772,0.92,0.833,0.832,0.864,0.863,0.983,0.816,0.714
+TRPC_SACS2_Chan_2017,0.543,0.462,0.62,0.615,0.54,0.667,0.612,0.95,0.468,0.628
+TRPC_THEMA_Chan_2017,0.733,0.685,0.782,0.741,0.741,0.796,0.77,0.966,0.646,0.891
+UBC9_HUMAN_Weile_2017,0.782,0.698,0.781,0.838,0.79,0.827,0.773,0.977,0.738,0.844
+UBE4B_MOUSE_Starita_2013,0.765,0.933,0.812,0.766,0.918,0.882,0.798,1.005,0.929,0.871
+VKOR1_HUMAN_Chiasson_2020_abundance,0.567,0.516,0.813,0.795,0.726,0.658,0.681,0.972,0.516,0.602
+VKOR1_HUMAN_Chiasson_2020_activity,0.951,0.787,0.828,0.802,0.817,0.817,0.791,0.994,1.024,1.041
+YAP1_HUMAN_Araya_2012,0.759,0.805,0.705,0.921,1.009,0.85,0.751,0.965,0.887,0.874
+Q8WTC7_9CNID_Somermeyer_2022,0.953,0.74,0.982,1.014,0.908,0.943,0.925,0.995,0.752,0.947
+D7PM05_CLYGR_Somermeyer_2022,0.992,0.528,0.798,1.015,0.692,0.873,0.788,0.992,0.49,0.803
+Q6WV13_9MAXI_Somermeyer_2022,1.097,0.714,0.958,1.006,0.872,0.966,0.935,0.991,0.565,0.879
+SPG1_STRSG_Wu_2016,3.598,4.715,3.988,3.602,4.779,4.04,4.28,4.014,4.098,3.561
+CAS9_STRP1_Spencer_2017_positive,1.005,0.974,0.998,1.017,0.984,0.992,0.989,1.002,0.973,0.981
+A0A247D711_LISMN_Stadelmann_2021,0.984,0.678,1.02,1.028,0.931,0.972,1.024,1.006,0.738,0.918
+HXK4_HUMAN_Gersing_2022_activity,0.811,0.795,0.841,0.859,0.812,0.925,0.864,0.992,0.683,0.76
+ANCSZ_Hobbs_2022,0.59,0.688,1.009,0.681,0.74,0.654,0.659,0.977,0.549,0.52
+PAI1_HUMAN_Huttinger_2021,0.821,0.755,0.832,0.821,0.803,0.831,0.812,0.994,0.66,0.789
+HXK4_HUMAN_Gersing_2023_abundance,0.772,0.801,0.858,0.88,0.819,0.968,0.91,0.989,0.58,0.688
+PHOT_CHLRE_Chen_2023,0.43,0.434,0.62,0.587,0.62,0.686,0.695,0.968,0.393,0.572
+S22A1_HUMAN_Yee_2023_abundance,0.632,0.645,0.695,0.652,0.639,0.599,0.611,0.988,0.417,0.482
+S22A1_HUMAN_Yee_2023_activity,0.631,0.679,0.679,0.687,0.674,0.618,0.625,0.987,0.435,0.511
+KCNE1_HUMAN_Muhammad_2023_expression,0.531,0.742,0.983,0.965,0.987,0.959,0.961,0.969,0.559,0.537
+KCNE1_HUMAN_Muhammad_2023_function,0.443,0.419,0.727,0.643,0.706,0.545,0.523,0.937,0.416,0.407
+RNC_ECOLI_Weeks_2023,0.444,0.418,0.687,0.635,0.642,0.759,0.646,0.985,0.331,0.462
+HEM3_HUMAN_Loggerenberg_2023,0.811,0.789,0.833,0.849,0.803,0.845,0.816,0.99,0.756,0.781
+OXDA_RHOTO_Vanella_2023_expression,0.792,0.757,0.916,0.904,0.903,0.905,0.901,0.987,0.695,0.799
+OXDA_RHOTO_Vanella_2023_activity,0.868,0.842,0.883,0.903,0.9,0.942,0.911,0.994,0.804,0.909
+PRKN_HUMAN_Clausen_2023,0.518,0.747,0.685,0.624,0.756,0.6,0.559,0.992,0.442,0.487
+DYR_ECOLI_Nguyen_2023,0.546,0.498,0.702,0.614,0.719,0.814,0.74,0.984,0.504,0.692
+ENVZ_ECOLI_Ghose_2023,1.027,0.878,0.932,0.886,0.892,0.851,0.903,0.976,1.062,1.045
+MLAC_ECOLI_MacRae_2023,0.609,0.614,0.813,0.822,0.804,0.854,0.803,0.971,0.546,0.63
+Q837P5_ENTFA_Meier_2023,0.918,0.732,0.754,0.87,0.85,0.738,0.76,0.972,0.625,0.612
+Q837P4_ENTFA_Meier_2023,0.534,0.486,0.625,0.645,0.642,0.676,0.594,0.973,0.523,0.548
+CCR5_HUMAN_Gill_2023,0.835,0.847,0.914,0.874,0.862,0.84,0.879,0.993,0.773,0.775
+Q53Z42_HUMAN_McShan_2019_expression,0.706,0.618,0.72,0.773,0.677,0.724,0.689,0.975,0.649,0.687
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,0.645,0.685,0.881,0.899,0.888,0.906,0.879,0.996,0.57,0.621
+RPC1_LAMBD_Li_2019_low-expression,0.763,0.577,0.721,0.757,0.756,0.74,0.703,0.968,0.694,0.836
+RPC1_LAMBD_Li_2019_high-expression,0.644,0.559,0.696,0.673,0.685,0.7,0.657,1.007,0.661,0.735
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,2.013,1.105,1.038,1.485,1.343,1.184,1.171,1.338,1.167,1.235
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,0.874,0.859,0.912,0.905,0.879,0.908,0.902,1.012,0.775,0.766
+A4_HUMAN_Seuma_2022,0.622,0.824,0.781,0.869,0.841,0.725,0.745,0.935,0.644,0.626
+SRC_HUMAN_Nguyen_2022,0.785,0.825,0.886,0.837,0.848,0.874,0.877,0.976,0.8,0.814
+F7YBW7_MESOW_Ding_2023,1.528,1.298,1.632,1.564,1.244,1.51,1.549,1.933,1.604,1.517
+RASK_HUMAN_Weng_2022_abundance,0.625,0.605,0.948,0.978,0.946,0.872,0.921,0.971,0.549,0.537
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,0.489,0.485,0.714,0.77,0.627,0.742,0.715,0.968,0.408,0.436
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,0.825,0.857,0.924,0.895,0.9,0.915,0.915,0.987,0.851,0.825
+AICDA_HUMAN_Gajula_2014_3cycles,2.639,2.511,2.227,2.263,2.196,2.223,2.208,2.3,2.422,2.594
+OTC_HUMAN_Lo_2023,0.531,0.437,0.736,0.68,0.677,0.733,0.661,0.985,0.381,0.642
+CASP3_HUMAN_Roychowdhury_2020,0.608,0.538,0.625,0.644,0.581,0.878,0.632,0.985,0.634,1.258
+CASP7_HUMAN_Roychowdhury_2020,0.67,0.615,0.658,0.668,0.642,0.755,0.679,0.985,0.731,0.75
+SERC_HUMAN_Xie_2023,0.545,0.564,0.65,0.623,0.639,0.596,0.639,1.002,0.482,0.518
+MTHR_HUMAN_Weile_2021,0.653,0.9,0.904,0.873,0.892,0.905,0.89,0.993,0.5,0.593
+AMFR_HUMAN_Rocklin_2023_4G3O,0.592,0.388,0.736,0.934,0.739,0.81,0.739,0.937,0.366,0.564
+ARGR_ECOLI_Rocklin_2023_1AOY,0.465,0.34,0.727,0.846,0.742,0.684,0.653,0.965,0.308,0.389
+BBC1_YEAST_Rocklin_2023_1TG0,0.514,0.455,0.769,0.722,0.808,0.763,0.749,0.939,0.428,0.569
+BCHB_CHLTE_Rocklin_2023_2KRU,0.61,0.462,0.891,0.913,0.769,0.855,0.879,0.992,0.432,0.689
+CATR_CHLRE_Rocklin_2023_2AMI,0.315,0.355,0.781,0.849,0.929,0.751,0.757,0.958,0.27,0.287
+CBPA2_HUMAN_Rocklin_2023_1O6X,0.328,0.302,0.756,0.702,0.678,0.657,0.692,0.969,0.253,0.415
+CBX4_HUMAN_Rocklin_2023_2K28,0.508,0.349,0.706,0.742,0.604,0.678,0.611,0.956,0.314,0.565
+CSN4_MOUSE_Rocklin_2023_1UFM,0.347,0.305,0.771,0.601,0.759,0.645,0.676,0.93,0.255,0.398
+CUE1_YEAST_Rocklin_2023_2MYX,0.483,0.371,0.706,0.784,0.729,0.792,0.71,0.92,0.359,0.573
+DN7A_SACS2_Rocklin_2023_1JIC,0.605,0.387,0.964,0.979,0.914,0.969,0.971,0.994,0.348,0.539
+DNJA1_HUMAN_Rocklin_2023_2LO1,0.263,0.327,0.732,0.765,0.837,0.563,0.664,0.929,0.244,0.286
+DOCK1_MOUSE_Rocklin_2023_2M0Y,0.457,0.417,0.852,0.638,0.646,0.696,0.735,0.94,0.394,0.611
+EPHB2_HUMAN_Rocklin_2023_1F0M,0.26,0.218,0.64,0.48,0.559,0.604,0.591,0.943,0.185,0.4
+FECA_ECOLI_Rocklin_2023_2D1U,0.699,0.451,0.827,0.937,0.813,0.924,0.887,0.957,0.443,0.676
+FKBP3_HUMAN_Rocklin_2023_2KFV,0.678,0.436,0.768,0.95,0.868,0.865,0.807,0.959,0.388,0.521
+HCP_LAMBD_Rocklin_2023_2L6Q,0.299,0.301,0.938,0.782,0.823,0.655,0.675,0.949,0.27,0.344
+HECD1_HUMAN_Rocklin_2023_3DKM,0.523,0.279,0.616,0.854,0.553,0.686,0.621,0.92,0.281,0.449
+ILF3_HUMAN_Rocklin_2023_2L33,0.353,0.303,0.78,0.779,0.73,0.749,0.749,0.997,0.278,0.393
+ISDH_STAAW_Rocklin_2023_2LHR,0.532,0.641,1.001,0.909,0.962,0.99,0.998,0.96,0.519,0.799
+MAFG_MOUSE_Rocklin_2023_1K1V,0.496,0.486,0.865,0.83,0.859,0.699,0.73,0.942,0.442,0.455
+MBD11_ARATH_Rocklin_2023_6ACV,0.522,0.376,0.701,0.898,0.717,0.768,0.692,0.905,0.305,0.535
+MYO3_YEAST_Rocklin_2023_2BTT,0.539,0.494,0.846,0.76,0.8,0.849,0.821,0.953,0.513,0.782
+NKX31_HUMAN_Rocklin_2023_2L9R,0.371,0.294,0.627,0.696,0.593,0.576,0.567,0.935,0.262,0.365
+NUSA_ECOLI_Rocklin_2023_1WCL,0.28,0.255,0.843,0.821,0.8,0.718,0.698,0.962,0.209,0.319
+NUSG_MYCTU_Rocklin_2023_2MI6,0.347,0.23,0.698,0.874,0.692,0.709,0.71,0.967,0.193,0.317
+OBSCN_HUMAN_Rocklin_2023_1V1C,0.283,0.321,0.596,0.541,0.609,0.619,0.58,1.017,0.264,0.457
+ODP2_GEOSE_Rocklin_2023_1W4G,0.975,0.516,0.851,0.977,0.751,0.814,0.809,0.971,0.49,0.78
+OTU7A_HUMAN_Rocklin_2023_2L2D,0.378,0.827,0.985,0.598,0.976,0.892,0.954,0.957,0.607,0.749
+PIN1_HUMAN_Rocklin_2023_1I6C,0.458,0.305,0.548,0.613,0.495,0.473,0.464,0.92,0.303,0.495
+PITX2_HUMAN_Rocklin_2023_2L7M,0.536,0.383,0.687,0.704,0.623,0.62,0.642,0.906,0.365,0.491
+PKN1_HUMAN_Rocklin_2023_1URF,0.36,0.349,0.92,0.865,0.909,0.875,0.874,0.99,0.308,0.393
+POLG_PESV_Rocklin_2023_2MXD,0.52,0.36,0.951,0.973,0.751,0.94,0.924,0.941,0.332,0.515
+PR40A_HUMAN_Rocklin_2023_1UZC,0.531,0.273,0.743,0.904,0.592,0.72,0.668,0.93,0.269,0.521
+PSAE_SYNP2_Rocklin_2023_1PSE,0.421,0.569,0.873,0.789,0.927,0.908,0.88,0.948,0.507,0.56
+RAD_ANTMA_Rocklin_2023_2CJJ,0.534,0.496,0.98,0.871,0.737,0.852,0.881,0.95,0.491,0.783
+RBP1_HUMAN_Rocklin_2023_2KWH,0.268,0.38,0.86,0.78,0.944,0.828,0.848,0.954,0.326,0.417
+RCD1_ARATH_Rocklin_2023_5OAO,0.364,0.412,0.914,0.961,0.902,0.94,0.912,0.965,0.397,0.565
+RCRO_LAMBD_Rocklin_2023_1ORC,0.436,0.436,0.761,0.825,0.721,0.754,0.709,0.973,0.382,0.472
+RD23A_HUMAN_Rocklin_2023_1IFY,0.243,0.255,0.719,0.59,0.619,0.629,0.646,0.941,0.199,0.363
+RFAH_ECOLI_Rocklin_2023_2LCL,0.603,0.476,0.881,0.903,0.888,0.934,0.914,0.955,0.491,0.65
+RL20_AQUAE_Rocklin_2023_1GYZ,0.478,0.506,0.882,0.838,0.919,0.827,0.853,1.009,0.488,0.511
+RPC1_BP434_Rocklin_2023_1R69,0.265,0.261,0.644,0.456,0.577,0.504,0.666,0.921,0.259,0.375
+RS15_GEOSE_Rocklin_2023_1A32,0.289,0.343,0.882,0.868,0.825,0.817,0.875,0.99,0.385,0.431
+SAV1_MOUSE_Rocklin_2023_2YSB,0.568,0.529,0.768,0.638,0.652,0.669,0.698,0.919,0.47,0.705
+SBI_STAAM_Rocklin_2023_2JVG,0.614,0.389,0.825,0.908,0.684,0.847,0.803,0.916,0.377,0.622
+SCIN_STAAR_Rocklin_2023_2QFF,0.434,0.371,0.899,0.861,0.892,0.952,0.902,0.957,0.343,0.571
+SDA_BACSU_Rocklin_2023_1PV0,0.411,0.304,0.7,0.758,0.648,0.701,0.708,0.948,0.26,0.37
+SOX30_HUMAN_Rocklin_2023_7JJK,0.385,0.498,0.901,0.798,0.825,0.898,0.866,0.966,0.405,0.622
+SPA_STAAU_Rocklin_2023_1LP1,0.603,0.368,0.861,0.953,0.739,0.868,0.831,0.919,0.374,0.47
+SPG2_STRSG_Rocklin_2023_5UBS,0.627,0.409,0.918,0.954,0.946,0.961,0.891,0.939,0.416,0.581
+SPTN1_CHICK_Rocklin_2023_1TUD,0.428,0.444,0.752,0.667,0.746,0.728,0.72,0.941,0.416,0.511
+SQSTM_MOUSE_Rocklin_2023_2RRU,0.419,0.278,0.615,0.823,0.646,0.66,0.609,0.956,0.227,0.385
+SR43C_ARATH_Rocklin_2023_2N88,0.369,0.417,0.641,0.699,0.693,0.666,0.629,0.976,0.33,0.441
+SRBS1_HUMAN_Rocklin_2023_2O2W,0.512,0.347,0.664,0.701,0.559,0.64,0.601,0.944,0.307,0.526
+TCRG1_MOUSE_Rocklin_2023_1E0L,0.242,0.371,0.726,0.474,0.533,0.521,0.586,0.933,0.313,0.337
+THO1_YEAST_Rocklin_2023_2WQG,0.588,0.519,0.891,0.827,0.823,0.88,0.869,0.909,0.455,0.666
+TNKS2_HUMAN_Rocklin_2023_5JRT,0.309,0.329,0.637,0.564,0.701,0.602,0.598,0.922,0.297,0.408
+UBE4B_HUMAN_Rocklin_2023_3L1X,0.468,0.399,0.72,0.663,0.687,0.811,0.725,0.944,0.388,0.72
+UBR5_HUMAN_Rocklin_2023_1I2T,0.36,0.281,0.794,0.835,0.794,0.662,0.672,0.949,0.286,0.336
+VG08_BPP22_Rocklin_2023_2GP8,0.29,0.398,0.858,0.671,0.758,0.707,0.753,0.918,0.388,0.455
+VILI_CHICK_Rocklin_2023_1YU5,0.421,0.459,0.681,0.655,0.642,0.792,0.668,0.941,0.478,0.616
+VRPI_BPT7_Rocklin_2023_2WNM,0.339,0.458,0.925,0.785,0.873,0.963,0.963,0.92,0.423,0.561
+YAIA_ECOLI_Rocklin_2023_2KVT,0.512,0.455,0.901,0.878,0.873,0.874,0.841,0.925,0.459,0.666
+YNZC_BACSU_Rocklin_2023_2JVD,0.419,0.517,0.86,0.714,0.827,0.846,0.854,0.945,0.401,0.699
+POLG_DEN26_Suphatrakul_2023,0.957,0.712,0.909,1.022,0.774,0.982,0.96,0.994,0.469,0.685
+MET_HUMAN_Estevam_2023,0.558,0.542,0.629,0.614,0.575,0.658,0.614,0.978,0.517,0.572
+RDRP_I33A0_Li_2023,0.985,0.735,0.817,1.008,0.732,0.786,0.707,0.993,0.618,0.732
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,0.904,0.911,0.91,0.905,0.927,0.942,0.917,0.994,0.849,0.904
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,0.736,0.743,0.897,0.865,0.867,0.85,0.843,0.985,0.562,0.629
+ACE2_HUMAN_Chan_2020,0.883,0.927,0.936,0.951,0.924,1.498,1.305,0.956,0.777,1.83
+PPM1D_HUMAN_Miller_2022,0.657,0.746,0.753,0.66,0.752,0.752,0.707,0.987,0.605,0.664
+HMDH_HUMAN_Jiang_2019,0.657,0.921,0.863,0.88,0.931,0.946,0.902,0.993,0.549,0.629
+CBS_HUMAN_Sun_2020,0.854,0.866,0.862,0.888,0.87,0.92,0.885,0.994,0.787,0.851
+NPC1_HUMAN_Erwood_2022_HEK293T,0.775,0.458,0.602,0.833,0.468,0.614,0.537,0.969,0.446,0.565
+NPC1_HUMAN_Erwood_2022_RPE1,1.289,0.447,0.621,0.955,0.524,0.551,0.463,1.095,0.506,0.573
+BRCA2_HUMAN_Erwood_2022_HEK293T,0.913,1.188,1.019,1.204,1.238,1.034,1.044,1.175,1.271,0.934
+SHOC2_HUMAN_Kwon_2022,0.808,0.84,0.86,0.832,0.834,0.859,0.845,0.997,0.688,0.775
+RAF1_HUMAN_Zinkus-Boltz_2019,0.964,0.759,0.883,0.837,0.809,0.797,0.795,1.047,0.831,1.156
+OPSD_HUMAN_Wan_2019,0.637,0.483,0.683,0.698,0.624,0.746,0.715,0.994,0.485,0.69
+CAR11_HUMAN_Meitlis_2020_lof,0.624,0.89,0.922,0.753,0.915,0.927,0.917,0.958,0.67,0.687
+CAR11_HUMAN_Meitlis_2020_gof,0.866,1.019,1.027,0.96,1.005,1.02,1.022,1.001,0.909,0.913
+ERBB2_HUMAN_Elazar_2016,0.65,0.641,1.074,0.879,0.969,0.812,0.914,1.032,0.731,0.549
+GLPA_HUMAN_Elazar_2016,0.756,0.763,0.959,0.835,0.88,0.874,0.876,0.996,0.745,0.663
+LYAM1_HUMAN_Elazar_2016,0.731,0.673,1.018,0.854,0.824,0.878,0.893,0.977,0.782,0.779
+GDIA_HUMAN_Silverstein_2021,0.678,0.682,0.799,0.759,0.792,0.733,0.738,1.009,0.772,0.78
+PPARG_HUMAN_Majithia_2016,0.377,0.557,0.579,0.533,0.616,0.63,0.547,0.98,0.237,0.391
+CD19_HUMAN_Klesmith_2019_FMC_singles,0.906,0.934,0.961,0.995,0.952,0.97,0.958,0.986,0.735,0.794
diff --git a/benchmarks/DMS_supervised/substitutions/MSE/DMS_substitutions_MSE_DMS_level_fold_contiguous_5.csv b/benchmarks/DMS_supervised/substitutions/MSE/DMS_substitutions_MSE_DMS_level_fold_contiguous_5.csv
new file mode 100644
index 0000000..803111d
--- /dev/null
+++ b/benchmarks/DMS_supervised/substitutions/MSE/DMS_substitutions_MSE_DMS_level_fold_contiguous_5.csv
@@ -0,0 +1,218 @@
+DMS_id,ESM-1v Embeddings,MSA Transformer Embeddings,DeepSequence + One-Hot Encodings,ESM-1v + One-Hot Encodings,MSA_Transformer + One-Hot Encodings,Tranception + One-Hot Encodings,TranceptEVE + One-Hot Encodings,One-Hot Encodings,ProteinNPT,Tranception Embeddings
+A0A140D2T1_ZIKV_Sourisseau_2019,1.631,1.063,1.2,1.247,1.076,1.128,1.114,1.156,1.055,1.264
+A0A192B1T2_9HIV1_Haddox_2018,0.79,0.753,0.842,0.79,0.765,0.765,0.754,1.041,0.766,0.763
+A0A1I9GEU1_NEIME_Kennouche_2019,1.189,1.137,1.023,1.03,1.019,1.032,1.028,1.014,1.1,1.96
+A0A2Z5U3Z0_9INFA_Doud_2016,0.749,0.879,0.759,0.764,0.749,0.738,0.706,1.027,0.962,0.731
+A0A2Z5U3Z0_9INFA_Wu_2014,0.989,1.261,0.987,0.995,0.986,0.976,0.974,1.048,1.093,0.979
+A4D664_9INFA_Soh_2019,1.141,1.034,1.01,1.108,1.034,1.007,0.977,1.098,1.039,1.005
+A4GRB6_PSEAI_Chen_2020,0.522,0.452,0.598,0.599,0.572,0.625,0.573,1.061,0.431,0.653
+AACC1_PSEAI_Dandage_2018,0.83,0.8,0.863,0.795,0.795,0.855,0.814,1.037,0.844,1.087
+ADRB2_HUMAN_Jones_2020,0.77,0.896,0.798,0.773,0.823,0.797,0.78,1.053,0.846,0.858
+AMIE_PSEAE_Wrenbeck_2017,0.655,0.731,0.856,0.7,0.773,0.973,0.889,1.083,0.692,1.01
+LGK_LIPST_Klesmith_2015,0.732,0.695,0.824,0.772,0.749,0.874,0.799,1.031,0.725,0.976
+BLAT_ECOLX_Deng_2012,0.806,0.715,0.753,0.784,0.77,0.835,0.754,1.017,0.762,0.998
+BLAT_ECOLX_Firnberg_2014,0.549,0.438,0.505,0.61,0.574,0.703,0.534,1.002,0.465,0.752
+BLAT_ECOLX_Jacquier_2013,0.491,0.42,0.536,0.598,0.573,0.603,0.509,1.019,0.469,0.704
+BLAT_ECOLX_Stiffler_2015,0.485,0.413,0.522,0.593,0.574,0.683,0.536,1.005,0.465,0.689
+BRCA1_HUMAN_Findlay_2018,0.762,0.777,0.873,0.75,0.78,0.726,0.728,1.011,0.72,0.921
+C6KNH7_9INFA_Lee_2018,0.802,0.847,0.893,0.841,0.889,0.842,0.828,1.024,0.832,0.858
+CALM1_HUMAN_Weile_2017,0.979,0.964,0.952,0.958,0.948,0.939,0.944,1.006,1.008,1.311
+CAPSD_AAV2S_Sinai_2021,0.652,0.688,0.71,1.095,0.732,0.83,0.789,1.097,0.62,0.667
+CCDB_ECOLI_Adkar_2012,1.015,0.844,0.773,1.046,0.915,0.838,0.761,1.133,0.898,1.224
+CCDB_ECOLI_Tripathi_2016,0.949,0.777,0.782,0.939,0.839,0.725,0.73,1.118,0.753,1.26
+CP2C9_HUMAN_Amorosi_2021_abundance,0.623,0.652,0.662,0.669,0.673,0.615,0.618,1.026,0.584,0.636
+CP2C9_HUMAN_Amorosi_2021_activity,0.606,0.599,0.624,0.614,0.665,0.643,0.622,1.013,0.539,0.692
+DLG4_HUMAN_Faure_2021,0.706,0.753,0.871,0.86,0.775,0.618,0.672,1.013,0.782,0.726
+DLG4_RAT_McLaughlin_2012,0.686,0.798,0.842,0.756,0.839,0.825,0.79,1.065,0.669,0.775
+DYR_ECOLI_Thompson_2019,0.85,0.719,0.801,0.856,0.793,0.922,0.812,1.061,0.709,1.11
+ENV_HV1B9_DuenasDecamp_2016,1.007,0.962,0.968,0.898,0.962,0.918,0.925,1.184,1.41,1.033
+ENV_HV1BR_Haddox_2016,0.966,0.954,0.961,0.974,0.964,0.953,0.951,1.006,0.958,0.947
+ESTA_BACSU_Nutschel_2020,0.99,0.804,0.874,0.967,0.884,0.99,0.907,1.042,0.802,1.306
+F7YBW8_MESOW_Aakre_2015,1.323,2.475,1.794,1.967,2.13,1.386,1.521,3.314,3.154,0.576
+GAL4_YEAST_Kitzman_2015,0.962,0.69,0.79,0.857,0.744,0.852,0.919,1.031,0.809,0.921
+GCN4_YEAST_Staller_2018,4.433,2.143,2.398,2.845,2.766,3.013,2.646,2.941,2.413,2.566
+GFP_AEQVI_Sarkisyan_2016,1.08,0.898,0.988,1.137,1.005,0.992,0.931,1.136,1.061,1.01
+GRB2_HUMAN_Faure_2021,0.59,0.726,0.714,0.703,0.671,0.864,0.733,1.023,0.717,1.012
+HIS7_YEAST_Pokusaeva_2019,1.507,1.686,1.153,1.115,1.118,1.069,1.15,1.208,1.293,1.184
+HSP82_YEAST_Flynn_2019,0.803,0.814,0.815,0.823,0.816,0.775,0.78,1.03,0.677,0.675
+HSP82_YEAST_Mishra_2016,0.811,0.966,0.851,0.844,0.963,0.773,0.776,1.269,0.954,0.832
+I6TAH8_I68A0_Doud_2015,1.189,0.917,0.998,1.061,0.932,0.927,0.891,1.045,1.053,0.941
+IF1_ECOLI_Kelsic_2016,0.768,0.601,0.676,0.723,0.885,0.646,0.683,1.055,0.794,0.862
+KCNH2_HUMAN_Kozek_2020,0.782,0.979,1.063,1.132,1.078,0.763,0.814,1.096,0.982,0.967
+KKA2_KLEPN_Melnikov_2014,0.668,0.602,0.704,0.719,0.721,0.747,0.702,1.042,0.635,0.772
+MK01_HUMAN_Brenan_2016,1.011,1.005,1.004,1.033,1.039,1.048,1.02,1.042,1.078,1.233
+MSH2_HUMAN_Jia_2020,0.826,0.799,0.839,0.826,0.798,0.859,0.81,1.042,0.783,0.969
+MTH3_HAEAE_RockahShmuel_2015,0.531,0.52,0.51,0.556,0.567,0.711,0.563,1.017,0.516,0.789
+NCAP_I34A1_Doud_2015,1.14,0.878,0.894,1.024,0.881,0.845,0.834,1.005,0.928,0.865
+NRAM_I33A0_Jiang_2016,1.727,0.815,1.194,1.594,0.939,1.097,1.016,1.517,0.788,1.069
+NUD15_HUMAN_Suiter_2020,0.66,0.52,0.673,0.644,0.554,0.672,0.627,1.023,0.511,0.754
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,0.715,0.865,1.0,0.759,0.984,0.845,0.877,1.131,0.738,0.984
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,1.017,1.121,1.25,1.03,1.249,1.126,1.16,1.305,1.088,1.24
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,0.74,1.005,1.212,0.897,1.265,1.123,1.124,1.334,0.712,1.185
+P53_HUMAN_Kotler_2018,0.744,0.66,0.738,0.729,0.722,0.737,0.715,1.029,0.673,0.789
+P84126_THETH_Chan_2017,0.672,0.464,0.622,0.73,0.602,0.758,0.693,1.024,0.575,0.916
+PA_I34A1_Wu_2015,1.142,1.52,0.902,1.09,1.026,0.887,0.868,1.073,1.056,0.91
+PABP_YEAST_Melamed_2013,0.562,0.55,0.758,0.59,0.594,0.583,0.585,1.048,0.557,0.622
+POLG_CXB3N_Mattenberger_2021,1.366,0.718,0.809,1.086,0.717,0.851,0.78,1.018,0.669,1.109
+POLG_HCVJF_Qi_2014,0.927,0.552,0.813,0.652,0.643,0.746,0.803,1.005,0.68,1.405
+PTEN_HUMAN_Matreyek_2021,0.836,0.786,0.892,0.87,0.856,0.933,0.859,1.023,0.8,1.116
+PTEN_HUMAN_Mighell_2018,0.762,0.731,0.717,0.79,0.844,0.852,0.695,1.078,0.71,0.963
+Q2N0S5_9HIV1_Haddox_2018,0.755,0.743,0.864,0.76,0.752,0.768,0.759,1.021,0.682,0.713
+Q59976_STRSQ_Romero_2015,0.722,0.633,0.649,0.745,0.65,0.653,0.618,1.007,0.554,0.658
+R1AB_SARS2_Flynn_2022,1.067,1.058,1.013,1.071,1.07,0.908,0.742,1.057,1.114,0.941
+RASH_HUMAN_Bandaru_2017,0.771,0.66,0.803,0.893,0.851,0.803,0.783,1.044,0.72,0.925
+REV_HV1H2_Fernandes_2016,0.994,0.956,1.02,1.018,1.006,1.011,1.016,1.088,1.149,1.104
+RL40A_YEAST_Mavor_2016,0.919,0.855,0.908,0.999,0.915,0.915,0.901,1.077,1.116,1.101
+RL40A_YEAST_Roscoe_2013,0.886,0.706,0.891,1.017,0.932,0.887,0.882,1.088,1.016,0.9
+RL40A_YEAST_Roscoe_2014,0.953,0.833,0.895,0.98,0.905,0.874,0.892,1.043,0.978,1.101
+SC6A4_HUMAN_Young_2021,0.782,0.742,0.869,0.795,0.742,0.83,0.784,1.049,0.685,0.895
+SCN5A_HUMAN_Glazer_2019,1.128,1.195,1.026,0.997,1.013,1.056,1.033,1.02,1.268,1.554
+SPG1_STRSG_Olson_2014,0.849,1.493,1.196,1.118,1.12,1.098,1.09,1.185,1.286,0.897
+SPIKE_SARS2_Starr_2020_binding,2.309,1.233,1.094,1.106,0.949,1.018,1.013,1.072,1.244,0.881
+SPIKE_SARS2_Starr_2020_expression,1.52,0.733,0.981,1.076,0.733,0.928,0.836,1.048,0.95,1.039
+SRC_HUMAN_Ahler_2019,0.795,0.931,0.91,0.816,0.963,0.861,0.865,1.058,1.045,0.953
+SUMO1_HUMAN_Weile_2017,0.827,0.76,0.862,0.813,0.784,0.904,0.835,1.06,0.86,1.029
+SYUA_HUMAN_Newberry_2020,0.688,0.774,1.38,1.229,1.411,1.289,1.297,1.501,0.899,0.665
+TADBP_HUMAN_Bolognesi_2019,0.519,0.863,1.358,1.358,1.364,1.36,1.373,1.33,0.557,0.776
+TAT_HV1BR_Fernandes_2016,0.963,0.864,1.121,1.033,1.061,1.151,1.122,1.202,0.831,1.14
+TPK1_HUMAN_Weile_2017,0.924,0.929,0.967,0.951,0.944,0.95,0.947,1.003,1.008,1.237
+TPMT_HUMAN_Matreyek_2018,0.682,0.676,0.798,0.765,0.83,0.86,0.811,1.067,0.683,1.346
+TPOR_HUMAN_Bridgford_2020,0.847,0.886,1.018,0.93,0.917,0.972,0.974,1.072,0.961,0.865
+TRPC_SACS2_Chan_2017,0.638,0.515,0.673,0.665,0.584,0.752,0.672,1.024,0.536,0.742
+TRPC_THEMA_Chan_2017,0.878,0.859,0.856,0.813,0.807,0.871,0.842,1.042,0.871,1.323
+UBC9_HUMAN_Weile_2017,0.921,0.813,0.834,0.901,0.846,0.878,0.823,1.039,0.805,1.029
+UBE4B_MOUSE_Starita_2013,0.826,0.985,0.85,0.804,0.996,0.955,0.844,1.083,0.984,0.969
+VKOR1_HUMAN_Chiasson_2020_abundance,0.68,0.624,0.854,0.835,0.763,0.692,0.721,1.012,0.601,0.868
+VKOR1_HUMAN_Chiasson_2020_activity,1.074,0.815,0.85,0.82,0.833,0.838,0.811,1.031,0.94,1.019
+YAP1_HUMAN_Araya_2012,1.03,0.944,0.787,1.06,1.172,0.969,0.843,1.133,1.271,1.203
+Q8WTC7_9CNID_Somermeyer_2022,1.041,0.784,1.016,1.052,0.946,0.975,0.96,1.04,0.935,1.061
+D7PM05_CLYGR_Somermeyer_2022,1.154,0.568,0.834,1.07,0.73,0.916,0.826,1.049,0.575,0.877
+Q6WV13_9MAXI_Somermeyer_2022,1.295,0.759,1.003,1.053,0.914,1.009,0.979,1.05,0.689,0.998
+SPG1_STRSG_Wu_2016,5.213,6.822,5.62,5.074,6.821,5.72,6.056,5.653,5.922,5.103
+CAS9_STRP1_Spencer_2017_positive,1.013,0.978,1.003,1.021,0.986,0.994,0.992,1.004,0.977,0.986
+A0A247D711_LISMN_Stadelmann_2021,1.289,0.813,1.146,1.155,1.05,1.085,1.164,1.147,0.865,1.119
+HXK4_HUMAN_Gersing_2022_activity,0.832,0.802,0.858,0.875,0.827,0.949,0.882,1.016,0.725,0.829
+ANCSZ_Hobbs_2022,0.634,0.702,1.011,0.704,0.767,0.675,0.68,1.013,0.684,0.604
+PAI1_HUMAN_Huttinger_2021,0.829,0.767,0.846,0.831,0.813,0.845,0.825,1.015,0.7,0.82
+HXK4_HUMAN_Gersing_2023_abundance,0.778,0.808,0.87,0.891,0.832,1.002,0.935,1.006,0.623,0.758
+PHOT_CHLRE_Chen_2023,0.51,0.535,0.656,0.628,0.666,0.739,0.74,1.033,0.478,0.727
+S22A1_HUMAN_Yee_2023_abundance,0.637,0.649,0.707,0.662,0.652,0.61,0.622,1.004,0.485,0.517
+S22A1_HUMAN_Yee_2023_activity,0.64,0.684,0.689,0.698,0.687,0.628,0.634,1.004,0.492,0.548
+KCNE1_HUMAN_Muhammad_2023_expression,0.632,0.848,1.05,1.039,1.059,1.034,1.035,1.045,0.694,0.665
+KCNE1_HUMAN_Muhammad_2023_function,0.539,0.473,0.774,0.688,0.747,0.579,0.557,1.009,0.505,0.47
+RNC_ECOLI_Weeks_2023,0.493,0.468,0.728,0.676,0.68,0.806,0.677,1.043,0.397,0.555
+HEM3_HUMAN_Loggerenberg_2023,0.825,0.791,0.843,0.859,0.811,0.854,0.824,1.004,0.769,0.807
+OXDA_RHOTO_Vanella_2023_expression,0.825,0.771,0.936,0.924,0.923,0.926,0.923,1.013,0.73,0.846
+OXDA_RHOTO_Vanella_2023_activity,0.896,0.853,0.898,0.92,0.917,0.962,0.932,1.02,0.834,0.948
+PRKN_HUMAN_Clausen_2023,0.573,0.761,0.722,0.656,0.791,0.638,0.583,1.05,0.596,0.607
+DYR_ECOLI_Nguyen_2023,0.609,0.564,0.74,0.643,0.757,0.855,0.774,1.049,0.617,0.864
+ENVZ_ECOLI_Ghose_2023,1.249,1.025,0.993,0.932,0.942,0.899,0.965,1.037,1.282,1.257
+MLAC_ECOLI_MacRae_2023,0.708,0.704,0.842,0.855,0.837,0.886,0.831,1.016,0.675,0.764
+Q837P5_ENTFA_Meier_2023,1.31,0.885,0.872,1.026,1.015,0.879,0.891,1.161,0.832,0.793
+Q837P4_ENTFA_Meier_2023,0.689,0.565,0.74,0.765,0.762,0.805,0.693,1.221,0.715,0.702
+CCR5_HUMAN_Gill_2023,0.85,0.854,0.926,0.883,0.872,0.848,0.89,1.007,0.804,0.805
+Q53Z42_HUMAN_McShan_2019_expression,0.724,0.641,0.749,0.806,0.702,0.747,0.715,1.019,0.768,0.793
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,0.708,0.764,0.966,0.982,0.967,0.986,0.961,1.101,0.657,0.733
+RPC1_LAMBD_Li_2019_low-expression,0.831,0.611,0.754,0.788,0.803,0.782,0.738,1.027,0.727,0.949
+RPC1_LAMBD_Li_2019_high-expression,0.708,0.598,0.732,0.713,0.735,0.744,0.693,1.069,0.708,0.867
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,2.424,1.27,1.12,1.664,1.53,1.321,1.315,1.461,1.343,1.402
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,0.901,0.88,0.946,0.945,0.918,0.942,0.937,1.064,0.881,0.9
+A4_HUMAN_Seuma_2022,0.705,0.857,0.828,0.947,0.91,0.773,0.793,1.039,0.763,0.71
+SRC_HUMAN_Nguyen_2022,0.817,0.847,0.912,0.86,0.871,0.899,0.902,1.014,0.908,0.933
+F7YBW7_MESOW_Ding_2023,2.018,1.665,1.839,1.763,1.417,1.695,1.756,2.206,2.049,2.027
+RASK_HUMAN_Weng_2022_abundance,0.724,0.679,0.987,1.02,0.985,0.909,0.959,1.019,0.656,0.644
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,0.594,0.555,0.754,0.815,0.665,0.785,0.755,1.028,0.552,0.581
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,0.894,0.894,0.961,0.93,0.934,0.949,0.949,1.034,0.983,0.985
+AICDA_HUMAN_Gajula_2014_3cycles,3.15,3.057,2.588,2.592,2.514,2.546,2.546,2.622,3.042,3.027
+OTC_HUMAN_Lo_2023,0.571,0.458,0.755,0.701,0.701,0.752,0.679,1.009,0.455,0.721
+CASP3_HUMAN_Roychowdhury_2020,0.624,0.543,0.635,0.656,0.591,0.875,0.643,1.005,0.619,1.271
+CASP7_HUMAN_Roychowdhury_2020,0.672,0.623,0.668,0.676,0.649,0.767,0.688,1.006,0.79,0.798
+SERC_HUMAN_Xie_2023,0.594,0.59,0.667,0.64,0.657,0.612,0.657,1.034,0.588,0.563
+MTHR_HUMAN_Weile_2021,0.671,0.912,0.927,0.894,0.914,0.925,0.912,1.024,0.532,0.632
+AMFR_HUMAN_Rocklin_2023_4G3O,0.81,0.532,0.848,1.054,0.855,0.914,0.838,1.074,0.515,0.779
+ARGR_ECOLI_Rocklin_2023_1AOY,0.671,0.499,0.862,0.988,0.9,0.798,0.762,1.138,0.407,0.545
+BBC1_YEAST_Rocklin_2023_1TG0,0.659,0.602,0.856,0.799,0.904,0.844,0.835,1.047,0.562,0.731
+BCHB_CHLTE_Rocklin_2023_2KRU,0.951,0.683,1.075,1.068,0.915,1.013,1.05,1.185,0.667,1.025
+CATR_CHLRE_Rocklin_2023_2AMI,0.443,0.484,0.858,0.954,1.053,0.833,0.837,1.065,0.357,0.418
+CBPA2_HUMAN_Rocklin_2023_1O6X,0.45,0.436,0.881,0.818,0.792,0.759,0.802,1.137,0.363,0.589
+CBX4_HUMAN_Rocklin_2023_2K28,0.678,0.459,0.825,0.861,0.713,0.76,0.696,1.127,0.401,0.775
+CSN4_MOUSE_Rocklin_2023_1UFM,0.485,0.421,0.876,0.682,0.862,0.732,0.768,1.05,0.374,0.563
+CUE1_YEAST_Rocklin_2023_2MYX,0.679,0.522,0.797,0.894,0.83,0.891,0.803,1.053,0.497,0.778
+DN7A_SACS2_Rocklin_2023_1JIC,0.84,0.523,1.084,1.099,1.034,1.084,1.092,1.129,0.453,0.694
+DNJA1_HUMAN_Rocklin_2023_2LO1,0.349,0.499,0.879,0.904,0.987,0.66,0.793,1.081,0.351,0.416
+DOCK1_MOUSE_Rocklin_2023_2M0Y,0.579,0.518,0.937,0.703,0.702,0.757,0.809,1.064,0.492,0.768
+EPHB2_HUMAN_Rocklin_2023_1F0M,0.367,0.298,0.759,0.561,0.651,0.705,0.697,1.106,0.249,0.551
+FECA_ECOLI_Rocklin_2023_2D1U,0.916,0.573,0.939,1.048,0.913,1.037,1.002,1.096,0.593,0.856
+FKBP3_HUMAN_Rocklin_2023_2KFV,0.913,0.567,0.855,1.058,0.962,0.96,0.894,1.081,0.513,0.715
+HCP_LAMBD_Rocklin_2023_2L6Q,0.409,0.405,1.111,0.914,0.959,0.778,0.803,1.14,0.37,0.479
+HECD1_HUMAN_Rocklin_2023_3DKM,0.678,0.345,0.691,0.935,0.607,0.75,0.684,1.042,0.363,0.551
+ILF3_HUMAN_Rocklin_2023_2L33,0.492,0.436,0.87,0.861,0.809,0.825,0.824,1.101,0.404,0.572
+ISDH_STAAW_Rocklin_2023_2LHR,0.714,0.885,1.146,1.034,1.119,1.138,1.15,1.111,0.657,1.129
+MAFG_MOUSE_Rocklin_2023_1K1V,0.674,0.627,0.985,0.943,0.96,0.785,0.82,1.082,0.596,0.611
+MBD11_ARATH_Rocklin_2023_6ACV,0.738,0.531,0.8,1.017,0.822,0.87,0.788,1.047,0.422,0.706
+MYO3_YEAST_Rocklin_2023_2BTT,0.677,0.645,0.946,0.84,0.889,0.941,0.915,1.08,0.697,1.096
+NKX31_HUMAN_Rocklin_2023_2L9R,0.474,0.356,0.725,0.788,0.674,0.65,0.645,1.076,0.32,0.451
+NUSA_ECOLI_Rocklin_2023_1WCL,0.379,0.377,1.039,1.009,0.994,0.865,0.836,1.17,0.304,0.448
+NUSG_MYCTU_Rocklin_2023_2MI6,0.504,0.322,0.824,1.042,0.815,0.826,0.837,1.164,0.28,0.433
+OBSCN_HUMAN_Rocklin_2023_1V1C,0.401,0.45,0.739,0.658,0.748,0.758,0.712,1.276,0.357,0.603
+ODP2_GEOSE_Rocklin_2023_1W4G,1.312,0.7,1.049,1.162,0.901,0.964,0.98,1.152,0.669,1.031
+OTU7A_HUMAN_Rocklin_2023_2L2D,0.462,1.16,1.105,0.643,1.103,0.983,1.069,1.058,0.8,1.012
+PIN1_HUMAN_Rocklin_2023_1I6C,0.652,0.386,0.65,0.721,0.582,0.527,0.531,1.125,0.388,0.698
+PITX2_HUMAN_Rocklin_2023_2L7M,0.73,0.489,0.81,0.782,0.703,0.696,0.736,1.036,0.474,0.62
+PKN1_HUMAN_Rocklin_2023_1URF,0.467,0.469,1.045,0.982,1.033,0.999,0.995,1.131,0.436,0.547
+POLG_PESV_Rocklin_2023_2MXD,0.759,0.505,1.141,1.121,0.887,1.088,1.087,1.101,0.475,0.726
+PR40A_HUMAN_Rocklin_2023_1UZC,0.669,0.33,0.833,0.996,0.656,0.802,0.745,1.045,0.319,0.696
+PSAE_SYNP2_Rocklin_2023_1PSE,0.536,0.803,0.972,0.872,1.029,1.005,0.975,1.061,0.707,0.702
+RAD_ANTMA_Rocklin_2023_2CJJ,0.661,0.607,1.103,0.967,0.807,0.936,0.977,1.06,0.601,1.002
+RBP1_HUMAN_Rocklin_2023_2KWH,0.326,0.483,0.949,0.859,1.057,0.915,0.936,1.088,0.407,0.556
+RCD1_ARATH_Rocklin_2023_5OAO,0.458,0.508,1.033,1.057,1.005,1.042,1.025,1.087,0.507,0.729
+RCRO_LAMBD_Rocklin_2023_1ORC,0.593,0.636,0.879,0.931,0.822,0.865,0.814,1.109,0.514,0.632
+RD23A_HUMAN_Rocklin_2023_1IFY,0.33,0.324,0.818,0.666,0.703,0.7,0.724,1.077,0.25,0.48
+RFAH_ECOLI_Rocklin_2023_2LCL,0.751,0.6,0.949,0.964,0.949,0.995,0.98,1.041,0.612,0.835
+RL20_AQUAE_Rocklin_2023_1GYZ,0.698,0.754,1.018,0.974,1.075,0.953,0.989,1.167,0.706,0.751
+RPC1_BP434_Rocklin_2023_1R69,0.329,0.328,0.759,0.506,0.65,0.563,0.766,1.041,0.323,0.496
+RS15_GEOSE_Rocklin_2023_1A32,0.415,0.494,1.078,1.056,1.011,0.989,1.065,1.197,0.623,0.641
+SAV1_MOUSE_Rocklin_2023_2YSB,0.747,0.675,0.89,0.733,0.739,0.761,0.798,1.09,0.662,0.956
+SBI_STAAM_Rocklin_2023_2JVG,0.815,0.463,0.919,1.015,0.754,0.943,0.891,1.047,0.456,0.83
+SCIN_STAAR_Rocklin_2023_2QFF,0.583,0.503,0.981,0.928,0.973,1.037,0.985,1.065,0.484,0.761
+SDA_BACSU_Rocklin_2023_1PV0,0.604,0.445,0.82,0.879,0.742,0.822,0.832,1.093,0.344,0.501
+SOX30_HUMAN_Rocklin_2023_7JJK,0.522,0.704,1.008,0.898,0.928,0.996,0.969,1.087,0.549,0.876
+SPA_STAAU_Rocklin_2023_1LP1,0.778,0.495,0.967,1.071,0.828,0.965,0.926,1.065,0.508,0.601
+SPG2_STRSG_Rocklin_2023_5UBS,0.845,0.546,1.017,1.058,1.046,1.064,0.992,1.051,0.579,0.814
+SPTN1_CHICK_Rocklin_2023_1TUD,0.542,0.565,0.854,0.745,0.836,0.817,0.818,1.072,0.565,0.659
+SQSTM_MOUSE_Rocklin_2023_2RRU,0.563,0.339,0.729,0.981,0.761,0.779,0.721,1.171,0.278,0.505
+SR43C_ARATH_Rocklin_2023_2N88,0.491,0.601,0.778,0.878,0.861,0.817,0.769,1.263,0.443,0.598
+SRBS1_HUMAN_Rocklin_2023_2O2W,0.693,0.443,0.765,0.817,0.633,0.727,0.685,1.092,0.387,0.713
+TCRG1_MOUSE_Rocklin_2023_1E0L,0.302,0.513,0.898,0.56,0.657,0.62,0.719,1.14,0.449,0.464
+THO1_YEAST_Rocklin_2023_2WQG,0.784,0.782,1.056,0.929,0.933,1.002,1.003,1.04,0.654,0.923
+TNKS2_HUMAN_Rocklin_2023_5JRT,0.409,0.46,0.735,0.649,0.824,0.7,0.692,1.066,0.418,0.577
+UBE4B_HUMAN_Rocklin_2023_3L1X,0.619,0.545,0.831,0.739,0.795,0.913,0.827,1.086,0.504,1.022
+UBR5_HUMAN_Rocklin_2023_1I2T,0.502,0.386,0.925,0.974,0.92,0.769,0.781,1.112,0.386,0.429
+VG08_BPP22_Rocklin_2023_2GP8,0.363,0.539,0.993,0.766,0.851,0.797,0.857,1.083,0.478,0.588
+VILI_CHICK_Rocklin_2023_1YU5,0.557,0.594,0.749,0.725,0.713,0.873,0.74,1.054,0.606,0.81
+VRPI_BPT7_Rocklin_2023_2WNM,0.448,0.607,1.073,0.905,0.996,1.102,1.107,1.077,0.581,0.768
+YAIA_ECOLI_Rocklin_2023_2KVT,0.671,0.605,0.992,0.962,0.955,0.958,0.922,1.035,0.565,0.866
+YNZC_BACSU_Rocklin_2023_2JVD,0.588,0.682,1.04,0.84,0.993,1.027,1.033,1.143,0.54,0.94
+POLG_DEN26_Suphatrakul_2023,1.002,0.713,0.922,1.04,0.787,1.045,1.021,1.012,0.528,0.711
+MET_HUMAN_Estevam_2023,0.593,0.548,0.64,0.625,0.584,0.681,0.626,1.006,0.586,0.643
+RDRP_I33A0_Li_2023,0.991,0.734,0.821,1.014,0.74,0.791,0.711,1.001,0.673,0.769
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,0.906,0.915,0.915,0.91,0.933,0.948,0.922,1.004,0.857,0.927
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,0.76,0.795,0.915,0.881,0.884,0.864,0.858,1.005,0.608,0.694
+ACE2_HUMAN_Chan_2020,0.963,0.946,0.992,1.011,0.983,2.453,1.906,1.011,0.886,3.812
+PPM1D_HUMAN_Miller_2022,0.66,0.749,0.765,0.666,0.762,0.766,0.716,1.01,0.639,0.695
+HMDH_HUMAN_Jiang_2019,0.672,0.929,0.872,0.89,0.942,0.956,0.913,1.009,0.562,0.647
+CBS_HUMAN_Sun_2020,0.859,0.867,0.865,0.893,0.874,0.926,0.89,1.002,0.79,0.862
+NPC1_HUMAN_Erwood_2022_HEK293T,0.872,0.463,0.648,0.896,0.504,0.647,0.569,1.029,0.482,0.64
+NPC1_HUMAN_Erwood_2022_RPE1,1.609,0.661,0.759,1.109,0.687,0.606,0.515,1.248,0.767,0.815
+BRCA2_HUMAN_Erwood_2022_HEK293T,1.238,1.354,1.174,1.399,1.484,1.192,1.208,1.375,1.404,1.179
+SHOC2_HUMAN_Kwon_2022,0.819,0.844,0.872,0.843,0.844,0.87,0.855,1.024,0.746,0.829
+RAF1_HUMAN_Zinkus-Boltz_2019,1.175,0.797,0.913,0.87,0.838,0.829,0.818,1.103,0.779,1.179
+OPSD_HUMAN_Wan_2019,0.702,0.522,0.736,0.736,0.659,0.796,0.776,1.078,0.521,0.762
+CAR11_HUMAN_Meitlis_2020_lof,0.686,0.91,0.97,0.794,0.962,0.982,0.97,1.02,0.799,0.833
+CAR11_HUMAN_Meitlis_2020_gof,0.977,1.065,1.104,1.035,1.076,1.1,1.102,1.081,1.137,1.075
+ERBB2_HUMAN_Elazar_2016,0.876,0.833,1.261,0.985,1.138,0.917,1.063,1.118,0.993,0.739
+GLPA_HUMAN_Elazar_2016,0.991,0.966,1.016,0.9,0.946,0.929,0.942,1.05,0.972,0.882
+LYAM1_HUMAN_Elazar_2016,0.833,0.793,1.247,0.981,0.963,1.017,1.038,1.116,0.93,0.881
+GDIA_HUMAN_Silverstein_2021,0.694,0.689,0.808,0.768,0.802,0.737,0.745,1.022,0.863,0.784
+PPARG_HUMAN_Majithia_2016,0.426,0.573,0.603,0.556,0.641,0.662,0.573,1.029,0.303,0.484
+CD19_HUMAN_Klesmith_2019_FMC_singles,0.988,0.932,0.989,1.026,0.982,0.997,0.986,1.02,0.819,0.894
diff --git a/benchmarks/DMS_supervised/substitutions/MSE/DMS_substitutions_MSE_DMS_level_fold_modulo_5.csv b/benchmarks/DMS_supervised/substitutions/MSE/DMS_substitutions_MSE_DMS_level_fold_modulo_5.csv
new file mode 100644
index 0000000..96dadc7
--- /dev/null
+++ b/benchmarks/DMS_supervised/substitutions/MSE/DMS_substitutions_MSE_DMS_level_fold_modulo_5.csv
@@ -0,0 +1,218 @@
+DMS_id,ESM-1v Embeddings,MSA Transformer Embeddings,DeepSequence + One-Hot Encodings,ESM-1v + One-Hot Encodings,MSA_Transformer + One-Hot Encodings,Tranception + One-Hot Encodings,TranceptEVE + One-Hot Encodings,One-Hot Encodings,ProteinNPT,Tranception Embeddings
+A0A140D2T1_ZIKV_Sourisseau_2019,0.995,0.94,1.071,1.083,0.948,1.003,0.992,1.03,0.935,0.979
+A0A192B1T2_9HIV1_Haddox_2018,0.77,0.717,0.815,0.771,0.742,0.741,0.73,1.007,0.624,0.669
+A0A1I9GEU1_NEIME_Kennouche_2019,1.119,1.105,1.012,1.018,1.013,1.024,1.02,1.01,1.245,1.389
+A0A2Z5U3Z0_9INFA_Doud_2016,0.666,0.796,0.742,0.743,0.724,0.724,0.693,1.001,0.891,0.629
+A0A2Z5U3Z0_9INFA_Wu_2014,0.984,1.278,0.984,0.99,0.978,0.974,0.971,1.045,1.06,0.966
+A4D664_9INFA_Soh_2019,1.045,0.979,0.964,1.062,0.981,0.96,0.93,1.048,0.937,0.886
+A4GRB6_PSEAI_Chen_2020,0.458,0.39,0.573,0.575,0.546,0.598,0.546,1.003,0.353,0.528
+AACC1_PSEAI_Dandage_2018,0.849,0.722,0.852,0.78,0.788,0.825,0.791,1.014,0.889,0.915
+ADRB2_HUMAN_Jones_2020,0.75,0.801,0.783,0.761,0.807,0.782,0.766,1.002,0.756,0.737
+AMIE_PSEAE_Wrenbeck_2017,0.578,0.567,0.789,0.665,0.719,0.839,0.801,1.003,0.58,0.63
+LGK_LIPST_Klesmith_2015,0.688,0.665,0.802,0.759,0.734,0.848,0.778,1.001,0.554,0.717
+BLAT_ECOLX_Deng_2012,0.764,0.688,0.744,0.761,0.747,0.82,0.742,1.009,0.839,0.864
+BLAT_ECOLX_Firnberg_2014,0.482,0.397,0.502,0.595,0.56,0.693,0.528,1.005,0.391,0.646
+BLAT_ECOLX_Jacquier_2013,0.444,0.406,0.534,0.574,0.56,0.59,0.509,1.016,0.412,0.536
+BLAT_ECOLX_Stiffler_2015,0.456,0.377,0.515,0.579,0.557,0.669,0.525,1.006,0.409,0.581
+BRCA1_HUMAN_Findlay_2018,0.699,0.778,0.877,0.739,0.782,0.723,0.725,1.014,0.747,0.747
+C6KNH7_9INFA_Lee_2018,0.726,0.811,0.87,0.825,0.838,0.815,0.802,1.002,0.628,0.653
+CALM1_HUMAN_Weile_2017,0.985,0.932,0.949,0.954,0.941,0.936,0.941,1.01,1.24,1.152
+CAPSD_AAV2S_Sinai_2021,0.733,0.553,0.701,1.059,0.706,0.799,0.758,1.059,0.598,0.604
+CCDB_ECOLI_Adkar_2012,0.817,0.581,0.738,0.961,0.829,0.791,0.722,1.042,0.638,0.803
+CCDB_ECOLI_Tripathi_2016,0.762,0.541,0.753,0.873,0.774,0.686,0.69,1.044,0.658,0.819
+CP2C9_HUMAN_Amorosi_2021_abundance,0.557,0.636,0.652,0.654,0.661,0.599,0.604,1.004,0.446,0.459
+CP2C9_HUMAN_Amorosi_2021_activity,0.548,0.574,0.615,0.602,0.657,0.629,0.614,1.003,0.448,0.5
+DLG4_HUMAN_Faure_2021,0.581,0.73,0.852,0.842,0.753,0.61,0.657,1.006,0.78,0.562
+DLG4_RAT_McLaughlin_2012,0.632,0.766,0.833,0.737,0.82,0.805,0.773,1.044,0.667,0.68
+DYR_ECOLI_Thompson_2019,0.769,0.659,0.768,0.804,0.765,0.845,0.774,1.009,0.744,0.981
+ENV_HV1B9_DuenasDecamp_2016,0.953,0.937,0.931,0.869,0.932,0.871,0.883,1.123,1.106,0.983
+ENV_HV1BR_Haddox_2016,0.963,0.95,0.958,0.968,0.959,0.949,0.948,1.001,0.954,0.944
+ESTA_BACSU_Nutschel_2020,0.9,0.736,0.844,0.941,0.849,0.95,0.878,1.018,0.815,0.998
+F7YBW8_MESOW_Aakre_2015,1.299,2.475,1.794,1.967,2.13,1.386,1.521,3.314,3.154,0.576
+GAL4_YEAST_Kitzman_2015,0.87,0.664,0.762,0.807,0.728,0.814,0.87,1.012,0.718,0.865
+GCN4_YEAST_Staller_2018,3.733,1.977,2.366,2.706,2.692,2.897,2.621,2.716,2.223,2.384
+GFP_AEQVI_Sarkisyan_2016,0.831,0.825,0.917,1.045,0.926,0.907,0.862,1.045,0.777,0.756
+GRB2_HUMAN_Faure_2021,0.52,0.663,0.717,0.683,0.662,0.792,0.683,1.034,0.682,0.855
+HIS7_YEAST_Pokusaeva_2019,1.486,2.01,1.29,1.249,1.242,1.193,1.28,1.335,1.245,1.602
+HSP82_YEAST_Flynn_2019,0.775,0.793,0.798,0.805,0.796,0.764,0.767,1.003,0.596,0.637
+HSP82_YEAST_Mishra_2016,0.563,0.767,0.705,0.698,0.786,0.648,0.647,1.023,0.637,0.56
+I6TAH8_I68A0_Doud_2015,1.037,0.876,0.951,1.013,0.891,0.886,0.852,1.002,0.876,0.864
+IF1_ECOLI_Kelsic_2016,0.731,0.6,0.667,0.711,0.856,0.632,0.672,1.041,0.797,0.645
+KCNH2_HUMAN_Kozek_2020,0.885,1.028,1.044,1.109,1.039,0.763,0.796,1.063,0.989,1.279
+KKA2_KLEPN_Melnikov_2014,0.632,0.561,0.687,0.704,0.705,0.73,0.682,1.003,0.634,0.688
+MK01_HUMAN_Brenan_2016,0.934,0.918,0.972,0.998,1.003,1.008,0.987,1.002,0.848,0.93
+MSH2_HUMAN_Jia_2020,0.792,0.767,0.805,0.797,0.766,0.826,0.78,1.005,0.603,0.715
+MTH3_HAEAE_RockahShmuel_2015,0.517,0.503,0.503,0.546,0.558,0.687,0.548,1.001,0.609,0.666
+NCAP_I34A1_Doud_2015,1.08,0.88,0.894,1.02,0.88,0.842,0.833,1.002,0.842,0.837
+NRAM_I33A0_Jiang_2016,1.09,0.432,0.759,1.022,0.592,0.7,0.618,1.014,0.423,0.487
+NUD15_HUMAN_Suiter_2020,0.605,0.472,0.659,0.625,0.543,0.655,0.611,1.005,0.454,0.624
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,0.598,0.67,0.89,0.705,0.863,0.762,0.774,1.003,0.541,0.624
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,0.653,0.616,0.954,0.805,0.94,0.867,0.883,1.001,0.552,0.612
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,0.437,0.494,0.908,0.721,0.92,0.819,0.821,1.001,0.321,0.43
+P53_HUMAN_Kotler_2018,0.673,0.641,0.722,0.711,0.698,0.726,0.703,1.003,0.74,0.732
+P84126_THETH_Chan_2017,0.608,0.444,0.586,0.68,0.567,0.719,0.655,1.007,0.521,0.753
+PA_I34A1_Wu_2015,1.042,1.502,0.855,1.032,0.977,0.841,0.822,1.017,1.002,0.826
+PABP_YEAST_Melamed_2013,0.474,0.477,0.75,0.583,0.582,0.567,0.574,1.02,0.493,0.475
+POLG_CXB3N_Mattenberger_2021,0.959,0.708,0.808,1.071,0.708,0.839,0.774,1.006,0.602,0.78
+POLG_HCVJF_Qi_2014,0.71,0.523,0.838,0.659,0.663,0.748,0.783,1.03,0.598,0.786
+PTEN_HUMAN_Matreyek_2021,0.729,0.716,0.854,0.846,0.823,0.91,0.832,1.002,0.61,0.699
+PTEN_HUMAN_Mighell_2018,0.641,0.662,0.68,0.743,0.782,0.813,0.669,1.007,0.533,0.699
+Q2N0S5_9HIV1_Haddox_2018,0.742,0.73,0.85,0.75,0.744,0.754,0.745,1.002,0.661,0.614
+Q59976_STRSQ_Romero_2015,0.675,0.63,0.64,0.732,0.644,0.647,0.611,1.006,0.603,0.633
+R1AB_SARS2_Flynn_2022,0.905,0.961,0.961,1.012,1.012,0.857,0.708,1.003,1.033,0.68
+RASH_HUMAN_Bandaru_2017,0.633,0.613,0.787,0.879,0.833,0.784,0.764,1.008,0.66,0.646
+REV_HV1H2_Fernandes_2016,0.854,0.811,0.947,0.945,0.933,0.942,0.946,1.01,0.978,0.955
+RL40A_YEAST_Mavor_2016,0.735,0.727,0.869,0.942,0.872,0.867,0.861,1.015,0.826,0.963
+RL40A_YEAST_Roscoe_2013,0.57,0.529,0.851,0.947,0.877,0.837,0.839,1.025,0.622,0.64
+RL40A_YEAST_Roscoe_2014,0.91,0.797,0.889,0.969,0.9,0.86,0.883,1.039,0.962,0.911
+SC6A4_HUMAN_Young_2021,0.729,0.711,0.835,0.752,0.711,0.783,0.751,1.002,0.598,0.664
+SCN5A_HUMAN_Glazer_2019,1.198,1.373,1.021,1.0,1.014,1.065,1.031,1.024,1.254,1.549
+SPG1_STRSG_Olson_2014,0.536,1.364,1.029,0.95,0.96,0.921,0.913,1.027,1.107,0.703
+SPIKE_SARS2_Starr_2020_binding,0.968,1.078,1.035,1.049,0.887,0.965,0.953,1.015,1.085,0.89
+SPIKE_SARS2_Starr_2020_expression,0.959,0.703,0.94,1.034,0.704,0.885,0.801,1.004,0.664,0.717
+SRC_HUMAN_Ahler_2019,0.724,0.902,0.877,0.784,0.922,0.823,0.83,1.007,0.922,0.812
+SUMO1_HUMAN_Weile_2017,0.803,0.717,0.838,0.804,0.771,0.846,0.8,1.017,0.834,0.954
+SYUA_HUMAN_Newberry_2020,0.316,0.34,0.97,0.839,0.963,0.887,0.905,1.018,0.324,0.339
+TADBP_HUMAN_Bolognesi_2019,0.32,0.528,1.023,1.023,1.033,1.012,1.018,1.012,0.324,0.395
+TAT_HV1BR_Fernandes_2016,0.838,0.763,0.955,0.899,0.91,0.965,0.95,1.02,0.816,0.959
+TPK1_HUMAN_Weile_2017,0.922,0.897,0.964,0.948,0.94,0.94,0.941,1.002,1.077,1.107
+TPMT_HUMAN_Matreyek_2018,0.642,0.673,0.765,0.739,0.787,0.802,0.767,1.003,0.633,0.709
+TPOR_HUMAN_Bridgford_2020,0.854,0.892,1.017,0.896,0.895,0.927,0.924,1.058,0.986,0.844
+TRPC_SACS2_Chan_2017,0.597,0.508,0.661,0.65,0.576,0.693,0.646,1.019,0.583,0.759
+TRPC_THEMA_Chan_2017,0.821,0.723,0.83,0.78,0.787,0.836,0.814,1.028,0.727,0.895
+UBC9_HUMAN_Weile_2017,0.838,0.72,0.792,0.848,0.8,0.839,0.784,1.007,0.82,0.897
+UBE4B_MOUSE_Starita_2013,0.779,0.907,0.822,0.77,0.92,0.877,0.801,1.012,0.903,0.854
+VKOR1_HUMAN_Chiasson_2020_abundance,0.637,0.536,0.843,0.822,0.751,0.682,0.707,1.004,0.642,0.633
+VKOR1_HUMAN_Chiasson_2020_activity,0.984,0.782,0.833,0.808,0.823,0.825,0.796,1.018,1.037,1.065
+YAP1_HUMAN_Araya_2012,0.801,0.834,0.741,0.975,1.075,0.895,0.794,1.041,0.934,0.837
+Q8WTC7_9CNID_Somermeyer_2022,1.03,0.778,1.012,1.039,0.934,0.972,0.955,1.027,0.878,1.102
+D7PM05_CLYGR_Somermeyer_2022,0.974,0.531,0.812,1.026,0.709,0.889,0.805,1.012,0.544,0.893
+Q6WV13_9MAXI_Somermeyer_2022,1.139,0.735,0.99,1.039,0.905,0.996,0.966,1.034,0.634,1.029
+SPG1_STRSG_Wu_2016,5.213,6.822,5.62,5.095,6.821,5.72,6.056,5.653,5.922,5.075
+CAS9_STRP1_Spencer_2017_positive,1.002,0.972,0.998,1.016,0.983,0.992,0.988,1.002,0.972,0.982
+A0A247D711_LISMN_Stadelmann_2021,1.125,0.732,1.038,1.053,0.954,0.994,1.065,1.041,0.813,1.023
+HXK4_HUMAN_Gersing_2022_activity,0.809,0.793,0.848,0.866,0.818,0.931,0.871,1.005,0.714,0.803
+ANCSZ_Hobbs_2022,0.613,0.705,1.075,0.703,0.763,0.674,0.68,1.009,0.655,0.609
+PAI1_HUMAN_Huttinger_2021,0.817,0.751,0.836,0.824,0.805,0.834,0.815,1.001,0.672,0.789
+HXK4_HUMAN_Gersing_2023_abundance,0.769,0.8,0.867,0.89,0.828,0.973,0.917,1.002,0.61,0.724
+PHOT_CHLRE_Chen_2023,0.493,0.502,0.652,0.616,0.648,0.717,0.73,1.014,0.484,0.698
+S22A1_HUMAN_Yee_2023_abundance,0.627,0.641,0.706,0.661,0.649,0.608,0.621,1.002,0.455,0.511
+S22A1_HUMAN_Yee_2023_activity,0.626,0.68,0.689,0.697,0.683,0.626,0.633,1.002,0.472,0.534
+KCNE1_HUMAN_Muhammad_2023_expression,0.633,0.848,1.014,0.998,1.017,0.993,0.994,1.01,0.683,0.642
+KCNE1_HUMAN_Muhammad_2023_function,0.498,0.464,0.771,0.682,0.744,0.574,0.551,1.005,0.475,0.49
+RNC_ECOLI_Weeks_2023,0.464,0.423,0.705,0.651,0.661,0.798,0.671,1.016,0.362,0.54
+HEM3_HUMAN_Loggerenberg_2023,0.815,0.79,0.838,0.855,0.807,0.849,0.82,1.001,0.764,0.797
+OXDA_RHOTO_Vanella_2023_expression,0.796,0.775,0.924,0.912,0.912,0.912,0.908,1.0,0.749,0.825
+OXDA_RHOTO_Vanella_2023_activity,0.867,0.837,0.887,0.907,0.905,0.947,0.915,1.003,0.834,0.919
+PRKN_HUMAN_Clausen_2023,0.52,0.761,0.693,0.632,0.762,0.603,0.568,1.005,0.536,0.532
+DYR_ECOLI_Nguyen_2023,0.597,0.543,0.725,0.632,0.743,0.843,0.766,1.023,0.579,0.833
+ENVZ_ECOLI_Ghose_2023,1.167,0.962,0.969,0.914,0.922,0.873,0.932,1.03,1.174,1.203
+MLAC_ECOLI_MacRae_2023,0.66,0.674,0.837,0.848,0.83,0.882,0.828,1.008,0.635,0.723
+Q837P5_ENTFA_Meier_2023,1.039,0.849,0.819,0.944,0.913,0.797,0.821,1.046,0.763,0.719
+Q837P4_ENTFA_Meier_2023,0.545,0.513,0.659,0.662,0.66,0.701,0.62,1.05,0.518,0.593
+CCR5_HUMAN_Gill_2023,0.84,0.845,0.922,0.882,0.87,0.848,0.888,1.005,0.807,0.803
+Q53Z42_HUMAN_McShan_2019_expression,0.723,0.638,0.739,0.794,0.697,0.743,0.707,1.007,0.726,0.759
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,0.654,0.7,0.885,0.906,0.894,0.911,0.882,1.013,0.666,0.656
+RPC1_LAMBD_Li_2019_low-expression,0.866,0.636,0.755,0.799,0.806,0.78,0.738,1.031,0.832,0.945
+RPC1_LAMBD_Li_2019_high-expression,0.729,0.63,0.715,0.7,0.718,0.723,0.676,1.046,0.765,0.788
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,3.089,1.572,1.383,2.045,1.847,1.619,1.597,1.812,1.714,1.859
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,0.954,0.915,0.978,0.975,0.942,0.975,0.969,1.083,0.911,0.912
+A4_HUMAN_Seuma_2022,0.717,0.815,0.821,0.932,0.898,0.764,0.788,1.023,0.732,0.72
+SRC_HUMAN_Nguyen_2022,0.8,0.829,0.906,0.852,0.864,0.891,0.894,1.005,0.913,0.911
+F7YBW7_MESOW_Ding_2023,1.933,1.663,2.031,1.937,1.518,1.851,1.946,2.402,2.191,2.022
+RASK_HUMAN_Weng_2022_abundance,0.691,0.613,0.98,1.009,0.979,0.9,0.952,1.007,0.619,0.62
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,0.578,0.527,0.761,0.82,0.67,0.79,0.759,1.033,0.491,0.535
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,0.849,0.867,0.942,0.91,0.915,0.931,0.93,1.014,1.031,0.933
+AICDA_HUMAN_Gajula_2014_3cycles,3.116,3.035,2.575,2.596,2.518,2.549,2.543,2.617,2.889,3.032
+OTC_HUMAN_Lo_2023,0.544,0.444,0.758,0.699,0.697,0.758,0.682,1.005,0.417,0.724
+CASP3_HUMAN_Roychowdhury_2020,0.612,0.545,0.634,0.653,0.588,0.886,0.641,1.005,0.684,1.331
+CASP7_HUMAN_Roychowdhury_2020,0.675,0.617,0.665,0.675,0.649,0.762,0.685,1.004,0.7,0.784
+SERC_HUMAN_Xie_2023,0.549,0.56,0.657,0.629,0.645,0.602,0.647,1.012,0.52,0.544
+MTHR_HUMAN_Weile_2021,0.658,0.895,0.908,0.879,0.898,0.912,0.896,1.001,0.512,0.608
+AMFR_HUMAN_Rocklin_2023_4G3O,0.727,0.453,0.785,0.999,0.796,0.872,0.793,1.034,0.419,0.653
+ARGR_ECOLI_Rocklin_2023_1AOY,0.557,0.385,0.778,0.9,0.781,0.735,0.702,1.033,0.394,0.48
+BBC1_YEAST_Rocklin_2023_1TG0,0.64,0.561,0.838,0.78,0.873,0.814,0.805,1.038,0.545,0.707
+BCHB_CHLTE_Rocklin_2023_2KRU,0.712,0.563,0.964,0.985,0.844,0.929,0.954,1.084,0.51,0.837
+CATR_CHLRE_Rocklin_2023_2AMI,0.362,0.426,0.819,0.889,0.969,0.779,0.791,1.02,0.333,0.329
+CBPA2_HUMAN_Rocklin_2023_1O6X,0.398,0.365,0.803,0.724,0.713,0.694,0.736,1.029,0.311,0.515
+CBX4_HUMAN_Rocklin_2023_2K28,0.631,0.409,0.78,0.803,0.639,0.734,0.667,1.039,0.367,0.692
+CSN4_MOUSE_Rocklin_2023_1UFM,0.438,0.374,0.851,0.666,0.828,0.705,0.747,1.025,0.304,0.505
+CUE1_YEAST_Rocklin_2023_2MYX,0.59,0.437,0.79,0.884,0.8,0.887,0.794,1.053,0.436,0.708
+DN7A_SACS2_Rocklin_2023_1JIC,0.76,0.466,1.036,1.064,0.992,1.038,1.039,1.095,0.433,0.68
+DNJA1_HUMAN_Rocklin_2023_2LO1,0.323,0.375,0.801,0.843,0.911,0.616,0.733,1.035,0.289,0.344
+DOCK1_MOUSE_Rocklin_2023_2M0Y,0.554,0.52,0.949,0.684,0.699,0.752,0.806,1.029,0.501,0.809
+EPHB2_HUMAN_Rocklin_2023_1F0M,0.315,0.262,0.709,0.529,0.615,0.662,0.656,1.04,0.222,0.509
+FECA_ECOLI_Rocklin_2023_2D1U,0.877,0.581,0.864,0.985,0.859,0.964,0.924,1.027,0.553,0.891
+FKBP3_HUMAN_Rocklin_2023_2KFV,0.885,0.539,0.835,1.022,0.939,0.939,0.878,1.057,0.49,0.642
+HCP_LAMBD_Rocklin_2023_2L6Q,0.363,0.37,1.015,0.838,0.881,0.701,0.724,1.043,0.349,0.431
+HECD1_HUMAN_Rocklin_2023_3DKM,0.664,0.333,0.675,0.925,0.607,0.744,0.675,1.024,0.344,0.565
+ILF3_HUMAN_Rocklin_2023_2L33,0.396,0.325,0.861,0.849,0.798,0.814,0.827,1.095,0.311,0.446
+ISDH_STAAW_Rocklin_2023_2LHR,0.654,0.786,1.062,0.96,1.012,1.047,1.058,1.035,0.664,0.976
+MAFG_MOUSE_Rocklin_2023_1K1V,0.667,0.668,0.978,0.927,0.969,0.791,0.838,1.058,0.594,0.589
+MBD11_ARATH_Rocklin_2023_6ACV,0.654,0.453,0.799,1.01,0.802,0.854,0.78,1.026,0.364,0.664
+MYO3_YEAST_Rocklin_2023_2BTT,0.675,0.619,0.916,0.816,0.859,0.918,0.889,1.045,0.629,0.973
+NKX31_HUMAN_Rocklin_2023_2L9R,0.438,0.337,0.67,0.756,0.637,0.613,0.603,1.029,0.292,0.424
+NUSA_ECOLI_Rocklin_2023_1WCL,0.346,0.279,0.89,0.855,0.839,0.759,0.748,1.028,0.245,0.385
+NUSG_MYCTU_Rocklin_2023_2MI6,0.415,0.273,0.787,0.958,0.767,0.783,0.792,1.076,0.219,0.398
+OBSCN_HUMAN_Rocklin_2023_1V1C,0.322,0.385,0.637,0.549,0.638,0.647,0.615,1.08,0.335,0.592
+ODP2_GEOSE_Rocklin_2023_1W4G,1.278,0.637,0.921,1.048,0.803,0.885,0.885,1.06,0.596,1.012
+OTU7A_HUMAN_Rocklin_2023_2L2D,0.488,1.064,1.128,0.655,1.109,1.026,1.094,1.125,0.796,0.965
+PIN1_HUMAN_Rocklin_2023_1I6C,0.548,0.374,0.615,0.691,0.535,0.508,0.516,1.03,0.382,0.633
+PITX2_HUMAN_Rocklin_2023_2L7M,0.685,0.503,0.765,0.789,0.684,0.684,0.718,1.027,0.482,0.686
+PKN1_HUMAN_Rocklin_2023_1URF,0.46,0.422,0.956,0.897,0.939,0.906,0.91,1.03,0.372,0.47
+POLG_PESV_Rocklin_2023_2MXD,0.644,0.458,1.013,1.045,0.822,1.016,0.997,1.039,0.426,0.633
+PR40A_HUMAN_Rocklin_2023_1UZC,0.693,0.33,0.827,0.985,0.654,0.786,0.734,1.038,0.349,0.652
+PSAE_SYNP2_Rocklin_2023_1PSE,0.525,0.69,0.957,0.865,1.01,0.99,0.963,1.063,0.631,0.748
+RAD_ANTMA_Rocklin_2023_2CJJ,0.644,0.611,1.098,0.944,0.803,0.926,0.974,1.05,0.608,1.012
+RBP1_HUMAN_Rocklin_2023_2KWH,0.297,0.473,0.923,0.823,1.01,0.877,0.906,1.039,0.395,0.477
+RCD1_ARATH_Rocklin_2023_5OAO,0.404,0.5,0.948,0.993,0.929,0.98,0.946,1.022,0.471,0.694
+RCRO_LAMBD_Rocklin_2023_1ORC,0.591,0.544,0.858,0.945,0.812,0.852,0.803,1.116,0.525,0.645
+RD23A_HUMAN_Rocklin_2023_1IFY,0.311,0.326,0.826,0.646,0.684,0.701,0.735,1.066,0.252,0.483
+RFAH_ECOLI_Rocklin_2023_2LCL,0.769,0.575,0.929,0.947,0.931,0.984,0.959,1.027,0.62,0.793
+RL20_AQUAE_Rocklin_2023_1GYZ,0.588,0.614,0.97,0.915,0.996,0.913,0.94,1.1,0.625,0.63
+RPC1_BP434_Rocklin_2023_1R69,0.324,0.325,0.703,0.498,0.637,0.556,0.741,1.039,0.329,0.476
+RS15_GEOSE_Rocklin_2023_1A32,0.325,0.399,0.908,0.882,0.845,0.842,0.901,1.042,0.423,0.518
+SAV1_MOUSE_Rocklin_2023_2YSB,0.761,0.719,0.874,0.714,0.761,0.739,0.799,1.054,0.572,0.943
+SBI_STAAM_Rocklin_2023_2JVG,0.735,0.473,0.897,0.979,0.727,0.913,0.868,1.012,0.453,0.721
+SCIN_STAAR_Rocklin_2023_2QFF,0.507,0.425,0.949,0.91,0.938,1.003,0.95,1.03,0.399,0.703
+SDA_BACSU_Rocklin_2023_1PV0,0.531,0.355,0.784,0.867,0.716,0.805,0.823,1.071,0.334,0.497
+SOX30_HUMAN_Rocklin_2023_7JJK,0.454,0.629,0.992,0.87,0.913,0.986,0.952,1.05,0.506,0.788
+SPA_STAAU_Rocklin_2023_1LP1,0.79,0.438,0.943,1.034,0.802,0.939,0.905,1.029,0.451,0.595
+SPG2_STRSG_Rocklin_2023_5UBS,0.892,0.531,0.992,1.035,1.01,1.024,0.958,1.032,0.542,0.738
+SPTN1_CHICK_Rocklin_2023_1TUD,0.526,0.574,0.819,0.716,0.806,0.777,0.779,1.02,0.51,0.652
+SQSTM_MOUSE_Rocklin_2023_2RRU,0.516,0.349,0.682,0.92,0.718,0.726,0.672,1.092,0.272,0.469
+SR43C_ARATH_Rocklin_2023_2N88,0.461,0.51,0.703,0.746,0.733,0.714,0.677,1.031,0.409,0.562
+SRBS1_HUMAN_Rocklin_2023_2O2W,0.68,0.451,0.727,0.758,0.61,0.699,0.663,1.04,0.408,0.702
+TCRG1_MOUSE_Rocklin_2023_1E0L,0.299,0.476,0.847,0.519,0.607,0.586,0.668,1.083,0.392,0.398
+THO1_YEAST_Rocklin_2023_2WQG,0.746,0.574,0.974,0.923,0.903,0.979,0.964,1.024,0.532,0.789
+TNKS2_HUMAN_Rocklin_2023_5JRT,0.397,0.408,0.715,0.628,0.772,0.666,0.673,1.028,0.367,0.504
+UBE4B_HUMAN_Rocklin_2023_3L1X,0.563,0.455,0.778,0.715,0.73,0.858,0.782,1.029,0.476,0.874
+UBR5_HUMAN_Rocklin_2023_1I2T,0.423,0.321,0.881,0.895,0.865,0.727,0.745,1.045,0.343,0.417
+VG08_BPP22_Rocklin_2023_2GP8,0.332,0.497,0.945,0.716,0.823,0.756,0.816,1.028,0.556,0.564
+VILI_CHICK_Rocklin_2023_1YU5,0.526,0.583,0.752,0.722,0.702,0.871,0.741,1.041,0.648,0.809
+VRPI_BPT7_Rocklin_2023_2WNM,0.415,0.589,0.992,0.855,0.94,1.03,1.029,1.019,0.551,0.702
+YAIA_ECOLI_Rocklin_2023_2KVT,0.637,0.562,0.988,0.95,0.954,0.942,0.911,1.025,0.63,0.856
+YNZC_BACSU_Rocklin_2023_2JVD,0.508,0.713,0.961,0.787,0.931,0.95,0.964,1.076,0.526,0.958
+POLG_DEN26_Suphatrakul_2023,0.954,0.717,0.918,1.027,0.78,0.975,0.951,1.003,0.508,0.695
+MET_HUMAN_Estevam_2023,0.575,0.549,0.641,0.626,0.585,0.666,0.626,1.006,0.556,0.618
+RDRP_I33A0_Li_2023,0.983,0.735,0.822,1.015,0.736,0.791,0.712,1.002,0.634,0.755
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,0.906,0.912,0.915,0.909,0.931,0.946,0.921,1.001,0.849,0.911
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,0.743,0.743,0.911,0.878,0.88,0.864,0.856,1.003,0.608,0.674
+ACE2_HUMAN_Chan_2020,0.98,0.938,0.985,0.999,0.971,1.132,1.112,1.014,0.989,1.111
+PPM1D_HUMAN_Miller_2022,0.658,0.75,0.759,0.665,0.757,0.756,0.71,1.002,0.617,0.674
+HMDH_HUMAN_Jiang_2019,0.655,0.917,0.869,0.886,0.937,0.952,0.908,1.001,0.556,0.639
+CBS_HUMAN_Sun_2020,0.863,0.866,0.865,0.892,0.873,0.925,0.889,1.001,0.792,0.859
+NPC1_HUMAN_Erwood_2022_HEK293T,0.819,0.46,0.628,0.868,0.484,0.64,0.559,1.009,0.496,0.616
+NPC1_HUMAN_Erwood_2022_RPE1,1.536,0.459,0.687,1.015,0.577,0.543,0.479,1.225,0.566,0.635
+BRCA2_HUMAN_Erwood_2022_HEK293T,1.079,1.195,1.061,1.289,1.306,1.088,1.097,1.236,1.292,1.001
+SHOC2_HUMAN_Kwon_2022,0.806,0.838,0.864,0.837,0.839,0.864,0.85,1.0,0.715,0.773
+RAF1_HUMAN_Zinkus-Boltz_2019,0.882,0.734,0.914,0.872,0.82,0.819,0.82,1.09,0.787,1.251
+OPSD_HUMAN_Wan_2019,0.64,0.499,0.708,0.734,0.65,0.779,0.74,1.059,0.511,0.684
+CAR11_HUMAN_Meitlis_2020_lof,0.67,0.893,0.968,0.786,0.953,0.969,0.961,1.012,0.756,0.767
+CAR11_HUMAN_Meitlis_2020_gof,0.902,0.999,1.043,0.968,1.012,1.033,1.036,1.016,0.902,0.994
+ERBB2_HUMAN_Elazar_2016,0.716,0.668,1.167,0.958,1.048,0.893,0.994,1.119,0.849,0.6
+GLPA_HUMAN_Elazar_2016,0.91,0.904,1.02,0.903,0.94,0.931,0.946,1.053,0.93,0.718
+LYAM1_HUMAN_Elazar_2016,0.778,0.742,1.072,0.917,0.901,0.943,0.952,1.042,0.896,0.85
+GDIA_HUMAN_Silverstein_2021,0.674,0.68,0.802,0.762,0.795,0.736,0.74,1.015,0.741,0.809
+PPARG_HUMAN_Majithia_2016,0.382,0.566,0.594,0.546,0.63,0.64,0.557,1.002,0.29,0.428
+CD19_HUMAN_Klesmith_2019_FMC_singles,0.946,0.923,0.984,1.015,0.972,0.988,0.978,1.01,0.834,0.816
diff --git a/benchmarks/DMS_supervised/substitutions/MSE/DMS_substitutions_MSE_DMS_level_fold_random_5.csv b/benchmarks/DMS_supervised/substitutions/MSE/DMS_substitutions_MSE_DMS_level_fold_random_5.csv
new file mode 100644
index 0000000..dc4aed8
--- /dev/null
+++ b/benchmarks/DMS_supervised/substitutions/MSE/DMS_substitutions_MSE_DMS_level_fold_random_5.csv
@@ -0,0 +1,218 @@
+DMS_id,ESM-1v Embeddings,MSA Transformer Embeddings,DeepSequence + One-Hot Encodings,ESM-1v + One-Hot Encodings,MSA_Transformer + One-Hot Encodings,Tranception + One-Hot Encodings,TranceptEVE + One-Hot Encodings,One-Hot Encodings,ProteinNPT,Tranception Embeddings
+A0A140D2T1_ZIKV_Sourisseau_2019,0.895,0.912,1.018,1.041,0.916,0.965,0.951,0.986,0.896,0.833
+A0A192B1T2_9HIV1_Haddox_2018,0.762,0.72,0.784,0.737,0.712,0.711,0.701,0.951,0.431,0.64
+A0A1I9GEU1_NEIME_Kennouche_2019,1.188,1.05,0.994,1.003,0.997,1.002,1.0,0.992,1.416,1.328
+A0A2Z5U3Z0_9INFA_Doud_2016,0.645,0.658,0.711,0.709,0.691,0.694,0.663,0.946,0.507,0.537
+A0A2Z5U3Z0_9INFA_Wu_2014,1.003,1.21,0.994,1.0,0.988,0.986,0.982,1.05,1.069,0.983
+A4D664_9INFA_Soh_2019,0.954,0.942,0.904,0.995,0.92,0.9,0.874,0.974,0.799,0.758
+A4GRB6_PSEAI_Chen_2020,0.346,0.356,0.504,0.502,0.478,0.528,0.479,0.87,0.156,0.327
+AACC1_PSEAI_Dandage_2018,0.676,0.624,0.792,0.726,0.73,0.771,0.742,0.889,0.745,0.704
+ADRB2_HUMAN_Jones_2020,0.738,0.795,0.771,0.75,0.795,0.77,0.754,0.968,0.709,0.706
+AMIE_PSEAE_Wrenbeck_2017,0.535,0.536,0.74,0.623,0.67,0.785,0.751,0.929,0.353,0.474
+LGK_LIPST_Klesmith_2015,0.673,0.666,0.776,0.731,0.708,0.809,0.748,0.958,0.409,0.64
+BLAT_ECOLX_Deng_2012,0.573,0.62,0.659,0.679,0.663,0.738,0.661,0.892,0.316,0.503
+BLAT_ECOLX_Firnberg_2014,0.374,0.355,0.452,0.53,0.496,0.623,0.476,0.885,0.199,0.387
+BLAT_ECOLX_Jacquier_2013,0.395,0.376,0.503,0.538,0.526,0.55,0.475,0.941,0.355,0.456
+BLAT_ECOLX_Stiffler_2015,0.335,0.334,0.462,0.514,0.491,0.597,0.47,0.887,0.16,0.303
+BRCA1_HUMAN_Findlay_2018,0.673,0.77,0.832,0.7,0.738,0.688,0.692,0.949,0.609,0.558
+C6KNH7_9INFA_Lee_2018,0.698,0.777,0.833,0.783,0.796,0.78,0.766,0.945,0.443,0.553
+CALM1_HUMAN_Weile_2017,0.924,0.904,0.93,0.93,0.92,0.919,0.924,0.991,1.22,1.066
+CAPSD_AAV2S_Sinai_2021,0.427,0.367,0.526,0.693,0.496,0.566,0.556,0.667,0.268,0.315
+CCDB_ECOLI_Adkar_2012,0.157,0.034,0.462,0.623,0.539,0.531,0.47,0.601,0.002,0.146
+CCDB_ECOLI_Tripathi_2016,0.39,0.37,0.644,0.722,0.658,0.594,0.603,0.862,0.4,0.417
+CP2C9_HUMAN_Amorosi_2021_abundance,0.516,0.642,0.614,0.615,0.626,0.562,0.567,0.947,0.306,0.386
+CP2C9_HUMAN_Amorosi_2021_activity,0.538,0.571,0.573,0.561,0.614,0.588,0.573,0.934,0.247,0.403
+DLG4_HUMAN_Faure_2021,0.449,0.67,0.684,0.673,0.609,0.508,0.534,0.805,0.381,0.355
+DLG4_RAT_McLaughlin_2012,0.513,0.743,0.667,0.599,0.658,0.639,0.624,0.801,0.344,0.329
+DYR_ECOLI_Thompson_2019,0.548,0.472,0.673,0.707,0.676,0.75,0.684,0.882,0.405,0.506
+ENV_HV1B9_DuenasDecamp_2016,0.649,0.688,0.716,0.753,0.751,0.691,0.69,0.83,0.644,0.568
+ENV_HV1BR_Haddox_2016,0.968,0.953,0.963,0.971,0.961,0.954,0.952,1.009,0.959,0.95
+ESTA_BACSU_Nutschel_2020,0.659,0.556,0.76,0.828,0.762,0.848,0.79,0.865,0.569,0.636
+F7YBW8_MESOW_Aakre_2015,0.315,0.488,0.721,0.454,0.591,0.707,0.716,0.912,0.45,0.468
+GAL4_YEAST_Kitzman_2015,0.567,0.6,0.667,0.66,0.613,0.669,0.701,0.763,0.642,0.597
+GCN4_YEAST_Staller_2018,3.17,2.366,3.346,2.995,3.571,3.765,3.627,3.513,3.214,2.582
+GFP_AEQVI_Sarkisyan_2016,0.721,0.768,0.889,1.017,0.893,0.875,0.831,1.0,0.761,0.637
+GRB2_HUMAN_Faure_2021,0.233,0.236,0.475,0.45,0.453,0.542,0.457,0.651,0.208,0.232
+HIS7_YEAST_Pokusaeva_2019,1.357,1.605,1.253,1.197,1.22,1.176,1.241,1.291,1.648,1.662
+HSP82_YEAST_Flynn_2019,0.779,0.788,0.772,0.78,0.771,0.743,0.744,0.963,0.498,0.554
+HSP82_YEAST_Mishra_2016,0.464,0.728,0.619,0.612,0.69,0.576,0.572,0.882,0.252,0.281
+I6TAH8_I68A0_Doud_2015,0.936,0.872,0.908,0.978,0.849,0.848,0.814,0.946,0.467,0.577
+IF1_ECOLI_Kelsic_2016,0.296,0.278,0.544,0.555,0.675,0.529,0.553,0.779,0.234,0.25
+KCNH2_HUMAN_Kozek_2020,0.491,0.486,0.735,0.748,0.651,0.585,0.595,0.852,0.458,0.487
+KKA2_KLEPN_Melnikov_2014,0.556,0.535,0.649,0.661,0.662,0.685,0.646,0.906,0.451,0.541
+MK01_HUMAN_Brenan_2016,0.909,0.895,0.956,0.984,0.988,0.99,0.972,0.974,0.745,0.857
+MSH2_HUMAN_Jia_2020,0.792,0.764,0.787,0.779,0.748,0.801,0.76,0.975,0.529,0.604
+MTH3_HAEAE_RockahShmuel_2015,0.507,0.493,0.485,0.523,0.533,0.658,0.529,0.939,0.498,0.617
+NCAP_I34A1_Doud_2015,0.95,0.877,0.869,0.99,0.852,0.818,0.809,0.967,0.631,0.663
+NRAM_I33A0_Jiang_2016,0.733,0.377,0.534,0.732,0.453,0.483,0.434,0.716,0.341,0.403
+NUD15_HUMAN_Suiter_2020,0.358,0.321,0.565,0.532,0.467,0.564,0.524,0.833,0.26,0.336
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,0.531,0.679,0.833,0.666,0.817,0.717,0.728,0.925,0.373,0.436
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,0.585,0.589,0.9,0.765,0.894,0.823,0.837,0.933,0.455,0.502
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,0.379,0.483,0.832,0.666,0.859,0.758,0.76,0.902,0.219,0.31
+P53_HUMAN_Kotler_2018,0.553,0.603,0.656,0.64,0.634,0.641,0.626,0.886,0.472,0.508
+P84126_THETH_Chan_2017,0.321,0.305,0.462,0.523,0.433,0.55,0.498,0.786,0.174,0.274
+PA_I34A1_Wu_2015,1.049,1.473,0.865,1.038,0.985,0.851,0.831,1.021,0.981,0.884
+PABP_YEAST_Melamed_2013,0.293,0.37,0.563,0.442,0.429,0.439,0.444,0.759,0.257,0.275
+POLG_CXB3N_Mattenberger_2021,0.846,0.706,0.793,1.041,0.695,0.825,0.758,0.974,0.524,0.589
+POLG_HCVJF_Qi_2014,0.39,0.373,0.695,0.569,0.563,0.644,0.665,0.847,0.315,0.344
+PTEN_HUMAN_Matreyek_2021,0.687,0.705,0.809,0.803,0.784,0.85,0.785,0.934,0.49,0.573
+PTEN_HUMAN_Mighell_2018,0.602,0.657,0.647,0.7,0.741,0.756,0.632,0.939,0.371,0.468
+Q2N0S5_9HIV1_Haddox_2018,0.742,0.72,0.818,0.719,0.711,0.724,0.716,0.951,0.598,0.564
+Q59976_STRSQ_Romero_2015,0.66,0.624,0.624,0.708,0.624,0.631,0.595,0.967,0.491,0.612
+R1AB_SARS2_Flynn_2022,0.721,0.878,0.873,0.927,0.925,0.774,0.643,0.887,0.223,0.332
+RASH_HUMAN_Bandaru_2017,0.398,0.471,0.674,0.757,0.712,0.679,0.651,0.859,0.309,0.324
+REV_HV1H2_Fernandes_2016,0.635,0.615,0.867,0.857,0.851,0.862,0.865,0.883,0.705,0.657
+RL40A_YEAST_Mavor_2016,0.497,0.474,0.72,0.775,0.711,0.715,0.712,0.784,0.48,0.537
+RL40A_YEAST_Roscoe_2013,0.365,0.338,0.67,0.743,0.684,0.653,0.655,0.762,0.31,0.349
+RL40A_YEAST_Roscoe_2014,0.376,0.348,0.695,0.755,0.703,0.672,0.695,0.758,0.311,0.322
+SC6A4_HUMAN_Young_2021,0.731,0.71,0.814,0.733,0.693,0.763,0.731,0.971,0.57,0.608
+SCN5A_HUMAN_Glazer_2019,1.211,1.282,1.035,1.011,1.049,1.07,1.046,1.021,1.508,1.596
+SPG1_STRSG_Olson_2014,0.212,0.867,0.694,0.648,0.636,0.631,0.621,0.632,0.567,0.215
+SPIKE_SARS2_Starr_2020_binding,0.791,0.532,0.902,0.92,0.767,0.841,0.826,0.862,0.335,0.277
+SPIKE_SARS2_Starr_2020_expression,0.793,0.699,0.818,0.907,0.596,0.776,0.697,0.848,0.178,0.272
+SRC_HUMAN_Ahler_2019,0.615,0.882,0.795,0.714,0.847,0.754,0.759,0.89,0.403,0.43
+SUMO1_HUMAN_Weile_2017,0.596,0.596,0.733,0.714,0.696,0.757,0.715,0.854,0.74,0.695
+SYUA_HUMAN_Newberry_2020,0.194,0.211,0.877,0.756,0.879,0.821,0.828,0.917,0.177,0.21
+TADBP_HUMAN_Bolognesi_2019,0.265,0.616,0.963,0.976,0.965,0.948,0.952,0.939,0.303,0.266
+TAT_HV1BR_Fernandes_2016,0.602,0.594,0.824,0.779,0.799,0.833,0.821,0.838,0.635,0.64
+TPK1_HUMAN_Weile_2017,0.897,0.856,0.945,0.925,0.922,0.926,0.925,0.982,1.024,0.998
+TPMT_HUMAN_Matreyek_2018,0.563,0.636,0.728,0.702,0.747,0.758,0.729,0.926,0.549,0.558
+TPOR_HUMAN_Bridgford_2020,0.502,0.539,0.724,0.673,0.684,0.693,0.692,0.818,0.5,0.433
+TRPC_SACS2_Chan_2017,0.393,0.361,0.525,0.529,0.461,0.556,0.519,0.807,0.287,0.383
+TRPC_THEMA_Chan_2017,0.501,0.473,0.661,0.629,0.629,0.682,0.655,0.828,0.339,0.454
+UBC9_HUMAN_Weile_2017,0.586,0.562,0.716,0.765,0.724,0.763,0.711,0.884,0.59,0.606
+UBE4B_MOUSE_Starita_2013,0.69,0.907,0.765,0.725,0.839,0.815,0.749,0.919,0.9,0.788
+VKOR1_HUMAN_Chiasson_2020_abundance,0.383,0.388,0.744,0.727,0.663,0.6,0.614,0.9,0.303,0.304
+VKOR1_HUMAN_Chiasson_2020_activity,0.794,0.765,0.8,0.778,0.794,0.787,0.766,0.931,1.096,1.04
+YAP1_HUMAN_Araya_2012,0.445,0.635,0.586,0.727,0.78,0.685,0.616,0.721,0.455,0.582
+Q8WTC7_9CNID_Somermeyer_2022,0.787,0.657,0.918,0.952,0.843,0.882,0.859,0.918,0.442,0.678
+D7PM05_CLYGR_Somermeyer_2022,0.846,0.483,0.747,0.95,0.636,0.815,0.733,0.914,0.349,0.638
+Q6WV13_9MAXI_Somermeyer_2022,0.858,0.647,0.881,0.927,0.797,0.891,0.86,0.89,0.372,0.611
+SPG1_STRSG_Wu_2016,0.366,0.503,0.724,0.636,0.696,0.68,0.728,0.736,0.451,0.506
+CAS9_STRP1_Spencer_2017_positive,1.001,0.972,0.995,1.015,0.981,0.99,0.987,1.0,0.972,0.976
+A0A247D711_LISMN_Stadelmann_2021,0.539,0.49,0.877,0.876,0.789,0.837,0.843,0.83,0.537,0.611
+HXK4_HUMAN_Gersing_2022_activity,0.791,0.789,0.816,0.836,0.791,0.894,0.838,0.955,0.61,0.649
+ANCSZ_Hobbs_2022,0.524,0.658,0.941,0.637,0.69,0.614,0.617,0.907,0.307,0.346
+PAI1_HUMAN_Huttinger_2021,0.818,0.745,0.815,0.806,0.792,0.814,0.797,0.967,0.607,0.758
+HXK4_HUMAN_Gersing_2023_abundance,0.77,0.795,0.835,0.858,0.796,0.93,0.878,0.959,0.507,0.584
+PHOT_CHLRE_Chen_2023,0.286,0.263,0.551,0.517,0.544,0.602,0.615,0.858,0.218,0.291
+S22A1_HUMAN_Yee_2023_abundance,0.632,0.645,0.673,0.633,0.617,0.58,0.592,0.958,0.311,0.417
+S22A1_HUMAN_Yee_2023_activity,0.627,0.674,0.66,0.666,0.652,0.6,0.607,0.955,0.342,0.45
+KCNE1_HUMAN_Muhammad_2023_expression,0.327,0.53,0.885,0.858,0.886,0.851,0.855,0.853,0.3,0.305
+KCNE1_HUMAN_Muhammad_2023_function,0.293,0.32,0.637,0.559,0.628,0.482,0.461,0.796,0.266,0.261
+RNC_ECOLI_Weeks_2023,0.376,0.363,0.629,0.579,0.586,0.672,0.59,0.896,0.233,0.292
+HEM3_HUMAN_Loggerenberg_2023,0.793,0.787,0.819,0.833,0.79,0.831,0.804,0.964,0.734,0.738
+OXDA_RHOTO_Vanella_2023_expression,0.754,0.726,0.887,0.875,0.875,0.875,0.872,0.949,0.607,0.727
+OXDA_RHOTO_Vanella_2023_activity,0.842,0.837,0.862,0.88,0.878,0.918,0.887,0.958,0.744,0.861
+PRKN_HUMAN_Clausen_2023,0.459,0.717,0.64,0.584,0.715,0.56,0.527,0.923,0.194,0.323
+DYR_ECOLI_Nguyen_2023,0.432,0.386,0.641,0.568,0.656,0.744,0.679,0.882,0.316,0.38
+ENVZ_ECOLI_Ghose_2023,0.666,0.648,0.834,0.812,0.812,0.781,0.813,0.862,0.732,0.675
+MLAC_ECOLI_MacRae_2023,0.46,0.463,0.76,0.763,0.746,0.795,0.75,0.888,0.327,0.404
+Q837P5_ENTFA_Meier_2023,0.406,0.461,0.572,0.639,0.623,0.537,0.566,0.709,0.28,0.324
+Q837P4_ENTFA_Meier_2023,0.368,0.379,0.475,0.507,0.506,0.521,0.47,0.648,0.335,0.349
+CCR5_HUMAN_Gill_2023,0.816,0.844,0.892,0.858,0.845,0.823,0.859,0.966,0.708,0.717
+Q53Z42_HUMAN_McShan_2019_expression,0.67,0.575,0.673,0.718,0.632,0.682,0.645,0.899,0.452,0.508
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,0.574,0.591,0.793,0.809,0.803,0.82,0.794,0.874,0.388,0.476
+RPC1_LAMBD_Li_2019_low-expression,0.591,0.484,0.653,0.684,0.658,0.658,0.632,0.847,0.523,0.614
+RPC1_LAMBD_Li_2019_high-expression,0.494,0.448,0.64,0.607,0.601,0.633,0.6,0.907,0.51,0.551
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,0.524,0.471,0.611,0.745,0.651,0.611,0.601,0.743,0.444,0.443
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,0.767,0.781,0.813,0.796,0.777,0.806,0.801,0.889,0.533,0.486
+A4_HUMAN_Seuma_2022,0.444,0.799,0.692,0.727,0.715,0.638,0.654,0.745,0.438,0.449
+SRC_HUMAN_Nguyen_2022,0.737,0.801,0.841,0.798,0.81,0.833,0.834,0.91,0.579,0.599
+F7YBW7_MESOW_Ding_2023,0.634,0.566,1.026,0.991,0.795,0.985,0.944,1.192,0.574,0.501
+RASK_HUMAN_Weng_2022_abundance,0.459,0.523,0.876,0.905,0.873,0.809,0.852,0.886,0.371,0.346
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,0.294,0.372,0.626,0.674,0.547,0.649,0.631,0.843,0.181,0.191
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,0.733,0.811,0.869,0.844,0.85,0.865,0.864,0.913,0.538,0.558
+AICDA_HUMAN_Gajula_2014_3cycles,1.651,1.442,1.518,1.601,1.556,1.575,1.534,1.662,1.335,1.723
+OTC_HUMAN_Lo_2023,0.479,0.407,0.696,0.638,0.634,0.689,0.621,0.94,0.272,0.481
+CASP3_HUMAN_Roychowdhury_2020,0.589,0.526,0.606,0.623,0.563,0.872,0.611,0.945,0.599,1.172
+CASP7_HUMAN_Roychowdhury_2020,0.665,0.606,0.643,0.652,0.627,0.737,0.665,0.945,0.702,0.667
+SERC_HUMAN_Xie_2023,0.493,0.542,0.625,0.598,0.615,0.575,0.614,0.96,0.339,0.447
+MTHR_HUMAN_Weile_2021,0.629,0.894,0.875,0.846,0.863,0.877,0.862,0.955,0.454,0.538
+AMFR_HUMAN_Rocklin_2023_4G3O,0.239,0.179,0.575,0.75,0.565,0.645,0.586,0.703,0.164,0.26
+ARGR_ECOLI_Rocklin_2023_1AOY,0.167,0.137,0.54,0.65,0.545,0.52,0.495,0.722,0.124,0.142
+BBC1_YEAST_Rocklin_2023_1TG0,0.242,0.202,0.613,0.587,0.645,0.631,0.606,0.733,0.176,0.268
+BCHB_CHLTE_Rocklin_2023_2KRU,0.165,0.141,0.634,0.685,0.548,0.622,0.632,0.706,0.118,0.204
+CATR_CHLRE_Rocklin_2023_2AMI,0.142,0.155,0.666,0.704,0.765,0.642,0.642,0.79,0.119,0.114
+CBPA2_HUMAN_Rocklin_2023_1O6X,0.137,0.105,0.585,0.563,0.528,0.518,0.537,0.743,0.085,0.14
+CBX4_HUMAN_Rocklin_2023_2K28,0.216,0.178,0.514,0.562,0.46,0.54,0.471,0.703,0.174,0.227
+CSN4_MOUSE_Rocklin_2023_1UFM,0.119,0.122,0.588,0.457,0.585,0.498,0.513,0.714,0.089,0.128
+CUE1_YEAST_Rocklin_2023_2MYX,0.18,0.155,0.532,0.575,0.557,0.597,0.533,0.654,0.144,0.233
+DN7A_SACS2_Rocklin_2023_1JIC,0.215,0.171,0.772,0.774,0.716,0.785,0.783,0.758,0.157,0.245
+DNJA1_HUMAN_Rocklin_2023_2LO1,0.118,0.108,0.516,0.55,0.613,0.413,0.468,0.673,0.093,0.099
+DOCK1_MOUSE_Rocklin_2023_2M0Y,0.237,0.212,0.67,0.527,0.536,0.578,0.592,0.726,0.188,0.256
+EPHB2_HUMAN_Rocklin_2023_1F0M,0.097,0.093,0.453,0.35,0.409,0.444,0.421,0.683,0.083,0.139
+FECA_ECOLI_Rocklin_2023_2D1U,0.303,0.199,0.677,0.777,0.668,0.77,0.736,0.747,0.184,0.283
+FKBP3_HUMAN_Rocklin_2023_2KFV,0.238,0.203,0.613,0.772,0.703,0.697,0.648,0.74,0.16,0.207
+HCP_LAMBD_Rocklin_2023_2L6Q,0.124,0.127,0.688,0.594,0.628,0.486,0.499,0.664,0.092,0.121
+HECD1_HUMAN_Rocklin_2023_3DKM,0.227,0.159,0.482,0.701,0.444,0.563,0.505,0.693,0.137,0.232
+ILF3_HUMAN_Rocklin_2023_2L33,0.17,0.148,0.609,0.626,0.583,0.609,0.598,0.794,0.118,0.161
+ISDH_STAAW_Rocklin_2023_2LHR,0.228,0.253,0.794,0.733,0.756,0.786,0.785,0.734,0.235,0.292
+MAFG_MOUSE_Rocklin_2023_1K1V,0.146,0.164,0.632,0.621,0.647,0.52,0.532,0.688,0.136,0.163
+MBD11_ARATH_Rocklin_2023_6ACV,0.175,0.143,0.505,0.666,0.527,0.581,0.507,0.642,0.13,0.236
+MYO3_YEAST_Rocklin_2023_2BTT,0.264,0.217,0.677,0.625,0.653,0.689,0.659,0.736,0.212,0.276
+NKX31_HUMAN_Rocklin_2023_2L9R,0.201,0.188,0.486,0.544,0.467,0.464,0.452,0.7,0.173,0.219
+NUSA_ECOLI_Rocklin_2023_1WCL,0.115,0.111,0.599,0.6,0.566,0.528,0.509,0.689,0.077,0.124
+NUSG_MYCTU_Rocklin_2023_2MI6,0.122,0.096,0.484,0.623,0.493,0.517,0.501,0.662,0.079,0.12
+OBSCN_HUMAN_Rocklin_2023_1V1C,0.126,0.128,0.411,0.416,0.44,0.453,0.413,0.695,0.101,0.176
+ODP2_GEOSE_Rocklin_2023_1W4G,0.334,0.211,0.583,0.72,0.549,0.592,0.564,0.7,0.203,0.298
+OTU7A_HUMAN_Rocklin_2023_2L2D,0.183,0.256,0.723,0.495,0.717,0.667,0.698,0.687,0.227,0.269
+PIN1_HUMAN_Rocklin_2023_1I6C,0.175,0.155,0.378,0.428,0.368,0.384,0.346,0.607,0.141,0.153
+PITX2_HUMAN_Rocklin_2023_2L7M,0.193,0.156,0.487,0.54,0.482,0.481,0.471,0.654,0.138,0.168
+PKN1_HUMAN_Rocklin_2023_1URF,0.151,0.157,0.759,0.715,0.754,0.718,0.717,0.81,0.115,0.162
+POLG_PESV_Rocklin_2023_2MXD,0.157,0.118,0.699,0.752,0.543,0.715,0.689,0.681,0.094,0.185
+PR40A_HUMAN_Rocklin_2023_1UZC,0.233,0.158,0.567,0.73,0.466,0.572,0.524,0.709,0.141,0.215
+PSAE_SYNP2_Rocklin_2023_1PSE,0.202,0.214,0.69,0.63,0.741,0.729,0.703,0.72,0.185,0.23
+RAD_ANTMA_Rocklin_2023_2CJJ,0.295,0.271,0.741,0.703,0.601,0.695,0.69,0.74,0.264,0.334
+RBP1_HUMAN_Rocklin_2023_2KWH,0.181,0.183,0.708,0.659,0.763,0.692,0.701,0.735,0.175,0.218
+RCD1_ARATH_Rocklin_2023_5OAO,0.23,0.229,0.762,0.832,0.77,0.799,0.766,0.786,0.212,0.273
+RCRO_LAMBD_Rocklin_2023_1ORC,0.124,0.13,0.546,0.598,0.529,0.545,0.511,0.694,0.108,0.138
+RD23A_HUMAN_Rocklin_2023_1IFY,0.088,0.115,0.513,0.459,0.469,0.485,0.479,0.679,0.095,0.127
+RFAH_ECOLI_Rocklin_2023_2LCL,0.289,0.251,0.766,0.799,0.784,0.824,0.802,0.797,0.24,0.321
+RL20_AQUAE_Rocklin_2023_1GYZ,0.147,0.151,0.659,0.625,0.685,0.615,0.631,0.76,0.132,0.152
+RPC1_BP434_Rocklin_2023_1R69,0.142,0.13,0.471,0.363,0.445,0.393,0.491,0.682,0.124,0.152
+RS15_GEOSE_Rocklin_2023_1A32,0.126,0.137,0.661,0.666,0.619,0.619,0.66,0.73,0.111,0.133
+SAV1_MOUSE_Rocklin_2023_2YSB,0.195,0.192,0.541,0.467,0.455,0.506,0.496,0.612,0.175,0.217
+SBI_STAAM_Rocklin_2023_2JVG,0.291,0.229,0.661,0.729,0.571,0.685,0.649,0.69,0.221,0.316
+SCIN_STAAR_Rocklin_2023_2QFF,0.211,0.186,0.768,0.745,0.766,0.815,0.771,0.775,0.147,0.249
+SDA_BACSU_Rocklin_2023_1PV0,0.099,0.111,0.497,0.527,0.485,0.477,0.469,0.679,0.102,0.11
+SOX30_HUMAN_Rocklin_2023_7JJK,0.18,0.161,0.703,0.625,0.636,0.712,0.676,0.761,0.161,0.201
+SPA_STAAU_Rocklin_2023_1LP1,0.24,0.17,0.672,0.755,0.586,0.701,0.661,0.662,0.163,0.214
+SPG2_STRSG_Rocklin_2023_5UBS,0.144,0.149,0.746,0.768,0.782,0.795,0.722,0.734,0.127,0.191
+SPTN1_CHICK_Rocklin_2023_1TUD,0.215,0.194,0.583,0.541,0.596,0.59,0.563,0.73,0.172,0.221
+SQSTM_MOUSE_Rocklin_2023_2RRU,0.179,0.144,0.433,0.567,0.459,0.476,0.434,0.604,0.132,0.18
+SR43C_ARATH_Rocklin_2023_2N88,0.156,0.141,0.442,0.472,0.485,0.465,0.44,0.634,0.139,0.162
+SRBS1_HUMAN_Rocklin_2023_2O2W,0.162,0.147,0.501,0.529,0.434,0.495,0.454,0.702,0.127,0.163
+TCRG1_MOUSE_Rocklin_2023_1E0L,0.126,0.124,0.434,0.344,0.336,0.357,0.37,0.576,0.099,0.15
+THO1_YEAST_Rocklin_2023_2WQG,0.234,0.201,0.644,0.629,0.631,0.66,0.641,0.664,0.179,0.285
+TNKS2_HUMAN_Rocklin_2023_5JRT,0.12,0.12,0.461,0.417,0.507,0.44,0.428,0.672,0.106,0.144
+UBE4B_HUMAN_Rocklin_2023_3L1X,0.223,0.197,0.55,0.534,0.538,0.663,0.566,0.717,0.184,0.263
+UBR5_HUMAN_Rocklin_2023_1I2T,0.154,0.135,0.576,0.636,0.599,0.491,0.488,0.689,0.13,0.163
+VG08_BPP22_Rocklin_2023_2GP8,0.173,0.159,0.637,0.531,0.599,0.568,0.586,0.644,0.13,0.213
+VILI_CHICK_Rocklin_2023_1YU5,0.181,0.2,0.543,0.517,0.512,0.632,0.524,0.727,0.179,0.23
+VRPI_BPT7_Rocklin_2023_2WNM,0.155,0.178,0.71,0.594,0.682,0.757,0.752,0.663,0.137,0.211
+YAIA_ECOLI_Rocklin_2023_2KVT,0.227,0.198,0.722,0.723,0.71,0.723,0.69,0.716,0.181,0.278
+YNZC_BACSU_Rocklin_2023_2JVD,0.161,0.155,0.58,0.514,0.558,0.561,0.564,0.617,0.136,0.2
+POLG_DEN26_Suphatrakul_2023,0.916,0.705,0.887,0.999,0.755,0.928,0.908,0.966,0.37,0.647
+MET_HUMAN_Estevam_2023,0.506,0.53,0.607,0.591,0.555,0.627,0.591,0.923,0.407,0.456
+RDRP_I33A0_Li_2023,0.982,0.735,0.809,0.995,0.719,0.776,0.698,0.977,0.547,0.672
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,0.901,0.905,0.901,0.897,0.917,0.932,0.909,0.977,0.842,0.876
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,0.704,0.69,0.866,0.834,0.836,0.821,0.813,0.947,0.471,0.52
+ACE2_HUMAN_Chan_2020,0.706,0.896,0.829,0.843,0.819,0.909,0.897,0.841,0.455,0.567
+PPM1D_HUMAN_Miller_2022,0.654,0.739,0.735,0.651,0.738,0.734,0.694,0.95,0.56,0.623
+HMDH_HUMAN_Jiang_2019,0.643,0.916,0.848,0.864,0.913,0.929,0.886,0.968,0.528,0.6
+CBS_HUMAN_Sun_2020,0.842,0.866,0.856,0.879,0.863,0.91,0.877,0.977,0.779,0.832
+NPC1_HUMAN_Erwood_2022_HEK293T,0.633,0.45,0.53,0.734,0.415,0.556,0.483,0.868,0.359,0.44
+NPC1_HUMAN_Erwood_2022_RPE1,0.721,0.221,0.417,0.741,0.308,0.503,0.394,0.812,0.186,0.27
+BRCA2_HUMAN_Erwood_2022_HEK293T,0.421,1.014,0.822,0.924,0.922,0.822,0.827,0.916,1.117,0.623
+SHOC2_HUMAN_Kwon_2022,0.8,0.838,0.843,0.817,0.819,0.843,0.829,0.968,0.602,0.723
+RAF1_HUMAN_Zinkus-Boltz_2019,0.836,0.747,0.823,0.768,0.768,0.743,0.747,0.949,0.928,1.037
+OPSD_HUMAN_Wan_2019,0.569,0.428,0.603,0.623,0.562,0.661,0.63,0.846,0.421,0.624
+CAR11_HUMAN_Meitlis_2020_lof,0.516,0.868,0.828,0.679,0.83,0.829,0.82,0.842,0.455,0.462
+CAR11_HUMAN_Meitlis_2020_gof,0.72,0.992,0.933,0.878,0.927,0.927,0.929,0.907,0.689,0.669
+ERBB2_HUMAN_Elazar_2016,0.357,0.422,0.793,0.694,0.72,0.627,0.686,0.86,0.352,0.309
+GLPA_HUMAN_Elazar_2016,0.368,0.419,0.841,0.701,0.753,0.761,0.741,0.885,0.333,0.389
+LYAM1_HUMAN_Elazar_2016,0.582,0.483,0.734,0.664,0.608,0.676,0.69,0.773,0.521,0.606
+GDIA_HUMAN_Silverstein_2021,0.667,0.676,0.787,0.747,0.777,0.726,0.728,0.99,0.711,0.746
+PPARG_HUMAN_Majithia_2016,0.323,0.533,0.539,0.497,0.576,0.589,0.511,0.909,0.118,0.262
+CD19_HUMAN_Klesmith_2019_FMC_singles,0.785,0.948,0.911,0.945,0.903,0.923,0.909,0.927,0.552,0.671
diff --git a/benchmarks/DMS_supervised/substitutions/MSE/Summary_performance_DMS_substitutions_MSE.csv b/benchmarks/DMS_supervised/substitutions/MSE/Summary_performance_DMS_substitutions_MSE.csv
new file mode 100644
index 0000000..a010bf3
--- /dev/null
+++ b/benchmarks/DMS_supervised/substitutions/MSE/Summary_performance_DMS_substitutions_MSE.csv
@@ -0,0 +1,11 @@
+Model_rank,Model_name,Model type,Average_MSE,Bootstrap_standard_error_MSE,Average_MSE_fold_random_5,Average_MSE_fold_modulo_5,Average_MSE_fold_contiguous_5,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,References,Model details
+1,ProteinNPT,Embedding,0.683,0.0,0.459,0.771,0.82,0.703,1.016,0.578,0.752,0.368,0.516,0.738,0.659,0.602,0.605,0.618,0.672,"Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",ProteinNPT Model
+2,MSA Transformer Embeddings,Embedding,0.735,0.016,0.573,0.795,0.836,0.728,1.092,0.66,0.789,0.405,0.522,0.816,0.712,0.636,0.659,0.618,0.736,"[1] Original model: Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",MSA Transformer Embeddings
+3,Tranception Embeddings,Embedding,0.769,0.027,0.503,0.833,0.972,0.814,1.08,0.639,0.788,0.525,0.635,0.781,0.757,0.724,0.712,0.703,0.717,"[1] Original model: Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",Tranception Embeddings
+4,ESM-1v Embeddings,Embedding,0.787,0.029,0.563,0.861,0.937,0.799,1.231,0.655,0.792,0.456,0.555,0.902,0.738,0.724,0.668,0.659,0.818,"[1] Original model: Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",ESM-1v Embeddings
+5,TranceptEVE + One-Hot Encodings,One-hot Encoding,0.87,0.016,0.743,0.914,0.953,0.793,1.199,0.78,0.825,0.756,0.765,0.889,0.845,0.849,0.791,0.837,0.822,"[1] Original model: Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",TranceptEVE + One-Hot Encodings
+6,MSA_Transformer + One-Hot Encodings,One-hot Encoding,0.882,0.013,0.749,0.934,0.963,0.81,1.221,0.788,0.836,0.756,0.765,0.921,0.852,0.848,0.808,0.842,0.818,"[1] Original model: Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",MSA Transformer + One-Hot Encodings
+7,DeepSequence + One-Hot Encodings,One-hot Encoding,0.891,0.015,0.767,0.94,0.967,0.83,1.14,0.832,0.86,0.793,0.794,0.927,0.877,0.88,0.824,0.853,0.881,"Hsu, C., Nisonoff, H., Fannjiang, C. et al. Learning protein fitness models from evolutionary and assay-labeled data. Nat Biotechnol 40, 1114–1122 (2022). https://doi.org/10.1038/s41587-021-01146-5",DeepSequence + One-Hot Encodings
+8,Tranception + One-Hot Encodings,One-hot Encoding,0.895,0.02,0.766,0.934,0.985,0.831,1.246,0.787,0.845,0.765,0.776,0.92,0.869,0.869,0.814,0.86,0.836,"[1] Original model: Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",Tranception + One-Hot Encodings
+9,ESM-1v + One-Hot Encodings,One-hot Encoding,0.897,0.017,0.764,0.949,0.977,0.843,1.192,0.795,0.87,0.783,0.768,0.968,0.879,0.868,0.819,0.864,0.907,"[1] Original model: Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",ESM-1v + One-Hot Encodings
+10,One-Hot Encodings,One-hot Encoding,1.061,0.017,0.898,1.125,1.158,1.022,1.306,0.986,1.04,0.949,1.004,1.076,1.027,1.03,1.004,1.062,0.991,"Hsu, C., Nisonoff, H., Fannjiang, C. et al. Learning protein fitness models from evolutionary and assay-labeled data. Nat Biotechnol 40, 1114–1122 (2022). https://doi.org/10.1038/s41587-021-01146-5",One-Hot Encodings
diff --git a/benchmarks/DMS_supervised/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level.csv b/benchmarks/DMS_supervised/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level.csv
new file mode 100644
index 0000000..1c85f14
--- /dev/null
+++ b/benchmarks/DMS_supervised/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level.csv
@@ -0,0 +1,218 @@
+DMS_id,ESM-1v Embeddings,MSA Transformer Embeddings,DeepSequence + One-Hot Encodings,ESM-1v + One-Hot Encodings,MSA_Transformer + One-Hot Encodings,Tranception + One-Hot Encodings,TranceptEVE + One-Hot Encodings,One-Hot Encodings,ProteinNPT,Tranception Embeddings
+A0A140D2T1_ZIKV_Sourisseau_2019,0.111,0.455,0.125,-0.039,0.454,0.355,0.383,0.15,0.468,0.385
+A0A192B1T2_9HIV1_Haddox_2018,0.499,0.54,0.447,0.508,0.528,0.528,0.542,0.242,0.648,0.581
+A0A1I9GEU1_NEIME_Kennouche_2019,0.064,0.089,0.1,0.072,0.104,0.064,0.08,0.015,0.079,0.074
+A0A2Z5U3Z0_9INFA_Doud_2016,0.584,0.505,0.528,0.527,0.539,0.549,0.583,0.279,0.507,0.633
+A0A2Z5U3Z0_9INFA_Wu_2014,0.459,0.294,0.475,0.461,0.509,0.539,0.554,0.174,0.373,0.52
+A4D664_9INFA_Soh_2019,0.2,0.332,0.416,0.056,0.347,0.405,0.471,0.22,0.438,0.498
+A4GRB6_PSEAI_Chen_2020,0.739,0.767,0.683,0.669,0.687,0.676,0.697,0.336,0.81,0.728
+AACC1_PSEAI_Dandage_2018,0.524,0.588,0.448,0.51,0.511,0.493,0.517,0.181,0.52,0.487
+ADRB2_HUMAN_Jones_2020,0.548,0.512,0.525,0.539,0.512,0.545,0.548,0.102,0.548,0.573
+AMIE_PSEAE_Wrenbeck_2017,0.686,0.667,0.524,0.63,0.586,0.448,0.492,0.29,0.711,0.623
+LGK_LIPST_Klesmith_2015,0.589,0.616,0.501,0.546,0.568,0.535,0.547,0.129,0.692,0.594
+BLAT_ECOLX_Deng_2012,0.552,0.57,0.542,0.519,0.54,0.503,0.544,0.184,0.622,0.531
+BLAT_ECOLX_Firnberg_2014,0.77,0.803,0.752,0.671,0.711,0.654,0.743,0.262,0.832,0.707
+BLAT_ECOLX_Jacquier_2013,0.709,0.709,0.71,0.642,0.656,0.644,0.724,0.175,0.7,0.641
+BLAT_ECOLX_Stiffler_2015,0.78,0.798,0.75,0.678,0.7,0.646,0.74,0.232,0.823,0.721
+BRCA1_HUMAN_Findlay_2018,0.444,0.503,0.446,0.418,0.511,0.524,0.542,0.165,0.508,0.521
+C6KNH7_9INFA_Lee_2018,0.527,0.439,0.372,0.442,0.405,0.442,0.462,0.269,0.617,0.579
+CALM1_HUMAN_Weile_2017,0.257,0.28,0.238,0.243,0.256,0.272,0.251,0.064,0.249,0.231
+CAPSD_AAV2S_Sinai_2021,0.607,0.71,0.6,0.313,0.619,0.57,0.585,0.318,0.743,0.732
+CCDB_ECOLI_Adkar_2012,0.591,0.731,0.609,0.46,0.52,0.542,0.588,0.361,0.717,0.618
+CCDB_ECOLI_Tripathi_2016,0.373,0.442,0.437,0.347,0.365,0.446,0.454,0.471,0.716,0.396
+CP2C9_HUMAN_Amorosi_2021_abundance,0.643,0.609,0.622,0.597,0.598,0.633,0.641,0.205,0.721,0.682
+CP2C9_HUMAN_Amorosi_2021_activity,0.675,0.669,0.658,0.654,0.611,0.649,0.662,0.325,0.781,0.719
+DLG4_HUMAN_Faure_2021,0.63,0.528,0.468,0.458,0.533,0.636,0.622,0.206,0.601,0.658
+DLG4_RAT_McLaughlin_2012,0.603,0.525,0.526,0.577,0.526,0.477,0.567,0.262,0.636,0.558
+DYR_ECOLI_Thompson_2019,0.512,0.606,0.496,0.442,0.515,0.455,0.506,0.292,0.617,0.513
+ENV_HV1B9_DuenasDecamp_2016,0.48,0.435,0.417,0.458,0.431,0.452,0.437,0.325,0.413,0.469
+ENV_HV1BR_Haddox_2016,0.324,0.342,0.327,0.325,0.349,0.365,0.368,0.046,0.345,0.367
+ESTA_BACSU_Nutschel_2020,0.446,0.581,0.45,0.353,0.461,0.376,0.431,0.149,0.572,0.407
+F7YBW8_MESOW_Aakre_2015,0.625,0.503,0.299,0.619,0.473,0.57,0.474,-0.13,0.412,0.476
+GAL4_YEAST_Kitzman_2015,0.513,0.638,0.59,0.503,0.607,0.585,0.543,0.334,0.62,0.524
+GCN4_YEAST_Staller_2018,0.051,0.489,0.365,0.367,0.405,0.352,0.419,0.034,0.302,0.369
+GFP_AEQVI_Sarkisyan_2016,0.248,0.497,0.49,0.094,0.459,0.423,0.531,0.178,0.558,0.417
+GRB2_HUMAN_Faure_2021,0.733,0.679,0.613,0.62,0.633,0.557,0.629,0.334,0.699,0.601
+HIS7_YEAST_Pokusaeva_2019,-0.003,-0.048,0.085,0.076,0.05,0.072,0.074,0.038,-0.081,-0.034
+HSP82_YEAST_Flynn_2019,0.407,0.425,0.415,0.399,0.429,0.417,0.43,0.135,0.516,0.496
+HSP82_YEAST_Mishra_2016,0.598,0.506,0.572,0.566,0.503,0.55,0.573,0.057,0.641,0.583
+I6TAH8_I68A0_Doud_2015,0.183,0.343,0.294,0.063,0.34,0.367,0.417,0.248,0.464,0.486
+IF1_ECOLI_Kelsic_2016,0.559,0.575,0.56,0.563,0.3,0.519,0.559,0.301,0.601,0.631
+KCNH2_HUMAN_Kozek_2020,0.599,0.526,0.361,0.3,0.404,0.571,0.548,0.401,0.538,0.548
+KKA2_KLEPN_Melnikov_2014,0.681,0.706,0.636,0.614,0.61,0.608,0.645,0.25,0.719,0.662
+MK01_HUMAN_Brenan_2016,0.308,0.329,0.26,0.191,0.165,0.127,0.224,0.22,0.448,0.327
+MSH2_HUMAN_Jia_2020,0.381,0.393,0.381,0.385,0.398,0.349,0.385,0.149,0.494,0.453
+MTH3_HAEAE_RockahShmuel_2015,0.705,0.707,0.714,0.691,0.679,0.636,0.71,0.147,0.694,0.626
+NCAP_I34A1_Doud_2015,0.169,0.359,0.35,0.055,0.37,0.435,0.452,0.131,0.46,0.489
+NRAM_I33A0_Jiang_2016,0.375,0.726,0.53,0.25,0.65,0.64,0.652,0.214,0.746,0.702
+NUD15_HUMAN_Suiter_2020,0.671,0.741,0.622,0.626,0.689,0.629,0.656,0.272,0.748,0.656
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,0.58,0.514,0.369,0.482,0.33,0.422,0.432,0.235,0.675,0.557
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,0.579,0.589,0.334,0.471,0.328,0.408,0.408,0.177,0.634,0.574
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,0.751,0.68,0.4,0.538,0.318,0.463,0.462,0.243,0.827,0.703
+P53_HUMAN_Kotler_2018,0.599,0.634,0.581,0.559,0.601,0.582,0.601,0.291,0.647,0.641
+P84126_THETH_Chan_2017,0.686,0.754,0.644,0.587,0.665,0.59,0.617,0.334,0.746,0.639
+PA_I34A1_Wu_2015,0.087,0.213,0.509,0.07,0.315,0.574,0.586,0.131,0.282,0.584
+PABP_YEAST_Melamed_2013,0.74,0.743,0.595,0.692,0.71,0.684,0.693,0.292,0.741,0.733
+POLG_CXB3N_Mattenberger_2021,0.23,0.5,0.415,-0.046,0.504,0.415,0.462,0.113,0.581,0.501
+POLG_HCVJF_Qi_2014,0.575,0.668,0.427,0.598,0.576,0.556,0.552,0.218,0.664,0.546
+PTEN_HUMAN_Matreyek_2021,0.504,0.532,0.399,0.421,0.441,0.428,0.456,0.185,0.584,0.524
+PTEN_HUMAN_Mighell_2018,0.501,0.513,0.527,0.485,0.471,0.433,0.541,0.277,0.562,0.477
+Q2N0S5_9HIV1_Haddox_2018,0.517,0.534,0.407,0.521,0.525,0.511,0.523,0.229,0.604,0.621
+Q59976_STRSQ_Romero_2015,0.557,0.663,0.646,0.523,0.659,0.661,0.677,0.196,0.714,0.671
+R1AB_SARS2_Flynn_2022,0.369,0.257,0.294,0.132,0.137,0.433,0.59,0.334,0.516,0.585
+RASH_HUMAN_Bandaru_2017,0.631,0.646,0.513,0.404,0.453,0.489,0.525,0.289,0.678,0.638
+REV_HV1H2_Fernandes_2016,0.41,0.425,0.279,0.293,0.294,0.285,0.289,0.205,0.336,0.384
+RL40A_YEAST_Mavor_2016,0.555,0.588,0.482,0.373,0.458,0.463,0.476,0.314,0.578,0.5
+RL40A_YEAST_Roscoe_2013,0.657,0.737,0.544,0.413,0.498,0.524,0.54,0.383,0.678,0.656
+RL40A_YEAST_Roscoe_2014,0.494,0.567,0.451,0.337,0.445,0.467,0.447,0.301,0.581,0.484
+SC6A4_HUMAN_Young_2021,0.548,0.573,0.447,0.539,0.578,0.539,0.555,0.247,0.649,0.608
+SCN5A_HUMAN_Glazer_2019,0.129,0.158,0.164,0.212,0.137,0.093,0.156,0.039,0.058,0.096
+SPG1_STRSG_Olson_2014,0.679,0.351,0.265,0.361,0.414,0.396,0.395,0.359,0.409,0.637
+SPIKE_SARS2_Starr_2020_binding,0.276,0.415,0.222,0.066,0.44,0.362,0.38,0.256,0.467,0.583
+SPIKE_SARS2_Starr_2020_expression,0.264,0.556,0.337,0.135,0.585,0.413,0.513,0.266,0.65,0.571
+SRC_HUMAN_Ahler_2019,0.614,0.422,0.501,0.585,0.412,0.522,0.543,0.315,0.549,0.594
+SUMO1_HUMAN_Weile_2017,0.549,0.585,0.481,0.508,0.53,0.446,0.498,0.3,0.523,0.499
+SYUA_HUMAN_Newberry_2020,0.593,0.562,0.155,0.264,0.177,0.179,0.177,0.117,0.607,0.577
+TADBP_HUMAN_Bolognesi_2019,0.76,0.671,0.116,0.066,0.077,0.141,0.127,0.055,0.762,0.713
+TAT_HV1BR_Fernandes_2016,0.514,0.536,0.34,0.399,0.382,0.309,0.335,0.269,0.534,0.465
+TPK1_HUMAN_Weile_2017,0.326,0.378,0.236,0.281,0.284,0.311,0.283,0.052,0.331,0.255
+TPMT_HUMAN_Matreyek_2018,0.574,0.577,0.523,0.532,0.487,0.489,0.526,0.225,0.561,0.528
+TPOR_HUMAN_Bridgford_2020,0.575,0.556,0.32,0.416,0.506,0.485,0.472,0.057,0.569,0.552
+TRPC_SACS2_Chan_2017,0.69,0.721,0.608,0.636,0.682,0.622,0.631,0.191,0.713,0.668
+TRPC_THEMA_Chan_2017,0.565,0.588,0.468,0.522,0.523,0.491,0.495,0.269,0.623,0.555
+UBC9_HUMAN_Weile_2017,0.55,0.618,0.554,0.503,0.541,0.506,0.564,0.236,0.602,0.519
+UBE4B_MOUSE_Starita_2013,0.477,0.36,0.492,0.471,0.394,0.414,0.492,0.222,0.327,0.474
+VKOR1_HUMAN_Chiasson_2020_abundance,0.643,0.681,0.452,0.473,0.528,0.572,0.56,0.187,0.678,0.653
+VKOR1_HUMAN_Chiasson_2020_activity,0.384,0.444,0.415,0.434,0.424,0.408,0.441,0.148,0.358,0.324
+YAP1_HUMAN_Araya_2012,0.605,0.557,0.678,0.468,0.223,0.534,0.647,0.146,0.54,0.535
+Q8WTC7_9CNID_Somermeyer_2022,0.302,0.534,0.194,0.113,0.382,0.327,0.353,0.211,0.521,0.371
+D7PM05_CLYGR_Somermeyer_2022,0.288,0.715,0.529,0.156,0.619,0.44,0.547,0.351,0.739,0.513
+Q6WV13_9MAXI_Somermeyer_2022,0.202,0.543,0.296,0.133,0.389,0.306,0.338,0.244,0.62,0.353
+SPG1_STRSG_Wu_2016,0.633,-0.077,-0.152,0.439,-0.219,0.179,0.054,0.088,0.13,0.374
+CAS9_STRP1_Spencer_2017_positive,0.07,0.195,0.167,0.069,0.196,0.188,0.194,0.031,0.191,0.176
+A0A247D711_LISMN_Stadelmann_2021,0.387,0.569,0.191,0.228,0.376,0.389,0.34,0.182,0.503,0.417
+HXK4_HUMAN_Gersing_2022_activity,0.521,0.557,0.5,0.475,0.533,0.466,0.505,0.245,0.628,0.57
+ANCSZ_Hobbs_2022,0.595,0.54,0.49,0.536,0.5,0.549,0.555,0.231,0.643,0.641
+PAI1_HUMAN_Huttinger_2021,0.413,0.489,0.4,0.415,0.444,0.42,0.433,0.16,0.556,0.454
+HXK4_HUMAN_Gersing_2023_abundance,0.44,0.43,0.364,0.347,0.414,0.379,0.383,0.204,0.569,0.505
+PHOT_CHLRE_Chen_2023,0.682,0.696,0.634,0.634,0.619,0.587,0.61,0.192,0.694,0.653
+S22A1_HUMAN_Yee_2023_abundance,0.624,0.604,0.584,0.609,0.608,0.639,0.643,0.174,0.74,0.707
+S22A1_HUMAN_Yee_2023_activity,0.592,0.552,0.577,0.551,0.556,0.597,0.608,0.192,0.701,0.66
+KCNE1_HUMAN_Muhammad_2023_expression,0.671,0.537,0.207,0.251,0.172,0.204,0.205,0.227,0.656,0.653
+KCNE1_HUMAN_Muhammad_2023_function,0.681,0.695,0.528,0.519,0.516,0.64,0.653,0.254,0.7,0.717
+RNC_ECOLI_Weeks_2023,0.661,0.675,0.595,0.586,0.602,0.567,0.618,0.245,0.719,0.655
+HEM3_HUMAN_Loggerenberg_2023,0.421,0.449,0.412,0.381,0.437,0.443,0.444,0.135,0.488,0.477
+OXDA_RHOTO_Vanella_2023_expression,0.443,0.495,0.307,0.318,0.326,0.293,0.299,0.243,0.566,0.445
+OXDA_RHOTO_Vanella_2023_activity,0.41,0.437,0.384,0.355,0.359,0.328,0.37,0.134,0.483,0.374
+PRKN_HUMAN_Clausen_2023,0.666,0.53,0.602,0.611,0.516,0.653,0.666,0.27,0.725,0.712
+DYR_ECOLI_Nguyen_2023,0.534,0.589,0.476,0.541,0.493,0.394,0.445,0.303,0.651,0.498
+ENVZ_ECOLI_Ghose_2023,0.257,0.29,0.213,0.236,0.219,0.248,0.227,0.116,0.308,0.257
+MLAC_ECOLI_MacRae_2023,0.586,0.603,0.448,0.426,0.445,0.354,0.423,0.255,0.667,0.585
+Q837P5_ENTFA_Meier_2023,0.477,0.6,0.525,0.425,0.417,0.538,0.527,0.179,0.661,0.649
+Q837P4_ENTFA_Meier_2023,0.667,0.673,0.57,0.583,0.574,0.574,0.602,0.431,0.699,0.666
+CCR5_HUMAN_Gill_2023,0.384,0.384,0.301,0.355,0.366,0.385,0.334,0.145,0.463,0.465
+Q53Z42_HUMAN_McShan_2019_expression,0.553,0.637,0.56,0.493,0.579,0.556,0.584,0.175,0.642,0.612
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,0.606,0.596,0.391,0.345,0.353,0.335,0.385,0.139,0.672,0.629
+RPC1_LAMBD_Li_2019_low-expression,0.544,0.656,0.539,0.511,0.483,0.512,0.54,0.325,0.602,0.505
+RPC1_LAMBD_Li_2019_high-expression,0.467,0.503,0.504,0.473,0.493,0.479,0.504,0.232,0.496,0.465
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,0.219,0.506,0.712,0.223,0.418,0.5,0.514,0.32,0.439,0.452
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,0.446,0.496,0.402,0.439,0.469,0.392,0.41,0.212,0.528,0.465
+A4_HUMAN_Seuma_2022,0.628,0.426,0.483,0.362,0.402,0.485,0.509,0.219,0.611,0.644
+SRC_HUMAN_Nguyen_2022,0.54,0.522,0.455,0.498,0.49,0.425,0.454,0.198,0.563,0.51
+F7YBW7_MESOW_Ding_2023,0.391,0.545,0.361,0.389,0.518,0.55,0.497,-0.055,0.473,0.518
+RASK_HUMAN_Weng_2022_abundance,0.593,0.621,0.215,0.166,0.226,0.323,0.261,0.237,0.674,0.677
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,0.661,0.673,0.532,0.509,0.601,0.522,0.543,0.256,0.761,0.719
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,0.524,0.514,0.456,0.499,0.491,0.428,0.46,0.167,0.555,0.528
+AICDA_HUMAN_Gajula_2014_3cycles,0.244,0.203,0.346,0.233,0.254,0.286,0.353,-0.053,0.273,0.172
+OTC_HUMAN_Lo_2023,0.714,0.778,0.541,0.594,0.593,0.636,0.618,0.218,0.804,0.695
+CASP3_HUMAN_Roychowdhury_2020,0.623,0.694,0.627,0.598,0.655,0.597,0.635,0.154,0.662,0.594
+CASP7_HUMAN_Roychowdhury_2020,0.613,0.656,0.623,0.613,0.642,0.596,0.634,0.182,0.599,0.593
+SERC_HUMAN_Xie_2023,0.542,0.558,0.552,0.541,0.553,0.551,0.559,0.148,0.585,0.56
+MTHR_HUMAN_Weile_2021,0.507,0.292,0.27,0.303,0.304,0.271,0.283,0.178,0.63,0.572
+AMFR_HUMAN_Rocklin_2023_4G3O,0.674,0.804,0.531,0.325,0.519,0.468,0.529,0.253,0.829,0.688
+ARGR_ECOLI_Rocklin_2023_1AOY,0.653,0.732,0.494,0.368,0.484,0.5,0.526,0.241,0.782,0.7
+BBC1_YEAST_Rocklin_2023_1TG0,0.716,0.742,0.486,0.512,0.451,0.47,0.482,0.266,0.758,0.689
+BCHB_CHLTE_Rocklin_2023_2KRU,0.654,0.725,0.386,0.432,0.498,0.448,0.407,0.214,0.747,0.609
+CATR_CHLRE_Rocklin_2023_2AMI,0.79,0.784,0.4,0.374,0.322,0.442,0.441,0.22,0.843,0.803
+CBPA2_HUMAN_Rocklin_2023_1O6X,0.738,0.762,0.466,0.455,0.482,0.505,0.504,0.275,0.787,0.699
+CBX4_HUMAN_Rocklin_2023_2K28,0.681,0.808,0.591,0.552,0.655,0.584,0.636,0.248,0.81,0.677
+CSN4_MOUSE_Rocklin_2023_1UFM,0.78,0.789,0.456,0.61,0.478,0.559,0.528,0.252,0.826,0.726
+CUE1_YEAST_Rocklin_2023_2MYX,0.718,0.813,0.578,0.488,0.548,0.497,0.571,0.389,0.814,0.665
+DN7A_SACS2_Rocklin_2023_1JIC,0.622,0.771,0.317,0.258,0.384,0.297,0.313,0.297,0.794,0.65
+DNJA1_HUMAN_Rocklin_2023_2LO1,0.805,0.773,0.506,0.485,0.412,0.607,0.539,0.446,0.812,0.78
+DOCK1_MOUSE_Rocklin_2023_2M0Y,0.743,0.782,0.47,0.602,0.598,0.571,0.531,0.25,0.79,0.667
+EPHB2_HUMAN_Rocklin_2023_1F0M,0.798,0.827,0.612,0.661,0.618,0.626,0.642,0.357,0.859,0.719
+FECA_ECOLI_Rocklin_2023_2D1U,0.61,0.777,0.484,0.348,0.501,0.365,0.409,0.296,0.781,0.621
+FKBP3_HUMAN_Rocklin_2023_2KFV,0.529,0.704,0.549,0.282,0.365,0.422,0.482,0.285,0.745,0.627
+HCP_LAMBD_Rocklin_2023_2L6Q,0.805,0.78,0.327,0.527,0.451,0.583,0.566,0.316,0.788,0.74
+HECD1_HUMAN_Rocklin_2023_3DKM,0.673,0.857,0.65,0.379,0.703,0.596,0.642,0.381,0.854,0.758
+ILF3_HUMAN_Rocklin_2023_2L33,0.661,0.697,0.451,0.391,0.44,0.427,0.459,0.172,0.766,0.674
+ISDH_STAAW_Rocklin_2023_2LHR,0.699,0.616,0.225,0.388,0.297,0.234,0.239,0.233,0.675,0.514
+MAFG_MOUSE_Rocklin_2023_1K1V,0.697,0.655,0.37,0.458,0.36,0.525,0.495,0.219,0.673,0.685
+MBD11_ARATH_Rocklin_2023_6ACV,0.651,0.763,0.549,0.348,0.527,0.482,0.552,0.404,0.803,0.644
+MYO3_YEAST_Rocklin_2023_2BTT,0.706,0.73,0.431,0.511,0.482,0.406,0.448,0.247,0.724,0.611
+NKX31_HUMAN_Rocklin_2023_2L9R,0.793,0.816,0.628,0.58,0.634,0.626,0.652,0.357,0.836,0.774
+NUSA_ECOLI_Rocklin_2023_1WCL,0.83,0.833,0.504,0.42,0.447,0.527,0.565,0.334,0.87,0.794
+NUSG_MYCTU_Rocklin_2023_2MI6,0.809,0.871,0.571,0.416,0.572,0.551,0.561,0.361,0.891,0.81
+OBSCN_HUMAN_Rocklin_2023_1V1C,0.777,0.811,0.676,0.62,0.658,0.597,0.665,0.275,0.82,0.702
+ODP2_GEOSE_Rocklin_2023_1W4G,0.389,0.719,0.471,0.274,0.511,0.476,0.491,0.229,0.735,0.563
+OTU7A_HUMAN_Rocklin_2023_2L2D,0.842,0.581,0.355,0.638,0.371,0.403,0.384,0.282,0.64,0.571
+PIN1_HUMAN_Rocklin_2023_1I6C,0.748,0.816,0.674,0.625,0.693,0.706,0.735,0.227,0.82,0.804
+PITX2_HUMAN_Rocklin_2023_2L7M,0.697,0.776,0.583,0.541,0.609,0.579,0.586,0.261,0.761,0.706
+PKN1_HUMAN_Rocklin_2023_1URF,0.665,0.697,0.3,0.352,0.291,0.365,0.352,0.181,0.76,0.669
+POLG_PESV_Rocklin_2023_2MXD,0.641,0.759,0.337,0.246,0.524,0.287,0.331,0.281,0.773,0.617
+PR40A_HUMAN_Rocklin_2023_1UZC,0.632,0.818,0.515,0.36,0.63,0.533,0.566,0.35,0.82,0.66
+PSAE_SYNP2_Rocklin_2023_1PSE,0.725,0.612,0.414,0.453,0.297,0.327,0.374,0.269,0.669,0.639
+RAD_ANTMA_Rocklin_2023_2CJJ,0.683,0.737,0.339,0.386,0.526,0.379,0.375,0.214,0.751,0.616
+RBP1_HUMAN_Rocklin_2023_2KWH,0.833,0.742,0.394,0.502,0.292,0.423,0.397,0.262,0.785,0.709
+RCD1_ARATH_Rocklin_2023_5OAO,0.822,0.778,0.369,0.28,0.339,0.335,0.377,0.164,0.788,0.692
+RCRO_LAMBD_Rocklin_2023_1ORC,0.706,0.757,0.515,0.452,0.545,0.483,0.523,0.361,0.786,0.692
+RD23A_HUMAN_Rocklin_2023_1IFY,0.846,0.852,0.545,0.616,0.62,0.595,0.594,0.206,0.872,0.768
+RFAH_ECOLI_Rocklin_2023_2LCL,0.678,0.715,0.377,0.315,0.342,0.309,0.332,0.128,0.719,0.651
+RL20_AQUAE_Rocklin_2023_1GYZ,0.596,0.561,0.35,0.43,0.309,0.379,0.368,0.244,0.558,0.553
+RPC1_BP434_Rocklin_2023_1R69,0.861,0.866,0.582,0.707,0.622,0.683,0.619,0.456,0.866,0.781
+RS15_GEOSE_Rocklin_2023_1A32,0.814,0.776,0.412,0.422,0.46,0.437,0.4,0.174,0.772,0.717
+SAV1_MOUSE_Rocklin_2023_2YSB,0.699,0.737,0.539,0.637,0.669,0.591,0.587,0.387,0.765,0.66
+SBI_STAAM_Rocklin_2023_2JVG,0.659,0.797,0.445,0.387,0.588,0.414,0.468,0.326,0.794,0.636
+SCIN_STAAR_Rocklin_2023_2QFF,0.7,0.783,0.348,0.386,0.344,0.272,0.341,0.21,0.802,0.622
+SDA_BACSU_Rocklin_2023_1PV0,0.755,0.809,0.598,0.496,0.623,0.564,0.608,0.206,0.814,0.749
+SOX30_HUMAN_Rocklin_2023_7JJK,0.721,0.68,0.326,0.41,0.358,0.316,0.339,0.192,0.746,0.575
+SPA_STAAU_Rocklin_2023_1LP1,0.644,0.778,0.405,0.28,0.51,0.39,0.432,0.258,0.765,0.707
+SPG2_STRSG_Rocklin_2023_5UBS,0.738,0.715,0.298,0.3,0.22,0.214,0.332,0.222,0.724,0.62
+SPTN1_CHICK_Rocklin_2023_1TUD,0.747,0.727,0.53,0.583,0.513,0.547,0.551,0.239,0.742,0.693
+SQSTM_MOUSE_Rocklin_2023_2RRU,0.767,0.863,0.672,0.496,0.66,0.648,0.679,0.31,0.886,0.807
+SR43C_ARATH_Rocklin_2023_2N88,0.742,0.774,0.62,0.573,0.616,0.61,0.633,0.303,0.795,0.723
+SRBS1_HUMAN_Rocklin_2023_2O2W,0.697,0.806,0.611,0.558,0.672,0.625,0.644,0.222,0.817,0.724
+TCRG1_MOUSE_Rocklin_2023_1E0L,0.848,0.725,0.453,0.731,0.607,0.595,0.56,0.26,0.764,0.748
+THO1_YEAST_Rocklin_2023_2WQG,0.651,0.713,0.404,0.459,0.477,0.379,0.397,0.272,0.751,0.596
+TNKS2_HUMAN_Rocklin_2023_5JRT,0.821,0.793,0.625,0.654,0.566,0.624,0.641,0.271,0.8,0.752
+UBE4B_HUMAN_Rocklin_2023_3L1X,0.747,0.787,0.556,0.61,0.587,0.5,0.554,0.304,0.791,0.613
+UBR5_HUMAN_Rocklin_2023_1I2T,0.725,0.779,0.467,0.412,0.429,0.506,0.521,0.346,0.787,0.73
+VG08_BPP22_Rocklin_2023_2GP8,0.832,0.753,0.371,0.607,0.502,0.519,0.479,0.311,0.768,0.736
+VILI_CHICK_Rocklin_2023_1YU5,0.748,0.736,0.639,0.57,0.615,0.487,0.614,0.235,0.712,0.592
+VRPI_BPT7_Rocklin_2023_2WNM,0.774,0.693,0.32,0.436,0.339,0.189,0.221,0.306,0.72,0.625
+YAIA_ECOLI_Rocklin_2023_2KVT,0.66,0.719,0.352,0.361,0.391,0.384,0.416,0.295,0.71,0.534
+YNZC_BACSU_Rocklin_2023_2JVD,0.754,0.665,0.33,0.537,0.387,0.376,0.361,0.275,0.745,0.546
+POLG_DEN26_Suphatrakul_2023,0.245,0.509,0.299,0.017,0.45,0.553,0.573,0.2,0.706,0.603
+MET_HUMAN_Estevam_2023,0.602,0.623,0.574,0.561,0.609,0.574,0.583,0.254,0.622,0.62
+RDRP_I33A0_Li_2023,0.119,0.488,0.392,0.062,0.49,0.479,0.53,0.165,0.592,0.53
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,0.351,0.35,0.35,0.352,0.316,0.298,0.341,0.142,0.437,0.387
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,0.423,0.408,0.283,0.301,0.314,0.344,0.354,0.184,0.507,0.489
+ACE2_HUMAN_Chan_2020,0.342,0.293,0.277,0.256,0.287,0.332,0.329,0.255,0.489,0.389
+PPM1D_HUMAN_Miller_2022,0.604,0.554,0.544,0.603,0.544,0.598,0.621,0.26,0.641,0.616
+HMDH_HUMAN_Jiang_2019,0.58,0.3,0.401,0.35,0.282,0.34,0.369,0.167,0.649,0.605
+CBS_HUMAN_Sun_2020,0.387,0.377,0.385,0.348,0.374,0.332,0.367,0.142,0.463,0.402
+NPC1_HUMAN_Erwood_2022_HEK293T,0.493,0.754,0.657,0.406,0.752,0.72,0.742,0.209,0.762,0.738
+NPC1_HUMAN_Erwood_2022_RPE1,0.4,0.808,0.631,0.409,0.785,0.738,0.756,0.507,0.776,0.734
+BRCA2_HUMAN_Erwood_2022_HEK293T,0.376,-0.138,0.458,0.229,-0.043,0.439,0.443,-0.029,-0.006,0.421
+SHOC2_HUMAN_Kwon_2022,0.436,0.397,0.396,0.396,0.403,0.381,0.408,0.199,0.539,0.47
+RAF1_HUMAN_Zinkus-Boltz_2019,0.318,0.461,0.393,0.427,0.474,0.395,0.44,0.166,0.368,0.29
+OPSD_HUMAN_Wan_2019,0.634,0.711,0.551,0.528,0.601,0.542,0.52,0.299,0.703,0.638
+CAR11_HUMAN_Meitlis_2020_lof,0.628,0.371,0.334,0.523,0.31,0.336,0.348,0.162,0.617,0.587
+CAR11_HUMAN_Meitlis_2020_gof,0.424,0.195,0.242,0.362,0.244,0.254,0.254,0.187,0.504,0.437
+ERBB2_HUMAN_Elazar_2016,0.66,0.64,0.341,0.517,0.479,0.539,0.493,0.119,0.606,0.684
+GLPA_HUMAN_Elazar_2016,0.644,0.555,0.29,0.45,0.4,0.449,0.443,0.232,0.584,0.674
+LYAM1_HUMAN_Elazar_2016,0.598,0.611,0.279,0.449,0.497,0.424,0.399,0.407,0.571,0.576
+GDIA_HUMAN_Silverstein_2021,0.469,0.526,0.446,0.425,0.461,0.438,0.462,0.054,0.508,0.439
+PPARG_HUMAN_Majithia_2016,0.809,0.686,0.668,0.707,0.623,0.653,0.691,0.317,0.89,0.815
+CD19_HUMAN_Klesmith_2019_FMC_singles,0.324,0.332,0.24,0.105,0.314,0.236,0.26,0.174,0.521,0.427
diff --git a/benchmarks/DMS_supervised/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level_fold_contiguous_5.csv b/benchmarks/DMS_supervised/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level_fold_contiguous_5.csv
new file mode 100644
index 0000000..77c132a
--- /dev/null
+++ b/benchmarks/DMS_supervised/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level_fold_contiguous_5.csv
@@ -0,0 +1,218 @@
+DMS_id,ESM-1v Embeddings,MSA Transformer Embeddings,DeepSequence + One-Hot Encodings,ESM-1v + One-Hot Encodings,MSA_Transformer + One-Hot Encodings,Tranception + One-Hot Encodings,TranceptEVE + One-Hot Encodings,One-Hot Encodings,ProteinNPT,Tranception Embeddings
+A0A140D2T1_ZIKV_Sourisseau_2019,-0.092,0.445,0.127,-0.064,0.445,0.342,0.37,0.082,0.429,0.297
+A0A192B1T2_9HIV1_Haddox_2018,0.507,0.539,0.443,0.507,0.525,0.523,0.538,0.179,0.562,0.536
+A0A1I9GEU1_NEIME_Kennouche_2019,0.054,0.06,0.089,0.055,0.093,0.043,0.062,-0.027,0.077,0.074
+A0A2Z5U3Z0_9INFA_Doud_2016,0.543,0.438,0.522,0.517,0.525,0.546,0.58,0.139,0.376,0.569
+A0A2Z5U3Z0_9INFA_Wu_2014,0.465,0.296,0.485,0.463,0.502,0.55,0.567,0.113,0.349,0.533
+A4D664_9INFA_Soh_2019,0.126,0.323,0.405,0.043,0.33,0.389,0.46,0.121,0.412,0.423
+A4GRB6_PSEAI_Chen_2020,0.698,0.75,0.676,0.658,0.677,0.668,0.687,0.216,0.752,0.675
+AACC1_PSEAI_Dandage_2018,0.512,0.544,0.429,0.492,0.5,0.471,0.5,-0.051,0.49,0.425
+ADRB2_HUMAN_Jones_2020,0.559,0.527,0.538,0.556,0.526,0.56,0.561,-0.201,0.506,0.559
+AMIE_PSEAE_Wrenbeck_2017,0.663,0.608,0.508,0.627,0.577,0.402,0.46,0.191,0.601,0.433
+LGK_LIPST_Klesmith_2015,0.571,0.61,0.493,0.541,0.561,0.519,0.537,-0.184,0.605,0.542
+BLAT_ECOLX_Deng_2012,0.477,0.543,0.516,0.484,0.508,0.467,0.519,-0.147,0.526,0.425
+BLAT_ECOLX_Firnberg_2014,0.721,0.779,0.74,0.641,0.69,0.632,0.731,-0.001,0.784,0.636
+BLAT_ECOLX_Jacquier_2013,0.688,0.701,0.712,0.618,0.646,0.631,0.727,-0.025,0.701,0.613
+BLAT_ECOLX_Stiffler_2015,0.744,0.778,0.735,0.653,0.68,0.623,0.724,-0.014,0.778,0.651
+BRCA1_HUMAN_Findlay_2018,0.4,0.508,0.438,0.402,0.507,0.517,0.536,0.131,0.503,0.456
+C6KNH7_9INFA_Lee_2018,0.482,0.412,0.357,0.429,0.367,0.424,0.444,0.13,0.486,0.451
+CALM1_HUMAN_Weile_2017,0.244,0.261,0.224,0.225,0.24,0.263,0.239,0.009,0.27,0.22
+CAPSD_AAV2S_Sinai_2021,0.579,0.642,0.565,0.17,0.574,0.54,0.563,0.181,0.686,0.681
+CCDB_ECOLI_Adkar_2012,0.399,0.537,0.559,0.347,0.431,0.485,0.541,0.125,0.496,0.443
+CCDB_ECOLI_Tripathi_2016,0.283,0.36,0.486,0.324,0.331,0.516,0.528,0.535,0.812,0.423
+CP2C9_HUMAN_Amorosi_2021_abundance,0.614,0.611,0.616,0.589,0.592,0.627,0.636,-0.011,0.66,0.605
+CP2C9_HUMAN_Amorosi_2021_activity,0.65,0.659,0.649,0.643,0.602,0.636,0.654,0.206,0.716,0.626
+DLG4_HUMAN_Faure_2021,0.541,0.502,0.401,0.388,0.48,0.607,0.582,0.022,0.52,0.529
+DLG4_RAT_McLaughlin_2012,0.556,0.517,0.49,0.547,0.493,0.431,0.54,0.042,0.531,0.399
+DYR_ECOLI_Thompson_2019,0.432,0.541,0.47,0.402,0.497,0.417,0.482,0.16,0.556,0.417
+ENV_HV1B9_DuenasDecamp_2016,0.462,0.384,0.354,0.428,0.383,0.384,0.376,0.33,0.371,0.417
+ENV_HV1BR_Haddox_2016,0.332,0.35,0.334,0.331,0.357,0.369,0.373,0.049,0.355,0.371
+ESTA_BACSU_Nutschel_2020,0.341,0.507,0.414,0.294,0.421,0.312,0.394,-0.103,0.499,0.248
+F7YBW8_MESOW_Aakre_2015,0.665,0.491,0.238,0.652,0.444,0.636,0.492,-0.418,0.342,0.495
+GAL4_YEAST_Kitzman_2015,0.382,0.617,0.567,0.409,0.575,0.517,0.457,0.17,0.553,0.403
+GCN4_YEAST_Staller_2018,0.008,0.513,0.644,0.464,0.646,0.647,0.705,0.328,0.397,0.368
+GFP_AEQVI_Sarkisyan_2016,0.152,0.483,0.495,0.059,0.449,0.402,0.528,0.12,0.506,0.357
+GRB2_HUMAN_Faure_2021,0.639,0.556,0.545,0.544,0.568,0.438,0.541,0.102,0.587,0.408
+HIS7_YEAST_Pokusaeva_2019,-0.004,-0.079,0.129,0.119,0.097,0.137,0.128,0.264,-0.06,0.04
+HSP82_YEAST_Flynn_2019,0.399,0.421,0.406,0.389,0.419,0.411,0.424,-0.094,0.473,0.474
+HSP82_YEAST_Mishra_2016,0.606,0.512,0.593,0.579,0.504,0.568,0.594,-0.412,0.634,0.55
+I6TAH8_I68A0_Doud_2015,0.083,0.34,0.249,0.016,0.321,0.349,0.4,0.067,0.26,0.377
+IF1_ECOLI_Kelsic_2016,0.424,0.457,0.528,0.533,0.213,0.483,0.531,0.054,0.43,0.511
+KCNH2_HUMAN_Kozek_2020,0.58,0.457,0.276,0.194,0.281,0.529,0.498,0.498,0.45,0.502
+KKA2_KLEPN_Melnikov_2014,0.663,0.695,0.633,0.612,0.603,0.607,0.642,0.081,0.692,0.616
+MK01_HUMAN_Brenan_2016,0.251,0.28,0.268,0.203,0.174,0.137,0.233,0.226,0.333,0.189
+MSH2_HUMAN_Jia_2020,0.378,0.39,0.374,0.379,0.39,0.334,0.373,-0.039,0.362,0.369
+MTH3_HAEAE_RockahShmuel_2015,0.698,0.7,0.705,0.681,0.67,0.608,0.696,-0.125,0.702,0.603
+NCAP_I34A1_Doud_2015,0.094,0.361,0.332,0.013,0.358,0.422,0.441,-0.042,0.339,0.411
+NRAM_I33A0_Jiang_2016,0.192,0.675,0.443,0.034,0.612,0.572,0.585,-0.116,0.714,0.625
+NUD15_HUMAN_Suiter_2020,0.601,0.697,0.597,0.603,0.673,0.598,0.631,0.108,0.696,0.558
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,0.542,0.474,0.349,0.5,0.319,0.431,0.432,-0.041,0.618,0.38
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,0.543,0.508,0.301,0.478,0.3,0.414,0.409,-0.143,0.55,0.429
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,0.699,0.552,0.359,0.573,0.276,0.468,0.453,-0.117,0.751,0.488
+P53_HUMAN_Kotler_2018,0.538,0.618,0.561,0.537,0.577,0.562,0.581,0.235,0.62,0.602
+P84126_THETH_Chan_2017,0.594,0.715,0.593,0.516,0.612,0.526,0.557,0.231,0.668,0.503
+PA_I34A1_Wu_2015,0.037,0.228,0.508,0.053,0.315,0.575,0.586,0.068,0.256,0.557
+PABP_YEAST_Melamed_2013,0.678,0.712,0.551,0.666,0.68,0.653,0.665,0.199,0.672,0.641
+POLG_CXB3N_Mattenberger_2021,0.122,0.5,0.417,-0.063,0.502,0.412,0.462,0.032,0.535,0.392
+POLG_HCVJF_Qi_2014,0.412,0.601,0.362,0.536,0.535,0.467,0.467,0.105,0.516,0.321
+PTEN_HUMAN_Matreyek_2021,0.437,0.497,0.372,0.398,0.415,0.401,0.434,-0.073,0.461,0.348
+PTEN_HUMAN_Mighell_2018,0.478,0.518,0.532,0.493,0.472,0.427,0.545,0.289,0.502,0.372
+Q2N0S5_9HIV1_Haddox_2018,0.519,0.533,0.397,0.514,0.518,0.501,0.512,-0.015,0.58,0.56
+Q59976_STRSQ_Romero_2015,0.523,0.663,0.638,0.509,0.652,0.656,0.67,0.036,0.703,0.661
+R1AB_SARS2_Flynn_2022,0.282,0.192,0.245,-0.011,-0.016,0.402,0.58,0.251,0.314,0.393
+RASH_HUMAN_Bandaru_2017,0.51,0.596,0.484,0.365,0.417,0.452,0.493,0.059,0.583,0.471
+REV_HV1H2_Fernandes_2016,0.319,0.341,0.238,0.251,0.259,0.252,0.25,0.075,0.182,0.274
+RL40A_YEAST_Mavor_2016,0.428,0.507,0.451,0.315,0.416,0.417,0.444,0.304,0.493,0.386
+RL40A_YEAST_Roscoe_2013,0.488,0.643,0.506,0.336,0.441,0.465,0.497,0.415,0.48,0.516
+RL40A_YEAST_Roscoe_2014,0.366,0.503,0.398,0.246,0.389,0.401,0.39,0.101,0.514,0.29
+SC6A4_HUMAN_Young_2021,0.547,0.582,0.45,0.539,0.582,0.538,0.56,0.232,0.618,0.544
+SCN5A_HUMAN_Glazer_2019,0.143,0.173,0.167,0.221,0.146,0.097,0.164,0.139,0.076,0.067
+SPG1_STRSG_Olson_2014,0.528,0.261,0.067,0.221,0.288,0.256,0.259,0.309,0.28,0.45
+SPIKE_SARS2_Starr_2020_binding,0.186,0.274,0.12,-0.083,0.38,0.306,0.308,-0.114,0.269,0.478
+SPIKE_SARS2_Starr_2020_expression,0.127,0.555,0.279,-0.021,0.553,0.361,0.476,-0.019,0.446,0.312
+SRC_HUMAN_Ahler_2019,0.573,0.434,0.479,0.572,0.387,0.502,0.526,0.22,0.357,0.475
+SUMO1_HUMAN_Weile_2017,0.51,0.554,0.459,0.501,0.527,0.398,0.475,0.287,0.497,0.421
+SYUA_HUMAN_Newberry_2020,0.531,0.494,0.165,0.28,0.177,0.19,0.19,-0.016,0.58,0.517
+TADBP_HUMAN_Bolognesi_2019,0.71,0.65,0.069,0.033,0.013,0.089,0.067,-0.066,0.753,0.615
+TAT_HV1BR_Fernandes_2016,0.474,0.517,0.289,0.389,0.364,0.244,0.283,0.279,0.54,0.374
+TPK1_HUMAN_Weile_2017,0.313,0.356,0.223,0.266,0.274,0.297,0.27,-0.047,0.298,0.198
+TPMT_HUMAN_Matreyek_2018,0.581,0.588,0.527,0.541,0.485,0.478,0.524,0.107,0.572,0.468
+TPOR_HUMAN_Bridgford_2020,0.552,0.531,0.241,0.349,0.455,0.415,0.4,-0.149,0.532,0.499
+TRPC_SACS2_Chan_2017,0.632,0.686,0.567,0.6,0.651,0.582,0.595,-0.082,0.683,0.609
+TRPC_THEMA_Chan_2017,0.466,0.469,0.397,0.453,0.465,0.428,0.432,0.139,0.471,0.437
+UBC9_HUMAN_Weile_2017,0.444,0.545,0.521,0.457,0.505,0.464,0.532,0.051,0.547,0.389
+UBE4B_MOUSE_Starita_2013,0.463,0.351,0.496,0.468,0.357,0.389,0.485,0.209,0.311,0.427
+VKOR1_HUMAN_Chiasson_2020_abundance,0.57,0.617,0.416,0.441,0.503,0.544,0.533,0.011,0.61,0.549
+VKOR1_HUMAN_Chiasson_2020_activity,0.332,0.446,0.413,0.44,0.43,0.413,0.446,0.164,0.37,0.314
+YAP1_HUMAN_Araya_2012,0.494,0.505,0.65,0.374,0.018,0.471,0.613,-0.232,0.373,0.358
+Q8WTC7_9CNID_Somermeyer_2022,0.229,0.527,0.159,-0.019,0.351,0.297,0.32,0.026,0.462,0.308
+D7PM05_CLYGR_Somermeyer_2022,0.166,0.702,0.515,0.026,0.601,0.415,0.531,0.34,0.709,0.462
+Q6WV13_9MAXI_Somermeyer_2022,0.048,0.511,0.254,0.039,0.362,0.264,0.302,0.0,0.536,0.279
+SPG1_STRSG_Wu_2016,0.58,-0.518,-0.509,0.267,-0.654,-0.108,-0.258,-0.222,-0.235,0.2
+CAS9_STRP1_Spencer_2017_positive,0.062,0.194,0.169,0.069,0.199,0.193,0.198,0.056,0.19,0.175
+A0A247D711_LISMN_Stadelmann_2021,0.264,0.539,0.056,0.114,0.314,0.327,0.277,-0.004,0.474,0.344
+HXK4_HUMAN_Gersing_2022_activity,0.512,0.563,0.494,0.469,0.529,0.448,0.496,0.109,0.61,0.535
+ANCSZ_Hobbs_2022,0.564,0.532,0.466,0.52,0.48,0.535,0.541,0.026,0.569,0.597
+PAI1_HUMAN_Huttinger_2021,0.414,0.486,0.395,0.412,0.441,0.415,0.43,0.048,0.538,0.439
+HXK4_HUMAN_Gersing_2023_abundance,0.439,0.426,0.354,0.337,0.406,0.358,0.369,0.073,0.539,0.462
+PHOT_CHLRE_Chen_2023,0.649,0.654,0.618,0.617,0.596,0.561,0.591,-0.055,0.628,0.606
+S22A1_HUMAN_Yee_2023_abundance,0.621,0.602,0.575,0.602,0.6,0.631,0.635,-0.012,0.704,0.69
+S22A1_HUMAN_Yee_2023_activity,0.586,0.55,0.57,0.543,0.548,0.59,0.602,0.021,0.672,0.64
+KCNE1_HUMAN_Muhammad_2023_expression,0.624,0.465,0.129,0.16,0.057,0.117,0.126,-0.03,0.599,0.595
+KCNE1_HUMAN_Muhammad_2023_function,0.631,0.666,0.49,0.469,0.481,0.619,0.633,0.046,0.659,0.692
+RNC_ECOLI_Weeks_2023,0.656,0.661,0.582,0.573,0.588,0.555,0.612,0.146,0.712,0.626
+HEM3_HUMAN_Loggerenberg_2023,0.409,0.448,0.402,0.371,0.431,0.438,0.437,0.011,0.477,0.458
+OXDA_RHOTO_Vanella_2023_expression,0.412,0.492,0.292,0.303,0.313,0.275,0.283,0.127,0.54,0.407
+OXDA_RHOTO_Vanella_2023_activity,0.383,0.434,0.377,0.343,0.349,0.308,0.357,-0.079,0.454,0.344
+PRKN_HUMAN_Clausen_2023,0.648,0.542,0.59,0.603,0.507,0.645,0.659,0.017,0.668,0.681
+DYR_ECOLI_Nguyen_2023,0.512,0.559,0.472,0.536,0.483,0.375,0.44,0.203,0.602,0.434
+ENVZ_ECOLI_Ghose_2023,0.151,0.184,0.146,0.189,0.17,0.193,0.164,0.019,0.222,0.147
+MLAC_ECOLI_MacRae_2023,0.526,0.552,0.427,0.399,0.418,0.317,0.399,0.052,0.598,0.503
+Q837P5_ENTFA_Meier_2023,0.332,0.546,0.482,0.343,0.319,0.472,0.467,0.0,0.614,0.59
+Q837P4_ENTFA_Meier_2023,0.617,0.656,0.531,0.548,0.534,0.534,0.566,0.45,0.594,0.594
+CCR5_HUMAN_Gill_2023,0.376,0.385,0.288,0.346,0.358,0.376,0.323,-0.006,0.443,0.443
+Q53Z42_HUMAN_McShan_2019_expression,0.541,0.624,0.54,0.465,0.562,0.538,0.566,-0.067,0.567,0.552
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,0.584,0.567,0.348,0.306,0.32,0.295,0.346,-0.198,0.636,0.58
+RPC1_LAMBD_Li_2019_low-expression,0.487,0.629,0.503,0.48,0.426,0.465,0.501,0.228,0.57,0.434
+RPC1_LAMBD_Li_2019_high-expression,0.46,0.486,0.491,0.457,0.472,0.455,0.489,0.205,0.483,0.414
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,0.01,0.311,0.664,-0.087,0.197,0.336,0.347,0.211,0.182,0.328
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,0.427,0.479,0.38,0.414,0.444,0.368,0.388,0.059,0.445,0.394
+A4_HUMAN_Seuma_2022,0.589,0.401,0.457,0.287,0.338,0.446,0.488,0.072,0.547,0.598
+SRC_HUMAN_Nguyen_2022,0.513,0.507,0.429,0.481,0.473,0.4,0.429,0.024,0.484,0.427
+F7YBW7_MESOW_Ding_2023,0.218,0.471,0.215,0.259,0.43,0.531,0.43,-0.389,0.337,0.366
+RASK_HUMAN_Weng_2022_abundance,0.528,0.568,0.152,0.07,0.164,0.278,0.206,-0.002,0.606,0.61
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,0.605,0.639,0.508,0.477,0.581,0.491,0.519,0.078,0.7,0.649
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,0.479,0.496,0.424,0.476,0.469,0.401,0.433,-0.074,0.491,0.446
+AICDA_HUMAN_Gajula_2014_3cycles,0.203,0.144,0.381,0.222,0.267,0.293,0.379,-0.246,0.179,0.13
+OTC_HUMAN_Lo_2023,0.685,0.776,0.525,0.577,0.574,0.616,0.603,0.041,0.767,0.657
+CASP3_HUMAN_Roychowdhury_2020,0.607,0.69,0.62,0.588,0.647,0.584,0.628,-0.011,0.655,0.566
+CASP7_HUMAN_Roychowdhury_2020,0.613,0.651,0.616,0.606,0.636,0.584,0.627,0.046,0.546,0.568
+SERC_HUMAN_Xie_2023,0.53,0.546,0.551,0.537,0.551,0.55,0.558,0.04,0.561,0.555
+MTHR_HUMAN_Weile_2021,0.496,0.293,0.255,0.292,0.291,0.257,0.27,-0.062,0.619,0.551
+AMFR_HUMAN_Rocklin_2023_4G3O,0.528,0.735,0.445,0.163,0.428,0.38,0.454,0.099,0.763,0.563
+ARGR_ECOLI_Rocklin_2023_1AOY,0.525,0.62,0.426,0.273,0.41,0.442,0.475,-0.008,0.706,0.606
+BBC1_YEAST_Rocklin_2023_1TG0,0.621,0.649,0.402,0.434,0.346,0.384,0.397,0.071,0.673,0.585
+BCHB_CHLTE_Rocklin_2023_2KRU,0.472,0.612,0.257,0.323,0.417,0.345,0.286,-0.138,0.632,0.437
+CATR_CHLRE_Rocklin_2023_2AMI,0.713,0.71,0.343,0.297,0.221,0.391,0.389,0.025,0.788,0.729
+CBPA2_HUMAN_Rocklin_2023_1O6X,0.666,0.679,0.426,0.403,0.439,0.477,0.475,0.196,0.715,0.605
+CBX4_HUMAN_Rocklin_2023_2K28,0.582,0.75,0.533,0.493,0.609,0.537,0.597,0.105,0.755,0.575
+CSN4_MOUSE_Rocklin_2023_1UFM,0.698,0.711,0.367,0.559,0.387,0.493,0.454,0.042,0.751,0.611
+CUE1_YEAST_Rocklin_2023_2MYX,0.582,0.733,0.503,0.376,0.464,0.408,0.491,0.23,0.734,0.53
+DN7A_SACS2_Rocklin_2023_1JIC,0.477,0.703,0.209,0.136,0.297,0.189,0.204,0.228,0.726,0.569
+DNJA1_HUMAN_Rocklin_2023_2LO1,0.753,0.668,0.423,0.389,0.289,0.557,0.46,0.372,0.727,0.689
+DOCK1_MOUSE_Rocklin_2023_2M0Y,0.673,0.732,0.397,0.56,0.562,0.524,0.477,0.099,0.739,0.581
+EPHB2_HUMAN_Rocklin_2023_1F0M,0.737,0.78,0.554,0.626,0.578,0.578,0.596,0.238,0.824,0.633
+FECA_ECOLI_Rocklin_2023_2D1U,0.497,0.718,0.419,0.25,0.442,0.27,0.328,0.152,0.71,0.534
+FKBP3_HUMAN_Rocklin_2023_2KFV,0.361,0.626,0.489,0.152,0.279,0.349,0.419,0.049,0.683,0.511
+HCP_LAMBD_Rocklin_2023_2L6Q,0.744,0.716,0.171,0.476,0.385,0.532,0.508,0.183,0.715,0.648
+HECD1_HUMAN_Rocklin_2023_3DKM,0.571,0.823,0.602,0.28,0.67,0.547,0.602,0.185,0.808,0.701
+ILF3_HUMAN_Rocklin_2023_2L33,0.599,0.632,0.416,0.349,0.404,0.394,0.431,-0.025,0.708,0.599
+ISDH_STAAW_Rocklin_2023_2LHR,0.589,0.478,0.004,0.282,0.135,0.041,0.043,-0.033,0.601,0.332
+MAFG_MOUSE_Rocklin_2023_1K1V,0.59,0.545,0.247,0.374,0.262,0.471,0.425,0.012,0.558,0.585
+MBD11_ARATH_Rocklin_2023_6ACV,0.495,0.669,0.473,0.186,0.449,0.395,0.484,0.249,0.735,0.508
+MYO3_YEAST_Rocklin_2023_2BTT,0.623,0.643,0.343,0.449,0.414,0.308,0.364,0.028,0.624,0.448
+NKX31_HUMAN_Rocklin_2023_2L9R,0.75,0.775,0.575,0.537,0.593,0.589,0.616,0.213,0.797,0.722
+NUSA_ECOLI_Rocklin_2023_1WCL,0.795,0.771,0.449,0.294,0.334,0.46,0.515,0.23,0.826,0.734
+NUSG_MYCTU_Rocklin_2023_2MI6,0.74,0.827,0.518,0.321,0.523,0.512,0.511,0.243,0.854,0.756
+OBSCN_HUMAN_Rocklin_2023_1V1C,0.7,0.76,0.647,0.594,0.623,0.556,0.64,0.004,0.77,0.611
+ODP2_GEOSE_Rocklin_2023_1W4G,0.146,0.629,0.352,0.094,0.417,0.376,0.379,-0.117,0.65,0.416
+OTU7A_HUMAN_Rocklin_2023_2L2D,0.802,0.41,0.177,0.593,0.217,0.284,0.232,0.05,0.483,0.38
+PIN1_HUMAN_Rocklin_2023_1I6C,0.643,0.769,0.631,0.576,0.651,0.69,0.718,0.027,0.776,0.741
+PITX2_HUMAN_Rocklin_2023_2L7M,0.581,0.706,0.505,0.469,0.547,0.511,0.512,0.017,0.682,0.62
+PKN1_HUMAN_Rocklin_2023_1URF,0.568,0.622,0.239,0.298,0.221,0.303,0.29,-0.101,0.705,0.567
+POLG_PESV_Rocklin_2023_2MXD,0.494,0.672,0.165,0.016,0.433,0.101,0.158,0.068,0.675,0.475
+PR40A_HUMAN_Rocklin_2023_1UZC,0.507,0.772,0.439,0.23,0.59,0.459,0.507,0.156,0.764,0.53
+PSAE_SYNP2_Rocklin_2023_1PSE,0.64,0.459,0.323,0.38,0.173,0.218,0.278,0.061,0.533,0.53
+RAD_ANTMA_Rocklin_2023_2CJJ,0.597,0.679,0.197,0.281,0.478,0.287,0.28,0.063,0.698,0.513
+RBP1_HUMAN_Rocklin_2023_2KWH,0.8,0.672,0.349,0.475,0.206,0.384,0.353,0.09,0.734,0.618
+RCD1_ARATH_Rocklin_2023_5OAO,0.786,0.729,0.284,0.183,0.259,0.253,0.3,-0.021,0.738,0.604
+RCRO_LAMBD_Rocklin_2023_1ORC,0.593,0.643,0.435,0.36,0.474,0.392,0.453,0.142,0.708,0.585
+RD23A_HUMAN_Rocklin_2023_1IFY,0.793,0.823,0.483,0.569,0.57,0.557,0.549,0.058,0.84,0.704
+RFAH_ECOLI_Rocklin_2023_2LCL,0.587,0.636,0.312,0.239,0.272,0.233,0.255,-0.064,0.648,0.549
+RL20_AQUAE_Rocklin_2023_1GYZ,0.491,0.445,0.297,0.378,0.231,0.334,0.321,0.086,0.445,0.449
+RPC1_BP434_Rocklin_2023_1R69,0.826,0.83,0.492,0.675,0.562,0.642,0.545,0.323,0.827,0.705
+RS15_GEOSE_Rocklin_2023_1A32,0.748,0.695,0.3,0.334,0.371,0.356,0.294,-0.161,0.657,0.606
+SAV1_MOUSE_Rocklin_2023_2YSB,0.609,0.673,0.458,0.584,0.637,0.531,0.528,0.306,0.688,0.539
+SBI_STAAM_Rocklin_2023_2JVG,0.536,0.756,0.369,0.263,0.545,0.319,0.4,0.18,0.747,0.505
+SCIN_STAAR_Rocklin_2023_2QFF,0.588,0.713,0.284,0.339,0.282,0.176,0.28,-0.021,0.73,0.48
+SDA_BACSU_Rocklin_2023_1PV0,0.643,0.734,0.537,0.405,0.581,0.498,0.55,-0.017,0.755,0.677
+SOX30_HUMAN_Rocklin_2023_7JJK,0.613,0.559,0.239,0.344,0.282,0.237,0.263,-0.085,0.644,0.407
+SPA_STAAU_Rocklin_2023_1LP1,0.535,0.702,0.308,0.096,0.448,0.304,0.357,0.014,0.68,0.622
+SPG2_STRSG_Rocklin_2023_5UBS,0.671,0.623,0.181,0.191,0.08,0.071,0.23,0.039,0.618,0.49
+SPTN1_CHICK_Rocklin_2023_1TUD,0.691,0.656,0.464,0.537,0.452,0.484,0.488,-0.04,0.664,0.6
+SQSTM_MOUSE_Rocklin_2023_2RRU,0.694,0.833,0.622,0.395,0.613,0.605,0.637,0.051,0.861,0.746
+SR43C_ARATH_Rocklin_2023_2N88,0.667,0.716,0.591,0.522,0.581,0.577,0.606,0.062,0.751,0.644
+SRBS1_HUMAN_Rocklin_2023_2O2W,0.594,0.747,0.552,0.485,0.635,0.574,0.597,-0.092,0.762,0.635
+TCRG1_MOUSE_Rocklin_2023_1E0L,0.82,0.621,0.341,0.708,0.539,0.529,0.486,-0.068,0.665,0.658
+THO1_YEAST_Rocklin_2023_2WQG,0.523,0.588,0.26,0.368,0.392,0.231,0.251,0.025,0.646,0.428
+TNKS2_HUMAN_Rocklin_2023_5JRT,0.774,0.718,0.566,0.606,0.472,0.562,0.586,0.173,0.722,0.659
+UBE4B_HUMAN_Rocklin_2023_3L1X,0.667,0.709,0.483,0.57,0.518,0.427,0.487,0.102,0.726,0.448
+UBR5_HUMAN_Rocklin_2023_1I2T,0.648,0.723,0.395,0.315,0.358,0.447,0.466,0.265,0.742,0.665
+VG08_BPP22_Rocklin_2023_2GP8,0.788,0.68,0.265,0.562,0.459,0.468,0.418,0.17,0.725,0.665
+VILI_CHICK_Rocklin_2023_1YU5,0.671,0.658,0.595,0.517,0.565,0.411,0.567,-0.026,0.634,0.458
+VRPI_BPT7_Rocklin_2023_2WNM,0.705,0.614,0.165,0.335,0.229,-0.068,-0.006,0.155,0.633,0.506
+YAIA_ECOLI_Rocklin_2023_2KVT,0.576,0.635,0.224,0.25,0.287,0.279,0.324,0.11,0.644,0.385
+YNZC_BACSU_Rocklin_2023_2JVD,0.69,0.58,0.211,0.492,0.29,0.259,0.248,0.21,0.681,0.404
+POLG_DEN26_Suphatrakul_2023,0.178,0.51,0.289,-0.013,0.443,0.54,0.563,0.016,0.675,0.59
+MET_HUMAN_Estevam_2023,0.585,0.622,0.57,0.556,0.607,0.567,0.578,0.085,0.597,0.595
+RDRP_I33A0_Li_2023,0.112,0.489,0.388,0.047,0.484,0.475,0.527,0.032,0.561,0.503
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,0.351,0.344,0.344,0.347,0.308,0.288,0.335,-0.015,0.429,0.364
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,0.401,0.37,0.263,0.287,0.298,0.33,0.34,0.007,0.466,0.437
+ACE2_HUMAN_Chan_2020,0.255,0.268,0.199,0.177,0.211,0.191,0.197,0.039,0.385,0.262
+PPM1D_HUMAN_Miller_2022,0.606,0.558,0.539,0.603,0.539,0.595,0.62,0.149,0.622,0.596
+HMDH_HUMAN_Jiang_2019,0.569,0.289,0.394,0.341,0.269,0.329,0.359,-0.049,0.641,0.594
+CBS_HUMAN_Sun_2020,0.383,0.378,0.384,0.344,0.37,0.327,0.363,0.015,0.465,0.399
+NPC1_HUMAN_Erwood_2022_HEK293T,0.419,0.758,0.624,0.321,0.731,0.705,0.726,0.161,0.745,0.727
+NPC1_HUMAN_Erwood_2022_RPE1,0.401,0.783,0.597,0.479,0.771,0.808,0.792,0.595,0.734,0.717
+BRCA2_HUMAN_Erwood_2022_HEK293T,0.278,-0.181,0.439,0.195,-0.138,0.43,0.427,-0.364,0.03,0.401
+SHOC2_HUMAN_Kwon_2022,0.434,0.407,0.398,0.396,0.404,0.378,0.41,0.062,0.502,0.437
+RAF1_HUMAN_Zinkus-Boltz_2019,0.239,0.476,0.388,0.427,0.479,0.4,0.448,0.066,0.381,0.268
+OPSD_HUMAN_Wan_2019,0.588,0.711,0.511,0.508,0.591,0.519,0.475,0.215,0.678,0.587
+CAR11_HUMAN_Meitlis_2020_lof,0.583,0.347,0.249,0.486,0.248,0.258,0.272,-0.141,0.517,0.481
+CAR11_HUMAN_Meitlis_2020_gof,0.389,0.197,0.162,0.323,0.201,0.186,0.187,0.003,0.441,0.37
+ERBB2_HUMAN_Elazar_2016,0.586,0.574,0.2,0.46,0.381,0.462,0.407,-0.008,0.536,0.608
+GLPA_HUMAN_Elazar_2016,0.529,0.415,0.238,0.4,0.333,0.415,0.398,0.202,0.436,0.541
+LYAM1_HUMAN_Elazar_2016,0.565,0.549,0.076,0.384,0.426,0.348,0.311,0.483,0.505,0.542
+GDIA_HUMAN_Silverstein_2021,0.461,0.524,0.443,0.424,0.457,0.439,0.461,-0.023,0.479,0.438
+PPARG_HUMAN_Majithia_2016,0.788,0.682,0.659,0.699,0.613,0.642,0.683,0.004,0.868,0.769
+CD19_HUMAN_Klesmith_2019_FMC_singles,0.269,0.349,0.216,0.053,0.286,0.202,0.232,0.004,0.479,0.375
diff --git a/benchmarks/DMS_supervised/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level_fold_modulo_5.csv b/benchmarks/DMS_supervised/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level_fold_modulo_5.csv
new file mode 100644
index 0000000..878058b
--- /dev/null
+++ b/benchmarks/DMS_supervised/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level_fold_modulo_5.csv
@@ -0,0 +1,218 @@
+DMS_id,ESM-1v Embeddings,MSA Transformer Embeddings,DeepSequence + One-Hot Encodings,ESM-1v + One-Hot Encodings,MSA_Transformer + One-Hot Encodings,Tranception + One-Hot Encodings,TranceptEVE + One-Hot Encodings,One-Hot Encodings,ProteinNPT,Tranception Embeddings
+A0A140D2T1_ZIKV_Sourisseau_2019,0.154,0.459,0.111,-0.04,0.454,0.355,0.385,0.057,0.467,0.353
+A0A192B1T2_9HIV1_Haddox_2018,0.493,0.544,0.433,0.492,0.515,0.515,0.529,-0.027,0.626,0.594
+A0A1I9GEU1_NEIME_Kennouche_2019,0.082,0.073,0.092,0.063,0.095,0.047,0.067,-0.039,0.044,0.085
+A0A2Z5U3Z0_9INFA_Doud_2016,0.597,0.481,0.516,0.515,0.53,0.537,0.572,-0.007,0.43,0.635
+A0A2Z5U3Z0_9INFA_Wu_2014,0.453,0.265,0.464,0.452,0.507,0.533,0.544,0.108,0.364,0.503
+A4D664_9INFA_Soh_2019,0.201,0.337,0.397,0.023,0.336,0.392,0.461,-0.021,0.45,0.507
+A4GRB6_PSEAI_Chen_2020,0.724,0.765,0.663,0.646,0.669,0.65,0.678,-0.0,0.775,0.695
+AACC1_PSEAI_Dandage_2018,0.458,0.568,0.421,0.488,0.487,0.471,0.5,0.012,0.454,0.436
+ADRB2_HUMAN_Jones_2020,0.536,0.502,0.512,0.526,0.499,0.533,0.536,0.016,0.547,0.571
+AMIE_PSEAE_Wrenbeck_2017,0.682,0.683,0.504,0.611,0.567,0.437,0.479,-0.02,0.694,0.668
+LGK_LIPST_Klesmith_2015,0.592,0.618,0.494,0.536,0.559,0.525,0.54,0.005,0.689,0.594
+BLAT_ECOLX_Deng_2012,0.517,0.56,0.522,0.501,0.521,0.473,0.523,-0.034,0.508,0.442
+BLAT_ECOLX_Firnberg_2014,0.762,0.802,0.744,0.661,0.701,0.63,0.73,0.046,0.808,0.663
+BLAT_ECOLX_Jacquier_2013,0.702,0.7,0.703,0.644,0.653,0.635,0.714,0.025,0.679,0.632
+BLAT_ECOLX_Stiffler_2015,0.763,0.794,0.742,0.665,0.688,0.622,0.73,-0.038,0.788,0.672
+BRCA1_HUMAN_Findlay_2018,0.456,0.501,0.428,0.405,0.498,0.514,0.534,-0.062,0.487,0.517
+C6KNH7_9INFA_Lee_2018,0.537,0.433,0.356,0.427,0.401,0.432,0.451,0.007,0.622,0.608
+CALM1_HUMAN_Weile_2017,0.242,0.278,0.233,0.236,0.251,0.267,0.247,0.02,0.203,0.21
+CAPSD_AAV2S_Sinai_2021,0.496,0.685,0.551,0.156,0.573,0.516,0.539,0.125,0.69,0.679
+CCDB_ECOLI_Adkar_2012,0.489,0.684,0.55,0.35,0.441,0.454,0.523,-0.013,0.659,0.506
+CCDB_ECOLI_Tripathi_2016,0.354,0.459,0.401,0.343,0.351,0.398,0.402,0.434,0.636,0.329
+CP2C9_HUMAN_Amorosi_2021_abundance,0.645,0.613,0.611,0.588,0.589,0.623,0.632,0.01,0.714,0.698
+CP2C9_HUMAN_Amorosi_2021_activity,0.685,0.674,0.646,0.643,0.599,0.639,0.651,0.087,0.759,0.738
+DLG4_HUMAN_Faure_2021,0.629,0.516,0.415,0.4,0.494,0.608,0.59,-0.025,0.516,0.663
+DLG4_RAT_McLaughlin_2012,0.601,0.53,0.488,0.56,0.495,0.442,0.541,0.05,0.577,0.529
+DYR_ECOLI_Thompson_2019,0.459,0.573,0.47,0.416,0.487,0.421,0.479,0.095,0.529,0.43
+ENV_HV1B9_DuenasDecamp_2016,0.434,0.359,0.362,0.432,0.383,0.417,0.388,0.168,0.3,0.405
+ENV_HV1BR_Haddox_2016,0.321,0.339,0.322,0.319,0.344,0.364,0.366,0.024,0.337,0.368
+ESTA_BACSU_Nutschel_2020,0.388,0.548,0.423,0.3,0.435,0.329,0.403,-0.103,0.511,0.34
+F7YBW8_MESOW_Aakre_2015,0.671,0.491,0.238,0.652,0.444,0.636,0.492,-0.418,0.342,0.495
+GAL4_YEAST_Kitzman_2015,0.46,0.629,0.569,0.462,0.58,0.564,0.511,0.164,0.612,0.48
+GCN4_YEAST_Staller_2018,0.181,0.582,0.608,0.459,0.614,0.575,0.627,0.238,0.41,0.435
+GFP_AEQVI_Sarkisyan_2016,0.244,0.493,0.473,0.044,0.442,0.412,0.521,-0.037,0.562,0.398
+GRB2_HUMAN_Faure_2021,0.698,0.617,0.556,0.573,0.589,0.502,0.59,0.179,0.638,0.534
+HIS7_YEAST_Pokusaeva_2019,-0.02,-0.051,0.083,0.087,0.064,0.07,0.07,-0.015,-0.106,-0.101
+HSP82_YEAST_Flynn_2019,0.411,0.426,0.409,0.393,0.424,0.411,0.424,0.009,0.513,0.481
+HSP82_YEAST_Mishra_2016,0.571,0.487,0.544,0.54,0.474,0.525,0.547,-0.082,0.592,0.55
+I6TAH8_I68A0_Doud_2015,0.188,0.346,0.263,0.02,0.322,0.347,0.4,0.026,0.42,0.445
+IF1_ECOLI_Kelsic_2016,0.52,0.504,0.541,0.536,0.24,0.492,0.536,0.135,0.52,0.585
+KCNH2_HUMAN_Kozek_2020,0.488,0.391,0.262,0.184,0.293,0.54,0.502,0.307,0.415,0.385
+KKA2_KLEPN_Melnikov_2014,0.664,0.701,0.62,0.595,0.592,0.583,0.628,-0.03,0.674,0.632
+MK01_HUMAN_Brenan_2016,0.312,0.335,0.238,0.164,0.134,0.087,0.199,-0.008,0.448,0.343
+MSH2_HUMAN_Jia_2020,0.384,0.395,0.377,0.38,0.396,0.347,0.384,-0.026,0.548,0.469
+MTH3_HAEAE_RockahShmuel_2015,0.706,0.706,0.714,0.687,0.674,0.633,0.71,0.006,0.661,0.621
+NCAP_I34A1_Doud_2015,0.161,0.358,0.33,0.018,0.357,0.424,0.441,-0.024,0.431,0.457
+NRAM_I33A0_Jiang_2016,0.356,0.729,0.482,0.147,0.619,0.613,0.625,0.106,0.729,0.727
+NUD15_HUMAN_Suiter_2020,0.629,0.719,0.595,0.6,0.671,0.604,0.633,-0.001,0.72,0.615
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,0.578,0.541,0.341,0.458,0.308,0.398,0.411,0.06,0.671,0.602
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,0.573,0.62,0.299,0.447,0.306,0.378,0.381,0.003,0.641,0.613
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,0.759,0.742,0.362,0.496,0.291,0.432,0.437,0.005,0.843,0.784
+P53_HUMAN_Kotler_2018,0.589,0.629,0.558,0.535,0.58,0.559,0.578,0.049,0.593,0.593
+P84126_THETH_Chan_2017,0.645,0.724,0.619,0.556,0.639,0.549,0.583,0.122,0.679,0.576
+PA_I34A1_Wu_2015,0.105,0.192,0.504,0.051,0.304,0.569,0.58,0.006,0.261,0.588
+PABP_YEAST_Melamed_2013,0.728,0.727,0.535,0.658,0.676,0.656,0.662,-0.018,0.72,0.724
+POLG_CXB3N_Mattenberger_2021,0.232,0.5,0.407,-0.06,0.499,0.409,0.455,-0.019,0.583,0.496
+POLG_HCVJF_Qi_2014,0.563,0.678,0.403,0.607,0.564,0.587,0.58,-0.058,0.664,0.56
+PTEN_HUMAN_Matreyek_2021,0.519,0.546,0.388,0.407,0.432,0.411,0.445,0.011,0.604,0.571
+PTEN_HUMAN_Mighell_2018,0.502,0.509,0.516,0.467,0.455,0.415,0.53,0.0,0.562,0.468
+Q2N0S5_9HIV1_Haddox_2018,0.517,0.531,0.393,0.509,0.512,0.499,0.512,0.022,0.599,0.633
+Q59976_STRSQ_Romero_2015,0.568,0.663,0.644,0.52,0.655,0.657,0.675,0.018,0.7,0.67
+R1AB_SARS2_Flynn_2022,0.332,0.242,0.234,-0.03,-0.037,0.399,0.565,0.0,0.419,0.591
+RASH_HUMAN_Bandaru_2017,0.611,0.614,0.475,0.351,0.41,0.451,0.488,0.113,0.61,0.612
+REV_HV1H2_Fernandes_2016,0.377,0.389,0.235,0.253,0.257,0.242,0.242,0.063,0.296,0.334
+RL40A_YEAST_Mavor_2016,0.533,0.534,0.41,0.273,0.379,0.386,0.403,-0.001,0.511,0.422
+RL40A_YEAST_Roscoe_2013,0.673,0.727,0.466,0.299,0.411,0.442,0.462,0.019,0.668,0.641
+RL40A_YEAST_Roscoe_2014,0.395,0.451,0.375,0.234,0.368,0.402,0.377,0.093,0.436,0.425
+SC6A4_HUMAN_Young_2021,0.55,0.568,0.433,0.53,0.568,0.53,0.544,-0.009,0.656,0.619
+SCN5A_HUMAN_Glazer_2019,0.131,0.156,0.178,0.218,0.137,0.081,0.169,-0.118,0.099,0.127
+SPG1_STRSG_Olson_2014,0.66,0.253,0.051,0.247,0.289,0.296,0.3,-0.055,0.247,0.607
+SPIKE_SARS2_Starr_2020_binding,0.233,0.338,0.163,-0.057,0.418,0.315,0.343,0.132,0.344,0.507
+SPIKE_SARS2_Starr_2020_expression,0.236,0.557,0.281,-0.021,0.555,0.367,0.475,0.026,0.61,0.565
+SRC_HUMAN_Ahler_2019,0.596,0.407,0.463,0.562,0.371,0.493,0.516,0.003,0.491,0.531
+SUMO1_HUMAN_Weile_2017,0.496,0.56,0.437,0.468,0.498,0.415,0.465,0.049,0.487,0.463
+SYUA_HUMAN_Newberry_2020,0.596,0.574,0.127,0.238,0.147,0.156,0.151,0.054,0.581,0.583
+TADBP_HUMAN_Bolognesi_2019,0.774,0.708,0.09,0.053,0.036,0.121,0.105,-0.013,0.766,0.729
+TAT_HV1BR_Fernandes_2016,0.464,0.486,0.266,0.338,0.326,0.237,0.266,-0.013,0.458,0.424
+TPK1_HUMAN_Weile_2017,0.315,0.368,0.225,0.268,0.276,0.303,0.275,-0.021,0.314,0.248
+TPMT_HUMAN_Matreyek_2018,0.554,0.564,0.509,0.516,0.475,0.478,0.514,0.055,0.529,0.527
+TPOR_HUMAN_Bridgford_2020,0.538,0.488,0.256,0.371,0.473,0.445,0.434,-0.056,0.526,0.48
+TRPC_SACS2_Chan_2017,0.662,0.696,0.576,0.611,0.66,0.593,0.603,0.053,0.64,0.614
+TRPC_THEMA_Chan_2017,0.501,0.55,0.418,0.482,0.476,0.44,0.445,-0.005,0.585,0.484
+UBC9_HUMAN_Weile_2017,0.516,0.6,0.531,0.478,0.521,0.477,0.542,0.025,0.545,0.472
+UBE4B_MOUSE_Starita_2013,0.442,0.364,0.465,0.447,0.363,0.39,0.472,-0.003,0.329,0.474
+VKOR1_HUMAN_Chiasson_2020_abundance,0.604,0.667,0.421,0.445,0.505,0.547,0.536,0.003,0.615,0.603
+VKOR1_HUMAN_Chiasson_2020_activity,0.398,0.458,0.409,0.428,0.418,0.403,0.438,0.033,0.375,0.33
+YAP1_HUMAN_Araya_2012,0.555,0.514,0.648,0.383,0.055,0.476,0.611,0.026,0.473,0.551
+Q8WTC7_9CNID_Somermeyer_2022,0.238,0.482,0.142,-0.003,0.348,0.275,0.307,0.013,0.439,0.273
+D7PM05_CLYGR_Somermeyer_2022,0.291,0.702,0.505,0.003,0.597,0.403,0.522,0.071,0.689,0.455
+Q6WV13_9MAXI_Somermeyer_2022,0.177,0.528,0.256,0.048,0.355,0.262,0.303,0.085,0.589,0.278
+SPG1_STRSG_Wu_2016,0.58,-0.518,-0.509,0.29,-0.654,-0.108,-0.258,-0.222,-0.235,0.243
+CAS9_STRP1_Spencer_2017_positive,0.071,0.195,0.165,0.066,0.194,0.184,0.192,0.001,0.193,0.174
+A0A247D711_LISMN_Stadelmann_2021,0.278,0.511,0.043,0.081,0.303,0.274,0.236,-0.051,0.402,0.298
+HXK4_HUMAN_Gersing_2022_activity,0.518,0.552,0.488,0.463,0.523,0.443,0.491,0.012,0.618,0.536
+ANCSZ_Hobbs_2022,0.588,0.532,0.47,0.523,0.484,0.537,0.543,0.017,0.591,0.601
+PAI1_HUMAN_Huttinger_2021,0.414,0.489,0.392,0.407,0.438,0.411,0.425,0.006,0.538,0.447
+HXK4_HUMAN_Gersing_2023_abundance,0.438,0.427,0.351,0.332,0.404,0.358,0.369,-0.002,0.557,0.48
+PHOT_CHLRE_Chen_2023,0.647,0.659,0.613,0.616,0.598,0.564,0.59,0.019,0.645,0.586
+S22A1_HUMAN_Yee_2023_abundance,0.628,0.606,0.574,0.602,0.6,0.631,0.635,-0.035,0.723,0.687
+S22A1_HUMAN_Yee_2023_activity,0.596,0.551,0.568,0.542,0.547,0.59,0.601,-0.017,0.685,0.645
+KCNE1_HUMAN_Muhammad_2023_expression,0.601,0.459,0.118,0.162,0.068,0.11,0.115,0.095,0.561,0.563
+KCNE1_HUMAN_Muhammad_2023_function,0.657,0.674,0.489,0.472,0.479,0.618,0.633,0.036,0.672,0.674
+RNC_ECOLI_Weeks_2023,0.652,0.672,0.581,0.575,0.59,0.547,0.605,-0.029,0.706,0.636
+HEM3_HUMAN_Loggerenberg_2023,0.419,0.449,0.408,0.376,0.433,0.436,0.44,-0.029,0.48,0.467
+OXDA_RHOTO_Vanella_2023_expression,0.432,0.468,0.287,0.299,0.306,0.267,0.278,0.032,0.52,0.413
+OXDA_RHOTO_Vanella_2023_activity,0.405,0.438,0.371,0.339,0.345,0.305,0.355,-0.017,0.455,0.355
+PRKN_HUMAN_Clausen_2023,0.66,0.502,0.586,0.598,0.5,0.64,0.654,0.035,0.676,0.684
+DYR_ECOLI_Nguyen_2023,0.502,0.563,0.453,0.528,0.471,0.357,0.416,0.069,0.587,0.408
+ENVZ_ECOLI_Ghose_2023,0.164,0.231,0.188,0.211,0.19,0.229,0.211,-0.002,0.211,0.131
+MLAC_ECOLI_MacRae_2023,0.558,0.579,0.422,0.397,0.418,0.315,0.395,0.025,0.606,0.525
+Q837P5_ENTFA_Meier_2023,0.392,0.503,0.454,0.327,0.326,0.465,0.449,-0.078,0.571,0.58
+Q837P4_ENTFA_Meier_2023,0.63,0.611,0.506,0.545,0.529,0.52,0.551,0.19,0.675,0.623
+CCR5_HUMAN_Gill_2023,0.375,0.384,0.29,0.347,0.358,0.376,0.324,0.007,0.441,0.45
+Q53Z42_HUMAN_McShan_2019_expression,0.535,0.618,0.54,0.468,0.559,0.533,0.565,-0.025,0.591,0.554
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,0.591,0.572,0.356,0.302,0.316,0.296,0.353,-0.06,0.604,0.6
+RPC1_LAMBD_Li_2019_low-expression,0.492,0.624,0.507,0.466,0.431,0.47,0.507,0.24,0.529,0.447
+RPC1_LAMBD_Li_2019_high-expression,0.429,0.47,0.502,0.461,0.474,0.472,0.498,0.135,0.439,0.42
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,-0.112,0.378,0.71,0.041,0.289,0.393,0.416,0.016,0.251,0.209
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,0.414,0.467,0.361,0.405,0.436,0.354,0.37,-0.006,0.489,0.416
+A4_HUMAN_Seuma_2022,0.58,0.433,0.45,0.289,0.338,0.448,0.479,-0.002,0.555,0.58
+SRC_HUMAN_Nguyen_2022,0.524,0.517,0.425,0.48,0.47,0.398,0.428,-0.061,0.497,0.439
+F7YBW7_MESOW_Ding_2023,0.292,0.48,0.228,0.317,0.434,0.51,0.417,-0.361,0.377,0.479
+RASK_HUMAN_Weng_2022_abundance,0.54,0.618,0.153,0.078,0.165,0.283,0.21,0.104,0.632,0.628
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,0.618,0.658,0.502,0.474,0.579,0.488,0.515,0.01,0.711,0.68
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,0.5,0.502,0.424,0.476,0.467,0.396,0.431,-0.099,0.455,0.447
+AICDA_HUMAN_Gajula_2014_3cycles,0.208,0.087,0.346,0.222,0.203,0.225,0.313,-0.19,0.185,0.068
+OTC_HUMAN_Lo_2023,0.705,0.768,0.518,0.575,0.573,0.612,0.597,0.062,0.783,0.664
+CASP3_HUMAN_Roychowdhury_2020,0.623,0.688,0.621,0.59,0.649,0.584,0.627,-0.017,0.641,0.561
+CASP7_HUMAN_Roychowdhury_2020,0.607,0.654,0.617,0.605,0.634,0.585,0.627,-0.023,0.607,0.568
+SERC_HUMAN_Xie_2023,0.538,0.557,0.543,0.534,0.546,0.543,0.551,-0.055,0.552,0.549
+MTHR_HUMAN_Weile_2021,0.501,0.291,0.255,0.29,0.29,0.254,0.268,0.01,0.622,0.561
+AMFR_HUMAN_Rocklin_2023_4G3O,0.594,0.761,0.474,0.183,0.448,0.383,0.469,-0.07,0.794,0.623
+ARGR_ECOLI_Rocklin_2023_1AOY,0.58,0.696,0.429,0.275,0.431,0.438,0.47,0.024,0.735,0.611
+BBC1_YEAST_Rocklin_2023_1TG0,0.64,0.675,0.426,0.453,0.377,0.414,0.425,0.02,0.69,0.605
+BCHB_CHLTE_Rocklin_2023_2KRU,0.573,0.648,0.284,0.314,0.419,0.35,0.302,0.018,0.675,0.512
+CATR_CHLRE_Rocklin_2023_2AMI,0.757,0.742,0.347,0.322,0.248,0.408,0.4,-0.011,0.819,0.76
+CBPA2_HUMAN_Rocklin_2023_1O6X,0.667,0.71,0.411,0.406,0.441,0.46,0.457,-0.068,0.724,0.613
+CBX4_HUMAN_Rocklin_2023_2K28,0.578,0.766,0.519,0.474,0.614,0.515,0.58,-0.085,0.765,0.572
+CSN4_MOUSE_Rocklin_2023_1UFM,0.727,0.739,0.364,0.555,0.4,0.499,0.457,-0.037,0.786,0.654
+CUE1_YEAST_Rocklin_2023_2MYX,0.66,0.782,0.505,0.377,0.491,0.405,0.499,0.155,0.78,0.582
+DN7A_SACS2_Rocklin_2023_1JIC,0.519,0.73,0.217,0.113,0.313,0.203,0.224,-0.023,0.757,0.553
+DNJA1_HUMAN_Rocklin_2023_2LO1,0.749,0.723,0.433,0.397,0.315,0.557,0.473,0.158,0.772,0.724
+DOCK1_MOUSE_Rocklin_2023_2M0Y,0.681,0.72,0.379,0.558,0.54,0.506,0.452,-0.08,0.725,0.55
+EPHB2_HUMAN_Rocklin_2023_1F0M,0.743,0.785,0.552,0.624,0.568,0.576,0.591,0.078,0.817,0.634
+FECA_ECOLI_Rocklin_2023_2D1U,0.481,0.702,0.429,0.236,0.438,0.275,0.338,0.034,0.719,0.465
+FKBP3_HUMAN_Rocklin_2023_2KFV,0.392,0.63,0.494,0.169,0.284,0.342,0.416,0.078,0.663,0.53
+HCP_LAMBD_Rocklin_2023_2L6Q,0.762,0.73,0.159,0.46,0.366,0.534,0.512,0.036,0.725,0.668
+HECD1_HUMAN_Rocklin_2023_3DKM,0.563,0.828,0.608,0.267,0.666,0.543,0.602,0.157,0.819,0.681
+ILF3_HUMAN_Rocklin_2023_2L33,0.621,0.666,0.431,0.362,0.421,0.401,0.435,-0.051,0.741,0.623
+ISDH_STAAW_Rocklin_2023_2LHR,0.633,0.517,0.004,0.288,0.169,0.044,0.049,0.049,0.555,0.369
+MAFG_MOUSE_Rocklin_2023_1K1V,0.593,0.527,0.222,0.349,0.224,0.45,0.399,-0.035,0.557,0.578
+MBD11_ARATH_Rocklin_2023_6ACV,0.562,0.707,0.447,0.183,0.44,0.384,0.467,0.125,0.75,0.555
+MYO3_YEAST_Rocklin_2023_2BTT,0.621,0.648,0.343,0.445,0.412,0.296,0.362,0.01,0.651,0.518
+NKX31_HUMAN_Rocklin_2023_2L9R,0.759,0.792,0.594,0.532,0.598,0.593,0.625,0.135,0.82,0.721
+NUSA_ECOLI_Rocklin_2023_1WCL,0.774,0.806,0.429,0.338,0.375,0.463,0.502,0.019,0.837,0.732
+NUSG_MYCTU_Rocklin_2023_2MI6,0.758,0.847,0.495,0.314,0.503,0.481,0.488,0.095,0.866,0.746
+OBSCN_HUMAN_Rocklin_2023_1V1C,0.72,0.768,0.636,0.584,0.617,0.552,0.628,0.055,0.768,0.609
+ODP2_GEOSE_Rocklin_2023_1W4G,0.2,0.633,0.389,0.102,0.45,0.399,0.415,0.066,0.645,0.416
+OTU7A_HUMAN_Rocklin_2023_2L2D,0.805,0.461,0.216,0.624,0.253,0.301,0.267,0.095,0.55,0.463
+PIN1_HUMAN_Rocklin_2023_1I6C,0.691,0.769,0.615,0.558,0.655,0.669,0.696,-0.092,0.762,0.761
+PITX2_HUMAN_Rocklin_2023_2L7M,0.608,0.711,0.512,0.454,0.559,0.519,0.522,-0.008,0.677,0.59
+PKN1_HUMAN_Rocklin_2023_1URF,0.56,0.603,0.224,0.288,0.219,0.305,0.285,0.009,0.68,0.574
+POLG_PESV_Rocklin_2023_2MXD,0.545,0.687,0.202,0.049,0.445,0.124,0.2,0.033,0.709,0.505
+PR40A_HUMAN_Rocklin_2023_1UZC,0.531,0.79,0.438,0.238,0.588,0.473,0.51,0.11,0.782,0.576
+PSAE_SYNP2_Rocklin_2023_1PSE,0.659,0.521,0.336,0.385,0.189,0.228,0.286,0.017,0.588,0.533
+RAD_ANTMA_Rocklin_2023_2CJJ,0.606,0.666,0.187,0.294,0.466,0.274,0.257,-0.095,0.688,0.494
+RBP1_HUMAN_Rocklin_2023_2KWH,0.807,0.682,0.326,0.462,0.177,0.369,0.332,0.062,0.739,0.646
+RCD1_ARATH_Rocklin_2023_5OAO,0.789,0.729,0.289,0.18,0.274,0.245,0.305,-0.113,0.736,0.608
+RCRO_LAMBD_Rocklin_2023_1ORC,0.608,0.712,0.454,0.359,0.486,0.409,0.464,0.169,0.723,0.59
+RD23A_HUMAN_Rocklin_2023_1IFY,0.799,0.806,0.448,0.567,0.568,0.537,0.524,-0.172,0.839,0.676
+RFAH_ECOLI_Rocklin_2023_2LCL,0.607,0.65,0.314,0.242,0.278,0.222,0.262,-0.134,0.641,0.577
+RL20_AQUAE_Rocklin_2023_1GYZ,0.506,0.447,0.256,0.356,0.213,0.291,0.272,-0.049,0.413,0.416
+RPC1_BP434_Rocklin_2023_1R69,0.827,0.837,0.536,0.675,0.566,0.644,0.567,0.283,0.832,0.716
+RS15_GEOSE_Rocklin_2023_1A32,0.776,0.725,0.34,0.363,0.406,0.372,0.326,-0.023,0.734,0.631
+SAV1_MOUSE_Rocklin_2023_2YSB,0.59,0.64,0.439,0.57,0.609,0.502,0.494,0.058,0.694,0.548
+SBI_STAAM_Rocklin_2023_2JVG,0.572,0.739,0.341,0.257,0.54,0.306,0.381,0.072,0.74,0.558
+SCIN_STAAR_Rocklin_2023_2QFF,0.636,0.741,0.269,0.31,0.267,0.15,0.256,-0.037,0.759,0.541
+SDA_BACSU_Rocklin_2023_1PV0,0.706,0.788,0.553,0.43,0.59,0.514,0.572,-0.06,0.763,0.669
+SOX30_HUMAN_Rocklin_2023_7JJK,0.66,0.574,0.239,0.346,0.27,0.223,0.256,0.003,0.674,0.443
+SPA_STAAU_Rocklin_2023_1LP1,0.521,0.721,0.287,0.09,0.443,0.294,0.342,0.013,0.702,0.617
+SPG2_STRSG_Rocklin_2023_5UBS,0.646,0.631,0.192,0.179,0.098,0.087,0.249,-0.017,0.65,0.499
+SPTN1_CHICK_Rocklin_2023_1TUD,0.676,0.638,0.454,0.526,0.44,0.478,0.483,0.041,0.661,0.605
+SQSTM_MOUSE_Rocklin_2023_2RRU,0.698,0.826,0.623,0.394,0.605,0.603,0.634,0.116,0.862,0.766
+SR43C_ARATH_Rocklin_2023_2N88,0.654,0.698,0.541,0.489,0.549,0.536,0.563,0.081,0.726,0.624
+SRBS1_HUMAN_Rocklin_2023_2O2W,0.596,0.75,0.56,0.507,0.633,0.574,0.601,-0.046,0.754,0.628
+TCRG1_MOUSE_Rocklin_2023_1E0L,0.812,0.648,0.345,0.711,0.547,0.538,0.493,0.102,0.704,0.691
+THO1_YEAST_Rocklin_2023_2WQG,0.544,0.669,0.299,0.355,0.391,0.262,0.29,0.094,0.709,0.505
+TNKS2_HUMAN_Rocklin_2023_5JRT,0.761,0.732,0.556,0.594,0.482,0.558,0.575,-0.11,0.742,0.677
+UBE4B_HUMAN_Rocklin_2023_3L1X,0.68,0.741,0.488,0.552,0.533,0.429,0.488,0.069,0.727,0.517
+UBR5_HUMAN_Rocklin_2023_1I2T,0.653,0.724,0.379,0.333,0.345,0.442,0.456,0.06,0.709,0.647
+VG08_BPP22_Rocklin_2023_2GP8,0.796,0.676,0.238,0.56,0.428,0.461,0.403,0.061,0.663,0.66
+VILI_CHICK_Rocklin_2023_1YU5,0.67,0.652,0.588,0.51,0.566,0.409,0.565,-0.036,0.582,0.446
+VRPI_BPT7_Rocklin_2023_2WNM,0.723,0.596,0.183,0.345,0.225,-0.045,0.009,0.032,0.625,0.505
+YAIA_ECOLI_Rocklin_2023_2KVT,0.549,0.654,0.232,0.248,0.293,0.283,0.328,0.057,0.607,0.399
+YNZC_BACSU_Rocklin_2023_2JVD,0.691,0.525,0.207,0.476,0.28,0.27,0.255,-0.059,0.652,0.382
+POLG_DEN26_Suphatrakul_2023,0.254,0.505,0.285,-0.005,0.442,0.543,0.565,0.031,0.684,0.597
+MET_HUMAN_Estevam_2023,0.59,0.618,0.565,0.55,0.602,0.565,0.575,0.1,0.601,0.6
+RDRP_I33A0_Li_2023,0.126,0.488,0.388,0.046,0.486,0.474,0.526,0.037,0.582,0.514
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,0.348,0.349,0.342,0.347,0.31,0.289,0.334,0.039,0.435,0.374
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,0.418,0.408,0.264,0.286,0.298,0.327,0.339,0.004,0.484,0.457
+ACE2_HUMAN_Chan_2020,0.254,0.267,0.213,0.181,0.225,0.216,0.222,0.086,0.365,0.264
+PPM1D_HUMAN_Miller_2022,0.602,0.545,0.535,0.598,0.537,0.592,0.617,0.026,0.63,0.612
+HMDH_HUMAN_Jiang_2019,0.582,0.305,0.393,0.34,0.269,0.326,0.358,-0.001,0.641,0.6
+CBS_HUMAN_Sun_2020,0.375,0.377,0.381,0.342,0.371,0.325,0.361,0.014,0.455,0.396
+NPC1_HUMAN_Erwood_2022_HEK293T,0.448,0.748,0.63,0.327,0.736,0.692,0.722,-0.103,0.733,0.709
+NPC1_HUMAN_Erwood_2022_RPE1,0.225,0.761,0.549,0.268,0.755,0.77,0.74,0.441,0.734,0.663
+BRCA2_HUMAN_Erwood_2022_HEK293T,0.298,-0.167,0.434,0.139,-0.124,0.408,0.41,-0.248,-0.172,0.359
+SHOC2_HUMAN_Kwon_2022,0.431,0.391,0.383,0.384,0.391,0.366,0.396,0.027,0.513,0.467
+RAF1_HUMAN_Zinkus-Boltz_2019,0.338,0.463,0.375,0.408,0.47,0.386,0.426,0.167,0.376,0.266
+OPSD_HUMAN_Wan_2019,0.655,0.706,0.547,0.51,0.582,0.517,0.507,0.153,0.695,0.66
+CAR11_HUMAN_Meitlis_2020_lof,0.597,0.362,0.25,0.488,0.251,0.264,0.272,-0.087,0.578,0.538
+CAR11_HUMAN_Meitlis_2020_gof,0.382,0.189,0.149,0.325,0.195,0.18,0.178,-0.016,0.478,0.384
+ERBB2_HUMAN_Elazar_2016,0.643,0.625,0.247,0.466,0.411,0.483,0.436,-0.079,0.548,0.658
+GLPA_HUMAN_Elazar_2016,0.573,0.473,0.219,0.394,0.352,0.434,0.395,0.162,0.488,0.65
+LYAM1_HUMAN_Elazar_2016,0.554,0.551,0.205,0.369,0.412,0.339,0.311,0.247,0.489,0.509
+GDIA_HUMAN_Silverstein_2021,0.469,0.527,0.443,0.421,0.456,0.434,0.458,0.012,0.506,0.427
+PPARG_HUMAN_Majithia_2016,0.807,0.677,0.654,0.696,0.608,0.637,0.68,0.016,0.866,0.807
+CD19_HUMAN_Klesmith_2019_FMC_singles,0.287,0.348,0.207,0.043,0.285,0.198,0.227,0.017,0.46,0.412
diff --git a/benchmarks/DMS_supervised/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level_fold_random_5.csv b/benchmarks/DMS_supervised/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level_fold_random_5.csv
new file mode 100644
index 0000000..d639437
--- /dev/null
+++ b/benchmarks/DMS_supervised/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level_fold_random_5.csv
@@ -0,0 +1,218 @@
+DMS_id,ESM-1v Embeddings,MSA Transformer Embeddings,DeepSequence + One-Hot Encodings,ESM-1v + One-Hot Encodings,MSA_Transformer + One-Hot Encodings,Tranception + One-Hot Encodings,TranceptEVE + One-Hot Encodings,One-Hot Encodings,ProteinNPT,Tranception Embeddings
+A0A140D2T1_ZIKV_Sourisseau_2019,0.273,0.459,0.137,-0.013,0.462,0.367,0.393,0.312,0.507,0.505
+A0A192B1T2_9HIV1_Haddox_2018,0.497,0.537,0.465,0.524,0.543,0.546,0.559,0.573,0.756,0.614
+A0A1I9GEU1_NEIME_Kennouche_2019,0.057,0.133,0.12,0.097,0.124,0.101,0.111,0.111,0.116,0.064
+A0A2Z5U3Z0_9INFA_Doud_2016,0.612,0.595,0.546,0.549,0.561,0.564,0.598,0.705,0.715,0.695
+A0A2Z5U3Z0_9INFA_Wu_2014,0.458,0.321,0.476,0.468,0.518,0.534,0.552,0.302,0.407,0.524
+A4D664_9INFA_Soh_2019,0.271,0.336,0.446,0.102,0.375,0.433,0.492,0.562,0.452,0.565
+A4GRB6_PSEAI_Chen_2020,0.794,0.785,0.71,0.704,0.717,0.711,0.727,0.792,0.901,0.813
+AACC1_PSEAI_Dandage_2018,0.603,0.654,0.493,0.551,0.546,0.538,0.551,0.582,0.617,0.599
+ADRB2_HUMAN_Jones_2020,0.548,0.507,0.525,0.536,0.512,0.543,0.547,0.492,0.591,0.588
+AMIE_PSEAE_Wrenbeck_2017,0.713,0.71,0.56,0.651,0.615,0.506,0.536,0.701,0.836,0.77
+LGK_LIPST_Klesmith_2015,0.604,0.62,0.517,0.562,0.582,0.561,0.565,0.567,0.783,0.646
+BLAT_ECOLX_Deng_2012,0.663,0.606,0.588,0.573,0.59,0.569,0.591,0.734,0.833,0.727
+BLAT_ECOLX_Firnberg_2014,0.826,0.828,0.773,0.711,0.743,0.699,0.766,0.741,0.904,0.822
+BLAT_ECOLX_Jacquier_2013,0.738,0.726,0.714,0.664,0.669,0.666,0.731,0.525,0.72,0.677
+BLAT_ECOLX_Stiffler_2015,0.831,0.823,0.772,0.716,0.733,0.691,0.766,0.748,0.903,0.841
+BRCA1_HUMAN_Findlay_2018,0.474,0.501,0.47,0.449,0.526,0.542,0.557,0.425,0.535,0.591
+C6KNH7_9INFA_Lee_2018,0.56,0.473,0.404,0.469,0.447,0.469,0.489,0.67,0.743,0.679
+CALM1_HUMAN_Weile_2017,0.286,0.303,0.258,0.27,0.277,0.285,0.268,0.164,0.275,0.262
+CAPSD_AAV2S_Sinai_2021,0.746,0.802,0.685,0.611,0.711,0.653,0.654,0.647,0.852,0.835
+CCDB_ECOLI_Adkar_2012,0.886,0.972,0.717,0.683,0.689,0.687,0.7,0.972,0.995,0.904
+CCDB_ECOLI_Tripathi_2016,0.481,0.507,0.424,0.374,0.413,0.424,0.431,0.444,0.699,0.435
+CP2C9_HUMAN_Amorosi_2021_abundance,0.67,0.604,0.638,0.613,0.613,0.648,0.656,0.616,0.788,0.743
+CP2C9_HUMAN_Amorosi_2021_activity,0.691,0.674,0.679,0.674,0.633,0.672,0.683,0.681,0.868,0.793
+DLG4_HUMAN_Faure_2021,0.721,0.566,0.59,0.586,0.625,0.692,0.694,0.622,0.767,0.783
+DLG4_RAT_McLaughlin_2012,0.651,0.527,0.598,0.625,0.591,0.558,0.62,0.694,0.801,0.745
+DYR_ECOLI_Thompson_2019,0.644,0.703,0.548,0.51,0.562,0.527,0.557,0.62,0.766,0.692
+ENV_HV1B9_DuenasDecamp_2016,0.545,0.56,0.534,0.513,0.528,0.555,0.547,0.477,0.569,0.585
+ENV_HV1BR_Haddox_2016,0.32,0.338,0.324,0.323,0.347,0.363,0.365,0.066,0.344,0.363
+ESTA_BACSU_Nutschel_2020,0.61,0.689,0.512,0.466,0.526,0.488,0.498,0.653,0.706,0.634
+F7YBW8_MESOW_Aakre_2015,0.539,0.525,0.42,0.555,0.532,0.439,0.437,0.447,0.552,0.439
+GAL4_YEAST_Kitzman_2015,0.697,0.67,0.636,0.637,0.666,0.674,0.662,0.667,0.695,0.689
+GCN4_YEAST_Staller_2018,-0.036,0.373,-0.157,0.177,-0.045,-0.166,-0.074,-0.465,0.1,0.304
+GFP_AEQVI_Sarkisyan_2016,0.348,0.515,0.504,0.181,0.485,0.454,0.545,0.451,0.608,0.495
+GRB2_HUMAN_Faure_2021,0.861,0.863,0.738,0.743,0.743,0.729,0.756,0.722,0.872,0.862
+HIS7_YEAST_Pokusaeva_2019,0.015,-0.013,0.042,0.023,-0.011,0.009,0.023,-0.134,-0.078,-0.04
+HSP82_YEAST_Flynn_2019,0.409,0.427,0.429,0.415,0.444,0.429,0.443,0.492,0.56,0.533
+HSP82_YEAST_Mishra_2016,0.618,0.519,0.579,0.578,0.531,0.558,0.579,0.665,0.696,0.65
+I6TAH8_I68A0_Doud_2015,0.278,0.342,0.37,0.153,0.378,0.405,0.452,0.652,0.71,0.636
+IF1_ECOLI_Kelsic_2016,0.733,0.762,0.612,0.619,0.447,0.582,0.609,0.713,0.853,0.798
+KCNH2_HUMAN_Kozek_2020,0.729,0.729,0.544,0.522,0.637,0.644,0.645,0.398,0.75,0.758
+KKA2_KLEPN_Melnikov_2014,0.716,0.721,0.655,0.635,0.634,0.635,0.664,0.7,0.79,0.737
+MK01_HUMAN_Brenan_2016,0.362,0.371,0.273,0.207,0.187,0.158,0.239,0.441,0.564,0.45
+MSH2_HUMAN_Jia_2020,0.382,0.395,0.391,0.394,0.408,0.367,0.399,0.512,0.572,0.521
+MTH3_HAEAE_RockahShmuel_2015,0.71,0.714,0.723,0.703,0.694,0.668,0.723,0.561,0.72,0.655
+NCAP_I34A1_Doud_2015,0.251,0.359,0.389,0.135,0.394,0.46,0.475,0.459,0.61,0.6
+NRAM_I33A0_Jiang_2016,0.577,0.775,0.665,0.569,0.72,0.735,0.747,0.652,0.797,0.754
+NUD15_HUMAN_Suiter_2020,0.782,0.807,0.674,0.677,0.724,0.686,0.704,0.709,0.827,0.796
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,0.62,0.529,0.416,0.488,0.363,0.438,0.452,0.686,0.734,0.689
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,0.621,0.639,0.401,0.488,0.378,0.431,0.433,0.67,0.71,0.681
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,0.796,0.746,0.479,0.545,0.388,0.489,0.497,0.84,0.888,0.837
+P53_HUMAN_Kotler_2018,0.672,0.654,0.624,0.604,0.645,0.626,0.643,0.589,0.727,0.729
+P84126_THETH_Chan_2017,0.82,0.823,0.72,0.69,0.743,0.694,0.711,0.649,0.891,0.839
+PA_I34A1_Wu_2015,0.12,0.22,0.516,0.105,0.327,0.578,0.591,0.318,0.328,0.607
+PABP_YEAST_Melamed_2013,0.814,0.79,0.7,0.752,0.773,0.743,0.752,0.695,0.831,0.833
+POLG_CXB3N_Mattenberger_2021,0.335,0.5,0.42,-0.015,0.51,0.424,0.468,0.326,0.626,0.614
+POLG_HCVJF_Qi_2014,0.751,0.723,0.517,0.652,0.628,0.615,0.609,0.607,0.812,0.756
+PTEN_HUMAN_Matreyek_2021,0.556,0.554,0.437,0.457,0.475,0.471,0.49,0.618,0.686,0.653
+PTEN_HUMAN_Mighell_2018,0.522,0.513,0.533,0.494,0.486,0.455,0.547,0.542,0.621,0.591
+Q2N0S5_9HIV1_Haddox_2018,0.515,0.538,0.431,0.54,0.546,0.532,0.544,0.68,0.634,0.671
+Q59976_STRSQ_Romero_2015,0.58,0.665,0.656,0.541,0.669,0.67,0.685,0.534,0.739,0.683
+R1AB_SARS2_Flynn_2022,0.494,0.337,0.404,0.438,0.463,0.498,0.626,0.749,0.814,0.769
+RASH_HUMAN_Bandaru_2017,0.772,0.727,0.579,0.497,0.532,0.565,0.594,0.695,0.842,0.83
+REV_HV1H2_Fernandes_2016,0.533,0.544,0.363,0.376,0.367,0.361,0.374,0.476,0.53,0.544
+RL40A_YEAST_Mavor_2016,0.704,0.723,0.586,0.532,0.578,0.586,0.582,0.639,0.73,0.692
+RL40A_YEAST_Roscoe_2013,0.809,0.84,0.659,0.603,0.643,0.663,0.661,0.714,0.887,0.811
+RL40A_YEAST_Roscoe_2014,0.722,0.747,0.578,0.53,0.579,0.598,0.574,0.708,0.792,0.735
+SC6A4_HUMAN_Young_2021,0.547,0.568,0.457,0.547,0.583,0.55,0.562,0.517,0.675,0.662
+SCN5A_HUMAN_Glazer_2019,0.113,0.146,0.147,0.198,0.127,0.101,0.135,0.096,0.001,0.094
+SPG1_STRSG_Olson_2014,0.851,0.537,0.676,0.615,0.665,0.638,0.626,0.821,0.699,0.855
+SPIKE_SARS2_Starr_2020_binding,0.409,0.632,0.383,0.339,0.522,0.466,0.489,0.751,0.788,0.766
+SPIKE_SARS2_Starr_2020_expression,0.429,0.557,0.453,0.446,0.647,0.51,0.587,0.791,0.893,0.836
+SRC_HUMAN_Ahler_2019,0.674,0.427,0.56,0.62,0.48,0.57,0.588,0.722,0.8,0.776
+SUMO1_HUMAN_Weile_2017,0.641,0.642,0.547,0.554,0.565,0.525,0.553,0.564,0.585,0.612
+SYUA_HUMAN_Newberry_2020,0.651,0.618,0.173,0.275,0.208,0.192,0.19,0.313,0.66,0.631
+TADBP_HUMAN_Bolognesi_2019,0.797,0.655,0.187,0.111,0.181,0.213,0.21,0.245,0.768,0.795
+TAT_HV1BR_Fernandes_2016,0.603,0.606,0.465,0.47,0.457,0.446,0.457,0.54,0.603,0.596
+TPK1_HUMAN_Weile_2017,0.35,0.408,0.258,0.309,0.303,0.333,0.304,0.223,0.382,0.319
+TPMT_HUMAN_Matreyek_2018,0.587,0.58,0.534,0.539,0.502,0.513,0.54,0.513,0.581,0.59
+TPOR_HUMAN_Bridgford_2020,0.636,0.648,0.464,0.527,0.589,0.594,0.583,0.375,0.648,0.678
+TRPC_SACS2_Chan_2017,0.776,0.782,0.68,0.696,0.734,0.691,0.694,0.601,0.817,0.783
+TRPC_THEMA_Chan_2017,0.729,0.745,0.588,0.63,0.628,0.606,0.607,0.672,0.814,0.744
+UBC9_HUMAN_Weile_2017,0.689,0.709,0.609,0.575,0.597,0.577,0.617,0.631,0.713,0.695
+UBE4B_MOUSE_Starita_2013,0.526,0.366,0.514,0.498,0.461,0.462,0.52,0.458,0.342,0.521
+VKOR1_HUMAN_Chiasson_2020_abundance,0.754,0.761,0.519,0.533,0.577,0.624,0.612,0.547,0.81,0.807
+VKOR1_HUMAN_Chiasson_2020_activity,0.422,0.427,0.421,0.435,0.424,0.408,0.44,0.247,0.33,0.329
+YAP1_HUMAN_Araya_2012,0.765,0.652,0.737,0.646,0.597,0.655,0.719,0.643,0.773,0.696
+Q8WTC7_9CNID_Somermeyer_2022,0.437,0.593,0.283,0.36,0.447,0.409,0.433,0.595,0.662,0.534
+D7PM05_CLYGR_Somermeyer_2022,0.406,0.74,0.566,0.441,0.658,0.501,0.589,0.641,0.82,0.622
+Q6WV13_9MAXI_Somermeyer_2022,0.381,0.591,0.379,0.312,0.451,0.393,0.409,0.645,0.735,0.503
+SPG1_STRSG_Wu_2016,0.738,0.805,0.562,0.761,0.651,0.754,0.677,0.707,0.86,0.68
+CAS9_STRP1_Spencer_2017_positive,0.075,0.195,0.166,0.071,0.195,0.188,0.192,0.036,0.191,0.179
+A0A247D711_LISMN_Stadelmann_2021,0.62,0.657,0.475,0.49,0.512,0.564,0.506,0.6,0.632,0.608
+HXK4_HUMAN_Gersing_2022_activity,0.533,0.556,0.519,0.494,0.547,0.505,0.527,0.616,0.657,0.638
+ANCSZ_Hobbs_2022,0.634,0.557,0.535,0.564,0.537,0.575,0.581,0.649,0.768,0.724
+PAI1_HUMAN_Huttinger_2021,0.412,0.492,0.414,0.427,0.452,0.434,0.444,0.425,0.591,0.475
+HXK4_HUMAN_Gersing_2023_abundance,0.442,0.436,0.385,0.372,0.434,0.42,0.412,0.542,0.611,0.572
+PHOT_CHLRE_Chen_2023,0.749,0.775,0.669,0.67,0.662,0.636,0.648,0.611,0.808,0.768
+S22A1_HUMAN_Yee_2023_abundance,0.624,0.603,0.604,0.624,0.624,0.654,0.657,0.57,0.793,0.746
+S22A1_HUMAN_Yee_2023_activity,0.596,0.556,0.593,0.568,0.572,0.611,0.621,0.57,0.745,0.696
+KCNE1_HUMAN_Muhammad_2023_expression,0.787,0.686,0.375,0.431,0.39,0.384,0.374,0.615,0.807,0.802
+KCNE1_HUMAN_Muhammad_2023_function,0.754,0.744,0.606,0.617,0.587,0.683,0.692,0.68,0.768,0.784
+RNC_ECOLI_Weeks_2023,0.674,0.691,0.621,0.611,0.626,0.597,0.638,0.617,0.74,0.703
+HEM3_HUMAN_Loggerenberg_2023,0.434,0.449,0.427,0.397,0.448,0.454,0.454,0.423,0.505,0.505
+OXDA_RHOTO_Vanella_2023_expression,0.484,0.526,0.341,0.352,0.358,0.337,0.335,0.569,0.638,0.514
+OXDA_RHOTO_Vanella_2023_activity,0.441,0.439,0.405,0.383,0.384,0.372,0.398,0.499,0.538,0.422
+PRKN_HUMAN_Clausen_2023,0.689,0.548,0.629,0.631,0.542,0.674,0.685,0.757,0.832,0.773
+DYR_ECOLI_Nguyen_2023,0.588,0.646,0.504,0.559,0.526,0.45,0.481,0.636,0.764,0.651
+ENVZ_ECOLI_Ghose_2023,0.455,0.453,0.305,0.308,0.297,0.322,0.307,0.332,0.492,0.492
+MLAC_ECOLI_MacRae_2023,0.675,0.677,0.495,0.482,0.5,0.43,0.476,0.687,0.798,0.727
+Q837P5_ENTFA_Meier_2023,0.707,0.75,0.641,0.604,0.606,0.677,0.663,0.616,0.798,0.779
+Q837P4_ENTFA_Meier_2023,0.755,0.752,0.673,0.657,0.658,0.667,0.69,0.654,0.827,0.781
+CCR5_HUMAN_Gill_2023,0.401,0.385,0.326,0.372,0.381,0.403,0.354,0.434,0.505,0.502
+Q53Z42_HUMAN_McShan_2019_expression,0.582,0.668,0.601,0.547,0.616,0.596,0.621,0.617,0.769,0.73
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,0.644,0.649,0.47,0.429,0.424,0.415,0.456,0.673,0.777,0.707
+RPC1_LAMBD_Li_2019_low-expression,0.654,0.716,0.609,0.587,0.592,0.6,0.612,0.509,0.708,0.633
+RPC1_LAMBD_Li_2019_high-expression,0.513,0.553,0.52,0.501,0.535,0.509,0.525,0.356,0.565,0.562
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,0.758,0.828,0.763,0.717,0.769,0.773,0.778,0.734,0.883,0.82
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,0.498,0.542,0.466,0.499,0.526,0.455,0.472,0.583,0.65,0.585
+A4_HUMAN_Seuma_2022,0.714,0.444,0.542,0.512,0.53,0.562,0.559,0.587,0.731,0.755
+SRC_HUMAN_Nguyen_2022,0.583,0.541,0.511,0.534,0.528,0.478,0.503,0.63,0.709,0.663
+F7YBW7_MESOW_Ding_2023,0.663,0.684,0.641,0.592,0.689,0.609,0.643,0.586,0.704,0.707
+RASK_HUMAN_Weng_2022_abundance,0.711,0.678,0.339,0.351,0.351,0.407,0.366,0.609,0.783,0.792
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,0.761,0.721,0.585,0.575,0.642,0.586,0.595,0.681,0.873,0.827
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,0.592,0.545,0.521,0.545,0.537,0.486,0.514,0.674,0.72,0.691
+AICDA_HUMAN_Gajula_2014_3cycles,0.321,0.379,0.311,0.256,0.293,0.341,0.367,0.278,0.455,0.318
+OTC_HUMAN_Lo_2023,0.752,0.792,0.58,0.631,0.631,0.679,0.655,0.551,0.862,0.763
+CASP3_HUMAN_Roychowdhury_2020,0.638,0.703,0.641,0.616,0.669,0.624,0.652,0.491,0.689,0.655
+CASP7_HUMAN_Roychowdhury_2020,0.618,0.662,0.635,0.629,0.655,0.619,0.649,0.524,0.645,0.644
+SERC_HUMAN_Xie_2023,0.557,0.569,0.561,0.551,0.562,0.56,0.568,0.459,0.642,0.574
+MTHR_HUMAN_Weile_2021,0.523,0.291,0.3,0.326,0.332,0.304,0.31,0.586,0.649,0.603
+AMFR_HUMAN_Rocklin_2023_4G3O,0.9,0.918,0.673,0.629,0.682,0.641,0.664,0.73,0.931,0.878
+ARGR_ECOLI_Rocklin_2023_1AOY,0.853,0.879,0.626,0.555,0.611,0.622,0.634,0.707,0.906,0.883
+BBC1_YEAST_Rocklin_2023_1TG0,0.886,0.902,0.631,0.648,0.63,0.612,0.623,0.706,0.91,0.876
+BCHB_CHLTE_Rocklin_2023_2KRU,0.916,0.915,0.617,0.658,0.658,0.647,0.632,0.762,0.933,0.877
+CATR_CHLRE_Rocklin_2023_2AMI,0.902,0.901,0.509,0.504,0.496,0.528,0.534,0.647,0.923,0.92
+CBPA2_HUMAN_Rocklin_2023_1O6X,0.882,0.896,0.56,0.556,0.566,0.578,0.581,0.697,0.922,0.878
+CBX4_HUMAN_Rocklin_2023_2K28,0.883,0.907,0.722,0.688,0.742,0.699,0.732,0.724,0.909,0.883
+CSN4_MOUSE_Rocklin_2023_1UFM,0.915,0.918,0.637,0.715,0.648,0.684,0.672,0.75,0.94,0.913
+CUE1_YEAST_Rocklin_2023_2MYX,0.913,0.924,0.725,0.712,0.688,0.678,0.721,0.781,0.929,0.882
+DN7A_SACS2_Rocklin_2023_1JIC,0.871,0.88,0.526,0.526,0.543,0.499,0.511,0.687,0.898,0.827
+DNJA1_HUMAN_Rocklin_2023_2LO1,0.914,0.927,0.662,0.669,0.632,0.707,0.684,0.806,0.937,0.926
+DOCK1_MOUSE_Rocklin_2023_2M0Y,0.876,0.895,0.634,0.689,0.691,0.682,0.664,0.73,0.904,0.869
+EPHB2_HUMAN_Rocklin_2023_1F0M,0.914,0.915,0.73,0.734,0.707,0.722,0.739,0.754,0.935,0.89
+FECA_ECOLI_Rocklin_2023_2D1U,0.851,0.91,0.603,0.556,0.623,0.549,0.562,0.701,0.914,0.866
+FKBP3_HUMAN_Rocklin_2023_2KFV,0.833,0.856,0.662,0.524,0.533,0.574,0.612,0.728,0.89,0.84
+HCP_LAMBD_Rocklin_2023_2L6Q,0.91,0.893,0.65,0.644,0.603,0.683,0.678,0.729,0.923,0.903
+HECD1_HUMAN_Rocklin_2023_3DKM,0.887,0.92,0.741,0.59,0.774,0.698,0.722,0.8,0.933,0.893
+ILF3_HUMAN_Rocklin_2023_2L33,0.761,0.792,0.506,0.463,0.496,0.486,0.512,0.593,0.849,0.801
+ISDH_STAAW_Rocklin_2023_2LHR,0.874,0.855,0.667,0.595,0.587,0.617,0.626,0.685,0.868,0.84
+MAFG_MOUSE_Rocklin_2023_1K1V,0.908,0.895,0.641,0.65,0.595,0.652,0.661,0.681,0.904,0.89
+MBD11_ARATH_Rocklin_2023_6ACV,0.898,0.915,0.728,0.675,0.693,0.666,0.705,0.837,0.924,0.869
+MYO3_YEAST_Rocklin_2023_2BTT,0.876,0.898,0.607,0.638,0.62,0.612,0.618,0.703,0.898,0.867
+NKX31_HUMAN_Rocklin_2023_2L9R,0.87,0.88,0.714,0.672,0.71,0.695,0.714,0.724,0.89,0.878
+NUSA_ECOLI_Rocklin_2023_1WCL,0.921,0.923,0.635,0.627,0.632,0.657,0.679,0.754,0.946,0.916
+NUSG_MYCTU_Rocklin_2023_2MI6,0.928,0.94,0.7,0.613,0.689,0.66,0.684,0.744,0.953,0.928
+OBSCN_HUMAN_Rocklin_2023_1V1C,0.912,0.903,0.744,0.68,0.734,0.683,0.728,0.767,0.923,0.886
+ODP2_GEOSE_Rocklin_2023_1W4G,0.82,0.896,0.671,0.626,0.667,0.652,0.679,0.736,0.91,0.857
+OTU7A_HUMAN_Rocklin_2023_2L2D,0.92,0.872,0.671,0.696,0.642,0.624,0.653,0.702,0.887,0.87
+PIN1_HUMAN_Rocklin_2023_1I6C,0.91,0.909,0.776,0.741,0.773,0.759,0.79,0.745,0.922,0.909
+PITX2_HUMAN_Rocklin_2023_2L7M,0.901,0.91,0.732,0.7,0.72,0.708,0.725,0.773,0.924,0.907
+PKN1_HUMAN_Rocklin_2023_1URF,0.867,0.867,0.438,0.472,0.434,0.487,0.48,0.635,0.897,0.865
+POLG_PESV_Rocklin_2023_2MXD,0.884,0.917,0.643,0.674,0.693,0.635,0.634,0.74,0.933,0.87
+PR40A_HUMAN_Rocklin_2023_1UZC,0.857,0.891,0.668,0.611,0.712,0.666,0.682,0.783,0.913,0.872
+PSAE_SYNP2_Rocklin_2023_1PSE,0.875,0.857,0.585,0.595,0.529,0.535,0.557,0.728,0.885,0.853
+RAD_ANTMA_Rocklin_2023_2CJJ,0.844,0.867,0.633,0.584,0.634,0.576,0.589,0.672,0.866,0.842
+RBP1_HUMAN_Rocklin_2023_2KWH,0.892,0.873,0.508,0.569,0.495,0.517,0.506,0.636,0.882,0.862
+RCD1_ARATH_Rocklin_2023_5OAO,0.89,0.876,0.533,0.476,0.484,0.509,0.525,0.626,0.89,0.864
+RCRO_LAMBD_Rocklin_2023_1ORC,0.916,0.914,0.657,0.636,0.675,0.649,0.653,0.772,0.927,0.9
+RD23A_HUMAN_Rocklin_2023_1IFY,0.944,0.928,0.704,0.711,0.721,0.691,0.709,0.731,0.937,0.923
+RFAH_ECOLI_Rocklin_2023_2LCL,0.841,0.859,0.504,0.463,0.475,0.472,0.479,0.583,0.868,0.826
+RL20_AQUAE_Rocklin_2023_1GYZ,0.792,0.791,0.497,0.556,0.483,0.511,0.511,0.696,0.817,0.795
+RPC1_BP434_Rocklin_2023_1R69,0.929,0.932,0.717,0.771,0.739,0.762,0.747,0.763,0.939,0.923
+RS15_GEOSE_Rocklin_2023_1A32,0.917,0.908,0.595,0.569,0.602,0.584,0.581,0.706,0.926,0.915
+SAV1_MOUSE_Rocklin_2023_2YSB,0.899,0.899,0.721,0.756,0.761,0.74,0.739,0.797,0.914,0.892
+SBI_STAAM_Rocklin_2023_2JVG,0.869,0.895,0.625,0.641,0.678,0.617,0.623,0.727,0.895,0.844
+SCIN_STAAR_Rocklin_2023_2QFF,0.877,0.894,0.491,0.51,0.482,0.489,0.488,0.687,0.917,0.847
+SDA_BACSU_Rocklin_2023_1PV0,0.916,0.904,0.704,0.652,0.696,0.681,0.701,0.695,0.924,0.901
+SOX30_HUMAN_Rocklin_2023_7JJK,0.891,0.908,0.501,0.541,0.521,0.49,0.497,0.658,0.919,0.876
+SPA_STAAU_Rocklin_2023_1LP1,0.874,0.909,0.619,0.655,0.637,0.57,0.596,0.748,0.913,0.883
+SPG2_STRSG_Rocklin_2023_5UBS,0.896,0.891,0.521,0.53,0.48,0.482,0.518,0.644,0.905,0.871
+SPTN1_CHICK_Rocklin_2023_1TUD,0.876,0.887,0.672,0.687,0.646,0.678,0.681,0.716,0.899,0.873
+SQSTM_MOUSE_Rocklin_2023_2RRU,0.909,0.93,0.771,0.699,0.761,0.737,0.767,0.765,0.936,0.911
+SR43C_ARATH_Rocklin_2023_2N88,0.905,0.908,0.73,0.707,0.719,0.719,0.73,0.765,0.91,0.902
+SRBS1_HUMAN_Rocklin_2023_2O2W,0.901,0.92,0.721,0.683,0.749,0.727,0.735,0.806,0.935,0.908
+TCRG1_MOUSE_Rocklin_2023_1E0L,0.911,0.907,0.672,0.773,0.736,0.718,0.7,0.746,0.922,0.894
+THO1_YEAST_Rocklin_2023_2WQG,0.885,0.881,0.652,0.654,0.649,0.643,0.651,0.695,0.898,0.855
+TNKS2_HUMAN_Rocklin_2023_5JRT,0.928,0.928,0.753,0.764,0.746,0.752,0.762,0.749,0.937,0.919
+UBE4B_HUMAN_Rocklin_2023_3L1X,0.896,0.911,0.696,0.707,0.709,0.645,0.687,0.743,0.918,0.875
+UBR5_HUMAN_Rocklin_2023_1I2T,0.875,0.889,0.626,0.59,0.586,0.629,0.643,0.713,0.909,0.877
+VG08_BPP22_Rocklin_2023_2GP8,0.913,0.904,0.611,0.7,0.618,0.628,0.617,0.7,0.915,0.882
+VILI_CHICK_Rocklin_2023_1YU5,0.902,0.898,0.734,0.685,0.713,0.642,0.711,0.768,0.92,0.872
+VRPI_BPT7_Rocklin_2023_2WNM,0.895,0.868,0.614,0.628,0.563,0.679,0.659,0.73,0.901,0.865
+YAIA_ECOLI_Rocklin_2023_2KVT,0.857,0.869,0.601,0.585,0.591,0.59,0.595,0.716,0.878,0.819
+YNZC_BACSU_Rocklin_2023_2JVD,0.88,0.89,0.573,0.641,0.592,0.598,0.579,0.673,0.902,0.853
+POLG_DEN26_Suphatrakul_2023,0.304,0.512,0.323,0.07,0.465,0.576,0.593,0.555,0.76,0.623
+MET_HUMAN_Estevam_2023,0.631,0.628,0.588,0.578,0.619,0.59,0.597,0.579,0.668,0.667
+RDRP_I33A0_Li_2023,0.121,0.486,0.399,0.094,0.5,0.488,0.537,0.425,0.633,0.572
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,0.354,0.358,0.363,0.363,0.331,0.317,0.354,0.401,0.448,0.421
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,0.449,0.445,0.321,0.331,0.347,0.376,0.383,0.541,0.571,0.574
+ACE2_HUMAN_Chan_2020,0.517,0.343,0.42,0.41,0.423,0.588,0.567,0.64,0.717,0.641
+PPM1D_HUMAN_Miller_2022,0.605,0.558,0.56,0.609,0.555,0.608,0.628,0.607,0.67,0.64
+HMDH_HUMAN_Jiang_2019,0.589,0.304,0.418,0.369,0.309,0.364,0.389,0.552,0.666,0.621
+CBS_HUMAN_Sun_2020,0.401,0.376,0.39,0.358,0.381,0.345,0.376,0.398,0.469,0.412
+NPC1_HUMAN_Erwood_2022_HEK293T,0.612,0.757,0.717,0.572,0.788,0.764,0.778,0.568,0.809,0.777
+NPC1_HUMAN_Erwood_2022_RPE1,0.575,0.88,0.748,0.478,0.83,0.637,0.735,0.486,0.861,0.823
+BRCA2_HUMAN_Erwood_2022_HEK293T,0.551,-0.067,0.501,0.354,0.132,0.478,0.491,0.526,0.124,0.502
+SHOC2_HUMAN_Kwon_2022,0.442,0.391,0.407,0.407,0.414,0.397,0.419,0.507,0.603,0.506
+RAF1_HUMAN_Zinkus-Boltz_2019,0.378,0.443,0.416,0.447,0.473,0.399,0.445,0.265,0.347,0.337
+OPSD_HUMAN_Wan_2019,0.659,0.716,0.594,0.564,0.629,0.591,0.577,0.53,0.735,0.666
+CAR11_HUMAN_Meitlis_2020_lof,0.703,0.404,0.505,0.595,0.431,0.487,0.5,0.712,0.755,0.74
+CAR11_HUMAN_Meitlis_2020_gof,0.503,0.2,0.413,0.439,0.336,0.395,0.399,0.572,0.592,0.557
+ERBB2_HUMAN_Elazar_2016,0.752,0.722,0.577,0.624,0.644,0.671,0.636,0.444,0.735,0.785
+GLPA_HUMAN_Elazar_2016,0.829,0.778,0.414,0.556,0.516,0.499,0.536,0.33,0.827,0.831
+LYAM1_HUMAN_Elazar_2016,0.675,0.732,0.556,0.594,0.654,0.586,0.574,0.49,0.72,0.678
+GDIA_HUMAN_Silverstein_2021,0.475,0.528,0.452,0.43,0.469,0.442,0.467,0.174,0.541,0.452
+PPARG_HUMAN_Majithia_2016,0.832,0.7,0.69,0.726,0.647,0.679,0.712,0.931,0.934,0.869
+CD19_HUMAN_Klesmith_2019_FMC_singles,0.417,0.298,0.299,0.219,0.371,0.306,0.322,0.501,0.624,0.494
diff --git a/benchmarks/DMS_supervised/substitutions/Spearman/Summary_performance_DMS_substitutions_Spearman.csv b/benchmarks/DMS_supervised/substitutions/Spearman/Summary_performance_DMS_substitutions_Spearman.csv
new file mode 100644
index 0000000..c763c49
--- /dev/null
+++ b/benchmarks/DMS_supervised/substitutions/Spearman/Summary_performance_DMS_substitutions_Spearman.csv
@@ -0,0 +1,11 @@
+Model_rank,Model_name,Model type,Average_Spearman,Bootstrap_standard_error_Spearman,Average_Spearman_fold_random_5,Average_Spearman_fold_modulo_5,Average_Spearman_fold_contiguous_5,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,References,Model details
+1,ProteinNPT,Embedding,0.613,0.0,0.73,0.564,0.547,0.577,0.536,0.637,0.545,0.772,0.701,0.587,0.608,0.649,0.628,0.668,0.58,"Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",ProteinNPT Model
+2,Tranception Embeddings,Embedding,0.571,0.008,0.696,0.526,0.49,0.52,0.529,0.613,0.519,0.674,0.621,0.556,0.561,0.569,0.582,0.594,0.568,"[1] Original model: Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",Tranception Embeddings
+3,MSA Transformer Embeddings,Embedding,0.568,0.008,0.642,0.538,0.525,0.547,0.47,0.584,0.493,0.749,0.685,0.518,0.567,0.634,0.579,0.648,0.521,"[1] Original model: Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",MSA Transformer Embeddings
+4,ESM-1v Embeddings,Embedding,0.542,0.013,0.639,0.506,0.481,0.487,0.45,0.587,0.468,0.717,0.653,0.465,0.541,0.565,0.579,0.617,0.433,"[1] Original model: Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",ESM-1v Embeddings
+5,TranceptEVE + One-Hot Encodings,One-hot Encoding,0.477,0.011,0.55,0.44,0.441,0.502,0.444,0.476,0.47,0.493,0.503,0.483,0.468,0.481,0.49,0.475,0.478,"[1] Original model: Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",TranceptEVE + One-Hot Encodings
+6,Tranception + One-Hot Encodings,One-hot Encoding,0.458,0.011,0.535,0.419,0.419,0.475,0.416,0.476,0.448,0.473,0.49,0.455,0.445,0.457,0.472,0.453,0.456,"[1] Original model: Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",Tranception + One-Hot Encodings
+7,MSA_Transformer + One-Hot Encodings,One-hot Encoding,0.453,0.011,0.536,0.412,0.41,0.48,0.393,0.463,0.437,0.491,0.5,0.441,0.448,0.482,0.459,0.468,0.448,"[1] Original model: Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",MSA Transformer + One-Hot Encodings
+8,DeepSequence + One-Hot Encodings,One-hot Encoding,0.44,0.014,0.521,0.4,0.4,0.467,0.418,0.424,0.422,0.471,0.482,0.422,0.426,0.451,0.46,0.455,0.383,"Hsu, C., Nisonoff, H., Fannjiang, C. et al. Learning protein fitness models from evolutionary and assay-labeled data. Nat Biotechnol 40, 1114–1122 (2022). https://doi.org/10.1038/s41587-021-01146-5",DeepSequence + One-Hot Encodings
+9,ESM-1v + One-Hot Encodings,One-hot Encoding,0.417,0.014,0.514,0.368,0.367,0.421,0.363,0.452,0.383,0.463,0.496,0.338,0.4,0.426,0.444,0.452,0.292,"[1] Original model: Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",ESM-1v + One-Hot Encodings
+10,One-Hot Encodings,One-hot Encoding,0.224,0.015,0.579,0.027,0.064,0.213,0.212,0.226,0.194,0.273,0.246,0.204,0.227,0.236,0.217,0.233,0.238,"Hsu, C., Nisonoff, H., Fannjiang, C. et al. Learning protein fitness models from evolutionary and assay-labeled data. Nat Biotechnol 40, 1114–1122 (2022). https://doi.org/10.1038/s41587-021-01146-5",One-Hot Encodings
diff --git a/benchmarks/DMS_zero_shot/indels/AUC/DMS_indels_AUC_DMS_level.csv b/benchmarks/DMS_zero_shot/indels/AUC/DMS_indels_AUC_DMS_level.csv
new file mode 100644
index 0000000..ca673d4
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/AUC/DMS_indels_AUC_DMS_level.csv
@@ -0,0 +1,67 @@
+DMS ID,Unirep,Wavenet,RITA S,RITA M,RITA L,RITA XL,Progen2 S,Progen2 M,Progen2 Base,Progen2 L,Progen2 XL,ProtGPT2,Tranception S no retrieval,Tranception M no retrieval,Tranception L no retrieval,Tranception S,Tranception M,Tranception L,TranceptEVE S,TranceptEVE M,TranceptEVE L,Hidden Markov Model,Provean,Number of Mutants,Selection Type,UniProt ID,MSA_Neff_L_category,Taxon
+A4_HUMAN_Seuma_2022_indels,0.822,0.879,0.854,0.854,0.848,0.843,0.9,0.87,0.875,0.869,0.862,0.88,0.896,0.849,0.85,0.884,0.861,0.815,0.892,0.858,0.862,0.26,0.835,2346,Stability,A4_HUMAN,Low,Human
+AMFR_HUMAN_Tsuboyama_2023_4G3O_indels,0.536,0.915,0.248,0.687,0.873,0.888,0.5,0.845,0.84,0.737,0.866,0.863,0.453,0.458,0.577,0.284,0.311,0.322,0.297,0.32,0.336,0.893,0.727,117,Stability,AMFR_HUMAN,Medium,Human
+ARGR_ECOLI_Tsuboyama_2023_1AOY_indels,0.676,0.969,0.934,0.97,0.967,0.951,0.751,0.962,0.98,0.974,0.949,0.532,0.901,0.897,0.992,0.588,0.707,0.848,0.621,0.761,0.886,0.903,0.776,181,Stability,ARGR_ECOLI,Medium,Prokaryote
+B1LPA6_ECOSM_Russ_2020_indels,0.676,0.77,0.678,0.717,0.701,0.71,0.73,0.721,0.757,0.744,0.717,0.7,0.606,0.613,0.611,0.699,0.701,0.684,0.694,0.708,0.691,0.743,0.756,3074,Activity,B1LPA6_ECOSM,Medium,Prokaryote
+BBC1_YEAST_Tsuboyama_2023_1TG0_indels,0.762,0.679,0.774,0.711,0.777,0.733,0.689,0.81,0.851,0.81,0.793,0.636,0.797,0.846,0.843,0.756,0.778,0.779,0.76,0.78,0.788,0.617,0.701,134,Stability,BBC1_YEAST,High,Eukaryote
+BCHB_CHLTE_Tsuboyama_2023_2KRU_indels,0.445,0.947,0.323,0.604,0.688,0.785,0.596,0.755,0.67,0.875,0.957,0.585,0.489,0.699,0.764,0.312,0.382,0.53,0.332,0.412,0.557,0.895,0.753,82,Stability,BCHB_CHLTE,Medium,Prokaryote
+BLAT_ECOLX_Gonzalez_2019_indels,0.474,0.699,0.714,0.697,0.644,0.66,0.74,0.808,0.816,0.801,0.7,0.573,0.722,0.673,0.642,0.709,0.668,0.667,0.714,0.685,0.664,0.667,0.671,4751,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+CAPSD_AAV2S_Sinai_2021_designed_indels,0.28,0.852,0.654,0.689,0.795,0.826,0.272,0.267,0.304,0.265,0.764,0.587,0.676,0.69,0.874,0.892,0.864,0.883,0.893,0.899,0.902,0.812,0.871,225998,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CAPSD_AAV2S_Sinai_2021_library_indels,0.375,0.479,0.54,0.611,0.612,0.634,0.439,0.448,0.459,0.451,0.664,0.58,0.547,0.595,0.716,0.766,0.754,0.761,0.764,0.755,0.792,0.565,0.668,24908,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CATR_CHLRE_Tsuboyama_2023_2AMI_indels,0.724,0.723,0.74,0.725,0.709,0.659,0.689,0.626,0.652,0.604,0.626,0.542,0.753,0.618,0.655,0.796,0.692,0.738,0.792,0.674,0.718,0.548,0.619,197,Stability,CATR_CHLRE,High,Eukaryote
+CBPA2_HUMAN_Tsuboyama_2023_1O6X_indels,0.673,0.731,0.666,0.77,0.674,0.686,0.611,0.668,0.666,0.606,0.749,0.531,0.618,0.738,0.615,0.425,0.45,0.433,0.424,0.453,0.433,0.776,0.648,205,Stability,CBPA2_HUMAN,Medium,Human
+CBX4_HUMAN_Tsuboyama_2023_2K28_indels,0.58,0.645,0.578,0.627,0.617,0.599,0.599,0.605,0.593,0.618,0.586,0.485,0.673,0.621,0.625,0.61,0.594,0.619,0.623,0.601,0.626,0.672,0.546,129,Stability,CBX4_HUMAN,High,Human
+CSN4_MOUSE_Tsuboyama_2023_1UFM_indels,0.81,0.898,0.849,0.843,0.909,0.954,0.712,0.929,0.943,0.935,0.926,0.695,0.775,0.884,0.913,0.369,0.472,0.515,0.38,0.498,0.553,0.958,0.724,195,Stability,CSN4_MOUSE,Medium,Eukaryote
+CUE1_YEAST_Tsuboyama_2023_2MYX_indels,0.825,0.939,0.868,0.778,0.812,0.734,0.649,0.731,0.67,0.749,0.92,0.649,0.746,0.726,0.759,0.239,0.24,0.258,0.256,0.25,0.276,0.819,0.922,140,Stability,CUE1_YEAST,Medium,Eukaryote
+DN7A_SACS2_Tsuboyama_2023_1JIC_indels,0.656,0.905,0.618,0.636,0.556,0.594,0.531,0.552,0.547,0.589,0.682,0.595,0.608,0.555,0.525,0.491,0.485,0.422,0.52,0.506,0.445,0.734,0.901,136,Stability,DN7A_SACS2,Medium,Prokaryote
+DNJA1_HUMAN_Tsuboyama_2023_2LO1_indels,0.734,0.626,0.614,0.696,0.734,0.672,0.717,0.66,0.681,0.65,0.702,0.636,0.766,0.774,0.765,0.663,0.703,0.701,0.675,0.715,0.712,0.616,0.532,174,Stability,DNJA1_HUMAN,High,Human
+DOCK1_MOUSE_Tsuboyama_2023_2M0Y_indels,0.702,0.815,0.821,0.834,0.843,0.81,0.855,0.857,0.839,0.829,0.837,0.64,0.834,0.853,0.86,0.701,0.739,0.75,0.72,0.758,0.769,0.716,0.738,154,Stability,DOCK1_MOUSE,High,Eukaryote
+EPHB2_HUMAN_Tsuboyama_2023_1F0M_indels,0.601,0.81,0.811,0.801,0.82,0.803,0.858,0.831,0.874,0.853,0.843,0.749,0.774,0.779,0.799,0.517,0.594,0.662,0.558,0.63,0.701,0.79,0.658,185,Stability,EPHB2_HUMAN,High,Human
+FECA_ECOLI_Tsuboyama_2023_2D1U_indels,0.685,0.528,0.659,0.696,0.713,0.606,0.646,0.739,0.774,0.707,0.691,0.575,0.522,0.659,0.52,0.424,0.523,0.465,0.426,0.536,0.471,0.557,0.68,193,Stability,FECA_ECOLI,High,Eukaryote
+HCP_LAMBD_Tsuboyama_2023_2L6Q_indels,0.692,0.884,0.829,0.856,0.888,0.894,0.808,0.769,0.758,0.773,0.932,0.763,0.807,0.753,0.9,0.397,0.382,0.779,0.393,0.368,0.807,0.822,0.869,148,Stability,HCP_LAMBD,Medium,Virus
+HECD1_HUMAN_Tsuboyama_2023_3DKM_indels,0.638,0.862,0.752,0.821,0.849,0.891,0.788,0.877,0.907,0.878,0.87,0.516,0.585,0.796,0.534,0.573,0.705,0.555,0.569,0.725,0.545,0.913,0.719,154,Stability,HECD1_HUMAN,Medium,Human
+HIS7_YEAST_Pokusaeva_2019_indels,0.503,0.941,0.911,0.92,0.936,0.94,0.931,0.953,0.953,0.953,0.958,0.445,0.906,0.941,0.954,0.904,0.895,0.922,0.911,0.937,0.947,0.853,0.95,6102,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+ILF3_HUMAN_Tsuboyama_2023_2L33_indels,0.483,0.796,0.714,0.755,0.643,0.657,0.746,0.638,0.711,0.764,0.743,0.584,0.608,0.805,0.788,0.481,0.645,0.694,0.499,0.676,0.722,0.666,0.539,193,Stability,ILF3_HUMAN,High,Human
+KCNJ2_MOUSE_Macdonald_2022_indels,0.559,0.804,0.761,0.784,0.778,0.775,0.764,0.803,0.803,0.798,0.779,0.68,0.769,0.795,0.783,0.796,0.805,0.813,0.792,0.814,0.808,0.762,0.791,10501,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+MAFG_MOUSE_Tsuboyama_2023_1K1V_indels,0.889,0.816,0.59,0.89,0.886,0.892,0.83,0.863,0.871,0.86,0.861,0.587,0.564,0.823,0.899,0.506,0.579,0.639,0.507,0.597,0.671,0.754,0.535,115,Stability,MAFG_MOUSE,Medium,Eukaryote
+MBD11_ARATH_Tsuboyama_2023_6ACV_indels,0.627,0.65,0.581,0.553,0.764,0.772,0.692,0.866,0.848,0.743,0.739,0.588,0.515,0.58,0.556,0.492,0.522,0.494,0.509,0.534,0.512,0.638,0.758,131,Stability,MBD11_ARATH,Medium,Eukaryote
+MYO3_YEAST_Tsuboyama_2023_2BTT_indels,0.479,0.714,0.708,0.664,0.793,0.762,0.753,0.762,0.657,0.731,0.764,0.582,0.736,0.74,0.704,0.588,0.605,0.576,0.613,0.631,0.599,0.627,0.745,80,Stability,MYO3_YEAST,High,Eukaryote
+NKX31_HUMAN_Tsuboyama_2023_2L9R_indels,0.878,0.934,0.96,0.935,0.922,0.922,0.906,0.911,0.954,0.896,0.897,0.634,0.985,0.874,0.888,0.936,0.876,0.896,0.94,0.876,0.888,0.88,0.755,178,Stability,NKX31_HUMAN,High,Human
+NUSA_ECOLI_Tsuboyama_2023_1WCL_indels,0.632,0.701,0.667,0.696,0.675,0.661,0.679,0.671,0.716,0.658,0.675,0.458,0.697,0.714,0.7,0.471,0.519,0.508,0.475,0.543,0.526,0.55,0.738,191,Stability,NUSA_ECOLI,High,Prokaryote
+NUSG_MYCTU_Tsuboyama_2023_2MI6_indels,0.763,0.76,0.744,0.751,0.757,0.742,0.759,0.75,0.743,0.758,0.77,0.813,0.737,0.751,0.781,0.653,0.672,0.72,0.666,0.686,0.733,0.668,0.889,157,Stability,NUSG_MYCTU,High,Prokaryote
+OBSCN_HUMAN_Tsuboyama_2023_1V1C_indels,0.682,0.784,0.924,0.884,0.874,0.856,0.891,0.93,0.956,0.891,0.873,0.687,0.654,0.653,0.715,0.583,0.599,0.614,0.591,0.61,0.634,0.71,0.606,169,Stability,OBSCN_HUMAN,High,Human
+ODP2_GEOSE_Tsuboyama_2023_1W4G_indels,0.541,0.9,0.952,0.995,0.984,0.982,0.923,0.968,0.98,0.934,0.984,0.776,0.824,0.667,0.692,0.821,0.88,0.805,0.864,0.912,0.821,0.756,0.993,47,Stability,ODP2_GEOSE,High,Prokaryote
+OTU7A_HUMAN_Tsuboyama_2023_2L2D_indels,0.747,0.604,0.715,0.957,0.958,0.978,0.79,0.975,0.975,0.955,0.919,0.669,0.627,0.803,0.947,0.305,0.415,0.577,0.307,0.44,0.603,0.822,0.898,84,Stability,OTU7A_HUMAN,High,Human
+P53_HUMAN_Kotler_2018_indels,0.531,0.5,0.667,0.69,0.66,0.597,0.723,0.684,0.662,0.697,0.658,0.501,0.748,0.763,0.664,0.689,0.735,0.643,0.699,0.754,0.661,0.745,0.623,341,OrganismalFitness,P53_HUMAN,Low,Human
+PIN1_HUMAN_Tsuboyama_2023_1I6C_indels,0.901,0.941,0.915,0.956,0.932,0.955,0.97,0.879,0.93,0.942,0.889,0.925,0.917,0.965,0.975,0.647,0.88,0.968,0.699,0.924,0.98,0.904,0.868,106,Stability,PIN1_HUMAN,High,Human
+PITX2_HUMAN_Tsuboyama_2023_2L7M_indels,0.89,0.867,0.902,0.925,0.915,0.926,0.874,0.892,0.874,0.936,0.897,0.771,0.895,0.903,0.914,0.645,0.634,0.626,0.719,0.7,0.707,0.763,0.779,117,Stability,PITX2_HUMAN,High,Human
+PKN1_HUMAN_Tsuboyama_2023_1URF_indels,0.609,0.882,0.882,0.887,0.883,0.9,0.895,0.929,0.929,0.926,0.924,0.674,0.794,0.888,0.885,0.613,0.837,0.856,0.645,0.861,0.868,0.724,0.762,187,Stability,PKN1_HUMAN,High,Human
+POLG_PESV_Tsuboyama_2023_2MXD_indels,0.741,0.777,0.731,0.708,0.699,0.668,0.676,0.654,0.665,0.605,0.691,0.602,0.746,0.693,0.701,0.362,0.335,0.348,0.377,0.351,0.37,0.652,0.63,149,Stability,POLG_PESV,Medium,Virus
+PR40A_HUMAN_Tsuboyama_2023_1UZC_indels,0.555,0.869,0.741,0.829,0.819,0.816,0.818,0.836,0.83,0.835,0.812,0.772,0.678,0.659,0.646,0.54,0.571,0.528,0.558,0.595,0.549,0.854,0.575,168,Stability,PR40A_HUMAN,Medium,Human
+PSAE_SYNP2_Tsuboyama_2023_1PSE_indels,0.524,0.533,0.618,0.547,0.586,0.575,0.643,0.702,0.481,0.646,0.703,0.583,0.63,0.692,0.43,0.621,0.743,0.561,0.628,0.753,0.55,0.621,0.641,175,Stability,PSAE_SYNP2,Medium,Prokaryote
+PTEN_HUMAN_Mighell_2018_indels,0.321,0.895,0.85,0.841,0.787,0.795,0.904,0.82,0.88,0.851,0.734,0.551,0.895,0.907,0.832,0.894,0.897,0.827,0.897,0.915,0.856,0.888,0.638,314,Activity,PTEN_HUMAN,Medium,Human
+Q8EG35_SHEON_Campbell_2022_indels,0.769,0.587,0.756,0.721,0.668,0.624,0.8,0.684,0.65,0.641,0.651,0.621,0.822,0.805,0.677,0.742,0.824,0.69,0.741,0.793,0.679,0.754,0.609,331,OrganismalFitness,Q8EG35_SHEON,Medium,Prokaryote
+RAD_ANTMA_Tsuboyama_2023_2CJJ_indels,0.815,0.794,0.748,0.951,0.961,0.942,0.853,0.924,0.937,0.899,0.833,0.599,0.795,0.873,0.782,0.527,0.777,0.725,0.55,0.812,0.754,0.852,0.459,97,Stability,RAD_ANTMA,High,Eukaryote
+RCD1_ARATH_Tsuboyama_2023_5OAO_indels,0.598,0.888,0.536,0.538,0.488,0.682,0.501,0.598,0.593,0.547,0.684,0.594,0.487,0.558,0.62,0.327,0.335,0.358,0.323,0.335,0.361,0.836,0.679,124,Stability,RCD1_ARATH,Medium,Eukaryote
+RD23A_HUMAN_Tsuboyama_2023_1IFY_indels,0.77,0.897,0.944,0.922,0.922,0.954,0.948,0.974,0.968,0.961,0.982,0.957,0.804,0.969,0.939,0.557,0.783,0.84,0.579,0.839,0.871,0.844,0.869,120,Stability,RD23A_HUMAN,High,Human
+RPC1_BP434_Tsuboyama_2023_1R69_indels,0.895,0.736,0.892,0.889,0.872,0.867,0.905,0.873,0.869,0.89,0.707,0.642,0.839,0.893,0.884,0.791,0.851,0.887,0.798,0.855,0.896,0.589,0.704,164,Stability,RPC1_BP434,High,Virus
+RS15_GEOSE_Tsuboyama_2023_1A32_indels,0.639,0.675,0.651,0.684,0.681,0.755,0.627,0.765,0.763,0.755,0.752,0.569,0.823,0.764,0.727,0.659,0.676,0.548,0.693,0.709,0.579,0.642,0.671,176,Stability,RS15_GEOSE,Medium,Prokaryote
+S22A1_HUMAN_Yee_2023_abundance_indels,0.638,0.509,0.718,0.743,0.811,0.791,0.729,0.813,0.818,0.767,0.712,0.633,0.751,0.746,0.782,0.707,0.715,0.779,0.739,0.75,0.781,0.674,0.698,430,Expression,S22A1_HUMAN,Medium,Human
+S22A1_HUMAN_Yee_2023_activity_indels,0.708,0.573,0.771,0.781,0.835,0.828,0.769,0.849,0.814,0.806,0.738,0.631,0.765,0.807,0.824,0.723,0.751,0.822,0.76,0.806,0.817,0.706,0.704,490,Activity,S22A1_HUMAN,Medium,Human
+SAV1_MOUSE_Tsuboyama_2023_2YSB_indels,0.901,0.972,0.937,0.91,0.9,0.9,0.897,0.926,0.906,0.903,0.901,0.863,0.927,0.92,0.931,0.657,0.812,0.838,0.711,0.862,0.885,0.914,0.873,86,Stability,SAV1_MOUSE,High,Eukaryote
+SDA_BACSU_Tsuboyama_2023_1PV0_indels,0.604,0.666,0.601,0.689,0.719,0.784,0.655,0.739,0.539,0.694,0.817,0.501,0.65,0.722,0.719,0.356,0.368,0.668,0.371,0.383,0.689,0.763,0.731,127,Stability,SDA_BACSU,Medium,Prokaryote
+SOX30_HUMAN_Tsuboyama_2023_7JJK_indels,0.936,0.757,0.613,0.756,0.787,0.863,0.504,0.735,0.784,0.779,0.801,0.816,0.416,0.808,0.727,0.35,0.468,0.55,0.364,0.505,0.598,0.691,0.856,109,Stability,SOX30_HUMAN,High,Human
+SPG2_STRSG_Tsuboyama_2023_5UBS_indels,0.417,0.682,0.558,0.575,0.672,0.684,0.626,0.667,0.655,0.682,0.738,0.378,0.49,0.51,0.8,0.608,0.613,0.687,0.605,0.612,0.705,0.556,0.663,148,Stability,SPG2_STRSG,Medium,Prokaryote
+SPTN1_CHICK_Tsuboyama_2023_1TUD_indels,0.716,0.54,0.756,0.803,0.778,0.842,0.737,0.885,0.828,0.858,0.81,0.597,0.818,0.787,0.824,0.675,0.722,0.725,0.688,0.74,0.744,0.621,0.609,129,Stability,SPTN1_CHICK,High,Eukaryote
+SQSTM_MOUSE_Tsuboyama_2023_2RRU_indels,0.641,0.739,0.811,0.856,0.87,0.875,0.834,0.83,0.891,0.879,0.861,0.56,0.719,0.841,0.86,0.545,0.674,0.788,0.561,0.718,0.82,0.681,0.924,111,Stability,SQSTM_MOUSE,Medium,Eukaryote
+SR43C_ARATH_Tsuboyama_2023_2N88_indels,0.511,0.665,0.762,0.61,0.693,0.643,0.586,0.608,0.706,0.713,0.684,0.752,0.657,0.455,0.656,0.746,0.675,0.783,0.755,0.681,0.784,0.641,0.652,135,Stability,SR43C_ARATH,High,Eukaryote
+SRBS1_HUMAN_Tsuboyama_2023_2O2W_indels,0.821,0.885,0.804,0.866,0.861,0.855,0.841,0.868,0.888,0.871,0.821,0.762,0.86,0.862,0.902,0.889,0.891,0.925,0.889,0.891,0.923,0.72,0.756,154,Stability,SRBS1_HUMAN,High,Human
+TCRG1_MOUSE_Tsuboyama_2023_1E0L_indels,0.911,0.942,0.942,0.969,0.964,0.981,0.94,0.94,0.964,0.95,0.956,0.821,0.784,0.941,0.964,0.152,0.212,0.284,0.183,0.258,0.354,0.967,0.809,99,Stability,TCRG1_MOUSE,Medium,Eukaryote
+THO1_YEAST_Tsuboyama_2023_2WQG_indels,0.625,0.949,0.546,0.867,0.878,0.856,0.816,0.893,0.856,0.895,0.93,0.806,0.624,0.905,0.858,0.384,0.609,0.573,0.392,0.655,0.612,0.886,0.955,82,Stability,THO1_YEAST,High,Eukaryote
+TNKS2_HUMAN_Tsuboyama_2023_5JRT_indels,0.435,0.917,0.668,0.818,0.903,0.861,0.746,0.82,0.851,0.828,0.837,0.682,0.522,0.657,0.751,0.55,0.626,0.676,0.564,0.633,0.68,0.926,0.695,171,Stability,TNKS2_HUMAN,High,Human
+UBE4B_HUMAN_Tsuboyama_2023_3L1X_indels,0.408,0.66,0.489,0.6,0.569,0.648,0.665,0.593,0.621,0.643,0.713,0.687,0.489,0.477,0.666,0.489,0.486,0.637,0.499,0.498,0.654,0.393,0.681,147,Stability,UBE4B_HUMAN,High,Human
+UBR5_HUMAN_Tsuboyama_2023_1I2T_indels,0.641,0.788,0.764,0.727,0.751,0.761,0.774,0.743,0.793,0.781,0.808,0.673,0.746,0.766,0.771,0.509,0.597,0.611,0.519,0.623,0.633,0.687,0.623,156,Stability,UBR5_HUMAN,Medium,Human
+VG08_BPP22_Tsuboyama_2023_2GP8_indels,0.796,0.932,0.825,0.938,0.897,0.987,0.771,0.878,0.818,0.788,0.952,0.356,0.751,0.948,0.976,0.634,0.711,0.946,0.634,0.745,0.977,0.859,0.729,101,Stability,VG08_BPP22,High,Virus
+VILI_CHICK_Tsuboyama_2023_1YU5_indels,0.495,0.916,0.629,0.805,0.777,0.786,0.622,0.826,0.791,0.78,0.846,0.566,0.857,0.836,0.831,0.691,0.805,0.805,0.724,0.816,0.824,0.808,0.8,156,Stability,VILI_CHICK,High,Eukaryote
+VRPI_BPT7_Tsuboyama_2023_2WNM_indels,0.704,0.826,0.562,0.71,0.8,0.675,0.721,0.698,0.696,0.7,0.736,0.538,0.644,0.602,0.673,0.394,0.384,0.375,0.396,0.384,0.374,0.708,0.857,154,Stability,VRPI_BPT7,Medium,Virus
+YNZC_BACSU_Tsuboyama_2023_2JVD_indels,0.749,0.915,0.766,0.771,0.854,0.869,0.772,0.921,0.7,0.898,0.887,0.602,0.718,0.751,0.856,0.264,0.256,0.359,0.269,0.265,0.39,0.898,0.637,104,Stability,YNZC_BACSU,Medium,Prokaryote
diff --git a/benchmarks/DMS_zero_shot/indels/AUC/Summary_performance_DMS_indels_AUC.csv b/benchmarks/DMS_zero_shot/indels/AUC/Summary_performance_DMS_indels_AUC.csv
new file mode 100644
index 0000000..0b2c092
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/AUC/Summary_performance_DMS_indels_AUC.csv
@@ -0,0 +1,24 @@
+Model_rank,Model_name,Model type,Average_AUC,Bootstrap_standard_error_AUC,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,Model details,References
+1,Progen2 Base,Protein language model,0.778,0.0,0.817,,0.81,0.692,0.794,0.64,0.763,0.823,0.827,0.817,0.714,0.698,Progen2 base model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+2,Progen2 M,Protein language model,0.776,0.007,0.797,,0.808,0.697,0.8,0.637,0.784,0.814,0.809,0.818,0.762,0.705,Progen2 medium model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+3,RITA L,Protein language model,0.773,0.019,0.774,,0.794,0.722,0.799,0.737,0.775,0.812,0.809,0.811,0.725,0.81,RITA large model (680M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+4,RITA XL,Protein language model,0.769,0.02,0.778,,0.783,0.71,0.804,0.723,0.787,0.809,0.814,0.807,0.741,0.804,RITA xlarge model (1.2B params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+5,Progen2 L,Protein language model,0.767,0.005,0.8,,0.782,0.69,0.797,0.641,0.775,0.815,0.815,0.807,0.761,0.686,Progen2 large model (2.7B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+6,Tranception M no retrieval,Protein language model,0.767,0.02,0.776,,0.77,0.765,0.756,0.752,0.733,0.78,0.773,0.779,0.701,0.755,Tranception Medium model (300M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+7,RITA M,Protein language model,0.766,0.015,0.78,,0.764,0.736,0.784,0.731,0.75,0.81,0.807,0.785,0.718,0.792,RITA medium model (300M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+8,Tranception L no retrieval,Protein language model,0.764,0.024,0.756,,0.782,0.746,0.772,0.77,0.74,0.795,0.774,0.789,0.708,0.822,Tranception Large model (700M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+9,Progen2 XL,Protein language model,0.757,0.02,0.73,,0.746,0.736,0.818,0.745,0.809,0.81,0.813,0.82,0.784,0.789,Progen2 xlarge model (6.4B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+10,TranceptEVE M,Hybrid model,0.754,0.024,0.81,,0.782,0.799,0.625,0.813,0.564,0.712,0.684,0.644,0.623,0.588,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+11,Progen2 S,Protein language model,0.751,0.011,0.801,,0.746,0.71,0.747,0.66,0.722,0.777,0.78,0.75,0.702,0.706,Progen2 small model (150M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+12,TranceptEVE L,Hybrid model,0.751,0.025,0.788,,0.794,0.76,0.66,0.79,0.592,0.742,0.702,0.662,0.637,0.712,TranceptEVE Large model (Tranception Large & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+13,Tranception S no retrieval,Protein language model,0.747,0.018,0.755,,0.76,0.762,0.71,0.752,0.687,0.74,0.715,0.729,0.694,0.733,Tranception Small model (85M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+14,Hidden Markov Model,Alignment-based model,0.744,0.022,0.779,,0.718,0.742,0.737,0.564,0.781,0.719,0.743,0.753,0.725,0.72,Profile Hidden Markov model,HMMER: biosequence analysis using profile hidden Markov models
+15,RITA S,Protein language model,0.741,0.015,0.766,,0.74,0.729,0.729,0.706,0.698,0.763,0.742,0.746,0.684,0.739,RITA small model (85M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+16,Tranception L,Hybrid model,0.741,0.024,0.778,,0.796,0.749,0.64,0.76,0.574,0.723,0.682,0.641,0.621,0.693,Tranception Large model (700M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+17,Tranception M,Hybrid model,0.733,0.025,0.783,,0.76,0.786,0.605,0.802,0.547,0.69,0.662,0.624,0.607,0.579,Tranception Medium model (300M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+18,Provean,Alignment-based model,0.725,0.027,0.699,,0.744,0.724,0.733,0.742,0.732,0.73,0.704,0.746,0.745,0.76,Provean model,"Choi Y, Sims GE, Murphy S, Miller JR, Chan AP (2012). Predicting the functional effect of amino acid substitutions and indels. PloS one."
+19,TranceptEVE S,Hybrid model,0.722,0.024,0.784,,0.766,0.779,0.562,0.807,0.52,0.639,0.615,0.578,0.585,0.571,TranceptEVE Small model (Tranception Small & retrieved EVE model),"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+20,Wavenet,Alignment-based model,0.72,0.037,0.746,,0.657,0.678,0.8,0.682,0.805,0.782,0.791,0.796,0.765,0.803,Wavenet model,"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12."
+21,Tranception S,Hybrid model,0.711,0.025,0.772,,0.752,0.775,0.546,0.801,0.509,0.621,0.598,0.564,0.571,0.568,Tranception Small model (85M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+22,ProtGPT2,Protein language model,0.62,0.032,0.627,,0.657,0.545,0.653,0.655,0.611,0.672,0.693,0.639,0.592,0.581,ProtGPT2 model,"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13."
+23,Unirep,Protein language model,0.593,0.04,0.568,,0.598,0.521,0.682,0.56,0.645,0.687,0.661,0.689,0.612,0.693,Unirep model,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
diff --git a/benchmarks/DMS_zero_shot/indels/AUC/Summary_performance_all_models_indels_AUC.csv b/benchmarks/DMS_zero_shot/indels/AUC/Summary_performance_all_models_indels_AUC.csv
new file mode 100644
index 0000000..629d82f
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/AUC/Summary_performance_all_models_indels_AUC.csv
@@ -0,0 +1,22 @@
+Model_rank,Model_name,Model type,Average_AUC,Bootstrap_standard_error_AUC,Model details,References
+1,TranceptEVE M,Hybrid model,0.799,0.0,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+2,Tranception M,Hybrid model,0.796,0.001,Tranception Medium model (300M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+3,Tranception L,Hybrid model,0.775,0.009,Tranception Large model (700M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+4,TranceptEVE L,Hybrid model,0.775,0.01,TranceptEVE Large model (Tranception Large & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+5,Tranception S,Hybrid model,0.773,0.015,Tranception Small model (85M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+6,RITA (ensemble),Protein language model,0.761,0.014,Ensemble of the 4 RITA models,"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+7,Tranception L no retrieval,Protein language model,0.756,0.012,Tranception Large model (700M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+8,Progen2 (ensemble),Protein language model,0.748,0.053,Ensemble of the 5 Progen2 models," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+9,RITA M,Protein language model,0.747,0.018,RITA medium model (300M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+10,RITA L,Protein language model,0.74,0.015,RITA large model (680M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+11,Wavenet,Alignment-based model,0.74,0.036,Wavenet model,"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12."
+12,RITA XL,Protein language model,0.738,0.018,RITA xlarge model (1.2B params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+13,Progen2 Base,Protein language model,0.737,0.056,Progen2 base model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+14,Progen2 XL,Protein language model,0.735,0.024,Progen2 xlarge model (6.4B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+15,Progen2 L,Protein language model,0.728,0.058,Progen2 large model (2.7B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+16,Unirep evotuned,Hybrid model,0.728,0.033,Unirep model w/ evotuning,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+17,Progen2 M,Protein language model,0.72,0.058,Progen2 medium model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+18,Progen2 S,Protein language model,0.714,0.053,Progen2 small model (150M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+19,RITA S,Protein language model,0.71,0.028,RITA small model (85M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+20,Unirep,Protein language model,0.576,0.045,Unirep model,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+21,ProtGPT2,Protein language model,0.575,0.059,ProtGPT2 model,"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13."
diff --git a/benchmarks/DMS_zero_shot/indels/AUC/all_models_indels_AUC_DMS_level.csv b/benchmarks/DMS_zero_shot/indels/AUC/all_models_indels_AUC_DMS_level.csv
new file mode 100644
index 0000000..25a285f
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/AUC/all_models_indels_AUC_DMS_level.csv
@@ -0,0 +1,9 @@
+,Tranception_L_no_retrieval,Tranception_S_retrieval,Tranception_M_retrieval,Tranception_L_retrieval,Wavenet,RITA_s,RITA_m,RITA_l,RITA_xl,RITA_ensemble,Progen2_small,Progen2_medium,Progen2_base,Progen2_large,Progen2_xlarge,Progen2_ensemble,Unirep,Unirep_evotune,ProtGPT2,TranceptEVE_L,TranceptEVE_M,number_mutants
+A0A1J4YT16_9PROT_Davidi_2020,0.625,0.618,0.658,0.645,0.577,0.454,0.626,0.636,0.643,0.621,0.592,0.625,0.63,0.63,0.601,0.618,0.468,0.679,0.634,0.641,0.657,105
+B1LPA6_ECOSM_Russ_2020,0.691,0.752,0.759,0.744,0.73,0.686,0.736,0.706,0.707,0.72,0.684,0.694,0.728,0.708,0.695,0.711,0.679,0.758,0.69,0.749,0.766,3074
+BLAT_ECOLX_Gonzalez_indels_2019,0.642,0.718,0.687,0.663,0.757,0.714,0.697,0.644,0.66,0.703,0.74,0.808,0.816,0.802,0.7,0.831,0.526,0.639,0.573,0.664,0.686,4751
+CAPSD_AAV2S_Sinai_indels_2021,0.769,0.787,0.788,0.789,0.708,0.62,0.657,0.726,0.712,0.697,0.376,0.372,0.402,0.371,0.714,0.43,0.688,0.746,0.559,0.794,0.796,250907
+HIS7_YEAST_Pokusaeva_indels_2019,0.954,0.909,0.933,0.946,0.937,0.911,0.92,0.936,0.94,0.936,0.931,0.953,0.953,0.953,0.958,0.96,0.497,0.671,0.445,0.947,0.937,6102
+PTEN_HUMAN_Mighell_deletions_2018,0.832,0.893,0.916,0.861,0.889,0.85,0.841,0.787,0.795,0.883,0.904,0.82,0.88,0.851,0.734,0.895,0.679,0.82,0.551,0.856,0.915,314
+P53_HUMAN_Kotler_deletions_2018,0.782,0.732,0.833,0.775,0.581,0.735,0.752,0.746,0.713,0.768,0.774,0.764,0.753,0.78,0.742,0.792,0.496,0.782,0.571,0.776,0.839,341
+Average,0.756,0.773,0.796,0.775,0.74,0.71,0.747,0.74,0.738,0.761,0.714,0.72,0.737,0.728,0.735,0.748,0.576,0.728,0.575,0.775,0.799,37942
diff --git a/benchmarks/DMS_zero_shot/indels/MCC/DMS_indels_MCC_DMS_level.csv b/benchmarks/DMS_zero_shot/indels/MCC/DMS_indels_MCC_DMS_level.csv
new file mode 100644
index 0000000..cfad26e
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/MCC/DMS_indels_MCC_DMS_level.csv
@@ -0,0 +1,67 @@
+DMS ID,Unirep,Wavenet,RITA S,RITA M,RITA L,RITA XL,Progen2 S,Progen2 M,Progen2 Base,Progen2 L,Progen2 XL,ProtGPT2,Tranception S no retrieval,Tranception M no retrieval,Tranception L no retrieval,Tranception S,Tranception M,Tranception L,TranceptEVE S,TranceptEVE M,TranceptEVE L,Hidden Markov Model,Provean,Number of Mutants,Selection Type,UniProt ID,MSA_Neff_L_category,Taxon
+A4_HUMAN_Seuma_2022_indels,0.585,0.619,0.55,0.614,0.592,0.628,0.664,0.647,0.633,0.633,0.618,0.626,0.656,0.635,0.604,0.637,0.642,0.58,0.64,0.632,0.607,-0.376,0.511,2346,Stability,A4_HUMAN,Low,Human
+AMFR_HUMAN_Tsuboyama_2023_4G3O_indels,0.137,0.462,-0.39,0.259,0.421,0.421,0.016,0.421,0.421,0.218,0.421,0.421,-0.106,-0.025,0.178,-0.228,-0.147,-0.147,-0.187,-0.147,-0.147,0.462,0.172,117,Stability,AMFR_HUMAN,Medium,Human
+ARGR_ECOLI_Tsuboyama_2023_1AOY_indels,0.25,0.407,0.407,0.407,0.407,0.344,0.155,0.344,0.407,0.407,0.344,0.029,0.376,0.376,0.407,-0.002,0.187,0.281,0.061,0.218,0.344,0.407,0.218,181,Stability,ARGR_ECOLI,Medium,Prokaryote
+B1LPA6_ECOSM_Russ_2020_indels,0.221,0.362,0.223,0.283,0.241,0.251,0.336,0.286,0.378,0.33,0.286,0.261,0.146,0.118,0.108,0.276,0.252,0.236,0.269,0.253,0.235,0.299,0.369,3074,Activity,B1LPA6_ECOSM,Medium,Prokaryote
+BBC1_YEAST_Tsuboyama_2023_1TG0_indels,0.309,0.034,0.377,0.137,0.343,0.24,0.24,0.343,0.446,0.309,0.377,0.206,0.48,0.412,0.446,0.377,0.412,0.412,0.377,0.412,0.412,0.171,0.171,134,Stability,BBC1_YEAST,High,Eukaryote
+BCHB_CHLTE_Tsuboyama_2023_2KRU_indels,-0.087,0.549,-0.202,0.145,0.26,0.318,0.145,0.376,0.26,0.434,0.491,0.087,0.029,0.26,0.318,-0.26,-0.145,0.087,-0.145,-0.145,0.087,0.549,0.26,82,Stability,BCHB_CHLTE,Medium,Prokaryote
+BLAT_ECOLX_Gonzalez_2019_indels,-0.052,0.31,0.266,0.245,0.179,0.244,0.303,0.445,0.478,0.475,0.256,0.078,0.286,0.277,0.224,0.269,0.233,0.242,0.266,0.274,0.254,0.293,0.238,4751,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+CAPSD_AAV2S_Sinai_2021_designed_indels,-0.353,0.558,0.252,0.298,0.443,0.502,-0.353,-0.358,-0.303,-0.361,0.383,0.13,0.274,0.302,0.578,0.603,0.554,0.599,0.606,0.618,0.633,0.474,0.571,225998,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CAPSD_AAV2S_Sinai_2021_library_indels,-0.118,0.033,0.038,0.103,0.097,0.114,-0.058,-0.041,-0.033,-0.039,0.141,0.083,0.038,0.09,0.195,0.245,0.219,0.223,0.239,0.238,0.247,0.082,0.187,24908,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CATR_CHLRE_Tsuboyama_2023_2AMI_indels,0.184,0.324,0.23,0.23,0.277,0.23,0.207,0.09,0.207,0.044,0.137,0.067,0.324,0.114,0.184,0.324,0.254,0.277,0.324,0.184,0.3,-0.05,0.114,197,Stability,CATR_CHLRE,High,Eukaryote
+CBPA2_HUMAN_Tsuboyama_2023_1O6X_indels,0.18,0.288,0.245,0.331,0.18,0.202,0.137,0.115,0.094,0.051,0.331,0.072,0.072,0.309,0.137,-0.057,-0.014,0.008,-0.057,-0.014,-0.014,0.352,0.18,205,Stability,CBPA2_HUMAN,Medium,Human
+CBX4_HUMAN_Tsuboyama_2023_2K28_indels,0.024,0.055,0.024,0.086,0.055,0.024,-0.069,-0.007,0.024,0.055,0.024,-0.007,0.149,0.086,0.118,0.18,0.149,0.18,0.18,0.149,0.211,0.18,0.055,129,Stability,CBX4_HUMAN,High,Human
+CSN4_MOUSE_Tsuboyama_2023_1UFM_indels,0.48,0.429,0.404,0.429,0.429,0.505,0.302,0.48,0.505,0.429,0.429,0.201,0.404,0.404,0.505,-0.13,-0.053,-0.003,-0.13,-0.003,0.048,0.505,0.251,195,Stability,CSN4_MOUSE,Medium,Eukaryote
+CUE1_YEAST_Tsuboyama_2023_2MYX_indels,0.372,0.328,0.372,0.284,0.284,0.241,0.197,0.197,0.153,0.197,0.372,0.197,0.197,0.284,0.372,-0.197,-0.197,-0.109,-0.153,-0.197,-0.109,0.328,0.284,140,Stability,CUE1_YEAST,Medium,Eukaryote
+DN7A_SACS2_Tsuboyama_2023_1JIC_indels,0.148,0.207,0.148,0.03,0.089,0.03,0.03,0.089,0.03,0.03,0.089,0.148,0.148,0.03,0.03,0.03,-0.03,-0.089,0.03,0.03,-0.089,0.207,0.207,136,Stability,DN7A_SACS2,Medium,Prokaryote
+DNJA1_HUMAN_Tsuboyama_2023_2LO1_indels,0.369,0.092,0.138,0.231,0.254,0.138,0.277,0.161,0.231,0.231,0.323,0.208,0.392,0.369,0.346,0.185,0.3,0.277,0.208,0.277,0.277,0.231,0.069,174,Stability,DNJA1_HUMAN,High,Human
+DOCK1_MOUSE_Tsuboyama_2023_2M0Y_indels,0.268,0.522,0.409,0.409,0.465,0.296,0.494,0.494,0.437,0.465,0.465,0.183,0.578,0.606,0.578,0.268,0.381,0.381,0.324,0.409,0.381,0.409,0.155,154,Stability,DOCK1_MOUSE,High,Eukaryote
+EPHB2_HUMAN_Tsuboyama_2023_1F0M_indels,0.122,0.322,0.347,0.322,0.372,0.372,0.422,0.372,0.372,0.372,0.422,0.347,0.272,0.297,0.297,-0.003,0.022,0.147,0.072,0.047,0.222,0.347,0.147,185,Stability,EPHB2_HUMAN,High,Human
+FECA_ECOLI_Tsuboyama_2023_2D1U_indels,0.241,-0.032,0.15,0.15,0.21,-0.032,0.18,0.241,0.301,0.15,0.089,0.089,-0.002,0.18,0.028,-0.063,0.028,-0.063,-0.063,0.059,-0.032,-0.032,0.028,193,Stability,FECA_ECOLI,High,Eukaryote
+HCP_LAMBD_Tsuboyama_2023_2L6Q_indels,0.207,0.461,0.429,0.429,0.493,0.493,0.334,0.334,0.334,0.302,0.493,0.302,0.429,0.366,0.461,-0.016,-0.048,0.334,-0.08,-0.08,0.334,0.27,0.334,148,Stability,HCP_LAMBD,Medium,Virus
+HECD1_HUMAN_Tsuboyama_2023_3DKM_indels,0.25,0.426,0.25,0.367,0.367,0.426,0.397,0.455,0.485,0.455,0.426,0.044,0.044,0.338,0.015,0.132,0.22,0.044,0.073,0.22,0.044,0.544,0.213,154,Stability,HECD1_HUMAN,Medium,Human
+HIS7_YEAST_Pokusaeva_2019_indels,0.027,0.568,0.555,0.555,0.577,0.571,0.594,0.61,0.586,0.602,0.597,-0.087,0.542,0.574,0.596,0.525,0.528,0.547,0.535,0.572,0.577,0.467,0.585,6102,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+ILF3_HUMAN_Tsuboyama_2023_2L33_indels,0.063,0.374,0.307,0.263,0.063,0.174,0.352,0.085,0.241,0.374,0.329,0.063,0.174,0.374,0.418,0.019,0.241,0.241,0.041,0.285,0.307,0.041,0.056,193,Stability,ILF3_HUMAN,High,Human
+KCNJ2_MOUSE_Macdonald_2022_indels,0.063,0.385,0.313,0.315,0.317,0.303,0.327,0.373,0.377,0.374,0.316,0.225,0.316,0.367,0.336,0.362,0.396,0.4,0.359,0.406,0.392,0.349,0.342,10501,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+MAFG_MOUSE_Tsuboyama_2023_1K1V_indels,0.322,0.263,-0.032,0.263,0.263,0.263,0.263,0.263,0.263,0.263,0.263,-0.032,0.027,0.263,0.322,0.027,0.027,0.086,0.027,0.027,0.086,0.204,-0.062,115,Stability,MAFG_MOUSE,Medium,Eukaryote
+MBD11_ARATH_Tsuboyama_2023_6ACV_indels,0.179,0.148,0.148,0.022,0.463,0.432,0.306,0.59,0.495,0.306,0.243,0.116,0.022,0.179,0.116,-0.01,0.022,-0.041,-0.01,0.022,-0.01,0.243,0.463,131,Stability,MBD11_ARATH,Medium,Eukaryote
+MYO3_YEAST_Tsuboyama_2023_2BTT_indels,-0.052,0.314,0.262,0.262,0.367,0.367,0.314,0.367,0.105,0.314,0.314,0.262,0.262,0.367,0.314,0.105,0.157,0.157,0.052,0.262,0.157,0.367,0.419,80,Stability,MYO3_YEAST,High,Eukaryote
+NKX31_HUMAN_Tsuboyama_2023_2L9R_indels,0.411,0.565,0.565,0.565,0.539,0.565,0.513,0.565,0.565,0.565,0.565,0.128,0.59,0.436,0.565,0.488,0.436,0.513,0.539,0.436,0.488,0.462,0.104,178,Stability,NKX31_HUMAN,High,Human
+NUSA_ECOLI_Tsuboyama_2023_1WCL_indels,0.193,0.193,0.054,0.193,0.124,0.078,0.216,0.078,0.286,0.078,0.078,-0.038,0.239,0.239,0.147,-0.038,0.008,0.054,-0.061,0.031,0.031,0.078,0.263,191,Stability,NUSA_ECOLI,High,Prokaryote
+NUSG_MYCTU_Tsuboyama_2023_2MI6_indels,0.223,0.123,0.123,0.123,0.123,0.123,0.123,0.123,0.123,0.123,0.123,0.223,0.123,0.123,0.173,0.023,0.073,0.173,0.073,0.073,0.173,0.123,0.273,157,Stability,NUSG_MYCTU,High,Prokaryote
+OBSCN_HUMAN_Tsuboyama_2023_1V1C_indels,0.215,0.473,0.602,0.525,0.576,0.628,0.55,0.628,0.628,0.55,0.499,0.293,0.241,0.267,0.318,0.138,0.241,0.267,0.138,0.189,0.267,0.318,0.119,169,Stability,OBSCN_HUMAN,High,Human
+ODP2_GEOSE_Tsuboyama_2023_1W4G_indels,0.034,0.605,0.605,0.605,0.605,0.605,0.605,0.605,0.605,0.605,0.605,0.415,0.32,0.225,0.32,0.415,0.51,0.415,0.415,0.605,0.415,0.415,0.605,47,Stability,ODP2_GEOSE,High,Prokaryote
+OTU7A_HUMAN_Tsuboyama_2023_2L2D_indels,0.406,0.174,0.232,0.464,0.522,0.464,0.174,0.464,0.464,0.464,0.464,0.116,0.232,0.348,0.464,-0.174,0.0,0.116,-0.174,0.0,0.116,0.522,0.406,84,Stability,OTU7A_HUMAN,High,Human
+P53_HUMAN_Kotler_2018_indels,0.056,-0.003,0.232,0.326,0.267,0.138,0.349,0.29,0.208,0.29,0.173,-0.026,0.361,0.419,0.243,0.29,0.419,0.22,0.29,0.443,0.267,0.372,0.126,341,OrganismalFitness,P53_HUMAN,Low,Human
+PIN1_HUMAN_Tsuboyama_2023_1I6C_indels,0.586,0.628,0.503,0.628,0.503,0.586,0.586,0.419,0.545,0.586,0.419,0.586,0.545,0.586,0.628,0.042,0.461,0.586,0.084,0.545,0.628,0.503,0.461,106,Stability,PIN1_HUMAN,High,Human
+PITX2_HUMAN_Tsuboyama_2023_2L7M_indels,0.375,0.33,0.33,0.419,0.375,0.375,0.33,0.375,0.33,0.419,0.419,0.286,0.33,0.375,0.375,0.107,0.152,0.063,0.196,0.196,0.196,0.196,0.233,117,Stability,PITX2_HUMAN,High,Human
+PKN1_HUMAN_Tsuboyama_2023_1URF_indels,0.162,0.634,0.516,0.539,0.563,0.516,0.539,0.587,0.634,0.61,0.634,0.209,0.492,0.539,0.563,0.115,0.469,0.469,0.162,0.492,0.469,0.209,0.256,187,Stability,PKN1_HUMAN,High,Human
+POLG_PESV_Tsuboyama_2023_2MXD_indels,0.283,0.369,0.455,0.369,0.283,0.254,0.34,0.34,0.34,0.283,0.34,0.11,0.369,0.283,0.283,-0.148,-0.206,-0.148,-0.12,-0.148,-0.12,0.168,0.139,149,Stability,POLG_PESV,Medium,Virus
+PR40A_HUMAN_Tsuboyama_2023_1UZC_indels,0.127,0.433,0.204,0.459,0.459,0.357,0.408,0.459,0.484,0.433,0.357,0.433,0.255,0.178,0.127,0.025,0.076,0.025,0.0,0.127,0.102,0.51,0.051,168,Stability,PR40A_HUMAN,Medium,Human
+PSAE_SYNP2_Tsuboyama_2023_1PSE_indels,0.064,0.178,0.247,0.109,0.109,0.018,0.247,0.339,-0.051,0.293,0.362,0.132,0.178,0.293,-0.143,0.224,0.431,0.041,0.247,0.431,0.041,0.109,0.293,175,Stability,PSAE_SYNP2,Medium,Prokaryote
+PTEN_HUMAN_Mighell_2018_indels,-0.274,0.58,0.503,0.49,0.376,0.478,0.592,0.465,0.605,0.529,0.338,0.057,0.592,0.618,0.516,0.618,0.592,0.465,0.631,0.631,0.529,0.643,0.159,314,Activity,PTEN_HUMAN,Medium,Human
+Q8EG35_SHEON_Campbell_2022_indels,0.39,0.148,0.366,0.305,0.269,0.184,0.426,0.269,0.269,0.233,0.233,0.136,0.45,0.499,0.269,0.39,0.462,0.233,0.378,0.39,0.269,0.414,0.196,331,OrganismalFitness,Q8EG35_SHEON,Medium,Prokaryote
+RAD_ANTMA_Tsuboyama_2023_2CJJ_indels,0.511,0.381,0.295,0.684,0.684,0.727,0.511,0.684,0.684,0.597,0.425,0.122,0.468,0.511,0.381,0.036,0.381,0.295,0.079,0.511,0.381,0.641,-0.152,97,Stability,RAD_ANTMA,High,Eukaryote
+RCD1_ARATH_Tsuboyama_2023_5OAO_indels,0.198,0.475,0.119,0.0,0.04,0.238,0.0,0.0,0.079,0.04,0.158,0.119,0.0,0.079,0.158,-0.198,-0.158,-0.119,-0.198,-0.158,-0.119,0.357,0.198,124,Stability,RCD1_ARATH,Medium,Eukaryote
+RD23A_HUMAN_Tsuboyama_2023_1IFY_indels,0.402,0.402,0.445,0.402,0.445,0.445,0.445,0.487,0.487,0.487,0.487,0.487,0.36,0.487,0.487,0.106,0.275,0.445,0.064,0.318,0.487,0.487,0.134,120,Stability,RD23A_HUMAN,High,Human
+RPC1_BP434_Tsuboyama_2023_1R69_indels,0.389,0.265,0.389,0.389,0.389,0.42,0.389,0.42,0.327,0.42,0.202,0.14,0.389,0.389,0.389,0.296,0.327,0.358,0.296,0.327,0.358,0.016,0.358,164,Stability,RPC1_BP434,High,Virus
+RS15_GEOSE_Tsuboyama_2023_1A32_indels,0.158,0.119,0.04,0.119,0.079,0.119,0.079,0.119,0.158,0.198,0.158,0.079,0.237,0.198,0.158,0.119,0.079,0.0,0.119,0.119,-0.04,0.079,0.115,176,Stability,RS15_GEOSE,Medium,Prokaryote
+S22A1_HUMAN_Yee_2023_abundance_indels,0.228,0.033,0.282,0.304,0.369,0.348,0.282,0.358,0.369,0.304,0.217,0.163,0.304,0.293,0.326,0.25,0.293,0.348,0.282,0.337,0.369,0.326,0.25,430,Expression,S22A1_HUMAN,Medium,Human
+S22A1_HUMAN_Yee_2023_activity_indels,0.346,0.115,0.423,0.389,0.508,0.474,0.423,0.517,0.431,0.44,0.295,0.226,0.363,0.491,0.457,0.312,0.406,0.465,0.38,0.465,0.465,0.354,0.26,490,Activity,S22A1_HUMAN,Medium,Human
+SAV1_MOUSE_Tsuboyama_2023_2YSB_indels,0.771,0.817,0.677,0.631,0.631,0.631,0.631,0.677,0.631,0.631,0.631,0.49,0.677,0.631,0.677,0.21,0.444,0.49,0.257,0.49,0.537,0.817,0.631,86,Stability,SAV1_MOUSE,High,Eukaryote
+SDA_BACSU_Tsuboyama_2023_1PV0_indels,0.116,0.186,0.151,0.359,0.29,0.428,0.186,0.29,0.116,0.22,0.498,-0.022,0.255,0.428,0.151,-0.23,-0.196,0.255,-0.23,-0.126,0.186,0.255,0.255,127,Stability,SDA_BACSU,Medium,Prokaryote
+SOX30_HUMAN_Tsuboyama_2023_7JJK_indels,0.455,0.31,0.068,0.165,0.262,0.407,-0.028,0.165,0.31,0.213,0.262,0.262,-0.174,0.213,0.213,-0.125,-0.125,0.068,-0.125,-0.077,0.068,0.262,0.145,109,Stability,SOX30_HUMAN,High,Human
+SPG2_STRSG_Tsuboyama_2023_5UBS_indels,-0.155,0.296,0.07,0.099,0.324,0.268,0.24,0.24,0.211,0.211,0.324,-0.183,-0.014,-0.014,0.409,0.127,0.155,0.268,0.127,0.127,0.268,0.155,0.24,148,Stability,SPG2_STRSG,Medium,Prokaryote
+SPTN1_CHICK_Tsuboyama_2023_1TUD_indels,0.29,0.071,0.352,0.477,0.446,0.477,0.258,0.633,0.508,0.539,0.415,0.04,0.508,0.508,0.539,0.227,0.383,0.383,0.227,0.352,0.383,0.071,0.258,129,Stability,SPTN1_CHICK,High,Eukaryote
+SQSTM_MOUSE_Tsuboyama_2023_2RRU_indels,0.15,0.202,0.304,0.202,0.304,0.304,0.304,0.304,0.355,0.304,0.355,0.048,0.253,0.304,0.304,-0.004,0.099,0.253,-0.004,0.202,0.304,0.202,0.279,111,Stability,SQSTM_MOUSE,Medium,Eukaryote
+SR43C_ARATH_Tsuboyama_2023_2N88_indels,-0.052,0.228,0.383,0.041,0.259,0.29,0.103,0.135,0.383,0.414,0.228,0.321,0.197,-0.052,0.135,0.321,0.321,0.446,0.383,0.29,0.383,0.228,0.197,135,Stability,SR43C_ARATH,High,Eukaryote
+SRBS1_HUMAN_Tsuboyama_2023_2O2W_indels,0.501,0.637,0.339,0.501,0.447,0.474,0.474,0.529,0.583,0.529,0.366,0.339,0.447,0.501,0.637,0.529,0.529,0.637,0.529,0.529,0.637,0.23,0.204,154,Stability,SRBS1_HUMAN,High,Human
+TCRG1_MOUSE_Tsuboyama_2023_1E0L_indels,0.641,0.555,0.598,0.598,0.641,0.641,0.598,0.598,0.598,0.598,0.598,0.425,0.425,0.598,0.641,-0.482,-0.439,-0.309,-0.482,-0.309,-0.266,0.684,0.05,99,Stability,TCRG1_MOUSE,Medium,Eukaryote
+THO1_YEAST_Tsuboyama_2023_2WQG_indels,0.236,0.53,0.059,0.471,0.471,0.53,0.295,0.471,0.412,0.471,0.53,0.471,0.236,0.471,0.471,-0.059,0.059,0.0,-0.059,0.118,0.059,0.471,0.53,82,Stability,THO1_YEAST,High,Eukaryote
+TNKS2_HUMAN_Tsuboyama_2023_5JRT_indels,-0.095,0.351,0.128,0.203,0.314,0.203,0.128,0.203,0.203,0.203,0.203,0.165,0.017,0.091,0.128,0.091,0.165,0.165,0.128,0.165,0.091,0.351,0.104,171,Stability,TNKS2_HUMAN,High,Human
+UBE4B_HUMAN_Tsuboyama_2023_3L1X_indels,-0.126,0.155,0.014,0.014,-0.021,0.014,0.12,0.014,0.085,0.014,0.155,0.225,0.014,-0.056,0.12,-0.021,-0.056,0.085,0.049,0.014,0.12,-0.126,0.043,147,Stability,UBE4B_HUMAN,High,Human
+UBR5_HUMAN_Tsuboyama_2023_1I2T_indels,0.209,0.273,0.209,0.209,0.209,0.177,0.305,0.241,0.241,0.273,0.273,0.209,0.241,0.241,0.241,-0.016,0.016,0.048,-0.016,0.016,0.048,0.112,0.106,156,Stability,UBR5_HUMAN,Medium,Human
+VG08_BPP22_Tsuboyama_2023_2GP8_indels,0.334,0.674,0.461,0.632,0.632,0.674,0.206,0.504,0.291,0.291,0.674,-0.262,0.376,0.674,0.674,0.078,0.206,0.547,0.121,0.291,0.632,0.674,0.334,101,Stability,VG08_BPP22,High,Virus
+VILI_CHICK_Tsuboyama_2023_1YU5_indels,-0.09,0.38,0.127,0.416,0.307,0.307,0.235,0.38,0.344,0.307,0.271,0.054,0.416,0.271,0.344,0.163,0.271,0.271,0.163,0.271,0.271,0.38,0.235,156,Stability,VILI_CHICK,High,Eukaryote
+VRPI_BPT7_Tsuboyama_2023_2WNM_indels,0.314,0.366,0.157,0.288,0.418,0.262,0.34,0.34,0.235,0.288,0.366,0.0,0.209,0.105,0.183,-0.131,-0.131,-0.209,-0.105,-0.157,-0.183,0.209,0.564,154,Stability,VRPI_BPT7,Medium,Virus
+YNZC_BACSU_Tsuboyama_2023_2JVD_indels,0.382,0.582,0.382,0.341,0.502,0.462,0.341,0.582,0.221,0.582,0.582,0.181,0.422,0.422,0.582,-0.341,-0.382,-0.261,-0.341,-0.301,-0.181,0.542,0.301,104,Stability,YNZC_BACSU,Medium,Prokaryote
diff --git a/benchmarks/DMS_zero_shot/indels/MCC/DMS_indels_MCC_DMS_level.html b/benchmarks/DMS_zero_shot/indels/MCC/DMS_indels_MCC_DMS_level.html
new file mode 100644
index 0000000..718ff3b
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/MCC/DMS_indels_MCC_DMS_level.html
@@ -0,0 +1,2083 @@
+
+
+
+ score |
+ Unirep |
+ Wavenet |
+ RITA S |
+ RITA M |
+ RITA L |
+ RITA XL |
+ Progen2 S |
+ Progen2 M |
+ Progen2 Base |
+ Progen2 L |
+ Progen2 XL |
+ ProtGPT2 |
+ Tranception S no retrieval |
+ Tranception M no retrieval |
+ Tranception L no retrieval |
+ Tranception S |
+ Tranception M |
+ Tranception L |
+ TranceptEVE S |
+ TranceptEVE M |
+ TranceptEVE L |
+ Hidden Markov Model |
+ Provean |
+ Number of Mutants |
+ Selection Type |
+ UniProt ID |
+ MSA_Neff_L_category |
+ Taxon |
+
+
+
+
+ A4_HUMAN_Seuma_2022_indels |
+ 0.585 |
+ 0.619 |
+ 0.550 |
+ 0.614 |
+ 0.592 |
+ 0.628 |
+ 0.664 |
+ 0.647 |
+ 0.633 |
+ 0.633 |
+ 0.618 |
+ 0.626 |
+ 0.656 |
+ 0.635 |
+ 0.604 |
+ 0.637 |
+ 0.642 |
+ 0.580 |
+ 0.640 |
+ 0.632 |
+ 0.607 |
+ -0.376 |
+ 0.511 |
+ 2346 |
+ Stability |
+ A4_HUMAN |
+ Low |
+ Human |
+
+
+ AMFR_HUMAN_Tsuboyama_2023_4G3O_indels |
+ 0.137 |
+ 0.462 |
+ -0.390 |
+ 0.259 |
+ 0.421 |
+ 0.421 |
+ 0.016 |
+ 0.421 |
+ 0.421 |
+ 0.218 |
+ 0.421 |
+ 0.421 |
+ -0.106 |
+ -0.025 |
+ 0.178 |
+ -0.228 |
+ -0.147 |
+ -0.147 |
+ -0.187 |
+ -0.147 |
+ -0.147 |
+ 0.462 |
+ 0.172 |
+ 117 |
+ Stability |
+ AMFR_HUMAN |
+ Medium |
+ Human |
+
+
+ ARGR_ECOLI_Tsuboyama_2023_1AOY_indels |
+ 0.250 |
+ 0.407 |
+ 0.407 |
+ 0.407 |
+ 0.407 |
+ 0.344 |
+ 0.155 |
+ 0.344 |
+ 0.407 |
+ 0.407 |
+ 0.344 |
+ 0.029 |
+ 0.376 |
+ 0.376 |
+ 0.407 |
+ -0.002 |
+ 0.187 |
+ 0.281 |
+ 0.061 |
+ 0.218 |
+ 0.344 |
+ 0.407 |
+ 0.218 |
+ 181 |
+ Stability |
+ ARGR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ B1LPA6_ECOSM_Russ_2020_indels |
+ 0.221 |
+ 0.362 |
+ 0.223 |
+ 0.283 |
+ 0.241 |
+ 0.251 |
+ 0.336 |
+ 0.286 |
+ 0.378 |
+ 0.330 |
+ 0.286 |
+ 0.261 |
+ 0.146 |
+ 0.118 |
+ 0.108 |
+ 0.276 |
+ 0.252 |
+ 0.236 |
+ 0.269 |
+ 0.253 |
+ 0.235 |
+ 0.299 |
+ 0.369 |
+ 3074 |
+ Activity |
+ B1LPA6_ECOSM |
+ Medium |
+ Prokaryote |
+
+
+ BBC1_YEAST_Tsuboyama_2023_1TG0_indels |
+ 0.309 |
+ 0.034 |
+ 0.377 |
+ 0.137 |
+ 0.343 |
+ 0.240 |
+ 0.240 |
+ 0.343 |
+ 0.446 |
+ 0.309 |
+ 0.377 |
+ 0.206 |
+ 0.480 |
+ 0.412 |
+ 0.446 |
+ 0.377 |
+ 0.412 |
+ 0.412 |
+ 0.377 |
+ 0.412 |
+ 0.412 |
+ 0.171 |
+ 0.171 |
+ 134 |
+ Stability |
+ BBC1_YEAST |
+ High |
+ Eukaryote |
+
+
+ BCHB_CHLTE_Tsuboyama_2023_2KRU_indels |
+ -0.087 |
+ 0.549 |
+ -0.202 |
+ 0.145 |
+ 0.260 |
+ 0.318 |
+ 0.145 |
+ 0.376 |
+ 0.260 |
+ 0.434 |
+ 0.491 |
+ 0.087 |
+ 0.029 |
+ 0.260 |
+ 0.318 |
+ -0.260 |
+ -0.145 |
+ 0.087 |
+ -0.145 |
+ -0.145 |
+ 0.087 |
+ 0.549 |
+ 0.260 |
+ 82 |
+ Stability |
+ BCHB_CHLTE |
+ Medium |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Gonzalez_2019_indels |
+ -0.052 |
+ 0.310 |
+ 0.266 |
+ 0.245 |
+ 0.179 |
+ 0.244 |
+ 0.303 |
+ 0.445 |
+ 0.478 |
+ 0.475 |
+ 0.256 |
+ 0.078 |
+ 0.286 |
+ 0.277 |
+ 0.224 |
+ 0.269 |
+ 0.233 |
+ 0.242 |
+ 0.266 |
+ 0.274 |
+ 0.254 |
+ 0.293 |
+ 0.238 |
+ 4751 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ CAPSD_AAV2S_Sinai_2021_designed_indels |
+ -0.353 |
+ 0.558 |
+ 0.252 |
+ 0.298 |
+ 0.443 |
+ 0.502 |
+ -0.353 |
+ -0.358 |
+ -0.303 |
+ -0.361 |
+ 0.383 |
+ 0.130 |
+ 0.274 |
+ 0.302 |
+ 0.578 |
+ 0.603 |
+ 0.554 |
+ 0.599 |
+ 0.606 |
+ 0.618 |
+ 0.633 |
+ 0.474 |
+ 0.571 |
+ 225998 |
+ OrganismalFitness |
+ CAPSD_AAV2S |
+ Low |
+ Virus |
+
+
+ CAPSD_AAV2S_Sinai_2021_library_indels |
+ -0.118 |
+ 0.033 |
+ 0.038 |
+ 0.103 |
+ 0.097 |
+ 0.114 |
+ -0.058 |
+ -0.041 |
+ -0.033 |
+ -0.039 |
+ 0.141 |
+ 0.083 |
+ 0.038 |
+ 0.090 |
+ 0.195 |
+ 0.245 |
+ 0.219 |
+ 0.223 |
+ 0.239 |
+ 0.238 |
+ 0.247 |
+ 0.082 |
+ 0.187 |
+ 24908 |
+ OrganismalFitness |
+ CAPSD_AAV2S |
+ Low |
+ Virus |
+
+
+ CATR_CHLRE_Tsuboyama_2023_2AMI_indels |
+ 0.184 |
+ 0.324 |
+ 0.230 |
+ 0.230 |
+ 0.277 |
+ 0.230 |
+ 0.207 |
+ 0.090 |
+ 0.207 |
+ 0.044 |
+ 0.137 |
+ 0.067 |
+ 0.324 |
+ 0.114 |
+ 0.184 |
+ 0.324 |
+ 0.254 |
+ 0.277 |
+ 0.324 |
+ 0.184 |
+ 0.300 |
+ -0.050 |
+ 0.114 |
+ 197 |
+ Stability |
+ CATR_CHLRE |
+ High |
+ Eukaryote |
+
+
+ CBPA2_HUMAN_Tsuboyama_2023_1O6X_indels |
+ 0.180 |
+ 0.288 |
+ 0.245 |
+ 0.331 |
+ 0.180 |
+ 0.202 |
+ 0.137 |
+ 0.115 |
+ 0.094 |
+ 0.051 |
+ 0.331 |
+ 0.072 |
+ 0.072 |
+ 0.309 |
+ 0.137 |
+ -0.057 |
+ -0.014 |
+ 0.008 |
+ -0.057 |
+ -0.014 |
+ -0.014 |
+ 0.352 |
+ 0.180 |
+ 205 |
+ Stability |
+ CBPA2_HUMAN |
+ Medium |
+ Human |
+
+
+ CBX4_HUMAN_Tsuboyama_2023_2K28_indels |
+ 0.024 |
+ 0.055 |
+ 0.024 |
+ 0.086 |
+ 0.055 |
+ 0.024 |
+ -0.069 |
+ -0.007 |
+ 0.024 |
+ 0.055 |
+ 0.024 |
+ -0.007 |
+ 0.149 |
+ 0.086 |
+ 0.118 |
+ 0.180 |
+ 0.149 |
+ 0.180 |
+ 0.180 |
+ 0.149 |
+ 0.211 |
+ 0.180 |
+ 0.055 |
+ 129 |
+ Stability |
+ CBX4_HUMAN |
+ High |
+ Human |
+
+
+ CSN4_MOUSE_Tsuboyama_2023_1UFM_indels |
+ 0.480 |
+ 0.429 |
+ 0.404 |
+ 0.429 |
+ 0.429 |
+ 0.505 |
+ 0.302 |
+ 0.480 |
+ 0.505 |
+ 0.429 |
+ 0.429 |
+ 0.201 |
+ 0.404 |
+ 0.404 |
+ 0.505 |
+ -0.130 |
+ -0.053 |
+ -0.003 |
+ -0.130 |
+ -0.003 |
+ 0.048 |
+ 0.505 |
+ 0.251 |
+ 195 |
+ Stability |
+ CSN4_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ CUE1_YEAST_Tsuboyama_2023_2MYX_indels |
+ 0.372 |
+ 0.328 |
+ 0.372 |
+ 0.284 |
+ 0.284 |
+ 0.241 |
+ 0.197 |
+ 0.197 |
+ 0.153 |
+ 0.197 |
+ 0.372 |
+ 0.197 |
+ 0.197 |
+ 0.284 |
+ 0.372 |
+ -0.197 |
+ -0.197 |
+ -0.109 |
+ -0.153 |
+ -0.197 |
+ -0.109 |
+ 0.328 |
+ 0.284 |
+ 140 |
+ Stability |
+ CUE1_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ DN7A_SACS2_Tsuboyama_2023_1JIC_indels |
+ 0.148 |
+ 0.207 |
+ 0.148 |
+ 0.030 |
+ 0.089 |
+ 0.030 |
+ 0.030 |
+ 0.089 |
+ 0.030 |
+ 0.030 |
+ 0.089 |
+ 0.148 |
+ 0.148 |
+ 0.030 |
+ 0.030 |
+ 0.030 |
+ -0.030 |
+ -0.089 |
+ 0.030 |
+ 0.030 |
+ -0.089 |
+ 0.207 |
+ 0.207 |
+ 136 |
+ Stability |
+ DN7A_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ DNJA1_HUMAN_Tsuboyama_2023_2LO1_indels |
+ 0.369 |
+ 0.092 |
+ 0.138 |
+ 0.231 |
+ 0.254 |
+ 0.138 |
+ 0.277 |
+ 0.161 |
+ 0.231 |
+ 0.231 |
+ 0.323 |
+ 0.208 |
+ 0.392 |
+ 0.369 |
+ 0.346 |
+ 0.185 |
+ 0.300 |
+ 0.277 |
+ 0.208 |
+ 0.277 |
+ 0.277 |
+ 0.231 |
+ 0.069 |
+ 174 |
+ Stability |
+ DNJA1_HUMAN |
+ High |
+ Human |
+
+
+ DOCK1_MOUSE_Tsuboyama_2023_2M0Y_indels |
+ 0.268 |
+ 0.522 |
+ 0.409 |
+ 0.409 |
+ 0.465 |
+ 0.296 |
+ 0.494 |
+ 0.494 |
+ 0.437 |
+ 0.465 |
+ 0.465 |
+ 0.183 |
+ 0.578 |
+ 0.606 |
+ 0.578 |
+ 0.268 |
+ 0.381 |
+ 0.381 |
+ 0.324 |
+ 0.409 |
+ 0.381 |
+ 0.409 |
+ 0.155 |
+ 154 |
+ Stability |
+ DOCK1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ EPHB2_HUMAN_Tsuboyama_2023_1F0M_indels |
+ 0.122 |
+ 0.322 |
+ 0.347 |
+ 0.322 |
+ 0.372 |
+ 0.372 |
+ 0.422 |
+ 0.372 |
+ 0.372 |
+ 0.372 |
+ 0.422 |
+ 0.347 |
+ 0.272 |
+ 0.297 |
+ 0.297 |
+ -0.003 |
+ 0.022 |
+ 0.147 |
+ 0.072 |
+ 0.047 |
+ 0.222 |
+ 0.347 |
+ 0.147 |
+ 185 |
+ Stability |
+ EPHB2_HUMAN |
+ High |
+ Human |
+
+
+ FECA_ECOLI_Tsuboyama_2023_2D1U_indels |
+ 0.241 |
+ -0.032 |
+ 0.150 |
+ 0.150 |
+ 0.210 |
+ -0.032 |
+ 0.180 |
+ 0.241 |
+ 0.301 |
+ 0.150 |
+ 0.089 |
+ 0.089 |
+ -0.002 |
+ 0.180 |
+ 0.028 |
+ -0.063 |
+ 0.028 |
+ -0.063 |
+ -0.063 |
+ 0.059 |
+ -0.032 |
+ -0.032 |
+ 0.028 |
+ 193 |
+ Stability |
+ FECA_ECOLI |
+ High |
+ Eukaryote |
+
+
+ HCP_LAMBD_Tsuboyama_2023_2L6Q_indels |
+ 0.207 |
+ 0.461 |
+ 0.429 |
+ 0.429 |
+ 0.493 |
+ 0.493 |
+ 0.334 |
+ 0.334 |
+ 0.334 |
+ 0.302 |
+ 0.493 |
+ 0.302 |
+ 0.429 |
+ 0.366 |
+ 0.461 |
+ -0.016 |
+ -0.048 |
+ 0.334 |
+ -0.080 |
+ -0.080 |
+ 0.334 |
+ 0.270 |
+ 0.334 |
+ 148 |
+ Stability |
+ HCP_LAMBD |
+ Medium |
+ Virus |
+
+
+ HECD1_HUMAN_Tsuboyama_2023_3DKM_indels |
+ 0.250 |
+ 0.426 |
+ 0.250 |
+ 0.367 |
+ 0.367 |
+ 0.426 |
+ 0.397 |
+ 0.455 |
+ 0.485 |
+ 0.455 |
+ 0.426 |
+ 0.044 |
+ 0.044 |
+ 0.338 |
+ 0.015 |
+ 0.132 |
+ 0.220 |
+ 0.044 |
+ 0.073 |
+ 0.220 |
+ 0.044 |
+ 0.544 |
+ 0.213 |
+ 154 |
+ Stability |
+ HECD1_HUMAN |
+ Medium |
+ Human |
+
+
+ HIS7_YEAST_Pokusaeva_2019_indels |
+ 0.027 |
+ 0.568 |
+ 0.555 |
+ 0.555 |
+ 0.577 |
+ 0.571 |
+ 0.594 |
+ 0.610 |
+ 0.586 |
+ 0.602 |
+ 0.597 |
+ -0.087 |
+ 0.542 |
+ 0.574 |
+ 0.596 |
+ 0.525 |
+ 0.528 |
+ 0.547 |
+ 0.535 |
+ 0.572 |
+ 0.577 |
+ 0.467 |
+ 0.585 |
+ 6102 |
+ OrganismalFitness |
+ HIS7_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ ILF3_HUMAN_Tsuboyama_2023_2L33_indels |
+ 0.063 |
+ 0.374 |
+ 0.307 |
+ 0.263 |
+ 0.063 |
+ 0.174 |
+ 0.352 |
+ 0.085 |
+ 0.241 |
+ 0.374 |
+ 0.329 |
+ 0.063 |
+ 0.174 |
+ 0.374 |
+ 0.418 |
+ 0.019 |
+ 0.241 |
+ 0.241 |
+ 0.041 |
+ 0.285 |
+ 0.307 |
+ 0.041 |
+ 0.056 |
+ 193 |
+ Stability |
+ ILF3_HUMAN |
+ High |
+ Human |
+
+
+ KCNJ2_MOUSE_Macdonald_2022_indels |
+ 0.063 |
+ 0.385 |
+ 0.313 |
+ 0.315 |
+ 0.317 |
+ 0.303 |
+ 0.327 |
+ 0.373 |
+ 0.377 |
+ 0.374 |
+ 0.316 |
+ 0.225 |
+ 0.316 |
+ 0.367 |
+ 0.336 |
+ 0.362 |
+ 0.396 |
+ 0.400 |
+ 0.359 |
+ 0.406 |
+ 0.392 |
+ 0.349 |
+ 0.342 |
+ 10501 |
+ Expression |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ MAFG_MOUSE_Tsuboyama_2023_1K1V_indels |
+ 0.322 |
+ 0.263 |
+ -0.032 |
+ 0.263 |
+ 0.263 |
+ 0.263 |
+ 0.263 |
+ 0.263 |
+ 0.263 |
+ 0.263 |
+ 0.263 |
+ -0.032 |
+ 0.027 |
+ 0.263 |
+ 0.322 |
+ 0.027 |
+ 0.027 |
+ 0.086 |
+ 0.027 |
+ 0.027 |
+ 0.086 |
+ 0.204 |
+ -0.062 |
+ 115 |
+ Stability |
+ MAFG_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ MBD11_ARATH_Tsuboyama_2023_6ACV_indels |
+ 0.179 |
+ 0.148 |
+ 0.148 |
+ 0.022 |
+ 0.463 |
+ 0.432 |
+ 0.306 |
+ 0.590 |
+ 0.495 |
+ 0.306 |
+ 0.243 |
+ 0.116 |
+ 0.022 |
+ 0.179 |
+ 0.116 |
+ -0.010 |
+ 0.022 |
+ -0.041 |
+ -0.010 |
+ 0.022 |
+ -0.010 |
+ 0.243 |
+ 0.463 |
+ 131 |
+ Stability |
+ MBD11_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ MYO3_YEAST_Tsuboyama_2023_2BTT_indels |
+ -0.052 |
+ 0.314 |
+ 0.262 |
+ 0.262 |
+ 0.367 |
+ 0.367 |
+ 0.314 |
+ 0.367 |
+ 0.105 |
+ 0.314 |
+ 0.314 |
+ 0.262 |
+ 0.262 |
+ 0.367 |
+ 0.314 |
+ 0.105 |
+ 0.157 |
+ 0.157 |
+ 0.052 |
+ 0.262 |
+ 0.157 |
+ 0.367 |
+ 0.419 |
+ 80 |
+ Stability |
+ MYO3_YEAST |
+ High |
+ Eukaryote |
+
+
+ NKX31_HUMAN_Tsuboyama_2023_2L9R_indels |
+ 0.411 |
+ 0.565 |
+ 0.565 |
+ 0.565 |
+ 0.539 |
+ 0.565 |
+ 0.513 |
+ 0.565 |
+ 0.565 |
+ 0.565 |
+ 0.565 |
+ 0.128 |
+ 0.590 |
+ 0.436 |
+ 0.565 |
+ 0.488 |
+ 0.436 |
+ 0.513 |
+ 0.539 |
+ 0.436 |
+ 0.488 |
+ 0.462 |
+ 0.104 |
+ 178 |
+ Stability |
+ NKX31_HUMAN |
+ High |
+ Human |
+
+
+ NUSA_ECOLI_Tsuboyama_2023_1WCL_indels |
+ 0.193 |
+ 0.193 |
+ 0.054 |
+ 0.193 |
+ 0.124 |
+ 0.078 |
+ 0.216 |
+ 0.078 |
+ 0.286 |
+ 0.078 |
+ 0.078 |
+ -0.038 |
+ 0.239 |
+ 0.239 |
+ 0.147 |
+ -0.038 |
+ 0.008 |
+ 0.054 |
+ -0.061 |
+ 0.031 |
+ 0.031 |
+ 0.078 |
+ 0.263 |
+ 191 |
+ Stability |
+ NUSA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ NUSG_MYCTU_Tsuboyama_2023_2MI6_indels |
+ 0.223 |
+ 0.123 |
+ 0.123 |
+ 0.123 |
+ 0.123 |
+ 0.123 |
+ 0.123 |
+ 0.123 |
+ 0.123 |
+ 0.123 |
+ 0.123 |
+ 0.223 |
+ 0.123 |
+ 0.123 |
+ 0.173 |
+ 0.023 |
+ 0.073 |
+ 0.173 |
+ 0.073 |
+ 0.073 |
+ 0.173 |
+ 0.123 |
+ 0.273 |
+ 157 |
+ Stability |
+ NUSG_MYCTU |
+ High |
+ Prokaryote |
+
+
+ OBSCN_HUMAN_Tsuboyama_2023_1V1C_indels |
+ 0.215 |
+ 0.473 |
+ 0.602 |
+ 0.525 |
+ 0.576 |
+ 0.628 |
+ 0.550 |
+ 0.628 |
+ 0.628 |
+ 0.550 |
+ 0.499 |
+ 0.293 |
+ 0.241 |
+ 0.267 |
+ 0.318 |
+ 0.138 |
+ 0.241 |
+ 0.267 |
+ 0.138 |
+ 0.189 |
+ 0.267 |
+ 0.318 |
+ 0.119 |
+ 169 |
+ Stability |
+ OBSCN_HUMAN |
+ High |
+ Human |
+
+
+ ODP2_GEOSE_Tsuboyama_2023_1W4G_indels |
+ 0.034 |
+ 0.605 |
+ 0.605 |
+ 0.605 |
+ 0.605 |
+ 0.605 |
+ 0.605 |
+ 0.605 |
+ 0.605 |
+ 0.605 |
+ 0.605 |
+ 0.415 |
+ 0.320 |
+ 0.225 |
+ 0.320 |
+ 0.415 |
+ 0.510 |
+ 0.415 |
+ 0.415 |
+ 0.605 |
+ 0.415 |
+ 0.415 |
+ 0.605 |
+ 47 |
+ Stability |
+ ODP2_GEOSE |
+ High |
+ Prokaryote |
+
+
+ OTU7A_HUMAN_Tsuboyama_2023_2L2D_indels |
+ 0.406 |
+ 0.174 |
+ 0.232 |
+ 0.464 |
+ 0.522 |
+ 0.464 |
+ 0.174 |
+ 0.464 |
+ 0.464 |
+ 0.464 |
+ 0.464 |
+ 0.116 |
+ 0.232 |
+ 0.348 |
+ 0.464 |
+ -0.174 |
+ 0.000 |
+ 0.116 |
+ -0.174 |
+ 0.000 |
+ 0.116 |
+ 0.522 |
+ 0.406 |
+ 84 |
+ Stability |
+ OTU7A_HUMAN |
+ High |
+ Human |
+
+
+ P53_HUMAN_Kotler_2018_indels |
+ 0.056 |
+ -0.003 |
+ 0.232 |
+ 0.326 |
+ 0.267 |
+ 0.138 |
+ 0.349 |
+ 0.290 |
+ 0.208 |
+ 0.290 |
+ 0.173 |
+ -0.026 |
+ 0.361 |
+ 0.419 |
+ 0.243 |
+ 0.290 |
+ 0.419 |
+ 0.220 |
+ 0.290 |
+ 0.443 |
+ 0.267 |
+ 0.372 |
+ 0.126 |
+ 341 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ PIN1_HUMAN_Tsuboyama_2023_1I6C_indels |
+ 0.586 |
+ 0.628 |
+ 0.503 |
+ 0.628 |
+ 0.503 |
+ 0.586 |
+ 0.586 |
+ 0.419 |
+ 0.545 |
+ 0.586 |
+ 0.419 |
+ 0.586 |
+ 0.545 |
+ 0.586 |
+ 0.628 |
+ 0.042 |
+ 0.461 |
+ 0.586 |
+ 0.084 |
+ 0.545 |
+ 0.628 |
+ 0.503 |
+ 0.461 |
+ 106 |
+ Stability |
+ PIN1_HUMAN |
+ High |
+ Human |
+
+
+ PITX2_HUMAN_Tsuboyama_2023_2L7M_indels |
+ 0.375 |
+ 0.330 |
+ 0.330 |
+ 0.419 |
+ 0.375 |
+ 0.375 |
+ 0.330 |
+ 0.375 |
+ 0.330 |
+ 0.419 |
+ 0.419 |
+ 0.286 |
+ 0.330 |
+ 0.375 |
+ 0.375 |
+ 0.107 |
+ 0.152 |
+ 0.063 |
+ 0.196 |
+ 0.196 |
+ 0.196 |
+ 0.196 |
+ 0.233 |
+ 117 |
+ Stability |
+ PITX2_HUMAN |
+ High |
+ Human |
+
+
+ PKN1_HUMAN_Tsuboyama_2023_1URF_indels |
+ 0.162 |
+ 0.634 |
+ 0.516 |
+ 0.539 |
+ 0.563 |
+ 0.516 |
+ 0.539 |
+ 0.587 |
+ 0.634 |
+ 0.610 |
+ 0.634 |
+ 0.209 |
+ 0.492 |
+ 0.539 |
+ 0.563 |
+ 0.115 |
+ 0.469 |
+ 0.469 |
+ 0.162 |
+ 0.492 |
+ 0.469 |
+ 0.209 |
+ 0.256 |
+ 187 |
+ Stability |
+ PKN1_HUMAN |
+ High |
+ Human |
+
+
+ POLG_PESV_Tsuboyama_2023_2MXD_indels |
+ 0.283 |
+ 0.369 |
+ 0.455 |
+ 0.369 |
+ 0.283 |
+ 0.254 |
+ 0.340 |
+ 0.340 |
+ 0.340 |
+ 0.283 |
+ 0.340 |
+ 0.110 |
+ 0.369 |
+ 0.283 |
+ 0.283 |
+ -0.148 |
+ -0.206 |
+ -0.148 |
+ -0.120 |
+ -0.148 |
+ -0.120 |
+ 0.168 |
+ 0.139 |
+ 149 |
+ Stability |
+ POLG_PESV |
+ Medium |
+ Virus |
+
+
+ PR40A_HUMAN_Tsuboyama_2023_1UZC_indels |
+ 0.127 |
+ 0.433 |
+ 0.204 |
+ 0.459 |
+ 0.459 |
+ 0.357 |
+ 0.408 |
+ 0.459 |
+ 0.484 |
+ 0.433 |
+ 0.357 |
+ 0.433 |
+ 0.255 |
+ 0.178 |
+ 0.127 |
+ 0.025 |
+ 0.076 |
+ 0.025 |
+ 0.000 |
+ 0.127 |
+ 0.102 |
+ 0.510 |
+ 0.051 |
+ 168 |
+ Stability |
+ PR40A_HUMAN |
+ Medium |
+ Human |
+
+
+ PSAE_SYNP2_Tsuboyama_2023_1PSE_indels |
+ 0.064 |
+ 0.178 |
+ 0.247 |
+ 0.109 |
+ 0.109 |
+ 0.018 |
+ 0.247 |
+ 0.339 |
+ -0.051 |
+ 0.293 |
+ 0.362 |
+ 0.132 |
+ 0.178 |
+ 0.293 |
+ -0.143 |
+ 0.224 |
+ 0.431 |
+ 0.041 |
+ 0.247 |
+ 0.431 |
+ 0.041 |
+ 0.109 |
+ 0.293 |
+ 175 |
+ Stability |
+ PSAE_SYNP2 |
+ Medium |
+ Prokaryote |
+
+
+ PTEN_HUMAN_Mighell_2018_indels |
+ -0.274 |
+ 0.580 |
+ 0.503 |
+ 0.490 |
+ 0.376 |
+ 0.478 |
+ 0.592 |
+ 0.465 |
+ 0.605 |
+ 0.529 |
+ 0.338 |
+ 0.057 |
+ 0.592 |
+ 0.618 |
+ 0.516 |
+ 0.618 |
+ 0.592 |
+ 0.465 |
+ 0.631 |
+ 0.631 |
+ 0.529 |
+ 0.643 |
+ 0.159 |
+ 314 |
+ Activity |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ Q8EG35_SHEON_Campbell_2022_indels |
+ 0.390 |
+ 0.148 |
+ 0.366 |
+ 0.305 |
+ 0.269 |
+ 0.184 |
+ 0.426 |
+ 0.269 |
+ 0.269 |
+ 0.233 |
+ 0.233 |
+ 0.136 |
+ 0.450 |
+ 0.499 |
+ 0.269 |
+ 0.390 |
+ 0.462 |
+ 0.233 |
+ 0.378 |
+ 0.390 |
+ 0.269 |
+ 0.414 |
+ 0.196 |
+ 331 |
+ OrganismalFitness |
+ Q8EG35_SHEON |
+ Medium |
+ Prokaryote |
+
+
+ RAD_ANTMA_Tsuboyama_2023_2CJJ_indels |
+ 0.511 |
+ 0.381 |
+ 0.295 |
+ 0.684 |
+ 0.684 |
+ 0.727 |
+ 0.511 |
+ 0.684 |
+ 0.684 |
+ 0.597 |
+ 0.425 |
+ 0.122 |
+ 0.468 |
+ 0.511 |
+ 0.381 |
+ 0.036 |
+ 0.381 |
+ 0.295 |
+ 0.079 |
+ 0.511 |
+ 0.381 |
+ 0.641 |
+ -0.152 |
+ 97 |
+ Stability |
+ RAD_ANTMA |
+ High |
+ Eukaryote |
+
+
+ RCD1_ARATH_Tsuboyama_2023_5OAO_indels |
+ 0.198 |
+ 0.475 |
+ 0.119 |
+ 0.000 |
+ 0.040 |
+ 0.238 |
+ 0.000 |
+ 0.000 |
+ 0.079 |
+ 0.040 |
+ 0.158 |
+ 0.119 |
+ 0.000 |
+ 0.079 |
+ 0.158 |
+ -0.198 |
+ -0.158 |
+ -0.119 |
+ -0.198 |
+ -0.158 |
+ -0.119 |
+ 0.357 |
+ 0.198 |
+ 124 |
+ Stability |
+ RCD1_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ RD23A_HUMAN_Tsuboyama_2023_1IFY_indels |
+ 0.402 |
+ 0.402 |
+ 0.445 |
+ 0.402 |
+ 0.445 |
+ 0.445 |
+ 0.445 |
+ 0.487 |
+ 0.487 |
+ 0.487 |
+ 0.487 |
+ 0.487 |
+ 0.360 |
+ 0.487 |
+ 0.487 |
+ 0.106 |
+ 0.275 |
+ 0.445 |
+ 0.064 |
+ 0.318 |
+ 0.487 |
+ 0.487 |
+ 0.134 |
+ 120 |
+ Stability |
+ RD23A_HUMAN |
+ High |
+ Human |
+
+
+ RPC1_BP434_Tsuboyama_2023_1R69_indels |
+ 0.389 |
+ 0.265 |
+ 0.389 |
+ 0.389 |
+ 0.389 |
+ 0.420 |
+ 0.389 |
+ 0.420 |
+ 0.327 |
+ 0.420 |
+ 0.202 |
+ 0.140 |
+ 0.389 |
+ 0.389 |
+ 0.389 |
+ 0.296 |
+ 0.327 |
+ 0.358 |
+ 0.296 |
+ 0.327 |
+ 0.358 |
+ 0.016 |
+ 0.358 |
+ 164 |
+ Stability |
+ RPC1_BP434 |
+ High |
+ Virus |
+
+
+ RS15_GEOSE_Tsuboyama_2023_1A32_indels |
+ 0.158 |
+ 0.119 |
+ 0.040 |
+ 0.119 |
+ 0.079 |
+ 0.119 |
+ 0.079 |
+ 0.119 |
+ 0.158 |
+ 0.198 |
+ 0.158 |
+ 0.079 |
+ 0.237 |
+ 0.198 |
+ 0.158 |
+ 0.119 |
+ 0.079 |
+ 0.000 |
+ 0.119 |
+ 0.119 |
+ -0.040 |
+ 0.079 |
+ 0.115 |
+ 176 |
+ Stability |
+ RS15_GEOSE |
+ Medium |
+ Prokaryote |
+
+
+ S22A1_HUMAN_Yee_2023_abundance_indels |
+ 0.228 |
+ 0.033 |
+ 0.282 |
+ 0.304 |
+ 0.369 |
+ 0.348 |
+ 0.282 |
+ 0.358 |
+ 0.369 |
+ 0.304 |
+ 0.217 |
+ 0.163 |
+ 0.304 |
+ 0.293 |
+ 0.326 |
+ 0.250 |
+ 0.293 |
+ 0.348 |
+ 0.282 |
+ 0.337 |
+ 0.369 |
+ 0.326 |
+ 0.250 |
+ 430 |
+ Expression |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ S22A1_HUMAN_Yee_2023_activity_indels |
+ 0.346 |
+ 0.115 |
+ 0.423 |
+ 0.389 |
+ 0.508 |
+ 0.474 |
+ 0.423 |
+ 0.517 |
+ 0.431 |
+ 0.440 |
+ 0.295 |
+ 0.226 |
+ 0.363 |
+ 0.491 |
+ 0.457 |
+ 0.312 |
+ 0.406 |
+ 0.465 |
+ 0.380 |
+ 0.465 |
+ 0.465 |
+ 0.354 |
+ 0.260 |
+ 490 |
+ Activity |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ SAV1_MOUSE_Tsuboyama_2023_2YSB_indels |
+ 0.771 |
+ 0.817 |
+ 0.677 |
+ 0.631 |
+ 0.631 |
+ 0.631 |
+ 0.631 |
+ 0.677 |
+ 0.631 |
+ 0.631 |
+ 0.631 |
+ 0.490 |
+ 0.677 |
+ 0.631 |
+ 0.677 |
+ 0.210 |
+ 0.444 |
+ 0.490 |
+ 0.257 |
+ 0.490 |
+ 0.537 |
+ 0.817 |
+ 0.631 |
+ 86 |
+ Stability |
+ SAV1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ SDA_BACSU_Tsuboyama_2023_1PV0_indels |
+ 0.116 |
+ 0.186 |
+ 0.151 |
+ 0.359 |
+ 0.290 |
+ 0.428 |
+ 0.186 |
+ 0.290 |
+ 0.116 |
+ 0.220 |
+ 0.498 |
+ -0.022 |
+ 0.255 |
+ 0.428 |
+ 0.151 |
+ -0.230 |
+ -0.196 |
+ 0.255 |
+ -0.230 |
+ -0.126 |
+ 0.186 |
+ 0.255 |
+ 0.255 |
+ 127 |
+ Stability |
+ SDA_BACSU |
+ Medium |
+ Prokaryote |
+
+
+ SOX30_HUMAN_Tsuboyama_2023_7JJK_indels |
+ 0.455 |
+ 0.310 |
+ 0.068 |
+ 0.165 |
+ 0.262 |
+ 0.407 |
+ -0.028 |
+ 0.165 |
+ 0.310 |
+ 0.213 |
+ 0.262 |
+ 0.262 |
+ -0.174 |
+ 0.213 |
+ 0.213 |
+ -0.125 |
+ -0.125 |
+ 0.068 |
+ -0.125 |
+ -0.077 |
+ 0.068 |
+ 0.262 |
+ 0.145 |
+ 109 |
+ Stability |
+ SOX30_HUMAN |
+ High |
+ Human |
+
+
+ SPG2_STRSG_Tsuboyama_2023_5UBS_indels |
+ -0.155 |
+ 0.296 |
+ 0.070 |
+ 0.099 |
+ 0.324 |
+ 0.268 |
+ 0.240 |
+ 0.240 |
+ 0.211 |
+ 0.211 |
+ 0.324 |
+ -0.183 |
+ -0.014 |
+ -0.014 |
+ 0.409 |
+ 0.127 |
+ 0.155 |
+ 0.268 |
+ 0.127 |
+ 0.127 |
+ 0.268 |
+ 0.155 |
+ 0.240 |
+ 148 |
+ Stability |
+ SPG2_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPTN1_CHICK_Tsuboyama_2023_1TUD_indels |
+ 0.290 |
+ 0.071 |
+ 0.352 |
+ 0.477 |
+ 0.446 |
+ 0.477 |
+ 0.258 |
+ 0.633 |
+ 0.508 |
+ 0.539 |
+ 0.415 |
+ 0.040 |
+ 0.508 |
+ 0.508 |
+ 0.539 |
+ 0.227 |
+ 0.383 |
+ 0.383 |
+ 0.227 |
+ 0.352 |
+ 0.383 |
+ 0.071 |
+ 0.258 |
+ 129 |
+ Stability |
+ SPTN1_CHICK |
+ High |
+ Eukaryote |
+
+
+ SQSTM_MOUSE_Tsuboyama_2023_2RRU_indels |
+ 0.150 |
+ 0.202 |
+ 0.304 |
+ 0.202 |
+ 0.304 |
+ 0.304 |
+ 0.304 |
+ 0.304 |
+ 0.355 |
+ 0.304 |
+ 0.355 |
+ 0.048 |
+ 0.253 |
+ 0.304 |
+ 0.304 |
+ -0.004 |
+ 0.099 |
+ 0.253 |
+ -0.004 |
+ 0.202 |
+ 0.304 |
+ 0.202 |
+ 0.279 |
+ 111 |
+ Stability |
+ SQSTM_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ SR43C_ARATH_Tsuboyama_2023_2N88_indels |
+ -0.052 |
+ 0.228 |
+ 0.383 |
+ 0.041 |
+ 0.259 |
+ 0.290 |
+ 0.103 |
+ 0.135 |
+ 0.383 |
+ 0.414 |
+ 0.228 |
+ 0.321 |
+ 0.197 |
+ -0.052 |
+ 0.135 |
+ 0.321 |
+ 0.321 |
+ 0.446 |
+ 0.383 |
+ 0.290 |
+ 0.383 |
+ 0.228 |
+ 0.197 |
+ 135 |
+ Stability |
+ SR43C_ARATH |
+ High |
+ Eukaryote |
+
+
+ SRBS1_HUMAN_Tsuboyama_2023_2O2W_indels |
+ 0.501 |
+ 0.637 |
+ 0.339 |
+ 0.501 |
+ 0.447 |
+ 0.474 |
+ 0.474 |
+ 0.529 |
+ 0.583 |
+ 0.529 |
+ 0.366 |
+ 0.339 |
+ 0.447 |
+ 0.501 |
+ 0.637 |
+ 0.529 |
+ 0.529 |
+ 0.637 |
+ 0.529 |
+ 0.529 |
+ 0.637 |
+ 0.230 |
+ 0.204 |
+ 154 |
+ Stability |
+ SRBS1_HUMAN |
+ High |
+ Human |
+
+
+ TCRG1_MOUSE_Tsuboyama_2023_1E0L_indels |
+ 0.641 |
+ 0.555 |
+ 0.598 |
+ 0.598 |
+ 0.641 |
+ 0.641 |
+ 0.598 |
+ 0.598 |
+ 0.598 |
+ 0.598 |
+ 0.598 |
+ 0.425 |
+ 0.425 |
+ 0.598 |
+ 0.641 |
+ -0.482 |
+ -0.439 |
+ -0.309 |
+ -0.482 |
+ -0.309 |
+ -0.266 |
+ 0.684 |
+ 0.050 |
+ 99 |
+ Stability |
+ TCRG1_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ THO1_YEAST_Tsuboyama_2023_2WQG_indels |
+ 0.236 |
+ 0.530 |
+ 0.059 |
+ 0.471 |
+ 0.471 |
+ 0.530 |
+ 0.295 |
+ 0.471 |
+ 0.412 |
+ 0.471 |
+ 0.530 |
+ 0.471 |
+ 0.236 |
+ 0.471 |
+ 0.471 |
+ -0.059 |
+ 0.059 |
+ 0.000 |
+ -0.059 |
+ 0.118 |
+ 0.059 |
+ 0.471 |
+ 0.530 |
+ 82 |
+ Stability |
+ THO1_YEAST |
+ High |
+ Eukaryote |
+
+
+ TNKS2_HUMAN_Tsuboyama_2023_5JRT_indels |
+ -0.095 |
+ 0.351 |
+ 0.128 |
+ 0.203 |
+ 0.314 |
+ 0.203 |
+ 0.128 |
+ 0.203 |
+ 0.203 |
+ 0.203 |
+ 0.203 |
+ 0.165 |
+ 0.017 |
+ 0.091 |
+ 0.128 |
+ 0.091 |
+ 0.165 |
+ 0.165 |
+ 0.128 |
+ 0.165 |
+ 0.091 |
+ 0.351 |
+ 0.104 |
+ 171 |
+ Stability |
+ TNKS2_HUMAN |
+ High |
+ Human |
+
+
+ UBE4B_HUMAN_Tsuboyama_2023_3L1X_indels |
+ -0.126 |
+ 0.155 |
+ 0.014 |
+ 0.014 |
+ -0.021 |
+ 0.014 |
+ 0.120 |
+ 0.014 |
+ 0.085 |
+ 0.014 |
+ 0.155 |
+ 0.225 |
+ 0.014 |
+ -0.056 |
+ 0.120 |
+ -0.021 |
+ -0.056 |
+ 0.085 |
+ 0.049 |
+ 0.014 |
+ 0.120 |
+ -0.126 |
+ 0.043 |
+ 147 |
+ Stability |
+ UBE4B_HUMAN |
+ High |
+ Human |
+
+
+ UBR5_HUMAN_Tsuboyama_2023_1I2T_indels |
+ 0.209 |
+ 0.273 |
+ 0.209 |
+ 0.209 |
+ 0.209 |
+ 0.177 |
+ 0.305 |
+ 0.241 |
+ 0.241 |
+ 0.273 |
+ 0.273 |
+ 0.209 |
+ 0.241 |
+ 0.241 |
+ 0.241 |
+ -0.016 |
+ 0.016 |
+ 0.048 |
+ -0.016 |
+ 0.016 |
+ 0.048 |
+ 0.112 |
+ 0.106 |
+ 156 |
+ Stability |
+ UBR5_HUMAN |
+ Medium |
+ Human |
+
+
+ VG08_BPP22_Tsuboyama_2023_2GP8_indels |
+ 0.334 |
+ 0.674 |
+ 0.461 |
+ 0.632 |
+ 0.632 |
+ 0.674 |
+ 0.206 |
+ 0.504 |
+ 0.291 |
+ 0.291 |
+ 0.674 |
+ -0.262 |
+ 0.376 |
+ 0.674 |
+ 0.674 |
+ 0.078 |
+ 0.206 |
+ 0.547 |
+ 0.121 |
+ 0.291 |
+ 0.632 |
+ 0.674 |
+ 0.334 |
+ 101 |
+ Stability |
+ VG08_BPP22 |
+ High |
+ Virus |
+
+
+ VILI_CHICK_Tsuboyama_2023_1YU5_indels |
+ -0.090 |
+ 0.380 |
+ 0.127 |
+ 0.416 |
+ 0.307 |
+ 0.307 |
+ 0.235 |
+ 0.380 |
+ 0.344 |
+ 0.307 |
+ 0.271 |
+ 0.054 |
+ 0.416 |
+ 0.271 |
+ 0.344 |
+ 0.163 |
+ 0.271 |
+ 0.271 |
+ 0.163 |
+ 0.271 |
+ 0.271 |
+ 0.380 |
+ 0.235 |
+ 156 |
+ Stability |
+ VILI_CHICK |
+ High |
+ Eukaryote |
+
+
+ VRPI_BPT7_Tsuboyama_2023_2WNM_indels |
+ 0.314 |
+ 0.366 |
+ 0.157 |
+ 0.288 |
+ 0.418 |
+ 0.262 |
+ 0.340 |
+ 0.340 |
+ 0.235 |
+ 0.288 |
+ 0.366 |
+ 0.000 |
+ 0.209 |
+ 0.105 |
+ 0.183 |
+ -0.131 |
+ -0.131 |
+ -0.209 |
+ -0.105 |
+ -0.157 |
+ -0.183 |
+ 0.209 |
+ 0.564 |
+ 154 |
+ Stability |
+ VRPI_BPT7 |
+ Medium |
+ Virus |
+
+
+ YNZC_BACSU_Tsuboyama_2023_2JVD_indels |
+ 0.382 |
+ 0.582 |
+ 0.382 |
+ 0.341 |
+ 0.502 |
+ 0.462 |
+ 0.341 |
+ 0.582 |
+ 0.221 |
+ 0.582 |
+ 0.582 |
+ 0.181 |
+ 0.422 |
+ 0.422 |
+ 0.582 |
+ -0.341 |
+ -0.382 |
+ -0.261 |
+ -0.341 |
+ -0.301 |
+ -0.181 |
+ 0.542 |
+ 0.301 |
+ 104 |
+ Stability |
+ YNZC_BACSU |
+ Medium |
+ Prokaryote |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/indels/MCC/Summary_performance_DMS_indels_MCC.csv b/benchmarks/DMS_zero_shot/indels/MCC/Summary_performance_DMS_indels_MCC.csv
new file mode 100644
index 0000000..dd203b5
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/MCC/Summary_performance_DMS_indels_MCC.csv
@@ -0,0 +1,24 @@
+Model_rank,Model_name,Model type,Average_MCC,Bootstrap_standard_error_MCC,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,Model details,References
+1,Progen2 Base,Protein language model,0.365,0.0,0.471,,0.373,0.275,0.343,0.224,0.312,0.383,0.386,0.393,0.249,0.226,Progen2 base model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+2,Tranception M no retrieval,Protein language model,0.361,0.031,0.409,,0.33,0.393,0.311,0.417,0.292,0.339,0.334,0.354,0.248,0.335,Tranception Medium model (300M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+3,Hidden Markov Model,Alignment-based model,0.357,0.028,0.432,,0.338,0.365,0.292,0.091,0.344,0.29,0.299,0.341,0.28,0.269,Profile Hidden Markov model,HMMER: biosequence analysis using profile hidden Markov models
+4,Progen2 M,Protein language model,0.356,0.015,0.423,,0.366,0.283,0.354,0.246,0.343,0.367,0.357,0.396,0.299,0.29,Progen2 medium model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+5,TranceptEVE M,Hybrid model,0.35,0.033,0.45,,0.372,0.421,0.157,0.501,0.083,0.266,0.245,0.196,0.141,0.11,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+6,Progen2 L,Protein language model,0.349,0.009,0.433,,0.339,0.28,0.343,0.241,0.319,0.372,0.372,0.368,0.301,0.231,Progen2 large model (2.7B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+7,RITA L,Protein language model,0.346,0.033,0.375,,0.343,0.312,0.355,0.376,0.329,0.368,0.357,0.389,0.257,0.414,RITA large model (680M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+8,RITA XL,Protein language model,0.341,0.03,0.401,,0.326,0.289,0.349,0.358,0.324,0.362,0.359,0.378,0.248,0.402,RITA xlarge model (1.2B params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+9,Tranception L no retrieval,Protein language model,0.341,0.035,0.36,,0.331,0.344,0.328,0.411,0.282,0.366,0.339,0.372,0.225,0.396,Tranception Large model (700M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+10,RITA M,Protein language model,0.337,0.023,0.387,,0.31,0.326,0.326,0.38,0.29,0.357,0.364,0.329,0.24,0.385,RITA medium model (300M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+11,TranceptEVE L,Hybrid model,0.336,0.034,0.41,,0.38,0.361,0.194,0.438,0.105,0.302,0.268,0.207,0.142,0.244,TranceptEVE Large model (Tranception Large & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+12,Progen2 S,Protein language model,0.335,0.015,0.45,,0.304,0.293,0.294,0.269,0.286,0.316,0.335,0.318,0.245,0.234,Progen2 small model (150M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+13,Tranception S no retrieval,Protein language model,0.326,0.025,0.367,,0.31,0.359,0.269,0.391,0.236,0.311,0.272,0.316,0.228,0.321,Tranception Small model (85M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+14,Tranception M,Hybrid model,0.325,0.033,0.417,,0.344,0.406,0.132,0.482,0.06,0.241,0.226,0.166,0.117,0.089,Tranception Medium model (300M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+15,Tranception L,Hybrid model,0.318,0.034,0.389,,0.374,0.331,0.177,0.404,0.089,0.284,0.246,0.188,0.138,0.216,Tranception Large model (700M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+16,RITA S,Protein language model,0.315,0.022,0.383,,0.298,0.313,0.269,0.309,0.242,0.306,0.28,0.305,0.206,0.339,RITA small model (85M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+17,Progen2 XL,Protein language model,0.31,0.028,0.306,,0.266,0.304,0.364,0.351,0.355,0.356,0.365,0.361,0.316,0.39,Progen2 xlarge model (6.4B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+18,TranceptEVE S,Hybrid model,0.3,0.033,0.427,,0.32,0.378,0.074,0.451,0.032,0.165,0.158,0.1,0.086,0.089,TranceptEVE Small model (Tranception Small & retrieved EVE model),"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+19,Wavenet,Alignment-based model,0.295,0.047,0.352,,0.209,0.264,0.355,0.304,0.349,0.351,0.361,0.346,0.305,0.405,Wavenet model,"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12."
+20,Tranception S,Hybrid model,0.287,0.034,0.402,,0.306,0.38,0.059,0.45,0.024,0.145,0.14,0.09,0.072,0.084,Tranception Small model (85M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+21,Provean,Alignment-based model,0.273,0.041,0.263,,0.296,0.305,0.229,0.339,0.242,0.225,0.18,0.249,0.274,0.351,Provean model,"Choi Y, Sims GE, Murphy S, Miller JR, Chan AP (2012). Predicting the functional effect of amino acid substitutions and indels. PloS one."
+22,ProtGPT2,Protein language model,0.151,0.045,0.181,,0.194,0.042,0.186,0.236,0.135,0.205,0.239,0.176,0.109,0.066,ProtGPT2 model,"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13."
+23,Unirep,Protein language model,0.13,0.062,0.098,,0.146,0.037,0.239,0.135,0.194,0.238,0.226,0.252,0.135,0.215,Unirep model,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
diff --git a/benchmarks/DMS_zero_shot/indels/MCC/Summary_performance_DMS_indels_MCC.html b/benchmarks/DMS_zero_shot/indels/MCC/Summary_performance_DMS_indels_MCC.html
new file mode 100644
index 0000000..2b8da55
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/MCC/Summary_performance_DMS_indels_MCC.html
@@ -0,0 +1,531 @@
+
+
+
+ |
+ Model_name |
+ Model type |
+ Average_MCC |
+ Bootstrap_standard_error_MCC |
+ Function_Activity |
+ Function_Binding |
+ Function_Expression |
+ Function_OrganismalFitness |
+ Function_Stability |
+ Low_MSA_depth |
+ Medium_MSA_depth |
+ High_MSA_depth |
+ Taxa_Human |
+ Taxa_Other_Eukaryote |
+ Taxa_Prokaryote |
+ Taxa_Virus |
+ Model details |
+ References |
+
+
+ Model_rank |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+
+
+
+
+ 1 |
+ TranceptEVE M |
+ Hybrid model |
+ 0.387 |
+ 0.000 |
+ 0.580 |
+ NaN |
+ 0.373 |
+ 0.432 |
+ 0.164 |
+ 0.519 |
+ 0.126 |
+ 0.251 |
+ 0.255 |
+ 0.230 |
+ 0.146 |
+ 0.082 |
+ TranceptEVE Medium model (Tranception Medium & retrieved EVE model) |
+ Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop. |
+
+
+ 2 |
+ TranceptEVE L |
+ Hybrid model |
+ 0.380 |
+ 0.012 |
+ 0.544 |
+ NaN |
+ 0.382 |
+ 0.381 |
+ 0.214 |
+ 0.471 |
+ 0.144 |
+ 0.314 |
+ 0.302 |
+ 0.257 |
+ 0.142 |
+ 0.214 |
+ TranceptEVE Large model (Tranception Large & retrieved EVE model) |
+ Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop. |
+
+
+ 3 |
+ Progen2 Base |
+ Protein language model |
+ 0.375 |
+ 0.034 |
+ 0.472 |
+ NaN |
+ 0.374 |
+ 0.291 |
+ 0.365 |
+ 0.251 |
+ 0.317 |
+ 0.416 |
+ 0.407 |
+ 0.438 |
+ 0.236 |
+ 0.240 |
+ Progen2 base model (760M params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 4 |
+ Progen2 M |
+ Protein language model |
+ 0.369 |
+ 0.039 |
+ 0.423 |
+ NaN |
+ 0.367 |
+ 0.296 |
+ 0.389 |
+ 0.267 |
+ 0.358 |
+ 0.414 |
+ 0.387 |
+ 0.450 |
+ 0.305 |
+ 0.314 |
+ Progen2 medium model (760M params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 5 |
+ Tranception M no retrieval |
+ Protein language model |
+ 0.368 |
+ 0.039 |
+ 0.409 |
+ NaN |
+ 0.332 |
+ 0.411 |
+ 0.319 |
+ 0.447 |
+ 0.277 |
+ 0.366 |
+ 0.365 |
+ 0.350 |
+ 0.217 |
+ 0.382 |
+ Tranception Medium model (300M params) without retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 6 |
+ Progen2 L |
+ Protein language model |
+ 0.359 |
+ 0.035 |
+ 0.433 |
+ NaN |
+ 0.340 |
+ 0.293 |
+ 0.371 |
+ 0.262 |
+ 0.325 |
+ 0.414 |
+ 0.403 |
+ 0.405 |
+ 0.287 |
+ 0.278 |
+ Progen2 large model (2.7B params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 7 |
+ Tranception L no retrieval |
+ Protein language model |
+ 0.352 |
+ 0.034 |
+ 0.361 |
+ NaN |
+ 0.332 |
+ 0.371 |
+ 0.345 |
+ 0.456 |
+ 0.275 |
+ 0.402 |
+ 0.378 |
+ 0.378 |
+ 0.216 |
+ 0.426 |
+ Tranception Large model (700M params) without retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 8 |
+ Hidden Markov Model |
+ Alignment-based model |
+ 0.351 |
+ 0.025 |
+ 0.433 |
+ NaN |
+ 0.338 |
+ 0.387 |
+ 0.245 |
+ 0.129 |
+ 0.293 |
+ 0.255 |
+ 0.280 |
+ 0.263 |
+ 0.246 |
+ 0.271 |
+ Profile Hidden Markov model |
+ HMMER: biosequence analysis using profile hidden Markov models |
+
+
+ 9 |
+ RITA XL |
+ Protein language model |
+ 0.350 |
+ 0.029 |
+ 0.401 |
+ NaN |
+ 0.326 |
+ 0.319 |
+ 0.355 |
+ 0.408 |
+ 0.331 |
+ 0.366 |
+ 0.368 |
+ 0.378 |
+ 0.238 |
+ 0.468 |
+ RITA xlarge model (1.2B params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 10 |
+ RITA L |
+ Protein language model |
+ 0.348 |
+ 0.031 |
+ 0.375 |
+ NaN |
+ 0.344 |
+ 0.325 |
+ 0.349 |
+ 0.397 |
+ 0.325 |
+ 0.362 |
+ 0.359 |
+ 0.369 |
+ 0.244 |
+ 0.466 |
+ RITA large model (680M params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 11 |
+ Progen2 S |
+ Protein language model |
+ 0.346 |
+ 0.034 |
+ 0.451 |
+ NaN |
+ 0.306 |
+ 0.294 |
+ 0.334 |
+ 0.271 |
+ 0.300 |
+ 0.374 |
+ 0.373 |
+ 0.363 |
+ 0.268 |
+ 0.249 |
+ Progen2 small model (150M params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 12 |
+ RITA M |
+ Protein language model |
+ 0.339 |
+ 0.021 |
+ 0.388 |
+ NaN |
+ 0.311 |
+ 0.327 |
+ 0.332 |
+ 0.381 |
+ 0.273 |
+ 0.383 |
+ 0.378 |
+ 0.334 |
+ 0.218 |
+ 0.419 |
+ RITA medium model (300M params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 13 |
+ Tranception M |
+ Hybrid model |
+ 0.334 |
+ 0.024 |
+ 0.417 |
+ NaN |
+ 0.346 |
+ 0.426 |
+ 0.145 |
+ 0.517 |
+ 0.096 |
+ 0.231 |
+ 0.240 |
+ 0.207 |
+ 0.097 |
+ 0.077 |
+ Tranception Medium model (300M params) with retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 14 |
+ Tranception S no retrieval |
+ Protein language model |
+ 0.333 |
+ 0.033 |
+ 0.368 |
+ NaN |
+ 0.312 |
+ 0.373 |
+ 0.280 |
+ 0.414 |
+ 0.256 |
+ 0.312 |
+ 0.280 |
+ 0.332 |
+ 0.238 |
+ 0.324 |
+ Tranception Small model (85M params) without retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 15 |
+ Tranception L |
+ Hybrid model |
+ 0.329 |
+ 0.025 |
+ 0.389 |
+ NaN |
+ 0.375 |
+ 0.362 |
+ 0.190 |
+ 0.456 |
+ 0.106 |
+ 0.291 |
+ 0.278 |
+ 0.231 |
+ 0.096 |
+ 0.187 |
+ Tranception Large model (700M params) with retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 16 |
+ Progen2 XL |
+ Protein language model |
+ 0.326 |
+ 0.017 |
+ 0.307 |
+ NaN |
+ 0.268 |
+ 0.333 |
+ 0.396 |
+ 0.398 |
+ 0.375 |
+ 0.393 |
+ 0.391 |
+ 0.411 |
+ 0.338 |
+ 0.384 |
+ Progen2 xlarge model (6.4B params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 17 |
+ RITA S |
+ Protein language model |
+ 0.324 |
+ 0.028 |
+ 0.383 |
+ NaN |
+ 0.299 |
+ 0.327 |
+ 0.287 |
+ 0.333 |
+ 0.264 |
+ 0.318 |
+ 0.305 |
+ 0.309 |
+ 0.226 |
+ 0.360 |
+ RITA small model (85M params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 18 |
+ TranceptEVE S |
+ Hybrid model |
+ 0.303 |
+ 0.024 |
+ 0.427 |
+ NaN |
+ 0.322 |
+ 0.391 |
+ 0.070 |
+ 0.472 |
+ 0.057 |
+ 0.136 |
+ 0.142 |
+ 0.137 |
+ 0.071 |
+ 0.043 |
+ TranceptEVE Small model (Tranception Small & retrieved EVE model) |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 19 |
+ Tranception S |
+ Hybrid model |
+ 0.291 |
+ 0.022 |
+ 0.403 |
+ NaN |
+ 0.308 |
+ 0.392 |
+ 0.061 |
+ 0.471 |
+ 0.051 |
+ 0.125 |
+ 0.135 |
+ 0.123 |
+ 0.062 |
+ 0.052 |
+ Tranception Small model (85M params) with retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 20 |
+ Provean |
+ Alignment-based model |
+ 0.279 |
+ 0.024 |
+ 0.263 |
+ NaN |
+ 0.298 |
+ 0.314 |
+ 0.240 |
+ 0.354 |
+ 0.238 |
+ 0.247 |
+ 0.205 |
+ 0.256 |
+ 0.288 |
+ 0.301 |
+ Provean model |
+ Choi Y, Sims GE, Murphy S, Miller JR, Chan AP (2012). Predicting the functional effect of amino acid substitutions and indels. PloS one. |
+
+
+ 21 |
+ Wavenet |
+ Alignment-based model |
+ 0.237 |
+ 0.036 |
+ 0.353 |
+ NaN |
+ 0.211 |
+ 0.294 |
+ 0.089 |
+ 0.158 |
+ 0.172 |
+ 0.072 |
+ 0.056 |
+ 0.165 |
+ 0.106 |
+ 0.274 |
+ Wavenet model |
+ Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12. |
+
+
+ 22 |
+ ProtGPT2 |
+ Protein language model |
+ 0.156 |
+ 0.028 |
+ 0.182 |
+ NaN |
+ 0.196 |
+ 0.071 |
+ 0.174 |
+ 0.285 |
+ 0.128 |
+ 0.190 |
+ 0.229 |
+ 0.182 |
+ 0.091 |
+ 0.046 |
+ ProtGPT2 model |
+ Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13. |
+
+
+ 23 |
+ Unirep |
+ Protein language model |
+ 0.135 |
+ 0.058 |
+ 0.098 |
+ NaN |
+ 0.147 |
+ 0.030 |
+ 0.266 |
+ 0.122 |
+ 0.210 |
+ 0.270 |
+ 0.241 |
+ 0.288 |
+ 0.152 |
+ 0.236 |
+ Unirep model |
+ Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8. |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/indels/MCC/Summary_performance_all_models_indels_MCC.csv b/benchmarks/DMS_zero_shot/indels/MCC/Summary_performance_all_models_indels_MCC.csv
new file mode 100644
index 0000000..7fbf1b0
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/MCC/Summary_performance_all_models_indels_MCC.csv
@@ -0,0 +1,22 @@
+Model_rank,Model_name,Model type,Average_MCC,Bootstrap_standard_error_MCC,Model details,References
+1,TranceptEVE M,Hybrid model,0.431,0.0,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+2,Tranception M,Hybrid model,0.427,0.002,Tranception Medium model (300M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+3,Tranception S,Hybrid model,0.396,0.018,Tranception Small model (85M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+4,TranceptEVE L,Hybrid model,0.383,0.018,TranceptEVE Large model (Tranception Large & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+5,Tranception L,Hybrid model,0.375,0.018,Tranception Large model (700M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+6,Tranception L no retrieval,Protein language model,0.365,0.018,Tranception Large model (700M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+7,RITA (ensemble),Protein language model,0.35,0.028,Ensemble of the 4 RITA models,"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+8,RITA XL,Protein language model,0.345,0.027,RITA xlarge model (1.2B params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+9,Progen2 (ensemble),Protein language model,0.341,0.091,Ensemble of the 5 Progen2 models," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+10,Wavenet,Alignment-based model,0.341,0.049,Wavenet model,"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12."
+11,Unirep evotuned,Hybrid model,0.333,0.045,Unirep model w/ evotuning,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+12,RITA M,Protein language model,0.33,0.03,RITA medium model (300M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+13,Progen2 Base,Protein language model,0.323,0.093,Progen2 base model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+14,RITA L,Protein language model,0.32,0.03,RITA large model (680M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+15,Progen2 L,Protein language model,0.306,0.097,Progen2 large model (2.7B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+16,Progen2 XL,Protein language model,0.303,0.037,Progen2 xlarge model (6.4B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+17,RITA S,Protein language model,0.299,0.036,RITA small model (85M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+18,Progen2 M,Protein language model,0.297,0.096,Progen2 medium model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+19,Progen2 S,Protein language model,0.276,0.086,Progen2 small model (150M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+20,Unirep,Protein language model,0.107,0.062,Unirep model,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+21,ProtGPT2,Protein language model,0.089,0.078,ProtGPT2 model,"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13."
diff --git a/benchmarks/DMS_zero_shot/indels/MCC/all_models_indels_MCC_DMS_level.csv b/benchmarks/DMS_zero_shot/indels/MCC/all_models_indels_MCC_DMS_level.csv
new file mode 100644
index 0000000..b2f95c3
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/MCC/all_models_indels_MCC_DMS_level.csv
@@ -0,0 +1,9 @@
+,Tranception_L_no_retrieval,Tranception_S_retrieval,Tranception_M_retrieval,Tranception_L_retrieval,Wavenet,RITA_s,RITA_m,RITA_l,RITA_xl,RITA_ensemble,Progen2_small,Progen2_medium,Progen2_base,Progen2_large,Progen2_xlarge,Progen2_ensemble,Unirep,Unirep_evotune,ProtGPT2,TranceptEVE_L,TranceptEVE_M,number_mutants
+A0A1J4YT16_9PROT_Davidi_2020,0.143,0.181,0.219,0.143,0.143,0.028,0.143,0.181,0.219,0.105,0.066,0.181,0.181,0.143,0.105,0.105,-0.048,0.295,0.143,0.181,0.219,105
+B1LPA6_ECOSM_Russ_2020,0.264,0.357,0.351,0.301,0.274,0.261,0.325,0.268,0.263,0.293,0.241,0.253,0.301,0.264,0.218,0.275,0.241,0.35,0.234,0.303,0.361,3074
+BLAT_ECOLX_Gonzalez_indels_2019,0.225,0.285,0.274,0.246,0.357,0.267,0.245,0.179,0.245,0.28,0.304,0.445,0.479,0.475,0.257,0.521,0.053,0.171,0.078,0.254,0.274,4751
+CAPSD_AAV2S_Sinai_indels_2021,0.432,0.451,0.452,0.465,0.371,0.176,0.224,0.331,0.355,0.281,-0.221,-0.226,-0.176,-0.227,0.291,-0.132,0.28,0.384,0.077,0.471,0.464,250907
+HIS7_YEAST_Pokusaeva_indels_2019,0.596,0.53,0.568,0.578,0.567,0.555,0.555,0.577,0.571,0.572,0.594,0.61,0.586,0.602,0.597,0.61,-0.027,0.239,-0.087,0.577,0.572,6102
+PTEN_HUMAN_Mighell_deletions_2018,0.516,0.618,0.631,0.541,0.554,0.503,0.49,0.376,0.478,0.605,0.592,0.465,0.605,0.529,0.338,0.631,0.274,0.478,0.057,0.529,0.631,314
+P53_HUMAN_Kotler_deletions_2018,0.379,0.353,0.496,0.353,0.121,0.302,0.328,0.328,0.289,0.315,0.353,0.353,0.289,0.353,0.315,0.379,-0.021,0.418,0.121,0.366,0.496,341
+Average,0.365,0.396,0.427,0.375,0.341,0.299,0.33,0.32,0.345,0.35,0.276,0.297,0.323,0.306,0.303,0.341,0.107,0.333,0.089,0.383,0.431,37942
diff --git a/benchmarks/DMS_zero_shot/indels/NDCG/DMS_indels_NDCG_DMS_level.csv b/benchmarks/DMS_zero_shot/indels/NDCG/DMS_indels_NDCG_DMS_level.csv
new file mode 100644
index 0000000..1ed3b55
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/NDCG/DMS_indels_NDCG_DMS_level.csv
@@ -0,0 +1,67 @@
+DMS ID,Unirep,Wavenet,RITA S,RITA M,RITA L,RITA XL,Progen2 S,Progen2 M,Progen2 Base,Progen2 L,Progen2 XL,ProtGPT2,Tranception S no retrieval,Tranception M no retrieval,Tranception L no retrieval,Tranception S,Tranception M,Tranception L,TranceptEVE S,TranceptEVE M,TranceptEVE L,Hidden Markov Model,Provean,Number of Mutants,Selection Type,UniProt ID,MSA_Neff_L_category,Taxon
+A4_HUMAN_Seuma_2022_indels,0.729,0.758,0.781,0.708,0.689,0.696,0.778,0.765,0.767,0.779,0.751,0.759,0.793,0.654,0.702,0.793,0.701,0.705,0.791,0.696,0.735,0.416,0.769,2346,Stability,A4_HUMAN,Low,Human
+AMFR_HUMAN_Tsuboyama_2023_4G3O_indels,0.331,0.882,0.26,0.467,0.837,0.921,0.374,0.871,0.883,0.848,0.892,0.921,0.325,0.24,0.336,0.221,0.281,0.31,0.225,0.307,0.352,0.822,0.845,117,Stability,AMFR_HUMAN,Medium,Human
+ARGR_ECOLI_Tsuboyama_2023_1AOY_indels,0.74,0.935,0.933,0.973,0.969,0.984,0.786,0.984,0.982,0.987,0.981,0.58,0.895,0.813,0.971,0.785,0.781,0.921,0.801,0.795,0.926,0.796,0.881,181,Stability,ARGR_ECOLI,Medium,Prokaryote
+B1LPA6_ECOSM_Russ_2020_indels,0.702,0.75,0.697,0.707,0.707,0.711,0.735,0.751,0.722,0.733,0.726,0.684,0.669,0.716,0.693,0.718,0.73,0.704,0.71,0.724,0.713,0.75,0.686,3074,Activity,B1LPA6_ECOSM,Medium,Prokaryote
+BBC1_YEAST_Tsuboyama_2023_1TG0_indels,0.588,0.737,0.398,0.503,0.601,0.743,0.603,0.9,0.863,0.855,0.819,0.426,0.661,0.688,0.456,0.485,0.629,0.525,0.489,0.664,0.488,0.454,0.938,134,Stability,BBC1_YEAST,High,Eukaryote
+BCHB_CHLTE_Tsuboyama_2023_2KRU_indels,0.579,0.862,0.507,0.58,0.37,0.665,0.713,0.762,0.453,0.93,0.976,0.434,0.496,0.698,0.817,0.341,0.453,0.792,0.484,0.471,0.813,0.837,0.958,82,Stability,BCHB_CHLTE,Medium,Prokaryote
+BLAT_ECOLX_Gonzalez_2019_indels,0.256,0.577,0.505,0.552,0.534,0.44,0.463,0.582,0.593,0.58,0.406,0.21,0.515,0.5,0.387,0.451,0.466,0.396,0.481,0.52,0.392,0.225,0.513,4751,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+CAPSD_AAV2S_Sinai_2021_designed_indels,0.512,0.886,0.784,0.8,0.894,0.918,0.468,0.462,0.49,0.459,0.858,0.716,0.804,0.808,0.937,0.924,0.916,0.929,0.923,0.929,0.936,0.898,0.904,225998,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CAPSD_AAV2S_Sinai_2021_library_indels,0.472,0.467,0.551,0.597,0.573,0.613,0.472,0.479,0.483,0.481,0.642,0.539,0.55,0.589,0.641,0.695,0.699,0.709,0.697,0.69,0.749,0.466,0.491,24908,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CATR_CHLRE_Tsuboyama_2023_2AMI_indels,0.888,0.872,0.882,0.885,0.88,0.874,0.87,0.885,0.881,0.856,0.875,0.671,0.852,0.877,0.872,0.909,0.875,0.871,0.908,0.872,0.861,0.669,0.876,197,Stability,CATR_CHLRE,High,Eukaryote
+CBPA2_HUMAN_Tsuboyama_2023_1O6X_indels,0.719,0.933,0.606,0.738,0.659,0.645,0.608,0.729,0.771,0.732,0.917,0.474,0.621,0.647,0.578,0.57,0.61,0.566,0.569,0.631,0.569,0.852,0.838,205,Stability,CBPA2_HUMAN,Medium,Human
+CBX4_HUMAN_Tsuboyama_2023_2K28_indels,0.519,0.698,0.686,0.791,0.791,0.829,0.847,0.826,0.826,0.848,0.859,0.416,0.725,0.806,0.819,0.623,0.551,0.653,0.65,0.588,0.695,0.77,0.822,129,Stability,CBX4_HUMAN,High,Human
+CSN4_MOUSE_Tsuboyama_2023_1UFM_indels,0.478,0.959,0.62,0.535,0.888,0.959,0.422,0.966,0.935,0.965,0.965,0.618,0.48,0.866,0.862,0.437,0.819,0.843,0.413,0.797,0.836,0.897,0.932,195,Stability,CSN4_MOUSE,Medium,Eukaryote
+CUE1_YEAST_Tsuboyama_2023_2MYX_indels,0.583,0.887,0.673,0.596,0.591,0.457,0.427,0.461,0.473,0.529,0.799,0.418,0.504,0.462,0.401,0.325,0.352,0.33,0.325,0.354,0.345,0.593,0.976,140,Stability,CUE1_YEAST,Medium,Eukaryote
+DN7A_SACS2_Tsuboyama_2023_1JIC_indels,0.525,0.808,0.474,0.666,0.623,0.606,0.702,0.59,0.576,0.757,0.755,0.415,0.558,0.455,0.562,0.384,0.392,0.445,0.398,0.377,0.404,0.591,0.96,136,Stability,DN7A_SACS2,Medium,Prokaryote
+DNJA1_HUMAN_Tsuboyama_2023_2LO1_indels,0.863,0.811,0.949,0.949,0.967,0.951,0.961,0.958,0.983,0.965,0.967,0.7,0.955,0.952,0.971,0.928,0.954,0.945,0.93,0.954,0.943,0.758,0.836,174,Stability,DNJA1_HUMAN,High,Human
+DOCK1_MOUSE_Tsuboyama_2023_2M0Y_indels,0.62,0.862,0.914,0.926,0.891,0.915,0.849,0.924,0.853,0.875,0.952,0.633,0.667,0.664,0.753,0.643,0.656,0.705,0.656,0.663,0.782,0.595,0.921,154,Stability,DOCK1_MOUSE,High,Eukaryote
+EPHB2_HUMAN_Tsuboyama_2023_1F0M_indels,0.518,0.907,0.972,0.977,0.979,0.973,0.97,0.979,0.981,0.975,0.972,0.65,0.957,0.978,0.971,0.757,0.787,0.822,0.783,0.825,0.823,0.705,0.841,185,Stability,EPHB2_HUMAN,High,Human
+FECA_ECOLI_Tsuboyama_2023_2D1U_indels,0.558,0.561,0.46,0.788,0.75,0.757,0.421,0.817,0.846,0.812,0.791,0.503,0.26,0.246,0.326,0.262,0.261,0.279,0.265,0.276,0.276,0.714,0.787,193,Stability,FECA_ECOLI,High,Eukaryote
+HCP_LAMBD_Tsuboyama_2023_2L6Q_indels,0.564,0.927,0.754,0.893,0.867,0.845,0.839,0.645,0.762,0.787,0.909,0.661,0.649,0.616,0.887,0.43,0.503,0.883,0.426,0.421,0.885,0.844,0.933,148,Stability,HCP_LAMBD,Medium,Virus
+HECD1_HUMAN_Tsuboyama_2023_3DKM_indels,0.665,0.888,0.749,0.889,0.881,0.887,0.654,0.841,0.81,0.82,0.822,0.5,0.518,0.637,0.586,0.647,0.674,0.617,0.641,0.665,0.586,0.791,0.786,154,Stability,HECD1_HUMAN,Medium,Human
+HIS7_YEAST_Pokusaeva_2019_indels,0.094,0.863,0.772,0.748,0.795,0.808,0.73,0.837,0.888,0.857,0.888,0.298,0.723,0.862,0.874,0.8,0.728,0.851,0.818,0.849,0.881,0.748,0.879,6102,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+ILF3_HUMAN_Tsuboyama_2023_2L33_indels,0.488,0.85,0.816,0.852,0.853,0.862,0.836,0.856,0.836,0.852,0.834,0.871,0.585,0.84,0.833,0.453,0.777,0.799,0.491,0.776,0.808,0.891,0.832,193,Stability,ILF3_HUMAN,High,Human
+KCNJ2_MOUSE_Macdonald_2022_indels,0.662,0.772,0.775,0.78,0.768,0.771,0.797,0.786,0.776,0.794,0.77,0.69,0.773,0.771,0.768,0.776,0.764,0.771,0.776,0.771,0.771,0.765,0.774,10501,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+MAFG_MOUSE_Tsuboyama_2023_1K1V_indels,0.626,0.694,0.518,0.751,0.821,0.841,0.688,0.802,0.834,0.755,0.725,0.512,0.432,0.7,0.714,0.498,0.706,0.713,0.5,0.706,0.678,0.532,0.599,115,Stability,MAFG_MOUSE,Medium,Eukaryote
+MBD11_ARATH_Tsuboyama_2023_6ACV_indels,0.494,0.872,0.507,0.612,0.749,0.655,0.734,0.91,0.907,0.852,0.922,0.802,0.553,0.551,0.447,0.618,0.622,0.618,0.631,0.624,0.636,0.737,0.918,131,Stability,MBD11_ARATH,Medium,Eukaryote
+MYO3_YEAST_Tsuboyama_2023_2BTT_indels,0.401,0.753,0.766,0.571,0.708,0.669,0.935,0.858,0.626,0.899,0.987,0.518,0.562,0.653,0.545,0.595,0.611,0.574,0.609,0.649,0.634,0.47,0.917,80,Stability,MYO3_YEAST,High,Eukaryote
+NKX31_HUMAN_Tsuboyama_2023_2L9R_indels,0.908,0.778,0.837,0.879,0.85,0.836,0.884,0.805,0.917,0.711,0.789,0.723,0.907,0.826,0.715,0.907,0.91,0.828,0.908,0.904,0.81,0.825,0.826,178,Stability,NKX31_HUMAN,High,Human
+NUSA_ECOLI_Tsuboyama_2023_1WCL_indels,0.664,0.81,0.876,0.841,0.877,0.892,0.652,0.839,0.679,0.921,0.916,0.343,0.699,0.876,0.734,0.584,0.658,0.63,0.617,0.73,0.665,0.652,0.902,191,Stability,NUSA_ECOLI,High,Prokaryote
+NUSG_MYCTU_Tsuboyama_2023_2MI6_indels,0.504,0.685,0.865,0.891,0.88,0.891,0.859,0.884,0.857,0.875,0.865,0.479,0.857,0.84,0.783,0.672,0.649,0.682,0.674,0.643,0.7,0.436,0.69,157,Stability,NUSG_MYCTU,High,Prokaryote
+OBSCN_HUMAN_Tsuboyama_2023_1V1C_indels,0.68,0.862,0.908,0.942,0.946,0.919,0.887,0.93,0.934,0.884,0.89,0.766,0.724,0.507,0.702,0.445,0.446,0.554,0.468,0.47,0.559,0.671,0.764,169,Stability,OBSCN_HUMAN,High,Human
+ODP2_GEOSE_Tsuboyama_2023_1W4G_indels,0.501,0.92,0.924,0.988,0.947,0.958,0.982,0.986,0.983,0.976,0.97,0.871,0.931,0.662,0.786,0.776,0.792,0.575,0.776,0.954,0.692,0.767,0.972,47,Stability,ODP2_GEOSE,High,Prokaryote
+OTU7A_HUMAN_Tsuboyama_2023_2L2D_indels,0.359,0.579,0.398,0.883,0.903,0.924,0.784,0.925,0.927,0.949,0.939,0.715,0.505,0.543,0.798,0.241,0.327,0.479,0.248,0.327,0.593,0.473,0.949,84,Stability,OTU7A_HUMAN,High,Human
+P53_HUMAN_Kotler_2018_indels,0.291,0.447,0.558,0.408,0.456,0.693,0.533,0.692,0.616,0.661,0.653,0.19,0.66,0.566,0.681,0.576,0.543,0.629,0.614,0.501,0.616,0.548,0.653,341,OrganismalFitness,P53_HUMAN,Low,Human
+PIN1_HUMAN_Tsuboyama_2023_1I6C_indels,0.766,0.853,0.844,0.894,0.874,0.846,0.833,0.837,0.842,0.84,0.846,0.851,0.689,0.921,0.849,0.681,0.967,0.858,0.666,0.965,0.85,0.931,0.838,106,Stability,PIN1_HUMAN,High,Human
+PITX2_HUMAN_Tsuboyama_2023_2L7M_indels,0.89,0.798,0.94,0.907,0.888,0.91,0.886,0.934,0.969,0.922,0.967,0.71,0.922,0.921,0.937,0.68,0.701,0.75,0.722,0.773,0.782,0.67,0.903,117,Stability,PITX2_HUMAN,High,Human
+PKN1_HUMAN_Tsuboyama_2023_1URF_indels,0.311,0.857,0.905,0.907,0.92,0.915,0.895,0.909,0.905,0.928,0.921,0.769,0.442,0.884,0.895,0.698,0.894,0.9,0.719,0.893,0.9,0.855,0.916,187,Stability,PKN1_HUMAN,High,Human
+POLG_PESV_Tsuboyama_2023_2MXD_indels,0.641,0.905,0.591,0.603,0.602,0.648,0.631,0.592,0.576,0.494,0.638,0.59,0.637,0.596,0.667,0.518,0.459,0.514,0.636,0.473,0.522,0.684,0.859,149,Stability,POLG_PESV,Medium,Virus
+PR40A_HUMAN_Tsuboyama_2023_1UZC_indels,0.463,0.963,0.911,0.978,0.979,0.983,0.954,0.985,0.96,0.983,0.982,0.783,0.689,0.78,0.781,0.693,0.762,0.74,0.729,0.773,0.74,0.808,0.739,168,Stability,PR40A_HUMAN,Medium,Human
+PSAE_SYNP2_Tsuboyama_2023_1PSE_indels,0.371,0.508,0.41,0.271,0.375,0.647,0.458,0.631,0.374,0.599,0.636,0.63,0.447,0.753,0.591,0.514,0.628,0.548,0.476,0.679,0.626,0.749,0.602,175,Stability,PSAE_SYNP2,Medium,Prokaryote
+PTEN_HUMAN_Mighell_2018_indels,0.325,0.872,0.799,0.874,0.87,0.799,0.863,0.854,0.892,0.837,0.837,0.375,0.877,0.891,0.907,0.869,0.904,0.908,0.867,0.888,0.9,0.849,0.897,314,Activity,PTEN_HUMAN,Medium,Human
+Q8EG35_SHEON_Campbell_2022_indels,0.512,0.131,0.394,0.351,0.299,0.352,0.55,0.395,0.255,0.288,0.202,0.396,0.549,0.597,0.486,0.451,0.538,0.517,0.448,0.581,0.473,0.441,0.16,331,OrganismalFitness,Q8EG35_SHEON,Medium,Prokaryote
+RAD_ANTMA_Tsuboyama_2023_2CJJ_indels,0.6,0.825,0.786,0.836,0.825,0.846,0.876,0.844,0.795,0.832,0.874,0.744,0.56,0.809,0.791,0.786,0.839,0.87,0.804,0.837,0.868,0.674,0.796,97,Stability,RAD_ANTMA,High,Eukaryote
+RCD1_ARATH_Tsuboyama_2023_5OAO_indels,0.409,0.842,0.484,0.595,0.504,0.642,0.331,0.696,0.651,0.653,0.9,0.462,0.386,0.413,0.487,0.448,0.372,0.346,0.438,0.382,0.348,0.774,0.772,124,Stability,RCD1_ARATH,Medium,Eukaryote
+RD23A_HUMAN_Tsuboyama_2023_1IFY_indels,0.663,0.93,0.951,0.965,0.914,0.983,0.982,0.978,0.956,0.979,0.973,0.97,0.806,0.937,0.958,0.795,0.922,0.891,0.825,0.928,0.912,0.696,0.98,120,Stability,RD23A_HUMAN,High,Human
+RPC1_BP434_Tsuboyama_2023_1R69_indels,0.863,0.687,0.783,0.82,0.818,0.832,0.837,0.851,0.829,0.847,0.847,0.554,0.563,0.803,0.871,0.558,0.71,0.809,0.526,0.739,0.834,0.523,0.768,164,Stability,RPC1_BP434,High,Virus
+RS15_GEOSE_Tsuboyama_2023_1A32_indels,0.7,0.851,0.884,0.89,0.883,0.888,0.851,0.882,0.909,0.888,0.888,0.68,0.889,0.878,0.872,0.828,0.794,0.789,0.86,0.801,0.795,0.802,0.782,176,Stability,RS15_GEOSE,Medium,Prokaryote
+S22A1_HUMAN_Yee_2023_abundance_indels,0.649,0.646,0.72,0.81,0.825,0.792,0.77,0.855,0.853,0.81,0.789,0.614,0.802,0.835,0.805,0.738,0.692,0.765,0.755,0.802,0.769,0.555,0.766,430,Expression,S22A1_HUMAN,Medium,Human
+S22A1_HUMAN_Yee_2023_activity_indels,0.552,0.486,0.602,0.721,0.762,0.727,0.7,0.784,0.821,0.772,0.733,0.498,0.742,0.787,0.771,0.636,0.622,0.705,0.664,0.751,0.712,0.416,0.694,490,Activity,S22A1_HUMAN,Medium,Human
+SAV1_MOUSE_Tsuboyama_2023_2YSB_indels,0.764,0.844,0.85,0.861,0.849,0.864,0.863,0.849,0.858,0.857,0.833,0.894,0.838,0.84,0.831,0.769,0.853,0.843,0.852,0.853,0.846,0.841,0.851,86,Stability,SAV1_MOUSE,High,Eukaryote
+SDA_BACSU_Tsuboyama_2023_1PV0_indels,0.587,0.949,0.674,0.707,0.692,0.743,0.807,0.857,0.587,0.804,0.951,0.571,0.708,0.675,0.935,0.661,0.696,0.782,0.661,0.701,0.88,0.81,0.965,127,Stability,SDA_BACSU,Medium,Prokaryote
+SOX30_HUMAN_Tsuboyama_2023_7JJK_indels,0.714,0.857,0.893,0.939,0.955,0.958,0.92,0.961,0.949,0.967,0.971,0.979,0.54,0.984,0.963,0.466,0.805,0.859,0.581,0.845,0.888,0.782,0.979,109,Stability,SOX30_HUMAN,High,Human
+SPG2_STRSG_Tsuboyama_2023_5UBS_indels,0.394,0.755,0.526,0.579,0.55,0.577,0.672,0.769,0.761,0.834,0.805,0.426,0.447,0.507,0.819,0.563,0.592,0.675,0.566,0.591,0.698,0.579,0.835,148,Stability,SPG2_STRSG,Medium,Prokaryote
+SPTN1_CHICK_Tsuboyama_2023_1TUD_indels,0.584,0.684,0.891,0.824,0.933,0.88,0.932,0.927,0.795,0.948,0.939,0.428,0.714,0.688,0.823,0.653,0.682,0.766,0.706,0.665,0.796,0.491,0.479,129,Stability,SPTN1_CHICK,High,Eukaryote
+SQSTM_MOUSE_Tsuboyama_2023_2RRU_indels,0.526,0.691,0.767,0.918,0.919,0.922,0.736,0.908,0.951,0.929,0.909,0.591,0.445,0.858,0.937,0.483,0.834,0.81,0.549,0.835,0.849,0.44,0.955,111,Stability,SQSTM_MOUSE,Medium,Eukaryote
+SR43C_ARATH_Tsuboyama_2023_2N88_indels,0.517,0.79,0.758,0.723,0.729,0.636,0.534,0.439,0.599,0.436,0.581,0.53,0.709,0.39,0.599,0.774,0.606,0.731,0.791,0.609,0.74,0.67,0.758,135,Stability,SR43C_ARATH,High,Eukaryote
+SRBS1_HUMAN_Tsuboyama_2023_2O2W_indels,0.692,0.918,0.935,0.904,0.935,0.915,0.933,0.918,0.913,0.937,0.934,0.809,0.873,0.941,0.868,0.921,0.92,0.892,0.924,0.929,0.906,0.922,0.927,154,Stability,SRBS1_HUMAN,High,Human
+TCRG1_MOUSE_Tsuboyama_2023_1E0L_indels,0.829,0.956,0.967,0.972,0.974,0.982,0.945,0.981,0.973,0.981,0.985,0.812,0.834,0.892,0.952,0.376,0.572,0.633,0.375,0.592,0.759,0.917,0.976,99,Stability,TCRG1_MOUSE,Medium,Eukaryote
+THO1_YEAST_Tsuboyama_2023_2WQG_indels,0.338,0.844,0.259,0.536,0.546,0.497,0.648,0.768,0.693,0.683,0.874,0.461,0.325,0.841,0.519,0.227,0.734,0.744,0.234,0.837,0.82,0.625,0.988,82,Stability,THO1_YEAST,High,Eukaryote
+TNKS2_HUMAN_Tsuboyama_2023_5JRT_indels,0.431,0.599,0.702,0.881,0.844,0.881,0.899,0.906,0.901,0.898,0.906,0.467,0.454,0.433,0.728,0.481,0.495,0.654,0.504,0.447,0.712,0.645,0.831,171,Stability,TNKS2_HUMAN,High,Human
+UBE4B_HUMAN_Tsuboyama_2023_3L1X_indels,0.178,0.763,0.21,0.783,0.803,0.825,0.799,0.829,0.801,0.83,0.823,0.6,0.417,0.238,0.755,0.376,0.309,0.65,0.336,0.314,0.643,0.398,0.72,147,Stability,UBE4B_HUMAN,High,Human
+UBR5_HUMAN_Tsuboyama_2023_1I2T_indels,0.608,0.88,0.945,0.917,0.955,0.952,0.941,0.953,0.94,0.954,0.956,0.54,0.878,0.909,0.831,0.6,0.79,0.702,0.596,0.794,0.75,0.904,0.844,156,Stability,UBR5_HUMAN,Medium,Human
+VG08_BPP22_Tsuboyama_2023_2GP8_indels,0.943,0.908,0.783,0.903,0.871,0.936,0.935,0.923,0.915,0.917,0.907,0.613,0.858,0.904,0.931,0.727,0.864,0.913,0.697,0.871,0.908,0.929,0.843,101,Stability,VG08_BPP22,High,Virus
+VILI_CHICK_Tsuboyama_2023_1YU5_indels,0.612,0.905,0.699,0.686,0.67,0.668,0.686,0.738,0.709,0.664,0.888,0.77,0.67,0.899,0.767,0.754,0.879,0.848,0.753,0.891,0.847,0.588,0.861,156,Stability,VILI_CHICK,High,Eukaryote
+VRPI_BPT7_Tsuboyama_2023_2WNM_indels,0.74,0.864,0.589,0.722,0.749,0.704,0.761,0.685,0.636,0.645,0.798,0.613,0.664,0.67,0.718,0.69,0.701,0.655,0.652,0.686,0.654,0.741,0.876,154,Stability,VRPI_BPT7,Medium,Virus
+YNZC_BACSU_Tsuboyama_2023_2JVD_indels,0.705,0.96,0.658,0.772,0.937,0.941,0.901,0.932,0.698,0.925,0.897,0.637,0.644,0.881,0.912,0.439,0.507,0.734,0.442,0.516,0.725,0.923,0.887,104,Stability,YNZC_BACSU,Medium,Prokaryote
diff --git a/benchmarks/DMS_zero_shot/indels/NDCG/DMS_indels_NDCG_DMS_level.html b/benchmarks/DMS_zero_shot/indels/NDCG/DMS_indels_NDCG_DMS_level.html
new file mode 100644
index 0000000..d138847
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/NDCG/DMS_indels_NDCG_DMS_level.html
@@ -0,0 +1,2083 @@
+
+
+
+ score |
+ Unirep |
+ Wavenet |
+ RITA S |
+ RITA M |
+ RITA L |
+ RITA XL |
+ Progen2 S |
+ Progen2 M |
+ Progen2 Base |
+ Progen2 L |
+ Progen2 XL |
+ ProtGPT2 |
+ Tranception S no retrieval |
+ Tranception M no retrieval |
+ Tranception L no retrieval |
+ Tranception S |
+ Tranception M |
+ Tranception L |
+ TranceptEVE S |
+ TranceptEVE M |
+ TranceptEVE L |
+ Hidden Markov Model |
+ Provean |
+ Number of Mutants |
+ Selection Type |
+ UniProt ID |
+ MSA_Neff_L_category |
+ Taxon |
+
+
+
+
+ A4_HUMAN_Seuma_2022_indels |
+ 0.729 |
+ 0.758 |
+ 0.781 |
+ 0.708 |
+ 0.689 |
+ 0.696 |
+ 0.778 |
+ 0.765 |
+ 0.767 |
+ 0.779 |
+ 0.751 |
+ 0.759 |
+ 0.793 |
+ 0.654 |
+ 0.702 |
+ 0.793 |
+ 0.701 |
+ 0.705 |
+ 0.791 |
+ 0.696 |
+ 0.735 |
+ 0.416 |
+ 0.769 |
+ 2346 |
+ Stability |
+ A4_HUMAN |
+ Low |
+ Human |
+
+
+ AMFR_HUMAN_Tsuboyama_2023_4G3O_indels |
+ 0.331 |
+ 0.882 |
+ 0.260 |
+ 0.467 |
+ 0.837 |
+ 0.921 |
+ 0.374 |
+ 0.871 |
+ 0.883 |
+ 0.848 |
+ 0.892 |
+ 0.921 |
+ 0.325 |
+ 0.240 |
+ 0.336 |
+ 0.221 |
+ 0.281 |
+ 0.310 |
+ 0.225 |
+ 0.307 |
+ 0.352 |
+ 0.822 |
+ 0.845 |
+ 117 |
+ Stability |
+ AMFR_HUMAN |
+ Medium |
+ Human |
+
+
+ ARGR_ECOLI_Tsuboyama_2023_1AOY_indels |
+ 0.740 |
+ 0.935 |
+ 0.933 |
+ 0.973 |
+ 0.969 |
+ 0.984 |
+ 0.786 |
+ 0.984 |
+ 0.982 |
+ 0.987 |
+ 0.981 |
+ 0.580 |
+ 0.895 |
+ 0.813 |
+ 0.971 |
+ 0.785 |
+ 0.781 |
+ 0.921 |
+ 0.801 |
+ 0.795 |
+ 0.926 |
+ 0.796 |
+ 0.881 |
+ 181 |
+ Stability |
+ ARGR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ B1LPA6_ECOSM_Russ_2020_indels |
+ 0.702 |
+ 0.750 |
+ 0.697 |
+ 0.707 |
+ 0.707 |
+ 0.711 |
+ 0.735 |
+ 0.751 |
+ 0.722 |
+ 0.733 |
+ 0.726 |
+ 0.684 |
+ 0.669 |
+ 0.716 |
+ 0.693 |
+ 0.718 |
+ 0.730 |
+ 0.704 |
+ 0.710 |
+ 0.724 |
+ 0.713 |
+ 0.750 |
+ 0.686 |
+ 3074 |
+ Activity |
+ B1LPA6_ECOSM |
+ Medium |
+ Prokaryote |
+
+
+ BBC1_YEAST_Tsuboyama_2023_1TG0_indels |
+ 0.588 |
+ 0.737 |
+ 0.398 |
+ 0.503 |
+ 0.601 |
+ 0.743 |
+ 0.603 |
+ 0.900 |
+ 0.863 |
+ 0.855 |
+ 0.819 |
+ 0.426 |
+ 0.661 |
+ 0.688 |
+ 0.456 |
+ 0.485 |
+ 0.629 |
+ 0.525 |
+ 0.489 |
+ 0.664 |
+ 0.488 |
+ 0.454 |
+ 0.938 |
+ 134 |
+ Stability |
+ BBC1_YEAST |
+ High |
+ Eukaryote |
+
+
+ BCHB_CHLTE_Tsuboyama_2023_2KRU_indels |
+ 0.579 |
+ 0.862 |
+ 0.507 |
+ 0.580 |
+ 0.370 |
+ 0.665 |
+ 0.713 |
+ 0.762 |
+ 0.453 |
+ 0.930 |
+ 0.976 |
+ 0.434 |
+ 0.496 |
+ 0.698 |
+ 0.817 |
+ 0.341 |
+ 0.453 |
+ 0.792 |
+ 0.484 |
+ 0.471 |
+ 0.813 |
+ 0.837 |
+ 0.958 |
+ 82 |
+ Stability |
+ BCHB_CHLTE |
+ Medium |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Gonzalez_2019_indels |
+ 0.256 |
+ 0.577 |
+ 0.505 |
+ 0.552 |
+ 0.534 |
+ 0.440 |
+ 0.463 |
+ 0.582 |
+ 0.593 |
+ 0.580 |
+ 0.406 |
+ 0.210 |
+ 0.515 |
+ 0.500 |
+ 0.387 |
+ 0.451 |
+ 0.466 |
+ 0.396 |
+ 0.481 |
+ 0.520 |
+ 0.392 |
+ 0.225 |
+ 0.513 |
+ 4751 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ CAPSD_AAV2S_Sinai_2021_designed_indels |
+ 0.512 |
+ 0.886 |
+ 0.784 |
+ 0.800 |
+ 0.894 |
+ 0.918 |
+ 0.468 |
+ 0.462 |
+ 0.490 |
+ 0.459 |
+ 0.858 |
+ 0.716 |
+ 0.804 |
+ 0.808 |
+ 0.937 |
+ 0.924 |
+ 0.916 |
+ 0.929 |
+ 0.923 |
+ 0.929 |
+ 0.936 |
+ 0.898 |
+ 0.904 |
+ 225998 |
+ OrganismalFitness |
+ CAPSD_AAV2S |
+ Low |
+ Virus |
+
+
+ CAPSD_AAV2S_Sinai_2021_library_indels |
+ 0.472 |
+ 0.467 |
+ 0.551 |
+ 0.597 |
+ 0.573 |
+ 0.613 |
+ 0.472 |
+ 0.479 |
+ 0.483 |
+ 0.481 |
+ 0.642 |
+ 0.539 |
+ 0.550 |
+ 0.589 |
+ 0.641 |
+ 0.695 |
+ 0.699 |
+ 0.709 |
+ 0.697 |
+ 0.690 |
+ 0.749 |
+ 0.466 |
+ 0.491 |
+ 24908 |
+ OrganismalFitness |
+ CAPSD_AAV2S |
+ Low |
+ Virus |
+
+
+ CATR_CHLRE_Tsuboyama_2023_2AMI_indels |
+ 0.888 |
+ 0.872 |
+ 0.882 |
+ 0.885 |
+ 0.880 |
+ 0.874 |
+ 0.870 |
+ 0.885 |
+ 0.881 |
+ 0.856 |
+ 0.875 |
+ 0.671 |
+ 0.852 |
+ 0.877 |
+ 0.872 |
+ 0.909 |
+ 0.875 |
+ 0.871 |
+ 0.908 |
+ 0.872 |
+ 0.861 |
+ 0.669 |
+ 0.876 |
+ 197 |
+ Stability |
+ CATR_CHLRE |
+ High |
+ Eukaryote |
+
+
+ CBPA2_HUMAN_Tsuboyama_2023_1O6X_indels |
+ 0.719 |
+ 0.933 |
+ 0.606 |
+ 0.738 |
+ 0.659 |
+ 0.645 |
+ 0.608 |
+ 0.729 |
+ 0.771 |
+ 0.732 |
+ 0.917 |
+ 0.474 |
+ 0.621 |
+ 0.647 |
+ 0.578 |
+ 0.570 |
+ 0.610 |
+ 0.566 |
+ 0.569 |
+ 0.631 |
+ 0.569 |
+ 0.852 |
+ 0.838 |
+ 205 |
+ Stability |
+ CBPA2_HUMAN |
+ Medium |
+ Human |
+
+
+ CBX4_HUMAN_Tsuboyama_2023_2K28_indels |
+ 0.519 |
+ 0.698 |
+ 0.686 |
+ 0.791 |
+ 0.791 |
+ 0.829 |
+ 0.847 |
+ 0.826 |
+ 0.826 |
+ 0.848 |
+ 0.859 |
+ 0.416 |
+ 0.725 |
+ 0.806 |
+ 0.819 |
+ 0.623 |
+ 0.551 |
+ 0.653 |
+ 0.650 |
+ 0.588 |
+ 0.695 |
+ 0.770 |
+ 0.822 |
+ 129 |
+ Stability |
+ CBX4_HUMAN |
+ High |
+ Human |
+
+
+ CSN4_MOUSE_Tsuboyama_2023_1UFM_indels |
+ 0.478 |
+ 0.959 |
+ 0.620 |
+ 0.535 |
+ 0.888 |
+ 0.959 |
+ 0.422 |
+ 0.966 |
+ 0.935 |
+ 0.965 |
+ 0.965 |
+ 0.618 |
+ 0.480 |
+ 0.866 |
+ 0.862 |
+ 0.437 |
+ 0.819 |
+ 0.843 |
+ 0.413 |
+ 0.797 |
+ 0.836 |
+ 0.897 |
+ 0.932 |
+ 195 |
+ Stability |
+ CSN4_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ CUE1_YEAST_Tsuboyama_2023_2MYX_indels |
+ 0.583 |
+ 0.887 |
+ 0.673 |
+ 0.596 |
+ 0.591 |
+ 0.457 |
+ 0.427 |
+ 0.461 |
+ 0.473 |
+ 0.529 |
+ 0.799 |
+ 0.418 |
+ 0.504 |
+ 0.462 |
+ 0.401 |
+ 0.325 |
+ 0.352 |
+ 0.330 |
+ 0.325 |
+ 0.354 |
+ 0.345 |
+ 0.593 |
+ 0.976 |
+ 140 |
+ Stability |
+ CUE1_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ DN7A_SACS2_Tsuboyama_2023_1JIC_indels |
+ 0.525 |
+ 0.808 |
+ 0.474 |
+ 0.666 |
+ 0.623 |
+ 0.606 |
+ 0.702 |
+ 0.590 |
+ 0.576 |
+ 0.757 |
+ 0.755 |
+ 0.415 |
+ 0.558 |
+ 0.455 |
+ 0.562 |
+ 0.384 |
+ 0.392 |
+ 0.445 |
+ 0.398 |
+ 0.377 |
+ 0.404 |
+ 0.591 |
+ 0.960 |
+ 136 |
+ Stability |
+ DN7A_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ DNJA1_HUMAN_Tsuboyama_2023_2LO1_indels |
+ 0.863 |
+ 0.811 |
+ 0.949 |
+ 0.949 |
+ 0.967 |
+ 0.951 |
+ 0.961 |
+ 0.958 |
+ 0.983 |
+ 0.965 |
+ 0.967 |
+ 0.700 |
+ 0.955 |
+ 0.952 |
+ 0.971 |
+ 0.928 |
+ 0.954 |
+ 0.945 |
+ 0.930 |
+ 0.954 |
+ 0.943 |
+ 0.758 |
+ 0.836 |
+ 174 |
+ Stability |
+ DNJA1_HUMAN |
+ High |
+ Human |
+
+
+ DOCK1_MOUSE_Tsuboyama_2023_2M0Y_indels |
+ 0.620 |
+ 0.862 |
+ 0.914 |
+ 0.926 |
+ 0.891 |
+ 0.915 |
+ 0.849 |
+ 0.924 |
+ 0.853 |
+ 0.875 |
+ 0.952 |
+ 0.633 |
+ 0.667 |
+ 0.664 |
+ 0.753 |
+ 0.643 |
+ 0.656 |
+ 0.705 |
+ 0.656 |
+ 0.663 |
+ 0.782 |
+ 0.595 |
+ 0.921 |
+ 154 |
+ Stability |
+ DOCK1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ EPHB2_HUMAN_Tsuboyama_2023_1F0M_indels |
+ 0.518 |
+ 0.907 |
+ 0.972 |
+ 0.977 |
+ 0.979 |
+ 0.973 |
+ 0.970 |
+ 0.979 |
+ 0.981 |
+ 0.975 |
+ 0.972 |
+ 0.650 |
+ 0.957 |
+ 0.978 |
+ 0.971 |
+ 0.757 |
+ 0.787 |
+ 0.822 |
+ 0.783 |
+ 0.825 |
+ 0.823 |
+ 0.705 |
+ 0.841 |
+ 185 |
+ Stability |
+ EPHB2_HUMAN |
+ High |
+ Human |
+
+
+ FECA_ECOLI_Tsuboyama_2023_2D1U_indels |
+ 0.558 |
+ 0.561 |
+ 0.460 |
+ 0.788 |
+ 0.750 |
+ 0.757 |
+ 0.421 |
+ 0.817 |
+ 0.846 |
+ 0.812 |
+ 0.791 |
+ 0.503 |
+ 0.260 |
+ 0.246 |
+ 0.326 |
+ 0.262 |
+ 0.261 |
+ 0.279 |
+ 0.265 |
+ 0.276 |
+ 0.276 |
+ 0.714 |
+ 0.787 |
+ 193 |
+ Stability |
+ FECA_ECOLI |
+ High |
+ Eukaryote |
+
+
+ HCP_LAMBD_Tsuboyama_2023_2L6Q_indels |
+ 0.564 |
+ 0.927 |
+ 0.754 |
+ 0.893 |
+ 0.867 |
+ 0.845 |
+ 0.839 |
+ 0.645 |
+ 0.762 |
+ 0.787 |
+ 0.909 |
+ 0.661 |
+ 0.649 |
+ 0.616 |
+ 0.887 |
+ 0.430 |
+ 0.503 |
+ 0.883 |
+ 0.426 |
+ 0.421 |
+ 0.885 |
+ 0.844 |
+ 0.933 |
+ 148 |
+ Stability |
+ HCP_LAMBD |
+ Medium |
+ Virus |
+
+
+ HECD1_HUMAN_Tsuboyama_2023_3DKM_indels |
+ 0.665 |
+ 0.888 |
+ 0.749 |
+ 0.889 |
+ 0.881 |
+ 0.887 |
+ 0.654 |
+ 0.841 |
+ 0.810 |
+ 0.820 |
+ 0.822 |
+ 0.500 |
+ 0.518 |
+ 0.637 |
+ 0.586 |
+ 0.647 |
+ 0.674 |
+ 0.617 |
+ 0.641 |
+ 0.665 |
+ 0.586 |
+ 0.791 |
+ 0.786 |
+ 154 |
+ Stability |
+ HECD1_HUMAN |
+ Medium |
+ Human |
+
+
+ HIS7_YEAST_Pokusaeva_2019_indels |
+ 0.094 |
+ 0.863 |
+ 0.772 |
+ 0.748 |
+ 0.795 |
+ 0.808 |
+ 0.730 |
+ 0.837 |
+ 0.888 |
+ 0.857 |
+ 0.888 |
+ 0.298 |
+ 0.723 |
+ 0.862 |
+ 0.874 |
+ 0.800 |
+ 0.728 |
+ 0.851 |
+ 0.818 |
+ 0.849 |
+ 0.881 |
+ 0.748 |
+ 0.879 |
+ 6102 |
+ OrganismalFitness |
+ HIS7_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ ILF3_HUMAN_Tsuboyama_2023_2L33_indels |
+ 0.488 |
+ 0.850 |
+ 0.816 |
+ 0.852 |
+ 0.853 |
+ 0.862 |
+ 0.836 |
+ 0.856 |
+ 0.836 |
+ 0.852 |
+ 0.834 |
+ 0.871 |
+ 0.585 |
+ 0.840 |
+ 0.833 |
+ 0.453 |
+ 0.777 |
+ 0.799 |
+ 0.491 |
+ 0.776 |
+ 0.808 |
+ 0.891 |
+ 0.832 |
+ 193 |
+ Stability |
+ ILF3_HUMAN |
+ High |
+ Human |
+
+
+ KCNJ2_MOUSE_Macdonald_2022_indels |
+ 0.662 |
+ 0.772 |
+ 0.775 |
+ 0.780 |
+ 0.768 |
+ 0.771 |
+ 0.797 |
+ 0.786 |
+ 0.776 |
+ 0.794 |
+ 0.770 |
+ 0.690 |
+ 0.773 |
+ 0.771 |
+ 0.768 |
+ 0.776 |
+ 0.764 |
+ 0.771 |
+ 0.776 |
+ 0.771 |
+ 0.771 |
+ 0.765 |
+ 0.774 |
+ 10501 |
+ Expression |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ MAFG_MOUSE_Tsuboyama_2023_1K1V_indels |
+ 0.626 |
+ 0.694 |
+ 0.518 |
+ 0.751 |
+ 0.821 |
+ 0.841 |
+ 0.688 |
+ 0.802 |
+ 0.834 |
+ 0.755 |
+ 0.725 |
+ 0.512 |
+ 0.432 |
+ 0.700 |
+ 0.714 |
+ 0.498 |
+ 0.706 |
+ 0.713 |
+ 0.500 |
+ 0.706 |
+ 0.678 |
+ 0.532 |
+ 0.599 |
+ 115 |
+ Stability |
+ MAFG_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ MBD11_ARATH_Tsuboyama_2023_6ACV_indels |
+ 0.494 |
+ 0.872 |
+ 0.507 |
+ 0.612 |
+ 0.749 |
+ 0.655 |
+ 0.734 |
+ 0.910 |
+ 0.907 |
+ 0.852 |
+ 0.922 |
+ 0.802 |
+ 0.553 |
+ 0.551 |
+ 0.447 |
+ 0.618 |
+ 0.622 |
+ 0.618 |
+ 0.631 |
+ 0.624 |
+ 0.636 |
+ 0.737 |
+ 0.918 |
+ 131 |
+ Stability |
+ MBD11_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ MYO3_YEAST_Tsuboyama_2023_2BTT_indels |
+ 0.401 |
+ 0.753 |
+ 0.766 |
+ 0.571 |
+ 0.708 |
+ 0.669 |
+ 0.935 |
+ 0.858 |
+ 0.626 |
+ 0.899 |
+ 0.987 |
+ 0.518 |
+ 0.562 |
+ 0.653 |
+ 0.545 |
+ 0.595 |
+ 0.611 |
+ 0.574 |
+ 0.609 |
+ 0.649 |
+ 0.634 |
+ 0.470 |
+ 0.917 |
+ 80 |
+ Stability |
+ MYO3_YEAST |
+ High |
+ Eukaryote |
+
+
+ NKX31_HUMAN_Tsuboyama_2023_2L9R_indels |
+ 0.908 |
+ 0.778 |
+ 0.837 |
+ 0.879 |
+ 0.850 |
+ 0.836 |
+ 0.884 |
+ 0.805 |
+ 0.917 |
+ 0.711 |
+ 0.789 |
+ 0.723 |
+ 0.907 |
+ 0.826 |
+ 0.715 |
+ 0.907 |
+ 0.910 |
+ 0.828 |
+ 0.908 |
+ 0.904 |
+ 0.810 |
+ 0.825 |
+ 0.826 |
+ 178 |
+ Stability |
+ NKX31_HUMAN |
+ High |
+ Human |
+
+
+ NUSA_ECOLI_Tsuboyama_2023_1WCL_indels |
+ 0.664 |
+ 0.810 |
+ 0.876 |
+ 0.841 |
+ 0.877 |
+ 0.892 |
+ 0.652 |
+ 0.839 |
+ 0.679 |
+ 0.921 |
+ 0.916 |
+ 0.343 |
+ 0.699 |
+ 0.876 |
+ 0.734 |
+ 0.584 |
+ 0.658 |
+ 0.630 |
+ 0.617 |
+ 0.730 |
+ 0.665 |
+ 0.652 |
+ 0.902 |
+ 191 |
+ Stability |
+ NUSA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ NUSG_MYCTU_Tsuboyama_2023_2MI6_indels |
+ 0.504 |
+ 0.685 |
+ 0.865 |
+ 0.891 |
+ 0.880 |
+ 0.891 |
+ 0.859 |
+ 0.884 |
+ 0.857 |
+ 0.875 |
+ 0.865 |
+ 0.479 |
+ 0.857 |
+ 0.840 |
+ 0.783 |
+ 0.672 |
+ 0.649 |
+ 0.682 |
+ 0.674 |
+ 0.643 |
+ 0.700 |
+ 0.436 |
+ 0.690 |
+ 157 |
+ Stability |
+ NUSG_MYCTU |
+ High |
+ Prokaryote |
+
+
+ OBSCN_HUMAN_Tsuboyama_2023_1V1C_indels |
+ 0.680 |
+ 0.862 |
+ 0.908 |
+ 0.942 |
+ 0.946 |
+ 0.919 |
+ 0.887 |
+ 0.930 |
+ 0.934 |
+ 0.884 |
+ 0.890 |
+ 0.766 |
+ 0.724 |
+ 0.507 |
+ 0.702 |
+ 0.445 |
+ 0.446 |
+ 0.554 |
+ 0.468 |
+ 0.470 |
+ 0.559 |
+ 0.671 |
+ 0.764 |
+ 169 |
+ Stability |
+ OBSCN_HUMAN |
+ High |
+ Human |
+
+
+ ODP2_GEOSE_Tsuboyama_2023_1W4G_indels |
+ 0.501 |
+ 0.920 |
+ 0.924 |
+ 0.988 |
+ 0.947 |
+ 0.958 |
+ 0.982 |
+ 0.986 |
+ 0.983 |
+ 0.976 |
+ 0.970 |
+ 0.871 |
+ 0.931 |
+ 0.662 |
+ 0.786 |
+ 0.776 |
+ 0.792 |
+ 0.575 |
+ 0.776 |
+ 0.954 |
+ 0.692 |
+ 0.767 |
+ 0.972 |
+ 47 |
+ Stability |
+ ODP2_GEOSE |
+ High |
+ Prokaryote |
+
+
+ OTU7A_HUMAN_Tsuboyama_2023_2L2D_indels |
+ 0.359 |
+ 0.579 |
+ 0.398 |
+ 0.883 |
+ 0.903 |
+ 0.924 |
+ 0.784 |
+ 0.925 |
+ 0.927 |
+ 0.949 |
+ 0.939 |
+ 0.715 |
+ 0.505 |
+ 0.543 |
+ 0.798 |
+ 0.241 |
+ 0.327 |
+ 0.479 |
+ 0.248 |
+ 0.327 |
+ 0.593 |
+ 0.473 |
+ 0.949 |
+ 84 |
+ Stability |
+ OTU7A_HUMAN |
+ High |
+ Human |
+
+
+ P53_HUMAN_Kotler_2018_indels |
+ 0.291 |
+ 0.447 |
+ 0.558 |
+ 0.408 |
+ 0.456 |
+ 0.693 |
+ 0.533 |
+ 0.692 |
+ 0.616 |
+ 0.661 |
+ 0.653 |
+ 0.190 |
+ 0.660 |
+ 0.566 |
+ 0.681 |
+ 0.576 |
+ 0.543 |
+ 0.629 |
+ 0.614 |
+ 0.501 |
+ 0.616 |
+ 0.548 |
+ 0.653 |
+ 341 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ PIN1_HUMAN_Tsuboyama_2023_1I6C_indels |
+ 0.766 |
+ 0.853 |
+ 0.844 |
+ 0.894 |
+ 0.874 |
+ 0.846 |
+ 0.833 |
+ 0.837 |
+ 0.842 |
+ 0.840 |
+ 0.846 |
+ 0.851 |
+ 0.689 |
+ 0.921 |
+ 0.849 |
+ 0.681 |
+ 0.967 |
+ 0.858 |
+ 0.666 |
+ 0.965 |
+ 0.850 |
+ 0.931 |
+ 0.838 |
+ 106 |
+ Stability |
+ PIN1_HUMAN |
+ High |
+ Human |
+
+
+ PITX2_HUMAN_Tsuboyama_2023_2L7M_indels |
+ 0.890 |
+ 0.798 |
+ 0.940 |
+ 0.907 |
+ 0.888 |
+ 0.910 |
+ 0.886 |
+ 0.934 |
+ 0.969 |
+ 0.922 |
+ 0.967 |
+ 0.710 |
+ 0.922 |
+ 0.921 |
+ 0.937 |
+ 0.680 |
+ 0.701 |
+ 0.750 |
+ 0.722 |
+ 0.773 |
+ 0.782 |
+ 0.670 |
+ 0.903 |
+ 117 |
+ Stability |
+ PITX2_HUMAN |
+ High |
+ Human |
+
+
+ PKN1_HUMAN_Tsuboyama_2023_1URF_indels |
+ 0.311 |
+ 0.857 |
+ 0.905 |
+ 0.907 |
+ 0.920 |
+ 0.915 |
+ 0.895 |
+ 0.909 |
+ 0.905 |
+ 0.928 |
+ 0.921 |
+ 0.769 |
+ 0.442 |
+ 0.884 |
+ 0.895 |
+ 0.698 |
+ 0.894 |
+ 0.900 |
+ 0.719 |
+ 0.893 |
+ 0.900 |
+ 0.855 |
+ 0.916 |
+ 187 |
+ Stability |
+ PKN1_HUMAN |
+ High |
+ Human |
+
+
+ POLG_PESV_Tsuboyama_2023_2MXD_indels |
+ 0.641 |
+ 0.905 |
+ 0.591 |
+ 0.603 |
+ 0.602 |
+ 0.648 |
+ 0.631 |
+ 0.592 |
+ 0.576 |
+ 0.494 |
+ 0.638 |
+ 0.590 |
+ 0.637 |
+ 0.596 |
+ 0.667 |
+ 0.518 |
+ 0.459 |
+ 0.514 |
+ 0.636 |
+ 0.473 |
+ 0.522 |
+ 0.684 |
+ 0.859 |
+ 149 |
+ Stability |
+ POLG_PESV |
+ Medium |
+ Virus |
+
+
+ PR40A_HUMAN_Tsuboyama_2023_1UZC_indels |
+ 0.463 |
+ 0.963 |
+ 0.911 |
+ 0.978 |
+ 0.979 |
+ 0.983 |
+ 0.954 |
+ 0.985 |
+ 0.960 |
+ 0.983 |
+ 0.982 |
+ 0.783 |
+ 0.689 |
+ 0.780 |
+ 0.781 |
+ 0.693 |
+ 0.762 |
+ 0.740 |
+ 0.729 |
+ 0.773 |
+ 0.740 |
+ 0.808 |
+ 0.739 |
+ 168 |
+ Stability |
+ PR40A_HUMAN |
+ Medium |
+ Human |
+
+
+ PSAE_SYNP2_Tsuboyama_2023_1PSE_indels |
+ 0.371 |
+ 0.508 |
+ 0.410 |
+ 0.271 |
+ 0.375 |
+ 0.647 |
+ 0.458 |
+ 0.631 |
+ 0.374 |
+ 0.599 |
+ 0.636 |
+ 0.630 |
+ 0.447 |
+ 0.753 |
+ 0.591 |
+ 0.514 |
+ 0.628 |
+ 0.548 |
+ 0.476 |
+ 0.679 |
+ 0.626 |
+ 0.749 |
+ 0.602 |
+ 175 |
+ Stability |
+ PSAE_SYNP2 |
+ Medium |
+ Prokaryote |
+
+
+ PTEN_HUMAN_Mighell_2018_indels |
+ 0.325 |
+ 0.872 |
+ 0.799 |
+ 0.874 |
+ 0.870 |
+ 0.799 |
+ 0.863 |
+ 0.854 |
+ 0.892 |
+ 0.837 |
+ 0.837 |
+ 0.375 |
+ 0.877 |
+ 0.891 |
+ 0.907 |
+ 0.869 |
+ 0.904 |
+ 0.908 |
+ 0.867 |
+ 0.888 |
+ 0.900 |
+ 0.849 |
+ 0.897 |
+ 314 |
+ Activity |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ Q8EG35_SHEON_Campbell_2022_indels |
+ 0.512 |
+ 0.131 |
+ 0.394 |
+ 0.351 |
+ 0.299 |
+ 0.352 |
+ 0.550 |
+ 0.395 |
+ 0.255 |
+ 0.288 |
+ 0.202 |
+ 0.396 |
+ 0.549 |
+ 0.597 |
+ 0.486 |
+ 0.451 |
+ 0.538 |
+ 0.517 |
+ 0.448 |
+ 0.581 |
+ 0.473 |
+ 0.441 |
+ 0.160 |
+ 331 |
+ OrganismalFitness |
+ Q8EG35_SHEON |
+ Medium |
+ Prokaryote |
+
+
+ RAD_ANTMA_Tsuboyama_2023_2CJJ_indels |
+ 0.600 |
+ 0.825 |
+ 0.786 |
+ 0.836 |
+ 0.825 |
+ 0.846 |
+ 0.876 |
+ 0.844 |
+ 0.795 |
+ 0.832 |
+ 0.874 |
+ 0.744 |
+ 0.560 |
+ 0.809 |
+ 0.791 |
+ 0.786 |
+ 0.839 |
+ 0.870 |
+ 0.804 |
+ 0.837 |
+ 0.868 |
+ 0.674 |
+ 0.796 |
+ 97 |
+ Stability |
+ RAD_ANTMA |
+ High |
+ Eukaryote |
+
+
+ RCD1_ARATH_Tsuboyama_2023_5OAO_indels |
+ 0.409 |
+ 0.842 |
+ 0.484 |
+ 0.595 |
+ 0.504 |
+ 0.642 |
+ 0.331 |
+ 0.696 |
+ 0.651 |
+ 0.653 |
+ 0.900 |
+ 0.462 |
+ 0.386 |
+ 0.413 |
+ 0.487 |
+ 0.448 |
+ 0.372 |
+ 0.346 |
+ 0.438 |
+ 0.382 |
+ 0.348 |
+ 0.774 |
+ 0.772 |
+ 124 |
+ Stability |
+ RCD1_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ RD23A_HUMAN_Tsuboyama_2023_1IFY_indels |
+ 0.663 |
+ 0.930 |
+ 0.951 |
+ 0.965 |
+ 0.914 |
+ 0.983 |
+ 0.982 |
+ 0.978 |
+ 0.956 |
+ 0.979 |
+ 0.973 |
+ 0.970 |
+ 0.806 |
+ 0.937 |
+ 0.958 |
+ 0.795 |
+ 0.922 |
+ 0.891 |
+ 0.825 |
+ 0.928 |
+ 0.912 |
+ 0.696 |
+ 0.980 |
+ 120 |
+ Stability |
+ RD23A_HUMAN |
+ High |
+ Human |
+
+
+ RPC1_BP434_Tsuboyama_2023_1R69_indels |
+ 0.863 |
+ 0.687 |
+ 0.783 |
+ 0.820 |
+ 0.818 |
+ 0.832 |
+ 0.837 |
+ 0.851 |
+ 0.829 |
+ 0.847 |
+ 0.847 |
+ 0.554 |
+ 0.563 |
+ 0.803 |
+ 0.871 |
+ 0.558 |
+ 0.710 |
+ 0.809 |
+ 0.526 |
+ 0.739 |
+ 0.834 |
+ 0.523 |
+ 0.768 |
+ 164 |
+ Stability |
+ RPC1_BP434 |
+ High |
+ Virus |
+
+
+ RS15_GEOSE_Tsuboyama_2023_1A32_indels |
+ 0.700 |
+ 0.851 |
+ 0.884 |
+ 0.890 |
+ 0.883 |
+ 0.888 |
+ 0.851 |
+ 0.882 |
+ 0.909 |
+ 0.888 |
+ 0.888 |
+ 0.680 |
+ 0.889 |
+ 0.878 |
+ 0.872 |
+ 0.828 |
+ 0.794 |
+ 0.789 |
+ 0.860 |
+ 0.801 |
+ 0.795 |
+ 0.802 |
+ 0.782 |
+ 176 |
+ Stability |
+ RS15_GEOSE |
+ Medium |
+ Prokaryote |
+
+
+ S22A1_HUMAN_Yee_2023_abundance_indels |
+ 0.649 |
+ 0.646 |
+ 0.720 |
+ 0.810 |
+ 0.825 |
+ 0.792 |
+ 0.770 |
+ 0.855 |
+ 0.853 |
+ 0.810 |
+ 0.789 |
+ 0.614 |
+ 0.802 |
+ 0.835 |
+ 0.805 |
+ 0.738 |
+ 0.692 |
+ 0.765 |
+ 0.755 |
+ 0.802 |
+ 0.769 |
+ 0.555 |
+ 0.766 |
+ 430 |
+ Expression |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ S22A1_HUMAN_Yee_2023_activity_indels |
+ 0.552 |
+ 0.486 |
+ 0.602 |
+ 0.721 |
+ 0.762 |
+ 0.727 |
+ 0.700 |
+ 0.784 |
+ 0.821 |
+ 0.772 |
+ 0.733 |
+ 0.498 |
+ 0.742 |
+ 0.787 |
+ 0.771 |
+ 0.636 |
+ 0.622 |
+ 0.705 |
+ 0.664 |
+ 0.751 |
+ 0.712 |
+ 0.416 |
+ 0.694 |
+ 490 |
+ Activity |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ SAV1_MOUSE_Tsuboyama_2023_2YSB_indels |
+ 0.764 |
+ 0.844 |
+ 0.850 |
+ 0.861 |
+ 0.849 |
+ 0.864 |
+ 0.863 |
+ 0.849 |
+ 0.858 |
+ 0.857 |
+ 0.833 |
+ 0.894 |
+ 0.838 |
+ 0.840 |
+ 0.831 |
+ 0.769 |
+ 0.853 |
+ 0.843 |
+ 0.852 |
+ 0.853 |
+ 0.846 |
+ 0.841 |
+ 0.851 |
+ 86 |
+ Stability |
+ SAV1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ SDA_BACSU_Tsuboyama_2023_1PV0_indels |
+ 0.587 |
+ 0.949 |
+ 0.674 |
+ 0.707 |
+ 0.692 |
+ 0.743 |
+ 0.807 |
+ 0.857 |
+ 0.587 |
+ 0.804 |
+ 0.951 |
+ 0.571 |
+ 0.708 |
+ 0.675 |
+ 0.935 |
+ 0.661 |
+ 0.696 |
+ 0.782 |
+ 0.661 |
+ 0.701 |
+ 0.880 |
+ 0.810 |
+ 0.965 |
+ 127 |
+ Stability |
+ SDA_BACSU |
+ Medium |
+ Prokaryote |
+
+
+ SOX30_HUMAN_Tsuboyama_2023_7JJK_indels |
+ 0.714 |
+ 0.857 |
+ 0.893 |
+ 0.939 |
+ 0.955 |
+ 0.958 |
+ 0.920 |
+ 0.961 |
+ 0.949 |
+ 0.967 |
+ 0.971 |
+ 0.979 |
+ 0.540 |
+ 0.984 |
+ 0.963 |
+ 0.466 |
+ 0.805 |
+ 0.859 |
+ 0.581 |
+ 0.845 |
+ 0.888 |
+ 0.782 |
+ 0.979 |
+ 109 |
+ Stability |
+ SOX30_HUMAN |
+ High |
+ Human |
+
+
+ SPG2_STRSG_Tsuboyama_2023_5UBS_indels |
+ 0.394 |
+ 0.755 |
+ 0.526 |
+ 0.579 |
+ 0.550 |
+ 0.577 |
+ 0.672 |
+ 0.769 |
+ 0.761 |
+ 0.834 |
+ 0.805 |
+ 0.426 |
+ 0.447 |
+ 0.507 |
+ 0.819 |
+ 0.563 |
+ 0.592 |
+ 0.675 |
+ 0.566 |
+ 0.591 |
+ 0.698 |
+ 0.579 |
+ 0.835 |
+ 148 |
+ Stability |
+ SPG2_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPTN1_CHICK_Tsuboyama_2023_1TUD_indels |
+ 0.584 |
+ 0.684 |
+ 0.891 |
+ 0.824 |
+ 0.933 |
+ 0.880 |
+ 0.932 |
+ 0.927 |
+ 0.795 |
+ 0.948 |
+ 0.939 |
+ 0.428 |
+ 0.714 |
+ 0.688 |
+ 0.823 |
+ 0.653 |
+ 0.682 |
+ 0.766 |
+ 0.706 |
+ 0.665 |
+ 0.796 |
+ 0.491 |
+ 0.479 |
+ 129 |
+ Stability |
+ SPTN1_CHICK |
+ High |
+ Eukaryote |
+
+
+ SQSTM_MOUSE_Tsuboyama_2023_2RRU_indels |
+ 0.526 |
+ 0.691 |
+ 0.767 |
+ 0.918 |
+ 0.919 |
+ 0.922 |
+ 0.736 |
+ 0.908 |
+ 0.951 |
+ 0.929 |
+ 0.909 |
+ 0.591 |
+ 0.445 |
+ 0.858 |
+ 0.937 |
+ 0.483 |
+ 0.834 |
+ 0.810 |
+ 0.549 |
+ 0.835 |
+ 0.849 |
+ 0.440 |
+ 0.955 |
+ 111 |
+ Stability |
+ SQSTM_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ SR43C_ARATH_Tsuboyama_2023_2N88_indels |
+ 0.517 |
+ 0.790 |
+ 0.758 |
+ 0.723 |
+ 0.729 |
+ 0.636 |
+ 0.534 |
+ 0.439 |
+ 0.599 |
+ 0.436 |
+ 0.581 |
+ 0.530 |
+ 0.709 |
+ 0.390 |
+ 0.599 |
+ 0.774 |
+ 0.606 |
+ 0.731 |
+ 0.791 |
+ 0.609 |
+ 0.740 |
+ 0.670 |
+ 0.758 |
+ 135 |
+ Stability |
+ SR43C_ARATH |
+ High |
+ Eukaryote |
+
+
+ SRBS1_HUMAN_Tsuboyama_2023_2O2W_indels |
+ 0.692 |
+ 0.918 |
+ 0.935 |
+ 0.904 |
+ 0.935 |
+ 0.915 |
+ 0.933 |
+ 0.918 |
+ 0.913 |
+ 0.937 |
+ 0.934 |
+ 0.809 |
+ 0.873 |
+ 0.941 |
+ 0.868 |
+ 0.921 |
+ 0.920 |
+ 0.892 |
+ 0.924 |
+ 0.929 |
+ 0.906 |
+ 0.922 |
+ 0.927 |
+ 154 |
+ Stability |
+ SRBS1_HUMAN |
+ High |
+ Human |
+
+
+ TCRG1_MOUSE_Tsuboyama_2023_1E0L_indels |
+ 0.829 |
+ 0.956 |
+ 0.967 |
+ 0.972 |
+ 0.974 |
+ 0.982 |
+ 0.945 |
+ 0.981 |
+ 0.973 |
+ 0.981 |
+ 0.985 |
+ 0.812 |
+ 0.834 |
+ 0.892 |
+ 0.952 |
+ 0.376 |
+ 0.572 |
+ 0.633 |
+ 0.375 |
+ 0.592 |
+ 0.759 |
+ 0.917 |
+ 0.976 |
+ 99 |
+ Stability |
+ TCRG1_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ THO1_YEAST_Tsuboyama_2023_2WQG_indels |
+ 0.338 |
+ 0.844 |
+ 0.259 |
+ 0.536 |
+ 0.546 |
+ 0.497 |
+ 0.648 |
+ 0.768 |
+ 0.693 |
+ 0.683 |
+ 0.874 |
+ 0.461 |
+ 0.325 |
+ 0.841 |
+ 0.519 |
+ 0.227 |
+ 0.734 |
+ 0.744 |
+ 0.234 |
+ 0.837 |
+ 0.820 |
+ 0.625 |
+ 0.988 |
+ 82 |
+ Stability |
+ THO1_YEAST |
+ High |
+ Eukaryote |
+
+
+ TNKS2_HUMAN_Tsuboyama_2023_5JRT_indels |
+ 0.431 |
+ 0.599 |
+ 0.702 |
+ 0.881 |
+ 0.844 |
+ 0.881 |
+ 0.899 |
+ 0.906 |
+ 0.901 |
+ 0.898 |
+ 0.906 |
+ 0.467 |
+ 0.454 |
+ 0.433 |
+ 0.728 |
+ 0.481 |
+ 0.495 |
+ 0.654 |
+ 0.504 |
+ 0.447 |
+ 0.712 |
+ 0.645 |
+ 0.831 |
+ 171 |
+ Stability |
+ TNKS2_HUMAN |
+ High |
+ Human |
+
+
+ UBE4B_HUMAN_Tsuboyama_2023_3L1X_indels |
+ 0.178 |
+ 0.763 |
+ 0.210 |
+ 0.783 |
+ 0.803 |
+ 0.825 |
+ 0.799 |
+ 0.829 |
+ 0.801 |
+ 0.830 |
+ 0.823 |
+ 0.600 |
+ 0.417 |
+ 0.238 |
+ 0.755 |
+ 0.376 |
+ 0.309 |
+ 0.650 |
+ 0.336 |
+ 0.314 |
+ 0.643 |
+ 0.398 |
+ 0.720 |
+ 147 |
+ Stability |
+ UBE4B_HUMAN |
+ High |
+ Human |
+
+
+ UBR5_HUMAN_Tsuboyama_2023_1I2T_indels |
+ 0.608 |
+ 0.880 |
+ 0.945 |
+ 0.917 |
+ 0.955 |
+ 0.952 |
+ 0.941 |
+ 0.953 |
+ 0.940 |
+ 0.954 |
+ 0.956 |
+ 0.540 |
+ 0.878 |
+ 0.909 |
+ 0.831 |
+ 0.600 |
+ 0.790 |
+ 0.702 |
+ 0.596 |
+ 0.794 |
+ 0.750 |
+ 0.904 |
+ 0.844 |
+ 156 |
+ Stability |
+ UBR5_HUMAN |
+ Medium |
+ Human |
+
+
+ VG08_BPP22_Tsuboyama_2023_2GP8_indels |
+ 0.943 |
+ 0.908 |
+ 0.783 |
+ 0.903 |
+ 0.871 |
+ 0.936 |
+ 0.935 |
+ 0.923 |
+ 0.915 |
+ 0.917 |
+ 0.907 |
+ 0.613 |
+ 0.858 |
+ 0.904 |
+ 0.931 |
+ 0.727 |
+ 0.864 |
+ 0.913 |
+ 0.697 |
+ 0.871 |
+ 0.908 |
+ 0.929 |
+ 0.843 |
+ 101 |
+ Stability |
+ VG08_BPP22 |
+ High |
+ Virus |
+
+
+ VILI_CHICK_Tsuboyama_2023_1YU5_indels |
+ 0.612 |
+ 0.905 |
+ 0.699 |
+ 0.686 |
+ 0.670 |
+ 0.668 |
+ 0.686 |
+ 0.738 |
+ 0.709 |
+ 0.664 |
+ 0.888 |
+ 0.770 |
+ 0.670 |
+ 0.899 |
+ 0.767 |
+ 0.754 |
+ 0.879 |
+ 0.848 |
+ 0.753 |
+ 0.891 |
+ 0.847 |
+ 0.588 |
+ 0.861 |
+ 156 |
+ Stability |
+ VILI_CHICK |
+ High |
+ Eukaryote |
+
+
+ VRPI_BPT7_Tsuboyama_2023_2WNM_indels |
+ 0.740 |
+ 0.864 |
+ 0.589 |
+ 0.722 |
+ 0.749 |
+ 0.704 |
+ 0.761 |
+ 0.685 |
+ 0.636 |
+ 0.645 |
+ 0.798 |
+ 0.613 |
+ 0.664 |
+ 0.670 |
+ 0.718 |
+ 0.690 |
+ 0.701 |
+ 0.655 |
+ 0.652 |
+ 0.686 |
+ 0.654 |
+ 0.741 |
+ 0.876 |
+ 154 |
+ Stability |
+ VRPI_BPT7 |
+ Medium |
+ Virus |
+
+
+ YNZC_BACSU_Tsuboyama_2023_2JVD_indels |
+ 0.705 |
+ 0.960 |
+ 0.658 |
+ 0.772 |
+ 0.937 |
+ 0.941 |
+ 0.901 |
+ 0.932 |
+ 0.698 |
+ 0.925 |
+ 0.897 |
+ 0.637 |
+ 0.644 |
+ 0.881 |
+ 0.912 |
+ 0.439 |
+ 0.507 |
+ 0.734 |
+ 0.442 |
+ 0.516 |
+ 0.725 |
+ 0.923 |
+ 0.887 |
+ 104 |
+ Stability |
+ YNZC_BACSU |
+ Medium |
+ Prokaryote |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/indels/NDCG/Summary_performance_DMS_indels_NDCG.csv b/benchmarks/DMS_zero_shot/indels/NDCG/Summary_performance_DMS_indels_NDCG.csv
new file mode 100644
index 0000000..27ffbb1
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/NDCG/Summary_performance_DMS_indels_NDCG.csv
@@ -0,0 +1,24 @@
+Model_rank,Model_name,Model type,Average_NDCG,Bootstrap_standard_error_NDCG,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,Model details,References
+1,Progen2 M,Protein language model,0.763,0.0,0.796,,0.82,0.595,0.84,0.642,0.789,0.861,0.878,0.815,0.775,0.694,Progen2 medium model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+2,Progen2 Base,Protein language model,0.751,0.01,0.812,,0.814,0.568,0.811,0.623,0.751,0.844,0.88,0.795,0.674,0.701,Progen2 base model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+3,Progen2 XL,Protein language model,0.751,0.02,0.765,,0.78,0.58,0.878,0.718,0.834,0.872,0.882,0.864,0.784,0.808,Progen2 xlarge model (6.4B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+4,Progen2 L,Protein language model,0.748,0.007,0.781,,0.802,0.571,0.839,0.637,0.791,0.854,0.87,0.802,0.793,0.693,Progen2 large model (2.7B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+5,Tranception L no retrieval,Protein language model,0.744,0.021,0.79,,0.786,0.643,0.755,0.724,0.73,0.767,0.79,0.686,0.739,0.811,Tranception Large model (700M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+6,Tranception M no retrieval,Protein language model,0.741,0.018,0.798,,0.803,0.645,0.719,0.64,0.695,0.747,0.744,0.698,0.704,0.715,Tranception Medium model (300M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+7,Provean,Alignment-based model,0.74,0.021,0.759,,0.77,0.58,0.851,0.706,0.822,0.838,0.836,0.848,0.771,0.829,Provean model,"Choi Y, Sims GE, Murphy S, Miller JR, Chan AP (2012). Predicting the functional effect of amino acid substitutions and indels. PloS one."
+8,RITA XL,Protein language model,0.74,0.017,0.746,,0.782,0.612,0.821,0.718,0.769,0.838,0.869,0.769,0.735,0.788,RITA xlarge model (1.2B params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+9,RITA L,Protein language model,0.736,0.019,0.78,,0.796,0.564,0.805,0.626,0.745,0.835,0.856,0.77,0.689,0.773,RITA large model (680M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+10,TranceptEVE L,Hybrid model,0.727,0.024,0.775,,0.77,0.641,0.722,0.731,0.685,0.749,0.742,0.703,0.679,0.774,TranceptEVE Large model (Tranception Large & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+11,TranceptEVE M,Hybrid model,0.726,0.022,0.788,,0.786,0.652,0.679,0.669,0.64,0.725,0.707,0.686,0.649,0.667,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+12,RITA M,Protein language model,0.726,0.02,0.767,,0.795,0.552,0.788,0.605,0.719,0.831,0.842,0.732,0.698,0.773,RITA medium model (300M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+13,Tranception L,Hybrid model,0.721,0.024,0.772,,0.768,0.642,0.701,0.718,0.671,0.728,0.727,0.684,0.656,0.766,Tranception Large model (700M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+14,Progen2 S,Protein language model,0.718,0.017,0.766,,0.784,0.549,0.773,0.594,0.702,0.821,0.823,0.701,0.724,0.745,Progen2 small model (150M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+15,Tranception S no retrieval,Protein language model,0.706,0.017,0.763,,0.788,0.625,0.649,0.71,0.623,0.679,0.693,0.597,0.665,0.675,Tranception Small model (85M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+16,Wavenet,Alignment-based model,0.693,0.034,0.703,,0.709,0.539,0.823,0.627,0.818,0.791,0.802,0.811,0.75,0.828,Wavenet model,"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12."
+17,Tranception M,Hybrid model,0.691,0.028,0.752,,0.728,0.616,0.667,0.684,0.628,0.704,0.695,0.67,0.62,0.674,Tranception Medium model (300M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+18,TranceptEVE S,Hybrid model,0.688,0.022,0.747,,0.766,0.634,0.605,0.738,0.577,0.644,0.646,0.595,0.6,0.624,TranceptEVE Small model (Tranception Small & retrieved EVE model),"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+19,RITA S,Protein language model,0.687,0.02,0.699,,0.748,0.579,0.721,0.669,0.659,0.761,0.759,0.687,0.666,0.695,RITA small model (85M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+20,Tranception S,Hybrid model,0.677,0.024,0.741,,0.757,0.617,0.591,0.726,0.564,0.627,0.63,0.581,0.583,0.622,Tranception Small model (85M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+21,Hidden Markov Model,Alignment-based model,0.642,0.041,0.672,,0.66,0.529,0.708,0.549,0.74,0.666,0.728,0.66,0.668,0.734,Profile Hidden Markov model,HMMER: biosequence analysis using profile hidden Markov models
+22,ProtGPT2,Protein language model,0.537,0.042,0.519,,0.652,0.344,0.634,0.526,0.575,0.645,0.671,0.589,0.525,0.61,ProtGPT2 model,"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13."
+23,Unirep,Protein language model,0.527,0.046,0.526,,0.656,0.329,0.599,0.504,0.558,0.599,0.571,0.559,0.553,0.707,Unirep model,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
diff --git a/benchmarks/DMS_zero_shot/indels/NDCG/Summary_performance_DMS_indels_NDCG.html b/benchmarks/DMS_zero_shot/indels/NDCG/Summary_performance_DMS_indels_NDCG.html
new file mode 100644
index 0000000..5b2ad43
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/NDCG/Summary_performance_DMS_indels_NDCG.html
@@ -0,0 +1,531 @@
+
+
+
+ |
+ Model_name |
+ Model type |
+ Average_NDCG |
+ Bootstrap_standard_error_NDCG |
+ Function_Activity |
+ Function_Binding |
+ Function_Expression |
+ Function_OrganismalFitness |
+ Function_Stability |
+ Low_MSA_depth |
+ Medium_MSA_depth |
+ High_MSA_depth |
+ Taxa_Human |
+ Taxa_Other_Eukaryote |
+ Taxa_Prokaryote |
+ Taxa_Virus |
+ Model details |
+ References |
+
+
+ Model_rank |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+
+
+
+
+ 1 |
+ Progen2 M |
+ Protein language model |
+ 0.764 |
+ 0.026 |
+ 0.798 |
+ NaN |
+ 0.820 |
+ 0.595 |
+ 0.843 |
+ 0.642 |
+ 0.789 |
+ 0.866 |
+ 0.882 |
+ 0.815 |
+ 0.778 |
+ 0.695 |
+ Progen2 medium model (760M params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 2 |
+ Progen2 XL |
+ Protein language model |
+ 0.754 |
+ 0.024 |
+ 0.766 |
+ NaN |
+ 0.786 |
+ 0.580 |
+ 0.882 |
+ 0.718 |
+ 0.838 |
+ 0.877 |
+ 0.886 |
+ 0.865 |
+ 0.792 |
+ 0.811 |
+ Progen2 xlarge model (6.4B params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 3 |
+ Progen2 Base |
+ Protein language model |
+ 0.752 |
+ 0.029 |
+ 0.810 |
+ NaN |
+ 0.816 |
+ 0.568 |
+ 0.814 |
+ 0.623 |
+ 0.752 |
+ 0.849 |
+ 0.883 |
+ 0.798 |
+ 0.677 |
+ 0.704 |
+ Progen2 base model (760M params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 4 |
+ Progen2 L |
+ Protein language model |
+ 0.750 |
+ 0.027 |
+ 0.780 |
+ NaN |
+ 0.806 |
+ 0.571 |
+ 0.843 |
+ 0.637 |
+ 0.794 |
+ 0.858 |
+ 0.874 |
+ 0.804 |
+ 0.796 |
+ 0.696 |
+ Progen2 large model (2.7B params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 5 |
+ TranceptEVE M |
+ Hybrid model |
+ 0.748 |
+ 0.000 |
+ 0.860 |
+ NaN |
+ 0.786 |
+ 0.652 |
+ 0.694 |
+ 0.670 |
+ 0.661 |
+ 0.738 |
+ 0.715 |
+ 0.702 |
+ 0.682 |
+ 0.676 |
+ TranceptEVE Medium model (Tranception Medium & retrieved EVE model) |
+ Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop. |
+
+
+ 6 |
+ TranceptEVE L |
+ Hybrid model |
+ 0.748 |
+ 0.012 |
+ 0.842 |
+ NaN |
+ 0.770 |
+ 0.641 |
+ 0.737 |
+ 0.731 |
+ 0.705 |
+ 0.763 |
+ 0.756 |
+ 0.717 |
+ 0.707 |
+ 0.779 |
+ TranceptEVE Large model (Tranception Large & retrieved EVE model) |
+ Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop. |
+
+
+ 7 |
+ Tranception L no retrieval |
+ Protein language model |
+ 0.747 |
+ 0.023 |
+ 0.792 |
+ NaN |
+ 0.788 |
+ 0.643 |
+ 0.765 |
+ 0.724 |
+ 0.738 |
+ 0.776 |
+ 0.795 |
+ 0.697 |
+ 0.754 |
+ 0.811 |
+ Tranception Large model (700M params) without retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 8 |
+ Provean |
+ Alignment-based model |
+ 0.745 |
+ 0.028 |
+ 0.761 |
+ NaN |
+ 0.774 |
+ 0.580 |
+ 0.865 |
+ 0.707 |
+ 0.834 |
+ 0.852 |
+ 0.845 |
+ 0.861 |
+ 0.787 |
+ 0.842 |
+ Provean model |
+ Choi Y, Sims GE, Murphy S, Miller JR, Chan AP (2012). Predicting the functional effect of amino acid substitutions and indels. PloS one. |
+
+
+ 9 |
+ Tranception M no retrieval |
+ Protein language model |
+ 0.743 |
+ 0.019 |
+ 0.800 |
+ NaN |
+ 0.804 |
+ 0.645 |
+ 0.723 |
+ 0.642 |
+ 0.698 |
+ 0.750 |
+ 0.745 |
+ 0.702 |
+ 0.711 |
+ 0.712 |
+ Tranception Medium model (300M params) without retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 10 |
+ RITA XL |
+ Protein language model |
+ 0.741 |
+ 0.020 |
+ 0.746 |
+ NaN |
+ 0.782 |
+ 0.612 |
+ 0.824 |
+ 0.718 |
+ 0.768 |
+ 0.844 |
+ 0.873 |
+ 0.770 |
+ 0.738 |
+ 0.789 |
+ RITA xlarge model (1.2B params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 11 |
+ RITA L |
+ Protein language model |
+ 0.738 |
+ 0.020 |
+ 0.780 |
+ NaN |
+ 0.798 |
+ 0.564 |
+ 0.809 |
+ 0.626 |
+ 0.746 |
+ 0.842 |
+ 0.860 |
+ 0.773 |
+ 0.692 |
+ 0.776 |
+ RITA large model (680M params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 12 |
+ RITA M |
+ Protein language model |
+ 0.727 |
+ 0.018 |
+ 0.769 |
+ NaN |
+ 0.795 |
+ 0.552 |
+ 0.793 |
+ 0.604 |
+ 0.721 |
+ 0.837 |
+ 0.847 |
+ 0.734 |
+ 0.704 |
+ 0.773 |
+ RITA medium model (300M params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 13 |
+ Tranception L |
+ Hybrid model |
+ 0.726 |
+ 0.019 |
+ 0.772 |
+ NaN |
+ 0.769 |
+ 0.642 |
+ 0.721 |
+ 0.718 |
+ 0.689 |
+ 0.744 |
+ 0.742 |
+ 0.703 |
+ 0.677 |
+ 0.771 |
+ Tranception Large model (700M params) with retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 14 |
+ Progen2 S |
+ Protein language model |
+ 0.721 |
+ 0.020 |
+ 0.769 |
+ NaN |
+ 0.788 |
+ 0.549 |
+ 0.777 |
+ 0.594 |
+ 0.702 |
+ 0.828 |
+ 0.829 |
+ 0.706 |
+ 0.725 |
+ 0.743 |
+ Progen2 small model (150M params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 15 |
+ Tranception S no retrieval |
+ Protein language model |
+ 0.708 |
+ 0.022 |
+ 0.763 |
+ NaN |
+ 0.792 |
+ 0.625 |
+ 0.651 |
+ 0.710 |
+ 0.624 |
+ 0.682 |
+ 0.694 |
+ 0.600 |
+ 0.666 |
+ 0.675 |
+ Tranception Small model (85M params) without retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 16 |
+ Tranception M |
+ Hybrid model |
+ 0.695 |
+ 0.017 |
+ 0.753 |
+ NaN |
+ 0.730 |
+ 0.616 |
+ 0.680 |
+ 0.684 |
+ 0.639 |
+ 0.718 |
+ 0.706 |
+ 0.683 |
+ 0.634 |
+ 0.680 |
+ Tranception Medium model (300M params) with retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 17 |
+ TranceptEVE S |
+ Hybrid model |
+ 0.690 |
+ 0.016 |
+ 0.746 |
+ NaN |
+ 0.766 |
+ 0.634 |
+ 0.615 |
+ 0.738 |
+ 0.583 |
+ 0.654 |
+ 0.653 |
+ 0.604 |
+ 0.608 |
+ 0.631 |
+ TranceptEVE Small model (Tranception Small & retrieved EVE model) |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 18 |
+ RITA S |
+ Protein language model |
+ 0.687 |
+ 0.016 |
+ 0.698 |
+ NaN |
+ 0.748 |
+ 0.579 |
+ 0.723 |
+ 0.669 |
+ 0.662 |
+ 0.762 |
+ 0.761 |
+ 0.693 |
+ 0.665 |
+ 0.695 |
+ RITA small model (85M params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 19 |
+ Tranception S |
+ Hybrid model |
+ 0.678 |
+ 0.016 |
+ 0.740 |
+ NaN |
+ 0.757 |
+ 0.617 |
+ 0.599 |
+ 0.726 |
+ 0.571 |
+ 0.636 |
+ 0.635 |
+ 0.588 |
+ 0.595 |
+ 0.624 |
+ Tranception Small model (85M params) with retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 20 |
+ Wavenet |
+ Alignment-based model |
+ 0.654 |
+ 0.029 |
+ 0.713 |
+ NaN |
+ 0.721 |
+ 0.539 |
+ 0.644 |
+ 0.526 |
+ 0.709 |
+ 0.592 |
+ 0.493 |
+ 0.744 |
+ 0.671 |
+ 0.828 |
+ Wavenet model |
+ Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12. |
+
+
+ 21 |
+ Hidden Markov Model |
+ Alignment-based model |
+ 0.643 |
+ 0.031 |
+ 0.672 |
+ NaN |
+ 0.660 |
+ 0.529 |
+ 0.711 |
+ 0.549 |
+ 0.743 |
+ 0.670 |
+ 0.729 |
+ 0.664 |
+ 0.671 |
+ 0.741 |
+ Profile Hidden Markov model |
+ HMMER: biosequence analysis using profile hidden Markov models |
+
+
+ 22 |
+ ProtGPT2 |
+ Protein language model |
+ 0.539 |
+ 0.026 |
+ 0.519 |
+ NaN |
+ 0.652 |
+ 0.344 |
+ 0.639 |
+ 0.526 |
+ 0.575 |
+ 0.654 |
+ 0.677 |
+ 0.595 |
+ 0.528 |
+ 0.608 |
+ ProtGPT2 model |
+ Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13. |
+
+
+ 23 |
+ Unirep |
+ Protein language model |
+ 0.527 |
+ 0.035 |
+ 0.526 |
+ NaN |
+ 0.656 |
+ 0.329 |
+ 0.598 |
+ 0.504 |
+ 0.556 |
+ 0.600 |
+ 0.572 |
+ 0.558 |
+ 0.552 |
+ 0.705 |
+ Unirep model |
+ Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8. |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/indels/Spearman/DMS_indels_Spearman_DMS_level.csv b/benchmarks/DMS_zero_shot/indels/Spearman/DMS_indels_Spearman_DMS_level.csv
new file mode 100644
index 0000000..be2d26d
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/Spearman/DMS_indels_Spearman_DMS_level.csv
@@ -0,0 +1,67 @@
+DMS ID,Unirep,Wavenet,RITA S,RITA M,RITA L,RITA XL,Progen2 S,Progen2 M,Progen2 Base,Progen2 L,Progen2 XL,ProtGPT2,Tranception S no retrieval,Tranception M no retrieval,Tranception L no retrieval,Tranception S,Tranception M,Tranception L,TranceptEVE S,TranceptEVE M,TranceptEVE L,Hidden Markov Model,Provean,Number of Mutants,Selection Type,UniProt ID,MSA_Neff_L_category,Taxon
+A4_HUMAN_Seuma_2022_indels,0.523,0.512,0.472,0.451,0.438,0.429,0.539,0.478,0.494,0.481,0.462,0.552,0.555,0.442,0.444,0.536,0.466,0.416,0.544,0.459,0.466,-0.207,0.381,2346,Stability,A4_HUMAN,Low,Human
+AMFR_HUMAN_Tsuboyama_2023_4G3O_indels,0.026,0.701,-0.314,0.18,0.561,0.547,0.019,0.512,0.502,0.298,0.523,0.574,-0.044,-0.113,0.125,-0.309,-0.26,-0.27,-0.3,-0.257,-0.25,0.632,0.331,117,Stability,AMFR_HUMAN,Medium,Human
+ARGR_ECOLI_Tsuboyama_2023_1AOY_indels,0.513,0.611,0.757,0.594,0.58,0.729,0.459,0.603,0.68,0.698,0.692,0.171,0.728,0.586,0.595,0.252,0.427,0.505,0.293,0.477,0.536,0.454,0.129,181,Stability,ARGR_ECOLI,Medium,Prokaryote
+B1LPA6_ECOSM_Russ_2020_indels,0.275,0.438,0.294,0.363,0.337,0.36,0.384,0.38,0.415,0.412,0.379,0.316,0.172,0.219,0.223,0.33,0.344,0.326,0.316,0.357,0.34,0.398,0.372,3074,Activity,B1LPA6_ECOSM,Medium,Prokaryote
+BBC1_YEAST_Tsuboyama_2023_1TG0_indels,0.537,0.41,0.499,0.499,0.513,0.486,0.33,0.548,0.588,0.524,0.566,0.259,0.547,0.588,0.604,0.46,0.518,0.524,0.463,0.515,0.53,0.29,0.26,134,Stability,BBC1_YEAST,High,Eukaryote
+BCHB_CHLTE_Tsuboyama_2023_2KRU_indels,-0.055,0.647,0.025,0.165,0.085,0.184,0.195,0.262,0.086,0.382,0.539,0.21,0.18,0.276,0.263,-0.31,-0.287,-0.191,-0.274,-0.246,-0.147,0.47,0.283,82,Stability,BCHB_CHLTE,Medium,Prokaryote
+BLAT_ECOLX_Gonzalez_2019_indels,-0.013,0.437,0.436,0.455,0.334,0.345,0.496,0.619,0.664,0.604,0.409,0.155,0.451,0.379,0.296,0.425,0.365,0.344,0.439,0.401,0.342,0.347,0.385,4751,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+CAPSD_AAV2S_Sinai_2021_designed_indels,-0.409,0.666,0.298,0.358,0.543,0.621,-0.457,-0.466,-0.402,-0.469,0.492,0.159,0.336,0.362,0.691,0.715,0.677,0.709,0.718,0.726,0.736,0.607,0.683,225998,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CAPSD_AAV2S_Sinai_2021_library_indels,-0.125,-0.007,0.043,0.119,0.103,0.141,-0.117,-0.1,-0.084,-0.104,0.167,0.083,0.044,0.126,0.21,0.425,0.371,0.338,0.419,0.419,0.416,0.057,0.177,24908,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CATR_CHLRE_Tsuboyama_2023_2AMI_indels,0.466,0.439,0.505,0.425,0.421,0.334,0.416,0.337,0.346,0.287,0.338,0.072,0.521,0.242,0.355,0.468,0.274,0.368,0.473,0.261,0.353,0.145,0.239,197,Stability,CATR_CHLRE,High,Eukaryote
+CBPA2_HUMAN_Tsuboyama_2023_1O6X_indels,0.412,0.375,0.494,0.589,0.449,0.445,0.244,0.41,0.452,0.379,0.493,0.018,0.455,0.609,0.461,-0.068,-0.041,-0.012,-0.052,-0.016,0.013,0.472,0.366,205,Stability,CBPA2_HUMAN,Medium,Human
+CBX4_HUMAN_Tsuboyama_2023_2K28_indels,0.231,0.31,0.208,0.294,0.259,0.233,0.199,0.234,0.201,0.265,0.208,-0.017,0.338,0.287,0.297,0.261,0.259,0.3,0.279,0.271,0.305,0.38,0.073,129,Stability,CBX4_HUMAN,High,Human
+CSN4_MOUSE_Tsuboyama_2023_1UFM_indels,0.637,0.581,0.684,0.668,0.551,0.753,0.684,0.782,0.806,0.722,0.745,0.272,0.512,0.681,0.77,-0.194,0.047,0.158,-0.171,0.094,0.223,0.536,0.312,195,Stability,CSN4_MOUSE,Medium,Eukaryote
+CUE1_YEAST_Tsuboyama_2023_2MYX_indels,0.528,0.695,0.51,0.518,0.535,0.466,0.255,0.351,0.346,0.404,0.636,0.27,0.471,0.386,0.507,-0.183,-0.181,-0.153,-0.153,-0.164,-0.125,0.474,0.46,140,Stability,CUE1_YEAST,Medium,Eukaryote
+DN7A_SACS2_Tsuboyama_2023_1JIC_indels,0.19,0.242,0.179,0.292,0.364,0.285,0.38,0.362,0.303,0.317,0.458,0.026,0.243,0.218,0.196,-0.117,-0.117,-0.135,-0.103,-0.103,-0.115,0.12,0.696,136,Stability,DN7A_SACS2,Medium,Prokaryote
+DNJA1_HUMAN_Tsuboyama_2023_2LO1_indels,0.476,0.291,0.262,0.378,0.466,0.311,0.431,0.332,0.4,0.285,0.415,0.23,0.536,0.498,0.495,0.343,0.368,0.396,0.371,0.387,0.413,0.22,0.109,174,Stability,DNJA1_HUMAN,High,Human
+DOCK1_MOUSE_Tsuboyama_2023_2M0Y_indels,0.414,0.619,0.554,0.559,0.627,0.515,0.62,0.596,0.553,0.559,0.628,0.309,0.64,0.7,0.659,0.391,0.48,0.463,0.42,0.513,0.495,0.433,0.353,154,Stability,DOCK1_MOUSE,High,Eukaryote
+EPHB2_HUMAN_Tsuboyama_2023_1F0M_indels,0.243,0.429,0.408,0.378,0.459,0.428,0.547,0.443,0.601,0.554,0.6,0.361,0.416,0.367,0.405,0.112,0.198,0.269,0.158,0.229,0.308,0.381,0.316,185,Stability,EPHB2_HUMAN,High,Human
+FECA_ECOLI_Tsuboyama_2023_2D1U_indels,0.426,0.187,0.334,0.387,0.411,0.376,0.35,0.494,0.576,0.505,0.429,0.104,0.204,0.286,0.029,-0.035,0.076,-0.058,-0.037,0.084,-0.058,0.226,0.444,193,Stability,FECA_ECOLI,High,Eukaryote
+HCP_LAMBD_Tsuboyama_2023_2L6Q_indels,0.315,0.396,0.625,0.593,0.775,0.815,0.505,0.451,0.448,0.48,0.589,0.244,0.521,0.47,0.517,-0.096,-0.107,0.383,-0.108,-0.13,0.405,0.408,0.378,148,Stability,HCP_LAMBD,Medium,Virus
+HECD1_HUMAN_Tsuboyama_2023_3DKM_indels,0.239,0.603,0.307,0.434,0.522,0.572,0.436,0.567,0.539,0.545,0.524,0.025,0.129,0.331,0.104,0.189,0.338,0.205,0.179,0.348,0.183,0.551,0.282,154,Stability,HECD1_HUMAN,Medium,Human
+HIS7_YEAST_Pokusaeva_2019_indels,0.002,0.687,0.638,0.656,0.677,0.684,0.669,0.702,0.706,0.703,0.713,-0.091,0.63,0.687,0.707,0.628,0.611,0.655,0.639,0.678,0.695,0.548,0.701,6102,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+ILF3_HUMAN_Tsuboyama_2023_2L33_indels,0.03,0.503,0.425,0.518,0.31,0.346,0.583,0.403,0.565,0.618,0.527,0.052,0.258,0.672,0.632,-0.033,0.309,0.423,-0.003,0.369,0.479,0.291,0.216,193,Stability,ILF3_HUMAN,High,Human
+KCNJ2_MOUSE_Macdonald_2022_indels,0.162,0.408,0.385,0.4,0.383,0.377,0.401,0.432,0.426,0.431,0.387,0.265,0.401,0.412,0.391,0.441,0.437,0.44,0.434,0.444,0.431,0.368,0.386,10501,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+MAFG_MOUSE_Tsuboyama_2023_1K1V_indels,0.412,0.049,-0.077,0.313,0.293,0.279,0.321,0.369,0.314,0.3,0.349,0.043,-0.014,0.167,0.274,0.083,0.124,0.196,0.075,0.141,0.23,-0.033,0.018,115,Stability,MAFG_MOUSE,Medium,Eukaryote
+MBD11_ARATH_Tsuboyama_2023_6ACV_indels,0.216,0.312,0.2,0.106,0.472,0.494,0.32,0.67,0.565,0.447,0.499,0.298,0.093,0.174,0.138,0.041,0.094,0.045,0.074,0.119,0.081,0.253,0.482,131,Stability,MBD11_ARATH,Medium,Eukaryote
+MYO3_YEAST_Tsuboyama_2023_2BTT_indels,0.096,0.354,0.39,0.283,0.535,0.47,0.616,0.533,0.397,0.484,0.548,0.067,0.544,0.525,0.412,0.286,0.295,0.242,0.343,0.358,0.285,0.147,0.519,80,Stability,MYO3_YEAST,High,Eukaryote
+NKX31_HUMAN_Tsuboyama_2023_2L9R_indels,0.667,0.772,0.648,0.767,0.761,0.754,0.711,0.706,0.767,0.698,0.677,0.073,0.612,0.667,0.698,0.476,0.609,0.658,0.487,0.617,0.652,0.612,0.331,178,Stability,NKX31_HUMAN,High,Human
+NUSA_ECOLI_Tsuboyama_2023_1WCL_indels,0.128,0.34,0.219,0.315,0.279,0.241,0.253,0.227,0.287,0.213,0.226,-0.027,0.206,0.289,0.266,-0.076,0.033,0.002,-0.076,0.065,0.029,0.064,0.32,191,Stability,NUSA_ECOLI,High,Prokaryote
+NUSG_MYCTU_Tsuboyama_2023_2MI6_indels,0.441,0.023,-0.125,-0.044,-0.101,-0.147,0.111,-0.039,-0.026,-0.083,-0.044,0.101,-0.161,-0.07,0.167,-0.156,-0.14,0.062,-0.129,-0.112,0.094,-0.031,0.17,157,Stability,NUSG_MYCTU,High,Prokaryote
+OBSCN_HUMAN_Tsuboyama_2023_1V1C_indels,0.505,0.505,0.807,0.678,0.691,0.714,0.78,0.815,0.845,0.733,0.696,0.39,0.367,0.437,0.584,0.149,0.25,0.33,0.162,0.274,0.373,0.451,0.152,169,Stability,OBSCN_HUMAN,High,Human
+ODP2_GEOSE_Tsuboyama_2023_1W4G_indels,0.142,0.677,0.762,0.833,0.857,0.873,0.821,0.881,0.863,0.838,0.85,0.331,0.52,0.444,0.492,0.204,0.337,0.258,0.269,0.383,0.284,0.629,0.686,47,Stability,ODP2_GEOSE,High,Prokaryote
+OTU7A_HUMAN_Tsuboyama_2023_2L2D_indels,0.468,0.221,0.367,0.336,0.388,0.429,0.303,0.482,0.587,0.468,0.546,0.286,0.251,0.462,0.501,-0.162,-0.078,0.155,-0.148,-0.042,0.174,0.36,0.519,84,Stability,OTU7A_HUMAN,High,Human
+P53_HUMAN_Kotler_2018_indels,0.073,0.031,0.36,0.407,0.383,0.273,0.474,0.428,0.391,0.449,0.354,0.026,0.536,0.579,0.395,0.409,0.536,0.362,0.429,0.56,0.399,0.482,0.273,341,OrganismalFitness,P53_HUMAN,Low,Human
+PIN1_HUMAN_Tsuboyama_2023_1I6C_indels,0.787,0.758,0.732,0.752,0.771,0.735,0.805,0.634,0.619,0.719,0.608,0.577,0.649,0.74,0.789,0.183,0.544,0.666,0.256,0.605,0.693,0.624,0.58,106,Stability,PIN1_HUMAN,High,Human
+PITX2_HUMAN_Tsuboyama_2023_2L7M_indels,0.416,0.481,0.389,0.633,0.625,0.671,0.409,0.69,0.543,0.708,0.655,0.463,0.249,0.535,0.551,-0.063,0.16,0.131,0.013,0.239,0.227,0.339,0.436,117,Stability,PITX2_HUMAN,High,Human
+PKN1_HUMAN_Tsuboyama_2023_1URF_indels,0.099,0.727,0.734,0.779,0.729,0.733,0.795,0.809,0.855,0.787,0.784,0.295,0.497,0.751,0.699,0.215,0.693,0.675,0.27,0.731,0.69,0.44,0.392,187,Stability,PKN1_HUMAN,High,Human
+POLG_PESV_Tsuboyama_2023_2MXD_indels,0.499,0.538,0.316,0.377,0.435,0.33,0.31,0.253,0.288,0.215,0.357,0.107,0.388,0.434,0.434,-0.192,-0.135,-0.159,-0.166,-0.103,-0.119,0.304,0.256,149,Stability,POLG_PESV,Medium,Virus
+PR40A_HUMAN_Tsuboyama_2023_1UZC_indels,0.251,0.688,0.434,0.63,0.654,0.6,0.631,0.677,0.672,0.688,0.638,0.484,0.43,0.368,0.294,0.184,0.227,0.119,0.215,0.262,0.146,0.625,0.163,168,Stability,PR40A_HUMAN,Medium,Human
+PSAE_SYNP2_Tsuboyama_2023_1PSE_indels,-0.049,0.155,0.142,0.042,0.133,0.142,0.145,0.345,-0.089,0.286,0.397,0.256,0.175,0.315,-0.092,0.206,0.377,0.095,0.215,0.39,0.081,0.322,0.296,175,Stability,PSAE_SYNP2,Medium,Prokaryote
+PTEN_HUMAN_Mighell_2018_indels,-0.325,0.697,0.612,0.575,0.504,0.523,0.696,0.552,0.64,0.59,0.402,0.056,0.68,0.7,0.563,0.661,0.678,0.546,0.67,0.708,0.602,0.668,0.237,314,Activity,PTEN_HUMAN,Medium,Human
+Q8EG35_SHEON_Campbell_2022_indels,0.528,0.244,0.468,0.465,0.38,0.281,0.584,0.403,0.352,0.33,0.348,0.286,0.587,0.615,0.375,0.409,0.612,0.395,0.411,0.562,0.374,0.472,0.278,331,OrganismalFitness,Q8EG35_SHEON,Medium,Prokaryote
+RAD_ANTMA_Tsuboyama_2023_2CJJ_indels,0.565,0.571,0.42,0.764,0.804,0.762,0.528,0.722,0.728,0.704,0.61,0.215,0.531,0.601,0.529,0.076,0.477,0.452,0.113,0.522,0.485,0.644,-0.069,97,Stability,RAD_ANTMA,High,Eukaryote
+RCD1_ARATH_Tsuboyama_2023_5OAO_indels,0.192,0.724,0.045,0.093,0.223,0.374,-0.013,0.306,0.297,0.23,0.504,0.096,0.073,0.054,0.192,-0.374,-0.423,-0.379,-0.368,-0.421,-0.368,0.643,0.168,124,Stability,RCD1_ARATH,Medium,Eukaryote
+RD23A_HUMAN_Tsuboyama_2023_1IFY_indels,0.396,0.725,0.565,0.676,0.644,0.749,0.622,0.733,0.731,0.73,0.702,0.539,0.502,0.669,0.744,0.255,0.498,0.643,0.278,0.556,0.677,0.601,0.418,120,Stability,RD23A_HUMAN,High,Human
+RPC1_BP434_Tsuboyama_2023_1R69_indels,0.607,0.349,0.637,0.658,0.59,0.627,0.658,0.644,0.602,0.675,0.35,0.161,0.565,0.622,0.678,0.305,0.408,0.508,0.312,0.417,0.53,0.133,0.297,164,Stability,RPC1_BP434,High,Virus
+RS15_GEOSE_Tsuboyama_2023_1A32_indels,0.359,0.252,0.079,0.066,0.114,0.16,0.143,0.328,0.291,0.274,0.296,0.056,0.586,0.277,0.211,0.348,0.137,-0.031,0.388,0.172,0.01,0.086,0.114,176,Stability,RS15_GEOSE,Medium,Prokaryote
+S22A1_HUMAN_Yee_2023_abundance_indels,0.203,0.056,0.311,0.362,0.528,0.49,0.324,0.496,0.494,0.43,0.33,0.208,0.342,0.404,0.429,0.272,0.364,0.428,0.323,0.408,0.425,0.236,0.269,430,Expression,S22A1_HUMAN,Medium,Human
+S22A1_HUMAN_Yee_2023_activity_indels,0.304,0.118,0.402,0.453,0.556,0.541,0.436,0.597,0.543,0.51,0.435,0.21,0.405,0.489,0.503,0.332,0.394,0.487,0.383,0.476,0.481,0.324,0.334,490,Activity,S22A1_HUMAN,Medium,Human
+SAV1_MOUSE_Tsuboyama_2023_2YSB_indels,0.712,0.764,0.639,0.65,0.629,0.629,0.652,0.7,0.644,0.668,0.645,0.575,0.72,0.687,0.706,0.315,0.547,0.608,0.408,0.619,0.674,0.631,0.585,86,Stability,SAV1_MOUSE,High,Eukaryote
+SDA_BACSU_Tsuboyama_2023_1PV0_indels,0.181,0.31,0.264,0.354,0.421,0.572,0.205,0.327,0.073,0.281,0.513,0.068,0.35,0.457,0.395,-0.151,-0.144,0.322,-0.13,-0.114,0.361,0.581,0.353,127,Stability,SDA_BACSU,Medium,Prokaryote
+SOX30_HUMAN_Tsuboyama_2023_7JJK_indels,0.61,0.502,0.469,0.631,0.615,0.699,0.405,0.63,0.59,0.611,0.682,0.425,0.038,0.594,0.605,-0.268,-0.039,0.247,-0.232,0.025,0.351,0.399,0.424,109,Stability,SOX30_HUMAN,High,Human
+SPG2_STRSG_Tsuboyama_2023_5UBS_indels,-0.107,0.41,0.215,0.211,0.455,0.434,0.313,0.444,0.38,0.47,0.573,-0.171,0.025,0.154,0.597,0.144,0.167,0.287,0.146,0.171,0.327,0.133,0.298,148,Stability,SPG2_STRSG,Medium,Prokaryote
+SPTN1_CHICK_Tsuboyama_2023_1TUD_indels,0.358,-0.002,0.515,0.565,0.523,0.625,0.515,0.693,0.664,0.663,0.597,0.195,0.567,0.47,0.573,0.431,0.477,0.502,0.451,0.497,0.522,0.186,0.152,129,Stability,SPTN1_CHICK,High,Eukaryote
+SQSTM_MOUSE_Tsuboyama_2023_2RRU_indels,0.31,0.31,0.35,0.493,0.494,0.515,0.548,0.471,0.476,0.459,0.486,0.142,0.375,0.523,0.454,0.028,0.203,0.365,0.046,0.278,0.415,0.011,0.479,111,Stability,SQSTM_MOUSE,Medium,Eukaryote
+SR43C_ARATH_Tsuboyama_2023_2N88_indels,0.048,0.374,0.425,0.282,0.392,0.36,0.271,0.344,0.489,0.484,0.474,0.367,0.319,0.099,0.357,0.4,0.35,0.495,0.415,0.364,0.499,0.35,0.303,135,Stability,SR43C_ARATH,High,Eukaryote
+SRBS1_HUMAN_Tsuboyama_2023_2O2W_indels,0.574,0.707,0.647,0.683,0.675,0.668,0.675,0.677,0.693,0.676,0.581,0.526,0.683,0.71,0.731,0.718,0.753,0.752,0.72,0.755,0.757,0.494,0.449,154,Stability,SRBS1_HUMAN,High,Human
+TCRG1_MOUSE_Tsuboyama_2023_1E0L_indels,0.66,0.823,0.645,0.659,0.69,0.758,0.552,0.657,0.636,0.649,0.708,0.412,0.347,0.618,0.698,-0.446,-0.302,-0.164,-0.397,-0.235,-0.064,0.822,0.335,99,Stability,TCRG1_MOUSE,Medium,Eukaryote
+THO1_YEAST_Tsuboyama_2023_2WQG_indels,0.416,0.663,0.253,0.607,0.652,0.666,0.631,0.788,0.714,0.759,0.737,0.425,0.335,0.693,0.664,-0.283,0.144,0.061,-0.255,0.216,0.128,0.371,0.684,82,Stability,THO1_YEAST,High,Eukaryote
+TNKS2_HUMAN_Tsuboyama_2023_5JRT_indels,-0.025,0.218,0.315,0.29,0.217,0.266,0.382,0.333,0.282,0.367,0.27,0.119,0.136,0.193,0.291,-0.054,0.02,0.103,-0.053,0.025,0.103,0.163,0.312,171,Stability,TNKS2_HUMAN,High,Human
+UBE4B_HUMAN_Tsuboyama_2023_3L1X_indels,-0.011,0.22,0.126,0.162,0.152,0.239,0.348,0.187,0.248,0.239,0.335,0.267,0.151,0.085,0.325,-0.17,-0.17,0.114,-0.188,-0.181,0.127,-0.074,0.312,147,Stability,UBE4B_HUMAN,High,Human
+UBR5_HUMAN_Tsuboyama_2023_1I2T_indels,0.223,0.385,0.481,0.389,0.414,0.431,0.424,0.377,0.488,0.51,0.521,0.358,0.429,0.481,0.446,-0.084,0.114,0.127,-0.067,0.156,0.162,0.241,0.253,156,Stability,UBR5_HUMAN,Medium,Human
+VG08_BPP22_Tsuboyama_2023_2GP8_indels,0.519,0.726,0.528,0.731,0.688,0.84,0.372,0.577,0.425,0.401,0.711,-0.16,0.417,0.747,0.823,-0.029,0.078,0.562,-0.046,0.115,0.645,0.645,0.282,101,Stability,VG08_BPP22,High,Virus
+VILI_CHICK_Tsuboyama_2023_1YU5_indels,0.118,0.681,0.244,0.439,0.366,0.297,0.601,0.793,0.744,0.76,0.618,0.368,0.575,0.612,0.775,0.262,0.489,0.655,0.288,0.512,0.687,0.002,0.346,156,Stability,VILI_CHICK,High,Eukaryote
+VRPI_BPT7_Tsuboyama_2023_2WNM_indels,0.399,0.597,0.156,0.409,0.56,0.343,0.409,0.388,0.375,0.416,0.454,0.128,0.273,0.243,0.341,-0.187,-0.177,-0.233,-0.186,-0.185,-0.236,0.333,0.674,154,Stability,VRPI_BPT7,Medium,Virus
+YNZC_BACSU_Tsuboyama_2023_2JVD_indels,0.492,0.728,0.541,0.548,0.66,0.683,0.495,0.746,0.355,0.665,0.703,0.327,0.438,0.517,0.653,-0.19,-0.171,0.004,-0.179,-0.148,0.055,0.674,0.262,104,Stability,YNZC_BACSU,Medium,Prokaryote
diff --git a/benchmarks/DMS_zero_shot/indels/Spearman/DMS_indels_Spearman_DMS_level.html b/benchmarks/DMS_zero_shot/indels/Spearman/DMS_indels_Spearman_DMS_level.html
new file mode 100644
index 0000000..62db53e
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/Spearman/DMS_indels_Spearman_DMS_level.html
@@ -0,0 +1,2083 @@
+
+
+
+ score |
+ Unirep |
+ Wavenet |
+ RITA S |
+ RITA M |
+ RITA L |
+ RITA XL |
+ Progen2 S |
+ Progen2 M |
+ Progen2 Base |
+ Progen2 L |
+ Progen2 XL |
+ ProtGPT2 |
+ Tranception S no retrieval |
+ Tranception M no retrieval |
+ Tranception L no retrieval |
+ Tranception S |
+ Tranception M |
+ Tranception L |
+ TranceptEVE S |
+ TranceptEVE M |
+ TranceptEVE L |
+ Hidden Markov Model |
+ Provean |
+ Number of Mutants |
+ Selection Type |
+ UniProt ID |
+ MSA_Neff_L_category |
+ Taxon |
+
+
+
+
+ A4_HUMAN_Seuma_2022_indels |
+ 0.523 |
+ 0.512 |
+ 0.472 |
+ 0.451 |
+ 0.438 |
+ 0.429 |
+ 0.539 |
+ 0.478 |
+ 0.494 |
+ 0.481 |
+ 0.462 |
+ 0.552 |
+ 0.555 |
+ 0.442 |
+ 0.444 |
+ 0.536 |
+ 0.466 |
+ 0.416 |
+ 0.544 |
+ 0.459 |
+ 0.466 |
+ -0.207 |
+ 0.381 |
+ 2346 |
+ Stability |
+ A4_HUMAN |
+ Low |
+ Human |
+
+
+ AMFR_HUMAN_Tsuboyama_2023_4G3O_indels |
+ 0.026 |
+ 0.701 |
+ -0.314 |
+ 0.180 |
+ 0.561 |
+ 0.547 |
+ 0.019 |
+ 0.512 |
+ 0.502 |
+ 0.298 |
+ 0.523 |
+ 0.574 |
+ -0.044 |
+ -0.113 |
+ 0.125 |
+ -0.309 |
+ -0.260 |
+ -0.270 |
+ -0.300 |
+ -0.257 |
+ -0.250 |
+ 0.632 |
+ 0.331 |
+ 117 |
+ Stability |
+ AMFR_HUMAN |
+ Medium |
+ Human |
+
+
+ ARGR_ECOLI_Tsuboyama_2023_1AOY_indels |
+ 0.513 |
+ 0.611 |
+ 0.757 |
+ 0.594 |
+ 0.580 |
+ 0.729 |
+ 0.459 |
+ 0.603 |
+ 0.680 |
+ 0.698 |
+ 0.692 |
+ 0.171 |
+ 0.728 |
+ 0.586 |
+ 0.595 |
+ 0.252 |
+ 0.427 |
+ 0.505 |
+ 0.293 |
+ 0.477 |
+ 0.536 |
+ 0.454 |
+ 0.129 |
+ 181 |
+ Stability |
+ ARGR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ B1LPA6_ECOSM_Russ_2020_indels |
+ 0.275 |
+ 0.438 |
+ 0.294 |
+ 0.363 |
+ 0.337 |
+ 0.360 |
+ 0.384 |
+ 0.380 |
+ 0.415 |
+ 0.412 |
+ 0.379 |
+ 0.316 |
+ 0.172 |
+ 0.219 |
+ 0.223 |
+ 0.330 |
+ 0.344 |
+ 0.326 |
+ 0.316 |
+ 0.357 |
+ 0.340 |
+ 0.398 |
+ 0.372 |
+ 3074 |
+ Activity |
+ B1LPA6_ECOSM |
+ Medium |
+ Prokaryote |
+
+
+ BBC1_YEAST_Tsuboyama_2023_1TG0_indels |
+ 0.537 |
+ 0.410 |
+ 0.499 |
+ 0.499 |
+ 0.513 |
+ 0.486 |
+ 0.330 |
+ 0.548 |
+ 0.588 |
+ 0.524 |
+ 0.566 |
+ 0.259 |
+ 0.547 |
+ 0.588 |
+ 0.604 |
+ 0.460 |
+ 0.518 |
+ 0.524 |
+ 0.463 |
+ 0.515 |
+ 0.530 |
+ 0.290 |
+ 0.260 |
+ 134 |
+ Stability |
+ BBC1_YEAST |
+ High |
+ Eukaryote |
+
+
+ BCHB_CHLTE_Tsuboyama_2023_2KRU_indels |
+ -0.055 |
+ 0.647 |
+ 0.025 |
+ 0.165 |
+ 0.085 |
+ 0.184 |
+ 0.195 |
+ 0.262 |
+ 0.086 |
+ 0.382 |
+ 0.539 |
+ 0.210 |
+ 0.180 |
+ 0.276 |
+ 0.263 |
+ -0.310 |
+ -0.287 |
+ -0.191 |
+ -0.274 |
+ -0.246 |
+ -0.147 |
+ 0.470 |
+ 0.283 |
+ 82 |
+ Stability |
+ BCHB_CHLTE |
+ Medium |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Gonzalez_2019_indels |
+ -0.013 |
+ 0.437 |
+ 0.436 |
+ 0.455 |
+ 0.334 |
+ 0.345 |
+ 0.496 |
+ 0.619 |
+ 0.664 |
+ 0.604 |
+ 0.409 |
+ 0.155 |
+ 0.451 |
+ 0.379 |
+ 0.296 |
+ 0.425 |
+ 0.365 |
+ 0.344 |
+ 0.439 |
+ 0.401 |
+ 0.342 |
+ 0.347 |
+ 0.385 |
+ 4751 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ CAPSD_AAV2S_Sinai_2021_designed_indels |
+ -0.409 |
+ 0.666 |
+ 0.298 |
+ 0.358 |
+ 0.543 |
+ 0.621 |
+ -0.457 |
+ -0.466 |
+ -0.402 |
+ -0.469 |
+ 0.492 |
+ 0.159 |
+ 0.336 |
+ 0.362 |
+ 0.691 |
+ 0.715 |
+ 0.677 |
+ 0.709 |
+ 0.718 |
+ 0.726 |
+ 0.736 |
+ 0.607 |
+ 0.683 |
+ 225998 |
+ OrganismalFitness |
+ CAPSD_AAV2S |
+ Low |
+ Virus |
+
+
+ CAPSD_AAV2S_Sinai_2021_library_indels |
+ -0.125 |
+ -0.007 |
+ 0.043 |
+ 0.119 |
+ 0.103 |
+ 0.141 |
+ -0.117 |
+ -0.100 |
+ -0.084 |
+ -0.104 |
+ 0.167 |
+ 0.083 |
+ 0.044 |
+ 0.126 |
+ 0.210 |
+ 0.425 |
+ 0.371 |
+ 0.338 |
+ 0.419 |
+ 0.419 |
+ 0.416 |
+ 0.057 |
+ 0.177 |
+ 24908 |
+ OrganismalFitness |
+ CAPSD_AAV2S |
+ Low |
+ Virus |
+
+
+ CATR_CHLRE_Tsuboyama_2023_2AMI_indels |
+ 0.466 |
+ 0.439 |
+ 0.505 |
+ 0.425 |
+ 0.421 |
+ 0.334 |
+ 0.416 |
+ 0.337 |
+ 0.346 |
+ 0.287 |
+ 0.338 |
+ 0.072 |
+ 0.521 |
+ 0.242 |
+ 0.355 |
+ 0.468 |
+ 0.274 |
+ 0.368 |
+ 0.473 |
+ 0.261 |
+ 0.353 |
+ 0.145 |
+ 0.239 |
+ 197 |
+ Stability |
+ CATR_CHLRE |
+ High |
+ Eukaryote |
+
+
+ CBPA2_HUMAN_Tsuboyama_2023_1O6X_indels |
+ 0.412 |
+ 0.375 |
+ 0.494 |
+ 0.589 |
+ 0.449 |
+ 0.445 |
+ 0.244 |
+ 0.410 |
+ 0.452 |
+ 0.379 |
+ 0.493 |
+ 0.018 |
+ 0.455 |
+ 0.609 |
+ 0.461 |
+ -0.068 |
+ -0.041 |
+ -0.012 |
+ -0.052 |
+ -0.016 |
+ 0.013 |
+ 0.472 |
+ 0.366 |
+ 205 |
+ Stability |
+ CBPA2_HUMAN |
+ Medium |
+ Human |
+
+
+ CBX4_HUMAN_Tsuboyama_2023_2K28_indels |
+ 0.231 |
+ 0.310 |
+ 0.208 |
+ 0.294 |
+ 0.259 |
+ 0.233 |
+ 0.199 |
+ 0.234 |
+ 0.201 |
+ 0.265 |
+ 0.208 |
+ -0.017 |
+ 0.338 |
+ 0.287 |
+ 0.297 |
+ 0.261 |
+ 0.259 |
+ 0.300 |
+ 0.279 |
+ 0.271 |
+ 0.305 |
+ 0.380 |
+ 0.073 |
+ 129 |
+ Stability |
+ CBX4_HUMAN |
+ High |
+ Human |
+
+
+ CSN4_MOUSE_Tsuboyama_2023_1UFM_indels |
+ 0.637 |
+ 0.581 |
+ 0.684 |
+ 0.668 |
+ 0.551 |
+ 0.753 |
+ 0.684 |
+ 0.782 |
+ 0.806 |
+ 0.722 |
+ 0.745 |
+ 0.272 |
+ 0.512 |
+ 0.681 |
+ 0.770 |
+ -0.194 |
+ 0.047 |
+ 0.158 |
+ -0.171 |
+ 0.094 |
+ 0.223 |
+ 0.536 |
+ 0.312 |
+ 195 |
+ Stability |
+ CSN4_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ CUE1_YEAST_Tsuboyama_2023_2MYX_indels |
+ 0.528 |
+ 0.695 |
+ 0.510 |
+ 0.518 |
+ 0.535 |
+ 0.466 |
+ 0.255 |
+ 0.351 |
+ 0.346 |
+ 0.404 |
+ 0.636 |
+ 0.270 |
+ 0.471 |
+ 0.386 |
+ 0.507 |
+ -0.183 |
+ -0.181 |
+ -0.153 |
+ -0.153 |
+ -0.164 |
+ -0.125 |
+ 0.474 |
+ 0.460 |
+ 140 |
+ Stability |
+ CUE1_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ DN7A_SACS2_Tsuboyama_2023_1JIC_indels |
+ 0.190 |
+ 0.242 |
+ 0.179 |
+ 0.292 |
+ 0.364 |
+ 0.285 |
+ 0.380 |
+ 0.362 |
+ 0.303 |
+ 0.317 |
+ 0.458 |
+ 0.026 |
+ 0.243 |
+ 0.218 |
+ 0.196 |
+ -0.117 |
+ -0.117 |
+ -0.135 |
+ -0.103 |
+ -0.103 |
+ -0.115 |
+ 0.120 |
+ 0.696 |
+ 136 |
+ Stability |
+ DN7A_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ DNJA1_HUMAN_Tsuboyama_2023_2LO1_indels |
+ 0.476 |
+ 0.291 |
+ 0.262 |
+ 0.378 |
+ 0.466 |
+ 0.311 |
+ 0.431 |
+ 0.332 |
+ 0.400 |
+ 0.285 |
+ 0.415 |
+ 0.230 |
+ 0.536 |
+ 0.498 |
+ 0.495 |
+ 0.343 |
+ 0.368 |
+ 0.396 |
+ 0.371 |
+ 0.387 |
+ 0.413 |
+ 0.220 |
+ 0.109 |
+ 174 |
+ Stability |
+ DNJA1_HUMAN |
+ High |
+ Human |
+
+
+ DOCK1_MOUSE_Tsuboyama_2023_2M0Y_indels |
+ 0.414 |
+ 0.619 |
+ 0.554 |
+ 0.559 |
+ 0.627 |
+ 0.515 |
+ 0.620 |
+ 0.596 |
+ 0.553 |
+ 0.559 |
+ 0.628 |
+ 0.309 |
+ 0.640 |
+ 0.700 |
+ 0.659 |
+ 0.391 |
+ 0.480 |
+ 0.463 |
+ 0.420 |
+ 0.513 |
+ 0.495 |
+ 0.433 |
+ 0.353 |
+ 154 |
+ Stability |
+ DOCK1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ EPHB2_HUMAN_Tsuboyama_2023_1F0M_indels |
+ 0.243 |
+ 0.429 |
+ 0.408 |
+ 0.378 |
+ 0.459 |
+ 0.428 |
+ 0.547 |
+ 0.443 |
+ 0.601 |
+ 0.554 |
+ 0.600 |
+ 0.361 |
+ 0.416 |
+ 0.367 |
+ 0.405 |
+ 0.112 |
+ 0.198 |
+ 0.269 |
+ 0.158 |
+ 0.229 |
+ 0.308 |
+ 0.381 |
+ 0.316 |
+ 185 |
+ Stability |
+ EPHB2_HUMAN |
+ High |
+ Human |
+
+
+ FECA_ECOLI_Tsuboyama_2023_2D1U_indels |
+ 0.426 |
+ 0.187 |
+ 0.334 |
+ 0.387 |
+ 0.411 |
+ 0.376 |
+ 0.350 |
+ 0.494 |
+ 0.576 |
+ 0.505 |
+ 0.429 |
+ 0.104 |
+ 0.204 |
+ 0.286 |
+ 0.029 |
+ -0.035 |
+ 0.076 |
+ -0.058 |
+ -0.037 |
+ 0.084 |
+ -0.058 |
+ 0.226 |
+ 0.444 |
+ 193 |
+ Stability |
+ FECA_ECOLI |
+ High |
+ Eukaryote |
+
+
+ HCP_LAMBD_Tsuboyama_2023_2L6Q_indels |
+ 0.315 |
+ 0.396 |
+ 0.625 |
+ 0.593 |
+ 0.775 |
+ 0.815 |
+ 0.505 |
+ 0.451 |
+ 0.448 |
+ 0.480 |
+ 0.589 |
+ 0.244 |
+ 0.521 |
+ 0.470 |
+ 0.517 |
+ -0.096 |
+ -0.107 |
+ 0.383 |
+ -0.108 |
+ -0.130 |
+ 0.405 |
+ 0.408 |
+ 0.378 |
+ 148 |
+ Stability |
+ HCP_LAMBD |
+ Medium |
+ Virus |
+
+
+ HECD1_HUMAN_Tsuboyama_2023_3DKM_indels |
+ 0.239 |
+ 0.603 |
+ 0.307 |
+ 0.434 |
+ 0.522 |
+ 0.572 |
+ 0.436 |
+ 0.567 |
+ 0.539 |
+ 0.545 |
+ 0.524 |
+ 0.025 |
+ 0.129 |
+ 0.331 |
+ 0.104 |
+ 0.189 |
+ 0.338 |
+ 0.205 |
+ 0.179 |
+ 0.348 |
+ 0.183 |
+ 0.551 |
+ 0.282 |
+ 154 |
+ Stability |
+ HECD1_HUMAN |
+ Medium |
+ Human |
+
+
+ HIS7_YEAST_Pokusaeva_2019_indels |
+ 0.002 |
+ 0.687 |
+ 0.638 |
+ 0.656 |
+ 0.677 |
+ 0.684 |
+ 0.669 |
+ 0.702 |
+ 0.706 |
+ 0.703 |
+ 0.713 |
+ -0.091 |
+ 0.630 |
+ 0.687 |
+ 0.707 |
+ 0.628 |
+ 0.611 |
+ 0.655 |
+ 0.639 |
+ 0.678 |
+ 0.695 |
+ 0.548 |
+ 0.701 |
+ 6102 |
+ OrganismalFitness |
+ HIS7_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ ILF3_HUMAN_Tsuboyama_2023_2L33_indels |
+ 0.030 |
+ 0.503 |
+ 0.425 |
+ 0.518 |
+ 0.310 |
+ 0.346 |
+ 0.583 |
+ 0.403 |
+ 0.565 |
+ 0.618 |
+ 0.527 |
+ 0.052 |
+ 0.258 |
+ 0.672 |
+ 0.632 |
+ -0.033 |
+ 0.309 |
+ 0.423 |
+ -0.003 |
+ 0.369 |
+ 0.479 |
+ 0.291 |
+ 0.216 |
+ 193 |
+ Stability |
+ ILF3_HUMAN |
+ High |
+ Human |
+
+
+ KCNJ2_MOUSE_Macdonald_2022_indels |
+ 0.162 |
+ 0.408 |
+ 0.385 |
+ 0.400 |
+ 0.383 |
+ 0.377 |
+ 0.401 |
+ 0.432 |
+ 0.426 |
+ 0.431 |
+ 0.387 |
+ 0.265 |
+ 0.401 |
+ 0.412 |
+ 0.391 |
+ 0.441 |
+ 0.437 |
+ 0.440 |
+ 0.434 |
+ 0.444 |
+ 0.431 |
+ 0.368 |
+ 0.386 |
+ 10501 |
+ Expression |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ MAFG_MOUSE_Tsuboyama_2023_1K1V_indels |
+ 0.412 |
+ 0.049 |
+ -0.077 |
+ 0.313 |
+ 0.293 |
+ 0.279 |
+ 0.321 |
+ 0.369 |
+ 0.314 |
+ 0.300 |
+ 0.349 |
+ 0.043 |
+ -0.014 |
+ 0.167 |
+ 0.274 |
+ 0.083 |
+ 0.124 |
+ 0.196 |
+ 0.075 |
+ 0.141 |
+ 0.230 |
+ -0.033 |
+ 0.018 |
+ 115 |
+ Stability |
+ MAFG_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ MBD11_ARATH_Tsuboyama_2023_6ACV_indels |
+ 0.216 |
+ 0.312 |
+ 0.200 |
+ 0.106 |
+ 0.472 |
+ 0.494 |
+ 0.320 |
+ 0.670 |
+ 0.565 |
+ 0.447 |
+ 0.499 |
+ 0.298 |
+ 0.093 |
+ 0.174 |
+ 0.138 |
+ 0.041 |
+ 0.094 |
+ 0.045 |
+ 0.074 |
+ 0.119 |
+ 0.081 |
+ 0.253 |
+ 0.482 |
+ 131 |
+ Stability |
+ MBD11_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ MYO3_YEAST_Tsuboyama_2023_2BTT_indels |
+ 0.096 |
+ 0.354 |
+ 0.390 |
+ 0.283 |
+ 0.535 |
+ 0.470 |
+ 0.616 |
+ 0.533 |
+ 0.397 |
+ 0.484 |
+ 0.548 |
+ 0.067 |
+ 0.544 |
+ 0.525 |
+ 0.412 |
+ 0.286 |
+ 0.295 |
+ 0.242 |
+ 0.343 |
+ 0.358 |
+ 0.285 |
+ 0.147 |
+ 0.519 |
+ 80 |
+ Stability |
+ MYO3_YEAST |
+ High |
+ Eukaryote |
+
+
+ NKX31_HUMAN_Tsuboyama_2023_2L9R_indels |
+ 0.667 |
+ 0.772 |
+ 0.648 |
+ 0.767 |
+ 0.761 |
+ 0.754 |
+ 0.711 |
+ 0.706 |
+ 0.767 |
+ 0.698 |
+ 0.677 |
+ 0.073 |
+ 0.612 |
+ 0.667 |
+ 0.698 |
+ 0.476 |
+ 0.609 |
+ 0.658 |
+ 0.487 |
+ 0.617 |
+ 0.652 |
+ 0.612 |
+ 0.331 |
+ 178 |
+ Stability |
+ NKX31_HUMAN |
+ High |
+ Human |
+
+
+ NUSA_ECOLI_Tsuboyama_2023_1WCL_indels |
+ 0.128 |
+ 0.340 |
+ 0.219 |
+ 0.315 |
+ 0.279 |
+ 0.241 |
+ 0.253 |
+ 0.227 |
+ 0.287 |
+ 0.213 |
+ 0.226 |
+ -0.027 |
+ 0.206 |
+ 0.289 |
+ 0.266 |
+ -0.076 |
+ 0.033 |
+ 0.002 |
+ -0.076 |
+ 0.065 |
+ 0.029 |
+ 0.064 |
+ 0.320 |
+ 191 |
+ Stability |
+ NUSA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ NUSG_MYCTU_Tsuboyama_2023_2MI6_indels |
+ 0.441 |
+ 0.023 |
+ -0.125 |
+ -0.044 |
+ -0.101 |
+ -0.147 |
+ 0.111 |
+ -0.039 |
+ -0.026 |
+ -0.083 |
+ -0.044 |
+ 0.101 |
+ -0.161 |
+ -0.070 |
+ 0.167 |
+ -0.156 |
+ -0.140 |
+ 0.062 |
+ -0.129 |
+ -0.112 |
+ 0.094 |
+ -0.031 |
+ 0.170 |
+ 157 |
+ Stability |
+ NUSG_MYCTU |
+ High |
+ Prokaryote |
+
+
+ OBSCN_HUMAN_Tsuboyama_2023_1V1C_indels |
+ 0.505 |
+ 0.505 |
+ 0.807 |
+ 0.678 |
+ 0.691 |
+ 0.714 |
+ 0.780 |
+ 0.815 |
+ 0.845 |
+ 0.733 |
+ 0.696 |
+ 0.390 |
+ 0.367 |
+ 0.437 |
+ 0.584 |
+ 0.149 |
+ 0.250 |
+ 0.330 |
+ 0.162 |
+ 0.274 |
+ 0.373 |
+ 0.451 |
+ 0.152 |
+ 169 |
+ Stability |
+ OBSCN_HUMAN |
+ High |
+ Human |
+
+
+ ODP2_GEOSE_Tsuboyama_2023_1W4G_indels |
+ 0.142 |
+ 0.677 |
+ 0.762 |
+ 0.833 |
+ 0.857 |
+ 0.873 |
+ 0.821 |
+ 0.881 |
+ 0.863 |
+ 0.838 |
+ 0.850 |
+ 0.331 |
+ 0.520 |
+ 0.444 |
+ 0.492 |
+ 0.204 |
+ 0.337 |
+ 0.258 |
+ 0.269 |
+ 0.383 |
+ 0.284 |
+ 0.629 |
+ 0.686 |
+ 47 |
+ Stability |
+ ODP2_GEOSE |
+ High |
+ Prokaryote |
+
+
+ OTU7A_HUMAN_Tsuboyama_2023_2L2D_indels |
+ 0.468 |
+ 0.221 |
+ 0.367 |
+ 0.336 |
+ 0.388 |
+ 0.429 |
+ 0.303 |
+ 0.482 |
+ 0.587 |
+ 0.468 |
+ 0.546 |
+ 0.286 |
+ 0.251 |
+ 0.462 |
+ 0.501 |
+ -0.162 |
+ -0.078 |
+ 0.155 |
+ -0.148 |
+ -0.042 |
+ 0.174 |
+ 0.360 |
+ 0.519 |
+ 84 |
+ Stability |
+ OTU7A_HUMAN |
+ High |
+ Human |
+
+
+ P53_HUMAN_Kotler_2018_indels |
+ 0.073 |
+ 0.031 |
+ 0.360 |
+ 0.407 |
+ 0.383 |
+ 0.273 |
+ 0.474 |
+ 0.428 |
+ 0.391 |
+ 0.449 |
+ 0.354 |
+ 0.026 |
+ 0.536 |
+ 0.579 |
+ 0.395 |
+ 0.409 |
+ 0.536 |
+ 0.362 |
+ 0.429 |
+ 0.560 |
+ 0.399 |
+ 0.482 |
+ 0.273 |
+ 341 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ PIN1_HUMAN_Tsuboyama_2023_1I6C_indels |
+ 0.787 |
+ 0.758 |
+ 0.732 |
+ 0.752 |
+ 0.771 |
+ 0.735 |
+ 0.805 |
+ 0.634 |
+ 0.619 |
+ 0.719 |
+ 0.608 |
+ 0.577 |
+ 0.649 |
+ 0.740 |
+ 0.789 |
+ 0.183 |
+ 0.544 |
+ 0.666 |
+ 0.256 |
+ 0.605 |
+ 0.693 |
+ 0.624 |
+ 0.580 |
+ 106 |
+ Stability |
+ PIN1_HUMAN |
+ High |
+ Human |
+
+
+ PITX2_HUMAN_Tsuboyama_2023_2L7M_indels |
+ 0.416 |
+ 0.481 |
+ 0.389 |
+ 0.633 |
+ 0.625 |
+ 0.671 |
+ 0.409 |
+ 0.690 |
+ 0.543 |
+ 0.708 |
+ 0.655 |
+ 0.463 |
+ 0.249 |
+ 0.535 |
+ 0.551 |
+ -0.063 |
+ 0.160 |
+ 0.131 |
+ 0.013 |
+ 0.239 |
+ 0.227 |
+ 0.339 |
+ 0.436 |
+ 117 |
+ Stability |
+ PITX2_HUMAN |
+ High |
+ Human |
+
+
+ PKN1_HUMAN_Tsuboyama_2023_1URF_indels |
+ 0.099 |
+ 0.727 |
+ 0.734 |
+ 0.779 |
+ 0.729 |
+ 0.733 |
+ 0.795 |
+ 0.809 |
+ 0.855 |
+ 0.787 |
+ 0.784 |
+ 0.295 |
+ 0.497 |
+ 0.751 |
+ 0.699 |
+ 0.215 |
+ 0.693 |
+ 0.675 |
+ 0.270 |
+ 0.731 |
+ 0.690 |
+ 0.440 |
+ 0.392 |
+ 187 |
+ Stability |
+ PKN1_HUMAN |
+ High |
+ Human |
+
+
+ POLG_PESV_Tsuboyama_2023_2MXD_indels |
+ 0.499 |
+ 0.538 |
+ 0.316 |
+ 0.377 |
+ 0.435 |
+ 0.330 |
+ 0.310 |
+ 0.253 |
+ 0.288 |
+ 0.215 |
+ 0.357 |
+ 0.107 |
+ 0.388 |
+ 0.434 |
+ 0.434 |
+ -0.192 |
+ -0.135 |
+ -0.159 |
+ -0.166 |
+ -0.103 |
+ -0.119 |
+ 0.304 |
+ 0.256 |
+ 149 |
+ Stability |
+ POLG_PESV |
+ Medium |
+ Virus |
+
+
+ PR40A_HUMAN_Tsuboyama_2023_1UZC_indels |
+ 0.251 |
+ 0.688 |
+ 0.434 |
+ 0.630 |
+ 0.654 |
+ 0.600 |
+ 0.631 |
+ 0.677 |
+ 0.672 |
+ 0.688 |
+ 0.638 |
+ 0.484 |
+ 0.430 |
+ 0.368 |
+ 0.294 |
+ 0.184 |
+ 0.227 |
+ 0.119 |
+ 0.215 |
+ 0.262 |
+ 0.146 |
+ 0.625 |
+ 0.163 |
+ 168 |
+ Stability |
+ PR40A_HUMAN |
+ Medium |
+ Human |
+
+
+ PSAE_SYNP2_Tsuboyama_2023_1PSE_indels |
+ -0.049 |
+ 0.155 |
+ 0.142 |
+ 0.042 |
+ 0.133 |
+ 0.142 |
+ 0.145 |
+ 0.345 |
+ -0.089 |
+ 0.286 |
+ 0.397 |
+ 0.256 |
+ 0.175 |
+ 0.315 |
+ -0.092 |
+ 0.206 |
+ 0.377 |
+ 0.095 |
+ 0.215 |
+ 0.390 |
+ 0.081 |
+ 0.322 |
+ 0.296 |
+ 175 |
+ Stability |
+ PSAE_SYNP2 |
+ Medium |
+ Prokaryote |
+
+
+ PTEN_HUMAN_Mighell_2018_indels |
+ -0.325 |
+ 0.697 |
+ 0.612 |
+ 0.575 |
+ 0.504 |
+ 0.523 |
+ 0.696 |
+ 0.552 |
+ 0.640 |
+ 0.590 |
+ 0.402 |
+ 0.056 |
+ 0.680 |
+ 0.700 |
+ 0.563 |
+ 0.661 |
+ 0.678 |
+ 0.546 |
+ 0.670 |
+ 0.708 |
+ 0.602 |
+ 0.668 |
+ 0.237 |
+ 314 |
+ Activity |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ Q8EG35_SHEON_Campbell_2022_indels |
+ 0.528 |
+ 0.244 |
+ 0.468 |
+ 0.465 |
+ 0.380 |
+ 0.281 |
+ 0.584 |
+ 0.403 |
+ 0.352 |
+ 0.330 |
+ 0.348 |
+ 0.286 |
+ 0.587 |
+ 0.615 |
+ 0.375 |
+ 0.409 |
+ 0.612 |
+ 0.395 |
+ 0.411 |
+ 0.562 |
+ 0.374 |
+ 0.472 |
+ 0.278 |
+ 331 |
+ OrganismalFitness |
+ Q8EG35_SHEON |
+ Medium |
+ Prokaryote |
+
+
+ RAD_ANTMA_Tsuboyama_2023_2CJJ_indels |
+ 0.565 |
+ 0.571 |
+ 0.420 |
+ 0.764 |
+ 0.804 |
+ 0.762 |
+ 0.528 |
+ 0.722 |
+ 0.728 |
+ 0.704 |
+ 0.610 |
+ 0.215 |
+ 0.531 |
+ 0.601 |
+ 0.529 |
+ 0.076 |
+ 0.477 |
+ 0.452 |
+ 0.113 |
+ 0.522 |
+ 0.485 |
+ 0.644 |
+ -0.069 |
+ 97 |
+ Stability |
+ RAD_ANTMA |
+ High |
+ Eukaryote |
+
+
+ RCD1_ARATH_Tsuboyama_2023_5OAO_indels |
+ 0.192 |
+ 0.724 |
+ 0.045 |
+ 0.093 |
+ 0.223 |
+ 0.374 |
+ -0.013 |
+ 0.306 |
+ 0.297 |
+ 0.230 |
+ 0.504 |
+ 0.096 |
+ 0.073 |
+ 0.054 |
+ 0.192 |
+ -0.374 |
+ -0.423 |
+ -0.379 |
+ -0.368 |
+ -0.421 |
+ -0.368 |
+ 0.643 |
+ 0.168 |
+ 124 |
+ Stability |
+ RCD1_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ RD23A_HUMAN_Tsuboyama_2023_1IFY_indels |
+ 0.396 |
+ 0.725 |
+ 0.565 |
+ 0.676 |
+ 0.644 |
+ 0.749 |
+ 0.622 |
+ 0.733 |
+ 0.731 |
+ 0.730 |
+ 0.702 |
+ 0.539 |
+ 0.502 |
+ 0.669 |
+ 0.744 |
+ 0.255 |
+ 0.498 |
+ 0.643 |
+ 0.278 |
+ 0.556 |
+ 0.677 |
+ 0.601 |
+ 0.418 |
+ 120 |
+ Stability |
+ RD23A_HUMAN |
+ High |
+ Human |
+
+
+ RPC1_BP434_Tsuboyama_2023_1R69_indels |
+ 0.607 |
+ 0.349 |
+ 0.637 |
+ 0.658 |
+ 0.590 |
+ 0.627 |
+ 0.658 |
+ 0.644 |
+ 0.602 |
+ 0.675 |
+ 0.350 |
+ 0.161 |
+ 0.565 |
+ 0.622 |
+ 0.678 |
+ 0.305 |
+ 0.408 |
+ 0.508 |
+ 0.312 |
+ 0.417 |
+ 0.530 |
+ 0.133 |
+ 0.297 |
+ 164 |
+ Stability |
+ RPC1_BP434 |
+ High |
+ Virus |
+
+
+ RS15_GEOSE_Tsuboyama_2023_1A32_indels |
+ 0.359 |
+ 0.252 |
+ 0.079 |
+ 0.066 |
+ 0.114 |
+ 0.160 |
+ 0.143 |
+ 0.328 |
+ 0.291 |
+ 0.274 |
+ 0.296 |
+ 0.056 |
+ 0.586 |
+ 0.277 |
+ 0.211 |
+ 0.348 |
+ 0.137 |
+ -0.031 |
+ 0.388 |
+ 0.172 |
+ 0.010 |
+ 0.086 |
+ 0.114 |
+ 176 |
+ Stability |
+ RS15_GEOSE |
+ Medium |
+ Prokaryote |
+
+
+ S22A1_HUMAN_Yee_2023_abundance_indels |
+ 0.203 |
+ 0.056 |
+ 0.311 |
+ 0.362 |
+ 0.528 |
+ 0.490 |
+ 0.324 |
+ 0.496 |
+ 0.494 |
+ 0.430 |
+ 0.330 |
+ 0.208 |
+ 0.342 |
+ 0.404 |
+ 0.429 |
+ 0.272 |
+ 0.364 |
+ 0.428 |
+ 0.323 |
+ 0.408 |
+ 0.425 |
+ 0.236 |
+ 0.269 |
+ 430 |
+ Expression |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ S22A1_HUMAN_Yee_2023_activity_indels |
+ 0.304 |
+ 0.118 |
+ 0.402 |
+ 0.453 |
+ 0.556 |
+ 0.541 |
+ 0.436 |
+ 0.597 |
+ 0.543 |
+ 0.510 |
+ 0.435 |
+ 0.210 |
+ 0.405 |
+ 0.489 |
+ 0.503 |
+ 0.332 |
+ 0.394 |
+ 0.487 |
+ 0.383 |
+ 0.476 |
+ 0.481 |
+ 0.324 |
+ 0.334 |
+ 490 |
+ Activity |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ SAV1_MOUSE_Tsuboyama_2023_2YSB_indels |
+ 0.712 |
+ 0.764 |
+ 0.639 |
+ 0.650 |
+ 0.629 |
+ 0.629 |
+ 0.652 |
+ 0.700 |
+ 0.644 |
+ 0.668 |
+ 0.645 |
+ 0.575 |
+ 0.720 |
+ 0.687 |
+ 0.706 |
+ 0.315 |
+ 0.547 |
+ 0.608 |
+ 0.408 |
+ 0.619 |
+ 0.674 |
+ 0.631 |
+ 0.585 |
+ 86 |
+ Stability |
+ SAV1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ SDA_BACSU_Tsuboyama_2023_1PV0_indels |
+ 0.181 |
+ 0.310 |
+ 0.264 |
+ 0.354 |
+ 0.421 |
+ 0.572 |
+ 0.205 |
+ 0.327 |
+ 0.073 |
+ 0.281 |
+ 0.513 |
+ 0.068 |
+ 0.350 |
+ 0.457 |
+ 0.395 |
+ -0.151 |
+ -0.144 |
+ 0.322 |
+ -0.130 |
+ -0.114 |
+ 0.361 |
+ 0.581 |
+ 0.353 |
+ 127 |
+ Stability |
+ SDA_BACSU |
+ Medium |
+ Prokaryote |
+
+
+ SOX30_HUMAN_Tsuboyama_2023_7JJK_indels |
+ 0.610 |
+ 0.502 |
+ 0.469 |
+ 0.631 |
+ 0.615 |
+ 0.699 |
+ 0.405 |
+ 0.630 |
+ 0.590 |
+ 0.611 |
+ 0.682 |
+ 0.425 |
+ 0.038 |
+ 0.594 |
+ 0.605 |
+ -0.268 |
+ -0.039 |
+ 0.247 |
+ -0.232 |
+ 0.025 |
+ 0.351 |
+ 0.399 |
+ 0.424 |
+ 109 |
+ Stability |
+ SOX30_HUMAN |
+ High |
+ Human |
+
+
+ SPG2_STRSG_Tsuboyama_2023_5UBS_indels |
+ -0.107 |
+ 0.410 |
+ 0.215 |
+ 0.211 |
+ 0.455 |
+ 0.434 |
+ 0.313 |
+ 0.444 |
+ 0.380 |
+ 0.470 |
+ 0.573 |
+ -0.171 |
+ 0.025 |
+ 0.154 |
+ 0.597 |
+ 0.144 |
+ 0.167 |
+ 0.287 |
+ 0.146 |
+ 0.171 |
+ 0.327 |
+ 0.133 |
+ 0.298 |
+ 148 |
+ Stability |
+ SPG2_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPTN1_CHICK_Tsuboyama_2023_1TUD_indels |
+ 0.358 |
+ -0.002 |
+ 0.515 |
+ 0.565 |
+ 0.523 |
+ 0.625 |
+ 0.515 |
+ 0.693 |
+ 0.664 |
+ 0.663 |
+ 0.597 |
+ 0.195 |
+ 0.567 |
+ 0.470 |
+ 0.573 |
+ 0.431 |
+ 0.477 |
+ 0.502 |
+ 0.451 |
+ 0.497 |
+ 0.522 |
+ 0.186 |
+ 0.152 |
+ 129 |
+ Stability |
+ SPTN1_CHICK |
+ High |
+ Eukaryote |
+
+
+ SQSTM_MOUSE_Tsuboyama_2023_2RRU_indels |
+ 0.310 |
+ 0.310 |
+ 0.350 |
+ 0.493 |
+ 0.494 |
+ 0.515 |
+ 0.548 |
+ 0.471 |
+ 0.476 |
+ 0.459 |
+ 0.486 |
+ 0.142 |
+ 0.375 |
+ 0.523 |
+ 0.454 |
+ 0.028 |
+ 0.203 |
+ 0.365 |
+ 0.046 |
+ 0.278 |
+ 0.415 |
+ 0.011 |
+ 0.479 |
+ 111 |
+ Stability |
+ SQSTM_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ SR43C_ARATH_Tsuboyama_2023_2N88_indels |
+ 0.048 |
+ 0.374 |
+ 0.425 |
+ 0.282 |
+ 0.392 |
+ 0.360 |
+ 0.271 |
+ 0.344 |
+ 0.489 |
+ 0.484 |
+ 0.474 |
+ 0.367 |
+ 0.319 |
+ 0.099 |
+ 0.357 |
+ 0.400 |
+ 0.350 |
+ 0.495 |
+ 0.415 |
+ 0.364 |
+ 0.499 |
+ 0.350 |
+ 0.303 |
+ 135 |
+ Stability |
+ SR43C_ARATH |
+ High |
+ Eukaryote |
+
+
+ SRBS1_HUMAN_Tsuboyama_2023_2O2W_indels |
+ 0.574 |
+ 0.707 |
+ 0.647 |
+ 0.683 |
+ 0.675 |
+ 0.668 |
+ 0.675 |
+ 0.677 |
+ 0.693 |
+ 0.676 |
+ 0.581 |
+ 0.526 |
+ 0.683 |
+ 0.710 |
+ 0.731 |
+ 0.718 |
+ 0.753 |
+ 0.752 |
+ 0.720 |
+ 0.755 |
+ 0.757 |
+ 0.494 |
+ 0.449 |
+ 154 |
+ Stability |
+ SRBS1_HUMAN |
+ High |
+ Human |
+
+
+ TCRG1_MOUSE_Tsuboyama_2023_1E0L_indels |
+ 0.660 |
+ 0.823 |
+ 0.645 |
+ 0.659 |
+ 0.690 |
+ 0.758 |
+ 0.552 |
+ 0.657 |
+ 0.636 |
+ 0.649 |
+ 0.708 |
+ 0.412 |
+ 0.347 |
+ 0.618 |
+ 0.698 |
+ -0.446 |
+ -0.302 |
+ -0.164 |
+ -0.397 |
+ -0.235 |
+ -0.064 |
+ 0.822 |
+ 0.335 |
+ 99 |
+ Stability |
+ TCRG1_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ THO1_YEAST_Tsuboyama_2023_2WQG_indels |
+ 0.416 |
+ 0.663 |
+ 0.253 |
+ 0.607 |
+ 0.652 |
+ 0.666 |
+ 0.631 |
+ 0.788 |
+ 0.714 |
+ 0.759 |
+ 0.737 |
+ 0.425 |
+ 0.335 |
+ 0.693 |
+ 0.664 |
+ -0.283 |
+ 0.144 |
+ 0.061 |
+ -0.255 |
+ 0.216 |
+ 0.128 |
+ 0.371 |
+ 0.684 |
+ 82 |
+ Stability |
+ THO1_YEAST |
+ High |
+ Eukaryote |
+
+
+ TNKS2_HUMAN_Tsuboyama_2023_5JRT_indels |
+ -0.025 |
+ 0.218 |
+ 0.315 |
+ 0.290 |
+ 0.217 |
+ 0.266 |
+ 0.382 |
+ 0.333 |
+ 0.282 |
+ 0.367 |
+ 0.270 |
+ 0.119 |
+ 0.136 |
+ 0.193 |
+ 0.291 |
+ -0.054 |
+ 0.020 |
+ 0.103 |
+ -0.053 |
+ 0.025 |
+ 0.103 |
+ 0.163 |
+ 0.312 |
+ 171 |
+ Stability |
+ TNKS2_HUMAN |
+ High |
+ Human |
+
+
+ UBE4B_HUMAN_Tsuboyama_2023_3L1X_indels |
+ -0.011 |
+ 0.220 |
+ 0.126 |
+ 0.162 |
+ 0.152 |
+ 0.239 |
+ 0.348 |
+ 0.187 |
+ 0.248 |
+ 0.239 |
+ 0.335 |
+ 0.267 |
+ 0.151 |
+ 0.085 |
+ 0.325 |
+ -0.170 |
+ -0.170 |
+ 0.114 |
+ -0.188 |
+ -0.181 |
+ 0.127 |
+ -0.074 |
+ 0.312 |
+ 147 |
+ Stability |
+ UBE4B_HUMAN |
+ High |
+ Human |
+
+
+ UBR5_HUMAN_Tsuboyama_2023_1I2T_indels |
+ 0.223 |
+ 0.385 |
+ 0.481 |
+ 0.389 |
+ 0.414 |
+ 0.431 |
+ 0.424 |
+ 0.377 |
+ 0.488 |
+ 0.510 |
+ 0.521 |
+ 0.358 |
+ 0.429 |
+ 0.481 |
+ 0.446 |
+ -0.084 |
+ 0.114 |
+ 0.127 |
+ -0.067 |
+ 0.156 |
+ 0.162 |
+ 0.241 |
+ 0.253 |
+ 156 |
+ Stability |
+ UBR5_HUMAN |
+ Medium |
+ Human |
+
+
+ VG08_BPP22_Tsuboyama_2023_2GP8_indels |
+ 0.519 |
+ 0.726 |
+ 0.528 |
+ 0.731 |
+ 0.688 |
+ 0.840 |
+ 0.372 |
+ 0.577 |
+ 0.425 |
+ 0.401 |
+ 0.711 |
+ -0.160 |
+ 0.417 |
+ 0.747 |
+ 0.823 |
+ -0.029 |
+ 0.078 |
+ 0.562 |
+ -0.046 |
+ 0.115 |
+ 0.645 |
+ 0.645 |
+ 0.282 |
+ 101 |
+ Stability |
+ VG08_BPP22 |
+ High |
+ Virus |
+
+
+ VILI_CHICK_Tsuboyama_2023_1YU5_indels |
+ 0.118 |
+ 0.681 |
+ 0.244 |
+ 0.439 |
+ 0.366 |
+ 0.297 |
+ 0.601 |
+ 0.793 |
+ 0.744 |
+ 0.760 |
+ 0.618 |
+ 0.368 |
+ 0.575 |
+ 0.612 |
+ 0.775 |
+ 0.262 |
+ 0.489 |
+ 0.655 |
+ 0.288 |
+ 0.512 |
+ 0.687 |
+ 0.002 |
+ 0.346 |
+ 156 |
+ Stability |
+ VILI_CHICK |
+ High |
+ Eukaryote |
+
+
+ VRPI_BPT7_Tsuboyama_2023_2WNM_indels |
+ 0.399 |
+ 0.597 |
+ 0.156 |
+ 0.409 |
+ 0.560 |
+ 0.343 |
+ 0.409 |
+ 0.388 |
+ 0.375 |
+ 0.416 |
+ 0.454 |
+ 0.128 |
+ 0.273 |
+ 0.243 |
+ 0.341 |
+ -0.187 |
+ -0.177 |
+ -0.233 |
+ -0.186 |
+ -0.185 |
+ -0.236 |
+ 0.333 |
+ 0.674 |
+ 154 |
+ Stability |
+ VRPI_BPT7 |
+ Medium |
+ Virus |
+
+
+ YNZC_BACSU_Tsuboyama_2023_2JVD_indels |
+ 0.492 |
+ 0.728 |
+ 0.541 |
+ 0.548 |
+ 0.660 |
+ 0.683 |
+ 0.495 |
+ 0.746 |
+ 0.355 |
+ 0.665 |
+ 0.703 |
+ 0.327 |
+ 0.438 |
+ 0.517 |
+ 0.653 |
+ -0.190 |
+ -0.171 |
+ 0.004 |
+ -0.179 |
+ -0.148 |
+ 0.055 |
+ 0.674 |
+ 0.262 |
+ 104 |
+ Stability |
+ YNZC_BACSU |
+ Medium |
+ Prokaryote |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/indels/Spearman/Summary_performance_DMS_indels_Spearman.csv b/benchmarks/DMS_zero_shot/indels/Spearman/Summary_performance_DMS_indels_Spearman.csv
new file mode 100644
index 0000000..8132440
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/Spearman/Summary_performance_DMS_indels_Spearman.csv
@@ -0,0 +1,24 @@
+Model_rank,Model_name,Model type,Average_Spearman,Bootstrap_standard_error_Spearman,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,Model details,References
+1,Progen2 M,Protein language model,0.465,0.0,0.51,,0.464,0.374,0.511,0.208,0.472,0.549,0.527,0.564,0.421,0.338,Progen2 medium model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+2,Progen2 Base,Protein language model,0.464,0.01,0.533,,0.46,0.374,0.489,0.214,0.426,0.556,0.551,0.551,0.331,0.316,Progen2 base model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+3,RITA L,Protein language model,0.457,0.035,0.466,,0.456,0.419,0.488,0.381,0.457,0.509,0.51,0.51,0.35,0.562,RITA large model (680M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+4,Tranception M no retrieval,Protein language model,0.453,0.036,0.469,,0.408,0.501,0.434,0.422,0.39,0.487,0.48,0.46,0.334,0.46,Tranception Medium model (300M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+5,RITA XL,Protein language model,0.449,0.038,0.475,,0.434,0.393,0.496,0.361,0.471,0.509,0.513,0.511,0.367,0.556,RITA xlarge model (1.2B params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+6,Progen2 L,Protein language model,0.449,0.011,0.504,,0.43,0.36,0.5,0.214,0.45,0.547,0.536,0.537,0.406,0.317,Progen2 large model (2.7B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+7,Tranception L no retrieval,Protein language model,0.437,0.041,0.43,,0.41,0.445,0.463,0.43,0.39,0.523,0.485,0.49,0.331,0.541,Tranception Large model (700M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+8,RITA M,Protein language model,0.436,0.03,0.464,,0.381,0.444,0.456,0.366,0.4,0.511,0.501,0.468,0.333,0.501,RITA medium model (300M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+9,Progen2 XL,Protein language model,0.431,0.036,0.405,,0.359,0.431,0.531,0.382,0.511,0.531,0.524,0.561,0.453,0.465,Progen2 xlarge model (6.4B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+10,TranceptEVE M,Hybrid model,0.424,0.046,0.514,,0.426,0.555,0.202,0.531,0.127,0.331,0.313,0.27,0.161,0.114,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+11,Progen2 S,Protein language model,0.424,0.025,0.505,,0.363,0.387,0.441,0.242,0.383,0.507,0.493,0.463,0.356,0.328,Progen2 small model (150M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+12,TranceptEVE L,Hybrid model,0.412,0.046,0.474,,0.428,0.477,0.269,0.48,0.162,0.402,0.354,0.306,0.184,0.3,TranceptEVE Large model (Tranception Large & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+13,Tranception S no retrieval,Protein language model,0.41,0.036,0.419,,0.372,0.479,0.372,0.427,0.345,0.412,0.384,0.42,0.321,0.392,Tranception Small model (85M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+14,RITA S,Protein language model,0.397,0.032,0.436,,0.348,0.414,0.39,0.334,0.338,0.448,0.429,0.408,0.304,0.405,RITA small model (85M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+15,Tranception L,Hybrid model,0.395,0.044,0.453,,0.434,0.456,0.238,0.434,0.135,0.372,0.326,0.274,0.16,0.264,Tranception Large model (700M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+16,Tranception M,Hybrid model,0.394,0.048,0.472,,0.4,0.53,0.174,0.509,0.102,0.299,0.284,0.237,0.139,0.098,Tranception Medium model (300M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+17,Hidden Markov Model,Alignment-based model,0.389,0.046,0.463,,0.302,0.436,0.354,0.202,0.408,0.34,0.393,0.352,0.337,0.359,Profile Hidden Markov model,HMMER: biosequence analysis using profile hidden Markov models
+18,Wavenet,Alignment-based model,0.368,0.067,0.418,,0.232,0.346,0.476,0.291,0.472,0.468,0.477,0.482,0.394,0.489,Wavenet model,"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12."
+19,TranceptEVE S,Hybrid model,0.357,0.05,0.456,,0.378,0.497,0.096,0.514,0.062,0.194,0.181,0.163,0.113,0.062,TranceptEVE Small model (Tranception Small & retrieved EVE model),"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+20,Provean,Alignment-based model,0.347,0.047,0.314,,0.328,0.413,0.334,0.361,0.333,0.344,0.318,0.358,0.332,0.386,Provean model,"Choi Y, Sims GE, Murphy S, Miller JR, Chan AP (2012). Predicting the functional effect of amino acid substitutions and indels. PloS one."
+21,Tranception S,Hybrid model,0.34,0.053,0.441,,0.357,0.488,0.074,0.505,0.046,0.169,0.158,0.14,0.094,0.062,Tranception Small model (85M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+22,ProtGPT2,Protein language model,0.191,0.054,0.194,,0.236,0.099,0.235,0.233,0.188,0.253,0.287,0.233,0.15,0.1,ProtGPT2 model,"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13."
+23,Unirep,Protein language model,0.169,0.06,0.085,,0.182,0.065,0.342,0.11,0.267,0.358,0.298,0.364,0.216,0.345,Unirep model,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
diff --git a/benchmarks/DMS_zero_shot/indels/Spearman/Summary_performance_DMS_indels_Spearman.html b/benchmarks/DMS_zero_shot/indels/Spearman/Summary_performance_DMS_indels_Spearman.html
new file mode 100644
index 0000000..5ebba00
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/Spearman/Summary_performance_DMS_indels_Spearman.html
@@ -0,0 +1,531 @@
+
+
+
+ |
+ Model_name |
+ Model type |
+ Average_Spearman |
+ Bootstrap_standard_error_Spearman |
+ Function_Activity |
+ Function_Binding |
+ Function_Expression |
+ Function_OrganismalFitness |
+ Function_Stability |
+ Low_MSA_depth |
+ Medium_MSA_depth |
+ High_MSA_depth |
+ Taxa_Human |
+ Taxa_Other_Eukaryote |
+ Taxa_Prokaryote |
+ Taxa_Virus |
+ Model details |
+ References |
+
+
+ Model_rank |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+
+
+
+
+ 1 |
+ TranceptEVE M |
+ Hybrid model |
+ 0.467 |
+ 0.000 |
+ 0.669 |
+ NaN |
+ 0.428 |
+ 0.555 |
+ 0.216 |
+ 0.531 |
+ 0.156 |
+ 0.342 |
+ 0.323 |
+ 0.282 |
+ 0.208 |
+ 0.126 |
+ TranceptEVE Medium model (Tranception Medium & retrieved EVE model) |
+ Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop. |
+
+
+ 2 |
+ Progen2 M |
+ Protein language model |
+ 0.467 |
+ 0.054 |
+ 0.510 |
+ NaN |
+ 0.466 |
+ 0.374 |
+ 0.517 |
+ 0.208 |
+ 0.475 |
+ 0.556 |
+ 0.534 |
+ 0.570 |
+ 0.427 |
+ 0.335 |
+ Progen2 medium model (760M params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 3 |
+ Progen2 Base |
+ Protein language model |
+ 0.466 |
+ 0.051 |
+ 0.533 |
+ NaN |
+ 0.462 |
+ 0.374 |
+ 0.494 |
+ 0.214 |
+ 0.427 |
+ 0.563 |
+ 0.558 |
+ 0.556 |
+ 0.333 |
+ 0.312 |
+ Progen2 base model (760M params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 4 |
+ RITA L |
+ Protein language model |
+ 0.459 |
+ 0.039 |
+ 0.466 |
+ NaN |
+ 0.456 |
+ 0.419 |
+ 0.495 |
+ 0.382 |
+ 0.462 |
+ 0.517 |
+ 0.517 |
+ 0.516 |
+ 0.356 |
+ 0.564 |
+ RITA large model (680M params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 5 |
+ TranceptEVE L |
+ Hybrid model |
+ 0.457 |
+ 0.013 |
+ 0.640 |
+ NaN |
+ 0.429 |
+ 0.477 |
+ 0.282 |
+ 0.481 |
+ 0.192 |
+ 0.413 |
+ 0.364 |
+ 0.318 |
+ 0.232 |
+ 0.309 |
+ TranceptEVE Large model (Tranception Large & retrieved EVE model) |
+ Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop. |
+
+
+ 6 |
+ Tranception M no retrieval |
+ Protein language model |
+ 0.455 |
+ 0.045 |
+ 0.470 |
+ NaN |
+ 0.410 |
+ 0.501 |
+ 0.439 |
+ 0.422 |
+ 0.394 |
+ 0.492 |
+ 0.484 |
+ 0.466 |
+ 0.340 |
+ 0.460 |
+ Tranception Medium model (300M params) without retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 7 |
+ RITA XL |
+ Protein language model |
+ 0.452 |
+ 0.038 |
+ 0.475 |
+ NaN |
+ 0.435 |
+ 0.393 |
+ 0.503 |
+ 0.361 |
+ 0.476 |
+ 0.517 |
+ 0.520 |
+ 0.518 |
+ 0.375 |
+ 0.557 |
+ RITA xlarge model (1.2B params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 8 |
+ Progen2 L |
+ Protein language model |
+ 0.451 |
+ 0.049 |
+ 0.505 |
+ NaN |
+ 0.432 |
+ 0.360 |
+ 0.506 |
+ 0.214 |
+ 0.454 |
+ 0.554 |
+ 0.543 |
+ 0.542 |
+ 0.412 |
+ 0.315 |
+ Progen2 large model (2.7B params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 9 |
+ Tranception L no retrieval |
+ Protein language model |
+ 0.439 |
+ 0.041 |
+ 0.431 |
+ NaN |
+ 0.412 |
+ 0.445 |
+ 0.469 |
+ 0.430 |
+ 0.395 |
+ 0.529 |
+ 0.491 |
+ 0.495 |
+ 0.339 |
+ 0.542 |
+ Tranception Large model (700M params) without retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 10 |
+ RITA M |
+ Protein language model |
+ 0.439 |
+ 0.032 |
+ 0.464 |
+ NaN |
+ 0.382 |
+ 0.444 |
+ 0.463 |
+ 0.366 |
+ 0.405 |
+ 0.519 |
+ 0.508 |
+ 0.474 |
+ 0.341 |
+ 0.503 |
+ RITA medium model (300M params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 11 |
+ Progen2 XL |
+ Protein language model |
+ 0.434 |
+ 0.029 |
+ 0.406 |
+ NaN |
+ 0.360 |
+ 0.431 |
+ 0.539 |
+ 0.382 |
+ 0.516 |
+ 0.539 |
+ 0.532 |
+ 0.568 |
+ 0.460 |
+ 0.465 |
+ Progen2 xlarge model (6.4B params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 12 |
+ Progen2 S |
+ Protein language model |
+ 0.425 |
+ 0.049 |
+ 0.506 |
+ NaN |
+ 0.365 |
+ 0.387 |
+ 0.444 |
+ 0.242 |
+ 0.381 |
+ 0.514 |
+ 0.499 |
+ 0.466 |
+ 0.355 |
+ 0.325 |
+ Progen2 small model (150M params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 13 |
+ Tranception S no retrieval |
+ Protein language model |
+ 0.412 |
+ 0.046 |
+ 0.420 |
+ NaN |
+ 0.373 |
+ 0.479 |
+ 0.376 |
+ 0.427 |
+ 0.347 |
+ 0.417 |
+ 0.388 |
+ 0.422 |
+ 0.327 |
+ 0.393 |
+ Tranception Small model (85M params) without retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 14 |
+ Tranception L |
+ Hybrid model |
+ 0.399 |
+ 0.032 |
+ 0.454 |
+ NaN |
+ 0.436 |
+ 0.456 |
+ 0.251 |
+ 0.434 |
+ 0.148 |
+ 0.384 |
+ 0.335 |
+ 0.287 |
+ 0.174 |
+ 0.273 |
+ Tranception Large model (700M params) with retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 15 |
+ RITA S |
+ Protein language model |
+ 0.399 |
+ 0.036 |
+ 0.437 |
+ NaN |
+ 0.350 |
+ 0.414 |
+ 0.395 |
+ 0.334 |
+ 0.341 |
+ 0.455 |
+ 0.435 |
+ 0.414 |
+ 0.309 |
+ 0.406 |
+ RITA small model (85M params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 16 |
+ Tranception M |
+ Hybrid model |
+ 0.398 |
+ 0.029 |
+ 0.473 |
+ NaN |
+ 0.402 |
+ 0.530 |
+ 0.188 |
+ 0.509 |
+ 0.116 |
+ 0.311 |
+ 0.294 |
+ 0.250 |
+ 0.153 |
+ 0.110 |
+ Tranception Medium model (300M params) with retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 17 |
+ Hidden Markov Model |
+ Alignment-based model |
+ 0.391 |
+ 0.029 |
+ 0.464 |
+ NaN |
+ 0.302 |
+ 0.436 |
+ 0.363 |
+ 0.202 |
+ 0.416 |
+ 0.350 |
+ 0.400 |
+ 0.362 |
+ 0.345 |
+ 0.367 |
+ Profile Hidden Markov model |
+ HMMER: biosequence analysis using profile hidden Markov models |
+
+
+ 18 |
+ TranceptEVE S |
+ Hybrid model |
+ 0.361 |
+ 0.032 |
+ 0.457 |
+ NaN |
+ 0.380 |
+ 0.497 |
+ 0.111 |
+ 0.514 |
+ 0.075 |
+ 0.208 |
+ 0.192 |
+ 0.177 |
+ 0.128 |
+ 0.074 |
+ TranceptEVE Small model (Tranception Small & retrieved EVE model) |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 19 |
+ Provean |
+ Alignment-based model |
+ 0.351 |
+ 0.029 |
+ 0.315 |
+ NaN |
+ 0.329 |
+ 0.413 |
+ 0.347 |
+ 0.361 |
+ 0.344 |
+ 0.356 |
+ 0.328 |
+ 0.370 |
+ 0.343 |
+ 0.396 |
+ Provean model |
+ Choi Y, Sims GE, Murphy S, Miller JR, Chan AP (2012). Predicting the functional effect of amino acid substitutions and indels. PloS one. |
+
+
+ 20 |
+ Tranception S |
+ Hybrid model |
+ 0.344 |
+ 0.031 |
+ 0.442 |
+ NaN |
+ 0.358 |
+ 0.488 |
+ 0.089 |
+ 0.505 |
+ 0.059 |
+ 0.183 |
+ 0.169 |
+ 0.154 |
+ 0.109 |
+ 0.073 |
+ Tranception Small model (85M params) with retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 21 |
+ Wavenet |
+ Alignment-based model |
+ 0.285 |
+ 0.046 |
+ 0.419 |
+ NaN |
+ 0.234 |
+ 0.346 |
+ 0.141 |
+ 0.170 |
+ 0.247 |
+ 0.108 |
+ 0.057 |
+ 0.250 |
+ 0.186 |
+ 0.358 |
+ Wavenet model |
+ Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12. |
+
+
+ 22 |
+ ProtGPT2 |
+ Protein language model |
+ 0.194 |
+ 0.033 |
+ 0.195 |
+ NaN |
+ 0.238 |
+ 0.099 |
+ 0.242 |
+ 0.233 |
+ 0.192 |
+ 0.263 |
+ 0.295 |
+ 0.240 |
+ 0.155 |
+ 0.104 |
+ ProtGPT2 model |
+ Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13. |
+
+
+ 23 |
+ Unirep |
+ Protein language model |
+ 0.169 |
+ 0.061 |
+ 0.085 |
+ NaN |
+ 0.184 |
+ 0.065 |
+ 0.344 |
+ 0.110 |
+ 0.268 |
+ 0.360 |
+ 0.300 |
+ 0.364 |
+ 0.218 |
+ 0.348 |
+ Unirep model |
+ Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8. |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/indels/Top_recall/DMS_indels_Top_recall_DMS_level.csv b/benchmarks/DMS_zero_shot/indels/Top_recall/DMS_indels_Top_recall_DMS_level.csv
new file mode 100644
index 0000000..ffb8dfd
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/Top_recall/DMS_indels_Top_recall_DMS_level.csv
@@ -0,0 +1,67 @@
+DMS ID,Unirep,Wavenet,RITA S,RITA M,RITA L,RITA XL,Progen2 S,Progen2 M,Progen2 Base,Progen2 L,Progen2 XL,ProtGPT2,Tranception S no retrieval,Tranception M no retrieval,Tranception L no retrieval,Tranception S,Tranception M,Tranception L,TranceptEVE S,TranceptEVE M,TranceptEVE L,Hidden Markov Model,Provean,Number of Mutants,Selection Type,UniProt ID,MSA_Neff_L_category,Taxon
+A4_HUMAN_Seuma_2022_indels,0.243,0.179,0.179,0.145,0.153,0.132,0.157,0.17,0.149,0.17,0.153,0.191,0.183,0.145,0.145,0.191,0.166,0.204,0.179,0.157,0.191,0.0,0.153,2346,Stability,A4_HUMAN,Low,Human
+AMFR_HUMAN_Tsuboyama_2023_4G3O_indels,0.25,0.667,0.0,0.25,0.583,0.75,0.25,0.583,0.75,0.583,0.667,0.667,0.083,0.0,0.167,0.0,0.083,0.167,0.0,0.083,0.167,0.417,0.75,117,Stability,AMFR_HUMAN,Medium,Human
+ARGR_ECOLI_Tsuboyama_2023_1AOY_indels,0.105,0.632,0.789,0.684,0.684,0.789,0.316,0.789,0.789,0.789,0.789,0.053,0.526,0.368,0.632,0.316,0.263,0.526,0.316,0.263,0.526,0.368,0.474,181,Stability,ARGR_ECOLI,Medium,Prokaryote
+B1LPA6_ECOSM_Russ_2020_indels,0.237,0.24,0.157,0.151,0.157,0.169,0.194,0.215,0.203,0.197,0.194,0.172,0.16,0.215,0.16,0.24,0.255,0.209,0.237,0.246,0.218,0.302,0.182,3074,Activity,B1LPA6_ECOSM,Medium,Prokaryote
+BBC1_YEAST_Tsuboyama_2023_1TG0_indels,0.214,0.357,0.214,0.286,0.429,0.571,0.143,0.643,0.643,0.643,0.714,0.143,0.143,0.5,0.0,0.143,0.357,0.214,0.286,0.357,0.214,0.143,0.786,134,Stability,BBC1_YEAST,High,Eukaryote
+BCHB_CHLTE_Tsuboyama_2023_2KRU_indels,0.0,0.667,0.111,0.333,0.222,0.444,0.222,0.333,0.444,0.667,0.667,0.111,0.222,0.444,0.444,0.222,0.333,0.444,0.222,0.333,0.556,0.222,0.778,82,Stability,BCHB_CHLTE,Medium,Prokaryote
+BLAT_ECOLX_Gonzalez_2019_indels,0.166,0.422,0.382,0.42,0.405,0.317,0.359,0.426,0.424,0.424,0.242,0.12,0.418,0.382,0.258,0.366,0.351,0.263,0.374,0.403,0.263,0.174,0.363,4751,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+CAPSD_AAV2S_Sinai_2021_designed_indels,0.013,0.309,0.133,0.139,0.239,0.34,0.011,0.01,0.013,0.01,0.212,0.114,0.152,0.147,0.349,0.328,0.325,0.35,0.333,0.327,0.356,0.316,0.314,225998,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CAPSD_AAV2S_Sinai_2021_library_indels,0.02,0.0,0.11,0.186,0.161,0.209,0.037,0.043,0.048,0.051,0.242,0.103,0.113,0.158,0.269,0.27,0.306,0.341,0.275,0.263,0.378,0.001,0.044,24908,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CATR_CHLRE_Tsuboyama_2023_2AMI_indels,0.4,0.45,0.45,0.45,0.45,0.45,0.4,0.45,0.45,0.45,0.45,0.15,0.45,0.4,0.45,0.45,0.4,0.4,0.45,0.4,0.4,0.25,0.4,197,Stability,CATR_CHLRE,High,Eukaryote
+CBPA2_HUMAN_Tsuboyama_2023_1O6X_indels,0.095,0.619,0.095,0.095,0.048,0.19,0.048,0.238,0.238,0.286,0.571,0.048,0.048,0.095,0.095,0.048,0.095,0.095,0.048,0.143,0.095,0.333,0.333,205,Stability,CBPA2_HUMAN,Medium,Human
+CBX4_HUMAN_Tsuboyama_2023_2K28_indels,0.154,0.154,0.385,0.385,0.462,0.538,0.538,0.385,0.462,0.462,0.538,0.077,0.308,0.308,0.385,0.308,0.231,0.385,0.385,0.231,0.385,0.308,0.538,129,Stability,CBX4_HUMAN,High,Human
+CSN4_MOUSE_Tsuboyama_2023_1UFM_indels,0.05,0.6,0.25,0.2,0.6,0.55,0.05,0.65,0.6,0.65,0.6,0.2,0.15,0.45,0.5,0.25,0.35,0.4,0.25,0.35,0.4,0.45,0.55,195,Stability,CSN4_MOUSE,Medium,Eukaryote
+CUE1_YEAST_Tsuboyama_2023_2MYX_indels,0.357,0.714,0.357,0.214,0.286,0.0,0.071,0.071,0.071,0.071,0.571,0.071,0.143,0.143,0.071,0.0,0.0,0.0,0.0,0.0,0.0,0.286,0.857,140,Stability,CUE1_YEAST,Medium,Eukaryote
+DN7A_SACS2_Tsuboyama_2023_1JIC_indels,0.214,0.5,0.0,0.286,0.214,0.286,0.143,0.071,0.214,0.286,0.286,0.071,0.071,0.0,0.071,0.0,0.0,0.0,0.071,0.0,0.0,0.143,0.643,136,Stability,DN7A_SACS2,Medium,Prokaryote
+DNJA1_HUMAN_Tsuboyama_2023_2LO1_indels,0.722,0.444,0.667,0.722,0.722,0.667,0.722,0.722,0.778,0.722,0.722,0.278,0.667,0.722,0.778,0.556,0.611,0.611,0.556,0.611,0.611,0.333,0.5,174,Stability,DNJA1_HUMAN,High,Human
+DOCK1_MOUSE_Tsuboyama_2023_2M0Y_indels,0.062,0.438,0.438,0.438,0.562,0.562,0.438,0.5,0.438,0.375,0.562,0.25,0.25,0.188,0.375,0.25,0.25,0.375,0.25,0.25,0.375,0.188,0.562,154,Stability,DOCK1_MOUSE,High,Eukaryote
+EPHB2_HUMAN_Tsuboyama_2023_1F0M_indels,0.211,0.632,0.632,0.789,0.737,0.737,0.632,0.632,0.684,0.684,0.684,0.211,0.632,0.789,0.684,0.368,0.421,0.421,0.421,0.474,0.421,0.316,0.421,185,Stability,EPHB2_HUMAN,High,Human
+FECA_ECOLI_Tsuboyama_2023_2D1U_indels,0.25,0.25,0.2,0.4,0.4,0.4,0.1,0.4,0.4,0.35,0.4,0.2,0.0,0.0,0.05,0.0,0.0,0.05,0.0,0.0,0.05,0.4,0.35,193,Stability,FECA_ECOLI,High,Eukaryote
+HCP_LAMBD_Tsuboyama_2023_2L6Q_indels,0.2,0.467,0.333,0.467,0.4,0.6,0.267,0.267,0.333,0.267,0.333,0.267,0.133,0.133,0.333,0.067,0.133,0.333,0.067,0.067,0.4,0.4,0.533,148,Stability,HCP_LAMBD,Medium,Virus
+HECD1_HUMAN_Tsuboyama_2023_3DKM_indels,0.312,0.562,0.375,0.625,0.625,0.625,0.188,0.688,0.5,0.5,0.438,0.312,0.0,0.125,0.125,0.062,0.25,0.125,0.062,0.25,0.125,0.438,0.438,154,Stability,HECD1_HUMAN,Medium,Human
+HIS7_YEAST_Pokusaeva_2019_indels,0.031,0.383,0.373,0.345,0.336,0.378,0.295,0.354,0.406,0.38,0.403,0.108,0.303,0.417,0.399,0.357,0.344,0.406,0.37,0.401,0.414,0.345,0.396,6102,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+ILF3_HUMAN_Tsuboyama_2023_2L33_indels,0.1,0.35,0.3,0.25,0.25,0.3,0.25,0.25,0.25,0.25,0.25,0.65,0.1,0.25,0.25,0.1,0.3,0.25,0.15,0.3,0.25,0.55,0.25,193,Stability,ILF3_HUMAN,High,Human
+KCNJ2_MOUSE_Macdonald_2022_indels,0.147,0.102,0.185,0.143,0.088,0.087,0.219,0.145,0.114,0.167,0.082,0.105,0.168,0.108,0.088,0.134,0.097,0.1,0.129,0.096,0.097,0.166,0.114,10501,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+MAFG_MOUSE_Tsuboyama_2023_1K1V_indels,0.5,0.5,0.25,0.5,0.583,0.583,0.417,0.5,0.583,0.417,0.417,0.25,0.333,0.5,0.5,0.167,0.333,0.417,0.167,0.333,0.417,0.333,0.25,115,Stability,MAFG_MOUSE,Medium,Eukaryote
+MBD11_ARATH_Tsuboyama_2023_6ACV_indels,0.143,0.286,0.143,0.071,0.071,0.143,0.143,0.429,0.286,0.286,0.357,0.214,0.214,0.071,0.0,0.143,0.071,0.071,0.214,0.071,0.0,0.143,0.286,131,Stability,MBD11_ARATH,Medium,Eukaryote
+MYO3_YEAST_Tsuboyama_2023_2BTT_indels,0.0,0.375,0.125,0.0,0.25,0.0,0.625,0.375,0.25,0.375,0.75,0.125,0.0,0.0,0.125,0.0,0.0,0.0,0.0,0.125,0.125,0.0,0.375,80,Stability,MYO3_YEAST,High,Eukaryote
+NKX31_HUMAN_Tsuboyama_2023_2L9R_indels,0.333,0.5,0.333,0.333,0.167,0.222,0.222,0.167,0.333,0.167,0.167,0.278,0.444,0.167,0.111,0.444,0.389,0.333,0.5,0.278,0.278,0.5,0.222,178,Stability,NKX31_HUMAN,High,Human
+NUSA_ECOLI_Tsuboyama_2023_1WCL_indels,0.35,0.55,0.7,0.55,0.6,0.6,0.35,0.55,0.25,0.65,0.65,0.05,0.35,0.65,0.3,0.3,0.4,0.4,0.3,0.45,0.4,0.3,0.65,191,Stability,NUSA_ECOLI,High,Prokaryote
+NUSG_MYCTU_Tsuboyama_2023_2MI6_indels,0.188,0.375,0.5,0.5,0.5,0.5,0.5,0.5,0.5,0.5,0.5,0.375,0.5,0.5,0.5,0.375,0.375,0.312,0.375,0.375,0.312,0.125,0.375,157,Stability,NUSG_MYCTU,High,Prokaryote
+OBSCN_HUMAN_Tsuboyama_2023_1V1C_indels,0.176,0.471,0.471,0.471,0.588,0.588,0.471,0.471,0.529,0.353,0.471,0.235,0.118,0.0,0.176,0.118,0.176,0.118,0.118,0.118,0.059,0.353,0.294,169,Stability,OBSCN_HUMAN,High,Human
+ODP2_GEOSE_Tsuboyama_2023_1W4G_indels,0.0,0.2,0.2,0.6,0.4,0.6,0.6,0.8,0.8,0.8,0.4,0.0,0.2,0.0,0.2,0.4,0.4,0.2,0.4,0.4,0.2,0.2,0.8,47,Stability,ODP2_GEOSE,High,Prokaryote
+OTU7A_HUMAN_Tsuboyama_2023_2L2D_indels,0.111,0.111,0.222,0.556,0.667,0.778,0.556,0.778,0.778,0.778,0.778,0.556,0.222,0.111,0.333,0.0,0.111,0.333,0.0,0.111,0.333,0.222,0.889,84,Stability,OTU7A_HUMAN,High,Human
+P53_HUMAN_Kotler_2018_indels,0.114,0.171,0.229,0.171,0.143,0.343,0.229,0.314,0.257,0.286,0.371,0.029,0.257,0.343,0.343,0.114,0.4,0.257,0.2,0.286,0.314,0.257,0.257,341,OrganismalFitness,P53_HUMAN,Low,Human
+PIN1_HUMAN_Tsuboyama_2023_1I6C_indels,0.364,0.455,0.364,0.273,0.455,0.364,0.182,0.273,0.455,0.273,0.455,0.455,0.273,0.364,0.364,0.545,0.636,0.455,0.455,0.636,0.455,0.364,0.364,106,Stability,PIN1_HUMAN,High,Human
+PITX2_HUMAN_Tsuboyama_2023_2L7M_indels,0.667,0.417,0.833,0.667,0.667,0.75,0.667,0.75,0.833,0.75,0.75,0.583,0.667,0.75,0.75,0.417,0.333,0.417,0.5,0.417,0.417,0.333,0.583,117,Stability,PITX2_HUMAN,High,Human
+PKN1_HUMAN_Tsuboyama_2023_1URF_indels,0.158,0.316,0.368,0.316,0.368,0.316,0.368,0.316,0.368,0.421,0.368,0.211,0.211,0.263,0.316,0.211,0.368,0.368,0.263,0.316,0.368,0.263,0.421,187,Stability,PKN1_HUMAN,High,Human
+POLG_PESV_Tsuboyama_2023_2MXD_indels,0.067,0.467,0.0,0.0,0.0,0.067,0.067,0.067,0.0,0.0,0.067,0.067,0.067,0.0,0.0,0.067,0.0,0.0,0.133,0.0,0.0,0.133,0.467,149,Stability,POLG_PESV,Medium,Virus
+PR40A_HUMAN_Tsuboyama_2023_1UZC_indels,0.0,0.706,0.529,0.706,0.765,0.765,0.706,0.824,0.706,0.765,0.765,0.294,0.353,0.294,0.412,0.294,0.353,0.353,0.353,0.412,0.412,0.294,0.353,168,Stability,PR40A_HUMAN,Medium,Human
+PSAE_SYNP2_Tsuboyama_2023_1PSE_indels,0.111,0.0,0.111,0.0,0.0,0.222,0.056,0.056,0.0,0.056,0.0,0.222,0.167,0.278,0.0,0.222,0.278,0.056,0.167,0.278,0.056,0.278,0.0,175,Stability,PSAE_SYNP2,Medium,Prokaryote
+PTEN_HUMAN_Mighell_2018_indels,0.0,0.125,0.219,0.125,0.188,0.281,0.25,0.281,0.312,0.094,0.219,0.0,0.25,0.219,0.219,0.188,0.281,0.219,0.156,0.125,0.156,0.219,0.281,314,Activity,PTEN_HUMAN,Medium,Human
+Q8EG35_SHEON_Campbell_2022_indels,0.235,0.0,0.147,0.176,0.088,0.176,0.206,0.118,0.029,0.118,0.059,0.176,0.235,0.235,0.206,0.118,0.147,0.206,0.176,0.206,0.147,0.206,0.118,331,OrganismalFitness,Q8EG35_SHEON,Medium,Prokaryote
+RAD_ANTMA_Tsuboyama_2023_2CJJ_indels,0.2,0.1,0.2,0.1,0.2,0.3,0.4,0.4,0.2,0.2,0.4,0.3,0.1,0.4,0.3,0.3,0.5,0.4,0.4,0.5,0.4,0.1,0.2,97,Stability,RAD_ANTMA,High,Eukaryote
+RCD1_ARATH_Tsuboyama_2023_5OAO_indels,0.077,0.615,0.0,0.077,0.0,0.154,0.077,0.308,0.154,0.231,0.538,0.077,0.0,0.0,0.231,0.0,0.0,0.0,0.0,0.0,0.0,0.308,0.385,124,Stability,RCD1_ARATH,Medium,Eukaryote
+RD23A_HUMAN_Tsuboyama_2023_1IFY_indels,0.167,0.5,0.583,0.583,0.5,0.833,0.75,0.75,0.667,0.75,0.75,0.667,0.333,0.75,0.667,0.417,0.583,0.583,0.417,0.667,0.583,0.167,0.75,120,Stability,RD23A_HUMAN,High,Human
+RPC1_BP434_Tsuboyama_2023_1R69_indels,0.647,0.353,0.412,0.588,0.471,0.471,0.588,0.471,0.471,0.471,0.471,0.176,0.235,0.529,0.588,0.118,0.294,0.412,0.118,0.353,0.471,0.118,0.412,164,Stability,RPC1_BP434,High,Virus
+RS15_GEOSE_Tsuboyama_2023_1A32_indels,0.111,0.444,0.5,0.5,0.5,0.5,0.389,0.5,0.5,0.5,0.5,0.056,0.5,0.5,0.444,0.333,0.278,0.278,0.389,0.278,0.278,0.278,0.389,176,Stability,RS15_GEOSE,Medium,Prokaryote
+S22A1_HUMAN_Yee_2023_abundance_indels,0.186,0.14,0.302,0.302,0.349,0.349,0.256,0.326,0.349,0.233,0.256,0.163,0.326,0.302,0.279,0.326,0.256,0.256,0.302,0.326,0.256,0.093,0.209,430,Expression,S22A1_HUMAN,Medium,Human
+S22A1_HUMAN_Yee_2023_activity_indels,0.163,0.102,0.224,0.347,0.347,0.347,0.286,0.327,0.388,0.286,0.306,0.245,0.347,0.306,0.265,0.286,0.265,0.265,0.265,0.327,0.265,0.163,0.224,490,Activity,S22A1_HUMAN,Medium,Human
+SAV1_MOUSE_Tsuboyama_2023_2YSB_indels,0.222,0.222,0.222,0.222,0.222,0.333,0.333,0.222,0.222,0.222,0.111,0.556,0.111,0.111,0.111,0.222,0.111,0.111,0.333,0.111,0.111,0.222,0.111,86,Stability,SAV1_MOUSE,High,Eukaryote
+SDA_BACSU_Tsuboyama_2023_1PV0_indels,0.0,0.615,0.077,0.0,0.077,0.154,0.231,0.385,0.077,0.231,0.538,0.077,0.0,0.0,0.615,0.077,0.077,0.308,0.077,0.077,0.538,0.308,0.692,127,Stability,SDA_BACSU,Medium,Prokaryote
+SOX30_HUMAN_Tsuboyama_2023_7JJK_indels,0.545,0.273,0.364,0.545,0.636,0.636,0.455,0.636,0.636,0.727,0.727,0.818,0.273,0.727,0.636,0.0,0.364,0.455,0.091,0.455,0.455,0.091,0.727,109,Stability,SOX30_HUMAN,High,Human
+SPG2_STRSG_Tsuboyama_2023_5UBS_indels,0.067,0.267,0.067,0.067,0.067,0.133,0.267,0.133,0.067,0.2,0.333,0.067,0.133,0.067,0.4,0.067,0.0,0.133,0.067,0.0,0.267,0.0,0.333,148,Stability,SPG2_STRSG,Medium,Prokaryote
+SPTN1_CHICK_Tsuboyama_2023_1TUD_indels,0.231,0.154,0.692,0.385,0.692,0.462,0.615,0.615,0.538,0.769,0.769,0.077,0.154,0.154,0.462,0.077,0.154,0.308,0.077,0.077,0.385,0.077,0.077,129,Stability,SPTN1_CHICK,High,Eukaryote
+SQSTM_MOUSE_Tsuboyama_2023_2RRU_indels,0.083,0.333,0.583,0.833,0.75,0.833,0.417,0.667,0.75,0.667,0.667,0.167,0.167,0.667,0.75,0.083,0.5,0.5,0.083,0.5,0.667,0.167,0.833,111,Stability,SQSTM_MOUSE,Medium,Eukaryote
+SR43C_ARATH_Tsuboyama_2023_2N88_indels,0.071,0.429,0.286,0.357,0.214,0.214,0.214,0.143,0.214,0.071,0.214,0.286,0.214,0.0,0.214,0.143,0.214,0.214,0.214,0.143,0.214,0.357,0.357,135,Stability,SR43C_ARATH,High,Eukaryote
+SRBS1_HUMAN_Tsuboyama_2023_2O2W_indels,0.125,0.5,0.5,0.438,0.438,0.375,0.5,0.375,0.312,0.438,0.438,0.438,0.312,0.5,0.438,0.312,0.5,0.5,0.312,0.5,0.5,0.438,0.375,154,Stability,SRBS1_HUMAN,High,Human
+TCRG1_MOUSE_Tsuboyama_2023_1E0L_indels,0.3,0.6,0.7,0.7,0.8,0.6,0.6,0.7,0.6,0.6,0.7,0.2,0.3,0.4,0.7,0.2,0.3,0.3,0.2,0.3,0.4,0.4,0.6,99,Stability,TCRG1_MOUSE,Medium,Eukaryote
+THO1_YEAST_Tsuboyama_2023_2WQG_indels,0.111,0.556,0.0,0.111,0.222,0.111,0.444,0.333,0.222,0.444,0.556,0.222,0.111,0.444,0.333,0.111,0.556,0.667,0.111,0.667,0.556,0.333,0.778,82,Stability,THO1_YEAST,High,Eukaryote
+TNKS2_HUMAN_Tsuboyama_2023_5JRT_indels,0.167,0.389,0.333,0.722,0.722,0.667,0.611,0.667,0.611,0.667,0.667,0.111,0.0,0.222,0.611,0.167,0.222,0.444,0.167,0.222,0.611,0.444,0.556,171,Stability,TNKS2_HUMAN,High,Human
+UBE4B_HUMAN_Tsuboyama_2023_3L1X_indels,0.0,0.467,0.0,0.533,0.533,0.533,0.467,0.533,0.533,0.533,0.533,0.4,0.133,0.067,0.467,0.067,0.067,0.333,0.067,0.067,0.333,0.2,0.467,147,Stability,UBE4B_HUMAN,High,Human
+UBR5_HUMAN_Tsuboyama_2023_1I2T_indels,0.062,0.5,0.562,0.5,0.625,0.562,0.562,0.625,0.625,0.625,0.625,0.125,0.375,0.5,0.375,0.25,0.375,0.375,0.25,0.438,0.438,0.625,0.438,156,Stability,UBR5_HUMAN,Medium,Human
+VG08_BPP22_Tsuboyama_2023_2GP8_indels,0.364,0.364,0.455,0.364,0.455,0.545,0.182,0.273,0.182,0.182,0.182,0.0,0.182,0.364,0.364,0.182,0.273,0.364,0.273,0.364,0.273,0.545,0.182,101,Stability,VG08_BPP22,High,Virus
+VILI_CHICK_Tsuboyama_2023_1YU5_indels,0.125,0.688,0.0,0.0,0.0,0.0,0.0,0.125,0.0,0.062,0.5,0.188,0.188,0.562,0.375,0.188,0.5,0.438,0.25,0.562,0.438,0.375,0.688,156,Stability,VILI_CHICK,High,Eukaryote
+VRPI_BPT7_Tsuboyama_2023_2WNM_indels,0.25,0.125,0.125,0.25,0.25,0.125,0.312,0.125,0.062,0.188,0.25,0.062,0.125,0.125,0.188,0.188,0.188,0.125,0.188,0.125,0.125,0.125,0.25,154,Stability,VRPI_BPT7,Medium,Virus
+YNZC_BACSU_Tsuboyama_2023_2JVD_indels,0.273,0.545,0.273,0.273,0.364,0.364,0.364,0.455,0.182,0.364,0.273,0.273,0.273,0.364,0.273,0.091,0.182,0.273,0.091,0.182,0.273,0.364,0.364,104,Stability,YNZC_BACSU,Medium,Prokaryote
diff --git a/benchmarks/DMS_zero_shot/indels/Top_recall/DMS_indels_Top_recall_DMS_level.html b/benchmarks/DMS_zero_shot/indels/Top_recall/DMS_indels_Top_recall_DMS_level.html
new file mode 100644
index 0000000..f2c875b
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/Top_recall/DMS_indels_Top_recall_DMS_level.html
@@ -0,0 +1,2083 @@
+
+
+
+ score |
+ Unirep |
+ Wavenet |
+ RITA S |
+ RITA M |
+ RITA L |
+ RITA XL |
+ Progen2 S |
+ Progen2 M |
+ Progen2 Base |
+ Progen2 L |
+ Progen2 XL |
+ ProtGPT2 |
+ Tranception S no retrieval |
+ Tranception M no retrieval |
+ Tranception L no retrieval |
+ Tranception S |
+ Tranception M |
+ Tranception L |
+ TranceptEVE S |
+ TranceptEVE M |
+ TranceptEVE L |
+ Hidden Markov Model |
+ Provean |
+ Number of Mutants |
+ Selection Type |
+ UniProt ID |
+ MSA_Neff_L_category |
+ Taxon |
+
+
+
+
+ A4_HUMAN_Seuma_2022_indels |
+ 0.243 |
+ 0.179 |
+ 0.179 |
+ 0.145 |
+ 0.153 |
+ 0.132 |
+ 0.157 |
+ 0.170 |
+ 0.149 |
+ 0.170 |
+ 0.153 |
+ 0.191 |
+ 0.183 |
+ 0.145 |
+ 0.145 |
+ 0.191 |
+ 0.166 |
+ 0.204 |
+ 0.179 |
+ 0.157 |
+ 0.191 |
+ 0.000 |
+ 0.153 |
+ 2346 |
+ Stability |
+ A4_HUMAN |
+ Low |
+ Human |
+
+
+ AMFR_HUMAN_Tsuboyama_2023_4G3O_indels |
+ 0.250 |
+ 0.667 |
+ 0.000 |
+ 0.250 |
+ 0.583 |
+ 0.750 |
+ 0.250 |
+ 0.583 |
+ 0.750 |
+ 0.583 |
+ 0.667 |
+ 0.667 |
+ 0.083 |
+ 0.000 |
+ 0.167 |
+ 0.000 |
+ 0.083 |
+ 0.167 |
+ 0.000 |
+ 0.083 |
+ 0.167 |
+ 0.417 |
+ 0.750 |
+ 117 |
+ Stability |
+ AMFR_HUMAN |
+ Medium |
+ Human |
+
+
+ ARGR_ECOLI_Tsuboyama_2023_1AOY_indels |
+ 0.105 |
+ 0.632 |
+ 0.789 |
+ 0.684 |
+ 0.684 |
+ 0.789 |
+ 0.316 |
+ 0.789 |
+ 0.789 |
+ 0.789 |
+ 0.789 |
+ 0.053 |
+ 0.526 |
+ 0.368 |
+ 0.632 |
+ 0.316 |
+ 0.263 |
+ 0.526 |
+ 0.316 |
+ 0.263 |
+ 0.526 |
+ 0.368 |
+ 0.474 |
+ 181 |
+ Stability |
+ ARGR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ B1LPA6_ECOSM_Russ_2020_indels |
+ 0.237 |
+ 0.240 |
+ 0.157 |
+ 0.151 |
+ 0.157 |
+ 0.169 |
+ 0.194 |
+ 0.215 |
+ 0.203 |
+ 0.197 |
+ 0.194 |
+ 0.172 |
+ 0.160 |
+ 0.215 |
+ 0.160 |
+ 0.240 |
+ 0.255 |
+ 0.209 |
+ 0.237 |
+ 0.246 |
+ 0.218 |
+ 0.302 |
+ 0.182 |
+ 3074 |
+ Activity |
+ B1LPA6_ECOSM |
+ Medium |
+ Prokaryote |
+
+
+ BBC1_YEAST_Tsuboyama_2023_1TG0_indels |
+ 0.214 |
+ 0.357 |
+ 0.214 |
+ 0.286 |
+ 0.429 |
+ 0.571 |
+ 0.143 |
+ 0.643 |
+ 0.643 |
+ 0.643 |
+ 0.714 |
+ 0.143 |
+ 0.143 |
+ 0.500 |
+ 0.000 |
+ 0.143 |
+ 0.357 |
+ 0.214 |
+ 0.286 |
+ 0.357 |
+ 0.214 |
+ 0.143 |
+ 0.786 |
+ 134 |
+ Stability |
+ BBC1_YEAST |
+ High |
+ Eukaryote |
+
+
+ BCHB_CHLTE_Tsuboyama_2023_2KRU_indels |
+ 0.000 |
+ 0.667 |
+ 0.111 |
+ 0.333 |
+ 0.222 |
+ 0.444 |
+ 0.222 |
+ 0.333 |
+ 0.444 |
+ 0.667 |
+ 0.667 |
+ 0.111 |
+ 0.222 |
+ 0.444 |
+ 0.444 |
+ 0.222 |
+ 0.333 |
+ 0.444 |
+ 0.222 |
+ 0.333 |
+ 0.556 |
+ 0.222 |
+ 0.778 |
+ 82 |
+ Stability |
+ BCHB_CHLTE |
+ Medium |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Gonzalez_2019_indels |
+ 0.166 |
+ 0.422 |
+ 0.382 |
+ 0.420 |
+ 0.405 |
+ 0.317 |
+ 0.359 |
+ 0.426 |
+ 0.424 |
+ 0.424 |
+ 0.242 |
+ 0.120 |
+ 0.418 |
+ 0.382 |
+ 0.258 |
+ 0.366 |
+ 0.351 |
+ 0.263 |
+ 0.374 |
+ 0.403 |
+ 0.263 |
+ 0.174 |
+ 0.363 |
+ 4751 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ CAPSD_AAV2S_Sinai_2021_designed_indels |
+ 0.013 |
+ 0.309 |
+ 0.133 |
+ 0.139 |
+ 0.239 |
+ 0.340 |
+ 0.011 |
+ 0.010 |
+ 0.013 |
+ 0.010 |
+ 0.212 |
+ 0.114 |
+ 0.152 |
+ 0.147 |
+ 0.349 |
+ 0.328 |
+ 0.325 |
+ 0.350 |
+ 0.333 |
+ 0.327 |
+ 0.356 |
+ 0.316 |
+ 0.314 |
+ 225998 |
+ OrganismalFitness |
+ CAPSD_AAV2S |
+ Low |
+ Virus |
+
+
+ CAPSD_AAV2S_Sinai_2021_library_indels |
+ 0.020 |
+ 0.000 |
+ 0.110 |
+ 0.186 |
+ 0.161 |
+ 0.209 |
+ 0.037 |
+ 0.043 |
+ 0.048 |
+ 0.051 |
+ 0.242 |
+ 0.103 |
+ 0.113 |
+ 0.158 |
+ 0.269 |
+ 0.270 |
+ 0.306 |
+ 0.341 |
+ 0.275 |
+ 0.263 |
+ 0.378 |
+ 0.001 |
+ 0.044 |
+ 24908 |
+ OrganismalFitness |
+ CAPSD_AAV2S |
+ Low |
+ Virus |
+
+
+ CATR_CHLRE_Tsuboyama_2023_2AMI_indels |
+ 0.400 |
+ 0.450 |
+ 0.450 |
+ 0.450 |
+ 0.450 |
+ 0.450 |
+ 0.400 |
+ 0.450 |
+ 0.450 |
+ 0.450 |
+ 0.450 |
+ 0.150 |
+ 0.450 |
+ 0.400 |
+ 0.450 |
+ 0.450 |
+ 0.400 |
+ 0.400 |
+ 0.450 |
+ 0.400 |
+ 0.400 |
+ 0.250 |
+ 0.400 |
+ 197 |
+ Stability |
+ CATR_CHLRE |
+ High |
+ Eukaryote |
+
+
+ CBPA2_HUMAN_Tsuboyama_2023_1O6X_indels |
+ 0.095 |
+ 0.619 |
+ 0.095 |
+ 0.095 |
+ 0.048 |
+ 0.190 |
+ 0.048 |
+ 0.238 |
+ 0.238 |
+ 0.286 |
+ 0.571 |
+ 0.048 |
+ 0.048 |
+ 0.095 |
+ 0.095 |
+ 0.048 |
+ 0.095 |
+ 0.095 |
+ 0.048 |
+ 0.143 |
+ 0.095 |
+ 0.333 |
+ 0.333 |
+ 205 |
+ Stability |
+ CBPA2_HUMAN |
+ Medium |
+ Human |
+
+
+ CBX4_HUMAN_Tsuboyama_2023_2K28_indels |
+ 0.154 |
+ 0.154 |
+ 0.385 |
+ 0.385 |
+ 0.462 |
+ 0.538 |
+ 0.538 |
+ 0.385 |
+ 0.462 |
+ 0.462 |
+ 0.538 |
+ 0.077 |
+ 0.308 |
+ 0.308 |
+ 0.385 |
+ 0.308 |
+ 0.231 |
+ 0.385 |
+ 0.385 |
+ 0.231 |
+ 0.385 |
+ 0.308 |
+ 0.538 |
+ 129 |
+ Stability |
+ CBX4_HUMAN |
+ High |
+ Human |
+
+
+ CSN4_MOUSE_Tsuboyama_2023_1UFM_indels |
+ 0.050 |
+ 0.600 |
+ 0.250 |
+ 0.200 |
+ 0.600 |
+ 0.550 |
+ 0.050 |
+ 0.650 |
+ 0.600 |
+ 0.650 |
+ 0.600 |
+ 0.200 |
+ 0.150 |
+ 0.450 |
+ 0.500 |
+ 0.250 |
+ 0.350 |
+ 0.400 |
+ 0.250 |
+ 0.350 |
+ 0.400 |
+ 0.450 |
+ 0.550 |
+ 195 |
+ Stability |
+ CSN4_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ CUE1_YEAST_Tsuboyama_2023_2MYX_indels |
+ 0.357 |
+ 0.714 |
+ 0.357 |
+ 0.214 |
+ 0.286 |
+ 0.000 |
+ 0.071 |
+ 0.071 |
+ 0.071 |
+ 0.071 |
+ 0.571 |
+ 0.071 |
+ 0.143 |
+ 0.143 |
+ 0.071 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.286 |
+ 0.857 |
+ 140 |
+ Stability |
+ CUE1_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ DN7A_SACS2_Tsuboyama_2023_1JIC_indels |
+ 0.214 |
+ 0.500 |
+ 0.000 |
+ 0.286 |
+ 0.214 |
+ 0.286 |
+ 0.143 |
+ 0.071 |
+ 0.214 |
+ 0.286 |
+ 0.286 |
+ 0.071 |
+ 0.071 |
+ 0.000 |
+ 0.071 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.071 |
+ 0.000 |
+ 0.000 |
+ 0.143 |
+ 0.643 |
+ 136 |
+ Stability |
+ DN7A_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ DNJA1_HUMAN_Tsuboyama_2023_2LO1_indels |
+ 0.722 |
+ 0.444 |
+ 0.667 |
+ 0.722 |
+ 0.722 |
+ 0.667 |
+ 0.722 |
+ 0.722 |
+ 0.778 |
+ 0.722 |
+ 0.722 |
+ 0.278 |
+ 0.667 |
+ 0.722 |
+ 0.778 |
+ 0.556 |
+ 0.611 |
+ 0.611 |
+ 0.556 |
+ 0.611 |
+ 0.611 |
+ 0.333 |
+ 0.500 |
+ 174 |
+ Stability |
+ DNJA1_HUMAN |
+ High |
+ Human |
+
+
+ DOCK1_MOUSE_Tsuboyama_2023_2M0Y_indels |
+ 0.062 |
+ 0.438 |
+ 0.438 |
+ 0.438 |
+ 0.562 |
+ 0.562 |
+ 0.438 |
+ 0.500 |
+ 0.438 |
+ 0.375 |
+ 0.562 |
+ 0.250 |
+ 0.250 |
+ 0.188 |
+ 0.375 |
+ 0.250 |
+ 0.250 |
+ 0.375 |
+ 0.250 |
+ 0.250 |
+ 0.375 |
+ 0.188 |
+ 0.562 |
+ 154 |
+ Stability |
+ DOCK1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ EPHB2_HUMAN_Tsuboyama_2023_1F0M_indels |
+ 0.211 |
+ 0.632 |
+ 0.632 |
+ 0.789 |
+ 0.737 |
+ 0.737 |
+ 0.632 |
+ 0.632 |
+ 0.684 |
+ 0.684 |
+ 0.684 |
+ 0.211 |
+ 0.632 |
+ 0.789 |
+ 0.684 |
+ 0.368 |
+ 0.421 |
+ 0.421 |
+ 0.421 |
+ 0.474 |
+ 0.421 |
+ 0.316 |
+ 0.421 |
+ 185 |
+ Stability |
+ EPHB2_HUMAN |
+ High |
+ Human |
+
+
+ FECA_ECOLI_Tsuboyama_2023_2D1U_indels |
+ 0.250 |
+ 0.250 |
+ 0.200 |
+ 0.400 |
+ 0.400 |
+ 0.400 |
+ 0.100 |
+ 0.400 |
+ 0.400 |
+ 0.350 |
+ 0.400 |
+ 0.200 |
+ 0.000 |
+ 0.000 |
+ 0.050 |
+ 0.000 |
+ 0.000 |
+ 0.050 |
+ 0.000 |
+ 0.000 |
+ 0.050 |
+ 0.400 |
+ 0.350 |
+ 193 |
+ Stability |
+ FECA_ECOLI |
+ High |
+ Eukaryote |
+
+
+ HCP_LAMBD_Tsuboyama_2023_2L6Q_indels |
+ 0.200 |
+ 0.467 |
+ 0.333 |
+ 0.467 |
+ 0.400 |
+ 0.600 |
+ 0.267 |
+ 0.267 |
+ 0.333 |
+ 0.267 |
+ 0.333 |
+ 0.267 |
+ 0.133 |
+ 0.133 |
+ 0.333 |
+ 0.067 |
+ 0.133 |
+ 0.333 |
+ 0.067 |
+ 0.067 |
+ 0.400 |
+ 0.400 |
+ 0.533 |
+ 148 |
+ Stability |
+ HCP_LAMBD |
+ Medium |
+ Virus |
+
+
+ HECD1_HUMAN_Tsuboyama_2023_3DKM_indels |
+ 0.312 |
+ 0.562 |
+ 0.375 |
+ 0.625 |
+ 0.625 |
+ 0.625 |
+ 0.188 |
+ 0.688 |
+ 0.500 |
+ 0.500 |
+ 0.438 |
+ 0.312 |
+ 0.000 |
+ 0.125 |
+ 0.125 |
+ 0.062 |
+ 0.250 |
+ 0.125 |
+ 0.062 |
+ 0.250 |
+ 0.125 |
+ 0.438 |
+ 0.438 |
+ 154 |
+ Stability |
+ HECD1_HUMAN |
+ Medium |
+ Human |
+
+
+ HIS7_YEAST_Pokusaeva_2019_indels |
+ 0.031 |
+ 0.383 |
+ 0.373 |
+ 0.345 |
+ 0.336 |
+ 0.378 |
+ 0.295 |
+ 0.354 |
+ 0.406 |
+ 0.380 |
+ 0.403 |
+ 0.108 |
+ 0.303 |
+ 0.417 |
+ 0.399 |
+ 0.357 |
+ 0.344 |
+ 0.406 |
+ 0.370 |
+ 0.401 |
+ 0.414 |
+ 0.345 |
+ 0.396 |
+ 6102 |
+ OrganismalFitness |
+ HIS7_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ ILF3_HUMAN_Tsuboyama_2023_2L33_indels |
+ 0.100 |
+ 0.350 |
+ 0.300 |
+ 0.250 |
+ 0.250 |
+ 0.300 |
+ 0.250 |
+ 0.250 |
+ 0.250 |
+ 0.250 |
+ 0.250 |
+ 0.650 |
+ 0.100 |
+ 0.250 |
+ 0.250 |
+ 0.100 |
+ 0.300 |
+ 0.250 |
+ 0.150 |
+ 0.300 |
+ 0.250 |
+ 0.550 |
+ 0.250 |
+ 193 |
+ Stability |
+ ILF3_HUMAN |
+ High |
+ Human |
+
+
+ KCNJ2_MOUSE_Macdonald_2022_indels |
+ 0.147 |
+ 0.102 |
+ 0.185 |
+ 0.143 |
+ 0.088 |
+ 0.087 |
+ 0.219 |
+ 0.145 |
+ 0.114 |
+ 0.167 |
+ 0.082 |
+ 0.105 |
+ 0.168 |
+ 0.108 |
+ 0.088 |
+ 0.134 |
+ 0.097 |
+ 0.100 |
+ 0.129 |
+ 0.096 |
+ 0.097 |
+ 0.166 |
+ 0.114 |
+ 10501 |
+ Expression |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ MAFG_MOUSE_Tsuboyama_2023_1K1V_indels |
+ 0.500 |
+ 0.500 |
+ 0.250 |
+ 0.500 |
+ 0.583 |
+ 0.583 |
+ 0.417 |
+ 0.500 |
+ 0.583 |
+ 0.417 |
+ 0.417 |
+ 0.250 |
+ 0.333 |
+ 0.500 |
+ 0.500 |
+ 0.167 |
+ 0.333 |
+ 0.417 |
+ 0.167 |
+ 0.333 |
+ 0.417 |
+ 0.333 |
+ 0.250 |
+ 115 |
+ Stability |
+ MAFG_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ MBD11_ARATH_Tsuboyama_2023_6ACV_indels |
+ 0.143 |
+ 0.286 |
+ 0.143 |
+ 0.071 |
+ 0.071 |
+ 0.143 |
+ 0.143 |
+ 0.429 |
+ 0.286 |
+ 0.286 |
+ 0.357 |
+ 0.214 |
+ 0.214 |
+ 0.071 |
+ 0.000 |
+ 0.143 |
+ 0.071 |
+ 0.071 |
+ 0.214 |
+ 0.071 |
+ 0.000 |
+ 0.143 |
+ 0.286 |
+ 131 |
+ Stability |
+ MBD11_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ MYO3_YEAST_Tsuboyama_2023_2BTT_indels |
+ 0.000 |
+ 0.375 |
+ 0.125 |
+ 0.000 |
+ 0.250 |
+ 0.000 |
+ 0.625 |
+ 0.375 |
+ 0.250 |
+ 0.375 |
+ 0.750 |
+ 0.125 |
+ 0.000 |
+ 0.000 |
+ 0.125 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.125 |
+ 0.125 |
+ 0.000 |
+ 0.375 |
+ 80 |
+ Stability |
+ MYO3_YEAST |
+ High |
+ Eukaryote |
+
+
+ NKX31_HUMAN_Tsuboyama_2023_2L9R_indels |
+ 0.333 |
+ 0.500 |
+ 0.333 |
+ 0.333 |
+ 0.167 |
+ 0.222 |
+ 0.222 |
+ 0.167 |
+ 0.333 |
+ 0.167 |
+ 0.167 |
+ 0.278 |
+ 0.444 |
+ 0.167 |
+ 0.111 |
+ 0.444 |
+ 0.389 |
+ 0.333 |
+ 0.500 |
+ 0.278 |
+ 0.278 |
+ 0.500 |
+ 0.222 |
+ 178 |
+ Stability |
+ NKX31_HUMAN |
+ High |
+ Human |
+
+
+ NUSA_ECOLI_Tsuboyama_2023_1WCL_indels |
+ 0.350 |
+ 0.550 |
+ 0.700 |
+ 0.550 |
+ 0.600 |
+ 0.600 |
+ 0.350 |
+ 0.550 |
+ 0.250 |
+ 0.650 |
+ 0.650 |
+ 0.050 |
+ 0.350 |
+ 0.650 |
+ 0.300 |
+ 0.300 |
+ 0.400 |
+ 0.400 |
+ 0.300 |
+ 0.450 |
+ 0.400 |
+ 0.300 |
+ 0.650 |
+ 191 |
+ Stability |
+ NUSA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ NUSG_MYCTU_Tsuboyama_2023_2MI6_indels |
+ 0.188 |
+ 0.375 |
+ 0.500 |
+ 0.500 |
+ 0.500 |
+ 0.500 |
+ 0.500 |
+ 0.500 |
+ 0.500 |
+ 0.500 |
+ 0.500 |
+ 0.375 |
+ 0.500 |
+ 0.500 |
+ 0.500 |
+ 0.375 |
+ 0.375 |
+ 0.312 |
+ 0.375 |
+ 0.375 |
+ 0.312 |
+ 0.125 |
+ 0.375 |
+ 157 |
+ Stability |
+ NUSG_MYCTU |
+ High |
+ Prokaryote |
+
+
+ OBSCN_HUMAN_Tsuboyama_2023_1V1C_indels |
+ 0.176 |
+ 0.471 |
+ 0.471 |
+ 0.471 |
+ 0.588 |
+ 0.588 |
+ 0.471 |
+ 0.471 |
+ 0.529 |
+ 0.353 |
+ 0.471 |
+ 0.235 |
+ 0.118 |
+ 0.000 |
+ 0.176 |
+ 0.118 |
+ 0.176 |
+ 0.118 |
+ 0.118 |
+ 0.118 |
+ 0.059 |
+ 0.353 |
+ 0.294 |
+ 169 |
+ Stability |
+ OBSCN_HUMAN |
+ High |
+ Human |
+
+
+ ODP2_GEOSE_Tsuboyama_2023_1W4G_indels |
+ 0.000 |
+ 0.200 |
+ 0.200 |
+ 0.600 |
+ 0.400 |
+ 0.600 |
+ 0.600 |
+ 0.800 |
+ 0.800 |
+ 0.800 |
+ 0.400 |
+ 0.000 |
+ 0.200 |
+ 0.000 |
+ 0.200 |
+ 0.400 |
+ 0.400 |
+ 0.200 |
+ 0.400 |
+ 0.400 |
+ 0.200 |
+ 0.200 |
+ 0.800 |
+ 47 |
+ Stability |
+ ODP2_GEOSE |
+ High |
+ Prokaryote |
+
+
+ OTU7A_HUMAN_Tsuboyama_2023_2L2D_indels |
+ 0.111 |
+ 0.111 |
+ 0.222 |
+ 0.556 |
+ 0.667 |
+ 0.778 |
+ 0.556 |
+ 0.778 |
+ 0.778 |
+ 0.778 |
+ 0.778 |
+ 0.556 |
+ 0.222 |
+ 0.111 |
+ 0.333 |
+ 0.000 |
+ 0.111 |
+ 0.333 |
+ 0.000 |
+ 0.111 |
+ 0.333 |
+ 0.222 |
+ 0.889 |
+ 84 |
+ Stability |
+ OTU7A_HUMAN |
+ High |
+ Human |
+
+
+ P53_HUMAN_Kotler_2018_indels |
+ 0.114 |
+ 0.171 |
+ 0.229 |
+ 0.171 |
+ 0.143 |
+ 0.343 |
+ 0.229 |
+ 0.314 |
+ 0.257 |
+ 0.286 |
+ 0.371 |
+ 0.029 |
+ 0.257 |
+ 0.343 |
+ 0.343 |
+ 0.114 |
+ 0.400 |
+ 0.257 |
+ 0.200 |
+ 0.286 |
+ 0.314 |
+ 0.257 |
+ 0.257 |
+ 341 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ PIN1_HUMAN_Tsuboyama_2023_1I6C_indels |
+ 0.364 |
+ 0.455 |
+ 0.364 |
+ 0.273 |
+ 0.455 |
+ 0.364 |
+ 0.182 |
+ 0.273 |
+ 0.455 |
+ 0.273 |
+ 0.455 |
+ 0.455 |
+ 0.273 |
+ 0.364 |
+ 0.364 |
+ 0.545 |
+ 0.636 |
+ 0.455 |
+ 0.455 |
+ 0.636 |
+ 0.455 |
+ 0.364 |
+ 0.364 |
+ 106 |
+ Stability |
+ PIN1_HUMAN |
+ High |
+ Human |
+
+
+ PITX2_HUMAN_Tsuboyama_2023_2L7M_indels |
+ 0.667 |
+ 0.417 |
+ 0.833 |
+ 0.667 |
+ 0.667 |
+ 0.750 |
+ 0.667 |
+ 0.750 |
+ 0.833 |
+ 0.750 |
+ 0.750 |
+ 0.583 |
+ 0.667 |
+ 0.750 |
+ 0.750 |
+ 0.417 |
+ 0.333 |
+ 0.417 |
+ 0.500 |
+ 0.417 |
+ 0.417 |
+ 0.333 |
+ 0.583 |
+ 117 |
+ Stability |
+ PITX2_HUMAN |
+ High |
+ Human |
+
+
+ PKN1_HUMAN_Tsuboyama_2023_1URF_indels |
+ 0.158 |
+ 0.316 |
+ 0.368 |
+ 0.316 |
+ 0.368 |
+ 0.316 |
+ 0.368 |
+ 0.316 |
+ 0.368 |
+ 0.421 |
+ 0.368 |
+ 0.211 |
+ 0.211 |
+ 0.263 |
+ 0.316 |
+ 0.211 |
+ 0.368 |
+ 0.368 |
+ 0.263 |
+ 0.316 |
+ 0.368 |
+ 0.263 |
+ 0.421 |
+ 187 |
+ Stability |
+ PKN1_HUMAN |
+ High |
+ Human |
+
+
+ POLG_PESV_Tsuboyama_2023_2MXD_indels |
+ 0.067 |
+ 0.467 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.067 |
+ 0.067 |
+ 0.067 |
+ 0.000 |
+ 0.000 |
+ 0.067 |
+ 0.067 |
+ 0.067 |
+ 0.000 |
+ 0.000 |
+ 0.067 |
+ 0.000 |
+ 0.000 |
+ 0.133 |
+ 0.000 |
+ 0.000 |
+ 0.133 |
+ 0.467 |
+ 149 |
+ Stability |
+ POLG_PESV |
+ Medium |
+ Virus |
+
+
+ PR40A_HUMAN_Tsuboyama_2023_1UZC_indels |
+ 0.000 |
+ 0.706 |
+ 0.529 |
+ 0.706 |
+ 0.765 |
+ 0.765 |
+ 0.706 |
+ 0.824 |
+ 0.706 |
+ 0.765 |
+ 0.765 |
+ 0.294 |
+ 0.353 |
+ 0.294 |
+ 0.412 |
+ 0.294 |
+ 0.353 |
+ 0.353 |
+ 0.353 |
+ 0.412 |
+ 0.412 |
+ 0.294 |
+ 0.353 |
+ 168 |
+ Stability |
+ PR40A_HUMAN |
+ Medium |
+ Human |
+
+
+ PSAE_SYNP2_Tsuboyama_2023_1PSE_indels |
+ 0.111 |
+ 0.000 |
+ 0.111 |
+ 0.000 |
+ 0.000 |
+ 0.222 |
+ 0.056 |
+ 0.056 |
+ 0.000 |
+ 0.056 |
+ 0.000 |
+ 0.222 |
+ 0.167 |
+ 0.278 |
+ 0.000 |
+ 0.222 |
+ 0.278 |
+ 0.056 |
+ 0.167 |
+ 0.278 |
+ 0.056 |
+ 0.278 |
+ 0.000 |
+ 175 |
+ Stability |
+ PSAE_SYNP2 |
+ Medium |
+ Prokaryote |
+
+
+ PTEN_HUMAN_Mighell_2018_indels |
+ 0.000 |
+ 0.125 |
+ 0.219 |
+ 0.125 |
+ 0.188 |
+ 0.281 |
+ 0.250 |
+ 0.281 |
+ 0.312 |
+ 0.094 |
+ 0.219 |
+ 0.000 |
+ 0.250 |
+ 0.219 |
+ 0.219 |
+ 0.188 |
+ 0.281 |
+ 0.219 |
+ 0.156 |
+ 0.125 |
+ 0.156 |
+ 0.219 |
+ 0.281 |
+ 314 |
+ Activity |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ Q8EG35_SHEON_Campbell_2022_indels |
+ 0.235 |
+ 0.000 |
+ 0.147 |
+ 0.176 |
+ 0.088 |
+ 0.176 |
+ 0.206 |
+ 0.118 |
+ 0.029 |
+ 0.118 |
+ 0.059 |
+ 0.176 |
+ 0.235 |
+ 0.235 |
+ 0.206 |
+ 0.118 |
+ 0.147 |
+ 0.206 |
+ 0.176 |
+ 0.206 |
+ 0.147 |
+ 0.206 |
+ 0.118 |
+ 331 |
+ OrganismalFitness |
+ Q8EG35_SHEON |
+ Medium |
+ Prokaryote |
+
+
+ RAD_ANTMA_Tsuboyama_2023_2CJJ_indels |
+ 0.200 |
+ 0.100 |
+ 0.200 |
+ 0.100 |
+ 0.200 |
+ 0.300 |
+ 0.400 |
+ 0.400 |
+ 0.200 |
+ 0.200 |
+ 0.400 |
+ 0.300 |
+ 0.100 |
+ 0.400 |
+ 0.300 |
+ 0.300 |
+ 0.500 |
+ 0.400 |
+ 0.400 |
+ 0.500 |
+ 0.400 |
+ 0.100 |
+ 0.200 |
+ 97 |
+ Stability |
+ RAD_ANTMA |
+ High |
+ Eukaryote |
+
+
+ RCD1_ARATH_Tsuboyama_2023_5OAO_indels |
+ 0.077 |
+ 0.615 |
+ 0.000 |
+ 0.077 |
+ 0.000 |
+ 0.154 |
+ 0.077 |
+ 0.308 |
+ 0.154 |
+ 0.231 |
+ 0.538 |
+ 0.077 |
+ 0.000 |
+ 0.000 |
+ 0.231 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.308 |
+ 0.385 |
+ 124 |
+ Stability |
+ RCD1_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ RD23A_HUMAN_Tsuboyama_2023_1IFY_indels |
+ 0.167 |
+ 0.500 |
+ 0.583 |
+ 0.583 |
+ 0.500 |
+ 0.833 |
+ 0.750 |
+ 0.750 |
+ 0.667 |
+ 0.750 |
+ 0.750 |
+ 0.667 |
+ 0.333 |
+ 0.750 |
+ 0.667 |
+ 0.417 |
+ 0.583 |
+ 0.583 |
+ 0.417 |
+ 0.667 |
+ 0.583 |
+ 0.167 |
+ 0.750 |
+ 120 |
+ Stability |
+ RD23A_HUMAN |
+ High |
+ Human |
+
+
+ RPC1_BP434_Tsuboyama_2023_1R69_indels |
+ 0.647 |
+ 0.353 |
+ 0.412 |
+ 0.588 |
+ 0.471 |
+ 0.471 |
+ 0.588 |
+ 0.471 |
+ 0.471 |
+ 0.471 |
+ 0.471 |
+ 0.176 |
+ 0.235 |
+ 0.529 |
+ 0.588 |
+ 0.118 |
+ 0.294 |
+ 0.412 |
+ 0.118 |
+ 0.353 |
+ 0.471 |
+ 0.118 |
+ 0.412 |
+ 164 |
+ Stability |
+ RPC1_BP434 |
+ High |
+ Virus |
+
+
+ RS15_GEOSE_Tsuboyama_2023_1A32_indels |
+ 0.111 |
+ 0.444 |
+ 0.500 |
+ 0.500 |
+ 0.500 |
+ 0.500 |
+ 0.389 |
+ 0.500 |
+ 0.500 |
+ 0.500 |
+ 0.500 |
+ 0.056 |
+ 0.500 |
+ 0.500 |
+ 0.444 |
+ 0.333 |
+ 0.278 |
+ 0.278 |
+ 0.389 |
+ 0.278 |
+ 0.278 |
+ 0.278 |
+ 0.389 |
+ 176 |
+ Stability |
+ RS15_GEOSE |
+ Medium |
+ Prokaryote |
+
+
+ S22A1_HUMAN_Yee_2023_abundance_indels |
+ 0.186 |
+ 0.140 |
+ 0.302 |
+ 0.302 |
+ 0.349 |
+ 0.349 |
+ 0.256 |
+ 0.326 |
+ 0.349 |
+ 0.233 |
+ 0.256 |
+ 0.163 |
+ 0.326 |
+ 0.302 |
+ 0.279 |
+ 0.326 |
+ 0.256 |
+ 0.256 |
+ 0.302 |
+ 0.326 |
+ 0.256 |
+ 0.093 |
+ 0.209 |
+ 430 |
+ Expression |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ S22A1_HUMAN_Yee_2023_activity_indels |
+ 0.163 |
+ 0.102 |
+ 0.224 |
+ 0.347 |
+ 0.347 |
+ 0.347 |
+ 0.286 |
+ 0.327 |
+ 0.388 |
+ 0.286 |
+ 0.306 |
+ 0.245 |
+ 0.347 |
+ 0.306 |
+ 0.265 |
+ 0.286 |
+ 0.265 |
+ 0.265 |
+ 0.265 |
+ 0.327 |
+ 0.265 |
+ 0.163 |
+ 0.224 |
+ 490 |
+ Activity |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ SAV1_MOUSE_Tsuboyama_2023_2YSB_indels |
+ 0.222 |
+ 0.222 |
+ 0.222 |
+ 0.222 |
+ 0.222 |
+ 0.333 |
+ 0.333 |
+ 0.222 |
+ 0.222 |
+ 0.222 |
+ 0.111 |
+ 0.556 |
+ 0.111 |
+ 0.111 |
+ 0.111 |
+ 0.222 |
+ 0.111 |
+ 0.111 |
+ 0.333 |
+ 0.111 |
+ 0.111 |
+ 0.222 |
+ 0.111 |
+ 86 |
+ Stability |
+ SAV1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ SDA_BACSU_Tsuboyama_2023_1PV0_indels |
+ 0.000 |
+ 0.615 |
+ 0.077 |
+ 0.000 |
+ 0.077 |
+ 0.154 |
+ 0.231 |
+ 0.385 |
+ 0.077 |
+ 0.231 |
+ 0.538 |
+ 0.077 |
+ 0.000 |
+ 0.000 |
+ 0.615 |
+ 0.077 |
+ 0.077 |
+ 0.308 |
+ 0.077 |
+ 0.077 |
+ 0.538 |
+ 0.308 |
+ 0.692 |
+ 127 |
+ Stability |
+ SDA_BACSU |
+ Medium |
+ Prokaryote |
+
+
+ SOX30_HUMAN_Tsuboyama_2023_7JJK_indels |
+ 0.545 |
+ 0.273 |
+ 0.364 |
+ 0.545 |
+ 0.636 |
+ 0.636 |
+ 0.455 |
+ 0.636 |
+ 0.636 |
+ 0.727 |
+ 0.727 |
+ 0.818 |
+ 0.273 |
+ 0.727 |
+ 0.636 |
+ 0.000 |
+ 0.364 |
+ 0.455 |
+ 0.091 |
+ 0.455 |
+ 0.455 |
+ 0.091 |
+ 0.727 |
+ 109 |
+ Stability |
+ SOX30_HUMAN |
+ High |
+ Human |
+
+
+ SPG2_STRSG_Tsuboyama_2023_5UBS_indels |
+ 0.067 |
+ 0.267 |
+ 0.067 |
+ 0.067 |
+ 0.067 |
+ 0.133 |
+ 0.267 |
+ 0.133 |
+ 0.067 |
+ 0.200 |
+ 0.333 |
+ 0.067 |
+ 0.133 |
+ 0.067 |
+ 0.400 |
+ 0.067 |
+ 0.000 |
+ 0.133 |
+ 0.067 |
+ 0.000 |
+ 0.267 |
+ 0.000 |
+ 0.333 |
+ 148 |
+ Stability |
+ SPG2_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPTN1_CHICK_Tsuboyama_2023_1TUD_indels |
+ 0.231 |
+ 0.154 |
+ 0.692 |
+ 0.385 |
+ 0.692 |
+ 0.462 |
+ 0.615 |
+ 0.615 |
+ 0.538 |
+ 0.769 |
+ 0.769 |
+ 0.077 |
+ 0.154 |
+ 0.154 |
+ 0.462 |
+ 0.077 |
+ 0.154 |
+ 0.308 |
+ 0.077 |
+ 0.077 |
+ 0.385 |
+ 0.077 |
+ 0.077 |
+ 129 |
+ Stability |
+ SPTN1_CHICK |
+ High |
+ Eukaryote |
+
+
+ SQSTM_MOUSE_Tsuboyama_2023_2RRU_indels |
+ 0.083 |
+ 0.333 |
+ 0.583 |
+ 0.833 |
+ 0.750 |
+ 0.833 |
+ 0.417 |
+ 0.667 |
+ 0.750 |
+ 0.667 |
+ 0.667 |
+ 0.167 |
+ 0.167 |
+ 0.667 |
+ 0.750 |
+ 0.083 |
+ 0.500 |
+ 0.500 |
+ 0.083 |
+ 0.500 |
+ 0.667 |
+ 0.167 |
+ 0.833 |
+ 111 |
+ Stability |
+ SQSTM_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ SR43C_ARATH_Tsuboyama_2023_2N88_indels |
+ 0.071 |
+ 0.429 |
+ 0.286 |
+ 0.357 |
+ 0.214 |
+ 0.214 |
+ 0.214 |
+ 0.143 |
+ 0.214 |
+ 0.071 |
+ 0.214 |
+ 0.286 |
+ 0.214 |
+ 0.000 |
+ 0.214 |
+ 0.143 |
+ 0.214 |
+ 0.214 |
+ 0.214 |
+ 0.143 |
+ 0.214 |
+ 0.357 |
+ 0.357 |
+ 135 |
+ Stability |
+ SR43C_ARATH |
+ High |
+ Eukaryote |
+
+
+ SRBS1_HUMAN_Tsuboyama_2023_2O2W_indels |
+ 0.125 |
+ 0.500 |
+ 0.500 |
+ 0.438 |
+ 0.438 |
+ 0.375 |
+ 0.500 |
+ 0.375 |
+ 0.312 |
+ 0.438 |
+ 0.438 |
+ 0.438 |
+ 0.312 |
+ 0.500 |
+ 0.438 |
+ 0.312 |
+ 0.500 |
+ 0.500 |
+ 0.312 |
+ 0.500 |
+ 0.500 |
+ 0.438 |
+ 0.375 |
+ 154 |
+ Stability |
+ SRBS1_HUMAN |
+ High |
+ Human |
+
+
+ TCRG1_MOUSE_Tsuboyama_2023_1E0L_indels |
+ 0.300 |
+ 0.600 |
+ 0.700 |
+ 0.700 |
+ 0.800 |
+ 0.600 |
+ 0.600 |
+ 0.700 |
+ 0.600 |
+ 0.600 |
+ 0.700 |
+ 0.200 |
+ 0.300 |
+ 0.400 |
+ 0.700 |
+ 0.200 |
+ 0.300 |
+ 0.300 |
+ 0.200 |
+ 0.300 |
+ 0.400 |
+ 0.400 |
+ 0.600 |
+ 99 |
+ Stability |
+ TCRG1_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ THO1_YEAST_Tsuboyama_2023_2WQG_indels |
+ 0.111 |
+ 0.556 |
+ 0.000 |
+ 0.111 |
+ 0.222 |
+ 0.111 |
+ 0.444 |
+ 0.333 |
+ 0.222 |
+ 0.444 |
+ 0.556 |
+ 0.222 |
+ 0.111 |
+ 0.444 |
+ 0.333 |
+ 0.111 |
+ 0.556 |
+ 0.667 |
+ 0.111 |
+ 0.667 |
+ 0.556 |
+ 0.333 |
+ 0.778 |
+ 82 |
+ Stability |
+ THO1_YEAST |
+ High |
+ Eukaryote |
+
+
+ TNKS2_HUMAN_Tsuboyama_2023_5JRT_indels |
+ 0.167 |
+ 0.389 |
+ 0.333 |
+ 0.722 |
+ 0.722 |
+ 0.667 |
+ 0.611 |
+ 0.667 |
+ 0.611 |
+ 0.667 |
+ 0.667 |
+ 0.111 |
+ 0.000 |
+ 0.222 |
+ 0.611 |
+ 0.167 |
+ 0.222 |
+ 0.444 |
+ 0.167 |
+ 0.222 |
+ 0.611 |
+ 0.444 |
+ 0.556 |
+ 171 |
+ Stability |
+ TNKS2_HUMAN |
+ High |
+ Human |
+
+
+ UBE4B_HUMAN_Tsuboyama_2023_3L1X_indels |
+ 0.000 |
+ 0.467 |
+ 0.000 |
+ 0.533 |
+ 0.533 |
+ 0.533 |
+ 0.467 |
+ 0.533 |
+ 0.533 |
+ 0.533 |
+ 0.533 |
+ 0.400 |
+ 0.133 |
+ 0.067 |
+ 0.467 |
+ 0.067 |
+ 0.067 |
+ 0.333 |
+ 0.067 |
+ 0.067 |
+ 0.333 |
+ 0.200 |
+ 0.467 |
+ 147 |
+ Stability |
+ UBE4B_HUMAN |
+ High |
+ Human |
+
+
+ UBR5_HUMAN_Tsuboyama_2023_1I2T_indels |
+ 0.062 |
+ 0.500 |
+ 0.562 |
+ 0.500 |
+ 0.625 |
+ 0.562 |
+ 0.562 |
+ 0.625 |
+ 0.625 |
+ 0.625 |
+ 0.625 |
+ 0.125 |
+ 0.375 |
+ 0.500 |
+ 0.375 |
+ 0.250 |
+ 0.375 |
+ 0.375 |
+ 0.250 |
+ 0.438 |
+ 0.438 |
+ 0.625 |
+ 0.438 |
+ 156 |
+ Stability |
+ UBR5_HUMAN |
+ Medium |
+ Human |
+
+
+ VG08_BPP22_Tsuboyama_2023_2GP8_indels |
+ 0.364 |
+ 0.364 |
+ 0.455 |
+ 0.364 |
+ 0.455 |
+ 0.545 |
+ 0.182 |
+ 0.273 |
+ 0.182 |
+ 0.182 |
+ 0.182 |
+ 0.000 |
+ 0.182 |
+ 0.364 |
+ 0.364 |
+ 0.182 |
+ 0.273 |
+ 0.364 |
+ 0.273 |
+ 0.364 |
+ 0.273 |
+ 0.545 |
+ 0.182 |
+ 101 |
+ Stability |
+ VG08_BPP22 |
+ High |
+ Virus |
+
+
+ VILI_CHICK_Tsuboyama_2023_1YU5_indels |
+ 0.125 |
+ 0.688 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.125 |
+ 0.000 |
+ 0.062 |
+ 0.500 |
+ 0.188 |
+ 0.188 |
+ 0.562 |
+ 0.375 |
+ 0.188 |
+ 0.500 |
+ 0.438 |
+ 0.250 |
+ 0.562 |
+ 0.438 |
+ 0.375 |
+ 0.688 |
+ 156 |
+ Stability |
+ VILI_CHICK |
+ High |
+ Eukaryote |
+
+
+ VRPI_BPT7_Tsuboyama_2023_2WNM_indels |
+ 0.250 |
+ 0.125 |
+ 0.125 |
+ 0.250 |
+ 0.250 |
+ 0.125 |
+ 0.312 |
+ 0.125 |
+ 0.062 |
+ 0.188 |
+ 0.250 |
+ 0.062 |
+ 0.125 |
+ 0.125 |
+ 0.188 |
+ 0.188 |
+ 0.188 |
+ 0.125 |
+ 0.188 |
+ 0.125 |
+ 0.125 |
+ 0.125 |
+ 0.250 |
+ 154 |
+ Stability |
+ VRPI_BPT7 |
+ Medium |
+ Virus |
+
+
+ YNZC_BACSU_Tsuboyama_2023_2JVD_indels |
+ 0.273 |
+ 0.545 |
+ 0.273 |
+ 0.273 |
+ 0.364 |
+ 0.364 |
+ 0.364 |
+ 0.455 |
+ 0.182 |
+ 0.364 |
+ 0.273 |
+ 0.273 |
+ 0.273 |
+ 0.364 |
+ 0.273 |
+ 0.091 |
+ 0.182 |
+ 0.273 |
+ 0.091 |
+ 0.182 |
+ 0.273 |
+ 0.364 |
+ 0.364 |
+ 104 |
+ Stability |
+ YNZC_BACSU |
+ Medium |
+ Prokaryote |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/indels/Top_recall/Summary_performance_DMS_indels_Top_recall.csv b/benchmarks/DMS_zero_shot/indels/Top_recall/Summary_performance_DMS_indels_Top_recall.csv
new file mode 100644
index 0000000..a30d174
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/Top_recall/Summary_performance_DMS_indels_Top_recall.csv
@@ -0,0 +1,24 @@
+Model_rank,Model_name,Model type,Average_Top_recall,Bootstrap_standard_error_Top_recall,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,Model details,References
+1,RITA XL,Protein language model,0.304,0.0,0.266,,0.218,0.298,0.436,0.25,0.375,0.461,0.513,0.337,0.375,0.347,RITA xlarge model (1.2B params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+2,Progen2 M,Protein language model,0.299,0.016,0.274,,0.236,0.248,0.439,0.17,0.376,0.467,0.49,0.401,0.381,0.205,Progen2 medium model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+3,Progen2 Base,Protein language model,0.294,0.015,0.301,,0.231,0.229,0.413,0.146,0.344,0.451,0.506,0.357,0.32,0.18,Progen2 base model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+4,Progen2 XL,Protein language model,0.29,0.013,0.24,,0.169,0.26,0.491,0.25,0.42,0.505,0.516,0.488,0.388,0.255,Progen2 xlarge model (6.4B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+5,Provean,Alignment-based model,0.28,0.02,0.229,,0.162,0.263,0.466,0.196,0.424,0.463,0.441,0.448,0.44,0.337,Provean model,"Choi Y, Sims GE, Murphy S, Miller JR, Chan AP (2012). Predicting the functional effect of amino acid substitutions and indels. PloS one."
+6,RITA L,Protein language model,0.274,0.012,0.231,,0.218,0.234,0.412,0.165,0.335,0.45,0.475,0.358,0.306,0.296,RITA large model (680M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+7,Progen2 L,Protein language model,0.269,0.026,0.192,,0.2,0.248,0.435,0.162,0.36,0.468,0.481,0.372,0.413,0.19,Progen2 large model (2.7B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+8,RITA M,Protein language model,0.265,0.018,0.208,,0.222,0.255,0.377,0.16,0.307,0.417,0.439,0.292,0.324,0.305,RITA medium model (300M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+9,Progen2 S,Protein language model,0.265,0.025,0.243,,0.238,0.223,0.357,0.137,0.262,0.428,0.419,0.3,0.3,0.24,Progen2 small model (150M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+10,Tranception M no retrieval,Protein language model,0.264,0.014,0.247,,0.205,0.306,0.299,0.214,0.242,0.349,0.334,0.276,0.286,0.217,Tranception Medium model (300M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+11,Tranception L no retrieval,Protein language model,0.262,0.011,0.215,,0.184,0.303,0.349,0.266,0.299,0.374,0.38,0.302,0.322,0.297,Tranception Large model (700M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+12,TranceptEVE M,Hybrid model,0.259,0.017,0.233,,0.211,0.318,0.276,0.246,0.203,0.342,0.318,0.262,0.249,0.201,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+13,Tranception M,Hybrid model,0.256,0.016,0.267,,0.176,0.312,0.267,0.294,0.201,0.326,0.316,0.252,0.238,0.201,Tranception Medium model (300M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+14,RITA S,Protein language model,0.255,0.018,0.2,,0.244,0.25,0.325,0.176,0.261,0.37,0.367,0.283,0.287,0.241,RITA small model (85M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+15,TranceptEVE L,Hybrid model,0.251,0.017,0.213,,0.176,0.301,0.315,0.291,0.256,0.351,0.342,0.283,0.288,0.273,TranceptEVE Large model (Tranception Large & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+16,Tranception L,Hybrid model,0.25,0.015,0.231,,0.178,0.296,0.297,0.269,0.23,0.348,0.336,0.269,0.258,0.263,Tranception Large model (700M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+17,Tranception S no retrieval,Protein language model,0.249,0.016,0.252,,0.247,0.269,0.226,0.191,0.201,0.259,0.274,0.175,0.268,0.146,Tranception Small model (85M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+18,TranceptEVE S,Hybrid model,0.234,0.018,0.219,,0.216,0.285,0.215,0.228,0.164,0.269,0.247,0.189,0.233,0.181,TranceptEVE Small model (Tranception Small & retrieved EVE model),"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+19,Wavenet,Alignment-based model,0.232,0.032,0.156,,0.121,0.226,0.426,0.168,0.428,0.383,0.401,0.408,0.39,0.322,Wavenet model,"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12."
+20,Tranception S,Hybrid model,0.228,0.018,0.238,,0.23,0.251,0.193,0.201,0.155,0.239,0.228,0.161,0.223,0.154,Tranception Small model (85M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+21,Hidden Markov Model,Alignment-based model,0.217,0.037,0.228,,0.13,0.228,0.281,0.138,0.282,0.275,0.316,0.252,0.233,0.247,Profile Hidden Markov model,HMMER: biosequence analysis using profile hidden Markov models
+22,ProtGPT2,Protein language model,0.155,0.029,0.139,,0.134,0.108,0.24,0.11,0.163,0.287,0.327,0.194,0.13,0.113,ProtGPT2 model,"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13."
+23,Unirep,Protein language model,0.154,0.034,0.133,,0.166,0.112,0.206,0.124,0.154,0.238,0.219,0.179,0.147,0.257,Unirep model,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
diff --git a/benchmarks/DMS_zero_shot/indels/Top_recall/Summary_performance_DMS_indels_Top_recall.html b/benchmarks/DMS_zero_shot/indels/Top_recall/Summary_performance_DMS_indels_Top_recall.html
new file mode 100644
index 0000000..e0453d6
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/indels/Top_recall/Summary_performance_DMS_indels_Top_recall.html
@@ -0,0 +1,531 @@
+
+
+
+ |
+ Model_name |
+ Model type |
+ Average_Top_recall |
+ Bootstrap_standard_error_Top_recall |
+ Function_Activity |
+ Function_Binding |
+ Function_Expression |
+ Function_OrganismalFitness |
+ Function_Stability |
+ Low_MSA_depth |
+ Medium_MSA_depth |
+ High_MSA_depth |
+ Taxa_Human |
+ Taxa_Other_Eukaryote |
+ Taxa_Prokaryote |
+ Taxa_Virus |
+ Model details |
+ References |
+
+
+ Model_rank |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+
+
+
+
+ 1 |
+ RITA XL |
+ Protein language model |
+ 0.303 |
+ 0.038 |
+ 0.260 |
+ NaN |
+ 0.207 |
+ 0.301 |
+ 0.443 |
+ 0.253 |
+ 0.385 |
+ 0.463 |
+ 0.512 |
+ 0.346 |
+ 0.389 |
+ 0.347 |
+ RITA xlarge model (1.2B params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 2 |
+ Progen2 M |
+ Protein language model |
+ 0.297 |
+ 0.036 |
+ 0.277 |
+ NaN |
+ 0.223 |
+ 0.250 |
+ 0.438 |
+ 0.174 |
+ 0.372 |
+ 0.469 |
+ 0.490 |
+ 0.409 |
+ 0.367 |
+ 0.205 |
+ Progen2 medium model (760M params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 3 |
+ TranceptEVE L |
+ Hybrid model |
+ 0.292 |
+ 0.012 |
+ 0.348 |
+ NaN |
+ 0.176 |
+ 0.304 |
+ 0.338 |
+ 0.294 |
+ 0.296 |
+ 0.368 |
+ 0.356 |
+ 0.316 |
+ 0.345 |
+ 0.256 |
+ TranceptEVE Large model (Tranception Large & retrieved EVE model) |
+ Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop. |
+
+
+ 4 |
+ TranceptEVE M |
+ Hybrid model |
+ 0.291 |
+ 0.000 |
+ 0.354 |
+ NaN |
+ 0.199 |
+ 0.321 |
+ 0.292 |
+ 0.249 |
+ 0.238 |
+ 0.348 |
+ 0.315 |
+ 0.295 |
+ 0.298 |
+ 0.195 |
+ TranceptEVE Medium model (Tranception Medium & retrieved EVE model) |
+ Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop. |
+
+
+ 5 |
+ Progen2 XL |
+ Protein language model |
+ 0.291 |
+ 0.033 |
+ 0.242 |
+ NaN |
+ 0.169 |
+ 0.251 |
+ 0.502 |
+ 0.235 |
+ 0.437 |
+ 0.508 |
+ 0.512 |
+ 0.499 |
+ 0.417 |
+ 0.255 |
+ Progen2 xlarge model (6.4B params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 6 |
+ Progen2 Base |
+ Protein language model |
+ 0.290 |
+ 0.039 |
+ 0.297 |
+ NaN |
+ 0.231 |
+ 0.231 |
+ 0.402 |
+ 0.148 |
+ 0.336 |
+ 0.439 |
+ 0.492 |
+ 0.356 |
+ 0.302 |
+ 0.180 |
+ Progen2 base model (760M params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 7 |
+ Provean |
+ Alignment-based model |
+ 0.285 |
+ 0.037 |
+ 0.233 |
+ NaN |
+ 0.162 |
+ 0.258 |
+ 0.486 |
+ 0.199 |
+ 0.441 |
+ 0.483 |
+ 0.446 |
+ 0.475 |
+ 0.463 |
+ 0.358 |
+ Provean model |
+ Choi Y, Sims GE, Murphy S, Miller JR, Chan AP (2012). Predicting the functional effect of amino acid substitutions and indels. PloS one. |
+
+
+ 8 |
+ RITA L |
+ Protein language model |
+ 0.275 |
+ 0.035 |
+ 0.232 |
+ NaN |
+ 0.218 |
+ 0.236 |
+ 0.413 |
+ 0.165 |
+ 0.333 |
+ 0.455 |
+ 0.476 |
+ 0.356 |
+ 0.309 |
+ 0.306 |
+ RITA large model (680M params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 9 |
+ Progen2 L |
+ Protein language model |
+ 0.271 |
+ 0.031 |
+ 0.195 |
+ NaN |
+ 0.200 |
+ 0.250 |
+ 0.439 |
+ 0.165 |
+ 0.370 |
+ 0.466 |
+ 0.478 |
+ 0.389 |
+ 0.409 |
+ 0.190 |
+ Progen2 large model (2.7B params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 10 |
+ RITA M |
+ Protein language model |
+ 0.268 |
+ 0.033 |
+ 0.210 |
+ NaN |
+ 0.222 |
+ 0.251 |
+ 0.389 |
+ 0.151 |
+ 0.308 |
+ 0.437 |
+ 0.444 |
+ 0.308 |
+ 0.338 |
+ 0.305 |
+ RITA medium model (300M params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 11 |
+ Tranception L no retrieval |
+ Protein language model |
+ 0.266 |
+ 0.034 |
+ 0.217 |
+ NaN |
+ 0.184 |
+ 0.306 |
+ 0.357 |
+ 0.269 |
+ 0.316 |
+ 0.373 |
+ 0.388 |
+ 0.300 |
+ 0.349 |
+ 0.287 |
+ Tranception Large model (700M params) without retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 12 |
+ Tranception M no retrieval |
+ Protein language model |
+ 0.265 |
+ 0.031 |
+ 0.250 |
+ NaN |
+ 0.205 |
+ 0.303 |
+ 0.302 |
+ 0.207 |
+ 0.247 |
+ 0.351 |
+ 0.329 |
+ 0.285 |
+ 0.294 |
+ 0.217 |
+ Tranception Medium model (300M params) without retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 13 |
+ Progen2 S |
+ Protein language model |
+ 0.263 |
+ 0.038 |
+ 0.245 |
+ NaN |
+ 0.238 |
+ 0.213 |
+ 0.358 |
+ 0.129 |
+ 0.260 |
+ 0.430 |
+ 0.412 |
+ 0.306 |
+ 0.302 |
+ 0.240 |
+ Progen2 small model (150M params) |
+ Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. |
+
+
+ 14 |
+ Tranception M |
+ Hybrid model |
+ 0.259 |
+ 0.030 |
+ 0.271 |
+ NaN |
+ 0.176 |
+ 0.303 |
+ 0.287 |
+ 0.278 |
+ 0.229 |
+ 0.335 |
+ 0.318 |
+ 0.284 |
+ 0.258 |
+ 0.210 |
+ Tranception Medium model (300M params) with retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 15 |
+ Tranception L |
+ Hybrid model |
+ 0.257 |
+ 0.030 |
+ 0.229 |
+ NaN |
+ 0.178 |
+ 0.298 |
+ 0.323 |
+ 0.272 |
+ 0.267 |
+ 0.360 |
+ 0.353 |
+ 0.293 |
+ 0.300 |
+ 0.258 |
+ Tranception Large model (700M params) with retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 16 |
+ RITA S |
+ Protein language model |
+ 0.252 |
+ 0.035 |
+ 0.195 |
+ NaN |
+ 0.244 |
+ 0.247 |
+ 0.323 |
+ 0.167 |
+ 0.257 |
+ 0.371 |
+ 0.363 |
+ 0.282 |
+ 0.287 |
+ 0.241 |
+ RITA small model (85M params) |
+ Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789. |
+
+
+ 17 |
+ Tranception S no retrieval |
+ Protein language model |
+ 0.245 |
+ 0.037 |
+ 0.255 |
+ NaN |
+ 0.247 |
+ 0.260 |
+ 0.220 |
+ 0.184 |
+ 0.195 |
+ 0.253 |
+ 0.260 |
+ 0.173 |
+ 0.266 |
+ 0.146 |
+ Tranception Small model (85M params) without retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 18 |
+ TranceptEVE S |
+ Hybrid model |
+ 0.235 |
+ 0.025 |
+ 0.216 |
+ NaN |
+ 0.216 |
+ 0.275 |
+ 0.234 |
+ 0.220 |
+ 0.183 |
+ 0.284 |
+ 0.255 |
+ 0.213 |
+ 0.259 |
+ 0.175 |
+ TranceptEVE Small model (Tranception Small & retrieved EVE model) |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 19 |
+ Tranception S |
+ Hybrid model |
+ 0.231 |
+ 0.027 |
+ 0.234 |
+ NaN |
+ 0.230 |
+ 0.252 |
+ 0.209 |
+ 0.203 |
+ 0.173 |
+ 0.250 |
+ 0.232 |
+ 0.182 |
+ 0.247 |
+ 0.154 |
+ Tranception Small model (85M params) with retrieval |
+ Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. |
+
+
+ 20 |
+ Hidden Markov Model |
+ Alignment-based model |
+ 0.217 |
+ 0.036 |
+ 0.229 |
+ NaN |
+ 0.130 |
+ 0.219 |
+ 0.289 |
+ 0.122 |
+ 0.296 |
+ 0.276 |
+ 0.311 |
+ 0.262 |
+ 0.253 |
+ 0.256 |
+ Profile Hidden Markov model |
+ HMMER: biosequence analysis using profile hidden Markov models |
+
+
+ 21 |
+ Wavenet |
+ Alignment-based model |
+ 0.188 |
+ 0.030 |
+ 0.155 |
+ NaN |
+ 0.121 |
+ 0.227 |
+ 0.250 |
+ 0.127 |
+ 0.300 |
+ 0.200 |
+ 0.133 |
+ 0.322 |
+ 0.279 |
+ 0.326 |
+ Wavenet model |
+ Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12. |
+
+
+ 22 |
+ ProtGPT2 |
+ Protein language model |
+ 0.154 |
+ 0.030 |
+ 0.138 |
+ NaN |
+ 0.134 |
+ 0.110 |
+ 0.237 |
+ 0.110 |
+ 0.164 |
+ 0.280 |
+ 0.327 |
+ 0.185 |
+ 0.130 |
+ 0.113 |
+ ProtGPT2 model |
+ Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13. |
+
+
+ 23 |
+ Unirep |
+ Protein language model |
+ 0.153 |
+ 0.027 |
+ 0.135 |
+ NaN |
+ 0.166 |
+ 0.115 |
+ 0.197 |
+ 0.126 |
+ 0.142 |
+ 0.234 |
+ 0.212 |
+ 0.170 |
+ 0.139 |
+ 0.257 |
+ Unirep model |
+ Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8. |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/substitutions/AUC/DMS_substitutions_AUC_DMS_level.csv b/benchmarks/DMS_zero_shot/substitutions/AUC/DMS_substitutions_AUC_DMS_level.csv
new file mode 100644
index 0000000..30ab73d
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/AUC/DMS_substitutions_AUC_DMS_level.csv
@@ -0,0 +1,217 @@
+DMS ID,Site-Independent,EVmutation,DeepSequence (single),DeepSequence (ensemble),EVE (single),EVE (ensemble),Unirep,Unirep evotuned,MSA Transformer (single),MSA Transformer (ensemble),ESM-1b,ESM-1v (single),ESM-1v (ensemble),ESM2 (8M),ESM2 (35M),ESM2 (150M),ESM2 (650M),ESM2 (3B),ESM2 (15B),Wavenet,RITA S,RITA M,RITA L,RITA XL,Progen2 S,Progen2 M,Progen2 Base,Progen2 L,Progen2 XL,GEMME,VESPA,VESPAl,ProtGPT2,Tranception S no retrieval,Tranception M no retrieval,Tranception L no retrieval,Tranception S,Tranception M,Tranception L,TranceptEVE S,TranceptEVE M,TranceptEVE L,CARP (38M),CARP (600K),CARP (640M),CARP (76M),MIF,MIF-ST,ESM-IF1,ProteinMPNN,ProtSSN (k=10 h=512),ProtSSN (k=10 h=768),ProtSSN (k=10 h=1280),ProtSSN (k=20 h=512),ProtSSN (k=20 h=768),ProtSSN (k=20 h=1280),ProtSSN (k=30 h=512),ProtSSN (k=30 h=768),ProtSSN (k=30 h=1280),ProtSSN (ensemble),SaProt (650M),SaProt (35M),Number of Mutants,Selection Type,UniProt ID,MSA_Neff_L_category,Taxon
+A0A140D2T1_ZIKV_Sourisseau_2019,0.69,0.674,0.56,0.544,0.692,0.698,0.426,0.532,0.718,0.721,0.485,0.467,0.494,0.454,0.464,0.46,0.597,0.678,0.685,0.612,0.674,0.651,0.656,0.651,0.658,0.669,0.66,0.653,0.644,0.715,0.653,0.64,0.499,0.648,0.659,0.633,0.677,0.676,0.672,0.675,0.668,0.682,0.459,0.453,0.555,0.459,0.637,0.642,0.642,0.565,0.618,0.617,0.633,0.642,0.631,0.628,0.623,0.628,0.626,0.631,0.598,0.556,9576,OrganismalFitness,A0A140D2T1_ZIKV,Medium,Virus
+A0A192B1T2_9HIV1_Haddox_2018,0.761,0.724,0.727,0.739,0.776,0.779,0.502,0.771,0.776,0.778,0.744,0.767,0.778,0.502,0.51,0.519,0.539,0.562,0.578,0.752,0.767,0.775,0.775,0.774,0.769,0.77,0.754,0.765,0.767,0.769,0.796,0.779,0.681,0.763,0.764,0.778,0.774,0.775,0.779,0.783,0.782,0.786,0.721,0.492,0.767,0.73,0.685,0.748,0.629,0.571,0.608,0.626,0.638,0.644,0.616,0.625,0.627,0.61,0.599,0.631,0.592,0.551,12577,OrganismalFitness,A0A192B1T2_9HIV1,Medium,Virus
+A0A1I9GEU1_NEIME_Kennouche_2019,0.491,0.518,0.556,0.547,0.528,0.528,0.488,0.537,0.541,0.537,0.521,0.521,0.525,0.477,0.468,0.49,0.516,0.512,0.516,0.538,0.491,0.518,0.531,0.54,0.517,0.537,0.537,0.539,0.542,0.524,0.51,0.504,0.513,0.521,0.527,0.55,0.516,0.521,0.531,0.528,0.53,0.538,0.473,0.47,0.524,0.475,0.516,0.532,0.513,0.517,0.528,0.524,0.522,0.517,0.52,0.529,0.519,0.53,0.523,0.525,0.517,0.491,922,Activity,A0A1I9GEU1_NEIME,Medium,Prokaryote
+A0A247D711_LISMN_Stadelmann_2021,0.727,0.733,0.568,0.534,0.716,0.717,0.515,0.526,0.655,0.665,0.556,0.558,0.558,0.542,0.553,0.564,0.549,0.557,0.555,0.534,0.518,0.516,0.535,0.547,0.501,0.543,0.514,0.535,0.561,0.73,0.668,0.672,0.537,0.532,0.522,0.538,0.661,0.657,0.666,0.636,0.631,0.641,0.536,0.514,0.55,0.556,0.713,0.724,0.738,0.683,0.64,0.668,0.656,0.673,0.663,0.681,0.657,0.669,0.632,0.67,0.718,0.645,1653,Activity,A0A247D711_LISMN,High,Prokaryote
+A0A2Z5U3Z0_9INFA_Doud_2016,0.745,0.744,0.751,0.769,0.782,0.783,0.499,0.758,0.755,0.755,0.559,0.769,0.786,0.495,0.504,0.529,0.766,0.759,0.758,0.708,0.742,0.777,0.768,0.782,0.721,0.769,0.765,0.757,0.762,0.774,0.76,0.715,0.54,0.741,0.771,0.77,0.767,0.783,0.777,0.787,0.797,0.796,0.502,0.5,0.673,0.507,0.723,0.751,0.739,0.591,0.758,0.752,0.781,0.777,0.778,0.778,0.784,0.776,0.788,0.785,0.605,0.579,10715,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+A0A2Z5U3Z0_9INFA_Wu_2014,0.753,0.766,0.734,0.749,0.769,0.777,0.497,0.725,0.754,0.761,0.579,0.741,0.766,0.52,0.531,0.532,0.742,0.752,0.754,0.735,0.729,0.76,0.755,0.768,0.681,0.738,0.747,0.731,0.748,0.767,0.731,0.697,0.547,0.73,0.758,0.771,0.766,0.781,0.784,0.787,0.794,0.794,0.519,0.52,0.626,0.528,0.688,0.7,0.689,0.576,0.733,0.726,0.743,0.74,0.741,0.745,0.75,0.743,0.744,0.75,0.607,0.588,2350,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+A4_HUMAN_Seuma_2022,0.714,0.707,0.735,0.722,0.673,0.676,0.693,0.577,0.673,0.679,0.666,0.666,0.711,0.709,0.7,0.703,0.736,0.758,0.688,0.632,0.672,0.645,0.666,0.659,0.718,0.654,0.667,0.664,0.662,0.757,0.622,0.588,0.766,0.713,0.639,0.692,0.74,0.695,0.743,0.739,0.716,0.728,0.703,0.699,0.73,0.696,0.65,0.697,0.405,0.527,0.739,0.734,0.726,0.72,0.722,0.703,0.73,0.687,0.698,0.725,0.722,0.709,14811,Stability,A4_HUMAN,Low,Human
+A4D664_9INFA_Soh_2019,0.714,0.681,0.713,0.711,0.715,0.714,0.513,0.692,0.668,0.663,0.521,0.512,0.514,0.51,0.509,0.513,0.57,0.603,0.636,0.635,0.679,0.705,0.712,0.708,0.533,0.643,0.655,0.634,0.676,0.733,0.666,0.658,0.525,0.686,0.697,0.711,0.694,0.701,0.704,0.73,0.732,0.741,0.51,0.499,0.539,0.501,0.64,0.633,0.566,0.559,0.592,0.589,0.596,0.609,0.605,0.603,0.6,0.6,0.597,0.602,0.584,0.551,14421,OrganismalFitness,A4D664_9INFA,Medium,Virus
+A4GRB6_PSEAI_Chen_2020,0.673,0.772,0.855,0.857,0.838,0.847,0.697,0.781,0.847,0.869,0.875,0.853,0.867,0.736,0.791,0.858,0.905,0.891,0.838,0.806,0.716,0.792,0.805,0.836,0.785,0.835,0.845,0.843,0.874,0.865,0.908,0.871,0.627,0.73,0.802,0.84,0.806,0.824,0.856,0.858,0.861,0.87,0.767,0.547,0.865,0.83,0.845,0.879,0.837,0.723,0.89,0.877,0.875,0.878,0.885,0.887,0.889,0.892,0.893,0.895,0.887,0.772,5004,OrganismalFitness,A4GRB6_PSEAI,High,Prokaryote
+AACC1_PSEAI_Dandage_2018,0.642,0.744,0.665,0.715,0.752,0.753,0.596,0.605,0.753,0.755,0.707,0.741,0.748,0.6,0.621,0.621,0.748,0.754,0.76,0.701,0.597,0.613,0.635,0.652,0.637,0.715,0.704,0.713,0.724,0.739,0.757,0.733,0.509,0.628,0.622,0.708,0.72,0.714,0.736,0.743,0.741,0.753,0.609,0.579,0.682,0.619,0.632,0.695,0.68,0.588,0.748,0.75,0.747,0.747,0.752,0.746,0.749,0.743,0.749,0.754,0.735,0.636,1801,OrganismalFitness,AACC1_PSEAI,High,Prokaryote
+ACE2_HUMAN_Chan_2020,0.611,0.588,0.553,0.562,0.593,0.586,0.472,0.502,0.592,0.599,0.612,0.555,0.57,0.483,0.522,0.603,0.606,0.562,0.554,0.595,0.499,0.527,0.558,0.589,0.481,0.545,0.574,0.572,0.616,0.576,0.547,0.534,0.534,0.501,0.527,0.526,0.586,0.565,0.563,0.58,0.564,0.564,0.497,0.516,0.623,0.528,0.694,0.66,0.675,0.559,0.603,0.625,0.608,0.607,0.612,0.621,0.603,0.602,0.602,0.611,0.656,0.591,2223,Binding,ACE2_HUMAN,Medium,Human
+ADRB2_HUMAN_Jones_2020,0.672,0.718,0.759,0.763,0.754,0.764,0.738,0.755,0.771,0.771,0.773,0.77,0.775,0.712,0.729,0.745,0.755,0.76,0.766,0.768,0.763,0.768,0.765,0.754,0.77,0.772,0.776,0.773,0.758,0.779,0.759,0.715,0.649,0.773,0.771,0.759,0.773,0.777,0.774,0.773,0.775,0.774,0.745,0.594,0.771,0.764,0.716,0.748,0.736,0.616,0.752,0.753,0.761,0.762,0.759,0.759,0.757,0.764,0.756,0.764,0.788,0.768,7800,Activity,ADRB2_HUMAN,Medium,Human
+AICDA_HUMAN_Gajula_2014_3cycles,0.47,0.697,0.728,0.74,0.727,0.74,0.361,0.622,0.672,0.709,0.68,0.665,0.714,0.339,0.373,0.572,0.65,0.649,0.621,0.776,0.392,0.557,0.708,0.696,0.448,0.672,0.713,0.704,0.631,0.635,0.71,0.711,0.563,0.471,0.496,0.66,0.589,0.591,0.682,0.716,0.711,0.737,0.368,0.368,0.657,0.568,0.735,0.717,0.719,0.689,0.672,0.716,0.653,0.733,0.658,0.659,0.659,0.66,0.644,0.667,0.651,0.486,209,Activity,AICDA_HUMAN,Medium,Human
+AMFR_HUMAN_Tsuboyama_2023_4G3O,0.667,0.692,0.693,0.687,0.676,0.682,0.33,0.535,0.588,0.598,0.67,0.542,0.699,0.394,0.639,0.636,0.609,0.628,0.61,0.504,0.485,0.385,0.44,0.447,0.317,0.419,0.444,0.416,0.494,0.682,0.61,0.598,0.524,0.356,0.428,0.334,0.684,0.688,0.661,0.697,0.695,0.669,0.409,0.407,0.556,0.533,0.603,0.534,0.689,0.65,0.705,0.642,0.576,0.657,0.649,0.654,0.653,0.648,0.66,0.651,0.652,0.674,2972,Stability,AMFR_HUMAN,Medium,Human
+AMIE_PSEAE_Wrenbeck_2017,0.624,0.75,0.779,0.788,0.769,0.763,0.534,0.741,0.713,0.825,0.79,0.833,0.862,0.653,0.693,0.702,0.78,0.858,0.837,0.759,0.794,0.794,0.796,0.8,0.8,0.808,0.797,0.802,0.788,0.814,0.824,0.794,0.635,0.821,0.764,0.735,0.814,0.765,0.745,0.823,0.787,0.775,0.668,0.528,0.735,0.695,0.683,0.735,0.709,0.608,0.783,0.768,0.771,0.774,0.769,0.779,0.764,0.772,0.785,0.783,0.808,0.696,6227,Activity,AMIE_PSEAE,High,Prokaryote
+ANCSZ_Hobbs_2022,0.768,0.753,0.715,0.73,0.754,0.756,0.754,0.727,0.744,0.756,0.761,0.756,0.768,0.752,0.783,0.786,0.791,0.789,0.781,0.508,0.723,0.744,0.751,0.747,0.747,0.756,0.752,0.732,0.74,0.777,0.766,0.758,0.649,0.732,0.751,0.736,0.76,0.772,0.767,0.764,0.773,0.766,0.763,0.697,0.739,0.75,0.731,0.728,0.706,0.587,0.771,0.767,0.78,0.779,0.775,0.781,0.773,0.778,0.775,0.783,0.783,0.787,4670,Activity,ANCSZ,Medium,Eukaryote
+ARGR_ECOLI_Tsuboyama_2023_1AOY,0.614,0.697,0.7,0.711,0.71,0.704,0.623,0.677,0.715,0.735,0.688,0.638,0.691,0.638,0.619,0.739,0.741,0.748,0.706,0.706,0.667,0.72,0.726,0.699,0.657,0.711,0.726,0.727,0.705,0.723,0.703,0.656,0.525,0.677,0.686,0.709,0.678,0.686,0.703,0.719,0.717,0.725,0.618,0.604,0.716,0.694,0.871,0.824,0.879,0.828,0.749,0.758,0.756,0.762,0.75,0.749,0.737,0.759,0.735,0.759,0.797,0.741,1287,Stability,ARGR_ECOLI,Medium,Prokaryote
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,0.853,0.659,0.823,0.827,0.764,0.813,0.645,0.769,0.619,0.612,0.642,0.519,0.685,0.607,0.586,0.606,0.67,0.686,0.674,0.664,0.647,0.568,0.67,0.788,0.563,0.646,0.711,0.713,0.714,0.89,0.702,0.69,0.647,0.648,0.635,0.681,0.791,0.76,0.709,0.831,0.816,0.712,0.616,0.591,0.632,0.605,0.649,0.655,0.709,0.509,0.695,0.686,0.729,0.665,0.627,0.757,0.662,0.667,0.738,0.706,0.635,0.621,170,Binding,B2L11_HUMAN,Low,Human
+BBC1_YEAST_Tsuboyama_2023_1TG0,0.605,0.688,0.722,0.725,0.715,0.723,0.607,0.65,0.724,0.735,0.736,0.724,0.739,0.693,0.724,0.716,0.754,0.744,0.768,0.685,0.687,0.688,0.706,0.691,0.56,0.711,0.718,0.709,0.678,0.688,0.698,0.684,0.645,0.684,0.636,0.653,0.7,0.675,0.687,0.729,0.71,0.721,0.592,0.577,0.671,0.617,0.801,0.748,0.857,0.824,0.743,0.752,0.756,0.764,0.757,0.758,0.758,0.754,0.764,0.758,0.789,0.807,2069,Stability,BBC1_YEAST,High,Eukaryote
+BCHB_CHLTE_Tsuboyama_2023_2KRU,0.698,0.731,0.7,0.705,0.704,0.711,0.563,0.637,0.704,0.772,0.714,0.677,0.717,0.591,0.603,0.662,0.766,0.778,0.732,0.725,0.569,0.693,0.623,0.707,0.608,0.733,0.655,0.716,0.737,0.827,0.796,0.77,0.524,0.634,0.66,0.734,0.734,0.734,0.759,0.717,0.718,0.739,0.563,0.563,0.609,0.598,0.784,0.759,0.845,0.79,0.788,0.792,0.797,0.805,0.798,0.807,0.78,0.794,0.789,0.797,0.769,0.648,1572,Stability,BCHB_CHLTE,Medium,Prokaryote
+BLAT_ECOLX_Deng_2012,0.667,0.763,0.764,0.773,0.759,0.764,0.542,0.653,0.764,0.784,0.762,0.751,0.761,0.628,0.683,0.737,0.772,0.735,0.686,0.733,0.691,0.698,0.706,0.704,0.7,0.73,0.732,0.714,0.678,0.742,0.77,0.751,0.552,0.698,0.708,0.693,0.735,0.747,0.748,0.766,0.775,0.773,0.676,0.52,0.749,0.71,0.718,0.755,0.741,0.63,0.77,0.774,0.776,0.784,0.781,0.779,0.781,0.771,0.776,0.785,0.781,0.715,4996,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Firnberg_2014,0.723,0.859,0.868,0.876,0.852,0.863,0.562,0.729,0.859,0.877,0.858,0.836,0.856,0.697,0.773,0.83,0.876,0.795,0.719,0.832,0.784,0.779,0.76,0.758,0.799,0.827,0.831,0.79,0.721,0.845,0.889,0.888,0.588,0.778,0.761,0.732,0.822,0.815,0.811,0.865,0.864,0.864,0.769,0.519,0.855,0.799,0.81,0.863,0.851,0.667,0.871,0.867,0.87,0.882,0.882,0.882,0.885,0.87,0.88,0.888,0.884,0.796,4783,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Jacquier_2013,0.727,0.853,0.857,0.869,0.855,0.868,0.551,0.716,0.843,0.863,0.835,0.833,0.843,0.682,0.755,0.819,0.858,0.801,0.749,0.822,0.781,0.768,0.759,0.758,0.784,0.816,0.811,0.784,0.729,0.791,0.866,0.851,0.577,0.757,0.766,0.747,0.807,0.816,0.821,0.86,0.867,0.871,0.739,0.524,0.837,0.783,0.771,0.83,0.813,0.655,0.837,0.842,0.845,0.847,0.845,0.847,0.855,0.842,0.84,0.854,0.862,0.79,989,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Stiffler_2015,0.728,0.862,0.875,0.882,0.857,0.868,0.563,0.73,0.863,0.881,0.865,0.846,0.867,0.704,0.782,0.839,0.883,0.802,0.724,0.837,0.793,0.786,0.757,0.761,0.809,0.835,0.838,0.794,0.724,0.851,0.897,0.895,0.587,0.787,0.758,0.729,0.83,0.815,0.812,0.871,0.868,0.869,0.776,0.518,0.863,0.807,0.816,0.867,0.854,0.675,0.875,0.866,0.872,0.882,0.882,0.884,0.886,0.873,0.882,0.89,0.891,0.805,4996,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BRCA1_HUMAN_Findlay_2018,0.824,0.756,0.788,0.774,0.776,0.8,0.566,0.683,0.76,0.772,0.836,0.775,0.805,0.582,0.767,0.823,0.86,0.845,0.788,0.699,0.577,0.762,0.778,0.774,0.705,0.82,0.814,0.801,0.815,0.776,0.86,0.832,0.658,0.603,0.621,0.79,0.792,0.792,0.846,0.806,0.806,0.855,0.642,0.58,0.865,0.815,0.804,0.809,0.57,0.587,0.856,0.85,0.86,0.852,0.861,0.862,0.858,0.857,0.853,0.864,0.864,0.832,1837,OrganismalFitness,BRCA1_HUMAN,Low,Human
+BRCA2_HUMAN_Erwood_2022_HEK293T,0.908,0.816,0.866,0.861,0.85,0.875,0.588,0.893,0.702,0.556,0.906,0.593,0.526,0.529,0.528,0.914,0.94,0.904,0.899,0.584,0.515,0.921,0.923,0.861,0.928,0.888,0.896,0.887,0.587,0.86,0.785,0.85,0.558,0.549,0.581,0.59,0.822,0.817,0.825,0.833,0.819,0.837,0.59,0.553,0.892,0.492,0.48,0.503,0.279,0.76,0.871,0.899,0.896,0.878,0.876,0.874,0.868,0.839,0.841,0.885,0.5,0.507,265,OrganismalFitness,BRCA2_HUMAN,,Human
+C6KNH7_9INFA_Lee_2018,0.697,0.687,0.676,0.68,0.719,0.72,0.485,0.731,0.686,0.689,0.526,0.717,0.747,0.49,0.488,0.495,0.75,0.705,0.716,0.674,0.701,0.685,0.693,0.678,0.621,0.734,0.715,0.736,0.69,0.742,0.731,0.681,0.559,0.683,0.691,0.701,0.71,0.713,0.719,0.724,0.724,0.729,0.487,0.481,0.644,0.498,0.761,0.776,0.777,0.619,0.756,0.756,0.767,0.769,0.774,0.763,0.772,0.768,0.768,0.773,0.657,0.593,10754,OrganismalFitness,C6KNH7_9INFA,Medium,Virus
+CALM1_HUMAN_Weile_2017,0.594,0.631,0.63,0.627,0.63,0.631,0.588,0.61,0.635,0.644,0.646,0.629,0.65,0.58,0.602,0.602,0.615,0.623,0.627,0.63,0.599,0.629,0.636,0.644,0.631,0.662,0.66,0.676,0.684,0.641,0.619,0.58,0.553,0.621,0.648,0.674,0.621,0.639,0.653,0.634,0.638,0.639,0.619,0.58,0.658,0.643,0.552,0.58,0.592,0.548,0.598,0.598,0.599,0.593,0.6,0.594,0.596,0.602,0.593,0.6,0.659,0.639,1813,OrganismalFitness,CALM1_HUMAN,High,Human
+CAPSD_AAV2S_Sinai_2021,0.698,0.67,0.665,0.688,0.67,0.662,0.703,0.724,0.66,0.679,0.588,0.609,0.61,0.643,0.659,0.61,0.649,0.594,0.553,0.63,0.606,0.641,0.648,0.65,0.614,0.61,0.644,0.614,0.717,0.719,0.586,0.582,0.56,0.612,0.644,0.757,0.686,0.689,0.74,0.668,0.67,0.716,0.558,0.594,0.644,0.564,0.713,0.703,0.673,0.666,0.614,0.61,0.603,0.615,0.604,0.605,0.607,0.602,0.613,0.609,0.668,0.629,42328,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CAR11_HUMAN_Meitlis_2020_gof,0.551,0.464,0.464,0.467,0.459,0.466,0.591,0.403,0.519,0.527,0.522,0.589,0.548,0.545,0.556,0.653,0.57,0.57,0.583,0.545,0.564,0.475,0.511,0.525,0.365,0.478,0.498,0.476,0.518,0.451,0.512,0.514,0.593,0.524,0.471,0.425,0.518,0.492,0.45,0.488,0.475,0.442,0.557,0.516,0.517,0.623,0.589,0.524,0.592,0.548,0.597,0.593,0.595,0.574,0.576,0.581,0.575,0.584,0.589,0.587,0.654,0.643,2374,OrganismalFitness,CAR11_HUMAN,Low,Human
+CAR11_HUMAN_Meitlis_2020_lof,0.752,0.711,0.689,0.679,0.713,0.721,0.541,0.509,0.654,0.652,0.761,0.77,0.765,0.524,0.552,0.74,0.783,0.794,0.802,0.604,0.572,0.698,0.676,0.658,0.605,0.686,0.69,0.727,0.598,0.707,0.75,0.722,0.7,0.49,0.675,0.644,0.678,0.707,0.684,0.711,0.717,0.697,0.539,0.512,0.746,0.661,0.725,0.757,0.742,0.582,0.789,0.785,0.787,0.786,0.781,0.792,0.779,0.787,0.795,0.793,0.803,0.716,2395,OrganismalFitness,CAR11_HUMAN,Low,Human
+CAS9_STRP1_Spencer_2017_positive,0.58,0.594,0.587,0.589,0.59,0.593,0.516,0.549,0.593,0.595,0.588,0.532,0.533,0.528,0.544,0.578,0.594,0.597,0.6,0.504,0.519,0.537,0.585,0.56,0.517,0.599,0.592,0.589,0.534,0.596,0.603,0.593,0.513,0.514,0.521,0.582,0.584,0.584,0.593,0.595,0.595,0.598,0.528,0.512,0.592,0.551,0.551,0.592,0.513,0.517,0.59,0.591,0.591,0.592,0.591,0.592,0.59,0.59,0.591,0.593,0.587,0.556,8117,Activity,CAS9_STRP1,Medium,Prokaryote
+CASP3_HUMAN_Roychowdhury_2020,0.694,0.776,0.814,0.818,0.822,0.825,0.532,0.735,0.825,0.833,0.799,0.805,0.818,0.637,0.792,0.828,0.824,0.783,0.751,0.813,0.61,0.748,0.757,0.779,0.756,0.781,0.785,0.757,0.784,0.809,0.804,0.773,0.621,0.537,0.739,0.765,0.768,0.782,0.797,0.821,0.816,0.824,0.709,0.492,0.813,0.784,0.707,0.78,0.751,0.638,0.781,0.787,0.791,0.787,0.791,0.785,0.797,0.785,0.787,0.797,0.833,0.762,1567,Activity,CASP3_HUMAN,High,Human
+CASP7_HUMAN_Roychowdhury_2020,0.713,0.782,0.835,0.835,0.831,0.833,0.537,0.767,0.828,0.842,0.815,0.835,0.845,0.669,0.826,0.838,0.84,0.816,0.803,0.832,0.613,0.799,0.796,0.811,0.798,0.819,0.822,0.816,0.81,0.85,0.83,0.8,0.682,0.552,0.804,0.787,0.782,0.831,0.822,0.825,0.847,0.842,0.76,0.509,0.848,0.825,0.763,0.835,0.817,0.674,0.823,0.816,0.81,0.827,0.836,0.833,0.827,0.817,0.828,0.835,0.847,0.793,1680,Activity,CASP7_HUMAN,Medium,Human
+CATR_CHLRE_Tsuboyama_2023_2AMI,0.796,0.813,0.85,0.853,0.86,0.863,0.87,0.818,0.81,0.791,0.847,0.852,0.864,0.798,0.869,0.87,0.866,0.837,0.862,0.521,0.844,0.835,0.841,0.836,0.857,0.852,0.835,0.817,0.829,0.866,0.846,0.848,0.734,0.851,0.836,0.853,0.862,0.853,0.869,0.856,0.854,0.864,0.748,0.732,0.691,0.758,0.76,0.744,0.873,0.799,0.868,0.866,0.878,0.872,0.883,0.868,0.869,0.862,0.865,0.874,0.851,0.874,1903,Stability,CATR_CHLRE,High,Eukaryote
+CBPA2_HUMAN_Tsuboyama_2023_1O6X,0.854,0.892,0.904,0.906,0.891,0.899,0.748,0.832,0.918,0.921,0.911,0.893,0.888,0.785,0.86,0.898,0.897,0.905,0.898,0.912,0.769,0.778,0.866,0.867,0.79,0.852,0.851,0.852,0.884,0.902,0.899,0.868,0.516,0.767,0.834,0.869,0.877,0.881,0.895,0.903,0.9,0.908,0.772,0.663,0.828,0.843,0.905,0.893,0.959,0.929,0.929,0.928,0.93,0.927,0.933,0.931,0.93,0.928,0.926,0.932,0.902,0.951,2068,Stability,CBPA2_HUMAN,Medium,Human
+CBS_HUMAN_Sun_2020,0.687,0.701,0.707,0.717,0.713,0.717,0.613,0.644,0.71,0.712,0.696,0.696,0.71,0.545,0.627,0.689,0.692,0.684,0.69,0.696,0.699,0.646,0.659,0.662,0.696,0.652,0.662,0.652,0.674,0.714,0.711,0.698,0.611,0.702,0.656,0.648,0.715,0.685,0.683,0.724,0.705,0.706,0.673,0.534,0.714,0.711,0.633,0.655,0.676,0.547,0.683,0.684,0.688,0.691,0.691,0.69,0.687,0.688,0.689,0.693,0.72,0.664,7217,OrganismalFitness,CBS_HUMAN,Medium,Human
+CBX4_HUMAN_Tsuboyama_2023_2K28,0.752,0.773,0.844,0.865,0.811,0.847,0.339,0.819,0.812,0.808,0.812,0.829,0.828,0.344,0.878,0.855,0.844,0.801,0.802,0.828,0.753,0.803,0.797,0.797,0.783,0.793,0.799,0.767,0.793,0.815,0.845,0.84,0.719,0.714,0.75,0.765,0.802,0.799,0.81,0.844,0.856,0.859,0.816,0.317,0.682,0.786,0.54,0.561,0.796,0.783,0.853,0.809,0.85,0.838,0.861,0.839,0.839,0.871,0.835,0.852,0.826,0.888,2282,Stability,CBX4_HUMAN,High,Human
+CCDB_ECOLI_Adkar_2012,0.705,0.749,0.757,0.777,0.767,0.775,0.47,0.619,0.706,0.715,0.745,0.701,0.749,0.484,0.477,0.723,0.761,0.746,0.629,0.756,0.514,0.471,0.431,0.571,0.396,0.538,0.484,0.5,0.713,0.743,0.817,0.807,0.558,0.489,0.507,0.657,0.743,0.729,0.75,0.763,0.753,0.771,0.497,0.468,0.747,0.495,0.65,0.741,0.67,0.648,0.743,0.728,0.733,0.743,0.743,0.743,0.743,0.755,0.768,0.751,0.746,0.61,1176,Activity,CCDB_ECOLI,High,Prokaryote
+CCDB_ECOLI_Tripathi_2016,0.809,0.861,0.872,0.886,0.867,0.876,0.496,0.709,0.786,0.807,0.851,0.808,0.843,0.498,0.502,0.826,0.865,0.871,0.757,0.87,0.528,0.512,0.447,0.624,0.425,0.567,0.482,0.553,0.841,0.861,0.891,0.87,0.577,0.505,0.597,0.803,0.844,0.845,0.88,0.869,0.87,0.89,0.507,0.462,0.85,0.517,0.727,0.806,0.74,0.666,0.847,0.822,0.822,0.844,0.844,0.843,0.837,0.848,0.869,0.851,0.849,0.734,1663,OrganismalFitness,CCDB_ECOLI,High,Prokaryote
+CCR5_HUMAN_Gill_2023,0.634,0.637,0.638,0.639,0.634,0.638,0.649,0.644,0.662,0.668,0.675,0.664,0.67,0.651,0.67,0.674,0.668,0.662,0.664,0.67,0.675,0.673,0.657,0.659,0.675,0.67,0.665,0.673,0.66,0.674,0.644,0.619,0.557,0.675,0.676,0.678,0.678,0.682,0.685,0.658,0.658,0.659,0.668,0.59,0.676,0.67,0.633,0.647,0.638,0.585,0.65,0.647,0.652,0.652,0.649,0.652,0.657,0.657,0.657,0.657,0.668,0.671,6137,Binding,CCR5_HUMAN,High,Human
+CD19_HUMAN_Klesmith_2019_FMC_singles,0.623,0.617,0.612,0.621,0.633,0.632,0.574,0.58,0.597,0.599,0.591,0.598,0.613,0.603,0.62,0.6,0.599,0.607,0.589,0.492,0.597,0.6,0.628,0.617,0.593,0.648,0.616,0.607,0.63,0.647,0.599,0.574,0.581,0.606,0.631,0.609,0.63,0.636,0.628,0.642,0.644,0.639,0.61,0.568,0.594,0.613,0.728,0.683,0.72,0.608,0.644,0.648,0.643,0.662,0.654,0.653,0.658,0.647,0.653,0.659,0.755,0.71,3761,Binding,CD19_HUMAN,Low,Human
+CP2C9_HUMAN_Amorosi_2021_abundance,0.748,0.788,0.8,0.81,0.8,0.808,0.763,0.783,0.8,0.808,0.758,0.795,0.814,0.767,0.803,0.816,0.822,0.817,0.8,0.807,0.783,0.783,0.8,0.777,0.792,0.795,0.796,0.79,0.787,0.806,0.792,0.762,0.611,0.797,0.787,0.786,0.811,0.811,0.811,0.82,0.818,0.819,0.79,0.549,0.768,0.806,0.782,0.775,0.81,0.611,0.806,0.804,0.812,0.823,0.821,0.817,0.817,0.813,0.814,0.824,0.822,0.811,6370,Expression,CP2C9_HUMAN,High,Human
+CP2C9_HUMAN_Amorosi_2021_activity,0.758,0.787,0.804,0.814,0.797,0.808,0.794,0.805,0.8,0.816,0.752,0.819,0.831,0.784,0.825,0.839,0.835,0.833,0.804,0.822,0.791,0.796,0.808,0.789,0.806,0.794,0.796,0.799,0.791,0.812,0.8,0.774,0.609,0.814,0.798,0.787,0.829,0.825,0.82,0.826,0.822,0.818,0.813,0.543,0.775,0.832,0.802,0.791,0.833,0.628,0.817,0.821,0.824,0.829,0.833,0.835,0.831,0.824,0.829,0.837,0.834,0.833,6142,Binding,CP2C9_HUMAN,High,Human
+CSN4_MOUSE_Tsuboyama_2023_1UFM,0.695,0.831,0.803,0.81,0.797,0.805,0.653,0.791,0.817,0.821,0.842,0.859,0.879,0.728,0.886,0.828,0.781,0.821,0.82,0.79,0.756,0.775,0.836,0.845,0.741,0.857,0.849,0.834,0.835,0.833,0.818,0.811,0.67,0.704,0.784,0.83,0.779,0.798,0.831,0.815,0.828,0.84,0.674,0.621,0.754,0.767,0.785,0.776,0.839,0.86,0.846,0.846,0.842,0.858,0.86,0.861,0.853,0.868,0.842,0.857,0.848,0.873,3295,Stability,CSN4_MOUSE,Medium,Eukaryote
+CUE1_YEAST_Tsuboyama_2023_2MYX,0.666,0.715,0.699,0.7,0.725,0.709,0.551,0.702,0.755,0.764,0.736,0.694,0.672,0.563,0.695,0.717,0.763,0.74,0.73,0.652,0.564,0.592,0.597,0.522,0.538,0.588,0.601,0.6,0.69,0.702,0.733,0.707,0.62,0.562,0.556,0.587,0.678,0.682,0.684,0.704,0.705,0.709,0.562,0.51,0.713,0.603,0.748,0.743,0.813,0.728,0.769,0.749,0.746,0.772,0.753,0.771,0.765,0.759,0.755,0.765,0.79,0.771,1580,Stability,CUE1_YEAST,Medium,Eukaryote
+D7PM05_CLYGR_Somermeyer_2022,0.781,0.83,0.828,0.81,0.835,0.835,0.531,0.75,0.849,0.851,0.732,0.53,0.529,0.53,0.522,0.512,0.52,0.531,0.575,0.735,0.514,0.519,0.554,0.541,0.5,0.521,0.535,0.601,0.69,0.831,0.805,0.813,0.514,0.549,0.563,0.58,0.783,0.783,0.781,0.834,0.833,0.828,0.496,0.506,0.498,0.508,0.636,0.652,0.76,0.706,0.759,0.756,0.758,0.764,0.756,0.761,0.758,0.756,0.755,0.76,0.668,0.6,24515,Activity,D7PM05_CLYGR,Low,Eukaryote
+DLG4_HUMAN_Faure_2021,0.877,0.822,0.834,0.816,0.834,0.838,0.904,0.853,0.791,0.795,0.781,0.801,0.834,0.909,0.921,0.905,0.822,0.764,0.741,0.854,0.812,0.818,0.809,0.791,0.834,0.832,0.809,0.804,0.768,0.839,0.855,0.848,0.777,0.816,0.852,0.817,0.861,0.888,0.867,0.854,0.865,0.849,0.827,0.613,0.708,0.793,0.839,0.741,0.868,0.668,0.768,0.7,0.798,0.774,0.786,0.77,0.782,0.77,0.779,0.778,0.783,0.913,6976,OrganismalFitness,DLG4_HUMAN,Low,Human
+DLG4_RAT_McLaughlin_2012,0.83,0.771,0.789,0.801,0.84,0.848,0.818,0.794,0.824,0.847,0.806,0.869,0.88,0.796,0.883,0.884,0.857,0.823,0.796,0.8,0.764,0.769,0.767,0.752,0.789,0.795,0.781,0.755,0.756,0.834,0.848,0.833,0.685,0.759,0.763,0.724,0.821,0.816,0.804,0.855,0.853,0.854,0.847,0.552,0.777,0.834,0.791,0.709,0.814,0.593,0.791,0.76,0.801,0.771,0.804,0.79,0.775,0.797,0.803,0.81,0.82,0.865,1576,Binding,DLG4_RAT,Low,Eukaryote
+DN7A_SACS2_Tsuboyama_2023_1JIC,0.574,0.58,0.601,0.61,0.601,0.625,0.424,0.668,0.66,0.672,0.496,0.539,0.531,0.502,0.567,0.581,0.63,0.647,0.681,0.604,0.44,0.5,0.493,0.508,0.498,0.533,0.5,0.511,0.59,0.647,0.722,0.699,0.522,0.469,0.467,0.486,0.593,0.589,0.593,0.614,0.606,0.608,0.522,0.451,0.6,0.53,0.787,0.782,0.837,0.783,0.683,0.675,0.738,0.73,0.72,0.718,0.709,0.689,0.682,0.716,0.759,0.723,1008,Stability,DN7A_SACS2,Medium,Prokaryote
+DNJA1_HUMAN_Tsuboyama_2023_2LO1,0.928,0.937,0.937,0.937,0.944,0.943,0.887,0.916,0.938,0.945,0.946,0.937,0.949,0.95,0.955,0.955,0.946,0.935,0.953,0.937,0.862,0.906,0.924,0.914,0.929,0.922,0.93,0.896,0.929,0.938,0.92,0.909,0.864,0.928,0.929,0.943,0.954,0.955,0.96,0.948,0.95,0.953,0.846,0.582,0.711,0.85,0.822,0.766,0.928,0.933,0.932,0.936,0.934,0.948,0.945,0.95,0.948,0.944,0.949,0.946,0.926,0.946,2264,Stability,DNJA1_HUMAN,High,Human
+DOCK1_MOUSE_Tsuboyama_2023_2M0Y,0.714,0.738,0.718,0.729,0.757,0.761,0.521,0.72,0.686,0.687,0.749,0.727,0.741,0.607,0.719,0.736,0.753,0.744,0.751,0.764,0.654,0.696,0.678,0.685,0.645,0.677,0.659,0.646,0.673,0.721,0.719,0.72,0.591,0.597,0.652,0.612,0.735,0.743,0.739,0.751,0.752,0.758,0.666,0.4,0.741,0.711,0.668,0.672,0.703,0.707,0.743,0.734,0.742,0.746,0.738,0.748,0.729,0.74,0.747,0.742,0.767,0.752,2915,Stability,DOCK1_MOUSE,High,Eukaryote
+DYR_ECOLI_Nguyen_2023,0.757,0.851,0.863,0.869,0.868,0.872,0.497,0.834,0.861,0.855,0.874,0.879,0.887,0.564,0.854,0.874,0.886,0.886,0.878,0.873,0.739,0.805,0.732,0.778,0.828,0.831,0.843,0.855,0.817,0.876,0.886,0.873,0.624,0.842,0.758,0.747,0.865,0.811,0.821,0.88,0.856,0.86,0.839,0.499,0.877,0.855,0.665,0.793,0.748,0.576,0.841,0.817,0.826,0.833,0.836,0.837,0.843,0.843,0.859,0.849,0.861,0.808,2916,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+DYR_ECOLI_Thompson_2019,0.745,0.792,0.79,0.792,0.796,0.797,0.457,0.72,0.788,0.806,0.782,0.764,0.772,0.532,0.724,0.773,0.789,0.807,0.806,0.792,0.644,0.746,0.667,0.703,0.734,0.758,0.789,0.778,0.766,0.79,0.794,0.78,0.605,0.749,0.712,0.711,0.773,0.757,0.765,0.797,0.795,0.8,0.718,0.48,0.77,0.751,0.621,0.728,0.681,0.554,0.763,0.739,0.744,0.742,0.757,0.752,0.764,0.759,0.771,0.763,0.792,0.722,2363,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+ENV_HV1B9_DuenasDecamp_2016,0.747,0.744,0.688,0.742,0.765,0.761,0.503,0.704,0.732,0.751,0.722,0.791,0.777,0.533,0.515,0.496,0.47,0.507,0.539,0.751,0.753,0.74,0.768,0.771,0.707,0.783,0.769,0.746,0.747,0.767,0.768,0.757,0.674,0.767,0.767,0.771,0.774,0.776,0.772,0.772,0.773,0.774,0.719,0.451,0.748,0.759,0.736,0.713,0.731,0.651,0.643,0.695,0.704,0.676,0.655,0.653,0.667,0.659,0.644,0.682,0.607,0.587,375,OrganismalFitness,ENV_HV1B9,Medium,Virus
+ENV_HV1BR_Haddox_2016,0.669,0.652,0.66,0.661,0.671,0.673,0.499,0.659,0.674,0.673,0.649,0.659,0.669,0.495,0.495,0.497,0.519,0.529,0.577,0.667,0.676,0.68,0.687,0.682,0.673,0.68,0.68,0.676,0.682,0.677,0.665,0.649,0.599,0.673,0.682,0.681,0.681,0.685,0.684,0.685,0.687,0.684,0.615,0.49,0.66,0.649,0.597,0.616,0.553,0.534,0.585,0.581,0.594,0.607,0.587,0.594,0.595,0.59,0.589,0.598,0.577,0.541,12863,OrganismalFitness,ENV_HV1BR,Medium,Virus
+ENVZ_ECOLI_Ghose_2023,0.571,0.569,0.61,0.61,0.619,0.618,0.503,0.616,0.608,0.614,0.606,0.625,0.636,0.607,0.615,0.618,0.613,0.6,0.584,0.614,0.605,0.601,0.632,0.625,0.606,0.611,0.626,0.621,0.615,0.618,0.598,0.595,0.559,0.602,0.612,0.621,0.618,0.623,0.63,0.621,0.625,0.625,0.611,0.566,0.612,0.61,0.528,0.596,0.582,0.525,0.623,0.62,0.604,0.617,0.639,0.625,0.615,0.625,0.627,0.627,0.594,0.637,1121,Activity,ENVZ_ECOLI,High,Prokaryote
+EPHB2_HUMAN_Tsuboyama_2023_1F0M,0.839,0.898,0.899,0.902,0.914,0.915,0.309,0.906,0.948,0.952,0.944,0.946,0.953,0.286,0.941,0.954,0.963,0.945,0.94,0.923,0.831,0.86,0.899,0.889,0.88,0.898,0.905,0.913,0.92,0.954,0.94,0.936,0.886,0.86,0.859,0.879,0.908,0.908,0.916,0.919,0.918,0.917,0.85,0.341,0.857,0.861,0.83,0.828,0.943,0.909,0.957,0.954,0.956,0.955,0.961,0.958,0.958,0.958,0.955,0.96,0.949,0.944,1960,Stability,EPHB2_HUMAN,High,Human
+ERBB2_HUMAN_Elazar_2016,0.697,0.691,0.641,0.642,0.634,0.645,0.717,0.694,0.702,0.715,0.719,0.73,0.738,0.737,0.732,0.735,0.706,0.692,0.652,0.525,0.721,0.729,0.752,0.725,0.74,0.747,0.778,0.782,0.734,0.692,0.716,0.543,0.732,0.768,0.728,0.736,0.747,0.731,0.737,0.728,0.723,0.72,0.739,0.735,0.764,0.734,0.718,0.73,0.407,0.527,0.698,0.693,0.72,0.702,0.699,0.699,0.709,0.717,0.721,0.714,0.756,0.741,326,Expression,ERBB2_HUMAN,Low,Human
+ESTA_BACSU_Nutschel_2020,0.637,0.694,0.702,0.711,0.696,0.694,0.577,0.665,0.669,0.707,0.659,0.645,0.662,0.576,0.621,0.629,0.645,0.641,0.664,0.654,0.548,0.591,0.634,0.648,0.628,0.62,0.653,0.634,0.696,0.69,0.724,0.711,0.534,0.579,0.63,0.641,0.655,0.665,0.673,0.7,0.703,0.703,0.588,0.552,0.637,0.623,0.783,0.743,0.774,0.681,0.677,0.664,0.673,0.666,0.682,0.67,0.666,0.677,0.675,0.679,0.691,0.594,2172,Stability,ESTA_BACSU,High,Prokaryote
+F7YBW8_MESOW_Aakre_2015,0.537,0.668,0.668,0.689,0.683,0.684,0.517,0.633,0.668,0.672,0.689,0.669,0.674,0.475,0.513,0.531,0.671,0.665,0.687,0.67,0.479,0.457,0.458,0.503,0.555,0.559,0.501,0.633,0.677,0.659,0.698,0.687,0.518,0.498,0.46,0.685,0.537,0.515,0.684,0.664,0.658,0.691,0.488,0.479,0.599,0.492,0.498,0.616,0.528,0.519,0.687,0.692,0.68,0.7,0.696,0.697,0.686,0.692,0.695,0.694,0.626,0.458,9192,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+FECA_ECOLI_Tsuboyama_2023_2D1U,0.705,0.727,0.746,0.751,0.746,0.742,0.51,0.677,0.701,0.745,0.775,0.637,0.727,0.51,0.735,0.784,0.78,0.764,0.711,0.746,0.577,0.648,0.697,0.663,0.581,0.754,0.743,0.641,0.717,0.779,0.719,0.684,0.657,0.526,0.562,0.577,0.692,0.687,0.665,0.729,0.727,0.716,0.701,0.5,0.756,0.729,0.801,0.783,0.816,0.781,0.783,0.761,0.776,0.775,0.769,0.787,0.768,0.763,0.77,0.777,0.814,0.811,1886,Stability,FECA_ECOLI,High,Prokaryote
+FKBP3_HUMAN_Tsuboyama_2023_2KFV,0.727,0.707,0.759,0.76,0.763,0.771,0.589,0.675,0.63,0.629,0.602,0.594,0.591,0.593,0.592,0.583,0.6,0.64,0.694,0.661,0.622,0.622,0.627,0.632,0.585,0.622,0.599,0.625,0.628,0.752,0.673,0.676,0.464,0.566,0.602,0.624,0.718,0.721,0.688,0.765,0.767,0.73,0.599,0.602,0.595,0.613,0.856,0.824,0.855,0.819,0.66,0.716,0.738,0.711,0.71,0.716,0.693,0.697,0.65,0.712,0.798,0.684,1237,Stability,FKBP3_HUMAN,Medium,Human
+GAL4_YEAST_Kitzman_2015,0.646,0.711,0.712,0.748,0.733,0.749,0.677,0.496,0.745,0.771,0.782,0.74,0.743,0.685,0.712,0.759,0.795,0.793,0.788,0.749,0.672,0.691,0.695,0.695,0.691,0.716,0.741,0.727,0.771,0.792,0.792,0.763,0.636,0.655,0.669,0.67,0.757,0.752,0.754,0.742,0.739,0.737,0.708,0.653,0.777,0.731,0.649,0.763,0.67,0.568,0.773,0.769,0.767,0.767,0.779,0.778,0.77,0.768,0.767,0.777,0.767,0.719,1195,OrganismalFitness,GAL4_YEAST,Medium,Eukaryote
+GCN4_YEAST_Staller_2018,0.632,0.633,0.629,0.63,0.628,0.628,0.584,0.611,0.631,0.631,0.627,0.644,0.647,0.662,0.644,0.638,0.649,0.638,0.631,0.599,0.574,0.581,0.572,0.572,0.563,0.551,0.532,0.559,0.577,0.622,0.637,0.634,0.519,0.573,0.571,0.641,0.635,0.635,0.647,0.635,0.635,0.645,0.549,0.589,0.572,0.556,0.568,0.591,0.612,0.587,0.624,0.62,0.62,0.617,0.623,0.622,0.621,0.621,0.623,0.622,0.624,0.627,2638,Binding,GCN4_YEAST,Low,Eukaryote
+GDIA_HUMAN_Silverstein_2021,0.719,0.72,0.726,0.724,0.727,0.727,0.591,0.696,0.739,0.736,0.694,0.706,0.726,0.582,0.618,0.716,0.698,0.684,0.704,0.701,0.626,0.688,0.686,0.701,0.659,0.718,0.708,0.685,0.696,0.704,0.712,0.692,0.627,0.657,0.69,0.671,0.715,0.719,0.713,0.726,0.733,0.729,0.616,0.56,0.717,0.62,0.671,0.709,0.682,0.544,0.695,0.669,0.697,0.684,0.691,0.7,0.688,0.696,0.69,0.701,0.729,0.659,1154,OrganismalFitness,GDIA_HUMAN,Low,Human
+GFP_AEQVI_Sarkisyan_2016,0.889,0.884,0.9,0.901,0.903,0.903,0.54,0.883,0.894,0.891,0.816,0.565,0.565,0.552,0.583,0.564,0.572,0.596,0.679,0.858,0.555,0.572,0.614,0.565,0.532,0.615,0.684,0.884,0.886,0.904,0.865,0.87,0.541,0.545,0.616,0.878,0.889,0.89,0.902,0.906,0.907,0.919,0.515,0.495,0.525,0.525,0.796,0.792,0.925,0.86,0.856,0.86,0.866,0.871,0.867,0.866,0.864,0.865,0.854,0.865,0.872,0.769,51714,Activity,GFP_AEQVI,Low,Eukaryote
+GLPA_HUMAN_Elazar_2016,0.602,0.546,0.603,0.601,0.594,0.583,0.711,0.799,0.682,0.672,0.741,0.756,0.754,0.728,0.746,0.77,0.751,0.727,0.757,0.583,0.743,0.755,0.737,0.761,0.739,0.752,0.761,0.765,0.798,0.672,0.746,0.638,0.638,0.761,0.737,0.761,0.75,0.736,0.748,0.732,0.717,0.74,0.758,0.749,0.762,0.727,0.722,0.767,0.706,0.617,0.79,0.779,0.782,0.765,0.776,0.747,0.776,0.748,0.77,0.781,0.775,0.766,245,Expression,GLPA_HUMAN,Low,Human
+GRB2_HUMAN_Faure_2021,0.727,0.787,0.784,0.801,0.799,0.802,0.777,0.748,0.796,0.768,0.792,0.747,0.781,0.797,0.832,0.847,0.858,0.792,0.816,0.799,0.807,0.79,0.786,0.752,0.801,0.785,0.753,0.793,0.751,0.78,0.741,0.741,0.757,0.801,0.786,0.727,0.79,0.784,0.745,0.812,0.812,0.796,0.826,0.673,0.789,0.816,0.879,0.79,0.89,0.785,0.845,0.86,0.855,0.856,0.86,0.849,0.849,0.859,0.848,0.861,0.806,0.837,63366,OrganismalFitness,GRB2_HUMAN,Medium,Human
+HCP_LAMBD_Tsuboyama_2023_2L6Q,0.652,0.715,0.572,0.586,0.679,0.682,0.629,0.707,0.726,0.749,0.829,0.724,0.747,0.644,0.704,0.783,0.835,0.79,0.801,0.62,0.644,0.652,0.67,0.677,0.66,0.669,0.605,0.666,0.765,0.802,0.824,0.758,0.579,0.633,0.643,0.753,0.711,0.716,0.773,0.687,0.714,0.765,0.653,0.643,0.81,0.686,0.829,0.861,0.864,0.786,0.84,0.846,0.846,0.843,0.851,0.853,0.844,0.854,0.853,0.857,0.878,0.799,1040,Stability,HCP_LAMBD,Medium,Virus
+HECD1_HUMAN_Tsuboyama_2023_3DKM,0.846,0.873,0.829,0.844,0.885,0.887,0.667,0.823,0.858,0.859,0.83,0.696,0.72,0.642,0.656,0.917,0.884,0.86,0.846,0.869,0.674,0.815,0.866,0.859,0.792,0.863,0.856,0.841,0.848,0.887,0.873,0.871,0.62,0.675,0.697,0.743,0.907,0.886,0.91,0.895,0.895,0.901,0.596,0.581,0.902,0.668,0.755,0.869,0.857,0.772,0.868,0.88,0.899,0.889,0.893,0.914,0.887,0.897,0.888,0.896,0.892,0.749,5586,Stability,HECD1_HUMAN,Medium,Human
+HEM3_HUMAN_Loggerenberg_2023,0.712,0.707,0.701,0.702,0.71,0.709,0.56,0.566,0.716,0.722,0.694,0.688,0.696,0.569,0.69,0.702,0.694,0.713,0.72,0.562,0.669,0.688,0.692,0.7,0.676,0.694,0.691,0.693,0.709,0.72,0.715,0.711,0.547,0.689,0.694,0.703,0.713,0.719,0.731,0.715,0.719,0.726,0.66,0.573,0.69,0.685,0.666,0.684,0.659,0.58,0.678,0.669,0.676,0.681,0.688,0.686,0.686,0.686,0.683,0.688,0.716,0.709,5689,Activity,HEM3_HUMAN,Medium,Human
+HIS7_YEAST_Pokusaeva_2019,0.768,0.809,0.825,0.822,0.801,0.8,0.589,0.645,0.762,0.785,0.761,0.734,0.773,0.534,0.568,0.749,0.728,0.763,0.769,0.755,0.686,0.734,0.75,0.777,0.724,0.779,0.771,0.802,0.81,0.804,0.724,0.714,0.583,0.734,0.758,0.849,0.783,0.783,0.865,0.829,0.812,0.835,0.587,0.524,0.666,0.507,0.729,0.75,0.82,0.722,0.746,0.685,0.718,0.737,0.744,0.723,0.726,0.748,0.738,0.732,0.795,0.632,496137,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+HMDH_HUMAN_Jiang_2019,0.741,0.7,0.683,0.688,0.72,0.718,0.575,0.627,0.625,0.621,0.626,0.664,0.649,0.514,0.498,0.654,0.755,0.682,0.683,0.64,0.727,0.566,0.572,0.595,0.719,0.588,0.572,0.594,0.592,0.735,0.699,0.694,0.61,0.655,0.617,0.6,0.719,0.664,0.657,0.714,0.672,0.67,0.581,0.526,0.735,0.75,0.66,0.721,0.397,0.555,0.745,0.745,0.741,0.747,0.743,0.744,0.736,0.749,0.74,0.748,0.765,0.716,16853,OrganismalFitness,HMDH_HUMAN,Low,Human
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,0.656,0.674,0.68,0.68,0.677,0.676,0.606,0.684,0.713,0.717,0.677,0.706,0.725,0.482,0.55,0.619,0.622,0.644,0.643,0.508,0.681,0.683,0.694,0.687,0.663,0.673,0.699,0.7,0.703,0.692,0.711,0.685,0.611,0.664,0.66,0.666,0.677,0.674,0.677,0.685,0.683,0.686,0.508,0.485,0.691,0.693,0.523,0.631,0.542,0.493,0.614,0.625,0.614,0.62,0.619,0.615,0.627,0.617,0.611,0.621,0.683,0.651,2252,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HSP82_YEAST_Flynn_2019,0.758,0.772,0.796,0.803,0.795,0.795,0.62,0.749,0.804,0.796,0.772,0.79,0.807,0.515,0.57,0.634,0.709,0.765,0.767,0.786,0.78,0.79,0.781,0.794,0.788,0.798,0.806,0.781,0.811,0.806,0.826,0.814,0.647,0.778,0.781,0.781,0.795,0.799,0.798,0.807,0.811,0.81,0.596,0.451,0.773,0.725,0.627,0.726,0.574,0.538,0.712,0.705,0.714,0.722,0.712,0.716,0.719,0.716,0.705,0.719,0.78,0.74,13294,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HSP82_YEAST_Mishra_2016,0.882,0.815,0.891,0.9,0.897,0.899,0.794,0.812,0.837,0.839,0.859,0.886,0.893,0.688,0.78,0.819,0.873,0.878,0.867,0.842,0.85,0.843,0.839,0.866,0.866,0.859,0.872,0.831,0.858,0.893,0.903,0.895,0.717,0.845,0.844,0.851,0.881,0.88,0.882,0.898,0.898,0.899,0.849,0.455,0.843,0.88,0.623,0.753,0.645,0.533,0.855,0.838,0.829,0.86,0.86,0.853,0.859,0.865,0.855,0.862,0.842,0.841,4323,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HXK4_HUMAN_Gersing_2022_activity,0.762,0.78,0.759,0.766,0.761,0.766,0.618,0.709,0.779,0.788,0.748,0.758,0.771,0.59,0.649,0.767,0.776,0.775,0.76,0.522,0.746,0.75,0.743,0.732,0.749,0.756,0.746,0.747,0.722,0.776,0.77,0.744,0.601,0.754,0.734,0.702,0.77,0.762,0.743,0.777,0.775,0.767,0.664,0.51,0.775,0.756,0.684,0.738,0.704,0.588,0.771,0.768,0.76,0.766,0.766,0.772,0.768,0.772,0.771,0.774,0.78,0.751,8570,OrganismalFitness,HXK4_HUMAN,Medium,Human
+HXK4_HUMAN_Gersing_2023_abundance,0.694,0.719,0.697,0.713,0.708,0.711,0.534,0.7,0.734,0.746,0.703,0.697,0.712,0.58,0.61,0.701,0.724,0.731,0.713,0.74,0.669,0.671,0.686,0.686,0.668,0.681,0.688,0.686,0.69,0.723,0.681,0.659,0.595,0.681,0.675,0.686,0.705,0.705,0.713,0.711,0.716,0.723,0.601,0.547,0.712,0.703,0.734,0.728,0.732,0.594,0.719,0.727,0.725,0.725,0.73,0.727,0.728,0.724,0.724,0.729,0.757,0.701,8396,Expression,HXK4_HUMAN,Medium,Human
+I6TAH8_I68A0_Doud_2015,0.677,0.664,0.639,0.638,0.685,0.683,0.495,0.659,0.645,0.663,0.503,0.507,0.506,0.509,0.51,0.501,0.507,0.503,0.543,0.608,0.663,0.669,0.691,0.691,0.504,0.508,0.554,0.497,0.655,0.695,0.627,0.63,0.572,0.652,0.666,0.673,0.665,0.676,0.677,0.693,0.701,0.704,0.502,0.511,0.506,0.509,0.608,0.604,0.623,0.549,0.533,0.552,0.564,0.563,0.554,0.551,0.542,0.545,0.527,0.551,0.521,0.504,9462,OrganismalFitness,I6TAH8_I68A0,Medium,Virus
+IF1_ECOLI_Kelsic_2016,0.695,0.775,0.823,0.823,0.816,0.82,0.578,0.712,0.649,0.641,0.803,0.815,0.826,0.6,0.756,0.817,0.832,0.818,0.8,0.793,0.673,0.729,0.695,0.714,0.746,0.748,0.751,0.735,0.745,0.722,0.784,0.754,0.643,0.756,0.774,0.787,0.773,0.78,0.785,0.812,0.816,0.82,0.693,0.563,0.821,0.782,0.692,0.797,0.781,0.645,0.798,0.79,0.808,0.814,0.804,0.809,0.789,0.813,0.813,0.819,0.852,0.783,1367,OrganismalFitness,IF1_ECOLI,High,Prokaryote
+ILF3_HUMAN_Tsuboyama_2023_2L33,0.718,0.758,0.827,0.821,0.811,0.814,0.604,0.741,0.797,0.813,0.747,0.757,0.772,0.718,0.775,0.838,0.722,0.71,0.649,0.837,0.621,0.646,0.685,0.726,0.735,0.672,0.751,0.696,0.697,0.82,0.773,0.758,0.529,0.614,0.67,0.738,0.747,0.757,0.787,0.792,0.783,0.821,0.755,0.62,0.745,0.813,0.834,0.79,0.851,0.752,0.693,0.67,0.727,0.705,0.726,0.758,0.733,0.708,0.779,0.741,0.737,0.783,1329,Stability,ILF3_HUMAN,High,Human
+ISDH_STAAW_Tsuboyama_2023_2LHR,0.54,0.548,0.599,0.594,0.612,0.612,0.662,0.571,0.68,0.679,0.738,0.71,0.719,0.687,0.695,0.724,0.718,0.74,0.724,0.589,0.653,0.601,0.614,0.629,0.649,0.66,0.655,0.642,0.685,0.669,0.718,0.691,0.622,0.636,0.62,0.636,0.625,0.605,0.616,0.629,0.606,0.616,0.639,0.611,0.665,0.652,0.788,0.743,0.773,0.752,0.725,0.717,0.728,0.724,0.726,0.728,0.715,0.72,0.724,0.726,0.791,0.749,1944,Stability,ISDH_STAAW,High,Prokaryote
+KCNE1_HUMAN_Muhammad_2023_expression,0.641,0.648,0.622,0.628,0.636,0.62,0.67,0.569,0.634,0.636,0.691,0.679,0.666,0.663,0.651,0.682,0.652,0.635,0.657,0.649,0.68,0.694,0.625,0.606,0.69,0.628,0.672,0.633,0.643,0.647,0.636,0.603,0.393,0.656,0.708,0.659,0.662,0.673,0.67,0.657,0.662,0.666,0.673,0.652,0.667,0.693,0.649,0.65,0.644,0.553,0.637,0.625,0.638,0.614,0.617,0.612,0.642,0.636,0.639,0.633,0.66,0.678,2339,Expression,KCNE1_HUMAN,Medium,Human
+KCNE1_HUMAN_Muhammad_2023_function,0.693,0.743,0.739,0.75,0.735,0.756,0.599,0.75,0.752,0.752,0.667,0.728,0.786,0.588,0.567,0.797,0.759,0.721,0.684,0.607,0.62,0.638,0.719,0.77,0.629,0.811,0.793,0.796,0.791,0.74,0.793,0.749,0.495,0.575,0.68,0.816,0.722,0.741,0.812,0.756,0.77,0.821,0.59,0.586,0.766,0.577,0.582,0.712,0.583,0.538,0.766,0.766,0.757,0.76,0.764,0.755,0.768,0.769,0.77,0.773,0.819,0.603,2315,Activity,KCNE1_HUMAN,Medium,Human
+KCNH2_HUMAN_Kozek_2020,0.738,0.73,0.671,0.67,0.636,0.635,0.755,0.588,0.667,0.678,0.644,0.607,0.609,0.619,0.593,0.608,0.623,0.625,0.622,0.739,0.741,0.766,0.751,0.729,0.762,0.748,0.736,0.739,0.735,0.758,0.693,0.63,0.678,0.738,0.767,0.749,0.75,0.782,0.771,0.739,0.767,0.759,0.728,0.599,0.59,0.652,0.605,0.477,0.426,0.547,0.798,0.719,0.766,0.748,0.738,0.739,0.76,0.746,0.749,0.762,0.648,0.738,200,Activity,KCNH2_HUMAN,Medium,Human
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,0.639,0.675,0.675,0.682,0.686,0.689,0.531,0.625,0.668,0.665,0.696,0.685,0.691,0.531,0.652,0.7,0.702,0.704,0.706,0.586,0.68,0.657,0.631,0.627,0.683,0.674,0.662,0.678,0.618,0.698,0.694,0.686,0.578,0.669,0.652,0.621,0.683,0.672,0.652,0.691,0.686,0.677,0.61,0.493,0.675,0.682,0.601,0.65,0.623,0.54,0.694,0.686,0.692,0.691,0.693,0.689,0.694,0.692,0.693,0.696,0.696,0.64,6963,Activity,KCNJ2_MOUSE,Medium,Eukaryote
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,0.677,0.655,0.614,0.62,0.654,0.655,0.557,0.602,0.624,0.624,0.639,0.623,0.63,0.59,0.669,0.663,0.652,0.651,0.658,0.609,0.604,0.606,0.607,0.622,0.614,0.621,0.619,0.622,0.621,0.666,0.632,0.617,0.575,0.602,0.613,0.614,0.642,0.65,0.656,0.646,0.654,0.658,0.625,0.534,0.615,0.637,0.623,0.612,0.622,0.567,0.637,0.63,0.641,0.637,0.644,0.635,0.646,0.637,0.634,0.641,0.649,0.655,6917,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+KKA2_KLEPN_Melnikov_2014,0.658,0.806,0.756,0.859,0.85,0.856,0.623,0.756,0.813,0.854,0.827,0.842,0.858,0.632,0.655,0.782,0.844,0.87,0.876,0.794,0.674,0.748,0.805,0.814,0.706,0.837,0.835,0.832,0.869,0.868,0.87,0.845,0.59,0.65,0.787,0.839,0.764,0.809,0.843,0.842,0.855,0.868,0.653,0.551,0.826,0.733,0.749,0.829,0.806,0.64,0.838,0.828,0.834,0.83,0.83,0.835,0.825,0.837,0.842,0.843,0.868,0.685,4960,OrganismalFitness,KKA2_KLEPN,High,Prokaryote
+LGK_LIPST_Klesmith_2015,0.64,0.706,0.73,0.734,0.73,0.737,0.56,0.717,0.728,0.772,0.715,0.756,0.771,0.588,0.654,0.685,0.75,0.782,0.779,0.742,0.639,0.693,0.722,0.736,0.677,0.734,0.751,0.729,0.765,0.755,0.756,0.718,0.565,0.649,0.69,0.767,0.68,0.696,0.755,0.73,0.736,0.759,0.584,0.53,0.732,0.676,0.689,0.743,0.735,0.57,0.747,0.742,0.747,0.746,0.747,0.752,0.751,0.749,0.751,0.755,0.699,0.662,7890,Activity,LGK_LIPST,Medium,Eukaryote
+LYAM1_HUMAN_Elazar_2016,0.669,0.667,0.61,0.594,0.649,0.653,0.675,0.58,0.698,0.701,0.698,0.673,0.67,0.643,0.672,0.66,0.637,0.691,0.732,0.653,0.681,0.687,0.682,0.675,0.674,0.679,0.669,0.642,0.66,0.688,0.641,0.583,0.598,0.672,0.705,0.664,0.684,0.705,0.679,0.677,0.691,0.667,0.642,0.658,0.644,0.665,0.586,0.631,0.59,0.556,0.608,0.653,0.696,0.664,0.635,0.646,0.675,0.672,0.63,0.662,0.709,0.69,359,Expression,LYAM1_HUMAN,Medium,Human
+MAFG_MOUSE_Tsuboyama_2023_1K1V,0.854,0.874,0.866,0.866,0.861,0.861,0.74,0.904,0.875,0.866,0.9,0.749,0.886,0.79,0.795,0.823,0.829,0.859,0.762,0.868,0.752,0.837,0.848,0.872,0.817,0.858,0.83,0.859,0.839,0.835,0.863,0.855,0.776,0.686,0.851,0.855,0.844,0.885,0.905,0.869,0.894,0.911,0.718,0.525,0.781,0.676,0.859,0.728,0.89,0.905,0.917,0.903,0.907,0.916,0.91,0.915,0.912,0.916,0.911,0.915,0.922,0.934,1429,Stability,MAFG_MOUSE,Medium,Eukaryote
+MBD11_ARATH_Tsuboyama_2023_6ACV,0.808,0.904,0.885,0.895,0.917,0.921,0.466,0.816,0.87,0.908,0.907,0.757,0.805,0.428,0.539,0.915,0.891,0.882,0.91,0.902,0.494,0.556,0.847,0.864,0.626,0.836,0.824,0.842,0.911,0.923,0.906,0.878,0.84,0.333,0.554,0.544,0.823,0.872,0.86,0.899,0.919,0.913,0.483,0.395,0.858,0.749,0.75,0.848,0.903,0.792,0.925,0.91,0.924,0.91,0.911,0.908,0.912,0.919,0.923,0.922,0.927,0.884,2116,Stability,MBD11_ARATH,Medium,Eukaryote
+MET_HUMAN_Estevam_2023,0.789,0.845,0.839,0.856,0.868,0.869,0.827,0.825,0.846,0.855,0.869,0.847,0.853,0.789,0.826,0.843,0.871,0.876,0.875,0.87,0.832,0.819,0.797,0.811,0.834,0.829,0.825,0.808,0.786,0.849,0.86,0.833,0.677,0.798,0.814,0.828,0.826,0.84,0.855,0.855,0.859,0.863,0.842,0.723,0.859,0.863,0.728,0.768,0.815,0.624,0.838,0.845,0.842,0.844,0.852,0.847,0.85,0.852,0.848,0.854,0.864,0.837,5393,Activity,MET_HUMAN,Medium,Human
+MK01_HUMAN_Brenan_2016,0.614,0.622,0.638,0.641,0.634,0.636,0.627,0.605,0.625,0.618,0.542,0.607,0.617,0.605,0.621,0.622,0.615,0.621,0.597,0.614,0.631,0.582,0.562,0.53,0.613,0.569,0.552,0.561,0.486,0.63,0.613,0.633,0.568,0.61,0.554,0.524,0.623,0.586,0.572,0.637,0.627,0.626,0.621,0.579,0.594,0.614,0.541,0.514,0.582,0.502,0.602,0.601,0.594,0.604,0.608,0.603,0.603,0.606,0.61,0.608,0.602,0.616,6809,OrganismalFitness,MK01_HUMAN,Medium,Human
+MLAC_ECOLI_MacRae_2023,0.607,0.686,0.723,0.728,0.72,0.721,0.489,0.684,0.716,0.72,0.708,0.72,0.728,0.489,0.614,0.631,0.694,0.691,0.706,0.708,0.538,0.692,0.697,0.71,0.659,0.706,0.708,0.705,0.725,0.716,0.737,0.726,0.513,0.566,0.662,0.651,0.634,0.667,0.66,0.698,0.708,0.707,0.584,0.479,0.662,0.618,0.514,0.605,0.571,0.491,0.666,0.646,0.654,0.655,0.665,0.664,0.654,0.66,0.674,0.664,0.61,0.599,4007,OrganismalFitness,MLAC_ECOLI,Medium,Prokaryote
+MSH2_HUMAN_Jia_2020,0.827,0.861,0.833,0.841,0.852,0.854,0.696,0.794,0.861,0.87,0.809,0.839,0.857,0.703,0.824,0.858,0.814,0.775,0.678,0.843,0.762,0.792,0.775,0.762,0.788,0.804,0.815,0.804,0.785,0.859,0.828,0.804,0.697,0.753,0.795,0.773,0.821,0.837,0.829,0.849,0.856,0.856,0.761,0.599,0.843,0.811,0.784,0.815,0.562,0.594,0.78,0.755,0.769,0.785,0.796,0.776,0.771,0.779,0.802,0.795,0.865,0.842,16749,OrganismalFitness,MSH2_HUMAN,Medium,Human
+MTH3_HAEAE_RockahShmuel_2015,0.706,0.8,0.836,0.842,0.83,0.835,0.686,0.823,0.834,0.845,0.811,0.854,0.856,0.631,0.697,0.717,0.775,0.822,0.835,0.84,0.68,0.747,0.818,0.838,0.764,0.836,0.855,0.843,0.862,0.83,0.847,0.828,0.671,0.692,0.764,0.832,0.722,0.763,0.824,0.815,0.829,0.847,0.692,0.52,0.756,0.692,0.713,0.786,0.777,0.6,0.786,0.77,0.781,0.788,0.783,0.79,0.777,0.785,0.784,0.79,0.801,0.707,1777,OrganismalFitness,MTH3_HAEAE,Medium,Prokaryote
+MTHR_HUMAN_Weile_2021,0.607,0.618,0.608,0.611,0.611,0.61,0.588,0.575,0.632,0.632,0.676,0.629,0.639,0.575,0.685,0.724,0.661,0.625,0.631,0.614,0.707,0.564,0.597,0.621,0.7,0.601,0.619,0.604,0.639,0.629,0.592,0.577,0.559,0.71,0.641,0.595,0.668,0.641,0.612,0.66,0.636,0.618,0.62,0.547,0.669,0.683,0.608,0.653,0.674,0.543,0.647,0.631,0.636,0.635,0.639,0.645,0.653,0.645,0.66,0.648,0.647,0.67,12464,OrganismalFitness,MTHR_HUMAN,Low,Human
+MYO3_YEAST_Tsuboyama_2023_2BTT,0.584,0.635,0.689,0.7,0.711,0.727,0.574,0.623,0.732,0.726,0.783,0.76,0.761,0.592,0.726,0.815,0.821,0.774,0.645,0.675,0.675,0.618,0.665,0.657,0.676,0.647,0.699,0.664,0.684,0.65,0.665,0.613,0.58,0.721,0.674,0.644,0.704,0.677,0.659,0.723,0.719,0.711,0.566,0.526,0.72,0.654,0.715,0.737,0.77,0.696,0.794,0.779,0.803,0.788,0.793,0.794,0.785,0.791,0.786,0.794,0.805,0.777,3297,Stability,MYO3_YEAST,High,Eukaryote
+NCAP_I34A1_Doud_2015,0.687,0.67,0.673,0.672,0.685,0.686,0.508,0.668,0.686,0.677,0.514,0.517,0.518,0.519,0.52,0.516,0.521,0.522,0.553,0.646,0.677,0.687,0.702,0.704,0.517,0.529,0.557,0.52,0.677,0.7,0.641,0.645,0.565,0.68,0.686,0.71,0.699,0.704,0.717,0.716,0.715,0.724,0.517,0.515,0.52,0.519,0.631,0.635,0.639,0.559,0.565,0.585,0.588,0.594,0.588,0.587,0.58,0.583,0.566,0.587,0.568,0.543,9462,OrganismalFitness,NCAP_I34A1,Medium,Virus
+NKX31_HUMAN_Tsuboyama_2023_2L9R,0.689,0.8,0.859,0.858,0.87,0.86,0.816,0.861,0.867,0.873,0.864,0.874,0.87,0.857,0.824,0.868,0.858,0.833,0.856,0.846,0.858,0.842,0.844,0.842,0.841,0.835,0.852,0.84,0.861,0.876,0.828,0.814,0.847,0.817,0.833,0.853,0.841,0.851,0.866,0.867,0.869,0.875,0.77,0.717,0.705,0.751,0.823,0.721,0.872,0.868,0.879,0.882,0.885,0.888,0.889,0.879,0.885,0.883,0.878,0.886,0.866,0.873,2482,Stability,NKX31_HUMAN,High,Human
+NPC1_HUMAN_Erwood_2022_HEK293T,0.863,0.876,0.844,0.848,0.88,0.878,0.605,0.761,0.898,0.891,0.871,0.671,0.742,0.592,0.716,0.818,0.873,0.876,0.874,0.554,0.617,0.73,0.707,0.707,0.727,0.758,0.835,0.739,0.759,0.889,0.861,0.803,0.544,0.603,0.787,0.799,0.836,0.859,0.867,0.871,0.879,0.885,0.652,0.551,0.874,0.799,0.752,0.871,0.515,0.601,0.875,0.862,0.867,0.867,0.867,0.858,0.87,0.882,0.872,0.877,0.864,0.712,637,Activity,NPC1_HUMAN,Low,Human
+NPC1_HUMAN_Erwood_2022_RPE1,0.87,0.773,0.682,0.685,0.8,0.81,0.663,0.721,0.763,0.806,0.833,0.538,0.582,0.581,0.61,0.763,0.789,0.775,0.811,0.675,0.795,0.687,0.715,0.648,0.69,0.689,0.689,0.635,0.71,0.795,0.883,0.795,0.78,0.688,0.584,0.742,0.867,0.77,0.847,0.838,0.779,0.811,0.673,0.681,0.912,0.609,0.731,0.817,0.44,0.602,0.77,0.755,0.771,0.707,0.767,0.756,0.758,0.807,0.799,0.769,0.855,0.72,63,Activity,NPC1_HUMAN,Low,Human
+NRAM_I33A0_Jiang_2016,0.802,0.775,0.727,0.717,0.799,0.795,0.53,0.718,0.809,0.814,0.474,0.598,0.745,0.45,0.444,0.505,0.615,0.752,0.795,0.686,0.769,0.802,0.781,0.777,0.549,0.774,0.809,0.747,0.81,0.833,0.729,0.732,0.456,0.747,0.767,0.772,0.803,0.822,0.823,0.82,0.828,0.823,0.428,0.432,0.425,0.423,0.725,0.706,0.731,0.592,0.636,0.648,0.653,0.677,0.663,0.679,0.666,0.66,0.634,0.662,0.644,0.567,298,OrganismalFitness,NRAM_I33A0,Low,Virus
+NUD15_HUMAN_Suiter_2020,0.644,0.75,0.797,0.813,0.809,0.811,0.504,0.711,0.832,0.866,0.838,0.839,0.863,0.677,0.735,0.753,0.801,0.833,0.84,0.777,0.671,0.75,0.808,0.798,0.736,0.828,0.826,0.81,0.788,0.813,0.841,0.811,0.594,0.698,0.739,0.816,0.728,0.744,0.83,0.809,0.809,0.838,0.724,0.511,0.831,0.744,0.753,0.805,0.779,0.667,0.806,0.785,0.798,0.813,0.804,0.814,0.817,0.813,0.813,0.817,0.882,0.815,2844,Expression,NUD15_HUMAN,High,Human
+NUSA_ECOLI_Tsuboyama_2023_1WCL,0.692,0.804,0.802,0.796,0.814,0.821,0.636,0.73,0.85,0.821,0.793,0.654,0.667,0.656,0.698,0.701,0.744,0.767,0.758,0.798,0.705,0.796,0.81,0.812,0.657,0.762,0.729,0.788,0.807,0.839,0.865,0.894,0.587,0.682,0.703,0.707,0.758,0.746,0.756,0.832,0.81,0.825,0.679,0.609,0.683,0.67,0.851,0.828,0.881,0.844,0.888,0.885,0.884,0.897,0.898,0.907,0.89,0.889,0.896,0.897,0.852,0.78,2028,Stability,NUSA_ECOLI,High,Prokaryote
+NUSG_MYCTU_Tsuboyama_2023_2MI6,0.752,0.76,0.767,0.774,0.783,0.774,0.653,0.758,0.782,0.776,0.749,0.722,0.746,0.693,0.796,0.763,0.784,0.789,0.788,0.761,0.682,0.705,0.7,0.727,0.73,0.722,0.737,0.699,0.733,0.803,0.741,0.739,0.66,0.657,0.669,0.745,0.743,0.744,0.771,0.763,0.765,0.775,0.681,0.559,0.674,0.743,0.815,0.789,0.886,0.862,0.798,0.788,0.777,0.798,0.794,0.792,0.796,0.8,0.798,0.797,0.762,0.808,1380,Stability,NUSG_MYCTU,High,Prokaryote
+OBSCN_HUMAN_Tsuboyama_2023_1V1C,0.758,0.86,0.916,0.928,0.917,0.923,0.644,0.809,0.917,0.915,0.921,0.869,0.887,0.649,0.9,0.923,0.928,0.924,0.919,0.916,0.909,0.899,0.898,0.887,0.854,0.902,0.893,0.882,0.877,0.909,0.927,0.922,0.758,0.618,0.67,0.762,0.874,0.86,0.869,0.927,0.917,0.92,0.735,0.484,0.73,0.787,0.837,0.787,0.916,0.918,0.943,0.937,0.944,0.94,0.947,0.944,0.941,0.94,0.947,0.945,0.932,0.89,3197,Stability,OBSCN_HUMAN,High,Human
+ODP2_GEOSE_Tsuboyama_2023_1W4G,0.372,0.393,0.643,0.562,0.637,0.636,0.404,0.518,0.558,0.564,0.5,0.432,0.426,0.448,0.415,0.532,0.569,0.514,0.571,0.589,0.559,0.555,0.546,0.56,0.554,0.561,0.552,0.564,0.581,0.642,0.504,0.493,0.614,0.453,0.49,0.479,0.566,0.56,0.56,0.61,0.6,0.618,0.599,0.452,0.632,0.625,0.694,0.678,0.681,0.626,0.525,0.505,0.508,0.529,0.501,0.528,0.52,0.523,0.512,0.518,0.608,0.551,1134,Stability,ODP2_GEOSE,High,Prokaryote
+OPSD_HUMAN_Wan_2019,0.611,0.75,0.755,0.796,0.758,0.76,0.701,0.808,0.787,0.806,0.692,0.788,0.808,0.659,0.777,0.767,0.784,0.751,0.769,0.784,0.782,0.778,0.772,0.796,0.804,0.812,0.822,0.815,0.801,0.749,0.74,0.694,0.604,0.761,0.785,0.768,0.78,0.783,0.773,0.763,0.765,0.767,0.702,0.557,0.762,0.752,0.874,0.789,0.882,0.611,0.703,0.67,0.699,0.696,0.745,0.754,0.682,0.701,0.729,0.729,0.814,0.785,165,Expression,OPSD_HUMAN,High,Human
+OTC_HUMAN_Lo_2023,0.757,0.786,0.748,0.769,0.778,0.78,0.547,0.71,0.791,0.793,0.773,0.779,0.785,0.552,0.69,0.739,0.753,0.749,0.763,0.782,0.713,0.731,0.768,0.782,0.735,0.783,0.769,0.777,0.798,0.806,0.745,0.716,0.533,0.703,0.761,0.798,0.753,0.787,0.815,0.786,0.798,0.807,0.619,0.524,0.752,0.72,0.83,0.797,0.837,0.687,0.755,0.766,0.764,0.781,0.766,0.775,0.77,0.766,0.763,0.773,0.799,0.734,1570,Activity,OTC_HUMAN,Medium,Human
+OTU7A_HUMAN_Tsuboyama_2023_2L2D,0.52,0.593,0.582,0.59,0.602,0.604,0.599,0.574,0.56,0.564,0.747,0.862,0.858,0.593,0.835,0.847,0.726,0.741,0.728,0.579,0.578,0.665,0.691,0.713,0.731,0.731,0.759,0.732,0.615,0.671,0.815,0.793,0.562,0.566,0.639,0.734,0.596,0.602,0.688,0.612,0.603,0.636,0.688,0.577,0.836,0.812,0.829,0.842,0.865,0.787,0.769,0.717,0.731,0.762,0.773,0.782,0.735,0.764,0.782,0.766,0.854,0.855,635,Stability,OTU7A_HUMAN,High,Human
+OXDA_RHOTO_Vanella_2023_activity,0.57,0.623,0.649,0.65,0.638,0.646,0.591,0.62,0.613,0.618,0.667,0.653,0.657,0.562,0.609,0.639,0.659,0.665,0.67,0.653,0.569,0.604,0.643,0.647,0.622,0.657,0.665,0.65,0.692,0.643,0.687,0.678,0.519,0.59,0.614,0.639,0.624,0.629,0.639,0.642,0.645,0.648,0.595,0.529,0.667,0.632,0.63,0.667,0.646,0.537,0.655,0.659,0.665,0.654,0.658,0.669,0.658,0.661,0.66,0.665,0.665,0.601,6396,Activity,OXDA_RHOTO,High,Eukaryote
+OXDA_RHOTO_Vanella_2023_expression,0.635,0.669,0.681,0.681,0.676,0.677,0.652,0.637,0.689,0.694,0.705,0.7,0.699,0.661,0.679,0.694,0.699,0.719,0.709,0.646,0.638,0.668,0.67,0.662,0.67,0.674,0.694,0.683,0.697,0.693,0.701,0.678,0.552,0.67,0.654,0.67,0.684,0.678,0.682,0.688,0.684,0.684,0.676,0.624,0.698,0.676,0.715,0.708,0.675,0.55,0.701,0.69,0.688,0.694,0.696,0.688,0.699,0.696,0.695,0.7,0.736,0.694,6769,Expression,OXDA_RHOTO,High,Eukaryote
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,0.831,0.831,0.765,0.777,0.826,0.829,0.45,0.764,0.763,0.772,0.826,0.827,0.855,0.417,0.436,0.764,0.835,0.849,0.869,0.627,0.722,0.788,0.804,0.796,0.778,0.818,0.822,0.818,0.783,0.83,0.853,0.836,0.611,0.701,0.8,0.784,0.819,0.839,0.821,0.833,0.847,0.829,0.419,0.412,0.84,0.5,0.771,0.833,0.784,0.637,0.834,0.836,0.838,0.834,0.839,0.847,0.839,0.843,0.847,0.848,0.859,0.66,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,0.67,0.694,0.649,0.654,0.688,0.689,0.455,0.623,0.645,0.647,0.726,0.713,0.738,0.418,0.426,0.629,0.685,0.697,0.711,0.56,0.636,0.712,0.718,0.707,0.68,0.731,0.738,0.749,0.66,0.706,0.713,0.678,0.598,0.606,0.713,0.677,0.669,0.705,0.684,0.682,0.708,0.688,0.425,0.422,0.719,0.464,0.68,0.723,0.689,0.586,0.692,0.694,0.696,0.699,0.699,0.704,0.694,0.698,0.702,0.703,0.717,0.577,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,0.855,0.865,0.782,0.795,0.851,0.856,0.453,0.766,0.774,0.778,0.883,0.863,0.891,0.415,0.44,0.828,0.869,0.886,0.889,0.626,0.78,0.862,0.86,0.847,0.835,0.87,0.867,0.889,0.761,0.881,0.887,0.867,0.634,0.752,0.881,0.821,0.855,0.894,0.85,0.865,0.896,0.853,0.423,0.405,0.894,0.509,0.832,0.888,0.833,0.67,0.873,0.881,0.879,0.88,0.884,0.888,0.881,0.886,0.886,0.891,0.893,0.688,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Kotler_2018,0.851,0.828,0.768,0.805,0.806,0.819,0.544,0.772,0.828,0.836,0.838,0.81,0.854,0.527,0.544,0.85,0.889,0.9,0.907,0.747,0.731,0.766,0.786,0.782,0.761,0.789,0.782,0.789,0.791,0.831,0.86,0.837,0.532,0.724,0.776,0.746,0.838,0.832,0.82,0.837,0.833,0.829,0.527,0.502,0.868,0.608,0.765,0.835,0.775,0.643,0.869,0.879,0.875,0.872,0.874,0.878,0.888,0.875,0.884,0.887,0.906,0.706,1048,OrganismalFitness,P53_HUMAN,Low,Human
+P84126_THETH_Chan_2017,0.79,0.827,0.844,0.852,0.818,0.827,0.661,0.785,0.86,0.847,0.818,0.808,0.832,0.634,0.809,0.817,0.831,0.838,0.811,0.846,0.737,0.785,0.781,0.813,0.792,0.828,0.837,0.819,0.87,0.801,0.817,0.773,0.699,0.769,0.775,0.798,0.795,0.798,0.811,0.821,0.822,0.831,0.754,0.488,0.797,0.774,0.661,0.727,0.768,0.56,0.795,0.782,0.748,0.796,0.799,0.777,0.787,0.799,0.802,0.802,0.835,0.828,1519,OrganismalFitness,P84126_THETH,Medium,Prokaryote
+PA_I34A1_Wu_2015,0.772,0.773,0.761,0.766,0.782,0.783,0.535,0.694,0.589,0.589,0.526,0.537,0.569,0.518,0.52,0.521,0.525,0.525,0.681,0.667,0.736,0.755,0.777,0.778,0.607,0.71,0.733,0.725,0.728,0.805,0.689,0.693,0.566,0.722,0.747,0.777,0.782,0.793,0.796,0.804,0.808,0.803,0.522,0.515,0.521,0.519,0.627,0.616,0.599,0.54,0.581,0.584,0.583,0.594,0.584,0.591,0.583,0.588,0.568,0.59,0.612,0.577,1820,OrganismalFitness,PA_I34A1,Medium,Virus
+PABP_YEAST_Melamed_2013,0.855,0.827,0.787,0.791,0.846,0.843,0.752,0.805,0.838,0.852,0.866,0.851,0.861,0.758,0.805,0.856,0.898,0.875,0.882,0.787,0.847,0.858,0.863,0.874,0.846,0.879,0.879,0.866,0.86,0.859,0.884,0.846,0.649,0.849,0.85,0.847,0.874,0.874,0.872,0.866,0.868,0.866,0.825,0.469,0.851,0.852,0.691,0.817,0.779,0.604,0.87,0.865,0.872,0.888,0.88,0.883,0.884,0.884,0.901,0.892,0.885,0.844,37708,OrganismalFitness,PABP_YEAST,Medium,Eukaryote
+PAI1_HUMAN_Huttinger_2021,0.693,0.693,0.686,0.693,0.695,0.702,0.526,0.656,0.705,0.711,0.708,0.693,0.704,0.545,0.688,0.705,0.715,0.662,0.636,0.704,0.57,0.686,0.677,0.668,0.67,0.69,0.693,0.681,0.665,0.705,0.692,0.676,0.561,0.583,0.678,0.681,0.683,0.693,0.697,0.702,0.706,0.706,0.669,0.533,0.705,0.695,0.69,0.723,0.703,0.576,0.712,0.712,0.712,0.718,0.714,0.716,0.715,0.717,0.713,0.72,0.728,0.692,5345,Activity,PAI1_HUMAN,,Human
+PHOT_CHLRE_Chen_2023,0.592,0.708,0.858,0.846,0.66,0.652,0.821,0.77,0.85,0.855,0.819,0.873,0.888,0.878,0.911,0.852,0.87,0.851,0.876,0.819,0.836,0.835,0.79,0.841,0.754,0.8,0.788,0.822,0.791,0.783,0.781,0.74,0.647,0.774,0.744,0.795,0.786,0.769,0.793,0.696,0.71,0.7,0.808,0.687,0.796,0.805,0.599,0.747,0.83,0.7,0.797,0.765,0.77,0.769,0.772,0.779,0.778,0.781,0.786,0.779,0.865,0.884,167529,Activity,PHOT_CHLRE,High,Eukaryote
+PIN1_HUMAN_Tsuboyama_2023_1I6C,0.608,0.662,0.858,0.837,0.829,0.85,0.8,0.753,0.851,0.872,0.85,0.799,0.858,0.837,0.832,0.846,0.848,0.761,0.79,0.837,0.722,0.859,0.834,0.825,0.789,0.857,0.869,0.852,0.859,0.873,0.879,0.861,0.799,0.766,0.834,0.859,0.806,0.85,0.884,0.856,0.872,0.888,0.8,0.381,0.836,0.777,0.786,0.808,0.871,0.862,0.846,0.824,0.752,0.782,0.789,0.808,0.812,0.835,0.821,0.817,0.889,0.895,802,Stability,PIN1_HUMAN,High,Human
+PITX2_HUMAN_Tsuboyama_2023_2L7M,0.795,0.773,0.787,0.791,0.787,0.79,0.76,0.754,0.781,0.793,0.745,0.732,0.726,0.786,0.836,0.847,0.827,0.807,0.765,0.79,0.771,0.743,0.752,0.738,0.793,0.766,0.744,0.73,0.752,0.78,0.712,0.703,0.729,0.797,0.771,0.761,0.813,0.788,0.784,0.801,0.782,0.784,0.748,0.718,0.646,0.669,0.715,0.653,0.85,0.82,0.794,0.798,0.795,0.805,0.812,0.816,0.806,0.808,0.81,0.809,0.75,0.812,1824,Stability,PITX2_HUMAN,High,Human
+PKN1_HUMAN_Tsuboyama_2023_1URF,0.623,0.605,0.662,0.671,0.69,0.694,0.612,0.658,0.639,0.661,0.68,0.706,0.737,0.716,0.733,0.814,0.721,0.678,0.676,0.718,0.702,0.705,0.68,0.712,0.71,0.708,0.711,0.704,0.712,0.717,0.752,0.75,0.555,0.721,0.665,0.708,0.69,0.686,0.714,0.713,0.697,0.712,0.684,0.674,0.705,0.692,0.835,0.777,0.83,0.79,0.685,0.703,0.724,0.689,0.707,0.706,0.692,0.707,0.713,0.709,0.728,0.825,1301,Stability,PKN1_HUMAN,High,Human
+POLG_CXB3N_Mattenberger_2021,0.709,0.687,0.681,0.699,0.726,0.731,0.478,0.672,0.749,0.749,0.651,0.468,0.516,0.457,0.469,0.595,0.702,0.704,0.717,0.677,0.664,0.694,0.689,0.688,0.561,0.691,0.688,0.683,0.694,0.743,0.694,0.66,0.501,0.524,0.631,0.677,0.667,0.688,0.706,0.688,0.709,0.727,0.465,0.465,0.677,0.472,0.596,0.673,0.559,0.523,0.673,0.683,0.685,0.687,0.684,0.686,0.685,0.684,0.69,0.689,0.558,0.536,15711,OrganismalFitness,POLG_CXB3N,Medium,Virus
+POLG_DEN26_Suphatrakul_2023,0.76,0.794,0.633,0.633,0.772,0.772,0.485,0.722,0.848,0.85,0.669,0.507,0.512,0.481,0.513,0.547,0.587,0.645,0.69,0.715,0.732,0.741,0.733,0.732,0.725,0.754,0.756,0.747,0.745,0.816,0.809,0.771,0.522,0.484,0.577,0.743,0.729,0.709,0.781,0.745,0.714,0.79,0.478,0.473,0.709,0.504,0.69,0.771,0.572,0.561,0.632,0.624,0.631,0.643,0.626,0.643,0.633,0.637,0.627,0.64,0.592,0.524,16897,OrganismalFitness,POLG_DEN26,Low,Virus
+POLG_HCVJF_Qi_2014,0.824,0.784,0.712,0.716,0.823,0.828,0.47,0.598,0.792,0.798,0.588,0.842,0.839,0.552,0.565,0.556,0.556,0.543,0.535,0.631,0.714,0.744,0.741,0.761,0.721,0.756,0.669,0.72,0.78,0.84,0.821,0.772,0.599,0.747,0.769,0.78,0.768,0.79,0.809,0.748,0.78,0.798,0.555,0.52,0.763,0.554,0.483,0.672,0.869,0.703,0.673,0.714,0.7,0.715,0.698,0.657,0.683,0.685,0.679,0.712,0.596,0.584,1630,OrganismalFitness,POLG_HCVJF,Medium,Virus
+POLG_PESV_Tsuboyama_2023_2MXD,0.638,0.766,0.695,0.72,0.74,0.74,0.525,0.722,0.679,0.735,0.737,0.523,0.558,0.525,0.556,0.515,0.567,0.553,0.53,0.818,0.558,0.53,0.507,0.574,0.542,0.47,0.486,0.463,0.578,0.761,0.822,0.826,0.575,0.498,0.503,0.5,0.702,0.706,0.702,0.734,0.741,0.733,0.479,0.448,0.491,0.469,0.778,0.748,0.882,0.841,0.859,0.863,0.848,0.892,0.894,0.894,0.867,0.896,0.894,0.888,0.907,0.83,5130,Stability,POLG_PESV,Medium,Virus
+PPARG_HUMAN_Majithia_2016,0.758,0.814,0.837,0.842,0.838,0.845,0.625,0.731,0.81,0.821,0.812,0.841,0.843,0.533,0.612,0.808,0.865,0.893,0.869,0.858,0.831,0.827,0.697,0.719,0.81,0.837,0.822,0.838,0.734,0.871,0.852,0.843,0.732,0.855,0.816,0.788,0.862,0.845,0.833,0.871,0.861,0.859,0.679,0.524,0.791,0.779,0.807,0.788,0.834,0.668,0.86,0.862,0.867,0.865,0.865,0.866,0.864,0.865,0.865,0.871,0.861,0.811,9576,Activity,PPARG_HUMAN,Medium,Human
+PPM1D_HUMAN_Miller_2022,0.783,0.779,0.779,0.782,0.816,0.817,0.513,0.694,0.741,0.77,0.797,0.805,0.816,0.639,0.692,0.734,0.805,0.82,0.817,0.697,0.726,0.767,0.776,0.719,0.763,0.795,0.79,0.787,0.714,0.813,0.804,0.777,0.614,0.718,0.769,0.775,0.801,0.805,0.805,0.819,0.817,0.818,0.677,0.473,0.794,0.763,0.756,0.798,0.783,0.627,0.791,0.793,0.789,0.796,0.801,0.794,0.797,0.797,0.799,0.802,0.824,0.757,7889,OrganismalFitness,PPM1D_HUMAN,Low,Human
+PR40A_HUMAN_Tsuboyama_2023_1UZC,0.845,0.882,0.926,0.921,0.939,0.94,0.771,0.771,0.936,0.939,0.913,0.85,0.881,0.783,0.779,0.951,0.945,0.921,0.912,0.904,0.829,0.885,0.886,0.898,0.853,0.888,0.907,0.898,0.912,0.946,0.944,0.942,0.856,0.828,0.8,0.824,0.917,0.92,0.909,0.932,0.943,0.933,0.718,0.611,0.635,0.719,0.798,0.664,0.902,0.913,0.945,0.947,0.947,0.949,0.954,0.954,0.949,0.953,0.953,0.954,0.94,0.937,2033,Stability,PR40A_HUMAN,Medium,Human
+PRKN_HUMAN_Clausen_2023,0.839,0.84,0.835,0.827,0.848,0.844,0.593,0.757,0.797,0.807,0.816,0.839,0.854,0.615,0.652,0.689,0.769,0.861,0.865,0.753,0.652,0.803,0.828,0.819,0.74,0.839,0.838,0.835,0.799,0.846,0.832,0.809,0.652,0.628,0.801,0.817,0.816,0.842,0.848,0.842,0.855,0.86,0.659,0.54,0.821,0.69,0.845,0.831,0.873,0.656,0.788,0.791,0.793,0.81,0.8,0.805,0.807,0.806,0.8,0.808,0.866,0.768,8756,Expression,PRKN_HUMAN,Low,Human
+PSAE_SYNP2_Tsuboyama_2023_1PSE,0.859,0.864,0.856,0.855,0.86,0.857,0.712,0.785,0.824,0.824,0.904,0.871,0.88,0.811,0.842,0.895,0.909,0.888,0.853,0.806,0.706,0.656,0.794,0.739,0.793,0.809,0.388,0.802,0.834,0.846,0.853,0.825,0.554,0.717,0.755,0.78,0.845,0.841,0.82,0.866,0.866,0.843,0.748,0.65,0.864,0.784,0.828,0.854,0.907,0.873,0.902,0.896,0.898,0.903,0.901,0.907,0.895,0.901,0.901,0.905,0.909,0.906,1579,Stability,PSAE_PICP2,Medium,Prokaryote
+PTEN_HUMAN_Matreyek_2021,0.681,0.695,0.681,0.691,0.695,0.7,0.579,0.683,0.714,0.736,0.721,0.704,0.73,0.587,0.635,0.727,0.732,0.642,0.654,0.739,0.595,0.73,0.712,0.702,0.633,0.679,0.67,0.696,0.657,0.737,0.698,0.695,0.567,0.636,0.717,0.68,0.688,0.73,0.711,0.706,0.731,0.725,0.622,0.526,0.719,0.669,0.736,0.734,0.736,0.611,0.719,0.714,0.729,0.718,0.717,0.723,0.725,0.726,0.721,0.727,0.748,0.725,5083,Expression,PTEN_HUMAN,Medium,Human
+PTEN_HUMAN_Mighell_2018,0.815,0.826,0.827,0.829,0.844,0.848,0.619,0.734,0.828,0.833,0.805,0.816,0.833,0.639,0.772,0.85,0.832,0.695,0.683,0.831,0.73,0.774,0.72,0.693,0.77,0.696,0.689,0.714,0.66,0.842,0.819,0.813,0.541,0.75,0.767,0.702,0.829,0.813,0.781,0.844,0.844,0.846,0.747,0.481,0.832,0.815,0.771,0.762,0.805,0.625,0.816,0.791,0.811,0.818,0.826,0.825,0.816,0.816,0.826,0.828,0.845,0.828,7260,Activity,PTEN_HUMAN,Medium,Human
+Q2N0S5_9HIV1_Haddox_2018,0.758,0.704,0.69,0.709,0.76,0.763,0.51,0.723,0.77,0.773,0.745,0.767,0.781,0.509,0.509,0.511,0.532,0.559,0.586,0.711,0.77,0.707,0.698,0.671,0.769,0.702,0.702,0.722,0.683,0.764,0.751,0.753,0.644,0.758,0.716,0.71,0.772,0.762,0.762,0.775,0.769,0.768,0.72,0.507,0.765,0.738,0.695,0.743,0.634,0.575,0.615,0.636,0.658,0.656,0.631,0.634,0.635,0.628,0.615,0.645,0.61,0.56,12729,OrganismalFitness,Q2N0S5_9HIV1,Medium,Virus
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,0.671,0.658,0.665,0.666,0.669,0.667,0.545,0.608,0.639,0.645,0.605,0.636,0.646,0.605,0.596,0.671,0.657,0.63,0.663,0.651,0.615,0.619,0.615,0.593,0.637,0.636,0.624,0.642,0.633,0.638,0.679,0.666,0.551,0.61,0.623,0.576,0.647,0.656,0.634,0.667,0.671,0.662,0.699,0.609,0.642,0.653,0.643,0.664,0.655,0.59,0.639,0.645,0.659,0.656,0.652,0.652,0.648,0.646,0.647,0.654,0.647,0.717,3344,Binding,Q53Z42_HUMAN,Medium,Human
+Q53Z42_HUMAN_McShan_2019_expression,0.771,0.755,0.772,0.779,0.784,0.784,0.471,0.721,0.784,0.791,0.719,0.747,0.766,0.533,0.545,0.745,0.785,0.785,0.792,0.787,0.718,0.752,0.752,0.746,0.747,0.765,0.754,0.767,0.784,0.785,0.784,0.755,0.582,0.721,0.737,0.728,0.773,0.784,0.787,0.788,0.794,0.795,0.657,0.539,0.777,0.746,0.734,0.761,0.739,0.604,0.777,0.781,0.786,0.783,0.792,0.786,0.784,0.787,0.792,0.793,0.771,0.812,3344,Expression,Q53Z42_HUMAN,Medium,Human
+Q59976_STRSQ_Romero_2015,0.781,0.831,0.853,0.857,0.857,0.864,0.689,0.776,0.865,0.869,0.827,0.786,0.796,0.574,0.741,0.775,0.813,0.812,0.817,0.85,0.826,0.85,0.849,0.857,0.838,0.859,0.866,0.865,0.869,0.872,0.854,0.827,0.674,0.827,0.85,0.842,0.838,0.856,0.857,0.864,0.869,0.869,0.763,0.492,0.824,0.784,0.711,0.783,0.781,0.569,0.813,0.792,0.792,0.803,0.808,0.798,0.81,0.797,0.802,0.809,0.843,0.79,2999,Activity,Q59976_STRSQ,Medium,Prokaryote
+Q6WV13_9MAXI_Somermeyer_2022,0.634,0.658,0.605,0.606,0.64,0.639,0.496,0.562,0.661,0.659,0.599,0.52,0.512,0.506,0.519,0.501,0.505,0.505,0.493,0.626,0.518,0.519,0.502,0.552,0.513,0.5,0.512,0.496,0.493,0.682,0.636,0.639,0.514,0.513,0.518,0.512,0.599,0.599,0.599,0.637,0.637,0.635,0.505,0.501,0.496,0.51,0.611,0.615,0.664,0.601,0.61,0.624,0.614,0.636,0.628,0.63,0.621,0.622,0.621,0.624,0.554,0.536,31401,Activity,Q6WV12_9MAXI,Low,Eukaryote
+Q837P4_ENTFA_Meier_2023,0.705,0.712,0.718,0.719,0.725,0.736,0.709,0.675,0.711,0.719,0.765,0.741,0.755,0.682,0.716,0.718,0.724,0.744,0.729,0.476,0.733,0.712,0.721,0.734,0.742,0.749,0.76,0.706,0.754,0.738,0.738,0.718,0.561,0.721,0.747,0.729,0.719,0.754,0.749,0.744,0.752,0.752,0.704,0.677,0.755,0.72,0.63,0.725,0.685,0.551,0.726,0.714,0.718,0.736,0.74,0.748,0.741,0.711,0.736,0.737,0.772,0.72,697,Activity,Q837P4_ENTFA,Medium,Prokaryote
+Q837P5_ENTFA_Meier_2023,0.565,0.677,0.682,0.698,0.664,0.668,0.573,0.601,0.613,0.618,0.638,0.646,0.638,0.526,0.578,0.629,0.653,0.691,0.671,0.636,0.657,0.676,0.701,0.698,0.639,0.684,0.69,0.735,0.714,0.644,0.665,0.658,0.559,0.68,0.703,0.741,0.643,0.674,0.714,0.679,0.686,0.699,0.611,0.557,0.653,0.6,0.665,0.642,0.676,0.563,0.636,0.641,0.648,0.629,0.657,0.65,0.63,0.647,0.637,0.648,0.64,0.629,747,Activity,Q837P5_ENTFA,Medium,Prokaryote
+Q8WTC7_9CNID_Somermeyer_2022,0.63,0.671,0.603,0.602,0.652,0.649,0.495,0.648,0.638,0.641,0.592,0.471,0.463,0.466,0.454,0.459,0.465,0.461,0.484,0.601,0.476,0.492,0.516,0.489,0.482,0.469,0.487,0.606,0.613,0.665,0.646,0.654,0.477,0.452,0.49,0.639,0.596,0.601,0.642,0.641,0.646,0.66,0.472,0.473,0.46,0.46,0.51,0.543,0.628,0.571,0.604,0.598,0.603,0.609,0.598,0.606,0.605,0.599,0.599,0.603,0.537,0.447,33510,Activity,Q8WTC7_9CNID,Low,Eukaryote
+R1AB_SARS2_Flynn_2022,0.835,0.829,0.618,0.633,0.846,0.848,0.487,0.675,0.496,0.496,0.575,0.497,0.49,0.513,0.499,0.555,0.567,0.792,0.824,0.663,0.636,0.667,0.674,0.676,0.643,0.643,0.623,0.634,0.642,0.825,0.801,0.745,0.486,0.607,0.637,0.63,0.711,0.735,0.734,0.823,0.834,0.832,0.501,0.486,0.549,0.495,0.752,0.733,0.777,0.626,0.65,0.636,0.659,0.675,0.665,0.661,0.662,0.66,0.641,0.664,0.636,0.589,5725,OrganismalFitness,R1AB_SARS2,Medium,Virus
+RAD_ANTMA_Tsuboyama_2023_2CJJ,0.605,0.585,0.66,0.663,0.675,0.692,0.735,0.703,0.853,0.835,0.794,0.748,0.769,0.726,0.846,0.868,0.761,0.754,0.797,0.714,0.788,0.824,0.803,0.766,0.816,0.744,0.808,0.754,0.747,0.799,0.725,0.681,0.649,0.75,0.834,0.745,0.745,0.811,0.744,0.734,0.762,0.723,0.811,0.62,0.625,0.726,0.761,0.666,0.865,0.778,0.786,0.801,0.804,0.8,0.812,0.802,0.799,0.801,0.797,0.805,0.719,0.818,912,Stability,RAD_ANTMA,High,Eukaryote
+RAF1_HUMAN_Zinkus-Boltz_2019,0.708,0.719,0.706,0.712,0.725,0.725,0.525,0.684,0.752,0.763,0.76,0.732,0.769,0.523,0.619,0.74,0.756,0.744,0.734,0.678,0.654,0.715,0.708,0.696,0.699,0.712,0.708,0.696,0.708,0.727,0.764,0.73,0.569,0.653,0.688,0.702,0.693,0.703,0.711,0.717,0.734,0.737,0.607,0.529,0.779,0.707,0.638,0.725,0.639,0.665,0.729,0.726,0.73,0.752,0.734,0.747,0.732,0.729,0.74,0.745,0.727,0.59,297,OrganismalFitness,RAF1_HUMAN,Low,Human
+RASH_HUMAN_Bandaru_2017,0.798,0.811,0.827,0.842,0.839,0.844,0.689,0.772,0.823,0.837,0.757,0.787,0.811,0.807,0.849,0.835,0.85,0.817,0.735,0.859,0.819,0.805,0.805,0.791,0.809,0.791,0.791,0.767,0.71,0.815,0.789,0.779,0.648,0.787,0.803,0.76,0.821,0.83,0.808,0.839,0.844,0.846,0.838,0.662,0.72,0.822,0.711,0.641,0.807,0.644,0.798,0.794,0.799,0.796,0.806,0.793,0.813,0.796,0.805,0.812,0.747,0.827,3134,Activity,RASH_HUMAN,High,Human
+RASK_HUMAN_Weng_2022_abundance,0.648,0.626,0.679,0.672,0.665,0.676,0.605,0.623,0.62,0.634,0.644,0.606,0.624,0.658,0.683,0.663,0.631,0.61,0.603,0.665,0.633,0.671,0.69,0.699,0.64,0.677,0.711,0.677,0.738,0.683,0.605,0.584,0.607,0.6,0.662,0.706,0.636,0.682,0.721,0.667,0.685,0.694,0.616,0.655,0.531,0.621,0.659,0.664,0.688,0.662,0.627,0.62,0.645,0.637,0.652,0.637,0.646,0.634,0.623,0.639,0.671,0.661,26012,Expression,RASK_HUMAN,High,Human
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,0.734,0.755,0.822,0.827,0.815,0.816,0.645,0.701,0.83,0.833,0.806,0.777,0.805,0.796,0.824,0.834,0.842,0.79,0.733,0.861,0.798,0.781,0.798,0.754,0.778,0.791,0.773,0.718,0.694,0.824,0.779,0.753,0.66,0.765,0.809,0.762,0.791,0.82,0.791,0.818,0.826,0.817,0.794,0.648,0.732,0.814,0.628,0.64,0.814,0.648,0.779,0.781,0.783,0.782,0.792,0.792,0.779,0.782,0.756,0.789,0.812,0.814,24873,Binding,RASK_HUMAN,High,Human
+RBP1_HUMAN_Tsuboyama_2023_2KWH,0.584,0.542,0.656,0.66,0.659,0.657,0.646,0.65,0.634,0.642,0.728,0.718,0.713,0.669,0.708,0.759,0.781,0.684,0.682,0.633,0.694,0.577,0.656,0.654,0.653,0.565,0.591,0.673,0.631,0.701,0.704,0.676,0.644,0.655,0.698,0.708,0.666,0.679,0.678,0.666,0.675,0.666,0.667,0.636,0.709,0.714,0.754,0.778,0.796,0.73,0.755,0.751,0.756,0.755,0.759,0.754,0.749,0.762,0.752,0.758,0.783,0.769,1332,Stability,RBP1_HUMAN,High,Human
+RCD1_ARATH_Tsuboyama_2023_5OAO,0.694,0.683,0.725,0.728,0.728,0.723,0.66,0.662,0.736,0.74,0.757,0.669,0.695,0.685,0.692,0.784,0.792,0.787,0.783,0.739,0.628,0.616,0.668,0.71,0.659,0.759,0.749,0.727,0.767,0.744,0.787,0.752,0.519,0.638,0.657,0.66,0.717,0.721,0.715,0.742,0.743,0.735,0.655,0.644,0.727,0.666,0.744,0.757,0.801,0.773,0.802,0.809,0.816,0.814,0.806,0.812,0.809,0.808,0.811,0.814,0.799,0.762,1261,Stability,RCD1_ARATH,Medium,Eukaryote
+RCRO_LAMBD_Tsuboyama_2023_1ORC,0.665,0.773,0.805,0.8,0.788,0.806,0.533,0.64,0.764,0.767,0.737,0.649,0.697,0.591,0.654,0.66,0.786,0.786,0.803,0.801,0.506,0.524,0.549,0.541,0.414,0.522,0.439,0.482,0.798,0.799,0.782,0.771,0.518,0.525,0.495,0.778,0.713,0.708,0.795,0.789,0.8,0.823,0.452,0.545,0.698,0.548,0.78,0.806,0.866,0.807,0.79,0.781,0.771,0.782,0.782,0.784,0.773,0.781,0.792,0.785,0.815,0.71,2278,Stability,RCRO_LAMBD,High,Virus
+RD23A_HUMAN_Tsuboyama_2023_1IFY,0.619,0.649,0.714,0.714,0.711,0.711,0.576,0.735,0.766,0.771,0.75,0.792,0.805,0.603,0.836,0.78,0.768,0.761,0.723,0.735,0.705,0.725,0.722,0.743,0.769,0.765,0.764,0.749,0.765,0.747,0.725,0.718,0.729,0.646,0.749,0.737,0.703,0.747,0.75,0.716,0.742,0.748,0.747,0.507,0.788,0.772,0.724,0.783,0.787,0.745,0.744,0.735,0.741,0.733,0.748,0.75,0.746,0.748,0.749,0.749,0.741,0.796,1019,Stability,RD23A_HUMAN,High,Human
+RDRP_I33A0_Li_2023,0.667,0.699,0.727,0.732,0.779,0.781,0.524,0.728,0.791,0.794,0.612,0.547,0.554,0.535,0.54,0.59,0.698,0.724,0.785,0.768,0.734,0.768,0.778,0.789,0.588,0.723,0.716,0.712,0.751,0.796,0.766,0.729,0.568,0.73,0.761,0.781,0.75,0.768,0.779,0.79,0.799,0.808,0.535,0.522,0.612,0.536,0.626,0.646,0.622,0.54,0.681,0.664,0.674,0.674,0.683,0.679,0.675,0.686,0.686,0.687,0.595,0.569,12003,OrganismalFitness,RDRP_I33A0,Low,Virus
+REV_HV1H2_Fernandes_2016,0.596,0.574,0.603,0.606,0.603,0.603,0.521,0.646,0.603,0.607,0.566,0.616,0.624,0.526,0.522,0.585,0.613,0.638,0.629,0.575,0.6,0.623,0.617,0.608,0.635,0.637,0.576,0.617,0.623,0.63,0.667,0.672,0.525,0.608,0.627,0.613,0.614,0.625,0.61,0.616,0.623,0.611,0.529,0.533,0.599,0.537,0.626,0.612,0.639,0.604,0.591,0.596,0.633,0.622,0.624,0.625,0.633,0.61,0.649,0.632,0.633,0.608,2147,OrganismalFitness,REV_HV1H2,Medium,Virus
+RFAH_ECOLI_Tsuboyama_2023_2LCL,0.507,0.596,0.607,0.618,0.618,0.616,0.487,0.62,0.613,0.627,0.622,0.571,0.585,0.469,0.529,0.511,0.649,0.637,0.632,0.635,0.454,0.519,0.559,0.553,0.464,0.565,0.577,0.552,0.576,0.61,0.636,0.61,0.5,0.498,0.558,0.576,0.543,0.572,0.583,0.61,0.61,0.611,0.547,0.447,0.585,0.54,0.649,0.622,0.689,0.658,0.668,0.657,0.649,0.668,0.66,0.655,0.66,0.656,0.668,0.663,0.652,0.593,1326,Stability,RFAH_ECOLI,High,Prokaryote
+RL20_AQUAE_Tsuboyama_2023_1GYZ,0.688,0.851,0.867,0.866,0.866,0.87,0.593,0.855,0.718,0.681,0.886,0.885,0.875,0.591,0.706,0.753,0.908,0.899,0.907,0.879,0.587,0.798,0.786,0.78,0.774,0.789,0.805,0.818,0.861,0.865,0.843,0.832,0.531,0.774,0.803,0.79,0.812,0.836,0.832,0.852,0.875,0.875,0.484,0.434,0.769,0.622,0.856,0.858,0.936,0.906,0.917,0.917,0.915,0.929,0.923,0.919,0.924,0.924,0.925,0.927,0.93,0.872,1461,Stability,RL20_AQUAE,High,Prokaryote
+RL40A_YEAST_Mavor_2016,0.654,0.683,0.685,0.713,0.695,0.708,0.556,0.728,0.744,0.746,0.623,0.653,0.674,0.547,0.738,0.769,0.788,0.744,0.797,0.73,0.706,0.789,0.774,0.745,0.775,0.75,0.751,0.739,0.724,0.672,0.7,0.705,0.532,0.715,0.757,0.709,0.726,0.76,0.726,0.738,0.759,0.721,0.653,0.518,0.653,0.668,0.553,0.587,0.627,0.503,0.767,0.755,0.776,0.767,0.778,0.765,0.762,0.777,0.77,0.781,0.697,0.698,1253,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+RL40A_YEAST_Roscoe_2013,0.66,0.695,0.708,0.742,0.714,0.728,0.53,0.684,0.767,0.769,0.614,0.654,0.692,0.54,0.73,0.773,0.814,0.753,0.806,0.748,0.709,0.799,0.777,0.755,0.783,0.759,0.769,0.754,0.739,0.696,0.745,0.756,0.536,0.714,0.775,0.716,0.731,0.778,0.734,0.75,0.777,0.737,0.68,0.522,0.659,0.66,0.542,0.567,0.628,0.511,0.802,0.787,0.806,0.793,0.804,0.797,0.787,0.813,0.803,0.814,0.698,0.71,1195,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+RL40A_YEAST_Roscoe_2014,0.731,0.684,0.73,0.753,0.72,0.743,0.596,0.761,0.745,0.759,0.595,0.66,0.676,0.628,0.816,0.813,0.828,0.765,0.791,0.744,0.714,0.796,0.776,0.764,0.781,0.752,0.748,0.723,0.719,0.733,0.714,0.719,0.616,0.731,0.778,0.73,0.757,0.794,0.767,0.765,0.783,0.756,0.663,0.576,0.659,0.672,0.63,0.64,0.685,0.588,0.816,0.796,0.802,0.83,0.828,0.825,0.801,0.811,0.8,0.828,0.69,0.746,1380,Activity,RL40A_YEAST,Medium,Eukaryote
+RNC_ECOLI_Weeks_2023,0.772,0.804,0.803,0.809,0.812,0.813,0.531,0.719,0.808,0.807,0.806,0.803,0.81,0.535,0.782,0.801,0.809,0.807,0.81,0.808,0.779,0.794,0.765,0.755,0.786,0.802,0.802,0.801,0.795,0.809,0.799,0.776,0.584,0.77,0.777,0.718,0.794,0.801,0.777,0.818,0.819,0.815,0.769,0.534,0.806,0.792,0.657,0.777,0.65,0.597,0.787,0.781,0.778,0.792,0.787,0.782,0.786,0.786,0.787,0.793,0.815,0.772,4277,Activity,RNC_ECOLI,Medium,Prokaryote
+RPC1_BP434_Tsuboyama_2023_1R69,0.818,0.857,0.864,0.875,0.833,0.856,0.841,0.829,0.85,0.87,0.891,0.898,0.902,0.876,0.89,0.901,0.894,0.88,0.85,0.884,0.865,0.886,0.889,0.871,0.882,0.894,0.89,0.891,0.87,0.897,0.869,0.834,0.836,0.826,0.881,0.881,0.874,0.888,0.887,0.873,0.873,0.869,0.888,0.854,0.898,0.887,0.875,0.847,0.925,0.889,0.879,0.872,0.88,0.887,0.884,0.881,0.89,0.886,0.892,0.89,0.913,0.906,1459,Stability,RPC1_BP434,High,Virus
+RPC1_LAMBD_Li_2019_high-expression,0.707,0.796,0.837,0.849,0.824,0.826,0.67,0.816,0.833,0.857,0.866,0.855,0.879,0.721,0.718,0.794,0.898,0.902,0.904,0.815,0.656,0.744,0.79,0.82,0.674,0.777,0.764,0.746,0.894,0.811,0.934,0.902,0.586,0.625,0.764,0.86,0.706,0.769,0.857,0.81,0.832,0.866,0.731,0.735,0.804,0.743,0.671,0.774,0.761,0.626,0.898,0.873,0.872,0.897,0.887,0.898,0.883,0.879,0.873,0.897,0.878,0.789,351,Activity,RPC1_LAMBD,High,Virus
+RPC1_LAMBD_Li_2019_low-expression,0.612,0.699,0.736,0.746,0.724,0.718,0.563,0.757,0.716,0.736,0.755,0.728,0.736,0.619,0.62,0.671,0.761,0.803,0.808,0.7,0.605,0.678,0.68,0.713,0.583,0.686,0.661,0.653,0.802,0.712,0.829,0.817,0.54,0.563,0.663,0.738,0.61,0.657,0.727,0.702,0.716,0.743,0.636,0.621,0.67,0.641,0.634,0.681,0.7,0.636,0.769,0.752,0.767,0.777,0.758,0.772,0.776,0.767,0.758,0.776,0.765,0.653,351,Activity,RPC1_LAMBD,High,Virus
+RS15_GEOSE_Tsuboyama_2023_1A32,0.698,0.672,0.685,0.682,0.672,0.675,0.587,0.62,0.71,0.71,0.734,0.709,0.701,0.636,0.657,0.758,0.714,0.702,0.67,0.663,0.601,0.652,0.667,0.658,0.657,0.672,0.7,0.679,0.686,0.701,0.726,0.723,0.548,0.711,0.659,0.662,0.725,0.687,0.69,0.703,0.679,0.673,0.665,0.62,0.719,0.683,0.801,0.723,0.817,0.768,0.71,0.709,0.715,0.713,0.709,0.708,0.719,0.726,0.71,0.719,0.72,0.821,1195,Stability,RS15_GEOSE,Medium,Prokaryote
+S22A1_HUMAN_Yee_2023_abundance,0.775,0.791,0.802,0.806,0.813,0.82,0.709,0.779,0.831,0.831,0.822,0.835,0.857,0.766,0.791,0.802,0.837,0.826,0.812,0.52,0.767,0.823,0.837,0.826,0.786,0.839,0.848,0.83,0.81,0.819,0.81,0.741,0.682,0.785,0.828,0.825,0.806,0.843,0.845,0.834,0.844,0.843,0.771,0.651,0.814,0.802,0.733,0.789,0.788,0.594,0.813,0.806,0.807,0.817,0.827,0.821,0.819,0.822,0.817,0.827,0.837,0.812,9803,Expression,S22A1_HUMAN,Medium,Human
+S22A1_HUMAN_Yee_2023_activity,0.773,0.81,0.835,0.843,0.838,0.845,0.714,0.821,0.844,0.843,0.837,0.848,0.877,0.769,0.789,0.789,0.854,0.833,0.822,0.521,0.793,0.854,0.861,0.849,0.815,0.86,0.865,0.854,0.831,0.852,0.856,0.802,0.691,0.805,0.858,0.846,0.817,0.86,0.857,0.856,0.87,0.866,0.775,0.636,0.837,0.8,0.728,0.805,0.788,0.587,0.828,0.812,0.809,0.828,0.837,0.826,0.827,0.837,0.837,0.839,0.828,0.804,10094,Activity,S22A1_HUMAN,Medium,Human
+SAV1_MOUSE_Tsuboyama_2023_2YSB,0.537,0.586,0.693,0.677,0.692,0.687,0.702,0.788,0.693,0.701,0.762,0.765,0.778,0.593,0.739,0.757,0.765,0.786,0.732,0.723,0.771,0.756,0.75,0.766,0.776,0.786,0.768,0.771,0.769,0.785,0.764,0.733,0.703,0.736,0.778,0.773,0.714,0.758,0.746,0.698,0.719,0.719,0.768,0.459,0.787,0.774,0.647,0.658,0.709,0.767,0.752,0.693,0.638,0.704,0.699,0.716,0.713,0.742,0.735,0.714,0.803,0.813,965,Stability,SAV1_MOUSE,High,Eukaryote
+SBI_STAAM_Tsuboyama_2023_2JVG,0.606,0.599,0.685,0.666,0.714,0.699,0.607,0.603,0.737,0.763,0.661,0.626,0.642,0.614,0.629,0.687,0.789,0.833,0.674,0.68,0.61,0.63,0.64,0.639,0.607,0.619,0.611,0.629,0.688,0.789,0.798,0.75,0.602,0.589,0.598,0.606,0.641,0.646,0.648,0.674,0.681,0.685,0.629,0.608,0.659,0.655,0.836,0.828,0.854,0.812,0.768,0.765,0.782,0.784,0.781,0.782,0.794,0.79,0.785,0.787,0.848,0.785,1025,Stability,SBI_STAAM,Medium,Prokaryote
+SC6A4_HUMAN_Young_2021,0.71,0.745,0.728,0.734,0.771,0.78,0.686,0.766,0.8,0.805,0.788,0.782,0.787,0.588,0.675,0.77,0.792,0.793,0.78,0.795,0.764,0.771,0.762,0.761,0.771,0.774,0.773,0.776,0.771,0.788,0.745,0.693,0.683,0.772,0.77,0.759,0.783,0.788,0.781,0.793,0.796,0.791,0.762,0.564,0.79,0.782,0.746,0.778,0.762,0.598,0.778,0.783,0.788,0.791,0.793,0.788,0.788,0.79,0.783,0.793,0.808,0.782,11576,Activity,SC6A4_HUMAN,Medium,Human
+SCIN_STAAR_Tsuboyama_2023_2QFF,0.535,0.56,0.633,0.63,0.654,0.641,0.552,0.53,0.632,0.63,0.62,0.618,0.607,0.585,0.625,0.613,0.638,0.67,0.675,0.634,0.538,0.563,0.549,0.555,0.564,0.576,0.56,0.578,0.585,0.629,0.638,0.602,0.49,0.522,0.534,0.565,0.566,0.564,0.574,0.626,0.62,0.628,0.592,0.566,0.623,0.607,0.763,0.758,0.763,0.733,0.692,0.699,0.699,0.72,0.686,0.715,0.689,0.688,0.667,0.702,0.797,0.733,1212,Stability,SCIN_STAAR,High,Prokaryote
+SCN5A_HUMAN_Glazer_2019,0.537,0.543,0.558,0.558,0.553,0.559,0.548,0.51,0.569,0.583,0.589,0.613,0.567,0.623,0.591,0.582,0.594,0.569,0.554,0.579,0.546,0.557,0.567,0.567,0.536,0.547,0.548,0.543,0.58,0.545,0.588,0.582,0.532,0.534,0.525,0.532,0.537,0.53,0.529,0.553,0.552,0.556,0.542,0.585,0.574,0.585,0.529,0.561,0.527,0.504,0.573,0.564,0.574,0.563,0.543,0.566,0.578,0.572,0.575,0.57,0.608,0.574,224,OrganismalFitness,SCN5A_HUMAN,Medium,Human
+SDA_BACSU_Tsuboyama_2023_1PV0,0.944,0.971,0.971,0.969,0.973,0.973,0.59,0.86,0.959,0.962,0.969,0.939,0.952,0.72,0.734,0.947,0.955,0.952,0.957,0.954,0.628,0.652,0.652,0.77,0.643,0.799,0.565,0.776,0.862,0.967,0.965,0.962,0.353,0.579,0.738,0.808,0.934,0.936,0.908,0.962,0.96,0.947,0.576,0.497,0.818,0.562,0.841,0.827,0.961,0.952,0.973,0.97,0.969,0.974,0.972,0.972,0.974,0.971,0.975,0.974,0.961,0.967,2770,Stability,SDA_BACSU,Medium,Prokaryote
+SERC_HUMAN_Xie_2023,0.678,0.722,0.759,0.759,0.745,0.75,0.509,0.715,0.774,0.778,0.747,0.757,0.76,0.587,0.692,0.754,0.765,0.776,0.76,0.755,0.736,0.744,0.746,0.735,0.738,0.755,0.747,0.755,0.745,0.744,0.753,0.732,0.588,0.747,0.753,0.749,0.748,0.754,0.757,0.757,0.759,0.759,0.651,0.519,0.759,0.725,0.696,0.735,0.681,0.593,0.762,0.756,0.764,0.768,0.771,0.767,0.767,0.766,0.765,0.772,0.765,0.727,1914,OrganismalFitness,SERC_HUMAN,High,Human
+SHOC2_HUMAN_Kwon_2022,0.599,0.664,0.689,0.69,0.685,0.69,0.606,0.679,0.711,0.711,0.696,0.689,0.702,0.613,0.618,0.627,0.711,0.695,0.648,0.694,0.624,0.679,0.693,0.677,0.626,0.693,0.692,0.698,0.678,0.704,0.704,0.677,0.574,0.621,0.651,0.687,0.627,0.648,0.682,0.682,0.682,0.698,0.623,0.611,0.713,0.639,0.65,0.681,0.638,0.545,0.685,0.682,0.691,0.693,0.697,0.699,0.689,0.695,0.69,0.697,0.656,0.639,10972,OrganismalFitness,SHOC2_HUMAN,Medium,Human
+SOX30_HUMAN_Tsuboyama_2023_7JJK,0.592,0.618,0.628,0.627,0.639,0.634,0.694,0.584,0.596,0.613,0.62,0.659,0.663,0.644,0.687,0.661,0.66,0.625,0.624,0.615,0.576,0.587,0.586,0.599,0.631,0.632,0.596,0.626,0.599,0.611,0.535,0.517,0.609,0.561,0.63,0.62,0.593,0.609,0.615,0.635,0.631,0.631,0.63,0.572,0.598,0.671,0.756,0.671,0.768,0.724,0.674,0.691,0.674,0.672,0.668,0.693,0.67,0.671,0.665,0.68,0.664,0.737,1010,Stability,SOX30_HUMAN,High,Human
+SPA_STAAU_Tsuboyama_2023_1LP1,0.748,0.788,0.799,0.788,0.822,0.822,0.411,0.796,0.744,0.744,0.7,0.434,0.456,0.436,0.438,0.427,0.457,0.447,0.468,0.794,0.386,0.48,0.586,0.546,0.428,0.447,0.504,0.674,0.733,0.811,0.736,0.75,0.476,0.461,0.471,0.505,0.744,0.738,0.737,0.82,0.818,0.812,0.387,0.418,0.411,0.445,0.753,0.72,0.871,0.785,0.761,0.756,0.743,0.766,0.753,0.755,0.75,0.745,0.757,0.755,0.768,0.687,2105,Stability,SPA_STAAU,Medium,Prokaryote
+SPG1_STRSG_Olson_2014,0.608,0.634,0.497,0.507,0.618,0.627,0.489,0.529,0.451,0.555,0.671,0.629,0.606,0.65,0.626,0.652,0.68,0.649,0.688,0.661,0.637,0.616,0.619,0.612,0.631,0.623,0.614,0.626,0.692,0.634,0.754,0.746,0.536,0.629,0.599,0.65,0.636,0.611,0.654,0.631,0.605,0.656,0.473,0.436,0.624,0.531,0.682,0.644,0.711,0.567,0.717,0.696,0.709,0.733,0.733,0.716,0.745,0.754,0.744,0.739,0.716,0.708,536962,Binding,SPG1_STRSG,Low,Prokaryote
+SPG1_STRSG_Wu_2016,0.517,0.605,0.612,0.621,0.614,0.622,0.604,0.541,0.723,0.715,0.693,0.677,0.662,0.645,0.684,0.692,0.708,0.738,0.761,0.653,0.609,0.589,0.58,0.607,0.573,0.643,0.622,0.6,0.6,0.631,0.702,0.664,0.493,0.619,0.567,0.605,0.625,0.611,0.627,0.64,0.63,0.633,0.572,0.508,0.617,0.561,0.604,0.594,0.725,0.636,0.718,0.7,0.696,0.73,0.699,0.693,0.732,0.711,0.734,0.715,0.73,0.68,149360,Binding,SPG1_STRSG,Medium,Prokaryote
+SPG2_STRSG_Tsuboyama_2023_5UBS,0.767,0.811,0.804,0.823,0.84,0.839,0.704,0.705,0.818,0.793,0.785,0.752,0.744,0.719,0.756,0.769,0.792,0.789,0.81,0.8,0.736,0.757,0.744,0.739,0.705,0.771,0.712,0.772,0.796,0.866,0.842,0.835,0.416,0.722,0.766,0.778,0.697,0.718,0.743,0.833,0.836,0.842,0.647,0.564,0.66,0.642,0.761,0.715,0.902,0.837,0.838,0.839,0.851,0.85,0.835,0.853,0.848,0.861,0.847,0.85,0.853,0.827,1451,Stability,SPG2_STRSG,Medium,Prokaryote
+SPIKE_SARS2_Starr_2020_binding,0.57,0.595,0.531,0.576,0.624,0.628,0.454,0.682,0.684,0.686,0.475,0.445,0.459,0.454,0.46,0.47,0.471,0.477,0.519,0.659,0.644,0.671,0.665,0.671,0.684,0.655,0.641,0.677,0.644,0.625,0.648,0.67,0.592,0.653,0.652,0.673,0.649,0.649,0.649,0.665,0.662,0.666,0.464,0.479,0.467,0.474,0.766,0.748,0.702,0.584,0.602,0.606,0.602,0.638,0.613,0.637,0.619,0.63,0.599,0.627,0.656,0.56,3802,Binding,SPIKE_SARS2,Medium,Virus
+SPIKE_SARS2_Starr_2020_expression,0.607,0.664,0.592,0.651,0.743,0.741,0.476,0.682,0.721,0.735,0.511,0.469,0.484,0.473,0.481,0.492,0.493,0.502,0.538,0.68,0.662,0.693,0.694,0.705,0.696,0.689,0.666,0.703,0.666,0.694,0.654,0.675,0.589,0.672,0.677,0.69,0.681,0.686,0.685,0.732,0.737,0.746,0.49,0.497,0.494,0.502,0.782,0.77,0.736,0.597,0.658,0.668,0.656,0.705,0.68,0.699,0.681,0.691,0.671,0.694,0.735,0.621,3798,Expression,SPIKE_SARS2,Medium,Virus
+SPTN1_CHICK_Tsuboyama_2023_1TUD,0.794,0.805,0.81,0.778,0.798,0.787,0.605,0.745,0.766,0.788,0.797,0.816,0.807,0.444,0.797,0.801,0.796,0.857,0.822,0.788,0.782,0.808,0.789,0.782,0.786,0.789,0.814,0.767,0.778,0.787,0.791,0.792,0.711,0.746,0.777,0.766,0.808,0.813,0.804,0.792,0.795,0.791,0.736,0.402,0.723,0.736,0.702,0.693,0.809,0.802,0.824,0.819,0.825,0.822,0.828,0.824,0.818,0.816,0.825,0.825,0.83,0.767,3201,Stability,SPTN1_CHICK,High,Eukaryote
+SQSTM_MOUSE_Tsuboyama_2023_2RRU,0.678,0.75,0.781,0.788,0.785,0.787,0.529,0.686,0.768,0.782,0.729,0.654,0.714,0.574,0.708,0.745,0.804,0.815,0.753,0.796,0.59,0.701,0.715,0.718,0.64,0.743,0.76,0.724,0.741,0.809,0.786,0.77,0.576,0.571,0.696,0.735,0.727,0.741,0.768,0.791,0.78,0.791,0.582,0.538,0.779,0.771,0.832,0.816,0.813,0.823,0.815,0.786,0.818,0.811,0.813,0.812,0.816,0.804,0.798,0.818,0.824,0.725,707,Stability,SQSTM_MOUSE,Medium,Eukaryote
+SR43C_ARATH_Tsuboyama_2023_2N88,0.82,0.856,0.869,0.869,0.873,0.87,0.382,0.779,0.844,0.831,0.878,0.842,0.851,0.347,0.853,0.878,0.863,0.877,0.864,0.843,0.419,0.682,0.687,0.762,0.76,0.801,0.825,0.819,0.823,0.865,0.866,0.848,0.686,0.504,0.575,0.678,0.853,0.851,0.855,0.868,0.868,0.865,0.794,0.321,0.871,0.825,0.567,0.81,0.859,0.864,0.867,0.864,0.861,0.874,0.878,0.867,0.862,0.862,0.87,0.87,0.917,0.891,1583,Stability,SR43C_ARATH,High,Eukaryote
+SRBS1_HUMAN_Tsuboyama_2023_2O2W,0.711,0.786,0.866,0.866,0.886,0.883,0.788,0.697,0.893,0.893,0.837,0.859,0.876,0.767,0.867,0.881,0.888,0.842,0.867,0.852,0.866,0.846,0.853,0.839,0.845,0.842,0.849,0.841,0.832,0.864,0.887,0.868,0.809,0.825,0.844,0.841,0.858,0.875,0.875,0.878,0.889,0.887,0.69,0.574,0.786,0.769,0.786,0.778,0.823,0.842,0.896,0.886,0.892,0.889,0.897,0.894,0.896,0.889,0.897,0.899,0.887,0.847,1556,Stability,SRBS1_HUMAN,High,Human
+SRC_HUMAN_Ahler_2019,0.825,0.825,0.796,0.797,0.82,0.825,0.833,0.78,0.837,0.848,0.813,0.861,0.875,0.775,0.797,0.806,0.856,0.833,0.814,0.815,0.78,0.769,0.766,0.74,0.792,0.804,0.785,0.766,0.715,0.847,0.865,0.855,0.77,0.764,0.771,0.721,0.816,0.819,0.806,0.831,0.832,0.827,0.855,0.749,0.82,0.845,0.697,0.684,0.754,0.562,0.826,0.81,0.813,0.825,0.831,0.826,0.833,0.834,0.832,0.836,0.826,0.797,3372,Activity,SRC_HUMAN,Medium,Human
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,0.748,0.753,0.743,0.753,0.76,0.758,0.755,0.731,0.771,0.768,0.74,0.78,0.79,0.694,0.718,0.725,0.766,0.746,0.74,0.74,0.722,0.711,0.709,0.686,0.721,0.735,0.716,0.709,0.659,0.769,0.787,0.785,0.704,0.698,0.702,0.665,0.74,0.742,0.733,0.761,0.761,0.758,0.766,0.677,0.748,0.761,0.642,0.654,0.688,0.532,0.75,0.732,0.736,0.749,0.75,0.75,0.749,0.753,0.752,0.755,0.75,0.726,3637,Activity,SRC_HUMAN,Medium,Human
+SRC_HUMAN_Nguyen_2022,0.725,0.734,0.733,0.738,0.738,0.736,0.737,0.721,0.695,0.732,0.732,0.764,0.777,0.674,0.696,0.706,0.755,0.738,0.731,0.726,0.705,0.696,0.699,0.678,0.709,0.723,0.705,0.698,0.655,0.749,0.769,0.76,0.7,0.689,0.693,0.657,0.722,0.724,0.715,0.741,0.741,0.737,0.757,0.658,0.74,0.756,0.627,0.642,0.671,0.537,0.741,0.723,0.726,0.738,0.741,0.739,0.741,0.745,0.741,0.745,0.739,0.703,3366,OrganismalFitness,SRC_HUMAN,Medium,Human
+SUMO1_HUMAN_Weile_2017,0.718,0.703,0.735,0.747,0.778,0.773,0.58,0.755,0.757,0.723,0.753,0.78,0.8,0.645,0.8,0.818,0.802,0.717,0.686,0.808,0.631,0.743,0.769,0.758,0.786,0.732,0.777,0.757,0.684,0.767,0.77,0.763,0.628,0.657,0.788,0.689,0.743,0.805,0.736,0.763,0.809,0.763,0.803,0.548,0.716,0.815,0.808,0.751,0.814,0.757,0.79,0.764,0.797,0.792,0.79,0.802,0.783,0.796,0.786,0.801,0.777,0.817,1700,OrganismalFitness,SUMO1_HUMAN,High,Human
+SYUA_HUMAN_Newberry_2020,0.61,0.647,0.668,0.676,0.671,0.676,0.667,0.719,0.703,0.697,0.774,0.775,0.772,0.669,0.7,0.699,0.702,0.748,0.753,0.753,0.683,0.735,0.698,0.697,0.654,0.717,0.731,0.689,0.687,0.755,0.736,0.712,0.579,0.694,0.755,0.72,0.658,0.724,0.692,0.679,0.717,0.697,0.665,0.617,0.719,0.697,0.495,0.634,0.519,0.472,0.648,0.64,0.638,0.657,0.687,0.659,0.678,0.671,0.692,0.67,0.574,0.544,2497,OrganismalFitness,SYUA_HUMAN,Medium,Human
+TADBP_HUMAN_Bolognesi_2019,0.555,0.538,0.554,0.552,0.547,0.545,0.62,0.503,0.544,0.542,0.512,0.525,0.524,0.552,0.517,0.516,0.447,0.502,0.533,0.474,0.568,0.532,0.504,0.502,0.582,0.551,0.517,0.525,0.498,0.525,0.554,0.57,0.471,0.595,0.619,0.567,0.571,0.59,0.569,0.556,0.566,0.563,0.538,0.584,0.499,0.528,0.638,0.524,0.59,0.529,0.568,0.508,0.503,0.521,0.503,0.523,0.532,0.502,0.469,0.517,0.518,0.549,1196,OrganismalFitness,TADBP_HUMAN,Low,Human
+TAT_HV1BR_Fernandes_2016,0.707,0.64,0.678,0.687,0.721,0.708,0.439,0.774,0.673,0.685,0.616,0.738,0.74,0.479,0.499,0.498,0.522,0.461,0.529,0.646,0.76,0.758,0.775,0.781,0.772,0.68,0.591,0.67,0.649,0.756,0.787,0.779,0.709,0.767,0.649,0.639,0.765,0.688,0.665,0.757,0.709,0.685,0.509,0.465,0.712,0.625,0.65,0.702,0.726,0.64,0.58,0.582,0.598,0.583,0.576,0.585,0.573,0.577,0.571,0.586,0.607,0.581,1577,OrganismalFitness,TAT_HV1BR,High,Virus
+TCRG1_MOUSE_Tsuboyama_2023_1E0L,0.837,0.852,0.88,0.885,0.887,0.888,0.883,0.686,0.892,0.918,0.874,0.922,0.933,0.908,0.914,0.94,0.931,0.915,0.929,0.837,0.785,0.843,0.837,0.865,0.854,0.861,0.888,0.884,0.851,0.909,0.899,0.886,0.84,0.777,0.85,0.885,0.895,0.907,0.91,0.897,0.908,0.906,0.905,0.588,0.827,0.901,0.853,0.797,0.928,0.917,0.918,0.907,0.898,0.931,0.928,0.931,0.922,0.93,0.919,0.928,0.934,0.935,1058,Stability,TCRG1_MOUSE,Medium,Eukaryote
+THO1_YEAST_Tsuboyama_2023_2WQG,0.707,0.76,0.81,0.805,0.811,0.81,0.509,0.561,0.851,0.845,0.836,0.804,0.828,0.576,0.848,0.849,0.84,0.851,0.857,0.831,0.441,0.692,0.695,0.668,0.683,0.717,0.618,0.75,0.795,0.852,0.853,0.843,0.775,0.469,0.721,0.686,0.784,0.816,0.809,0.803,0.825,0.824,0.641,0.547,0.797,0.711,0.796,0.796,0.812,0.868,0.86,0.851,0.866,0.866,0.87,0.856,0.858,0.855,0.858,0.861,0.826,0.775,1279,Stability,THO1_YEAST,High,Eukaryote
+TNKS2_HUMAN_Tsuboyama_2023_5JRT,0.712,0.742,0.774,0.767,0.769,0.769,0.422,0.71,0.795,0.795,0.757,0.762,0.765,0.505,0.76,0.791,0.778,0.776,0.785,0.752,0.643,0.691,0.709,0.707,0.696,0.726,0.69,0.704,0.719,0.779,0.753,0.732,0.602,0.536,0.67,0.682,0.767,0.766,0.763,0.786,0.775,0.774,0.653,0.465,0.781,0.735,0.733,0.813,0.839,0.816,0.772,0.783,0.776,0.772,0.772,0.767,0.774,0.77,0.772,0.777,0.814,0.8,1479,Stability,TNKS2_HUMAN,High,Human
+TPK1_HUMAN_Weile_2017,0.622,0.631,0.626,0.629,0.632,0.631,0.547,0.639,0.662,0.652,0.658,0.653,0.675,0.571,0.618,0.644,0.684,0.68,0.706,0.64,0.555,0.575,0.618,0.646,0.569,0.647,0.652,0.635,0.669,0.641,0.642,0.606,0.563,0.571,0.614,0.669,0.632,0.638,0.671,0.64,0.64,0.655,0.578,0.56,0.673,0.601,0.617,0.645,0.633,0.559,0.636,0.655,0.649,0.655,0.663,0.664,0.656,0.658,0.66,0.664,0.652,0.622,3181,OrganismalFitness,TPK1_HUMAN,Medium,Human
+TPMT_HUMAN_Matreyek_2018,0.722,0.754,0.773,0.782,0.778,0.787,0.656,0.761,0.786,0.787,0.812,0.785,0.802,0.706,0.751,0.814,0.807,0.756,0.749,0.779,0.694,0.725,0.761,0.766,0.742,0.777,0.757,0.746,0.736,0.805,0.802,0.779,0.708,0.698,0.768,0.719,0.785,0.797,0.765,0.799,0.805,0.783,0.736,0.637,0.797,0.777,0.779,0.788,0.807,0.641,0.782,0.785,0.791,0.776,0.795,0.793,0.795,0.79,0.798,0.798,0.827,0.778,3648,Expression,TPMT_HUMAN,Medium,Human
+TPOR_HUMAN_Bridgford_2020,0.723,0.695,0.677,0.655,0.658,0.667,0.765,0.752,0.718,0.721,0.739,0.732,0.739,0.694,0.748,0.704,0.685,0.736,0.678,0.677,0.681,0.737,0.714,0.726,0.749,0.634,0.806,0.78,0.708,0.741,0.751,0.63,0.747,0.756,0.73,0.751,0.776,0.768,0.773,0.75,0.739,0.757,0.709,0.712,0.717,0.738,0.578,0.69,0.681,0.511,0.702,0.676,0.722,0.701,0.684,0.699,0.739,0.643,0.672,0.699,0.749,0.752,562,OrganismalFitness,TPOR_HUMAN,Low,Human
+TRPC_SACS2_Chan_2017,0.832,0.841,0.818,0.827,0.829,0.834,0.63,0.804,0.873,0.88,0.858,0.849,0.869,0.689,0.817,0.858,0.864,0.86,0.847,0.835,0.758,0.832,0.8,0.832,0.792,0.798,0.817,0.803,0.799,0.808,0.846,0.808,0.62,0.804,0.829,0.827,0.834,0.848,0.844,0.841,0.854,0.847,0.763,0.539,0.85,0.8,0.674,0.788,0.793,0.564,0.842,0.833,0.838,0.867,0.851,0.85,0.853,0.86,0.847,0.863,0.858,0.836,1519,OrganismalFitness,TRPC_SACS2,Medium,Prokaryote
+TRPC_THEMA_Chan_2017,0.768,0.78,0.76,0.764,0.76,0.762,0.691,0.748,0.794,0.79,0.772,0.797,0.807,0.661,0.796,0.797,0.794,0.793,0.811,0.773,0.728,0.76,0.762,0.783,0.756,0.763,0.746,0.744,0.787,0.753,0.765,0.732,0.603,0.731,0.784,0.775,0.762,0.784,0.779,0.77,0.783,0.777,0.757,0.558,0.79,0.754,0.64,0.721,0.73,0.52,0.784,0.751,0.765,0.784,0.784,0.757,0.777,0.777,0.765,0.783,0.814,0.794,1519,OrganismalFitness,TRPC_THEMA,Medium,Prokaryote
+UBC9_HUMAN_Weile_2017,0.697,0.771,0.796,0.801,0.788,0.794,0.477,0.72,0.787,0.791,0.731,0.773,0.79,0.498,0.499,0.716,0.756,0.786,0.803,0.816,0.625,0.727,0.746,0.732,0.734,0.76,0.759,0.747,0.745,0.766,0.757,0.745,0.551,0.63,0.727,0.74,0.678,0.752,0.762,0.749,0.792,0.803,0.648,0.493,0.733,0.774,0.684,0.695,0.754,0.643,0.736,0.727,0.72,0.735,0.73,0.731,0.735,0.739,0.741,0.738,0.715,0.736,2563,OrganismalFitness,UBC9_HUMAN,Medium,Human
+UBE4B_HUMAN_Tsuboyama_2023_3L1X,0.703,0.811,0.829,0.828,0.834,0.833,0.628,0.78,0.81,0.824,0.815,0.845,0.837,0.751,0.81,0.854,0.843,0.847,0.858,0.818,0.66,0.792,0.795,0.801,0.784,0.803,0.787,0.789,0.807,0.83,0.863,0.852,0.813,0.612,0.615,0.757,0.772,0.769,0.784,0.833,0.831,0.823,0.763,0.484,0.72,0.726,0.805,0.763,0.845,0.755,0.826,0.843,0.838,0.845,0.845,0.84,0.839,0.843,0.842,0.843,0.861,0.813,3622,Stability,UBE4B_HUMAN,High,Human
+UBE4B_MOUSE_Starita_2013,0.779,0.799,0.824,0.831,0.842,0.844,0.536,0.594,0.796,0.805,0.762,0.838,0.846,0.817,0.838,0.847,0.834,0.786,0.752,0.798,0.59,0.764,0.749,0.732,0.837,0.8,0.786,0.775,0.731,0.826,0.845,0.825,0.534,0.492,0.58,0.698,0.782,0.749,0.776,0.834,0.828,0.842,0.815,0.499,0.814,0.826,0.694,0.773,0.469,0.54,0.814,0.798,0.832,0.836,0.82,0.834,0.813,0.829,0.826,0.833,0.814,0.791,899,Activity,UBE4B_MOUSE,Low,Eukaryote
+UBR5_HUMAN_Tsuboyama_2023_1I2T,0.735,0.832,0.832,0.834,0.837,0.845,0.601,0.665,0.798,0.816,0.813,0.742,0.791,0.627,0.635,0.587,0.588,0.84,0.835,0.795,0.754,0.749,0.777,0.804,0.765,0.784,0.795,0.789,0.801,0.815,0.784,0.717,0.669,0.668,0.758,0.742,0.76,0.786,0.784,0.845,0.846,0.845,0.555,0.552,0.707,0.589,0.784,0.716,0.854,0.82,0.837,0.853,0.858,0.86,0.86,0.864,0.852,0.858,0.849,0.858,0.871,0.826,1453,Stability,UBR5_HUMAN,Medium,Human
+VG08_BPP22_Tsuboyama_2023_2GP8,0.583,0.623,0.591,0.609,0.612,0.609,0.618,0.467,0.688,0.681,0.831,0.785,0.79,0.674,0.761,0.808,0.844,0.817,0.85,0.566,0.701,0.765,0.747,0.813,0.715,0.736,0.729,0.752,0.815,0.766,0.782,0.715,0.487,0.652,0.76,0.78,0.659,0.7,0.753,0.655,0.684,0.709,0.737,0.637,0.781,0.752,0.742,0.783,0.792,0.759,0.84,0.821,0.842,0.844,0.842,0.86,0.822,0.84,0.857,0.849,0.819,0.809,723,Stability,VG08_BPP22,High,Virus
+VILI_CHICK_Tsuboyama_2023_1YU5,0.687,0.772,0.869,0.894,0.896,0.897,0.552,0.811,0.857,0.874,0.914,0.886,0.896,0.695,0.618,0.904,0.912,0.88,0.817,0.862,0.668,0.757,0.842,0.812,0.793,0.826,0.812,0.792,0.843,0.88,0.919,0.913,0.692,0.683,0.791,0.789,0.812,0.84,0.831,0.894,0.898,0.888,0.626,0.66,0.79,0.795,0.816,0.793,0.922,0.861,0.92,0.921,0.925,0.925,0.919,0.927,0.919,0.922,0.924,0.925,0.909,0.891,2568,Stability,VILI_CHICK,High,Eukaryote
+VKOR1_HUMAN_Chiasson_2020_abundance,0.704,0.711,0.712,0.723,0.734,0.736,0.601,0.71,0.764,0.772,0.738,0.742,0.753,0.604,0.707,0.754,0.762,0.759,0.757,0.75,0.626,0.636,0.705,0.732,0.672,0.743,0.74,0.724,0.726,0.752,0.715,0.666,0.554,0.601,0.664,0.764,0.727,0.738,0.784,0.747,0.754,0.779,0.595,0.57,0.749,0.679,0.734,0.744,0.762,0.644,0.779,0.765,0.788,0.795,0.789,0.782,0.783,0.772,0.78,0.789,0.777,0.742,2695,Expression,VKOR1_HUMAN,Medium,Human
+VKOR1_HUMAN_Chiasson_2020_activity,0.705,0.72,0.729,0.739,0.742,0.751,0.525,0.72,0.75,0.757,0.749,0.755,0.766,0.534,0.687,0.712,0.754,0.76,0.754,0.747,0.554,0.568,0.664,0.684,0.592,0.73,0.722,0.717,0.733,0.748,0.744,0.718,0.601,0.534,0.598,0.721,0.715,0.721,0.742,0.747,0.753,0.758,0.563,0.54,0.746,0.646,0.621,0.688,0.711,0.541,0.742,0.709,0.721,0.73,0.734,0.727,0.732,0.732,0.729,0.736,0.748,0.693,697,Activity,VKOR1_HUMAN,Medium,Human
+VRPI_BPT7_Tsuboyama_2023_2WNM,0.422,0.514,0.587,0.615,0.55,0.572,0.548,0.603,0.75,0.748,0.801,0.722,0.733,0.669,0.755,0.811,0.86,0.874,0.813,0.616,0.488,0.567,0.575,0.636,0.613,0.622,0.57,0.622,0.664,0.563,0.809,0.769,0.638,0.514,0.514,0.553,0.468,0.474,0.456,0.528,0.537,0.49,0.656,0.521,0.819,0.728,0.833,0.812,0.84,0.842,0.854,0.845,0.862,0.861,0.857,0.869,0.858,0.872,0.853,0.868,0.878,0.837,1047,Stability,VRPI_BPT7,Medium,Virus
+YAIA_ECOLI_Tsuboyama_2023_2KVT,0.65,0.816,0.795,0.809,0.807,0.803,0.38,0.783,0.781,0.804,0.798,0.618,0.733,0.432,0.574,0.791,0.813,0.844,0.851,0.8,0.459,0.404,0.435,0.633,0.431,0.409,0.373,0.498,0.771,0.803,0.854,0.84,0.471,0.363,0.451,0.75,0.677,0.682,0.766,0.791,0.788,0.806,0.459,0.363,0.582,0.479,0.709,0.691,0.813,0.767,0.794,0.819,0.794,0.813,0.808,0.819,0.818,0.835,0.844,0.82,0.805,0.743,1890,Stability,YAIA_ECOLI,Medium,Prokaryote
+YAP1_HUMAN_Araya_2012,0.726,0.674,0.743,0.742,0.735,0.738,0.673,0.654,0.524,0.534,0.677,0.649,0.654,0.716,0.736,0.734,0.74,0.698,0.667,0.607,0.598,0.597,0.588,0.592,0.657,0.623,0.637,0.612,0.586,0.677,0.729,0.74,0.569,0.66,0.605,0.618,0.712,0.672,0.691,0.728,0.712,0.731,0.75,0.454,0.673,0.75,0.679,0.685,0.7,0.603,0.734,0.699,0.725,0.721,0.718,0.713,0.716,0.729,0.732,0.728,0.698,0.72,10075,Binding,YAP1_HUMAN,Low,Human
+YNZC_BACSU_Tsuboyama_2023_2JVD,0.904,0.927,0.955,0.954,0.948,0.945,0.819,0.853,0.939,0.941,0.969,0.967,0.971,0.874,0.883,0.973,0.974,0.961,0.97,0.942,0.848,0.912,0.929,0.933,0.923,0.93,0.886,0.934,0.952,0.923,0.942,0.919,0.911,0.856,0.907,0.897,0.922,0.926,0.923,0.942,0.938,0.944,0.732,0.658,0.826,0.739,0.775,0.825,0.98,0.932,0.971,0.964,0.97,0.971,0.97,0.973,0.97,0.967,0.971,0.971,0.959,0.964,2300,Stability,YNZC_BACSU,Medium,Prokaryote
diff --git a/benchmarks/DMS_zero_shot/substitutions/AUC/DMS_substitutions_AUC_DMS_level.html b/benchmarks/DMS_zero_shot/substitutions/AUC/DMS_substitutions_AUC_DMS_level.html
new file mode 100644
index 0000000..35ee671
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/AUC/DMS_substitutions_AUC_DMS_level.html
@@ -0,0 +1,15196 @@
+
+
+
+ score |
+ Site-Independent |
+ EVmutation |
+ DeepSequence (single) |
+ DeepSequence (ensemble) |
+ EVE (single) |
+ EVE (ensemble) |
+ Unirep |
+ Unirep evotuned |
+ MSA Transformer (single) |
+ MSA Transformer (ensemble) |
+ ESM-1b |
+ ESM-1v (single) |
+ ESM-1v (ensemble) |
+ ESM2 (8M) |
+ ESM2 (35M) |
+ ESM2 (150M) |
+ ESM2 (650M) |
+ ESM2 (3B) |
+ ESM2 (15B) |
+ Wavenet |
+ RITA S |
+ RITA M |
+ RITA L |
+ RITA XL |
+ Progen2 S |
+ Progen2 M |
+ Progen2 Base |
+ Progen2 L |
+ Progen2 XL |
+ GEMME |
+ VESPA |
+ VESPAl |
+ ProtGPT2 |
+ Tranception S no retrieval |
+ Tranception M no retrieval |
+ Tranception L no retrieval |
+ Tranception S |
+ Tranception M |
+ Tranception L |
+ TranceptEVE S |
+ TranceptEVE M |
+ TranceptEVE L |
+ CARP (38M) |
+ CARP (600K) |
+ CARP (640M) |
+ CARP (76M) |
+ MIF |
+ MIF-ST |
+ ESM-IF1 |
+ ProteinMPNN |
+ ProtSSN (k=10 h=512) |
+ ProtSSN (k=10 h=768) |
+ ProtSSN (k=10 h=1280) |
+ ProtSSN (k=20 h=512) |
+ ProtSSN (k=20 h=768) |
+ ProtSSN (k=20 h=1280) |
+ ProtSSN (k=30 h=512) |
+ ProtSSN (k=30 h=768) |
+ ProtSSN (k=30 h=1280) |
+ ProtSSN (ensemble) |
+ SaProt (650M) |
+ SaProt (35M) |
+ Number of Mutants |
+ Selection Type |
+ UniProt ID |
+ MSA_Neff_L_category |
+ Taxon |
+
+
+
+
+ A0A140D2T1_ZIKV_Sourisseau_2019 |
+ 0.690 |
+ 0.674 |
+ 0.560 |
+ 0.544 |
+ 0.692 |
+ 0.698 |
+ 0.426 |
+ 0.532 |
+ 0.718 |
+ 0.721 |
+ 0.485 |
+ 0.467 |
+ 0.494 |
+ 0.454 |
+ 0.464 |
+ 0.460 |
+ 0.597 |
+ 0.678 |
+ 0.685 |
+ 0.612 |
+ 0.674 |
+ 0.651 |
+ 0.656 |
+ 0.651 |
+ 0.658 |
+ 0.669 |
+ 0.660 |
+ 0.653 |
+ 0.644 |
+ 0.715 |
+ 0.653 |
+ 0.640 |
+ 0.499 |
+ 0.648 |
+ 0.659 |
+ 0.633 |
+ 0.677 |
+ 0.676 |
+ 0.672 |
+ 0.675 |
+ 0.668 |
+ 0.682 |
+ 0.459 |
+ 0.453 |
+ 0.555 |
+ 0.459 |
+ 0.637 |
+ 0.642 |
+ 0.642 |
+ 0.565 |
+ 0.618 |
+ 0.617 |
+ 0.633 |
+ 0.642 |
+ 0.631 |
+ 0.628 |
+ 0.623 |
+ 0.628 |
+ 0.626 |
+ 0.631 |
+ 0.598 |
+ 0.556 |
+ 9576 |
+ OrganismalFitness |
+ A0A140D2T1_ZIKV |
+ Medium |
+ Virus |
+
+
+ A0A192B1T2_9HIV1_Haddox_2018 |
+ 0.761 |
+ 0.724 |
+ 0.727 |
+ 0.739 |
+ 0.776 |
+ 0.779 |
+ 0.502 |
+ 0.771 |
+ 0.776 |
+ 0.778 |
+ 0.744 |
+ 0.767 |
+ 0.778 |
+ 0.502 |
+ 0.510 |
+ 0.519 |
+ 0.539 |
+ 0.562 |
+ 0.578 |
+ 0.752 |
+ 0.767 |
+ 0.775 |
+ 0.775 |
+ 0.774 |
+ 0.769 |
+ 0.770 |
+ 0.754 |
+ 0.765 |
+ 0.767 |
+ 0.769 |
+ 0.796 |
+ 0.779 |
+ 0.681 |
+ 0.763 |
+ 0.764 |
+ 0.778 |
+ 0.774 |
+ 0.775 |
+ 0.779 |
+ 0.783 |
+ 0.782 |
+ 0.786 |
+ 0.721 |
+ 0.492 |
+ 0.767 |
+ 0.730 |
+ 0.685 |
+ 0.748 |
+ 0.629 |
+ 0.571 |
+ 0.608 |
+ 0.626 |
+ 0.638 |
+ 0.644 |
+ 0.616 |
+ 0.625 |
+ 0.627 |
+ 0.610 |
+ 0.599 |
+ 0.631 |
+ 0.592 |
+ 0.551 |
+ 12577 |
+ OrganismalFitness |
+ A0A192B1T2_9HIV1 |
+ Medium |
+ Virus |
+
+
+ A0A1I9GEU1_NEIME_Kennouche_2019 |
+ 0.491 |
+ 0.518 |
+ 0.556 |
+ 0.547 |
+ 0.528 |
+ 0.528 |
+ 0.488 |
+ 0.537 |
+ 0.541 |
+ 0.537 |
+ 0.521 |
+ 0.521 |
+ 0.525 |
+ 0.477 |
+ 0.468 |
+ 0.490 |
+ 0.516 |
+ 0.512 |
+ 0.516 |
+ 0.538 |
+ 0.491 |
+ 0.518 |
+ 0.531 |
+ 0.540 |
+ 0.517 |
+ 0.537 |
+ 0.537 |
+ 0.539 |
+ 0.542 |
+ 0.524 |
+ 0.510 |
+ 0.504 |
+ 0.513 |
+ 0.521 |
+ 0.527 |
+ 0.550 |
+ 0.516 |
+ 0.521 |
+ 0.531 |
+ 0.528 |
+ 0.530 |
+ 0.538 |
+ 0.473 |
+ 0.470 |
+ 0.524 |
+ 0.475 |
+ 0.516 |
+ 0.532 |
+ 0.513 |
+ 0.517 |
+ 0.528 |
+ 0.524 |
+ 0.522 |
+ 0.517 |
+ 0.520 |
+ 0.529 |
+ 0.519 |
+ 0.530 |
+ 0.523 |
+ 0.525 |
+ 0.517 |
+ 0.491 |
+ 922 |
+ Activity |
+ A0A1I9GEU1_NEIME |
+ Medium |
+ Prokaryote |
+
+
+ A0A247D711_LISMN_Stadelmann_2021 |
+ 0.727 |
+ 0.733 |
+ 0.568 |
+ 0.534 |
+ 0.716 |
+ 0.717 |
+ 0.515 |
+ 0.526 |
+ 0.655 |
+ 0.665 |
+ 0.556 |
+ 0.558 |
+ 0.558 |
+ 0.542 |
+ 0.553 |
+ 0.564 |
+ 0.549 |
+ 0.557 |
+ 0.555 |
+ 0.534 |
+ 0.518 |
+ 0.516 |
+ 0.535 |
+ 0.547 |
+ 0.501 |
+ 0.543 |
+ 0.514 |
+ 0.535 |
+ 0.561 |
+ 0.730 |
+ 0.668 |
+ 0.672 |
+ 0.537 |
+ 0.532 |
+ 0.522 |
+ 0.538 |
+ 0.661 |
+ 0.657 |
+ 0.666 |
+ 0.636 |
+ 0.631 |
+ 0.641 |
+ 0.536 |
+ 0.514 |
+ 0.550 |
+ 0.556 |
+ 0.713 |
+ 0.724 |
+ 0.738 |
+ 0.683 |
+ 0.640 |
+ 0.668 |
+ 0.656 |
+ 0.673 |
+ 0.663 |
+ 0.681 |
+ 0.657 |
+ 0.669 |
+ 0.632 |
+ 0.670 |
+ 0.718 |
+ 0.645 |
+ 1653 |
+ Activity |
+ A0A247D711_LISMN |
+ High |
+ Prokaryote |
+
+
+ A0A2Z5U3Z0_9INFA_Doud_2016 |
+ 0.745 |
+ 0.744 |
+ 0.751 |
+ 0.769 |
+ 0.782 |
+ 0.783 |
+ 0.499 |
+ 0.758 |
+ 0.755 |
+ 0.755 |
+ 0.559 |
+ 0.769 |
+ 0.786 |
+ 0.495 |
+ 0.504 |
+ 0.529 |
+ 0.766 |
+ 0.759 |
+ 0.758 |
+ 0.708 |
+ 0.742 |
+ 0.777 |
+ 0.768 |
+ 0.782 |
+ 0.721 |
+ 0.769 |
+ 0.765 |
+ 0.757 |
+ 0.762 |
+ 0.774 |
+ 0.760 |
+ 0.715 |
+ 0.540 |
+ 0.741 |
+ 0.771 |
+ 0.770 |
+ 0.767 |
+ 0.783 |
+ 0.777 |
+ 0.787 |
+ 0.797 |
+ 0.796 |
+ 0.502 |
+ 0.500 |
+ 0.673 |
+ 0.507 |
+ 0.723 |
+ 0.751 |
+ 0.739 |
+ 0.591 |
+ 0.758 |
+ 0.752 |
+ 0.781 |
+ 0.777 |
+ 0.778 |
+ 0.778 |
+ 0.784 |
+ 0.776 |
+ 0.788 |
+ 0.785 |
+ 0.605 |
+ 0.579 |
+ 10715 |
+ OrganismalFitness |
+ A0A2Z5U3Z0_9INFA |
+ Medium |
+ Virus |
+
+
+ A0A2Z5U3Z0_9INFA_Wu_2014 |
+ 0.753 |
+ 0.766 |
+ 0.734 |
+ 0.749 |
+ 0.769 |
+ 0.777 |
+ 0.497 |
+ 0.725 |
+ 0.754 |
+ 0.761 |
+ 0.579 |
+ 0.741 |
+ 0.766 |
+ 0.520 |
+ 0.531 |
+ 0.532 |
+ 0.742 |
+ 0.752 |
+ 0.754 |
+ 0.735 |
+ 0.729 |
+ 0.760 |
+ 0.755 |
+ 0.768 |
+ 0.681 |
+ 0.738 |
+ 0.747 |
+ 0.731 |
+ 0.748 |
+ 0.767 |
+ 0.731 |
+ 0.697 |
+ 0.547 |
+ 0.730 |
+ 0.758 |
+ 0.771 |
+ 0.766 |
+ 0.781 |
+ 0.784 |
+ 0.787 |
+ 0.794 |
+ 0.794 |
+ 0.519 |
+ 0.520 |
+ 0.626 |
+ 0.528 |
+ 0.688 |
+ 0.700 |
+ 0.689 |
+ 0.576 |
+ 0.733 |
+ 0.726 |
+ 0.743 |
+ 0.740 |
+ 0.741 |
+ 0.745 |
+ 0.750 |
+ 0.743 |
+ 0.744 |
+ 0.750 |
+ 0.607 |
+ 0.588 |
+ 2350 |
+ OrganismalFitness |
+ A0A2Z5U3Z0_9INFA |
+ Medium |
+ Virus |
+
+
+ A4_HUMAN_Seuma_2022 |
+ 0.714 |
+ 0.707 |
+ 0.735 |
+ 0.722 |
+ 0.673 |
+ 0.676 |
+ 0.693 |
+ 0.577 |
+ 0.673 |
+ 0.679 |
+ 0.666 |
+ 0.666 |
+ 0.711 |
+ 0.709 |
+ 0.700 |
+ 0.703 |
+ 0.736 |
+ 0.758 |
+ 0.688 |
+ 0.632 |
+ 0.672 |
+ 0.645 |
+ 0.666 |
+ 0.659 |
+ 0.718 |
+ 0.654 |
+ 0.667 |
+ 0.664 |
+ 0.662 |
+ 0.757 |
+ 0.622 |
+ 0.588 |
+ 0.766 |
+ 0.713 |
+ 0.639 |
+ 0.692 |
+ 0.740 |
+ 0.695 |
+ 0.743 |
+ 0.739 |
+ 0.716 |
+ 0.728 |
+ 0.703 |
+ 0.699 |
+ 0.730 |
+ 0.696 |
+ 0.650 |
+ 0.697 |
+ 0.405 |
+ 0.527 |
+ 0.739 |
+ 0.734 |
+ 0.726 |
+ 0.720 |
+ 0.722 |
+ 0.703 |
+ 0.730 |
+ 0.687 |
+ 0.698 |
+ 0.725 |
+ 0.722 |
+ 0.709 |
+ 14811 |
+ Stability |
+ A4_HUMAN |
+ Low |
+ Human |
+
+
+ A4D664_9INFA_Soh_2019 |
+ 0.714 |
+ 0.681 |
+ 0.713 |
+ 0.711 |
+ 0.715 |
+ 0.714 |
+ 0.513 |
+ 0.692 |
+ 0.668 |
+ 0.663 |
+ 0.521 |
+ 0.512 |
+ 0.514 |
+ 0.510 |
+ 0.509 |
+ 0.513 |
+ 0.570 |
+ 0.603 |
+ 0.636 |
+ 0.635 |
+ 0.679 |
+ 0.705 |
+ 0.712 |
+ 0.708 |
+ 0.533 |
+ 0.643 |
+ 0.655 |
+ 0.634 |
+ 0.676 |
+ 0.733 |
+ 0.666 |
+ 0.658 |
+ 0.525 |
+ 0.686 |
+ 0.697 |
+ 0.711 |
+ 0.694 |
+ 0.701 |
+ 0.704 |
+ 0.730 |
+ 0.732 |
+ 0.741 |
+ 0.510 |
+ 0.499 |
+ 0.539 |
+ 0.501 |
+ 0.640 |
+ 0.633 |
+ 0.566 |
+ 0.559 |
+ 0.592 |
+ 0.589 |
+ 0.596 |
+ 0.609 |
+ 0.605 |
+ 0.603 |
+ 0.600 |
+ 0.600 |
+ 0.597 |
+ 0.602 |
+ 0.584 |
+ 0.551 |
+ 14421 |
+ OrganismalFitness |
+ A4D664_9INFA |
+ Medium |
+ Virus |
+
+
+ A4GRB6_PSEAI_Chen_2020 |
+ 0.673 |
+ 0.772 |
+ 0.855 |
+ 0.857 |
+ 0.838 |
+ 0.847 |
+ 0.697 |
+ 0.781 |
+ 0.847 |
+ 0.869 |
+ 0.875 |
+ 0.853 |
+ 0.867 |
+ 0.736 |
+ 0.791 |
+ 0.858 |
+ 0.905 |
+ 0.891 |
+ 0.838 |
+ 0.806 |
+ 0.716 |
+ 0.792 |
+ 0.805 |
+ 0.836 |
+ 0.785 |
+ 0.835 |
+ 0.845 |
+ 0.843 |
+ 0.874 |
+ 0.865 |
+ 0.908 |
+ 0.871 |
+ 0.627 |
+ 0.730 |
+ 0.802 |
+ 0.840 |
+ 0.806 |
+ 0.824 |
+ 0.856 |
+ 0.858 |
+ 0.861 |
+ 0.870 |
+ 0.767 |
+ 0.547 |
+ 0.865 |
+ 0.830 |
+ 0.845 |
+ 0.879 |
+ 0.837 |
+ 0.723 |
+ 0.890 |
+ 0.877 |
+ 0.875 |
+ 0.878 |
+ 0.885 |
+ 0.887 |
+ 0.889 |
+ 0.892 |
+ 0.893 |
+ 0.895 |
+ 0.887 |
+ 0.772 |
+ 5004 |
+ OrganismalFitness |
+ A4GRB6_PSEAI |
+ High |
+ Prokaryote |
+
+
+ AACC1_PSEAI_Dandage_2018 |
+ 0.642 |
+ 0.744 |
+ 0.665 |
+ 0.715 |
+ 0.752 |
+ 0.753 |
+ 0.596 |
+ 0.605 |
+ 0.753 |
+ 0.755 |
+ 0.707 |
+ 0.741 |
+ 0.748 |
+ 0.600 |
+ 0.621 |
+ 0.621 |
+ 0.748 |
+ 0.754 |
+ 0.760 |
+ 0.701 |
+ 0.597 |
+ 0.613 |
+ 0.635 |
+ 0.652 |
+ 0.637 |
+ 0.715 |
+ 0.704 |
+ 0.713 |
+ 0.724 |
+ 0.739 |
+ 0.757 |
+ 0.733 |
+ 0.509 |
+ 0.628 |
+ 0.622 |
+ 0.708 |
+ 0.720 |
+ 0.714 |
+ 0.736 |
+ 0.743 |
+ 0.741 |
+ 0.753 |
+ 0.609 |
+ 0.579 |
+ 0.682 |
+ 0.619 |
+ 0.632 |
+ 0.695 |
+ 0.680 |
+ 0.588 |
+ 0.748 |
+ 0.750 |
+ 0.747 |
+ 0.747 |
+ 0.752 |
+ 0.746 |
+ 0.749 |
+ 0.743 |
+ 0.749 |
+ 0.754 |
+ 0.735 |
+ 0.636 |
+ 1801 |
+ OrganismalFitness |
+ AACC1_PSEAI |
+ High |
+ Prokaryote |
+
+
+ ACE2_HUMAN_Chan_2020 |
+ 0.611 |
+ 0.588 |
+ 0.553 |
+ 0.562 |
+ 0.593 |
+ 0.586 |
+ 0.472 |
+ 0.502 |
+ 0.592 |
+ 0.599 |
+ 0.612 |
+ 0.555 |
+ 0.570 |
+ 0.483 |
+ 0.522 |
+ 0.603 |
+ 0.606 |
+ 0.562 |
+ 0.554 |
+ 0.595 |
+ 0.499 |
+ 0.527 |
+ 0.558 |
+ 0.589 |
+ 0.481 |
+ 0.545 |
+ 0.574 |
+ 0.572 |
+ 0.616 |
+ 0.576 |
+ 0.547 |
+ 0.534 |
+ 0.534 |
+ 0.501 |
+ 0.527 |
+ 0.526 |
+ 0.586 |
+ 0.565 |
+ 0.563 |
+ 0.580 |
+ 0.564 |
+ 0.564 |
+ 0.497 |
+ 0.516 |
+ 0.623 |
+ 0.528 |
+ 0.694 |
+ 0.660 |
+ 0.675 |
+ 0.559 |
+ 0.603 |
+ 0.625 |
+ 0.608 |
+ 0.607 |
+ 0.612 |
+ 0.621 |
+ 0.603 |
+ 0.602 |
+ 0.602 |
+ 0.611 |
+ 0.656 |
+ 0.591 |
+ 2223 |
+ Binding |
+ ACE2_HUMAN |
+ Medium |
+ Human |
+
+
+ ADRB2_HUMAN_Jones_2020 |
+ 0.672 |
+ 0.718 |
+ 0.759 |
+ 0.763 |
+ 0.754 |
+ 0.764 |
+ 0.738 |
+ 0.755 |
+ 0.771 |
+ 0.771 |
+ 0.773 |
+ 0.770 |
+ 0.775 |
+ 0.712 |
+ 0.729 |
+ 0.745 |
+ 0.755 |
+ 0.760 |
+ 0.766 |
+ 0.768 |
+ 0.763 |
+ 0.768 |
+ 0.765 |
+ 0.754 |
+ 0.770 |
+ 0.772 |
+ 0.776 |
+ 0.773 |
+ 0.758 |
+ 0.779 |
+ 0.759 |
+ 0.715 |
+ 0.649 |
+ 0.773 |
+ 0.771 |
+ 0.759 |
+ 0.773 |
+ 0.777 |
+ 0.774 |
+ 0.773 |
+ 0.775 |
+ 0.774 |
+ 0.745 |
+ 0.594 |
+ 0.771 |
+ 0.764 |
+ 0.716 |
+ 0.748 |
+ 0.736 |
+ 0.616 |
+ 0.752 |
+ 0.753 |
+ 0.761 |
+ 0.762 |
+ 0.759 |
+ 0.759 |
+ 0.757 |
+ 0.764 |
+ 0.756 |
+ 0.764 |
+ 0.788 |
+ 0.768 |
+ 7800 |
+ Activity |
+ ADRB2_HUMAN |
+ Medium |
+ Human |
+
+
+ AICDA_HUMAN_Gajula_2014_3cycles |
+ 0.470 |
+ 0.697 |
+ 0.728 |
+ 0.740 |
+ 0.727 |
+ 0.740 |
+ 0.361 |
+ 0.622 |
+ 0.672 |
+ 0.709 |
+ 0.680 |
+ 0.665 |
+ 0.714 |
+ 0.339 |
+ 0.373 |
+ 0.572 |
+ 0.650 |
+ 0.649 |
+ 0.621 |
+ 0.776 |
+ 0.392 |
+ 0.557 |
+ 0.708 |
+ 0.696 |
+ 0.448 |
+ 0.672 |
+ 0.713 |
+ 0.704 |
+ 0.631 |
+ 0.635 |
+ 0.710 |
+ 0.711 |
+ 0.563 |
+ 0.471 |
+ 0.496 |
+ 0.660 |
+ 0.589 |
+ 0.591 |
+ 0.682 |
+ 0.716 |
+ 0.711 |
+ 0.737 |
+ 0.368 |
+ 0.368 |
+ 0.657 |
+ 0.568 |
+ 0.735 |
+ 0.717 |
+ 0.719 |
+ 0.689 |
+ 0.672 |
+ 0.716 |
+ 0.653 |
+ 0.733 |
+ 0.658 |
+ 0.659 |
+ 0.659 |
+ 0.660 |
+ 0.644 |
+ 0.667 |
+ 0.651 |
+ 0.486 |
+ 209 |
+ Activity |
+ AICDA_HUMAN |
+ Medium |
+ Human |
+
+
+ AMFR_HUMAN_Tsuboyama_2023_4G3O |
+ 0.667 |
+ 0.692 |
+ 0.693 |
+ 0.687 |
+ 0.676 |
+ 0.682 |
+ 0.330 |
+ 0.535 |
+ 0.588 |
+ 0.598 |
+ 0.670 |
+ 0.542 |
+ 0.699 |
+ 0.394 |
+ 0.639 |
+ 0.636 |
+ 0.609 |
+ 0.628 |
+ 0.610 |
+ 0.504 |
+ 0.485 |
+ 0.385 |
+ 0.440 |
+ 0.447 |
+ 0.317 |
+ 0.419 |
+ 0.444 |
+ 0.416 |
+ 0.494 |
+ 0.682 |
+ 0.610 |
+ 0.598 |
+ 0.524 |
+ 0.356 |
+ 0.428 |
+ 0.334 |
+ 0.684 |
+ 0.688 |
+ 0.661 |
+ 0.697 |
+ 0.695 |
+ 0.669 |
+ 0.409 |
+ 0.407 |
+ 0.556 |
+ 0.533 |
+ 0.603 |
+ 0.534 |
+ 0.689 |
+ 0.650 |
+ 0.705 |
+ 0.642 |
+ 0.576 |
+ 0.657 |
+ 0.649 |
+ 0.654 |
+ 0.653 |
+ 0.648 |
+ 0.660 |
+ 0.651 |
+ 0.652 |
+ 0.674 |
+ 2972 |
+ Stability |
+ AMFR_HUMAN |
+ Medium |
+ Human |
+
+
+ AMIE_PSEAE_Wrenbeck_2017 |
+ 0.624 |
+ 0.750 |
+ 0.779 |
+ 0.788 |
+ 0.769 |
+ 0.763 |
+ 0.534 |
+ 0.741 |
+ 0.713 |
+ 0.825 |
+ 0.790 |
+ 0.833 |
+ 0.862 |
+ 0.653 |
+ 0.693 |
+ 0.702 |
+ 0.780 |
+ 0.858 |
+ 0.837 |
+ 0.759 |
+ 0.794 |
+ 0.794 |
+ 0.796 |
+ 0.800 |
+ 0.800 |
+ 0.808 |
+ 0.797 |
+ 0.802 |
+ 0.788 |
+ 0.814 |
+ 0.824 |
+ 0.794 |
+ 0.635 |
+ 0.821 |
+ 0.764 |
+ 0.735 |
+ 0.814 |
+ 0.765 |
+ 0.745 |
+ 0.823 |
+ 0.787 |
+ 0.775 |
+ 0.668 |
+ 0.528 |
+ 0.735 |
+ 0.695 |
+ 0.683 |
+ 0.735 |
+ 0.709 |
+ 0.608 |
+ 0.783 |
+ 0.768 |
+ 0.771 |
+ 0.774 |
+ 0.769 |
+ 0.779 |
+ 0.764 |
+ 0.772 |
+ 0.785 |
+ 0.783 |
+ 0.808 |
+ 0.696 |
+ 6227 |
+ Activity |
+ AMIE_PSEAE |
+ High |
+ Prokaryote |
+
+
+ ANCSZ_Hobbs_2022 |
+ 0.768 |
+ 0.753 |
+ 0.715 |
+ 0.730 |
+ 0.754 |
+ 0.756 |
+ 0.754 |
+ 0.727 |
+ 0.744 |
+ 0.756 |
+ 0.761 |
+ 0.756 |
+ 0.768 |
+ 0.752 |
+ 0.783 |
+ 0.786 |
+ 0.791 |
+ 0.789 |
+ 0.781 |
+ 0.508 |
+ 0.723 |
+ 0.744 |
+ 0.751 |
+ 0.747 |
+ 0.747 |
+ 0.756 |
+ 0.752 |
+ 0.732 |
+ 0.740 |
+ 0.777 |
+ 0.766 |
+ 0.758 |
+ 0.649 |
+ 0.732 |
+ 0.751 |
+ 0.736 |
+ 0.760 |
+ 0.772 |
+ 0.767 |
+ 0.764 |
+ 0.773 |
+ 0.766 |
+ 0.763 |
+ 0.697 |
+ 0.739 |
+ 0.750 |
+ 0.731 |
+ 0.728 |
+ 0.706 |
+ 0.587 |
+ 0.771 |
+ 0.767 |
+ 0.780 |
+ 0.779 |
+ 0.775 |
+ 0.781 |
+ 0.773 |
+ 0.778 |
+ 0.775 |
+ 0.783 |
+ 0.783 |
+ 0.787 |
+ 4670 |
+ Activity |
+ ANCSZ |
+ Medium |
+ Eukaryote |
+
+
+ ARGR_ECOLI_Tsuboyama_2023_1AOY |
+ 0.614 |
+ 0.697 |
+ 0.700 |
+ 0.711 |
+ 0.710 |
+ 0.704 |
+ 0.623 |
+ 0.677 |
+ 0.715 |
+ 0.735 |
+ 0.688 |
+ 0.638 |
+ 0.691 |
+ 0.638 |
+ 0.619 |
+ 0.739 |
+ 0.741 |
+ 0.748 |
+ 0.706 |
+ 0.706 |
+ 0.667 |
+ 0.720 |
+ 0.726 |
+ 0.699 |
+ 0.657 |
+ 0.711 |
+ 0.726 |
+ 0.727 |
+ 0.705 |
+ 0.723 |
+ 0.703 |
+ 0.656 |
+ 0.525 |
+ 0.677 |
+ 0.686 |
+ 0.709 |
+ 0.678 |
+ 0.686 |
+ 0.703 |
+ 0.719 |
+ 0.717 |
+ 0.725 |
+ 0.618 |
+ 0.604 |
+ 0.716 |
+ 0.694 |
+ 0.871 |
+ 0.824 |
+ 0.879 |
+ 0.828 |
+ 0.749 |
+ 0.758 |
+ 0.756 |
+ 0.762 |
+ 0.750 |
+ 0.749 |
+ 0.737 |
+ 0.759 |
+ 0.735 |
+ 0.759 |
+ 0.797 |
+ 0.741 |
+ 1287 |
+ Stability |
+ ARGR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ B2L11_HUMAN_Dutta_2010_binding-Mcl-1 |
+ 0.853 |
+ 0.659 |
+ 0.823 |
+ 0.827 |
+ 0.764 |
+ 0.813 |
+ 0.645 |
+ 0.769 |
+ 0.619 |
+ 0.612 |
+ 0.642 |
+ 0.519 |
+ 0.685 |
+ 0.607 |
+ 0.586 |
+ 0.606 |
+ 0.670 |
+ 0.686 |
+ 0.674 |
+ 0.664 |
+ 0.647 |
+ 0.568 |
+ 0.670 |
+ 0.788 |
+ 0.563 |
+ 0.646 |
+ 0.711 |
+ 0.713 |
+ 0.714 |
+ 0.890 |
+ 0.702 |
+ 0.690 |
+ 0.647 |
+ 0.648 |
+ 0.635 |
+ 0.681 |
+ 0.791 |
+ 0.760 |
+ 0.709 |
+ 0.831 |
+ 0.816 |
+ 0.712 |
+ 0.616 |
+ 0.591 |
+ 0.632 |
+ 0.605 |
+ 0.649 |
+ 0.655 |
+ 0.709 |
+ 0.509 |
+ 0.695 |
+ 0.686 |
+ 0.729 |
+ 0.665 |
+ 0.627 |
+ 0.757 |
+ 0.662 |
+ 0.667 |
+ 0.738 |
+ 0.706 |
+ 0.635 |
+ 0.621 |
+ 170 |
+ Binding |
+ B2L11_HUMAN |
+ Low |
+ Human |
+
+
+ BBC1_YEAST_Tsuboyama_2023_1TG0 |
+ 0.605 |
+ 0.688 |
+ 0.722 |
+ 0.725 |
+ 0.715 |
+ 0.723 |
+ 0.607 |
+ 0.650 |
+ 0.724 |
+ 0.735 |
+ 0.736 |
+ 0.724 |
+ 0.739 |
+ 0.693 |
+ 0.724 |
+ 0.716 |
+ 0.754 |
+ 0.744 |
+ 0.768 |
+ 0.685 |
+ 0.687 |
+ 0.688 |
+ 0.706 |
+ 0.691 |
+ 0.560 |
+ 0.711 |
+ 0.718 |
+ 0.709 |
+ 0.678 |
+ 0.688 |
+ 0.698 |
+ 0.684 |
+ 0.645 |
+ 0.684 |
+ 0.636 |
+ 0.653 |
+ 0.700 |
+ 0.675 |
+ 0.687 |
+ 0.729 |
+ 0.710 |
+ 0.721 |
+ 0.592 |
+ 0.577 |
+ 0.671 |
+ 0.617 |
+ 0.801 |
+ 0.748 |
+ 0.857 |
+ 0.824 |
+ 0.743 |
+ 0.752 |
+ 0.756 |
+ 0.764 |
+ 0.757 |
+ 0.758 |
+ 0.758 |
+ 0.754 |
+ 0.764 |
+ 0.758 |
+ 0.789 |
+ 0.807 |
+ 2069 |
+ Stability |
+ BBC1_YEAST |
+ High |
+ Eukaryote |
+
+
+ BCHB_CHLTE_Tsuboyama_2023_2KRU |
+ 0.698 |
+ 0.731 |
+ 0.700 |
+ 0.705 |
+ 0.704 |
+ 0.711 |
+ 0.563 |
+ 0.637 |
+ 0.704 |
+ 0.772 |
+ 0.714 |
+ 0.677 |
+ 0.717 |
+ 0.591 |
+ 0.603 |
+ 0.662 |
+ 0.766 |
+ 0.778 |
+ 0.732 |
+ 0.725 |
+ 0.569 |
+ 0.693 |
+ 0.623 |
+ 0.707 |
+ 0.608 |
+ 0.733 |
+ 0.655 |
+ 0.716 |
+ 0.737 |
+ 0.827 |
+ 0.796 |
+ 0.770 |
+ 0.524 |
+ 0.634 |
+ 0.660 |
+ 0.734 |
+ 0.734 |
+ 0.734 |
+ 0.759 |
+ 0.717 |
+ 0.718 |
+ 0.739 |
+ 0.563 |
+ 0.563 |
+ 0.609 |
+ 0.598 |
+ 0.784 |
+ 0.759 |
+ 0.845 |
+ 0.790 |
+ 0.788 |
+ 0.792 |
+ 0.797 |
+ 0.805 |
+ 0.798 |
+ 0.807 |
+ 0.780 |
+ 0.794 |
+ 0.789 |
+ 0.797 |
+ 0.769 |
+ 0.648 |
+ 1572 |
+ Stability |
+ BCHB_CHLTE |
+ Medium |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Deng_2012 |
+ 0.667 |
+ 0.763 |
+ 0.764 |
+ 0.773 |
+ 0.759 |
+ 0.764 |
+ 0.542 |
+ 0.653 |
+ 0.764 |
+ 0.784 |
+ 0.762 |
+ 0.751 |
+ 0.761 |
+ 0.628 |
+ 0.683 |
+ 0.737 |
+ 0.772 |
+ 0.735 |
+ 0.686 |
+ 0.733 |
+ 0.691 |
+ 0.698 |
+ 0.706 |
+ 0.704 |
+ 0.700 |
+ 0.730 |
+ 0.732 |
+ 0.714 |
+ 0.678 |
+ 0.742 |
+ 0.770 |
+ 0.751 |
+ 0.552 |
+ 0.698 |
+ 0.708 |
+ 0.693 |
+ 0.735 |
+ 0.747 |
+ 0.748 |
+ 0.766 |
+ 0.775 |
+ 0.773 |
+ 0.676 |
+ 0.520 |
+ 0.749 |
+ 0.710 |
+ 0.718 |
+ 0.755 |
+ 0.741 |
+ 0.630 |
+ 0.770 |
+ 0.774 |
+ 0.776 |
+ 0.784 |
+ 0.781 |
+ 0.779 |
+ 0.781 |
+ 0.771 |
+ 0.776 |
+ 0.785 |
+ 0.781 |
+ 0.715 |
+ 4996 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Firnberg_2014 |
+ 0.723 |
+ 0.859 |
+ 0.868 |
+ 0.876 |
+ 0.852 |
+ 0.863 |
+ 0.562 |
+ 0.729 |
+ 0.859 |
+ 0.877 |
+ 0.858 |
+ 0.836 |
+ 0.856 |
+ 0.697 |
+ 0.773 |
+ 0.830 |
+ 0.876 |
+ 0.795 |
+ 0.719 |
+ 0.832 |
+ 0.784 |
+ 0.779 |
+ 0.760 |
+ 0.758 |
+ 0.799 |
+ 0.827 |
+ 0.831 |
+ 0.790 |
+ 0.721 |
+ 0.845 |
+ 0.889 |
+ 0.888 |
+ 0.588 |
+ 0.778 |
+ 0.761 |
+ 0.732 |
+ 0.822 |
+ 0.815 |
+ 0.811 |
+ 0.865 |
+ 0.864 |
+ 0.864 |
+ 0.769 |
+ 0.519 |
+ 0.855 |
+ 0.799 |
+ 0.810 |
+ 0.863 |
+ 0.851 |
+ 0.667 |
+ 0.871 |
+ 0.867 |
+ 0.870 |
+ 0.882 |
+ 0.882 |
+ 0.882 |
+ 0.885 |
+ 0.870 |
+ 0.880 |
+ 0.888 |
+ 0.884 |
+ 0.796 |
+ 4783 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Jacquier_2013 |
+ 0.727 |
+ 0.853 |
+ 0.857 |
+ 0.869 |
+ 0.855 |
+ 0.868 |
+ 0.551 |
+ 0.716 |
+ 0.843 |
+ 0.863 |
+ 0.835 |
+ 0.833 |
+ 0.843 |
+ 0.682 |
+ 0.755 |
+ 0.819 |
+ 0.858 |
+ 0.801 |
+ 0.749 |
+ 0.822 |
+ 0.781 |
+ 0.768 |
+ 0.759 |
+ 0.758 |
+ 0.784 |
+ 0.816 |
+ 0.811 |
+ 0.784 |
+ 0.729 |
+ 0.791 |
+ 0.866 |
+ 0.851 |
+ 0.577 |
+ 0.757 |
+ 0.766 |
+ 0.747 |
+ 0.807 |
+ 0.816 |
+ 0.821 |
+ 0.860 |
+ 0.867 |
+ 0.871 |
+ 0.739 |
+ 0.524 |
+ 0.837 |
+ 0.783 |
+ 0.771 |
+ 0.830 |
+ 0.813 |
+ 0.655 |
+ 0.837 |
+ 0.842 |
+ 0.845 |
+ 0.847 |
+ 0.845 |
+ 0.847 |
+ 0.855 |
+ 0.842 |
+ 0.840 |
+ 0.854 |
+ 0.862 |
+ 0.790 |
+ 989 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Stiffler_2015 |
+ 0.728 |
+ 0.862 |
+ 0.875 |
+ 0.882 |
+ 0.857 |
+ 0.868 |
+ 0.563 |
+ 0.730 |
+ 0.863 |
+ 0.881 |
+ 0.865 |
+ 0.846 |
+ 0.867 |
+ 0.704 |
+ 0.782 |
+ 0.839 |
+ 0.883 |
+ 0.802 |
+ 0.724 |
+ 0.837 |
+ 0.793 |
+ 0.786 |
+ 0.757 |
+ 0.761 |
+ 0.809 |
+ 0.835 |
+ 0.838 |
+ 0.794 |
+ 0.724 |
+ 0.851 |
+ 0.897 |
+ 0.895 |
+ 0.587 |
+ 0.787 |
+ 0.758 |
+ 0.729 |
+ 0.830 |
+ 0.815 |
+ 0.812 |
+ 0.871 |
+ 0.868 |
+ 0.869 |
+ 0.776 |
+ 0.518 |
+ 0.863 |
+ 0.807 |
+ 0.816 |
+ 0.867 |
+ 0.854 |
+ 0.675 |
+ 0.875 |
+ 0.866 |
+ 0.872 |
+ 0.882 |
+ 0.882 |
+ 0.884 |
+ 0.886 |
+ 0.873 |
+ 0.882 |
+ 0.890 |
+ 0.891 |
+ 0.805 |
+ 4996 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BRCA1_HUMAN_Findlay_2018 |
+ 0.824 |
+ 0.756 |
+ 0.788 |
+ 0.774 |
+ 0.776 |
+ 0.800 |
+ 0.566 |
+ 0.683 |
+ 0.760 |
+ 0.772 |
+ 0.836 |
+ 0.775 |
+ 0.805 |
+ 0.582 |
+ 0.767 |
+ 0.823 |
+ 0.860 |
+ 0.845 |
+ 0.788 |
+ 0.699 |
+ 0.577 |
+ 0.762 |
+ 0.778 |
+ 0.774 |
+ 0.705 |
+ 0.820 |
+ 0.814 |
+ 0.801 |
+ 0.815 |
+ 0.776 |
+ 0.860 |
+ 0.832 |
+ 0.658 |
+ 0.603 |
+ 0.621 |
+ 0.790 |
+ 0.792 |
+ 0.792 |
+ 0.846 |
+ 0.806 |
+ 0.806 |
+ 0.855 |
+ 0.642 |
+ 0.580 |
+ 0.865 |
+ 0.815 |
+ 0.804 |
+ 0.809 |
+ 0.570 |
+ 0.587 |
+ 0.856 |
+ 0.850 |
+ 0.860 |
+ 0.852 |
+ 0.861 |
+ 0.862 |
+ 0.858 |
+ 0.857 |
+ 0.853 |
+ 0.864 |
+ 0.864 |
+ 0.832 |
+ 1837 |
+ OrganismalFitness |
+ BRCA1_HUMAN |
+ Low |
+ Human |
+
+
+ BRCA2_HUMAN_Erwood_2022_HEK293T |
+ 0.908 |
+ 0.816 |
+ 0.866 |
+ 0.861 |
+ 0.850 |
+ 0.875 |
+ 0.588 |
+ 0.893 |
+ 0.702 |
+ 0.556 |
+ 0.906 |
+ 0.593 |
+ 0.526 |
+ 0.529 |
+ 0.528 |
+ 0.914 |
+ 0.940 |
+ 0.904 |
+ 0.899 |
+ 0.584 |
+ 0.515 |
+ 0.921 |
+ 0.923 |
+ 0.861 |
+ 0.928 |
+ 0.888 |
+ 0.896 |
+ 0.887 |
+ 0.587 |
+ 0.860 |
+ 0.785 |
+ 0.850 |
+ 0.558 |
+ 0.549 |
+ 0.581 |
+ 0.590 |
+ 0.822 |
+ 0.817 |
+ 0.825 |
+ 0.833 |
+ 0.819 |
+ 0.837 |
+ 0.590 |
+ 0.553 |
+ 0.892 |
+ 0.492 |
+ 0.480 |
+ 0.503 |
+ 0.279 |
+ 0.760 |
+ 0.871 |
+ 0.899 |
+ 0.896 |
+ 0.878 |
+ 0.876 |
+ 0.874 |
+ 0.868 |
+ 0.839 |
+ 0.841 |
+ 0.885 |
+ 0.500 |
+ 0.507 |
+ 265 |
+ OrganismalFitness |
+ BRCA2_HUMAN |
+ NaN |
+ Human |
+
+
+ C6KNH7_9INFA_Lee_2018 |
+ 0.697 |
+ 0.687 |
+ 0.676 |
+ 0.680 |
+ 0.719 |
+ 0.720 |
+ 0.485 |
+ 0.731 |
+ 0.686 |
+ 0.689 |
+ 0.526 |
+ 0.717 |
+ 0.747 |
+ 0.490 |
+ 0.488 |
+ 0.495 |
+ 0.750 |
+ 0.705 |
+ 0.716 |
+ 0.674 |
+ 0.701 |
+ 0.685 |
+ 0.693 |
+ 0.678 |
+ 0.621 |
+ 0.734 |
+ 0.715 |
+ 0.736 |
+ 0.690 |
+ 0.742 |
+ 0.731 |
+ 0.681 |
+ 0.559 |
+ 0.683 |
+ 0.691 |
+ 0.701 |
+ 0.710 |
+ 0.713 |
+ 0.719 |
+ 0.724 |
+ 0.724 |
+ 0.729 |
+ 0.487 |
+ 0.481 |
+ 0.644 |
+ 0.498 |
+ 0.761 |
+ 0.776 |
+ 0.777 |
+ 0.619 |
+ 0.756 |
+ 0.756 |
+ 0.767 |
+ 0.769 |
+ 0.774 |
+ 0.763 |
+ 0.772 |
+ 0.768 |
+ 0.768 |
+ 0.773 |
+ 0.657 |
+ 0.593 |
+ 10754 |
+ OrganismalFitness |
+ C6KNH7_9INFA |
+ Medium |
+ Virus |
+
+
+ CALM1_HUMAN_Weile_2017 |
+ 0.594 |
+ 0.631 |
+ 0.630 |
+ 0.627 |
+ 0.630 |
+ 0.631 |
+ 0.588 |
+ 0.610 |
+ 0.635 |
+ 0.644 |
+ 0.646 |
+ 0.629 |
+ 0.650 |
+ 0.580 |
+ 0.602 |
+ 0.602 |
+ 0.615 |
+ 0.623 |
+ 0.627 |
+ 0.630 |
+ 0.599 |
+ 0.629 |
+ 0.636 |
+ 0.644 |
+ 0.631 |
+ 0.662 |
+ 0.660 |
+ 0.676 |
+ 0.684 |
+ 0.641 |
+ 0.619 |
+ 0.580 |
+ 0.553 |
+ 0.621 |
+ 0.648 |
+ 0.674 |
+ 0.621 |
+ 0.639 |
+ 0.653 |
+ 0.634 |
+ 0.638 |
+ 0.639 |
+ 0.619 |
+ 0.580 |
+ 0.658 |
+ 0.643 |
+ 0.552 |
+ 0.580 |
+ 0.592 |
+ 0.548 |
+ 0.598 |
+ 0.598 |
+ 0.599 |
+ 0.593 |
+ 0.600 |
+ 0.594 |
+ 0.596 |
+ 0.602 |
+ 0.593 |
+ 0.600 |
+ 0.659 |
+ 0.639 |
+ 1813 |
+ OrganismalFitness |
+ CALM1_HUMAN |
+ High |
+ Human |
+
+
+ CAPSD_AAV2S_Sinai_2021 |
+ 0.698 |
+ 0.670 |
+ 0.665 |
+ 0.688 |
+ 0.670 |
+ 0.662 |
+ 0.703 |
+ 0.724 |
+ 0.660 |
+ 0.679 |
+ 0.588 |
+ 0.609 |
+ 0.610 |
+ 0.643 |
+ 0.659 |
+ 0.610 |
+ 0.649 |
+ 0.594 |
+ 0.553 |
+ 0.630 |
+ 0.606 |
+ 0.641 |
+ 0.648 |
+ 0.650 |
+ 0.614 |
+ 0.610 |
+ 0.644 |
+ 0.614 |
+ 0.717 |
+ 0.719 |
+ 0.586 |
+ 0.582 |
+ 0.560 |
+ 0.612 |
+ 0.644 |
+ 0.757 |
+ 0.686 |
+ 0.689 |
+ 0.740 |
+ 0.668 |
+ 0.670 |
+ 0.716 |
+ 0.558 |
+ 0.594 |
+ 0.644 |
+ 0.564 |
+ 0.713 |
+ 0.703 |
+ 0.673 |
+ 0.666 |
+ 0.614 |
+ 0.610 |
+ 0.603 |
+ 0.615 |
+ 0.604 |
+ 0.605 |
+ 0.607 |
+ 0.602 |
+ 0.613 |
+ 0.609 |
+ 0.668 |
+ 0.629 |
+ 42328 |
+ OrganismalFitness |
+ CAPSD_AAV2S |
+ Low |
+ Virus |
+
+
+ CAR11_HUMAN_Meitlis_2020_gof |
+ 0.551 |
+ 0.464 |
+ 0.464 |
+ 0.467 |
+ 0.459 |
+ 0.466 |
+ 0.591 |
+ 0.403 |
+ 0.519 |
+ 0.527 |
+ 0.522 |
+ 0.589 |
+ 0.548 |
+ 0.545 |
+ 0.556 |
+ 0.653 |
+ 0.570 |
+ 0.570 |
+ 0.583 |
+ 0.545 |
+ 0.564 |
+ 0.475 |
+ 0.511 |
+ 0.525 |
+ 0.365 |
+ 0.478 |
+ 0.498 |
+ 0.476 |
+ 0.518 |
+ 0.451 |
+ 0.512 |
+ 0.514 |
+ 0.593 |
+ 0.524 |
+ 0.471 |
+ 0.425 |
+ 0.518 |
+ 0.492 |
+ 0.450 |
+ 0.488 |
+ 0.475 |
+ 0.442 |
+ 0.557 |
+ 0.516 |
+ 0.517 |
+ 0.623 |
+ 0.589 |
+ 0.524 |
+ 0.592 |
+ 0.548 |
+ 0.597 |
+ 0.593 |
+ 0.595 |
+ 0.574 |
+ 0.576 |
+ 0.581 |
+ 0.575 |
+ 0.584 |
+ 0.589 |
+ 0.587 |
+ 0.654 |
+ 0.643 |
+ 2374 |
+ OrganismalFitness |
+ CAR11_HUMAN |
+ Low |
+ Human |
+
+
+ CAR11_HUMAN_Meitlis_2020_lof |
+ 0.752 |
+ 0.711 |
+ 0.689 |
+ 0.679 |
+ 0.713 |
+ 0.721 |
+ 0.541 |
+ 0.509 |
+ 0.654 |
+ 0.652 |
+ 0.761 |
+ 0.770 |
+ 0.765 |
+ 0.524 |
+ 0.552 |
+ 0.740 |
+ 0.783 |
+ 0.794 |
+ 0.802 |
+ 0.604 |
+ 0.572 |
+ 0.698 |
+ 0.676 |
+ 0.658 |
+ 0.605 |
+ 0.686 |
+ 0.690 |
+ 0.727 |
+ 0.598 |
+ 0.707 |
+ 0.750 |
+ 0.722 |
+ 0.700 |
+ 0.490 |
+ 0.675 |
+ 0.644 |
+ 0.678 |
+ 0.707 |
+ 0.684 |
+ 0.711 |
+ 0.717 |
+ 0.697 |
+ 0.539 |
+ 0.512 |
+ 0.746 |
+ 0.661 |
+ 0.725 |
+ 0.757 |
+ 0.742 |
+ 0.582 |
+ 0.789 |
+ 0.785 |
+ 0.787 |
+ 0.786 |
+ 0.781 |
+ 0.792 |
+ 0.779 |
+ 0.787 |
+ 0.795 |
+ 0.793 |
+ 0.803 |
+ 0.716 |
+ 2395 |
+ OrganismalFitness |
+ CAR11_HUMAN |
+ Low |
+ Human |
+
+
+ CAS9_STRP1_Spencer_2017_positive |
+ 0.580 |
+ 0.594 |
+ 0.587 |
+ 0.589 |
+ 0.590 |
+ 0.593 |
+ 0.516 |
+ 0.549 |
+ 0.593 |
+ 0.595 |
+ 0.588 |
+ 0.532 |
+ 0.533 |
+ 0.528 |
+ 0.544 |
+ 0.578 |
+ 0.594 |
+ 0.597 |
+ 0.600 |
+ 0.504 |
+ 0.519 |
+ 0.537 |
+ 0.585 |
+ 0.560 |
+ 0.517 |
+ 0.599 |
+ 0.592 |
+ 0.589 |
+ 0.534 |
+ 0.596 |
+ 0.603 |
+ 0.593 |
+ 0.513 |
+ 0.514 |
+ 0.521 |
+ 0.582 |
+ 0.584 |
+ 0.584 |
+ 0.593 |
+ 0.595 |
+ 0.595 |
+ 0.598 |
+ 0.528 |
+ 0.512 |
+ 0.592 |
+ 0.551 |
+ 0.551 |
+ 0.592 |
+ 0.513 |
+ 0.517 |
+ 0.590 |
+ 0.591 |
+ 0.591 |
+ 0.592 |
+ 0.591 |
+ 0.592 |
+ 0.590 |
+ 0.590 |
+ 0.591 |
+ 0.593 |
+ 0.587 |
+ 0.556 |
+ 8117 |
+ Activity |
+ CAS9_STRP1 |
+ Medium |
+ Prokaryote |
+
+
+ CASP3_HUMAN_Roychowdhury_2020 |
+ 0.694 |
+ 0.776 |
+ 0.814 |
+ 0.818 |
+ 0.822 |
+ 0.825 |
+ 0.532 |
+ 0.735 |
+ 0.825 |
+ 0.833 |
+ 0.799 |
+ 0.805 |
+ 0.818 |
+ 0.637 |
+ 0.792 |
+ 0.828 |
+ 0.824 |
+ 0.783 |
+ 0.751 |
+ 0.813 |
+ 0.610 |
+ 0.748 |
+ 0.757 |
+ 0.779 |
+ 0.756 |
+ 0.781 |
+ 0.785 |
+ 0.757 |
+ 0.784 |
+ 0.809 |
+ 0.804 |
+ 0.773 |
+ 0.621 |
+ 0.537 |
+ 0.739 |
+ 0.765 |
+ 0.768 |
+ 0.782 |
+ 0.797 |
+ 0.821 |
+ 0.816 |
+ 0.824 |
+ 0.709 |
+ 0.492 |
+ 0.813 |
+ 0.784 |
+ 0.707 |
+ 0.780 |
+ 0.751 |
+ 0.638 |
+ 0.781 |
+ 0.787 |
+ 0.791 |
+ 0.787 |
+ 0.791 |
+ 0.785 |
+ 0.797 |
+ 0.785 |
+ 0.787 |
+ 0.797 |
+ 0.833 |
+ 0.762 |
+ 1567 |
+ Activity |
+ CASP3_HUMAN |
+ High |
+ Human |
+
+
+ CASP7_HUMAN_Roychowdhury_2020 |
+ 0.713 |
+ 0.782 |
+ 0.835 |
+ 0.835 |
+ 0.831 |
+ 0.833 |
+ 0.537 |
+ 0.767 |
+ 0.828 |
+ 0.842 |
+ 0.815 |
+ 0.835 |
+ 0.845 |
+ 0.669 |
+ 0.826 |
+ 0.838 |
+ 0.840 |
+ 0.816 |
+ 0.803 |
+ 0.832 |
+ 0.613 |
+ 0.799 |
+ 0.796 |
+ 0.811 |
+ 0.798 |
+ 0.819 |
+ 0.822 |
+ 0.816 |
+ 0.810 |
+ 0.850 |
+ 0.830 |
+ 0.800 |
+ 0.682 |
+ 0.552 |
+ 0.804 |
+ 0.787 |
+ 0.782 |
+ 0.831 |
+ 0.822 |
+ 0.825 |
+ 0.847 |
+ 0.842 |
+ 0.760 |
+ 0.509 |
+ 0.848 |
+ 0.825 |
+ 0.763 |
+ 0.835 |
+ 0.817 |
+ 0.674 |
+ 0.823 |
+ 0.816 |
+ 0.810 |
+ 0.827 |
+ 0.836 |
+ 0.833 |
+ 0.827 |
+ 0.817 |
+ 0.828 |
+ 0.835 |
+ 0.847 |
+ 0.793 |
+ 1680 |
+ Activity |
+ CASP7_HUMAN |
+ Medium |
+ Human |
+
+
+ CATR_CHLRE_Tsuboyama_2023_2AMI |
+ 0.796 |
+ 0.813 |
+ 0.850 |
+ 0.853 |
+ 0.860 |
+ 0.863 |
+ 0.870 |
+ 0.818 |
+ 0.810 |
+ 0.791 |
+ 0.847 |
+ 0.852 |
+ 0.864 |
+ 0.798 |
+ 0.869 |
+ 0.870 |
+ 0.866 |
+ 0.837 |
+ 0.862 |
+ 0.521 |
+ 0.844 |
+ 0.835 |
+ 0.841 |
+ 0.836 |
+ 0.857 |
+ 0.852 |
+ 0.835 |
+ 0.817 |
+ 0.829 |
+ 0.866 |
+ 0.846 |
+ 0.848 |
+ 0.734 |
+ 0.851 |
+ 0.836 |
+ 0.853 |
+ 0.862 |
+ 0.853 |
+ 0.869 |
+ 0.856 |
+ 0.854 |
+ 0.864 |
+ 0.748 |
+ 0.732 |
+ 0.691 |
+ 0.758 |
+ 0.760 |
+ 0.744 |
+ 0.873 |
+ 0.799 |
+ 0.868 |
+ 0.866 |
+ 0.878 |
+ 0.872 |
+ 0.883 |
+ 0.868 |
+ 0.869 |
+ 0.862 |
+ 0.865 |
+ 0.874 |
+ 0.851 |
+ 0.874 |
+ 1903 |
+ Stability |
+ CATR_CHLRE |
+ High |
+ Eukaryote |
+
+
+ CBPA2_HUMAN_Tsuboyama_2023_1O6X |
+ 0.854 |
+ 0.892 |
+ 0.904 |
+ 0.906 |
+ 0.891 |
+ 0.899 |
+ 0.748 |
+ 0.832 |
+ 0.918 |
+ 0.921 |
+ 0.911 |
+ 0.893 |
+ 0.888 |
+ 0.785 |
+ 0.860 |
+ 0.898 |
+ 0.897 |
+ 0.905 |
+ 0.898 |
+ 0.912 |
+ 0.769 |
+ 0.778 |
+ 0.866 |
+ 0.867 |
+ 0.790 |
+ 0.852 |
+ 0.851 |
+ 0.852 |
+ 0.884 |
+ 0.902 |
+ 0.899 |
+ 0.868 |
+ 0.516 |
+ 0.767 |
+ 0.834 |
+ 0.869 |
+ 0.877 |
+ 0.881 |
+ 0.895 |
+ 0.903 |
+ 0.900 |
+ 0.908 |
+ 0.772 |
+ 0.663 |
+ 0.828 |
+ 0.843 |
+ 0.905 |
+ 0.893 |
+ 0.959 |
+ 0.929 |
+ 0.929 |
+ 0.928 |
+ 0.930 |
+ 0.927 |
+ 0.933 |
+ 0.931 |
+ 0.930 |
+ 0.928 |
+ 0.926 |
+ 0.932 |
+ 0.902 |
+ 0.951 |
+ 2068 |
+ Stability |
+ CBPA2_HUMAN |
+ Medium |
+ Human |
+
+
+ CBS_HUMAN_Sun_2020 |
+ 0.687 |
+ 0.701 |
+ 0.707 |
+ 0.717 |
+ 0.713 |
+ 0.717 |
+ 0.613 |
+ 0.644 |
+ 0.710 |
+ 0.712 |
+ 0.696 |
+ 0.696 |
+ 0.710 |
+ 0.545 |
+ 0.627 |
+ 0.689 |
+ 0.692 |
+ 0.684 |
+ 0.690 |
+ 0.696 |
+ 0.699 |
+ 0.646 |
+ 0.659 |
+ 0.662 |
+ 0.696 |
+ 0.652 |
+ 0.662 |
+ 0.652 |
+ 0.674 |
+ 0.714 |
+ 0.711 |
+ 0.698 |
+ 0.611 |
+ 0.702 |
+ 0.656 |
+ 0.648 |
+ 0.715 |
+ 0.685 |
+ 0.683 |
+ 0.724 |
+ 0.705 |
+ 0.706 |
+ 0.673 |
+ 0.534 |
+ 0.714 |
+ 0.711 |
+ 0.633 |
+ 0.655 |
+ 0.676 |
+ 0.547 |
+ 0.683 |
+ 0.684 |
+ 0.688 |
+ 0.691 |
+ 0.691 |
+ 0.690 |
+ 0.687 |
+ 0.688 |
+ 0.689 |
+ 0.693 |
+ 0.720 |
+ 0.664 |
+ 7217 |
+ OrganismalFitness |
+ CBS_HUMAN |
+ Medium |
+ Human |
+
+
+ CBX4_HUMAN_Tsuboyama_2023_2K28 |
+ 0.752 |
+ 0.773 |
+ 0.844 |
+ 0.865 |
+ 0.811 |
+ 0.847 |
+ 0.339 |
+ 0.819 |
+ 0.812 |
+ 0.808 |
+ 0.812 |
+ 0.829 |
+ 0.828 |
+ 0.344 |
+ 0.878 |
+ 0.855 |
+ 0.844 |
+ 0.801 |
+ 0.802 |
+ 0.828 |
+ 0.753 |
+ 0.803 |
+ 0.797 |
+ 0.797 |
+ 0.783 |
+ 0.793 |
+ 0.799 |
+ 0.767 |
+ 0.793 |
+ 0.815 |
+ 0.845 |
+ 0.840 |
+ 0.719 |
+ 0.714 |
+ 0.750 |
+ 0.765 |
+ 0.802 |
+ 0.799 |
+ 0.810 |
+ 0.844 |
+ 0.856 |
+ 0.859 |
+ 0.816 |
+ 0.317 |
+ 0.682 |
+ 0.786 |
+ 0.540 |
+ 0.561 |
+ 0.796 |
+ 0.783 |
+ 0.853 |
+ 0.809 |
+ 0.850 |
+ 0.838 |
+ 0.861 |
+ 0.839 |
+ 0.839 |
+ 0.871 |
+ 0.835 |
+ 0.852 |
+ 0.826 |
+ 0.888 |
+ 2282 |
+ Stability |
+ CBX4_HUMAN |
+ High |
+ Human |
+
+
+ CCDB_ECOLI_Adkar_2012 |
+ 0.705 |
+ 0.749 |
+ 0.757 |
+ 0.777 |
+ 0.767 |
+ 0.775 |
+ 0.470 |
+ 0.619 |
+ 0.706 |
+ 0.715 |
+ 0.745 |
+ 0.701 |
+ 0.749 |
+ 0.484 |
+ 0.477 |
+ 0.723 |
+ 0.761 |
+ 0.746 |
+ 0.629 |
+ 0.756 |
+ 0.514 |
+ 0.471 |
+ 0.431 |
+ 0.571 |
+ 0.396 |
+ 0.538 |
+ 0.484 |
+ 0.500 |
+ 0.713 |
+ 0.743 |
+ 0.817 |
+ 0.807 |
+ 0.558 |
+ 0.489 |
+ 0.507 |
+ 0.657 |
+ 0.743 |
+ 0.729 |
+ 0.750 |
+ 0.763 |
+ 0.753 |
+ 0.771 |
+ 0.497 |
+ 0.468 |
+ 0.747 |
+ 0.495 |
+ 0.650 |
+ 0.741 |
+ 0.670 |
+ 0.648 |
+ 0.743 |
+ 0.728 |
+ 0.733 |
+ 0.743 |
+ 0.743 |
+ 0.743 |
+ 0.743 |
+ 0.755 |
+ 0.768 |
+ 0.751 |
+ 0.746 |
+ 0.610 |
+ 1176 |
+ Activity |
+ CCDB_ECOLI |
+ High |
+ Prokaryote |
+
+
+ CCDB_ECOLI_Tripathi_2016 |
+ 0.809 |
+ 0.861 |
+ 0.872 |
+ 0.886 |
+ 0.867 |
+ 0.876 |
+ 0.496 |
+ 0.709 |
+ 0.786 |
+ 0.807 |
+ 0.851 |
+ 0.808 |
+ 0.843 |
+ 0.498 |
+ 0.502 |
+ 0.826 |
+ 0.865 |
+ 0.871 |
+ 0.757 |
+ 0.870 |
+ 0.528 |
+ 0.512 |
+ 0.447 |
+ 0.624 |
+ 0.425 |
+ 0.567 |
+ 0.482 |
+ 0.553 |
+ 0.841 |
+ 0.861 |
+ 0.891 |
+ 0.870 |
+ 0.577 |
+ 0.505 |
+ 0.597 |
+ 0.803 |
+ 0.844 |
+ 0.845 |
+ 0.880 |
+ 0.869 |
+ 0.870 |
+ 0.890 |
+ 0.507 |
+ 0.462 |
+ 0.850 |
+ 0.517 |
+ 0.727 |
+ 0.806 |
+ 0.740 |
+ 0.666 |
+ 0.847 |
+ 0.822 |
+ 0.822 |
+ 0.844 |
+ 0.844 |
+ 0.843 |
+ 0.837 |
+ 0.848 |
+ 0.869 |
+ 0.851 |
+ 0.849 |
+ 0.734 |
+ 1663 |
+ OrganismalFitness |
+ CCDB_ECOLI |
+ High |
+ Prokaryote |
+
+
+ CCR5_HUMAN_Gill_2023 |
+ 0.634 |
+ 0.637 |
+ 0.638 |
+ 0.639 |
+ 0.634 |
+ 0.638 |
+ 0.649 |
+ 0.644 |
+ 0.662 |
+ 0.668 |
+ 0.675 |
+ 0.664 |
+ 0.670 |
+ 0.651 |
+ 0.670 |
+ 0.674 |
+ 0.668 |
+ 0.662 |
+ 0.664 |
+ 0.670 |
+ 0.675 |
+ 0.673 |
+ 0.657 |
+ 0.659 |
+ 0.675 |
+ 0.670 |
+ 0.665 |
+ 0.673 |
+ 0.660 |
+ 0.674 |
+ 0.644 |
+ 0.619 |
+ 0.557 |
+ 0.675 |
+ 0.676 |
+ 0.678 |
+ 0.678 |
+ 0.682 |
+ 0.685 |
+ 0.658 |
+ 0.658 |
+ 0.659 |
+ 0.668 |
+ 0.590 |
+ 0.676 |
+ 0.670 |
+ 0.633 |
+ 0.647 |
+ 0.638 |
+ 0.585 |
+ 0.650 |
+ 0.647 |
+ 0.652 |
+ 0.652 |
+ 0.649 |
+ 0.652 |
+ 0.657 |
+ 0.657 |
+ 0.657 |
+ 0.657 |
+ 0.668 |
+ 0.671 |
+ 6137 |
+ Binding |
+ CCR5_HUMAN |
+ High |
+ Human |
+
+
+ CD19_HUMAN_Klesmith_2019_FMC_singles |
+ 0.623 |
+ 0.617 |
+ 0.612 |
+ 0.621 |
+ 0.633 |
+ 0.632 |
+ 0.574 |
+ 0.580 |
+ 0.597 |
+ 0.599 |
+ 0.591 |
+ 0.598 |
+ 0.613 |
+ 0.603 |
+ 0.620 |
+ 0.600 |
+ 0.599 |
+ 0.607 |
+ 0.589 |
+ 0.492 |
+ 0.597 |
+ 0.600 |
+ 0.628 |
+ 0.617 |
+ 0.593 |
+ 0.648 |
+ 0.616 |
+ 0.607 |
+ 0.630 |
+ 0.647 |
+ 0.599 |
+ 0.574 |
+ 0.581 |
+ 0.606 |
+ 0.631 |
+ 0.609 |
+ 0.630 |
+ 0.636 |
+ 0.628 |
+ 0.642 |
+ 0.644 |
+ 0.639 |
+ 0.610 |
+ 0.568 |
+ 0.594 |
+ 0.613 |
+ 0.728 |
+ 0.683 |
+ 0.720 |
+ 0.608 |
+ 0.644 |
+ 0.648 |
+ 0.643 |
+ 0.662 |
+ 0.654 |
+ 0.653 |
+ 0.658 |
+ 0.647 |
+ 0.653 |
+ 0.659 |
+ 0.755 |
+ 0.710 |
+ 3761 |
+ Binding |
+ CD19_HUMAN |
+ Low |
+ Human |
+
+
+ CP2C9_HUMAN_Amorosi_2021_abundance |
+ 0.748 |
+ 0.788 |
+ 0.800 |
+ 0.810 |
+ 0.800 |
+ 0.808 |
+ 0.763 |
+ 0.783 |
+ 0.800 |
+ 0.808 |
+ 0.758 |
+ 0.795 |
+ 0.814 |
+ 0.767 |
+ 0.803 |
+ 0.816 |
+ 0.822 |
+ 0.817 |
+ 0.800 |
+ 0.807 |
+ 0.783 |
+ 0.783 |
+ 0.800 |
+ 0.777 |
+ 0.792 |
+ 0.795 |
+ 0.796 |
+ 0.790 |
+ 0.787 |
+ 0.806 |
+ 0.792 |
+ 0.762 |
+ 0.611 |
+ 0.797 |
+ 0.787 |
+ 0.786 |
+ 0.811 |
+ 0.811 |
+ 0.811 |
+ 0.820 |
+ 0.818 |
+ 0.819 |
+ 0.790 |
+ 0.549 |
+ 0.768 |
+ 0.806 |
+ 0.782 |
+ 0.775 |
+ 0.810 |
+ 0.611 |
+ 0.806 |
+ 0.804 |
+ 0.812 |
+ 0.823 |
+ 0.821 |
+ 0.817 |
+ 0.817 |
+ 0.813 |
+ 0.814 |
+ 0.824 |
+ 0.822 |
+ 0.811 |
+ 6370 |
+ Expression |
+ CP2C9_HUMAN |
+ High |
+ Human |
+
+
+ CP2C9_HUMAN_Amorosi_2021_activity |
+ 0.758 |
+ 0.787 |
+ 0.804 |
+ 0.814 |
+ 0.797 |
+ 0.808 |
+ 0.794 |
+ 0.805 |
+ 0.800 |
+ 0.816 |
+ 0.752 |
+ 0.819 |
+ 0.831 |
+ 0.784 |
+ 0.825 |
+ 0.839 |
+ 0.835 |
+ 0.833 |
+ 0.804 |
+ 0.822 |
+ 0.791 |
+ 0.796 |
+ 0.808 |
+ 0.789 |
+ 0.806 |
+ 0.794 |
+ 0.796 |
+ 0.799 |
+ 0.791 |
+ 0.812 |
+ 0.800 |
+ 0.774 |
+ 0.609 |
+ 0.814 |
+ 0.798 |
+ 0.787 |
+ 0.829 |
+ 0.825 |
+ 0.820 |
+ 0.826 |
+ 0.822 |
+ 0.818 |
+ 0.813 |
+ 0.543 |
+ 0.775 |
+ 0.832 |
+ 0.802 |
+ 0.791 |
+ 0.833 |
+ 0.628 |
+ 0.817 |
+ 0.821 |
+ 0.824 |
+ 0.829 |
+ 0.833 |
+ 0.835 |
+ 0.831 |
+ 0.824 |
+ 0.829 |
+ 0.837 |
+ 0.834 |
+ 0.833 |
+ 6142 |
+ Binding |
+ CP2C9_HUMAN |
+ High |
+ Human |
+
+
+ CSN4_MOUSE_Tsuboyama_2023_1UFM |
+ 0.695 |
+ 0.831 |
+ 0.803 |
+ 0.810 |
+ 0.797 |
+ 0.805 |
+ 0.653 |
+ 0.791 |
+ 0.817 |
+ 0.821 |
+ 0.842 |
+ 0.859 |
+ 0.879 |
+ 0.728 |
+ 0.886 |
+ 0.828 |
+ 0.781 |
+ 0.821 |
+ 0.820 |
+ 0.790 |
+ 0.756 |
+ 0.775 |
+ 0.836 |
+ 0.845 |
+ 0.741 |
+ 0.857 |
+ 0.849 |
+ 0.834 |
+ 0.835 |
+ 0.833 |
+ 0.818 |
+ 0.811 |
+ 0.670 |
+ 0.704 |
+ 0.784 |
+ 0.830 |
+ 0.779 |
+ 0.798 |
+ 0.831 |
+ 0.815 |
+ 0.828 |
+ 0.840 |
+ 0.674 |
+ 0.621 |
+ 0.754 |
+ 0.767 |
+ 0.785 |
+ 0.776 |
+ 0.839 |
+ 0.860 |
+ 0.846 |
+ 0.846 |
+ 0.842 |
+ 0.858 |
+ 0.860 |
+ 0.861 |
+ 0.853 |
+ 0.868 |
+ 0.842 |
+ 0.857 |
+ 0.848 |
+ 0.873 |
+ 3295 |
+ Stability |
+ CSN4_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ CUE1_YEAST_Tsuboyama_2023_2MYX |
+ 0.666 |
+ 0.715 |
+ 0.699 |
+ 0.700 |
+ 0.725 |
+ 0.709 |
+ 0.551 |
+ 0.702 |
+ 0.755 |
+ 0.764 |
+ 0.736 |
+ 0.694 |
+ 0.672 |
+ 0.563 |
+ 0.695 |
+ 0.717 |
+ 0.763 |
+ 0.740 |
+ 0.730 |
+ 0.652 |
+ 0.564 |
+ 0.592 |
+ 0.597 |
+ 0.522 |
+ 0.538 |
+ 0.588 |
+ 0.601 |
+ 0.600 |
+ 0.690 |
+ 0.702 |
+ 0.733 |
+ 0.707 |
+ 0.620 |
+ 0.562 |
+ 0.556 |
+ 0.587 |
+ 0.678 |
+ 0.682 |
+ 0.684 |
+ 0.704 |
+ 0.705 |
+ 0.709 |
+ 0.562 |
+ 0.510 |
+ 0.713 |
+ 0.603 |
+ 0.748 |
+ 0.743 |
+ 0.813 |
+ 0.728 |
+ 0.769 |
+ 0.749 |
+ 0.746 |
+ 0.772 |
+ 0.753 |
+ 0.771 |
+ 0.765 |
+ 0.759 |
+ 0.755 |
+ 0.765 |
+ 0.790 |
+ 0.771 |
+ 1580 |
+ Stability |
+ CUE1_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ D7PM05_CLYGR_Somermeyer_2022 |
+ 0.781 |
+ 0.830 |
+ 0.828 |
+ 0.810 |
+ 0.835 |
+ 0.835 |
+ 0.531 |
+ 0.750 |
+ 0.849 |
+ 0.851 |
+ 0.732 |
+ 0.530 |
+ 0.529 |
+ 0.530 |
+ 0.522 |
+ 0.512 |
+ 0.520 |
+ 0.531 |
+ 0.575 |
+ 0.735 |
+ 0.514 |
+ 0.519 |
+ 0.554 |
+ 0.541 |
+ 0.500 |
+ 0.521 |
+ 0.535 |
+ 0.601 |
+ 0.690 |
+ 0.831 |
+ 0.805 |
+ 0.813 |
+ 0.514 |
+ 0.549 |
+ 0.563 |
+ 0.580 |
+ 0.783 |
+ 0.783 |
+ 0.781 |
+ 0.834 |
+ 0.833 |
+ 0.828 |
+ 0.496 |
+ 0.506 |
+ 0.498 |
+ 0.508 |
+ 0.636 |
+ 0.652 |
+ 0.760 |
+ 0.706 |
+ 0.759 |
+ 0.756 |
+ 0.758 |
+ 0.764 |
+ 0.756 |
+ 0.761 |
+ 0.758 |
+ 0.756 |
+ 0.755 |
+ 0.760 |
+ 0.668 |
+ 0.600 |
+ 24515 |
+ Activity |
+ D7PM05_CLYGR |
+ Low |
+ Eukaryote |
+
+
+ DLG4_HUMAN_Faure_2021 |
+ 0.877 |
+ 0.822 |
+ 0.834 |
+ 0.816 |
+ 0.834 |
+ 0.838 |
+ 0.904 |
+ 0.853 |
+ 0.791 |
+ 0.795 |
+ 0.781 |
+ 0.801 |
+ 0.834 |
+ 0.909 |
+ 0.921 |
+ 0.905 |
+ 0.822 |
+ 0.764 |
+ 0.741 |
+ 0.854 |
+ 0.812 |
+ 0.818 |
+ 0.809 |
+ 0.791 |
+ 0.834 |
+ 0.832 |
+ 0.809 |
+ 0.804 |
+ 0.768 |
+ 0.839 |
+ 0.855 |
+ 0.848 |
+ 0.777 |
+ 0.816 |
+ 0.852 |
+ 0.817 |
+ 0.861 |
+ 0.888 |
+ 0.867 |
+ 0.854 |
+ 0.865 |
+ 0.849 |
+ 0.827 |
+ 0.613 |
+ 0.708 |
+ 0.793 |
+ 0.839 |
+ 0.741 |
+ 0.868 |
+ 0.668 |
+ 0.768 |
+ 0.700 |
+ 0.798 |
+ 0.774 |
+ 0.786 |
+ 0.770 |
+ 0.782 |
+ 0.770 |
+ 0.779 |
+ 0.778 |
+ 0.783 |
+ 0.913 |
+ 6976 |
+ OrganismalFitness |
+ DLG4_HUMAN |
+ Low |
+ Human |
+
+
+ DLG4_RAT_McLaughlin_2012 |
+ 0.830 |
+ 0.771 |
+ 0.789 |
+ 0.801 |
+ 0.840 |
+ 0.848 |
+ 0.818 |
+ 0.794 |
+ 0.824 |
+ 0.847 |
+ 0.806 |
+ 0.869 |
+ 0.880 |
+ 0.796 |
+ 0.883 |
+ 0.884 |
+ 0.857 |
+ 0.823 |
+ 0.796 |
+ 0.800 |
+ 0.764 |
+ 0.769 |
+ 0.767 |
+ 0.752 |
+ 0.789 |
+ 0.795 |
+ 0.781 |
+ 0.755 |
+ 0.756 |
+ 0.834 |
+ 0.848 |
+ 0.833 |
+ 0.685 |
+ 0.759 |
+ 0.763 |
+ 0.724 |
+ 0.821 |
+ 0.816 |
+ 0.804 |
+ 0.855 |
+ 0.853 |
+ 0.854 |
+ 0.847 |
+ 0.552 |
+ 0.777 |
+ 0.834 |
+ 0.791 |
+ 0.709 |
+ 0.814 |
+ 0.593 |
+ 0.791 |
+ 0.760 |
+ 0.801 |
+ 0.771 |
+ 0.804 |
+ 0.790 |
+ 0.775 |
+ 0.797 |
+ 0.803 |
+ 0.810 |
+ 0.820 |
+ 0.865 |
+ 1576 |
+ Binding |
+ DLG4_RAT |
+ Low |
+ Eukaryote |
+
+
+ DN7A_SACS2_Tsuboyama_2023_1JIC |
+ 0.574 |
+ 0.580 |
+ 0.601 |
+ 0.610 |
+ 0.601 |
+ 0.625 |
+ 0.424 |
+ 0.668 |
+ 0.660 |
+ 0.672 |
+ 0.496 |
+ 0.539 |
+ 0.531 |
+ 0.502 |
+ 0.567 |
+ 0.581 |
+ 0.630 |
+ 0.647 |
+ 0.681 |
+ 0.604 |
+ 0.440 |
+ 0.500 |
+ 0.493 |
+ 0.508 |
+ 0.498 |
+ 0.533 |
+ 0.500 |
+ 0.511 |
+ 0.590 |
+ 0.647 |
+ 0.722 |
+ 0.699 |
+ 0.522 |
+ 0.469 |
+ 0.467 |
+ 0.486 |
+ 0.593 |
+ 0.589 |
+ 0.593 |
+ 0.614 |
+ 0.606 |
+ 0.608 |
+ 0.522 |
+ 0.451 |
+ 0.600 |
+ 0.530 |
+ 0.787 |
+ 0.782 |
+ 0.837 |
+ 0.783 |
+ 0.683 |
+ 0.675 |
+ 0.738 |
+ 0.730 |
+ 0.720 |
+ 0.718 |
+ 0.709 |
+ 0.689 |
+ 0.682 |
+ 0.716 |
+ 0.759 |
+ 0.723 |
+ 1008 |
+ Stability |
+ DN7A_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ DNJA1_HUMAN_Tsuboyama_2023_2LO1 |
+ 0.928 |
+ 0.937 |
+ 0.937 |
+ 0.937 |
+ 0.944 |
+ 0.943 |
+ 0.887 |
+ 0.916 |
+ 0.938 |
+ 0.945 |
+ 0.946 |
+ 0.937 |
+ 0.949 |
+ 0.950 |
+ 0.955 |
+ 0.955 |
+ 0.946 |
+ 0.935 |
+ 0.953 |
+ 0.937 |
+ 0.862 |
+ 0.906 |
+ 0.924 |
+ 0.914 |
+ 0.929 |
+ 0.922 |
+ 0.930 |
+ 0.896 |
+ 0.929 |
+ 0.938 |
+ 0.920 |
+ 0.909 |
+ 0.864 |
+ 0.928 |
+ 0.929 |
+ 0.943 |
+ 0.954 |
+ 0.955 |
+ 0.960 |
+ 0.948 |
+ 0.950 |
+ 0.953 |
+ 0.846 |
+ 0.582 |
+ 0.711 |
+ 0.850 |
+ 0.822 |
+ 0.766 |
+ 0.928 |
+ 0.933 |
+ 0.932 |
+ 0.936 |
+ 0.934 |
+ 0.948 |
+ 0.945 |
+ 0.950 |
+ 0.948 |
+ 0.944 |
+ 0.949 |
+ 0.946 |
+ 0.926 |
+ 0.946 |
+ 2264 |
+ Stability |
+ DNJA1_HUMAN |
+ High |
+ Human |
+
+
+ DOCK1_MOUSE_Tsuboyama_2023_2M0Y |
+ 0.714 |
+ 0.738 |
+ 0.718 |
+ 0.729 |
+ 0.757 |
+ 0.761 |
+ 0.521 |
+ 0.720 |
+ 0.686 |
+ 0.687 |
+ 0.749 |
+ 0.727 |
+ 0.741 |
+ 0.607 |
+ 0.719 |
+ 0.736 |
+ 0.753 |
+ 0.744 |
+ 0.751 |
+ 0.764 |
+ 0.654 |
+ 0.696 |
+ 0.678 |
+ 0.685 |
+ 0.645 |
+ 0.677 |
+ 0.659 |
+ 0.646 |
+ 0.673 |
+ 0.721 |
+ 0.719 |
+ 0.720 |
+ 0.591 |
+ 0.597 |
+ 0.652 |
+ 0.612 |
+ 0.735 |
+ 0.743 |
+ 0.739 |
+ 0.751 |
+ 0.752 |
+ 0.758 |
+ 0.666 |
+ 0.400 |
+ 0.741 |
+ 0.711 |
+ 0.668 |
+ 0.672 |
+ 0.703 |
+ 0.707 |
+ 0.743 |
+ 0.734 |
+ 0.742 |
+ 0.746 |
+ 0.738 |
+ 0.748 |
+ 0.729 |
+ 0.740 |
+ 0.747 |
+ 0.742 |
+ 0.767 |
+ 0.752 |
+ 2915 |
+ Stability |
+ DOCK1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ DYR_ECOLI_Nguyen_2023 |
+ 0.757 |
+ 0.851 |
+ 0.863 |
+ 0.869 |
+ 0.868 |
+ 0.872 |
+ 0.497 |
+ 0.834 |
+ 0.861 |
+ 0.855 |
+ 0.874 |
+ 0.879 |
+ 0.887 |
+ 0.564 |
+ 0.854 |
+ 0.874 |
+ 0.886 |
+ 0.886 |
+ 0.878 |
+ 0.873 |
+ 0.739 |
+ 0.805 |
+ 0.732 |
+ 0.778 |
+ 0.828 |
+ 0.831 |
+ 0.843 |
+ 0.855 |
+ 0.817 |
+ 0.876 |
+ 0.886 |
+ 0.873 |
+ 0.624 |
+ 0.842 |
+ 0.758 |
+ 0.747 |
+ 0.865 |
+ 0.811 |
+ 0.821 |
+ 0.880 |
+ 0.856 |
+ 0.860 |
+ 0.839 |
+ 0.499 |
+ 0.877 |
+ 0.855 |
+ 0.665 |
+ 0.793 |
+ 0.748 |
+ 0.576 |
+ 0.841 |
+ 0.817 |
+ 0.826 |
+ 0.833 |
+ 0.836 |
+ 0.837 |
+ 0.843 |
+ 0.843 |
+ 0.859 |
+ 0.849 |
+ 0.861 |
+ 0.808 |
+ 2916 |
+ OrganismalFitness |
+ DYR_ECOLI |
+ High |
+ Prokaryote |
+
+
+ DYR_ECOLI_Thompson_2019 |
+ 0.745 |
+ 0.792 |
+ 0.790 |
+ 0.792 |
+ 0.796 |
+ 0.797 |
+ 0.457 |
+ 0.720 |
+ 0.788 |
+ 0.806 |
+ 0.782 |
+ 0.764 |
+ 0.772 |
+ 0.532 |
+ 0.724 |
+ 0.773 |
+ 0.789 |
+ 0.807 |
+ 0.806 |
+ 0.792 |
+ 0.644 |
+ 0.746 |
+ 0.667 |
+ 0.703 |
+ 0.734 |
+ 0.758 |
+ 0.789 |
+ 0.778 |
+ 0.766 |
+ 0.790 |
+ 0.794 |
+ 0.780 |
+ 0.605 |
+ 0.749 |
+ 0.712 |
+ 0.711 |
+ 0.773 |
+ 0.757 |
+ 0.765 |
+ 0.797 |
+ 0.795 |
+ 0.800 |
+ 0.718 |
+ 0.480 |
+ 0.770 |
+ 0.751 |
+ 0.621 |
+ 0.728 |
+ 0.681 |
+ 0.554 |
+ 0.763 |
+ 0.739 |
+ 0.744 |
+ 0.742 |
+ 0.757 |
+ 0.752 |
+ 0.764 |
+ 0.759 |
+ 0.771 |
+ 0.763 |
+ 0.792 |
+ 0.722 |
+ 2363 |
+ OrganismalFitness |
+ DYR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ ENV_HV1B9_DuenasDecamp_2016 |
+ 0.747 |
+ 0.744 |
+ 0.688 |
+ 0.742 |
+ 0.765 |
+ 0.761 |
+ 0.503 |
+ 0.704 |
+ 0.732 |
+ 0.751 |
+ 0.722 |
+ 0.791 |
+ 0.777 |
+ 0.533 |
+ 0.515 |
+ 0.496 |
+ 0.470 |
+ 0.507 |
+ 0.539 |
+ 0.751 |
+ 0.753 |
+ 0.740 |
+ 0.768 |
+ 0.771 |
+ 0.707 |
+ 0.783 |
+ 0.769 |
+ 0.746 |
+ 0.747 |
+ 0.767 |
+ 0.768 |
+ 0.757 |
+ 0.674 |
+ 0.767 |
+ 0.767 |
+ 0.771 |
+ 0.774 |
+ 0.776 |
+ 0.772 |
+ 0.772 |
+ 0.773 |
+ 0.774 |
+ 0.719 |
+ 0.451 |
+ 0.748 |
+ 0.759 |
+ 0.736 |
+ 0.713 |
+ 0.731 |
+ 0.651 |
+ 0.643 |
+ 0.695 |
+ 0.704 |
+ 0.676 |
+ 0.655 |
+ 0.653 |
+ 0.667 |
+ 0.659 |
+ 0.644 |
+ 0.682 |
+ 0.607 |
+ 0.587 |
+ 375 |
+ OrganismalFitness |
+ ENV_HV1B9 |
+ Medium |
+ Virus |
+
+
+ ENV_HV1BR_Haddox_2016 |
+ 0.669 |
+ 0.652 |
+ 0.660 |
+ 0.661 |
+ 0.671 |
+ 0.673 |
+ 0.499 |
+ 0.659 |
+ 0.674 |
+ 0.673 |
+ 0.649 |
+ 0.659 |
+ 0.669 |
+ 0.495 |
+ 0.495 |
+ 0.497 |
+ 0.519 |
+ 0.529 |
+ 0.577 |
+ 0.667 |
+ 0.676 |
+ 0.680 |
+ 0.687 |
+ 0.682 |
+ 0.673 |
+ 0.680 |
+ 0.680 |
+ 0.676 |
+ 0.682 |
+ 0.677 |
+ 0.665 |
+ 0.649 |
+ 0.599 |
+ 0.673 |
+ 0.682 |
+ 0.681 |
+ 0.681 |
+ 0.685 |
+ 0.684 |
+ 0.685 |
+ 0.687 |
+ 0.684 |
+ 0.615 |
+ 0.490 |
+ 0.660 |
+ 0.649 |
+ 0.597 |
+ 0.616 |
+ 0.553 |
+ 0.534 |
+ 0.585 |
+ 0.581 |
+ 0.594 |
+ 0.607 |
+ 0.587 |
+ 0.594 |
+ 0.595 |
+ 0.590 |
+ 0.589 |
+ 0.598 |
+ 0.577 |
+ 0.541 |
+ 12863 |
+ OrganismalFitness |
+ ENV_HV1BR |
+ Medium |
+ Virus |
+
+
+ ENVZ_ECOLI_Ghose_2023 |
+ 0.571 |
+ 0.569 |
+ 0.610 |
+ 0.610 |
+ 0.619 |
+ 0.618 |
+ 0.503 |
+ 0.616 |
+ 0.608 |
+ 0.614 |
+ 0.606 |
+ 0.625 |
+ 0.636 |
+ 0.607 |
+ 0.615 |
+ 0.618 |
+ 0.613 |
+ 0.600 |
+ 0.584 |
+ 0.614 |
+ 0.605 |
+ 0.601 |
+ 0.632 |
+ 0.625 |
+ 0.606 |
+ 0.611 |
+ 0.626 |
+ 0.621 |
+ 0.615 |
+ 0.618 |
+ 0.598 |
+ 0.595 |
+ 0.559 |
+ 0.602 |
+ 0.612 |
+ 0.621 |
+ 0.618 |
+ 0.623 |
+ 0.630 |
+ 0.621 |
+ 0.625 |
+ 0.625 |
+ 0.611 |
+ 0.566 |
+ 0.612 |
+ 0.610 |
+ 0.528 |
+ 0.596 |
+ 0.582 |
+ 0.525 |
+ 0.623 |
+ 0.620 |
+ 0.604 |
+ 0.617 |
+ 0.639 |
+ 0.625 |
+ 0.615 |
+ 0.625 |
+ 0.627 |
+ 0.627 |
+ 0.594 |
+ 0.637 |
+ 1121 |
+ Activity |
+ ENVZ_ECOLI |
+ High |
+ Prokaryote |
+
+
+ EPHB2_HUMAN_Tsuboyama_2023_1F0M |
+ 0.839 |
+ 0.898 |
+ 0.899 |
+ 0.902 |
+ 0.914 |
+ 0.915 |
+ 0.309 |
+ 0.906 |
+ 0.948 |
+ 0.952 |
+ 0.944 |
+ 0.946 |
+ 0.953 |
+ 0.286 |
+ 0.941 |
+ 0.954 |
+ 0.963 |
+ 0.945 |
+ 0.940 |
+ 0.923 |
+ 0.831 |
+ 0.860 |
+ 0.899 |
+ 0.889 |
+ 0.880 |
+ 0.898 |
+ 0.905 |
+ 0.913 |
+ 0.920 |
+ 0.954 |
+ 0.940 |
+ 0.936 |
+ 0.886 |
+ 0.860 |
+ 0.859 |
+ 0.879 |
+ 0.908 |
+ 0.908 |
+ 0.916 |
+ 0.919 |
+ 0.918 |
+ 0.917 |
+ 0.850 |
+ 0.341 |
+ 0.857 |
+ 0.861 |
+ 0.830 |
+ 0.828 |
+ 0.943 |
+ 0.909 |
+ 0.957 |
+ 0.954 |
+ 0.956 |
+ 0.955 |
+ 0.961 |
+ 0.958 |
+ 0.958 |
+ 0.958 |
+ 0.955 |
+ 0.960 |
+ 0.949 |
+ 0.944 |
+ 1960 |
+ Stability |
+ EPHB2_HUMAN |
+ High |
+ Human |
+
+
+ ERBB2_HUMAN_Elazar_2016 |
+ 0.697 |
+ 0.691 |
+ 0.641 |
+ 0.642 |
+ 0.634 |
+ 0.645 |
+ 0.717 |
+ 0.694 |
+ 0.702 |
+ 0.715 |
+ 0.719 |
+ 0.730 |
+ 0.738 |
+ 0.737 |
+ 0.732 |
+ 0.735 |
+ 0.706 |
+ 0.692 |
+ 0.652 |
+ 0.525 |
+ 0.721 |
+ 0.729 |
+ 0.752 |
+ 0.725 |
+ 0.740 |
+ 0.747 |
+ 0.778 |
+ 0.782 |
+ 0.734 |
+ 0.692 |
+ 0.716 |
+ 0.543 |
+ 0.732 |
+ 0.768 |
+ 0.728 |
+ 0.736 |
+ 0.747 |
+ 0.731 |
+ 0.737 |
+ 0.728 |
+ 0.723 |
+ 0.720 |
+ 0.739 |
+ 0.735 |
+ 0.764 |
+ 0.734 |
+ 0.718 |
+ 0.730 |
+ 0.407 |
+ 0.527 |
+ 0.698 |
+ 0.693 |
+ 0.720 |
+ 0.702 |
+ 0.699 |
+ 0.699 |
+ 0.709 |
+ 0.717 |
+ 0.721 |
+ 0.714 |
+ 0.756 |
+ 0.741 |
+ 326 |
+ Expression |
+ ERBB2_HUMAN |
+ Low |
+ Human |
+
+
+ ESTA_BACSU_Nutschel_2020 |
+ 0.637 |
+ 0.694 |
+ 0.702 |
+ 0.711 |
+ 0.696 |
+ 0.694 |
+ 0.577 |
+ 0.665 |
+ 0.669 |
+ 0.707 |
+ 0.659 |
+ 0.645 |
+ 0.662 |
+ 0.576 |
+ 0.621 |
+ 0.629 |
+ 0.645 |
+ 0.641 |
+ 0.664 |
+ 0.654 |
+ 0.548 |
+ 0.591 |
+ 0.634 |
+ 0.648 |
+ 0.628 |
+ 0.620 |
+ 0.653 |
+ 0.634 |
+ 0.696 |
+ 0.690 |
+ 0.724 |
+ 0.711 |
+ 0.534 |
+ 0.579 |
+ 0.630 |
+ 0.641 |
+ 0.655 |
+ 0.665 |
+ 0.673 |
+ 0.700 |
+ 0.703 |
+ 0.703 |
+ 0.588 |
+ 0.552 |
+ 0.637 |
+ 0.623 |
+ 0.783 |
+ 0.743 |
+ 0.774 |
+ 0.681 |
+ 0.677 |
+ 0.664 |
+ 0.673 |
+ 0.666 |
+ 0.682 |
+ 0.670 |
+ 0.666 |
+ 0.677 |
+ 0.675 |
+ 0.679 |
+ 0.691 |
+ 0.594 |
+ 2172 |
+ Stability |
+ ESTA_BACSU |
+ High |
+ Prokaryote |
+
+
+ F7YBW8_MESOW_Aakre_2015 |
+ 0.537 |
+ 0.668 |
+ 0.668 |
+ 0.689 |
+ 0.683 |
+ 0.684 |
+ 0.517 |
+ 0.633 |
+ 0.668 |
+ 0.672 |
+ 0.689 |
+ 0.669 |
+ 0.674 |
+ 0.475 |
+ 0.513 |
+ 0.531 |
+ 0.671 |
+ 0.665 |
+ 0.687 |
+ 0.670 |
+ 0.479 |
+ 0.457 |
+ 0.458 |
+ 0.503 |
+ 0.555 |
+ 0.559 |
+ 0.501 |
+ 0.633 |
+ 0.677 |
+ 0.659 |
+ 0.698 |
+ 0.687 |
+ 0.518 |
+ 0.498 |
+ 0.460 |
+ 0.685 |
+ 0.537 |
+ 0.515 |
+ 0.684 |
+ 0.664 |
+ 0.658 |
+ 0.691 |
+ 0.488 |
+ 0.479 |
+ 0.599 |
+ 0.492 |
+ 0.498 |
+ 0.616 |
+ 0.528 |
+ 0.519 |
+ 0.687 |
+ 0.692 |
+ 0.680 |
+ 0.700 |
+ 0.696 |
+ 0.697 |
+ 0.686 |
+ 0.692 |
+ 0.695 |
+ 0.694 |
+ 0.626 |
+ 0.458 |
+ 9192 |
+ OrganismalFitness |
+ F7YBW8_MESOW |
+ High |
+ Prokaryote |
+
+
+ FECA_ECOLI_Tsuboyama_2023_2D1U |
+ 0.705 |
+ 0.727 |
+ 0.746 |
+ 0.751 |
+ 0.746 |
+ 0.742 |
+ 0.510 |
+ 0.677 |
+ 0.701 |
+ 0.745 |
+ 0.775 |
+ 0.637 |
+ 0.727 |
+ 0.510 |
+ 0.735 |
+ 0.784 |
+ 0.780 |
+ 0.764 |
+ 0.711 |
+ 0.746 |
+ 0.577 |
+ 0.648 |
+ 0.697 |
+ 0.663 |
+ 0.581 |
+ 0.754 |
+ 0.743 |
+ 0.641 |
+ 0.717 |
+ 0.779 |
+ 0.719 |
+ 0.684 |
+ 0.657 |
+ 0.526 |
+ 0.562 |
+ 0.577 |
+ 0.692 |
+ 0.687 |
+ 0.665 |
+ 0.729 |
+ 0.727 |
+ 0.716 |
+ 0.701 |
+ 0.500 |
+ 0.756 |
+ 0.729 |
+ 0.801 |
+ 0.783 |
+ 0.816 |
+ 0.781 |
+ 0.783 |
+ 0.761 |
+ 0.776 |
+ 0.775 |
+ 0.769 |
+ 0.787 |
+ 0.768 |
+ 0.763 |
+ 0.770 |
+ 0.777 |
+ 0.814 |
+ 0.811 |
+ 1886 |
+ Stability |
+ FECA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ FKBP3_HUMAN_Tsuboyama_2023_2KFV |
+ 0.727 |
+ 0.707 |
+ 0.759 |
+ 0.760 |
+ 0.763 |
+ 0.771 |
+ 0.589 |
+ 0.675 |
+ 0.630 |
+ 0.629 |
+ 0.602 |
+ 0.594 |
+ 0.591 |
+ 0.593 |
+ 0.592 |
+ 0.583 |
+ 0.600 |
+ 0.640 |
+ 0.694 |
+ 0.661 |
+ 0.622 |
+ 0.622 |
+ 0.627 |
+ 0.632 |
+ 0.585 |
+ 0.622 |
+ 0.599 |
+ 0.625 |
+ 0.628 |
+ 0.752 |
+ 0.673 |
+ 0.676 |
+ 0.464 |
+ 0.566 |
+ 0.602 |
+ 0.624 |
+ 0.718 |
+ 0.721 |
+ 0.688 |
+ 0.765 |
+ 0.767 |
+ 0.730 |
+ 0.599 |
+ 0.602 |
+ 0.595 |
+ 0.613 |
+ 0.856 |
+ 0.824 |
+ 0.855 |
+ 0.819 |
+ 0.660 |
+ 0.716 |
+ 0.738 |
+ 0.711 |
+ 0.710 |
+ 0.716 |
+ 0.693 |
+ 0.697 |
+ 0.650 |
+ 0.712 |
+ 0.798 |
+ 0.684 |
+ 1237 |
+ Stability |
+ FKBP3_HUMAN |
+ Medium |
+ Human |
+
+
+ GAL4_YEAST_Kitzman_2015 |
+ 0.646 |
+ 0.711 |
+ 0.712 |
+ 0.748 |
+ 0.733 |
+ 0.749 |
+ 0.677 |
+ 0.496 |
+ 0.745 |
+ 0.771 |
+ 0.782 |
+ 0.740 |
+ 0.743 |
+ 0.685 |
+ 0.712 |
+ 0.759 |
+ 0.795 |
+ 0.793 |
+ 0.788 |
+ 0.749 |
+ 0.672 |
+ 0.691 |
+ 0.695 |
+ 0.695 |
+ 0.691 |
+ 0.716 |
+ 0.741 |
+ 0.727 |
+ 0.771 |
+ 0.792 |
+ 0.792 |
+ 0.763 |
+ 0.636 |
+ 0.655 |
+ 0.669 |
+ 0.670 |
+ 0.757 |
+ 0.752 |
+ 0.754 |
+ 0.742 |
+ 0.739 |
+ 0.737 |
+ 0.708 |
+ 0.653 |
+ 0.777 |
+ 0.731 |
+ 0.649 |
+ 0.763 |
+ 0.670 |
+ 0.568 |
+ 0.773 |
+ 0.769 |
+ 0.767 |
+ 0.767 |
+ 0.779 |
+ 0.778 |
+ 0.770 |
+ 0.768 |
+ 0.767 |
+ 0.777 |
+ 0.767 |
+ 0.719 |
+ 1195 |
+ OrganismalFitness |
+ GAL4_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ GCN4_YEAST_Staller_2018 |
+ 0.632 |
+ 0.633 |
+ 0.629 |
+ 0.630 |
+ 0.628 |
+ 0.628 |
+ 0.584 |
+ 0.611 |
+ 0.631 |
+ 0.631 |
+ 0.627 |
+ 0.644 |
+ 0.647 |
+ 0.662 |
+ 0.644 |
+ 0.638 |
+ 0.649 |
+ 0.638 |
+ 0.631 |
+ 0.599 |
+ 0.574 |
+ 0.581 |
+ 0.572 |
+ 0.572 |
+ 0.563 |
+ 0.551 |
+ 0.532 |
+ 0.559 |
+ 0.577 |
+ 0.622 |
+ 0.637 |
+ 0.634 |
+ 0.519 |
+ 0.573 |
+ 0.571 |
+ 0.641 |
+ 0.635 |
+ 0.635 |
+ 0.647 |
+ 0.635 |
+ 0.635 |
+ 0.645 |
+ 0.549 |
+ 0.589 |
+ 0.572 |
+ 0.556 |
+ 0.568 |
+ 0.591 |
+ 0.612 |
+ 0.587 |
+ 0.624 |
+ 0.620 |
+ 0.620 |
+ 0.617 |
+ 0.623 |
+ 0.622 |
+ 0.621 |
+ 0.621 |
+ 0.623 |
+ 0.622 |
+ 0.624 |
+ 0.627 |
+ 2638 |
+ Binding |
+ GCN4_YEAST |
+ Low |
+ Eukaryote |
+
+
+ GDIA_HUMAN_Silverstein_2021 |
+ 0.719 |
+ 0.720 |
+ 0.726 |
+ 0.724 |
+ 0.727 |
+ 0.727 |
+ 0.591 |
+ 0.696 |
+ 0.739 |
+ 0.736 |
+ 0.694 |
+ 0.706 |
+ 0.726 |
+ 0.582 |
+ 0.618 |
+ 0.716 |
+ 0.698 |
+ 0.684 |
+ 0.704 |
+ 0.701 |
+ 0.626 |
+ 0.688 |
+ 0.686 |
+ 0.701 |
+ 0.659 |
+ 0.718 |
+ 0.708 |
+ 0.685 |
+ 0.696 |
+ 0.704 |
+ 0.712 |
+ 0.692 |
+ 0.627 |
+ 0.657 |
+ 0.690 |
+ 0.671 |
+ 0.715 |
+ 0.719 |
+ 0.713 |
+ 0.726 |
+ 0.733 |
+ 0.729 |
+ 0.616 |
+ 0.560 |
+ 0.717 |
+ 0.620 |
+ 0.671 |
+ 0.709 |
+ 0.682 |
+ 0.544 |
+ 0.695 |
+ 0.669 |
+ 0.697 |
+ 0.684 |
+ 0.691 |
+ 0.700 |
+ 0.688 |
+ 0.696 |
+ 0.690 |
+ 0.701 |
+ 0.729 |
+ 0.659 |
+ 1154 |
+ OrganismalFitness |
+ GDIA_HUMAN |
+ Low |
+ Human |
+
+
+ GFP_AEQVI_Sarkisyan_2016 |
+ 0.889 |
+ 0.884 |
+ 0.900 |
+ 0.901 |
+ 0.903 |
+ 0.903 |
+ 0.540 |
+ 0.883 |
+ 0.894 |
+ 0.891 |
+ 0.816 |
+ 0.565 |
+ 0.565 |
+ 0.552 |
+ 0.583 |
+ 0.564 |
+ 0.572 |
+ 0.596 |
+ 0.679 |
+ 0.858 |
+ 0.555 |
+ 0.572 |
+ 0.614 |
+ 0.565 |
+ 0.532 |
+ 0.615 |
+ 0.684 |
+ 0.884 |
+ 0.886 |
+ 0.904 |
+ 0.865 |
+ 0.870 |
+ 0.541 |
+ 0.545 |
+ 0.616 |
+ 0.878 |
+ 0.889 |
+ 0.890 |
+ 0.902 |
+ 0.906 |
+ 0.907 |
+ 0.919 |
+ 0.515 |
+ 0.495 |
+ 0.525 |
+ 0.525 |
+ 0.796 |
+ 0.792 |
+ 0.925 |
+ 0.860 |
+ 0.856 |
+ 0.860 |
+ 0.866 |
+ 0.871 |
+ 0.867 |
+ 0.866 |
+ 0.864 |
+ 0.865 |
+ 0.854 |
+ 0.865 |
+ 0.872 |
+ 0.769 |
+ 51714 |
+ Activity |
+ GFP_AEQVI |
+ Low |
+ Eukaryote |
+
+
+ GLPA_HUMAN_Elazar_2016 |
+ 0.602 |
+ 0.546 |
+ 0.603 |
+ 0.601 |
+ 0.594 |
+ 0.583 |
+ 0.711 |
+ 0.799 |
+ 0.682 |
+ 0.672 |
+ 0.741 |
+ 0.756 |
+ 0.754 |
+ 0.728 |
+ 0.746 |
+ 0.770 |
+ 0.751 |
+ 0.727 |
+ 0.757 |
+ 0.583 |
+ 0.743 |
+ 0.755 |
+ 0.737 |
+ 0.761 |
+ 0.739 |
+ 0.752 |
+ 0.761 |
+ 0.765 |
+ 0.798 |
+ 0.672 |
+ 0.746 |
+ 0.638 |
+ 0.638 |
+ 0.761 |
+ 0.737 |
+ 0.761 |
+ 0.750 |
+ 0.736 |
+ 0.748 |
+ 0.732 |
+ 0.717 |
+ 0.740 |
+ 0.758 |
+ 0.749 |
+ 0.762 |
+ 0.727 |
+ 0.722 |
+ 0.767 |
+ 0.706 |
+ 0.617 |
+ 0.790 |
+ 0.779 |
+ 0.782 |
+ 0.765 |
+ 0.776 |
+ 0.747 |
+ 0.776 |
+ 0.748 |
+ 0.770 |
+ 0.781 |
+ 0.775 |
+ 0.766 |
+ 245 |
+ Expression |
+ GLPA_HUMAN |
+ Low |
+ Human |
+
+
+ GRB2_HUMAN_Faure_2021 |
+ 0.727 |
+ 0.787 |
+ 0.784 |
+ 0.801 |
+ 0.799 |
+ 0.802 |
+ 0.777 |
+ 0.748 |
+ 0.796 |
+ 0.768 |
+ 0.792 |
+ 0.747 |
+ 0.781 |
+ 0.797 |
+ 0.832 |
+ 0.847 |
+ 0.858 |
+ 0.792 |
+ 0.816 |
+ 0.799 |
+ 0.807 |
+ 0.790 |
+ 0.786 |
+ 0.752 |
+ 0.801 |
+ 0.785 |
+ 0.753 |
+ 0.793 |
+ 0.751 |
+ 0.780 |
+ 0.741 |
+ 0.741 |
+ 0.757 |
+ 0.801 |
+ 0.786 |
+ 0.727 |
+ 0.790 |
+ 0.784 |
+ 0.745 |
+ 0.812 |
+ 0.812 |
+ 0.796 |
+ 0.826 |
+ 0.673 |
+ 0.789 |
+ 0.816 |
+ 0.879 |
+ 0.790 |
+ 0.890 |
+ 0.785 |
+ 0.845 |
+ 0.860 |
+ 0.855 |
+ 0.856 |
+ 0.860 |
+ 0.849 |
+ 0.849 |
+ 0.859 |
+ 0.848 |
+ 0.861 |
+ 0.806 |
+ 0.837 |
+ 63366 |
+ OrganismalFitness |
+ GRB2_HUMAN |
+ Medium |
+ Human |
+
+
+ HCP_LAMBD_Tsuboyama_2023_2L6Q |
+ 0.652 |
+ 0.715 |
+ 0.572 |
+ 0.586 |
+ 0.679 |
+ 0.682 |
+ 0.629 |
+ 0.707 |
+ 0.726 |
+ 0.749 |
+ 0.829 |
+ 0.724 |
+ 0.747 |
+ 0.644 |
+ 0.704 |
+ 0.783 |
+ 0.835 |
+ 0.790 |
+ 0.801 |
+ 0.620 |
+ 0.644 |
+ 0.652 |
+ 0.670 |
+ 0.677 |
+ 0.660 |
+ 0.669 |
+ 0.605 |
+ 0.666 |
+ 0.765 |
+ 0.802 |
+ 0.824 |
+ 0.758 |
+ 0.579 |
+ 0.633 |
+ 0.643 |
+ 0.753 |
+ 0.711 |
+ 0.716 |
+ 0.773 |
+ 0.687 |
+ 0.714 |
+ 0.765 |
+ 0.653 |
+ 0.643 |
+ 0.810 |
+ 0.686 |
+ 0.829 |
+ 0.861 |
+ 0.864 |
+ 0.786 |
+ 0.840 |
+ 0.846 |
+ 0.846 |
+ 0.843 |
+ 0.851 |
+ 0.853 |
+ 0.844 |
+ 0.854 |
+ 0.853 |
+ 0.857 |
+ 0.878 |
+ 0.799 |
+ 1040 |
+ Stability |
+ HCP_LAMBD |
+ Medium |
+ Virus |
+
+
+ HECD1_HUMAN_Tsuboyama_2023_3DKM |
+ 0.846 |
+ 0.873 |
+ 0.829 |
+ 0.844 |
+ 0.885 |
+ 0.887 |
+ 0.667 |
+ 0.823 |
+ 0.858 |
+ 0.859 |
+ 0.830 |
+ 0.696 |
+ 0.720 |
+ 0.642 |
+ 0.656 |
+ 0.917 |
+ 0.884 |
+ 0.860 |
+ 0.846 |
+ 0.869 |
+ 0.674 |
+ 0.815 |
+ 0.866 |
+ 0.859 |
+ 0.792 |
+ 0.863 |
+ 0.856 |
+ 0.841 |
+ 0.848 |
+ 0.887 |
+ 0.873 |
+ 0.871 |
+ 0.620 |
+ 0.675 |
+ 0.697 |
+ 0.743 |
+ 0.907 |
+ 0.886 |
+ 0.910 |
+ 0.895 |
+ 0.895 |
+ 0.901 |
+ 0.596 |
+ 0.581 |
+ 0.902 |
+ 0.668 |
+ 0.755 |
+ 0.869 |
+ 0.857 |
+ 0.772 |
+ 0.868 |
+ 0.880 |
+ 0.899 |
+ 0.889 |
+ 0.893 |
+ 0.914 |
+ 0.887 |
+ 0.897 |
+ 0.888 |
+ 0.896 |
+ 0.892 |
+ 0.749 |
+ 5586 |
+ Stability |
+ HECD1_HUMAN |
+ Medium |
+ Human |
+
+
+ HEM3_HUMAN_Loggerenberg_2023 |
+ 0.712 |
+ 0.707 |
+ 0.701 |
+ 0.702 |
+ 0.710 |
+ 0.709 |
+ 0.560 |
+ 0.566 |
+ 0.716 |
+ 0.722 |
+ 0.694 |
+ 0.688 |
+ 0.696 |
+ 0.569 |
+ 0.690 |
+ 0.702 |
+ 0.694 |
+ 0.713 |
+ 0.720 |
+ 0.562 |
+ 0.669 |
+ 0.688 |
+ 0.692 |
+ 0.700 |
+ 0.676 |
+ 0.694 |
+ 0.691 |
+ 0.693 |
+ 0.709 |
+ 0.720 |
+ 0.715 |
+ 0.711 |
+ 0.547 |
+ 0.689 |
+ 0.694 |
+ 0.703 |
+ 0.713 |
+ 0.719 |
+ 0.731 |
+ 0.715 |
+ 0.719 |
+ 0.726 |
+ 0.660 |
+ 0.573 |
+ 0.690 |
+ 0.685 |
+ 0.666 |
+ 0.684 |
+ 0.659 |
+ 0.580 |
+ 0.678 |
+ 0.669 |
+ 0.676 |
+ 0.681 |
+ 0.688 |
+ 0.686 |
+ 0.686 |
+ 0.686 |
+ 0.683 |
+ 0.688 |
+ 0.716 |
+ 0.709 |
+ 5689 |
+ Activity |
+ HEM3_HUMAN |
+ Medium |
+ Human |
+
+
+ HIS7_YEAST_Pokusaeva_2019 |
+ 0.768 |
+ 0.809 |
+ 0.825 |
+ 0.822 |
+ 0.801 |
+ 0.800 |
+ 0.589 |
+ 0.645 |
+ 0.762 |
+ 0.785 |
+ 0.761 |
+ 0.734 |
+ 0.773 |
+ 0.534 |
+ 0.568 |
+ 0.749 |
+ 0.728 |
+ 0.763 |
+ 0.769 |
+ 0.755 |
+ 0.686 |
+ 0.734 |
+ 0.750 |
+ 0.777 |
+ 0.724 |
+ 0.779 |
+ 0.771 |
+ 0.802 |
+ 0.810 |
+ 0.804 |
+ 0.724 |
+ 0.714 |
+ 0.583 |
+ 0.734 |
+ 0.758 |
+ 0.849 |
+ 0.783 |
+ 0.783 |
+ 0.865 |
+ 0.829 |
+ 0.812 |
+ 0.835 |
+ 0.587 |
+ 0.524 |
+ 0.666 |
+ 0.507 |
+ 0.729 |
+ 0.750 |
+ 0.820 |
+ 0.722 |
+ 0.746 |
+ 0.685 |
+ 0.718 |
+ 0.737 |
+ 0.744 |
+ 0.723 |
+ 0.726 |
+ 0.748 |
+ 0.738 |
+ 0.732 |
+ 0.795 |
+ 0.632 |
+ 496137 |
+ OrganismalFitness |
+ HIS7_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HMDH_HUMAN_Jiang_2019 |
+ 0.741 |
+ 0.700 |
+ 0.683 |
+ 0.688 |
+ 0.720 |
+ 0.718 |
+ 0.575 |
+ 0.627 |
+ 0.625 |
+ 0.621 |
+ 0.626 |
+ 0.664 |
+ 0.649 |
+ 0.514 |
+ 0.498 |
+ 0.654 |
+ 0.755 |
+ 0.682 |
+ 0.683 |
+ 0.640 |
+ 0.727 |
+ 0.566 |
+ 0.572 |
+ 0.595 |
+ 0.719 |
+ 0.588 |
+ 0.572 |
+ 0.594 |
+ 0.592 |
+ 0.735 |
+ 0.699 |
+ 0.694 |
+ 0.610 |
+ 0.655 |
+ 0.617 |
+ 0.600 |
+ 0.719 |
+ 0.664 |
+ 0.657 |
+ 0.714 |
+ 0.672 |
+ 0.670 |
+ 0.581 |
+ 0.526 |
+ 0.735 |
+ 0.750 |
+ 0.660 |
+ 0.721 |
+ 0.397 |
+ 0.555 |
+ 0.745 |
+ 0.745 |
+ 0.741 |
+ 0.747 |
+ 0.743 |
+ 0.744 |
+ 0.736 |
+ 0.749 |
+ 0.740 |
+ 0.748 |
+ 0.765 |
+ 0.716 |
+ 16853 |
+ OrganismalFitness |
+ HMDH_HUMAN |
+ Low |
+ Human |
+
+
+ HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2 |
+ 0.656 |
+ 0.674 |
+ 0.680 |
+ 0.680 |
+ 0.677 |
+ 0.676 |
+ 0.606 |
+ 0.684 |
+ 0.713 |
+ 0.717 |
+ 0.677 |
+ 0.706 |
+ 0.725 |
+ 0.482 |
+ 0.550 |
+ 0.619 |
+ 0.622 |
+ 0.644 |
+ 0.643 |
+ 0.508 |
+ 0.681 |
+ 0.683 |
+ 0.694 |
+ 0.687 |
+ 0.663 |
+ 0.673 |
+ 0.699 |
+ 0.700 |
+ 0.703 |
+ 0.692 |
+ 0.711 |
+ 0.685 |
+ 0.611 |
+ 0.664 |
+ 0.660 |
+ 0.666 |
+ 0.677 |
+ 0.674 |
+ 0.677 |
+ 0.685 |
+ 0.683 |
+ 0.686 |
+ 0.508 |
+ 0.485 |
+ 0.691 |
+ 0.693 |
+ 0.523 |
+ 0.631 |
+ 0.542 |
+ 0.493 |
+ 0.614 |
+ 0.625 |
+ 0.614 |
+ 0.620 |
+ 0.619 |
+ 0.615 |
+ 0.627 |
+ 0.617 |
+ 0.611 |
+ 0.621 |
+ 0.683 |
+ 0.651 |
+ 2252 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HSP82_YEAST_Flynn_2019 |
+ 0.758 |
+ 0.772 |
+ 0.796 |
+ 0.803 |
+ 0.795 |
+ 0.795 |
+ 0.620 |
+ 0.749 |
+ 0.804 |
+ 0.796 |
+ 0.772 |
+ 0.790 |
+ 0.807 |
+ 0.515 |
+ 0.570 |
+ 0.634 |
+ 0.709 |
+ 0.765 |
+ 0.767 |
+ 0.786 |
+ 0.780 |
+ 0.790 |
+ 0.781 |
+ 0.794 |
+ 0.788 |
+ 0.798 |
+ 0.806 |
+ 0.781 |
+ 0.811 |
+ 0.806 |
+ 0.826 |
+ 0.814 |
+ 0.647 |
+ 0.778 |
+ 0.781 |
+ 0.781 |
+ 0.795 |
+ 0.799 |
+ 0.798 |
+ 0.807 |
+ 0.811 |
+ 0.810 |
+ 0.596 |
+ 0.451 |
+ 0.773 |
+ 0.725 |
+ 0.627 |
+ 0.726 |
+ 0.574 |
+ 0.538 |
+ 0.712 |
+ 0.705 |
+ 0.714 |
+ 0.722 |
+ 0.712 |
+ 0.716 |
+ 0.719 |
+ 0.716 |
+ 0.705 |
+ 0.719 |
+ 0.780 |
+ 0.740 |
+ 13294 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HSP82_YEAST_Mishra_2016 |
+ 0.882 |
+ 0.815 |
+ 0.891 |
+ 0.900 |
+ 0.897 |
+ 0.899 |
+ 0.794 |
+ 0.812 |
+ 0.837 |
+ 0.839 |
+ 0.859 |
+ 0.886 |
+ 0.893 |
+ 0.688 |
+ 0.780 |
+ 0.819 |
+ 0.873 |
+ 0.878 |
+ 0.867 |
+ 0.842 |
+ 0.850 |
+ 0.843 |
+ 0.839 |
+ 0.866 |
+ 0.866 |
+ 0.859 |
+ 0.872 |
+ 0.831 |
+ 0.858 |
+ 0.893 |
+ 0.903 |
+ 0.895 |
+ 0.717 |
+ 0.845 |
+ 0.844 |
+ 0.851 |
+ 0.881 |
+ 0.880 |
+ 0.882 |
+ 0.898 |
+ 0.898 |
+ 0.899 |
+ 0.849 |
+ 0.455 |
+ 0.843 |
+ 0.880 |
+ 0.623 |
+ 0.753 |
+ 0.645 |
+ 0.533 |
+ 0.855 |
+ 0.838 |
+ 0.829 |
+ 0.860 |
+ 0.860 |
+ 0.853 |
+ 0.859 |
+ 0.865 |
+ 0.855 |
+ 0.862 |
+ 0.842 |
+ 0.841 |
+ 4323 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HXK4_HUMAN_Gersing_2022_activity |
+ 0.762 |
+ 0.780 |
+ 0.759 |
+ 0.766 |
+ 0.761 |
+ 0.766 |
+ 0.618 |
+ 0.709 |
+ 0.779 |
+ 0.788 |
+ 0.748 |
+ 0.758 |
+ 0.771 |
+ 0.590 |
+ 0.649 |
+ 0.767 |
+ 0.776 |
+ 0.775 |
+ 0.760 |
+ 0.522 |
+ 0.746 |
+ 0.750 |
+ 0.743 |
+ 0.732 |
+ 0.749 |
+ 0.756 |
+ 0.746 |
+ 0.747 |
+ 0.722 |
+ 0.776 |
+ 0.770 |
+ 0.744 |
+ 0.601 |
+ 0.754 |
+ 0.734 |
+ 0.702 |
+ 0.770 |
+ 0.762 |
+ 0.743 |
+ 0.777 |
+ 0.775 |
+ 0.767 |
+ 0.664 |
+ 0.510 |
+ 0.775 |
+ 0.756 |
+ 0.684 |
+ 0.738 |
+ 0.704 |
+ 0.588 |
+ 0.771 |
+ 0.768 |
+ 0.760 |
+ 0.766 |
+ 0.766 |
+ 0.772 |
+ 0.768 |
+ 0.772 |
+ 0.771 |
+ 0.774 |
+ 0.780 |
+ 0.751 |
+ 8570 |
+ OrganismalFitness |
+ HXK4_HUMAN |
+ Medium |
+ Human |
+
+
+ HXK4_HUMAN_Gersing_2023_abundance |
+ 0.694 |
+ 0.719 |
+ 0.697 |
+ 0.713 |
+ 0.708 |
+ 0.711 |
+ 0.534 |
+ 0.700 |
+ 0.734 |
+ 0.746 |
+ 0.703 |
+ 0.697 |
+ 0.712 |
+ 0.580 |
+ 0.610 |
+ 0.701 |
+ 0.724 |
+ 0.731 |
+ 0.713 |
+ 0.740 |
+ 0.669 |
+ 0.671 |
+ 0.686 |
+ 0.686 |
+ 0.668 |
+ 0.681 |
+ 0.688 |
+ 0.686 |
+ 0.690 |
+ 0.723 |
+ 0.681 |
+ 0.659 |
+ 0.595 |
+ 0.681 |
+ 0.675 |
+ 0.686 |
+ 0.705 |
+ 0.705 |
+ 0.713 |
+ 0.711 |
+ 0.716 |
+ 0.723 |
+ 0.601 |
+ 0.547 |
+ 0.712 |
+ 0.703 |
+ 0.734 |
+ 0.728 |
+ 0.732 |
+ 0.594 |
+ 0.719 |
+ 0.727 |
+ 0.725 |
+ 0.725 |
+ 0.730 |
+ 0.727 |
+ 0.728 |
+ 0.724 |
+ 0.724 |
+ 0.729 |
+ 0.757 |
+ 0.701 |
+ 8396 |
+ Expression |
+ HXK4_HUMAN |
+ Medium |
+ Human |
+
+
+ I6TAH8_I68A0_Doud_2015 |
+ 0.677 |
+ 0.664 |
+ 0.639 |
+ 0.638 |
+ 0.685 |
+ 0.683 |
+ 0.495 |
+ 0.659 |
+ 0.645 |
+ 0.663 |
+ 0.503 |
+ 0.507 |
+ 0.506 |
+ 0.509 |
+ 0.510 |
+ 0.501 |
+ 0.507 |
+ 0.503 |
+ 0.543 |
+ 0.608 |
+ 0.663 |
+ 0.669 |
+ 0.691 |
+ 0.691 |
+ 0.504 |
+ 0.508 |
+ 0.554 |
+ 0.497 |
+ 0.655 |
+ 0.695 |
+ 0.627 |
+ 0.630 |
+ 0.572 |
+ 0.652 |
+ 0.666 |
+ 0.673 |
+ 0.665 |
+ 0.676 |
+ 0.677 |
+ 0.693 |
+ 0.701 |
+ 0.704 |
+ 0.502 |
+ 0.511 |
+ 0.506 |
+ 0.509 |
+ 0.608 |
+ 0.604 |
+ 0.623 |
+ 0.549 |
+ 0.533 |
+ 0.552 |
+ 0.564 |
+ 0.563 |
+ 0.554 |
+ 0.551 |
+ 0.542 |
+ 0.545 |
+ 0.527 |
+ 0.551 |
+ 0.521 |
+ 0.504 |
+ 9462 |
+ OrganismalFitness |
+ I6TAH8_I68A0 |
+ Medium |
+ Virus |
+
+
+ IF1_ECOLI_Kelsic_2016 |
+ 0.695 |
+ 0.775 |
+ 0.823 |
+ 0.823 |
+ 0.816 |
+ 0.820 |
+ 0.578 |
+ 0.712 |
+ 0.649 |
+ 0.641 |
+ 0.803 |
+ 0.815 |
+ 0.826 |
+ 0.600 |
+ 0.756 |
+ 0.817 |
+ 0.832 |
+ 0.818 |
+ 0.800 |
+ 0.793 |
+ 0.673 |
+ 0.729 |
+ 0.695 |
+ 0.714 |
+ 0.746 |
+ 0.748 |
+ 0.751 |
+ 0.735 |
+ 0.745 |
+ 0.722 |
+ 0.784 |
+ 0.754 |
+ 0.643 |
+ 0.756 |
+ 0.774 |
+ 0.787 |
+ 0.773 |
+ 0.780 |
+ 0.785 |
+ 0.812 |
+ 0.816 |
+ 0.820 |
+ 0.693 |
+ 0.563 |
+ 0.821 |
+ 0.782 |
+ 0.692 |
+ 0.797 |
+ 0.781 |
+ 0.645 |
+ 0.798 |
+ 0.790 |
+ 0.808 |
+ 0.814 |
+ 0.804 |
+ 0.809 |
+ 0.789 |
+ 0.813 |
+ 0.813 |
+ 0.819 |
+ 0.852 |
+ 0.783 |
+ 1367 |
+ OrganismalFitness |
+ IF1_ECOLI |
+ High |
+ Prokaryote |
+
+
+ ILF3_HUMAN_Tsuboyama_2023_2L33 |
+ 0.718 |
+ 0.758 |
+ 0.827 |
+ 0.821 |
+ 0.811 |
+ 0.814 |
+ 0.604 |
+ 0.741 |
+ 0.797 |
+ 0.813 |
+ 0.747 |
+ 0.757 |
+ 0.772 |
+ 0.718 |
+ 0.775 |
+ 0.838 |
+ 0.722 |
+ 0.710 |
+ 0.649 |
+ 0.837 |
+ 0.621 |
+ 0.646 |
+ 0.685 |
+ 0.726 |
+ 0.735 |
+ 0.672 |
+ 0.751 |
+ 0.696 |
+ 0.697 |
+ 0.820 |
+ 0.773 |
+ 0.758 |
+ 0.529 |
+ 0.614 |
+ 0.670 |
+ 0.738 |
+ 0.747 |
+ 0.757 |
+ 0.787 |
+ 0.792 |
+ 0.783 |
+ 0.821 |
+ 0.755 |
+ 0.620 |
+ 0.745 |
+ 0.813 |
+ 0.834 |
+ 0.790 |
+ 0.851 |
+ 0.752 |
+ 0.693 |
+ 0.670 |
+ 0.727 |
+ 0.705 |
+ 0.726 |
+ 0.758 |
+ 0.733 |
+ 0.708 |
+ 0.779 |
+ 0.741 |
+ 0.737 |
+ 0.783 |
+ 1329 |
+ Stability |
+ ILF3_HUMAN |
+ High |
+ Human |
+
+
+ ISDH_STAAW_Tsuboyama_2023_2LHR |
+ 0.540 |
+ 0.548 |
+ 0.599 |
+ 0.594 |
+ 0.612 |
+ 0.612 |
+ 0.662 |
+ 0.571 |
+ 0.680 |
+ 0.679 |
+ 0.738 |
+ 0.710 |
+ 0.719 |
+ 0.687 |
+ 0.695 |
+ 0.724 |
+ 0.718 |
+ 0.740 |
+ 0.724 |
+ 0.589 |
+ 0.653 |
+ 0.601 |
+ 0.614 |
+ 0.629 |
+ 0.649 |
+ 0.660 |
+ 0.655 |
+ 0.642 |
+ 0.685 |
+ 0.669 |
+ 0.718 |
+ 0.691 |
+ 0.622 |
+ 0.636 |
+ 0.620 |
+ 0.636 |
+ 0.625 |
+ 0.605 |
+ 0.616 |
+ 0.629 |
+ 0.606 |
+ 0.616 |
+ 0.639 |
+ 0.611 |
+ 0.665 |
+ 0.652 |
+ 0.788 |
+ 0.743 |
+ 0.773 |
+ 0.752 |
+ 0.725 |
+ 0.717 |
+ 0.728 |
+ 0.724 |
+ 0.726 |
+ 0.728 |
+ 0.715 |
+ 0.720 |
+ 0.724 |
+ 0.726 |
+ 0.791 |
+ 0.749 |
+ 1944 |
+ Stability |
+ ISDH_STAAW |
+ High |
+ Prokaryote |
+
+
+ KCNE1_HUMAN_Muhammad_2023_expression |
+ 0.641 |
+ 0.648 |
+ 0.622 |
+ 0.628 |
+ 0.636 |
+ 0.620 |
+ 0.670 |
+ 0.569 |
+ 0.634 |
+ 0.636 |
+ 0.691 |
+ 0.679 |
+ 0.666 |
+ 0.663 |
+ 0.651 |
+ 0.682 |
+ 0.652 |
+ 0.635 |
+ 0.657 |
+ 0.649 |
+ 0.680 |
+ 0.694 |
+ 0.625 |
+ 0.606 |
+ 0.690 |
+ 0.628 |
+ 0.672 |
+ 0.633 |
+ 0.643 |
+ 0.647 |
+ 0.636 |
+ 0.603 |
+ 0.393 |
+ 0.656 |
+ 0.708 |
+ 0.659 |
+ 0.662 |
+ 0.673 |
+ 0.670 |
+ 0.657 |
+ 0.662 |
+ 0.666 |
+ 0.673 |
+ 0.652 |
+ 0.667 |
+ 0.693 |
+ 0.649 |
+ 0.650 |
+ 0.644 |
+ 0.553 |
+ 0.637 |
+ 0.625 |
+ 0.638 |
+ 0.614 |
+ 0.617 |
+ 0.612 |
+ 0.642 |
+ 0.636 |
+ 0.639 |
+ 0.633 |
+ 0.660 |
+ 0.678 |
+ 2339 |
+ Expression |
+ KCNE1_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNE1_HUMAN_Muhammad_2023_function |
+ 0.693 |
+ 0.743 |
+ 0.739 |
+ 0.750 |
+ 0.735 |
+ 0.756 |
+ 0.599 |
+ 0.750 |
+ 0.752 |
+ 0.752 |
+ 0.667 |
+ 0.728 |
+ 0.786 |
+ 0.588 |
+ 0.567 |
+ 0.797 |
+ 0.759 |
+ 0.721 |
+ 0.684 |
+ 0.607 |
+ 0.620 |
+ 0.638 |
+ 0.719 |
+ 0.770 |
+ 0.629 |
+ 0.811 |
+ 0.793 |
+ 0.796 |
+ 0.791 |
+ 0.740 |
+ 0.793 |
+ 0.749 |
+ 0.495 |
+ 0.575 |
+ 0.680 |
+ 0.816 |
+ 0.722 |
+ 0.741 |
+ 0.812 |
+ 0.756 |
+ 0.770 |
+ 0.821 |
+ 0.590 |
+ 0.586 |
+ 0.766 |
+ 0.577 |
+ 0.582 |
+ 0.712 |
+ 0.583 |
+ 0.538 |
+ 0.766 |
+ 0.766 |
+ 0.757 |
+ 0.760 |
+ 0.764 |
+ 0.755 |
+ 0.768 |
+ 0.769 |
+ 0.770 |
+ 0.773 |
+ 0.819 |
+ 0.603 |
+ 2315 |
+ Activity |
+ KCNE1_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNH2_HUMAN_Kozek_2020 |
+ 0.738 |
+ 0.730 |
+ 0.671 |
+ 0.670 |
+ 0.636 |
+ 0.635 |
+ 0.755 |
+ 0.588 |
+ 0.667 |
+ 0.678 |
+ 0.644 |
+ 0.607 |
+ 0.609 |
+ 0.619 |
+ 0.593 |
+ 0.608 |
+ 0.623 |
+ 0.625 |
+ 0.622 |
+ 0.739 |
+ 0.741 |
+ 0.766 |
+ 0.751 |
+ 0.729 |
+ 0.762 |
+ 0.748 |
+ 0.736 |
+ 0.739 |
+ 0.735 |
+ 0.758 |
+ 0.693 |
+ 0.630 |
+ 0.678 |
+ 0.738 |
+ 0.767 |
+ 0.749 |
+ 0.750 |
+ 0.782 |
+ 0.771 |
+ 0.739 |
+ 0.767 |
+ 0.759 |
+ 0.728 |
+ 0.599 |
+ 0.590 |
+ 0.652 |
+ 0.605 |
+ 0.477 |
+ 0.426 |
+ 0.547 |
+ 0.798 |
+ 0.719 |
+ 0.766 |
+ 0.748 |
+ 0.738 |
+ 0.739 |
+ 0.760 |
+ 0.746 |
+ 0.749 |
+ 0.762 |
+ 0.648 |
+ 0.738 |
+ 200 |
+ Activity |
+ KCNH2_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNJ2_MOUSE_Coyote-Maestas_2022_function |
+ 0.639 |
+ 0.675 |
+ 0.675 |
+ 0.682 |
+ 0.686 |
+ 0.689 |
+ 0.531 |
+ 0.625 |
+ 0.668 |
+ 0.665 |
+ 0.696 |
+ 0.685 |
+ 0.691 |
+ 0.531 |
+ 0.652 |
+ 0.700 |
+ 0.702 |
+ 0.704 |
+ 0.706 |
+ 0.586 |
+ 0.680 |
+ 0.657 |
+ 0.631 |
+ 0.627 |
+ 0.683 |
+ 0.674 |
+ 0.662 |
+ 0.678 |
+ 0.618 |
+ 0.698 |
+ 0.694 |
+ 0.686 |
+ 0.578 |
+ 0.669 |
+ 0.652 |
+ 0.621 |
+ 0.683 |
+ 0.672 |
+ 0.652 |
+ 0.691 |
+ 0.686 |
+ 0.677 |
+ 0.610 |
+ 0.493 |
+ 0.675 |
+ 0.682 |
+ 0.601 |
+ 0.650 |
+ 0.623 |
+ 0.540 |
+ 0.694 |
+ 0.686 |
+ 0.692 |
+ 0.691 |
+ 0.693 |
+ 0.689 |
+ 0.694 |
+ 0.692 |
+ 0.693 |
+ 0.696 |
+ 0.696 |
+ 0.640 |
+ 6963 |
+ Activity |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ KCNJ2_MOUSE_Coyote-Maestas_2022_surface |
+ 0.677 |
+ 0.655 |
+ 0.614 |
+ 0.620 |
+ 0.654 |
+ 0.655 |
+ 0.557 |
+ 0.602 |
+ 0.624 |
+ 0.624 |
+ 0.639 |
+ 0.623 |
+ 0.630 |
+ 0.590 |
+ 0.669 |
+ 0.663 |
+ 0.652 |
+ 0.651 |
+ 0.658 |
+ 0.609 |
+ 0.604 |
+ 0.606 |
+ 0.607 |
+ 0.622 |
+ 0.614 |
+ 0.621 |
+ 0.619 |
+ 0.622 |
+ 0.621 |
+ 0.666 |
+ 0.632 |
+ 0.617 |
+ 0.575 |
+ 0.602 |
+ 0.613 |
+ 0.614 |
+ 0.642 |
+ 0.650 |
+ 0.656 |
+ 0.646 |
+ 0.654 |
+ 0.658 |
+ 0.625 |
+ 0.534 |
+ 0.615 |
+ 0.637 |
+ 0.623 |
+ 0.612 |
+ 0.622 |
+ 0.567 |
+ 0.637 |
+ 0.630 |
+ 0.641 |
+ 0.637 |
+ 0.644 |
+ 0.635 |
+ 0.646 |
+ 0.637 |
+ 0.634 |
+ 0.641 |
+ 0.649 |
+ 0.655 |
+ 6917 |
+ Expression |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ KKA2_KLEPN_Melnikov_2014 |
+ 0.658 |
+ 0.806 |
+ 0.756 |
+ 0.859 |
+ 0.850 |
+ 0.856 |
+ 0.623 |
+ 0.756 |
+ 0.813 |
+ 0.854 |
+ 0.827 |
+ 0.842 |
+ 0.858 |
+ 0.632 |
+ 0.655 |
+ 0.782 |
+ 0.844 |
+ 0.870 |
+ 0.876 |
+ 0.794 |
+ 0.674 |
+ 0.748 |
+ 0.805 |
+ 0.814 |
+ 0.706 |
+ 0.837 |
+ 0.835 |
+ 0.832 |
+ 0.869 |
+ 0.868 |
+ 0.870 |
+ 0.845 |
+ 0.590 |
+ 0.650 |
+ 0.787 |
+ 0.839 |
+ 0.764 |
+ 0.809 |
+ 0.843 |
+ 0.842 |
+ 0.855 |
+ 0.868 |
+ 0.653 |
+ 0.551 |
+ 0.826 |
+ 0.733 |
+ 0.749 |
+ 0.829 |
+ 0.806 |
+ 0.640 |
+ 0.838 |
+ 0.828 |
+ 0.834 |
+ 0.830 |
+ 0.830 |
+ 0.835 |
+ 0.825 |
+ 0.837 |
+ 0.842 |
+ 0.843 |
+ 0.868 |
+ 0.685 |
+ 4960 |
+ OrganismalFitness |
+ KKA2_KLEPN |
+ High |
+ Prokaryote |
+
+
+ LGK_LIPST_Klesmith_2015 |
+ 0.640 |
+ 0.706 |
+ 0.730 |
+ 0.734 |
+ 0.730 |
+ 0.737 |
+ 0.560 |
+ 0.717 |
+ 0.728 |
+ 0.772 |
+ 0.715 |
+ 0.756 |
+ 0.771 |
+ 0.588 |
+ 0.654 |
+ 0.685 |
+ 0.750 |
+ 0.782 |
+ 0.779 |
+ 0.742 |
+ 0.639 |
+ 0.693 |
+ 0.722 |
+ 0.736 |
+ 0.677 |
+ 0.734 |
+ 0.751 |
+ 0.729 |
+ 0.765 |
+ 0.755 |
+ 0.756 |
+ 0.718 |
+ 0.565 |
+ 0.649 |
+ 0.690 |
+ 0.767 |
+ 0.680 |
+ 0.696 |
+ 0.755 |
+ 0.730 |
+ 0.736 |
+ 0.759 |
+ 0.584 |
+ 0.530 |
+ 0.732 |
+ 0.676 |
+ 0.689 |
+ 0.743 |
+ 0.735 |
+ 0.570 |
+ 0.747 |
+ 0.742 |
+ 0.747 |
+ 0.746 |
+ 0.747 |
+ 0.752 |
+ 0.751 |
+ 0.749 |
+ 0.751 |
+ 0.755 |
+ 0.699 |
+ 0.662 |
+ 7890 |
+ Activity |
+ LGK_LIPST |
+ Medium |
+ Eukaryote |
+
+
+ LYAM1_HUMAN_Elazar_2016 |
+ 0.669 |
+ 0.667 |
+ 0.610 |
+ 0.594 |
+ 0.649 |
+ 0.653 |
+ 0.675 |
+ 0.580 |
+ 0.698 |
+ 0.701 |
+ 0.698 |
+ 0.673 |
+ 0.670 |
+ 0.643 |
+ 0.672 |
+ 0.660 |
+ 0.637 |
+ 0.691 |
+ 0.732 |
+ 0.653 |
+ 0.681 |
+ 0.687 |
+ 0.682 |
+ 0.675 |
+ 0.674 |
+ 0.679 |
+ 0.669 |
+ 0.642 |
+ 0.660 |
+ 0.688 |
+ 0.641 |
+ 0.583 |
+ 0.598 |
+ 0.672 |
+ 0.705 |
+ 0.664 |
+ 0.684 |
+ 0.705 |
+ 0.679 |
+ 0.677 |
+ 0.691 |
+ 0.667 |
+ 0.642 |
+ 0.658 |
+ 0.644 |
+ 0.665 |
+ 0.586 |
+ 0.631 |
+ 0.590 |
+ 0.556 |
+ 0.608 |
+ 0.653 |
+ 0.696 |
+ 0.664 |
+ 0.635 |
+ 0.646 |
+ 0.675 |
+ 0.672 |
+ 0.630 |
+ 0.662 |
+ 0.709 |
+ 0.690 |
+ 359 |
+ Expression |
+ LYAM1_HUMAN |
+ Medium |
+ Human |
+
+
+ MAFG_MOUSE_Tsuboyama_2023_1K1V |
+ 0.854 |
+ 0.874 |
+ 0.866 |
+ 0.866 |
+ 0.861 |
+ 0.861 |
+ 0.740 |
+ 0.904 |
+ 0.875 |
+ 0.866 |
+ 0.900 |
+ 0.749 |
+ 0.886 |
+ 0.790 |
+ 0.795 |
+ 0.823 |
+ 0.829 |
+ 0.859 |
+ 0.762 |
+ 0.868 |
+ 0.752 |
+ 0.837 |
+ 0.848 |
+ 0.872 |
+ 0.817 |
+ 0.858 |
+ 0.830 |
+ 0.859 |
+ 0.839 |
+ 0.835 |
+ 0.863 |
+ 0.855 |
+ 0.776 |
+ 0.686 |
+ 0.851 |
+ 0.855 |
+ 0.844 |
+ 0.885 |
+ 0.905 |
+ 0.869 |
+ 0.894 |
+ 0.911 |
+ 0.718 |
+ 0.525 |
+ 0.781 |
+ 0.676 |
+ 0.859 |
+ 0.728 |
+ 0.890 |
+ 0.905 |
+ 0.917 |
+ 0.903 |
+ 0.907 |
+ 0.916 |
+ 0.910 |
+ 0.915 |
+ 0.912 |
+ 0.916 |
+ 0.911 |
+ 0.915 |
+ 0.922 |
+ 0.934 |
+ 1429 |
+ Stability |
+ MAFG_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ MBD11_ARATH_Tsuboyama_2023_6ACV |
+ 0.808 |
+ 0.904 |
+ 0.885 |
+ 0.895 |
+ 0.917 |
+ 0.921 |
+ 0.466 |
+ 0.816 |
+ 0.870 |
+ 0.908 |
+ 0.907 |
+ 0.757 |
+ 0.805 |
+ 0.428 |
+ 0.539 |
+ 0.915 |
+ 0.891 |
+ 0.882 |
+ 0.910 |
+ 0.902 |
+ 0.494 |
+ 0.556 |
+ 0.847 |
+ 0.864 |
+ 0.626 |
+ 0.836 |
+ 0.824 |
+ 0.842 |
+ 0.911 |
+ 0.923 |
+ 0.906 |
+ 0.878 |
+ 0.840 |
+ 0.333 |
+ 0.554 |
+ 0.544 |
+ 0.823 |
+ 0.872 |
+ 0.860 |
+ 0.899 |
+ 0.919 |
+ 0.913 |
+ 0.483 |
+ 0.395 |
+ 0.858 |
+ 0.749 |
+ 0.750 |
+ 0.848 |
+ 0.903 |
+ 0.792 |
+ 0.925 |
+ 0.910 |
+ 0.924 |
+ 0.910 |
+ 0.911 |
+ 0.908 |
+ 0.912 |
+ 0.919 |
+ 0.923 |
+ 0.922 |
+ 0.927 |
+ 0.884 |
+ 2116 |
+ Stability |
+ MBD11_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ MET_HUMAN_Estevam_2023 |
+ 0.789 |
+ 0.845 |
+ 0.839 |
+ 0.856 |
+ 0.868 |
+ 0.869 |
+ 0.827 |
+ 0.825 |
+ 0.846 |
+ 0.855 |
+ 0.869 |
+ 0.847 |
+ 0.853 |
+ 0.789 |
+ 0.826 |
+ 0.843 |
+ 0.871 |
+ 0.876 |
+ 0.875 |
+ 0.870 |
+ 0.832 |
+ 0.819 |
+ 0.797 |
+ 0.811 |
+ 0.834 |
+ 0.829 |
+ 0.825 |
+ 0.808 |
+ 0.786 |
+ 0.849 |
+ 0.860 |
+ 0.833 |
+ 0.677 |
+ 0.798 |
+ 0.814 |
+ 0.828 |
+ 0.826 |
+ 0.840 |
+ 0.855 |
+ 0.855 |
+ 0.859 |
+ 0.863 |
+ 0.842 |
+ 0.723 |
+ 0.859 |
+ 0.863 |
+ 0.728 |
+ 0.768 |
+ 0.815 |
+ 0.624 |
+ 0.838 |
+ 0.845 |
+ 0.842 |
+ 0.844 |
+ 0.852 |
+ 0.847 |
+ 0.850 |
+ 0.852 |
+ 0.848 |
+ 0.854 |
+ 0.864 |
+ 0.837 |
+ 5393 |
+ Activity |
+ MET_HUMAN |
+ Medium |
+ Human |
+
+
+ MK01_HUMAN_Brenan_2016 |
+ 0.614 |
+ 0.622 |
+ 0.638 |
+ 0.641 |
+ 0.634 |
+ 0.636 |
+ 0.627 |
+ 0.605 |
+ 0.625 |
+ 0.618 |
+ 0.542 |
+ 0.607 |
+ 0.617 |
+ 0.605 |
+ 0.621 |
+ 0.622 |
+ 0.615 |
+ 0.621 |
+ 0.597 |
+ 0.614 |
+ 0.631 |
+ 0.582 |
+ 0.562 |
+ 0.530 |
+ 0.613 |
+ 0.569 |
+ 0.552 |
+ 0.561 |
+ 0.486 |
+ 0.630 |
+ 0.613 |
+ 0.633 |
+ 0.568 |
+ 0.610 |
+ 0.554 |
+ 0.524 |
+ 0.623 |
+ 0.586 |
+ 0.572 |
+ 0.637 |
+ 0.627 |
+ 0.626 |
+ 0.621 |
+ 0.579 |
+ 0.594 |
+ 0.614 |
+ 0.541 |
+ 0.514 |
+ 0.582 |
+ 0.502 |
+ 0.602 |
+ 0.601 |
+ 0.594 |
+ 0.604 |
+ 0.608 |
+ 0.603 |
+ 0.603 |
+ 0.606 |
+ 0.610 |
+ 0.608 |
+ 0.602 |
+ 0.616 |
+ 6809 |
+ OrganismalFitness |
+ MK01_HUMAN |
+ Medium |
+ Human |
+
+
+ MLAC_ECOLI_MacRae_2023 |
+ 0.607 |
+ 0.686 |
+ 0.723 |
+ 0.728 |
+ 0.720 |
+ 0.721 |
+ 0.489 |
+ 0.684 |
+ 0.716 |
+ 0.720 |
+ 0.708 |
+ 0.720 |
+ 0.728 |
+ 0.489 |
+ 0.614 |
+ 0.631 |
+ 0.694 |
+ 0.691 |
+ 0.706 |
+ 0.708 |
+ 0.538 |
+ 0.692 |
+ 0.697 |
+ 0.710 |
+ 0.659 |
+ 0.706 |
+ 0.708 |
+ 0.705 |
+ 0.725 |
+ 0.716 |
+ 0.737 |
+ 0.726 |
+ 0.513 |
+ 0.566 |
+ 0.662 |
+ 0.651 |
+ 0.634 |
+ 0.667 |
+ 0.660 |
+ 0.698 |
+ 0.708 |
+ 0.707 |
+ 0.584 |
+ 0.479 |
+ 0.662 |
+ 0.618 |
+ 0.514 |
+ 0.605 |
+ 0.571 |
+ 0.491 |
+ 0.666 |
+ 0.646 |
+ 0.654 |
+ 0.655 |
+ 0.665 |
+ 0.664 |
+ 0.654 |
+ 0.660 |
+ 0.674 |
+ 0.664 |
+ 0.610 |
+ 0.599 |
+ 4007 |
+ OrganismalFitness |
+ MLAC_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ MSH2_HUMAN_Jia_2020 |
+ 0.827 |
+ 0.861 |
+ 0.833 |
+ 0.841 |
+ 0.852 |
+ 0.854 |
+ 0.696 |
+ 0.794 |
+ 0.861 |
+ 0.870 |
+ 0.809 |
+ 0.839 |
+ 0.857 |
+ 0.703 |
+ 0.824 |
+ 0.858 |
+ 0.814 |
+ 0.775 |
+ 0.678 |
+ 0.843 |
+ 0.762 |
+ 0.792 |
+ 0.775 |
+ 0.762 |
+ 0.788 |
+ 0.804 |
+ 0.815 |
+ 0.804 |
+ 0.785 |
+ 0.859 |
+ 0.828 |
+ 0.804 |
+ 0.697 |
+ 0.753 |
+ 0.795 |
+ 0.773 |
+ 0.821 |
+ 0.837 |
+ 0.829 |
+ 0.849 |
+ 0.856 |
+ 0.856 |
+ 0.761 |
+ 0.599 |
+ 0.843 |
+ 0.811 |
+ 0.784 |
+ 0.815 |
+ 0.562 |
+ 0.594 |
+ 0.780 |
+ 0.755 |
+ 0.769 |
+ 0.785 |
+ 0.796 |
+ 0.776 |
+ 0.771 |
+ 0.779 |
+ 0.802 |
+ 0.795 |
+ 0.865 |
+ 0.842 |
+ 16749 |
+ OrganismalFitness |
+ MSH2_HUMAN |
+ Medium |
+ Human |
+
+
+ MTH3_HAEAE_RockahShmuel_2015 |
+ 0.706 |
+ 0.800 |
+ 0.836 |
+ 0.842 |
+ 0.830 |
+ 0.835 |
+ 0.686 |
+ 0.823 |
+ 0.834 |
+ 0.845 |
+ 0.811 |
+ 0.854 |
+ 0.856 |
+ 0.631 |
+ 0.697 |
+ 0.717 |
+ 0.775 |
+ 0.822 |
+ 0.835 |
+ 0.840 |
+ 0.680 |
+ 0.747 |
+ 0.818 |
+ 0.838 |
+ 0.764 |
+ 0.836 |
+ 0.855 |
+ 0.843 |
+ 0.862 |
+ 0.830 |
+ 0.847 |
+ 0.828 |
+ 0.671 |
+ 0.692 |
+ 0.764 |
+ 0.832 |
+ 0.722 |
+ 0.763 |
+ 0.824 |
+ 0.815 |
+ 0.829 |
+ 0.847 |
+ 0.692 |
+ 0.520 |
+ 0.756 |
+ 0.692 |
+ 0.713 |
+ 0.786 |
+ 0.777 |
+ 0.600 |
+ 0.786 |
+ 0.770 |
+ 0.781 |
+ 0.788 |
+ 0.783 |
+ 0.790 |
+ 0.777 |
+ 0.785 |
+ 0.784 |
+ 0.790 |
+ 0.801 |
+ 0.707 |
+ 1777 |
+ OrganismalFitness |
+ MTH3_HAEAE |
+ Medium |
+ Prokaryote |
+
+
+ MTHR_HUMAN_Weile_2021 |
+ 0.607 |
+ 0.618 |
+ 0.608 |
+ 0.611 |
+ 0.611 |
+ 0.610 |
+ 0.588 |
+ 0.575 |
+ 0.632 |
+ 0.632 |
+ 0.676 |
+ 0.629 |
+ 0.639 |
+ 0.575 |
+ 0.685 |
+ 0.724 |
+ 0.661 |
+ 0.625 |
+ 0.631 |
+ 0.614 |
+ 0.707 |
+ 0.564 |
+ 0.597 |
+ 0.621 |
+ 0.700 |
+ 0.601 |
+ 0.619 |
+ 0.604 |
+ 0.639 |
+ 0.629 |
+ 0.592 |
+ 0.577 |
+ 0.559 |
+ 0.710 |
+ 0.641 |
+ 0.595 |
+ 0.668 |
+ 0.641 |
+ 0.612 |
+ 0.660 |
+ 0.636 |
+ 0.618 |
+ 0.620 |
+ 0.547 |
+ 0.669 |
+ 0.683 |
+ 0.608 |
+ 0.653 |
+ 0.674 |
+ 0.543 |
+ 0.647 |
+ 0.631 |
+ 0.636 |
+ 0.635 |
+ 0.639 |
+ 0.645 |
+ 0.653 |
+ 0.645 |
+ 0.660 |
+ 0.648 |
+ 0.647 |
+ 0.670 |
+ 12464 |
+ OrganismalFitness |
+ MTHR_HUMAN |
+ Low |
+ Human |
+
+
+ MYO3_YEAST_Tsuboyama_2023_2BTT |
+ 0.584 |
+ 0.635 |
+ 0.689 |
+ 0.700 |
+ 0.711 |
+ 0.727 |
+ 0.574 |
+ 0.623 |
+ 0.732 |
+ 0.726 |
+ 0.783 |
+ 0.760 |
+ 0.761 |
+ 0.592 |
+ 0.726 |
+ 0.815 |
+ 0.821 |
+ 0.774 |
+ 0.645 |
+ 0.675 |
+ 0.675 |
+ 0.618 |
+ 0.665 |
+ 0.657 |
+ 0.676 |
+ 0.647 |
+ 0.699 |
+ 0.664 |
+ 0.684 |
+ 0.650 |
+ 0.665 |
+ 0.613 |
+ 0.580 |
+ 0.721 |
+ 0.674 |
+ 0.644 |
+ 0.704 |
+ 0.677 |
+ 0.659 |
+ 0.723 |
+ 0.719 |
+ 0.711 |
+ 0.566 |
+ 0.526 |
+ 0.720 |
+ 0.654 |
+ 0.715 |
+ 0.737 |
+ 0.770 |
+ 0.696 |
+ 0.794 |
+ 0.779 |
+ 0.803 |
+ 0.788 |
+ 0.793 |
+ 0.794 |
+ 0.785 |
+ 0.791 |
+ 0.786 |
+ 0.794 |
+ 0.805 |
+ 0.777 |
+ 3297 |
+ Stability |
+ MYO3_YEAST |
+ High |
+ Eukaryote |
+
+
+ NCAP_I34A1_Doud_2015 |
+ 0.687 |
+ 0.670 |
+ 0.673 |
+ 0.672 |
+ 0.685 |
+ 0.686 |
+ 0.508 |
+ 0.668 |
+ 0.686 |
+ 0.677 |
+ 0.514 |
+ 0.517 |
+ 0.518 |
+ 0.519 |
+ 0.520 |
+ 0.516 |
+ 0.521 |
+ 0.522 |
+ 0.553 |
+ 0.646 |
+ 0.677 |
+ 0.687 |
+ 0.702 |
+ 0.704 |
+ 0.517 |
+ 0.529 |
+ 0.557 |
+ 0.520 |
+ 0.677 |
+ 0.700 |
+ 0.641 |
+ 0.645 |
+ 0.565 |
+ 0.680 |
+ 0.686 |
+ 0.710 |
+ 0.699 |
+ 0.704 |
+ 0.717 |
+ 0.716 |
+ 0.715 |
+ 0.724 |
+ 0.517 |
+ 0.515 |
+ 0.520 |
+ 0.519 |
+ 0.631 |
+ 0.635 |
+ 0.639 |
+ 0.559 |
+ 0.565 |
+ 0.585 |
+ 0.588 |
+ 0.594 |
+ 0.588 |
+ 0.587 |
+ 0.580 |
+ 0.583 |
+ 0.566 |
+ 0.587 |
+ 0.568 |
+ 0.543 |
+ 9462 |
+ OrganismalFitness |
+ NCAP_I34A1 |
+ Medium |
+ Virus |
+
+
+ NKX31_HUMAN_Tsuboyama_2023_2L9R |
+ 0.689 |
+ 0.800 |
+ 0.859 |
+ 0.858 |
+ 0.870 |
+ 0.860 |
+ 0.816 |
+ 0.861 |
+ 0.867 |
+ 0.873 |
+ 0.864 |
+ 0.874 |
+ 0.870 |
+ 0.857 |
+ 0.824 |
+ 0.868 |
+ 0.858 |
+ 0.833 |
+ 0.856 |
+ 0.846 |
+ 0.858 |
+ 0.842 |
+ 0.844 |
+ 0.842 |
+ 0.841 |
+ 0.835 |
+ 0.852 |
+ 0.840 |
+ 0.861 |
+ 0.876 |
+ 0.828 |
+ 0.814 |
+ 0.847 |
+ 0.817 |
+ 0.833 |
+ 0.853 |
+ 0.841 |
+ 0.851 |
+ 0.866 |
+ 0.867 |
+ 0.869 |
+ 0.875 |
+ 0.770 |
+ 0.717 |
+ 0.705 |
+ 0.751 |
+ 0.823 |
+ 0.721 |
+ 0.872 |
+ 0.868 |
+ 0.879 |
+ 0.882 |
+ 0.885 |
+ 0.888 |
+ 0.889 |
+ 0.879 |
+ 0.885 |
+ 0.883 |
+ 0.878 |
+ 0.886 |
+ 0.866 |
+ 0.873 |
+ 2482 |
+ Stability |
+ NKX31_HUMAN |
+ High |
+ Human |
+
+
+ NPC1_HUMAN_Erwood_2022_HEK293T |
+ 0.863 |
+ 0.876 |
+ 0.844 |
+ 0.848 |
+ 0.880 |
+ 0.878 |
+ 0.605 |
+ 0.761 |
+ 0.898 |
+ 0.891 |
+ 0.871 |
+ 0.671 |
+ 0.742 |
+ 0.592 |
+ 0.716 |
+ 0.818 |
+ 0.873 |
+ 0.876 |
+ 0.874 |
+ 0.554 |
+ 0.617 |
+ 0.730 |
+ 0.707 |
+ 0.707 |
+ 0.727 |
+ 0.758 |
+ 0.835 |
+ 0.739 |
+ 0.759 |
+ 0.889 |
+ 0.861 |
+ 0.803 |
+ 0.544 |
+ 0.603 |
+ 0.787 |
+ 0.799 |
+ 0.836 |
+ 0.859 |
+ 0.867 |
+ 0.871 |
+ 0.879 |
+ 0.885 |
+ 0.652 |
+ 0.551 |
+ 0.874 |
+ 0.799 |
+ 0.752 |
+ 0.871 |
+ 0.515 |
+ 0.601 |
+ 0.875 |
+ 0.862 |
+ 0.867 |
+ 0.867 |
+ 0.867 |
+ 0.858 |
+ 0.870 |
+ 0.882 |
+ 0.872 |
+ 0.877 |
+ 0.864 |
+ 0.712 |
+ 637 |
+ Activity |
+ NPC1_HUMAN |
+ Low |
+ Human |
+
+
+ NPC1_HUMAN_Erwood_2022_RPE1 |
+ 0.870 |
+ 0.773 |
+ 0.682 |
+ 0.685 |
+ 0.800 |
+ 0.810 |
+ 0.663 |
+ 0.721 |
+ 0.763 |
+ 0.806 |
+ 0.833 |
+ 0.538 |
+ 0.582 |
+ 0.581 |
+ 0.610 |
+ 0.763 |
+ 0.789 |
+ 0.775 |
+ 0.811 |
+ 0.675 |
+ 0.795 |
+ 0.687 |
+ 0.715 |
+ 0.648 |
+ 0.690 |
+ 0.689 |
+ 0.689 |
+ 0.635 |
+ 0.710 |
+ 0.795 |
+ 0.883 |
+ 0.795 |
+ 0.780 |
+ 0.688 |
+ 0.584 |
+ 0.742 |
+ 0.867 |
+ 0.770 |
+ 0.847 |
+ 0.838 |
+ 0.779 |
+ 0.811 |
+ 0.673 |
+ 0.681 |
+ 0.912 |
+ 0.609 |
+ 0.731 |
+ 0.817 |
+ 0.440 |
+ 0.602 |
+ 0.770 |
+ 0.755 |
+ 0.771 |
+ 0.707 |
+ 0.767 |
+ 0.756 |
+ 0.758 |
+ 0.807 |
+ 0.799 |
+ 0.769 |
+ 0.855 |
+ 0.720 |
+ 63 |
+ Activity |
+ NPC1_HUMAN |
+ Low |
+ Human |
+
+
+ NRAM_I33A0_Jiang_2016 |
+ 0.802 |
+ 0.775 |
+ 0.727 |
+ 0.717 |
+ 0.799 |
+ 0.795 |
+ 0.530 |
+ 0.718 |
+ 0.809 |
+ 0.814 |
+ 0.474 |
+ 0.598 |
+ 0.745 |
+ 0.450 |
+ 0.444 |
+ 0.505 |
+ 0.615 |
+ 0.752 |
+ 0.795 |
+ 0.686 |
+ 0.769 |
+ 0.802 |
+ 0.781 |
+ 0.777 |
+ 0.549 |
+ 0.774 |
+ 0.809 |
+ 0.747 |
+ 0.810 |
+ 0.833 |
+ 0.729 |
+ 0.732 |
+ 0.456 |
+ 0.747 |
+ 0.767 |
+ 0.772 |
+ 0.803 |
+ 0.822 |
+ 0.823 |
+ 0.820 |
+ 0.828 |
+ 0.823 |
+ 0.428 |
+ 0.432 |
+ 0.425 |
+ 0.423 |
+ 0.725 |
+ 0.706 |
+ 0.731 |
+ 0.592 |
+ 0.636 |
+ 0.648 |
+ 0.653 |
+ 0.677 |
+ 0.663 |
+ 0.679 |
+ 0.666 |
+ 0.660 |
+ 0.634 |
+ 0.662 |
+ 0.644 |
+ 0.567 |
+ 298 |
+ OrganismalFitness |
+ NRAM_I33A0 |
+ Low |
+ Virus |
+
+
+ NUD15_HUMAN_Suiter_2020 |
+ 0.644 |
+ 0.750 |
+ 0.797 |
+ 0.813 |
+ 0.809 |
+ 0.811 |
+ 0.504 |
+ 0.711 |
+ 0.832 |
+ 0.866 |
+ 0.838 |
+ 0.839 |
+ 0.863 |
+ 0.677 |
+ 0.735 |
+ 0.753 |
+ 0.801 |
+ 0.833 |
+ 0.840 |
+ 0.777 |
+ 0.671 |
+ 0.750 |
+ 0.808 |
+ 0.798 |
+ 0.736 |
+ 0.828 |
+ 0.826 |
+ 0.810 |
+ 0.788 |
+ 0.813 |
+ 0.841 |
+ 0.811 |
+ 0.594 |
+ 0.698 |
+ 0.739 |
+ 0.816 |
+ 0.728 |
+ 0.744 |
+ 0.830 |
+ 0.809 |
+ 0.809 |
+ 0.838 |
+ 0.724 |
+ 0.511 |
+ 0.831 |
+ 0.744 |
+ 0.753 |
+ 0.805 |
+ 0.779 |
+ 0.667 |
+ 0.806 |
+ 0.785 |
+ 0.798 |
+ 0.813 |
+ 0.804 |
+ 0.814 |
+ 0.817 |
+ 0.813 |
+ 0.813 |
+ 0.817 |
+ 0.882 |
+ 0.815 |
+ 2844 |
+ Expression |
+ NUD15_HUMAN |
+ High |
+ Human |
+
+
+ NUSA_ECOLI_Tsuboyama_2023_1WCL |
+ 0.692 |
+ 0.804 |
+ 0.802 |
+ 0.796 |
+ 0.814 |
+ 0.821 |
+ 0.636 |
+ 0.730 |
+ 0.850 |
+ 0.821 |
+ 0.793 |
+ 0.654 |
+ 0.667 |
+ 0.656 |
+ 0.698 |
+ 0.701 |
+ 0.744 |
+ 0.767 |
+ 0.758 |
+ 0.798 |
+ 0.705 |
+ 0.796 |
+ 0.810 |
+ 0.812 |
+ 0.657 |
+ 0.762 |
+ 0.729 |
+ 0.788 |
+ 0.807 |
+ 0.839 |
+ 0.865 |
+ 0.894 |
+ 0.587 |
+ 0.682 |
+ 0.703 |
+ 0.707 |
+ 0.758 |
+ 0.746 |
+ 0.756 |
+ 0.832 |
+ 0.810 |
+ 0.825 |
+ 0.679 |
+ 0.609 |
+ 0.683 |
+ 0.670 |
+ 0.851 |
+ 0.828 |
+ 0.881 |
+ 0.844 |
+ 0.888 |
+ 0.885 |
+ 0.884 |
+ 0.897 |
+ 0.898 |
+ 0.907 |
+ 0.890 |
+ 0.889 |
+ 0.896 |
+ 0.897 |
+ 0.852 |
+ 0.780 |
+ 2028 |
+ Stability |
+ NUSA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ NUSG_MYCTU_Tsuboyama_2023_2MI6 |
+ 0.752 |
+ 0.760 |
+ 0.767 |
+ 0.774 |
+ 0.783 |
+ 0.774 |
+ 0.653 |
+ 0.758 |
+ 0.782 |
+ 0.776 |
+ 0.749 |
+ 0.722 |
+ 0.746 |
+ 0.693 |
+ 0.796 |
+ 0.763 |
+ 0.784 |
+ 0.789 |
+ 0.788 |
+ 0.761 |
+ 0.682 |
+ 0.705 |
+ 0.700 |
+ 0.727 |
+ 0.730 |
+ 0.722 |
+ 0.737 |
+ 0.699 |
+ 0.733 |
+ 0.803 |
+ 0.741 |
+ 0.739 |
+ 0.660 |
+ 0.657 |
+ 0.669 |
+ 0.745 |
+ 0.743 |
+ 0.744 |
+ 0.771 |
+ 0.763 |
+ 0.765 |
+ 0.775 |
+ 0.681 |
+ 0.559 |
+ 0.674 |
+ 0.743 |
+ 0.815 |
+ 0.789 |
+ 0.886 |
+ 0.862 |
+ 0.798 |
+ 0.788 |
+ 0.777 |
+ 0.798 |
+ 0.794 |
+ 0.792 |
+ 0.796 |
+ 0.800 |
+ 0.798 |
+ 0.797 |
+ 0.762 |
+ 0.808 |
+ 1380 |
+ Stability |
+ NUSG_MYCTU |
+ High |
+ Prokaryote |
+
+
+ OBSCN_HUMAN_Tsuboyama_2023_1V1C |
+ 0.758 |
+ 0.860 |
+ 0.916 |
+ 0.928 |
+ 0.917 |
+ 0.923 |
+ 0.644 |
+ 0.809 |
+ 0.917 |
+ 0.915 |
+ 0.921 |
+ 0.869 |
+ 0.887 |
+ 0.649 |
+ 0.900 |
+ 0.923 |
+ 0.928 |
+ 0.924 |
+ 0.919 |
+ 0.916 |
+ 0.909 |
+ 0.899 |
+ 0.898 |
+ 0.887 |
+ 0.854 |
+ 0.902 |
+ 0.893 |
+ 0.882 |
+ 0.877 |
+ 0.909 |
+ 0.927 |
+ 0.922 |
+ 0.758 |
+ 0.618 |
+ 0.670 |
+ 0.762 |
+ 0.874 |
+ 0.860 |
+ 0.869 |
+ 0.927 |
+ 0.917 |
+ 0.920 |
+ 0.735 |
+ 0.484 |
+ 0.730 |
+ 0.787 |
+ 0.837 |
+ 0.787 |
+ 0.916 |
+ 0.918 |
+ 0.943 |
+ 0.937 |
+ 0.944 |
+ 0.940 |
+ 0.947 |
+ 0.944 |
+ 0.941 |
+ 0.940 |
+ 0.947 |
+ 0.945 |
+ 0.932 |
+ 0.890 |
+ 3197 |
+ Stability |
+ OBSCN_HUMAN |
+ High |
+ Human |
+
+
+ ODP2_GEOSE_Tsuboyama_2023_1W4G |
+ 0.372 |
+ 0.393 |
+ 0.643 |
+ 0.562 |
+ 0.637 |
+ 0.636 |
+ 0.404 |
+ 0.518 |
+ 0.558 |
+ 0.564 |
+ 0.500 |
+ 0.432 |
+ 0.426 |
+ 0.448 |
+ 0.415 |
+ 0.532 |
+ 0.569 |
+ 0.514 |
+ 0.571 |
+ 0.589 |
+ 0.559 |
+ 0.555 |
+ 0.546 |
+ 0.560 |
+ 0.554 |
+ 0.561 |
+ 0.552 |
+ 0.564 |
+ 0.581 |
+ 0.642 |
+ 0.504 |
+ 0.493 |
+ 0.614 |
+ 0.453 |
+ 0.490 |
+ 0.479 |
+ 0.566 |
+ 0.560 |
+ 0.560 |
+ 0.610 |
+ 0.600 |
+ 0.618 |
+ 0.599 |
+ 0.452 |
+ 0.632 |
+ 0.625 |
+ 0.694 |
+ 0.678 |
+ 0.681 |
+ 0.626 |
+ 0.525 |
+ 0.505 |
+ 0.508 |
+ 0.529 |
+ 0.501 |
+ 0.528 |
+ 0.520 |
+ 0.523 |
+ 0.512 |
+ 0.518 |
+ 0.608 |
+ 0.551 |
+ 1134 |
+ Stability |
+ ODP2_GEOSE |
+ High |
+ Prokaryote |
+
+
+ OPSD_HUMAN_Wan_2019 |
+ 0.611 |
+ 0.750 |
+ 0.755 |
+ 0.796 |
+ 0.758 |
+ 0.760 |
+ 0.701 |
+ 0.808 |
+ 0.787 |
+ 0.806 |
+ 0.692 |
+ 0.788 |
+ 0.808 |
+ 0.659 |
+ 0.777 |
+ 0.767 |
+ 0.784 |
+ 0.751 |
+ 0.769 |
+ 0.784 |
+ 0.782 |
+ 0.778 |
+ 0.772 |
+ 0.796 |
+ 0.804 |
+ 0.812 |
+ 0.822 |
+ 0.815 |
+ 0.801 |
+ 0.749 |
+ 0.740 |
+ 0.694 |
+ 0.604 |
+ 0.761 |
+ 0.785 |
+ 0.768 |
+ 0.780 |
+ 0.783 |
+ 0.773 |
+ 0.763 |
+ 0.765 |
+ 0.767 |
+ 0.702 |
+ 0.557 |
+ 0.762 |
+ 0.752 |
+ 0.874 |
+ 0.789 |
+ 0.882 |
+ 0.611 |
+ 0.703 |
+ 0.670 |
+ 0.699 |
+ 0.696 |
+ 0.745 |
+ 0.754 |
+ 0.682 |
+ 0.701 |
+ 0.729 |
+ 0.729 |
+ 0.814 |
+ 0.785 |
+ 165 |
+ Expression |
+ OPSD_HUMAN |
+ High |
+ Human |
+
+
+ OTC_HUMAN_Lo_2023 |
+ 0.757 |
+ 0.786 |
+ 0.748 |
+ 0.769 |
+ 0.778 |
+ 0.780 |
+ 0.547 |
+ 0.710 |
+ 0.791 |
+ 0.793 |
+ 0.773 |
+ 0.779 |
+ 0.785 |
+ 0.552 |
+ 0.690 |
+ 0.739 |
+ 0.753 |
+ 0.749 |
+ 0.763 |
+ 0.782 |
+ 0.713 |
+ 0.731 |
+ 0.768 |
+ 0.782 |
+ 0.735 |
+ 0.783 |
+ 0.769 |
+ 0.777 |
+ 0.798 |
+ 0.806 |
+ 0.745 |
+ 0.716 |
+ 0.533 |
+ 0.703 |
+ 0.761 |
+ 0.798 |
+ 0.753 |
+ 0.787 |
+ 0.815 |
+ 0.786 |
+ 0.798 |
+ 0.807 |
+ 0.619 |
+ 0.524 |
+ 0.752 |
+ 0.720 |
+ 0.830 |
+ 0.797 |
+ 0.837 |
+ 0.687 |
+ 0.755 |
+ 0.766 |
+ 0.764 |
+ 0.781 |
+ 0.766 |
+ 0.775 |
+ 0.770 |
+ 0.766 |
+ 0.763 |
+ 0.773 |
+ 0.799 |
+ 0.734 |
+ 1570 |
+ Activity |
+ OTC_HUMAN |
+ Medium |
+ Human |
+
+
+ OTU7A_HUMAN_Tsuboyama_2023_2L2D |
+ 0.520 |
+ 0.593 |
+ 0.582 |
+ 0.590 |
+ 0.602 |
+ 0.604 |
+ 0.599 |
+ 0.574 |
+ 0.560 |
+ 0.564 |
+ 0.747 |
+ 0.862 |
+ 0.858 |
+ 0.593 |
+ 0.835 |
+ 0.847 |
+ 0.726 |
+ 0.741 |
+ 0.728 |
+ 0.579 |
+ 0.578 |
+ 0.665 |
+ 0.691 |
+ 0.713 |
+ 0.731 |
+ 0.731 |
+ 0.759 |
+ 0.732 |
+ 0.615 |
+ 0.671 |
+ 0.815 |
+ 0.793 |
+ 0.562 |
+ 0.566 |
+ 0.639 |
+ 0.734 |
+ 0.596 |
+ 0.602 |
+ 0.688 |
+ 0.612 |
+ 0.603 |
+ 0.636 |
+ 0.688 |
+ 0.577 |
+ 0.836 |
+ 0.812 |
+ 0.829 |
+ 0.842 |
+ 0.865 |
+ 0.787 |
+ 0.769 |
+ 0.717 |
+ 0.731 |
+ 0.762 |
+ 0.773 |
+ 0.782 |
+ 0.735 |
+ 0.764 |
+ 0.782 |
+ 0.766 |
+ 0.854 |
+ 0.855 |
+ 635 |
+ Stability |
+ OTU7A_HUMAN |
+ High |
+ Human |
+
+
+ OXDA_RHOTO_Vanella_2023_activity |
+ 0.570 |
+ 0.623 |
+ 0.649 |
+ 0.650 |
+ 0.638 |
+ 0.646 |
+ 0.591 |
+ 0.620 |
+ 0.613 |
+ 0.618 |
+ 0.667 |
+ 0.653 |
+ 0.657 |
+ 0.562 |
+ 0.609 |
+ 0.639 |
+ 0.659 |
+ 0.665 |
+ 0.670 |
+ 0.653 |
+ 0.569 |
+ 0.604 |
+ 0.643 |
+ 0.647 |
+ 0.622 |
+ 0.657 |
+ 0.665 |
+ 0.650 |
+ 0.692 |
+ 0.643 |
+ 0.687 |
+ 0.678 |
+ 0.519 |
+ 0.590 |
+ 0.614 |
+ 0.639 |
+ 0.624 |
+ 0.629 |
+ 0.639 |
+ 0.642 |
+ 0.645 |
+ 0.648 |
+ 0.595 |
+ 0.529 |
+ 0.667 |
+ 0.632 |
+ 0.630 |
+ 0.667 |
+ 0.646 |
+ 0.537 |
+ 0.655 |
+ 0.659 |
+ 0.665 |
+ 0.654 |
+ 0.658 |
+ 0.669 |
+ 0.658 |
+ 0.661 |
+ 0.660 |
+ 0.665 |
+ 0.665 |
+ 0.601 |
+ 6396 |
+ Activity |
+ OXDA_RHOTO |
+ High |
+ Eukaryote |
+
+
+ OXDA_RHOTO_Vanella_2023_expression |
+ 0.635 |
+ 0.669 |
+ 0.681 |
+ 0.681 |
+ 0.676 |
+ 0.677 |
+ 0.652 |
+ 0.637 |
+ 0.689 |
+ 0.694 |
+ 0.705 |
+ 0.700 |
+ 0.699 |
+ 0.661 |
+ 0.679 |
+ 0.694 |
+ 0.699 |
+ 0.719 |
+ 0.709 |
+ 0.646 |
+ 0.638 |
+ 0.668 |
+ 0.670 |
+ 0.662 |
+ 0.670 |
+ 0.674 |
+ 0.694 |
+ 0.683 |
+ 0.697 |
+ 0.693 |
+ 0.701 |
+ 0.678 |
+ 0.552 |
+ 0.670 |
+ 0.654 |
+ 0.670 |
+ 0.684 |
+ 0.678 |
+ 0.682 |
+ 0.688 |
+ 0.684 |
+ 0.684 |
+ 0.676 |
+ 0.624 |
+ 0.698 |
+ 0.676 |
+ 0.715 |
+ 0.708 |
+ 0.675 |
+ 0.550 |
+ 0.701 |
+ 0.690 |
+ 0.688 |
+ 0.694 |
+ 0.696 |
+ 0.688 |
+ 0.699 |
+ 0.696 |
+ 0.695 |
+ 0.700 |
+ 0.736 |
+ 0.694 |
+ 6769 |
+ Expression |
+ OXDA_RHOTO |
+ High |
+ Eukaryote |
+
+
+ P53_HUMAN_Giacomelli_2018_Null_Etoposide |
+ 0.831 |
+ 0.831 |
+ 0.765 |
+ 0.777 |
+ 0.826 |
+ 0.829 |
+ 0.450 |
+ 0.764 |
+ 0.763 |
+ 0.772 |
+ 0.826 |
+ 0.827 |
+ 0.855 |
+ 0.417 |
+ 0.436 |
+ 0.764 |
+ 0.835 |
+ 0.849 |
+ 0.869 |
+ 0.627 |
+ 0.722 |
+ 0.788 |
+ 0.804 |
+ 0.796 |
+ 0.778 |
+ 0.818 |
+ 0.822 |
+ 0.818 |
+ 0.783 |
+ 0.830 |
+ 0.853 |
+ 0.836 |
+ 0.611 |
+ 0.701 |
+ 0.800 |
+ 0.784 |
+ 0.819 |
+ 0.839 |
+ 0.821 |
+ 0.833 |
+ 0.847 |
+ 0.829 |
+ 0.419 |
+ 0.412 |
+ 0.840 |
+ 0.500 |
+ 0.771 |
+ 0.833 |
+ 0.784 |
+ 0.637 |
+ 0.834 |
+ 0.836 |
+ 0.838 |
+ 0.834 |
+ 0.839 |
+ 0.847 |
+ 0.839 |
+ 0.843 |
+ 0.847 |
+ 0.848 |
+ 0.859 |
+ 0.660 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Giacomelli_2018_Null_Nutlin |
+ 0.670 |
+ 0.694 |
+ 0.649 |
+ 0.654 |
+ 0.688 |
+ 0.689 |
+ 0.455 |
+ 0.623 |
+ 0.645 |
+ 0.647 |
+ 0.726 |
+ 0.713 |
+ 0.738 |
+ 0.418 |
+ 0.426 |
+ 0.629 |
+ 0.685 |
+ 0.697 |
+ 0.711 |
+ 0.560 |
+ 0.636 |
+ 0.712 |
+ 0.718 |
+ 0.707 |
+ 0.680 |
+ 0.731 |
+ 0.738 |
+ 0.749 |
+ 0.660 |
+ 0.706 |
+ 0.713 |
+ 0.678 |
+ 0.598 |
+ 0.606 |
+ 0.713 |
+ 0.677 |
+ 0.669 |
+ 0.705 |
+ 0.684 |
+ 0.682 |
+ 0.708 |
+ 0.688 |
+ 0.425 |
+ 0.422 |
+ 0.719 |
+ 0.464 |
+ 0.680 |
+ 0.723 |
+ 0.689 |
+ 0.586 |
+ 0.692 |
+ 0.694 |
+ 0.696 |
+ 0.699 |
+ 0.699 |
+ 0.704 |
+ 0.694 |
+ 0.698 |
+ 0.702 |
+ 0.703 |
+ 0.717 |
+ 0.577 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Giacomelli_2018_WT_Nutlin |
+ 0.855 |
+ 0.865 |
+ 0.782 |
+ 0.795 |
+ 0.851 |
+ 0.856 |
+ 0.453 |
+ 0.766 |
+ 0.774 |
+ 0.778 |
+ 0.883 |
+ 0.863 |
+ 0.891 |
+ 0.415 |
+ 0.440 |
+ 0.828 |
+ 0.869 |
+ 0.886 |
+ 0.889 |
+ 0.626 |
+ 0.780 |
+ 0.862 |
+ 0.860 |
+ 0.847 |
+ 0.835 |
+ 0.870 |
+ 0.867 |
+ 0.889 |
+ 0.761 |
+ 0.881 |
+ 0.887 |
+ 0.867 |
+ 0.634 |
+ 0.752 |
+ 0.881 |
+ 0.821 |
+ 0.855 |
+ 0.894 |
+ 0.850 |
+ 0.865 |
+ 0.896 |
+ 0.853 |
+ 0.423 |
+ 0.405 |
+ 0.894 |
+ 0.509 |
+ 0.832 |
+ 0.888 |
+ 0.833 |
+ 0.670 |
+ 0.873 |
+ 0.881 |
+ 0.879 |
+ 0.880 |
+ 0.884 |
+ 0.888 |
+ 0.881 |
+ 0.886 |
+ 0.886 |
+ 0.891 |
+ 0.893 |
+ 0.688 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Kotler_2018 |
+ 0.851 |
+ 0.828 |
+ 0.768 |
+ 0.805 |
+ 0.806 |
+ 0.819 |
+ 0.544 |
+ 0.772 |
+ 0.828 |
+ 0.836 |
+ 0.838 |
+ 0.810 |
+ 0.854 |
+ 0.527 |
+ 0.544 |
+ 0.850 |
+ 0.889 |
+ 0.900 |
+ 0.907 |
+ 0.747 |
+ 0.731 |
+ 0.766 |
+ 0.786 |
+ 0.782 |
+ 0.761 |
+ 0.789 |
+ 0.782 |
+ 0.789 |
+ 0.791 |
+ 0.831 |
+ 0.860 |
+ 0.837 |
+ 0.532 |
+ 0.724 |
+ 0.776 |
+ 0.746 |
+ 0.838 |
+ 0.832 |
+ 0.820 |
+ 0.837 |
+ 0.833 |
+ 0.829 |
+ 0.527 |
+ 0.502 |
+ 0.868 |
+ 0.608 |
+ 0.765 |
+ 0.835 |
+ 0.775 |
+ 0.643 |
+ 0.869 |
+ 0.879 |
+ 0.875 |
+ 0.872 |
+ 0.874 |
+ 0.878 |
+ 0.888 |
+ 0.875 |
+ 0.884 |
+ 0.887 |
+ 0.906 |
+ 0.706 |
+ 1048 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P84126_THETH_Chan_2017 |
+ 0.790 |
+ 0.827 |
+ 0.844 |
+ 0.852 |
+ 0.818 |
+ 0.827 |
+ 0.661 |
+ 0.785 |
+ 0.860 |
+ 0.847 |
+ 0.818 |
+ 0.808 |
+ 0.832 |
+ 0.634 |
+ 0.809 |
+ 0.817 |
+ 0.831 |
+ 0.838 |
+ 0.811 |
+ 0.846 |
+ 0.737 |
+ 0.785 |
+ 0.781 |
+ 0.813 |
+ 0.792 |
+ 0.828 |
+ 0.837 |
+ 0.819 |
+ 0.870 |
+ 0.801 |
+ 0.817 |
+ 0.773 |
+ 0.699 |
+ 0.769 |
+ 0.775 |
+ 0.798 |
+ 0.795 |
+ 0.798 |
+ 0.811 |
+ 0.821 |
+ 0.822 |
+ 0.831 |
+ 0.754 |
+ 0.488 |
+ 0.797 |
+ 0.774 |
+ 0.661 |
+ 0.727 |
+ 0.768 |
+ 0.560 |
+ 0.795 |
+ 0.782 |
+ 0.748 |
+ 0.796 |
+ 0.799 |
+ 0.777 |
+ 0.787 |
+ 0.799 |
+ 0.802 |
+ 0.802 |
+ 0.835 |
+ 0.828 |
+ 1519 |
+ OrganismalFitness |
+ P84126_THETH |
+ Medium |
+ Prokaryote |
+
+
+ PA_I34A1_Wu_2015 |
+ 0.772 |
+ 0.773 |
+ 0.761 |
+ 0.766 |
+ 0.782 |
+ 0.783 |
+ 0.535 |
+ 0.694 |
+ 0.589 |
+ 0.589 |
+ 0.526 |
+ 0.537 |
+ 0.569 |
+ 0.518 |
+ 0.520 |
+ 0.521 |
+ 0.525 |
+ 0.525 |
+ 0.681 |
+ 0.667 |
+ 0.736 |
+ 0.755 |
+ 0.777 |
+ 0.778 |
+ 0.607 |
+ 0.710 |
+ 0.733 |
+ 0.725 |
+ 0.728 |
+ 0.805 |
+ 0.689 |
+ 0.693 |
+ 0.566 |
+ 0.722 |
+ 0.747 |
+ 0.777 |
+ 0.782 |
+ 0.793 |
+ 0.796 |
+ 0.804 |
+ 0.808 |
+ 0.803 |
+ 0.522 |
+ 0.515 |
+ 0.521 |
+ 0.519 |
+ 0.627 |
+ 0.616 |
+ 0.599 |
+ 0.540 |
+ 0.581 |
+ 0.584 |
+ 0.583 |
+ 0.594 |
+ 0.584 |
+ 0.591 |
+ 0.583 |
+ 0.588 |
+ 0.568 |
+ 0.590 |
+ 0.612 |
+ 0.577 |
+ 1820 |
+ OrganismalFitness |
+ PA_I34A1 |
+ Medium |
+ Virus |
+
+
+ PABP_YEAST_Melamed_2013 |
+ 0.855 |
+ 0.827 |
+ 0.787 |
+ 0.791 |
+ 0.846 |
+ 0.843 |
+ 0.752 |
+ 0.805 |
+ 0.838 |
+ 0.852 |
+ 0.866 |
+ 0.851 |
+ 0.861 |
+ 0.758 |
+ 0.805 |
+ 0.856 |
+ 0.898 |
+ 0.875 |
+ 0.882 |
+ 0.787 |
+ 0.847 |
+ 0.858 |
+ 0.863 |
+ 0.874 |
+ 0.846 |
+ 0.879 |
+ 0.879 |
+ 0.866 |
+ 0.860 |
+ 0.859 |
+ 0.884 |
+ 0.846 |
+ 0.649 |
+ 0.849 |
+ 0.850 |
+ 0.847 |
+ 0.874 |
+ 0.874 |
+ 0.872 |
+ 0.866 |
+ 0.868 |
+ 0.866 |
+ 0.825 |
+ 0.469 |
+ 0.851 |
+ 0.852 |
+ 0.691 |
+ 0.817 |
+ 0.779 |
+ 0.604 |
+ 0.870 |
+ 0.865 |
+ 0.872 |
+ 0.888 |
+ 0.880 |
+ 0.883 |
+ 0.884 |
+ 0.884 |
+ 0.901 |
+ 0.892 |
+ 0.885 |
+ 0.844 |
+ 37708 |
+ OrganismalFitness |
+ PABP_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ PAI1_HUMAN_Huttinger_2021 |
+ 0.693 |
+ 0.693 |
+ 0.686 |
+ 0.693 |
+ 0.695 |
+ 0.702 |
+ 0.526 |
+ 0.656 |
+ 0.705 |
+ 0.711 |
+ 0.708 |
+ 0.693 |
+ 0.704 |
+ 0.545 |
+ 0.688 |
+ 0.705 |
+ 0.715 |
+ 0.662 |
+ 0.636 |
+ 0.704 |
+ 0.570 |
+ 0.686 |
+ 0.677 |
+ 0.668 |
+ 0.670 |
+ 0.690 |
+ 0.693 |
+ 0.681 |
+ 0.665 |
+ 0.705 |
+ 0.692 |
+ 0.676 |
+ 0.561 |
+ 0.583 |
+ 0.678 |
+ 0.681 |
+ 0.683 |
+ 0.693 |
+ 0.697 |
+ 0.702 |
+ 0.706 |
+ 0.706 |
+ 0.669 |
+ 0.533 |
+ 0.705 |
+ 0.695 |
+ 0.690 |
+ 0.723 |
+ 0.703 |
+ 0.576 |
+ 0.712 |
+ 0.712 |
+ 0.712 |
+ 0.718 |
+ 0.714 |
+ 0.716 |
+ 0.715 |
+ 0.717 |
+ 0.713 |
+ 0.720 |
+ 0.728 |
+ 0.692 |
+ 5345 |
+ Activity |
+ PAI1_HUMAN |
+ NaN |
+ Human |
+
+
+ PHOT_CHLRE_Chen_2023 |
+ 0.592 |
+ 0.708 |
+ 0.858 |
+ 0.846 |
+ 0.660 |
+ 0.652 |
+ 0.821 |
+ 0.770 |
+ 0.850 |
+ 0.855 |
+ 0.819 |
+ 0.873 |
+ 0.888 |
+ 0.878 |
+ 0.911 |
+ 0.852 |
+ 0.870 |
+ 0.851 |
+ 0.876 |
+ 0.819 |
+ 0.836 |
+ 0.835 |
+ 0.790 |
+ 0.841 |
+ 0.754 |
+ 0.800 |
+ 0.788 |
+ 0.822 |
+ 0.791 |
+ 0.783 |
+ 0.781 |
+ 0.740 |
+ 0.647 |
+ 0.774 |
+ 0.744 |
+ 0.795 |
+ 0.786 |
+ 0.769 |
+ 0.793 |
+ 0.696 |
+ 0.710 |
+ 0.700 |
+ 0.808 |
+ 0.687 |
+ 0.796 |
+ 0.805 |
+ 0.599 |
+ 0.747 |
+ 0.830 |
+ 0.700 |
+ 0.797 |
+ 0.765 |
+ 0.770 |
+ 0.769 |
+ 0.772 |
+ 0.779 |
+ 0.778 |
+ 0.781 |
+ 0.786 |
+ 0.779 |
+ 0.865 |
+ 0.884 |
+ 167529 |
+ Activity |
+ PHOT_CHLRE |
+ High |
+ Eukaryote |
+
+
+ PIN1_HUMAN_Tsuboyama_2023_1I6C |
+ 0.608 |
+ 0.662 |
+ 0.858 |
+ 0.837 |
+ 0.829 |
+ 0.850 |
+ 0.800 |
+ 0.753 |
+ 0.851 |
+ 0.872 |
+ 0.850 |
+ 0.799 |
+ 0.858 |
+ 0.837 |
+ 0.832 |
+ 0.846 |
+ 0.848 |
+ 0.761 |
+ 0.790 |
+ 0.837 |
+ 0.722 |
+ 0.859 |
+ 0.834 |
+ 0.825 |
+ 0.789 |
+ 0.857 |
+ 0.869 |
+ 0.852 |
+ 0.859 |
+ 0.873 |
+ 0.879 |
+ 0.861 |
+ 0.799 |
+ 0.766 |
+ 0.834 |
+ 0.859 |
+ 0.806 |
+ 0.850 |
+ 0.884 |
+ 0.856 |
+ 0.872 |
+ 0.888 |
+ 0.800 |
+ 0.381 |
+ 0.836 |
+ 0.777 |
+ 0.786 |
+ 0.808 |
+ 0.871 |
+ 0.862 |
+ 0.846 |
+ 0.824 |
+ 0.752 |
+ 0.782 |
+ 0.789 |
+ 0.808 |
+ 0.812 |
+ 0.835 |
+ 0.821 |
+ 0.817 |
+ 0.889 |
+ 0.895 |
+ 802 |
+ Stability |
+ PIN1_HUMAN |
+ High |
+ Human |
+
+
+ PITX2_HUMAN_Tsuboyama_2023_2L7M |
+ 0.795 |
+ 0.773 |
+ 0.787 |
+ 0.791 |
+ 0.787 |
+ 0.790 |
+ 0.760 |
+ 0.754 |
+ 0.781 |
+ 0.793 |
+ 0.745 |
+ 0.732 |
+ 0.726 |
+ 0.786 |
+ 0.836 |
+ 0.847 |
+ 0.827 |
+ 0.807 |
+ 0.765 |
+ 0.790 |
+ 0.771 |
+ 0.743 |
+ 0.752 |
+ 0.738 |
+ 0.793 |
+ 0.766 |
+ 0.744 |
+ 0.730 |
+ 0.752 |
+ 0.780 |
+ 0.712 |
+ 0.703 |
+ 0.729 |
+ 0.797 |
+ 0.771 |
+ 0.761 |
+ 0.813 |
+ 0.788 |
+ 0.784 |
+ 0.801 |
+ 0.782 |
+ 0.784 |
+ 0.748 |
+ 0.718 |
+ 0.646 |
+ 0.669 |
+ 0.715 |
+ 0.653 |
+ 0.850 |
+ 0.820 |
+ 0.794 |
+ 0.798 |
+ 0.795 |
+ 0.805 |
+ 0.812 |
+ 0.816 |
+ 0.806 |
+ 0.808 |
+ 0.810 |
+ 0.809 |
+ 0.750 |
+ 0.812 |
+ 1824 |
+ Stability |
+ PITX2_HUMAN |
+ High |
+ Human |
+
+
+ PKN1_HUMAN_Tsuboyama_2023_1URF |
+ 0.623 |
+ 0.605 |
+ 0.662 |
+ 0.671 |
+ 0.690 |
+ 0.694 |
+ 0.612 |
+ 0.658 |
+ 0.639 |
+ 0.661 |
+ 0.680 |
+ 0.706 |
+ 0.737 |
+ 0.716 |
+ 0.733 |
+ 0.814 |
+ 0.721 |
+ 0.678 |
+ 0.676 |
+ 0.718 |
+ 0.702 |
+ 0.705 |
+ 0.680 |
+ 0.712 |
+ 0.710 |
+ 0.708 |
+ 0.711 |
+ 0.704 |
+ 0.712 |
+ 0.717 |
+ 0.752 |
+ 0.750 |
+ 0.555 |
+ 0.721 |
+ 0.665 |
+ 0.708 |
+ 0.690 |
+ 0.686 |
+ 0.714 |
+ 0.713 |
+ 0.697 |
+ 0.712 |
+ 0.684 |
+ 0.674 |
+ 0.705 |
+ 0.692 |
+ 0.835 |
+ 0.777 |
+ 0.830 |
+ 0.790 |
+ 0.685 |
+ 0.703 |
+ 0.724 |
+ 0.689 |
+ 0.707 |
+ 0.706 |
+ 0.692 |
+ 0.707 |
+ 0.713 |
+ 0.709 |
+ 0.728 |
+ 0.825 |
+ 1301 |
+ Stability |
+ PKN1_HUMAN |
+ High |
+ Human |
+
+
+ POLG_CXB3N_Mattenberger_2021 |
+ 0.709 |
+ 0.687 |
+ 0.681 |
+ 0.699 |
+ 0.726 |
+ 0.731 |
+ 0.478 |
+ 0.672 |
+ 0.749 |
+ 0.749 |
+ 0.651 |
+ 0.468 |
+ 0.516 |
+ 0.457 |
+ 0.469 |
+ 0.595 |
+ 0.702 |
+ 0.704 |
+ 0.717 |
+ 0.677 |
+ 0.664 |
+ 0.694 |
+ 0.689 |
+ 0.688 |
+ 0.561 |
+ 0.691 |
+ 0.688 |
+ 0.683 |
+ 0.694 |
+ 0.743 |
+ 0.694 |
+ 0.660 |
+ 0.501 |
+ 0.524 |
+ 0.631 |
+ 0.677 |
+ 0.667 |
+ 0.688 |
+ 0.706 |
+ 0.688 |
+ 0.709 |
+ 0.727 |
+ 0.465 |
+ 0.465 |
+ 0.677 |
+ 0.472 |
+ 0.596 |
+ 0.673 |
+ 0.559 |
+ 0.523 |
+ 0.673 |
+ 0.683 |
+ 0.685 |
+ 0.687 |
+ 0.684 |
+ 0.686 |
+ 0.685 |
+ 0.684 |
+ 0.690 |
+ 0.689 |
+ 0.558 |
+ 0.536 |
+ 15711 |
+ OrganismalFitness |
+ POLG_CXB3N |
+ Medium |
+ Virus |
+
+
+ POLG_DEN26_Suphatrakul_2023 |
+ 0.760 |
+ 0.794 |
+ 0.633 |
+ 0.633 |
+ 0.772 |
+ 0.772 |
+ 0.485 |
+ 0.722 |
+ 0.848 |
+ 0.850 |
+ 0.669 |
+ 0.507 |
+ 0.512 |
+ 0.481 |
+ 0.513 |
+ 0.547 |
+ 0.587 |
+ 0.645 |
+ 0.690 |
+ 0.715 |
+ 0.732 |
+ 0.741 |
+ 0.733 |
+ 0.732 |
+ 0.725 |
+ 0.754 |
+ 0.756 |
+ 0.747 |
+ 0.745 |
+ 0.816 |
+ 0.809 |
+ 0.771 |
+ 0.522 |
+ 0.484 |
+ 0.577 |
+ 0.743 |
+ 0.729 |
+ 0.709 |
+ 0.781 |
+ 0.745 |
+ 0.714 |
+ 0.790 |
+ 0.478 |
+ 0.473 |
+ 0.709 |
+ 0.504 |
+ 0.690 |
+ 0.771 |
+ 0.572 |
+ 0.561 |
+ 0.632 |
+ 0.624 |
+ 0.631 |
+ 0.643 |
+ 0.626 |
+ 0.643 |
+ 0.633 |
+ 0.637 |
+ 0.627 |
+ 0.640 |
+ 0.592 |
+ 0.524 |
+ 16897 |
+ OrganismalFitness |
+ POLG_DEN26 |
+ Low |
+ Virus |
+
+
+ POLG_HCVJF_Qi_2014 |
+ 0.824 |
+ 0.784 |
+ 0.712 |
+ 0.716 |
+ 0.823 |
+ 0.828 |
+ 0.470 |
+ 0.598 |
+ 0.792 |
+ 0.798 |
+ 0.588 |
+ 0.842 |
+ 0.839 |
+ 0.552 |
+ 0.565 |
+ 0.556 |
+ 0.556 |
+ 0.543 |
+ 0.535 |
+ 0.631 |
+ 0.714 |
+ 0.744 |
+ 0.741 |
+ 0.761 |
+ 0.721 |
+ 0.756 |
+ 0.669 |
+ 0.720 |
+ 0.780 |
+ 0.840 |
+ 0.821 |
+ 0.772 |
+ 0.599 |
+ 0.747 |
+ 0.769 |
+ 0.780 |
+ 0.768 |
+ 0.790 |
+ 0.809 |
+ 0.748 |
+ 0.780 |
+ 0.798 |
+ 0.555 |
+ 0.520 |
+ 0.763 |
+ 0.554 |
+ 0.483 |
+ 0.672 |
+ 0.869 |
+ 0.703 |
+ 0.673 |
+ 0.714 |
+ 0.700 |
+ 0.715 |
+ 0.698 |
+ 0.657 |
+ 0.683 |
+ 0.685 |
+ 0.679 |
+ 0.712 |
+ 0.596 |
+ 0.584 |
+ 1630 |
+ OrganismalFitness |
+ POLG_HCVJF |
+ Medium |
+ Virus |
+
+
+ POLG_PESV_Tsuboyama_2023_2MXD |
+ 0.638 |
+ 0.766 |
+ 0.695 |
+ 0.720 |
+ 0.740 |
+ 0.740 |
+ 0.525 |
+ 0.722 |
+ 0.679 |
+ 0.735 |
+ 0.737 |
+ 0.523 |
+ 0.558 |
+ 0.525 |
+ 0.556 |
+ 0.515 |
+ 0.567 |
+ 0.553 |
+ 0.530 |
+ 0.818 |
+ 0.558 |
+ 0.530 |
+ 0.507 |
+ 0.574 |
+ 0.542 |
+ 0.470 |
+ 0.486 |
+ 0.463 |
+ 0.578 |
+ 0.761 |
+ 0.822 |
+ 0.826 |
+ 0.575 |
+ 0.498 |
+ 0.503 |
+ 0.500 |
+ 0.702 |
+ 0.706 |
+ 0.702 |
+ 0.734 |
+ 0.741 |
+ 0.733 |
+ 0.479 |
+ 0.448 |
+ 0.491 |
+ 0.469 |
+ 0.778 |
+ 0.748 |
+ 0.882 |
+ 0.841 |
+ 0.859 |
+ 0.863 |
+ 0.848 |
+ 0.892 |
+ 0.894 |
+ 0.894 |
+ 0.867 |
+ 0.896 |
+ 0.894 |
+ 0.888 |
+ 0.907 |
+ 0.830 |
+ 5130 |
+ Stability |
+ POLG_PESV |
+ Medium |
+ Virus |
+
+
+ PPARG_HUMAN_Majithia_2016 |
+ 0.758 |
+ 0.814 |
+ 0.837 |
+ 0.842 |
+ 0.838 |
+ 0.845 |
+ 0.625 |
+ 0.731 |
+ 0.810 |
+ 0.821 |
+ 0.812 |
+ 0.841 |
+ 0.843 |
+ 0.533 |
+ 0.612 |
+ 0.808 |
+ 0.865 |
+ 0.893 |
+ 0.869 |
+ 0.858 |
+ 0.831 |
+ 0.827 |
+ 0.697 |
+ 0.719 |
+ 0.810 |
+ 0.837 |
+ 0.822 |
+ 0.838 |
+ 0.734 |
+ 0.871 |
+ 0.852 |
+ 0.843 |
+ 0.732 |
+ 0.855 |
+ 0.816 |
+ 0.788 |
+ 0.862 |
+ 0.845 |
+ 0.833 |
+ 0.871 |
+ 0.861 |
+ 0.859 |
+ 0.679 |
+ 0.524 |
+ 0.791 |
+ 0.779 |
+ 0.807 |
+ 0.788 |
+ 0.834 |
+ 0.668 |
+ 0.860 |
+ 0.862 |
+ 0.867 |
+ 0.865 |
+ 0.865 |
+ 0.866 |
+ 0.864 |
+ 0.865 |
+ 0.865 |
+ 0.871 |
+ 0.861 |
+ 0.811 |
+ 9576 |
+ Activity |
+ PPARG_HUMAN |
+ Medium |
+ Human |
+
+
+ PPM1D_HUMAN_Miller_2022 |
+ 0.783 |
+ 0.779 |
+ 0.779 |
+ 0.782 |
+ 0.816 |
+ 0.817 |
+ 0.513 |
+ 0.694 |
+ 0.741 |
+ 0.770 |
+ 0.797 |
+ 0.805 |
+ 0.816 |
+ 0.639 |
+ 0.692 |
+ 0.734 |
+ 0.805 |
+ 0.820 |
+ 0.817 |
+ 0.697 |
+ 0.726 |
+ 0.767 |
+ 0.776 |
+ 0.719 |
+ 0.763 |
+ 0.795 |
+ 0.790 |
+ 0.787 |
+ 0.714 |
+ 0.813 |
+ 0.804 |
+ 0.777 |
+ 0.614 |
+ 0.718 |
+ 0.769 |
+ 0.775 |
+ 0.801 |
+ 0.805 |
+ 0.805 |
+ 0.819 |
+ 0.817 |
+ 0.818 |
+ 0.677 |
+ 0.473 |
+ 0.794 |
+ 0.763 |
+ 0.756 |
+ 0.798 |
+ 0.783 |
+ 0.627 |
+ 0.791 |
+ 0.793 |
+ 0.789 |
+ 0.796 |
+ 0.801 |
+ 0.794 |
+ 0.797 |
+ 0.797 |
+ 0.799 |
+ 0.802 |
+ 0.824 |
+ 0.757 |
+ 7889 |
+ OrganismalFitness |
+ PPM1D_HUMAN |
+ Low |
+ Human |
+
+
+ PR40A_HUMAN_Tsuboyama_2023_1UZC |
+ 0.845 |
+ 0.882 |
+ 0.926 |
+ 0.921 |
+ 0.939 |
+ 0.940 |
+ 0.771 |
+ 0.771 |
+ 0.936 |
+ 0.939 |
+ 0.913 |
+ 0.850 |
+ 0.881 |
+ 0.783 |
+ 0.779 |
+ 0.951 |
+ 0.945 |
+ 0.921 |
+ 0.912 |
+ 0.904 |
+ 0.829 |
+ 0.885 |
+ 0.886 |
+ 0.898 |
+ 0.853 |
+ 0.888 |
+ 0.907 |
+ 0.898 |
+ 0.912 |
+ 0.946 |
+ 0.944 |
+ 0.942 |
+ 0.856 |
+ 0.828 |
+ 0.800 |
+ 0.824 |
+ 0.917 |
+ 0.920 |
+ 0.909 |
+ 0.932 |
+ 0.943 |
+ 0.933 |
+ 0.718 |
+ 0.611 |
+ 0.635 |
+ 0.719 |
+ 0.798 |
+ 0.664 |
+ 0.902 |
+ 0.913 |
+ 0.945 |
+ 0.947 |
+ 0.947 |
+ 0.949 |
+ 0.954 |
+ 0.954 |
+ 0.949 |
+ 0.953 |
+ 0.953 |
+ 0.954 |
+ 0.940 |
+ 0.937 |
+ 2033 |
+ Stability |
+ PR40A_HUMAN |
+ Medium |
+ Human |
+
+
+ PRKN_HUMAN_Clausen_2023 |
+ 0.839 |
+ 0.840 |
+ 0.835 |
+ 0.827 |
+ 0.848 |
+ 0.844 |
+ 0.593 |
+ 0.757 |
+ 0.797 |
+ 0.807 |
+ 0.816 |
+ 0.839 |
+ 0.854 |
+ 0.615 |
+ 0.652 |
+ 0.689 |
+ 0.769 |
+ 0.861 |
+ 0.865 |
+ 0.753 |
+ 0.652 |
+ 0.803 |
+ 0.828 |
+ 0.819 |
+ 0.740 |
+ 0.839 |
+ 0.838 |
+ 0.835 |
+ 0.799 |
+ 0.846 |
+ 0.832 |
+ 0.809 |
+ 0.652 |
+ 0.628 |
+ 0.801 |
+ 0.817 |
+ 0.816 |
+ 0.842 |
+ 0.848 |
+ 0.842 |
+ 0.855 |
+ 0.860 |
+ 0.659 |
+ 0.540 |
+ 0.821 |
+ 0.690 |
+ 0.845 |
+ 0.831 |
+ 0.873 |
+ 0.656 |
+ 0.788 |
+ 0.791 |
+ 0.793 |
+ 0.810 |
+ 0.800 |
+ 0.805 |
+ 0.807 |
+ 0.806 |
+ 0.800 |
+ 0.808 |
+ 0.866 |
+ 0.768 |
+ 8756 |
+ Expression |
+ PRKN_HUMAN |
+ Low |
+ Human |
+
+
+ PSAE_SYNP2_Tsuboyama_2023_1PSE |
+ 0.859 |
+ 0.864 |
+ 0.856 |
+ 0.855 |
+ 0.860 |
+ 0.857 |
+ 0.712 |
+ 0.785 |
+ 0.824 |
+ 0.824 |
+ 0.904 |
+ 0.871 |
+ 0.880 |
+ 0.811 |
+ 0.842 |
+ 0.895 |
+ 0.909 |
+ 0.888 |
+ 0.853 |
+ 0.806 |
+ 0.706 |
+ 0.656 |
+ 0.794 |
+ 0.739 |
+ 0.793 |
+ 0.809 |
+ 0.388 |
+ 0.802 |
+ 0.834 |
+ 0.846 |
+ 0.853 |
+ 0.825 |
+ 0.554 |
+ 0.717 |
+ 0.755 |
+ 0.780 |
+ 0.845 |
+ 0.841 |
+ 0.820 |
+ 0.866 |
+ 0.866 |
+ 0.843 |
+ 0.748 |
+ 0.650 |
+ 0.864 |
+ 0.784 |
+ 0.828 |
+ 0.854 |
+ 0.907 |
+ 0.873 |
+ 0.902 |
+ 0.896 |
+ 0.898 |
+ 0.903 |
+ 0.901 |
+ 0.907 |
+ 0.895 |
+ 0.901 |
+ 0.901 |
+ 0.905 |
+ 0.909 |
+ 0.906 |
+ 1579 |
+ Stability |
+ PSAE_PICP2 |
+ Medium |
+ Prokaryote |
+
+
+ PTEN_HUMAN_Matreyek_2021 |
+ 0.681 |
+ 0.695 |
+ 0.681 |
+ 0.691 |
+ 0.695 |
+ 0.700 |
+ 0.579 |
+ 0.683 |
+ 0.714 |
+ 0.736 |
+ 0.721 |
+ 0.704 |
+ 0.730 |
+ 0.587 |
+ 0.635 |
+ 0.727 |
+ 0.732 |
+ 0.642 |
+ 0.654 |
+ 0.739 |
+ 0.595 |
+ 0.730 |
+ 0.712 |
+ 0.702 |
+ 0.633 |
+ 0.679 |
+ 0.670 |
+ 0.696 |
+ 0.657 |
+ 0.737 |
+ 0.698 |
+ 0.695 |
+ 0.567 |
+ 0.636 |
+ 0.717 |
+ 0.680 |
+ 0.688 |
+ 0.730 |
+ 0.711 |
+ 0.706 |
+ 0.731 |
+ 0.725 |
+ 0.622 |
+ 0.526 |
+ 0.719 |
+ 0.669 |
+ 0.736 |
+ 0.734 |
+ 0.736 |
+ 0.611 |
+ 0.719 |
+ 0.714 |
+ 0.729 |
+ 0.718 |
+ 0.717 |
+ 0.723 |
+ 0.725 |
+ 0.726 |
+ 0.721 |
+ 0.727 |
+ 0.748 |
+ 0.725 |
+ 5083 |
+ Expression |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ PTEN_HUMAN_Mighell_2018 |
+ 0.815 |
+ 0.826 |
+ 0.827 |
+ 0.829 |
+ 0.844 |
+ 0.848 |
+ 0.619 |
+ 0.734 |
+ 0.828 |
+ 0.833 |
+ 0.805 |
+ 0.816 |
+ 0.833 |
+ 0.639 |
+ 0.772 |
+ 0.850 |
+ 0.832 |
+ 0.695 |
+ 0.683 |
+ 0.831 |
+ 0.730 |
+ 0.774 |
+ 0.720 |
+ 0.693 |
+ 0.770 |
+ 0.696 |
+ 0.689 |
+ 0.714 |
+ 0.660 |
+ 0.842 |
+ 0.819 |
+ 0.813 |
+ 0.541 |
+ 0.750 |
+ 0.767 |
+ 0.702 |
+ 0.829 |
+ 0.813 |
+ 0.781 |
+ 0.844 |
+ 0.844 |
+ 0.846 |
+ 0.747 |
+ 0.481 |
+ 0.832 |
+ 0.815 |
+ 0.771 |
+ 0.762 |
+ 0.805 |
+ 0.625 |
+ 0.816 |
+ 0.791 |
+ 0.811 |
+ 0.818 |
+ 0.826 |
+ 0.825 |
+ 0.816 |
+ 0.816 |
+ 0.826 |
+ 0.828 |
+ 0.845 |
+ 0.828 |
+ 7260 |
+ Activity |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ Q2N0S5_9HIV1_Haddox_2018 |
+ 0.758 |
+ 0.704 |
+ 0.690 |
+ 0.709 |
+ 0.760 |
+ 0.763 |
+ 0.510 |
+ 0.723 |
+ 0.770 |
+ 0.773 |
+ 0.745 |
+ 0.767 |
+ 0.781 |
+ 0.509 |
+ 0.509 |
+ 0.511 |
+ 0.532 |
+ 0.559 |
+ 0.586 |
+ 0.711 |
+ 0.770 |
+ 0.707 |
+ 0.698 |
+ 0.671 |
+ 0.769 |
+ 0.702 |
+ 0.702 |
+ 0.722 |
+ 0.683 |
+ 0.764 |
+ 0.751 |
+ 0.753 |
+ 0.644 |
+ 0.758 |
+ 0.716 |
+ 0.710 |
+ 0.772 |
+ 0.762 |
+ 0.762 |
+ 0.775 |
+ 0.769 |
+ 0.768 |
+ 0.720 |
+ 0.507 |
+ 0.765 |
+ 0.738 |
+ 0.695 |
+ 0.743 |
+ 0.634 |
+ 0.575 |
+ 0.615 |
+ 0.636 |
+ 0.658 |
+ 0.656 |
+ 0.631 |
+ 0.634 |
+ 0.635 |
+ 0.628 |
+ 0.615 |
+ 0.645 |
+ 0.610 |
+ 0.560 |
+ 12729 |
+ OrganismalFitness |
+ Q2N0S5_9HIV1 |
+ Medium |
+ Virus |
+
+
+ Q53Z42_HUMAN_McShan_2019_binding-TAPBPR |
+ 0.671 |
+ 0.658 |
+ 0.665 |
+ 0.666 |
+ 0.669 |
+ 0.667 |
+ 0.545 |
+ 0.608 |
+ 0.639 |
+ 0.645 |
+ 0.605 |
+ 0.636 |
+ 0.646 |
+ 0.605 |
+ 0.596 |
+ 0.671 |
+ 0.657 |
+ 0.630 |
+ 0.663 |
+ 0.651 |
+ 0.615 |
+ 0.619 |
+ 0.615 |
+ 0.593 |
+ 0.637 |
+ 0.636 |
+ 0.624 |
+ 0.642 |
+ 0.633 |
+ 0.638 |
+ 0.679 |
+ 0.666 |
+ 0.551 |
+ 0.610 |
+ 0.623 |
+ 0.576 |
+ 0.647 |
+ 0.656 |
+ 0.634 |
+ 0.667 |
+ 0.671 |
+ 0.662 |
+ 0.699 |
+ 0.609 |
+ 0.642 |
+ 0.653 |
+ 0.643 |
+ 0.664 |
+ 0.655 |
+ 0.590 |
+ 0.639 |
+ 0.645 |
+ 0.659 |
+ 0.656 |
+ 0.652 |
+ 0.652 |
+ 0.648 |
+ 0.646 |
+ 0.647 |
+ 0.654 |
+ 0.647 |
+ 0.717 |
+ 3344 |
+ Binding |
+ Q53Z42_HUMAN |
+ Medium |
+ Human |
+
+
+ Q53Z42_HUMAN_McShan_2019_expression |
+ 0.771 |
+ 0.755 |
+ 0.772 |
+ 0.779 |
+ 0.784 |
+ 0.784 |
+ 0.471 |
+ 0.721 |
+ 0.784 |
+ 0.791 |
+ 0.719 |
+ 0.747 |
+ 0.766 |
+ 0.533 |
+ 0.545 |
+ 0.745 |
+ 0.785 |
+ 0.785 |
+ 0.792 |
+ 0.787 |
+ 0.718 |
+ 0.752 |
+ 0.752 |
+ 0.746 |
+ 0.747 |
+ 0.765 |
+ 0.754 |
+ 0.767 |
+ 0.784 |
+ 0.785 |
+ 0.784 |
+ 0.755 |
+ 0.582 |
+ 0.721 |
+ 0.737 |
+ 0.728 |
+ 0.773 |
+ 0.784 |
+ 0.787 |
+ 0.788 |
+ 0.794 |
+ 0.795 |
+ 0.657 |
+ 0.539 |
+ 0.777 |
+ 0.746 |
+ 0.734 |
+ 0.761 |
+ 0.739 |
+ 0.604 |
+ 0.777 |
+ 0.781 |
+ 0.786 |
+ 0.783 |
+ 0.792 |
+ 0.786 |
+ 0.784 |
+ 0.787 |
+ 0.792 |
+ 0.793 |
+ 0.771 |
+ 0.812 |
+ 3344 |
+ Expression |
+ Q53Z42_HUMAN |
+ Medium |
+ Human |
+
+
+ Q59976_STRSQ_Romero_2015 |
+ 0.781 |
+ 0.831 |
+ 0.853 |
+ 0.857 |
+ 0.857 |
+ 0.864 |
+ 0.689 |
+ 0.776 |
+ 0.865 |
+ 0.869 |
+ 0.827 |
+ 0.786 |
+ 0.796 |
+ 0.574 |
+ 0.741 |
+ 0.775 |
+ 0.813 |
+ 0.812 |
+ 0.817 |
+ 0.850 |
+ 0.826 |
+ 0.850 |
+ 0.849 |
+ 0.857 |
+ 0.838 |
+ 0.859 |
+ 0.866 |
+ 0.865 |
+ 0.869 |
+ 0.872 |
+ 0.854 |
+ 0.827 |
+ 0.674 |
+ 0.827 |
+ 0.850 |
+ 0.842 |
+ 0.838 |
+ 0.856 |
+ 0.857 |
+ 0.864 |
+ 0.869 |
+ 0.869 |
+ 0.763 |
+ 0.492 |
+ 0.824 |
+ 0.784 |
+ 0.711 |
+ 0.783 |
+ 0.781 |
+ 0.569 |
+ 0.813 |
+ 0.792 |
+ 0.792 |
+ 0.803 |
+ 0.808 |
+ 0.798 |
+ 0.810 |
+ 0.797 |
+ 0.802 |
+ 0.809 |
+ 0.843 |
+ 0.790 |
+ 2999 |
+ Activity |
+ Q59976_STRSQ |
+ Medium |
+ Prokaryote |
+
+
+ Q6WV13_9MAXI_Somermeyer_2022 |
+ 0.634 |
+ 0.658 |
+ 0.605 |
+ 0.606 |
+ 0.640 |
+ 0.639 |
+ 0.496 |
+ 0.562 |
+ 0.661 |
+ 0.659 |
+ 0.599 |
+ 0.520 |
+ 0.512 |
+ 0.506 |
+ 0.519 |
+ 0.501 |
+ 0.505 |
+ 0.505 |
+ 0.493 |
+ 0.626 |
+ 0.518 |
+ 0.519 |
+ 0.502 |
+ 0.552 |
+ 0.513 |
+ 0.500 |
+ 0.512 |
+ 0.496 |
+ 0.493 |
+ 0.682 |
+ 0.636 |
+ 0.639 |
+ 0.514 |
+ 0.513 |
+ 0.518 |
+ 0.512 |
+ 0.599 |
+ 0.599 |
+ 0.599 |
+ 0.637 |
+ 0.637 |
+ 0.635 |
+ 0.505 |
+ 0.501 |
+ 0.496 |
+ 0.510 |
+ 0.611 |
+ 0.615 |
+ 0.664 |
+ 0.601 |
+ 0.610 |
+ 0.624 |
+ 0.614 |
+ 0.636 |
+ 0.628 |
+ 0.630 |
+ 0.621 |
+ 0.622 |
+ 0.621 |
+ 0.624 |
+ 0.554 |
+ 0.536 |
+ 31401 |
+ Activity |
+ Q6WV12_9MAXI |
+ Low |
+ Eukaryote |
+
+
+ Q837P4_ENTFA_Meier_2023 |
+ 0.705 |
+ 0.712 |
+ 0.718 |
+ 0.719 |
+ 0.725 |
+ 0.736 |
+ 0.709 |
+ 0.675 |
+ 0.711 |
+ 0.719 |
+ 0.765 |
+ 0.741 |
+ 0.755 |
+ 0.682 |
+ 0.716 |
+ 0.718 |
+ 0.724 |
+ 0.744 |
+ 0.729 |
+ 0.476 |
+ 0.733 |
+ 0.712 |
+ 0.721 |
+ 0.734 |
+ 0.742 |
+ 0.749 |
+ 0.760 |
+ 0.706 |
+ 0.754 |
+ 0.738 |
+ 0.738 |
+ 0.718 |
+ 0.561 |
+ 0.721 |
+ 0.747 |
+ 0.729 |
+ 0.719 |
+ 0.754 |
+ 0.749 |
+ 0.744 |
+ 0.752 |
+ 0.752 |
+ 0.704 |
+ 0.677 |
+ 0.755 |
+ 0.720 |
+ 0.630 |
+ 0.725 |
+ 0.685 |
+ 0.551 |
+ 0.726 |
+ 0.714 |
+ 0.718 |
+ 0.736 |
+ 0.740 |
+ 0.748 |
+ 0.741 |
+ 0.711 |
+ 0.736 |
+ 0.737 |
+ 0.772 |
+ 0.720 |
+ 697 |
+ Activity |
+ Q837P4_ENTFA |
+ Medium |
+ Prokaryote |
+
+
+ Q837P5_ENTFA_Meier_2023 |
+ 0.565 |
+ 0.677 |
+ 0.682 |
+ 0.698 |
+ 0.664 |
+ 0.668 |
+ 0.573 |
+ 0.601 |
+ 0.613 |
+ 0.618 |
+ 0.638 |
+ 0.646 |
+ 0.638 |
+ 0.526 |
+ 0.578 |
+ 0.629 |
+ 0.653 |
+ 0.691 |
+ 0.671 |
+ 0.636 |
+ 0.657 |
+ 0.676 |
+ 0.701 |
+ 0.698 |
+ 0.639 |
+ 0.684 |
+ 0.690 |
+ 0.735 |
+ 0.714 |
+ 0.644 |
+ 0.665 |
+ 0.658 |
+ 0.559 |
+ 0.680 |
+ 0.703 |
+ 0.741 |
+ 0.643 |
+ 0.674 |
+ 0.714 |
+ 0.679 |
+ 0.686 |
+ 0.699 |
+ 0.611 |
+ 0.557 |
+ 0.653 |
+ 0.600 |
+ 0.665 |
+ 0.642 |
+ 0.676 |
+ 0.563 |
+ 0.636 |
+ 0.641 |
+ 0.648 |
+ 0.629 |
+ 0.657 |
+ 0.650 |
+ 0.630 |
+ 0.647 |
+ 0.637 |
+ 0.648 |
+ 0.640 |
+ 0.629 |
+ 747 |
+ Activity |
+ Q837P5_ENTFA |
+ Medium |
+ Prokaryote |
+
+
+ Q8WTC7_9CNID_Somermeyer_2022 |
+ 0.630 |
+ 0.671 |
+ 0.603 |
+ 0.602 |
+ 0.652 |
+ 0.649 |
+ 0.495 |
+ 0.648 |
+ 0.638 |
+ 0.641 |
+ 0.592 |
+ 0.471 |
+ 0.463 |
+ 0.466 |
+ 0.454 |
+ 0.459 |
+ 0.465 |
+ 0.461 |
+ 0.484 |
+ 0.601 |
+ 0.476 |
+ 0.492 |
+ 0.516 |
+ 0.489 |
+ 0.482 |
+ 0.469 |
+ 0.487 |
+ 0.606 |
+ 0.613 |
+ 0.665 |
+ 0.646 |
+ 0.654 |
+ 0.477 |
+ 0.452 |
+ 0.490 |
+ 0.639 |
+ 0.596 |
+ 0.601 |
+ 0.642 |
+ 0.641 |
+ 0.646 |
+ 0.660 |
+ 0.472 |
+ 0.473 |
+ 0.460 |
+ 0.460 |
+ 0.510 |
+ 0.543 |
+ 0.628 |
+ 0.571 |
+ 0.604 |
+ 0.598 |
+ 0.603 |
+ 0.609 |
+ 0.598 |
+ 0.606 |
+ 0.605 |
+ 0.599 |
+ 0.599 |
+ 0.603 |
+ 0.537 |
+ 0.447 |
+ 33510 |
+ Activity |
+ Q8WTC7_9CNID |
+ Low |
+ Eukaryote |
+
+
+ R1AB_SARS2_Flynn_2022 |
+ 0.835 |
+ 0.829 |
+ 0.618 |
+ 0.633 |
+ 0.846 |
+ 0.848 |
+ 0.487 |
+ 0.675 |
+ 0.496 |
+ 0.496 |
+ 0.575 |
+ 0.497 |
+ 0.490 |
+ 0.513 |
+ 0.499 |
+ 0.555 |
+ 0.567 |
+ 0.792 |
+ 0.824 |
+ 0.663 |
+ 0.636 |
+ 0.667 |
+ 0.674 |
+ 0.676 |
+ 0.643 |
+ 0.643 |
+ 0.623 |
+ 0.634 |
+ 0.642 |
+ 0.825 |
+ 0.801 |
+ 0.745 |
+ 0.486 |
+ 0.607 |
+ 0.637 |
+ 0.630 |
+ 0.711 |
+ 0.735 |
+ 0.734 |
+ 0.823 |
+ 0.834 |
+ 0.832 |
+ 0.501 |
+ 0.486 |
+ 0.549 |
+ 0.495 |
+ 0.752 |
+ 0.733 |
+ 0.777 |
+ 0.626 |
+ 0.650 |
+ 0.636 |
+ 0.659 |
+ 0.675 |
+ 0.665 |
+ 0.661 |
+ 0.662 |
+ 0.660 |
+ 0.641 |
+ 0.664 |
+ 0.636 |
+ 0.589 |
+ 5725 |
+ OrganismalFitness |
+ R1AB_SARS2 |
+ Medium |
+ Virus |
+
+
+ RAD_ANTMA_Tsuboyama_2023_2CJJ |
+ 0.605 |
+ 0.585 |
+ 0.660 |
+ 0.663 |
+ 0.675 |
+ 0.692 |
+ 0.735 |
+ 0.703 |
+ 0.853 |
+ 0.835 |
+ 0.794 |
+ 0.748 |
+ 0.769 |
+ 0.726 |
+ 0.846 |
+ 0.868 |
+ 0.761 |
+ 0.754 |
+ 0.797 |
+ 0.714 |
+ 0.788 |
+ 0.824 |
+ 0.803 |
+ 0.766 |
+ 0.816 |
+ 0.744 |
+ 0.808 |
+ 0.754 |
+ 0.747 |
+ 0.799 |
+ 0.725 |
+ 0.681 |
+ 0.649 |
+ 0.750 |
+ 0.834 |
+ 0.745 |
+ 0.745 |
+ 0.811 |
+ 0.744 |
+ 0.734 |
+ 0.762 |
+ 0.723 |
+ 0.811 |
+ 0.620 |
+ 0.625 |
+ 0.726 |
+ 0.761 |
+ 0.666 |
+ 0.865 |
+ 0.778 |
+ 0.786 |
+ 0.801 |
+ 0.804 |
+ 0.800 |
+ 0.812 |
+ 0.802 |
+ 0.799 |
+ 0.801 |
+ 0.797 |
+ 0.805 |
+ 0.719 |
+ 0.818 |
+ 912 |
+ Stability |
+ RAD_ANTMA |
+ High |
+ Eukaryote |
+
+
+ RAF1_HUMAN_Zinkus-Boltz_2019 |
+ 0.708 |
+ 0.719 |
+ 0.706 |
+ 0.712 |
+ 0.725 |
+ 0.725 |
+ 0.525 |
+ 0.684 |
+ 0.752 |
+ 0.763 |
+ 0.760 |
+ 0.732 |
+ 0.769 |
+ 0.523 |
+ 0.619 |
+ 0.740 |
+ 0.756 |
+ 0.744 |
+ 0.734 |
+ 0.678 |
+ 0.654 |
+ 0.715 |
+ 0.708 |
+ 0.696 |
+ 0.699 |
+ 0.712 |
+ 0.708 |
+ 0.696 |
+ 0.708 |
+ 0.727 |
+ 0.764 |
+ 0.730 |
+ 0.569 |
+ 0.653 |
+ 0.688 |
+ 0.702 |
+ 0.693 |
+ 0.703 |
+ 0.711 |
+ 0.717 |
+ 0.734 |
+ 0.737 |
+ 0.607 |
+ 0.529 |
+ 0.779 |
+ 0.707 |
+ 0.638 |
+ 0.725 |
+ 0.639 |
+ 0.665 |
+ 0.729 |
+ 0.726 |
+ 0.730 |
+ 0.752 |
+ 0.734 |
+ 0.747 |
+ 0.732 |
+ 0.729 |
+ 0.740 |
+ 0.745 |
+ 0.727 |
+ 0.590 |
+ 297 |
+ OrganismalFitness |
+ RAF1_HUMAN |
+ Low |
+ Human |
+
+
+ RASH_HUMAN_Bandaru_2017 |
+ 0.798 |
+ 0.811 |
+ 0.827 |
+ 0.842 |
+ 0.839 |
+ 0.844 |
+ 0.689 |
+ 0.772 |
+ 0.823 |
+ 0.837 |
+ 0.757 |
+ 0.787 |
+ 0.811 |
+ 0.807 |
+ 0.849 |
+ 0.835 |
+ 0.850 |
+ 0.817 |
+ 0.735 |
+ 0.859 |
+ 0.819 |
+ 0.805 |
+ 0.805 |
+ 0.791 |
+ 0.809 |
+ 0.791 |
+ 0.791 |
+ 0.767 |
+ 0.710 |
+ 0.815 |
+ 0.789 |
+ 0.779 |
+ 0.648 |
+ 0.787 |
+ 0.803 |
+ 0.760 |
+ 0.821 |
+ 0.830 |
+ 0.808 |
+ 0.839 |
+ 0.844 |
+ 0.846 |
+ 0.838 |
+ 0.662 |
+ 0.720 |
+ 0.822 |
+ 0.711 |
+ 0.641 |
+ 0.807 |
+ 0.644 |
+ 0.798 |
+ 0.794 |
+ 0.799 |
+ 0.796 |
+ 0.806 |
+ 0.793 |
+ 0.813 |
+ 0.796 |
+ 0.805 |
+ 0.812 |
+ 0.747 |
+ 0.827 |
+ 3134 |
+ Activity |
+ RASH_HUMAN |
+ High |
+ Human |
+
+
+ RASK_HUMAN_Weng_2022_abundance |
+ 0.648 |
+ 0.626 |
+ 0.679 |
+ 0.672 |
+ 0.665 |
+ 0.676 |
+ 0.605 |
+ 0.623 |
+ 0.620 |
+ 0.634 |
+ 0.644 |
+ 0.606 |
+ 0.624 |
+ 0.658 |
+ 0.683 |
+ 0.663 |
+ 0.631 |
+ 0.610 |
+ 0.603 |
+ 0.665 |
+ 0.633 |
+ 0.671 |
+ 0.690 |
+ 0.699 |
+ 0.640 |
+ 0.677 |
+ 0.711 |
+ 0.677 |
+ 0.738 |
+ 0.683 |
+ 0.605 |
+ 0.584 |
+ 0.607 |
+ 0.600 |
+ 0.662 |
+ 0.706 |
+ 0.636 |
+ 0.682 |
+ 0.721 |
+ 0.667 |
+ 0.685 |
+ 0.694 |
+ 0.616 |
+ 0.655 |
+ 0.531 |
+ 0.621 |
+ 0.659 |
+ 0.664 |
+ 0.688 |
+ 0.662 |
+ 0.627 |
+ 0.620 |
+ 0.645 |
+ 0.637 |
+ 0.652 |
+ 0.637 |
+ 0.646 |
+ 0.634 |
+ 0.623 |
+ 0.639 |
+ 0.671 |
+ 0.661 |
+ 26012 |
+ Expression |
+ RASK_HUMAN |
+ High |
+ Human |
+
+
+ RASK_HUMAN_Weng_2022_binding-DARPin_K55 |
+ 0.734 |
+ 0.755 |
+ 0.822 |
+ 0.827 |
+ 0.815 |
+ 0.816 |
+ 0.645 |
+ 0.701 |
+ 0.830 |
+ 0.833 |
+ 0.806 |
+ 0.777 |
+ 0.805 |
+ 0.796 |
+ 0.824 |
+ 0.834 |
+ 0.842 |
+ 0.790 |
+ 0.733 |
+ 0.861 |
+ 0.798 |
+ 0.781 |
+ 0.798 |
+ 0.754 |
+ 0.778 |
+ 0.791 |
+ 0.773 |
+ 0.718 |
+ 0.694 |
+ 0.824 |
+ 0.779 |
+ 0.753 |
+ 0.660 |
+ 0.765 |
+ 0.809 |
+ 0.762 |
+ 0.791 |
+ 0.820 |
+ 0.791 |
+ 0.818 |
+ 0.826 |
+ 0.817 |
+ 0.794 |
+ 0.648 |
+ 0.732 |
+ 0.814 |
+ 0.628 |
+ 0.640 |
+ 0.814 |
+ 0.648 |
+ 0.779 |
+ 0.781 |
+ 0.783 |
+ 0.782 |
+ 0.792 |
+ 0.792 |
+ 0.779 |
+ 0.782 |
+ 0.756 |
+ 0.789 |
+ 0.812 |
+ 0.814 |
+ 24873 |
+ Binding |
+ RASK_HUMAN |
+ High |
+ Human |
+
+
+ RBP1_HUMAN_Tsuboyama_2023_2KWH |
+ 0.584 |
+ 0.542 |
+ 0.656 |
+ 0.660 |
+ 0.659 |
+ 0.657 |
+ 0.646 |
+ 0.650 |
+ 0.634 |
+ 0.642 |
+ 0.728 |
+ 0.718 |
+ 0.713 |
+ 0.669 |
+ 0.708 |
+ 0.759 |
+ 0.781 |
+ 0.684 |
+ 0.682 |
+ 0.633 |
+ 0.694 |
+ 0.577 |
+ 0.656 |
+ 0.654 |
+ 0.653 |
+ 0.565 |
+ 0.591 |
+ 0.673 |
+ 0.631 |
+ 0.701 |
+ 0.704 |
+ 0.676 |
+ 0.644 |
+ 0.655 |
+ 0.698 |
+ 0.708 |
+ 0.666 |
+ 0.679 |
+ 0.678 |
+ 0.666 |
+ 0.675 |
+ 0.666 |
+ 0.667 |
+ 0.636 |
+ 0.709 |
+ 0.714 |
+ 0.754 |
+ 0.778 |
+ 0.796 |
+ 0.730 |
+ 0.755 |
+ 0.751 |
+ 0.756 |
+ 0.755 |
+ 0.759 |
+ 0.754 |
+ 0.749 |
+ 0.762 |
+ 0.752 |
+ 0.758 |
+ 0.783 |
+ 0.769 |
+ 1332 |
+ Stability |
+ RBP1_HUMAN |
+ High |
+ Human |
+
+
+ RCD1_ARATH_Tsuboyama_2023_5OAO |
+ 0.694 |
+ 0.683 |
+ 0.725 |
+ 0.728 |
+ 0.728 |
+ 0.723 |
+ 0.660 |
+ 0.662 |
+ 0.736 |
+ 0.740 |
+ 0.757 |
+ 0.669 |
+ 0.695 |
+ 0.685 |
+ 0.692 |
+ 0.784 |
+ 0.792 |
+ 0.787 |
+ 0.783 |
+ 0.739 |
+ 0.628 |
+ 0.616 |
+ 0.668 |
+ 0.710 |
+ 0.659 |
+ 0.759 |
+ 0.749 |
+ 0.727 |
+ 0.767 |
+ 0.744 |
+ 0.787 |
+ 0.752 |
+ 0.519 |
+ 0.638 |
+ 0.657 |
+ 0.660 |
+ 0.717 |
+ 0.721 |
+ 0.715 |
+ 0.742 |
+ 0.743 |
+ 0.735 |
+ 0.655 |
+ 0.644 |
+ 0.727 |
+ 0.666 |
+ 0.744 |
+ 0.757 |
+ 0.801 |
+ 0.773 |
+ 0.802 |
+ 0.809 |
+ 0.816 |
+ 0.814 |
+ 0.806 |
+ 0.812 |
+ 0.809 |
+ 0.808 |
+ 0.811 |
+ 0.814 |
+ 0.799 |
+ 0.762 |
+ 1261 |
+ Stability |
+ RCD1_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ RCRO_LAMBD_Tsuboyama_2023_1ORC |
+ 0.665 |
+ 0.773 |
+ 0.805 |
+ 0.800 |
+ 0.788 |
+ 0.806 |
+ 0.533 |
+ 0.640 |
+ 0.764 |
+ 0.767 |
+ 0.737 |
+ 0.649 |
+ 0.697 |
+ 0.591 |
+ 0.654 |
+ 0.660 |
+ 0.786 |
+ 0.786 |
+ 0.803 |
+ 0.801 |
+ 0.506 |
+ 0.524 |
+ 0.549 |
+ 0.541 |
+ 0.414 |
+ 0.522 |
+ 0.439 |
+ 0.482 |
+ 0.798 |
+ 0.799 |
+ 0.782 |
+ 0.771 |
+ 0.518 |
+ 0.525 |
+ 0.495 |
+ 0.778 |
+ 0.713 |
+ 0.708 |
+ 0.795 |
+ 0.789 |
+ 0.800 |
+ 0.823 |
+ 0.452 |
+ 0.545 |
+ 0.698 |
+ 0.548 |
+ 0.780 |
+ 0.806 |
+ 0.866 |
+ 0.807 |
+ 0.790 |
+ 0.781 |
+ 0.771 |
+ 0.782 |
+ 0.782 |
+ 0.784 |
+ 0.773 |
+ 0.781 |
+ 0.792 |
+ 0.785 |
+ 0.815 |
+ 0.710 |
+ 2278 |
+ Stability |
+ RCRO_LAMBD |
+ High |
+ Virus |
+
+
+ RD23A_HUMAN_Tsuboyama_2023_1IFY |
+ 0.619 |
+ 0.649 |
+ 0.714 |
+ 0.714 |
+ 0.711 |
+ 0.711 |
+ 0.576 |
+ 0.735 |
+ 0.766 |
+ 0.771 |
+ 0.750 |
+ 0.792 |
+ 0.805 |
+ 0.603 |
+ 0.836 |
+ 0.780 |
+ 0.768 |
+ 0.761 |
+ 0.723 |
+ 0.735 |
+ 0.705 |
+ 0.725 |
+ 0.722 |
+ 0.743 |
+ 0.769 |
+ 0.765 |
+ 0.764 |
+ 0.749 |
+ 0.765 |
+ 0.747 |
+ 0.725 |
+ 0.718 |
+ 0.729 |
+ 0.646 |
+ 0.749 |
+ 0.737 |
+ 0.703 |
+ 0.747 |
+ 0.750 |
+ 0.716 |
+ 0.742 |
+ 0.748 |
+ 0.747 |
+ 0.507 |
+ 0.788 |
+ 0.772 |
+ 0.724 |
+ 0.783 |
+ 0.787 |
+ 0.745 |
+ 0.744 |
+ 0.735 |
+ 0.741 |
+ 0.733 |
+ 0.748 |
+ 0.750 |
+ 0.746 |
+ 0.748 |
+ 0.749 |
+ 0.749 |
+ 0.741 |
+ 0.796 |
+ 1019 |
+ Stability |
+ RD23A_HUMAN |
+ High |
+ Human |
+
+
+ RDRP_I33A0_Li_2023 |
+ 0.667 |
+ 0.699 |
+ 0.727 |
+ 0.732 |
+ 0.779 |
+ 0.781 |
+ 0.524 |
+ 0.728 |
+ 0.791 |
+ 0.794 |
+ 0.612 |
+ 0.547 |
+ 0.554 |
+ 0.535 |
+ 0.540 |
+ 0.590 |
+ 0.698 |
+ 0.724 |
+ 0.785 |
+ 0.768 |
+ 0.734 |
+ 0.768 |
+ 0.778 |
+ 0.789 |
+ 0.588 |
+ 0.723 |
+ 0.716 |
+ 0.712 |
+ 0.751 |
+ 0.796 |
+ 0.766 |
+ 0.729 |
+ 0.568 |
+ 0.730 |
+ 0.761 |
+ 0.781 |
+ 0.750 |
+ 0.768 |
+ 0.779 |
+ 0.790 |
+ 0.799 |
+ 0.808 |
+ 0.535 |
+ 0.522 |
+ 0.612 |
+ 0.536 |
+ 0.626 |
+ 0.646 |
+ 0.622 |
+ 0.540 |
+ 0.681 |
+ 0.664 |
+ 0.674 |
+ 0.674 |
+ 0.683 |
+ 0.679 |
+ 0.675 |
+ 0.686 |
+ 0.686 |
+ 0.687 |
+ 0.595 |
+ 0.569 |
+ 12003 |
+ OrganismalFitness |
+ RDRP_I33A0 |
+ Low |
+ Virus |
+
+
+ REV_HV1H2_Fernandes_2016 |
+ 0.596 |
+ 0.574 |
+ 0.603 |
+ 0.606 |
+ 0.603 |
+ 0.603 |
+ 0.521 |
+ 0.646 |
+ 0.603 |
+ 0.607 |
+ 0.566 |
+ 0.616 |
+ 0.624 |
+ 0.526 |
+ 0.522 |
+ 0.585 |
+ 0.613 |
+ 0.638 |
+ 0.629 |
+ 0.575 |
+ 0.600 |
+ 0.623 |
+ 0.617 |
+ 0.608 |
+ 0.635 |
+ 0.637 |
+ 0.576 |
+ 0.617 |
+ 0.623 |
+ 0.630 |
+ 0.667 |
+ 0.672 |
+ 0.525 |
+ 0.608 |
+ 0.627 |
+ 0.613 |
+ 0.614 |
+ 0.625 |
+ 0.610 |
+ 0.616 |
+ 0.623 |
+ 0.611 |
+ 0.529 |
+ 0.533 |
+ 0.599 |
+ 0.537 |
+ 0.626 |
+ 0.612 |
+ 0.639 |
+ 0.604 |
+ 0.591 |
+ 0.596 |
+ 0.633 |
+ 0.622 |
+ 0.624 |
+ 0.625 |
+ 0.633 |
+ 0.610 |
+ 0.649 |
+ 0.632 |
+ 0.633 |
+ 0.608 |
+ 2147 |
+ OrganismalFitness |
+ REV_HV1H2 |
+ Medium |
+ Virus |
+
+
+ RFAH_ECOLI_Tsuboyama_2023_2LCL |
+ 0.507 |
+ 0.596 |
+ 0.607 |
+ 0.618 |
+ 0.618 |
+ 0.616 |
+ 0.487 |
+ 0.620 |
+ 0.613 |
+ 0.627 |
+ 0.622 |
+ 0.571 |
+ 0.585 |
+ 0.469 |
+ 0.529 |
+ 0.511 |
+ 0.649 |
+ 0.637 |
+ 0.632 |
+ 0.635 |
+ 0.454 |
+ 0.519 |
+ 0.559 |
+ 0.553 |
+ 0.464 |
+ 0.565 |
+ 0.577 |
+ 0.552 |
+ 0.576 |
+ 0.610 |
+ 0.636 |
+ 0.610 |
+ 0.500 |
+ 0.498 |
+ 0.558 |
+ 0.576 |
+ 0.543 |
+ 0.572 |
+ 0.583 |
+ 0.610 |
+ 0.610 |
+ 0.611 |
+ 0.547 |
+ 0.447 |
+ 0.585 |
+ 0.540 |
+ 0.649 |
+ 0.622 |
+ 0.689 |
+ 0.658 |
+ 0.668 |
+ 0.657 |
+ 0.649 |
+ 0.668 |
+ 0.660 |
+ 0.655 |
+ 0.660 |
+ 0.656 |
+ 0.668 |
+ 0.663 |
+ 0.652 |
+ 0.593 |
+ 1326 |
+ Stability |
+ RFAH_ECOLI |
+ High |
+ Prokaryote |
+
+
+ RL20_AQUAE_Tsuboyama_2023_1GYZ |
+ 0.688 |
+ 0.851 |
+ 0.867 |
+ 0.866 |
+ 0.866 |
+ 0.870 |
+ 0.593 |
+ 0.855 |
+ 0.718 |
+ 0.681 |
+ 0.886 |
+ 0.885 |
+ 0.875 |
+ 0.591 |
+ 0.706 |
+ 0.753 |
+ 0.908 |
+ 0.899 |
+ 0.907 |
+ 0.879 |
+ 0.587 |
+ 0.798 |
+ 0.786 |
+ 0.780 |
+ 0.774 |
+ 0.789 |
+ 0.805 |
+ 0.818 |
+ 0.861 |
+ 0.865 |
+ 0.843 |
+ 0.832 |
+ 0.531 |
+ 0.774 |
+ 0.803 |
+ 0.790 |
+ 0.812 |
+ 0.836 |
+ 0.832 |
+ 0.852 |
+ 0.875 |
+ 0.875 |
+ 0.484 |
+ 0.434 |
+ 0.769 |
+ 0.622 |
+ 0.856 |
+ 0.858 |
+ 0.936 |
+ 0.906 |
+ 0.917 |
+ 0.917 |
+ 0.915 |
+ 0.929 |
+ 0.923 |
+ 0.919 |
+ 0.924 |
+ 0.924 |
+ 0.925 |
+ 0.927 |
+ 0.930 |
+ 0.872 |
+ 1461 |
+ Stability |
+ RL20_AQUAE |
+ High |
+ Prokaryote |
+
+
+ RL40A_YEAST_Mavor_2016 |
+ 0.654 |
+ 0.683 |
+ 0.685 |
+ 0.713 |
+ 0.695 |
+ 0.708 |
+ 0.556 |
+ 0.728 |
+ 0.744 |
+ 0.746 |
+ 0.623 |
+ 0.653 |
+ 0.674 |
+ 0.547 |
+ 0.738 |
+ 0.769 |
+ 0.788 |
+ 0.744 |
+ 0.797 |
+ 0.730 |
+ 0.706 |
+ 0.789 |
+ 0.774 |
+ 0.745 |
+ 0.775 |
+ 0.750 |
+ 0.751 |
+ 0.739 |
+ 0.724 |
+ 0.672 |
+ 0.700 |
+ 0.705 |
+ 0.532 |
+ 0.715 |
+ 0.757 |
+ 0.709 |
+ 0.726 |
+ 0.760 |
+ 0.726 |
+ 0.738 |
+ 0.759 |
+ 0.721 |
+ 0.653 |
+ 0.518 |
+ 0.653 |
+ 0.668 |
+ 0.553 |
+ 0.587 |
+ 0.627 |
+ 0.503 |
+ 0.767 |
+ 0.755 |
+ 0.776 |
+ 0.767 |
+ 0.778 |
+ 0.765 |
+ 0.762 |
+ 0.777 |
+ 0.770 |
+ 0.781 |
+ 0.697 |
+ 0.698 |
+ 1253 |
+ OrganismalFitness |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RL40A_YEAST_Roscoe_2013 |
+ 0.660 |
+ 0.695 |
+ 0.708 |
+ 0.742 |
+ 0.714 |
+ 0.728 |
+ 0.530 |
+ 0.684 |
+ 0.767 |
+ 0.769 |
+ 0.614 |
+ 0.654 |
+ 0.692 |
+ 0.540 |
+ 0.730 |
+ 0.773 |
+ 0.814 |
+ 0.753 |
+ 0.806 |
+ 0.748 |
+ 0.709 |
+ 0.799 |
+ 0.777 |
+ 0.755 |
+ 0.783 |
+ 0.759 |
+ 0.769 |
+ 0.754 |
+ 0.739 |
+ 0.696 |
+ 0.745 |
+ 0.756 |
+ 0.536 |
+ 0.714 |
+ 0.775 |
+ 0.716 |
+ 0.731 |
+ 0.778 |
+ 0.734 |
+ 0.750 |
+ 0.777 |
+ 0.737 |
+ 0.680 |
+ 0.522 |
+ 0.659 |
+ 0.660 |
+ 0.542 |
+ 0.567 |
+ 0.628 |
+ 0.511 |
+ 0.802 |
+ 0.787 |
+ 0.806 |
+ 0.793 |
+ 0.804 |
+ 0.797 |
+ 0.787 |
+ 0.813 |
+ 0.803 |
+ 0.814 |
+ 0.698 |
+ 0.710 |
+ 1195 |
+ OrganismalFitness |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RL40A_YEAST_Roscoe_2014 |
+ 0.731 |
+ 0.684 |
+ 0.730 |
+ 0.753 |
+ 0.720 |
+ 0.743 |
+ 0.596 |
+ 0.761 |
+ 0.745 |
+ 0.759 |
+ 0.595 |
+ 0.660 |
+ 0.676 |
+ 0.628 |
+ 0.816 |
+ 0.813 |
+ 0.828 |
+ 0.765 |
+ 0.791 |
+ 0.744 |
+ 0.714 |
+ 0.796 |
+ 0.776 |
+ 0.764 |
+ 0.781 |
+ 0.752 |
+ 0.748 |
+ 0.723 |
+ 0.719 |
+ 0.733 |
+ 0.714 |
+ 0.719 |
+ 0.616 |
+ 0.731 |
+ 0.778 |
+ 0.730 |
+ 0.757 |
+ 0.794 |
+ 0.767 |
+ 0.765 |
+ 0.783 |
+ 0.756 |
+ 0.663 |
+ 0.576 |
+ 0.659 |
+ 0.672 |
+ 0.630 |
+ 0.640 |
+ 0.685 |
+ 0.588 |
+ 0.816 |
+ 0.796 |
+ 0.802 |
+ 0.830 |
+ 0.828 |
+ 0.825 |
+ 0.801 |
+ 0.811 |
+ 0.800 |
+ 0.828 |
+ 0.690 |
+ 0.746 |
+ 1380 |
+ Activity |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RNC_ECOLI_Weeks_2023 |
+ 0.772 |
+ 0.804 |
+ 0.803 |
+ 0.809 |
+ 0.812 |
+ 0.813 |
+ 0.531 |
+ 0.719 |
+ 0.808 |
+ 0.807 |
+ 0.806 |
+ 0.803 |
+ 0.810 |
+ 0.535 |
+ 0.782 |
+ 0.801 |
+ 0.809 |
+ 0.807 |
+ 0.810 |
+ 0.808 |
+ 0.779 |
+ 0.794 |
+ 0.765 |
+ 0.755 |
+ 0.786 |
+ 0.802 |
+ 0.802 |
+ 0.801 |
+ 0.795 |
+ 0.809 |
+ 0.799 |
+ 0.776 |
+ 0.584 |
+ 0.770 |
+ 0.777 |
+ 0.718 |
+ 0.794 |
+ 0.801 |
+ 0.777 |
+ 0.818 |
+ 0.819 |
+ 0.815 |
+ 0.769 |
+ 0.534 |
+ 0.806 |
+ 0.792 |
+ 0.657 |
+ 0.777 |
+ 0.650 |
+ 0.597 |
+ 0.787 |
+ 0.781 |
+ 0.778 |
+ 0.792 |
+ 0.787 |
+ 0.782 |
+ 0.786 |
+ 0.786 |
+ 0.787 |
+ 0.793 |
+ 0.815 |
+ 0.772 |
+ 4277 |
+ Activity |
+ RNC_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ RPC1_BP434_Tsuboyama_2023_1R69 |
+ 0.818 |
+ 0.857 |
+ 0.864 |
+ 0.875 |
+ 0.833 |
+ 0.856 |
+ 0.841 |
+ 0.829 |
+ 0.850 |
+ 0.870 |
+ 0.891 |
+ 0.898 |
+ 0.902 |
+ 0.876 |
+ 0.890 |
+ 0.901 |
+ 0.894 |
+ 0.880 |
+ 0.850 |
+ 0.884 |
+ 0.865 |
+ 0.886 |
+ 0.889 |
+ 0.871 |
+ 0.882 |
+ 0.894 |
+ 0.890 |
+ 0.891 |
+ 0.870 |
+ 0.897 |
+ 0.869 |
+ 0.834 |
+ 0.836 |
+ 0.826 |
+ 0.881 |
+ 0.881 |
+ 0.874 |
+ 0.888 |
+ 0.887 |
+ 0.873 |
+ 0.873 |
+ 0.869 |
+ 0.888 |
+ 0.854 |
+ 0.898 |
+ 0.887 |
+ 0.875 |
+ 0.847 |
+ 0.925 |
+ 0.889 |
+ 0.879 |
+ 0.872 |
+ 0.880 |
+ 0.887 |
+ 0.884 |
+ 0.881 |
+ 0.890 |
+ 0.886 |
+ 0.892 |
+ 0.890 |
+ 0.913 |
+ 0.906 |
+ 1459 |
+ Stability |
+ RPC1_BP434 |
+ High |
+ Virus |
+
+
+ RPC1_LAMBD_Li_2019_high-expression |
+ 0.707 |
+ 0.796 |
+ 0.837 |
+ 0.849 |
+ 0.824 |
+ 0.826 |
+ 0.670 |
+ 0.816 |
+ 0.833 |
+ 0.857 |
+ 0.866 |
+ 0.855 |
+ 0.879 |
+ 0.721 |
+ 0.718 |
+ 0.794 |
+ 0.898 |
+ 0.902 |
+ 0.904 |
+ 0.815 |
+ 0.656 |
+ 0.744 |
+ 0.790 |
+ 0.820 |
+ 0.674 |
+ 0.777 |
+ 0.764 |
+ 0.746 |
+ 0.894 |
+ 0.811 |
+ 0.934 |
+ 0.902 |
+ 0.586 |
+ 0.625 |
+ 0.764 |
+ 0.860 |
+ 0.706 |
+ 0.769 |
+ 0.857 |
+ 0.810 |
+ 0.832 |
+ 0.866 |
+ 0.731 |
+ 0.735 |
+ 0.804 |
+ 0.743 |
+ 0.671 |
+ 0.774 |
+ 0.761 |
+ 0.626 |
+ 0.898 |
+ 0.873 |
+ 0.872 |
+ 0.897 |
+ 0.887 |
+ 0.898 |
+ 0.883 |
+ 0.879 |
+ 0.873 |
+ 0.897 |
+ 0.878 |
+ 0.789 |
+ 351 |
+ Activity |
+ RPC1_LAMBD |
+ High |
+ Virus |
+
+
+ RPC1_LAMBD_Li_2019_low-expression |
+ 0.612 |
+ 0.699 |
+ 0.736 |
+ 0.746 |
+ 0.724 |
+ 0.718 |
+ 0.563 |
+ 0.757 |
+ 0.716 |
+ 0.736 |
+ 0.755 |
+ 0.728 |
+ 0.736 |
+ 0.619 |
+ 0.620 |
+ 0.671 |
+ 0.761 |
+ 0.803 |
+ 0.808 |
+ 0.700 |
+ 0.605 |
+ 0.678 |
+ 0.680 |
+ 0.713 |
+ 0.583 |
+ 0.686 |
+ 0.661 |
+ 0.653 |
+ 0.802 |
+ 0.712 |
+ 0.829 |
+ 0.817 |
+ 0.540 |
+ 0.563 |
+ 0.663 |
+ 0.738 |
+ 0.610 |
+ 0.657 |
+ 0.727 |
+ 0.702 |
+ 0.716 |
+ 0.743 |
+ 0.636 |
+ 0.621 |
+ 0.670 |
+ 0.641 |
+ 0.634 |
+ 0.681 |
+ 0.700 |
+ 0.636 |
+ 0.769 |
+ 0.752 |
+ 0.767 |
+ 0.777 |
+ 0.758 |
+ 0.772 |
+ 0.776 |
+ 0.767 |
+ 0.758 |
+ 0.776 |
+ 0.765 |
+ 0.653 |
+ 351 |
+ Activity |
+ RPC1_LAMBD |
+ High |
+ Virus |
+
+
+ RS15_GEOSE_Tsuboyama_2023_1A32 |
+ 0.698 |
+ 0.672 |
+ 0.685 |
+ 0.682 |
+ 0.672 |
+ 0.675 |
+ 0.587 |
+ 0.620 |
+ 0.710 |
+ 0.710 |
+ 0.734 |
+ 0.709 |
+ 0.701 |
+ 0.636 |
+ 0.657 |
+ 0.758 |
+ 0.714 |
+ 0.702 |
+ 0.670 |
+ 0.663 |
+ 0.601 |
+ 0.652 |
+ 0.667 |
+ 0.658 |
+ 0.657 |
+ 0.672 |
+ 0.700 |
+ 0.679 |
+ 0.686 |
+ 0.701 |
+ 0.726 |
+ 0.723 |
+ 0.548 |
+ 0.711 |
+ 0.659 |
+ 0.662 |
+ 0.725 |
+ 0.687 |
+ 0.690 |
+ 0.703 |
+ 0.679 |
+ 0.673 |
+ 0.665 |
+ 0.620 |
+ 0.719 |
+ 0.683 |
+ 0.801 |
+ 0.723 |
+ 0.817 |
+ 0.768 |
+ 0.710 |
+ 0.709 |
+ 0.715 |
+ 0.713 |
+ 0.709 |
+ 0.708 |
+ 0.719 |
+ 0.726 |
+ 0.710 |
+ 0.719 |
+ 0.720 |
+ 0.821 |
+ 1195 |
+ Stability |
+ RS15_GEOSE |
+ Medium |
+ Prokaryote |
+
+
+ S22A1_HUMAN_Yee_2023_abundance |
+ 0.775 |
+ 0.791 |
+ 0.802 |
+ 0.806 |
+ 0.813 |
+ 0.820 |
+ 0.709 |
+ 0.779 |
+ 0.831 |
+ 0.831 |
+ 0.822 |
+ 0.835 |
+ 0.857 |
+ 0.766 |
+ 0.791 |
+ 0.802 |
+ 0.837 |
+ 0.826 |
+ 0.812 |
+ 0.520 |
+ 0.767 |
+ 0.823 |
+ 0.837 |
+ 0.826 |
+ 0.786 |
+ 0.839 |
+ 0.848 |
+ 0.830 |
+ 0.810 |
+ 0.819 |
+ 0.810 |
+ 0.741 |
+ 0.682 |
+ 0.785 |
+ 0.828 |
+ 0.825 |
+ 0.806 |
+ 0.843 |
+ 0.845 |
+ 0.834 |
+ 0.844 |
+ 0.843 |
+ 0.771 |
+ 0.651 |
+ 0.814 |
+ 0.802 |
+ 0.733 |
+ 0.789 |
+ 0.788 |
+ 0.594 |
+ 0.813 |
+ 0.806 |
+ 0.807 |
+ 0.817 |
+ 0.827 |
+ 0.821 |
+ 0.819 |
+ 0.822 |
+ 0.817 |
+ 0.827 |
+ 0.837 |
+ 0.812 |
+ 9803 |
+ Expression |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ S22A1_HUMAN_Yee_2023_activity |
+ 0.773 |
+ 0.810 |
+ 0.835 |
+ 0.843 |
+ 0.838 |
+ 0.845 |
+ 0.714 |
+ 0.821 |
+ 0.844 |
+ 0.843 |
+ 0.837 |
+ 0.848 |
+ 0.877 |
+ 0.769 |
+ 0.789 |
+ 0.789 |
+ 0.854 |
+ 0.833 |
+ 0.822 |
+ 0.521 |
+ 0.793 |
+ 0.854 |
+ 0.861 |
+ 0.849 |
+ 0.815 |
+ 0.860 |
+ 0.865 |
+ 0.854 |
+ 0.831 |
+ 0.852 |
+ 0.856 |
+ 0.802 |
+ 0.691 |
+ 0.805 |
+ 0.858 |
+ 0.846 |
+ 0.817 |
+ 0.860 |
+ 0.857 |
+ 0.856 |
+ 0.870 |
+ 0.866 |
+ 0.775 |
+ 0.636 |
+ 0.837 |
+ 0.800 |
+ 0.728 |
+ 0.805 |
+ 0.788 |
+ 0.587 |
+ 0.828 |
+ 0.812 |
+ 0.809 |
+ 0.828 |
+ 0.837 |
+ 0.826 |
+ 0.827 |
+ 0.837 |
+ 0.837 |
+ 0.839 |
+ 0.828 |
+ 0.804 |
+ 10094 |
+ Activity |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ SAV1_MOUSE_Tsuboyama_2023_2YSB |
+ 0.537 |
+ 0.586 |
+ 0.693 |
+ 0.677 |
+ 0.692 |
+ 0.687 |
+ 0.702 |
+ 0.788 |
+ 0.693 |
+ 0.701 |
+ 0.762 |
+ 0.765 |
+ 0.778 |
+ 0.593 |
+ 0.739 |
+ 0.757 |
+ 0.765 |
+ 0.786 |
+ 0.732 |
+ 0.723 |
+ 0.771 |
+ 0.756 |
+ 0.750 |
+ 0.766 |
+ 0.776 |
+ 0.786 |
+ 0.768 |
+ 0.771 |
+ 0.769 |
+ 0.785 |
+ 0.764 |
+ 0.733 |
+ 0.703 |
+ 0.736 |
+ 0.778 |
+ 0.773 |
+ 0.714 |
+ 0.758 |
+ 0.746 |
+ 0.698 |
+ 0.719 |
+ 0.719 |
+ 0.768 |
+ 0.459 |
+ 0.787 |
+ 0.774 |
+ 0.647 |
+ 0.658 |
+ 0.709 |
+ 0.767 |
+ 0.752 |
+ 0.693 |
+ 0.638 |
+ 0.704 |
+ 0.699 |
+ 0.716 |
+ 0.713 |
+ 0.742 |
+ 0.735 |
+ 0.714 |
+ 0.803 |
+ 0.813 |
+ 965 |
+ Stability |
+ SAV1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ SBI_STAAM_Tsuboyama_2023_2JVG |
+ 0.606 |
+ 0.599 |
+ 0.685 |
+ 0.666 |
+ 0.714 |
+ 0.699 |
+ 0.607 |
+ 0.603 |
+ 0.737 |
+ 0.763 |
+ 0.661 |
+ 0.626 |
+ 0.642 |
+ 0.614 |
+ 0.629 |
+ 0.687 |
+ 0.789 |
+ 0.833 |
+ 0.674 |
+ 0.680 |
+ 0.610 |
+ 0.630 |
+ 0.640 |
+ 0.639 |
+ 0.607 |
+ 0.619 |
+ 0.611 |
+ 0.629 |
+ 0.688 |
+ 0.789 |
+ 0.798 |
+ 0.750 |
+ 0.602 |
+ 0.589 |
+ 0.598 |
+ 0.606 |
+ 0.641 |
+ 0.646 |
+ 0.648 |
+ 0.674 |
+ 0.681 |
+ 0.685 |
+ 0.629 |
+ 0.608 |
+ 0.659 |
+ 0.655 |
+ 0.836 |
+ 0.828 |
+ 0.854 |
+ 0.812 |
+ 0.768 |
+ 0.765 |
+ 0.782 |
+ 0.784 |
+ 0.781 |
+ 0.782 |
+ 0.794 |
+ 0.790 |
+ 0.785 |
+ 0.787 |
+ 0.848 |
+ 0.785 |
+ 1025 |
+ Stability |
+ SBI_STAAM |
+ Medium |
+ Prokaryote |
+
+
+ SC6A4_HUMAN_Young_2021 |
+ 0.710 |
+ 0.745 |
+ 0.728 |
+ 0.734 |
+ 0.771 |
+ 0.780 |
+ 0.686 |
+ 0.766 |
+ 0.800 |
+ 0.805 |
+ 0.788 |
+ 0.782 |
+ 0.787 |
+ 0.588 |
+ 0.675 |
+ 0.770 |
+ 0.792 |
+ 0.793 |
+ 0.780 |
+ 0.795 |
+ 0.764 |
+ 0.771 |
+ 0.762 |
+ 0.761 |
+ 0.771 |
+ 0.774 |
+ 0.773 |
+ 0.776 |
+ 0.771 |
+ 0.788 |
+ 0.745 |
+ 0.693 |
+ 0.683 |
+ 0.772 |
+ 0.770 |
+ 0.759 |
+ 0.783 |
+ 0.788 |
+ 0.781 |
+ 0.793 |
+ 0.796 |
+ 0.791 |
+ 0.762 |
+ 0.564 |
+ 0.790 |
+ 0.782 |
+ 0.746 |
+ 0.778 |
+ 0.762 |
+ 0.598 |
+ 0.778 |
+ 0.783 |
+ 0.788 |
+ 0.791 |
+ 0.793 |
+ 0.788 |
+ 0.788 |
+ 0.790 |
+ 0.783 |
+ 0.793 |
+ 0.808 |
+ 0.782 |
+ 11576 |
+ Activity |
+ SC6A4_HUMAN |
+ Medium |
+ Human |
+
+
+ SCIN_STAAR_Tsuboyama_2023_2QFF |
+ 0.535 |
+ 0.560 |
+ 0.633 |
+ 0.630 |
+ 0.654 |
+ 0.641 |
+ 0.552 |
+ 0.530 |
+ 0.632 |
+ 0.630 |
+ 0.620 |
+ 0.618 |
+ 0.607 |
+ 0.585 |
+ 0.625 |
+ 0.613 |
+ 0.638 |
+ 0.670 |
+ 0.675 |
+ 0.634 |
+ 0.538 |
+ 0.563 |
+ 0.549 |
+ 0.555 |
+ 0.564 |
+ 0.576 |
+ 0.560 |
+ 0.578 |
+ 0.585 |
+ 0.629 |
+ 0.638 |
+ 0.602 |
+ 0.490 |
+ 0.522 |
+ 0.534 |
+ 0.565 |
+ 0.566 |
+ 0.564 |
+ 0.574 |
+ 0.626 |
+ 0.620 |
+ 0.628 |
+ 0.592 |
+ 0.566 |
+ 0.623 |
+ 0.607 |
+ 0.763 |
+ 0.758 |
+ 0.763 |
+ 0.733 |
+ 0.692 |
+ 0.699 |
+ 0.699 |
+ 0.720 |
+ 0.686 |
+ 0.715 |
+ 0.689 |
+ 0.688 |
+ 0.667 |
+ 0.702 |
+ 0.797 |
+ 0.733 |
+ 1212 |
+ Stability |
+ SCIN_STAAR |
+ High |
+ Prokaryote |
+
+
+ SCN5A_HUMAN_Glazer_2019 |
+ 0.537 |
+ 0.543 |
+ 0.558 |
+ 0.558 |
+ 0.553 |
+ 0.559 |
+ 0.548 |
+ 0.510 |
+ 0.569 |
+ 0.583 |
+ 0.589 |
+ 0.613 |
+ 0.567 |
+ 0.623 |
+ 0.591 |
+ 0.582 |
+ 0.594 |
+ 0.569 |
+ 0.554 |
+ 0.579 |
+ 0.546 |
+ 0.557 |
+ 0.567 |
+ 0.567 |
+ 0.536 |
+ 0.547 |
+ 0.548 |
+ 0.543 |
+ 0.580 |
+ 0.545 |
+ 0.588 |
+ 0.582 |
+ 0.532 |
+ 0.534 |
+ 0.525 |
+ 0.532 |
+ 0.537 |
+ 0.530 |
+ 0.529 |
+ 0.553 |
+ 0.552 |
+ 0.556 |
+ 0.542 |
+ 0.585 |
+ 0.574 |
+ 0.585 |
+ 0.529 |
+ 0.561 |
+ 0.527 |
+ 0.504 |
+ 0.573 |
+ 0.564 |
+ 0.574 |
+ 0.563 |
+ 0.543 |
+ 0.566 |
+ 0.578 |
+ 0.572 |
+ 0.575 |
+ 0.570 |
+ 0.608 |
+ 0.574 |
+ 224 |
+ OrganismalFitness |
+ SCN5A_HUMAN |
+ Medium |
+ Human |
+
+
+ SDA_BACSU_Tsuboyama_2023_1PV0 |
+ 0.944 |
+ 0.971 |
+ 0.971 |
+ 0.969 |
+ 0.973 |
+ 0.973 |
+ 0.590 |
+ 0.860 |
+ 0.959 |
+ 0.962 |
+ 0.969 |
+ 0.939 |
+ 0.952 |
+ 0.720 |
+ 0.734 |
+ 0.947 |
+ 0.955 |
+ 0.952 |
+ 0.957 |
+ 0.954 |
+ 0.628 |
+ 0.652 |
+ 0.652 |
+ 0.770 |
+ 0.643 |
+ 0.799 |
+ 0.565 |
+ 0.776 |
+ 0.862 |
+ 0.967 |
+ 0.965 |
+ 0.962 |
+ 0.353 |
+ 0.579 |
+ 0.738 |
+ 0.808 |
+ 0.934 |
+ 0.936 |
+ 0.908 |
+ 0.962 |
+ 0.960 |
+ 0.947 |
+ 0.576 |
+ 0.497 |
+ 0.818 |
+ 0.562 |
+ 0.841 |
+ 0.827 |
+ 0.961 |
+ 0.952 |
+ 0.973 |
+ 0.970 |
+ 0.969 |
+ 0.974 |
+ 0.972 |
+ 0.972 |
+ 0.974 |
+ 0.971 |
+ 0.975 |
+ 0.974 |
+ 0.961 |
+ 0.967 |
+ 2770 |
+ Stability |
+ SDA_BACSU |
+ Medium |
+ Prokaryote |
+
+
+ SERC_HUMAN_Xie_2023 |
+ 0.678 |
+ 0.722 |
+ 0.759 |
+ 0.759 |
+ 0.745 |
+ 0.750 |
+ 0.509 |
+ 0.715 |
+ 0.774 |
+ 0.778 |
+ 0.747 |
+ 0.757 |
+ 0.760 |
+ 0.587 |
+ 0.692 |
+ 0.754 |
+ 0.765 |
+ 0.776 |
+ 0.760 |
+ 0.755 |
+ 0.736 |
+ 0.744 |
+ 0.746 |
+ 0.735 |
+ 0.738 |
+ 0.755 |
+ 0.747 |
+ 0.755 |
+ 0.745 |
+ 0.744 |
+ 0.753 |
+ 0.732 |
+ 0.588 |
+ 0.747 |
+ 0.753 |
+ 0.749 |
+ 0.748 |
+ 0.754 |
+ 0.757 |
+ 0.757 |
+ 0.759 |
+ 0.759 |
+ 0.651 |
+ 0.519 |
+ 0.759 |
+ 0.725 |
+ 0.696 |
+ 0.735 |
+ 0.681 |
+ 0.593 |
+ 0.762 |
+ 0.756 |
+ 0.764 |
+ 0.768 |
+ 0.771 |
+ 0.767 |
+ 0.767 |
+ 0.766 |
+ 0.765 |
+ 0.772 |
+ 0.765 |
+ 0.727 |
+ 1914 |
+ OrganismalFitness |
+ SERC_HUMAN |
+ High |
+ Human |
+
+
+ SHOC2_HUMAN_Kwon_2022 |
+ 0.599 |
+ 0.664 |
+ 0.689 |
+ 0.690 |
+ 0.685 |
+ 0.690 |
+ 0.606 |
+ 0.679 |
+ 0.711 |
+ 0.711 |
+ 0.696 |
+ 0.689 |
+ 0.702 |
+ 0.613 |
+ 0.618 |
+ 0.627 |
+ 0.711 |
+ 0.695 |
+ 0.648 |
+ 0.694 |
+ 0.624 |
+ 0.679 |
+ 0.693 |
+ 0.677 |
+ 0.626 |
+ 0.693 |
+ 0.692 |
+ 0.698 |
+ 0.678 |
+ 0.704 |
+ 0.704 |
+ 0.677 |
+ 0.574 |
+ 0.621 |
+ 0.651 |
+ 0.687 |
+ 0.627 |
+ 0.648 |
+ 0.682 |
+ 0.682 |
+ 0.682 |
+ 0.698 |
+ 0.623 |
+ 0.611 |
+ 0.713 |
+ 0.639 |
+ 0.650 |
+ 0.681 |
+ 0.638 |
+ 0.545 |
+ 0.685 |
+ 0.682 |
+ 0.691 |
+ 0.693 |
+ 0.697 |
+ 0.699 |
+ 0.689 |
+ 0.695 |
+ 0.690 |
+ 0.697 |
+ 0.656 |
+ 0.639 |
+ 10972 |
+ OrganismalFitness |
+ SHOC2_HUMAN |
+ Medium |
+ Human |
+
+
+ SOX30_HUMAN_Tsuboyama_2023_7JJK |
+ 0.592 |
+ 0.618 |
+ 0.628 |
+ 0.627 |
+ 0.639 |
+ 0.634 |
+ 0.694 |
+ 0.584 |
+ 0.596 |
+ 0.613 |
+ 0.620 |
+ 0.659 |
+ 0.663 |
+ 0.644 |
+ 0.687 |
+ 0.661 |
+ 0.660 |
+ 0.625 |
+ 0.624 |
+ 0.615 |
+ 0.576 |
+ 0.587 |
+ 0.586 |
+ 0.599 |
+ 0.631 |
+ 0.632 |
+ 0.596 |
+ 0.626 |
+ 0.599 |
+ 0.611 |
+ 0.535 |
+ 0.517 |
+ 0.609 |
+ 0.561 |
+ 0.630 |
+ 0.620 |
+ 0.593 |
+ 0.609 |
+ 0.615 |
+ 0.635 |
+ 0.631 |
+ 0.631 |
+ 0.630 |
+ 0.572 |
+ 0.598 |
+ 0.671 |
+ 0.756 |
+ 0.671 |
+ 0.768 |
+ 0.724 |
+ 0.674 |
+ 0.691 |
+ 0.674 |
+ 0.672 |
+ 0.668 |
+ 0.693 |
+ 0.670 |
+ 0.671 |
+ 0.665 |
+ 0.680 |
+ 0.664 |
+ 0.737 |
+ 1010 |
+ Stability |
+ SOX30_HUMAN |
+ High |
+ Human |
+
+
+ SPA_STAAU_Tsuboyama_2023_1LP1 |
+ 0.748 |
+ 0.788 |
+ 0.799 |
+ 0.788 |
+ 0.822 |
+ 0.822 |
+ 0.411 |
+ 0.796 |
+ 0.744 |
+ 0.744 |
+ 0.700 |
+ 0.434 |
+ 0.456 |
+ 0.436 |
+ 0.438 |
+ 0.427 |
+ 0.457 |
+ 0.447 |
+ 0.468 |
+ 0.794 |
+ 0.386 |
+ 0.480 |
+ 0.586 |
+ 0.546 |
+ 0.428 |
+ 0.447 |
+ 0.504 |
+ 0.674 |
+ 0.733 |
+ 0.811 |
+ 0.736 |
+ 0.750 |
+ 0.476 |
+ 0.461 |
+ 0.471 |
+ 0.505 |
+ 0.744 |
+ 0.738 |
+ 0.737 |
+ 0.820 |
+ 0.818 |
+ 0.812 |
+ 0.387 |
+ 0.418 |
+ 0.411 |
+ 0.445 |
+ 0.753 |
+ 0.720 |
+ 0.871 |
+ 0.785 |
+ 0.761 |
+ 0.756 |
+ 0.743 |
+ 0.766 |
+ 0.753 |
+ 0.755 |
+ 0.750 |
+ 0.745 |
+ 0.757 |
+ 0.755 |
+ 0.768 |
+ 0.687 |
+ 2105 |
+ Stability |
+ SPA_STAAU |
+ Medium |
+ Prokaryote |
+
+
+ SPG1_STRSG_Olson_2014 |
+ 0.608 |
+ 0.634 |
+ 0.497 |
+ 0.507 |
+ 0.618 |
+ 0.627 |
+ 0.489 |
+ 0.529 |
+ 0.451 |
+ 0.555 |
+ 0.671 |
+ 0.629 |
+ 0.606 |
+ 0.650 |
+ 0.626 |
+ 0.652 |
+ 0.680 |
+ 0.649 |
+ 0.688 |
+ 0.661 |
+ 0.637 |
+ 0.616 |
+ 0.619 |
+ 0.612 |
+ 0.631 |
+ 0.623 |
+ 0.614 |
+ 0.626 |
+ 0.692 |
+ 0.634 |
+ 0.754 |
+ 0.746 |
+ 0.536 |
+ 0.629 |
+ 0.599 |
+ 0.650 |
+ 0.636 |
+ 0.611 |
+ 0.654 |
+ 0.631 |
+ 0.605 |
+ 0.656 |
+ 0.473 |
+ 0.436 |
+ 0.624 |
+ 0.531 |
+ 0.682 |
+ 0.644 |
+ 0.711 |
+ 0.567 |
+ 0.717 |
+ 0.696 |
+ 0.709 |
+ 0.733 |
+ 0.733 |
+ 0.716 |
+ 0.745 |
+ 0.754 |
+ 0.744 |
+ 0.739 |
+ 0.716 |
+ 0.708 |
+ 536962 |
+ Binding |
+ SPG1_STRSG |
+ Low |
+ Prokaryote |
+
+
+ SPG1_STRSG_Wu_2016 |
+ 0.517 |
+ 0.605 |
+ 0.612 |
+ 0.621 |
+ 0.614 |
+ 0.622 |
+ 0.604 |
+ 0.541 |
+ 0.723 |
+ 0.715 |
+ 0.693 |
+ 0.677 |
+ 0.662 |
+ 0.645 |
+ 0.684 |
+ 0.692 |
+ 0.708 |
+ 0.738 |
+ 0.761 |
+ 0.653 |
+ 0.609 |
+ 0.589 |
+ 0.580 |
+ 0.607 |
+ 0.573 |
+ 0.643 |
+ 0.622 |
+ 0.600 |
+ 0.600 |
+ 0.631 |
+ 0.702 |
+ 0.664 |
+ 0.493 |
+ 0.619 |
+ 0.567 |
+ 0.605 |
+ 0.625 |
+ 0.611 |
+ 0.627 |
+ 0.640 |
+ 0.630 |
+ 0.633 |
+ 0.572 |
+ 0.508 |
+ 0.617 |
+ 0.561 |
+ 0.604 |
+ 0.594 |
+ 0.725 |
+ 0.636 |
+ 0.718 |
+ 0.700 |
+ 0.696 |
+ 0.730 |
+ 0.699 |
+ 0.693 |
+ 0.732 |
+ 0.711 |
+ 0.734 |
+ 0.715 |
+ 0.730 |
+ 0.680 |
+ 149360 |
+ Binding |
+ SPG1_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPG2_STRSG_Tsuboyama_2023_5UBS |
+ 0.767 |
+ 0.811 |
+ 0.804 |
+ 0.823 |
+ 0.840 |
+ 0.839 |
+ 0.704 |
+ 0.705 |
+ 0.818 |
+ 0.793 |
+ 0.785 |
+ 0.752 |
+ 0.744 |
+ 0.719 |
+ 0.756 |
+ 0.769 |
+ 0.792 |
+ 0.789 |
+ 0.810 |
+ 0.800 |
+ 0.736 |
+ 0.757 |
+ 0.744 |
+ 0.739 |
+ 0.705 |
+ 0.771 |
+ 0.712 |
+ 0.772 |
+ 0.796 |
+ 0.866 |
+ 0.842 |
+ 0.835 |
+ 0.416 |
+ 0.722 |
+ 0.766 |
+ 0.778 |
+ 0.697 |
+ 0.718 |
+ 0.743 |
+ 0.833 |
+ 0.836 |
+ 0.842 |
+ 0.647 |
+ 0.564 |
+ 0.660 |
+ 0.642 |
+ 0.761 |
+ 0.715 |
+ 0.902 |
+ 0.837 |
+ 0.838 |
+ 0.839 |
+ 0.851 |
+ 0.850 |
+ 0.835 |
+ 0.853 |
+ 0.848 |
+ 0.861 |
+ 0.847 |
+ 0.850 |
+ 0.853 |
+ 0.827 |
+ 1451 |
+ Stability |
+ SPG2_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPIKE_SARS2_Starr_2020_binding |
+ 0.570 |
+ 0.595 |
+ 0.531 |
+ 0.576 |
+ 0.624 |
+ 0.628 |
+ 0.454 |
+ 0.682 |
+ 0.684 |
+ 0.686 |
+ 0.475 |
+ 0.445 |
+ 0.459 |
+ 0.454 |
+ 0.460 |
+ 0.470 |
+ 0.471 |
+ 0.477 |
+ 0.519 |
+ 0.659 |
+ 0.644 |
+ 0.671 |
+ 0.665 |
+ 0.671 |
+ 0.684 |
+ 0.655 |
+ 0.641 |
+ 0.677 |
+ 0.644 |
+ 0.625 |
+ 0.648 |
+ 0.670 |
+ 0.592 |
+ 0.653 |
+ 0.652 |
+ 0.673 |
+ 0.649 |
+ 0.649 |
+ 0.649 |
+ 0.665 |
+ 0.662 |
+ 0.666 |
+ 0.464 |
+ 0.479 |
+ 0.467 |
+ 0.474 |
+ 0.766 |
+ 0.748 |
+ 0.702 |
+ 0.584 |
+ 0.602 |
+ 0.606 |
+ 0.602 |
+ 0.638 |
+ 0.613 |
+ 0.637 |
+ 0.619 |
+ 0.630 |
+ 0.599 |
+ 0.627 |
+ 0.656 |
+ 0.560 |
+ 3802 |
+ Binding |
+ SPIKE_SARS2 |
+ Medium |
+ Virus |
+
+
+ SPIKE_SARS2_Starr_2020_expression |
+ 0.607 |
+ 0.664 |
+ 0.592 |
+ 0.651 |
+ 0.743 |
+ 0.741 |
+ 0.476 |
+ 0.682 |
+ 0.721 |
+ 0.735 |
+ 0.511 |
+ 0.469 |
+ 0.484 |
+ 0.473 |
+ 0.481 |
+ 0.492 |
+ 0.493 |
+ 0.502 |
+ 0.538 |
+ 0.680 |
+ 0.662 |
+ 0.693 |
+ 0.694 |
+ 0.705 |
+ 0.696 |
+ 0.689 |
+ 0.666 |
+ 0.703 |
+ 0.666 |
+ 0.694 |
+ 0.654 |
+ 0.675 |
+ 0.589 |
+ 0.672 |
+ 0.677 |
+ 0.690 |
+ 0.681 |
+ 0.686 |
+ 0.685 |
+ 0.732 |
+ 0.737 |
+ 0.746 |
+ 0.490 |
+ 0.497 |
+ 0.494 |
+ 0.502 |
+ 0.782 |
+ 0.770 |
+ 0.736 |
+ 0.597 |
+ 0.658 |
+ 0.668 |
+ 0.656 |
+ 0.705 |
+ 0.680 |
+ 0.699 |
+ 0.681 |
+ 0.691 |
+ 0.671 |
+ 0.694 |
+ 0.735 |
+ 0.621 |
+ 3798 |
+ Expression |
+ SPIKE_SARS2 |
+ Medium |
+ Virus |
+
+
+ SPTN1_CHICK_Tsuboyama_2023_1TUD |
+ 0.794 |
+ 0.805 |
+ 0.810 |
+ 0.778 |
+ 0.798 |
+ 0.787 |
+ 0.605 |
+ 0.745 |
+ 0.766 |
+ 0.788 |
+ 0.797 |
+ 0.816 |
+ 0.807 |
+ 0.444 |
+ 0.797 |
+ 0.801 |
+ 0.796 |
+ 0.857 |
+ 0.822 |
+ 0.788 |
+ 0.782 |
+ 0.808 |
+ 0.789 |
+ 0.782 |
+ 0.786 |
+ 0.789 |
+ 0.814 |
+ 0.767 |
+ 0.778 |
+ 0.787 |
+ 0.791 |
+ 0.792 |
+ 0.711 |
+ 0.746 |
+ 0.777 |
+ 0.766 |
+ 0.808 |
+ 0.813 |
+ 0.804 |
+ 0.792 |
+ 0.795 |
+ 0.791 |
+ 0.736 |
+ 0.402 |
+ 0.723 |
+ 0.736 |
+ 0.702 |
+ 0.693 |
+ 0.809 |
+ 0.802 |
+ 0.824 |
+ 0.819 |
+ 0.825 |
+ 0.822 |
+ 0.828 |
+ 0.824 |
+ 0.818 |
+ 0.816 |
+ 0.825 |
+ 0.825 |
+ 0.830 |
+ 0.767 |
+ 3201 |
+ Stability |
+ SPTN1_CHICK |
+ High |
+ Eukaryote |
+
+
+ SQSTM_MOUSE_Tsuboyama_2023_2RRU |
+ 0.678 |
+ 0.750 |
+ 0.781 |
+ 0.788 |
+ 0.785 |
+ 0.787 |
+ 0.529 |
+ 0.686 |
+ 0.768 |
+ 0.782 |
+ 0.729 |
+ 0.654 |
+ 0.714 |
+ 0.574 |
+ 0.708 |
+ 0.745 |
+ 0.804 |
+ 0.815 |
+ 0.753 |
+ 0.796 |
+ 0.590 |
+ 0.701 |
+ 0.715 |
+ 0.718 |
+ 0.640 |
+ 0.743 |
+ 0.760 |
+ 0.724 |
+ 0.741 |
+ 0.809 |
+ 0.786 |
+ 0.770 |
+ 0.576 |
+ 0.571 |
+ 0.696 |
+ 0.735 |
+ 0.727 |
+ 0.741 |
+ 0.768 |
+ 0.791 |
+ 0.780 |
+ 0.791 |
+ 0.582 |
+ 0.538 |
+ 0.779 |
+ 0.771 |
+ 0.832 |
+ 0.816 |
+ 0.813 |
+ 0.823 |
+ 0.815 |
+ 0.786 |
+ 0.818 |
+ 0.811 |
+ 0.813 |
+ 0.812 |
+ 0.816 |
+ 0.804 |
+ 0.798 |
+ 0.818 |
+ 0.824 |
+ 0.725 |
+ 707 |
+ Stability |
+ SQSTM_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ SR43C_ARATH_Tsuboyama_2023_2N88 |
+ 0.820 |
+ 0.856 |
+ 0.869 |
+ 0.869 |
+ 0.873 |
+ 0.870 |
+ 0.382 |
+ 0.779 |
+ 0.844 |
+ 0.831 |
+ 0.878 |
+ 0.842 |
+ 0.851 |
+ 0.347 |
+ 0.853 |
+ 0.878 |
+ 0.863 |
+ 0.877 |
+ 0.864 |
+ 0.843 |
+ 0.419 |
+ 0.682 |
+ 0.687 |
+ 0.762 |
+ 0.760 |
+ 0.801 |
+ 0.825 |
+ 0.819 |
+ 0.823 |
+ 0.865 |
+ 0.866 |
+ 0.848 |
+ 0.686 |
+ 0.504 |
+ 0.575 |
+ 0.678 |
+ 0.853 |
+ 0.851 |
+ 0.855 |
+ 0.868 |
+ 0.868 |
+ 0.865 |
+ 0.794 |
+ 0.321 |
+ 0.871 |
+ 0.825 |
+ 0.567 |
+ 0.810 |
+ 0.859 |
+ 0.864 |
+ 0.867 |
+ 0.864 |
+ 0.861 |
+ 0.874 |
+ 0.878 |
+ 0.867 |
+ 0.862 |
+ 0.862 |
+ 0.870 |
+ 0.870 |
+ 0.917 |
+ 0.891 |
+ 1583 |
+ Stability |
+ SR43C_ARATH |
+ High |
+ Eukaryote |
+
+
+ SRBS1_HUMAN_Tsuboyama_2023_2O2W |
+ 0.711 |
+ 0.786 |
+ 0.866 |
+ 0.866 |
+ 0.886 |
+ 0.883 |
+ 0.788 |
+ 0.697 |
+ 0.893 |
+ 0.893 |
+ 0.837 |
+ 0.859 |
+ 0.876 |
+ 0.767 |
+ 0.867 |
+ 0.881 |
+ 0.888 |
+ 0.842 |
+ 0.867 |
+ 0.852 |
+ 0.866 |
+ 0.846 |
+ 0.853 |
+ 0.839 |
+ 0.845 |
+ 0.842 |
+ 0.849 |
+ 0.841 |
+ 0.832 |
+ 0.864 |
+ 0.887 |
+ 0.868 |
+ 0.809 |
+ 0.825 |
+ 0.844 |
+ 0.841 |
+ 0.858 |
+ 0.875 |
+ 0.875 |
+ 0.878 |
+ 0.889 |
+ 0.887 |
+ 0.690 |
+ 0.574 |
+ 0.786 |
+ 0.769 |
+ 0.786 |
+ 0.778 |
+ 0.823 |
+ 0.842 |
+ 0.896 |
+ 0.886 |
+ 0.892 |
+ 0.889 |
+ 0.897 |
+ 0.894 |
+ 0.896 |
+ 0.889 |
+ 0.897 |
+ 0.899 |
+ 0.887 |
+ 0.847 |
+ 1556 |
+ Stability |
+ SRBS1_HUMAN |
+ High |
+ Human |
+
+
+ SRC_HUMAN_Ahler_2019 |
+ 0.825 |
+ 0.825 |
+ 0.796 |
+ 0.797 |
+ 0.820 |
+ 0.825 |
+ 0.833 |
+ 0.780 |
+ 0.837 |
+ 0.848 |
+ 0.813 |
+ 0.861 |
+ 0.875 |
+ 0.775 |
+ 0.797 |
+ 0.806 |
+ 0.856 |
+ 0.833 |
+ 0.814 |
+ 0.815 |
+ 0.780 |
+ 0.769 |
+ 0.766 |
+ 0.740 |
+ 0.792 |
+ 0.804 |
+ 0.785 |
+ 0.766 |
+ 0.715 |
+ 0.847 |
+ 0.865 |
+ 0.855 |
+ 0.770 |
+ 0.764 |
+ 0.771 |
+ 0.721 |
+ 0.816 |
+ 0.819 |
+ 0.806 |
+ 0.831 |
+ 0.832 |
+ 0.827 |
+ 0.855 |
+ 0.749 |
+ 0.820 |
+ 0.845 |
+ 0.697 |
+ 0.684 |
+ 0.754 |
+ 0.562 |
+ 0.826 |
+ 0.810 |
+ 0.813 |
+ 0.825 |
+ 0.831 |
+ 0.826 |
+ 0.833 |
+ 0.834 |
+ 0.832 |
+ 0.836 |
+ 0.826 |
+ 0.797 |
+ 3372 |
+ Activity |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM |
+ 0.748 |
+ 0.753 |
+ 0.743 |
+ 0.753 |
+ 0.760 |
+ 0.758 |
+ 0.755 |
+ 0.731 |
+ 0.771 |
+ 0.768 |
+ 0.740 |
+ 0.780 |
+ 0.790 |
+ 0.694 |
+ 0.718 |
+ 0.725 |
+ 0.766 |
+ 0.746 |
+ 0.740 |
+ 0.740 |
+ 0.722 |
+ 0.711 |
+ 0.709 |
+ 0.686 |
+ 0.721 |
+ 0.735 |
+ 0.716 |
+ 0.709 |
+ 0.659 |
+ 0.769 |
+ 0.787 |
+ 0.785 |
+ 0.704 |
+ 0.698 |
+ 0.702 |
+ 0.665 |
+ 0.740 |
+ 0.742 |
+ 0.733 |
+ 0.761 |
+ 0.761 |
+ 0.758 |
+ 0.766 |
+ 0.677 |
+ 0.748 |
+ 0.761 |
+ 0.642 |
+ 0.654 |
+ 0.688 |
+ 0.532 |
+ 0.750 |
+ 0.732 |
+ 0.736 |
+ 0.749 |
+ 0.750 |
+ 0.750 |
+ 0.749 |
+ 0.753 |
+ 0.752 |
+ 0.755 |
+ 0.750 |
+ 0.726 |
+ 3637 |
+ Activity |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SRC_HUMAN_Nguyen_2022 |
+ 0.725 |
+ 0.734 |
+ 0.733 |
+ 0.738 |
+ 0.738 |
+ 0.736 |
+ 0.737 |
+ 0.721 |
+ 0.695 |
+ 0.732 |
+ 0.732 |
+ 0.764 |
+ 0.777 |
+ 0.674 |
+ 0.696 |
+ 0.706 |
+ 0.755 |
+ 0.738 |
+ 0.731 |
+ 0.726 |
+ 0.705 |
+ 0.696 |
+ 0.699 |
+ 0.678 |
+ 0.709 |
+ 0.723 |
+ 0.705 |
+ 0.698 |
+ 0.655 |
+ 0.749 |
+ 0.769 |
+ 0.760 |
+ 0.700 |
+ 0.689 |
+ 0.693 |
+ 0.657 |
+ 0.722 |
+ 0.724 |
+ 0.715 |
+ 0.741 |
+ 0.741 |
+ 0.737 |
+ 0.757 |
+ 0.658 |
+ 0.740 |
+ 0.756 |
+ 0.627 |
+ 0.642 |
+ 0.671 |
+ 0.537 |
+ 0.741 |
+ 0.723 |
+ 0.726 |
+ 0.738 |
+ 0.741 |
+ 0.739 |
+ 0.741 |
+ 0.745 |
+ 0.741 |
+ 0.745 |
+ 0.739 |
+ 0.703 |
+ 3366 |
+ OrganismalFitness |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SUMO1_HUMAN_Weile_2017 |
+ 0.718 |
+ 0.703 |
+ 0.735 |
+ 0.747 |
+ 0.778 |
+ 0.773 |
+ 0.580 |
+ 0.755 |
+ 0.757 |
+ 0.723 |
+ 0.753 |
+ 0.780 |
+ 0.800 |
+ 0.645 |
+ 0.800 |
+ 0.818 |
+ 0.802 |
+ 0.717 |
+ 0.686 |
+ 0.808 |
+ 0.631 |
+ 0.743 |
+ 0.769 |
+ 0.758 |
+ 0.786 |
+ 0.732 |
+ 0.777 |
+ 0.757 |
+ 0.684 |
+ 0.767 |
+ 0.770 |
+ 0.763 |
+ 0.628 |
+ 0.657 |
+ 0.788 |
+ 0.689 |
+ 0.743 |
+ 0.805 |
+ 0.736 |
+ 0.763 |
+ 0.809 |
+ 0.763 |
+ 0.803 |
+ 0.548 |
+ 0.716 |
+ 0.815 |
+ 0.808 |
+ 0.751 |
+ 0.814 |
+ 0.757 |
+ 0.790 |
+ 0.764 |
+ 0.797 |
+ 0.792 |
+ 0.790 |
+ 0.802 |
+ 0.783 |
+ 0.796 |
+ 0.786 |
+ 0.801 |
+ 0.777 |
+ 0.817 |
+ 1700 |
+ OrganismalFitness |
+ SUMO1_HUMAN |
+ High |
+ Human |
+
+
+ SYUA_HUMAN_Newberry_2020 |
+ 0.610 |
+ 0.647 |
+ 0.668 |
+ 0.676 |
+ 0.671 |
+ 0.676 |
+ 0.667 |
+ 0.719 |
+ 0.703 |
+ 0.697 |
+ 0.774 |
+ 0.775 |
+ 0.772 |
+ 0.669 |
+ 0.700 |
+ 0.699 |
+ 0.702 |
+ 0.748 |
+ 0.753 |
+ 0.753 |
+ 0.683 |
+ 0.735 |
+ 0.698 |
+ 0.697 |
+ 0.654 |
+ 0.717 |
+ 0.731 |
+ 0.689 |
+ 0.687 |
+ 0.755 |
+ 0.736 |
+ 0.712 |
+ 0.579 |
+ 0.694 |
+ 0.755 |
+ 0.720 |
+ 0.658 |
+ 0.724 |
+ 0.692 |
+ 0.679 |
+ 0.717 |
+ 0.697 |
+ 0.665 |
+ 0.617 |
+ 0.719 |
+ 0.697 |
+ 0.495 |
+ 0.634 |
+ 0.519 |
+ 0.472 |
+ 0.648 |
+ 0.640 |
+ 0.638 |
+ 0.657 |
+ 0.687 |
+ 0.659 |
+ 0.678 |
+ 0.671 |
+ 0.692 |
+ 0.670 |
+ 0.574 |
+ 0.544 |
+ 2497 |
+ OrganismalFitness |
+ SYUA_HUMAN |
+ Medium |
+ Human |
+
+
+ TADBP_HUMAN_Bolognesi_2019 |
+ 0.555 |
+ 0.538 |
+ 0.554 |
+ 0.552 |
+ 0.547 |
+ 0.545 |
+ 0.620 |
+ 0.503 |
+ 0.544 |
+ 0.542 |
+ 0.512 |
+ 0.525 |
+ 0.524 |
+ 0.552 |
+ 0.517 |
+ 0.516 |
+ 0.447 |
+ 0.502 |
+ 0.533 |
+ 0.474 |
+ 0.568 |
+ 0.532 |
+ 0.504 |
+ 0.502 |
+ 0.582 |
+ 0.551 |
+ 0.517 |
+ 0.525 |
+ 0.498 |
+ 0.525 |
+ 0.554 |
+ 0.570 |
+ 0.471 |
+ 0.595 |
+ 0.619 |
+ 0.567 |
+ 0.571 |
+ 0.590 |
+ 0.569 |
+ 0.556 |
+ 0.566 |
+ 0.563 |
+ 0.538 |
+ 0.584 |
+ 0.499 |
+ 0.528 |
+ 0.638 |
+ 0.524 |
+ 0.590 |
+ 0.529 |
+ 0.568 |
+ 0.508 |
+ 0.503 |
+ 0.521 |
+ 0.503 |
+ 0.523 |
+ 0.532 |
+ 0.502 |
+ 0.469 |
+ 0.517 |
+ 0.518 |
+ 0.549 |
+ 1196 |
+ OrganismalFitness |
+ TADBP_HUMAN |
+ Low |
+ Human |
+
+
+ TAT_HV1BR_Fernandes_2016 |
+ 0.707 |
+ 0.640 |
+ 0.678 |
+ 0.687 |
+ 0.721 |
+ 0.708 |
+ 0.439 |
+ 0.774 |
+ 0.673 |
+ 0.685 |
+ 0.616 |
+ 0.738 |
+ 0.740 |
+ 0.479 |
+ 0.499 |
+ 0.498 |
+ 0.522 |
+ 0.461 |
+ 0.529 |
+ 0.646 |
+ 0.760 |
+ 0.758 |
+ 0.775 |
+ 0.781 |
+ 0.772 |
+ 0.680 |
+ 0.591 |
+ 0.670 |
+ 0.649 |
+ 0.756 |
+ 0.787 |
+ 0.779 |
+ 0.709 |
+ 0.767 |
+ 0.649 |
+ 0.639 |
+ 0.765 |
+ 0.688 |
+ 0.665 |
+ 0.757 |
+ 0.709 |
+ 0.685 |
+ 0.509 |
+ 0.465 |
+ 0.712 |
+ 0.625 |
+ 0.650 |
+ 0.702 |
+ 0.726 |
+ 0.640 |
+ 0.580 |
+ 0.582 |
+ 0.598 |
+ 0.583 |
+ 0.576 |
+ 0.585 |
+ 0.573 |
+ 0.577 |
+ 0.571 |
+ 0.586 |
+ 0.607 |
+ 0.581 |
+ 1577 |
+ OrganismalFitness |
+ TAT_HV1BR |
+ High |
+ Virus |
+
+
+ TCRG1_MOUSE_Tsuboyama_2023_1E0L |
+ 0.837 |
+ 0.852 |
+ 0.880 |
+ 0.885 |
+ 0.887 |
+ 0.888 |
+ 0.883 |
+ 0.686 |
+ 0.892 |
+ 0.918 |
+ 0.874 |
+ 0.922 |
+ 0.933 |
+ 0.908 |
+ 0.914 |
+ 0.940 |
+ 0.931 |
+ 0.915 |
+ 0.929 |
+ 0.837 |
+ 0.785 |
+ 0.843 |
+ 0.837 |
+ 0.865 |
+ 0.854 |
+ 0.861 |
+ 0.888 |
+ 0.884 |
+ 0.851 |
+ 0.909 |
+ 0.899 |
+ 0.886 |
+ 0.840 |
+ 0.777 |
+ 0.850 |
+ 0.885 |
+ 0.895 |
+ 0.907 |
+ 0.910 |
+ 0.897 |
+ 0.908 |
+ 0.906 |
+ 0.905 |
+ 0.588 |
+ 0.827 |
+ 0.901 |
+ 0.853 |
+ 0.797 |
+ 0.928 |
+ 0.917 |
+ 0.918 |
+ 0.907 |
+ 0.898 |
+ 0.931 |
+ 0.928 |
+ 0.931 |
+ 0.922 |
+ 0.930 |
+ 0.919 |
+ 0.928 |
+ 0.934 |
+ 0.935 |
+ 1058 |
+ Stability |
+ TCRG1_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ THO1_YEAST_Tsuboyama_2023_2WQG |
+ 0.707 |
+ 0.760 |
+ 0.810 |
+ 0.805 |
+ 0.811 |
+ 0.810 |
+ 0.509 |
+ 0.561 |
+ 0.851 |
+ 0.845 |
+ 0.836 |
+ 0.804 |
+ 0.828 |
+ 0.576 |
+ 0.848 |
+ 0.849 |
+ 0.840 |
+ 0.851 |
+ 0.857 |
+ 0.831 |
+ 0.441 |
+ 0.692 |
+ 0.695 |
+ 0.668 |
+ 0.683 |
+ 0.717 |
+ 0.618 |
+ 0.750 |
+ 0.795 |
+ 0.852 |
+ 0.853 |
+ 0.843 |
+ 0.775 |
+ 0.469 |
+ 0.721 |
+ 0.686 |
+ 0.784 |
+ 0.816 |
+ 0.809 |
+ 0.803 |
+ 0.825 |
+ 0.824 |
+ 0.641 |
+ 0.547 |
+ 0.797 |
+ 0.711 |
+ 0.796 |
+ 0.796 |
+ 0.812 |
+ 0.868 |
+ 0.860 |
+ 0.851 |
+ 0.866 |
+ 0.866 |
+ 0.870 |
+ 0.856 |
+ 0.858 |
+ 0.855 |
+ 0.858 |
+ 0.861 |
+ 0.826 |
+ 0.775 |
+ 1279 |
+ Stability |
+ THO1_YEAST |
+ High |
+ Eukaryote |
+
+
+ TNKS2_HUMAN_Tsuboyama_2023_5JRT |
+ 0.712 |
+ 0.742 |
+ 0.774 |
+ 0.767 |
+ 0.769 |
+ 0.769 |
+ 0.422 |
+ 0.710 |
+ 0.795 |
+ 0.795 |
+ 0.757 |
+ 0.762 |
+ 0.765 |
+ 0.505 |
+ 0.760 |
+ 0.791 |
+ 0.778 |
+ 0.776 |
+ 0.785 |
+ 0.752 |
+ 0.643 |
+ 0.691 |
+ 0.709 |
+ 0.707 |
+ 0.696 |
+ 0.726 |
+ 0.690 |
+ 0.704 |
+ 0.719 |
+ 0.779 |
+ 0.753 |
+ 0.732 |
+ 0.602 |
+ 0.536 |
+ 0.670 |
+ 0.682 |
+ 0.767 |
+ 0.766 |
+ 0.763 |
+ 0.786 |
+ 0.775 |
+ 0.774 |
+ 0.653 |
+ 0.465 |
+ 0.781 |
+ 0.735 |
+ 0.733 |
+ 0.813 |
+ 0.839 |
+ 0.816 |
+ 0.772 |
+ 0.783 |
+ 0.776 |
+ 0.772 |
+ 0.772 |
+ 0.767 |
+ 0.774 |
+ 0.770 |
+ 0.772 |
+ 0.777 |
+ 0.814 |
+ 0.800 |
+ 1479 |
+ Stability |
+ TNKS2_HUMAN |
+ High |
+ Human |
+
+
+ TPK1_HUMAN_Weile_2017 |
+ 0.622 |
+ 0.631 |
+ 0.626 |
+ 0.629 |
+ 0.632 |
+ 0.631 |
+ 0.547 |
+ 0.639 |
+ 0.662 |
+ 0.652 |
+ 0.658 |
+ 0.653 |
+ 0.675 |
+ 0.571 |
+ 0.618 |
+ 0.644 |
+ 0.684 |
+ 0.680 |
+ 0.706 |
+ 0.640 |
+ 0.555 |
+ 0.575 |
+ 0.618 |
+ 0.646 |
+ 0.569 |
+ 0.647 |
+ 0.652 |
+ 0.635 |
+ 0.669 |
+ 0.641 |
+ 0.642 |
+ 0.606 |
+ 0.563 |
+ 0.571 |
+ 0.614 |
+ 0.669 |
+ 0.632 |
+ 0.638 |
+ 0.671 |
+ 0.640 |
+ 0.640 |
+ 0.655 |
+ 0.578 |
+ 0.560 |
+ 0.673 |
+ 0.601 |
+ 0.617 |
+ 0.645 |
+ 0.633 |
+ 0.559 |
+ 0.636 |
+ 0.655 |
+ 0.649 |
+ 0.655 |
+ 0.663 |
+ 0.664 |
+ 0.656 |
+ 0.658 |
+ 0.660 |
+ 0.664 |
+ 0.652 |
+ 0.622 |
+ 3181 |
+ OrganismalFitness |
+ TPK1_HUMAN |
+ Medium |
+ Human |
+
+
+ TPMT_HUMAN_Matreyek_2018 |
+ 0.722 |
+ 0.754 |
+ 0.773 |
+ 0.782 |
+ 0.778 |
+ 0.787 |
+ 0.656 |
+ 0.761 |
+ 0.786 |
+ 0.787 |
+ 0.812 |
+ 0.785 |
+ 0.802 |
+ 0.706 |
+ 0.751 |
+ 0.814 |
+ 0.807 |
+ 0.756 |
+ 0.749 |
+ 0.779 |
+ 0.694 |
+ 0.725 |
+ 0.761 |
+ 0.766 |
+ 0.742 |
+ 0.777 |
+ 0.757 |
+ 0.746 |
+ 0.736 |
+ 0.805 |
+ 0.802 |
+ 0.779 |
+ 0.708 |
+ 0.698 |
+ 0.768 |
+ 0.719 |
+ 0.785 |
+ 0.797 |
+ 0.765 |
+ 0.799 |
+ 0.805 |
+ 0.783 |
+ 0.736 |
+ 0.637 |
+ 0.797 |
+ 0.777 |
+ 0.779 |
+ 0.788 |
+ 0.807 |
+ 0.641 |
+ 0.782 |
+ 0.785 |
+ 0.791 |
+ 0.776 |
+ 0.795 |
+ 0.793 |
+ 0.795 |
+ 0.790 |
+ 0.798 |
+ 0.798 |
+ 0.827 |
+ 0.778 |
+ 3648 |
+ Expression |
+ TPMT_HUMAN |
+ Medium |
+ Human |
+
+
+ TPOR_HUMAN_Bridgford_2020 |
+ 0.723 |
+ 0.695 |
+ 0.677 |
+ 0.655 |
+ 0.658 |
+ 0.667 |
+ 0.765 |
+ 0.752 |
+ 0.718 |
+ 0.721 |
+ 0.739 |
+ 0.732 |
+ 0.739 |
+ 0.694 |
+ 0.748 |
+ 0.704 |
+ 0.685 |
+ 0.736 |
+ 0.678 |
+ 0.677 |
+ 0.681 |
+ 0.737 |
+ 0.714 |
+ 0.726 |
+ 0.749 |
+ 0.634 |
+ 0.806 |
+ 0.780 |
+ 0.708 |
+ 0.741 |
+ 0.751 |
+ 0.630 |
+ 0.747 |
+ 0.756 |
+ 0.730 |
+ 0.751 |
+ 0.776 |
+ 0.768 |
+ 0.773 |
+ 0.750 |
+ 0.739 |
+ 0.757 |
+ 0.709 |
+ 0.712 |
+ 0.717 |
+ 0.738 |
+ 0.578 |
+ 0.690 |
+ 0.681 |
+ 0.511 |
+ 0.702 |
+ 0.676 |
+ 0.722 |
+ 0.701 |
+ 0.684 |
+ 0.699 |
+ 0.739 |
+ 0.643 |
+ 0.672 |
+ 0.699 |
+ 0.749 |
+ 0.752 |
+ 562 |
+ OrganismalFitness |
+ TPOR_HUMAN |
+ Low |
+ Human |
+
+
+ TRPC_SACS2_Chan_2017 |
+ 0.832 |
+ 0.841 |
+ 0.818 |
+ 0.827 |
+ 0.829 |
+ 0.834 |
+ 0.630 |
+ 0.804 |
+ 0.873 |
+ 0.880 |
+ 0.858 |
+ 0.849 |
+ 0.869 |
+ 0.689 |
+ 0.817 |
+ 0.858 |
+ 0.864 |
+ 0.860 |
+ 0.847 |
+ 0.835 |
+ 0.758 |
+ 0.832 |
+ 0.800 |
+ 0.832 |
+ 0.792 |
+ 0.798 |
+ 0.817 |
+ 0.803 |
+ 0.799 |
+ 0.808 |
+ 0.846 |
+ 0.808 |
+ 0.620 |
+ 0.804 |
+ 0.829 |
+ 0.827 |
+ 0.834 |
+ 0.848 |
+ 0.844 |
+ 0.841 |
+ 0.854 |
+ 0.847 |
+ 0.763 |
+ 0.539 |
+ 0.850 |
+ 0.800 |
+ 0.674 |
+ 0.788 |
+ 0.793 |
+ 0.564 |
+ 0.842 |
+ 0.833 |
+ 0.838 |
+ 0.867 |
+ 0.851 |
+ 0.850 |
+ 0.853 |
+ 0.860 |
+ 0.847 |
+ 0.863 |
+ 0.858 |
+ 0.836 |
+ 1519 |
+ OrganismalFitness |
+ TRPC_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ TRPC_THEMA_Chan_2017 |
+ 0.768 |
+ 0.780 |
+ 0.760 |
+ 0.764 |
+ 0.760 |
+ 0.762 |
+ 0.691 |
+ 0.748 |
+ 0.794 |
+ 0.790 |
+ 0.772 |
+ 0.797 |
+ 0.807 |
+ 0.661 |
+ 0.796 |
+ 0.797 |
+ 0.794 |
+ 0.793 |
+ 0.811 |
+ 0.773 |
+ 0.728 |
+ 0.760 |
+ 0.762 |
+ 0.783 |
+ 0.756 |
+ 0.763 |
+ 0.746 |
+ 0.744 |
+ 0.787 |
+ 0.753 |
+ 0.765 |
+ 0.732 |
+ 0.603 |
+ 0.731 |
+ 0.784 |
+ 0.775 |
+ 0.762 |
+ 0.784 |
+ 0.779 |
+ 0.770 |
+ 0.783 |
+ 0.777 |
+ 0.757 |
+ 0.558 |
+ 0.790 |
+ 0.754 |
+ 0.640 |
+ 0.721 |
+ 0.730 |
+ 0.520 |
+ 0.784 |
+ 0.751 |
+ 0.765 |
+ 0.784 |
+ 0.784 |
+ 0.757 |
+ 0.777 |
+ 0.777 |
+ 0.765 |
+ 0.783 |
+ 0.814 |
+ 0.794 |
+ 1519 |
+ OrganismalFitness |
+ TRPC_THEMA |
+ Medium |
+ Prokaryote |
+
+
+ UBC9_HUMAN_Weile_2017 |
+ 0.697 |
+ 0.771 |
+ 0.796 |
+ 0.801 |
+ 0.788 |
+ 0.794 |
+ 0.477 |
+ 0.720 |
+ 0.787 |
+ 0.791 |
+ 0.731 |
+ 0.773 |
+ 0.790 |
+ 0.498 |
+ 0.499 |
+ 0.716 |
+ 0.756 |
+ 0.786 |
+ 0.803 |
+ 0.816 |
+ 0.625 |
+ 0.727 |
+ 0.746 |
+ 0.732 |
+ 0.734 |
+ 0.760 |
+ 0.759 |
+ 0.747 |
+ 0.745 |
+ 0.766 |
+ 0.757 |
+ 0.745 |
+ 0.551 |
+ 0.630 |
+ 0.727 |
+ 0.740 |
+ 0.678 |
+ 0.752 |
+ 0.762 |
+ 0.749 |
+ 0.792 |
+ 0.803 |
+ 0.648 |
+ 0.493 |
+ 0.733 |
+ 0.774 |
+ 0.684 |
+ 0.695 |
+ 0.754 |
+ 0.643 |
+ 0.736 |
+ 0.727 |
+ 0.720 |
+ 0.735 |
+ 0.730 |
+ 0.731 |
+ 0.735 |
+ 0.739 |
+ 0.741 |
+ 0.738 |
+ 0.715 |
+ 0.736 |
+ 2563 |
+ OrganismalFitness |
+ UBC9_HUMAN |
+ Medium |
+ Human |
+
+
+ UBE4B_HUMAN_Tsuboyama_2023_3L1X |
+ 0.703 |
+ 0.811 |
+ 0.829 |
+ 0.828 |
+ 0.834 |
+ 0.833 |
+ 0.628 |
+ 0.780 |
+ 0.810 |
+ 0.824 |
+ 0.815 |
+ 0.845 |
+ 0.837 |
+ 0.751 |
+ 0.810 |
+ 0.854 |
+ 0.843 |
+ 0.847 |
+ 0.858 |
+ 0.818 |
+ 0.660 |
+ 0.792 |
+ 0.795 |
+ 0.801 |
+ 0.784 |
+ 0.803 |
+ 0.787 |
+ 0.789 |
+ 0.807 |
+ 0.830 |
+ 0.863 |
+ 0.852 |
+ 0.813 |
+ 0.612 |
+ 0.615 |
+ 0.757 |
+ 0.772 |
+ 0.769 |
+ 0.784 |
+ 0.833 |
+ 0.831 |
+ 0.823 |
+ 0.763 |
+ 0.484 |
+ 0.720 |
+ 0.726 |
+ 0.805 |
+ 0.763 |
+ 0.845 |
+ 0.755 |
+ 0.826 |
+ 0.843 |
+ 0.838 |
+ 0.845 |
+ 0.845 |
+ 0.840 |
+ 0.839 |
+ 0.843 |
+ 0.842 |
+ 0.843 |
+ 0.861 |
+ 0.813 |
+ 3622 |
+ Stability |
+ UBE4B_HUMAN |
+ High |
+ Human |
+
+
+ UBE4B_MOUSE_Starita_2013 |
+ 0.779 |
+ 0.799 |
+ 0.824 |
+ 0.831 |
+ 0.842 |
+ 0.844 |
+ 0.536 |
+ 0.594 |
+ 0.796 |
+ 0.805 |
+ 0.762 |
+ 0.838 |
+ 0.846 |
+ 0.817 |
+ 0.838 |
+ 0.847 |
+ 0.834 |
+ 0.786 |
+ 0.752 |
+ 0.798 |
+ 0.590 |
+ 0.764 |
+ 0.749 |
+ 0.732 |
+ 0.837 |
+ 0.800 |
+ 0.786 |
+ 0.775 |
+ 0.731 |
+ 0.826 |
+ 0.845 |
+ 0.825 |
+ 0.534 |
+ 0.492 |
+ 0.580 |
+ 0.698 |
+ 0.782 |
+ 0.749 |
+ 0.776 |
+ 0.834 |
+ 0.828 |
+ 0.842 |
+ 0.815 |
+ 0.499 |
+ 0.814 |
+ 0.826 |
+ 0.694 |
+ 0.773 |
+ 0.469 |
+ 0.540 |
+ 0.814 |
+ 0.798 |
+ 0.832 |
+ 0.836 |
+ 0.820 |
+ 0.834 |
+ 0.813 |
+ 0.829 |
+ 0.826 |
+ 0.833 |
+ 0.814 |
+ 0.791 |
+ 899 |
+ Activity |
+ UBE4B_MOUSE |
+ Low |
+ Eukaryote |
+
+
+ UBR5_HUMAN_Tsuboyama_2023_1I2T |
+ 0.735 |
+ 0.832 |
+ 0.832 |
+ 0.834 |
+ 0.837 |
+ 0.845 |
+ 0.601 |
+ 0.665 |
+ 0.798 |
+ 0.816 |
+ 0.813 |
+ 0.742 |
+ 0.791 |
+ 0.627 |
+ 0.635 |
+ 0.587 |
+ 0.588 |
+ 0.840 |
+ 0.835 |
+ 0.795 |
+ 0.754 |
+ 0.749 |
+ 0.777 |
+ 0.804 |
+ 0.765 |
+ 0.784 |
+ 0.795 |
+ 0.789 |
+ 0.801 |
+ 0.815 |
+ 0.784 |
+ 0.717 |
+ 0.669 |
+ 0.668 |
+ 0.758 |
+ 0.742 |
+ 0.760 |
+ 0.786 |
+ 0.784 |
+ 0.845 |
+ 0.846 |
+ 0.845 |
+ 0.555 |
+ 0.552 |
+ 0.707 |
+ 0.589 |
+ 0.784 |
+ 0.716 |
+ 0.854 |
+ 0.820 |
+ 0.837 |
+ 0.853 |
+ 0.858 |
+ 0.860 |
+ 0.860 |
+ 0.864 |
+ 0.852 |
+ 0.858 |
+ 0.849 |
+ 0.858 |
+ 0.871 |
+ 0.826 |
+ 1453 |
+ Stability |
+ UBR5_HUMAN |
+ Medium |
+ Human |
+
+
+ VG08_BPP22_Tsuboyama_2023_2GP8 |
+ 0.583 |
+ 0.623 |
+ 0.591 |
+ 0.609 |
+ 0.612 |
+ 0.609 |
+ 0.618 |
+ 0.467 |
+ 0.688 |
+ 0.681 |
+ 0.831 |
+ 0.785 |
+ 0.790 |
+ 0.674 |
+ 0.761 |
+ 0.808 |
+ 0.844 |
+ 0.817 |
+ 0.850 |
+ 0.566 |
+ 0.701 |
+ 0.765 |
+ 0.747 |
+ 0.813 |
+ 0.715 |
+ 0.736 |
+ 0.729 |
+ 0.752 |
+ 0.815 |
+ 0.766 |
+ 0.782 |
+ 0.715 |
+ 0.487 |
+ 0.652 |
+ 0.760 |
+ 0.780 |
+ 0.659 |
+ 0.700 |
+ 0.753 |
+ 0.655 |
+ 0.684 |
+ 0.709 |
+ 0.737 |
+ 0.637 |
+ 0.781 |
+ 0.752 |
+ 0.742 |
+ 0.783 |
+ 0.792 |
+ 0.759 |
+ 0.840 |
+ 0.821 |
+ 0.842 |
+ 0.844 |
+ 0.842 |
+ 0.860 |
+ 0.822 |
+ 0.840 |
+ 0.857 |
+ 0.849 |
+ 0.819 |
+ 0.809 |
+ 723 |
+ Stability |
+ VG08_BPP22 |
+ High |
+ Virus |
+
+
+ VILI_CHICK_Tsuboyama_2023_1YU5 |
+ 0.687 |
+ 0.772 |
+ 0.869 |
+ 0.894 |
+ 0.896 |
+ 0.897 |
+ 0.552 |
+ 0.811 |
+ 0.857 |
+ 0.874 |
+ 0.914 |
+ 0.886 |
+ 0.896 |
+ 0.695 |
+ 0.618 |
+ 0.904 |
+ 0.912 |
+ 0.880 |
+ 0.817 |
+ 0.862 |
+ 0.668 |
+ 0.757 |
+ 0.842 |
+ 0.812 |
+ 0.793 |
+ 0.826 |
+ 0.812 |
+ 0.792 |
+ 0.843 |
+ 0.880 |
+ 0.919 |
+ 0.913 |
+ 0.692 |
+ 0.683 |
+ 0.791 |
+ 0.789 |
+ 0.812 |
+ 0.840 |
+ 0.831 |
+ 0.894 |
+ 0.898 |
+ 0.888 |
+ 0.626 |
+ 0.660 |
+ 0.790 |
+ 0.795 |
+ 0.816 |
+ 0.793 |
+ 0.922 |
+ 0.861 |
+ 0.920 |
+ 0.921 |
+ 0.925 |
+ 0.925 |
+ 0.919 |
+ 0.927 |
+ 0.919 |
+ 0.922 |
+ 0.924 |
+ 0.925 |
+ 0.909 |
+ 0.891 |
+ 2568 |
+ Stability |
+ VILI_CHICK |
+ High |
+ Eukaryote |
+
+
+ VKOR1_HUMAN_Chiasson_2020_abundance |
+ 0.704 |
+ 0.711 |
+ 0.712 |
+ 0.723 |
+ 0.734 |
+ 0.736 |
+ 0.601 |
+ 0.710 |
+ 0.764 |
+ 0.772 |
+ 0.738 |
+ 0.742 |
+ 0.753 |
+ 0.604 |
+ 0.707 |
+ 0.754 |
+ 0.762 |
+ 0.759 |
+ 0.757 |
+ 0.750 |
+ 0.626 |
+ 0.636 |
+ 0.705 |
+ 0.732 |
+ 0.672 |
+ 0.743 |
+ 0.740 |
+ 0.724 |
+ 0.726 |
+ 0.752 |
+ 0.715 |
+ 0.666 |
+ 0.554 |
+ 0.601 |
+ 0.664 |
+ 0.764 |
+ 0.727 |
+ 0.738 |
+ 0.784 |
+ 0.747 |
+ 0.754 |
+ 0.779 |
+ 0.595 |
+ 0.570 |
+ 0.749 |
+ 0.679 |
+ 0.734 |
+ 0.744 |
+ 0.762 |
+ 0.644 |
+ 0.779 |
+ 0.765 |
+ 0.788 |
+ 0.795 |
+ 0.789 |
+ 0.782 |
+ 0.783 |
+ 0.772 |
+ 0.780 |
+ 0.789 |
+ 0.777 |
+ 0.742 |
+ 2695 |
+ Expression |
+ VKOR1_HUMAN |
+ Medium |
+ Human |
+
+
+ VKOR1_HUMAN_Chiasson_2020_activity |
+ 0.705 |
+ 0.720 |
+ 0.729 |
+ 0.739 |
+ 0.742 |
+ 0.751 |
+ 0.525 |
+ 0.720 |
+ 0.750 |
+ 0.757 |
+ 0.749 |
+ 0.755 |
+ 0.766 |
+ 0.534 |
+ 0.687 |
+ 0.712 |
+ 0.754 |
+ 0.760 |
+ 0.754 |
+ 0.747 |
+ 0.554 |
+ 0.568 |
+ 0.664 |
+ 0.684 |
+ 0.592 |
+ 0.730 |
+ 0.722 |
+ 0.717 |
+ 0.733 |
+ 0.748 |
+ 0.744 |
+ 0.718 |
+ 0.601 |
+ 0.534 |
+ 0.598 |
+ 0.721 |
+ 0.715 |
+ 0.721 |
+ 0.742 |
+ 0.747 |
+ 0.753 |
+ 0.758 |
+ 0.563 |
+ 0.540 |
+ 0.746 |
+ 0.646 |
+ 0.621 |
+ 0.688 |
+ 0.711 |
+ 0.541 |
+ 0.742 |
+ 0.709 |
+ 0.721 |
+ 0.730 |
+ 0.734 |
+ 0.727 |
+ 0.732 |
+ 0.732 |
+ 0.729 |
+ 0.736 |
+ 0.748 |
+ 0.693 |
+ 697 |
+ Activity |
+ VKOR1_HUMAN |
+ Medium |
+ Human |
+
+
+ VRPI_BPT7_Tsuboyama_2023_2WNM |
+ 0.422 |
+ 0.514 |
+ 0.587 |
+ 0.615 |
+ 0.550 |
+ 0.572 |
+ 0.548 |
+ 0.603 |
+ 0.750 |
+ 0.748 |
+ 0.801 |
+ 0.722 |
+ 0.733 |
+ 0.669 |
+ 0.755 |
+ 0.811 |
+ 0.860 |
+ 0.874 |
+ 0.813 |
+ 0.616 |
+ 0.488 |
+ 0.567 |
+ 0.575 |
+ 0.636 |
+ 0.613 |
+ 0.622 |
+ 0.570 |
+ 0.622 |
+ 0.664 |
+ 0.563 |
+ 0.809 |
+ 0.769 |
+ 0.638 |
+ 0.514 |
+ 0.514 |
+ 0.553 |
+ 0.468 |
+ 0.474 |
+ 0.456 |
+ 0.528 |
+ 0.537 |
+ 0.490 |
+ 0.656 |
+ 0.521 |
+ 0.819 |
+ 0.728 |
+ 0.833 |
+ 0.812 |
+ 0.840 |
+ 0.842 |
+ 0.854 |
+ 0.845 |
+ 0.862 |
+ 0.861 |
+ 0.857 |
+ 0.869 |
+ 0.858 |
+ 0.872 |
+ 0.853 |
+ 0.868 |
+ 0.878 |
+ 0.837 |
+ 1047 |
+ Stability |
+ VRPI_BPT7 |
+ Medium |
+ Virus |
+
+
+ YAIA_ECOLI_Tsuboyama_2023_2KVT |
+ 0.650 |
+ 0.816 |
+ 0.795 |
+ 0.809 |
+ 0.807 |
+ 0.803 |
+ 0.380 |
+ 0.783 |
+ 0.781 |
+ 0.804 |
+ 0.798 |
+ 0.618 |
+ 0.733 |
+ 0.432 |
+ 0.574 |
+ 0.791 |
+ 0.813 |
+ 0.844 |
+ 0.851 |
+ 0.800 |
+ 0.459 |
+ 0.404 |
+ 0.435 |
+ 0.633 |
+ 0.431 |
+ 0.409 |
+ 0.373 |
+ 0.498 |
+ 0.771 |
+ 0.803 |
+ 0.854 |
+ 0.840 |
+ 0.471 |
+ 0.363 |
+ 0.451 |
+ 0.750 |
+ 0.677 |
+ 0.682 |
+ 0.766 |
+ 0.791 |
+ 0.788 |
+ 0.806 |
+ 0.459 |
+ 0.363 |
+ 0.582 |
+ 0.479 |
+ 0.709 |
+ 0.691 |
+ 0.813 |
+ 0.767 |
+ 0.794 |
+ 0.819 |
+ 0.794 |
+ 0.813 |
+ 0.808 |
+ 0.819 |
+ 0.818 |
+ 0.835 |
+ 0.844 |
+ 0.820 |
+ 0.805 |
+ 0.743 |
+ 1890 |
+ Stability |
+ YAIA_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ YAP1_HUMAN_Araya_2012 |
+ 0.726 |
+ 0.674 |
+ 0.743 |
+ 0.742 |
+ 0.735 |
+ 0.738 |
+ 0.673 |
+ 0.654 |
+ 0.524 |
+ 0.534 |
+ 0.677 |
+ 0.649 |
+ 0.654 |
+ 0.716 |
+ 0.736 |
+ 0.734 |
+ 0.740 |
+ 0.698 |
+ 0.667 |
+ 0.607 |
+ 0.598 |
+ 0.597 |
+ 0.588 |
+ 0.592 |
+ 0.657 |
+ 0.623 |
+ 0.637 |
+ 0.612 |
+ 0.586 |
+ 0.677 |
+ 0.729 |
+ 0.740 |
+ 0.569 |
+ 0.660 |
+ 0.605 |
+ 0.618 |
+ 0.712 |
+ 0.672 |
+ 0.691 |
+ 0.728 |
+ 0.712 |
+ 0.731 |
+ 0.750 |
+ 0.454 |
+ 0.673 |
+ 0.750 |
+ 0.679 |
+ 0.685 |
+ 0.700 |
+ 0.603 |
+ 0.734 |
+ 0.699 |
+ 0.725 |
+ 0.721 |
+ 0.718 |
+ 0.713 |
+ 0.716 |
+ 0.729 |
+ 0.732 |
+ 0.728 |
+ 0.698 |
+ 0.720 |
+ 10075 |
+ Binding |
+ YAP1_HUMAN |
+ Low |
+ Human |
+
+
+ YNZC_BACSU_Tsuboyama_2023_2JVD |
+ 0.904 |
+ 0.927 |
+ 0.955 |
+ 0.954 |
+ 0.948 |
+ 0.945 |
+ 0.819 |
+ 0.853 |
+ 0.939 |
+ 0.941 |
+ 0.969 |
+ 0.967 |
+ 0.971 |
+ 0.874 |
+ 0.883 |
+ 0.973 |
+ 0.974 |
+ 0.961 |
+ 0.970 |
+ 0.942 |
+ 0.848 |
+ 0.912 |
+ 0.929 |
+ 0.933 |
+ 0.923 |
+ 0.930 |
+ 0.886 |
+ 0.934 |
+ 0.952 |
+ 0.923 |
+ 0.942 |
+ 0.919 |
+ 0.911 |
+ 0.856 |
+ 0.907 |
+ 0.897 |
+ 0.922 |
+ 0.926 |
+ 0.923 |
+ 0.942 |
+ 0.938 |
+ 0.944 |
+ 0.732 |
+ 0.658 |
+ 0.826 |
+ 0.739 |
+ 0.775 |
+ 0.825 |
+ 0.980 |
+ 0.932 |
+ 0.971 |
+ 0.964 |
+ 0.970 |
+ 0.971 |
+ 0.970 |
+ 0.973 |
+ 0.970 |
+ 0.967 |
+ 0.971 |
+ 0.971 |
+ 0.959 |
+ 0.964 |
+ 2300 |
+ Stability |
+ YNZC_BACSU |
+ Medium |
+ Prokaryote |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/substitutions/AUC/DMS_substitutions_AUC_Uniprot_Selection_Type_level.csv b/benchmarks/DMS_zero_shot/substitutions/AUC/DMS_substitutions_AUC_Uniprot_Selection_Type_level.csv
new file mode 100644
index 0000000..91e9b18
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/AUC/DMS_substitutions_AUC_Uniprot_Selection_Type_level.csv
@@ -0,0 +1,7 @@
+Site_Independent,EVmutation,DeepSequence_single,DeepSequence_ensemble,EVE_single,EVE_ensemble,Unirep,Unirep_evotune,MSA_Transformer_single,MSA_Transformer_ensemble,ESM1b,ESM1v_single,ESM1v_ensemble,ESM2_8M,ESM2_35M,ESM2_150M,ESM2_650M,ESM2_3B,ESM2_15B,Wavenet,RITA_s,RITA_m,RITA_l,RITA_xl,Progen2_small,Progen2_medium,Progen2_base,Progen2_large,Progen2_xlarge,GEMME,VESPA,VESPAl,ProtGPT2,Tranception_S_no_retrieval,Tranception_M_no_retrieval,Tranception_L_no_retrieval,Tranception_S,Tranception_M,Tranception_L,TranceptEVE_S,TranceptEVE_M,TranceptEVE_L,CARP_38M,CARP_600K,CARP_640M,CARP_76M,MIF,MIFST,ESM-IF1,ProteinMPNN,ProtSSN_k10_h512,ProtSSN_k10_h768,ProtSSN_k10_h1280,ProtSSN_k20_h512,ProtSSN_k20_h768,ProtSSN_k20_h1280,ProtSSN_k30_h512,ProtSSN_k30_h768,ProtSSN_k30_h1280,ProtSSN_ensemble,SaProt_650M_AF2,SaProt_35M_AF2,Selection Type
+0.699,0.737,0.741,0.746,0.748,0.753,0.597,0.695,0.746,0.756,0.73,0.713,0.726,0.607,0.669,0.71,0.73,0.724,0.717,0.709,0.659,0.691,0.698,0.702,0.68,0.713,0.713,0.718,0.72,0.759,0.756,0.737,0.593,0.652,0.687,0.717,0.734,0.741,0.752,0.756,0.759,0.764,0.655,0.555,0.711,0.682,0.673,0.709,0.697,0.605,0.746,0.739,0.742,0.749,0.748,0.748,0.746,0.747,0.746,0.752,0.747,0.702,Activity
+0.684,0.666,0.68,0.689,0.696,0.702,0.617,0.657,0.666,0.675,0.662,0.652,0.674,0.65,0.668,0.685,0.691,0.675,0.668,0.673,0.652,0.649,0.66,0.665,0.652,0.666,0.664,0.662,0.662,0.704,0.695,0.683,0.585,0.657,0.656,0.659,0.7,0.694,0.688,0.712,0.707,0.701,0.652,0.551,0.649,0.656,0.685,0.674,0.716,0.591,0.691,0.686,0.696,0.694,0.691,0.702,0.692,0.695,0.698,0.702,0.711,0.702,Binding
+0.687,0.707,0.704,0.713,0.721,0.723,0.621,0.7,0.733,0.742,0.72,0.723,0.735,0.647,0.684,0.718,0.725,0.722,0.725,0.691,0.684,0.72,0.729,0.728,0.71,0.735,0.739,0.734,0.73,0.737,0.724,0.683,0.602,0.689,0.721,0.727,0.728,0.74,0.747,0.741,0.747,0.75,0.671,0.596,0.718,0.701,0.731,0.738,0.721,0.603,0.725,0.721,0.733,0.732,0.734,0.732,0.735,0.732,0.732,0.739,0.766,0.736,Expression
+0.714,0.727,0.721,0.729,0.745,0.748,0.577,0.695,0.729,0.732,0.699,0.705,0.718,0.577,0.622,0.674,0.708,0.712,0.715,0.706,0.68,0.709,0.711,0.715,0.689,0.715,0.714,0.713,0.718,0.752,0.747,0.728,0.595,0.679,0.702,0.715,0.73,0.736,0.742,0.75,0.753,0.756,0.623,0.531,0.707,0.653,0.663,0.701,0.674,0.59,0.711,0.706,0.714,0.719,0.716,0.715,0.715,0.714,0.714,0.722,0.703,0.662,OrganismalFitness
+0.695,0.741,0.768,0.769,0.774,0.777,0.61,0.714,0.769,0.777,0.782,0.744,0.764,0.638,0.734,0.779,0.789,0.786,0.775,0.757,0.66,0.698,0.718,0.727,0.695,0.726,0.712,0.726,0.757,0.792,0.79,0.771,0.639,0.645,0.687,0.714,0.749,0.756,0.764,0.779,0.78,0.783,0.661,0.549,0.722,0.697,0.775,0.762,0.838,0.805,0.811,0.806,0.809,0.814,0.814,0.817,0.81,0.814,0.813,0.817,0.826,0.807,Stability
+0.696,0.716,0.723,0.729,0.737,0.741,0.604,0.692,0.729,0.737,0.719,0.707,0.723,0.624,0.676,0.713,0.728,0.724,0.72,0.707,0.667,0.693,0.703,0.707,0.685,0.711,0.709,0.711,0.717,0.749,0.742,0.72,0.603,0.665,0.691,0.707,0.728,0.733,0.739,0.748,0.749,0.751,0.653,0.557,0.701,0.678,0.705,0.717,0.729,0.639,0.737,0.732,0.739,0.742,0.741,0.743,0.74,0.741,0.741,0.747,0.751,0.722,
diff --git a/benchmarks/DMS_zero_shot/substitutions/AUC/DMS_substitutions_AUC_Uniprot_level.csv b/benchmarks/DMS_zero_shot/substitutions/AUC/DMS_substitutions_AUC_Uniprot_level.csv
new file mode 100644
index 0000000..8be547d
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/AUC/DMS_substitutions_AUC_Uniprot_level.csv
@@ -0,0 +1,223 @@
+Site_Independent,EVmutation,DeepSequence_single,DeepSequence_ensemble,EVE_single,EVE_ensemble,Unirep,Unirep_evotune,MSA_Transformer_single,MSA_Transformer_ensemble,ESM1b,ESM1v_single,ESM1v_ensemble,ESM2_8M,ESM2_35M,ESM2_150M,ESM2_650M,ESM2_3B,ESM2_15B,Wavenet,RITA_s,RITA_m,RITA_l,RITA_xl,Progen2_small,Progen2_medium,Progen2_base,Progen2_large,Progen2_xlarge,GEMME,VESPA,VESPAl,ProtGPT2,Tranception_S_no_retrieval,Tranception_M_no_retrieval,Tranception_L_no_retrieval,Tranception_S,Tranception_M,Tranception_L,TranceptEVE_S,TranceptEVE_M,TranceptEVE_L,CARP_38M,CARP_600K,CARP_640M,CARP_76M,MIF,MIFST,ESM-IF1,ProteinMPNN,ProtSSN_k10_h512,ProtSSN_k10_h768,ProtSSN_k10_h1280,ProtSSN_k20_h512,ProtSSN_k20_h768,ProtSSN_k20_h1280,ProtSSN_k30_h512,ProtSSN_k30_h768,ProtSSN_k30_h1280,ProtSSN_ensemble,SaProt_650M_AF2,SaProt_35M_AF2,Selection Type,UniProt_ID,MSA_Neff_L_category,Taxon
+0.69,0.674,0.56,0.544,0.692,0.698,0.426,0.532,0.718,0.721,0.485,0.467,0.494,0.454,0.464,0.46,0.597,0.678,0.685,0.612,0.674,0.651,0.656,0.651,0.658,0.669,0.66,0.653,0.644,0.715,0.653,0.64,0.499,0.648,0.659,0.633,0.677,0.676,0.672,0.675,0.668,0.682,0.459,0.453,0.555,0.459,0.637,0.642,0.642,0.565,0.618,0.617,0.633,0.642,0.631,0.628,0.623,0.628,0.626,0.631,0.598,0.556,OrganismalFitness,A0A140D2T1_ZIKV,Medium,Virus
+0.761,0.724,0.727,0.739,0.776,0.779,0.502,0.771,0.776,0.778,0.744,0.767,0.778,0.502,0.51,0.519,0.539,0.562,0.578,0.752,0.767,0.775,0.775,0.774,0.769,0.77,0.754,0.765,0.767,0.769,0.796,0.779,0.681,0.763,0.764,0.778,0.774,0.775,0.779,0.783,0.782,0.786,0.721,0.492,0.767,0.73,0.685,0.748,0.629,0.571,0.608,0.626,0.638,0.644,0.616,0.625,0.627,0.61,0.599,0.631,0.592,0.551,OrganismalFitness,A0A192B1T2_9HIV1,Medium,Virus
+0.491,0.518,0.556,0.547,0.528,0.528,0.488,0.537,0.541,0.537,0.521,0.521,0.525,0.477,0.468,0.49,0.516,0.512,0.516,0.538,0.491,0.518,0.531,0.54,0.517,0.537,0.537,0.539,0.542,0.524,0.51,0.504,0.513,0.521,0.527,0.55,0.516,0.521,0.531,0.528,0.53,0.538,0.473,0.47,0.524,0.475,0.516,0.532,0.513,0.517,0.528,0.524,0.522,0.517,0.52,0.529,0.519,0.53,0.523,0.525,0.517,0.491,Activity,A0A1I9GEU1_NEIME,Medium,Prokaryote
+0.727,0.733,0.568,0.534,0.716,0.717,0.515,0.526,0.655,0.665,0.556,0.558,0.558,0.542,0.553,0.564,0.549,0.557,0.555,0.534,0.518,0.516,0.535,0.547,0.501,0.543,0.514,0.535,0.561,0.73,0.668,0.672,0.537,0.532,0.522,0.538,0.661,0.657,0.666,0.636,0.631,0.641,0.536,0.514,0.55,0.556,0.713,0.724,0.738,0.683,0.64,0.668,0.656,0.673,0.663,0.681,0.657,0.669,0.632,0.67,0.718,0.645,Activity,A0A247D711_LISMN,High,Prokaryote
+0.749,0.755,0.742,0.759,0.776,0.78,0.498,0.742,0.754,0.758,0.569,0.755,0.776,0.508,0.518,0.53,0.754,0.756,0.756,0.722,0.736,0.768,0.762,0.775,0.701,0.754,0.756,0.744,0.755,0.77,0.746,0.706,0.544,0.736,0.764,0.77,0.766,0.782,0.78,0.787,0.796,0.795,0.51,0.51,0.65,0.518,0.706,0.726,0.714,0.583,0.746,0.739,0.762,0.758,0.76,0.762,0.767,0.76,0.766,0.768,0.606,0.583,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+0.749,0.755,0.742,0.759,0.776,0.78,0.498,0.742,0.754,0.758,0.569,0.755,0.776,0.508,0.518,0.53,0.754,0.756,0.756,0.722,0.736,0.768,0.762,0.775,0.701,0.754,0.756,0.744,0.755,0.77,0.746,0.706,0.544,0.736,0.764,0.77,0.766,0.782,0.78,0.787,0.796,0.795,0.51,0.51,0.65,0.518,0.706,0.726,0.714,0.583,0.746,0.739,0.762,0.758,0.76,0.762,0.767,0.76,0.766,0.768,0.606,0.583,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+0.714,0.681,0.713,0.711,0.715,0.714,0.513,0.692,0.668,0.663,0.521,0.512,0.514,0.51,0.509,0.513,0.57,0.603,0.636,0.635,0.679,0.705,0.712,0.708,0.533,0.643,0.655,0.634,0.676,0.733,0.666,0.658,0.525,0.686,0.697,0.711,0.694,0.701,0.704,0.73,0.732,0.741,0.51,0.499,0.539,0.501,0.64,0.633,0.566,0.559,0.592,0.589,0.596,0.609,0.605,0.603,0.6,0.6,0.597,0.602,0.584,0.551,OrganismalFitness,A4D664_9INFA,Medium,Virus
+0.673,0.772,0.855,0.857,0.838,0.847,0.697,0.781,0.847,0.869,0.875,0.853,0.867,0.736,0.791,0.858,0.905,0.891,0.838,0.806,0.716,0.792,0.805,0.836,0.785,0.835,0.845,0.843,0.874,0.865,0.908,0.871,0.627,0.73,0.802,0.84,0.806,0.824,0.856,0.858,0.861,0.87,0.767,0.547,0.865,0.83,0.845,0.879,0.837,0.723,0.89,0.877,0.875,0.878,0.885,0.887,0.889,0.892,0.893,0.895,0.887,0.772,OrganismalFitness,A4GRB6_PSEAI,High,Prokaryote
+0.714,0.707,0.735,0.722,0.673,0.676,0.693,0.577,0.673,0.679,0.666,0.666,0.711,0.709,0.7,0.703,0.736,0.758,0.688,0.632,0.672,0.645,0.666,0.659,0.718,0.654,0.667,0.664,0.662,0.757,0.622,0.588,0.766,0.713,0.639,0.692,0.74,0.695,0.743,0.739,0.716,0.728,0.703,0.699,0.73,0.696,0.65,0.697,0.405,0.527,0.739,0.734,0.726,0.72,0.722,0.703,0.73,0.687,0.698,0.725,0.722,0.709,Stability,A4_HUMAN,Low,Human
+0.642,0.744,0.665,0.715,0.752,0.753,0.596,0.605,0.753,0.755,0.707,0.741,0.748,0.6,0.621,0.621,0.748,0.754,0.76,0.701,0.597,0.613,0.635,0.652,0.637,0.715,0.704,0.713,0.724,0.739,0.757,0.733,0.509,0.628,0.622,0.708,0.72,0.714,0.736,0.743,0.741,0.753,0.609,0.579,0.682,0.619,0.632,0.695,0.68,0.588,0.748,0.75,0.747,0.747,0.752,0.746,0.749,0.743,0.749,0.754,0.735,0.636,OrganismalFitness,AACC1_PSEAI,High,Prokaryote
+0.611,0.588,0.553,0.562,0.593,0.586,0.472,0.502,0.592,0.599,0.612,0.555,0.57,0.483,0.522,0.603,0.606,0.562,0.554,0.595,0.499,0.527,0.558,0.589,0.481,0.545,0.574,0.572,0.616,0.576,0.547,0.534,0.534,0.501,0.527,0.526,0.586,0.565,0.563,0.58,0.564,0.564,0.497,0.516,0.623,0.528,0.694,0.66,0.675,0.559,0.603,0.625,0.608,0.607,0.612,0.621,0.603,0.602,0.602,0.611,0.656,0.591,Binding,ACE2_HUMAN,Medium,Human
+0.672,0.718,0.759,0.763,0.754,0.764,0.738,0.755,0.771,0.771,0.773,0.77,0.775,0.712,0.729,0.745,0.755,0.76,0.766,0.768,0.763,0.768,0.765,0.754,0.77,0.772,0.776,0.773,0.758,0.779,0.759,0.715,0.649,0.773,0.771,0.759,0.773,0.777,0.774,0.773,0.775,0.774,0.745,0.594,0.771,0.764,0.716,0.748,0.736,0.616,0.752,0.753,0.761,0.762,0.759,0.759,0.757,0.764,0.756,0.764,0.788,0.768,Activity,ADRB2_HUMAN,Medium,Human
+0.47,0.697,0.728,0.74,0.727,0.74,0.361,0.622,0.672,0.709,0.68,0.665,0.714,0.339,0.373,0.572,0.65,0.649,0.621,0.776,0.392,0.557,0.708,0.696,0.448,0.672,0.713,0.704,0.631,0.635,0.71,0.711,0.563,0.471,0.496,0.66,0.589,0.591,0.682,0.716,0.711,0.737,0.368,0.368,0.657,0.568,0.735,0.717,0.719,0.689,0.672,0.716,0.653,0.733,0.658,0.659,0.659,0.66,0.644,0.667,0.651,0.486,Activity,AICDA_HUMAN,Medium,Human
+0.667,0.692,0.693,0.687,0.676,0.682,0.33,0.535,0.588,0.598,0.67,0.542,0.699,0.394,0.639,0.636,0.609,0.628,0.61,0.504,0.485,0.385,0.44,0.447,0.317,0.419,0.444,0.416,0.494,0.682,0.61,0.598,0.524,0.356,0.428,0.334,0.684,0.688,0.661,0.697,0.695,0.669,0.409,0.407,0.556,0.533,0.603,0.534,0.689,0.65,0.705,0.642,0.576,0.657,0.649,0.654,0.653,0.648,0.66,0.651,0.652,0.674,Stability,AMFR_HUMAN,Medium,Human
+0.624,0.75,0.779,0.788,0.769,0.763,0.534,0.741,0.713,0.825,0.79,0.833,0.862,0.653,0.693,0.702,0.78,0.858,0.837,0.759,0.794,0.794,0.796,0.8,0.8,0.808,0.797,0.802,0.788,0.814,0.824,0.794,0.635,0.821,0.764,0.735,0.814,0.765,0.745,0.823,0.787,0.775,0.668,0.528,0.735,0.695,0.683,0.735,0.709,0.608,0.783,0.768,0.771,0.774,0.769,0.779,0.764,0.772,0.785,0.783,0.808,0.696,Activity,AMIE_PSEAE,High,Prokaryote
+0.768,0.753,0.715,0.73,0.754,0.756,0.754,0.727,0.744,0.756,0.761,0.756,0.768,0.752,0.783,0.786,0.791,0.789,0.781,0.508,0.723,0.744,0.751,0.747,0.747,0.756,0.752,0.732,0.74,0.777,0.766,0.758,0.649,0.732,0.751,0.736,0.76,0.772,0.767,0.764,0.773,0.766,0.763,0.697,0.739,0.75,0.731,0.728,0.706,0.587,0.771,0.767,0.78,0.779,0.775,0.781,0.773,0.778,0.775,0.783,0.783,0.787,Activity,ANCSZ,Medium,Eukaryote
+0.614,0.697,0.7,0.711,0.71,0.704,0.623,0.677,0.715,0.735,0.688,0.638,0.691,0.638,0.619,0.739,0.741,0.748,0.706,0.706,0.667,0.72,0.726,0.699,0.657,0.711,0.726,0.727,0.705,0.723,0.703,0.656,0.525,0.677,0.686,0.709,0.678,0.686,0.703,0.719,0.717,0.725,0.618,0.604,0.716,0.694,0.871,0.824,0.879,0.828,0.749,0.758,0.756,0.762,0.75,0.749,0.737,0.759,0.735,0.759,0.797,0.741,Stability,ARGR_ECOLI,Medium,Prokaryote
+0.853,0.659,0.823,0.827,0.764,0.813,0.645,0.769,0.619,0.612,0.642,0.519,0.685,0.607,0.586,0.606,0.67,0.686,0.674,0.664,0.647,0.568,0.67,0.788,0.563,0.646,0.711,0.713,0.714,0.89,0.702,0.69,0.647,0.648,0.635,0.681,0.791,0.76,0.709,0.831,0.816,0.712,0.616,0.591,0.632,0.605,0.649,0.655,0.709,0.509,0.695,0.686,0.729,0.665,0.627,0.757,0.662,0.667,0.738,0.706,0.635,0.621,Binding,B2L11_HUMAN,Low,Human
+0.605,0.688,0.722,0.725,0.715,0.723,0.607,0.65,0.724,0.735,0.736,0.724,0.739,0.693,0.724,0.716,0.754,0.744,0.768,0.685,0.687,0.688,0.706,0.691,0.56,0.711,0.718,0.709,0.678,0.688,0.698,0.684,0.645,0.684,0.636,0.653,0.7,0.675,0.687,0.729,0.71,0.721,0.592,0.577,0.671,0.617,0.801,0.748,0.857,0.824,0.743,0.752,0.756,0.764,0.757,0.758,0.758,0.754,0.764,0.758,0.789,0.807,Stability,BBC1_YEAST,High,Eukaryote
+0.698,0.731,0.7,0.705,0.704,0.711,0.563,0.637,0.704,0.772,0.714,0.677,0.717,0.591,0.603,0.662,0.766,0.778,0.732,0.725,0.569,0.693,0.623,0.707,0.608,0.733,0.655,0.716,0.737,0.827,0.796,0.77,0.524,0.634,0.66,0.734,0.734,0.734,0.759,0.717,0.718,0.739,0.563,0.563,0.609,0.598,0.784,0.759,0.845,0.79,0.788,0.792,0.797,0.805,0.798,0.807,0.78,0.794,0.789,0.797,0.769,0.648,Stability,BCHB_CHLTE,Medium,Prokaryote
+0.711,0.834,0.841,0.85,0.831,0.841,0.554,0.707,0.832,0.851,0.83,0.816,0.832,0.678,0.748,0.806,0.847,0.783,0.72,0.806,0.762,0.758,0.746,0.745,0.773,0.802,0.803,0.77,0.713,0.807,0.856,0.846,0.576,0.755,0.748,0.725,0.798,0.798,0.798,0.84,0.844,0.844,0.74,0.52,0.826,0.775,0.779,0.829,0.815,0.657,0.838,0.837,0.841,0.849,0.848,0.848,0.852,0.839,0.844,0.854,0.854,0.776,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.711,0.834,0.841,0.85,0.831,0.841,0.554,0.707,0.832,0.851,0.83,0.816,0.832,0.678,0.748,0.806,0.847,0.783,0.72,0.806,0.762,0.758,0.746,0.745,0.773,0.802,0.803,0.77,0.713,0.807,0.856,0.846,0.576,0.755,0.748,0.725,0.798,0.798,0.798,0.84,0.844,0.844,0.74,0.52,0.826,0.775,0.779,0.829,0.815,0.657,0.838,0.837,0.841,0.849,0.848,0.848,0.852,0.839,0.844,0.854,0.854,0.776,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.711,0.834,0.841,0.85,0.831,0.841,0.554,0.707,0.832,0.851,0.83,0.816,0.832,0.678,0.748,0.806,0.847,0.783,0.72,0.806,0.762,0.758,0.746,0.745,0.773,0.802,0.803,0.77,0.713,0.807,0.856,0.846,0.576,0.755,0.748,0.725,0.798,0.798,0.798,0.84,0.844,0.844,0.74,0.52,0.826,0.775,0.779,0.829,0.815,0.657,0.838,0.837,0.841,0.849,0.848,0.848,0.852,0.839,0.844,0.854,0.854,0.776,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.711,0.834,0.841,0.85,0.831,0.841,0.554,0.707,0.832,0.851,0.83,0.816,0.832,0.678,0.748,0.806,0.847,0.783,0.72,0.806,0.762,0.758,0.746,0.745,0.773,0.802,0.803,0.77,0.713,0.807,0.856,0.846,0.576,0.755,0.748,0.725,0.798,0.798,0.798,0.84,0.844,0.844,0.74,0.52,0.826,0.775,0.779,0.829,0.815,0.657,0.838,0.837,0.841,0.849,0.848,0.848,0.852,0.839,0.844,0.854,0.854,0.776,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.824,0.756,0.788,0.774,0.776,0.8,0.566,0.683,0.76,0.772,0.836,0.775,0.805,0.582,0.767,0.823,0.86,0.845,0.788,0.699,0.577,0.762,0.778,0.774,0.705,0.82,0.814,0.801,0.815,0.776,0.86,0.832,0.658,0.603,0.621,0.79,0.792,0.792,0.846,0.806,0.806,0.855,0.642,0.58,0.865,0.815,0.804,0.809,0.57,0.587,0.856,0.85,0.86,0.852,0.861,0.862,0.858,0.857,0.853,0.864,0.864,0.832,OrganismalFitness,BRCA1_HUMAN,Low,Human
+0.908,0.816,0.866,0.861,0.85,0.875,0.588,0.893,0.702,0.556,0.906,0.593,0.526,0.529,0.528,0.914,0.94,0.904,0.899,0.584,0.515,0.921,0.923,0.861,0.928,0.888,0.896,0.887,0.587,0.86,0.785,0.85,0.558,0.549,0.581,0.59,0.822,0.817,0.825,0.833,0.819,0.837,0.59,0.553,0.892,0.492,0.48,0.503,0.279,0.76,0.871,0.899,0.896,0.878,0.876,0.874,0.868,0.839,0.841,0.885,0.5,0.507,OrganismalFitness,BRCA2_HUMAN,,Human
+0.697,0.687,0.676,0.68,0.719,0.72,0.485,0.731,0.686,0.689,0.526,0.717,0.747,0.49,0.488,0.495,0.75,0.705,0.716,0.674,0.701,0.685,0.693,0.678,0.621,0.734,0.715,0.736,0.69,0.742,0.731,0.681,0.559,0.683,0.691,0.701,0.71,0.713,0.719,0.724,0.724,0.729,0.487,0.481,0.644,0.498,0.761,0.776,0.777,0.619,0.756,0.756,0.767,0.769,0.774,0.763,0.772,0.768,0.768,0.773,0.657,0.593,OrganismalFitness,C6KNH7_9INFA,Medium,Virus
+0.594,0.631,0.63,0.627,0.63,0.631,0.588,0.61,0.635,0.644,0.646,0.629,0.65,0.58,0.602,0.602,0.615,0.623,0.627,0.63,0.599,0.629,0.636,0.644,0.631,0.662,0.66,0.676,0.684,0.641,0.619,0.58,0.553,0.621,0.648,0.674,0.621,0.639,0.653,0.634,0.638,0.639,0.619,0.58,0.658,0.643,0.552,0.58,0.592,0.548,0.598,0.598,0.599,0.593,0.6,0.594,0.596,0.602,0.593,0.6,0.659,0.639,OrganismalFitness,CALM1_HUMAN,High,Human
+0.698,0.67,0.665,0.688,0.67,0.662,0.703,0.724,0.66,0.679,0.588,0.609,0.61,0.643,0.659,0.61,0.649,0.594,0.553,0.63,0.606,0.641,0.648,0.65,0.614,0.61,0.644,0.614,0.717,0.719,0.586,0.582,0.56,0.612,0.644,0.757,0.686,0.689,0.74,0.668,0.67,0.716,0.558,0.594,0.644,0.564,0.713,0.703,0.673,0.666,0.614,0.61,0.603,0.615,0.604,0.605,0.607,0.602,0.613,0.609,0.668,0.629,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+0.652,0.588,0.576,0.573,0.586,0.594,0.566,0.456,0.586,0.59,0.642,0.68,0.657,0.534,0.554,0.696,0.676,0.682,0.692,0.574,0.568,0.586,0.594,0.592,0.485,0.582,0.594,0.601,0.558,0.579,0.631,0.618,0.646,0.507,0.573,0.534,0.598,0.599,0.567,0.599,0.596,0.57,0.548,0.514,0.632,0.642,0.657,0.641,0.667,0.565,0.693,0.689,0.691,0.68,0.678,0.686,0.677,0.686,0.692,0.69,0.728,0.68,OrganismalFitness,CAR11_HUMAN,Low,Human
+0.652,0.588,0.576,0.573,0.586,0.594,0.566,0.456,0.586,0.59,0.642,0.68,0.657,0.534,0.554,0.696,0.676,0.682,0.692,0.574,0.568,0.586,0.594,0.592,0.485,0.582,0.594,0.601,0.558,0.579,0.631,0.618,0.646,0.507,0.573,0.534,0.598,0.599,0.567,0.599,0.596,0.57,0.548,0.514,0.632,0.642,0.657,0.641,0.667,0.565,0.693,0.689,0.691,0.68,0.678,0.686,0.677,0.686,0.692,0.69,0.728,0.68,OrganismalFitness,CAR11_HUMAN,Low,Human
+0.58,0.594,0.587,0.589,0.59,0.593,0.516,0.549,0.593,0.595,0.588,0.532,0.533,0.528,0.544,0.578,0.594,0.597,0.6,0.504,0.519,0.537,0.585,0.56,0.517,0.599,0.592,0.589,0.534,0.596,0.603,0.593,0.513,0.514,0.521,0.582,0.584,0.584,0.593,0.595,0.595,0.598,0.528,0.512,0.592,0.551,0.551,0.592,0.513,0.517,0.59,0.591,0.591,0.592,0.591,0.592,0.59,0.59,0.591,0.593,0.587,0.556,Activity,CAS9_STRP1,Medium,Prokaryote
+0.694,0.776,0.814,0.818,0.822,0.825,0.532,0.735,0.825,0.833,0.799,0.805,0.818,0.637,0.792,0.828,0.824,0.783,0.751,0.813,0.61,0.748,0.757,0.779,0.756,0.781,0.785,0.757,0.784,0.809,0.804,0.773,0.621,0.537,0.739,0.765,0.768,0.782,0.797,0.821,0.816,0.824,0.709,0.492,0.813,0.784,0.707,0.78,0.751,0.638,0.781,0.787,0.791,0.787,0.791,0.785,0.797,0.785,0.787,0.797,0.833,0.762,Activity,CASP3_HUMAN,High,Human
+0.713,0.782,0.835,0.835,0.831,0.833,0.537,0.767,0.828,0.842,0.815,0.835,0.845,0.669,0.826,0.838,0.84,0.816,0.803,0.832,0.613,0.799,0.796,0.811,0.798,0.819,0.822,0.816,0.81,0.85,0.83,0.8,0.682,0.552,0.804,0.787,0.782,0.831,0.822,0.825,0.847,0.842,0.76,0.509,0.848,0.825,0.763,0.835,0.817,0.674,0.823,0.816,0.81,0.827,0.836,0.833,0.827,0.817,0.828,0.835,0.847,0.793,Activity,CASP7_HUMAN,Medium,Human
+0.796,0.813,0.85,0.853,0.86,0.863,0.87,0.818,0.81,0.791,0.847,0.852,0.864,0.798,0.869,0.87,0.866,0.837,0.862,0.521,0.844,0.835,0.841,0.836,0.857,0.852,0.835,0.817,0.829,0.866,0.846,0.848,0.734,0.851,0.836,0.853,0.862,0.853,0.869,0.856,0.854,0.864,0.748,0.732,0.691,0.758,0.76,0.744,0.873,0.799,0.868,0.866,0.878,0.872,0.883,0.868,0.869,0.862,0.865,0.874,0.851,0.874,Stability,CATR_CHLRE,High,Eukaryote
+0.854,0.892,0.904,0.906,0.891,0.899,0.748,0.832,0.918,0.921,0.911,0.893,0.888,0.785,0.86,0.898,0.897,0.905,0.898,0.912,0.769,0.778,0.866,0.867,0.79,0.852,0.851,0.852,0.884,0.902,0.899,0.868,0.516,0.767,0.834,0.869,0.877,0.881,0.895,0.903,0.9,0.908,0.772,0.663,0.828,0.843,0.905,0.893,0.959,0.929,0.929,0.928,0.93,0.927,0.933,0.931,0.93,0.928,0.926,0.932,0.902,0.951,Stability,CBPA2_HUMAN,Medium,Human
+0.687,0.701,0.707,0.717,0.713,0.717,0.613,0.644,0.71,0.712,0.696,0.696,0.71,0.545,0.627,0.689,0.692,0.684,0.69,0.696,0.699,0.646,0.659,0.662,0.696,0.652,0.662,0.652,0.674,0.714,0.711,0.698,0.611,0.702,0.656,0.648,0.715,0.685,0.683,0.724,0.705,0.706,0.673,0.534,0.714,0.711,0.633,0.655,0.676,0.547,0.683,0.684,0.688,0.691,0.691,0.69,0.687,0.688,0.689,0.693,0.72,0.664,OrganismalFitness,CBS_HUMAN,Medium,Human
+0.752,0.773,0.844,0.865,0.811,0.847,0.339,0.819,0.812,0.808,0.812,0.829,0.828,0.344,0.878,0.855,0.844,0.801,0.802,0.828,0.753,0.803,0.797,0.797,0.783,0.793,0.799,0.767,0.793,0.815,0.845,0.84,0.719,0.714,0.75,0.765,0.802,0.799,0.81,0.844,0.856,0.859,0.816,0.317,0.682,0.786,0.54,0.561,0.796,0.783,0.853,0.809,0.85,0.838,0.861,0.839,0.839,0.871,0.835,0.852,0.826,0.888,Stability,CBX4_HUMAN,High,Human
+0.757,0.805,0.814,0.832,0.817,0.826,0.483,0.664,0.746,0.761,0.798,0.754,0.796,0.491,0.49,0.774,0.813,0.808,0.693,0.813,0.521,0.492,0.439,0.597,0.41,0.552,0.483,0.526,0.777,0.802,0.854,0.838,0.568,0.497,0.552,0.73,0.794,0.787,0.815,0.816,0.812,0.83,0.502,0.465,0.798,0.506,0.688,0.774,0.705,0.657,0.795,0.775,0.778,0.794,0.794,0.793,0.79,0.802,0.818,0.801,0.798,0.672,Activity,CCDB_ECOLI,High,Prokaryote
+0.757,0.805,0.814,0.832,0.817,0.826,0.483,0.664,0.746,0.761,0.798,0.754,0.796,0.491,0.49,0.774,0.813,0.808,0.693,0.813,0.521,0.492,0.439,0.597,0.41,0.552,0.483,0.526,0.777,0.802,0.854,0.838,0.568,0.497,0.552,0.73,0.794,0.787,0.815,0.816,0.812,0.83,0.502,0.465,0.798,0.506,0.688,0.774,0.705,0.657,0.795,0.775,0.778,0.794,0.794,0.793,0.79,0.802,0.818,0.801,0.798,0.672,OrganismalFitness,CCDB_ECOLI,High,Prokaryote
+0.634,0.637,0.638,0.639,0.634,0.638,0.649,0.644,0.662,0.668,0.675,0.664,0.67,0.651,0.67,0.674,0.668,0.662,0.664,0.67,0.675,0.673,0.657,0.659,0.675,0.67,0.665,0.673,0.66,0.674,0.644,0.619,0.557,0.675,0.676,0.678,0.678,0.682,0.685,0.658,0.658,0.659,0.668,0.59,0.676,0.67,0.633,0.647,0.638,0.585,0.65,0.647,0.652,0.652,0.649,0.652,0.657,0.657,0.657,0.657,0.668,0.671,Binding,CCR5_HUMAN,High,Human
+0.623,0.617,0.612,0.621,0.633,0.632,0.574,0.58,0.597,0.599,0.591,0.598,0.613,0.603,0.62,0.6,0.599,0.607,0.589,0.492,0.597,0.6,0.628,0.617,0.593,0.648,0.616,0.607,0.63,0.647,0.599,0.574,0.581,0.606,0.631,0.609,0.63,0.636,0.628,0.642,0.644,0.639,0.61,0.568,0.594,0.613,0.728,0.683,0.72,0.608,0.644,0.648,0.643,0.662,0.654,0.653,0.658,0.647,0.653,0.659,0.755,0.71,Binding,CD19_HUMAN,Low,Human
+0.753,0.788,0.802,0.812,0.798,0.808,0.778,0.794,0.8,0.812,0.755,0.807,0.822,0.776,0.814,0.827,0.828,0.825,0.802,0.814,0.787,0.79,0.804,0.783,0.799,0.794,0.796,0.794,0.789,0.809,0.796,0.768,0.61,0.806,0.792,0.786,0.82,0.818,0.816,0.823,0.82,0.818,0.802,0.546,0.772,0.819,0.792,0.783,0.822,0.619,0.812,0.812,0.818,0.826,0.827,0.826,0.824,0.818,0.821,0.83,0.828,0.822,Expression,CP2C9_HUMAN,High,Human
+0.753,0.788,0.802,0.812,0.798,0.808,0.778,0.794,0.8,0.812,0.755,0.807,0.822,0.776,0.814,0.827,0.828,0.825,0.802,0.814,0.787,0.79,0.804,0.783,0.799,0.794,0.796,0.794,0.789,0.809,0.796,0.768,0.61,0.806,0.792,0.786,0.82,0.818,0.816,0.823,0.82,0.818,0.802,0.546,0.772,0.819,0.792,0.783,0.822,0.619,0.812,0.812,0.818,0.826,0.827,0.826,0.824,0.818,0.821,0.83,0.828,0.822,Binding,CP2C9_HUMAN,High,Human
+0.695,0.831,0.803,0.81,0.797,0.805,0.653,0.791,0.817,0.821,0.842,0.859,0.879,0.728,0.886,0.828,0.781,0.821,0.82,0.79,0.756,0.775,0.836,0.845,0.741,0.857,0.849,0.834,0.835,0.833,0.818,0.811,0.67,0.704,0.784,0.83,0.779,0.798,0.831,0.815,0.828,0.84,0.674,0.621,0.754,0.767,0.785,0.776,0.839,0.86,0.846,0.846,0.842,0.858,0.86,0.861,0.853,0.868,0.842,0.857,0.848,0.873,Stability,CSN4_MOUSE,Medium,Eukaryote
+0.666,0.715,0.699,0.7,0.725,0.709,0.551,0.702,0.755,0.764,0.736,0.694,0.672,0.563,0.695,0.717,0.763,0.74,0.73,0.652,0.564,0.592,0.597,0.522,0.538,0.588,0.601,0.6,0.69,0.702,0.733,0.707,0.62,0.562,0.556,0.587,0.678,0.682,0.684,0.704,0.705,0.709,0.562,0.51,0.713,0.603,0.748,0.743,0.813,0.728,0.769,0.749,0.746,0.772,0.753,0.771,0.765,0.759,0.755,0.765,0.79,0.771,Stability,CUE1_YEAST,Medium,Eukaryote
+0.781,0.83,0.828,0.81,0.835,0.835,0.531,0.75,0.849,0.851,0.732,0.53,0.529,0.53,0.522,0.512,0.52,0.531,0.575,0.735,0.514,0.519,0.554,0.541,0.5,0.521,0.535,0.601,0.69,0.831,0.805,0.813,0.514,0.549,0.563,0.58,0.783,0.783,0.781,0.834,0.833,0.828,0.496,0.506,0.498,0.508,0.636,0.652,0.76,0.706,0.759,0.756,0.758,0.764,0.756,0.761,0.758,0.756,0.755,0.76,0.668,0.6,Activity,D7PM05_CLYGR,Low,Eukaryote
+0.877,0.822,0.834,0.816,0.834,0.838,0.904,0.853,0.791,0.795,0.781,0.801,0.834,0.909,0.921,0.905,0.822,0.764,0.741,0.854,0.812,0.818,0.809,0.791,0.834,0.832,0.809,0.804,0.768,0.839,0.855,0.848,0.777,0.816,0.852,0.817,0.861,0.888,0.867,0.854,0.865,0.849,0.827,0.613,0.708,0.793,0.839,0.741,0.868,0.668,0.768,0.7,0.798,0.774,0.786,0.77,0.782,0.77,0.779,0.778,0.783,0.913,OrganismalFitness,DLG4_HUMAN,Low,Human
+0.83,0.771,0.789,0.801,0.84,0.848,0.818,0.794,0.824,0.847,0.806,0.869,0.88,0.796,0.883,0.884,0.857,0.823,0.796,0.8,0.764,0.769,0.767,0.752,0.789,0.795,0.781,0.755,0.756,0.834,0.848,0.833,0.685,0.759,0.763,0.724,0.821,0.816,0.804,0.855,0.853,0.854,0.847,0.552,0.777,0.834,0.791,0.709,0.814,0.593,0.791,0.76,0.801,0.771,0.804,0.79,0.775,0.797,0.803,0.81,0.82,0.865,Binding,DLG4_RAT,Low,Eukaryote
+0.574,0.58,0.601,0.61,0.601,0.625,0.424,0.668,0.66,0.672,0.496,0.539,0.531,0.502,0.567,0.581,0.63,0.647,0.681,0.604,0.44,0.5,0.493,0.508,0.498,0.533,0.5,0.511,0.59,0.647,0.722,0.699,0.522,0.469,0.467,0.486,0.593,0.589,0.593,0.614,0.606,0.608,0.522,0.451,0.6,0.53,0.787,0.782,0.837,0.783,0.683,0.675,0.738,0.73,0.72,0.718,0.709,0.689,0.682,0.716,0.759,0.723,Stability,DN7A_SACS2,Medium,Prokaryote
+0.928,0.937,0.937,0.937,0.944,0.943,0.887,0.916,0.938,0.945,0.946,0.937,0.949,0.95,0.955,0.955,0.946,0.935,0.953,0.937,0.862,0.906,0.924,0.914,0.929,0.922,0.93,0.896,0.929,0.938,0.92,0.909,0.864,0.928,0.929,0.943,0.954,0.955,0.96,0.948,0.95,0.953,0.846,0.582,0.711,0.85,0.822,0.766,0.928,0.933,0.932,0.936,0.934,0.948,0.945,0.95,0.948,0.944,0.949,0.946,0.926,0.946,Stability,DNJA1_HUMAN,High,Human
+0.714,0.738,0.718,0.729,0.757,0.761,0.521,0.72,0.686,0.687,0.749,0.727,0.741,0.607,0.719,0.736,0.753,0.744,0.751,0.764,0.654,0.696,0.678,0.685,0.645,0.677,0.659,0.646,0.673,0.721,0.719,0.72,0.591,0.597,0.652,0.612,0.735,0.743,0.739,0.751,0.752,0.758,0.666,0.4,0.741,0.711,0.668,0.672,0.703,0.707,0.743,0.734,0.742,0.746,0.738,0.748,0.729,0.74,0.747,0.742,0.767,0.752,Stability,DOCK1_MOUSE,High,Eukaryote
+0.751,0.822,0.826,0.83,0.832,0.834,0.477,0.777,0.824,0.83,0.828,0.822,0.83,0.548,0.789,0.824,0.838,0.846,0.842,0.832,0.692,0.776,0.7,0.74,0.781,0.794,0.816,0.816,0.792,0.833,0.84,0.826,0.614,0.796,0.735,0.729,0.819,0.784,0.793,0.838,0.826,0.83,0.778,0.49,0.824,0.803,0.643,0.76,0.714,0.565,0.802,0.778,0.785,0.788,0.796,0.794,0.804,0.801,0.815,0.806,0.826,0.765,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+0.751,0.822,0.826,0.83,0.832,0.834,0.477,0.777,0.824,0.83,0.828,0.822,0.83,0.548,0.789,0.824,0.838,0.846,0.842,0.832,0.692,0.776,0.7,0.74,0.781,0.794,0.816,0.816,0.792,0.833,0.84,0.826,0.614,0.796,0.735,0.729,0.819,0.784,0.793,0.838,0.826,0.83,0.778,0.49,0.824,0.803,0.643,0.76,0.714,0.565,0.802,0.778,0.785,0.788,0.796,0.794,0.804,0.801,0.815,0.806,0.826,0.765,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+0.751,0.822,0.826,0.83,0.832,0.834,0.477,0.777,0.824,0.83,0.828,0.822,0.83,0.548,0.789,0.824,0.838,0.846,0.842,0.832,0.692,0.776,0.7,0.74,0.781,0.794,0.816,0.816,0.792,0.833,0.84,0.826,0.614,0.796,0.735,0.729,0.819,0.784,0.793,0.838,0.826,0.83,0.778,0.49,0.824,0.803,0.643,0.76,0.714,0.565,0.802,0.778,0.785,0.788,0.796,0.794,0.804,0.801,0.815,0.806,0.826,0.765,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+0.751,0.822,0.826,0.83,0.832,0.834,0.477,0.777,0.824,0.83,0.828,0.822,0.83,0.548,0.789,0.824,0.838,0.846,0.842,0.832,0.692,0.776,0.7,0.74,0.781,0.794,0.816,0.816,0.792,0.833,0.84,0.826,0.614,0.796,0.735,0.729,0.819,0.784,0.793,0.838,0.826,0.83,0.778,0.49,0.824,0.803,0.643,0.76,0.714,0.565,0.802,0.778,0.785,0.788,0.796,0.794,0.804,0.801,0.815,0.806,0.826,0.765,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+0.571,0.569,0.61,0.61,0.619,0.618,0.503,0.616,0.608,0.614,0.606,0.625,0.636,0.607,0.615,0.618,0.613,0.6,0.584,0.614,0.605,0.601,0.632,0.625,0.606,0.611,0.626,0.621,0.615,0.618,0.598,0.595,0.559,0.602,0.612,0.621,0.618,0.623,0.63,0.621,0.625,0.625,0.611,0.566,0.612,0.61,0.528,0.596,0.582,0.525,0.623,0.62,0.604,0.617,0.639,0.625,0.615,0.625,0.627,0.627,0.594,0.637,Activity,ENVZ_ECOLI,High,Prokaryote
+0.747,0.744,0.688,0.742,0.765,0.761,0.503,0.704,0.732,0.751,0.722,0.791,0.777,0.533,0.515,0.496,0.47,0.507,0.539,0.751,0.753,0.74,0.768,0.771,0.707,0.783,0.769,0.746,0.747,0.767,0.768,0.757,0.674,0.767,0.767,0.771,0.774,0.776,0.772,0.772,0.773,0.774,0.719,0.451,0.748,0.759,0.736,0.713,0.731,0.651,0.643,0.695,0.704,0.676,0.655,0.653,0.667,0.659,0.644,0.682,0.607,0.587,OrganismalFitness,ENV_HV1B9,Medium,Virus
+0.669,0.652,0.66,0.661,0.671,0.673,0.499,0.659,0.674,0.673,0.649,0.659,0.669,0.495,0.495,0.497,0.519,0.529,0.577,0.667,0.676,0.68,0.687,0.682,0.673,0.68,0.68,0.676,0.682,0.677,0.665,0.649,0.599,0.673,0.682,0.681,0.681,0.685,0.684,0.685,0.687,0.684,0.615,0.49,0.66,0.649,0.597,0.616,0.553,0.534,0.585,0.581,0.594,0.607,0.587,0.594,0.595,0.59,0.589,0.598,0.577,0.541,OrganismalFitness,ENV_HV1BR,Medium,Virus
+0.839,0.898,0.899,0.902,0.914,0.915,0.309,0.906,0.948,0.952,0.944,0.946,0.953,0.286,0.941,0.954,0.963,0.945,0.94,0.923,0.831,0.86,0.899,0.889,0.88,0.898,0.905,0.913,0.92,0.954,0.94,0.936,0.886,0.86,0.859,0.879,0.908,0.908,0.916,0.919,0.918,0.917,0.85,0.341,0.857,0.861,0.83,0.828,0.943,0.909,0.957,0.954,0.956,0.955,0.961,0.958,0.958,0.958,0.955,0.96,0.949,0.944,Stability,EPHB2_HUMAN,High,Human
+0.697,0.691,0.641,0.642,0.634,0.645,0.717,0.694,0.702,0.715,0.719,0.73,0.738,0.737,0.732,0.735,0.706,0.692,0.652,0.525,0.721,0.729,0.752,0.725,0.74,0.747,0.778,0.782,0.734,0.692,0.716,0.543,0.732,0.768,0.728,0.736,0.747,0.731,0.737,0.728,0.723,0.72,0.739,0.735,0.764,0.734,0.718,0.73,0.407,0.527,0.698,0.693,0.72,0.702,0.699,0.699,0.709,0.717,0.721,0.714,0.756,0.741,Expression,ERBB2_HUMAN,Low,Human
+0.637,0.694,0.702,0.711,0.696,0.694,0.577,0.665,0.669,0.707,0.659,0.645,0.662,0.576,0.621,0.629,0.645,0.641,0.664,0.654,0.548,0.591,0.634,0.648,0.628,0.62,0.653,0.634,0.696,0.69,0.724,0.711,0.534,0.579,0.63,0.641,0.655,0.665,0.673,0.7,0.703,0.703,0.588,0.552,0.637,0.623,0.783,0.743,0.774,0.681,0.677,0.664,0.673,0.666,0.682,0.67,0.666,0.677,0.675,0.679,0.691,0.594,Stability,ESTA_BACSU,High,Prokaryote
+0.537,0.668,0.668,0.689,0.683,0.684,0.517,0.633,0.668,0.672,0.689,0.669,0.674,0.475,0.513,0.531,0.671,0.665,0.687,0.67,0.479,0.457,0.458,0.503,0.555,0.559,0.501,0.633,0.677,0.659,0.698,0.687,0.518,0.498,0.46,0.685,0.537,0.515,0.684,0.664,0.658,0.691,0.488,0.479,0.599,0.492,0.498,0.616,0.528,0.519,0.687,0.692,0.68,0.7,0.696,0.697,0.686,0.692,0.695,0.694,0.626,0.458,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+0.537,0.668,0.668,0.689,0.683,0.684,0.517,0.633,0.668,0.672,0.689,0.669,0.674,0.475,0.513,0.531,0.671,0.665,0.687,0.67,0.479,0.457,0.458,0.503,0.555,0.559,0.501,0.633,0.677,0.659,0.698,0.687,0.518,0.498,0.46,0.685,0.537,0.515,0.684,0.664,0.658,0.691,0.488,0.479,0.599,0.492,0.498,0.616,0.528,0.519,0.687,0.692,0.68,0.7,0.696,0.697,0.686,0.692,0.695,0.694,0.626,0.458,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+0.705,0.727,0.746,0.751,0.746,0.742,0.51,0.677,0.701,0.745,0.775,0.637,0.727,0.51,0.735,0.784,0.78,0.764,0.711,0.746,0.577,0.648,0.697,0.663,0.581,0.754,0.743,0.641,0.717,0.779,0.719,0.684,0.657,0.526,0.562,0.577,0.692,0.687,0.665,0.729,0.727,0.716,0.701,0.5,0.756,0.729,0.801,0.783,0.816,0.781,0.783,0.761,0.776,0.775,0.769,0.787,0.768,0.763,0.77,0.777,0.814,0.811,Stability,FECA_ECOLI,High,Prokaryote
+0.727,0.707,0.759,0.76,0.763,0.771,0.589,0.675,0.63,0.629,0.602,0.594,0.591,0.593,0.592,0.583,0.6,0.64,0.694,0.661,0.622,0.622,0.627,0.632,0.585,0.622,0.599,0.625,0.628,0.752,0.673,0.676,0.464,0.566,0.602,0.624,0.718,0.721,0.688,0.765,0.767,0.73,0.599,0.602,0.595,0.613,0.856,0.824,0.855,0.819,0.66,0.716,0.738,0.711,0.71,0.716,0.693,0.697,0.65,0.712,0.798,0.684,Stability,FKBP3_HUMAN,Medium,Human
+0.646,0.711,0.712,0.748,0.733,0.749,0.677,0.496,0.745,0.771,0.782,0.74,0.743,0.685,0.712,0.759,0.795,0.793,0.788,0.749,0.672,0.691,0.695,0.695,0.691,0.716,0.741,0.727,0.771,0.792,0.792,0.763,0.636,0.655,0.669,0.67,0.757,0.752,0.754,0.742,0.739,0.737,0.708,0.653,0.777,0.731,0.649,0.763,0.67,0.568,0.773,0.769,0.767,0.767,0.779,0.778,0.77,0.768,0.767,0.777,0.767,0.719,OrganismalFitness,GAL4_YEAST,Medium,Eukaryote
+0.632,0.633,0.629,0.63,0.628,0.628,0.584,0.611,0.631,0.631,0.627,0.644,0.647,0.662,0.644,0.638,0.649,0.638,0.631,0.599,0.574,0.581,0.572,0.572,0.563,0.551,0.532,0.559,0.577,0.622,0.637,0.634,0.519,0.573,0.571,0.641,0.635,0.635,0.647,0.635,0.635,0.645,0.549,0.589,0.572,0.556,0.568,0.591,0.612,0.587,0.624,0.62,0.62,0.617,0.623,0.622,0.621,0.621,0.623,0.622,0.624,0.627,Binding,GCN4_YEAST,Low,Eukaryote
+0.719,0.72,0.726,0.724,0.727,0.727,0.591,0.696,0.739,0.736,0.694,0.706,0.726,0.582,0.618,0.716,0.698,0.684,0.704,0.701,0.626,0.688,0.686,0.701,0.659,0.718,0.708,0.685,0.696,0.704,0.712,0.692,0.627,0.657,0.69,0.671,0.715,0.719,0.713,0.726,0.733,0.729,0.616,0.56,0.717,0.62,0.671,0.709,0.682,0.544,0.695,0.669,0.697,0.684,0.691,0.7,0.688,0.696,0.69,0.701,0.729,0.659,OrganismalFitness,GDIA_HUMAN,Low,Human
+0.889,0.884,0.9,0.901,0.903,0.903,0.54,0.883,0.894,0.891,0.816,0.565,0.565,0.552,0.583,0.564,0.572,0.596,0.679,0.858,0.555,0.572,0.614,0.565,0.532,0.615,0.684,0.884,0.886,0.904,0.865,0.87,0.541,0.545,0.616,0.878,0.889,0.89,0.902,0.906,0.907,0.919,0.515,0.495,0.525,0.525,0.796,0.792,0.925,0.86,0.856,0.86,0.866,0.871,0.867,0.866,0.864,0.865,0.854,0.865,0.872,0.769,Activity,GFP_AEQVI,Low,Eukaryote
+0.602,0.546,0.603,0.601,0.594,0.583,0.711,0.799,0.682,0.672,0.741,0.756,0.754,0.728,0.746,0.77,0.751,0.727,0.757,0.583,0.743,0.755,0.737,0.761,0.739,0.752,0.761,0.765,0.798,0.672,0.746,0.638,0.638,0.761,0.737,0.761,0.75,0.736,0.748,0.732,0.717,0.74,0.758,0.749,0.762,0.727,0.722,0.767,0.706,0.617,0.79,0.779,0.782,0.765,0.776,0.747,0.776,0.748,0.77,0.781,0.775,0.766,Expression,GLPA_HUMAN,Low,Human
+0.727,0.787,0.784,0.801,0.799,0.802,0.777,0.748,0.796,0.768,0.792,0.747,0.781,0.797,0.832,0.847,0.858,0.792,0.816,0.799,0.807,0.79,0.786,0.752,0.801,0.785,0.753,0.793,0.751,0.78,0.741,0.741,0.757,0.801,0.786,0.727,0.79,0.784,0.745,0.812,0.812,0.796,0.826,0.673,0.789,0.816,0.879,0.79,0.89,0.785,0.845,0.86,0.855,0.856,0.86,0.849,0.849,0.859,0.848,0.861,0.806,0.837,OrganismalFitness,GRB2_HUMAN,Medium,Human
+0.652,0.715,0.572,0.586,0.679,0.682,0.629,0.707,0.726,0.749,0.829,0.724,0.747,0.644,0.704,0.783,0.835,0.79,0.801,0.62,0.644,0.652,0.67,0.677,0.66,0.669,0.605,0.666,0.765,0.802,0.824,0.758,0.579,0.633,0.643,0.753,0.711,0.716,0.773,0.687,0.714,0.765,0.653,0.643,0.81,0.686,0.829,0.861,0.864,0.786,0.84,0.846,0.846,0.843,0.851,0.853,0.844,0.854,0.853,0.857,0.878,0.799,Stability,HCP_LAMBD,Medium,Virus
+0.846,0.873,0.829,0.844,0.885,0.887,0.667,0.823,0.858,0.859,0.83,0.696,0.72,0.642,0.656,0.917,0.884,0.86,0.846,0.869,0.674,0.815,0.866,0.859,0.792,0.863,0.856,0.841,0.848,0.887,0.873,0.871,0.62,0.675,0.697,0.743,0.907,0.886,0.91,0.895,0.895,0.901,0.596,0.581,0.902,0.668,0.755,0.869,0.857,0.772,0.868,0.88,0.899,0.889,0.893,0.914,0.887,0.897,0.888,0.896,0.892,0.749,Stability,HECD1_HUMAN,Medium,Human
+0.712,0.707,0.701,0.702,0.71,0.709,0.56,0.566,0.716,0.722,0.694,0.688,0.696,0.569,0.69,0.702,0.694,0.713,0.72,0.562,0.669,0.688,0.692,0.7,0.676,0.694,0.691,0.693,0.709,0.72,0.715,0.711,0.547,0.689,0.694,0.703,0.713,0.719,0.731,0.715,0.719,0.726,0.66,0.573,0.69,0.685,0.666,0.684,0.659,0.58,0.678,0.669,0.676,0.681,0.688,0.686,0.686,0.686,0.683,0.688,0.716,0.709,Activity,HEM3_HUMAN,Medium,Human
+0.768,0.809,0.825,0.822,0.801,0.8,0.589,0.645,0.762,0.785,0.761,0.734,0.773,0.534,0.568,0.749,0.728,0.763,0.769,0.755,0.686,0.734,0.75,0.777,0.724,0.779,0.771,0.802,0.81,0.804,0.724,0.714,0.583,0.734,0.758,0.849,0.783,0.783,0.865,0.829,0.812,0.835,0.587,0.524,0.666,0.507,0.729,0.75,0.82,0.722,0.746,0.685,0.718,0.737,0.744,0.723,0.726,0.748,0.738,0.732,0.795,0.632,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+0.741,0.7,0.683,0.688,0.72,0.718,0.575,0.627,0.625,0.621,0.626,0.664,0.649,0.514,0.498,0.654,0.755,0.682,0.683,0.64,0.727,0.566,0.572,0.595,0.719,0.588,0.572,0.594,0.592,0.735,0.699,0.694,0.61,0.655,0.617,0.6,0.719,0.664,0.657,0.714,0.672,0.67,0.581,0.526,0.735,0.75,0.66,0.721,0.397,0.555,0.745,0.745,0.741,0.747,0.743,0.744,0.736,0.749,0.74,0.748,0.765,0.716,OrganismalFitness,HMDH_HUMAN,Low,Human
+0.765,0.754,0.789,0.794,0.79,0.79,0.673,0.748,0.785,0.784,0.769,0.794,0.808,0.562,0.633,0.691,0.735,0.762,0.759,0.712,0.77,0.772,0.771,0.782,0.772,0.777,0.792,0.771,0.791,0.797,0.813,0.798,0.658,0.762,0.762,0.766,0.784,0.784,0.786,0.797,0.797,0.798,0.651,0.464,0.769,0.766,0.591,0.703,0.587,0.521,0.727,0.723,0.719,0.734,0.73,0.728,0.735,0.733,0.724,0.734,0.768,0.744,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.765,0.754,0.789,0.794,0.79,0.79,0.673,0.748,0.785,0.784,0.769,0.794,0.808,0.562,0.633,0.691,0.735,0.762,0.759,0.712,0.77,0.772,0.771,0.782,0.772,0.777,0.792,0.771,0.791,0.797,0.813,0.798,0.658,0.762,0.762,0.766,0.784,0.784,0.786,0.797,0.797,0.798,0.651,0.464,0.769,0.766,0.591,0.703,0.587,0.521,0.727,0.723,0.719,0.734,0.73,0.728,0.735,0.733,0.724,0.734,0.768,0.744,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.765,0.754,0.789,0.794,0.79,0.79,0.673,0.748,0.785,0.784,0.769,0.794,0.808,0.562,0.633,0.691,0.735,0.762,0.759,0.712,0.77,0.772,0.771,0.782,0.772,0.777,0.792,0.771,0.791,0.797,0.813,0.798,0.658,0.762,0.762,0.766,0.784,0.784,0.786,0.797,0.797,0.798,0.651,0.464,0.769,0.766,0.591,0.703,0.587,0.521,0.727,0.723,0.719,0.734,0.73,0.728,0.735,0.733,0.724,0.734,0.768,0.744,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.728,0.75,0.728,0.74,0.734,0.738,0.576,0.704,0.756,0.767,0.726,0.728,0.742,0.585,0.63,0.734,0.75,0.753,0.736,0.631,0.708,0.71,0.714,0.709,0.708,0.718,0.717,0.716,0.706,0.75,0.726,0.702,0.598,0.718,0.704,0.694,0.738,0.734,0.728,0.744,0.746,0.745,0.633,0.528,0.744,0.73,0.709,0.733,0.718,0.591,0.745,0.748,0.742,0.746,0.748,0.75,0.748,0.748,0.748,0.752,0.768,0.726,OrganismalFitness,HXK4_HUMAN,Medium,Human
+0.728,0.75,0.728,0.74,0.734,0.738,0.576,0.704,0.756,0.767,0.726,0.728,0.742,0.585,0.63,0.734,0.75,0.753,0.736,0.631,0.708,0.71,0.714,0.709,0.708,0.718,0.717,0.716,0.706,0.75,0.726,0.702,0.598,0.718,0.704,0.694,0.738,0.734,0.728,0.744,0.746,0.745,0.633,0.528,0.744,0.73,0.709,0.733,0.718,0.591,0.745,0.748,0.742,0.746,0.748,0.75,0.748,0.748,0.748,0.752,0.768,0.726,Expression,HXK4_HUMAN,Medium,Human
+0.677,0.664,0.639,0.638,0.685,0.683,0.495,0.659,0.645,0.663,0.503,0.507,0.506,0.509,0.51,0.501,0.507,0.503,0.543,0.608,0.663,0.669,0.691,0.691,0.504,0.508,0.554,0.497,0.655,0.695,0.627,0.63,0.572,0.652,0.666,0.673,0.665,0.676,0.677,0.693,0.701,0.704,0.502,0.511,0.506,0.509,0.608,0.604,0.623,0.549,0.533,0.552,0.564,0.563,0.554,0.551,0.542,0.545,0.527,0.551,0.521,0.504,OrganismalFitness,I6TAH8_I68A0,Medium,Virus
+0.695,0.775,0.823,0.823,0.816,0.82,0.578,0.712,0.649,0.641,0.803,0.815,0.826,0.6,0.756,0.817,0.832,0.818,0.8,0.793,0.673,0.729,0.695,0.714,0.746,0.748,0.751,0.735,0.745,0.722,0.784,0.754,0.643,0.756,0.774,0.787,0.773,0.78,0.785,0.812,0.816,0.82,0.693,0.563,0.821,0.782,0.692,0.797,0.781,0.645,0.798,0.79,0.808,0.814,0.804,0.809,0.789,0.813,0.813,0.819,0.852,0.783,OrganismalFitness,IF1_ECOLI,High,Prokaryote
+0.718,0.758,0.827,0.821,0.811,0.814,0.604,0.741,0.797,0.813,0.747,0.757,0.772,0.718,0.775,0.838,0.722,0.71,0.649,0.837,0.621,0.646,0.685,0.726,0.735,0.672,0.751,0.696,0.697,0.82,0.773,0.758,0.529,0.614,0.67,0.738,0.747,0.757,0.787,0.792,0.783,0.821,0.755,0.62,0.745,0.813,0.834,0.79,0.851,0.752,0.693,0.67,0.727,0.705,0.726,0.758,0.733,0.708,0.779,0.741,0.737,0.783,Stability,ILF3_HUMAN,High,Human
+0.54,0.548,0.599,0.594,0.612,0.612,0.662,0.571,0.68,0.679,0.738,0.71,0.719,0.687,0.695,0.724,0.718,0.74,0.724,0.589,0.653,0.601,0.614,0.629,0.649,0.66,0.655,0.642,0.685,0.669,0.718,0.691,0.622,0.636,0.62,0.636,0.625,0.605,0.616,0.629,0.606,0.616,0.639,0.611,0.665,0.652,0.788,0.743,0.773,0.752,0.725,0.717,0.728,0.724,0.726,0.728,0.715,0.72,0.724,0.726,0.791,0.749,Stability,ISDH_STAAW,High,Prokaryote
+0.667,0.696,0.68,0.689,0.686,0.688,0.635,0.66,0.693,0.694,0.679,0.704,0.726,0.626,0.609,0.74,0.706,0.678,0.671,0.628,0.65,0.666,0.672,0.688,0.66,0.72,0.732,0.714,0.717,0.694,0.714,0.676,0.444,0.615,0.694,0.738,0.692,0.707,0.741,0.706,0.716,0.744,0.632,0.619,0.716,0.635,0.615,0.681,0.613,0.546,0.702,0.696,0.698,0.687,0.69,0.684,0.705,0.702,0.704,0.703,0.74,0.641,Expression,KCNE1_HUMAN,Medium,Human
+0.667,0.696,0.68,0.689,0.686,0.688,0.635,0.66,0.693,0.694,0.679,0.704,0.726,0.626,0.609,0.74,0.706,0.678,0.671,0.628,0.65,0.666,0.672,0.688,0.66,0.72,0.732,0.714,0.717,0.694,0.714,0.676,0.444,0.615,0.694,0.738,0.692,0.707,0.741,0.706,0.716,0.744,0.632,0.619,0.716,0.635,0.615,0.681,0.613,0.546,0.702,0.696,0.698,0.687,0.69,0.684,0.705,0.702,0.704,0.703,0.74,0.641,Activity,KCNE1_HUMAN,Medium,Human
+0.738,0.73,0.671,0.67,0.636,0.635,0.755,0.588,0.667,0.678,0.644,0.607,0.609,0.619,0.593,0.608,0.623,0.625,0.622,0.739,0.741,0.766,0.751,0.729,0.762,0.748,0.736,0.739,0.735,0.758,0.693,0.63,0.678,0.738,0.767,0.749,0.75,0.782,0.771,0.739,0.767,0.759,0.728,0.599,0.59,0.652,0.605,0.477,0.426,0.547,0.798,0.719,0.766,0.748,0.738,0.739,0.76,0.746,0.749,0.762,0.648,0.738,Activity,KCNH2_HUMAN,Medium,Human
+0.658,0.665,0.645,0.651,0.67,0.672,0.544,0.613,0.646,0.645,0.668,0.654,0.66,0.56,0.661,0.682,0.677,0.678,0.682,0.597,0.642,0.632,0.619,0.624,0.649,0.648,0.641,0.65,0.619,0.682,0.663,0.652,0.576,0.636,0.633,0.617,0.663,0.661,0.654,0.668,0.67,0.668,0.617,0.514,0.645,0.66,0.612,0.631,0.622,0.554,0.666,0.658,0.666,0.664,0.668,0.662,0.67,0.664,0.664,0.668,0.672,0.648,Activity,KCNJ2_MOUSE,Medium,Eukaryote
+0.658,0.665,0.645,0.651,0.67,0.672,0.544,0.613,0.646,0.645,0.668,0.654,0.66,0.56,0.661,0.682,0.677,0.678,0.682,0.597,0.642,0.632,0.619,0.624,0.649,0.648,0.641,0.65,0.619,0.682,0.663,0.652,0.576,0.636,0.633,0.617,0.663,0.661,0.654,0.668,0.67,0.668,0.617,0.514,0.645,0.66,0.612,0.631,0.622,0.554,0.666,0.658,0.666,0.664,0.668,0.662,0.67,0.664,0.664,0.668,0.672,0.648,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+0.658,0.806,0.756,0.859,0.85,0.856,0.623,0.756,0.813,0.854,0.827,0.842,0.858,0.632,0.655,0.782,0.844,0.87,0.876,0.794,0.674,0.748,0.805,0.814,0.706,0.837,0.835,0.832,0.869,0.868,0.87,0.845,0.59,0.65,0.787,0.839,0.764,0.809,0.843,0.842,0.855,0.868,0.653,0.551,0.826,0.733,0.749,0.829,0.806,0.64,0.838,0.828,0.834,0.83,0.83,0.835,0.825,0.837,0.842,0.843,0.868,0.685,OrganismalFitness,KKA2_KLEPN,High,Prokaryote
+0.64,0.706,0.73,0.734,0.73,0.737,0.56,0.717,0.728,0.772,0.715,0.756,0.771,0.588,0.654,0.685,0.75,0.782,0.779,0.742,0.639,0.693,0.722,0.736,0.677,0.734,0.751,0.729,0.765,0.755,0.756,0.718,0.565,0.649,0.69,0.767,0.68,0.696,0.755,0.73,0.736,0.759,0.584,0.53,0.732,0.676,0.689,0.743,0.735,0.57,0.747,0.742,0.747,0.746,0.747,0.752,0.751,0.749,0.751,0.755,0.699,0.662,Activity,LGK_LIPST,Medium,Eukaryote
+0.669,0.667,0.61,0.594,0.649,0.653,0.675,0.58,0.698,0.701,0.698,0.673,0.67,0.643,0.672,0.66,0.637,0.691,0.732,0.653,0.681,0.687,0.682,0.675,0.674,0.679,0.669,0.642,0.66,0.688,0.641,0.583,0.598,0.672,0.705,0.664,0.684,0.705,0.679,0.677,0.691,0.667,0.642,0.658,0.644,0.665,0.586,0.631,0.59,0.556,0.608,0.653,0.696,0.664,0.635,0.646,0.675,0.672,0.63,0.662,0.709,0.69,Expression,LYAM1_HUMAN,Medium,Human
+0.854,0.874,0.866,0.866,0.861,0.861,0.74,0.904,0.875,0.866,0.9,0.749,0.886,0.79,0.795,0.823,0.829,0.859,0.762,0.868,0.752,0.837,0.848,0.872,0.817,0.858,0.83,0.859,0.839,0.835,0.863,0.855,0.776,0.686,0.851,0.855,0.844,0.885,0.905,0.869,0.894,0.911,0.718,0.525,0.781,0.676,0.859,0.728,0.89,0.905,0.917,0.903,0.907,0.916,0.91,0.915,0.912,0.916,0.911,0.915,0.922,0.934,Stability,MAFG_MOUSE,Medium,Eukaryote
+0.808,0.904,0.885,0.895,0.917,0.921,0.466,0.816,0.87,0.908,0.907,0.757,0.805,0.428,0.539,0.915,0.891,0.882,0.91,0.902,0.494,0.556,0.847,0.864,0.626,0.836,0.824,0.842,0.911,0.923,0.906,0.878,0.84,0.333,0.554,0.544,0.823,0.872,0.86,0.899,0.919,0.913,0.483,0.395,0.858,0.749,0.75,0.848,0.903,0.792,0.925,0.91,0.924,0.91,0.911,0.908,0.912,0.919,0.923,0.922,0.927,0.884,Stability,MBD11_ARATH,Medium,Eukaryote
+0.789,0.845,0.839,0.856,0.868,0.869,0.827,0.825,0.846,0.855,0.869,0.847,0.853,0.789,0.826,0.843,0.871,0.876,0.875,0.87,0.832,0.819,0.797,0.811,0.834,0.829,0.825,0.808,0.786,0.849,0.86,0.833,0.677,0.798,0.814,0.828,0.826,0.84,0.855,0.855,0.859,0.863,0.842,0.723,0.859,0.863,0.728,0.768,0.815,0.624,0.838,0.845,0.842,0.844,0.852,0.847,0.85,0.852,0.848,0.854,0.864,0.837,Activity,MET_HUMAN,Medium,Human
+0.614,0.622,0.638,0.641,0.634,0.636,0.627,0.605,0.625,0.618,0.542,0.607,0.617,0.605,0.621,0.622,0.615,0.621,0.597,0.614,0.631,0.582,0.562,0.53,0.613,0.569,0.552,0.561,0.486,0.63,0.613,0.633,0.568,0.61,0.554,0.524,0.623,0.586,0.572,0.637,0.627,0.626,0.621,0.579,0.594,0.614,0.541,0.514,0.582,0.502,0.602,0.601,0.594,0.604,0.608,0.603,0.603,0.606,0.61,0.608,0.602,0.616,OrganismalFitness,MK01_HUMAN,Medium,Human
+0.607,0.686,0.723,0.728,0.72,0.721,0.489,0.684,0.716,0.72,0.708,0.72,0.728,0.489,0.614,0.631,0.694,0.691,0.706,0.708,0.538,0.692,0.697,0.71,0.659,0.706,0.708,0.705,0.725,0.716,0.737,0.726,0.513,0.566,0.662,0.651,0.634,0.667,0.66,0.698,0.708,0.707,0.584,0.479,0.662,0.618,0.514,0.605,0.571,0.491,0.666,0.646,0.654,0.655,0.665,0.664,0.654,0.66,0.674,0.664,0.61,0.599,OrganismalFitness,MLAC_ECOLI,Medium,Prokaryote
+0.827,0.861,0.833,0.841,0.852,0.854,0.696,0.794,0.861,0.87,0.809,0.839,0.857,0.703,0.824,0.858,0.814,0.775,0.678,0.843,0.762,0.792,0.775,0.762,0.788,0.804,0.815,0.804,0.785,0.859,0.828,0.804,0.697,0.753,0.795,0.773,0.821,0.837,0.829,0.849,0.856,0.856,0.761,0.599,0.843,0.811,0.784,0.815,0.562,0.594,0.78,0.755,0.769,0.785,0.796,0.776,0.771,0.779,0.802,0.795,0.865,0.842,OrganismalFitness,MSH2_HUMAN,Medium,Human
+0.706,0.8,0.836,0.842,0.83,0.835,0.686,0.823,0.834,0.845,0.811,0.854,0.856,0.631,0.697,0.717,0.775,0.822,0.835,0.84,0.68,0.747,0.818,0.838,0.764,0.836,0.855,0.843,0.862,0.83,0.847,0.828,0.671,0.692,0.764,0.832,0.722,0.763,0.824,0.815,0.829,0.847,0.692,0.52,0.756,0.692,0.713,0.786,0.777,0.6,0.786,0.77,0.781,0.788,0.783,0.79,0.777,0.785,0.784,0.79,0.801,0.707,OrganismalFitness,MTH3_HAEAE,Medium,Prokaryote
+0.607,0.618,0.608,0.611,0.611,0.61,0.588,0.575,0.632,0.632,0.676,0.629,0.639,0.575,0.685,0.724,0.661,0.625,0.631,0.614,0.707,0.564,0.597,0.621,0.7,0.601,0.619,0.604,0.639,0.629,0.592,0.577,0.559,0.71,0.641,0.595,0.668,0.641,0.612,0.66,0.636,0.618,0.62,0.547,0.669,0.683,0.608,0.653,0.674,0.543,0.647,0.631,0.636,0.635,0.639,0.645,0.653,0.645,0.66,0.648,0.647,0.67,OrganismalFitness,MTHR_HUMAN,Low,Human
+0.584,0.635,0.689,0.7,0.711,0.727,0.574,0.623,0.732,0.726,0.783,0.76,0.761,0.592,0.726,0.815,0.821,0.774,0.645,0.675,0.675,0.618,0.665,0.657,0.676,0.647,0.699,0.664,0.684,0.65,0.665,0.613,0.58,0.721,0.674,0.644,0.704,0.677,0.659,0.723,0.719,0.711,0.566,0.526,0.72,0.654,0.715,0.737,0.77,0.696,0.794,0.779,0.803,0.788,0.793,0.794,0.785,0.791,0.786,0.794,0.805,0.777,Stability,MYO3_YEAST,High,Eukaryote
+0.687,0.67,0.673,0.672,0.685,0.686,0.508,0.668,0.686,0.677,0.514,0.517,0.518,0.519,0.52,0.516,0.521,0.522,0.553,0.646,0.677,0.687,0.702,0.704,0.517,0.529,0.557,0.52,0.677,0.7,0.641,0.645,0.565,0.68,0.686,0.71,0.699,0.704,0.717,0.716,0.715,0.724,0.517,0.515,0.52,0.519,0.631,0.635,0.639,0.559,0.565,0.585,0.588,0.594,0.588,0.587,0.58,0.583,0.566,0.587,0.568,0.543,OrganismalFitness,NCAP_I34A1,Medium,Virus
+0.689,0.8,0.859,0.858,0.87,0.86,0.816,0.861,0.867,0.873,0.864,0.874,0.87,0.857,0.824,0.868,0.858,0.833,0.856,0.846,0.858,0.842,0.844,0.842,0.841,0.835,0.852,0.84,0.861,0.876,0.828,0.814,0.847,0.817,0.833,0.853,0.841,0.851,0.866,0.867,0.869,0.875,0.77,0.717,0.705,0.751,0.823,0.721,0.872,0.868,0.879,0.882,0.885,0.888,0.889,0.879,0.885,0.883,0.878,0.886,0.866,0.873,Stability,NKX31_HUMAN,High,Human
+0.866,0.824,0.763,0.766,0.84,0.844,0.634,0.741,0.83,0.848,0.852,0.604,0.662,0.586,0.663,0.79,0.831,0.826,0.842,0.614,0.706,0.708,0.711,0.678,0.708,0.724,0.762,0.687,0.734,0.842,0.872,0.799,0.662,0.646,0.686,0.77,0.851,0.814,0.857,0.854,0.829,0.848,0.663,0.616,0.893,0.704,0.742,0.844,0.478,0.601,0.822,0.808,0.819,0.787,0.817,0.807,0.814,0.844,0.836,0.823,0.859,0.716,Activity,NPC1_HUMAN,Low,Human
+0.866,0.824,0.763,0.766,0.84,0.844,0.634,0.741,0.83,0.848,0.852,0.604,0.662,0.586,0.663,0.79,0.831,0.826,0.842,0.614,0.706,0.708,0.711,0.678,0.708,0.724,0.762,0.687,0.734,0.842,0.872,0.799,0.662,0.646,0.686,0.77,0.851,0.814,0.857,0.854,0.829,0.848,0.663,0.616,0.893,0.704,0.742,0.844,0.478,0.601,0.822,0.808,0.819,0.787,0.817,0.807,0.814,0.844,0.836,0.823,0.859,0.716,Activity,NPC1_HUMAN,Low,Human
+0.802,0.775,0.727,0.717,0.799,0.795,0.53,0.718,0.809,0.814,0.474,0.598,0.745,0.45,0.444,0.505,0.615,0.752,0.795,0.686,0.769,0.802,0.781,0.777,0.549,0.774,0.809,0.747,0.81,0.833,0.729,0.732,0.456,0.747,0.767,0.772,0.803,0.822,0.823,0.82,0.828,0.823,0.428,0.432,0.425,0.423,0.725,0.706,0.731,0.592,0.636,0.648,0.653,0.677,0.663,0.679,0.666,0.66,0.634,0.662,0.644,0.567,OrganismalFitness,NRAM_I33A0,Low,Virus
+0.644,0.75,0.797,0.813,0.809,0.811,0.504,0.711,0.832,0.866,0.838,0.839,0.863,0.677,0.735,0.753,0.801,0.833,0.84,0.777,0.671,0.75,0.808,0.798,0.736,0.828,0.826,0.81,0.788,0.813,0.841,0.811,0.594,0.698,0.739,0.816,0.728,0.744,0.83,0.809,0.809,0.838,0.724,0.511,0.831,0.744,0.753,0.805,0.779,0.667,0.806,0.785,0.798,0.813,0.804,0.814,0.817,0.813,0.813,0.817,0.882,0.815,Expression,NUD15_HUMAN,High,Human
+0.692,0.804,0.802,0.796,0.814,0.821,0.636,0.73,0.85,0.821,0.793,0.654,0.667,0.656,0.698,0.701,0.744,0.767,0.758,0.798,0.705,0.796,0.81,0.812,0.657,0.762,0.729,0.788,0.807,0.839,0.865,0.894,0.587,0.682,0.703,0.707,0.758,0.746,0.756,0.832,0.81,0.825,0.679,0.609,0.683,0.67,0.851,0.828,0.881,0.844,0.888,0.885,0.884,0.897,0.898,0.907,0.89,0.889,0.896,0.897,0.852,0.78,Stability,NUSA_ECOLI,High,Prokaryote
+0.752,0.76,0.767,0.774,0.783,0.774,0.653,0.758,0.782,0.776,0.749,0.722,0.746,0.693,0.796,0.763,0.784,0.789,0.788,0.761,0.682,0.705,0.7,0.727,0.73,0.722,0.737,0.699,0.733,0.803,0.741,0.739,0.66,0.657,0.669,0.745,0.743,0.744,0.771,0.763,0.765,0.775,0.681,0.559,0.674,0.743,0.815,0.789,0.886,0.862,0.798,0.788,0.777,0.798,0.794,0.792,0.796,0.8,0.798,0.797,0.762,0.808,Stability,NUSG_MYCTU,High,Prokaryote
+0.758,0.86,0.916,0.928,0.917,0.923,0.644,0.809,0.917,0.915,0.921,0.869,0.887,0.649,0.9,0.923,0.928,0.924,0.919,0.916,0.909,0.899,0.898,0.887,0.854,0.902,0.893,0.882,0.877,0.909,0.927,0.922,0.758,0.618,0.67,0.762,0.874,0.86,0.869,0.927,0.917,0.92,0.735,0.484,0.73,0.787,0.837,0.787,0.916,0.918,0.943,0.937,0.944,0.94,0.947,0.944,0.941,0.94,0.947,0.945,0.932,0.89,Stability,OBSCN_HUMAN,High,Human
+0.372,0.393,0.643,0.562,0.637,0.636,0.404,0.518,0.558,0.564,0.5,0.432,0.426,0.448,0.415,0.532,0.569,0.514,0.571,0.589,0.559,0.555,0.546,0.56,0.554,0.561,0.552,0.564,0.581,0.642,0.504,0.493,0.614,0.453,0.49,0.479,0.566,0.56,0.56,0.61,0.6,0.618,0.599,0.452,0.632,0.625,0.694,0.678,0.681,0.626,0.525,0.505,0.508,0.529,0.501,0.528,0.52,0.523,0.512,0.518,0.608,0.551,Stability,ODP2_GEOSE,High,Prokaryote
+0.611,0.75,0.755,0.796,0.758,0.76,0.701,0.808,0.787,0.806,0.692,0.788,0.808,0.659,0.777,0.767,0.784,0.751,0.769,0.784,0.782,0.778,0.772,0.796,0.804,0.812,0.822,0.815,0.801,0.749,0.74,0.694,0.604,0.761,0.785,0.768,0.78,0.783,0.773,0.763,0.765,0.767,0.702,0.557,0.762,0.752,0.874,0.789,0.882,0.611,0.703,0.67,0.699,0.696,0.745,0.754,0.682,0.701,0.729,0.729,0.814,0.785,Expression,OPSD_HUMAN,High,Human
+0.757,0.786,0.748,0.769,0.778,0.78,0.547,0.71,0.791,0.793,0.773,0.779,0.785,0.552,0.69,0.739,0.753,0.749,0.763,0.782,0.713,0.731,0.768,0.782,0.735,0.783,0.769,0.777,0.798,0.806,0.745,0.716,0.533,0.703,0.761,0.798,0.753,0.787,0.815,0.786,0.798,0.807,0.619,0.524,0.752,0.72,0.83,0.797,0.837,0.687,0.755,0.766,0.764,0.781,0.766,0.775,0.77,0.766,0.763,0.773,0.799,0.734,Activity,OTC_HUMAN,Medium,Human
+0.52,0.593,0.582,0.59,0.602,0.604,0.599,0.574,0.56,0.564,0.747,0.862,0.858,0.593,0.835,0.847,0.726,0.741,0.728,0.579,0.578,0.665,0.691,0.713,0.731,0.731,0.759,0.732,0.615,0.671,0.815,0.793,0.562,0.566,0.639,0.734,0.596,0.602,0.688,0.612,0.603,0.636,0.688,0.577,0.836,0.812,0.829,0.842,0.865,0.787,0.769,0.717,0.731,0.762,0.773,0.782,0.735,0.764,0.782,0.766,0.854,0.855,Stability,OTU7A_HUMAN,High,Human
+0.602,0.646,0.665,0.666,0.657,0.662,0.621,0.629,0.651,0.656,0.686,0.676,0.678,0.612,0.644,0.666,0.679,0.692,0.69,0.65,0.603,0.636,0.657,0.655,0.646,0.666,0.68,0.667,0.694,0.668,0.694,0.678,0.536,0.63,0.634,0.655,0.654,0.654,0.661,0.665,0.665,0.666,0.636,0.576,0.682,0.654,0.672,0.688,0.661,0.544,0.678,0.674,0.676,0.674,0.677,0.678,0.678,0.678,0.678,0.682,0.7,0.648,Activity,OXDA_RHOTO,High,Eukaryote
+0.602,0.646,0.665,0.666,0.657,0.662,0.621,0.629,0.651,0.656,0.686,0.676,0.678,0.612,0.644,0.666,0.679,0.692,0.69,0.65,0.603,0.636,0.657,0.655,0.646,0.666,0.68,0.667,0.694,0.668,0.694,0.678,0.536,0.63,0.634,0.655,0.654,0.654,0.661,0.665,0.665,0.666,0.636,0.576,0.682,0.654,0.672,0.688,0.661,0.544,0.678,0.674,0.676,0.674,0.677,0.678,0.678,0.678,0.678,0.682,0.7,0.648,Expression,OXDA_RHOTO,High,Eukaryote
+0.802,0.804,0.741,0.758,0.793,0.798,0.476,0.731,0.752,0.758,0.818,0.803,0.834,0.444,0.462,0.768,0.82,0.833,0.844,0.64,0.717,0.782,0.792,0.783,0.764,0.802,0.802,0.811,0.749,0.812,0.828,0.804,0.594,0.696,0.792,0.757,0.795,0.818,0.794,0.804,0.821,0.8,0.448,0.435,0.83,0.52,0.762,0.82,0.77,0.634,0.817,0.822,0.822,0.821,0.824,0.829,0.826,0.826,0.83,0.832,0.844,0.658,OrganismalFitness,P53_HUMAN,Low,Human
+0.802,0.804,0.741,0.758,0.793,0.798,0.476,0.731,0.752,0.758,0.818,0.803,0.834,0.444,0.462,0.768,0.82,0.833,0.844,0.64,0.717,0.782,0.792,0.783,0.764,0.802,0.802,0.811,0.749,0.812,0.828,0.804,0.594,0.696,0.792,0.757,0.795,0.818,0.794,0.804,0.821,0.8,0.448,0.435,0.83,0.52,0.762,0.82,0.77,0.634,0.817,0.822,0.822,0.821,0.824,0.829,0.826,0.826,0.83,0.832,0.844,0.658,OrganismalFitness,P53_HUMAN,Low,Human
+0.802,0.804,0.741,0.758,0.793,0.798,0.476,0.731,0.752,0.758,0.818,0.803,0.834,0.444,0.462,0.768,0.82,0.833,0.844,0.64,0.717,0.782,0.792,0.783,0.764,0.802,0.802,0.811,0.749,0.812,0.828,0.804,0.594,0.696,0.792,0.757,0.795,0.818,0.794,0.804,0.821,0.8,0.448,0.435,0.83,0.52,0.762,0.82,0.77,0.634,0.817,0.822,0.822,0.821,0.824,0.829,0.826,0.826,0.83,0.832,0.844,0.658,OrganismalFitness,P53_HUMAN,Low,Human
+0.802,0.804,0.741,0.758,0.793,0.798,0.476,0.731,0.752,0.758,0.818,0.803,0.834,0.444,0.462,0.768,0.82,0.833,0.844,0.64,0.717,0.782,0.792,0.783,0.764,0.802,0.802,0.811,0.749,0.812,0.828,0.804,0.594,0.696,0.792,0.757,0.795,0.818,0.794,0.804,0.821,0.8,0.448,0.435,0.83,0.52,0.762,0.82,0.77,0.634,0.817,0.822,0.822,0.821,0.824,0.829,0.826,0.826,0.83,0.832,0.844,0.658,OrganismalFitness,P53_HUMAN,Low,Human
+0.79,0.827,0.844,0.852,0.818,0.827,0.661,0.785,0.86,0.847,0.818,0.808,0.832,0.634,0.809,0.817,0.831,0.838,0.811,0.846,0.737,0.785,0.781,0.813,0.792,0.828,0.837,0.819,0.87,0.801,0.817,0.773,0.699,0.769,0.775,0.798,0.795,0.798,0.811,0.821,0.822,0.831,0.754,0.488,0.797,0.774,0.661,0.727,0.768,0.56,0.795,0.782,0.748,0.796,0.799,0.777,0.787,0.799,0.802,0.802,0.835,0.828,OrganismalFitness,P84126_THETH,Medium,Prokaryote
+0.855,0.827,0.787,0.791,0.846,0.843,0.752,0.805,0.838,0.852,0.866,0.851,0.861,0.758,0.805,0.856,0.898,0.875,0.882,0.787,0.847,0.858,0.863,0.874,0.846,0.879,0.879,0.866,0.86,0.859,0.884,0.846,0.649,0.849,0.85,0.847,0.874,0.874,0.872,0.866,0.868,0.866,0.825,0.469,0.851,0.852,0.691,0.817,0.779,0.604,0.87,0.865,0.872,0.888,0.88,0.883,0.884,0.884,0.901,0.892,0.885,0.844,OrganismalFitness,PABP_YEAST,Medium,Eukaryote
+0.693,0.693,0.686,0.693,0.695,0.702,0.526,0.656,0.705,0.711,0.708,0.693,0.704,0.545,0.688,0.705,0.715,0.662,0.636,0.704,0.57,0.686,0.677,0.668,0.67,0.69,0.693,0.681,0.665,0.705,0.692,0.676,0.561,0.583,0.678,0.681,0.683,0.693,0.697,0.702,0.706,0.706,0.669,0.533,0.705,0.695,0.69,0.723,0.703,0.576,0.712,0.712,0.712,0.718,0.714,0.716,0.715,0.717,0.713,0.72,0.728,0.692,Activity,PAI1_HUMAN,,Human
+0.772,0.773,0.761,0.766,0.782,0.783,0.535,0.694,0.589,0.589,0.526,0.537,0.569,0.518,0.52,0.521,0.525,0.525,0.681,0.667,0.736,0.755,0.777,0.778,0.607,0.71,0.733,0.725,0.728,0.805,0.689,0.693,0.566,0.722,0.747,0.777,0.782,0.793,0.796,0.804,0.808,0.803,0.522,0.515,0.521,0.519,0.627,0.616,0.599,0.54,0.581,0.584,0.583,0.594,0.584,0.591,0.583,0.588,0.568,0.59,0.612,0.577,OrganismalFitness,PA_I34A1,Medium,Virus
+0.592,0.708,0.858,0.846,0.66,0.652,0.821,0.77,0.85,0.855,0.819,0.873,0.888,0.878,0.911,0.852,0.87,0.851,0.876,0.819,0.836,0.835,0.79,0.841,0.754,0.8,0.788,0.822,0.791,0.783,0.781,0.74,0.647,0.774,0.744,0.795,0.786,0.769,0.793,0.696,0.71,0.7,0.808,0.687,0.796,0.805,0.599,0.747,0.83,0.7,0.797,0.765,0.77,0.769,0.772,0.779,0.778,0.781,0.786,0.779,0.865,0.884,Activity,PHOT_CHLRE,High,Eukaryote
+0.608,0.662,0.858,0.837,0.829,0.85,0.8,0.753,0.851,0.872,0.85,0.799,0.858,0.837,0.832,0.846,0.848,0.761,0.79,0.837,0.722,0.859,0.834,0.825,0.789,0.857,0.869,0.852,0.859,0.873,0.879,0.861,0.799,0.766,0.834,0.859,0.806,0.85,0.884,0.856,0.872,0.888,0.8,0.381,0.836,0.777,0.786,0.808,0.871,0.862,0.846,0.824,0.752,0.782,0.789,0.808,0.812,0.835,0.821,0.817,0.889,0.895,Stability,PIN1_HUMAN,High,Human
+0.795,0.773,0.787,0.791,0.787,0.79,0.76,0.754,0.781,0.793,0.745,0.732,0.726,0.786,0.836,0.847,0.827,0.807,0.765,0.79,0.771,0.743,0.752,0.738,0.793,0.766,0.744,0.73,0.752,0.78,0.712,0.703,0.729,0.797,0.771,0.761,0.813,0.788,0.784,0.801,0.782,0.784,0.748,0.718,0.646,0.669,0.715,0.653,0.85,0.82,0.794,0.798,0.795,0.805,0.812,0.816,0.806,0.808,0.81,0.809,0.75,0.812,Stability,PITX2_HUMAN,High,Human
+0.623,0.605,0.662,0.671,0.69,0.694,0.612,0.658,0.639,0.661,0.68,0.706,0.737,0.716,0.733,0.814,0.721,0.678,0.676,0.718,0.702,0.705,0.68,0.712,0.71,0.708,0.711,0.704,0.712,0.717,0.752,0.75,0.555,0.721,0.665,0.708,0.69,0.686,0.714,0.713,0.697,0.712,0.684,0.674,0.705,0.692,0.835,0.777,0.83,0.79,0.685,0.703,0.724,0.689,0.707,0.706,0.692,0.707,0.713,0.709,0.728,0.825,Stability,PKN1_HUMAN,High,Human
+0.709,0.687,0.681,0.699,0.726,0.731,0.478,0.672,0.749,0.749,0.651,0.468,0.516,0.457,0.469,0.595,0.702,0.704,0.717,0.677,0.664,0.694,0.689,0.688,0.561,0.691,0.688,0.683,0.694,0.743,0.694,0.66,0.501,0.524,0.631,0.677,0.667,0.688,0.706,0.688,0.709,0.727,0.465,0.465,0.677,0.472,0.596,0.673,0.559,0.523,0.673,0.683,0.685,0.687,0.684,0.686,0.685,0.684,0.69,0.689,0.558,0.536,OrganismalFitness,POLG_CXB3N,Medium,Virus
+0.76,0.794,0.633,0.633,0.772,0.772,0.485,0.722,0.848,0.85,0.669,0.507,0.512,0.481,0.513,0.547,0.587,0.645,0.69,0.715,0.732,0.741,0.733,0.732,0.725,0.754,0.756,0.747,0.745,0.816,0.809,0.771,0.522,0.484,0.577,0.743,0.729,0.709,0.781,0.745,0.714,0.79,0.478,0.473,0.709,0.504,0.69,0.771,0.572,0.561,0.632,0.624,0.631,0.643,0.626,0.643,0.633,0.637,0.627,0.64,0.592,0.524,OrganismalFitness,POLG_DEN26,Low,Virus
+0.824,0.784,0.712,0.716,0.823,0.828,0.47,0.598,0.792,0.798,0.588,0.842,0.839,0.552,0.565,0.556,0.556,0.543,0.535,0.631,0.714,0.744,0.741,0.761,0.721,0.756,0.669,0.72,0.78,0.84,0.821,0.772,0.599,0.747,0.769,0.78,0.768,0.79,0.809,0.748,0.78,0.798,0.555,0.52,0.763,0.554,0.483,0.672,0.869,0.703,0.673,0.714,0.7,0.715,0.698,0.657,0.683,0.685,0.679,0.712,0.596,0.584,OrganismalFitness,POLG_HCVJF,Medium,Virus
+0.638,0.766,0.695,0.72,0.74,0.74,0.525,0.722,0.679,0.735,0.737,0.523,0.558,0.525,0.556,0.515,0.567,0.553,0.53,0.818,0.558,0.53,0.507,0.574,0.542,0.47,0.486,0.463,0.578,0.761,0.822,0.826,0.575,0.498,0.503,0.5,0.702,0.706,0.702,0.734,0.741,0.733,0.479,0.448,0.491,0.469,0.778,0.748,0.882,0.841,0.859,0.863,0.848,0.892,0.894,0.894,0.867,0.896,0.894,0.888,0.907,0.83,Stability,POLG_PESV,Medium,Virus
+0.758,0.814,0.837,0.842,0.838,0.845,0.625,0.731,0.81,0.821,0.812,0.841,0.843,0.533,0.612,0.808,0.865,0.893,0.869,0.858,0.831,0.827,0.697,0.719,0.81,0.837,0.822,0.838,0.734,0.871,0.852,0.843,0.732,0.855,0.816,0.788,0.862,0.845,0.833,0.871,0.861,0.859,0.679,0.524,0.791,0.779,0.807,0.788,0.834,0.668,0.86,0.862,0.867,0.865,0.865,0.866,0.864,0.865,0.865,0.871,0.861,0.811,Activity,PPARG_HUMAN,Medium,Human
+0.783,0.779,0.779,0.782,0.816,0.817,0.513,0.694,0.741,0.77,0.797,0.805,0.816,0.639,0.692,0.734,0.805,0.82,0.817,0.697,0.726,0.767,0.776,0.719,0.763,0.795,0.79,0.787,0.714,0.813,0.804,0.777,0.614,0.718,0.769,0.775,0.801,0.805,0.805,0.819,0.817,0.818,0.677,0.473,0.794,0.763,0.756,0.798,0.783,0.627,0.791,0.793,0.789,0.796,0.801,0.794,0.797,0.797,0.799,0.802,0.824,0.757,OrganismalFitness,PPM1D_HUMAN,Low,Human
+0.845,0.882,0.926,0.921,0.939,0.94,0.771,0.771,0.936,0.939,0.913,0.85,0.881,0.783,0.779,0.951,0.945,0.921,0.912,0.904,0.829,0.885,0.886,0.898,0.853,0.888,0.907,0.898,0.912,0.946,0.944,0.942,0.856,0.828,0.8,0.824,0.917,0.92,0.909,0.932,0.943,0.933,0.718,0.611,0.635,0.719,0.798,0.664,0.902,0.913,0.945,0.947,0.947,0.949,0.954,0.954,0.949,0.953,0.953,0.954,0.94,0.937,Stability,PR40A_HUMAN,Medium,Human
+0.839,0.84,0.835,0.827,0.848,0.844,0.593,0.757,0.797,0.807,0.816,0.839,0.854,0.615,0.652,0.689,0.769,0.861,0.865,0.753,0.652,0.803,0.828,0.819,0.74,0.839,0.838,0.835,0.799,0.846,0.832,0.809,0.652,0.628,0.801,0.817,0.816,0.842,0.848,0.842,0.855,0.86,0.659,0.54,0.821,0.69,0.845,0.831,0.873,0.656,0.788,0.791,0.793,0.81,0.8,0.805,0.807,0.806,0.8,0.808,0.866,0.768,Expression,PRKN_HUMAN,Low,Human
+0.859,0.864,0.856,0.855,0.86,0.857,0.712,0.785,0.824,0.824,0.904,0.871,0.88,0.811,0.842,0.895,0.909,0.888,0.853,0.806,0.706,0.656,0.794,0.739,0.793,0.809,0.388,0.802,0.834,0.846,0.853,0.825,0.554,0.717,0.755,0.78,0.845,0.841,0.82,0.866,0.866,0.843,0.748,0.65,0.864,0.784,0.828,0.854,0.907,0.873,0.902,0.896,0.898,0.903,0.901,0.907,0.895,0.901,0.901,0.905,0.909,0.906,Stability,PSAE_PICP2,Medium,Prokaryote
+0.748,0.76,0.754,0.76,0.77,0.774,0.599,0.708,0.771,0.784,0.763,0.76,0.782,0.613,0.704,0.788,0.782,0.668,0.669,0.785,0.662,0.752,0.716,0.698,0.702,0.688,0.68,0.705,0.659,0.79,0.758,0.754,0.554,0.693,0.742,0.691,0.758,0.772,0.746,0.775,0.788,0.786,0.684,0.504,0.776,0.742,0.754,0.748,0.77,0.618,0.768,0.752,0.77,0.768,0.772,0.774,0.77,0.771,0.774,0.778,0.796,0.776,Expression,PTEN_HUMAN,Medium,Human
+0.748,0.76,0.754,0.76,0.77,0.774,0.599,0.708,0.771,0.784,0.763,0.76,0.782,0.613,0.704,0.788,0.782,0.668,0.669,0.785,0.662,0.752,0.716,0.698,0.702,0.688,0.68,0.705,0.659,0.79,0.758,0.754,0.554,0.693,0.742,0.691,0.758,0.772,0.746,0.775,0.788,0.786,0.684,0.504,0.776,0.742,0.754,0.748,0.77,0.618,0.768,0.752,0.77,0.768,0.772,0.774,0.77,0.771,0.774,0.778,0.796,0.776,Activity,PTEN_HUMAN,Medium,Human
+0.758,0.704,0.69,0.709,0.76,0.763,0.51,0.723,0.77,0.773,0.745,0.767,0.781,0.509,0.509,0.511,0.532,0.559,0.586,0.711,0.77,0.707,0.698,0.671,0.769,0.702,0.702,0.722,0.683,0.764,0.751,0.753,0.644,0.758,0.716,0.71,0.772,0.762,0.762,0.775,0.769,0.768,0.72,0.507,0.765,0.738,0.695,0.743,0.634,0.575,0.615,0.636,0.658,0.656,0.631,0.634,0.635,0.628,0.615,0.645,0.61,0.56,OrganismalFitness,Q2N0S5_9HIV1,Medium,Virus
+0.721,0.706,0.718,0.722,0.726,0.726,0.508,0.664,0.712,0.718,0.662,0.692,0.706,0.569,0.57,0.708,0.721,0.708,0.728,0.719,0.666,0.686,0.684,0.67,0.692,0.7,0.689,0.704,0.708,0.712,0.732,0.71,0.566,0.666,0.68,0.652,0.71,0.72,0.71,0.728,0.732,0.728,0.678,0.574,0.71,0.7,0.688,0.712,0.697,0.597,0.708,0.713,0.722,0.72,0.722,0.719,0.716,0.716,0.72,0.724,0.709,0.764,Binding,Q53Z42_HUMAN,Medium,Human
+0.721,0.706,0.718,0.722,0.726,0.726,0.508,0.664,0.712,0.718,0.662,0.692,0.706,0.569,0.57,0.708,0.721,0.708,0.728,0.719,0.666,0.686,0.684,0.67,0.692,0.7,0.689,0.704,0.708,0.712,0.732,0.71,0.566,0.666,0.68,0.652,0.71,0.72,0.71,0.728,0.732,0.728,0.678,0.574,0.71,0.7,0.688,0.712,0.697,0.597,0.708,0.713,0.722,0.72,0.722,0.719,0.716,0.716,0.72,0.724,0.709,0.764,Expression,Q53Z42_HUMAN,Medium,Human
+0.781,0.831,0.853,0.857,0.857,0.864,0.689,0.776,0.865,0.869,0.827,0.786,0.796,0.574,0.741,0.775,0.813,0.812,0.817,0.85,0.826,0.85,0.849,0.857,0.838,0.859,0.866,0.865,0.869,0.872,0.854,0.827,0.674,0.827,0.85,0.842,0.838,0.856,0.857,0.864,0.869,0.869,0.763,0.492,0.824,0.784,0.711,0.783,0.781,0.569,0.813,0.792,0.792,0.803,0.808,0.798,0.81,0.797,0.802,0.809,0.843,0.79,Activity,Q59976_STRSQ,Medium,Prokaryote
+0.634,0.658,0.605,0.606,0.64,0.639,0.496,0.562,0.661,0.659,0.599,0.52,0.512,0.506,0.519,0.501,0.505,0.505,0.493,0.626,0.518,0.519,0.502,0.552,0.513,0.5,0.512,0.496,0.493,0.682,0.636,0.639,0.514,0.513,0.518,0.512,0.599,0.599,0.599,0.637,0.637,0.635,0.505,0.501,0.496,0.51,0.611,0.615,0.664,0.601,0.61,0.624,0.614,0.636,0.628,0.63,0.621,0.622,0.621,0.624,0.554,0.536,Activity,Q6WV12_9MAXI,Low,Eukaryote
+0.705,0.712,0.718,0.719,0.725,0.736,0.709,0.675,0.711,0.719,0.765,0.741,0.755,0.682,0.716,0.718,0.724,0.744,0.729,0.476,0.733,0.712,0.721,0.734,0.742,0.749,0.76,0.706,0.754,0.738,0.738,0.718,0.561,0.721,0.747,0.729,0.719,0.754,0.749,0.744,0.752,0.752,0.704,0.677,0.755,0.72,0.63,0.725,0.685,0.551,0.726,0.714,0.718,0.736,0.74,0.748,0.741,0.711,0.736,0.737,0.772,0.72,Activity,Q837P4_ENTFA,Medium,Prokaryote
+0.565,0.677,0.682,0.698,0.664,0.668,0.573,0.601,0.613,0.618,0.638,0.646,0.638,0.526,0.578,0.629,0.653,0.691,0.671,0.636,0.657,0.676,0.701,0.698,0.639,0.684,0.69,0.735,0.714,0.644,0.665,0.658,0.559,0.68,0.703,0.741,0.643,0.674,0.714,0.679,0.686,0.699,0.611,0.557,0.653,0.6,0.665,0.642,0.676,0.563,0.636,0.641,0.648,0.629,0.657,0.65,0.63,0.647,0.637,0.648,0.64,0.629,Activity,Q837P5_ENTFA,Medium,Prokaryote
+0.63,0.671,0.603,0.602,0.652,0.649,0.495,0.648,0.638,0.641,0.592,0.471,0.463,0.466,0.454,0.459,0.465,0.461,0.484,0.601,0.476,0.492,0.516,0.489,0.482,0.469,0.487,0.606,0.613,0.665,0.646,0.654,0.477,0.452,0.49,0.639,0.596,0.601,0.642,0.641,0.646,0.66,0.472,0.473,0.46,0.46,0.51,0.543,0.628,0.571,0.604,0.598,0.603,0.609,0.598,0.606,0.605,0.599,0.599,0.603,0.537,0.447,Activity,Q8WTC7_9CNID,Low,Eukaryote
+0.835,0.829,0.618,0.633,0.846,0.848,0.487,0.675,0.496,0.496,0.575,0.497,0.49,0.513,0.499,0.555,0.567,0.792,0.824,0.663,0.636,0.667,0.674,0.676,0.643,0.643,0.623,0.634,0.642,0.825,0.801,0.745,0.486,0.607,0.637,0.63,0.711,0.735,0.734,0.823,0.834,0.832,0.501,0.486,0.549,0.495,0.752,0.733,0.777,0.626,0.65,0.636,0.659,0.675,0.665,0.661,0.662,0.66,0.641,0.664,0.636,0.589,OrganismalFitness,R1AB_SARS2,Medium,Virus
+0.605,0.585,0.66,0.663,0.675,0.692,0.735,0.703,0.853,0.835,0.794,0.748,0.769,0.726,0.846,0.868,0.761,0.754,0.797,0.714,0.788,0.824,0.803,0.766,0.816,0.744,0.808,0.754,0.747,0.799,0.725,0.681,0.649,0.75,0.834,0.745,0.745,0.811,0.744,0.734,0.762,0.723,0.811,0.62,0.625,0.726,0.761,0.666,0.865,0.778,0.786,0.801,0.804,0.8,0.812,0.802,0.799,0.801,0.797,0.805,0.719,0.818,Stability,RAD_ANTMA,High,Eukaryote
+0.708,0.719,0.706,0.712,0.725,0.725,0.525,0.684,0.752,0.763,0.76,0.732,0.769,0.523,0.619,0.74,0.756,0.744,0.734,0.678,0.654,0.715,0.708,0.696,0.699,0.712,0.708,0.696,0.708,0.727,0.764,0.73,0.569,0.653,0.688,0.702,0.693,0.703,0.711,0.717,0.734,0.737,0.607,0.529,0.779,0.707,0.638,0.725,0.639,0.665,0.729,0.726,0.73,0.752,0.734,0.747,0.732,0.729,0.74,0.745,0.727,0.59,OrganismalFitness,RAF1_HUMAN,Low,Human
+0.798,0.811,0.827,0.842,0.839,0.844,0.689,0.772,0.823,0.837,0.757,0.787,0.811,0.807,0.849,0.835,0.85,0.817,0.735,0.859,0.819,0.805,0.805,0.791,0.809,0.791,0.791,0.767,0.71,0.815,0.789,0.779,0.648,0.787,0.803,0.76,0.821,0.83,0.808,0.839,0.844,0.846,0.838,0.662,0.72,0.822,0.711,0.641,0.807,0.644,0.798,0.794,0.799,0.796,0.806,0.793,0.813,0.796,0.805,0.812,0.747,0.827,Activity,RASH_HUMAN,High,Human
+0.691,0.69,0.75,0.75,0.74,0.746,0.625,0.662,0.725,0.734,0.725,0.692,0.714,0.727,0.754,0.748,0.736,0.7,0.668,0.763,0.716,0.726,0.744,0.726,0.709,0.734,0.742,0.698,0.716,0.754,0.692,0.668,0.634,0.682,0.736,0.734,0.714,0.751,0.756,0.742,0.756,0.756,0.705,0.652,0.632,0.718,0.644,0.652,0.751,0.655,0.703,0.7,0.714,0.71,0.722,0.714,0.712,0.708,0.69,0.714,0.742,0.738,Expression,RASK_HUMAN,High,Human
+0.691,0.69,0.75,0.75,0.74,0.746,0.625,0.662,0.725,0.734,0.725,0.692,0.714,0.727,0.754,0.748,0.736,0.7,0.668,0.763,0.716,0.726,0.744,0.726,0.709,0.734,0.742,0.698,0.716,0.754,0.692,0.668,0.634,0.682,0.736,0.734,0.714,0.751,0.756,0.742,0.756,0.756,0.705,0.652,0.632,0.718,0.644,0.652,0.751,0.655,0.703,0.7,0.714,0.71,0.722,0.714,0.712,0.708,0.69,0.714,0.742,0.738,Binding,RASK_HUMAN,High,Human
+0.584,0.542,0.656,0.66,0.659,0.657,0.646,0.65,0.634,0.642,0.728,0.718,0.713,0.669,0.708,0.759,0.781,0.684,0.682,0.633,0.694,0.577,0.656,0.654,0.653,0.565,0.591,0.673,0.631,0.701,0.704,0.676,0.644,0.655,0.698,0.708,0.666,0.679,0.678,0.666,0.675,0.666,0.667,0.636,0.709,0.714,0.754,0.778,0.796,0.73,0.755,0.751,0.756,0.755,0.759,0.754,0.749,0.762,0.752,0.758,0.783,0.769,Stability,RBP1_HUMAN,High,Human
+0.694,0.683,0.725,0.728,0.728,0.723,0.66,0.662,0.736,0.74,0.757,0.669,0.695,0.685,0.692,0.784,0.792,0.787,0.783,0.739,0.628,0.616,0.668,0.71,0.659,0.759,0.749,0.727,0.767,0.744,0.787,0.752,0.519,0.638,0.657,0.66,0.717,0.721,0.715,0.742,0.743,0.735,0.655,0.644,0.727,0.666,0.744,0.757,0.801,0.773,0.802,0.809,0.816,0.814,0.806,0.812,0.809,0.808,0.811,0.814,0.799,0.762,Stability,RCD1_ARATH,Medium,Eukaryote
+0.665,0.773,0.805,0.8,0.788,0.806,0.533,0.64,0.764,0.767,0.737,0.649,0.697,0.591,0.654,0.66,0.786,0.786,0.803,0.801,0.506,0.524,0.549,0.541,0.414,0.522,0.439,0.482,0.798,0.799,0.782,0.771,0.518,0.525,0.495,0.778,0.713,0.708,0.795,0.789,0.8,0.823,0.452,0.545,0.698,0.548,0.78,0.806,0.866,0.807,0.79,0.781,0.771,0.782,0.782,0.784,0.773,0.781,0.792,0.785,0.815,0.71,Stability,RCRO_LAMBD,High,Virus
+0.619,0.649,0.714,0.714,0.711,0.711,0.576,0.735,0.766,0.771,0.75,0.792,0.805,0.603,0.836,0.78,0.768,0.761,0.723,0.735,0.705,0.725,0.722,0.743,0.769,0.765,0.764,0.749,0.765,0.747,0.725,0.718,0.729,0.646,0.749,0.737,0.703,0.747,0.75,0.716,0.742,0.748,0.747,0.507,0.788,0.772,0.724,0.783,0.787,0.745,0.744,0.735,0.741,0.733,0.748,0.75,0.746,0.748,0.749,0.749,0.741,0.796,Stability,RD23A_HUMAN,High,Human
+0.667,0.699,0.727,0.732,0.779,0.781,0.524,0.728,0.791,0.794,0.612,0.547,0.554,0.535,0.54,0.59,0.698,0.724,0.785,0.768,0.734,0.768,0.778,0.789,0.588,0.723,0.716,0.712,0.751,0.796,0.766,0.729,0.568,0.73,0.761,0.781,0.75,0.768,0.779,0.79,0.799,0.808,0.535,0.522,0.612,0.536,0.626,0.646,0.622,0.54,0.681,0.664,0.674,0.674,0.683,0.679,0.675,0.686,0.686,0.687,0.595,0.569,OrganismalFitness,RDRP_I33A0,Low,Virus
+0.596,0.574,0.603,0.606,0.603,0.603,0.521,0.646,0.603,0.607,0.566,0.616,0.624,0.526,0.522,0.585,0.613,0.638,0.629,0.575,0.6,0.623,0.617,0.608,0.635,0.637,0.576,0.617,0.623,0.63,0.667,0.672,0.525,0.608,0.627,0.613,0.614,0.625,0.61,0.616,0.623,0.611,0.529,0.533,0.599,0.537,0.626,0.612,0.639,0.604,0.591,0.596,0.633,0.622,0.624,0.625,0.633,0.61,0.649,0.632,0.633,0.608,OrganismalFitness,REV_HV1H2,Medium,Virus
+0.507,0.596,0.607,0.618,0.618,0.616,0.487,0.62,0.613,0.627,0.622,0.571,0.585,0.469,0.529,0.511,0.649,0.637,0.632,0.635,0.454,0.519,0.559,0.553,0.464,0.565,0.577,0.552,0.576,0.61,0.636,0.61,0.5,0.498,0.558,0.576,0.543,0.572,0.583,0.61,0.61,0.611,0.547,0.447,0.585,0.54,0.649,0.622,0.689,0.658,0.668,0.657,0.649,0.668,0.66,0.655,0.66,0.656,0.668,0.663,0.652,0.593,Stability,RFAH_ECOLI,High,Prokaryote
+0.688,0.851,0.867,0.866,0.866,0.87,0.593,0.855,0.718,0.681,0.886,0.885,0.875,0.591,0.706,0.753,0.908,0.899,0.907,0.879,0.587,0.798,0.786,0.78,0.774,0.789,0.805,0.818,0.861,0.865,0.843,0.832,0.531,0.774,0.803,0.79,0.812,0.836,0.832,0.852,0.875,0.875,0.484,0.434,0.769,0.622,0.856,0.858,0.936,0.906,0.917,0.917,0.915,0.929,0.923,0.919,0.924,0.924,0.925,0.927,0.93,0.872,Stability,RL20_AQUAE,High,Prokaryote
+0.682,0.687,0.708,0.736,0.71,0.726,0.561,0.724,0.752,0.758,0.611,0.656,0.681,0.572,0.761,0.785,0.81,0.754,0.798,0.741,0.71,0.795,0.776,0.755,0.78,0.754,0.756,0.739,0.727,0.7,0.72,0.727,0.561,0.72,0.77,0.718,0.738,0.777,0.742,0.751,0.773,0.738,0.665,0.539,0.657,0.667,0.575,0.598,0.647,0.534,0.795,0.779,0.795,0.797,0.803,0.796,0.783,0.8,0.791,0.808,0.695,0.718,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+0.682,0.687,0.708,0.736,0.71,0.726,0.561,0.724,0.752,0.758,0.611,0.656,0.681,0.572,0.761,0.785,0.81,0.754,0.798,0.741,0.71,0.795,0.776,0.755,0.78,0.754,0.756,0.739,0.727,0.7,0.72,0.727,0.561,0.72,0.77,0.718,0.738,0.777,0.742,0.751,0.773,0.738,0.665,0.539,0.657,0.667,0.575,0.598,0.647,0.534,0.795,0.779,0.795,0.797,0.803,0.796,0.783,0.8,0.791,0.808,0.695,0.718,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+0.682,0.687,0.708,0.736,0.71,0.726,0.561,0.724,0.752,0.758,0.611,0.656,0.681,0.572,0.761,0.785,0.81,0.754,0.798,0.741,0.71,0.795,0.776,0.755,0.78,0.754,0.756,0.739,0.727,0.7,0.72,0.727,0.561,0.72,0.77,0.718,0.738,0.777,0.742,0.751,0.773,0.738,0.665,0.539,0.657,0.667,0.575,0.598,0.647,0.534,0.795,0.779,0.795,0.797,0.803,0.796,0.783,0.8,0.791,0.808,0.695,0.718,Activity,RL40A_YEAST,Medium,Eukaryote
+0.772,0.804,0.803,0.809,0.812,0.813,0.531,0.719,0.808,0.807,0.806,0.803,0.81,0.535,0.782,0.801,0.809,0.807,0.81,0.808,0.779,0.794,0.765,0.755,0.786,0.802,0.802,0.801,0.795,0.809,0.799,0.776,0.584,0.77,0.777,0.718,0.794,0.801,0.777,0.818,0.819,0.815,0.769,0.534,0.806,0.792,0.657,0.777,0.65,0.597,0.787,0.781,0.778,0.792,0.787,0.782,0.786,0.786,0.787,0.793,0.815,0.772,Activity,RNC_ECOLI,Medium,Prokaryote
+0.818,0.857,0.864,0.875,0.833,0.856,0.841,0.829,0.85,0.87,0.891,0.898,0.902,0.876,0.89,0.901,0.894,0.88,0.85,0.884,0.865,0.886,0.889,0.871,0.882,0.894,0.89,0.891,0.87,0.897,0.869,0.834,0.836,0.826,0.881,0.881,0.874,0.888,0.887,0.873,0.873,0.869,0.888,0.854,0.898,0.887,0.875,0.847,0.925,0.889,0.879,0.872,0.88,0.887,0.884,0.881,0.89,0.886,0.892,0.89,0.913,0.906,Stability,RPC1_BP434,High,Virus
+0.66,0.748,0.786,0.798,0.774,0.772,0.616,0.786,0.774,0.796,0.81,0.792,0.808,0.67,0.669,0.732,0.83,0.852,0.856,0.758,0.631,0.711,0.735,0.766,0.629,0.732,0.712,0.7,0.848,0.762,0.882,0.859,0.563,0.594,0.714,0.799,0.658,0.713,0.792,0.756,0.774,0.804,0.684,0.678,0.737,0.692,0.653,0.728,0.73,0.631,0.834,0.812,0.82,0.837,0.822,0.835,0.83,0.823,0.816,0.836,0.822,0.721,Activity,RPC1_LAMBD,High,Virus
+0.66,0.748,0.786,0.798,0.774,0.772,0.616,0.786,0.774,0.796,0.81,0.792,0.808,0.67,0.669,0.732,0.83,0.852,0.856,0.758,0.631,0.711,0.735,0.766,0.629,0.732,0.712,0.7,0.848,0.762,0.882,0.859,0.563,0.594,0.714,0.799,0.658,0.713,0.792,0.756,0.774,0.804,0.684,0.678,0.737,0.692,0.653,0.728,0.73,0.631,0.834,0.812,0.82,0.837,0.822,0.835,0.83,0.823,0.816,0.836,0.822,0.721,Activity,RPC1_LAMBD,High,Virus
+0.698,0.672,0.685,0.682,0.672,0.675,0.587,0.62,0.71,0.71,0.734,0.709,0.701,0.636,0.657,0.758,0.714,0.702,0.67,0.663,0.601,0.652,0.667,0.658,0.657,0.672,0.7,0.679,0.686,0.701,0.726,0.723,0.548,0.711,0.659,0.662,0.725,0.687,0.69,0.703,0.679,0.673,0.665,0.62,0.719,0.683,0.801,0.723,0.817,0.768,0.71,0.709,0.715,0.713,0.709,0.708,0.719,0.726,0.71,0.719,0.72,0.821,Stability,RS15_GEOSE,Medium,Prokaryote
+0.774,0.8,0.818,0.824,0.825,0.832,0.712,0.8,0.837,0.837,0.829,0.841,0.867,0.768,0.79,0.796,0.845,0.829,0.817,0.52,0.78,0.838,0.849,0.837,0.8,0.849,0.856,0.842,0.82,0.835,0.833,0.772,0.686,0.795,0.843,0.835,0.812,0.851,0.851,0.845,0.857,0.854,0.773,0.644,0.825,0.801,0.73,0.797,0.788,0.59,0.82,0.809,0.808,0.822,0.832,0.823,0.823,0.829,0.827,0.833,0.832,0.808,Expression,S22A1_HUMAN,Medium,Human
+0.774,0.8,0.818,0.824,0.825,0.832,0.712,0.8,0.837,0.837,0.829,0.841,0.867,0.768,0.79,0.796,0.845,0.829,0.817,0.52,0.78,0.838,0.849,0.837,0.8,0.849,0.856,0.842,0.82,0.835,0.833,0.772,0.686,0.795,0.843,0.835,0.812,0.851,0.851,0.845,0.857,0.854,0.773,0.644,0.825,0.801,0.73,0.797,0.788,0.59,0.82,0.809,0.808,0.822,0.832,0.823,0.823,0.829,0.827,0.833,0.832,0.808,Activity,S22A1_HUMAN,Medium,Human
+0.537,0.586,0.693,0.677,0.692,0.687,0.702,0.788,0.693,0.701,0.762,0.765,0.778,0.593,0.739,0.757,0.765,0.786,0.732,0.723,0.771,0.756,0.75,0.766,0.776,0.786,0.768,0.771,0.769,0.785,0.764,0.733,0.703,0.736,0.778,0.773,0.714,0.758,0.746,0.698,0.719,0.719,0.768,0.459,0.787,0.774,0.647,0.658,0.709,0.767,0.752,0.693,0.638,0.704,0.699,0.716,0.713,0.742,0.735,0.714,0.803,0.813,Stability,SAV1_MOUSE,High,Eukaryote
+0.606,0.599,0.685,0.666,0.714,0.699,0.607,0.603,0.737,0.763,0.661,0.626,0.642,0.614,0.629,0.687,0.789,0.833,0.674,0.68,0.61,0.63,0.64,0.639,0.607,0.619,0.611,0.629,0.688,0.789,0.798,0.75,0.602,0.589,0.598,0.606,0.641,0.646,0.648,0.674,0.681,0.685,0.629,0.608,0.659,0.655,0.836,0.828,0.854,0.812,0.768,0.765,0.782,0.784,0.781,0.782,0.794,0.79,0.785,0.787,0.848,0.785,Stability,SBI_STAAM,Medium,Prokaryote
+0.71,0.745,0.728,0.734,0.771,0.78,0.686,0.766,0.8,0.805,0.788,0.782,0.787,0.588,0.675,0.77,0.792,0.793,0.78,0.795,0.764,0.771,0.762,0.761,0.771,0.774,0.773,0.776,0.771,0.788,0.745,0.693,0.683,0.772,0.77,0.759,0.783,0.788,0.781,0.793,0.796,0.791,0.762,0.564,0.79,0.782,0.746,0.778,0.762,0.598,0.778,0.783,0.788,0.791,0.793,0.788,0.788,0.79,0.783,0.793,0.808,0.782,Activity,SC6A4_HUMAN,Medium,Human
+0.535,0.56,0.633,0.63,0.654,0.641,0.552,0.53,0.632,0.63,0.62,0.618,0.607,0.585,0.625,0.613,0.638,0.67,0.675,0.634,0.538,0.563,0.549,0.555,0.564,0.576,0.56,0.578,0.585,0.629,0.638,0.602,0.49,0.522,0.534,0.565,0.566,0.564,0.574,0.626,0.62,0.628,0.592,0.566,0.623,0.607,0.763,0.758,0.763,0.733,0.692,0.699,0.699,0.72,0.686,0.715,0.689,0.688,0.667,0.702,0.797,0.733,Stability,SCIN_STAAR,High,Prokaryote
+0.537,0.543,0.558,0.558,0.553,0.559,0.548,0.51,0.569,0.583,0.589,0.613,0.567,0.623,0.591,0.582,0.594,0.569,0.554,0.579,0.546,0.557,0.567,0.567,0.536,0.547,0.548,0.543,0.58,0.545,0.588,0.582,0.532,0.534,0.525,0.532,0.537,0.53,0.529,0.553,0.552,0.556,0.542,0.585,0.574,0.585,0.529,0.561,0.527,0.504,0.573,0.564,0.574,0.563,0.543,0.566,0.578,0.572,0.575,0.57,0.608,0.574,OrganismalFitness,SCN5A_HUMAN,Medium,Human
+0.944,0.971,0.971,0.969,0.973,0.973,0.59,0.86,0.959,0.962,0.969,0.939,0.952,0.72,0.734,0.947,0.955,0.952,0.957,0.954,0.628,0.652,0.652,0.77,0.643,0.799,0.565,0.776,0.862,0.967,0.965,0.962,0.353,0.579,0.738,0.808,0.934,0.936,0.908,0.962,0.96,0.947,0.576,0.497,0.818,0.562,0.841,0.827,0.961,0.952,0.973,0.97,0.969,0.974,0.972,0.972,0.974,0.971,0.975,0.974,0.961,0.967,Stability,SDA_BACSU,Medium,Prokaryote
+0.678,0.722,0.759,0.759,0.745,0.75,0.509,0.715,0.774,0.778,0.747,0.757,0.76,0.587,0.692,0.754,0.765,0.776,0.76,0.755,0.736,0.744,0.746,0.735,0.738,0.755,0.747,0.755,0.745,0.744,0.753,0.732,0.588,0.747,0.753,0.749,0.748,0.754,0.757,0.757,0.759,0.759,0.651,0.519,0.759,0.725,0.696,0.735,0.681,0.593,0.762,0.756,0.764,0.768,0.771,0.767,0.767,0.766,0.765,0.772,0.765,0.727,OrganismalFitness,SERC_HUMAN,High,Human
+0.599,0.664,0.689,0.69,0.685,0.69,0.606,0.679,0.711,0.711,0.696,0.689,0.702,0.613,0.618,0.627,0.711,0.695,0.648,0.694,0.624,0.679,0.693,0.677,0.626,0.693,0.692,0.698,0.678,0.704,0.704,0.677,0.574,0.621,0.651,0.687,0.627,0.648,0.682,0.682,0.682,0.698,0.623,0.611,0.713,0.639,0.65,0.681,0.638,0.545,0.685,0.682,0.691,0.693,0.697,0.699,0.689,0.695,0.69,0.697,0.656,0.639,OrganismalFitness,SHOC2_HUMAN,Medium,Human
+0.592,0.618,0.628,0.627,0.639,0.634,0.694,0.584,0.596,0.613,0.62,0.659,0.663,0.644,0.687,0.661,0.66,0.625,0.624,0.615,0.576,0.587,0.586,0.599,0.631,0.632,0.596,0.626,0.599,0.611,0.535,0.517,0.609,0.561,0.63,0.62,0.593,0.609,0.615,0.635,0.631,0.631,0.63,0.572,0.598,0.671,0.756,0.671,0.768,0.724,0.674,0.691,0.674,0.672,0.668,0.693,0.67,0.671,0.665,0.68,0.664,0.737,Stability,SOX30_HUMAN,High,Human
+0.748,0.788,0.799,0.788,0.822,0.822,0.411,0.796,0.744,0.744,0.7,0.434,0.456,0.436,0.438,0.427,0.457,0.447,0.468,0.794,0.386,0.48,0.586,0.546,0.428,0.447,0.504,0.674,0.733,0.811,0.736,0.75,0.476,0.461,0.471,0.505,0.744,0.738,0.737,0.82,0.818,0.812,0.387,0.418,0.411,0.445,0.753,0.72,0.871,0.785,0.761,0.756,0.743,0.766,0.753,0.755,0.75,0.745,0.757,0.755,0.768,0.687,Stability,SPA_STAAU,Medium,Prokaryote
+0.562,0.619,0.554,0.564,0.616,0.624,0.546,0.535,0.587,0.635,0.682,0.653,0.634,0.648,0.655,0.672,0.694,0.694,0.724,0.657,0.623,0.602,0.599,0.609,0.602,0.633,0.618,0.613,0.646,0.633,0.728,0.705,0.514,0.624,0.583,0.628,0.631,0.611,0.641,0.636,0.617,0.645,0.522,0.472,0.62,0.546,0.643,0.619,0.718,0.601,0.718,0.698,0.702,0.732,0.716,0.704,0.738,0.732,0.739,0.727,0.723,0.694,Binding,SPG1_STRSG,Low,Prokaryote
+0.562,0.619,0.554,0.564,0.616,0.624,0.546,0.535,0.587,0.635,0.682,0.653,0.634,0.648,0.655,0.672,0.694,0.694,0.724,0.657,0.623,0.602,0.599,0.609,0.602,0.633,0.618,0.613,0.646,0.633,0.728,0.705,0.514,0.624,0.583,0.628,0.631,0.611,0.641,0.636,0.617,0.645,0.522,0.472,0.62,0.546,0.643,0.619,0.718,0.601,0.718,0.698,0.702,0.732,0.716,0.704,0.738,0.732,0.739,0.727,0.723,0.694,Binding,SPG1_STRSG,Low,Prokaryote
+0.562,0.619,0.554,0.564,0.616,0.624,0.546,0.535,0.587,0.635,0.682,0.653,0.634,0.648,0.655,0.672,0.694,0.694,0.724,0.657,0.623,0.602,0.599,0.609,0.602,0.633,0.618,0.613,0.646,0.633,0.728,0.705,0.514,0.624,0.583,0.628,0.631,0.611,0.641,0.636,0.617,0.645,0.522,0.472,0.62,0.546,0.643,0.619,0.718,0.601,0.718,0.698,0.702,0.732,0.716,0.704,0.738,0.732,0.739,0.727,0.723,0.694,Binding,SPG1_STRSG,Medium,Prokaryote
+0.562,0.619,0.554,0.564,0.616,0.624,0.546,0.535,0.587,0.635,0.682,0.653,0.634,0.648,0.655,0.672,0.694,0.694,0.724,0.657,0.623,0.602,0.599,0.609,0.602,0.633,0.618,0.613,0.646,0.633,0.728,0.705,0.514,0.624,0.583,0.628,0.631,0.611,0.641,0.636,0.617,0.645,0.522,0.472,0.62,0.546,0.643,0.619,0.718,0.601,0.718,0.698,0.702,0.732,0.716,0.704,0.738,0.732,0.739,0.727,0.723,0.694,Binding,SPG1_STRSG,Medium,Prokaryote
+0.767,0.811,0.804,0.823,0.84,0.839,0.704,0.705,0.818,0.793,0.785,0.752,0.744,0.719,0.756,0.769,0.792,0.789,0.81,0.8,0.736,0.757,0.744,0.739,0.705,0.771,0.712,0.772,0.796,0.866,0.842,0.835,0.416,0.722,0.766,0.778,0.697,0.718,0.743,0.833,0.836,0.842,0.647,0.564,0.66,0.642,0.761,0.715,0.902,0.837,0.838,0.839,0.851,0.85,0.835,0.853,0.848,0.861,0.847,0.85,0.853,0.827,Stability,SPG2_STRSG,Medium,Prokaryote
+0.588,0.63,0.562,0.613,0.684,0.684,0.465,0.682,0.702,0.71,0.493,0.457,0.472,0.464,0.471,0.481,0.482,0.49,0.528,0.67,0.653,0.682,0.68,0.688,0.69,0.672,0.654,0.69,0.655,0.66,0.651,0.673,0.59,0.663,0.665,0.682,0.665,0.668,0.667,0.698,0.7,0.706,0.477,0.488,0.481,0.488,0.774,0.759,0.719,0.59,0.63,0.637,0.629,0.672,0.647,0.668,0.65,0.66,0.635,0.66,0.696,0.59,Binding,SPIKE_SARS2,Medium,Virus
+0.588,0.63,0.562,0.613,0.684,0.684,0.465,0.682,0.702,0.71,0.493,0.457,0.472,0.464,0.471,0.481,0.482,0.49,0.528,0.67,0.653,0.682,0.68,0.688,0.69,0.672,0.654,0.69,0.655,0.66,0.651,0.673,0.59,0.663,0.665,0.682,0.665,0.668,0.667,0.698,0.7,0.706,0.477,0.488,0.481,0.488,0.774,0.759,0.719,0.59,0.63,0.637,0.629,0.672,0.647,0.668,0.65,0.66,0.635,0.66,0.696,0.59,Expression,SPIKE_SARS2,Medium,Virus
+0.794,0.805,0.81,0.778,0.798,0.787,0.605,0.745,0.766,0.788,0.797,0.816,0.807,0.444,0.797,0.801,0.796,0.857,0.822,0.788,0.782,0.808,0.789,0.782,0.786,0.789,0.814,0.767,0.778,0.787,0.791,0.792,0.711,0.746,0.777,0.766,0.808,0.813,0.804,0.792,0.795,0.791,0.736,0.402,0.723,0.736,0.702,0.693,0.809,0.802,0.824,0.819,0.825,0.822,0.828,0.824,0.818,0.816,0.825,0.825,0.83,0.767,Stability,SPTN1_CHICK,High,Eukaryote
+0.678,0.75,0.781,0.788,0.785,0.787,0.529,0.686,0.768,0.782,0.729,0.654,0.714,0.574,0.708,0.745,0.804,0.815,0.753,0.796,0.59,0.701,0.715,0.718,0.64,0.743,0.76,0.724,0.741,0.809,0.786,0.77,0.576,0.571,0.696,0.735,0.727,0.741,0.768,0.791,0.78,0.791,0.582,0.538,0.779,0.771,0.832,0.816,0.813,0.823,0.815,0.786,0.818,0.811,0.813,0.812,0.816,0.804,0.798,0.818,0.824,0.725,Stability,SQSTM_MOUSE,Medium,Eukaryote
+0.82,0.856,0.869,0.869,0.873,0.87,0.382,0.779,0.844,0.831,0.878,0.842,0.851,0.347,0.853,0.878,0.863,0.877,0.864,0.843,0.419,0.682,0.687,0.762,0.76,0.801,0.825,0.819,0.823,0.865,0.866,0.848,0.686,0.504,0.575,0.678,0.853,0.851,0.855,0.868,0.868,0.865,0.794,0.321,0.871,0.825,0.567,0.81,0.859,0.864,0.867,0.864,0.861,0.874,0.878,0.867,0.862,0.862,0.87,0.87,0.917,0.891,Stability,SR43C_ARATH,High,Eukaryote
+0.711,0.786,0.866,0.866,0.886,0.883,0.788,0.697,0.893,0.893,0.837,0.859,0.876,0.767,0.867,0.881,0.888,0.842,0.867,0.852,0.866,0.846,0.853,0.839,0.845,0.842,0.849,0.841,0.832,0.864,0.887,0.868,0.809,0.825,0.844,0.841,0.858,0.875,0.875,0.878,0.889,0.887,0.69,0.574,0.786,0.769,0.786,0.778,0.823,0.842,0.896,0.886,0.892,0.889,0.897,0.894,0.896,0.889,0.897,0.899,0.887,0.847,Stability,SRBS1_HUMAN,High,Human
+0.766,0.771,0.757,0.763,0.773,0.773,0.775,0.744,0.768,0.783,0.762,0.802,0.814,0.714,0.737,0.746,0.792,0.772,0.762,0.76,0.736,0.725,0.725,0.701,0.741,0.754,0.735,0.724,0.676,0.788,0.807,0.8,0.725,0.717,0.722,0.681,0.759,0.762,0.751,0.778,0.778,0.774,0.793,0.695,0.769,0.787,0.655,0.66,0.704,0.544,0.772,0.755,0.758,0.771,0.774,0.772,0.774,0.777,0.775,0.779,0.772,0.742,Activity,SRC_HUMAN,Medium,Human
+0.766,0.771,0.757,0.763,0.773,0.773,0.775,0.744,0.768,0.783,0.762,0.802,0.814,0.714,0.737,0.746,0.792,0.772,0.762,0.76,0.736,0.725,0.725,0.701,0.741,0.754,0.735,0.724,0.676,0.788,0.807,0.8,0.725,0.717,0.722,0.681,0.759,0.762,0.751,0.778,0.778,0.774,0.793,0.695,0.769,0.787,0.655,0.66,0.704,0.544,0.772,0.755,0.758,0.771,0.774,0.772,0.774,0.777,0.775,0.779,0.772,0.742,Activity,SRC_HUMAN,Medium,Human
+0.766,0.771,0.757,0.763,0.773,0.773,0.775,0.744,0.768,0.783,0.762,0.802,0.814,0.714,0.737,0.746,0.792,0.772,0.762,0.76,0.736,0.725,0.725,0.701,0.741,0.754,0.735,0.724,0.676,0.788,0.807,0.8,0.725,0.717,0.722,0.681,0.759,0.762,0.751,0.778,0.778,0.774,0.793,0.695,0.769,0.787,0.655,0.66,0.704,0.544,0.772,0.755,0.758,0.771,0.774,0.772,0.774,0.777,0.775,0.779,0.772,0.742,OrganismalFitness,SRC_HUMAN,Medium,Human
+0.718,0.703,0.735,0.747,0.778,0.773,0.58,0.755,0.757,0.723,0.753,0.78,0.8,0.645,0.8,0.818,0.802,0.717,0.686,0.808,0.631,0.743,0.769,0.758,0.786,0.732,0.777,0.757,0.684,0.767,0.77,0.763,0.628,0.657,0.788,0.689,0.743,0.805,0.736,0.763,0.809,0.763,0.803,0.548,0.716,0.815,0.808,0.751,0.814,0.757,0.79,0.764,0.797,0.792,0.79,0.802,0.783,0.796,0.786,0.801,0.777,0.817,OrganismalFitness,SUMO1_HUMAN,High,Human
+0.61,0.647,0.668,0.676,0.671,0.676,0.667,0.719,0.703,0.697,0.774,0.775,0.772,0.669,0.7,0.699,0.702,0.748,0.753,0.753,0.683,0.735,0.698,0.697,0.654,0.717,0.731,0.689,0.687,0.755,0.736,0.712,0.579,0.694,0.755,0.72,0.658,0.724,0.692,0.679,0.717,0.697,0.665,0.617,0.719,0.697,0.495,0.634,0.519,0.472,0.648,0.64,0.638,0.657,0.687,0.659,0.678,0.671,0.692,0.67,0.574,0.544,OrganismalFitness,SYUA_HUMAN,Medium,Human
+0.555,0.538,0.554,0.552,0.547,0.545,0.62,0.503,0.544,0.542,0.512,0.525,0.524,0.552,0.517,0.516,0.447,0.502,0.533,0.474,0.568,0.532,0.504,0.502,0.582,0.551,0.517,0.525,0.498,0.525,0.554,0.57,0.471,0.595,0.619,0.567,0.571,0.59,0.569,0.556,0.566,0.563,0.538,0.584,0.499,0.528,0.638,0.524,0.59,0.529,0.568,0.508,0.503,0.521,0.503,0.523,0.532,0.502,0.469,0.517,0.518,0.549,OrganismalFitness,TADBP_HUMAN,Low,Human
+0.707,0.64,0.678,0.687,0.721,0.708,0.439,0.774,0.673,0.685,0.616,0.738,0.74,0.479,0.499,0.498,0.522,0.461,0.529,0.646,0.76,0.758,0.775,0.781,0.772,0.68,0.591,0.67,0.649,0.756,0.787,0.779,0.709,0.767,0.649,0.639,0.765,0.688,0.665,0.757,0.709,0.685,0.509,0.465,0.712,0.625,0.65,0.702,0.726,0.64,0.58,0.582,0.598,0.583,0.576,0.585,0.573,0.577,0.571,0.586,0.607,0.581,OrganismalFitness,TAT_HV1BR,High,Virus
+0.837,0.852,0.88,0.885,0.887,0.888,0.883,0.686,0.892,0.918,0.874,0.922,0.933,0.908,0.914,0.94,0.931,0.915,0.929,0.837,0.785,0.843,0.837,0.865,0.854,0.861,0.888,0.884,0.851,0.909,0.899,0.886,0.84,0.777,0.85,0.885,0.895,0.907,0.91,0.897,0.908,0.906,0.905,0.588,0.827,0.901,0.853,0.797,0.928,0.917,0.918,0.907,0.898,0.931,0.928,0.931,0.922,0.93,0.919,0.928,0.934,0.935,Stability,TCRG1_MOUSE,Medium,Eukaryote
+0.707,0.76,0.81,0.805,0.811,0.81,0.509,0.561,0.851,0.845,0.836,0.804,0.828,0.576,0.848,0.849,0.84,0.851,0.857,0.831,0.441,0.692,0.695,0.668,0.683,0.717,0.618,0.75,0.795,0.852,0.853,0.843,0.775,0.469,0.721,0.686,0.784,0.816,0.809,0.803,0.825,0.824,0.641,0.547,0.797,0.711,0.796,0.796,0.812,0.868,0.86,0.851,0.866,0.866,0.87,0.856,0.858,0.855,0.858,0.861,0.826,0.775,Stability,THO1_YEAST,High,Eukaryote
+0.712,0.742,0.774,0.767,0.769,0.769,0.422,0.71,0.795,0.795,0.757,0.762,0.765,0.505,0.76,0.791,0.778,0.776,0.785,0.752,0.643,0.691,0.709,0.707,0.696,0.726,0.69,0.704,0.719,0.779,0.753,0.732,0.602,0.536,0.67,0.682,0.767,0.766,0.763,0.786,0.775,0.774,0.653,0.465,0.781,0.735,0.733,0.813,0.839,0.816,0.772,0.783,0.776,0.772,0.772,0.767,0.774,0.77,0.772,0.777,0.814,0.8,Stability,TNKS2_HUMAN,High,Human
+0.622,0.631,0.626,0.629,0.632,0.631,0.547,0.639,0.662,0.652,0.658,0.653,0.675,0.571,0.618,0.644,0.684,0.68,0.706,0.64,0.555,0.575,0.618,0.646,0.569,0.647,0.652,0.635,0.669,0.641,0.642,0.606,0.563,0.571,0.614,0.669,0.632,0.638,0.671,0.64,0.64,0.655,0.578,0.56,0.673,0.601,0.617,0.645,0.633,0.559,0.636,0.655,0.649,0.655,0.663,0.664,0.656,0.658,0.66,0.664,0.652,0.622,OrganismalFitness,TPK1_HUMAN,Medium,Human
+0.722,0.754,0.773,0.782,0.778,0.787,0.656,0.761,0.786,0.787,0.812,0.785,0.802,0.706,0.751,0.814,0.807,0.756,0.749,0.779,0.694,0.725,0.761,0.766,0.742,0.777,0.757,0.746,0.736,0.805,0.802,0.779,0.708,0.698,0.768,0.719,0.785,0.797,0.765,0.799,0.805,0.783,0.736,0.637,0.797,0.777,0.779,0.788,0.807,0.641,0.782,0.785,0.791,0.776,0.795,0.793,0.795,0.79,0.798,0.798,0.827,0.778,Expression,TPMT_HUMAN,Medium,Human
+0.723,0.695,0.677,0.655,0.658,0.667,0.765,0.752,0.718,0.721,0.739,0.732,0.739,0.694,0.748,0.704,0.685,0.736,0.678,0.677,0.681,0.737,0.714,0.726,0.749,0.634,0.806,0.78,0.708,0.741,0.751,0.63,0.747,0.756,0.73,0.751,0.776,0.768,0.773,0.75,0.739,0.757,0.709,0.712,0.717,0.738,0.578,0.69,0.681,0.511,0.702,0.676,0.722,0.701,0.684,0.699,0.739,0.643,0.672,0.699,0.749,0.752,OrganismalFitness,TPOR_HUMAN,Low,Human
+0.832,0.841,0.818,0.827,0.829,0.834,0.63,0.804,0.873,0.88,0.858,0.849,0.869,0.689,0.817,0.858,0.864,0.86,0.847,0.835,0.758,0.832,0.8,0.832,0.792,0.798,0.817,0.803,0.799,0.808,0.846,0.808,0.62,0.804,0.829,0.827,0.834,0.848,0.844,0.841,0.854,0.847,0.763,0.539,0.85,0.8,0.674,0.788,0.793,0.564,0.842,0.833,0.838,0.867,0.851,0.85,0.853,0.86,0.847,0.863,0.858,0.836,OrganismalFitness,TRPC_SACS2,Medium,Prokaryote
+0.768,0.78,0.76,0.764,0.76,0.762,0.691,0.748,0.794,0.79,0.772,0.797,0.807,0.661,0.796,0.797,0.794,0.793,0.811,0.773,0.728,0.76,0.762,0.783,0.756,0.763,0.746,0.744,0.787,0.753,0.765,0.732,0.603,0.731,0.784,0.775,0.762,0.784,0.779,0.77,0.783,0.777,0.757,0.558,0.79,0.754,0.64,0.721,0.73,0.52,0.784,0.751,0.765,0.784,0.784,0.757,0.777,0.777,0.765,0.783,0.814,0.794,OrganismalFitness,TRPC_THEMA,Medium,Prokaryote
+0.697,0.771,0.796,0.801,0.788,0.794,0.477,0.72,0.787,0.791,0.731,0.773,0.79,0.498,0.499,0.716,0.756,0.786,0.803,0.816,0.625,0.727,0.746,0.732,0.734,0.76,0.759,0.747,0.745,0.766,0.757,0.745,0.551,0.63,0.727,0.74,0.678,0.752,0.762,0.749,0.792,0.803,0.648,0.493,0.733,0.774,0.684,0.695,0.754,0.643,0.736,0.727,0.72,0.735,0.73,0.731,0.735,0.739,0.741,0.738,0.715,0.736,OrganismalFitness,UBC9_HUMAN,Medium,Human
+0.703,0.811,0.829,0.828,0.834,0.833,0.628,0.78,0.81,0.824,0.815,0.845,0.837,0.751,0.81,0.854,0.843,0.847,0.858,0.818,0.66,0.792,0.795,0.801,0.784,0.803,0.787,0.789,0.807,0.83,0.863,0.852,0.813,0.612,0.615,0.757,0.772,0.769,0.784,0.833,0.831,0.823,0.763,0.484,0.72,0.726,0.805,0.763,0.845,0.755,0.826,0.843,0.838,0.845,0.845,0.84,0.839,0.843,0.842,0.843,0.861,0.813,Stability,UBE4B_HUMAN,High,Human
+0.779,0.799,0.824,0.831,0.842,0.844,0.536,0.594,0.796,0.805,0.762,0.838,0.846,0.817,0.838,0.847,0.834,0.786,0.752,0.798,0.59,0.764,0.749,0.732,0.837,0.8,0.786,0.775,0.731,0.826,0.845,0.825,0.534,0.492,0.58,0.698,0.782,0.749,0.776,0.834,0.828,0.842,0.815,0.499,0.814,0.826,0.694,0.773,0.469,0.54,0.814,0.798,0.832,0.836,0.82,0.834,0.813,0.829,0.826,0.833,0.814,0.791,Activity,UBE4B_MOUSE,Low,Eukaryote
+0.735,0.832,0.832,0.834,0.837,0.845,0.601,0.665,0.798,0.816,0.813,0.742,0.791,0.627,0.635,0.587,0.588,0.84,0.835,0.795,0.754,0.749,0.777,0.804,0.765,0.784,0.795,0.789,0.801,0.815,0.784,0.717,0.669,0.668,0.758,0.742,0.76,0.786,0.784,0.845,0.846,0.845,0.555,0.552,0.707,0.589,0.784,0.716,0.854,0.82,0.837,0.853,0.858,0.86,0.86,0.864,0.852,0.858,0.849,0.858,0.871,0.826,Stability,UBR5_HUMAN,Medium,Human
+0.583,0.623,0.591,0.609,0.612,0.609,0.618,0.467,0.688,0.681,0.831,0.785,0.79,0.674,0.761,0.808,0.844,0.817,0.85,0.566,0.701,0.765,0.747,0.813,0.715,0.736,0.729,0.752,0.815,0.766,0.782,0.715,0.487,0.652,0.76,0.78,0.659,0.7,0.753,0.655,0.684,0.709,0.737,0.637,0.781,0.752,0.742,0.783,0.792,0.759,0.84,0.821,0.842,0.844,0.842,0.86,0.822,0.84,0.857,0.849,0.819,0.809,Stability,VG08_BPP22,High,Virus
+0.687,0.772,0.869,0.894,0.896,0.897,0.552,0.811,0.857,0.874,0.914,0.886,0.896,0.695,0.618,0.904,0.912,0.88,0.817,0.862,0.668,0.757,0.842,0.812,0.793,0.826,0.812,0.792,0.843,0.88,0.919,0.913,0.692,0.683,0.791,0.789,0.812,0.84,0.831,0.894,0.898,0.888,0.626,0.66,0.79,0.795,0.816,0.793,0.922,0.861,0.92,0.921,0.925,0.925,0.919,0.927,0.919,0.922,0.924,0.925,0.909,0.891,Stability,VILI_CHICK,High,Eukaryote
+0.704,0.716,0.72,0.731,0.738,0.744,0.563,0.715,0.757,0.764,0.744,0.748,0.76,0.569,0.697,0.733,0.758,0.76,0.756,0.748,0.59,0.602,0.684,0.708,0.632,0.736,0.731,0.72,0.73,0.75,0.73,0.692,0.578,0.568,0.631,0.742,0.721,0.73,0.763,0.747,0.754,0.768,0.579,0.555,0.748,0.663,0.678,0.716,0.736,0.592,0.76,0.737,0.754,0.762,0.762,0.754,0.758,0.752,0.754,0.762,0.762,0.718,Expression,VKOR1_HUMAN,Medium,Human
+0.704,0.716,0.72,0.731,0.738,0.744,0.563,0.715,0.757,0.764,0.744,0.748,0.76,0.569,0.697,0.733,0.758,0.76,0.756,0.748,0.59,0.602,0.684,0.708,0.632,0.736,0.731,0.72,0.73,0.75,0.73,0.692,0.578,0.568,0.631,0.742,0.721,0.73,0.763,0.747,0.754,0.768,0.579,0.555,0.748,0.663,0.678,0.716,0.736,0.592,0.76,0.737,0.754,0.762,0.762,0.754,0.758,0.752,0.754,0.762,0.762,0.718,Activity,VKOR1_HUMAN,Medium,Human
+0.422,0.514,0.587,0.615,0.55,0.572,0.548,0.603,0.75,0.748,0.801,0.722,0.733,0.669,0.755,0.811,0.86,0.874,0.813,0.616,0.488,0.567,0.575,0.636,0.613,0.622,0.57,0.622,0.664,0.563,0.809,0.769,0.638,0.514,0.514,0.553,0.468,0.474,0.456,0.528,0.537,0.49,0.656,0.521,0.819,0.728,0.833,0.812,0.84,0.842,0.854,0.845,0.862,0.861,0.857,0.869,0.858,0.872,0.853,0.868,0.878,0.837,Stability,VRPI_BPT7,Medium,Virus
+0.65,0.816,0.795,0.809,0.807,0.803,0.38,0.783,0.781,0.804,0.798,0.618,0.733,0.432,0.574,0.791,0.813,0.844,0.851,0.8,0.459,0.404,0.435,0.633,0.431,0.409,0.373,0.498,0.771,0.803,0.854,0.84,0.471,0.363,0.451,0.75,0.677,0.682,0.766,0.791,0.788,0.806,0.459,0.363,0.582,0.479,0.709,0.691,0.813,0.767,0.794,0.819,0.794,0.813,0.808,0.819,0.818,0.835,0.844,0.82,0.805,0.743,Stability,YAIA_ECOLI,Medium,Prokaryote
+0.726,0.674,0.743,0.742,0.735,0.738,0.673,0.654,0.524,0.534,0.677,0.649,0.654,0.716,0.736,0.734,0.74,0.698,0.667,0.607,0.598,0.597,0.588,0.592,0.657,0.623,0.637,0.612,0.586,0.677,0.729,0.74,0.569,0.66,0.605,0.618,0.712,0.672,0.691,0.728,0.712,0.731,0.75,0.454,0.673,0.75,0.679,0.685,0.7,0.603,0.734,0.699,0.725,0.721,0.718,0.713,0.716,0.729,0.732,0.728,0.698,0.72,Binding,YAP1_HUMAN,Low,Human
+0.904,0.927,0.955,0.954,0.948,0.945,0.819,0.853,0.939,0.941,0.969,0.967,0.971,0.874,0.883,0.973,0.974,0.961,0.97,0.942,0.848,0.912,0.929,0.933,0.923,0.93,0.886,0.934,0.952,0.923,0.942,0.919,0.911,0.856,0.907,0.897,0.922,0.926,0.923,0.942,0.938,0.944,0.732,0.658,0.826,0.739,0.775,0.825,0.98,0.932,0.971,0.964,0.97,0.971,0.97,0.973,0.97,0.967,0.971,0.971,0.959,0.964,Stability,YNZC_BACSU,Medium,Prokaryote
+0.702,0.73,0.736,0.742,0.751,0.755,0.594,0.699,0.743,0.751,0.735,0.721,0.737,0.61,0.674,0.722,0.744,0.741,0.738,0.72,0.67,0.701,0.709,0.714,0.691,0.719,0.715,0.719,0.729,0.762,0.761,0.741,0.608,0.665,0.694,0.713,0.736,0.741,0.749,0.759,0.761,0.763,0.646,0.545,0.714,0.677,0.706,0.725,0.736,0.658,0.754,0.748,0.753,0.758,0.757,0.758,0.756,0.757,0.757,0.762,0.761,0.725,,,,
diff --git a/benchmarks/DMS_zero_shot/substitutions/AUC/Summary_performance_DMS_substitutions_AUC.csv b/benchmarks/DMS_zero_shot/substitutions/AUC/Summary_performance_DMS_substitutions_AUC.csv
new file mode 100644
index 0000000..dfcbf9d
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/AUC/Summary_performance_DMS_substitutions_AUC.csv
@@ -0,0 +1,63 @@
+Model_rank,Model_name,Model type,Average_AUC,Bootstrap_standard_error_AUC,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,Depth_1,Depth_2,Depth_3,Depth_4,Depth_5+,Model details,References
+1,TranceptEVE L,Hybrid - Alignment & PLM,0.751,0.0,0.764,0.701,0.75,0.756,0.783,0.746,0.762,0.772,0.765,0.782,0.755,0.746,0.751,0.683,0.703,0.688,0.739,TranceptEVE Large model (Tranception Large & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+2,SaProt (650M),Hybrid - Structure & PLM,0.751,0.005,0.747,0.711,0.766,0.703,0.826,0.721,0.75,0.796,0.768,0.788,0.777,0.671,0.755,0.705,0.679,0.665,0.719,SaProt (650M),"Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv."
+3,TranceptEVE M,Hybrid - Alignment & PLM,0.749,0.002,0.759,0.707,0.747,0.753,0.78,0.74,0.761,0.768,0.765,0.781,0.75,0.737,0.749,0.681,0.674,0.678,0.732,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+4,GEMME,Alignment-based model,0.749,0.004,0.759,0.704,0.737,0.752,0.792,0.744,0.762,0.775,0.762,0.783,0.758,0.754,0.754,0.675,0.693,0.694,0.77,GEMME model,"Laine, É., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619."
+5,TranceptEVE S,Hybrid - Alignment & PLM,0.748,0.002,0.756,0.712,0.741,0.75,0.779,0.745,0.756,0.767,0.764,0.777,0.751,0.733,0.746,0.68,0.675,0.678,0.731,TranceptEVE Small model (Tranception Small & retrieved EVE model),"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+6,ProtSSN (ensemble),Hybrid - Structure & PLM,0.747,0.003,0.752,0.702,0.739,0.722,0.817,0.723,0.755,0.786,0.765,0.792,0.768,0.699,0.755,0.688,0.687,0.658,0.697,ProtSSN (ensemble of 9 models),"Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+7,ProtSSN (k=20 h=1280),Hybrid - Structure & PLM,0.743,0.003,0.748,0.702,0.732,0.715,0.817,0.72,0.75,0.785,0.761,0.789,0.765,0.695,0.75,0.688,0.684,0.654,0.698,"ProtSSN (k=20, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+8,VESPA,Protein language model,0.742,0.003,0.756,0.695,0.724,0.747,0.79,0.735,0.757,0.775,0.753,0.78,0.765,0.743,0.751,0.638,0.711,0.678,0.701,VESPA model,"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647."
+9,ProtSSN (k=20 h=512),Hybrid - Structure & PLM,0.742,0.004,0.749,0.694,0.732,0.719,0.814,0.717,0.753,0.78,0.758,0.789,0.766,0.701,0.749,0.685,0.693,0.666,0.703,"ProtSSN (k=20, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+10,ProtSSN (k=20 h=768),Hybrid - Structure & PLM,0.741,0.004,0.748,0.691,0.734,0.716,0.814,0.715,0.75,0.782,0.76,0.789,0.763,0.692,0.748,0.689,0.682,0.65,0.7,"ProtSSN (k=20, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+11,ProtSSN (k=30 h=1280),Hybrid - Structure & PLM,0.741,0.004,0.746,0.698,0.732,0.714,0.813,0.718,0.747,0.783,0.759,0.788,0.764,0.687,0.747,0.689,0.681,0.65,0.693,"ProtSSN (k=30, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+12,ProtSSN (k=30 h=768),Hybrid - Structure & PLM,0.741,0.004,0.747,0.695,0.732,0.714,0.814,0.715,0.75,0.781,0.758,0.789,0.764,0.692,0.748,0.685,0.678,0.65,0.694,"ProtSSN (k=30, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+13,EVE (ensemble),Alignment-based model,0.741,0.003,0.753,0.702,0.723,0.748,0.777,0.729,0.754,0.765,0.755,0.772,0.754,0.732,0.741,0.676,0.672,0.672,0.721,EVE model (ensemble of 5 independently-trained models),"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature."
+14,ProtSSN (k=30 h=512),Hybrid - Structure & PLM,0.74,0.004,0.746,0.692,0.735,0.715,0.81,0.719,0.749,0.778,0.759,0.785,0.761,0.691,0.747,0.678,0.676,0.655,0.693,"ProtSSN (k=30, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+15,Tranception L,Hybrid - Alignment & PLM,0.739,0.002,0.752,0.688,0.747,0.742,0.764,0.736,0.745,0.76,0.755,0.771,0.731,0.733,0.737,0.673,0.711,0.688,0.736,Tranception Large model (700M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+16,ProtSSN (k=10 h=1280),Hybrid - Structure & PLM,0.739,0.004,0.742,0.696,0.733,0.714,0.809,0.719,0.746,0.776,0.757,0.784,0.757,0.694,0.744,0.668,0.665,0.64,0.686,"ProtSSN (k=10, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+17,EVE (single),Alignment-based model,0.737,0.002,0.748,0.696,0.721,0.745,0.774,0.726,0.751,0.763,0.75,0.77,0.752,0.729,0.738,0.672,0.669,0.669,0.723,EVE model (single seed),"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature."
+18,ProtSSN (k=10 h=512),Hybrid - Structure & PLM,0.737,0.004,0.746,0.691,0.725,0.711,0.811,0.718,0.744,0.78,0.757,0.788,0.761,0.682,0.743,0.681,0.673,0.649,0.692,"ProtSSN (k=10, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+19,MSA Transformer (ensemble),Hybrid - Alignment & PLM,0.737,0.005,0.756,0.675,0.742,0.732,0.777,0.716,0.753,0.764,0.746,0.781,0.748,0.724,0.739,0.644,0.71,0.719,0.758,MSA Transformer (ensemble of 5 MSA samples),"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML."
+20,Tranception M,Hybrid - Alignment & PLM,0.733,0.003,0.741,0.694,0.74,0.736,0.756,0.727,0.74,0.748,0.751,0.766,0.718,0.719,0.732,0.665,0.647,0.654,0.709,Tranception Medium model (300M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+21,ProtSSN (k=10 h=768),Hybrid - Structure & PLM,0.732,0.004,0.739,0.686,0.721,0.706,0.806,0.707,0.742,0.772,0.751,0.777,0.754,0.686,0.737,0.672,0.675,0.643,0.685,"ProtSSN (k=10, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+22,DeepSequence (ensemble),Alignment-based model,0.729,0.004,0.746,0.689,0.713,0.729,0.769,0.705,0.74,0.761,0.748,0.767,0.746,0.691,0.728,0.673,0.69,0.689,0.732,DeepSequence model (ensemble of 5 independently-trained models),"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822."
+23,ESM-IF1,Inverse folding model,0.729,0.007,0.697,0.716,0.721,0.674,0.838,0.661,0.738,0.793,0.728,0.767,0.766,0.71,0.741,0.721,0.697,0.684,0.743,ESM-IF1 model,"Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. BioRxiv."
+24,MSA Transformer (single),Hybrid - Alignment & PLM,0.729,0.004,0.746,0.666,0.733,0.729,0.769,0.707,0.746,0.756,0.742,0.774,0.737,0.716,0.731,0.639,0.71,0.712,0.754,MSA Transformer (single MSA sample),"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML."
+25,ESM2 (650M),Protein language model,0.728,0.006,0.73,0.691,0.725,0.708,0.789,0.691,0.725,0.782,0.758,0.766,0.748,0.639,0.737,0.665,0.632,0.605,0.62,ESM2 model (650M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+26,Tranception S,Hybrid - Alignment & PLM,0.728,0.003,0.734,0.7,0.728,0.73,0.749,0.733,0.731,0.742,0.745,0.758,0.715,0.712,0.724,0.666,0.654,0.659,0.711,Tranception Small model (85M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+27,ESM2 (3B),Protein language model,0.724,0.006,0.724,0.675,0.722,0.712,0.786,0.693,0.732,0.77,0.749,0.765,0.752,0.655,0.732,0.645,0.615,0.605,0.621,ESM2 model (3B params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+28,ESM-1v (ensemble),Protein language model,0.723,0.006,0.726,0.674,0.735,0.718,0.764,0.682,0.725,0.776,0.754,0.756,0.728,0.659,0.724,0.644,0.616,0.598,0.622,ESM-1v (ensemble of 5 independently-trained models),"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS."
+29,DeepSequence (single),Alignment-based model,0.723,0.004,0.741,0.68,0.704,0.721,0.768,0.705,0.733,0.757,0.745,0.763,0.741,0.68,0.721,0.666,0.669,0.682,0.73,DeepSequence model (single seed),"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822."
+30,SaProt (35M),Hybrid - Structure & PLM,0.722,0.005,0.702,0.702,0.736,0.662,0.807,0.683,0.716,0.771,0.748,0.764,0.727,0.626,0.728,0.72,0.627,0.613,0.657,SaProt (35M),"Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv."
+31,VESPAl,Protein language model,0.72,0.004,0.737,0.683,0.683,0.728,0.771,0.709,0.736,0.756,0.729,0.765,0.746,0.723,0.728,0.615,0.693,0.657,0.696,VESPAl model,"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647."
+32,ESM2 (15B),Protein language model,0.72,0.006,0.717,0.668,0.725,0.715,0.775,0.694,0.731,0.761,0.742,0.759,0.741,0.676,0.726,0.641,0.637,0.609,0.64,ESM2 model (15B params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+33,ESM-1b,Protein language model,0.719,0.006,0.73,0.662,0.72,0.699,0.782,0.695,0.724,0.767,0.747,0.771,0.741,0.642,0.715,0.644,0.616,0.596,0.662,ESM-1b (w/ Brandes et al. extensions),"[1] Original model: Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118. [2] Extensions: Brandes, N., Goldman, G., Wang, C.H., Ye, C.J., & Ntranos, V. (2022). Genome-wide prediction of disease variants with a deep protein language model. bioRxiv."
+34,Progen2 XL,Protein language model,0.717,0.004,0.72,0.662,0.73,0.718,0.757,0.696,0.73,0.749,0.722,0.752,0.736,0.718,0.718,0.642,0.68,0.647,0.691,Progen2 xlarge model (6.4B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+35,MIF-ST,Hybrid - Structure & PLM,0.717,0.006,0.709,0.674,0.738,0.701,0.762,0.705,0.72,0.746,0.721,0.724,0.741,0.715,0.736,0.675,0.706,0.673,0.711,MIF-ST model,"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv."
+36,EVmutation,Alignment-based model,0.716,0.003,0.737,0.666,0.707,0.727,0.741,0.713,0.737,0.723,0.728,0.746,0.729,0.707,0.711,0.673,0.683,0.674,0.757,EVmutation model,"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135."
+37,ESM2 (150M),Protein language model,0.713,0.007,0.71,0.685,0.718,0.674,0.779,0.675,0.701,0.767,0.754,0.761,0.712,0.582,0.717,0.655,0.588,0.592,0.611,ESM2 model (150M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+38,Progen2 M,Protein language model,0.711,0.004,0.713,0.666,0.735,0.715,0.726,0.678,0.716,0.737,0.735,0.73,0.699,0.681,0.709,0.61,0.596,0.588,0.599,Progen2 medium model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+39,Progen2 L,Protein language model,0.711,0.004,0.718,0.662,0.734,0.713,0.726,0.693,0.715,0.733,0.733,0.739,0.703,0.672,0.708,0.615,0.649,0.63,0.669,Progen2 large model (2.7B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+40,Progen2 Base,Protein language model,0.709,0.005,0.713,0.664,0.739,0.714,0.712,0.689,0.706,0.733,0.739,0.733,0.678,0.668,0.707,0.597,0.598,0.593,0.613,Progen2 base model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+41,RITA XL,Protein language model,0.707,0.005,0.702,0.665,0.728,0.715,0.727,0.674,0.716,0.732,0.727,0.721,0.693,0.711,0.703,0.616,0.605,0.602,0.634,RITA xlarge model (1.2B params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+42,ESM-1v (single),Protein language model,0.707,0.007,0.713,0.652,0.723,0.705,0.744,0.663,0.708,0.763,0.738,0.74,0.714,0.642,0.709,0.624,0.62,0.596,0.619,ESM-1v (single seed),"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS."
+43,Wavenet,Alignment-based model,0.707,0.006,0.709,0.673,0.691,0.706,0.757,0.664,0.723,0.75,0.723,0.737,0.729,0.685,0.703,0.648,0.654,0.631,0.669,Wavenet model,"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12."
+44,Tranception L no retrieval,Protein language model,0.707,0.004,0.717,0.659,0.727,0.715,0.714,0.697,0.708,0.73,0.719,0.717,0.699,0.715,0.703,0.621,0.695,0.657,0.704,Tranception Large model (700M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+45,MIF,Inverse folding model,0.705,0.007,0.673,0.685,0.731,0.663,0.775,0.69,0.704,0.734,0.717,0.704,0.716,0.695,0.723,0.673,0.67,0.645,0.689,MIF model,"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv."
+46,RITA L,Protein language model,0.703,0.005,0.698,0.66,0.729,0.711,0.718,0.676,0.709,0.724,0.727,0.72,0.675,0.704,0.698,0.609,0.589,0.585,0.62,RITA large model (680M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+47,CARP (640M),Protein language model,0.701,0.007,0.711,0.649,0.718,0.707,0.722,0.676,0.705,0.732,0.732,0.715,0.704,0.653,0.718,0.638,0.609,0.595,0.595,CARP model (640M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+48,Site-Independent,Alignment-based model,0.696,0.005,0.699,0.684,0.687,0.714,0.695,0.73,0.708,0.67,0.71,0.713,0.674,0.698,0.69,0.657,0.625,0.643,0.721,Site-Independent model,"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135."
+49,RITA M,Protein language model,0.693,0.005,0.691,0.649,0.72,0.709,0.698,0.668,0.696,0.716,0.716,0.702,0.665,0.699,0.69,0.591,0.586,0.591,0.614,RITA medium model (300M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+50,Unirep evotuned,Hybrid - Alignment & PLM,0.692,0.004,0.695,0.657,0.7,0.695,0.714,0.68,0.697,0.71,0.704,0.711,0.69,0.688,0.684,0.634,0.653,0.638,0.699,Unirep model w/ evotuning,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+51,Tranception M no retrieval,Protein language model,0.691,0.005,0.687,0.656,0.721,0.702,0.687,0.661,0.695,0.705,0.71,0.694,0.666,0.681,0.685,0.596,0.581,0.583,0.611,Tranception Medium model (300M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+52,Progen2 S,Protein language model,0.685,0.006,0.68,0.652,0.71,0.689,0.695,0.659,0.679,0.714,0.718,0.698,0.657,0.643,0.683,0.584,0.572,0.569,0.587,Progen2 small model (150M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+53,CARP (76M),Protein language model,0.678,0.007,0.682,0.656,0.701,0.653,0.697,0.64,0.664,0.718,0.714,0.698,0.654,0.584,0.687,0.628,0.561,0.564,0.568,CARP model (76M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+54,ESM2 (35M),Protein language model,0.676,0.008,0.669,0.668,0.684,0.622,0.734,0.637,0.652,0.739,0.706,0.722,0.664,0.562,0.669,0.649,0.576,0.581,0.613,ESM2 model (35M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+55,RITA S,Protein language model,0.667,0.006,0.659,0.652,0.684,0.68,0.66,0.651,0.668,0.679,0.687,0.659,0.629,0.682,0.659,0.583,0.575,0.576,0.598,RITA small model (85M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+56,Tranception S no retrieval,Protein language model,0.665,0.006,0.652,0.657,0.689,0.679,0.645,0.641,0.664,0.672,0.678,0.648,0.642,0.661,0.654,0.587,0.579,0.582,0.597,Tranception Small model (85M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+57,CARP (38M),Protein language model,0.653,0.007,0.655,0.652,0.671,0.623,0.661,0.612,0.633,0.688,0.679,0.664,0.628,0.567,0.656,0.61,0.557,0.571,0.578,CARP model (38M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+58,ProteinMPNN,Inverse folding model,0.639,0.006,0.605,0.591,0.603,0.59,0.805,0.596,0.655,0.729,0.658,0.71,0.684,0.64,0.657,0.673,0.604,0.61,0.695,ProteinMPNN model,"J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378."
+59,ESM2 (8M),Protein language model,0.624,0.008,0.607,0.65,0.647,0.577,0.638,0.611,0.601,0.636,0.635,0.635,0.601,0.546,0.611,0.593,0.566,0.573,0.599,ESM2 model (8M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+60,Unirep,Protein language model,0.604,0.008,0.597,0.617,0.621,0.577,0.61,0.602,0.591,0.603,0.622,0.62,0.576,0.532,0.595,0.557,0.566,0.582,0.592,Unirep model,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+61,ProtGPT2,Protein language model,0.603,0.006,0.593,0.585,0.602,0.595,0.639,0.598,0.597,0.637,0.636,0.631,0.57,0.576,0.601,0.602,0.528,0.509,0.534,ProtGPT2 model,"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13."
+62,CARP (600K),Protein language model,0.557,0.009,0.555,0.551,0.596,0.531,0.549,0.552,0.545,0.548,0.568,0.539,0.531,0.526,0.556,0.533,0.524,0.548,0.557,CARP model (600K params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
diff --git a/benchmarks/DMS_zero_shot/substitutions/AUC/Summary_performance_DMS_substitutions_AUC.html b/benchmarks/DMS_zero_shot/substitutions/AUC/Summary_performance_DMS_substitutions_AUC.html
new file mode 100644
index 0000000..2c4b1ea
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/AUC/Summary_performance_DMS_substitutions_AUC.html
@@ -0,0 +1,1670 @@
+
+
+
+ |
+ Model_name |
+ Model type |
+ Average_AUC |
+ Bootstrap_standard_error_AUC |
+ Function_Activity |
+ Function_Binding |
+ Function_Expression |
+ Function_OrganismalFitness |
+ Function_Stability |
+ Low_MSA_depth |
+ Medium_MSA_depth |
+ High_MSA_depth |
+ Taxa_Human |
+ Taxa_Other_Eukaryote |
+ Taxa_Prokaryote |
+ Taxa_Virus |
+ Depth_1 |
+ Depth_2 |
+ Depth_3 |
+ Depth_4 |
+ Depth_5+ |
+ Model details |
+ References |
+
+
+ Model_rank |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+
+
+
+
+ 1 |
+ TranceptEVE L |
+ Hybrid - Alignment & PLM |
+ 0.751 |
+ 0.000 |
+ 0.764 |
+ 0.701 |
+ 0.750 |
+ 0.756 |
+ 0.783 |
+ 0.746 |
+ 0.762 |
+ 0.772 |
+ 0.765 |
+ 0.782 |
+ 0.755 |
+ 0.746 |
+ 0.751 |
+ 0.683 |
+ 0.703 |
+ 0.688 |
+ 0.739 |
+ TranceptEVE Large model (Tranception Large & retrieved EVE model) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.12.07.519495v1?rss=1'>Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.</a> |
+
+
+ 2 |
+ SaProt (650M) |
+ Hybrid - Structure & PLM |
+ 0.751 |
+ 0.005 |
+ 0.747 |
+ 0.711 |
+ 0.766 |
+ 0.703 |
+ 0.826 |
+ 0.721 |
+ 0.750 |
+ 0.796 |
+ 0.768 |
+ 0.788 |
+ 0.777 |
+ 0.671 |
+ 0.755 |
+ 0.705 |
+ 0.679 |
+ 0.665 |
+ 0.719 |
+ SaProt (650M) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.10.01.560349v5'>Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv.</a> |
+
+
+ 3 |
+ TranceptEVE M |
+ Hybrid - Alignment & PLM |
+ 0.749 |
+ 0.002 |
+ 0.759 |
+ 0.707 |
+ 0.747 |
+ 0.753 |
+ 0.780 |
+ 0.740 |
+ 0.761 |
+ 0.768 |
+ 0.765 |
+ 0.781 |
+ 0.750 |
+ 0.737 |
+ 0.749 |
+ 0.681 |
+ 0.674 |
+ 0.678 |
+ 0.732 |
+ TranceptEVE Medium model (Tranception Medium & retrieved EVE model) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.12.07.519495v1?rss=1'>Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.</a> |
+
+
+ 4 |
+ GEMME |
+ Alignment-based model |
+ 0.749 |
+ 0.004 |
+ 0.759 |
+ 0.704 |
+ 0.737 |
+ 0.752 |
+ 0.792 |
+ 0.744 |
+ 0.762 |
+ 0.775 |
+ 0.762 |
+ 0.783 |
+ 0.758 |
+ 0.754 |
+ 0.754 |
+ 0.675 |
+ 0.693 |
+ 0.694 |
+ 0.770 |
+ GEMME model |
+ <a href='https://pubmed.ncbi.nlm.nih.gov/31406981/'>Laine, É., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619.</a> |
+
+
+ 5 |
+ TranceptEVE S |
+ Hybrid - Alignment & PLM |
+ 0.748 |
+ 0.002 |
+ 0.756 |
+ 0.712 |
+ 0.741 |
+ 0.750 |
+ 0.779 |
+ 0.745 |
+ 0.756 |
+ 0.767 |
+ 0.764 |
+ 0.777 |
+ 0.751 |
+ 0.733 |
+ 0.746 |
+ 0.680 |
+ 0.675 |
+ 0.678 |
+ 0.731 |
+ TranceptEVE Small model (Tranception Small & retrieved EVE model) |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 6 |
+ ProtSSN (ensemble) |
+ Hybrid - Structure & PLM |
+ 0.747 |
+ 0.003 |
+ 0.752 |
+ 0.702 |
+ 0.739 |
+ 0.722 |
+ 0.817 |
+ 0.723 |
+ 0.755 |
+ 0.786 |
+ 0.765 |
+ 0.792 |
+ 0.768 |
+ 0.699 |
+ 0.755 |
+ 0.688 |
+ 0.687 |
+ 0.658 |
+ 0.697 |
+ ProtSSN (ensemble of 9 models) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 7 |
+ ProtSSN (k=20 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.743 |
+ 0.003 |
+ 0.748 |
+ 0.702 |
+ 0.732 |
+ 0.715 |
+ 0.817 |
+ 0.720 |
+ 0.750 |
+ 0.785 |
+ 0.761 |
+ 0.789 |
+ 0.765 |
+ 0.695 |
+ 0.750 |
+ 0.688 |
+ 0.684 |
+ 0.654 |
+ 0.698 |
+ ProtSSN (k=20, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 8 |
+ VESPA |
+ Protein language model |
+ 0.742 |
+ 0.003 |
+ 0.756 |
+ 0.695 |
+ 0.724 |
+ 0.747 |
+ 0.790 |
+ 0.735 |
+ 0.757 |
+ 0.775 |
+ 0.753 |
+ 0.780 |
+ 0.765 |
+ 0.743 |
+ 0.751 |
+ 0.638 |
+ 0.711 |
+ 0.678 |
+ 0.701 |
+ VESPA model |
+ <a href='https://link.springer.com/article/10.1007/s00439-021-02411-y'>Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.</a> |
+
+
+ 9 |
+ ProtSSN (k=20 h=512) |
+ Hybrid - Structure & PLM |
+ 0.742 |
+ 0.004 |
+ 0.749 |
+ 0.694 |
+ 0.732 |
+ 0.719 |
+ 0.814 |
+ 0.717 |
+ 0.753 |
+ 0.780 |
+ 0.758 |
+ 0.789 |
+ 0.766 |
+ 0.701 |
+ 0.749 |
+ 0.685 |
+ 0.693 |
+ 0.666 |
+ 0.703 |
+ ProtSSN (k=20, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 10 |
+ ProtSSN (k=20 h=768) |
+ Hybrid - Structure & PLM |
+ 0.741 |
+ 0.004 |
+ 0.748 |
+ 0.691 |
+ 0.734 |
+ 0.716 |
+ 0.814 |
+ 0.715 |
+ 0.750 |
+ 0.782 |
+ 0.760 |
+ 0.789 |
+ 0.763 |
+ 0.692 |
+ 0.748 |
+ 0.689 |
+ 0.682 |
+ 0.650 |
+ 0.700 |
+ ProtSSN (k=20, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 11 |
+ ProtSSN (k=30 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.741 |
+ 0.004 |
+ 0.746 |
+ 0.698 |
+ 0.732 |
+ 0.714 |
+ 0.813 |
+ 0.718 |
+ 0.747 |
+ 0.783 |
+ 0.759 |
+ 0.788 |
+ 0.764 |
+ 0.687 |
+ 0.747 |
+ 0.689 |
+ 0.681 |
+ 0.650 |
+ 0.693 |
+ ProtSSN (k=30, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 12 |
+ ProtSSN (k=30 h=768) |
+ Hybrid - Structure & PLM |
+ 0.741 |
+ 0.004 |
+ 0.747 |
+ 0.695 |
+ 0.732 |
+ 0.714 |
+ 0.814 |
+ 0.715 |
+ 0.750 |
+ 0.781 |
+ 0.758 |
+ 0.789 |
+ 0.764 |
+ 0.692 |
+ 0.748 |
+ 0.685 |
+ 0.678 |
+ 0.650 |
+ 0.694 |
+ ProtSSN (k=30, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 13 |
+ EVE (ensemble) |
+ Alignment-based model |
+ 0.741 |
+ 0.003 |
+ 0.753 |
+ 0.702 |
+ 0.723 |
+ 0.748 |
+ 0.777 |
+ 0.729 |
+ 0.754 |
+ 0.765 |
+ 0.755 |
+ 0.772 |
+ 0.754 |
+ 0.732 |
+ 0.741 |
+ 0.676 |
+ 0.672 |
+ 0.672 |
+ 0.721 |
+ EVE model (ensemble of 5 independently-trained models) |
+ <a href='https://www.nature.com/articles/s41586-021-04043-8'>Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.</a> |
+
+
+ 14 |
+ ProtSSN (k=30 h=512) |
+ Hybrid - Structure & PLM |
+ 0.740 |
+ 0.004 |
+ 0.746 |
+ 0.692 |
+ 0.735 |
+ 0.715 |
+ 0.810 |
+ 0.719 |
+ 0.749 |
+ 0.778 |
+ 0.759 |
+ 0.785 |
+ 0.761 |
+ 0.691 |
+ 0.747 |
+ 0.678 |
+ 0.676 |
+ 0.655 |
+ 0.693 |
+ ProtSSN (k=30, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 15 |
+ Tranception L |
+ Hybrid - Alignment & PLM |
+ 0.739 |
+ 0.002 |
+ 0.752 |
+ 0.688 |
+ 0.747 |
+ 0.742 |
+ 0.764 |
+ 0.736 |
+ 0.745 |
+ 0.760 |
+ 0.755 |
+ 0.771 |
+ 0.731 |
+ 0.733 |
+ 0.737 |
+ 0.673 |
+ 0.711 |
+ 0.688 |
+ 0.736 |
+ Tranception Large model (700M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 16 |
+ ProtSSN (k=10 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.739 |
+ 0.004 |
+ 0.742 |
+ 0.696 |
+ 0.733 |
+ 0.714 |
+ 0.809 |
+ 0.719 |
+ 0.746 |
+ 0.776 |
+ 0.757 |
+ 0.784 |
+ 0.757 |
+ 0.694 |
+ 0.744 |
+ 0.668 |
+ 0.665 |
+ 0.640 |
+ 0.686 |
+ ProtSSN (k=10, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 17 |
+ EVE (single) |
+ Alignment-based model |
+ 0.737 |
+ 0.002 |
+ 0.748 |
+ 0.696 |
+ 0.721 |
+ 0.745 |
+ 0.774 |
+ 0.726 |
+ 0.751 |
+ 0.763 |
+ 0.750 |
+ 0.770 |
+ 0.752 |
+ 0.729 |
+ 0.738 |
+ 0.672 |
+ 0.669 |
+ 0.669 |
+ 0.723 |
+ EVE model (single seed) |
+ <a href='https://www.nature.com/articles/s41586-021-04043-8'>Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.</a> |
+
+
+ 18 |
+ ProtSSN (k=10 h=512) |
+ Hybrid - Structure & PLM |
+ 0.737 |
+ 0.004 |
+ 0.746 |
+ 0.691 |
+ 0.725 |
+ 0.711 |
+ 0.811 |
+ 0.718 |
+ 0.744 |
+ 0.780 |
+ 0.757 |
+ 0.788 |
+ 0.761 |
+ 0.682 |
+ 0.743 |
+ 0.681 |
+ 0.673 |
+ 0.649 |
+ 0.692 |
+ ProtSSN (k=10, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 19 |
+ MSA Transformer (ensemble) |
+ Hybrid - Alignment & PLM |
+ 0.737 |
+ 0.005 |
+ 0.756 |
+ 0.675 |
+ 0.742 |
+ 0.732 |
+ 0.777 |
+ 0.716 |
+ 0.753 |
+ 0.764 |
+ 0.746 |
+ 0.781 |
+ 0.748 |
+ 0.724 |
+ 0.739 |
+ 0.644 |
+ 0.710 |
+ 0.719 |
+ 0.758 |
+ MSA Transformer (ensemble of 5 MSA samples) |
+ <a href='http://proceedings.mlr.press/v139/rao21a.html'>Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.</a> |
+
+
+ 20 |
+ Tranception M |
+ Hybrid - Alignment & PLM |
+ 0.733 |
+ 0.003 |
+ 0.741 |
+ 0.694 |
+ 0.740 |
+ 0.736 |
+ 0.756 |
+ 0.727 |
+ 0.740 |
+ 0.748 |
+ 0.751 |
+ 0.766 |
+ 0.718 |
+ 0.719 |
+ 0.732 |
+ 0.665 |
+ 0.647 |
+ 0.654 |
+ 0.709 |
+ Tranception Medium model (300M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 21 |
+ ProtSSN (k=10 h=768) |
+ Hybrid - Structure & PLM |
+ 0.732 |
+ 0.004 |
+ 0.739 |
+ 0.686 |
+ 0.721 |
+ 0.706 |
+ 0.806 |
+ 0.707 |
+ 0.742 |
+ 0.772 |
+ 0.751 |
+ 0.777 |
+ 0.754 |
+ 0.686 |
+ 0.737 |
+ 0.672 |
+ 0.675 |
+ 0.643 |
+ 0.685 |
+ ProtSSN (k=10, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 22 |
+ DeepSequence (ensemble) |
+ Alignment-based model |
+ 0.729 |
+ 0.004 |
+ 0.746 |
+ 0.689 |
+ 0.713 |
+ 0.729 |
+ 0.769 |
+ 0.705 |
+ 0.740 |
+ 0.761 |
+ 0.748 |
+ 0.767 |
+ 0.746 |
+ 0.691 |
+ 0.728 |
+ 0.673 |
+ 0.690 |
+ 0.689 |
+ 0.732 |
+ DeepSequence model (ensemble of 5 independently-trained models) |
+ <a href='https://www.nature.com/articles/s41592-018-0138-4'>Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.</a> |
+
+
+ 23 |
+ ESM-IF1 |
+ Inverse folding model |
+ 0.729 |
+ 0.007 |
+ 0.697 |
+ 0.716 |
+ 0.721 |
+ 0.674 |
+ 0.838 |
+ 0.661 |
+ 0.738 |
+ 0.793 |
+ 0.728 |
+ 0.767 |
+ 0.766 |
+ 0.710 |
+ 0.741 |
+ 0.721 |
+ 0.697 |
+ 0.684 |
+ 0.743 |
+ ESM-IF1 model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.04.10.487779v2.full.pdf+html'>Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. BioRxiv.</a> |
+
+
+ 24 |
+ MSA Transformer (single) |
+ Hybrid - Alignment & PLM |
+ 0.729 |
+ 0.004 |
+ 0.746 |
+ 0.666 |
+ 0.733 |
+ 0.729 |
+ 0.769 |
+ 0.707 |
+ 0.746 |
+ 0.756 |
+ 0.742 |
+ 0.774 |
+ 0.737 |
+ 0.716 |
+ 0.731 |
+ 0.639 |
+ 0.710 |
+ 0.712 |
+ 0.754 |
+ MSA Transformer (single MSA sample) |
+ <a href='http://proceedings.mlr.press/v139/rao21a.html'>Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.</a> |
+
+
+ 25 |
+ ESM2 (650M) |
+ Protein language model |
+ 0.728 |
+ 0.006 |
+ 0.730 |
+ 0.691 |
+ 0.725 |
+ 0.708 |
+ 0.789 |
+ 0.691 |
+ 0.725 |
+ 0.782 |
+ 0.758 |
+ 0.766 |
+ 0.748 |
+ 0.639 |
+ 0.737 |
+ 0.665 |
+ 0.632 |
+ 0.605 |
+ 0.620 |
+ ESM2 model (650M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 26 |
+ Tranception S |
+ Hybrid - Alignment & PLM |
+ 0.728 |
+ 0.003 |
+ 0.734 |
+ 0.700 |
+ 0.728 |
+ 0.730 |
+ 0.749 |
+ 0.733 |
+ 0.731 |
+ 0.742 |
+ 0.745 |
+ 0.758 |
+ 0.715 |
+ 0.712 |
+ 0.724 |
+ 0.666 |
+ 0.654 |
+ 0.659 |
+ 0.711 |
+ Tranception Small model (85M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 27 |
+ ESM2 (3B) |
+ Protein language model |
+ 0.724 |
+ 0.006 |
+ 0.724 |
+ 0.675 |
+ 0.722 |
+ 0.712 |
+ 0.786 |
+ 0.693 |
+ 0.732 |
+ 0.770 |
+ 0.749 |
+ 0.765 |
+ 0.752 |
+ 0.655 |
+ 0.732 |
+ 0.645 |
+ 0.615 |
+ 0.605 |
+ 0.621 |
+ ESM2 model (3B params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 28 |
+ ESM-1v (ensemble) |
+ Protein language model |
+ 0.723 |
+ 0.006 |
+ 0.726 |
+ 0.674 |
+ 0.735 |
+ 0.718 |
+ 0.764 |
+ 0.682 |
+ 0.725 |
+ 0.776 |
+ 0.754 |
+ 0.756 |
+ 0.728 |
+ 0.659 |
+ 0.724 |
+ 0.644 |
+ 0.616 |
+ 0.598 |
+ 0.622 |
+ ESM-1v (ensemble of 5 independently-trained models) |
+ <a href='https://proceedings.neurips.cc/paper/2021/hash/f51338d736f95dd42427296047067694-Abstract.html'>Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.</a> |
+
+
+ 29 |
+ DeepSequence (single) |
+ Alignment-based model |
+ 0.723 |
+ 0.004 |
+ 0.741 |
+ 0.680 |
+ 0.704 |
+ 0.721 |
+ 0.768 |
+ 0.705 |
+ 0.733 |
+ 0.757 |
+ 0.745 |
+ 0.763 |
+ 0.741 |
+ 0.680 |
+ 0.721 |
+ 0.666 |
+ 0.669 |
+ 0.682 |
+ 0.730 |
+ DeepSequence model (single seed) |
+ <a href='https://www.nature.com/articles/s41592-018-0138-4'>Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.</a> |
+
+
+ 30 |
+ SaProt (35M) |
+ Hybrid - Structure & PLM |
+ 0.722 |
+ 0.005 |
+ 0.702 |
+ 0.702 |
+ 0.736 |
+ 0.662 |
+ 0.807 |
+ 0.683 |
+ 0.716 |
+ 0.771 |
+ 0.748 |
+ 0.764 |
+ 0.727 |
+ 0.626 |
+ 0.728 |
+ 0.720 |
+ 0.627 |
+ 0.613 |
+ 0.657 |
+ SaProt (35M) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.10.01.560349v5'>Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv.</a> |
+
+
+ 31 |
+ VESPAl |
+ Protein language model |
+ 0.720 |
+ 0.004 |
+ 0.737 |
+ 0.683 |
+ 0.683 |
+ 0.728 |
+ 0.771 |
+ 0.709 |
+ 0.736 |
+ 0.756 |
+ 0.729 |
+ 0.765 |
+ 0.746 |
+ 0.723 |
+ 0.728 |
+ 0.615 |
+ 0.693 |
+ 0.657 |
+ 0.696 |
+ VESPAl model |
+ <a href='https://link.springer.com/article/10.1007/s00439-021-02411-y'>Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.</a> |
+
+
+ 32 |
+ ESM2 (15B) |
+ Protein language model |
+ 0.720 |
+ 0.006 |
+ 0.717 |
+ 0.668 |
+ 0.725 |
+ 0.715 |
+ 0.775 |
+ 0.694 |
+ 0.731 |
+ 0.761 |
+ 0.742 |
+ 0.759 |
+ 0.741 |
+ 0.676 |
+ 0.726 |
+ 0.641 |
+ 0.637 |
+ 0.609 |
+ 0.640 |
+ ESM2 model (15B params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 33 |
+ ESM-1b |
+ Protein language model |
+ 0.719 |
+ 0.006 |
+ 0.730 |
+ 0.662 |
+ 0.720 |
+ 0.699 |
+ 0.782 |
+ 0.695 |
+ 0.724 |
+ 0.767 |
+ 0.747 |
+ 0.771 |
+ 0.741 |
+ 0.642 |
+ 0.715 |
+ 0.644 |
+ 0.616 |
+ 0.596 |
+ 0.662 |
+ ESM-1b (w/ Brandes et al. extensions) |
+ [1] Original model: <a href='https://www.biorxiv.org/content/10.1101/622803v4'>Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118.</a> [2] Extensions: <a href='https://www.biorxiv.org/content/10.1101/2022.08.25.505311v1'>Brandes, N., Goldman, G., Wang, C.H., Ye, C.J., & Ntranos, V. (2022). Genome-wide prediction of disease variants with a deep protein language model. bioRxiv.</a> |
+
+
+ 34 |
+ Progen2 XL |
+ Protein language model |
+ 0.717 |
+ 0.004 |
+ 0.720 |
+ 0.662 |
+ 0.730 |
+ 0.718 |
+ 0.757 |
+ 0.696 |
+ 0.730 |
+ 0.749 |
+ 0.722 |
+ 0.752 |
+ 0.736 |
+ 0.718 |
+ 0.718 |
+ 0.642 |
+ 0.680 |
+ 0.647 |
+ 0.691 |
+ Progen2 xlarge model (6.4B params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 35 |
+ MIF-ST |
+ Hybrid - Structure & PLM |
+ 0.717 |
+ 0.006 |
+ 0.709 |
+ 0.674 |
+ 0.738 |
+ 0.701 |
+ 0.762 |
+ 0.705 |
+ 0.720 |
+ 0.746 |
+ 0.721 |
+ 0.724 |
+ 0.741 |
+ 0.715 |
+ 0.736 |
+ 0.675 |
+ 0.706 |
+ 0.673 |
+ 0.711 |
+ MIF-ST model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.25.493516v3'>Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.</a> |
+
+
+ 36 |
+ EVmutation |
+ Alignment-based model |
+ 0.716 |
+ 0.003 |
+ 0.737 |
+ 0.666 |
+ 0.707 |
+ 0.727 |
+ 0.741 |
+ 0.713 |
+ 0.737 |
+ 0.723 |
+ 0.728 |
+ 0.746 |
+ 0.729 |
+ 0.707 |
+ 0.711 |
+ 0.673 |
+ 0.683 |
+ 0.674 |
+ 0.757 |
+ EVmutation model |
+ <a href='https://www.nature.com/articles/nbt.3769'>Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.</a> |
+
+
+ 37 |
+ ESM2 (150M) |
+ Protein language model |
+ 0.713 |
+ 0.007 |
+ 0.710 |
+ 0.685 |
+ 0.718 |
+ 0.674 |
+ 0.779 |
+ 0.675 |
+ 0.701 |
+ 0.767 |
+ 0.754 |
+ 0.761 |
+ 0.712 |
+ 0.582 |
+ 0.717 |
+ 0.655 |
+ 0.588 |
+ 0.592 |
+ 0.611 |
+ ESM2 model (150M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 38 |
+ Progen2 M |
+ Protein language model |
+ 0.711 |
+ 0.004 |
+ 0.713 |
+ 0.666 |
+ 0.735 |
+ 0.715 |
+ 0.726 |
+ 0.678 |
+ 0.716 |
+ 0.737 |
+ 0.735 |
+ 0.730 |
+ 0.699 |
+ 0.681 |
+ 0.709 |
+ 0.610 |
+ 0.596 |
+ 0.588 |
+ 0.599 |
+ Progen2 medium model (760M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 39 |
+ Progen2 L |
+ Protein language model |
+ 0.711 |
+ 0.004 |
+ 0.718 |
+ 0.662 |
+ 0.734 |
+ 0.713 |
+ 0.726 |
+ 0.693 |
+ 0.715 |
+ 0.733 |
+ 0.733 |
+ 0.739 |
+ 0.703 |
+ 0.672 |
+ 0.708 |
+ 0.615 |
+ 0.649 |
+ 0.630 |
+ 0.669 |
+ Progen2 large model (2.7B params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 40 |
+ Progen2 Base |
+ Protein language model |
+ 0.709 |
+ 0.005 |
+ 0.713 |
+ 0.664 |
+ 0.739 |
+ 0.714 |
+ 0.712 |
+ 0.689 |
+ 0.706 |
+ 0.733 |
+ 0.739 |
+ 0.733 |
+ 0.678 |
+ 0.668 |
+ 0.707 |
+ 0.597 |
+ 0.598 |
+ 0.593 |
+ 0.613 |
+ Progen2 base model (760M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 41 |
+ RITA XL |
+ Protein language model |
+ 0.707 |
+ 0.005 |
+ 0.702 |
+ 0.665 |
+ 0.728 |
+ 0.715 |
+ 0.727 |
+ 0.674 |
+ 0.716 |
+ 0.732 |
+ 0.727 |
+ 0.721 |
+ 0.693 |
+ 0.711 |
+ 0.703 |
+ 0.616 |
+ 0.605 |
+ 0.602 |
+ 0.634 |
+ RITA xlarge model (1.2B params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 42 |
+ ESM-1v (single) |
+ Protein language model |
+ 0.707 |
+ 0.007 |
+ 0.713 |
+ 0.652 |
+ 0.723 |
+ 0.705 |
+ 0.744 |
+ 0.663 |
+ 0.708 |
+ 0.763 |
+ 0.738 |
+ 0.740 |
+ 0.714 |
+ 0.642 |
+ 0.709 |
+ 0.624 |
+ 0.620 |
+ 0.596 |
+ 0.619 |
+ ESM-1v (single seed) |
+ <a href='https://proceedings.neurips.cc/paper/2021/hash/f51338d736f95dd42427296047067694-Abstract.html'>Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.</a> |
+
+
+ 43 |
+ Wavenet |
+ Alignment-based model |
+ 0.707 |
+ 0.006 |
+ 0.709 |
+ 0.673 |
+ 0.691 |
+ 0.706 |
+ 0.757 |
+ 0.664 |
+ 0.723 |
+ 0.750 |
+ 0.723 |
+ 0.737 |
+ 0.729 |
+ 0.685 |
+ 0.703 |
+ 0.648 |
+ 0.654 |
+ 0.631 |
+ 0.669 |
+ Wavenet model |
+ <a href='https://www.nature.com/articles/s41467-021-22732-w'>Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12.</a> |
+
+
+ 44 |
+ Tranception L no retrieval |
+ Protein language model |
+ 0.707 |
+ 0.004 |
+ 0.717 |
+ 0.659 |
+ 0.727 |
+ 0.715 |
+ 0.714 |
+ 0.697 |
+ 0.708 |
+ 0.730 |
+ 0.719 |
+ 0.717 |
+ 0.699 |
+ 0.715 |
+ 0.703 |
+ 0.621 |
+ 0.695 |
+ 0.657 |
+ 0.704 |
+ Tranception Large model (700M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 45 |
+ MIF |
+ Inverse folding model |
+ 0.705 |
+ 0.007 |
+ 0.673 |
+ 0.685 |
+ 0.731 |
+ 0.663 |
+ 0.775 |
+ 0.690 |
+ 0.704 |
+ 0.734 |
+ 0.717 |
+ 0.704 |
+ 0.716 |
+ 0.695 |
+ 0.723 |
+ 0.673 |
+ 0.670 |
+ 0.645 |
+ 0.689 |
+ MIF model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.25.493516v3'>Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.</a> |
+
+
+ 46 |
+ RITA L |
+ Protein language model |
+ 0.703 |
+ 0.005 |
+ 0.698 |
+ 0.660 |
+ 0.729 |
+ 0.711 |
+ 0.718 |
+ 0.676 |
+ 0.709 |
+ 0.724 |
+ 0.727 |
+ 0.720 |
+ 0.675 |
+ 0.704 |
+ 0.698 |
+ 0.609 |
+ 0.589 |
+ 0.585 |
+ 0.620 |
+ RITA large model (680M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 47 |
+ CARP (640M) |
+ Protein language model |
+ 0.701 |
+ 0.007 |
+ 0.711 |
+ 0.649 |
+ 0.718 |
+ 0.707 |
+ 0.722 |
+ 0.676 |
+ 0.705 |
+ 0.732 |
+ 0.732 |
+ 0.715 |
+ 0.704 |
+ 0.653 |
+ 0.718 |
+ 0.638 |
+ 0.609 |
+ 0.595 |
+ 0.595 |
+ CARP model (640M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 48 |
+ Site-Independent |
+ Alignment-based model |
+ 0.696 |
+ 0.005 |
+ 0.699 |
+ 0.684 |
+ 0.687 |
+ 0.714 |
+ 0.695 |
+ 0.730 |
+ 0.708 |
+ 0.670 |
+ 0.710 |
+ 0.713 |
+ 0.674 |
+ 0.698 |
+ 0.690 |
+ 0.657 |
+ 0.625 |
+ 0.643 |
+ 0.721 |
+ Site-Independent model |
+ <a href='https://www.nature.com/articles/nbt.3769'>Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.</a> |
+
+
+ 49 |
+ RITA M |
+ Protein language model |
+ 0.693 |
+ 0.005 |
+ 0.691 |
+ 0.649 |
+ 0.720 |
+ 0.709 |
+ 0.698 |
+ 0.668 |
+ 0.696 |
+ 0.716 |
+ 0.716 |
+ 0.702 |
+ 0.665 |
+ 0.699 |
+ 0.690 |
+ 0.591 |
+ 0.586 |
+ 0.591 |
+ 0.614 |
+ RITA medium model (300M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 50 |
+ Unirep evotuned |
+ Hybrid - Alignment & PLM |
+ 0.692 |
+ 0.004 |
+ 0.695 |
+ 0.657 |
+ 0.700 |
+ 0.695 |
+ 0.714 |
+ 0.680 |
+ 0.697 |
+ 0.710 |
+ 0.704 |
+ 0.711 |
+ 0.690 |
+ 0.688 |
+ 0.684 |
+ 0.634 |
+ 0.653 |
+ 0.638 |
+ 0.699 |
+ Unirep model w/ evotuning |
+ <a href='https://www.nature.com/articles/s41592-019-0598-1'>Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.</a> |
+
+
+ 51 |
+ Tranception M no retrieval |
+ Protein language model |
+ 0.691 |
+ 0.005 |
+ 0.687 |
+ 0.656 |
+ 0.721 |
+ 0.702 |
+ 0.687 |
+ 0.661 |
+ 0.695 |
+ 0.705 |
+ 0.710 |
+ 0.694 |
+ 0.666 |
+ 0.681 |
+ 0.685 |
+ 0.596 |
+ 0.581 |
+ 0.583 |
+ 0.611 |
+ Tranception Medium model (300M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 52 |
+ Progen2 S |
+ Protein language model |
+ 0.685 |
+ 0.006 |
+ 0.680 |
+ 0.652 |
+ 0.710 |
+ 0.689 |
+ 0.695 |
+ 0.659 |
+ 0.679 |
+ 0.714 |
+ 0.718 |
+ 0.698 |
+ 0.657 |
+ 0.643 |
+ 0.683 |
+ 0.584 |
+ 0.572 |
+ 0.569 |
+ 0.587 |
+ Progen2 small model (150M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 53 |
+ CARP (76M) |
+ Protein language model |
+ 0.678 |
+ 0.007 |
+ 0.682 |
+ 0.656 |
+ 0.701 |
+ 0.653 |
+ 0.697 |
+ 0.640 |
+ 0.664 |
+ 0.718 |
+ 0.714 |
+ 0.698 |
+ 0.654 |
+ 0.584 |
+ 0.687 |
+ 0.628 |
+ 0.561 |
+ 0.564 |
+ 0.568 |
+ CARP model (76M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 54 |
+ ESM2 (35M) |
+ Protein language model |
+ 0.676 |
+ 0.008 |
+ 0.669 |
+ 0.668 |
+ 0.684 |
+ 0.622 |
+ 0.734 |
+ 0.637 |
+ 0.652 |
+ 0.739 |
+ 0.706 |
+ 0.722 |
+ 0.664 |
+ 0.562 |
+ 0.669 |
+ 0.649 |
+ 0.576 |
+ 0.581 |
+ 0.613 |
+ ESM2 model (35M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 55 |
+ RITA S |
+ Protein language model |
+ 0.667 |
+ 0.006 |
+ 0.659 |
+ 0.652 |
+ 0.684 |
+ 0.680 |
+ 0.660 |
+ 0.651 |
+ 0.668 |
+ 0.679 |
+ 0.687 |
+ 0.659 |
+ 0.629 |
+ 0.682 |
+ 0.659 |
+ 0.583 |
+ 0.575 |
+ 0.576 |
+ 0.598 |
+ RITA small model (85M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 56 |
+ Tranception S no retrieval |
+ Protein language model |
+ 0.665 |
+ 0.006 |
+ 0.652 |
+ 0.657 |
+ 0.689 |
+ 0.679 |
+ 0.645 |
+ 0.641 |
+ 0.664 |
+ 0.672 |
+ 0.678 |
+ 0.648 |
+ 0.642 |
+ 0.661 |
+ 0.654 |
+ 0.587 |
+ 0.579 |
+ 0.582 |
+ 0.597 |
+ Tranception Small model (85M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 57 |
+ CARP (38M) |
+ Protein language model |
+ 0.653 |
+ 0.007 |
+ 0.655 |
+ 0.652 |
+ 0.671 |
+ 0.623 |
+ 0.661 |
+ 0.612 |
+ 0.633 |
+ 0.688 |
+ 0.679 |
+ 0.664 |
+ 0.628 |
+ 0.567 |
+ 0.656 |
+ 0.610 |
+ 0.557 |
+ 0.571 |
+ 0.578 |
+ CARP model (38M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 58 |
+ ProteinMPNN |
+ Inverse folding model |
+ 0.639 |
+ 0.006 |
+ 0.605 |
+ 0.591 |
+ 0.603 |
+ 0.590 |
+ 0.805 |
+ 0.596 |
+ 0.655 |
+ 0.729 |
+ 0.658 |
+ 0.710 |
+ 0.684 |
+ 0.640 |
+ 0.657 |
+ 0.673 |
+ 0.604 |
+ 0.610 |
+ 0.695 |
+ ProteinMPNN model |
+ <a href='https://www.science.org/doi/10.1126/science.add2187'>J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378.</a> |
+
+
+ 59 |
+ ESM2 (8M) |
+ Protein language model |
+ 0.624 |
+ 0.008 |
+ 0.607 |
+ 0.650 |
+ 0.647 |
+ 0.577 |
+ 0.638 |
+ 0.611 |
+ 0.601 |
+ 0.636 |
+ 0.635 |
+ 0.635 |
+ 0.601 |
+ 0.546 |
+ 0.611 |
+ 0.593 |
+ 0.566 |
+ 0.573 |
+ 0.599 |
+ ESM2 model (8M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 60 |
+ Unirep |
+ Protein language model |
+ 0.604 |
+ 0.008 |
+ 0.597 |
+ 0.617 |
+ 0.621 |
+ 0.577 |
+ 0.610 |
+ 0.602 |
+ 0.591 |
+ 0.603 |
+ 0.622 |
+ 0.620 |
+ 0.576 |
+ 0.532 |
+ 0.595 |
+ 0.557 |
+ 0.566 |
+ 0.582 |
+ 0.592 |
+ Unirep model |
+ <a href='https://www.nature.com/articles/s41592-019-0598-1'>Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.</a> |
+
+
+ 61 |
+ ProtGPT2 |
+ Protein language model |
+ 0.603 |
+ 0.006 |
+ 0.593 |
+ 0.585 |
+ 0.602 |
+ 0.595 |
+ 0.639 |
+ 0.598 |
+ 0.597 |
+ 0.637 |
+ 0.636 |
+ 0.631 |
+ 0.570 |
+ 0.576 |
+ 0.601 |
+ 0.602 |
+ 0.528 |
+ 0.509 |
+ 0.534 |
+ ProtGPT2 model |
+ <a href='https://www.nature.com/articles/s41467-022-32007-7'>Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13.</a> |
+
+
+ 62 |
+ CARP (600K) |
+ Protein language model |
+ 0.557 |
+ 0.009 |
+ 0.555 |
+ 0.551 |
+ 0.596 |
+ 0.531 |
+ 0.549 |
+ 0.552 |
+ 0.545 |
+ 0.548 |
+ 0.568 |
+ 0.539 |
+ 0.531 |
+ 0.526 |
+ 0.556 |
+ 0.533 |
+ 0.524 |
+ 0.548 |
+ 0.557 |
+ CARP model (600K params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/substitutions/MCC/DMS_substitutions_MCC_DMS_level.csv b/benchmarks/DMS_zero_shot/substitutions/MCC/DMS_substitutions_MCC_DMS_level.csv
new file mode 100644
index 0000000..3e3d4a5
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/MCC/DMS_substitutions_MCC_DMS_level.csv
@@ -0,0 +1,217 @@
+DMS ID,Site-Independent,EVmutation,DeepSequence (single),DeepSequence (ensemble),EVE (single),EVE (ensemble),Unirep,Unirep evotuned,MSA Transformer (single),MSA Transformer (ensemble),ESM-1b,ESM-1v (single),ESM-1v (ensemble),ESM2 (8M),ESM2 (35M),ESM2 (150M),ESM2 (650M),ESM2 (3B),ESM2 (15B),Wavenet,RITA S,RITA M,RITA L,RITA XL,Progen2 S,Progen2 M,Progen2 Base,Progen2 L,Progen2 XL,GEMME,VESPA,VESPAl,ProtGPT2,Tranception S no retrieval,Tranception M no retrieval,Tranception L no retrieval,Tranception S,Tranception M,Tranception L,TranceptEVE S,TranceptEVE M,TranceptEVE L,CARP (38M),CARP (600K),CARP (640M),CARP (76M),MIF,MIF-ST,ESM-IF1,ProteinMPNN,ProtSSN (k=10 h=512),ProtSSN (k=10 h=768),ProtSSN (k=10 h=1280),ProtSSN (k=20 h=512),ProtSSN (k=20 h=768),ProtSSN (k=20 h=1280),ProtSSN (k=30 h=512),ProtSSN (k=30 h=768),ProtSSN (k=30 h=1280),ProtSSN (ensemble),SaProt (650M),SaProt (35M),Number of Mutants,Selection Type,UniProt ID,MSA_Neff_L_category,Taxon
+A0A140D2T1_ZIKV_Sourisseau_2019,0.265,0.23,0.082,0.075,0.28,0.292,-0.124,0.063,0.334,0.335,-0.023,-0.041,0.005,-0.058,-0.045,-0.037,0.147,0.277,0.281,0.168,0.263,0.218,0.219,0.21,0.241,0.236,0.224,0.226,0.197,0.27,0.208,0.191,-0.005,0.217,0.234,0.193,0.264,0.263,0.248,0.258,0.254,0.271,-0.043,-0.061,0.097,-0.052,0.202,0.205,0.21,0.086,0.175,0.178,0.183,0.202,0.191,0.191,0.184,0.183,0.183,0.188,0.14,0.084,9576,OrganismalFitness,A0A140D2T1_ZIKV,Medium,Virus
+A0A192B1T2_9HIV1_Haddox_2018,0.359,0.332,0.317,0.337,0.387,0.401,0.008,0.39,0.406,0.409,0.356,0.391,0.401,0.022,0.022,0.04,0.076,0.087,0.1,0.359,0.386,0.397,0.398,0.394,0.393,0.389,0.366,0.384,0.385,0.382,0.435,0.418,0.256,0.385,0.384,0.4,0.393,0.399,0.398,0.402,0.409,0.411,0.329,-0.002,0.378,0.332,0.284,0.359,0.187,0.102,0.167,0.19,0.205,0.208,0.168,0.186,0.187,0.163,0.145,0.197,0.119,0.063,12577,OrganismalFitness,A0A192B1T2_9HIV1,Medium,Virus
+A0A1I9GEU1_NEIME_Kennouche_2019,-0.009,0.026,0.082,0.061,0.043,0.052,0.0,0.065,0.061,0.069,0.017,0.043,0.065,-0.026,-0.03,0.004,0.048,0.022,0.056,0.048,-0.004,0.022,0.052,0.043,0.03,0.061,0.052,0.069,0.065,0.03,0.0,-0.007,0.017,0.043,0.052,0.082,0.03,0.065,0.056,0.039,0.082,0.061,-0.013,-0.035,0.052,-0.03,0.022,0.052,0.035,0.056,0.065,0.052,0.065,0.056,0.065,0.078,0.039,0.069,0.061,0.082,0.043,0.004,922,Activity,A0A1I9GEU1_NEIME,Medium,Prokaryote
+A0A247D711_LISMN_Stadelmann_2021,0.347,0.359,0.096,0.041,0.312,0.316,0.028,0.038,0.234,0.258,0.065,0.082,0.087,0.07,0.079,0.084,0.06,0.07,0.084,0.043,0.067,0.055,0.053,0.094,0.014,0.065,0.033,0.07,0.089,0.351,0.244,0.234,0.004,0.053,0.041,0.055,0.212,0.21,0.215,0.193,0.183,0.193,0.062,0.043,0.067,0.074,0.312,0.353,0.328,0.263,0.21,0.237,0.224,0.246,0.241,0.28,0.208,0.244,0.191,0.237,0.316,0.212,1653,Activity,A0A247D711_LISMN,High,Prokaryote
+A0A2Z5U3Z0_9INFA_Doud_2016,0.357,0.355,0.359,0.39,0.417,0.413,-0.006,0.373,0.38,0.376,0.071,0.405,0.431,-0.022,-0.001,0.04,0.397,0.39,0.378,0.296,0.352,0.413,0.396,0.42,0.334,0.391,0.395,0.376,0.385,0.403,0.389,0.311,0.059,0.356,0.407,0.415,0.407,0.416,0.402,0.436,0.44,0.431,0.001,0.007,0.263,0.004,0.333,0.378,0.366,0.132,0.384,0.384,0.425,0.415,0.417,0.431,0.426,0.407,0.439,0.432,0.15,0.122,10715,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+A0A2Z5U3Z0_9INFA_Wu_2014,0.413,0.423,0.365,0.375,0.413,0.431,-0.011,0.316,0.382,0.387,0.101,0.358,0.394,0.03,0.054,0.054,0.362,0.389,0.375,0.346,0.336,0.389,0.386,0.392,0.29,0.333,0.355,0.321,0.355,0.425,0.333,0.283,0.066,0.338,0.379,0.406,0.401,0.45,0.44,0.457,0.466,0.459,0.031,0.031,0.183,0.03,0.289,0.295,0.278,0.096,0.35,0.333,0.369,0.362,0.362,0.379,0.379,0.37,0.369,0.38,0.163,0.13,2350,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+A4_HUMAN_Seuma_2022,0.363,0.323,0.35,0.323,0.246,0.243,0.292,0.11,0.246,0.25,0.234,0.218,0.295,0.338,0.307,0.305,0.359,0.371,0.269,0.185,0.217,0.171,0.204,0.198,0.28,0.192,0.212,0.202,0.201,0.388,0.191,0.128,0.412,0.28,0.161,0.236,0.333,0.266,0.361,0.332,0.288,0.31,0.316,0.311,0.333,0.304,0.232,0.315,-0.13,0.042,0.357,0.338,0.337,0.341,0.344,0.316,0.356,0.281,0.29,0.343,0.348,0.332,14811,Stability,A4_HUMAN,Low,Human
+A4D664_9INFA_Soh_2019,0.344,0.252,0.341,0.335,0.326,0.323,0.015,0.267,0.254,0.245,0.024,0.019,0.025,0.019,0.02,0.02,0.103,0.139,0.196,0.194,0.269,0.301,0.319,0.313,0.05,0.208,0.224,0.197,0.258,0.365,0.24,0.237,0.024,0.269,0.284,0.314,0.279,0.293,0.296,0.349,0.356,0.367,0.015,0.002,0.047,0.005,0.204,0.199,0.09,0.081,0.154,0.151,0.145,0.163,0.153,0.163,0.158,0.168,0.16,0.158,0.112,0.064,14421,OrganismalFitness,A4D664_9INFA,Medium,Virus
+A4GRB6_PSEAI_Chen_2020,0.304,0.469,0.554,0.557,0.514,0.536,0.29,0.427,0.526,0.58,0.585,0.546,0.572,0.354,0.441,0.551,0.639,0.614,0.513,0.465,0.336,0.448,0.463,0.525,0.441,0.514,0.53,0.527,0.593,0.568,0.643,0.573,0.206,0.34,0.464,0.521,0.46,0.493,0.539,0.547,0.556,0.567,0.412,0.08,0.56,0.504,0.546,0.597,0.536,0.325,0.618,0.603,0.596,0.59,0.597,0.607,0.617,0.617,0.616,0.615,0.61,0.417,5004,OrganismalFitness,A4GRB6_PSEAI,High,Prokaryote
+AACC1_PSEAI_Dandage_2018,0.192,0.363,0.249,0.336,0.376,0.374,0.12,0.196,0.389,0.38,0.287,0.345,0.371,0.127,0.147,0.154,0.363,0.398,0.414,0.278,0.112,0.156,0.174,0.209,0.178,0.32,0.298,0.327,0.354,0.363,0.4,0.349,0.003,0.163,0.163,0.311,0.32,0.303,0.334,0.349,0.343,0.385,0.154,0.076,0.243,0.163,0.187,0.265,0.245,0.114,0.363,0.349,0.36,0.349,0.356,0.34,0.351,0.349,0.347,0.358,0.36,0.2,1801,OrganismalFitness,AACC1_PSEAI,High,Prokaryote
+ACE2_HUMAN_Chan_2020,0.162,0.105,0.08,0.08,0.117,0.103,-0.028,0.013,0.161,0.17,0.18,0.087,0.126,-0.014,0.035,0.152,0.152,0.116,0.089,0.126,-0.005,0.033,0.094,0.141,-0.028,0.074,0.117,0.116,0.18,0.096,0.103,0.073,0.062,0.006,0.04,0.045,0.135,0.092,0.083,0.107,0.083,0.081,0.008,0.045,0.218,0.035,0.288,0.24,0.27,0.08,0.161,0.207,0.161,0.182,0.173,0.195,0.164,0.162,0.166,0.173,0.233,0.161,2223,Binding,ACE2_HUMAN,Medium,Human
+ADRB2_HUMAN_Jones_2020,0.277,0.364,0.41,0.411,0.397,0.407,0.364,0.396,0.432,0.426,0.43,0.428,0.428,0.313,0.341,0.373,0.392,0.402,0.405,0.412,0.425,0.427,0.415,0.39,0.431,0.421,0.421,0.425,0.391,0.429,0.406,0.311,0.25,0.435,0.421,0.401,0.428,0.43,0.427,0.423,0.43,0.428,0.375,0.124,0.431,0.411,0.336,0.396,0.368,0.169,0.386,0.39,0.412,0.402,0.409,0.402,0.395,0.414,0.403,0.417,0.457,0.411,7800,Activity,ADRB2_HUMAN,Medium,Human
+AICDA_HUMAN_Gajula_2014_3cycles,0.011,0.216,0.267,0.267,0.267,0.267,-0.169,0.139,0.164,0.216,0.216,0.216,0.267,-0.143,-0.143,0.011,0.19,0.19,0.113,0.395,-0.092,0.036,0.19,0.241,-0.066,0.139,0.293,0.216,0.19,0.164,0.293,0.293,0.036,-0.041,-0.015,0.216,0.036,0.113,0.19,0.216,0.241,0.216,-0.169,-0.169,0.164,0.036,0.293,0.241,0.267,0.139,0.216,0.241,0.216,0.216,0.164,0.216,0.216,0.216,0.19,0.19,0.19,-0.015,209,Activity,AICDA_HUMAN,Medium,Human
+AMFR_HUMAN_Tsuboyama_2023_4G3O,0.201,0.249,0.241,0.226,0.229,0.234,-0.227,0.013,0.085,0.096,0.188,0.07,0.271,-0.139,0.186,0.127,0.102,0.135,0.127,-0.015,-0.063,-0.157,-0.094,-0.087,-0.271,-0.124,-0.112,-0.176,0.005,0.203,0.125,0.128,0.007,-0.207,-0.108,-0.248,0.222,0.234,0.183,0.262,0.256,0.211,-0.16,-0.162,0.04,0.003,0.143,0.02,0.276,0.232,0.283,0.149,0.048,0.172,0.162,0.164,0.163,0.166,0.174,0.159,0.153,0.26,2972,Stability,AMFR_HUMAN,Medium,Human
+AMIE_PSEAE_Wrenbeck_2017,0.207,0.391,0.424,0.448,0.421,0.396,0.035,0.363,0.314,0.494,0.466,0.496,0.546,0.214,0.282,0.299,0.437,0.557,0.493,0.394,0.452,0.449,0.444,0.454,0.453,0.485,0.455,0.453,0.433,0.47,0.487,0.455,0.243,0.499,0.389,0.351,0.474,0.395,0.36,0.494,0.432,0.415,0.233,0.04,0.358,0.281,0.291,0.374,0.324,0.167,0.448,0.41,0.418,0.435,0.422,0.432,0.413,0.426,0.453,0.435,0.475,0.286,6227,Activity,AMIE_PSEAE,High,Prokaryote
+ANCSZ_Hobbs_2022,0.417,0.382,0.325,0.348,0.364,0.369,0.389,0.342,0.368,0.382,0.405,0.382,0.406,0.36,0.431,0.434,0.448,0.437,0.418,0.008,0.316,0.346,0.38,0.38,0.374,0.383,0.372,0.352,0.37,0.433,0.41,0.406,0.222,0.343,0.37,0.341,0.404,0.424,0.409,0.388,0.408,0.394,0.381,0.285,0.366,0.372,0.339,0.361,0.311,0.123,0.414,0.408,0.431,0.434,0.431,0.442,0.418,0.43,0.422,0.441,0.419,0.442,4670,Activity,ANCSZ,Medium,Eukaryote
+ARGR_ECOLI_Tsuboyama_2023_1AOY,0.212,0.24,0.293,0.296,0.281,0.281,0.181,0.249,0.315,0.352,0.284,0.203,0.262,0.209,0.181,0.355,0.34,0.377,0.293,0.296,0.225,0.327,0.33,0.277,0.231,0.296,0.29,0.324,0.302,0.336,0.305,0.239,0.066,0.253,0.265,0.324,0.212,0.215,0.281,0.268,0.277,0.299,0.162,0.144,0.33,0.296,0.585,0.498,0.601,0.495,0.371,0.392,0.377,0.386,0.358,0.38,0.355,0.368,0.34,0.377,0.43,0.368,1287,Stability,ARGR_ECOLI,Medium,Prokaryote
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,0.519,0.002,0.508,0.532,0.319,0.508,0.177,0.414,0.201,0.225,0.177,0.059,0.248,0.059,0.154,0.106,0.225,0.343,0.272,0.225,0.154,0.035,0.225,0.366,0.059,0.177,0.319,0.248,0.295,0.658,0.319,0.3,0.201,0.106,0.177,0.366,0.437,0.343,0.343,0.532,0.461,0.366,0.13,0.012,0.154,0.083,0.272,0.248,0.319,0.035,0.272,0.272,0.319,0.201,0.248,0.414,0.225,0.177,0.366,0.248,0.201,0.201,170,Binding,B2L11_HUMAN,Low,Human
+BBC1_YEAST_Tsuboyama_2023_1TG0,0.189,0.239,0.379,0.361,0.375,0.381,0.185,0.282,0.365,0.382,0.433,0.369,0.41,0.324,0.359,0.359,0.439,0.371,0.44,0.29,0.332,0.352,0.381,0.34,0.123,0.375,0.408,0.369,0.299,0.294,0.352,0.329,0.266,0.299,0.206,0.268,0.328,0.263,0.309,0.408,0.359,0.382,0.098,0.108,0.21,0.127,0.468,0.367,0.572,0.514,0.442,0.421,0.448,0.442,0.442,0.446,0.435,0.442,0.442,0.444,0.45,0.483,2069,Stability,BBC1_YEAST,High,Eukaryote
+BCHB_CHLTE_Tsuboyama_2023_2KRU,0.327,0.307,0.281,0.296,0.284,0.302,0.098,0.159,0.309,0.391,0.33,0.248,0.289,0.118,0.149,0.251,0.406,0.414,0.324,0.342,0.088,0.299,0.154,0.302,0.136,0.363,0.21,0.33,0.34,0.5,0.472,0.401,0.024,0.164,0.23,0.35,0.355,0.36,0.36,0.296,0.309,0.34,0.078,0.085,0.169,0.146,0.421,0.37,0.518,0.431,0.454,0.452,0.464,0.472,0.459,0.487,0.459,0.449,0.454,0.462,0.378,0.21,1572,Stability,BCHB_CHLTE,Medium,Prokaryote
+BLAT_ECOLX_Deng_2012,0.261,0.406,0.402,0.422,0.402,0.414,0.067,0.24,0.419,0.448,0.413,0.394,0.418,0.189,0.281,0.373,0.443,0.345,0.259,0.365,0.293,0.297,0.303,0.309,0.31,0.356,0.361,0.315,0.271,0.387,0.428,0.396,0.091,0.286,0.296,0.261,0.355,0.356,0.359,0.414,0.429,0.426,0.273,0.028,0.389,0.34,0.335,0.387,0.364,0.193,0.43,0.443,0.438,0.453,0.45,0.442,0.45,0.435,0.442,0.466,0.44,0.321,4996,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Firnberg_2014,0.348,0.564,0.589,0.604,0.549,0.573,0.104,0.365,0.58,0.608,0.579,0.53,0.575,0.284,0.413,0.514,0.62,0.429,0.3,0.523,0.42,0.408,0.381,0.385,0.451,0.5,0.506,0.418,0.335,0.555,0.614,0.63,0.139,0.41,0.366,0.323,0.48,0.464,0.462,0.572,0.574,0.576,0.397,0.032,0.571,0.469,0.471,0.572,0.529,0.244,0.599,0.591,0.59,0.617,0.611,0.612,0.616,0.584,0.603,0.632,0.605,0.448,4783,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Jacquier_2013,0.357,0.551,0.551,0.587,0.567,0.579,0.07,0.345,0.571,0.587,0.523,0.523,0.535,0.24,0.393,0.499,0.579,0.466,0.369,0.515,0.422,0.393,0.377,0.393,0.426,0.527,0.47,0.418,0.345,0.446,0.575,0.571,0.114,0.385,0.397,0.381,0.47,0.494,0.499,0.563,0.583,0.596,0.341,0.058,0.519,0.43,0.41,0.519,0.499,0.236,0.547,0.519,0.539,0.559,0.555,0.535,0.567,0.547,0.519,0.559,0.559,0.438,989,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Stiffler_2015,0.35,0.564,0.601,0.614,0.551,0.586,0.102,0.363,0.594,0.62,0.591,0.549,0.595,0.298,0.424,0.526,0.632,0.436,0.306,0.528,0.433,0.419,0.374,0.392,0.466,0.511,0.526,0.427,0.339,0.565,0.627,0.638,0.137,0.432,0.369,0.326,0.49,0.462,0.465,0.585,0.58,0.586,0.411,0.032,0.581,0.483,0.48,0.576,0.53,0.247,0.6,0.582,0.594,0.608,0.608,0.611,0.61,0.591,0.604,0.63,0.616,0.462,4996,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BRCA1_HUMAN_Findlay_2018,0.436,0.298,0.423,0.426,0.403,0.408,0.048,0.227,0.334,0.331,0.413,0.299,0.341,0.077,0.286,0.379,0.413,0.426,0.361,0.237,0.065,0.321,0.341,0.349,0.239,0.403,0.391,0.379,0.398,0.399,0.436,0.416,0.212,0.09,0.127,0.371,0.406,0.406,0.436,0.441,0.431,0.448,0.142,0.068,0.446,0.361,0.369,0.391,0.044,0.122,0.431,0.423,0.428,0.411,0.421,0.423,0.428,0.413,0.408,0.423,0.436,0.398,1837,OrganismalFitness,BRCA1_HUMAN,Low,Human
+BRCA2_HUMAN_Erwood_2022_HEK293T,0.312,0.309,0.309,0.309,0.309,0.309,0.068,0.283,0.122,-0.012,0.309,0.095,0.015,0.015,0.015,0.283,0.309,0.309,0.309,0.041,0.015,0.309,0.309,0.309,0.309,0.283,0.309,0.309,0.068,0.309,0.256,0.285,-0.012,0.041,0.068,0.068,0.309,0.309,0.309,0.309,0.309,0.309,0.015,0.015,0.309,-0.012,-0.039,0.015,-0.2,0.175,0.256,0.283,0.283,0.309,0.309,0.309,0.309,0.309,0.309,0.309,-0.039,0.015,265,OrganismalFitness,BRCA2_HUMAN,,Human
+C6KNH7_9INFA_Lee_2018,0.297,0.268,0.258,0.264,0.308,0.308,-0.011,0.333,0.267,0.272,0.024,0.32,0.37,-0.002,-0.007,-0.003,0.381,0.293,0.298,0.245,0.288,0.269,0.275,0.261,0.168,0.341,0.323,0.357,0.281,0.345,0.333,0.251,0.088,0.269,0.276,0.299,0.312,0.317,0.318,0.332,0.335,0.325,-0.009,-0.016,0.204,0.014,0.393,0.422,0.426,0.172,0.387,0.396,0.41,0.411,0.417,0.399,0.417,0.416,0.409,0.416,0.235,0.134,10754,OrganismalFitness,C6KNH7_9INFA,Medium,Virus
+CALM1_HUMAN_Weile_2017,0.135,0.201,0.197,0.204,0.186,0.192,0.124,0.135,0.197,0.221,0.232,0.19,0.234,0.115,0.153,0.153,0.166,0.166,0.168,0.188,0.131,0.184,0.215,0.21,0.188,0.243,0.239,0.25,0.276,0.206,0.166,0.098,0.062,0.173,0.226,0.27,0.175,0.197,0.226,0.204,0.215,0.204,0.164,0.115,0.256,0.212,0.06,0.124,0.137,0.067,0.124,0.144,0.151,0.155,0.153,0.14,0.142,0.133,0.137,0.151,0.252,0.192,1813,OrganismalFitness,CALM1_HUMAN,High,Human
+CAPSD_AAV2S_Sinai_2021,0.234,0.195,0.188,0.211,0.194,0.188,0.285,0.285,0.179,0.202,0.122,0.146,0.148,0.201,0.217,0.144,0.201,0.111,0.044,0.163,0.145,0.198,0.206,0.202,0.159,0.149,0.188,0.152,0.3,0.274,0.118,0.11,0.102,0.149,0.203,0.36,0.213,0.223,0.318,0.194,0.197,0.263,0.092,0.136,0.21,0.096,0.313,0.301,0.24,0.221,0.151,0.148,0.131,0.151,0.139,0.139,0.143,0.139,0.15,0.145,0.214,0.167,42328,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CAR11_HUMAN_Meitlis_2020_gof,-0.01,-0.026,-0.027,-0.03,-0.055,-0.062,0.09,-0.095,0.025,0.027,0.012,0.082,0.035,0.047,0.05,0.145,0.057,0.065,0.1,0.045,0.075,-0.042,0.012,0.005,-0.14,-0.052,-0.035,-0.032,0.02,-0.019,0.022,0.012,0.097,0.042,-0.042,-0.09,0.032,-0.022,-0.07,-0.02,-0.062,-0.092,0.06,0.015,0.002,0.125,0.102,0.015,0.07,0.04,0.105,0.095,0.09,0.08,0.085,0.075,0.072,0.09,0.095,0.095,0.137,0.13,2374,OrganismalFitness,CAR11_HUMAN,Low,Human
+CAR11_HUMAN_Meitlis_2020_lof,0.365,0.311,0.299,0.284,0.294,0.299,0.075,0.011,0.199,0.189,0.361,0.41,0.395,0.02,0.085,0.381,0.426,0.436,0.443,0.128,0.123,0.269,0.246,0.21,0.155,0.256,0.241,0.296,0.113,0.314,0.363,0.325,0.277,-0.009,0.224,0.205,0.313,0.267,0.237,0.329,0.276,0.252,0.072,0.026,0.334,0.241,0.353,0.375,0.348,0.108,0.443,0.431,0.431,0.443,0.428,0.443,0.42,0.436,0.453,0.44,0.465,0.323,2395,OrganismalFitness,CAR11_HUMAN,Low,Human
+CAS9_STRP1_Spencer_2017_positive,0.107,0.141,0.123,0.126,0.135,0.139,0.014,0.079,0.131,0.14,0.123,0.046,0.039,0.027,0.056,0.101,0.133,0.137,0.134,0.01,0.027,0.048,0.116,0.091,0.026,0.144,0.138,0.133,0.048,0.128,0.145,0.129,0.016,0.022,0.033,0.12,0.132,0.134,0.132,0.129,0.131,0.139,0.039,0.011,0.129,0.074,0.074,0.129,0.015,0.02,0.123,0.123,0.121,0.125,0.126,0.13,0.127,0.128,0.121,0.128,0.119,0.077,8117,Activity,CAS9_STRP1,Medium,Prokaryote
+CASP3_HUMAN_Roychowdhury_2020,0.314,0.447,0.508,0.503,0.518,0.523,0.053,0.37,0.518,0.518,0.464,0.477,0.487,0.186,0.441,0.518,0.521,0.452,0.38,0.48,0.132,0.378,0.424,0.406,0.383,0.441,0.447,0.393,0.441,0.49,0.467,0.401,0.181,0.048,0.355,0.388,0.39,0.429,0.462,0.505,0.49,0.526,0.27,-0.029,0.48,0.424,0.316,0.429,0.395,0.188,0.459,0.457,0.459,0.462,0.482,0.47,0.467,0.447,0.457,0.487,0.531,0.372,1567,Activity,CASP3_HUMAN,High,Human
+CASP7_HUMAN_Roychowdhury_2020,0.335,0.461,0.509,0.521,0.502,0.502,0.051,0.392,0.487,0.533,0.487,0.502,0.53,0.225,0.516,0.518,0.535,0.499,0.478,0.504,0.18,0.459,0.461,0.464,0.475,0.497,0.511,0.485,0.48,0.549,0.511,0.459,0.299,0.101,0.487,0.425,0.442,0.506,0.509,0.497,0.549,0.533,0.371,0.023,0.549,0.514,0.397,0.523,0.521,0.249,0.514,0.502,0.471,0.492,0.518,0.521,0.495,0.495,0.514,0.511,0.556,0.433,1680,Activity,CASP7_HUMAN,Medium,Human
+CATR_CHLRE_Tsuboyama_2023_2AMI,0.47,0.473,0.567,0.563,0.58,0.578,0.607,0.487,0.449,0.422,0.498,0.538,0.563,0.481,0.594,0.586,0.557,0.508,0.573,0.088,0.506,0.515,0.51,0.508,0.54,0.519,0.498,0.46,0.477,0.576,0.536,0.544,0.357,0.527,0.502,0.529,0.569,0.542,0.561,0.58,0.563,0.576,0.395,0.38,0.294,0.424,0.393,0.386,0.576,0.439,0.565,0.544,0.578,0.557,0.584,0.563,0.557,0.536,0.555,0.559,0.512,0.565,1903,Stability,CATR_CHLRE,High,Eukaryote
+CBPA2_HUMAN_Tsuboyama_2023_1O6X,0.555,0.605,0.633,0.634,0.638,0.65,0.375,0.569,0.673,0.685,0.648,0.615,0.611,0.404,0.561,0.642,0.636,0.633,0.607,0.673,0.4,0.406,0.592,0.571,0.449,0.573,0.582,0.557,0.611,0.667,0.658,0.595,0.019,0.404,0.513,0.611,0.576,0.603,0.623,0.65,0.64,0.65,0.397,0.279,0.513,0.522,0.677,0.625,0.805,0.71,0.698,0.725,0.692,0.71,0.737,0.714,0.712,0.71,0.706,0.725,0.631,0.776,2068,Stability,CBPA2_HUMAN,Medium,Human
+CBS_HUMAN_Sun_2020,0.297,0.306,0.314,0.339,0.319,0.331,0.162,0.205,0.33,0.341,0.304,0.305,0.329,0.07,0.177,0.281,0.299,0.285,0.283,0.309,0.315,0.238,0.261,0.262,0.319,0.254,0.262,0.248,0.273,0.322,0.316,0.299,0.179,0.332,0.268,0.233,0.329,0.289,0.286,0.349,0.317,0.315,0.243,0.062,0.326,0.324,0.21,0.236,0.277,0.058,0.277,0.277,0.283,0.297,0.292,0.289,0.281,0.284,0.29,0.293,0.336,0.238,7217,OrganismalFitness,CBS_HUMAN,Medium,Human
+CBX4_HUMAN_Tsuboyama_2023_2K28,0.389,0.45,0.493,0.505,0.463,0.507,-0.205,0.454,0.482,0.468,0.48,0.482,0.47,-0.21,0.545,0.514,0.503,0.428,0.442,0.5,0.312,0.41,0.377,0.394,0.372,0.361,0.366,0.31,0.394,0.489,0.507,0.502,0.349,0.277,0.305,0.31,0.415,0.41,0.396,0.496,0.517,0.503,0.465,-0.261,0.249,0.435,-0.002,0.067,0.452,0.421,0.528,0.458,0.535,0.526,0.545,0.515,0.519,0.552,0.519,0.526,0.493,0.615,2282,Stability,CBX4_HUMAN,High,Human
+CCDB_ECOLI_Adkar_2012,0.298,0.349,0.389,0.413,0.389,0.413,-0.019,0.165,0.315,0.308,0.379,0.304,0.386,0.015,-0.012,0.349,0.4,0.369,0.213,0.393,0.015,0.012,-0.08,0.094,-0.145,0.077,0.036,-0.005,0.328,0.41,0.495,0.481,0.117,-0.022,0.005,0.253,0.383,0.345,0.366,0.403,0.366,0.396,0.036,-0.009,0.379,0.015,0.25,0.349,0.27,0.219,0.379,0.321,0.349,0.366,0.342,0.359,0.352,0.379,0.406,0.383,0.359,0.138,1176,Activity,CCDB_ECOLI,High,Prokaryote
+CCDB_ECOLI_Tripathi_2016,0.326,0.384,0.414,0.436,0.402,0.414,0.026,0.217,0.314,0.341,0.393,0.357,0.378,0.023,0.029,0.357,0.393,0.423,0.311,0.411,0.038,0.032,-0.044,0.153,-0.074,0.093,0.011,0.059,0.384,0.387,0.448,0.439,0.069,0.02,0.105,0.332,0.384,0.372,0.417,0.393,0.393,0.414,0.026,-0.022,0.399,0.02,0.254,0.341,0.251,0.196,0.396,0.332,0.36,0.375,0.357,0.363,0.357,0.36,0.381,0.372,0.372,0.241,1663,OrganismalFitness,CCDB_ECOLI,High,Prokaryote
+CCR5_HUMAN_Gill_2023,0.208,0.215,0.206,0.203,0.203,0.21,0.221,0.227,0.239,0.263,0.268,0.258,0.268,0.228,0.255,0.263,0.256,0.241,0.258,0.268,0.271,0.257,0.232,0.23,0.271,0.255,0.255,0.273,0.233,0.259,0.216,0.182,0.086,0.272,0.268,0.264,0.261,0.27,0.28,0.234,0.233,0.227,0.267,0.121,0.268,0.264,0.19,0.23,0.203,0.112,0.246,0.234,0.246,0.25,0.245,0.24,0.252,0.254,0.256,0.256,0.271,0.26,6137,Binding,CCR5_HUMAN,High,Human
+CD19_HUMAN_Klesmith_2019_FMC_singles,0.144,0.152,0.146,0.146,0.163,0.153,0.062,0.081,0.126,0.131,0.09,0.11,0.123,0.102,0.13,0.109,0.099,0.126,0.099,-0.018,0.103,0.109,0.14,0.126,0.094,0.159,0.131,0.095,0.155,0.168,0.099,0.073,0.078,0.119,0.147,0.126,0.136,0.135,0.132,0.164,0.161,0.156,0.11,0.07,0.094,0.11,0.264,0.209,0.252,0.132,0.164,0.182,0.169,0.193,0.176,0.169,0.179,0.176,0.182,0.193,0.305,0.26,3761,Binding,CD19_HUMAN,Low,Human
+CP2C9_HUMAN_Amorosi_2021_abundance,0.384,0.442,0.455,0.465,0.446,0.469,0.396,0.441,0.446,0.465,0.38,0.453,0.472,0.41,0.475,0.477,0.498,0.488,0.447,0.483,0.449,0.445,0.464,0.425,0.447,0.437,0.441,0.449,0.423,0.46,0.446,0.399,0.175,0.451,0.438,0.422,0.478,0.464,0.463,0.483,0.484,0.486,0.452,0.05,0.383,0.473,0.435,0.408,0.473,0.159,0.479,0.47,0.483,0.505,0.493,0.498,0.49,0.479,0.486,0.505,0.484,0.478,6370,Expression,CP2C9_HUMAN,High,Human
+CP2C9_HUMAN_Amorosi_2021_activity,0.371,0.438,0.446,0.459,0.434,0.463,0.437,0.456,0.452,0.48,0.367,0.479,0.497,0.414,0.503,0.511,0.513,0.5,0.463,0.495,0.445,0.455,0.461,0.437,0.45,0.439,0.442,0.46,0.439,0.471,0.448,0.402,0.181,0.484,0.44,0.426,0.5,0.489,0.484,0.495,0.484,0.482,0.48,0.045,0.408,0.503,0.465,0.44,0.505,0.186,0.483,0.489,0.499,0.505,0.513,0.515,0.502,0.493,0.504,0.514,0.508,0.504,6142,Binding,CP2C9_HUMAN,High,Human
+CSN4_MOUSE_Tsuboyama_2023_1UFM,0.255,0.474,0.392,0.409,0.356,0.369,0.213,0.367,0.421,0.434,0.455,0.455,0.502,0.305,0.494,0.418,0.365,0.436,0.429,0.378,0.328,0.375,0.455,0.465,0.342,0.464,0.459,0.453,0.442,0.448,0.428,0.419,0.259,0.274,0.364,0.433,0.347,0.359,0.421,0.417,0.421,0.457,0.233,0.167,0.306,0.329,0.345,0.329,0.426,0.467,0.468,0.459,0.445,0.471,0.479,0.471,0.47,0.476,0.438,0.478,0.456,0.482,3295,Stability,CSN4_MOUSE,Medium,Eukaryote
+CUE1_YEAST_Tsuboyama_2023_2MYX,0.252,0.308,0.277,0.282,0.31,0.292,0.072,0.32,0.366,0.376,0.368,0.297,0.265,0.105,0.32,0.361,0.434,0.361,0.373,0.257,0.115,0.141,0.158,0.062,0.085,0.153,0.148,0.148,0.232,0.295,0.305,0.271,0.189,0.113,0.11,0.135,0.257,0.247,0.247,0.315,0.315,0.315,0.108,0.032,0.31,0.151,0.396,0.381,0.454,0.346,0.404,0.373,0.381,0.409,0.404,0.419,0.416,0.416,0.411,0.404,0.449,0.404,1580,Stability,CUE1_YEAST,Medium,Eukaryote
+D7PM05_CLYGR_Somermeyer_2022,0.426,0.5,0.5,0.465,0.506,0.503,0.049,0.378,0.538,0.543,0.353,0.048,0.042,0.045,0.032,0.022,0.032,0.046,0.109,0.356,0.032,0.033,0.082,0.06,-0.003,0.035,0.05,0.145,0.281,0.506,0.444,0.461,0.024,0.078,0.094,0.133,0.418,0.417,0.415,0.505,0.504,0.493,-0.002,0.01,0.001,0.02,0.209,0.228,0.389,0.292,0.386,0.386,0.384,0.392,0.381,0.392,0.383,0.376,0.377,0.385,0.235,0.147,24515,Activity,D7PM05_CLYGR,Low,Eukaryote
+DLG4_HUMAN_Faure_2021,0.611,0.49,0.509,0.487,0.498,0.51,0.652,0.557,0.444,0.448,0.435,0.459,0.497,0.671,0.688,0.646,0.473,0.386,0.368,0.536,0.436,0.478,0.465,0.437,0.487,0.482,0.453,0.451,0.38,0.508,0.545,0.542,0.394,0.437,0.549,0.481,0.533,0.623,0.576,0.532,0.555,0.529,0.502,0.162,0.314,0.443,0.548,0.35,0.561,0.251,0.412,0.3,0.458,0.415,0.434,0.405,0.436,0.415,0.424,0.421,0.417,0.662,6976,OrganismalFitness,DLG4_HUMAN,Low,Human
+DLG4_RAT_McLaughlin_2012,0.407,0.404,0.398,0.423,0.435,0.447,0.338,0.359,0.383,0.401,0.371,0.453,0.462,0.334,0.438,0.447,0.432,0.398,0.347,0.35,0.298,0.31,0.31,0.292,0.304,0.383,0.341,0.301,0.334,0.371,0.426,0.417,0.207,0.286,0.304,0.252,0.389,0.392,0.371,0.438,0.444,0.441,0.404,0.049,0.328,0.407,0.334,0.258,0.371,0.119,0.341,0.31,0.347,0.331,0.353,0.338,0.307,0.347,0.359,0.362,0.404,0.42,1576,Binding,DLG4_RAT,Low,Eukaryote
+DN7A_SACS2_Tsuboyama_2023_1JIC,0.161,0.117,0.188,0.173,0.185,0.208,-0.093,0.268,0.252,0.248,-0.014,0.038,0.034,0.006,0.093,0.089,0.216,0.212,0.256,0.177,-0.069,-0.002,-0.01,0.034,0.03,0.058,-0.006,0.03,0.121,0.204,0.3,0.272,0.01,-0.022,-0.026,-0.014,0.157,0.157,0.161,0.2,0.181,0.173,0.034,-0.077,0.133,0.042,0.45,0.438,0.522,0.423,0.276,0.232,0.343,0.331,0.308,0.315,0.288,0.268,0.272,0.308,0.387,0.335,1008,Stability,DN7A_SACS2,Medium,Prokaryote
+DNJA1_HUMAN_Tsuboyama_2023_2LO1,0.724,0.742,0.684,0.693,0.724,0.726,0.633,0.689,0.716,0.735,0.723,0.716,0.749,0.765,0.777,0.779,0.758,0.71,0.763,0.703,0.597,0.648,0.68,0.652,0.684,0.661,0.689,0.588,0.707,0.712,0.67,0.648,0.581,0.7,0.673,0.74,0.76,0.76,0.784,0.728,0.737,0.754,0.532,0.069,0.309,0.551,0.502,0.401,0.714,0.726,0.721,0.742,0.751,0.739,0.758,0.747,0.777,0.731,0.762,0.754,0.712,0.733,2264,Stability,DNJA1_HUMAN,High,Human
+DOCK1_MOUSE_Tsuboyama_2023_2M0Y,0.321,0.337,0.321,0.327,0.36,0.359,0.046,0.317,0.268,0.28,0.309,0.332,0.354,0.184,0.337,0.359,0.35,0.272,0.335,0.342,0.241,0.298,0.262,0.272,0.247,0.268,0.214,0.21,0.208,0.319,0.261,0.274,0.153,0.214,0.249,0.227,0.372,0.386,0.352,0.35,0.37,0.354,0.224,-0.155,0.321,0.265,0.231,0.234,0.286,0.297,0.287,0.258,0.265,0.273,0.268,0.279,0.283,0.276,0.278,0.265,0.365,0.357,2915,Stability,DOCK1_MOUSE,High,Eukaryote
+DYR_ECOLI_Nguyen_2023,0.291,0.42,0.427,0.42,0.432,0.434,0.001,0.361,0.402,0.407,0.42,0.405,0.42,0.056,0.402,0.412,0.426,0.429,0.429,0.41,0.309,0.37,0.276,0.323,0.4,0.412,0.399,0.417,0.372,0.42,0.424,0.415,0.164,0.395,0.297,0.313,0.417,0.387,0.39,0.436,0.424,0.42,0.388,0.006,0.424,0.397,0.186,0.36,0.291,0.078,0.382,0.37,0.363,0.387,0.378,0.392,0.388,0.383,0.392,0.392,0.405,0.319,2916,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+DYR_ECOLI_Thompson_2019,0.336,0.396,0.391,0.387,0.406,0.404,-0.048,0.286,0.396,0.422,0.404,0.341,0.358,0.047,0.292,0.373,0.398,0.426,0.422,0.407,0.235,0.323,0.24,0.293,0.314,0.393,0.409,0.389,0.363,0.396,0.389,0.378,0.159,0.314,0.297,0.312,0.35,0.358,0.373,0.4,0.42,0.411,0.292,-0.025,0.369,0.336,0.2,0.358,0.27,0.084,0.363,0.336,0.334,0.349,0.367,0.358,0.376,0.354,0.385,0.373,0.407,0.268,2363,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+ENV_HV1B9_DuenasDecamp_2016,0.298,0.321,0.286,0.298,0.357,0.321,0.004,0.263,0.286,0.286,0.31,0.38,0.357,0.039,0.075,0.051,-0.008,0.039,0.004,0.298,0.357,0.321,0.321,0.321,0.298,0.357,0.333,0.286,0.321,0.38,0.368,0.353,0.251,0.345,0.31,0.31,0.392,0.345,0.345,0.345,0.333,0.333,0.274,0.004,0.31,0.333,0.333,0.286,0.274,0.216,0.18,0.274,0.298,0.204,0.157,0.216,0.216,0.204,0.157,0.216,0.122,0.086,375,OrganismalFitness,ENV_HV1B9,Medium,Virus
+ENV_HV1BR_Haddox_2016,0.251,0.241,0.235,0.237,0.251,0.256,-0.003,0.237,0.264,0.267,0.23,0.241,0.26,-0.006,-0.004,-0.001,0.027,0.043,0.107,0.251,0.271,0.269,0.275,0.279,0.256,0.272,0.268,0.263,0.271,0.277,0.248,0.223,0.158,0.26,0.27,0.27,0.277,0.279,0.274,0.274,0.277,0.27,0.163,-0.011,0.24,0.23,0.143,0.162,0.069,0.049,0.124,0.115,0.133,0.156,0.129,0.136,0.14,0.134,0.133,0.142,0.108,0.058,12863,OrganismalFitness,ENV_HV1BR,Medium,Virus
+ENVZ_ECOLI_Ghose_2023,0.138,0.131,0.123,0.142,0.187,0.187,0.001,0.157,0.135,0.127,0.131,0.183,0.194,0.146,0.164,0.157,0.161,0.142,0.12,0.153,0.138,0.149,0.168,0.176,0.135,0.153,0.202,0.19,0.19,0.164,0.157,0.135,0.064,0.123,0.153,0.194,0.176,0.179,0.198,0.183,0.202,0.202,0.127,0.068,0.164,0.168,0.049,0.146,0.101,0.042,0.164,0.172,0.131,0.161,0.138,0.164,0.157,0.153,0.176,0.164,0.112,0.183,1121,Activity,ENVZ_ECOLI,High,Prokaryote
+EPHB2_HUMAN_Tsuboyama_2023_1F0M,0.612,0.608,0.631,0.637,0.682,0.686,-0.261,0.655,0.749,0.753,0.755,0.745,0.773,-0.3,0.725,0.788,0.804,0.735,0.708,0.688,0.476,0.547,0.659,0.616,0.606,0.643,0.637,0.684,0.686,0.798,0.737,0.725,0.604,0.573,0.541,0.631,0.651,0.661,0.694,0.682,0.68,0.667,0.569,-0.253,0.553,0.541,0.492,0.537,0.741,0.657,0.794,0.778,0.788,0.776,0.8,0.796,0.802,0.794,0.786,0.802,0.751,0.745,1960,Stability,EPHB2_HUMAN,High,Human
+ERBB2_HUMAN_Elazar_2016,0.351,0.301,0.191,0.203,0.203,0.24,0.338,0.338,0.326,0.351,0.363,0.338,0.4,0.351,0.351,0.388,0.326,0.351,0.289,0.043,0.351,0.326,0.375,0.351,0.363,0.351,0.449,0.437,0.388,0.375,0.314,0.05,0.4,0.363,0.338,0.351,0.4,0.4,0.338,0.363,0.363,0.351,0.388,0.363,0.412,0.4,0.314,0.363,-0.105,0.043,0.351,0.301,0.363,0.289,0.301,0.314,0.338,0.314,0.338,0.351,0.412,0.4,326,Expression,ERBB2_HUMAN,Low,Human
+ESTA_BACSU_Nutschel_2020,0.216,0.286,0.307,0.314,0.29,0.288,0.13,0.237,0.248,0.305,0.237,0.218,0.248,0.113,0.18,0.2,0.224,0.235,0.27,0.231,0.054,0.137,0.209,0.227,0.181,0.205,0.227,0.205,0.301,0.279,0.355,0.298,0.064,0.113,0.191,0.202,0.238,0.244,0.238,0.312,0.301,0.31,0.126,0.1,0.207,0.185,0.425,0.367,0.426,0.272,0.272,0.259,0.253,0.242,0.262,0.248,0.25,0.273,0.272,0.266,0.318,0.152,2172,Stability,ESTA_BACSU,High,Prokaryote
+F7YBW8_MESOW_Aakre_2015,0.047,0.227,0.239,0.258,0.257,0.258,0.013,0.183,0.232,0.241,0.25,0.228,0.237,-0.043,0.009,0.049,0.248,0.237,0.264,0.225,-0.029,-0.07,-0.066,-0.014,0.076,0.071,-0.009,0.19,0.238,0.211,0.27,0.263,0.04,-0.006,-0.07,0.244,0.046,0.011,0.252,0.237,0.227,0.264,-0.006,-0.023,0.134,-0.008,-0.007,0.159,0.037,0.023,0.268,0.264,0.248,0.273,0.275,0.271,0.258,0.265,0.266,0.269,0.181,-0.079,9192,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+FECA_ECOLI_Tsuboyama_2023_2D1U,0.351,0.359,0.357,0.366,0.374,0.364,0.037,0.245,0.292,0.353,0.396,0.198,0.326,0.039,0.34,0.451,0.425,0.376,0.29,0.353,0.12,0.226,0.264,0.247,0.101,0.391,0.41,0.209,0.315,0.381,0.366,0.345,0.268,0.06,0.086,0.09,0.302,0.304,0.222,0.326,0.349,0.306,0.29,-0.022,0.362,0.343,0.436,0.391,0.485,0.419,0.44,0.374,0.423,0.4,0.393,0.432,0.425,0.391,0.41,0.419,0.476,0.483,1886,Stability,FECA_ECOLI,High,Prokaryote
+FKBP3_HUMAN_Tsuboyama_2023_2KFV,0.3,0.271,0.374,0.378,0.378,0.387,0.135,0.248,0.18,0.177,0.196,0.164,0.164,0.171,0.148,0.154,0.151,0.216,0.277,0.216,0.209,0.203,0.196,0.167,0.141,0.158,0.18,0.177,0.184,0.374,0.235,0.263,-0.033,0.109,0.167,0.196,0.274,0.284,0.251,0.361,0.371,0.348,0.154,0.164,0.174,0.2,0.555,0.491,0.53,0.468,0.232,0.31,0.329,0.284,0.281,0.303,0.274,0.258,0.232,0.297,0.445,0.258,1237,Stability,FKBP3_HUMAN,Medium,Human
+GAL4_YEAST_Kitzman_2015,0.193,0.295,0.31,0.375,0.342,0.346,0.231,-0.007,0.342,0.426,0.426,0.335,0.353,0.255,0.27,0.382,0.466,0.451,0.441,0.346,0.204,0.241,0.284,0.284,0.237,0.291,0.375,0.331,0.397,0.451,0.451,0.399,0.186,0.171,0.201,0.197,0.342,0.351,0.375,0.331,0.313,0.321,0.288,0.193,0.419,0.339,0.186,0.368,0.259,0.102,0.404,0.415,0.39,0.397,0.426,0.386,0.397,0.393,0.408,0.419,0.422,0.306,1195,OrganismalFitness,GAL4_YEAST,Medium,Eukaryote
+GCN4_YEAST_Staller_2018,0.192,0.175,0.178,0.184,0.16,0.158,0.119,0.157,0.18,0.175,0.169,0.212,0.216,0.246,0.215,0.201,0.215,0.187,0.169,0.166,0.113,0.136,0.107,0.105,0.096,0.089,0.061,0.095,0.122,0.14,0.183,0.186,0.033,0.114,0.116,0.231,0.195,0.199,0.228,0.193,0.19,0.215,0.037,0.084,0.087,0.046,0.105,0.125,0.172,0.16,0.154,0.145,0.143,0.139,0.151,0.149,0.143,0.149,0.158,0.148,0.171,0.183,2638,Binding,GCN4_YEAST,Low,Eukaryote
+GDIA_HUMAN_Silverstein_2021,0.338,0.341,0.366,0.348,0.338,0.338,0.102,0.279,0.373,0.369,0.293,0.307,0.328,0.113,0.161,0.293,0.289,0.258,0.317,0.289,0.182,0.279,0.289,0.307,0.213,0.341,0.31,0.258,0.279,0.31,0.348,0.314,0.185,0.217,0.248,0.255,0.317,0.307,0.331,0.338,0.355,0.359,0.175,0.088,0.317,0.172,0.217,0.289,0.248,0.095,0.272,0.248,0.293,0.272,0.276,0.289,0.279,0.272,0.262,0.314,0.334,0.213,1154,OrganismalFitness,GDIA_HUMAN,Low,Human
+GFP_AEQVI_Sarkisyan_2016,0.584,0.582,0.613,0.615,0.618,0.618,0.061,0.583,0.604,0.596,0.473,0.097,0.097,0.081,0.119,0.09,0.106,0.14,0.255,0.537,0.087,0.11,0.167,0.089,0.048,0.166,0.261,0.584,0.584,0.619,0.546,0.555,0.058,0.069,0.163,0.573,0.581,0.583,0.618,0.624,0.627,0.65,0.023,-0.006,0.036,0.036,0.456,0.445,0.67,0.535,0.529,0.537,0.546,0.554,0.546,0.545,0.542,0.543,0.525,0.541,0.562,0.402,51714,Activity,GFP_AEQVI,Low,Eukaryote
+GLPA_HUMAN_Elazar_2016,0.114,-0.045,0.102,0.135,0.102,0.07,0.331,0.413,0.282,0.25,0.446,0.446,0.446,0.315,0.446,0.462,0.495,0.381,0.43,0.152,0.381,0.397,0.43,0.413,0.348,0.413,0.462,0.43,0.479,0.209,0.364,0.136,0.168,0.43,0.364,0.43,0.397,0.381,0.397,0.299,0.25,0.364,0.413,0.397,0.462,0.397,0.331,0.495,0.266,0.168,0.413,0.381,0.495,0.43,0.446,0.397,0.462,0.397,0.413,0.479,0.413,0.413,245,Expression,GLPA_HUMAN,Low,Human
+GRB2_HUMAN_Faure_2021,0.314,0.392,0.384,0.411,0.404,0.406,0.403,0.338,0.405,0.365,0.39,0.337,0.378,0.43,0.464,0.474,0.496,0.396,0.429,0.407,0.419,0.396,0.4,0.344,0.419,0.397,0.341,0.409,0.342,0.376,0.308,0.308,0.361,0.418,0.385,0.31,0.39,0.381,0.331,0.421,0.421,0.397,0.451,0.235,0.399,0.435,0.54,0.401,0.548,0.396,0.484,0.518,0.507,0.505,0.511,0.494,0.496,0.512,0.491,0.514,0.421,0.468,63366,OrganismalFitness,GRB2_HUMAN,Medium,Human
+HCP_LAMBD_Tsuboyama_2023_2L6Q,0.238,0.312,0.092,0.123,0.269,0.281,0.188,0.288,0.346,0.342,0.458,0.35,0.377,0.238,0.315,0.419,0.473,0.412,0.431,0.192,0.212,0.219,0.273,0.288,0.254,0.273,0.162,0.265,0.392,0.408,0.477,0.36,0.115,0.227,0.223,0.369,0.312,0.315,0.373,0.25,0.323,0.377,0.223,0.235,0.458,0.308,0.492,0.527,0.573,0.408,0.519,0.531,0.523,0.523,0.535,0.531,0.504,0.535,0.523,0.535,0.577,0.45,1040,Stability,HCP_LAMBD,Medium,Virus
+HECD1_HUMAN_Tsuboyama_2023_3DKM,0.297,0.324,0.238,0.244,0.344,0.353,0.172,0.294,0.286,0.284,0.253,0.209,0.231,0.169,0.175,0.37,0.326,0.277,0.272,0.303,0.201,0.261,0.333,0.312,0.252,0.32,0.307,0.288,0.302,0.331,0.305,0.308,0.132,0.186,0.215,0.24,0.368,0.345,0.372,0.351,0.335,0.356,0.105,0.079,0.354,0.141,0.222,0.336,0.331,0.266,0.272,0.32,0.351,0.332,0.335,0.372,0.327,0.342,0.329,0.348,0.34,0.29,5586,Stability,HECD1_HUMAN,Medium,Human
+HEM3_HUMAN_Loggerenberg_2023,0.298,0.313,0.295,0.304,0.313,0.311,0.1,0.096,0.32,0.33,0.311,0.283,0.306,0.105,0.273,0.299,0.31,0.321,0.337,0.115,0.239,0.292,0.307,0.308,0.254,0.297,0.295,0.305,0.321,0.318,0.313,0.312,0.061,0.277,0.291,0.313,0.306,0.328,0.347,0.307,0.325,0.344,0.22,0.106,0.292,0.26,0.228,0.28,0.224,0.115,0.274,0.268,0.271,0.288,0.294,0.285,0.297,0.292,0.293,0.301,0.339,0.303,5689,Activity,HEM3_HUMAN,Medium,Human
+HIS7_YEAST_Pokusaeva_2019,0.355,0.41,0.44,0.433,0.409,0.41,0.114,0.218,0.343,0.377,0.354,0.333,0.377,0.038,0.089,0.347,0.282,0.341,0.367,0.342,0.237,0.303,0.337,0.346,0.294,0.375,0.339,0.378,0.394,0.419,0.307,0.301,0.115,0.296,0.325,0.475,0.376,0.377,0.511,0.444,0.424,0.463,0.127,0.013,0.212,0.002,0.322,0.342,0.437,0.3,0.342,0.249,0.295,0.326,0.331,0.308,0.306,0.338,0.319,0.317,0.399,0.136,496137,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+HMDH_HUMAN_Jiang_2019,0.368,0.321,0.289,0.297,0.323,0.319,0.116,0.171,0.206,0.198,0.209,0.242,0.225,0.032,0.014,0.231,0.399,0.278,0.283,0.222,0.346,0.092,0.113,0.154,0.374,0.126,0.113,0.143,0.147,0.354,0.3,0.301,0.173,0.241,0.183,0.154,0.361,0.297,0.285,0.342,0.296,0.294,0.114,0.041,0.362,0.389,0.229,0.349,-0.175,0.074,0.378,0.376,0.367,0.378,0.379,0.373,0.37,0.382,0.374,0.382,0.42,0.327,16853,OrganismalFitness,HMDH_HUMAN,Low,Human
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,0.235,0.25,0.255,0.266,0.23,0.232,0.173,0.283,0.301,0.31,0.239,0.298,0.33,-0.003,0.1,0.17,0.171,0.216,0.214,-0.001,0.235,0.255,0.259,0.271,0.212,0.246,0.285,0.292,0.292,0.275,0.314,0.259,0.134,0.228,0.248,0.228,0.246,0.264,0.253,0.244,0.253,0.244,0.02,-0.01,0.282,0.285,0.035,0.186,0.06,-0.026,0.166,0.187,0.17,0.189,0.184,0.186,0.18,0.18,0.166,0.187,0.276,0.203,2252,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HSP82_YEAST_Flynn_2019,0.315,0.361,0.374,0.385,0.374,0.371,0.125,0.311,0.394,0.384,0.337,0.357,0.377,0.01,0.068,0.141,0.247,0.323,0.334,0.371,0.351,0.371,0.357,0.377,0.367,0.38,0.385,0.371,0.399,0.374,0.408,0.398,0.176,0.351,0.358,0.362,0.364,0.368,0.375,0.382,0.388,0.389,0.081,-0.056,0.35,0.27,0.16,0.295,0.09,0.05,0.245,0.248,0.262,0.264,0.254,0.259,0.265,0.255,0.242,0.258,0.355,0.295,13294,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HSP82_YEAST_Mishra_2016,0.394,0.4,0.405,0.405,0.397,0.394,0.27,0.309,0.374,0.381,0.394,0.411,0.416,0.187,0.275,0.305,0.368,0.369,0.37,0.376,0.38,0.366,0.368,0.395,0.385,0.385,0.401,0.366,0.391,0.399,0.401,0.396,0.238,0.363,0.378,0.386,0.403,0.413,0.41,0.403,0.408,0.41,0.358,-0.052,0.385,0.395,0.12,0.286,0.144,0.035,0.341,0.341,0.327,0.363,0.357,0.341,0.359,0.349,0.348,0.359,0.375,0.347,4323,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HXK4_HUMAN_Gersing_2022_activity,0.39,0.426,0.406,0.413,0.396,0.406,0.178,0.311,0.426,0.44,0.375,0.384,0.403,0.121,0.201,0.408,0.421,0.415,0.398,0.033,0.374,0.367,0.346,0.327,0.366,0.368,0.357,0.358,0.325,0.413,0.417,0.388,0.138,0.375,0.338,0.279,0.411,0.384,0.351,0.424,0.421,0.406,0.225,0.024,0.418,0.398,0.287,0.366,0.319,0.123,0.421,0.407,0.409,0.413,0.415,0.427,0.411,0.418,0.411,0.424,0.439,0.375,8570,OrganismalFitness,HXK4_HUMAN,Medium,Human
+HXK4_HUMAN_Gersing_2023_abundance,0.282,0.309,0.287,0.303,0.301,0.305,0.045,0.282,0.347,0.363,0.295,0.289,0.308,0.106,0.148,0.294,0.32,0.336,0.317,0.348,0.244,0.249,0.261,0.27,0.241,0.264,0.283,0.281,0.275,0.323,0.287,0.25,0.152,0.265,0.258,0.265,0.302,0.304,0.308,0.317,0.317,0.324,0.135,0.075,0.302,0.29,0.325,0.335,0.334,0.11,0.33,0.332,0.331,0.333,0.334,0.337,0.328,0.327,0.321,0.332,0.37,0.286,8396,Expression,HXK4_HUMAN,Medium,Human
+I6TAH8_I68A0_Doud_2015,0.287,0.24,0.213,0.208,0.274,0.277,-0.008,0.234,0.217,0.239,0.008,0.015,0.008,0.026,0.02,0.012,0.015,0.006,0.054,0.151,0.24,0.245,0.286,0.283,0.013,0.019,0.085,0.003,0.23,0.284,0.179,0.183,0.111,0.211,0.242,0.247,0.225,0.252,0.256,0.28,0.306,0.306,0.016,0.015,0.016,0.017,0.166,0.149,0.182,0.078,0.048,0.075,0.101,0.097,0.079,0.072,0.059,0.067,0.051,0.072,0.024,0.007,9462,OrganismalFitness,I6TAH8_I68A0,Medium,Virus
+IF1_ECOLI_Kelsic_2016,0.262,0.37,0.443,0.456,0.418,0.437,0.086,0.277,0.195,0.163,0.405,0.43,0.44,0.08,0.306,0.414,0.446,0.446,0.424,0.418,0.242,0.328,0.223,0.297,0.341,0.354,0.344,0.306,0.354,0.313,0.386,0.335,0.214,0.354,0.373,0.398,0.367,0.363,0.36,0.418,0.421,0.437,0.252,0.093,0.449,0.373,0.23,0.398,0.348,0.172,0.408,0.408,0.408,0.43,0.389,0.408,0.389,0.411,0.414,0.421,0.456,0.379,1367,OrganismalFitness,IF1_ECOLI,High,Prokaryote
+ILF3_HUMAN_Tsuboyama_2023_2L33,0.295,0.387,0.425,0.418,0.415,0.422,0.176,0.271,0.38,0.404,0.313,0.331,0.352,0.264,0.366,0.422,0.26,0.246,0.18,0.432,0.127,0.187,0.274,0.302,0.317,0.222,0.324,0.243,0.267,0.411,0.352,0.316,0.025,0.173,0.232,0.313,0.302,0.345,0.359,0.39,0.387,0.415,0.299,0.176,0.306,0.38,0.415,0.366,0.432,0.331,0.267,0.229,0.288,0.25,0.278,0.331,0.292,0.25,0.341,0.324,0.292,0.362,1329,Stability,ILF3_HUMAN,High,Human
+ISDH_STAAW_Tsuboyama_2023_2LHR,0.065,0.108,0.139,0.147,0.151,0.176,0.25,0.124,0.281,0.266,0.378,0.326,0.326,0.271,0.291,0.336,0.324,0.371,0.326,0.131,0.219,0.145,0.106,0.166,0.221,0.242,0.221,0.211,0.242,0.275,0.334,0.315,0.17,0.192,0.174,0.184,0.155,0.139,0.137,0.194,0.157,0.17,0.199,0.178,0.252,0.24,0.427,0.353,0.419,0.4,0.32,0.324,0.345,0.332,0.33,0.345,0.316,0.324,0.332,0.334,0.437,0.359,1944,Stability,ISDH_STAAW,High,Prokaryote
+KCNE1_HUMAN_Muhammad_2023_expression,0.195,0.2,0.195,0.177,0.181,0.171,0.179,0.108,0.161,0.171,0.226,0.21,0.195,0.179,0.175,0.226,0.181,0.173,0.179,0.197,0.195,0.204,0.161,0.11,0.212,0.148,0.208,0.191,0.161,0.181,0.175,0.128,-0.087,0.189,0.226,0.163,0.177,0.185,0.212,0.191,0.191,0.208,0.206,0.167,0.204,0.228,0.193,0.2,0.189,0.063,0.153,0.169,0.163,0.155,0.169,0.15,0.179,0.153,0.163,0.163,0.204,0.193,2339,Expression,KCNE1_HUMAN,Medium,Human
+KCNE1_HUMAN_Muhammad_2023_function,0.282,0.331,0.358,0.393,0.362,0.391,0.111,0.389,0.374,0.388,0.215,0.32,0.41,0.13,0.085,0.446,0.391,0.322,0.254,0.159,0.153,0.179,0.329,0.401,0.166,0.445,0.453,0.433,0.424,0.337,0.46,0.38,0.006,0.104,0.234,0.491,0.298,0.318,0.457,0.375,0.391,0.486,0.121,0.116,0.386,0.087,0.123,0.313,0.127,0.03,0.407,0.396,0.389,0.394,0.394,0.381,0.403,0.4,0.42,0.42,0.507,0.127,2315,Activity,KCNE1_HUMAN,Medium,Human
+KCNH2_HUMAN_Kozek_2020,0.36,0.4,0.18,0.18,0.18,0.16,0.36,0.04,0.26,0.26,0.2,0.18,0.16,0.18,0.12,0.16,0.16,0.16,0.12,0.36,0.36,0.36,0.34,0.28,0.38,0.32,0.32,0.28,0.28,0.281,0.3,0.14,0.28,0.32,0.36,0.36,0.36,0.4,0.4,0.3,0.4,0.36,0.32,0.14,0.16,0.22,0.12,0.0,-0.12,0.06,0.4,0.3,0.4,0.34,0.3,0.34,0.38,0.34,0.4,0.36,0.18,0.36,200,Activity,KCNH2_HUMAN,Medium,Human
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,0.215,0.273,0.26,0.277,0.285,0.288,0.049,0.183,0.256,0.246,0.306,0.288,0.297,0.035,0.216,0.308,0.322,0.326,0.326,0.126,0.283,0.222,0.187,0.179,0.28,0.268,0.241,0.274,0.172,0.299,0.303,0.291,0.118,0.261,0.238,0.168,0.286,0.267,0.227,0.299,0.291,0.27,0.154,-0.009,0.267,0.297,0.153,0.227,0.185,0.062,0.318,0.304,0.308,0.301,0.307,0.293,0.311,0.309,0.301,0.313,0.297,0.215,6963,Activity,KCNJ2_MOUSE,Medium,Eukaryote
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,0.261,0.225,0.165,0.183,0.214,0.217,0.073,0.143,0.186,0.188,0.207,0.186,0.198,0.11,0.23,0.225,0.221,0.222,0.243,0.158,0.143,0.158,0.161,0.178,0.157,0.179,0.172,0.176,0.176,0.236,0.19,0.152,0.114,0.143,0.17,0.161,0.199,0.223,0.224,0.203,0.224,0.224,0.166,0.043,0.172,0.195,0.169,0.157,0.168,0.094,0.217,0.198,0.214,0.199,0.208,0.194,0.209,0.201,0.21,0.21,0.22,0.22,6917,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+KKA2_KLEPN_Melnikov_2014,0.221,0.452,0.366,0.516,0.499,0.508,0.148,0.38,0.449,0.509,0.471,0.488,0.512,0.165,0.199,0.404,0.486,0.531,0.556,0.394,0.213,0.331,0.418,0.467,0.268,0.488,0.486,0.473,0.526,0.523,0.548,0.499,0.113,0.184,0.407,0.495,0.346,0.429,0.487,0.495,0.511,0.526,0.191,0.069,0.467,0.324,0.362,0.486,0.439,0.185,0.479,0.475,0.478,0.478,0.476,0.482,0.476,0.482,0.488,0.491,0.53,0.239,4960,OrganismalFitness,KKA2_KLEPN,High,Prokaryote
+LGK_LIPST_Klesmith_2015,0.233,0.292,0.34,0.345,0.332,0.344,0.089,0.352,0.348,0.414,0.336,0.394,0.424,0.125,0.241,0.3,0.391,0.433,0.42,0.381,0.228,0.316,0.366,0.378,0.292,0.39,0.402,0.367,0.409,0.39,0.406,0.338,0.114,0.253,0.312,0.422,0.28,0.31,0.389,0.339,0.357,0.393,0.128,0.047,0.368,0.278,0.29,0.371,0.358,0.106,0.387,0.366,0.383,0.374,0.375,0.395,0.393,0.392,0.395,0.397,0.301,0.251,7890,Activity,LGK_LIPST,Medium,Eukaryote
+LYAM1_HUMAN_Elazar_2016,0.26,0.26,0.148,0.159,0.215,0.226,0.26,0.137,0.304,0.327,0.315,0.26,0.293,0.193,0.237,0.193,0.226,0.26,0.349,0.193,0.237,0.26,0.304,0.282,0.215,0.304,0.271,0.182,0.182,0.289,0.204,0.144,0.137,0.282,0.315,0.215,0.282,0.304,0.282,0.204,0.249,0.204,0.215,0.215,0.237,0.249,0.103,0.148,0.115,0.025,0.182,0.249,0.26,0.226,0.193,0.204,0.237,0.271,0.193,0.249,0.293,0.226,359,Expression,LYAM1_HUMAN,Medium,Human
+MAFG_MOUSE_Tsuboyama_2023_1K1V,0.422,0.536,0.481,0.481,0.498,0.486,0.327,0.649,0.581,0.563,0.643,0.309,0.584,0.416,0.395,0.436,0.457,0.498,0.339,0.575,0.365,0.557,0.546,0.569,0.498,0.528,0.469,0.531,0.472,0.43,0.548,0.522,0.383,0.247,0.513,0.525,0.454,0.566,0.599,0.522,0.599,0.64,0.265,0.038,0.36,0.236,0.516,0.301,0.548,0.566,0.652,0.634,0.628,0.64,0.637,0.634,0.637,0.649,0.661,0.64,0.593,0.64,1429,Stability,MAFG_MOUSE,Medium,Eukaryote
+MBD11_ARATH_Tsuboyama_2023_6ACV,0.518,0.657,0.614,0.628,0.675,0.681,-0.026,0.471,0.612,0.675,0.658,0.333,0.461,-0.066,0.104,0.692,0.612,0.605,0.677,0.65,0.04,0.1,0.544,0.592,0.221,0.476,0.444,0.512,0.667,0.692,0.662,0.615,0.558,-0.242,0.104,0.093,0.488,0.578,0.565,0.63,0.667,0.658,0.03,-0.106,0.578,0.34,0.399,0.541,0.664,0.446,0.698,0.673,0.675,0.681,0.66,0.662,0.671,0.692,0.703,0.688,0.673,0.609,2116,Stability,MBD11_ARATH,Medium,Eukaryote
+MET_HUMAN_Estevam_2023,0.453,0.528,0.52,0.524,0.554,0.558,0.489,0.49,0.517,0.527,0.55,0.503,0.517,0.414,0.471,0.494,0.555,0.579,0.572,0.563,0.497,0.482,0.435,0.461,0.496,0.503,0.499,0.452,0.423,0.507,0.528,0.494,0.26,0.463,0.467,0.489,0.479,0.499,0.527,0.524,0.531,0.543,0.503,0.303,0.535,0.544,0.328,0.388,0.476,0.173,0.509,0.521,0.513,0.522,0.526,0.533,0.521,0.534,0.531,0.532,0.544,0.49,5393,Activity,MET_HUMAN,Medium,Human
+MK01_HUMAN_Brenan_2016,0.152,0.19,0.209,0.212,0.193,0.197,0.173,0.149,0.195,0.171,0.069,0.156,0.171,0.132,0.173,0.171,0.166,0.178,0.155,0.172,0.206,0.131,0.099,0.048,0.161,0.104,0.081,0.087,-0.021,0.19,0.171,0.198,0.119,0.163,0.093,0.054,0.176,0.138,0.125,0.202,0.185,0.179,0.149,0.1,0.146,0.162,0.068,0.016,0.122,0.004,0.158,0.155,0.153,0.171,0.169,0.156,0.163,0.168,0.165,0.167,0.168,0.166,6809,OrganismalFitness,MK01_HUMAN,Medium,Human
+MLAC_ECOLI_MacRae_2023,0.176,0.274,0.344,0.354,0.337,0.33,-0.022,0.271,0.346,0.359,0.302,0.343,0.354,-0.022,0.174,0.207,0.312,0.295,0.313,0.295,0.042,0.266,0.274,0.309,0.248,0.314,0.306,0.301,0.339,0.32,0.356,0.319,0.028,0.103,0.218,0.216,0.203,0.249,0.231,0.286,0.306,0.303,0.108,-0.05,0.244,0.174,0.027,0.154,0.112,-0.012,0.244,0.214,0.235,0.23,0.239,0.242,0.222,0.233,0.268,0.24,0.16,0.151,4007,OrganismalFitness,MLAC_ECOLI,Medium,Prokaryote
+MSH2_HUMAN_Jia_2020,0.308,0.343,0.326,0.335,0.335,0.338,0.177,0.276,0.342,0.348,0.308,0.328,0.34,0.184,0.287,0.338,0.324,0.268,0.181,0.335,0.245,0.302,0.283,0.277,0.28,0.315,0.33,0.306,0.289,0.34,0.327,0.309,0.176,0.235,0.29,0.277,0.299,0.326,0.326,0.337,0.343,0.34,0.222,0.095,0.332,0.283,0.258,0.317,0.044,0.089,0.279,0.262,0.272,0.279,0.297,0.275,0.277,0.275,0.298,0.288,0.343,0.306,16749,OrganismalFitness,MSH2_HUMAN,Medium,Human
+MTH3_HAEAE_RockahShmuel_2015,0.259,0.419,0.447,0.453,0.458,0.442,0.215,0.414,0.435,0.442,0.38,0.455,0.45,0.153,0.21,0.236,0.336,0.393,0.435,0.429,0.202,0.298,0.393,0.429,0.323,0.437,0.445,0.447,0.46,0.45,0.455,0.429,0.215,0.212,0.331,0.427,0.261,0.329,0.419,0.396,0.419,0.458,0.223,0.026,0.321,0.212,0.287,0.37,0.357,0.119,0.36,0.334,0.36,0.354,0.37,0.378,0.357,0.362,0.354,0.365,0.378,0.215,1777,OrganismalFitness,MTH3_HAEAE,Medium,Prokaryote
+MTHR_HUMAN_Weile_2021,0.169,0.177,0.167,0.173,0.175,0.17,0.113,0.111,0.2,0.204,0.258,0.2,0.213,0.108,0.261,0.335,0.255,0.191,0.203,0.178,0.311,0.105,0.141,0.182,0.297,0.156,0.18,0.17,0.221,0.203,0.149,0.139,0.087,0.309,0.219,0.133,0.233,0.203,0.158,0.219,0.193,0.171,0.174,0.07,0.251,0.262,0.159,0.237,0.27,0.066,0.232,0.207,0.211,0.225,0.223,0.234,0.249,0.245,0.254,0.238,0.223,0.245,12464,OrganismalFitness,MTHR_HUMAN,Low,Human
+MYO3_YEAST_Tsuboyama_2023_2BTT,0.156,0.169,0.255,0.261,0.262,0.298,0.104,0.207,0.33,0.321,0.394,0.376,0.39,0.136,0.323,0.442,0.44,0.393,0.206,0.244,0.249,0.18,0.244,0.213,0.244,0.176,0.251,0.19,0.222,0.189,0.225,0.13,0.114,0.309,0.216,0.221,0.287,0.26,0.246,0.302,0.298,0.283,0.1,-0.002,0.313,0.223,0.297,0.336,0.379,0.257,0.421,0.398,0.43,0.406,0.418,0.421,0.406,0.413,0.401,0.411,0.431,0.39,3297,Stability,MYO3_YEAST,High,Eukaryote
+NCAP_I34A1_Doud_2015,0.29,0.24,0.248,0.25,0.283,0.282,0.012,0.241,0.275,0.252,0.029,0.026,0.025,0.037,0.034,0.026,0.04,0.034,0.089,0.215,0.266,0.264,0.293,0.288,0.03,0.058,0.098,0.055,0.256,0.294,0.199,0.211,0.105,0.259,0.272,0.295,0.281,0.301,0.321,0.325,0.324,0.339,0.035,0.019,0.029,0.035,0.196,0.196,0.198,0.081,0.102,0.128,0.133,0.149,0.135,0.129,0.116,0.134,0.107,0.13,0.096,0.066,9462,OrganismalFitness,NCAP_I34A1,Medium,Virus
+NKX31_HUMAN_Tsuboyama_2023_2L9R,0.272,0.343,0.463,0.459,0.511,0.47,0.399,0.482,0.5,0.509,0.534,0.532,0.535,0.497,0.417,0.472,0.458,0.422,0.467,0.444,0.479,0.459,0.477,0.475,0.436,0.435,0.47,0.479,0.507,0.498,0.435,0.42,0.481,0.431,0.438,0.486,0.442,0.447,0.498,0.489,0.505,0.512,0.33,0.24,0.258,0.306,0.431,0.256,0.497,0.511,0.523,0.53,0.527,0.535,0.535,0.509,0.525,0.518,0.523,0.528,0.525,0.498,2482,Stability,NKX31_HUMAN,High,Human
+NPC1_HUMAN_Erwood_2022_HEK293T,0.526,0.572,0.481,0.513,0.578,0.565,0.183,0.358,0.611,0.598,0.572,0.274,0.371,0.151,0.306,0.494,0.572,0.578,0.578,0.054,0.196,0.358,0.3,0.306,0.332,0.403,0.475,0.358,0.365,0.572,0.565,0.468,0.131,0.157,0.41,0.416,0.488,0.501,0.546,0.539,0.546,0.578,0.235,0.041,0.559,0.488,0.378,0.533,0.028,0.119,0.572,0.565,0.572,0.585,0.578,0.552,0.559,0.572,0.578,0.572,0.533,0.287,637,Activity,NPC1_HUMAN,Low,Human
+NPC1_HUMAN_Erwood_2022_RPE1,0.651,0.397,0.079,0.142,0.397,0.46,0.269,0.333,0.333,0.397,0.46,0.015,0.142,0.079,0.206,0.524,0.46,0.397,0.397,0.206,0.524,0.333,0.397,0.142,0.333,0.333,0.269,0.206,0.206,0.333,0.651,0.426,0.397,0.269,0.079,0.333,0.587,0.397,0.524,0.587,0.397,0.46,0.333,0.206,0.651,0.142,0.269,0.397,0.142,0.142,0.397,0.46,0.46,0.397,0.46,0.46,0.397,0.397,0.397,0.46,0.587,0.333,63,Activity,NPC1_HUMAN,Low,Human
+NRAM_I33A0_Jiang_2016,0.477,0.45,0.356,0.329,0.45,0.463,0.06,0.289,0.477,0.477,-0.02,0.114,0.369,-0.06,-0.034,0.06,0.154,0.396,0.45,0.302,0.342,0.477,0.396,0.409,0.06,0.409,0.45,0.329,0.423,0.53,0.289,0.342,0.007,0.356,0.383,0.396,0.436,0.503,0.544,0.503,0.544,0.477,-0.034,-0.034,-0.06,-0.074,0.356,0.289,0.369,0.154,0.248,0.195,0.181,0.221,0.154,0.235,0.221,0.208,0.154,0.168,0.221,0.128,298,OrganismalFitness,NRAM_I33A0,Low,Virus
+NUD15_HUMAN_Suiter_2020,0.218,0.386,0.423,0.454,0.443,0.446,0.01,0.317,0.469,0.528,0.478,0.472,0.514,0.206,0.32,0.35,0.405,0.472,0.475,0.393,0.221,0.356,0.44,0.42,0.308,0.49,0.471,0.454,0.408,0.417,0.502,0.449,0.148,0.254,0.326,0.454,0.287,0.314,0.456,0.435,0.434,0.472,0.3,0.021,0.481,0.327,0.351,0.451,0.398,0.23,0.417,0.399,0.431,0.453,0.435,0.454,0.438,0.459,0.454,0.447,0.549,0.453,2844,Expression,NUD15_HUMAN,High,Human
+NUSA_ECOLI_Tsuboyama_2023_1WCL,0.352,0.417,0.455,0.449,0.486,0.484,0.204,0.324,0.512,0.453,0.411,0.23,0.24,0.232,0.291,0.287,0.356,0.403,0.388,0.423,0.246,0.427,0.464,0.461,0.263,0.386,0.39,0.386,0.441,0.486,0.526,0.618,0.129,0.244,0.322,0.307,0.364,0.372,0.386,0.496,0.464,0.486,0.257,0.179,0.267,0.248,0.528,0.49,0.591,0.512,0.61,0.597,0.61,0.628,0.644,0.654,0.634,0.616,0.64,0.642,0.536,0.455,2028,Stability,NUSA_ECOLI,High,Prokaryote
+NUSG_MYCTU_Tsuboyama_2023_2MI6,0.353,0.326,0.338,0.341,0.369,0.356,0.225,0.372,0.36,0.347,0.356,0.298,0.32,0.289,0.396,0.35,0.375,0.399,0.402,0.375,0.234,0.289,0.286,0.304,0.344,0.28,0.283,0.243,0.32,0.427,0.341,0.346,0.2,0.216,0.234,0.344,0.335,0.326,0.372,0.338,0.347,0.356,0.237,0.115,0.216,0.311,0.439,0.39,0.571,0.531,0.405,0.363,0.35,0.399,0.412,0.396,0.399,0.415,0.412,0.399,0.338,0.399,1380,Stability,NUSG_MYCTU,High,Prokaryote
+OBSCN_HUMAN_Tsuboyama_2023_1V1C,0.342,0.402,0.508,0.521,0.505,0.512,0.199,0.396,0.523,0.526,0.533,0.482,0.498,0.198,0.515,0.541,0.545,0.522,0.533,0.511,0.526,0.519,0.518,0.526,0.425,0.503,0.499,0.491,0.472,0.486,0.531,0.514,0.316,0.133,0.223,0.318,0.465,0.441,0.439,0.515,0.511,0.511,0.268,-0.044,0.261,0.319,0.406,0.349,0.508,0.513,0.571,0.562,0.565,0.562,0.569,0.568,0.559,0.555,0.575,0.566,0.535,0.493,3197,Stability,OBSCN_HUMAN,High,Human
+ODP2_GEOSE_Tsuboyama_2023_1W4G,-0.298,-0.201,0.189,0.083,0.189,0.185,-0.171,0.03,0.083,0.083,0.002,-0.108,-0.097,-0.079,-0.139,0.026,0.097,0.034,0.083,0.129,0.108,0.051,0.041,0.069,0.044,0.069,0.079,0.09,0.108,0.189,-0.034,-0.063,0.146,-0.108,0.009,-0.005,0.097,0.097,0.101,0.161,0.168,0.189,0.15,-0.079,0.157,0.168,0.245,0.228,0.256,0.185,0.026,-0.005,0.012,0.041,-0.002,0.041,0.037,0.044,0.019,0.026,0.15,0.076,1134,Stability,ODP2_GEOSE,High,Prokaryote
+OPSD_HUMAN_Wan_2019,0.139,0.382,0.358,0.503,0.406,0.382,0.309,0.43,0.43,0.43,0.212,0.406,0.43,0.188,0.479,0.43,0.406,0.358,0.43,0.503,0.479,0.455,0.455,0.455,0.503,0.455,0.503,0.455,0.43,0.333,0.382,0.261,0.188,0.406,0.43,0.382,0.455,0.43,0.406,0.43,0.358,0.382,0.261,0.042,0.406,0.406,0.624,0.406,0.576,0.212,0.333,0.309,0.309,0.261,0.43,0.406,0.236,0.309,0.382,0.358,0.503,0.406,165,Expression,OPSD_HUMAN,High,Human
+OTC_HUMAN_Lo_2023,0.404,0.432,0.376,0.414,0.404,0.419,0.062,0.34,0.455,0.442,0.424,0.424,0.452,0.057,0.32,0.394,0.417,0.376,0.424,0.424,0.355,0.33,0.396,0.427,0.386,0.445,0.417,0.427,0.439,0.447,0.366,0.312,0.055,0.327,0.389,0.429,0.396,0.432,0.48,0.434,0.452,0.462,0.149,0.045,0.419,0.345,0.511,0.468,0.516,0.266,0.434,0.422,0.434,0.439,0.445,0.437,0.437,0.427,0.419,0.452,0.473,0.353,1570,Activity,OTC_HUMAN,Medium,Human
+OTU7A_HUMAN_Tsuboyama_2023_2L2D,0.089,0.218,0.177,0.184,0.204,0.211,0.157,0.13,0.15,0.137,0.354,0.51,0.496,0.198,0.462,0.482,0.347,0.313,0.333,0.15,0.123,0.191,0.259,0.259,0.32,0.313,0.367,0.34,0.137,0.232,0.381,0.381,0.062,0.137,0.211,0.306,0.218,0.211,0.286,0.232,0.177,0.225,0.266,0.13,0.449,0.442,0.435,0.476,0.489,0.394,0.394,0.286,0.32,0.36,0.374,0.388,0.313,0.374,0.442,0.374,0.51,0.469,635,Stability,OTU7A_HUMAN,High,Human
+OXDA_RHOTO_Vanella_2023_activity,0.06,0.082,0.085,0.094,0.08,0.085,0.056,0.074,0.068,0.072,0.105,0.098,0.086,0.04,0.072,0.074,0.088,0.101,0.109,0.094,0.045,0.071,0.101,0.098,0.076,0.1,0.109,0.089,0.121,0.085,0.123,0.106,0.011,0.056,0.074,0.088,0.082,0.089,0.089,0.085,0.085,0.08,0.072,0.028,0.097,0.082,0.074,0.105,0.083,0.036,0.098,0.098,0.095,0.095,0.092,0.106,0.094,0.095,0.088,0.098,0.1,0.059,6396,Activity,OXDA_RHOTO,High,Eukaryote
+OXDA_RHOTO_Vanella_2023_expression,0.156,0.208,0.215,0.215,0.192,0.192,0.156,0.155,0.209,0.209,0.247,0.231,0.226,0.164,0.188,0.223,0.241,0.253,0.247,0.158,0.154,0.181,0.187,0.188,0.175,0.204,0.221,0.213,0.226,0.204,0.259,0.227,0.066,0.168,0.163,0.184,0.19,0.185,0.204,0.209,0.204,0.206,0.188,0.122,0.231,0.179,0.262,0.248,0.204,0.059,0.245,0.223,0.224,0.233,0.23,0.229,0.231,0.236,0.227,0.237,0.272,0.216,6769,Expression,OXDA_RHOTO,High,Eukaryote
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,0.418,0.42,0.383,0.395,0.429,0.439,-0.046,0.319,0.323,0.346,0.411,0.415,0.45,-0.095,-0.072,0.294,0.402,0.423,0.457,0.157,0.261,0.364,0.414,0.403,0.355,0.423,0.429,0.426,0.371,0.416,0.449,0.431,0.153,0.236,0.39,0.36,0.416,0.434,0.424,0.436,0.455,0.431,-0.09,-0.096,0.417,-0.009,0.338,0.42,0.358,0.169,0.407,0.407,0.409,0.405,0.414,0.431,0.406,0.415,0.419,0.42,0.437,0.179,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,0.251,0.284,0.25,0.254,0.301,0.306,-0.067,0.181,0.206,0.21,0.349,0.341,0.363,-0.136,-0.118,0.165,0.278,0.305,0.338,0.09,0.194,0.343,0.357,0.344,0.289,0.369,0.374,0.386,0.245,0.304,0.301,0.268,0.164,0.142,0.354,0.255,0.257,0.306,0.297,0.283,0.335,0.299,-0.117,-0.115,0.349,-0.068,0.268,0.341,0.28,0.118,0.298,0.281,0.296,0.305,0.301,0.319,0.288,0.294,0.297,0.304,0.338,0.09,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,0.394,0.399,0.363,0.38,0.392,0.4,-0.034,0.305,0.311,0.31,0.438,0.409,0.434,-0.086,-0.066,0.334,0.39,0.411,0.422,0.14,0.3,0.402,0.434,0.418,0.369,0.439,0.44,0.441,0.317,0.43,0.426,0.41,0.155,0.268,0.435,0.374,0.404,0.431,0.418,0.411,0.438,0.411,-0.081,-0.104,0.434,0.004,0.367,0.431,0.363,0.187,0.398,0.407,0.399,0.407,0.415,0.421,0.4,0.407,0.403,0.415,0.429,0.189,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Kotler_2018,0.542,0.519,0.429,0.507,0.487,0.515,0.056,0.386,0.499,0.511,0.476,0.472,0.542,0.052,0.037,0.53,0.604,0.619,0.631,0.359,0.317,0.402,0.441,0.429,0.386,0.433,0.421,0.445,0.429,0.476,0.55,0.515,0.056,0.336,0.421,0.344,0.519,0.484,0.464,0.53,0.495,0.511,0.052,0.002,0.573,0.153,0.39,0.507,0.394,0.181,0.588,0.612,0.604,0.588,0.585,0.588,0.608,0.573,0.616,0.612,0.639,0.27,1048,OrganismalFitness,P53_HUMAN,Low,Human
+P84126_THETH_Chan_2017,0.484,0.516,0.534,0.534,0.489,0.521,0.272,0.407,0.539,0.545,0.481,0.489,0.531,0.193,0.484,0.492,0.524,0.486,0.463,0.524,0.383,0.436,0.434,0.486,0.481,0.518,0.51,0.518,0.563,0.486,0.471,0.401,0.322,0.423,0.436,0.471,0.492,0.494,0.516,0.518,0.524,0.537,0.404,-0.032,0.46,0.402,0.256,0.336,0.396,0.081,0.46,0.436,0.373,0.479,0.473,0.42,0.449,0.457,0.473,0.476,0.524,0.526,1519,OrganismalFitness,P84126_THETH,Medium,Prokaryote
+PA_I34A1_Wu_2015,0.36,0.376,0.36,0.365,0.396,0.409,0.048,0.301,0.145,0.101,0.026,0.04,0.088,-0.002,0.024,0.026,0.024,0.018,0.275,0.248,0.336,0.382,0.4,0.407,0.171,0.297,0.332,0.336,0.327,0.516,0.284,0.264,0.086,0.308,0.347,0.396,0.433,0.44,0.435,0.462,0.455,0.433,0.024,0.031,0.037,0.013,0.18,0.178,0.149,0.046,0.132,0.141,0.147,0.149,0.134,0.152,0.134,0.132,0.101,0.143,0.167,0.121,1820,OrganismalFitness,PA_I34A1,Medium,Virus
+PABP_YEAST_Melamed_2013,0.521,0.474,0.41,0.418,0.495,0.493,0.35,0.429,0.491,0.512,0.545,0.514,0.534,0.357,0.429,0.515,0.597,0.549,0.563,0.408,0.508,0.528,0.531,0.555,0.506,0.558,0.57,0.538,0.525,0.522,0.568,0.503,0.191,0.511,0.504,0.503,0.556,0.556,0.551,0.531,0.53,0.529,0.465,-0.048,0.518,0.517,0.274,0.461,0.393,0.144,0.542,0.532,0.545,0.57,0.558,0.563,0.565,0.561,0.594,0.581,0.574,0.502,37708,OrganismalFitness,PABP_YEAST,Medium,Eukaryote
+PAI1_HUMAN_Huttinger_2021,0.301,0.28,0.263,0.275,0.283,0.298,0.046,0.238,0.316,0.325,0.335,0.312,0.312,0.061,0.298,0.322,0.344,0.24,0.205,0.316,0.103,0.277,0.256,0.24,0.277,0.293,0.289,0.271,0.218,0.311,0.3,0.268,0.125,0.121,0.264,0.274,0.284,0.295,0.299,0.301,0.31,0.306,0.252,0.053,0.322,0.306,0.297,0.343,0.305,0.108,0.334,0.344,0.337,0.348,0.339,0.348,0.342,0.347,0.341,0.353,0.36,0.302,5345,Activity,PAI1_HUMAN,,Human
+PHOT_CHLRE_Chen_2023,0.149,0.3,0.541,0.52,0.215,0.256,0.477,0.395,0.526,0.536,0.474,0.57,0.6,0.58,0.648,0.529,0.563,0.528,0.574,0.472,0.499,0.526,0.433,0.51,0.326,0.434,0.417,0.483,0.426,0.415,0.414,0.346,0.21,0.398,0.325,0.419,0.413,0.369,0.422,0.282,0.302,0.287,0.469,0.278,0.452,0.463,0.142,0.364,0.493,0.284,0.439,0.388,0.395,0.393,0.398,0.411,0.408,0.413,0.421,0.41,0.551,0.588,167529,Activity,PHOT_CHLRE,High,Eukaryote
+PIN1_HUMAN_Tsuboyama_2023_1I6C,0.2,0.234,0.549,0.499,0.489,0.544,0.489,0.384,0.514,0.579,0.524,0.449,0.569,0.529,0.494,0.524,0.549,0.394,0.464,0.499,0.354,0.524,0.509,0.439,0.424,0.519,0.574,0.544,0.509,0.594,0.594,0.549,0.434,0.399,0.494,0.529,0.434,0.499,0.564,0.549,0.564,0.564,0.474,-0.17,0.529,0.414,0.399,0.444,0.594,0.569,0.534,0.494,0.414,0.389,0.389,0.444,0.424,0.504,0.454,0.444,0.603,0.653,802,Stability,PIN1_HUMAN,High,Human
+PITX2_HUMAN_Tsuboyama_2023_2L7M,0.461,0.364,0.333,0.342,0.336,0.351,0.349,0.36,0.412,0.41,0.36,0.366,0.384,0.415,0.524,0.577,0.474,0.447,0.373,0.375,0.371,0.338,0.369,0.344,0.423,0.364,0.344,0.331,0.353,0.349,0.349,0.35,0.338,0.415,0.358,0.373,0.472,0.423,0.404,0.406,0.358,0.36,0.349,0.303,0.178,0.235,0.294,0.2,0.537,0.498,0.377,0.388,0.393,0.406,0.399,0.419,0.395,0.384,0.393,0.39,0.39,0.443,1824,Stability,PITX2_HUMAN,High,Human
+PKN1_HUMAN_Tsuboyama_2023_1URF,0.177,0.184,0.232,0.246,0.288,0.285,0.177,0.246,0.201,0.229,0.226,0.281,0.299,0.327,0.334,0.435,0.26,0.226,0.229,0.292,0.292,0.278,0.222,0.281,0.281,0.299,0.299,0.278,0.281,0.295,0.32,0.32,0.062,0.309,0.271,0.306,0.246,0.292,0.323,0.306,0.288,0.306,0.274,0.267,0.274,0.288,0.431,0.337,0.428,0.376,0.232,0.264,0.292,0.243,0.257,0.271,0.236,0.257,0.281,0.257,0.292,0.424,1301,Stability,PKN1_HUMAN,High,Human
+POLG_CXB3N_Mattenberger_2021,0.301,0.281,0.26,0.299,0.34,0.348,-0.03,0.255,0.376,0.374,0.228,-0.036,0.001,-0.047,-0.03,0.124,0.317,0.324,0.346,0.259,0.25,0.283,0.272,0.264,0.074,0.287,0.277,0.261,0.273,0.365,0.295,0.229,-0.005,0.036,0.187,0.248,0.263,0.276,0.311,0.288,0.307,0.347,-0.035,-0.039,0.268,-0.03,0.147,0.268,0.077,0.04,0.259,0.286,0.289,0.291,0.286,0.288,0.286,0.285,0.294,0.29,0.068,0.051,15711,OrganismalFitness,POLG_CXB3N,Medium,Virus
+POLG_DEN26_Suphatrakul_2023,0.357,0.454,0.184,0.185,0.413,0.418,-0.014,0.33,0.535,0.542,0.241,0.011,0.02,-0.009,0.025,0.059,0.122,0.206,0.274,0.312,0.334,0.351,0.344,0.344,0.327,0.367,0.371,0.363,0.352,0.484,0.468,0.408,0.029,-0.025,0.087,0.352,0.331,0.337,0.422,0.364,0.315,0.43,-0.028,-0.025,0.319,0.017,0.282,0.42,0.099,0.084,0.195,0.189,0.198,0.216,0.184,0.217,0.197,0.199,0.19,0.206,0.115,0.023,16897,OrganismalFitness,POLG_DEN26,Low,Virus
+POLG_HCVJF_Qi_2014,0.448,0.432,0.336,0.346,0.46,0.472,-0.041,0.152,0.405,0.423,0.131,0.507,0.475,0.074,0.101,0.082,0.082,0.062,0.062,0.158,0.309,0.358,0.336,0.376,0.302,0.376,0.213,0.314,0.405,0.465,0.467,0.399,0.151,0.361,0.395,0.386,0.4,0.428,0.415,0.353,0.403,0.423,0.079,0.047,0.378,0.084,-0.035,0.26,0.578,0.311,0.245,0.292,0.282,0.297,0.299,0.227,0.269,0.299,0.267,0.309,0.119,0.143,1630,OrganismalFitness,POLG_HCVJF,Medium,Virus
+POLG_PESV_Tsuboyama_2023_2MXD,0.194,0.352,0.217,0.263,0.295,0.297,0.053,0.269,0.222,0.256,0.267,0.041,0.089,0.048,0.066,0.032,0.089,0.064,0.048,0.39,0.079,0.06,0.023,0.116,0.076,-0.032,-0.003,-0.042,0.105,0.309,0.37,0.377,0.125,0.023,0.018,0.009,0.242,0.249,0.249,0.302,0.306,0.305,-0.025,-0.054,-0.012,-0.035,0.37,0.297,0.486,0.434,0.456,0.462,0.437,0.518,0.525,0.529,0.462,0.53,0.53,0.511,0.545,0.466,5130,Stability,POLG_PESV,Medium,Virus
+PPARG_HUMAN_Majithia_2016,0.299,0.373,0.407,0.41,0.396,0.397,0.132,0.263,0.363,0.383,0.397,0.405,0.409,0.04,0.12,0.347,0.414,0.469,0.461,0.434,0.406,0.389,0.214,0.256,0.39,0.407,0.399,0.414,0.283,0.45,0.424,0.411,0.278,0.43,0.38,0.342,0.431,0.414,0.397,0.445,0.429,0.423,0.195,0.042,0.359,0.32,0.368,0.355,0.402,0.2,0.404,0.412,0.419,0.408,0.408,0.417,0.416,0.414,0.42,0.42,0.424,0.352,9576,Activity,PPARG_HUMAN,Medium,Human
+PPM1D_HUMAN_Miller_2022,0.438,0.462,0.454,0.465,0.508,0.512,0.005,0.298,0.34,0.4,0.462,0.487,0.495,0.184,0.258,0.327,0.494,0.511,0.509,0.286,0.327,0.434,0.412,0.323,0.421,0.448,0.448,0.439,0.307,0.492,0.488,0.461,0.216,0.312,0.427,0.414,0.478,0.493,0.504,0.528,0.509,0.512,0.229,-0.034,0.465,0.397,0.389,0.464,0.437,0.197,0.467,0.463,0.46,0.469,0.483,0.46,0.477,0.476,0.479,0.482,0.52,0.392,7889,OrganismalFitness,PPM1D_HUMAN,Low,Human
+PR40A_HUMAN_Tsuboyama_2023_1UZC,0.58,0.629,0.69,0.672,0.724,0.724,0.413,0.413,0.743,0.755,0.696,0.58,0.623,0.444,0.428,0.747,0.769,0.735,0.712,0.659,0.497,0.6,0.613,0.611,0.539,0.604,0.672,0.619,0.653,0.763,0.789,0.794,0.56,0.539,0.436,0.486,0.67,0.676,0.667,0.69,0.716,0.698,0.367,0.181,0.19,0.358,0.472,0.226,0.633,0.649,0.789,0.765,0.775,0.781,0.787,0.79,0.79,0.789,0.806,0.802,0.731,0.712,2033,Stability,PR40A_HUMAN,Medium,Human
+PRKN_HUMAN_Clausen_2023,0.505,0.501,0.505,0.49,0.509,0.512,0.104,0.367,0.425,0.443,0.458,0.507,0.527,0.137,0.183,0.25,0.389,0.544,0.553,0.357,0.207,0.44,0.478,0.458,0.359,0.502,0.493,0.477,0.424,0.523,0.482,0.462,0.217,0.17,0.43,0.45,0.49,0.522,0.52,0.517,0.526,0.527,0.206,0.048,0.472,0.246,0.511,0.482,0.552,0.224,0.416,0.427,0.436,0.452,0.44,0.459,0.453,0.447,0.449,0.459,0.556,0.371,8756,Expression,PRKN_HUMAN,Low,Human
+PSAE_SYNP2_Tsuboyama_2023_1PSE,0.518,0.542,0.508,0.511,0.526,0.521,0.335,0.407,0.428,0.436,0.602,0.532,0.529,0.428,0.516,0.609,0.627,0.563,0.49,0.397,0.311,0.241,0.433,0.418,0.407,0.41,-0.129,0.449,0.459,0.471,0.49,0.459,0.161,0.332,0.394,0.399,0.511,0.498,0.459,0.558,0.568,0.495,0.423,0.286,0.586,0.441,0.532,0.576,0.638,0.539,0.617,0.583,0.602,0.599,0.591,0.625,0.591,0.599,0.612,0.607,0.622,0.625,1579,Stability,PSAE_PICP2,Medium,Prokaryote
+PTEN_HUMAN_Matreyek_2021,0.298,0.304,0.314,0.304,0.315,0.32,0.105,0.263,0.327,0.368,0.335,0.31,0.355,0.117,0.2,0.38,0.359,0.195,0.21,0.348,0.128,0.339,0.308,0.298,0.216,0.264,0.241,0.28,0.224,0.342,0.318,0.323,0.114,0.195,0.328,0.272,0.271,0.346,0.314,0.324,0.354,0.339,0.179,0.033,0.344,0.255,0.369,0.348,0.375,0.157,0.35,0.346,0.357,0.354,0.344,0.362,0.358,0.365,0.351,0.365,0.402,0.366,5083,Expression,PTEN_HUMAN,Medium,Human
+PTEN_HUMAN_Mighell_2018,0.399,0.42,0.396,0.396,0.417,0.418,0.112,0.3,0.418,0.423,0.395,0.393,0.412,0.144,0.301,0.418,0.43,0.239,0.225,0.417,0.257,0.356,0.276,0.257,0.343,0.234,0.245,0.274,0.212,0.417,0.391,0.39,0.058,0.3,0.346,0.257,0.411,0.401,0.368,0.417,0.431,0.431,0.28,-0.032,0.415,0.379,0.334,0.338,0.398,0.153,0.405,0.395,0.405,0.412,0.419,0.414,0.412,0.407,0.42,0.425,0.428,0.388,7260,Activity,PTEN_HUMAN,Medium,Human
+Q2N0S5_9HIV1_Haddox_2018,0.361,0.321,0.279,0.304,0.378,0.376,0.024,0.327,0.396,0.401,0.35,0.391,0.414,0.034,0.03,0.024,0.054,0.078,0.104,0.303,0.394,0.301,0.289,0.244,0.392,0.287,0.284,0.327,0.26,0.379,0.378,0.369,0.222,0.371,0.318,0.295,0.39,0.384,0.379,0.397,0.394,0.387,0.34,0.02,0.385,0.345,0.292,0.348,0.199,0.106,0.166,0.207,0.23,0.226,0.188,0.193,0.194,0.185,0.167,0.218,0.151,0.079,12729,OrganismalFitness,Q2N0S5_9HIV1,Medium,Virus
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,0.24,0.215,0.228,0.224,0.236,0.231,0.083,0.15,0.194,0.201,0.134,0.172,0.19,0.161,0.135,0.24,0.206,0.177,0.237,0.208,0.176,0.188,0.172,0.134,0.191,0.191,0.178,0.209,0.176,0.213,0.257,0.246,0.051,0.163,0.179,0.104,0.214,0.23,0.205,0.24,0.255,0.242,0.309,0.165,0.184,0.206,0.22,0.243,0.24,0.135,0.193,0.194,0.22,0.225,0.207,0.199,0.193,0.205,0.203,0.206,0.188,0.322,3344,Binding,Q53Z42_HUMAN,Medium,Human
+Q53Z42_HUMAN_McShan_2019_expression,0.411,0.375,0.416,0.423,0.425,0.425,-0.034,0.319,0.408,0.423,0.319,0.365,0.387,0.059,0.069,0.382,0.435,0.408,0.44,0.428,0.333,0.371,0.381,0.35,0.368,0.403,0.389,0.423,0.416,0.431,0.418,0.375,0.108,0.326,0.355,0.343,0.419,0.42,0.435,0.452,0.456,0.44,0.247,0.045,0.413,0.357,0.356,0.407,0.361,0.144,0.438,0.438,0.444,0.443,0.453,0.438,0.429,0.442,0.456,0.45,0.4,0.472,3344,Expression,Q53Z42_HUMAN,Medium,Human
+Q59976_STRSQ_Romero_2015,0.418,0.49,0.508,0.518,0.516,0.523,0.235,0.408,0.546,0.55,0.471,0.415,0.415,0.085,0.325,0.38,0.431,0.462,0.453,0.507,0.459,0.505,0.519,0.523,0.481,0.525,0.528,0.537,0.561,0.528,0.514,0.452,0.265,0.477,0.514,0.501,0.484,0.518,0.528,0.526,0.536,0.537,0.372,-0.004,0.456,0.398,0.317,0.411,0.407,0.093,0.456,0.422,0.409,0.429,0.435,0.426,0.433,0.419,0.438,0.433,0.488,0.415,2999,Activity,Q59976_STRSQ,Medium,Prokaryote
+Q6WV13_9MAXI_Somermeyer_2022,0.175,0.194,0.137,0.14,0.18,0.18,-0.009,0.096,0.2,0.196,0.14,0.027,0.018,0.008,0.024,0.006,0.007,0.013,-0.011,0.161,0.026,0.024,0.002,0.071,0.021,-0.0,0.02,0.0,-0.001,0.233,0.167,0.171,0.026,0.015,0.025,0.02,0.13,0.132,0.137,0.173,0.173,0.172,0.006,0.005,-0.002,0.012,0.138,0.15,0.215,0.129,0.153,0.164,0.151,0.175,0.168,0.164,0.16,0.167,0.163,0.164,0.068,0.047,31401,Activity,Q6WV12_9MAXI,Low,Eukaryote
+Q837P4_ENTFA_Meier_2023,0.257,0.303,0.28,0.252,0.28,0.326,0.315,0.24,0.269,0.297,0.355,0.32,0.338,0.274,0.309,0.309,0.286,0.355,0.315,-0.024,0.355,0.303,0.32,0.361,0.349,0.332,0.366,0.252,0.372,0.303,0.343,0.333,0.079,0.286,0.378,0.338,0.274,0.378,0.384,0.332,0.32,0.32,0.257,0.229,0.372,0.297,0.229,0.326,0.246,0.068,0.315,0.274,0.309,0.326,0.303,0.338,0.338,0.309,0.326,0.338,0.349,0.332,697,Activity,Q837P4_ENTFA,Medium,Prokaryote
+Q837P5_ENTFA_Meier_2023,0.082,0.264,0.291,0.291,0.232,0.232,0.055,0.125,0.173,0.2,0.168,0.2,0.2,0.076,0.125,0.2,0.216,0.291,0.259,0.205,0.248,0.27,0.275,0.253,0.216,0.253,0.264,0.345,0.302,0.2,0.248,0.227,0.06,0.248,0.302,0.334,0.184,0.248,0.312,0.259,0.27,0.28,0.146,0.135,0.227,0.119,0.253,0.221,0.275,0.098,0.189,0.205,0.237,0.216,0.237,0.237,0.189,0.232,0.243,0.232,0.211,0.189,747,Activity,Q837P5_ENTFA,Medium,Prokaryote
+Q8WTC7_9CNID_Somermeyer_2022,0.162,0.21,0.127,0.125,0.189,0.185,-0.004,0.187,0.169,0.173,0.111,-0.038,-0.048,-0.042,-0.059,-0.054,-0.048,-0.049,-0.023,0.129,-0.036,-0.005,0.022,-0.019,-0.024,-0.039,-0.02,0.138,0.144,0.203,0.187,0.194,-0.029,-0.064,-0.015,0.18,0.123,0.13,0.18,0.173,0.182,0.202,-0.035,-0.035,-0.053,-0.054,0.007,0.054,0.161,0.087,0.126,0.119,0.126,0.133,0.12,0.129,0.13,0.12,0.116,0.127,0.038,-0.07,33510,Activity,Q8WTC7_9CNID,Low,Eukaryote
+R1AB_SARS2_Flynn_2022,0.477,0.427,0.185,0.221,0.517,0.514,-0.011,0.227,0.013,0.013,0.094,0.01,0.002,0.038,0.015,0.067,0.096,0.395,0.456,0.228,0.186,0.239,0.258,0.233,0.211,0.225,0.173,0.177,0.215,0.46,0.414,0.331,-0.026,0.152,0.19,0.174,0.308,0.339,0.33,0.463,0.484,0.48,0.006,-0.005,0.063,-0.003,0.337,0.31,0.367,0.162,0.197,0.156,0.216,0.216,0.201,0.201,0.204,0.2,0.173,0.205,0.17,0.097,5725,OrganismalFitness,R1AB_SARS2,Medium,Virus
+RAD_ANTMA_Tsuboyama_2023_2CJJ,0.204,0.112,0.226,0.213,0.23,0.226,0.362,0.283,0.542,0.485,0.415,0.38,0.397,0.327,0.52,0.56,0.353,0.353,0.432,0.279,0.432,0.476,0.441,0.362,0.459,0.305,0.472,0.345,0.318,0.424,0.279,0.191,0.248,0.388,0.525,0.345,0.331,0.459,0.371,0.349,0.388,0.309,0.472,0.195,0.2,0.323,0.38,0.261,0.586,0.38,0.38,0.406,0.415,0.415,0.419,0.41,0.384,0.432,0.393,0.428,0.283,0.463,912,Stability,RAD_ANTMA,High,Eukaryote
+RAF1_HUMAN_Zinkus-Boltz_2019,0.394,0.367,0.367,0.367,0.354,0.367,0.044,0.286,0.354,0.367,0.394,0.354,0.421,0.017,0.192,0.354,0.394,0.381,0.34,0.246,0.205,0.313,0.34,0.3,0.286,0.313,0.286,0.259,0.259,0.34,0.394,0.3,0.111,0.246,0.232,0.286,0.34,0.3,0.313,0.354,0.367,0.394,0.165,0.071,0.421,0.327,0.232,0.354,0.192,0.286,0.34,0.354,0.3,0.34,0.3,0.407,0.34,0.354,0.367,0.327,0.354,0.111,297,OrganismalFitness,RAF1_HUMAN,Low,Human
+RASH_HUMAN_Bandaru_2017,0.397,0.442,0.439,0.467,0.454,0.465,0.288,0.376,0.464,0.479,0.368,0.41,0.442,0.417,0.464,0.443,0.489,0.453,0.317,0.497,0.451,0.403,0.403,0.388,0.414,0.4,0.407,0.347,0.277,0.422,0.399,0.378,0.199,0.375,0.407,0.338,0.449,0.45,0.425,0.455,0.461,0.475,0.48,0.186,0.278,0.454,0.289,0.177,0.414,0.198,0.419,0.415,0.43,0.429,0.422,0.422,0.436,0.421,0.432,0.449,0.349,0.444,3134,Activity,RASH_HUMAN,High,Human
+RASK_HUMAN_Weng_2022_abundance,0.232,0.2,0.247,0.233,0.226,0.236,0.173,0.179,0.177,0.203,0.219,0.167,0.182,0.213,0.256,0.233,0.188,0.158,0.161,0.232,0.211,0.253,0.272,0.289,0.213,0.252,0.311,0.251,0.358,0.259,0.148,0.111,0.155,0.146,0.223,0.307,0.199,0.258,0.314,0.232,0.254,0.268,0.172,0.211,0.045,0.181,0.234,0.245,0.257,0.23,0.182,0.165,0.205,0.197,0.222,0.199,0.206,0.197,0.177,0.201,0.245,0.231,26012,Expression,RASK_HUMAN,High,Human
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,0.39,0.388,0.484,0.489,0.468,0.472,0.253,0.315,0.483,0.493,0.469,0.414,0.46,0.439,0.482,0.504,0.499,0.417,0.332,0.541,0.421,0.424,0.446,0.377,0.394,0.439,0.404,0.3,0.276,0.493,0.408,0.359,0.24,0.382,0.443,0.391,0.42,0.469,0.44,0.467,0.482,0.475,0.438,0.219,0.337,0.475,0.18,0.208,0.466,0.212,0.412,0.411,0.421,0.421,0.446,0.444,0.41,0.423,0.365,0.431,0.476,0.476,24873,Binding,RASK_HUMAN,High,Human
+RBP1_HUMAN_Tsuboyama_2023_2KWH,0.165,0.072,0.264,0.264,0.267,0.267,0.231,0.243,0.21,0.21,0.363,0.318,0.318,0.255,0.321,0.378,0.426,0.3,0.273,0.18,0.282,0.06,0.228,0.213,0.207,0.111,0.117,0.279,0.192,0.339,0.318,0.288,0.264,0.204,0.309,0.312,0.255,0.27,0.279,0.264,0.264,0.264,0.255,0.186,0.327,0.333,0.333,0.378,0.462,0.345,0.378,0.378,0.411,0.384,0.393,0.399,0.387,0.393,0.381,0.39,0.432,0.417,1332,Stability,RBP1_HUMAN,High,Human
+RCD1_ARATH_Tsuboyama_2023_5OAO,0.277,0.274,0.331,0.34,0.331,0.331,0.213,0.217,0.328,0.34,0.372,0.197,0.236,0.258,0.226,0.461,0.464,0.451,0.429,0.315,0.178,0.182,0.242,0.305,0.239,0.41,0.416,0.359,0.423,0.385,0.467,0.391,0.042,0.185,0.239,0.232,0.299,0.305,0.283,0.362,0.369,0.337,0.229,0.194,0.356,0.229,0.388,0.391,0.464,0.439,0.461,0.461,0.47,0.505,0.467,0.496,0.47,0.48,0.499,0.483,0.464,0.381,1261,Stability,RCD1_ARATH,Medium,Eukaryote
+RCRO_LAMBD_Tsuboyama_2023_1ORC,0.286,0.414,0.49,0.471,0.45,0.474,0.074,0.23,0.406,0.414,0.4,0.256,0.334,0.146,0.228,0.205,0.42,0.432,0.432,0.488,0.018,0.065,0.095,0.091,-0.079,0.049,-0.06,-0.002,0.479,0.46,0.427,0.416,0.018,0.054,0.042,0.425,0.36,0.332,0.465,0.45,0.457,0.511,-0.109,0.053,0.234,0.042,0.367,0.442,0.543,0.432,0.427,0.411,0.414,0.404,0.406,0.409,0.411,0.409,0.418,0.418,0.453,0.32,2278,Stability,RCRO_LAMBD,High,Virus
+RD23A_HUMAN_Tsuboyama_2023_1IFY,0.235,0.168,0.266,0.282,0.305,0.309,0.121,0.344,0.419,0.419,0.384,0.45,0.462,0.156,0.521,0.458,0.423,0.399,0.286,0.344,0.301,0.321,0.313,0.356,0.403,0.372,0.372,0.38,0.388,0.364,0.356,0.339,0.305,0.231,0.38,0.38,0.29,0.321,0.368,0.305,0.317,0.329,0.384,0.023,0.415,0.466,0.372,0.392,0.447,0.376,0.36,0.341,0.36,0.344,0.38,0.384,0.368,0.368,0.38,0.38,0.368,0.423,1019,Stability,RD23A_HUMAN,High,Human
+RDRP_I33A0_Li_2023,0.221,0.252,0.281,0.286,0.362,0.367,0.042,0.289,0.381,0.385,0.146,0.072,0.079,0.064,0.058,0.111,0.278,0.292,0.368,0.33,0.286,0.331,0.341,0.355,0.125,0.267,0.263,0.259,0.307,0.357,0.336,0.292,0.086,0.278,0.321,0.347,0.317,0.349,0.358,0.368,0.38,0.392,0.052,0.035,0.15,0.054,0.17,0.191,0.181,0.041,0.233,0.215,0.227,0.229,0.24,0.242,0.229,0.247,0.253,0.248,0.13,0.096,12003,OrganismalFitness,RDRP_I33A0,Low,Virus
+REV_HV1H2_Fernandes_2016,0.132,0.128,0.141,0.152,0.156,0.154,0.033,0.219,0.132,0.149,0.104,0.178,0.195,0.029,0.018,0.109,0.152,0.206,0.184,0.115,0.141,0.18,0.171,0.141,0.206,0.19,0.106,0.175,0.19,0.191,0.245,0.266,0.024,0.143,0.182,0.173,0.178,0.182,0.162,0.182,0.19,0.167,0.031,0.037,0.145,0.044,0.177,0.16,0.201,0.139,0.141,0.143,0.203,0.188,0.177,0.204,0.206,0.134,0.219,0.193,0.208,0.136,2147,OrganismalFitness,REV_HV1H2,Medium,Virus
+RFAH_ECOLI_Tsuboyama_2023_2LCL,-0.026,0.176,0.24,0.264,0.222,0.231,-0.014,0.192,0.195,0.201,0.207,0.125,0.119,-0.053,0.023,0.005,0.24,0.261,0.204,0.222,-0.068,0.032,0.137,0.101,-0.053,0.116,0.152,0.125,0.149,0.164,0.27,0.247,0.014,0.002,0.113,0.152,0.053,0.14,0.131,0.192,0.179,0.195,0.053,-0.104,0.053,-0.014,0.173,0.137,0.252,0.234,0.279,0.261,0.252,0.267,0.234,0.255,0.255,0.276,0.255,0.264,0.198,0.104,1326,Stability,RFAH_ECOLI,High,Prokaryote
+RL20_AQUAE_Tsuboyama_2023_1GYZ,0.353,0.499,0.536,0.536,0.514,0.525,0.19,0.536,0.308,0.255,0.567,0.564,0.539,0.128,0.308,0.351,0.601,0.581,0.612,0.556,0.157,0.488,0.446,0.44,0.415,0.438,0.438,0.471,0.547,0.525,0.502,0.487,0.044,0.415,0.429,0.415,0.466,0.485,0.466,0.514,0.522,0.514,-0.043,-0.113,0.381,0.207,0.589,0.544,0.665,0.595,0.637,0.623,0.615,0.657,0.643,0.626,0.646,0.632,0.626,0.646,0.663,0.544,1461,Stability,RL20_AQUAE,High,Prokaryote
+RL40A_YEAST_Mavor_2016,0.167,0.253,0.257,0.302,0.264,0.298,0.095,0.298,0.333,0.322,0.171,0.202,0.229,0.023,0.343,0.377,0.405,0.308,0.425,0.333,0.253,0.415,0.384,0.322,0.367,0.312,0.329,0.322,0.288,0.222,0.291,0.31,0.016,0.277,0.339,0.295,0.312,0.333,0.295,0.346,0.343,0.291,0.216,-0.001,0.164,0.209,0.04,0.116,0.136,-0.005,0.374,0.357,0.388,0.377,0.374,0.381,0.353,0.384,0.384,0.388,0.247,0.26,1253,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+RL40A_YEAST_Roscoe_2013,0.183,0.284,0.294,0.339,0.301,0.319,0.054,0.256,0.364,0.374,0.159,0.221,0.277,0.037,0.332,0.367,0.451,0.336,0.461,0.36,0.28,0.458,0.395,0.357,0.395,0.346,0.385,0.353,0.326,0.277,0.36,0.367,0.03,0.312,0.402,0.312,0.343,0.388,0.305,0.371,0.409,0.332,0.26,0.013,0.187,0.221,0.037,0.093,0.141,0.048,0.433,0.405,0.426,0.426,0.43,0.43,0.405,0.458,0.437,0.433,0.27,0.27,1195,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+RL40A_YEAST_Roscoe_2014,0.246,0.216,0.286,0.332,0.279,0.303,0.116,0.332,0.299,0.309,0.113,0.199,0.209,0.126,0.396,0.386,0.426,0.313,0.399,0.289,0.253,0.409,0.356,0.329,0.396,0.313,0.319,0.259,0.259,0.283,0.273,0.263,0.123,0.316,0.359,0.289,0.329,0.376,0.329,0.336,0.366,0.316,0.186,0.08,0.163,0.223,0.146,0.176,0.213,0.113,0.432,0.372,0.409,0.442,0.416,0.426,0.389,0.406,0.416,0.436,0.233,0.309,1380,Activity,RL40A_YEAST,Medium,Eukaryote
+RNC_ECOLI_Weeks_2023,0.425,0.49,0.484,0.5,0.502,0.494,0.032,0.319,0.478,0.488,0.478,0.478,0.489,0.046,0.419,0.452,0.486,0.492,0.499,0.483,0.425,0.461,0.406,0.376,0.45,0.472,0.462,0.473,0.459,0.496,0.48,0.457,0.138,0.426,0.421,0.331,0.459,0.471,0.435,0.506,0.517,0.502,0.397,0.049,0.467,0.447,0.227,0.425,0.2,0.138,0.462,0.453,0.419,0.453,0.453,0.454,0.46,0.46,0.462,0.464,0.492,0.408,4277,Activity,RNC_ECOLI,Medium,Prokaryote
+RPC1_BP434_Tsuboyama_2023_1R69,0.557,0.576,0.549,0.571,0.491,0.533,0.538,0.524,0.557,0.576,0.623,0.667,0.67,0.629,0.661,0.678,0.623,0.598,0.505,0.596,0.609,0.62,0.637,0.579,0.629,0.626,0.626,0.626,0.587,0.664,0.615,0.555,0.502,0.516,0.623,0.631,0.596,0.642,0.642,0.596,0.585,0.593,0.626,0.59,0.664,0.642,0.615,0.557,0.711,0.607,0.607,0.579,0.601,0.612,0.62,0.604,0.612,0.607,0.626,0.612,0.694,0.683,1459,Stability,RPC1_BP434,High,Virus
+RPC1_LAMBD_Li_2019_high-expression,0.287,0.424,0.473,0.498,0.461,0.436,0.188,0.387,0.411,0.461,0.473,0.498,0.536,0.275,0.275,0.399,0.536,0.523,0.548,0.411,0.225,0.337,0.449,0.424,0.213,0.337,0.362,0.3,0.523,0.387,0.622,0.523,0.089,0.138,0.362,0.511,0.275,0.362,0.486,0.424,0.449,0.523,0.287,0.287,0.424,0.312,0.188,0.349,0.3,0.138,0.573,0.498,0.511,0.56,0.548,0.573,0.523,0.511,0.536,0.56,0.486,0.349,351,Activity,RPC1_LAMBD,High,Virus
+RPC1_LAMBD_Li_2019_low-expression,0.208,0.299,0.379,0.379,0.368,0.311,0.083,0.345,0.299,0.322,0.356,0.345,0.356,0.14,0.14,0.242,0.402,0.47,0.481,0.299,0.128,0.311,0.322,0.333,0.105,0.242,0.231,0.197,0.425,0.265,0.481,0.413,0.037,0.06,0.254,0.345,0.162,0.231,0.322,0.311,0.333,0.368,0.197,0.162,0.299,0.162,0.174,0.276,0.265,0.254,0.436,0.39,0.436,0.436,0.436,0.47,0.425,0.413,0.436,0.459,0.402,0.208,351,Activity,RPC1_LAMBD,High,Virus
+RS15_GEOSE_Tsuboyama_2023_1A32,0.295,0.289,0.279,0.282,0.272,0.272,0.162,0.168,0.322,0.322,0.356,0.332,0.305,0.225,0.245,0.339,0.326,0.329,0.232,0.242,0.108,0.185,0.252,0.212,0.235,0.252,0.245,0.262,0.245,0.315,0.339,0.324,0.085,0.279,0.222,0.242,0.299,0.259,0.265,0.265,0.255,0.255,0.262,0.175,0.315,0.259,0.423,0.322,0.469,0.389,0.322,0.322,0.312,0.312,0.302,0.326,0.329,0.336,0.312,0.329,0.322,0.463,1195,Stability,RS15_GEOSE,Medium,Prokaryote
+S22A1_HUMAN_Yee_2023_abundance,0.384,0.435,0.447,0.449,0.458,0.467,0.279,0.411,0.496,0.499,0.483,0.492,0.532,0.37,0.418,0.436,0.499,0.482,0.455,0.026,0.382,0.478,0.49,0.485,0.398,0.502,0.512,0.482,0.458,0.447,0.453,0.329,0.273,0.411,0.468,0.479,0.43,0.498,0.509,0.487,0.499,0.504,0.39,0.193,0.466,0.439,0.331,0.431,0.433,0.127,0.463,0.456,0.454,0.466,0.482,0.473,0.478,0.479,0.471,0.489,0.502,0.456,9803,Expression,S22A1_HUMAN,Medium,Human
+S22A1_HUMAN_Yee_2023_activity,0.344,0.418,0.443,0.452,0.446,0.459,0.26,0.42,0.465,0.464,0.451,0.461,0.492,0.338,0.374,0.364,0.46,0.445,0.426,0.027,0.368,0.464,0.466,0.456,0.379,0.472,0.481,0.465,0.437,0.447,0.474,0.385,0.261,0.385,0.468,0.459,0.4,0.47,0.473,0.463,0.478,0.478,0.355,0.154,0.453,0.382,0.295,0.413,0.38,0.115,0.427,0.411,0.408,0.426,0.446,0.424,0.439,0.447,0.44,0.446,0.418,0.389,10094,Activity,S22A1_HUMAN,Medium,Human
+SAV1_MOUSE_Tsuboyama_2023_2YSB,0.082,0.074,0.297,0.248,0.285,0.281,0.297,0.438,0.268,0.285,0.418,0.409,0.422,0.136,0.339,0.463,0.405,0.463,0.368,0.38,0.443,0.372,0.355,0.376,0.422,0.451,0.389,0.401,0.405,0.467,0.447,0.418,0.281,0.405,0.434,0.389,0.281,0.397,0.351,0.26,0.285,0.302,0.463,-0.072,0.463,0.451,0.19,0.206,0.289,0.397,0.438,0.281,0.181,0.285,0.264,0.322,0.339,0.434,0.372,0.347,0.455,0.48,965,Stability,SAV1_MOUSE,High,Eukaryote
+SBI_STAAM_Tsuboyama_2023_2JVG,0.218,0.179,0.311,0.272,0.315,0.315,0.182,0.128,0.378,0.381,0.253,0.233,0.241,0.186,0.206,0.311,0.463,0.502,0.233,0.276,0.175,0.175,0.214,0.233,0.194,0.206,0.167,0.218,0.311,0.42,0.409,0.358,0.194,0.14,0.155,0.19,0.186,0.194,0.19,0.276,0.292,0.3,0.221,0.182,0.264,0.233,0.526,0.475,0.538,0.479,0.389,0.417,0.424,0.428,0.432,0.436,0.448,0.44,0.405,0.436,0.538,0.444,1025,Stability,SBI_STAAM,Medium,Prokaryote
+SC6A4_HUMAN_Young_2021,0.359,0.4,0.352,0.372,0.438,0.452,0.273,0.405,0.483,0.49,0.454,0.467,0.467,0.135,0.255,0.44,0.477,0.478,0.449,0.47,0.43,0.418,0.399,0.394,0.435,0.419,0.43,0.428,0.421,0.456,0.384,0.317,0.283,0.436,0.416,0.386,0.459,0.455,0.432,0.475,0.475,0.461,0.419,0.1,0.461,0.453,0.383,0.43,0.406,0.132,0.455,0.454,0.468,0.47,0.478,0.469,0.475,0.477,0.466,0.482,0.491,0.452,11576,Activity,SC6A4_HUMAN,Medium,Human
+SCIN_STAAR_Tsuboyama_2023_2QFF,0.051,0.084,0.167,0.18,0.236,0.213,0.097,0.012,0.183,0.2,0.18,0.173,0.17,0.127,0.186,0.167,0.19,0.262,0.262,0.17,0.054,0.087,0.084,0.091,0.107,0.111,0.101,0.13,0.134,0.18,0.177,0.109,-0.025,0.035,0.051,0.107,0.114,0.104,0.12,0.19,0.173,0.18,0.13,0.111,0.2,0.163,0.375,0.391,0.381,0.325,0.256,0.276,0.269,0.292,0.259,0.295,0.272,0.249,0.206,0.276,0.447,0.355,1212,Stability,SCIN_STAAR,High,Prokaryote
+SCN5A_HUMAN_Glazer_2019,0.036,0.054,0.089,0.089,0.125,0.143,0.071,0.0,0.071,0.125,0.089,0.143,0.071,0.196,0.143,0.071,0.089,0.054,0.071,0.054,0.018,0.054,0.071,0.089,-0.018,0.018,0.018,0.036,0.107,0.027,0.161,0.143,0.036,0.0,0.0,0.089,-0.018,-0.018,-0.018,0.054,0.036,0.036,0.018,0.125,0.107,0.125,0.036,0.054,0.018,-0.018,0.089,0.054,0.071,0.036,0.054,0.089,0.143,0.107,0.036,0.089,0.125,0.018,224,OrganismalFitness,SCN5A_HUMAN,Medium,Human
+SDA_BACSU_Tsuboyama_2023_1PV0,0.477,0.527,0.53,0.53,0.532,0.532,0.118,0.443,0.524,0.525,0.529,0.512,0.522,0.284,0.282,0.518,0.508,0.5,0.505,0.529,0.156,0.19,0.192,0.318,0.163,0.323,0.084,0.323,0.385,0.532,0.527,0.527,-0.24,0.103,0.287,0.342,0.488,0.496,0.46,0.512,0.512,0.508,0.096,-0.009,0.342,0.068,0.376,0.349,0.529,0.522,0.529,0.529,0.529,0.529,0.529,0.529,0.529,0.529,0.529,0.529,0.518,0.525,2770,Stability,SDA_BACSU,Medium,Prokaryote
+SERC_HUMAN_Xie_2023,0.273,0.354,0.394,0.396,0.392,0.394,0.018,0.333,0.417,0.421,0.371,0.379,0.369,0.116,0.281,0.388,0.396,0.436,0.398,0.379,0.346,0.371,0.381,0.369,0.352,0.386,0.369,0.396,0.371,0.354,0.365,0.315,0.152,0.361,0.373,0.379,0.365,0.375,0.379,0.4,0.413,0.415,0.202,0.047,0.39,0.335,0.277,0.375,0.292,0.112,0.419,0.396,0.411,0.427,0.417,0.421,0.398,0.4,0.404,0.423,0.396,0.35,1914,OrganismalFitness,SERC_HUMAN,High,Human
+SHOC2_HUMAN_Kwon_2022,0.133,0.265,0.294,0.29,0.285,0.296,0.129,0.26,0.314,0.319,0.298,0.278,0.304,0.144,0.152,0.166,0.323,0.295,0.213,0.294,0.161,0.282,0.289,0.262,0.166,0.296,0.3,0.304,0.27,0.308,0.303,0.254,0.099,0.15,0.234,0.277,0.152,0.201,0.258,0.26,0.275,0.303,0.156,0.149,0.317,0.191,0.216,0.277,0.202,0.064,0.295,0.296,0.303,0.308,0.306,0.306,0.29,0.307,0.296,0.307,0.217,0.187,10972,OrganismalFitness,SHOC2_HUMAN,Medium,Human
+SOX30_HUMAN_Tsuboyama_2023_7JJK,0.17,0.21,0.25,0.253,0.242,0.238,0.289,0.158,0.166,0.202,0.174,0.234,0.218,0.21,0.25,0.226,0.222,0.17,0.186,0.158,0.103,0.115,0.111,0.131,0.19,0.182,0.103,0.194,0.103,0.17,0.04,0.026,0.158,0.079,0.162,0.166,0.147,0.158,0.154,0.234,0.23,0.214,0.186,0.127,0.147,0.246,0.38,0.253,0.412,0.356,0.293,0.277,0.242,0.277,0.273,0.281,0.257,0.269,0.246,0.285,0.25,0.364,1010,Stability,SOX30_HUMAN,High,Human
+SPA_STAAU_Tsuboyama_2023_1LP1,0.373,0.44,0.464,0.445,0.493,0.498,-0.114,0.46,0.407,0.409,0.367,-0.068,-0.028,-0.055,-0.058,-0.076,-0.057,-0.068,-0.022,0.491,-0.155,-0.022,0.111,0.054,-0.085,-0.055,0.042,0.23,0.312,0.436,0.424,0.412,-0.02,-0.009,-0.024,0.008,0.335,0.327,0.318,0.489,0.487,0.481,-0.159,-0.117,-0.123,-0.074,0.362,0.306,0.563,0.434,0.472,0.455,0.445,0.47,0.451,0.455,0.472,0.46,0.46,0.47,0.409,0.286,2105,Stability,SPA_STAAU,Medium,Prokaryote
+SPG1_STRSG_Olson_2014,0.169,0.186,-0.005,0.012,0.158,0.16,-0.006,0.044,-0.068,0.072,0.228,0.174,0.142,0.2,0.165,0.205,0.237,0.2,0.254,0.202,0.18,0.152,0.154,0.149,0.167,0.162,0.15,0.167,0.248,0.17,0.328,0.316,0.047,0.172,0.132,0.198,0.185,0.149,0.203,0.173,0.136,0.205,-0.029,-0.08,0.157,0.051,0.24,0.19,0.276,0.086,0.279,0.253,0.272,0.302,0.303,0.28,0.314,0.334,0.316,0.309,0.287,0.273,536962,Binding,SPG1_STRSG,Low,Prokaryote
+SPG1_STRSG_Wu_2016,0.01,0.079,0.087,0.091,0.084,0.093,0.076,0.028,0.162,0.158,0.131,0.117,0.108,0.096,0.125,0.129,0.138,0.163,0.182,0.098,0.071,0.056,0.048,0.069,0.044,0.096,0.079,0.063,0.063,0.095,0.125,0.093,-0.004,0.08,0.04,0.061,0.09,0.078,0.086,0.103,0.094,0.092,0.05,0.005,0.078,0.045,0.058,0.054,0.145,0.089,0.141,0.124,0.118,0.149,0.125,0.119,0.143,0.136,0.149,0.136,0.148,0.115,149360,Binding,SPG1_STRSG,Medium,Prokaryote
+SPG2_STRSG_Tsuboyama_2023_5UBS,0.442,0.458,0.461,0.483,0.555,0.519,0.312,0.301,0.489,0.414,0.417,0.384,0.354,0.326,0.376,0.403,0.436,0.425,0.489,0.425,0.343,0.376,0.378,0.354,0.298,0.381,0.301,0.403,0.45,0.571,0.525,0.507,-0.126,0.332,0.409,0.389,0.304,0.309,0.37,0.486,0.513,0.5,0.235,0.13,0.235,0.216,0.395,0.315,0.662,0.494,0.53,0.505,0.527,0.513,0.494,0.522,0.527,0.569,0.516,0.53,0.519,0.519,1451,Stability,SPG2_STRSG,Medium,Prokaryote
+SPIKE_SARS2_Starr_2020_binding,0.111,0.135,0.051,0.116,0.153,0.166,-0.059,0.259,0.27,0.257,-0.011,-0.075,-0.045,-0.048,-0.048,-0.033,-0.024,-0.023,0.016,0.255,0.223,0.264,0.256,0.264,0.284,0.236,0.209,0.272,0.227,0.163,0.185,0.238,0.161,0.227,0.215,0.262,0.209,0.21,0.214,0.249,0.242,0.238,-0.053,-0.016,-0.024,-0.018,0.413,0.379,0.32,0.114,0.137,0.152,0.137,0.181,0.153,0.183,0.156,0.187,0.158,0.174,0.216,0.095,3802,Binding,SPIKE_SARS2,Medium,Virus
+SPIKE_SARS2_Starr_2020_expression,0.161,0.223,0.122,0.215,0.375,0.352,-0.034,0.266,0.321,0.344,0.038,-0.032,-0.007,-0.022,-0.024,-0.006,0.001,0.019,0.055,0.271,0.24,0.288,0.293,0.308,0.289,0.277,0.236,0.302,0.249,0.267,0.207,0.269,0.144,0.25,0.241,0.274,0.259,0.27,0.273,0.339,0.352,0.367,-0.01,0.013,0.013,0.021,0.434,0.419,0.377,0.14,0.225,0.239,0.23,0.285,0.247,0.298,0.246,0.27,0.247,0.273,0.338,0.18,3798,Expression,SPIKE_SARS2,Medium,Virus
+SPTN1_CHICK_Tsuboyama_2023_1TUD,0.499,0.446,0.433,0.402,0.426,0.412,0.14,0.378,0.4,0.422,0.402,0.485,0.445,-0.069,0.443,0.437,0.442,0.544,0.489,0.392,0.425,0.417,0.435,0.4,0.458,0.412,0.46,0.396,0.382,0.385,0.405,0.431,0.33,0.373,0.418,0.391,0.463,0.465,0.471,0.406,0.42,0.415,0.367,-0.148,0.335,0.352,0.303,0.248,0.472,0.44,0.433,0.432,0.463,0.455,0.451,0.448,0.433,0.423,0.435,0.433,0.5,0.415,3201,Stability,SPTN1_CHICK,High,Eukaryote
+SQSTM_MOUSE_Tsuboyama_2023_2RRU,0.324,0.409,0.42,0.437,0.426,0.437,0.064,0.233,0.369,0.414,0.318,0.233,0.324,0.143,0.318,0.386,0.437,0.494,0.369,0.431,0.143,0.296,0.369,0.296,0.182,0.392,0.42,0.347,0.358,0.426,0.414,0.378,0.075,0.149,0.284,0.397,0.358,0.324,0.42,0.471,0.386,0.437,0.16,0.115,0.454,0.403,0.528,0.494,0.528,0.528,0.477,0.431,0.516,0.494,0.482,0.477,0.488,0.471,0.482,0.494,0.482,0.341,707,Stability,SQSTM_MOUSE,Medium,Eukaryote
+SR43C_ARATH_Tsuboyama_2023_2N88,0.506,0.529,0.536,0.539,0.567,0.554,-0.141,0.37,0.546,0.544,0.572,0.564,0.559,-0.169,0.539,0.589,0.579,0.584,0.569,0.506,-0.111,0.319,0.231,0.375,0.347,0.42,0.458,0.483,0.435,0.564,0.577,0.574,0.263,0.049,0.15,0.319,0.534,0.519,0.511,0.562,0.559,0.526,0.466,-0.245,0.572,0.476,0.059,0.461,0.587,0.572,0.572,0.567,0.574,0.582,0.587,0.595,0.567,0.567,0.584,0.577,0.655,0.612,1583,Stability,SR43C_ARATH,High,Eukaryote
+SRBS1_HUMAN_Tsuboyama_2023_2O2W,0.413,0.418,0.559,0.549,0.598,0.585,0.444,0.336,0.618,0.621,0.474,0.536,0.567,0.379,0.575,0.613,0.593,0.544,0.541,0.541,0.549,0.513,0.51,0.495,0.495,0.515,0.523,0.503,0.503,0.539,0.59,0.559,0.485,0.48,0.487,0.539,0.513,0.554,0.559,0.577,0.587,0.585,0.248,0.117,0.456,0.433,0.438,0.446,0.497,0.51,0.621,0.593,0.6,0.6,0.613,0.608,0.598,0.59,0.613,0.608,0.582,0.521,1556,Stability,SRBS1_HUMAN,High,Human
+SRC_HUMAN_Ahler_2019,0.46,0.453,0.418,0.428,0.453,0.457,0.466,0.386,0.494,0.51,0.442,0.515,0.533,0.357,0.418,0.413,0.495,0.45,0.446,0.45,0.377,0.359,0.373,0.33,0.394,0.428,0.391,0.358,0.272,0.501,0.519,0.493,0.379,0.352,0.359,0.295,0.443,0.453,0.432,0.459,0.459,0.457,0.496,0.311,0.442,0.486,0.276,0.247,0.355,0.094,0.456,0.438,0.445,0.465,0.479,0.471,0.47,0.476,0.472,0.487,0.47,0.412,3372,Activity,SRC_HUMAN,Medium,Human
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,0.385,0.378,0.38,0.383,0.391,0.384,0.387,0.333,0.417,0.407,0.361,0.423,0.439,0.268,0.335,0.332,0.425,0.352,0.376,0.362,0.329,0.304,0.322,0.277,0.328,0.357,0.316,0.313,0.22,0.411,0.438,0.435,0.299,0.282,0.292,0.226,0.362,0.358,0.345,0.38,0.378,0.384,0.41,0.238,0.36,0.393,0.219,0.224,0.29,0.035,0.393,0.372,0.373,0.396,0.407,0.41,0.393,0.412,0.396,0.414,0.389,0.346,3637,Activity,SRC_HUMAN,Medium,Human
+SRC_HUMAN_Nguyen_2022,0.337,0.341,0.356,0.351,0.351,0.351,0.345,0.319,0.284,0.345,0.343,0.389,0.411,0.238,0.296,0.298,0.382,0.338,0.351,0.34,0.283,0.284,0.293,0.257,0.291,0.332,0.288,0.279,0.209,0.369,0.402,0.391,0.295,0.257,0.265,0.207,0.32,0.321,0.316,0.35,0.338,0.34,0.387,0.213,0.345,0.39,0.193,0.201,0.256,0.04,0.366,0.346,0.343,0.366,0.377,0.372,0.364,0.378,0.366,0.381,0.365,0.308,3366,OrganismalFitness,SRC_HUMAN,Medium,Human
+SUMO1_HUMAN_Weile_2017,0.339,0.293,0.339,0.363,0.392,0.378,0.123,0.37,0.378,0.303,0.341,0.402,0.431,0.213,0.412,0.467,0.433,0.307,0.281,0.448,0.196,0.322,0.378,0.337,0.402,0.295,0.402,0.349,0.269,0.375,0.399,0.426,0.201,0.223,0.416,0.252,0.339,0.46,0.322,0.351,0.465,0.344,0.431,0.056,0.293,0.45,0.438,0.358,0.445,0.37,0.431,0.37,0.429,0.429,0.419,0.453,0.402,0.436,0.412,0.443,0.397,0.453,1700,OrganismalFitness,SUMO1_HUMAN,High,Human
+SYUA_HUMAN_Newberry_2020,0.152,0.192,0.198,0.234,0.196,0.211,0.202,0.267,0.242,0.248,0.338,0.332,0.324,0.206,0.231,0.229,0.236,0.278,0.309,0.332,0.248,0.326,0.255,0.238,0.194,0.273,0.286,0.259,0.238,0.307,0.311,0.252,0.129,0.257,0.359,0.284,0.189,0.278,0.229,0.225,0.276,0.231,0.211,0.173,0.313,0.24,-0.006,0.206,0.051,-0.041,0.19,0.19,0.189,0.198,0.229,0.196,0.227,0.215,0.229,0.221,0.074,0.059,2497,OrganismalFitness,SYUA_HUMAN,Medium,Human
+TADBP_HUMAN_Bolognesi_2019,0.09,0.04,0.077,0.08,0.074,0.07,0.154,0.023,0.06,0.054,0.01,0.03,0.023,0.054,0.02,0.023,-0.07,0.013,0.054,-0.047,0.077,0.037,-0.003,-0.01,0.104,0.07,0.043,-0.017,-0.03,0.03,0.107,0.142,-0.043,0.12,0.147,0.084,0.09,0.11,0.087,0.1,0.117,0.084,0.054,0.107,-0.007,0.03,0.197,0.023,0.12,0.047,0.09,-0.033,-0.013,0.003,0.007,0.027,0.033,0.003,-0.06,0.023,0.01,0.057,1196,OrganismalFitness,TADBP_HUMAN,Low,Human
+TAT_HV1BR_Fernandes_2016,0.265,0.183,0.238,0.243,0.259,0.243,-0.076,0.396,0.227,0.257,0.159,0.338,0.322,-0.032,0.006,0.009,0.028,-0.057,0.044,0.186,0.393,0.415,0.417,0.423,0.36,0.257,0.12,0.229,0.199,0.314,0.401,0.403,0.319,0.371,0.175,0.227,0.374,0.254,0.229,0.341,0.27,0.257,0.02,-0.027,0.284,0.156,0.202,0.281,0.3,0.178,0.082,0.126,0.139,0.099,0.107,0.107,0.074,0.099,0.09,0.093,0.161,0.145,1577,OrganismalFitness,TAT_HV1BR,High,Virus
+TCRG1_MOUSE_Tsuboyama_2023_1E0L,0.59,0.537,0.606,0.61,0.655,0.655,0.682,0.312,0.648,0.697,0.552,0.728,0.747,0.67,0.69,0.762,0.731,0.69,0.724,0.514,0.423,0.571,0.556,0.602,0.583,0.579,0.636,0.644,0.575,0.693,0.636,0.587,0.541,0.446,0.602,0.686,0.64,0.705,0.697,0.636,0.655,0.648,0.701,0.103,0.53,0.667,0.541,0.461,0.739,0.712,0.705,0.663,0.644,0.739,0.712,0.735,0.709,0.724,0.697,0.72,0.758,0.754,1058,Stability,TCRG1_MOUSE,Medium,Eukaryote
+THO1_YEAST_Tsuboyama_2023_2WQG,0.25,0.26,0.437,0.411,0.407,0.42,-0.002,0.122,0.535,0.525,0.512,0.42,0.483,0.099,0.525,0.532,0.496,0.515,0.545,0.496,-0.097,0.266,0.312,0.296,0.266,0.286,0.185,0.368,0.443,0.499,0.551,0.548,0.414,-0.042,0.339,0.306,0.397,0.463,0.473,0.411,0.489,0.483,0.188,0.076,0.424,0.306,0.414,0.401,0.443,0.568,0.542,0.529,0.568,0.587,0.571,0.555,0.555,0.558,0.548,0.561,0.479,0.404,1279,Stability,THO1_YEAST,High,Eukaryote
+TNKS2_HUMAN_Tsuboyama_2023_5JRT,0.346,0.335,0.443,0.438,0.424,0.421,-0.109,0.318,0.511,0.492,0.405,0.416,0.419,-0.014,0.394,0.456,0.473,0.432,0.47,0.405,0.227,0.316,0.34,0.321,0.351,0.381,0.289,0.332,0.346,0.486,0.454,0.435,0.167,0.067,0.278,0.302,0.397,0.394,0.397,0.462,0.421,0.427,0.153,-0.076,0.4,0.337,0.335,0.473,0.559,0.508,0.459,0.459,0.467,0.47,0.451,0.443,0.44,0.432,0.446,0.454,0.586,0.467,1479,Stability,TNKS2_HUMAN,High,Human
+TPK1_HUMAN_Weile_2017,0.19,0.194,0.185,0.198,0.203,0.209,0.059,0.193,0.236,0.221,0.23,0.215,0.257,0.121,0.162,0.216,0.266,0.258,0.306,0.191,0.096,0.107,0.18,0.222,0.119,0.217,0.22,0.209,0.231,0.191,0.199,0.161,0.105,0.117,0.171,0.234,0.19,0.197,0.242,0.208,0.209,0.239,0.131,0.104,0.269,0.152,0.148,0.209,0.184,0.076,0.2,0.207,0.211,0.217,0.226,0.226,0.215,0.229,0.233,0.231,0.217,0.185,3181,OrganismalFitness,TPK1_HUMAN,Medium,Human
+TPMT_HUMAN_Matreyek_2018,0.263,0.306,0.313,0.323,0.322,0.331,0.173,0.302,0.322,0.329,0.349,0.33,0.344,0.225,0.265,0.344,0.349,0.295,0.291,0.341,0.186,0.263,0.301,0.299,0.265,0.304,0.301,0.268,0.295,0.351,0.359,0.342,0.223,0.195,0.302,0.29,0.302,0.337,0.331,0.349,0.334,0.344,0.244,0.148,0.345,0.316,0.322,0.347,0.36,0.162,0.345,0.342,0.349,0.342,0.348,0.356,0.359,0.344,0.355,0.356,0.36,0.324,3648,Expression,TPMT_HUMAN,Medium,Human
+TPOR_HUMAN_Bridgford_2020,0.273,0.222,0.23,0.188,0.162,0.213,0.299,0.316,0.265,0.29,0.29,0.265,0.282,0.247,0.307,0.265,0.179,0.282,0.205,0.213,0.213,0.299,0.239,0.265,0.307,0.128,0.35,0.41,0.239,0.358,0.282,0.133,0.324,0.307,0.29,0.333,0.299,0.299,0.367,0.316,0.307,0.35,0.247,0.247,0.213,0.265,0.119,0.23,0.213,0.0,0.205,0.196,0.273,0.23,0.205,0.273,0.29,0.111,0.188,0.239,0.324,0.299,562,OrganismalFitness,TPOR_HUMAN,Low,Human
+TRPC_SACS2_Chan_2017,0.516,0.527,0.462,0.479,0.487,0.508,0.188,0.465,0.557,0.562,0.546,0.543,0.557,0.258,0.506,0.532,0.541,0.543,0.524,0.462,0.368,0.519,0.465,0.516,0.436,0.471,0.444,0.46,0.441,0.441,0.527,0.462,0.196,0.487,0.492,0.508,0.511,0.527,0.527,0.506,0.516,0.516,0.395,0.045,0.543,0.465,0.25,0.384,0.387,0.099,0.541,0.495,0.503,0.557,0.53,0.524,0.543,0.546,0.524,0.551,0.535,0.511,1519,OrganismalFitness,TRPC_SACS2,Medium,Prokaryote
+TRPC_THEMA_Chan_2017,0.365,0.371,0.348,0.371,0.365,0.363,0.237,0.333,0.383,0.383,0.371,0.395,0.406,0.19,0.377,0.386,0.392,0.383,0.389,0.354,0.283,0.324,0.33,0.339,0.354,0.354,0.313,0.333,0.36,0.348,0.339,0.304,0.146,0.333,0.392,0.365,0.363,0.38,0.363,0.368,0.365,0.357,0.33,0.093,0.377,0.333,0.169,0.263,0.275,0.003,0.383,0.316,0.357,0.377,0.377,0.33,0.36,0.36,0.351,0.377,0.409,0.383,1519,OrganismalFitness,TRPC_THEMA,Medium,Prokaryote
+UBC9_HUMAN_Weile_2017,0.304,0.407,0.457,0.474,0.438,0.457,-0.027,0.327,0.43,0.44,0.34,0.396,0.432,0.018,0.025,0.326,0.379,0.426,0.49,0.477,0.168,0.337,0.379,0.346,0.348,0.396,0.388,0.359,0.354,0.399,0.371,0.372,0.103,0.168,0.346,0.355,0.243,0.366,0.394,0.371,0.437,0.455,0.209,0.001,0.316,0.423,0.273,0.277,0.385,0.218,0.327,0.346,0.316,0.352,0.338,0.346,0.327,0.343,0.351,0.351,0.293,0.357,2563,OrganismalFitness,UBC9_HUMAN,Medium,Human
+UBE4B_HUMAN_Tsuboyama_2023_3L1X,0.247,0.336,0.369,0.379,0.37,0.378,0.152,0.349,0.363,0.371,0.341,0.414,0.4,0.334,0.358,0.394,0.374,0.402,0.428,0.383,0.217,0.358,0.35,0.371,0.377,0.383,0.342,0.341,0.369,0.387,0.422,0.418,0.387,0.153,0.167,0.299,0.333,0.324,0.332,0.382,0.375,0.386,0.326,-0.02,0.252,0.259,0.369,0.292,0.394,0.305,0.35,0.385,0.369,0.37,0.374,0.366,0.374,0.375,0.385,0.374,0.411,0.39,3622,Stability,UBE4B_HUMAN,High,Human
+UBE4B_MOUSE_Starita_2013,0.306,0.312,0.348,0.342,0.348,0.354,0.04,0.046,0.318,0.342,0.276,0.33,0.348,0.318,0.336,0.36,0.33,0.318,0.288,0.318,0.082,0.276,0.27,0.252,0.348,0.3,0.288,0.288,0.252,0.33,0.36,0.3,0.058,-0.009,0.118,0.203,0.312,0.288,0.306,0.342,0.36,0.379,0.306,-0.003,0.33,0.348,0.191,0.27,-0.057,0.034,0.318,0.33,0.354,0.354,0.342,0.36,0.33,0.36,0.342,0.354,0.33,0.288,899,Activity,UBE4B_MOUSE,Low,Eukaryote
+UBR5_HUMAN_Tsuboyama_2023_1I2T,0.374,0.495,0.528,0.544,0.542,0.566,0.142,0.294,0.432,0.478,0.492,0.374,0.454,0.192,0.187,0.129,0.101,0.561,0.522,0.421,0.404,0.393,0.429,0.487,0.443,0.454,0.462,0.443,0.451,0.506,0.528,0.382,0.264,0.283,0.376,0.365,0.374,0.41,0.418,0.553,0.553,0.547,0.093,0.098,0.291,0.101,0.445,0.244,0.588,0.484,0.533,0.536,0.547,0.553,0.561,0.572,0.536,0.555,0.547,0.55,0.586,0.514,1453,Stability,UBR5_HUMAN,Medium,Human
+VG08_BPP22_Tsuboyama_2023_2GP8,0.12,0.192,0.12,0.148,0.192,0.198,0.187,-0.057,0.264,0.242,0.519,0.441,0.463,0.286,0.419,0.491,0.519,0.491,0.546,0.109,0.297,0.402,0.358,0.485,0.32,0.342,0.331,0.369,0.447,0.441,0.43,0.295,0.048,0.237,0.358,0.397,0.198,0.281,0.342,0.22,0.248,0.281,0.347,0.209,0.441,0.38,0.386,0.425,0.43,0.375,0.53,0.48,0.508,0.53,0.508,0.568,0.474,0.535,0.563,0.535,0.497,0.48,723,Stability,VG08_BPP22,High,Virus
+VILI_CHICK_Tsuboyama_2023_1YU5,0.271,0.323,0.432,0.476,0.495,0.499,0.065,0.416,0.441,0.478,0.55,0.485,0.49,0.26,0.158,0.513,0.517,0.474,0.42,0.425,0.234,0.35,0.429,0.385,0.379,0.425,0.411,0.378,0.441,0.474,0.499,0.489,0.283,0.264,0.343,0.357,0.409,0.415,0.409,0.501,0.494,0.48,0.163,0.2,0.36,0.351,0.387,0.346,0.539,0.464,0.545,0.553,0.555,0.539,0.546,0.552,0.553,0.546,0.553,0.552,0.499,0.508,2568,Stability,VILI_CHICK,High,Eukaryote
+VKOR1_HUMAN_Chiasson_2020_abundance,0.323,0.34,0.351,0.357,0.387,0.386,0.124,0.317,0.446,0.449,0.396,0.375,0.411,0.139,0.307,0.381,0.443,0.426,0.424,0.393,0.175,0.188,0.332,0.359,0.236,0.411,0.384,0.384,0.342,0.393,0.345,0.264,0.089,0.111,0.221,0.402,0.331,0.342,0.452,0.386,0.396,0.437,0.108,0.077,0.405,0.265,0.371,0.383,0.409,0.191,0.437,0.421,0.47,0.475,0.478,0.445,0.464,0.437,0.458,0.47,0.461,0.377,2695,Expression,VKOR1_HUMAN,Medium,Human
+VKOR1_HUMAN_Chiasson_2020_activity,0.317,0.311,0.335,0.359,0.365,0.365,0.037,0.311,0.371,0.383,0.353,0.383,0.383,0.037,0.263,0.281,0.371,0.353,0.365,0.371,0.096,0.055,0.269,0.293,0.144,0.317,0.323,0.317,0.347,0.335,0.341,0.287,0.162,0.061,0.132,0.329,0.317,0.311,0.383,0.353,0.371,0.371,0.067,0.031,0.377,0.198,0.198,0.317,0.335,0.049,0.371,0.311,0.347,0.335,0.347,0.347,0.347,0.353,0.335,0.359,0.371,0.281,697,Activity,VKOR1_HUMAN,Medium,Human
+VRPI_BPT7_Tsuboyama_2023_2WNM,-0.076,0.011,0.102,0.144,0.081,0.11,0.094,0.11,0.342,0.351,0.363,0.264,0.28,0.214,0.342,0.384,0.463,0.496,0.421,0.135,0.007,0.139,0.115,0.222,0.197,0.185,0.139,0.189,0.247,0.069,0.355,0.338,0.231,0.04,0.011,0.073,-0.084,-0.055,-0.08,0.028,0.028,-0.043,0.218,0.036,0.425,0.293,0.463,0.392,0.442,0.442,0.467,0.446,0.446,0.446,0.454,0.454,0.438,0.458,0.434,0.467,0.492,0.417,1047,Stability,VRPI_BPT7,Medium,Virus
+YAIA_ECOLI_Tsuboyama_2023_2KVT,0.243,0.511,0.432,0.455,0.434,0.432,-0.144,0.461,0.497,0.504,0.483,0.148,0.309,-0.091,0.106,0.432,0.521,0.544,0.546,0.523,-0.091,-0.167,-0.091,0.188,-0.099,-0.142,-0.186,0.017,0.394,0.449,0.55,0.524,0.004,-0.197,-0.051,0.396,0.273,0.267,0.417,0.404,0.396,0.485,-0.059,-0.212,0.068,-0.04,0.326,0.271,0.432,0.387,0.523,0.531,0.529,0.527,0.529,0.533,0.525,0.54,0.525,0.527,0.459,0.372,1890,Stability,YAIA_ECOLI,Medium,Prokaryote
+YAP1_HUMAN_Araya_2012,0.338,0.26,0.356,0.359,0.347,0.358,0.24,0.25,0.055,0.067,0.263,0.208,0.218,0.342,0.361,0.369,0.381,0.3,0.246,0.149,0.129,0.134,0.117,0.114,0.23,0.171,0.201,0.157,0.107,0.245,0.353,0.378,0.093,0.23,0.146,0.151,0.309,0.259,0.273,0.342,0.312,0.337,0.38,-0.07,0.254,0.391,0.285,0.272,0.327,0.145,0.37,0.303,0.343,0.341,0.351,0.327,0.339,0.362,0.367,0.361,0.29,0.348,10075,Binding,YAP1_HUMAN,Low,Human
+YNZC_BACSU_Tsuboyama_2023_2JVD,0.471,0.55,0.553,0.553,0.55,0.553,0.393,0.448,0.526,0.522,0.559,0.553,0.553,0.436,0.438,0.55,0.548,0.55,0.546,0.542,0.421,0.486,0.506,0.506,0.5,0.5,0.464,0.5,0.54,0.497,0.516,0.493,0.502,0.434,0.476,0.446,0.512,0.51,0.506,0.52,0.516,0.514,0.24,0.186,0.359,0.264,0.315,0.381,0.563,0.534,0.559,0.55,0.555,0.557,0.555,0.561,0.553,0.557,0.553,0.555,0.544,0.555,2300,Stability,YNZC_BACSU,Medium,Prokaryote
diff --git a/benchmarks/DMS_zero_shot/substitutions/MCC/DMS_substitutions_MCC_DMS_level.html b/benchmarks/DMS_zero_shot/substitutions/MCC/DMS_substitutions_MCC_DMS_level.html
new file mode 100644
index 0000000..f764b15
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/MCC/DMS_substitutions_MCC_DMS_level.html
@@ -0,0 +1,15196 @@
+
+
+
+ score |
+ Site-Independent |
+ EVmutation |
+ DeepSequence (single) |
+ DeepSequence (ensemble) |
+ EVE (single) |
+ EVE (ensemble) |
+ Unirep |
+ Unirep evotuned |
+ MSA Transformer (single) |
+ MSA Transformer (ensemble) |
+ ESM-1b |
+ ESM-1v (single) |
+ ESM-1v (ensemble) |
+ ESM2 (8M) |
+ ESM2 (35M) |
+ ESM2 (150M) |
+ ESM2 (650M) |
+ ESM2 (3B) |
+ ESM2 (15B) |
+ Wavenet |
+ RITA S |
+ RITA M |
+ RITA L |
+ RITA XL |
+ Progen2 S |
+ Progen2 M |
+ Progen2 Base |
+ Progen2 L |
+ Progen2 XL |
+ GEMME |
+ VESPA |
+ VESPAl |
+ ProtGPT2 |
+ Tranception S no retrieval |
+ Tranception M no retrieval |
+ Tranception L no retrieval |
+ Tranception S |
+ Tranception M |
+ Tranception L |
+ TranceptEVE S |
+ TranceptEVE M |
+ TranceptEVE L |
+ CARP (38M) |
+ CARP (600K) |
+ CARP (640M) |
+ CARP (76M) |
+ MIF |
+ MIF-ST |
+ ESM-IF1 |
+ ProteinMPNN |
+ ProtSSN (k=10 h=512) |
+ ProtSSN (k=10 h=768) |
+ ProtSSN (k=10 h=1280) |
+ ProtSSN (k=20 h=512) |
+ ProtSSN (k=20 h=768) |
+ ProtSSN (k=20 h=1280) |
+ ProtSSN (k=30 h=512) |
+ ProtSSN (k=30 h=768) |
+ ProtSSN (k=30 h=1280) |
+ ProtSSN (ensemble) |
+ SaProt (650M) |
+ SaProt (35M) |
+ Number of Mutants |
+ Selection Type |
+ UniProt ID |
+ MSA_Neff_L_category |
+ Taxon |
+
+
+
+
+ A0A140D2T1_ZIKV_Sourisseau_2019 |
+ 0.265 |
+ 0.230 |
+ 0.082 |
+ 0.075 |
+ 0.280 |
+ 0.292 |
+ -0.124 |
+ 0.063 |
+ 0.334 |
+ 0.335 |
+ -0.023 |
+ -0.041 |
+ 0.005 |
+ -0.058 |
+ -0.045 |
+ -0.037 |
+ 0.147 |
+ 0.277 |
+ 0.281 |
+ 0.168 |
+ 0.263 |
+ 0.218 |
+ 0.219 |
+ 0.210 |
+ 0.241 |
+ 0.236 |
+ 0.224 |
+ 0.226 |
+ 0.197 |
+ 0.270 |
+ 0.208 |
+ 0.191 |
+ -0.005 |
+ 0.217 |
+ 0.234 |
+ 0.193 |
+ 0.264 |
+ 0.263 |
+ 0.248 |
+ 0.258 |
+ 0.254 |
+ 0.271 |
+ -0.043 |
+ -0.061 |
+ 0.097 |
+ -0.052 |
+ 0.202 |
+ 0.205 |
+ 0.210 |
+ 0.086 |
+ 0.175 |
+ 0.178 |
+ 0.183 |
+ 0.202 |
+ 0.191 |
+ 0.191 |
+ 0.184 |
+ 0.183 |
+ 0.183 |
+ 0.188 |
+ 0.140 |
+ 0.084 |
+ 9576 |
+ OrganismalFitness |
+ A0A140D2T1_ZIKV |
+ Medium |
+ Virus |
+
+
+ A0A192B1T2_9HIV1_Haddox_2018 |
+ 0.359 |
+ 0.332 |
+ 0.317 |
+ 0.337 |
+ 0.387 |
+ 0.401 |
+ 0.008 |
+ 0.390 |
+ 0.406 |
+ 0.409 |
+ 0.356 |
+ 0.391 |
+ 0.401 |
+ 0.022 |
+ 0.022 |
+ 0.040 |
+ 0.076 |
+ 0.087 |
+ 0.100 |
+ 0.359 |
+ 0.386 |
+ 0.397 |
+ 0.398 |
+ 0.394 |
+ 0.393 |
+ 0.389 |
+ 0.366 |
+ 0.384 |
+ 0.385 |
+ 0.382 |
+ 0.435 |
+ 0.418 |
+ 0.256 |
+ 0.385 |
+ 0.384 |
+ 0.400 |
+ 0.393 |
+ 0.399 |
+ 0.398 |
+ 0.402 |
+ 0.409 |
+ 0.411 |
+ 0.329 |
+ -0.002 |
+ 0.378 |
+ 0.332 |
+ 0.284 |
+ 0.359 |
+ 0.187 |
+ 0.102 |
+ 0.167 |
+ 0.190 |
+ 0.205 |
+ 0.208 |
+ 0.168 |
+ 0.186 |
+ 0.187 |
+ 0.163 |
+ 0.145 |
+ 0.197 |
+ 0.119 |
+ 0.063 |
+ 12577 |
+ OrganismalFitness |
+ A0A192B1T2_9HIV1 |
+ Medium |
+ Virus |
+
+
+ A0A1I9GEU1_NEIME_Kennouche_2019 |
+ -0.009 |
+ 0.026 |
+ 0.082 |
+ 0.061 |
+ 0.043 |
+ 0.052 |
+ 0.000 |
+ 0.065 |
+ 0.061 |
+ 0.069 |
+ 0.017 |
+ 0.043 |
+ 0.065 |
+ -0.026 |
+ -0.030 |
+ 0.004 |
+ 0.048 |
+ 0.022 |
+ 0.056 |
+ 0.048 |
+ -0.004 |
+ 0.022 |
+ 0.052 |
+ 0.043 |
+ 0.030 |
+ 0.061 |
+ 0.052 |
+ 0.069 |
+ 0.065 |
+ 0.030 |
+ 0.000 |
+ -0.007 |
+ 0.017 |
+ 0.043 |
+ 0.052 |
+ 0.082 |
+ 0.030 |
+ 0.065 |
+ 0.056 |
+ 0.039 |
+ 0.082 |
+ 0.061 |
+ -0.013 |
+ -0.035 |
+ 0.052 |
+ -0.030 |
+ 0.022 |
+ 0.052 |
+ 0.035 |
+ 0.056 |
+ 0.065 |
+ 0.052 |
+ 0.065 |
+ 0.056 |
+ 0.065 |
+ 0.078 |
+ 0.039 |
+ 0.069 |
+ 0.061 |
+ 0.082 |
+ 0.043 |
+ 0.004 |
+ 922 |
+ Activity |
+ A0A1I9GEU1_NEIME |
+ Medium |
+ Prokaryote |
+
+
+ A0A247D711_LISMN_Stadelmann_2021 |
+ 0.347 |
+ 0.359 |
+ 0.096 |
+ 0.041 |
+ 0.312 |
+ 0.316 |
+ 0.028 |
+ 0.038 |
+ 0.234 |
+ 0.258 |
+ 0.065 |
+ 0.082 |
+ 0.087 |
+ 0.070 |
+ 0.079 |
+ 0.084 |
+ 0.060 |
+ 0.070 |
+ 0.084 |
+ 0.043 |
+ 0.067 |
+ 0.055 |
+ 0.053 |
+ 0.094 |
+ 0.014 |
+ 0.065 |
+ 0.033 |
+ 0.070 |
+ 0.089 |
+ 0.351 |
+ 0.244 |
+ 0.234 |
+ 0.004 |
+ 0.053 |
+ 0.041 |
+ 0.055 |
+ 0.212 |
+ 0.210 |
+ 0.215 |
+ 0.193 |
+ 0.183 |
+ 0.193 |
+ 0.062 |
+ 0.043 |
+ 0.067 |
+ 0.074 |
+ 0.312 |
+ 0.353 |
+ 0.328 |
+ 0.263 |
+ 0.210 |
+ 0.237 |
+ 0.224 |
+ 0.246 |
+ 0.241 |
+ 0.280 |
+ 0.208 |
+ 0.244 |
+ 0.191 |
+ 0.237 |
+ 0.316 |
+ 0.212 |
+ 1653 |
+ Activity |
+ A0A247D711_LISMN |
+ High |
+ Prokaryote |
+
+
+ A0A2Z5U3Z0_9INFA_Doud_2016 |
+ 0.357 |
+ 0.355 |
+ 0.359 |
+ 0.390 |
+ 0.417 |
+ 0.413 |
+ -0.006 |
+ 0.373 |
+ 0.380 |
+ 0.376 |
+ 0.071 |
+ 0.405 |
+ 0.431 |
+ -0.022 |
+ -0.001 |
+ 0.040 |
+ 0.397 |
+ 0.390 |
+ 0.378 |
+ 0.296 |
+ 0.352 |
+ 0.413 |
+ 0.396 |
+ 0.420 |
+ 0.334 |
+ 0.391 |
+ 0.395 |
+ 0.376 |
+ 0.385 |
+ 0.403 |
+ 0.389 |
+ 0.311 |
+ 0.059 |
+ 0.356 |
+ 0.407 |
+ 0.415 |
+ 0.407 |
+ 0.416 |
+ 0.402 |
+ 0.436 |
+ 0.440 |
+ 0.431 |
+ 0.001 |
+ 0.007 |
+ 0.263 |
+ 0.004 |
+ 0.333 |
+ 0.378 |
+ 0.366 |
+ 0.132 |
+ 0.384 |
+ 0.384 |
+ 0.425 |
+ 0.415 |
+ 0.417 |
+ 0.431 |
+ 0.426 |
+ 0.407 |
+ 0.439 |
+ 0.432 |
+ 0.150 |
+ 0.122 |
+ 10715 |
+ OrganismalFitness |
+ A0A2Z5U3Z0_9INFA |
+ Medium |
+ Virus |
+
+
+ A0A2Z5U3Z0_9INFA_Wu_2014 |
+ 0.413 |
+ 0.423 |
+ 0.365 |
+ 0.375 |
+ 0.413 |
+ 0.431 |
+ -0.011 |
+ 0.316 |
+ 0.382 |
+ 0.387 |
+ 0.101 |
+ 0.358 |
+ 0.394 |
+ 0.030 |
+ 0.054 |
+ 0.054 |
+ 0.362 |
+ 0.389 |
+ 0.375 |
+ 0.346 |
+ 0.336 |
+ 0.389 |
+ 0.386 |
+ 0.392 |
+ 0.290 |
+ 0.333 |
+ 0.355 |
+ 0.321 |
+ 0.355 |
+ 0.425 |
+ 0.333 |
+ 0.283 |
+ 0.066 |
+ 0.338 |
+ 0.379 |
+ 0.406 |
+ 0.401 |
+ 0.450 |
+ 0.440 |
+ 0.457 |
+ 0.466 |
+ 0.459 |
+ 0.031 |
+ 0.031 |
+ 0.183 |
+ 0.030 |
+ 0.289 |
+ 0.295 |
+ 0.278 |
+ 0.096 |
+ 0.350 |
+ 0.333 |
+ 0.369 |
+ 0.362 |
+ 0.362 |
+ 0.379 |
+ 0.379 |
+ 0.370 |
+ 0.369 |
+ 0.380 |
+ 0.163 |
+ 0.130 |
+ 2350 |
+ OrganismalFitness |
+ A0A2Z5U3Z0_9INFA |
+ Medium |
+ Virus |
+
+
+ A4_HUMAN_Seuma_2022 |
+ 0.363 |
+ 0.323 |
+ 0.350 |
+ 0.323 |
+ 0.246 |
+ 0.243 |
+ 0.292 |
+ 0.110 |
+ 0.246 |
+ 0.250 |
+ 0.234 |
+ 0.218 |
+ 0.295 |
+ 0.338 |
+ 0.307 |
+ 0.305 |
+ 0.359 |
+ 0.371 |
+ 0.269 |
+ 0.185 |
+ 0.217 |
+ 0.171 |
+ 0.204 |
+ 0.198 |
+ 0.280 |
+ 0.192 |
+ 0.212 |
+ 0.202 |
+ 0.201 |
+ 0.388 |
+ 0.191 |
+ 0.128 |
+ 0.412 |
+ 0.280 |
+ 0.161 |
+ 0.236 |
+ 0.333 |
+ 0.266 |
+ 0.361 |
+ 0.332 |
+ 0.288 |
+ 0.310 |
+ 0.316 |
+ 0.311 |
+ 0.333 |
+ 0.304 |
+ 0.232 |
+ 0.315 |
+ -0.130 |
+ 0.042 |
+ 0.357 |
+ 0.338 |
+ 0.337 |
+ 0.341 |
+ 0.344 |
+ 0.316 |
+ 0.356 |
+ 0.281 |
+ 0.290 |
+ 0.343 |
+ 0.348 |
+ 0.332 |
+ 14811 |
+ Stability |
+ A4_HUMAN |
+ Low |
+ Human |
+
+
+ A4D664_9INFA_Soh_2019 |
+ 0.344 |
+ 0.252 |
+ 0.341 |
+ 0.335 |
+ 0.326 |
+ 0.323 |
+ 0.015 |
+ 0.267 |
+ 0.254 |
+ 0.245 |
+ 0.024 |
+ 0.019 |
+ 0.025 |
+ 0.019 |
+ 0.020 |
+ 0.020 |
+ 0.103 |
+ 0.139 |
+ 0.196 |
+ 0.194 |
+ 0.269 |
+ 0.301 |
+ 0.319 |
+ 0.313 |
+ 0.050 |
+ 0.208 |
+ 0.224 |
+ 0.197 |
+ 0.258 |
+ 0.365 |
+ 0.240 |
+ 0.237 |
+ 0.024 |
+ 0.269 |
+ 0.284 |
+ 0.314 |
+ 0.279 |
+ 0.293 |
+ 0.296 |
+ 0.349 |
+ 0.356 |
+ 0.367 |
+ 0.015 |
+ 0.002 |
+ 0.047 |
+ 0.005 |
+ 0.204 |
+ 0.199 |
+ 0.090 |
+ 0.081 |
+ 0.154 |
+ 0.151 |
+ 0.145 |
+ 0.163 |
+ 0.153 |
+ 0.163 |
+ 0.158 |
+ 0.168 |
+ 0.160 |
+ 0.158 |
+ 0.112 |
+ 0.064 |
+ 14421 |
+ OrganismalFitness |
+ A4D664_9INFA |
+ Medium |
+ Virus |
+
+
+ A4GRB6_PSEAI_Chen_2020 |
+ 0.304 |
+ 0.469 |
+ 0.554 |
+ 0.557 |
+ 0.514 |
+ 0.536 |
+ 0.290 |
+ 0.427 |
+ 0.526 |
+ 0.580 |
+ 0.585 |
+ 0.546 |
+ 0.572 |
+ 0.354 |
+ 0.441 |
+ 0.551 |
+ 0.639 |
+ 0.614 |
+ 0.513 |
+ 0.465 |
+ 0.336 |
+ 0.448 |
+ 0.463 |
+ 0.525 |
+ 0.441 |
+ 0.514 |
+ 0.530 |
+ 0.527 |
+ 0.593 |
+ 0.568 |
+ 0.643 |
+ 0.573 |
+ 0.206 |
+ 0.340 |
+ 0.464 |
+ 0.521 |
+ 0.460 |
+ 0.493 |
+ 0.539 |
+ 0.547 |
+ 0.556 |
+ 0.567 |
+ 0.412 |
+ 0.080 |
+ 0.560 |
+ 0.504 |
+ 0.546 |
+ 0.597 |
+ 0.536 |
+ 0.325 |
+ 0.618 |
+ 0.603 |
+ 0.596 |
+ 0.590 |
+ 0.597 |
+ 0.607 |
+ 0.617 |
+ 0.617 |
+ 0.616 |
+ 0.615 |
+ 0.610 |
+ 0.417 |
+ 5004 |
+ OrganismalFitness |
+ A4GRB6_PSEAI |
+ High |
+ Prokaryote |
+
+
+ AACC1_PSEAI_Dandage_2018 |
+ 0.192 |
+ 0.363 |
+ 0.249 |
+ 0.336 |
+ 0.376 |
+ 0.374 |
+ 0.120 |
+ 0.196 |
+ 0.389 |
+ 0.380 |
+ 0.287 |
+ 0.345 |
+ 0.371 |
+ 0.127 |
+ 0.147 |
+ 0.154 |
+ 0.363 |
+ 0.398 |
+ 0.414 |
+ 0.278 |
+ 0.112 |
+ 0.156 |
+ 0.174 |
+ 0.209 |
+ 0.178 |
+ 0.320 |
+ 0.298 |
+ 0.327 |
+ 0.354 |
+ 0.363 |
+ 0.400 |
+ 0.349 |
+ 0.003 |
+ 0.163 |
+ 0.163 |
+ 0.311 |
+ 0.320 |
+ 0.303 |
+ 0.334 |
+ 0.349 |
+ 0.343 |
+ 0.385 |
+ 0.154 |
+ 0.076 |
+ 0.243 |
+ 0.163 |
+ 0.187 |
+ 0.265 |
+ 0.245 |
+ 0.114 |
+ 0.363 |
+ 0.349 |
+ 0.360 |
+ 0.349 |
+ 0.356 |
+ 0.340 |
+ 0.351 |
+ 0.349 |
+ 0.347 |
+ 0.358 |
+ 0.360 |
+ 0.200 |
+ 1801 |
+ OrganismalFitness |
+ AACC1_PSEAI |
+ High |
+ Prokaryote |
+
+
+ ACE2_HUMAN_Chan_2020 |
+ 0.162 |
+ 0.105 |
+ 0.080 |
+ 0.080 |
+ 0.117 |
+ 0.103 |
+ -0.028 |
+ 0.013 |
+ 0.161 |
+ 0.170 |
+ 0.180 |
+ 0.087 |
+ 0.126 |
+ -0.014 |
+ 0.035 |
+ 0.152 |
+ 0.152 |
+ 0.116 |
+ 0.089 |
+ 0.126 |
+ -0.005 |
+ 0.033 |
+ 0.094 |
+ 0.141 |
+ -0.028 |
+ 0.074 |
+ 0.117 |
+ 0.116 |
+ 0.180 |
+ 0.096 |
+ 0.103 |
+ 0.073 |
+ 0.062 |
+ 0.006 |
+ 0.040 |
+ 0.045 |
+ 0.135 |
+ 0.092 |
+ 0.083 |
+ 0.107 |
+ 0.083 |
+ 0.081 |
+ 0.008 |
+ 0.045 |
+ 0.218 |
+ 0.035 |
+ 0.288 |
+ 0.240 |
+ 0.270 |
+ 0.080 |
+ 0.161 |
+ 0.207 |
+ 0.161 |
+ 0.182 |
+ 0.173 |
+ 0.195 |
+ 0.164 |
+ 0.162 |
+ 0.166 |
+ 0.173 |
+ 0.233 |
+ 0.161 |
+ 2223 |
+ Binding |
+ ACE2_HUMAN |
+ Medium |
+ Human |
+
+
+ ADRB2_HUMAN_Jones_2020 |
+ 0.277 |
+ 0.364 |
+ 0.410 |
+ 0.411 |
+ 0.397 |
+ 0.407 |
+ 0.364 |
+ 0.396 |
+ 0.432 |
+ 0.426 |
+ 0.430 |
+ 0.428 |
+ 0.428 |
+ 0.313 |
+ 0.341 |
+ 0.373 |
+ 0.392 |
+ 0.402 |
+ 0.405 |
+ 0.412 |
+ 0.425 |
+ 0.427 |
+ 0.415 |
+ 0.390 |
+ 0.431 |
+ 0.421 |
+ 0.421 |
+ 0.425 |
+ 0.391 |
+ 0.429 |
+ 0.406 |
+ 0.311 |
+ 0.250 |
+ 0.435 |
+ 0.421 |
+ 0.401 |
+ 0.428 |
+ 0.430 |
+ 0.427 |
+ 0.423 |
+ 0.430 |
+ 0.428 |
+ 0.375 |
+ 0.124 |
+ 0.431 |
+ 0.411 |
+ 0.336 |
+ 0.396 |
+ 0.368 |
+ 0.169 |
+ 0.386 |
+ 0.390 |
+ 0.412 |
+ 0.402 |
+ 0.409 |
+ 0.402 |
+ 0.395 |
+ 0.414 |
+ 0.403 |
+ 0.417 |
+ 0.457 |
+ 0.411 |
+ 7800 |
+ Activity |
+ ADRB2_HUMAN |
+ Medium |
+ Human |
+
+
+ AICDA_HUMAN_Gajula_2014_3cycles |
+ 0.011 |
+ 0.216 |
+ 0.267 |
+ 0.267 |
+ 0.267 |
+ 0.267 |
+ -0.169 |
+ 0.139 |
+ 0.164 |
+ 0.216 |
+ 0.216 |
+ 0.216 |
+ 0.267 |
+ -0.143 |
+ -0.143 |
+ 0.011 |
+ 0.190 |
+ 0.190 |
+ 0.113 |
+ 0.395 |
+ -0.092 |
+ 0.036 |
+ 0.190 |
+ 0.241 |
+ -0.066 |
+ 0.139 |
+ 0.293 |
+ 0.216 |
+ 0.190 |
+ 0.164 |
+ 0.293 |
+ 0.293 |
+ 0.036 |
+ -0.041 |
+ -0.015 |
+ 0.216 |
+ 0.036 |
+ 0.113 |
+ 0.190 |
+ 0.216 |
+ 0.241 |
+ 0.216 |
+ -0.169 |
+ -0.169 |
+ 0.164 |
+ 0.036 |
+ 0.293 |
+ 0.241 |
+ 0.267 |
+ 0.139 |
+ 0.216 |
+ 0.241 |
+ 0.216 |
+ 0.216 |
+ 0.164 |
+ 0.216 |
+ 0.216 |
+ 0.216 |
+ 0.190 |
+ 0.190 |
+ 0.190 |
+ -0.015 |
+ 209 |
+ Activity |
+ AICDA_HUMAN |
+ Medium |
+ Human |
+
+
+ AMFR_HUMAN_Tsuboyama_2023_4G3O |
+ 0.201 |
+ 0.249 |
+ 0.241 |
+ 0.226 |
+ 0.229 |
+ 0.234 |
+ -0.227 |
+ 0.013 |
+ 0.085 |
+ 0.096 |
+ 0.188 |
+ 0.070 |
+ 0.271 |
+ -0.139 |
+ 0.186 |
+ 0.127 |
+ 0.102 |
+ 0.135 |
+ 0.127 |
+ -0.015 |
+ -0.063 |
+ -0.157 |
+ -0.094 |
+ -0.087 |
+ -0.271 |
+ -0.124 |
+ -0.112 |
+ -0.176 |
+ 0.005 |
+ 0.203 |
+ 0.125 |
+ 0.128 |
+ 0.007 |
+ -0.207 |
+ -0.108 |
+ -0.248 |
+ 0.222 |
+ 0.234 |
+ 0.183 |
+ 0.262 |
+ 0.256 |
+ 0.211 |
+ -0.160 |
+ -0.162 |
+ 0.040 |
+ 0.003 |
+ 0.143 |
+ 0.020 |
+ 0.276 |
+ 0.232 |
+ 0.283 |
+ 0.149 |
+ 0.048 |
+ 0.172 |
+ 0.162 |
+ 0.164 |
+ 0.163 |
+ 0.166 |
+ 0.174 |
+ 0.159 |
+ 0.153 |
+ 0.260 |
+ 2972 |
+ Stability |
+ AMFR_HUMAN |
+ Medium |
+ Human |
+
+
+ AMIE_PSEAE_Wrenbeck_2017 |
+ 0.207 |
+ 0.391 |
+ 0.424 |
+ 0.448 |
+ 0.421 |
+ 0.396 |
+ 0.035 |
+ 0.363 |
+ 0.314 |
+ 0.494 |
+ 0.466 |
+ 0.496 |
+ 0.546 |
+ 0.214 |
+ 0.282 |
+ 0.299 |
+ 0.437 |
+ 0.557 |
+ 0.493 |
+ 0.394 |
+ 0.452 |
+ 0.449 |
+ 0.444 |
+ 0.454 |
+ 0.453 |
+ 0.485 |
+ 0.455 |
+ 0.453 |
+ 0.433 |
+ 0.470 |
+ 0.487 |
+ 0.455 |
+ 0.243 |
+ 0.499 |
+ 0.389 |
+ 0.351 |
+ 0.474 |
+ 0.395 |
+ 0.360 |
+ 0.494 |
+ 0.432 |
+ 0.415 |
+ 0.233 |
+ 0.040 |
+ 0.358 |
+ 0.281 |
+ 0.291 |
+ 0.374 |
+ 0.324 |
+ 0.167 |
+ 0.448 |
+ 0.410 |
+ 0.418 |
+ 0.435 |
+ 0.422 |
+ 0.432 |
+ 0.413 |
+ 0.426 |
+ 0.453 |
+ 0.435 |
+ 0.475 |
+ 0.286 |
+ 6227 |
+ Activity |
+ AMIE_PSEAE |
+ High |
+ Prokaryote |
+
+
+ ANCSZ_Hobbs_2022 |
+ 0.417 |
+ 0.382 |
+ 0.325 |
+ 0.348 |
+ 0.364 |
+ 0.369 |
+ 0.389 |
+ 0.342 |
+ 0.368 |
+ 0.382 |
+ 0.405 |
+ 0.382 |
+ 0.406 |
+ 0.360 |
+ 0.431 |
+ 0.434 |
+ 0.448 |
+ 0.437 |
+ 0.418 |
+ 0.008 |
+ 0.316 |
+ 0.346 |
+ 0.380 |
+ 0.380 |
+ 0.374 |
+ 0.383 |
+ 0.372 |
+ 0.352 |
+ 0.370 |
+ 0.433 |
+ 0.410 |
+ 0.406 |
+ 0.222 |
+ 0.343 |
+ 0.370 |
+ 0.341 |
+ 0.404 |
+ 0.424 |
+ 0.409 |
+ 0.388 |
+ 0.408 |
+ 0.394 |
+ 0.381 |
+ 0.285 |
+ 0.366 |
+ 0.372 |
+ 0.339 |
+ 0.361 |
+ 0.311 |
+ 0.123 |
+ 0.414 |
+ 0.408 |
+ 0.431 |
+ 0.434 |
+ 0.431 |
+ 0.442 |
+ 0.418 |
+ 0.430 |
+ 0.422 |
+ 0.441 |
+ 0.419 |
+ 0.442 |
+ 4670 |
+ Activity |
+ ANCSZ |
+ Medium |
+ Eukaryote |
+
+
+ ARGR_ECOLI_Tsuboyama_2023_1AOY |
+ 0.212 |
+ 0.240 |
+ 0.293 |
+ 0.296 |
+ 0.281 |
+ 0.281 |
+ 0.181 |
+ 0.249 |
+ 0.315 |
+ 0.352 |
+ 0.284 |
+ 0.203 |
+ 0.262 |
+ 0.209 |
+ 0.181 |
+ 0.355 |
+ 0.340 |
+ 0.377 |
+ 0.293 |
+ 0.296 |
+ 0.225 |
+ 0.327 |
+ 0.330 |
+ 0.277 |
+ 0.231 |
+ 0.296 |
+ 0.290 |
+ 0.324 |
+ 0.302 |
+ 0.336 |
+ 0.305 |
+ 0.239 |
+ 0.066 |
+ 0.253 |
+ 0.265 |
+ 0.324 |
+ 0.212 |
+ 0.215 |
+ 0.281 |
+ 0.268 |
+ 0.277 |
+ 0.299 |
+ 0.162 |
+ 0.144 |
+ 0.330 |
+ 0.296 |
+ 0.585 |
+ 0.498 |
+ 0.601 |
+ 0.495 |
+ 0.371 |
+ 0.392 |
+ 0.377 |
+ 0.386 |
+ 0.358 |
+ 0.380 |
+ 0.355 |
+ 0.368 |
+ 0.340 |
+ 0.377 |
+ 0.430 |
+ 0.368 |
+ 1287 |
+ Stability |
+ ARGR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ B2L11_HUMAN_Dutta_2010_binding-Mcl-1 |
+ 0.519 |
+ 0.002 |
+ 0.508 |
+ 0.532 |
+ 0.319 |
+ 0.508 |
+ 0.177 |
+ 0.414 |
+ 0.201 |
+ 0.225 |
+ 0.177 |
+ 0.059 |
+ 0.248 |
+ 0.059 |
+ 0.154 |
+ 0.106 |
+ 0.225 |
+ 0.343 |
+ 0.272 |
+ 0.225 |
+ 0.154 |
+ 0.035 |
+ 0.225 |
+ 0.366 |
+ 0.059 |
+ 0.177 |
+ 0.319 |
+ 0.248 |
+ 0.295 |
+ 0.658 |
+ 0.319 |
+ 0.300 |
+ 0.201 |
+ 0.106 |
+ 0.177 |
+ 0.366 |
+ 0.437 |
+ 0.343 |
+ 0.343 |
+ 0.532 |
+ 0.461 |
+ 0.366 |
+ 0.130 |
+ 0.012 |
+ 0.154 |
+ 0.083 |
+ 0.272 |
+ 0.248 |
+ 0.319 |
+ 0.035 |
+ 0.272 |
+ 0.272 |
+ 0.319 |
+ 0.201 |
+ 0.248 |
+ 0.414 |
+ 0.225 |
+ 0.177 |
+ 0.366 |
+ 0.248 |
+ 0.201 |
+ 0.201 |
+ 170 |
+ Binding |
+ B2L11_HUMAN |
+ Low |
+ Human |
+
+
+ BBC1_YEAST_Tsuboyama_2023_1TG0 |
+ 0.189 |
+ 0.239 |
+ 0.379 |
+ 0.361 |
+ 0.375 |
+ 0.381 |
+ 0.185 |
+ 0.282 |
+ 0.365 |
+ 0.382 |
+ 0.433 |
+ 0.369 |
+ 0.410 |
+ 0.324 |
+ 0.359 |
+ 0.359 |
+ 0.439 |
+ 0.371 |
+ 0.440 |
+ 0.290 |
+ 0.332 |
+ 0.352 |
+ 0.381 |
+ 0.340 |
+ 0.123 |
+ 0.375 |
+ 0.408 |
+ 0.369 |
+ 0.299 |
+ 0.294 |
+ 0.352 |
+ 0.329 |
+ 0.266 |
+ 0.299 |
+ 0.206 |
+ 0.268 |
+ 0.328 |
+ 0.263 |
+ 0.309 |
+ 0.408 |
+ 0.359 |
+ 0.382 |
+ 0.098 |
+ 0.108 |
+ 0.210 |
+ 0.127 |
+ 0.468 |
+ 0.367 |
+ 0.572 |
+ 0.514 |
+ 0.442 |
+ 0.421 |
+ 0.448 |
+ 0.442 |
+ 0.442 |
+ 0.446 |
+ 0.435 |
+ 0.442 |
+ 0.442 |
+ 0.444 |
+ 0.450 |
+ 0.483 |
+ 2069 |
+ Stability |
+ BBC1_YEAST |
+ High |
+ Eukaryote |
+
+
+ BCHB_CHLTE_Tsuboyama_2023_2KRU |
+ 0.327 |
+ 0.307 |
+ 0.281 |
+ 0.296 |
+ 0.284 |
+ 0.302 |
+ 0.098 |
+ 0.159 |
+ 0.309 |
+ 0.391 |
+ 0.330 |
+ 0.248 |
+ 0.289 |
+ 0.118 |
+ 0.149 |
+ 0.251 |
+ 0.406 |
+ 0.414 |
+ 0.324 |
+ 0.342 |
+ 0.088 |
+ 0.299 |
+ 0.154 |
+ 0.302 |
+ 0.136 |
+ 0.363 |
+ 0.210 |
+ 0.330 |
+ 0.340 |
+ 0.500 |
+ 0.472 |
+ 0.401 |
+ 0.024 |
+ 0.164 |
+ 0.230 |
+ 0.350 |
+ 0.355 |
+ 0.360 |
+ 0.360 |
+ 0.296 |
+ 0.309 |
+ 0.340 |
+ 0.078 |
+ 0.085 |
+ 0.169 |
+ 0.146 |
+ 0.421 |
+ 0.370 |
+ 0.518 |
+ 0.431 |
+ 0.454 |
+ 0.452 |
+ 0.464 |
+ 0.472 |
+ 0.459 |
+ 0.487 |
+ 0.459 |
+ 0.449 |
+ 0.454 |
+ 0.462 |
+ 0.378 |
+ 0.210 |
+ 1572 |
+ Stability |
+ BCHB_CHLTE |
+ Medium |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Deng_2012 |
+ 0.261 |
+ 0.406 |
+ 0.402 |
+ 0.422 |
+ 0.402 |
+ 0.414 |
+ 0.067 |
+ 0.240 |
+ 0.419 |
+ 0.448 |
+ 0.413 |
+ 0.394 |
+ 0.418 |
+ 0.189 |
+ 0.281 |
+ 0.373 |
+ 0.443 |
+ 0.345 |
+ 0.259 |
+ 0.365 |
+ 0.293 |
+ 0.297 |
+ 0.303 |
+ 0.309 |
+ 0.310 |
+ 0.356 |
+ 0.361 |
+ 0.315 |
+ 0.271 |
+ 0.387 |
+ 0.428 |
+ 0.396 |
+ 0.091 |
+ 0.286 |
+ 0.296 |
+ 0.261 |
+ 0.355 |
+ 0.356 |
+ 0.359 |
+ 0.414 |
+ 0.429 |
+ 0.426 |
+ 0.273 |
+ 0.028 |
+ 0.389 |
+ 0.340 |
+ 0.335 |
+ 0.387 |
+ 0.364 |
+ 0.193 |
+ 0.430 |
+ 0.443 |
+ 0.438 |
+ 0.453 |
+ 0.450 |
+ 0.442 |
+ 0.450 |
+ 0.435 |
+ 0.442 |
+ 0.466 |
+ 0.440 |
+ 0.321 |
+ 4996 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Firnberg_2014 |
+ 0.348 |
+ 0.564 |
+ 0.589 |
+ 0.604 |
+ 0.549 |
+ 0.573 |
+ 0.104 |
+ 0.365 |
+ 0.580 |
+ 0.608 |
+ 0.579 |
+ 0.530 |
+ 0.575 |
+ 0.284 |
+ 0.413 |
+ 0.514 |
+ 0.620 |
+ 0.429 |
+ 0.300 |
+ 0.523 |
+ 0.420 |
+ 0.408 |
+ 0.381 |
+ 0.385 |
+ 0.451 |
+ 0.500 |
+ 0.506 |
+ 0.418 |
+ 0.335 |
+ 0.555 |
+ 0.614 |
+ 0.630 |
+ 0.139 |
+ 0.410 |
+ 0.366 |
+ 0.323 |
+ 0.480 |
+ 0.464 |
+ 0.462 |
+ 0.572 |
+ 0.574 |
+ 0.576 |
+ 0.397 |
+ 0.032 |
+ 0.571 |
+ 0.469 |
+ 0.471 |
+ 0.572 |
+ 0.529 |
+ 0.244 |
+ 0.599 |
+ 0.591 |
+ 0.590 |
+ 0.617 |
+ 0.611 |
+ 0.612 |
+ 0.616 |
+ 0.584 |
+ 0.603 |
+ 0.632 |
+ 0.605 |
+ 0.448 |
+ 4783 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Jacquier_2013 |
+ 0.357 |
+ 0.551 |
+ 0.551 |
+ 0.587 |
+ 0.567 |
+ 0.579 |
+ 0.070 |
+ 0.345 |
+ 0.571 |
+ 0.587 |
+ 0.523 |
+ 0.523 |
+ 0.535 |
+ 0.240 |
+ 0.393 |
+ 0.499 |
+ 0.579 |
+ 0.466 |
+ 0.369 |
+ 0.515 |
+ 0.422 |
+ 0.393 |
+ 0.377 |
+ 0.393 |
+ 0.426 |
+ 0.527 |
+ 0.470 |
+ 0.418 |
+ 0.345 |
+ 0.446 |
+ 0.575 |
+ 0.571 |
+ 0.114 |
+ 0.385 |
+ 0.397 |
+ 0.381 |
+ 0.470 |
+ 0.494 |
+ 0.499 |
+ 0.563 |
+ 0.583 |
+ 0.596 |
+ 0.341 |
+ 0.058 |
+ 0.519 |
+ 0.430 |
+ 0.410 |
+ 0.519 |
+ 0.499 |
+ 0.236 |
+ 0.547 |
+ 0.519 |
+ 0.539 |
+ 0.559 |
+ 0.555 |
+ 0.535 |
+ 0.567 |
+ 0.547 |
+ 0.519 |
+ 0.559 |
+ 0.559 |
+ 0.438 |
+ 989 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Stiffler_2015 |
+ 0.350 |
+ 0.564 |
+ 0.601 |
+ 0.614 |
+ 0.551 |
+ 0.586 |
+ 0.102 |
+ 0.363 |
+ 0.594 |
+ 0.620 |
+ 0.591 |
+ 0.549 |
+ 0.595 |
+ 0.298 |
+ 0.424 |
+ 0.526 |
+ 0.632 |
+ 0.436 |
+ 0.306 |
+ 0.528 |
+ 0.433 |
+ 0.419 |
+ 0.374 |
+ 0.392 |
+ 0.466 |
+ 0.511 |
+ 0.526 |
+ 0.427 |
+ 0.339 |
+ 0.565 |
+ 0.627 |
+ 0.638 |
+ 0.137 |
+ 0.432 |
+ 0.369 |
+ 0.326 |
+ 0.490 |
+ 0.462 |
+ 0.465 |
+ 0.585 |
+ 0.580 |
+ 0.586 |
+ 0.411 |
+ 0.032 |
+ 0.581 |
+ 0.483 |
+ 0.480 |
+ 0.576 |
+ 0.530 |
+ 0.247 |
+ 0.600 |
+ 0.582 |
+ 0.594 |
+ 0.608 |
+ 0.608 |
+ 0.611 |
+ 0.610 |
+ 0.591 |
+ 0.604 |
+ 0.630 |
+ 0.616 |
+ 0.462 |
+ 4996 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BRCA1_HUMAN_Findlay_2018 |
+ 0.436 |
+ 0.298 |
+ 0.423 |
+ 0.426 |
+ 0.403 |
+ 0.408 |
+ 0.048 |
+ 0.227 |
+ 0.334 |
+ 0.331 |
+ 0.413 |
+ 0.299 |
+ 0.341 |
+ 0.077 |
+ 0.286 |
+ 0.379 |
+ 0.413 |
+ 0.426 |
+ 0.361 |
+ 0.237 |
+ 0.065 |
+ 0.321 |
+ 0.341 |
+ 0.349 |
+ 0.239 |
+ 0.403 |
+ 0.391 |
+ 0.379 |
+ 0.398 |
+ 0.399 |
+ 0.436 |
+ 0.416 |
+ 0.212 |
+ 0.090 |
+ 0.127 |
+ 0.371 |
+ 0.406 |
+ 0.406 |
+ 0.436 |
+ 0.441 |
+ 0.431 |
+ 0.448 |
+ 0.142 |
+ 0.068 |
+ 0.446 |
+ 0.361 |
+ 0.369 |
+ 0.391 |
+ 0.044 |
+ 0.122 |
+ 0.431 |
+ 0.423 |
+ 0.428 |
+ 0.411 |
+ 0.421 |
+ 0.423 |
+ 0.428 |
+ 0.413 |
+ 0.408 |
+ 0.423 |
+ 0.436 |
+ 0.398 |
+ 1837 |
+ OrganismalFitness |
+ BRCA1_HUMAN |
+ Low |
+ Human |
+
+
+ BRCA2_HUMAN_Erwood_2022_HEK293T |
+ 0.312 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.068 |
+ 0.283 |
+ 0.122 |
+ -0.012 |
+ 0.309 |
+ 0.095 |
+ 0.015 |
+ 0.015 |
+ 0.015 |
+ 0.283 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.041 |
+ 0.015 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.283 |
+ 0.309 |
+ 0.309 |
+ 0.068 |
+ 0.309 |
+ 0.256 |
+ 0.285 |
+ -0.012 |
+ 0.041 |
+ 0.068 |
+ 0.068 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.015 |
+ 0.015 |
+ 0.309 |
+ -0.012 |
+ -0.039 |
+ 0.015 |
+ -0.200 |
+ 0.175 |
+ 0.256 |
+ 0.283 |
+ 0.283 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ 0.309 |
+ -0.039 |
+ 0.015 |
+ 265 |
+ OrganismalFitness |
+ BRCA2_HUMAN |
+ NaN |
+ Human |
+
+
+ C6KNH7_9INFA_Lee_2018 |
+ 0.297 |
+ 0.268 |
+ 0.258 |
+ 0.264 |
+ 0.308 |
+ 0.308 |
+ -0.011 |
+ 0.333 |
+ 0.267 |
+ 0.272 |
+ 0.024 |
+ 0.320 |
+ 0.370 |
+ -0.002 |
+ -0.007 |
+ -0.003 |
+ 0.381 |
+ 0.293 |
+ 0.298 |
+ 0.245 |
+ 0.288 |
+ 0.269 |
+ 0.275 |
+ 0.261 |
+ 0.168 |
+ 0.341 |
+ 0.323 |
+ 0.357 |
+ 0.281 |
+ 0.345 |
+ 0.333 |
+ 0.251 |
+ 0.088 |
+ 0.269 |
+ 0.276 |
+ 0.299 |
+ 0.312 |
+ 0.317 |
+ 0.318 |
+ 0.332 |
+ 0.335 |
+ 0.325 |
+ -0.009 |
+ -0.016 |
+ 0.204 |
+ 0.014 |
+ 0.393 |
+ 0.422 |
+ 0.426 |
+ 0.172 |
+ 0.387 |
+ 0.396 |
+ 0.410 |
+ 0.411 |
+ 0.417 |
+ 0.399 |
+ 0.417 |
+ 0.416 |
+ 0.409 |
+ 0.416 |
+ 0.235 |
+ 0.134 |
+ 10754 |
+ OrganismalFitness |
+ C6KNH7_9INFA |
+ Medium |
+ Virus |
+
+
+ CALM1_HUMAN_Weile_2017 |
+ 0.135 |
+ 0.201 |
+ 0.197 |
+ 0.204 |
+ 0.186 |
+ 0.192 |
+ 0.124 |
+ 0.135 |
+ 0.197 |
+ 0.221 |
+ 0.232 |
+ 0.190 |
+ 0.234 |
+ 0.115 |
+ 0.153 |
+ 0.153 |
+ 0.166 |
+ 0.166 |
+ 0.168 |
+ 0.188 |
+ 0.131 |
+ 0.184 |
+ 0.215 |
+ 0.210 |
+ 0.188 |
+ 0.243 |
+ 0.239 |
+ 0.250 |
+ 0.276 |
+ 0.206 |
+ 0.166 |
+ 0.098 |
+ 0.062 |
+ 0.173 |
+ 0.226 |
+ 0.270 |
+ 0.175 |
+ 0.197 |
+ 0.226 |
+ 0.204 |
+ 0.215 |
+ 0.204 |
+ 0.164 |
+ 0.115 |
+ 0.256 |
+ 0.212 |
+ 0.060 |
+ 0.124 |
+ 0.137 |
+ 0.067 |
+ 0.124 |
+ 0.144 |
+ 0.151 |
+ 0.155 |
+ 0.153 |
+ 0.140 |
+ 0.142 |
+ 0.133 |
+ 0.137 |
+ 0.151 |
+ 0.252 |
+ 0.192 |
+ 1813 |
+ OrganismalFitness |
+ CALM1_HUMAN |
+ High |
+ Human |
+
+
+ CAPSD_AAV2S_Sinai_2021 |
+ 0.234 |
+ 0.195 |
+ 0.188 |
+ 0.211 |
+ 0.194 |
+ 0.188 |
+ 0.285 |
+ 0.285 |
+ 0.179 |
+ 0.202 |
+ 0.122 |
+ 0.146 |
+ 0.148 |
+ 0.201 |
+ 0.217 |
+ 0.144 |
+ 0.201 |
+ 0.111 |
+ 0.044 |
+ 0.163 |
+ 0.145 |
+ 0.198 |
+ 0.206 |
+ 0.202 |
+ 0.159 |
+ 0.149 |
+ 0.188 |
+ 0.152 |
+ 0.300 |
+ 0.274 |
+ 0.118 |
+ 0.110 |
+ 0.102 |
+ 0.149 |
+ 0.203 |
+ 0.360 |
+ 0.213 |
+ 0.223 |
+ 0.318 |
+ 0.194 |
+ 0.197 |
+ 0.263 |
+ 0.092 |
+ 0.136 |
+ 0.210 |
+ 0.096 |
+ 0.313 |
+ 0.301 |
+ 0.240 |
+ 0.221 |
+ 0.151 |
+ 0.148 |
+ 0.131 |
+ 0.151 |
+ 0.139 |
+ 0.139 |
+ 0.143 |
+ 0.139 |
+ 0.150 |
+ 0.145 |
+ 0.214 |
+ 0.167 |
+ 42328 |
+ OrganismalFitness |
+ CAPSD_AAV2S |
+ Low |
+ Virus |
+
+
+ CAR11_HUMAN_Meitlis_2020_gof |
+ -0.010 |
+ -0.026 |
+ -0.027 |
+ -0.030 |
+ -0.055 |
+ -0.062 |
+ 0.090 |
+ -0.095 |
+ 0.025 |
+ 0.027 |
+ 0.012 |
+ 0.082 |
+ 0.035 |
+ 0.047 |
+ 0.050 |
+ 0.145 |
+ 0.057 |
+ 0.065 |
+ 0.100 |
+ 0.045 |
+ 0.075 |
+ -0.042 |
+ 0.012 |
+ 0.005 |
+ -0.140 |
+ -0.052 |
+ -0.035 |
+ -0.032 |
+ 0.020 |
+ -0.019 |
+ 0.022 |
+ 0.012 |
+ 0.097 |
+ 0.042 |
+ -0.042 |
+ -0.090 |
+ 0.032 |
+ -0.022 |
+ -0.070 |
+ -0.020 |
+ -0.062 |
+ -0.092 |
+ 0.060 |
+ 0.015 |
+ 0.002 |
+ 0.125 |
+ 0.102 |
+ 0.015 |
+ 0.070 |
+ 0.040 |
+ 0.105 |
+ 0.095 |
+ 0.090 |
+ 0.080 |
+ 0.085 |
+ 0.075 |
+ 0.072 |
+ 0.090 |
+ 0.095 |
+ 0.095 |
+ 0.137 |
+ 0.130 |
+ 2374 |
+ OrganismalFitness |
+ CAR11_HUMAN |
+ Low |
+ Human |
+
+
+ CAR11_HUMAN_Meitlis_2020_lof |
+ 0.365 |
+ 0.311 |
+ 0.299 |
+ 0.284 |
+ 0.294 |
+ 0.299 |
+ 0.075 |
+ 0.011 |
+ 0.199 |
+ 0.189 |
+ 0.361 |
+ 0.410 |
+ 0.395 |
+ 0.020 |
+ 0.085 |
+ 0.381 |
+ 0.426 |
+ 0.436 |
+ 0.443 |
+ 0.128 |
+ 0.123 |
+ 0.269 |
+ 0.246 |
+ 0.210 |
+ 0.155 |
+ 0.256 |
+ 0.241 |
+ 0.296 |
+ 0.113 |
+ 0.314 |
+ 0.363 |
+ 0.325 |
+ 0.277 |
+ -0.009 |
+ 0.224 |
+ 0.205 |
+ 0.313 |
+ 0.267 |
+ 0.237 |
+ 0.329 |
+ 0.276 |
+ 0.252 |
+ 0.072 |
+ 0.026 |
+ 0.334 |
+ 0.241 |
+ 0.353 |
+ 0.375 |
+ 0.348 |
+ 0.108 |
+ 0.443 |
+ 0.431 |
+ 0.431 |
+ 0.443 |
+ 0.428 |
+ 0.443 |
+ 0.420 |
+ 0.436 |
+ 0.453 |
+ 0.440 |
+ 0.465 |
+ 0.323 |
+ 2395 |
+ OrganismalFitness |
+ CAR11_HUMAN |
+ Low |
+ Human |
+
+
+ CAS9_STRP1_Spencer_2017_positive |
+ 0.107 |
+ 0.141 |
+ 0.123 |
+ 0.126 |
+ 0.135 |
+ 0.139 |
+ 0.014 |
+ 0.079 |
+ 0.131 |
+ 0.140 |
+ 0.123 |
+ 0.046 |
+ 0.039 |
+ 0.027 |
+ 0.056 |
+ 0.101 |
+ 0.133 |
+ 0.137 |
+ 0.134 |
+ 0.010 |
+ 0.027 |
+ 0.048 |
+ 0.116 |
+ 0.091 |
+ 0.026 |
+ 0.144 |
+ 0.138 |
+ 0.133 |
+ 0.048 |
+ 0.128 |
+ 0.145 |
+ 0.129 |
+ 0.016 |
+ 0.022 |
+ 0.033 |
+ 0.120 |
+ 0.132 |
+ 0.134 |
+ 0.132 |
+ 0.129 |
+ 0.131 |
+ 0.139 |
+ 0.039 |
+ 0.011 |
+ 0.129 |
+ 0.074 |
+ 0.074 |
+ 0.129 |
+ 0.015 |
+ 0.020 |
+ 0.123 |
+ 0.123 |
+ 0.121 |
+ 0.125 |
+ 0.126 |
+ 0.130 |
+ 0.127 |
+ 0.128 |
+ 0.121 |
+ 0.128 |
+ 0.119 |
+ 0.077 |
+ 8117 |
+ Activity |
+ CAS9_STRP1 |
+ Medium |
+ Prokaryote |
+
+
+ CASP3_HUMAN_Roychowdhury_2020 |
+ 0.314 |
+ 0.447 |
+ 0.508 |
+ 0.503 |
+ 0.518 |
+ 0.523 |
+ 0.053 |
+ 0.370 |
+ 0.518 |
+ 0.518 |
+ 0.464 |
+ 0.477 |
+ 0.487 |
+ 0.186 |
+ 0.441 |
+ 0.518 |
+ 0.521 |
+ 0.452 |
+ 0.380 |
+ 0.480 |
+ 0.132 |
+ 0.378 |
+ 0.424 |
+ 0.406 |
+ 0.383 |
+ 0.441 |
+ 0.447 |
+ 0.393 |
+ 0.441 |
+ 0.490 |
+ 0.467 |
+ 0.401 |
+ 0.181 |
+ 0.048 |
+ 0.355 |
+ 0.388 |
+ 0.390 |
+ 0.429 |
+ 0.462 |
+ 0.505 |
+ 0.490 |
+ 0.526 |
+ 0.270 |
+ -0.029 |
+ 0.480 |
+ 0.424 |
+ 0.316 |
+ 0.429 |
+ 0.395 |
+ 0.188 |
+ 0.459 |
+ 0.457 |
+ 0.459 |
+ 0.462 |
+ 0.482 |
+ 0.470 |
+ 0.467 |
+ 0.447 |
+ 0.457 |
+ 0.487 |
+ 0.531 |
+ 0.372 |
+ 1567 |
+ Activity |
+ CASP3_HUMAN |
+ High |
+ Human |
+
+
+ CASP7_HUMAN_Roychowdhury_2020 |
+ 0.335 |
+ 0.461 |
+ 0.509 |
+ 0.521 |
+ 0.502 |
+ 0.502 |
+ 0.051 |
+ 0.392 |
+ 0.487 |
+ 0.533 |
+ 0.487 |
+ 0.502 |
+ 0.530 |
+ 0.225 |
+ 0.516 |
+ 0.518 |
+ 0.535 |
+ 0.499 |
+ 0.478 |
+ 0.504 |
+ 0.180 |
+ 0.459 |
+ 0.461 |
+ 0.464 |
+ 0.475 |
+ 0.497 |
+ 0.511 |
+ 0.485 |
+ 0.480 |
+ 0.549 |
+ 0.511 |
+ 0.459 |
+ 0.299 |
+ 0.101 |
+ 0.487 |
+ 0.425 |
+ 0.442 |
+ 0.506 |
+ 0.509 |
+ 0.497 |
+ 0.549 |
+ 0.533 |
+ 0.371 |
+ 0.023 |
+ 0.549 |
+ 0.514 |
+ 0.397 |
+ 0.523 |
+ 0.521 |
+ 0.249 |
+ 0.514 |
+ 0.502 |
+ 0.471 |
+ 0.492 |
+ 0.518 |
+ 0.521 |
+ 0.495 |
+ 0.495 |
+ 0.514 |
+ 0.511 |
+ 0.556 |
+ 0.433 |
+ 1680 |
+ Activity |
+ CASP7_HUMAN |
+ Medium |
+ Human |
+
+
+ CATR_CHLRE_Tsuboyama_2023_2AMI |
+ 0.470 |
+ 0.473 |
+ 0.567 |
+ 0.563 |
+ 0.580 |
+ 0.578 |
+ 0.607 |
+ 0.487 |
+ 0.449 |
+ 0.422 |
+ 0.498 |
+ 0.538 |
+ 0.563 |
+ 0.481 |
+ 0.594 |
+ 0.586 |
+ 0.557 |
+ 0.508 |
+ 0.573 |
+ 0.088 |
+ 0.506 |
+ 0.515 |
+ 0.510 |
+ 0.508 |
+ 0.540 |
+ 0.519 |
+ 0.498 |
+ 0.460 |
+ 0.477 |
+ 0.576 |
+ 0.536 |
+ 0.544 |
+ 0.357 |
+ 0.527 |
+ 0.502 |
+ 0.529 |
+ 0.569 |
+ 0.542 |
+ 0.561 |
+ 0.580 |
+ 0.563 |
+ 0.576 |
+ 0.395 |
+ 0.380 |
+ 0.294 |
+ 0.424 |
+ 0.393 |
+ 0.386 |
+ 0.576 |
+ 0.439 |
+ 0.565 |
+ 0.544 |
+ 0.578 |
+ 0.557 |
+ 0.584 |
+ 0.563 |
+ 0.557 |
+ 0.536 |
+ 0.555 |
+ 0.559 |
+ 0.512 |
+ 0.565 |
+ 1903 |
+ Stability |
+ CATR_CHLRE |
+ High |
+ Eukaryote |
+
+
+ CBPA2_HUMAN_Tsuboyama_2023_1O6X |
+ 0.555 |
+ 0.605 |
+ 0.633 |
+ 0.634 |
+ 0.638 |
+ 0.650 |
+ 0.375 |
+ 0.569 |
+ 0.673 |
+ 0.685 |
+ 0.648 |
+ 0.615 |
+ 0.611 |
+ 0.404 |
+ 0.561 |
+ 0.642 |
+ 0.636 |
+ 0.633 |
+ 0.607 |
+ 0.673 |
+ 0.400 |
+ 0.406 |
+ 0.592 |
+ 0.571 |
+ 0.449 |
+ 0.573 |
+ 0.582 |
+ 0.557 |
+ 0.611 |
+ 0.667 |
+ 0.658 |
+ 0.595 |
+ 0.019 |
+ 0.404 |
+ 0.513 |
+ 0.611 |
+ 0.576 |
+ 0.603 |
+ 0.623 |
+ 0.650 |
+ 0.640 |
+ 0.650 |
+ 0.397 |
+ 0.279 |
+ 0.513 |
+ 0.522 |
+ 0.677 |
+ 0.625 |
+ 0.805 |
+ 0.710 |
+ 0.698 |
+ 0.725 |
+ 0.692 |
+ 0.710 |
+ 0.737 |
+ 0.714 |
+ 0.712 |
+ 0.710 |
+ 0.706 |
+ 0.725 |
+ 0.631 |
+ 0.776 |
+ 2068 |
+ Stability |
+ CBPA2_HUMAN |
+ Medium |
+ Human |
+
+
+ CBS_HUMAN_Sun_2020 |
+ 0.297 |
+ 0.306 |
+ 0.314 |
+ 0.339 |
+ 0.319 |
+ 0.331 |
+ 0.162 |
+ 0.205 |
+ 0.330 |
+ 0.341 |
+ 0.304 |
+ 0.305 |
+ 0.329 |
+ 0.070 |
+ 0.177 |
+ 0.281 |
+ 0.299 |
+ 0.285 |
+ 0.283 |
+ 0.309 |
+ 0.315 |
+ 0.238 |
+ 0.261 |
+ 0.262 |
+ 0.319 |
+ 0.254 |
+ 0.262 |
+ 0.248 |
+ 0.273 |
+ 0.322 |
+ 0.316 |
+ 0.299 |
+ 0.179 |
+ 0.332 |
+ 0.268 |
+ 0.233 |
+ 0.329 |
+ 0.289 |
+ 0.286 |
+ 0.349 |
+ 0.317 |
+ 0.315 |
+ 0.243 |
+ 0.062 |
+ 0.326 |
+ 0.324 |
+ 0.210 |
+ 0.236 |
+ 0.277 |
+ 0.058 |
+ 0.277 |
+ 0.277 |
+ 0.283 |
+ 0.297 |
+ 0.292 |
+ 0.289 |
+ 0.281 |
+ 0.284 |
+ 0.290 |
+ 0.293 |
+ 0.336 |
+ 0.238 |
+ 7217 |
+ OrganismalFitness |
+ CBS_HUMAN |
+ Medium |
+ Human |
+
+
+ CBX4_HUMAN_Tsuboyama_2023_2K28 |
+ 0.389 |
+ 0.450 |
+ 0.493 |
+ 0.505 |
+ 0.463 |
+ 0.507 |
+ -0.205 |
+ 0.454 |
+ 0.482 |
+ 0.468 |
+ 0.480 |
+ 0.482 |
+ 0.470 |
+ -0.210 |
+ 0.545 |
+ 0.514 |
+ 0.503 |
+ 0.428 |
+ 0.442 |
+ 0.500 |
+ 0.312 |
+ 0.410 |
+ 0.377 |
+ 0.394 |
+ 0.372 |
+ 0.361 |
+ 0.366 |
+ 0.310 |
+ 0.394 |
+ 0.489 |
+ 0.507 |
+ 0.502 |
+ 0.349 |
+ 0.277 |
+ 0.305 |
+ 0.310 |
+ 0.415 |
+ 0.410 |
+ 0.396 |
+ 0.496 |
+ 0.517 |
+ 0.503 |
+ 0.465 |
+ -0.261 |
+ 0.249 |
+ 0.435 |
+ -0.002 |
+ 0.067 |
+ 0.452 |
+ 0.421 |
+ 0.528 |
+ 0.458 |
+ 0.535 |
+ 0.526 |
+ 0.545 |
+ 0.515 |
+ 0.519 |
+ 0.552 |
+ 0.519 |
+ 0.526 |
+ 0.493 |
+ 0.615 |
+ 2282 |
+ Stability |
+ CBX4_HUMAN |
+ High |
+ Human |
+
+
+ CCDB_ECOLI_Adkar_2012 |
+ 0.298 |
+ 0.349 |
+ 0.389 |
+ 0.413 |
+ 0.389 |
+ 0.413 |
+ -0.019 |
+ 0.165 |
+ 0.315 |
+ 0.308 |
+ 0.379 |
+ 0.304 |
+ 0.386 |
+ 0.015 |
+ -0.012 |
+ 0.349 |
+ 0.400 |
+ 0.369 |
+ 0.213 |
+ 0.393 |
+ 0.015 |
+ 0.012 |
+ -0.080 |
+ 0.094 |
+ -0.145 |
+ 0.077 |
+ 0.036 |
+ -0.005 |
+ 0.328 |
+ 0.410 |
+ 0.495 |
+ 0.481 |
+ 0.117 |
+ -0.022 |
+ 0.005 |
+ 0.253 |
+ 0.383 |
+ 0.345 |
+ 0.366 |
+ 0.403 |
+ 0.366 |
+ 0.396 |
+ 0.036 |
+ -0.009 |
+ 0.379 |
+ 0.015 |
+ 0.250 |
+ 0.349 |
+ 0.270 |
+ 0.219 |
+ 0.379 |
+ 0.321 |
+ 0.349 |
+ 0.366 |
+ 0.342 |
+ 0.359 |
+ 0.352 |
+ 0.379 |
+ 0.406 |
+ 0.383 |
+ 0.359 |
+ 0.138 |
+ 1176 |
+ Activity |
+ CCDB_ECOLI |
+ High |
+ Prokaryote |
+
+
+ CCDB_ECOLI_Tripathi_2016 |
+ 0.326 |
+ 0.384 |
+ 0.414 |
+ 0.436 |
+ 0.402 |
+ 0.414 |
+ 0.026 |
+ 0.217 |
+ 0.314 |
+ 0.341 |
+ 0.393 |
+ 0.357 |
+ 0.378 |
+ 0.023 |
+ 0.029 |
+ 0.357 |
+ 0.393 |
+ 0.423 |
+ 0.311 |
+ 0.411 |
+ 0.038 |
+ 0.032 |
+ -0.044 |
+ 0.153 |
+ -0.074 |
+ 0.093 |
+ 0.011 |
+ 0.059 |
+ 0.384 |
+ 0.387 |
+ 0.448 |
+ 0.439 |
+ 0.069 |
+ 0.020 |
+ 0.105 |
+ 0.332 |
+ 0.384 |
+ 0.372 |
+ 0.417 |
+ 0.393 |
+ 0.393 |
+ 0.414 |
+ 0.026 |
+ -0.022 |
+ 0.399 |
+ 0.020 |
+ 0.254 |
+ 0.341 |
+ 0.251 |
+ 0.196 |
+ 0.396 |
+ 0.332 |
+ 0.360 |
+ 0.375 |
+ 0.357 |
+ 0.363 |
+ 0.357 |
+ 0.360 |
+ 0.381 |
+ 0.372 |
+ 0.372 |
+ 0.241 |
+ 1663 |
+ OrganismalFitness |
+ CCDB_ECOLI |
+ High |
+ Prokaryote |
+
+
+ CCR5_HUMAN_Gill_2023 |
+ 0.208 |
+ 0.215 |
+ 0.206 |
+ 0.203 |
+ 0.203 |
+ 0.210 |
+ 0.221 |
+ 0.227 |
+ 0.239 |
+ 0.263 |
+ 0.268 |
+ 0.258 |
+ 0.268 |
+ 0.228 |
+ 0.255 |
+ 0.263 |
+ 0.256 |
+ 0.241 |
+ 0.258 |
+ 0.268 |
+ 0.271 |
+ 0.257 |
+ 0.232 |
+ 0.230 |
+ 0.271 |
+ 0.255 |
+ 0.255 |
+ 0.273 |
+ 0.233 |
+ 0.259 |
+ 0.216 |
+ 0.182 |
+ 0.086 |
+ 0.272 |
+ 0.268 |
+ 0.264 |
+ 0.261 |
+ 0.270 |
+ 0.280 |
+ 0.234 |
+ 0.233 |
+ 0.227 |
+ 0.267 |
+ 0.121 |
+ 0.268 |
+ 0.264 |
+ 0.190 |
+ 0.230 |
+ 0.203 |
+ 0.112 |
+ 0.246 |
+ 0.234 |
+ 0.246 |
+ 0.250 |
+ 0.245 |
+ 0.240 |
+ 0.252 |
+ 0.254 |
+ 0.256 |
+ 0.256 |
+ 0.271 |
+ 0.260 |
+ 6137 |
+ Binding |
+ CCR5_HUMAN |
+ High |
+ Human |
+
+
+ CD19_HUMAN_Klesmith_2019_FMC_singles |
+ 0.144 |
+ 0.152 |
+ 0.146 |
+ 0.146 |
+ 0.163 |
+ 0.153 |
+ 0.062 |
+ 0.081 |
+ 0.126 |
+ 0.131 |
+ 0.090 |
+ 0.110 |
+ 0.123 |
+ 0.102 |
+ 0.130 |
+ 0.109 |
+ 0.099 |
+ 0.126 |
+ 0.099 |
+ -0.018 |
+ 0.103 |
+ 0.109 |
+ 0.140 |
+ 0.126 |
+ 0.094 |
+ 0.159 |
+ 0.131 |
+ 0.095 |
+ 0.155 |
+ 0.168 |
+ 0.099 |
+ 0.073 |
+ 0.078 |
+ 0.119 |
+ 0.147 |
+ 0.126 |
+ 0.136 |
+ 0.135 |
+ 0.132 |
+ 0.164 |
+ 0.161 |
+ 0.156 |
+ 0.110 |
+ 0.070 |
+ 0.094 |
+ 0.110 |
+ 0.264 |
+ 0.209 |
+ 0.252 |
+ 0.132 |
+ 0.164 |
+ 0.182 |
+ 0.169 |
+ 0.193 |
+ 0.176 |
+ 0.169 |
+ 0.179 |
+ 0.176 |
+ 0.182 |
+ 0.193 |
+ 0.305 |
+ 0.260 |
+ 3761 |
+ Binding |
+ CD19_HUMAN |
+ Low |
+ Human |
+
+
+ CP2C9_HUMAN_Amorosi_2021_abundance |
+ 0.384 |
+ 0.442 |
+ 0.455 |
+ 0.465 |
+ 0.446 |
+ 0.469 |
+ 0.396 |
+ 0.441 |
+ 0.446 |
+ 0.465 |
+ 0.380 |
+ 0.453 |
+ 0.472 |
+ 0.410 |
+ 0.475 |
+ 0.477 |
+ 0.498 |
+ 0.488 |
+ 0.447 |
+ 0.483 |
+ 0.449 |
+ 0.445 |
+ 0.464 |
+ 0.425 |
+ 0.447 |
+ 0.437 |
+ 0.441 |
+ 0.449 |
+ 0.423 |
+ 0.460 |
+ 0.446 |
+ 0.399 |
+ 0.175 |
+ 0.451 |
+ 0.438 |
+ 0.422 |
+ 0.478 |
+ 0.464 |
+ 0.463 |
+ 0.483 |
+ 0.484 |
+ 0.486 |
+ 0.452 |
+ 0.050 |
+ 0.383 |
+ 0.473 |
+ 0.435 |
+ 0.408 |
+ 0.473 |
+ 0.159 |
+ 0.479 |
+ 0.470 |
+ 0.483 |
+ 0.505 |
+ 0.493 |
+ 0.498 |
+ 0.490 |
+ 0.479 |
+ 0.486 |
+ 0.505 |
+ 0.484 |
+ 0.478 |
+ 6370 |
+ Expression |
+ CP2C9_HUMAN |
+ High |
+ Human |
+
+
+ CP2C9_HUMAN_Amorosi_2021_activity |
+ 0.371 |
+ 0.438 |
+ 0.446 |
+ 0.459 |
+ 0.434 |
+ 0.463 |
+ 0.437 |
+ 0.456 |
+ 0.452 |
+ 0.480 |
+ 0.367 |
+ 0.479 |
+ 0.497 |
+ 0.414 |
+ 0.503 |
+ 0.511 |
+ 0.513 |
+ 0.500 |
+ 0.463 |
+ 0.495 |
+ 0.445 |
+ 0.455 |
+ 0.461 |
+ 0.437 |
+ 0.450 |
+ 0.439 |
+ 0.442 |
+ 0.460 |
+ 0.439 |
+ 0.471 |
+ 0.448 |
+ 0.402 |
+ 0.181 |
+ 0.484 |
+ 0.440 |
+ 0.426 |
+ 0.500 |
+ 0.489 |
+ 0.484 |
+ 0.495 |
+ 0.484 |
+ 0.482 |
+ 0.480 |
+ 0.045 |
+ 0.408 |
+ 0.503 |
+ 0.465 |
+ 0.440 |
+ 0.505 |
+ 0.186 |
+ 0.483 |
+ 0.489 |
+ 0.499 |
+ 0.505 |
+ 0.513 |
+ 0.515 |
+ 0.502 |
+ 0.493 |
+ 0.504 |
+ 0.514 |
+ 0.508 |
+ 0.504 |
+ 6142 |
+ Binding |
+ CP2C9_HUMAN |
+ High |
+ Human |
+
+
+ CSN4_MOUSE_Tsuboyama_2023_1UFM |
+ 0.255 |
+ 0.474 |
+ 0.392 |
+ 0.409 |
+ 0.356 |
+ 0.369 |
+ 0.213 |
+ 0.367 |
+ 0.421 |
+ 0.434 |
+ 0.455 |
+ 0.455 |
+ 0.502 |
+ 0.305 |
+ 0.494 |
+ 0.418 |
+ 0.365 |
+ 0.436 |
+ 0.429 |
+ 0.378 |
+ 0.328 |
+ 0.375 |
+ 0.455 |
+ 0.465 |
+ 0.342 |
+ 0.464 |
+ 0.459 |
+ 0.453 |
+ 0.442 |
+ 0.448 |
+ 0.428 |
+ 0.419 |
+ 0.259 |
+ 0.274 |
+ 0.364 |
+ 0.433 |
+ 0.347 |
+ 0.359 |
+ 0.421 |
+ 0.417 |
+ 0.421 |
+ 0.457 |
+ 0.233 |
+ 0.167 |
+ 0.306 |
+ 0.329 |
+ 0.345 |
+ 0.329 |
+ 0.426 |
+ 0.467 |
+ 0.468 |
+ 0.459 |
+ 0.445 |
+ 0.471 |
+ 0.479 |
+ 0.471 |
+ 0.470 |
+ 0.476 |
+ 0.438 |
+ 0.478 |
+ 0.456 |
+ 0.482 |
+ 3295 |
+ Stability |
+ CSN4_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ CUE1_YEAST_Tsuboyama_2023_2MYX |
+ 0.252 |
+ 0.308 |
+ 0.277 |
+ 0.282 |
+ 0.310 |
+ 0.292 |
+ 0.072 |
+ 0.320 |
+ 0.366 |
+ 0.376 |
+ 0.368 |
+ 0.297 |
+ 0.265 |
+ 0.105 |
+ 0.320 |
+ 0.361 |
+ 0.434 |
+ 0.361 |
+ 0.373 |
+ 0.257 |
+ 0.115 |
+ 0.141 |
+ 0.158 |
+ 0.062 |
+ 0.085 |
+ 0.153 |
+ 0.148 |
+ 0.148 |
+ 0.232 |
+ 0.295 |
+ 0.305 |
+ 0.271 |
+ 0.189 |
+ 0.113 |
+ 0.110 |
+ 0.135 |
+ 0.257 |
+ 0.247 |
+ 0.247 |
+ 0.315 |
+ 0.315 |
+ 0.315 |
+ 0.108 |
+ 0.032 |
+ 0.310 |
+ 0.151 |
+ 0.396 |
+ 0.381 |
+ 0.454 |
+ 0.346 |
+ 0.404 |
+ 0.373 |
+ 0.381 |
+ 0.409 |
+ 0.404 |
+ 0.419 |
+ 0.416 |
+ 0.416 |
+ 0.411 |
+ 0.404 |
+ 0.449 |
+ 0.404 |
+ 1580 |
+ Stability |
+ CUE1_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ D7PM05_CLYGR_Somermeyer_2022 |
+ 0.426 |
+ 0.500 |
+ 0.500 |
+ 0.465 |
+ 0.506 |
+ 0.503 |
+ 0.049 |
+ 0.378 |
+ 0.538 |
+ 0.543 |
+ 0.353 |
+ 0.048 |
+ 0.042 |
+ 0.045 |
+ 0.032 |
+ 0.022 |
+ 0.032 |
+ 0.046 |
+ 0.109 |
+ 0.356 |
+ 0.032 |
+ 0.033 |
+ 0.082 |
+ 0.060 |
+ -0.003 |
+ 0.035 |
+ 0.050 |
+ 0.145 |
+ 0.281 |
+ 0.506 |
+ 0.444 |
+ 0.461 |
+ 0.024 |
+ 0.078 |
+ 0.094 |
+ 0.133 |
+ 0.418 |
+ 0.417 |
+ 0.415 |
+ 0.505 |
+ 0.504 |
+ 0.493 |
+ -0.002 |
+ 0.010 |
+ 0.001 |
+ 0.020 |
+ 0.209 |
+ 0.228 |
+ 0.389 |
+ 0.292 |
+ 0.386 |
+ 0.386 |
+ 0.384 |
+ 0.392 |
+ 0.381 |
+ 0.392 |
+ 0.383 |
+ 0.376 |
+ 0.377 |
+ 0.385 |
+ 0.235 |
+ 0.147 |
+ 24515 |
+ Activity |
+ D7PM05_CLYGR |
+ Low |
+ Eukaryote |
+
+
+ DLG4_HUMAN_Faure_2021 |
+ 0.611 |
+ 0.490 |
+ 0.509 |
+ 0.487 |
+ 0.498 |
+ 0.510 |
+ 0.652 |
+ 0.557 |
+ 0.444 |
+ 0.448 |
+ 0.435 |
+ 0.459 |
+ 0.497 |
+ 0.671 |
+ 0.688 |
+ 0.646 |
+ 0.473 |
+ 0.386 |
+ 0.368 |
+ 0.536 |
+ 0.436 |
+ 0.478 |
+ 0.465 |
+ 0.437 |
+ 0.487 |
+ 0.482 |
+ 0.453 |
+ 0.451 |
+ 0.380 |
+ 0.508 |
+ 0.545 |
+ 0.542 |
+ 0.394 |
+ 0.437 |
+ 0.549 |
+ 0.481 |
+ 0.533 |
+ 0.623 |
+ 0.576 |
+ 0.532 |
+ 0.555 |
+ 0.529 |
+ 0.502 |
+ 0.162 |
+ 0.314 |
+ 0.443 |
+ 0.548 |
+ 0.350 |
+ 0.561 |
+ 0.251 |
+ 0.412 |
+ 0.300 |
+ 0.458 |
+ 0.415 |
+ 0.434 |
+ 0.405 |
+ 0.436 |
+ 0.415 |
+ 0.424 |
+ 0.421 |
+ 0.417 |
+ 0.662 |
+ 6976 |
+ OrganismalFitness |
+ DLG4_HUMAN |
+ Low |
+ Human |
+
+
+ DLG4_RAT_McLaughlin_2012 |
+ 0.407 |
+ 0.404 |
+ 0.398 |
+ 0.423 |
+ 0.435 |
+ 0.447 |
+ 0.338 |
+ 0.359 |
+ 0.383 |
+ 0.401 |
+ 0.371 |
+ 0.453 |
+ 0.462 |
+ 0.334 |
+ 0.438 |
+ 0.447 |
+ 0.432 |
+ 0.398 |
+ 0.347 |
+ 0.350 |
+ 0.298 |
+ 0.310 |
+ 0.310 |
+ 0.292 |
+ 0.304 |
+ 0.383 |
+ 0.341 |
+ 0.301 |
+ 0.334 |
+ 0.371 |
+ 0.426 |
+ 0.417 |
+ 0.207 |
+ 0.286 |
+ 0.304 |
+ 0.252 |
+ 0.389 |
+ 0.392 |
+ 0.371 |
+ 0.438 |
+ 0.444 |
+ 0.441 |
+ 0.404 |
+ 0.049 |
+ 0.328 |
+ 0.407 |
+ 0.334 |
+ 0.258 |
+ 0.371 |
+ 0.119 |
+ 0.341 |
+ 0.310 |
+ 0.347 |
+ 0.331 |
+ 0.353 |
+ 0.338 |
+ 0.307 |
+ 0.347 |
+ 0.359 |
+ 0.362 |
+ 0.404 |
+ 0.420 |
+ 1576 |
+ Binding |
+ DLG4_RAT |
+ Low |
+ Eukaryote |
+
+
+ DN7A_SACS2_Tsuboyama_2023_1JIC |
+ 0.161 |
+ 0.117 |
+ 0.188 |
+ 0.173 |
+ 0.185 |
+ 0.208 |
+ -0.093 |
+ 0.268 |
+ 0.252 |
+ 0.248 |
+ -0.014 |
+ 0.038 |
+ 0.034 |
+ 0.006 |
+ 0.093 |
+ 0.089 |
+ 0.216 |
+ 0.212 |
+ 0.256 |
+ 0.177 |
+ -0.069 |
+ -0.002 |
+ -0.010 |
+ 0.034 |
+ 0.030 |
+ 0.058 |
+ -0.006 |
+ 0.030 |
+ 0.121 |
+ 0.204 |
+ 0.300 |
+ 0.272 |
+ 0.010 |
+ -0.022 |
+ -0.026 |
+ -0.014 |
+ 0.157 |
+ 0.157 |
+ 0.161 |
+ 0.200 |
+ 0.181 |
+ 0.173 |
+ 0.034 |
+ -0.077 |
+ 0.133 |
+ 0.042 |
+ 0.450 |
+ 0.438 |
+ 0.522 |
+ 0.423 |
+ 0.276 |
+ 0.232 |
+ 0.343 |
+ 0.331 |
+ 0.308 |
+ 0.315 |
+ 0.288 |
+ 0.268 |
+ 0.272 |
+ 0.308 |
+ 0.387 |
+ 0.335 |
+ 1008 |
+ Stability |
+ DN7A_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ DNJA1_HUMAN_Tsuboyama_2023_2LO1 |
+ 0.724 |
+ 0.742 |
+ 0.684 |
+ 0.693 |
+ 0.724 |
+ 0.726 |
+ 0.633 |
+ 0.689 |
+ 0.716 |
+ 0.735 |
+ 0.723 |
+ 0.716 |
+ 0.749 |
+ 0.765 |
+ 0.777 |
+ 0.779 |
+ 0.758 |
+ 0.710 |
+ 0.763 |
+ 0.703 |
+ 0.597 |
+ 0.648 |
+ 0.680 |
+ 0.652 |
+ 0.684 |
+ 0.661 |
+ 0.689 |
+ 0.588 |
+ 0.707 |
+ 0.712 |
+ 0.670 |
+ 0.648 |
+ 0.581 |
+ 0.700 |
+ 0.673 |
+ 0.740 |
+ 0.760 |
+ 0.760 |
+ 0.784 |
+ 0.728 |
+ 0.737 |
+ 0.754 |
+ 0.532 |
+ 0.069 |
+ 0.309 |
+ 0.551 |
+ 0.502 |
+ 0.401 |
+ 0.714 |
+ 0.726 |
+ 0.721 |
+ 0.742 |
+ 0.751 |
+ 0.739 |
+ 0.758 |
+ 0.747 |
+ 0.777 |
+ 0.731 |
+ 0.762 |
+ 0.754 |
+ 0.712 |
+ 0.733 |
+ 2264 |
+ Stability |
+ DNJA1_HUMAN |
+ High |
+ Human |
+
+
+ DOCK1_MOUSE_Tsuboyama_2023_2M0Y |
+ 0.321 |
+ 0.337 |
+ 0.321 |
+ 0.327 |
+ 0.360 |
+ 0.359 |
+ 0.046 |
+ 0.317 |
+ 0.268 |
+ 0.280 |
+ 0.309 |
+ 0.332 |
+ 0.354 |
+ 0.184 |
+ 0.337 |
+ 0.359 |
+ 0.350 |
+ 0.272 |
+ 0.335 |
+ 0.342 |
+ 0.241 |
+ 0.298 |
+ 0.262 |
+ 0.272 |
+ 0.247 |
+ 0.268 |
+ 0.214 |
+ 0.210 |
+ 0.208 |
+ 0.319 |
+ 0.261 |
+ 0.274 |
+ 0.153 |
+ 0.214 |
+ 0.249 |
+ 0.227 |
+ 0.372 |
+ 0.386 |
+ 0.352 |
+ 0.350 |
+ 0.370 |
+ 0.354 |
+ 0.224 |
+ -0.155 |
+ 0.321 |
+ 0.265 |
+ 0.231 |
+ 0.234 |
+ 0.286 |
+ 0.297 |
+ 0.287 |
+ 0.258 |
+ 0.265 |
+ 0.273 |
+ 0.268 |
+ 0.279 |
+ 0.283 |
+ 0.276 |
+ 0.278 |
+ 0.265 |
+ 0.365 |
+ 0.357 |
+ 2915 |
+ Stability |
+ DOCK1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ DYR_ECOLI_Nguyen_2023 |
+ 0.291 |
+ 0.420 |
+ 0.427 |
+ 0.420 |
+ 0.432 |
+ 0.434 |
+ 0.001 |
+ 0.361 |
+ 0.402 |
+ 0.407 |
+ 0.420 |
+ 0.405 |
+ 0.420 |
+ 0.056 |
+ 0.402 |
+ 0.412 |
+ 0.426 |
+ 0.429 |
+ 0.429 |
+ 0.410 |
+ 0.309 |
+ 0.370 |
+ 0.276 |
+ 0.323 |
+ 0.400 |
+ 0.412 |
+ 0.399 |
+ 0.417 |
+ 0.372 |
+ 0.420 |
+ 0.424 |
+ 0.415 |
+ 0.164 |
+ 0.395 |
+ 0.297 |
+ 0.313 |
+ 0.417 |
+ 0.387 |
+ 0.390 |
+ 0.436 |
+ 0.424 |
+ 0.420 |
+ 0.388 |
+ 0.006 |
+ 0.424 |
+ 0.397 |
+ 0.186 |
+ 0.360 |
+ 0.291 |
+ 0.078 |
+ 0.382 |
+ 0.370 |
+ 0.363 |
+ 0.387 |
+ 0.378 |
+ 0.392 |
+ 0.388 |
+ 0.383 |
+ 0.392 |
+ 0.392 |
+ 0.405 |
+ 0.319 |
+ 2916 |
+ OrganismalFitness |
+ DYR_ECOLI |
+ High |
+ Prokaryote |
+
+
+ DYR_ECOLI_Thompson_2019 |
+ 0.336 |
+ 0.396 |
+ 0.391 |
+ 0.387 |
+ 0.406 |
+ 0.404 |
+ -0.048 |
+ 0.286 |
+ 0.396 |
+ 0.422 |
+ 0.404 |
+ 0.341 |
+ 0.358 |
+ 0.047 |
+ 0.292 |
+ 0.373 |
+ 0.398 |
+ 0.426 |
+ 0.422 |
+ 0.407 |
+ 0.235 |
+ 0.323 |
+ 0.240 |
+ 0.293 |
+ 0.314 |
+ 0.393 |
+ 0.409 |
+ 0.389 |
+ 0.363 |
+ 0.396 |
+ 0.389 |
+ 0.378 |
+ 0.159 |
+ 0.314 |
+ 0.297 |
+ 0.312 |
+ 0.350 |
+ 0.358 |
+ 0.373 |
+ 0.400 |
+ 0.420 |
+ 0.411 |
+ 0.292 |
+ -0.025 |
+ 0.369 |
+ 0.336 |
+ 0.200 |
+ 0.358 |
+ 0.270 |
+ 0.084 |
+ 0.363 |
+ 0.336 |
+ 0.334 |
+ 0.349 |
+ 0.367 |
+ 0.358 |
+ 0.376 |
+ 0.354 |
+ 0.385 |
+ 0.373 |
+ 0.407 |
+ 0.268 |
+ 2363 |
+ OrganismalFitness |
+ DYR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ ENV_HV1B9_DuenasDecamp_2016 |
+ 0.298 |
+ 0.321 |
+ 0.286 |
+ 0.298 |
+ 0.357 |
+ 0.321 |
+ 0.004 |
+ 0.263 |
+ 0.286 |
+ 0.286 |
+ 0.310 |
+ 0.380 |
+ 0.357 |
+ 0.039 |
+ 0.075 |
+ 0.051 |
+ -0.008 |
+ 0.039 |
+ 0.004 |
+ 0.298 |
+ 0.357 |
+ 0.321 |
+ 0.321 |
+ 0.321 |
+ 0.298 |
+ 0.357 |
+ 0.333 |
+ 0.286 |
+ 0.321 |
+ 0.380 |
+ 0.368 |
+ 0.353 |
+ 0.251 |
+ 0.345 |
+ 0.310 |
+ 0.310 |
+ 0.392 |
+ 0.345 |
+ 0.345 |
+ 0.345 |
+ 0.333 |
+ 0.333 |
+ 0.274 |
+ 0.004 |
+ 0.310 |
+ 0.333 |
+ 0.333 |
+ 0.286 |
+ 0.274 |
+ 0.216 |
+ 0.180 |
+ 0.274 |
+ 0.298 |
+ 0.204 |
+ 0.157 |
+ 0.216 |
+ 0.216 |
+ 0.204 |
+ 0.157 |
+ 0.216 |
+ 0.122 |
+ 0.086 |
+ 375 |
+ OrganismalFitness |
+ ENV_HV1B9 |
+ Medium |
+ Virus |
+
+
+ ENV_HV1BR_Haddox_2016 |
+ 0.251 |
+ 0.241 |
+ 0.235 |
+ 0.237 |
+ 0.251 |
+ 0.256 |
+ -0.003 |
+ 0.237 |
+ 0.264 |
+ 0.267 |
+ 0.230 |
+ 0.241 |
+ 0.260 |
+ -0.006 |
+ -0.004 |
+ -0.001 |
+ 0.027 |
+ 0.043 |
+ 0.107 |
+ 0.251 |
+ 0.271 |
+ 0.269 |
+ 0.275 |
+ 0.279 |
+ 0.256 |
+ 0.272 |
+ 0.268 |
+ 0.263 |
+ 0.271 |
+ 0.277 |
+ 0.248 |
+ 0.223 |
+ 0.158 |
+ 0.260 |
+ 0.270 |
+ 0.270 |
+ 0.277 |
+ 0.279 |
+ 0.274 |
+ 0.274 |
+ 0.277 |
+ 0.270 |
+ 0.163 |
+ -0.011 |
+ 0.240 |
+ 0.230 |
+ 0.143 |
+ 0.162 |
+ 0.069 |
+ 0.049 |
+ 0.124 |
+ 0.115 |
+ 0.133 |
+ 0.156 |
+ 0.129 |
+ 0.136 |
+ 0.140 |
+ 0.134 |
+ 0.133 |
+ 0.142 |
+ 0.108 |
+ 0.058 |
+ 12863 |
+ OrganismalFitness |
+ ENV_HV1BR |
+ Medium |
+ Virus |
+
+
+ ENVZ_ECOLI_Ghose_2023 |
+ 0.138 |
+ 0.131 |
+ 0.123 |
+ 0.142 |
+ 0.187 |
+ 0.187 |
+ 0.001 |
+ 0.157 |
+ 0.135 |
+ 0.127 |
+ 0.131 |
+ 0.183 |
+ 0.194 |
+ 0.146 |
+ 0.164 |
+ 0.157 |
+ 0.161 |
+ 0.142 |
+ 0.120 |
+ 0.153 |
+ 0.138 |
+ 0.149 |
+ 0.168 |
+ 0.176 |
+ 0.135 |
+ 0.153 |
+ 0.202 |
+ 0.190 |
+ 0.190 |
+ 0.164 |
+ 0.157 |
+ 0.135 |
+ 0.064 |
+ 0.123 |
+ 0.153 |
+ 0.194 |
+ 0.176 |
+ 0.179 |
+ 0.198 |
+ 0.183 |
+ 0.202 |
+ 0.202 |
+ 0.127 |
+ 0.068 |
+ 0.164 |
+ 0.168 |
+ 0.049 |
+ 0.146 |
+ 0.101 |
+ 0.042 |
+ 0.164 |
+ 0.172 |
+ 0.131 |
+ 0.161 |
+ 0.138 |
+ 0.164 |
+ 0.157 |
+ 0.153 |
+ 0.176 |
+ 0.164 |
+ 0.112 |
+ 0.183 |
+ 1121 |
+ Activity |
+ ENVZ_ECOLI |
+ High |
+ Prokaryote |
+
+
+ EPHB2_HUMAN_Tsuboyama_2023_1F0M |
+ 0.612 |
+ 0.608 |
+ 0.631 |
+ 0.637 |
+ 0.682 |
+ 0.686 |
+ -0.261 |
+ 0.655 |
+ 0.749 |
+ 0.753 |
+ 0.755 |
+ 0.745 |
+ 0.773 |
+ -0.300 |
+ 0.725 |
+ 0.788 |
+ 0.804 |
+ 0.735 |
+ 0.708 |
+ 0.688 |
+ 0.476 |
+ 0.547 |
+ 0.659 |
+ 0.616 |
+ 0.606 |
+ 0.643 |
+ 0.637 |
+ 0.684 |
+ 0.686 |
+ 0.798 |
+ 0.737 |
+ 0.725 |
+ 0.604 |
+ 0.573 |
+ 0.541 |
+ 0.631 |
+ 0.651 |
+ 0.661 |
+ 0.694 |
+ 0.682 |
+ 0.680 |
+ 0.667 |
+ 0.569 |
+ -0.253 |
+ 0.553 |
+ 0.541 |
+ 0.492 |
+ 0.537 |
+ 0.741 |
+ 0.657 |
+ 0.794 |
+ 0.778 |
+ 0.788 |
+ 0.776 |
+ 0.800 |
+ 0.796 |
+ 0.802 |
+ 0.794 |
+ 0.786 |
+ 0.802 |
+ 0.751 |
+ 0.745 |
+ 1960 |
+ Stability |
+ EPHB2_HUMAN |
+ High |
+ Human |
+
+
+ ERBB2_HUMAN_Elazar_2016 |
+ 0.351 |
+ 0.301 |
+ 0.191 |
+ 0.203 |
+ 0.203 |
+ 0.240 |
+ 0.338 |
+ 0.338 |
+ 0.326 |
+ 0.351 |
+ 0.363 |
+ 0.338 |
+ 0.400 |
+ 0.351 |
+ 0.351 |
+ 0.388 |
+ 0.326 |
+ 0.351 |
+ 0.289 |
+ 0.043 |
+ 0.351 |
+ 0.326 |
+ 0.375 |
+ 0.351 |
+ 0.363 |
+ 0.351 |
+ 0.449 |
+ 0.437 |
+ 0.388 |
+ 0.375 |
+ 0.314 |
+ 0.050 |
+ 0.400 |
+ 0.363 |
+ 0.338 |
+ 0.351 |
+ 0.400 |
+ 0.400 |
+ 0.338 |
+ 0.363 |
+ 0.363 |
+ 0.351 |
+ 0.388 |
+ 0.363 |
+ 0.412 |
+ 0.400 |
+ 0.314 |
+ 0.363 |
+ -0.105 |
+ 0.043 |
+ 0.351 |
+ 0.301 |
+ 0.363 |
+ 0.289 |
+ 0.301 |
+ 0.314 |
+ 0.338 |
+ 0.314 |
+ 0.338 |
+ 0.351 |
+ 0.412 |
+ 0.400 |
+ 326 |
+ Expression |
+ ERBB2_HUMAN |
+ Low |
+ Human |
+
+
+ ESTA_BACSU_Nutschel_2020 |
+ 0.216 |
+ 0.286 |
+ 0.307 |
+ 0.314 |
+ 0.290 |
+ 0.288 |
+ 0.130 |
+ 0.237 |
+ 0.248 |
+ 0.305 |
+ 0.237 |
+ 0.218 |
+ 0.248 |
+ 0.113 |
+ 0.180 |
+ 0.200 |
+ 0.224 |
+ 0.235 |
+ 0.270 |
+ 0.231 |
+ 0.054 |
+ 0.137 |
+ 0.209 |
+ 0.227 |
+ 0.181 |
+ 0.205 |
+ 0.227 |
+ 0.205 |
+ 0.301 |
+ 0.279 |
+ 0.355 |
+ 0.298 |
+ 0.064 |
+ 0.113 |
+ 0.191 |
+ 0.202 |
+ 0.238 |
+ 0.244 |
+ 0.238 |
+ 0.312 |
+ 0.301 |
+ 0.310 |
+ 0.126 |
+ 0.100 |
+ 0.207 |
+ 0.185 |
+ 0.425 |
+ 0.367 |
+ 0.426 |
+ 0.272 |
+ 0.272 |
+ 0.259 |
+ 0.253 |
+ 0.242 |
+ 0.262 |
+ 0.248 |
+ 0.250 |
+ 0.273 |
+ 0.272 |
+ 0.266 |
+ 0.318 |
+ 0.152 |
+ 2172 |
+ Stability |
+ ESTA_BACSU |
+ High |
+ Prokaryote |
+
+
+ F7YBW8_MESOW_Aakre_2015 |
+ 0.047 |
+ 0.227 |
+ 0.239 |
+ 0.258 |
+ 0.257 |
+ 0.258 |
+ 0.013 |
+ 0.183 |
+ 0.232 |
+ 0.241 |
+ 0.250 |
+ 0.228 |
+ 0.237 |
+ -0.043 |
+ 0.009 |
+ 0.049 |
+ 0.248 |
+ 0.237 |
+ 0.264 |
+ 0.225 |
+ -0.029 |
+ -0.070 |
+ -0.066 |
+ -0.014 |
+ 0.076 |
+ 0.071 |
+ -0.009 |
+ 0.190 |
+ 0.238 |
+ 0.211 |
+ 0.270 |
+ 0.263 |
+ 0.040 |
+ -0.006 |
+ -0.070 |
+ 0.244 |
+ 0.046 |
+ 0.011 |
+ 0.252 |
+ 0.237 |
+ 0.227 |
+ 0.264 |
+ -0.006 |
+ -0.023 |
+ 0.134 |
+ -0.008 |
+ -0.007 |
+ 0.159 |
+ 0.037 |
+ 0.023 |
+ 0.268 |
+ 0.264 |
+ 0.248 |
+ 0.273 |
+ 0.275 |
+ 0.271 |
+ 0.258 |
+ 0.265 |
+ 0.266 |
+ 0.269 |
+ 0.181 |
+ -0.079 |
+ 9192 |
+ OrganismalFitness |
+ F7YBW8_MESOW |
+ High |
+ Prokaryote |
+
+
+ FECA_ECOLI_Tsuboyama_2023_2D1U |
+ 0.351 |
+ 0.359 |
+ 0.357 |
+ 0.366 |
+ 0.374 |
+ 0.364 |
+ 0.037 |
+ 0.245 |
+ 0.292 |
+ 0.353 |
+ 0.396 |
+ 0.198 |
+ 0.326 |
+ 0.039 |
+ 0.340 |
+ 0.451 |
+ 0.425 |
+ 0.376 |
+ 0.290 |
+ 0.353 |
+ 0.120 |
+ 0.226 |
+ 0.264 |
+ 0.247 |
+ 0.101 |
+ 0.391 |
+ 0.410 |
+ 0.209 |
+ 0.315 |
+ 0.381 |
+ 0.366 |
+ 0.345 |
+ 0.268 |
+ 0.060 |
+ 0.086 |
+ 0.090 |
+ 0.302 |
+ 0.304 |
+ 0.222 |
+ 0.326 |
+ 0.349 |
+ 0.306 |
+ 0.290 |
+ -0.022 |
+ 0.362 |
+ 0.343 |
+ 0.436 |
+ 0.391 |
+ 0.485 |
+ 0.419 |
+ 0.440 |
+ 0.374 |
+ 0.423 |
+ 0.400 |
+ 0.393 |
+ 0.432 |
+ 0.425 |
+ 0.391 |
+ 0.410 |
+ 0.419 |
+ 0.476 |
+ 0.483 |
+ 1886 |
+ Stability |
+ FECA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ FKBP3_HUMAN_Tsuboyama_2023_2KFV |
+ 0.300 |
+ 0.271 |
+ 0.374 |
+ 0.378 |
+ 0.378 |
+ 0.387 |
+ 0.135 |
+ 0.248 |
+ 0.180 |
+ 0.177 |
+ 0.196 |
+ 0.164 |
+ 0.164 |
+ 0.171 |
+ 0.148 |
+ 0.154 |
+ 0.151 |
+ 0.216 |
+ 0.277 |
+ 0.216 |
+ 0.209 |
+ 0.203 |
+ 0.196 |
+ 0.167 |
+ 0.141 |
+ 0.158 |
+ 0.180 |
+ 0.177 |
+ 0.184 |
+ 0.374 |
+ 0.235 |
+ 0.263 |
+ -0.033 |
+ 0.109 |
+ 0.167 |
+ 0.196 |
+ 0.274 |
+ 0.284 |
+ 0.251 |
+ 0.361 |
+ 0.371 |
+ 0.348 |
+ 0.154 |
+ 0.164 |
+ 0.174 |
+ 0.200 |
+ 0.555 |
+ 0.491 |
+ 0.530 |
+ 0.468 |
+ 0.232 |
+ 0.310 |
+ 0.329 |
+ 0.284 |
+ 0.281 |
+ 0.303 |
+ 0.274 |
+ 0.258 |
+ 0.232 |
+ 0.297 |
+ 0.445 |
+ 0.258 |
+ 1237 |
+ Stability |
+ FKBP3_HUMAN |
+ Medium |
+ Human |
+
+
+ GAL4_YEAST_Kitzman_2015 |
+ 0.193 |
+ 0.295 |
+ 0.310 |
+ 0.375 |
+ 0.342 |
+ 0.346 |
+ 0.231 |
+ -0.007 |
+ 0.342 |
+ 0.426 |
+ 0.426 |
+ 0.335 |
+ 0.353 |
+ 0.255 |
+ 0.270 |
+ 0.382 |
+ 0.466 |
+ 0.451 |
+ 0.441 |
+ 0.346 |
+ 0.204 |
+ 0.241 |
+ 0.284 |
+ 0.284 |
+ 0.237 |
+ 0.291 |
+ 0.375 |
+ 0.331 |
+ 0.397 |
+ 0.451 |
+ 0.451 |
+ 0.399 |
+ 0.186 |
+ 0.171 |
+ 0.201 |
+ 0.197 |
+ 0.342 |
+ 0.351 |
+ 0.375 |
+ 0.331 |
+ 0.313 |
+ 0.321 |
+ 0.288 |
+ 0.193 |
+ 0.419 |
+ 0.339 |
+ 0.186 |
+ 0.368 |
+ 0.259 |
+ 0.102 |
+ 0.404 |
+ 0.415 |
+ 0.390 |
+ 0.397 |
+ 0.426 |
+ 0.386 |
+ 0.397 |
+ 0.393 |
+ 0.408 |
+ 0.419 |
+ 0.422 |
+ 0.306 |
+ 1195 |
+ OrganismalFitness |
+ GAL4_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ GCN4_YEAST_Staller_2018 |
+ 0.192 |
+ 0.175 |
+ 0.178 |
+ 0.184 |
+ 0.160 |
+ 0.158 |
+ 0.119 |
+ 0.157 |
+ 0.180 |
+ 0.175 |
+ 0.169 |
+ 0.212 |
+ 0.216 |
+ 0.246 |
+ 0.215 |
+ 0.201 |
+ 0.215 |
+ 0.187 |
+ 0.169 |
+ 0.166 |
+ 0.113 |
+ 0.136 |
+ 0.107 |
+ 0.105 |
+ 0.096 |
+ 0.089 |
+ 0.061 |
+ 0.095 |
+ 0.122 |
+ 0.140 |
+ 0.183 |
+ 0.186 |
+ 0.033 |
+ 0.114 |
+ 0.116 |
+ 0.231 |
+ 0.195 |
+ 0.199 |
+ 0.228 |
+ 0.193 |
+ 0.190 |
+ 0.215 |
+ 0.037 |
+ 0.084 |
+ 0.087 |
+ 0.046 |
+ 0.105 |
+ 0.125 |
+ 0.172 |
+ 0.160 |
+ 0.154 |
+ 0.145 |
+ 0.143 |
+ 0.139 |
+ 0.151 |
+ 0.149 |
+ 0.143 |
+ 0.149 |
+ 0.158 |
+ 0.148 |
+ 0.171 |
+ 0.183 |
+ 2638 |
+ Binding |
+ GCN4_YEAST |
+ Low |
+ Eukaryote |
+
+
+ GDIA_HUMAN_Silverstein_2021 |
+ 0.338 |
+ 0.341 |
+ 0.366 |
+ 0.348 |
+ 0.338 |
+ 0.338 |
+ 0.102 |
+ 0.279 |
+ 0.373 |
+ 0.369 |
+ 0.293 |
+ 0.307 |
+ 0.328 |
+ 0.113 |
+ 0.161 |
+ 0.293 |
+ 0.289 |
+ 0.258 |
+ 0.317 |
+ 0.289 |
+ 0.182 |
+ 0.279 |
+ 0.289 |
+ 0.307 |
+ 0.213 |
+ 0.341 |
+ 0.310 |
+ 0.258 |
+ 0.279 |
+ 0.310 |
+ 0.348 |
+ 0.314 |
+ 0.185 |
+ 0.217 |
+ 0.248 |
+ 0.255 |
+ 0.317 |
+ 0.307 |
+ 0.331 |
+ 0.338 |
+ 0.355 |
+ 0.359 |
+ 0.175 |
+ 0.088 |
+ 0.317 |
+ 0.172 |
+ 0.217 |
+ 0.289 |
+ 0.248 |
+ 0.095 |
+ 0.272 |
+ 0.248 |
+ 0.293 |
+ 0.272 |
+ 0.276 |
+ 0.289 |
+ 0.279 |
+ 0.272 |
+ 0.262 |
+ 0.314 |
+ 0.334 |
+ 0.213 |
+ 1154 |
+ OrganismalFitness |
+ GDIA_HUMAN |
+ Low |
+ Human |
+
+
+ GFP_AEQVI_Sarkisyan_2016 |
+ 0.584 |
+ 0.582 |
+ 0.613 |
+ 0.615 |
+ 0.618 |
+ 0.618 |
+ 0.061 |
+ 0.583 |
+ 0.604 |
+ 0.596 |
+ 0.473 |
+ 0.097 |
+ 0.097 |
+ 0.081 |
+ 0.119 |
+ 0.090 |
+ 0.106 |
+ 0.140 |
+ 0.255 |
+ 0.537 |
+ 0.087 |
+ 0.110 |
+ 0.167 |
+ 0.089 |
+ 0.048 |
+ 0.166 |
+ 0.261 |
+ 0.584 |
+ 0.584 |
+ 0.619 |
+ 0.546 |
+ 0.555 |
+ 0.058 |
+ 0.069 |
+ 0.163 |
+ 0.573 |
+ 0.581 |
+ 0.583 |
+ 0.618 |
+ 0.624 |
+ 0.627 |
+ 0.650 |
+ 0.023 |
+ -0.006 |
+ 0.036 |
+ 0.036 |
+ 0.456 |
+ 0.445 |
+ 0.670 |
+ 0.535 |
+ 0.529 |
+ 0.537 |
+ 0.546 |
+ 0.554 |
+ 0.546 |
+ 0.545 |
+ 0.542 |
+ 0.543 |
+ 0.525 |
+ 0.541 |
+ 0.562 |
+ 0.402 |
+ 51714 |
+ Activity |
+ GFP_AEQVI |
+ Low |
+ Eukaryote |
+
+
+ GLPA_HUMAN_Elazar_2016 |
+ 0.114 |
+ -0.045 |
+ 0.102 |
+ 0.135 |
+ 0.102 |
+ 0.070 |
+ 0.331 |
+ 0.413 |
+ 0.282 |
+ 0.250 |
+ 0.446 |
+ 0.446 |
+ 0.446 |
+ 0.315 |
+ 0.446 |
+ 0.462 |
+ 0.495 |
+ 0.381 |
+ 0.430 |
+ 0.152 |
+ 0.381 |
+ 0.397 |
+ 0.430 |
+ 0.413 |
+ 0.348 |
+ 0.413 |
+ 0.462 |
+ 0.430 |
+ 0.479 |
+ 0.209 |
+ 0.364 |
+ 0.136 |
+ 0.168 |
+ 0.430 |
+ 0.364 |
+ 0.430 |
+ 0.397 |
+ 0.381 |
+ 0.397 |
+ 0.299 |
+ 0.250 |
+ 0.364 |
+ 0.413 |
+ 0.397 |
+ 0.462 |
+ 0.397 |
+ 0.331 |
+ 0.495 |
+ 0.266 |
+ 0.168 |
+ 0.413 |
+ 0.381 |
+ 0.495 |
+ 0.430 |
+ 0.446 |
+ 0.397 |
+ 0.462 |
+ 0.397 |
+ 0.413 |
+ 0.479 |
+ 0.413 |
+ 0.413 |
+ 245 |
+ Expression |
+ GLPA_HUMAN |
+ Low |
+ Human |
+
+
+ GRB2_HUMAN_Faure_2021 |
+ 0.314 |
+ 0.392 |
+ 0.384 |
+ 0.411 |
+ 0.404 |
+ 0.406 |
+ 0.403 |
+ 0.338 |
+ 0.405 |
+ 0.365 |
+ 0.390 |
+ 0.337 |
+ 0.378 |
+ 0.430 |
+ 0.464 |
+ 0.474 |
+ 0.496 |
+ 0.396 |
+ 0.429 |
+ 0.407 |
+ 0.419 |
+ 0.396 |
+ 0.400 |
+ 0.344 |
+ 0.419 |
+ 0.397 |
+ 0.341 |
+ 0.409 |
+ 0.342 |
+ 0.376 |
+ 0.308 |
+ 0.308 |
+ 0.361 |
+ 0.418 |
+ 0.385 |
+ 0.310 |
+ 0.390 |
+ 0.381 |
+ 0.331 |
+ 0.421 |
+ 0.421 |
+ 0.397 |
+ 0.451 |
+ 0.235 |
+ 0.399 |
+ 0.435 |
+ 0.540 |
+ 0.401 |
+ 0.548 |
+ 0.396 |
+ 0.484 |
+ 0.518 |
+ 0.507 |
+ 0.505 |
+ 0.511 |
+ 0.494 |
+ 0.496 |
+ 0.512 |
+ 0.491 |
+ 0.514 |
+ 0.421 |
+ 0.468 |
+ 63366 |
+ OrganismalFitness |
+ GRB2_HUMAN |
+ Medium |
+ Human |
+
+
+ HCP_LAMBD_Tsuboyama_2023_2L6Q |
+ 0.238 |
+ 0.312 |
+ 0.092 |
+ 0.123 |
+ 0.269 |
+ 0.281 |
+ 0.188 |
+ 0.288 |
+ 0.346 |
+ 0.342 |
+ 0.458 |
+ 0.350 |
+ 0.377 |
+ 0.238 |
+ 0.315 |
+ 0.419 |
+ 0.473 |
+ 0.412 |
+ 0.431 |
+ 0.192 |
+ 0.212 |
+ 0.219 |
+ 0.273 |
+ 0.288 |
+ 0.254 |
+ 0.273 |
+ 0.162 |
+ 0.265 |
+ 0.392 |
+ 0.408 |
+ 0.477 |
+ 0.360 |
+ 0.115 |
+ 0.227 |
+ 0.223 |
+ 0.369 |
+ 0.312 |
+ 0.315 |
+ 0.373 |
+ 0.250 |
+ 0.323 |
+ 0.377 |
+ 0.223 |
+ 0.235 |
+ 0.458 |
+ 0.308 |
+ 0.492 |
+ 0.527 |
+ 0.573 |
+ 0.408 |
+ 0.519 |
+ 0.531 |
+ 0.523 |
+ 0.523 |
+ 0.535 |
+ 0.531 |
+ 0.504 |
+ 0.535 |
+ 0.523 |
+ 0.535 |
+ 0.577 |
+ 0.450 |
+ 1040 |
+ Stability |
+ HCP_LAMBD |
+ Medium |
+ Virus |
+
+
+ HECD1_HUMAN_Tsuboyama_2023_3DKM |
+ 0.297 |
+ 0.324 |
+ 0.238 |
+ 0.244 |
+ 0.344 |
+ 0.353 |
+ 0.172 |
+ 0.294 |
+ 0.286 |
+ 0.284 |
+ 0.253 |
+ 0.209 |
+ 0.231 |
+ 0.169 |
+ 0.175 |
+ 0.370 |
+ 0.326 |
+ 0.277 |
+ 0.272 |
+ 0.303 |
+ 0.201 |
+ 0.261 |
+ 0.333 |
+ 0.312 |
+ 0.252 |
+ 0.320 |
+ 0.307 |
+ 0.288 |
+ 0.302 |
+ 0.331 |
+ 0.305 |
+ 0.308 |
+ 0.132 |
+ 0.186 |
+ 0.215 |
+ 0.240 |
+ 0.368 |
+ 0.345 |
+ 0.372 |
+ 0.351 |
+ 0.335 |
+ 0.356 |
+ 0.105 |
+ 0.079 |
+ 0.354 |
+ 0.141 |
+ 0.222 |
+ 0.336 |
+ 0.331 |
+ 0.266 |
+ 0.272 |
+ 0.320 |
+ 0.351 |
+ 0.332 |
+ 0.335 |
+ 0.372 |
+ 0.327 |
+ 0.342 |
+ 0.329 |
+ 0.348 |
+ 0.340 |
+ 0.290 |
+ 5586 |
+ Stability |
+ HECD1_HUMAN |
+ Medium |
+ Human |
+
+
+ HEM3_HUMAN_Loggerenberg_2023 |
+ 0.298 |
+ 0.313 |
+ 0.295 |
+ 0.304 |
+ 0.313 |
+ 0.311 |
+ 0.100 |
+ 0.096 |
+ 0.320 |
+ 0.330 |
+ 0.311 |
+ 0.283 |
+ 0.306 |
+ 0.105 |
+ 0.273 |
+ 0.299 |
+ 0.310 |
+ 0.321 |
+ 0.337 |
+ 0.115 |
+ 0.239 |
+ 0.292 |
+ 0.307 |
+ 0.308 |
+ 0.254 |
+ 0.297 |
+ 0.295 |
+ 0.305 |
+ 0.321 |
+ 0.318 |
+ 0.313 |
+ 0.312 |
+ 0.061 |
+ 0.277 |
+ 0.291 |
+ 0.313 |
+ 0.306 |
+ 0.328 |
+ 0.347 |
+ 0.307 |
+ 0.325 |
+ 0.344 |
+ 0.220 |
+ 0.106 |
+ 0.292 |
+ 0.260 |
+ 0.228 |
+ 0.280 |
+ 0.224 |
+ 0.115 |
+ 0.274 |
+ 0.268 |
+ 0.271 |
+ 0.288 |
+ 0.294 |
+ 0.285 |
+ 0.297 |
+ 0.292 |
+ 0.293 |
+ 0.301 |
+ 0.339 |
+ 0.303 |
+ 5689 |
+ Activity |
+ HEM3_HUMAN |
+ Medium |
+ Human |
+
+
+ HIS7_YEAST_Pokusaeva_2019 |
+ 0.355 |
+ 0.410 |
+ 0.440 |
+ 0.433 |
+ 0.409 |
+ 0.410 |
+ 0.114 |
+ 0.218 |
+ 0.343 |
+ 0.377 |
+ 0.354 |
+ 0.333 |
+ 0.377 |
+ 0.038 |
+ 0.089 |
+ 0.347 |
+ 0.282 |
+ 0.341 |
+ 0.367 |
+ 0.342 |
+ 0.237 |
+ 0.303 |
+ 0.337 |
+ 0.346 |
+ 0.294 |
+ 0.375 |
+ 0.339 |
+ 0.378 |
+ 0.394 |
+ 0.419 |
+ 0.307 |
+ 0.301 |
+ 0.115 |
+ 0.296 |
+ 0.325 |
+ 0.475 |
+ 0.376 |
+ 0.377 |
+ 0.511 |
+ 0.444 |
+ 0.424 |
+ 0.463 |
+ 0.127 |
+ 0.013 |
+ 0.212 |
+ 0.002 |
+ 0.322 |
+ 0.342 |
+ 0.437 |
+ 0.300 |
+ 0.342 |
+ 0.249 |
+ 0.295 |
+ 0.326 |
+ 0.331 |
+ 0.308 |
+ 0.306 |
+ 0.338 |
+ 0.319 |
+ 0.317 |
+ 0.399 |
+ 0.136 |
+ 496137 |
+ OrganismalFitness |
+ HIS7_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HMDH_HUMAN_Jiang_2019 |
+ 0.368 |
+ 0.321 |
+ 0.289 |
+ 0.297 |
+ 0.323 |
+ 0.319 |
+ 0.116 |
+ 0.171 |
+ 0.206 |
+ 0.198 |
+ 0.209 |
+ 0.242 |
+ 0.225 |
+ 0.032 |
+ 0.014 |
+ 0.231 |
+ 0.399 |
+ 0.278 |
+ 0.283 |
+ 0.222 |
+ 0.346 |
+ 0.092 |
+ 0.113 |
+ 0.154 |
+ 0.374 |
+ 0.126 |
+ 0.113 |
+ 0.143 |
+ 0.147 |
+ 0.354 |
+ 0.300 |
+ 0.301 |
+ 0.173 |
+ 0.241 |
+ 0.183 |
+ 0.154 |
+ 0.361 |
+ 0.297 |
+ 0.285 |
+ 0.342 |
+ 0.296 |
+ 0.294 |
+ 0.114 |
+ 0.041 |
+ 0.362 |
+ 0.389 |
+ 0.229 |
+ 0.349 |
+ -0.175 |
+ 0.074 |
+ 0.378 |
+ 0.376 |
+ 0.367 |
+ 0.378 |
+ 0.379 |
+ 0.373 |
+ 0.370 |
+ 0.382 |
+ 0.374 |
+ 0.382 |
+ 0.420 |
+ 0.327 |
+ 16853 |
+ OrganismalFitness |
+ HMDH_HUMAN |
+ Low |
+ Human |
+
+
+ HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2 |
+ 0.235 |
+ 0.250 |
+ 0.255 |
+ 0.266 |
+ 0.230 |
+ 0.232 |
+ 0.173 |
+ 0.283 |
+ 0.301 |
+ 0.310 |
+ 0.239 |
+ 0.298 |
+ 0.330 |
+ -0.003 |
+ 0.100 |
+ 0.170 |
+ 0.171 |
+ 0.216 |
+ 0.214 |
+ -0.001 |
+ 0.235 |
+ 0.255 |
+ 0.259 |
+ 0.271 |
+ 0.212 |
+ 0.246 |
+ 0.285 |
+ 0.292 |
+ 0.292 |
+ 0.275 |
+ 0.314 |
+ 0.259 |
+ 0.134 |
+ 0.228 |
+ 0.248 |
+ 0.228 |
+ 0.246 |
+ 0.264 |
+ 0.253 |
+ 0.244 |
+ 0.253 |
+ 0.244 |
+ 0.020 |
+ -0.010 |
+ 0.282 |
+ 0.285 |
+ 0.035 |
+ 0.186 |
+ 0.060 |
+ -0.026 |
+ 0.166 |
+ 0.187 |
+ 0.170 |
+ 0.189 |
+ 0.184 |
+ 0.186 |
+ 0.180 |
+ 0.180 |
+ 0.166 |
+ 0.187 |
+ 0.276 |
+ 0.203 |
+ 2252 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HSP82_YEAST_Flynn_2019 |
+ 0.315 |
+ 0.361 |
+ 0.374 |
+ 0.385 |
+ 0.374 |
+ 0.371 |
+ 0.125 |
+ 0.311 |
+ 0.394 |
+ 0.384 |
+ 0.337 |
+ 0.357 |
+ 0.377 |
+ 0.010 |
+ 0.068 |
+ 0.141 |
+ 0.247 |
+ 0.323 |
+ 0.334 |
+ 0.371 |
+ 0.351 |
+ 0.371 |
+ 0.357 |
+ 0.377 |
+ 0.367 |
+ 0.380 |
+ 0.385 |
+ 0.371 |
+ 0.399 |
+ 0.374 |
+ 0.408 |
+ 0.398 |
+ 0.176 |
+ 0.351 |
+ 0.358 |
+ 0.362 |
+ 0.364 |
+ 0.368 |
+ 0.375 |
+ 0.382 |
+ 0.388 |
+ 0.389 |
+ 0.081 |
+ -0.056 |
+ 0.350 |
+ 0.270 |
+ 0.160 |
+ 0.295 |
+ 0.090 |
+ 0.050 |
+ 0.245 |
+ 0.248 |
+ 0.262 |
+ 0.264 |
+ 0.254 |
+ 0.259 |
+ 0.265 |
+ 0.255 |
+ 0.242 |
+ 0.258 |
+ 0.355 |
+ 0.295 |
+ 13294 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HSP82_YEAST_Mishra_2016 |
+ 0.394 |
+ 0.400 |
+ 0.405 |
+ 0.405 |
+ 0.397 |
+ 0.394 |
+ 0.270 |
+ 0.309 |
+ 0.374 |
+ 0.381 |
+ 0.394 |
+ 0.411 |
+ 0.416 |
+ 0.187 |
+ 0.275 |
+ 0.305 |
+ 0.368 |
+ 0.369 |
+ 0.370 |
+ 0.376 |
+ 0.380 |
+ 0.366 |
+ 0.368 |
+ 0.395 |
+ 0.385 |
+ 0.385 |
+ 0.401 |
+ 0.366 |
+ 0.391 |
+ 0.399 |
+ 0.401 |
+ 0.396 |
+ 0.238 |
+ 0.363 |
+ 0.378 |
+ 0.386 |
+ 0.403 |
+ 0.413 |
+ 0.410 |
+ 0.403 |
+ 0.408 |
+ 0.410 |
+ 0.358 |
+ -0.052 |
+ 0.385 |
+ 0.395 |
+ 0.120 |
+ 0.286 |
+ 0.144 |
+ 0.035 |
+ 0.341 |
+ 0.341 |
+ 0.327 |
+ 0.363 |
+ 0.357 |
+ 0.341 |
+ 0.359 |
+ 0.349 |
+ 0.348 |
+ 0.359 |
+ 0.375 |
+ 0.347 |
+ 4323 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HXK4_HUMAN_Gersing_2022_activity |
+ 0.390 |
+ 0.426 |
+ 0.406 |
+ 0.413 |
+ 0.396 |
+ 0.406 |
+ 0.178 |
+ 0.311 |
+ 0.426 |
+ 0.440 |
+ 0.375 |
+ 0.384 |
+ 0.403 |
+ 0.121 |
+ 0.201 |
+ 0.408 |
+ 0.421 |
+ 0.415 |
+ 0.398 |
+ 0.033 |
+ 0.374 |
+ 0.367 |
+ 0.346 |
+ 0.327 |
+ 0.366 |
+ 0.368 |
+ 0.357 |
+ 0.358 |
+ 0.325 |
+ 0.413 |
+ 0.417 |
+ 0.388 |
+ 0.138 |
+ 0.375 |
+ 0.338 |
+ 0.279 |
+ 0.411 |
+ 0.384 |
+ 0.351 |
+ 0.424 |
+ 0.421 |
+ 0.406 |
+ 0.225 |
+ 0.024 |
+ 0.418 |
+ 0.398 |
+ 0.287 |
+ 0.366 |
+ 0.319 |
+ 0.123 |
+ 0.421 |
+ 0.407 |
+ 0.409 |
+ 0.413 |
+ 0.415 |
+ 0.427 |
+ 0.411 |
+ 0.418 |
+ 0.411 |
+ 0.424 |
+ 0.439 |
+ 0.375 |
+ 8570 |
+ OrganismalFitness |
+ HXK4_HUMAN |
+ Medium |
+ Human |
+
+
+ HXK4_HUMAN_Gersing_2023_abundance |
+ 0.282 |
+ 0.309 |
+ 0.287 |
+ 0.303 |
+ 0.301 |
+ 0.305 |
+ 0.045 |
+ 0.282 |
+ 0.347 |
+ 0.363 |
+ 0.295 |
+ 0.289 |
+ 0.308 |
+ 0.106 |
+ 0.148 |
+ 0.294 |
+ 0.320 |
+ 0.336 |
+ 0.317 |
+ 0.348 |
+ 0.244 |
+ 0.249 |
+ 0.261 |
+ 0.270 |
+ 0.241 |
+ 0.264 |
+ 0.283 |
+ 0.281 |
+ 0.275 |
+ 0.323 |
+ 0.287 |
+ 0.250 |
+ 0.152 |
+ 0.265 |
+ 0.258 |
+ 0.265 |
+ 0.302 |
+ 0.304 |
+ 0.308 |
+ 0.317 |
+ 0.317 |
+ 0.324 |
+ 0.135 |
+ 0.075 |
+ 0.302 |
+ 0.290 |
+ 0.325 |
+ 0.335 |
+ 0.334 |
+ 0.110 |
+ 0.330 |
+ 0.332 |
+ 0.331 |
+ 0.333 |
+ 0.334 |
+ 0.337 |
+ 0.328 |
+ 0.327 |
+ 0.321 |
+ 0.332 |
+ 0.370 |
+ 0.286 |
+ 8396 |
+ Expression |
+ HXK4_HUMAN |
+ Medium |
+ Human |
+
+
+ I6TAH8_I68A0_Doud_2015 |
+ 0.287 |
+ 0.240 |
+ 0.213 |
+ 0.208 |
+ 0.274 |
+ 0.277 |
+ -0.008 |
+ 0.234 |
+ 0.217 |
+ 0.239 |
+ 0.008 |
+ 0.015 |
+ 0.008 |
+ 0.026 |
+ 0.020 |
+ 0.012 |
+ 0.015 |
+ 0.006 |
+ 0.054 |
+ 0.151 |
+ 0.240 |
+ 0.245 |
+ 0.286 |
+ 0.283 |
+ 0.013 |
+ 0.019 |
+ 0.085 |
+ 0.003 |
+ 0.230 |
+ 0.284 |
+ 0.179 |
+ 0.183 |
+ 0.111 |
+ 0.211 |
+ 0.242 |
+ 0.247 |
+ 0.225 |
+ 0.252 |
+ 0.256 |
+ 0.280 |
+ 0.306 |
+ 0.306 |
+ 0.016 |
+ 0.015 |
+ 0.016 |
+ 0.017 |
+ 0.166 |
+ 0.149 |
+ 0.182 |
+ 0.078 |
+ 0.048 |
+ 0.075 |
+ 0.101 |
+ 0.097 |
+ 0.079 |
+ 0.072 |
+ 0.059 |
+ 0.067 |
+ 0.051 |
+ 0.072 |
+ 0.024 |
+ 0.007 |
+ 9462 |
+ OrganismalFitness |
+ I6TAH8_I68A0 |
+ Medium |
+ Virus |
+
+
+ IF1_ECOLI_Kelsic_2016 |
+ 0.262 |
+ 0.370 |
+ 0.443 |
+ 0.456 |
+ 0.418 |
+ 0.437 |
+ 0.086 |
+ 0.277 |
+ 0.195 |
+ 0.163 |
+ 0.405 |
+ 0.430 |
+ 0.440 |
+ 0.080 |
+ 0.306 |
+ 0.414 |
+ 0.446 |
+ 0.446 |
+ 0.424 |
+ 0.418 |
+ 0.242 |
+ 0.328 |
+ 0.223 |
+ 0.297 |
+ 0.341 |
+ 0.354 |
+ 0.344 |
+ 0.306 |
+ 0.354 |
+ 0.313 |
+ 0.386 |
+ 0.335 |
+ 0.214 |
+ 0.354 |
+ 0.373 |
+ 0.398 |
+ 0.367 |
+ 0.363 |
+ 0.360 |
+ 0.418 |
+ 0.421 |
+ 0.437 |
+ 0.252 |
+ 0.093 |
+ 0.449 |
+ 0.373 |
+ 0.230 |
+ 0.398 |
+ 0.348 |
+ 0.172 |
+ 0.408 |
+ 0.408 |
+ 0.408 |
+ 0.430 |
+ 0.389 |
+ 0.408 |
+ 0.389 |
+ 0.411 |
+ 0.414 |
+ 0.421 |
+ 0.456 |
+ 0.379 |
+ 1367 |
+ OrganismalFitness |
+ IF1_ECOLI |
+ High |
+ Prokaryote |
+
+
+ ILF3_HUMAN_Tsuboyama_2023_2L33 |
+ 0.295 |
+ 0.387 |
+ 0.425 |
+ 0.418 |
+ 0.415 |
+ 0.422 |
+ 0.176 |
+ 0.271 |
+ 0.380 |
+ 0.404 |
+ 0.313 |
+ 0.331 |
+ 0.352 |
+ 0.264 |
+ 0.366 |
+ 0.422 |
+ 0.260 |
+ 0.246 |
+ 0.180 |
+ 0.432 |
+ 0.127 |
+ 0.187 |
+ 0.274 |
+ 0.302 |
+ 0.317 |
+ 0.222 |
+ 0.324 |
+ 0.243 |
+ 0.267 |
+ 0.411 |
+ 0.352 |
+ 0.316 |
+ 0.025 |
+ 0.173 |
+ 0.232 |
+ 0.313 |
+ 0.302 |
+ 0.345 |
+ 0.359 |
+ 0.390 |
+ 0.387 |
+ 0.415 |
+ 0.299 |
+ 0.176 |
+ 0.306 |
+ 0.380 |
+ 0.415 |
+ 0.366 |
+ 0.432 |
+ 0.331 |
+ 0.267 |
+ 0.229 |
+ 0.288 |
+ 0.250 |
+ 0.278 |
+ 0.331 |
+ 0.292 |
+ 0.250 |
+ 0.341 |
+ 0.324 |
+ 0.292 |
+ 0.362 |
+ 1329 |
+ Stability |
+ ILF3_HUMAN |
+ High |
+ Human |
+
+
+ ISDH_STAAW_Tsuboyama_2023_2LHR |
+ 0.065 |
+ 0.108 |
+ 0.139 |
+ 0.147 |
+ 0.151 |
+ 0.176 |
+ 0.250 |
+ 0.124 |
+ 0.281 |
+ 0.266 |
+ 0.378 |
+ 0.326 |
+ 0.326 |
+ 0.271 |
+ 0.291 |
+ 0.336 |
+ 0.324 |
+ 0.371 |
+ 0.326 |
+ 0.131 |
+ 0.219 |
+ 0.145 |
+ 0.106 |
+ 0.166 |
+ 0.221 |
+ 0.242 |
+ 0.221 |
+ 0.211 |
+ 0.242 |
+ 0.275 |
+ 0.334 |
+ 0.315 |
+ 0.170 |
+ 0.192 |
+ 0.174 |
+ 0.184 |
+ 0.155 |
+ 0.139 |
+ 0.137 |
+ 0.194 |
+ 0.157 |
+ 0.170 |
+ 0.199 |
+ 0.178 |
+ 0.252 |
+ 0.240 |
+ 0.427 |
+ 0.353 |
+ 0.419 |
+ 0.400 |
+ 0.320 |
+ 0.324 |
+ 0.345 |
+ 0.332 |
+ 0.330 |
+ 0.345 |
+ 0.316 |
+ 0.324 |
+ 0.332 |
+ 0.334 |
+ 0.437 |
+ 0.359 |
+ 1944 |
+ Stability |
+ ISDH_STAAW |
+ High |
+ Prokaryote |
+
+
+ KCNE1_HUMAN_Muhammad_2023_expression |
+ 0.195 |
+ 0.200 |
+ 0.195 |
+ 0.177 |
+ 0.181 |
+ 0.171 |
+ 0.179 |
+ 0.108 |
+ 0.161 |
+ 0.171 |
+ 0.226 |
+ 0.210 |
+ 0.195 |
+ 0.179 |
+ 0.175 |
+ 0.226 |
+ 0.181 |
+ 0.173 |
+ 0.179 |
+ 0.197 |
+ 0.195 |
+ 0.204 |
+ 0.161 |
+ 0.110 |
+ 0.212 |
+ 0.148 |
+ 0.208 |
+ 0.191 |
+ 0.161 |
+ 0.181 |
+ 0.175 |
+ 0.128 |
+ -0.087 |
+ 0.189 |
+ 0.226 |
+ 0.163 |
+ 0.177 |
+ 0.185 |
+ 0.212 |
+ 0.191 |
+ 0.191 |
+ 0.208 |
+ 0.206 |
+ 0.167 |
+ 0.204 |
+ 0.228 |
+ 0.193 |
+ 0.200 |
+ 0.189 |
+ 0.063 |
+ 0.153 |
+ 0.169 |
+ 0.163 |
+ 0.155 |
+ 0.169 |
+ 0.150 |
+ 0.179 |
+ 0.153 |
+ 0.163 |
+ 0.163 |
+ 0.204 |
+ 0.193 |
+ 2339 |
+ Expression |
+ KCNE1_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNE1_HUMAN_Muhammad_2023_function |
+ 0.282 |
+ 0.331 |
+ 0.358 |
+ 0.393 |
+ 0.362 |
+ 0.391 |
+ 0.111 |
+ 0.389 |
+ 0.374 |
+ 0.388 |
+ 0.215 |
+ 0.320 |
+ 0.410 |
+ 0.130 |
+ 0.085 |
+ 0.446 |
+ 0.391 |
+ 0.322 |
+ 0.254 |
+ 0.159 |
+ 0.153 |
+ 0.179 |
+ 0.329 |
+ 0.401 |
+ 0.166 |
+ 0.445 |
+ 0.453 |
+ 0.433 |
+ 0.424 |
+ 0.337 |
+ 0.460 |
+ 0.380 |
+ 0.006 |
+ 0.104 |
+ 0.234 |
+ 0.491 |
+ 0.298 |
+ 0.318 |
+ 0.457 |
+ 0.375 |
+ 0.391 |
+ 0.486 |
+ 0.121 |
+ 0.116 |
+ 0.386 |
+ 0.087 |
+ 0.123 |
+ 0.313 |
+ 0.127 |
+ 0.030 |
+ 0.407 |
+ 0.396 |
+ 0.389 |
+ 0.394 |
+ 0.394 |
+ 0.381 |
+ 0.403 |
+ 0.400 |
+ 0.420 |
+ 0.420 |
+ 0.507 |
+ 0.127 |
+ 2315 |
+ Activity |
+ KCNE1_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNH2_HUMAN_Kozek_2020 |
+ 0.360 |
+ 0.400 |
+ 0.180 |
+ 0.180 |
+ 0.180 |
+ 0.160 |
+ 0.360 |
+ 0.040 |
+ 0.260 |
+ 0.260 |
+ 0.200 |
+ 0.180 |
+ 0.160 |
+ 0.180 |
+ 0.120 |
+ 0.160 |
+ 0.160 |
+ 0.160 |
+ 0.120 |
+ 0.360 |
+ 0.360 |
+ 0.360 |
+ 0.340 |
+ 0.280 |
+ 0.380 |
+ 0.320 |
+ 0.320 |
+ 0.280 |
+ 0.280 |
+ 0.281 |
+ 0.300 |
+ 0.140 |
+ 0.280 |
+ 0.320 |
+ 0.360 |
+ 0.360 |
+ 0.360 |
+ 0.400 |
+ 0.400 |
+ 0.300 |
+ 0.400 |
+ 0.360 |
+ 0.320 |
+ 0.140 |
+ 0.160 |
+ 0.220 |
+ 0.120 |
+ 0.000 |
+ -0.120 |
+ 0.060 |
+ 0.400 |
+ 0.300 |
+ 0.400 |
+ 0.340 |
+ 0.300 |
+ 0.340 |
+ 0.380 |
+ 0.340 |
+ 0.400 |
+ 0.360 |
+ 0.180 |
+ 0.360 |
+ 200 |
+ Activity |
+ KCNH2_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNJ2_MOUSE_Coyote-Maestas_2022_function |
+ 0.215 |
+ 0.273 |
+ 0.260 |
+ 0.277 |
+ 0.285 |
+ 0.288 |
+ 0.049 |
+ 0.183 |
+ 0.256 |
+ 0.246 |
+ 0.306 |
+ 0.288 |
+ 0.297 |
+ 0.035 |
+ 0.216 |
+ 0.308 |
+ 0.322 |
+ 0.326 |
+ 0.326 |
+ 0.126 |
+ 0.283 |
+ 0.222 |
+ 0.187 |
+ 0.179 |
+ 0.280 |
+ 0.268 |
+ 0.241 |
+ 0.274 |
+ 0.172 |
+ 0.299 |
+ 0.303 |
+ 0.291 |
+ 0.118 |
+ 0.261 |
+ 0.238 |
+ 0.168 |
+ 0.286 |
+ 0.267 |
+ 0.227 |
+ 0.299 |
+ 0.291 |
+ 0.270 |
+ 0.154 |
+ -0.009 |
+ 0.267 |
+ 0.297 |
+ 0.153 |
+ 0.227 |
+ 0.185 |
+ 0.062 |
+ 0.318 |
+ 0.304 |
+ 0.308 |
+ 0.301 |
+ 0.307 |
+ 0.293 |
+ 0.311 |
+ 0.309 |
+ 0.301 |
+ 0.313 |
+ 0.297 |
+ 0.215 |
+ 6963 |
+ Activity |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ KCNJ2_MOUSE_Coyote-Maestas_2022_surface |
+ 0.261 |
+ 0.225 |
+ 0.165 |
+ 0.183 |
+ 0.214 |
+ 0.217 |
+ 0.073 |
+ 0.143 |
+ 0.186 |
+ 0.188 |
+ 0.207 |
+ 0.186 |
+ 0.198 |
+ 0.110 |
+ 0.230 |
+ 0.225 |
+ 0.221 |
+ 0.222 |
+ 0.243 |
+ 0.158 |
+ 0.143 |
+ 0.158 |
+ 0.161 |
+ 0.178 |
+ 0.157 |
+ 0.179 |
+ 0.172 |
+ 0.176 |
+ 0.176 |
+ 0.236 |
+ 0.190 |
+ 0.152 |
+ 0.114 |
+ 0.143 |
+ 0.170 |
+ 0.161 |
+ 0.199 |
+ 0.223 |
+ 0.224 |
+ 0.203 |
+ 0.224 |
+ 0.224 |
+ 0.166 |
+ 0.043 |
+ 0.172 |
+ 0.195 |
+ 0.169 |
+ 0.157 |
+ 0.168 |
+ 0.094 |
+ 0.217 |
+ 0.198 |
+ 0.214 |
+ 0.199 |
+ 0.208 |
+ 0.194 |
+ 0.209 |
+ 0.201 |
+ 0.210 |
+ 0.210 |
+ 0.220 |
+ 0.220 |
+ 6917 |
+ Expression |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ KKA2_KLEPN_Melnikov_2014 |
+ 0.221 |
+ 0.452 |
+ 0.366 |
+ 0.516 |
+ 0.499 |
+ 0.508 |
+ 0.148 |
+ 0.380 |
+ 0.449 |
+ 0.509 |
+ 0.471 |
+ 0.488 |
+ 0.512 |
+ 0.165 |
+ 0.199 |
+ 0.404 |
+ 0.486 |
+ 0.531 |
+ 0.556 |
+ 0.394 |
+ 0.213 |
+ 0.331 |
+ 0.418 |
+ 0.467 |
+ 0.268 |
+ 0.488 |
+ 0.486 |
+ 0.473 |
+ 0.526 |
+ 0.523 |
+ 0.548 |
+ 0.499 |
+ 0.113 |
+ 0.184 |
+ 0.407 |
+ 0.495 |
+ 0.346 |
+ 0.429 |
+ 0.487 |
+ 0.495 |
+ 0.511 |
+ 0.526 |
+ 0.191 |
+ 0.069 |
+ 0.467 |
+ 0.324 |
+ 0.362 |
+ 0.486 |
+ 0.439 |
+ 0.185 |
+ 0.479 |
+ 0.475 |
+ 0.478 |
+ 0.478 |
+ 0.476 |
+ 0.482 |
+ 0.476 |
+ 0.482 |
+ 0.488 |
+ 0.491 |
+ 0.530 |
+ 0.239 |
+ 4960 |
+ OrganismalFitness |
+ KKA2_KLEPN |
+ High |
+ Prokaryote |
+
+
+ LGK_LIPST_Klesmith_2015 |
+ 0.233 |
+ 0.292 |
+ 0.340 |
+ 0.345 |
+ 0.332 |
+ 0.344 |
+ 0.089 |
+ 0.352 |
+ 0.348 |
+ 0.414 |
+ 0.336 |
+ 0.394 |
+ 0.424 |
+ 0.125 |
+ 0.241 |
+ 0.300 |
+ 0.391 |
+ 0.433 |
+ 0.420 |
+ 0.381 |
+ 0.228 |
+ 0.316 |
+ 0.366 |
+ 0.378 |
+ 0.292 |
+ 0.390 |
+ 0.402 |
+ 0.367 |
+ 0.409 |
+ 0.390 |
+ 0.406 |
+ 0.338 |
+ 0.114 |
+ 0.253 |
+ 0.312 |
+ 0.422 |
+ 0.280 |
+ 0.310 |
+ 0.389 |
+ 0.339 |
+ 0.357 |
+ 0.393 |
+ 0.128 |
+ 0.047 |
+ 0.368 |
+ 0.278 |
+ 0.290 |
+ 0.371 |
+ 0.358 |
+ 0.106 |
+ 0.387 |
+ 0.366 |
+ 0.383 |
+ 0.374 |
+ 0.375 |
+ 0.395 |
+ 0.393 |
+ 0.392 |
+ 0.395 |
+ 0.397 |
+ 0.301 |
+ 0.251 |
+ 7890 |
+ Activity |
+ LGK_LIPST |
+ Medium |
+ Eukaryote |
+
+
+ LYAM1_HUMAN_Elazar_2016 |
+ 0.260 |
+ 0.260 |
+ 0.148 |
+ 0.159 |
+ 0.215 |
+ 0.226 |
+ 0.260 |
+ 0.137 |
+ 0.304 |
+ 0.327 |
+ 0.315 |
+ 0.260 |
+ 0.293 |
+ 0.193 |
+ 0.237 |
+ 0.193 |
+ 0.226 |
+ 0.260 |
+ 0.349 |
+ 0.193 |
+ 0.237 |
+ 0.260 |
+ 0.304 |
+ 0.282 |
+ 0.215 |
+ 0.304 |
+ 0.271 |
+ 0.182 |
+ 0.182 |
+ 0.289 |
+ 0.204 |
+ 0.144 |
+ 0.137 |
+ 0.282 |
+ 0.315 |
+ 0.215 |
+ 0.282 |
+ 0.304 |
+ 0.282 |
+ 0.204 |
+ 0.249 |
+ 0.204 |
+ 0.215 |
+ 0.215 |
+ 0.237 |
+ 0.249 |
+ 0.103 |
+ 0.148 |
+ 0.115 |
+ 0.025 |
+ 0.182 |
+ 0.249 |
+ 0.260 |
+ 0.226 |
+ 0.193 |
+ 0.204 |
+ 0.237 |
+ 0.271 |
+ 0.193 |
+ 0.249 |
+ 0.293 |
+ 0.226 |
+ 359 |
+ Expression |
+ LYAM1_HUMAN |
+ Medium |
+ Human |
+
+
+ MAFG_MOUSE_Tsuboyama_2023_1K1V |
+ 0.422 |
+ 0.536 |
+ 0.481 |
+ 0.481 |
+ 0.498 |
+ 0.486 |
+ 0.327 |
+ 0.649 |
+ 0.581 |
+ 0.563 |
+ 0.643 |
+ 0.309 |
+ 0.584 |
+ 0.416 |
+ 0.395 |
+ 0.436 |
+ 0.457 |
+ 0.498 |
+ 0.339 |
+ 0.575 |
+ 0.365 |
+ 0.557 |
+ 0.546 |
+ 0.569 |
+ 0.498 |
+ 0.528 |
+ 0.469 |
+ 0.531 |
+ 0.472 |
+ 0.430 |
+ 0.548 |
+ 0.522 |
+ 0.383 |
+ 0.247 |
+ 0.513 |
+ 0.525 |
+ 0.454 |
+ 0.566 |
+ 0.599 |
+ 0.522 |
+ 0.599 |
+ 0.640 |
+ 0.265 |
+ 0.038 |
+ 0.360 |
+ 0.236 |
+ 0.516 |
+ 0.301 |
+ 0.548 |
+ 0.566 |
+ 0.652 |
+ 0.634 |
+ 0.628 |
+ 0.640 |
+ 0.637 |
+ 0.634 |
+ 0.637 |
+ 0.649 |
+ 0.661 |
+ 0.640 |
+ 0.593 |
+ 0.640 |
+ 1429 |
+ Stability |
+ MAFG_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ MBD11_ARATH_Tsuboyama_2023_6ACV |
+ 0.518 |
+ 0.657 |
+ 0.614 |
+ 0.628 |
+ 0.675 |
+ 0.681 |
+ -0.026 |
+ 0.471 |
+ 0.612 |
+ 0.675 |
+ 0.658 |
+ 0.333 |
+ 0.461 |
+ -0.066 |
+ 0.104 |
+ 0.692 |
+ 0.612 |
+ 0.605 |
+ 0.677 |
+ 0.650 |
+ 0.040 |
+ 0.100 |
+ 0.544 |
+ 0.592 |
+ 0.221 |
+ 0.476 |
+ 0.444 |
+ 0.512 |
+ 0.667 |
+ 0.692 |
+ 0.662 |
+ 0.615 |
+ 0.558 |
+ -0.242 |
+ 0.104 |
+ 0.093 |
+ 0.488 |
+ 0.578 |
+ 0.565 |
+ 0.630 |
+ 0.667 |
+ 0.658 |
+ 0.030 |
+ -0.106 |
+ 0.578 |
+ 0.340 |
+ 0.399 |
+ 0.541 |
+ 0.664 |
+ 0.446 |
+ 0.698 |
+ 0.673 |
+ 0.675 |
+ 0.681 |
+ 0.660 |
+ 0.662 |
+ 0.671 |
+ 0.692 |
+ 0.703 |
+ 0.688 |
+ 0.673 |
+ 0.609 |
+ 2116 |
+ Stability |
+ MBD11_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ MET_HUMAN_Estevam_2023 |
+ 0.453 |
+ 0.528 |
+ 0.520 |
+ 0.524 |
+ 0.554 |
+ 0.558 |
+ 0.489 |
+ 0.490 |
+ 0.517 |
+ 0.527 |
+ 0.550 |
+ 0.503 |
+ 0.517 |
+ 0.414 |
+ 0.471 |
+ 0.494 |
+ 0.555 |
+ 0.579 |
+ 0.572 |
+ 0.563 |
+ 0.497 |
+ 0.482 |
+ 0.435 |
+ 0.461 |
+ 0.496 |
+ 0.503 |
+ 0.499 |
+ 0.452 |
+ 0.423 |
+ 0.507 |
+ 0.528 |
+ 0.494 |
+ 0.260 |
+ 0.463 |
+ 0.467 |
+ 0.489 |
+ 0.479 |
+ 0.499 |
+ 0.527 |
+ 0.524 |
+ 0.531 |
+ 0.543 |
+ 0.503 |
+ 0.303 |
+ 0.535 |
+ 0.544 |
+ 0.328 |
+ 0.388 |
+ 0.476 |
+ 0.173 |
+ 0.509 |
+ 0.521 |
+ 0.513 |
+ 0.522 |
+ 0.526 |
+ 0.533 |
+ 0.521 |
+ 0.534 |
+ 0.531 |
+ 0.532 |
+ 0.544 |
+ 0.490 |
+ 5393 |
+ Activity |
+ MET_HUMAN |
+ Medium |
+ Human |
+
+
+ MK01_HUMAN_Brenan_2016 |
+ 0.152 |
+ 0.190 |
+ 0.209 |
+ 0.212 |
+ 0.193 |
+ 0.197 |
+ 0.173 |
+ 0.149 |
+ 0.195 |
+ 0.171 |
+ 0.069 |
+ 0.156 |
+ 0.171 |
+ 0.132 |
+ 0.173 |
+ 0.171 |
+ 0.166 |
+ 0.178 |
+ 0.155 |
+ 0.172 |
+ 0.206 |
+ 0.131 |
+ 0.099 |
+ 0.048 |
+ 0.161 |
+ 0.104 |
+ 0.081 |
+ 0.087 |
+ -0.021 |
+ 0.190 |
+ 0.171 |
+ 0.198 |
+ 0.119 |
+ 0.163 |
+ 0.093 |
+ 0.054 |
+ 0.176 |
+ 0.138 |
+ 0.125 |
+ 0.202 |
+ 0.185 |
+ 0.179 |
+ 0.149 |
+ 0.100 |
+ 0.146 |
+ 0.162 |
+ 0.068 |
+ 0.016 |
+ 0.122 |
+ 0.004 |
+ 0.158 |
+ 0.155 |
+ 0.153 |
+ 0.171 |
+ 0.169 |
+ 0.156 |
+ 0.163 |
+ 0.168 |
+ 0.165 |
+ 0.167 |
+ 0.168 |
+ 0.166 |
+ 6809 |
+ OrganismalFitness |
+ MK01_HUMAN |
+ Medium |
+ Human |
+
+
+ MLAC_ECOLI_MacRae_2023 |
+ 0.176 |
+ 0.274 |
+ 0.344 |
+ 0.354 |
+ 0.337 |
+ 0.330 |
+ -0.022 |
+ 0.271 |
+ 0.346 |
+ 0.359 |
+ 0.302 |
+ 0.343 |
+ 0.354 |
+ -0.022 |
+ 0.174 |
+ 0.207 |
+ 0.312 |
+ 0.295 |
+ 0.313 |
+ 0.295 |
+ 0.042 |
+ 0.266 |
+ 0.274 |
+ 0.309 |
+ 0.248 |
+ 0.314 |
+ 0.306 |
+ 0.301 |
+ 0.339 |
+ 0.320 |
+ 0.356 |
+ 0.319 |
+ 0.028 |
+ 0.103 |
+ 0.218 |
+ 0.216 |
+ 0.203 |
+ 0.249 |
+ 0.231 |
+ 0.286 |
+ 0.306 |
+ 0.303 |
+ 0.108 |
+ -0.050 |
+ 0.244 |
+ 0.174 |
+ 0.027 |
+ 0.154 |
+ 0.112 |
+ -0.012 |
+ 0.244 |
+ 0.214 |
+ 0.235 |
+ 0.230 |
+ 0.239 |
+ 0.242 |
+ 0.222 |
+ 0.233 |
+ 0.268 |
+ 0.240 |
+ 0.160 |
+ 0.151 |
+ 4007 |
+ OrganismalFitness |
+ MLAC_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ MSH2_HUMAN_Jia_2020 |
+ 0.308 |
+ 0.343 |
+ 0.326 |
+ 0.335 |
+ 0.335 |
+ 0.338 |
+ 0.177 |
+ 0.276 |
+ 0.342 |
+ 0.348 |
+ 0.308 |
+ 0.328 |
+ 0.340 |
+ 0.184 |
+ 0.287 |
+ 0.338 |
+ 0.324 |
+ 0.268 |
+ 0.181 |
+ 0.335 |
+ 0.245 |
+ 0.302 |
+ 0.283 |
+ 0.277 |
+ 0.280 |
+ 0.315 |
+ 0.330 |
+ 0.306 |
+ 0.289 |
+ 0.340 |
+ 0.327 |
+ 0.309 |
+ 0.176 |
+ 0.235 |
+ 0.290 |
+ 0.277 |
+ 0.299 |
+ 0.326 |
+ 0.326 |
+ 0.337 |
+ 0.343 |
+ 0.340 |
+ 0.222 |
+ 0.095 |
+ 0.332 |
+ 0.283 |
+ 0.258 |
+ 0.317 |
+ 0.044 |
+ 0.089 |
+ 0.279 |
+ 0.262 |
+ 0.272 |
+ 0.279 |
+ 0.297 |
+ 0.275 |
+ 0.277 |
+ 0.275 |
+ 0.298 |
+ 0.288 |
+ 0.343 |
+ 0.306 |
+ 16749 |
+ OrganismalFitness |
+ MSH2_HUMAN |
+ Medium |
+ Human |
+
+
+ MTH3_HAEAE_RockahShmuel_2015 |
+ 0.259 |
+ 0.419 |
+ 0.447 |
+ 0.453 |
+ 0.458 |
+ 0.442 |
+ 0.215 |
+ 0.414 |
+ 0.435 |
+ 0.442 |
+ 0.380 |
+ 0.455 |
+ 0.450 |
+ 0.153 |
+ 0.210 |
+ 0.236 |
+ 0.336 |
+ 0.393 |
+ 0.435 |
+ 0.429 |
+ 0.202 |
+ 0.298 |
+ 0.393 |
+ 0.429 |
+ 0.323 |
+ 0.437 |
+ 0.445 |
+ 0.447 |
+ 0.460 |
+ 0.450 |
+ 0.455 |
+ 0.429 |
+ 0.215 |
+ 0.212 |
+ 0.331 |
+ 0.427 |
+ 0.261 |
+ 0.329 |
+ 0.419 |
+ 0.396 |
+ 0.419 |
+ 0.458 |
+ 0.223 |
+ 0.026 |
+ 0.321 |
+ 0.212 |
+ 0.287 |
+ 0.370 |
+ 0.357 |
+ 0.119 |
+ 0.360 |
+ 0.334 |
+ 0.360 |
+ 0.354 |
+ 0.370 |
+ 0.378 |
+ 0.357 |
+ 0.362 |
+ 0.354 |
+ 0.365 |
+ 0.378 |
+ 0.215 |
+ 1777 |
+ OrganismalFitness |
+ MTH3_HAEAE |
+ Medium |
+ Prokaryote |
+
+
+ MTHR_HUMAN_Weile_2021 |
+ 0.169 |
+ 0.177 |
+ 0.167 |
+ 0.173 |
+ 0.175 |
+ 0.170 |
+ 0.113 |
+ 0.111 |
+ 0.200 |
+ 0.204 |
+ 0.258 |
+ 0.200 |
+ 0.213 |
+ 0.108 |
+ 0.261 |
+ 0.335 |
+ 0.255 |
+ 0.191 |
+ 0.203 |
+ 0.178 |
+ 0.311 |
+ 0.105 |
+ 0.141 |
+ 0.182 |
+ 0.297 |
+ 0.156 |
+ 0.180 |
+ 0.170 |
+ 0.221 |
+ 0.203 |
+ 0.149 |
+ 0.139 |
+ 0.087 |
+ 0.309 |
+ 0.219 |
+ 0.133 |
+ 0.233 |
+ 0.203 |
+ 0.158 |
+ 0.219 |
+ 0.193 |
+ 0.171 |
+ 0.174 |
+ 0.070 |
+ 0.251 |
+ 0.262 |
+ 0.159 |
+ 0.237 |
+ 0.270 |
+ 0.066 |
+ 0.232 |
+ 0.207 |
+ 0.211 |
+ 0.225 |
+ 0.223 |
+ 0.234 |
+ 0.249 |
+ 0.245 |
+ 0.254 |
+ 0.238 |
+ 0.223 |
+ 0.245 |
+ 12464 |
+ OrganismalFitness |
+ MTHR_HUMAN |
+ Low |
+ Human |
+
+
+ MYO3_YEAST_Tsuboyama_2023_2BTT |
+ 0.156 |
+ 0.169 |
+ 0.255 |
+ 0.261 |
+ 0.262 |
+ 0.298 |
+ 0.104 |
+ 0.207 |
+ 0.330 |
+ 0.321 |
+ 0.394 |
+ 0.376 |
+ 0.390 |
+ 0.136 |
+ 0.323 |
+ 0.442 |
+ 0.440 |
+ 0.393 |
+ 0.206 |
+ 0.244 |
+ 0.249 |
+ 0.180 |
+ 0.244 |
+ 0.213 |
+ 0.244 |
+ 0.176 |
+ 0.251 |
+ 0.190 |
+ 0.222 |
+ 0.189 |
+ 0.225 |
+ 0.130 |
+ 0.114 |
+ 0.309 |
+ 0.216 |
+ 0.221 |
+ 0.287 |
+ 0.260 |
+ 0.246 |
+ 0.302 |
+ 0.298 |
+ 0.283 |
+ 0.100 |
+ -0.002 |
+ 0.313 |
+ 0.223 |
+ 0.297 |
+ 0.336 |
+ 0.379 |
+ 0.257 |
+ 0.421 |
+ 0.398 |
+ 0.430 |
+ 0.406 |
+ 0.418 |
+ 0.421 |
+ 0.406 |
+ 0.413 |
+ 0.401 |
+ 0.411 |
+ 0.431 |
+ 0.390 |
+ 3297 |
+ Stability |
+ MYO3_YEAST |
+ High |
+ Eukaryote |
+
+
+ NCAP_I34A1_Doud_2015 |
+ 0.290 |
+ 0.240 |
+ 0.248 |
+ 0.250 |
+ 0.283 |
+ 0.282 |
+ 0.012 |
+ 0.241 |
+ 0.275 |
+ 0.252 |
+ 0.029 |
+ 0.026 |
+ 0.025 |
+ 0.037 |
+ 0.034 |
+ 0.026 |
+ 0.040 |
+ 0.034 |
+ 0.089 |
+ 0.215 |
+ 0.266 |
+ 0.264 |
+ 0.293 |
+ 0.288 |
+ 0.030 |
+ 0.058 |
+ 0.098 |
+ 0.055 |
+ 0.256 |
+ 0.294 |
+ 0.199 |
+ 0.211 |
+ 0.105 |
+ 0.259 |
+ 0.272 |
+ 0.295 |
+ 0.281 |
+ 0.301 |
+ 0.321 |
+ 0.325 |
+ 0.324 |
+ 0.339 |
+ 0.035 |
+ 0.019 |
+ 0.029 |
+ 0.035 |
+ 0.196 |
+ 0.196 |
+ 0.198 |
+ 0.081 |
+ 0.102 |
+ 0.128 |
+ 0.133 |
+ 0.149 |
+ 0.135 |
+ 0.129 |
+ 0.116 |
+ 0.134 |
+ 0.107 |
+ 0.130 |
+ 0.096 |
+ 0.066 |
+ 9462 |
+ OrganismalFitness |
+ NCAP_I34A1 |
+ Medium |
+ Virus |
+
+
+ NKX31_HUMAN_Tsuboyama_2023_2L9R |
+ 0.272 |
+ 0.343 |
+ 0.463 |
+ 0.459 |
+ 0.511 |
+ 0.470 |
+ 0.399 |
+ 0.482 |
+ 0.500 |
+ 0.509 |
+ 0.534 |
+ 0.532 |
+ 0.535 |
+ 0.497 |
+ 0.417 |
+ 0.472 |
+ 0.458 |
+ 0.422 |
+ 0.467 |
+ 0.444 |
+ 0.479 |
+ 0.459 |
+ 0.477 |
+ 0.475 |
+ 0.436 |
+ 0.435 |
+ 0.470 |
+ 0.479 |
+ 0.507 |
+ 0.498 |
+ 0.435 |
+ 0.420 |
+ 0.481 |
+ 0.431 |
+ 0.438 |
+ 0.486 |
+ 0.442 |
+ 0.447 |
+ 0.498 |
+ 0.489 |
+ 0.505 |
+ 0.512 |
+ 0.330 |
+ 0.240 |
+ 0.258 |
+ 0.306 |
+ 0.431 |
+ 0.256 |
+ 0.497 |
+ 0.511 |
+ 0.523 |
+ 0.530 |
+ 0.527 |
+ 0.535 |
+ 0.535 |
+ 0.509 |
+ 0.525 |
+ 0.518 |
+ 0.523 |
+ 0.528 |
+ 0.525 |
+ 0.498 |
+ 2482 |
+ Stability |
+ NKX31_HUMAN |
+ High |
+ Human |
+
+
+ NPC1_HUMAN_Erwood_2022_HEK293T |
+ 0.526 |
+ 0.572 |
+ 0.481 |
+ 0.513 |
+ 0.578 |
+ 0.565 |
+ 0.183 |
+ 0.358 |
+ 0.611 |
+ 0.598 |
+ 0.572 |
+ 0.274 |
+ 0.371 |
+ 0.151 |
+ 0.306 |
+ 0.494 |
+ 0.572 |
+ 0.578 |
+ 0.578 |
+ 0.054 |
+ 0.196 |
+ 0.358 |
+ 0.300 |
+ 0.306 |
+ 0.332 |
+ 0.403 |
+ 0.475 |
+ 0.358 |
+ 0.365 |
+ 0.572 |
+ 0.565 |
+ 0.468 |
+ 0.131 |
+ 0.157 |
+ 0.410 |
+ 0.416 |
+ 0.488 |
+ 0.501 |
+ 0.546 |
+ 0.539 |
+ 0.546 |
+ 0.578 |
+ 0.235 |
+ 0.041 |
+ 0.559 |
+ 0.488 |
+ 0.378 |
+ 0.533 |
+ 0.028 |
+ 0.119 |
+ 0.572 |
+ 0.565 |
+ 0.572 |
+ 0.585 |
+ 0.578 |
+ 0.552 |
+ 0.559 |
+ 0.572 |
+ 0.578 |
+ 0.572 |
+ 0.533 |
+ 0.287 |
+ 637 |
+ Activity |
+ NPC1_HUMAN |
+ Low |
+ Human |
+
+
+ NPC1_HUMAN_Erwood_2022_RPE1 |
+ 0.651 |
+ 0.397 |
+ 0.079 |
+ 0.142 |
+ 0.397 |
+ 0.460 |
+ 0.269 |
+ 0.333 |
+ 0.333 |
+ 0.397 |
+ 0.460 |
+ 0.015 |
+ 0.142 |
+ 0.079 |
+ 0.206 |
+ 0.524 |
+ 0.460 |
+ 0.397 |
+ 0.397 |
+ 0.206 |
+ 0.524 |
+ 0.333 |
+ 0.397 |
+ 0.142 |
+ 0.333 |
+ 0.333 |
+ 0.269 |
+ 0.206 |
+ 0.206 |
+ 0.333 |
+ 0.651 |
+ 0.426 |
+ 0.397 |
+ 0.269 |
+ 0.079 |
+ 0.333 |
+ 0.587 |
+ 0.397 |
+ 0.524 |
+ 0.587 |
+ 0.397 |
+ 0.460 |
+ 0.333 |
+ 0.206 |
+ 0.651 |
+ 0.142 |
+ 0.269 |
+ 0.397 |
+ 0.142 |
+ 0.142 |
+ 0.397 |
+ 0.460 |
+ 0.460 |
+ 0.397 |
+ 0.460 |
+ 0.460 |
+ 0.397 |
+ 0.397 |
+ 0.397 |
+ 0.460 |
+ 0.587 |
+ 0.333 |
+ 63 |
+ Activity |
+ NPC1_HUMAN |
+ Low |
+ Human |
+
+
+ NRAM_I33A0_Jiang_2016 |
+ 0.477 |
+ 0.450 |
+ 0.356 |
+ 0.329 |
+ 0.450 |
+ 0.463 |
+ 0.060 |
+ 0.289 |
+ 0.477 |
+ 0.477 |
+ -0.020 |
+ 0.114 |
+ 0.369 |
+ -0.060 |
+ -0.034 |
+ 0.060 |
+ 0.154 |
+ 0.396 |
+ 0.450 |
+ 0.302 |
+ 0.342 |
+ 0.477 |
+ 0.396 |
+ 0.409 |
+ 0.060 |
+ 0.409 |
+ 0.450 |
+ 0.329 |
+ 0.423 |
+ 0.530 |
+ 0.289 |
+ 0.342 |
+ 0.007 |
+ 0.356 |
+ 0.383 |
+ 0.396 |
+ 0.436 |
+ 0.503 |
+ 0.544 |
+ 0.503 |
+ 0.544 |
+ 0.477 |
+ -0.034 |
+ -0.034 |
+ -0.060 |
+ -0.074 |
+ 0.356 |
+ 0.289 |
+ 0.369 |
+ 0.154 |
+ 0.248 |
+ 0.195 |
+ 0.181 |
+ 0.221 |
+ 0.154 |
+ 0.235 |
+ 0.221 |
+ 0.208 |
+ 0.154 |
+ 0.168 |
+ 0.221 |
+ 0.128 |
+ 298 |
+ OrganismalFitness |
+ NRAM_I33A0 |
+ Low |
+ Virus |
+
+
+ NUD15_HUMAN_Suiter_2020 |
+ 0.218 |
+ 0.386 |
+ 0.423 |
+ 0.454 |
+ 0.443 |
+ 0.446 |
+ 0.010 |
+ 0.317 |
+ 0.469 |
+ 0.528 |
+ 0.478 |
+ 0.472 |
+ 0.514 |
+ 0.206 |
+ 0.320 |
+ 0.350 |
+ 0.405 |
+ 0.472 |
+ 0.475 |
+ 0.393 |
+ 0.221 |
+ 0.356 |
+ 0.440 |
+ 0.420 |
+ 0.308 |
+ 0.490 |
+ 0.471 |
+ 0.454 |
+ 0.408 |
+ 0.417 |
+ 0.502 |
+ 0.449 |
+ 0.148 |
+ 0.254 |
+ 0.326 |
+ 0.454 |
+ 0.287 |
+ 0.314 |
+ 0.456 |
+ 0.435 |
+ 0.434 |
+ 0.472 |
+ 0.300 |
+ 0.021 |
+ 0.481 |
+ 0.327 |
+ 0.351 |
+ 0.451 |
+ 0.398 |
+ 0.230 |
+ 0.417 |
+ 0.399 |
+ 0.431 |
+ 0.453 |
+ 0.435 |
+ 0.454 |
+ 0.438 |
+ 0.459 |
+ 0.454 |
+ 0.447 |
+ 0.549 |
+ 0.453 |
+ 2844 |
+ Expression |
+ NUD15_HUMAN |
+ High |
+ Human |
+
+
+ NUSA_ECOLI_Tsuboyama_2023_1WCL |
+ 0.352 |
+ 0.417 |
+ 0.455 |
+ 0.449 |
+ 0.486 |
+ 0.484 |
+ 0.204 |
+ 0.324 |
+ 0.512 |
+ 0.453 |
+ 0.411 |
+ 0.230 |
+ 0.240 |
+ 0.232 |
+ 0.291 |
+ 0.287 |
+ 0.356 |
+ 0.403 |
+ 0.388 |
+ 0.423 |
+ 0.246 |
+ 0.427 |
+ 0.464 |
+ 0.461 |
+ 0.263 |
+ 0.386 |
+ 0.390 |
+ 0.386 |
+ 0.441 |
+ 0.486 |
+ 0.526 |
+ 0.618 |
+ 0.129 |
+ 0.244 |
+ 0.322 |
+ 0.307 |
+ 0.364 |
+ 0.372 |
+ 0.386 |
+ 0.496 |
+ 0.464 |
+ 0.486 |
+ 0.257 |
+ 0.179 |
+ 0.267 |
+ 0.248 |
+ 0.528 |
+ 0.490 |
+ 0.591 |
+ 0.512 |
+ 0.610 |
+ 0.597 |
+ 0.610 |
+ 0.628 |
+ 0.644 |
+ 0.654 |
+ 0.634 |
+ 0.616 |
+ 0.640 |
+ 0.642 |
+ 0.536 |
+ 0.455 |
+ 2028 |
+ Stability |
+ NUSA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ NUSG_MYCTU_Tsuboyama_2023_2MI6 |
+ 0.353 |
+ 0.326 |
+ 0.338 |
+ 0.341 |
+ 0.369 |
+ 0.356 |
+ 0.225 |
+ 0.372 |
+ 0.360 |
+ 0.347 |
+ 0.356 |
+ 0.298 |
+ 0.320 |
+ 0.289 |
+ 0.396 |
+ 0.350 |
+ 0.375 |
+ 0.399 |
+ 0.402 |
+ 0.375 |
+ 0.234 |
+ 0.289 |
+ 0.286 |
+ 0.304 |
+ 0.344 |
+ 0.280 |
+ 0.283 |
+ 0.243 |
+ 0.320 |
+ 0.427 |
+ 0.341 |
+ 0.346 |
+ 0.200 |
+ 0.216 |
+ 0.234 |
+ 0.344 |
+ 0.335 |
+ 0.326 |
+ 0.372 |
+ 0.338 |
+ 0.347 |
+ 0.356 |
+ 0.237 |
+ 0.115 |
+ 0.216 |
+ 0.311 |
+ 0.439 |
+ 0.390 |
+ 0.571 |
+ 0.531 |
+ 0.405 |
+ 0.363 |
+ 0.350 |
+ 0.399 |
+ 0.412 |
+ 0.396 |
+ 0.399 |
+ 0.415 |
+ 0.412 |
+ 0.399 |
+ 0.338 |
+ 0.399 |
+ 1380 |
+ Stability |
+ NUSG_MYCTU |
+ High |
+ Prokaryote |
+
+
+ OBSCN_HUMAN_Tsuboyama_2023_1V1C |
+ 0.342 |
+ 0.402 |
+ 0.508 |
+ 0.521 |
+ 0.505 |
+ 0.512 |
+ 0.199 |
+ 0.396 |
+ 0.523 |
+ 0.526 |
+ 0.533 |
+ 0.482 |
+ 0.498 |
+ 0.198 |
+ 0.515 |
+ 0.541 |
+ 0.545 |
+ 0.522 |
+ 0.533 |
+ 0.511 |
+ 0.526 |
+ 0.519 |
+ 0.518 |
+ 0.526 |
+ 0.425 |
+ 0.503 |
+ 0.499 |
+ 0.491 |
+ 0.472 |
+ 0.486 |
+ 0.531 |
+ 0.514 |
+ 0.316 |
+ 0.133 |
+ 0.223 |
+ 0.318 |
+ 0.465 |
+ 0.441 |
+ 0.439 |
+ 0.515 |
+ 0.511 |
+ 0.511 |
+ 0.268 |
+ -0.044 |
+ 0.261 |
+ 0.319 |
+ 0.406 |
+ 0.349 |
+ 0.508 |
+ 0.513 |
+ 0.571 |
+ 0.562 |
+ 0.565 |
+ 0.562 |
+ 0.569 |
+ 0.568 |
+ 0.559 |
+ 0.555 |
+ 0.575 |
+ 0.566 |
+ 0.535 |
+ 0.493 |
+ 3197 |
+ Stability |
+ OBSCN_HUMAN |
+ High |
+ Human |
+
+
+ ODP2_GEOSE_Tsuboyama_2023_1W4G |
+ -0.298 |
+ -0.201 |
+ 0.189 |
+ 0.083 |
+ 0.189 |
+ 0.185 |
+ -0.171 |
+ 0.030 |
+ 0.083 |
+ 0.083 |
+ 0.002 |
+ -0.108 |
+ -0.097 |
+ -0.079 |
+ -0.139 |
+ 0.026 |
+ 0.097 |
+ 0.034 |
+ 0.083 |
+ 0.129 |
+ 0.108 |
+ 0.051 |
+ 0.041 |
+ 0.069 |
+ 0.044 |
+ 0.069 |
+ 0.079 |
+ 0.090 |
+ 0.108 |
+ 0.189 |
+ -0.034 |
+ -0.063 |
+ 0.146 |
+ -0.108 |
+ 0.009 |
+ -0.005 |
+ 0.097 |
+ 0.097 |
+ 0.101 |
+ 0.161 |
+ 0.168 |
+ 0.189 |
+ 0.150 |
+ -0.079 |
+ 0.157 |
+ 0.168 |
+ 0.245 |
+ 0.228 |
+ 0.256 |
+ 0.185 |
+ 0.026 |
+ -0.005 |
+ 0.012 |
+ 0.041 |
+ -0.002 |
+ 0.041 |
+ 0.037 |
+ 0.044 |
+ 0.019 |
+ 0.026 |
+ 0.150 |
+ 0.076 |
+ 1134 |
+ Stability |
+ ODP2_GEOSE |
+ High |
+ Prokaryote |
+
+
+ OPSD_HUMAN_Wan_2019 |
+ 0.139 |
+ 0.382 |
+ 0.358 |
+ 0.503 |
+ 0.406 |
+ 0.382 |
+ 0.309 |
+ 0.430 |
+ 0.430 |
+ 0.430 |
+ 0.212 |
+ 0.406 |
+ 0.430 |
+ 0.188 |
+ 0.479 |
+ 0.430 |
+ 0.406 |
+ 0.358 |
+ 0.430 |
+ 0.503 |
+ 0.479 |
+ 0.455 |
+ 0.455 |
+ 0.455 |
+ 0.503 |
+ 0.455 |
+ 0.503 |
+ 0.455 |
+ 0.430 |
+ 0.333 |
+ 0.382 |
+ 0.261 |
+ 0.188 |
+ 0.406 |
+ 0.430 |
+ 0.382 |
+ 0.455 |
+ 0.430 |
+ 0.406 |
+ 0.430 |
+ 0.358 |
+ 0.382 |
+ 0.261 |
+ 0.042 |
+ 0.406 |
+ 0.406 |
+ 0.624 |
+ 0.406 |
+ 0.576 |
+ 0.212 |
+ 0.333 |
+ 0.309 |
+ 0.309 |
+ 0.261 |
+ 0.430 |
+ 0.406 |
+ 0.236 |
+ 0.309 |
+ 0.382 |
+ 0.358 |
+ 0.503 |
+ 0.406 |
+ 165 |
+ Expression |
+ OPSD_HUMAN |
+ High |
+ Human |
+
+
+ OTC_HUMAN_Lo_2023 |
+ 0.404 |
+ 0.432 |
+ 0.376 |
+ 0.414 |
+ 0.404 |
+ 0.419 |
+ 0.062 |
+ 0.340 |
+ 0.455 |
+ 0.442 |
+ 0.424 |
+ 0.424 |
+ 0.452 |
+ 0.057 |
+ 0.320 |
+ 0.394 |
+ 0.417 |
+ 0.376 |
+ 0.424 |
+ 0.424 |
+ 0.355 |
+ 0.330 |
+ 0.396 |
+ 0.427 |
+ 0.386 |
+ 0.445 |
+ 0.417 |
+ 0.427 |
+ 0.439 |
+ 0.447 |
+ 0.366 |
+ 0.312 |
+ 0.055 |
+ 0.327 |
+ 0.389 |
+ 0.429 |
+ 0.396 |
+ 0.432 |
+ 0.480 |
+ 0.434 |
+ 0.452 |
+ 0.462 |
+ 0.149 |
+ 0.045 |
+ 0.419 |
+ 0.345 |
+ 0.511 |
+ 0.468 |
+ 0.516 |
+ 0.266 |
+ 0.434 |
+ 0.422 |
+ 0.434 |
+ 0.439 |
+ 0.445 |
+ 0.437 |
+ 0.437 |
+ 0.427 |
+ 0.419 |
+ 0.452 |
+ 0.473 |
+ 0.353 |
+ 1570 |
+ Activity |
+ OTC_HUMAN |
+ Medium |
+ Human |
+
+
+ OTU7A_HUMAN_Tsuboyama_2023_2L2D |
+ 0.089 |
+ 0.218 |
+ 0.177 |
+ 0.184 |
+ 0.204 |
+ 0.211 |
+ 0.157 |
+ 0.130 |
+ 0.150 |
+ 0.137 |
+ 0.354 |
+ 0.510 |
+ 0.496 |
+ 0.198 |
+ 0.462 |
+ 0.482 |
+ 0.347 |
+ 0.313 |
+ 0.333 |
+ 0.150 |
+ 0.123 |
+ 0.191 |
+ 0.259 |
+ 0.259 |
+ 0.320 |
+ 0.313 |
+ 0.367 |
+ 0.340 |
+ 0.137 |
+ 0.232 |
+ 0.381 |
+ 0.381 |
+ 0.062 |
+ 0.137 |
+ 0.211 |
+ 0.306 |
+ 0.218 |
+ 0.211 |
+ 0.286 |
+ 0.232 |
+ 0.177 |
+ 0.225 |
+ 0.266 |
+ 0.130 |
+ 0.449 |
+ 0.442 |
+ 0.435 |
+ 0.476 |
+ 0.489 |
+ 0.394 |
+ 0.394 |
+ 0.286 |
+ 0.320 |
+ 0.360 |
+ 0.374 |
+ 0.388 |
+ 0.313 |
+ 0.374 |
+ 0.442 |
+ 0.374 |
+ 0.510 |
+ 0.469 |
+ 635 |
+ Stability |
+ OTU7A_HUMAN |
+ High |
+ Human |
+
+
+ OXDA_RHOTO_Vanella_2023_activity |
+ 0.060 |
+ 0.082 |
+ 0.085 |
+ 0.094 |
+ 0.080 |
+ 0.085 |
+ 0.056 |
+ 0.074 |
+ 0.068 |
+ 0.072 |
+ 0.105 |
+ 0.098 |
+ 0.086 |
+ 0.040 |
+ 0.072 |
+ 0.074 |
+ 0.088 |
+ 0.101 |
+ 0.109 |
+ 0.094 |
+ 0.045 |
+ 0.071 |
+ 0.101 |
+ 0.098 |
+ 0.076 |
+ 0.100 |
+ 0.109 |
+ 0.089 |
+ 0.121 |
+ 0.085 |
+ 0.123 |
+ 0.106 |
+ 0.011 |
+ 0.056 |
+ 0.074 |
+ 0.088 |
+ 0.082 |
+ 0.089 |
+ 0.089 |
+ 0.085 |
+ 0.085 |
+ 0.080 |
+ 0.072 |
+ 0.028 |
+ 0.097 |
+ 0.082 |
+ 0.074 |
+ 0.105 |
+ 0.083 |
+ 0.036 |
+ 0.098 |
+ 0.098 |
+ 0.095 |
+ 0.095 |
+ 0.092 |
+ 0.106 |
+ 0.094 |
+ 0.095 |
+ 0.088 |
+ 0.098 |
+ 0.100 |
+ 0.059 |
+ 6396 |
+ Activity |
+ OXDA_RHOTO |
+ High |
+ Eukaryote |
+
+
+ OXDA_RHOTO_Vanella_2023_expression |
+ 0.156 |
+ 0.208 |
+ 0.215 |
+ 0.215 |
+ 0.192 |
+ 0.192 |
+ 0.156 |
+ 0.155 |
+ 0.209 |
+ 0.209 |
+ 0.247 |
+ 0.231 |
+ 0.226 |
+ 0.164 |
+ 0.188 |
+ 0.223 |
+ 0.241 |
+ 0.253 |
+ 0.247 |
+ 0.158 |
+ 0.154 |
+ 0.181 |
+ 0.187 |
+ 0.188 |
+ 0.175 |
+ 0.204 |
+ 0.221 |
+ 0.213 |
+ 0.226 |
+ 0.204 |
+ 0.259 |
+ 0.227 |
+ 0.066 |
+ 0.168 |
+ 0.163 |
+ 0.184 |
+ 0.190 |
+ 0.185 |
+ 0.204 |
+ 0.209 |
+ 0.204 |
+ 0.206 |
+ 0.188 |
+ 0.122 |
+ 0.231 |
+ 0.179 |
+ 0.262 |
+ 0.248 |
+ 0.204 |
+ 0.059 |
+ 0.245 |
+ 0.223 |
+ 0.224 |
+ 0.233 |
+ 0.230 |
+ 0.229 |
+ 0.231 |
+ 0.236 |
+ 0.227 |
+ 0.237 |
+ 0.272 |
+ 0.216 |
+ 6769 |
+ Expression |
+ OXDA_RHOTO |
+ High |
+ Eukaryote |
+
+
+ P53_HUMAN_Giacomelli_2018_Null_Etoposide |
+ 0.418 |
+ 0.420 |
+ 0.383 |
+ 0.395 |
+ 0.429 |
+ 0.439 |
+ -0.046 |
+ 0.319 |
+ 0.323 |
+ 0.346 |
+ 0.411 |
+ 0.415 |
+ 0.450 |
+ -0.095 |
+ -0.072 |
+ 0.294 |
+ 0.402 |
+ 0.423 |
+ 0.457 |
+ 0.157 |
+ 0.261 |
+ 0.364 |
+ 0.414 |
+ 0.403 |
+ 0.355 |
+ 0.423 |
+ 0.429 |
+ 0.426 |
+ 0.371 |
+ 0.416 |
+ 0.449 |
+ 0.431 |
+ 0.153 |
+ 0.236 |
+ 0.390 |
+ 0.360 |
+ 0.416 |
+ 0.434 |
+ 0.424 |
+ 0.436 |
+ 0.455 |
+ 0.431 |
+ -0.090 |
+ -0.096 |
+ 0.417 |
+ -0.009 |
+ 0.338 |
+ 0.420 |
+ 0.358 |
+ 0.169 |
+ 0.407 |
+ 0.407 |
+ 0.409 |
+ 0.405 |
+ 0.414 |
+ 0.431 |
+ 0.406 |
+ 0.415 |
+ 0.419 |
+ 0.420 |
+ 0.437 |
+ 0.179 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Giacomelli_2018_Null_Nutlin |
+ 0.251 |
+ 0.284 |
+ 0.250 |
+ 0.254 |
+ 0.301 |
+ 0.306 |
+ -0.067 |
+ 0.181 |
+ 0.206 |
+ 0.210 |
+ 0.349 |
+ 0.341 |
+ 0.363 |
+ -0.136 |
+ -0.118 |
+ 0.165 |
+ 0.278 |
+ 0.305 |
+ 0.338 |
+ 0.090 |
+ 0.194 |
+ 0.343 |
+ 0.357 |
+ 0.344 |
+ 0.289 |
+ 0.369 |
+ 0.374 |
+ 0.386 |
+ 0.245 |
+ 0.304 |
+ 0.301 |
+ 0.268 |
+ 0.164 |
+ 0.142 |
+ 0.354 |
+ 0.255 |
+ 0.257 |
+ 0.306 |
+ 0.297 |
+ 0.283 |
+ 0.335 |
+ 0.299 |
+ -0.117 |
+ -0.115 |
+ 0.349 |
+ -0.068 |
+ 0.268 |
+ 0.341 |
+ 0.280 |
+ 0.118 |
+ 0.298 |
+ 0.281 |
+ 0.296 |
+ 0.305 |
+ 0.301 |
+ 0.319 |
+ 0.288 |
+ 0.294 |
+ 0.297 |
+ 0.304 |
+ 0.338 |
+ 0.090 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Giacomelli_2018_WT_Nutlin |
+ 0.394 |
+ 0.399 |
+ 0.363 |
+ 0.380 |
+ 0.392 |
+ 0.400 |
+ -0.034 |
+ 0.305 |
+ 0.311 |
+ 0.310 |
+ 0.438 |
+ 0.409 |
+ 0.434 |
+ -0.086 |
+ -0.066 |
+ 0.334 |
+ 0.390 |
+ 0.411 |
+ 0.422 |
+ 0.140 |
+ 0.300 |
+ 0.402 |
+ 0.434 |
+ 0.418 |
+ 0.369 |
+ 0.439 |
+ 0.440 |
+ 0.441 |
+ 0.317 |
+ 0.430 |
+ 0.426 |
+ 0.410 |
+ 0.155 |
+ 0.268 |
+ 0.435 |
+ 0.374 |
+ 0.404 |
+ 0.431 |
+ 0.418 |
+ 0.411 |
+ 0.438 |
+ 0.411 |
+ -0.081 |
+ -0.104 |
+ 0.434 |
+ 0.004 |
+ 0.367 |
+ 0.431 |
+ 0.363 |
+ 0.187 |
+ 0.398 |
+ 0.407 |
+ 0.399 |
+ 0.407 |
+ 0.415 |
+ 0.421 |
+ 0.400 |
+ 0.407 |
+ 0.403 |
+ 0.415 |
+ 0.429 |
+ 0.189 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Kotler_2018 |
+ 0.542 |
+ 0.519 |
+ 0.429 |
+ 0.507 |
+ 0.487 |
+ 0.515 |
+ 0.056 |
+ 0.386 |
+ 0.499 |
+ 0.511 |
+ 0.476 |
+ 0.472 |
+ 0.542 |
+ 0.052 |
+ 0.037 |
+ 0.530 |
+ 0.604 |
+ 0.619 |
+ 0.631 |
+ 0.359 |
+ 0.317 |
+ 0.402 |
+ 0.441 |
+ 0.429 |
+ 0.386 |
+ 0.433 |
+ 0.421 |
+ 0.445 |
+ 0.429 |
+ 0.476 |
+ 0.550 |
+ 0.515 |
+ 0.056 |
+ 0.336 |
+ 0.421 |
+ 0.344 |
+ 0.519 |
+ 0.484 |
+ 0.464 |
+ 0.530 |
+ 0.495 |
+ 0.511 |
+ 0.052 |
+ 0.002 |
+ 0.573 |
+ 0.153 |
+ 0.390 |
+ 0.507 |
+ 0.394 |
+ 0.181 |
+ 0.588 |
+ 0.612 |
+ 0.604 |
+ 0.588 |
+ 0.585 |
+ 0.588 |
+ 0.608 |
+ 0.573 |
+ 0.616 |
+ 0.612 |
+ 0.639 |
+ 0.270 |
+ 1048 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P84126_THETH_Chan_2017 |
+ 0.484 |
+ 0.516 |
+ 0.534 |
+ 0.534 |
+ 0.489 |
+ 0.521 |
+ 0.272 |
+ 0.407 |
+ 0.539 |
+ 0.545 |
+ 0.481 |
+ 0.489 |
+ 0.531 |
+ 0.193 |
+ 0.484 |
+ 0.492 |
+ 0.524 |
+ 0.486 |
+ 0.463 |
+ 0.524 |
+ 0.383 |
+ 0.436 |
+ 0.434 |
+ 0.486 |
+ 0.481 |
+ 0.518 |
+ 0.510 |
+ 0.518 |
+ 0.563 |
+ 0.486 |
+ 0.471 |
+ 0.401 |
+ 0.322 |
+ 0.423 |
+ 0.436 |
+ 0.471 |
+ 0.492 |
+ 0.494 |
+ 0.516 |
+ 0.518 |
+ 0.524 |
+ 0.537 |
+ 0.404 |
+ -0.032 |
+ 0.460 |
+ 0.402 |
+ 0.256 |
+ 0.336 |
+ 0.396 |
+ 0.081 |
+ 0.460 |
+ 0.436 |
+ 0.373 |
+ 0.479 |
+ 0.473 |
+ 0.420 |
+ 0.449 |
+ 0.457 |
+ 0.473 |
+ 0.476 |
+ 0.524 |
+ 0.526 |
+ 1519 |
+ OrganismalFitness |
+ P84126_THETH |
+ Medium |
+ Prokaryote |
+
+
+ PA_I34A1_Wu_2015 |
+ 0.360 |
+ 0.376 |
+ 0.360 |
+ 0.365 |
+ 0.396 |
+ 0.409 |
+ 0.048 |
+ 0.301 |
+ 0.145 |
+ 0.101 |
+ 0.026 |
+ 0.040 |
+ 0.088 |
+ -0.002 |
+ 0.024 |
+ 0.026 |
+ 0.024 |
+ 0.018 |
+ 0.275 |
+ 0.248 |
+ 0.336 |
+ 0.382 |
+ 0.400 |
+ 0.407 |
+ 0.171 |
+ 0.297 |
+ 0.332 |
+ 0.336 |
+ 0.327 |
+ 0.516 |
+ 0.284 |
+ 0.264 |
+ 0.086 |
+ 0.308 |
+ 0.347 |
+ 0.396 |
+ 0.433 |
+ 0.440 |
+ 0.435 |
+ 0.462 |
+ 0.455 |
+ 0.433 |
+ 0.024 |
+ 0.031 |
+ 0.037 |
+ 0.013 |
+ 0.180 |
+ 0.178 |
+ 0.149 |
+ 0.046 |
+ 0.132 |
+ 0.141 |
+ 0.147 |
+ 0.149 |
+ 0.134 |
+ 0.152 |
+ 0.134 |
+ 0.132 |
+ 0.101 |
+ 0.143 |
+ 0.167 |
+ 0.121 |
+ 1820 |
+ OrganismalFitness |
+ PA_I34A1 |
+ Medium |
+ Virus |
+
+
+ PABP_YEAST_Melamed_2013 |
+ 0.521 |
+ 0.474 |
+ 0.410 |
+ 0.418 |
+ 0.495 |
+ 0.493 |
+ 0.350 |
+ 0.429 |
+ 0.491 |
+ 0.512 |
+ 0.545 |
+ 0.514 |
+ 0.534 |
+ 0.357 |
+ 0.429 |
+ 0.515 |
+ 0.597 |
+ 0.549 |
+ 0.563 |
+ 0.408 |
+ 0.508 |
+ 0.528 |
+ 0.531 |
+ 0.555 |
+ 0.506 |
+ 0.558 |
+ 0.570 |
+ 0.538 |
+ 0.525 |
+ 0.522 |
+ 0.568 |
+ 0.503 |
+ 0.191 |
+ 0.511 |
+ 0.504 |
+ 0.503 |
+ 0.556 |
+ 0.556 |
+ 0.551 |
+ 0.531 |
+ 0.530 |
+ 0.529 |
+ 0.465 |
+ -0.048 |
+ 0.518 |
+ 0.517 |
+ 0.274 |
+ 0.461 |
+ 0.393 |
+ 0.144 |
+ 0.542 |
+ 0.532 |
+ 0.545 |
+ 0.570 |
+ 0.558 |
+ 0.563 |
+ 0.565 |
+ 0.561 |
+ 0.594 |
+ 0.581 |
+ 0.574 |
+ 0.502 |
+ 37708 |
+ OrganismalFitness |
+ PABP_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ PAI1_HUMAN_Huttinger_2021 |
+ 0.301 |
+ 0.280 |
+ 0.263 |
+ 0.275 |
+ 0.283 |
+ 0.298 |
+ 0.046 |
+ 0.238 |
+ 0.316 |
+ 0.325 |
+ 0.335 |
+ 0.312 |
+ 0.312 |
+ 0.061 |
+ 0.298 |
+ 0.322 |
+ 0.344 |
+ 0.240 |
+ 0.205 |
+ 0.316 |
+ 0.103 |
+ 0.277 |
+ 0.256 |
+ 0.240 |
+ 0.277 |
+ 0.293 |
+ 0.289 |
+ 0.271 |
+ 0.218 |
+ 0.311 |
+ 0.300 |
+ 0.268 |
+ 0.125 |
+ 0.121 |
+ 0.264 |
+ 0.274 |
+ 0.284 |
+ 0.295 |
+ 0.299 |
+ 0.301 |
+ 0.310 |
+ 0.306 |
+ 0.252 |
+ 0.053 |
+ 0.322 |
+ 0.306 |
+ 0.297 |
+ 0.343 |
+ 0.305 |
+ 0.108 |
+ 0.334 |
+ 0.344 |
+ 0.337 |
+ 0.348 |
+ 0.339 |
+ 0.348 |
+ 0.342 |
+ 0.347 |
+ 0.341 |
+ 0.353 |
+ 0.360 |
+ 0.302 |
+ 5345 |
+ Activity |
+ PAI1_HUMAN |
+ NaN |
+ Human |
+
+
+ PHOT_CHLRE_Chen_2023 |
+ 0.149 |
+ 0.300 |
+ 0.541 |
+ 0.520 |
+ 0.215 |
+ 0.256 |
+ 0.477 |
+ 0.395 |
+ 0.526 |
+ 0.536 |
+ 0.474 |
+ 0.570 |
+ 0.600 |
+ 0.580 |
+ 0.648 |
+ 0.529 |
+ 0.563 |
+ 0.528 |
+ 0.574 |
+ 0.472 |
+ 0.499 |
+ 0.526 |
+ 0.433 |
+ 0.510 |
+ 0.326 |
+ 0.434 |
+ 0.417 |
+ 0.483 |
+ 0.426 |
+ 0.415 |
+ 0.414 |
+ 0.346 |
+ 0.210 |
+ 0.398 |
+ 0.325 |
+ 0.419 |
+ 0.413 |
+ 0.369 |
+ 0.422 |
+ 0.282 |
+ 0.302 |
+ 0.287 |
+ 0.469 |
+ 0.278 |
+ 0.452 |
+ 0.463 |
+ 0.142 |
+ 0.364 |
+ 0.493 |
+ 0.284 |
+ 0.439 |
+ 0.388 |
+ 0.395 |
+ 0.393 |
+ 0.398 |
+ 0.411 |
+ 0.408 |
+ 0.413 |
+ 0.421 |
+ 0.410 |
+ 0.551 |
+ 0.588 |
+ 167529 |
+ Activity |
+ PHOT_CHLRE |
+ High |
+ Eukaryote |
+
+
+ PIN1_HUMAN_Tsuboyama_2023_1I6C |
+ 0.200 |
+ 0.234 |
+ 0.549 |
+ 0.499 |
+ 0.489 |
+ 0.544 |
+ 0.489 |
+ 0.384 |
+ 0.514 |
+ 0.579 |
+ 0.524 |
+ 0.449 |
+ 0.569 |
+ 0.529 |
+ 0.494 |
+ 0.524 |
+ 0.549 |
+ 0.394 |
+ 0.464 |
+ 0.499 |
+ 0.354 |
+ 0.524 |
+ 0.509 |
+ 0.439 |
+ 0.424 |
+ 0.519 |
+ 0.574 |
+ 0.544 |
+ 0.509 |
+ 0.594 |
+ 0.594 |
+ 0.549 |
+ 0.434 |
+ 0.399 |
+ 0.494 |
+ 0.529 |
+ 0.434 |
+ 0.499 |
+ 0.564 |
+ 0.549 |
+ 0.564 |
+ 0.564 |
+ 0.474 |
+ -0.170 |
+ 0.529 |
+ 0.414 |
+ 0.399 |
+ 0.444 |
+ 0.594 |
+ 0.569 |
+ 0.534 |
+ 0.494 |
+ 0.414 |
+ 0.389 |
+ 0.389 |
+ 0.444 |
+ 0.424 |
+ 0.504 |
+ 0.454 |
+ 0.444 |
+ 0.603 |
+ 0.653 |
+ 802 |
+ Stability |
+ PIN1_HUMAN |
+ High |
+ Human |
+
+
+ PITX2_HUMAN_Tsuboyama_2023_2L7M |
+ 0.461 |
+ 0.364 |
+ 0.333 |
+ 0.342 |
+ 0.336 |
+ 0.351 |
+ 0.349 |
+ 0.360 |
+ 0.412 |
+ 0.410 |
+ 0.360 |
+ 0.366 |
+ 0.384 |
+ 0.415 |
+ 0.524 |
+ 0.577 |
+ 0.474 |
+ 0.447 |
+ 0.373 |
+ 0.375 |
+ 0.371 |
+ 0.338 |
+ 0.369 |
+ 0.344 |
+ 0.423 |
+ 0.364 |
+ 0.344 |
+ 0.331 |
+ 0.353 |
+ 0.349 |
+ 0.349 |
+ 0.350 |
+ 0.338 |
+ 0.415 |
+ 0.358 |
+ 0.373 |
+ 0.472 |
+ 0.423 |
+ 0.404 |
+ 0.406 |
+ 0.358 |
+ 0.360 |
+ 0.349 |
+ 0.303 |
+ 0.178 |
+ 0.235 |
+ 0.294 |
+ 0.200 |
+ 0.537 |
+ 0.498 |
+ 0.377 |
+ 0.388 |
+ 0.393 |
+ 0.406 |
+ 0.399 |
+ 0.419 |
+ 0.395 |
+ 0.384 |
+ 0.393 |
+ 0.390 |
+ 0.390 |
+ 0.443 |
+ 1824 |
+ Stability |
+ PITX2_HUMAN |
+ High |
+ Human |
+
+
+ PKN1_HUMAN_Tsuboyama_2023_1URF |
+ 0.177 |
+ 0.184 |
+ 0.232 |
+ 0.246 |
+ 0.288 |
+ 0.285 |
+ 0.177 |
+ 0.246 |
+ 0.201 |
+ 0.229 |
+ 0.226 |
+ 0.281 |
+ 0.299 |
+ 0.327 |
+ 0.334 |
+ 0.435 |
+ 0.260 |
+ 0.226 |
+ 0.229 |
+ 0.292 |
+ 0.292 |
+ 0.278 |
+ 0.222 |
+ 0.281 |
+ 0.281 |
+ 0.299 |
+ 0.299 |
+ 0.278 |
+ 0.281 |
+ 0.295 |
+ 0.320 |
+ 0.320 |
+ 0.062 |
+ 0.309 |
+ 0.271 |
+ 0.306 |
+ 0.246 |
+ 0.292 |
+ 0.323 |
+ 0.306 |
+ 0.288 |
+ 0.306 |
+ 0.274 |
+ 0.267 |
+ 0.274 |
+ 0.288 |
+ 0.431 |
+ 0.337 |
+ 0.428 |
+ 0.376 |
+ 0.232 |
+ 0.264 |
+ 0.292 |
+ 0.243 |
+ 0.257 |
+ 0.271 |
+ 0.236 |
+ 0.257 |
+ 0.281 |
+ 0.257 |
+ 0.292 |
+ 0.424 |
+ 1301 |
+ Stability |
+ PKN1_HUMAN |
+ High |
+ Human |
+
+
+ POLG_CXB3N_Mattenberger_2021 |
+ 0.301 |
+ 0.281 |
+ 0.260 |
+ 0.299 |
+ 0.340 |
+ 0.348 |
+ -0.030 |
+ 0.255 |
+ 0.376 |
+ 0.374 |
+ 0.228 |
+ -0.036 |
+ 0.001 |
+ -0.047 |
+ -0.030 |
+ 0.124 |
+ 0.317 |
+ 0.324 |
+ 0.346 |
+ 0.259 |
+ 0.250 |
+ 0.283 |
+ 0.272 |
+ 0.264 |
+ 0.074 |
+ 0.287 |
+ 0.277 |
+ 0.261 |
+ 0.273 |
+ 0.365 |
+ 0.295 |
+ 0.229 |
+ -0.005 |
+ 0.036 |
+ 0.187 |
+ 0.248 |
+ 0.263 |
+ 0.276 |
+ 0.311 |
+ 0.288 |
+ 0.307 |
+ 0.347 |
+ -0.035 |
+ -0.039 |
+ 0.268 |
+ -0.030 |
+ 0.147 |
+ 0.268 |
+ 0.077 |
+ 0.040 |
+ 0.259 |
+ 0.286 |
+ 0.289 |
+ 0.291 |
+ 0.286 |
+ 0.288 |
+ 0.286 |
+ 0.285 |
+ 0.294 |
+ 0.290 |
+ 0.068 |
+ 0.051 |
+ 15711 |
+ OrganismalFitness |
+ POLG_CXB3N |
+ Medium |
+ Virus |
+
+
+ POLG_DEN26_Suphatrakul_2023 |
+ 0.357 |
+ 0.454 |
+ 0.184 |
+ 0.185 |
+ 0.413 |
+ 0.418 |
+ -0.014 |
+ 0.330 |
+ 0.535 |
+ 0.542 |
+ 0.241 |
+ 0.011 |
+ 0.020 |
+ -0.009 |
+ 0.025 |
+ 0.059 |
+ 0.122 |
+ 0.206 |
+ 0.274 |
+ 0.312 |
+ 0.334 |
+ 0.351 |
+ 0.344 |
+ 0.344 |
+ 0.327 |
+ 0.367 |
+ 0.371 |
+ 0.363 |
+ 0.352 |
+ 0.484 |
+ 0.468 |
+ 0.408 |
+ 0.029 |
+ -0.025 |
+ 0.087 |
+ 0.352 |
+ 0.331 |
+ 0.337 |
+ 0.422 |
+ 0.364 |
+ 0.315 |
+ 0.430 |
+ -0.028 |
+ -0.025 |
+ 0.319 |
+ 0.017 |
+ 0.282 |
+ 0.420 |
+ 0.099 |
+ 0.084 |
+ 0.195 |
+ 0.189 |
+ 0.198 |
+ 0.216 |
+ 0.184 |
+ 0.217 |
+ 0.197 |
+ 0.199 |
+ 0.190 |
+ 0.206 |
+ 0.115 |
+ 0.023 |
+ 16897 |
+ OrganismalFitness |
+ POLG_DEN26 |
+ Low |
+ Virus |
+
+
+ POLG_HCVJF_Qi_2014 |
+ 0.448 |
+ 0.432 |
+ 0.336 |
+ 0.346 |
+ 0.460 |
+ 0.472 |
+ -0.041 |
+ 0.152 |
+ 0.405 |
+ 0.423 |
+ 0.131 |
+ 0.507 |
+ 0.475 |
+ 0.074 |
+ 0.101 |
+ 0.082 |
+ 0.082 |
+ 0.062 |
+ 0.062 |
+ 0.158 |
+ 0.309 |
+ 0.358 |
+ 0.336 |
+ 0.376 |
+ 0.302 |
+ 0.376 |
+ 0.213 |
+ 0.314 |
+ 0.405 |
+ 0.465 |
+ 0.467 |
+ 0.399 |
+ 0.151 |
+ 0.361 |
+ 0.395 |
+ 0.386 |
+ 0.400 |
+ 0.428 |
+ 0.415 |
+ 0.353 |
+ 0.403 |
+ 0.423 |
+ 0.079 |
+ 0.047 |
+ 0.378 |
+ 0.084 |
+ -0.035 |
+ 0.260 |
+ 0.578 |
+ 0.311 |
+ 0.245 |
+ 0.292 |
+ 0.282 |
+ 0.297 |
+ 0.299 |
+ 0.227 |
+ 0.269 |
+ 0.299 |
+ 0.267 |
+ 0.309 |
+ 0.119 |
+ 0.143 |
+ 1630 |
+ OrganismalFitness |
+ POLG_HCVJF |
+ Medium |
+ Virus |
+
+
+ POLG_PESV_Tsuboyama_2023_2MXD |
+ 0.194 |
+ 0.352 |
+ 0.217 |
+ 0.263 |
+ 0.295 |
+ 0.297 |
+ 0.053 |
+ 0.269 |
+ 0.222 |
+ 0.256 |
+ 0.267 |
+ 0.041 |
+ 0.089 |
+ 0.048 |
+ 0.066 |
+ 0.032 |
+ 0.089 |
+ 0.064 |
+ 0.048 |
+ 0.390 |
+ 0.079 |
+ 0.060 |
+ 0.023 |
+ 0.116 |
+ 0.076 |
+ -0.032 |
+ -0.003 |
+ -0.042 |
+ 0.105 |
+ 0.309 |
+ 0.370 |
+ 0.377 |
+ 0.125 |
+ 0.023 |
+ 0.018 |
+ 0.009 |
+ 0.242 |
+ 0.249 |
+ 0.249 |
+ 0.302 |
+ 0.306 |
+ 0.305 |
+ -0.025 |
+ -0.054 |
+ -0.012 |
+ -0.035 |
+ 0.370 |
+ 0.297 |
+ 0.486 |
+ 0.434 |
+ 0.456 |
+ 0.462 |
+ 0.437 |
+ 0.518 |
+ 0.525 |
+ 0.529 |
+ 0.462 |
+ 0.530 |
+ 0.530 |
+ 0.511 |
+ 0.545 |
+ 0.466 |
+ 5130 |
+ Stability |
+ POLG_PESV |
+ Medium |
+ Virus |
+
+
+ PPARG_HUMAN_Majithia_2016 |
+ 0.299 |
+ 0.373 |
+ 0.407 |
+ 0.410 |
+ 0.396 |
+ 0.397 |
+ 0.132 |
+ 0.263 |
+ 0.363 |
+ 0.383 |
+ 0.397 |
+ 0.405 |
+ 0.409 |
+ 0.040 |
+ 0.120 |
+ 0.347 |
+ 0.414 |
+ 0.469 |
+ 0.461 |
+ 0.434 |
+ 0.406 |
+ 0.389 |
+ 0.214 |
+ 0.256 |
+ 0.390 |
+ 0.407 |
+ 0.399 |
+ 0.414 |
+ 0.283 |
+ 0.450 |
+ 0.424 |
+ 0.411 |
+ 0.278 |
+ 0.430 |
+ 0.380 |
+ 0.342 |
+ 0.431 |
+ 0.414 |
+ 0.397 |
+ 0.445 |
+ 0.429 |
+ 0.423 |
+ 0.195 |
+ 0.042 |
+ 0.359 |
+ 0.320 |
+ 0.368 |
+ 0.355 |
+ 0.402 |
+ 0.200 |
+ 0.404 |
+ 0.412 |
+ 0.419 |
+ 0.408 |
+ 0.408 |
+ 0.417 |
+ 0.416 |
+ 0.414 |
+ 0.420 |
+ 0.420 |
+ 0.424 |
+ 0.352 |
+ 9576 |
+ Activity |
+ PPARG_HUMAN |
+ Medium |
+ Human |
+
+
+ PPM1D_HUMAN_Miller_2022 |
+ 0.438 |
+ 0.462 |
+ 0.454 |
+ 0.465 |
+ 0.508 |
+ 0.512 |
+ 0.005 |
+ 0.298 |
+ 0.340 |
+ 0.400 |
+ 0.462 |
+ 0.487 |
+ 0.495 |
+ 0.184 |
+ 0.258 |
+ 0.327 |
+ 0.494 |
+ 0.511 |
+ 0.509 |
+ 0.286 |
+ 0.327 |
+ 0.434 |
+ 0.412 |
+ 0.323 |
+ 0.421 |
+ 0.448 |
+ 0.448 |
+ 0.439 |
+ 0.307 |
+ 0.492 |
+ 0.488 |
+ 0.461 |
+ 0.216 |
+ 0.312 |
+ 0.427 |
+ 0.414 |
+ 0.478 |
+ 0.493 |
+ 0.504 |
+ 0.528 |
+ 0.509 |
+ 0.512 |
+ 0.229 |
+ -0.034 |
+ 0.465 |
+ 0.397 |
+ 0.389 |
+ 0.464 |
+ 0.437 |
+ 0.197 |
+ 0.467 |
+ 0.463 |
+ 0.460 |
+ 0.469 |
+ 0.483 |
+ 0.460 |
+ 0.477 |
+ 0.476 |
+ 0.479 |
+ 0.482 |
+ 0.520 |
+ 0.392 |
+ 7889 |
+ OrganismalFitness |
+ PPM1D_HUMAN |
+ Low |
+ Human |
+
+
+ PR40A_HUMAN_Tsuboyama_2023_1UZC |
+ 0.580 |
+ 0.629 |
+ 0.690 |
+ 0.672 |
+ 0.724 |
+ 0.724 |
+ 0.413 |
+ 0.413 |
+ 0.743 |
+ 0.755 |
+ 0.696 |
+ 0.580 |
+ 0.623 |
+ 0.444 |
+ 0.428 |
+ 0.747 |
+ 0.769 |
+ 0.735 |
+ 0.712 |
+ 0.659 |
+ 0.497 |
+ 0.600 |
+ 0.613 |
+ 0.611 |
+ 0.539 |
+ 0.604 |
+ 0.672 |
+ 0.619 |
+ 0.653 |
+ 0.763 |
+ 0.789 |
+ 0.794 |
+ 0.560 |
+ 0.539 |
+ 0.436 |
+ 0.486 |
+ 0.670 |
+ 0.676 |
+ 0.667 |
+ 0.690 |
+ 0.716 |
+ 0.698 |
+ 0.367 |
+ 0.181 |
+ 0.190 |
+ 0.358 |
+ 0.472 |
+ 0.226 |
+ 0.633 |
+ 0.649 |
+ 0.789 |
+ 0.765 |
+ 0.775 |
+ 0.781 |
+ 0.787 |
+ 0.790 |
+ 0.790 |
+ 0.789 |
+ 0.806 |
+ 0.802 |
+ 0.731 |
+ 0.712 |
+ 2033 |
+ Stability |
+ PR40A_HUMAN |
+ Medium |
+ Human |
+
+
+ PRKN_HUMAN_Clausen_2023 |
+ 0.505 |
+ 0.501 |
+ 0.505 |
+ 0.490 |
+ 0.509 |
+ 0.512 |
+ 0.104 |
+ 0.367 |
+ 0.425 |
+ 0.443 |
+ 0.458 |
+ 0.507 |
+ 0.527 |
+ 0.137 |
+ 0.183 |
+ 0.250 |
+ 0.389 |
+ 0.544 |
+ 0.553 |
+ 0.357 |
+ 0.207 |
+ 0.440 |
+ 0.478 |
+ 0.458 |
+ 0.359 |
+ 0.502 |
+ 0.493 |
+ 0.477 |
+ 0.424 |
+ 0.523 |
+ 0.482 |
+ 0.462 |
+ 0.217 |
+ 0.170 |
+ 0.430 |
+ 0.450 |
+ 0.490 |
+ 0.522 |
+ 0.520 |
+ 0.517 |
+ 0.526 |
+ 0.527 |
+ 0.206 |
+ 0.048 |
+ 0.472 |
+ 0.246 |
+ 0.511 |
+ 0.482 |
+ 0.552 |
+ 0.224 |
+ 0.416 |
+ 0.427 |
+ 0.436 |
+ 0.452 |
+ 0.440 |
+ 0.459 |
+ 0.453 |
+ 0.447 |
+ 0.449 |
+ 0.459 |
+ 0.556 |
+ 0.371 |
+ 8756 |
+ Expression |
+ PRKN_HUMAN |
+ Low |
+ Human |
+
+
+ PSAE_SYNP2_Tsuboyama_2023_1PSE |
+ 0.518 |
+ 0.542 |
+ 0.508 |
+ 0.511 |
+ 0.526 |
+ 0.521 |
+ 0.335 |
+ 0.407 |
+ 0.428 |
+ 0.436 |
+ 0.602 |
+ 0.532 |
+ 0.529 |
+ 0.428 |
+ 0.516 |
+ 0.609 |
+ 0.627 |
+ 0.563 |
+ 0.490 |
+ 0.397 |
+ 0.311 |
+ 0.241 |
+ 0.433 |
+ 0.418 |
+ 0.407 |
+ 0.410 |
+ -0.129 |
+ 0.449 |
+ 0.459 |
+ 0.471 |
+ 0.490 |
+ 0.459 |
+ 0.161 |
+ 0.332 |
+ 0.394 |
+ 0.399 |
+ 0.511 |
+ 0.498 |
+ 0.459 |
+ 0.558 |
+ 0.568 |
+ 0.495 |
+ 0.423 |
+ 0.286 |
+ 0.586 |
+ 0.441 |
+ 0.532 |
+ 0.576 |
+ 0.638 |
+ 0.539 |
+ 0.617 |
+ 0.583 |
+ 0.602 |
+ 0.599 |
+ 0.591 |
+ 0.625 |
+ 0.591 |
+ 0.599 |
+ 0.612 |
+ 0.607 |
+ 0.622 |
+ 0.625 |
+ 1579 |
+ Stability |
+ PSAE_PICP2 |
+ Medium |
+ Prokaryote |
+
+
+ PTEN_HUMAN_Matreyek_2021 |
+ 0.298 |
+ 0.304 |
+ 0.314 |
+ 0.304 |
+ 0.315 |
+ 0.320 |
+ 0.105 |
+ 0.263 |
+ 0.327 |
+ 0.368 |
+ 0.335 |
+ 0.310 |
+ 0.355 |
+ 0.117 |
+ 0.200 |
+ 0.380 |
+ 0.359 |
+ 0.195 |
+ 0.210 |
+ 0.348 |
+ 0.128 |
+ 0.339 |
+ 0.308 |
+ 0.298 |
+ 0.216 |
+ 0.264 |
+ 0.241 |
+ 0.280 |
+ 0.224 |
+ 0.342 |
+ 0.318 |
+ 0.323 |
+ 0.114 |
+ 0.195 |
+ 0.328 |
+ 0.272 |
+ 0.271 |
+ 0.346 |
+ 0.314 |
+ 0.324 |
+ 0.354 |
+ 0.339 |
+ 0.179 |
+ 0.033 |
+ 0.344 |
+ 0.255 |
+ 0.369 |
+ 0.348 |
+ 0.375 |
+ 0.157 |
+ 0.350 |
+ 0.346 |
+ 0.357 |
+ 0.354 |
+ 0.344 |
+ 0.362 |
+ 0.358 |
+ 0.365 |
+ 0.351 |
+ 0.365 |
+ 0.402 |
+ 0.366 |
+ 5083 |
+ Expression |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ PTEN_HUMAN_Mighell_2018 |
+ 0.399 |
+ 0.420 |
+ 0.396 |
+ 0.396 |
+ 0.417 |
+ 0.418 |
+ 0.112 |
+ 0.300 |
+ 0.418 |
+ 0.423 |
+ 0.395 |
+ 0.393 |
+ 0.412 |
+ 0.144 |
+ 0.301 |
+ 0.418 |
+ 0.430 |
+ 0.239 |
+ 0.225 |
+ 0.417 |
+ 0.257 |
+ 0.356 |
+ 0.276 |
+ 0.257 |
+ 0.343 |
+ 0.234 |
+ 0.245 |
+ 0.274 |
+ 0.212 |
+ 0.417 |
+ 0.391 |
+ 0.390 |
+ 0.058 |
+ 0.300 |
+ 0.346 |
+ 0.257 |
+ 0.411 |
+ 0.401 |
+ 0.368 |
+ 0.417 |
+ 0.431 |
+ 0.431 |
+ 0.280 |
+ -0.032 |
+ 0.415 |
+ 0.379 |
+ 0.334 |
+ 0.338 |
+ 0.398 |
+ 0.153 |
+ 0.405 |
+ 0.395 |
+ 0.405 |
+ 0.412 |
+ 0.419 |
+ 0.414 |
+ 0.412 |
+ 0.407 |
+ 0.420 |
+ 0.425 |
+ 0.428 |
+ 0.388 |
+ 7260 |
+ Activity |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ Q2N0S5_9HIV1_Haddox_2018 |
+ 0.361 |
+ 0.321 |
+ 0.279 |
+ 0.304 |
+ 0.378 |
+ 0.376 |
+ 0.024 |
+ 0.327 |
+ 0.396 |
+ 0.401 |
+ 0.350 |
+ 0.391 |
+ 0.414 |
+ 0.034 |
+ 0.030 |
+ 0.024 |
+ 0.054 |
+ 0.078 |
+ 0.104 |
+ 0.303 |
+ 0.394 |
+ 0.301 |
+ 0.289 |
+ 0.244 |
+ 0.392 |
+ 0.287 |
+ 0.284 |
+ 0.327 |
+ 0.260 |
+ 0.379 |
+ 0.378 |
+ 0.369 |
+ 0.222 |
+ 0.371 |
+ 0.318 |
+ 0.295 |
+ 0.390 |
+ 0.384 |
+ 0.379 |
+ 0.397 |
+ 0.394 |
+ 0.387 |
+ 0.340 |
+ 0.020 |
+ 0.385 |
+ 0.345 |
+ 0.292 |
+ 0.348 |
+ 0.199 |
+ 0.106 |
+ 0.166 |
+ 0.207 |
+ 0.230 |
+ 0.226 |
+ 0.188 |
+ 0.193 |
+ 0.194 |
+ 0.185 |
+ 0.167 |
+ 0.218 |
+ 0.151 |
+ 0.079 |
+ 12729 |
+ OrganismalFitness |
+ Q2N0S5_9HIV1 |
+ Medium |
+ Virus |
+
+
+ Q53Z42_HUMAN_McShan_2019_binding-TAPBPR |
+ 0.240 |
+ 0.215 |
+ 0.228 |
+ 0.224 |
+ 0.236 |
+ 0.231 |
+ 0.083 |
+ 0.150 |
+ 0.194 |
+ 0.201 |
+ 0.134 |
+ 0.172 |
+ 0.190 |
+ 0.161 |
+ 0.135 |
+ 0.240 |
+ 0.206 |
+ 0.177 |
+ 0.237 |
+ 0.208 |
+ 0.176 |
+ 0.188 |
+ 0.172 |
+ 0.134 |
+ 0.191 |
+ 0.191 |
+ 0.178 |
+ 0.209 |
+ 0.176 |
+ 0.213 |
+ 0.257 |
+ 0.246 |
+ 0.051 |
+ 0.163 |
+ 0.179 |
+ 0.104 |
+ 0.214 |
+ 0.230 |
+ 0.205 |
+ 0.240 |
+ 0.255 |
+ 0.242 |
+ 0.309 |
+ 0.165 |
+ 0.184 |
+ 0.206 |
+ 0.220 |
+ 0.243 |
+ 0.240 |
+ 0.135 |
+ 0.193 |
+ 0.194 |
+ 0.220 |
+ 0.225 |
+ 0.207 |
+ 0.199 |
+ 0.193 |
+ 0.205 |
+ 0.203 |
+ 0.206 |
+ 0.188 |
+ 0.322 |
+ 3344 |
+ Binding |
+ Q53Z42_HUMAN |
+ Medium |
+ Human |
+
+
+ Q53Z42_HUMAN_McShan_2019_expression |
+ 0.411 |
+ 0.375 |
+ 0.416 |
+ 0.423 |
+ 0.425 |
+ 0.425 |
+ -0.034 |
+ 0.319 |
+ 0.408 |
+ 0.423 |
+ 0.319 |
+ 0.365 |
+ 0.387 |
+ 0.059 |
+ 0.069 |
+ 0.382 |
+ 0.435 |
+ 0.408 |
+ 0.440 |
+ 0.428 |
+ 0.333 |
+ 0.371 |
+ 0.381 |
+ 0.350 |
+ 0.368 |
+ 0.403 |
+ 0.389 |
+ 0.423 |
+ 0.416 |
+ 0.431 |
+ 0.418 |
+ 0.375 |
+ 0.108 |
+ 0.326 |
+ 0.355 |
+ 0.343 |
+ 0.419 |
+ 0.420 |
+ 0.435 |
+ 0.452 |
+ 0.456 |
+ 0.440 |
+ 0.247 |
+ 0.045 |
+ 0.413 |
+ 0.357 |
+ 0.356 |
+ 0.407 |
+ 0.361 |
+ 0.144 |
+ 0.438 |
+ 0.438 |
+ 0.444 |
+ 0.443 |
+ 0.453 |
+ 0.438 |
+ 0.429 |
+ 0.442 |
+ 0.456 |
+ 0.450 |
+ 0.400 |
+ 0.472 |
+ 3344 |
+ Expression |
+ Q53Z42_HUMAN |
+ Medium |
+ Human |
+
+
+ Q59976_STRSQ_Romero_2015 |
+ 0.418 |
+ 0.490 |
+ 0.508 |
+ 0.518 |
+ 0.516 |
+ 0.523 |
+ 0.235 |
+ 0.408 |
+ 0.546 |
+ 0.550 |
+ 0.471 |
+ 0.415 |
+ 0.415 |
+ 0.085 |
+ 0.325 |
+ 0.380 |
+ 0.431 |
+ 0.462 |
+ 0.453 |
+ 0.507 |
+ 0.459 |
+ 0.505 |
+ 0.519 |
+ 0.523 |
+ 0.481 |
+ 0.525 |
+ 0.528 |
+ 0.537 |
+ 0.561 |
+ 0.528 |
+ 0.514 |
+ 0.452 |
+ 0.265 |
+ 0.477 |
+ 0.514 |
+ 0.501 |
+ 0.484 |
+ 0.518 |
+ 0.528 |
+ 0.526 |
+ 0.536 |
+ 0.537 |
+ 0.372 |
+ -0.004 |
+ 0.456 |
+ 0.398 |
+ 0.317 |
+ 0.411 |
+ 0.407 |
+ 0.093 |
+ 0.456 |
+ 0.422 |
+ 0.409 |
+ 0.429 |
+ 0.435 |
+ 0.426 |
+ 0.433 |
+ 0.419 |
+ 0.438 |
+ 0.433 |
+ 0.488 |
+ 0.415 |
+ 2999 |
+ Activity |
+ Q59976_STRSQ |
+ Medium |
+ Prokaryote |
+
+
+ Q6WV13_9MAXI_Somermeyer_2022 |
+ 0.175 |
+ 0.194 |
+ 0.137 |
+ 0.140 |
+ 0.180 |
+ 0.180 |
+ -0.009 |
+ 0.096 |
+ 0.200 |
+ 0.196 |
+ 0.140 |
+ 0.027 |
+ 0.018 |
+ 0.008 |
+ 0.024 |
+ 0.006 |
+ 0.007 |
+ 0.013 |
+ -0.011 |
+ 0.161 |
+ 0.026 |
+ 0.024 |
+ 0.002 |
+ 0.071 |
+ 0.021 |
+ -0.000 |
+ 0.020 |
+ 0.000 |
+ -0.001 |
+ 0.233 |
+ 0.167 |
+ 0.171 |
+ 0.026 |
+ 0.015 |
+ 0.025 |
+ 0.020 |
+ 0.130 |
+ 0.132 |
+ 0.137 |
+ 0.173 |
+ 0.173 |
+ 0.172 |
+ 0.006 |
+ 0.005 |
+ -0.002 |
+ 0.012 |
+ 0.138 |
+ 0.150 |
+ 0.215 |
+ 0.129 |
+ 0.153 |
+ 0.164 |
+ 0.151 |
+ 0.175 |
+ 0.168 |
+ 0.164 |
+ 0.160 |
+ 0.167 |
+ 0.163 |
+ 0.164 |
+ 0.068 |
+ 0.047 |
+ 31401 |
+ Activity |
+ Q6WV12_9MAXI |
+ Low |
+ Eukaryote |
+
+
+ Q837P4_ENTFA_Meier_2023 |
+ 0.257 |
+ 0.303 |
+ 0.280 |
+ 0.252 |
+ 0.280 |
+ 0.326 |
+ 0.315 |
+ 0.240 |
+ 0.269 |
+ 0.297 |
+ 0.355 |
+ 0.320 |
+ 0.338 |
+ 0.274 |
+ 0.309 |
+ 0.309 |
+ 0.286 |
+ 0.355 |
+ 0.315 |
+ -0.024 |
+ 0.355 |
+ 0.303 |
+ 0.320 |
+ 0.361 |
+ 0.349 |
+ 0.332 |
+ 0.366 |
+ 0.252 |
+ 0.372 |
+ 0.303 |
+ 0.343 |
+ 0.333 |
+ 0.079 |
+ 0.286 |
+ 0.378 |
+ 0.338 |
+ 0.274 |
+ 0.378 |
+ 0.384 |
+ 0.332 |
+ 0.320 |
+ 0.320 |
+ 0.257 |
+ 0.229 |
+ 0.372 |
+ 0.297 |
+ 0.229 |
+ 0.326 |
+ 0.246 |
+ 0.068 |
+ 0.315 |
+ 0.274 |
+ 0.309 |
+ 0.326 |
+ 0.303 |
+ 0.338 |
+ 0.338 |
+ 0.309 |
+ 0.326 |
+ 0.338 |
+ 0.349 |
+ 0.332 |
+ 697 |
+ Activity |
+ Q837P4_ENTFA |
+ Medium |
+ Prokaryote |
+
+
+ Q837P5_ENTFA_Meier_2023 |
+ 0.082 |
+ 0.264 |
+ 0.291 |
+ 0.291 |
+ 0.232 |
+ 0.232 |
+ 0.055 |
+ 0.125 |
+ 0.173 |
+ 0.200 |
+ 0.168 |
+ 0.200 |
+ 0.200 |
+ 0.076 |
+ 0.125 |
+ 0.200 |
+ 0.216 |
+ 0.291 |
+ 0.259 |
+ 0.205 |
+ 0.248 |
+ 0.270 |
+ 0.275 |
+ 0.253 |
+ 0.216 |
+ 0.253 |
+ 0.264 |
+ 0.345 |
+ 0.302 |
+ 0.200 |
+ 0.248 |
+ 0.227 |
+ 0.060 |
+ 0.248 |
+ 0.302 |
+ 0.334 |
+ 0.184 |
+ 0.248 |
+ 0.312 |
+ 0.259 |
+ 0.270 |
+ 0.280 |
+ 0.146 |
+ 0.135 |
+ 0.227 |
+ 0.119 |
+ 0.253 |
+ 0.221 |
+ 0.275 |
+ 0.098 |
+ 0.189 |
+ 0.205 |
+ 0.237 |
+ 0.216 |
+ 0.237 |
+ 0.237 |
+ 0.189 |
+ 0.232 |
+ 0.243 |
+ 0.232 |
+ 0.211 |
+ 0.189 |
+ 747 |
+ Activity |
+ Q837P5_ENTFA |
+ Medium |
+ Prokaryote |
+
+
+ Q8WTC7_9CNID_Somermeyer_2022 |
+ 0.162 |
+ 0.210 |
+ 0.127 |
+ 0.125 |
+ 0.189 |
+ 0.185 |
+ -0.004 |
+ 0.187 |
+ 0.169 |
+ 0.173 |
+ 0.111 |
+ -0.038 |
+ -0.048 |
+ -0.042 |
+ -0.059 |
+ -0.054 |
+ -0.048 |
+ -0.049 |
+ -0.023 |
+ 0.129 |
+ -0.036 |
+ -0.005 |
+ 0.022 |
+ -0.019 |
+ -0.024 |
+ -0.039 |
+ -0.020 |
+ 0.138 |
+ 0.144 |
+ 0.203 |
+ 0.187 |
+ 0.194 |
+ -0.029 |
+ -0.064 |
+ -0.015 |
+ 0.180 |
+ 0.123 |
+ 0.130 |
+ 0.180 |
+ 0.173 |
+ 0.182 |
+ 0.202 |
+ -0.035 |
+ -0.035 |
+ -0.053 |
+ -0.054 |
+ 0.007 |
+ 0.054 |
+ 0.161 |
+ 0.087 |
+ 0.126 |
+ 0.119 |
+ 0.126 |
+ 0.133 |
+ 0.120 |
+ 0.129 |
+ 0.130 |
+ 0.120 |
+ 0.116 |
+ 0.127 |
+ 0.038 |
+ -0.070 |
+ 33510 |
+ Activity |
+ Q8WTC7_9CNID |
+ Low |
+ Eukaryote |
+
+
+ R1AB_SARS2_Flynn_2022 |
+ 0.477 |
+ 0.427 |
+ 0.185 |
+ 0.221 |
+ 0.517 |
+ 0.514 |
+ -0.011 |
+ 0.227 |
+ 0.013 |
+ 0.013 |
+ 0.094 |
+ 0.010 |
+ 0.002 |
+ 0.038 |
+ 0.015 |
+ 0.067 |
+ 0.096 |
+ 0.395 |
+ 0.456 |
+ 0.228 |
+ 0.186 |
+ 0.239 |
+ 0.258 |
+ 0.233 |
+ 0.211 |
+ 0.225 |
+ 0.173 |
+ 0.177 |
+ 0.215 |
+ 0.460 |
+ 0.414 |
+ 0.331 |
+ -0.026 |
+ 0.152 |
+ 0.190 |
+ 0.174 |
+ 0.308 |
+ 0.339 |
+ 0.330 |
+ 0.463 |
+ 0.484 |
+ 0.480 |
+ 0.006 |
+ -0.005 |
+ 0.063 |
+ -0.003 |
+ 0.337 |
+ 0.310 |
+ 0.367 |
+ 0.162 |
+ 0.197 |
+ 0.156 |
+ 0.216 |
+ 0.216 |
+ 0.201 |
+ 0.201 |
+ 0.204 |
+ 0.200 |
+ 0.173 |
+ 0.205 |
+ 0.170 |
+ 0.097 |
+ 5725 |
+ OrganismalFitness |
+ R1AB_SARS2 |
+ Medium |
+ Virus |
+
+
+ RAD_ANTMA_Tsuboyama_2023_2CJJ |
+ 0.204 |
+ 0.112 |
+ 0.226 |
+ 0.213 |
+ 0.230 |
+ 0.226 |
+ 0.362 |
+ 0.283 |
+ 0.542 |
+ 0.485 |
+ 0.415 |
+ 0.380 |
+ 0.397 |
+ 0.327 |
+ 0.520 |
+ 0.560 |
+ 0.353 |
+ 0.353 |
+ 0.432 |
+ 0.279 |
+ 0.432 |
+ 0.476 |
+ 0.441 |
+ 0.362 |
+ 0.459 |
+ 0.305 |
+ 0.472 |
+ 0.345 |
+ 0.318 |
+ 0.424 |
+ 0.279 |
+ 0.191 |
+ 0.248 |
+ 0.388 |
+ 0.525 |
+ 0.345 |
+ 0.331 |
+ 0.459 |
+ 0.371 |
+ 0.349 |
+ 0.388 |
+ 0.309 |
+ 0.472 |
+ 0.195 |
+ 0.200 |
+ 0.323 |
+ 0.380 |
+ 0.261 |
+ 0.586 |
+ 0.380 |
+ 0.380 |
+ 0.406 |
+ 0.415 |
+ 0.415 |
+ 0.419 |
+ 0.410 |
+ 0.384 |
+ 0.432 |
+ 0.393 |
+ 0.428 |
+ 0.283 |
+ 0.463 |
+ 912 |
+ Stability |
+ RAD_ANTMA |
+ High |
+ Eukaryote |
+
+
+ RAF1_HUMAN_Zinkus-Boltz_2019 |
+ 0.394 |
+ 0.367 |
+ 0.367 |
+ 0.367 |
+ 0.354 |
+ 0.367 |
+ 0.044 |
+ 0.286 |
+ 0.354 |
+ 0.367 |
+ 0.394 |
+ 0.354 |
+ 0.421 |
+ 0.017 |
+ 0.192 |
+ 0.354 |
+ 0.394 |
+ 0.381 |
+ 0.340 |
+ 0.246 |
+ 0.205 |
+ 0.313 |
+ 0.340 |
+ 0.300 |
+ 0.286 |
+ 0.313 |
+ 0.286 |
+ 0.259 |
+ 0.259 |
+ 0.340 |
+ 0.394 |
+ 0.300 |
+ 0.111 |
+ 0.246 |
+ 0.232 |
+ 0.286 |
+ 0.340 |
+ 0.300 |
+ 0.313 |
+ 0.354 |
+ 0.367 |
+ 0.394 |
+ 0.165 |
+ 0.071 |
+ 0.421 |
+ 0.327 |
+ 0.232 |
+ 0.354 |
+ 0.192 |
+ 0.286 |
+ 0.340 |
+ 0.354 |
+ 0.300 |
+ 0.340 |
+ 0.300 |
+ 0.407 |
+ 0.340 |
+ 0.354 |
+ 0.367 |
+ 0.327 |
+ 0.354 |
+ 0.111 |
+ 297 |
+ OrganismalFitness |
+ RAF1_HUMAN |
+ Low |
+ Human |
+
+
+ RASH_HUMAN_Bandaru_2017 |
+ 0.397 |
+ 0.442 |
+ 0.439 |
+ 0.467 |
+ 0.454 |
+ 0.465 |
+ 0.288 |
+ 0.376 |
+ 0.464 |
+ 0.479 |
+ 0.368 |
+ 0.410 |
+ 0.442 |
+ 0.417 |
+ 0.464 |
+ 0.443 |
+ 0.489 |
+ 0.453 |
+ 0.317 |
+ 0.497 |
+ 0.451 |
+ 0.403 |
+ 0.403 |
+ 0.388 |
+ 0.414 |
+ 0.400 |
+ 0.407 |
+ 0.347 |
+ 0.277 |
+ 0.422 |
+ 0.399 |
+ 0.378 |
+ 0.199 |
+ 0.375 |
+ 0.407 |
+ 0.338 |
+ 0.449 |
+ 0.450 |
+ 0.425 |
+ 0.455 |
+ 0.461 |
+ 0.475 |
+ 0.480 |
+ 0.186 |
+ 0.278 |
+ 0.454 |
+ 0.289 |
+ 0.177 |
+ 0.414 |
+ 0.198 |
+ 0.419 |
+ 0.415 |
+ 0.430 |
+ 0.429 |
+ 0.422 |
+ 0.422 |
+ 0.436 |
+ 0.421 |
+ 0.432 |
+ 0.449 |
+ 0.349 |
+ 0.444 |
+ 3134 |
+ Activity |
+ RASH_HUMAN |
+ High |
+ Human |
+
+
+ RASK_HUMAN_Weng_2022_abundance |
+ 0.232 |
+ 0.200 |
+ 0.247 |
+ 0.233 |
+ 0.226 |
+ 0.236 |
+ 0.173 |
+ 0.179 |
+ 0.177 |
+ 0.203 |
+ 0.219 |
+ 0.167 |
+ 0.182 |
+ 0.213 |
+ 0.256 |
+ 0.233 |
+ 0.188 |
+ 0.158 |
+ 0.161 |
+ 0.232 |
+ 0.211 |
+ 0.253 |
+ 0.272 |
+ 0.289 |
+ 0.213 |
+ 0.252 |
+ 0.311 |
+ 0.251 |
+ 0.358 |
+ 0.259 |
+ 0.148 |
+ 0.111 |
+ 0.155 |
+ 0.146 |
+ 0.223 |
+ 0.307 |
+ 0.199 |
+ 0.258 |
+ 0.314 |
+ 0.232 |
+ 0.254 |
+ 0.268 |
+ 0.172 |
+ 0.211 |
+ 0.045 |
+ 0.181 |
+ 0.234 |
+ 0.245 |
+ 0.257 |
+ 0.230 |
+ 0.182 |
+ 0.165 |
+ 0.205 |
+ 0.197 |
+ 0.222 |
+ 0.199 |
+ 0.206 |
+ 0.197 |
+ 0.177 |
+ 0.201 |
+ 0.245 |
+ 0.231 |
+ 26012 |
+ Expression |
+ RASK_HUMAN |
+ High |
+ Human |
+
+
+ RASK_HUMAN_Weng_2022_binding-DARPin_K55 |
+ 0.390 |
+ 0.388 |
+ 0.484 |
+ 0.489 |
+ 0.468 |
+ 0.472 |
+ 0.253 |
+ 0.315 |
+ 0.483 |
+ 0.493 |
+ 0.469 |
+ 0.414 |
+ 0.460 |
+ 0.439 |
+ 0.482 |
+ 0.504 |
+ 0.499 |
+ 0.417 |
+ 0.332 |
+ 0.541 |
+ 0.421 |
+ 0.424 |
+ 0.446 |
+ 0.377 |
+ 0.394 |
+ 0.439 |
+ 0.404 |
+ 0.300 |
+ 0.276 |
+ 0.493 |
+ 0.408 |
+ 0.359 |
+ 0.240 |
+ 0.382 |
+ 0.443 |
+ 0.391 |
+ 0.420 |
+ 0.469 |
+ 0.440 |
+ 0.467 |
+ 0.482 |
+ 0.475 |
+ 0.438 |
+ 0.219 |
+ 0.337 |
+ 0.475 |
+ 0.180 |
+ 0.208 |
+ 0.466 |
+ 0.212 |
+ 0.412 |
+ 0.411 |
+ 0.421 |
+ 0.421 |
+ 0.446 |
+ 0.444 |
+ 0.410 |
+ 0.423 |
+ 0.365 |
+ 0.431 |
+ 0.476 |
+ 0.476 |
+ 24873 |
+ Binding |
+ RASK_HUMAN |
+ High |
+ Human |
+
+
+ RBP1_HUMAN_Tsuboyama_2023_2KWH |
+ 0.165 |
+ 0.072 |
+ 0.264 |
+ 0.264 |
+ 0.267 |
+ 0.267 |
+ 0.231 |
+ 0.243 |
+ 0.210 |
+ 0.210 |
+ 0.363 |
+ 0.318 |
+ 0.318 |
+ 0.255 |
+ 0.321 |
+ 0.378 |
+ 0.426 |
+ 0.300 |
+ 0.273 |
+ 0.180 |
+ 0.282 |
+ 0.060 |
+ 0.228 |
+ 0.213 |
+ 0.207 |
+ 0.111 |
+ 0.117 |
+ 0.279 |
+ 0.192 |
+ 0.339 |
+ 0.318 |
+ 0.288 |
+ 0.264 |
+ 0.204 |
+ 0.309 |
+ 0.312 |
+ 0.255 |
+ 0.270 |
+ 0.279 |
+ 0.264 |
+ 0.264 |
+ 0.264 |
+ 0.255 |
+ 0.186 |
+ 0.327 |
+ 0.333 |
+ 0.333 |
+ 0.378 |
+ 0.462 |
+ 0.345 |
+ 0.378 |
+ 0.378 |
+ 0.411 |
+ 0.384 |
+ 0.393 |
+ 0.399 |
+ 0.387 |
+ 0.393 |
+ 0.381 |
+ 0.390 |
+ 0.432 |
+ 0.417 |
+ 1332 |
+ Stability |
+ RBP1_HUMAN |
+ High |
+ Human |
+
+
+ RCD1_ARATH_Tsuboyama_2023_5OAO |
+ 0.277 |
+ 0.274 |
+ 0.331 |
+ 0.340 |
+ 0.331 |
+ 0.331 |
+ 0.213 |
+ 0.217 |
+ 0.328 |
+ 0.340 |
+ 0.372 |
+ 0.197 |
+ 0.236 |
+ 0.258 |
+ 0.226 |
+ 0.461 |
+ 0.464 |
+ 0.451 |
+ 0.429 |
+ 0.315 |
+ 0.178 |
+ 0.182 |
+ 0.242 |
+ 0.305 |
+ 0.239 |
+ 0.410 |
+ 0.416 |
+ 0.359 |
+ 0.423 |
+ 0.385 |
+ 0.467 |
+ 0.391 |
+ 0.042 |
+ 0.185 |
+ 0.239 |
+ 0.232 |
+ 0.299 |
+ 0.305 |
+ 0.283 |
+ 0.362 |
+ 0.369 |
+ 0.337 |
+ 0.229 |
+ 0.194 |
+ 0.356 |
+ 0.229 |
+ 0.388 |
+ 0.391 |
+ 0.464 |
+ 0.439 |
+ 0.461 |
+ 0.461 |
+ 0.470 |
+ 0.505 |
+ 0.467 |
+ 0.496 |
+ 0.470 |
+ 0.480 |
+ 0.499 |
+ 0.483 |
+ 0.464 |
+ 0.381 |
+ 1261 |
+ Stability |
+ RCD1_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ RCRO_LAMBD_Tsuboyama_2023_1ORC |
+ 0.286 |
+ 0.414 |
+ 0.490 |
+ 0.471 |
+ 0.450 |
+ 0.474 |
+ 0.074 |
+ 0.230 |
+ 0.406 |
+ 0.414 |
+ 0.400 |
+ 0.256 |
+ 0.334 |
+ 0.146 |
+ 0.228 |
+ 0.205 |
+ 0.420 |
+ 0.432 |
+ 0.432 |
+ 0.488 |
+ 0.018 |
+ 0.065 |
+ 0.095 |
+ 0.091 |
+ -0.079 |
+ 0.049 |
+ -0.060 |
+ -0.002 |
+ 0.479 |
+ 0.460 |
+ 0.427 |
+ 0.416 |
+ 0.018 |
+ 0.054 |
+ 0.042 |
+ 0.425 |
+ 0.360 |
+ 0.332 |
+ 0.465 |
+ 0.450 |
+ 0.457 |
+ 0.511 |
+ -0.109 |
+ 0.053 |
+ 0.234 |
+ 0.042 |
+ 0.367 |
+ 0.442 |
+ 0.543 |
+ 0.432 |
+ 0.427 |
+ 0.411 |
+ 0.414 |
+ 0.404 |
+ 0.406 |
+ 0.409 |
+ 0.411 |
+ 0.409 |
+ 0.418 |
+ 0.418 |
+ 0.453 |
+ 0.320 |
+ 2278 |
+ Stability |
+ RCRO_LAMBD |
+ High |
+ Virus |
+
+
+ RD23A_HUMAN_Tsuboyama_2023_1IFY |
+ 0.235 |
+ 0.168 |
+ 0.266 |
+ 0.282 |
+ 0.305 |
+ 0.309 |
+ 0.121 |
+ 0.344 |
+ 0.419 |
+ 0.419 |
+ 0.384 |
+ 0.450 |
+ 0.462 |
+ 0.156 |
+ 0.521 |
+ 0.458 |
+ 0.423 |
+ 0.399 |
+ 0.286 |
+ 0.344 |
+ 0.301 |
+ 0.321 |
+ 0.313 |
+ 0.356 |
+ 0.403 |
+ 0.372 |
+ 0.372 |
+ 0.380 |
+ 0.388 |
+ 0.364 |
+ 0.356 |
+ 0.339 |
+ 0.305 |
+ 0.231 |
+ 0.380 |
+ 0.380 |
+ 0.290 |
+ 0.321 |
+ 0.368 |
+ 0.305 |
+ 0.317 |
+ 0.329 |
+ 0.384 |
+ 0.023 |
+ 0.415 |
+ 0.466 |
+ 0.372 |
+ 0.392 |
+ 0.447 |
+ 0.376 |
+ 0.360 |
+ 0.341 |
+ 0.360 |
+ 0.344 |
+ 0.380 |
+ 0.384 |
+ 0.368 |
+ 0.368 |
+ 0.380 |
+ 0.380 |
+ 0.368 |
+ 0.423 |
+ 1019 |
+ Stability |
+ RD23A_HUMAN |
+ High |
+ Human |
+
+
+ RDRP_I33A0_Li_2023 |
+ 0.221 |
+ 0.252 |
+ 0.281 |
+ 0.286 |
+ 0.362 |
+ 0.367 |
+ 0.042 |
+ 0.289 |
+ 0.381 |
+ 0.385 |
+ 0.146 |
+ 0.072 |
+ 0.079 |
+ 0.064 |
+ 0.058 |
+ 0.111 |
+ 0.278 |
+ 0.292 |
+ 0.368 |
+ 0.330 |
+ 0.286 |
+ 0.331 |
+ 0.341 |
+ 0.355 |
+ 0.125 |
+ 0.267 |
+ 0.263 |
+ 0.259 |
+ 0.307 |
+ 0.357 |
+ 0.336 |
+ 0.292 |
+ 0.086 |
+ 0.278 |
+ 0.321 |
+ 0.347 |
+ 0.317 |
+ 0.349 |
+ 0.358 |
+ 0.368 |
+ 0.380 |
+ 0.392 |
+ 0.052 |
+ 0.035 |
+ 0.150 |
+ 0.054 |
+ 0.170 |
+ 0.191 |
+ 0.181 |
+ 0.041 |
+ 0.233 |
+ 0.215 |
+ 0.227 |
+ 0.229 |
+ 0.240 |
+ 0.242 |
+ 0.229 |
+ 0.247 |
+ 0.253 |
+ 0.248 |
+ 0.130 |
+ 0.096 |
+ 12003 |
+ OrganismalFitness |
+ RDRP_I33A0 |
+ Low |
+ Virus |
+
+
+ REV_HV1H2_Fernandes_2016 |
+ 0.132 |
+ 0.128 |
+ 0.141 |
+ 0.152 |
+ 0.156 |
+ 0.154 |
+ 0.033 |
+ 0.219 |
+ 0.132 |
+ 0.149 |
+ 0.104 |
+ 0.178 |
+ 0.195 |
+ 0.029 |
+ 0.018 |
+ 0.109 |
+ 0.152 |
+ 0.206 |
+ 0.184 |
+ 0.115 |
+ 0.141 |
+ 0.180 |
+ 0.171 |
+ 0.141 |
+ 0.206 |
+ 0.190 |
+ 0.106 |
+ 0.175 |
+ 0.190 |
+ 0.191 |
+ 0.245 |
+ 0.266 |
+ 0.024 |
+ 0.143 |
+ 0.182 |
+ 0.173 |
+ 0.178 |
+ 0.182 |
+ 0.162 |
+ 0.182 |
+ 0.190 |
+ 0.167 |
+ 0.031 |
+ 0.037 |
+ 0.145 |
+ 0.044 |
+ 0.177 |
+ 0.160 |
+ 0.201 |
+ 0.139 |
+ 0.141 |
+ 0.143 |
+ 0.203 |
+ 0.188 |
+ 0.177 |
+ 0.204 |
+ 0.206 |
+ 0.134 |
+ 0.219 |
+ 0.193 |
+ 0.208 |
+ 0.136 |
+ 2147 |
+ OrganismalFitness |
+ REV_HV1H2 |
+ Medium |
+ Virus |
+
+
+ RFAH_ECOLI_Tsuboyama_2023_2LCL |
+ -0.026 |
+ 0.176 |
+ 0.240 |
+ 0.264 |
+ 0.222 |
+ 0.231 |
+ -0.014 |
+ 0.192 |
+ 0.195 |
+ 0.201 |
+ 0.207 |
+ 0.125 |
+ 0.119 |
+ -0.053 |
+ 0.023 |
+ 0.005 |
+ 0.240 |
+ 0.261 |
+ 0.204 |
+ 0.222 |
+ -0.068 |
+ 0.032 |
+ 0.137 |
+ 0.101 |
+ -0.053 |
+ 0.116 |
+ 0.152 |
+ 0.125 |
+ 0.149 |
+ 0.164 |
+ 0.270 |
+ 0.247 |
+ 0.014 |
+ 0.002 |
+ 0.113 |
+ 0.152 |
+ 0.053 |
+ 0.140 |
+ 0.131 |
+ 0.192 |
+ 0.179 |
+ 0.195 |
+ 0.053 |
+ -0.104 |
+ 0.053 |
+ -0.014 |
+ 0.173 |
+ 0.137 |
+ 0.252 |
+ 0.234 |
+ 0.279 |
+ 0.261 |
+ 0.252 |
+ 0.267 |
+ 0.234 |
+ 0.255 |
+ 0.255 |
+ 0.276 |
+ 0.255 |
+ 0.264 |
+ 0.198 |
+ 0.104 |
+ 1326 |
+ Stability |
+ RFAH_ECOLI |
+ High |
+ Prokaryote |
+
+
+ RL20_AQUAE_Tsuboyama_2023_1GYZ |
+ 0.353 |
+ 0.499 |
+ 0.536 |
+ 0.536 |
+ 0.514 |
+ 0.525 |
+ 0.190 |
+ 0.536 |
+ 0.308 |
+ 0.255 |
+ 0.567 |
+ 0.564 |
+ 0.539 |
+ 0.128 |
+ 0.308 |
+ 0.351 |
+ 0.601 |
+ 0.581 |
+ 0.612 |
+ 0.556 |
+ 0.157 |
+ 0.488 |
+ 0.446 |
+ 0.440 |
+ 0.415 |
+ 0.438 |
+ 0.438 |
+ 0.471 |
+ 0.547 |
+ 0.525 |
+ 0.502 |
+ 0.487 |
+ 0.044 |
+ 0.415 |
+ 0.429 |
+ 0.415 |
+ 0.466 |
+ 0.485 |
+ 0.466 |
+ 0.514 |
+ 0.522 |
+ 0.514 |
+ -0.043 |
+ -0.113 |
+ 0.381 |
+ 0.207 |
+ 0.589 |
+ 0.544 |
+ 0.665 |
+ 0.595 |
+ 0.637 |
+ 0.623 |
+ 0.615 |
+ 0.657 |
+ 0.643 |
+ 0.626 |
+ 0.646 |
+ 0.632 |
+ 0.626 |
+ 0.646 |
+ 0.663 |
+ 0.544 |
+ 1461 |
+ Stability |
+ RL20_AQUAE |
+ High |
+ Prokaryote |
+
+
+ RL40A_YEAST_Mavor_2016 |
+ 0.167 |
+ 0.253 |
+ 0.257 |
+ 0.302 |
+ 0.264 |
+ 0.298 |
+ 0.095 |
+ 0.298 |
+ 0.333 |
+ 0.322 |
+ 0.171 |
+ 0.202 |
+ 0.229 |
+ 0.023 |
+ 0.343 |
+ 0.377 |
+ 0.405 |
+ 0.308 |
+ 0.425 |
+ 0.333 |
+ 0.253 |
+ 0.415 |
+ 0.384 |
+ 0.322 |
+ 0.367 |
+ 0.312 |
+ 0.329 |
+ 0.322 |
+ 0.288 |
+ 0.222 |
+ 0.291 |
+ 0.310 |
+ 0.016 |
+ 0.277 |
+ 0.339 |
+ 0.295 |
+ 0.312 |
+ 0.333 |
+ 0.295 |
+ 0.346 |
+ 0.343 |
+ 0.291 |
+ 0.216 |
+ -0.001 |
+ 0.164 |
+ 0.209 |
+ 0.040 |
+ 0.116 |
+ 0.136 |
+ -0.005 |
+ 0.374 |
+ 0.357 |
+ 0.388 |
+ 0.377 |
+ 0.374 |
+ 0.381 |
+ 0.353 |
+ 0.384 |
+ 0.384 |
+ 0.388 |
+ 0.247 |
+ 0.260 |
+ 1253 |
+ OrganismalFitness |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RL40A_YEAST_Roscoe_2013 |
+ 0.183 |
+ 0.284 |
+ 0.294 |
+ 0.339 |
+ 0.301 |
+ 0.319 |
+ 0.054 |
+ 0.256 |
+ 0.364 |
+ 0.374 |
+ 0.159 |
+ 0.221 |
+ 0.277 |
+ 0.037 |
+ 0.332 |
+ 0.367 |
+ 0.451 |
+ 0.336 |
+ 0.461 |
+ 0.360 |
+ 0.280 |
+ 0.458 |
+ 0.395 |
+ 0.357 |
+ 0.395 |
+ 0.346 |
+ 0.385 |
+ 0.353 |
+ 0.326 |
+ 0.277 |
+ 0.360 |
+ 0.367 |
+ 0.030 |
+ 0.312 |
+ 0.402 |
+ 0.312 |
+ 0.343 |
+ 0.388 |
+ 0.305 |
+ 0.371 |
+ 0.409 |
+ 0.332 |
+ 0.260 |
+ 0.013 |
+ 0.187 |
+ 0.221 |
+ 0.037 |
+ 0.093 |
+ 0.141 |
+ 0.048 |
+ 0.433 |
+ 0.405 |
+ 0.426 |
+ 0.426 |
+ 0.430 |
+ 0.430 |
+ 0.405 |
+ 0.458 |
+ 0.437 |
+ 0.433 |
+ 0.270 |
+ 0.270 |
+ 1195 |
+ OrganismalFitness |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RL40A_YEAST_Roscoe_2014 |
+ 0.246 |
+ 0.216 |
+ 0.286 |
+ 0.332 |
+ 0.279 |
+ 0.303 |
+ 0.116 |
+ 0.332 |
+ 0.299 |
+ 0.309 |
+ 0.113 |
+ 0.199 |
+ 0.209 |
+ 0.126 |
+ 0.396 |
+ 0.386 |
+ 0.426 |
+ 0.313 |
+ 0.399 |
+ 0.289 |
+ 0.253 |
+ 0.409 |
+ 0.356 |
+ 0.329 |
+ 0.396 |
+ 0.313 |
+ 0.319 |
+ 0.259 |
+ 0.259 |
+ 0.283 |
+ 0.273 |
+ 0.263 |
+ 0.123 |
+ 0.316 |
+ 0.359 |
+ 0.289 |
+ 0.329 |
+ 0.376 |
+ 0.329 |
+ 0.336 |
+ 0.366 |
+ 0.316 |
+ 0.186 |
+ 0.080 |
+ 0.163 |
+ 0.223 |
+ 0.146 |
+ 0.176 |
+ 0.213 |
+ 0.113 |
+ 0.432 |
+ 0.372 |
+ 0.409 |
+ 0.442 |
+ 0.416 |
+ 0.426 |
+ 0.389 |
+ 0.406 |
+ 0.416 |
+ 0.436 |
+ 0.233 |
+ 0.309 |
+ 1380 |
+ Activity |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RNC_ECOLI_Weeks_2023 |
+ 0.425 |
+ 0.490 |
+ 0.484 |
+ 0.500 |
+ 0.502 |
+ 0.494 |
+ 0.032 |
+ 0.319 |
+ 0.478 |
+ 0.488 |
+ 0.478 |
+ 0.478 |
+ 0.489 |
+ 0.046 |
+ 0.419 |
+ 0.452 |
+ 0.486 |
+ 0.492 |
+ 0.499 |
+ 0.483 |
+ 0.425 |
+ 0.461 |
+ 0.406 |
+ 0.376 |
+ 0.450 |
+ 0.472 |
+ 0.462 |
+ 0.473 |
+ 0.459 |
+ 0.496 |
+ 0.480 |
+ 0.457 |
+ 0.138 |
+ 0.426 |
+ 0.421 |
+ 0.331 |
+ 0.459 |
+ 0.471 |
+ 0.435 |
+ 0.506 |
+ 0.517 |
+ 0.502 |
+ 0.397 |
+ 0.049 |
+ 0.467 |
+ 0.447 |
+ 0.227 |
+ 0.425 |
+ 0.200 |
+ 0.138 |
+ 0.462 |
+ 0.453 |
+ 0.419 |
+ 0.453 |
+ 0.453 |
+ 0.454 |
+ 0.460 |
+ 0.460 |
+ 0.462 |
+ 0.464 |
+ 0.492 |
+ 0.408 |
+ 4277 |
+ Activity |
+ RNC_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ RPC1_BP434_Tsuboyama_2023_1R69 |
+ 0.557 |
+ 0.576 |
+ 0.549 |
+ 0.571 |
+ 0.491 |
+ 0.533 |
+ 0.538 |
+ 0.524 |
+ 0.557 |
+ 0.576 |
+ 0.623 |
+ 0.667 |
+ 0.670 |
+ 0.629 |
+ 0.661 |
+ 0.678 |
+ 0.623 |
+ 0.598 |
+ 0.505 |
+ 0.596 |
+ 0.609 |
+ 0.620 |
+ 0.637 |
+ 0.579 |
+ 0.629 |
+ 0.626 |
+ 0.626 |
+ 0.626 |
+ 0.587 |
+ 0.664 |
+ 0.615 |
+ 0.555 |
+ 0.502 |
+ 0.516 |
+ 0.623 |
+ 0.631 |
+ 0.596 |
+ 0.642 |
+ 0.642 |
+ 0.596 |
+ 0.585 |
+ 0.593 |
+ 0.626 |
+ 0.590 |
+ 0.664 |
+ 0.642 |
+ 0.615 |
+ 0.557 |
+ 0.711 |
+ 0.607 |
+ 0.607 |
+ 0.579 |
+ 0.601 |
+ 0.612 |
+ 0.620 |
+ 0.604 |
+ 0.612 |
+ 0.607 |
+ 0.626 |
+ 0.612 |
+ 0.694 |
+ 0.683 |
+ 1459 |
+ Stability |
+ RPC1_BP434 |
+ High |
+ Virus |
+
+
+ RPC1_LAMBD_Li_2019_high-expression |
+ 0.287 |
+ 0.424 |
+ 0.473 |
+ 0.498 |
+ 0.461 |
+ 0.436 |
+ 0.188 |
+ 0.387 |
+ 0.411 |
+ 0.461 |
+ 0.473 |
+ 0.498 |
+ 0.536 |
+ 0.275 |
+ 0.275 |
+ 0.399 |
+ 0.536 |
+ 0.523 |
+ 0.548 |
+ 0.411 |
+ 0.225 |
+ 0.337 |
+ 0.449 |
+ 0.424 |
+ 0.213 |
+ 0.337 |
+ 0.362 |
+ 0.300 |
+ 0.523 |
+ 0.387 |
+ 0.622 |
+ 0.523 |
+ 0.089 |
+ 0.138 |
+ 0.362 |
+ 0.511 |
+ 0.275 |
+ 0.362 |
+ 0.486 |
+ 0.424 |
+ 0.449 |
+ 0.523 |
+ 0.287 |
+ 0.287 |
+ 0.424 |
+ 0.312 |
+ 0.188 |
+ 0.349 |
+ 0.300 |
+ 0.138 |
+ 0.573 |
+ 0.498 |
+ 0.511 |
+ 0.560 |
+ 0.548 |
+ 0.573 |
+ 0.523 |
+ 0.511 |
+ 0.536 |
+ 0.560 |
+ 0.486 |
+ 0.349 |
+ 351 |
+ Activity |
+ RPC1_LAMBD |
+ High |
+ Virus |
+
+
+ RPC1_LAMBD_Li_2019_low-expression |
+ 0.208 |
+ 0.299 |
+ 0.379 |
+ 0.379 |
+ 0.368 |
+ 0.311 |
+ 0.083 |
+ 0.345 |
+ 0.299 |
+ 0.322 |
+ 0.356 |
+ 0.345 |
+ 0.356 |
+ 0.140 |
+ 0.140 |
+ 0.242 |
+ 0.402 |
+ 0.470 |
+ 0.481 |
+ 0.299 |
+ 0.128 |
+ 0.311 |
+ 0.322 |
+ 0.333 |
+ 0.105 |
+ 0.242 |
+ 0.231 |
+ 0.197 |
+ 0.425 |
+ 0.265 |
+ 0.481 |
+ 0.413 |
+ 0.037 |
+ 0.060 |
+ 0.254 |
+ 0.345 |
+ 0.162 |
+ 0.231 |
+ 0.322 |
+ 0.311 |
+ 0.333 |
+ 0.368 |
+ 0.197 |
+ 0.162 |
+ 0.299 |
+ 0.162 |
+ 0.174 |
+ 0.276 |
+ 0.265 |
+ 0.254 |
+ 0.436 |
+ 0.390 |
+ 0.436 |
+ 0.436 |
+ 0.436 |
+ 0.470 |
+ 0.425 |
+ 0.413 |
+ 0.436 |
+ 0.459 |
+ 0.402 |
+ 0.208 |
+ 351 |
+ Activity |
+ RPC1_LAMBD |
+ High |
+ Virus |
+
+
+ RS15_GEOSE_Tsuboyama_2023_1A32 |
+ 0.295 |
+ 0.289 |
+ 0.279 |
+ 0.282 |
+ 0.272 |
+ 0.272 |
+ 0.162 |
+ 0.168 |
+ 0.322 |
+ 0.322 |
+ 0.356 |
+ 0.332 |
+ 0.305 |
+ 0.225 |
+ 0.245 |
+ 0.339 |
+ 0.326 |
+ 0.329 |
+ 0.232 |
+ 0.242 |
+ 0.108 |
+ 0.185 |
+ 0.252 |
+ 0.212 |
+ 0.235 |
+ 0.252 |
+ 0.245 |
+ 0.262 |
+ 0.245 |
+ 0.315 |
+ 0.339 |
+ 0.324 |
+ 0.085 |
+ 0.279 |
+ 0.222 |
+ 0.242 |
+ 0.299 |
+ 0.259 |
+ 0.265 |
+ 0.265 |
+ 0.255 |
+ 0.255 |
+ 0.262 |
+ 0.175 |
+ 0.315 |
+ 0.259 |
+ 0.423 |
+ 0.322 |
+ 0.469 |
+ 0.389 |
+ 0.322 |
+ 0.322 |
+ 0.312 |
+ 0.312 |
+ 0.302 |
+ 0.326 |
+ 0.329 |
+ 0.336 |
+ 0.312 |
+ 0.329 |
+ 0.322 |
+ 0.463 |
+ 1195 |
+ Stability |
+ RS15_GEOSE |
+ Medium |
+ Prokaryote |
+
+
+ S22A1_HUMAN_Yee_2023_abundance |
+ 0.384 |
+ 0.435 |
+ 0.447 |
+ 0.449 |
+ 0.458 |
+ 0.467 |
+ 0.279 |
+ 0.411 |
+ 0.496 |
+ 0.499 |
+ 0.483 |
+ 0.492 |
+ 0.532 |
+ 0.370 |
+ 0.418 |
+ 0.436 |
+ 0.499 |
+ 0.482 |
+ 0.455 |
+ 0.026 |
+ 0.382 |
+ 0.478 |
+ 0.490 |
+ 0.485 |
+ 0.398 |
+ 0.502 |
+ 0.512 |
+ 0.482 |
+ 0.458 |
+ 0.447 |
+ 0.453 |
+ 0.329 |
+ 0.273 |
+ 0.411 |
+ 0.468 |
+ 0.479 |
+ 0.430 |
+ 0.498 |
+ 0.509 |
+ 0.487 |
+ 0.499 |
+ 0.504 |
+ 0.390 |
+ 0.193 |
+ 0.466 |
+ 0.439 |
+ 0.331 |
+ 0.431 |
+ 0.433 |
+ 0.127 |
+ 0.463 |
+ 0.456 |
+ 0.454 |
+ 0.466 |
+ 0.482 |
+ 0.473 |
+ 0.478 |
+ 0.479 |
+ 0.471 |
+ 0.489 |
+ 0.502 |
+ 0.456 |
+ 9803 |
+ Expression |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ S22A1_HUMAN_Yee_2023_activity |
+ 0.344 |
+ 0.418 |
+ 0.443 |
+ 0.452 |
+ 0.446 |
+ 0.459 |
+ 0.260 |
+ 0.420 |
+ 0.465 |
+ 0.464 |
+ 0.451 |
+ 0.461 |
+ 0.492 |
+ 0.338 |
+ 0.374 |
+ 0.364 |
+ 0.460 |
+ 0.445 |
+ 0.426 |
+ 0.027 |
+ 0.368 |
+ 0.464 |
+ 0.466 |
+ 0.456 |
+ 0.379 |
+ 0.472 |
+ 0.481 |
+ 0.465 |
+ 0.437 |
+ 0.447 |
+ 0.474 |
+ 0.385 |
+ 0.261 |
+ 0.385 |
+ 0.468 |
+ 0.459 |
+ 0.400 |
+ 0.470 |
+ 0.473 |
+ 0.463 |
+ 0.478 |
+ 0.478 |
+ 0.355 |
+ 0.154 |
+ 0.453 |
+ 0.382 |
+ 0.295 |
+ 0.413 |
+ 0.380 |
+ 0.115 |
+ 0.427 |
+ 0.411 |
+ 0.408 |
+ 0.426 |
+ 0.446 |
+ 0.424 |
+ 0.439 |
+ 0.447 |
+ 0.440 |
+ 0.446 |
+ 0.418 |
+ 0.389 |
+ 10094 |
+ Activity |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ SAV1_MOUSE_Tsuboyama_2023_2YSB |
+ 0.082 |
+ 0.074 |
+ 0.297 |
+ 0.248 |
+ 0.285 |
+ 0.281 |
+ 0.297 |
+ 0.438 |
+ 0.268 |
+ 0.285 |
+ 0.418 |
+ 0.409 |
+ 0.422 |
+ 0.136 |
+ 0.339 |
+ 0.463 |
+ 0.405 |
+ 0.463 |
+ 0.368 |
+ 0.380 |
+ 0.443 |
+ 0.372 |
+ 0.355 |
+ 0.376 |
+ 0.422 |
+ 0.451 |
+ 0.389 |
+ 0.401 |
+ 0.405 |
+ 0.467 |
+ 0.447 |
+ 0.418 |
+ 0.281 |
+ 0.405 |
+ 0.434 |
+ 0.389 |
+ 0.281 |
+ 0.397 |
+ 0.351 |
+ 0.260 |
+ 0.285 |
+ 0.302 |
+ 0.463 |
+ -0.072 |
+ 0.463 |
+ 0.451 |
+ 0.190 |
+ 0.206 |
+ 0.289 |
+ 0.397 |
+ 0.438 |
+ 0.281 |
+ 0.181 |
+ 0.285 |
+ 0.264 |
+ 0.322 |
+ 0.339 |
+ 0.434 |
+ 0.372 |
+ 0.347 |
+ 0.455 |
+ 0.480 |
+ 965 |
+ Stability |
+ SAV1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ SBI_STAAM_Tsuboyama_2023_2JVG |
+ 0.218 |
+ 0.179 |
+ 0.311 |
+ 0.272 |
+ 0.315 |
+ 0.315 |
+ 0.182 |
+ 0.128 |
+ 0.378 |
+ 0.381 |
+ 0.253 |
+ 0.233 |
+ 0.241 |
+ 0.186 |
+ 0.206 |
+ 0.311 |
+ 0.463 |
+ 0.502 |
+ 0.233 |
+ 0.276 |
+ 0.175 |
+ 0.175 |
+ 0.214 |
+ 0.233 |
+ 0.194 |
+ 0.206 |
+ 0.167 |
+ 0.218 |
+ 0.311 |
+ 0.420 |
+ 0.409 |
+ 0.358 |
+ 0.194 |
+ 0.140 |
+ 0.155 |
+ 0.190 |
+ 0.186 |
+ 0.194 |
+ 0.190 |
+ 0.276 |
+ 0.292 |
+ 0.300 |
+ 0.221 |
+ 0.182 |
+ 0.264 |
+ 0.233 |
+ 0.526 |
+ 0.475 |
+ 0.538 |
+ 0.479 |
+ 0.389 |
+ 0.417 |
+ 0.424 |
+ 0.428 |
+ 0.432 |
+ 0.436 |
+ 0.448 |
+ 0.440 |
+ 0.405 |
+ 0.436 |
+ 0.538 |
+ 0.444 |
+ 1025 |
+ Stability |
+ SBI_STAAM |
+ Medium |
+ Prokaryote |
+
+
+ SC6A4_HUMAN_Young_2021 |
+ 0.359 |
+ 0.400 |
+ 0.352 |
+ 0.372 |
+ 0.438 |
+ 0.452 |
+ 0.273 |
+ 0.405 |
+ 0.483 |
+ 0.490 |
+ 0.454 |
+ 0.467 |
+ 0.467 |
+ 0.135 |
+ 0.255 |
+ 0.440 |
+ 0.477 |
+ 0.478 |
+ 0.449 |
+ 0.470 |
+ 0.430 |
+ 0.418 |
+ 0.399 |
+ 0.394 |
+ 0.435 |
+ 0.419 |
+ 0.430 |
+ 0.428 |
+ 0.421 |
+ 0.456 |
+ 0.384 |
+ 0.317 |
+ 0.283 |
+ 0.436 |
+ 0.416 |
+ 0.386 |
+ 0.459 |
+ 0.455 |
+ 0.432 |
+ 0.475 |
+ 0.475 |
+ 0.461 |
+ 0.419 |
+ 0.100 |
+ 0.461 |
+ 0.453 |
+ 0.383 |
+ 0.430 |
+ 0.406 |
+ 0.132 |
+ 0.455 |
+ 0.454 |
+ 0.468 |
+ 0.470 |
+ 0.478 |
+ 0.469 |
+ 0.475 |
+ 0.477 |
+ 0.466 |
+ 0.482 |
+ 0.491 |
+ 0.452 |
+ 11576 |
+ Activity |
+ SC6A4_HUMAN |
+ Medium |
+ Human |
+
+
+ SCIN_STAAR_Tsuboyama_2023_2QFF |
+ 0.051 |
+ 0.084 |
+ 0.167 |
+ 0.180 |
+ 0.236 |
+ 0.213 |
+ 0.097 |
+ 0.012 |
+ 0.183 |
+ 0.200 |
+ 0.180 |
+ 0.173 |
+ 0.170 |
+ 0.127 |
+ 0.186 |
+ 0.167 |
+ 0.190 |
+ 0.262 |
+ 0.262 |
+ 0.170 |
+ 0.054 |
+ 0.087 |
+ 0.084 |
+ 0.091 |
+ 0.107 |
+ 0.111 |
+ 0.101 |
+ 0.130 |
+ 0.134 |
+ 0.180 |
+ 0.177 |
+ 0.109 |
+ -0.025 |
+ 0.035 |
+ 0.051 |
+ 0.107 |
+ 0.114 |
+ 0.104 |
+ 0.120 |
+ 0.190 |
+ 0.173 |
+ 0.180 |
+ 0.130 |
+ 0.111 |
+ 0.200 |
+ 0.163 |
+ 0.375 |
+ 0.391 |
+ 0.381 |
+ 0.325 |
+ 0.256 |
+ 0.276 |
+ 0.269 |
+ 0.292 |
+ 0.259 |
+ 0.295 |
+ 0.272 |
+ 0.249 |
+ 0.206 |
+ 0.276 |
+ 0.447 |
+ 0.355 |
+ 1212 |
+ Stability |
+ SCIN_STAAR |
+ High |
+ Prokaryote |
+
+
+ SCN5A_HUMAN_Glazer_2019 |
+ 0.036 |
+ 0.054 |
+ 0.089 |
+ 0.089 |
+ 0.125 |
+ 0.143 |
+ 0.071 |
+ 0.000 |
+ 0.071 |
+ 0.125 |
+ 0.089 |
+ 0.143 |
+ 0.071 |
+ 0.196 |
+ 0.143 |
+ 0.071 |
+ 0.089 |
+ 0.054 |
+ 0.071 |
+ 0.054 |
+ 0.018 |
+ 0.054 |
+ 0.071 |
+ 0.089 |
+ -0.018 |
+ 0.018 |
+ 0.018 |
+ 0.036 |
+ 0.107 |
+ 0.027 |
+ 0.161 |
+ 0.143 |
+ 0.036 |
+ 0.000 |
+ 0.000 |
+ 0.089 |
+ -0.018 |
+ -0.018 |
+ -0.018 |
+ 0.054 |
+ 0.036 |
+ 0.036 |
+ 0.018 |
+ 0.125 |
+ 0.107 |
+ 0.125 |
+ 0.036 |
+ 0.054 |
+ 0.018 |
+ -0.018 |
+ 0.089 |
+ 0.054 |
+ 0.071 |
+ 0.036 |
+ 0.054 |
+ 0.089 |
+ 0.143 |
+ 0.107 |
+ 0.036 |
+ 0.089 |
+ 0.125 |
+ 0.018 |
+ 224 |
+ OrganismalFitness |
+ SCN5A_HUMAN |
+ Medium |
+ Human |
+
+
+ SDA_BACSU_Tsuboyama_2023_1PV0 |
+ 0.477 |
+ 0.527 |
+ 0.530 |
+ 0.530 |
+ 0.532 |
+ 0.532 |
+ 0.118 |
+ 0.443 |
+ 0.524 |
+ 0.525 |
+ 0.529 |
+ 0.512 |
+ 0.522 |
+ 0.284 |
+ 0.282 |
+ 0.518 |
+ 0.508 |
+ 0.500 |
+ 0.505 |
+ 0.529 |
+ 0.156 |
+ 0.190 |
+ 0.192 |
+ 0.318 |
+ 0.163 |
+ 0.323 |
+ 0.084 |
+ 0.323 |
+ 0.385 |
+ 0.532 |
+ 0.527 |
+ 0.527 |
+ -0.240 |
+ 0.103 |
+ 0.287 |
+ 0.342 |
+ 0.488 |
+ 0.496 |
+ 0.460 |
+ 0.512 |
+ 0.512 |
+ 0.508 |
+ 0.096 |
+ -0.009 |
+ 0.342 |
+ 0.068 |
+ 0.376 |
+ 0.349 |
+ 0.529 |
+ 0.522 |
+ 0.529 |
+ 0.529 |
+ 0.529 |
+ 0.529 |
+ 0.529 |
+ 0.529 |
+ 0.529 |
+ 0.529 |
+ 0.529 |
+ 0.529 |
+ 0.518 |
+ 0.525 |
+ 2770 |
+ Stability |
+ SDA_BACSU |
+ Medium |
+ Prokaryote |
+
+
+ SERC_HUMAN_Xie_2023 |
+ 0.273 |
+ 0.354 |
+ 0.394 |
+ 0.396 |
+ 0.392 |
+ 0.394 |
+ 0.018 |
+ 0.333 |
+ 0.417 |
+ 0.421 |
+ 0.371 |
+ 0.379 |
+ 0.369 |
+ 0.116 |
+ 0.281 |
+ 0.388 |
+ 0.396 |
+ 0.436 |
+ 0.398 |
+ 0.379 |
+ 0.346 |
+ 0.371 |
+ 0.381 |
+ 0.369 |
+ 0.352 |
+ 0.386 |
+ 0.369 |
+ 0.396 |
+ 0.371 |
+ 0.354 |
+ 0.365 |
+ 0.315 |
+ 0.152 |
+ 0.361 |
+ 0.373 |
+ 0.379 |
+ 0.365 |
+ 0.375 |
+ 0.379 |
+ 0.400 |
+ 0.413 |
+ 0.415 |
+ 0.202 |
+ 0.047 |
+ 0.390 |
+ 0.335 |
+ 0.277 |
+ 0.375 |
+ 0.292 |
+ 0.112 |
+ 0.419 |
+ 0.396 |
+ 0.411 |
+ 0.427 |
+ 0.417 |
+ 0.421 |
+ 0.398 |
+ 0.400 |
+ 0.404 |
+ 0.423 |
+ 0.396 |
+ 0.350 |
+ 1914 |
+ OrganismalFitness |
+ SERC_HUMAN |
+ High |
+ Human |
+
+
+ SHOC2_HUMAN_Kwon_2022 |
+ 0.133 |
+ 0.265 |
+ 0.294 |
+ 0.290 |
+ 0.285 |
+ 0.296 |
+ 0.129 |
+ 0.260 |
+ 0.314 |
+ 0.319 |
+ 0.298 |
+ 0.278 |
+ 0.304 |
+ 0.144 |
+ 0.152 |
+ 0.166 |
+ 0.323 |
+ 0.295 |
+ 0.213 |
+ 0.294 |
+ 0.161 |
+ 0.282 |
+ 0.289 |
+ 0.262 |
+ 0.166 |
+ 0.296 |
+ 0.300 |
+ 0.304 |
+ 0.270 |
+ 0.308 |
+ 0.303 |
+ 0.254 |
+ 0.099 |
+ 0.150 |
+ 0.234 |
+ 0.277 |
+ 0.152 |
+ 0.201 |
+ 0.258 |
+ 0.260 |
+ 0.275 |
+ 0.303 |
+ 0.156 |
+ 0.149 |
+ 0.317 |
+ 0.191 |
+ 0.216 |
+ 0.277 |
+ 0.202 |
+ 0.064 |
+ 0.295 |
+ 0.296 |
+ 0.303 |
+ 0.308 |
+ 0.306 |
+ 0.306 |
+ 0.290 |
+ 0.307 |
+ 0.296 |
+ 0.307 |
+ 0.217 |
+ 0.187 |
+ 10972 |
+ OrganismalFitness |
+ SHOC2_HUMAN |
+ Medium |
+ Human |
+
+
+ SOX30_HUMAN_Tsuboyama_2023_7JJK |
+ 0.170 |
+ 0.210 |
+ 0.250 |
+ 0.253 |
+ 0.242 |
+ 0.238 |
+ 0.289 |
+ 0.158 |
+ 0.166 |
+ 0.202 |
+ 0.174 |
+ 0.234 |
+ 0.218 |
+ 0.210 |
+ 0.250 |
+ 0.226 |
+ 0.222 |
+ 0.170 |
+ 0.186 |
+ 0.158 |
+ 0.103 |
+ 0.115 |
+ 0.111 |
+ 0.131 |
+ 0.190 |
+ 0.182 |
+ 0.103 |
+ 0.194 |
+ 0.103 |
+ 0.170 |
+ 0.040 |
+ 0.026 |
+ 0.158 |
+ 0.079 |
+ 0.162 |
+ 0.166 |
+ 0.147 |
+ 0.158 |
+ 0.154 |
+ 0.234 |
+ 0.230 |
+ 0.214 |
+ 0.186 |
+ 0.127 |
+ 0.147 |
+ 0.246 |
+ 0.380 |
+ 0.253 |
+ 0.412 |
+ 0.356 |
+ 0.293 |
+ 0.277 |
+ 0.242 |
+ 0.277 |
+ 0.273 |
+ 0.281 |
+ 0.257 |
+ 0.269 |
+ 0.246 |
+ 0.285 |
+ 0.250 |
+ 0.364 |
+ 1010 |
+ Stability |
+ SOX30_HUMAN |
+ High |
+ Human |
+
+
+ SPA_STAAU_Tsuboyama_2023_1LP1 |
+ 0.373 |
+ 0.440 |
+ 0.464 |
+ 0.445 |
+ 0.493 |
+ 0.498 |
+ -0.114 |
+ 0.460 |
+ 0.407 |
+ 0.409 |
+ 0.367 |
+ -0.068 |
+ -0.028 |
+ -0.055 |
+ -0.058 |
+ -0.076 |
+ -0.057 |
+ -0.068 |
+ -0.022 |
+ 0.491 |
+ -0.155 |
+ -0.022 |
+ 0.111 |
+ 0.054 |
+ -0.085 |
+ -0.055 |
+ 0.042 |
+ 0.230 |
+ 0.312 |
+ 0.436 |
+ 0.424 |
+ 0.412 |
+ -0.020 |
+ -0.009 |
+ -0.024 |
+ 0.008 |
+ 0.335 |
+ 0.327 |
+ 0.318 |
+ 0.489 |
+ 0.487 |
+ 0.481 |
+ -0.159 |
+ -0.117 |
+ -0.123 |
+ -0.074 |
+ 0.362 |
+ 0.306 |
+ 0.563 |
+ 0.434 |
+ 0.472 |
+ 0.455 |
+ 0.445 |
+ 0.470 |
+ 0.451 |
+ 0.455 |
+ 0.472 |
+ 0.460 |
+ 0.460 |
+ 0.470 |
+ 0.409 |
+ 0.286 |
+ 2105 |
+ Stability |
+ SPA_STAAU |
+ Medium |
+ Prokaryote |
+
+
+ SPG1_STRSG_Olson_2014 |
+ 0.169 |
+ 0.186 |
+ -0.005 |
+ 0.012 |
+ 0.158 |
+ 0.160 |
+ -0.006 |
+ 0.044 |
+ -0.068 |
+ 0.072 |
+ 0.228 |
+ 0.174 |
+ 0.142 |
+ 0.200 |
+ 0.165 |
+ 0.205 |
+ 0.237 |
+ 0.200 |
+ 0.254 |
+ 0.202 |
+ 0.180 |
+ 0.152 |
+ 0.154 |
+ 0.149 |
+ 0.167 |
+ 0.162 |
+ 0.150 |
+ 0.167 |
+ 0.248 |
+ 0.170 |
+ 0.328 |
+ 0.316 |
+ 0.047 |
+ 0.172 |
+ 0.132 |
+ 0.198 |
+ 0.185 |
+ 0.149 |
+ 0.203 |
+ 0.173 |
+ 0.136 |
+ 0.205 |
+ -0.029 |
+ -0.080 |
+ 0.157 |
+ 0.051 |
+ 0.240 |
+ 0.190 |
+ 0.276 |
+ 0.086 |
+ 0.279 |
+ 0.253 |
+ 0.272 |
+ 0.302 |
+ 0.303 |
+ 0.280 |
+ 0.314 |
+ 0.334 |
+ 0.316 |
+ 0.309 |
+ 0.287 |
+ 0.273 |
+ 536962 |
+ Binding |
+ SPG1_STRSG |
+ Low |
+ Prokaryote |
+
+
+ SPG1_STRSG_Wu_2016 |
+ 0.010 |
+ 0.079 |
+ 0.087 |
+ 0.091 |
+ 0.084 |
+ 0.093 |
+ 0.076 |
+ 0.028 |
+ 0.162 |
+ 0.158 |
+ 0.131 |
+ 0.117 |
+ 0.108 |
+ 0.096 |
+ 0.125 |
+ 0.129 |
+ 0.138 |
+ 0.163 |
+ 0.182 |
+ 0.098 |
+ 0.071 |
+ 0.056 |
+ 0.048 |
+ 0.069 |
+ 0.044 |
+ 0.096 |
+ 0.079 |
+ 0.063 |
+ 0.063 |
+ 0.095 |
+ 0.125 |
+ 0.093 |
+ -0.004 |
+ 0.080 |
+ 0.040 |
+ 0.061 |
+ 0.090 |
+ 0.078 |
+ 0.086 |
+ 0.103 |
+ 0.094 |
+ 0.092 |
+ 0.050 |
+ 0.005 |
+ 0.078 |
+ 0.045 |
+ 0.058 |
+ 0.054 |
+ 0.145 |
+ 0.089 |
+ 0.141 |
+ 0.124 |
+ 0.118 |
+ 0.149 |
+ 0.125 |
+ 0.119 |
+ 0.143 |
+ 0.136 |
+ 0.149 |
+ 0.136 |
+ 0.148 |
+ 0.115 |
+ 149360 |
+ Binding |
+ SPG1_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPG2_STRSG_Tsuboyama_2023_5UBS |
+ 0.442 |
+ 0.458 |
+ 0.461 |
+ 0.483 |
+ 0.555 |
+ 0.519 |
+ 0.312 |
+ 0.301 |
+ 0.489 |
+ 0.414 |
+ 0.417 |
+ 0.384 |
+ 0.354 |
+ 0.326 |
+ 0.376 |
+ 0.403 |
+ 0.436 |
+ 0.425 |
+ 0.489 |
+ 0.425 |
+ 0.343 |
+ 0.376 |
+ 0.378 |
+ 0.354 |
+ 0.298 |
+ 0.381 |
+ 0.301 |
+ 0.403 |
+ 0.450 |
+ 0.571 |
+ 0.525 |
+ 0.507 |
+ -0.126 |
+ 0.332 |
+ 0.409 |
+ 0.389 |
+ 0.304 |
+ 0.309 |
+ 0.370 |
+ 0.486 |
+ 0.513 |
+ 0.500 |
+ 0.235 |
+ 0.130 |
+ 0.235 |
+ 0.216 |
+ 0.395 |
+ 0.315 |
+ 0.662 |
+ 0.494 |
+ 0.530 |
+ 0.505 |
+ 0.527 |
+ 0.513 |
+ 0.494 |
+ 0.522 |
+ 0.527 |
+ 0.569 |
+ 0.516 |
+ 0.530 |
+ 0.519 |
+ 0.519 |
+ 1451 |
+ Stability |
+ SPG2_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPIKE_SARS2_Starr_2020_binding |
+ 0.111 |
+ 0.135 |
+ 0.051 |
+ 0.116 |
+ 0.153 |
+ 0.166 |
+ -0.059 |
+ 0.259 |
+ 0.270 |
+ 0.257 |
+ -0.011 |
+ -0.075 |
+ -0.045 |
+ -0.048 |
+ -0.048 |
+ -0.033 |
+ -0.024 |
+ -0.023 |
+ 0.016 |
+ 0.255 |
+ 0.223 |
+ 0.264 |
+ 0.256 |
+ 0.264 |
+ 0.284 |
+ 0.236 |
+ 0.209 |
+ 0.272 |
+ 0.227 |
+ 0.163 |
+ 0.185 |
+ 0.238 |
+ 0.161 |
+ 0.227 |
+ 0.215 |
+ 0.262 |
+ 0.209 |
+ 0.210 |
+ 0.214 |
+ 0.249 |
+ 0.242 |
+ 0.238 |
+ -0.053 |
+ -0.016 |
+ -0.024 |
+ -0.018 |
+ 0.413 |
+ 0.379 |
+ 0.320 |
+ 0.114 |
+ 0.137 |
+ 0.152 |
+ 0.137 |
+ 0.181 |
+ 0.153 |
+ 0.183 |
+ 0.156 |
+ 0.187 |
+ 0.158 |
+ 0.174 |
+ 0.216 |
+ 0.095 |
+ 3802 |
+ Binding |
+ SPIKE_SARS2 |
+ Medium |
+ Virus |
+
+
+ SPIKE_SARS2_Starr_2020_expression |
+ 0.161 |
+ 0.223 |
+ 0.122 |
+ 0.215 |
+ 0.375 |
+ 0.352 |
+ -0.034 |
+ 0.266 |
+ 0.321 |
+ 0.344 |
+ 0.038 |
+ -0.032 |
+ -0.007 |
+ -0.022 |
+ -0.024 |
+ -0.006 |
+ 0.001 |
+ 0.019 |
+ 0.055 |
+ 0.271 |
+ 0.240 |
+ 0.288 |
+ 0.293 |
+ 0.308 |
+ 0.289 |
+ 0.277 |
+ 0.236 |
+ 0.302 |
+ 0.249 |
+ 0.267 |
+ 0.207 |
+ 0.269 |
+ 0.144 |
+ 0.250 |
+ 0.241 |
+ 0.274 |
+ 0.259 |
+ 0.270 |
+ 0.273 |
+ 0.339 |
+ 0.352 |
+ 0.367 |
+ -0.010 |
+ 0.013 |
+ 0.013 |
+ 0.021 |
+ 0.434 |
+ 0.419 |
+ 0.377 |
+ 0.140 |
+ 0.225 |
+ 0.239 |
+ 0.230 |
+ 0.285 |
+ 0.247 |
+ 0.298 |
+ 0.246 |
+ 0.270 |
+ 0.247 |
+ 0.273 |
+ 0.338 |
+ 0.180 |
+ 3798 |
+ Expression |
+ SPIKE_SARS2 |
+ Medium |
+ Virus |
+
+
+ SPTN1_CHICK_Tsuboyama_2023_1TUD |
+ 0.499 |
+ 0.446 |
+ 0.433 |
+ 0.402 |
+ 0.426 |
+ 0.412 |
+ 0.140 |
+ 0.378 |
+ 0.400 |
+ 0.422 |
+ 0.402 |
+ 0.485 |
+ 0.445 |
+ -0.069 |
+ 0.443 |
+ 0.437 |
+ 0.442 |
+ 0.544 |
+ 0.489 |
+ 0.392 |
+ 0.425 |
+ 0.417 |
+ 0.435 |
+ 0.400 |
+ 0.458 |
+ 0.412 |
+ 0.460 |
+ 0.396 |
+ 0.382 |
+ 0.385 |
+ 0.405 |
+ 0.431 |
+ 0.330 |
+ 0.373 |
+ 0.418 |
+ 0.391 |
+ 0.463 |
+ 0.465 |
+ 0.471 |
+ 0.406 |
+ 0.420 |
+ 0.415 |
+ 0.367 |
+ -0.148 |
+ 0.335 |
+ 0.352 |
+ 0.303 |
+ 0.248 |
+ 0.472 |
+ 0.440 |
+ 0.433 |
+ 0.432 |
+ 0.463 |
+ 0.455 |
+ 0.451 |
+ 0.448 |
+ 0.433 |
+ 0.423 |
+ 0.435 |
+ 0.433 |
+ 0.500 |
+ 0.415 |
+ 3201 |
+ Stability |
+ SPTN1_CHICK |
+ High |
+ Eukaryote |
+
+
+ SQSTM_MOUSE_Tsuboyama_2023_2RRU |
+ 0.324 |
+ 0.409 |
+ 0.420 |
+ 0.437 |
+ 0.426 |
+ 0.437 |
+ 0.064 |
+ 0.233 |
+ 0.369 |
+ 0.414 |
+ 0.318 |
+ 0.233 |
+ 0.324 |
+ 0.143 |
+ 0.318 |
+ 0.386 |
+ 0.437 |
+ 0.494 |
+ 0.369 |
+ 0.431 |
+ 0.143 |
+ 0.296 |
+ 0.369 |
+ 0.296 |
+ 0.182 |
+ 0.392 |
+ 0.420 |
+ 0.347 |
+ 0.358 |
+ 0.426 |
+ 0.414 |
+ 0.378 |
+ 0.075 |
+ 0.149 |
+ 0.284 |
+ 0.397 |
+ 0.358 |
+ 0.324 |
+ 0.420 |
+ 0.471 |
+ 0.386 |
+ 0.437 |
+ 0.160 |
+ 0.115 |
+ 0.454 |
+ 0.403 |
+ 0.528 |
+ 0.494 |
+ 0.528 |
+ 0.528 |
+ 0.477 |
+ 0.431 |
+ 0.516 |
+ 0.494 |
+ 0.482 |
+ 0.477 |
+ 0.488 |
+ 0.471 |
+ 0.482 |
+ 0.494 |
+ 0.482 |
+ 0.341 |
+ 707 |
+ Stability |
+ SQSTM_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ SR43C_ARATH_Tsuboyama_2023_2N88 |
+ 0.506 |
+ 0.529 |
+ 0.536 |
+ 0.539 |
+ 0.567 |
+ 0.554 |
+ -0.141 |
+ 0.370 |
+ 0.546 |
+ 0.544 |
+ 0.572 |
+ 0.564 |
+ 0.559 |
+ -0.169 |
+ 0.539 |
+ 0.589 |
+ 0.579 |
+ 0.584 |
+ 0.569 |
+ 0.506 |
+ -0.111 |
+ 0.319 |
+ 0.231 |
+ 0.375 |
+ 0.347 |
+ 0.420 |
+ 0.458 |
+ 0.483 |
+ 0.435 |
+ 0.564 |
+ 0.577 |
+ 0.574 |
+ 0.263 |
+ 0.049 |
+ 0.150 |
+ 0.319 |
+ 0.534 |
+ 0.519 |
+ 0.511 |
+ 0.562 |
+ 0.559 |
+ 0.526 |
+ 0.466 |
+ -0.245 |
+ 0.572 |
+ 0.476 |
+ 0.059 |
+ 0.461 |
+ 0.587 |
+ 0.572 |
+ 0.572 |
+ 0.567 |
+ 0.574 |
+ 0.582 |
+ 0.587 |
+ 0.595 |
+ 0.567 |
+ 0.567 |
+ 0.584 |
+ 0.577 |
+ 0.655 |
+ 0.612 |
+ 1583 |
+ Stability |
+ SR43C_ARATH |
+ High |
+ Eukaryote |
+
+
+ SRBS1_HUMAN_Tsuboyama_2023_2O2W |
+ 0.413 |
+ 0.418 |
+ 0.559 |
+ 0.549 |
+ 0.598 |
+ 0.585 |
+ 0.444 |
+ 0.336 |
+ 0.618 |
+ 0.621 |
+ 0.474 |
+ 0.536 |
+ 0.567 |
+ 0.379 |
+ 0.575 |
+ 0.613 |
+ 0.593 |
+ 0.544 |
+ 0.541 |
+ 0.541 |
+ 0.549 |
+ 0.513 |
+ 0.510 |
+ 0.495 |
+ 0.495 |
+ 0.515 |
+ 0.523 |
+ 0.503 |
+ 0.503 |
+ 0.539 |
+ 0.590 |
+ 0.559 |
+ 0.485 |
+ 0.480 |
+ 0.487 |
+ 0.539 |
+ 0.513 |
+ 0.554 |
+ 0.559 |
+ 0.577 |
+ 0.587 |
+ 0.585 |
+ 0.248 |
+ 0.117 |
+ 0.456 |
+ 0.433 |
+ 0.438 |
+ 0.446 |
+ 0.497 |
+ 0.510 |
+ 0.621 |
+ 0.593 |
+ 0.600 |
+ 0.600 |
+ 0.613 |
+ 0.608 |
+ 0.598 |
+ 0.590 |
+ 0.613 |
+ 0.608 |
+ 0.582 |
+ 0.521 |
+ 1556 |
+ Stability |
+ SRBS1_HUMAN |
+ High |
+ Human |
+
+
+ SRC_HUMAN_Ahler_2019 |
+ 0.460 |
+ 0.453 |
+ 0.418 |
+ 0.428 |
+ 0.453 |
+ 0.457 |
+ 0.466 |
+ 0.386 |
+ 0.494 |
+ 0.510 |
+ 0.442 |
+ 0.515 |
+ 0.533 |
+ 0.357 |
+ 0.418 |
+ 0.413 |
+ 0.495 |
+ 0.450 |
+ 0.446 |
+ 0.450 |
+ 0.377 |
+ 0.359 |
+ 0.373 |
+ 0.330 |
+ 0.394 |
+ 0.428 |
+ 0.391 |
+ 0.358 |
+ 0.272 |
+ 0.501 |
+ 0.519 |
+ 0.493 |
+ 0.379 |
+ 0.352 |
+ 0.359 |
+ 0.295 |
+ 0.443 |
+ 0.453 |
+ 0.432 |
+ 0.459 |
+ 0.459 |
+ 0.457 |
+ 0.496 |
+ 0.311 |
+ 0.442 |
+ 0.486 |
+ 0.276 |
+ 0.247 |
+ 0.355 |
+ 0.094 |
+ 0.456 |
+ 0.438 |
+ 0.445 |
+ 0.465 |
+ 0.479 |
+ 0.471 |
+ 0.470 |
+ 0.476 |
+ 0.472 |
+ 0.487 |
+ 0.470 |
+ 0.412 |
+ 3372 |
+ Activity |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM |
+ 0.385 |
+ 0.378 |
+ 0.380 |
+ 0.383 |
+ 0.391 |
+ 0.384 |
+ 0.387 |
+ 0.333 |
+ 0.417 |
+ 0.407 |
+ 0.361 |
+ 0.423 |
+ 0.439 |
+ 0.268 |
+ 0.335 |
+ 0.332 |
+ 0.425 |
+ 0.352 |
+ 0.376 |
+ 0.362 |
+ 0.329 |
+ 0.304 |
+ 0.322 |
+ 0.277 |
+ 0.328 |
+ 0.357 |
+ 0.316 |
+ 0.313 |
+ 0.220 |
+ 0.411 |
+ 0.438 |
+ 0.435 |
+ 0.299 |
+ 0.282 |
+ 0.292 |
+ 0.226 |
+ 0.362 |
+ 0.358 |
+ 0.345 |
+ 0.380 |
+ 0.378 |
+ 0.384 |
+ 0.410 |
+ 0.238 |
+ 0.360 |
+ 0.393 |
+ 0.219 |
+ 0.224 |
+ 0.290 |
+ 0.035 |
+ 0.393 |
+ 0.372 |
+ 0.373 |
+ 0.396 |
+ 0.407 |
+ 0.410 |
+ 0.393 |
+ 0.412 |
+ 0.396 |
+ 0.414 |
+ 0.389 |
+ 0.346 |
+ 3637 |
+ Activity |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SRC_HUMAN_Nguyen_2022 |
+ 0.337 |
+ 0.341 |
+ 0.356 |
+ 0.351 |
+ 0.351 |
+ 0.351 |
+ 0.345 |
+ 0.319 |
+ 0.284 |
+ 0.345 |
+ 0.343 |
+ 0.389 |
+ 0.411 |
+ 0.238 |
+ 0.296 |
+ 0.298 |
+ 0.382 |
+ 0.338 |
+ 0.351 |
+ 0.340 |
+ 0.283 |
+ 0.284 |
+ 0.293 |
+ 0.257 |
+ 0.291 |
+ 0.332 |
+ 0.288 |
+ 0.279 |
+ 0.209 |
+ 0.369 |
+ 0.402 |
+ 0.391 |
+ 0.295 |
+ 0.257 |
+ 0.265 |
+ 0.207 |
+ 0.320 |
+ 0.321 |
+ 0.316 |
+ 0.350 |
+ 0.338 |
+ 0.340 |
+ 0.387 |
+ 0.213 |
+ 0.345 |
+ 0.390 |
+ 0.193 |
+ 0.201 |
+ 0.256 |
+ 0.040 |
+ 0.366 |
+ 0.346 |
+ 0.343 |
+ 0.366 |
+ 0.377 |
+ 0.372 |
+ 0.364 |
+ 0.378 |
+ 0.366 |
+ 0.381 |
+ 0.365 |
+ 0.308 |
+ 3366 |
+ OrganismalFitness |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SUMO1_HUMAN_Weile_2017 |
+ 0.339 |
+ 0.293 |
+ 0.339 |
+ 0.363 |
+ 0.392 |
+ 0.378 |
+ 0.123 |
+ 0.370 |
+ 0.378 |
+ 0.303 |
+ 0.341 |
+ 0.402 |
+ 0.431 |
+ 0.213 |
+ 0.412 |
+ 0.467 |
+ 0.433 |
+ 0.307 |
+ 0.281 |
+ 0.448 |
+ 0.196 |
+ 0.322 |
+ 0.378 |
+ 0.337 |
+ 0.402 |
+ 0.295 |
+ 0.402 |
+ 0.349 |
+ 0.269 |
+ 0.375 |
+ 0.399 |
+ 0.426 |
+ 0.201 |
+ 0.223 |
+ 0.416 |
+ 0.252 |
+ 0.339 |
+ 0.460 |
+ 0.322 |
+ 0.351 |
+ 0.465 |
+ 0.344 |
+ 0.431 |
+ 0.056 |
+ 0.293 |
+ 0.450 |
+ 0.438 |
+ 0.358 |
+ 0.445 |
+ 0.370 |
+ 0.431 |
+ 0.370 |
+ 0.429 |
+ 0.429 |
+ 0.419 |
+ 0.453 |
+ 0.402 |
+ 0.436 |
+ 0.412 |
+ 0.443 |
+ 0.397 |
+ 0.453 |
+ 1700 |
+ OrganismalFitness |
+ SUMO1_HUMAN |
+ High |
+ Human |
+
+
+ SYUA_HUMAN_Newberry_2020 |
+ 0.152 |
+ 0.192 |
+ 0.198 |
+ 0.234 |
+ 0.196 |
+ 0.211 |
+ 0.202 |
+ 0.267 |
+ 0.242 |
+ 0.248 |
+ 0.338 |
+ 0.332 |
+ 0.324 |
+ 0.206 |
+ 0.231 |
+ 0.229 |
+ 0.236 |
+ 0.278 |
+ 0.309 |
+ 0.332 |
+ 0.248 |
+ 0.326 |
+ 0.255 |
+ 0.238 |
+ 0.194 |
+ 0.273 |
+ 0.286 |
+ 0.259 |
+ 0.238 |
+ 0.307 |
+ 0.311 |
+ 0.252 |
+ 0.129 |
+ 0.257 |
+ 0.359 |
+ 0.284 |
+ 0.189 |
+ 0.278 |
+ 0.229 |
+ 0.225 |
+ 0.276 |
+ 0.231 |
+ 0.211 |
+ 0.173 |
+ 0.313 |
+ 0.240 |
+ -0.006 |
+ 0.206 |
+ 0.051 |
+ -0.041 |
+ 0.190 |
+ 0.190 |
+ 0.189 |
+ 0.198 |
+ 0.229 |
+ 0.196 |
+ 0.227 |
+ 0.215 |
+ 0.229 |
+ 0.221 |
+ 0.074 |
+ 0.059 |
+ 2497 |
+ OrganismalFitness |
+ SYUA_HUMAN |
+ Medium |
+ Human |
+
+
+ TADBP_HUMAN_Bolognesi_2019 |
+ 0.090 |
+ 0.040 |
+ 0.077 |
+ 0.080 |
+ 0.074 |
+ 0.070 |
+ 0.154 |
+ 0.023 |
+ 0.060 |
+ 0.054 |
+ 0.010 |
+ 0.030 |
+ 0.023 |
+ 0.054 |
+ 0.020 |
+ 0.023 |
+ -0.070 |
+ 0.013 |
+ 0.054 |
+ -0.047 |
+ 0.077 |
+ 0.037 |
+ -0.003 |
+ -0.010 |
+ 0.104 |
+ 0.070 |
+ 0.043 |
+ -0.017 |
+ -0.030 |
+ 0.030 |
+ 0.107 |
+ 0.142 |
+ -0.043 |
+ 0.120 |
+ 0.147 |
+ 0.084 |
+ 0.090 |
+ 0.110 |
+ 0.087 |
+ 0.100 |
+ 0.117 |
+ 0.084 |
+ 0.054 |
+ 0.107 |
+ -0.007 |
+ 0.030 |
+ 0.197 |
+ 0.023 |
+ 0.120 |
+ 0.047 |
+ 0.090 |
+ -0.033 |
+ -0.013 |
+ 0.003 |
+ 0.007 |
+ 0.027 |
+ 0.033 |
+ 0.003 |
+ -0.060 |
+ 0.023 |
+ 0.010 |
+ 0.057 |
+ 1196 |
+ OrganismalFitness |
+ TADBP_HUMAN |
+ Low |
+ Human |
+
+
+ TAT_HV1BR_Fernandes_2016 |
+ 0.265 |
+ 0.183 |
+ 0.238 |
+ 0.243 |
+ 0.259 |
+ 0.243 |
+ -0.076 |
+ 0.396 |
+ 0.227 |
+ 0.257 |
+ 0.159 |
+ 0.338 |
+ 0.322 |
+ -0.032 |
+ 0.006 |
+ 0.009 |
+ 0.028 |
+ -0.057 |
+ 0.044 |
+ 0.186 |
+ 0.393 |
+ 0.415 |
+ 0.417 |
+ 0.423 |
+ 0.360 |
+ 0.257 |
+ 0.120 |
+ 0.229 |
+ 0.199 |
+ 0.314 |
+ 0.401 |
+ 0.403 |
+ 0.319 |
+ 0.371 |
+ 0.175 |
+ 0.227 |
+ 0.374 |
+ 0.254 |
+ 0.229 |
+ 0.341 |
+ 0.270 |
+ 0.257 |
+ 0.020 |
+ -0.027 |
+ 0.284 |
+ 0.156 |
+ 0.202 |
+ 0.281 |
+ 0.300 |
+ 0.178 |
+ 0.082 |
+ 0.126 |
+ 0.139 |
+ 0.099 |
+ 0.107 |
+ 0.107 |
+ 0.074 |
+ 0.099 |
+ 0.090 |
+ 0.093 |
+ 0.161 |
+ 0.145 |
+ 1577 |
+ OrganismalFitness |
+ TAT_HV1BR |
+ High |
+ Virus |
+
+
+ TCRG1_MOUSE_Tsuboyama_2023_1E0L |
+ 0.590 |
+ 0.537 |
+ 0.606 |
+ 0.610 |
+ 0.655 |
+ 0.655 |
+ 0.682 |
+ 0.312 |
+ 0.648 |
+ 0.697 |
+ 0.552 |
+ 0.728 |
+ 0.747 |
+ 0.670 |
+ 0.690 |
+ 0.762 |
+ 0.731 |
+ 0.690 |
+ 0.724 |
+ 0.514 |
+ 0.423 |
+ 0.571 |
+ 0.556 |
+ 0.602 |
+ 0.583 |
+ 0.579 |
+ 0.636 |
+ 0.644 |
+ 0.575 |
+ 0.693 |
+ 0.636 |
+ 0.587 |
+ 0.541 |
+ 0.446 |
+ 0.602 |
+ 0.686 |
+ 0.640 |
+ 0.705 |
+ 0.697 |
+ 0.636 |
+ 0.655 |
+ 0.648 |
+ 0.701 |
+ 0.103 |
+ 0.530 |
+ 0.667 |
+ 0.541 |
+ 0.461 |
+ 0.739 |
+ 0.712 |
+ 0.705 |
+ 0.663 |
+ 0.644 |
+ 0.739 |
+ 0.712 |
+ 0.735 |
+ 0.709 |
+ 0.724 |
+ 0.697 |
+ 0.720 |
+ 0.758 |
+ 0.754 |
+ 1058 |
+ Stability |
+ TCRG1_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ THO1_YEAST_Tsuboyama_2023_2WQG |
+ 0.250 |
+ 0.260 |
+ 0.437 |
+ 0.411 |
+ 0.407 |
+ 0.420 |
+ -0.002 |
+ 0.122 |
+ 0.535 |
+ 0.525 |
+ 0.512 |
+ 0.420 |
+ 0.483 |
+ 0.099 |
+ 0.525 |
+ 0.532 |
+ 0.496 |
+ 0.515 |
+ 0.545 |
+ 0.496 |
+ -0.097 |
+ 0.266 |
+ 0.312 |
+ 0.296 |
+ 0.266 |
+ 0.286 |
+ 0.185 |
+ 0.368 |
+ 0.443 |
+ 0.499 |
+ 0.551 |
+ 0.548 |
+ 0.414 |
+ -0.042 |
+ 0.339 |
+ 0.306 |
+ 0.397 |
+ 0.463 |
+ 0.473 |
+ 0.411 |
+ 0.489 |
+ 0.483 |
+ 0.188 |
+ 0.076 |
+ 0.424 |
+ 0.306 |
+ 0.414 |
+ 0.401 |
+ 0.443 |
+ 0.568 |
+ 0.542 |
+ 0.529 |
+ 0.568 |
+ 0.587 |
+ 0.571 |
+ 0.555 |
+ 0.555 |
+ 0.558 |
+ 0.548 |
+ 0.561 |
+ 0.479 |
+ 0.404 |
+ 1279 |
+ Stability |
+ THO1_YEAST |
+ High |
+ Eukaryote |
+
+
+ TNKS2_HUMAN_Tsuboyama_2023_5JRT |
+ 0.346 |
+ 0.335 |
+ 0.443 |
+ 0.438 |
+ 0.424 |
+ 0.421 |
+ -0.109 |
+ 0.318 |
+ 0.511 |
+ 0.492 |
+ 0.405 |
+ 0.416 |
+ 0.419 |
+ -0.014 |
+ 0.394 |
+ 0.456 |
+ 0.473 |
+ 0.432 |
+ 0.470 |
+ 0.405 |
+ 0.227 |
+ 0.316 |
+ 0.340 |
+ 0.321 |
+ 0.351 |
+ 0.381 |
+ 0.289 |
+ 0.332 |
+ 0.346 |
+ 0.486 |
+ 0.454 |
+ 0.435 |
+ 0.167 |
+ 0.067 |
+ 0.278 |
+ 0.302 |
+ 0.397 |
+ 0.394 |
+ 0.397 |
+ 0.462 |
+ 0.421 |
+ 0.427 |
+ 0.153 |
+ -0.076 |
+ 0.400 |
+ 0.337 |
+ 0.335 |
+ 0.473 |
+ 0.559 |
+ 0.508 |
+ 0.459 |
+ 0.459 |
+ 0.467 |
+ 0.470 |
+ 0.451 |
+ 0.443 |
+ 0.440 |
+ 0.432 |
+ 0.446 |
+ 0.454 |
+ 0.586 |
+ 0.467 |
+ 1479 |
+ Stability |
+ TNKS2_HUMAN |
+ High |
+ Human |
+
+
+ TPK1_HUMAN_Weile_2017 |
+ 0.190 |
+ 0.194 |
+ 0.185 |
+ 0.198 |
+ 0.203 |
+ 0.209 |
+ 0.059 |
+ 0.193 |
+ 0.236 |
+ 0.221 |
+ 0.230 |
+ 0.215 |
+ 0.257 |
+ 0.121 |
+ 0.162 |
+ 0.216 |
+ 0.266 |
+ 0.258 |
+ 0.306 |
+ 0.191 |
+ 0.096 |
+ 0.107 |
+ 0.180 |
+ 0.222 |
+ 0.119 |
+ 0.217 |
+ 0.220 |
+ 0.209 |
+ 0.231 |
+ 0.191 |
+ 0.199 |
+ 0.161 |
+ 0.105 |
+ 0.117 |
+ 0.171 |
+ 0.234 |
+ 0.190 |
+ 0.197 |
+ 0.242 |
+ 0.208 |
+ 0.209 |
+ 0.239 |
+ 0.131 |
+ 0.104 |
+ 0.269 |
+ 0.152 |
+ 0.148 |
+ 0.209 |
+ 0.184 |
+ 0.076 |
+ 0.200 |
+ 0.207 |
+ 0.211 |
+ 0.217 |
+ 0.226 |
+ 0.226 |
+ 0.215 |
+ 0.229 |
+ 0.233 |
+ 0.231 |
+ 0.217 |
+ 0.185 |
+ 3181 |
+ OrganismalFitness |
+ TPK1_HUMAN |
+ Medium |
+ Human |
+
+
+ TPMT_HUMAN_Matreyek_2018 |
+ 0.263 |
+ 0.306 |
+ 0.313 |
+ 0.323 |
+ 0.322 |
+ 0.331 |
+ 0.173 |
+ 0.302 |
+ 0.322 |
+ 0.329 |
+ 0.349 |
+ 0.330 |
+ 0.344 |
+ 0.225 |
+ 0.265 |
+ 0.344 |
+ 0.349 |
+ 0.295 |
+ 0.291 |
+ 0.341 |
+ 0.186 |
+ 0.263 |
+ 0.301 |
+ 0.299 |
+ 0.265 |
+ 0.304 |
+ 0.301 |
+ 0.268 |
+ 0.295 |
+ 0.351 |
+ 0.359 |
+ 0.342 |
+ 0.223 |
+ 0.195 |
+ 0.302 |
+ 0.290 |
+ 0.302 |
+ 0.337 |
+ 0.331 |
+ 0.349 |
+ 0.334 |
+ 0.344 |
+ 0.244 |
+ 0.148 |
+ 0.345 |
+ 0.316 |
+ 0.322 |
+ 0.347 |
+ 0.360 |
+ 0.162 |
+ 0.345 |
+ 0.342 |
+ 0.349 |
+ 0.342 |
+ 0.348 |
+ 0.356 |
+ 0.359 |
+ 0.344 |
+ 0.355 |
+ 0.356 |
+ 0.360 |
+ 0.324 |
+ 3648 |
+ Expression |
+ TPMT_HUMAN |
+ Medium |
+ Human |
+
+
+ TPOR_HUMAN_Bridgford_2020 |
+ 0.273 |
+ 0.222 |
+ 0.230 |
+ 0.188 |
+ 0.162 |
+ 0.213 |
+ 0.299 |
+ 0.316 |
+ 0.265 |
+ 0.290 |
+ 0.290 |
+ 0.265 |
+ 0.282 |
+ 0.247 |
+ 0.307 |
+ 0.265 |
+ 0.179 |
+ 0.282 |
+ 0.205 |
+ 0.213 |
+ 0.213 |
+ 0.299 |
+ 0.239 |
+ 0.265 |
+ 0.307 |
+ 0.128 |
+ 0.350 |
+ 0.410 |
+ 0.239 |
+ 0.358 |
+ 0.282 |
+ 0.133 |
+ 0.324 |
+ 0.307 |
+ 0.290 |
+ 0.333 |
+ 0.299 |
+ 0.299 |
+ 0.367 |
+ 0.316 |
+ 0.307 |
+ 0.350 |
+ 0.247 |
+ 0.247 |
+ 0.213 |
+ 0.265 |
+ 0.119 |
+ 0.230 |
+ 0.213 |
+ 0.000 |
+ 0.205 |
+ 0.196 |
+ 0.273 |
+ 0.230 |
+ 0.205 |
+ 0.273 |
+ 0.290 |
+ 0.111 |
+ 0.188 |
+ 0.239 |
+ 0.324 |
+ 0.299 |
+ 562 |
+ OrganismalFitness |
+ TPOR_HUMAN |
+ Low |
+ Human |
+
+
+ TRPC_SACS2_Chan_2017 |
+ 0.516 |
+ 0.527 |
+ 0.462 |
+ 0.479 |
+ 0.487 |
+ 0.508 |
+ 0.188 |
+ 0.465 |
+ 0.557 |
+ 0.562 |
+ 0.546 |
+ 0.543 |
+ 0.557 |
+ 0.258 |
+ 0.506 |
+ 0.532 |
+ 0.541 |
+ 0.543 |
+ 0.524 |
+ 0.462 |
+ 0.368 |
+ 0.519 |
+ 0.465 |
+ 0.516 |
+ 0.436 |
+ 0.471 |
+ 0.444 |
+ 0.460 |
+ 0.441 |
+ 0.441 |
+ 0.527 |
+ 0.462 |
+ 0.196 |
+ 0.487 |
+ 0.492 |
+ 0.508 |
+ 0.511 |
+ 0.527 |
+ 0.527 |
+ 0.506 |
+ 0.516 |
+ 0.516 |
+ 0.395 |
+ 0.045 |
+ 0.543 |
+ 0.465 |
+ 0.250 |
+ 0.384 |
+ 0.387 |
+ 0.099 |
+ 0.541 |
+ 0.495 |
+ 0.503 |
+ 0.557 |
+ 0.530 |
+ 0.524 |
+ 0.543 |
+ 0.546 |
+ 0.524 |
+ 0.551 |
+ 0.535 |
+ 0.511 |
+ 1519 |
+ OrganismalFitness |
+ TRPC_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ TRPC_THEMA_Chan_2017 |
+ 0.365 |
+ 0.371 |
+ 0.348 |
+ 0.371 |
+ 0.365 |
+ 0.363 |
+ 0.237 |
+ 0.333 |
+ 0.383 |
+ 0.383 |
+ 0.371 |
+ 0.395 |
+ 0.406 |
+ 0.190 |
+ 0.377 |
+ 0.386 |
+ 0.392 |
+ 0.383 |
+ 0.389 |
+ 0.354 |
+ 0.283 |
+ 0.324 |
+ 0.330 |
+ 0.339 |
+ 0.354 |
+ 0.354 |
+ 0.313 |
+ 0.333 |
+ 0.360 |
+ 0.348 |
+ 0.339 |
+ 0.304 |
+ 0.146 |
+ 0.333 |
+ 0.392 |
+ 0.365 |
+ 0.363 |
+ 0.380 |
+ 0.363 |
+ 0.368 |
+ 0.365 |
+ 0.357 |
+ 0.330 |
+ 0.093 |
+ 0.377 |
+ 0.333 |
+ 0.169 |
+ 0.263 |
+ 0.275 |
+ 0.003 |
+ 0.383 |
+ 0.316 |
+ 0.357 |
+ 0.377 |
+ 0.377 |
+ 0.330 |
+ 0.360 |
+ 0.360 |
+ 0.351 |
+ 0.377 |
+ 0.409 |
+ 0.383 |
+ 1519 |
+ OrganismalFitness |
+ TRPC_THEMA |
+ Medium |
+ Prokaryote |
+
+
+ UBC9_HUMAN_Weile_2017 |
+ 0.304 |
+ 0.407 |
+ 0.457 |
+ 0.474 |
+ 0.438 |
+ 0.457 |
+ -0.027 |
+ 0.327 |
+ 0.430 |
+ 0.440 |
+ 0.340 |
+ 0.396 |
+ 0.432 |
+ 0.018 |
+ 0.025 |
+ 0.326 |
+ 0.379 |
+ 0.426 |
+ 0.490 |
+ 0.477 |
+ 0.168 |
+ 0.337 |
+ 0.379 |
+ 0.346 |
+ 0.348 |
+ 0.396 |
+ 0.388 |
+ 0.359 |
+ 0.354 |
+ 0.399 |
+ 0.371 |
+ 0.372 |
+ 0.103 |
+ 0.168 |
+ 0.346 |
+ 0.355 |
+ 0.243 |
+ 0.366 |
+ 0.394 |
+ 0.371 |
+ 0.437 |
+ 0.455 |
+ 0.209 |
+ 0.001 |
+ 0.316 |
+ 0.423 |
+ 0.273 |
+ 0.277 |
+ 0.385 |
+ 0.218 |
+ 0.327 |
+ 0.346 |
+ 0.316 |
+ 0.352 |
+ 0.338 |
+ 0.346 |
+ 0.327 |
+ 0.343 |
+ 0.351 |
+ 0.351 |
+ 0.293 |
+ 0.357 |
+ 2563 |
+ OrganismalFitness |
+ UBC9_HUMAN |
+ Medium |
+ Human |
+
+
+ UBE4B_HUMAN_Tsuboyama_2023_3L1X |
+ 0.247 |
+ 0.336 |
+ 0.369 |
+ 0.379 |
+ 0.370 |
+ 0.378 |
+ 0.152 |
+ 0.349 |
+ 0.363 |
+ 0.371 |
+ 0.341 |
+ 0.414 |
+ 0.400 |
+ 0.334 |
+ 0.358 |
+ 0.394 |
+ 0.374 |
+ 0.402 |
+ 0.428 |
+ 0.383 |
+ 0.217 |
+ 0.358 |
+ 0.350 |
+ 0.371 |
+ 0.377 |
+ 0.383 |
+ 0.342 |
+ 0.341 |
+ 0.369 |
+ 0.387 |
+ 0.422 |
+ 0.418 |
+ 0.387 |
+ 0.153 |
+ 0.167 |
+ 0.299 |
+ 0.333 |
+ 0.324 |
+ 0.332 |
+ 0.382 |
+ 0.375 |
+ 0.386 |
+ 0.326 |
+ -0.020 |
+ 0.252 |
+ 0.259 |
+ 0.369 |
+ 0.292 |
+ 0.394 |
+ 0.305 |
+ 0.350 |
+ 0.385 |
+ 0.369 |
+ 0.370 |
+ 0.374 |
+ 0.366 |
+ 0.374 |
+ 0.375 |
+ 0.385 |
+ 0.374 |
+ 0.411 |
+ 0.390 |
+ 3622 |
+ Stability |
+ UBE4B_HUMAN |
+ High |
+ Human |
+
+
+ UBE4B_MOUSE_Starita_2013 |
+ 0.306 |
+ 0.312 |
+ 0.348 |
+ 0.342 |
+ 0.348 |
+ 0.354 |
+ 0.040 |
+ 0.046 |
+ 0.318 |
+ 0.342 |
+ 0.276 |
+ 0.330 |
+ 0.348 |
+ 0.318 |
+ 0.336 |
+ 0.360 |
+ 0.330 |
+ 0.318 |
+ 0.288 |
+ 0.318 |
+ 0.082 |
+ 0.276 |
+ 0.270 |
+ 0.252 |
+ 0.348 |
+ 0.300 |
+ 0.288 |
+ 0.288 |
+ 0.252 |
+ 0.330 |
+ 0.360 |
+ 0.300 |
+ 0.058 |
+ -0.009 |
+ 0.118 |
+ 0.203 |
+ 0.312 |
+ 0.288 |
+ 0.306 |
+ 0.342 |
+ 0.360 |
+ 0.379 |
+ 0.306 |
+ -0.003 |
+ 0.330 |
+ 0.348 |
+ 0.191 |
+ 0.270 |
+ -0.057 |
+ 0.034 |
+ 0.318 |
+ 0.330 |
+ 0.354 |
+ 0.354 |
+ 0.342 |
+ 0.360 |
+ 0.330 |
+ 0.360 |
+ 0.342 |
+ 0.354 |
+ 0.330 |
+ 0.288 |
+ 899 |
+ Activity |
+ UBE4B_MOUSE |
+ Low |
+ Eukaryote |
+
+
+ UBR5_HUMAN_Tsuboyama_2023_1I2T |
+ 0.374 |
+ 0.495 |
+ 0.528 |
+ 0.544 |
+ 0.542 |
+ 0.566 |
+ 0.142 |
+ 0.294 |
+ 0.432 |
+ 0.478 |
+ 0.492 |
+ 0.374 |
+ 0.454 |
+ 0.192 |
+ 0.187 |
+ 0.129 |
+ 0.101 |
+ 0.561 |
+ 0.522 |
+ 0.421 |
+ 0.404 |
+ 0.393 |
+ 0.429 |
+ 0.487 |
+ 0.443 |
+ 0.454 |
+ 0.462 |
+ 0.443 |
+ 0.451 |
+ 0.506 |
+ 0.528 |
+ 0.382 |
+ 0.264 |
+ 0.283 |
+ 0.376 |
+ 0.365 |
+ 0.374 |
+ 0.410 |
+ 0.418 |
+ 0.553 |
+ 0.553 |
+ 0.547 |
+ 0.093 |
+ 0.098 |
+ 0.291 |
+ 0.101 |
+ 0.445 |
+ 0.244 |
+ 0.588 |
+ 0.484 |
+ 0.533 |
+ 0.536 |
+ 0.547 |
+ 0.553 |
+ 0.561 |
+ 0.572 |
+ 0.536 |
+ 0.555 |
+ 0.547 |
+ 0.550 |
+ 0.586 |
+ 0.514 |
+ 1453 |
+ Stability |
+ UBR5_HUMAN |
+ Medium |
+ Human |
+
+
+ VG08_BPP22_Tsuboyama_2023_2GP8 |
+ 0.120 |
+ 0.192 |
+ 0.120 |
+ 0.148 |
+ 0.192 |
+ 0.198 |
+ 0.187 |
+ -0.057 |
+ 0.264 |
+ 0.242 |
+ 0.519 |
+ 0.441 |
+ 0.463 |
+ 0.286 |
+ 0.419 |
+ 0.491 |
+ 0.519 |
+ 0.491 |
+ 0.546 |
+ 0.109 |
+ 0.297 |
+ 0.402 |
+ 0.358 |
+ 0.485 |
+ 0.320 |
+ 0.342 |
+ 0.331 |
+ 0.369 |
+ 0.447 |
+ 0.441 |
+ 0.430 |
+ 0.295 |
+ 0.048 |
+ 0.237 |
+ 0.358 |
+ 0.397 |
+ 0.198 |
+ 0.281 |
+ 0.342 |
+ 0.220 |
+ 0.248 |
+ 0.281 |
+ 0.347 |
+ 0.209 |
+ 0.441 |
+ 0.380 |
+ 0.386 |
+ 0.425 |
+ 0.430 |
+ 0.375 |
+ 0.530 |
+ 0.480 |
+ 0.508 |
+ 0.530 |
+ 0.508 |
+ 0.568 |
+ 0.474 |
+ 0.535 |
+ 0.563 |
+ 0.535 |
+ 0.497 |
+ 0.480 |
+ 723 |
+ Stability |
+ VG08_BPP22 |
+ High |
+ Virus |
+
+
+ VILI_CHICK_Tsuboyama_2023_1YU5 |
+ 0.271 |
+ 0.323 |
+ 0.432 |
+ 0.476 |
+ 0.495 |
+ 0.499 |
+ 0.065 |
+ 0.416 |
+ 0.441 |
+ 0.478 |
+ 0.550 |
+ 0.485 |
+ 0.490 |
+ 0.260 |
+ 0.158 |
+ 0.513 |
+ 0.517 |
+ 0.474 |
+ 0.420 |
+ 0.425 |
+ 0.234 |
+ 0.350 |
+ 0.429 |
+ 0.385 |
+ 0.379 |
+ 0.425 |
+ 0.411 |
+ 0.378 |
+ 0.441 |
+ 0.474 |
+ 0.499 |
+ 0.489 |
+ 0.283 |
+ 0.264 |
+ 0.343 |
+ 0.357 |
+ 0.409 |
+ 0.415 |
+ 0.409 |
+ 0.501 |
+ 0.494 |
+ 0.480 |
+ 0.163 |
+ 0.200 |
+ 0.360 |
+ 0.351 |
+ 0.387 |
+ 0.346 |
+ 0.539 |
+ 0.464 |
+ 0.545 |
+ 0.553 |
+ 0.555 |
+ 0.539 |
+ 0.546 |
+ 0.552 |
+ 0.553 |
+ 0.546 |
+ 0.553 |
+ 0.552 |
+ 0.499 |
+ 0.508 |
+ 2568 |
+ Stability |
+ VILI_CHICK |
+ High |
+ Eukaryote |
+
+
+ VKOR1_HUMAN_Chiasson_2020_abundance |
+ 0.323 |
+ 0.340 |
+ 0.351 |
+ 0.357 |
+ 0.387 |
+ 0.386 |
+ 0.124 |
+ 0.317 |
+ 0.446 |
+ 0.449 |
+ 0.396 |
+ 0.375 |
+ 0.411 |
+ 0.139 |
+ 0.307 |
+ 0.381 |
+ 0.443 |
+ 0.426 |
+ 0.424 |
+ 0.393 |
+ 0.175 |
+ 0.188 |
+ 0.332 |
+ 0.359 |
+ 0.236 |
+ 0.411 |
+ 0.384 |
+ 0.384 |
+ 0.342 |
+ 0.393 |
+ 0.345 |
+ 0.264 |
+ 0.089 |
+ 0.111 |
+ 0.221 |
+ 0.402 |
+ 0.331 |
+ 0.342 |
+ 0.452 |
+ 0.386 |
+ 0.396 |
+ 0.437 |
+ 0.108 |
+ 0.077 |
+ 0.405 |
+ 0.265 |
+ 0.371 |
+ 0.383 |
+ 0.409 |
+ 0.191 |
+ 0.437 |
+ 0.421 |
+ 0.470 |
+ 0.475 |
+ 0.478 |
+ 0.445 |
+ 0.464 |
+ 0.437 |
+ 0.458 |
+ 0.470 |
+ 0.461 |
+ 0.377 |
+ 2695 |
+ Expression |
+ VKOR1_HUMAN |
+ Medium |
+ Human |
+
+
+ VKOR1_HUMAN_Chiasson_2020_activity |
+ 0.317 |
+ 0.311 |
+ 0.335 |
+ 0.359 |
+ 0.365 |
+ 0.365 |
+ 0.037 |
+ 0.311 |
+ 0.371 |
+ 0.383 |
+ 0.353 |
+ 0.383 |
+ 0.383 |
+ 0.037 |
+ 0.263 |
+ 0.281 |
+ 0.371 |
+ 0.353 |
+ 0.365 |
+ 0.371 |
+ 0.096 |
+ 0.055 |
+ 0.269 |
+ 0.293 |
+ 0.144 |
+ 0.317 |
+ 0.323 |
+ 0.317 |
+ 0.347 |
+ 0.335 |
+ 0.341 |
+ 0.287 |
+ 0.162 |
+ 0.061 |
+ 0.132 |
+ 0.329 |
+ 0.317 |
+ 0.311 |
+ 0.383 |
+ 0.353 |
+ 0.371 |
+ 0.371 |
+ 0.067 |
+ 0.031 |
+ 0.377 |
+ 0.198 |
+ 0.198 |
+ 0.317 |
+ 0.335 |
+ 0.049 |
+ 0.371 |
+ 0.311 |
+ 0.347 |
+ 0.335 |
+ 0.347 |
+ 0.347 |
+ 0.347 |
+ 0.353 |
+ 0.335 |
+ 0.359 |
+ 0.371 |
+ 0.281 |
+ 697 |
+ Activity |
+ VKOR1_HUMAN |
+ Medium |
+ Human |
+
+
+ VRPI_BPT7_Tsuboyama_2023_2WNM |
+ -0.076 |
+ 0.011 |
+ 0.102 |
+ 0.144 |
+ 0.081 |
+ 0.110 |
+ 0.094 |
+ 0.110 |
+ 0.342 |
+ 0.351 |
+ 0.363 |
+ 0.264 |
+ 0.280 |
+ 0.214 |
+ 0.342 |
+ 0.384 |
+ 0.463 |
+ 0.496 |
+ 0.421 |
+ 0.135 |
+ 0.007 |
+ 0.139 |
+ 0.115 |
+ 0.222 |
+ 0.197 |
+ 0.185 |
+ 0.139 |
+ 0.189 |
+ 0.247 |
+ 0.069 |
+ 0.355 |
+ 0.338 |
+ 0.231 |
+ 0.040 |
+ 0.011 |
+ 0.073 |
+ -0.084 |
+ -0.055 |
+ -0.080 |
+ 0.028 |
+ 0.028 |
+ -0.043 |
+ 0.218 |
+ 0.036 |
+ 0.425 |
+ 0.293 |
+ 0.463 |
+ 0.392 |
+ 0.442 |
+ 0.442 |
+ 0.467 |
+ 0.446 |
+ 0.446 |
+ 0.446 |
+ 0.454 |
+ 0.454 |
+ 0.438 |
+ 0.458 |
+ 0.434 |
+ 0.467 |
+ 0.492 |
+ 0.417 |
+ 1047 |
+ Stability |
+ VRPI_BPT7 |
+ Medium |
+ Virus |
+
+
+ YAIA_ECOLI_Tsuboyama_2023_2KVT |
+ 0.243 |
+ 0.511 |
+ 0.432 |
+ 0.455 |
+ 0.434 |
+ 0.432 |
+ -0.144 |
+ 0.461 |
+ 0.497 |
+ 0.504 |
+ 0.483 |
+ 0.148 |
+ 0.309 |
+ -0.091 |
+ 0.106 |
+ 0.432 |
+ 0.521 |
+ 0.544 |
+ 0.546 |
+ 0.523 |
+ -0.091 |
+ -0.167 |
+ -0.091 |
+ 0.188 |
+ -0.099 |
+ -0.142 |
+ -0.186 |
+ 0.017 |
+ 0.394 |
+ 0.449 |
+ 0.550 |
+ 0.524 |
+ 0.004 |
+ -0.197 |
+ -0.051 |
+ 0.396 |
+ 0.273 |
+ 0.267 |
+ 0.417 |
+ 0.404 |
+ 0.396 |
+ 0.485 |
+ -0.059 |
+ -0.212 |
+ 0.068 |
+ -0.040 |
+ 0.326 |
+ 0.271 |
+ 0.432 |
+ 0.387 |
+ 0.523 |
+ 0.531 |
+ 0.529 |
+ 0.527 |
+ 0.529 |
+ 0.533 |
+ 0.525 |
+ 0.540 |
+ 0.525 |
+ 0.527 |
+ 0.459 |
+ 0.372 |
+ 1890 |
+ Stability |
+ YAIA_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ YAP1_HUMAN_Araya_2012 |
+ 0.338 |
+ 0.260 |
+ 0.356 |
+ 0.359 |
+ 0.347 |
+ 0.358 |
+ 0.240 |
+ 0.250 |
+ 0.055 |
+ 0.067 |
+ 0.263 |
+ 0.208 |
+ 0.218 |
+ 0.342 |
+ 0.361 |
+ 0.369 |
+ 0.381 |
+ 0.300 |
+ 0.246 |
+ 0.149 |
+ 0.129 |
+ 0.134 |
+ 0.117 |
+ 0.114 |
+ 0.230 |
+ 0.171 |
+ 0.201 |
+ 0.157 |
+ 0.107 |
+ 0.245 |
+ 0.353 |
+ 0.378 |
+ 0.093 |
+ 0.230 |
+ 0.146 |
+ 0.151 |
+ 0.309 |
+ 0.259 |
+ 0.273 |
+ 0.342 |
+ 0.312 |
+ 0.337 |
+ 0.380 |
+ -0.070 |
+ 0.254 |
+ 0.391 |
+ 0.285 |
+ 0.272 |
+ 0.327 |
+ 0.145 |
+ 0.370 |
+ 0.303 |
+ 0.343 |
+ 0.341 |
+ 0.351 |
+ 0.327 |
+ 0.339 |
+ 0.362 |
+ 0.367 |
+ 0.361 |
+ 0.290 |
+ 0.348 |
+ 10075 |
+ Binding |
+ YAP1_HUMAN |
+ Low |
+ Human |
+
+
+ YNZC_BACSU_Tsuboyama_2023_2JVD |
+ 0.471 |
+ 0.550 |
+ 0.553 |
+ 0.553 |
+ 0.550 |
+ 0.553 |
+ 0.393 |
+ 0.448 |
+ 0.526 |
+ 0.522 |
+ 0.559 |
+ 0.553 |
+ 0.553 |
+ 0.436 |
+ 0.438 |
+ 0.550 |
+ 0.548 |
+ 0.550 |
+ 0.546 |
+ 0.542 |
+ 0.421 |
+ 0.486 |
+ 0.506 |
+ 0.506 |
+ 0.500 |
+ 0.500 |
+ 0.464 |
+ 0.500 |
+ 0.540 |
+ 0.497 |
+ 0.516 |
+ 0.493 |
+ 0.502 |
+ 0.434 |
+ 0.476 |
+ 0.446 |
+ 0.512 |
+ 0.510 |
+ 0.506 |
+ 0.520 |
+ 0.516 |
+ 0.514 |
+ 0.240 |
+ 0.186 |
+ 0.359 |
+ 0.264 |
+ 0.315 |
+ 0.381 |
+ 0.563 |
+ 0.534 |
+ 0.559 |
+ 0.550 |
+ 0.555 |
+ 0.557 |
+ 0.555 |
+ 0.561 |
+ 0.553 |
+ 0.557 |
+ 0.553 |
+ 0.555 |
+ 0.544 |
+ 0.555 |
+ 2300 |
+ Stability |
+ YNZC_BACSU |
+ Medium |
+ Prokaryote |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/substitutions/MCC/DMS_substitutions_MCC_Uniprot_Selection_Type_level.csv b/benchmarks/DMS_zero_shot/substitutions/MCC/DMS_substitutions_MCC_Uniprot_Selection_Type_level.csv
new file mode 100644
index 0000000..225ca8b
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/MCC/DMS_substitutions_MCC_Uniprot_Selection_Type_level.csv
@@ -0,0 +1,7 @@
+Site_Independent,EVmutation,DeepSequence_single,DeepSequence_ensemble,EVE_single,EVE_ensemble,Unirep,Unirep_evotune,MSA_Transformer_single,MSA_Transformer_ensemble,ESM1b,ESM1v_single,ESM1v_ensemble,ESM2_8M,ESM2_35M,ESM2_150M,ESM2_650M,ESM2_3B,ESM2_15B,Wavenet,RITA_s,RITA_m,RITA_l,RITA_xl,Progen2_small,Progen2_medium,Progen2_base,Progen2_large,Progen2_xlarge,GEMME,VESPA,VESPAl,ProtGPT2,Tranception_S_no_retrieval,Tranception_M_no_retrieval,Tranception_L_no_retrieval,Tranception_S,Tranception_M,Tranception_L,TranceptEVE_S,TranceptEVE_M,TranceptEVE_L,CARP_38M,CARP_600K,CARP_640M,CARP_76M,MIF,MIFST,ESM-IF1,ProteinMPNN,ProtSSN_k10_h512,ProtSSN_k10_h768,ProtSSN_k10_h1280,ProtSSN_k20_h512,ProtSSN_k20_h768,ProtSSN_k20_h1280,ProtSSN_k30_h512,ProtSSN_k30_h768,ProtSSN_k30_h1280,ProtSSN_ensemble,SaProt_650M_AF2,SaProt_35M_AF2,Selection Type
+0.291,0.342,0.338,0.345,0.351,0.357,0.134,0.274,0.351,0.367,0.328,0.304,0.323,0.147,0.238,0.296,0.331,0.322,0.311,0.298,0.231,0.273,0.283,0.284,0.257,0.3,0.306,0.307,0.311,0.362,0.369,0.331,0.135,0.216,0.266,0.309,0.33,0.343,0.36,0.361,0.368,0.373,0.213,0.076,0.304,0.256,0.243,0.299,0.283,0.143,0.356,0.344,0.351,0.357,0.355,0.361,0.353,0.356,0.358,0.366,0.354,0.283,Activity
+0.264,0.218,0.26,0.272,0.263,0.283,0.156,0.226,0.233,0.248,0.221,0.21,0.241,0.201,0.234,0.253,0.262,0.247,0.229,0.243,0.204,0.204,0.222,0.225,0.204,0.228,0.231,0.22,0.225,0.284,0.269,0.255,0.118,0.21,0.213,0.229,0.279,0.267,0.266,0.3,0.289,0.284,0.21,0.057,0.202,0.212,0.264,0.248,0.305,0.126,0.262,0.257,0.267,0.266,0.269,0.281,0.258,0.264,0.276,0.274,0.29,0.285,Binding
+0.274,0.297,0.292,0.311,0.318,0.319,0.166,0.288,0.338,0.352,0.32,0.322,0.345,0.192,0.262,0.315,0.332,0.323,0.333,0.279,0.262,0.314,0.338,0.33,0.295,0.342,0.353,0.341,0.329,0.336,0.325,0.26,0.155,0.264,0.311,0.325,0.326,0.344,0.358,0.346,0.347,0.358,0.237,0.126,0.322,0.29,0.335,0.348,0.319,0.141,0.332,0.326,0.345,0.339,0.347,0.345,0.341,0.34,0.342,0.355,0.388,0.337,Expression
+0.299,0.322,0.314,0.327,0.345,0.35,0.103,0.268,0.321,0.327,0.274,0.287,0.305,0.104,0.169,0.237,0.29,0.293,0.301,0.282,0.249,0.292,0.293,0.299,0.261,0.3,0.295,0.296,0.302,0.354,0.348,0.321,0.136,0.247,0.28,0.301,0.32,0.33,0.34,0.353,0.357,0.359,0.169,0.046,0.288,0.215,0.23,0.285,0.246,0.121,0.296,0.287,0.298,0.305,0.3,0.303,0.301,0.3,0.297,0.309,0.283,0.219,OrganismalFitness
+0.301,0.342,0.383,0.384,0.397,0.401,0.171,0.311,0.395,0.403,0.41,0.357,0.387,0.209,0.344,0.41,0.42,0.415,0.398,0.369,0.232,0.286,0.317,0.328,0.285,0.324,0.308,0.327,0.365,0.423,0.422,0.395,0.21,0.218,0.274,0.315,0.354,0.366,0.378,0.402,0.403,0.406,0.235,0.074,0.313,0.282,0.395,0.368,0.5,0.445,0.46,0.445,0.452,0.459,0.458,0.466,0.455,0.46,0.459,0.464,0.476,0.453,Stability
+0.286,0.305,0.318,0.328,0.335,0.342,0.146,0.274,0.328,0.339,0.311,0.296,0.32,0.171,0.249,0.302,0.327,0.32,0.314,0.294,0.236,0.274,0.291,0.293,0.261,0.299,0.299,0.298,0.306,0.352,0.346,0.312,0.151,0.231,0.269,0.296,0.322,0.33,0.34,0.352,0.353,0.356,0.213,0.076,0.286,0.251,0.294,0.31,0.331,0.195,0.341,0.332,0.343,0.345,0.346,0.351,0.342,0.344,0.347,0.354,0.358,0.316,
diff --git a/benchmarks/DMS_zero_shot/substitutions/MCC/DMS_substitutions_MCC_Uniprot_level.csv b/benchmarks/DMS_zero_shot/substitutions/MCC/DMS_substitutions_MCC_Uniprot_level.csv
new file mode 100644
index 0000000..2b790f9
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/MCC/DMS_substitutions_MCC_Uniprot_level.csv
@@ -0,0 +1,223 @@
+Site_Independent,EVmutation,DeepSequence_single,DeepSequence_ensemble,EVE_single,EVE_ensemble,Unirep,Unirep_evotune,MSA_Transformer_single,MSA_Transformer_ensemble,ESM1b,ESM1v_single,ESM1v_ensemble,ESM2_8M,ESM2_35M,ESM2_150M,ESM2_650M,ESM2_3B,ESM2_15B,Wavenet,RITA_s,RITA_m,RITA_l,RITA_xl,Progen2_small,Progen2_medium,Progen2_base,Progen2_large,Progen2_xlarge,GEMME,VESPA,VESPAl,ProtGPT2,Tranception_S_no_retrieval,Tranception_M_no_retrieval,Tranception_L_no_retrieval,Tranception_S,Tranception_M,Tranception_L,TranceptEVE_S,TranceptEVE_M,TranceptEVE_L,CARP_38M,CARP_600K,CARP_640M,CARP_76M,MIF,MIFST,ESM-IF1,ProteinMPNN,ProtSSN_k10_h512,ProtSSN_k10_h768,ProtSSN_k10_h1280,ProtSSN_k20_h512,ProtSSN_k20_h768,ProtSSN_k20_h1280,ProtSSN_k30_h512,ProtSSN_k30_h768,ProtSSN_k30_h1280,ProtSSN_ensemble,SaProt_650M_AF2,SaProt_35M_AF2,Selection Type,UniProt_ID,MSA_Neff_L_category,Taxon
+0.265,0.23,0.082,0.075,0.28,0.292,-0.124,0.063,0.334,0.335,-0.023,-0.041,0.005,-0.058,-0.045,-0.037,0.147,0.277,0.281,0.168,0.263,0.218,0.219,0.21,0.241,0.236,0.224,0.226,0.197,0.27,0.208,0.191,-0.005,0.217,0.234,0.193,0.264,0.263,0.248,0.258,0.254,0.271,-0.043,-0.061,0.097,-0.052,0.202,0.205,0.21,0.086,0.175,0.178,0.183,0.202,0.191,0.191,0.184,0.183,0.183,0.188,0.14,0.084,OrganismalFitness,A0A140D2T1_ZIKV,Medium,Virus
+0.359,0.332,0.317,0.337,0.387,0.401,0.008,0.39,0.406,0.409,0.356,0.391,0.401,0.022,0.022,0.04,0.076,0.087,0.1,0.359,0.386,0.397,0.398,0.394,0.393,0.389,0.366,0.384,0.385,0.382,0.435,0.418,0.256,0.385,0.384,0.4,0.393,0.399,0.398,0.402,0.409,0.411,0.329,-0.002,0.378,0.332,0.284,0.359,0.187,0.102,0.167,0.19,0.205,0.208,0.168,0.186,0.187,0.163,0.145,0.197,0.119,0.063,OrganismalFitness,A0A192B1T2_9HIV1,Medium,Virus
+-0.009,0.026,0.082,0.061,0.043,0.052,0.0,0.065,0.061,0.069,0.017,0.043,0.065,-0.026,-0.03,0.004,0.048,0.022,0.056,0.048,-0.004,0.022,0.052,0.043,0.03,0.061,0.052,0.069,0.065,0.03,0.0,-0.007,0.017,0.043,0.052,0.082,0.03,0.065,0.056,0.039,0.082,0.061,-0.013,-0.035,0.052,-0.03,0.022,0.052,0.035,0.056,0.065,0.052,0.065,0.056,0.065,0.078,0.039,0.069,0.061,0.082,0.043,0.004,Activity,A0A1I9GEU1_NEIME,Medium,Prokaryote
+0.347,0.359,0.096,0.041,0.312,0.316,0.028,0.038,0.234,0.258,0.065,0.082,0.087,0.07,0.079,0.084,0.06,0.07,0.084,0.043,0.067,0.055,0.053,0.094,0.014,0.065,0.033,0.07,0.089,0.351,0.244,0.234,0.004,0.053,0.041,0.055,0.212,0.21,0.215,0.193,0.183,0.193,0.062,0.043,0.067,0.074,0.312,0.353,0.328,0.263,0.21,0.237,0.224,0.246,0.241,0.28,0.208,0.244,0.191,0.237,0.316,0.212,Activity,A0A247D711_LISMN,High,Prokaryote
+0.385,0.389,0.362,0.382,0.415,0.422,-0.008,0.344,0.381,0.382,0.086,0.382,0.412,0.004,0.026,0.047,0.38,0.39,0.376,0.321,0.344,0.401,0.391,0.406,0.312,0.362,0.375,0.349,0.37,0.414,0.361,0.297,0.062,0.347,0.393,0.41,0.404,0.433,0.421,0.446,0.453,0.445,0.016,0.019,0.223,0.017,0.311,0.336,0.322,0.114,0.367,0.359,0.397,0.388,0.389,0.405,0.402,0.388,0.404,0.406,0.156,0.126,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+0.385,0.389,0.362,0.382,0.415,0.422,-0.008,0.344,0.381,0.382,0.086,0.382,0.412,0.004,0.026,0.047,0.38,0.39,0.376,0.321,0.344,0.401,0.391,0.406,0.312,0.362,0.375,0.349,0.37,0.414,0.361,0.297,0.062,0.347,0.393,0.41,0.404,0.433,0.421,0.446,0.453,0.445,0.016,0.019,0.223,0.017,0.311,0.336,0.322,0.114,0.367,0.359,0.397,0.388,0.389,0.405,0.402,0.388,0.404,0.406,0.156,0.126,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+0.344,0.252,0.341,0.335,0.326,0.323,0.015,0.267,0.254,0.245,0.024,0.019,0.025,0.019,0.02,0.02,0.103,0.139,0.196,0.194,0.269,0.301,0.319,0.313,0.05,0.208,0.224,0.197,0.258,0.365,0.24,0.237,0.024,0.269,0.284,0.314,0.279,0.293,0.296,0.349,0.356,0.367,0.015,0.002,0.047,0.005,0.204,0.199,0.09,0.081,0.154,0.151,0.145,0.163,0.153,0.163,0.158,0.168,0.16,0.158,0.112,0.064,OrganismalFitness,A4D664_9INFA,Medium,Virus
+0.304,0.469,0.554,0.557,0.514,0.536,0.29,0.427,0.526,0.58,0.585,0.546,0.572,0.354,0.441,0.551,0.639,0.614,0.513,0.465,0.336,0.448,0.463,0.525,0.441,0.514,0.53,0.527,0.593,0.568,0.643,0.573,0.206,0.34,0.464,0.521,0.46,0.493,0.539,0.547,0.556,0.567,0.412,0.08,0.56,0.504,0.546,0.597,0.536,0.325,0.618,0.603,0.596,0.59,0.597,0.607,0.617,0.617,0.616,0.615,0.61,0.417,OrganismalFitness,A4GRB6_PSEAI,High,Prokaryote
+0.363,0.323,0.35,0.323,0.246,0.243,0.292,0.11,0.246,0.25,0.234,0.218,0.295,0.338,0.307,0.305,0.359,0.371,0.269,0.185,0.217,0.171,0.204,0.198,0.28,0.192,0.212,0.202,0.201,0.388,0.191,0.128,0.412,0.28,0.161,0.236,0.333,0.266,0.361,0.332,0.288,0.31,0.316,0.311,0.333,0.304,0.232,0.315,-0.13,0.042,0.357,0.338,0.337,0.341,0.344,0.316,0.356,0.281,0.29,0.343,0.348,0.332,Stability,A4_HUMAN,Low,Human
+0.192,0.363,0.249,0.336,0.376,0.374,0.12,0.196,0.389,0.38,0.287,0.345,0.371,0.127,0.147,0.154,0.363,0.398,0.414,0.278,0.112,0.156,0.174,0.209,0.178,0.32,0.298,0.327,0.354,0.363,0.4,0.349,0.003,0.163,0.163,0.311,0.32,0.303,0.334,0.349,0.343,0.385,0.154,0.076,0.243,0.163,0.187,0.265,0.245,0.114,0.363,0.349,0.36,0.349,0.356,0.34,0.351,0.349,0.347,0.358,0.36,0.2,OrganismalFitness,AACC1_PSEAI,High,Prokaryote
+0.162,0.105,0.08,0.08,0.117,0.103,-0.028,0.013,0.161,0.17,0.18,0.087,0.126,-0.014,0.035,0.152,0.152,0.116,0.089,0.126,-0.005,0.033,0.094,0.141,-0.028,0.074,0.117,0.116,0.18,0.096,0.103,0.073,0.062,0.006,0.04,0.045,0.135,0.092,0.083,0.107,0.083,0.081,0.008,0.045,0.218,0.035,0.288,0.24,0.27,0.08,0.161,0.207,0.161,0.182,0.173,0.195,0.164,0.162,0.166,0.173,0.233,0.161,Binding,ACE2_HUMAN,Medium,Human
+0.277,0.364,0.41,0.411,0.397,0.407,0.364,0.396,0.432,0.426,0.43,0.428,0.428,0.313,0.341,0.373,0.392,0.402,0.405,0.412,0.425,0.427,0.415,0.39,0.431,0.421,0.421,0.425,0.391,0.429,0.406,0.311,0.25,0.435,0.421,0.401,0.428,0.43,0.427,0.423,0.43,0.428,0.375,0.124,0.431,0.411,0.336,0.396,0.368,0.169,0.386,0.39,0.412,0.402,0.409,0.402,0.395,0.414,0.403,0.417,0.457,0.411,Activity,ADRB2_HUMAN,Medium,Human
+0.011,0.216,0.267,0.267,0.267,0.267,-0.169,0.139,0.164,0.216,0.216,0.216,0.267,-0.143,-0.143,0.011,0.19,0.19,0.113,0.395,-0.092,0.036,0.19,0.241,-0.066,0.139,0.293,0.216,0.19,0.164,0.293,0.293,0.036,-0.041,-0.015,0.216,0.036,0.113,0.19,0.216,0.241,0.216,-0.169,-0.169,0.164,0.036,0.293,0.241,0.267,0.139,0.216,0.241,0.216,0.216,0.164,0.216,0.216,0.216,0.19,0.19,0.19,-0.015,Activity,AICDA_HUMAN,Medium,Human
+0.201,0.249,0.241,0.226,0.229,0.234,-0.227,0.013,0.085,0.096,0.188,0.07,0.271,-0.139,0.186,0.127,0.102,0.135,0.127,-0.015,-0.063,-0.157,-0.094,-0.087,-0.271,-0.124,-0.112,-0.176,0.005,0.203,0.125,0.128,0.007,-0.207,-0.108,-0.248,0.222,0.234,0.183,0.262,0.256,0.211,-0.16,-0.162,0.04,0.003,0.143,0.02,0.276,0.232,0.283,0.149,0.048,0.172,0.162,0.164,0.163,0.166,0.174,0.159,0.153,0.26,Stability,AMFR_HUMAN,Medium,Human
+0.207,0.391,0.424,0.448,0.421,0.396,0.035,0.363,0.314,0.494,0.466,0.496,0.546,0.214,0.282,0.299,0.437,0.557,0.493,0.394,0.452,0.449,0.444,0.454,0.453,0.485,0.455,0.453,0.433,0.47,0.487,0.455,0.243,0.499,0.389,0.351,0.474,0.395,0.36,0.494,0.432,0.415,0.233,0.04,0.358,0.281,0.291,0.374,0.324,0.167,0.448,0.41,0.418,0.435,0.422,0.432,0.413,0.426,0.453,0.435,0.475,0.286,Activity,AMIE_PSEAE,High,Prokaryote
+0.417,0.382,0.325,0.348,0.364,0.369,0.389,0.342,0.368,0.382,0.405,0.382,0.406,0.36,0.431,0.434,0.448,0.437,0.418,0.008,0.316,0.346,0.38,0.38,0.374,0.383,0.372,0.352,0.37,0.433,0.41,0.406,0.222,0.343,0.37,0.341,0.404,0.424,0.409,0.388,0.408,0.394,0.381,0.285,0.366,0.372,0.339,0.361,0.311,0.123,0.414,0.408,0.431,0.434,0.431,0.442,0.418,0.43,0.422,0.441,0.419,0.442,Activity,ANCSZ,Medium,Eukaryote
+0.212,0.24,0.293,0.296,0.281,0.281,0.181,0.249,0.315,0.352,0.284,0.203,0.262,0.209,0.181,0.355,0.34,0.377,0.293,0.296,0.225,0.327,0.33,0.277,0.231,0.296,0.29,0.324,0.302,0.336,0.305,0.239,0.066,0.253,0.265,0.324,0.212,0.215,0.281,0.268,0.277,0.299,0.162,0.144,0.33,0.296,0.585,0.498,0.601,0.495,0.371,0.392,0.377,0.386,0.358,0.38,0.355,0.368,0.34,0.377,0.43,0.368,Stability,ARGR_ECOLI,Medium,Prokaryote
+0.519,0.002,0.508,0.532,0.319,0.508,0.177,0.414,0.201,0.225,0.177,0.059,0.248,0.059,0.154,0.106,0.225,0.343,0.272,0.225,0.154,0.035,0.225,0.366,0.059,0.177,0.319,0.248,0.295,0.658,0.319,0.3,0.201,0.106,0.177,0.366,0.437,0.343,0.343,0.532,0.461,0.366,0.13,0.012,0.154,0.083,0.272,0.248,0.319,0.035,0.272,0.272,0.319,0.201,0.248,0.414,0.225,0.177,0.366,0.248,0.201,0.201,Binding,B2L11_HUMAN,Low,Human
+0.189,0.239,0.379,0.361,0.375,0.381,0.185,0.282,0.365,0.382,0.433,0.369,0.41,0.324,0.359,0.359,0.439,0.371,0.44,0.29,0.332,0.352,0.381,0.34,0.123,0.375,0.408,0.369,0.299,0.294,0.352,0.329,0.266,0.299,0.206,0.268,0.328,0.263,0.309,0.408,0.359,0.382,0.098,0.108,0.21,0.127,0.468,0.367,0.572,0.514,0.442,0.421,0.448,0.442,0.442,0.446,0.435,0.442,0.442,0.444,0.45,0.483,Stability,BBC1_YEAST,High,Eukaryote
+0.327,0.307,0.281,0.296,0.284,0.302,0.098,0.159,0.309,0.391,0.33,0.248,0.289,0.118,0.149,0.251,0.406,0.414,0.324,0.342,0.088,0.299,0.154,0.302,0.136,0.363,0.21,0.33,0.34,0.5,0.472,0.401,0.024,0.164,0.23,0.35,0.355,0.36,0.36,0.296,0.309,0.34,0.078,0.085,0.169,0.146,0.421,0.37,0.518,0.431,0.454,0.452,0.464,0.472,0.459,0.487,0.459,0.449,0.454,0.462,0.378,0.21,Stability,BCHB_CHLTE,Medium,Prokaryote
+0.329,0.521,0.536,0.557,0.517,0.538,0.086,0.328,0.541,0.566,0.526,0.499,0.531,0.253,0.378,0.478,0.568,0.419,0.308,0.483,0.392,0.379,0.359,0.37,0.413,0.474,0.466,0.394,0.322,0.488,0.561,0.559,0.12,0.378,0.357,0.323,0.449,0.444,0.446,0.534,0.542,0.546,0.356,0.038,0.515,0.43,0.424,0.514,0.481,0.23,0.544,0.534,0.54,0.559,0.556,0.55,0.561,0.539,0.542,0.572,0.555,0.417,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.329,0.521,0.536,0.557,0.517,0.538,0.086,0.328,0.541,0.566,0.526,0.499,0.531,0.253,0.378,0.478,0.568,0.419,0.308,0.483,0.392,0.379,0.359,0.37,0.413,0.474,0.466,0.394,0.322,0.488,0.561,0.559,0.12,0.378,0.357,0.323,0.449,0.444,0.446,0.534,0.542,0.546,0.356,0.038,0.515,0.43,0.424,0.514,0.481,0.23,0.544,0.534,0.54,0.559,0.556,0.55,0.561,0.539,0.542,0.572,0.555,0.417,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.329,0.521,0.536,0.557,0.517,0.538,0.086,0.328,0.541,0.566,0.526,0.499,0.531,0.253,0.378,0.478,0.568,0.419,0.308,0.483,0.392,0.379,0.359,0.37,0.413,0.474,0.466,0.394,0.322,0.488,0.561,0.559,0.12,0.378,0.357,0.323,0.449,0.444,0.446,0.534,0.542,0.546,0.356,0.038,0.515,0.43,0.424,0.514,0.481,0.23,0.544,0.534,0.54,0.559,0.556,0.55,0.561,0.539,0.542,0.572,0.555,0.417,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.329,0.521,0.536,0.557,0.517,0.538,0.086,0.328,0.541,0.566,0.526,0.499,0.531,0.253,0.378,0.478,0.568,0.419,0.308,0.483,0.392,0.379,0.359,0.37,0.413,0.474,0.466,0.394,0.322,0.488,0.561,0.559,0.12,0.378,0.357,0.323,0.449,0.444,0.446,0.534,0.542,0.546,0.356,0.038,0.515,0.43,0.424,0.514,0.481,0.23,0.544,0.534,0.54,0.559,0.556,0.55,0.561,0.539,0.542,0.572,0.555,0.417,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.436,0.298,0.423,0.426,0.403,0.408,0.048,0.227,0.334,0.331,0.413,0.299,0.341,0.077,0.286,0.379,0.413,0.426,0.361,0.237,0.065,0.321,0.341,0.349,0.239,0.403,0.391,0.379,0.398,0.399,0.436,0.416,0.212,0.09,0.127,0.371,0.406,0.406,0.436,0.441,0.431,0.448,0.142,0.068,0.446,0.361,0.369,0.391,0.044,0.122,0.431,0.423,0.428,0.411,0.421,0.423,0.428,0.413,0.408,0.423,0.436,0.398,OrganismalFitness,BRCA1_HUMAN,Low,Human
+0.312,0.309,0.309,0.309,0.309,0.309,0.068,0.283,0.122,-0.012,0.309,0.095,0.015,0.015,0.015,0.283,0.309,0.309,0.309,0.041,0.015,0.309,0.309,0.309,0.309,0.283,0.309,0.309,0.068,0.309,0.256,0.285,-0.012,0.041,0.068,0.068,0.309,0.309,0.309,0.309,0.309,0.309,0.015,0.015,0.309,-0.012,-0.039,0.015,-0.2,0.175,0.256,0.283,0.283,0.309,0.309,0.309,0.309,0.309,0.309,0.309,-0.039,0.015,OrganismalFitness,BRCA2_HUMAN,,Human
+0.297,0.268,0.258,0.264,0.308,0.308,-0.011,0.333,0.267,0.272,0.024,0.32,0.37,-0.002,-0.007,-0.003,0.381,0.293,0.298,0.245,0.288,0.269,0.275,0.261,0.168,0.341,0.323,0.357,0.281,0.345,0.333,0.251,0.088,0.269,0.276,0.299,0.312,0.317,0.318,0.332,0.335,0.325,-0.009,-0.016,0.204,0.014,0.393,0.422,0.426,0.172,0.387,0.396,0.41,0.411,0.417,0.399,0.417,0.416,0.409,0.416,0.235,0.134,OrganismalFitness,C6KNH7_9INFA,Medium,Virus
+0.135,0.201,0.197,0.204,0.186,0.192,0.124,0.135,0.197,0.221,0.232,0.19,0.234,0.115,0.153,0.153,0.166,0.166,0.168,0.188,0.131,0.184,0.215,0.21,0.188,0.243,0.239,0.25,0.276,0.206,0.166,0.098,0.062,0.173,0.226,0.27,0.175,0.197,0.226,0.204,0.215,0.204,0.164,0.115,0.256,0.212,0.06,0.124,0.137,0.067,0.124,0.144,0.151,0.155,0.153,0.14,0.142,0.133,0.137,0.151,0.252,0.192,OrganismalFitness,CALM1_HUMAN,High,Human
+0.234,0.195,0.188,0.211,0.194,0.188,0.285,0.285,0.179,0.202,0.122,0.146,0.148,0.201,0.217,0.144,0.201,0.111,0.044,0.163,0.145,0.198,0.206,0.202,0.159,0.149,0.188,0.152,0.3,0.274,0.118,0.11,0.102,0.149,0.203,0.36,0.213,0.223,0.318,0.194,0.197,0.263,0.092,0.136,0.21,0.096,0.313,0.301,0.24,0.221,0.151,0.148,0.131,0.151,0.139,0.139,0.143,0.139,0.15,0.145,0.214,0.167,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+0.178,0.142,0.136,0.127,0.12,0.118,0.082,-0.042,0.112,0.108,0.186,0.246,0.215,0.034,0.068,0.263,0.242,0.25,0.272,0.086,0.099,0.114,0.129,0.108,0.007,0.102,0.103,0.132,0.066,0.148,0.192,0.168,0.187,0.016,0.091,0.057,0.172,0.123,0.083,0.154,0.107,0.08,0.066,0.02,0.168,0.183,0.227,0.195,0.209,0.074,0.274,0.263,0.26,0.262,0.256,0.259,0.246,0.263,0.274,0.268,0.301,0.226,OrganismalFitness,CAR11_HUMAN,Low,Human
+0.178,0.142,0.136,0.127,0.12,0.118,0.082,-0.042,0.112,0.108,0.186,0.246,0.215,0.034,0.068,0.263,0.242,0.25,0.272,0.086,0.099,0.114,0.129,0.108,0.007,0.102,0.103,0.132,0.066,0.148,0.192,0.168,0.187,0.016,0.091,0.057,0.172,0.123,0.083,0.154,0.107,0.08,0.066,0.02,0.168,0.183,0.227,0.195,0.209,0.074,0.274,0.263,0.26,0.262,0.256,0.259,0.246,0.263,0.274,0.268,0.301,0.226,OrganismalFitness,CAR11_HUMAN,Low,Human
+0.107,0.141,0.123,0.126,0.135,0.139,0.014,0.079,0.131,0.14,0.123,0.046,0.039,0.027,0.056,0.101,0.133,0.137,0.134,0.01,0.027,0.048,0.116,0.091,0.026,0.144,0.138,0.133,0.048,0.128,0.145,0.129,0.016,0.022,0.033,0.12,0.132,0.134,0.132,0.129,0.131,0.139,0.039,0.011,0.129,0.074,0.074,0.129,0.015,0.02,0.123,0.123,0.121,0.125,0.126,0.13,0.127,0.128,0.121,0.128,0.119,0.077,Activity,CAS9_STRP1,Medium,Prokaryote
+0.314,0.447,0.508,0.503,0.518,0.523,0.053,0.37,0.518,0.518,0.464,0.477,0.487,0.186,0.441,0.518,0.521,0.452,0.38,0.48,0.132,0.378,0.424,0.406,0.383,0.441,0.447,0.393,0.441,0.49,0.467,0.401,0.181,0.048,0.355,0.388,0.39,0.429,0.462,0.505,0.49,0.526,0.27,-0.029,0.48,0.424,0.316,0.429,0.395,0.188,0.459,0.457,0.459,0.462,0.482,0.47,0.467,0.447,0.457,0.487,0.531,0.372,Activity,CASP3_HUMAN,High,Human
+0.335,0.461,0.509,0.521,0.502,0.502,0.051,0.392,0.487,0.533,0.487,0.502,0.53,0.225,0.516,0.518,0.535,0.499,0.478,0.504,0.18,0.459,0.461,0.464,0.475,0.497,0.511,0.485,0.48,0.549,0.511,0.459,0.299,0.101,0.487,0.425,0.442,0.506,0.509,0.497,0.549,0.533,0.371,0.023,0.549,0.514,0.397,0.523,0.521,0.249,0.514,0.502,0.471,0.492,0.518,0.521,0.495,0.495,0.514,0.511,0.556,0.433,Activity,CASP7_HUMAN,Medium,Human
+0.47,0.473,0.567,0.563,0.58,0.578,0.607,0.487,0.449,0.422,0.498,0.538,0.563,0.481,0.594,0.586,0.557,0.508,0.573,0.088,0.506,0.515,0.51,0.508,0.54,0.519,0.498,0.46,0.477,0.576,0.536,0.544,0.357,0.527,0.502,0.529,0.569,0.542,0.561,0.58,0.563,0.576,0.395,0.38,0.294,0.424,0.393,0.386,0.576,0.439,0.565,0.544,0.578,0.557,0.584,0.563,0.557,0.536,0.555,0.559,0.512,0.565,Stability,CATR_CHLRE,High,Eukaryote
+0.555,0.605,0.633,0.634,0.638,0.65,0.375,0.569,0.673,0.685,0.648,0.615,0.611,0.404,0.561,0.642,0.636,0.633,0.607,0.673,0.4,0.406,0.592,0.571,0.449,0.573,0.582,0.557,0.611,0.667,0.658,0.595,0.019,0.404,0.513,0.611,0.576,0.603,0.623,0.65,0.64,0.65,0.397,0.279,0.513,0.522,0.677,0.625,0.805,0.71,0.698,0.725,0.692,0.71,0.737,0.714,0.712,0.71,0.706,0.725,0.631,0.776,Stability,CBPA2_HUMAN,Medium,Human
+0.297,0.306,0.314,0.339,0.319,0.331,0.162,0.205,0.33,0.341,0.304,0.305,0.329,0.07,0.177,0.281,0.299,0.285,0.283,0.309,0.315,0.238,0.261,0.262,0.319,0.254,0.262,0.248,0.273,0.322,0.316,0.299,0.179,0.332,0.268,0.233,0.329,0.289,0.286,0.349,0.317,0.315,0.243,0.062,0.326,0.324,0.21,0.236,0.277,0.058,0.277,0.277,0.283,0.297,0.292,0.289,0.281,0.284,0.29,0.293,0.336,0.238,OrganismalFitness,CBS_HUMAN,Medium,Human
+0.389,0.45,0.493,0.505,0.463,0.507,-0.205,0.454,0.482,0.468,0.48,0.482,0.47,-0.21,0.545,0.514,0.503,0.428,0.442,0.5,0.312,0.41,0.377,0.394,0.372,0.361,0.366,0.31,0.394,0.489,0.507,0.502,0.349,0.277,0.305,0.31,0.415,0.41,0.396,0.496,0.517,0.503,0.465,-0.261,0.249,0.435,-0.002,0.067,0.452,0.421,0.528,0.458,0.535,0.526,0.545,0.515,0.519,0.552,0.519,0.526,0.493,0.615,Stability,CBX4_HUMAN,High,Human
+0.312,0.366,0.401,0.424,0.396,0.414,0.003,0.191,0.314,0.324,0.386,0.33,0.382,0.019,0.008,0.353,0.396,0.396,0.262,0.402,0.026,0.022,-0.062,0.124,-0.109,0.085,0.024,0.027,0.356,0.398,0.472,0.46,0.093,-0.001,0.055,0.292,0.384,0.358,0.391,0.398,0.38,0.405,0.031,-0.016,0.389,0.018,0.252,0.345,0.26,0.208,0.388,0.326,0.354,0.37,0.35,0.361,0.354,0.37,0.394,0.378,0.366,0.19,Activity,CCDB_ECOLI,High,Prokaryote
+0.312,0.366,0.401,0.424,0.396,0.414,0.003,0.191,0.314,0.324,0.386,0.33,0.382,0.019,0.008,0.353,0.396,0.396,0.262,0.402,0.026,0.022,-0.062,0.124,-0.109,0.085,0.024,0.027,0.356,0.398,0.472,0.46,0.093,-0.001,0.055,0.292,0.384,0.358,0.391,0.398,0.38,0.405,0.031,-0.016,0.389,0.018,0.252,0.345,0.26,0.208,0.388,0.326,0.354,0.37,0.35,0.361,0.354,0.37,0.394,0.378,0.366,0.19,OrganismalFitness,CCDB_ECOLI,High,Prokaryote
+0.208,0.215,0.206,0.203,0.203,0.21,0.221,0.227,0.239,0.263,0.268,0.258,0.268,0.228,0.255,0.263,0.256,0.241,0.258,0.268,0.271,0.257,0.232,0.23,0.271,0.255,0.255,0.273,0.233,0.259,0.216,0.182,0.086,0.272,0.268,0.264,0.261,0.27,0.28,0.234,0.233,0.227,0.267,0.121,0.268,0.264,0.19,0.23,0.203,0.112,0.246,0.234,0.246,0.25,0.245,0.24,0.252,0.254,0.256,0.256,0.271,0.26,Binding,CCR5_HUMAN,High,Human
+0.144,0.152,0.146,0.146,0.163,0.153,0.062,0.081,0.126,0.131,0.09,0.11,0.123,0.102,0.13,0.109,0.099,0.126,0.099,-0.018,0.103,0.109,0.14,0.126,0.094,0.159,0.131,0.095,0.155,0.168,0.099,0.073,0.078,0.119,0.147,0.126,0.136,0.135,0.132,0.164,0.161,0.156,0.11,0.07,0.094,0.11,0.264,0.209,0.252,0.132,0.164,0.182,0.169,0.193,0.176,0.169,0.179,0.176,0.182,0.193,0.305,0.26,Binding,CD19_HUMAN,Low,Human
+0.378,0.44,0.45,0.462,0.44,0.466,0.416,0.448,0.449,0.473,0.374,0.466,0.484,0.412,0.489,0.494,0.506,0.494,0.455,0.489,0.447,0.45,0.462,0.431,0.448,0.438,0.442,0.454,0.431,0.466,0.447,0.4,0.178,0.468,0.439,0.424,0.489,0.477,0.474,0.489,0.484,0.484,0.466,0.048,0.395,0.488,0.45,0.424,0.489,0.172,0.481,0.48,0.491,0.505,0.503,0.506,0.496,0.486,0.495,0.51,0.496,0.491,Expression,CP2C9_HUMAN,High,Human
+0.378,0.44,0.45,0.462,0.44,0.466,0.416,0.448,0.449,0.473,0.374,0.466,0.484,0.412,0.489,0.494,0.506,0.494,0.455,0.489,0.447,0.45,0.462,0.431,0.448,0.438,0.442,0.454,0.431,0.466,0.447,0.4,0.178,0.468,0.439,0.424,0.489,0.477,0.474,0.489,0.484,0.484,0.466,0.048,0.395,0.488,0.45,0.424,0.489,0.172,0.481,0.48,0.491,0.505,0.503,0.506,0.496,0.486,0.495,0.51,0.496,0.491,Binding,CP2C9_HUMAN,High,Human
+0.255,0.474,0.392,0.409,0.356,0.369,0.213,0.367,0.421,0.434,0.455,0.455,0.502,0.305,0.494,0.418,0.365,0.436,0.429,0.378,0.328,0.375,0.455,0.465,0.342,0.464,0.459,0.453,0.442,0.448,0.428,0.419,0.259,0.274,0.364,0.433,0.347,0.359,0.421,0.417,0.421,0.457,0.233,0.167,0.306,0.329,0.345,0.329,0.426,0.467,0.468,0.459,0.445,0.471,0.479,0.471,0.47,0.476,0.438,0.478,0.456,0.482,Stability,CSN4_MOUSE,Medium,Eukaryote
+0.252,0.308,0.277,0.282,0.31,0.292,0.072,0.32,0.366,0.376,0.368,0.297,0.265,0.105,0.32,0.361,0.434,0.361,0.373,0.257,0.115,0.141,0.158,0.062,0.085,0.153,0.148,0.148,0.232,0.295,0.305,0.271,0.189,0.113,0.11,0.135,0.257,0.247,0.247,0.315,0.315,0.315,0.108,0.032,0.31,0.151,0.396,0.381,0.454,0.346,0.404,0.373,0.381,0.409,0.404,0.419,0.416,0.416,0.411,0.404,0.449,0.404,Stability,CUE1_YEAST,Medium,Eukaryote
+0.426,0.5,0.5,0.465,0.506,0.503,0.049,0.378,0.538,0.543,0.353,0.048,0.042,0.045,0.032,0.022,0.032,0.046,0.109,0.356,0.032,0.033,0.082,0.06,-0.003,0.035,0.05,0.145,0.281,0.506,0.444,0.461,0.024,0.078,0.094,0.133,0.418,0.417,0.415,0.505,0.504,0.493,-0.002,0.01,0.001,0.02,0.209,0.228,0.389,0.292,0.386,0.386,0.384,0.392,0.381,0.392,0.383,0.376,0.377,0.385,0.235,0.147,Activity,D7PM05_CLYGR,Low,Eukaryote
+0.611,0.49,0.509,0.487,0.498,0.51,0.652,0.557,0.444,0.448,0.435,0.459,0.497,0.671,0.688,0.646,0.473,0.386,0.368,0.536,0.436,0.478,0.465,0.437,0.487,0.482,0.453,0.451,0.38,0.508,0.545,0.542,0.394,0.437,0.549,0.481,0.533,0.623,0.576,0.532,0.555,0.529,0.502,0.162,0.314,0.443,0.548,0.35,0.561,0.251,0.412,0.3,0.458,0.415,0.434,0.405,0.436,0.415,0.424,0.421,0.417,0.662,OrganismalFitness,DLG4_HUMAN,Low,Human
+0.407,0.404,0.398,0.423,0.435,0.447,0.338,0.359,0.383,0.401,0.371,0.453,0.462,0.334,0.438,0.447,0.432,0.398,0.347,0.35,0.298,0.31,0.31,0.292,0.304,0.383,0.341,0.301,0.334,0.371,0.426,0.417,0.207,0.286,0.304,0.252,0.389,0.392,0.371,0.438,0.444,0.441,0.404,0.049,0.328,0.407,0.334,0.258,0.371,0.119,0.341,0.31,0.347,0.331,0.353,0.338,0.307,0.347,0.359,0.362,0.404,0.42,Binding,DLG4_RAT,Low,Eukaryote
+0.161,0.117,0.188,0.173,0.185,0.208,-0.093,0.268,0.252,0.248,-0.014,0.038,0.034,0.006,0.093,0.089,0.216,0.212,0.256,0.177,-0.069,-0.002,-0.01,0.034,0.03,0.058,-0.006,0.03,0.121,0.204,0.3,0.272,0.01,-0.022,-0.026,-0.014,0.157,0.157,0.161,0.2,0.181,0.173,0.034,-0.077,0.133,0.042,0.45,0.438,0.522,0.423,0.276,0.232,0.343,0.331,0.308,0.315,0.288,0.268,0.272,0.308,0.387,0.335,Stability,DN7A_SACS2,Medium,Prokaryote
+0.724,0.742,0.684,0.693,0.724,0.726,0.633,0.689,0.716,0.735,0.723,0.716,0.749,0.765,0.777,0.779,0.758,0.71,0.763,0.703,0.597,0.648,0.68,0.652,0.684,0.661,0.689,0.588,0.707,0.712,0.67,0.648,0.581,0.7,0.673,0.74,0.76,0.76,0.784,0.728,0.737,0.754,0.532,0.069,0.309,0.551,0.502,0.401,0.714,0.726,0.721,0.742,0.751,0.739,0.758,0.747,0.777,0.731,0.762,0.754,0.712,0.733,Stability,DNJA1_HUMAN,High,Human
+0.321,0.337,0.321,0.327,0.36,0.359,0.046,0.317,0.268,0.28,0.309,0.332,0.354,0.184,0.337,0.359,0.35,0.272,0.335,0.342,0.241,0.298,0.262,0.272,0.247,0.268,0.214,0.21,0.208,0.319,0.261,0.274,0.153,0.214,0.249,0.227,0.372,0.386,0.352,0.35,0.37,0.354,0.224,-0.155,0.321,0.265,0.231,0.234,0.286,0.297,0.287,0.258,0.265,0.273,0.268,0.279,0.283,0.276,0.278,0.265,0.365,0.357,Stability,DOCK1_MOUSE,High,Eukaryote
+0.314,0.408,0.409,0.403,0.419,0.419,-0.024,0.324,0.399,0.414,0.412,0.373,0.389,0.052,0.347,0.392,0.412,0.428,0.426,0.408,0.272,0.347,0.258,0.308,0.357,0.402,0.404,0.403,0.368,0.408,0.406,0.396,0.162,0.355,0.297,0.312,0.383,0.372,0.382,0.418,0.422,0.416,0.34,-0.01,0.396,0.367,0.193,0.359,0.28,0.081,0.372,0.353,0.349,0.368,0.372,0.375,0.382,0.368,0.388,0.382,0.406,0.294,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+0.314,0.408,0.409,0.403,0.419,0.419,-0.024,0.324,0.399,0.414,0.412,0.373,0.389,0.052,0.347,0.392,0.412,0.428,0.426,0.408,0.272,0.347,0.258,0.308,0.357,0.402,0.404,0.403,0.368,0.408,0.406,0.396,0.162,0.355,0.297,0.312,0.383,0.372,0.382,0.418,0.422,0.416,0.34,-0.01,0.396,0.367,0.193,0.359,0.28,0.081,0.372,0.353,0.349,0.368,0.372,0.375,0.382,0.368,0.388,0.382,0.406,0.294,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+0.314,0.408,0.409,0.403,0.419,0.419,-0.024,0.324,0.399,0.414,0.412,0.373,0.389,0.052,0.347,0.392,0.412,0.428,0.426,0.408,0.272,0.347,0.258,0.308,0.357,0.402,0.404,0.403,0.368,0.408,0.406,0.396,0.162,0.355,0.297,0.312,0.383,0.372,0.382,0.418,0.422,0.416,0.34,-0.01,0.396,0.367,0.193,0.359,0.28,0.081,0.372,0.353,0.349,0.368,0.372,0.375,0.382,0.368,0.388,0.382,0.406,0.294,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+0.314,0.408,0.409,0.403,0.419,0.419,-0.024,0.324,0.399,0.414,0.412,0.373,0.389,0.052,0.347,0.392,0.412,0.428,0.426,0.408,0.272,0.347,0.258,0.308,0.357,0.402,0.404,0.403,0.368,0.408,0.406,0.396,0.162,0.355,0.297,0.312,0.383,0.372,0.382,0.418,0.422,0.416,0.34,-0.01,0.396,0.367,0.193,0.359,0.28,0.081,0.372,0.353,0.349,0.368,0.372,0.375,0.382,0.368,0.388,0.382,0.406,0.294,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+0.138,0.131,0.123,0.142,0.187,0.187,0.001,0.157,0.135,0.127,0.131,0.183,0.194,0.146,0.164,0.157,0.161,0.142,0.12,0.153,0.138,0.149,0.168,0.176,0.135,0.153,0.202,0.19,0.19,0.164,0.157,0.135,0.064,0.123,0.153,0.194,0.176,0.179,0.198,0.183,0.202,0.202,0.127,0.068,0.164,0.168,0.049,0.146,0.101,0.042,0.164,0.172,0.131,0.161,0.138,0.164,0.157,0.153,0.176,0.164,0.112,0.183,Activity,ENVZ_ECOLI,High,Prokaryote
+0.298,0.321,0.286,0.298,0.357,0.321,0.004,0.263,0.286,0.286,0.31,0.38,0.357,0.039,0.075,0.051,-0.008,0.039,0.004,0.298,0.357,0.321,0.321,0.321,0.298,0.357,0.333,0.286,0.321,0.38,0.368,0.353,0.251,0.345,0.31,0.31,0.392,0.345,0.345,0.345,0.333,0.333,0.274,0.004,0.31,0.333,0.333,0.286,0.274,0.216,0.18,0.274,0.298,0.204,0.157,0.216,0.216,0.204,0.157,0.216,0.122,0.086,OrganismalFitness,ENV_HV1B9,Medium,Virus
+0.251,0.241,0.235,0.237,0.251,0.256,-0.003,0.237,0.264,0.267,0.23,0.241,0.26,-0.006,-0.004,-0.001,0.027,0.043,0.107,0.251,0.271,0.269,0.275,0.279,0.256,0.272,0.268,0.263,0.271,0.277,0.248,0.223,0.158,0.26,0.27,0.27,0.277,0.279,0.274,0.274,0.277,0.27,0.163,-0.011,0.24,0.23,0.143,0.162,0.069,0.049,0.124,0.115,0.133,0.156,0.129,0.136,0.14,0.134,0.133,0.142,0.108,0.058,OrganismalFitness,ENV_HV1BR,Medium,Virus
+0.612,0.608,0.631,0.637,0.682,0.686,-0.261,0.655,0.749,0.753,0.755,0.745,0.773,-0.3,0.725,0.788,0.804,0.735,0.708,0.688,0.476,0.547,0.659,0.616,0.606,0.643,0.637,0.684,0.686,0.798,0.737,0.725,0.604,0.573,0.541,0.631,0.651,0.661,0.694,0.682,0.68,0.667,0.569,-0.253,0.553,0.541,0.492,0.537,0.741,0.657,0.794,0.778,0.788,0.776,0.8,0.796,0.802,0.794,0.786,0.802,0.751,0.745,Stability,EPHB2_HUMAN,High,Human
+0.351,0.301,0.191,0.203,0.203,0.24,0.338,0.338,0.326,0.351,0.363,0.338,0.4,0.351,0.351,0.388,0.326,0.351,0.289,0.043,0.351,0.326,0.375,0.351,0.363,0.351,0.449,0.437,0.388,0.375,0.314,0.05,0.4,0.363,0.338,0.351,0.4,0.4,0.338,0.363,0.363,0.351,0.388,0.363,0.412,0.4,0.314,0.363,-0.105,0.043,0.351,0.301,0.363,0.289,0.301,0.314,0.338,0.314,0.338,0.351,0.412,0.4,Expression,ERBB2_HUMAN,Low,Human
+0.216,0.286,0.307,0.314,0.29,0.288,0.13,0.237,0.248,0.305,0.237,0.218,0.248,0.113,0.18,0.2,0.224,0.235,0.27,0.231,0.054,0.137,0.209,0.227,0.181,0.205,0.227,0.205,0.301,0.279,0.355,0.298,0.064,0.113,0.191,0.202,0.238,0.244,0.238,0.312,0.301,0.31,0.126,0.1,0.207,0.185,0.425,0.367,0.426,0.272,0.272,0.259,0.253,0.242,0.262,0.248,0.25,0.273,0.272,0.266,0.318,0.152,Stability,ESTA_BACSU,High,Prokaryote
+0.047,0.227,0.239,0.258,0.257,0.258,0.013,0.183,0.232,0.241,0.25,0.228,0.237,-0.043,0.009,0.049,0.248,0.237,0.264,0.225,-0.029,-0.07,-0.066,-0.014,0.076,0.071,-0.009,0.19,0.238,0.211,0.27,0.263,0.04,-0.006,-0.07,0.244,0.046,0.011,0.252,0.237,0.227,0.264,-0.006,-0.023,0.134,-0.008,-0.007,0.159,0.037,0.023,0.268,0.264,0.248,0.273,0.275,0.271,0.258,0.265,0.266,0.269,0.181,-0.079,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+0.047,0.227,0.239,0.258,0.257,0.258,0.013,0.183,0.232,0.241,0.25,0.228,0.237,-0.043,0.009,0.049,0.248,0.237,0.264,0.225,-0.029,-0.07,-0.066,-0.014,0.076,0.071,-0.009,0.19,0.238,0.211,0.27,0.263,0.04,-0.006,-0.07,0.244,0.046,0.011,0.252,0.237,0.227,0.264,-0.006,-0.023,0.134,-0.008,-0.007,0.159,0.037,0.023,0.268,0.264,0.248,0.273,0.275,0.271,0.258,0.265,0.266,0.269,0.181,-0.079,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+0.351,0.359,0.357,0.366,0.374,0.364,0.037,0.245,0.292,0.353,0.396,0.198,0.326,0.039,0.34,0.451,0.425,0.376,0.29,0.353,0.12,0.226,0.264,0.247,0.101,0.391,0.41,0.209,0.315,0.381,0.366,0.345,0.268,0.06,0.086,0.09,0.302,0.304,0.222,0.326,0.349,0.306,0.29,-0.022,0.362,0.343,0.436,0.391,0.485,0.419,0.44,0.374,0.423,0.4,0.393,0.432,0.425,0.391,0.41,0.419,0.476,0.483,Stability,FECA_ECOLI,High,Prokaryote
+0.3,0.271,0.374,0.378,0.378,0.387,0.135,0.248,0.18,0.177,0.196,0.164,0.164,0.171,0.148,0.154,0.151,0.216,0.277,0.216,0.209,0.203,0.196,0.167,0.141,0.158,0.18,0.177,0.184,0.374,0.235,0.263,-0.033,0.109,0.167,0.196,0.274,0.284,0.251,0.361,0.371,0.348,0.154,0.164,0.174,0.2,0.555,0.491,0.53,0.468,0.232,0.31,0.329,0.284,0.281,0.303,0.274,0.258,0.232,0.297,0.445,0.258,Stability,FKBP3_HUMAN,Medium,Human
+0.193,0.295,0.31,0.375,0.342,0.346,0.231,-0.007,0.342,0.426,0.426,0.335,0.353,0.255,0.27,0.382,0.466,0.451,0.441,0.346,0.204,0.241,0.284,0.284,0.237,0.291,0.375,0.331,0.397,0.451,0.451,0.399,0.186,0.171,0.201,0.197,0.342,0.351,0.375,0.331,0.313,0.321,0.288,0.193,0.419,0.339,0.186,0.368,0.259,0.102,0.404,0.415,0.39,0.397,0.426,0.386,0.397,0.393,0.408,0.419,0.422,0.306,OrganismalFitness,GAL4_YEAST,Medium,Eukaryote
+0.192,0.175,0.178,0.184,0.16,0.158,0.119,0.157,0.18,0.175,0.169,0.212,0.216,0.246,0.215,0.201,0.215,0.187,0.169,0.166,0.113,0.136,0.107,0.105,0.096,0.089,0.061,0.095,0.122,0.14,0.183,0.186,0.033,0.114,0.116,0.231,0.195,0.199,0.228,0.193,0.19,0.215,0.037,0.084,0.087,0.046,0.105,0.125,0.172,0.16,0.154,0.145,0.143,0.139,0.151,0.149,0.143,0.149,0.158,0.148,0.171,0.183,Binding,GCN4_YEAST,Low,Eukaryote
+0.338,0.341,0.366,0.348,0.338,0.338,0.102,0.279,0.373,0.369,0.293,0.307,0.328,0.113,0.161,0.293,0.289,0.258,0.317,0.289,0.182,0.279,0.289,0.307,0.213,0.341,0.31,0.258,0.279,0.31,0.348,0.314,0.185,0.217,0.248,0.255,0.317,0.307,0.331,0.338,0.355,0.359,0.175,0.088,0.317,0.172,0.217,0.289,0.248,0.095,0.272,0.248,0.293,0.272,0.276,0.289,0.279,0.272,0.262,0.314,0.334,0.213,OrganismalFitness,GDIA_HUMAN,Low,Human
+0.584,0.582,0.613,0.615,0.618,0.618,0.061,0.583,0.604,0.596,0.473,0.097,0.097,0.081,0.119,0.09,0.106,0.14,0.255,0.537,0.087,0.11,0.167,0.089,0.048,0.166,0.261,0.584,0.584,0.619,0.546,0.555,0.058,0.069,0.163,0.573,0.581,0.583,0.618,0.624,0.627,0.65,0.023,-0.006,0.036,0.036,0.456,0.445,0.67,0.535,0.529,0.537,0.546,0.554,0.546,0.545,0.542,0.543,0.525,0.541,0.562,0.402,Activity,GFP_AEQVI,Low,Eukaryote
+0.114,-0.045,0.102,0.135,0.102,0.07,0.331,0.413,0.282,0.25,0.446,0.446,0.446,0.315,0.446,0.462,0.495,0.381,0.43,0.152,0.381,0.397,0.43,0.413,0.348,0.413,0.462,0.43,0.479,0.209,0.364,0.136,0.168,0.43,0.364,0.43,0.397,0.381,0.397,0.299,0.25,0.364,0.413,0.397,0.462,0.397,0.331,0.495,0.266,0.168,0.413,0.381,0.495,0.43,0.446,0.397,0.462,0.397,0.413,0.479,0.413,0.413,Expression,GLPA_HUMAN,Low,Human
+0.314,0.392,0.384,0.411,0.404,0.406,0.403,0.338,0.405,0.365,0.39,0.337,0.378,0.43,0.464,0.474,0.496,0.396,0.429,0.407,0.419,0.396,0.4,0.344,0.419,0.397,0.341,0.409,0.342,0.376,0.308,0.308,0.361,0.418,0.385,0.31,0.39,0.381,0.331,0.421,0.421,0.397,0.451,0.235,0.399,0.435,0.54,0.401,0.548,0.396,0.484,0.518,0.507,0.505,0.511,0.494,0.496,0.512,0.491,0.514,0.421,0.468,OrganismalFitness,GRB2_HUMAN,Medium,Human
+0.238,0.312,0.092,0.123,0.269,0.281,0.188,0.288,0.346,0.342,0.458,0.35,0.377,0.238,0.315,0.419,0.473,0.412,0.431,0.192,0.212,0.219,0.273,0.288,0.254,0.273,0.162,0.265,0.392,0.408,0.477,0.36,0.115,0.227,0.223,0.369,0.312,0.315,0.373,0.25,0.323,0.377,0.223,0.235,0.458,0.308,0.492,0.527,0.573,0.408,0.519,0.531,0.523,0.523,0.535,0.531,0.504,0.535,0.523,0.535,0.577,0.45,Stability,HCP_LAMBD,Medium,Virus
+0.297,0.324,0.238,0.244,0.344,0.353,0.172,0.294,0.286,0.284,0.253,0.209,0.231,0.169,0.175,0.37,0.326,0.277,0.272,0.303,0.201,0.261,0.333,0.312,0.252,0.32,0.307,0.288,0.302,0.331,0.305,0.308,0.132,0.186,0.215,0.24,0.368,0.345,0.372,0.351,0.335,0.356,0.105,0.079,0.354,0.141,0.222,0.336,0.331,0.266,0.272,0.32,0.351,0.332,0.335,0.372,0.327,0.342,0.329,0.348,0.34,0.29,Stability,HECD1_HUMAN,Medium,Human
+0.298,0.313,0.295,0.304,0.313,0.311,0.1,0.096,0.32,0.33,0.311,0.283,0.306,0.105,0.273,0.299,0.31,0.321,0.337,0.115,0.239,0.292,0.307,0.308,0.254,0.297,0.295,0.305,0.321,0.318,0.313,0.312,0.061,0.277,0.291,0.313,0.306,0.328,0.347,0.307,0.325,0.344,0.22,0.106,0.292,0.26,0.228,0.28,0.224,0.115,0.274,0.268,0.271,0.288,0.294,0.285,0.297,0.292,0.293,0.301,0.339,0.303,Activity,HEM3_HUMAN,Medium,Human
+0.355,0.41,0.44,0.433,0.409,0.41,0.114,0.218,0.343,0.377,0.354,0.333,0.377,0.038,0.089,0.347,0.282,0.341,0.367,0.342,0.237,0.303,0.337,0.346,0.294,0.375,0.339,0.378,0.394,0.419,0.307,0.301,0.115,0.296,0.325,0.475,0.376,0.377,0.511,0.444,0.424,0.463,0.127,0.013,0.212,0.002,0.322,0.342,0.437,0.3,0.342,0.249,0.295,0.326,0.331,0.308,0.306,0.338,0.319,0.317,0.399,0.136,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+0.368,0.321,0.289,0.297,0.323,0.319,0.116,0.171,0.206,0.198,0.209,0.242,0.225,0.032,0.014,0.231,0.399,0.278,0.283,0.222,0.346,0.092,0.113,0.154,0.374,0.126,0.113,0.143,0.147,0.354,0.3,0.301,0.173,0.241,0.183,0.154,0.361,0.297,0.285,0.342,0.296,0.294,0.114,0.041,0.362,0.389,0.229,0.349,-0.175,0.074,0.378,0.376,0.367,0.378,0.379,0.373,0.37,0.382,0.374,0.382,0.42,0.327,OrganismalFitness,HMDH_HUMAN,Low,Human
+0.315,0.337,0.345,0.352,0.334,0.332,0.189,0.301,0.356,0.358,0.323,0.355,0.374,0.065,0.148,0.205,0.262,0.303,0.306,0.249,0.322,0.331,0.328,0.348,0.321,0.337,0.357,0.343,0.361,0.349,0.374,0.351,0.183,0.314,0.328,0.325,0.338,0.348,0.346,0.343,0.35,0.348,0.153,-0.039,0.339,0.317,0.105,0.256,0.098,0.02,0.251,0.259,0.253,0.272,0.265,0.262,0.268,0.261,0.252,0.268,0.335,0.282,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.315,0.337,0.345,0.352,0.334,0.332,0.189,0.301,0.356,0.358,0.323,0.355,0.374,0.065,0.148,0.205,0.262,0.303,0.306,0.249,0.322,0.331,0.328,0.348,0.321,0.337,0.357,0.343,0.361,0.349,0.374,0.351,0.183,0.314,0.328,0.325,0.338,0.348,0.346,0.343,0.35,0.348,0.153,-0.039,0.339,0.317,0.105,0.256,0.098,0.02,0.251,0.259,0.253,0.272,0.265,0.262,0.268,0.261,0.252,0.268,0.335,0.282,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.315,0.337,0.345,0.352,0.334,0.332,0.189,0.301,0.356,0.358,0.323,0.355,0.374,0.065,0.148,0.205,0.262,0.303,0.306,0.249,0.322,0.331,0.328,0.348,0.321,0.337,0.357,0.343,0.361,0.349,0.374,0.351,0.183,0.314,0.328,0.325,0.338,0.348,0.346,0.343,0.35,0.348,0.153,-0.039,0.339,0.317,0.105,0.256,0.098,0.02,0.251,0.259,0.253,0.272,0.265,0.262,0.268,0.261,0.252,0.268,0.335,0.282,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.336,0.368,0.347,0.358,0.349,0.356,0.111,0.296,0.386,0.401,0.335,0.336,0.356,0.113,0.174,0.351,0.37,0.376,0.358,0.19,0.309,0.308,0.304,0.298,0.304,0.316,0.32,0.32,0.3,0.368,0.352,0.319,0.145,0.32,0.298,0.272,0.356,0.344,0.33,0.37,0.369,0.365,0.18,0.05,0.36,0.344,0.306,0.351,0.326,0.116,0.376,0.37,0.37,0.373,0.374,0.382,0.37,0.372,0.366,0.378,0.404,0.33,OrganismalFitness,HXK4_HUMAN,Medium,Human
+0.336,0.368,0.347,0.358,0.349,0.356,0.111,0.296,0.386,0.401,0.335,0.336,0.356,0.113,0.174,0.351,0.37,0.376,0.358,0.19,0.309,0.308,0.304,0.298,0.304,0.316,0.32,0.32,0.3,0.368,0.352,0.319,0.145,0.32,0.298,0.272,0.356,0.344,0.33,0.37,0.369,0.365,0.18,0.05,0.36,0.344,0.306,0.351,0.326,0.116,0.376,0.37,0.37,0.373,0.374,0.382,0.37,0.372,0.366,0.378,0.404,0.33,Expression,HXK4_HUMAN,Medium,Human
+0.287,0.24,0.213,0.208,0.274,0.277,-0.008,0.234,0.217,0.239,0.008,0.015,0.008,0.026,0.02,0.012,0.015,0.006,0.054,0.151,0.24,0.245,0.286,0.283,0.013,0.019,0.085,0.003,0.23,0.284,0.179,0.183,0.111,0.211,0.242,0.247,0.225,0.252,0.256,0.28,0.306,0.306,0.016,0.015,0.016,0.017,0.166,0.149,0.182,0.078,0.048,0.075,0.101,0.097,0.079,0.072,0.059,0.067,0.051,0.072,0.024,0.007,OrganismalFitness,I6TAH8_I68A0,Medium,Virus
+0.262,0.37,0.443,0.456,0.418,0.437,0.086,0.277,0.195,0.163,0.405,0.43,0.44,0.08,0.306,0.414,0.446,0.446,0.424,0.418,0.242,0.328,0.223,0.297,0.341,0.354,0.344,0.306,0.354,0.313,0.386,0.335,0.214,0.354,0.373,0.398,0.367,0.363,0.36,0.418,0.421,0.437,0.252,0.093,0.449,0.373,0.23,0.398,0.348,0.172,0.408,0.408,0.408,0.43,0.389,0.408,0.389,0.411,0.414,0.421,0.456,0.379,OrganismalFitness,IF1_ECOLI,High,Prokaryote
+0.295,0.387,0.425,0.418,0.415,0.422,0.176,0.271,0.38,0.404,0.313,0.331,0.352,0.264,0.366,0.422,0.26,0.246,0.18,0.432,0.127,0.187,0.274,0.302,0.317,0.222,0.324,0.243,0.267,0.411,0.352,0.316,0.025,0.173,0.232,0.313,0.302,0.345,0.359,0.39,0.387,0.415,0.299,0.176,0.306,0.38,0.415,0.366,0.432,0.331,0.267,0.229,0.288,0.25,0.278,0.331,0.292,0.25,0.341,0.324,0.292,0.362,Stability,ILF3_HUMAN,High,Human
+0.065,0.108,0.139,0.147,0.151,0.176,0.25,0.124,0.281,0.266,0.378,0.326,0.326,0.271,0.291,0.336,0.324,0.371,0.326,0.131,0.219,0.145,0.106,0.166,0.221,0.242,0.221,0.211,0.242,0.275,0.334,0.315,0.17,0.192,0.174,0.184,0.155,0.139,0.137,0.194,0.157,0.17,0.199,0.178,0.252,0.24,0.427,0.353,0.419,0.4,0.32,0.324,0.345,0.332,0.33,0.345,0.316,0.324,0.332,0.334,0.437,0.359,Stability,ISDH_STAAW,High,Prokaryote
+0.238,0.266,0.276,0.285,0.271,0.281,0.145,0.248,0.268,0.28,0.22,0.265,0.302,0.154,0.13,0.336,0.286,0.248,0.216,0.178,0.174,0.192,0.245,0.256,0.189,0.296,0.33,0.312,0.292,0.259,0.318,0.254,-0.04,0.146,0.23,0.327,0.238,0.252,0.334,0.283,0.291,0.347,0.163,0.142,0.295,0.158,0.158,0.256,0.158,0.046,0.28,0.283,0.276,0.274,0.282,0.266,0.291,0.276,0.292,0.292,0.356,0.16,Expression,KCNE1_HUMAN,Medium,Human
+0.238,0.266,0.276,0.285,0.271,0.281,0.145,0.248,0.268,0.28,0.22,0.265,0.302,0.154,0.13,0.336,0.286,0.248,0.216,0.178,0.174,0.192,0.245,0.256,0.189,0.296,0.33,0.312,0.292,0.259,0.318,0.254,-0.04,0.146,0.23,0.327,0.238,0.252,0.334,0.283,0.291,0.347,0.163,0.142,0.295,0.158,0.158,0.256,0.158,0.046,0.28,0.283,0.276,0.274,0.282,0.266,0.291,0.276,0.292,0.292,0.356,0.16,Activity,KCNE1_HUMAN,Medium,Human
+0.36,0.4,0.18,0.18,0.18,0.16,0.36,0.04,0.26,0.26,0.2,0.18,0.16,0.18,0.12,0.16,0.16,0.16,0.12,0.36,0.36,0.36,0.34,0.28,0.38,0.32,0.32,0.28,0.28,0.281,0.3,0.14,0.28,0.32,0.36,0.36,0.36,0.4,0.4,0.3,0.4,0.36,0.32,0.14,0.16,0.22,0.12,0.0,-0.12,0.06,0.4,0.3,0.4,0.34,0.3,0.34,0.38,0.34,0.4,0.36,0.18,0.36,Activity,KCNH2_HUMAN,Medium,Human
+0.238,0.249,0.213,0.23,0.25,0.252,0.061,0.163,0.221,0.217,0.256,0.237,0.248,0.073,0.223,0.266,0.272,0.274,0.284,0.142,0.213,0.19,0.174,0.178,0.219,0.224,0.206,0.225,0.174,0.267,0.246,0.221,0.116,0.202,0.204,0.164,0.242,0.245,0.226,0.251,0.258,0.247,0.16,0.017,0.22,0.246,0.161,0.192,0.176,0.078,0.268,0.251,0.261,0.25,0.258,0.244,0.26,0.255,0.256,0.262,0.258,0.218,Activity,KCNJ2_MOUSE,Medium,Eukaryote
+0.238,0.249,0.213,0.23,0.25,0.252,0.061,0.163,0.221,0.217,0.256,0.237,0.248,0.073,0.223,0.266,0.272,0.274,0.284,0.142,0.213,0.19,0.174,0.178,0.219,0.224,0.206,0.225,0.174,0.267,0.246,0.221,0.116,0.202,0.204,0.164,0.242,0.245,0.226,0.251,0.258,0.247,0.16,0.017,0.22,0.246,0.161,0.192,0.176,0.078,0.268,0.251,0.261,0.25,0.258,0.244,0.26,0.255,0.256,0.262,0.258,0.218,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+0.221,0.452,0.366,0.516,0.499,0.508,0.148,0.38,0.449,0.509,0.471,0.488,0.512,0.165,0.199,0.404,0.486,0.531,0.556,0.394,0.213,0.331,0.418,0.467,0.268,0.488,0.486,0.473,0.526,0.523,0.548,0.499,0.113,0.184,0.407,0.495,0.346,0.429,0.487,0.495,0.511,0.526,0.191,0.069,0.467,0.324,0.362,0.486,0.439,0.185,0.479,0.475,0.478,0.478,0.476,0.482,0.476,0.482,0.488,0.491,0.53,0.239,OrganismalFitness,KKA2_KLEPN,High,Prokaryote
+0.233,0.292,0.34,0.345,0.332,0.344,0.089,0.352,0.348,0.414,0.336,0.394,0.424,0.125,0.241,0.3,0.391,0.433,0.42,0.381,0.228,0.316,0.366,0.378,0.292,0.39,0.402,0.367,0.409,0.39,0.406,0.338,0.114,0.253,0.312,0.422,0.28,0.31,0.389,0.339,0.357,0.393,0.128,0.047,0.368,0.278,0.29,0.371,0.358,0.106,0.387,0.366,0.383,0.374,0.375,0.395,0.393,0.392,0.395,0.397,0.301,0.251,Activity,LGK_LIPST,Medium,Eukaryote
+0.26,0.26,0.148,0.159,0.215,0.226,0.26,0.137,0.304,0.327,0.315,0.26,0.293,0.193,0.237,0.193,0.226,0.26,0.349,0.193,0.237,0.26,0.304,0.282,0.215,0.304,0.271,0.182,0.182,0.289,0.204,0.144,0.137,0.282,0.315,0.215,0.282,0.304,0.282,0.204,0.249,0.204,0.215,0.215,0.237,0.249,0.103,0.148,0.115,0.025,0.182,0.249,0.26,0.226,0.193,0.204,0.237,0.271,0.193,0.249,0.293,0.226,Expression,LYAM1_HUMAN,Medium,Human
+0.422,0.536,0.481,0.481,0.498,0.486,0.327,0.649,0.581,0.563,0.643,0.309,0.584,0.416,0.395,0.436,0.457,0.498,0.339,0.575,0.365,0.557,0.546,0.569,0.498,0.528,0.469,0.531,0.472,0.43,0.548,0.522,0.383,0.247,0.513,0.525,0.454,0.566,0.599,0.522,0.599,0.64,0.265,0.038,0.36,0.236,0.516,0.301,0.548,0.566,0.652,0.634,0.628,0.64,0.637,0.634,0.637,0.649,0.661,0.64,0.593,0.64,Stability,MAFG_MOUSE,Medium,Eukaryote
+0.518,0.657,0.614,0.628,0.675,0.681,-0.026,0.471,0.612,0.675,0.658,0.333,0.461,-0.066,0.104,0.692,0.612,0.605,0.677,0.65,0.04,0.1,0.544,0.592,0.221,0.476,0.444,0.512,0.667,0.692,0.662,0.615,0.558,-0.242,0.104,0.093,0.488,0.578,0.565,0.63,0.667,0.658,0.03,-0.106,0.578,0.34,0.399,0.541,0.664,0.446,0.698,0.673,0.675,0.681,0.66,0.662,0.671,0.692,0.703,0.688,0.673,0.609,Stability,MBD11_ARATH,Medium,Eukaryote
+0.453,0.528,0.52,0.524,0.554,0.558,0.489,0.49,0.517,0.527,0.55,0.503,0.517,0.414,0.471,0.494,0.555,0.579,0.572,0.563,0.497,0.482,0.435,0.461,0.496,0.503,0.499,0.452,0.423,0.507,0.528,0.494,0.26,0.463,0.467,0.489,0.479,0.499,0.527,0.524,0.531,0.543,0.503,0.303,0.535,0.544,0.328,0.388,0.476,0.173,0.509,0.521,0.513,0.522,0.526,0.533,0.521,0.534,0.531,0.532,0.544,0.49,Activity,MET_HUMAN,Medium,Human
+0.152,0.19,0.209,0.212,0.193,0.197,0.173,0.149,0.195,0.171,0.069,0.156,0.171,0.132,0.173,0.171,0.166,0.178,0.155,0.172,0.206,0.131,0.099,0.048,0.161,0.104,0.081,0.087,-0.021,0.19,0.171,0.198,0.119,0.163,0.093,0.054,0.176,0.138,0.125,0.202,0.185,0.179,0.149,0.1,0.146,0.162,0.068,0.016,0.122,0.004,0.158,0.155,0.153,0.171,0.169,0.156,0.163,0.168,0.165,0.167,0.168,0.166,OrganismalFitness,MK01_HUMAN,Medium,Human
+0.176,0.274,0.344,0.354,0.337,0.33,-0.022,0.271,0.346,0.359,0.302,0.343,0.354,-0.022,0.174,0.207,0.312,0.295,0.313,0.295,0.042,0.266,0.274,0.309,0.248,0.314,0.306,0.301,0.339,0.32,0.356,0.319,0.028,0.103,0.218,0.216,0.203,0.249,0.231,0.286,0.306,0.303,0.108,-0.05,0.244,0.174,0.027,0.154,0.112,-0.012,0.244,0.214,0.235,0.23,0.239,0.242,0.222,0.233,0.268,0.24,0.16,0.151,OrganismalFitness,MLAC_ECOLI,Medium,Prokaryote
+0.308,0.343,0.326,0.335,0.335,0.338,0.177,0.276,0.342,0.348,0.308,0.328,0.34,0.184,0.287,0.338,0.324,0.268,0.181,0.335,0.245,0.302,0.283,0.277,0.28,0.315,0.33,0.306,0.289,0.34,0.327,0.309,0.176,0.235,0.29,0.277,0.299,0.326,0.326,0.337,0.343,0.34,0.222,0.095,0.332,0.283,0.258,0.317,0.044,0.089,0.279,0.262,0.272,0.279,0.297,0.275,0.277,0.275,0.298,0.288,0.343,0.306,OrganismalFitness,MSH2_HUMAN,Medium,Human
+0.259,0.419,0.447,0.453,0.458,0.442,0.215,0.414,0.435,0.442,0.38,0.455,0.45,0.153,0.21,0.236,0.336,0.393,0.435,0.429,0.202,0.298,0.393,0.429,0.323,0.437,0.445,0.447,0.46,0.45,0.455,0.429,0.215,0.212,0.331,0.427,0.261,0.329,0.419,0.396,0.419,0.458,0.223,0.026,0.321,0.212,0.287,0.37,0.357,0.119,0.36,0.334,0.36,0.354,0.37,0.378,0.357,0.362,0.354,0.365,0.378,0.215,OrganismalFitness,MTH3_HAEAE,Medium,Prokaryote
+0.169,0.177,0.167,0.173,0.175,0.17,0.113,0.111,0.2,0.204,0.258,0.2,0.213,0.108,0.261,0.335,0.255,0.191,0.203,0.178,0.311,0.105,0.141,0.182,0.297,0.156,0.18,0.17,0.221,0.203,0.149,0.139,0.087,0.309,0.219,0.133,0.233,0.203,0.158,0.219,0.193,0.171,0.174,0.07,0.251,0.262,0.159,0.237,0.27,0.066,0.232,0.207,0.211,0.225,0.223,0.234,0.249,0.245,0.254,0.238,0.223,0.245,OrganismalFitness,MTHR_HUMAN,Low,Human
+0.156,0.169,0.255,0.261,0.262,0.298,0.104,0.207,0.33,0.321,0.394,0.376,0.39,0.136,0.323,0.442,0.44,0.393,0.206,0.244,0.249,0.18,0.244,0.213,0.244,0.176,0.251,0.19,0.222,0.189,0.225,0.13,0.114,0.309,0.216,0.221,0.287,0.26,0.246,0.302,0.298,0.283,0.1,-0.002,0.313,0.223,0.297,0.336,0.379,0.257,0.421,0.398,0.43,0.406,0.418,0.421,0.406,0.413,0.401,0.411,0.431,0.39,Stability,MYO3_YEAST,High,Eukaryote
+0.29,0.24,0.248,0.25,0.283,0.282,0.012,0.241,0.275,0.252,0.029,0.026,0.025,0.037,0.034,0.026,0.04,0.034,0.089,0.215,0.266,0.264,0.293,0.288,0.03,0.058,0.098,0.055,0.256,0.294,0.199,0.211,0.105,0.259,0.272,0.295,0.281,0.301,0.321,0.325,0.324,0.339,0.035,0.019,0.029,0.035,0.196,0.196,0.198,0.081,0.102,0.128,0.133,0.149,0.135,0.129,0.116,0.134,0.107,0.13,0.096,0.066,OrganismalFitness,NCAP_I34A1,Medium,Virus
+0.272,0.343,0.463,0.459,0.511,0.47,0.399,0.482,0.5,0.509,0.534,0.532,0.535,0.497,0.417,0.472,0.458,0.422,0.467,0.444,0.479,0.459,0.477,0.475,0.436,0.435,0.47,0.479,0.507,0.498,0.435,0.42,0.481,0.431,0.438,0.486,0.442,0.447,0.498,0.489,0.505,0.512,0.33,0.24,0.258,0.306,0.431,0.256,0.497,0.511,0.523,0.53,0.527,0.535,0.535,0.509,0.525,0.518,0.523,0.528,0.525,0.498,Stability,NKX31_HUMAN,High,Human
+0.588,0.484,0.28,0.328,0.488,0.512,0.226,0.346,0.472,0.498,0.516,0.145,0.256,0.115,0.256,0.509,0.516,0.488,0.488,0.13,0.36,0.346,0.349,0.224,0.332,0.368,0.372,0.282,0.286,0.452,0.608,0.447,0.264,0.213,0.244,0.374,0.538,0.449,0.535,0.563,0.472,0.519,0.284,0.124,0.605,0.315,0.324,0.465,0.085,0.13,0.484,0.512,0.516,0.491,0.519,0.506,0.478,0.484,0.488,0.516,0.56,0.31,Activity,NPC1_HUMAN,Low,Human
+0.588,0.484,0.28,0.328,0.488,0.512,0.226,0.346,0.472,0.498,0.516,0.145,0.256,0.115,0.256,0.509,0.516,0.488,0.488,0.13,0.36,0.346,0.349,0.224,0.332,0.368,0.372,0.282,0.286,0.452,0.608,0.447,0.264,0.213,0.244,0.374,0.538,0.449,0.535,0.563,0.472,0.519,0.284,0.124,0.605,0.315,0.324,0.465,0.085,0.13,0.484,0.512,0.516,0.491,0.519,0.506,0.478,0.484,0.488,0.516,0.56,0.31,Activity,NPC1_HUMAN,Low,Human
+0.477,0.45,0.356,0.329,0.45,0.463,0.06,0.289,0.477,0.477,-0.02,0.114,0.369,-0.06,-0.034,0.06,0.154,0.396,0.45,0.302,0.342,0.477,0.396,0.409,0.06,0.409,0.45,0.329,0.423,0.53,0.289,0.342,0.007,0.356,0.383,0.396,0.436,0.503,0.544,0.503,0.544,0.477,-0.034,-0.034,-0.06,-0.074,0.356,0.289,0.369,0.154,0.248,0.195,0.181,0.221,0.154,0.235,0.221,0.208,0.154,0.168,0.221,0.128,OrganismalFitness,NRAM_I33A0,Low,Virus
+0.218,0.386,0.423,0.454,0.443,0.446,0.01,0.317,0.469,0.528,0.478,0.472,0.514,0.206,0.32,0.35,0.405,0.472,0.475,0.393,0.221,0.356,0.44,0.42,0.308,0.49,0.471,0.454,0.408,0.417,0.502,0.449,0.148,0.254,0.326,0.454,0.287,0.314,0.456,0.435,0.434,0.472,0.3,0.021,0.481,0.327,0.351,0.451,0.398,0.23,0.417,0.399,0.431,0.453,0.435,0.454,0.438,0.459,0.454,0.447,0.549,0.453,Expression,NUD15_HUMAN,High,Human
+0.352,0.417,0.455,0.449,0.486,0.484,0.204,0.324,0.512,0.453,0.411,0.23,0.24,0.232,0.291,0.287,0.356,0.403,0.388,0.423,0.246,0.427,0.464,0.461,0.263,0.386,0.39,0.386,0.441,0.486,0.526,0.618,0.129,0.244,0.322,0.307,0.364,0.372,0.386,0.496,0.464,0.486,0.257,0.179,0.267,0.248,0.528,0.49,0.591,0.512,0.61,0.597,0.61,0.628,0.644,0.654,0.634,0.616,0.64,0.642,0.536,0.455,Stability,NUSA_ECOLI,High,Prokaryote
+0.353,0.326,0.338,0.341,0.369,0.356,0.225,0.372,0.36,0.347,0.356,0.298,0.32,0.289,0.396,0.35,0.375,0.399,0.402,0.375,0.234,0.289,0.286,0.304,0.344,0.28,0.283,0.243,0.32,0.427,0.341,0.346,0.2,0.216,0.234,0.344,0.335,0.326,0.372,0.338,0.347,0.356,0.237,0.115,0.216,0.311,0.439,0.39,0.571,0.531,0.405,0.363,0.35,0.399,0.412,0.396,0.399,0.415,0.412,0.399,0.338,0.399,Stability,NUSG_MYCTU,High,Prokaryote
+0.342,0.402,0.508,0.521,0.505,0.512,0.199,0.396,0.523,0.526,0.533,0.482,0.498,0.198,0.515,0.541,0.545,0.522,0.533,0.511,0.526,0.519,0.518,0.526,0.425,0.503,0.499,0.491,0.472,0.486,0.531,0.514,0.316,0.133,0.223,0.318,0.465,0.441,0.439,0.515,0.511,0.511,0.268,-0.044,0.261,0.319,0.406,0.349,0.508,0.513,0.571,0.562,0.565,0.562,0.569,0.568,0.559,0.555,0.575,0.566,0.535,0.493,Stability,OBSCN_HUMAN,High,Human
+-0.298,-0.201,0.189,0.083,0.189,0.185,-0.171,0.03,0.083,0.083,0.002,-0.108,-0.097,-0.079,-0.139,0.026,0.097,0.034,0.083,0.129,0.108,0.051,0.041,0.069,0.044,0.069,0.079,0.09,0.108,0.189,-0.034,-0.063,0.146,-0.108,0.009,-0.005,0.097,0.097,0.101,0.161,0.168,0.189,0.15,-0.079,0.157,0.168,0.245,0.228,0.256,0.185,0.026,-0.005,0.012,0.041,-0.002,0.041,0.037,0.044,0.019,0.026,0.15,0.076,Stability,ODP2_GEOSE,High,Prokaryote
+0.139,0.382,0.358,0.503,0.406,0.382,0.309,0.43,0.43,0.43,0.212,0.406,0.43,0.188,0.479,0.43,0.406,0.358,0.43,0.503,0.479,0.455,0.455,0.455,0.503,0.455,0.503,0.455,0.43,0.333,0.382,0.261,0.188,0.406,0.43,0.382,0.455,0.43,0.406,0.43,0.358,0.382,0.261,0.042,0.406,0.406,0.624,0.406,0.576,0.212,0.333,0.309,0.309,0.261,0.43,0.406,0.236,0.309,0.382,0.358,0.503,0.406,Expression,OPSD_HUMAN,High,Human
+0.404,0.432,0.376,0.414,0.404,0.419,0.062,0.34,0.455,0.442,0.424,0.424,0.452,0.057,0.32,0.394,0.417,0.376,0.424,0.424,0.355,0.33,0.396,0.427,0.386,0.445,0.417,0.427,0.439,0.447,0.366,0.312,0.055,0.327,0.389,0.429,0.396,0.432,0.48,0.434,0.452,0.462,0.149,0.045,0.419,0.345,0.511,0.468,0.516,0.266,0.434,0.422,0.434,0.439,0.445,0.437,0.437,0.427,0.419,0.452,0.473,0.353,Activity,OTC_HUMAN,Medium,Human
+0.089,0.218,0.177,0.184,0.204,0.211,0.157,0.13,0.15,0.137,0.354,0.51,0.496,0.198,0.462,0.482,0.347,0.313,0.333,0.15,0.123,0.191,0.259,0.259,0.32,0.313,0.367,0.34,0.137,0.232,0.381,0.381,0.062,0.137,0.211,0.306,0.218,0.211,0.286,0.232,0.177,0.225,0.266,0.13,0.449,0.442,0.435,0.476,0.489,0.394,0.394,0.286,0.32,0.36,0.374,0.388,0.313,0.374,0.442,0.374,0.51,0.469,Stability,OTU7A_HUMAN,High,Human
+0.108,0.145,0.15,0.154,0.136,0.138,0.106,0.114,0.138,0.14,0.176,0.164,0.156,0.102,0.13,0.148,0.164,0.177,0.178,0.126,0.1,0.126,0.144,0.143,0.126,0.152,0.165,0.151,0.174,0.144,0.191,0.166,0.038,0.112,0.118,0.136,0.136,0.137,0.146,0.147,0.144,0.143,0.13,0.075,0.164,0.13,0.168,0.176,0.144,0.048,0.172,0.16,0.16,0.164,0.161,0.168,0.162,0.165,0.158,0.167,0.186,0.138,Activity,OXDA_RHOTO,High,Eukaryote
+0.108,0.145,0.15,0.154,0.136,0.138,0.106,0.114,0.138,0.14,0.176,0.164,0.156,0.102,0.13,0.148,0.164,0.177,0.178,0.126,0.1,0.126,0.144,0.143,0.126,0.152,0.165,0.151,0.174,0.144,0.191,0.166,0.038,0.112,0.118,0.136,0.136,0.137,0.146,0.147,0.144,0.143,0.13,0.075,0.164,0.13,0.168,0.176,0.144,0.048,0.172,0.16,0.16,0.164,0.161,0.168,0.162,0.165,0.158,0.167,0.186,0.138,Expression,OXDA_RHOTO,High,Eukaryote
+0.401,0.406,0.356,0.384,0.402,0.415,-0.023,0.298,0.335,0.344,0.418,0.409,0.447,-0.066,-0.055,0.331,0.418,0.44,0.462,0.186,0.268,0.378,0.412,0.398,0.35,0.416,0.416,0.424,0.34,0.406,0.432,0.406,0.132,0.246,0.4,0.333,0.399,0.414,0.401,0.415,0.431,0.413,-0.059,-0.078,0.443,0.02,0.341,0.425,0.349,0.164,0.423,0.427,0.427,0.426,0.429,0.44,0.426,0.422,0.434,0.438,0.461,0.182,OrganismalFitness,P53_HUMAN,Low,Human
+0.401,0.406,0.356,0.384,0.402,0.415,-0.023,0.298,0.335,0.344,0.418,0.409,0.447,-0.066,-0.055,0.331,0.418,0.44,0.462,0.186,0.268,0.378,0.412,0.398,0.35,0.416,0.416,0.424,0.34,0.406,0.432,0.406,0.132,0.246,0.4,0.333,0.399,0.414,0.401,0.415,0.431,0.413,-0.059,-0.078,0.443,0.02,0.341,0.425,0.349,0.164,0.423,0.427,0.427,0.426,0.429,0.44,0.426,0.422,0.434,0.438,0.461,0.182,OrganismalFitness,P53_HUMAN,Low,Human
+0.401,0.406,0.356,0.384,0.402,0.415,-0.023,0.298,0.335,0.344,0.418,0.409,0.447,-0.066,-0.055,0.331,0.418,0.44,0.462,0.186,0.268,0.378,0.412,0.398,0.35,0.416,0.416,0.424,0.34,0.406,0.432,0.406,0.132,0.246,0.4,0.333,0.399,0.414,0.401,0.415,0.431,0.413,-0.059,-0.078,0.443,0.02,0.341,0.425,0.349,0.164,0.423,0.427,0.427,0.426,0.429,0.44,0.426,0.422,0.434,0.438,0.461,0.182,OrganismalFitness,P53_HUMAN,Low,Human
+0.401,0.406,0.356,0.384,0.402,0.415,-0.023,0.298,0.335,0.344,0.418,0.409,0.447,-0.066,-0.055,0.331,0.418,0.44,0.462,0.186,0.268,0.378,0.412,0.398,0.35,0.416,0.416,0.424,0.34,0.406,0.432,0.406,0.132,0.246,0.4,0.333,0.399,0.414,0.401,0.415,0.431,0.413,-0.059,-0.078,0.443,0.02,0.341,0.425,0.349,0.164,0.423,0.427,0.427,0.426,0.429,0.44,0.426,0.422,0.434,0.438,0.461,0.182,OrganismalFitness,P53_HUMAN,Low,Human
+0.484,0.516,0.534,0.534,0.489,0.521,0.272,0.407,0.539,0.545,0.481,0.489,0.531,0.193,0.484,0.492,0.524,0.486,0.463,0.524,0.383,0.436,0.434,0.486,0.481,0.518,0.51,0.518,0.563,0.486,0.471,0.401,0.322,0.423,0.436,0.471,0.492,0.494,0.516,0.518,0.524,0.537,0.404,-0.032,0.46,0.402,0.256,0.336,0.396,0.081,0.46,0.436,0.373,0.479,0.473,0.42,0.449,0.457,0.473,0.476,0.524,0.526,OrganismalFitness,P84126_THETH,Medium,Prokaryote
+0.521,0.474,0.41,0.418,0.495,0.493,0.35,0.429,0.491,0.512,0.545,0.514,0.534,0.357,0.429,0.515,0.597,0.549,0.563,0.408,0.508,0.528,0.531,0.555,0.506,0.558,0.57,0.538,0.525,0.522,0.568,0.503,0.191,0.511,0.504,0.503,0.556,0.556,0.551,0.531,0.53,0.529,0.465,-0.048,0.518,0.517,0.274,0.461,0.393,0.144,0.542,0.532,0.545,0.57,0.558,0.563,0.565,0.561,0.594,0.581,0.574,0.502,OrganismalFitness,PABP_YEAST,Medium,Eukaryote
+0.301,0.28,0.263,0.275,0.283,0.298,0.046,0.238,0.316,0.325,0.335,0.312,0.312,0.061,0.298,0.322,0.344,0.24,0.205,0.316,0.103,0.277,0.256,0.24,0.277,0.293,0.289,0.271,0.218,0.311,0.3,0.268,0.125,0.121,0.264,0.274,0.284,0.295,0.299,0.301,0.31,0.306,0.252,0.053,0.322,0.306,0.297,0.343,0.305,0.108,0.334,0.344,0.337,0.348,0.339,0.348,0.342,0.347,0.341,0.353,0.36,0.302,Activity,PAI1_HUMAN,,Human
+0.36,0.376,0.36,0.365,0.396,0.409,0.048,0.301,0.145,0.101,0.026,0.04,0.088,-0.002,0.024,0.026,0.024,0.018,0.275,0.248,0.336,0.382,0.4,0.407,0.171,0.297,0.332,0.336,0.327,0.516,0.284,0.264,0.086,0.308,0.347,0.396,0.433,0.44,0.435,0.462,0.455,0.433,0.024,0.031,0.037,0.013,0.18,0.178,0.149,0.046,0.132,0.141,0.147,0.149,0.134,0.152,0.134,0.132,0.101,0.143,0.167,0.121,OrganismalFitness,PA_I34A1,Medium,Virus
+0.149,0.3,0.541,0.52,0.215,0.256,0.477,0.395,0.526,0.536,0.474,0.57,0.6,0.58,0.648,0.529,0.563,0.528,0.574,0.472,0.499,0.526,0.433,0.51,0.326,0.434,0.417,0.483,0.426,0.415,0.414,0.346,0.21,0.398,0.325,0.419,0.413,0.369,0.422,0.282,0.302,0.287,0.469,0.278,0.452,0.463,0.142,0.364,0.493,0.284,0.439,0.388,0.395,0.393,0.398,0.411,0.408,0.413,0.421,0.41,0.551,0.588,Activity,PHOT_CHLRE,High,Eukaryote
+0.2,0.234,0.549,0.499,0.489,0.544,0.489,0.384,0.514,0.579,0.524,0.449,0.569,0.529,0.494,0.524,0.549,0.394,0.464,0.499,0.354,0.524,0.509,0.439,0.424,0.519,0.574,0.544,0.509,0.594,0.594,0.549,0.434,0.399,0.494,0.529,0.434,0.499,0.564,0.549,0.564,0.564,0.474,-0.17,0.529,0.414,0.399,0.444,0.594,0.569,0.534,0.494,0.414,0.389,0.389,0.444,0.424,0.504,0.454,0.444,0.603,0.653,Stability,PIN1_HUMAN,High,Human
+0.461,0.364,0.333,0.342,0.336,0.351,0.349,0.36,0.412,0.41,0.36,0.366,0.384,0.415,0.524,0.577,0.474,0.447,0.373,0.375,0.371,0.338,0.369,0.344,0.423,0.364,0.344,0.331,0.353,0.349,0.349,0.35,0.338,0.415,0.358,0.373,0.472,0.423,0.404,0.406,0.358,0.36,0.349,0.303,0.178,0.235,0.294,0.2,0.537,0.498,0.377,0.388,0.393,0.406,0.399,0.419,0.395,0.384,0.393,0.39,0.39,0.443,Stability,PITX2_HUMAN,High,Human
+0.177,0.184,0.232,0.246,0.288,0.285,0.177,0.246,0.201,0.229,0.226,0.281,0.299,0.327,0.334,0.435,0.26,0.226,0.229,0.292,0.292,0.278,0.222,0.281,0.281,0.299,0.299,0.278,0.281,0.295,0.32,0.32,0.062,0.309,0.271,0.306,0.246,0.292,0.323,0.306,0.288,0.306,0.274,0.267,0.274,0.288,0.431,0.337,0.428,0.376,0.232,0.264,0.292,0.243,0.257,0.271,0.236,0.257,0.281,0.257,0.292,0.424,Stability,PKN1_HUMAN,High,Human
+0.301,0.281,0.26,0.299,0.34,0.348,-0.03,0.255,0.376,0.374,0.228,-0.036,0.001,-0.047,-0.03,0.124,0.317,0.324,0.346,0.259,0.25,0.283,0.272,0.264,0.074,0.287,0.277,0.261,0.273,0.365,0.295,0.229,-0.005,0.036,0.187,0.248,0.263,0.276,0.311,0.288,0.307,0.347,-0.035,-0.039,0.268,-0.03,0.147,0.268,0.077,0.04,0.259,0.286,0.289,0.291,0.286,0.288,0.286,0.285,0.294,0.29,0.068,0.051,OrganismalFitness,POLG_CXB3N,Medium,Virus
+0.357,0.454,0.184,0.185,0.413,0.418,-0.014,0.33,0.535,0.542,0.241,0.011,0.02,-0.009,0.025,0.059,0.122,0.206,0.274,0.312,0.334,0.351,0.344,0.344,0.327,0.367,0.371,0.363,0.352,0.484,0.468,0.408,0.029,-0.025,0.087,0.352,0.331,0.337,0.422,0.364,0.315,0.43,-0.028,-0.025,0.319,0.017,0.282,0.42,0.099,0.084,0.195,0.189,0.198,0.216,0.184,0.217,0.197,0.199,0.19,0.206,0.115,0.023,OrganismalFitness,POLG_DEN26,Low,Virus
+0.448,0.432,0.336,0.346,0.46,0.472,-0.041,0.152,0.405,0.423,0.131,0.507,0.475,0.074,0.101,0.082,0.082,0.062,0.062,0.158,0.309,0.358,0.336,0.376,0.302,0.376,0.213,0.314,0.405,0.465,0.467,0.399,0.151,0.361,0.395,0.386,0.4,0.428,0.415,0.353,0.403,0.423,0.079,0.047,0.378,0.084,-0.035,0.26,0.578,0.311,0.245,0.292,0.282,0.297,0.299,0.227,0.269,0.299,0.267,0.309,0.119,0.143,OrganismalFitness,POLG_HCVJF,Medium,Virus
+0.194,0.352,0.217,0.263,0.295,0.297,0.053,0.269,0.222,0.256,0.267,0.041,0.089,0.048,0.066,0.032,0.089,0.064,0.048,0.39,0.079,0.06,0.023,0.116,0.076,-0.032,-0.003,-0.042,0.105,0.309,0.37,0.377,0.125,0.023,0.018,0.009,0.242,0.249,0.249,0.302,0.306,0.305,-0.025,-0.054,-0.012,-0.035,0.37,0.297,0.486,0.434,0.456,0.462,0.437,0.518,0.525,0.529,0.462,0.53,0.53,0.511,0.545,0.466,Stability,POLG_PESV,Medium,Virus
+0.299,0.373,0.407,0.41,0.396,0.397,0.132,0.263,0.363,0.383,0.397,0.405,0.409,0.04,0.12,0.347,0.414,0.469,0.461,0.434,0.406,0.389,0.214,0.256,0.39,0.407,0.399,0.414,0.283,0.45,0.424,0.411,0.278,0.43,0.38,0.342,0.431,0.414,0.397,0.445,0.429,0.423,0.195,0.042,0.359,0.32,0.368,0.355,0.402,0.2,0.404,0.412,0.419,0.408,0.408,0.417,0.416,0.414,0.42,0.42,0.424,0.352,Activity,PPARG_HUMAN,Medium,Human
+0.438,0.462,0.454,0.465,0.508,0.512,0.005,0.298,0.34,0.4,0.462,0.487,0.495,0.184,0.258,0.327,0.494,0.511,0.509,0.286,0.327,0.434,0.412,0.323,0.421,0.448,0.448,0.439,0.307,0.492,0.488,0.461,0.216,0.312,0.427,0.414,0.478,0.493,0.504,0.528,0.509,0.512,0.229,-0.034,0.465,0.397,0.389,0.464,0.437,0.197,0.467,0.463,0.46,0.469,0.483,0.46,0.477,0.476,0.479,0.482,0.52,0.392,OrganismalFitness,PPM1D_HUMAN,Low,Human
+0.58,0.629,0.69,0.672,0.724,0.724,0.413,0.413,0.743,0.755,0.696,0.58,0.623,0.444,0.428,0.747,0.769,0.735,0.712,0.659,0.497,0.6,0.613,0.611,0.539,0.604,0.672,0.619,0.653,0.763,0.789,0.794,0.56,0.539,0.436,0.486,0.67,0.676,0.667,0.69,0.716,0.698,0.367,0.181,0.19,0.358,0.472,0.226,0.633,0.649,0.789,0.765,0.775,0.781,0.787,0.79,0.79,0.789,0.806,0.802,0.731,0.712,Stability,PR40A_HUMAN,Medium,Human
+0.505,0.501,0.505,0.49,0.509,0.512,0.104,0.367,0.425,0.443,0.458,0.507,0.527,0.137,0.183,0.25,0.389,0.544,0.553,0.357,0.207,0.44,0.478,0.458,0.359,0.502,0.493,0.477,0.424,0.523,0.482,0.462,0.217,0.17,0.43,0.45,0.49,0.522,0.52,0.517,0.526,0.527,0.206,0.048,0.472,0.246,0.511,0.482,0.552,0.224,0.416,0.427,0.436,0.452,0.44,0.459,0.453,0.447,0.449,0.459,0.556,0.371,Expression,PRKN_HUMAN,Low,Human
+0.518,0.542,0.508,0.511,0.526,0.521,0.335,0.407,0.428,0.436,0.602,0.532,0.529,0.428,0.516,0.609,0.627,0.563,0.49,0.397,0.311,0.241,0.433,0.418,0.407,0.41,-0.129,0.449,0.459,0.471,0.49,0.459,0.161,0.332,0.394,0.399,0.511,0.498,0.459,0.558,0.568,0.495,0.423,0.286,0.586,0.441,0.532,0.576,0.638,0.539,0.617,0.583,0.602,0.599,0.591,0.625,0.591,0.599,0.612,0.607,0.622,0.625,Stability,PSAE_PICP2,Medium,Prokaryote
+0.349,0.362,0.355,0.35,0.366,0.369,0.108,0.282,0.372,0.395,0.365,0.352,0.383,0.13,0.25,0.399,0.394,0.217,0.218,0.382,0.192,0.348,0.292,0.277,0.28,0.249,0.243,0.277,0.218,0.38,0.355,0.357,0.086,0.248,0.337,0.264,0.341,0.374,0.341,0.37,0.392,0.385,0.23,0.001,0.379,0.317,0.352,0.343,0.386,0.155,0.378,0.37,0.381,0.383,0.381,0.388,0.385,0.386,0.385,0.395,0.415,0.377,Expression,PTEN_HUMAN,Medium,Human
+0.349,0.362,0.355,0.35,0.366,0.369,0.108,0.282,0.372,0.395,0.365,0.352,0.383,0.13,0.25,0.399,0.394,0.217,0.218,0.382,0.192,0.348,0.292,0.277,0.28,0.249,0.243,0.277,0.218,0.38,0.355,0.357,0.086,0.248,0.337,0.264,0.341,0.374,0.341,0.37,0.392,0.385,0.23,0.001,0.379,0.317,0.352,0.343,0.386,0.155,0.378,0.37,0.381,0.383,0.381,0.388,0.385,0.386,0.385,0.395,0.415,0.377,Activity,PTEN_HUMAN,Medium,Human
+0.361,0.321,0.279,0.304,0.378,0.376,0.024,0.327,0.396,0.401,0.35,0.391,0.414,0.034,0.03,0.024,0.054,0.078,0.104,0.303,0.394,0.301,0.289,0.244,0.392,0.287,0.284,0.327,0.26,0.379,0.378,0.369,0.222,0.371,0.318,0.295,0.39,0.384,0.379,0.397,0.394,0.387,0.34,0.02,0.385,0.345,0.292,0.348,0.199,0.106,0.166,0.207,0.23,0.226,0.188,0.193,0.194,0.185,0.167,0.218,0.151,0.079,OrganismalFitness,Q2N0S5_9HIV1,Medium,Virus
+0.326,0.295,0.322,0.324,0.33,0.328,0.024,0.234,0.301,0.312,0.226,0.268,0.288,0.11,0.102,0.311,0.32,0.292,0.338,0.318,0.254,0.279,0.276,0.242,0.279,0.297,0.284,0.316,0.296,0.322,0.338,0.31,0.08,0.244,0.267,0.224,0.316,0.325,0.32,0.346,0.356,0.341,0.278,0.105,0.298,0.282,0.288,0.325,0.3,0.14,0.316,0.316,0.332,0.334,0.33,0.318,0.311,0.324,0.33,0.328,0.294,0.397,Binding,Q53Z42_HUMAN,Medium,Human
+0.326,0.295,0.322,0.324,0.33,0.328,0.024,0.234,0.301,0.312,0.226,0.268,0.288,0.11,0.102,0.311,0.32,0.292,0.338,0.318,0.254,0.279,0.276,0.242,0.279,0.297,0.284,0.316,0.296,0.322,0.338,0.31,0.08,0.244,0.267,0.224,0.316,0.325,0.32,0.346,0.356,0.341,0.278,0.105,0.298,0.282,0.288,0.325,0.3,0.14,0.316,0.316,0.332,0.334,0.33,0.318,0.311,0.324,0.33,0.328,0.294,0.397,Expression,Q53Z42_HUMAN,Medium,Human
+0.418,0.49,0.508,0.518,0.516,0.523,0.235,0.408,0.546,0.55,0.471,0.415,0.415,0.085,0.325,0.38,0.431,0.462,0.453,0.507,0.459,0.505,0.519,0.523,0.481,0.525,0.528,0.537,0.561,0.528,0.514,0.452,0.265,0.477,0.514,0.501,0.484,0.518,0.528,0.526,0.536,0.537,0.372,-0.004,0.456,0.398,0.317,0.411,0.407,0.093,0.456,0.422,0.409,0.429,0.435,0.426,0.433,0.419,0.438,0.433,0.488,0.415,Activity,Q59976_STRSQ,Medium,Prokaryote
+0.175,0.194,0.137,0.14,0.18,0.18,-0.009,0.096,0.2,0.196,0.14,0.027,0.018,0.008,0.024,0.006,0.007,0.013,-0.011,0.161,0.026,0.024,0.002,0.071,0.021,0.0,0.02,0.0,-0.001,0.233,0.167,0.171,0.026,0.015,0.025,0.02,0.13,0.132,0.137,0.173,0.173,0.172,0.006,0.005,-0.002,0.012,0.138,0.15,0.215,0.129,0.153,0.164,0.151,0.175,0.168,0.164,0.16,0.167,0.163,0.164,0.068,0.047,Activity,Q6WV12_9MAXI,Low,Eukaryote
+0.257,0.303,0.28,0.252,0.28,0.326,0.315,0.24,0.269,0.297,0.355,0.32,0.338,0.274,0.309,0.309,0.286,0.355,0.315,-0.024,0.355,0.303,0.32,0.361,0.349,0.332,0.366,0.252,0.372,0.303,0.343,0.333,0.079,0.286,0.378,0.338,0.274,0.378,0.384,0.332,0.32,0.32,0.257,0.229,0.372,0.297,0.229,0.326,0.246,0.068,0.315,0.274,0.309,0.326,0.303,0.338,0.338,0.309,0.326,0.338,0.349,0.332,Activity,Q837P4_ENTFA,Medium,Prokaryote
+0.082,0.264,0.291,0.291,0.232,0.232,0.055,0.125,0.173,0.2,0.168,0.2,0.2,0.076,0.125,0.2,0.216,0.291,0.259,0.205,0.248,0.27,0.275,0.253,0.216,0.253,0.264,0.345,0.302,0.2,0.248,0.227,0.06,0.248,0.302,0.334,0.184,0.248,0.312,0.259,0.27,0.28,0.146,0.135,0.227,0.119,0.253,0.221,0.275,0.098,0.189,0.205,0.237,0.216,0.237,0.237,0.189,0.232,0.243,0.232,0.211,0.189,Activity,Q837P5_ENTFA,Medium,Prokaryote
+0.162,0.21,0.127,0.125,0.189,0.185,-0.004,0.187,0.169,0.173,0.111,-0.038,-0.048,-0.042,-0.059,-0.054,-0.048,-0.049,-0.023,0.129,-0.036,-0.005,0.022,-0.019,-0.024,-0.039,-0.02,0.138,0.144,0.203,0.187,0.194,-0.029,-0.064,-0.015,0.18,0.123,0.13,0.18,0.173,0.182,0.202,-0.035,-0.035,-0.053,-0.054,0.007,0.054,0.161,0.087,0.126,0.119,0.126,0.133,0.12,0.129,0.13,0.12,0.116,0.127,0.038,-0.07,Activity,Q8WTC7_9CNID,Low,Eukaryote
+0.477,0.427,0.185,0.221,0.517,0.514,-0.011,0.227,0.013,0.013,0.094,0.01,0.002,0.038,0.015,0.067,0.096,0.395,0.456,0.228,0.186,0.239,0.258,0.233,0.211,0.225,0.173,0.177,0.215,0.46,0.414,0.331,-0.026,0.152,0.19,0.174,0.308,0.339,0.33,0.463,0.484,0.48,0.006,-0.005,0.063,-0.003,0.337,0.31,0.367,0.162,0.197,0.156,0.216,0.216,0.201,0.201,0.204,0.2,0.173,0.205,0.17,0.097,OrganismalFitness,R1AB_SARS2,Medium,Virus
+0.204,0.112,0.226,0.213,0.23,0.226,0.362,0.283,0.542,0.485,0.415,0.38,0.397,0.327,0.52,0.56,0.353,0.353,0.432,0.279,0.432,0.476,0.441,0.362,0.459,0.305,0.472,0.345,0.318,0.424,0.279,0.191,0.248,0.388,0.525,0.345,0.331,0.459,0.371,0.349,0.388,0.309,0.472,0.195,0.2,0.323,0.38,0.261,0.586,0.38,0.38,0.406,0.415,0.415,0.419,0.41,0.384,0.432,0.393,0.428,0.283,0.463,Stability,RAD_ANTMA,High,Eukaryote
+0.394,0.367,0.367,0.367,0.354,0.367,0.044,0.286,0.354,0.367,0.394,0.354,0.421,0.017,0.192,0.354,0.394,0.381,0.34,0.246,0.205,0.313,0.34,0.3,0.286,0.313,0.286,0.259,0.259,0.34,0.394,0.3,0.111,0.246,0.232,0.286,0.34,0.3,0.313,0.354,0.367,0.394,0.165,0.071,0.421,0.327,0.232,0.354,0.192,0.286,0.34,0.354,0.3,0.34,0.3,0.407,0.34,0.354,0.367,0.327,0.354,0.111,OrganismalFitness,RAF1_HUMAN,Low,Human
+0.397,0.442,0.439,0.467,0.454,0.465,0.288,0.376,0.464,0.479,0.368,0.41,0.442,0.417,0.464,0.443,0.489,0.453,0.317,0.497,0.451,0.403,0.403,0.388,0.414,0.4,0.407,0.347,0.277,0.422,0.399,0.378,0.199,0.375,0.407,0.338,0.449,0.45,0.425,0.455,0.461,0.475,0.48,0.186,0.278,0.454,0.289,0.177,0.414,0.198,0.419,0.415,0.43,0.429,0.422,0.422,0.436,0.421,0.432,0.449,0.349,0.444,Activity,RASH_HUMAN,High,Human
+0.311,0.294,0.366,0.361,0.347,0.354,0.213,0.247,0.33,0.348,0.344,0.29,0.321,0.326,0.369,0.368,0.344,0.288,0.246,0.386,0.316,0.338,0.359,0.333,0.304,0.346,0.358,0.275,0.317,0.376,0.278,0.235,0.198,0.264,0.333,0.349,0.31,0.364,0.377,0.35,0.368,0.372,0.305,0.215,0.191,0.328,0.207,0.226,0.362,0.221,0.297,0.288,0.313,0.309,0.334,0.322,0.308,0.31,0.271,0.316,0.36,0.354,Expression,RASK_HUMAN,High,Human
+0.311,0.294,0.366,0.361,0.347,0.354,0.213,0.247,0.33,0.348,0.344,0.29,0.321,0.326,0.369,0.368,0.344,0.288,0.246,0.386,0.316,0.338,0.359,0.333,0.304,0.346,0.358,0.275,0.317,0.376,0.278,0.235,0.198,0.264,0.333,0.349,0.31,0.364,0.377,0.35,0.368,0.372,0.305,0.215,0.191,0.328,0.207,0.226,0.362,0.221,0.297,0.288,0.313,0.309,0.334,0.322,0.308,0.31,0.271,0.316,0.36,0.354,Binding,RASK_HUMAN,High,Human
+0.165,0.072,0.264,0.264,0.267,0.267,0.231,0.243,0.21,0.21,0.363,0.318,0.318,0.255,0.321,0.378,0.426,0.3,0.273,0.18,0.282,0.06,0.228,0.213,0.207,0.111,0.117,0.279,0.192,0.339,0.318,0.288,0.264,0.204,0.309,0.312,0.255,0.27,0.279,0.264,0.264,0.264,0.255,0.186,0.327,0.333,0.333,0.378,0.462,0.345,0.378,0.378,0.411,0.384,0.393,0.399,0.387,0.393,0.381,0.39,0.432,0.417,Stability,RBP1_HUMAN,High,Human
+0.277,0.274,0.331,0.34,0.331,0.331,0.213,0.217,0.328,0.34,0.372,0.197,0.236,0.258,0.226,0.461,0.464,0.451,0.429,0.315,0.178,0.182,0.242,0.305,0.239,0.41,0.416,0.359,0.423,0.385,0.467,0.391,0.042,0.185,0.239,0.232,0.299,0.305,0.283,0.362,0.369,0.337,0.229,0.194,0.356,0.229,0.388,0.391,0.464,0.439,0.461,0.461,0.47,0.505,0.467,0.496,0.47,0.48,0.499,0.483,0.464,0.381,Stability,RCD1_ARATH,Medium,Eukaryote
+0.286,0.414,0.49,0.471,0.45,0.474,0.074,0.23,0.406,0.414,0.4,0.256,0.334,0.146,0.228,0.205,0.42,0.432,0.432,0.488,0.018,0.065,0.095,0.091,-0.079,0.049,-0.06,-0.002,0.479,0.46,0.427,0.416,0.018,0.054,0.042,0.425,0.36,0.332,0.465,0.45,0.457,0.511,-0.109,0.053,0.234,0.042,0.367,0.442,0.543,0.432,0.427,0.411,0.414,0.404,0.406,0.409,0.411,0.409,0.418,0.418,0.453,0.32,Stability,RCRO_LAMBD,High,Virus
+0.235,0.168,0.266,0.282,0.305,0.309,0.121,0.344,0.419,0.419,0.384,0.45,0.462,0.156,0.521,0.458,0.423,0.399,0.286,0.344,0.301,0.321,0.313,0.356,0.403,0.372,0.372,0.38,0.388,0.364,0.356,0.339,0.305,0.231,0.38,0.38,0.29,0.321,0.368,0.305,0.317,0.329,0.384,0.023,0.415,0.466,0.372,0.392,0.447,0.376,0.36,0.341,0.36,0.344,0.38,0.384,0.368,0.368,0.38,0.38,0.368,0.423,Stability,RD23A_HUMAN,High,Human
+0.221,0.252,0.281,0.286,0.362,0.367,0.042,0.289,0.381,0.385,0.146,0.072,0.079,0.064,0.058,0.111,0.278,0.292,0.368,0.33,0.286,0.331,0.341,0.355,0.125,0.267,0.263,0.259,0.307,0.357,0.336,0.292,0.086,0.278,0.321,0.347,0.317,0.349,0.358,0.368,0.38,0.392,0.052,0.035,0.15,0.054,0.17,0.191,0.181,0.041,0.233,0.215,0.227,0.229,0.24,0.242,0.229,0.247,0.253,0.248,0.13,0.096,OrganismalFitness,RDRP_I33A0,Low,Virus
+0.132,0.128,0.141,0.152,0.156,0.154,0.033,0.219,0.132,0.149,0.104,0.178,0.195,0.029,0.018,0.109,0.152,0.206,0.184,0.115,0.141,0.18,0.171,0.141,0.206,0.19,0.106,0.175,0.19,0.191,0.245,0.266,0.024,0.143,0.182,0.173,0.178,0.182,0.162,0.182,0.19,0.167,0.031,0.037,0.145,0.044,0.177,0.16,0.201,0.139,0.141,0.143,0.203,0.188,0.177,0.204,0.206,0.134,0.219,0.193,0.208,0.136,OrganismalFitness,REV_HV1H2,Medium,Virus
+-0.026,0.176,0.24,0.264,0.222,0.231,-0.014,0.192,0.195,0.201,0.207,0.125,0.119,-0.053,0.023,0.005,0.24,0.261,0.204,0.222,-0.068,0.032,0.137,0.101,-0.053,0.116,0.152,0.125,0.149,0.164,0.27,0.247,0.014,0.002,0.113,0.152,0.053,0.14,0.131,0.192,0.179,0.195,0.053,-0.104,0.053,-0.014,0.173,0.137,0.252,0.234,0.279,0.261,0.252,0.267,0.234,0.255,0.255,0.276,0.255,0.264,0.198,0.104,Stability,RFAH_ECOLI,High,Prokaryote
+0.353,0.499,0.536,0.536,0.514,0.525,0.19,0.536,0.308,0.255,0.567,0.564,0.539,0.128,0.308,0.351,0.601,0.581,0.612,0.556,0.157,0.488,0.446,0.44,0.415,0.438,0.438,0.471,0.547,0.525,0.502,0.487,0.044,0.415,0.429,0.415,0.466,0.485,0.466,0.514,0.522,0.514,-0.043,-0.113,0.381,0.207,0.589,0.544,0.665,0.595,0.637,0.623,0.615,0.657,0.643,0.626,0.646,0.632,0.626,0.646,0.663,0.544,Stability,RL20_AQUAE,High,Prokaryote
+0.199,0.251,0.279,0.324,0.281,0.307,0.088,0.295,0.332,0.335,0.148,0.207,0.238,0.062,0.357,0.377,0.427,0.319,0.428,0.327,0.262,0.427,0.378,0.336,0.386,0.324,0.344,0.311,0.291,0.261,0.308,0.313,0.056,0.302,0.367,0.299,0.328,0.366,0.31,0.351,0.373,0.313,0.221,0.031,0.171,0.218,0.074,0.128,0.163,0.052,0.413,0.378,0.408,0.415,0.407,0.412,0.382,0.416,0.412,0.419,0.25,0.28,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+0.199,0.251,0.279,0.324,0.281,0.307,0.088,0.295,0.332,0.335,0.148,0.207,0.238,0.062,0.357,0.377,0.427,0.319,0.428,0.327,0.262,0.427,0.378,0.336,0.386,0.324,0.344,0.311,0.291,0.261,0.308,0.313,0.056,0.302,0.367,0.299,0.328,0.366,0.31,0.351,0.373,0.313,0.221,0.031,0.171,0.218,0.074,0.128,0.163,0.052,0.413,0.378,0.408,0.415,0.407,0.412,0.382,0.416,0.412,0.419,0.25,0.28,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+0.199,0.251,0.279,0.324,0.281,0.307,0.088,0.295,0.332,0.335,0.148,0.207,0.238,0.062,0.357,0.377,0.427,0.319,0.428,0.327,0.262,0.427,0.378,0.336,0.386,0.324,0.344,0.311,0.291,0.261,0.308,0.313,0.056,0.302,0.367,0.299,0.328,0.366,0.31,0.351,0.373,0.313,0.221,0.031,0.171,0.218,0.074,0.128,0.163,0.052,0.413,0.378,0.408,0.415,0.407,0.412,0.382,0.416,0.412,0.419,0.25,0.28,Activity,RL40A_YEAST,Medium,Eukaryote
+0.425,0.49,0.484,0.5,0.502,0.494,0.032,0.319,0.478,0.488,0.478,0.478,0.489,0.046,0.419,0.452,0.486,0.492,0.499,0.483,0.425,0.461,0.406,0.376,0.45,0.472,0.462,0.473,0.459,0.496,0.48,0.457,0.138,0.426,0.421,0.331,0.459,0.471,0.435,0.506,0.517,0.502,0.397,0.049,0.467,0.447,0.227,0.425,0.2,0.138,0.462,0.453,0.419,0.453,0.453,0.454,0.46,0.46,0.462,0.464,0.492,0.408,Activity,RNC_ECOLI,Medium,Prokaryote
+0.557,0.576,0.549,0.571,0.491,0.533,0.538,0.524,0.557,0.576,0.623,0.667,0.67,0.629,0.661,0.678,0.623,0.598,0.505,0.596,0.609,0.62,0.637,0.579,0.629,0.626,0.626,0.626,0.587,0.664,0.615,0.555,0.502,0.516,0.623,0.631,0.596,0.642,0.642,0.596,0.585,0.593,0.626,0.59,0.664,0.642,0.615,0.557,0.711,0.607,0.607,0.579,0.601,0.612,0.62,0.604,0.612,0.607,0.626,0.612,0.694,0.683,Stability,RPC1_BP434,High,Virus
+0.248,0.362,0.426,0.438,0.414,0.374,0.136,0.366,0.355,0.392,0.414,0.422,0.446,0.208,0.208,0.32,0.469,0.496,0.514,0.355,0.176,0.324,0.386,0.378,0.159,0.29,0.296,0.248,0.474,0.326,0.552,0.468,0.063,0.099,0.308,0.428,0.219,0.296,0.404,0.368,0.391,0.446,0.242,0.224,0.362,0.237,0.181,0.312,0.282,0.196,0.504,0.444,0.474,0.498,0.492,0.522,0.474,0.462,0.486,0.51,0.444,0.278,Activity,RPC1_LAMBD,High,Virus
+0.248,0.362,0.426,0.438,0.414,0.374,0.136,0.366,0.355,0.392,0.414,0.422,0.446,0.208,0.208,0.32,0.469,0.496,0.514,0.355,0.176,0.324,0.386,0.378,0.159,0.29,0.296,0.248,0.474,0.326,0.552,0.468,0.063,0.099,0.308,0.428,0.219,0.296,0.404,0.368,0.391,0.446,0.242,0.224,0.362,0.237,0.181,0.312,0.282,0.196,0.504,0.444,0.474,0.498,0.492,0.522,0.474,0.462,0.486,0.51,0.444,0.278,Activity,RPC1_LAMBD,High,Virus
+0.295,0.289,0.279,0.282,0.272,0.272,0.162,0.168,0.322,0.322,0.356,0.332,0.305,0.225,0.245,0.339,0.326,0.329,0.232,0.242,0.108,0.185,0.252,0.212,0.235,0.252,0.245,0.262,0.245,0.315,0.339,0.324,0.085,0.279,0.222,0.242,0.299,0.259,0.265,0.265,0.255,0.255,0.262,0.175,0.315,0.259,0.423,0.322,0.469,0.389,0.322,0.322,0.312,0.312,0.302,0.326,0.329,0.336,0.312,0.329,0.322,0.463,Stability,RS15_GEOSE,Medium,Prokaryote
+0.364,0.426,0.445,0.45,0.452,0.463,0.27,0.416,0.481,0.482,0.467,0.477,0.512,0.354,0.396,0.4,0.48,0.464,0.44,0.026,0.375,0.471,0.478,0.471,0.388,0.487,0.496,0.474,0.448,0.447,0.464,0.357,0.267,0.398,0.468,0.469,0.415,0.484,0.491,0.475,0.488,0.491,0.372,0.174,0.46,0.41,0.313,0.422,0.406,0.121,0.445,0.434,0.431,0.446,0.464,0.448,0.458,0.463,0.456,0.468,0.46,0.422,Expression,S22A1_HUMAN,Medium,Human
+0.364,0.426,0.445,0.45,0.452,0.463,0.27,0.416,0.481,0.482,0.467,0.477,0.512,0.354,0.396,0.4,0.48,0.464,0.44,0.026,0.375,0.471,0.478,0.471,0.388,0.487,0.496,0.474,0.448,0.447,0.464,0.357,0.267,0.398,0.468,0.469,0.415,0.484,0.491,0.475,0.488,0.491,0.372,0.174,0.46,0.41,0.313,0.422,0.406,0.121,0.445,0.434,0.431,0.446,0.464,0.448,0.458,0.463,0.456,0.468,0.46,0.422,Activity,S22A1_HUMAN,Medium,Human
+0.082,0.074,0.297,0.248,0.285,0.281,0.297,0.438,0.268,0.285,0.418,0.409,0.422,0.136,0.339,0.463,0.405,0.463,0.368,0.38,0.443,0.372,0.355,0.376,0.422,0.451,0.389,0.401,0.405,0.467,0.447,0.418,0.281,0.405,0.434,0.389,0.281,0.397,0.351,0.26,0.285,0.302,0.463,-0.072,0.463,0.451,0.19,0.206,0.289,0.397,0.438,0.281,0.181,0.285,0.264,0.322,0.339,0.434,0.372,0.347,0.455,0.48,Stability,SAV1_MOUSE,High,Eukaryote
+0.218,0.179,0.311,0.272,0.315,0.315,0.182,0.128,0.378,0.381,0.253,0.233,0.241,0.186,0.206,0.311,0.463,0.502,0.233,0.276,0.175,0.175,0.214,0.233,0.194,0.206,0.167,0.218,0.311,0.42,0.409,0.358,0.194,0.14,0.155,0.19,0.186,0.194,0.19,0.276,0.292,0.3,0.221,0.182,0.264,0.233,0.526,0.475,0.538,0.479,0.389,0.417,0.424,0.428,0.432,0.436,0.448,0.44,0.405,0.436,0.538,0.444,Stability,SBI_STAAM,Medium,Prokaryote
+0.359,0.4,0.352,0.372,0.438,0.452,0.273,0.405,0.483,0.49,0.454,0.467,0.467,0.135,0.255,0.44,0.477,0.478,0.449,0.47,0.43,0.418,0.399,0.394,0.435,0.419,0.43,0.428,0.421,0.456,0.384,0.317,0.283,0.436,0.416,0.386,0.459,0.455,0.432,0.475,0.475,0.461,0.419,0.1,0.461,0.453,0.383,0.43,0.406,0.132,0.455,0.454,0.468,0.47,0.478,0.469,0.475,0.477,0.466,0.482,0.491,0.452,Activity,SC6A4_HUMAN,Medium,Human
+0.051,0.084,0.167,0.18,0.236,0.213,0.097,0.012,0.183,0.2,0.18,0.173,0.17,0.127,0.186,0.167,0.19,0.262,0.262,0.17,0.054,0.087,0.084,0.091,0.107,0.111,0.101,0.13,0.134,0.18,0.177,0.109,-0.025,0.035,0.051,0.107,0.114,0.104,0.12,0.19,0.173,0.18,0.13,0.111,0.2,0.163,0.375,0.391,0.381,0.325,0.256,0.276,0.269,0.292,0.259,0.295,0.272,0.249,0.206,0.276,0.447,0.355,Stability,SCIN_STAAR,High,Prokaryote
+0.036,0.054,0.089,0.089,0.125,0.143,0.071,0.0,0.071,0.125,0.089,0.143,0.071,0.196,0.143,0.071,0.089,0.054,0.071,0.054,0.018,0.054,0.071,0.089,-0.018,0.018,0.018,0.036,0.107,0.027,0.161,0.143,0.036,0.0,0.0,0.089,-0.018,-0.018,-0.018,0.054,0.036,0.036,0.018,0.125,0.107,0.125,0.036,0.054,0.018,-0.018,0.089,0.054,0.071,0.036,0.054,0.089,0.143,0.107,0.036,0.089,0.125,0.018,OrganismalFitness,SCN5A_HUMAN,Medium,Human
+0.477,0.527,0.53,0.53,0.532,0.532,0.118,0.443,0.524,0.525,0.529,0.512,0.522,0.284,0.282,0.518,0.508,0.5,0.505,0.529,0.156,0.19,0.192,0.318,0.163,0.323,0.084,0.323,0.385,0.532,0.527,0.527,-0.24,0.103,0.287,0.342,0.488,0.496,0.46,0.512,0.512,0.508,0.096,-0.009,0.342,0.068,0.376,0.349,0.529,0.522,0.529,0.529,0.529,0.529,0.529,0.529,0.529,0.529,0.529,0.529,0.518,0.525,Stability,SDA_BACSU,Medium,Prokaryote
+0.273,0.354,0.394,0.396,0.392,0.394,0.018,0.333,0.417,0.421,0.371,0.379,0.369,0.116,0.281,0.388,0.396,0.436,0.398,0.379,0.346,0.371,0.381,0.369,0.352,0.386,0.369,0.396,0.371,0.354,0.365,0.315,0.152,0.361,0.373,0.379,0.365,0.375,0.379,0.4,0.413,0.415,0.202,0.047,0.39,0.335,0.277,0.375,0.292,0.112,0.419,0.396,0.411,0.427,0.417,0.421,0.398,0.4,0.404,0.423,0.396,0.35,OrganismalFitness,SERC_HUMAN,High,Human
+0.133,0.265,0.294,0.29,0.285,0.296,0.129,0.26,0.314,0.319,0.298,0.278,0.304,0.144,0.152,0.166,0.323,0.295,0.213,0.294,0.161,0.282,0.289,0.262,0.166,0.296,0.3,0.304,0.27,0.308,0.303,0.254,0.099,0.15,0.234,0.277,0.152,0.201,0.258,0.26,0.275,0.303,0.156,0.149,0.317,0.191,0.216,0.277,0.202,0.064,0.295,0.296,0.303,0.308,0.306,0.306,0.29,0.307,0.296,0.307,0.217,0.187,OrganismalFitness,SHOC2_HUMAN,Medium,Human
+0.17,0.21,0.25,0.253,0.242,0.238,0.289,0.158,0.166,0.202,0.174,0.234,0.218,0.21,0.25,0.226,0.222,0.17,0.186,0.158,0.103,0.115,0.111,0.131,0.19,0.182,0.103,0.194,0.103,0.17,0.04,0.026,0.158,0.079,0.162,0.166,0.147,0.158,0.154,0.234,0.23,0.214,0.186,0.127,0.147,0.246,0.38,0.253,0.412,0.356,0.293,0.277,0.242,0.277,0.273,0.281,0.257,0.269,0.246,0.285,0.25,0.364,Stability,SOX30_HUMAN,High,Human
+0.373,0.44,0.464,0.445,0.493,0.498,-0.114,0.46,0.407,0.409,0.367,-0.068,-0.028,-0.055,-0.058,-0.076,-0.057,-0.068,-0.022,0.491,-0.155,-0.022,0.111,0.054,-0.085,-0.055,0.042,0.23,0.312,0.436,0.424,0.412,-0.02,-0.009,-0.024,0.008,0.335,0.327,0.318,0.489,0.487,0.481,-0.159,-0.117,-0.123,-0.074,0.362,0.306,0.563,0.434,0.472,0.455,0.445,0.47,0.451,0.455,0.472,0.46,0.46,0.47,0.409,0.286,Stability,SPA_STAAU,Medium,Prokaryote
+0.09,0.132,0.041,0.052,0.121,0.126,0.035,0.036,0.047,0.115,0.18,0.146,0.125,0.148,0.145,0.167,0.188,0.182,0.218,0.15,0.126,0.104,0.101,0.109,0.106,0.129,0.114,0.115,0.156,0.132,0.226,0.205,0.022,0.126,0.086,0.13,0.138,0.113,0.145,0.138,0.115,0.148,0.01,-0.038,0.118,0.048,0.149,0.122,0.211,0.088,0.21,0.188,0.195,0.225,0.214,0.2,0.228,0.235,0.232,0.222,0.217,0.194,Binding,SPG1_STRSG,Low,Prokaryote
+0.09,0.132,0.041,0.052,0.121,0.126,0.035,0.036,0.047,0.115,0.18,0.146,0.125,0.148,0.145,0.167,0.188,0.182,0.218,0.15,0.126,0.104,0.101,0.109,0.106,0.129,0.114,0.115,0.156,0.132,0.226,0.205,0.022,0.126,0.086,0.13,0.138,0.113,0.145,0.138,0.115,0.148,0.01,-0.038,0.118,0.048,0.149,0.122,0.211,0.088,0.21,0.188,0.195,0.225,0.214,0.2,0.228,0.235,0.232,0.222,0.217,0.194,Binding,SPG1_STRSG,Low,Prokaryote
+0.09,0.132,0.041,0.052,0.121,0.126,0.035,0.036,0.047,0.115,0.18,0.146,0.125,0.148,0.145,0.167,0.188,0.182,0.218,0.15,0.126,0.104,0.101,0.109,0.106,0.129,0.114,0.115,0.156,0.132,0.226,0.205,0.022,0.126,0.086,0.13,0.138,0.113,0.145,0.138,0.115,0.148,0.01,-0.038,0.118,0.048,0.149,0.122,0.211,0.088,0.21,0.188,0.195,0.225,0.214,0.2,0.228,0.235,0.232,0.222,0.217,0.194,Binding,SPG1_STRSG,Medium,Prokaryote
+0.09,0.132,0.041,0.052,0.121,0.126,0.035,0.036,0.047,0.115,0.18,0.146,0.125,0.148,0.145,0.167,0.188,0.182,0.218,0.15,0.126,0.104,0.101,0.109,0.106,0.129,0.114,0.115,0.156,0.132,0.226,0.205,0.022,0.126,0.086,0.13,0.138,0.113,0.145,0.138,0.115,0.148,0.01,-0.038,0.118,0.048,0.149,0.122,0.211,0.088,0.21,0.188,0.195,0.225,0.214,0.2,0.228,0.235,0.232,0.222,0.217,0.194,Binding,SPG1_STRSG,Medium,Prokaryote
+0.442,0.458,0.461,0.483,0.555,0.519,0.312,0.301,0.489,0.414,0.417,0.384,0.354,0.326,0.376,0.403,0.436,0.425,0.489,0.425,0.343,0.376,0.378,0.354,0.298,0.381,0.301,0.403,0.45,0.571,0.525,0.507,-0.126,0.332,0.409,0.389,0.304,0.309,0.37,0.486,0.513,0.5,0.235,0.13,0.235,0.216,0.395,0.315,0.662,0.494,0.53,0.505,0.527,0.513,0.494,0.522,0.527,0.569,0.516,0.53,0.519,0.519,Stability,SPG2_STRSG,Medium,Prokaryote
+0.136,0.179,0.086,0.166,0.264,0.259,-0.046,0.262,0.296,0.3,0.014,-0.054,-0.026,-0.035,-0.036,-0.02,-0.012,-0.002,0.036,0.263,0.231,0.276,0.274,0.286,0.286,0.256,0.222,0.287,0.238,0.215,0.196,0.254,0.152,0.238,0.228,0.268,0.234,0.24,0.244,0.294,0.297,0.302,-0.032,-0.002,-0.006,0.002,0.424,0.399,0.349,0.127,0.181,0.196,0.184,0.233,0.2,0.24,0.201,0.228,0.202,0.224,0.277,0.138,Binding,SPIKE_SARS2,Medium,Virus
+0.136,0.179,0.086,0.166,0.264,0.259,-0.046,0.262,0.296,0.3,0.014,-0.054,-0.026,-0.035,-0.036,-0.02,-0.012,-0.002,0.036,0.263,0.231,0.276,0.274,0.286,0.286,0.256,0.222,0.287,0.238,0.215,0.196,0.254,0.152,0.238,0.228,0.268,0.234,0.24,0.244,0.294,0.297,0.302,-0.032,-0.002,-0.006,0.002,0.424,0.399,0.349,0.127,0.181,0.196,0.184,0.233,0.2,0.24,0.201,0.228,0.202,0.224,0.277,0.138,Expression,SPIKE_SARS2,Medium,Virus
+0.499,0.446,0.433,0.402,0.426,0.412,0.14,0.378,0.4,0.422,0.402,0.485,0.445,-0.069,0.443,0.437,0.442,0.544,0.489,0.392,0.425,0.417,0.435,0.4,0.458,0.412,0.46,0.396,0.382,0.385,0.405,0.431,0.33,0.373,0.418,0.391,0.463,0.465,0.471,0.406,0.42,0.415,0.367,-0.148,0.335,0.352,0.303,0.248,0.472,0.44,0.433,0.432,0.463,0.455,0.451,0.448,0.433,0.423,0.435,0.433,0.5,0.415,Stability,SPTN1_CHICK,High,Eukaryote
+0.324,0.409,0.42,0.437,0.426,0.437,0.064,0.233,0.369,0.414,0.318,0.233,0.324,0.143,0.318,0.386,0.437,0.494,0.369,0.431,0.143,0.296,0.369,0.296,0.182,0.392,0.42,0.347,0.358,0.426,0.414,0.378,0.075,0.149,0.284,0.397,0.358,0.324,0.42,0.471,0.386,0.437,0.16,0.115,0.454,0.403,0.528,0.494,0.528,0.528,0.477,0.431,0.516,0.494,0.482,0.477,0.488,0.471,0.482,0.494,0.482,0.341,Stability,SQSTM_MOUSE,Medium,Eukaryote
+0.506,0.529,0.536,0.539,0.567,0.554,-0.141,0.37,0.546,0.544,0.572,0.564,0.559,-0.169,0.539,0.589,0.579,0.584,0.569,0.506,-0.111,0.319,0.231,0.375,0.347,0.42,0.458,0.483,0.435,0.564,0.577,0.574,0.263,0.049,0.15,0.319,0.534,0.519,0.511,0.562,0.559,0.526,0.466,-0.245,0.572,0.476,0.059,0.461,0.587,0.572,0.572,0.567,0.574,0.582,0.587,0.595,0.567,0.567,0.584,0.577,0.655,0.612,Stability,SR43C_ARATH,High,Eukaryote
+0.413,0.418,0.559,0.549,0.598,0.585,0.444,0.336,0.618,0.621,0.474,0.536,0.567,0.379,0.575,0.613,0.593,0.544,0.541,0.541,0.549,0.513,0.51,0.495,0.495,0.515,0.523,0.503,0.503,0.539,0.59,0.559,0.485,0.48,0.487,0.539,0.513,0.554,0.559,0.577,0.587,0.585,0.248,0.117,0.456,0.433,0.438,0.446,0.497,0.51,0.621,0.593,0.6,0.6,0.613,0.608,0.598,0.59,0.613,0.608,0.582,0.521,Stability,SRBS1_HUMAN,High,Human
+0.394,0.391,0.385,0.387,0.398,0.397,0.399,0.346,0.398,0.421,0.382,0.442,0.461,0.288,0.35,0.348,0.434,0.38,0.391,0.384,0.33,0.316,0.329,0.288,0.338,0.372,0.332,0.317,0.234,0.427,0.453,0.44,0.324,0.297,0.305,0.243,0.375,0.377,0.364,0.396,0.392,0.394,0.431,0.254,0.382,0.423,0.229,0.224,0.3,0.056,0.405,0.385,0.387,0.409,0.421,0.418,0.409,0.422,0.411,0.427,0.408,0.355,Activity,SRC_HUMAN,Medium,Human
+0.394,0.391,0.385,0.387,0.398,0.397,0.399,0.346,0.398,0.421,0.382,0.442,0.461,0.288,0.35,0.348,0.434,0.38,0.391,0.384,0.33,0.316,0.329,0.288,0.338,0.372,0.332,0.317,0.234,0.427,0.453,0.44,0.324,0.297,0.305,0.243,0.375,0.377,0.364,0.396,0.392,0.394,0.431,0.254,0.382,0.423,0.229,0.224,0.3,0.056,0.405,0.385,0.387,0.409,0.421,0.418,0.409,0.422,0.411,0.427,0.408,0.355,Activity,SRC_HUMAN,Medium,Human
+0.394,0.391,0.385,0.387,0.398,0.397,0.399,0.346,0.398,0.421,0.382,0.442,0.461,0.288,0.35,0.348,0.434,0.38,0.391,0.384,0.33,0.316,0.329,0.288,0.338,0.372,0.332,0.317,0.234,0.427,0.453,0.44,0.324,0.297,0.305,0.243,0.375,0.377,0.364,0.396,0.392,0.394,0.431,0.254,0.382,0.423,0.229,0.224,0.3,0.056,0.405,0.385,0.387,0.409,0.421,0.418,0.409,0.422,0.411,0.427,0.408,0.355,OrganismalFitness,SRC_HUMAN,Medium,Human
+0.339,0.293,0.339,0.363,0.392,0.378,0.123,0.37,0.378,0.303,0.341,0.402,0.431,0.213,0.412,0.467,0.433,0.307,0.281,0.448,0.196,0.322,0.378,0.337,0.402,0.295,0.402,0.349,0.269,0.375,0.399,0.426,0.201,0.223,0.416,0.252,0.339,0.46,0.322,0.351,0.465,0.344,0.431,0.056,0.293,0.45,0.438,0.358,0.445,0.37,0.431,0.37,0.429,0.429,0.419,0.453,0.402,0.436,0.412,0.443,0.397,0.453,OrganismalFitness,SUMO1_HUMAN,High,Human
+0.152,0.192,0.198,0.234,0.196,0.211,0.202,0.267,0.242,0.248,0.338,0.332,0.324,0.206,0.231,0.229,0.236,0.278,0.309,0.332,0.248,0.326,0.255,0.238,0.194,0.273,0.286,0.259,0.238,0.307,0.311,0.252,0.129,0.257,0.359,0.284,0.189,0.278,0.229,0.225,0.276,0.231,0.211,0.173,0.313,0.24,-0.006,0.206,0.051,-0.041,0.19,0.19,0.189,0.198,0.229,0.196,0.227,0.215,0.229,0.221,0.074,0.059,OrganismalFitness,SYUA_HUMAN,Medium,Human
+0.09,0.04,0.077,0.08,0.074,0.07,0.154,0.023,0.06,0.054,0.01,0.03,0.023,0.054,0.02,0.023,-0.07,0.013,0.054,-0.047,0.077,0.037,-0.003,-0.01,0.104,0.07,0.043,-0.017,-0.03,0.03,0.107,0.142,-0.043,0.12,0.147,0.084,0.09,0.11,0.087,0.1,0.117,0.084,0.054,0.107,-0.007,0.03,0.197,0.023,0.12,0.047,0.09,-0.033,-0.013,0.003,0.007,0.027,0.033,0.003,-0.06,0.023,0.01,0.057,OrganismalFitness,TADBP_HUMAN,Low,Human
+0.265,0.183,0.238,0.243,0.259,0.243,-0.076,0.396,0.227,0.257,0.159,0.338,0.322,-0.032,0.006,0.009,0.028,-0.057,0.044,0.186,0.393,0.415,0.417,0.423,0.36,0.257,0.12,0.229,0.199,0.314,0.401,0.403,0.319,0.371,0.175,0.227,0.374,0.254,0.229,0.341,0.27,0.257,0.02,-0.027,0.284,0.156,0.202,0.281,0.3,0.178,0.082,0.126,0.139,0.099,0.107,0.107,0.074,0.099,0.09,0.093,0.161,0.145,OrganismalFitness,TAT_HV1BR,High,Virus
+0.59,0.537,0.606,0.61,0.655,0.655,0.682,0.312,0.648,0.697,0.552,0.728,0.747,0.67,0.69,0.762,0.731,0.69,0.724,0.514,0.423,0.571,0.556,0.602,0.583,0.579,0.636,0.644,0.575,0.693,0.636,0.587,0.541,0.446,0.602,0.686,0.64,0.705,0.697,0.636,0.655,0.648,0.701,0.103,0.53,0.667,0.541,0.461,0.739,0.712,0.705,0.663,0.644,0.739,0.712,0.735,0.709,0.724,0.697,0.72,0.758,0.754,Stability,TCRG1_MOUSE,Medium,Eukaryote
+0.25,0.26,0.437,0.411,0.407,0.42,-0.002,0.122,0.535,0.525,0.512,0.42,0.483,0.099,0.525,0.532,0.496,0.515,0.545,0.496,-0.097,0.266,0.312,0.296,0.266,0.286,0.185,0.368,0.443,0.499,0.551,0.548,0.414,-0.042,0.339,0.306,0.397,0.463,0.473,0.411,0.489,0.483,0.188,0.076,0.424,0.306,0.414,0.401,0.443,0.568,0.542,0.529,0.568,0.587,0.571,0.555,0.555,0.558,0.548,0.561,0.479,0.404,Stability,THO1_YEAST,High,Eukaryote
+0.346,0.335,0.443,0.438,0.424,0.421,-0.109,0.318,0.511,0.492,0.405,0.416,0.419,-0.014,0.394,0.456,0.473,0.432,0.47,0.405,0.227,0.316,0.34,0.321,0.351,0.381,0.289,0.332,0.346,0.486,0.454,0.435,0.167,0.067,0.278,0.302,0.397,0.394,0.397,0.462,0.421,0.427,0.153,-0.076,0.4,0.337,0.335,0.473,0.559,0.508,0.459,0.459,0.467,0.47,0.451,0.443,0.44,0.432,0.446,0.454,0.586,0.467,Stability,TNKS2_HUMAN,High,Human
+0.19,0.194,0.185,0.198,0.203,0.209,0.059,0.193,0.236,0.221,0.23,0.215,0.257,0.121,0.162,0.216,0.266,0.258,0.306,0.191,0.096,0.107,0.18,0.222,0.119,0.217,0.22,0.209,0.231,0.191,0.199,0.161,0.105,0.117,0.171,0.234,0.19,0.197,0.242,0.208,0.209,0.239,0.131,0.104,0.269,0.152,0.148,0.209,0.184,0.076,0.2,0.207,0.211,0.217,0.226,0.226,0.215,0.229,0.233,0.231,0.217,0.185,OrganismalFitness,TPK1_HUMAN,Medium,Human
+0.263,0.306,0.313,0.323,0.322,0.331,0.173,0.302,0.322,0.329,0.349,0.33,0.344,0.225,0.265,0.344,0.349,0.295,0.291,0.341,0.186,0.263,0.301,0.299,0.265,0.304,0.301,0.268,0.295,0.351,0.359,0.342,0.223,0.195,0.302,0.29,0.302,0.337,0.331,0.349,0.334,0.344,0.244,0.148,0.345,0.316,0.322,0.347,0.36,0.162,0.345,0.342,0.349,0.342,0.348,0.356,0.359,0.344,0.355,0.356,0.36,0.324,Expression,TPMT_HUMAN,Medium,Human
+0.273,0.222,0.23,0.188,0.162,0.213,0.299,0.316,0.265,0.29,0.29,0.265,0.282,0.247,0.307,0.265,0.179,0.282,0.205,0.213,0.213,0.299,0.239,0.265,0.307,0.128,0.35,0.41,0.239,0.358,0.282,0.133,0.324,0.307,0.29,0.333,0.299,0.299,0.367,0.316,0.307,0.35,0.247,0.247,0.213,0.265,0.119,0.23,0.213,0.0,0.205,0.196,0.273,0.23,0.205,0.273,0.29,0.111,0.188,0.239,0.324,0.299,OrganismalFitness,TPOR_HUMAN,Low,Human
+0.516,0.527,0.462,0.479,0.487,0.508,0.188,0.465,0.557,0.562,0.546,0.543,0.557,0.258,0.506,0.532,0.541,0.543,0.524,0.462,0.368,0.519,0.465,0.516,0.436,0.471,0.444,0.46,0.441,0.441,0.527,0.462,0.196,0.487,0.492,0.508,0.511,0.527,0.527,0.506,0.516,0.516,0.395,0.045,0.543,0.465,0.25,0.384,0.387,0.099,0.541,0.495,0.503,0.557,0.53,0.524,0.543,0.546,0.524,0.551,0.535,0.511,OrganismalFitness,TRPC_SACS2,Medium,Prokaryote
+0.365,0.371,0.348,0.371,0.365,0.363,0.237,0.333,0.383,0.383,0.371,0.395,0.406,0.19,0.377,0.386,0.392,0.383,0.389,0.354,0.283,0.324,0.33,0.339,0.354,0.354,0.313,0.333,0.36,0.348,0.339,0.304,0.146,0.333,0.392,0.365,0.363,0.38,0.363,0.368,0.365,0.357,0.33,0.093,0.377,0.333,0.169,0.263,0.275,0.003,0.383,0.316,0.357,0.377,0.377,0.33,0.36,0.36,0.351,0.377,0.409,0.383,OrganismalFitness,TRPC_THEMA,Medium,Prokaryote
+0.304,0.407,0.457,0.474,0.438,0.457,-0.027,0.327,0.43,0.44,0.34,0.396,0.432,0.018,0.025,0.326,0.379,0.426,0.49,0.477,0.168,0.337,0.379,0.346,0.348,0.396,0.388,0.359,0.354,0.399,0.371,0.372,0.103,0.168,0.346,0.355,0.243,0.366,0.394,0.371,0.437,0.455,0.209,0.001,0.316,0.423,0.273,0.277,0.385,0.218,0.327,0.346,0.316,0.352,0.338,0.346,0.327,0.343,0.351,0.351,0.293,0.357,OrganismalFitness,UBC9_HUMAN,Medium,Human
+0.247,0.336,0.369,0.379,0.37,0.378,0.152,0.349,0.363,0.371,0.341,0.414,0.4,0.334,0.358,0.394,0.374,0.402,0.428,0.383,0.217,0.358,0.35,0.371,0.377,0.383,0.342,0.341,0.369,0.387,0.422,0.418,0.387,0.153,0.167,0.299,0.333,0.324,0.332,0.382,0.375,0.386,0.326,-0.02,0.252,0.259,0.369,0.292,0.394,0.305,0.35,0.385,0.369,0.37,0.374,0.366,0.374,0.375,0.385,0.374,0.411,0.39,Stability,UBE4B_HUMAN,High,Human
+0.306,0.312,0.348,0.342,0.348,0.354,0.04,0.046,0.318,0.342,0.276,0.33,0.348,0.318,0.336,0.36,0.33,0.318,0.288,0.318,0.082,0.276,0.27,0.252,0.348,0.3,0.288,0.288,0.252,0.33,0.36,0.3,0.058,-0.009,0.118,0.203,0.312,0.288,0.306,0.342,0.36,0.379,0.306,-0.003,0.33,0.348,0.191,0.27,-0.057,0.034,0.318,0.33,0.354,0.354,0.342,0.36,0.33,0.36,0.342,0.354,0.33,0.288,Activity,UBE4B_MOUSE,Low,Eukaryote
+0.374,0.495,0.528,0.544,0.542,0.566,0.142,0.294,0.432,0.478,0.492,0.374,0.454,0.192,0.187,0.129,0.101,0.561,0.522,0.421,0.404,0.393,0.429,0.487,0.443,0.454,0.462,0.443,0.451,0.506,0.528,0.382,0.264,0.283,0.376,0.365,0.374,0.41,0.418,0.553,0.553,0.547,0.093,0.098,0.291,0.101,0.445,0.244,0.588,0.484,0.533,0.536,0.547,0.553,0.561,0.572,0.536,0.555,0.547,0.55,0.586,0.514,Stability,UBR5_HUMAN,Medium,Human
+0.12,0.192,0.12,0.148,0.192,0.198,0.187,-0.057,0.264,0.242,0.519,0.441,0.463,0.286,0.419,0.491,0.519,0.491,0.546,0.109,0.297,0.402,0.358,0.485,0.32,0.342,0.331,0.369,0.447,0.441,0.43,0.295,0.048,0.237,0.358,0.397,0.198,0.281,0.342,0.22,0.248,0.281,0.347,0.209,0.441,0.38,0.386,0.425,0.43,0.375,0.53,0.48,0.508,0.53,0.508,0.568,0.474,0.535,0.563,0.535,0.497,0.48,Stability,VG08_BPP22,High,Virus
+0.271,0.323,0.432,0.476,0.495,0.499,0.065,0.416,0.441,0.478,0.55,0.485,0.49,0.26,0.158,0.513,0.517,0.474,0.42,0.425,0.234,0.35,0.429,0.385,0.379,0.425,0.411,0.378,0.441,0.474,0.499,0.489,0.283,0.264,0.343,0.357,0.409,0.415,0.409,0.501,0.494,0.48,0.163,0.2,0.36,0.351,0.387,0.346,0.539,0.464,0.545,0.553,0.555,0.539,0.546,0.552,0.553,0.546,0.553,0.552,0.499,0.508,Stability,VILI_CHICK,High,Eukaryote
+0.32,0.326,0.343,0.358,0.376,0.376,0.08,0.314,0.408,0.416,0.374,0.379,0.397,0.088,0.285,0.331,0.407,0.389,0.394,0.382,0.136,0.122,0.3,0.326,0.19,0.364,0.354,0.351,0.344,0.364,0.343,0.275,0.126,0.086,0.176,0.366,0.324,0.326,0.418,0.37,0.384,0.404,0.088,0.054,0.391,0.232,0.284,0.35,0.372,0.12,0.404,0.366,0.408,0.405,0.412,0.396,0.406,0.395,0.396,0.414,0.416,0.329,Expression,VKOR1_HUMAN,Medium,Human
+0.32,0.326,0.343,0.358,0.376,0.376,0.08,0.314,0.408,0.416,0.374,0.379,0.397,0.088,0.285,0.331,0.407,0.389,0.394,0.382,0.136,0.122,0.3,0.326,0.19,0.364,0.354,0.351,0.344,0.364,0.343,0.275,0.126,0.086,0.176,0.366,0.324,0.326,0.418,0.37,0.384,0.404,0.088,0.054,0.391,0.232,0.284,0.35,0.372,0.12,0.404,0.366,0.408,0.405,0.412,0.396,0.406,0.395,0.396,0.414,0.416,0.329,Activity,VKOR1_HUMAN,Medium,Human
+-0.076,0.011,0.102,0.144,0.081,0.11,0.094,0.11,0.342,0.351,0.363,0.264,0.28,0.214,0.342,0.384,0.463,0.496,0.421,0.135,0.007,0.139,0.115,0.222,0.197,0.185,0.139,0.189,0.247,0.069,0.355,0.338,0.231,0.04,0.011,0.073,-0.084,-0.055,-0.08,0.028,0.028,-0.043,0.218,0.036,0.425,0.293,0.463,0.392,0.442,0.442,0.467,0.446,0.446,0.446,0.454,0.454,0.438,0.458,0.434,0.467,0.492,0.417,Stability,VRPI_BPT7,Medium,Virus
+0.243,0.511,0.432,0.455,0.434,0.432,-0.144,0.461,0.497,0.504,0.483,0.148,0.309,-0.091,0.106,0.432,0.521,0.544,0.546,0.523,-0.091,-0.167,-0.091,0.188,-0.099,-0.142,-0.186,0.017,0.394,0.449,0.55,0.524,0.004,-0.197,-0.051,0.396,0.273,0.267,0.417,0.404,0.396,0.485,-0.059,-0.212,0.068,-0.04,0.326,0.271,0.432,0.387,0.523,0.531,0.529,0.527,0.529,0.533,0.525,0.54,0.525,0.527,0.459,0.372,Stability,YAIA_ECOLI,Medium,Prokaryote
+0.338,0.26,0.356,0.359,0.347,0.358,0.24,0.25,0.055,0.067,0.263,0.208,0.218,0.342,0.361,0.369,0.381,0.3,0.246,0.149,0.129,0.134,0.117,0.114,0.23,0.171,0.201,0.157,0.107,0.245,0.353,0.378,0.093,0.23,0.146,0.151,0.309,0.259,0.273,0.342,0.312,0.337,0.38,-0.07,0.254,0.391,0.285,0.272,0.327,0.145,0.37,0.303,0.343,0.341,0.351,0.327,0.339,0.362,0.367,0.361,0.29,0.348,Binding,YAP1_HUMAN,Low,Human
+0.471,0.55,0.553,0.553,0.55,0.553,0.393,0.448,0.526,0.522,0.559,0.553,0.553,0.436,0.438,0.55,0.548,0.55,0.546,0.542,0.421,0.486,0.506,0.506,0.5,0.5,0.464,0.5,0.54,0.497,0.516,0.493,0.502,0.434,0.476,0.446,0.512,0.51,0.506,0.52,0.516,0.514,0.24,0.186,0.359,0.264,0.315,0.381,0.563,0.534,0.559,0.55,0.555,0.557,0.555,0.561,0.553,0.557,0.553,0.555,0.544,0.555,Stability,YNZC_BACSU,Medium,Prokaryote
+0.293,0.327,0.335,0.344,0.356,0.361,0.134,0.28,0.346,0.357,0.332,0.315,0.338,0.155,0.247,0.313,0.347,0.341,0.337,0.309,0.24,0.284,0.297,0.302,0.269,0.309,0.305,0.308,0.32,0.37,0.373,0.341,0.157,0.232,0.274,0.304,0.331,0.34,0.353,0.367,0.369,0.372,0.205,0.064,0.301,0.249,0.292,0.318,0.339,0.223,0.364,0.353,0.362,0.368,0.367,0.372,0.365,0.367,0.367,0.375,0.371,0.317,,,,
diff --git a/benchmarks/DMS_zero_shot/substitutions/MCC/Summary_performance_DMS_substitutions_MCC.csv b/benchmarks/DMS_zero_shot/substitutions/MCC/Summary_performance_DMS_substitutions_MCC.csv
new file mode 100644
index 0000000..ea1e7ee
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/MCC/Summary_performance_DMS_substitutions_MCC.csv
@@ -0,0 +1,63 @@
+Model_rank,Model_name,Model type,Average_MCC,Bootstrap_standard_error_MCC,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,Depth_1,Depth_2,Depth_3,Depth_4,Depth_5+,Model details,References
+1,SaProt (650M),Hybrid - Structure & PLM,0.358,0.0,0.354,0.29,0.388,0.283,0.476,0.312,0.353,0.435,0.388,0.412,0.395,0.243,0.334,0.207,0.164,0.187,0.263,SaProt (650M),"Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv."
+2,TranceptEVE L,Hybrid - Alignment & PLM,0.356,0.007,0.373,0.284,0.358,0.359,0.406,0.351,0.37,0.391,0.377,0.4,0.362,0.353,0.31,0.188,0.226,0.221,0.257,TranceptEVE Large model (Tranception Large & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+3,ProtSSN (ensemble),Hybrid - Structure & PLM,0.354,0.005,0.366,0.274,0.355,0.309,0.464,0.312,0.368,0.416,0.382,0.419,0.387,0.284,0.3,0.134,0.065,0.059,0.104,ProtSSN (ensemble of 9 models),"Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+4,TranceptEVE M,Hybrid - Alignment & PLM,0.353,0.008,0.368,0.289,0.347,0.357,0.403,0.34,0.37,0.386,0.377,0.399,0.355,0.344,0.313,0.188,0.2,0.199,0.252,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+5,TranceptEVE S,Hybrid - Alignment & PLM,0.352,0.008,0.361,0.3,0.346,0.353,0.402,0.352,0.361,0.387,0.377,0.392,0.355,0.337,0.307,0.191,0.207,0.202,0.25,TranceptEVE Small model (Tranception Small & retrieved EVE model),"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+6,GEMME,Alignment-based model,0.352,0.01,0.362,0.284,0.336,0.354,0.423,0.348,0.37,0.396,0.373,0.4,0.366,0.366,0.324,0.154,0.214,0.198,0.241,GEMME model,"Laine, É., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619."
+7,ProtSSN (k=20 h=1280),Hybrid - Structure & PLM,0.351,0.006,0.361,0.281,0.345,0.303,0.466,0.312,0.362,0.417,0.379,0.416,0.383,0.284,0.298,0.135,0.07,0.059,0.107,"ProtSSN (k=20, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+8,ProtSSN (k=30 h=1280),Hybrid - Structure & PLM,0.347,0.006,0.358,0.276,0.342,0.297,0.459,0.303,0.358,0.413,0.375,0.416,0.379,0.271,0.295,0.118,0.049,0.063,0.111,"ProtSSN (k=30, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+9,VESPA,Protein language model,0.346,0.007,0.369,0.269,0.325,0.348,0.422,0.331,0.367,0.401,0.365,0.399,0.381,0.346,0.308,0.125,0.113,0.067,0.107,VESPA model,"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647."
+10,ProtSSN (k=20 h=768),Hybrid - Structure & PLM,0.346,0.005,0.355,0.269,0.347,0.3,0.458,0.301,0.359,0.41,0.374,0.414,0.376,0.274,0.293,0.136,0.072,0.058,0.107,"ProtSSN (k=20, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+11,ProtSSN (k=20 h=512),Hybrid - Structure & PLM,0.345,0.005,0.357,0.266,0.339,0.305,0.459,0.302,0.363,0.408,0.369,0.416,0.383,0.287,0.294,0.132,0.083,0.072,0.11,"ProtSSN (k=20, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+12,ProtSSN (k=30 h=768),Hybrid - Structure & PLM,0.344,0.005,0.356,0.264,0.34,0.3,0.46,0.296,0.362,0.409,0.37,0.418,0.381,0.277,0.293,0.131,0.058,0.058,0.111,"ProtSSN (k=30, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+13,ProtSSN (k=10 h=1280),Hybrid - Structure & PLM,0.343,0.006,0.351,0.267,0.345,0.298,0.452,0.307,0.355,0.403,0.371,0.406,0.37,0.28,0.289,0.115,0.067,0.047,0.098,"ProtSSN (k=10, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+14,EVE (ensemble),Alignment-based model,0.342,0.009,0.357,0.283,0.319,0.35,0.401,0.325,0.362,0.384,0.365,0.381,0.365,0.334,0.291,0.184,0.189,0.203,0.237,EVE model (ensemble of 5 independently-trained models),"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature."
+15,ProtSSN (k=30 h=512),Hybrid - Structure & PLM,0.342,0.006,0.353,0.258,0.341,0.301,0.455,0.307,0.359,0.403,0.372,0.41,0.377,0.272,0.292,0.129,0.063,0.065,0.103,"ProtSSN (k=30, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+16,ProtSSN (k=10 h=512),Hybrid - Structure & PLM,0.341,0.006,0.356,0.262,0.332,0.296,0.46,0.305,0.353,0.412,0.37,0.416,0.38,0.266,0.291,0.137,0.062,0.067,0.114,"ProtSSN (k=10, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+17,Tranception L,Hybrid - Alignment & PLM,0.34,0.007,0.36,0.266,0.358,0.34,0.378,0.34,0.347,0.373,0.363,0.389,0.325,0.336,0.312,0.179,0.221,0.225,0.272,Tranception Large model (700M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+18,MSA Transformer (ensemble),Hybrid - Alignment & PLM,0.339,0.007,0.367,0.248,0.352,0.327,0.403,0.304,0.361,0.385,0.357,0.405,0.353,0.317,0.295,0.155,0.203,0.222,0.244,MSA Transformer (ensemble of 5 MSA samples),"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML."
+19,EVE (single),Alignment-based model,0.335,0.008,0.351,0.263,0.318,0.345,0.397,0.314,0.357,0.379,0.357,0.377,0.361,0.331,0.286,0.179,0.198,0.197,0.235,EVE model (single seed),"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature."
+20,ProtSSN (k=10 h=768),Hybrid - Structure & PLM,0.332,0.005,0.344,0.257,0.326,0.287,0.445,0.288,0.348,0.394,0.36,0.396,0.365,0.268,0.278,0.109,0.059,0.054,0.093,"ProtSSN (k=10, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+21,ESM-IF1,Inverse folding model,0.331,0.009,0.283,0.305,0.319,0.246,0.5,0.23,0.343,0.428,0.334,0.391,0.381,0.305,0.334,0.22,0.167,0.211,0.281,ESM-IF1 model,"Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. BioRxiv."
+22,Tranception M,Hybrid - Alignment & PLM,0.33,0.007,0.343,0.267,0.344,0.33,0.366,0.324,0.338,0.358,0.354,0.379,0.309,0.318,0.307,0.169,0.166,0.182,0.241,Tranception Medium model (300M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+23,DeepSequence (ensemble),Alignment-based model,0.328,0.01,0.345,0.272,0.311,0.327,0.384,0.296,0.34,0.377,0.355,0.376,0.351,0.273,0.272,0.163,0.203,0.216,0.271,DeepSequence model (ensemble of 5 independently-trained models),"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822."
+24,MSA Transformer (single),Hybrid - Alignment & PLM,0.328,0.007,0.351,0.233,0.338,0.321,0.395,0.292,0.351,0.375,0.349,0.392,0.34,0.312,0.287,0.146,0.208,0.214,0.229,MSA Transformer (single MSA sample),"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML."
+25,ESM2 (650M),Protein language model,0.327,0.009,0.331,0.262,0.332,0.29,0.42,0.269,0.322,0.41,0.369,0.383,0.355,0.204,0.309,0.173,0.11,0.114,0.153,ESM2 model (650M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+26,Tranception S,Hybrid - Alignment & PLM,0.322,0.008,0.33,0.279,0.326,0.32,0.354,0.332,0.321,0.348,0.345,0.364,0.3,0.305,0.293,0.171,0.168,0.175,0.248,Tranception Small model (85M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+27,ESM-1v (ensemble),Protein language model,0.32,0.008,0.323,0.241,0.345,0.305,0.387,0.255,0.32,0.404,0.368,0.367,0.321,0.236,0.292,0.145,0.104,0.103,0.159,ESM-1v (ensemble of 5 independently-trained models),"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS."
+28,ESM2 (3B),Protein language model,0.32,0.008,0.322,0.247,0.323,0.293,0.415,0.277,0.329,0.391,0.355,0.379,0.362,0.226,0.299,0.144,0.066,0.085,0.149,ESM2 model (3B params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+29,DeepSequence (single),Alignment-based model,0.318,0.01,0.338,0.26,0.292,0.314,0.383,0.294,0.33,0.37,0.348,0.372,0.344,0.258,0.265,0.161,0.177,0.218,0.279,DeepSequence model (single seed),"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822."
+30,SaProt (35M),Hybrid - Structure & PLM,0.316,0.006,0.283,0.285,0.337,0.219,0.453,0.253,0.306,0.393,0.356,0.378,0.32,0.182,0.307,0.225,0.111,0.13,0.186,SaProt (35M),"Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv."
+31,ESM2 (15B),Protein language model,0.314,0.008,0.311,0.229,0.333,0.301,0.398,0.276,0.328,0.38,0.346,0.377,0.341,0.252,0.285,0.12,0.076,0.092,0.166,ESM2 model (15B params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+32,VESPAl,Protein language model,0.312,0.009,0.331,0.255,0.26,0.321,0.395,0.292,0.334,0.373,0.327,0.375,0.353,0.316,0.279,0.107,0.096,0.074,0.105,VESPAl model,"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647."
+33,ESM-1b,Protein language model,0.311,0.008,0.328,0.221,0.32,0.274,0.41,0.275,0.317,0.387,0.352,0.388,0.343,0.203,0.256,0.127,0.021,0.046,0.124,ESM-1b (w/ Brandes et al. extensions),"[1] Original model: Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118. [2] Extensions: Brandes, N., Goldman, G., Wang, C.H., Ye, C.J., & Ntranos, V. (2022). Genome-wide prediction of disease variants with a deep protein language model. bioRxiv."
+34,MIF-ST,Hybrid - Structure & PLM,0.31,0.008,0.299,0.248,0.348,0.285,0.368,0.293,0.311,0.349,0.314,0.315,0.344,0.31,0.332,0.164,0.233,0.223,0.267,MIF-ST model,"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv."
+35,Progen2 XL,Protein language model,0.306,0.007,0.311,0.225,0.329,0.302,0.365,0.267,0.325,0.357,0.311,0.354,0.334,0.314,0.283,0.141,0.181,0.163,0.223,Progen2 xlarge model (6.4B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+36,EVmutation,Alignment-based model,0.305,0.008,0.342,0.218,0.297,0.322,0.342,0.295,0.339,0.32,0.324,0.343,0.331,0.3,0.242,0.17,0.172,0.181,0.242,EVmutation model,"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135."
+37,ESM2 (150M),Protein language model,0.302,0.009,0.296,0.253,0.315,0.237,0.41,0.245,0.282,0.392,0.36,0.38,0.297,0.124,0.287,0.167,0.078,0.109,0.148,ESM2 model (150M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+38,Progen2 M,Protein language model,0.299,0.007,0.3,0.228,0.342,0.3,0.324,0.245,0.307,0.341,0.329,0.325,0.285,0.262,0.281,0.099,0.078,0.09,0.127,Progen2 medium model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+39,Progen2 Base,Protein language model,0.299,0.008,0.306,0.231,0.353,0.295,0.308,0.263,0.292,0.339,0.339,0.332,0.255,0.242,0.283,0.098,0.072,0.1,0.133,Progen2 base model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+40,Progen2 L,Protein language model,0.298,0.007,0.307,0.22,0.341,0.296,0.327,0.264,0.306,0.335,0.326,0.342,0.291,0.249,0.279,0.115,0.144,0.146,0.195,Progen2 large model (2.7B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+41,ESM-1v (single),Protein language model,0.296,0.009,0.304,0.21,0.322,0.287,0.357,0.225,0.295,0.385,0.344,0.339,0.303,0.21,0.274,0.126,0.114,0.105,0.152,ESM-1v (single seed),"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS."
+42,Tranception L no retrieval,Protein language model,0.296,0.008,0.309,0.229,0.325,0.301,0.315,0.278,0.297,0.335,0.313,0.315,0.283,0.31,0.27,0.126,0.182,0.191,0.247,Tranception Large model (700M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+43,Wavenet,Alignment-based model,0.294,0.011,0.298,0.243,0.279,0.282,0.369,0.225,0.314,0.36,0.317,0.335,0.324,0.262,0.257,0.154,0.148,0.149,0.139,Wavenet model,"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12."
+44,MIF,Inverse folding model,0.294,0.01,0.243,0.264,0.335,0.23,0.395,0.27,0.292,0.331,0.308,0.286,0.313,0.284,0.315,0.177,0.198,0.191,0.231,MIF model,"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv."
+45,RITA XL,Protein language model,0.293,0.008,0.284,0.225,0.33,0.299,0.328,0.238,0.307,0.334,0.316,0.315,0.275,0.307,0.272,0.115,0.094,0.106,0.166,RITA xlarge model (1.2B params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+46,RITA L,Protein language model,0.291,0.008,0.283,0.222,0.338,0.293,0.317,0.245,0.299,0.325,0.322,0.317,0.247,0.299,0.268,0.116,0.071,0.087,0.15,RITA large model (680M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+47,Site-Independent,Alignment-based model,0.286,0.011,0.291,0.264,0.274,0.299,0.301,0.333,0.3,0.263,0.31,0.311,0.256,0.29,0.248,0.177,0.148,0.172,0.24,Site-Independent model,"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135."
+48,CARP (640M),Protein language model,0.286,0.008,0.304,0.202,0.322,0.288,0.313,0.248,0.291,0.33,0.326,0.304,0.286,0.225,0.303,0.127,0.103,0.106,0.129,CARP model (640M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+49,RITA M,Protein language model,0.274,0.009,0.273,0.204,0.314,0.292,0.286,0.232,0.278,0.315,0.301,0.294,0.235,0.297,0.26,0.091,0.057,0.09,0.155,RITA medium model (300M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+50,Unirep evotuned,Hybrid - Alignment & PLM,0.274,0.009,0.274,0.226,0.288,0.268,0.311,0.254,0.276,0.307,0.289,0.302,0.27,0.266,0.244,0.141,0.185,0.184,0.228,Unirep model w/ evotuning,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+51,Tranception M no retrieval,Protein language model,0.269,0.008,0.266,0.213,0.311,0.28,0.274,0.221,0.277,0.296,0.297,0.278,0.238,0.259,0.254,0.103,0.083,0.101,0.127,Tranception Medium model (300M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+52,Progen2 S,Protein language model,0.261,0.009,0.257,0.204,0.295,0.261,0.285,0.219,0.257,0.309,0.306,0.28,0.226,0.214,0.251,0.091,0.071,0.087,0.103,Progen2 small model (150M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+53,CARP (76M),Protein language model,0.251,0.01,0.256,0.212,0.29,0.215,0.282,0.195,0.231,0.311,0.304,0.275,0.217,0.125,0.257,0.135,0.049,0.068,0.09,CARP model (76M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+54,ESM2 (35M),Protein language model,0.249,0.011,0.238,0.234,0.262,0.169,0.344,0.191,0.213,0.349,0.296,0.316,0.23,0.099,0.23,0.155,0.074,0.095,0.151,ESM2 model (35M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+55,RITA S,Protein language model,0.236,0.01,0.231,0.204,0.262,0.249,0.232,0.204,0.237,0.261,0.264,0.228,0.181,0.265,0.217,0.093,0.073,0.083,0.12,RITA small model (85M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+56,Tranception S no retrieval,Protein language model,0.231,0.01,0.216,0.21,0.264,0.247,0.218,0.189,0.233,0.251,0.251,0.216,0.2,0.232,0.212,0.093,0.074,0.089,0.12,Tranception Small model (85M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+57,CARP (38M),Protein language model,0.213,0.011,0.213,0.21,0.237,0.169,0.235,0.156,0.186,0.266,0.249,0.233,0.18,0.101,0.212,0.116,0.05,0.074,0.107,CARP model (38M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+58,ProteinMPNN,Inverse folding model,0.195,0.009,0.143,0.126,0.141,0.121,0.445,0.137,0.216,0.333,0.223,0.307,0.262,0.196,0.206,0.176,0.119,0.127,0.224,ProteinMPNN model,"J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378."
+59,ESM2 (8M),Protein language model,0.171,0.012,0.147,0.201,0.192,0.104,0.209,0.149,0.141,0.199,0.19,0.193,0.139,0.075,0.148,0.099,0.063,0.097,0.13,ESM2 model (8M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+60,ProtGPT2,Protein language model,0.151,0.009,0.135,0.118,0.155,0.136,0.21,0.144,0.143,0.203,0.198,0.193,0.103,0.118,0.143,0.102,0.027,0.014,0.046,ProtGPT2 model,"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13."
+61,Unirep,Protein language model,0.146,0.012,0.134,0.156,0.166,0.103,0.171,0.139,0.128,0.157,0.174,0.173,0.11,0.051,0.132,0.064,0.074,0.093,0.119,Unirep model,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+62,CARP (600K),Protein language model,0.076,0.013,0.076,0.057,0.126,0.046,0.074,0.069,0.067,0.067,0.09,0.054,0.047,0.051,0.077,0.016,0.019,0.062,0.069,CARP model (600K params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
diff --git a/benchmarks/DMS_zero_shot/substitutions/MCC/Summary_performance_DMS_substitutions_MCC.html b/benchmarks/DMS_zero_shot/substitutions/MCC/Summary_performance_DMS_substitutions_MCC.html
new file mode 100644
index 0000000..96f9f7f
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/MCC/Summary_performance_DMS_substitutions_MCC.html
@@ -0,0 +1,1670 @@
+
+
+
+ |
+ Model_name |
+ Model type |
+ Average_MCC |
+ Bootstrap_standard_error_MCC |
+ Function_Activity |
+ Function_Binding |
+ Function_Expression |
+ Function_OrganismalFitness |
+ Function_Stability |
+ Low_MSA_depth |
+ Medium_MSA_depth |
+ High_MSA_depth |
+ Taxa_Human |
+ Taxa_Other_Eukaryote |
+ Taxa_Prokaryote |
+ Taxa_Virus |
+ Depth_1 |
+ Depth_2 |
+ Depth_3 |
+ Depth_4 |
+ Depth_5+ |
+ Model details |
+ References |
+
+
+ Model_rank |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+
+
+
+
+ 1 |
+ SaProt (650M) |
+ Hybrid - Structure & PLM |
+ 0.358 |
+ 0.000 |
+ 0.354 |
+ 0.290 |
+ 0.388 |
+ 0.283 |
+ 0.476 |
+ 0.312 |
+ 0.353 |
+ 0.435 |
+ 0.388 |
+ 0.412 |
+ 0.395 |
+ 0.243 |
+ 0.334 |
+ 0.207 |
+ 0.164 |
+ 0.187 |
+ 0.263 |
+ SaProt (650M) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.10.01.560349v5'>Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv.</a> |
+
+
+ 2 |
+ TranceptEVE L |
+ Hybrid - Alignment & PLM |
+ 0.356 |
+ 0.007 |
+ 0.373 |
+ 0.284 |
+ 0.358 |
+ 0.359 |
+ 0.406 |
+ 0.351 |
+ 0.370 |
+ 0.391 |
+ 0.377 |
+ 0.400 |
+ 0.362 |
+ 0.353 |
+ 0.310 |
+ 0.188 |
+ 0.226 |
+ 0.221 |
+ 0.257 |
+ TranceptEVE Large model (Tranception Large & retrieved EVE model) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.12.07.519495v1?rss=1'>Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.</a> |
+
+
+ 3 |
+ ProtSSN (ensemble) |
+ Hybrid - Structure & PLM |
+ 0.354 |
+ 0.005 |
+ 0.366 |
+ 0.274 |
+ 0.355 |
+ 0.309 |
+ 0.464 |
+ 0.312 |
+ 0.368 |
+ 0.416 |
+ 0.382 |
+ 0.419 |
+ 0.387 |
+ 0.284 |
+ 0.300 |
+ 0.134 |
+ 0.065 |
+ 0.059 |
+ 0.104 |
+ ProtSSN (ensemble of 9 models) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 4 |
+ TranceptEVE M |
+ Hybrid - Alignment & PLM |
+ 0.353 |
+ 0.008 |
+ 0.368 |
+ 0.289 |
+ 0.347 |
+ 0.357 |
+ 0.403 |
+ 0.340 |
+ 0.370 |
+ 0.386 |
+ 0.377 |
+ 0.399 |
+ 0.355 |
+ 0.344 |
+ 0.313 |
+ 0.188 |
+ 0.200 |
+ 0.199 |
+ 0.252 |
+ TranceptEVE Medium model (Tranception Medium & retrieved EVE model) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.12.07.519495v1?rss=1'>Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.</a> |
+
+
+ 5 |
+ TranceptEVE S |
+ Hybrid - Alignment & PLM |
+ 0.352 |
+ 0.008 |
+ 0.361 |
+ 0.300 |
+ 0.346 |
+ 0.353 |
+ 0.402 |
+ 0.352 |
+ 0.361 |
+ 0.387 |
+ 0.377 |
+ 0.392 |
+ 0.355 |
+ 0.337 |
+ 0.307 |
+ 0.191 |
+ 0.207 |
+ 0.202 |
+ 0.250 |
+ TranceptEVE Small model (Tranception Small & retrieved EVE model) |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 6 |
+ GEMME |
+ Alignment-based model |
+ 0.352 |
+ 0.010 |
+ 0.362 |
+ 0.284 |
+ 0.336 |
+ 0.354 |
+ 0.423 |
+ 0.348 |
+ 0.370 |
+ 0.396 |
+ 0.373 |
+ 0.400 |
+ 0.366 |
+ 0.366 |
+ 0.324 |
+ 0.154 |
+ 0.214 |
+ 0.198 |
+ 0.241 |
+ GEMME model |
+ <a href='https://pubmed.ncbi.nlm.nih.gov/31406981/'>Laine, É., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619.</a> |
+
+
+ 7 |
+ ProtSSN (k=20 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.351 |
+ 0.006 |
+ 0.361 |
+ 0.281 |
+ 0.345 |
+ 0.303 |
+ 0.466 |
+ 0.312 |
+ 0.362 |
+ 0.417 |
+ 0.379 |
+ 0.416 |
+ 0.383 |
+ 0.284 |
+ 0.298 |
+ 0.135 |
+ 0.070 |
+ 0.059 |
+ 0.107 |
+ ProtSSN (k=20, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 8 |
+ ProtSSN (k=30 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.347 |
+ 0.006 |
+ 0.358 |
+ 0.276 |
+ 0.342 |
+ 0.297 |
+ 0.459 |
+ 0.303 |
+ 0.358 |
+ 0.413 |
+ 0.375 |
+ 0.416 |
+ 0.379 |
+ 0.271 |
+ 0.295 |
+ 0.118 |
+ 0.049 |
+ 0.063 |
+ 0.111 |
+ ProtSSN (k=30, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 9 |
+ VESPA |
+ Protein language model |
+ 0.346 |
+ 0.007 |
+ 0.369 |
+ 0.269 |
+ 0.325 |
+ 0.348 |
+ 0.422 |
+ 0.331 |
+ 0.367 |
+ 0.401 |
+ 0.365 |
+ 0.399 |
+ 0.381 |
+ 0.346 |
+ 0.308 |
+ 0.125 |
+ 0.113 |
+ 0.067 |
+ 0.107 |
+ VESPA model |
+ <a href='https://link.springer.com/article/10.1007/s00439-021-02411-y'>Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.</a> |
+
+
+ 10 |
+ ProtSSN (k=20 h=768) |
+ Hybrid - Structure & PLM |
+ 0.346 |
+ 0.005 |
+ 0.355 |
+ 0.269 |
+ 0.347 |
+ 0.300 |
+ 0.458 |
+ 0.301 |
+ 0.359 |
+ 0.410 |
+ 0.374 |
+ 0.414 |
+ 0.376 |
+ 0.274 |
+ 0.293 |
+ 0.136 |
+ 0.072 |
+ 0.058 |
+ 0.107 |
+ ProtSSN (k=20, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 11 |
+ ProtSSN (k=20 h=512) |
+ Hybrid - Structure & PLM |
+ 0.345 |
+ 0.005 |
+ 0.357 |
+ 0.266 |
+ 0.339 |
+ 0.305 |
+ 0.459 |
+ 0.302 |
+ 0.363 |
+ 0.408 |
+ 0.369 |
+ 0.416 |
+ 0.383 |
+ 0.287 |
+ 0.294 |
+ 0.132 |
+ 0.083 |
+ 0.072 |
+ 0.110 |
+ ProtSSN (k=20, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 12 |
+ ProtSSN (k=30 h=768) |
+ Hybrid - Structure & PLM |
+ 0.344 |
+ 0.005 |
+ 0.356 |
+ 0.264 |
+ 0.340 |
+ 0.300 |
+ 0.460 |
+ 0.296 |
+ 0.362 |
+ 0.409 |
+ 0.370 |
+ 0.418 |
+ 0.381 |
+ 0.277 |
+ 0.293 |
+ 0.131 |
+ 0.058 |
+ 0.058 |
+ 0.111 |
+ ProtSSN (k=30, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 13 |
+ ProtSSN (k=10 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.343 |
+ 0.006 |
+ 0.351 |
+ 0.267 |
+ 0.345 |
+ 0.298 |
+ 0.452 |
+ 0.307 |
+ 0.355 |
+ 0.403 |
+ 0.371 |
+ 0.406 |
+ 0.370 |
+ 0.280 |
+ 0.289 |
+ 0.115 |
+ 0.067 |
+ 0.047 |
+ 0.098 |
+ ProtSSN (k=10, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 14 |
+ EVE (ensemble) |
+ Alignment-based model |
+ 0.342 |
+ 0.009 |
+ 0.357 |
+ 0.283 |
+ 0.319 |
+ 0.350 |
+ 0.401 |
+ 0.325 |
+ 0.362 |
+ 0.384 |
+ 0.365 |
+ 0.381 |
+ 0.365 |
+ 0.334 |
+ 0.291 |
+ 0.184 |
+ 0.189 |
+ 0.203 |
+ 0.237 |
+ EVE model (ensemble of 5 independently-trained models) |
+ <a href='https://www.nature.com/articles/s41586-021-04043-8'>Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.</a> |
+
+
+ 15 |
+ ProtSSN (k=30 h=512) |
+ Hybrid - Structure & PLM |
+ 0.342 |
+ 0.006 |
+ 0.353 |
+ 0.258 |
+ 0.341 |
+ 0.301 |
+ 0.455 |
+ 0.307 |
+ 0.359 |
+ 0.403 |
+ 0.372 |
+ 0.410 |
+ 0.377 |
+ 0.272 |
+ 0.292 |
+ 0.129 |
+ 0.063 |
+ 0.065 |
+ 0.103 |
+ ProtSSN (k=30, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 16 |
+ ProtSSN (k=10 h=512) |
+ Hybrid - Structure & PLM |
+ 0.341 |
+ 0.006 |
+ 0.356 |
+ 0.262 |
+ 0.332 |
+ 0.296 |
+ 0.460 |
+ 0.305 |
+ 0.353 |
+ 0.412 |
+ 0.370 |
+ 0.416 |
+ 0.380 |
+ 0.266 |
+ 0.291 |
+ 0.137 |
+ 0.062 |
+ 0.067 |
+ 0.114 |
+ ProtSSN (k=10, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 17 |
+ Tranception L |
+ Hybrid - Alignment & PLM |
+ 0.340 |
+ 0.007 |
+ 0.360 |
+ 0.266 |
+ 0.358 |
+ 0.340 |
+ 0.378 |
+ 0.340 |
+ 0.347 |
+ 0.373 |
+ 0.363 |
+ 0.389 |
+ 0.325 |
+ 0.336 |
+ 0.312 |
+ 0.179 |
+ 0.221 |
+ 0.225 |
+ 0.272 |
+ Tranception Large model (700M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 18 |
+ MSA Transformer (ensemble) |
+ Hybrid - Alignment & PLM |
+ 0.339 |
+ 0.007 |
+ 0.367 |
+ 0.248 |
+ 0.352 |
+ 0.327 |
+ 0.403 |
+ 0.304 |
+ 0.361 |
+ 0.385 |
+ 0.357 |
+ 0.405 |
+ 0.353 |
+ 0.317 |
+ 0.295 |
+ 0.155 |
+ 0.203 |
+ 0.222 |
+ 0.244 |
+ MSA Transformer (ensemble of 5 MSA samples) |
+ <a href='http://proceedings.mlr.press/v139/rao21a.html'>Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.</a> |
+
+
+ 19 |
+ EVE (single) |
+ Alignment-based model |
+ 0.335 |
+ 0.008 |
+ 0.351 |
+ 0.263 |
+ 0.318 |
+ 0.345 |
+ 0.397 |
+ 0.314 |
+ 0.357 |
+ 0.379 |
+ 0.357 |
+ 0.377 |
+ 0.361 |
+ 0.331 |
+ 0.286 |
+ 0.179 |
+ 0.198 |
+ 0.197 |
+ 0.235 |
+ EVE model (single seed) |
+ <a href='https://www.nature.com/articles/s41586-021-04043-8'>Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.</a> |
+
+
+ 20 |
+ ProtSSN (k=10 h=768) |
+ Hybrid - Structure & PLM |
+ 0.332 |
+ 0.005 |
+ 0.344 |
+ 0.257 |
+ 0.326 |
+ 0.287 |
+ 0.445 |
+ 0.288 |
+ 0.348 |
+ 0.394 |
+ 0.360 |
+ 0.396 |
+ 0.365 |
+ 0.268 |
+ 0.278 |
+ 0.109 |
+ 0.059 |
+ 0.054 |
+ 0.093 |
+ ProtSSN (k=10, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 21 |
+ ESM-IF1 |
+ Inverse folding model |
+ 0.331 |
+ 0.009 |
+ 0.283 |
+ 0.305 |
+ 0.319 |
+ 0.246 |
+ 0.500 |
+ 0.230 |
+ 0.343 |
+ 0.428 |
+ 0.334 |
+ 0.391 |
+ 0.381 |
+ 0.305 |
+ 0.334 |
+ 0.220 |
+ 0.167 |
+ 0.211 |
+ 0.281 |
+ ESM-IF1 model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.04.10.487779v2.full.pdf+html'>Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. BioRxiv.</a> |
+
+
+ 22 |
+ Tranception M |
+ Hybrid - Alignment & PLM |
+ 0.330 |
+ 0.007 |
+ 0.343 |
+ 0.267 |
+ 0.344 |
+ 0.330 |
+ 0.366 |
+ 0.324 |
+ 0.338 |
+ 0.358 |
+ 0.354 |
+ 0.379 |
+ 0.309 |
+ 0.318 |
+ 0.307 |
+ 0.169 |
+ 0.166 |
+ 0.182 |
+ 0.241 |
+ Tranception Medium model (300M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 23 |
+ DeepSequence (ensemble) |
+ Alignment-based model |
+ 0.328 |
+ 0.010 |
+ 0.345 |
+ 0.272 |
+ 0.311 |
+ 0.327 |
+ 0.384 |
+ 0.296 |
+ 0.340 |
+ 0.377 |
+ 0.355 |
+ 0.376 |
+ 0.351 |
+ 0.273 |
+ 0.272 |
+ 0.163 |
+ 0.203 |
+ 0.216 |
+ 0.271 |
+ DeepSequence model (ensemble of 5 independently-trained models) |
+ <a href='https://www.nature.com/articles/s41592-018-0138-4'>Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.</a> |
+
+
+ 24 |
+ MSA Transformer (single) |
+ Hybrid - Alignment & PLM |
+ 0.328 |
+ 0.007 |
+ 0.351 |
+ 0.233 |
+ 0.338 |
+ 0.321 |
+ 0.395 |
+ 0.292 |
+ 0.351 |
+ 0.375 |
+ 0.349 |
+ 0.392 |
+ 0.340 |
+ 0.312 |
+ 0.287 |
+ 0.146 |
+ 0.208 |
+ 0.214 |
+ 0.229 |
+ MSA Transformer (single MSA sample) |
+ <a href='http://proceedings.mlr.press/v139/rao21a.html'>Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.</a> |
+
+
+ 25 |
+ ESM2 (650M) |
+ Protein language model |
+ 0.327 |
+ 0.009 |
+ 0.331 |
+ 0.262 |
+ 0.332 |
+ 0.290 |
+ 0.420 |
+ 0.269 |
+ 0.322 |
+ 0.410 |
+ 0.369 |
+ 0.383 |
+ 0.355 |
+ 0.204 |
+ 0.309 |
+ 0.173 |
+ 0.110 |
+ 0.114 |
+ 0.153 |
+ ESM2 model (650M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 26 |
+ Tranception S |
+ Hybrid - Alignment & PLM |
+ 0.322 |
+ 0.008 |
+ 0.330 |
+ 0.279 |
+ 0.326 |
+ 0.320 |
+ 0.354 |
+ 0.332 |
+ 0.321 |
+ 0.348 |
+ 0.345 |
+ 0.364 |
+ 0.300 |
+ 0.305 |
+ 0.293 |
+ 0.171 |
+ 0.168 |
+ 0.175 |
+ 0.248 |
+ Tranception Small model (85M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 27 |
+ ESM-1v (ensemble) |
+ Protein language model |
+ 0.320 |
+ 0.008 |
+ 0.323 |
+ 0.241 |
+ 0.345 |
+ 0.305 |
+ 0.387 |
+ 0.255 |
+ 0.320 |
+ 0.404 |
+ 0.368 |
+ 0.367 |
+ 0.321 |
+ 0.236 |
+ 0.292 |
+ 0.145 |
+ 0.104 |
+ 0.103 |
+ 0.159 |
+ ESM-1v (ensemble of 5 independently-trained models) |
+ <a href='https://proceedings.neurips.cc/paper/2021/hash/f51338d736f95dd42427296047067694-Abstract.html'>Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.</a> |
+
+
+ 28 |
+ ESM2 (3B) |
+ Protein language model |
+ 0.320 |
+ 0.008 |
+ 0.322 |
+ 0.247 |
+ 0.323 |
+ 0.293 |
+ 0.415 |
+ 0.277 |
+ 0.329 |
+ 0.391 |
+ 0.355 |
+ 0.379 |
+ 0.362 |
+ 0.226 |
+ 0.299 |
+ 0.144 |
+ 0.066 |
+ 0.085 |
+ 0.149 |
+ ESM2 model (3B params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 29 |
+ DeepSequence (single) |
+ Alignment-based model |
+ 0.318 |
+ 0.010 |
+ 0.338 |
+ 0.260 |
+ 0.292 |
+ 0.314 |
+ 0.383 |
+ 0.294 |
+ 0.330 |
+ 0.370 |
+ 0.348 |
+ 0.372 |
+ 0.344 |
+ 0.258 |
+ 0.265 |
+ 0.161 |
+ 0.177 |
+ 0.218 |
+ 0.279 |
+ DeepSequence model (single seed) |
+ <a href='https://www.nature.com/articles/s41592-018-0138-4'>Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.</a> |
+
+
+ 30 |
+ SaProt (35M) |
+ Hybrid - Structure & PLM |
+ 0.316 |
+ 0.006 |
+ 0.283 |
+ 0.285 |
+ 0.337 |
+ 0.219 |
+ 0.453 |
+ 0.253 |
+ 0.306 |
+ 0.393 |
+ 0.356 |
+ 0.378 |
+ 0.320 |
+ 0.182 |
+ 0.307 |
+ 0.225 |
+ 0.111 |
+ 0.130 |
+ 0.186 |
+ SaProt (35M) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.10.01.560349v5'>Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv.</a> |
+
+
+ 31 |
+ ESM2 (15B) |
+ Protein language model |
+ 0.314 |
+ 0.008 |
+ 0.311 |
+ 0.229 |
+ 0.333 |
+ 0.301 |
+ 0.398 |
+ 0.276 |
+ 0.328 |
+ 0.380 |
+ 0.346 |
+ 0.377 |
+ 0.341 |
+ 0.252 |
+ 0.285 |
+ 0.120 |
+ 0.076 |
+ 0.092 |
+ 0.166 |
+ ESM2 model (15B params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 32 |
+ VESPAl |
+ Protein language model |
+ 0.312 |
+ 0.009 |
+ 0.331 |
+ 0.255 |
+ 0.260 |
+ 0.321 |
+ 0.395 |
+ 0.292 |
+ 0.334 |
+ 0.373 |
+ 0.327 |
+ 0.375 |
+ 0.353 |
+ 0.316 |
+ 0.279 |
+ 0.107 |
+ 0.096 |
+ 0.074 |
+ 0.105 |
+ VESPAl model |
+ <a href='https://link.springer.com/article/10.1007/s00439-021-02411-y'>Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.</a> |
+
+
+ 33 |
+ ESM-1b |
+ Protein language model |
+ 0.311 |
+ 0.008 |
+ 0.328 |
+ 0.221 |
+ 0.320 |
+ 0.274 |
+ 0.410 |
+ 0.275 |
+ 0.317 |
+ 0.387 |
+ 0.352 |
+ 0.388 |
+ 0.343 |
+ 0.203 |
+ 0.256 |
+ 0.127 |
+ 0.021 |
+ 0.046 |
+ 0.124 |
+ ESM-1b (w/ Brandes et al. extensions) |
+ [1] Original model: <a href='https://www.biorxiv.org/content/10.1101/622803v4'>Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118.</a> [2] Extensions: <a href='https://www.biorxiv.org/content/10.1101/2022.08.25.505311v1'>Brandes, N., Goldman, G., Wang, C.H., Ye, C.J., & Ntranos, V. (2022). Genome-wide prediction of disease variants with a deep protein language model. bioRxiv.</a> |
+
+
+ 34 |
+ MIF-ST |
+ Hybrid - Structure & PLM |
+ 0.310 |
+ 0.008 |
+ 0.299 |
+ 0.248 |
+ 0.348 |
+ 0.285 |
+ 0.368 |
+ 0.293 |
+ 0.311 |
+ 0.349 |
+ 0.314 |
+ 0.315 |
+ 0.344 |
+ 0.310 |
+ 0.332 |
+ 0.164 |
+ 0.233 |
+ 0.223 |
+ 0.267 |
+ MIF-ST model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.25.493516v3'>Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.</a> |
+
+
+ 35 |
+ Progen2 XL |
+ Protein language model |
+ 0.306 |
+ 0.007 |
+ 0.311 |
+ 0.225 |
+ 0.329 |
+ 0.302 |
+ 0.365 |
+ 0.267 |
+ 0.325 |
+ 0.357 |
+ 0.311 |
+ 0.354 |
+ 0.334 |
+ 0.314 |
+ 0.283 |
+ 0.141 |
+ 0.181 |
+ 0.163 |
+ 0.223 |
+ Progen2 xlarge model (6.4B params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 36 |
+ EVmutation |
+ Alignment-based model |
+ 0.305 |
+ 0.008 |
+ 0.342 |
+ 0.218 |
+ 0.297 |
+ 0.322 |
+ 0.342 |
+ 0.295 |
+ 0.339 |
+ 0.320 |
+ 0.324 |
+ 0.343 |
+ 0.331 |
+ 0.300 |
+ 0.242 |
+ 0.170 |
+ 0.172 |
+ 0.181 |
+ 0.242 |
+ EVmutation model |
+ <a href='https://www.nature.com/articles/nbt.3769'>Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.</a> |
+
+
+ 37 |
+ ESM2 (150M) |
+ Protein language model |
+ 0.302 |
+ 0.009 |
+ 0.296 |
+ 0.253 |
+ 0.315 |
+ 0.237 |
+ 0.410 |
+ 0.245 |
+ 0.282 |
+ 0.392 |
+ 0.360 |
+ 0.380 |
+ 0.297 |
+ 0.124 |
+ 0.287 |
+ 0.167 |
+ 0.078 |
+ 0.109 |
+ 0.148 |
+ ESM2 model (150M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 38 |
+ Progen2 M |
+ Protein language model |
+ 0.299 |
+ 0.007 |
+ 0.300 |
+ 0.228 |
+ 0.342 |
+ 0.300 |
+ 0.324 |
+ 0.245 |
+ 0.307 |
+ 0.341 |
+ 0.329 |
+ 0.325 |
+ 0.285 |
+ 0.262 |
+ 0.281 |
+ 0.099 |
+ 0.078 |
+ 0.090 |
+ 0.127 |
+ Progen2 medium model (760M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 39 |
+ Progen2 Base |
+ Protein language model |
+ 0.299 |
+ 0.008 |
+ 0.306 |
+ 0.231 |
+ 0.353 |
+ 0.295 |
+ 0.308 |
+ 0.263 |
+ 0.292 |
+ 0.339 |
+ 0.339 |
+ 0.332 |
+ 0.255 |
+ 0.242 |
+ 0.283 |
+ 0.098 |
+ 0.072 |
+ 0.100 |
+ 0.133 |
+ Progen2 base model (760M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 40 |
+ Progen2 L |
+ Protein language model |
+ 0.298 |
+ 0.007 |
+ 0.307 |
+ 0.220 |
+ 0.341 |
+ 0.296 |
+ 0.327 |
+ 0.264 |
+ 0.306 |
+ 0.335 |
+ 0.326 |
+ 0.342 |
+ 0.291 |
+ 0.249 |
+ 0.279 |
+ 0.115 |
+ 0.144 |
+ 0.146 |
+ 0.195 |
+ Progen2 large model (2.7B params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 41 |
+ ESM-1v (single) |
+ Protein language model |
+ 0.296 |
+ 0.009 |
+ 0.304 |
+ 0.210 |
+ 0.322 |
+ 0.287 |
+ 0.357 |
+ 0.225 |
+ 0.295 |
+ 0.385 |
+ 0.344 |
+ 0.339 |
+ 0.303 |
+ 0.210 |
+ 0.274 |
+ 0.126 |
+ 0.114 |
+ 0.105 |
+ 0.152 |
+ ESM-1v (single seed) |
+ <a href='https://proceedings.neurips.cc/paper/2021/hash/f51338d736f95dd42427296047067694-Abstract.html'>Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.</a> |
+
+
+ 42 |
+ Tranception L no retrieval |
+ Protein language model |
+ 0.296 |
+ 0.008 |
+ 0.309 |
+ 0.229 |
+ 0.325 |
+ 0.301 |
+ 0.315 |
+ 0.278 |
+ 0.297 |
+ 0.335 |
+ 0.313 |
+ 0.315 |
+ 0.283 |
+ 0.310 |
+ 0.270 |
+ 0.126 |
+ 0.182 |
+ 0.191 |
+ 0.247 |
+ Tranception Large model (700M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 43 |
+ Wavenet |
+ Alignment-based model |
+ 0.294 |
+ 0.011 |
+ 0.298 |
+ 0.243 |
+ 0.279 |
+ 0.282 |
+ 0.369 |
+ 0.225 |
+ 0.314 |
+ 0.360 |
+ 0.317 |
+ 0.335 |
+ 0.324 |
+ 0.262 |
+ 0.257 |
+ 0.154 |
+ 0.148 |
+ 0.149 |
+ 0.139 |
+ Wavenet model |
+ <a href='https://www.nature.com/articles/s41467-021-22732-w'>Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12.</a> |
+
+
+ 44 |
+ MIF |
+ Inverse folding model |
+ 0.294 |
+ 0.010 |
+ 0.243 |
+ 0.264 |
+ 0.335 |
+ 0.230 |
+ 0.395 |
+ 0.270 |
+ 0.292 |
+ 0.331 |
+ 0.308 |
+ 0.286 |
+ 0.313 |
+ 0.284 |
+ 0.315 |
+ 0.177 |
+ 0.198 |
+ 0.191 |
+ 0.231 |
+ MIF model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.25.493516v3'>Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.</a> |
+
+
+ 45 |
+ RITA XL |
+ Protein language model |
+ 0.293 |
+ 0.008 |
+ 0.284 |
+ 0.225 |
+ 0.330 |
+ 0.299 |
+ 0.328 |
+ 0.238 |
+ 0.307 |
+ 0.334 |
+ 0.316 |
+ 0.315 |
+ 0.275 |
+ 0.307 |
+ 0.272 |
+ 0.115 |
+ 0.094 |
+ 0.106 |
+ 0.166 |
+ RITA xlarge model (1.2B params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 46 |
+ RITA L |
+ Protein language model |
+ 0.291 |
+ 0.008 |
+ 0.283 |
+ 0.222 |
+ 0.338 |
+ 0.293 |
+ 0.317 |
+ 0.245 |
+ 0.299 |
+ 0.325 |
+ 0.322 |
+ 0.317 |
+ 0.247 |
+ 0.299 |
+ 0.268 |
+ 0.116 |
+ 0.071 |
+ 0.087 |
+ 0.150 |
+ RITA large model (680M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 47 |
+ Site-Independent |
+ Alignment-based model |
+ 0.286 |
+ 0.011 |
+ 0.291 |
+ 0.264 |
+ 0.274 |
+ 0.299 |
+ 0.301 |
+ 0.333 |
+ 0.300 |
+ 0.263 |
+ 0.310 |
+ 0.311 |
+ 0.256 |
+ 0.290 |
+ 0.248 |
+ 0.177 |
+ 0.148 |
+ 0.172 |
+ 0.240 |
+ Site-Independent model |
+ <a href='https://www.nature.com/articles/nbt.3769'>Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.</a> |
+
+
+ 48 |
+ CARP (640M) |
+ Protein language model |
+ 0.286 |
+ 0.008 |
+ 0.304 |
+ 0.202 |
+ 0.322 |
+ 0.288 |
+ 0.313 |
+ 0.248 |
+ 0.291 |
+ 0.330 |
+ 0.326 |
+ 0.304 |
+ 0.286 |
+ 0.225 |
+ 0.303 |
+ 0.127 |
+ 0.103 |
+ 0.106 |
+ 0.129 |
+ CARP model (640M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 49 |
+ RITA M |
+ Protein language model |
+ 0.274 |
+ 0.009 |
+ 0.273 |
+ 0.204 |
+ 0.314 |
+ 0.292 |
+ 0.286 |
+ 0.232 |
+ 0.278 |
+ 0.315 |
+ 0.301 |
+ 0.294 |
+ 0.235 |
+ 0.297 |
+ 0.260 |
+ 0.091 |
+ 0.057 |
+ 0.090 |
+ 0.155 |
+ RITA medium model (300M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 50 |
+ Unirep evotuned |
+ Hybrid - Alignment & PLM |
+ 0.274 |
+ 0.009 |
+ 0.274 |
+ 0.226 |
+ 0.288 |
+ 0.268 |
+ 0.311 |
+ 0.254 |
+ 0.276 |
+ 0.307 |
+ 0.289 |
+ 0.302 |
+ 0.270 |
+ 0.266 |
+ 0.244 |
+ 0.141 |
+ 0.185 |
+ 0.184 |
+ 0.228 |
+ Unirep model w/ evotuning |
+ <a href='https://www.nature.com/articles/s41592-019-0598-1'>Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.</a> |
+
+
+ 51 |
+ Tranception M no retrieval |
+ Protein language model |
+ 0.269 |
+ 0.008 |
+ 0.266 |
+ 0.213 |
+ 0.311 |
+ 0.280 |
+ 0.274 |
+ 0.221 |
+ 0.277 |
+ 0.296 |
+ 0.297 |
+ 0.278 |
+ 0.238 |
+ 0.259 |
+ 0.254 |
+ 0.103 |
+ 0.083 |
+ 0.101 |
+ 0.127 |
+ Tranception Medium model (300M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 52 |
+ Progen2 S |
+ Protein language model |
+ 0.261 |
+ 0.009 |
+ 0.257 |
+ 0.204 |
+ 0.295 |
+ 0.261 |
+ 0.285 |
+ 0.219 |
+ 0.257 |
+ 0.309 |
+ 0.306 |
+ 0.280 |
+ 0.226 |
+ 0.214 |
+ 0.251 |
+ 0.091 |
+ 0.071 |
+ 0.087 |
+ 0.103 |
+ Progen2 small model (150M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 53 |
+ CARP (76M) |
+ Protein language model |
+ 0.251 |
+ 0.010 |
+ 0.256 |
+ 0.212 |
+ 0.290 |
+ 0.215 |
+ 0.282 |
+ 0.195 |
+ 0.231 |
+ 0.311 |
+ 0.304 |
+ 0.275 |
+ 0.217 |
+ 0.125 |
+ 0.257 |
+ 0.135 |
+ 0.049 |
+ 0.068 |
+ 0.090 |
+ CARP model (76M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 54 |
+ ESM2 (35M) |
+ Protein language model |
+ 0.249 |
+ 0.011 |
+ 0.238 |
+ 0.234 |
+ 0.262 |
+ 0.169 |
+ 0.344 |
+ 0.191 |
+ 0.213 |
+ 0.349 |
+ 0.296 |
+ 0.316 |
+ 0.230 |
+ 0.099 |
+ 0.230 |
+ 0.155 |
+ 0.074 |
+ 0.095 |
+ 0.151 |
+ ESM2 model (35M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 55 |
+ RITA S |
+ Protein language model |
+ 0.236 |
+ 0.010 |
+ 0.231 |
+ 0.204 |
+ 0.262 |
+ 0.249 |
+ 0.232 |
+ 0.204 |
+ 0.237 |
+ 0.261 |
+ 0.264 |
+ 0.228 |
+ 0.181 |
+ 0.265 |
+ 0.217 |
+ 0.093 |
+ 0.073 |
+ 0.083 |
+ 0.120 |
+ RITA small model (85M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 56 |
+ Tranception S no retrieval |
+ Protein language model |
+ 0.231 |
+ 0.010 |
+ 0.216 |
+ 0.210 |
+ 0.264 |
+ 0.247 |
+ 0.218 |
+ 0.189 |
+ 0.233 |
+ 0.251 |
+ 0.251 |
+ 0.216 |
+ 0.200 |
+ 0.232 |
+ 0.212 |
+ 0.093 |
+ 0.074 |
+ 0.089 |
+ 0.120 |
+ Tranception Small model (85M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 57 |
+ CARP (38M) |
+ Protein language model |
+ 0.213 |
+ 0.011 |
+ 0.213 |
+ 0.210 |
+ 0.237 |
+ 0.169 |
+ 0.235 |
+ 0.156 |
+ 0.186 |
+ 0.266 |
+ 0.249 |
+ 0.233 |
+ 0.180 |
+ 0.101 |
+ 0.212 |
+ 0.116 |
+ 0.050 |
+ 0.074 |
+ 0.107 |
+ CARP model (38M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 58 |
+ ProteinMPNN |
+ Inverse folding model |
+ 0.195 |
+ 0.009 |
+ 0.143 |
+ 0.126 |
+ 0.141 |
+ 0.121 |
+ 0.445 |
+ 0.137 |
+ 0.216 |
+ 0.333 |
+ 0.223 |
+ 0.307 |
+ 0.262 |
+ 0.196 |
+ 0.206 |
+ 0.176 |
+ 0.119 |
+ 0.127 |
+ 0.224 |
+ ProteinMPNN model |
+ <a href='https://www.science.org/doi/10.1126/science.add2187'>J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378.</a> |
+
+
+ 59 |
+ ESM2 (8M) |
+ Protein language model |
+ 0.171 |
+ 0.012 |
+ 0.147 |
+ 0.201 |
+ 0.192 |
+ 0.104 |
+ 0.209 |
+ 0.149 |
+ 0.141 |
+ 0.199 |
+ 0.190 |
+ 0.193 |
+ 0.139 |
+ 0.075 |
+ 0.148 |
+ 0.099 |
+ 0.063 |
+ 0.097 |
+ 0.130 |
+ ESM2 model (8M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 60 |
+ ProtGPT2 |
+ Protein language model |
+ 0.151 |
+ 0.009 |
+ 0.135 |
+ 0.118 |
+ 0.155 |
+ 0.136 |
+ 0.210 |
+ 0.144 |
+ 0.143 |
+ 0.203 |
+ 0.198 |
+ 0.193 |
+ 0.103 |
+ 0.118 |
+ 0.143 |
+ 0.102 |
+ 0.027 |
+ 0.014 |
+ 0.046 |
+ ProtGPT2 model |
+ <a href='https://www.nature.com/articles/s41467-022-32007-7'>Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13.</a> |
+
+
+ 61 |
+ Unirep |
+ Protein language model |
+ 0.146 |
+ 0.012 |
+ 0.134 |
+ 0.156 |
+ 0.166 |
+ 0.103 |
+ 0.171 |
+ 0.139 |
+ 0.128 |
+ 0.157 |
+ 0.174 |
+ 0.173 |
+ 0.110 |
+ 0.051 |
+ 0.132 |
+ 0.064 |
+ 0.074 |
+ 0.093 |
+ 0.119 |
+ Unirep model |
+ <a href='https://www.nature.com/articles/s41592-019-0598-1'>Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.</a> |
+
+
+ 62 |
+ CARP (600K) |
+ Protein language model |
+ 0.076 |
+ 0.013 |
+ 0.076 |
+ 0.057 |
+ 0.126 |
+ 0.046 |
+ 0.074 |
+ 0.069 |
+ 0.067 |
+ 0.067 |
+ 0.090 |
+ 0.054 |
+ 0.047 |
+ 0.051 |
+ 0.077 |
+ 0.016 |
+ 0.019 |
+ 0.062 |
+ 0.069 |
+ CARP model (600K params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/substitutions/NDCG/DMS_substitutions_NDCG_DMS_level.csv b/benchmarks/DMS_zero_shot/substitutions/NDCG/DMS_substitutions_NDCG_DMS_level.csv
new file mode 100644
index 0000000..a0a7479
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/NDCG/DMS_substitutions_NDCG_DMS_level.csv
@@ -0,0 +1,217 @@
+DMS ID,Site-Independent,EVmutation,DeepSequence (single),DeepSequence (ensemble),EVE (single),EVE (ensemble),Unirep,Unirep evotuned,MSA Transformer (single),MSA Transformer (ensemble),ESM-1b,ESM-1v (single),ESM-1v (ensemble),ESM2 (8M),ESM2 (35M),ESM2 (150M),ESM2 (650M),ESM2 (3B),ESM2 (15B),Wavenet,RITA S,RITA M,RITA L,RITA XL,Progen2 S,Progen2 M,Progen2 Base,Progen2 L,Progen2 XL,GEMME,VESPA,VESPAl,ProtGPT2,Tranception S no retrieval,Tranception M no retrieval,Tranception L no retrieval,Tranception S,Tranception M,Tranception L,TranceptEVE S,TranceptEVE M,TranceptEVE L,CARP (38M),CARP (600K),CARP (640M),CARP (76M),MIF,MIF-ST,ESM-IF1,ProteinMPNN,ProtSSN (k=10 h=512),ProtSSN (k=10 h=768),ProtSSN (k=10 h=1280),ProtSSN (k=20 h=512),ProtSSN (k=20 h=768),ProtSSN (k=20 h=1280),ProtSSN (k=30 h=512),ProtSSN (k=30 h=768),ProtSSN (k=30 h=1280),ProtSSN (ensemble),SaProt (650M),SaProt (35M),Number of Mutants,Selection Type,UniProt ID,MSA_Neff_L_category,Taxon
+A0A140D2T1_ZIKV_Sourisseau_2019,0.278,0.29,0.231,0.219,0.283,0.286,0.1,0.099,0.327,0.333,0.149,0.128,0.164,0.097,0.122,0.107,0.144,0.257,0.271,0.189,0.274,0.277,0.279,0.298,0.254,0.267,0.283,0.248,0.273,0.299,0.262,0.231,0.125,0.259,0.252,0.239,0.28,0.283,0.279,0.281,0.293,0.285,0.123,0.121,0.184,0.131,0.186,0.202,0.163,0.166,0.198,0.207,0.221,0.218,0.22,0.199,0.178,0.199,0.184,0.209,0.155,0.154,9576,OrganismalFitness,A0A140D2T1_ZIKV,Medium,Virus
+A0A192B1T2_9HIV1_Haddox_2018,0.878,0.871,0.873,0.876,0.879,0.881,0.653,0.874,0.884,0.883,0.852,0.87,0.873,0.681,0.677,0.676,0.689,0.707,0.722,0.879,0.874,0.877,0.881,0.879,0.875,0.877,0.877,0.876,0.881,0.884,0.875,0.856,0.82,0.873,0.878,0.881,0.879,0.881,0.883,0.882,0.881,0.882,0.822,0.645,0.873,0.866,0.742,0.866,0.697,0.74,0.703,0.715,0.729,0.726,0.717,0.726,0.719,0.717,0.713,0.72,0.728,0.706,12577,OrganismalFitness,A0A192B1T2_9HIV1,Medium,Virus
+A0A1I9GEU1_NEIME_Kennouche_2019,0.691,0.755,0.76,0.766,0.758,0.761,0.711,0.773,0.764,0.759,0.746,0.764,0.756,0.73,0.736,0.747,0.725,0.719,0.709,0.771,0.752,0.761,0.76,0.761,0.76,0.763,0.768,0.771,0.756,0.759,0.763,0.771,0.757,0.743,0.764,0.764,0.751,0.745,0.761,0.757,0.761,0.761,0.712,0.716,0.755,0.709,0.768,0.771,0.734,0.752,0.752,0.741,0.732,0.734,0.723,0.745,0.747,0.754,0.733,0.727,0.731,0.72,922,Activity,A0A1I9GEU1_NEIME,Medium,Prokaryote
+A0A247D711_LISMN_Stadelmann_2021,0.893,0.897,0.805,0.8,0.891,0.886,0.695,0.708,0.901,0.895,0.772,0.766,0.768,0.744,0.741,0.758,0.751,0.766,0.736,0.873,0.687,0.688,0.697,0.688,0.667,0.72,0.741,0.686,0.725,0.901,0.856,0.857,0.833,0.709,0.689,0.696,0.881,0.88,0.882,0.87,0.875,0.873,0.726,0.685,0.729,0.739,0.848,0.839,0.89,0.869,0.826,0.847,0.84,0.809,0.792,0.819,0.809,0.821,0.796,0.809,0.84,0.805,1653,Activity,A0A247D711_LISMN,High,Prokaryote
+A0A2Z5U3Z0_9INFA_Doud_2016,0.755,0.798,0.783,0.788,0.792,0.79,0.497,0.765,0.791,0.795,0.552,0.779,0.8,0.52,0.525,0.534,0.767,0.773,0.778,0.778,0.769,0.796,0.794,0.803,0.7,0.783,0.789,0.776,0.779,0.799,0.754,0.718,0.554,0.77,0.789,0.79,0.788,0.794,0.794,0.796,0.799,0.799,0.526,0.513,0.641,0.524,0.659,0.722,0.674,0.632,0.74,0.739,0.748,0.746,0.739,0.746,0.746,0.742,0.748,0.743,0.614,0.595,10715,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+A0A2Z5U3Z0_9INFA_Wu_2014,0.215,0.228,0.233,0.235,0.232,0.232,0.112,0.277,0.244,0.23,0.151,0.212,0.245,0.114,0.117,0.127,0.237,0.207,0.242,0.29,0.206,0.238,0.25,0.251,0.201,0.215,0.257,0.232,0.22,0.274,0.261,0.238,0.082,0.202,0.224,0.229,0.222,0.241,0.237,0.225,0.23,0.223,0.111,0.106,0.178,0.123,0.178,0.203,0.175,0.146,0.228,0.215,0.215,0.231,0.205,0.232,0.203,0.223,0.223,0.22,0.151,0.159,2350,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+A4_HUMAN_Seuma_2022,0.662,0.735,0.761,0.761,0.725,0.733,0.639,0.632,0.743,0.745,0.684,0.708,0.741,0.611,0.609,0.633,0.657,0.753,0.725,0.72,0.685,0.67,0.691,0.711,0.724,0.671,0.687,0.686,0.692,0.762,0.7,0.695,0.758,0.708,0.693,0.749,0.711,0.704,0.738,0.736,0.733,0.746,0.614,0.614,0.747,0.621,0.677,0.714,0.435,0.612,0.709,0.736,0.709,0.683,0.688,0.687,0.695,0.648,0.684,0.696,0.63,0.608,14811,Stability,A4_HUMAN,Low,Human
+A4D664_9INFA_Soh_2019,0.43,0.43,0.442,0.436,0.444,0.445,0.239,0.421,0.375,0.372,0.249,0.237,0.245,0.246,0.252,0.246,0.288,0.288,0.383,0.407,0.384,0.415,0.416,0.396,0.245,0.356,0.357,0.355,0.389,0.432,0.383,0.356,0.233,0.397,0.42,0.424,0.434,0.444,0.443,0.455,0.456,0.453,0.248,0.245,0.275,0.258,0.273,0.285,0.2,0.288,0.275,0.292,0.274,0.305,0.283,0.299,0.288,0.301,0.293,0.288,0.281,0.249,14421,OrganismalFitness,A4D664_9INFA,Medium,Virus
+A4GRB6_PSEAI_Chen_2020,0.724,0.829,0.83,0.841,0.827,0.834,0.656,0.737,0.821,0.831,0.838,0.82,0.837,0.731,0.776,0.839,0.865,0.88,0.874,0.832,0.701,0.78,0.802,0.822,0.764,0.826,0.823,0.814,0.87,0.821,0.887,0.858,0.646,0.723,0.815,0.844,0.817,0.823,0.857,0.856,0.854,0.862,0.741,0.63,0.85,0.808,0.861,0.881,0.799,0.802,0.858,0.864,0.855,0.877,0.873,0.873,0.86,0.873,0.867,0.873,0.864,0.776,5004,OrganismalFitness,A4GRB6_PSEAI,High,Prokaryote
+AACC1_PSEAI_Dandage_2018,0.776,0.819,0.779,0.792,0.788,0.8,0.735,0.705,0.818,0.815,0.766,0.786,0.777,0.729,0.733,0.747,0.779,0.799,0.798,0.795,0.728,0.737,0.763,0.706,0.747,0.786,0.778,0.785,0.766,0.785,0.779,0.767,0.676,0.754,0.747,0.789,0.804,0.803,0.799,0.796,0.794,0.799,0.744,0.726,0.754,0.744,0.739,0.786,0.77,0.734,0.788,0.797,0.792,0.791,0.799,0.806,0.793,0.804,0.808,0.801,0.771,0.756,1801,OrganismalFitness,AACC1_PSEAI,High,Prokaryote
+ACE2_HUMAN_Chan_2020,0.705,0.686,0.71,0.7,0.687,0.685,0.622,0.675,0.698,0.7,0.674,0.698,0.699,0.616,0.652,0.676,0.672,0.668,0.69,0.697,0.641,0.698,0.721,0.697,0.634,0.705,0.704,0.703,0.707,0.689,0.673,0.677,0.681,0.66,0.688,0.657,0.707,0.696,0.693,0.693,0.687,0.685,0.634,0.615,0.664,0.659,0.683,0.676,0.695,0.664,0.677,0.681,0.672,0.662,0.679,0.674,0.676,0.669,0.678,0.669,0.679,0.679,2223,Binding,ACE2_HUMAN,Medium,Human
+ADRB2_HUMAN_Jones_2020,0.604,0.634,0.637,0.64,0.654,0.647,0.604,0.647,0.634,0.646,0.647,0.632,0.641,0.586,0.605,0.606,0.603,0.619,0.627,0.631,0.625,0.633,0.627,0.635,0.636,0.639,0.641,0.644,0.635,0.629,0.631,0.613,0.545,0.641,0.634,0.636,0.639,0.632,0.636,0.645,0.645,0.645,0.61,0.546,0.632,0.627,0.604,0.635,0.602,0.582,0.618,0.611,0.607,0.62,0.615,0.616,0.607,0.619,0.606,0.616,0.64,0.619,7800,Activity,ADRB2_HUMAN,Medium,Human
+AICDA_HUMAN_Gajula_2014_3cycles,0.104,0.126,0.185,0.106,0.148,0.163,0.036,0.149,0.201,0.147,0.229,0.173,0.168,0.059,0.016,0.226,0.279,0.24,0.171,0.32,0.047,0.066,0.285,0.29,0.037,0.248,0.301,0.282,0.133,0.104,0.185,0.159,0.119,0.084,0.076,0.14,0.361,0.327,0.332,0.164,0.164,0.147,0.043,0.047,0.146,0.022,0.237,0.159,0.108,0.387,0.23,0.237,0.197,0.231,0.157,0.203,0.192,0.187,0.186,0.207,0.053,0.048,209,Activity,AICDA_HUMAN,Medium,Human
+AMFR_HUMAN_Tsuboyama_2023_4G3O,0.813,0.856,0.859,0.858,0.851,0.862,0.418,0.831,0.846,0.855,0.879,0.567,0.815,0.44,0.603,0.879,0.867,0.872,0.865,0.846,0.587,0.46,0.809,0.804,0.43,0.808,0.784,0.673,0.844,0.846,0.862,0.837,0.823,0.454,0.546,0.468,0.832,0.837,0.841,0.864,0.867,0.865,0.621,0.593,0.854,0.788,0.756,0.801,0.837,0.834,0.87,0.878,0.869,0.863,0.876,0.867,0.866,0.859,0.873,0.871,0.888,0.674,2972,Stability,AMFR_HUMAN,Medium,Human
+AMIE_PSEAE_Wrenbeck_2017,0.796,0.882,0.875,0.877,0.874,0.875,0.733,0.857,0.828,0.883,0.843,0.882,0.886,0.781,0.803,0.806,0.836,0.891,0.882,0.845,0.876,0.886,0.887,0.885,0.866,0.879,0.888,0.883,0.882,0.873,0.882,0.874,0.776,0.886,0.883,0.881,0.878,0.878,0.876,0.885,0.883,0.882,0.79,0.759,0.816,0.802,0.818,0.848,0.828,0.816,0.85,0.86,0.858,0.854,0.852,0.855,0.854,0.85,0.857,0.859,0.864,0.813,6227,Activity,AMIE_PSEAE,High,Prokaryote
+ANCSZ_Hobbs_2022,0.82,0.817,0.811,0.81,0.818,0.818,0.787,0.808,0.814,0.819,0.806,0.82,0.822,0.818,0.834,0.829,0.829,0.829,0.826,0.676,0.834,0.807,0.811,0.808,0.84,0.815,0.817,0.8,0.797,0.819,0.804,0.794,0.754,0.804,0.8,0.811,0.819,0.816,0.821,0.822,0.82,0.822,0.82,0.795,0.81,0.812,0.808,0.825,0.785,0.745,0.827,0.831,0.819,0.814,0.825,0.822,0.828,0.829,0.827,0.828,0.825,0.823,4670,Activity,ANCSZ,Medium,Eukaryote
+ARGR_ECOLI_Tsuboyama_2023_1AOY,0.839,0.908,0.91,0.91,0.909,0.907,0.793,0.868,0.825,0.882,0.872,0.826,0.895,0.827,0.851,0.889,0.901,0.912,0.896,0.901,0.85,0.883,0.898,0.895,0.858,0.88,0.901,0.901,0.891,0.887,0.878,0.848,0.77,0.866,0.87,0.87,0.871,0.881,0.888,0.913,0.901,0.908,0.827,0.787,0.874,0.856,0.918,0.913,0.914,0.919,0.899,0.905,0.908,0.908,0.907,0.904,0.904,0.912,0.902,0.908,0.927,0.884,1287,Stability,ARGR_ECOLI,Medium,Prokaryote
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,0.594,0.518,0.605,0.598,0.595,0.604,0.425,0.567,0.492,0.524,0.18,0.158,0.552,0.424,0.352,0.38,0.474,0.494,0.563,0.479,0.363,0.272,0.479,0.529,0.195,0.493,0.536,0.386,0.536,0.564,0.458,0.389,0.371,0.49,0.224,0.457,0.597,0.605,0.637,0.617,0.624,0.628,0.355,0.402,0.518,0.445,0.386,0.502,0.449,0.158,0.403,0.498,0.489,0.53,0.428,0.46,0.491,0.504,0.511,0.522,0.424,0.223,170,Binding,B2L11_HUMAN,Low,Human
+BBC1_YEAST_Tsuboyama_2023_1TG0,0.681,0.749,0.713,0.716,0.718,0.69,0.612,0.654,0.754,0.759,0.771,0.758,0.766,0.514,0.735,0.759,0.814,0.812,0.815,0.755,0.661,0.732,0.742,0.746,0.592,0.765,0.709,0.734,0.716,0.668,0.711,0.681,0.616,0.7,0.658,0.699,0.719,0.694,0.727,0.709,0.678,0.693,0.676,0.529,0.762,0.732,0.789,0.772,0.811,0.789,0.788,0.8,0.798,0.822,0.803,0.823,0.807,0.809,0.8,0.809,0.811,0.797,2069,Stability,BBC1_YEAST,High,Eukaryote
+BCHB_CHLTE_Tsuboyama_2023_2KRU,0.877,0.893,0.888,0.887,0.878,0.887,0.727,0.844,0.861,0.901,0.817,0.858,0.873,0.773,0.791,0.825,0.89,0.888,0.894,0.9,0.737,0.838,0.831,0.851,0.811,0.874,0.813,0.875,0.905,0.893,0.879,0.86,0.679,0.813,0.823,0.866,0.872,0.872,0.889,0.884,0.882,0.888,0.808,0.801,0.838,0.834,0.884,0.907,0.92,0.896,0.885,0.906,0.895,0.897,0.899,0.893,0.896,0.892,0.891,0.898,0.913,0.788,1572,Stability,BCHB_CHLTE,Medium,Prokaryote
+BLAT_ECOLX_Deng_2012,0.681,0.798,0.775,0.78,0.806,0.802,0.532,0.611,0.718,0.77,0.747,0.742,0.746,0.611,0.683,0.718,0.732,0.749,0.738,0.731,0.708,0.706,0.746,0.714,0.694,0.757,0.75,0.772,0.724,0.726,0.789,0.771,0.546,0.697,0.78,0.737,0.764,0.799,0.781,0.785,0.807,0.815,0.651,0.493,0.754,0.691,0.731,0.76,0.746,0.681,0.77,0.786,0.775,0.785,0.785,0.785,0.797,0.782,0.805,0.793,0.754,0.715,4996,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Firnberg_2014,0.598,0.787,0.793,0.775,0.808,0.814,0.387,0.514,0.667,0.755,0.706,0.716,0.729,0.56,0.636,0.702,0.722,0.736,0.695,0.684,0.697,0.695,0.706,0.674,0.669,0.734,0.732,0.747,0.657,0.666,0.799,0.789,0.445,0.668,0.736,0.683,0.744,0.774,0.758,0.78,0.804,0.824,0.611,0.375,0.726,0.645,0.723,0.787,0.792,0.606,0.754,0.759,0.768,0.765,0.777,0.788,0.772,0.769,0.768,0.779,0.797,0.676,4783,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Jacquier_2013,0.815,0.937,0.918,0.9,0.921,0.924,0.621,0.76,0.858,0.894,0.869,0.894,0.898,0.802,0.825,0.837,0.863,0.899,0.897,0.85,0.895,0.887,0.912,0.906,0.838,0.882,0.912,0.922,0.903,0.771,0.916,0.916,0.728,0.871,0.918,0.906,0.904,0.921,0.918,0.898,0.908,0.922,0.819,0.571,0.883,0.853,0.823,0.908,0.902,0.84,0.892,0.894,0.901,0.909,0.892,0.901,0.909,0.895,0.903,0.897,0.919,0.872,989,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Stiffler_2015,0.83,0.95,0.94,0.931,0.964,0.962,0.633,0.769,0.86,0.922,0.891,0.907,0.909,0.806,0.856,0.897,0.901,0.933,0.912,0.873,0.897,0.904,0.922,0.905,0.863,0.916,0.929,0.943,0.895,0.877,0.967,0.963,0.683,0.882,0.939,0.906,0.932,0.957,0.946,0.942,0.957,0.973,0.84,0.615,0.913,0.863,0.91,0.953,0.951,0.815,0.933,0.93,0.941,0.936,0.94,0.952,0.941,0.935,0.945,0.944,0.956,0.885,4996,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BRCA1_HUMAN_Findlay_2018,0.881,0.871,0.877,0.878,0.866,0.867,0.738,0.812,0.867,0.869,0.865,0.822,0.822,0.77,0.812,0.834,0.866,0.879,0.85,0.872,0.76,0.862,0.877,0.861,0.823,0.89,0.887,0.873,0.873,0.878,0.879,0.855,0.717,0.781,0.772,0.857,0.877,0.881,0.883,0.878,0.877,0.878,0.793,0.756,0.881,0.835,0.838,0.877,0.751,0.781,0.853,0.86,0.858,0.855,0.85,0.858,0.848,0.859,0.852,0.857,0.818,0.833,1837,OrganismalFitness,BRCA1_HUMAN,Low,Human
+BRCA2_HUMAN_Erwood_2022_HEK293T,0.941,0.933,0.936,0.935,0.935,0.932,0.915,0.937,0.856,0.891,0.94,0.923,0.918,0.928,0.925,0.929,0.944,0.935,0.94,0.903,0.93,0.944,0.949,0.943,0.942,0.922,0.921,0.933,0.914,0.932,0.926,0.946,0.927,0.926,0.93,0.931,0.938,0.937,0.939,0.933,0.934,0.933,0.918,0.922,0.939,0.898,0.89,0.892,0.769,0.917,0.943,0.942,0.941,0.938,0.945,0.933,0.948,0.943,0.943,0.944,0.896,0.91,265,OrganismalFitness,BRCA2_HUMAN,,Human
+C6KNH7_9INFA_Lee_2018,0.771,0.811,0.803,0.8,0.812,0.812,0.609,0.792,0.809,0.812,0.627,0.77,0.822,0.601,0.604,0.616,0.784,0.81,0.82,0.792,0.798,0.788,0.796,0.786,0.66,0.793,0.796,0.808,0.778,0.828,0.8,0.767,0.667,0.786,0.796,0.787,0.801,0.805,0.801,0.815,0.818,0.817,0.603,0.59,0.698,0.608,0.75,0.795,0.785,0.719,0.789,0.796,0.795,0.793,0.793,0.783,0.794,0.794,0.783,0.795,0.688,0.658,10754,OrganismalFitness,C6KNH7_9INFA,Medium,Virus
+CALM1_HUMAN_Weile_2017,0.799,0.817,0.808,0.804,0.805,0.8,0.801,0.79,0.815,0.816,0.827,0.794,0.821,0.797,0.793,0.791,0.773,0.786,0.812,0.813,0.801,0.806,0.821,0.81,0.812,0.827,0.816,0.841,0.832,0.823,0.819,0.794,0.75,0.809,0.832,0.823,0.815,0.827,0.828,0.813,0.807,0.813,0.807,0.794,0.819,0.824,0.781,0.775,0.809,0.758,0.774,0.78,0.781,0.758,0.766,0.783,0.781,0.779,0.763,0.778,0.826,0.822,1813,OrganismalFitness,CALM1_HUMAN,High,Human
+CAPSD_AAV2S_Sinai_2021,0.791,0.789,0.786,0.811,0.781,0.784,0.758,0.841,0.75,0.777,0.641,0.666,0.682,0.701,0.729,0.69,0.733,0.665,0.606,0.703,0.683,0.722,0.696,0.712,0.68,0.691,0.729,0.698,0.794,0.798,0.629,0.619,0.616,0.699,0.721,0.83,0.771,0.772,0.821,0.785,0.787,0.817,0.663,0.718,0.776,0.688,0.768,0.753,0.686,0.723,0.673,0.651,0.654,0.656,0.637,0.645,0.659,0.652,0.688,0.663,0.723,0.665,42328,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CAR11_HUMAN_Meitlis_2020_gof,0.635,0.638,0.639,0.639,0.641,0.639,0.653,0.607,0.648,0.645,0.656,0.621,0.631,0.61,0.596,0.642,0.642,0.635,0.643,0.634,0.616,0.651,0.652,0.649,0.605,0.655,0.654,0.65,0.634,0.639,0.644,0.629,0.623,0.64,0.622,0.621,0.634,0.641,0.638,0.639,0.646,0.646,0.617,0.594,0.642,0.633,0.614,0.632,0.634,0.628,0.65,0.653,0.637,0.651,0.654,0.641,0.652,0.648,0.642,0.646,0.661,0.649,2374,OrganismalFitness,CAR11_HUMAN,Low,Human
+CAR11_HUMAN_Meitlis_2020_lof,0.789,0.79,0.794,0.791,0.785,0.784,0.769,0.736,0.804,0.804,0.818,0.765,0.784,0.738,0.712,0.757,0.786,0.805,0.809,0.789,0.7,0.811,0.808,0.8,0.737,0.823,0.813,0.831,0.765,0.802,0.818,0.784,0.741,0.755,0.777,0.766,0.784,0.804,0.794,0.786,0.8,0.795,0.728,0.72,0.818,0.764,0.733,0.806,0.782,0.753,0.806,0.798,0.787,0.8,0.803,0.793,0.799,0.795,0.796,0.798,0.807,0.765,2395,OrganismalFitness,CAR11_HUMAN,Low,Human
+CAS9_STRP1_Spencer_2017_positive,0.836,0.838,0.839,0.839,0.839,0.839,0.831,0.822,0.84,0.84,0.836,0.829,0.829,0.829,0.829,0.837,0.839,0.842,0.841,0.828,0.832,0.831,0.839,0.845,0.831,0.84,0.838,0.837,0.839,0.844,0.842,0.842,0.834,0.825,0.828,0.84,0.838,0.839,0.842,0.839,0.839,0.841,0.829,0.826,0.838,0.829,0.829,0.838,0.829,0.828,0.84,0.837,0.841,0.838,0.837,0.837,0.837,0.84,0.839,0.839,0.842,0.837,8117,Activity,CAS9_STRP1,Medium,Prokaryote
+CASP3_HUMAN_Roychowdhury_2020,0.779,0.82,0.82,0.812,0.812,0.818,0.7,0.759,0.818,0.824,0.813,0.796,0.811,0.712,0.809,0.826,0.812,0.823,0.824,0.798,0.711,0.778,0.756,0.829,0.809,0.817,0.824,0.812,0.831,0.802,0.838,0.821,0.754,0.712,0.803,0.819,0.809,0.822,0.822,0.823,0.829,0.824,0.777,0.681,0.817,0.801,0.768,0.807,0.785,0.74,0.812,0.801,0.804,0.807,0.81,0.807,0.819,0.818,0.806,0.805,0.819,0.796,1567,Activity,CASP3_HUMAN,High,Human
+CASP7_HUMAN_Roychowdhury_2020,0.747,0.808,0.815,0.816,0.817,0.814,0.607,0.786,0.81,0.813,0.818,0.813,0.816,0.682,0.804,0.813,0.804,0.787,0.786,0.81,0.622,0.794,0.764,0.788,0.782,0.792,0.796,0.785,0.804,0.82,0.804,0.786,0.723,0.664,0.784,0.785,0.799,0.822,0.812,0.812,0.813,0.818,0.746,0.62,0.821,0.806,0.761,0.823,0.788,0.74,0.805,0.811,0.812,0.803,0.804,0.802,0.806,0.805,0.809,0.809,0.819,0.775,1680,Activity,CASP7_HUMAN,Medium,Human
+CATR_CHLRE_Tsuboyama_2023_2AMI,0.853,0.808,0.813,0.816,0.825,0.817,0.844,0.774,0.777,0.793,0.805,0.82,0.815,0.819,0.859,0.819,0.82,0.814,0.801,0.62,0.821,0.792,0.8,0.791,0.798,0.798,0.788,0.796,0.8,0.813,0.79,0.799,0.755,0.811,0.803,0.807,0.831,0.821,0.813,0.812,0.81,0.813,0.819,0.82,0.795,0.8,0.847,0.76,0.818,0.842,0.815,0.808,0.815,0.817,0.813,0.811,0.813,0.81,0.809,0.816,0.81,0.814,1903,Stability,CATR_CHLRE,High,Eukaryote
+CBPA2_HUMAN_Tsuboyama_2023_1O6X,0.91,0.895,0.916,0.91,0.91,0.912,0.848,0.896,0.909,0.916,0.912,0.9,0.903,0.854,0.901,0.902,0.908,0.915,0.913,0.917,0.861,0.845,0.884,0.891,0.889,0.837,0.835,0.823,0.913,0.906,0.914,0.899,0.733,0.868,0.886,0.905,0.912,0.915,0.916,0.913,0.912,0.918,0.898,0.888,0.901,0.902,0.928,0.922,0.925,0.928,0.908,0.916,0.922,0.915,0.917,0.909,0.911,0.911,0.91,0.917,0.918,0.928,2068,Stability,CBPA2_HUMAN,Medium,Human
+CBS_HUMAN_Sun_2020,0.472,0.468,0.477,0.478,0.477,0.48,0.323,0.435,0.478,0.455,0.451,0.463,0.45,0.331,0.394,0.427,0.421,0.447,0.435,0.455,0.465,0.419,0.446,0.452,0.427,0.433,0.446,0.443,0.454,0.47,0.494,0.478,0.386,0.465,0.43,0.431,0.478,0.479,0.49,0.49,0.486,0.474,0.431,0.326,0.472,0.461,0.42,0.453,0.442,0.372,0.432,0.444,0.447,0.439,0.427,0.439,0.448,0.444,0.434,0.445,0.468,0.413,7217,OrganismalFitness,CBS_HUMAN,Medium,Human
+CBX4_HUMAN_Tsuboyama_2023_2K28,0.881,0.875,0.889,0.895,0.897,0.904,0.318,0.816,0.89,0.891,0.873,0.89,0.89,0.355,0.912,0.911,0.907,0.899,0.87,0.889,0.815,0.829,0.837,0.838,0.859,0.865,0.858,0.847,0.849,0.887,0.876,0.846,0.641,0.775,0.844,0.857,0.884,0.876,0.893,0.901,0.892,0.892,0.88,0.448,0.833,0.883,0.748,0.757,0.914,0.859,0.906,0.898,0.903,0.899,0.907,0.9,0.905,0.901,0.906,0.909,0.896,0.905,2282,Stability,CBX4_HUMAN,High,Human
+CCDB_ECOLI_Adkar_2012,0.782,0.824,0.845,0.844,0.829,0.835,0.563,0.702,0.686,0.776,0.753,0.623,0.641,0.56,0.577,0.655,0.808,0.849,0.813,0.733,0.532,0.533,0.483,0.689,0.493,0.655,0.533,0.668,0.846,0.789,0.877,0.873,0.735,0.572,0.56,0.774,0.724,0.701,0.817,0.828,0.814,0.845,0.59,0.52,0.691,0.587,0.655,0.789,0.646,0.692,0.788,0.737,0.724,0.766,0.826,0.806,0.757,0.822,0.815,0.799,0.801,0.707,1176,Activity,CCDB_ECOLI,High,Prokaryote
+CCDB_ECOLI_Tripathi_2016,0.946,0.994,0.991,0.993,0.989,0.988,0.783,0.954,0.89,0.955,0.946,0.866,0.89,0.773,0.78,0.893,0.967,0.991,1.0,0.939,0.763,0.765,0.71,0.914,0.714,0.874,0.746,0.882,0.996,0.953,0.996,0.996,0.917,0.765,0.782,0.988,0.913,0.892,0.99,0.98,0.981,0.993,0.796,0.741,0.905,0.78,0.89,0.983,0.899,0.89,0.974,0.953,0.935,0.957,0.99,0.978,0.967,0.991,0.983,0.977,0.964,0.955,1663,OrganismalFitness,CCDB_ECOLI,High,Prokaryote
+CCR5_HUMAN_Gill_2023,0.819,0.818,0.818,0.819,0.821,0.821,0.8,0.841,0.822,0.822,0.823,0.82,0.819,0.81,0.813,0.82,0.807,0.816,0.812,0.823,0.814,0.836,0.832,0.83,0.824,0.832,0.825,0.828,0.832,0.829,0.822,0.815,0.775,0.822,0.828,0.829,0.82,0.83,0.834,0.822,0.823,0.825,0.813,0.772,0.826,0.815,0.809,0.824,0.815,0.805,0.818,0.814,0.813,0.809,0.813,0.82,0.814,0.814,0.814,0.817,0.823,0.827,6137,Binding,CCR5_HUMAN,High,Human
+CD19_HUMAN_Klesmith_2019_FMC_singles,0.566,0.555,0.578,0.578,0.566,0.565,0.519,0.523,0.526,0.529,0.523,0.51,0.543,0.54,0.566,0.522,0.517,0.558,0.577,0.491,0.575,0.569,0.584,0.577,0.511,0.554,0.536,0.586,0.565,0.576,0.572,0.548,0.527,0.608,0.586,0.531,0.598,0.587,0.578,0.578,0.574,0.573,0.552,0.529,0.573,0.562,0.693,0.67,0.683,0.553,0.574,0.564,0.591,0.6,0.58,0.595,0.614,0.573,0.61,0.6,0.687,0.621,3761,Binding,CD19_HUMAN,Low,Human
+CP2C9_HUMAN_Amorosi_2021_abundance,0.825,0.857,0.852,0.859,0.867,0.864,0.842,0.859,0.856,0.866,0.857,0.867,0.874,0.819,0.854,0.859,0.873,0.87,0.868,0.867,0.853,0.83,0.865,0.863,0.867,0.866,0.868,0.868,0.868,0.861,0.859,0.846,0.634,0.861,0.862,0.862,0.869,0.868,0.87,0.868,0.868,0.866,0.859,0.727,0.862,0.867,0.853,0.862,0.875,0.77,0.868,0.869,0.874,0.87,0.869,0.872,0.874,0.874,0.871,0.873,0.871,0.856,6370,Expression,CP2C9_HUMAN,High,Human
+CP2C9_HUMAN_Amorosi_2021_activity,0.753,0.814,0.815,0.827,0.832,0.83,0.796,0.834,0.801,0.833,0.814,0.839,0.857,0.753,0.827,0.84,0.867,0.852,0.829,0.847,0.81,0.773,0.835,0.814,0.841,0.827,0.834,0.821,0.825,0.81,0.8,0.753,0.501,0.834,0.833,0.836,0.842,0.845,0.853,0.845,0.841,0.837,0.829,0.589,0.825,0.856,0.81,0.817,0.842,0.653,0.857,0.849,0.851,0.852,0.857,0.861,0.854,0.854,0.859,0.863,0.852,0.827,6142,Binding,CP2C9_HUMAN,High,Human
+CSN4_MOUSE_Tsuboyama_2023_1UFM,0.824,0.841,0.847,0.852,0.851,0.849,0.672,0.807,0.858,0.848,0.838,0.87,0.874,0.751,0.895,0.86,0.789,0.835,0.822,0.845,0.766,0.745,0.815,0.841,0.726,0.866,0.871,0.847,0.842,0.842,0.857,0.843,0.678,0.729,0.833,0.824,0.853,0.867,0.87,0.852,0.864,0.865,0.756,0.745,0.844,0.862,0.872,0.847,0.902,0.887,0.84,0.839,0.843,0.847,0.842,0.849,0.845,0.851,0.841,0.847,0.855,0.873,3295,Stability,CSN4_MOUSE,Medium,Eukaryote
+CUE1_YEAST_Tsuboyama_2023_2MYX,0.746,0.847,0.798,0.8,0.821,0.815,0.646,0.781,0.863,0.867,0.777,0.671,0.673,0.617,0.637,0.679,0.728,0.797,0.793,0.768,0.626,0.618,0.643,0.563,0.544,0.564,0.623,0.622,0.796,0.779,0.802,0.768,0.676,0.617,0.597,0.629,0.742,0.739,0.769,0.808,0.806,0.812,0.626,0.633,0.731,0.644,0.781,0.787,0.87,0.835,0.711,0.782,0.764,0.76,0.721,0.76,0.747,0.739,0.747,0.747,0.824,0.75,1580,Stability,CUE1_YEAST,Medium,Eukaryote
+D7PM05_CLYGR_Somermeyer_2022,0.63,0.771,0.736,0.695,0.76,0.76,0.32,0.622,0.792,0.789,0.595,0.325,0.328,0.327,0.309,0.302,0.312,0.341,0.357,0.621,0.291,0.27,0.362,0.331,0.261,0.305,0.313,0.393,0.478,0.773,0.752,0.756,0.308,0.325,0.36,0.363,0.691,0.69,0.686,0.764,0.763,0.753,0.443,0.445,0.45,0.45,0.573,0.593,0.611,0.594,0.626,0.62,0.632,0.628,0.614,0.629,0.608,0.627,0.615,0.625,0.513,0.386,24515,Activity,D7PM05_CLYGR,Low,Eukaryote
+DLG4_HUMAN_Faure_2021,0.87,0.859,0.871,0.863,0.876,0.876,0.872,0.863,0.834,0.841,0.816,0.865,0.867,0.888,0.914,0.901,0.865,0.853,0.826,0.878,0.851,0.849,0.836,0.823,0.863,0.851,0.843,0.848,0.821,0.874,0.853,0.852,0.817,0.849,0.869,0.836,0.886,0.894,0.88,0.883,0.888,0.885,0.877,0.727,0.832,0.867,0.869,0.817,0.895,0.772,0.837,0.813,0.835,0.833,0.845,0.846,0.829,0.827,0.829,0.835,0.857,0.914,6976,OrganismalFitness,DLG4_HUMAN,Low,Human
+DLG4_RAT_McLaughlin_2012,0.921,0.93,0.925,0.925,0.931,0.932,0.926,0.923,0.929,0.929,0.928,0.927,0.927,0.912,0.914,0.936,0.928,0.927,0.928,0.921,0.922,0.921,0.924,0.924,0.923,0.926,0.922,0.921,0.927,0.921,0.935,0.936,0.879,0.917,0.925,0.911,0.922,0.924,0.92,0.927,0.928,0.932,0.929,0.8,0.927,0.922,0.907,0.91,0.906,0.879,0.927,0.882,0.919,0.921,0.923,0.92,0.923,0.922,0.925,0.923,0.926,0.92,1576,Binding,DLG4_RAT,Low,Eukaryote
+DN7A_SACS2_Tsuboyama_2023_1JIC,0.841,0.877,0.86,0.848,0.831,0.855,0.751,0.876,0.898,0.906,0.799,0.808,0.789,0.77,0.802,0.801,0.823,0.819,0.902,0.894,0.738,0.784,0.759,0.765,0.763,0.74,0.746,0.782,0.854,0.891,0.906,0.883,0.778,0.77,0.748,0.764,0.839,0.829,0.83,0.858,0.858,0.845,0.787,0.774,0.818,0.785,0.917,0.896,0.926,0.918,0.893,0.899,0.906,0.892,0.899,0.885,0.872,0.879,0.892,0.889,0.854,0.85,1008,Stability,DN7A_SACS2,Medium,Prokaryote
+DNJA1_HUMAN_Tsuboyama_2023_2LO1,0.9,0.89,0.901,0.898,0.904,0.899,0.888,0.835,0.891,0.899,0.91,0.907,0.924,0.903,0.936,0.929,0.933,0.923,0.923,0.91,0.9,0.898,0.908,0.903,0.904,0.885,0.902,0.892,0.889,0.883,0.88,0.879,0.768,0.896,0.887,0.907,0.899,0.899,0.906,0.897,0.898,0.902,0.918,0.729,0.887,0.92,0.92,0.885,0.923,0.925,0.928,0.935,0.938,0.938,0.939,0.937,0.936,0.94,0.933,0.942,0.898,0.926,2264,Stability,DNJA1_HUMAN,High,Human
+DOCK1_MOUSE_Tsuboyama_2023_2M0Y,0.774,0.795,0.789,0.776,0.761,0.785,0.546,0.742,0.671,0.644,0.791,0.674,0.707,0.521,0.654,0.679,0.787,0.795,0.78,0.828,0.73,0.796,0.793,0.774,0.648,0.766,0.65,0.743,0.788,0.688,0.808,0.782,0.696,0.571,0.586,0.561,0.779,0.775,0.795,0.798,0.783,0.801,0.61,0.533,0.821,0.699,0.797,0.828,0.822,0.847,0.811,0.814,0.806,0.82,0.819,0.817,0.817,0.819,0.812,0.821,0.838,0.804,2915,Stability,DOCK1_MOUSE,High,Eukaryote
+DYR_ECOLI_Nguyen_2023,0.961,0.962,0.961,0.962,0.962,0.96,0.893,0.967,0.962,0.958,0.96,0.967,0.964,0.921,0.963,0.961,0.966,0.963,0.963,0.963,0.934,0.961,0.964,0.964,0.965,0.959,0.965,0.961,0.965,0.96,0.964,0.962,0.901,0.961,0.962,0.964,0.959,0.963,0.961,0.96,0.96,0.96,0.962,0.903,0.962,0.964,0.936,0.963,0.966,0.937,0.968,0.963,0.964,0.963,0.966,0.967,0.966,0.964,0.966,0.966,0.965,0.962,2916,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+DYR_ECOLI_Thompson_2019,0.865,0.885,0.884,0.887,0.891,0.888,0.797,0.865,0.888,0.883,0.862,0.882,0.866,0.829,0.87,0.87,0.877,0.874,0.869,0.876,0.815,0.882,0.865,0.867,0.867,0.889,0.9,0.87,0.898,0.9,0.88,0.878,0.816,0.882,0.895,0.903,0.884,0.892,0.891,0.89,0.893,0.892,0.868,0.791,0.873,0.875,0.845,0.869,0.873,0.852,0.875,0.868,0.864,0.871,0.873,0.885,0.87,0.872,0.872,0.871,0.863,0.876,2363,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+ENV_HV1B9_DuenasDecamp_2016,0.643,0.63,0.538,0.601,0.635,0.631,0.249,0.662,0.628,0.634,0.538,0.623,0.624,0.299,0.278,0.305,0.301,0.298,0.469,0.663,0.59,0.645,0.662,0.66,0.564,0.677,0.587,0.655,0.669,0.577,0.709,0.647,0.564,0.618,0.637,0.668,0.642,0.631,0.653,0.639,0.643,0.658,0.566,0.244,0.629,0.623,0.549,0.578,0.497,0.41,0.431,0.527,0.47,0.431,0.528,0.488,0.495,0.51,0.378,0.487,0.353,0.319,375,OrganismalFitness,ENV_HV1B9,Medium,Virus
+ENV_HV1BR_Haddox_2016,0.304,0.315,0.33,0.343,0.337,0.336,0.145,0.317,0.302,0.307,0.258,0.287,0.289,0.165,0.174,0.17,0.18,0.182,0.209,0.342,0.303,0.313,0.331,0.339,0.294,0.325,0.337,0.318,0.342,0.267,0.302,0.271,0.216,0.3,0.314,0.324,0.312,0.314,0.321,0.335,0.337,0.341,0.268,0.143,0.294,0.275,0.225,0.293,0.177,0.199,0.211,0.207,0.225,0.225,0.223,0.22,0.216,0.209,0.212,0.222,0.206,0.183,12863,OrganismalFitness,ENV_HV1BR,Medium,Virus
+ENVZ_ECOLI_Ghose_2023,0.758,0.754,0.774,0.791,0.808,0.81,0.701,0.671,0.783,0.797,0.803,0.79,0.805,0.721,0.781,0.754,0.756,0.739,0.722,0.772,0.769,0.732,0.769,0.776,0.777,0.753,0.692,0.802,0.724,0.848,0.798,0.808,0.8,0.756,0.789,0.762,0.766,0.763,0.767,0.814,0.813,0.817,0.714,0.753,0.735,0.711,0.681,0.788,0.728,0.692,0.762,0.775,0.767,0.805,0.793,0.78,0.775,0.767,0.817,0.773,0.728,0.776,1121,Activity,ENVZ_ECOLI,High,Prokaryote
+EPHB2_HUMAN_Tsuboyama_2023_1F0M,0.917,0.935,0.937,0.939,0.935,0.939,0.363,0.912,0.938,0.939,0.912,0.917,0.922,0.391,0.915,0.919,0.937,0.94,0.936,0.938,0.921,0.924,0.917,0.92,0.926,0.925,0.93,0.932,0.922,0.912,0.929,0.91,0.883,0.93,0.913,0.931,0.946,0.934,0.942,0.943,0.937,0.941,0.886,0.551,0.937,0.906,0.947,0.931,0.949,0.942,0.938,0.939,0.938,0.945,0.944,0.941,0.937,0.941,0.942,0.941,0.948,0.946,1960,Stability,EPHB2_HUMAN,High,Human
+ERBB2_HUMAN_Elazar_2016,0.782,0.884,0.794,0.766,0.805,0.821,0.798,0.892,0.836,0.899,0.848,0.744,0.79,0.866,0.789,0.788,0.881,0.899,0.896,0.853,0.82,0.866,0.897,0.907,0.667,0.898,0.914,0.919,0.899,0.772,0.884,0.847,0.828,0.878,0.652,0.908,0.858,0.74,0.893,0.823,0.797,0.836,0.793,0.758,0.89,0.767,0.917,0.928,0.741,0.825,0.827,0.844,0.786,0.872,0.857,0.775,0.804,0.866,0.826,0.829,0.779,0.772,326,Expression,ERBB2_HUMAN,Low,Human
+ESTA_BACSU_Nutschel_2020,0.708,0.774,0.774,0.777,0.773,0.779,0.657,0.69,0.737,0.796,0.691,0.729,0.75,0.662,0.713,0.69,0.693,0.69,0.717,0.731,0.663,0.663,0.711,0.667,0.718,0.704,0.735,0.739,0.775,0.753,0.732,0.737,0.595,0.667,0.731,0.686,0.725,0.741,0.7,0.784,0.783,0.764,0.719,0.605,0.75,0.728,0.838,0.804,0.792,0.757,0.755,0.701,0.74,0.729,0.732,0.724,0.72,0.714,0.72,0.735,0.744,0.719,2172,Stability,ESTA_BACSU,High,Prokaryote
+F7YBW8_MESOW_Aakre_2015,0.143,0.824,0.791,0.884,0.845,0.859,0.159,0.66,0.675,0.712,0.837,0.703,0.726,0.123,0.122,0.192,0.616,0.59,0.818,0.776,0.12,0.112,0.121,0.154,0.17,0.455,0.121,0.671,0.852,0.797,0.914,0.892,0.161,0.119,0.117,0.921,0.144,0.127,0.879,0.727,0.7,0.893,0.175,0.16,0.466,0.196,0.187,0.52,0.2,0.138,0.846,0.87,0.825,0.891,0.872,0.882,0.834,0.845,0.857,0.868,0.471,0.111,9192,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+FECA_ECOLI_Tsuboyama_2023_2D1U,0.666,0.702,0.687,0.686,0.691,0.684,0.529,0.638,0.685,0.728,0.736,0.638,0.69,0.521,0.742,0.759,0.715,0.709,0.718,0.693,0.601,0.675,0.668,0.654,0.669,0.67,0.694,0.679,0.701,0.698,0.714,0.677,0.578,0.522,0.578,0.656,0.639,0.634,0.692,0.668,0.657,0.699,0.703,0.597,0.745,0.719,0.788,0.773,0.767,0.745,0.679,0.71,0.726,0.729,0.718,0.703,0.699,0.698,0.716,0.697,0.746,0.745,1886,Stability,FECA_ECOLI,High,Prokaryote
+FKBP3_HUMAN_Tsuboyama_2023_2KFV,0.87,0.869,0.875,0.876,0.878,0.878,0.737,0.829,0.82,0.822,0.735,0.728,0.741,0.729,0.718,0.712,0.761,0.774,0.817,0.818,0.758,0.754,0.716,0.837,0.732,0.831,0.815,0.768,0.799,0.844,0.816,0.807,0.65,0.696,0.724,0.8,0.869,0.87,0.863,0.88,0.876,0.881,0.75,0.738,0.744,0.748,0.881,0.86,0.892,0.884,0.787,0.845,0.859,0.836,0.858,0.827,0.842,0.831,0.8,0.844,0.888,0.82,1237,Stability,FKBP3_HUMAN,Medium,Human
+GAL4_YEAST_Kitzman_2015,0.714,0.774,0.82,0.798,0.793,0.802,0.705,0.587,0.842,0.818,0.825,0.775,0.775,0.729,0.759,0.771,0.832,0.847,0.859,0.783,0.776,0.729,0.751,0.738,0.754,0.776,0.794,0.778,0.863,0.809,0.863,0.851,0.67,0.706,0.751,0.714,0.813,0.807,0.828,0.794,0.802,0.811,0.743,0.777,0.824,0.75,0.638,0.804,0.603,0.652,0.851,0.816,0.843,0.833,0.823,0.818,0.858,0.817,0.832,0.835,0.778,0.764,1195,OrganismalFitness,GAL4_YEAST,Medium,Eukaryote
+GCN4_YEAST_Staller_2018,0.759,0.754,0.748,0.754,0.752,0.749,0.664,0.683,0.759,0.756,0.742,0.739,0.747,0.727,0.719,0.748,0.76,0.757,0.752,0.756,0.647,0.641,0.652,0.647,0.637,0.634,0.632,0.645,0.681,0.758,0.748,0.742,0.661,0.644,0.638,0.754,0.755,0.757,0.764,0.753,0.753,0.762,0.723,0.735,0.723,0.735,0.743,0.759,0.735,0.708,0.744,0.74,0.741,0.741,0.746,0.747,0.741,0.74,0.745,0.742,0.709,0.703,2638,Binding,GCN4_YEAST,Low,Eukaryote
+GDIA_HUMAN_Silverstein_2021,0.879,0.859,0.86,0.859,0.849,0.857,0.769,0.871,0.872,0.869,0.864,0.86,0.868,0.737,0.795,0.878,0.864,0.867,0.877,0.864,0.779,0.874,0.865,0.872,0.854,0.876,0.87,0.87,0.871,0.868,0.842,0.844,0.804,0.832,0.86,0.843,0.877,0.861,0.858,0.863,0.861,0.858,0.778,0.706,0.886,0.79,0.88,0.879,0.862,0.742,0.852,0.851,0.861,0.861,0.858,0.872,0.858,0.86,0.862,0.856,0.866,0.844,1154,OrganismalFitness,GDIA_HUMAN,Low,Human
+GFP_AEQVI_Sarkisyan_2016,0.917,0.907,0.913,0.913,0.913,0.913,0.416,0.912,0.916,0.915,0.812,0.47,0.472,0.464,0.493,0.449,0.456,0.5,0.562,0.878,0.443,0.457,0.514,0.471,0.403,0.524,0.63,0.912,0.918,0.923,0.899,0.899,0.47,0.435,0.543,0.902,0.924,0.925,0.922,0.919,0.92,0.931,0.65,0.598,0.653,0.658,0.906,0.898,0.923,0.883,0.875,0.882,0.89,0.895,0.889,0.889,0.887,0.889,0.878,0.89,0.871,0.748,51714,Activity,GFP_AEQVI,Low,Eukaryote
+GLPA_HUMAN_Elazar_2016,0.765,0.716,0.777,0.776,0.764,0.77,0.624,0.836,0.744,0.729,0.649,0.718,0.682,0.621,0.616,0.696,0.699,0.673,0.724,0.699,0.692,0.652,0.664,0.719,0.672,0.689,0.648,0.692,0.734,0.743,0.739,0.762,0.765,0.655,0.688,0.687,0.794,0.793,0.796,0.765,0.772,0.769,0.643,0.653,0.71,0.674,0.737,0.655,0.755,0.652,0.832,0.761,0.749,0.713,0.752,0.687,0.75,0.797,0.781,0.752,0.776,0.808,245,Expression,GLPA_HUMAN,Low,Human
+GRB2_HUMAN_Faure_2021,0.744,0.81,0.812,0.82,0.828,0.832,0.76,0.754,0.83,0.801,0.827,0.777,0.818,0.752,0.831,0.864,0.867,0.812,0.843,0.818,0.824,0.813,0.789,0.774,0.801,0.804,0.767,0.816,0.763,0.807,0.78,0.767,0.781,0.817,0.816,0.721,0.825,0.825,0.766,0.845,0.848,0.831,0.839,0.633,0.803,0.817,0.875,0.8,0.894,0.787,0.842,0.849,0.84,0.852,0.854,0.849,0.841,0.85,0.838,0.855,0.819,0.845,63366,OrganismalFitness,GRB2_HUMAN,Medium,Human
+HCP_LAMBD_Tsuboyama_2023_2L6Q,0.892,0.907,0.883,0.885,0.9,0.901,0.778,0.871,0.857,0.91,0.917,0.836,0.871,0.795,0.851,0.905,0.924,0.914,0.925,0.823,0.791,0.819,0.847,0.858,0.846,0.815,0.756,0.825,0.916,0.905,0.905,0.878,0.746,0.749,0.803,0.889,0.886,0.899,0.898,0.89,0.913,0.907,0.832,0.813,0.913,0.83,0.898,0.924,0.914,0.891,0.918,0.921,0.917,0.919,0.932,0.921,0.926,0.928,0.923,0.927,0.927,0.88,1040,Stability,HCP_LAMBD,Medium,Virus
+HECD1_HUMAN_Tsuboyama_2023_3DKM,0.86,0.85,0.882,0.89,0.898,0.895,0.59,0.776,0.883,0.892,0.872,0.654,0.669,0.59,0.585,0.853,0.888,0.871,0.876,0.868,0.5,0.813,0.85,0.842,0.821,0.832,0.85,0.855,0.836,0.873,0.887,0.852,0.511,0.599,0.438,0.666,0.859,0.841,0.86,0.883,0.879,0.875,0.642,0.623,0.876,0.648,0.767,0.845,0.747,0.681,0.892,0.884,0.891,0.887,0.903,0.901,0.898,0.903,0.898,0.902,0.889,0.563,5586,Stability,HECD1_HUMAN,Medium,Human
+HEM3_HUMAN_Loggerenberg_2023,0.57,0.585,0.585,0.577,0.579,0.581,0.468,0.375,0.581,0.59,0.549,0.549,0.55,0.45,0.549,0.554,0.554,0.585,0.575,0.442,0.565,0.565,0.587,0.61,0.549,0.56,0.56,0.55,0.589,0.595,0.601,0.614,0.45,0.562,0.594,0.616,0.582,0.603,0.611,0.587,0.591,0.598,0.535,0.473,0.558,0.553,0.558,0.585,0.526,0.543,0.561,0.545,0.564,0.566,0.565,0.562,0.568,0.556,0.56,0.561,0.574,0.565,5689,Activity,HEM3_HUMAN,Medium,Human
+HIS7_YEAST_Pokusaeva_2019,0.826,0.882,0.873,0.874,0.864,0.861,0.667,0.385,0.852,0.863,0.86,0.812,0.839,0.549,0.645,0.744,0.867,0.88,0.882,0.8,0.811,0.857,0.871,0.875,0.852,0.867,0.858,0.862,0.859,0.865,0.841,0.837,0.575,0.844,0.859,0.881,0.85,0.842,0.875,0.88,0.872,0.882,0.699,0.676,0.811,0.596,0.854,0.884,0.869,0.825,0.848,0.812,0.837,0.838,0.842,0.838,0.835,0.842,0.843,0.84,0.874,0.574,496137,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+HMDH_HUMAN_Jiang_2019,0.635,0.631,0.633,0.629,0.633,0.632,0.594,0.69,0.626,0.627,0.644,0.644,0.645,0.535,0.536,0.581,0.652,0.638,0.644,0.63,0.627,0.626,0.635,0.637,0.647,0.635,0.632,0.638,0.636,0.638,0.635,0.629,0.561,0.608,0.646,0.644,0.64,0.646,0.644,0.632,0.634,0.633,0.544,0.522,0.668,0.634,0.624,0.655,0.505,0.571,0.647,0.642,0.652,0.654,0.644,0.652,0.646,0.651,0.64,0.643,0.663,0.626,16853,OrganismalFitness,HMDH_HUMAN,Low,Human
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,0.945,0.962,0.959,0.961,0.962,0.963,0.836,0.958,0.962,0.961,0.958,0.947,0.962,0.766,0.8,0.876,0.932,0.942,0.948,0.873,0.963,0.962,0.96,0.962,0.96,0.96,0.963,0.963,0.963,0.963,0.964,0.962,0.943,0.96,0.96,0.961,0.958,0.958,0.958,0.963,0.962,0.963,0.76,0.77,0.962,0.953,0.778,0.956,0.783,0.884,0.937,0.94,0.93,0.93,0.935,0.931,0.935,0.927,0.935,0.933,0.958,0.938,2252,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HSP82_YEAST_Flynn_2019,0.793,0.812,0.816,0.816,0.818,0.816,0.728,0.802,0.822,0.813,0.805,0.813,0.816,0.7,0.722,0.732,0.777,0.794,0.795,0.812,0.794,0.815,0.817,0.82,0.803,0.823,0.825,0.823,0.824,0.821,0.82,0.815,0.763,0.811,0.819,0.819,0.808,0.814,0.815,0.815,0.817,0.818,0.73,0.681,0.801,0.771,0.745,0.812,0.701,0.708,0.767,0.771,0.777,0.783,0.771,0.782,0.772,0.771,0.766,0.776,0.801,0.789,13294,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HSP82_YEAST_Mishra_2016,0.933,0.943,0.942,0.943,0.944,0.935,0.882,0.896,0.941,0.945,0.942,0.943,0.942,0.858,0.89,0.903,0.928,0.929,0.926,0.934,0.941,0.939,0.942,0.941,0.944,0.942,0.944,0.943,0.944,0.944,0.942,0.938,0.892,0.94,0.941,0.941,0.939,0.94,0.94,0.942,0.943,0.94,0.939,0.794,0.938,0.942,0.862,0.938,0.855,0.84,0.92,0.923,0.917,0.919,0.926,0.923,0.923,0.926,0.917,0.924,0.931,0.928,4323,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HXK4_HUMAN_Gersing_2022_activity,0.616,0.619,0.603,0.602,0.609,0.612,0.515,0.584,0.612,0.613,0.6,0.595,0.603,0.527,0.548,0.62,0.632,0.612,0.604,0.512,0.596,0.602,0.607,0.6,0.612,0.614,0.616,0.61,0.589,0.593,0.587,0.567,0.503,0.609,0.607,0.602,0.617,0.603,0.605,0.615,0.606,0.611,0.568,0.502,0.616,0.609,0.594,0.621,0.593,0.556,0.608,0.615,0.62,0.619,0.618,0.631,0.632,0.622,0.616,0.621,0.632,0.599,8570,OrganismalFitness,HXK4_HUMAN,Medium,Human
+HXK4_HUMAN_Gersing_2023_abundance,0.853,0.854,0.851,0.854,0.851,0.85,0.761,0.848,0.857,0.861,0.851,0.854,0.857,0.79,0.813,0.853,0.859,0.855,0.859,0.863,0.855,0.85,0.86,0.854,0.856,0.854,0.859,0.857,0.851,0.86,0.855,0.851,0.805,0.853,0.852,0.857,0.861,0.855,0.859,0.852,0.849,0.849,0.821,0.788,0.848,0.86,0.86,0.849,0.855,0.826,0.854,0.858,0.858,0.852,0.859,0.859,0.862,0.855,0.857,0.858,0.861,0.849,8396,Expression,HXK4_HUMAN,Medium,Human
+I6TAH8_I68A0_Doud_2015,0.772,0.763,0.754,0.753,0.776,0.776,0.619,0.744,0.756,0.771,0.617,0.612,0.613,0.611,0.608,0.613,0.625,0.617,0.645,0.731,0.727,0.754,0.761,0.771,0.615,0.626,0.635,0.615,0.755,0.785,0.718,0.713,0.666,0.759,0.755,0.754,0.774,0.769,0.768,0.779,0.776,0.772,0.608,0.613,0.618,0.616,0.677,0.679,0.674,0.661,0.609,0.643,0.646,0.635,0.627,0.615,0.618,0.615,0.606,0.624,0.645,0.625,9462,OrganismalFitness,I6TAH8_I68A0,Medium,Virus
+IF1_ECOLI_Kelsic_2016,0.907,0.943,0.934,0.932,0.934,0.935,0.862,0.907,0.892,0.888,0.944,0.936,0.937,0.875,0.936,0.942,0.94,0.932,0.939,0.938,0.932,0.942,0.941,0.942,0.942,0.947,0.947,0.943,0.941,0.886,0.931,0.929,0.857,0.935,0.939,0.949,0.934,0.937,0.945,0.936,0.937,0.937,0.919,0.868,0.945,0.939,0.873,0.931,0.939,0.911,0.919,0.938,0.946,0.939,0.945,0.943,0.933,0.944,0.946,0.946,0.938,0.925,1367,OrganismalFitness,IF1_ECOLI,High,Prokaryote
+ILF3_HUMAN_Tsuboyama_2023_2L33,0.788,0.811,0.827,0.821,0.82,0.819,0.751,0.815,0.805,0.809,0.831,0.809,0.804,0.781,0.782,0.796,0.803,0.832,0.825,0.812,0.836,0.819,0.83,0.835,0.868,0.845,0.859,0.843,0.841,0.825,0.822,0.808,0.812,0.742,0.787,0.857,0.79,0.834,0.85,0.8,0.803,0.82,0.788,0.767,0.829,0.799,0.811,0.833,0.812,0.806,0.838,0.809,0.824,0.838,0.84,0.823,0.832,0.827,0.838,0.836,0.843,0.795,1329,Stability,ILF3_HUMAN,High,Human
+ISDH_STAAW_Tsuboyama_2023_2LHR,0.765,0.722,0.784,0.781,0.785,0.786,0.773,0.758,0.831,0.799,0.8,0.811,0.821,0.828,0.814,0.876,0.838,0.842,0.854,0.77,0.786,0.771,0.797,0.811,0.819,0.823,0.845,0.781,0.825,0.793,0.818,0.797,0.7,0.765,0.751,0.793,0.765,0.758,0.779,0.781,0.771,0.785,0.811,0.783,0.821,0.809,0.908,0.894,0.887,0.865,0.84,0.837,0.858,0.841,0.831,0.845,0.839,0.844,0.821,0.847,0.882,0.833,1944,Stability,ISDH_STAAW,High,Prokaryote
+KCNE1_HUMAN_Muhammad_2023_expression,0.59,0.575,0.569,0.568,0.588,0.587,0.587,0.584,0.566,0.57,0.64,0.645,0.643,0.6,0.595,0.588,0.606,0.572,0.57,0.572,0.641,0.628,0.579,0.595,0.634,0.638,0.603,0.569,0.571,0.571,0.604,0.587,0.38,0.584,0.632,0.604,0.595,0.605,0.596,0.588,0.586,0.584,0.61,0.59,0.577,0.614,0.604,0.582,0.602,0.604,0.609,0.595,0.602,0.614,0.597,0.601,0.595,0.611,0.601,0.608,0.623,0.637,2339,Expression,KCNE1_HUMAN,Medium,Human
+KCNE1_HUMAN_Muhammad_2023_function,0.836,0.841,0.841,0.847,0.844,0.847,0.709,0.821,0.837,0.847,0.659,0.723,0.797,0.665,0.655,0.842,0.854,0.812,0.808,0.804,0.625,0.607,0.802,0.822,0.702,0.844,0.836,0.829,0.84,0.849,0.841,0.817,0.367,0.708,0.784,0.857,0.835,0.838,0.854,0.851,0.851,0.853,0.67,0.657,0.843,0.665,0.743,0.831,0.712,0.67,0.831,0.856,0.842,0.843,0.854,0.84,0.84,0.844,0.848,0.851,0.849,0.646,2315,Activity,KCNE1_HUMAN,Medium,Human
+KCNH2_HUMAN_Kozek_2020,0.639,0.641,0.639,0.639,0.64,0.64,0.625,0.521,0.614,0.633,0.678,0.549,0.537,0.556,0.525,0.582,0.52,0.558,0.606,0.697,0.652,0.648,0.698,0.679,0.683,0.693,0.63,0.661,0.643,0.678,0.601,0.641,0.593,0.633,0.644,0.661,0.628,0.647,0.68,0.63,0.648,0.646,0.709,0.513,0.68,0.646,0.645,0.461,0.27,0.52,0.702,0.682,0.686,0.717,0.697,0.718,0.676,0.678,0.703,0.702,0.698,0.66,200,Activity,KCNH2_HUMAN,Medium,Human
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,0.781,0.797,0.795,0.797,0.799,0.799,0.722,0.782,0.792,0.792,0.797,0.796,0.797,0.724,0.773,0.799,0.799,0.797,0.795,0.787,0.802,0.795,0.792,0.79,0.799,0.796,0.795,0.795,0.794,0.797,0.791,0.784,0.731,0.791,0.793,0.792,0.797,0.795,0.793,0.797,0.796,0.795,0.759,0.723,0.792,0.788,0.763,0.79,0.77,0.737,0.795,0.795,0.797,0.798,0.795,0.798,0.796,0.8,0.799,0.8,0.797,0.777,6963,Activity,KCNJ2_MOUSE,Medium,Eukaryote
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,0.83,0.82,0.823,0.822,0.823,0.824,0.715,0.803,0.821,0.821,0.829,0.822,0.824,0.746,0.834,0.842,0.84,0.838,0.841,0.812,0.826,0.804,0.806,0.818,0.824,0.815,0.815,0.821,0.812,0.821,0.824,0.817,0.763,0.821,0.792,0.812,0.823,0.812,0.813,0.823,0.818,0.819,0.815,0.714,0.821,0.831,0.817,0.813,0.81,0.796,0.832,0.823,0.828,0.827,0.832,0.826,0.831,0.827,0.828,0.829,0.832,0.831,6917,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+KKA2_KLEPN_Melnikov_2014,0.471,0.591,0.585,0.612,0.6,0.61,0.366,0.523,0.597,0.627,0.548,0.554,0.587,0.389,0.426,0.506,0.604,0.637,0.641,0.57,0.476,0.52,0.598,0.578,0.484,0.586,0.614,0.557,0.634,0.604,0.57,0.548,0.3,0.406,0.515,0.566,0.549,0.572,0.6,0.621,0.623,0.63,0.423,0.334,0.553,0.466,0.561,0.606,0.606,0.509,0.598,0.597,0.614,0.603,0.608,0.607,0.602,0.583,0.597,0.613,0.623,0.506,4960,OrganismalFitness,KKA2_KLEPN,High,Prokaryote
+LGK_LIPST_Klesmith_2015,0.853,0.937,0.935,0.936,0.937,0.937,0.794,0.896,0.881,0.928,0.859,0.916,0.926,0.802,0.826,0.844,0.903,0.929,0.931,0.901,0.826,0.868,0.887,0.908,0.847,0.908,0.933,0.894,0.937,0.945,0.935,0.921,0.802,0.82,0.865,0.948,0.847,0.864,0.938,0.933,0.936,0.944,0.794,0.783,0.876,0.832,0.853,0.907,0.901,0.832,0.907,0.917,0.909,0.918,0.911,0.913,0.912,0.91,0.916,0.918,0.876,0.841,7890,Activity,LGK_LIPST,Medium,Eukaryote
+LYAM1_HUMAN_Elazar_2016,0.679,0.693,0.65,0.608,0.666,0.678,0.669,0.682,0.76,0.754,0.732,0.604,0.65,0.687,0.675,0.712,0.636,0.649,0.755,0.747,0.712,0.721,0.706,0.643,0.685,0.63,0.663,0.755,0.666,0.709,0.663,0.616,0.614,0.677,0.712,0.633,0.668,0.711,0.639,0.679,0.704,0.67,0.658,0.724,0.632,0.673,0.75,0.673,0.684,0.652,0.617,0.702,0.775,0.627,0.657,0.676,0.695,0.716,0.535,0.659,0.72,0.758,359,Expression,LYAM1_HUMAN,Medium,Human
+MAFG_MOUSE_Tsuboyama_2023_1K1V,0.881,0.88,0.876,0.877,0.868,0.867,0.848,0.846,0.86,0.859,0.857,0.796,0.88,0.858,0.853,0.851,0.867,0.896,0.774,0.844,0.775,0.778,0.836,0.856,0.84,0.846,0.849,0.849,0.843,0.856,0.85,0.858,0.749,0.831,0.857,0.843,0.879,0.889,0.888,0.875,0.886,0.887,0.848,0.705,0.871,0.814,0.9,0.825,0.908,0.908,0.904,0.897,0.912,0.914,0.895,0.917,0.904,0.908,0.903,0.915,0.916,0.899,1429,Stability,MAFG_MOUSE,Medium,Eukaryote
+MBD11_ARATH_Tsuboyama_2023_6ACV,0.826,0.874,0.876,0.875,0.879,0.878,0.369,0.795,0.829,0.871,0.853,0.739,0.751,0.306,0.273,0.817,0.827,0.798,0.891,0.85,0.289,0.436,0.792,0.768,0.471,0.86,0.819,0.8,0.858,0.846,0.863,0.825,0.79,0.397,0.288,0.34,0.808,0.842,0.846,0.874,0.875,0.875,0.477,0.365,0.868,0.786,0.796,0.879,0.857,0.803,0.847,0.865,0.851,0.834,0.827,0.83,0.835,0.83,0.81,0.837,0.905,0.799,2116,Stability,MBD11_ARATH,Medium,Eukaryote
+MET_HUMAN_Estevam_2023,0.859,0.914,0.897,0.912,0.917,0.918,0.867,0.894,0.855,0.874,0.923,0.879,0.879,0.829,0.869,0.888,0.906,0.91,0.92,0.899,0.91,0.926,0.913,0.914,0.927,0.923,0.929,0.926,0.927,0.922,0.917,0.906,0.821,0.856,0.908,0.926,0.875,0.904,0.917,0.91,0.912,0.903,0.879,0.802,0.918,0.903,0.835,0.912,0.876,0.812,0.881,0.896,0.897,0.901,0.9,0.898,0.9,0.899,0.897,0.896,0.884,0.875,5393,Activity,MET_HUMAN,Medium,Human
+MK01_HUMAN_Brenan_2016,0.759,0.776,0.777,0.78,0.776,0.773,0.755,0.741,0.762,0.759,0.741,0.759,0.754,0.759,0.752,0.758,0.753,0.759,0.756,0.758,0.761,0.745,0.741,0.736,0.751,0.744,0.747,0.751,0.739,0.762,0.732,0.741,0.765,0.754,0.745,0.733,0.75,0.742,0.733,0.773,0.773,0.767,0.762,0.746,0.758,0.769,0.731,0.747,0.738,0.745,0.766,0.767,0.767,0.763,0.764,0.761,0.758,0.756,0.761,0.764,0.764,0.754,6809,OrganismalFitness,MK01_HUMAN,Medium,Human
+MLAC_ECOLI_MacRae_2023,0.847,0.9,0.902,0.902,0.904,0.902,0.752,0.887,0.891,0.899,0.883,0.894,0.902,0.799,0.833,0.835,0.864,0.884,0.891,0.906,0.804,0.901,0.903,0.903,0.848,0.887,0.902,0.885,0.906,0.911,0.903,0.904,0.748,0.779,0.889,0.894,0.852,0.887,0.89,0.898,0.907,0.905,0.81,0.793,0.863,0.837,0.704,0.839,0.797,0.765,0.862,0.856,0.846,0.855,0.868,0.858,0.852,0.854,0.862,0.863,0.84,0.815,4007,OrganismalFitness,MLAC_ECOLI,Medium,Prokaryote
+MSH2_HUMAN_Jia_2020,0.817,0.841,0.831,0.831,0.831,0.833,0.784,0.814,0.837,0.836,0.832,0.834,0.837,0.78,0.818,0.833,0.831,0.838,0.825,0.829,0.804,0.829,0.831,0.832,0.814,0.835,0.833,0.819,0.837,0.83,0.828,0.821,0.797,0.808,0.84,0.831,0.823,0.839,0.832,0.836,0.841,0.838,0.799,0.747,0.839,0.815,0.809,0.835,0.736,0.786,0.828,0.826,0.829,0.826,0.829,0.829,0.832,0.83,0.829,0.83,0.834,0.82,16749,OrganismalFitness,MSH2_HUMAN,Medium,Human
+MTH3_HAEAE_RockahShmuel_2015,0.529,0.78,0.784,0.792,0.789,0.784,0.455,0.766,0.73,0.736,0.621,0.73,0.756,0.389,0.484,0.519,0.588,0.612,0.658,0.718,0.505,0.537,0.656,0.73,0.558,0.74,0.776,0.715,0.764,0.771,0.785,0.765,0.547,0.529,0.566,0.747,0.523,0.566,0.725,0.778,0.776,0.782,0.464,0.288,0.523,0.461,0.564,0.688,0.704,0.55,0.621,0.645,0.632,0.601,0.594,0.644,0.633,0.637,0.608,0.63,0.664,0.538,1777,OrganismalFitness,MTH3_HAEAE,Medium,Prokaryote
+MTHR_HUMAN_Weile_2021,0.79,0.798,0.79,0.798,0.787,0.788,0.677,0.78,0.806,0.805,0.807,0.789,0.788,0.711,0.751,0.795,0.8,0.797,0.808,0.769,0.781,0.786,0.789,0.809,0.785,0.783,0.788,0.777,0.817,0.794,0.791,0.782,0.622,0.791,0.792,0.795,0.792,0.793,0.798,0.798,0.798,0.801,0.735,0.699,0.805,0.756,0.746,0.793,0.801,0.725,0.796,0.8,0.795,0.801,0.796,0.796,0.8,0.796,0.801,0.801,0.796,0.766,12464,OrganismalFitness,MTHR_HUMAN,Low,Human
+MYO3_YEAST_Tsuboyama_2023_2BTT,0.595,0.686,0.682,0.669,0.69,0.695,0.482,0.482,0.703,0.689,0.75,0.724,0.723,0.569,0.699,0.786,0.836,0.75,0.648,0.685,0.558,0.515,0.576,0.598,0.629,0.686,0.729,0.704,0.751,0.631,0.674,0.63,0.6,0.642,0.587,0.552,0.695,0.61,0.574,0.703,0.691,0.685,0.627,0.517,0.707,0.669,0.695,0.731,0.779,0.706,0.752,0.738,0.741,0.746,0.746,0.752,0.745,0.764,0.757,0.756,0.732,0.762,3297,Stability,MYO3_YEAST,High,Eukaryote
+NCAP_I34A1_Doud_2015,0.745,0.737,0.741,0.742,0.739,0.742,0.562,0.726,0.741,0.738,0.559,0.551,0.55,0.552,0.551,0.543,0.556,0.562,0.587,0.708,0.718,0.738,0.748,0.753,0.563,0.566,0.563,0.557,0.731,0.758,0.69,0.69,0.609,0.733,0.74,0.763,0.752,0.755,0.76,0.753,0.757,0.756,0.554,0.558,0.561,0.553,0.634,0.64,0.64,0.614,0.577,0.598,0.597,0.584,0.584,0.575,0.579,0.571,0.555,0.581,0.599,0.58,9462,OrganismalFitness,NCAP_I34A1,Medium,Virus
+NKX31_HUMAN_Tsuboyama_2023_2L9R,0.899,0.909,0.913,0.914,0.906,0.913,0.886,0.894,0.897,0.9,0.86,0.885,0.874,0.901,0.886,0.914,0.925,0.921,0.9,0.885,0.903,0.897,0.881,0.878,0.896,0.87,0.898,0.857,0.874,0.902,0.887,0.881,0.874,0.903,0.895,0.872,0.914,0.906,0.903,0.91,0.91,0.912,0.895,0.898,0.87,0.885,0.9,0.874,0.905,0.905,0.922,0.922,0.922,0.922,0.925,0.927,0.921,0.927,0.92,0.926,0.882,0.909,2482,Stability,NKX31_HUMAN,High,Human
+NPC1_HUMAN_Erwood_2022_HEK293T,0.932,0.942,0.927,0.939,0.94,0.937,0.635,0.889,0.956,0.955,0.925,0.763,0.794,0.685,0.721,0.858,0.929,0.932,0.915,0.612,0.687,0.838,0.771,0.809,0.789,0.798,0.946,0.893,0.904,0.947,0.94,0.906,0.669,0.716,0.909,0.923,0.908,0.946,0.939,0.927,0.953,0.947,0.717,0.607,0.935,0.806,0.783,0.945,0.576,0.716,0.927,0.925,0.92,0.905,0.902,0.906,0.918,0.912,0.939,0.921,0.916,0.719,637,Activity,NPC1_HUMAN,Low,Human
+NPC1_HUMAN_Erwood_2022_RPE1,0.966,0.943,0.944,0.969,0.943,0.969,0.951,0.96,0.982,0.985,0.961,0.691,0.678,0.589,0.717,0.724,0.741,0.847,0.978,0.92,0.901,0.976,0.957,0.955,0.86,0.857,0.974,0.958,0.896,0.968,0.955,0.937,0.887,0.919,0.907,0.904,0.969,0.971,0.96,0.971,0.97,0.944,0.806,0.92,0.936,0.565,0.945,0.926,0.776,0.837,0.701,0.846,0.671,0.798,0.72,0.714,0.732,0.943,0.95,0.741,0.939,0.778,63,Activity,NPC1_HUMAN,Low,Human
+NRAM_I33A0_Jiang_2016,0.879,0.947,0.933,0.933,0.952,0.953,0.606,0.919,0.957,0.963,0.575,0.684,0.922,0.568,0.539,0.564,0.7,0.914,0.917,0.872,0.95,0.942,0.924,0.912,0.537,0.898,0.895,0.889,0.945,0.944,0.831,0.841,0.482,0.941,0.93,0.946,0.914,0.923,0.934,0.955,0.953,0.953,0.535,0.568,0.549,0.568,0.729,0.766,0.75,0.644,0.658,0.688,0.713,0.717,0.748,0.773,0.703,0.764,0.641,0.757,0.649,0.561,298,OrganismalFitness,NRAM_I33A0,Low,Virus
+NUD15_HUMAN_Suiter_2020,0.693,0.82,0.817,0.8,0.82,0.822,0.522,0.699,0.808,0.841,0.821,0.806,0.836,0.615,0.637,0.658,0.77,0.804,0.806,0.747,0.616,0.697,0.8,0.823,0.674,0.851,0.819,0.812,0.836,0.829,0.848,0.818,0.573,0.642,0.713,0.846,0.716,0.745,0.841,0.818,0.824,0.841,0.666,0.545,0.811,0.685,0.685,0.823,0.71,0.728,0.753,0.759,0.738,0.764,0.754,0.774,0.779,0.78,0.772,0.777,0.826,0.742,2844,Expression,NUD15_HUMAN,High,Human
+NUSA_ECOLI_Tsuboyama_2023_1WCL,0.656,0.85,0.817,0.811,0.813,0.826,0.644,0.75,0.848,0.831,0.753,0.477,0.515,0.636,0.592,0.59,0.651,0.674,0.683,0.81,0.796,0.832,0.844,0.859,0.569,0.79,0.599,0.866,0.862,0.824,0.837,0.838,0.534,0.728,0.621,0.685,0.704,0.634,0.689,0.833,0.808,0.836,0.671,0.723,0.666,0.672,0.899,0.836,0.859,0.877,0.832,0.869,0.851,0.87,0.865,0.863,0.858,0.843,0.856,0.86,0.842,0.672,2028,Stability,NUSA_ECOLI,High,Prokaryote
+NUSG_MYCTU_Tsuboyama_2023_2MI6,0.869,0.869,0.88,0.886,0.884,0.885,0.763,0.843,0.901,0.889,0.84,0.86,0.892,0.747,0.862,0.885,0.877,0.871,0.864,0.858,0.841,0.862,0.849,0.848,0.874,0.86,0.862,0.85,0.847,0.88,0.864,0.832,0.816,0.842,0.808,0.858,0.876,0.871,0.881,0.876,0.878,0.885,0.861,0.708,0.868,0.897,0.899,0.883,0.917,0.895,0.874,0.894,0.888,0.877,0.88,0.882,0.891,0.897,0.888,0.89,0.872,0.899,1380,Stability,NUSG_MYCTU,High,Prokaryote
+OBSCN_HUMAN_Tsuboyama_2023_1V1C,0.815,0.901,0.922,0.927,0.924,0.922,0.619,0.81,0.896,0.896,0.887,0.875,0.882,0.627,0.863,0.893,0.913,0.903,0.894,0.902,0.878,0.867,0.867,0.86,0.873,0.867,0.88,0.86,0.85,0.897,0.903,0.895,0.788,0.624,0.674,0.811,0.875,0.876,0.891,0.922,0.918,0.921,0.832,0.619,0.843,0.853,0.906,0.847,0.934,0.928,0.926,0.929,0.933,0.931,0.932,0.927,0.92,0.922,0.925,0.935,0.934,0.882,3197,Stability,OBSCN_HUMAN,High,Human
+ODP2_GEOSE_Tsuboyama_2023_1W4G,0.833,0.832,0.851,0.852,0.815,0.832,0.831,0.824,0.843,0.845,0.857,0.808,0.79,0.825,0.817,0.862,0.881,0.833,0.857,0.864,0.847,0.845,0.864,0.835,0.821,0.843,0.865,0.852,0.852,0.865,0.838,0.83,0.824,0.817,0.816,0.823,0.853,0.82,0.826,0.851,0.825,0.83,0.855,0.822,0.867,0.869,0.881,0.891,0.883,0.911,0.866,0.863,0.866,0.876,0.872,0.867,0.873,0.871,0.862,0.871,0.88,0.836,1134,Stability,ODP2_GEOSE,High,Prokaryote
+OPSD_HUMAN_Wan_2019,0.585,0.704,0.757,0.773,0.781,0.78,0.439,0.77,0.627,0.8,0.527,0.67,0.703,0.551,0.646,0.533,0.602,0.648,0.574,0.686,0.749,0.72,0.643,0.647,0.729,0.66,0.674,0.719,0.659,0.763,0.639,0.468,0.166,0.682,0.651,0.538,0.758,0.686,0.686,0.779,0.757,0.77,0.506,0.317,0.665,0.622,0.615,0.697,0.541,0.383,0.536,0.472,0.449,0.472,0.471,0.512,0.527,0.559,0.522,0.516,0.764,0.677,165,Expression,OPSD_HUMAN,High,Human
+OTC_HUMAN_Lo_2023,0.669,0.803,0.772,0.774,0.78,0.789,0.4,0.551,0.775,0.797,0.729,0.768,0.778,0.423,0.573,0.626,0.695,0.676,0.7,0.721,0.649,0.687,0.746,0.778,0.659,0.764,0.757,0.756,0.779,0.792,0.763,0.707,0.474,0.63,0.736,0.771,0.685,0.739,0.778,0.788,0.783,0.796,0.509,0.384,0.675,0.581,0.751,0.736,0.809,0.637,0.702,0.717,0.739,0.713,0.71,0.707,0.706,0.739,0.693,0.715,0.807,0.677,1570,Activity,OTC_HUMAN,Medium,Human
+OTU7A_HUMAN_Tsuboyama_2023_2L2D,0.61,0.755,0.632,0.652,0.663,0.665,0.583,0.707,0.595,0.599,0.838,0.796,0.797,0.581,0.806,0.806,0.727,0.753,0.804,0.64,0.577,0.715,0.746,0.756,0.729,0.734,0.766,0.742,0.692,0.748,0.764,0.721,0.734,0.607,0.674,0.75,0.628,0.638,0.75,0.666,0.655,0.707,0.651,0.603,0.78,0.827,0.821,0.792,0.84,0.807,0.759,0.769,0.771,0.766,0.772,0.759,0.767,0.73,0.738,0.762,0.869,0.844,635,Stability,OTU7A_HUMAN,High,Human
+OXDA_RHOTO_Vanella_2023_activity,0.453,0.585,0.59,0.595,0.597,0.606,0.445,0.451,0.481,0.494,0.547,0.518,0.528,0.435,0.45,0.52,0.542,0.585,0.59,0.494,0.438,0.475,0.504,0.488,0.458,0.502,0.534,0.508,0.577,0.607,0.594,0.572,0.391,0.46,0.467,0.477,0.492,0.49,0.489,0.597,0.6,0.595,0.447,0.421,0.536,0.478,0.541,0.585,0.585,0.499,0.549,0.569,0.562,0.563,0.554,0.577,0.563,0.569,0.546,0.574,0.575,0.49,6396,Activity,OXDA_RHOTO,High,Eukaryote
+OXDA_RHOTO_Vanella_2023_expression,0.664,0.737,0.73,0.732,0.738,0.746,0.68,0.673,0.694,0.693,0.727,0.708,0.718,0.666,0.686,0.716,0.728,0.744,0.75,0.65,0.642,0.696,0.711,0.693,0.676,0.716,0.715,0.7,0.73,0.746,0.727,0.718,0.636,0.675,0.674,0.687,0.691,0.683,0.688,0.739,0.741,0.743,0.667,0.657,0.722,0.688,0.748,0.755,0.726,0.63,0.739,0.73,0.729,0.726,0.728,0.736,0.728,0.731,0.727,0.735,0.746,0.71,6769,Expression,OXDA_RHOTO,High,Eukaryote
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,0.828,0.833,0.83,0.829,0.828,0.827,0.746,0.829,0.829,0.826,0.841,0.833,0.847,0.741,0.738,0.796,0.825,0.83,0.841,0.822,0.797,0.813,0.836,0.814,0.812,0.846,0.861,0.84,0.838,0.832,0.849,0.837,0.73,0.77,0.815,0.821,0.827,0.836,0.826,0.827,0.832,0.83,0.746,0.727,0.83,0.752,0.815,0.839,0.812,0.811,0.836,0.835,0.83,0.827,0.833,0.838,0.827,0.833,0.829,0.832,0.831,0.774,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,0.592,0.597,0.589,0.59,0.589,0.59,0.539,0.602,0.596,0.597,0.647,0.617,0.646,0.529,0.519,0.573,0.598,0.602,0.619,0.609,0.575,0.612,0.657,0.614,0.594,0.654,0.701,0.683,0.621,0.615,0.654,0.623,0.548,0.552,0.616,0.597,0.598,0.614,0.592,0.59,0.597,0.592,0.54,0.522,0.622,0.532,0.62,0.639,0.624,0.6,0.628,0.619,0.615,0.604,0.611,0.612,0.606,0.615,0.613,0.611,0.618,0.565,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,0.688,0.697,0.698,0.699,0.691,0.689,0.607,0.691,0.691,0.691,0.728,0.698,0.723,0.626,0.634,0.675,0.677,0.692,0.695,0.651,0.663,0.708,0.729,0.711,0.69,0.726,0.744,0.745,0.719,0.699,0.715,0.689,0.572,0.623,0.719,0.721,0.684,0.709,0.708,0.69,0.7,0.703,0.621,0.601,0.705,0.623,0.692,0.727,0.694,0.687,0.698,0.706,0.703,0.704,0.698,0.703,0.689,0.688,0.681,0.698,0.7,0.656,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Kotler_2018,0.859,0.841,0.843,0.839,0.817,0.821,0.587,0.829,0.835,0.839,0.838,0.825,0.835,0.623,0.61,0.773,0.817,0.829,0.831,0.804,0.776,0.798,0.803,0.81,0.793,0.811,0.804,0.804,0.787,0.851,0.821,0.813,0.566,0.727,0.802,0.788,0.863,0.855,0.855,0.833,0.844,0.84,0.622,0.588,0.837,0.637,0.713,0.832,0.764,0.744,0.795,0.827,0.812,0.811,0.808,0.814,0.801,0.808,0.806,0.807,0.851,0.728,1048,OrganismalFitness,P53_HUMAN,Low,Human
+P84126_THETH_Chan_2017,0.749,0.861,0.905,0.905,0.874,0.876,0.642,0.855,0.906,0.879,0.891,0.856,0.857,0.721,0.86,0.864,0.901,0.889,0.905,0.897,0.773,0.823,0.827,0.86,0.796,0.858,0.908,0.855,0.945,0.803,0.905,0.877,0.782,0.801,0.755,0.837,0.775,0.744,0.779,0.847,0.844,0.86,0.757,0.582,0.824,0.828,0.748,0.902,0.893,0.698,0.861,0.878,0.856,0.863,0.888,0.892,0.886,0.886,0.899,0.891,0.896,0.895,1519,OrganismalFitness,P84126_THETH,Medium,Prokaryote
+PA_I34A1_Wu_2015,0.463,0.46,0.464,0.453,0.457,0.453,0.228,0.429,0.245,0.258,0.231,0.236,0.245,0.22,0.218,0.22,0.215,0.215,0.401,0.391,0.417,0.422,0.41,0.455,0.241,0.387,0.407,0.381,0.409,0.431,0.396,0.417,0.266,0.413,0.435,0.439,0.46,0.454,0.449,0.455,0.459,0.451,0.226,0.223,0.23,0.232,0.265,0.257,0.205,0.273,0.209,0.209,0.222,0.212,0.215,0.225,0.211,0.216,0.215,0.229,0.254,0.213,1820,OrganismalFitness,PA_I34A1,Medium,Virus
+PABP_YEAST_Melamed_2013,0.755,0.745,0.735,0.738,0.764,0.76,0.603,0.734,0.753,0.764,0.759,0.765,0.764,0.609,0.634,0.671,0.722,0.717,0.739,0.727,0.731,0.753,0.754,0.761,0.741,0.756,0.764,0.759,0.745,0.776,0.767,0.744,0.524,0.72,0.74,0.741,0.745,0.757,0.751,0.76,0.766,0.763,0.7,0.447,0.759,0.744,0.474,0.75,0.681,0.532,0.709,0.701,0.711,0.709,0.709,0.709,0.712,0.716,0.718,0.713,0.75,0.684,37708,OrganismalFitness,PABP_YEAST,Medium,Eukaryote
+PAI1_HUMAN_Huttinger_2021,0.797,0.83,0.823,0.826,0.823,0.823,0.694,0.775,0.807,0.823,0.81,0.797,0.809,0.696,0.8,0.806,0.812,0.816,0.803,0.811,0.701,0.796,0.798,0.806,0.793,0.813,0.813,0.814,0.812,0.816,0.805,0.792,0.721,0.735,0.804,0.81,0.795,0.808,0.811,0.828,0.827,0.829,0.784,0.688,0.805,0.802,0.781,0.831,0.814,0.751,0.815,0.808,0.813,0.812,0.813,0.816,0.812,0.812,0.814,0.817,0.825,0.798,5345,Activity,PAI1_HUMAN,,Human
+PHOT_CHLRE_Chen_2023,0.707,0.809,0.89,0.882,0.781,0.78,0.843,0.835,0.886,0.883,0.851,0.877,0.889,0.89,0.903,0.885,0.897,0.882,0.889,0.827,0.845,0.875,0.848,0.872,0.836,0.859,0.871,0.868,0.833,0.827,0.841,0.81,0.756,0.798,0.782,0.867,0.839,0.824,0.866,0.788,0.798,0.791,0.875,0.783,0.872,0.884,0.703,0.848,0.835,0.735,0.838,0.815,0.816,0.819,0.819,0.827,0.826,0.828,0.83,0.826,0.882,0.883,167529,Activity,PHOT_CHLRE,High,Eukaryote
+PIN1_HUMAN_Tsuboyama_2023_1I6C,0.873,0.868,0.9,0.898,0.889,0.896,0.847,0.821,0.875,0.896,0.888,0.848,0.89,0.862,0.878,0.892,0.895,0.819,0.806,0.895,0.752,0.885,0.89,0.871,0.826,0.859,0.883,0.878,0.864,0.883,0.889,0.863,0.869,0.812,0.87,0.891,0.869,0.894,0.899,0.89,0.9,0.905,0.851,0.644,0.907,0.839,0.889,0.897,0.889,0.868,0.884,0.895,0.88,0.869,0.865,0.89,0.879,0.888,0.887,0.888,0.903,0.901,802,Stability,PIN1_HUMAN,High,Human
+PITX2_HUMAN_Tsuboyama_2023_2L7M,0.888,0.864,0.868,0.867,0.874,0.87,0.863,0.804,0.873,0.899,0.812,0.85,0.85,0.86,0.887,0.897,0.9,0.885,0.863,0.867,0.826,0.812,0.826,0.822,0.851,0.838,0.841,0.82,0.82,0.858,0.828,0.822,0.806,0.841,0.859,0.797,0.86,0.874,0.836,0.866,0.877,0.861,0.858,0.838,0.791,0.852,0.832,0.773,0.898,0.857,0.892,0.894,0.904,0.9,0.906,0.891,0.898,0.879,0.893,0.897,0.807,0.888,1824,Stability,PITX2_HUMAN,High,Human
+PKN1_HUMAN_Tsuboyama_2023_1URF,0.866,0.888,0.844,0.835,0.846,0.866,0.864,0.821,0.803,0.807,0.867,0.86,0.871,0.857,0.859,0.894,0.863,0.866,0.865,0.862,0.881,0.89,0.875,0.875,0.887,0.872,0.881,0.902,0.864,0.829,0.87,0.865,0.783,0.874,0.898,0.904,0.872,0.882,0.873,0.877,0.87,0.859,0.848,0.853,0.871,0.843,0.887,0.872,0.876,0.894,0.859,0.861,0.872,0.868,0.863,0.861,0.862,0.868,0.865,0.866,0.89,0.877,1301,Stability,PKN1_HUMAN,High,Human
+POLG_CXB3N_Mattenberger_2021,0.768,0.817,0.798,0.816,0.821,0.824,0.574,0.733,0.815,0.821,0.658,0.581,0.596,0.583,0.577,0.624,0.705,0.727,0.754,0.757,0.738,0.788,0.795,0.79,0.632,0.764,0.772,0.765,0.801,0.82,0.759,0.75,0.598,0.599,0.718,0.773,0.756,0.775,0.792,0.793,0.809,0.816,0.58,0.576,0.682,0.585,0.627,0.692,0.596,0.618,0.683,0.686,0.688,0.687,0.684,0.678,0.691,0.69,0.69,0.69,0.597,0.588,15711,OrganismalFitness,POLG_CXB3N,Medium,Virus
+POLG_DEN26_Suphatrakul_2023,0.773,0.829,0.767,0.781,0.813,0.816,0.472,0.761,0.896,0.898,0.568,0.46,0.474,0.45,0.459,0.487,0.504,0.547,0.595,0.784,0.761,0.761,0.763,0.715,0.77,0.783,0.8,0.798,0.81,0.87,0.854,0.794,0.512,0.455,0.571,0.795,0.75,0.776,0.834,0.826,0.83,0.856,0.467,0.448,0.629,0.464,0.603,0.727,0.453,0.555,0.523,0.526,0.56,0.534,0.534,0.539,0.529,0.536,0.528,0.548,0.498,0.495,16897,OrganismalFitness,POLG_DEN26,Low,Virus
+POLG_HCVJF_Qi_2014,0.679,0.708,0.686,0.692,0.705,0.712,0.186,0.284,0.703,0.7,0.283,0.645,0.643,0.186,0.202,0.196,0.229,0.212,0.237,0.484,0.4,0.556,0.631,0.596,0.542,0.588,0.467,0.5,0.615,0.732,0.601,0.498,0.286,0.646,0.654,0.619,0.667,0.667,0.669,0.682,0.668,0.672,0.197,0.177,0.596,0.228,0.195,0.435,0.517,0.406,0.335,0.4,0.284,0.408,0.33,0.299,0.413,0.298,0.299,0.362,0.276,0.248,1630,OrganismalFitness,POLG_HCVJF,Medium,Virus
+POLG_PESV_Tsuboyama_2023_2MXD,0.582,0.867,0.806,0.816,0.825,0.828,0.544,0.815,0.703,0.845,0.818,0.581,0.589,0.584,0.59,0.567,0.613,0.606,0.589,0.891,0.518,0.5,0.496,0.547,0.542,0.512,0.526,0.462,0.577,0.876,0.891,0.895,0.517,0.529,0.506,0.553,0.704,0.706,0.698,0.81,0.813,0.805,0.648,0.595,0.643,0.623,0.845,0.827,0.902,0.884,0.896,0.898,0.887,0.905,0.904,0.917,0.903,0.922,0.904,0.913,0.883,0.784,5130,Stability,POLG_PESV,Medium,Virus
+PPARG_HUMAN_Majithia_2016,0.785,0.863,0.872,0.866,0.872,0.867,0.773,0.875,0.865,0.877,0.861,0.879,0.888,0.741,0.766,0.78,0.807,0.855,0.889,0.882,0.868,0.902,0.863,0.866,0.842,0.916,0.917,0.917,0.861,0.897,0.849,0.8,0.783,0.884,0.904,0.905,0.871,0.905,0.9,0.874,0.889,0.885,0.757,0.709,0.873,0.778,0.83,0.888,0.848,0.793,0.825,0.832,0.822,0.816,0.831,0.821,0.825,0.819,0.81,0.821,0.841,0.789,9576,Activity,PPARG_HUMAN,Medium,Human
+PPM1D_HUMAN_Miller_2022,0.805,0.807,0.807,0.809,0.807,0.807,0.728,0.774,0.81,0.807,0.808,0.817,0.815,0.744,0.754,0.784,0.804,0.814,0.813,0.801,0.776,0.783,0.818,0.812,0.801,0.818,0.815,0.818,0.813,0.81,0.807,0.787,0.735,0.778,0.803,0.82,0.807,0.812,0.815,0.808,0.809,0.809,0.766,0.717,0.807,0.79,0.792,0.817,0.804,0.772,0.801,0.8,0.798,0.799,0.798,0.798,0.796,0.794,0.8,0.8,0.819,0.783,7889,OrganismalFitness,PPM1D_HUMAN,Low,Human
+PR40A_HUMAN_Tsuboyama_2023_1UZC,0.857,0.872,0.916,0.913,0.907,0.91,0.76,0.854,0.917,0.922,0.924,0.805,0.847,0.765,0.771,0.894,0.917,0.915,0.898,0.901,0.864,0.911,0.918,0.902,0.895,0.911,0.917,0.917,0.905,0.901,0.903,0.878,0.811,0.814,0.818,0.858,0.884,0.888,0.895,0.908,0.914,0.909,0.784,0.766,0.884,0.78,0.899,0.862,0.917,0.901,0.919,0.924,0.924,0.922,0.92,0.923,0.923,0.922,0.919,0.926,0.92,0.906,2033,Stability,PR40A_HUMAN,Medium,Human
+PRKN_HUMAN_Clausen_2023,0.863,0.87,0.876,0.878,0.874,0.874,0.652,0.823,0.867,0.87,0.874,0.861,0.878,0.683,0.697,0.72,0.764,0.86,0.877,0.852,0.708,0.855,0.877,0.871,0.767,0.876,0.871,0.881,0.862,0.874,0.858,0.828,0.627,0.696,0.85,0.873,0.857,0.871,0.873,0.873,0.875,0.874,0.711,0.584,0.881,0.717,0.855,0.873,0.868,0.773,0.774,0.803,0.815,0.821,0.796,0.791,0.8,0.799,0.799,0.804,0.868,0.787,8756,Expression,PRKN_HUMAN,Low,Human
+PSAE_SYNP2_Tsuboyama_2023_1PSE,0.767,0.759,0.726,0.731,0.753,0.754,0.666,0.709,0.722,0.72,0.863,0.796,0.801,0.681,0.71,0.823,0.839,0.806,0.803,0.774,0.661,0.627,0.722,0.735,0.734,0.754,0.511,0.741,0.762,0.737,0.776,0.727,0.658,0.657,0.715,0.742,0.773,0.764,0.759,0.755,0.752,0.765,0.691,0.659,0.813,0.673,0.838,0.811,0.844,0.821,0.841,0.825,0.822,0.823,0.829,0.821,0.83,0.835,0.825,0.835,0.806,0.815,1579,Stability,PSAE_PICP2,Medium,Prokaryote
+PTEN_HUMAN_Matreyek_2021,0.766,0.79,0.786,0.78,0.779,0.783,0.686,0.772,0.791,0.799,0.789,0.791,0.805,0.671,0.732,0.775,0.796,0.764,0.767,0.797,0.692,0.807,0.799,0.801,0.727,0.783,0.785,0.799,0.779,0.812,0.783,0.779,0.701,0.73,0.806,0.791,0.765,0.806,0.801,0.788,0.798,0.797,0.701,0.672,0.799,0.751,0.769,0.797,0.78,0.754,0.795,0.79,0.795,0.798,0.791,0.79,0.794,0.801,0.799,0.798,0.785,0.769,5083,Expression,PTEN_HUMAN,Medium,Human
+PTEN_HUMAN_Mighell_2018,0.858,0.856,0.858,0.864,0.865,0.863,0.755,0.842,0.855,0.854,0.856,0.857,0.864,0.781,0.831,0.855,0.851,0.847,0.85,0.859,0.806,0.841,0.855,0.841,0.83,0.845,0.845,0.836,0.848,0.853,0.85,0.843,0.718,0.826,0.853,0.843,0.854,0.855,0.856,0.861,0.861,0.86,0.822,0.749,0.85,0.852,0.833,0.851,0.851,0.824,0.855,0.853,0.852,0.853,0.855,0.855,0.852,0.853,0.848,0.851,0.853,0.842,7260,Activity,PTEN_HUMAN,Medium,Human
+Q2N0S5_9HIV1_Haddox_2018,0.869,0.849,0.83,0.84,0.856,0.858,0.597,0.822,0.87,0.87,0.847,0.867,0.872,0.623,0.618,0.608,0.635,0.652,0.705,0.843,0.869,0.829,0.818,0.803,0.87,0.828,0.822,0.831,0.824,0.872,0.833,0.83,0.737,0.865,0.837,0.84,0.869,0.858,0.863,0.865,0.857,0.858,0.783,0.58,0.868,0.858,0.723,0.866,0.641,0.712,0.676,0.69,0.716,0.697,0.693,0.698,0.695,0.692,0.68,0.698,0.682,0.649,12729,OrganismalFitness,Q2N0S5_9HIV1,Medium,Virus
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,0.791,0.806,0.807,0.807,0.807,0.806,0.707,0.764,0.808,0.809,0.803,0.806,0.81,0.705,0.717,0.784,0.792,0.808,0.803,0.804,0.784,0.768,0.77,0.768,0.811,0.798,0.796,0.798,0.779,0.775,0.783,0.768,0.737,0.773,0.766,0.763,0.789,0.781,0.775,0.804,0.802,0.796,0.787,0.693,0.806,0.794,0.786,0.806,0.77,0.777,0.784,0.791,0.807,0.793,0.804,0.8,0.804,0.805,0.789,0.799,0.818,0.814,3344,Binding,Q53Z42_HUMAN,Medium,Human
+Q53Z42_HUMAN_McShan_2019_expression,0.774,0.778,0.785,0.784,0.789,0.786,0.483,0.735,0.783,0.792,0.775,0.759,0.776,0.532,0.559,0.724,0.762,0.803,0.802,0.798,0.736,0.749,0.745,0.741,0.783,0.789,0.784,0.786,0.791,0.76,0.769,0.754,0.619,0.763,0.742,0.74,0.796,0.783,0.777,0.791,0.793,0.791,0.673,0.517,0.783,0.745,0.721,0.786,0.719,0.644,0.749,0.767,0.79,0.777,0.785,0.794,0.795,0.796,0.775,0.782,0.781,0.827,3344,Expression,Q53Z42_HUMAN,Medium,Human
+Q59976_STRSQ_Romero_2015,0.845,0.877,0.873,0.872,0.88,0.875,0.757,0.813,0.88,0.88,0.857,0.814,0.829,0.728,0.798,0.826,0.842,0.85,0.854,0.879,0.862,0.883,0.886,0.887,0.868,0.888,0.892,0.885,0.883,0.879,0.868,0.851,0.768,0.86,0.885,0.881,0.861,0.877,0.879,0.876,0.877,0.879,0.813,0.714,0.861,0.833,0.829,0.862,0.86,0.772,0.85,0.844,0.857,0.848,0.852,0.841,0.854,0.847,0.845,0.855,0.861,0.834,2999,Activity,Q59976_STRSQ,Medium,Prokaryote
+Q6WV13_9MAXI_Somermeyer_2022,0.658,0.724,0.619,0.623,0.66,0.658,0.429,0.526,0.731,0.728,0.622,0.487,0.473,0.465,0.484,0.465,0.458,0.471,0.441,0.67,0.443,0.496,0.464,0.506,0.445,0.441,0.464,0.448,0.448,0.728,0.698,0.701,0.482,0.484,0.462,0.499,0.63,0.624,0.631,0.658,0.657,0.659,0.545,0.515,0.534,0.533,0.66,0.605,0.643,0.595,0.585,0.594,0.604,0.659,0.645,0.648,0.641,0.638,0.638,0.635,0.524,0.513,31401,Activity,Q6WV12_9MAXI,Low,Eukaryote
+Q837P4_ENTFA_Meier_2023,0.898,0.921,0.918,0.93,0.928,0.942,0.838,0.891,0.923,0.927,0.938,0.92,0.94,0.873,0.885,0.901,0.926,0.92,0.926,0.807,0.93,0.934,0.931,0.936,0.922,0.932,0.933,0.939,0.937,0.927,0.923,0.906,0.859,0.928,0.937,0.937,0.926,0.938,0.931,0.94,0.94,0.939,0.896,0.882,0.914,0.897,0.862,0.925,0.886,0.847,0.934,0.931,0.928,0.936,0.933,0.93,0.935,0.927,0.925,0.94,0.948,0.888,697,Activity,Q837P4_ENTFA,Medium,Prokaryote
+Q837P5_ENTFA_Meier_2023,0.813,0.919,0.915,0.912,0.918,0.927,0.867,0.883,0.792,0.813,0.861,0.867,0.862,0.767,0.842,0.839,0.844,0.882,0.873,0.85,0.858,0.892,0.924,0.918,0.836,0.897,0.911,0.917,0.935,0.897,0.888,0.908,0.803,0.856,0.895,0.916,0.861,0.886,0.911,0.917,0.919,0.918,0.852,0.777,0.877,0.848,0.84,0.894,0.903,0.796,0.864,0.859,0.85,0.842,0.854,0.831,0.86,0.862,0.842,0.857,0.836,0.846,747,Activity,Q837P5_ENTFA,Medium,Prokaryote
+Q8WTC7_9CNID_Somermeyer_2022,0.592,0.684,0.598,0.598,0.646,0.645,0.481,0.666,0.665,0.67,0.594,0.487,0.484,0.478,0.47,0.479,0.484,0.492,0.508,0.612,0.5,0.499,0.511,0.499,0.479,0.478,0.495,0.633,0.64,0.685,0.656,0.661,0.477,0.464,0.478,0.675,0.584,0.592,0.673,0.641,0.649,0.685,0.535,0.538,0.537,0.535,0.604,0.606,0.679,0.605,0.594,0.579,0.598,0.599,0.591,0.609,0.598,0.598,0.595,0.597,0.539,0.45,33510,Activity,Q8WTC7_9CNID,Low,Eukaryote
+R1AB_SARS2_Flynn_2022,0.928,0.931,0.916,0.914,0.943,0.946,0.596,0.889,0.621,0.621,0.669,0.626,0.609,0.625,0.647,0.648,0.637,0.889,0.937,0.89,0.838,0.839,0.859,0.854,0.845,0.808,0.828,0.828,0.818,0.935,0.897,0.876,0.602,0.769,0.828,0.792,0.912,0.922,0.913,0.936,0.945,0.939,0.623,0.583,0.68,0.623,0.824,0.843,0.871,0.792,0.736,0.75,0.771,0.778,0.787,0.758,0.763,0.749,0.75,0.778,0.721,0.718,5725,OrganismalFitness,R1AB_SARS2,Medium,Virus
+RAD_ANTMA_Tsuboyama_2023_2CJJ,0.797,0.754,0.752,0.754,0.76,0.769,0.739,0.762,0.795,0.807,0.797,0.716,0.761,0.777,0.825,0.835,0.806,0.822,0.75,0.806,0.856,0.8,0.771,0.786,0.808,0.779,0.758,0.773,0.756,0.809,0.748,0.711,0.649,0.772,0.8,0.772,0.819,0.813,0.794,0.784,0.788,0.795,0.802,0.717,0.731,0.786,0.805,0.697,0.831,0.821,0.789,0.82,0.804,0.808,0.821,0.802,0.8,0.807,0.812,0.81,0.77,0.833,912,Stability,RAD_ANTMA,High,Eukaryote
+RAF1_HUMAN_Zinkus-Boltz_2019,0.749,0.785,0.775,0.783,0.788,0.789,0.704,0.82,0.782,0.789,0.769,0.805,0.806,0.688,0.713,0.795,0.763,0.786,0.801,0.788,0.806,0.792,0.793,0.796,0.794,0.815,0.814,0.801,0.799,0.802,0.773,0.792,0.738,0.805,0.804,0.791,0.791,0.788,0.787,0.797,0.784,0.788,0.726,0.712,0.747,0.804,0.715,0.749,0.704,0.735,0.764,0.763,0.773,0.751,0.745,0.759,0.763,0.742,0.73,0.753,0.76,0.658,297,OrganismalFitness,RAF1_HUMAN,Low,Human
+RASH_HUMAN_Bandaru_2017,0.73,0.717,0.729,0.723,0.718,0.72,0.534,0.654,0.709,0.716,0.714,0.707,0.704,0.723,0.746,0.741,0.726,0.723,0.71,0.74,0.731,0.72,0.71,0.707,0.723,0.704,0.701,0.701,0.686,0.711,0.707,0.705,0.625,0.73,0.722,0.705,0.743,0.733,0.726,0.722,0.721,0.722,0.733,0.685,0.712,0.731,0.674,0.652,0.723,0.658,0.713,0.721,0.712,0.717,0.712,0.718,0.716,0.714,0.708,0.716,0.715,0.729,3134,Activity,RASH_HUMAN,High,Human
+RASK_HUMAN_Weng_2022_abundance,0.743,0.724,0.738,0.736,0.735,0.74,0.696,0.696,0.731,0.745,0.724,0.718,0.722,0.763,0.752,0.756,0.736,0.731,0.726,0.733,0.691,0.737,0.736,0.745,0.721,0.748,0.751,0.741,0.75,0.712,0.725,0.711,0.666,0.684,0.732,0.744,0.696,0.737,0.757,0.735,0.743,0.746,0.731,0.741,0.687,0.718,0.727,0.72,0.763,0.726,0.738,0.733,0.735,0.736,0.74,0.733,0.741,0.735,0.736,0.739,0.737,0.746,26012,Expression,RASK_HUMAN,High,Human
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,0.802,0.909,0.917,0.918,0.916,0.915,0.619,0.809,0.923,0.931,0.897,0.903,0.908,0.893,0.918,0.919,0.925,0.92,0.883,0.927,0.918,0.911,0.907,0.893,0.926,0.915,0.898,0.898,0.873,0.885,0.903,0.898,0.805,0.917,0.915,0.89,0.914,0.909,0.888,0.919,0.918,0.916,0.905,0.772,0.891,0.908,0.782,0.808,0.916,0.782,0.9,0.888,0.888,0.893,0.896,0.893,0.894,0.897,0.896,0.897,0.895,0.901,24873,Binding,RASK_HUMAN,High,Human
+RBP1_HUMAN_Tsuboyama_2023_2KWH,0.824,0.791,0.819,0.828,0.822,0.828,0.829,0.752,0.787,0.813,0.86,0.838,0.848,0.841,0.843,0.858,0.87,0.809,0.819,0.826,0.838,0.779,0.786,0.783,0.839,0.735,0.753,0.789,0.798,0.807,0.819,0.795,0.716,0.825,0.829,0.833,0.83,0.826,0.828,0.826,0.827,0.824,0.854,0.845,0.853,0.848,0.852,0.862,0.863,0.835,0.869,0.858,0.859,0.857,0.866,0.864,0.86,0.856,0.861,0.864,0.849,0.842,1332,Stability,RBP1_HUMAN,High,Human
+RCD1_ARATH_Tsuboyama_2023_5OAO,0.787,0.765,0.784,0.784,0.782,0.782,0.704,0.714,0.759,0.767,0.815,0.775,0.793,0.76,0.799,0.809,0.822,0.782,0.815,0.779,0.728,0.617,0.727,0.759,0.715,0.776,0.771,0.746,0.794,0.757,0.765,0.725,0.482,0.745,0.75,0.769,0.799,0.805,0.794,0.797,0.791,0.784,0.775,0.779,0.805,0.777,0.814,0.836,0.821,0.814,0.814,0.812,0.808,0.806,0.806,0.807,0.813,0.805,0.803,0.813,0.806,0.842,1261,Stability,RCD1_ARATH,Medium,Eukaryote
+RCRO_LAMBD_Tsuboyama_2023_1ORC,0.777,0.868,0.899,0.903,0.9,0.909,0.667,0.676,0.9,0.91,0.874,0.814,0.878,0.75,0.812,0.846,0.874,0.857,0.866,0.866,0.709,0.752,0.75,0.735,0.614,0.781,0.591,0.718,0.877,0.871,0.902,0.889,0.635,0.665,0.611,0.84,0.789,0.784,0.871,0.905,0.901,0.901,0.786,0.783,0.856,0.776,0.954,0.933,0.941,0.929,0.913,0.913,0.925,0.93,0.924,0.92,0.92,0.921,0.922,0.924,0.928,0.724,2278,Stability,RCRO_LAMBD,High,Virus
+RD23A_HUMAN_Tsuboyama_2023_1IFY,0.897,0.871,0.897,0.894,0.898,0.895,0.783,0.855,0.839,0.888,0.861,0.855,0.872,0.792,0.895,0.857,0.862,0.872,0.855,0.883,0.845,0.879,0.859,0.874,0.883,0.885,0.881,0.886,0.873,0.873,0.849,0.828,0.865,0.773,0.857,0.851,0.884,0.896,0.886,0.904,0.905,0.897,0.834,0.709,0.878,0.854,0.886,0.9,0.908,0.911,0.874,0.876,0.879,0.879,0.885,0.88,0.879,0.881,0.873,0.881,0.892,0.888,1019,Stability,RD23A_HUMAN,High,Human
+RDRP_I33A0_Li_2023,0.616,0.687,0.779,0.782,0.798,0.797,0.494,0.759,0.754,0.755,0.535,0.516,0.521,0.513,0.513,0.534,0.63,0.726,0.763,0.81,0.744,0.763,0.78,0.787,0.561,0.757,0.746,0.74,0.781,0.82,0.764,0.747,0.539,0.76,0.775,0.793,0.768,0.781,0.787,0.808,0.811,0.811,0.518,0.518,0.562,0.526,0.564,0.596,0.518,0.55,0.646,0.638,0.647,0.638,0.641,0.639,0.641,0.644,0.638,0.643,0.539,0.523,12003,OrganismalFitness,RDRP_I33A0,Low,Virus
+REV_HV1H2_Fernandes_2016,0.703,0.694,0.71,0.706,0.709,0.705,0.59,0.703,0.7,0.702,0.602,0.695,0.695,0.61,0.606,0.627,0.704,0.713,0.703,0.688,0.687,0.695,0.686,0.71,0.69,0.715,0.635,0.693,0.703,0.705,0.701,0.706,0.581,0.699,0.703,0.696,0.701,0.705,0.692,0.705,0.702,0.704,0.604,0.594,0.678,0.607,0.67,0.657,0.687,0.693,0.695,0.696,0.697,0.703,0.706,0.699,0.7,0.694,0.705,0.701,0.668,0.682,2147,OrganismalFitness,REV_HV1H2,Medium,Virus
+RFAH_ECOLI_Tsuboyama_2023_2LCL,0.631,0.641,0.633,0.625,0.62,0.626,0.588,0.518,0.583,0.66,0.719,0.651,0.687,0.588,0.62,0.636,0.711,0.621,0.576,0.575,0.549,0.622,0.58,0.578,0.658,0.629,0.639,0.598,0.554,0.565,0.598,0.537,0.421,0.595,0.58,0.57,0.618,0.555,0.618,0.641,0.602,0.629,0.633,0.588,0.693,0.636,0.681,0.693,0.765,0.697,0.693,0.698,0.707,0.699,0.682,0.689,0.722,0.698,0.721,0.712,0.729,0.635,1326,Stability,RFAH_ECOLI,High,Prokaryote
+RL20_AQUAE_Tsuboyama_2023_1GYZ,0.892,0.944,0.939,0.934,0.935,0.938,0.801,0.928,0.892,0.879,0.941,0.948,0.947,0.818,0.865,0.922,0.939,0.935,0.931,0.935,0.891,0.924,0.913,0.901,0.928,0.902,0.917,0.935,0.916,0.903,0.936,0.935,0.756,0.913,0.936,0.898,0.92,0.935,0.913,0.937,0.941,0.932,0.896,0.823,0.936,0.928,0.956,0.941,0.953,0.948,0.944,0.946,0.947,0.947,0.942,0.944,0.94,0.944,0.943,0.944,0.945,0.945,1461,Stability,RL20_AQUAE,High,Prokaryote
+RL40A_YEAST_Mavor_2016,0.808,0.845,0.848,0.863,0.855,0.855,0.526,0.829,0.875,0.876,0.788,0.83,0.859,0.674,0.775,0.809,0.853,0.863,0.861,0.855,0.85,0.852,0.849,0.858,0.869,0.855,0.858,0.856,0.864,0.854,0.855,0.861,0.64,0.844,0.834,0.84,0.85,0.857,0.854,0.874,0.877,0.868,0.832,0.627,0.834,0.826,0.527,0.703,0.67,0.542,0.856,0.87,0.858,0.845,0.867,0.854,0.868,0.868,0.865,0.864,0.868,0.773,1253,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+RL40A_YEAST_Roscoe_2013,0.926,0.939,0.94,0.951,0.941,0.944,0.718,0.927,0.955,0.955,0.912,0.929,0.953,0.821,0.905,0.891,0.952,0.944,0.952,0.952,0.94,0.946,0.95,0.949,0.959,0.947,0.952,0.948,0.961,0.954,0.959,0.96,0.832,0.933,0.947,0.937,0.942,0.955,0.947,0.958,0.961,0.954,0.944,0.792,0.94,0.934,0.715,0.857,0.843,0.728,0.945,0.96,0.951,0.947,0.955,0.947,0.953,0.953,0.951,0.949,0.956,0.912,1195,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+RL40A_YEAST_Roscoe_2014,0.952,0.942,0.95,0.954,0.95,0.95,0.87,0.947,0.941,0.955,0.927,0.941,0.948,0.902,0.953,0.894,0.961,0.948,0.957,0.953,0.956,0.958,0.957,0.956,0.96,0.951,0.951,0.955,0.959,0.959,0.952,0.955,0.937,0.956,0.954,0.941,0.952,0.954,0.949,0.952,0.957,0.954,0.946,0.899,0.949,0.94,0.858,0.932,0.893,0.891,0.964,0.964,0.965,0.967,0.966,0.966,0.965,0.965,0.965,0.966,0.954,0.919,1380,Activity,RL40A_YEAST,Medium,Eukaryote
+RNC_ECOLI_Weeks_2023,0.968,0.969,0.938,0.943,0.938,0.956,0.865,0.925,0.967,0.962,0.969,0.967,0.967,0.89,0.964,0.967,0.967,0.969,0.972,0.939,0.97,0.969,0.966,0.969,0.97,0.966,0.97,0.966,0.969,0.938,0.969,0.969,0.853,0.965,0.966,0.965,0.965,0.966,0.967,0.961,0.962,0.962,0.966,0.882,0.967,0.965,0.913,0.969,0.882,0.893,0.968,0.966,0.965,0.968,0.966,0.969,0.966,0.965,0.966,0.967,0.969,0.962,4277,Activity,RNC_ECOLI,Medium,Prokaryote
+RPC1_BP434_Tsuboyama_2023_1R69,0.848,0.884,0.874,0.879,0.873,0.884,0.834,0.835,0.823,0.817,0.885,0.876,0.879,0.853,0.86,0.868,0.879,0.894,0.888,0.891,0.823,0.888,0.882,0.886,0.849,0.891,0.865,0.887,0.893,0.872,0.883,0.839,0.856,0.777,0.865,0.863,0.822,0.866,0.853,0.886,0.888,0.882,0.848,0.796,0.827,0.825,0.901,0.801,0.903,0.887,0.857,0.863,0.869,0.877,0.874,0.867,0.877,0.866,0.866,0.872,0.885,0.85,1459,Stability,RPC1_BP434,High,Virus
+RPC1_LAMBD_Li_2019_high-expression,0.85,0.911,0.91,0.913,0.915,0.913,0.851,0.897,0.904,0.921,0.91,0.922,0.924,0.855,0.879,0.91,0.926,0.912,0.911,0.928,0.794,0.872,0.9,0.898,0.814,0.892,0.889,0.82,0.914,0.889,0.889,0.885,0.746,0.807,0.908,0.905,0.866,0.923,0.925,0.915,0.914,0.917,0.847,0.901,0.886,0.885,0.665,0.905,0.864,0.806,0.927,0.927,0.931,0.926,0.926,0.928,0.929,0.927,0.929,0.933,0.926,0.872,351,Activity,RPC1_LAMBD,High,Virus
+RPC1_LAMBD_Li_2019_low-expression,0.69,0.873,0.848,0.855,0.843,0.856,0.739,0.874,0.808,0.827,0.839,0.758,0.829,0.704,0.756,0.754,0.794,0.844,0.858,0.817,0.598,0.733,0.714,0.796,0.637,0.74,0.736,0.68,0.902,0.854,0.889,0.883,0.641,0.668,0.741,0.86,0.712,0.751,0.827,0.84,0.843,0.864,0.68,0.726,0.753,0.742,0.628,0.719,0.79,0.623,0.791,0.797,0.779,0.795,0.805,0.81,0.809,0.77,0.801,0.791,0.806,0.699,351,Activity,RPC1_LAMBD,High,Virus
+RS15_GEOSE_Tsuboyama_2023_1A32,0.879,0.865,0.867,0.867,0.867,0.869,0.799,0.837,0.867,0.866,0.86,0.859,0.857,0.813,0.831,0.884,0.864,0.868,0.861,0.856,0.86,0.867,0.849,0.849,0.86,0.852,0.884,0.862,0.86,0.86,0.858,0.849,0.823,0.875,0.862,0.852,0.881,0.87,0.865,0.876,0.868,0.864,0.831,0.82,0.878,0.858,0.9,0.887,0.888,0.893,0.866,0.863,0.866,0.87,0.866,0.856,0.865,0.865,0.872,0.868,0.866,0.915,1195,Stability,RS15_GEOSE,Medium,Prokaryote
+S22A1_HUMAN_Yee_2023_abundance,0.862,0.9,0.905,0.91,0.906,0.906,0.823,0.858,0.901,0.904,0.896,0.909,0.911,0.846,0.851,0.88,0.904,0.907,0.902,0.768,0.88,0.907,0.901,0.896,0.891,0.908,0.907,0.905,0.896,0.901,0.909,0.878,0.807,0.88,0.905,0.903,0.896,0.91,0.909,0.91,0.908,0.909,0.853,0.789,0.896,0.863,0.875,0.911,0.878,0.823,0.903,0.9,0.907,0.91,0.907,0.908,0.906,0.908,0.907,0.91,0.901,0.876,9803,Expression,S22A1_HUMAN,Medium,Human
+S22A1_HUMAN_Yee_2023_activity,0.739,0.803,0.811,0.814,0.812,0.815,0.682,0.755,0.807,0.807,0.8,0.818,0.816,0.718,0.71,0.747,0.803,0.811,0.804,0.635,0.781,0.81,0.802,0.798,0.796,0.81,0.811,0.81,0.796,0.807,0.815,0.79,0.701,0.777,0.809,0.808,0.786,0.814,0.813,0.818,0.817,0.818,0.711,0.636,0.801,0.726,0.773,0.824,0.768,0.687,0.808,0.801,0.809,0.812,0.806,0.811,0.807,0.812,0.812,0.812,0.789,0.751,10094,Activity,S22A1_HUMAN,Medium,Human
+SAV1_MOUSE_Tsuboyama_2023_2YSB,0.874,0.833,0.842,0.863,0.854,0.869,0.766,0.808,0.844,0.857,0.816,0.829,0.826,0.659,0.847,0.866,0.805,0.842,0.77,0.765,0.784,0.819,0.817,0.822,0.781,0.829,0.816,0.818,0.816,0.804,0.813,0.756,0.76,0.749,0.798,0.801,0.836,0.842,0.849,0.853,0.873,0.863,0.824,0.671,0.854,0.829,0.827,0.869,0.842,0.869,0.834,0.772,0.83,0.807,0.837,0.831,0.809,0.824,0.824,0.827,0.796,0.893,965,Stability,SAV1_MOUSE,High,Eukaryote
+SBI_STAAM_Tsuboyama_2023_2JVG,0.761,0.773,0.795,0.794,0.788,0.779,0.648,0.686,0.8,0.823,0.722,0.676,0.679,0.624,0.655,0.711,0.826,0.842,0.79,0.754,0.647,0.745,0.77,0.727,0.671,0.645,0.688,0.656,0.754,0.777,0.831,0.784,0.622,0.596,0.618,0.62,0.728,0.744,0.747,0.751,0.777,0.778,0.659,0.657,0.644,0.702,0.837,0.803,0.843,0.846,0.685,0.705,0.709,0.72,0.726,0.691,0.746,0.77,0.813,0.743,0.851,0.801,1025,Stability,SBI_STAAM,Medium,Prokaryote
+SC6A4_HUMAN_Young_2021,0.868,0.888,0.882,0.883,0.887,0.886,0.829,0.901,0.885,0.889,0.89,0.872,0.881,0.783,0.817,0.856,0.868,0.871,0.87,0.891,0.888,0.905,0.904,0.9,0.887,0.904,0.903,0.902,0.902,0.89,0.881,0.85,0.854,0.892,0.902,0.901,0.884,0.892,0.893,0.888,0.89,0.888,0.864,0.781,0.891,0.871,0.868,0.893,0.868,0.823,0.867,0.875,0.875,0.876,0.873,0.875,0.871,0.873,0.867,0.874,0.889,0.87,11576,Activity,SC6A4_HUMAN,Medium,Human
+SCIN_STAAR_Tsuboyama_2023_2QFF,0.74,0.766,0.776,0.779,0.781,0.788,0.763,0.722,0.793,0.784,0.783,0.782,0.776,0.738,0.774,0.8,0.803,0.793,0.817,0.759,0.714,0.749,0.755,0.755,0.764,0.767,0.784,0.782,0.76,0.758,0.768,0.746,0.617,0.712,0.686,0.746,0.757,0.758,0.766,0.78,0.789,0.793,0.787,0.769,0.771,0.783,0.793,0.851,0.835,0.803,0.801,0.8,0.808,0.818,0.813,0.836,0.816,0.835,0.81,0.823,0.887,0.851,1212,Stability,SCIN_STAAR,High,Prokaryote
+SCN5A_HUMAN_Glazer_2019,0.753,0.728,0.754,0.754,0.754,0.754,0.794,0.603,0.791,0.798,0.753,0.731,0.765,0.77,0.75,0.763,0.778,0.77,0.748,0.76,0.743,0.724,0.739,0.723,0.732,0.748,0.743,0.714,0.795,0.739,0.741,0.738,0.639,0.675,0.697,0.758,0.756,0.784,0.816,0.77,0.787,0.772,0.752,0.714,0.8,0.693,0.637,0.657,0.71,0.606,0.703,0.734,0.727,0.741,0.728,0.736,0.72,0.705,0.72,0.726,0.713,0.737,224,OrganismalFitness,SCN5A_HUMAN,Medium,Human
+SDA_BACSU_Tsuboyama_2023_1PV0,0.931,0.927,0.931,0.931,0.936,0.936,0.599,0.877,0.925,0.934,0.922,0.899,0.906,0.688,0.743,0.914,0.922,0.935,0.933,0.917,0.591,0.637,0.618,0.76,0.67,0.886,0.568,0.806,0.92,0.94,0.92,0.9,0.522,0.593,0.743,0.886,0.932,0.93,0.924,0.935,0.935,0.933,0.703,0.654,0.913,0.728,0.901,0.898,0.94,0.935,0.932,0.932,0.934,0.939,0.935,0.937,0.938,0.936,0.94,0.94,0.944,0.945,2770,Stability,SDA_BACSU,Medium,Prokaryote
+SERC_HUMAN_Xie_2023,0.882,0.903,0.893,0.898,0.897,0.903,0.661,0.892,0.896,0.898,0.899,0.898,0.9,0.764,0.867,0.89,0.897,0.892,0.889,0.899,0.903,0.9,0.903,0.904,0.903,0.901,0.906,0.905,0.899,0.899,0.903,0.901,0.805,0.898,0.906,0.904,0.894,0.904,0.903,0.903,0.903,0.902,0.857,0.737,0.896,0.879,0.871,0.903,0.787,0.85,0.897,0.901,0.896,0.901,0.905,0.897,0.899,0.897,0.894,0.902,0.901,0.88,1914,OrganismalFitness,SERC_HUMAN,High,Human
+SHOC2_HUMAN_Kwon_2022,0.82,0.841,0.836,0.835,0.839,0.838,0.802,0.839,0.85,0.844,0.83,0.827,0.83,0.805,0.806,0.807,0.826,0.838,0.834,0.831,0.81,0.815,0.845,0.838,0.807,0.82,0.821,0.823,0.844,0.844,0.845,0.838,0.806,0.811,0.822,0.842,0.818,0.828,0.84,0.838,0.836,0.843,0.809,0.806,0.84,0.806,0.82,0.84,0.814,0.81,0.832,0.832,0.836,0.834,0.837,0.838,0.832,0.831,0.826,0.833,0.805,0.809,10972,OrganismalFitness,SHOC2_HUMAN,Medium,Human
+SOX30_HUMAN_Tsuboyama_2023_7JJK,0.83,0.855,0.835,0.834,0.841,0.838,0.821,0.809,0.818,0.838,0.845,0.84,0.847,0.832,0.84,0.86,0.855,0.835,0.839,0.85,0.835,0.812,0.829,0.823,0.842,0.846,0.839,0.839,0.838,0.835,0.816,0.804,0.829,0.754,0.823,0.839,0.829,0.827,0.842,0.835,0.839,0.838,0.846,0.76,0.837,0.855,0.845,0.836,0.859,0.845,0.849,0.867,0.866,0.859,0.855,0.862,0.848,0.848,0.846,0.86,0.853,0.851,1010,Stability,SOX30_HUMAN,High,Human
+SPA_STAAU_Tsuboyama_2023_1LP1,0.777,0.8,0.792,0.794,0.805,0.802,0.545,0.768,0.725,0.727,0.739,0.543,0.54,0.516,0.547,0.513,0.59,0.602,0.588,0.793,0.482,0.583,0.743,0.637,0.548,0.509,0.567,0.746,0.786,0.752,0.738,0.729,0.493,0.548,0.518,0.602,0.779,0.788,0.779,0.806,0.816,0.803,0.566,0.588,0.638,0.63,0.771,0.762,0.818,0.803,0.793,0.785,0.787,0.827,0.763,0.794,0.778,0.801,0.785,0.801,0.755,0.761,2105,Stability,SPA_STAAU,Medium,Prokaryote
+SPG1_STRSG_Olson_2014,0.652,0.658,0.475,0.483,0.658,0.668,0.439,0.486,0.455,0.621,0.708,0.675,0.663,0.716,0.683,0.681,0.696,0.677,0.697,0.651,0.711,0.681,0.679,0.683,0.691,0.684,0.687,0.685,0.73,0.682,0.735,0.724,0.489,0.674,0.655,0.661,0.658,0.65,0.661,0.684,0.672,0.688,0.433,0.418,0.555,0.485,0.659,0.613,0.679,0.567,0.726,0.685,0.704,0.718,0.715,0.695,0.724,0.726,0.715,0.721,0.704,0.715,536962,Binding,SPG1_STRSG,Low,Prokaryote
+SPG1_STRSG_Wu_2016,0.137,0.172,0.168,0.19,0.184,0.186,0.157,0.151,0.252,0.239,0.249,0.261,0.238,0.232,0.246,0.26,0.282,0.297,0.315,0.236,0.197,0.186,0.196,0.199,0.17,0.214,0.213,0.198,0.198,0.152,0.303,0.285,0.111,0.203,0.168,0.218,0.169,0.159,0.188,0.196,0.191,0.204,0.183,0.137,0.231,0.141,0.238,0.243,0.301,0.203,0.301,0.302,0.306,0.316,0.299,0.3,0.33,0.309,0.317,0.309,0.324,0.276,149360,Binding,SPG1_STRSG,Medium,Prokaryote
+SPG2_STRSG_Tsuboyama_2023_5UBS,0.784,0.792,0.779,0.771,0.775,0.774,0.708,0.778,0.81,0.824,0.784,0.788,0.776,0.72,0.765,0.753,0.794,0.796,0.766,0.804,0.724,0.778,0.693,0.702,0.668,0.731,0.71,0.778,0.769,0.826,0.803,0.787,0.47,0.652,0.762,0.799,0.734,0.761,0.743,0.804,0.807,0.81,0.755,0.656,0.778,0.727,0.828,0.787,0.833,0.817,0.789,0.8,0.815,0.818,0.812,0.812,0.81,0.804,0.791,0.813,0.835,0.755,1451,Stability,SPG2_STRSG,Medium,Prokaryote
+SPIKE_SARS2_Starr_2020_binding,0.831,0.909,0.816,0.865,0.898,0.896,0.739,0.938,0.949,0.948,0.747,0.741,0.739,0.748,0.774,0.758,0.772,0.775,0.793,0.925,0.928,0.929,0.926,0.931,0.931,0.925,0.913,0.937,0.917,0.939,0.935,0.932,0.864,0.929,0.928,0.915,0.918,0.913,0.921,0.927,0.921,0.919,0.752,0.751,0.748,0.776,0.919,0.863,0.745,0.861,0.845,0.841,0.828,0.849,0.849,0.867,0.847,0.823,0.789,0.855,0.838,0.793,3802,Binding,SPIKE_SARS2,Medium,Virus
+SPIKE_SARS2_Starr_2020_expression,0.759,0.882,0.797,0.855,0.882,0.88,0.643,0.847,0.848,0.887,0.633,0.62,0.636,0.634,0.662,0.643,0.635,0.683,0.688,0.864,0.824,0.845,0.836,0.843,0.844,0.846,0.821,0.844,0.829,0.889,0.849,0.849,0.747,0.841,0.842,0.841,0.833,0.834,0.847,0.89,0.891,0.89,0.622,0.622,0.637,0.652,0.899,0.856,0.749,0.756,0.811,0.819,0.824,0.822,0.832,0.835,0.82,0.805,0.812,0.834,0.828,0.716,3798,Expression,SPIKE_SARS2,Medium,Virus
+SPTN1_CHICK_Tsuboyama_2023_1TUD,0.832,0.869,0.807,0.774,0.806,0.797,0.613,0.706,0.79,0.799,0.865,0.815,0.838,0.358,0.811,0.819,0.804,0.887,0.809,0.836,0.766,0.813,0.76,0.788,0.826,0.841,0.81,0.834,0.833,0.827,0.877,0.88,0.581,0.78,0.755,0.776,0.834,0.846,0.83,0.803,0.803,0.801,0.751,0.421,0.785,0.737,0.765,0.775,0.902,0.893,0.892,0.887,0.898,0.886,0.887,0.889,0.886,0.894,0.88,0.895,0.883,0.767,3201,Stability,SPTN1_CHICK,High,Eukaryote
+SQSTM_MOUSE_Tsuboyama_2023_2RRU,0.799,0.851,0.859,0.873,0.878,0.872,0.493,0.81,0.866,0.864,0.84,0.764,0.832,0.567,0.701,0.795,0.865,0.846,0.837,0.84,0.615,0.837,0.755,0.855,0.823,0.834,0.84,0.817,0.831,0.823,0.823,0.804,0.762,0.616,0.841,0.847,0.801,0.873,0.865,0.885,0.879,0.867,0.603,0.547,0.824,0.853,0.879,0.835,0.824,0.83,0.851,0.861,0.863,0.849,0.846,0.863,0.854,0.841,0.84,0.859,0.854,0.772,707,Stability,SQSTM_MOUSE,Medium,Eukaryote
+SR43C_ARATH_Tsuboyama_2023_2N88,0.862,0.858,0.853,0.851,0.856,0.855,0.416,0.76,0.843,0.843,0.889,0.851,0.883,0.362,0.844,0.903,0.893,0.904,0.913,0.836,0.482,0.703,0.757,0.722,0.734,0.783,0.799,0.819,0.866,0.817,0.883,0.835,0.627,0.676,0.629,0.748,0.887,0.875,0.875,0.875,0.882,0.87,0.798,0.459,0.884,0.854,0.753,0.893,0.916,0.907,0.872,0.883,0.877,0.896,0.874,0.886,0.881,0.889,0.883,0.882,0.914,0.891,1583,Stability,SR43C_ARATH,High,Eukaryote
+SRBS1_HUMAN_Tsuboyama_2023_2O2W,0.869,0.885,0.906,0.903,0.886,0.883,0.804,0.782,0.885,0.891,0.92,0.912,0.923,0.811,0.9,0.911,0.909,0.898,0.924,0.893,0.905,0.895,0.913,0.89,0.907,0.909,0.905,0.897,0.9,0.913,0.904,0.881,0.822,0.833,0.878,0.867,0.888,0.902,0.906,0.886,0.898,0.892,0.889,0.723,0.922,0.903,0.891,0.891,0.92,0.916,0.911,0.919,0.916,0.919,0.912,0.921,0.91,0.915,0.914,0.918,0.905,0.911,1556,Stability,SRBS1_HUMAN,High,Human
+SRC_HUMAN_Ahler_2019,0.561,0.57,0.554,0.563,0.553,0.557,0.557,0.523,0.575,0.57,0.571,0.568,0.572,0.538,0.545,0.516,0.538,0.549,0.542,0.555,0.554,0.545,0.552,0.541,0.542,0.552,0.556,0.544,0.543,0.571,0.55,0.561,0.548,0.547,0.539,0.532,0.546,0.553,0.552,0.556,0.561,0.556,0.573,0.517,0.558,0.57,0.499,0.532,0.499,0.436,0.54,0.542,0.529,0.564,0.55,0.544,0.532,0.549,0.538,0.538,0.556,0.542,3372,Activity,SRC_HUMAN,Medium,Human
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,0.626,0.633,0.63,0.637,0.633,0.627,0.626,0.616,0.62,0.627,0.643,0.632,0.639,0.614,0.618,0.602,0.605,0.609,0.614,0.623,0.617,0.61,0.617,0.608,0.614,0.621,0.62,0.609,0.616,0.636,0.617,0.625,0.641,0.615,0.605,0.604,0.617,0.615,0.615,0.626,0.625,0.625,0.63,0.597,0.621,0.629,0.592,0.616,0.595,0.577,0.61,0.608,0.602,0.626,0.617,0.607,0.607,0.618,0.609,0.611,0.624,0.614,3637,Activity,SRC_HUMAN,Medium,Human
+SRC_HUMAN_Nguyen_2022,0.669,0.685,0.686,0.683,0.674,0.674,0.667,0.67,0.668,0.682,0.69,0.682,0.685,0.661,0.662,0.647,0.664,0.675,0.676,0.681,0.69,0.681,0.683,0.677,0.675,0.684,0.685,0.675,0.677,0.694,0.677,0.68,0.67,0.678,0.671,0.671,0.67,0.672,0.672,0.68,0.675,0.675,0.678,0.65,0.686,0.68,0.636,0.667,0.638,0.629,0.661,0.663,0.661,0.673,0.664,0.664,0.66,0.665,0.658,0.66,0.681,0.661,3366,OrganismalFitness,SRC_HUMAN,Medium,Human
+SUMO1_HUMAN_Weile_2017,0.707,0.76,0.779,0.766,0.769,0.771,0.637,0.763,0.758,0.749,0.773,0.775,0.788,0.658,0.723,0.75,0.785,0.73,0.763,0.774,0.67,0.759,0.77,0.778,0.768,0.775,0.763,0.79,0.765,0.717,0.777,0.779,0.621,0.672,0.759,0.771,0.739,0.754,0.772,0.764,0.764,0.772,0.737,0.639,0.772,0.749,0.735,0.774,0.729,0.723,0.768,0.763,0.778,0.77,0.767,0.769,0.78,0.78,0.77,0.777,0.776,0.732,1700,OrganismalFitness,SUMO1_HUMAN,High,Human
+SYUA_HUMAN_Newberry_2020,0.826,0.836,0.818,0.814,0.827,0.829,0.809,0.878,0.838,0.845,0.836,0.868,0.861,0.809,0.811,0.824,0.812,0.849,0.86,0.86,0.807,0.867,0.872,0.87,0.811,0.862,0.868,0.86,0.867,0.874,0.877,0.877,0.696,0.824,0.865,0.868,0.836,0.855,0.856,0.835,0.851,0.85,0.812,0.8,0.857,0.808,0.761,0.834,0.809,0.77,0.813,0.795,0.793,0.805,0.806,0.807,0.816,0.824,0.832,0.804,0.834,0.802,2497,OrganismalFitness,SYUA_HUMAN,Medium,Human
+TADBP_HUMAN_Bolognesi_2019,0.594,0.591,0.589,0.587,0.59,0.587,0.716,0.592,0.577,0.58,0.595,0.607,0.593,0.643,0.66,0.637,0.557,0.579,0.612,0.59,0.677,0.576,0.595,0.596,0.693,0.608,0.584,0.592,0.593,0.601,0.599,0.622,0.576,0.698,0.732,0.592,0.609,0.616,0.6,0.594,0.594,0.59,0.652,0.697,0.607,0.653,0.696,0.598,0.699,0.61,0.66,0.621,0.613,0.643,0.621,0.634,0.608,0.612,0.59,0.62,0.62,0.657,1196,OrganismalFitness,TADBP_HUMAN,Low,Human
+TAT_HV1BR_Fernandes_2016,0.686,0.699,0.694,0.695,0.697,0.698,0.457,0.707,0.701,0.698,0.609,0.707,0.704,0.599,0.615,0.583,0.598,0.578,0.578,0.698,0.701,0.714,0.704,0.707,0.695,0.729,0.609,0.698,0.718,0.695,0.707,0.718,0.618,0.702,0.681,0.712,0.706,0.689,0.694,0.702,0.701,0.702,0.539,0.495,0.686,0.557,0.601,0.697,0.644,0.585,0.54,0.615,0.594,0.623,0.593,0.617,0.565,0.582,0.581,0.58,0.53,0.537,1577,OrganismalFitness,TAT_HV1BR,High,Virus
+TCRG1_MOUSE_Tsuboyama_2023_1E0L,0.896,0.882,0.888,0.891,0.881,0.882,0.825,0.799,0.887,0.871,0.85,0.877,0.879,0.885,0.863,0.889,0.904,0.87,0.868,0.824,0.848,0.845,0.826,0.835,0.857,0.848,0.844,0.85,0.841,0.848,0.828,0.798,0.861,0.846,0.863,0.889,0.88,0.881,0.894,0.879,0.883,0.879,0.88,0.691,0.852,0.876,0.849,0.846,0.877,0.879,0.869,0.863,0.864,0.882,0.864,0.88,0.875,0.869,0.874,0.871,0.899,0.88,1058,Stability,TCRG1_MOUSE,Medium,Eukaryote
+THO1_YEAST_Tsuboyama_2023_2WQG,0.771,0.767,0.79,0.808,0.808,0.823,0.583,0.59,0.799,0.818,0.766,0.816,0.8,0.488,0.791,0.794,0.788,0.816,0.799,0.802,0.538,0.661,0.58,0.519,0.642,0.653,0.515,0.669,0.778,0.836,0.812,0.776,0.747,0.53,0.722,0.605,0.777,0.824,0.777,0.79,0.822,0.808,0.71,0.624,0.81,0.745,0.83,0.835,0.814,0.874,0.85,0.805,0.842,0.84,0.854,0.827,0.834,0.834,0.827,0.841,0.794,0.767,1279,Stability,THO1_YEAST,High,Eukaryote
+TNKS2_HUMAN_Tsuboyama_2023_5JRT,0.848,0.885,0.878,0.88,0.871,0.877,0.577,0.79,0.878,0.891,0.851,0.839,0.852,0.715,0.827,0.848,0.871,0.885,0.886,0.868,0.811,0.887,0.888,0.887,0.829,0.88,0.868,0.889,0.876,0.864,0.898,0.883,0.729,0.62,0.821,0.829,0.844,0.863,0.867,0.879,0.88,0.878,0.832,0.724,0.87,0.829,0.82,0.891,0.913,0.916,0.863,0.889,0.885,0.864,0.879,0.879,0.886,0.878,0.868,0.887,0.905,0.819,1479,Stability,TNKS2_HUMAN,High,Human
+TPK1_HUMAN_Weile_2017,0.619,0.642,0.624,0.621,0.62,0.626,0.554,0.644,0.626,0.621,0.63,0.645,0.651,0.551,0.607,0.625,0.654,0.652,0.665,0.631,0.558,0.599,0.605,0.619,0.558,0.649,0.665,0.619,0.667,0.672,0.667,0.643,0.557,0.564,0.623,0.672,0.625,0.631,0.676,0.627,0.638,0.648,0.569,0.55,0.637,0.581,0.632,0.644,0.649,0.598,0.648,0.656,0.648,0.659,0.649,0.657,0.665,0.651,0.645,0.649,0.628,0.6,3181,OrganismalFitness,TPK1_HUMAN,Medium,Human
+TPMT_HUMAN_Matreyek_2018,0.805,0.817,0.818,0.816,0.819,0.816,0.735,0.815,0.817,0.819,0.82,0.818,0.826,0.764,0.773,0.804,0.813,0.814,0.807,0.82,0.771,0.761,0.803,0.828,0.801,0.798,0.809,0.799,0.81,0.826,0.829,0.819,0.771,0.787,0.8,0.807,0.81,0.811,0.812,0.82,0.819,0.817,0.783,0.733,0.822,0.803,0.8,0.823,0.824,0.789,0.803,0.804,0.809,0.802,0.809,0.811,0.812,0.815,0.812,0.81,0.817,0.8,3648,Expression,TPMT_HUMAN,Medium,Human
+TPOR_HUMAN_Bridgford_2020,0.668,0.597,0.695,0.635,0.627,0.625,0.69,0.641,0.663,0.656,0.678,0.643,0.671,0.589,0.664,0.575,0.571,0.61,0.649,0.589,0.619,0.589,0.659,0.652,0.638,0.546,0.742,0.7,0.676,0.618,0.611,0.618,0.708,0.64,0.633,0.617,0.67,0.682,0.63,0.651,0.667,0.607,0.58,0.627,0.624,0.643,0.486,0.635,0.653,0.56,0.703,0.606,0.625,0.629,0.621,0.634,0.613,0.668,0.664,0.624,0.637,0.691,562,OrganismalFitness,TPOR_HUMAN,Low,Human
+TRPC_SACS2_Chan_2017,0.879,0.917,0.91,0.915,0.911,0.913,0.744,0.899,0.91,0.918,0.9,0.888,0.892,0.788,0.826,0.877,0.919,0.921,0.909,0.922,0.781,0.849,0.859,0.867,0.879,0.861,0.909,0.883,0.93,0.912,0.912,0.887,0.735,0.822,0.877,0.846,0.845,0.888,0.874,0.887,0.912,0.909,0.769,0.75,0.884,0.825,0.798,0.896,0.914,0.746,0.898,0.903,0.898,0.911,0.906,0.9,0.904,0.911,0.902,0.912,0.924,0.866,1519,OrganismalFitness,TRPC_SACS2,Medium,Prokaryote
+TRPC_THEMA_Chan_2017,0.692,0.732,0.652,0.688,0.719,0.712,0.607,0.639,0.705,0.737,0.726,0.732,0.747,0.622,0.719,0.755,0.725,0.743,0.751,0.742,0.67,0.716,0.733,0.756,0.698,0.736,0.689,0.751,0.762,0.657,0.734,0.684,0.554,0.672,0.747,0.738,0.716,0.723,0.723,0.733,0.73,0.723,0.689,0.501,0.765,0.718,0.651,0.758,0.726,0.581,0.718,0.72,0.691,0.73,0.725,0.707,0.705,0.701,0.69,0.713,0.754,0.753,1519,OrganismalFitness,TRPC_THEMA,Medium,Prokaryote
+UBC9_HUMAN_Weile_2017,0.534,0.615,0.61,0.61,0.607,0.618,0.369,0.531,0.575,0.591,0.582,0.589,0.587,0.405,0.403,0.47,0.522,0.58,0.596,0.606,0.49,0.54,0.538,0.535,0.547,0.579,0.553,0.568,0.574,0.555,0.556,0.536,0.353,0.49,0.558,0.576,0.522,0.597,0.583,0.597,0.62,0.626,0.463,0.398,0.577,0.579,0.489,0.553,0.512,0.486,0.504,0.505,0.523,0.532,0.536,0.534,0.531,0.521,0.519,0.519,0.579,0.484,2563,OrganismalFitness,UBC9_HUMAN,Medium,Human
+UBE4B_HUMAN_Tsuboyama_2023_3L1X,0.66,0.786,0.785,0.782,0.795,0.798,0.528,0.728,0.773,0.802,0.838,0.799,0.804,0.609,0.784,0.848,0.84,0.829,0.821,0.798,0.542,0.74,0.754,0.777,0.723,0.756,0.752,0.763,0.784,0.765,0.831,0.792,0.734,0.516,0.541,0.792,0.684,0.697,0.797,0.797,0.792,0.818,0.657,0.52,0.683,0.715,0.768,0.749,0.836,0.731,0.848,0.856,0.855,0.854,0.856,0.846,0.842,0.855,0.849,0.854,0.833,0.744,3622,Stability,UBE4B_HUMAN,High,Human
+UBE4B_MOUSE_Starita_2013,0.672,0.659,0.658,0.658,0.648,0.656,0.536,0.626,0.647,0.644,0.656,0.687,0.666,0.575,0.656,0.66,0.641,0.635,0.617,0.63,0.547,0.628,0.646,0.633,0.66,0.649,0.649,0.673,0.648,0.636,0.644,0.632,0.549,0.544,0.566,0.629,0.674,0.691,0.682,0.658,0.663,0.664,0.641,0.528,0.626,0.664,0.644,0.625,0.491,0.6,0.667,0.647,0.658,0.648,0.645,0.657,0.639,0.636,0.632,0.651,0.653,0.644,899,Activity,UBE4B_MOUSE,Low,Eukaryote
+UBR5_HUMAN_Tsuboyama_2023_1I2T,0.867,0.877,0.845,0.856,0.872,0.867,0.775,0.8,0.862,0.869,0.859,0.818,0.842,0.792,0.795,0.781,0.773,0.863,0.859,0.831,0.852,0.869,0.884,0.884,0.878,0.883,0.889,0.891,0.885,0.88,0.873,0.848,0.813,0.83,0.863,0.852,0.864,0.872,0.878,0.871,0.869,0.879,0.804,0.783,0.885,0.82,0.881,0.885,0.885,0.885,0.859,0.877,0.887,0.882,0.886,0.878,0.874,0.881,0.874,0.881,0.886,0.861,1453,Stability,UBR5_HUMAN,Medium,Human
+VG08_BPP22_Tsuboyama_2023_2GP8,0.74,0.718,0.729,0.729,0.737,0.735,0.671,0.679,0.738,0.767,0.753,0.765,0.759,0.72,0.746,0.766,0.769,0.772,0.75,0.708,0.706,0.732,0.705,0.746,0.721,0.719,0.731,0.716,0.757,0.743,0.74,0.719,0.572,0.716,0.746,0.745,0.73,0.766,0.748,0.74,0.746,0.731,0.727,0.723,0.743,0.696,0.707,0.738,0.753,0.752,0.776,0.776,0.778,0.783,0.795,0.788,0.784,0.775,0.791,0.783,0.785,0.79,723,Stability,VG08_BPP22,High,Virus
+VILI_CHICK_Tsuboyama_2023_1YU5,0.791,0.92,0.936,0.935,0.934,0.93,0.684,0.84,0.924,0.926,0.93,0.914,0.931,0.782,0.753,0.937,0.953,0.932,0.883,0.93,0.732,0.821,0.853,0.849,0.846,0.877,0.864,0.86,0.92,0.933,0.944,0.924,0.766,0.653,0.888,0.887,0.858,0.906,0.901,0.928,0.931,0.931,0.809,0.806,0.934,0.904,0.912,0.922,0.947,0.895,0.953,0.947,0.954,0.952,0.948,0.957,0.951,0.954,0.95,0.955,0.952,0.917,2568,Stability,VILI_CHICK,High,Eukaryote
+VKOR1_HUMAN_Chiasson_2020_abundance,0.841,0.829,0.839,0.843,0.835,0.841,0.717,0.816,0.839,0.854,0.822,0.843,0.836,0.7,0.801,0.834,0.84,0.841,0.825,0.852,0.755,0.749,0.771,0.825,0.768,0.777,0.806,0.771,0.814,0.845,0.847,0.82,0.668,0.729,0.768,0.81,0.844,0.844,0.847,0.84,0.843,0.854,0.718,0.661,0.83,0.789,0.805,0.819,0.828,0.784,0.832,0.822,0.84,0.847,0.854,0.855,0.848,0.839,0.84,0.847,0.839,0.833,2695,Expression,VKOR1_HUMAN,Medium,Human
+VKOR1_HUMAN_Chiasson_2020_activity,0.817,0.834,0.812,0.817,0.797,0.81,0.693,0.838,0.847,0.843,0.836,0.824,0.815,0.716,0.792,0.829,0.816,0.819,0.841,0.835,0.778,0.768,0.836,0.822,0.736,0.848,0.832,0.825,0.834,0.857,0.828,0.823,0.819,0.725,0.75,0.836,0.788,0.79,0.825,0.795,0.801,0.81,0.737,0.73,0.829,0.749,0.774,0.827,0.832,0.791,0.824,0.81,0.817,0.832,0.832,0.817,0.828,0.811,0.814,0.823,0.811,0.82,697,Activity,VKOR1_HUMAN,Medium,Human
+VRPI_BPT7_Tsuboyama_2023_2WNM,0.584,0.739,0.781,0.778,0.767,0.783,0.655,0.728,0.799,0.765,0.785,0.75,0.737,0.697,0.749,0.79,0.803,0.822,0.797,0.752,0.598,0.694,0.676,0.704,0.703,0.647,0.675,0.692,0.763,0.753,0.802,0.788,0.693,0.671,0.74,0.687,0.638,0.649,0.645,0.754,0.757,0.739,0.729,0.672,0.756,0.734,0.803,0.798,0.801,0.851,0.85,0.836,0.863,0.856,0.847,0.861,0.857,0.861,0.853,0.869,0.869,0.808,1047,Stability,VRPI_BPT7,Medium,Virus
+YAIA_ECOLI_Tsuboyama_2023_2KVT,0.667,0.852,0.846,0.86,0.856,0.853,0.453,0.803,0.832,0.859,0.789,0.626,0.795,0.577,0.644,0.85,0.873,0.919,0.904,0.875,0.503,0.53,0.57,0.735,0.562,0.573,0.521,0.553,0.841,0.875,0.877,0.867,0.521,0.452,0.564,0.831,0.702,0.727,0.816,0.839,0.842,0.855,0.685,0.564,0.717,0.694,0.876,0.837,0.912,0.862,0.889,0.903,0.88,0.891,0.901,0.894,0.89,0.894,0.897,0.897,0.877,0.749,1890,Stability,YAIA_ECOLI,Medium,Prokaryote
+YAP1_HUMAN_Araya_2012,0.464,0.386,0.471,0.468,0.466,0.463,0.374,0.312,0.295,0.288,0.409,0.362,0.357,0.409,0.419,0.441,0.453,0.412,0.39,0.327,0.308,0.301,0.293,0.322,0.332,0.3,0.311,0.322,0.296,0.37,0.382,0.397,0.34,0.358,0.302,0.323,0.427,0.368,0.396,0.45,0.42,0.439,0.484,0.181,0.422,0.48,0.359,0.418,0.332,0.372,0.355,0.348,0.365,0.37,0.347,0.358,0.369,0.373,0.386,0.361,0.478,0.41,10075,Binding,YAP1_HUMAN,Low,Human
+YNZC_BACSU_Tsuboyama_2023_2JVD,0.855,0.869,0.855,0.855,0.857,0.856,0.767,0.793,0.876,0.877,0.881,0.872,0.887,0.865,0.883,0.887,0.891,0.887,0.881,0.85,0.845,0.893,0.896,0.896,0.883,0.882,0.882,0.875,0.878,0.863,0.875,0.854,0.845,0.842,0.872,0.868,0.876,0.863,0.865,0.867,0.859,0.865,0.848,0.802,0.867,0.845,0.829,0.872,0.911,0.894,0.894,0.874,0.88,0.877,0.878,0.886,0.878,0.879,0.879,0.881,0.884,0.923,2300,Stability,YNZC_BACSU,Medium,Prokaryote
diff --git a/benchmarks/DMS_zero_shot/substitutions/NDCG/DMS_substitutions_NDCG_DMS_level.html b/benchmarks/DMS_zero_shot/substitutions/NDCG/DMS_substitutions_NDCG_DMS_level.html
new file mode 100644
index 0000000..582d970
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/NDCG/DMS_substitutions_NDCG_DMS_level.html
@@ -0,0 +1,15196 @@
+
+
+
+ score |
+ Site-Independent |
+ EVmutation |
+ DeepSequence (single) |
+ DeepSequence (ensemble) |
+ EVE (single) |
+ EVE (ensemble) |
+ Unirep |
+ Unirep evotuned |
+ MSA Transformer (single) |
+ MSA Transformer (ensemble) |
+ ESM-1b |
+ ESM-1v (single) |
+ ESM-1v (ensemble) |
+ ESM2 (8M) |
+ ESM2 (35M) |
+ ESM2 (150M) |
+ ESM2 (650M) |
+ ESM2 (3B) |
+ ESM2 (15B) |
+ Wavenet |
+ RITA S |
+ RITA M |
+ RITA L |
+ RITA XL |
+ Progen2 S |
+ Progen2 M |
+ Progen2 Base |
+ Progen2 L |
+ Progen2 XL |
+ GEMME |
+ VESPA |
+ VESPAl |
+ ProtGPT2 |
+ Tranception S no retrieval |
+ Tranception M no retrieval |
+ Tranception L no retrieval |
+ Tranception S |
+ Tranception M |
+ Tranception L |
+ TranceptEVE S |
+ TranceptEVE M |
+ TranceptEVE L |
+ CARP (38M) |
+ CARP (600K) |
+ CARP (640M) |
+ CARP (76M) |
+ MIF |
+ MIF-ST |
+ ESM-IF1 |
+ ProteinMPNN |
+ ProtSSN (k=10 h=512) |
+ ProtSSN (k=10 h=768) |
+ ProtSSN (k=10 h=1280) |
+ ProtSSN (k=20 h=512) |
+ ProtSSN (k=20 h=768) |
+ ProtSSN (k=20 h=1280) |
+ ProtSSN (k=30 h=512) |
+ ProtSSN (k=30 h=768) |
+ ProtSSN (k=30 h=1280) |
+ ProtSSN (ensemble) |
+ SaProt (650M) |
+ SaProt (35M) |
+ Number of Mutants |
+ Selection Type |
+ UniProt ID |
+ MSA_Neff_L_category |
+ Taxon |
+
+
+
+
+ A0A140D2T1_ZIKV_Sourisseau_2019 |
+ 0.278 |
+ 0.290 |
+ 0.231 |
+ 0.219 |
+ 0.283 |
+ 0.286 |
+ 0.100 |
+ 0.099 |
+ 0.327 |
+ 0.333 |
+ 0.149 |
+ 0.128 |
+ 0.164 |
+ 0.097 |
+ 0.122 |
+ 0.107 |
+ 0.144 |
+ 0.257 |
+ 0.271 |
+ 0.189 |
+ 0.274 |
+ 0.277 |
+ 0.279 |
+ 0.298 |
+ 0.254 |
+ 0.267 |
+ 0.283 |
+ 0.248 |
+ 0.273 |
+ 0.299 |
+ 0.262 |
+ 0.231 |
+ 0.125 |
+ 0.259 |
+ 0.252 |
+ 0.239 |
+ 0.280 |
+ 0.283 |
+ 0.279 |
+ 0.281 |
+ 0.293 |
+ 0.285 |
+ 0.123 |
+ 0.121 |
+ 0.184 |
+ 0.131 |
+ 0.186 |
+ 0.202 |
+ 0.163 |
+ 0.166 |
+ 0.198 |
+ 0.207 |
+ 0.221 |
+ 0.218 |
+ 0.220 |
+ 0.199 |
+ 0.178 |
+ 0.199 |
+ 0.184 |
+ 0.209 |
+ 0.155 |
+ 0.154 |
+ 9576 |
+ OrganismalFitness |
+ A0A140D2T1_ZIKV |
+ Medium |
+ Virus |
+
+
+ A0A192B1T2_9HIV1_Haddox_2018 |
+ 0.878 |
+ 0.871 |
+ 0.873 |
+ 0.876 |
+ 0.879 |
+ 0.881 |
+ 0.653 |
+ 0.874 |
+ 0.884 |
+ 0.883 |
+ 0.852 |
+ 0.870 |
+ 0.873 |
+ 0.681 |
+ 0.677 |
+ 0.676 |
+ 0.689 |
+ 0.707 |
+ 0.722 |
+ 0.879 |
+ 0.874 |
+ 0.877 |
+ 0.881 |
+ 0.879 |
+ 0.875 |
+ 0.877 |
+ 0.877 |
+ 0.876 |
+ 0.881 |
+ 0.884 |
+ 0.875 |
+ 0.856 |
+ 0.820 |
+ 0.873 |
+ 0.878 |
+ 0.881 |
+ 0.879 |
+ 0.881 |
+ 0.883 |
+ 0.882 |
+ 0.881 |
+ 0.882 |
+ 0.822 |
+ 0.645 |
+ 0.873 |
+ 0.866 |
+ 0.742 |
+ 0.866 |
+ 0.697 |
+ 0.740 |
+ 0.703 |
+ 0.715 |
+ 0.729 |
+ 0.726 |
+ 0.717 |
+ 0.726 |
+ 0.719 |
+ 0.717 |
+ 0.713 |
+ 0.720 |
+ 0.728 |
+ 0.706 |
+ 12577 |
+ OrganismalFitness |
+ A0A192B1T2_9HIV1 |
+ Medium |
+ Virus |
+
+
+ A0A1I9GEU1_NEIME_Kennouche_2019 |
+ 0.691 |
+ 0.755 |
+ 0.760 |
+ 0.766 |
+ 0.758 |
+ 0.761 |
+ 0.711 |
+ 0.773 |
+ 0.764 |
+ 0.759 |
+ 0.746 |
+ 0.764 |
+ 0.756 |
+ 0.730 |
+ 0.736 |
+ 0.747 |
+ 0.725 |
+ 0.719 |
+ 0.709 |
+ 0.771 |
+ 0.752 |
+ 0.761 |
+ 0.760 |
+ 0.761 |
+ 0.760 |
+ 0.763 |
+ 0.768 |
+ 0.771 |
+ 0.756 |
+ 0.759 |
+ 0.763 |
+ 0.771 |
+ 0.757 |
+ 0.743 |
+ 0.764 |
+ 0.764 |
+ 0.751 |
+ 0.745 |
+ 0.761 |
+ 0.757 |
+ 0.761 |
+ 0.761 |
+ 0.712 |
+ 0.716 |
+ 0.755 |
+ 0.709 |
+ 0.768 |
+ 0.771 |
+ 0.734 |
+ 0.752 |
+ 0.752 |
+ 0.741 |
+ 0.732 |
+ 0.734 |
+ 0.723 |
+ 0.745 |
+ 0.747 |
+ 0.754 |
+ 0.733 |
+ 0.727 |
+ 0.731 |
+ 0.720 |
+ 922 |
+ Activity |
+ A0A1I9GEU1_NEIME |
+ Medium |
+ Prokaryote |
+
+
+ A0A247D711_LISMN_Stadelmann_2021 |
+ 0.893 |
+ 0.897 |
+ 0.805 |
+ 0.800 |
+ 0.891 |
+ 0.886 |
+ 0.695 |
+ 0.708 |
+ 0.901 |
+ 0.895 |
+ 0.772 |
+ 0.766 |
+ 0.768 |
+ 0.744 |
+ 0.741 |
+ 0.758 |
+ 0.751 |
+ 0.766 |
+ 0.736 |
+ 0.873 |
+ 0.687 |
+ 0.688 |
+ 0.697 |
+ 0.688 |
+ 0.667 |
+ 0.720 |
+ 0.741 |
+ 0.686 |
+ 0.725 |
+ 0.901 |
+ 0.856 |
+ 0.857 |
+ 0.833 |
+ 0.709 |
+ 0.689 |
+ 0.696 |
+ 0.881 |
+ 0.880 |
+ 0.882 |
+ 0.870 |
+ 0.875 |
+ 0.873 |
+ 0.726 |
+ 0.685 |
+ 0.729 |
+ 0.739 |
+ 0.848 |
+ 0.839 |
+ 0.890 |
+ 0.869 |
+ 0.826 |
+ 0.847 |
+ 0.840 |
+ 0.809 |
+ 0.792 |
+ 0.819 |
+ 0.809 |
+ 0.821 |
+ 0.796 |
+ 0.809 |
+ 0.840 |
+ 0.805 |
+ 1653 |
+ Activity |
+ A0A247D711_LISMN |
+ High |
+ Prokaryote |
+
+
+ A0A2Z5U3Z0_9INFA_Doud_2016 |
+ 0.755 |
+ 0.798 |
+ 0.783 |
+ 0.788 |
+ 0.792 |
+ 0.790 |
+ 0.497 |
+ 0.765 |
+ 0.791 |
+ 0.795 |
+ 0.552 |
+ 0.779 |
+ 0.800 |
+ 0.520 |
+ 0.525 |
+ 0.534 |
+ 0.767 |
+ 0.773 |
+ 0.778 |
+ 0.778 |
+ 0.769 |
+ 0.796 |
+ 0.794 |
+ 0.803 |
+ 0.700 |
+ 0.783 |
+ 0.789 |
+ 0.776 |
+ 0.779 |
+ 0.799 |
+ 0.754 |
+ 0.718 |
+ 0.554 |
+ 0.770 |
+ 0.789 |
+ 0.790 |
+ 0.788 |
+ 0.794 |
+ 0.794 |
+ 0.796 |
+ 0.799 |
+ 0.799 |
+ 0.526 |
+ 0.513 |
+ 0.641 |
+ 0.524 |
+ 0.659 |
+ 0.722 |
+ 0.674 |
+ 0.632 |
+ 0.740 |
+ 0.739 |
+ 0.748 |
+ 0.746 |
+ 0.739 |
+ 0.746 |
+ 0.746 |
+ 0.742 |
+ 0.748 |
+ 0.743 |
+ 0.614 |
+ 0.595 |
+ 10715 |
+ OrganismalFitness |
+ A0A2Z5U3Z0_9INFA |
+ Medium |
+ Virus |
+
+
+ A0A2Z5U3Z0_9INFA_Wu_2014 |
+ 0.215 |
+ 0.228 |
+ 0.233 |
+ 0.235 |
+ 0.232 |
+ 0.232 |
+ 0.112 |
+ 0.277 |
+ 0.244 |
+ 0.230 |
+ 0.151 |
+ 0.212 |
+ 0.245 |
+ 0.114 |
+ 0.117 |
+ 0.127 |
+ 0.237 |
+ 0.207 |
+ 0.242 |
+ 0.290 |
+ 0.206 |
+ 0.238 |
+ 0.250 |
+ 0.251 |
+ 0.201 |
+ 0.215 |
+ 0.257 |
+ 0.232 |
+ 0.220 |
+ 0.274 |
+ 0.261 |
+ 0.238 |
+ 0.082 |
+ 0.202 |
+ 0.224 |
+ 0.229 |
+ 0.222 |
+ 0.241 |
+ 0.237 |
+ 0.225 |
+ 0.230 |
+ 0.223 |
+ 0.111 |
+ 0.106 |
+ 0.178 |
+ 0.123 |
+ 0.178 |
+ 0.203 |
+ 0.175 |
+ 0.146 |
+ 0.228 |
+ 0.215 |
+ 0.215 |
+ 0.231 |
+ 0.205 |
+ 0.232 |
+ 0.203 |
+ 0.223 |
+ 0.223 |
+ 0.220 |
+ 0.151 |
+ 0.159 |
+ 2350 |
+ OrganismalFitness |
+ A0A2Z5U3Z0_9INFA |
+ Medium |
+ Virus |
+
+
+ A4_HUMAN_Seuma_2022 |
+ 0.662 |
+ 0.735 |
+ 0.761 |
+ 0.761 |
+ 0.725 |
+ 0.733 |
+ 0.639 |
+ 0.632 |
+ 0.743 |
+ 0.745 |
+ 0.684 |
+ 0.708 |
+ 0.741 |
+ 0.611 |
+ 0.609 |
+ 0.633 |
+ 0.657 |
+ 0.753 |
+ 0.725 |
+ 0.720 |
+ 0.685 |
+ 0.670 |
+ 0.691 |
+ 0.711 |
+ 0.724 |
+ 0.671 |
+ 0.687 |
+ 0.686 |
+ 0.692 |
+ 0.762 |
+ 0.700 |
+ 0.695 |
+ 0.758 |
+ 0.708 |
+ 0.693 |
+ 0.749 |
+ 0.711 |
+ 0.704 |
+ 0.738 |
+ 0.736 |
+ 0.733 |
+ 0.746 |
+ 0.614 |
+ 0.614 |
+ 0.747 |
+ 0.621 |
+ 0.677 |
+ 0.714 |
+ 0.435 |
+ 0.612 |
+ 0.709 |
+ 0.736 |
+ 0.709 |
+ 0.683 |
+ 0.688 |
+ 0.687 |
+ 0.695 |
+ 0.648 |
+ 0.684 |
+ 0.696 |
+ 0.630 |
+ 0.608 |
+ 14811 |
+ Stability |
+ A4_HUMAN |
+ Low |
+ Human |
+
+
+ A4D664_9INFA_Soh_2019 |
+ 0.430 |
+ 0.430 |
+ 0.442 |
+ 0.436 |
+ 0.444 |
+ 0.445 |
+ 0.239 |
+ 0.421 |
+ 0.375 |
+ 0.372 |
+ 0.249 |
+ 0.237 |
+ 0.245 |
+ 0.246 |
+ 0.252 |
+ 0.246 |
+ 0.288 |
+ 0.288 |
+ 0.383 |
+ 0.407 |
+ 0.384 |
+ 0.415 |
+ 0.416 |
+ 0.396 |
+ 0.245 |
+ 0.356 |
+ 0.357 |
+ 0.355 |
+ 0.389 |
+ 0.432 |
+ 0.383 |
+ 0.356 |
+ 0.233 |
+ 0.397 |
+ 0.420 |
+ 0.424 |
+ 0.434 |
+ 0.444 |
+ 0.443 |
+ 0.455 |
+ 0.456 |
+ 0.453 |
+ 0.248 |
+ 0.245 |
+ 0.275 |
+ 0.258 |
+ 0.273 |
+ 0.285 |
+ 0.200 |
+ 0.288 |
+ 0.275 |
+ 0.292 |
+ 0.274 |
+ 0.305 |
+ 0.283 |
+ 0.299 |
+ 0.288 |
+ 0.301 |
+ 0.293 |
+ 0.288 |
+ 0.281 |
+ 0.249 |
+ 14421 |
+ OrganismalFitness |
+ A4D664_9INFA |
+ Medium |
+ Virus |
+
+
+ A4GRB6_PSEAI_Chen_2020 |
+ 0.724 |
+ 0.829 |
+ 0.830 |
+ 0.841 |
+ 0.827 |
+ 0.834 |
+ 0.656 |
+ 0.737 |
+ 0.821 |
+ 0.831 |
+ 0.838 |
+ 0.820 |
+ 0.837 |
+ 0.731 |
+ 0.776 |
+ 0.839 |
+ 0.865 |
+ 0.880 |
+ 0.874 |
+ 0.832 |
+ 0.701 |
+ 0.780 |
+ 0.802 |
+ 0.822 |
+ 0.764 |
+ 0.826 |
+ 0.823 |
+ 0.814 |
+ 0.870 |
+ 0.821 |
+ 0.887 |
+ 0.858 |
+ 0.646 |
+ 0.723 |
+ 0.815 |
+ 0.844 |
+ 0.817 |
+ 0.823 |
+ 0.857 |
+ 0.856 |
+ 0.854 |
+ 0.862 |
+ 0.741 |
+ 0.630 |
+ 0.850 |
+ 0.808 |
+ 0.861 |
+ 0.881 |
+ 0.799 |
+ 0.802 |
+ 0.858 |
+ 0.864 |
+ 0.855 |
+ 0.877 |
+ 0.873 |
+ 0.873 |
+ 0.860 |
+ 0.873 |
+ 0.867 |
+ 0.873 |
+ 0.864 |
+ 0.776 |
+ 5004 |
+ OrganismalFitness |
+ A4GRB6_PSEAI |
+ High |
+ Prokaryote |
+
+
+ AACC1_PSEAI_Dandage_2018 |
+ 0.776 |
+ 0.819 |
+ 0.779 |
+ 0.792 |
+ 0.788 |
+ 0.800 |
+ 0.735 |
+ 0.705 |
+ 0.818 |
+ 0.815 |
+ 0.766 |
+ 0.786 |
+ 0.777 |
+ 0.729 |
+ 0.733 |
+ 0.747 |
+ 0.779 |
+ 0.799 |
+ 0.798 |
+ 0.795 |
+ 0.728 |
+ 0.737 |
+ 0.763 |
+ 0.706 |
+ 0.747 |
+ 0.786 |
+ 0.778 |
+ 0.785 |
+ 0.766 |
+ 0.785 |
+ 0.779 |
+ 0.767 |
+ 0.676 |
+ 0.754 |
+ 0.747 |
+ 0.789 |
+ 0.804 |
+ 0.803 |
+ 0.799 |
+ 0.796 |
+ 0.794 |
+ 0.799 |
+ 0.744 |
+ 0.726 |
+ 0.754 |
+ 0.744 |
+ 0.739 |
+ 0.786 |
+ 0.770 |
+ 0.734 |
+ 0.788 |
+ 0.797 |
+ 0.792 |
+ 0.791 |
+ 0.799 |
+ 0.806 |
+ 0.793 |
+ 0.804 |
+ 0.808 |
+ 0.801 |
+ 0.771 |
+ 0.756 |
+ 1801 |
+ OrganismalFitness |
+ AACC1_PSEAI |
+ High |
+ Prokaryote |
+
+
+ ACE2_HUMAN_Chan_2020 |
+ 0.705 |
+ 0.686 |
+ 0.710 |
+ 0.700 |
+ 0.687 |
+ 0.685 |
+ 0.622 |
+ 0.675 |
+ 0.698 |
+ 0.700 |
+ 0.674 |
+ 0.698 |
+ 0.699 |
+ 0.616 |
+ 0.652 |
+ 0.676 |
+ 0.672 |
+ 0.668 |
+ 0.690 |
+ 0.697 |
+ 0.641 |
+ 0.698 |
+ 0.721 |
+ 0.697 |
+ 0.634 |
+ 0.705 |
+ 0.704 |
+ 0.703 |
+ 0.707 |
+ 0.689 |
+ 0.673 |
+ 0.677 |
+ 0.681 |
+ 0.660 |
+ 0.688 |
+ 0.657 |
+ 0.707 |
+ 0.696 |
+ 0.693 |
+ 0.693 |
+ 0.687 |
+ 0.685 |
+ 0.634 |
+ 0.615 |
+ 0.664 |
+ 0.659 |
+ 0.683 |
+ 0.676 |
+ 0.695 |
+ 0.664 |
+ 0.677 |
+ 0.681 |
+ 0.672 |
+ 0.662 |
+ 0.679 |
+ 0.674 |
+ 0.676 |
+ 0.669 |
+ 0.678 |
+ 0.669 |
+ 0.679 |
+ 0.679 |
+ 2223 |
+ Binding |
+ ACE2_HUMAN |
+ Medium |
+ Human |
+
+
+ ADRB2_HUMAN_Jones_2020 |
+ 0.604 |
+ 0.634 |
+ 0.637 |
+ 0.640 |
+ 0.654 |
+ 0.647 |
+ 0.604 |
+ 0.647 |
+ 0.634 |
+ 0.646 |
+ 0.647 |
+ 0.632 |
+ 0.641 |
+ 0.586 |
+ 0.605 |
+ 0.606 |
+ 0.603 |
+ 0.619 |
+ 0.627 |
+ 0.631 |
+ 0.625 |
+ 0.633 |
+ 0.627 |
+ 0.635 |
+ 0.636 |
+ 0.639 |
+ 0.641 |
+ 0.644 |
+ 0.635 |
+ 0.629 |
+ 0.631 |
+ 0.613 |
+ 0.545 |
+ 0.641 |
+ 0.634 |
+ 0.636 |
+ 0.639 |
+ 0.632 |
+ 0.636 |
+ 0.645 |
+ 0.645 |
+ 0.645 |
+ 0.610 |
+ 0.546 |
+ 0.632 |
+ 0.627 |
+ 0.604 |
+ 0.635 |
+ 0.602 |
+ 0.582 |
+ 0.618 |
+ 0.611 |
+ 0.607 |
+ 0.620 |
+ 0.615 |
+ 0.616 |
+ 0.607 |
+ 0.619 |
+ 0.606 |
+ 0.616 |
+ 0.640 |
+ 0.619 |
+ 7800 |
+ Activity |
+ ADRB2_HUMAN |
+ Medium |
+ Human |
+
+
+ AICDA_HUMAN_Gajula_2014_3cycles |
+ 0.104 |
+ 0.126 |
+ 0.185 |
+ 0.106 |
+ 0.148 |
+ 0.163 |
+ 0.036 |
+ 0.149 |
+ 0.201 |
+ 0.147 |
+ 0.229 |
+ 0.173 |
+ 0.168 |
+ 0.059 |
+ 0.016 |
+ 0.226 |
+ 0.279 |
+ 0.240 |
+ 0.171 |
+ 0.320 |
+ 0.047 |
+ 0.066 |
+ 0.285 |
+ 0.290 |
+ 0.037 |
+ 0.248 |
+ 0.301 |
+ 0.282 |
+ 0.133 |
+ 0.104 |
+ 0.185 |
+ 0.159 |
+ 0.119 |
+ 0.084 |
+ 0.076 |
+ 0.140 |
+ 0.361 |
+ 0.327 |
+ 0.332 |
+ 0.164 |
+ 0.164 |
+ 0.147 |
+ 0.043 |
+ 0.047 |
+ 0.146 |
+ 0.022 |
+ 0.237 |
+ 0.159 |
+ 0.108 |
+ 0.387 |
+ 0.230 |
+ 0.237 |
+ 0.197 |
+ 0.231 |
+ 0.157 |
+ 0.203 |
+ 0.192 |
+ 0.187 |
+ 0.186 |
+ 0.207 |
+ 0.053 |
+ 0.048 |
+ 209 |
+ Activity |
+ AICDA_HUMAN |
+ Medium |
+ Human |
+
+
+ AMFR_HUMAN_Tsuboyama_2023_4G3O |
+ 0.813 |
+ 0.856 |
+ 0.859 |
+ 0.858 |
+ 0.851 |
+ 0.862 |
+ 0.418 |
+ 0.831 |
+ 0.846 |
+ 0.855 |
+ 0.879 |
+ 0.567 |
+ 0.815 |
+ 0.440 |
+ 0.603 |
+ 0.879 |
+ 0.867 |
+ 0.872 |
+ 0.865 |
+ 0.846 |
+ 0.587 |
+ 0.460 |
+ 0.809 |
+ 0.804 |
+ 0.430 |
+ 0.808 |
+ 0.784 |
+ 0.673 |
+ 0.844 |
+ 0.846 |
+ 0.862 |
+ 0.837 |
+ 0.823 |
+ 0.454 |
+ 0.546 |
+ 0.468 |
+ 0.832 |
+ 0.837 |
+ 0.841 |
+ 0.864 |
+ 0.867 |
+ 0.865 |
+ 0.621 |
+ 0.593 |
+ 0.854 |
+ 0.788 |
+ 0.756 |
+ 0.801 |
+ 0.837 |
+ 0.834 |
+ 0.870 |
+ 0.878 |
+ 0.869 |
+ 0.863 |
+ 0.876 |
+ 0.867 |
+ 0.866 |
+ 0.859 |
+ 0.873 |
+ 0.871 |
+ 0.888 |
+ 0.674 |
+ 2972 |
+ Stability |
+ AMFR_HUMAN |
+ Medium |
+ Human |
+
+
+ AMIE_PSEAE_Wrenbeck_2017 |
+ 0.796 |
+ 0.882 |
+ 0.875 |
+ 0.877 |
+ 0.874 |
+ 0.875 |
+ 0.733 |
+ 0.857 |
+ 0.828 |
+ 0.883 |
+ 0.843 |
+ 0.882 |
+ 0.886 |
+ 0.781 |
+ 0.803 |
+ 0.806 |
+ 0.836 |
+ 0.891 |
+ 0.882 |
+ 0.845 |
+ 0.876 |
+ 0.886 |
+ 0.887 |
+ 0.885 |
+ 0.866 |
+ 0.879 |
+ 0.888 |
+ 0.883 |
+ 0.882 |
+ 0.873 |
+ 0.882 |
+ 0.874 |
+ 0.776 |
+ 0.886 |
+ 0.883 |
+ 0.881 |
+ 0.878 |
+ 0.878 |
+ 0.876 |
+ 0.885 |
+ 0.883 |
+ 0.882 |
+ 0.790 |
+ 0.759 |
+ 0.816 |
+ 0.802 |
+ 0.818 |
+ 0.848 |
+ 0.828 |
+ 0.816 |
+ 0.850 |
+ 0.860 |
+ 0.858 |
+ 0.854 |
+ 0.852 |
+ 0.855 |
+ 0.854 |
+ 0.850 |
+ 0.857 |
+ 0.859 |
+ 0.864 |
+ 0.813 |
+ 6227 |
+ Activity |
+ AMIE_PSEAE |
+ High |
+ Prokaryote |
+
+
+ ANCSZ_Hobbs_2022 |
+ 0.820 |
+ 0.817 |
+ 0.811 |
+ 0.810 |
+ 0.818 |
+ 0.818 |
+ 0.787 |
+ 0.808 |
+ 0.814 |
+ 0.819 |
+ 0.806 |
+ 0.820 |
+ 0.822 |
+ 0.818 |
+ 0.834 |
+ 0.829 |
+ 0.829 |
+ 0.829 |
+ 0.826 |
+ 0.676 |
+ 0.834 |
+ 0.807 |
+ 0.811 |
+ 0.808 |
+ 0.840 |
+ 0.815 |
+ 0.817 |
+ 0.800 |
+ 0.797 |
+ 0.819 |
+ 0.804 |
+ 0.794 |
+ 0.754 |
+ 0.804 |
+ 0.800 |
+ 0.811 |
+ 0.819 |
+ 0.816 |
+ 0.821 |
+ 0.822 |
+ 0.820 |
+ 0.822 |
+ 0.820 |
+ 0.795 |
+ 0.810 |
+ 0.812 |
+ 0.808 |
+ 0.825 |
+ 0.785 |
+ 0.745 |
+ 0.827 |
+ 0.831 |
+ 0.819 |
+ 0.814 |
+ 0.825 |
+ 0.822 |
+ 0.828 |
+ 0.829 |
+ 0.827 |
+ 0.828 |
+ 0.825 |
+ 0.823 |
+ 4670 |
+ Activity |
+ ANCSZ |
+ Medium |
+ Eukaryote |
+
+
+ ARGR_ECOLI_Tsuboyama_2023_1AOY |
+ 0.839 |
+ 0.908 |
+ 0.910 |
+ 0.910 |
+ 0.909 |
+ 0.907 |
+ 0.793 |
+ 0.868 |
+ 0.825 |
+ 0.882 |
+ 0.872 |
+ 0.826 |
+ 0.895 |
+ 0.827 |
+ 0.851 |
+ 0.889 |
+ 0.901 |
+ 0.912 |
+ 0.896 |
+ 0.901 |
+ 0.850 |
+ 0.883 |
+ 0.898 |
+ 0.895 |
+ 0.858 |
+ 0.880 |
+ 0.901 |
+ 0.901 |
+ 0.891 |
+ 0.887 |
+ 0.878 |
+ 0.848 |
+ 0.770 |
+ 0.866 |
+ 0.870 |
+ 0.870 |
+ 0.871 |
+ 0.881 |
+ 0.888 |
+ 0.913 |
+ 0.901 |
+ 0.908 |
+ 0.827 |
+ 0.787 |
+ 0.874 |
+ 0.856 |
+ 0.918 |
+ 0.913 |
+ 0.914 |
+ 0.919 |
+ 0.899 |
+ 0.905 |
+ 0.908 |
+ 0.908 |
+ 0.907 |
+ 0.904 |
+ 0.904 |
+ 0.912 |
+ 0.902 |
+ 0.908 |
+ 0.927 |
+ 0.884 |
+ 1287 |
+ Stability |
+ ARGR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ B2L11_HUMAN_Dutta_2010_binding-Mcl-1 |
+ 0.594 |
+ 0.518 |
+ 0.605 |
+ 0.598 |
+ 0.595 |
+ 0.604 |
+ 0.425 |
+ 0.567 |
+ 0.492 |
+ 0.524 |
+ 0.180 |
+ 0.158 |
+ 0.552 |
+ 0.424 |
+ 0.352 |
+ 0.380 |
+ 0.474 |
+ 0.494 |
+ 0.563 |
+ 0.479 |
+ 0.363 |
+ 0.272 |
+ 0.479 |
+ 0.529 |
+ 0.195 |
+ 0.493 |
+ 0.536 |
+ 0.386 |
+ 0.536 |
+ 0.564 |
+ 0.458 |
+ 0.389 |
+ 0.371 |
+ 0.490 |
+ 0.224 |
+ 0.457 |
+ 0.597 |
+ 0.605 |
+ 0.637 |
+ 0.617 |
+ 0.624 |
+ 0.628 |
+ 0.355 |
+ 0.402 |
+ 0.518 |
+ 0.445 |
+ 0.386 |
+ 0.502 |
+ 0.449 |
+ 0.158 |
+ 0.403 |
+ 0.498 |
+ 0.489 |
+ 0.530 |
+ 0.428 |
+ 0.460 |
+ 0.491 |
+ 0.504 |
+ 0.511 |
+ 0.522 |
+ 0.424 |
+ 0.223 |
+ 170 |
+ Binding |
+ B2L11_HUMAN |
+ Low |
+ Human |
+
+
+ BBC1_YEAST_Tsuboyama_2023_1TG0 |
+ 0.681 |
+ 0.749 |
+ 0.713 |
+ 0.716 |
+ 0.718 |
+ 0.690 |
+ 0.612 |
+ 0.654 |
+ 0.754 |
+ 0.759 |
+ 0.771 |
+ 0.758 |
+ 0.766 |
+ 0.514 |
+ 0.735 |
+ 0.759 |
+ 0.814 |
+ 0.812 |
+ 0.815 |
+ 0.755 |
+ 0.661 |
+ 0.732 |
+ 0.742 |
+ 0.746 |
+ 0.592 |
+ 0.765 |
+ 0.709 |
+ 0.734 |
+ 0.716 |
+ 0.668 |
+ 0.711 |
+ 0.681 |
+ 0.616 |
+ 0.700 |
+ 0.658 |
+ 0.699 |
+ 0.719 |
+ 0.694 |
+ 0.727 |
+ 0.709 |
+ 0.678 |
+ 0.693 |
+ 0.676 |
+ 0.529 |
+ 0.762 |
+ 0.732 |
+ 0.789 |
+ 0.772 |
+ 0.811 |
+ 0.789 |
+ 0.788 |
+ 0.800 |
+ 0.798 |
+ 0.822 |
+ 0.803 |
+ 0.823 |
+ 0.807 |
+ 0.809 |
+ 0.800 |
+ 0.809 |
+ 0.811 |
+ 0.797 |
+ 2069 |
+ Stability |
+ BBC1_YEAST |
+ High |
+ Eukaryote |
+
+
+ BCHB_CHLTE_Tsuboyama_2023_2KRU |
+ 0.877 |
+ 0.893 |
+ 0.888 |
+ 0.887 |
+ 0.878 |
+ 0.887 |
+ 0.727 |
+ 0.844 |
+ 0.861 |
+ 0.901 |
+ 0.817 |
+ 0.858 |
+ 0.873 |
+ 0.773 |
+ 0.791 |
+ 0.825 |
+ 0.890 |
+ 0.888 |
+ 0.894 |
+ 0.900 |
+ 0.737 |
+ 0.838 |
+ 0.831 |
+ 0.851 |
+ 0.811 |
+ 0.874 |
+ 0.813 |
+ 0.875 |
+ 0.905 |
+ 0.893 |
+ 0.879 |
+ 0.860 |
+ 0.679 |
+ 0.813 |
+ 0.823 |
+ 0.866 |
+ 0.872 |
+ 0.872 |
+ 0.889 |
+ 0.884 |
+ 0.882 |
+ 0.888 |
+ 0.808 |
+ 0.801 |
+ 0.838 |
+ 0.834 |
+ 0.884 |
+ 0.907 |
+ 0.920 |
+ 0.896 |
+ 0.885 |
+ 0.906 |
+ 0.895 |
+ 0.897 |
+ 0.899 |
+ 0.893 |
+ 0.896 |
+ 0.892 |
+ 0.891 |
+ 0.898 |
+ 0.913 |
+ 0.788 |
+ 1572 |
+ Stability |
+ BCHB_CHLTE |
+ Medium |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Deng_2012 |
+ 0.681 |
+ 0.798 |
+ 0.775 |
+ 0.780 |
+ 0.806 |
+ 0.802 |
+ 0.532 |
+ 0.611 |
+ 0.718 |
+ 0.770 |
+ 0.747 |
+ 0.742 |
+ 0.746 |
+ 0.611 |
+ 0.683 |
+ 0.718 |
+ 0.732 |
+ 0.749 |
+ 0.738 |
+ 0.731 |
+ 0.708 |
+ 0.706 |
+ 0.746 |
+ 0.714 |
+ 0.694 |
+ 0.757 |
+ 0.750 |
+ 0.772 |
+ 0.724 |
+ 0.726 |
+ 0.789 |
+ 0.771 |
+ 0.546 |
+ 0.697 |
+ 0.780 |
+ 0.737 |
+ 0.764 |
+ 0.799 |
+ 0.781 |
+ 0.785 |
+ 0.807 |
+ 0.815 |
+ 0.651 |
+ 0.493 |
+ 0.754 |
+ 0.691 |
+ 0.731 |
+ 0.760 |
+ 0.746 |
+ 0.681 |
+ 0.770 |
+ 0.786 |
+ 0.775 |
+ 0.785 |
+ 0.785 |
+ 0.785 |
+ 0.797 |
+ 0.782 |
+ 0.805 |
+ 0.793 |
+ 0.754 |
+ 0.715 |
+ 4996 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Firnberg_2014 |
+ 0.598 |
+ 0.787 |
+ 0.793 |
+ 0.775 |
+ 0.808 |
+ 0.814 |
+ 0.387 |
+ 0.514 |
+ 0.667 |
+ 0.755 |
+ 0.706 |
+ 0.716 |
+ 0.729 |
+ 0.560 |
+ 0.636 |
+ 0.702 |
+ 0.722 |
+ 0.736 |
+ 0.695 |
+ 0.684 |
+ 0.697 |
+ 0.695 |
+ 0.706 |
+ 0.674 |
+ 0.669 |
+ 0.734 |
+ 0.732 |
+ 0.747 |
+ 0.657 |
+ 0.666 |
+ 0.799 |
+ 0.789 |
+ 0.445 |
+ 0.668 |
+ 0.736 |
+ 0.683 |
+ 0.744 |
+ 0.774 |
+ 0.758 |
+ 0.780 |
+ 0.804 |
+ 0.824 |
+ 0.611 |
+ 0.375 |
+ 0.726 |
+ 0.645 |
+ 0.723 |
+ 0.787 |
+ 0.792 |
+ 0.606 |
+ 0.754 |
+ 0.759 |
+ 0.768 |
+ 0.765 |
+ 0.777 |
+ 0.788 |
+ 0.772 |
+ 0.769 |
+ 0.768 |
+ 0.779 |
+ 0.797 |
+ 0.676 |
+ 4783 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Jacquier_2013 |
+ 0.815 |
+ 0.937 |
+ 0.918 |
+ 0.900 |
+ 0.921 |
+ 0.924 |
+ 0.621 |
+ 0.760 |
+ 0.858 |
+ 0.894 |
+ 0.869 |
+ 0.894 |
+ 0.898 |
+ 0.802 |
+ 0.825 |
+ 0.837 |
+ 0.863 |
+ 0.899 |
+ 0.897 |
+ 0.850 |
+ 0.895 |
+ 0.887 |
+ 0.912 |
+ 0.906 |
+ 0.838 |
+ 0.882 |
+ 0.912 |
+ 0.922 |
+ 0.903 |
+ 0.771 |
+ 0.916 |
+ 0.916 |
+ 0.728 |
+ 0.871 |
+ 0.918 |
+ 0.906 |
+ 0.904 |
+ 0.921 |
+ 0.918 |
+ 0.898 |
+ 0.908 |
+ 0.922 |
+ 0.819 |
+ 0.571 |
+ 0.883 |
+ 0.853 |
+ 0.823 |
+ 0.908 |
+ 0.902 |
+ 0.840 |
+ 0.892 |
+ 0.894 |
+ 0.901 |
+ 0.909 |
+ 0.892 |
+ 0.901 |
+ 0.909 |
+ 0.895 |
+ 0.903 |
+ 0.897 |
+ 0.919 |
+ 0.872 |
+ 989 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Stiffler_2015 |
+ 0.830 |
+ 0.950 |
+ 0.940 |
+ 0.931 |
+ 0.964 |
+ 0.962 |
+ 0.633 |
+ 0.769 |
+ 0.860 |
+ 0.922 |
+ 0.891 |
+ 0.907 |
+ 0.909 |
+ 0.806 |
+ 0.856 |
+ 0.897 |
+ 0.901 |
+ 0.933 |
+ 0.912 |
+ 0.873 |
+ 0.897 |
+ 0.904 |
+ 0.922 |
+ 0.905 |
+ 0.863 |
+ 0.916 |
+ 0.929 |
+ 0.943 |
+ 0.895 |
+ 0.877 |
+ 0.967 |
+ 0.963 |
+ 0.683 |
+ 0.882 |
+ 0.939 |
+ 0.906 |
+ 0.932 |
+ 0.957 |
+ 0.946 |
+ 0.942 |
+ 0.957 |
+ 0.973 |
+ 0.840 |
+ 0.615 |
+ 0.913 |
+ 0.863 |
+ 0.910 |
+ 0.953 |
+ 0.951 |
+ 0.815 |
+ 0.933 |
+ 0.930 |
+ 0.941 |
+ 0.936 |
+ 0.940 |
+ 0.952 |
+ 0.941 |
+ 0.935 |
+ 0.945 |
+ 0.944 |
+ 0.956 |
+ 0.885 |
+ 4996 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BRCA1_HUMAN_Findlay_2018 |
+ 0.881 |
+ 0.871 |
+ 0.877 |
+ 0.878 |
+ 0.866 |
+ 0.867 |
+ 0.738 |
+ 0.812 |
+ 0.867 |
+ 0.869 |
+ 0.865 |
+ 0.822 |
+ 0.822 |
+ 0.770 |
+ 0.812 |
+ 0.834 |
+ 0.866 |
+ 0.879 |
+ 0.850 |
+ 0.872 |
+ 0.760 |
+ 0.862 |
+ 0.877 |
+ 0.861 |
+ 0.823 |
+ 0.890 |
+ 0.887 |
+ 0.873 |
+ 0.873 |
+ 0.878 |
+ 0.879 |
+ 0.855 |
+ 0.717 |
+ 0.781 |
+ 0.772 |
+ 0.857 |
+ 0.877 |
+ 0.881 |
+ 0.883 |
+ 0.878 |
+ 0.877 |
+ 0.878 |
+ 0.793 |
+ 0.756 |
+ 0.881 |
+ 0.835 |
+ 0.838 |
+ 0.877 |
+ 0.751 |
+ 0.781 |
+ 0.853 |
+ 0.860 |
+ 0.858 |
+ 0.855 |
+ 0.850 |
+ 0.858 |
+ 0.848 |
+ 0.859 |
+ 0.852 |
+ 0.857 |
+ 0.818 |
+ 0.833 |
+ 1837 |
+ OrganismalFitness |
+ BRCA1_HUMAN |
+ Low |
+ Human |
+
+
+ BRCA2_HUMAN_Erwood_2022_HEK293T |
+ 0.941 |
+ 0.933 |
+ 0.936 |
+ 0.935 |
+ 0.935 |
+ 0.932 |
+ 0.915 |
+ 0.937 |
+ 0.856 |
+ 0.891 |
+ 0.940 |
+ 0.923 |
+ 0.918 |
+ 0.928 |
+ 0.925 |
+ 0.929 |
+ 0.944 |
+ 0.935 |
+ 0.940 |
+ 0.903 |
+ 0.930 |
+ 0.944 |
+ 0.949 |
+ 0.943 |
+ 0.942 |
+ 0.922 |
+ 0.921 |
+ 0.933 |
+ 0.914 |
+ 0.932 |
+ 0.926 |
+ 0.946 |
+ 0.927 |
+ 0.926 |
+ 0.930 |
+ 0.931 |
+ 0.938 |
+ 0.937 |
+ 0.939 |
+ 0.933 |
+ 0.934 |
+ 0.933 |
+ 0.918 |
+ 0.922 |
+ 0.939 |
+ 0.898 |
+ 0.890 |
+ 0.892 |
+ 0.769 |
+ 0.917 |
+ 0.943 |
+ 0.942 |
+ 0.941 |
+ 0.938 |
+ 0.945 |
+ 0.933 |
+ 0.948 |
+ 0.943 |
+ 0.943 |
+ 0.944 |
+ 0.896 |
+ 0.910 |
+ 265 |
+ OrganismalFitness |
+ BRCA2_HUMAN |
+ NaN |
+ Human |
+
+
+ C6KNH7_9INFA_Lee_2018 |
+ 0.771 |
+ 0.811 |
+ 0.803 |
+ 0.800 |
+ 0.812 |
+ 0.812 |
+ 0.609 |
+ 0.792 |
+ 0.809 |
+ 0.812 |
+ 0.627 |
+ 0.770 |
+ 0.822 |
+ 0.601 |
+ 0.604 |
+ 0.616 |
+ 0.784 |
+ 0.810 |
+ 0.820 |
+ 0.792 |
+ 0.798 |
+ 0.788 |
+ 0.796 |
+ 0.786 |
+ 0.660 |
+ 0.793 |
+ 0.796 |
+ 0.808 |
+ 0.778 |
+ 0.828 |
+ 0.800 |
+ 0.767 |
+ 0.667 |
+ 0.786 |
+ 0.796 |
+ 0.787 |
+ 0.801 |
+ 0.805 |
+ 0.801 |
+ 0.815 |
+ 0.818 |
+ 0.817 |
+ 0.603 |
+ 0.590 |
+ 0.698 |
+ 0.608 |
+ 0.750 |
+ 0.795 |
+ 0.785 |
+ 0.719 |
+ 0.789 |
+ 0.796 |
+ 0.795 |
+ 0.793 |
+ 0.793 |
+ 0.783 |
+ 0.794 |
+ 0.794 |
+ 0.783 |
+ 0.795 |
+ 0.688 |
+ 0.658 |
+ 10754 |
+ OrganismalFitness |
+ C6KNH7_9INFA |
+ Medium |
+ Virus |
+
+
+ CALM1_HUMAN_Weile_2017 |
+ 0.799 |
+ 0.817 |
+ 0.808 |
+ 0.804 |
+ 0.805 |
+ 0.800 |
+ 0.801 |
+ 0.790 |
+ 0.815 |
+ 0.816 |
+ 0.827 |
+ 0.794 |
+ 0.821 |
+ 0.797 |
+ 0.793 |
+ 0.791 |
+ 0.773 |
+ 0.786 |
+ 0.812 |
+ 0.813 |
+ 0.801 |
+ 0.806 |
+ 0.821 |
+ 0.810 |
+ 0.812 |
+ 0.827 |
+ 0.816 |
+ 0.841 |
+ 0.832 |
+ 0.823 |
+ 0.819 |
+ 0.794 |
+ 0.750 |
+ 0.809 |
+ 0.832 |
+ 0.823 |
+ 0.815 |
+ 0.827 |
+ 0.828 |
+ 0.813 |
+ 0.807 |
+ 0.813 |
+ 0.807 |
+ 0.794 |
+ 0.819 |
+ 0.824 |
+ 0.781 |
+ 0.775 |
+ 0.809 |
+ 0.758 |
+ 0.774 |
+ 0.780 |
+ 0.781 |
+ 0.758 |
+ 0.766 |
+ 0.783 |
+ 0.781 |
+ 0.779 |
+ 0.763 |
+ 0.778 |
+ 0.826 |
+ 0.822 |
+ 1813 |
+ OrganismalFitness |
+ CALM1_HUMAN |
+ High |
+ Human |
+
+
+ CAPSD_AAV2S_Sinai_2021 |
+ 0.791 |
+ 0.789 |
+ 0.786 |
+ 0.811 |
+ 0.781 |
+ 0.784 |
+ 0.758 |
+ 0.841 |
+ 0.750 |
+ 0.777 |
+ 0.641 |
+ 0.666 |
+ 0.682 |
+ 0.701 |
+ 0.729 |
+ 0.690 |
+ 0.733 |
+ 0.665 |
+ 0.606 |
+ 0.703 |
+ 0.683 |
+ 0.722 |
+ 0.696 |
+ 0.712 |
+ 0.680 |
+ 0.691 |
+ 0.729 |
+ 0.698 |
+ 0.794 |
+ 0.798 |
+ 0.629 |
+ 0.619 |
+ 0.616 |
+ 0.699 |
+ 0.721 |
+ 0.830 |
+ 0.771 |
+ 0.772 |
+ 0.821 |
+ 0.785 |
+ 0.787 |
+ 0.817 |
+ 0.663 |
+ 0.718 |
+ 0.776 |
+ 0.688 |
+ 0.768 |
+ 0.753 |
+ 0.686 |
+ 0.723 |
+ 0.673 |
+ 0.651 |
+ 0.654 |
+ 0.656 |
+ 0.637 |
+ 0.645 |
+ 0.659 |
+ 0.652 |
+ 0.688 |
+ 0.663 |
+ 0.723 |
+ 0.665 |
+ 42328 |
+ OrganismalFitness |
+ CAPSD_AAV2S |
+ Low |
+ Virus |
+
+
+ CAR11_HUMAN_Meitlis_2020_gof |
+ 0.635 |
+ 0.638 |
+ 0.639 |
+ 0.639 |
+ 0.641 |
+ 0.639 |
+ 0.653 |
+ 0.607 |
+ 0.648 |
+ 0.645 |
+ 0.656 |
+ 0.621 |
+ 0.631 |
+ 0.610 |
+ 0.596 |
+ 0.642 |
+ 0.642 |
+ 0.635 |
+ 0.643 |
+ 0.634 |
+ 0.616 |
+ 0.651 |
+ 0.652 |
+ 0.649 |
+ 0.605 |
+ 0.655 |
+ 0.654 |
+ 0.650 |
+ 0.634 |
+ 0.639 |
+ 0.644 |
+ 0.629 |
+ 0.623 |
+ 0.640 |
+ 0.622 |
+ 0.621 |
+ 0.634 |
+ 0.641 |
+ 0.638 |
+ 0.639 |
+ 0.646 |
+ 0.646 |
+ 0.617 |
+ 0.594 |
+ 0.642 |
+ 0.633 |
+ 0.614 |
+ 0.632 |
+ 0.634 |
+ 0.628 |
+ 0.650 |
+ 0.653 |
+ 0.637 |
+ 0.651 |
+ 0.654 |
+ 0.641 |
+ 0.652 |
+ 0.648 |
+ 0.642 |
+ 0.646 |
+ 0.661 |
+ 0.649 |
+ 2374 |
+ OrganismalFitness |
+ CAR11_HUMAN |
+ Low |
+ Human |
+
+
+ CAR11_HUMAN_Meitlis_2020_lof |
+ 0.789 |
+ 0.790 |
+ 0.794 |
+ 0.791 |
+ 0.785 |
+ 0.784 |
+ 0.769 |
+ 0.736 |
+ 0.804 |
+ 0.804 |
+ 0.818 |
+ 0.765 |
+ 0.784 |
+ 0.738 |
+ 0.712 |
+ 0.757 |
+ 0.786 |
+ 0.805 |
+ 0.809 |
+ 0.789 |
+ 0.700 |
+ 0.811 |
+ 0.808 |
+ 0.800 |
+ 0.737 |
+ 0.823 |
+ 0.813 |
+ 0.831 |
+ 0.765 |
+ 0.802 |
+ 0.818 |
+ 0.784 |
+ 0.741 |
+ 0.755 |
+ 0.777 |
+ 0.766 |
+ 0.784 |
+ 0.804 |
+ 0.794 |
+ 0.786 |
+ 0.800 |
+ 0.795 |
+ 0.728 |
+ 0.720 |
+ 0.818 |
+ 0.764 |
+ 0.733 |
+ 0.806 |
+ 0.782 |
+ 0.753 |
+ 0.806 |
+ 0.798 |
+ 0.787 |
+ 0.800 |
+ 0.803 |
+ 0.793 |
+ 0.799 |
+ 0.795 |
+ 0.796 |
+ 0.798 |
+ 0.807 |
+ 0.765 |
+ 2395 |
+ OrganismalFitness |
+ CAR11_HUMAN |
+ Low |
+ Human |
+
+
+ CAS9_STRP1_Spencer_2017_positive |
+ 0.836 |
+ 0.838 |
+ 0.839 |
+ 0.839 |
+ 0.839 |
+ 0.839 |
+ 0.831 |
+ 0.822 |
+ 0.840 |
+ 0.840 |
+ 0.836 |
+ 0.829 |
+ 0.829 |
+ 0.829 |
+ 0.829 |
+ 0.837 |
+ 0.839 |
+ 0.842 |
+ 0.841 |
+ 0.828 |
+ 0.832 |
+ 0.831 |
+ 0.839 |
+ 0.845 |
+ 0.831 |
+ 0.840 |
+ 0.838 |
+ 0.837 |
+ 0.839 |
+ 0.844 |
+ 0.842 |
+ 0.842 |
+ 0.834 |
+ 0.825 |
+ 0.828 |
+ 0.840 |
+ 0.838 |
+ 0.839 |
+ 0.842 |
+ 0.839 |
+ 0.839 |
+ 0.841 |
+ 0.829 |
+ 0.826 |
+ 0.838 |
+ 0.829 |
+ 0.829 |
+ 0.838 |
+ 0.829 |
+ 0.828 |
+ 0.840 |
+ 0.837 |
+ 0.841 |
+ 0.838 |
+ 0.837 |
+ 0.837 |
+ 0.837 |
+ 0.840 |
+ 0.839 |
+ 0.839 |
+ 0.842 |
+ 0.837 |
+ 8117 |
+ Activity |
+ CAS9_STRP1 |
+ Medium |
+ Prokaryote |
+
+
+ CASP3_HUMAN_Roychowdhury_2020 |
+ 0.779 |
+ 0.820 |
+ 0.820 |
+ 0.812 |
+ 0.812 |
+ 0.818 |
+ 0.700 |
+ 0.759 |
+ 0.818 |
+ 0.824 |
+ 0.813 |
+ 0.796 |
+ 0.811 |
+ 0.712 |
+ 0.809 |
+ 0.826 |
+ 0.812 |
+ 0.823 |
+ 0.824 |
+ 0.798 |
+ 0.711 |
+ 0.778 |
+ 0.756 |
+ 0.829 |
+ 0.809 |
+ 0.817 |
+ 0.824 |
+ 0.812 |
+ 0.831 |
+ 0.802 |
+ 0.838 |
+ 0.821 |
+ 0.754 |
+ 0.712 |
+ 0.803 |
+ 0.819 |
+ 0.809 |
+ 0.822 |
+ 0.822 |
+ 0.823 |
+ 0.829 |
+ 0.824 |
+ 0.777 |
+ 0.681 |
+ 0.817 |
+ 0.801 |
+ 0.768 |
+ 0.807 |
+ 0.785 |
+ 0.740 |
+ 0.812 |
+ 0.801 |
+ 0.804 |
+ 0.807 |
+ 0.810 |
+ 0.807 |
+ 0.819 |
+ 0.818 |
+ 0.806 |
+ 0.805 |
+ 0.819 |
+ 0.796 |
+ 1567 |
+ Activity |
+ CASP3_HUMAN |
+ High |
+ Human |
+
+
+ CASP7_HUMAN_Roychowdhury_2020 |
+ 0.747 |
+ 0.808 |
+ 0.815 |
+ 0.816 |
+ 0.817 |
+ 0.814 |
+ 0.607 |
+ 0.786 |
+ 0.810 |
+ 0.813 |
+ 0.818 |
+ 0.813 |
+ 0.816 |
+ 0.682 |
+ 0.804 |
+ 0.813 |
+ 0.804 |
+ 0.787 |
+ 0.786 |
+ 0.810 |
+ 0.622 |
+ 0.794 |
+ 0.764 |
+ 0.788 |
+ 0.782 |
+ 0.792 |
+ 0.796 |
+ 0.785 |
+ 0.804 |
+ 0.820 |
+ 0.804 |
+ 0.786 |
+ 0.723 |
+ 0.664 |
+ 0.784 |
+ 0.785 |
+ 0.799 |
+ 0.822 |
+ 0.812 |
+ 0.812 |
+ 0.813 |
+ 0.818 |
+ 0.746 |
+ 0.620 |
+ 0.821 |
+ 0.806 |
+ 0.761 |
+ 0.823 |
+ 0.788 |
+ 0.740 |
+ 0.805 |
+ 0.811 |
+ 0.812 |
+ 0.803 |
+ 0.804 |
+ 0.802 |
+ 0.806 |
+ 0.805 |
+ 0.809 |
+ 0.809 |
+ 0.819 |
+ 0.775 |
+ 1680 |
+ Activity |
+ CASP7_HUMAN |
+ Medium |
+ Human |
+
+
+ CATR_CHLRE_Tsuboyama_2023_2AMI |
+ 0.853 |
+ 0.808 |
+ 0.813 |
+ 0.816 |
+ 0.825 |
+ 0.817 |
+ 0.844 |
+ 0.774 |
+ 0.777 |
+ 0.793 |
+ 0.805 |
+ 0.820 |
+ 0.815 |
+ 0.819 |
+ 0.859 |
+ 0.819 |
+ 0.820 |
+ 0.814 |
+ 0.801 |
+ 0.620 |
+ 0.821 |
+ 0.792 |
+ 0.800 |
+ 0.791 |
+ 0.798 |
+ 0.798 |
+ 0.788 |
+ 0.796 |
+ 0.800 |
+ 0.813 |
+ 0.790 |
+ 0.799 |
+ 0.755 |
+ 0.811 |
+ 0.803 |
+ 0.807 |
+ 0.831 |
+ 0.821 |
+ 0.813 |
+ 0.812 |
+ 0.810 |
+ 0.813 |
+ 0.819 |
+ 0.820 |
+ 0.795 |
+ 0.800 |
+ 0.847 |
+ 0.760 |
+ 0.818 |
+ 0.842 |
+ 0.815 |
+ 0.808 |
+ 0.815 |
+ 0.817 |
+ 0.813 |
+ 0.811 |
+ 0.813 |
+ 0.810 |
+ 0.809 |
+ 0.816 |
+ 0.810 |
+ 0.814 |
+ 1903 |
+ Stability |
+ CATR_CHLRE |
+ High |
+ Eukaryote |
+
+
+ CBPA2_HUMAN_Tsuboyama_2023_1O6X |
+ 0.910 |
+ 0.895 |
+ 0.916 |
+ 0.910 |
+ 0.910 |
+ 0.912 |
+ 0.848 |
+ 0.896 |
+ 0.909 |
+ 0.916 |
+ 0.912 |
+ 0.900 |
+ 0.903 |
+ 0.854 |
+ 0.901 |
+ 0.902 |
+ 0.908 |
+ 0.915 |
+ 0.913 |
+ 0.917 |
+ 0.861 |
+ 0.845 |
+ 0.884 |
+ 0.891 |
+ 0.889 |
+ 0.837 |
+ 0.835 |
+ 0.823 |
+ 0.913 |
+ 0.906 |
+ 0.914 |
+ 0.899 |
+ 0.733 |
+ 0.868 |
+ 0.886 |
+ 0.905 |
+ 0.912 |
+ 0.915 |
+ 0.916 |
+ 0.913 |
+ 0.912 |
+ 0.918 |
+ 0.898 |
+ 0.888 |
+ 0.901 |
+ 0.902 |
+ 0.928 |
+ 0.922 |
+ 0.925 |
+ 0.928 |
+ 0.908 |
+ 0.916 |
+ 0.922 |
+ 0.915 |
+ 0.917 |
+ 0.909 |
+ 0.911 |
+ 0.911 |
+ 0.910 |
+ 0.917 |
+ 0.918 |
+ 0.928 |
+ 2068 |
+ Stability |
+ CBPA2_HUMAN |
+ Medium |
+ Human |
+
+
+ CBS_HUMAN_Sun_2020 |
+ 0.472 |
+ 0.468 |
+ 0.477 |
+ 0.478 |
+ 0.477 |
+ 0.480 |
+ 0.323 |
+ 0.435 |
+ 0.478 |
+ 0.455 |
+ 0.451 |
+ 0.463 |
+ 0.450 |
+ 0.331 |
+ 0.394 |
+ 0.427 |
+ 0.421 |
+ 0.447 |
+ 0.435 |
+ 0.455 |
+ 0.465 |
+ 0.419 |
+ 0.446 |
+ 0.452 |
+ 0.427 |
+ 0.433 |
+ 0.446 |
+ 0.443 |
+ 0.454 |
+ 0.470 |
+ 0.494 |
+ 0.478 |
+ 0.386 |
+ 0.465 |
+ 0.430 |
+ 0.431 |
+ 0.478 |
+ 0.479 |
+ 0.490 |
+ 0.490 |
+ 0.486 |
+ 0.474 |
+ 0.431 |
+ 0.326 |
+ 0.472 |
+ 0.461 |
+ 0.420 |
+ 0.453 |
+ 0.442 |
+ 0.372 |
+ 0.432 |
+ 0.444 |
+ 0.447 |
+ 0.439 |
+ 0.427 |
+ 0.439 |
+ 0.448 |
+ 0.444 |
+ 0.434 |
+ 0.445 |
+ 0.468 |
+ 0.413 |
+ 7217 |
+ OrganismalFitness |
+ CBS_HUMAN |
+ Medium |
+ Human |
+
+
+ CBX4_HUMAN_Tsuboyama_2023_2K28 |
+ 0.881 |
+ 0.875 |
+ 0.889 |
+ 0.895 |
+ 0.897 |
+ 0.904 |
+ 0.318 |
+ 0.816 |
+ 0.890 |
+ 0.891 |
+ 0.873 |
+ 0.890 |
+ 0.890 |
+ 0.355 |
+ 0.912 |
+ 0.911 |
+ 0.907 |
+ 0.899 |
+ 0.870 |
+ 0.889 |
+ 0.815 |
+ 0.829 |
+ 0.837 |
+ 0.838 |
+ 0.859 |
+ 0.865 |
+ 0.858 |
+ 0.847 |
+ 0.849 |
+ 0.887 |
+ 0.876 |
+ 0.846 |
+ 0.641 |
+ 0.775 |
+ 0.844 |
+ 0.857 |
+ 0.884 |
+ 0.876 |
+ 0.893 |
+ 0.901 |
+ 0.892 |
+ 0.892 |
+ 0.880 |
+ 0.448 |
+ 0.833 |
+ 0.883 |
+ 0.748 |
+ 0.757 |
+ 0.914 |
+ 0.859 |
+ 0.906 |
+ 0.898 |
+ 0.903 |
+ 0.899 |
+ 0.907 |
+ 0.900 |
+ 0.905 |
+ 0.901 |
+ 0.906 |
+ 0.909 |
+ 0.896 |
+ 0.905 |
+ 2282 |
+ Stability |
+ CBX4_HUMAN |
+ High |
+ Human |
+
+
+ CCDB_ECOLI_Adkar_2012 |
+ 0.782 |
+ 0.824 |
+ 0.845 |
+ 0.844 |
+ 0.829 |
+ 0.835 |
+ 0.563 |
+ 0.702 |
+ 0.686 |
+ 0.776 |
+ 0.753 |
+ 0.623 |
+ 0.641 |
+ 0.560 |
+ 0.577 |
+ 0.655 |
+ 0.808 |
+ 0.849 |
+ 0.813 |
+ 0.733 |
+ 0.532 |
+ 0.533 |
+ 0.483 |
+ 0.689 |
+ 0.493 |
+ 0.655 |
+ 0.533 |
+ 0.668 |
+ 0.846 |
+ 0.789 |
+ 0.877 |
+ 0.873 |
+ 0.735 |
+ 0.572 |
+ 0.560 |
+ 0.774 |
+ 0.724 |
+ 0.701 |
+ 0.817 |
+ 0.828 |
+ 0.814 |
+ 0.845 |
+ 0.590 |
+ 0.520 |
+ 0.691 |
+ 0.587 |
+ 0.655 |
+ 0.789 |
+ 0.646 |
+ 0.692 |
+ 0.788 |
+ 0.737 |
+ 0.724 |
+ 0.766 |
+ 0.826 |
+ 0.806 |
+ 0.757 |
+ 0.822 |
+ 0.815 |
+ 0.799 |
+ 0.801 |
+ 0.707 |
+ 1176 |
+ Activity |
+ CCDB_ECOLI |
+ High |
+ Prokaryote |
+
+
+ CCDB_ECOLI_Tripathi_2016 |
+ 0.946 |
+ 0.994 |
+ 0.991 |
+ 0.993 |
+ 0.989 |
+ 0.988 |
+ 0.783 |
+ 0.954 |
+ 0.890 |
+ 0.955 |
+ 0.946 |
+ 0.866 |
+ 0.890 |
+ 0.773 |
+ 0.780 |
+ 0.893 |
+ 0.967 |
+ 0.991 |
+ 1.000 |
+ 0.939 |
+ 0.763 |
+ 0.765 |
+ 0.710 |
+ 0.914 |
+ 0.714 |
+ 0.874 |
+ 0.746 |
+ 0.882 |
+ 0.996 |
+ 0.953 |
+ 0.996 |
+ 0.996 |
+ 0.917 |
+ 0.765 |
+ 0.782 |
+ 0.988 |
+ 0.913 |
+ 0.892 |
+ 0.990 |
+ 0.980 |
+ 0.981 |
+ 0.993 |
+ 0.796 |
+ 0.741 |
+ 0.905 |
+ 0.780 |
+ 0.890 |
+ 0.983 |
+ 0.899 |
+ 0.890 |
+ 0.974 |
+ 0.953 |
+ 0.935 |
+ 0.957 |
+ 0.990 |
+ 0.978 |
+ 0.967 |
+ 0.991 |
+ 0.983 |
+ 0.977 |
+ 0.964 |
+ 0.955 |
+ 1663 |
+ OrganismalFitness |
+ CCDB_ECOLI |
+ High |
+ Prokaryote |
+
+
+ CCR5_HUMAN_Gill_2023 |
+ 0.819 |
+ 0.818 |
+ 0.818 |
+ 0.819 |
+ 0.821 |
+ 0.821 |
+ 0.800 |
+ 0.841 |
+ 0.822 |
+ 0.822 |
+ 0.823 |
+ 0.820 |
+ 0.819 |
+ 0.810 |
+ 0.813 |
+ 0.820 |
+ 0.807 |
+ 0.816 |
+ 0.812 |
+ 0.823 |
+ 0.814 |
+ 0.836 |
+ 0.832 |
+ 0.830 |
+ 0.824 |
+ 0.832 |
+ 0.825 |
+ 0.828 |
+ 0.832 |
+ 0.829 |
+ 0.822 |
+ 0.815 |
+ 0.775 |
+ 0.822 |
+ 0.828 |
+ 0.829 |
+ 0.820 |
+ 0.830 |
+ 0.834 |
+ 0.822 |
+ 0.823 |
+ 0.825 |
+ 0.813 |
+ 0.772 |
+ 0.826 |
+ 0.815 |
+ 0.809 |
+ 0.824 |
+ 0.815 |
+ 0.805 |
+ 0.818 |
+ 0.814 |
+ 0.813 |
+ 0.809 |
+ 0.813 |
+ 0.820 |
+ 0.814 |
+ 0.814 |
+ 0.814 |
+ 0.817 |
+ 0.823 |
+ 0.827 |
+ 6137 |
+ Binding |
+ CCR5_HUMAN |
+ High |
+ Human |
+
+
+ CD19_HUMAN_Klesmith_2019_FMC_singles |
+ 0.566 |
+ 0.555 |
+ 0.578 |
+ 0.578 |
+ 0.566 |
+ 0.565 |
+ 0.519 |
+ 0.523 |
+ 0.526 |
+ 0.529 |
+ 0.523 |
+ 0.510 |
+ 0.543 |
+ 0.540 |
+ 0.566 |
+ 0.522 |
+ 0.517 |
+ 0.558 |
+ 0.577 |
+ 0.491 |
+ 0.575 |
+ 0.569 |
+ 0.584 |
+ 0.577 |
+ 0.511 |
+ 0.554 |
+ 0.536 |
+ 0.586 |
+ 0.565 |
+ 0.576 |
+ 0.572 |
+ 0.548 |
+ 0.527 |
+ 0.608 |
+ 0.586 |
+ 0.531 |
+ 0.598 |
+ 0.587 |
+ 0.578 |
+ 0.578 |
+ 0.574 |
+ 0.573 |
+ 0.552 |
+ 0.529 |
+ 0.573 |
+ 0.562 |
+ 0.693 |
+ 0.670 |
+ 0.683 |
+ 0.553 |
+ 0.574 |
+ 0.564 |
+ 0.591 |
+ 0.600 |
+ 0.580 |
+ 0.595 |
+ 0.614 |
+ 0.573 |
+ 0.610 |
+ 0.600 |
+ 0.687 |
+ 0.621 |
+ 3761 |
+ Binding |
+ CD19_HUMAN |
+ Low |
+ Human |
+
+
+ CP2C9_HUMAN_Amorosi_2021_abundance |
+ 0.825 |
+ 0.857 |
+ 0.852 |
+ 0.859 |
+ 0.867 |
+ 0.864 |
+ 0.842 |
+ 0.859 |
+ 0.856 |
+ 0.866 |
+ 0.857 |
+ 0.867 |
+ 0.874 |
+ 0.819 |
+ 0.854 |
+ 0.859 |
+ 0.873 |
+ 0.870 |
+ 0.868 |
+ 0.867 |
+ 0.853 |
+ 0.830 |
+ 0.865 |
+ 0.863 |
+ 0.867 |
+ 0.866 |
+ 0.868 |
+ 0.868 |
+ 0.868 |
+ 0.861 |
+ 0.859 |
+ 0.846 |
+ 0.634 |
+ 0.861 |
+ 0.862 |
+ 0.862 |
+ 0.869 |
+ 0.868 |
+ 0.870 |
+ 0.868 |
+ 0.868 |
+ 0.866 |
+ 0.859 |
+ 0.727 |
+ 0.862 |
+ 0.867 |
+ 0.853 |
+ 0.862 |
+ 0.875 |
+ 0.770 |
+ 0.868 |
+ 0.869 |
+ 0.874 |
+ 0.870 |
+ 0.869 |
+ 0.872 |
+ 0.874 |
+ 0.874 |
+ 0.871 |
+ 0.873 |
+ 0.871 |
+ 0.856 |
+ 6370 |
+ Expression |
+ CP2C9_HUMAN |
+ High |
+ Human |
+
+
+ CP2C9_HUMAN_Amorosi_2021_activity |
+ 0.753 |
+ 0.814 |
+ 0.815 |
+ 0.827 |
+ 0.832 |
+ 0.830 |
+ 0.796 |
+ 0.834 |
+ 0.801 |
+ 0.833 |
+ 0.814 |
+ 0.839 |
+ 0.857 |
+ 0.753 |
+ 0.827 |
+ 0.840 |
+ 0.867 |
+ 0.852 |
+ 0.829 |
+ 0.847 |
+ 0.810 |
+ 0.773 |
+ 0.835 |
+ 0.814 |
+ 0.841 |
+ 0.827 |
+ 0.834 |
+ 0.821 |
+ 0.825 |
+ 0.810 |
+ 0.800 |
+ 0.753 |
+ 0.501 |
+ 0.834 |
+ 0.833 |
+ 0.836 |
+ 0.842 |
+ 0.845 |
+ 0.853 |
+ 0.845 |
+ 0.841 |
+ 0.837 |
+ 0.829 |
+ 0.589 |
+ 0.825 |
+ 0.856 |
+ 0.810 |
+ 0.817 |
+ 0.842 |
+ 0.653 |
+ 0.857 |
+ 0.849 |
+ 0.851 |
+ 0.852 |
+ 0.857 |
+ 0.861 |
+ 0.854 |
+ 0.854 |
+ 0.859 |
+ 0.863 |
+ 0.852 |
+ 0.827 |
+ 6142 |
+ Binding |
+ CP2C9_HUMAN |
+ High |
+ Human |
+
+
+ CSN4_MOUSE_Tsuboyama_2023_1UFM |
+ 0.824 |
+ 0.841 |
+ 0.847 |
+ 0.852 |
+ 0.851 |
+ 0.849 |
+ 0.672 |
+ 0.807 |
+ 0.858 |
+ 0.848 |
+ 0.838 |
+ 0.870 |
+ 0.874 |
+ 0.751 |
+ 0.895 |
+ 0.860 |
+ 0.789 |
+ 0.835 |
+ 0.822 |
+ 0.845 |
+ 0.766 |
+ 0.745 |
+ 0.815 |
+ 0.841 |
+ 0.726 |
+ 0.866 |
+ 0.871 |
+ 0.847 |
+ 0.842 |
+ 0.842 |
+ 0.857 |
+ 0.843 |
+ 0.678 |
+ 0.729 |
+ 0.833 |
+ 0.824 |
+ 0.853 |
+ 0.867 |
+ 0.870 |
+ 0.852 |
+ 0.864 |
+ 0.865 |
+ 0.756 |
+ 0.745 |
+ 0.844 |
+ 0.862 |
+ 0.872 |
+ 0.847 |
+ 0.902 |
+ 0.887 |
+ 0.840 |
+ 0.839 |
+ 0.843 |
+ 0.847 |
+ 0.842 |
+ 0.849 |
+ 0.845 |
+ 0.851 |
+ 0.841 |
+ 0.847 |
+ 0.855 |
+ 0.873 |
+ 3295 |
+ Stability |
+ CSN4_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ CUE1_YEAST_Tsuboyama_2023_2MYX |
+ 0.746 |
+ 0.847 |
+ 0.798 |
+ 0.800 |
+ 0.821 |
+ 0.815 |
+ 0.646 |
+ 0.781 |
+ 0.863 |
+ 0.867 |
+ 0.777 |
+ 0.671 |
+ 0.673 |
+ 0.617 |
+ 0.637 |
+ 0.679 |
+ 0.728 |
+ 0.797 |
+ 0.793 |
+ 0.768 |
+ 0.626 |
+ 0.618 |
+ 0.643 |
+ 0.563 |
+ 0.544 |
+ 0.564 |
+ 0.623 |
+ 0.622 |
+ 0.796 |
+ 0.779 |
+ 0.802 |
+ 0.768 |
+ 0.676 |
+ 0.617 |
+ 0.597 |
+ 0.629 |
+ 0.742 |
+ 0.739 |
+ 0.769 |
+ 0.808 |
+ 0.806 |
+ 0.812 |
+ 0.626 |
+ 0.633 |
+ 0.731 |
+ 0.644 |
+ 0.781 |
+ 0.787 |
+ 0.870 |
+ 0.835 |
+ 0.711 |
+ 0.782 |
+ 0.764 |
+ 0.760 |
+ 0.721 |
+ 0.760 |
+ 0.747 |
+ 0.739 |
+ 0.747 |
+ 0.747 |
+ 0.824 |
+ 0.750 |
+ 1580 |
+ Stability |
+ CUE1_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ D7PM05_CLYGR_Somermeyer_2022 |
+ 0.630 |
+ 0.771 |
+ 0.736 |
+ 0.695 |
+ 0.760 |
+ 0.760 |
+ 0.320 |
+ 0.622 |
+ 0.792 |
+ 0.789 |
+ 0.595 |
+ 0.325 |
+ 0.328 |
+ 0.327 |
+ 0.309 |
+ 0.302 |
+ 0.312 |
+ 0.341 |
+ 0.357 |
+ 0.621 |
+ 0.291 |
+ 0.270 |
+ 0.362 |
+ 0.331 |
+ 0.261 |
+ 0.305 |
+ 0.313 |
+ 0.393 |
+ 0.478 |
+ 0.773 |
+ 0.752 |
+ 0.756 |
+ 0.308 |
+ 0.325 |
+ 0.360 |
+ 0.363 |
+ 0.691 |
+ 0.690 |
+ 0.686 |
+ 0.764 |
+ 0.763 |
+ 0.753 |
+ 0.443 |
+ 0.445 |
+ 0.450 |
+ 0.450 |
+ 0.573 |
+ 0.593 |
+ 0.611 |
+ 0.594 |
+ 0.626 |
+ 0.620 |
+ 0.632 |
+ 0.628 |
+ 0.614 |
+ 0.629 |
+ 0.608 |
+ 0.627 |
+ 0.615 |
+ 0.625 |
+ 0.513 |
+ 0.386 |
+ 24515 |
+ Activity |
+ D7PM05_CLYGR |
+ Low |
+ Eukaryote |
+
+
+ DLG4_HUMAN_Faure_2021 |
+ 0.870 |
+ 0.859 |
+ 0.871 |
+ 0.863 |
+ 0.876 |
+ 0.876 |
+ 0.872 |
+ 0.863 |
+ 0.834 |
+ 0.841 |
+ 0.816 |
+ 0.865 |
+ 0.867 |
+ 0.888 |
+ 0.914 |
+ 0.901 |
+ 0.865 |
+ 0.853 |
+ 0.826 |
+ 0.878 |
+ 0.851 |
+ 0.849 |
+ 0.836 |
+ 0.823 |
+ 0.863 |
+ 0.851 |
+ 0.843 |
+ 0.848 |
+ 0.821 |
+ 0.874 |
+ 0.853 |
+ 0.852 |
+ 0.817 |
+ 0.849 |
+ 0.869 |
+ 0.836 |
+ 0.886 |
+ 0.894 |
+ 0.880 |
+ 0.883 |
+ 0.888 |
+ 0.885 |
+ 0.877 |
+ 0.727 |
+ 0.832 |
+ 0.867 |
+ 0.869 |
+ 0.817 |
+ 0.895 |
+ 0.772 |
+ 0.837 |
+ 0.813 |
+ 0.835 |
+ 0.833 |
+ 0.845 |
+ 0.846 |
+ 0.829 |
+ 0.827 |
+ 0.829 |
+ 0.835 |
+ 0.857 |
+ 0.914 |
+ 6976 |
+ OrganismalFitness |
+ DLG4_HUMAN |
+ Low |
+ Human |
+
+
+ DLG4_RAT_McLaughlin_2012 |
+ 0.921 |
+ 0.930 |
+ 0.925 |
+ 0.925 |
+ 0.931 |
+ 0.932 |
+ 0.926 |
+ 0.923 |
+ 0.929 |
+ 0.929 |
+ 0.928 |
+ 0.927 |
+ 0.927 |
+ 0.912 |
+ 0.914 |
+ 0.936 |
+ 0.928 |
+ 0.927 |
+ 0.928 |
+ 0.921 |
+ 0.922 |
+ 0.921 |
+ 0.924 |
+ 0.924 |
+ 0.923 |
+ 0.926 |
+ 0.922 |
+ 0.921 |
+ 0.927 |
+ 0.921 |
+ 0.935 |
+ 0.936 |
+ 0.879 |
+ 0.917 |
+ 0.925 |
+ 0.911 |
+ 0.922 |
+ 0.924 |
+ 0.920 |
+ 0.927 |
+ 0.928 |
+ 0.932 |
+ 0.929 |
+ 0.800 |
+ 0.927 |
+ 0.922 |
+ 0.907 |
+ 0.910 |
+ 0.906 |
+ 0.879 |
+ 0.927 |
+ 0.882 |
+ 0.919 |
+ 0.921 |
+ 0.923 |
+ 0.920 |
+ 0.923 |
+ 0.922 |
+ 0.925 |
+ 0.923 |
+ 0.926 |
+ 0.920 |
+ 1576 |
+ Binding |
+ DLG4_RAT |
+ Low |
+ Eukaryote |
+
+
+ DN7A_SACS2_Tsuboyama_2023_1JIC |
+ 0.841 |
+ 0.877 |
+ 0.860 |
+ 0.848 |
+ 0.831 |
+ 0.855 |
+ 0.751 |
+ 0.876 |
+ 0.898 |
+ 0.906 |
+ 0.799 |
+ 0.808 |
+ 0.789 |
+ 0.770 |
+ 0.802 |
+ 0.801 |
+ 0.823 |
+ 0.819 |
+ 0.902 |
+ 0.894 |
+ 0.738 |
+ 0.784 |
+ 0.759 |
+ 0.765 |
+ 0.763 |
+ 0.740 |
+ 0.746 |
+ 0.782 |
+ 0.854 |
+ 0.891 |
+ 0.906 |
+ 0.883 |
+ 0.778 |
+ 0.770 |
+ 0.748 |
+ 0.764 |
+ 0.839 |
+ 0.829 |
+ 0.830 |
+ 0.858 |
+ 0.858 |
+ 0.845 |
+ 0.787 |
+ 0.774 |
+ 0.818 |
+ 0.785 |
+ 0.917 |
+ 0.896 |
+ 0.926 |
+ 0.918 |
+ 0.893 |
+ 0.899 |
+ 0.906 |
+ 0.892 |
+ 0.899 |
+ 0.885 |
+ 0.872 |
+ 0.879 |
+ 0.892 |
+ 0.889 |
+ 0.854 |
+ 0.850 |
+ 1008 |
+ Stability |
+ DN7A_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ DNJA1_HUMAN_Tsuboyama_2023_2LO1 |
+ 0.900 |
+ 0.890 |
+ 0.901 |
+ 0.898 |
+ 0.904 |
+ 0.899 |
+ 0.888 |
+ 0.835 |
+ 0.891 |
+ 0.899 |
+ 0.910 |
+ 0.907 |
+ 0.924 |
+ 0.903 |
+ 0.936 |
+ 0.929 |
+ 0.933 |
+ 0.923 |
+ 0.923 |
+ 0.910 |
+ 0.900 |
+ 0.898 |
+ 0.908 |
+ 0.903 |
+ 0.904 |
+ 0.885 |
+ 0.902 |
+ 0.892 |
+ 0.889 |
+ 0.883 |
+ 0.880 |
+ 0.879 |
+ 0.768 |
+ 0.896 |
+ 0.887 |
+ 0.907 |
+ 0.899 |
+ 0.899 |
+ 0.906 |
+ 0.897 |
+ 0.898 |
+ 0.902 |
+ 0.918 |
+ 0.729 |
+ 0.887 |
+ 0.920 |
+ 0.920 |
+ 0.885 |
+ 0.923 |
+ 0.925 |
+ 0.928 |
+ 0.935 |
+ 0.938 |
+ 0.938 |
+ 0.939 |
+ 0.937 |
+ 0.936 |
+ 0.940 |
+ 0.933 |
+ 0.942 |
+ 0.898 |
+ 0.926 |
+ 2264 |
+ Stability |
+ DNJA1_HUMAN |
+ High |
+ Human |
+
+
+ DOCK1_MOUSE_Tsuboyama_2023_2M0Y |
+ 0.774 |
+ 0.795 |
+ 0.789 |
+ 0.776 |
+ 0.761 |
+ 0.785 |
+ 0.546 |
+ 0.742 |
+ 0.671 |
+ 0.644 |
+ 0.791 |
+ 0.674 |
+ 0.707 |
+ 0.521 |
+ 0.654 |
+ 0.679 |
+ 0.787 |
+ 0.795 |
+ 0.780 |
+ 0.828 |
+ 0.730 |
+ 0.796 |
+ 0.793 |
+ 0.774 |
+ 0.648 |
+ 0.766 |
+ 0.650 |
+ 0.743 |
+ 0.788 |
+ 0.688 |
+ 0.808 |
+ 0.782 |
+ 0.696 |
+ 0.571 |
+ 0.586 |
+ 0.561 |
+ 0.779 |
+ 0.775 |
+ 0.795 |
+ 0.798 |
+ 0.783 |
+ 0.801 |
+ 0.610 |
+ 0.533 |
+ 0.821 |
+ 0.699 |
+ 0.797 |
+ 0.828 |
+ 0.822 |
+ 0.847 |
+ 0.811 |
+ 0.814 |
+ 0.806 |
+ 0.820 |
+ 0.819 |
+ 0.817 |
+ 0.817 |
+ 0.819 |
+ 0.812 |
+ 0.821 |
+ 0.838 |
+ 0.804 |
+ 2915 |
+ Stability |
+ DOCK1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ DYR_ECOLI_Nguyen_2023 |
+ 0.961 |
+ 0.962 |
+ 0.961 |
+ 0.962 |
+ 0.962 |
+ 0.960 |
+ 0.893 |
+ 0.967 |
+ 0.962 |
+ 0.958 |
+ 0.960 |
+ 0.967 |
+ 0.964 |
+ 0.921 |
+ 0.963 |
+ 0.961 |
+ 0.966 |
+ 0.963 |
+ 0.963 |
+ 0.963 |
+ 0.934 |
+ 0.961 |
+ 0.964 |
+ 0.964 |
+ 0.965 |
+ 0.959 |
+ 0.965 |
+ 0.961 |
+ 0.965 |
+ 0.960 |
+ 0.964 |
+ 0.962 |
+ 0.901 |
+ 0.961 |
+ 0.962 |
+ 0.964 |
+ 0.959 |
+ 0.963 |
+ 0.961 |
+ 0.960 |
+ 0.960 |
+ 0.960 |
+ 0.962 |
+ 0.903 |
+ 0.962 |
+ 0.964 |
+ 0.936 |
+ 0.963 |
+ 0.966 |
+ 0.937 |
+ 0.968 |
+ 0.963 |
+ 0.964 |
+ 0.963 |
+ 0.966 |
+ 0.967 |
+ 0.966 |
+ 0.964 |
+ 0.966 |
+ 0.966 |
+ 0.965 |
+ 0.962 |
+ 2916 |
+ OrganismalFitness |
+ DYR_ECOLI |
+ High |
+ Prokaryote |
+
+
+ DYR_ECOLI_Thompson_2019 |
+ 0.865 |
+ 0.885 |
+ 0.884 |
+ 0.887 |
+ 0.891 |
+ 0.888 |
+ 0.797 |
+ 0.865 |
+ 0.888 |
+ 0.883 |
+ 0.862 |
+ 0.882 |
+ 0.866 |
+ 0.829 |
+ 0.870 |
+ 0.870 |
+ 0.877 |
+ 0.874 |
+ 0.869 |
+ 0.876 |
+ 0.815 |
+ 0.882 |
+ 0.865 |
+ 0.867 |
+ 0.867 |
+ 0.889 |
+ 0.900 |
+ 0.870 |
+ 0.898 |
+ 0.900 |
+ 0.880 |
+ 0.878 |
+ 0.816 |
+ 0.882 |
+ 0.895 |
+ 0.903 |
+ 0.884 |
+ 0.892 |
+ 0.891 |
+ 0.890 |
+ 0.893 |
+ 0.892 |
+ 0.868 |
+ 0.791 |
+ 0.873 |
+ 0.875 |
+ 0.845 |
+ 0.869 |
+ 0.873 |
+ 0.852 |
+ 0.875 |
+ 0.868 |
+ 0.864 |
+ 0.871 |
+ 0.873 |
+ 0.885 |
+ 0.870 |
+ 0.872 |
+ 0.872 |
+ 0.871 |
+ 0.863 |
+ 0.876 |
+ 2363 |
+ OrganismalFitness |
+ DYR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ ENV_HV1B9_DuenasDecamp_2016 |
+ 0.643 |
+ 0.630 |
+ 0.538 |
+ 0.601 |
+ 0.635 |
+ 0.631 |
+ 0.249 |
+ 0.662 |
+ 0.628 |
+ 0.634 |
+ 0.538 |
+ 0.623 |
+ 0.624 |
+ 0.299 |
+ 0.278 |
+ 0.305 |
+ 0.301 |
+ 0.298 |
+ 0.469 |
+ 0.663 |
+ 0.590 |
+ 0.645 |
+ 0.662 |
+ 0.660 |
+ 0.564 |
+ 0.677 |
+ 0.587 |
+ 0.655 |
+ 0.669 |
+ 0.577 |
+ 0.709 |
+ 0.647 |
+ 0.564 |
+ 0.618 |
+ 0.637 |
+ 0.668 |
+ 0.642 |
+ 0.631 |
+ 0.653 |
+ 0.639 |
+ 0.643 |
+ 0.658 |
+ 0.566 |
+ 0.244 |
+ 0.629 |
+ 0.623 |
+ 0.549 |
+ 0.578 |
+ 0.497 |
+ 0.410 |
+ 0.431 |
+ 0.527 |
+ 0.470 |
+ 0.431 |
+ 0.528 |
+ 0.488 |
+ 0.495 |
+ 0.510 |
+ 0.378 |
+ 0.487 |
+ 0.353 |
+ 0.319 |
+ 375 |
+ OrganismalFitness |
+ ENV_HV1B9 |
+ Medium |
+ Virus |
+
+
+ ENV_HV1BR_Haddox_2016 |
+ 0.304 |
+ 0.315 |
+ 0.330 |
+ 0.343 |
+ 0.337 |
+ 0.336 |
+ 0.145 |
+ 0.317 |
+ 0.302 |
+ 0.307 |
+ 0.258 |
+ 0.287 |
+ 0.289 |
+ 0.165 |
+ 0.174 |
+ 0.170 |
+ 0.180 |
+ 0.182 |
+ 0.209 |
+ 0.342 |
+ 0.303 |
+ 0.313 |
+ 0.331 |
+ 0.339 |
+ 0.294 |
+ 0.325 |
+ 0.337 |
+ 0.318 |
+ 0.342 |
+ 0.267 |
+ 0.302 |
+ 0.271 |
+ 0.216 |
+ 0.300 |
+ 0.314 |
+ 0.324 |
+ 0.312 |
+ 0.314 |
+ 0.321 |
+ 0.335 |
+ 0.337 |
+ 0.341 |
+ 0.268 |
+ 0.143 |
+ 0.294 |
+ 0.275 |
+ 0.225 |
+ 0.293 |
+ 0.177 |
+ 0.199 |
+ 0.211 |
+ 0.207 |
+ 0.225 |
+ 0.225 |
+ 0.223 |
+ 0.220 |
+ 0.216 |
+ 0.209 |
+ 0.212 |
+ 0.222 |
+ 0.206 |
+ 0.183 |
+ 12863 |
+ OrganismalFitness |
+ ENV_HV1BR |
+ Medium |
+ Virus |
+
+
+ ENVZ_ECOLI_Ghose_2023 |
+ 0.758 |
+ 0.754 |
+ 0.774 |
+ 0.791 |
+ 0.808 |
+ 0.810 |
+ 0.701 |
+ 0.671 |
+ 0.783 |
+ 0.797 |
+ 0.803 |
+ 0.790 |
+ 0.805 |
+ 0.721 |
+ 0.781 |
+ 0.754 |
+ 0.756 |
+ 0.739 |
+ 0.722 |
+ 0.772 |
+ 0.769 |
+ 0.732 |
+ 0.769 |
+ 0.776 |
+ 0.777 |
+ 0.753 |
+ 0.692 |
+ 0.802 |
+ 0.724 |
+ 0.848 |
+ 0.798 |
+ 0.808 |
+ 0.800 |
+ 0.756 |
+ 0.789 |
+ 0.762 |
+ 0.766 |
+ 0.763 |
+ 0.767 |
+ 0.814 |
+ 0.813 |
+ 0.817 |
+ 0.714 |
+ 0.753 |
+ 0.735 |
+ 0.711 |
+ 0.681 |
+ 0.788 |
+ 0.728 |
+ 0.692 |
+ 0.762 |
+ 0.775 |
+ 0.767 |
+ 0.805 |
+ 0.793 |
+ 0.780 |
+ 0.775 |
+ 0.767 |
+ 0.817 |
+ 0.773 |
+ 0.728 |
+ 0.776 |
+ 1121 |
+ Activity |
+ ENVZ_ECOLI |
+ High |
+ Prokaryote |
+
+
+ EPHB2_HUMAN_Tsuboyama_2023_1F0M |
+ 0.917 |
+ 0.935 |
+ 0.937 |
+ 0.939 |
+ 0.935 |
+ 0.939 |
+ 0.363 |
+ 0.912 |
+ 0.938 |
+ 0.939 |
+ 0.912 |
+ 0.917 |
+ 0.922 |
+ 0.391 |
+ 0.915 |
+ 0.919 |
+ 0.937 |
+ 0.940 |
+ 0.936 |
+ 0.938 |
+ 0.921 |
+ 0.924 |
+ 0.917 |
+ 0.920 |
+ 0.926 |
+ 0.925 |
+ 0.930 |
+ 0.932 |
+ 0.922 |
+ 0.912 |
+ 0.929 |
+ 0.910 |
+ 0.883 |
+ 0.930 |
+ 0.913 |
+ 0.931 |
+ 0.946 |
+ 0.934 |
+ 0.942 |
+ 0.943 |
+ 0.937 |
+ 0.941 |
+ 0.886 |
+ 0.551 |
+ 0.937 |
+ 0.906 |
+ 0.947 |
+ 0.931 |
+ 0.949 |
+ 0.942 |
+ 0.938 |
+ 0.939 |
+ 0.938 |
+ 0.945 |
+ 0.944 |
+ 0.941 |
+ 0.937 |
+ 0.941 |
+ 0.942 |
+ 0.941 |
+ 0.948 |
+ 0.946 |
+ 1960 |
+ Stability |
+ EPHB2_HUMAN |
+ High |
+ Human |
+
+
+ ERBB2_HUMAN_Elazar_2016 |
+ 0.782 |
+ 0.884 |
+ 0.794 |
+ 0.766 |
+ 0.805 |
+ 0.821 |
+ 0.798 |
+ 0.892 |
+ 0.836 |
+ 0.899 |
+ 0.848 |
+ 0.744 |
+ 0.790 |
+ 0.866 |
+ 0.789 |
+ 0.788 |
+ 0.881 |
+ 0.899 |
+ 0.896 |
+ 0.853 |
+ 0.820 |
+ 0.866 |
+ 0.897 |
+ 0.907 |
+ 0.667 |
+ 0.898 |
+ 0.914 |
+ 0.919 |
+ 0.899 |
+ 0.772 |
+ 0.884 |
+ 0.847 |
+ 0.828 |
+ 0.878 |
+ 0.652 |
+ 0.908 |
+ 0.858 |
+ 0.740 |
+ 0.893 |
+ 0.823 |
+ 0.797 |
+ 0.836 |
+ 0.793 |
+ 0.758 |
+ 0.890 |
+ 0.767 |
+ 0.917 |
+ 0.928 |
+ 0.741 |
+ 0.825 |
+ 0.827 |
+ 0.844 |
+ 0.786 |
+ 0.872 |
+ 0.857 |
+ 0.775 |
+ 0.804 |
+ 0.866 |
+ 0.826 |
+ 0.829 |
+ 0.779 |
+ 0.772 |
+ 326 |
+ Expression |
+ ERBB2_HUMAN |
+ Low |
+ Human |
+
+
+ ESTA_BACSU_Nutschel_2020 |
+ 0.708 |
+ 0.774 |
+ 0.774 |
+ 0.777 |
+ 0.773 |
+ 0.779 |
+ 0.657 |
+ 0.690 |
+ 0.737 |
+ 0.796 |
+ 0.691 |
+ 0.729 |
+ 0.750 |
+ 0.662 |
+ 0.713 |
+ 0.690 |
+ 0.693 |
+ 0.690 |
+ 0.717 |
+ 0.731 |
+ 0.663 |
+ 0.663 |
+ 0.711 |
+ 0.667 |
+ 0.718 |
+ 0.704 |
+ 0.735 |
+ 0.739 |
+ 0.775 |
+ 0.753 |
+ 0.732 |
+ 0.737 |
+ 0.595 |
+ 0.667 |
+ 0.731 |
+ 0.686 |
+ 0.725 |
+ 0.741 |
+ 0.700 |
+ 0.784 |
+ 0.783 |
+ 0.764 |
+ 0.719 |
+ 0.605 |
+ 0.750 |
+ 0.728 |
+ 0.838 |
+ 0.804 |
+ 0.792 |
+ 0.757 |
+ 0.755 |
+ 0.701 |
+ 0.740 |
+ 0.729 |
+ 0.732 |
+ 0.724 |
+ 0.720 |
+ 0.714 |
+ 0.720 |
+ 0.735 |
+ 0.744 |
+ 0.719 |
+ 2172 |
+ Stability |
+ ESTA_BACSU |
+ High |
+ Prokaryote |
+
+
+ F7YBW8_MESOW_Aakre_2015 |
+ 0.143 |
+ 0.824 |
+ 0.791 |
+ 0.884 |
+ 0.845 |
+ 0.859 |
+ 0.159 |
+ 0.660 |
+ 0.675 |
+ 0.712 |
+ 0.837 |
+ 0.703 |
+ 0.726 |
+ 0.123 |
+ 0.122 |
+ 0.192 |
+ 0.616 |
+ 0.590 |
+ 0.818 |
+ 0.776 |
+ 0.120 |
+ 0.112 |
+ 0.121 |
+ 0.154 |
+ 0.170 |
+ 0.455 |
+ 0.121 |
+ 0.671 |
+ 0.852 |
+ 0.797 |
+ 0.914 |
+ 0.892 |
+ 0.161 |
+ 0.119 |
+ 0.117 |
+ 0.921 |
+ 0.144 |
+ 0.127 |
+ 0.879 |
+ 0.727 |
+ 0.700 |
+ 0.893 |
+ 0.175 |
+ 0.160 |
+ 0.466 |
+ 0.196 |
+ 0.187 |
+ 0.520 |
+ 0.200 |
+ 0.138 |
+ 0.846 |
+ 0.870 |
+ 0.825 |
+ 0.891 |
+ 0.872 |
+ 0.882 |
+ 0.834 |
+ 0.845 |
+ 0.857 |
+ 0.868 |
+ 0.471 |
+ 0.111 |
+ 9192 |
+ OrganismalFitness |
+ F7YBW8_MESOW |
+ High |
+ Prokaryote |
+
+
+ FECA_ECOLI_Tsuboyama_2023_2D1U |
+ 0.666 |
+ 0.702 |
+ 0.687 |
+ 0.686 |
+ 0.691 |
+ 0.684 |
+ 0.529 |
+ 0.638 |
+ 0.685 |
+ 0.728 |
+ 0.736 |
+ 0.638 |
+ 0.690 |
+ 0.521 |
+ 0.742 |
+ 0.759 |
+ 0.715 |
+ 0.709 |
+ 0.718 |
+ 0.693 |
+ 0.601 |
+ 0.675 |
+ 0.668 |
+ 0.654 |
+ 0.669 |
+ 0.670 |
+ 0.694 |
+ 0.679 |
+ 0.701 |
+ 0.698 |
+ 0.714 |
+ 0.677 |
+ 0.578 |
+ 0.522 |
+ 0.578 |
+ 0.656 |
+ 0.639 |
+ 0.634 |
+ 0.692 |
+ 0.668 |
+ 0.657 |
+ 0.699 |
+ 0.703 |
+ 0.597 |
+ 0.745 |
+ 0.719 |
+ 0.788 |
+ 0.773 |
+ 0.767 |
+ 0.745 |
+ 0.679 |
+ 0.710 |
+ 0.726 |
+ 0.729 |
+ 0.718 |
+ 0.703 |
+ 0.699 |
+ 0.698 |
+ 0.716 |
+ 0.697 |
+ 0.746 |
+ 0.745 |
+ 1886 |
+ Stability |
+ FECA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ FKBP3_HUMAN_Tsuboyama_2023_2KFV |
+ 0.870 |
+ 0.869 |
+ 0.875 |
+ 0.876 |
+ 0.878 |
+ 0.878 |
+ 0.737 |
+ 0.829 |
+ 0.820 |
+ 0.822 |
+ 0.735 |
+ 0.728 |
+ 0.741 |
+ 0.729 |
+ 0.718 |
+ 0.712 |
+ 0.761 |
+ 0.774 |
+ 0.817 |
+ 0.818 |
+ 0.758 |
+ 0.754 |
+ 0.716 |
+ 0.837 |
+ 0.732 |
+ 0.831 |
+ 0.815 |
+ 0.768 |
+ 0.799 |
+ 0.844 |
+ 0.816 |
+ 0.807 |
+ 0.650 |
+ 0.696 |
+ 0.724 |
+ 0.800 |
+ 0.869 |
+ 0.870 |
+ 0.863 |
+ 0.880 |
+ 0.876 |
+ 0.881 |
+ 0.750 |
+ 0.738 |
+ 0.744 |
+ 0.748 |
+ 0.881 |
+ 0.860 |
+ 0.892 |
+ 0.884 |
+ 0.787 |
+ 0.845 |
+ 0.859 |
+ 0.836 |
+ 0.858 |
+ 0.827 |
+ 0.842 |
+ 0.831 |
+ 0.800 |
+ 0.844 |
+ 0.888 |
+ 0.820 |
+ 1237 |
+ Stability |
+ FKBP3_HUMAN |
+ Medium |
+ Human |
+
+
+ GAL4_YEAST_Kitzman_2015 |
+ 0.714 |
+ 0.774 |
+ 0.820 |
+ 0.798 |
+ 0.793 |
+ 0.802 |
+ 0.705 |
+ 0.587 |
+ 0.842 |
+ 0.818 |
+ 0.825 |
+ 0.775 |
+ 0.775 |
+ 0.729 |
+ 0.759 |
+ 0.771 |
+ 0.832 |
+ 0.847 |
+ 0.859 |
+ 0.783 |
+ 0.776 |
+ 0.729 |
+ 0.751 |
+ 0.738 |
+ 0.754 |
+ 0.776 |
+ 0.794 |
+ 0.778 |
+ 0.863 |
+ 0.809 |
+ 0.863 |
+ 0.851 |
+ 0.670 |
+ 0.706 |
+ 0.751 |
+ 0.714 |
+ 0.813 |
+ 0.807 |
+ 0.828 |
+ 0.794 |
+ 0.802 |
+ 0.811 |
+ 0.743 |
+ 0.777 |
+ 0.824 |
+ 0.750 |
+ 0.638 |
+ 0.804 |
+ 0.603 |
+ 0.652 |
+ 0.851 |
+ 0.816 |
+ 0.843 |
+ 0.833 |
+ 0.823 |
+ 0.818 |
+ 0.858 |
+ 0.817 |
+ 0.832 |
+ 0.835 |
+ 0.778 |
+ 0.764 |
+ 1195 |
+ OrganismalFitness |
+ GAL4_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ GCN4_YEAST_Staller_2018 |
+ 0.759 |
+ 0.754 |
+ 0.748 |
+ 0.754 |
+ 0.752 |
+ 0.749 |
+ 0.664 |
+ 0.683 |
+ 0.759 |
+ 0.756 |
+ 0.742 |
+ 0.739 |
+ 0.747 |
+ 0.727 |
+ 0.719 |
+ 0.748 |
+ 0.760 |
+ 0.757 |
+ 0.752 |
+ 0.756 |
+ 0.647 |
+ 0.641 |
+ 0.652 |
+ 0.647 |
+ 0.637 |
+ 0.634 |
+ 0.632 |
+ 0.645 |
+ 0.681 |
+ 0.758 |
+ 0.748 |
+ 0.742 |
+ 0.661 |
+ 0.644 |
+ 0.638 |
+ 0.754 |
+ 0.755 |
+ 0.757 |
+ 0.764 |
+ 0.753 |
+ 0.753 |
+ 0.762 |
+ 0.723 |
+ 0.735 |
+ 0.723 |
+ 0.735 |
+ 0.743 |
+ 0.759 |
+ 0.735 |
+ 0.708 |
+ 0.744 |
+ 0.740 |
+ 0.741 |
+ 0.741 |
+ 0.746 |
+ 0.747 |
+ 0.741 |
+ 0.740 |
+ 0.745 |
+ 0.742 |
+ 0.709 |
+ 0.703 |
+ 2638 |
+ Binding |
+ GCN4_YEAST |
+ Low |
+ Eukaryote |
+
+
+ GDIA_HUMAN_Silverstein_2021 |
+ 0.879 |
+ 0.859 |
+ 0.860 |
+ 0.859 |
+ 0.849 |
+ 0.857 |
+ 0.769 |
+ 0.871 |
+ 0.872 |
+ 0.869 |
+ 0.864 |
+ 0.860 |
+ 0.868 |
+ 0.737 |
+ 0.795 |
+ 0.878 |
+ 0.864 |
+ 0.867 |
+ 0.877 |
+ 0.864 |
+ 0.779 |
+ 0.874 |
+ 0.865 |
+ 0.872 |
+ 0.854 |
+ 0.876 |
+ 0.870 |
+ 0.870 |
+ 0.871 |
+ 0.868 |
+ 0.842 |
+ 0.844 |
+ 0.804 |
+ 0.832 |
+ 0.860 |
+ 0.843 |
+ 0.877 |
+ 0.861 |
+ 0.858 |
+ 0.863 |
+ 0.861 |
+ 0.858 |
+ 0.778 |
+ 0.706 |
+ 0.886 |
+ 0.790 |
+ 0.880 |
+ 0.879 |
+ 0.862 |
+ 0.742 |
+ 0.852 |
+ 0.851 |
+ 0.861 |
+ 0.861 |
+ 0.858 |
+ 0.872 |
+ 0.858 |
+ 0.860 |
+ 0.862 |
+ 0.856 |
+ 0.866 |
+ 0.844 |
+ 1154 |
+ OrganismalFitness |
+ GDIA_HUMAN |
+ Low |
+ Human |
+
+
+ GFP_AEQVI_Sarkisyan_2016 |
+ 0.917 |
+ 0.907 |
+ 0.913 |
+ 0.913 |
+ 0.913 |
+ 0.913 |
+ 0.416 |
+ 0.912 |
+ 0.916 |
+ 0.915 |
+ 0.812 |
+ 0.470 |
+ 0.472 |
+ 0.464 |
+ 0.493 |
+ 0.449 |
+ 0.456 |
+ 0.500 |
+ 0.562 |
+ 0.878 |
+ 0.443 |
+ 0.457 |
+ 0.514 |
+ 0.471 |
+ 0.403 |
+ 0.524 |
+ 0.630 |
+ 0.912 |
+ 0.918 |
+ 0.923 |
+ 0.899 |
+ 0.899 |
+ 0.470 |
+ 0.435 |
+ 0.543 |
+ 0.902 |
+ 0.924 |
+ 0.925 |
+ 0.922 |
+ 0.919 |
+ 0.920 |
+ 0.931 |
+ 0.650 |
+ 0.598 |
+ 0.653 |
+ 0.658 |
+ 0.906 |
+ 0.898 |
+ 0.923 |
+ 0.883 |
+ 0.875 |
+ 0.882 |
+ 0.890 |
+ 0.895 |
+ 0.889 |
+ 0.889 |
+ 0.887 |
+ 0.889 |
+ 0.878 |
+ 0.890 |
+ 0.871 |
+ 0.748 |
+ 51714 |
+ Activity |
+ GFP_AEQVI |
+ Low |
+ Eukaryote |
+
+
+ GLPA_HUMAN_Elazar_2016 |
+ 0.765 |
+ 0.716 |
+ 0.777 |
+ 0.776 |
+ 0.764 |
+ 0.770 |
+ 0.624 |
+ 0.836 |
+ 0.744 |
+ 0.729 |
+ 0.649 |
+ 0.718 |
+ 0.682 |
+ 0.621 |
+ 0.616 |
+ 0.696 |
+ 0.699 |
+ 0.673 |
+ 0.724 |
+ 0.699 |
+ 0.692 |
+ 0.652 |
+ 0.664 |
+ 0.719 |
+ 0.672 |
+ 0.689 |
+ 0.648 |
+ 0.692 |
+ 0.734 |
+ 0.743 |
+ 0.739 |
+ 0.762 |
+ 0.765 |
+ 0.655 |
+ 0.688 |
+ 0.687 |
+ 0.794 |
+ 0.793 |
+ 0.796 |
+ 0.765 |
+ 0.772 |
+ 0.769 |
+ 0.643 |
+ 0.653 |
+ 0.710 |
+ 0.674 |
+ 0.737 |
+ 0.655 |
+ 0.755 |
+ 0.652 |
+ 0.832 |
+ 0.761 |
+ 0.749 |
+ 0.713 |
+ 0.752 |
+ 0.687 |
+ 0.750 |
+ 0.797 |
+ 0.781 |
+ 0.752 |
+ 0.776 |
+ 0.808 |
+ 245 |
+ Expression |
+ GLPA_HUMAN |
+ Low |
+ Human |
+
+
+ GRB2_HUMAN_Faure_2021 |
+ 0.744 |
+ 0.810 |
+ 0.812 |
+ 0.820 |
+ 0.828 |
+ 0.832 |
+ 0.760 |
+ 0.754 |
+ 0.830 |
+ 0.801 |
+ 0.827 |
+ 0.777 |
+ 0.818 |
+ 0.752 |
+ 0.831 |
+ 0.864 |
+ 0.867 |
+ 0.812 |
+ 0.843 |
+ 0.818 |
+ 0.824 |
+ 0.813 |
+ 0.789 |
+ 0.774 |
+ 0.801 |
+ 0.804 |
+ 0.767 |
+ 0.816 |
+ 0.763 |
+ 0.807 |
+ 0.780 |
+ 0.767 |
+ 0.781 |
+ 0.817 |
+ 0.816 |
+ 0.721 |
+ 0.825 |
+ 0.825 |
+ 0.766 |
+ 0.845 |
+ 0.848 |
+ 0.831 |
+ 0.839 |
+ 0.633 |
+ 0.803 |
+ 0.817 |
+ 0.875 |
+ 0.800 |
+ 0.894 |
+ 0.787 |
+ 0.842 |
+ 0.849 |
+ 0.840 |
+ 0.852 |
+ 0.854 |
+ 0.849 |
+ 0.841 |
+ 0.850 |
+ 0.838 |
+ 0.855 |
+ 0.819 |
+ 0.845 |
+ 63366 |
+ OrganismalFitness |
+ GRB2_HUMAN |
+ Medium |
+ Human |
+
+
+ HCP_LAMBD_Tsuboyama_2023_2L6Q |
+ 0.892 |
+ 0.907 |
+ 0.883 |
+ 0.885 |
+ 0.900 |
+ 0.901 |
+ 0.778 |
+ 0.871 |
+ 0.857 |
+ 0.910 |
+ 0.917 |
+ 0.836 |
+ 0.871 |
+ 0.795 |
+ 0.851 |
+ 0.905 |
+ 0.924 |
+ 0.914 |
+ 0.925 |
+ 0.823 |
+ 0.791 |
+ 0.819 |
+ 0.847 |
+ 0.858 |
+ 0.846 |
+ 0.815 |
+ 0.756 |
+ 0.825 |
+ 0.916 |
+ 0.905 |
+ 0.905 |
+ 0.878 |
+ 0.746 |
+ 0.749 |
+ 0.803 |
+ 0.889 |
+ 0.886 |
+ 0.899 |
+ 0.898 |
+ 0.890 |
+ 0.913 |
+ 0.907 |
+ 0.832 |
+ 0.813 |
+ 0.913 |
+ 0.830 |
+ 0.898 |
+ 0.924 |
+ 0.914 |
+ 0.891 |
+ 0.918 |
+ 0.921 |
+ 0.917 |
+ 0.919 |
+ 0.932 |
+ 0.921 |
+ 0.926 |
+ 0.928 |
+ 0.923 |
+ 0.927 |
+ 0.927 |
+ 0.880 |
+ 1040 |
+ Stability |
+ HCP_LAMBD |
+ Medium |
+ Virus |
+
+
+ HECD1_HUMAN_Tsuboyama_2023_3DKM |
+ 0.860 |
+ 0.850 |
+ 0.882 |
+ 0.890 |
+ 0.898 |
+ 0.895 |
+ 0.590 |
+ 0.776 |
+ 0.883 |
+ 0.892 |
+ 0.872 |
+ 0.654 |
+ 0.669 |
+ 0.590 |
+ 0.585 |
+ 0.853 |
+ 0.888 |
+ 0.871 |
+ 0.876 |
+ 0.868 |
+ 0.500 |
+ 0.813 |
+ 0.850 |
+ 0.842 |
+ 0.821 |
+ 0.832 |
+ 0.850 |
+ 0.855 |
+ 0.836 |
+ 0.873 |
+ 0.887 |
+ 0.852 |
+ 0.511 |
+ 0.599 |
+ 0.438 |
+ 0.666 |
+ 0.859 |
+ 0.841 |
+ 0.860 |
+ 0.883 |
+ 0.879 |
+ 0.875 |
+ 0.642 |
+ 0.623 |
+ 0.876 |
+ 0.648 |
+ 0.767 |
+ 0.845 |
+ 0.747 |
+ 0.681 |
+ 0.892 |
+ 0.884 |
+ 0.891 |
+ 0.887 |
+ 0.903 |
+ 0.901 |
+ 0.898 |
+ 0.903 |
+ 0.898 |
+ 0.902 |
+ 0.889 |
+ 0.563 |
+ 5586 |
+ Stability |
+ HECD1_HUMAN |
+ Medium |
+ Human |
+
+
+ HEM3_HUMAN_Loggerenberg_2023 |
+ 0.570 |
+ 0.585 |
+ 0.585 |
+ 0.577 |
+ 0.579 |
+ 0.581 |
+ 0.468 |
+ 0.375 |
+ 0.581 |
+ 0.590 |
+ 0.549 |
+ 0.549 |
+ 0.550 |
+ 0.450 |
+ 0.549 |
+ 0.554 |
+ 0.554 |
+ 0.585 |
+ 0.575 |
+ 0.442 |
+ 0.565 |
+ 0.565 |
+ 0.587 |
+ 0.610 |
+ 0.549 |
+ 0.560 |
+ 0.560 |
+ 0.550 |
+ 0.589 |
+ 0.595 |
+ 0.601 |
+ 0.614 |
+ 0.450 |
+ 0.562 |
+ 0.594 |
+ 0.616 |
+ 0.582 |
+ 0.603 |
+ 0.611 |
+ 0.587 |
+ 0.591 |
+ 0.598 |
+ 0.535 |
+ 0.473 |
+ 0.558 |
+ 0.553 |
+ 0.558 |
+ 0.585 |
+ 0.526 |
+ 0.543 |
+ 0.561 |
+ 0.545 |
+ 0.564 |
+ 0.566 |
+ 0.565 |
+ 0.562 |
+ 0.568 |
+ 0.556 |
+ 0.560 |
+ 0.561 |
+ 0.574 |
+ 0.565 |
+ 5689 |
+ Activity |
+ HEM3_HUMAN |
+ Medium |
+ Human |
+
+
+ HIS7_YEAST_Pokusaeva_2019 |
+ 0.826 |
+ 0.882 |
+ 0.873 |
+ 0.874 |
+ 0.864 |
+ 0.861 |
+ 0.667 |
+ 0.385 |
+ 0.852 |
+ 0.863 |
+ 0.860 |
+ 0.812 |
+ 0.839 |
+ 0.549 |
+ 0.645 |
+ 0.744 |
+ 0.867 |
+ 0.880 |
+ 0.882 |
+ 0.800 |
+ 0.811 |
+ 0.857 |
+ 0.871 |
+ 0.875 |
+ 0.852 |
+ 0.867 |
+ 0.858 |
+ 0.862 |
+ 0.859 |
+ 0.865 |
+ 0.841 |
+ 0.837 |
+ 0.575 |
+ 0.844 |
+ 0.859 |
+ 0.881 |
+ 0.850 |
+ 0.842 |
+ 0.875 |
+ 0.880 |
+ 0.872 |
+ 0.882 |
+ 0.699 |
+ 0.676 |
+ 0.811 |
+ 0.596 |
+ 0.854 |
+ 0.884 |
+ 0.869 |
+ 0.825 |
+ 0.848 |
+ 0.812 |
+ 0.837 |
+ 0.838 |
+ 0.842 |
+ 0.838 |
+ 0.835 |
+ 0.842 |
+ 0.843 |
+ 0.840 |
+ 0.874 |
+ 0.574 |
+ 496137 |
+ OrganismalFitness |
+ HIS7_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HMDH_HUMAN_Jiang_2019 |
+ 0.635 |
+ 0.631 |
+ 0.633 |
+ 0.629 |
+ 0.633 |
+ 0.632 |
+ 0.594 |
+ 0.690 |
+ 0.626 |
+ 0.627 |
+ 0.644 |
+ 0.644 |
+ 0.645 |
+ 0.535 |
+ 0.536 |
+ 0.581 |
+ 0.652 |
+ 0.638 |
+ 0.644 |
+ 0.630 |
+ 0.627 |
+ 0.626 |
+ 0.635 |
+ 0.637 |
+ 0.647 |
+ 0.635 |
+ 0.632 |
+ 0.638 |
+ 0.636 |
+ 0.638 |
+ 0.635 |
+ 0.629 |
+ 0.561 |
+ 0.608 |
+ 0.646 |
+ 0.644 |
+ 0.640 |
+ 0.646 |
+ 0.644 |
+ 0.632 |
+ 0.634 |
+ 0.633 |
+ 0.544 |
+ 0.522 |
+ 0.668 |
+ 0.634 |
+ 0.624 |
+ 0.655 |
+ 0.505 |
+ 0.571 |
+ 0.647 |
+ 0.642 |
+ 0.652 |
+ 0.654 |
+ 0.644 |
+ 0.652 |
+ 0.646 |
+ 0.651 |
+ 0.640 |
+ 0.643 |
+ 0.663 |
+ 0.626 |
+ 16853 |
+ OrganismalFitness |
+ HMDH_HUMAN |
+ Low |
+ Human |
+
+
+ HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2 |
+ 0.945 |
+ 0.962 |
+ 0.959 |
+ 0.961 |
+ 0.962 |
+ 0.963 |
+ 0.836 |
+ 0.958 |
+ 0.962 |
+ 0.961 |
+ 0.958 |
+ 0.947 |
+ 0.962 |
+ 0.766 |
+ 0.800 |
+ 0.876 |
+ 0.932 |
+ 0.942 |
+ 0.948 |
+ 0.873 |
+ 0.963 |
+ 0.962 |
+ 0.960 |
+ 0.962 |
+ 0.960 |
+ 0.960 |
+ 0.963 |
+ 0.963 |
+ 0.963 |
+ 0.963 |
+ 0.964 |
+ 0.962 |
+ 0.943 |
+ 0.960 |
+ 0.960 |
+ 0.961 |
+ 0.958 |
+ 0.958 |
+ 0.958 |
+ 0.963 |
+ 0.962 |
+ 0.963 |
+ 0.760 |
+ 0.770 |
+ 0.962 |
+ 0.953 |
+ 0.778 |
+ 0.956 |
+ 0.783 |
+ 0.884 |
+ 0.937 |
+ 0.940 |
+ 0.930 |
+ 0.930 |
+ 0.935 |
+ 0.931 |
+ 0.935 |
+ 0.927 |
+ 0.935 |
+ 0.933 |
+ 0.958 |
+ 0.938 |
+ 2252 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HSP82_YEAST_Flynn_2019 |
+ 0.793 |
+ 0.812 |
+ 0.816 |
+ 0.816 |
+ 0.818 |
+ 0.816 |
+ 0.728 |
+ 0.802 |
+ 0.822 |
+ 0.813 |
+ 0.805 |
+ 0.813 |
+ 0.816 |
+ 0.700 |
+ 0.722 |
+ 0.732 |
+ 0.777 |
+ 0.794 |
+ 0.795 |
+ 0.812 |
+ 0.794 |
+ 0.815 |
+ 0.817 |
+ 0.820 |
+ 0.803 |
+ 0.823 |
+ 0.825 |
+ 0.823 |
+ 0.824 |
+ 0.821 |
+ 0.820 |
+ 0.815 |
+ 0.763 |
+ 0.811 |
+ 0.819 |
+ 0.819 |
+ 0.808 |
+ 0.814 |
+ 0.815 |
+ 0.815 |
+ 0.817 |
+ 0.818 |
+ 0.730 |
+ 0.681 |
+ 0.801 |
+ 0.771 |
+ 0.745 |
+ 0.812 |
+ 0.701 |
+ 0.708 |
+ 0.767 |
+ 0.771 |
+ 0.777 |
+ 0.783 |
+ 0.771 |
+ 0.782 |
+ 0.772 |
+ 0.771 |
+ 0.766 |
+ 0.776 |
+ 0.801 |
+ 0.789 |
+ 13294 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HSP82_YEAST_Mishra_2016 |
+ 0.933 |
+ 0.943 |
+ 0.942 |
+ 0.943 |
+ 0.944 |
+ 0.935 |
+ 0.882 |
+ 0.896 |
+ 0.941 |
+ 0.945 |
+ 0.942 |
+ 0.943 |
+ 0.942 |
+ 0.858 |
+ 0.890 |
+ 0.903 |
+ 0.928 |
+ 0.929 |
+ 0.926 |
+ 0.934 |
+ 0.941 |
+ 0.939 |
+ 0.942 |
+ 0.941 |
+ 0.944 |
+ 0.942 |
+ 0.944 |
+ 0.943 |
+ 0.944 |
+ 0.944 |
+ 0.942 |
+ 0.938 |
+ 0.892 |
+ 0.940 |
+ 0.941 |
+ 0.941 |
+ 0.939 |
+ 0.940 |
+ 0.940 |
+ 0.942 |
+ 0.943 |
+ 0.940 |
+ 0.939 |
+ 0.794 |
+ 0.938 |
+ 0.942 |
+ 0.862 |
+ 0.938 |
+ 0.855 |
+ 0.840 |
+ 0.920 |
+ 0.923 |
+ 0.917 |
+ 0.919 |
+ 0.926 |
+ 0.923 |
+ 0.923 |
+ 0.926 |
+ 0.917 |
+ 0.924 |
+ 0.931 |
+ 0.928 |
+ 4323 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HXK4_HUMAN_Gersing_2022_activity |
+ 0.616 |
+ 0.619 |
+ 0.603 |
+ 0.602 |
+ 0.609 |
+ 0.612 |
+ 0.515 |
+ 0.584 |
+ 0.612 |
+ 0.613 |
+ 0.600 |
+ 0.595 |
+ 0.603 |
+ 0.527 |
+ 0.548 |
+ 0.620 |
+ 0.632 |
+ 0.612 |
+ 0.604 |
+ 0.512 |
+ 0.596 |
+ 0.602 |
+ 0.607 |
+ 0.600 |
+ 0.612 |
+ 0.614 |
+ 0.616 |
+ 0.610 |
+ 0.589 |
+ 0.593 |
+ 0.587 |
+ 0.567 |
+ 0.503 |
+ 0.609 |
+ 0.607 |
+ 0.602 |
+ 0.617 |
+ 0.603 |
+ 0.605 |
+ 0.615 |
+ 0.606 |
+ 0.611 |
+ 0.568 |
+ 0.502 |
+ 0.616 |
+ 0.609 |
+ 0.594 |
+ 0.621 |
+ 0.593 |
+ 0.556 |
+ 0.608 |
+ 0.615 |
+ 0.620 |
+ 0.619 |
+ 0.618 |
+ 0.631 |
+ 0.632 |
+ 0.622 |
+ 0.616 |
+ 0.621 |
+ 0.632 |
+ 0.599 |
+ 8570 |
+ OrganismalFitness |
+ HXK4_HUMAN |
+ Medium |
+ Human |
+
+
+ HXK4_HUMAN_Gersing_2023_abundance |
+ 0.853 |
+ 0.854 |
+ 0.851 |
+ 0.854 |
+ 0.851 |
+ 0.850 |
+ 0.761 |
+ 0.848 |
+ 0.857 |
+ 0.861 |
+ 0.851 |
+ 0.854 |
+ 0.857 |
+ 0.790 |
+ 0.813 |
+ 0.853 |
+ 0.859 |
+ 0.855 |
+ 0.859 |
+ 0.863 |
+ 0.855 |
+ 0.850 |
+ 0.860 |
+ 0.854 |
+ 0.856 |
+ 0.854 |
+ 0.859 |
+ 0.857 |
+ 0.851 |
+ 0.860 |
+ 0.855 |
+ 0.851 |
+ 0.805 |
+ 0.853 |
+ 0.852 |
+ 0.857 |
+ 0.861 |
+ 0.855 |
+ 0.859 |
+ 0.852 |
+ 0.849 |
+ 0.849 |
+ 0.821 |
+ 0.788 |
+ 0.848 |
+ 0.860 |
+ 0.860 |
+ 0.849 |
+ 0.855 |
+ 0.826 |
+ 0.854 |
+ 0.858 |
+ 0.858 |
+ 0.852 |
+ 0.859 |
+ 0.859 |
+ 0.862 |
+ 0.855 |
+ 0.857 |
+ 0.858 |
+ 0.861 |
+ 0.849 |
+ 8396 |
+ Expression |
+ HXK4_HUMAN |
+ Medium |
+ Human |
+
+
+ I6TAH8_I68A0_Doud_2015 |
+ 0.772 |
+ 0.763 |
+ 0.754 |
+ 0.753 |
+ 0.776 |
+ 0.776 |
+ 0.619 |
+ 0.744 |
+ 0.756 |
+ 0.771 |
+ 0.617 |
+ 0.612 |
+ 0.613 |
+ 0.611 |
+ 0.608 |
+ 0.613 |
+ 0.625 |
+ 0.617 |
+ 0.645 |
+ 0.731 |
+ 0.727 |
+ 0.754 |
+ 0.761 |
+ 0.771 |
+ 0.615 |
+ 0.626 |
+ 0.635 |
+ 0.615 |
+ 0.755 |
+ 0.785 |
+ 0.718 |
+ 0.713 |
+ 0.666 |
+ 0.759 |
+ 0.755 |
+ 0.754 |
+ 0.774 |
+ 0.769 |
+ 0.768 |
+ 0.779 |
+ 0.776 |
+ 0.772 |
+ 0.608 |
+ 0.613 |
+ 0.618 |
+ 0.616 |
+ 0.677 |
+ 0.679 |
+ 0.674 |
+ 0.661 |
+ 0.609 |
+ 0.643 |
+ 0.646 |
+ 0.635 |
+ 0.627 |
+ 0.615 |
+ 0.618 |
+ 0.615 |
+ 0.606 |
+ 0.624 |
+ 0.645 |
+ 0.625 |
+ 9462 |
+ OrganismalFitness |
+ I6TAH8_I68A0 |
+ Medium |
+ Virus |
+
+
+ IF1_ECOLI_Kelsic_2016 |
+ 0.907 |
+ 0.943 |
+ 0.934 |
+ 0.932 |
+ 0.934 |
+ 0.935 |
+ 0.862 |
+ 0.907 |
+ 0.892 |
+ 0.888 |
+ 0.944 |
+ 0.936 |
+ 0.937 |
+ 0.875 |
+ 0.936 |
+ 0.942 |
+ 0.940 |
+ 0.932 |
+ 0.939 |
+ 0.938 |
+ 0.932 |
+ 0.942 |
+ 0.941 |
+ 0.942 |
+ 0.942 |
+ 0.947 |
+ 0.947 |
+ 0.943 |
+ 0.941 |
+ 0.886 |
+ 0.931 |
+ 0.929 |
+ 0.857 |
+ 0.935 |
+ 0.939 |
+ 0.949 |
+ 0.934 |
+ 0.937 |
+ 0.945 |
+ 0.936 |
+ 0.937 |
+ 0.937 |
+ 0.919 |
+ 0.868 |
+ 0.945 |
+ 0.939 |
+ 0.873 |
+ 0.931 |
+ 0.939 |
+ 0.911 |
+ 0.919 |
+ 0.938 |
+ 0.946 |
+ 0.939 |
+ 0.945 |
+ 0.943 |
+ 0.933 |
+ 0.944 |
+ 0.946 |
+ 0.946 |
+ 0.938 |
+ 0.925 |
+ 1367 |
+ OrganismalFitness |
+ IF1_ECOLI |
+ High |
+ Prokaryote |
+
+
+ ILF3_HUMAN_Tsuboyama_2023_2L33 |
+ 0.788 |
+ 0.811 |
+ 0.827 |
+ 0.821 |
+ 0.820 |
+ 0.819 |
+ 0.751 |
+ 0.815 |
+ 0.805 |
+ 0.809 |
+ 0.831 |
+ 0.809 |
+ 0.804 |
+ 0.781 |
+ 0.782 |
+ 0.796 |
+ 0.803 |
+ 0.832 |
+ 0.825 |
+ 0.812 |
+ 0.836 |
+ 0.819 |
+ 0.830 |
+ 0.835 |
+ 0.868 |
+ 0.845 |
+ 0.859 |
+ 0.843 |
+ 0.841 |
+ 0.825 |
+ 0.822 |
+ 0.808 |
+ 0.812 |
+ 0.742 |
+ 0.787 |
+ 0.857 |
+ 0.790 |
+ 0.834 |
+ 0.850 |
+ 0.800 |
+ 0.803 |
+ 0.820 |
+ 0.788 |
+ 0.767 |
+ 0.829 |
+ 0.799 |
+ 0.811 |
+ 0.833 |
+ 0.812 |
+ 0.806 |
+ 0.838 |
+ 0.809 |
+ 0.824 |
+ 0.838 |
+ 0.840 |
+ 0.823 |
+ 0.832 |
+ 0.827 |
+ 0.838 |
+ 0.836 |
+ 0.843 |
+ 0.795 |
+ 1329 |
+ Stability |
+ ILF3_HUMAN |
+ High |
+ Human |
+
+
+ ISDH_STAAW_Tsuboyama_2023_2LHR |
+ 0.765 |
+ 0.722 |
+ 0.784 |
+ 0.781 |
+ 0.785 |
+ 0.786 |
+ 0.773 |
+ 0.758 |
+ 0.831 |
+ 0.799 |
+ 0.800 |
+ 0.811 |
+ 0.821 |
+ 0.828 |
+ 0.814 |
+ 0.876 |
+ 0.838 |
+ 0.842 |
+ 0.854 |
+ 0.770 |
+ 0.786 |
+ 0.771 |
+ 0.797 |
+ 0.811 |
+ 0.819 |
+ 0.823 |
+ 0.845 |
+ 0.781 |
+ 0.825 |
+ 0.793 |
+ 0.818 |
+ 0.797 |
+ 0.700 |
+ 0.765 |
+ 0.751 |
+ 0.793 |
+ 0.765 |
+ 0.758 |
+ 0.779 |
+ 0.781 |
+ 0.771 |
+ 0.785 |
+ 0.811 |
+ 0.783 |
+ 0.821 |
+ 0.809 |
+ 0.908 |
+ 0.894 |
+ 0.887 |
+ 0.865 |
+ 0.840 |
+ 0.837 |
+ 0.858 |
+ 0.841 |
+ 0.831 |
+ 0.845 |
+ 0.839 |
+ 0.844 |
+ 0.821 |
+ 0.847 |
+ 0.882 |
+ 0.833 |
+ 1944 |
+ Stability |
+ ISDH_STAAW |
+ High |
+ Prokaryote |
+
+
+ KCNE1_HUMAN_Muhammad_2023_expression |
+ 0.590 |
+ 0.575 |
+ 0.569 |
+ 0.568 |
+ 0.588 |
+ 0.587 |
+ 0.587 |
+ 0.584 |
+ 0.566 |
+ 0.570 |
+ 0.640 |
+ 0.645 |
+ 0.643 |
+ 0.600 |
+ 0.595 |
+ 0.588 |
+ 0.606 |
+ 0.572 |
+ 0.570 |
+ 0.572 |
+ 0.641 |
+ 0.628 |
+ 0.579 |
+ 0.595 |
+ 0.634 |
+ 0.638 |
+ 0.603 |
+ 0.569 |
+ 0.571 |
+ 0.571 |
+ 0.604 |
+ 0.587 |
+ 0.380 |
+ 0.584 |
+ 0.632 |
+ 0.604 |
+ 0.595 |
+ 0.605 |
+ 0.596 |
+ 0.588 |
+ 0.586 |
+ 0.584 |
+ 0.610 |
+ 0.590 |
+ 0.577 |
+ 0.614 |
+ 0.604 |
+ 0.582 |
+ 0.602 |
+ 0.604 |
+ 0.609 |
+ 0.595 |
+ 0.602 |
+ 0.614 |
+ 0.597 |
+ 0.601 |
+ 0.595 |
+ 0.611 |
+ 0.601 |
+ 0.608 |
+ 0.623 |
+ 0.637 |
+ 2339 |
+ Expression |
+ KCNE1_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNE1_HUMAN_Muhammad_2023_function |
+ 0.836 |
+ 0.841 |
+ 0.841 |
+ 0.847 |
+ 0.844 |
+ 0.847 |
+ 0.709 |
+ 0.821 |
+ 0.837 |
+ 0.847 |
+ 0.659 |
+ 0.723 |
+ 0.797 |
+ 0.665 |
+ 0.655 |
+ 0.842 |
+ 0.854 |
+ 0.812 |
+ 0.808 |
+ 0.804 |
+ 0.625 |
+ 0.607 |
+ 0.802 |
+ 0.822 |
+ 0.702 |
+ 0.844 |
+ 0.836 |
+ 0.829 |
+ 0.840 |
+ 0.849 |
+ 0.841 |
+ 0.817 |
+ 0.367 |
+ 0.708 |
+ 0.784 |
+ 0.857 |
+ 0.835 |
+ 0.838 |
+ 0.854 |
+ 0.851 |
+ 0.851 |
+ 0.853 |
+ 0.670 |
+ 0.657 |
+ 0.843 |
+ 0.665 |
+ 0.743 |
+ 0.831 |
+ 0.712 |
+ 0.670 |
+ 0.831 |
+ 0.856 |
+ 0.842 |
+ 0.843 |
+ 0.854 |
+ 0.840 |
+ 0.840 |
+ 0.844 |
+ 0.848 |
+ 0.851 |
+ 0.849 |
+ 0.646 |
+ 2315 |
+ Activity |
+ KCNE1_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNH2_HUMAN_Kozek_2020 |
+ 0.639 |
+ 0.641 |
+ 0.639 |
+ 0.639 |
+ 0.640 |
+ 0.640 |
+ 0.625 |
+ 0.521 |
+ 0.614 |
+ 0.633 |
+ 0.678 |
+ 0.549 |
+ 0.537 |
+ 0.556 |
+ 0.525 |
+ 0.582 |
+ 0.520 |
+ 0.558 |
+ 0.606 |
+ 0.697 |
+ 0.652 |
+ 0.648 |
+ 0.698 |
+ 0.679 |
+ 0.683 |
+ 0.693 |
+ 0.630 |
+ 0.661 |
+ 0.643 |
+ 0.678 |
+ 0.601 |
+ 0.641 |
+ 0.593 |
+ 0.633 |
+ 0.644 |
+ 0.661 |
+ 0.628 |
+ 0.647 |
+ 0.680 |
+ 0.630 |
+ 0.648 |
+ 0.646 |
+ 0.709 |
+ 0.513 |
+ 0.680 |
+ 0.646 |
+ 0.645 |
+ 0.461 |
+ 0.270 |
+ 0.520 |
+ 0.702 |
+ 0.682 |
+ 0.686 |
+ 0.717 |
+ 0.697 |
+ 0.718 |
+ 0.676 |
+ 0.678 |
+ 0.703 |
+ 0.702 |
+ 0.698 |
+ 0.660 |
+ 200 |
+ Activity |
+ KCNH2_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNJ2_MOUSE_Coyote-Maestas_2022_function |
+ 0.781 |
+ 0.797 |
+ 0.795 |
+ 0.797 |
+ 0.799 |
+ 0.799 |
+ 0.722 |
+ 0.782 |
+ 0.792 |
+ 0.792 |
+ 0.797 |
+ 0.796 |
+ 0.797 |
+ 0.724 |
+ 0.773 |
+ 0.799 |
+ 0.799 |
+ 0.797 |
+ 0.795 |
+ 0.787 |
+ 0.802 |
+ 0.795 |
+ 0.792 |
+ 0.790 |
+ 0.799 |
+ 0.796 |
+ 0.795 |
+ 0.795 |
+ 0.794 |
+ 0.797 |
+ 0.791 |
+ 0.784 |
+ 0.731 |
+ 0.791 |
+ 0.793 |
+ 0.792 |
+ 0.797 |
+ 0.795 |
+ 0.793 |
+ 0.797 |
+ 0.796 |
+ 0.795 |
+ 0.759 |
+ 0.723 |
+ 0.792 |
+ 0.788 |
+ 0.763 |
+ 0.790 |
+ 0.770 |
+ 0.737 |
+ 0.795 |
+ 0.795 |
+ 0.797 |
+ 0.798 |
+ 0.795 |
+ 0.798 |
+ 0.796 |
+ 0.800 |
+ 0.799 |
+ 0.800 |
+ 0.797 |
+ 0.777 |
+ 6963 |
+ Activity |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ KCNJ2_MOUSE_Coyote-Maestas_2022_surface |
+ 0.830 |
+ 0.820 |
+ 0.823 |
+ 0.822 |
+ 0.823 |
+ 0.824 |
+ 0.715 |
+ 0.803 |
+ 0.821 |
+ 0.821 |
+ 0.829 |
+ 0.822 |
+ 0.824 |
+ 0.746 |
+ 0.834 |
+ 0.842 |
+ 0.840 |
+ 0.838 |
+ 0.841 |
+ 0.812 |
+ 0.826 |
+ 0.804 |
+ 0.806 |
+ 0.818 |
+ 0.824 |
+ 0.815 |
+ 0.815 |
+ 0.821 |
+ 0.812 |
+ 0.821 |
+ 0.824 |
+ 0.817 |
+ 0.763 |
+ 0.821 |
+ 0.792 |
+ 0.812 |
+ 0.823 |
+ 0.812 |
+ 0.813 |
+ 0.823 |
+ 0.818 |
+ 0.819 |
+ 0.815 |
+ 0.714 |
+ 0.821 |
+ 0.831 |
+ 0.817 |
+ 0.813 |
+ 0.810 |
+ 0.796 |
+ 0.832 |
+ 0.823 |
+ 0.828 |
+ 0.827 |
+ 0.832 |
+ 0.826 |
+ 0.831 |
+ 0.827 |
+ 0.828 |
+ 0.829 |
+ 0.832 |
+ 0.831 |
+ 6917 |
+ Expression |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ KKA2_KLEPN_Melnikov_2014 |
+ 0.471 |
+ 0.591 |
+ 0.585 |
+ 0.612 |
+ 0.600 |
+ 0.610 |
+ 0.366 |
+ 0.523 |
+ 0.597 |
+ 0.627 |
+ 0.548 |
+ 0.554 |
+ 0.587 |
+ 0.389 |
+ 0.426 |
+ 0.506 |
+ 0.604 |
+ 0.637 |
+ 0.641 |
+ 0.570 |
+ 0.476 |
+ 0.520 |
+ 0.598 |
+ 0.578 |
+ 0.484 |
+ 0.586 |
+ 0.614 |
+ 0.557 |
+ 0.634 |
+ 0.604 |
+ 0.570 |
+ 0.548 |
+ 0.300 |
+ 0.406 |
+ 0.515 |
+ 0.566 |
+ 0.549 |
+ 0.572 |
+ 0.600 |
+ 0.621 |
+ 0.623 |
+ 0.630 |
+ 0.423 |
+ 0.334 |
+ 0.553 |
+ 0.466 |
+ 0.561 |
+ 0.606 |
+ 0.606 |
+ 0.509 |
+ 0.598 |
+ 0.597 |
+ 0.614 |
+ 0.603 |
+ 0.608 |
+ 0.607 |
+ 0.602 |
+ 0.583 |
+ 0.597 |
+ 0.613 |
+ 0.623 |
+ 0.506 |
+ 4960 |
+ OrganismalFitness |
+ KKA2_KLEPN |
+ High |
+ Prokaryote |
+
+
+ LGK_LIPST_Klesmith_2015 |
+ 0.853 |
+ 0.937 |
+ 0.935 |
+ 0.936 |
+ 0.937 |
+ 0.937 |
+ 0.794 |
+ 0.896 |
+ 0.881 |
+ 0.928 |
+ 0.859 |
+ 0.916 |
+ 0.926 |
+ 0.802 |
+ 0.826 |
+ 0.844 |
+ 0.903 |
+ 0.929 |
+ 0.931 |
+ 0.901 |
+ 0.826 |
+ 0.868 |
+ 0.887 |
+ 0.908 |
+ 0.847 |
+ 0.908 |
+ 0.933 |
+ 0.894 |
+ 0.937 |
+ 0.945 |
+ 0.935 |
+ 0.921 |
+ 0.802 |
+ 0.820 |
+ 0.865 |
+ 0.948 |
+ 0.847 |
+ 0.864 |
+ 0.938 |
+ 0.933 |
+ 0.936 |
+ 0.944 |
+ 0.794 |
+ 0.783 |
+ 0.876 |
+ 0.832 |
+ 0.853 |
+ 0.907 |
+ 0.901 |
+ 0.832 |
+ 0.907 |
+ 0.917 |
+ 0.909 |
+ 0.918 |
+ 0.911 |
+ 0.913 |
+ 0.912 |
+ 0.910 |
+ 0.916 |
+ 0.918 |
+ 0.876 |
+ 0.841 |
+ 7890 |
+ Activity |
+ LGK_LIPST |
+ Medium |
+ Eukaryote |
+
+
+ LYAM1_HUMAN_Elazar_2016 |
+ 0.679 |
+ 0.693 |
+ 0.650 |
+ 0.608 |
+ 0.666 |
+ 0.678 |
+ 0.669 |
+ 0.682 |
+ 0.760 |
+ 0.754 |
+ 0.732 |
+ 0.604 |
+ 0.650 |
+ 0.687 |
+ 0.675 |
+ 0.712 |
+ 0.636 |
+ 0.649 |
+ 0.755 |
+ 0.747 |
+ 0.712 |
+ 0.721 |
+ 0.706 |
+ 0.643 |
+ 0.685 |
+ 0.630 |
+ 0.663 |
+ 0.755 |
+ 0.666 |
+ 0.709 |
+ 0.663 |
+ 0.616 |
+ 0.614 |
+ 0.677 |
+ 0.712 |
+ 0.633 |
+ 0.668 |
+ 0.711 |
+ 0.639 |
+ 0.679 |
+ 0.704 |
+ 0.670 |
+ 0.658 |
+ 0.724 |
+ 0.632 |
+ 0.673 |
+ 0.750 |
+ 0.673 |
+ 0.684 |
+ 0.652 |
+ 0.617 |
+ 0.702 |
+ 0.775 |
+ 0.627 |
+ 0.657 |
+ 0.676 |
+ 0.695 |
+ 0.716 |
+ 0.535 |
+ 0.659 |
+ 0.720 |
+ 0.758 |
+ 359 |
+ Expression |
+ LYAM1_HUMAN |
+ Medium |
+ Human |
+
+
+ MAFG_MOUSE_Tsuboyama_2023_1K1V |
+ 0.881 |
+ 0.880 |
+ 0.876 |
+ 0.877 |
+ 0.868 |
+ 0.867 |
+ 0.848 |
+ 0.846 |
+ 0.860 |
+ 0.859 |
+ 0.857 |
+ 0.796 |
+ 0.880 |
+ 0.858 |
+ 0.853 |
+ 0.851 |
+ 0.867 |
+ 0.896 |
+ 0.774 |
+ 0.844 |
+ 0.775 |
+ 0.778 |
+ 0.836 |
+ 0.856 |
+ 0.840 |
+ 0.846 |
+ 0.849 |
+ 0.849 |
+ 0.843 |
+ 0.856 |
+ 0.850 |
+ 0.858 |
+ 0.749 |
+ 0.831 |
+ 0.857 |
+ 0.843 |
+ 0.879 |
+ 0.889 |
+ 0.888 |
+ 0.875 |
+ 0.886 |
+ 0.887 |
+ 0.848 |
+ 0.705 |
+ 0.871 |
+ 0.814 |
+ 0.900 |
+ 0.825 |
+ 0.908 |
+ 0.908 |
+ 0.904 |
+ 0.897 |
+ 0.912 |
+ 0.914 |
+ 0.895 |
+ 0.917 |
+ 0.904 |
+ 0.908 |
+ 0.903 |
+ 0.915 |
+ 0.916 |
+ 0.899 |
+ 1429 |
+ Stability |
+ MAFG_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ MBD11_ARATH_Tsuboyama_2023_6ACV |
+ 0.826 |
+ 0.874 |
+ 0.876 |
+ 0.875 |
+ 0.879 |
+ 0.878 |
+ 0.369 |
+ 0.795 |
+ 0.829 |
+ 0.871 |
+ 0.853 |
+ 0.739 |
+ 0.751 |
+ 0.306 |
+ 0.273 |
+ 0.817 |
+ 0.827 |
+ 0.798 |
+ 0.891 |
+ 0.850 |
+ 0.289 |
+ 0.436 |
+ 0.792 |
+ 0.768 |
+ 0.471 |
+ 0.860 |
+ 0.819 |
+ 0.800 |
+ 0.858 |
+ 0.846 |
+ 0.863 |
+ 0.825 |
+ 0.790 |
+ 0.397 |
+ 0.288 |
+ 0.340 |
+ 0.808 |
+ 0.842 |
+ 0.846 |
+ 0.874 |
+ 0.875 |
+ 0.875 |
+ 0.477 |
+ 0.365 |
+ 0.868 |
+ 0.786 |
+ 0.796 |
+ 0.879 |
+ 0.857 |
+ 0.803 |
+ 0.847 |
+ 0.865 |
+ 0.851 |
+ 0.834 |
+ 0.827 |
+ 0.830 |
+ 0.835 |
+ 0.830 |
+ 0.810 |
+ 0.837 |
+ 0.905 |
+ 0.799 |
+ 2116 |
+ Stability |
+ MBD11_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ MET_HUMAN_Estevam_2023 |
+ 0.859 |
+ 0.914 |
+ 0.897 |
+ 0.912 |
+ 0.917 |
+ 0.918 |
+ 0.867 |
+ 0.894 |
+ 0.855 |
+ 0.874 |
+ 0.923 |
+ 0.879 |
+ 0.879 |
+ 0.829 |
+ 0.869 |
+ 0.888 |
+ 0.906 |
+ 0.910 |
+ 0.920 |
+ 0.899 |
+ 0.910 |
+ 0.926 |
+ 0.913 |
+ 0.914 |
+ 0.927 |
+ 0.923 |
+ 0.929 |
+ 0.926 |
+ 0.927 |
+ 0.922 |
+ 0.917 |
+ 0.906 |
+ 0.821 |
+ 0.856 |
+ 0.908 |
+ 0.926 |
+ 0.875 |
+ 0.904 |
+ 0.917 |
+ 0.910 |
+ 0.912 |
+ 0.903 |
+ 0.879 |
+ 0.802 |
+ 0.918 |
+ 0.903 |
+ 0.835 |
+ 0.912 |
+ 0.876 |
+ 0.812 |
+ 0.881 |
+ 0.896 |
+ 0.897 |
+ 0.901 |
+ 0.900 |
+ 0.898 |
+ 0.900 |
+ 0.899 |
+ 0.897 |
+ 0.896 |
+ 0.884 |
+ 0.875 |
+ 5393 |
+ Activity |
+ MET_HUMAN |
+ Medium |
+ Human |
+
+
+ MK01_HUMAN_Brenan_2016 |
+ 0.759 |
+ 0.776 |
+ 0.777 |
+ 0.780 |
+ 0.776 |
+ 0.773 |
+ 0.755 |
+ 0.741 |
+ 0.762 |
+ 0.759 |
+ 0.741 |
+ 0.759 |
+ 0.754 |
+ 0.759 |
+ 0.752 |
+ 0.758 |
+ 0.753 |
+ 0.759 |
+ 0.756 |
+ 0.758 |
+ 0.761 |
+ 0.745 |
+ 0.741 |
+ 0.736 |
+ 0.751 |
+ 0.744 |
+ 0.747 |
+ 0.751 |
+ 0.739 |
+ 0.762 |
+ 0.732 |
+ 0.741 |
+ 0.765 |
+ 0.754 |
+ 0.745 |
+ 0.733 |
+ 0.750 |
+ 0.742 |
+ 0.733 |
+ 0.773 |
+ 0.773 |
+ 0.767 |
+ 0.762 |
+ 0.746 |
+ 0.758 |
+ 0.769 |
+ 0.731 |
+ 0.747 |
+ 0.738 |
+ 0.745 |
+ 0.766 |
+ 0.767 |
+ 0.767 |
+ 0.763 |
+ 0.764 |
+ 0.761 |
+ 0.758 |
+ 0.756 |
+ 0.761 |
+ 0.764 |
+ 0.764 |
+ 0.754 |
+ 6809 |
+ OrganismalFitness |
+ MK01_HUMAN |
+ Medium |
+ Human |
+
+
+ MLAC_ECOLI_MacRae_2023 |
+ 0.847 |
+ 0.900 |
+ 0.902 |
+ 0.902 |
+ 0.904 |
+ 0.902 |
+ 0.752 |
+ 0.887 |
+ 0.891 |
+ 0.899 |
+ 0.883 |
+ 0.894 |
+ 0.902 |
+ 0.799 |
+ 0.833 |
+ 0.835 |
+ 0.864 |
+ 0.884 |
+ 0.891 |
+ 0.906 |
+ 0.804 |
+ 0.901 |
+ 0.903 |
+ 0.903 |
+ 0.848 |
+ 0.887 |
+ 0.902 |
+ 0.885 |
+ 0.906 |
+ 0.911 |
+ 0.903 |
+ 0.904 |
+ 0.748 |
+ 0.779 |
+ 0.889 |
+ 0.894 |
+ 0.852 |
+ 0.887 |
+ 0.890 |
+ 0.898 |
+ 0.907 |
+ 0.905 |
+ 0.810 |
+ 0.793 |
+ 0.863 |
+ 0.837 |
+ 0.704 |
+ 0.839 |
+ 0.797 |
+ 0.765 |
+ 0.862 |
+ 0.856 |
+ 0.846 |
+ 0.855 |
+ 0.868 |
+ 0.858 |
+ 0.852 |
+ 0.854 |
+ 0.862 |
+ 0.863 |
+ 0.840 |
+ 0.815 |
+ 4007 |
+ OrganismalFitness |
+ MLAC_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ MSH2_HUMAN_Jia_2020 |
+ 0.817 |
+ 0.841 |
+ 0.831 |
+ 0.831 |
+ 0.831 |
+ 0.833 |
+ 0.784 |
+ 0.814 |
+ 0.837 |
+ 0.836 |
+ 0.832 |
+ 0.834 |
+ 0.837 |
+ 0.780 |
+ 0.818 |
+ 0.833 |
+ 0.831 |
+ 0.838 |
+ 0.825 |
+ 0.829 |
+ 0.804 |
+ 0.829 |
+ 0.831 |
+ 0.832 |
+ 0.814 |
+ 0.835 |
+ 0.833 |
+ 0.819 |
+ 0.837 |
+ 0.830 |
+ 0.828 |
+ 0.821 |
+ 0.797 |
+ 0.808 |
+ 0.840 |
+ 0.831 |
+ 0.823 |
+ 0.839 |
+ 0.832 |
+ 0.836 |
+ 0.841 |
+ 0.838 |
+ 0.799 |
+ 0.747 |
+ 0.839 |
+ 0.815 |
+ 0.809 |
+ 0.835 |
+ 0.736 |
+ 0.786 |
+ 0.828 |
+ 0.826 |
+ 0.829 |
+ 0.826 |
+ 0.829 |
+ 0.829 |
+ 0.832 |
+ 0.830 |
+ 0.829 |
+ 0.830 |
+ 0.834 |
+ 0.820 |
+ 16749 |
+ OrganismalFitness |
+ MSH2_HUMAN |
+ Medium |
+ Human |
+
+
+ MTH3_HAEAE_RockahShmuel_2015 |
+ 0.529 |
+ 0.780 |
+ 0.784 |
+ 0.792 |
+ 0.789 |
+ 0.784 |
+ 0.455 |
+ 0.766 |
+ 0.730 |
+ 0.736 |
+ 0.621 |
+ 0.730 |
+ 0.756 |
+ 0.389 |
+ 0.484 |
+ 0.519 |
+ 0.588 |
+ 0.612 |
+ 0.658 |
+ 0.718 |
+ 0.505 |
+ 0.537 |
+ 0.656 |
+ 0.730 |
+ 0.558 |
+ 0.740 |
+ 0.776 |
+ 0.715 |
+ 0.764 |
+ 0.771 |
+ 0.785 |
+ 0.765 |
+ 0.547 |
+ 0.529 |
+ 0.566 |
+ 0.747 |
+ 0.523 |
+ 0.566 |
+ 0.725 |
+ 0.778 |
+ 0.776 |
+ 0.782 |
+ 0.464 |
+ 0.288 |
+ 0.523 |
+ 0.461 |
+ 0.564 |
+ 0.688 |
+ 0.704 |
+ 0.550 |
+ 0.621 |
+ 0.645 |
+ 0.632 |
+ 0.601 |
+ 0.594 |
+ 0.644 |
+ 0.633 |
+ 0.637 |
+ 0.608 |
+ 0.630 |
+ 0.664 |
+ 0.538 |
+ 1777 |
+ OrganismalFitness |
+ MTH3_HAEAE |
+ Medium |
+ Prokaryote |
+
+
+ MTHR_HUMAN_Weile_2021 |
+ 0.790 |
+ 0.798 |
+ 0.790 |
+ 0.798 |
+ 0.787 |
+ 0.788 |
+ 0.677 |
+ 0.780 |
+ 0.806 |
+ 0.805 |
+ 0.807 |
+ 0.789 |
+ 0.788 |
+ 0.711 |
+ 0.751 |
+ 0.795 |
+ 0.800 |
+ 0.797 |
+ 0.808 |
+ 0.769 |
+ 0.781 |
+ 0.786 |
+ 0.789 |
+ 0.809 |
+ 0.785 |
+ 0.783 |
+ 0.788 |
+ 0.777 |
+ 0.817 |
+ 0.794 |
+ 0.791 |
+ 0.782 |
+ 0.622 |
+ 0.791 |
+ 0.792 |
+ 0.795 |
+ 0.792 |
+ 0.793 |
+ 0.798 |
+ 0.798 |
+ 0.798 |
+ 0.801 |
+ 0.735 |
+ 0.699 |
+ 0.805 |
+ 0.756 |
+ 0.746 |
+ 0.793 |
+ 0.801 |
+ 0.725 |
+ 0.796 |
+ 0.800 |
+ 0.795 |
+ 0.801 |
+ 0.796 |
+ 0.796 |
+ 0.800 |
+ 0.796 |
+ 0.801 |
+ 0.801 |
+ 0.796 |
+ 0.766 |
+ 12464 |
+ OrganismalFitness |
+ MTHR_HUMAN |
+ Low |
+ Human |
+
+
+ MYO3_YEAST_Tsuboyama_2023_2BTT |
+ 0.595 |
+ 0.686 |
+ 0.682 |
+ 0.669 |
+ 0.690 |
+ 0.695 |
+ 0.482 |
+ 0.482 |
+ 0.703 |
+ 0.689 |
+ 0.750 |
+ 0.724 |
+ 0.723 |
+ 0.569 |
+ 0.699 |
+ 0.786 |
+ 0.836 |
+ 0.750 |
+ 0.648 |
+ 0.685 |
+ 0.558 |
+ 0.515 |
+ 0.576 |
+ 0.598 |
+ 0.629 |
+ 0.686 |
+ 0.729 |
+ 0.704 |
+ 0.751 |
+ 0.631 |
+ 0.674 |
+ 0.630 |
+ 0.600 |
+ 0.642 |
+ 0.587 |
+ 0.552 |
+ 0.695 |
+ 0.610 |
+ 0.574 |
+ 0.703 |
+ 0.691 |
+ 0.685 |
+ 0.627 |
+ 0.517 |
+ 0.707 |
+ 0.669 |
+ 0.695 |
+ 0.731 |
+ 0.779 |
+ 0.706 |
+ 0.752 |
+ 0.738 |
+ 0.741 |
+ 0.746 |
+ 0.746 |
+ 0.752 |
+ 0.745 |
+ 0.764 |
+ 0.757 |
+ 0.756 |
+ 0.732 |
+ 0.762 |
+ 3297 |
+ Stability |
+ MYO3_YEAST |
+ High |
+ Eukaryote |
+
+
+ NCAP_I34A1_Doud_2015 |
+ 0.745 |
+ 0.737 |
+ 0.741 |
+ 0.742 |
+ 0.739 |
+ 0.742 |
+ 0.562 |
+ 0.726 |
+ 0.741 |
+ 0.738 |
+ 0.559 |
+ 0.551 |
+ 0.550 |
+ 0.552 |
+ 0.551 |
+ 0.543 |
+ 0.556 |
+ 0.562 |
+ 0.587 |
+ 0.708 |
+ 0.718 |
+ 0.738 |
+ 0.748 |
+ 0.753 |
+ 0.563 |
+ 0.566 |
+ 0.563 |
+ 0.557 |
+ 0.731 |
+ 0.758 |
+ 0.690 |
+ 0.690 |
+ 0.609 |
+ 0.733 |
+ 0.740 |
+ 0.763 |
+ 0.752 |
+ 0.755 |
+ 0.760 |
+ 0.753 |
+ 0.757 |
+ 0.756 |
+ 0.554 |
+ 0.558 |
+ 0.561 |
+ 0.553 |
+ 0.634 |
+ 0.640 |
+ 0.640 |
+ 0.614 |
+ 0.577 |
+ 0.598 |
+ 0.597 |
+ 0.584 |
+ 0.584 |
+ 0.575 |
+ 0.579 |
+ 0.571 |
+ 0.555 |
+ 0.581 |
+ 0.599 |
+ 0.580 |
+ 9462 |
+ OrganismalFitness |
+ NCAP_I34A1 |
+ Medium |
+ Virus |
+
+
+ NKX31_HUMAN_Tsuboyama_2023_2L9R |
+ 0.899 |
+ 0.909 |
+ 0.913 |
+ 0.914 |
+ 0.906 |
+ 0.913 |
+ 0.886 |
+ 0.894 |
+ 0.897 |
+ 0.900 |
+ 0.860 |
+ 0.885 |
+ 0.874 |
+ 0.901 |
+ 0.886 |
+ 0.914 |
+ 0.925 |
+ 0.921 |
+ 0.900 |
+ 0.885 |
+ 0.903 |
+ 0.897 |
+ 0.881 |
+ 0.878 |
+ 0.896 |
+ 0.870 |
+ 0.898 |
+ 0.857 |
+ 0.874 |
+ 0.902 |
+ 0.887 |
+ 0.881 |
+ 0.874 |
+ 0.903 |
+ 0.895 |
+ 0.872 |
+ 0.914 |
+ 0.906 |
+ 0.903 |
+ 0.910 |
+ 0.910 |
+ 0.912 |
+ 0.895 |
+ 0.898 |
+ 0.870 |
+ 0.885 |
+ 0.900 |
+ 0.874 |
+ 0.905 |
+ 0.905 |
+ 0.922 |
+ 0.922 |
+ 0.922 |
+ 0.922 |
+ 0.925 |
+ 0.927 |
+ 0.921 |
+ 0.927 |
+ 0.920 |
+ 0.926 |
+ 0.882 |
+ 0.909 |
+ 2482 |
+ Stability |
+ NKX31_HUMAN |
+ High |
+ Human |
+
+
+ NPC1_HUMAN_Erwood_2022_HEK293T |
+ 0.932 |
+ 0.942 |
+ 0.927 |
+ 0.939 |
+ 0.940 |
+ 0.937 |
+ 0.635 |
+ 0.889 |
+ 0.956 |
+ 0.955 |
+ 0.925 |
+ 0.763 |
+ 0.794 |
+ 0.685 |
+ 0.721 |
+ 0.858 |
+ 0.929 |
+ 0.932 |
+ 0.915 |
+ 0.612 |
+ 0.687 |
+ 0.838 |
+ 0.771 |
+ 0.809 |
+ 0.789 |
+ 0.798 |
+ 0.946 |
+ 0.893 |
+ 0.904 |
+ 0.947 |
+ 0.940 |
+ 0.906 |
+ 0.669 |
+ 0.716 |
+ 0.909 |
+ 0.923 |
+ 0.908 |
+ 0.946 |
+ 0.939 |
+ 0.927 |
+ 0.953 |
+ 0.947 |
+ 0.717 |
+ 0.607 |
+ 0.935 |
+ 0.806 |
+ 0.783 |
+ 0.945 |
+ 0.576 |
+ 0.716 |
+ 0.927 |
+ 0.925 |
+ 0.920 |
+ 0.905 |
+ 0.902 |
+ 0.906 |
+ 0.918 |
+ 0.912 |
+ 0.939 |
+ 0.921 |
+ 0.916 |
+ 0.719 |
+ 637 |
+ Activity |
+ NPC1_HUMAN |
+ Low |
+ Human |
+
+
+ NPC1_HUMAN_Erwood_2022_RPE1 |
+ 0.966 |
+ 0.943 |
+ 0.944 |
+ 0.969 |
+ 0.943 |
+ 0.969 |
+ 0.951 |
+ 0.960 |
+ 0.982 |
+ 0.985 |
+ 0.961 |
+ 0.691 |
+ 0.678 |
+ 0.589 |
+ 0.717 |
+ 0.724 |
+ 0.741 |
+ 0.847 |
+ 0.978 |
+ 0.920 |
+ 0.901 |
+ 0.976 |
+ 0.957 |
+ 0.955 |
+ 0.860 |
+ 0.857 |
+ 0.974 |
+ 0.958 |
+ 0.896 |
+ 0.968 |
+ 0.955 |
+ 0.937 |
+ 0.887 |
+ 0.919 |
+ 0.907 |
+ 0.904 |
+ 0.969 |
+ 0.971 |
+ 0.960 |
+ 0.971 |
+ 0.970 |
+ 0.944 |
+ 0.806 |
+ 0.920 |
+ 0.936 |
+ 0.565 |
+ 0.945 |
+ 0.926 |
+ 0.776 |
+ 0.837 |
+ 0.701 |
+ 0.846 |
+ 0.671 |
+ 0.798 |
+ 0.720 |
+ 0.714 |
+ 0.732 |
+ 0.943 |
+ 0.950 |
+ 0.741 |
+ 0.939 |
+ 0.778 |
+ 63 |
+ Activity |
+ NPC1_HUMAN |
+ Low |
+ Human |
+
+
+ NRAM_I33A0_Jiang_2016 |
+ 0.879 |
+ 0.947 |
+ 0.933 |
+ 0.933 |
+ 0.952 |
+ 0.953 |
+ 0.606 |
+ 0.919 |
+ 0.957 |
+ 0.963 |
+ 0.575 |
+ 0.684 |
+ 0.922 |
+ 0.568 |
+ 0.539 |
+ 0.564 |
+ 0.700 |
+ 0.914 |
+ 0.917 |
+ 0.872 |
+ 0.950 |
+ 0.942 |
+ 0.924 |
+ 0.912 |
+ 0.537 |
+ 0.898 |
+ 0.895 |
+ 0.889 |
+ 0.945 |
+ 0.944 |
+ 0.831 |
+ 0.841 |
+ 0.482 |
+ 0.941 |
+ 0.930 |
+ 0.946 |
+ 0.914 |
+ 0.923 |
+ 0.934 |
+ 0.955 |
+ 0.953 |
+ 0.953 |
+ 0.535 |
+ 0.568 |
+ 0.549 |
+ 0.568 |
+ 0.729 |
+ 0.766 |
+ 0.750 |
+ 0.644 |
+ 0.658 |
+ 0.688 |
+ 0.713 |
+ 0.717 |
+ 0.748 |
+ 0.773 |
+ 0.703 |
+ 0.764 |
+ 0.641 |
+ 0.757 |
+ 0.649 |
+ 0.561 |
+ 298 |
+ OrganismalFitness |
+ NRAM_I33A0 |
+ Low |
+ Virus |
+
+
+ NUD15_HUMAN_Suiter_2020 |
+ 0.693 |
+ 0.820 |
+ 0.817 |
+ 0.800 |
+ 0.820 |
+ 0.822 |
+ 0.522 |
+ 0.699 |
+ 0.808 |
+ 0.841 |
+ 0.821 |
+ 0.806 |
+ 0.836 |
+ 0.615 |
+ 0.637 |
+ 0.658 |
+ 0.770 |
+ 0.804 |
+ 0.806 |
+ 0.747 |
+ 0.616 |
+ 0.697 |
+ 0.800 |
+ 0.823 |
+ 0.674 |
+ 0.851 |
+ 0.819 |
+ 0.812 |
+ 0.836 |
+ 0.829 |
+ 0.848 |
+ 0.818 |
+ 0.573 |
+ 0.642 |
+ 0.713 |
+ 0.846 |
+ 0.716 |
+ 0.745 |
+ 0.841 |
+ 0.818 |
+ 0.824 |
+ 0.841 |
+ 0.666 |
+ 0.545 |
+ 0.811 |
+ 0.685 |
+ 0.685 |
+ 0.823 |
+ 0.710 |
+ 0.728 |
+ 0.753 |
+ 0.759 |
+ 0.738 |
+ 0.764 |
+ 0.754 |
+ 0.774 |
+ 0.779 |
+ 0.780 |
+ 0.772 |
+ 0.777 |
+ 0.826 |
+ 0.742 |
+ 2844 |
+ Expression |
+ NUD15_HUMAN |
+ High |
+ Human |
+
+
+ NUSA_ECOLI_Tsuboyama_2023_1WCL |
+ 0.656 |
+ 0.850 |
+ 0.817 |
+ 0.811 |
+ 0.813 |
+ 0.826 |
+ 0.644 |
+ 0.750 |
+ 0.848 |
+ 0.831 |
+ 0.753 |
+ 0.477 |
+ 0.515 |
+ 0.636 |
+ 0.592 |
+ 0.590 |
+ 0.651 |
+ 0.674 |
+ 0.683 |
+ 0.810 |
+ 0.796 |
+ 0.832 |
+ 0.844 |
+ 0.859 |
+ 0.569 |
+ 0.790 |
+ 0.599 |
+ 0.866 |
+ 0.862 |
+ 0.824 |
+ 0.837 |
+ 0.838 |
+ 0.534 |
+ 0.728 |
+ 0.621 |
+ 0.685 |
+ 0.704 |
+ 0.634 |
+ 0.689 |
+ 0.833 |
+ 0.808 |
+ 0.836 |
+ 0.671 |
+ 0.723 |
+ 0.666 |
+ 0.672 |
+ 0.899 |
+ 0.836 |
+ 0.859 |
+ 0.877 |
+ 0.832 |
+ 0.869 |
+ 0.851 |
+ 0.870 |
+ 0.865 |
+ 0.863 |
+ 0.858 |
+ 0.843 |
+ 0.856 |
+ 0.860 |
+ 0.842 |
+ 0.672 |
+ 2028 |
+ Stability |
+ NUSA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ NUSG_MYCTU_Tsuboyama_2023_2MI6 |
+ 0.869 |
+ 0.869 |
+ 0.880 |
+ 0.886 |
+ 0.884 |
+ 0.885 |
+ 0.763 |
+ 0.843 |
+ 0.901 |
+ 0.889 |
+ 0.840 |
+ 0.860 |
+ 0.892 |
+ 0.747 |
+ 0.862 |
+ 0.885 |
+ 0.877 |
+ 0.871 |
+ 0.864 |
+ 0.858 |
+ 0.841 |
+ 0.862 |
+ 0.849 |
+ 0.848 |
+ 0.874 |
+ 0.860 |
+ 0.862 |
+ 0.850 |
+ 0.847 |
+ 0.880 |
+ 0.864 |
+ 0.832 |
+ 0.816 |
+ 0.842 |
+ 0.808 |
+ 0.858 |
+ 0.876 |
+ 0.871 |
+ 0.881 |
+ 0.876 |
+ 0.878 |
+ 0.885 |
+ 0.861 |
+ 0.708 |
+ 0.868 |
+ 0.897 |
+ 0.899 |
+ 0.883 |
+ 0.917 |
+ 0.895 |
+ 0.874 |
+ 0.894 |
+ 0.888 |
+ 0.877 |
+ 0.880 |
+ 0.882 |
+ 0.891 |
+ 0.897 |
+ 0.888 |
+ 0.890 |
+ 0.872 |
+ 0.899 |
+ 1380 |
+ Stability |
+ NUSG_MYCTU |
+ High |
+ Prokaryote |
+
+
+ OBSCN_HUMAN_Tsuboyama_2023_1V1C |
+ 0.815 |
+ 0.901 |
+ 0.922 |
+ 0.927 |
+ 0.924 |
+ 0.922 |
+ 0.619 |
+ 0.810 |
+ 0.896 |
+ 0.896 |
+ 0.887 |
+ 0.875 |
+ 0.882 |
+ 0.627 |
+ 0.863 |
+ 0.893 |
+ 0.913 |
+ 0.903 |
+ 0.894 |
+ 0.902 |
+ 0.878 |
+ 0.867 |
+ 0.867 |
+ 0.860 |
+ 0.873 |
+ 0.867 |
+ 0.880 |
+ 0.860 |
+ 0.850 |
+ 0.897 |
+ 0.903 |
+ 0.895 |
+ 0.788 |
+ 0.624 |
+ 0.674 |
+ 0.811 |
+ 0.875 |
+ 0.876 |
+ 0.891 |
+ 0.922 |
+ 0.918 |
+ 0.921 |
+ 0.832 |
+ 0.619 |
+ 0.843 |
+ 0.853 |
+ 0.906 |
+ 0.847 |
+ 0.934 |
+ 0.928 |
+ 0.926 |
+ 0.929 |
+ 0.933 |
+ 0.931 |
+ 0.932 |
+ 0.927 |
+ 0.920 |
+ 0.922 |
+ 0.925 |
+ 0.935 |
+ 0.934 |
+ 0.882 |
+ 3197 |
+ Stability |
+ OBSCN_HUMAN |
+ High |
+ Human |
+
+
+ ODP2_GEOSE_Tsuboyama_2023_1W4G |
+ 0.833 |
+ 0.832 |
+ 0.851 |
+ 0.852 |
+ 0.815 |
+ 0.832 |
+ 0.831 |
+ 0.824 |
+ 0.843 |
+ 0.845 |
+ 0.857 |
+ 0.808 |
+ 0.790 |
+ 0.825 |
+ 0.817 |
+ 0.862 |
+ 0.881 |
+ 0.833 |
+ 0.857 |
+ 0.864 |
+ 0.847 |
+ 0.845 |
+ 0.864 |
+ 0.835 |
+ 0.821 |
+ 0.843 |
+ 0.865 |
+ 0.852 |
+ 0.852 |
+ 0.865 |
+ 0.838 |
+ 0.830 |
+ 0.824 |
+ 0.817 |
+ 0.816 |
+ 0.823 |
+ 0.853 |
+ 0.820 |
+ 0.826 |
+ 0.851 |
+ 0.825 |
+ 0.830 |
+ 0.855 |
+ 0.822 |
+ 0.867 |
+ 0.869 |
+ 0.881 |
+ 0.891 |
+ 0.883 |
+ 0.911 |
+ 0.866 |
+ 0.863 |
+ 0.866 |
+ 0.876 |
+ 0.872 |
+ 0.867 |
+ 0.873 |
+ 0.871 |
+ 0.862 |
+ 0.871 |
+ 0.880 |
+ 0.836 |
+ 1134 |
+ Stability |
+ ODP2_GEOSE |
+ High |
+ Prokaryote |
+
+
+ OPSD_HUMAN_Wan_2019 |
+ 0.585 |
+ 0.704 |
+ 0.757 |
+ 0.773 |
+ 0.781 |
+ 0.780 |
+ 0.439 |
+ 0.770 |
+ 0.627 |
+ 0.800 |
+ 0.527 |
+ 0.670 |
+ 0.703 |
+ 0.551 |
+ 0.646 |
+ 0.533 |
+ 0.602 |
+ 0.648 |
+ 0.574 |
+ 0.686 |
+ 0.749 |
+ 0.720 |
+ 0.643 |
+ 0.647 |
+ 0.729 |
+ 0.660 |
+ 0.674 |
+ 0.719 |
+ 0.659 |
+ 0.763 |
+ 0.639 |
+ 0.468 |
+ 0.166 |
+ 0.682 |
+ 0.651 |
+ 0.538 |
+ 0.758 |
+ 0.686 |
+ 0.686 |
+ 0.779 |
+ 0.757 |
+ 0.770 |
+ 0.506 |
+ 0.317 |
+ 0.665 |
+ 0.622 |
+ 0.615 |
+ 0.697 |
+ 0.541 |
+ 0.383 |
+ 0.536 |
+ 0.472 |
+ 0.449 |
+ 0.472 |
+ 0.471 |
+ 0.512 |
+ 0.527 |
+ 0.559 |
+ 0.522 |
+ 0.516 |
+ 0.764 |
+ 0.677 |
+ 165 |
+ Expression |
+ OPSD_HUMAN |
+ High |
+ Human |
+
+
+ OTC_HUMAN_Lo_2023 |
+ 0.669 |
+ 0.803 |
+ 0.772 |
+ 0.774 |
+ 0.780 |
+ 0.789 |
+ 0.400 |
+ 0.551 |
+ 0.775 |
+ 0.797 |
+ 0.729 |
+ 0.768 |
+ 0.778 |
+ 0.423 |
+ 0.573 |
+ 0.626 |
+ 0.695 |
+ 0.676 |
+ 0.700 |
+ 0.721 |
+ 0.649 |
+ 0.687 |
+ 0.746 |
+ 0.778 |
+ 0.659 |
+ 0.764 |
+ 0.757 |
+ 0.756 |
+ 0.779 |
+ 0.792 |
+ 0.763 |
+ 0.707 |
+ 0.474 |
+ 0.630 |
+ 0.736 |
+ 0.771 |
+ 0.685 |
+ 0.739 |
+ 0.778 |
+ 0.788 |
+ 0.783 |
+ 0.796 |
+ 0.509 |
+ 0.384 |
+ 0.675 |
+ 0.581 |
+ 0.751 |
+ 0.736 |
+ 0.809 |
+ 0.637 |
+ 0.702 |
+ 0.717 |
+ 0.739 |
+ 0.713 |
+ 0.710 |
+ 0.707 |
+ 0.706 |
+ 0.739 |
+ 0.693 |
+ 0.715 |
+ 0.807 |
+ 0.677 |
+ 1570 |
+ Activity |
+ OTC_HUMAN |
+ Medium |
+ Human |
+
+
+ OTU7A_HUMAN_Tsuboyama_2023_2L2D |
+ 0.610 |
+ 0.755 |
+ 0.632 |
+ 0.652 |
+ 0.663 |
+ 0.665 |
+ 0.583 |
+ 0.707 |
+ 0.595 |
+ 0.599 |
+ 0.838 |
+ 0.796 |
+ 0.797 |
+ 0.581 |
+ 0.806 |
+ 0.806 |
+ 0.727 |
+ 0.753 |
+ 0.804 |
+ 0.640 |
+ 0.577 |
+ 0.715 |
+ 0.746 |
+ 0.756 |
+ 0.729 |
+ 0.734 |
+ 0.766 |
+ 0.742 |
+ 0.692 |
+ 0.748 |
+ 0.764 |
+ 0.721 |
+ 0.734 |
+ 0.607 |
+ 0.674 |
+ 0.750 |
+ 0.628 |
+ 0.638 |
+ 0.750 |
+ 0.666 |
+ 0.655 |
+ 0.707 |
+ 0.651 |
+ 0.603 |
+ 0.780 |
+ 0.827 |
+ 0.821 |
+ 0.792 |
+ 0.840 |
+ 0.807 |
+ 0.759 |
+ 0.769 |
+ 0.771 |
+ 0.766 |
+ 0.772 |
+ 0.759 |
+ 0.767 |
+ 0.730 |
+ 0.738 |
+ 0.762 |
+ 0.869 |
+ 0.844 |
+ 635 |
+ Stability |
+ OTU7A_HUMAN |
+ High |
+ Human |
+
+
+ OXDA_RHOTO_Vanella_2023_activity |
+ 0.453 |
+ 0.585 |
+ 0.590 |
+ 0.595 |
+ 0.597 |
+ 0.606 |
+ 0.445 |
+ 0.451 |
+ 0.481 |
+ 0.494 |
+ 0.547 |
+ 0.518 |
+ 0.528 |
+ 0.435 |
+ 0.450 |
+ 0.520 |
+ 0.542 |
+ 0.585 |
+ 0.590 |
+ 0.494 |
+ 0.438 |
+ 0.475 |
+ 0.504 |
+ 0.488 |
+ 0.458 |
+ 0.502 |
+ 0.534 |
+ 0.508 |
+ 0.577 |
+ 0.607 |
+ 0.594 |
+ 0.572 |
+ 0.391 |
+ 0.460 |
+ 0.467 |
+ 0.477 |
+ 0.492 |
+ 0.490 |
+ 0.489 |
+ 0.597 |
+ 0.600 |
+ 0.595 |
+ 0.447 |
+ 0.421 |
+ 0.536 |
+ 0.478 |
+ 0.541 |
+ 0.585 |
+ 0.585 |
+ 0.499 |
+ 0.549 |
+ 0.569 |
+ 0.562 |
+ 0.563 |
+ 0.554 |
+ 0.577 |
+ 0.563 |
+ 0.569 |
+ 0.546 |
+ 0.574 |
+ 0.575 |
+ 0.490 |
+ 6396 |
+ Activity |
+ OXDA_RHOTO |
+ High |
+ Eukaryote |
+
+
+ OXDA_RHOTO_Vanella_2023_expression |
+ 0.664 |
+ 0.737 |
+ 0.730 |
+ 0.732 |
+ 0.738 |
+ 0.746 |
+ 0.680 |
+ 0.673 |
+ 0.694 |
+ 0.693 |
+ 0.727 |
+ 0.708 |
+ 0.718 |
+ 0.666 |
+ 0.686 |
+ 0.716 |
+ 0.728 |
+ 0.744 |
+ 0.750 |
+ 0.650 |
+ 0.642 |
+ 0.696 |
+ 0.711 |
+ 0.693 |
+ 0.676 |
+ 0.716 |
+ 0.715 |
+ 0.700 |
+ 0.730 |
+ 0.746 |
+ 0.727 |
+ 0.718 |
+ 0.636 |
+ 0.675 |
+ 0.674 |
+ 0.687 |
+ 0.691 |
+ 0.683 |
+ 0.688 |
+ 0.739 |
+ 0.741 |
+ 0.743 |
+ 0.667 |
+ 0.657 |
+ 0.722 |
+ 0.688 |
+ 0.748 |
+ 0.755 |
+ 0.726 |
+ 0.630 |
+ 0.739 |
+ 0.730 |
+ 0.729 |
+ 0.726 |
+ 0.728 |
+ 0.736 |
+ 0.728 |
+ 0.731 |
+ 0.727 |
+ 0.735 |
+ 0.746 |
+ 0.710 |
+ 6769 |
+ Expression |
+ OXDA_RHOTO |
+ High |
+ Eukaryote |
+
+
+ P53_HUMAN_Giacomelli_2018_Null_Etoposide |
+ 0.828 |
+ 0.833 |
+ 0.830 |
+ 0.829 |
+ 0.828 |
+ 0.827 |
+ 0.746 |
+ 0.829 |
+ 0.829 |
+ 0.826 |
+ 0.841 |
+ 0.833 |
+ 0.847 |
+ 0.741 |
+ 0.738 |
+ 0.796 |
+ 0.825 |
+ 0.830 |
+ 0.841 |
+ 0.822 |
+ 0.797 |
+ 0.813 |
+ 0.836 |
+ 0.814 |
+ 0.812 |
+ 0.846 |
+ 0.861 |
+ 0.840 |
+ 0.838 |
+ 0.832 |
+ 0.849 |
+ 0.837 |
+ 0.730 |
+ 0.770 |
+ 0.815 |
+ 0.821 |
+ 0.827 |
+ 0.836 |
+ 0.826 |
+ 0.827 |
+ 0.832 |
+ 0.830 |
+ 0.746 |
+ 0.727 |
+ 0.830 |
+ 0.752 |
+ 0.815 |
+ 0.839 |
+ 0.812 |
+ 0.811 |
+ 0.836 |
+ 0.835 |
+ 0.830 |
+ 0.827 |
+ 0.833 |
+ 0.838 |
+ 0.827 |
+ 0.833 |
+ 0.829 |
+ 0.832 |
+ 0.831 |
+ 0.774 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Giacomelli_2018_Null_Nutlin |
+ 0.592 |
+ 0.597 |
+ 0.589 |
+ 0.590 |
+ 0.589 |
+ 0.590 |
+ 0.539 |
+ 0.602 |
+ 0.596 |
+ 0.597 |
+ 0.647 |
+ 0.617 |
+ 0.646 |
+ 0.529 |
+ 0.519 |
+ 0.573 |
+ 0.598 |
+ 0.602 |
+ 0.619 |
+ 0.609 |
+ 0.575 |
+ 0.612 |
+ 0.657 |
+ 0.614 |
+ 0.594 |
+ 0.654 |
+ 0.701 |
+ 0.683 |
+ 0.621 |
+ 0.615 |
+ 0.654 |
+ 0.623 |
+ 0.548 |
+ 0.552 |
+ 0.616 |
+ 0.597 |
+ 0.598 |
+ 0.614 |
+ 0.592 |
+ 0.590 |
+ 0.597 |
+ 0.592 |
+ 0.540 |
+ 0.522 |
+ 0.622 |
+ 0.532 |
+ 0.620 |
+ 0.639 |
+ 0.624 |
+ 0.600 |
+ 0.628 |
+ 0.619 |
+ 0.615 |
+ 0.604 |
+ 0.611 |
+ 0.612 |
+ 0.606 |
+ 0.615 |
+ 0.613 |
+ 0.611 |
+ 0.618 |
+ 0.565 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Giacomelli_2018_WT_Nutlin |
+ 0.688 |
+ 0.697 |
+ 0.698 |
+ 0.699 |
+ 0.691 |
+ 0.689 |
+ 0.607 |
+ 0.691 |
+ 0.691 |
+ 0.691 |
+ 0.728 |
+ 0.698 |
+ 0.723 |
+ 0.626 |
+ 0.634 |
+ 0.675 |
+ 0.677 |
+ 0.692 |
+ 0.695 |
+ 0.651 |
+ 0.663 |
+ 0.708 |
+ 0.729 |
+ 0.711 |
+ 0.690 |
+ 0.726 |
+ 0.744 |
+ 0.745 |
+ 0.719 |
+ 0.699 |
+ 0.715 |
+ 0.689 |
+ 0.572 |
+ 0.623 |
+ 0.719 |
+ 0.721 |
+ 0.684 |
+ 0.709 |
+ 0.708 |
+ 0.690 |
+ 0.700 |
+ 0.703 |
+ 0.621 |
+ 0.601 |
+ 0.705 |
+ 0.623 |
+ 0.692 |
+ 0.727 |
+ 0.694 |
+ 0.687 |
+ 0.698 |
+ 0.706 |
+ 0.703 |
+ 0.704 |
+ 0.698 |
+ 0.703 |
+ 0.689 |
+ 0.688 |
+ 0.681 |
+ 0.698 |
+ 0.700 |
+ 0.656 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Kotler_2018 |
+ 0.859 |
+ 0.841 |
+ 0.843 |
+ 0.839 |
+ 0.817 |
+ 0.821 |
+ 0.587 |
+ 0.829 |
+ 0.835 |
+ 0.839 |
+ 0.838 |
+ 0.825 |
+ 0.835 |
+ 0.623 |
+ 0.610 |
+ 0.773 |
+ 0.817 |
+ 0.829 |
+ 0.831 |
+ 0.804 |
+ 0.776 |
+ 0.798 |
+ 0.803 |
+ 0.810 |
+ 0.793 |
+ 0.811 |
+ 0.804 |
+ 0.804 |
+ 0.787 |
+ 0.851 |
+ 0.821 |
+ 0.813 |
+ 0.566 |
+ 0.727 |
+ 0.802 |
+ 0.788 |
+ 0.863 |
+ 0.855 |
+ 0.855 |
+ 0.833 |
+ 0.844 |
+ 0.840 |
+ 0.622 |
+ 0.588 |
+ 0.837 |
+ 0.637 |
+ 0.713 |
+ 0.832 |
+ 0.764 |
+ 0.744 |
+ 0.795 |
+ 0.827 |
+ 0.812 |
+ 0.811 |
+ 0.808 |
+ 0.814 |
+ 0.801 |
+ 0.808 |
+ 0.806 |
+ 0.807 |
+ 0.851 |
+ 0.728 |
+ 1048 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P84126_THETH_Chan_2017 |
+ 0.749 |
+ 0.861 |
+ 0.905 |
+ 0.905 |
+ 0.874 |
+ 0.876 |
+ 0.642 |
+ 0.855 |
+ 0.906 |
+ 0.879 |
+ 0.891 |
+ 0.856 |
+ 0.857 |
+ 0.721 |
+ 0.860 |
+ 0.864 |
+ 0.901 |
+ 0.889 |
+ 0.905 |
+ 0.897 |
+ 0.773 |
+ 0.823 |
+ 0.827 |
+ 0.860 |
+ 0.796 |
+ 0.858 |
+ 0.908 |
+ 0.855 |
+ 0.945 |
+ 0.803 |
+ 0.905 |
+ 0.877 |
+ 0.782 |
+ 0.801 |
+ 0.755 |
+ 0.837 |
+ 0.775 |
+ 0.744 |
+ 0.779 |
+ 0.847 |
+ 0.844 |
+ 0.860 |
+ 0.757 |
+ 0.582 |
+ 0.824 |
+ 0.828 |
+ 0.748 |
+ 0.902 |
+ 0.893 |
+ 0.698 |
+ 0.861 |
+ 0.878 |
+ 0.856 |
+ 0.863 |
+ 0.888 |
+ 0.892 |
+ 0.886 |
+ 0.886 |
+ 0.899 |
+ 0.891 |
+ 0.896 |
+ 0.895 |
+ 1519 |
+ OrganismalFitness |
+ P84126_THETH |
+ Medium |
+ Prokaryote |
+
+
+ PA_I34A1_Wu_2015 |
+ 0.463 |
+ 0.460 |
+ 0.464 |
+ 0.453 |
+ 0.457 |
+ 0.453 |
+ 0.228 |
+ 0.429 |
+ 0.245 |
+ 0.258 |
+ 0.231 |
+ 0.236 |
+ 0.245 |
+ 0.220 |
+ 0.218 |
+ 0.220 |
+ 0.215 |
+ 0.215 |
+ 0.401 |
+ 0.391 |
+ 0.417 |
+ 0.422 |
+ 0.410 |
+ 0.455 |
+ 0.241 |
+ 0.387 |
+ 0.407 |
+ 0.381 |
+ 0.409 |
+ 0.431 |
+ 0.396 |
+ 0.417 |
+ 0.266 |
+ 0.413 |
+ 0.435 |
+ 0.439 |
+ 0.460 |
+ 0.454 |
+ 0.449 |
+ 0.455 |
+ 0.459 |
+ 0.451 |
+ 0.226 |
+ 0.223 |
+ 0.230 |
+ 0.232 |
+ 0.265 |
+ 0.257 |
+ 0.205 |
+ 0.273 |
+ 0.209 |
+ 0.209 |
+ 0.222 |
+ 0.212 |
+ 0.215 |
+ 0.225 |
+ 0.211 |
+ 0.216 |
+ 0.215 |
+ 0.229 |
+ 0.254 |
+ 0.213 |
+ 1820 |
+ OrganismalFitness |
+ PA_I34A1 |
+ Medium |
+ Virus |
+
+
+ PABP_YEAST_Melamed_2013 |
+ 0.755 |
+ 0.745 |
+ 0.735 |
+ 0.738 |
+ 0.764 |
+ 0.760 |
+ 0.603 |
+ 0.734 |
+ 0.753 |
+ 0.764 |
+ 0.759 |
+ 0.765 |
+ 0.764 |
+ 0.609 |
+ 0.634 |
+ 0.671 |
+ 0.722 |
+ 0.717 |
+ 0.739 |
+ 0.727 |
+ 0.731 |
+ 0.753 |
+ 0.754 |
+ 0.761 |
+ 0.741 |
+ 0.756 |
+ 0.764 |
+ 0.759 |
+ 0.745 |
+ 0.776 |
+ 0.767 |
+ 0.744 |
+ 0.524 |
+ 0.720 |
+ 0.740 |
+ 0.741 |
+ 0.745 |
+ 0.757 |
+ 0.751 |
+ 0.760 |
+ 0.766 |
+ 0.763 |
+ 0.700 |
+ 0.447 |
+ 0.759 |
+ 0.744 |
+ 0.474 |
+ 0.750 |
+ 0.681 |
+ 0.532 |
+ 0.709 |
+ 0.701 |
+ 0.711 |
+ 0.709 |
+ 0.709 |
+ 0.709 |
+ 0.712 |
+ 0.716 |
+ 0.718 |
+ 0.713 |
+ 0.750 |
+ 0.684 |
+ 37708 |
+ OrganismalFitness |
+ PABP_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ PAI1_HUMAN_Huttinger_2021 |
+ 0.797 |
+ 0.830 |
+ 0.823 |
+ 0.826 |
+ 0.823 |
+ 0.823 |
+ 0.694 |
+ 0.775 |
+ 0.807 |
+ 0.823 |
+ 0.810 |
+ 0.797 |
+ 0.809 |
+ 0.696 |
+ 0.800 |
+ 0.806 |
+ 0.812 |
+ 0.816 |
+ 0.803 |
+ 0.811 |
+ 0.701 |
+ 0.796 |
+ 0.798 |
+ 0.806 |
+ 0.793 |
+ 0.813 |
+ 0.813 |
+ 0.814 |
+ 0.812 |
+ 0.816 |
+ 0.805 |
+ 0.792 |
+ 0.721 |
+ 0.735 |
+ 0.804 |
+ 0.810 |
+ 0.795 |
+ 0.808 |
+ 0.811 |
+ 0.828 |
+ 0.827 |
+ 0.829 |
+ 0.784 |
+ 0.688 |
+ 0.805 |
+ 0.802 |
+ 0.781 |
+ 0.831 |
+ 0.814 |
+ 0.751 |
+ 0.815 |
+ 0.808 |
+ 0.813 |
+ 0.812 |
+ 0.813 |
+ 0.816 |
+ 0.812 |
+ 0.812 |
+ 0.814 |
+ 0.817 |
+ 0.825 |
+ 0.798 |
+ 5345 |
+ Activity |
+ PAI1_HUMAN |
+ NaN |
+ Human |
+
+
+ PHOT_CHLRE_Chen_2023 |
+ 0.707 |
+ 0.809 |
+ 0.890 |
+ 0.882 |
+ 0.781 |
+ 0.780 |
+ 0.843 |
+ 0.835 |
+ 0.886 |
+ 0.883 |
+ 0.851 |
+ 0.877 |
+ 0.889 |
+ 0.890 |
+ 0.903 |
+ 0.885 |
+ 0.897 |
+ 0.882 |
+ 0.889 |
+ 0.827 |
+ 0.845 |
+ 0.875 |
+ 0.848 |
+ 0.872 |
+ 0.836 |
+ 0.859 |
+ 0.871 |
+ 0.868 |
+ 0.833 |
+ 0.827 |
+ 0.841 |
+ 0.810 |
+ 0.756 |
+ 0.798 |
+ 0.782 |
+ 0.867 |
+ 0.839 |
+ 0.824 |
+ 0.866 |
+ 0.788 |
+ 0.798 |
+ 0.791 |
+ 0.875 |
+ 0.783 |
+ 0.872 |
+ 0.884 |
+ 0.703 |
+ 0.848 |
+ 0.835 |
+ 0.735 |
+ 0.838 |
+ 0.815 |
+ 0.816 |
+ 0.819 |
+ 0.819 |
+ 0.827 |
+ 0.826 |
+ 0.828 |
+ 0.830 |
+ 0.826 |
+ 0.882 |
+ 0.883 |
+ 167529 |
+ Activity |
+ PHOT_CHLRE |
+ High |
+ Eukaryote |
+
+
+ PIN1_HUMAN_Tsuboyama_2023_1I6C |
+ 0.873 |
+ 0.868 |
+ 0.900 |
+ 0.898 |
+ 0.889 |
+ 0.896 |
+ 0.847 |
+ 0.821 |
+ 0.875 |
+ 0.896 |
+ 0.888 |
+ 0.848 |
+ 0.890 |
+ 0.862 |
+ 0.878 |
+ 0.892 |
+ 0.895 |
+ 0.819 |
+ 0.806 |
+ 0.895 |
+ 0.752 |
+ 0.885 |
+ 0.890 |
+ 0.871 |
+ 0.826 |
+ 0.859 |
+ 0.883 |
+ 0.878 |
+ 0.864 |
+ 0.883 |
+ 0.889 |
+ 0.863 |
+ 0.869 |
+ 0.812 |
+ 0.870 |
+ 0.891 |
+ 0.869 |
+ 0.894 |
+ 0.899 |
+ 0.890 |
+ 0.900 |
+ 0.905 |
+ 0.851 |
+ 0.644 |
+ 0.907 |
+ 0.839 |
+ 0.889 |
+ 0.897 |
+ 0.889 |
+ 0.868 |
+ 0.884 |
+ 0.895 |
+ 0.880 |
+ 0.869 |
+ 0.865 |
+ 0.890 |
+ 0.879 |
+ 0.888 |
+ 0.887 |
+ 0.888 |
+ 0.903 |
+ 0.901 |
+ 802 |
+ Stability |
+ PIN1_HUMAN |
+ High |
+ Human |
+
+
+ PITX2_HUMAN_Tsuboyama_2023_2L7M |
+ 0.888 |
+ 0.864 |
+ 0.868 |
+ 0.867 |
+ 0.874 |
+ 0.870 |
+ 0.863 |
+ 0.804 |
+ 0.873 |
+ 0.899 |
+ 0.812 |
+ 0.850 |
+ 0.850 |
+ 0.860 |
+ 0.887 |
+ 0.897 |
+ 0.900 |
+ 0.885 |
+ 0.863 |
+ 0.867 |
+ 0.826 |
+ 0.812 |
+ 0.826 |
+ 0.822 |
+ 0.851 |
+ 0.838 |
+ 0.841 |
+ 0.820 |
+ 0.820 |
+ 0.858 |
+ 0.828 |
+ 0.822 |
+ 0.806 |
+ 0.841 |
+ 0.859 |
+ 0.797 |
+ 0.860 |
+ 0.874 |
+ 0.836 |
+ 0.866 |
+ 0.877 |
+ 0.861 |
+ 0.858 |
+ 0.838 |
+ 0.791 |
+ 0.852 |
+ 0.832 |
+ 0.773 |
+ 0.898 |
+ 0.857 |
+ 0.892 |
+ 0.894 |
+ 0.904 |
+ 0.900 |
+ 0.906 |
+ 0.891 |
+ 0.898 |
+ 0.879 |
+ 0.893 |
+ 0.897 |
+ 0.807 |
+ 0.888 |
+ 1824 |
+ Stability |
+ PITX2_HUMAN |
+ High |
+ Human |
+
+
+ PKN1_HUMAN_Tsuboyama_2023_1URF |
+ 0.866 |
+ 0.888 |
+ 0.844 |
+ 0.835 |
+ 0.846 |
+ 0.866 |
+ 0.864 |
+ 0.821 |
+ 0.803 |
+ 0.807 |
+ 0.867 |
+ 0.860 |
+ 0.871 |
+ 0.857 |
+ 0.859 |
+ 0.894 |
+ 0.863 |
+ 0.866 |
+ 0.865 |
+ 0.862 |
+ 0.881 |
+ 0.890 |
+ 0.875 |
+ 0.875 |
+ 0.887 |
+ 0.872 |
+ 0.881 |
+ 0.902 |
+ 0.864 |
+ 0.829 |
+ 0.870 |
+ 0.865 |
+ 0.783 |
+ 0.874 |
+ 0.898 |
+ 0.904 |
+ 0.872 |
+ 0.882 |
+ 0.873 |
+ 0.877 |
+ 0.870 |
+ 0.859 |
+ 0.848 |
+ 0.853 |
+ 0.871 |
+ 0.843 |
+ 0.887 |
+ 0.872 |
+ 0.876 |
+ 0.894 |
+ 0.859 |
+ 0.861 |
+ 0.872 |
+ 0.868 |
+ 0.863 |
+ 0.861 |
+ 0.862 |
+ 0.868 |
+ 0.865 |
+ 0.866 |
+ 0.890 |
+ 0.877 |
+ 1301 |
+ Stability |
+ PKN1_HUMAN |
+ High |
+ Human |
+
+
+ POLG_CXB3N_Mattenberger_2021 |
+ 0.768 |
+ 0.817 |
+ 0.798 |
+ 0.816 |
+ 0.821 |
+ 0.824 |
+ 0.574 |
+ 0.733 |
+ 0.815 |
+ 0.821 |
+ 0.658 |
+ 0.581 |
+ 0.596 |
+ 0.583 |
+ 0.577 |
+ 0.624 |
+ 0.705 |
+ 0.727 |
+ 0.754 |
+ 0.757 |
+ 0.738 |
+ 0.788 |
+ 0.795 |
+ 0.790 |
+ 0.632 |
+ 0.764 |
+ 0.772 |
+ 0.765 |
+ 0.801 |
+ 0.820 |
+ 0.759 |
+ 0.750 |
+ 0.598 |
+ 0.599 |
+ 0.718 |
+ 0.773 |
+ 0.756 |
+ 0.775 |
+ 0.792 |
+ 0.793 |
+ 0.809 |
+ 0.816 |
+ 0.580 |
+ 0.576 |
+ 0.682 |
+ 0.585 |
+ 0.627 |
+ 0.692 |
+ 0.596 |
+ 0.618 |
+ 0.683 |
+ 0.686 |
+ 0.688 |
+ 0.687 |
+ 0.684 |
+ 0.678 |
+ 0.691 |
+ 0.690 |
+ 0.690 |
+ 0.690 |
+ 0.597 |
+ 0.588 |
+ 15711 |
+ OrganismalFitness |
+ POLG_CXB3N |
+ Medium |
+ Virus |
+
+
+ POLG_DEN26_Suphatrakul_2023 |
+ 0.773 |
+ 0.829 |
+ 0.767 |
+ 0.781 |
+ 0.813 |
+ 0.816 |
+ 0.472 |
+ 0.761 |
+ 0.896 |
+ 0.898 |
+ 0.568 |
+ 0.460 |
+ 0.474 |
+ 0.450 |
+ 0.459 |
+ 0.487 |
+ 0.504 |
+ 0.547 |
+ 0.595 |
+ 0.784 |
+ 0.761 |
+ 0.761 |
+ 0.763 |
+ 0.715 |
+ 0.770 |
+ 0.783 |
+ 0.800 |
+ 0.798 |
+ 0.810 |
+ 0.870 |
+ 0.854 |
+ 0.794 |
+ 0.512 |
+ 0.455 |
+ 0.571 |
+ 0.795 |
+ 0.750 |
+ 0.776 |
+ 0.834 |
+ 0.826 |
+ 0.830 |
+ 0.856 |
+ 0.467 |
+ 0.448 |
+ 0.629 |
+ 0.464 |
+ 0.603 |
+ 0.727 |
+ 0.453 |
+ 0.555 |
+ 0.523 |
+ 0.526 |
+ 0.560 |
+ 0.534 |
+ 0.534 |
+ 0.539 |
+ 0.529 |
+ 0.536 |
+ 0.528 |
+ 0.548 |
+ 0.498 |
+ 0.495 |
+ 16897 |
+ OrganismalFitness |
+ POLG_DEN26 |
+ Low |
+ Virus |
+
+
+ POLG_HCVJF_Qi_2014 |
+ 0.679 |
+ 0.708 |
+ 0.686 |
+ 0.692 |
+ 0.705 |
+ 0.712 |
+ 0.186 |
+ 0.284 |
+ 0.703 |
+ 0.700 |
+ 0.283 |
+ 0.645 |
+ 0.643 |
+ 0.186 |
+ 0.202 |
+ 0.196 |
+ 0.229 |
+ 0.212 |
+ 0.237 |
+ 0.484 |
+ 0.400 |
+ 0.556 |
+ 0.631 |
+ 0.596 |
+ 0.542 |
+ 0.588 |
+ 0.467 |
+ 0.500 |
+ 0.615 |
+ 0.732 |
+ 0.601 |
+ 0.498 |
+ 0.286 |
+ 0.646 |
+ 0.654 |
+ 0.619 |
+ 0.667 |
+ 0.667 |
+ 0.669 |
+ 0.682 |
+ 0.668 |
+ 0.672 |
+ 0.197 |
+ 0.177 |
+ 0.596 |
+ 0.228 |
+ 0.195 |
+ 0.435 |
+ 0.517 |
+ 0.406 |
+ 0.335 |
+ 0.400 |
+ 0.284 |
+ 0.408 |
+ 0.330 |
+ 0.299 |
+ 0.413 |
+ 0.298 |
+ 0.299 |
+ 0.362 |
+ 0.276 |
+ 0.248 |
+ 1630 |
+ OrganismalFitness |
+ POLG_HCVJF |
+ Medium |
+ Virus |
+
+
+ POLG_PESV_Tsuboyama_2023_2MXD |
+ 0.582 |
+ 0.867 |
+ 0.806 |
+ 0.816 |
+ 0.825 |
+ 0.828 |
+ 0.544 |
+ 0.815 |
+ 0.703 |
+ 0.845 |
+ 0.818 |
+ 0.581 |
+ 0.589 |
+ 0.584 |
+ 0.590 |
+ 0.567 |
+ 0.613 |
+ 0.606 |
+ 0.589 |
+ 0.891 |
+ 0.518 |
+ 0.500 |
+ 0.496 |
+ 0.547 |
+ 0.542 |
+ 0.512 |
+ 0.526 |
+ 0.462 |
+ 0.577 |
+ 0.876 |
+ 0.891 |
+ 0.895 |
+ 0.517 |
+ 0.529 |
+ 0.506 |
+ 0.553 |
+ 0.704 |
+ 0.706 |
+ 0.698 |
+ 0.810 |
+ 0.813 |
+ 0.805 |
+ 0.648 |
+ 0.595 |
+ 0.643 |
+ 0.623 |
+ 0.845 |
+ 0.827 |
+ 0.902 |
+ 0.884 |
+ 0.896 |
+ 0.898 |
+ 0.887 |
+ 0.905 |
+ 0.904 |
+ 0.917 |
+ 0.903 |
+ 0.922 |
+ 0.904 |
+ 0.913 |
+ 0.883 |
+ 0.784 |
+ 5130 |
+ Stability |
+ POLG_PESV |
+ Medium |
+ Virus |
+
+
+ PPARG_HUMAN_Majithia_2016 |
+ 0.785 |
+ 0.863 |
+ 0.872 |
+ 0.866 |
+ 0.872 |
+ 0.867 |
+ 0.773 |
+ 0.875 |
+ 0.865 |
+ 0.877 |
+ 0.861 |
+ 0.879 |
+ 0.888 |
+ 0.741 |
+ 0.766 |
+ 0.780 |
+ 0.807 |
+ 0.855 |
+ 0.889 |
+ 0.882 |
+ 0.868 |
+ 0.902 |
+ 0.863 |
+ 0.866 |
+ 0.842 |
+ 0.916 |
+ 0.917 |
+ 0.917 |
+ 0.861 |
+ 0.897 |
+ 0.849 |
+ 0.800 |
+ 0.783 |
+ 0.884 |
+ 0.904 |
+ 0.905 |
+ 0.871 |
+ 0.905 |
+ 0.900 |
+ 0.874 |
+ 0.889 |
+ 0.885 |
+ 0.757 |
+ 0.709 |
+ 0.873 |
+ 0.778 |
+ 0.830 |
+ 0.888 |
+ 0.848 |
+ 0.793 |
+ 0.825 |
+ 0.832 |
+ 0.822 |
+ 0.816 |
+ 0.831 |
+ 0.821 |
+ 0.825 |
+ 0.819 |
+ 0.810 |
+ 0.821 |
+ 0.841 |
+ 0.789 |
+ 9576 |
+ Activity |
+ PPARG_HUMAN |
+ Medium |
+ Human |
+
+
+ PPM1D_HUMAN_Miller_2022 |
+ 0.805 |
+ 0.807 |
+ 0.807 |
+ 0.809 |
+ 0.807 |
+ 0.807 |
+ 0.728 |
+ 0.774 |
+ 0.810 |
+ 0.807 |
+ 0.808 |
+ 0.817 |
+ 0.815 |
+ 0.744 |
+ 0.754 |
+ 0.784 |
+ 0.804 |
+ 0.814 |
+ 0.813 |
+ 0.801 |
+ 0.776 |
+ 0.783 |
+ 0.818 |
+ 0.812 |
+ 0.801 |
+ 0.818 |
+ 0.815 |
+ 0.818 |
+ 0.813 |
+ 0.810 |
+ 0.807 |
+ 0.787 |
+ 0.735 |
+ 0.778 |
+ 0.803 |
+ 0.820 |
+ 0.807 |
+ 0.812 |
+ 0.815 |
+ 0.808 |
+ 0.809 |
+ 0.809 |
+ 0.766 |
+ 0.717 |
+ 0.807 |
+ 0.790 |
+ 0.792 |
+ 0.817 |
+ 0.804 |
+ 0.772 |
+ 0.801 |
+ 0.800 |
+ 0.798 |
+ 0.799 |
+ 0.798 |
+ 0.798 |
+ 0.796 |
+ 0.794 |
+ 0.800 |
+ 0.800 |
+ 0.819 |
+ 0.783 |
+ 7889 |
+ OrganismalFitness |
+ PPM1D_HUMAN |
+ Low |
+ Human |
+
+
+ PR40A_HUMAN_Tsuboyama_2023_1UZC |
+ 0.857 |
+ 0.872 |
+ 0.916 |
+ 0.913 |
+ 0.907 |
+ 0.910 |
+ 0.760 |
+ 0.854 |
+ 0.917 |
+ 0.922 |
+ 0.924 |
+ 0.805 |
+ 0.847 |
+ 0.765 |
+ 0.771 |
+ 0.894 |
+ 0.917 |
+ 0.915 |
+ 0.898 |
+ 0.901 |
+ 0.864 |
+ 0.911 |
+ 0.918 |
+ 0.902 |
+ 0.895 |
+ 0.911 |
+ 0.917 |
+ 0.917 |
+ 0.905 |
+ 0.901 |
+ 0.903 |
+ 0.878 |
+ 0.811 |
+ 0.814 |
+ 0.818 |
+ 0.858 |
+ 0.884 |
+ 0.888 |
+ 0.895 |
+ 0.908 |
+ 0.914 |
+ 0.909 |
+ 0.784 |
+ 0.766 |
+ 0.884 |
+ 0.780 |
+ 0.899 |
+ 0.862 |
+ 0.917 |
+ 0.901 |
+ 0.919 |
+ 0.924 |
+ 0.924 |
+ 0.922 |
+ 0.920 |
+ 0.923 |
+ 0.923 |
+ 0.922 |
+ 0.919 |
+ 0.926 |
+ 0.920 |
+ 0.906 |
+ 2033 |
+ Stability |
+ PR40A_HUMAN |
+ Medium |
+ Human |
+
+
+ PRKN_HUMAN_Clausen_2023 |
+ 0.863 |
+ 0.870 |
+ 0.876 |
+ 0.878 |
+ 0.874 |
+ 0.874 |
+ 0.652 |
+ 0.823 |
+ 0.867 |
+ 0.870 |
+ 0.874 |
+ 0.861 |
+ 0.878 |
+ 0.683 |
+ 0.697 |
+ 0.720 |
+ 0.764 |
+ 0.860 |
+ 0.877 |
+ 0.852 |
+ 0.708 |
+ 0.855 |
+ 0.877 |
+ 0.871 |
+ 0.767 |
+ 0.876 |
+ 0.871 |
+ 0.881 |
+ 0.862 |
+ 0.874 |
+ 0.858 |
+ 0.828 |
+ 0.627 |
+ 0.696 |
+ 0.850 |
+ 0.873 |
+ 0.857 |
+ 0.871 |
+ 0.873 |
+ 0.873 |
+ 0.875 |
+ 0.874 |
+ 0.711 |
+ 0.584 |
+ 0.881 |
+ 0.717 |
+ 0.855 |
+ 0.873 |
+ 0.868 |
+ 0.773 |
+ 0.774 |
+ 0.803 |
+ 0.815 |
+ 0.821 |
+ 0.796 |
+ 0.791 |
+ 0.800 |
+ 0.799 |
+ 0.799 |
+ 0.804 |
+ 0.868 |
+ 0.787 |
+ 8756 |
+ Expression |
+ PRKN_HUMAN |
+ Low |
+ Human |
+
+
+ PSAE_SYNP2_Tsuboyama_2023_1PSE |
+ 0.767 |
+ 0.759 |
+ 0.726 |
+ 0.731 |
+ 0.753 |
+ 0.754 |
+ 0.666 |
+ 0.709 |
+ 0.722 |
+ 0.720 |
+ 0.863 |
+ 0.796 |
+ 0.801 |
+ 0.681 |
+ 0.710 |
+ 0.823 |
+ 0.839 |
+ 0.806 |
+ 0.803 |
+ 0.774 |
+ 0.661 |
+ 0.627 |
+ 0.722 |
+ 0.735 |
+ 0.734 |
+ 0.754 |
+ 0.511 |
+ 0.741 |
+ 0.762 |
+ 0.737 |
+ 0.776 |
+ 0.727 |
+ 0.658 |
+ 0.657 |
+ 0.715 |
+ 0.742 |
+ 0.773 |
+ 0.764 |
+ 0.759 |
+ 0.755 |
+ 0.752 |
+ 0.765 |
+ 0.691 |
+ 0.659 |
+ 0.813 |
+ 0.673 |
+ 0.838 |
+ 0.811 |
+ 0.844 |
+ 0.821 |
+ 0.841 |
+ 0.825 |
+ 0.822 |
+ 0.823 |
+ 0.829 |
+ 0.821 |
+ 0.830 |
+ 0.835 |
+ 0.825 |
+ 0.835 |
+ 0.806 |
+ 0.815 |
+ 1579 |
+ Stability |
+ PSAE_PICP2 |
+ Medium |
+ Prokaryote |
+
+
+ PTEN_HUMAN_Matreyek_2021 |
+ 0.766 |
+ 0.790 |
+ 0.786 |
+ 0.780 |
+ 0.779 |
+ 0.783 |
+ 0.686 |
+ 0.772 |
+ 0.791 |
+ 0.799 |
+ 0.789 |
+ 0.791 |
+ 0.805 |
+ 0.671 |
+ 0.732 |
+ 0.775 |
+ 0.796 |
+ 0.764 |
+ 0.767 |
+ 0.797 |
+ 0.692 |
+ 0.807 |
+ 0.799 |
+ 0.801 |
+ 0.727 |
+ 0.783 |
+ 0.785 |
+ 0.799 |
+ 0.779 |
+ 0.812 |
+ 0.783 |
+ 0.779 |
+ 0.701 |
+ 0.730 |
+ 0.806 |
+ 0.791 |
+ 0.765 |
+ 0.806 |
+ 0.801 |
+ 0.788 |
+ 0.798 |
+ 0.797 |
+ 0.701 |
+ 0.672 |
+ 0.799 |
+ 0.751 |
+ 0.769 |
+ 0.797 |
+ 0.780 |
+ 0.754 |
+ 0.795 |
+ 0.790 |
+ 0.795 |
+ 0.798 |
+ 0.791 |
+ 0.790 |
+ 0.794 |
+ 0.801 |
+ 0.799 |
+ 0.798 |
+ 0.785 |
+ 0.769 |
+ 5083 |
+ Expression |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ PTEN_HUMAN_Mighell_2018 |
+ 0.858 |
+ 0.856 |
+ 0.858 |
+ 0.864 |
+ 0.865 |
+ 0.863 |
+ 0.755 |
+ 0.842 |
+ 0.855 |
+ 0.854 |
+ 0.856 |
+ 0.857 |
+ 0.864 |
+ 0.781 |
+ 0.831 |
+ 0.855 |
+ 0.851 |
+ 0.847 |
+ 0.850 |
+ 0.859 |
+ 0.806 |
+ 0.841 |
+ 0.855 |
+ 0.841 |
+ 0.830 |
+ 0.845 |
+ 0.845 |
+ 0.836 |
+ 0.848 |
+ 0.853 |
+ 0.850 |
+ 0.843 |
+ 0.718 |
+ 0.826 |
+ 0.853 |
+ 0.843 |
+ 0.854 |
+ 0.855 |
+ 0.856 |
+ 0.861 |
+ 0.861 |
+ 0.860 |
+ 0.822 |
+ 0.749 |
+ 0.850 |
+ 0.852 |
+ 0.833 |
+ 0.851 |
+ 0.851 |
+ 0.824 |
+ 0.855 |
+ 0.853 |
+ 0.852 |
+ 0.853 |
+ 0.855 |
+ 0.855 |
+ 0.852 |
+ 0.853 |
+ 0.848 |
+ 0.851 |
+ 0.853 |
+ 0.842 |
+ 7260 |
+ Activity |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ Q2N0S5_9HIV1_Haddox_2018 |
+ 0.869 |
+ 0.849 |
+ 0.830 |
+ 0.840 |
+ 0.856 |
+ 0.858 |
+ 0.597 |
+ 0.822 |
+ 0.870 |
+ 0.870 |
+ 0.847 |
+ 0.867 |
+ 0.872 |
+ 0.623 |
+ 0.618 |
+ 0.608 |
+ 0.635 |
+ 0.652 |
+ 0.705 |
+ 0.843 |
+ 0.869 |
+ 0.829 |
+ 0.818 |
+ 0.803 |
+ 0.870 |
+ 0.828 |
+ 0.822 |
+ 0.831 |
+ 0.824 |
+ 0.872 |
+ 0.833 |
+ 0.830 |
+ 0.737 |
+ 0.865 |
+ 0.837 |
+ 0.840 |
+ 0.869 |
+ 0.858 |
+ 0.863 |
+ 0.865 |
+ 0.857 |
+ 0.858 |
+ 0.783 |
+ 0.580 |
+ 0.868 |
+ 0.858 |
+ 0.723 |
+ 0.866 |
+ 0.641 |
+ 0.712 |
+ 0.676 |
+ 0.690 |
+ 0.716 |
+ 0.697 |
+ 0.693 |
+ 0.698 |
+ 0.695 |
+ 0.692 |
+ 0.680 |
+ 0.698 |
+ 0.682 |
+ 0.649 |
+ 12729 |
+ OrganismalFitness |
+ Q2N0S5_9HIV1 |
+ Medium |
+ Virus |
+
+
+ Q53Z42_HUMAN_McShan_2019_binding-TAPBPR |
+ 0.791 |
+ 0.806 |
+ 0.807 |
+ 0.807 |
+ 0.807 |
+ 0.806 |
+ 0.707 |
+ 0.764 |
+ 0.808 |
+ 0.809 |
+ 0.803 |
+ 0.806 |
+ 0.810 |
+ 0.705 |
+ 0.717 |
+ 0.784 |
+ 0.792 |
+ 0.808 |
+ 0.803 |
+ 0.804 |
+ 0.784 |
+ 0.768 |
+ 0.770 |
+ 0.768 |
+ 0.811 |
+ 0.798 |
+ 0.796 |
+ 0.798 |
+ 0.779 |
+ 0.775 |
+ 0.783 |
+ 0.768 |
+ 0.737 |
+ 0.773 |
+ 0.766 |
+ 0.763 |
+ 0.789 |
+ 0.781 |
+ 0.775 |
+ 0.804 |
+ 0.802 |
+ 0.796 |
+ 0.787 |
+ 0.693 |
+ 0.806 |
+ 0.794 |
+ 0.786 |
+ 0.806 |
+ 0.770 |
+ 0.777 |
+ 0.784 |
+ 0.791 |
+ 0.807 |
+ 0.793 |
+ 0.804 |
+ 0.800 |
+ 0.804 |
+ 0.805 |
+ 0.789 |
+ 0.799 |
+ 0.818 |
+ 0.814 |
+ 3344 |
+ Binding |
+ Q53Z42_HUMAN |
+ Medium |
+ Human |
+
+
+ Q53Z42_HUMAN_McShan_2019_expression |
+ 0.774 |
+ 0.778 |
+ 0.785 |
+ 0.784 |
+ 0.789 |
+ 0.786 |
+ 0.483 |
+ 0.735 |
+ 0.783 |
+ 0.792 |
+ 0.775 |
+ 0.759 |
+ 0.776 |
+ 0.532 |
+ 0.559 |
+ 0.724 |
+ 0.762 |
+ 0.803 |
+ 0.802 |
+ 0.798 |
+ 0.736 |
+ 0.749 |
+ 0.745 |
+ 0.741 |
+ 0.783 |
+ 0.789 |
+ 0.784 |
+ 0.786 |
+ 0.791 |
+ 0.760 |
+ 0.769 |
+ 0.754 |
+ 0.619 |
+ 0.763 |
+ 0.742 |
+ 0.740 |
+ 0.796 |
+ 0.783 |
+ 0.777 |
+ 0.791 |
+ 0.793 |
+ 0.791 |
+ 0.673 |
+ 0.517 |
+ 0.783 |
+ 0.745 |
+ 0.721 |
+ 0.786 |
+ 0.719 |
+ 0.644 |
+ 0.749 |
+ 0.767 |
+ 0.790 |
+ 0.777 |
+ 0.785 |
+ 0.794 |
+ 0.795 |
+ 0.796 |
+ 0.775 |
+ 0.782 |
+ 0.781 |
+ 0.827 |
+ 3344 |
+ Expression |
+ Q53Z42_HUMAN |
+ Medium |
+ Human |
+
+
+ Q59976_STRSQ_Romero_2015 |
+ 0.845 |
+ 0.877 |
+ 0.873 |
+ 0.872 |
+ 0.880 |
+ 0.875 |
+ 0.757 |
+ 0.813 |
+ 0.880 |
+ 0.880 |
+ 0.857 |
+ 0.814 |
+ 0.829 |
+ 0.728 |
+ 0.798 |
+ 0.826 |
+ 0.842 |
+ 0.850 |
+ 0.854 |
+ 0.879 |
+ 0.862 |
+ 0.883 |
+ 0.886 |
+ 0.887 |
+ 0.868 |
+ 0.888 |
+ 0.892 |
+ 0.885 |
+ 0.883 |
+ 0.879 |
+ 0.868 |
+ 0.851 |
+ 0.768 |
+ 0.860 |
+ 0.885 |
+ 0.881 |
+ 0.861 |
+ 0.877 |
+ 0.879 |
+ 0.876 |
+ 0.877 |
+ 0.879 |
+ 0.813 |
+ 0.714 |
+ 0.861 |
+ 0.833 |
+ 0.829 |
+ 0.862 |
+ 0.860 |
+ 0.772 |
+ 0.850 |
+ 0.844 |
+ 0.857 |
+ 0.848 |
+ 0.852 |
+ 0.841 |
+ 0.854 |
+ 0.847 |
+ 0.845 |
+ 0.855 |
+ 0.861 |
+ 0.834 |
+ 2999 |
+ Activity |
+ Q59976_STRSQ |
+ Medium |
+ Prokaryote |
+
+
+ Q6WV13_9MAXI_Somermeyer_2022 |
+ 0.658 |
+ 0.724 |
+ 0.619 |
+ 0.623 |
+ 0.660 |
+ 0.658 |
+ 0.429 |
+ 0.526 |
+ 0.731 |
+ 0.728 |
+ 0.622 |
+ 0.487 |
+ 0.473 |
+ 0.465 |
+ 0.484 |
+ 0.465 |
+ 0.458 |
+ 0.471 |
+ 0.441 |
+ 0.670 |
+ 0.443 |
+ 0.496 |
+ 0.464 |
+ 0.506 |
+ 0.445 |
+ 0.441 |
+ 0.464 |
+ 0.448 |
+ 0.448 |
+ 0.728 |
+ 0.698 |
+ 0.701 |
+ 0.482 |
+ 0.484 |
+ 0.462 |
+ 0.499 |
+ 0.630 |
+ 0.624 |
+ 0.631 |
+ 0.658 |
+ 0.657 |
+ 0.659 |
+ 0.545 |
+ 0.515 |
+ 0.534 |
+ 0.533 |
+ 0.660 |
+ 0.605 |
+ 0.643 |
+ 0.595 |
+ 0.585 |
+ 0.594 |
+ 0.604 |
+ 0.659 |
+ 0.645 |
+ 0.648 |
+ 0.641 |
+ 0.638 |
+ 0.638 |
+ 0.635 |
+ 0.524 |
+ 0.513 |
+ 31401 |
+ Activity |
+ Q6WV12_9MAXI |
+ Low |
+ Eukaryote |
+
+
+ Q837P4_ENTFA_Meier_2023 |
+ 0.898 |
+ 0.921 |
+ 0.918 |
+ 0.930 |
+ 0.928 |
+ 0.942 |
+ 0.838 |
+ 0.891 |
+ 0.923 |
+ 0.927 |
+ 0.938 |
+ 0.920 |
+ 0.940 |
+ 0.873 |
+ 0.885 |
+ 0.901 |
+ 0.926 |
+ 0.920 |
+ 0.926 |
+ 0.807 |
+ 0.930 |
+ 0.934 |
+ 0.931 |
+ 0.936 |
+ 0.922 |
+ 0.932 |
+ 0.933 |
+ 0.939 |
+ 0.937 |
+ 0.927 |
+ 0.923 |
+ 0.906 |
+ 0.859 |
+ 0.928 |
+ 0.937 |
+ 0.937 |
+ 0.926 |
+ 0.938 |
+ 0.931 |
+ 0.940 |
+ 0.940 |
+ 0.939 |
+ 0.896 |
+ 0.882 |
+ 0.914 |
+ 0.897 |
+ 0.862 |
+ 0.925 |
+ 0.886 |
+ 0.847 |
+ 0.934 |
+ 0.931 |
+ 0.928 |
+ 0.936 |
+ 0.933 |
+ 0.930 |
+ 0.935 |
+ 0.927 |
+ 0.925 |
+ 0.940 |
+ 0.948 |
+ 0.888 |
+ 697 |
+ Activity |
+ Q837P4_ENTFA |
+ Medium |
+ Prokaryote |
+
+
+ Q837P5_ENTFA_Meier_2023 |
+ 0.813 |
+ 0.919 |
+ 0.915 |
+ 0.912 |
+ 0.918 |
+ 0.927 |
+ 0.867 |
+ 0.883 |
+ 0.792 |
+ 0.813 |
+ 0.861 |
+ 0.867 |
+ 0.862 |
+ 0.767 |
+ 0.842 |
+ 0.839 |
+ 0.844 |
+ 0.882 |
+ 0.873 |
+ 0.850 |
+ 0.858 |
+ 0.892 |
+ 0.924 |
+ 0.918 |
+ 0.836 |
+ 0.897 |
+ 0.911 |
+ 0.917 |
+ 0.935 |
+ 0.897 |
+ 0.888 |
+ 0.908 |
+ 0.803 |
+ 0.856 |
+ 0.895 |
+ 0.916 |
+ 0.861 |
+ 0.886 |
+ 0.911 |
+ 0.917 |
+ 0.919 |
+ 0.918 |
+ 0.852 |
+ 0.777 |
+ 0.877 |
+ 0.848 |
+ 0.840 |
+ 0.894 |
+ 0.903 |
+ 0.796 |
+ 0.864 |
+ 0.859 |
+ 0.850 |
+ 0.842 |
+ 0.854 |
+ 0.831 |
+ 0.860 |
+ 0.862 |
+ 0.842 |
+ 0.857 |
+ 0.836 |
+ 0.846 |
+ 747 |
+ Activity |
+ Q837P5_ENTFA |
+ Medium |
+ Prokaryote |
+
+
+ Q8WTC7_9CNID_Somermeyer_2022 |
+ 0.592 |
+ 0.684 |
+ 0.598 |
+ 0.598 |
+ 0.646 |
+ 0.645 |
+ 0.481 |
+ 0.666 |
+ 0.665 |
+ 0.670 |
+ 0.594 |
+ 0.487 |
+ 0.484 |
+ 0.478 |
+ 0.470 |
+ 0.479 |
+ 0.484 |
+ 0.492 |
+ 0.508 |
+ 0.612 |
+ 0.500 |
+ 0.499 |
+ 0.511 |
+ 0.499 |
+ 0.479 |
+ 0.478 |
+ 0.495 |
+ 0.633 |
+ 0.640 |
+ 0.685 |
+ 0.656 |
+ 0.661 |
+ 0.477 |
+ 0.464 |
+ 0.478 |
+ 0.675 |
+ 0.584 |
+ 0.592 |
+ 0.673 |
+ 0.641 |
+ 0.649 |
+ 0.685 |
+ 0.535 |
+ 0.538 |
+ 0.537 |
+ 0.535 |
+ 0.604 |
+ 0.606 |
+ 0.679 |
+ 0.605 |
+ 0.594 |
+ 0.579 |
+ 0.598 |
+ 0.599 |
+ 0.591 |
+ 0.609 |
+ 0.598 |
+ 0.598 |
+ 0.595 |
+ 0.597 |
+ 0.539 |
+ 0.450 |
+ 33510 |
+ Activity |
+ Q8WTC7_9CNID |
+ Low |
+ Eukaryote |
+
+
+ R1AB_SARS2_Flynn_2022 |
+ 0.928 |
+ 0.931 |
+ 0.916 |
+ 0.914 |
+ 0.943 |
+ 0.946 |
+ 0.596 |
+ 0.889 |
+ 0.621 |
+ 0.621 |
+ 0.669 |
+ 0.626 |
+ 0.609 |
+ 0.625 |
+ 0.647 |
+ 0.648 |
+ 0.637 |
+ 0.889 |
+ 0.937 |
+ 0.890 |
+ 0.838 |
+ 0.839 |
+ 0.859 |
+ 0.854 |
+ 0.845 |
+ 0.808 |
+ 0.828 |
+ 0.828 |
+ 0.818 |
+ 0.935 |
+ 0.897 |
+ 0.876 |
+ 0.602 |
+ 0.769 |
+ 0.828 |
+ 0.792 |
+ 0.912 |
+ 0.922 |
+ 0.913 |
+ 0.936 |
+ 0.945 |
+ 0.939 |
+ 0.623 |
+ 0.583 |
+ 0.680 |
+ 0.623 |
+ 0.824 |
+ 0.843 |
+ 0.871 |
+ 0.792 |
+ 0.736 |
+ 0.750 |
+ 0.771 |
+ 0.778 |
+ 0.787 |
+ 0.758 |
+ 0.763 |
+ 0.749 |
+ 0.750 |
+ 0.778 |
+ 0.721 |
+ 0.718 |
+ 5725 |
+ OrganismalFitness |
+ R1AB_SARS2 |
+ Medium |
+ Virus |
+
+
+ RAD_ANTMA_Tsuboyama_2023_2CJJ |
+ 0.797 |
+ 0.754 |
+ 0.752 |
+ 0.754 |
+ 0.760 |
+ 0.769 |
+ 0.739 |
+ 0.762 |
+ 0.795 |
+ 0.807 |
+ 0.797 |
+ 0.716 |
+ 0.761 |
+ 0.777 |
+ 0.825 |
+ 0.835 |
+ 0.806 |
+ 0.822 |
+ 0.750 |
+ 0.806 |
+ 0.856 |
+ 0.800 |
+ 0.771 |
+ 0.786 |
+ 0.808 |
+ 0.779 |
+ 0.758 |
+ 0.773 |
+ 0.756 |
+ 0.809 |
+ 0.748 |
+ 0.711 |
+ 0.649 |
+ 0.772 |
+ 0.800 |
+ 0.772 |
+ 0.819 |
+ 0.813 |
+ 0.794 |
+ 0.784 |
+ 0.788 |
+ 0.795 |
+ 0.802 |
+ 0.717 |
+ 0.731 |
+ 0.786 |
+ 0.805 |
+ 0.697 |
+ 0.831 |
+ 0.821 |
+ 0.789 |
+ 0.820 |
+ 0.804 |
+ 0.808 |
+ 0.821 |
+ 0.802 |
+ 0.800 |
+ 0.807 |
+ 0.812 |
+ 0.810 |
+ 0.770 |
+ 0.833 |
+ 912 |
+ Stability |
+ RAD_ANTMA |
+ High |
+ Eukaryote |
+
+
+ RAF1_HUMAN_Zinkus-Boltz_2019 |
+ 0.749 |
+ 0.785 |
+ 0.775 |
+ 0.783 |
+ 0.788 |
+ 0.789 |
+ 0.704 |
+ 0.820 |
+ 0.782 |
+ 0.789 |
+ 0.769 |
+ 0.805 |
+ 0.806 |
+ 0.688 |
+ 0.713 |
+ 0.795 |
+ 0.763 |
+ 0.786 |
+ 0.801 |
+ 0.788 |
+ 0.806 |
+ 0.792 |
+ 0.793 |
+ 0.796 |
+ 0.794 |
+ 0.815 |
+ 0.814 |
+ 0.801 |
+ 0.799 |
+ 0.802 |
+ 0.773 |
+ 0.792 |
+ 0.738 |
+ 0.805 |
+ 0.804 |
+ 0.791 |
+ 0.791 |
+ 0.788 |
+ 0.787 |
+ 0.797 |
+ 0.784 |
+ 0.788 |
+ 0.726 |
+ 0.712 |
+ 0.747 |
+ 0.804 |
+ 0.715 |
+ 0.749 |
+ 0.704 |
+ 0.735 |
+ 0.764 |
+ 0.763 |
+ 0.773 |
+ 0.751 |
+ 0.745 |
+ 0.759 |
+ 0.763 |
+ 0.742 |
+ 0.730 |
+ 0.753 |
+ 0.760 |
+ 0.658 |
+ 297 |
+ OrganismalFitness |
+ RAF1_HUMAN |
+ Low |
+ Human |
+
+
+ RASH_HUMAN_Bandaru_2017 |
+ 0.730 |
+ 0.717 |
+ 0.729 |
+ 0.723 |
+ 0.718 |
+ 0.720 |
+ 0.534 |
+ 0.654 |
+ 0.709 |
+ 0.716 |
+ 0.714 |
+ 0.707 |
+ 0.704 |
+ 0.723 |
+ 0.746 |
+ 0.741 |
+ 0.726 |
+ 0.723 |
+ 0.710 |
+ 0.740 |
+ 0.731 |
+ 0.720 |
+ 0.710 |
+ 0.707 |
+ 0.723 |
+ 0.704 |
+ 0.701 |
+ 0.701 |
+ 0.686 |
+ 0.711 |
+ 0.707 |
+ 0.705 |
+ 0.625 |
+ 0.730 |
+ 0.722 |
+ 0.705 |
+ 0.743 |
+ 0.733 |
+ 0.726 |
+ 0.722 |
+ 0.721 |
+ 0.722 |
+ 0.733 |
+ 0.685 |
+ 0.712 |
+ 0.731 |
+ 0.674 |
+ 0.652 |
+ 0.723 |
+ 0.658 |
+ 0.713 |
+ 0.721 |
+ 0.712 |
+ 0.717 |
+ 0.712 |
+ 0.718 |
+ 0.716 |
+ 0.714 |
+ 0.708 |
+ 0.716 |
+ 0.715 |
+ 0.729 |
+ 3134 |
+ Activity |
+ RASH_HUMAN |
+ High |
+ Human |
+
+
+ RASK_HUMAN_Weng_2022_abundance |
+ 0.743 |
+ 0.724 |
+ 0.738 |
+ 0.736 |
+ 0.735 |
+ 0.740 |
+ 0.696 |
+ 0.696 |
+ 0.731 |
+ 0.745 |
+ 0.724 |
+ 0.718 |
+ 0.722 |
+ 0.763 |
+ 0.752 |
+ 0.756 |
+ 0.736 |
+ 0.731 |
+ 0.726 |
+ 0.733 |
+ 0.691 |
+ 0.737 |
+ 0.736 |
+ 0.745 |
+ 0.721 |
+ 0.748 |
+ 0.751 |
+ 0.741 |
+ 0.750 |
+ 0.712 |
+ 0.725 |
+ 0.711 |
+ 0.666 |
+ 0.684 |
+ 0.732 |
+ 0.744 |
+ 0.696 |
+ 0.737 |
+ 0.757 |
+ 0.735 |
+ 0.743 |
+ 0.746 |
+ 0.731 |
+ 0.741 |
+ 0.687 |
+ 0.718 |
+ 0.727 |
+ 0.720 |
+ 0.763 |
+ 0.726 |
+ 0.738 |
+ 0.733 |
+ 0.735 |
+ 0.736 |
+ 0.740 |
+ 0.733 |
+ 0.741 |
+ 0.735 |
+ 0.736 |
+ 0.739 |
+ 0.737 |
+ 0.746 |
+ 26012 |
+ Expression |
+ RASK_HUMAN |
+ High |
+ Human |
+
+
+ RASK_HUMAN_Weng_2022_binding-DARPin_K55 |
+ 0.802 |
+ 0.909 |
+ 0.917 |
+ 0.918 |
+ 0.916 |
+ 0.915 |
+ 0.619 |
+ 0.809 |
+ 0.923 |
+ 0.931 |
+ 0.897 |
+ 0.903 |
+ 0.908 |
+ 0.893 |
+ 0.918 |
+ 0.919 |
+ 0.925 |
+ 0.920 |
+ 0.883 |
+ 0.927 |
+ 0.918 |
+ 0.911 |
+ 0.907 |
+ 0.893 |
+ 0.926 |
+ 0.915 |
+ 0.898 |
+ 0.898 |
+ 0.873 |
+ 0.885 |
+ 0.903 |
+ 0.898 |
+ 0.805 |
+ 0.917 |
+ 0.915 |
+ 0.890 |
+ 0.914 |
+ 0.909 |
+ 0.888 |
+ 0.919 |
+ 0.918 |
+ 0.916 |
+ 0.905 |
+ 0.772 |
+ 0.891 |
+ 0.908 |
+ 0.782 |
+ 0.808 |
+ 0.916 |
+ 0.782 |
+ 0.900 |
+ 0.888 |
+ 0.888 |
+ 0.893 |
+ 0.896 |
+ 0.893 |
+ 0.894 |
+ 0.897 |
+ 0.896 |
+ 0.897 |
+ 0.895 |
+ 0.901 |
+ 24873 |
+ Binding |
+ RASK_HUMAN |
+ High |
+ Human |
+
+
+ RBP1_HUMAN_Tsuboyama_2023_2KWH |
+ 0.824 |
+ 0.791 |
+ 0.819 |
+ 0.828 |
+ 0.822 |
+ 0.828 |
+ 0.829 |
+ 0.752 |
+ 0.787 |
+ 0.813 |
+ 0.860 |
+ 0.838 |
+ 0.848 |
+ 0.841 |
+ 0.843 |
+ 0.858 |
+ 0.870 |
+ 0.809 |
+ 0.819 |
+ 0.826 |
+ 0.838 |
+ 0.779 |
+ 0.786 |
+ 0.783 |
+ 0.839 |
+ 0.735 |
+ 0.753 |
+ 0.789 |
+ 0.798 |
+ 0.807 |
+ 0.819 |
+ 0.795 |
+ 0.716 |
+ 0.825 |
+ 0.829 |
+ 0.833 |
+ 0.830 |
+ 0.826 |
+ 0.828 |
+ 0.826 |
+ 0.827 |
+ 0.824 |
+ 0.854 |
+ 0.845 |
+ 0.853 |
+ 0.848 |
+ 0.852 |
+ 0.862 |
+ 0.863 |
+ 0.835 |
+ 0.869 |
+ 0.858 |
+ 0.859 |
+ 0.857 |
+ 0.866 |
+ 0.864 |
+ 0.860 |
+ 0.856 |
+ 0.861 |
+ 0.864 |
+ 0.849 |
+ 0.842 |
+ 1332 |
+ Stability |
+ RBP1_HUMAN |
+ High |
+ Human |
+
+
+ RCD1_ARATH_Tsuboyama_2023_5OAO |
+ 0.787 |
+ 0.765 |
+ 0.784 |
+ 0.784 |
+ 0.782 |
+ 0.782 |
+ 0.704 |
+ 0.714 |
+ 0.759 |
+ 0.767 |
+ 0.815 |
+ 0.775 |
+ 0.793 |
+ 0.760 |
+ 0.799 |
+ 0.809 |
+ 0.822 |
+ 0.782 |
+ 0.815 |
+ 0.779 |
+ 0.728 |
+ 0.617 |
+ 0.727 |
+ 0.759 |
+ 0.715 |
+ 0.776 |
+ 0.771 |
+ 0.746 |
+ 0.794 |
+ 0.757 |
+ 0.765 |
+ 0.725 |
+ 0.482 |
+ 0.745 |
+ 0.750 |
+ 0.769 |
+ 0.799 |
+ 0.805 |
+ 0.794 |
+ 0.797 |
+ 0.791 |
+ 0.784 |
+ 0.775 |
+ 0.779 |
+ 0.805 |
+ 0.777 |
+ 0.814 |
+ 0.836 |
+ 0.821 |
+ 0.814 |
+ 0.814 |
+ 0.812 |
+ 0.808 |
+ 0.806 |
+ 0.806 |
+ 0.807 |
+ 0.813 |
+ 0.805 |
+ 0.803 |
+ 0.813 |
+ 0.806 |
+ 0.842 |
+ 1261 |
+ Stability |
+ RCD1_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ RCRO_LAMBD_Tsuboyama_2023_1ORC |
+ 0.777 |
+ 0.868 |
+ 0.899 |
+ 0.903 |
+ 0.900 |
+ 0.909 |
+ 0.667 |
+ 0.676 |
+ 0.900 |
+ 0.910 |
+ 0.874 |
+ 0.814 |
+ 0.878 |
+ 0.750 |
+ 0.812 |
+ 0.846 |
+ 0.874 |
+ 0.857 |
+ 0.866 |
+ 0.866 |
+ 0.709 |
+ 0.752 |
+ 0.750 |
+ 0.735 |
+ 0.614 |
+ 0.781 |
+ 0.591 |
+ 0.718 |
+ 0.877 |
+ 0.871 |
+ 0.902 |
+ 0.889 |
+ 0.635 |
+ 0.665 |
+ 0.611 |
+ 0.840 |
+ 0.789 |
+ 0.784 |
+ 0.871 |
+ 0.905 |
+ 0.901 |
+ 0.901 |
+ 0.786 |
+ 0.783 |
+ 0.856 |
+ 0.776 |
+ 0.954 |
+ 0.933 |
+ 0.941 |
+ 0.929 |
+ 0.913 |
+ 0.913 |
+ 0.925 |
+ 0.930 |
+ 0.924 |
+ 0.920 |
+ 0.920 |
+ 0.921 |
+ 0.922 |
+ 0.924 |
+ 0.928 |
+ 0.724 |
+ 2278 |
+ Stability |
+ RCRO_LAMBD |
+ High |
+ Virus |
+
+
+ RD23A_HUMAN_Tsuboyama_2023_1IFY |
+ 0.897 |
+ 0.871 |
+ 0.897 |
+ 0.894 |
+ 0.898 |
+ 0.895 |
+ 0.783 |
+ 0.855 |
+ 0.839 |
+ 0.888 |
+ 0.861 |
+ 0.855 |
+ 0.872 |
+ 0.792 |
+ 0.895 |
+ 0.857 |
+ 0.862 |
+ 0.872 |
+ 0.855 |
+ 0.883 |
+ 0.845 |
+ 0.879 |
+ 0.859 |
+ 0.874 |
+ 0.883 |
+ 0.885 |
+ 0.881 |
+ 0.886 |
+ 0.873 |
+ 0.873 |
+ 0.849 |
+ 0.828 |
+ 0.865 |
+ 0.773 |
+ 0.857 |
+ 0.851 |
+ 0.884 |
+ 0.896 |
+ 0.886 |
+ 0.904 |
+ 0.905 |
+ 0.897 |
+ 0.834 |
+ 0.709 |
+ 0.878 |
+ 0.854 |
+ 0.886 |
+ 0.900 |
+ 0.908 |
+ 0.911 |
+ 0.874 |
+ 0.876 |
+ 0.879 |
+ 0.879 |
+ 0.885 |
+ 0.880 |
+ 0.879 |
+ 0.881 |
+ 0.873 |
+ 0.881 |
+ 0.892 |
+ 0.888 |
+ 1019 |
+ Stability |
+ RD23A_HUMAN |
+ High |
+ Human |
+
+
+ RDRP_I33A0_Li_2023 |
+ 0.616 |
+ 0.687 |
+ 0.779 |
+ 0.782 |
+ 0.798 |
+ 0.797 |
+ 0.494 |
+ 0.759 |
+ 0.754 |
+ 0.755 |
+ 0.535 |
+ 0.516 |
+ 0.521 |
+ 0.513 |
+ 0.513 |
+ 0.534 |
+ 0.630 |
+ 0.726 |
+ 0.763 |
+ 0.810 |
+ 0.744 |
+ 0.763 |
+ 0.780 |
+ 0.787 |
+ 0.561 |
+ 0.757 |
+ 0.746 |
+ 0.740 |
+ 0.781 |
+ 0.820 |
+ 0.764 |
+ 0.747 |
+ 0.539 |
+ 0.760 |
+ 0.775 |
+ 0.793 |
+ 0.768 |
+ 0.781 |
+ 0.787 |
+ 0.808 |
+ 0.811 |
+ 0.811 |
+ 0.518 |
+ 0.518 |
+ 0.562 |
+ 0.526 |
+ 0.564 |
+ 0.596 |
+ 0.518 |
+ 0.550 |
+ 0.646 |
+ 0.638 |
+ 0.647 |
+ 0.638 |
+ 0.641 |
+ 0.639 |
+ 0.641 |
+ 0.644 |
+ 0.638 |
+ 0.643 |
+ 0.539 |
+ 0.523 |
+ 12003 |
+ OrganismalFitness |
+ RDRP_I33A0 |
+ Low |
+ Virus |
+
+
+ REV_HV1H2_Fernandes_2016 |
+ 0.703 |
+ 0.694 |
+ 0.710 |
+ 0.706 |
+ 0.709 |
+ 0.705 |
+ 0.590 |
+ 0.703 |
+ 0.700 |
+ 0.702 |
+ 0.602 |
+ 0.695 |
+ 0.695 |
+ 0.610 |
+ 0.606 |
+ 0.627 |
+ 0.704 |
+ 0.713 |
+ 0.703 |
+ 0.688 |
+ 0.687 |
+ 0.695 |
+ 0.686 |
+ 0.710 |
+ 0.690 |
+ 0.715 |
+ 0.635 |
+ 0.693 |
+ 0.703 |
+ 0.705 |
+ 0.701 |
+ 0.706 |
+ 0.581 |
+ 0.699 |
+ 0.703 |
+ 0.696 |
+ 0.701 |
+ 0.705 |
+ 0.692 |
+ 0.705 |
+ 0.702 |
+ 0.704 |
+ 0.604 |
+ 0.594 |
+ 0.678 |
+ 0.607 |
+ 0.670 |
+ 0.657 |
+ 0.687 |
+ 0.693 |
+ 0.695 |
+ 0.696 |
+ 0.697 |
+ 0.703 |
+ 0.706 |
+ 0.699 |
+ 0.700 |
+ 0.694 |
+ 0.705 |
+ 0.701 |
+ 0.668 |
+ 0.682 |
+ 2147 |
+ OrganismalFitness |
+ REV_HV1H2 |
+ Medium |
+ Virus |
+
+
+ RFAH_ECOLI_Tsuboyama_2023_2LCL |
+ 0.631 |
+ 0.641 |
+ 0.633 |
+ 0.625 |
+ 0.620 |
+ 0.626 |
+ 0.588 |
+ 0.518 |
+ 0.583 |
+ 0.660 |
+ 0.719 |
+ 0.651 |
+ 0.687 |
+ 0.588 |
+ 0.620 |
+ 0.636 |
+ 0.711 |
+ 0.621 |
+ 0.576 |
+ 0.575 |
+ 0.549 |
+ 0.622 |
+ 0.580 |
+ 0.578 |
+ 0.658 |
+ 0.629 |
+ 0.639 |
+ 0.598 |
+ 0.554 |
+ 0.565 |
+ 0.598 |
+ 0.537 |
+ 0.421 |
+ 0.595 |
+ 0.580 |
+ 0.570 |
+ 0.618 |
+ 0.555 |
+ 0.618 |
+ 0.641 |
+ 0.602 |
+ 0.629 |
+ 0.633 |
+ 0.588 |
+ 0.693 |
+ 0.636 |
+ 0.681 |
+ 0.693 |
+ 0.765 |
+ 0.697 |
+ 0.693 |
+ 0.698 |
+ 0.707 |
+ 0.699 |
+ 0.682 |
+ 0.689 |
+ 0.722 |
+ 0.698 |
+ 0.721 |
+ 0.712 |
+ 0.729 |
+ 0.635 |
+ 1326 |
+ Stability |
+ RFAH_ECOLI |
+ High |
+ Prokaryote |
+
+
+ RL20_AQUAE_Tsuboyama_2023_1GYZ |
+ 0.892 |
+ 0.944 |
+ 0.939 |
+ 0.934 |
+ 0.935 |
+ 0.938 |
+ 0.801 |
+ 0.928 |
+ 0.892 |
+ 0.879 |
+ 0.941 |
+ 0.948 |
+ 0.947 |
+ 0.818 |
+ 0.865 |
+ 0.922 |
+ 0.939 |
+ 0.935 |
+ 0.931 |
+ 0.935 |
+ 0.891 |
+ 0.924 |
+ 0.913 |
+ 0.901 |
+ 0.928 |
+ 0.902 |
+ 0.917 |
+ 0.935 |
+ 0.916 |
+ 0.903 |
+ 0.936 |
+ 0.935 |
+ 0.756 |
+ 0.913 |
+ 0.936 |
+ 0.898 |
+ 0.920 |
+ 0.935 |
+ 0.913 |
+ 0.937 |
+ 0.941 |
+ 0.932 |
+ 0.896 |
+ 0.823 |
+ 0.936 |
+ 0.928 |
+ 0.956 |
+ 0.941 |
+ 0.953 |
+ 0.948 |
+ 0.944 |
+ 0.946 |
+ 0.947 |
+ 0.947 |
+ 0.942 |
+ 0.944 |
+ 0.940 |
+ 0.944 |
+ 0.943 |
+ 0.944 |
+ 0.945 |
+ 0.945 |
+ 1461 |
+ Stability |
+ RL20_AQUAE |
+ High |
+ Prokaryote |
+
+
+ RL40A_YEAST_Mavor_2016 |
+ 0.808 |
+ 0.845 |
+ 0.848 |
+ 0.863 |
+ 0.855 |
+ 0.855 |
+ 0.526 |
+ 0.829 |
+ 0.875 |
+ 0.876 |
+ 0.788 |
+ 0.830 |
+ 0.859 |
+ 0.674 |
+ 0.775 |
+ 0.809 |
+ 0.853 |
+ 0.863 |
+ 0.861 |
+ 0.855 |
+ 0.850 |
+ 0.852 |
+ 0.849 |
+ 0.858 |
+ 0.869 |
+ 0.855 |
+ 0.858 |
+ 0.856 |
+ 0.864 |
+ 0.854 |
+ 0.855 |
+ 0.861 |
+ 0.640 |
+ 0.844 |
+ 0.834 |
+ 0.840 |
+ 0.850 |
+ 0.857 |
+ 0.854 |
+ 0.874 |
+ 0.877 |
+ 0.868 |
+ 0.832 |
+ 0.627 |
+ 0.834 |
+ 0.826 |
+ 0.527 |
+ 0.703 |
+ 0.670 |
+ 0.542 |
+ 0.856 |
+ 0.870 |
+ 0.858 |
+ 0.845 |
+ 0.867 |
+ 0.854 |
+ 0.868 |
+ 0.868 |
+ 0.865 |
+ 0.864 |
+ 0.868 |
+ 0.773 |
+ 1253 |
+ OrganismalFitness |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RL40A_YEAST_Roscoe_2013 |
+ 0.926 |
+ 0.939 |
+ 0.940 |
+ 0.951 |
+ 0.941 |
+ 0.944 |
+ 0.718 |
+ 0.927 |
+ 0.955 |
+ 0.955 |
+ 0.912 |
+ 0.929 |
+ 0.953 |
+ 0.821 |
+ 0.905 |
+ 0.891 |
+ 0.952 |
+ 0.944 |
+ 0.952 |
+ 0.952 |
+ 0.940 |
+ 0.946 |
+ 0.950 |
+ 0.949 |
+ 0.959 |
+ 0.947 |
+ 0.952 |
+ 0.948 |
+ 0.961 |
+ 0.954 |
+ 0.959 |
+ 0.960 |
+ 0.832 |
+ 0.933 |
+ 0.947 |
+ 0.937 |
+ 0.942 |
+ 0.955 |
+ 0.947 |
+ 0.958 |
+ 0.961 |
+ 0.954 |
+ 0.944 |
+ 0.792 |
+ 0.940 |
+ 0.934 |
+ 0.715 |
+ 0.857 |
+ 0.843 |
+ 0.728 |
+ 0.945 |
+ 0.960 |
+ 0.951 |
+ 0.947 |
+ 0.955 |
+ 0.947 |
+ 0.953 |
+ 0.953 |
+ 0.951 |
+ 0.949 |
+ 0.956 |
+ 0.912 |
+ 1195 |
+ OrganismalFitness |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RL40A_YEAST_Roscoe_2014 |
+ 0.952 |
+ 0.942 |
+ 0.950 |
+ 0.954 |
+ 0.950 |
+ 0.950 |
+ 0.870 |
+ 0.947 |
+ 0.941 |
+ 0.955 |
+ 0.927 |
+ 0.941 |
+ 0.948 |
+ 0.902 |
+ 0.953 |
+ 0.894 |
+ 0.961 |
+ 0.948 |
+ 0.957 |
+ 0.953 |
+ 0.956 |
+ 0.958 |
+ 0.957 |
+ 0.956 |
+ 0.960 |
+ 0.951 |
+ 0.951 |
+ 0.955 |
+ 0.959 |
+ 0.959 |
+ 0.952 |
+ 0.955 |
+ 0.937 |
+ 0.956 |
+ 0.954 |
+ 0.941 |
+ 0.952 |
+ 0.954 |
+ 0.949 |
+ 0.952 |
+ 0.957 |
+ 0.954 |
+ 0.946 |
+ 0.899 |
+ 0.949 |
+ 0.940 |
+ 0.858 |
+ 0.932 |
+ 0.893 |
+ 0.891 |
+ 0.964 |
+ 0.964 |
+ 0.965 |
+ 0.967 |
+ 0.966 |
+ 0.966 |
+ 0.965 |
+ 0.965 |
+ 0.965 |
+ 0.966 |
+ 0.954 |
+ 0.919 |
+ 1380 |
+ Activity |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RNC_ECOLI_Weeks_2023 |
+ 0.968 |
+ 0.969 |
+ 0.938 |
+ 0.943 |
+ 0.938 |
+ 0.956 |
+ 0.865 |
+ 0.925 |
+ 0.967 |
+ 0.962 |
+ 0.969 |
+ 0.967 |
+ 0.967 |
+ 0.890 |
+ 0.964 |
+ 0.967 |
+ 0.967 |
+ 0.969 |
+ 0.972 |
+ 0.939 |
+ 0.970 |
+ 0.969 |
+ 0.966 |
+ 0.969 |
+ 0.970 |
+ 0.966 |
+ 0.970 |
+ 0.966 |
+ 0.969 |
+ 0.938 |
+ 0.969 |
+ 0.969 |
+ 0.853 |
+ 0.965 |
+ 0.966 |
+ 0.965 |
+ 0.965 |
+ 0.966 |
+ 0.967 |
+ 0.961 |
+ 0.962 |
+ 0.962 |
+ 0.966 |
+ 0.882 |
+ 0.967 |
+ 0.965 |
+ 0.913 |
+ 0.969 |
+ 0.882 |
+ 0.893 |
+ 0.968 |
+ 0.966 |
+ 0.965 |
+ 0.968 |
+ 0.966 |
+ 0.969 |
+ 0.966 |
+ 0.965 |
+ 0.966 |
+ 0.967 |
+ 0.969 |
+ 0.962 |
+ 4277 |
+ Activity |
+ RNC_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ RPC1_BP434_Tsuboyama_2023_1R69 |
+ 0.848 |
+ 0.884 |
+ 0.874 |
+ 0.879 |
+ 0.873 |
+ 0.884 |
+ 0.834 |
+ 0.835 |
+ 0.823 |
+ 0.817 |
+ 0.885 |
+ 0.876 |
+ 0.879 |
+ 0.853 |
+ 0.860 |
+ 0.868 |
+ 0.879 |
+ 0.894 |
+ 0.888 |
+ 0.891 |
+ 0.823 |
+ 0.888 |
+ 0.882 |
+ 0.886 |
+ 0.849 |
+ 0.891 |
+ 0.865 |
+ 0.887 |
+ 0.893 |
+ 0.872 |
+ 0.883 |
+ 0.839 |
+ 0.856 |
+ 0.777 |
+ 0.865 |
+ 0.863 |
+ 0.822 |
+ 0.866 |
+ 0.853 |
+ 0.886 |
+ 0.888 |
+ 0.882 |
+ 0.848 |
+ 0.796 |
+ 0.827 |
+ 0.825 |
+ 0.901 |
+ 0.801 |
+ 0.903 |
+ 0.887 |
+ 0.857 |
+ 0.863 |
+ 0.869 |
+ 0.877 |
+ 0.874 |
+ 0.867 |
+ 0.877 |
+ 0.866 |
+ 0.866 |
+ 0.872 |
+ 0.885 |
+ 0.850 |
+ 1459 |
+ Stability |
+ RPC1_BP434 |
+ High |
+ Virus |
+
+
+ RPC1_LAMBD_Li_2019_high-expression |
+ 0.850 |
+ 0.911 |
+ 0.910 |
+ 0.913 |
+ 0.915 |
+ 0.913 |
+ 0.851 |
+ 0.897 |
+ 0.904 |
+ 0.921 |
+ 0.910 |
+ 0.922 |
+ 0.924 |
+ 0.855 |
+ 0.879 |
+ 0.910 |
+ 0.926 |
+ 0.912 |
+ 0.911 |
+ 0.928 |
+ 0.794 |
+ 0.872 |
+ 0.900 |
+ 0.898 |
+ 0.814 |
+ 0.892 |
+ 0.889 |
+ 0.820 |
+ 0.914 |
+ 0.889 |
+ 0.889 |
+ 0.885 |
+ 0.746 |
+ 0.807 |
+ 0.908 |
+ 0.905 |
+ 0.866 |
+ 0.923 |
+ 0.925 |
+ 0.915 |
+ 0.914 |
+ 0.917 |
+ 0.847 |
+ 0.901 |
+ 0.886 |
+ 0.885 |
+ 0.665 |
+ 0.905 |
+ 0.864 |
+ 0.806 |
+ 0.927 |
+ 0.927 |
+ 0.931 |
+ 0.926 |
+ 0.926 |
+ 0.928 |
+ 0.929 |
+ 0.927 |
+ 0.929 |
+ 0.933 |
+ 0.926 |
+ 0.872 |
+ 351 |
+ Activity |
+ RPC1_LAMBD |
+ High |
+ Virus |
+
+
+ RPC1_LAMBD_Li_2019_low-expression |
+ 0.690 |
+ 0.873 |
+ 0.848 |
+ 0.855 |
+ 0.843 |
+ 0.856 |
+ 0.739 |
+ 0.874 |
+ 0.808 |
+ 0.827 |
+ 0.839 |
+ 0.758 |
+ 0.829 |
+ 0.704 |
+ 0.756 |
+ 0.754 |
+ 0.794 |
+ 0.844 |
+ 0.858 |
+ 0.817 |
+ 0.598 |
+ 0.733 |
+ 0.714 |
+ 0.796 |
+ 0.637 |
+ 0.740 |
+ 0.736 |
+ 0.680 |
+ 0.902 |
+ 0.854 |
+ 0.889 |
+ 0.883 |
+ 0.641 |
+ 0.668 |
+ 0.741 |
+ 0.860 |
+ 0.712 |
+ 0.751 |
+ 0.827 |
+ 0.840 |
+ 0.843 |
+ 0.864 |
+ 0.680 |
+ 0.726 |
+ 0.753 |
+ 0.742 |
+ 0.628 |
+ 0.719 |
+ 0.790 |
+ 0.623 |
+ 0.791 |
+ 0.797 |
+ 0.779 |
+ 0.795 |
+ 0.805 |
+ 0.810 |
+ 0.809 |
+ 0.770 |
+ 0.801 |
+ 0.791 |
+ 0.806 |
+ 0.699 |
+ 351 |
+ Activity |
+ RPC1_LAMBD |
+ High |
+ Virus |
+
+
+ RS15_GEOSE_Tsuboyama_2023_1A32 |
+ 0.879 |
+ 0.865 |
+ 0.867 |
+ 0.867 |
+ 0.867 |
+ 0.869 |
+ 0.799 |
+ 0.837 |
+ 0.867 |
+ 0.866 |
+ 0.860 |
+ 0.859 |
+ 0.857 |
+ 0.813 |
+ 0.831 |
+ 0.884 |
+ 0.864 |
+ 0.868 |
+ 0.861 |
+ 0.856 |
+ 0.860 |
+ 0.867 |
+ 0.849 |
+ 0.849 |
+ 0.860 |
+ 0.852 |
+ 0.884 |
+ 0.862 |
+ 0.860 |
+ 0.860 |
+ 0.858 |
+ 0.849 |
+ 0.823 |
+ 0.875 |
+ 0.862 |
+ 0.852 |
+ 0.881 |
+ 0.870 |
+ 0.865 |
+ 0.876 |
+ 0.868 |
+ 0.864 |
+ 0.831 |
+ 0.820 |
+ 0.878 |
+ 0.858 |
+ 0.900 |
+ 0.887 |
+ 0.888 |
+ 0.893 |
+ 0.866 |
+ 0.863 |
+ 0.866 |
+ 0.870 |
+ 0.866 |
+ 0.856 |
+ 0.865 |
+ 0.865 |
+ 0.872 |
+ 0.868 |
+ 0.866 |
+ 0.915 |
+ 1195 |
+ Stability |
+ RS15_GEOSE |
+ Medium |
+ Prokaryote |
+
+
+ S22A1_HUMAN_Yee_2023_abundance |
+ 0.862 |
+ 0.900 |
+ 0.905 |
+ 0.910 |
+ 0.906 |
+ 0.906 |
+ 0.823 |
+ 0.858 |
+ 0.901 |
+ 0.904 |
+ 0.896 |
+ 0.909 |
+ 0.911 |
+ 0.846 |
+ 0.851 |
+ 0.880 |
+ 0.904 |
+ 0.907 |
+ 0.902 |
+ 0.768 |
+ 0.880 |
+ 0.907 |
+ 0.901 |
+ 0.896 |
+ 0.891 |
+ 0.908 |
+ 0.907 |
+ 0.905 |
+ 0.896 |
+ 0.901 |
+ 0.909 |
+ 0.878 |
+ 0.807 |
+ 0.880 |
+ 0.905 |
+ 0.903 |
+ 0.896 |
+ 0.910 |
+ 0.909 |
+ 0.910 |
+ 0.908 |
+ 0.909 |
+ 0.853 |
+ 0.789 |
+ 0.896 |
+ 0.863 |
+ 0.875 |
+ 0.911 |
+ 0.878 |
+ 0.823 |
+ 0.903 |
+ 0.900 |
+ 0.907 |
+ 0.910 |
+ 0.907 |
+ 0.908 |
+ 0.906 |
+ 0.908 |
+ 0.907 |
+ 0.910 |
+ 0.901 |
+ 0.876 |
+ 9803 |
+ Expression |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ S22A1_HUMAN_Yee_2023_activity |
+ 0.739 |
+ 0.803 |
+ 0.811 |
+ 0.814 |
+ 0.812 |
+ 0.815 |
+ 0.682 |
+ 0.755 |
+ 0.807 |
+ 0.807 |
+ 0.800 |
+ 0.818 |
+ 0.816 |
+ 0.718 |
+ 0.710 |
+ 0.747 |
+ 0.803 |
+ 0.811 |
+ 0.804 |
+ 0.635 |
+ 0.781 |
+ 0.810 |
+ 0.802 |
+ 0.798 |
+ 0.796 |
+ 0.810 |
+ 0.811 |
+ 0.810 |
+ 0.796 |
+ 0.807 |
+ 0.815 |
+ 0.790 |
+ 0.701 |
+ 0.777 |
+ 0.809 |
+ 0.808 |
+ 0.786 |
+ 0.814 |
+ 0.813 |
+ 0.818 |
+ 0.817 |
+ 0.818 |
+ 0.711 |
+ 0.636 |
+ 0.801 |
+ 0.726 |
+ 0.773 |
+ 0.824 |
+ 0.768 |
+ 0.687 |
+ 0.808 |
+ 0.801 |
+ 0.809 |
+ 0.812 |
+ 0.806 |
+ 0.811 |
+ 0.807 |
+ 0.812 |
+ 0.812 |
+ 0.812 |
+ 0.789 |
+ 0.751 |
+ 10094 |
+ Activity |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ SAV1_MOUSE_Tsuboyama_2023_2YSB |
+ 0.874 |
+ 0.833 |
+ 0.842 |
+ 0.863 |
+ 0.854 |
+ 0.869 |
+ 0.766 |
+ 0.808 |
+ 0.844 |
+ 0.857 |
+ 0.816 |
+ 0.829 |
+ 0.826 |
+ 0.659 |
+ 0.847 |
+ 0.866 |
+ 0.805 |
+ 0.842 |
+ 0.770 |
+ 0.765 |
+ 0.784 |
+ 0.819 |
+ 0.817 |
+ 0.822 |
+ 0.781 |
+ 0.829 |
+ 0.816 |
+ 0.818 |
+ 0.816 |
+ 0.804 |
+ 0.813 |
+ 0.756 |
+ 0.760 |
+ 0.749 |
+ 0.798 |
+ 0.801 |
+ 0.836 |
+ 0.842 |
+ 0.849 |
+ 0.853 |
+ 0.873 |
+ 0.863 |
+ 0.824 |
+ 0.671 |
+ 0.854 |
+ 0.829 |
+ 0.827 |
+ 0.869 |
+ 0.842 |
+ 0.869 |
+ 0.834 |
+ 0.772 |
+ 0.830 |
+ 0.807 |
+ 0.837 |
+ 0.831 |
+ 0.809 |
+ 0.824 |
+ 0.824 |
+ 0.827 |
+ 0.796 |
+ 0.893 |
+ 965 |
+ Stability |
+ SAV1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ SBI_STAAM_Tsuboyama_2023_2JVG |
+ 0.761 |
+ 0.773 |
+ 0.795 |
+ 0.794 |
+ 0.788 |
+ 0.779 |
+ 0.648 |
+ 0.686 |
+ 0.800 |
+ 0.823 |
+ 0.722 |
+ 0.676 |
+ 0.679 |
+ 0.624 |
+ 0.655 |
+ 0.711 |
+ 0.826 |
+ 0.842 |
+ 0.790 |
+ 0.754 |
+ 0.647 |
+ 0.745 |
+ 0.770 |
+ 0.727 |
+ 0.671 |
+ 0.645 |
+ 0.688 |
+ 0.656 |
+ 0.754 |
+ 0.777 |
+ 0.831 |
+ 0.784 |
+ 0.622 |
+ 0.596 |
+ 0.618 |
+ 0.620 |
+ 0.728 |
+ 0.744 |
+ 0.747 |
+ 0.751 |
+ 0.777 |
+ 0.778 |
+ 0.659 |
+ 0.657 |
+ 0.644 |
+ 0.702 |
+ 0.837 |
+ 0.803 |
+ 0.843 |
+ 0.846 |
+ 0.685 |
+ 0.705 |
+ 0.709 |
+ 0.720 |
+ 0.726 |
+ 0.691 |
+ 0.746 |
+ 0.770 |
+ 0.813 |
+ 0.743 |
+ 0.851 |
+ 0.801 |
+ 1025 |
+ Stability |
+ SBI_STAAM |
+ Medium |
+ Prokaryote |
+
+
+ SC6A4_HUMAN_Young_2021 |
+ 0.868 |
+ 0.888 |
+ 0.882 |
+ 0.883 |
+ 0.887 |
+ 0.886 |
+ 0.829 |
+ 0.901 |
+ 0.885 |
+ 0.889 |
+ 0.890 |
+ 0.872 |
+ 0.881 |
+ 0.783 |
+ 0.817 |
+ 0.856 |
+ 0.868 |
+ 0.871 |
+ 0.870 |
+ 0.891 |
+ 0.888 |
+ 0.905 |
+ 0.904 |
+ 0.900 |
+ 0.887 |
+ 0.904 |
+ 0.903 |
+ 0.902 |
+ 0.902 |
+ 0.890 |
+ 0.881 |
+ 0.850 |
+ 0.854 |
+ 0.892 |
+ 0.902 |
+ 0.901 |
+ 0.884 |
+ 0.892 |
+ 0.893 |
+ 0.888 |
+ 0.890 |
+ 0.888 |
+ 0.864 |
+ 0.781 |
+ 0.891 |
+ 0.871 |
+ 0.868 |
+ 0.893 |
+ 0.868 |
+ 0.823 |
+ 0.867 |
+ 0.875 |
+ 0.875 |
+ 0.876 |
+ 0.873 |
+ 0.875 |
+ 0.871 |
+ 0.873 |
+ 0.867 |
+ 0.874 |
+ 0.889 |
+ 0.870 |
+ 11576 |
+ Activity |
+ SC6A4_HUMAN |
+ Medium |
+ Human |
+
+
+ SCIN_STAAR_Tsuboyama_2023_2QFF |
+ 0.740 |
+ 0.766 |
+ 0.776 |
+ 0.779 |
+ 0.781 |
+ 0.788 |
+ 0.763 |
+ 0.722 |
+ 0.793 |
+ 0.784 |
+ 0.783 |
+ 0.782 |
+ 0.776 |
+ 0.738 |
+ 0.774 |
+ 0.800 |
+ 0.803 |
+ 0.793 |
+ 0.817 |
+ 0.759 |
+ 0.714 |
+ 0.749 |
+ 0.755 |
+ 0.755 |
+ 0.764 |
+ 0.767 |
+ 0.784 |
+ 0.782 |
+ 0.760 |
+ 0.758 |
+ 0.768 |
+ 0.746 |
+ 0.617 |
+ 0.712 |
+ 0.686 |
+ 0.746 |
+ 0.757 |
+ 0.758 |
+ 0.766 |
+ 0.780 |
+ 0.789 |
+ 0.793 |
+ 0.787 |
+ 0.769 |
+ 0.771 |
+ 0.783 |
+ 0.793 |
+ 0.851 |
+ 0.835 |
+ 0.803 |
+ 0.801 |
+ 0.800 |
+ 0.808 |
+ 0.818 |
+ 0.813 |
+ 0.836 |
+ 0.816 |
+ 0.835 |
+ 0.810 |
+ 0.823 |
+ 0.887 |
+ 0.851 |
+ 1212 |
+ Stability |
+ SCIN_STAAR |
+ High |
+ Prokaryote |
+
+
+ SCN5A_HUMAN_Glazer_2019 |
+ 0.753 |
+ 0.728 |
+ 0.754 |
+ 0.754 |
+ 0.754 |
+ 0.754 |
+ 0.794 |
+ 0.603 |
+ 0.791 |
+ 0.798 |
+ 0.753 |
+ 0.731 |
+ 0.765 |
+ 0.770 |
+ 0.750 |
+ 0.763 |
+ 0.778 |
+ 0.770 |
+ 0.748 |
+ 0.760 |
+ 0.743 |
+ 0.724 |
+ 0.739 |
+ 0.723 |
+ 0.732 |
+ 0.748 |
+ 0.743 |
+ 0.714 |
+ 0.795 |
+ 0.739 |
+ 0.741 |
+ 0.738 |
+ 0.639 |
+ 0.675 |
+ 0.697 |
+ 0.758 |
+ 0.756 |
+ 0.784 |
+ 0.816 |
+ 0.770 |
+ 0.787 |
+ 0.772 |
+ 0.752 |
+ 0.714 |
+ 0.800 |
+ 0.693 |
+ 0.637 |
+ 0.657 |
+ 0.710 |
+ 0.606 |
+ 0.703 |
+ 0.734 |
+ 0.727 |
+ 0.741 |
+ 0.728 |
+ 0.736 |
+ 0.720 |
+ 0.705 |
+ 0.720 |
+ 0.726 |
+ 0.713 |
+ 0.737 |
+ 224 |
+ OrganismalFitness |
+ SCN5A_HUMAN |
+ Medium |
+ Human |
+
+
+ SDA_BACSU_Tsuboyama_2023_1PV0 |
+ 0.931 |
+ 0.927 |
+ 0.931 |
+ 0.931 |
+ 0.936 |
+ 0.936 |
+ 0.599 |
+ 0.877 |
+ 0.925 |
+ 0.934 |
+ 0.922 |
+ 0.899 |
+ 0.906 |
+ 0.688 |
+ 0.743 |
+ 0.914 |
+ 0.922 |
+ 0.935 |
+ 0.933 |
+ 0.917 |
+ 0.591 |
+ 0.637 |
+ 0.618 |
+ 0.760 |
+ 0.670 |
+ 0.886 |
+ 0.568 |
+ 0.806 |
+ 0.920 |
+ 0.940 |
+ 0.920 |
+ 0.900 |
+ 0.522 |
+ 0.593 |
+ 0.743 |
+ 0.886 |
+ 0.932 |
+ 0.930 |
+ 0.924 |
+ 0.935 |
+ 0.935 |
+ 0.933 |
+ 0.703 |
+ 0.654 |
+ 0.913 |
+ 0.728 |
+ 0.901 |
+ 0.898 |
+ 0.940 |
+ 0.935 |
+ 0.932 |
+ 0.932 |
+ 0.934 |
+ 0.939 |
+ 0.935 |
+ 0.937 |
+ 0.938 |
+ 0.936 |
+ 0.940 |
+ 0.940 |
+ 0.944 |
+ 0.945 |
+ 2770 |
+ Stability |
+ SDA_BACSU |
+ Medium |
+ Prokaryote |
+
+
+ SERC_HUMAN_Xie_2023 |
+ 0.882 |
+ 0.903 |
+ 0.893 |
+ 0.898 |
+ 0.897 |
+ 0.903 |
+ 0.661 |
+ 0.892 |
+ 0.896 |
+ 0.898 |
+ 0.899 |
+ 0.898 |
+ 0.900 |
+ 0.764 |
+ 0.867 |
+ 0.890 |
+ 0.897 |
+ 0.892 |
+ 0.889 |
+ 0.899 |
+ 0.903 |
+ 0.900 |
+ 0.903 |
+ 0.904 |
+ 0.903 |
+ 0.901 |
+ 0.906 |
+ 0.905 |
+ 0.899 |
+ 0.899 |
+ 0.903 |
+ 0.901 |
+ 0.805 |
+ 0.898 |
+ 0.906 |
+ 0.904 |
+ 0.894 |
+ 0.904 |
+ 0.903 |
+ 0.903 |
+ 0.903 |
+ 0.902 |
+ 0.857 |
+ 0.737 |
+ 0.896 |
+ 0.879 |
+ 0.871 |
+ 0.903 |
+ 0.787 |
+ 0.850 |
+ 0.897 |
+ 0.901 |
+ 0.896 |
+ 0.901 |
+ 0.905 |
+ 0.897 |
+ 0.899 |
+ 0.897 |
+ 0.894 |
+ 0.902 |
+ 0.901 |
+ 0.880 |
+ 1914 |
+ OrganismalFitness |
+ SERC_HUMAN |
+ High |
+ Human |
+
+
+ SHOC2_HUMAN_Kwon_2022 |
+ 0.820 |
+ 0.841 |
+ 0.836 |
+ 0.835 |
+ 0.839 |
+ 0.838 |
+ 0.802 |
+ 0.839 |
+ 0.850 |
+ 0.844 |
+ 0.830 |
+ 0.827 |
+ 0.830 |
+ 0.805 |
+ 0.806 |
+ 0.807 |
+ 0.826 |
+ 0.838 |
+ 0.834 |
+ 0.831 |
+ 0.810 |
+ 0.815 |
+ 0.845 |
+ 0.838 |
+ 0.807 |
+ 0.820 |
+ 0.821 |
+ 0.823 |
+ 0.844 |
+ 0.844 |
+ 0.845 |
+ 0.838 |
+ 0.806 |
+ 0.811 |
+ 0.822 |
+ 0.842 |
+ 0.818 |
+ 0.828 |
+ 0.840 |
+ 0.838 |
+ 0.836 |
+ 0.843 |
+ 0.809 |
+ 0.806 |
+ 0.840 |
+ 0.806 |
+ 0.820 |
+ 0.840 |
+ 0.814 |
+ 0.810 |
+ 0.832 |
+ 0.832 |
+ 0.836 |
+ 0.834 |
+ 0.837 |
+ 0.838 |
+ 0.832 |
+ 0.831 |
+ 0.826 |
+ 0.833 |
+ 0.805 |
+ 0.809 |
+ 10972 |
+ OrganismalFitness |
+ SHOC2_HUMAN |
+ Medium |
+ Human |
+
+
+ SOX30_HUMAN_Tsuboyama_2023_7JJK |
+ 0.830 |
+ 0.855 |
+ 0.835 |
+ 0.834 |
+ 0.841 |
+ 0.838 |
+ 0.821 |
+ 0.809 |
+ 0.818 |
+ 0.838 |
+ 0.845 |
+ 0.840 |
+ 0.847 |
+ 0.832 |
+ 0.840 |
+ 0.860 |
+ 0.855 |
+ 0.835 |
+ 0.839 |
+ 0.850 |
+ 0.835 |
+ 0.812 |
+ 0.829 |
+ 0.823 |
+ 0.842 |
+ 0.846 |
+ 0.839 |
+ 0.839 |
+ 0.838 |
+ 0.835 |
+ 0.816 |
+ 0.804 |
+ 0.829 |
+ 0.754 |
+ 0.823 |
+ 0.839 |
+ 0.829 |
+ 0.827 |
+ 0.842 |
+ 0.835 |
+ 0.839 |
+ 0.838 |
+ 0.846 |
+ 0.760 |
+ 0.837 |
+ 0.855 |
+ 0.845 |
+ 0.836 |
+ 0.859 |
+ 0.845 |
+ 0.849 |
+ 0.867 |
+ 0.866 |
+ 0.859 |
+ 0.855 |
+ 0.862 |
+ 0.848 |
+ 0.848 |
+ 0.846 |
+ 0.860 |
+ 0.853 |
+ 0.851 |
+ 1010 |
+ Stability |
+ SOX30_HUMAN |
+ High |
+ Human |
+
+
+ SPA_STAAU_Tsuboyama_2023_1LP1 |
+ 0.777 |
+ 0.800 |
+ 0.792 |
+ 0.794 |
+ 0.805 |
+ 0.802 |
+ 0.545 |
+ 0.768 |
+ 0.725 |
+ 0.727 |
+ 0.739 |
+ 0.543 |
+ 0.540 |
+ 0.516 |
+ 0.547 |
+ 0.513 |
+ 0.590 |
+ 0.602 |
+ 0.588 |
+ 0.793 |
+ 0.482 |
+ 0.583 |
+ 0.743 |
+ 0.637 |
+ 0.548 |
+ 0.509 |
+ 0.567 |
+ 0.746 |
+ 0.786 |
+ 0.752 |
+ 0.738 |
+ 0.729 |
+ 0.493 |
+ 0.548 |
+ 0.518 |
+ 0.602 |
+ 0.779 |
+ 0.788 |
+ 0.779 |
+ 0.806 |
+ 0.816 |
+ 0.803 |
+ 0.566 |
+ 0.588 |
+ 0.638 |
+ 0.630 |
+ 0.771 |
+ 0.762 |
+ 0.818 |
+ 0.803 |
+ 0.793 |
+ 0.785 |
+ 0.787 |
+ 0.827 |
+ 0.763 |
+ 0.794 |
+ 0.778 |
+ 0.801 |
+ 0.785 |
+ 0.801 |
+ 0.755 |
+ 0.761 |
+ 2105 |
+ Stability |
+ SPA_STAAU |
+ Medium |
+ Prokaryote |
+
+
+ SPG1_STRSG_Olson_2014 |
+ 0.652 |
+ 0.658 |
+ 0.475 |
+ 0.483 |
+ 0.658 |
+ 0.668 |
+ 0.439 |
+ 0.486 |
+ 0.455 |
+ 0.621 |
+ 0.708 |
+ 0.675 |
+ 0.663 |
+ 0.716 |
+ 0.683 |
+ 0.681 |
+ 0.696 |
+ 0.677 |
+ 0.697 |
+ 0.651 |
+ 0.711 |
+ 0.681 |
+ 0.679 |
+ 0.683 |
+ 0.691 |
+ 0.684 |
+ 0.687 |
+ 0.685 |
+ 0.730 |
+ 0.682 |
+ 0.735 |
+ 0.724 |
+ 0.489 |
+ 0.674 |
+ 0.655 |
+ 0.661 |
+ 0.658 |
+ 0.650 |
+ 0.661 |
+ 0.684 |
+ 0.672 |
+ 0.688 |
+ 0.433 |
+ 0.418 |
+ 0.555 |
+ 0.485 |
+ 0.659 |
+ 0.613 |
+ 0.679 |
+ 0.567 |
+ 0.726 |
+ 0.685 |
+ 0.704 |
+ 0.718 |
+ 0.715 |
+ 0.695 |
+ 0.724 |
+ 0.726 |
+ 0.715 |
+ 0.721 |
+ 0.704 |
+ 0.715 |
+ 536962 |
+ Binding |
+ SPG1_STRSG |
+ Low |
+ Prokaryote |
+
+
+ SPG1_STRSG_Wu_2016 |
+ 0.137 |
+ 0.172 |
+ 0.168 |
+ 0.190 |
+ 0.184 |
+ 0.186 |
+ 0.157 |
+ 0.151 |
+ 0.252 |
+ 0.239 |
+ 0.249 |
+ 0.261 |
+ 0.238 |
+ 0.232 |
+ 0.246 |
+ 0.260 |
+ 0.282 |
+ 0.297 |
+ 0.315 |
+ 0.236 |
+ 0.197 |
+ 0.186 |
+ 0.196 |
+ 0.199 |
+ 0.170 |
+ 0.214 |
+ 0.213 |
+ 0.198 |
+ 0.198 |
+ 0.152 |
+ 0.303 |
+ 0.285 |
+ 0.111 |
+ 0.203 |
+ 0.168 |
+ 0.218 |
+ 0.169 |
+ 0.159 |
+ 0.188 |
+ 0.196 |
+ 0.191 |
+ 0.204 |
+ 0.183 |
+ 0.137 |
+ 0.231 |
+ 0.141 |
+ 0.238 |
+ 0.243 |
+ 0.301 |
+ 0.203 |
+ 0.301 |
+ 0.302 |
+ 0.306 |
+ 0.316 |
+ 0.299 |
+ 0.300 |
+ 0.330 |
+ 0.309 |
+ 0.317 |
+ 0.309 |
+ 0.324 |
+ 0.276 |
+ 149360 |
+ Binding |
+ SPG1_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPG2_STRSG_Tsuboyama_2023_5UBS |
+ 0.784 |
+ 0.792 |
+ 0.779 |
+ 0.771 |
+ 0.775 |
+ 0.774 |
+ 0.708 |
+ 0.778 |
+ 0.810 |
+ 0.824 |
+ 0.784 |
+ 0.788 |
+ 0.776 |
+ 0.720 |
+ 0.765 |
+ 0.753 |
+ 0.794 |
+ 0.796 |
+ 0.766 |
+ 0.804 |
+ 0.724 |
+ 0.778 |
+ 0.693 |
+ 0.702 |
+ 0.668 |
+ 0.731 |
+ 0.710 |
+ 0.778 |
+ 0.769 |
+ 0.826 |
+ 0.803 |
+ 0.787 |
+ 0.470 |
+ 0.652 |
+ 0.762 |
+ 0.799 |
+ 0.734 |
+ 0.761 |
+ 0.743 |
+ 0.804 |
+ 0.807 |
+ 0.810 |
+ 0.755 |
+ 0.656 |
+ 0.778 |
+ 0.727 |
+ 0.828 |
+ 0.787 |
+ 0.833 |
+ 0.817 |
+ 0.789 |
+ 0.800 |
+ 0.815 |
+ 0.818 |
+ 0.812 |
+ 0.812 |
+ 0.810 |
+ 0.804 |
+ 0.791 |
+ 0.813 |
+ 0.835 |
+ 0.755 |
+ 1451 |
+ Stability |
+ SPG2_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPIKE_SARS2_Starr_2020_binding |
+ 0.831 |
+ 0.909 |
+ 0.816 |
+ 0.865 |
+ 0.898 |
+ 0.896 |
+ 0.739 |
+ 0.938 |
+ 0.949 |
+ 0.948 |
+ 0.747 |
+ 0.741 |
+ 0.739 |
+ 0.748 |
+ 0.774 |
+ 0.758 |
+ 0.772 |
+ 0.775 |
+ 0.793 |
+ 0.925 |
+ 0.928 |
+ 0.929 |
+ 0.926 |
+ 0.931 |
+ 0.931 |
+ 0.925 |
+ 0.913 |
+ 0.937 |
+ 0.917 |
+ 0.939 |
+ 0.935 |
+ 0.932 |
+ 0.864 |
+ 0.929 |
+ 0.928 |
+ 0.915 |
+ 0.918 |
+ 0.913 |
+ 0.921 |
+ 0.927 |
+ 0.921 |
+ 0.919 |
+ 0.752 |
+ 0.751 |
+ 0.748 |
+ 0.776 |
+ 0.919 |
+ 0.863 |
+ 0.745 |
+ 0.861 |
+ 0.845 |
+ 0.841 |
+ 0.828 |
+ 0.849 |
+ 0.849 |
+ 0.867 |
+ 0.847 |
+ 0.823 |
+ 0.789 |
+ 0.855 |
+ 0.838 |
+ 0.793 |
+ 3802 |
+ Binding |
+ SPIKE_SARS2 |
+ Medium |
+ Virus |
+
+
+ SPIKE_SARS2_Starr_2020_expression |
+ 0.759 |
+ 0.882 |
+ 0.797 |
+ 0.855 |
+ 0.882 |
+ 0.880 |
+ 0.643 |
+ 0.847 |
+ 0.848 |
+ 0.887 |
+ 0.633 |
+ 0.620 |
+ 0.636 |
+ 0.634 |
+ 0.662 |
+ 0.643 |
+ 0.635 |
+ 0.683 |
+ 0.688 |
+ 0.864 |
+ 0.824 |
+ 0.845 |
+ 0.836 |
+ 0.843 |
+ 0.844 |
+ 0.846 |
+ 0.821 |
+ 0.844 |
+ 0.829 |
+ 0.889 |
+ 0.849 |
+ 0.849 |
+ 0.747 |
+ 0.841 |
+ 0.842 |
+ 0.841 |
+ 0.833 |
+ 0.834 |
+ 0.847 |
+ 0.890 |
+ 0.891 |
+ 0.890 |
+ 0.622 |
+ 0.622 |
+ 0.637 |
+ 0.652 |
+ 0.899 |
+ 0.856 |
+ 0.749 |
+ 0.756 |
+ 0.811 |
+ 0.819 |
+ 0.824 |
+ 0.822 |
+ 0.832 |
+ 0.835 |
+ 0.820 |
+ 0.805 |
+ 0.812 |
+ 0.834 |
+ 0.828 |
+ 0.716 |
+ 3798 |
+ Expression |
+ SPIKE_SARS2 |
+ Medium |
+ Virus |
+
+
+ SPTN1_CHICK_Tsuboyama_2023_1TUD |
+ 0.832 |
+ 0.869 |
+ 0.807 |
+ 0.774 |
+ 0.806 |
+ 0.797 |
+ 0.613 |
+ 0.706 |
+ 0.790 |
+ 0.799 |
+ 0.865 |
+ 0.815 |
+ 0.838 |
+ 0.358 |
+ 0.811 |
+ 0.819 |
+ 0.804 |
+ 0.887 |
+ 0.809 |
+ 0.836 |
+ 0.766 |
+ 0.813 |
+ 0.760 |
+ 0.788 |
+ 0.826 |
+ 0.841 |
+ 0.810 |
+ 0.834 |
+ 0.833 |
+ 0.827 |
+ 0.877 |
+ 0.880 |
+ 0.581 |
+ 0.780 |
+ 0.755 |
+ 0.776 |
+ 0.834 |
+ 0.846 |
+ 0.830 |
+ 0.803 |
+ 0.803 |
+ 0.801 |
+ 0.751 |
+ 0.421 |
+ 0.785 |
+ 0.737 |
+ 0.765 |
+ 0.775 |
+ 0.902 |
+ 0.893 |
+ 0.892 |
+ 0.887 |
+ 0.898 |
+ 0.886 |
+ 0.887 |
+ 0.889 |
+ 0.886 |
+ 0.894 |
+ 0.880 |
+ 0.895 |
+ 0.883 |
+ 0.767 |
+ 3201 |
+ Stability |
+ SPTN1_CHICK |
+ High |
+ Eukaryote |
+
+
+ SQSTM_MOUSE_Tsuboyama_2023_2RRU |
+ 0.799 |
+ 0.851 |
+ 0.859 |
+ 0.873 |
+ 0.878 |
+ 0.872 |
+ 0.493 |
+ 0.810 |
+ 0.866 |
+ 0.864 |
+ 0.840 |
+ 0.764 |
+ 0.832 |
+ 0.567 |
+ 0.701 |
+ 0.795 |
+ 0.865 |
+ 0.846 |
+ 0.837 |
+ 0.840 |
+ 0.615 |
+ 0.837 |
+ 0.755 |
+ 0.855 |
+ 0.823 |
+ 0.834 |
+ 0.840 |
+ 0.817 |
+ 0.831 |
+ 0.823 |
+ 0.823 |
+ 0.804 |
+ 0.762 |
+ 0.616 |
+ 0.841 |
+ 0.847 |
+ 0.801 |
+ 0.873 |
+ 0.865 |
+ 0.885 |
+ 0.879 |
+ 0.867 |
+ 0.603 |
+ 0.547 |
+ 0.824 |
+ 0.853 |
+ 0.879 |
+ 0.835 |
+ 0.824 |
+ 0.830 |
+ 0.851 |
+ 0.861 |
+ 0.863 |
+ 0.849 |
+ 0.846 |
+ 0.863 |
+ 0.854 |
+ 0.841 |
+ 0.840 |
+ 0.859 |
+ 0.854 |
+ 0.772 |
+ 707 |
+ Stability |
+ SQSTM_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ SR43C_ARATH_Tsuboyama_2023_2N88 |
+ 0.862 |
+ 0.858 |
+ 0.853 |
+ 0.851 |
+ 0.856 |
+ 0.855 |
+ 0.416 |
+ 0.760 |
+ 0.843 |
+ 0.843 |
+ 0.889 |
+ 0.851 |
+ 0.883 |
+ 0.362 |
+ 0.844 |
+ 0.903 |
+ 0.893 |
+ 0.904 |
+ 0.913 |
+ 0.836 |
+ 0.482 |
+ 0.703 |
+ 0.757 |
+ 0.722 |
+ 0.734 |
+ 0.783 |
+ 0.799 |
+ 0.819 |
+ 0.866 |
+ 0.817 |
+ 0.883 |
+ 0.835 |
+ 0.627 |
+ 0.676 |
+ 0.629 |
+ 0.748 |
+ 0.887 |
+ 0.875 |
+ 0.875 |
+ 0.875 |
+ 0.882 |
+ 0.870 |
+ 0.798 |
+ 0.459 |
+ 0.884 |
+ 0.854 |
+ 0.753 |
+ 0.893 |
+ 0.916 |
+ 0.907 |
+ 0.872 |
+ 0.883 |
+ 0.877 |
+ 0.896 |
+ 0.874 |
+ 0.886 |
+ 0.881 |
+ 0.889 |
+ 0.883 |
+ 0.882 |
+ 0.914 |
+ 0.891 |
+ 1583 |
+ Stability |
+ SR43C_ARATH |
+ High |
+ Eukaryote |
+
+
+ SRBS1_HUMAN_Tsuboyama_2023_2O2W |
+ 0.869 |
+ 0.885 |
+ 0.906 |
+ 0.903 |
+ 0.886 |
+ 0.883 |
+ 0.804 |
+ 0.782 |
+ 0.885 |
+ 0.891 |
+ 0.920 |
+ 0.912 |
+ 0.923 |
+ 0.811 |
+ 0.900 |
+ 0.911 |
+ 0.909 |
+ 0.898 |
+ 0.924 |
+ 0.893 |
+ 0.905 |
+ 0.895 |
+ 0.913 |
+ 0.890 |
+ 0.907 |
+ 0.909 |
+ 0.905 |
+ 0.897 |
+ 0.900 |
+ 0.913 |
+ 0.904 |
+ 0.881 |
+ 0.822 |
+ 0.833 |
+ 0.878 |
+ 0.867 |
+ 0.888 |
+ 0.902 |
+ 0.906 |
+ 0.886 |
+ 0.898 |
+ 0.892 |
+ 0.889 |
+ 0.723 |
+ 0.922 |
+ 0.903 |
+ 0.891 |
+ 0.891 |
+ 0.920 |
+ 0.916 |
+ 0.911 |
+ 0.919 |
+ 0.916 |
+ 0.919 |
+ 0.912 |
+ 0.921 |
+ 0.910 |
+ 0.915 |
+ 0.914 |
+ 0.918 |
+ 0.905 |
+ 0.911 |
+ 1556 |
+ Stability |
+ SRBS1_HUMAN |
+ High |
+ Human |
+
+
+ SRC_HUMAN_Ahler_2019 |
+ 0.561 |
+ 0.570 |
+ 0.554 |
+ 0.563 |
+ 0.553 |
+ 0.557 |
+ 0.557 |
+ 0.523 |
+ 0.575 |
+ 0.570 |
+ 0.571 |
+ 0.568 |
+ 0.572 |
+ 0.538 |
+ 0.545 |
+ 0.516 |
+ 0.538 |
+ 0.549 |
+ 0.542 |
+ 0.555 |
+ 0.554 |
+ 0.545 |
+ 0.552 |
+ 0.541 |
+ 0.542 |
+ 0.552 |
+ 0.556 |
+ 0.544 |
+ 0.543 |
+ 0.571 |
+ 0.550 |
+ 0.561 |
+ 0.548 |
+ 0.547 |
+ 0.539 |
+ 0.532 |
+ 0.546 |
+ 0.553 |
+ 0.552 |
+ 0.556 |
+ 0.561 |
+ 0.556 |
+ 0.573 |
+ 0.517 |
+ 0.558 |
+ 0.570 |
+ 0.499 |
+ 0.532 |
+ 0.499 |
+ 0.436 |
+ 0.540 |
+ 0.542 |
+ 0.529 |
+ 0.564 |
+ 0.550 |
+ 0.544 |
+ 0.532 |
+ 0.549 |
+ 0.538 |
+ 0.538 |
+ 0.556 |
+ 0.542 |
+ 3372 |
+ Activity |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM |
+ 0.626 |
+ 0.633 |
+ 0.630 |
+ 0.637 |
+ 0.633 |
+ 0.627 |
+ 0.626 |
+ 0.616 |
+ 0.620 |
+ 0.627 |
+ 0.643 |
+ 0.632 |
+ 0.639 |
+ 0.614 |
+ 0.618 |
+ 0.602 |
+ 0.605 |
+ 0.609 |
+ 0.614 |
+ 0.623 |
+ 0.617 |
+ 0.610 |
+ 0.617 |
+ 0.608 |
+ 0.614 |
+ 0.621 |
+ 0.620 |
+ 0.609 |
+ 0.616 |
+ 0.636 |
+ 0.617 |
+ 0.625 |
+ 0.641 |
+ 0.615 |
+ 0.605 |
+ 0.604 |
+ 0.617 |
+ 0.615 |
+ 0.615 |
+ 0.626 |
+ 0.625 |
+ 0.625 |
+ 0.630 |
+ 0.597 |
+ 0.621 |
+ 0.629 |
+ 0.592 |
+ 0.616 |
+ 0.595 |
+ 0.577 |
+ 0.610 |
+ 0.608 |
+ 0.602 |
+ 0.626 |
+ 0.617 |
+ 0.607 |
+ 0.607 |
+ 0.618 |
+ 0.609 |
+ 0.611 |
+ 0.624 |
+ 0.614 |
+ 3637 |
+ Activity |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SRC_HUMAN_Nguyen_2022 |
+ 0.669 |
+ 0.685 |
+ 0.686 |
+ 0.683 |
+ 0.674 |
+ 0.674 |
+ 0.667 |
+ 0.670 |
+ 0.668 |
+ 0.682 |
+ 0.690 |
+ 0.682 |
+ 0.685 |
+ 0.661 |
+ 0.662 |
+ 0.647 |
+ 0.664 |
+ 0.675 |
+ 0.676 |
+ 0.681 |
+ 0.690 |
+ 0.681 |
+ 0.683 |
+ 0.677 |
+ 0.675 |
+ 0.684 |
+ 0.685 |
+ 0.675 |
+ 0.677 |
+ 0.694 |
+ 0.677 |
+ 0.680 |
+ 0.670 |
+ 0.678 |
+ 0.671 |
+ 0.671 |
+ 0.670 |
+ 0.672 |
+ 0.672 |
+ 0.680 |
+ 0.675 |
+ 0.675 |
+ 0.678 |
+ 0.650 |
+ 0.686 |
+ 0.680 |
+ 0.636 |
+ 0.667 |
+ 0.638 |
+ 0.629 |
+ 0.661 |
+ 0.663 |
+ 0.661 |
+ 0.673 |
+ 0.664 |
+ 0.664 |
+ 0.660 |
+ 0.665 |
+ 0.658 |
+ 0.660 |
+ 0.681 |
+ 0.661 |
+ 3366 |
+ OrganismalFitness |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SUMO1_HUMAN_Weile_2017 |
+ 0.707 |
+ 0.760 |
+ 0.779 |
+ 0.766 |
+ 0.769 |
+ 0.771 |
+ 0.637 |
+ 0.763 |
+ 0.758 |
+ 0.749 |
+ 0.773 |
+ 0.775 |
+ 0.788 |
+ 0.658 |
+ 0.723 |
+ 0.750 |
+ 0.785 |
+ 0.730 |
+ 0.763 |
+ 0.774 |
+ 0.670 |
+ 0.759 |
+ 0.770 |
+ 0.778 |
+ 0.768 |
+ 0.775 |
+ 0.763 |
+ 0.790 |
+ 0.765 |
+ 0.717 |
+ 0.777 |
+ 0.779 |
+ 0.621 |
+ 0.672 |
+ 0.759 |
+ 0.771 |
+ 0.739 |
+ 0.754 |
+ 0.772 |
+ 0.764 |
+ 0.764 |
+ 0.772 |
+ 0.737 |
+ 0.639 |
+ 0.772 |
+ 0.749 |
+ 0.735 |
+ 0.774 |
+ 0.729 |
+ 0.723 |
+ 0.768 |
+ 0.763 |
+ 0.778 |
+ 0.770 |
+ 0.767 |
+ 0.769 |
+ 0.780 |
+ 0.780 |
+ 0.770 |
+ 0.777 |
+ 0.776 |
+ 0.732 |
+ 1700 |
+ OrganismalFitness |
+ SUMO1_HUMAN |
+ High |
+ Human |
+
+
+ SYUA_HUMAN_Newberry_2020 |
+ 0.826 |
+ 0.836 |
+ 0.818 |
+ 0.814 |
+ 0.827 |
+ 0.829 |
+ 0.809 |
+ 0.878 |
+ 0.838 |
+ 0.845 |
+ 0.836 |
+ 0.868 |
+ 0.861 |
+ 0.809 |
+ 0.811 |
+ 0.824 |
+ 0.812 |
+ 0.849 |
+ 0.860 |
+ 0.860 |
+ 0.807 |
+ 0.867 |
+ 0.872 |
+ 0.870 |
+ 0.811 |
+ 0.862 |
+ 0.868 |
+ 0.860 |
+ 0.867 |
+ 0.874 |
+ 0.877 |
+ 0.877 |
+ 0.696 |
+ 0.824 |
+ 0.865 |
+ 0.868 |
+ 0.836 |
+ 0.855 |
+ 0.856 |
+ 0.835 |
+ 0.851 |
+ 0.850 |
+ 0.812 |
+ 0.800 |
+ 0.857 |
+ 0.808 |
+ 0.761 |
+ 0.834 |
+ 0.809 |
+ 0.770 |
+ 0.813 |
+ 0.795 |
+ 0.793 |
+ 0.805 |
+ 0.806 |
+ 0.807 |
+ 0.816 |
+ 0.824 |
+ 0.832 |
+ 0.804 |
+ 0.834 |
+ 0.802 |
+ 2497 |
+ OrganismalFitness |
+ SYUA_HUMAN |
+ Medium |
+ Human |
+
+
+ TADBP_HUMAN_Bolognesi_2019 |
+ 0.594 |
+ 0.591 |
+ 0.589 |
+ 0.587 |
+ 0.590 |
+ 0.587 |
+ 0.716 |
+ 0.592 |
+ 0.577 |
+ 0.580 |
+ 0.595 |
+ 0.607 |
+ 0.593 |
+ 0.643 |
+ 0.660 |
+ 0.637 |
+ 0.557 |
+ 0.579 |
+ 0.612 |
+ 0.590 |
+ 0.677 |
+ 0.576 |
+ 0.595 |
+ 0.596 |
+ 0.693 |
+ 0.608 |
+ 0.584 |
+ 0.592 |
+ 0.593 |
+ 0.601 |
+ 0.599 |
+ 0.622 |
+ 0.576 |
+ 0.698 |
+ 0.732 |
+ 0.592 |
+ 0.609 |
+ 0.616 |
+ 0.600 |
+ 0.594 |
+ 0.594 |
+ 0.590 |
+ 0.652 |
+ 0.697 |
+ 0.607 |
+ 0.653 |
+ 0.696 |
+ 0.598 |
+ 0.699 |
+ 0.610 |
+ 0.660 |
+ 0.621 |
+ 0.613 |
+ 0.643 |
+ 0.621 |
+ 0.634 |
+ 0.608 |
+ 0.612 |
+ 0.590 |
+ 0.620 |
+ 0.620 |
+ 0.657 |
+ 1196 |
+ OrganismalFitness |
+ TADBP_HUMAN |
+ Low |
+ Human |
+
+
+ TAT_HV1BR_Fernandes_2016 |
+ 0.686 |
+ 0.699 |
+ 0.694 |
+ 0.695 |
+ 0.697 |
+ 0.698 |
+ 0.457 |
+ 0.707 |
+ 0.701 |
+ 0.698 |
+ 0.609 |
+ 0.707 |
+ 0.704 |
+ 0.599 |
+ 0.615 |
+ 0.583 |
+ 0.598 |
+ 0.578 |
+ 0.578 |
+ 0.698 |
+ 0.701 |
+ 0.714 |
+ 0.704 |
+ 0.707 |
+ 0.695 |
+ 0.729 |
+ 0.609 |
+ 0.698 |
+ 0.718 |
+ 0.695 |
+ 0.707 |
+ 0.718 |
+ 0.618 |
+ 0.702 |
+ 0.681 |
+ 0.712 |
+ 0.706 |
+ 0.689 |
+ 0.694 |
+ 0.702 |
+ 0.701 |
+ 0.702 |
+ 0.539 |
+ 0.495 |
+ 0.686 |
+ 0.557 |
+ 0.601 |
+ 0.697 |
+ 0.644 |
+ 0.585 |
+ 0.540 |
+ 0.615 |
+ 0.594 |
+ 0.623 |
+ 0.593 |
+ 0.617 |
+ 0.565 |
+ 0.582 |
+ 0.581 |
+ 0.580 |
+ 0.530 |
+ 0.537 |
+ 1577 |
+ OrganismalFitness |
+ TAT_HV1BR |
+ High |
+ Virus |
+
+
+ TCRG1_MOUSE_Tsuboyama_2023_1E0L |
+ 0.896 |
+ 0.882 |
+ 0.888 |
+ 0.891 |
+ 0.881 |
+ 0.882 |
+ 0.825 |
+ 0.799 |
+ 0.887 |
+ 0.871 |
+ 0.850 |
+ 0.877 |
+ 0.879 |
+ 0.885 |
+ 0.863 |
+ 0.889 |
+ 0.904 |
+ 0.870 |
+ 0.868 |
+ 0.824 |
+ 0.848 |
+ 0.845 |
+ 0.826 |
+ 0.835 |
+ 0.857 |
+ 0.848 |
+ 0.844 |
+ 0.850 |
+ 0.841 |
+ 0.848 |
+ 0.828 |
+ 0.798 |
+ 0.861 |
+ 0.846 |
+ 0.863 |
+ 0.889 |
+ 0.880 |
+ 0.881 |
+ 0.894 |
+ 0.879 |
+ 0.883 |
+ 0.879 |
+ 0.880 |
+ 0.691 |
+ 0.852 |
+ 0.876 |
+ 0.849 |
+ 0.846 |
+ 0.877 |
+ 0.879 |
+ 0.869 |
+ 0.863 |
+ 0.864 |
+ 0.882 |
+ 0.864 |
+ 0.880 |
+ 0.875 |
+ 0.869 |
+ 0.874 |
+ 0.871 |
+ 0.899 |
+ 0.880 |
+ 1058 |
+ Stability |
+ TCRG1_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ THO1_YEAST_Tsuboyama_2023_2WQG |
+ 0.771 |
+ 0.767 |
+ 0.790 |
+ 0.808 |
+ 0.808 |
+ 0.823 |
+ 0.583 |
+ 0.590 |
+ 0.799 |
+ 0.818 |
+ 0.766 |
+ 0.816 |
+ 0.800 |
+ 0.488 |
+ 0.791 |
+ 0.794 |
+ 0.788 |
+ 0.816 |
+ 0.799 |
+ 0.802 |
+ 0.538 |
+ 0.661 |
+ 0.580 |
+ 0.519 |
+ 0.642 |
+ 0.653 |
+ 0.515 |
+ 0.669 |
+ 0.778 |
+ 0.836 |
+ 0.812 |
+ 0.776 |
+ 0.747 |
+ 0.530 |
+ 0.722 |
+ 0.605 |
+ 0.777 |
+ 0.824 |
+ 0.777 |
+ 0.790 |
+ 0.822 |
+ 0.808 |
+ 0.710 |
+ 0.624 |
+ 0.810 |
+ 0.745 |
+ 0.830 |
+ 0.835 |
+ 0.814 |
+ 0.874 |
+ 0.850 |
+ 0.805 |
+ 0.842 |
+ 0.840 |
+ 0.854 |
+ 0.827 |
+ 0.834 |
+ 0.834 |
+ 0.827 |
+ 0.841 |
+ 0.794 |
+ 0.767 |
+ 1279 |
+ Stability |
+ THO1_YEAST |
+ High |
+ Eukaryote |
+
+
+ TNKS2_HUMAN_Tsuboyama_2023_5JRT |
+ 0.848 |
+ 0.885 |
+ 0.878 |
+ 0.880 |
+ 0.871 |
+ 0.877 |
+ 0.577 |
+ 0.790 |
+ 0.878 |
+ 0.891 |
+ 0.851 |
+ 0.839 |
+ 0.852 |
+ 0.715 |
+ 0.827 |
+ 0.848 |
+ 0.871 |
+ 0.885 |
+ 0.886 |
+ 0.868 |
+ 0.811 |
+ 0.887 |
+ 0.888 |
+ 0.887 |
+ 0.829 |
+ 0.880 |
+ 0.868 |
+ 0.889 |
+ 0.876 |
+ 0.864 |
+ 0.898 |
+ 0.883 |
+ 0.729 |
+ 0.620 |
+ 0.821 |
+ 0.829 |
+ 0.844 |
+ 0.863 |
+ 0.867 |
+ 0.879 |
+ 0.880 |
+ 0.878 |
+ 0.832 |
+ 0.724 |
+ 0.870 |
+ 0.829 |
+ 0.820 |
+ 0.891 |
+ 0.913 |
+ 0.916 |
+ 0.863 |
+ 0.889 |
+ 0.885 |
+ 0.864 |
+ 0.879 |
+ 0.879 |
+ 0.886 |
+ 0.878 |
+ 0.868 |
+ 0.887 |
+ 0.905 |
+ 0.819 |
+ 1479 |
+ Stability |
+ TNKS2_HUMAN |
+ High |
+ Human |
+
+
+ TPK1_HUMAN_Weile_2017 |
+ 0.619 |
+ 0.642 |
+ 0.624 |
+ 0.621 |
+ 0.620 |
+ 0.626 |
+ 0.554 |
+ 0.644 |
+ 0.626 |
+ 0.621 |
+ 0.630 |
+ 0.645 |
+ 0.651 |
+ 0.551 |
+ 0.607 |
+ 0.625 |
+ 0.654 |
+ 0.652 |
+ 0.665 |
+ 0.631 |
+ 0.558 |
+ 0.599 |
+ 0.605 |
+ 0.619 |
+ 0.558 |
+ 0.649 |
+ 0.665 |
+ 0.619 |
+ 0.667 |
+ 0.672 |
+ 0.667 |
+ 0.643 |
+ 0.557 |
+ 0.564 |
+ 0.623 |
+ 0.672 |
+ 0.625 |
+ 0.631 |
+ 0.676 |
+ 0.627 |
+ 0.638 |
+ 0.648 |
+ 0.569 |
+ 0.550 |
+ 0.637 |
+ 0.581 |
+ 0.632 |
+ 0.644 |
+ 0.649 |
+ 0.598 |
+ 0.648 |
+ 0.656 |
+ 0.648 |
+ 0.659 |
+ 0.649 |
+ 0.657 |
+ 0.665 |
+ 0.651 |
+ 0.645 |
+ 0.649 |
+ 0.628 |
+ 0.600 |
+ 3181 |
+ OrganismalFitness |
+ TPK1_HUMAN |
+ Medium |
+ Human |
+
+
+ TPMT_HUMAN_Matreyek_2018 |
+ 0.805 |
+ 0.817 |
+ 0.818 |
+ 0.816 |
+ 0.819 |
+ 0.816 |
+ 0.735 |
+ 0.815 |
+ 0.817 |
+ 0.819 |
+ 0.820 |
+ 0.818 |
+ 0.826 |
+ 0.764 |
+ 0.773 |
+ 0.804 |
+ 0.813 |
+ 0.814 |
+ 0.807 |
+ 0.820 |
+ 0.771 |
+ 0.761 |
+ 0.803 |
+ 0.828 |
+ 0.801 |
+ 0.798 |
+ 0.809 |
+ 0.799 |
+ 0.810 |
+ 0.826 |
+ 0.829 |
+ 0.819 |
+ 0.771 |
+ 0.787 |
+ 0.800 |
+ 0.807 |
+ 0.810 |
+ 0.811 |
+ 0.812 |
+ 0.820 |
+ 0.819 |
+ 0.817 |
+ 0.783 |
+ 0.733 |
+ 0.822 |
+ 0.803 |
+ 0.800 |
+ 0.823 |
+ 0.824 |
+ 0.789 |
+ 0.803 |
+ 0.804 |
+ 0.809 |
+ 0.802 |
+ 0.809 |
+ 0.811 |
+ 0.812 |
+ 0.815 |
+ 0.812 |
+ 0.810 |
+ 0.817 |
+ 0.800 |
+ 3648 |
+ Expression |
+ TPMT_HUMAN |
+ Medium |
+ Human |
+
+
+ TPOR_HUMAN_Bridgford_2020 |
+ 0.668 |
+ 0.597 |
+ 0.695 |
+ 0.635 |
+ 0.627 |
+ 0.625 |
+ 0.690 |
+ 0.641 |
+ 0.663 |
+ 0.656 |
+ 0.678 |
+ 0.643 |
+ 0.671 |
+ 0.589 |
+ 0.664 |
+ 0.575 |
+ 0.571 |
+ 0.610 |
+ 0.649 |
+ 0.589 |
+ 0.619 |
+ 0.589 |
+ 0.659 |
+ 0.652 |
+ 0.638 |
+ 0.546 |
+ 0.742 |
+ 0.700 |
+ 0.676 |
+ 0.618 |
+ 0.611 |
+ 0.618 |
+ 0.708 |
+ 0.640 |
+ 0.633 |
+ 0.617 |
+ 0.670 |
+ 0.682 |
+ 0.630 |
+ 0.651 |
+ 0.667 |
+ 0.607 |
+ 0.580 |
+ 0.627 |
+ 0.624 |
+ 0.643 |
+ 0.486 |
+ 0.635 |
+ 0.653 |
+ 0.560 |
+ 0.703 |
+ 0.606 |
+ 0.625 |
+ 0.629 |
+ 0.621 |
+ 0.634 |
+ 0.613 |
+ 0.668 |
+ 0.664 |
+ 0.624 |
+ 0.637 |
+ 0.691 |
+ 562 |
+ OrganismalFitness |
+ TPOR_HUMAN |
+ Low |
+ Human |
+
+
+ TRPC_SACS2_Chan_2017 |
+ 0.879 |
+ 0.917 |
+ 0.910 |
+ 0.915 |
+ 0.911 |
+ 0.913 |
+ 0.744 |
+ 0.899 |
+ 0.910 |
+ 0.918 |
+ 0.900 |
+ 0.888 |
+ 0.892 |
+ 0.788 |
+ 0.826 |
+ 0.877 |
+ 0.919 |
+ 0.921 |
+ 0.909 |
+ 0.922 |
+ 0.781 |
+ 0.849 |
+ 0.859 |
+ 0.867 |
+ 0.879 |
+ 0.861 |
+ 0.909 |
+ 0.883 |
+ 0.930 |
+ 0.912 |
+ 0.912 |
+ 0.887 |
+ 0.735 |
+ 0.822 |
+ 0.877 |
+ 0.846 |
+ 0.845 |
+ 0.888 |
+ 0.874 |
+ 0.887 |
+ 0.912 |
+ 0.909 |
+ 0.769 |
+ 0.750 |
+ 0.884 |
+ 0.825 |
+ 0.798 |
+ 0.896 |
+ 0.914 |
+ 0.746 |
+ 0.898 |
+ 0.903 |
+ 0.898 |
+ 0.911 |
+ 0.906 |
+ 0.900 |
+ 0.904 |
+ 0.911 |
+ 0.902 |
+ 0.912 |
+ 0.924 |
+ 0.866 |
+ 1519 |
+ OrganismalFitness |
+ TRPC_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ TRPC_THEMA_Chan_2017 |
+ 0.692 |
+ 0.732 |
+ 0.652 |
+ 0.688 |
+ 0.719 |
+ 0.712 |
+ 0.607 |
+ 0.639 |
+ 0.705 |
+ 0.737 |
+ 0.726 |
+ 0.732 |
+ 0.747 |
+ 0.622 |
+ 0.719 |
+ 0.755 |
+ 0.725 |
+ 0.743 |
+ 0.751 |
+ 0.742 |
+ 0.670 |
+ 0.716 |
+ 0.733 |
+ 0.756 |
+ 0.698 |
+ 0.736 |
+ 0.689 |
+ 0.751 |
+ 0.762 |
+ 0.657 |
+ 0.734 |
+ 0.684 |
+ 0.554 |
+ 0.672 |
+ 0.747 |
+ 0.738 |
+ 0.716 |
+ 0.723 |
+ 0.723 |
+ 0.733 |
+ 0.730 |
+ 0.723 |
+ 0.689 |
+ 0.501 |
+ 0.765 |
+ 0.718 |
+ 0.651 |
+ 0.758 |
+ 0.726 |
+ 0.581 |
+ 0.718 |
+ 0.720 |
+ 0.691 |
+ 0.730 |
+ 0.725 |
+ 0.707 |
+ 0.705 |
+ 0.701 |
+ 0.690 |
+ 0.713 |
+ 0.754 |
+ 0.753 |
+ 1519 |
+ OrganismalFitness |
+ TRPC_THEMA |
+ Medium |
+ Prokaryote |
+
+
+ UBC9_HUMAN_Weile_2017 |
+ 0.534 |
+ 0.615 |
+ 0.610 |
+ 0.610 |
+ 0.607 |
+ 0.618 |
+ 0.369 |
+ 0.531 |
+ 0.575 |
+ 0.591 |
+ 0.582 |
+ 0.589 |
+ 0.587 |
+ 0.405 |
+ 0.403 |
+ 0.470 |
+ 0.522 |
+ 0.580 |
+ 0.596 |
+ 0.606 |
+ 0.490 |
+ 0.540 |
+ 0.538 |
+ 0.535 |
+ 0.547 |
+ 0.579 |
+ 0.553 |
+ 0.568 |
+ 0.574 |
+ 0.555 |
+ 0.556 |
+ 0.536 |
+ 0.353 |
+ 0.490 |
+ 0.558 |
+ 0.576 |
+ 0.522 |
+ 0.597 |
+ 0.583 |
+ 0.597 |
+ 0.620 |
+ 0.626 |
+ 0.463 |
+ 0.398 |
+ 0.577 |
+ 0.579 |
+ 0.489 |
+ 0.553 |
+ 0.512 |
+ 0.486 |
+ 0.504 |
+ 0.505 |
+ 0.523 |
+ 0.532 |
+ 0.536 |
+ 0.534 |
+ 0.531 |
+ 0.521 |
+ 0.519 |
+ 0.519 |
+ 0.579 |
+ 0.484 |
+ 2563 |
+ OrganismalFitness |
+ UBC9_HUMAN |
+ Medium |
+ Human |
+
+
+ UBE4B_HUMAN_Tsuboyama_2023_3L1X |
+ 0.660 |
+ 0.786 |
+ 0.785 |
+ 0.782 |
+ 0.795 |
+ 0.798 |
+ 0.528 |
+ 0.728 |
+ 0.773 |
+ 0.802 |
+ 0.838 |
+ 0.799 |
+ 0.804 |
+ 0.609 |
+ 0.784 |
+ 0.848 |
+ 0.840 |
+ 0.829 |
+ 0.821 |
+ 0.798 |
+ 0.542 |
+ 0.740 |
+ 0.754 |
+ 0.777 |
+ 0.723 |
+ 0.756 |
+ 0.752 |
+ 0.763 |
+ 0.784 |
+ 0.765 |
+ 0.831 |
+ 0.792 |
+ 0.734 |
+ 0.516 |
+ 0.541 |
+ 0.792 |
+ 0.684 |
+ 0.697 |
+ 0.797 |
+ 0.797 |
+ 0.792 |
+ 0.818 |
+ 0.657 |
+ 0.520 |
+ 0.683 |
+ 0.715 |
+ 0.768 |
+ 0.749 |
+ 0.836 |
+ 0.731 |
+ 0.848 |
+ 0.856 |
+ 0.855 |
+ 0.854 |
+ 0.856 |
+ 0.846 |
+ 0.842 |
+ 0.855 |
+ 0.849 |
+ 0.854 |
+ 0.833 |
+ 0.744 |
+ 3622 |
+ Stability |
+ UBE4B_HUMAN |
+ High |
+ Human |
+
+
+ UBE4B_MOUSE_Starita_2013 |
+ 0.672 |
+ 0.659 |
+ 0.658 |
+ 0.658 |
+ 0.648 |
+ 0.656 |
+ 0.536 |
+ 0.626 |
+ 0.647 |
+ 0.644 |
+ 0.656 |
+ 0.687 |
+ 0.666 |
+ 0.575 |
+ 0.656 |
+ 0.660 |
+ 0.641 |
+ 0.635 |
+ 0.617 |
+ 0.630 |
+ 0.547 |
+ 0.628 |
+ 0.646 |
+ 0.633 |
+ 0.660 |
+ 0.649 |
+ 0.649 |
+ 0.673 |
+ 0.648 |
+ 0.636 |
+ 0.644 |
+ 0.632 |
+ 0.549 |
+ 0.544 |
+ 0.566 |
+ 0.629 |
+ 0.674 |
+ 0.691 |
+ 0.682 |
+ 0.658 |
+ 0.663 |
+ 0.664 |
+ 0.641 |
+ 0.528 |
+ 0.626 |
+ 0.664 |
+ 0.644 |
+ 0.625 |
+ 0.491 |
+ 0.600 |
+ 0.667 |
+ 0.647 |
+ 0.658 |
+ 0.648 |
+ 0.645 |
+ 0.657 |
+ 0.639 |
+ 0.636 |
+ 0.632 |
+ 0.651 |
+ 0.653 |
+ 0.644 |
+ 899 |
+ Activity |
+ UBE4B_MOUSE |
+ Low |
+ Eukaryote |
+
+
+ UBR5_HUMAN_Tsuboyama_2023_1I2T |
+ 0.867 |
+ 0.877 |
+ 0.845 |
+ 0.856 |
+ 0.872 |
+ 0.867 |
+ 0.775 |
+ 0.800 |
+ 0.862 |
+ 0.869 |
+ 0.859 |
+ 0.818 |
+ 0.842 |
+ 0.792 |
+ 0.795 |
+ 0.781 |
+ 0.773 |
+ 0.863 |
+ 0.859 |
+ 0.831 |
+ 0.852 |
+ 0.869 |
+ 0.884 |
+ 0.884 |
+ 0.878 |
+ 0.883 |
+ 0.889 |
+ 0.891 |
+ 0.885 |
+ 0.880 |
+ 0.873 |
+ 0.848 |
+ 0.813 |
+ 0.830 |
+ 0.863 |
+ 0.852 |
+ 0.864 |
+ 0.872 |
+ 0.878 |
+ 0.871 |
+ 0.869 |
+ 0.879 |
+ 0.804 |
+ 0.783 |
+ 0.885 |
+ 0.820 |
+ 0.881 |
+ 0.885 |
+ 0.885 |
+ 0.885 |
+ 0.859 |
+ 0.877 |
+ 0.887 |
+ 0.882 |
+ 0.886 |
+ 0.878 |
+ 0.874 |
+ 0.881 |
+ 0.874 |
+ 0.881 |
+ 0.886 |
+ 0.861 |
+ 1453 |
+ Stability |
+ UBR5_HUMAN |
+ Medium |
+ Human |
+
+
+ VG08_BPP22_Tsuboyama_2023_2GP8 |
+ 0.740 |
+ 0.718 |
+ 0.729 |
+ 0.729 |
+ 0.737 |
+ 0.735 |
+ 0.671 |
+ 0.679 |
+ 0.738 |
+ 0.767 |
+ 0.753 |
+ 0.765 |
+ 0.759 |
+ 0.720 |
+ 0.746 |
+ 0.766 |
+ 0.769 |
+ 0.772 |
+ 0.750 |
+ 0.708 |
+ 0.706 |
+ 0.732 |
+ 0.705 |
+ 0.746 |
+ 0.721 |
+ 0.719 |
+ 0.731 |
+ 0.716 |
+ 0.757 |
+ 0.743 |
+ 0.740 |
+ 0.719 |
+ 0.572 |
+ 0.716 |
+ 0.746 |
+ 0.745 |
+ 0.730 |
+ 0.766 |
+ 0.748 |
+ 0.740 |
+ 0.746 |
+ 0.731 |
+ 0.727 |
+ 0.723 |
+ 0.743 |
+ 0.696 |
+ 0.707 |
+ 0.738 |
+ 0.753 |
+ 0.752 |
+ 0.776 |
+ 0.776 |
+ 0.778 |
+ 0.783 |
+ 0.795 |
+ 0.788 |
+ 0.784 |
+ 0.775 |
+ 0.791 |
+ 0.783 |
+ 0.785 |
+ 0.790 |
+ 723 |
+ Stability |
+ VG08_BPP22 |
+ High |
+ Virus |
+
+
+ VILI_CHICK_Tsuboyama_2023_1YU5 |
+ 0.791 |
+ 0.920 |
+ 0.936 |
+ 0.935 |
+ 0.934 |
+ 0.930 |
+ 0.684 |
+ 0.840 |
+ 0.924 |
+ 0.926 |
+ 0.930 |
+ 0.914 |
+ 0.931 |
+ 0.782 |
+ 0.753 |
+ 0.937 |
+ 0.953 |
+ 0.932 |
+ 0.883 |
+ 0.930 |
+ 0.732 |
+ 0.821 |
+ 0.853 |
+ 0.849 |
+ 0.846 |
+ 0.877 |
+ 0.864 |
+ 0.860 |
+ 0.920 |
+ 0.933 |
+ 0.944 |
+ 0.924 |
+ 0.766 |
+ 0.653 |
+ 0.888 |
+ 0.887 |
+ 0.858 |
+ 0.906 |
+ 0.901 |
+ 0.928 |
+ 0.931 |
+ 0.931 |
+ 0.809 |
+ 0.806 |
+ 0.934 |
+ 0.904 |
+ 0.912 |
+ 0.922 |
+ 0.947 |
+ 0.895 |
+ 0.953 |
+ 0.947 |
+ 0.954 |
+ 0.952 |
+ 0.948 |
+ 0.957 |
+ 0.951 |
+ 0.954 |
+ 0.950 |
+ 0.955 |
+ 0.952 |
+ 0.917 |
+ 2568 |
+ Stability |
+ VILI_CHICK |
+ High |
+ Eukaryote |
+
+
+ VKOR1_HUMAN_Chiasson_2020_abundance |
+ 0.841 |
+ 0.829 |
+ 0.839 |
+ 0.843 |
+ 0.835 |
+ 0.841 |
+ 0.717 |
+ 0.816 |
+ 0.839 |
+ 0.854 |
+ 0.822 |
+ 0.843 |
+ 0.836 |
+ 0.700 |
+ 0.801 |
+ 0.834 |
+ 0.840 |
+ 0.841 |
+ 0.825 |
+ 0.852 |
+ 0.755 |
+ 0.749 |
+ 0.771 |
+ 0.825 |
+ 0.768 |
+ 0.777 |
+ 0.806 |
+ 0.771 |
+ 0.814 |
+ 0.845 |
+ 0.847 |
+ 0.820 |
+ 0.668 |
+ 0.729 |
+ 0.768 |
+ 0.810 |
+ 0.844 |
+ 0.844 |
+ 0.847 |
+ 0.840 |
+ 0.843 |
+ 0.854 |
+ 0.718 |
+ 0.661 |
+ 0.830 |
+ 0.789 |
+ 0.805 |
+ 0.819 |
+ 0.828 |
+ 0.784 |
+ 0.832 |
+ 0.822 |
+ 0.840 |
+ 0.847 |
+ 0.854 |
+ 0.855 |
+ 0.848 |
+ 0.839 |
+ 0.840 |
+ 0.847 |
+ 0.839 |
+ 0.833 |
+ 2695 |
+ Expression |
+ VKOR1_HUMAN |
+ Medium |
+ Human |
+
+
+ VKOR1_HUMAN_Chiasson_2020_activity |
+ 0.817 |
+ 0.834 |
+ 0.812 |
+ 0.817 |
+ 0.797 |
+ 0.810 |
+ 0.693 |
+ 0.838 |
+ 0.847 |
+ 0.843 |
+ 0.836 |
+ 0.824 |
+ 0.815 |
+ 0.716 |
+ 0.792 |
+ 0.829 |
+ 0.816 |
+ 0.819 |
+ 0.841 |
+ 0.835 |
+ 0.778 |
+ 0.768 |
+ 0.836 |
+ 0.822 |
+ 0.736 |
+ 0.848 |
+ 0.832 |
+ 0.825 |
+ 0.834 |
+ 0.857 |
+ 0.828 |
+ 0.823 |
+ 0.819 |
+ 0.725 |
+ 0.750 |
+ 0.836 |
+ 0.788 |
+ 0.790 |
+ 0.825 |
+ 0.795 |
+ 0.801 |
+ 0.810 |
+ 0.737 |
+ 0.730 |
+ 0.829 |
+ 0.749 |
+ 0.774 |
+ 0.827 |
+ 0.832 |
+ 0.791 |
+ 0.824 |
+ 0.810 |
+ 0.817 |
+ 0.832 |
+ 0.832 |
+ 0.817 |
+ 0.828 |
+ 0.811 |
+ 0.814 |
+ 0.823 |
+ 0.811 |
+ 0.820 |
+ 697 |
+ Activity |
+ VKOR1_HUMAN |
+ Medium |
+ Human |
+
+
+ VRPI_BPT7_Tsuboyama_2023_2WNM |
+ 0.584 |
+ 0.739 |
+ 0.781 |
+ 0.778 |
+ 0.767 |
+ 0.783 |
+ 0.655 |
+ 0.728 |
+ 0.799 |
+ 0.765 |
+ 0.785 |
+ 0.750 |
+ 0.737 |
+ 0.697 |
+ 0.749 |
+ 0.790 |
+ 0.803 |
+ 0.822 |
+ 0.797 |
+ 0.752 |
+ 0.598 |
+ 0.694 |
+ 0.676 |
+ 0.704 |
+ 0.703 |
+ 0.647 |
+ 0.675 |
+ 0.692 |
+ 0.763 |
+ 0.753 |
+ 0.802 |
+ 0.788 |
+ 0.693 |
+ 0.671 |
+ 0.740 |
+ 0.687 |
+ 0.638 |
+ 0.649 |
+ 0.645 |
+ 0.754 |
+ 0.757 |
+ 0.739 |
+ 0.729 |
+ 0.672 |
+ 0.756 |
+ 0.734 |
+ 0.803 |
+ 0.798 |
+ 0.801 |
+ 0.851 |
+ 0.850 |
+ 0.836 |
+ 0.863 |
+ 0.856 |
+ 0.847 |
+ 0.861 |
+ 0.857 |
+ 0.861 |
+ 0.853 |
+ 0.869 |
+ 0.869 |
+ 0.808 |
+ 1047 |
+ Stability |
+ VRPI_BPT7 |
+ Medium |
+ Virus |
+
+
+ YAIA_ECOLI_Tsuboyama_2023_2KVT |
+ 0.667 |
+ 0.852 |
+ 0.846 |
+ 0.860 |
+ 0.856 |
+ 0.853 |
+ 0.453 |
+ 0.803 |
+ 0.832 |
+ 0.859 |
+ 0.789 |
+ 0.626 |
+ 0.795 |
+ 0.577 |
+ 0.644 |
+ 0.850 |
+ 0.873 |
+ 0.919 |
+ 0.904 |
+ 0.875 |
+ 0.503 |
+ 0.530 |
+ 0.570 |
+ 0.735 |
+ 0.562 |
+ 0.573 |
+ 0.521 |
+ 0.553 |
+ 0.841 |
+ 0.875 |
+ 0.877 |
+ 0.867 |
+ 0.521 |
+ 0.452 |
+ 0.564 |
+ 0.831 |
+ 0.702 |
+ 0.727 |
+ 0.816 |
+ 0.839 |
+ 0.842 |
+ 0.855 |
+ 0.685 |
+ 0.564 |
+ 0.717 |
+ 0.694 |
+ 0.876 |
+ 0.837 |
+ 0.912 |
+ 0.862 |
+ 0.889 |
+ 0.903 |
+ 0.880 |
+ 0.891 |
+ 0.901 |
+ 0.894 |
+ 0.890 |
+ 0.894 |
+ 0.897 |
+ 0.897 |
+ 0.877 |
+ 0.749 |
+ 1890 |
+ Stability |
+ YAIA_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ YAP1_HUMAN_Araya_2012 |
+ 0.464 |
+ 0.386 |
+ 0.471 |
+ 0.468 |
+ 0.466 |
+ 0.463 |
+ 0.374 |
+ 0.312 |
+ 0.295 |
+ 0.288 |
+ 0.409 |
+ 0.362 |
+ 0.357 |
+ 0.409 |
+ 0.419 |
+ 0.441 |
+ 0.453 |
+ 0.412 |
+ 0.390 |
+ 0.327 |
+ 0.308 |
+ 0.301 |
+ 0.293 |
+ 0.322 |
+ 0.332 |
+ 0.300 |
+ 0.311 |
+ 0.322 |
+ 0.296 |
+ 0.370 |
+ 0.382 |
+ 0.397 |
+ 0.340 |
+ 0.358 |
+ 0.302 |
+ 0.323 |
+ 0.427 |
+ 0.368 |
+ 0.396 |
+ 0.450 |
+ 0.420 |
+ 0.439 |
+ 0.484 |
+ 0.181 |
+ 0.422 |
+ 0.480 |
+ 0.359 |
+ 0.418 |
+ 0.332 |
+ 0.372 |
+ 0.355 |
+ 0.348 |
+ 0.365 |
+ 0.370 |
+ 0.347 |
+ 0.358 |
+ 0.369 |
+ 0.373 |
+ 0.386 |
+ 0.361 |
+ 0.478 |
+ 0.410 |
+ 10075 |
+ Binding |
+ YAP1_HUMAN |
+ Low |
+ Human |
+
+
+ YNZC_BACSU_Tsuboyama_2023_2JVD |
+ 0.855 |
+ 0.869 |
+ 0.855 |
+ 0.855 |
+ 0.857 |
+ 0.856 |
+ 0.767 |
+ 0.793 |
+ 0.876 |
+ 0.877 |
+ 0.881 |
+ 0.872 |
+ 0.887 |
+ 0.865 |
+ 0.883 |
+ 0.887 |
+ 0.891 |
+ 0.887 |
+ 0.881 |
+ 0.850 |
+ 0.845 |
+ 0.893 |
+ 0.896 |
+ 0.896 |
+ 0.883 |
+ 0.882 |
+ 0.882 |
+ 0.875 |
+ 0.878 |
+ 0.863 |
+ 0.875 |
+ 0.854 |
+ 0.845 |
+ 0.842 |
+ 0.872 |
+ 0.868 |
+ 0.876 |
+ 0.863 |
+ 0.865 |
+ 0.867 |
+ 0.859 |
+ 0.865 |
+ 0.848 |
+ 0.802 |
+ 0.867 |
+ 0.845 |
+ 0.829 |
+ 0.872 |
+ 0.911 |
+ 0.894 |
+ 0.894 |
+ 0.874 |
+ 0.880 |
+ 0.877 |
+ 0.878 |
+ 0.886 |
+ 0.878 |
+ 0.879 |
+ 0.879 |
+ 0.881 |
+ 0.884 |
+ 0.923 |
+ 2300 |
+ Stability |
+ YNZC_BACSU |
+ Medium |
+ Prokaryote |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/substitutions/NDCG/DMS_substitutions_NDCG_Uniprot_Selection_Type_level.csv b/benchmarks/DMS_zero_shot/substitutions/NDCG/DMS_substitutions_NDCG_Uniprot_Selection_Type_level.csv
new file mode 100644
index 0000000..1db609d
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/NDCG/DMS_substitutions_NDCG_Uniprot_Selection_Type_level.csv
@@ -0,0 +1,7 @@
+Site_Independent,EVmutation,DeepSequence_single,DeepSequence_ensemble,EVE_single,EVE_ensemble,Unirep,Unirep_evotune,MSA_Transformer_single,MSA_Transformer_ensemble,ESM1b,ESM1v_single,ESM1v_ensemble,ESM2_8M,ESM2_35M,ESM2_150M,ESM2_650M,ESM2_3B,ESM2_15B,Wavenet,RITA_s,RITA_m,RITA_l,RITA_xl,Progen2_small,Progen2_medium,Progen2_base,Progen2_large,Progen2_xlarge,GEMME,VESPA,VESPAl,ProtGPT2,Tranception_S_no_retrieval,Tranception_M_no_retrieval,Tranception_L_no_retrieval,Tranception_S,Tranception_M,Tranception_L,TranceptEVE_S,TranceptEVE_M,TranceptEVE_L,CARP_38M,CARP_600K,CARP_640M,CARP_76M,MIF,MIFST,ESM-IF1,ProteinMPNN,ProtSSN_k10_h512,ProtSSN_k10_h768,ProtSSN_k10_h1280,ProtSSN_k20_h512,ProtSSN_k20_h768,ProtSSN_k20_h1280,ProtSSN_k30_h512,ProtSSN_k30_h768,ProtSSN_k30_h1280,ProtSSN_ensemble,SaProt_650M_AF2,SaProt_35M_AF2,Selection Type
+0.751,0.792,0.786,0.785,0.789,0.791,0.659,0.741,0.781,0.789,0.767,0.734,0.74,0.665,0.702,0.724,0.739,0.748,0.75,0.759,0.701,0.724,0.741,0.75,0.712,0.747,0.752,0.763,0.771,0.795,0.789,0.78,0.67,0.704,0.732,0.768,0.774,0.781,0.794,0.791,0.793,0.794,0.709,0.659,0.75,0.718,0.736,0.769,0.738,0.716,0.769,0.77,0.767,0.773,0.769,0.772,0.769,0.774,0.771,0.772,0.767,0.726,Activity
+0.7,0.708,0.711,0.716,0.724,0.724,0.624,0.682,0.696,0.708,0.668,0.664,0.701,0.668,0.678,0.691,0.705,0.706,0.71,0.703,0.68,0.671,0.697,0.698,0.666,0.696,0.696,0.691,0.7,0.711,0.711,0.697,0.62,0.699,0.67,0.692,0.725,0.718,0.724,0.731,0.727,0.73,0.673,0.593,0.693,0.689,0.694,0.707,0.698,0.633,0.7,0.699,0.706,0.711,0.702,0.708,0.713,0.708,0.71,0.714,0.72,0.684,Binding
+0.76,0.792,0.787,0.787,0.796,0.798,0.671,0.778,0.786,0.806,0.767,0.764,0.776,0.697,0.721,0.743,0.764,0.775,0.78,0.777,0.748,0.771,0.778,0.784,0.755,0.786,0.784,0.791,0.786,0.794,0.79,0.765,0.654,0.747,0.76,0.775,0.785,0.783,0.795,0.799,0.799,0.801,0.713,0.655,0.771,0.74,0.78,0.79,0.762,0.717,0.771,0.77,0.772,0.769,0.772,0.769,0.776,0.784,0.767,0.776,0.797,0.777,Expression
+0.72,0.757,0.754,0.758,0.761,0.762,0.603,0.714,0.745,0.749,0.7,0.708,0.721,0.61,0.636,0.658,0.694,0.709,0.728,0.74,0.701,0.721,0.731,0.734,0.692,0.734,0.727,0.735,0.757,0.756,0.752,0.739,0.616,0.7,0.723,0.749,0.734,0.741,0.76,0.761,0.764,0.767,0.643,0.585,0.709,0.663,0.662,0.72,0.676,0.644,0.702,0.706,0.706,0.709,0.709,0.71,0.707,0.707,0.701,0.71,0.693,0.658,OrganismalFitness
+0.802,0.835,0.834,0.835,0.835,0.838,0.68,0.778,0.823,0.835,0.832,0.791,0.811,0.693,0.777,0.821,0.834,0.835,0.827,0.825,0.731,0.768,0.786,0.792,0.764,0.794,0.779,0.795,0.823,0.828,0.835,0.814,0.706,0.717,0.745,0.776,0.813,0.816,0.824,0.838,0.837,0.839,0.765,0.687,0.818,0.79,0.844,0.839,0.864,0.854,0.849,0.853,0.856,0.856,0.855,0.855,0.854,0.854,0.853,0.858,0.861,0.826,Stability
+0.747,0.777,0.774,0.776,0.781,0.783,0.647,0.739,0.766,0.777,0.747,0.732,0.75,0.667,0.703,0.728,0.747,0.755,0.759,0.761,0.712,0.731,0.747,0.752,0.718,0.751,0.748,0.755,0.768,0.777,0.775,0.759,0.653,0.713,0.726,0.752,0.766,0.768,0.779,0.784,0.784,0.786,0.701,0.636,0.748,0.72,0.743,0.765,0.748,0.713,0.758,0.76,0.761,0.764,0.761,0.762,0.763,0.765,0.76,0.766,0.768,0.734,
diff --git a/benchmarks/DMS_zero_shot/substitutions/NDCG/DMS_substitutions_NDCG_Uniprot_level.csv b/benchmarks/DMS_zero_shot/substitutions/NDCG/DMS_substitutions_NDCG_Uniprot_level.csv
new file mode 100644
index 0000000..235f865
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/NDCG/DMS_substitutions_NDCG_Uniprot_level.csv
@@ -0,0 +1,223 @@
+Site_Independent,EVmutation,DeepSequence_single,DeepSequence_ensemble,EVE_single,EVE_ensemble,Unirep,Unirep_evotune,MSA_Transformer_single,MSA_Transformer_ensemble,ESM1b,ESM1v_single,ESM1v_ensemble,ESM2_8M,ESM2_35M,ESM2_150M,ESM2_650M,ESM2_3B,ESM2_15B,Wavenet,RITA_s,RITA_m,RITA_l,RITA_xl,Progen2_small,Progen2_medium,Progen2_base,Progen2_large,Progen2_xlarge,GEMME,VESPA,VESPAl,ProtGPT2,Tranception_S_no_retrieval,Tranception_M_no_retrieval,Tranception_L_no_retrieval,Tranception_S,Tranception_M,Tranception_L,TranceptEVE_S,TranceptEVE_M,TranceptEVE_L,CARP_38M,CARP_600K,CARP_640M,CARP_76M,MIF,MIFST,ESM-IF1,ProteinMPNN,ProtSSN_k10_h512,ProtSSN_k10_h768,ProtSSN_k10_h1280,ProtSSN_k20_h512,ProtSSN_k20_h768,ProtSSN_k20_h1280,ProtSSN_k30_h512,ProtSSN_k30_h768,ProtSSN_k30_h1280,ProtSSN_ensemble,SaProt_650M_AF2,SaProt_35M_AF2,Selection Type,UniProt_ID,MSA_Neff_L_category,Taxon
+0.278,0.29,0.231,0.219,0.283,0.286,0.1,0.099,0.327,0.333,0.149,0.128,0.164,0.097,0.122,0.107,0.144,0.257,0.271,0.189,0.274,0.277,0.279,0.298,0.254,0.267,0.283,0.248,0.273,0.299,0.262,0.231,0.125,0.259,0.252,0.239,0.28,0.283,0.279,0.281,0.293,0.285,0.123,0.121,0.184,0.131,0.186,0.202,0.163,0.166,0.198,0.207,0.221,0.218,0.22,0.199,0.178,0.199,0.184,0.209,0.155,0.154,OrganismalFitness,A0A140D2T1_ZIKV,Medium,Virus
+0.878,0.871,0.873,0.876,0.879,0.881,0.653,0.874,0.884,0.883,0.852,0.87,0.873,0.681,0.677,0.676,0.689,0.707,0.722,0.879,0.874,0.877,0.881,0.879,0.875,0.877,0.877,0.876,0.881,0.884,0.875,0.856,0.82,0.873,0.878,0.881,0.879,0.881,0.883,0.882,0.881,0.882,0.822,0.645,0.873,0.866,0.742,0.866,0.697,0.74,0.703,0.715,0.729,0.726,0.717,0.726,0.719,0.717,0.713,0.72,0.728,0.706,OrganismalFitness,A0A192B1T2_9HIV1,Medium,Virus
+0.691,0.755,0.76,0.766,0.758,0.761,0.711,0.773,0.764,0.759,0.746,0.764,0.756,0.73,0.736,0.747,0.725,0.719,0.709,0.771,0.752,0.761,0.76,0.761,0.76,0.763,0.768,0.771,0.756,0.759,0.763,0.771,0.757,0.743,0.764,0.764,0.751,0.745,0.761,0.757,0.761,0.761,0.712,0.716,0.755,0.709,0.768,0.771,0.734,0.752,0.752,0.741,0.732,0.734,0.723,0.745,0.747,0.754,0.733,0.727,0.731,0.72,Activity,A0A1I9GEU1_NEIME,Medium,Prokaryote
+0.893,0.897,0.805,0.8,0.891,0.886,0.695,0.708,0.901,0.895,0.772,0.766,0.768,0.744,0.741,0.758,0.751,0.766,0.736,0.873,0.687,0.688,0.697,0.688,0.667,0.72,0.741,0.686,0.725,0.901,0.856,0.857,0.833,0.709,0.689,0.696,0.881,0.88,0.882,0.87,0.875,0.873,0.726,0.685,0.729,0.739,0.848,0.839,0.89,0.869,0.826,0.847,0.84,0.809,0.792,0.819,0.809,0.821,0.796,0.809,0.84,0.805,Activity,A0A247D711_LISMN,High,Prokaryote
+0.485,0.513,0.508,0.512,0.512,0.511,0.304,0.521,0.518,0.513,0.352,0.496,0.522,0.317,0.321,0.33,0.502,0.49,0.51,0.534,0.488,0.517,0.522,0.527,0.45,0.499,0.523,0.504,0.5,0.536,0.508,0.478,0.318,0.486,0.507,0.51,0.505,0.518,0.516,0.511,0.515,0.511,0.318,0.31,0.41,0.324,0.418,0.462,0.424,0.389,0.484,0.477,0.482,0.488,0.472,0.489,0.475,0.482,0.486,0.482,0.382,0.377,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+0.485,0.513,0.508,0.512,0.512,0.511,0.304,0.521,0.518,0.513,0.352,0.496,0.522,0.317,0.321,0.33,0.502,0.49,0.51,0.534,0.488,0.517,0.522,0.527,0.45,0.499,0.523,0.504,0.5,0.536,0.508,0.478,0.318,0.486,0.507,0.51,0.505,0.518,0.516,0.511,0.515,0.511,0.318,0.31,0.41,0.324,0.418,0.462,0.424,0.389,0.484,0.477,0.482,0.488,0.472,0.489,0.475,0.482,0.486,0.482,0.382,0.377,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+0.43,0.43,0.442,0.436,0.444,0.445,0.239,0.421,0.375,0.372,0.249,0.237,0.245,0.246,0.252,0.246,0.288,0.288,0.383,0.407,0.384,0.415,0.416,0.396,0.245,0.356,0.357,0.355,0.389,0.432,0.383,0.356,0.233,0.397,0.42,0.424,0.434,0.444,0.443,0.455,0.456,0.453,0.248,0.245,0.275,0.258,0.273,0.285,0.2,0.288,0.275,0.292,0.274,0.305,0.283,0.299,0.288,0.301,0.293,0.288,0.281,0.249,OrganismalFitness,A4D664_9INFA,Medium,Virus
+0.724,0.829,0.83,0.841,0.827,0.834,0.656,0.737,0.821,0.831,0.838,0.82,0.837,0.731,0.776,0.839,0.865,0.88,0.874,0.832,0.701,0.78,0.802,0.822,0.764,0.826,0.823,0.814,0.87,0.821,0.887,0.858,0.646,0.723,0.815,0.844,0.817,0.823,0.857,0.856,0.854,0.862,0.741,0.63,0.85,0.808,0.861,0.881,0.799,0.802,0.858,0.864,0.855,0.877,0.873,0.873,0.86,0.873,0.867,0.873,0.864,0.776,OrganismalFitness,A4GRB6_PSEAI,High,Prokaryote
+0.662,0.735,0.761,0.761,0.725,0.733,0.639,0.632,0.743,0.745,0.684,0.708,0.741,0.611,0.609,0.633,0.657,0.753,0.725,0.72,0.685,0.67,0.691,0.711,0.724,0.671,0.687,0.686,0.692,0.762,0.7,0.695,0.758,0.708,0.693,0.749,0.711,0.704,0.738,0.736,0.733,0.746,0.614,0.614,0.747,0.621,0.677,0.714,0.435,0.612,0.709,0.736,0.709,0.683,0.688,0.687,0.695,0.648,0.684,0.696,0.63,0.608,Stability,A4_HUMAN,Low,Human
+0.776,0.819,0.779,0.792,0.788,0.8,0.735,0.705,0.818,0.815,0.766,0.786,0.777,0.729,0.733,0.747,0.779,0.799,0.798,0.795,0.728,0.737,0.763,0.706,0.747,0.786,0.778,0.785,0.766,0.785,0.779,0.767,0.676,0.754,0.747,0.789,0.804,0.803,0.799,0.796,0.794,0.799,0.744,0.726,0.754,0.744,0.739,0.786,0.77,0.734,0.788,0.797,0.792,0.791,0.799,0.806,0.793,0.804,0.808,0.801,0.771,0.756,OrganismalFitness,AACC1_PSEAI,High,Prokaryote
+0.705,0.686,0.71,0.7,0.687,0.685,0.622,0.675,0.698,0.7,0.674,0.698,0.699,0.616,0.652,0.676,0.672,0.668,0.69,0.697,0.641,0.698,0.721,0.697,0.634,0.705,0.704,0.703,0.707,0.689,0.673,0.677,0.681,0.66,0.688,0.657,0.707,0.696,0.693,0.693,0.687,0.685,0.634,0.615,0.664,0.659,0.683,0.676,0.695,0.664,0.677,0.681,0.672,0.662,0.679,0.674,0.676,0.669,0.678,0.669,0.679,0.679,Binding,ACE2_HUMAN,Medium,Human
+0.604,0.634,0.637,0.64,0.654,0.647,0.604,0.647,0.634,0.646,0.647,0.632,0.641,0.586,0.605,0.606,0.603,0.619,0.627,0.631,0.625,0.633,0.627,0.635,0.636,0.639,0.641,0.644,0.635,0.629,0.631,0.613,0.545,0.641,0.634,0.636,0.639,0.632,0.636,0.645,0.645,0.645,0.61,0.546,0.632,0.627,0.604,0.635,0.602,0.582,0.618,0.611,0.607,0.62,0.615,0.616,0.607,0.619,0.606,0.616,0.64,0.619,Activity,ADRB2_HUMAN,Medium,Human
+0.104,0.126,0.185,0.106,0.148,0.163,0.036,0.149,0.201,0.147,0.229,0.173,0.168,0.059,0.016,0.226,0.279,0.24,0.171,0.32,0.047,0.066,0.285,0.29,0.037,0.248,0.301,0.282,0.133,0.104,0.185,0.159,0.119,0.084,0.076,0.14,0.361,0.327,0.332,0.164,0.164,0.147,0.043,0.047,0.146,0.022,0.237,0.159,0.108,0.387,0.23,0.237,0.197,0.231,0.157,0.203,0.192,0.187,0.186,0.207,0.053,0.048,Activity,AICDA_HUMAN,Medium,Human
+0.813,0.856,0.859,0.858,0.851,0.862,0.418,0.831,0.846,0.855,0.879,0.567,0.815,0.44,0.603,0.879,0.867,0.872,0.865,0.846,0.587,0.46,0.809,0.804,0.43,0.808,0.784,0.673,0.844,0.846,0.862,0.837,0.823,0.454,0.546,0.468,0.832,0.837,0.841,0.864,0.867,0.865,0.621,0.593,0.854,0.788,0.756,0.801,0.837,0.834,0.87,0.878,0.869,0.863,0.876,0.867,0.866,0.859,0.873,0.871,0.888,0.674,Stability,AMFR_HUMAN,Medium,Human
+0.796,0.882,0.875,0.877,0.874,0.875,0.733,0.857,0.828,0.883,0.843,0.882,0.886,0.781,0.803,0.806,0.836,0.891,0.882,0.845,0.876,0.886,0.887,0.885,0.866,0.879,0.888,0.883,0.882,0.873,0.882,0.874,0.776,0.886,0.883,0.881,0.878,0.878,0.876,0.885,0.883,0.882,0.79,0.759,0.816,0.802,0.818,0.848,0.828,0.816,0.85,0.86,0.858,0.854,0.852,0.855,0.854,0.85,0.857,0.859,0.864,0.813,Activity,AMIE_PSEAE,High,Prokaryote
+0.82,0.817,0.811,0.81,0.818,0.818,0.787,0.808,0.814,0.819,0.806,0.82,0.822,0.818,0.834,0.829,0.829,0.829,0.826,0.676,0.834,0.807,0.811,0.808,0.84,0.815,0.817,0.8,0.797,0.819,0.804,0.794,0.754,0.804,0.8,0.811,0.819,0.816,0.821,0.822,0.82,0.822,0.82,0.795,0.81,0.812,0.808,0.825,0.785,0.745,0.827,0.831,0.819,0.814,0.825,0.822,0.828,0.829,0.827,0.828,0.825,0.823,Activity,ANCSZ,Medium,Eukaryote
+0.839,0.908,0.91,0.91,0.909,0.907,0.793,0.868,0.825,0.882,0.872,0.826,0.895,0.827,0.851,0.889,0.901,0.912,0.896,0.901,0.85,0.883,0.898,0.895,0.858,0.88,0.901,0.901,0.891,0.887,0.878,0.848,0.77,0.866,0.87,0.87,0.871,0.881,0.888,0.913,0.901,0.908,0.827,0.787,0.874,0.856,0.918,0.913,0.914,0.919,0.899,0.905,0.908,0.908,0.907,0.904,0.904,0.912,0.902,0.908,0.927,0.884,Stability,ARGR_ECOLI,Medium,Prokaryote
+0.594,0.518,0.605,0.598,0.595,0.604,0.425,0.567,0.492,0.524,0.18,0.158,0.552,0.424,0.352,0.38,0.474,0.494,0.563,0.479,0.363,0.272,0.479,0.529,0.195,0.493,0.536,0.386,0.536,0.564,0.458,0.389,0.371,0.49,0.224,0.457,0.597,0.605,0.637,0.617,0.624,0.628,0.355,0.402,0.518,0.445,0.386,0.502,0.449,0.158,0.403,0.498,0.489,0.53,0.428,0.46,0.491,0.504,0.511,0.522,0.424,0.223,Binding,B2L11_HUMAN,Low,Human
+0.681,0.749,0.713,0.716,0.718,0.69,0.612,0.654,0.754,0.759,0.771,0.758,0.766,0.514,0.735,0.759,0.814,0.812,0.815,0.755,0.661,0.732,0.742,0.746,0.592,0.765,0.709,0.734,0.716,0.668,0.711,0.681,0.616,0.7,0.658,0.699,0.719,0.694,0.727,0.709,0.678,0.693,0.676,0.529,0.762,0.732,0.789,0.772,0.811,0.789,0.788,0.8,0.798,0.822,0.803,0.823,0.807,0.809,0.8,0.809,0.811,0.797,Stability,BBC1_YEAST,High,Eukaryote
+0.877,0.893,0.888,0.887,0.878,0.887,0.727,0.844,0.861,0.901,0.817,0.858,0.873,0.773,0.791,0.825,0.89,0.888,0.894,0.9,0.737,0.838,0.831,0.851,0.811,0.874,0.813,0.875,0.905,0.893,0.879,0.86,0.679,0.813,0.823,0.866,0.872,0.872,0.889,0.884,0.882,0.888,0.808,0.801,0.838,0.834,0.884,0.907,0.92,0.896,0.885,0.906,0.895,0.897,0.899,0.893,0.896,0.892,0.891,0.898,0.913,0.788,Stability,BCHB_CHLTE,Medium,Prokaryote
+0.731,0.868,0.856,0.846,0.875,0.876,0.543,0.664,0.776,0.835,0.803,0.815,0.82,0.695,0.75,0.788,0.804,0.829,0.81,0.784,0.799,0.798,0.822,0.8,0.766,0.822,0.831,0.846,0.795,0.76,0.868,0.86,0.6,0.78,0.843,0.808,0.836,0.863,0.851,0.851,0.869,0.884,0.73,0.514,0.819,0.763,0.797,0.852,0.848,0.736,0.837,0.842,0.846,0.849,0.848,0.856,0.855,0.845,0.855,0.853,0.856,0.787,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.731,0.868,0.856,0.846,0.875,0.876,0.543,0.664,0.776,0.835,0.803,0.815,0.82,0.695,0.75,0.788,0.804,0.829,0.81,0.784,0.799,0.798,0.822,0.8,0.766,0.822,0.831,0.846,0.795,0.76,0.868,0.86,0.6,0.78,0.843,0.808,0.836,0.863,0.851,0.851,0.869,0.884,0.73,0.514,0.819,0.763,0.797,0.852,0.848,0.736,0.837,0.842,0.846,0.849,0.848,0.856,0.855,0.845,0.855,0.853,0.856,0.787,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.731,0.868,0.856,0.846,0.875,0.876,0.543,0.664,0.776,0.835,0.803,0.815,0.82,0.695,0.75,0.788,0.804,0.829,0.81,0.784,0.799,0.798,0.822,0.8,0.766,0.822,0.831,0.846,0.795,0.76,0.868,0.86,0.6,0.78,0.843,0.808,0.836,0.863,0.851,0.851,0.869,0.884,0.73,0.514,0.819,0.763,0.797,0.852,0.848,0.736,0.837,0.842,0.846,0.849,0.848,0.856,0.855,0.845,0.855,0.853,0.856,0.787,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.731,0.868,0.856,0.846,0.875,0.876,0.543,0.664,0.776,0.835,0.803,0.815,0.82,0.695,0.75,0.788,0.804,0.829,0.81,0.784,0.799,0.798,0.822,0.8,0.766,0.822,0.831,0.846,0.795,0.76,0.868,0.86,0.6,0.78,0.843,0.808,0.836,0.863,0.851,0.851,0.869,0.884,0.73,0.514,0.819,0.763,0.797,0.852,0.848,0.736,0.837,0.842,0.846,0.849,0.848,0.856,0.855,0.845,0.855,0.853,0.856,0.787,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.881,0.871,0.877,0.878,0.866,0.867,0.738,0.812,0.867,0.869,0.865,0.822,0.822,0.77,0.812,0.834,0.866,0.879,0.85,0.872,0.76,0.862,0.877,0.861,0.823,0.89,0.887,0.873,0.873,0.878,0.879,0.855,0.717,0.781,0.772,0.857,0.877,0.881,0.883,0.878,0.877,0.878,0.793,0.756,0.881,0.835,0.838,0.877,0.751,0.781,0.853,0.86,0.858,0.855,0.85,0.858,0.848,0.859,0.852,0.857,0.818,0.833,OrganismalFitness,BRCA1_HUMAN,Low,Human
+0.941,0.933,0.936,0.935,0.935,0.932,0.915,0.937,0.856,0.891,0.94,0.923,0.918,0.928,0.925,0.929,0.944,0.935,0.94,0.903,0.93,0.944,0.949,0.943,0.942,0.922,0.921,0.933,0.914,0.932,0.926,0.946,0.927,0.926,0.93,0.931,0.938,0.937,0.939,0.933,0.934,0.933,0.918,0.922,0.939,0.898,0.89,0.892,0.769,0.917,0.943,0.942,0.941,0.938,0.945,0.933,0.948,0.943,0.943,0.944,0.896,0.91,OrganismalFitness,BRCA2_HUMAN,,Human
+0.771,0.811,0.803,0.8,0.812,0.812,0.609,0.792,0.809,0.812,0.627,0.77,0.822,0.601,0.604,0.616,0.784,0.81,0.82,0.792,0.798,0.788,0.796,0.786,0.66,0.793,0.796,0.808,0.778,0.828,0.8,0.767,0.667,0.786,0.796,0.787,0.801,0.805,0.801,0.815,0.818,0.817,0.603,0.59,0.698,0.608,0.75,0.795,0.785,0.719,0.789,0.796,0.795,0.793,0.793,0.783,0.794,0.794,0.783,0.795,0.688,0.658,OrganismalFitness,C6KNH7_9INFA,Medium,Virus
+0.799,0.817,0.808,0.804,0.805,0.8,0.801,0.79,0.815,0.816,0.827,0.794,0.821,0.797,0.793,0.791,0.773,0.786,0.812,0.813,0.801,0.806,0.821,0.81,0.812,0.827,0.816,0.841,0.832,0.823,0.819,0.794,0.75,0.809,0.832,0.823,0.815,0.827,0.828,0.813,0.807,0.813,0.807,0.794,0.819,0.824,0.781,0.775,0.809,0.758,0.774,0.78,0.781,0.758,0.766,0.783,0.781,0.779,0.763,0.778,0.826,0.822,OrganismalFitness,CALM1_HUMAN,High,Human
+0.791,0.789,0.786,0.811,0.781,0.784,0.758,0.841,0.75,0.777,0.641,0.666,0.682,0.701,0.729,0.69,0.733,0.665,0.606,0.703,0.683,0.722,0.696,0.712,0.68,0.691,0.729,0.698,0.794,0.798,0.629,0.619,0.616,0.699,0.721,0.83,0.771,0.772,0.821,0.785,0.787,0.817,0.663,0.718,0.776,0.688,0.768,0.753,0.686,0.723,0.673,0.651,0.654,0.656,0.637,0.645,0.659,0.652,0.688,0.663,0.723,0.665,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+0.712,0.714,0.716,0.715,0.713,0.712,0.711,0.672,0.726,0.724,0.737,0.693,0.708,0.674,0.654,0.7,0.714,0.72,0.726,0.712,0.658,0.731,0.73,0.724,0.671,0.739,0.734,0.74,0.7,0.72,0.731,0.706,0.682,0.698,0.7,0.694,0.709,0.722,0.716,0.712,0.723,0.72,0.672,0.657,0.73,0.698,0.674,0.719,0.708,0.69,0.728,0.726,0.712,0.726,0.728,0.717,0.726,0.722,0.719,0.722,0.734,0.707,OrganismalFitness,CAR11_HUMAN,Low,Human
+0.712,0.714,0.716,0.715,0.713,0.712,0.711,0.672,0.726,0.724,0.737,0.693,0.708,0.674,0.654,0.7,0.714,0.72,0.726,0.712,0.658,0.731,0.73,0.724,0.671,0.739,0.734,0.74,0.7,0.72,0.731,0.706,0.682,0.698,0.7,0.694,0.709,0.722,0.716,0.712,0.723,0.72,0.672,0.657,0.73,0.698,0.674,0.719,0.708,0.69,0.728,0.726,0.712,0.726,0.728,0.717,0.726,0.722,0.719,0.722,0.734,0.707,OrganismalFitness,CAR11_HUMAN,Low,Human
+0.836,0.838,0.839,0.839,0.839,0.839,0.831,0.822,0.84,0.84,0.836,0.829,0.829,0.829,0.829,0.837,0.839,0.842,0.841,0.828,0.832,0.831,0.839,0.845,0.831,0.84,0.838,0.837,0.839,0.844,0.842,0.842,0.834,0.825,0.828,0.84,0.838,0.839,0.842,0.839,0.839,0.841,0.829,0.826,0.838,0.829,0.829,0.838,0.829,0.828,0.84,0.837,0.841,0.838,0.837,0.837,0.837,0.84,0.839,0.839,0.842,0.837,Activity,CAS9_STRP1,Medium,Prokaryote
+0.779,0.82,0.82,0.812,0.812,0.818,0.7,0.759,0.818,0.824,0.813,0.796,0.811,0.712,0.809,0.826,0.812,0.823,0.824,0.798,0.711,0.778,0.756,0.829,0.809,0.817,0.824,0.812,0.831,0.802,0.838,0.821,0.754,0.712,0.803,0.819,0.809,0.822,0.822,0.823,0.829,0.824,0.777,0.681,0.817,0.801,0.768,0.807,0.785,0.74,0.812,0.801,0.804,0.807,0.81,0.807,0.819,0.818,0.806,0.805,0.819,0.796,Activity,CASP3_HUMAN,High,Human
+0.747,0.808,0.815,0.816,0.817,0.814,0.607,0.786,0.81,0.813,0.818,0.813,0.816,0.682,0.804,0.813,0.804,0.787,0.786,0.81,0.622,0.794,0.764,0.788,0.782,0.792,0.796,0.785,0.804,0.82,0.804,0.786,0.723,0.664,0.784,0.785,0.799,0.822,0.812,0.812,0.813,0.818,0.746,0.62,0.821,0.806,0.761,0.823,0.788,0.74,0.805,0.811,0.812,0.803,0.804,0.802,0.806,0.805,0.809,0.809,0.819,0.775,Activity,CASP7_HUMAN,Medium,Human
+0.853,0.808,0.813,0.816,0.825,0.817,0.844,0.774,0.777,0.793,0.805,0.82,0.815,0.819,0.859,0.819,0.82,0.814,0.801,0.62,0.821,0.792,0.8,0.791,0.798,0.798,0.788,0.796,0.8,0.813,0.79,0.799,0.755,0.811,0.803,0.807,0.831,0.821,0.813,0.812,0.81,0.813,0.819,0.82,0.795,0.8,0.847,0.76,0.818,0.842,0.815,0.808,0.815,0.817,0.813,0.811,0.813,0.81,0.809,0.816,0.81,0.814,Stability,CATR_CHLRE,High,Eukaryote
+0.91,0.895,0.916,0.91,0.91,0.912,0.848,0.896,0.909,0.916,0.912,0.9,0.903,0.854,0.901,0.902,0.908,0.915,0.913,0.917,0.861,0.845,0.884,0.891,0.889,0.837,0.835,0.823,0.913,0.906,0.914,0.899,0.733,0.868,0.886,0.905,0.912,0.915,0.916,0.913,0.912,0.918,0.898,0.888,0.901,0.902,0.928,0.922,0.925,0.928,0.908,0.916,0.922,0.915,0.917,0.909,0.911,0.911,0.91,0.917,0.918,0.928,Stability,CBPA2_HUMAN,Medium,Human
+0.472,0.468,0.477,0.478,0.477,0.48,0.323,0.435,0.478,0.455,0.451,0.463,0.45,0.331,0.394,0.427,0.421,0.447,0.435,0.455,0.465,0.419,0.446,0.452,0.427,0.433,0.446,0.443,0.454,0.47,0.494,0.478,0.386,0.465,0.43,0.431,0.478,0.479,0.49,0.49,0.486,0.474,0.431,0.326,0.472,0.461,0.42,0.453,0.442,0.372,0.432,0.444,0.447,0.439,0.427,0.439,0.448,0.444,0.434,0.445,0.468,0.413,OrganismalFitness,CBS_HUMAN,Medium,Human
+0.881,0.875,0.889,0.895,0.897,0.904,0.318,0.816,0.89,0.891,0.873,0.89,0.89,0.355,0.912,0.911,0.907,0.899,0.87,0.889,0.815,0.829,0.837,0.838,0.859,0.865,0.858,0.847,0.849,0.887,0.876,0.846,0.641,0.775,0.844,0.857,0.884,0.876,0.893,0.901,0.892,0.892,0.88,0.448,0.833,0.883,0.748,0.757,0.914,0.859,0.906,0.898,0.903,0.899,0.907,0.9,0.905,0.901,0.906,0.909,0.896,0.905,Stability,CBX4_HUMAN,High,Human
+0.864,0.909,0.918,0.918,0.909,0.912,0.673,0.828,0.788,0.865,0.849,0.744,0.766,0.667,0.678,0.774,0.888,0.92,0.906,0.836,0.648,0.649,0.596,0.802,0.603,0.764,0.64,0.775,0.921,0.871,0.936,0.934,0.826,0.668,0.671,0.881,0.818,0.796,0.904,0.904,0.898,0.919,0.693,0.631,0.798,0.684,0.772,0.886,0.772,0.791,0.881,0.845,0.83,0.861,0.908,0.892,0.862,0.906,0.899,0.888,0.883,0.831,Activity,CCDB_ECOLI,High,Prokaryote
+0.864,0.909,0.918,0.918,0.909,0.912,0.673,0.828,0.788,0.865,0.849,0.744,0.766,0.667,0.678,0.774,0.888,0.92,0.906,0.836,0.648,0.649,0.596,0.802,0.603,0.764,0.64,0.775,0.921,0.871,0.936,0.934,0.826,0.668,0.671,0.881,0.818,0.796,0.904,0.904,0.898,0.919,0.693,0.631,0.798,0.684,0.772,0.886,0.772,0.791,0.881,0.845,0.83,0.861,0.908,0.892,0.862,0.906,0.899,0.888,0.883,0.831,OrganismalFitness,CCDB_ECOLI,High,Prokaryote
+0.819,0.818,0.818,0.819,0.821,0.821,0.8,0.841,0.822,0.822,0.823,0.82,0.819,0.81,0.813,0.82,0.807,0.816,0.812,0.823,0.814,0.836,0.832,0.83,0.824,0.832,0.825,0.828,0.832,0.829,0.822,0.815,0.775,0.822,0.828,0.829,0.82,0.83,0.834,0.822,0.823,0.825,0.813,0.772,0.826,0.815,0.809,0.824,0.815,0.805,0.818,0.814,0.813,0.809,0.813,0.82,0.814,0.814,0.814,0.817,0.823,0.827,Binding,CCR5_HUMAN,High,Human
+0.566,0.555,0.578,0.578,0.566,0.565,0.519,0.523,0.526,0.529,0.523,0.51,0.543,0.54,0.566,0.522,0.517,0.558,0.577,0.491,0.575,0.569,0.584,0.577,0.511,0.554,0.536,0.586,0.565,0.576,0.572,0.548,0.527,0.608,0.586,0.531,0.598,0.587,0.578,0.578,0.574,0.573,0.552,0.529,0.573,0.562,0.693,0.67,0.683,0.553,0.574,0.564,0.591,0.6,0.58,0.595,0.614,0.573,0.61,0.6,0.687,0.621,Binding,CD19_HUMAN,Low,Human
+0.789,0.835,0.833,0.843,0.849,0.847,0.819,0.846,0.828,0.849,0.835,0.853,0.865,0.786,0.84,0.849,0.87,0.861,0.848,0.857,0.832,0.802,0.85,0.838,0.854,0.846,0.851,0.844,0.846,0.836,0.83,0.8,0.568,0.847,0.847,0.849,0.855,0.856,0.861,0.856,0.854,0.851,0.844,0.658,0.843,0.861,0.832,0.839,0.858,0.712,0.862,0.859,0.862,0.861,0.863,0.866,0.864,0.864,0.865,0.868,0.861,0.841,Expression,CP2C9_HUMAN,High,Human
+0.789,0.835,0.833,0.843,0.849,0.847,0.819,0.846,0.828,0.849,0.835,0.853,0.865,0.786,0.84,0.849,0.87,0.861,0.848,0.857,0.832,0.802,0.85,0.838,0.854,0.846,0.851,0.844,0.846,0.836,0.83,0.8,0.568,0.847,0.847,0.849,0.855,0.856,0.861,0.856,0.854,0.851,0.844,0.658,0.843,0.861,0.832,0.839,0.858,0.712,0.862,0.859,0.862,0.861,0.863,0.866,0.864,0.864,0.865,0.868,0.861,0.841,Binding,CP2C9_HUMAN,High,Human
+0.824,0.841,0.847,0.852,0.851,0.849,0.672,0.807,0.858,0.848,0.838,0.87,0.874,0.751,0.895,0.86,0.789,0.835,0.822,0.845,0.766,0.745,0.815,0.841,0.726,0.866,0.871,0.847,0.842,0.842,0.857,0.843,0.678,0.729,0.833,0.824,0.853,0.867,0.87,0.852,0.864,0.865,0.756,0.745,0.844,0.862,0.872,0.847,0.902,0.887,0.84,0.839,0.843,0.847,0.842,0.849,0.845,0.851,0.841,0.847,0.855,0.873,Stability,CSN4_MOUSE,Medium,Eukaryote
+0.746,0.847,0.798,0.8,0.821,0.815,0.646,0.781,0.863,0.867,0.777,0.671,0.673,0.617,0.637,0.679,0.728,0.797,0.793,0.768,0.626,0.618,0.643,0.563,0.544,0.564,0.623,0.622,0.796,0.779,0.802,0.768,0.676,0.617,0.597,0.629,0.742,0.739,0.769,0.808,0.806,0.812,0.626,0.633,0.731,0.644,0.781,0.787,0.87,0.835,0.711,0.782,0.764,0.76,0.721,0.76,0.747,0.739,0.747,0.747,0.824,0.75,Stability,CUE1_YEAST,Medium,Eukaryote
+0.63,0.771,0.736,0.695,0.76,0.76,0.32,0.622,0.792,0.789,0.595,0.325,0.328,0.327,0.309,0.302,0.312,0.341,0.357,0.621,0.291,0.27,0.362,0.331,0.261,0.305,0.313,0.393,0.478,0.773,0.752,0.756,0.308,0.325,0.36,0.363,0.691,0.69,0.686,0.764,0.763,0.753,0.443,0.445,0.45,0.45,0.573,0.593,0.611,0.594,0.626,0.62,0.632,0.628,0.614,0.629,0.608,0.627,0.615,0.625,0.513,0.386,Activity,D7PM05_CLYGR,Low,Eukaryote
+0.87,0.859,0.871,0.863,0.876,0.876,0.872,0.863,0.834,0.841,0.816,0.865,0.867,0.888,0.914,0.901,0.865,0.853,0.826,0.878,0.851,0.849,0.836,0.823,0.863,0.851,0.843,0.848,0.821,0.874,0.853,0.852,0.817,0.849,0.869,0.836,0.886,0.894,0.88,0.883,0.888,0.885,0.877,0.727,0.832,0.867,0.869,0.817,0.895,0.772,0.837,0.813,0.835,0.833,0.845,0.846,0.829,0.827,0.829,0.835,0.857,0.914,OrganismalFitness,DLG4_HUMAN,Low,Human
+0.921,0.93,0.925,0.925,0.931,0.932,0.926,0.923,0.929,0.929,0.928,0.927,0.927,0.912,0.914,0.936,0.928,0.927,0.928,0.921,0.922,0.921,0.924,0.924,0.923,0.926,0.922,0.921,0.927,0.921,0.935,0.936,0.879,0.917,0.925,0.911,0.922,0.924,0.92,0.927,0.928,0.932,0.929,0.8,0.927,0.922,0.907,0.91,0.906,0.879,0.927,0.882,0.919,0.921,0.923,0.92,0.923,0.922,0.925,0.923,0.926,0.92,Binding,DLG4_RAT,Low,Eukaryote
+0.841,0.877,0.86,0.848,0.831,0.855,0.751,0.876,0.898,0.906,0.799,0.808,0.789,0.77,0.802,0.801,0.823,0.819,0.902,0.894,0.738,0.784,0.759,0.765,0.763,0.74,0.746,0.782,0.854,0.891,0.906,0.883,0.778,0.77,0.748,0.764,0.839,0.829,0.83,0.858,0.858,0.845,0.787,0.774,0.818,0.785,0.917,0.896,0.926,0.918,0.893,0.899,0.906,0.892,0.899,0.885,0.872,0.879,0.892,0.889,0.854,0.85,Stability,DN7A_SACS2,Medium,Prokaryote
+0.9,0.89,0.901,0.898,0.904,0.899,0.888,0.835,0.891,0.899,0.91,0.907,0.924,0.903,0.936,0.929,0.933,0.923,0.923,0.91,0.9,0.898,0.908,0.903,0.904,0.885,0.902,0.892,0.889,0.883,0.88,0.879,0.768,0.896,0.887,0.907,0.899,0.899,0.906,0.897,0.898,0.902,0.918,0.729,0.887,0.92,0.92,0.885,0.923,0.925,0.928,0.935,0.938,0.938,0.939,0.937,0.936,0.94,0.933,0.942,0.898,0.926,Stability,DNJA1_HUMAN,High,Human
+0.774,0.795,0.789,0.776,0.761,0.785,0.546,0.742,0.671,0.644,0.791,0.674,0.707,0.521,0.654,0.679,0.787,0.795,0.78,0.828,0.73,0.796,0.793,0.774,0.648,0.766,0.65,0.743,0.788,0.688,0.808,0.782,0.696,0.571,0.586,0.561,0.779,0.775,0.795,0.798,0.783,0.801,0.61,0.533,0.821,0.699,0.797,0.828,0.822,0.847,0.811,0.814,0.806,0.82,0.819,0.817,0.817,0.819,0.812,0.821,0.838,0.804,Stability,DOCK1_MOUSE,High,Eukaryote
+0.913,0.924,0.922,0.924,0.926,0.924,0.845,0.916,0.925,0.92,0.911,0.924,0.915,0.875,0.916,0.916,0.922,0.918,0.916,0.92,0.874,0.922,0.914,0.916,0.916,0.924,0.932,0.916,0.932,0.93,0.922,0.92,0.858,0.922,0.928,0.934,0.922,0.928,0.926,0.925,0.926,0.926,0.915,0.847,0.918,0.92,0.891,0.916,0.92,0.895,0.922,0.916,0.914,0.917,0.92,0.926,0.918,0.918,0.919,0.918,0.914,0.919,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+0.913,0.924,0.922,0.924,0.926,0.924,0.845,0.916,0.925,0.92,0.911,0.924,0.915,0.875,0.916,0.916,0.922,0.918,0.916,0.92,0.874,0.922,0.914,0.916,0.916,0.924,0.932,0.916,0.932,0.93,0.922,0.92,0.858,0.922,0.928,0.934,0.922,0.928,0.926,0.925,0.926,0.926,0.915,0.847,0.918,0.92,0.891,0.916,0.92,0.895,0.922,0.916,0.914,0.917,0.92,0.926,0.918,0.918,0.919,0.918,0.914,0.919,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+0.913,0.924,0.922,0.924,0.926,0.924,0.845,0.916,0.925,0.92,0.911,0.924,0.915,0.875,0.916,0.916,0.922,0.918,0.916,0.92,0.874,0.922,0.914,0.916,0.916,0.924,0.932,0.916,0.932,0.93,0.922,0.92,0.858,0.922,0.928,0.934,0.922,0.928,0.926,0.925,0.926,0.926,0.915,0.847,0.918,0.92,0.891,0.916,0.92,0.895,0.922,0.916,0.914,0.917,0.92,0.926,0.918,0.918,0.919,0.918,0.914,0.919,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+0.913,0.924,0.922,0.924,0.926,0.924,0.845,0.916,0.925,0.92,0.911,0.924,0.915,0.875,0.916,0.916,0.922,0.918,0.916,0.92,0.874,0.922,0.914,0.916,0.916,0.924,0.932,0.916,0.932,0.93,0.922,0.92,0.858,0.922,0.928,0.934,0.922,0.928,0.926,0.925,0.926,0.926,0.915,0.847,0.918,0.92,0.891,0.916,0.92,0.895,0.922,0.916,0.914,0.917,0.92,0.926,0.918,0.918,0.919,0.918,0.914,0.919,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+0.758,0.754,0.774,0.791,0.808,0.81,0.701,0.671,0.783,0.797,0.803,0.79,0.805,0.721,0.781,0.754,0.756,0.739,0.722,0.772,0.769,0.732,0.769,0.776,0.777,0.753,0.692,0.802,0.724,0.848,0.798,0.808,0.8,0.756,0.789,0.762,0.766,0.763,0.767,0.814,0.813,0.817,0.714,0.753,0.735,0.711,0.681,0.788,0.728,0.692,0.762,0.775,0.767,0.805,0.793,0.78,0.775,0.767,0.817,0.773,0.728,0.776,Activity,ENVZ_ECOLI,High,Prokaryote
+0.643,0.63,0.538,0.601,0.635,0.631,0.249,0.662,0.628,0.634,0.538,0.623,0.624,0.299,0.278,0.305,0.301,0.298,0.469,0.663,0.59,0.645,0.662,0.66,0.564,0.677,0.587,0.655,0.669,0.577,0.709,0.647,0.564,0.618,0.637,0.668,0.642,0.631,0.653,0.639,0.643,0.658,0.566,0.244,0.629,0.623,0.549,0.578,0.497,0.41,0.431,0.527,0.47,0.431,0.528,0.488,0.495,0.51,0.378,0.487,0.353,0.319,OrganismalFitness,ENV_HV1B9,Medium,Virus
+0.304,0.315,0.33,0.343,0.337,0.336,0.145,0.317,0.302,0.307,0.258,0.287,0.289,0.165,0.174,0.17,0.18,0.182,0.209,0.342,0.303,0.313,0.331,0.339,0.294,0.325,0.337,0.318,0.342,0.267,0.302,0.271,0.216,0.3,0.314,0.324,0.312,0.314,0.321,0.335,0.337,0.341,0.268,0.143,0.294,0.275,0.225,0.293,0.177,0.199,0.211,0.207,0.225,0.225,0.223,0.22,0.216,0.209,0.212,0.222,0.206,0.183,OrganismalFitness,ENV_HV1BR,Medium,Virus
+0.917,0.935,0.937,0.939,0.935,0.939,0.363,0.912,0.938,0.939,0.912,0.917,0.922,0.391,0.915,0.919,0.937,0.94,0.936,0.938,0.921,0.924,0.917,0.92,0.926,0.925,0.93,0.932,0.922,0.912,0.929,0.91,0.883,0.93,0.913,0.931,0.946,0.934,0.942,0.943,0.937,0.941,0.886,0.551,0.937,0.906,0.947,0.931,0.949,0.942,0.938,0.939,0.938,0.945,0.944,0.941,0.937,0.941,0.942,0.941,0.948,0.946,Stability,EPHB2_HUMAN,High,Human
+0.782,0.884,0.794,0.766,0.805,0.821,0.798,0.892,0.836,0.899,0.848,0.744,0.79,0.866,0.789,0.788,0.881,0.899,0.896,0.853,0.82,0.866,0.897,0.907,0.667,0.898,0.914,0.919,0.899,0.772,0.884,0.847,0.828,0.878,0.652,0.908,0.858,0.74,0.893,0.823,0.797,0.836,0.793,0.758,0.89,0.767,0.917,0.928,0.741,0.825,0.827,0.844,0.786,0.872,0.857,0.775,0.804,0.866,0.826,0.829,0.779,0.772,Expression,ERBB2_HUMAN,Low,Human
+0.708,0.774,0.774,0.777,0.773,0.779,0.657,0.69,0.737,0.796,0.691,0.729,0.75,0.662,0.713,0.69,0.693,0.69,0.717,0.731,0.663,0.663,0.711,0.667,0.718,0.704,0.735,0.739,0.775,0.753,0.732,0.737,0.595,0.667,0.731,0.686,0.725,0.741,0.7,0.784,0.783,0.764,0.719,0.605,0.75,0.728,0.838,0.804,0.792,0.757,0.755,0.701,0.74,0.729,0.732,0.724,0.72,0.714,0.72,0.735,0.744,0.719,Stability,ESTA_BACSU,High,Prokaryote
+0.143,0.824,0.791,0.884,0.845,0.859,0.159,0.66,0.675,0.712,0.837,0.703,0.726,0.123,0.122,0.192,0.616,0.59,0.818,0.776,0.12,0.112,0.121,0.154,0.17,0.455,0.121,0.671,0.852,0.797,0.914,0.892,0.161,0.119,0.117,0.921,0.144,0.127,0.879,0.727,0.7,0.893,0.175,0.16,0.466,0.196,0.187,0.52,0.2,0.138,0.846,0.87,0.825,0.891,0.872,0.882,0.834,0.845,0.857,0.868,0.471,0.111,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+0.143,0.824,0.791,0.884,0.845,0.859,0.159,0.66,0.675,0.712,0.837,0.703,0.726,0.123,0.122,0.192,0.616,0.59,0.818,0.776,0.12,0.112,0.121,0.154,0.17,0.455,0.121,0.671,0.852,0.797,0.914,0.892,0.161,0.119,0.117,0.921,0.144,0.127,0.879,0.727,0.7,0.893,0.175,0.16,0.466,0.196,0.187,0.52,0.2,0.138,0.846,0.87,0.825,0.891,0.872,0.882,0.834,0.845,0.857,0.868,0.471,0.111,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+0.666,0.702,0.687,0.686,0.691,0.684,0.529,0.638,0.685,0.728,0.736,0.638,0.69,0.521,0.742,0.759,0.715,0.709,0.718,0.693,0.601,0.675,0.668,0.654,0.669,0.67,0.694,0.679,0.701,0.698,0.714,0.677,0.578,0.522,0.578,0.656,0.639,0.634,0.692,0.668,0.657,0.699,0.703,0.597,0.745,0.719,0.788,0.773,0.767,0.745,0.679,0.71,0.726,0.729,0.718,0.703,0.699,0.698,0.716,0.697,0.746,0.745,Stability,FECA_ECOLI,High,Prokaryote
+0.87,0.869,0.875,0.876,0.878,0.878,0.737,0.829,0.82,0.822,0.735,0.728,0.741,0.729,0.718,0.712,0.761,0.774,0.817,0.818,0.758,0.754,0.716,0.837,0.732,0.831,0.815,0.768,0.799,0.844,0.816,0.807,0.65,0.696,0.724,0.8,0.869,0.87,0.863,0.88,0.876,0.881,0.75,0.738,0.744,0.748,0.881,0.86,0.892,0.884,0.787,0.845,0.859,0.836,0.858,0.827,0.842,0.831,0.8,0.844,0.888,0.82,Stability,FKBP3_HUMAN,Medium,Human
+0.714,0.774,0.82,0.798,0.793,0.802,0.705,0.587,0.842,0.818,0.825,0.775,0.775,0.729,0.759,0.771,0.832,0.847,0.859,0.783,0.776,0.729,0.751,0.738,0.754,0.776,0.794,0.778,0.863,0.809,0.863,0.851,0.67,0.706,0.751,0.714,0.813,0.807,0.828,0.794,0.802,0.811,0.743,0.777,0.824,0.75,0.638,0.804,0.603,0.652,0.851,0.816,0.843,0.833,0.823,0.818,0.858,0.817,0.832,0.835,0.778,0.764,OrganismalFitness,GAL4_YEAST,Medium,Eukaryote
+0.759,0.754,0.748,0.754,0.752,0.749,0.664,0.683,0.759,0.756,0.742,0.739,0.747,0.727,0.719,0.748,0.76,0.757,0.752,0.756,0.647,0.641,0.652,0.647,0.637,0.634,0.632,0.645,0.681,0.758,0.748,0.742,0.661,0.644,0.638,0.754,0.755,0.757,0.764,0.753,0.753,0.762,0.723,0.735,0.723,0.735,0.743,0.759,0.735,0.708,0.744,0.74,0.741,0.741,0.746,0.747,0.741,0.74,0.745,0.742,0.709,0.703,Binding,GCN4_YEAST,Low,Eukaryote
+0.879,0.859,0.86,0.859,0.849,0.857,0.769,0.871,0.872,0.869,0.864,0.86,0.868,0.737,0.795,0.878,0.864,0.867,0.877,0.864,0.779,0.874,0.865,0.872,0.854,0.876,0.87,0.87,0.871,0.868,0.842,0.844,0.804,0.832,0.86,0.843,0.877,0.861,0.858,0.863,0.861,0.858,0.778,0.706,0.886,0.79,0.88,0.879,0.862,0.742,0.852,0.851,0.861,0.861,0.858,0.872,0.858,0.86,0.862,0.856,0.866,0.844,OrganismalFitness,GDIA_HUMAN,Low,Human
+0.917,0.907,0.913,0.913,0.913,0.913,0.416,0.912,0.916,0.915,0.812,0.47,0.472,0.464,0.493,0.449,0.456,0.5,0.562,0.878,0.443,0.457,0.514,0.471,0.403,0.524,0.63,0.912,0.918,0.923,0.899,0.899,0.47,0.435,0.543,0.902,0.924,0.925,0.922,0.919,0.92,0.931,0.65,0.598,0.653,0.658,0.906,0.898,0.923,0.883,0.875,0.882,0.89,0.895,0.889,0.889,0.887,0.889,0.878,0.89,0.871,0.748,Activity,GFP_AEQVI,Low,Eukaryote
+0.765,0.716,0.777,0.776,0.764,0.77,0.624,0.836,0.744,0.729,0.649,0.718,0.682,0.621,0.616,0.696,0.699,0.673,0.724,0.699,0.692,0.652,0.664,0.719,0.672,0.689,0.648,0.692,0.734,0.743,0.739,0.762,0.765,0.655,0.688,0.687,0.794,0.793,0.796,0.765,0.772,0.769,0.643,0.653,0.71,0.674,0.737,0.655,0.755,0.652,0.832,0.761,0.749,0.713,0.752,0.687,0.75,0.797,0.781,0.752,0.776,0.808,Expression,GLPA_HUMAN,Low,Human
+0.744,0.81,0.812,0.82,0.828,0.832,0.76,0.754,0.83,0.801,0.827,0.777,0.818,0.752,0.831,0.864,0.867,0.812,0.843,0.818,0.824,0.813,0.789,0.774,0.801,0.804,0.767,0.816,0.763,0.807,0.78,0.767,0.781,0.817,0.816,0.721,0.825,0.825,0.766,0.845,0.848,0.831,0.839,0.633,0.803,0.817,0.875,0.8,0.894,0.787,0.842,0.849,0.84,0.852,0.854,0.849,0.841,0.85,0.838,0.855,0.819,0.845,OrganismalFitness,GRB2_HUMAN,Medium,Human
+0.892,0.907,0.883,0.885,0.9,0.901,0.778,0.871,0.857,0.91,0.917,0.836,0.871,0.795,0.851,0.905,0.924,0.914,0.925,0.823,0.791,0.819,0.847,0.858,0.846,0.815,0.756,0.825,0.916,0.905,0.905,0.878,0.746,0.749,0.803,0.889,0.886,0.899,0.898,0.89,0.913,0.907,0.832,0.813,0.913,0.83,0.898,0.924,0.914,0.891,0.918,0.921,0.917,0.919,0.932,0.921,0.926,0.928,0.923,0.927,0.927,0.88,Stability,HCP_LAMBD,Medium,Virus
+0.86,0.85,0.882,0.89,0.898,0.895,0.59,0.776,0.883,0.892,0.872,0.654,0.669,0.59,0.585,0.853,0.888,0.871,0.876,0.868,0.5,0.813,0.85,0.842,0.821,0.832,0.85,0.855,0.836,0.873,0.887,0.852,0.511,0.599,0.438,0.666,0.859,0.841,0.86,0.883,0.879,0.875,0.642,0.623,0.876,0.648,0.767,0.845,0.747,0.681,0.892,0.884,0.891,0.887,0.903,0.901,0.898,0.903,0.898,0.902,0.889,0.563,Stability,HECD1_HUMAN,Medium,Human
+0.57,0.585,0.585,0.577,0.579,0.581,0.468,0.375,0.581,0.59,0.549,0.549,0.55,0.45,0.549,0.554,0.554,0.585,0.575,0.442,0.565,0.565,0.587,0.61,0.549,0.56,0.56,0.55,0.589,0.595,0.601,0.614,0.45,0.562,0.594,0.616,0.582,0.603,0.611,0.587,0.591,0.598,0.535,0.473,0.558,0.553,0.558,0.585,0.526,0.543,0.561,0.545,0.564,0.566,0.565,0.562,0.568,0.556,0.56,0.561,0.574,0.565,Activity,HEM3_HUMAN,Medium,Human
+0.826,0.882,0.873,0.874,0.864,0.861,0.667,0.385,0.852,0.863,0.86,0.812,0.839,0.549,0.645,0.744,0.867,0.88,0.882,0.8,0.811,0.857,0.871,0.875,0.852,0.867,0.858,0.862,0.859,0.865,0.841,0.837,0.575,0.844,0.859,0.881,0.85,0.842,0.875,0.88,0.872,0.882,0.699,0.676,0.811,0.596,0.854,0.884,0.869,0.825,0.848,0.812,0.837,0.838,0.842,0.838,0.835,0.842,0.843,0.84,0.874,0.574,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+0.635,0.631,0.633,0.629,0.633,0.632,0.594,0.69,0.626,0.627,0.644,0.644,0.645,0.535,0.536,0.581,0.652,0.638,0.644,0.63,0.627,0.626,0.635,0.637,0.647,0.635,0.632,0.638,0.636,0.638,0.635,0.629,0.561,0.608,0.646,0.644,0.64,0.646,0.644,0.632,0.634,0.633,0.544,0.522,0.668,0.634,0.624,0.655,0.505,0.571,0.647,0.642,0.652,0.654,0.644,0.652,0.646,0.651,0.64,0.643,0.663,0.626,OrganismalFitness,HMDH_HUMAN,Low,Human
+0.89,0.906,0.906,0.907,0.908,0.905,0.815,0.885,0.908,0.906,0.902,0.901,0.907,0.775,0.804,0.837,0.879,0.888,0.89,0.873,0.899,0.905,0.906,0.908,0.902,0.908,0.911,0.91,0.91,0.909,0.909,0.905,0.866,0.904,0.907,0.907,0.902,0.904,0.904,0.907,0.907,0.907,0.81,0.748,0.9,0.889,0.795,0.902,0.78,0.811,0.875,0.878,0.875,0.877,0.877,0.879,0.877,0.875,0.873,0.878,0.897,0.885,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.89,0.906,0.906,0.907,0.908,0.905,0.815,0.885,0.908,0.906,0.902,0.901,0.907,0.775,0.804,0.837,0.879,0.888,0.89,0.873,0.899,0.905,0.906,0.908,0.902,0.908,0.911,0.91,0.91,0.909,0.909,0.905,0.866,0.904,0.907,0.907,0.902,0.904,0.904,0.907,0.907,0.907,0.81,0.748,0.9,0.889,0.795,0.902,0.78,0.811,0.875,0.878,0.875,0.877,0.877,0.879,0.877,0.875,0.873,0.878,0.897,0.885,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.89,0.906,0.906,0.907,0.908,0.905,0.815,0.885,0.908,0.906,0.902,0.901,0.907,0.775,0.804,0.837,0.879,0.888,0.89,0.873,0.899,0.905,0.906,0.908,0.902,0.908,0.911,0.91,0.91,0.909,0.909,0.905,0.866,0.904,0.907,0.907,0.902,0.904,0.904,0.907,0.907,0.907,0.81,0.748,0.9,0.889,0.795,0.902,0.78,0.811,0.875,0.878,0.875,0.877,0.877,0.879,0.877,0.875,0.873,0.878,0.897,0.885,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.734,0.736,0.727,0.728,0.73,0.731,0.638,0.716,0.734,0.737,0.726,0.724,0.73,0.659,0.68,0.736,0.746,0.734,0.732,0.688,0.726,0.726,0.734,0.727,0.734,0.734,0.738,0.734,0.72,0.726,0.721,0.709,0.654,0.731,0.73,0.73,0.739,0.729,0.732,0.734,0.728,0.73,0.694,0.645,0.732,0.734,0.727,0.735,0.724,0.691,0.731,0.736,0.739,0.736,0.738,0.745,0.747,0.738,0.736,0.74,0.746,0.724,OrganismalFitness,HXK4_HUMAN,Medium,Human
+0.734,0.736,0.727,0.728,0.73,0.731,0.638,0.716,0.734,0.737,0.726,0.724,0.73,0.659,0.68,0.736,0.746,0.734,0.732,0.688,0.726,0.726,0.734,0.727,0.734,0.734,0.738,0.734,0.72,0.726,0.721,0.709,0.654,0.731,0.73,0.73,0.739,0.729,0.732,0.734,0.728,0.73,0.694,0.645,0.732,0.734,0.727,0.735,0.724,0.691,0.731,0.736,0.739,0.736,0.738,0.745,0.747,0.738,0.736,0.74,0.746,0.724,Expression,HXK4_HUMAN,Medium,Human
+0.772,0.763,0.754,0.753,0.776,0.776,0.619,0.744,0.756,0.771,0.617,0.612,0.613,0.611,0.608,0.613,0.625,0.617,0.645,0.731,0.727,0.754,0.761,0.771,0.615,0.626,0.635,0.615,0.755,0.785,0.718,0.713,0.666,0.759,0.755,0.754,0.774,0.769,0.768,0.779,0.776,0.772,0.608,0.613,0.618,0.616,0.677,0.679,0.674,0.661,0.609,0.643,0.646,0.635,0.627,0.615,0.618,0.615,0.606,0.624,0.645,0.625,OrganismalFitness,I6TAH8_I68A0,Medium,Virus
+0.907,0.943,0.934,0.932,0.934,0.935,0.862,0.907,0.892,0.888,0.944,0.936,0.937,0.875,0.936,0.942,0.94,0.932,0.939,0.938,0.932,0.942,0.941,0.942,0.942,0.947,0.947,0.943,0.941,0.886,0.931,0.929,0.857,0.935,0.939,0.949,0.934,0.937,0.945,0.936,0.937,0.937,0.919,0.868,0.945,0.939,0.873,0.931,0.939,0.911,0.919,0.938,0.946,0.939,0.945,0.943,0.933,0.944,0.946,0.946,0.938,0.925,OrganismalFitness,IF1_ECOLI,High,Prokaryote
+0.788,0.811,0.827,0.821,0.82,0.819,0.751,0.815,0.805,0.809,0.831,0.809,0.804,0.781,0.782,0.796,0.803,0.832,0.825,0.812,0.836,0.819,0.83,0.835,0.868,0.845,0.859,0.843,0.841,0.825,0.822,0.808,0.812,0.742,0.787,0.857,0.79,0.834,0.85,0.8,0.803,0.82,0.788,0.767,0.829,0.799,0.811,0.833,0.812,0.806,0.838,0.809,0.824,0.838,0.84,0.823,0.832,0.827,0.838,0.836,0.843,0.795,Stability,ILF3_HUMAN,High,Human
+0.765,0.722,0.784,0.781,0.785,0.786,0.773,0.758,0.831,0.799,0.8,0.811,0.821,0.828,0.814,0.876,0.838,0.842,0.854,0.77,0.786,0.771,0.797,0.811,0.819,0.823,0.845,0.781,0.825,0.793,0.818,0.797,0.7,0.765,0.751,0.793,0.765,0.758,0.779,0.781,0.771,0.785,0.811,0.783,0.821,0.809,0.908,0.894,0.887,0.865,0.84,0.837,0.858,0.841,0.831,0.845,0.839,0.844,0.821,0.847,0.882,0.833,Stability,ISDH_STAAW,High,Prokaryote
+0.713,0.708,0.705,0.708,0.716,0.717,0.648,0.702,0.702,0.708,0.65,0.684,0.72,0.633,0.625,0.715,0.73,0.692,0.689,0.688,0.633,0.617,0.69,0.708,0.668,0.741,0.72,0.699,0.706,0.71,0.722,0.702,0.374,0.646,0.708,0.73,0.715,0.722,0.725,0.72,0.718,0.718,0.64,0.624,0.71,0.64,0.674,0.706,0.657,0.637,0.72,0.726,0.722,0.728,0.726,0.72,0.718,0.728,0.724,0.73,0.736,0.642,Expression,KCNE1_HUMAN,Medium,Human
+0.713,0.708,0.705,0.708,0.716,0.717,0.648,0.702,0.702,0.708,0.65,0.684,0.72,0.633,0.625,0.715,0.73,0.692,0.689,0.688,0.633,0.617,0.69,0.708,0.668,0.741,0.72,0.699,0.706,0.71,0.722,0.702,0.374,0.646,0.708,0.73,0.715,0.722,0.725,0.72,0.718,0.718,0.64,0.624,0.71,0.64,0.674,0.706,0.657,0.637,0.72,0.726,0.722,0.728,0.726,0.72,0.718,0.728,0.724,0.73,0.736,0.642,Activity,KCNE1_HUMAN,Medium,Human
+0.639,0.641,0.639,0.639,0.64,0.64,0.625,0.521,0.614,0.633,0.678,0.549,0.537,0.556,0.525,0.582,0.52,0.558,0.606,0.697,0.652,0.648,0.698,0.679,0.683,0.693,0.63,0.661,0.643,0.678,0.601,0.641,0.593,0.633,0.644,0.661,0.628,0.647,0.68,0.63,0.648,0.646,0.709,0.513,0.68,0.646,0.645,0.461,0.27,0.52,0.702,0.682,0.686,0.717,0.697,0.718,0.676,0.678,0.703,0.702,0.698,0.66,Activity,KCNH2_HUMAN,Medium,Human
+0.806,0.808,0.809,0.81,0.811,0.812,0.718,0.792,0.806,0.806,0.813,0.809,0.81,0.735,0.804,0.82,0.82,0.818,0.818,0.8,0.814,0.8,0.799,0.804,0.812,0.806,0.805,0.808,0.803,0.809,0.808,0.8,0.747,0.806,0.792,0.802,0.81,0.804,0.803,0.81,0.807,0.807,0.787,0.718,0.806,0.81,0.79,0.802,0.79,0.766,0.814,0.809,0.812,0.812,0.814,0.812,0.814,0.814,0.814,0.814,0.814,0.804,Activity,KCNJ2_MOUSE,Medium,Eukaryote
+0.806,0.808,0.809,0.81,0.811,0.812,0.718,0.792,0.806,0.806,0.813,0.809,0.81,0.735,0.804,0.82,0.82,0.818,0.818,0.8,0.814,0.8,0.799,0.804,0.812,0.806,0.805,0.808,0.803,0.809,0.808,0.8,0.747,0.806,0.792,0.802,0.81,0.804,0.803,0.81,0.807,0.807,0.787,0.718,0.806,0.81,0.79,0.802,0.79,0.766,0.814,0.809,0.812,0.812,0.814,0.812,0.814,0.814,0.814,0.814,0.814,0.804,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+0.471,0.591,0.585,0.612,0.6,0.61,0.366,0.523,0.597,0.627,0.548,0.554,0.587,0.389,0.426,0.506,0.604,0.637,0.641,0.57,0.476,0.52,0.598,0.578,0.484,0.586,0.614,0.557,0.634,0.604,0.57,0.548,0.3,0.406,0.515,0.566,0.549,0.572,0.6,0.621,0.623,0.63,0.423,0.334,0.553,0.466,0.561,0.606,0.606,0.509,0.598,0.597,0.614,0.603,0.608,0.607,0.602,0.583,0.597,0.613,0.623,0.506,OrganismalFitness,KKA2_KLEPN,High,Prokaryote
+0.853,0.937,0.935,0.936,0.937,0.937,0.794,0.896,0.881,0.928,0.859,0.916,0.926,0.802,0.826,0.844,0.903,0.929,0.931,0.901,0.826,0.868,0.887,0.908,0.847,0.908,0.933,0.894,0.937,0.945,0.935,0.921,0.802,0.82,0.865,0.948,0.847,0.864,0.938,0.933,0.936,0.944,0.794,0.783,0.876,0.832,0.853,0.907,0.901,0.832,0.907,0.917,0.909,0.918,0.911,0.913,0.912,0.91,0.916,0.918,0.876,0.841,Activity,LGK_LIPST,Medium,Eukaryote
+0.679,0.693,0.65,0.608,0.666,0.678,0.669,0.682,0.76,0.754,0.732,0.604,0.65,0.687,0.675,0.712,0.636,0.649,0.755,0.747,0.712,0.721,0.706,0.643,0.685,0.63,0.663,0.755,0.666,0.709,0.663,0.616,0.614,0.677,0.712,0.633,0.668,0.711,0.639,0.679,0.704,0.67,0.658,0.724,0.632,0.673,0.75,0.673,0.684,0.652,0.617,0.702,0.775,0.627,0.657,0.676,0.695,0.716,0.535,0.659,0.72,0.758,Expression,LYAM1_HUMAN,Medium,Human
+0.881,0.88,0.876,0.877,0.868,0.867,0.848,0.846,0.86,0.859,0.857,0.796,0.88,0.858,0.853,0.851,0.867,0.896,0.774,0.844,0.775,0.778,0.836,0.856,0.84,0.846,0.849,0.849,0.843,0.856,0.85,0.858,0.749,0.831,0.857,0.843,0.879,0.889,0.888,0.875,0.886,0.887,0.848,0.705,0.871,0.814,0.9,0.825,0.908,0.908,0.904,0.897,0.912,0.914,0.895,0.917,0.904,0.908,0.903,0.915,0.916,0.899,Stability,MAFG_MOUSE,Medium,Eukaryote
+0.826,0.874,0.876,0.875,0.879,0.878,0.369,0.795,0.829,0.871,0.853,0.739,0.751,0.306,0.273,0.817,0.827,0.798,0.891,0.85,0.289,0.436,0.792,0.768,0.471,0.86,0.819,0.8,0.858,0.846,0.863,0.825,0.79,0.397,0.288,0.34,0.808,0.842,0.846,0.874,0.875,0.875,0.477,0.365,0.868,0.786,0.796,0.879,0.857,0.803,0.847,0.865,0.851,0.834,0.827,0.83,0.835,0.83,0.81,0.837,0.905,0.799,Stability,MBD11_ARATH,Medium,Eukaryote
+0.859,0.914,0.897,0.912,0.917,0.918,0.867,0.894,0.855,0.874,0.923,0.879,0.879,0.829,0.869,0.888,0.906,0.91,0.92,0.899,0.91,0.926,0.913,0.914,0.927,0.923,0.929,0.926,0.927,0.922,0.917,0.906,0.821,0.856,0.908,0.926,0.875,0.904,0.917,0.91,0.912,0.903,0.879,0.802,0.918,0.903,0.835,0.912,0.876,0.812,0.881,0.896,0.897,0.901,0.9,0.898,0.9,0.899,0.897,0.896,0.884,0.875,Activity,MET_HUMAN,Medium,Human
+0.759,0.776,0.777,0.78,0.776,0.773,0.755,0.741,0.762,0.759,0.741,0.759,0.754,0.759,0.752,0.758,0.753,0.759,0.756,0.758,0.761,0.745,0.741,0.736,0.751,0.744,0.747,0.751,0.739,0.762,0.732,0.741,0.765,0.754,0.745,0.733,0.75,0.742,0.733,0.773,0.773,0.767,0.762,0.746,0.758,0.769,0.731,0.747,0.738,0.745,0.766,0.767,0.767,0.763,0.764,0.761,0.758,0.756,0.761,0.764,0.764,0.754,OrganismalFitness,MK01_HUMAN,Medium,Human
+0.847,0.9,0.902,0.902,0.904,0.902,0.752,0.887,0.891,0.899,0.883,0.894,0.902,0.799,0.833,0.835,0.864,0.884,0.891,0.906,0.804,0.901,0.903,0.903,0.848,0.887,0.902,0.885,0.906,0.911,0.903,0.904,0.748,0.779,0.889,0.894,0.852,0.887,0.89,0.898,0.907,0.905,0.81,0.793,0.863,0.837,0.704,0.839,0.797,0.765,0.862,0.856,0.846,0.855,0.868,0.858,0.852,0.854,0.862,0.863,0.84,0.815,OrganismalFitness,MLAC_ECOLI,Medium,Prokaryote
+0.817,0.841,0.831,0.831,0.831,0.833,0.784,0.814,0.837,0.836,0.832,0.834,0.837,0.78,0.818,0.833,0.831,0.838,0.825,0.829,0.804,0.829,0.831,0.832,0.814,0.835,0.833,0.819,0.837,0.83,0.828,0.821,0.797,0.808,0.84,0.831,0.823,0.839,0.832,0.836,0.841,0.838,0.799,0.747,0.839,0.815,0.809,0.835,0.736,0.786,0.828,0.826,0.829,0.826,0.829,0.829,0.832,0.83,0.829,0.83,0.834,0.82,OrganismalFitness,MSH2_HUMAN,Medium,Human
+0.529,0.78,0.784,0.792,0.789,0.784,0.455,0.766,0.73,0.736,0.621,0.73,0.756,0.389,0.484,0.519,0.588,0.612,0.658,0.718,0.505,0.537,0.656,0.73,0.558,0.74,0.776,0.715,0.764,0.771,0.785,0.765,0.547,0.529,0.566,0.747,0.523,0.566,0.725,0.778,0.776,0.782,0.464,0.288,0.523,0.461,0.564,0.688,0.704,0.55,0.621,0.645,0.632,0.601,0.594,0.644,0.633,0.637,0.608,0.63,0.664,0.538,OrganismalFitness,MTH3_HAEAE,Medium,Prokaryote
+0.79,0.798,0.79,0.798,0.787,0.788,0.677,0.78,0.806,0.805,0.807,0.789,0.788,0.711,0.751,0.795,0.8,0.797,0.808,0.769,0.781,0.786,0.789,0.809,0.785,0.783,0.788,0.777,0.817,0.794,0.791,0.782,0.622,0.791,0.792,0.795,0.792,0.793,0.798,0.798,0.798,0.801,0.735,0.699,0.805,0.756,0.746,0.793,0.801,0.725,0.796,0.8,0.795,0.801,0.796,0.796,0.8,0.796,0.801,0.801,0.796,0.766,OrganismalFitness,MTHR_HUMAN,Low,Human
+0.595,0.686,0.682,0.669,0.69,0.695,0.482,0.482,0.703,0.689,0.75,0.724,0.723,0.569,0.699,0.786,0.836,0.75,0.648,0.685,0.558,0.515,0.576,0.598,0.629,0.686,0.729,0.704,0.751,0.631,0.674,0.63,0.6,0.642,0.587,0.552,0.695,0.61,0.574,0.703,0.691,0.685,0.627,0.517,0.707,0.669,0.695,0.731,0.779,0.706,0.752,0.738,0.741,0.746,0.746,0.752,0.745,0.764,0.757,0.756,0.732,0.762,Stability,MYO3_YEAST,High,Eukaryote
+0.745,0.737,0.741,0.742,0.739,0.742,0.562,0.726,0.741,0.738,0.559,0.551,0.55,0.552,0.551,0.543,0.556,0.562,0.587,0.708,0.718,0.738,0.748,0.753,0.563,0.566,0.563,0.557,0.731,0.758,0.69,0.69,0.609,0.733,0.74,0.763,0.752,0.755,0.76,0.753,0.757,0.756,0.554,0.558,0.561,0.553,0.634,0.64,0.64,0.614,0.577,0.598,0.597,0.584,0.584,0.575,0.579,0.571,0.555,0.581,0.599,0.58,OrganismalFitness,NCAP_I34A1,Medium,Virus
+0.899,0.909,0.913,0.914,0.906,0.913,0.886,0.894,0.897,0.9,0.86,0.885,0.874,0.901,0.886,0.914,0.925,0.921,0.9,0.885,0.903,0.897,0.881,0.878,0.896,0.87,0.898,0.857,0.874,0.902,0.887,0.881,0.874,0.903,0.895,0.872,0.914,0.906,0.903,0.91,0.91,0.912,0.895,0.898,0.87,0.885,0.9,0.874,0.905,0.905,0.922,0.922,0.922,0.922,0.925,0.927,0.921,0.927,0.92,0.926,0.882,0.909,Stability,NKX31_HUMAN,High,Human
+0.949,0.942,0.936,0.954,0.942,0.953,0.793,0.924,0.969,0.97,0.943,0.727,0.736,0.637,0.719,0.791,0.835,0.89,0.946,0.766,0.794,0.907,0.864,0.882,0.824,0.828,0.96,0.926,0.9,0.958,0.948,0.922,0.778,0.818,0.908,0.914,0.938,0.958,0.95,0.949,0.962,0.946,0.762,0.764,0.936,0.686,0.864,0.936,0.676,0.776,0.814,0.886,0.796,0.852,0.811,0.81,0.825,0.928,0.944,0.831,0.928,0.748,Activity,NPC1_HUMAN,Low,Human
+0.949,0.942,0.936,0.954,0.942,0.953,0.793,0.924,0.969,0.97,0.943,0.727,0.736,0.637,0.719,0.791,0.835,0.89,0.946,0.766,0.794,0.907,0.864,0.882,0.824,0.828,0.96,0.926,0.9,0.958,0.948,0.922,0.778,0.818,0.908,0.914,0.938,0.958,0.95,0.949,0.962,0.946,0.762,0.764,0.936,0.686,0.864,0.936,0.676,0.776,0.814,0.886,0.796,0.852,0.811,0.81,0.825,0.928,0.944,0.831,0.928,0.748,Activity,NPC1_HUMAN,Low,Human
+0.879,0.947,0.933,0.933,0.952,0.953,0.606,0.919,0.957,0.963,0.575,0.684,0.922,0.568,0.539,0.564,0.7,0.914,0.917,0.872,0.95,0.942,0.924,0.912,0.537,0.898,0.895,0.889,0.945,0.944,0.831,0.841,0.482,0.941,0.93,0.946,0.914,0.923,0.934,0.955,0.953,0.953,0.535,0.568,0.549,0.568,0.729,0.766,0.75,0.644,0.658,0.688,0.713,0.717,0.748,0.773,0.703,0.764,0.641,0.757,0.649,0.561,OrganismalFitness,NRAM_I33A0,Low,Virus
+0.693,0.82,0.817,0.8,0.82,0.822,0.522,0.699,0.808,0.841,0.821,0.806,0.836,0.615,0.637,0.658,0.77,0.804,0.806,0.747,0.616,0.697,0.8,0.823,0.674,0.851,0.819,0.812,0.836,0.829,0.848,0.818,0.573,0.642,0.713,0.846,0.716,0.745,0.841,0.818,0.824,0.841,0.666,0.545,0.811,0.685,0.685,0.823,0.71,0.728,0.753,0.759,0.738,0.764,0.754,0.774,0.779,0.78,0.772,0.777,0.826,0.742,Expression,NUD15_HUMAN,High,Human
+0.656,0.85,0.817,0.811,0.813,0.826,0.644,0.75,0.848,0.831,0.753,0.477,0.515,0.636,0.592,0.59,0.651,0.674,0.683,0.81,0.796,0.832,0.844,0.859,0.569,0.79,0.599,0.866,0.862,0.824,0.837,0.838,0.534,0.728,0.621,0.685,0.704,0.634,0.689,0.833,0.808,0.836,0.671,0.723,0.666,0.672,0.899,0.836,0.859,0.877,0.832,0.869,0.851,0.87,0.865,0.863,0.858,0.843,0.856,0.86,0.842,0.672,Stability,NUSA_ECOLI,High,Prokaryote
+0.869,0.869,0.88,0.886,0.884,0.885,0.763,0.843,0.901,0.889,0.84,0.86,0.892,0.747,0.862,0.885,0.877,0.871,0.864,0.858,0.841,0.862,0.849,0.848,0.874,0.86,0.862,0.85,0.847,0.88,0.864,0.832,0.816,0.842,0.808,0.858,0.876,0.871,0.881,0.876,0.878,0.885,0.861,0.708,0.868,0.897,0.899,0.883,0.917,0.895,0.874,0.894,0.888,0.877,0.88,0.882,0.891,0.897,0.888,0.89,0.872,0.899,Stability,NUSG_MYCTU,High,Prokaryote
+0.815,0.901,0.922,0.927,0.924,0.922,0.619,0.81,0.896,0.896,0.887,0.875,0.882,0.627,0.863,0.893,0.913,0.903,0.894,0.902,0.878,0.867,0.867,0.86,0.873,0.867,0.88,0.86,0.85,0.897,0.903,0.895,0.788,0.624,0.674,0.811,0.875,0.876,0.891,0.922,0.918,0.921,0.832,0.619,0.843,0.853,0.906,0.847,0.934,0.928,0.926,0.929,0.933,0.931,0.932,0.927,0.92,0.922,0.925,0.935,0.934,0.882,Stability,OBSCN_HUMAN,High,Human
+0.833,0.832,0.851,0.852,0.815,0.832,0.831,0.824,0.843,0.845,0.857,0.808,0.79,0.825,0.817,0.862,0.881,0.833,0.857,0.864,0.847,0.845,0.864,0.835,0.821,0.843,0.865,0.852,0.852,0.865,0.838,0.83,0.824,0.817,0.816,0.823,0.853,0.82,0.826,0.851,0.825,0.83,0.855,0.822,0.867,0.869,0.881,0.891,0.883,0.911,0.866,0.863,0.866,0.876,0.872,0.867,0.873,0.871,0.862,0.871,0.88,0.836,Stability,ODP2_GEOSE,High,Prokaryote
+0.585,0.704,0.757,0.773,0.781,0.78,0.439,0.77,0.627,0.8,0.527,0.67,0.703,0.551,0.646,0.533,0.602,0.648,0.574,0.686,0.749,0.72,0.643,0.647,0.729,0.66,0.674,0.719,0.659,0.763,0.639,0.468,0.166,0.682,0.651,0.538,0.758,0.686,0.686,0.779,0.757,0.77,0.506,0.317,0.665,0.622,0.615,0.697,0.541,0.383,0.536,0.472,0.449,0.472,0.471,0.512,0.527,0.559,0.522,0.516,0.764,0.677,Expression,OPSD_HUMAN,High,Human
+0.669,0.803,0.772,0.774,0.78,0.789,0.4,0.551,0.775,0.797,0.729,0.768,0.778,0.423,0.573,0.626,0.695,0.676,0.7,0.721,0.649,0.687,0.746,0.778,0.659,0.764,0.757,0.756,0.779,0.792,0.763,0.707,0.474,0.63,0.736,0.771,0.685,0.739,0.778,0.788,0.783,0.796,0.509,0.384,0.675,0.581,0.751,0.736,0.809,0.637,0.702,0.717,0.739,0.713,0.71,0.707,0.706,0.739,0.693,0.715,0.807,0.677,Activity,OTC_HUMAN,Medium,Human
+0.61,0.755,0.632,0.652,0.663,0.665,0.583,0.707,0.595,0.599,0.838,0.796,0.797,0.581,0.806,0.806,0.727,0.753,0.804,0.64,0.577,0.715,0.746,0.756,0.729,0.734,0.766,0.742,0.692,0.748,0.764,0.721,0.734,0.607,0.674,0.75,0.628,0.638,0.75,0.666,0.655,0.707,0.651,0.603,0.78,0.827,0.821,0.792,0.84,0.807,0.759,0.769,0.771,0.766,0.772,0.759,0.767,0.73,0.738,0.762,0.869,0.844,Stability,OTU7A_HUMAN,High,Human
+0.558,0.661,0.66,0.664,0.668,0.676,0.562,0.562,0.587,0.593,0.637,0.613,0.623,0.55,0.568,0.618,0.635,0.664,0.67,0.572,0.54,0.585,0.607,0.59,0.567,0.609,0.624,0.604,0.654,0.676,0.66,0.645,0.514,0.568,0.57,0.582,0.591,0.586,0.588,0.668,0.67,0.669,0.557,0.539,0.629,0.583,0.645,0.67,0.656,0.564,0.644,0.65,0.646,0.644,0.641,0.656,0.646,0.65,0.637,0.654,0.66,0.6,Activity,OXDA_RHOTO,High,Eukaryote
+0.558,0.661,0.66,0.664,0.668,0.676,0.562,0.562,0.587,0.593,0.637,0.613,0.623,0.55,0.568,0.618,0.635,0.664,0.67,0.572,0.54,0.585,0.607,0.59,0.567,0.609,0.624,0.604,0.654,0.676,0.66,0.645,0.514,0.568,0.57,0.582,0.591,0.586,0.588,0.668,0.67,0.669,0.557,0.539,0.629,0.583,0.645,0.67,0.656,0.564,0.644,0.65,0.646,0.644,0.641,0.656,0.646,0.65,0.637,0.654,0.66,0.6,Expression,OXDA_RHOTO,High,Eukaryote
+0.742,0.742,0.74,0.739,0.731,0.732,0.62,0.738,0.738,0.738,0.764,0.743,0.763,0.63,0.625,0.704,0.729,0.738,0.746,0.722,0.703,0.733,0.756,0.737,0.722,0.759,0.778,0.768,0.741,0.749,0.76,0.74,0.604,0.668,0.738,0.732,0.743,0.754,0.745,0.735,0.743,0.741,0.632,0.609,0.748,0.636,0.71,0.759,0.724,0.71,0.739,0.747,0.74,0.736,0.738,0.742,0.731,0.736,0.732,0.737,0.75,0.681,OrganismalFitness,P53_HUMAN,Low,Human
+0.742,0.742,0.74,0.739,0.731,0.732,0.62,0.738,0.738,0.738,0.764,0.743,0.763,0.63,0.625,0.704,0.729,0.738,0.746,0.722,0.703,0.733,0.756,0.737,0.722,0.759,0.778,0.768,0.741,0.749,0.76,0.74,0.604,0.668,0.738,0.732,0.743,0.754,0.745,0.735,0.743,0.741,0.632,0.609,0.748,0.636,0.71,0.759,0.724,0.71,0.739,0.747,0.74,0.736,0.738,0.742,0.731,0.736,0.732,0.737,0.75,0.681,OrganismalFitness,P53_HUMAN,Low,Human
+0.742,0.742,0.74,0.739,0.731,0.732,0.62,0.738,0.738,0.738,0.764,0.743,0.763,0.63,0.625,0.704,0.729,0.738,0.746,0.722,0.703,0.733,0.756,0.737,0.722,0.759,0.778,0.768,0.741,0.749,0.76,0.74,0.604,0.668,0.738,0.732,0.743,0.754,0.745,0.735,0.743,0.741,0.632,0.609,0.748,0.636,0.71,0.759,0.724,0.71,0.739,0.747,0.74,0.736,0.738,0.742,0.731,0.736,0.732,0.737,0.75,0.681,OrganismalFitness,P53_HUMAN,Low,Human
+0.742,0.742,0.74,0.739,0.731,0.732,0.62,0.738,0.738,0.738,0.764,0.743,0.763,0.63,0.625,0.704,0.729,0.738,0.746,0.722,0.703,0.733,0.756,0.737,0.722,0.759,0.778,0.768,0.741,0.749,0.76,0.74,0.604,0.668,0.738,0.732,0.743,0.754,0.745,0.735,0.743,0.741,0.632,0.609,0.748,0.636,0.71,0.759,0.724,0.71,0.739,0.747,0.74,0.736,0.738,0.742,0.731,0.736,0.732,0.737,0.75,0.681,OrganismalFitness,P53_HUMAN,Low,Human
+0.749,0.861,0.905,0.905,0.874,0.876,0.642,0.855,0.906,0.879,0.891,0.856,0.857,0.721,0.86,0.864,0.901,0.889,0.905,0.897,0.773,0.823,0.827,0.86,0.796,0.858,0.908,0.855,0.945,0.803,0.905,0.877,0.782,0.801,0.755,0.837,0.775,0.744,0.779,0.847,0.844,0.86,0.757,0.582,0.824,0.828,0.748,0.902,0.893,0.698,0.861,0.878,0.856,0.863,0.888,0.892,0.886,0.886,0.899,0.891,0.896,0.895,OrganismalFitness,P84126_THETH,Medium,Prokaryote
+0.755,0.745,0.735,0.738,0.764,0.76,0.603,0.734,0.753,0.764,0.759,0.765,0.764,0.609,0.634,0.671,0.722,0.717,0.739,0.727,0.731,0.753,0.754,0.761,0.741,0.756,0.764,0.759,0.745,0.776,0.767,0.744,0.524,0.72,0.74,0.741,0.745,0.757,0.751,0.76,0.766,0.763,0.7,0.447,0.759,0.744,0.474,0.75,0.681,0.532,0.709,0.701,0.711,0.709,0.709,0.709,0.712,0.716,0.718,0.713,0.75,0.684,OrganismalFitness,PABP_YEAST,Medium,Eukaryote
+0.797,0.83,0.823,0.826,0.823,0.823,0.694,0.775,0.807,0.823,0.81,0.797,0.809,0.696,0.8,0.806,0.812,0.816,0.803,0.811,0.701,0.796,0.798,0.806,0.793,0.813,0.813,0.814,0.812,0.816,0.805,0.792,0.721,0.735,0.804,0.81,0.795,0.808,0.811,0.828,0.827,0.829,0.784,0.688,0.805,0.802,0.781,0.831,0.814,0.751,0.815,0.808,0.813,0.812,0.813,0.816,0.812,0.812,0.814,0.817,0.825,0.798,Activity,PAI1_HUMAN,,Human
+0.463,0.46,0.464,0.453,0.457,0.453,0.228,0.429,0.245,0.258,0.231,0.236,0.245,0.22,0.218,0.22,0.215,0.215,0.401,0.391,0.417,0.422,0.41,0.455,0.241,0.387,0.407,0.381,0.409,0.431,0.396,0.417,0.266,0.413,0.435,0.439,0.46,0.454,0.449,0.455,0.459,0.451,0.226,0.223,0.23,0.232,0.265,0.257,0.205,0.273,0.209,0.209,0.222,0.212,0.215,0.225,0.211,0.216,0.215,0.229,0.254,0.213,OrganismalFitness,PA_I34A1,Medium,Virus
+0.707,0.809,0.89,0.882,0.781,0.78,0.843,0.835,0.886,0.883,0.851,0.877,0.889,0.89,0.903,0.885,0.897,0.882,0.889,0.827,0.845,0.875,0.848,0.872,0.836,0.859,0.871,0.868,0.833,0.827,0.841,0.81,0.756,0.798,0.782,0.867,0.839,0.824,0.866,0.788,0.798,0.791,0.875,0.783,0.872,0.884,0.703,0.848,0.835,0.735,0.838,0.815,0.816,0.819,0.819,0.827,0.826,0.828,0.83,0.826,0.882,0.883,Activity,PHOT_CHLRE,High,Eukaryote
+0.873,0.868,0.9,0.898,0.889,0.896,0.847,0.821,0.875,0.896,0.888,0.848,0.89,0.862,0.878,0.892,0.895,0.819,0.806,0.895,0.752,0.885,0.89,0.871,0.826,0.859,0.883,0.878,0.864,0.883,0.889,0.863,0.869,0.812,0.87,0.891,0.869,0.894,0.899,0.89,0.9,0.905,0.851,0.644,0.907,0.839,0.889,0.897,0.889,0.868,0.884,0.895,0.88,0.869,0.865,0.89,0.879,0.888,0.887,0.888,0.903,0.901,Stability,PIN1_HUMAN,High,Human
+0.888,0.864,0.868,0.867,0.874,0.87,0.863,0.804,0.873,0.899,0.812,0.85,0.85,0.86,0.887,0.897,0.9,0.885,0.863,0.867,0.826,0.812,0.826,0.822,0.851,0.838,0.841,0.82,0.82,0.858,0.828,0.822,0.806,0.841,0.859,0.797,0.86,0.874,0.836,0.866,0.877,0.861,0.858,0.838,0.791,0.852,0.832,0.773,0.898,0.857,0.892,0.894,0.904,0.9,0.906,0.891,0.898,0.879,0.893,0.897,0.807,0.888,Stability,PITX2_HUMAN,High,Human
+0.866,0.888,0.844,0.835,0.846,0.866,0.864,0.821,0.803,0.807,0.867,0.86,0.871,0.857,0.859,0.894,0.863,0.866,0.865,0.862,0.881,0.89,0.875,0.875,0.887,0.872,0.881,0.902,0.864,0.829,0.87,0.865,0.783,0.874,0.898,0.904,0.872,0.882,0.873,0.877,0.87,0.859,0.848,0.853,0.871,0.843,0.887,0.872,0.876,0.894,0.859,0.861,0.872,0.868,0.863,0.861,0.862,0.868,0.865,0.866,0.89,0.877,Stability,PKN1_HUMAN,High,Human
+0.768,0.817,0.798,0.816,0.821,0.824,0.574,0.733,0.815,0.821,0.658,0.581,0.596,0.583,0.577,0.624,0.705,0.727,0.754,0.757,0.738,0.788,0.795,0.79,0.632,0.764,0.772,0.765,0.801,0.82,0.759,0.75,0.598,0.599,0.718,0.773,0.756,0.775,0.792,0.793,0.809,0.816,0.58,0.576,0.682,0.585,0.627,0.692,0.596,0.618,0.683,0.686,0.688,0.687,0.684,0.678,0.691,0.69,0.69,0.69,0.597,0.588,OrganismalFitness,POLG_CXB3N,Medium,Virus
+0.773,0.829,0.767,0.781,0.813,0.816,0.472,0.761,0.896,0.898,0.568,0.46,0.474,0.45,0.459,0.487,0.504,0.547,0.595,0.784,0.761,0.761,0.763,0.715,0.77,0.783,0.8,0.798,0.81,0.87,0.854,0.794,0.512,0.455,0.571,0.795,0.75,0.776,0.834,0.826,0.83,0.856,0.467,0.448,0.629,0.464,0.603,0.727,0.453,0.555,0.523,0.526,0.56,0.534,0.534,0.539,0.529,0.536,0.528,0.548,0.498,0.495,OrganismalFitness,POLG_DEN26,Low,Virus
+0.679,0.708,0.686,0.692,0.705,0.712,0.186,0.284,0.703,0.7,0.283,0.645,0.643,0.186,0.202,0.196,0.229,0.212,0.237,0.484,0.4,0.556,0.631,0.596,0.542,0.588,0.467,0.5,0.615,0.732,0.601,0.498,0.286,0.646,0.654,0.619,0.667,0.667,0.669,0.682,0.668,0.672,0.197,0.177,0.596,0.228,0.195,0.435,0.517,0.406,0.335,0.4,0.284,0.408,0.33,0.299,0.413,0.298,0.299,0.362,0.276,0.248,OrganismalFitness,POLG_HCVJF,Medium,Virus
+0.582,0.867,0.806,0.816,0.825,0.828,0.544,0.815,0.703,0.845,0.818,0.581,0.589,0.584,0.59,0.567,0.613,0.606,0.589,0.891,0.518,0.5,0.496,0.547,0.542,0.512,0.526,0.462,0.577,0.876,0.891,0.895,0.517,0.529,0.506,0.553,0.704,0.706,0.698,0.81,0.813,0.805,0.648,0.595,0.643,0.623,0.845,0.827,0.902,0.884,0.896,0.898,0.887,0.905,0.904,0.917,0.903,0.922,0.904,0.913,0.883,0.784,Stability,POLG_PESV,Medium,Virus
+0.785,0.863,0.872,0.866,0.872,0.867,0.773,0.875,0.865,0.877,0.861,0.879,0.888,0.741,0.766,0.78,0.807,0.855,0.889,0.882,0.868,0.902,0.863,0.866,0.842,0.916,0.917,0.917,0.861,0.897,0.849,0.8,0.783,0.884,0.904,0.905,0.871,0.905,0.9,0.874,0.889,0.885,0.757,0.709,0.873,0.778,0.83,0.888,0.848,0.793,0.825,0.832,0.822,0.816,0.831,0.821,0.825,0.819,0.81,0.821,0.841,0.789,Activity,PPARG_HUMAN,Medium,Human
+0.805,0.807,0.807,0.809,0.807,0.807,0.728,0.774,0.81,0.807,0.808,0.817,0.815,0.744,0.754,0.784,0.804,0.814,0.813,0.801,0.776,0.783,0.818,0.812,0.801,0.818,0.815,0.818,0.813,0.81,0.807,0.787,0.735,0.778,0.803,0.82,0.807,0.812,0.815,0.808,0.809,0.809,0.766,0.717,0.807,0.79,0.792,0.817,0.804,0.772,0.801,0.8,0.798,0.799,0.798,0.798,0.796,0.794,0.8,0.8,0.819,0.783,OrganismalFitness,PPM1D_HUMAN,Low,Human
+0.857,0.872,0.916,0.913,0.907,0.91,0.76,0.854,0.917,0.922,0.924,0.805,0.847,0.765,0.771,0.894,0.917,0.915,0.898,0.901,0.864,0.911,0.918,0.902,0.895,0.911,0.917,0.917,0.905,0.901,0.903,0.878,0.811,0.814,0.818,0.858,0.884,0.888,0.895,0.908,0.914,0.909,0.784,0.766,0.884,0.78,0.899,0.862,0.917,0.901,0.919,0.924,0.924,0.922,0.92,0.923,0.923,0.922,0.919,0.926,0.92,0.906,Stability,PR40A_HUMAN,Medium,Human
+0.863,0.87,0.876,0.878,0.874,0.874,0.652,0.823,0.867,0.87,0.874,0.861,0.878,0.683,0.697,0.72,0.764,0.86,0.877,0.852,0.708,0.855,0.877,0.871,0.767,0.876,0.871,0.881,0.862,0.874,0.858,0.828,0.627,0.696,0.85,0.873,0.857,0.871,0.873,0.873,0.875,0.874,0.711,0.584,0.881,0.717,0.855,0.873,0.868,0.773,0.774,0.803,0.815,0.821,0.796,0.791,0.8,0.799,0.799,0.804,0.868,0.787,Expression,PRKN_HUMAN,Low,Human
+0.767,0.759,0.726,0.731,0.753,0.754,0.666,0.709,0.722,0.72,0.863,0.796,0.801,0.681,0.71,0.823,0.839,0.806,0.803,0.774,0.661,0.627,0.722,0.735,0.734,0.754,0.511,0.741,0.762,0.737,0.776,0.727,0.658,0.657,0.715,0.742,0.773,0.764,0.759,0.755,0.752,0.765,0.691,0.659,0.813,0.673,0.838,0.811,0.844,0.821,0.841,0.825,0.822,0.823,0.829,0.821,0.83,0.835,0.825,0.835,0.806,0.815,Stability,PSAE_PICP2,Medium,Prokaryote
+0.812,0.823,0.822,0.822,0.822,0.823,0.72,0.807,0.823,0.826,0.822,0.824,0.834,0.726,0.782,0.815,0.824,0.806,0.808,0.828,0.749,0.824,0.827,0.821,0.778,0.814,0.815,0.818,0.814,0.832,0.816,0.811,0.71,0.778,0.83,0.817,0.81,0.83,0.828,0.824,0.83,0.828,0.762,0.71,0.824,0.802,0.801,0.824,0.816,0.789,0.825,0.822,0.824,0.826,0.823,0.822,0.823,0.827,0.824,0.824,0.819,0.806,Expression,PTEN_HUMAN,Medium,Human
+0.812,0.823,0.822,0.822,0.822,0.823,0.72,0.807,0.823,0.826,0.822,0.824,0.834,0.726,0.782,0.815,0.824,0.806,0.808,0.828,0.749,0.824,0.827,0.821,0.778,0.814,0.815,0.818,0.814,0.832,0.816,0.811,0.71,0.778,0.83,0.817,0.81,0.83,0.828,0.824,0.83,0.828,0.762,0.71,0.824,0.802,0.801,0.824,0.816,0.789,0.825,0.822,0.824,0.826,0.823,0.822,0.823,0.827,0.824,0.824,0.819,0.806,Activity,PTEN_HUMAN,Medium,Human
+0.869,0.849,0.83,0.84,0.856,0.858,0.597,0.822,0.87,0.87,0.847,0.867,0.872,0.623,0.618,0.608,0.635,0.652,0.705,0.843,0.869,0.829,0.818,0.803,0.87,0.828,0.822,0.831,0.824,0.872,0.833,0.83,0.737,0.865,0.837,0.84,0.869,0.858,0.863,0.865,0.857,0.858,0.783,0.58,0.868,0.858,0.723,0.866,0.641,0.712,0.676,0.69,0.716,0.697,0.693,0.698,0.695,0.692,0.68,0.698,0.682,0.649,OrganismalFitness,Q2N0S5_9HIV1,Medium,Virus
+0.782,0.792,0.796,0.796,0.798,0.796,0.595,0.75,0.796,0.8,0.789,0.782,0.793,0.618,0.638,0.754,0.777,0.806,0.802,0.801,0.76,0.758,0.758,0.754,0.797,0.794,0.79,0.792,0.785,0.768,0.776,0.761,0.678,0.768,0.754,0.752,0.792,0.782,0.776,0.798,0.798,0.794,0.73,0.605,0.794,0.77,0.754,0.796,0.744,0.71,0.766,0.779,0.798,0.785,0.794,0.797,0.8,0.8,0.782,0.79,0.8,0.82,Binding,Q53Z42_HUMAN,Medium,Human
+0.782,0.792,0.796,0.796,0.798,0.796,0.595,0.75,0.796,0.8,0.789,0.782,0.793,0.618,0.638,0.754,0.777,0.806,0.802,0.801,0.76,0.758,0.758,0.754,0.797,0.794,0.79,0.792,0.785,0.768,0.776,0.761,0.678,0.768,0.754,0.752,0.792,0.782,0.776,0.798,0.798,0.794,0.73,0.605,0.794,0.77,0.754,0.796,0.744,0.71,0.766,0.779,0.798,0.785,0.794,0.797,0.8,0.8,0.782,0.79,0.8,0.82,Expression,Q53Z42_HUMAN,Medium,Human
+0.845,0.877,0.873,0.872,0.88,0.875,0.757,0.813,0.88,0.88,0.857,0.814,0.829,0.728,0.798,0.826,0.842,0.85,0.854,0.879,0.862,0.883,0.886,0.887,0.868,0.888,0.892,0.885,0.883,0.879,0.868,0.851,0.768,0.86,0.885,0.881,0.861,0.877,0.879,0.876,0.877,0.879,0.813,0.714,0.861,0.833,0.829,0.862,0.86,0.772,0.85,0.844,0.857,0.848,0.852,0.841,0.854,0.847,0.845,0.855,0.861,0.834,Activity,Q59976_STRSQ,Medium,Prokaryote
+0.658,0.724,0.619,0.623,0.66,0.658,0.429,0.526,0.731,0.728,0.622,0.487,0.473,0.465,0.484,0.465,0.458,0.471,0.441,0.67,0.443,0.496,0.464,0.506,0.445,0.441,0.464,0.448,0.448,0.728,0.698,0.701,0.482,0.484,0.462,0.499,0.63,0.624,0.631,0.658,0.657,0.659,0.545,0.515,0.534,0.533,0.66,0.605,0.643,0.595,0.585,0.594,0.604,0.659,0.645,0.648,0.641,0.638,0.638,0.635,0.524,0.513,Activity,Q6WV12_9MAXI,Low,Eukaryote
+0.898,0.921,0.918,0.93,0.928,0.942,0.838,0.891,0.923,0.927,0.938,0.92,0.94,0.873,0.885,0.901,0.926,0.92,0.926,0.807,0.93,0.934,0.931,0.936,0.922,0.932,0.933,0.939,0.937,0.927,0.923,0.906,0.859,0.928,0.937,0.937,0.926,0.938,0.931,0.94,0.94,0.939,0.896,0.882,0.914,0.897,0.862,0.925,0.886,0.847,0.934,0.931,0.928,0.936,0.933,0.93,0.935,0.927,0.925,0.94,0.948,0.888,Activity,Q837P4_ENTFA,Medium,Prokaryote
+0.813,0.919,0.915,0.912,0.918,0.927,0.867,0.883,0.792,0.813,0.861,0.867,0.862,0.767,0.842,0.839,0.844,0.882,0.873,0.85,0.858,0.892,0.924,0.918,0.836,0.897,0.911,0.917,0.935,0.897,0.888,0.908,0.803,0.856,0.895,0.916,0.861,0.886,0.911,0.917,0.919,0.918,0.852,0.777,0.877,0.848,0.84,0.894,0.903,0.796,0.864,0.859,0.85,0.842,0.854,0.831,0.86,0.862,0.842,0.857,0.836,0.846,Activity,Q837P5_ENTFA,Medium,Prokaryote
+0.592,0.684,0.598,0.598,0.646,0.645,0.481,0.666,0.665,0.67,0.594,0.487,0.484,0.478,0.47,0.479,0.484,0.492,0.508,0.612,0.5,0.499,0.511,0.499,0.479,0.478,0.495,0.633,0.64,0.685,0.656,0.661,0.477,0.464,0.478,0.675,0.584,0.592,0.673,0.641,0.649,0.685,0.535,0.538,0.537,0.535,0.604,0.606,0.679,0.605,0.594,0.579,0.598,0.599,0.591,0.609,0.598,0.598,0.595,0.597,0.539,0.45,Activity,Q8WTC7_9CNID,Low,Eukaryote
+0.928,0.931,0.916,0.914,0.943,0.946,0.596,0.889,0.621,0.621,0.669,0.626,0.609,0.625,0.647,0.648,0.637,0.889,0.937,0.89,0.838,0.839,0.859,0.854,0.845,0.808,0.828,0.828,0.818,0.935,0.897,0.876,0.602,0.769,0.828,0.792,0.912,0.922,0.913,0.936,0.945,0.939,0.623,0.583,0.68,0.623,0.824,0.843,0.871,0.792,0.736,0.75,0.771,0.778,0.787,0.758,0.763,0.749,0.75,0.778,0.721,0.718,OrganismalFitness,R1AB_SARS2,Medium,Virus
+0.797,0.754,0.752,0.754,0.76,0.769,0.739,0.762,0.795,0.807,0.797,0.716,0.761,0.777,0.825,0.835,0.806,0.822,0.75,0.806,0.856,0.8,0.771,0.786,0.808,0.779,0.758,0.773,0.756,0.809,0.748,0.711,0.649,0.772,0.8,0.772,0.819,0.813,0.794,0.784,0.788,0.795,0.802,0.717,0.731,0.786,0.805,0.697,0.831,0.821,0.789,0.82,0.804,0.808,0.821,0.802,0.8,0.807,0.812,0.81,0.77,0.833,Stability,RAD_ANTMA,High,Eukaryote
+0.749,0.785,0.775,0.783,0.788,0.789,0.704,0.82,0.782,0.789,0.769,0.805,0.806,0.688,0.713,0.795,0.763,0.786,0.801,0.788,0.806,0.792,0.793,0.796,0.794,0.815,0.814,0.801,0.799,0.802,0.773,0.792,0.738,0.805,0.804,0.791,0.791,0.788,0.787,0.797,0.784,0.788,0.726,0.712,0.747,0.804,0.715,0.749,0.704,0.735,0.764,0.763,0.773,0.751,0.745,0.759,0.763,0.742,0.73,0.753,0.76,0.658,OrganismalFitness,RAF1_HUMAN,Low,Human
+0.73,0.717,0.729,0.723,0.718,0.72,0.534,0.654,0.709,0.716,0.714,0.707,0.704,0.723,0.746,0.741,0.726,0.723,0.71,0.74,0.731,0.72,0.71,0.707,0.723,0.704,0.701,0.701,0.686,0.711,0.707,0.705,0.625,0.73,0.722,0.705,0.743,0.733,0.726,0.722,0.721,0.722,0.733,0.685,0.712,0.731,0.674,0.652,0.723,0.658,0.713,0.721,0.712,0.717,0.712,0.718,0.716,0.714,0.708,0.716,0.715,0.729,Activity,RASH_HUMAN,High,Human
+0.772,0.816,0.828,0.827,0.826,0.828,0.658,0.752,0.827,0.838,0.81,0.81,0.815,0.828,0.835,0.838,0.83,0.826,0.804,0.83,0.804,0.824,0.822,0.819,0.824,0.832,0.824,0.82,0.812,0.798,0.814,0.804,0.736,0.8,0.824,0.817,0.805,0.823,0.822,0.827,0.83,0.831,0.818,0.756,0.789,0.813,0.754,0.764,0.84,0.754,0.819,0.81,0.812,0.814,0.818,0.813,0.818,0.816,0.816,0.818,0.816,0.824,Expression,RASK_HUMAN,High,Human
+0.772,0.816,0.828,0.827,0.826,0.828,0.658,0.752,0.827,0.838,0.81,0.81,0.815,0.828,0.835,0.838,0.83,0.826,0.804,0.83,0.804,0.824,0.822,0.819,0.824,0.832,0.824,0.82,0.812,0.798,0.814,0.804,0.736,0.8,0.824,0.817,0.805,0.823,0.822,0.827,0.83,0.831,0.818,0.756,0.789,0.813,0.754,0.764,0.84,0.754,0.819,0.81,0.812,0.814,0.818,0.813,0.818,0.816,0.816,0.818,0.816,0.824,Binding,RASK_HUMAN,High,Human
+0.824,0.791,0.819,0.828,0.822,0.828,0.829,0.752,0.787,0.813,0.86,0.838,0.848,0.841,0.843,0.858,0.87,0.809,0.819,0.826,0.838,0.779,0.786,0.783,0.839,0.735,0.753,0.789,0.798,0.807,0.819,0.795,0.716,0.825,0.829,0.833,0.83,0.826,0.828,0.826,0.827,0.824,0.854,0.845,0.853,0.848,0.852,0.862,0.863,0.835,0.869,0.858,0.859,0.857,0.866,0.864,0.86,0.856,0.861,0.864,0.849,0.842,Stability,RBP1_HUMAN,High,Human
+0.787,0.765,0.784,0.784,0.782,0.782,0.704,0.714,0.759,0.767,0.815,0.775,0.793,0.76,0.799,0.809,0.822,0.782,0.815,0.779,0.728,0.617,0.727,0.759,0.715,0.776,0.771,0.746,0.794,0.757,0.765,0.725,0.482,0.745,0.75,0.769,0.799,0.805,0.794,0.797,0.791,0.784,0.775,0.779,0.805,0.777,0.814,0.836,0.821,0.814,0.814,0.812,0.808,0.806,0.806,0.807,0.813,0.805,0.803,0.813,0.806,0.842,Stability,RCD1_ARATH,Medium,Eukaryote
+0.777,0.868,0.899,0.903,0.9,0.909,0.667,0.676,0.9,0.91,0.874,0.814,0.878,0.75,0.812,0.846,0.874,0.857,0.866,0.866,0.709,0.752,0.75,0.735,0.614,0.781,0.591,0.718,0.877,0.871,0.902,0.889,0.635,0.665,0.611,0.84,0.789,0.784,0.871,0.905,0.901,0.901,0.786,0.783,0.856,0.776,0.954,0.933,0.941,0.929,0.913,0.913,0.925,0.93,0.924,0.92,0.92,0.921,0.922,0.924,0.928,0.724,Stability,RCRO_LAMBD,High,Virus
+0.897,0.871,0.897,0.894,0.898,0.895,0.783,0.855,0.839,0.888,0.861,0.855,0.872,0.792,0.895,0.857,0.862,0.872,0.855,0.883,0.845,0.879,0.859,0.874,0.883,0.885,0.881,0.886,0.873,0.873,0.849,0.828,0.865,0.773,0.857,0.851,0.884,0.896,0.886,0.904,0.905,0.897,0.834,0.709,0.878,0.854,0.886,0.9,0.908,0.911,0.874,0.876,0.879,0.879,0.885,0.88,0.879,0.881,0.873,0.881,0.892,0.888,Stability,RD23A_HUMAN,High,Human
+0.616,0.687,0.779,0.782,0.798,0.797,0.494,0.759,0.754,0.755,0.535,0.516,0.521,0.513,0.513,0.534,0.63,0.726,0.763,0.81,0.744,0.763,0.78,0.787,0.561,0.757,0.746,0.74,0.781,0.82,0.764,0.747,0.539,0.76,0.775,0.793,0.768,0.781,0.787,0.808,0.811,0.811,0.518,0.518,0.562,0.526,0.564,0.596,0.518,0.55,0.646,0.638,0.647,0.638,0.641,0.639,0.641,0.644,0.638,0.643,0.539,0.523,OrganismalFitness,RDRP_I33A0,Low,Virus
+0.703,0.694,0.71,0.706,0.709,0.705,0.59,0.703,0.7,0.702,0.602,0.695,0.695,0.61,0.606,0.627,0.704,0.713,0.703,0.688,0.687,0.695,0.686,0.71,0.69,0.715,0.635,0.693,0.703,0.705,0.701,0.706,0.581,0.699,0.703,0.696,0.701,0.705,0.692,0.705,0.702,0.704,0.604,0.594,0.678,0.607,0.67,0.657,0.687,0.693,0.695,0.696,0.697,0.703,0.706,0.699,0.7,0.694,0.705,0.701,0.668,0.682,OrganismalFitness,REV_HV1H2,Medium,Virus
+0.631,0.641,0.633,0.625,0.62,0.626,0.588,0.518,0.583,0.66,0.719,0.651,0.687,0.588,0.62,0.636,0.711,0.621,0.576,0.575,0.549,0.622,0.58,0.578,0.658,0.629,0.639,0.598,0.554,0.565,0.598,0.537,0.421,0.595,0.58,0.57,0.618,0.555,0.618,0.641,0.602,0.629,0.633,0.588,0.693,0.636,0.681,0.693,0.765,0.697,0.693,0.698,0.707,0.699,0.682,0.689,0.722,0.698,0.721,0.712,0.729,0.635,Stability,RFAH_ECOLI,High,Prokaryote
+0.892,0.944,0.939,0.934,0.935,0.938,0.801,0.928,0.892,0.879,0.941,0.948,0.947,0.818,0.865,0.922,0.939,0.935,0.931,0.935,0.891,0.924,0.913,0.901,0.928,0.902,0.917,0.935,0.916,0.903,0.936,0.935,0.756,0.913,0.936,0.898,0.92,0.935,0.913,0.937,0.941,0.932,0.896,0.823,0.936,0.928,0.956,0.941,0.953,0.948,0.944,0.946,0.947,0.947,0.942,0.944,0.94,0.944,0.943,0.944,0.945,0.945,Stability,RL20_AQUAE,High,Prokaryote
+0.895,0.909,0.913,0.923,0.915,0.916,0.705,0.901,0.924,0.929,0.876,0.9,0.92,0.799,0.878,0.865,0.922,0.918,0.923,0.92,0.915,0.919,0.919,0.921,0.929,0.918,0.92,0.92,0.928,0.922,0.922,0.925,0.803,0.911,0.912,0.906,0.915,0.922,0.917,0.928,0.932,0.925,0.907,0.773,0.908,0.9,0.7,0.831,0.802,0.72,0.922,0.931,0.925,0.92,0.929,0.922,0.929,0.929,0.927,0.926,0.926,0.868,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+0.895,0.909,0.913,0.923,0.915,0.916,0.705,0.901,0.924,0.929,0.876,0.9,0.92,0.799,0.878,0.865,0.922,0.918,0.923,0.92,0.915,0.919,0.919,0.921,0.929,0.918,0.92,0.92,0.928,0.922,0.922,0.925,0.803,0.911,0.912,0.906,0.915,0.922,0.917,0.928,0.932,0.925,0.907,0.773,0.908,0.9,0.7,0.831,0.802,0.72,0.922,0.931,0.925,0.92,0.929,0.922,0.929,0.929,0.927,0.926,0.926,0.868,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+0.895,0.909,0.913,0.923,0.915,0.916,0.705,0.901,0.924,0.929,0.876,0.9,0.92,0.799,0.878,0.865,0.922,0.918,0.923,0.92,0.915,0.919,0.919,0.921,0.929,0.918,0.92,0.92,0.928,0.922,0.922,0.925,0.803,0.911,0.912,0.906,0.915,0.922,0.917,0.928,0.932,0.925,0.907,0.773,0.908,0.9,0.7,0.831,0.802,0.72,0.922,0.931,0.925,0.92,0.929,0.922,0.929,0.929,0.927,0.926,0.926,0.868,Activity,RL40A_YEAST,Medium,Eukaryote
+0.968,0.969,0.938,0.943,0.938,0.956,0.865,0.925,0.967,0.962,0.969,0.967,0.967,0.89,0.964,0.967,0.967,0.969,0.972,0.939,0.97,0.969,0.966,0.969,0.97,0.966,0.97,0.966,0.969,0.938,0.969,0.969,0.853,0.965,0.966,0.965,0.965,0.966,0.967,0.961,0.962,0.962,0.966,0.882,0.967,0.965,0.913,0.969,0.882,0.893,0.968,0.966,0.965,0.968,0.966,0.969,0.966,0.965,0.966,0.967,0.969,0.962,Activity,RNC_ECOLI,Medium,Prokaryote
+0.848,0.884,0.874,0.879,0.873,0.884,0.834,0.835,0.823,0.817,0.885,0.876,0.879,0.853,0.86,0.868,0.879,0.894,0.888,0.891,0.823,0.888,0.882,0.886,0.849,0.891,0.865,0.887,0.893,0.872,0.883,0.839,0.856,0.777,0.865,0.863,0.822,0.866,0.853,0.886,0.888,0.882,0.848,0.796,0.827,0.825,0.901,0.801,0.903,0.887,0.857,0.863,0.869,0.877,0.874,0.867,0.877,0.866,0.866,0.872,0.885,0.85,Stability,RPC1_BP434,High,Virus
+0.77,0.892,0.879,0.884,0.879,0.885,0.795,0.886,0.856,0.874,0.874,0.84,0.876,0.78,0.818,0.832,0.86,0.878,0.885,0.872,0.696,0.802,0.807,0.847,0.726,0.816,0.812,0.75,0.908,0.871,0.889,0.884,0.694,0.738,0.824,0.883,0.789,0.837,0.876,0.878,0.879,0.891,0.764,0.814,0.82,0.814,0.647,0.812,0.827,0.714,0.859,0.862,0.855,0.86,0.866,0.869,0.869,0.848,0.865,0.862,0.866,0.786,Activity,RPC1_LAMBD,High,Virus
+0.77,0.892,0.879,0.884,0.879,0.885,0.795,0.886,0.856,0.874,0.874,0.84,0.876,0.78,0.818,0.832,0.86,0.878,0.885,0.872,0.696,0.802,0.807,0.847,0.726,0.816,0.812,0.75,0.908,0.871,0.889,0.884,0.694,0.738,0.824,0.883,0.789,0.837,0.876,0.878,0.879,0.891,0.764,0.814,0.82,0.814,0.647,0.812,0.827,0.714,0.859,0.862,0.855,0.86,0.866,0.869,0.869,0.848,0.865,0.862,0.866,0.786,Activity,RPC1_LAMBD,High,Virus
+0.879,0.865,0.867,0.867,0.867,0.869,0.799,0.837,0.867,0.866,0.86,0.859,0.857,0.813,0.831,0.884,0.864,0.868,0.861,0.856,0.86,0.867,0.849,0.849,0.86,0.852,0.884,0.862,0.86,0.86,0.858,0.849,0.823,0.875,0.862,0.852,0.881,0.87,0.865,0.876,0.868,0.864,0.831,0.82,0.878,0.858,0.9,0.887,0.888,0.893,0.866,0.863,0.866,0.87,0.866,0.856,0.865,0.865,0.872,0.868,0.866,0.915,Stability,RS15_GEOSE,Medium,Prokaryote
+0.8,0.852,0.858,0.862,0.859,0.86,0.752,0.806,0.854,0.856,0.848,0.863,0.863,0.782,0.78,0.814,0.854,0.859,0.853,0.702,0.83,0.858,0.852,0.847,0.844,0.859,0.859,0.858,0.846,0.854,0.862,0.834,0.754,0.828,0.857,0.856,0.841,0.862,0.861,0.864,0.862,0.863,0.782,0.712,0.848,0.794,0.824,0.867,0.823,0.755,0.856,0.85,0.858,0.861,0.856,0.86,0.856,0.86,0.86,0.861,0.845,0.814,Expression,S22A1_HUMAN,Medium,Human
+0.8,0.852,0.858,0.862,0.859,0.86,0.752,0.806,0.854,0.856,0.848,0.863,0.863,0.782,0.78,0.814,0.854,0.859,0.853,0.702,0.83,0.858,0.852,0.847,0.844,0.859,0.859,0.858,0.846,0.854,0.862,0.834,0.754,0.828,0.857,0.856,0.841,0.862,0.861,0.864,0.862,0.863,0.782,0.712,0.848,0.794,0.824,0.867,0.823,0.755,0.856,0.85,0.858,0.861,0.856,0.86,0.856,0.86,0.86,0.861,0.845,0.814,Activity,S22A1_HUMAN,Medium,Human
+0.874,0.833,0.842,0.863,0.854,0.869,0.766,0.808,0.844,0.857,0.816,0.829,0.826,0.659,0.847,0.866,0.805,0.842,0.77,0.765,0.784,0.819,0.817,0.822,0.781,0.829,0.816,0.818,0.816,0.804,0.813,0.756,0.76,0.749,0.798,0.801,0.836,0.842,0.849,0.853,0.873,0.863,0.824,0.671,0.854,0.829,0.827,0.869,0.842,0.869,0.834,0.772,0.83,0.807,0.837,0.831,0.809,0.824,0.824,0.827,0.796,0.893,Stability,SAV1_MOUSE,High,Eukaryote
+0.761,0.773,0.795,0.794,0.788,0.779,0.648,0.686,0.8,0.823,0.722,0.676,0.679,0.624,0.655,0.711,0.826,0.842,0.79,0.754,0.647,0.745,0.77,0.727,0.671,0.645,0.688,0.656,0.754,0.777,0.831,0.784,0.622,0.596,0.618,0.62,0.728,0.744,0.747,0.751,0.777,0.778,0.659,0.657,0.644,0.702,0.837,0.803,0.843,0.846,0.685,0.705,0.709,0.72,0.726,0.691,0.746,0.77,0.813,0.743,0.851,0.801,Stability,SBI_STAAM,Medium,Prokaryote
+0.868,0.888,0.882,0.883,0.887,0.886,0.829,0.901,0.885,0.889,0.89,0.872,0.881,0.783,0.817,0.856,0.868,0.871,0.87,0.891,0.888,0.905,0.904,0.9,0.887,0.904,0.903,0.902,0.902,0.89,0.881,0.85,0.854,0.892,0.902,0.901,0.884,0.892,0.893,0.888,0.89,0.888,0.864,0.781,0.891,0.871,0.868,0.893,0.868,0.823,0.867,0.875,0.875,0.876,0.873,0.875,0.871,0.873,0.867,0.874,0.889,0.87,Activity,SC6A4_HUMAN,Medium,Human
+0.74,0.766,0.776,0.779,0.781,0.788,0.763,0.722,0.793,0.784,0.783,0.782,0.776,0.738,0.774,0.8,0.803,0.793,0.817,0.759,0.714,0.749,0.755,0.755,0.764,0.767,0.784,0.782,0.76,0.758,0.768,0.746,0.617,0.712,0.686,0.746,0.757,0.758,0.766,0.78,0.789,0.793,0.787,0.769,0.771,0.783,0.793,0.851,0.835,0.803,0.801,0.8,0.808,0.818,0.813,0.836,0.816,0.835,0.81,0.823,0.887,0.851,Stability,SCIN_STAAR,High,Prokaryote
+0.753,0.728,0.754,0.754,0.754,0.754,0.794,0.603,0.791,0.798,0.753,0.731,0.765,0.77,0.75,0.763,0.778,0.77,0.748,0.76,0.743,0.724,0.739,0.723,0.732,0.748,0.743,0.714,0.795,0.739,0.741,0.738,0.639,0.675,0.697,0.758,0.756,0.784,0.816,0.77,0.787,0.772,0.752,0.714,0.8,0.693,0.637,0.657,0.71,0.606,0.703,0.734,0.727,0.741,0.728,0.736,0.72,0.705,0.72,0.726,0.713,0.737,OrganismalFitness,SCN5A_HUMAN,Medium,Human
+0.931,0.927,0.931,0.931,0.936,0.936,0.599,0.877,0.925,0.934,0.922,0.899,0.906,0.688,0.743,0.914,0.922,0.935,0.933,0.917,0.591,0.637,0.618,0.76,0.67,0.886,0.568,0.806,0.92,0.94,0.92,0.9,0.522,0.593,0.743,0.886,0.932,0.93,0.924,0.935,0.935,0.933,0.703,0.654,0.913,0.728,0.901,0.898,0.94,0.935,0.932,0.932,0.934,0.939,0.935,0.937,0.938,0.936,0.94,0.94,0.944,0.945,Stability,SDA_BACSU,Medium,Prokaryote
+0.882,0.903,0.893,0.898,0.897,0.903,0.661,0.892,0.896,0.898,0.899,0.898,0.9,0.764,0.867,0.89,0.897,0.892,0.889,0.899,0.903,0.9,0.903,0.904,0.903,0.901,0.906,0.905,0.899,0.899,0.903,0.901,0.805,0.898,0.906,0.904,0.894,0.904,0.903,0.903,0.903,0.902,0.857,0.737,0.896,0.879,0.871,0.903,0.787,0.85,0.897,0.901,0.896,0.901,0.905,0.897,0.899,0.897,0.894,0.902,0.901,0.88,OrganismalFitness,SERC_HUMAN,High,Human
+0.82,0.841,0.836,0.835,0.839,0.838,0.802,0.839,0.85,0.844,0.83,0.827,0.83,0.805,0.806,0.807,0.826,0.838,0.834,0.831,0.81,0.815,0.845,0.838,0.807,0.82,0.821,0.823,0.844,0.844,0.845,0.838,0.806,0.811,0.822,0.842,0.818,0.828,0.84,0.838,0.836,0.843,0.809,0.806,0.84,0.806,0.82,0.84,0.814,0.81,0.832,0.832,0.836,0.834,0.837,0.838,0.832,0.831,0.826,0.833,0.805,0.809,OrganismalFitness,SHOC2_HUMAN,Medium,Human
+0.83,0.855,0.835,0.834,0.841,0.838,0.821,0.809,0.818,0.838,0.845,0.84,0.847,0.832,0.84,0.86,0.855,0.835,0.839,0.85,0.835,0.812,0.829,0.823,0.842,0.846,0.839,0.839,0.838,0.835,0.816,0.804,0.829,0.754,0.823,0.839,0.829,0.827,0.842,0.835,0.839,0.838,0.846,0.76,0.837,0.855,0.845,0.836,0.859,0.845,0.849,0.867,0.866,0.859,0.855,0.862,0.848,0.848,0.846,0.86,0.853,0.851,Stability,SOX30_HUMAN,High,Human
+0.777,0.8,0.792,0.794,0.805,0.802,0.545,0.768,0.725,0.727,0.739,0.543,0.54,0.516,0.547,0.513,0.59,0.602,0.588,0.793,0.482,0.583,0.743,0.637,0.548,0.509,0.567,0.746,0.786,0.752,0.738,0.729,0.493,0.548,0.518,0.602,0.779,0.788,0.779,0.806,0.816,0.803,0.566,0.588,0.638,0.63,0.771,0.762,0.818,0.803,0.793,0.785,0.787,0.827,0.763,0.794,0.778,0.801,0.785,0.801,0.755,0.761,Stability,SPA_STAAU,Medium,Prokaryote
+0.394,0.415,0.322,0.336,0.421,0.427,0.298,0.318,0.354,0.43,0.478,0.468,0.45,0.474,0.464,0.471,0.489,0.487,0.506,0.444,0.454,0.434,0.438,0.441,0.43,0.449,0.45,0.442,0.464,0.417,0.519,0.504,0.3,0.438,0.412,0.44,0.414,0.404,0.424,0.44,0.432,0.446,0.308,0.277,0.393,0.313,0.448,0.428,0.49,0.385,0.514,0.494,0.505,0.517,0.507,0.497,0.527,0.518,0.516,0.515,0.514,0.496,Binding,SPG1_STRSG,Low,Prokaryote
+0.394,0.415,0.322,0.336,0.421,0.427,0.298,0.318,0.354,0.43,0.478,0.468,0.45,0.474,0.464,0.471,0.489,0.487,0.506,0.444,0.454,0.434,0.438,0.441,0.43,0.449,0.45,0.442,0.464,0.417,0.519,0.504,0.3,0.438,0.412,0.44,0.414,0.404,0.424,0.44,0.432,0.446,0.308,0.277,0.393,0.313,0.448,0.428,0.49,0.385,0.514,0.494,0.505,0.517,0.507,0.497,0.527,0.518,0.516,0.515,0.514,0.496,Binding,SPG1_STRSG,Low,Prokaryote
+0.394,0.415,0.322,0.336,0.421,0.427,0.298,0.318,0.354,0.43,0.478,0.468,0.45,0.474,0.464,0.471,0.489,0.487,0.506,0.444,0.454,0.434,0.438,0.441,0.43,0.449,0.45,0.442,0.464,0.417,0.519,0.504,0.3,0.438,0.412,0.44,0.414,0.404,0.424,0.44,0.432,0.446,0.308,0.277,0.393,0.313,0.448,0.428,0.49,0.385,0.514,0.494,0.505,0.517,0.507,0.497,0.527,0.518,0.516,0.515,0.514,0.496,Binding,SPG1_STRSG,Medium,Prokaryote
+0.394,0.415,0.322,0.336,0.421,0.427,0.298,0.318,0.354,0.43,0.478,0.468,0.45,0.474,0.464,0.471,0.489,0.487,0.506,0.444,0.454,0.434,0.438,0.441,0.43,0.449,0.45,0.442,0.464,0.417,0.519,0.504,0.3,0.438,0.412,0.44,0.414,0.404,0.424,0.44,0.432,0.446,0.308,0.277,0.393,0.313,0.448,0.428,0.49,0.385,0.514,0.494,0.505,0.517,0.507,0.497,0.527,0.518,0.516,0.515,0.514,0.496,Binding,SPG1_STRSG,Medium,Prokaryote
+0.784,0.792,0.779,0.771,0.775,0.774,0.708,0.778,0.81,0.824,0.784,0.788,0.776,0.72,0.765,0.753,0.794,0.796,0.766,0.804,0.724,0.778,0.693,0.702,0.668,0.731,0.71,0.778,0.769,0.826,0.803,0.787,0.47,0.652,0.762,0.799,0.734,0.761,0.743,0.804,0.807,0.81,0.755,0.656,0.778,0.727,0.828,0.787,0.833,0.817,0.789,0.8,0.815,0.818,0.812,0.812,0.81,0.804,0.791,0.813,0.835,0.755,Stability,SPG2_STRSG,Medium,Prokaryote
+0.795,0.896,0.806,0.86,0.89,0.888,0.691,0.892,0.898,0.918,0.69,0.68,0.688,0.691,0.718,0.7,0.704,0.729,0.74,0.895,0.876,0.887,0.881,0.887,0.888,0.886,0.867,0.891,0.873,0.914,0.892,0.891,0.806,0.885,0.885,0.878,0.876,0.874,0.884,0.909,0.906,0.905,0.687,0.686,0.692,0.714,0.909,0.859,0.747,0.808,0.828,0.83,0.826,0.835,0.84,0.851,0.833,0.814,0.8,0.844,0.833,0.754,Binding,SPIKE_SARS2,Medium,Virus
+0.795,0.896,0.806,0.86,0.89,0.888,0.691,0.892,0.898,0.918,0.69,0.68,0.688,0.691,0.718,0.7,0.704,0.729,0.74,0.895,0.876,0.887,0.881,0.887,0.888,0.886,0.867,0.891,0.873,0.914,0.892,0.891,0.806,0.885,0.885,0.878,0.876,0.874,0.884,0.909,0.906,0.905,0.687,0.686,0.692,0.714,0.909,0.859,0.747,0.808,0.828,0.83,0.826,0.835,0.84,0.851,0.833,0.814,0.8,0.844,0.833,0.754,Expression,SPIKE_SARS2,Medium,Virus
+0.832,0.869,0.807,0.774,0.806,0.797,0.613,0.706,0.79,0.799,0.865,0.815,0.838,0.358,0.811,0.819,0.804,0.887,0.809,0.836,0.766,0.813,0.76,0.788,0.826,0.841,0.81,0.834,0.833,0.827,0.877,0.88,0.581,0.78,0.755,0.776,0.834,0.846,0.83,0.803,0.803,0.801,0.751,0.421,0.785,0.737,0.765,0.775,0.902,0.893,0.892,0.887,0.898,0.886,0.887,0.889,0.886,0.894,0.88,0.895,0.883,0.767,Stability,SPTN1_CHICK,High,Eukaryote
+0.799,0.851,0.859,0.873,0.878,0.872,0.493,0.81,0.866,0.864,0.84,0.764,0.832,0.567,0.701,0.795,0.865,0.846,0.837,0.84,0.615,0.837,0.755,0.855,0.823,0.834,0.84,0.817,0.831,0.823,0.823,0.804,0.762,0.616,0.841,0.847,0.801,0.873,0.865,0.885,0.879,0.867,0.603,0.547,0.824,0.853,0.879,0.835,0.824,0.83,0.851,0.861,0.863,0.849,0.846,0.863,0.854,0.841,0.84,0.859,0.854,0.772,Stability,SQSTM_MOUSE,Medium,Eukaryote
+0.862,0.858,0.853,0.851,0.856,0.855,0.416,0.76,0.843,0.843,0.889,0.851,0.883,0.362,0.844,0.903,0.893,0.904,0.913,0.836,0.482,0.703,0.757,0.722,0.734,0.783,0.799,0.819,0.866,0.817,0.883,0.835,0.627,0.676,0.629,0.748,0.887,0.875,0.875,0.875,0.882,0.87,0.798,0.459,0.884,0.854,0.753,0.893,0.916,0.907,0.872,0.883,0.877,0.896,0.874,0.886,0.881,0.889,0.883,0.882,0.914,0.891,Stability,SR43C_ARATH,High,Eukaryote
+0.869,0.885,0.906,0.903,0.886,0.883,0.804,0.782,0.885,0.891,0.92,0.912,0.923,0.811,0.9,0.911,0.909,0.898,0.924,0.893,0.905,0.895,0.913,0.89,0.907,0.909,0.905,0.897,0.9,0.913,0.904,0.881,0.822,0.833,0.878,0.867,0.888,0.902,0.906,0.886,0.898,0.892,0.889,0.723,0.922,0.903,0.891,0.891,0.92,0.916,0.911,0.919,0.916,0.919,0.912,0.921,0.91,0.915,0.914,0.918,0.905,0.911,Stability,SRBS1_HUMAN,High,Human
+0.619,0.629,0.623,0.628,0.62,0.619,0.617,0.603,0.621,0.626,0.635,0.627,0.632,0.604,0.608,0.588,0.602,0.611,0.611,0.62,0.62,0.612,0.617,0.609,0.61,0.619,0.62,0.609,0.612,0.634,0.615,0.622,0.62,0.613,0.605,0.602,0.611,0.613,0.613,0.621,0.62,0.619,0.627,0.588,0.622,0.626,0.576,0.605,0.577,0.547,0.604,0.604,0.597,0.621,0.61,0.605,0.6,0.611,0.602,0.603,0.62,0.606,Activity,SRC_HUMAN,Medium,Human
+0.619,0.629,0.623,0.628,0.62,0.619,0.617,0.603,0.621,0.626,0.635,0.627,0.632,0.604,0.608,0.588,0.602,0.611,0.611,0.62,0.62,0.612,0.617,0.609,0.61,0.619,0.62,0.609,0.612,0.634,0.615,0.622,0.62,0.613,0.605,0.602,0.611,0.613,0.613,0.621,0.62,0.619,0.627,0.588,0.622,0.626,0.576,0.605,0.577,0.547,0.604,0.604,0.597,0.621,0.61,0.605,0.6,0.611,0.602,0.603,0.62,0.606,Activity,SRC_HUMAN,Medium,Human
+0.619,0.629,0.623,0.628,0.62,0.619,0.617,0.603,0.621,0.626,0.635,0.627,0.632,0.604,0.608,0.588,0.602,0.611,0.611,0.62,0.62,0.612,0.617,0.609,0.61,0.619,0.62,0.609,0.612,0.634,0.615,0.622,0.62,0.613,0.605,0.602,0.611,0.613,0.613,0.621,0.62,0.619,0.627,0.588,0.622,0.626,0.576,0.605,0.577,0.547,0.604,0.604,0.597,0.621,0.61,0.605,0.6,0.611,0.602,0.603,0.62,0.606,OrganismalFitness,SRC_HUMAN,Medium,Human
+0.707,0.76,0.779,0.766,0.769,0.771,0.637,0.763,0.758,0.749,0.773,0.775,0.788,0.658,0.723,0.75,0.785,0.73,0.763,0.774,0.67,0.759,0.77,0.778,0.768,0.775,0.763,0.79,0.765,0.717,0.777,0.779,0.621,0.672,0.759,0.771,0.739,0.754,0.772,0.764,0.764,0.772,0.737,0.639,0.772,0.749,0.735,0.774,0.729,0.723,0.768,0.763,0.778,0.77,0.767,0.769,0.78,0.78,0.77,0.777,0.776,0.732,OrganismalFitness,SUMO1_HUMAN,High,Human
+0.826,0.836,0.818,0.814,0.827,0.829,0.809,0.878,0.838,0.845,0.836,0.868,0.861,0.809,0.811,0.824,0.812,0.849,0.86,0.86,0.807,0.867,0.872,0.87,0.811,0.862,0.868,0.86,0.867,0.874,0.877,0.877,0.696,0.824,0.865,0.868,0.836,0.855,0.856,0.835,0.851,0.85,0.812,0.8,0.857,0.808,0.761,0.834,0.809,0.77,0.813,0.795,0.793,0.805,0.806,0.807,0.816,0.824,0.832,0.804,0.834,0.802,OrganismalFitness,SYUA_HUMAN,Medium,Human
+0.594,0.591,0.589,0.587,0.59,0.587,0.716,0.592,0.577,0.58,0.595,0.607,0.593,0.643,0.66,0.637,0.557,0.579,0.612,0.59,0.677,0.576,0.595,0.596,0.693,0.608,0.584,0.592,0.593,0.601,0.599,0.622,0.576,0.698,0.732,0.592,0.609,0.616,0.6,0.594,0.594,0.59,0.652,0.697,0.607,0.653,0.696,0.598,0.699,0.61,0.66,0.621,0.613,0.643,0.621,0.634,0.608,0.612,0.59,0.62,0.62,0.657,OrganismalFitness,TADBP_HUMAN,Low,Human
+0.686,0.699,0.694,0.695,0.697,0.698,0.457,0.707,0.701,0.698,0.609,0.707,0.704,0.599,0.615,0.583,0.598,0.578,0.578,0.698,0.701,0.714,0.704,0.707,0.695,0.729,0.609,0.698,0.718,0.695,0.707,0.718,0.618,0.702,0.681,0.712,0.706,0.689,0.694,0.702,0.701,0.702,0.539,0.495,0.686,0.557,0.601,0.697,0.644,0.585,0.54,0.615,0.594,0.623,0.593,0.617,0.565,0.582,0.581,0.58,0.53,0.537,OrganismalFitness,TAT_HV1BR,High,Virus
+0.896,0.882,0.888,0.891,0.881,0.882,0.825,0.799,0.887,0.871,0.85,0.877,0.879,0.885,0.863,0.889,0.904,0.87,0.868,0.824,0.848,0.845,0.826,0.835,0.857,0.848,0.844,0.85,0.841,0.848,0.828,0.798,0.861,0.846,0.863,0.889,0.88,0.881,0.894,0.879,0.883,0.879,0.88,0.691,0.852,0.876,0.849,0.846,0.877,0.879,0.869,0.863,0.864,0.882,0.864,0.88,0.875,0.869,0.874,0.871,0.899,0.88,Stability,TCRG1_MOUSE,Medium,Eukaryote
+0.771,0.767,0.79,0.808,0.808,0.823,0.583,0.59,0.799,0.818,0.766,0.816,0.8,0.488,0.791,0.794,0.788,0.816,0.799,0.802,0.538,0.661,0.58,0.519,0.642,0.653,0.515,0.669,0.778,0.836,0.812,0.776,0.747,0.53,0.722,0.605,0.777,0.824,0.777,0.79,0.822,0.808,0.71,0.624,0.81,0.745,0.83,0.835,0.814,0.874,0.85,0.805,0.842,0.84,0.854,0.827,0.834,0.834,0.827,0.841,0.794,0.767,Stability,THO1_YEAST,High,Eukaryote
+0.848,0.885,0.878,0.88,0.871,0.877,0.577,0.79,0.878,0.891,0.851,0.839,0.852,0.715,0.827,0.848,0.871,0.885,0.886,0.868,0.811,0.887,0.888,0.887,0.829,0.88,0.868,0.889,0.876,0.864,0.898,0.883,0.729,0.62,0.821,0.829,0.844,0.863,0.867,0.879,0.88,0.878,0.832,0.724,0.87,0.829,0.82,0.891,0.913,0.916,0.863,0.889,0.885,0.864,0.879,0.879,0.886,0.878,0.868,0.887,0.905,0.819,Stability,TNKS2_HUMAN,High,Human
+0.619,0.642,0.624,0.621,0.62,0.626,0.554,0.644,0.626,0.621,0.63,0.645,0.651,0.551,0.607,0.625,0.654,0.652,0.665,0.631,0.558,0.599,0.605,0.619,0.558,0.649,0.665,0.619,0.667,0.672,0.667,0.643,0.557,0.564,0.623,0.672,0.625,0.631,0.676,0.627,0.638,0.648,0.569,0.55,0.637,0.581,0.632,0.644,0.649,0.598,0.648,0.656,0.648,0.659,0.649,0.657,0.665,0.651,0.645,0.649,0.628,0.6,OrganismalFitness,TPK1_HUMAN,Medium,Human
+0.805,0.817,0.818,0.816,0.819,0.816,0.735,0.815,0.817,0.819,0.82,0.818,0.826,0.764,0.773,0.804,0.813,0.814,0.807,0.82,0.771,0.761,0.803,0.828,0.801,0.798,0.809,0.799,0.81,0.826,0.829,0.819,0.771,0.787,0.8,0.807,0.81,0.811,0.812,0.82,0.819,0.817,0.783,0.733,0.822,0.803,0.8,0.823,0.824,0.789,0.803,0.804,0.809,0.802,0.809,0.811,0.812,0.815,0.812,0.81,0.817,0.8,Expression,TPMT_HUMAN,Medium,Human
+0.668,0.597,0.695,0.635,0.627,0.625,0.69,0.641,0.663,0.656,0.678,0.643,0.671,0.589,0.664,0.575,0.571,0.61,0.649,0.589,0.619,0.589,0.659,0.652,0.638,0.546,0.742,0.7,0.676,0.618,0.611,0.618,0.708,0.64,0.633,0.617,0.67,0.682,0.63,0.651,0.667,0.607,0.58,0.627,0.624,0.643,0.486,0.635,0.653,0.56,0.703,0.606,0.625,0.629,0.621,0.634,0.613,0.668,0.664,0.624,0.637,0.691,OrganismalFitness,TPOR_HUMAN,Low,Human
+0.879,0.917,0.91,0.915,0.911,0.913,0.744,0.899,0.91,0.918,0.9,0.888,0.892,0.788,0.826,0.877,0.919,0.921,0.909,0.922,0.781,0.849,0.859,0.867,0.879,0.861,0.909,0.883,0.93,0.912,0.912,0.887,0.735,0.822,0.877,0.846,0.845,0.888,0.874,0.887,0.912,0.909,0.769,0.75,0.884,0.825,0.798,0.896,0.914,0.746,0.898,0.903,0.898,0.911,0.906,0.9,0.904,0.911,0.902,0.912,0.924,0.866,OrganismalFitness,TRPC_SACS2,Medium,Prokaryote
+0.692,0.732,0.652,0.688,0.719,0.712,0.607,0.639,0.705,0.737,0.726,0.732,0.747,0.622,0.719,0.755,0.725,0.743,0.751,0.742,0.67,0.716,0.733,0.756,0.698,0.736,0.689,0.751,0.762,0.657,0.734,0.684,0.554,0.672,0.747,0.738,0.716,0.723,0.723,0.733,0.73,0.723,0.689,0.501,0.765,0.718,0.651,0.758,0.726,0.581,0.718,0.72,0.691,0.73,0.725,0.707,0.705,0.701,0.69,0.713,0.754,0.753,OrganismalFitness,TRPC_THEMA,Medium,Prokaryote
+0.534,0.615,0.61,0.61,0.607,0.618,0.369,0.531,0.575,0.591,0.582,0.589,0.587,0.405,0.403,0.47,0.522,0.58,0.596,0.606,0.49,0.54,0.538,0.535,0.547,0.579,0.553,0.568,0.574,0.555,0.556,0.536,0.353,0.49,0.558,0.576,0.522,0.597,0.583,0.597,0.62,0.626,0.463,0.398,0.577,0.579,0.489,0.553,0.512,0.486,0.504,0.505,0.523,0.532,0.536,0.534,0.531,0.521,0.519,0.519,0.579,0.484,OrganismalFitness,UBC9_HUMAN,Medium,Human
+0.66,0.786,0.785,0.782,0.795,0.798,0.528,0.728,0.773,0.802,0.838,0.799,0.804,0.609,0.784,0.848,0.84,0.829,0.821,0.798,0.542,0.74,0.754,0.777,0.723,0.756,0.752,0.763,0.784,0.765,0.831,0.792,0.734,0.516,0.541,0.792,0.684,0.697,0.797,0.797,0.792,0.818,0.657,0.52,0.683,0.715,0.768,0.749,0.836,0.731,0.848,0.856,0.855,0.854,0.856,0.846,0.842,0.855,0.849,0.854,0.833,0.744,Stability,UBE4B_HUMAN,High,Human
+0.672,0.659,0.658,0.658,0.648,0.656,0.536,0.626,0.647,0.644,0.656,0.687,0.666,0.575,0.656,0.66,0.641,0.635,0.617,0.63,0.547,0.628,0.646,0.633,0.66,0.649,0.649,0.673,0.648,0.636,0.644,0.632,0.549,0.544,0.566,0.629,0.674,0.691,0.682,0.658,0.663,0.664,0.641,0.528,0.626,0.664,0.644,0.625,0.491,0.6,0.667,0.647,0.658,0.648,0.645,0.657,0.639,0.636,0.632,0.651,0.653,0.644,Activity,UBE4B_MOUSE,Low,Eukaryote
+0.867,0.877,0.845,0.856,0.872,0.867,0.775,0.8,0.862,0.869,0.859,0.818,0.842,0.792,0.795,0.781,0.773,0.863,0.859,0.831,0.852,0.869,0.884,0.884,0.878,0.883,0.889,0.891,0.885,0.88,0.873,0.848,0.813,0.83,0.863,0.852,0.864,0.872,0.878,0.871,0.869,0.879,0.804,0.783,0.885,0.82,0.881,0.885,0.885,0.885,0.859,0.877,0.887,0.882,0.886,0.878,0.874,0.881,0.874,0.881,0.886,0.861,Stability,UBR5_HUMAN,Medium,Human
+0.74,0.718,0.729,0.729,0.737,0.735,0.671,0.679,0.738,0.767,0.753,0.765,0.759,0.72,0.746,0.766,0.769,0.772,0.75,0.708,0.706,0.732,0.705,0.746,0.721,0.719,0.731,0.716,0.757,0.743,0.74,0.719,0.572,0.716,0.746,0.745,0.73,0.766,0.748,0.74,0.746,0.731,0.727,0.723,0.743,0.696,0.707,0.738,0.753,0.752,0.776,0.776,0.778,0.783,0.795,0.788,0.784,0.775,0.791,0.783,0.785,0.79,Stability,VG08_BPP22,High,Virus
+0.791,0.92,0.936,0.935,0.934,0.93,0.684,0.84,0.924,0.926,0.93,0.914,0.931,0.782,0.753,0.937,0.953,0.932,0.883,0.93,0.732,0.821,0.853,0.849,0.846,0.877,0.864,0.86,0.92,0.933,0.944,0.924,0.766,0.653,0.888,0.887,0.858,0.906,0.901,0.928,0.931,0.931,0.809,0.806,0.934,0.904,0.912,0.922,0.947,0.895,0.953,0.947,0.954,0.952,0.948,0.957,0.951,0.954,0.95,0.955,0.952,0.917,Stability,VILI_CHICK,High,Eukaryote
+0.829,0.831,0.826,0.83,0.816,0.826,0.705,0.827,0.843,0.848,0.829,0.833,0.825,0.708,0.796,0.831,0.828,0.83,0.833,0.843,0.766,0.758,0.804,0.823,0.752,0.812,0.819,0.798,0.824,0.851,0.837,0.821,0.744,0.727,0.759,0.823,0.816,0.817,0.836,0.818,0.822,0.832,0.728,0.696,0.829,0.769,0.79,0.823,0.83,0.788,0.828,0.816,0.828,0.839,0.843,0.836,0.838,0.825,0.827,0.835,0.825,0.826,Expression,VKOR1_HUMAN,Medium,Human
+0.829,0.831,0.826,0.83,0.816,0.826,0.705,0.827,0.843,0.848,0.829,0.833,0.825,0.708,0.796,0.831,0.828,0.83,0.833,0.843,0.766,0.758,0.804,0.823,0.752,0.812,0.819,0.798,0.824,0.851,0.837,0.821,0.744,0.727,0.759,0.823,0.816,0.817,0.836,0.818,0.822,0.832,0.728,0.696,0.829,0.769,0.79,0.823,0.83,0.788,0.828,0.816,0.828,0.839,0.843,0.836,0.838,0.825,0.827,0.835,0.825,0.826,Activity,VKOR1_HUMAN,Medium,Human
+0.584,0.739,0.781,0.778,0.767,0.783,0.655,0.728,0.799,0.765,0.785,0.75,0.737,0.697,0.749,0.79,0.803,0.822,0.797,0.752,0.598,0.694,0.676,0.704,0.703,0.647,0.675,0.692,0.763,0.753,0.802,0.788,0.693,0.671,0.74,0.687,0.638,0.649,0.645,0.754,0.757,0.739,0.729,0.672,0.756,0.734,0.803,0.798,0.801,0.851,0.85,0.836,0.863,0.856,0.847,0.861,0.857,0.861,0.853,0.869,0.869,0.808,Stability,VRPI_BPT7,Medium,Virus
+0.667,0.852,0.846,0.86,0.856,0.853,0.453,0.803,0.832,0.859,0.789,0.626,0.795,0.577,0.644,0.85,0.873,0.919,0.904,0.875,0.503,0.53,0.57,0.735,0.562,0.573,0.521,0.553,0.841,0.875,0.877,0.867,0.521,0.452,0.564,0.831,0.702,0.727,0.816,0.839,0.842,0.855,0.685,0.564,0.717,0.694,0.876,0.837,0.912,0.862,0.889,0.903,0.88,0.891,0.901,0.894,0.89,0.894,0.897,0.897,0.877,0.749,Stability,YAIA_ECOLI,Medium,Prokaryote
+0.464,0.386,0.471,0.468,0.466,0.463,0.374,0.312,0.295,0.288,0.409,0.362,0.357,0.409,0.419,0.441,0.453,0.412,0.39,0.327,0.308,0.301,0.293,0.322,0.332,0.3,0.311,0.322,0.296,0.37,0.382,0.397,0.34,0.358,0.302,0.323,0.427,0.368,0.396,0.45,0.42,0.439,0.484,0.181,0.422,0.48,0.359,0.418,0.332,0.372,0.355,0.348,0.365,0.37,0.347,0.358,0.369,0.373,0.386,0.361,0.478,0.41,Binding,YAP1_HUMAN,Low,Human
+0.855,0.869,0.855,0.855,0.857,0.856,0.767,0.793,0.876,0.877,0.881,0.872,0.887,0.865,0.883,0.887,0.891,0.887,0.881,0.85,0.845,0.893,0.896,0.896,0.883,0.882,0.882,0.875,0.878,0.863,0.875,0.854,0.845,0.842,0.872,0.868,0.876,0.863,0.865,0.867,0.859,0.865,0.848,0.802,0.867,0.845,0.829,0.872,0.911,0.894,0.894,0.874,0.88,0.877,0.878,0.886,0.878,0.879,0.879,0.881,0.884,0.923,Stability,YNZC_BACSU,Medium,Prokaryote
+0.75,0.787,0.783,0.785,0.789,0.791,0.642,0.741,0.774,0.785,0.761,0.743,0.757,0.658,0.702,0.731,0.755,0.763,0.769,0.77,0.712,0.736,0.75,0.755,0.721,0.757,0.751,0.762,0.779,0.785,0.788,0.772,0.655,0.709,0.731,0.76,0.767,0.772,0.787,0.791,0.792,0.795,0.699,0.635,0.755,0.718,0.742,0.771,0.754,0.726,0.769,0.772,0.772,0.775,0.773,0.774,0.773,0.775,0.771,0.776,0.772,0.735,,,,
diff --git a/benchmarks/DMS_zero_shot/substitutions/NDCG/Summary_performance_DMS_substitutions_NDCG.csv b/benchmarks/DMS_zero_shot/substitutions/NDCG/Summary_performance_DMS_substitutions_NDCG.csv
new file mode 100644
index 0000000..23c7d93
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/NDCG/Summary_performance_DMS_substitutions_NDCG.csv
@@ -0,0 +1,63 @@
+Model_rank,Model_name,Model type,Average_NDCG,Bootstrap_standard_error_NDCG,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,Depth_1,Depth_2,Depth_3,Depth_4,Depth_5+,Model details,References
+1,TranceptEVE L,Hybrid - Alignment & PLM,0.786,0.0,0.794,0.73,0.801,0.767,0.839,0.762,0.781,0.829,0.787,0.816,0.833,0.743,0.8,0.715,0.694,0.67,0.681,TranceptEVE Large model (Tranception Large & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+2,TranceptEVE S,Hybrid - Alignment & PLM,0.784,0.001,0.791,0.731,0.799,0.761,0.838,0.759,0.779,0.824,0.786,0.813,0.826,0.74,0.797,0.711,0.671,0.662,0.673,TranceptEVE Small model (Tranception Small & retrieved EVE model),"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+3,TranceptEVE M,Hybrid - Alignment & PLM,0.784,0.001,0.793,0.727,0.799,0.764,0.837,0.759,0.782,0.822,0.787,0.814,0.824,0.743,0.798,0.707,0.669,0.664,0.674,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+4,EVE (ensemble),Alignment-based model,0.783,0.001,0.791,0.724,0.798,0.762,0.838,0.754,0.779,0.826,0.784,0.81,0.829,0.742,0.796,0.708,0.688,0.674,0.666,EVE model (ensemble of 5 independently-trained models),"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature."
+5,EVE (single),Alignment-based model,0.781,0.001,0.789,0.724,0.796,0.761,0.835,0.752,0.778,0.823,0.782,0.81,0.826,0.74,0.794,0.71,0.685,0.671,0.668,EVE model (single seed),"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature."
+6,Tranception L,Hybrid - Alignment & PLM,0.779,0.002,0.794,0.724,0.795,0.76,0.824,0.759,0.773,0.819,0.788,0.807,0.813,0.727,0.793,0.705,0.68,0.653,0.668,Tranception Large model (700M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+7,MSA Transformer (ensemble),Hybrid - Alignment & PLM,0.777,0.004,0.789,0.708,0.806,0.749,0.835,0.752,0.773,0.819,0.78,0.812,0.823,0.723,0.791,0.693,0.699,0.696,0.696,MSA Transformer (ensemble of 5 MSA samples),"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML."
+8,GEMME,Alignment-based model,0.777,0.002,0.795,0.711,0.794,0.756,0.828,0.758,0.775,0.813,0.779,0.805,0.817,0.743,0.789,0.69,0.674,0.674,0.708,GEMME model,"Laine, É., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619."
+9,EVmutation,Alignment-based model,0.777,0.002,0.792,0.708,0.792,0.757,0.835,0.748,0.776,0.821,0.778,0.813,0.826,0.734,0.793,0.717,0.684,0.674,0.697,EVmutation model,"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135."
+10,DeepSequence (ensemble),Alignment-based model,0.776,0.003,0.785,0.716,0.787,0.758,0.835,0.743,0.773,0.822,0.781,0.806,0.823,0.73,0.791,0.705,0.671,0.654,0.668,DeepSequence model (ensemble of 5 independently-trained models),"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822."
+11,VESPA,Protein language model,0.775,0.004,0.789,0.711,0.79,0.752,0.835,0.741,0.773,0.824,0.776,0.81,0.833,0.719,0.789,0.664,0.714,0.659,0.645,VESPA model,"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647."
+12,DeepSequence (single),Alignment-based model,0.774,0.003,0.786,0.711,0.787,0.754,0.834,0.745,0.771,0.82,0.783,0.807,0.818,0.723,0.791,0.703,0.659,0.653,0.67,DeepSequence model (single seed),"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822."
+13,Tranception M,Hybrid - Alignment & PLM,0.768,0.004,0.781,0.718,0.783,0.741,0.816,0.748,0.766,0.795,0.781,0.801,0.781,0.718,0.781,0.688,0.624,0.635,0.648,Tranception Medium model (300M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+14,SaProt (650M),Hybrid - Structure & PLM,0.768,0.006,0.767,0.72,0.797,0.693,0.861,0.701,0.752,0.832,0.784,0.805,0.827,0.613,0.779,0.705,0.621,0.608,0.606,SaProt (650M),"Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv."
+15,Progen2 XL,Protein language model,0.768,0.004,0.771,0.7,0.786,0.757,0.823,0.73,0.768,0.815,0.772,0.791,0.824,0.718,0.786,0.669,0.661,0.602,0.584,Progen2 xlarge model (6.4B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+16,Tranception S,Hybrid - Alignment & PLM,0.766,0.004,0.774,0.725,0.785,0.734,0.813,0.749,0.758,0.794,0.776,0.797,0.78,0.71,0.776,0.689,0.626,0.636,0.65,Tranception Small model (85M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+17,MSA Transformer (single),Hybrid - Alignment & PLM,0.766,0.004,0.781,0.696,0.786,0.745,0.823,0.743,0.767,0.805,0.771,0.809,0.807,0.712,0.782,0.68,0.695,0.695,0.694,MSA Transformer (single MSA sample),"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML."
+18,ProtSSN (ensemble),Hybrid - Structure & PLM,0.766,0.005,0.772,0.714,0.776,0.71,0.858,0.711,0.754,0.833,0.776,0.812,0.832,0.645,0.778,0.698,0.624,0.596,0.594,ProtSSN (ensemble of 9 models),"Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+19,ProtSSN (k=30 h=768),Hybrid - Structure & PLM,0.765,0.005,0.774,0.708,0.784,0.707,0.854,0.714,0.752,0.83,0.776,0.809,0.83,0.638,0.776,0.694,0.616,0.59,0.587,"ProtSSN (k=30, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+20,MIF-ST,Hybrid - Structure & PLM,0.765,0.004,0.769,0.707,0.79,0.72,0.839,0.724,0.755,0.817,0.769,0.799,0.827,0.671,0.781,0.684,0.662,0.623,0.645,MIF-ST model,"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv."
+21,ProtSSN (k=20 h=512),Hybrid - Structure & PLM,0.764,0.005,0.773,0.711,0.769,0.709,0.856,0.712,0.753,0.831,0.775,0.811,0.83,0.644,0.777,0.694,0.638,0.614,0.604,"ProtSSN (k=20, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+22,ProtSSN (k=30 h=512),Hybrid - Structure & PLM,0.763,0.005,0.769,0.713,0.776,0.707,0.854,0.705,0.753,0.829,0.774,0.808,0.827,0.639,0.775,0.689,0.615,0.599,0.59,"ProtSSN (k=30, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+23,ProtSSN (k=20 h=1280),Hybrid - Structure & PLM,0.762,0.005,0.772,0.708,0.769,0.71,0.855,0.705,0.751,0.832,0.773,0.812,0.828,0.641,0.776,0.697,0.63,0.6,0.604,"ProtSSN (k=20, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+24,ProtSSN (k=20 h=768),Hybrid - Structure & PLM,0.761,0.006,0.769,0.702,0.772,0.709,0.855,0.704,0.751,0.831,0.772,0.807,0.828,0.643,0.775,0.697,0.617,0.587,0.594,"ProtSSN (k=20, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+25,ProtSSN (k=10 h=1280),Hybrid - Structure & PLM,0.761,0.006,0.767,0.706,0.772,0.706,0.856,0.706,0.752,0.828,0.774,0.809,0.825,0.639,0.775,0.679,0.599,0.578,0.574,"ProtSSN (k=10, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+26,Wavenet,Alignment-based model,0.761,0.005,0.759,0.703,0.777,0.74,0.825,0.72,0.761,0.809,0.769,0.782,0.813,0.709,0.777,0.692,0.645,0.597,0.58,Wavenet model,"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12."
+27,ProtSSN (k=30 h=1280),Hybrid - Structure & PLM,0.76,0.005,0.771,0.71,0.767,0.701,0.853,0.71,0.745,0.829,0.772,0.806,0.83,0.627,0.773,0.698,0.622,0.597,0.588,"ProtSSN (k=30, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+28,ProtSSN (k=10 h=768),Hybrid - Structure & PLM,0.76,0.005,0.77,0.699,0.77,0.706,0.853,0.703,0.752,0.827,0.774,0.803,0.826,0.64,0.774,0.68,0.61,0.576,0.566,"ProtSSN (k=10, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+29,ESM2 (15B),Protein language model,0.759,0.006,0.75,0.71,0.78,0.728,0.827,0.703,0.753,0.814,0.778,0.775,0.816,0.652,0.777,0.675,0.599,0.553,0.516,ESM2 model (15B params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+30,VESPAl,Protein language model,0.759,0.005,0.78,0.697,0.765,0.739,0.814,0.731,0.756,0.804,0.76,0.793,0.817,0.7,0.771,0.643,0.699,0.645,0.632,VESPAl model,"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647."
+31,ProtSSN (k=10 h=512),Hybrid - Structure & PLM,0.758,0.006,0.769,0.7,0.771,0.702,0.849,0.707,0.746,0.826,0.771,0.807,0.824,0.627,0.771,0.691,0.603,0.574,0.567,"ProtSSN (k=10, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+32,Progen2 L,Protein language model,0.755,0.006,0.763,0.691,0.791,0.735,0.795,0.718,0.744,0.802,0.773,0.768,0.795,0.668,0.771,0.65,0.628,0.589,0.561,Progen2 large model (2.7B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+33,ESM2 (3B),Protein language model,0.755,0.006,0.748,0.706,0.775,0.709,0.835,0.693,0.743,0.819,0.774,0.785,0.811,0.626,0.772,0.682,0.592,0.566,0.511,ESM2 model (3B params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+34,Tranception L no retrieval,Protein language model,0.752,0.005,0.768,0.692,0.775,0.749,0.776,0.723,0.742,0.793,0.761,0.742,0.797,0.712,0.772,0.663,0.665,0.607,0.599,Tranception Large model (700M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+35,RITA XL,Protein language model,0.752,0.006,0.75,0.698,0.784,0.734,0.792,0.699,0.753,0.784,0.773,0.737,0.777,0.702,0.768,0.646,0.559,0.555,0.518,RITA xlarge model (1.2B params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+36,Progen2 M,Protein language model,0.751,0.006,0.747,0.696,0.786,0.734,0.794,0.69,0.746,0.798,0.772,0.751,0.781,0.679,0.77,0.634,0.57,0.539,0.485,Progen2 medium model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+37,ESM-1v (ensemble),Protein language model,0.75,0.006,0.74,0.701,0.776,0.721,0.811,0.677,0.739,0.814,0.77,0.768,0.797,0.641,0.768,0.67,0.595,0.553,0.498,ESM-1v (ensemble of 5 independently-trained models),"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS."
+38,CARP (640M),Protein language model,0.748,0.006,0.75,0.693,0.771,0.709,0.818,0.695,0.74,0.803,0.774,0.78,0.785,0.633,0.768,0.676,0.593,0.554,0.522,CARP model (640M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+39,ESM-IF1,Inverse folding model,0.748,0.007,0.738,0.698,0.762,0.676,0.864,0.681,0.74,0.824,0.754,0.801,0.822,0.629,0.761,0.716,0.619,0.616,0.648,ESM-IF1 model,"Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. BioRxiv."
+40,Progen2 Base,Protein language model,0.748,0.006,0.752,0.696,0.784,0.727,0.779,0.707,0.74,0.781,0.776,0.749,0.76,0.66,0.766,0.64,0.563,0.55,0.506,Progen2 base model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+41,ESM2 (650M),Protein language model,0.747,0.007,0.739,0.705,0.764,0.694,0.834,0.67,0.733,0.819,0.768,0.778,0.807,0.599,0.764,0.686,0.601,0.554,0.514,ESM2 model (650M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+42,ESM-1b,Protein language model,0.747,0.009,0.767,0.668,0.767,0.7,0.832,0.691,0.741,0.817,0.771,0.798,0.809,0.608,0.761,0.679,0.579,0.542,0.544,ESM-1b (w/ Brandes et al. extensions),"[1] Original model: Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118. [2] Extensions: Brandes, N., Goldman, G., Wang, C.H., Ye, C.J., & Ntranos, V. (2022). Genome-wide prediction of disease variants with a deep protein language model. bioRxiv."
+43,Site-Independent,Alignment-based model,0.747,0.005,0.751,0.7,0.76,0.72,0.802,0.736,0.744,0.772,0.759,0.781,0.764,0.695,0.764,0.687,0.623,0.637,0.644,Site-Independent model,"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135."
+44,RITA L,Protein language model,0.747,0.006,0.741,0.697,0.778,0.731,0.786,0.697,0.746,0.781,0.769,0.737,0.767,0.697,0.763,0.643,0.552,0.552,0.496,RITA large model (680M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+45,MIF,Inverse folding model,0.743,0.007,0.736,0.694,0.78,0.662,0.844,0.706,0.727,0.796,0.755,0.771,0.794,0.632,0.757,0.682,0.623,0.619,0.637,MIF model,"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv."
+46,Unirep evotuned,Hybrid - Alignment & PLM,0.739,0.005,0.741,0.682,0.778,0.714,0.778,0.725,0.73,0.761,0.747,0.736,0.771,0.692,0.753,0.623,0.619,0.588,0.554,Unirep model w/ evotuning,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+47,SaProt (35M),Hybrid - Structure & PLM,0.734,0.009,0.726,0.684,0.777,0.658,0.826,0.66,0.716,0.8,0.753,0.769,0.785,0.575,0.753,0.714,0.564,0.554,0.507,SaProt (35M),"Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv."
+48,ESM-1v (single),Protein language model,0.732,0.01,0.734,0.664,0.764,0.708,0.791,0.652,0.726,0.802,0.753,0.756,0.784,0.621,0.755,0.659,0.596,0.552,0.493,ESM-1v (single seed),"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS."
+49,RITA M,Protein language model,0.731,0.008,0.724,0.671,0.771,0.721,0.768,0.68,0.725,0.776,0.751,0.716,0.754,0.694,0.752,0.625,0.542,0.54,0.499,RITA medium model (300M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+50,ESM2 (150M),Protein language model,0.728,0.008,0.724,0.691,0.743,0.658,0.821,0.65,0.71,0.8,0.758,0.759,0.775,0.559,0.746,0.681,0.544,0.535,0.485,ESM2 model (150M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+51,Tranception M no retrieval,Protein language model,0.726,0.009,0.732,0.67,0.76,0.723,0.745,0.673,0.724,0.757,0.743,0.712,0.746,0.683,0.748,0.634,0.553,0.542,0.49,Tranception Medium model (300M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+52,CARP (76M),Protein language model,0.72,0.007,0.718,0.689,0.74,0.663,0.79,0.654,0.7,0.78,0.743,0.755,0.749,0.579,0.734,0.655,0.535,0.523,0.489,CARP model (76M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+53,Progen2 S,Protein language model,0.718,0.009,0.712,0.666,0.755,0.692,0.764,0.646,0.709,0.769,0.747,0.711,0.743,0.624,0.739,0.624,0.535,0.527,0.474,Progen2 small model (150M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+54,Tranception S no retrieval,Protein language model,0.713,0.007,0.704,0.699,0.747,0.7,0.717,0.668,0.7,0.731,0.722,0.687,0.723,0.66,0.729,0.62,0.542,0.532,0.475,Tranception Small model (85M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+55,ProteinMPNN,Inverse folding model,0.713,0.009,0.716,0.633,0.717,0.644,0.854,0.655,0.715,0.795,0.729,0.778,0.782,0.623,0.74,0.692,0.599,0.593,0.602,ProteinMPNN model,"J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378."
+56,RITA S,Protein language model,0.712,0.008,0.701,0.68,0.748,0.701,0.731,0.665,0.698,0.746,0.729,0.687,0.724,0.666,0.731,0.623,0.531,0.529,0.482,RITA small model (85M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+57,ESM2 (35M),Protein language model,0.703,0.008,0.702,0.678,0.721,0.636,0.777,0.633,0.671,0.78,0.728,0.722,0.742,0.552,0.722,0.674,0.542,0.532,0.488,ESM2 model (35M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+58,CARP (38M),Protein language model,0.701,0.008,0.709,0.673,0.713,0.643,0.765,0.637,0.679,0.761,0.724,0.722,0.733,0.567,0.716,0.643,0.534,0.527,0.496,CARP model (38M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+59,ESM2 (8M),Protein language model,0.667,0.008,0.665,0.668,0.697,0.61,0.693,0.622,0.639,0.694,0.679,0.641,0.701,0.54,0.684,0.626,0.538,0.522,0.467,ESM2 model (8M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+60,ProtGPT2,Protein language model,0.653,0.01,0.67,0.62,0.654,0.616,0.706,0.614,0.644,0.693,0.683,0.666,0.672,0.556,0.68,0.618,0.536,0.516,0.426,ProtGPT2 model,"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13."
+61,Unirep,Protein language model,0.647,0.009,0.659,0.624,0.671,0.603,0.68,0.619,0.628,0.673,0.668,0.642,0.678,0.524,0.672,0.589,0.521,0.528,0.468,Unirep model,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+62,CARP (600K),Protein language model,0.636,0.009,0.659,0.593,0.655,0.585,0.687,0.605,0.622,0.67,0.654,0.64,0.676,0.53,0.658,0.583,0.526,0.522,0.472,CARP model (600K params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
diff --git a/benchmarks/DMS_zero_shot/substitutions/NDCG/Summary_performance_DMS_substitutions_NDCG.html b/benchmarks/DMS_zero_shot/substitutions/NDCG/Summary_performance_DMS_substitutions_NDCG.html
new file mode 100644
index 0000000..ce9eaba
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/NDCG/Summary_performance_DMS_substitutions_NDCG.html
@@ -0,0 +1,1670 @@
+
+
+
+ |
+ Model_name |
+ Model type |
+ Average_NDCG |
+ Bootstrap_standard_error_NDCG |
+ Function_Activity |
+ Function_Binding |
+ Function_Expression |
+ Function_OrganismalFitness |
+ Function_Stability |
+ Low_MSA_depth |
+ Medium_MSA_depth |
+ High_MSA_depth |
+ Taxa_Human |
+ Taxa_Other_Eukaryote |
+ Taxa_Prokaryote |
+ Taxa_Virus |
+ Depth_1 |
+ Depth_2 |
+ Depth_3 |
+ Depth_4 |
+ Depth_5+ |
+ Model details |
+ References |
+
+
+ Model_rank |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+
+
+
+
+ 1 |
+ TranceptEVE L |
+ Hybrid - Alignment & PLM |
+ 0.786 |
+ 0.000 |
+ 0.794 |
+ 0.730 |
+ 0.801 |
+ 0.767 |
+ 0.839 |
+ 0.762 |
+ 0.781 |
+ 0.829 |
+ 0.787 |
+ 0.816 |
+ 0.833 |
+ 0.743 |
+ 0.800 |
+ 0.715 |
+ 0.694 |
+ 0.670 |
+ 0.681 |
+ TranceptEVE Large model (Tranception Large & retrieved EVE model) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.12.07.519495v1?rss=1'>Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.</a> |
+
+
+ 2 |
+ TranceptEVE S |
+ Hybrid - Alignment & PLM |
+ 0.784 |
+ 0.001 |
+ 0.791 |
+ 0.731 |
+ 0.799 |
+ 0.761 |
+ 0.838 |
+ 0.759 |
+ 0.779 |
+ 0.824 |
+ 0.786 |
+ 0.813 |
+ 0.826 |
+ 0.740 |
+ 0.797 |
+ 0.711 |
+ 0.671 |
+ 0.662 |
+ 0.673 |
+ TranceptEVE Small model (Tranception Small & retrieved EVE model) |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 3 |
+ TranceptEVE M |
+ Hybrid - Alignment & PLM |
+ 0.784 |
+ 0.001 |
+ 0.793 |
+ 0.727 |
+ 0.799 |
+ 0.764 |
+ 0.837 |
+ 0.759 |
+ 0.782 |
+ 0.822 |
+ 0.787 |
+ 0.814 |
+ 0.824 |
+ 0.743 |
+ 0.798 |
+ 0.707 |
+ 0.669 |
+ 0.664 |
+ 0.674 |
+ TranceptEVE Medium model (Tranception Medium & retrieved EVE model) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.12.07.519495v1?rss=1'>Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.</a> |
+
+
+ 4 |
+ EVE (ensemble) |
+ Alignment-based model |
+ 0.783 |
+ 0.001 |
+ 0.791 |
+ 0.724 |
+ 0.798 |
+ 0.762 |
+ 0.838 |
+ 0.754 |
+ 0.779 |
+ 0.826 |
+ 0.784 |
+ 0.810 |
+ 0.829 |
+ 0.742 |
+ 0.796 |
+ 0.708 |
+ 0.688 |
+ 0.674 |
+ 0.666 |
+ EVE model (ensemble of 5 independently-trained models) |
+ <a href='https://www.nature.com/articles/s41586-021-04043-8'>Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.</a> |
+
+
+ 5 |
+ EVE (single) |
+ Alignment-based model |
+ 0.781 |
+ 0.001 |
+ 0.789 |
+ 0.724 |
+ 0.796 |
+ 0.761 |
+ 0.835 |
+ 0.752 |
+ 0.778 |
+ 0.823 |
+ 0.782 |
+ 0.810 |
+ 0.826 |
+ 0.740 |
+ 0.794 |
+ 0.710 |
+ 0.685 |
+ 0.671 |
+ 0.668 |
+ EVE model (single seed) |
+ <a href='https://www.nature.com/articles/s41586-021-04043-8'>Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.</a> |
+
+
+ 6 |
+ Tranception L |
+ Hybrid - Alignment & PLM |
+ 0.779 |
+ 0.002 |
+ 0.794 |
+ 0.724 |
+ 0.795 |
+ 0.760 |
+ 0.824 |
+ 0.759 |
+ 0.773 |
+ 0.819 |
+ 0.788 |
+ 0.807 |
+ 0.813 |
+ 0.727 |
+ 0.793 |
+ 0.705 |
+ 0.680 |
+ 0.653 |
+ 0.668 |
+ Tranception Large model (700M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 7 |
+ MSA Transformer (ensemble) |
+ Hybrid - Alignment & PLM |
+ 0.777 |
+ 0.004 |
+ 0.789 |
+ 0.708 |
+ 0.806 |
+ 0.749 |
+ 0.835 |
+ 0.752 |
+ 0.773 |
+ 0.819 |
+ 0.780 |
+ 0.812 |
+ 0.823 |
+ 0.723 |
+ 0.791 |
+ 0.693 |
+ 0.699 |
+ 0.696 |
+ 0.696 |
+ MSA Transformer (ensemble of 5 MSA samples) |
+ <a href='http://proceedings.mlr.press/v139/rao21a.html'>Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.</a> |
+
+
+ 8 |
+ GEMME |
+ Alignment-based model |
+ 0.777 |
+ 0.002 |
+ 0.795 |
+ 0.711 |
+ 0.794 |
+ 0.756 |
+ 0.828 |
+ 0.758 |
+ 0.775 |
+ 0.813 |
+ 0.779 |
+ 0.805 |
+ 0.817 |
+ 0.743 |
+ 0.789 |
+ 0.690 |
+ 0.674 |
+ 0.674 |
+ 0.708 |
+ GEMME model |
+ <a href='https://pubmed.ncbi.nlm.nih.gov/31406981/'>Laine, É., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619.</a> |
+
+
+ 9 |
+ EVmutation |
+ Alignment-based model |
+ 0.777 |
+ 0.002 |
+ 0.792 |
+ 0.708 |
+ 0.792 |
+ 0.757 |
+ 0.835 |
+ 0.748 |
+ 0.776 |
+ 0.821 |
+ 0.778 |
+ 0.813 |
+ 0.826 |
+ 0.734 |
+ 0.793 |
+ 0.717 |
+ 0.684 |
+ 0.674 |
+ 0.697 |
+ EVmutation model |
+ <a href='https://www.nature.com/articles/nbt.3769'>Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.</a> |
+
+
+ 10 |
+ DeepSequence (ensemble) |
+ Alignment-based model |
+ 0.776 |
+ 0.003 |
+ 0.785 |
+ 0.716 |
+ 0.787 |
+ 0.758 |
+ 0.835 |
+ 0.743 |
+ 0.773 |
+ 0.822 |
+ 0.781 |
+ 0.806 |
+ 0.823 |
+ 0.730 |
+ 0.791 |
+ 0.705 |
+ 0.671 |
+ 0.654 |
+ 0.668 |
+ DeepSequence model (ensemble of 5 independently-trained models) |
+ <a href='https://www.nature.com/articles/s41592-018-0138-4'>Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.</a> |
+
+
+ 11 |
+ VESPA |
+ Protein language model |
+ 0.775 |
+ 0.004 |
+ 0.789 |
+ 0.711 |
+ 0.790 |
+ 0.752 |
+ 0.835 |
+ 0.741 |
+ 0.773 |
+ 0.824 |
+ 0.776 |
+ 0.810 |
+ 0.833 |
+ 0.719 |
+ 0.789 |
+ 0.664 |
+ 0.714 |
+ 0.659 |
+ 0.645 |
+ VESPA model |
+ <a href='https://link.springer.com/article/10.1007/s00439-021-02411-y'>Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.</a> |
+
+
+ 12 |
+ DeepSequence (single) |
+ Alignment-based model |
+ 0.774 |
+ 0.003 |
+ 0.786 |
+ 0.711 |
+ 0.787 |
+ 0.754 |
+ 0.834 |
+ 0.745 |
+ 0.771 |
+ 0.820 |
+ 0.783 |
+ 0.807 |
+ 0.818 |
+ 0.723 |
+ 0.791 |
+ 0.703 |
+ 0.659 |
+ 0.653 |
+ 0.670 |
+ DeepSequence model (single seed) |
+ <a href='https://www.nature.com/articles/s41592-018-0138-4'>Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.</a> |
+
+
+ 13 |
+ Tranception M |
+ Hybrid - Alignment & PLM |
+ 0.768 |
+ 0.004 |
+ 0.781 |
+ 0.718 |
+ 0.783 |
+ 0.741 |
+ 0.816 |
+ 0.748 |
+ 0.766 |
+ 0.795 |
+ 0.781 |
+ 0.801 |
+ 0.781 |
+ 0.718 |
+ 0.781 |
+ 0.688 |
+ 0.624 |
+ 0.635 |
+ 0.648 |
+ Tranception Medium model (300M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 14 |
+ SaProt (650M) |
+ Hybrid - Structure & PLM |
+ 0.768 |
+ 0.006 |
+ 0.767 |
+ 0.720 |
+ 0.797 |
+ 0.693 |
+ 0.861 |
+ 0.701 |
+ 0.752 |
+ 0.832 |
+ 0.784 |
+ 0.805 |
+ 0.827 |
+ 0.613 |
+ 0.779 |
+ 0.705 |
+ 0.621 |
+ 0.608 |
+ 0.606 |
+ SaProt (650M) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.10.01.560349v5'>Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv.</a> |
+
+
+ 15 |
+ Progen2 XL |
+ Protein language model |
+ 0.768 |
+ 0.004 |
+ 0.771 |
+ 0.700 |
+ 0.786 |
+ 0.757 |
+ 0.823 |
+ 0.730 |
+ 0.768 |
+ 0.815 |
+ 0.772 |
+ 0.791 |
+ 0.824 |
+ 0.718 |
+ 0.786 |
+ 0.669 |
+ 0.661 |
+ 0.602 |
+ 0.584 |
+ Progen2 xlarge model (6.4B params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 16 |
+ Tranception S |
+ Hybrid - Alignment & PLM |
+ 0.766 |
+ 0.004 |
+ 0.774 |
+ 0.725 |
+ 0.785 |
+ 0.734 |
+ 0.813 |
+ 0.749 |
+ 0.758 |
+ 0.794 |
+ 0.776 |
+ 0.797 |
+ 0.780 |
+ 0.710 |
+ 0.776 |
+ 0.689 |
+ 0.626 |
+ 0.636 |
+ 0.650 |
+ Tranception Small model (85M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 17 |
+ MSA Transformer (single) |
+ Hybrid - Alignment & PLM |
+ 0.766 |
+ 0.004 |
+ 0.781 |
+ 0.696 |
+ 0.786 |
+ 0.745 |
+ 0.823 |
+ 0.743 |
+ 0.767 |
+ 0.805 |
+ 0.771 |
+ 0.809 |
+ 0.807 |
+ 0.712 |
+ 0.782 |
+ 0.680 |
+ 0.695 |
+ 0.695 |
+ 0.694 |
+ MSA Transformer (single MSA sample) |
+ <a href='http://proceedings.mlr.press/v139/rao21a.html'>Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.</a> |
+
+
+ 18 |
+ ProtSSN (ensemble) |
+ Hybrid - Structure & PLM |
+ 0.766 |
+ 0.005 |
+ 0.772 |
+ 0.714 |
+ 0.776 |
+ 0.710 |
+ 0.858 |
+ 0.711 |
+ 0.754 |
+ 0.833 |
+ 0.776 |
+ 0.812 |
+ 0.832 |
+ 0.645 |
+ 0.778 |
+ 0.698 |
+ 0.624 |
+ 0.596 |
+ 0.594 |
+ ProtSSN (ensemble of 9 models) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 19 |
+ ProtSSN (k=30 h=768) |
+ Hybrid - Structure & PLM |
+ 0.765 |
+ 0.005 |
+ 0.774 |
+ 0.708 |
+ 0.784 |
+ 0.707 |
+ 0.854 |
+ 0.714 |
+ 0.752 |
+ 0.830 |
+ 0.776 |
+ 0.809 |
+ 0.830 |
+ 0.638 |
+ 0.776 |
+ 0.694 |
+ 0.616 |
+ 0.590 |
+ 0.587 |
+ ProtSSN (k=30, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 20 |
+ MIF-ST |
+ Hybrid - Structure & PLM |
+ 0.765 |
+ 0.004 |
+ 0.769 |
+ 0.707 |
+ 0.790 |
+ 0.720 |
+ 0.839 |
+ 0.724 |
+ 0.755 |
+ 0.817 |
+ 0.769 |
+ 0.799 |
+ 0.827 |
+ 0.671 |
+ 0.781 |
+ 0.684 |
+ 0.662 |
+ 0.623 |
+ 0.645 |
+ MIF-ST model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.25.493516v3'>Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.</a> |
+
+
+ 21 |
+ ProtSSN (k=20 h=512) |
+ Hybrid - Structure & PLM |
+ 0.764 |
+ 0.005 |
+ 0.773 |
+ 0.711 |
+ 0.769 |
+ 0.709 |
+ 0.856 |
+ 0.712 |
+ 0.753 |
+ 0.831 |
+ 0.775 |
+ 0.811 |
+ 0.830 |
+ 0.644 |
+ 0.777 |
+ 0.694 |
+ 0.638 |
+ 0.614 |
+ 0.604 |
+ ProtSSN (k=20, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 22 |
+ ProtSSN (k=30 h=512) |
+ Hybrid - Structure & PLM |
+ 0.763 |
+ 0.005 |
+ 0.769 |
+ 0.713 |
+ 0.776 |
+ 0.707 |
+ 0.854 |
+ 0.705 |
+ 0.753 |
+ 0.829 |
+ 0.774 |
+ 0.808 |
+ 0.827 |
+ 0.639 |
+ 0.775 |
+ 0.689 |
+ 0.615 |
+ 0.599 |
+ 0.590 |
+ ProtSSN (k=30, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 23 |
+ ProtSSN (k=20 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.762 |
+ 0.005 |
+ 0.772 |
+ 0.708 |
+ 0.769 |
+ 0.710 |
+ 0.855 |
+ 0.705 |
+ 0.751 |
+ 0.832 |
+ 0.773 |
+ 0.812 |
+ 0.828 |
+ 0.641 |
+ 0.776 |
+ 0.697 |
+ 0.630 |
+ 0.600 |
+ 0.604 |
+ ProtSSN (k=20, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 24 |
+ ProtSSN (k=20 h=768) |
+ Hybrid - Structure & PLM |
+ 0.761 |
+ 0.006 |
+ 0.769 |
+ 0.702 |
+ 0.772 |
+ 0.709 |
+ 0.855 |
+ 0.704 |
+ 0.751 |
+ 0.831 |
+ 0.772 |
+ 0.807 |
+ 0.828 |
+ 0.643 |
+ 0.775 |
+ 0.697 |
+ 0.617 |
+ 0.587 |
+ 0.594 |
+ ProtSSN (k=20, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 25 |
+ ProtSSN (k=10 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.761 |
+ 0.006 |
+ 0.767 |
+ 0.706 |
+ 0.772 |
+ 0.706 |
+ 0.856 |
+ 0.706 |
+ 0.752 |
+ 0.828 |
+ 0.774 |
+ 0.809 |
+ 0.825 |
+ 0.639 |
+ 0.775 |
+ 0.679 |
+ 0.599 |
+ 0.578 |
+ 0.574 |
+ ProtSSN (k=10, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 26 |
+ Wavenet |
+ Alignment-based model |
+ 0.761 |
+ 0.005 |
+ 0.759 |
+ 0.703 |
+ 0.777 |
+ 0.740 |
+ 0.825 |
+ 0.720 |
+ 0.761 |
+ 0.809 |
+ 0.769 |
+ 0.782 |
+ 0.813 |
+ 0.709 |
+ 0.777 |
+ 0.692 |
+ 0.645 |
+ 0.597 |
+ 0.580 |
+ Wavenet model |
+ <a href='https://www.nature.com/articles/s41467-021-22732-w'>Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12.</a> |
+
+
+ 27 |
+ ProtSSN (k=30 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.760 |
+ 0.005 |
+ 0.771 |
+ 0.710 |
+ 0.767 |
+ 0.701 |
+ 0.853 |
+ 0.710 |
+ 0.745 |
+ 0.829 |
+ 0.772 |
+ 0.806 |
+ 0.830 |
+ 0.627 |
+ 0.773 |
+ 0.698 |
+ 0.622 |
+ 0.597 |
+ 0.588 |
+ ProtSSN (k=30, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 28 |
+ ProtSSN (k=10 h=768) |
+ Hybrid - Structure & PLM |
+ 0.760 |
+ 0.005 |
+ 0.770 |
+ 0.699 |
+ 0.770 |
+ 0.706 |
+ 0.853 |
+ 0.703 |
+ 0.752 |
+ 0.827 |
+ 0.774 |
+ 0.803 |
+ 0.826 |
+ 0.640 |
+ 0.774 |
+ 0.680 |
+ 0.610 |
+ 0.576 |
+ 0.566 |
+ ProtSSN (k=10, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 29 |
+ ESM2 (15B) |
+ Protein language model |
+ 0.759 |
+ 0.006 |
+ 0.750 |
+ 0.710 |
+ 0.780 |
+ 0.728 |
+ 0.827 |
+ 0.703 |
+ 0.753 |
+ 0.814 |
+ 0.778 |
+ 0.775 |
+ 0.816 |
+ 0.652 |
+ 0.777 |
+ 0.675 |
+ 0.599 |
+ 0.553 |
+ 0.516 |
+ ESM2 model (15B params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 30 |
+ VESPAl |
+ Protein language model |
+ 0.759 |
+ 0.005 |
+ 0.780 |
+ 0.697 |
+ 0.765 |
+ 0.739 |
+ 0.814 |
+ 0.731 |
+ 0.756 |
+ 0.804 |
+ 0.760 |
+ 0.793 |
+ 0.817 |
+ 0.700 |
+ 0.771 |
+ 0.643 |
+ 0.699 |
+ 0.645 |
+ 0.632 |
+ VESPAl model |
+ <a href='https://link.springer.com/article/10.1007/s00439-021-02411-y'>Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.</a> |
+
+
+ 31 |
+ ProtSSN (k=10 h=512) |
+ Hybrid - Structure & PLM |
+ 0.758 |
+ 0.006 |
+ 0.769 |
+ 0.700 |
+ 0.771 |
+ 0.702 |
+ 0.849 |
+ 0.707 |
+ 0.746 |
+ 0.826 |
+ 0.771 |
+ 0.807 |
+ 0.824 |
+ 0.627 |
+ 0.771 |
+ 0.691 |
+ 0.603 |
+ 0.574 |
+ 0.567 |
+ ProtSSN (k=10, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 32 |
+ Progen2 L |
+ Protein language model |
+ 0.755 |
+ 0.006 |
+ 0.763 |
+ 0.691 |
+ 0.791 |
+ 0.735 |
+ 0.795 |
+ 0.718 |
+ 0.744 |
+ 0.802 |
+ 0.773 |
+ 0.768 |
+ 0.795 |
+ 0.668 |
+ 0.771 |
+ 0.650 |
+ 0.628 |
+ 0.589 |
+ 0.561 |
+ Progen2 large model (2.7B params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 33 |
+ ESM2 (3B) |
+ Protein language model |
+ 0.755 |
+ 0.006 |
+ 0.748 |
+ 0.706 |
+ 0.775 |
+ 0.709 |
+ 0.835 |
+ 0.693 |
+ 0.743 |
+ 0.819 |
+ 0.774 |
+ 0.785 |
+ 0.811 |
+ 0.626 |
+ 0.772 |
+ 0.682 |
+ 0.592 |
+ 0.566 |
+ 0.511 |
+ ESM2 model (3B params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 34 |
+ Tranception L no retrieval |
+ Protein language model |
+ 0.752 |
+ 0.005 |
+ 0.768 |
+ 0.692 |
+ 0.775 |
+ 0.749 |
+ 0.776 |
+ 0.723 |
+ 0.742 |
+ 0.793 |
+ 0.761 |
+ 0.742 |
+ 0.797 |
+ 0.712 |
+ 0.772 |
+ 0.663 |
+ 0.665 |
+ 0.607 |
+ 0.599 |
+ Tranception Large model (700M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 35 |
+ RITA XL |
+ Protein language model |
+ 0.752 |
+ 0.006 |
+ 0.750 |
+ 0.698 |
+ 0.784 |
+ 0.734 |
+ 0.792 |
+ 0.699 |
+ 0.753 |
+ 0.784 |
+ 0.773 |
+ 0.737 |
+ 0.777 |
+ 0.702 |
+ 0.768 |
+ 0.646 |
+ 0.559 |
+ 0.555 |
+ 0.518 |
+ RITA xlarge model (1.2B params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 36 |
+ Progen2 M |
+ Protein language model |
+ 0.751 |
+ 0.006 |
+ 0.747 |
+ 0.696 |
+ 0.786 |
+ 0.734 |
+ 0.794 |
+ 0.690 |
+ 0.746 |
+ 0.798 |
+ 0.772 |
+ 0.751 |
+ 0.781 |
+ 0.679 |
+ 0.770 |
+ 0.634 |
+ 0.570 |
+ 0.539 |
+ 0.485 |
+ Progen2 medium model (760M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 37 |
+ ESM-1v (ensemble) |
+ Protein language model |
+ 0.750 |
+ 0.006 |
+ 0.740 |
+ 0.701 |
+ 0.776 |
+ 0.721 |
+ 0.811 |
+ 0.677 |
+ 0.739 |
+ 0.814 |
+ 0.770 |
+ 0.768 |
+ 0.797 |
+ 0.641 |
+ 0.768 |
+ 0.670 |
+ 0.595 |
+ 0.553 |
+ 0.498 |
+ ESM-1v (ensemble of 5 independently-trained models) |
+ <a href='https://proceedings.neurips.cc/paper/2021/hash/f51338d736f95dd42427296047067694-Abstract.html'>Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.</a> |
+
+
+ 38 |
+ CARP (640M) |
+ Protein language model |
+ 0.748 |
+ 0.006 |
+ 0.750 |
+ 0.693 |
+ 0.771 |
+ 0.709 |
+ 0.818 |
+ 0.695 |
+ 0.740 |
+ 0.803 |
+ 0.774 |
+ 0.780 |
+ 0.785 |
+ 0.633 |
+ 0.768 |
+ 0.676 |
+ 0.593 |
+ 0.554 |
+ 0.522 |
+ CARP model (640M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 39 |
+ ESM-IF1 |
+ Inverse folding model |
+ 0.748 |
+ 0.007 |
+ 0.738 |
+ 0.698 |
+ 0.762 |
+ 0.676 |
+ 0.864 |
+ 0.681 |
+ 0.740 |
+ 0.824 |
+ 0.754 |
+ 0.801 |
+ 0.822 |
+ 0.629 |
+ 0.761 |
+ 0.716 |
+ 0.619 |
+ 0.616 |
+ 0.648 |
+ ESM-IF1 model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.04.10.487779v2.full.pdf+html'>Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. BioRxiv.</a> |
+
+
+ 40 |
+ Progen2 Base |
+ Protein language model |
+ 0.748 |
+ 0.006 |
+ 0.752 |
+ 0.696 |
+ 0.784 |
+ 0.727 |
+ 0.779 |
+ 0.707 |
+ 0.740 |
+ 0.781 |
+ 0.776 |
+ 0.749 |
+ 0.760 |
+ 0.660 |
+ 0.766 |
+ 0.640 |
+ 0.563 |
+ 0.550 |
+ 0.506 |
+ Progen2 base model (760M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 41 |
+ ESM2 (650M) |
+ Protein language model |
+ 0.747 |
+ 0.007 |
+ 0.739 |
+ 0.705 |
+ 0.764 |
+ 0.694 |
+ 0.834 |
+ 0.670 |
+ 0.733 |
+ 0.819 |
+ 0.768 |
+ 0.778 |
+ 0.807 |
+ 0.599 |
+ 0.764 |
+ 0.686 |
+ 0.601 |
+ 0.554 |
+ 0.514 |
+ ESM2 model (650M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 42 |
+ ESM-1b |
+ Protein language model |
+ 0.747 |
+ 0.009 |
+ 0.767 |
+ 0.668 |
+ 0.767 |
+ 0.700 |
+ 0.832 |
+ 0.691 |
+ 0.741 |
+ 0.817 |
+ 0.771 |
+ 0.798 |
+ 0.809 |
+ 0.608 |
+ 0.761 |
+ 0.679 |
+ 0.579 |
+ 0.542 |
+ 0.544 |
+ ESM-1b (w/ Brandes et al. extensions) |
+ [1] Original model: <a href='https://www.biorxiv.org/content/10.1101/622803v4'>Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118.</a> [2] Extensions: <a href='https://www.biorxiv.org/content/10.1101/2022.08.25.505311v1'>Brandes, N., Goldman, G., Wang, C.H., Ye, C.J., & Ntranos, V. (2022). Genome-wide prediction of disease variants with a deep protein language model. bioRxiv.</a> |
+
+
+ 43 |
+ Site-Independent |
+ Alignment-based model |
+ 0.747 |
+ 0.005 |
+ 0.751 |
+ 0.700 |
+ 0.760 |
+ 0.720 |
+ 0.802 |
+ 0.736 |
+ 0.744 |
+ 0.772 |
+ 0.759 |
+ 0.781 |
+ 0.764 |
+ 0.695 |
+ 0.764 |
+ 0.687 |
+ 0.623 |
+ 0.637 |
+ 0.644 |
+ Site-Independent model |
+ <a href='https://www.nature.com/articles/nbt.3769'>Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.</a> |
+
+
+ 44 |
+ RITA L |
+ Protein language model |
+ 0.747 |
+ 0.006 |
+ 0.741 |
+ 0.697 |
+ 0.778 |
+ 0.731 |
+ 0.786 |
+ 0.697 |
+ 0.746 |
+ 0.781 |
+ 0.769 |
+ 0.737 |
+ 0.767 |
+ 0.697 |
+ 0.763 |
+ 0.643 |
+ 0.552 |
+ 0.552 |
+ 0.496 |
+ RITA large model (680M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 45 |
+ MIF |
+ Inverse folding model |
+ 0.743 |
+ 0.007 |
+ 0.736 |
+ 0.694 |
+ 0.780 |
+ 0.662 |
+ 0.844 |
+ 0.706 |
+ 0.727 |
+ 0.796 |
+ 0.755 |
+ 0.771 |
+ 0.794 |
+ 0.632 |
+ 0.757 |
+ 0.682 |
+ 0.623 |
+ 0.619 |
+ 0.637 |
+ MIF model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.25.493516v3'>Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.</a> |
+
+
+ 46 |
+ Unirep evotuned |
+ Hybrid - Alignment & PLM |
+ 0.739 |
+ 0.005 |
+ 0.741 |
+ 0.682 |
+ 0.778 |
+ 0.714 |
+ 0.778 |
+ 0.725 |
+ 0.730 |
+ 0.761 |
+ 0.747 |
+ 0.736 |
+ 0.771 |
+ 0.692 |
+ 0.753 |
+ 0.623 |
+ 0.619 |
+ 0.588 |
+ 0.554 |
+ Unirep model w/ evotuning |
+ <a href='https://www.nature.com/articles/s41592-019-0598-1'>Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.</a> |
+
+
+ 47 |
+ SaProt (35M) |
+ Hybrid - Structure & PLM |
+ 0.734 |
+ 0.009 |
+ 0.726 |
+ 0.684 |
+ 0.777 |
+ 0.658 |
+ 0.826 |
+ 0.660 |
+ 0.716 |
+ 0.800 |
+ 0.753 |
+ 0.769 |
+ 0.785 |
+ 0.575 |
+ 0.753 |
+ 0.714 |
+ 0.564 |
+ 0.554 |
+ 0.507 |
+ SaProt (35M) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.10.01.560349v5'>Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv.</a> |
+
+
+ 48 |
+ ESM-1v (single) |
+ Protein language model |
+ 0.732 |
+ 0.010 |
+ 0.734 |
+ 0.664 |
+ 0.764 |
+ 0.708 |
+ 0.791 |
+ 0.652 |
+ 0.726 |
+ 0.802 |
+ 0.753 |
+ 0.756 |
+ 0.784 |
+ 0.621 |
+ 0.755 |
+ 0.659 |
+ 0.596 |
+ 0.552 |
+ 0.493 |
+ ESM-1v (single seed) |
+ <a href='https://proceedings.neurips.cc/paper/2021/hash/f51338d736f95dd42427296047067694-Abstract.html'>Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.</a> |
+
+
+ 49 |
+ RITA M |
+ Protein language model |
+ 0.731 |
+ 0.008 |
+ 0.724 |
+ 0.671 |
+ 0.771 |
+ 0.721 |
+ 0.768 |
+ 0.680 |
+ 0.725 |
+ 0.776 |
+ 0.751 |
+ 0.716 |
+ 0.754 |
+ 0.694 |
+ 0.752 |
+ 0.625 |
+ 0.542 |
+ 0.540 |
+ 0.499 |
+ RITA medium model (300M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 50 |
+ ESM2 (150M) |
+ Protein language model |
+ 0.728 |
+ 0.008 |
+ 0.724 |
+ 0.691 |
+ 0.743 |
+ 0.658 |
+ 0.821 |
+ 0.650 |
+ 0.710 |
+ 0.800 |
+ 0.758 |
+ 0.759 |
+ 0.775 |
+ 0.559 |
+ 0.746 |
+ 0.681 |
+ 0.544 |
+ 0.535 |
+ 0.485 |
+ ESM2 model (150M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 51 |
+ Tranception M no retrieval |
+ Protein language model |
+ 0.726 |
+ 0.009 |
+ 0.732 |
+ 0.670 |
+ 0.760 |
+ 0.723 |
+ 0.745 |
+ 0.673 |
+ 0.724 |
+ 0.757 |
+ 0.743 |
+ 0.712 |
+ 0.746 |
+ 0.683 |
+ 0.748 |
+ 0.634 |
+ 0.553 |
+ 0.542 |
+ 0.490 |
+ Tranception Medium model (300M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 52 |
+ CARP (76M) |
+ Protein language model |
+ 0.720 |
+ 0.007 |
+ 0.718 |
+ 0.689 |
+ 0.740 |
+ 0.663 |
+ 0.790 |
+ 0.654 |
+ 0.700 |
+ 0.780 |
+ 0.743 |
+ 0.755 |
+ 0.749 |
+ 0.579 |
+ 0.734 |
+ 0.655 |
+ 0.535 |
+ 0.523 |
+ 0.489 |
+ CARP model (76M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 53 |
+ Progen2 S |
+ Protein language model |
+ 0.718 |
+ 0.009 |
+ 0.712 |
+ 0.666 |
+ 0.755 |
+ 0.692 |
+ 0.764 |
+ 0.646 |
+ 0.709 |
+ 0.769 |
+ 0.747 |
+ 0.711 |
+ 0.743 |
+ 0.624 |
+ 0.739 |
+ 0.624 |
+ 0.535 |
+ 0.527 |
+ 0.474 |
+ Progen2 small model (150M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 54 |
+ Tranception S no retrieval |
+ Protein language model |
+ 0.713 |
+ 0.007 |
+ 0.704 |
+ 0.699 |
+ 0.747 |
+ 0.700 |
+ 0.717 |
+ 0.668 |
+ 0.700 |
+ 0.731 |
+ 0.722 |
+ 0.687 |
+ 0.723 |
+ 0.660 |
+ 0.729 |
+ 0.620 |
+ 0.542 |
+ 0.532 |
+ 0.475 |
+ Tranception Small model (85M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 55 |
+ ProteinMPNN |
+ Inverse folding model |
+ 0.713 |
+ 0.009 |
+ 0.716 |
+ 0.633 |
+ 0.717 |
+ 0.644 |
+ 0.854 |
+ 0.655 |
+ 0.715 |
+ 0.795 |
+ 0.729 |
+ 0.778 |
+ 0.782 |
+ 0.623 |
+ 0.740 |
+ 0.692 |
+ 0.599 |
+ 0.593 |
+ 0.602 |
+ ProteinMPNN model |
+ <a href='https://www.science.org/doi/10.1126/science.add2187'>J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378.</a> |
+
+
+ 56 |
+ RITA S |
+ Protein language model |
+ 0.712 |
+ 0.008 |
+ 0.701 |
+ 0.680 |
+ 0.748 |
+ 0.701 |
+ 0.731 |
+ 0.665 |
+ 0.698 |
+ 0.746 |
+ 0.729 |
+ 0.687 |
+ 0.724 |
+ 0.666 |
+ 0.731 |
+ 0.623 |
+ 0.531 |
+ 0.529 |
+ 0.482 |
+ RITA small model (85M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 57 |
+ ESM2 (35M) |
+ Protein language model |
+ 0.703 |
+ 0.008 |
+ 0.702 |
+ 0.678 |
+ 0.721 |
+ 0.636 |
+ 0.777 |
+ 0.633 |
+ 0.671 |
+ 0.780 |
+ 0.728 |
+ 0.722 |
+ 0.742 |
+ 0.552 |
+ 0.722 |
+ 0.674 |
+ 0.542 |
+ 0.532 |
+ 0.488 |
+ ESM2 model (35M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 58 |
+ CARP (38M) |
+ Protein language model |
+ 0.701 |
+ 0.008 |
+ 0.709 |
+ 0.673 |
+ 0.713 |
+ 0.643 |
+ 0.765 |
+ 0.637 |
+ 0.679 |
+ 0.761 |
+ 0.724 |
+ 0.722 |
+ 0.733 |
+ 0.567 |
+ 0.716 |
+ 0.643 |
+ 0.534 |
+ 0.527 |
+ 0.496 |
+ CARP model (38M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 59 |
+ ESM2 (8M) |
+ Protein language model |
+ 0.667 |
+ 0.008 |
+ 0.665 |
+ 0.668 |
+ 0.697 |
+ 0.610 |
+ 0.693 |
+ 0.622 |
+ 0.639 |
+ 0.694 |
+ 0.679 |
+ 0.641 |
+ 0.701 |
+ 0.540 |
+ 0.684 |
+ 0.626 |
+ 0.538 |
+ 0.522 |
+ 0.467 |
+ ESM2 model (8M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 60 |
+ ProtGPT2 |
+ Protein language model |
+ 0.653 |
+ 0.010 |
+ 0.670 |
+ 0.620 |
+ 0.654 |
+ 0.616 |
+ 0.706 |
+ 0.614 |
+ 0.644 |
+ 0.693 |
+ 0.683 |
+ 0.666 |
+ 0.672 |
+ 0.556 |
+ 0.680 |
+ 0.618 |
+ 0.536 |
+ 0.516 |
+ 0.426 |
+ ProtGPT2 model |
+ <a href='https://www.nature.com/articles/s41467-022-32007-7'>Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13.</a> |
+
+
+ 61 |
+ Unirep |
+ Protein language model |
+ 0.647 |
+ 0.009 |
+ 0.659 |
+ 0.624 |
+ 0.671 |
+ 0.603 |
+ 0.680 |
+ 0.619 |
+ 0.628 |
+ 0.673 |
+ 0.668 |
+ 0.642 |
+ 0.678 |
+ 0.524 |
+ 0.672 |
+ 0.589 |
+ 0.521 |
+ 0.528 |
+ 0.468 |
+ Unirep model |
+ <a href='https://www.nature.com/articles/s41592-019-0598-1'>Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.</a> |
+
+
+ 62 |
+ CARP (600K) |
+ Protein language model |
+ 0.636 |
+ 0.009 |
+ 0.659 |
+ 0.593 |
+ 0.655 |
+ 0.585 |
+ 0.687 |
+ 0.605 |
+ 0.622 |
+ 0.670 |
+ 0.654 |
+ 0.640 |
+ 0.676 |
+ 0.530 |
+ 0.658 |
+ 0.583 |
+ 0.526 |
+ 0.522 |
+ 0.472 |
+ CARP model (600K params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level.csv b/benchmarks/DMS_zero_shot/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level.csv
new file mode 100644
index 0000000..db931e3
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level.csv
@@ -0,0 +1,217 @@
+DMS ID,Site-Independent,EVmutation,DeepSequence (single),DeepSequence (ensemble),EVE (single),EVE (ensemble),Unirep,Unirep evotuned,MSA Transformer (single),MSA Transformer (ensemble),ESM-1b,ESM-1v (single),ESM-1v (ensemble),ESM2 (8M),ESM2 (35M),ESM2 (150M),ESM2 (650M),ESM2 (3B),ESM2 (15B),Wavenet,RITA S,RITA M,RITA L,RITA XL,Progen2 S,Progen2 M,Progen2 Base,Progen2 L,Progen2 XL,GEMME,VESPA,VESPAl,ProtGPT2,Tranception S no retrieval,Tranception M no retrieval,Tranception L no retrieval,Tranception S,Tranception M,Tranception L,TranceptEVE S,TranceptEVE M,TranceptEVE L,CARP (38M),CARP (600K),CARP (640M),CARP (76M),MIF,MIF-ST,ESM-IF1,ProteinMPNN,ProtSSN (k=10 h=512),ProtSSN (k=10 h=768),ProtSSN (k=10 h=1280),ProtSSN (k=20 h=512),ProtSSN (k=20 h=768),ProtSSN (k=20 h=1280),ProtSSN (k=30 h=512),ProtSSN (k=30 h=768),ProtSSN (k=30 h=1280),ProtSSN (ensemble),SaProt (650M),SaProt (35M),Number of Mutants,Selection Type,UniProt ID,MSA_Neff_L_category,Taxon
+A0A140D2T1_ZIKV_Sourisseau_2019,0.383,0.354,0.131,0.103,0.394,0.405,-0.133,0.062,0.436,0.439,-0.001,-0.048,0.015,-0.073,-0.054,-0.058,0.213,0.363,0.375,0.216,0.361,0.309,0.317,0.305,0.329,0.342,0.328,0.312,0.293,0.43,0.319,0.296,0.005,0.304,0.331,0.272,0.362,0.366,0.351,0.361,0.358,0.373,-0.06,-0.073,0.131,-0.061,0.273,0.29,0.287,0.129,0.253,0.251,0.282,0.296,0.277,0.269,0.263,0.269,0.265,0.278,0.199,0.116,9576,OrganismalFitness,A0A140D2T1_ZIKV,Medium,Virus
+A0A192B1T2_9HIV1_Haddox_2018,0.481,0.407,0.413,0.432,0.509,0.516,0.0,0.513,0.511,0.515,0.456,0.492,0.516,-0.003,0.015,0.035,0.08,0.132,0.164,0.465,0.496,0.507,0.509,0.505,0.497,0.501,0.463,0.49,0.484,0.496,0.541,0.507,0.327,0.492,0.483,0.514,0.509,0.503,0.513,0.524,0.518,0.528,0.422,-0.021,0.497,0.429,0.329,0.451,0.206,0.136,0.202,0.23,0.258,0.263,0.21,0.226,0.228,0.2,0.183,0.239,0.173,0.084,12577,OrganismalFitness,A0A192B1T2_9HIV1,Medium,Virus
+A0A1I9GEU1_NEIME_Kennouche_2019,-0.011,0.044,0.107,0.098,0.053,0.054,-0.024,0.084,0.082,0.077,0.04,0.068,0.068,-0.037,-0.047,-0.016,0.03,0.027,0.025,0.067,-0.01,0.047,0.071,0.088,0.05,0.088,0.08,0.089,0.095,0.045,0.046,0.036,0.03,0.036,0.055,0.099,0.031,0.041,0.057,0.058,0.061,0.075,-0.055,-0.056,0.039,-0.043,0.051,0.055,0.039,0.04,0.054,0.042,0.039,0.035,0.037,0.052,0.039,0.058,0.041,0.047,0.04,-0.005,922,Activity,A0A1I9GEU1_NEIME,Medium,Prokaryote
+A0A247D711_LISMN_Stadelmann_2021,0.436,0.459,0.109,0.041,0.428,0.428,0.003,0.026,0.299,0.32,0.087,0.088,0.093,0.064,0.084,0.097,0.066,0.084,0.084,0.059,0.021,0.015,0.04,0.078,-0.029,0.067,0.01,0.037,0.106,0.473,0.337,0.351,0.093,0.041,0.024,0.057,0.301,0.292,0.311,0.253,0.242,0.263,0.047,0.002,0.074,0.084,0.418,0.441,0.476,0.364,0.274,0.326,0.314,0.344,0.313,0.36,0.312,0.331,0.265,0.335,0.427,0.28,1653,Activity,A0A247D711_LISMN,High,Prokaryote
+A0A2Z5U3Z0_9INFA_Doud_2016,0.478,0.473,0.484,0.517,0.545,0.548,0.009,0.492,0.495,0.496,0.124,0.516,0.55,-0.004,0.013,0.064,0.507,0.498,0.501,0.413,0.467,0.534,0.52,0.542,0.433,0.52,0.513,0.494,0.504,0.538,0.49,0.4,0.076,0.47,0.525,0.523,0.518,0.548,0.537,0.557,0.576,0.573,0.006,0.0,0.339,0.017,0.43,0.487,0.462,0.174,0.489,0.474,0.529,0.522,0.526,0.524,0.535,0.516,0.545,0.537,0.199,0.152,10715,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+A0A2Z5U3Z0_9INFA_Wu_2014,0.48,0.506,0.454,0.478,0.506,0.521,0.021,0.436,0.495,0.509,0.155,0.461,0.509,0.048,0.073,0.073,0.464,0.481,0.485,0.458,0.45,0.503,0.484,0.511,0.345,0.46,0.482,0.447,0.475,0.513,0.442,0.375,0.085,0.442,0.493,0.519,0.506,0.534,0.541,0.541,0.556,0.558,0.052,0.051,0.253,0.069,0.355,0.38,0.362,0.147,0.432,0.429,0.455,0.451,0.454,0.461,0.468,0.449,0.462,0.468,0.229,0.197,2350,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+A4_HUMAN_Seuma_2022,0.394,0.385,0.449,0.425,0.312,0.316,0.345,0.167,0.337,0.351,0.302,0.316,0.404,0.382,0.366,0.372,0.431,0.48,0.352,0.261,0.327,0.281,0.318,0.306,0.406,0.297,0.32,0.313,0.314,0.478,0.251,0.186,0.487,0.393,0.273,0.37,0.441,0.372,0.458,0.436,0.4,0.416,0.366,0.362,0.421,0.356,0.279,0.358,-0.191,0.06,0.449,0.44,0.417,0.41,0.406,0.38,0.423,0.346,0.372,0.417,0.411,0.388,14811,Stability,A4_HUMAN,Low,Human
+A4D664_9INFA_Soh_2019,0.411,0.34,0.406,0.403,0.409,0.408,0.029,0.364,0.297,0.287,0.044,0.026,0.033,0.022,0.02,0.027,0.149,0.211,0.27,0.26,0.329,0.386,0.404,0.398,0.064,0.26,0.285,0.244,0.33,0.443,0.318,0.296,0.055,0.348,0.371,0.404,0.366,0.38,0.393,0.434,0.439,0.461,0.019,-0.0,0.08,-0.001,0.262,0.254,0.126,0.105,0.183,0.18,0.191,0.216,0.209,0.207,0.2,0.198,0.193,0.204,0.148,0.092,14421,OrganismalFitness,A4D664_9INFA,Medium,Virus
+A4GRB6_PSEAI_Chen_2020,0.317,0.492,0.657,0.664,0.623,0.641,0.344,0.529,0.641,0.681,0.68,0.647,0.668,0.431,0.528,0.655,0.738,0.712,0.627,0.567,0.409,0.533,0.564,0.624,0.526,0.622,0.639,0.638,0.705,0.671,0.742,0.673,0.242,0.423,0.552,0.629,0.557,0.588,0.652,0.653,0.66,0.679,0.481,0.071,0.671,0.601,0.664,0.713,0.629,0.422,0.728,0.718,0.711,0.719,0.729,0.733,0.728,0.733,0.735,0.745,0.722,0.513,5004,OrganismalFitness,A4GRB6_PSEAI,High,Prokaryote
+AACC1_PSEAI_Dandage_2018,0.282,0.474,0.331,0.414,0.487,0.491,0.196,0.191,0.501,0.504,0.413,0.483,0.489,0.196,0.248,0.256,0.49,0.511,0.518,0.391,0.206,0.232,0.276,0.306,0.273,0.428,0.407,0.413,0.438,0.465,0.507,0.455,0.021,0.259,0.246,0.417,0.427,0.418,0.466,0.471,0.466,0.495,0.216,0.17,0.354,0.243,0.257,0.374,0.357,0.183,0.481,0.492,0.483,0.488,0.495,0.486,0.486,0.477,0.494,0.5,0.461,0.282,1801,OrganismalFitness,AACC1_PSEAI,High,Prokaryote
+ACE2_HUMAN_Chan_2020,0.254,0.237,0.187,0.2,0.249,0.242,-0.049,0.077,0.236,0.251,0.244,0.173,0.213,-0.053,0.062,0.216,0.225,0.175,0.178,0.227,0.033,0.129,0.175,0.207,0.042,0.162,0.207,0.208,0.261,0.217,0.164,0.138,0.08,0.044,0.12,0.134,0.217,0.19,0.196,0.228,0.2,0.205,0.005,0.005,0.244,0.078,0.336,0.292,0.317,0.106,0.217,0.255,0.226,0.222,0.236,0.249,0.223,0.221,0.214,0.234,0.289,0.168,2223,Binding,ACE2_HUMAN,Medium,Human
+ADRB2_HUMAN_Jones_2020,0.331,0.423,0.506,0.514,0.497,0.517,0.463,0.5,0.531,0.53,0.534,0.526,0.539,0.408,0.444,0.474,0.493,0.505,0.515,0.521,0.509,0.524,0.517,0.493,0.524,0.531,0.538,0.533,0.502,0.545,0.5,0.414,0.298,0.528,0.53,0.506,0.528,0.541,0.533,0.533,0.538,0.537,0.471,0.181,0.528,0.51,0.419,0.477,0.451,0.223,0.486,0.486,0.5,0.508,0.501,0.498,0.493,0.506,0.494,0.509,0.554,0.516,7800,Activity,ADRB2_HUMAN,Medium,Human
+AICDA_HUMAN_Gajula_2014_3cycles,0.138,0.396,0.474,0.464,0.429,0.445,-0.287,0.327,0.32,0.385,0.415,0.376,0.478,-0.245,-0.264,0.196,0.328,0.347,0.262,0.479,-0.219,0.14,0.388,0.395,0.013,0.321,0.417,0.413,0.282,0.376,0.45,0.405,0.057,-0.039,0.057,0.339,0.206,0.218,0.367,0.399,0.402,0.444,-0.228,-0.238,0.379,0.258,0.328,0.401,0.328,0.243,0.337,0.404,0.344,0.447,0.359,0.365,0.341,0.343,0.312,0.359,0.257,-0.036,209,Activity,AICDA_HUMAN,Medium,Human
+AMFR_HUMAN_Tsuboyama_2023_4G3O,0.358,0.418,0.419,0.407,0.386,0.398,-0.301,0.13,0.22,0.24,0.374,0.07,0.423,-0.209,0.286,0.321,0.261,0.297,0.26,0.062,-0.052,-0.218,-0.051,-0.038,-0.317,-0.089,-0.05,-0.121,0.056,0.396,0.254,0.228,0.09,-0.287,-0.139,-0.275,0.392,0.402,0.356,0.424,0.422,0.374,-0.185,-0.194,0.165,0.114,0.251,0.112,0.433,0.353,0.446,0.326,0.204,0.349,0.342,0.349,0.343,0.339,0.361,0.343,0.347,0.397,2972,Stability,AMFR_HUMAN,Medium,Human
+AMIE_PSEAE_Wrenbeck_2017,0.246,0.443,0.496,0.506,0.477,0.464,0.093,0.436,0.401,0.605,0.57,0.613,0.662,0.326,0.408,0.428,0.557,0.674,0.613,0.506,0.555,0.525,0.532,0.542,0.566,0.561,0.534,0.559,0.508,0.558,0.583,0.528,0.265,0.601,0.47,0.405,0.585,0.479,0.438,0.58,0.506,0.481,0.356,0.096,0.483,0.412,0.368,0.469,0.407,0.213,0.555,0.519,0.53,0.537,0.529,0.541,0.523,0.54,0.564,0.556,0.607,0.41,6227,Activity,AMIE_PSEAE,High,Prokaryote
+ANCSZ_Hobbs_2022,0.562,0.521,0.445,0.472,0.524,0.528,0.525,0.452,0.503,0.524,0.527,0.522,0.543,0.535,0.592,0.598,0.609,0.598,0.57,0.021,0.462,0.493,0.509,0.5,0.497,0.519,0.513,0.469,0.494,0.557,0.542,0.525,0.318,0.483,0.503,0.475,0.537,0.549,0.538,0.545,0.556,0.543,0.549,0.42,0.496,0.523,0.471,0.458,0.422,0.165,0.566,0.555,0.572,0.578,0.572,0.584,0.569,0.583,0.577,0.589,0.585,0.601,4670,Activity,ANCSZ,Medium,Eukaryote
+ARGR_ECOLI_Tsuboyama_2023_1AOY,0.277,0.413,0.411,0.43,0.431,0.423,0.306,0.386,0.43,0.463,0.409,0.296,0.405,0.332,0.297,0.511,0.497,0.499,0.435,0.434,0.372,0.439,0.448,0.403,0.362,0.437,0.473,0.461,0.418,0.46,0.409,0.31,0.073,0.394,0.396,0.43,0.405,0.415,0.439,0.465,0.46,0.471,0.285,0.267,0.45,0.408,0.725,0.651,0.738,0.636,0.513,0.532,0.527,0.529,0.517,0.508,0.503,0.526,0.494,0.533,0.604,0.511,1287,Stability,ARGR_ECOLI,Medium,Prokaryote
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,0.688,0.27,0.668,0.661,0.53,0.646,0.216,0.529,0.188,0.164,0.206,-0.019,0.396,0.174,0.123,0.151,0.27,0.367,0.315,0.336,0.239,0.076,0.304,0.568,0.067,0.251,0.362,0.372,0.366,0.734,0.358,0.338,0.295,0.266,0.213,0.341,0.544,0.469,0.388,0.615,0.57,0.402,0.146,0.138,0.207,0.17,0.287,0.307,0.432,-0.005,0.352,0.367,0.419,0.283,0.255,0.47,0.295,0.288,0.421,0.374,0.242,0.204,170,Binding,B2L11_HUMAN,Low,Human
+BBC1_YEAST_Tsuboyama_2023_1TG0,0.216,0.332,0.395,0.399,0.394,0.403,0.212,0.256,0.438,0.454,0.425,0.425,0.451,0.331,0.421,0.423,0.479,0.458,0.499,0.332,0.335,0.346,0.376,0.343,0.144,0.391,0.403,0.382,0.307,0.338,0.345,0.314,0.273,0.354,0.261,0.299,0.392,0.338,0.365,0.421,0.386,0.405,0.21,0.113,0.314,0.242,0.558,0.451,0.644,0.566,0.434,0.444,0.457,0.472,0.463,0.463,0.46,0.458,0.471,0.461,0.54,0.579,2069,Stability,BBC1_YEAST,High,Eukaryote
+BCHB_CHLTE_Tsuboyama_2023_2KRU,0.364,0.418,0.357,0.364,0.365,0.382,0.1,0.248,0.383,0.501,0.394,0.329,0.406,0.166,0.182,0.301,0.51,0.521,0.421,0.407,0.122,0.362,0.22,0.376,0.207,0.421,0.281,0.402,0.435,0.607,0.549,0.496,0.07,0.245,0.294,0.431,0.44,0.44,0.483,0.392,0.396,0.436,0.116,0.109,0.205,0.175,0.579,0.537,0.683,0.564,0.549,0.554,0.563,0.579,0.561,0.581,0.529,0.556,0.543,0.562,0.516,0.296,1572,Stability,BCHB_CHLTE,Medium,Prokaryote
+BLAT_ECOLX_Deng_2012,0.322,0.504,0.508,0.521,0.507,0.508,0.075,0.312,0.513,0.55,0.517,0.501,0.518,0.243,0.354,0.461,0.528,0.461,0.352,0.455,0.375,0.378,0.4,0.383,0.391,0.445,0.453,0.406,0.343,0.464,0.519,0.487,0.107,0.392,0.402,0.373,0.457,0.471,0.473,0.513,0.526,0.522,0.33,0.011,0.48,0.406,0.423,0.496,0.46,0.252,0.53,0.535,0.538,0.552,0.548,0.542,0.544,0.53,0.541,0.555,0.539,0.406,4996,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Firnberg_2014,0.471,0.708,0.732,0.747,0.709,0.729,0.134,0.459,0.701,0.741,0.703,0.665,0.707,0.411,0.555,0.658,0.737,0.573,0.424,0.662,0.566,0.563,0.537,0.531,0.595,0.646,0.654,0.577,0.452,0.683,0.758,0.752,0.186,0.551,0.529,0.48,0.646,0.638,0.637,0.731,0.734,0.736,0.54,0.046,0.7,0.595,0.616,0.703,0.696,0.321,0.718,0.699,0.703,0.729,0.739,0.73,0.736,0.72,0.729,0.745,0.754,0.605,4783,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Jacquier_2013,0.456,0.689,0.698,0.72,0.703,0.723,0.113,0.441,0.665,0.712,0.668,0.656,0.684,0.362,0.52,0.626,0.704,0.588,0.503,0.635,0.564,0.537,0.542,0.528,0.573,0.632,0.625,0.578,0.462,0.567,0.719,0.691,0.143,0.54,0.549,0.508,0.628,0.642,0.647,0.716,0.728,0.734,0.482,0.04,0.663,0.562,0.527,0.64,0.603,0.29,0.677,0.671,0.672,0.691,0.686,0.686,0.702,0.677,0.684,0.703,0.709,0.573,989,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Stiffler_2015,0.466,0.707,0.732,0.743,0.711,0.727,0.115,0.459,0.688,0.731,0.698,0.668,0.707,0.399,0.557,0.661,0.731,0.589,0.43,0.655,0.568,0.57,0.528,0.525,0.597,0.651,0.661,0.577,0.46,0.679,0.767,0.761,0.158,0.558,0.522,0.475,0.645,0.63,0.629,0.73,0.731,0.733,0.532,0.03,0.703,0.598,0.604,0.693,0.682,0.324,0.715,0.692,0.698,0.72,0.725,0.724,0.729,0.709,0.724,0.737,0.748,0.593,4996,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BRCA1_HUMAN_Findlay_2018,0.501,0.385,0.441,0.432,0.424,0.466,0.125,0.189,0.387,0.407,0.494,0.404,0.447,0.134,0.392,0.455,0.515,0.497,0.394,0.296,0.145,0.379,0.405,0.361,0.305,0.473,0.457,0.442,0.423,0.438,0.539,0.482,0.22,0.151,0.179,0.409,0.456,0.456,0.514,0.475,0.476,0.533,0.229,0.132,0.534,0.457,0.44,0.469,0.116,0.12,0.515,0.501,0.506,0.502,0.512,0.504,0.513,0.512,0.504,0.519,0.539,0.481,1837,OrganismalFitness,BRCA1_HUMAN,Low,Human
+BRCA2_HUMAN_Erwood_2022_HEK293T,0.481,0.346,0.44,0.445,0.447,0.481,0.095,0.477,0.095,-0.03,0.48,0.183,0.103,0.088,0.103,0.416,0.51,0.464,0.473,0.033,0.09,0.494,0.534,0.482,0.533,0.483,0.496,0.499,0.014,0.4,0.297,0.328,0.14,0.119,0.185,0.138,0.42,0.425,0.426,0.419,0.415,0.414,0.163,0.093,0.497,0.047,0.052,0.053,-0.091,0.285,0.436,0.477,0.473,0.424,0.455,0.436,0.456,0.438,0.453,0.465,0.0,0.016,265,OrganismalFitness,BRCA2_HUMAN,,Human
+C6KNH7_9INFA_Lee_2018,0.393,0.371,0.351,0.357,0.433,0.436,-0.025,0.451,0.374,0.378,0.06,0.428,0.492,-0.014,-0.022,-0.007,0.483,0.405,0.426,0.343,0.396,0.363,0.378,0.349,0.254,0.454,0.428,0.465,0.37,0.475,0.457,0.36,0.11,0.354,0.374,0.392,0.412,0.419,0.433,0.439,0.441,0.452,-0.024,-0.033,0.294,-0.002,0.513,0.539,0.544,0.227,0.498,0.497,0.519,0.52,0.531,0.51,0.528,0.519,0.521,0.529,0.313,0.185,10754,OrganismalFitness,C6KNH7_9INFA,Medium,Virus
+CALM1_HUMAN_Weile_2017,0.175,0.233,0.238,0.233,0.238,0.236,0.174,0.187,0.236,0.253,0.25,0.235,0.264,0.161,0.199,0.195,0.212,0.218,0.225,0.229,0.173,0.227,0.242,0.254,0.232,0.276,0.279,0.3,0.313,0.239,0.21,0.146,0.087,0.216,0.253,0.291,0.218,0.242,0.266,0.238,0.244,0.246,0.213,0.156,0.268,0.251,0.106,0.143,0.164,0.088,0.177,0.182,0.188,0.171,0.189,0.176,0.18,0.192,0.178,0.188,0.287,0.238,1813,OrganismalFitness,CALM1_HUMAN,High,Human
+CAPSD_AAV2S_Sinai_2021,0.407,0.345,0.317,0.372,0.346,0.33,0.374,0.424,0.329,0.363,0.183,0.196,0.199,0.254,0.286,0.203,0.277,0.183,0.124,0.258,0.191,0.25,0.269,0.279,0.203,0.198,0.266,0.204,0.399,0.445,0.183,0.177,0.132,0.202,0.262,0.492,0.38,0.386,0.473,0.338,0.343,0.426,0.092,0.16,0.26,0.112,0.402,0.393,0.347,0.319,0.235,0.225,0.216,0.235,0.214,0.215,0.22,0.208,0.228,0.223,0.301,0.222,42328,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CAR11_HUMAN_Meitlis_2020_gof,0.265,0.174,0.164,0.15,0.169,0.184,0.1,0.009,0.218,0.245,0.304,0.324,0.302,0.068,0.095,0.337,0.326,0.345,0.357,0.165,0.151,0.198,0.219,0.216,0.037,0.215,0.229,0.226,0.167,0.179,0.326,0.296,0.189,0.005,0.148,0.124,0.203,0.208,0.182,0.204,0.201,0.182,0.089,0.043,0.261,0.215,0.318,0.294,0.335,0.142,0.343,0.337,0.335,0.323,0.33,0.331,0.317,0.344,0.346,0.342,0.408,0.333,2374,OrganismalFitness,CAR11_HUMAN,Low,Human
+CAR11_HUMAN_Meitlis_2020_lof,0.421,0.313,0.277,0.258,0.31,0.327,0.108,-0.014,0.299,0.293,0.446,0.49,0.461,0.072,0.115,0.462,0.501,0.533,0.54,0.239,0.141,0.312,0.3,0.277,0.107,0.303,0.306,0.358,0.191,0.338,0.433,0.378,0.334,-0.003,0.264,0.195,0.29,0.325,0.271,0.32,0.326,0.281,0.087,0.041,0.417,0.311,0.417,0.445,0.451,0.175,0.518,0.521,0.515,0.508,0.5,0.518,0.503,0.512,0.526,0.525,0.579,0.425,2395,OrganismalFitness,CAR11_HUMAN,Low,Human
+CAS9_STRP1_Spencer_2017_positive,0.159,0.179,0.162,0.166,0.169,0.177,0.04,0.1,0.18,0.183,0.175,0.069,0.073,0.063,0.09,0.154,0.182,0.188,0.194,0.006,0.043,0.081,0.173,0.12,0.043,0.194,0.182,0.179,0.068,0.187,0.199,0.179,0.027,0.035,0.047,0.166,0.17,0.172,0.189,0.184,0.184,0.194,0.061,0.028,0.181,0.106,0.106,0.181,0.036,0.037,0.176,0.177,0.178,0.185,0.18,0.181,0.177,0.178,0.177,0.183,0.177,0.114,8117,Activity,CAS9_STRP1,Medium,Prokaryote
+CASP3_HUMAN_Roychowdhury_2020,0.382,0.543,0.61,0.622,0.626,0.632,0.054,0.478,0.644,0.653,0.59,0.595,0.624,0.264,0.574,0.639,0.638,0.554,0.514,0.603,0.233,0.512,0.517,0.532,0.505,0.556,0.554,0.526,0.567,0.608,0.599,0.542,0.217,0.092,0.482,0.52,0.533,0.562,0.582,0.628,0.619,0.628,0.422,-0.006,0.616,0.563,0.414,0.558,0.499,0.263,0.553,0.571,0.571,0.57,0.575,0.567,0.585,0.571,0.576,0.59,0.652,0.513,1567,Activity,CASP3_HUMAN,High,Human
+CASP7_HUMAN_Roychowdhury_2020,0.372,0.516,0.619,0.617,0.609,0.613,0.057,0.47,0.6,0.627,0.585,0.611,0.632,0.306,0.598,0.632,0.622,0.572,0.562,0.611,0.206,0.532,0.529,0.558,0.529,0.579,0.589,0.563,0.572,0.642,0.596,0.538,0.332,0.103,0.543,0.521,0.517,0.594,0.586,0.599,0.628,0.628,0.461,0.019,0.646,0.596,0.479,0.612,0.573,0.329,0.585,0.57,0.562,0.587,0.608,0.602,0.591,0.578,0.596,0.605,0.643,0.533,1680,Activity,CASP7_HUMAN,Medium,Human
+CATR_CHLRE_Tsuboyama_2023_2AMI,0.533,0.559,0.599,0.606,0.627,0.63,0.68,0.558,0.565,0.528,0.614,0.638,0.657,0.564,0.689,0.688,0.658,0.615,0.644,0.041,0.621,0.603,0.619,0.602,0.641,0.627,0.593,0.561,0.587,0.649,0.611,0.619,0.439,0.626,0.611,0.633,0.649,0.642,0.665,0.622,0.62,0.638,0.44,0.427,0.315,0.46,0.475,0.4,0.679,0.572,0.663,0.652,0.676,0.668,0.687,0.662,0.67,0.653,0.663,0.674,0.62,0.686,1903,Stability,CATR_CHLRE,High,Eukaryote
+CBPA2_HUMAN_Tsuboyama_2023_1O6X,0.644,0.718,0.744,0.745,0.716,0.729,0.482,0.577,0.745,0.754,0.77,0.752,0.736,0.529,0.688,0.736,0.702,0.727,0.718,0.739,0.486,0.522,0.699,0.692,0.547,0.665,0.673,0.674,0.71,0.732,0.71,0.654,0.104,0.509,0.642,0.703,0.699,0.706,0.732,0.746,0.736,0.748,0.51,0.302,0.631,0.646,0.701,0.689,0.82,0.773,0.772,0.768,0.778,0.768,0.782,0.78,0.77,0.772,0.769,0.779,0.716,0.835,2068,Stability,CBPA2_HUMAN,Medium,Human
+CBS_HUMAN_Sun_2020,0.334,0.358,0.364,0.383,0.371,0.379,0.202,0.251,0.374,0.377,0.347,0.346,0.372,0.085,0.226,0.335,0.342,0.329,0.339,0.35,0.356,0.26,0.285,0.288,0.349,0.273,0.29,0.273,0.31,0.381,0.37,0.344,0.197,0.361,0.283,0.266,0.384,0.332,0.327,0.394,0.363,0.363,0.304,0.068,0.376,0.373,0.237,0.268,0.309,0.079,0.324,0.324,0.331,0.337,0.337,0.335,0.332,0.332,0.335,0.34,0.386,0.289,7217,OrganismalFitness,CBS_HUMAN,Medium,Human
+CBX4_HUMAN_Tsuboyama_2023_2K28,0.498,0.548,0.674,0.706,0.633,0.685,-0.266,0.611,0.623,0.613,0.615,0.65,0.648,-0.271,0.735,0.699,0.682,0.609,0.606,0.648,0.486,0.589,0.581,0.587,0.553,0.577,0.589,0.531,0.569,0.633,0.667,0.653,0.423,0.425,0.505,0.54,0.601,0.6,0.623,0.676,0.698,0.701,0.621,-0.32,0.369,0.561,0.167,0.155,0.614,0.588,0.695,0.626,0.69,0.676,0.709,0.668,0.677,0.718,0.664,0.695,0.642,0.751,2282,Stability,CBX4_HUMAN,High,Human
+CCDB_ECOLI_Adkar_2012,0.357,0.486,0.52,0.547,0.528,0.531,-0.031,0.185,0.341,0.361,0.424,0.351,0.432,-0.006,-0.018,0.406,0.466,0.463,0.264,0.457,0.027,-0.015,-0.115,0.142,-0.139,0.053,-0.029,0.034,0.425,0.453,0.554,0.555,0.124,0.006,0.056,0.31,0.442,0.426,0.458,0.507,0.494,0.522,0.018,-0.05,0.444,0.02,0.285,0.426,0.343,0.242,0.436,0.401,0.414,0.443,0.437,0.454,0.436,0.464,0.486,0.453,0.438,0.194,1176,Activity,CCDB_ECOLI,High,Prokaryote
+CCDB_ECOLI_Tripathi_2016,0.434,0.506,0.524,0.541,0.517,0.528,-0.008,0.294,0.403,0.431,0.49,0.435,0.483,-0.007,-0.001,0.462,0.511,0.522,0.362,0.518,0.034,0.014,-0.074,0.171,-0.107,0.094,-0.029,0.074,0.476,0.505,0.542,0.51,0.112,0.009,0.129,0.424,0.48,0.48,0.532,0.518,0.517,0.548,0.005,-0.055,0.495,0.017,0.317,0.433,0.335,0.23,0.486,0.449,0.449,0.482,0.483,0.48,0.473,0.49,0.517,0.492,0.49,0.326,1663,OrganismalFitness,CCDB_ECOLI,High,Prokaryote
+CCR5_HUMAN_Gill_2023,0.271,0.279,0.282,0.289,0.274,0.282,0.316,0.309,0.333,0.343,0.358,0.344,0.356,0.325,0.358,0.362,0.347,0.339,0.341,0.352,0.367,0.366,0.323,0.326,0.369,0.356,0.346,0.36,0.322,0.364,0.3,0.251,0.13,0.367,0.365,0.363,0.367,0.375,0.376,0.324,0.324,0.323,0.354,0.175,0.362,0.361,0.278,0.306,0.291,0.173,0.314,0.305,0.316,0.318,0.313,0.317,0.328,0.323,0.327,0.327,0.348,0.356,6137,Binding,CCR5_HUMAN,High,Human
+CD19_HUMAN_Klesmith_2019_FMC_singles,0.189,0.247,0.199,0.211,0.234,0.232,0.051,0.233,0.257,0.261,0.046,0.05,0.072,0.09,0.093,0.054,0.05,0.123,0.217,-0.013,0.096,0.124,0.118,0.119,0.042,0.12,0.112,0.093,0.244,0.283,0.194,0.159,0.077,0.075,0.12,0.093,0.203,0.21,0.202,0.233,0.235,0.231,0.083,0.05,0.104,0.086,0.316,0.258,0.301,0.174,0.158,0.18,0.173,0.196,0.186,0.195,0.204,0.17,0.185,0.195,0.312,0.244,3761,Binding,CD19_HUMAN,Low,Human
+CP2C9_HUMAN_Amorosi_2021_abundance,0.505,0.572,0.594,0.611,0.596,0.61,0.529,0.555,0.601,0.615,0.513,0.588,0.621,0.531,0.605,0.625,0.635,0.626,0.591,0.611,0.553,0.548,0.587,0.558,0.571,0.585,0.59,0.572,0.572,0.605,0.573,0.511,0.208,0.581,0.563,0.569,0.618,0.618,0.624,0.634,0.63,0.632,0.579,0.116,0.534,0.607,0.555,0.538,0.607,0.218,0.6,0.598,0.606,0.629,0.631,0.623,0.622,0.611,0.617,0.635,0.637,0.618,6370,Expression,CP2C9_HUMAN,High,Human
+CP2C9_HUMAN_Amorosi_2021_activity,0.526,0.59,0.626,0.645,0.616,0.635,0.583,0.615,0.606,0.635,0.519,0.642,0.666,0.57,0.655,0.679,0.679,0.678,0.62,0.652,0.587,0.592,0.619,0.573,0.618,0.587,0.593,0.592,0.582,0.637,0.603,0.544,0.225,0.638,0.605,0.572,0.666,0.658,0.638,0.668,0.659,0.65,0.638,0.094,0.557,0.666,0.603,0.577,0.656,0.263,0.645,0.649,0.652,0.663,0.674,0.672,0.668,0.656,0.662,0.681,0.673,0.667,6142,Binding,CP2C9_HUMAN,High,Human
+CSN4_MOUSE_Tsuboyama_2023_1UFM,0.389,0.504,0.432,0.445,0.47,0.477,0.26,0.4,0.461,0.461,0.481,0.546,0.605,0.449,0.686,0.518,0.484,0.423,0.415,0.406,0.449,0.455,0.502,0.514,0.443,0.587,0.53,0.504,0.501,0.532,0.401,0.389,0.244,0.341,0.474,0.546,0.497,0.536,0.576,0.468,0.513,0.53,0.416,0.326,0.494,0.538,0.562,0.541,0.653,0.62,0.531,0.52,0.532,0.557,0.58,0.556,0.539,0.58,0.548,0.556,0.575,0.72,3295,Stability,CSN4_MOUSE,Medium,Eukaryote
+CUE1_YEAST_Tsuboyama_2023_2MYX,0.357,0.451,0.435,0.438,0.474,0.445,0.154,0.453,0.516,0.547,0.483,0.427,0.398,0.153,0.428,0.478,0.547,0.498,0.467,0.304,0.174,0.206,0.225,0.102,0.117,0.224,0.255,0.248,0.434,0.42,0.469,0.414,0.231,0.16,0.146,0.213,0.38,0.387,0.393,0.438,0.441,0.448,0.154,0.064,0.462,0.256,0.515,0.501,0.654,0.482,0.549,0.508,0.494,0.552,0.517,0.551,0.538,0.525,0.517,0.538,0.6,0.571,1580,Stability,CUE1_YEAST,Medium,Eukaryote
+D7PM05_CLYGR_Somermeyer_2022,0.53,0.625,0.613,0.582,0.629,0.631,0.073,0.475,0.649,0.657,0.439,0.066,0.064,0.068,0.053,0.036,0.05,0.072,0.152,0.438,0.039,0.042,0.114,0.082,0.009,0.047,0.068,0.194,0.361,0.623,0.563,0.574,0.016,0.096,0.125,0.156,0.536,0.536,0.532,0.63,0.629,0.618,0.004,0.024,0.009,0.028,0.292,0.316,0.505,0.396,0.495,0.491,0.496,0.505,0.494,0.502,0.494,0.49,0.487,0.498,0.345,0.208,24515,Activity,D7PM05_CLYGR,Low,Eukaryote
+DLG4_HUMAN_Faure_2021,0.679,0.584,0.607,0.577,0.609,0.616,0.723,0.629,0.524,0.535,0.52,0.55,0.608,0.743,0.764,0.728,0.584,0.487,0.443,0.65,0.574,0.583,0.569,0.538,0.605,0.607,0.564,0.562,0.495,0.614,0.634,0.621,0.511,0.576,0.638,0.58,0.653,0.7,0.667,0.643,0.661,0.636,0.581,0.233,0.372,0.528,0.637,0.453,0.672,0.315,0.483,0.37,0.539,0.499,0.52,0.492,0.508,0.486,0.504,0.504,0.521,0.745,6976,OrganismalFitness,DLG4_HUMAN,Low,Human
+DLG4_RAT_McLaughlin_2012,0.486,0.442,0.487,0.491,0.526,0.539,0.49,0.437,0.483,0.522,0.469,0.557,0.588,0.411,0.543,0.581,0.543,0.478,0.471,0.444,0.378,0.389,0.373,0.371,0.407,0.42,0.407,0.367,0.395,0.49,0.556,0.537,0.246,0.367,0.392,0.307,0.47,0.478,0.443,0.538,0.544,0.54,0.527,0.076,0.443,0.5,0.439,0.308,0.459,0.135,0.442,0.375,0.444,0.433,0.444,0.43,0.43,0.464,0.46,0.469,0.504,0.535,1576,Binding,DLG4_RAT,Low,Eukaryote
+DN7A_SACS2_Tsuboyama_2023_1JIC,0.171,0.198,0.218,0.23,0.2,0.259,-0.071,0.317,0.368,0.386,0.085,0.126,0.131,0.074,0.209,0.255,0.337,0.337,0.37,0.235,-0.068,0.045,0.033,0.066,0.041,0.107,0.057,0.071,0.235,0.298,0.431,0.351,0.054,-0.013,-0.023,0.031,0.212,0.2,0.211,0.243,0.227,0.231,0.106,-0.048,0.299,0.129,0.568,0.56,0.66,0.556,0.424,0.436,0.516,0.507,0.491,0.476,0.472,0.444,0.436,0.49,0.556,0.481,1008,Stability,DN7A_SACS2,Medium,Prokaryote
+DNJA1_HUMAN_Tsuboyama_2023_2LO1,0.738,0.75,0.759,0.757,0.767,0.764,0.756,0.732,0.776,0.79,0.773,0.756,0.785,0.827,0.816,0.817,0.803,0.753,0.798,0.781,0.669,0.736,0.776,0.75,0.782,0.765,0.774,0.721,0.764,0.775,0.729,0.714,0.66,0.78,0.776,0.805,0.808,0.816,0.823,0.774,0.787,0.787,0.67,0.164,0.443,0.684,0.643,0.528,0.779,0.785,0.757,0.761,0.76,0.793,0.789,0.803,0.789,0.786,0.794,0.786,0.726,0.801,2264,Stability,DNJA1_HUMAN,High,Human
+DOCK1_MOUSE_Tsuboyama_2023_2M0Y,0.418,0.453,0.421,0.439,0.494,0.501,0.057,0.432,0.406,0.403,0.525,0.472,0.501,0.224,0.458,0.486,0.531,0.508,0.531,0.518,0.34,0.418,0.402,0.408,0.315,0.402,0.365,0.348,0.395,0.448,0.471,0.469,0.216,0.202,0.321,0.252,0.473,0.49,0.494,0.489,0.491,0.507,0.359,-0.177,0.513,0.439,0.38,0.394,0.419,0.439,0.492,0.481,0.497,0.507,0.491,0.512,0.477,0.493,0.504,0.497,0.567,0.531,2915,Stability,DOCK1_MOUSE,High,Eukaryote
+DYR_ECOLI_Nguyen_2023,0.34,0.424,0.448,0.453,0.451,0.454,0.004,0.425,0.505,0.481,0.523,0.535,0.545,0.133,0.5,0.525,0.544,0.529,0.511,0.502,0.373,0.421,0.281,0.342,0.435,0.388,0.387,0.454,0.333,0.48,0.502,0.477,0.148,0.405,0.235,0.225,0.46,0.33,0.356,0.459,0.406,0.418,0.469,0.013,0.519,0.499,0.264,0.428,0.374,0.157,0.501,0.459,0.47,0.483,0.492,0.49,0.503,0.495,0.511,0.505,0.511,0.471,2916,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+DYR_ECOLI_Thompson_2019,0.384,0.484,0.469,0.472,0.476,0.474,-0.018,0.321,0.47,0.501,0.469,0.416,0.436,0.055,0.349,0.455,0.48,0.506,0.512,0.485,0.199,0.36,0.266,0.31,0.342,0.436,0.476,0.441,0.418,0.451,0.451,0.408,0.1,0.355,0.348,0.347,0.407,0.415,0.423,0.459,0.476,0.481,0.318,-0.019,0.442,0.39,0.248,0.403,0.321,0.111,0.44,0.411,0.415,0.418,0.435,0.422,0.441,0.442,0.455,0.445,0.491,0.387,2363,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+ENV_HV1B9_DuenasDecamp_2016,0.369,0.397,0.238,0.367,0.388,0.377,0.056,0.353,0.341,0.366,0.359,0.415,0.389,0.11,0.055,0.035,0.012,0.048,0.095,0.388,0.38,0.358,0.408,0.419,0.263,0.394,0.401,0.391,0.374,0.389,0.355,0.322,0.253,0.368,0.373,0.404,0.392,0.396,0.407,0.39,0.392,0.392,0.343,-0.04,0.334,0.395,0.355,0.325,0.364,0.231,0.215,0.261,0.274,0.24,0.214,0.233,0.241,0.258,0.24,0.263,0.15,0.102,375,OrganismalFitness,ENV_HV1B9,Medium,Virus
+ENV_HV1BR_Haddox_2016,0.338,0.303,0.322,0.323,0.339,0.345,-0.001,0.322,0.345,0.344,0.298,0.32,0.336,-0.009,-0.006,0.004,0.046,0.069,0.16,0.337,0.35,0.358,0.371,0.364,0.345,0.358,0.358,0.354,0.362,0.35,0.325,0.29,0.189,0.344,0.36,0.358,0.359,0.366,0.362,0.367,0.369,0.365,0.233,-0.013,0.321,0.293,0.212,0.245,0.119,0.072,0.18,0.179,0.203,0.226,0.188,0.202,0.2,0.192,0.193,0.209,0.167,0.098,12863,OrganismalFitness,ENV_HV1BR,Medium,Virus
+ENVZ_ECOLI_Ghose_2023,0.115,0.095,0.18,0.178,0.187,0.188,0.04,0.19,0.19,0.198,0.154,0.204,0.209,0.194,0.207,0.195,0.172,0.154,0.113,0.183,0.164,0.152,0.201,0.183,0.176,0.164,0.199,0.183,0.163,0.178,0.133,0.135,0.142,0.18,0.184,0.193,0.207,0.213,0.215,0.195,0.201,0.199,0.194,0.134,0.187,0.185,0.055,0.149,0.114,0.036,0.189,0.186,0.168,0.188,0.22,0.197,0.183,0.19,0.2,0.199,0.148,0.224,1121,Activity,ENVZ_ECOLI,High,Prokaryote
+EPHB2_HUMAN_Tsuboyama_2023_1F0M,0.615,0.706,0.705,0.708,0.732,0.733,-0.32,0.684,0.781,0.789,0.762,0.787,0.789,-0.378,0.768,0.803,0.81,0.769,0.744,0.741,0.555,0.599,0.685,0.656,0.641,0.664,0.684,0.705,0.735,0.795,0.752,0.738,0.677,0.612,0.603,0.644,0.716,0.711,0.728,0.741,0.737,0.735,0.625,-0.253,0.618,0.63,0.665,0.604,0.828,0.76,0.805,0.801,0.806,0.811,0.82,0.809,0.802,0.806,0.8,0.812,0.799,0.805,1960,Stability,EPHB2_HUMAN,High,Human
+ERBB2_HUMAN_Elazar_2016,0.386,0.368,0.254,0.25,0.244,0.268,0.449,0.415,0.405,0.416,0.449,0.48,0.49,0.481,0.463,0.469,0.423,0.373,0.295,0.047,0.457,0.499,0.55,0.471,0.505,0.536,0.596,0.598,0.506,0.381,0.424,0.066,0.489,0.553,0.471,0.521,0.494,0.465,0.492,0.447,0.441,0.439,0.468,0.459,0.537,0.463,0.461,0.491,-0.139,0.022,0.41,0.406,0.451,0.44,0.415,0.406,0.432,0.452,0.438,0.446,0.525,0.488,326,Expression,ERBB2_HUMAN,Low,Human
+ESTA_BACSU_Nutschel_2020,0.258,0.399,0.389,0.415,0.387,0.387,0.187,0.315,0.345,0.413,0.336,0.305,0.335,0.169,0.27,0.267,0.3,0.281,0.323,0.32,0.121,0.196,0.268,0.286,0.27,0.254,0.311,0.272,0.379,0.379,0.434,0.406,0.075,0.184,0.272,0.261,0.31,0.329,0.326,0.403,0.407,0.397,0.193,0.121,0.289,0.262,0.557,0.489,0.545,0.368,0.366,0.334,0.347,0.343,0.378,0.347,0.347,0.365,0.369,0.37,0.391,0.212,2172,Stability,ESTA_BACSU,High,Prokaryote
+F7YBW8_MESOW_Aakre_2015,0.06,0.395,0.395,0.44,0.428,0.43,0.016,0.32,0.382,0.395,0.437,0.382,0.393,-0.091,-0.001,0.049,0.383,0.366,0.434,0.391,-0.075,-0.122,-0.104,-0.005,0.098,0.125,-0.021,0.298,0.403,0.375,0.461,0.441,0.037,-0.036,-0.124,0.433,0.062,0.011,0.425,0.38,0.367,0.444,-0.064,-0.081,0.199,-0.048,-0.008,0.246,0.069,0.035,0.431,0.447,0.423,0.46,0.453,0.457,0.43,0.443,0.447,0.448,0.27,-0.126,9192,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+FECA_ECOLI_Tsuboyama_2023_2D1U,0.38,0.431,0.449,0.458,0.449,0.441,0.023,0.297,0.372,0.456,0.509,0.269,0.433,0.059,0.449,0.536,0.524,0.48,0.386,0.464,0.169,0.272,0.343,0.288,0.156,0.477,0.451,0.292,0.394,0.52,0.409,0.336,0.276,0.065,0.121,0.148,0.355,0.348,0.302,0.415,0.415,0.389,0.395,0.008,0.501,0.447,0.582,0.551,0.588,0.525,0.527,0.483,0.514,0.512,0.5,0.528,0.499,0.489,0.506,0.515,0.595,0.585,1886,Stability,FECA_ECOLI,High,Prokaryote
+FKBP3_HUMAN_Tsuboyama_2023_2KFV,0.423,0.394,0.479,0.482,0.487,0.498,0.18,0.342,0.237,0.237,0.171,0.17,0.164,0.172,0.163,0.147,0.188,0.265,0.378,0.287,0.221,0.213,0.218,0.279,0.169,0.229,0.193,0.25,0.22,0.478,0.315,0.319,-0.077,0.095,0.168,0.23,0.398,0.41,0.342,0.484,0.49,0.409,0.18,0.184,0.177,0.21,0.685,0.603,0.687,0.589,0.306,0.397,0.443,0.402,0.397,0.403,0.369,0.368,0.296,0.399,0.581,0.368,1237,Stability,FKBP3_HUMAN,Medium,Human
+GAL4_YEAST_Kitzman_2015,0.237,0.403,0.507,0.567,0.486,0.532,0.337,-0.004,0.513,0.575,0.623,0.458,0.462,0.34,0.392,0.528,0.668,0.67,0.655,0.507,0.319,0.355,0.383,0.372,0.364,0.424,0.493,0.451,0.579,0.628,0.651,0.563,0.303,0.279,0.331,0.325,0.56,0.555,0.557,0.508,0.509,0.503,0.395,0.301,0.604,0.446,0.287,0.568,0.308,0.114,0.627,0.634,0.613,0.625,0.632,0.634,0.62,0.617,0.619,0.638,0.558,0.408,1195,OrganismalFitness,GAL4_YEAST,Medium,Eukaryote
+GCN4_YEAST_Staller_2018,0.253,0.25,0.245,0.248,0.242,0.241,0.182,0.221,0.24,0.24,0.238,0.272,0.284,0.318,0.278,0.266,0.281,0.261,0.248,0.186,0.175,0.189,0.168,0.167,0.158,0.118,0.081,0.134,0.164,0.225,0.255,0.247,0.025,0.175,0.169,0.265,0.258,0.257,0.278,0.258,0.257,0.274,0.107,0.192,0.157,0.126,0.145,0.183,0.201,0.167,0.227,0.22,0.217,0.214,0.226,0.224,0.219,0.221,0.224,0.222,0.221,0.245,2638,Binding,GCN4_YEAST,Low,Eukaryote
+GDIA_HUMAN_Silverstein_2021,0.442,0.429,0.439,0.446,0.451,0.452,0.205,0.398,0.466,0.462,0.391,0.423,0.46,0.159,0.247,0.448,0.397,0.356,0.411,0.401,0.253,0.385,0.38,0.409,0.326,0.439,0.425,0.384,0.397,0.415,0.423,0.393,0.276,0.321,0.394,0.356,0.432,0.448,0.437,0.451,0.47,0.461,0.233,0.121,0.432,0.26,0.357,0.421,0.387,0.101,0.389,0.335,0.402,0.378,0.391,0.399,0.377,0.391,0.375,0.405,0.461,0.338,1154,OrganismalFitness,GDIA_HUMAN,Low,Human
+GFP_AEQVI_Sarkisyan_2016,0.649,0.644,0.672,0.673,0.679,0.679,0.049,0.635,0.667,0.661,0.524,0.098,0.103,0.078,0.131,0.103,0.108,0.149,0.29,0.598,0.078,0.105,0.181,0.098,0.046,0.188,0.298,0.642,0.647,0.678,0.607,0.615,0.074,0.066,0.182,0.629,0.649,0.651,0.672,0.683,0.685,0.706,0.024,-0.014,0.041,0.036,0.512,0.508,0.713,0.602,0.591,0.6,0.61,0.617,0.609,0.611,0.605,0.606,0.587,0.607,0.623,0.449,51714,Activity,GFP_AEQVI,Low,Eukaryote
+GLPA_HUMAN_Elazar_2016,0.227,0.134,0.207,0.21,0.18,0.162,0.307,0.524,0.385,0.361,0.388,0.417,0.412,0.339,0.392,0.453,0.425,0.38,0.47,0.163,0.37,0.388,0.364,0.407,0.363,0.39,0.414,0.42,0.49,0.349,0.435,0.276,0.281,0.423,0.372,0.414,0.423,0.392,0.417,0.391,0.362,0.405,0.405,0.384,0.428,0.341,0.33,0.413,0.31,0.177,0.529,0.513,0.507,0.479,0.5,0.455,0.5,0.434,0.464,0.507,0.443,0.411,245,Expression,GLPA_HUMAN,Low,Human
+GRB2_HUMAN_Faure_2021,0.405,0.521,0.507,0.538,0.54,0.546,0.486,0.45,0.539,0.491,0.533,0.455,0.515,0.526,0.598,0.626,0.647,0.535,0.579,0.534,0.544,0.516,0.514,0.453,0.532,0.509,0.446,0.524,0.448,0.504,0.428,0.422,0.464,0.536,0.506,0.404,0.517,0.506,0.436,0.562,0.561,0.532,0.584,0.323,0.53,0.572,0.672,0.527,0.69,0.516,0.621,0.649,0.638,0.638,0.647,0.628,0.629,0.645,0.625,0.65,0.556,0.607,63366,OrganismalFitness,GRB2_HUMAN,Medium,Human
+HCP_LAMBD_Tsuboyama_2023_2L6Q,0.315,0.41,0.113,0.147,0.351,0.36,0.241,0.399,0.43,0.485,0.666,0.45,0.496,0.285,0.416,0.574,0.686,0.588,0.577,0.245,0.292,0.306,0.362,0.368,0.324,0.325,0.21,0.345,0.515,0.579,0.636,0.504,0.191,0.262,0.287,0.486,0.43,0.443,0.531,0.374,0.43,0.507,0.31,0.285,0.614,0.376,0.667,0.724,0.73,0.601,0.692,0.698,0.71,0.687,0.714,0.711,0.696,0.714,0.719,0.723,0.768,0.609,1040,Stability,HCP_LAMBD,Medium,Virus
+HECD1_HUMAN_Tsuboyama_2023_3DKM,0.291,0.37,0.324,0.324,0.332,0.336,0.055,0.234,0.359,0.37,0.303,0.303,0.293,0.134,0.206,0.474,0.387,0.34,0.385,0.337,-0.024,0.232,0.302,0.264,0.16,0.294,0.308,0.247,0.214,0.312,0.332,0.307,0.103,0.135,0.101,0.229,0.327,0.298,0.353,0.327,0.341,0.359,0.089,0.038,0.433,0.138,0.252,0.26,0.38,0.321,0.368,0.408,0.345,0.392,0.411,0.436,0.442,0.428,0.451,0.417,0.414,0.219,5586,Stability,HECD1_HUMAN,Medium,Human
+HEM3_HUMAN_Loggerenberg_2023,0.418,0.409,0.403,0.407,0.42,0.418,0.126,0.103,0.432,0.442,0.388,0.383,0.396,0.142,0.387,0.408,0.393,0.418,0.429,0.108,0.349,0.373,0.37,0.375,0.358,0.387,0.376,0.385,0.402,0.435,0.42,0.41,0.093,0.377,0.381,0.377,0.42,0.43,0.438,0.427,0.434,0.44,0.323,0.135,0.384,0.378,0.314,0.358,0.301,0.146,0.357,0.337,0.353,0.363,0.377,0.374,0.371,0.373,0.368,0.377,0.431,0.415,5689,Activity,HEM3_HUMAN,Medium,Human
+HIS7_YEAST_Pokusaeva_2019,0.472,0.542,0.558,0.557,0.533,0.531,0.146,0.273,0.468,0.508,0.472,0.411,0.477,0.057,0.113,0.44,0.411,0.475,0.48,0.454,0.325,0.402,0.433,0.478,0.404,0.485,0.464,0.516,0.539,0.524,0.4,0.374,0.143,0.397,0.453,0.585,0.488,0.496,0.616,0.573,0.548,0.582,0.14,0.04,0.299,0.02,0.378,0.424,0.529,0.392,0.444,0.35,0.404,0.427,0.44,0.407,0.418,0.45,0.431,0.424,0.521,0.222,496137,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+HMDH_HUMAN_Jiang_2019,0.491,0.421,0.382,0.394,0.456,0.451,0.158,0.282,0.282,0.274,0.275,0.341,0.315,0.046,0.01,0.317,0.509,0.374,0.378,0.302,0.451,0.162,0.175,0.221,0.435,0.197,0.168,0.209,0.213,0.476,0.413,0.401,0.228,0.308,0.247,0.219,0.44,0.34,0.33,0.437,0.361,0.36,0.172,0.069,0.477,0.496,0.323,0.444,-0.184,0.105,0.493,0.492,0.486,0.498,0.489,0.492,0.48,0.5,0.485,0.499,0.529,0.438,16853,OrganismalFitness,HMDH_HUMAN,Low,Human
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,0.316,0.36,0.371,0.376,0.364,0.365,0.194,0.394,0.427,0.437,0.362,0.416,0.453,-0.073,0.071,0.234,0.257,0.309,0.293,0.02,0.374,0.398,0.399,0.386,0.338,0.366,0.411,0.415,0.42,0.397,0.436,0.395,0.225,0.34,0.34,0.34,0.363,0.362,0.361,0.382,0.381,0.381,0.01,-0.056,0.383,0.4,0.068,0.267,0.097,-0.015,0.241,0.257,0.243,0.258,0.255,0.243,0.269,0.25,0.236,0.256,0.367,0.301,2252,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HSP82_YEAST_Flynn_2019,0.349,0.367,0.398,0.409,0.401,0.401,0.159,0.347,0.417,0.406,0.367,0.392,0.417,0.018,0.094,0.174,0.283,0.359,0.37,0.391,0.384,0.397,0.389,0.407,0.398,0.419,0.423,0.397,0.434,0.425,0.454,0.441,0.194,0.38,0.394,0.392,0.403,0.415,0.411,0.418,0.427,0.424,0.133,-0.065,0.371,0.313,0.188,0.308,0.11,0.062,0.291,0.283,0.295,0.305,0.293,0.296,0.301,0.296,0.282,0.302,0.386,0.329,13294,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HSP82_YEAST_Mishra_2016,0.487,0.461,0.541,0.553,0.543,0.543,0.385,0.399,0.472,0.49,0.491,0.547,0.556,0.178,0.291,0.372,0.461,0.473,0.429,0.485,0.501,0.504,0.494,0.528,0.532,0.51,0.54,0.479,0.515,0.557,0.558,0.55,0.338,0.496,0.51,0.505,0.525,0.534,0.528,0.552,0.558,0.552,0.495,-0.104,0.485,0.533,0.105,0.352,0.096,0.04,0.433,0.416,0.417,0.446,0.442,0.44,0.451,0.46,0.436,0.45,0.458,0.445,4323,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HXK4_HUMAN_Gersing_2022_activity,0.489,0.516,0.477,0.49,0.484,0.492,0.229,0.387,0.511,0.527,0.446,0.464,0.491,0.183,0.292,0.51,0.517,0.508,0.476,0.051,0.453,0.449,0.442,0.421,0.449,0.465,0.452,0.448,0.402,0.503,0.492,0.45,0.179,0.472,0.426,0.37,0.503,0.482,0.446,0.513,0.508,0.493,0.316,0.032,0.501,0.478,0.356,0.439,0.394,0.168,0.507,0.499,0.488,0.497,0.498,0.509,0.502,0.511,0.508,0.513,0.52,0.481,8570,OrganismalFitness,HXK4_HUMAN,Medium,Human
+HXK4_HUMAN_Gersing_2023_abundance,0.322,0.362,0.325,0.351,0.341,0.348,0.057,0.329,0.386,0.407,0.338,0.331,0.354,0.136,0.185,0.328,0.367,0.376,0.358,0.401,0.282,0.29,0.314,0.316,0.285,0.307,0.316,0.315,0.32,0.374,0.295,0.253,0.17,0.305,0.304,0.317,0.34,0.349,0.359,0.348,0.362,0.369,0.168,0.071,0.351,0.336,0.399,0.382,0.391,0.16,0.359,0.374,0.371,0.368,0.379,0.376,0.375,0.368,0.371,0.378,0.424,0.329,8396,Expression,HXK4_HUMAN,Medium,Human
+I6TAH8_I68A0_Doud_2015,0.347,0.317,0.268,0.264,0.364,0.361,-0.005,0.297,0.299,0.336,0.013,0.018,0.014,0.018,0.025,0.01,0.02,0.013,0.094,0.212,0.308,0.328,0.374,0.377,0.004,0.011,0.097,0.002,0.302,0.368,0.25,0.253,0.171,0.304,0.332,0.337,0.329,0.354,0.348,0.383,0.399,0.401,0.007,0.02,0.016,0.019,0.209,0.209,0.231,0.1,0.075,0.114,0.126,0.135,0.111,0.106,0.089,0.098,0.067,0.109,0.056,0.018,9462,OrganismalFitness,I6TAH8_I68A0,Medium,Virus
+IF1_ECOLI_Kelsic_2016,0.328,0.499,0.541,0.539,0.527,0.537,0.177,0.387,0.261,0.255,0.534,0.54,0.565,0.193,0.477,0.55,0.599,0.556,0.515,0.49,0.361,0.438,0.366,0.411,0.451,0.458,0.482,0.443,0.459,0.401,0.52,0.463,0.238,0.458,0.477,0.527,0.463,0.47,0.495,0.529,0.532,0.539,0.371,0.15,0.562,0.516,0.321,0.518,0.475,0.244,0.545,0.534,0.555,0.575,0.577,0.559,0.551,0.571,0.56,0.583,0.616,0.506,1367,OrganismalFitness,IF1_ECOLI,High,Prokaryote
+ILF3_HUMAN_Tsuboyama_2023_2L33,0.233,0.336,0.444,0.438,0.437,0.436,0.022,0.315,0.371,0.416,0.353,0.352,0.35,0.192,0.324,0.435,0.318,0.282,0.167,0.48,0.206,0.214,0.257,0.312,0.347,0.233,0.372,0.255,0.259,0.457,0.349,0.313,0.097,0.127,0.282,0.362,0.282,0.349,0.401,0.382,0.389,0.445,0.279,0.045,0.339,0.395,0.418,0.368,0.475,0.342,0.304,0.274,0.343,0.33,0.37,0.395,0.33,0.312,0.414,0.37,0.319,0.349,1329,Stability,ILF3_HUMAN,High,Human
+ISDH_STAAW_Tsuboyama_2023_2LHR,0.094,0.101,0.206,0.197,0.231,0.231,0.326,0.128,0.326,0.356,0.453,0.413,0.434,0.377,0.394,0.449,0.429,0.479,0.45,0.185,0.326,0.199,0.218,0.228,0.318,0.335,0.329,0.298,0.36,0.304,0.396,0.334,0.236,0.291,0.249,0.298,0.259,0.219,0.245,0.263,0.221,0.244,0.3,0.248,0.349,0.329,0.592,0.526,0.519,0.49,0.42,0.407,0.429,0.428,0.427,0.429,0.409,0.414,0.419,0.426,0.608,0.522,1944,Stability,ISDH_STAAW,High,Prokaryote
+KCNE1_HUMAN_Muhammad_2023_expression,0.13,0.116,0.093,0.128,0.124,0.103,0.209,0.024,0.041,0.053,0.187,0.159,0.134,0.163,0.177,0.129,0.106,0.062,0.073,0.116,0.181,0.183,0.092,0.039,0.191,0.111,0.108,0.055,0.046,0.096,0.089,0.057,-0.17,0.147,0.225,0.097,0.138,0.152,0.115,0.138,0.141,0.121,0.199,0.166,0.093,0.194,0.179,0.116,0.139,0.086,0.088,0.087,0.091,0.071,0.067,0.065,0.105,0.1,0.082,0.086,0.11,0.193,2339,Expression,KCNE1_HUMAN,Medium,Human
+KCNE1_HUMAN_Muhammad_2023_function,0.417,0.498,0.479,0.496,0.479,0.514,0.215,0.52,0.514,0.52,0.368,0.474,0.584,0.209,0.162,0.599,0.525,0.465,0.38,0.241,0.272,0.293,0.448,0.539,0.285,0.614,0.58,0.603,0.575,0.488,0.568,0.475,-0.034,0.189,0.376,0.616,0.464,0.5,0.618,0.524,0.551,0.634,0.212,0.208,0.528,0.198,0.202,0.425,0.182,0.093,0.54,0.54,0.523,0.528,0.537,0.513,0.548,0.545,0.548,0.554,0.642,0.238,2315,Activity,KCNE1_HUMAN,Medium,Human
+KCNH2_HUMAN_Kozek_2020,0.449,0.421,0.298,0.292,0.211,0.21,0.454,0.151,0.303,0.306,0.307,0.216,0.234,0.253,0.197,0.229,0.28,0.258,0.248,0.382,0.468,0.513,0.495,0.463,0.502,0.491,0.455,0.477,0.481,0.46,0.433,0.318,0.344,0.471,0.525,0.511,0.499,0.554,0.538,0.472,0.517,0.509,0.394,0.223,0.193,0.291,0.219,-0.079,-0.202,0.086,0.53,0.39,0.481,0.435,0.444,0.442,0.476,0.441,0.435,0.475,0.271,0.411,200,Activity,KCNH2_HUMAN,Medium,Human
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,0.256,0.329,0.334,0.344,0.347,0.353,0.059,0.238,0.315,0.309,0.37,0.348,0.361,0.058,0.289,0.379,0.382,0.385,0.388,0.161,0.345,0.305,0.252,0.241,0.351,0.335,0.316,0.345,0.224,0.372,0.366,0.352,0.148,0.327,0.294,0.235,0.35,0.329,0.289,0.364,0.354,0.335,0.213,-0.022,0.328,0.343,0.189,0.286,0.227,0.084,0.374,0.355,0.365,0.365,0.369,0.361,0.371,0.365,0.365,0.373,0.371,0.266,6963,Activity,KCNJ2_MOUSE,Medium,Eukaryote
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,0.381,0.341,0.254,0.265,0.338,0.34,0.119,0.224,0.28,0.277,0.323,0.287,0.297,0.176,0.36,0.37,0.352,0.343,0.354,0.218,0.239,0.228,0.217,0.245,0.265,0.265,0.26,0.273,0.251,0.368,0.284,0.255,0.174,0.243,0.252,0.235,0.324,0.331,0.33,0.328,0.338,0.34,0.266,0.065,0.263,0.309,0.305,0.262,0.306,0.167,0.322,0.311,0.33,0.32,0.339,0.321,0.343,0.321,0.317,0.334,0.351,0.349,6917,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+KKA2_KLEPN_Melnikov_2014,0.25,0.53,0.437,0.622,0.599,0.603,0.199,0.423,0.538,0.606,0.566,0.597,0.621,0.212,0.261,0.487,0.601,0.645,0.662,0.498,0.285,0.42,0.518,0.54,0.353,0.578,0.577,0.568,0.638,0.627,0.637,0.586,0.148,0.243,0.485,0.586,0.445,0.524,0.588,0.586,0.609,0.63,0.253,0.076,0.559,0.386,0.456,0.582,0.551,0.267,0.587,0.572,0.588,0.578,0.579,0.59,0.574,0.583,0.591,0.599,0.641,0.329,4960,OrganismalFitness,KKA2_KLEPN,High,Prokaryote
+LGK_LIPST_Klesmith_2015,0.28,0.426,0.485,0.494,0.481,0.498,0.104,0.465,0.467,0.559,0.444,0.537,0.564,0.179,0.327,0.382,0.515,0.58,0.578,0.499,0.276,0.396,0.462,0.495,0.365,0.492,0.525,0.479,0.552,0.542,0.537,0.46,0.139,0.307,0.398,0.555,0.37,0.404,0.527,0.482,0.495,0.54,0.176,0.051,0.483,0.367,0.398,0.507,0.492,0.145,0.51,0.505,0.511,0.51,0.513,0.52,0.518,0.517,0.52,0.527,0.411,0.334,7890,Activity,LGK_LIPST,Medium,Eukaryote
+LYAM1_HUMAN_Elazar_2016,0.314,0.303,0.189,0.158,0.258,0.266,0.361,0.172,0.413,0.416,0.436,0.37,0.371,0.307,0.363,0.336,0.31,0.426,0.454,0.336,0.382,0.391,0.386,0.385,0.362,0.369,0.373,0.298,0.339,0.349,0.313,0.179,0.203,0.353,0.437,0.343,0.362,0.41,0.348,0.343,0.372,0.313,0.307,0.346,0.318,0.35,0.17,0.259,0.163,0.103,0.235,0.317,0.398,0.33,0.279,0.295,0.352,0.348,0.272,0.332,0.419,0.39,359,Expression,LYAM1_HUMAN,Medium,Human
+MAFG_MOUSE_Tsuboyama_2023_1K1V,0.613,0.636,0.633,0.635,0.621,0.623,0.405,0.643,0.633,0.626,0.619,0.355,0.6,0.453,0.429,0.471,0.48,0.551,0.413,0.627,0.448,0.494,0.522,0.566,0.46,0.515,0.447,0.536,0.49,0.602,0.621,0.609,0.392,0.329,0.565,0.509,0.589,0.647,0.657,0.629,0.661,0.683,0.312,0.079,0.425,0.246,0.596,0.327,0.638,0.611,0.646,0.615,0.617,0.638,0.623,0.638,0.636,0.642,0.651,0.639,0.719,0.727,1429,Stability,MAFG_MOUSE,Medium,Eukaryote
+MBD11_ARATH_Tsuboyama_2023_6ACV,0.551,0.723,0.708,0.727,0.758,0.763,-0.058,0.572,0.658,0.744,0.727,0.422,0.512,-0.146,0.025,0.742,0.699,0.685,0.74,0.715,-0.032,0.058,0.576,0.615,0.193,0.561,0.537,0.568,0.714,0.748,0.729,0.67,0.649,-0.282,0.063,0.033,0.586,0.658,0.634,0.729,0.751,0.742,-0.064,-0.203,0.629,0.433,0.521,0.66,0.773,0.576,0.793,0.76,0.792,0.745,0.758,0.741,0.763,0.776,0.774,0.78,0.813,0.72,2116,Stability,MBD11_ARATH,Medium,Eukaryote
+MET_HUMAN_Estevam_2023,0.454,0.544,0.538,0.563,0.582,0.584,0.517,0.52,0.55,0.562,0.585,0.55,0.562,0.452,0.515,0.542,0.59,0.596,0.599,0.583,0.526,0.506,0.466,0.492,0.53,0.519,0.519,0.488,0.454,0.548,0.57,0.53,0.286,0.471,0.501,0.525,0.512,0.539,0.563,0.558,0.565,0.573,0.542,0.347,0.571,0.575,0.35,0.42,0.498,0.192,0.54,0.548,0.548,0.55,0.562,0.554,0.557,0.561,0.556,0.566,0.575,0.529,5393,Activity,MET_HUMAN,Medium,Human
+MK01_HUMAN_Brenan_2016,0.176,0.198,0.237,0.241,0.223,0.227,0.209,0.163,0.198,0.184,0.033,0.167,0.182,0.17,0.199,0.195,0.178,0.189,0.141,0.184,0.209,0.117,0.069,0.016,0.183,0.089,0.057,0.076,-0.067,0.218,0.176,0.222,0.106,0.169,0.058,0.004,0.193,0.119,0.093,0.227,0.205,0.202,0.196,0.125,0.135,0.184,0.059,-0.0,0.133,0.006,0.161,0.162,0.148,0.166,0.17,0.163,0.163,0.169,0.177,0.171,0.15,0.186,6809,OrganismalFitness,MK01_HUMAN,Medium,Human
+MLAC_ECOLI_MacRae_2023,0.225,0.352,0.419,0.424,0.409,0.411,-0.014,0.36,0.412,0.419,0.399,0.416,0.432,-0.011,0.236,0.273,0.382,0.382,0.412,0.387,0.078,0.366,0.38,0.402,0.313,0.383,0.392,0.382,0.424,0.4,0.44,0.41,0.039,0.131,0.315,0.296,0.265,0.327,0.315,0.374,0.396,0.395,0.177,-0.036,0.329,0.235,0.006,0.193,0.123,-0.026,0.319,0.28,0.288,0.293,0.315,0.308,0.295,0.305,0.332,0.313,0.23,0.203,4007,OrganismalFitness,MLAC_ECOLI,Medium,Prokaryote
+MSH2_HUMAN_Jia_2020,0.352,0.399,0.367,0.376,0.383,0.39,0.204,0.339,0.395,0.402,0.349,0.38,0.4,0.213,0.349,0.397,0.339,0.296,0.204,0.364,0.296,0.31,0.264,0.257,0.326,0.317,0.32,0.326,0.301,0.394,0.351,0.324,0.208,0.277,0.351,0.281,0.35,0.388,0.346,0.387,0.405,0.383,0.284,0.103,0.4,0.355,0.293,0.363,0.062,0.099,0.305,0.279,0.291,0.309,0.317,0.3,0.295,0.308,0.339,0.321,0.394,0.366,16749,OrganismalFitness,MSH2_HUMAN,Medium,Human
+MTH3_HAEAE_RockahShmuel_2015,0.371,0.612,0.709,0.718,0.696,0.704,0.314,0.644,0.673,0.688,0.581,0.692,0.708,0.19,0.352,0.405,0.527,0.608,0.644,0.658,0.311,0.454,0.6,0.662,0.488,0.663,0.711,0.684,0.727,0.679,0.697,0.652,0.291,0.353,0.495,0.665,0.422,0.496,0.638,0.657,0.679,0.714,0.324,-0.031,0.477,0.341,0.449,0.574,0.583,0.225,0.544,0.531,0.559,0.569,0.561,0.573,0.551,0.571,0.557,0.572,0.58,0.378,1777,OrganismalFitness,MTH3_HAEAE,Medium,Prokaryote
+MTHR_HUMAN_Weile_2021,0.25,0.271,0.249,0.257,0.258,0.257,0.205,0.194,0.292,0.293,0.387,0.299,0.32,0.169,0.414,0.482,0.348,0.276,0.285,0.253,0.443,0.162,0.217,0.265,0.437,0.239,0.264,0.245,0.295,0.296,0.22,0.189,0.143,0.451,0.307,0.212,0.378,0.315,0.254,0.363,0.309,0.269,0.278,0.111,0.379,0.411,0.239,0.338,0.377,0.092,0.324,0.289,0.299,0.297,0.305,0.316,0.335,0.319,0.347,0.325,0.332,0.376,12464,OrganismalFitness,MTHR_HUMAN,Low,Human
+MYO3_YEAST_Tsuboyama_2023_2BTT,0.143,0.248,0.334,0.362,0.381,0.412,0.155,0.251,0.428,0.419,0.519,0.482,0.478,0.171,0.411,0.583,0.588,0.507,0.247,0.321,0.351,0.247,0.336,0.311,0.342,0.287,0.364,0.313,0.336,0.283,0.279,0.178,0.126,0.419,0.332,0.282,0.382,0.329,0.292,0.411,0.401,0.382,0.112,0.045,0.431,0.294,0.392,0.427,0.479,0.359,0.517,0.495,0.545,0.512,0.525,0.529,0.513,0.519,0.506,0.525,0.571,0.5,3297,Stability,MYO3_YEAST,High,Eukaryote
+NCAP_I34A1_Doud_2015,0.364,0.328,0.334,0.333,0.363,0.364,0.002,0.335,0.373,0.348,0.015,0.019,0.02,0.023,0.026,0.02,0.028,0.031,0.107,0.269,0.352,0.382,0.408,0.413,0.018,0.042,0.108,0.03,0.352,0.377,0.27,0.279,0.126,0.356,0.373,0.415,0.39,0.402,0.424,0.425,0.426,0.441,0.019,0.015,0.027,0.025,0.259,0.27,0.278,0.123,0.129,0.171,0.166,0.184,0.172,0.172,0.155,0.163,0.129,0.171,0.135,0.081,9462,OrganismalFitness,NCAP_I34A1,Medium,Virus
+NKX31_HUMAN_Tsuboyama_2023_2L9R,0.428,0.548,0.586,0.584,0.607,0.619,0.582,0.553,0.584,0.608,0.56,0.568,0.572,0.627,0.638,0.674,0.651,0.582,0.602,0.558,0.564,0.557,0.556,0.564,0.576,0.579,0.606,0.54,0.577,0.64,0.525,0.506,0.551,0.494,0.565,0.577,0.578,0.624,0.628,0.601,0.618,0.621,0.454,0.395,0.342,0.408,0.589,0.401,0.643,0.642,0.634,0.626,0.639,0.648,0.669,0.641,0.635,0.629,0.642,0.645,0.566,0.596,2482,Stability,NKX31_HUMAN,High,Human
+NPC1_HUMAN_Erwood_2022_HEK293T,0.683,0.7,0.627,0.632,0.7,0.7,0.22,0.529,0.739,0.734,0.701,0.325,0.458,0.221,0.394,0.614,0.694,0.721,0.719,0.147,0.197,0.46,0.412,0.419,0.421,0.491,0.631,0.447,0.495,0.71,0.659,0.548,0.095,0.231,0.547,0.579,0.639,0.675,0.696,0.691,0.708,0.722,0.3,0.134,0.702,0.568,0.499,0.683,0.06,0.211,0.681,0.664,0.687,0.679,0.678,0.664,0.693,0.701,0.685,0.697,0.7,0.392,637,Activity,NPC1_HUMAN,Low,Human
+NPC1_HUMAN_Erwood_2022_RPE1,0.779,0.688,0.534,0.554,0.716,0.731,0.417,0.535,0.584,0.707,0.704,0.298,0.357,0.314,0.394,0.629,0.661,0.64,0.709,0.405,0.631,0.49,0.543,0.417,0.408,0.468,0.536,0.373,0.531,0.633,0.74,0.568,0.475,0.466,0.377,0.612,0.787,0.672,0.774,0.783,0.7,0.751,0.506,0.442,0.786,0.391,0.472,0.557,-0.107,0.157,0.574,0.537,0.559,0.482,0.569,0.56,0.571,0.646,0.64,0.595,0.692,0.482,63,Activity,NPC1_HUMAN,Low,Human
+NRAM_I33A0_Jiang_2016,0.569,0.565,0.501,0.49,0.584,0.584,0.035,0.39,0.625,0.638,-0.076,0.162,0.448,-0.075,-0.098,0.005,0.161,0.541,0.575,0.343,0.583,0.633,0.584,0.571,0.047,0.53,0.627,0.462,0.654,0.635,0.404,0.441,-0.169,0.512,0.532,0.551,0.592,0.615,0.621,0.628,0.638,0.632,-0.105,-0.127,-0.111,-0.111,0.422,0.414,0.448,0.178,0.19,0.228,0.235,0.297,0.253,0.29,0.263,0.247,0.199,0.254,0.292,0.145,298,OrganismalFitness,NRAM_I33A0,Low,Virus
+NUD15_HUMAN_Suiter_2020,0.271,0.453,0.564,0.596,0.591,0.594,-0.005,0.389,0.602,0.665,0.603,0.6,0.645,0.299,0.419,0.46,0.526,0.583,0.599,0.501,0.301,0.454,0.546,0.518,0.412,0.579,0.577,0.549,0.542,0.587,0.623,0.553,0.147,0.36,0.423,0.575,0.433,0.456,0.604,0.587,0.586,0.635,0.386,0.015,0.603,0.429,0.483,0.581,0.534,0.317,0.55,0.521,0.544,0.568,0.556,0.571,0.567,0.559,0.556,0.573,0.679,0.564,2844,Expression,NUD15_HUMAN,High,Human
+NUSA_ECOLI_Tsuboyama_2023_1WCL,0.392,0.589,0.58,0.571,0.602,0.613,0.285,0.465,0.638,0.599,0.56,0.323,0.346,0.322,0.39,0.405,0.494,0.532,0.533,0.56,0.394,0.549,0.572,0.589,0.325,0.497,0.44,0.525,0.572,0.638,0.67,0.706,0.183,0.365,0.419,0.425,0.505,0.493,0.508,0.628,0.601,0.622,0.375,0.246,0.381,0.357,0.682,0.641,0.73,0.658,0.709,0.695,0.699,0.721,0.726,0.741,0.716,0.716,0.719,0.723,0.685,0.575,2028,Stability,NUSA_ECOLI,High,Prokaryote
+NUSG_MYCTU_Tsuboyama_2023_2MI6,0.405,0.404,0.417,0.431,0.451,0.434,0.23,0.423,0.485,0.461,0.398,0.361,0.421,0.389,0.561,0.51,0.518,0.525,0.477,0.521,0.264,0.339,0.315,0.347,0.38,0.385,0.461,0.334,0.378,0.502,0.394,0.387,0.321,0.269,0.275,0.372,0.395,0.399,0.426,0.413,0.417,0.424,0.445,0.144,0.395,0.531,0.676,0.56,0.759,0.64,0.487,0.457,0.454,0.507,0.487,0.485,0.485,0.486,0.483,0.488,0.41,0.588,1380,Stability,NUSG_MYCTU,High,Prokaryote
+OBSCN_HUMAN_Tsuboyama_2023_1V1C,0.54,0.697,0.759,0.777,0.745,0.757,0.384,0.455,0.751,0.763,0.691,0.735,0.759,0.392,0.779,0.819,0.797,0.786,0.755,0.723,0.688,0.689,0.663,0.687,0.589,0.667,0.694,0.726,0.604,0.735,0.694,0.679,0.549,0.262,0.461,0.575,0.687,0.695,0.704,0.755,0.76,0.764,0.543,0.115,0.507,0.633,0.63,0.54,0.782,0.747,0.762,0.76,0.761,0.755,0.772,0.76,0.773,0.767,0.776,0.771,0.825,0.772,3197,Stability,OBSCN_HUMAN,High,Human
+ODP2_GEOSE_Tsuboyama_2023_1W4G,-0.211,-0.17,0.339,0.188,0.332,0.332,-0.149,0.093,0.175,0.19,0.064,-0.1,-0.115,-0.069,-0.151,0.14,0.221,0.089,0.21,0.233,0.189,0.174,0.159,0.177,0.171,0.195,0.166,0.192,0.233,0.327,0.062,0.023,0.245,-0.05,0.025,0.012,0.19,0.181,0.178,0.271,0.26,0.29,0.268,-0.07,0.339,0.327,0.452,0.424,0.437,0.322,0.118,0.075,0.085,0.129,0.073,0.127,0.114,0.116,0.094,0.107,0.286,0.171,1134,Stability,ODP2_GEOSE,High,Prokaryote
+OPSD_HUMAN_Wan_2019,0.17,0.433,0.445,0.517,0.463,0.466,0.347,0.569,0.563,0.6,0.397,0.499,0.56,0.29,0.508,0.518,0.552,0.449,0.482,0.555,0.528,0.549,0.538,0.578,0.552,0.594,0.598,0.577,0.57,0.511,0.422,0.326,0.17,0.503,0.53,0.485,0.537,0.532,0.503,0.485,0.482,0.488,0.386,0.08,0.493,0.472,0.632,0.518,0.632,0.224,0.402,0.353,0.412,0.421,0.482,0.487,0.418,0.421,0.49,0.472,0.582,0.522,165,Expression,OPSD_HUMAN,High,Human
+OTC_HUMAN_Lo_2023,0.524,0.564,0.484,0.522,0.55,0.552,0.127,0.404,0.59,0.592,0.546,0.573,0.581,0.142,0.411,0.512,0.531,0.51,0.534,0.565,0.453,0.461,0.522,0.545,0.476,0.555,0.526,0.54,0.574,0.599,0.475,0.409,0.085,0.429,0.511,0.569,0.522,0.57,0.614,0.567,0.584,0.599,0.279,0.092,0.524,0.463,0.649,0.606,0.674,0.364,0.526,0.547,0.544,0.574,0.556,0.567,0.56,0.548,0.541,0.564,0.616,0.49,1570,Activity,OTC_HUMAN,Medium,Human
+OTU7A_HUMAN_Tsuboyama_2023_2L2D,0.116,0.243,0.184,0.201,0.224,0.227,0.192,0.134,0.141,0.146,0.434,0.604,0.581,0.205,0.563,0.588,0.319,0.362,0.408,0.155,0.184,0.178,0.221,0.275,0.345,0.26,0.38,0.307,0.148,0.259,0.406,0.388,0.132,0.168,0.225,0.349,0.201,0.198,0.308,0.242,0.22,0.262,0.365,0.184,0.543,0.548,0.532,0.533,0.61,0.537,0.395,0.342,0.348,0.401,0.422,0.421,0.374,0.409,0.414,0.404,0.642,0.632,635,Stability,OTU7A_HUMAN,High,Human
+OXDA_RHOTO_Vanella_2023_activity,0.104,0.292,0.364,0.373,0.356,0.356,0.195,0.251,0.278,0.285,0.383,0.348,0.359,0.122,0.219,0.326,0.37,0.39,0.401,0.259,0.125,0.214,0.303,0.305,0.219,0.343,0.356,0.329,0.407,0.378,0.429,0.408,0.102,0.164,0.213,0.293,0.267,0.275,0.305,0.342,0.344,0.355,0.153,0.014,0.369,0.247,0.295,0.386,0.361,0.139,0.367,0.36,0.355,0.361,0.365,0.374,0.367,0.376,0.358,0.377,0.396,0.242,6396,Activity,OXDA_RHOTO,High,Eukaryote
+OXDA_RHOTO_Vanella_2023_expression,0.18,0.253,0.29,0.287,0.276,0.272,0.19,0.193,0.296,0.305,0.327,0.31,0.311,0.221,0.257,0.298,0.315,0.351,0.34,0.175,0.169,0.243,0.252,0.233,0.217,0.264,0.295,0.275,0.31,0.308,0.289,0.241,0.082,0.227,0.211,0.246,0.268,0.259,0.267,0.284,0.278,0.278,0.243,0.144,0.304,0.234,0.385,0.356,0.333,0.142,0.329,0.318,0.308,0.317,0.323,0.308,0.321,0.316,0.311,0.326,0.408,0.321,6769,Expression,OXDA_RHOTO,High,Eukaryote
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,0.415,0.424,0.329,0.344,0.432,0.427,-0.088,0.319,0.283,0.291,0.467,0.457,0.509,-0.154,-0.146,0.314,0.432,0.469,0.497,0.116,0.312,0.438,0.453,0.409,0.39,0.472,0.484,0.493,0.368,0.423,0.465,0.41,0.208,0.258,0.449,0.355,0.403,0.462,0.4,0.427,0.469,0.413,-0.151,-0.155,0.475,-0.051,0.389,0.471,0.414,0.186,0.449,0.45,0.451,0.45,0.452,0.47,0.448,0.454,0.452,0.464,0.477,0.186,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,0.35,0.395,0.294,0.304,0.383,0.38,-0.087,0.269,0.287,0.289,0.482,0.447,0.498,-0.158,-0.145,0.29,0.404,0.431,0.459,0.126,0.294,0.447,0.463,0.43,0.375,0.487,0.5,0.523,0.329,0.42,0.45,0.375,0.213,0.239,0.453,0.36,0.354,0.431,0.379,0.372,0.43,0.383,-0.144,-0.153,0.471,-0.063,0.403,0.48,0.426,0.188,0.423,0.427,0.427,0.433,0.431,0.44,0.426,0.43,0.434,0.44,0.469,0.182,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,0.445,0.462,0.342,0.366,0.479,0.481,-0.143,0.3,0.258,0.259,0.551,0.496,0.554,-0.194,-0.174,0.362,0.4,0.446,0.467,0.028,0.41,0.563,0.521,0.48,0.482,0.535,0.55,0.616,0.285,0.507,0.493,0.449,0.225,0.326,0.612,0.408,0.448,0.568,0.433,0.475,0.567,0.438,-0.185,-0.202,0.544,-0.08,0.452,0.553,0.461,0.223,0.438,0.45,0.449,0.452,0.461,0.471,0.437,0.461,0.438,0.463,0.482,0.166,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Kotler_2018,0.629,0.588,0.496,0.556,0.555,0.576,0.109,0.498,0.573,0.585,0.6,0.538,0.622,0.095,0.108,0.628,0.678,0.698,0.702,0.432,0.429,0.461,0.478,0.463,0.47,0.48,0.465,0.47,0.493,0.593,0.631,0.585,0.039,0.398,0.468,0.42,0.611,0.584,0.564,0.604,0.588,0.581,0.09,0.04,0.638,0.221,0.467,0.583,0.479,0.252,0.638,0.659,0.658,0.642,0.645,0.655,0.673,0.658,0.663,0.672,0.705,0.396,1048,OrganismalFitness,P53_HUMAN,Low,Human
+P84126_THETH_Chan_2017,0.508,0.58,0.603,0.617,0.564,0.578,0.364,0.526,0.644,0.623,0.569,0.552,0.588,0.319,0.551,0.559,0.605,0.596,0.571,0.618,0.419,0.507,0.478,0.558,0.521,0.584,0.589,0.571,0.653,0.522,0.553,0.469,0.359,0.474,0.499,0.536,0.515,0.526,0.547,0.56,0.568,0.581,0.465,0.032,0.543,0.51,0.337,0.452,0.512,0.125,0.543,0.527,0.487,0.549,0.551,0.519,0.526,0.553,0.561,0.562,0.622,0.61,1519,OrganismalFitness,P84126_THETH,Medium,Prokaryote
+PA_I34A1_Wu_2015,0.518,0.519,0.499,0.508,0.539,0.543,0.041,0.358,0.152,0.165,0.037,0.054,0.101,0.024,0.029,0.025,0.038,0.041,0.356,0.325,0.456,0.493,0.533,0.538,0.219,0.408,0.444,0.429,0.438,0.584,0.384,0.374,0.107,0.436,0.475,0.541,0.546,0.561,0.572,0.588,0.592,0.584,0.028,0.019,0.031,0.02,0.244,0.232,0.172,0.095,0.154,0.173,0.184,0.187,0.168,0.178,0.158,0.173,0.125,0.181,0.202,0.132,1820,OrganismalFitness,PA_I34A1,Medium,Virus
+PABP_YEAST_Melamed_2013,0.663,0.617,0.541,0.55,0.654,0.648,0.474,0.569,0.637,0.663,0.688,0.662,0.678,0.476,0.566,0.648,0.716,0.684,0.695,0.541,0.638,0.665,0.666,0.692,0.638,0.698,0.7,0.676,0.666,0.674,0.703,0.635,0.261,0.638,0.648,0.64,0.689,0.692,0.688,0.683,0.687,0.684,0.605,-0.027,0.67,0.654,0.361,0.612,0.514,0.197,0.664,0.66,0.672,0.693,0.685,0.689,0.691,0.69,0.718,0.705,0.715,0.622,37708,OrganismalFitness,PABP_YEAST,Medium,Eukaryote
+PAI1_HUMAN_Huttinger_2021,0.403,0.395,0.377,0.392,0.402,0.413,0.054,0.314,0.428,0.438,0.435,0.408,0.43,0.083,0.391,0.434,0.448,0.335,0.27,0.422,0.132,0.381,0.364,0.345,0.351,0.395,0.399,0.372,0.33,0.423,0.393,0.356,0.141,0.171,0.37,0.379,0.384,0.404,0.412,0.414,0.424,0.426,0.35,0.049,0.43,0.407,0.379,0.45,0.404,0.152,0.441,0.437,0.439,0.448,0.448,0.445,0.449,0.449,0.446,0.457,0.468,0.391,5345,Activity,PAI1_HUMAN,,Human
+PHOT_CHLRE_Chen_2023,0.211,0.44,0.731,0.706,0.352,0.335,0.654,0.552,0.711,0.721,0.647,0.757,0.783,0.768,0.823,0.714,0.749,0.714,0.758,0.652,0.685,0.678,0.596,0.699,0.533,0.616,0.596,0.656,0.602,0.587,0.572,0.49,0.303,0.565,0.511,0.602,0.597,0.564,0.606,0.416,0.445,0.422,0.623,0.395,0.599,0.621,0.2,0.502,0.669,0.41,0.603,0.543,0.553,0.551,0.555,0.57,0.568,0.574,0.583,0.569,0.737,0.775,167529,Activity,PHOT_CHLRE,High,Eukaryote
+PIN1_HUMAN_Tsuboyama_2023_1I6C,0.222,0.327,0.68,0.645,0.631,0.661,0.585,0.5,0.67,0.707,0.654,0.576,0.674,0.66,0.646,0.665,0.67,0.52,0.548,0.656,0.456,0.671,0.63,0.61,0.559,0.672,0.681,0.647,0.658,0.689,0.692,0.644,0.531,0.539,0.626,0.667,0.607,0.671,0.72,0.677,0.705,0.73,0.556,-0.189,0.623,0.51,0.547,0.574,0.691,0.669,0.659,0.627,0.498,0.563,0.575,0.604,0.608,0.649,0.626,0.62,0.74,0.742,802,Stability,PIN1_HUMAN,High,Human
+PITX2_HUMAN_Tsuboyama_2023_2L7M,0.579,0.52,0.548,0.555,0.549,0.553,0.506,0.47,0.538,0.561,0.467,0.446,0.436,0.557,0.652,0.675,0.626,0.586,0.507,0.542,0.513,0.454,0.477,0.45,0.547,0.493,0.465,0.428,0.468,0.536,0.403,0.386,0.45,0.558,0.508,0.481,0.596,0.548,0.538,0.573,0.537,0.538,0.468,0.412,0.282,0.333,0.453,0.295,0.693,0.63,0.572,0.579,0.572,0.59,0.602,0.609,0.593,0.594,0.598,0.598,0.47,0.593,1824,Stability,PITX2_HUMAN,High,Human
+PKN1_HUMAN_Tsuboyama_2023_1URF,0.179,0.188,0.218,0.234,0.249,0.254,0.198,0.187,0.165,0.206,0.253,0.278,0.31,0.314,0.347,0.464,0.298,0.236,0.238,0.277,0.31,0.289,0.278,0.325,0.3,0.291,0.31,0.292,0.295,0.29,0.326,0.317,0.067,0.335,0.278,0.311,0.262,0.285,0.308,0.288,0.28,0.293,0.266,0.279,0.26,0.3,0.512,0.379,0.493,0.469,0.258,0.303,0.317,0.26,0.284,0.287,0.286,0.299,0.314,0.299,0.324,0.464,1301,Stability,PKN1_HUMAN,High,Human
+POLG_CXB3N_Mattenberger_2021,0.423,0.39,0.377,0.411,0.46,0.473,-0.036,0.336,0.5,0.499,0.292,-0.059,0.042,-0.079,-0.056,0.177,0.395,0.404,0.426,0.356,0.339,0.39,0.381,0.377,0.138,0.388,0.383,0.369,0.393,0.495,0.386,0.319,0.007,0.049,0.274,0.355,0.342,0.385,0.413,0.387,0.43,0.458,-0.065,-0.066,0.342,-0.054,0.193,0.339,0.113,0.046,0.342,0.361,0.364,0.369,0.364,0.365,0.365,0.365,0.374,0.374,0.11,0.068,15711,OrganismalFitness,POLG_CXB3N,Medium,Virus
+POLG_DEN26_Suphatrakul_2023,0.5,0.578,0.289,0.286,0.532,0.533,-0.044,0.434,0.67,0.675,0.318,-0.006,0.0,-0.044,0.007,0.077,0.153,0.275,0.366,0.409,0.453,0.473,0.449,0.456,0.47,0.501,0.493,0.488,0.461,0.622,0.603,0.522,0.035,-0.064,0.115,0.471,0.432,0.383,0.542,0.469,0.397,0.563,-0.055,-0.059,0.391,-0.005,0.36,0.51,0.101,0.116,0.241,0.229,0.244,0.26,0.227,0.261,0.24,0.246,0.223,0.254,0.161,0.032,16897,OrganismalFitness,POLG_DEN26,Low,Virus
+POLG_HCVJF_Qi_2014,0.605,0.547,0.41,0.413,0.605,0.614,-0.039,0.196,0.565,0.577,0.178,0.637,0.635,0.101,0.128,0.114,0.116,0.09,0.078,0.26,0.4,0.443,0.452,0.492,0.422,0.475,0.32,0.41,0.517,0.63,0.587,0.485,0.182,0.474,0.505,0.522,0.515,0.547,0.578,0.48,0.528,0.56,0.114,0.053,0.506,0.109,-0.037,0.349,0.643,0.35,0.308,0.378,0.344,0.379,0.342,0.283,0.323,0.318,0.314,0.372,0.173,0.159,1630,OrganismalFitness,POLG_HCVJF,Medium,Virus
+POLG_PESV_Tsuboyama_2023_2MXD,0.251,0.455,0.373,0.407,0.471,0.463,0.07,0.42,0.39,0.514,0.375,0.062,0.157,0.132,0.14,0.031,0.16,0.114,0.084,0.592,0.079,0.058,0.016,0.157,0.106,-0.059,0.008,-0.084,0.17,0.507,0.575,0.58,0.131,0.003,-0.033,0.053,0.382,0.383,0.384,0.453,0.463,0.452,0.006,-0.037,0.033,0.026,0.448,0.443,0.65,0.576,0.658,0.664,0.615,0.736,0.705,0.718,0.692,0.731,0.718,0.718,0.716,0.686,5130,Stability,POLG_PESV,Medium,Virus
+PPARG_HUMAN_Majithia_2016,0.353,0.539,0.629,0.655,0.612,0.632,0.187,0.339,0.557,0.571,0.623,0.696,0.695,0.028,0.126,0.456,0.594,0.736,0.768,0.717,0.675,0.692,0.25,0.288,0.592,0.722,0.692,0.719,0.299,0.712,0.583,0.509,0.384,0.709,0.683,0.587,0.673,0.7,0.637,0.698,0.706,0.68,0.266,0.03,0.585,0.45,0.594,0.607,0.631,0.302,0.58,0.588,0.593,0.575,0.599,0.583,0.602,0.598,0.585,0.603,0.657,0.499,9576,Activity,PPARG_HUMAN,Medium,Human
+PPM1D_HUMAN_Miller_2022,0.55,0.523,0.531,0.536,0.609,0.61,0.035,0.378,0.46,0.517,0.578,0.599,0.618,0.281,0.386,0.47,0.602,0.628,0.618,0.387,0.45,0.526,0.537,0.43,0.516,0.577,0.567,0.557,0.419,0.601,0.594,0.546,0.234,0.428,0.528,0.537,0.582,0.593,0.591,0.615,0.614,0.616,0.361,-0.05,0.575,0.515,0.503,0.579,0.563,0.236,0.574,0.575,0.57,0.586,0.593,0.58,0.584,0.585,0.59,0.595,0.633,0.509,7889,OrganismalFitness,PPM1D_HUMAN,Low,Human
+PR40A_HUMAN_Tsuboyama_2023_1UZC,0.562,0.655,0.745,0.736,0.772,0.774,0.518,0.493,0.774,0.783,0.727,0.592,0.681,0.54,0.535,0.821,0.805,0.74,0.727,0.715,0.552,0.694,0.7,0.725,0.634,0.716,0.733,0.723,0.744,0.795,0.777,0.756,0.637,0.593,0.556,0.611,0.735,0.746,0.738,0.764,0.784,0.775,0.438,0.281,0.326,0.431,0.639,0.423,0.794,0.783,0.792,0.809,0.804,0.812,0.828,0.823,0.819,0.816,0.816,0.821,0.801,0.835,2033,Stability,PR40A_HUMAN,Medium,Human
+PRKN_HUMAN_Clausen_2023,0.605,0.619,0.6,0.586,0.622,0.614,0.182,0.459,0.525,0.54,0.582,0.608,0.638,0.233,0.292,0.363,0.501,0.658,0.663,0.447,0.281,0.557,0.609,0.576,0.447,0.632,0.623,0.623,0.52,0.622,0.568,0.526,0.266,0.241,0.55,0.593,0.569,0.621,0.64,0.611,0.638,0.654,0.309,0.093,0.591,0.352,0.649,0.618,0.696,0.298,0.542,0.555,0.561,0.583,0.567,0.571,0.579,0.573,0.561,0.583,0.672,0.519,8756,Expression,PRKN_HUMAN,Low,Human
+PSAE_SYNP2_Tsuboyama_2023_1PSE,0.572,0.576,0.566,0.566,0.576,0.572,0.3,0.442,0.491,0.497,0.692,0.601,0.615,0.498,0.57,0.678,0.708,0.627,0.583,0.477,0.304,0.242,0.467,0.391,0.462,0.492,-0.202,0.471,0.529,0.553,0.558,0.505,0.098,0.328,0.409,0.43,0.546,0.543,0.498,0.586,0.585,0.541,0.422,0.237,0.635,0.482,0.586,0.626,0.699,0.643,0.7,0.684,0.687,0.685,0.686,0.692,0.681,0.687,0.689,0.699,0.682,0.674,1579,Stability,PSAE_PICP2,Medium,Prokaryote
+PTEN_HUMAN_Matreyek_2021,0.372,0.393,0.368,0.39,0.397,0.408,0.153,0.358,0.426,0.471,0.449,0.409,0.454,0.165,0.274,0.462,0.465,0.265,0.29,0.477,0.191,0.46,0.408,0.391,0.27,0.338,0.323,0.377,0.294,0.479,0.39,0.388,0.127,0.275,0.427,0.343,0.383,0.457,0.412,0.419,0.463,0.447,0.242,0.027,0.442,0.347,0.482,0.473,0.482,0.229,0.446,0.428,0.461,0.441,0.44,0.451,0.456,0.455,0.447,0.46,0.499,0.454,5083,Expression,PTEN_HUMAN,Medium,Human
+PTEN_HUMAN_Mighell_2018,0.48,0.504,0.511,0.516,0.536,0.54,0.177,0.347,0.504,0.511,0.475,0.469,0.499,0.206,0.407,0.546,0.519,0.308,0.291,0.514,0.335,0.413,0.328,0.279,0.41,0.291,0.278,0.317,0.228,0.525,0.485,0.475,0.058,0.374,0.404,0.299,0.499,0.474,0.418,0.532,0.527,0.531,0.371,-0.009,0.51,0.484,0.421,0.416,0.48,0.202,0.488,0.459,0.488,0.495,0.511,0.505,0.495,0.495,0.509,0.51,0.539,0.51,7260,Activity,PTEN_HUMAN,Medium,Human
+Q2N0S5_9HIV1_Haddox_2018,0.493,0.379,0.352,0.393,0.495,0.502,0.003,0.437,0.51,0.515,0.47,0.509,0.537,-0.005,-0.003,-0.002,0.044,0.093,0.168,0.403,0.518,0.403,0.39,0.337,0.517,0.401,0.394,0.436,0.354,0.507,0.478,0.483,0.291,0.494,0.419,0.406,0.52,0.502,0.501,0.526,0.515,0.513,0.413,-0.016,0.502,0.456,0.396,0.463,0.263,0.145,0.226,0.266,0.307,0.301,0.25,0.257,0.26,0.245,0.223,0.28,0.232,0.116,12729,OrganismalFitness,Q2N0S5_9HIV1,Medium,Virus
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,0.356,0.341,0.348,0.353,0.358,0.355,0.081,0.245,0.314,0.324,0.235,0.301,0.325,0.192,0.186,0.364,0.33,0.287,0.347,0.329,0.249,0.267,0.257,0.215,0.292,0.297,0.268,0.306,0.299,0.303,0.371,0.341,0.098,0.238,0.268,0.182,0.311,0.334,0.294,0.352,0.365,0.348,0.402,0.202,0.321,0.328,0.311,0.341,0.338,0.192,0.291,0.302,0.333,0.334,0.321,0.321,0.316,0.312,0.312,0.325,0.33,0.465,3344,Binding,Q53Z42_HUMAN,Medium,Human
+Q53Z42_HUMAN_McShan_2019_expression,0.524,0.485,0.526,0.538,0.55,0.548,-0.062,0.415,0.541,0.556,0.413,0.467,0.507,0.088,0.106,0.489,0.554,0.55,0.558,0.546,0.415,0.47,0.459,0.448,0.476,0.514,0.492,0.511,0.534,0.533,0.553,0.505,0.183,0.417,0.44,0.405,0.52,0.538,0.533,0.555,0.564,0.563,0.308,0.093,0.526,0.483,0.454,0.49,0.458,0.207,0.533,0.546,0.555,0.551,0.565,0.556,0.557,0.56,0.564,0.57,0.513,0.622,3344,Expression,Q53Z42_HUMAN,Medium,Human
+Q59976_STRSQ_Romero_2015,0.475,0.593,0.634,0.643,0.654,0.662,0.363,0.535,0.672,0.679,0.588,0.519,0.543,0.144,0.448,0.516,0.572,0.56,0.57,0.649,0.598,0.651,0.652,0.663,0.622,0.662,0.677,0.673,0.68,0.684,0.633,0.575,0.304,0.606,0.652,0.634,0.616,0.653,0.657,0.666,0.672,0.675,0.49,0.008,0.587,0.524,0.408,0.533,0.533,0.155,0.56,0.537,0.533,0.556,0.557,0.542,0.562,0.545,0.553,0.563,0.634,0.54,2999,Activity,Q59976_STRSQ,Medium,Prokaryote
+Q6WV13_9MAXI_Somermeyer_2022,0.327,0.393,0.26,0.264,0.333,0.334,-0.031,0.115,0.413,0.407,0.242,0.036,0.022,0.014,0.046,0.011,0.01,0.01,-0.022,0.294,0.013,0.045,0.003,0.102,0.004,-0.002,0.03,-0.005,-0.013,0.426,0.344,0.354,0.043,0.031,0.022,0.014,0.248,0.246,0.245,0.33,0.327,0.324,0.014,-0.015,-0.004,0.019,0.161,0.156,0.299,0.169,0.228,0.248,0.237,0.287,0.267,0.272,0.261,0.263,0.264,0.261,0.072,0.031,31401,Activity,Q6WV12_9MAXI,Low,Eukaryote
+Q837P4_ENTFA_Meier_2023,0.451,0.465,0.489,0.507,0.503,0.517,0.454,0.404,0.479,0.495,0.563,0.545,0.566,0.42,0.475,0.492,0.515,0.545,0.523,-0.019,0.521,0.445,0.441,0.462,0.552,0.535,0.541,0.449,0.521,0.524,0.527,0.477,0.19,0.491,0.516,0.454,0.491,0.538,0.512,0.54,0.549,0.544,0.458,0.37,0.545,0.49,0.231,0.454,0.373,0.114,0.511,0.488,0.499,0.511,0.522,0.529,0.536,0.491,0.524,0.531,0.589,0.483,697,Activity,Q837P4_ENTFA,Medium,Prokaryote
+Q837P5_ENTFA_Meier_2023,0.196,0.416,0.421,0.459,0.39,0.395,0.222,0.263,0.282,0.294,0.326,0.353,0.342,0.129,0.24,0.334,0.374,0.436,0.392,0.321,0.356,0.399,0.435,0.41,0.319,0.408,0.428,0.511,0.455,0.336,0.378,0.352,0.186,0.399,0.427,0.51,0.342,0.391,0.477,0.417,0.428,0.457,0.274,0.186,0.356,0.271,0.353,0.296,0.391,0.098,0.326,0.344,0.359,0.313,0.383,0.348,0.318,0.353,0.32,0.355,0.341,0.319,747,Activity,Q837P5_ENTFA,Medium,Prokaryote
+Q8WTC7_9CNID_Somermeyer_2022,0.286,0.368,0.217,0.215,0.31,0.305,0.025,0.303,0.316,0.322,0.223,-0.009,-0.018,-0.027,-0.035,-0.023,-0.025,-0.007,0.034,0.23,0.013,0.037,0.057,0.013,-0.005,-0.001,0.037,0.252,0.267,0.373,0.297,0.307,-0.018,-0.026,0.009,0.303,0.231,0.237,0.318,0.304,0.309,0.345,-0.013,-0.008,-0.024,-0.027,0.097,0.143,0.292,0.189,0.242,0.234,0.246,0.26,0.24,0.254,0.249,0.243,0.238,0.247,0.137,-0.001,33510,Activity,Q8WTC7_9CNID,Low,Eukaryote
+R1AB_SARS2_Flynn_2022,0.577,0.561,0.212,0.227,0.6,0.605,-0.049,0.292,-0.037,-0.037,0.103,-0.03,-0.04,-0.009,-0.026,0.079,0.105,0.498,0.577,0.266,0.214,0.259,0.274,0.289,0.242,0.236,0.203,0.21,0.224,0.558,0.507,0.408,-0.056,0.157,0.22,0.216,0.35,0.399,0.401,0.546,0.567,0.565,-0.031,-0.051,0.073,-0.038,0.453,0.415,0.496,0.232,0.251,0.228,0.265,0.298,0.275,0.274,0.273,0.268,0.237,0.277,0.241,0.137,5725,OrganismalFitness,R1AB_SARS2,Medium,Virus
+RAD_ANTMA_Tsuboyama_2023_2CJJ,0.28,0.183,0.286,0.299,0.324,0.356,0.508,0.373,0.654,0.624,0.507,0.403,0.45,0.458,0.687,0.704,0.493,0.467,0.542,0.412,0.541,0.575,0.536,0.454,0.552,0.419,0.546,0.444,0.417,0.598,0.422,0.365,0.258,0.477,0.601,0.439,0.497,0.588,0.467,0.436,0.486,0.41,0.573,0.222,0.147,0.37,0.46,0.233,0.656,0.548,0.544,0.578,0.577,0.577,0.606,0.581,0.571,0.575,0.573,0.585,0.373,0.582,912,Stability,RAD_ANTMA,High,Eukaryote
+RAF1_HUMAN_Zinkus-Boltz_2019,0.405,0.425,0.382,0.389,0.408,0.408,0.045,0.339,0.445,0.439,0.471,0.423,0.482,0.034,0.242,0.442,0.473,0.433,0.399,0.311,0.285,0.378,0.378,0.339,0.377,0.395,0.378,0.355,0.359,0.439,0.473,0.421,0.12,0.287,0.339,0.363,0.372,0.38,0.396,0.403,0.43,0.441,0.221,0.039,0.48,0.387,0.271,0.383,0.264,0.234,0.456,0.43,0.437,0.483,0.451,0.475,0.439,0.453,0.46,0.472,0.432,0.16,297,OrganismalFitness,RAF1_HUMAN,Low,Human
+RASH_HUMAN_Bandaru_2017,0.447,0.436,0.444,0.476,0.466,0.48,0.313,0.353,0.426,0.446,0.318,0.36,0.405,0.457,0.514,0.487,0.498,0.44,0.313,0.514,0.437,0.414,0.42,0.396,0.433,0.403,0.401,0.374,0.305,0.434,0.338,0.319,0.18,0.399,0.439,0.377,0.454,0.478,0.45,0.472,0.485,0.487,0.478,0.326,0.322,0.453,0.386,0.252,0.458,0.255,0.431,0.423,0.443,0.428,0.456,0.43,0.452,0.43,0.439,0.455,0.373,0.474,3134,Activity,RASH_HUMAN,High,Human
+RASK_HUMAN_Weng_2022_abundance,0.299,0.244,0.356,0.346,0.332,0.353,0.211,0.244,0.244,0.271,0.276,0.21,0.245,0.336,0.367,0.335,0.271,0.231,0.21,0.331,0.266,0.333,0.372,0.384,0.279,0.347,0.402,0.345,0.47,0.357,0.21,0.167,0.208,0.198,0.325,0.398,0.275,0.368,0.436,0.334,0.37,0.388,0.237,0.335,0.058,0.243,0.334,0.327,0.377,0.326,0.264,0.245,0.294,0.279,0.31,0.281,0.298,0.28,0.26,0.287,0.339,0.331,26012,Expression,RASK_HUMAN,High,Human
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,0.419,0.47,0.602,0.617,0.598,0.598,0.268,0.35,0.641,0.639,0.585,0.524,0.566,0.573,0.605,0.621,0.652,0.563,0.446,0.677,0.568,0.511,0.549,0.458,0.537,0.534,0.479,0.396,0.358,0.569,0.529,0.49,0.289,0.515,0.589,0.499,0.555,0.601,0.543,0.605,0.616,0.597,0.56,0.298,0.464,0.597,0.239,0.269,0.585,0.281,0.545,0.549,0.542,0.552,0.556,0.554,0.544,0.543,0.51,0.56,0.601,0.573,24873,Binding,RASK_HUMAN,High,Human
+RBP1_HUMAN_Tsuboyama_2023_2KWH,0.19,0.108,0.302,0.307,0.305,0.303,0.271,0.277,0.218,0.233,0.43,0.409,0.404,0.321,0.394,0.491,0.552,0.328,0.335,0.259,0.366,0.108,0.267,0.263,0.297,0.078,0.16,0.293,0.202,0.365,0.373,0.318,0.239,0.291,0.364,0.384,0.318,0.34,0.339,0.317,0.334,0.318,0.319,0.255,0.415,0.406,0.487,0.545,0.578,0.459,0.494,0.479,0.491,0.485,0.492,0.486,0.476,0.493,0.48,0.493,0.539,0.526,1332,Stability,RBP1_HUMAN,High,Human
+RCD1_ARATH_Tsuboyama_2023_5OAO,0.304,0.284,0.362,0.37,0.367,0.355,0.303,0.255,0.412,0.421,0.486,0.342,0.393,0.368,0.392,0.498,0.505,0.482,0.478,0.391,0.267,0.243,0.33,0.395,0.32,0.481,0.454,0.412,0.465,0.398,0.478,0.406,0.046,0.293,0.308,0.324,0.36,0.361,0.352,0.403,0.402,0.386,0.304,0.285,0.371,0.327,0.442,0.432,0.56,0.532,0.541,0.555,0.559,0.559,0.545,0.555,0.551,0.549,0.556,0.56,0.531,0.501,1261,Stability,RCD1_ARATH,Medium,Eukaryote
+RCRO_LAMBD_Tsuboyama_2023_1ORC,0.293,0.518,0.584,0.577,0.557,0.588,0.167,0.263,0.552,0.556,0.558,0.38,0.492,0.226,0.422,0.451,0.596,0.573,0.639,0.573,0.129,0.17,0.225,0.175,-0.041,0.132,-0.003,0.087,0.577,0.59,0.569,0.537,0.096,0.178,0.127,0.567,0.44,0.45,0.58,0.568,0.598,0.621,0.028,0.139,0.471,0.187,0.635,0.646,0.782,0.69,0.592,0.58,0.56,0.585,0.574,0.583,0.568,0.569,0.6,0.584,0.646,0.541,2278,Stability,RCRO_LAMBD,High,Virus
+RD23A_HUMAN_Tsuboyama_2023_1IFY,0.33,0.333,0.466,0.466,0.479,0.479,0.196,0.434,0.512,0.517,0.457,0.55,0.592,0.254,0.675,0.533,0.513,0.464,0.39,0.462,0.422,0.422,0.395,0.453,0.545,0.513,0.51,0.477,0.494,0.51,0.414,0.399,0.429,0.288,0.531,0.475,0.484,0.568,0.54,0.482,0.537,0.536,0.512,0.074,0.556,0.548,0.502,0.548,0.582,0.505,0.47,0.454,0.467,0.449,0.478,0.475,0.475,0.474,0.478,0.481,0.482,0.631,1019,Stability,RD23A_HUMAN,High,Human
+RDRP_I33A0_Li_2023,0.313,0.367,0.376,0.387,0.482,0.484,0.029,0.355,0.52,0.525,0.166,0.045,0.057,0.033,0.041,0.128,0.322,0.389,0.491,0.427,0.385,0.436,0.454,0.475,0.125,0.358,0.342,0.345,0.413,0.52,0.45,0.395,0.075,0.377,0.423,0.459,0.425,0.452,0.474,0.494,0.508,0.526,0.03,0.02,0.171,0.033,0.215,0.23,0.188,0.055,0.3,0.271,0.279,0.282,0.303,0.296,0.283,0.304,0.31,0.307,0.149,0.11,12003,OrganismalFitness,RDRP_I33A0,Low,Virus
+REV_HV1H2_Fernandes_2016,0.206,0.159,0.221,0.227,0.216,0.216,0.038,0.316,0.222,0.232,0.128,0.245,0.267,0.046,0.046,0.173,0.24,0.281,0.274,0.17,0.216,0.259,0.238,0.24,0.29,0.294,0.16,0.253,0.255,0.282,0.35,0.353,0.06,0.231,0.273,0.24,0.245,0.269,0.236,0.246,0.261,0.235,0.053,0.06,0.208,0.071,0.258,0.234,0.304,0.224,0.193,0.221,0.276,0.246,0.267,0.255,0.281,0.241,0.31,0.279,0.274,0.229,2147,OrganismalFitness,REV_HV1H2,Medium,Virus
+RFAH_ECOLI_Tsuboyama_2023_2LCL,0.064,0.21,0.212,0.233,0.23,0.227,-0.015,0.238,0.232,0.258,0.248,0.175,0.2,-0.032,0.078,0.055,0.299,0.261,0.246,0.249,-0.067,0.059,0.121,0.115,-0.043,0.13,0.151,0.114,0.158,0.214,0.268,0.213,-0.009,0.009,0.102,0.133,0.107,0.144,0.156,0.22,0.216,0.208,0.115,-0.075,0.193,0.107,0.304,0.248,0.384,0.328,0.331,0.31,0.291,0.333,0.314,0.3,0.314,0.309,0.327,0.319,0.318,0.22,1326,Stability,RFAH_ECOLI,High,Prokaryote
+RL20_AQUAE_Tsuboyama_2023_1GYZ,0.341,0.588,0.607,0.601,0.603,0.61,0.224,0.577,0.405,0.352,0.675,0.67,0.657,0.235,0.421,0.487,0.699,0.68,0.676,0.613,0.151,0.493,0.47,0.475,0.473,0.481,0.534,0.545,0.622,0.636,0.563,0.544,0.078,0.481,0.522,0.506,0.54,0.579,0.574,0.588,0.624,0.625,0.041,-0.048,0.467,0.283,0.684,0.64,0.816,0.769,0.716,0.715,0.72,0.726,0.713,0.718,0.723,0.724,0.732,0.728,0.738,0.71,1461,Stability,RL20_AQUAE,High,Prokaryote
+RL40A_YEAST_Mavor_2016,0.283,0.37,0.372,0.417,0.38,0.399,0.122,0.372,0.441,0.448,0.192,0.285,0.315,0.104,0.397,0.482,0.518,0.446,0.527,0.43,0.382,0.512,0.48,0.422,0.493,0.433,0.433,0.43,0.4,0.319,0.406,0.408,0.044,0.387,0.451,0.368,0.407,0.456,0.398,0.439,0.469,0.411,0.286,0.071,0.282,0.294,0.068,0.091,0.177,-0.026,0.487,0.457,0.481,0.47,0.499,0.476,0.474,0.498,0.498,0.503,0.338,0.345,1253,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+RL40A_YEAST_Roscoe_2013,0.313,0.402,0.418,0.47,0.43,0.454,0.086,0.367,0.528,0.528,0.218,0.307,0.367,0.122,0.423,0.534,0.599,0.479,0.556,0.488,0.424,0.553,0.549,0.467,0.537,0.49,0.503,0.487,0.46,0.391,0.523,0.538,0.139,0.437,0.522,0.427,0.459,0.526,0.449,0.494,0.533,0.467,0.357,0.076,0.317,0.319,0.106,0.116,0.253,0.026,0.569,0.529,0.554,0.532,0.566,0.558,0.542,0.582,0.58,0.583,0.381,0.391,1195,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+RL40A_YEAST_Roscoe_2014,0.371,0.285,0.351,0.387,0.331,0.368,0.161,0.381,0.419,0.442,0.163,0.24,0.267,0.227,0.521,0.529,0.53,0.427,0.438,0.394,0.37,0.466,0.433,0.405,0.439,0.38,0.373,0.336,0.325,0.359,0.35,0.361,0.236,0.39,0.431,0.351,0.425,0.455,0.408,0.42,0.428,0.383,0.292,0.154,0.268,0.295,0.317,0.247,0.38,0.207,0.527,0.511,0.488,0.544,0.547,0.543,0.491,0.517,0.516,0.544,0.302,0.401,1380,Activity,RL40A_YEAST,Medium,Eukaryote
+RNC_ECOLI_Weeks_2023,0.544,0.592,0.57,0.584,0.594,0.595,0.058,0.423,0.591,0.591,0.59,0.579,0.593,0.064,0.557,0.588,0.596,0.595,0.592,0.586,0.554,0.57,0.509,0.499,0.557,0.584,0.583,0.586,0.571,0.594,0.578,0.539,0.187,0.538,0.54,0.431,0.581,0.587,0.545,0.617,0.616,0.606,0.529,0.06,0.588,0.567,0.299,0.524,0.281,0.177,0.541,0.53,0.52,0.553,0.544,0.533,0.548,0.543,0.544,0.554,0.608,0.533,4277,Activity,RNC_ECOLI,Medium,Prokaryote
+RPC1_BP434_Tsuboyama_2023_1R69,0.627,0.681,0.655,0.673,0.606,0.644,0.67,0.593,0.617,0.659,0.711,0.744,0.75,0.722,0.741,0.752,0.708,0.687,0.63,0.697,0.699,0.75,0.735,0.682,0.742,0.731,0.718,0.723,0.66,0.743,0.666,0.609,0.621,0.633,0.738,0.716,0.72,0.747,0.737,0.699,0.693,0.69,0.749,0.674,0.735,0.728,0.696,0.64,0.793,0.723,0.702,0.672,0.687,0.7,0.699,0.69,0.711,0.702,0.711,0.708,0.756,0.764,1459,Stability,RPC1_BP434,High,Virus
+RPC1_LAMBD_Li_2019_high-expression,0.288,0.398,0.484,0.493,0.45,0.447,0.24,0.361,0.457,0.47,0.442,0.461,0.514,0.29,0.289,0.401,0.549,0.52,0.509,0.434,0.176,0.29,0.351,0.386,0.223,0.349,0.323,0.305,0.467,0.383,0.535,0.472,0.11,0.149,0.338,0.457,0.278,0.362,0.464,0.416,0.449,0.491,0.297,0.314,0.418,0.328,0.25,0.407,0.376,0.176,0.544,0.52,0.514,0.532,0.547,0.555,0.533,0.531,0.538,0.553,0.523,0.398,351,Activity,RPC1_LAMBD,High,Virus
+RPC1_LAMBD_Li_2019_low-expression,0.246,0.412,0.475,0.498,0.462,0.455,0.165,0.466,0.45,0.468,0.488,0.466,0.481,0.253,0.263,0.363,0.523,0.579,0.591,0.408,0.195,0.338,0.38,0.415,0.182,0.366,0.321,0.299,0.562,0.447,0.635,0.611,0.119,0.135,0.322,0.463,0.235,0.323,0.454,0.42,0.447,0.498,0.269,0.271,0.357,0.28,0.262,0.375,0.393,0.266,0.52,0.488,0.51,0.519,0.503,0.53,0.522,0.511,0.494,0.528,0.518,0.344,351,Activity,RPC1_LAMBD,High,Virus
+RS15_GEOSE_Tsuboyama_2023_1A32,0.393,0.31,0.336,0.328,0.321,0.326,0.178,0.188,0.387,0.389,0.468,0.394,0.39,0.289,0.326,0.51,0.409,0.372,0.33,0.291,0.199,0.33,0.317,0.323,0.297,0.34,0.416,0.362,0.367,0.375,0.413,0.396,0.117,0.423,0.325,0.306,0.453,0.377,0.365,0.395,0.348,0.32,0.333,0.262,0.443,0.381,0.582,0.44,0.607,0.55,0.396,0.403,0.403,0.399,0.407,0.377,0.415,0.42,0.395,0.413,0.433,0.646,1195,Stability,RS15_GEOSE,Medium,Prokaryote
+S22A1_HUMAN_Yee_2023_abundance,0.507,0.55,0.564,0.573,0.584,0.597,0.38,0.511,0.608,0.607,0.597,0.621,0.655,0.484,0.537,0.558,0.626,0.599,0.566,0.038,0.492,0.596,0.606,0.59,0.526,0.611,0.629,0.6,0.557,0.582,0.571,0.444,0.33,0.522,0.596,0.59,0.566,0.627,0.631,0.618,0.633,0.633,0.504,0.274,0.584,0.555,0.44,0.547,0.534,0.176,0.582,0.571,0.574,0.594,0.607,0.597,0.594,0.599,0.592,0.609,0.63,0.58,9803,Expression,S22A1_HUMAN,Medium,Human
+S22A1_HUMAN_Yee_2023_activity,0.45,0.517,0.557,0.57,0.553,0.566,0.347,0.508,0.562,0.557,0.553,0.572,0.614,0.448,0.48,0.477,0.575,0.547,0.525,0.03,0.488,0.579,0.57,0.561,0.526,0.584,0.597,0.567,0.533,0.578,0.596,0.508,0.299,0.504,0.577,0.568,0.527,0.59,0.591,0.586,0.604,0.601,0.458,0.238,0.561,0.501,0.408,0.527,0.503,0.169,0.542,0.52,0.514,0.539,0.56,0.542,0.542,0.559,0.552,0.56,0.542,0.5,10094,Activity,S22A1_HUMAN,Medium,Human
+SAV1_MOUSE_Tsuboyama_2023_2YSB,0.137,0.201,0.394,0.373,0.407,0.399,0.426,0.503,0.41,0.43,0.481,0.472,0.508,0.238,0.496,0.496,0.512,0.52,0.406,0.43,0.491,0.458,0.449,0.479,0.524,0.519,0.476,0.491,0.482,0.518,0.469,0.419,0.353,0.424,0.497,0.499,0.431,0.498,0.489,0.411,0.443,0.444,0.503,-0.042,0.545,0.531,0.336,0.332,0.443,0.502,0.454,0.348,0.234,0.367,0.362,0.395,0.392,0.445,0.426,0.387,0.575,0.607,965,Stability,SAV1_MOUSE,High,Eukaryote
+SBI_STAAM_Tsuboyama_2023_2JVG,0.232,0.225,0.38,0.345,0.44,0.403,0.205,0.178,0.481,0.533,0.323,0.268,0.297,0.23,0.266,0.376,0.573,0.658,0.318,0.359,0.228,0.263,0.259,0.235,0.21,0.228,0.221,0.245,0.339,0.549,0.582,0.487,0.169,0.195,0.2,0.225,0.297,0.305,0.314,0.355,0.368,0.382,0.262,0.228,0.316,0.305,0.664,0.627,0.687,0.616,0.529,0.526,0.561,0.55,0.545,0.547,0.569,0.565,0.558,0.561,0.683,0.571,1025,Stability,SBI_STAAM,Medium,Prokaryote
+SC6A4_HUMAN_Young_2021,0.387,0.456,0.423,0.433,0.504,0.522,0.352,0.507,0.562,0.574,0.545,0.531,0.542,0.159,0.327,0.502,0.543,0.548,0.524,0.555,0.498,0.511,0.5,0.496,0.512,0.52,0.518,0.525,0.511,0.541,0.463,0.36,0.342,0.513,0.512,0.491,0.53,0.542,0.53,0.549,0.555,0.545,0.489,0.112,0.549,0.531,0.467,0.527,0.501,0.182,0.516,0.526,0.535,0.543,0.545,0.536,0.536,0.538,0.526,0.546,0.578,0.528,11576,Activity,SC6A4_HUMAN,Medium,Human
+SCIN_STAAR_Tsuboyama_2023_2QFF,0.083,0.14,0.261,0.264,0.3,0.272,0.154,0.065,0.271,0.258,0.299,0.298,0.269,0.208,0.305,0.279,0.346,0.406,0.414,0.25,0.106,0.162,0.134,0.144,0.155,0.194,0.152,0.2,0.215,0.229,0.296,0.208,-0.012,0.072,0.104,0.187,0.132,0.132,0.156,0.245,0.235,0.255,0.231,0.177,0.3,0.269,0.525,0.527,0.517,0.477,0.438,0.442,0.437,0.48,0.422,0.478,0.43,0.43,0.391,0.454,0.62,0.513,1212,Stability,SCIN_STAAR,High,Prokaryote
+SCN5A_HUMAN_Glazer_2019,0.13,0.131,0.162,0.162,0.153,0.158,0.13,-0.014,0.163,0.18,0.141,0.217,0.135,0.216,0.177,0.156,0.183,0.152,0.125,0.126,0.106,0.143,0.151,0.127,0.124,0.106,0.093,0.107,0.167,0.144,0.186,0.181,0.095,0.075,0.068,0.069,0.095,0.086,0.086,0.14,0.14,0.152,0.086,0.137,0.165,0.154,0.03,0.092,0.053,0.014,0.127,0.16,0.157,0.127,0.105,0.135,0.143,0.136,0.169,0.146,0.232,0.168,224,OrganismalFitness,SCN5A_HUMAN,Medium,Human
+SDA_BACSU_Tsuboyama_2023_1PV0,0.631,0.646,0.615,0.619,0.647,0.65,0.001,0.303,0.456,0.472,0.606,0.418,0.466,0.249,0.274,0.608,0.594,0.598,0.596,0.53,0.026,0.067,0.003,0.238,0.081,0.123,0.045,0.119,0.214,0.64,0.604,0.581,-0.165,0.01,0.26,0.107,0.607,0.613,0.368,0.652,0.65,0.486,0.037,-0.079,0.336,0.011,0.434,0.41,0.608,0.597,0.65,0.641,0.588,0.647,0.63,0.613,0.622,0.623,0.653,0.633,0.639,0.636,2770,Stability,SDA_BACSU,Medium,Prokaryote
+SERC_HUMAN_Xie_2023,0.375,0.483,0.544,0.544,0.52,0.53,0.013,0.456,0.57,0.576,0.518,0.541,0.547,0.183,0.407,0.529,0.558,0.577,0.543,0.535,0.492,0.511,0.513,0.484,0.5,0.53,0.518,0.534,0.504,0.506,0.516,0.469,0.192,0.513,0.528,0.519,0.528,0.54,0.544,0.548,0.551,0.552,0.319,0.031,0.54,0.46,0.382,0.461,0.352,0.189,0.539,0.524,0.54,0.551,0.558,0.548,0.553,0.55,0.543,0.561,0.562,0.478,1914,OrganismalFitness,SERC_HUMAN,High,Human
+SHOC2_HUMAN_Kwon_2022,0.208,0.328,0.378,0.383,0.37,0.379,0.226,0.36,0.424,0.422,0.395,0.384,0.41,0.242,0.249,0.267,0.426,0.388,0.29,0.388,0.262,0.363,0.385,0.355,0.267,0.394,0.39,0.4,0.358,0.407,0.416,0.366,0.153,0.254,0.308,0.376,0.264,0.302,0.367,0.367,0.367,0.396,0.262,0.236,0.421,0.291,0.303,0.358,0.286,0.095,0.373,0.367,0.387,0.39,0.396,0.398,0.381,0.392,0.384,0.398,0.322,0.286,10972,OrganismalFitness,SHOC2_HUMAN,Medium,Human
+SOX30_HUMAN_Tsuboyama_2023_7JJK,0.225,0.241,0.235,0.242,0.282,0.273,0.369,0.17,0.21,0.246,0.254,0.335,0.336,0.298,0.396,0.333,0.322,0.262,0.261,0.223,0.166,0.167,0.162,0.19,0.264,0.247,0.202,0.253,0.189,0.231,0.087,0.06,0.222,0.133,0.267,0.229,0.217,0.241,0.234,0.275,0.274,0.262,0.271,0.15,0.208,0.356,0.526,0.346,0.53,0.464,0.333,0.38,0.348,0.333,0.326,0.385,0.333,0.334,0.334,0.356,0.333,0.466,1010,Stability,SOX30_HUMAN,High,Human
+SPA_STAAU_Tsuboyama_2023_1LP1,0.432,0.509,0.518,0.501,0.552,0.549,-0.093,0.495,0.409,0.408,0.393,-0.041,0.001,-0.032,-0.033,-0.064,0.005,-0.01,0.02,0.514,-0.154,0.032,0.205,0.105,-0.081,-0.041,0.057,0.338,0.428,0.564,0.432,0.442,-0.034,-0.011,0.001,0.066,0.437,0.43,0.427,0.548,0.547,0.536,-0.137,-0.087,-0.071,-0.009,0.512,0.453,0.705,0.56,0.52,0.511,0.493,0.534,0.498,0.515,0.51,0.501,0.504,0.513,0.546,0.414,2105,Stability,SPA_STAAU,Medium,Prokaryote
+SPG1_STRSG_Olson_2014,0.239,0.282,-0.004,0.006,0.247,0.272,-0.041,0.017,-0.038,0.171,0.305,0.237,0.192,0.28,0.232,0.264,0.301,0.251,0.337,0.278,0.252,0.216,0.214,0.208,0.232,0.222,0.218,0.239,0.354,0.283,0.478,0.494,0.111,0.223,0.175,0.279,0.243,0.204,0.289,0.229,0.188,0.29,-0.092,-0.112,0.158,0.004,0.362,0.306,0.423,0.147,0.38,0.315,0.364,0.414,0.414,0.36,0.418,0.417,0.41,0.407,0.387,0.374,536962,Binding,SPG1_STRSG,Low,Prokaryote
+SPG1_STRSG_Wu_2016,-0.051,0.025,0.048,0.048,0.036,0.042,0.036,-0.048,0.17,0.171,0.121,0.086,0.059,0.03,0.097,0.118,0.128,0.165,0.199,0.083,-0.012,-0.033,-0.051,-0.023,-0.061,0.016,-0.014,-0.031,-0.031,0.102,0.146,0.125,-0.083,0.001,-0.085,-0.024,0.039,0.013,0.021,0.044,0.027,0.027,-0.013,-0.099,0.04,-0.026,0.091,0.023,0.18,0.089,0.144,0.136,0.132,0.176,0.156,0.131,0.194,0.167,0.177,0.159,0.171,0.114,149360,Binding,SPG1_STRSG,Medium,Prokaryote
+SPG2_STRSG_Tsuboyama_2023_5UBS,0.439,0.517,0.496,0.539,0.569,0.565,0.325,0.309,0.543,0.478,0.501,0.437,0.409,0.366,0.432,0.451,0.5,0.504,0.528,0.507,0.388,0.433,0.436,0.392,0.343,0.465,0.356,0.485,0.501,0.614,0.561,0.536,-0.192,0.356,0.44,0.462,0.294,0.335,0.382,0.561,0.562,0.569,0.253,0.089,0.286,0.24,0.447,0.369,0.704,0.57,0.564,0.566,0.584,0.592,0.569,0.595,0.588,0.61,0.584,0.589,0.598,0.549,1451,Stability,SPG2_STRSG,Medium,Prokaryote
+SPIKE_SARS2_Starr_2020_binding,0.158,0.205,0.062,0.151,0.249,0.252,-0.046,0.369,0.356,0.36,-0.002,-0.064,-0.039,-0.049,-0.036,-0.022,-0.018,-0.008,0.038,0.337,0.305,0.368,0.357,0.364,0.384,0.347,0.315,0.378,0.319,0.247,0.324,0.357,0.171,0.338,0.328,0.355,0.334,0.325,0.317,0.356,0.346,0.341,-0.036,-0.013,-0.022,-0.008,0.517,0.483,0.405,0.16,0.224,0.241,0.226,0.297,0.251,0.301,0.258,0.279,0.228,0.279,0.357,0.159,3802,Binding,SPIKE_SARS2,Medium,Virus
+SPIKE_SARS2_Starr_2020_expression,0.2,0.314,0.165,0.279,0.445,0.45,-0.016,0.352,0.413,0.446,0.05,-0.023,0.004,-0.026,0.001,0.01,0.012,0.018,0.072,0.355,0.317,0.382,0.374,0.387,0.379,0.367,0.322,0.397,0.328,0.359,0.313,0.346,0.178,0.344,0.346,0.383,0.361,0.361,0.367,0.451,0.455,0.475,0.005,0.022,0.024,0.041,0.561,0.533,0.465,0.187,0.321,0.354,0.33,0.422,0.379,0.413,0.384,0.392,0.363,0.405,0.498,0.266,3798,Expression,SPIKE_SARS2,Medium,Virus
+SPTN1_CHICK_Tsuboyama_2023_1TUD,0.572,0.613,0.647,0.599,0.632,0.615,0.23,0.524,0.552,0.593,0.637,0.649,0.641,-0.133,0.619,0.627,0.637,0.726,0.657,0.624,0.578,0.643,0.604,0.589,0.589,0.613,0.652,0.563,0.601,0.621,0.631,0.632,0.457,0.508,0.57,0.548,0.629,0.641,0.621,0.623,0.628,0.618,0.488,-0.221,0.465,0.498,0.421,0.401,0.62,0.623,0.693,0.679,0.683,0.685,0.696,0.685,0.676,0.674,0.696,0.691,0.667,0.526,3201,Stability,SPTN1_CHICK,High,Eukaryote
+SQSTM_MOUSE_Tsuboyama_2023_2RRU,0.399,0.522,0.597,0.61,0.604,0.604,0.086,0.449,0.565,0.592,0.514,0.373,0.503,0.186,0.442,0.529,0.618,0.64,0.527,0.607,0.218,0.474,0.49,0.505,0.345,0.544,0.569,0.506,0.538,0.623,0.566,0.526,0.155,0.194,0.444,0.518,0.502,0.54,0.589,0.614,0.604,0.625,0.209,0.119,0.582,0.583,0.651,0.626,0.629,0.619,0.626,0.582,0.635,0.62,0.634,0.628,0.634,0.61,0.603,0.639,0.682,0.514,707,Stability,SQSTM_MOUSE,Medium,Eukaryote
+SR43C_ARATH_Tsuboyama_2023_2N88,0.596,0.658,0.684,0.681,0.693,0.684,-0.19,0.508,0.638,0.61,0.702,0.639,0.654,-0.267,0.662,0.702,0.675,0.698,0.681,0.634,-0.109,0.377,0.382,0.504,0.493,0.575,0.607,0.608,0.61,0.665,0.674,0.635,0.359,0.039,0.189,0.368,0.667,0.669,0.673,0.685,0.689,0.68,0.575,-0.296,0.703,0.618,0.214,0.637,0.731,0.716,0.677,0.673,0.67,0.692,0.696,0.68,0.668,0.669,0.685,0.684,0.788,0.744,1583,Stability,SR43C_ARATH,High,Eukaryote
+SRBS1_HUMAN_Tsuboyama_2023_2O2W,0.36,0.511,0.696,0.694,0.723,0.718,0.549,0.362,0.743,0.742,0.628,0.664,0.7,0.498,0.713,0.731,0.734,0.673,0.694,0.653,0.679,0.649,0.653,0.62,0.642,0.637,0.659,0.63,0.607,0.694,0.696,0.658,0.577,0.597,0.637,0.645,0.66,0.697,0.705,0.701,0.727,0.728,0.388,0.084,0.538,0.532,0.579,0.538,0.64,0.661,0.751,0.731,0.734,0.736,0.746,0.741,0.735,0.735,0.74,0.75,0.707,0.689,1556,Stability,SRBS1_HUMAN,High,Human
+SRC_HUMAN_Ahler_2019,0.526,0.508,0.465,0.465,0.496,0.507,0.532,0.44,0.514,0.532,0.469,0.561,0.585,0.447,0.486,0.49,0.555,0.515,0.467,0.484,0.439,0.413,0.421,0.373,0.437,0.46,0.438,0.422,0.33,0.542,0.578,0.574,0.431,0.41,0.411,0.348,0.502,0.502,0.492,0.521,0.52,0.516,0.567,0.397,0.487,0.546,0.309,0.296,0.409,0.098,0.517,0.495,0.492,0.517,0.524,0.521,0.517,0.527,0.524,0.532,0.506,0.479,3372,Activity,SRC_HUMAN,Medium,Human
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,0.435,0.427,0.41,0.433,0.441,0.437,0.454,0.401,0.458,0.454,0.403,0.486,0.506,0.358,0.39,0.393,0.465,0.426,0.409,0.397,0.372,0.35,0.354,0.309,0.37,0.395,0.364,0.35,0.257,0.456,0.498,0.495,0.36,0.339,0.335,0.28,0.414,0.411,0.401,0.443,0.441,0.437,0.475,0.316,0.423,0.458,0.238,0.246,0.324,0.046,0.434,0.408,0.412,0.437,0.441,0.438,0.432,0.442,0.447,0.447,0.431,0.391,3637,Activity,SRC_HUMAN,Medium,Human
+SRC_HUMAN_Nguyen_2022,0.428,0.426,0.413,0.428,0.429,0.425,0.442,0.405,0.357,0.411,0.401,0.483,0.506,0.342,0.378,0.387,0.471,0.443,0.415,0.396,0.373,0.344,0.35,0.306,0.381,0.397,0.372,0.358,0.269,0.453,0.493,0.484,0.355,0.348,0.343,0.282,0.416,0.412,0.399,0.44,0.437,0.43,0.477,0.304,0.427,0.466,0.221,0.227,0.31,0.063,0.439,0.405,0.409,0.432,0.441,0.433,0.434,0.444,0.441,0.445,0.43,0.382,3366,OrganismalFitness,SRC_HUMAN,Medium,Human
+SUMO1_HUMAN_Weile_2017,0.369,0.373,0.419,0.438,0.48,0.478,0.13,0.425,0.449,0.386,0.433,0.467,0.51,0.25,0.496,0.533,0.509,0.38,0.34,0.517,0.217,0.414,0.443,0.43,0.468,0.386,0.46,0.431,0.333,0.438,0.46,0.445,0.212,0.245,0.473,0.329,0.405,0.511,0.41,0.452,0.522,0.463,0.49,0.086,0.395,0.527,0.497,0.428,0.509,0.412,0.492,0.453,0.5,0.5,0.493,0.517,0.479,0.498,0.491,0.511,0.479,0.524,1700,OrganismalFitness,SUMO1_HUMAN,High,Human
+SYUA_HUMAN_Newberry_2020,0.103,0.111,0.119,0.132,0.131,0.139,0.131,0.212,0.171,0.162,0.234,0.242,0.233,0.105,0.142,0.15,0.137,0.206,0.205,0.22,0.138,0.203,0.155,0.141,0.086,0.151,0.167,0.105,0.136,0.227,0.205,0.189,0.005,0.137,0.197,0.181,0.124,0.176,0.159,0.134,0.166,0.156,0.115,0.068,0.16,0.155,-0.052,0.046,-0.06,-0.074,0.069,0.056,0.053,0.088,0.129,0.098,0.127,0.102,0.144,0.098,-0.011,-0.045,2497,OrganismalFitness,SYUA_HUMAN,Medium,Human
+TADBP_HUMAN_Bolognesi_2019,0.097,0.061,0.1,0.096,0.08,0.08,0.291,-0.018,0.077,0.072,0.013,0.051,0.048,0.121,0.047,0.021,-0.12,-0.028,0.053,-0.075,0.158,0.063,-0.011,-0.006,0.206,0.082,0.006,0.023,-0.014,0.036,0.088,0.112,-0.051,0.226,0.273,0.121,0.142,0.185,0.121,0.105,0.129,0.109,0.121,0.212,-0.002,0.078,0.271,0.033,0.223,0.059,0.105,0.013,0.004,0.04,0.002,0.034,0.045,-0.011,-0.079,0.021,0.054,0.121,1196,OrganismalFitness,TADBP_HUMAN,Low,Human
+TAT_HV1BR_Fernandes_2016,0.293,0.201,0.255,0.268,0.319,0.3,-0.09,0.397,0.274,0.288,0.185,0.343,0.342,-0.034,-0.008,-0.005,0.017,-0.057,0.044,0.203,0.379,0.364,0.396,0.398,0.394,0.269,0.136,0.246,0.223,0.368,0.405,0.387,0.283,0.385,0.211,0.203,0.379,0.263,0.237,0.366,0.295,0.268,0.005,-0.054,0.292,0.181,0.23,0.284,0.344,0.207,0.114,0.134,0.142,0.137,0.116,0.133,0.12,0.117,0.117,0.134,0.157,0.116,1577,OrganismalFitness,TAT_HV1BR,High,Virus
+TCRG1_MOUSE_Tsuboyama_2023_1E0L,0.576,0.615,0.644,0.655,0.66,0.662,0.696,0.292,0.677,0.708,0.591,0.739,0.75,0.724,0.707,0.784,0.769,0.707,0.745,0.577,0.49,0.552,0.57,0.609,0.604,0.613,0.66,0.648,0.597,0.68,0.64,0.595,0.642,0.485,0.619,0.669,0.688,0.715,0.71,0.68,0.712,0.695,0.753,0.184,0.629,0.752,0.68,0.596,0.774,0.759,0.721,0.697,0.657,0.757,0.733,0.763,0.723,0.738,0.71,0.735,0.79,0.774,1058,Stability,TCRG1_MOUSE,Medium,Eukaryote
+THO1_YEAST_Tsuboyama_2023_2WQG,0.309,0.397,0.482,0.473,0.483,0.481,0.072,0.061,0.523,0.52,0.511,0.502,0.532,0.186,0.557,0.548,0.513,0.532,0.546,0.515,-0.047,0.332,0.349,0.305,0.351,0.383,0.279,0.423,0.534,0.543,0.526,0.512,0.48,0.003,0.403,0.337,0.446,0.505,0.495,0.472,0.51,0.511,0.301,0.114,0.463,0.389,0.504,0.484,0.561,0.597,0.555,0.546,0.566,0.573,0.584,0.548,0.561,0.551,0.545,0.561,0.502,0.483,1279,Stability,THO1_YEAST,High,Eukaryote
+TNKS2_HUMAN_Tsuboyama_2023_5JRT,0.464,0.473,0.506,0.495,0.507,0.502,-0.099,0.392,0.573,0.576,0.472,0.538,0.526,0.044,0.547,0.589,0.525,0.529,0.533,0.466,0.262,0.361,0.429,0.394,0.395,0.455,0.385,0.395,0.436,0.528,0.484,0.451,0.25,0.061,0.312,0.333,0.535,0.523,0.517,0.541,0.518,0.518,0.343,-0.03,0.572,0.492,0.526,0.649,0.675,0.624,0.51,0.53,0.516,0.512,0.517,0.511,0.521,0.512,0.513,0.524,0.597,0.604,1479,Stability,TNKS2_HUMAN,High,Human
+TPK1_HUMAN_Weile_2017,0.217,0.236,0.219,0.228,0.23,0.229,0.067,0.253,0.286,0.27,0.285,0.27,0.318,0.108,0.199,0.253,0.339,0.328,0.371,0.249,0.088,0.142,0.228,0.272,0.115,0.267,0.273,0.253,0.306,0.255,0.253,0.188,0.117,0.115,0.214,0.301,0.232,0.252,0.305,0.243,0.252,0.277,0.12,0.089,0.309,0.17,0.206,0.267,0.245,0.116,0.253,0.277,0.275,0.278,0.293,0.291,0.281,0.291,0.292,0.298,0.268,0.203,3181,OrganismalFitness,TPK1_HUMAN,Medium,Human
+TPMT_HUMAN_Matreyek_2018,0.372,0.456,0.489,0.509,0.499,0.513,0.241,0.445,0.497,0.5,0.546,0.517,0.547,0.308,0.433,0.531,0.539,0.476,0.448,0.511,0.333,0.42,0.481,0.496,0.447,0.506,0.481,0.466,0.424,0.547,0.542,0.489,0.334,0.33,0.471,0.411,0.478,0.513,0.478,0.517,0.537,0.515,0.371,0.204,0.539,0.495,0.476,0.534,0.54,0.238,0.522,0.519,0.53,0.514,0.541,0.535,0.541,0.529,0.534,0.543,0.568,0.469,3648,Expression,TPMT_HUMAN,Medium,Human
+TPOR_HUMAN_Bridgford_2020,0.376,0.365,0.283,0.263,0.288,0.296,0.37,0.455,0.454,0.456,0.407,0.368,0.37,0.279,0.378,0.294,0.29,0.422,0.362,0.362,0.25,0.338,0.313,0.377,0.367,0.2,0.471,0.451,0.34,0.45,0.436,0.274,0.395,0.372,0.321,0.419,0.429,0.408,0.451,0.408,0.382,0.437,0.295,0.303,0.353,0.348,0.126,0.357,0.309,0.035,0.344,0.308,0.344,0.354,0.322,0.345,0.396,0.257,0.293,0.341,0.389,0.372,562,OrganismalFitness,TPOR_HUMAN,Low,Human
+TRPC_SACS2_Chan_2017,0.575,0.606,0.558,0.574,0.575,0.585,0.173,0.499,0.655,0.671,0.621,0.611,0.643,0.325,0.539,0.618,0.653,0.632,0.601,0.592,0.421,0.564,0.508,0.575,0.498,0.499,0.558,0.521,0.524,0.529,0.59,0.52,0.221,0.52,0.555,0.558,0.575,0.598,0.591,0.59,0.612,0.601,0.434,0.072,0.612,0.498,0.321,0.512,0.507,0.122,0.593,0.594,0.582,0.646,0.614,0.615,0.621,0.63,0.614,0.636,0.626,0.578,1519,OrganismalFitness,TRPC_SACS2,Medium,Prokaryote
+TRPC_THEMA_Chan_2017,0.405,0.45,0.394,0.411,0.416,0.417,0.395,0.434,0.471,0.47,0.439,0.478,0.506,0.327,0.47,0.503,0.464,0.47,0.499,0.43,0.336,0.399,0.395,0.455,0.437,0.421,0.369,0.387,0.478,0.39,0.427,0.365,0.14,0.329,0.46,0.448,0.39,0.442,0.438,0.418,0.445,0.442,0.397,0.14,0.461,0.395,0.253,0.386,0.393,0.075,0.459,0.41,0.431,0.462,0.463,0.409,0.439,0.441,0.418,0.454,0.504,0.491,1519,OrganismalFitness,TRPC_THEMA,Medium,Prokaryote
+UBC9_HUMAN_Weile_2017,0.371,0.496,0.521,0.531,0.508,0.522,-0.042,0.403,0.512,0.519,0.42,0.477,0.509,0.001,0.004,0.403,0.473,0.518,0.538,0.557,0.22,0.405,0.445,0.419,0.42,0.479,0.471,0.449,0.442,0.484,0.454,0.432,0.071,0.233,0.403,0.428,0.327,0.452,0.475,0.443,0.517,0.54,0.277,-0.008,0.427,0.492,0.322,0.35,0.446,0.243,0.44,0.417,0.408,0.433,0.425,0.427,0.433,0.44,0.442,0.439,0.37,0.426,2563,OrganismalFitness,UBC9_HUMAN,Medium,Human
+UBE4B_HUMAN_Tsuboyama_2023_3L1X,0.325,0.443,0.464,0.467,0.466,0.466,0.051,0.346,0.379,0.415,0.391,0.453,0.449,0.347,0.42,0.49,0.46,0.467,0.477,0.421,0.142,0.353,0.357,0.364,0.345,0.374,0.343,0.322,0.355,0.486,0.475,0.453,0.441,0.079,0.11,0.255,0.383,0.385,0.329,0.472,0.477,0.424,0.359,-0.095,0.348,0.344,0.488,0.409,0.52,0.351,0.411,0.436,0.424,0.447,0.448,0.428,0.407,0.429,0.42,0.432,0.5,0.407,3622,Stability,UBE4B_HUMAN,High,Human
+UBE4B_MOUSE_Starita_2013,0.412,0.417,0.454,0.468,0.463,0.47,0.078,0.089,0.392,0.394,0.361,0.447,0.471,0.399,0.463,0.459,0.459,0.351,0.349,0.383,0.122,0.324,0.337,0.3,0.444,0.376,0.395,0.376,0.289,0.433,0.437,0.396,0.091,0.026,0.096,0.262,0.419,0.351,0.39,0.467,0.454,0.472,0.409,0.041,0.42,0.458,0.307,0.393,-0.017,0.118,0.426,0.409,0.456,0.458,0.438,0.458,0.432,0.449,0.443,0.454,0.44,0.412,899,Activity,UBE4B_MOUSE,Low,Eukaryote
+UBR5_HUMAN_Tsuboyama_2023_1I2T,0.442,0.55,0.54,0.538,0.566,0.577,0.152,0.251,0.498,0.528,0.528,0.425,0.511,0.226,0.226,0.158,0.197,0.591,0.601,0.476,0.422,0.414,0.467,0.514,0.473,0.462,0.495,0.488,0.492,0.557,0.497,0.388,0.288,0.304,0.455,0.401,0.484,0.519,0.502,0.592,0.594,0.583,0.133,0.121,0.422,0.209,0.557,0.439,0.637,0.574,0.56,0.588,0.597,0.609,0.599,0.612,0.589,0.605,0.585,0.6,0.661,0.597,1453,Stability,UBR5_HUMAN,Medium,Human
+VG08_BPP22_Tsuboyama_2023_2GP8,0.228,0.314,0.228,0.246,0.266,0.255,0.256,0.064,0.371,0.392,0.623,0.544,0.562,0.36,0.514,0.601,0.655,0.611,0.653,0.167,0.395,0.474,0.458,0.531,0.419,0.425,0.41,0.467,0.516,0.487,0.51,0.386,0.038,0.307,0.473,0.464,0.343,0.415,0.464,0.336,0.403,0.409,0.469,0.295,0.552,0.483,0.458,0.561,0.543,0.505,0.652,0.626,0.661,0.661,0.664,0.691,0.634,0.658,0.684,0.674,0.619,0.6,723,Stability,VG08_BPP22,High,Virus
+VILI_CHICK_Tsuboyama_2023_1YU5,0.284,0.422,0.594,0.63,0.634,0.637,0.162,0.54,0.553,0.59,0.673,0.63,0.673,0.371,0.316,0.652,0.662,0.589,0.478,0.592,0.293,0.431,0.601,0.524,0.424,0.575,0.494,0.482,0.579,0.599,0.701,0.694,0.322,0.346,0.515,0.521,0.487,0.564,0.563,0.634,0.659,0.642,0.272,0.283,0.426,0.421,0.542,0.433,0.775,0.702,0.672,0.658,0.666,0.658,0.653,0.665,0.649,0.656,0.652,0.662,0.685,0.798,2568,Stability,VILI_CHICK,High,Eukaryote
+VKOR1_HUMAN_Chiasson_2020_abundance,0.409,0.416,0.402,0.422,0.437,0.437,0.245,0.396,0.501,0.52,0.438,0.447,0.474,0.251,0.443,0.5,0.493,0.484,0.472,0.464,0.286,0.298,0.396,0.43,0.368,0.471,0.463,0.437,0.442,0.481,0.383,0.288,0.09,0.256,0.364,0.505,0.462,0.483,0.548,0.476,0.492,0.538,0.236,0.18,0.464,0.384,0.446,0.441,0.49,0.279,0.52,0.49,0.53,0.543,0.53,0.513,0.526,0.508,0.525,0.533,0.494,0.479,2695,Expression,VKOR1_HUMAN,Medium,Human
+VKOR1_HUMAN_Chiasson_2020_activity,0.329,0.368,0.399,0.41,0.413,0.428,0.0,0.386,0.423,0.441,0.419,0.427,0.448,0.016,0.298,0.369,0.43,0.446,0.445,0.415,0.051,0.073,0.272,0.31,0.132,0.384,0.384,0.36,0.385,0.421,0.431,0.388,0.179,0.022,0.135,0.37,0.339,0.35,0.401,0.409,0.42,0.436,0.084,0.021,0.416,0.226,0.225,0.334,0.366,0.1,0.404,0.358,0.369,0.389,0.406,0.389,0.391,0.394,0.394,0.401,0.427,0.322,697,Activity,VKOR1_HUMAN,Medium,Human
+VRPI_BPT7_Tsuboyama_2023_2WNM,-0.119,0.043,0.164,0.194,0.078,0.119,0.082,0.147,0.366,0.356,0.498,0.368,0.374,0.252,0.41,0.507,0.587,0.612,0.468,0.128,-0.02,0.135,0.133,0.222,0.215,0.202,0.131,0.199,0.291,0.084,0.499,0.429,0.242,0.065,0.059,0.122,-0.039,-0.028,-0.057,0.064,0.083,0.006,0.254,0.056,0.501,0.357,0.589,0.557,0.593,0.606,0.612,0.592,0.633,0.639,0.622,0.641,0.615,0.63,0.604,0.64,0.662,0.582,1047,Stability,VRPI_BPT7,Medium,Virus
+YAIA_ECOLI_Tsuboyama_2023_2KVT,0.291,0.602,0.572,0.596,0.599,0.591,-0.221,0.533,0.559,0.598,0.593,0.27,0.481,-0.107,0.173,0.572,0.625,0.684,0.684,0.591,-0.091,-0.182,-0.13,0.264,-0.125,-0.166,-0.231,-0.023,0.52,0.605,0.681,0.643,-0.06,-0.234,-0.064,0.502,0.349,0.359,0.528,0.572,0.569,0.604,-0.046,-0.261,0.208,-0.013,0.449,0.425,0.651,0.544,0.587,0.627,0.585,0.619,0.612,0.627,0.625,0.657,0.677,0.632,0.616,0.498,1890,Stability,YAIA_ECOLI,Medium,Prokaryote
+YAP1_HUMAN_Araya_2012,0.428,0.321,0.464,0.46,0.449,0.455,0.329,0.289,0.018,0.036,0.333,0.28,0.285,0.411,0.457,0.451,0.466,0.382,0.313,0.189,0.18,0.176,0.158,0.167,0.299,0.226,0.257,0.207,0.15,0.332,0.425,0.451,0.139,0.302,0.196,0.217,0.402,0.326,0.359,0.435,0.405,0.439,0.485,-0.104,0.335,0.486,0.34,0.362,0.383,0.197,0.446,0.379,0.432,0.426,0.417,0.412,0.415,0.441,0.444,0.438,0.386,0.422,10075,Binding,YAP1_HUMAN,Low,Human
+YNZC_BACSU_Tsuboyama_2023_2JVD,0.596,0.61,0.595,0.618,0.604,0.609,0.492,0.394,0.591,0.583,0.673,0.649,0.667,0.584,0.621,0.65,0.658,0.631,0.63,0.525,0.488,0.534,0.612,0.604,0.594,0.611,0.587,0.59,0.643,0.594,0.586,0.547,0.568,0.573,0.567,0.544,0.665,0.634,0.621,0.669,0.624,0.63,0.446,0.353,0.536,0.448,0.483,0.562,0.68,0.65,0.701,0.649,0.665,0.687,0.7,0.691,0.687,0.676,0.679,0.69,0.638,0.76,2300,Stability,YNZC_BACSU,Medium,Prokaryote
diff --git a/benchmarks/DMS_zero_shot/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level.html b/benchmarks/DMS_zero_shot/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level.html
new file mode 100644
index 0000000..0b57558
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/Spearman/DMS_substitutions_Spearman_DMS_level.html
@@ -0,0 +1,15196 @@
+
+
+
+ score |
+ Site-Independent |
+ EVmutation |
+ DeepSequence (single) |
+ DeepSequence (ensemble) |
+ EVE (single) |
+ EVE (ensemble) |
+ Unirep |
+ Unirep evotuned |
+ MSA Transformer (single) |
+ MSA Transformer (ensemble) |
+ ESM-1b |
+ ESM-1v (single) |
+ ESM-1v (ensemble) |
+ ESM2 (8M) |
+ ESM2 (35M) |
+ ESM2 (150M) |
+ ESM2 (650M) |
+ ESM2 (3B) |
+ ESM2 (15B) |
+ Wavenet |
+ RITA S |
+ RITA M |
+ RITA L |
+ RITA XL |
+ Progen2 S |
+ Progen2 M |
+ Progen2 Base |
+ Progen2 L |
+ Progen2 XL |
+ GEMME |
+ VESPA |
+ VESPAl |
+ ProtGPT2 |
+ Tranception S no retrieval |
+ Tranception M no retrieval |
+ Tranception L no retrieval |
+ Tranception S |
+ Tranception M |
+ Tranception L |
+ TranceptEVE S |
+ TranceptEVE M |
+ TranceptEVE L |
+ CARP (38M) |
+ CARP (600K) |
+ CARP (640M) |
+ CARP (76M) |
+ MIF |
+ MIF-ST |
+ ESM-IF1 |
+ ProteinMPNN |
+ ProtSSN (k=10 h=512) |
+ ProtSSN (k=10 h=768) |
+ ProtSSN (k=10 h=1280) |
+ ProtSSN (k=20 h=512) |
+ ProtSSN (k=20 h=768) |
+ ProtSSN (k=20 h=1280) |
+ ProtSSN (k=30 h=512) |
+ ProtSSN (k=30 h=768) |
+ ProtSSN (k=30 h=1280) |
+ ProtSSN (ensemble) |
+ SaProt (650M) |
+ SaProt (35M) |
+ Number of Mutants |
+ Selection Type |
+ UniProt ID |
+ MSA_Neff_L_category |
+ Taxon |
+
+
+
+
+ A0A140D2T1_ZIKV_Sourisseau_2019 |
+ 0.383 |
+ 0.354 |
+ 0.131 |
+ 0.103 |
+ 0.394 |
+ 0.405 |
+ -0.133 |
+ 0.062 |
+ 0.436 |
+ 0.439 |
+ -0.001 |
+ -0.048 |
+ 0.015 |
+ -0.073 |
+ -0.054 |
+ -0.058 |
+ 0.213 |
+ 0.363 |
+ 0.375 |
+ 0.216 |
+ 0.361 |
+ 0.309 |
+ 0.317 |
+ 0.305 |
+ 0.329 |
+ 0.342 |
+ 0.328 |
+ 0.312 |
+ 0.293 |
+ 0.430 |
+ 0.319 |
+ 0.296 |
+ 0.005 |
+ 0.304 |
+ 0.331 |
+ 0.272 |
+ 0.362 |
+ 0.366 |
+ 0.351 |
+ 0.361 |
+ 0.358 |
+ 0.373 |
+ -0.060 |
+ -0.073 |
+ 0.131 |
+ -0.061 |
+ 0.273 |
+ 0.290 |
+ 0.287 |
+ 0.129 |
+ 0.253 |
+ 0.251 |
+ 0.282 |
+ 0.296 |
+ 0.277 |
+ 0.269 |
+ 0.263 |
+ 0.269 |
+ 0.265 |
+ 0.278 |
+ 0.199 |
+ 0.116 |
+ 9576 |
+ OrganismalFitness |
+ A0A140D2T1_ZIKV |
+ Medium |
+ Virus |
+
+
+ A0A192B1T2_9HIV1_Haddox_2018 |
+ 0.481 |
+ 0.407 |
+ 0.413 |
+ 0.432 |
+ 0.509 |
+ 0.516 |
+ 0.000 |
+ 0.513 |
+ 0.511 |
+ 0.515 |
+ 0.456 |
+ 0.492 |
+ 0.516 |
+ -0.003 |
+ 0.015 |
+ 0.035 |
+ 0.080 |
+ 0.132 |
+ 0.164 |
+ 0.465 |
+ 0.496 |
+ 0.507 |
+ 0.509 |
+ 0.505 |
+ 0.497 |
+ 0.501 |
+ 0.463 |
+ 0.490 |
+ 0.484 |
+ 0.496 |
+ 0.541 |
+ 0.507 |
+ 0.327 |
+ 0.492 |
+ 0.483 |
+ 0.514 |
+ 0.509 |
+ 0.503 |
+ 0.513 |
+ 0.524 |
+ 0.518 |
+ 0.528 |
+ 0.422 |
+ -0.021 |
+ 0.497 |
+ 0.429 |
+ 0.329 |
+ 0.451 |
+ 0.206 |
+ 0.136 |
+ 0.202 |
+ 0.230 |
+ 0.258 |
+ 0.263 |
+ 0.210 |
+ 0.226 |
+ 0.228 |
+ 0.200 |
+ 0.183 |
+ 0.239 |
+ 0.173 |
+ 0.084 |
+ 12577 |
+ OrganismalFitness |
+ A0A192B1T2_9HIV1 |
+ Medium |
+ Virus |
+
+
+ A0A1I9GEU1_NEIME_Kennouche_2019 |
+ -0.011 |
+ 0.044 |
+ 0.107 |
+ 0.098 |
+ 0.053 |
+ 0.054 |
+ -0.024 |
+ 0.084 |
+ 0.082 |
+ 0.077 |
+ 0.040 |
+ 0.068 |
+ 0.068 |
+ -0.037 |
+ -0.047 |
+ -0.016 |
+ 0.030 |
+ 0.027 |
+ 0.025 |
+ 0.067 |
+ -0.010 |
+ 0.047 |
+ 0.071 |
+ 0.088 |
+ 0.050 |
+ 0.088 |
+ 0.080 |
+ 0.089 |
+ 0.095 |
+ 0.045 |
+ 0.046 |
+ 0.036 |
+ 0.030 |
+ 0.036 |
+ 0.055 |
+ 0.099 |
+ 0.031 |
+ 0.041 |
+ 0.057 |
+ 0.058 |
+ 0.061 |
+ 0.075 |
+ -0.055 |
+ -0.056 |
+ 0.039 |
+ -0.043 |
+ 0.051 |
+ 0.055 |
+ 0.039 |
+ 0.040 |
+ 0.054 |
+ 0.042 |
+ 0.039 |
+ 0.035 |
+ 0.037 |
+ 0.052 |
+ 0.039 |
+ 0.058 |
+ 0.041 |
+ 0.047 |
+ 0.040 |
+ -0.005 |
+ 922 |
+ Activity |
+ A0A1I9GEU1_NEIME |
+ Medium |
+ Prokaryote |
+
+
+ A0A247D711_LISMN_Stadelmann_2021 |
+ 0.436 |
+ 0.459 |
+ 0.109 |
+ 0.041 |
+ 0.428 |
+ 0.428 |
+ 0.003 |
+ 0.026 |
+ 0.299 |
+ 0.320 |
+ 0.087 |
+ 0.088 |
+ 0.093 |
+ 0.064 |
+ 0.084 |
+ 0.097 |
+ 0.066 |
+ 0.084 |
+ 0.084 |
+ 0.059 |
+ 0.021 |
+ 0.015 |
+ 0.040 |
+ 0.078 |
+ -0.029 |
+ 0.067 |
+ 0.010 |
+ 0.037 |
+ 0.106 |
+ 0.473 |
+ 0.337 |
+ 0.351 |
+ 0.093 |
+ 0.041 |
+ 0.024 |
+ 0.057 |
+ 0.301 |
+ 0.292 |
+ 0.311 |
+ 0.253 |
+ 0.242 |
+ 0.263 |
+ 0.047 |
+ 0.002 |
+ 0.074 |
+ 0.084 |
+ 0.418 |
+ 0.441 |
+ 0.476 |
+ 0.364 |
+ 0.274 |
+ 0.326 |
+ 0.314 |
+ 0.344 |
+ 0.313 |
+ 0.360 |
+ 0.312 |
+ 0.331 |
+ 0.265 |
+ 0.335 |
+ 0.427 |
+ 0.280 |
+ 1653 |
+ Activity |
+ A0A247D711_LISMN |
+ High |
+ Prokaryote |
+
+
+ A0A2Z5U3Z0_9INFA_Doud_2016 |
+ 0.478 |
+ 0.473 |
+ 0.484 |
+ 0.517 |
+ 0.545 |
+ 0.548 |
+ 0.009 |
+ 0.492 |
+ 0.495 |
+ 0.496 |
+ 0.124 |
+ 0.516 |
+ 0.550 |
+ -0.004 |
+ 0.013 |
+ 0.064 |
+ 0.507 |
+ 0.498 |
+ 0.501 |
+ 0.413 |
+ 0.467 |
+ 0.534 |
+ 0.520 |
+ 0.542 |
+ 0.433 |
+ 0.520 |
+ 0.513 |
+ 0.494 |
+ 0.504 |
+ 0.538 |
+ 0.490 |
+ 0.400 |
+ 0.076 |
+ 0.470 |
+ 0.525 |
+ 0.523 |
+ 0.518 |
+ 0.548 |
+ 0.537 |
+ 0.557 |
+ 0.576 |
+ 0.573 |
+ 0.006 |
+ 0.000 |
+ 0.339 |
+ 0.017 |
+ 0.430 |
+ 0.487 |
+ 0.462 |
+ 0.174 |
+ 0.489 |
+ 0.474 |
+ 0.529 |
+ 0.522 |
+ 0.526 |
+ 0.524 |
+ 0.535 |
+ 0.516 |
+ 0.545 |
+ 0.537 |
+ 0.199 |
+ 0.152 |
+ 10715 |
+ OrganismalFitness |
+ A0A2Z5U3Z0_9INFA |
+ Medium |
+ Virus |
+
+
+ A0A2Z5U3Z0_9INFA_Wu_2014 |
+ 0.480 |
+ 0.506 |
+ 0.454 |
+ 0.478 |
+ 0.506 |
+ 0.521 |
+ 0.021 |
+ 0.436 |
+ 0.495 |
+ 0.509 |
+ 0.155 |
+ 0.461 |
+ 0.509 |
+ 0.048 |
+ 0.073 |
+ 0.073 |
+ 0.464 |
+ 0.481 |
+ 0.485 |
+ 0.458 |
+ 0.450 |
+ 0.503 |
+ 0.484 |
+ 0.511 |
+ 0.345 |
+ 0.460 |
+ 0.482 |
+ 0.447 |
+ 0.475 |
+ 0.513 |
+ 0.442 |
+ 0.375 |
+ 0.085 |
+ 0.442 |
+ 0.493 |
+ 0.519 |
+ 0.506 |
+ 0.534 |
+ 0.541 |
+ 0.541 |
+ 0.556 |
+ 0.558 |
+ 0.052 |
+ 0.051 |
+ 0.253 |
+ 0.069 |
+ 0.355 |
+ 0.380 |
+ 0.362 |
+ 0.147 |
+ 0.432 |
+ 0.429 |
+ 0.455 |
+ 0.451 |
+ 0.454 |
+ 0.461 |
+ 0.468 |
+ 0.449 |
+ 0.462 |
+ 0.468 |
+ 0.229 |
+ 0.197 |
+ 2350 |
+ OrganismalFitness |
+ A0A2Z5U3Z0_9INFA |
+ Medium |
+ Virus |
+
+
+ A4_HUMAN_Seuma_2022 |
+ 0.394 |
+ 0.385 |
+ 0.449 |
+ 0.425 |
+ 0.312 |
+ 0.316 |
+ 0.345 |
+ 0.167 |
+ 0.337 |
+ 0.351 |
+ 0.302 |
+ 0.316 |
+ 0.404 |
+ 0.382 |
+ 0.366 |
+ 0.372 |
+ 0.431 |
+ 0.480 |
+ 0.352 |
+ 0.261 |
+ 0.327 |
+ 0.281 |
+ 0.318 |
+ 0.306 |
+ 0.406 |
+ 0.297 |
+ 0.320 |
+ 0.313 |
+ 0.314 |
+ 0.478 |
+ 0.251 |
+ 0.186 |
+ 0.487 |
+ 0.393 |
+ 0.273 |
+ 0.370 |
+ 0.441 |
+ 0.372 |
+ 0.458 |
+ 0.436 |
+ 0.400 |
+ 0.416 |
+ 0.366 |
+ 0.362 |
+ 0.421 |
+ 0.356 |
+ 0.279 |
+ 0.358 |
+ -0.191 |
+ 0.060 |
+ 0.449 |
+ 0.440 |
+ 0.417 |
+ 0.410 |
+ 0.406 |
+ 0.380 |
+ 0.423 |
+ 0.346 |
+ 0.372 |
+ 0.417 |
+ 0.411 |
+ 0.388 |
+ 14811 |
+ Stability |
+ A4_HUMAN |
+ Low |
+ Human |
+
+
+ A4D664_9INFA_Soh_2019 |
+ 0.411 |
+ 0.340 |
+ 0.406 |
+ 0.403 |
+ 0.409 |
+ 0.408 |
+ 0.029 |
+ 0.364 |
+ 0.297 |
+ 0.287 |
+ 0.044 |
+ 0.026 |
+ 0.033 |
+ 0.022 |
+ 0.020 |
+ 0.027 |
+ 0.149 |
+ 0.211 |
+ 0.270 |
+ 0.260 |
+ 0.329 |
+ 0.386 |
+ 0.404 |
+ 0.398 |
+ 0.064 |
+ 0.260 |
+ 0.285 |
+ 0.244 |
+ 0.330 |
+ 0.443 |
+ 0.318 |
+ 0.296 |
+ 0.055 |
+ 0.348 |
+ 0.371 |
+ 0.404 |
+ 0.366 |
+ 0.380 |
+ 0.393 |
+ 0.434 |
+ 0.439 |
+ 0.461 |
+ 0.019 |
+ -0.000 |
+ 0.080 |
+ -0.001 |
+ 0.262 |
+ 0.254 |
+ 0.126 |
+ 0.105 |
+ 0.183 |
+ 0.180 |
+ 0.191 |
+ 0.216 |
+ 0.209 |
+ 0.207 |
+ 0.200 |
+ 0.198 |
+ 0.193 |
+ 0.204 |
+ 0.148 |
+ 0.092 |
+ 14421 |
+ OrganismalFitness |
+ A4D664_9INFA |
+ Medium |
+ Virus |
+
+
+ A4GRB6_PSEAI_Chen_2020 |
+ 0.317 |
+ 0.492 |
+ 0.657 |
+ 0.664 |
+ 0.623 |
+ 0.641 |
+ 0.344 |
+ 0.529 |
+ 0.641 |
+ 0.681 |
+ 0.680 |
+ 0.647 |
+ 0.668 |
+ 0.431 |
+ 0.528 |
+ 0.655 |
+ 0.738 |
+ 0.712 |
+ 0.627 |
+ 0.567 |
+ 0.409 |
+ 0.533 |
+ 0.564 |
+ 0.624 |
+ 0.526 |
+ 0.622 |
+ 0.639 |
+ 0.638 |
+ 0.705 |
+ 0.671 |
+ 0.742 |
+ 0.673 |
+ 0.242 |
+ 0.423 |
+ 0.552 |
+ 0.629 |
+ 0.557 |
+ 0.588 |
+ 0.652 |
+ 0.653 |
+ 0.660 |
+ 0.679 |
+ 0.481 |
+ 0.071 |
+ 0.671 |
+ 0.601 |
+ 0.664 |
+ 0.713 |
+ 0.629 |
+ 0.422 |
+ 0.728 |
+ 0.718 |
+ 0.711 |
+ 0.719 |
+ 0.729 |
+ 0.733 |
+ 0.728 |
+ 0.733 |
+ 0.735 |
+ 0.745 |
+ 0.722 |
+ 0.513 |
+ 5004 |
+ OrganismalFitness |
+ A4GRB6_PSEAI |
+ High |
+ Prokaryote |
+
+
+ AACC1_PSEAI_Dandage_2018 |
+ 0.282 |
+ 0.474 |
+ 0.331 |
+ 0.414 |
+ 0.487 |
+ 0.491 |
+ 0.196 |
+ 0.191 |
+ 0.501 |
+ 0.504 |
+ 0.413 |
+ 0.483 |
+ 0.489 |
+ 0.196 |
+ 0.248 |
+ 0.256 |
+ 0.490 |
+ 0.511 |
+ 0.518 |
+ 0.391 |
+ 0.206 |
+ 0.232 |
+ 0.276 |
+ 0.306 |
+ 0.273 |
+ 0.428 |
+ 0.407 |
+ 0.413 |
+ 0.438 |
+ 0.465 |
+ 0.507 |
+ 0.455 |
+ 0.021 |
+ 0.259 |
+ 0.246 |
+ 0.417 |
+ 0.427 |
+ 0.418 |
+ 0.466 |
+ 0.471 |
+ 0.466 |
+ 0.495 |
+ 0.216 |
+ 0.170 |
+ 0.354 |
+ 0.243 |
+ 0.257 |
+ 0.374 |
+ 0.357 |
+ 0.183 |
+ 0.481 |
+ 0.492 |
+ 0.483 |
+ 0.488 |
+ 0.495 |
+ 0.486 |
+ 0.486 |
+ 0.477 |
+ 0.494 |
+ 0.500 |
+ 0.461 |
+ 0.282 |
+ 1801 |
+ OrganismalFitness |
+ AACC1_PSEAI |
+ High |
+ Prokaryote |
+
+
+ ACE2_HUMAN_Chan_2020 |
+ 0.254 |
+ 0.237 |
+ 0.187 |
+ 0.200 |
+ 0.249 |
+ 0.242 |
+ -0.049 |
+ 0.077 |
+ 0.236 |
+ 0.251 |
+ 0.244 |
+ 0.173 |
+ 0.213 |
+ -0.053 |
+ 0.062 |
+ 0.216 |
+ 0.225 |
+ 0.175 |
+ 0.178 |
+ 0.227 |
+ 0.033 |
+ 0.129 |
+ 0.175 |
+ 0.207 |
+ 0.042 |
+ 0.162 |
+ 0.207 |
+ 0.208 |
+ 0.261 |
+ 0.217 |
+ 0.164 |
+ 0.138 |
+ 0.080 |
+ 0.044 |
+ 0.120 |
+ 0.134 |
+ 0.217 |
+ 0.190 |
+ 0.196 |
+ 0.228 |
+ 0.200 |
+ 0.205 |
+ 0.005 |
+ 0.005 |
+ 0.244 |
+ 0.078 |
+ 0.336 |
+ 0.292 |
+ 0.317 |
+ 0.106 |
+ 0.217 |
+ 0.255 |
+ 0.226 |
+ 0.222 |
+ 0.236 |
+ 0.249 |
+ 0.223 |
+ 0.221 |
+ 0.214 |
+ 0.234 |
+ 0.289 |
+ 0.168 |
+ 2223 |
+ Binding |
+ ACE2_HUMAN |
+ Medium |
+ Human |
+
+
+ ADRB2_HUMAN_Jones_2020 |
+ 0.331 |
+ 0.423 |
+ 0.506 |
+ 0.514 |
+ 0.497 |
+ 0.517 |
+ 0.463 |
+ 0.500 |
+ 0.531 |
+ 0.530 |
+ 0.534 |
+ 0.526 |
+ 0.539 |
+ 0.408 |
+ 0.444 |
+ 0.474 |
+ 0.493 |
+ 0.505 |
+ 0.515 |
+ 0.521 |
+ 0.509 |
+ 0.524 |
+ 0.517 |
+ 0.493 |
+ 0.524 |
+ 0.531 |
+ 0.538 |
+ 0.533 |
+ 0.502 |
+ 0.545 |
+ 0.500 |
+ 0.414 |
+ 0.298 |
+ 0.528 |
+ 0.530 |
+ 0.506 |
+ 0.528 |
+ 0.541 |
+ 0.533 |
+ 0.533 |
+ 0.538 |
+ 0.537 |
+ 0.471 |
+ 0.181 |
+ 0.528 |
+ 0.510 |
+ 0.419 |
+ 0.477 |
+ 0.451 |
+ 0.223 |
+ 0.486 |
+ 0.486 |
+ 0.500 |
+ 0.508 |
+ 0.501 |
+ 0.498 |
+ 0.493 |
+ 0.506 |
+ 0.494 |
+ 0.509 |
+ 0.554 |
+ 0.516 |
+ 7800 |
+ Activity |
+ ADRB2_HUMAN |
+ Medium |
+ Human |
+
+
+ AICDA_HUMAN_Gajula_2014_3cycles |
+ 0.138 |
+ 0.396 |
+ 0.474 |
+ 0.464 |
+ 0.429 |
+ 0.445 |
+ -0.287 |
+ 0.327 |
+ 0.320 |
+ 0.385 |
+ 0.415 |
+ 0.376 |
+ 0.478 |
+ -0.245 |
+ -0.264 |
+ 0.196 |
+ 0.328 |
+ 0.347 |
+ 0.262 |
+ 0.479 |
+ -0.219 |
+ 0.140 |
+ 0.388 |
+ 0.395 |
+ 0.013 |
+ 0.321 |
+ 0.417 |
+ 0.413 |
+ 0.282 |
+ 0.376 |
+ 0.450 |
+ 0.405 |
+ 0.057 |
+ -0.039 |
+ 0.057 |
+ 0.339 |
+ 0.206 |
+ 0.218 |
+ 0.367 |
+ 0.399 |
+ 0.402 |
+ 0.444 |
+ -0.228 |
+ -0.238 |
+ 0.379 |
+ 0.258 |
+ 0.328 |
+ 0.401 |
+ 0.328 |
+ 0.243 |
+ 0.337 |
+ 0.404 |
+ 0.344 |
+ 0.447 |
+ 0.359 |
+ 0.365 |
+ 0.341 |
+ 0.343 |
+ 0.312 |
+ 0.359 |
+ 0.257 |
+ -0.036 |
+ 209 |
+ Activity |
+ AICDA_HUMAN |
+ Medium |
+ Human |
+
+
+ AMFR_HUMAN_Tsuboyama_2023_4G3O |
+ 0.358 |
+ 0.418 |
+ 0.419 |
+ 0.407 |
+ 0.386 |
+ 0.398 |
+ -0.301 |
+ 0.130 |
+ 0.220 |
+ 0.240 |
+ 0.374 |
+ 0.070 |
+ 0.423 |
+ -0.209 |
+ 0.286 |
+ 0.321 |
+ 0.261 |
+ 0.297 |
+ 0.260 |
+ 0.062 |
+ -0.052 |
+ -0.218 |
+ -0.051 |
+ -0.038 |
+ -0.317 |
+ -0.089 |
+ -0.050 |
+ -0.121 |
+ 0.056 |
+ 0.396 |
+ 0.254 |
+ 0.228 |
+ 0.090 |
+ -0.287 |
+ -0.139 |
+ -0.275 |
+ 0.392 |
+ 0.402 |
+ 0.356 |
+ 0.424 |
+ 0.422 |
+ 0.374 |
+ -0.185 |
+ -0.194 |
+ 0.165 |
+ 0.114 |
+ 0.251 |
+ 0.112 |
+ 0.433 |
+ 0.353 |
+ 0.446 |
+ 0.326 |
+ 0.204 |
+ 0.349 |
+ 0.342 |
+ 0.349 |
+ 0.343 |
+ 0.339 |
+ 0.361 |
+ 0.343 |
+ 0.347 |
+ 0.397 |
+ 2972 |
+ Stability |
+ AMFR_HUMAN |
+ Medium |
+ Human |
+
+
+ AMIE_PSEAE_Wrenbeck_2017 |
+ 0.246 |
+ 0.443 |
+ 0.496 |
+ 0.506 |
+ 0.477 |
+ 0.464 |
+ 0.093 |
+ 0.436 |
+ 0.401 |
+ 0.605 |
+ 0.570 |
+ 0.613 |
+ 0.662 |
+ 0.326 |
+ 0.408 |
+ 0.428 |
+ 0.557 |
+ 0.674 |
+ 0.613 |
+ 0.506 |
+ 0.555 |
+ 0.525 |
+ 0.532 |
+ 0.542 |
+ 0.566 |
+ 0.561 |
+ 0.534 |
+ 0.559 |
+ 0.508 |
+ 0.558 |
+ 0.583 |
+ 0.528 |
+ 0.265 |
+ 0.601 |
+ 0.470 |
+ 0.405 |
+ 0.585 |
+ 0.479 |
+ 0.438 |
+ 0.580 |
+ 0.506 |
+ 0.481 |
+ 0.356 |
+ 0.096 |
+ 0.483 |
+ 0.412 |
+ 0.368 |
+ 0.469 |
+ 0.407 |
+ 0.213 |
+ 0.555 |
+ 0.519 |
+ 0.530 |
+ 0.537 |
+ 0.529 |
+ 0.541 |
+ 0.523 |
+ 0.540 |
+ 0.564 |
+ 0.556 |
+ 0.607 |
+ 0.410 |
+ 6227 |
+ Activity |
+ AMIE_PSEAE |
+ High |
+ Prokaryote |
+
+
+ ANCSZ_Hobbs_2022 |
+ 0.562 |
+ 0.521 |
+ 0.445 |
+ 0.472 |
+ 0.524 |
+ 0.528 |
+ 0.525 |
+ 0.452 |
+ 0.503 |
+ 0.524 |
+ 0.527 |
+ 0.522 |
+ 0.543 |
+ 0.535 |
+ 0.592 |
+ 0.598 |
+ 0.609 |
+ 0.598 |
+ 0.570 |
+ 0.021 |
+ 0.462 |
+ 0.493 |
+ 0.509 |
+ 0.500 |
+ 0.497 |
+ 0.519 |
+ 0.513 |
+ 0.469 |
+ 0.494 |
+ 0.557 |
+ 0.542 |
+ 0.525 |
+ 0.318 |
+ 0.483 |
+ 0.503 |
+ 0.475 |
+ 0.537 |
+ 0.549 |
+ 0.538 |
+ 0.545 |
+ 0.556 |
+ 0.543 |
+ 0.549 |
+ 0.420 |
+ 0.496 |
+ 0.523 |
+ 0.471 |
+ 0.458 |
+ 0.422 |
+ 0.165 |
+ 0.566 |
+ 0.555 |
+ 0.572 |
+ 0.578 |
+ 0.572 |
+ 0.584 |
+ 0.569 |
+ 0.583 |
+ 0.577 |
+ 0.589 |
+ 0.585 |
+ 0.601 |
+ 4670 |
+ Activity |
+ ANCSZ |
+ Medium |
+ Eukaryote |
+
+
+ ARGR_ECOLI_Tsuboyama_2023_1AOY |
+ 0.277 |
+ 0.413 |
+ 0.411 |
+ 0.430 |
+ 0.431 |
+ 0.423 |
+ 0.306 |
+ 0.386 |
+ 0.430 |
+ 0.463 |
+ 0.409 |
+ 0.296 |
+ 0.405 |
+ 0.332 |
+ 0.297 |
+ 0.511 |
+ 0.497 |
+ 0.499 |
+ 0.435 |
+ 0.434 |
+ 0.372 |
+ 0.439 |
+ 0.448 |
+ 0.403 |
+ 0.362 |
+ 0.437 |
+ 0.473 |
+ 0.461 |
+ 0.418 |
+ 0.460 |
+ 0.409 |
+ 0.310 |
+ 0.073 |
+ 0.394 |
+ 0.396 |
+ 0.430 |
+ 0.405 |
+ 0.415 |
+ 0.439 |
+ 0.465 |
+ 0.460 |
+ 0.471 |
+ 0.285 |
+ 0.267 |
+ 0.450 |
+ 0.408 |
+ 0.725 |
+ 0.651 |
+ 0.738 |
+ 0.636 |
+ 0.513 |
+ 0.532 |
+ 0.527 |
+ 0.529 |
+ 0.517 |
+ 0.508 |
+ 0.503 |
+ 0.526 |
+ 0.494 |
+ 0.533 |
+ 0.604 |
+ 0.511 |
+ 1287 |
+ Stability |
+ ARGR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ B2L11_HUMAN_Dutta_2010_binding-Mcl-1 |
+ 0.688 |
+ 0.270 |
+ 0.668 |
+ 0.661 |
+ 0.530 |
+ 0.646 |
+ 0.216 |
+ 0.529 |
+ 0.188 |
+ 0.164 |
+ 0.206 |
+ -0.019 |
+ 0.396 |
+ 0.174 |
+ 0.123 |
+ 0.151 |
+ 0.270 |
+ 0.367 |
+ 0.315 |
+ 0.336 |
+ 0.239 |
+ 0.076 |
+ 0.304 |
+ 0.568 |
+ 0.067 |
+ 0.251 |
+ 0.362 |
+ 0.372 |
+ 0.366 |
+ 0.734 |
+ 0.358 |
+ 0.338 |
+ 0.295 |
+ 0.266 |
+ 0.213 |
+ 0.341 |
+ 0.544 |
+ 0.469 |
+ 0.388 |
+ 0.615 |
+ 0.570 |
+ 0.402 |
+ 0.146 |
+ 0.138 |
+ 0.207 |
+ 0.170 |
+ 0.287 |
+ 0.307 |
+ 0.432 |
+ -0.005 |
+ 0.352 |
+ 0.367 |
+ 0.419 |
+ 0.283 |
+ 0.255 |
+ 0.470 |
+ 0.295 |
+ 0.288 |
+ 0.421 |
+ 0.374 |
+ 0.242 |
+ 0.204 |
+ 170 |
+ Binding |
+ B2L11_HUMAN |
+ Low |
+ Human |
+
+
+ BBC1_YEAST_Tsuboyama_2023_1TG0 |
+ 0.216 |
+ 0.332 |
+ 0.395 |
+ 0.399 |
+ 0.394 |
+ 0.403 |
+ 0.212 |
+ 0.256 |
+ 0.438 |
+ 0.454 |
+ 0.425 |
+ 0.425 |
+ 0.451 |
+ 0.331 |
+ 0.421 |
+ 0.423 |
+ 0.479 |
+ 0.458 |
+ 0.499 |
+ 0.332 |
+ 0.335 |
+ 0.346 |
+ 0.376 |
+ 0.343 |
+ 0.144 |
+ 0.391 |
+ 0.403 |
+ 0.382 |
+ 0.307 |
+ 0.338 |
+ 0.345 |
+ 0.314 |
+ 0.273 |
+ 0.354 |
+ 0.261 |
+ 0.299 |
+ 0.392 |
+ 0.338 |
+ 0.365 |
+ 0.421 |
+ 0.386 |
+ 0.405 |
+ 0.210 |
+ 0.113 |
+ 0.314 |
+ 0.242 |
+ 0.558 |
+ 0.451 |
+ 0.644 |
+ 0.566 |
+ 0.434 |
+ 0.444 |
+ 0.457 |
+ 0.472 |
+ 0.463 |
+ 0.463 |
+ 0.460 |
+ 0.458 |
+ 0.471 |
+ 0.461 |
+ 0.540 |
+ 0.579 |
+ 2069 |
+ Stability |
+ BBC1_YEAST |
+ High |
+ Eukaryote |
+
+
+ BCHB_CHLTE_Tsuboyama_2023_2KRU |
+ 0.364 |
+ 0.418 |
+ 0.357 |
+ 0.364 |
+ 0.365 |
+ 0.382 |
+ 0.100 |
+ 0.248 |
+ 0.383 |
+ 0.501 |
+ 0.394 |
+ 0.329 |
+ 0.406 |
+ 0.166 |
+ 0.182 |
+ 0.301 |
+ 0.510 |
+ 0.521 |
+ 0.421 |
+ 0.407 |
+ 0.122 |
+ 0.362 |
+ 0.220 |
+ 0.376 |
+ 0.207 |
+ 0.421 |
+ 0.281 |
+ 0.402 |
+ 0.435 |
+ 0.607 |
+ 0.549 |
+ 0.496 |
+ 0.070 |
+ 0.245 |
+ 0.294 |
+ 0.431 |
+ 0.440 |
+ 0.440 |
+ 0.483 |
+ 0.392 |
+ 0.396 |
+ 0.436 |
+ 0.116 |
+ 0.109 |
+ 0.205 |
+ 0.175 |
+ 0.579 |
+ 0.537 |
+ 0.683 |
+ 0.564 |
+ 0.549 |
+ 0.554 |
+ 0.563 |
+ 0.579 |
+ 0.561 |
+ 0.581 |
+ 0.529 |
+ 0.556 |
+ 0.543 |
+ 0.562 |
+ 0.516 |
+ 0.296 |
+ 1572 |
+ Stability |
+ BCHB_CHLTE |
+ Medium |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Deng_2012 |
+ 0.322 |
+ 0.504 |
+ 0.508 |
+ 0.521 |
+ 0.507 |
+ 0.508 |
+ 0.075 |
+ 0.312 |
+ 0.513 |
+ 0.550 |
+ 0.517 |
+ 0.501 |
+ 0.518 |
+ 0.243 |
+ 0.354 |
+ 0.461 |
+ 0.528 |
+ 0.461 |
+ 0.352 |
+ 0.455 |
+ 0.375 |
+ 0.378 |
+ 0.400 |
+ 0.383 |
+ 0.391 |
+ 0.445 |
+ 0.453 |
+ 0.406 |
+ 0.343 |
+ 0.464 |
+ 0.519 |
+ 0.487 |
+ 0.107 |
+ 0.392 |
+ 0.402 |
+ 0.373 |
+ 0.457 |
+ 0.471 |
+ 0.473 |
+ 0.513 |
+ 0.526 |
+ 0.522 |
+ 0.330 |
+ 0.011 |
+ 0.480 |
+ 0.406 |
+ 0.423 |
+ 0.496 |
+ 0.460 |
+ 0.252 |
+ 0.530 |
+ 0.535 |
+ 0.538 |
+ 0.552 |
+ 0.548 |
+ 0.542 |
+ 0.544 |
+ 0.530 |
+ 0.541 |
+ 0.555 |
+ 0.539 |
+ 0.406 |
+ 4996 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Firnberg_2014 |
+ 0.471 |
+ 0.708 |
+ 0.732 |
+ 0.747 |
+ 0.709 |
+ 0.729 |
+ 0.134 |
+ 0.459 |
+ 0.701 |
+ 0.741 |
+ 0.703 |
+ 0.665 |
+ 0.707 |
+ 0.411 |
+ 0.555 |
+ 0.658 |
+ 0.737 |
+ 0.573 |
+ 0.424 |
+ 0.662 |
+ 0.566 |
+ 0.563 |
+ 0.537 |
+ 0.531 |
+ 0.595 |
+ 0.646 |
+ 0.654 |
+ 0.577 |
+ 0.452 |
+ 0.683 |
+ 0.758 |
+ 0.752 |
+ 0.186 |
+ 0.551 |
+ 0.529 |
+ 0.480 |
+ 0.646 |
+ 0.638 |
+ 0.637 |
+ 0.731 |
+ 0.734 |
+ 0.736 |
+ 0.540 |
+ 0.046 |
+ 0.700 |
+ 0.595 |
+ 0.616 |
+ 0.703 |
+ 0.696 |
+ 0.321 |
+ 0.718 |
+ 0.699 |
+ 0.703 |
+ 0.729 |
+ 0.739 |
+ 0.730 |
+ 0.736 |
+ 0.720 |
+ 0.729 |
+ 0.745 |
+ 0.754 |
+ 0.605 |
+ 4783 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Jacquier_2013 |
+ 0.456 |
+ 0.689 |
+ 0.698 |
+ 0.720 |
+ 0.703 |
+ 0.723 |
+ 0.113 |
+ 0.441 |
+ 0.665 |
+ 0.712 |
+ 0.668 |
+ 0.656 |
+ 0.684 |
+ 0.362 |
+ 0.520 |
+ 0.626 |
+ 0.704 |
+ 0.588 |
+ 0.503 |
+ 0.635 |
+ 0.564 |
+ 0.537 |
+ 0.542 |
+ 0.528 |
+ 0.573 |
+ 0.632 |
+ 0.625 |
+ 0.578 |
+ 0.462 |
+ 0.567 |
+ 0.719 |
+ 0.691 |
+ 0.143 |
+ 0.540 |
+ 0.549 |
+ 0.508 |
+ 0.628 |
+ 0.642 |
+ 0.647 |
+ 0.716 |
+ 0.728 |
+ 0.734 |
+ 0.482 |
+ 0.040 |
+ 0.663 |
+ 0.562 |
+ 0.527 |
+ 0.640 |
+ 0.603 |
+ 0.290 |
+ 0.677 |
+ 0.671 |
+ 0.672 |
+ 0.691 |
+ 0.686 |
+ 0.686 |
+ 0.702 |
+ 0.677 |
+ 0.684 |
+ 0.703 |
+ 0.709 |
+ 0.573 |
+ 989 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Stiffler_2015 |
+ 0.466 |
+ 0.707 |
+ 0.732 |
+ 0.743 |
+ 0.711 |
+ 0.727 |
+ 0.115 |
+ 0.459 |
+ 0.688 |
+ 0.731 |
+ 0.698 |
+ 0.668 |
+ 0.707 |
+ 0.399 |
+ 0.557 |
+ 0.661 |
+ 0.731 |
+ 0.589 |
+ 0.430 |
+ 0.655 |
+ 0.568 |
+ 0.570 |
+ 0.528 |
+ 0.525 |
+ 0.597 |
+ 0.651 |
+ 0.661 |
+ 0.577 |
+ 0.460 |
+ 0.679 |
+ 0.767 |
+ 0.761 |
+ 0.158 |
+ 0.558 |
+ 0.522 |
+ 0.475 |
+ 0.645 |
+ 0.630 |
+ 0.629 |
+ 0.730 |
+ 0.731 |
+ 0.733 |
+ 0.532 |
+ 0.030 |
+ 0.703 |
+ 0.598 |
+ 0.604 |
+ 0.693 |
+ 0.682 |
+ 0.324 |
+ 0.715 |
+ 0.692 |
+ 0.698 |
+ 0.720 |
+ 0.725 |
+ 0.724 |
+ 0.729 |
+ 0.709 |
+ 0.724 |
+ 0.737 |
+ 0.748 |
+ 0.593 |
+ 4996 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BRCA1_HUMAN_Findlay_2018 |
+ 0.501 |
+ 0.385 |
+ 0.441 |
+ 0.432 |
+ 0.424 |
+ 0.466 |
+ 0.125 |
+ 0.189 |
+ 0.387 |
+ 0.407 |
+ 0.494 |
+ 0.404 |
+ 0.447 |
+ 0.134 |
+ 0.392 |
+ 0.455 |
+ 0.515 |
+ 0.497 |
+ 0.394 |
+ 0.296 |
+ 0.145 |
+ 0.379 |
+ 0.405 |
+ 0.361 |
+ 0.305 |
+ 0.473 |
+ 0.457 |
+ 0.442 |
+ 0.423 |
+ 0.438 |
+ 0.539 |
+ 0.482 |
+ 0.220 |
+ 0.151 |
+ 0.179 |
+ 0.409 |
+ 0.456 |
+ 0.456 |
+ 0.514 |
+ 0.475 |
+ 0.476 |
+ 0.533 |
+ 0.229 |
+ 0.132 |
+ 0.534 |
+ 0.457 |
+ 0.440 |
+ 0.469 |
+ 0.116 |
+ 0.120 |
+ 0.515 |
+ 0.501 |
+ 0.506 |
+ 0.502 |
+ 0.512 |
+ 0.504 |
+ 0.513 |
+ 0.512 |
+ 0.504 |
+ 0.519 |
+ 0.539 |
+ 0.481 |
+ 1837 |
+ OrganismalFitness |
+ BRCA1_HUMAN |
+ Low |
+ Human |
+
+
+ BRCA2_HUMAN_Erwood_2022_HEK293T |
+ 0.481 |
+ 0.346 |
+ 0.440 |
+ 0.445 |
+ 0.447 |
+ 0.481 |
+ 0.095 |
+ 0.477 |
+ 0.095 |
+ -0.030 |
+ 0.480 |
+ 0.183 |
+ 0.103 |
+ 0.088 |
+ 0.103 |
+ 0.416 |
+ 0.510 |
+ 0.464 |
+ 0.473 |
+ 0.033 |
+ 0.090 |
+ 0.494 |
+ 0.534 |
+ 0.482 |
+ 0.533 |
+ 0.483 |
+ 0.496 |
+ 0.499 |
+ 0.014 |
+ 0.400 |
+ 0.297 |
+ 0.328 |
+ 0.140 |
+ 0.119 |
+ 0.185 |
+ 0.138 |
+ 0.420 |
+ 0.425 |
+ 0.426 |
+ 0.419 |
+ 0.415 |
+ 0.414 |
+ 0.163 |
+ 0.093 |
+ 0.497 |
+ 0.047 |
+ 0.052 |
+ 0.053 |
+ -0.091 |
+ 0.285 |
+ 0.436 |
+ 0.477 |
+ 0.473 |
+ 0.424 |
+ 0.455 |
+ 0.436 |
+ 0.456 |
+ 0.438 |
+ 0.453 |
+ 0.465 |
+ 0.000 |
+ 0.016 |
+ 265 |
+ OrganismalFitness |
+ BRCA2_HUMAN |
+ NaN |
+ Human |
+
+
+ C6KNH7_9INFA_Lee_2018 |
+ 0.393 |
+ 0.371 |
+ 0.351 |
+ 0.357 |
+ 0.433 |
+ 0.436 |
+ -0.025 |
+ 0.451 |
+ 0.374 |
+ 0.378 |
+ 0.060 |
+ 0.428 |
+ 0.492 |
+ -0.014 |
+ -0.022 |
+ -0.007 |
+ 0.483 |
+ 0.405 |
+ 0.426 |
+ 0.343 |
+ 0.396 |
+ 0.363 |
+ 0.378 |
+ 0.349 |
+ 0.254 |
+ 0.454 |
+ 0.428 |
+ 0.465 |
+ 0.370 |
+ 0.475 |
+ 0.457 |
+ 0.360 |
+ 0.110 |
+ 0.354 |
+ 0.374 |
+ 0.392 |
+ 0.412 |
+ 0.419 |
+ 0.433 |
+ 0.439 |
+ 0.441 |
+ 0.452 |
+ -0.024 |
+ -0.033 |
+ 0.294 |
+ -0.002 |
+ 0.513 |
+ 0.539 |
+ 0.544 |
+ 0.227 |
+ 0.498 |
+ 0.497 |
+ 0.519 |
+ 0.520 |
+ 0.531 |
+ 0.510 |
+ 0.528 |
+ 0.519 |
+ 0.521 |
+ 0.529 |
+ 0.313 |
+ 0.185 |
+ 10754 |
+ OrganismalFitness |
+ C6KNH7_9INFA |
+ Medium |
+ Virus |
+
+
+ CALM1_HUMAN_Weile_2017 |
+ 0.175 |
+ 0.233 |
+ 0.238 |
+ 0.233 |
+ 0.238 |
+ 0.236 |
+ 0.174 |
+ 0.187 |
+ 0.236 |
+ 0.253 |
+ 0.250 |
+ 0.235 |
+ 0.264 |
+ 0.161 |
+ 0.199 |
+ 0.195 |
+ 0.212 |
+ 0.218 |
+ 0.225 |
+ 0.229 |
+ 0.173 |
+ 0.227 |
+ 0.242 |
+ 0.254 |
+ 0.232 |
+ 0.276 |
+ 0.279 |
+ 0.300 |
+ 0.313 |
+ 0.239 |
+ 0.210 |
+ 0.146 |
+ 0.087 |
+ 0.216 |
+ 0.253 |
+ 0.291 |
+ 0.218 |
+ 0.242 |
+ 0.266 |
+ 0.238 |
+ 0.244 |
+ 0.246 |
+ 0.213 |
+ 0.156 |
+ 0.268 |
+ 0.251 |
+ 0.106 |
+ 0.143 |
+ 0.164 |
+ 0.088 |
+ 0.177 |
+ 0.182 |
+ 0.188 |
+ 0.171 |
+ 0.189 |
+ 0.176 |
+ 0.180 |
+ 0.192 |
+ 0.178 |
+ 0.188 |
+ 0.287 |
+ 0.238 |
+ 1813 |
+ OrganismalFitness |
+ CALM1_HUMAN |
+ High |
+ Human |
+
+
+ CAPSD_AAV2S_Sinai_2021 |
+ 0.407 |
+ 0.345 |
+ 0.317 |
+ 0.372 |
+ 0.346 |
+ 0.330 |
+ 0.374 |
+ 0.424 |
+ 0.329 |
+ 0.363 |
+ 0.183 |
+ 0.196 |
+ 0.199 |
+ 0.254 |
+ 0.286 |
+ 0.203 |
+ 0.277 |
+ 0.183 |
+ 0.124 |
+ 0.258 |
+ 0.191 |
+ 0.250 |
+ 0.269 |
+ 0.279 |
+ 0.203 |
+ 0.198 |
+ 0.266 |
+ 0.204 |
+ 0.399 |
+ 0.445 |
+ 0.183 |
+ 0.177 |
+ 0.132 |
+ 0.202 |
+ 0.262 |
+ 0.492 |
+ 0.380 |
+ 0.386 |
+ 0.473 |
+ 0.338 |
+ 0.343 |
+ 0.426 |
+ 0.092 |
+ 0.160 |
+ 0.260 |
+ 0.112 |
+ 0.402 |
+ 0.393 |
+ 0.347 |
+ 0.319 |
+ 0.235 |
+ 0.225 |
+ 0.216 |
+ 0.235 |
+ 0.214 |
+ 0.215 |
+ 0.220 |
+ 0.208 |
+ 0.228 |
+ 0.223 |
+ 0.301 |
+ 0.222 |
+ 42328 |
+ OrganismalFitness |
+ CAPSD_AAV2S |
+ Low |
+ Virus |
+
+
+ CAR11_HUMAN_Meitlis_2020_gof |
+ 0.265 |
+ 0.174 |
+ 0.164 |
+ 0.150 |
+ 0.169 |
+ 0.184 |
+ 0.100 |
+ 0.009 |
+ 0.218 |
+ 0.245 |
+ 0.304 |
+ 0.324 |
+ 0.302 |
+ 0.068 |
+ 0.095 |
+ 0.337 |
+ 0.326 |
+ 0.345 |
+ 0.357 |
+ 0.165 |
+ 0.151 |
+ 0.198 |
+ 0.219 |
+ 0.216 |
+ 0.037 |
+ 0.215 |
+ 0.229 |
+ 0.226 |
+ 0.167 |
+ 0.179 |
+ 0.326 |
+ 0.296 |
+ 0.189 |
+ 0.005 |
+ 0.148 |
+ 0.124 |
+ 0.203 |
+ 0.208 |
+ 0.182 |
+ 0.204 |
+ 0.201 |
+ 0.182 |
+ 0.089 |
+ 0.043 |
+ 0.261 |
+ 0.215 |
+ 0.318 |
+ 0.294 |
+ 0.335 |
+ 0.142 |
+ 0.343 |
+ 0.337 |
+ 0.335 |
+ 0.323 |
+ 0.330 |
+ 0.331 |
+ 0.317 |
+ 0.344 |
+ 0.346 |
+ 0.342 |
+ 0.408 |
+ 0.333 |
+ 2374 |
+ OrganismalFitness |
+ CAR11_HUMAN |
+ Low |
+ Human |
+
+
+ CAR11_HUMAN_Meitlis_2020_lof |
+ 0.421 |
+ 0.313 |
+ 0.277 |
+ 0.258 |
+ 0.310 |
+ 0.327 |
+ 0.108 |
+ -0.014 |
+ 0.299 |
+ 0.293 |
+ 0.446 |
+ 0.490 |
+ 0.461 |
+ 0.072 |
+ 0.115 |
+ 0.462 |
+ 0.501 |
+ 0.533 |
+ 0.540 |
+ 0.239 |
+ 0.141 |
+ 0.312 |
+ 0.300 |
+ 0.277 |
+ 0.107 |
+ 0.303 |
+ 0.306 |
+ 0.358 |
+ 0.191 |
+ 0.338 |
+ 0.433 |
+ 0.378 |
+ 0.334 |
+ -0.003 |
+ 0.264 |
+ 0.195 |
+ 0.290 |
+ 0.325 |
+ 0.271 |
+ 0.320 |
+ 0.326 |
+ 0.281 |
+ 0.087 |
+ 0.041 |
+ 0.417 |
+ 0.311 |
+ 0.417 |
+ 0.445 |
+ 0.451 |
+ 0.175 |
+ 0.518 |
+ 0.521 |
+ 0.515 |
+ 0.508 |
+ 0.500 |
+ 0.518 |
+ 0.503 |
+ 0.512 |
+ 0.526 |
+ 0.525 |
+ 0.579 |
+ 0.425 |
+ 2395 |
+ OrganismalFitness |
+ CAR11_HUMAN |
+ Low |
+ Human |
+
+
+ CAS9_STRP1_Spencer_2017_positive |
+ 0.159 |
+ 0.179 |
+ 0.162 |
+ 0.166 |
+ 0.169 |
+ 0.177 |
+ 0.040 |
+ 0.100 |
+ 0.180 |
+ 0.183 |
+ 0.175 |
+ 0.069 |
+ 0.073 |
+ 0.063 |
+ 0.090 |
+ 0.154 |
+ 0.182 |
+ 0.188 |
+ 0.194 |
+ 0.006 |
+ 0.043 |
+ 0.081 |
+ 0.173 |
+ 0.120 |
+ 0.043 |
+ 0.194 |
+ 0.182 |
+ 0.179 |
+ 0.068 |
+ 0.187 |
+ 0.199 |
+ 0.179 |
+ 0.027 |
+ 0.035 |
+ 0.047 |
+ 0.166 |
+ 0.170 |
+ 0.172 |
+ 0.189 |
+ 0.184 |
+ 0.184 |
+ 0.194 |
+ 0.061 |
+ 0.028 |
+ 0.181 |
+ 0.106 |
+ 0.106 |
+ 0.181 |
+ 0.036 |
+ 0.037 |
+ 0.176 |
+ 0.177 |
+ 0.178 |
+ 0.185 |
+ 0.180 |
+ 0.181 |
+ 0.177 |
+ 0.178 |
+ 0.177 |
+ 0.183 |
+ 0.177 |
+ 0.114 |
+ 8117 |
+ Activity |
+ CAS9_STRP1 |
+ Medium |
+ Prokaryote |
+
+
+ CASP3_HUMAN_Roychowdhury_2020 |
+ 0.382 |
+ 0.543 |
+ 0.610 |
+ 0.622 |
+ 0.626 |
+ 0.632 |
+ 0.054 |
+ 0.478 |
+ 0.644 |
+ 0.653 |
+ 0.590 |
+ 0.595 |
+ 0.624 |
+ 0.264 |
+ 0.574 |
+ 0.639 |
+ 0.638 |
+ 0.554 |
+ 0.514 |
+ 0.603 |
+ 0.233 |
+ 0.512 |
+ 0.517 |
+ 0.532 |
+ 0.505 |
+ 0.556 |
+ 0.554 |
+ 0.526 |
+ 0.567 |
+ 0.608 |
+ 0.599 |
+ 0.542 |
+ 0.217 |
+ 0.092 |
+ 0.482 |
+ 0.520 |
+ 0.533 |
+ 0.562 |
+ 0.582 |
+ 0.628 |
+ 0.619 |
+ 0.628 |
+ 0.422 |
+ -0.006 |
+ 0.616 |
+ 0.563 |
+ 0.414 |
+ 0.558 |
+ 0.499 |
+ 0.263 |
+ 0.553 |
+ 0.571 |
+ 0.571 |
+ 0.570 |
+ 0.575 |
+ 0.567 |
+ 0.585 |
+ 0.571 |
+ 0.576 |
+ 0.590 |
+ 0.652 |
+ 0.513 |
+ 1567 |
+ Activity |
+ CASP3_HUMAN |
+ High |
+ Human |
+
+
+ CASP7_HUMAN_Roychowdhury_2020 |
+ 0.372 |
+ 0.516 |
+ 0.619 |
+ 0.617 |
+ 0.609 |
+ 0.613 |
+ 0.057 |
+ 0.470 |
+ 0.600 |
+ 0.627 |
+ 0.585 |
+ 0.611 |
+ 0.632 |
+ 0.306 |
+ 0.598 |
+ 0.632 |
+ 0.622 |
+ 0.572 |
+ 0.562 |
+ 0.611 |
+ 0.206 |
+ 0.532 |
+ 0.529 |
+ 0.558 |
+ 0.529 |
+ 0.579 |
+ 0.589 |
+ 0.563 |
+ 0.572 |
+ 0.642 |
+ 0.596 |
+ 0.538 |
+ 0.332 |
+ 0.103 |
+ 0.543 |
+ 0.521 |
+ 0.517 |
+ 0.594 |
+ 0.586 |
+ 0.599 |
+ 0.628 |
+ 0.628 |
+ 0.461 |
+ 0.019 |
+ 0.646 |
+ 0.596 |
+ 0.479 |
+ 0.612 |
+ 0.573 |
+ 0.329 |
+ 0.585 |
+ 0.570 |
+ 0.562 |
+ 0.587 |
+ 0.608 |
+ 0.602 |
+ 0.591 |
+ 0.578 |
+ 0.596 |
+ 0.605 |
+ 0.643 |
+ 0.533 |
+ 1680 |
+ Activity |
+ CASP7_HUMAN |
+ Medium |
+ Human |
+
+
+ CATR_CHLRE_Tsuboyama_2023_2AMI |
+ 0.533 |
+ 0.559 |
+ 0.599 |
+ 0.606 |
+ 0.627 |
+ 0.630 |
+ 0.680 |
+ 0.558 |
+ 0.565 |
+ 0.528 |
+ 0.614 |
+ 0.638 |
+ 0.657 |
+ 0.564 |
+ 0.689 |
+ 0.688 |
+ 0.658 |
+ 0.615 |
+ 0.644 |
+ 0.041 |
+ 0.621 |
+ 0.603 |
+ 0.619 |
+ 0.602 |
+ 0.641 |
+ 0.627 |
+ 0.593 |
+ 0.561 |
+ 0.587 |
+ 0.649 |
+ 0.611 |
+ 0.619 |
+ 0.439 |
+ 0.626 |
+ 0.611 |
+ 0.633 |
+ 0.649 |
+ 0.642 |
+ 0.665 |
+ 0.622 |
+ 0.620 |
+ 0.638 |
+ 0.440 |
+ 0.427 |
+ 0.315 |
+ 0.460 |
+ 0.475 |
+ 0.400 |
+ 0.679 |
+ 0.572 |
+ 0.663 |
+ 0.652 |
+ 0.676 |
+ 0.668 |
+ 0.687 |
+ 0.662 |
+ 0.670 |
+ 0.653 |
+ 0.663 |
+ 0.674 |
+ 0.620 |
+ 0.686 |
+ 1903 |
+ Stability |
+ CATR_CHLRE |
+ High |
+ Eukaryote |
+
+
+ CBPA2_HUMAN_Tsuboyama_2023_1O6X |
+ 0.644 |
+ 0.718 |
+ 0.744 |
+ 0.745 |
+ 0.716 |
+ 0.729 |
+ 0.482 |
+ 0.577 |
+ 0.745 |
+ 0.754 |
+ 0.770 |
+ 0.752 |
+ 0.736 |
+ 0.529 |
+ 0.688 |
+ 0.736 |
+ 0.702 |
+ 0.727 |
+ 0.718 |
+ 0.739 |
+ 0.486 |
+ 0.522 |
+ 0.699 |
+ 0.692 |
+ 0.547 |
+ 0.665 |
+ 0.673 |
+ 0.674 |
+ 0.710 |
+ 0.732 |
+ 0.710 |
+ 0.654 |
+ 0.104 |
+ 0.509 |
+ 0.642 |
+ 0.703 |
+ 0.699 |
+ 0.706 |
+ 0.732 |
+ 0.746 |
+ 0.736 |
+ 0.748 |
+ 0.510 |
+ 0.302 |
+ 0.631 |
+ 0.646 |
+ 0.701 |
+ 0.689 |
+ 0.820 |
+ 0.773 |
+ 0.772 |
+ 0.768 |
+ 0.778 |
+ 0.768 |
+ 0.782 |
+ 0.780 |
+ 0.770 |
+ 0.772 |
+ 0.769 |
+ 0.779 |
+ 0.716 |
+ 0.835 |
+ 2068 |
+ Stability |
+ CBPA2_HUMAN |
+ Medium |
+ Human |
+
+
+ CBS_HUMAN_Sun_2020 |
+ 0.334 |
+ 0.358 |
+ 0.364 |
+ 0.383 |
+ 0.371 |
+ 0.379 |
+ 0.202 |
+ 0.251 |
+ 0.374 |
+ 0.377 |
+ 0.347 |
+ 0.346 |
+ 0.372 |
+ 0.085 |
+ 0.226 |
+ 0.335 |
+ 0.342 |
+ 0.329 |
+ 0.339 |
+ 0.350 |
+ 0.356 |
+ 0.260 |
+ 0.285 |
+ 0.288 |
+ 0.349 |
+ 0.273 |
+ 0.290 |
+ 0.273 |
+ 0.310 |
+ 0.381 |
+ 0.370 |
+ 0.344 |
+ 0.197 |
+ 0.361 |
+ 0.283 |
+ 0.266 |
+ 0.384 |
+ 0.332 |
+ 0.327 |
+ 0.394 |
+ 0.363 |
+ 0.363 |
+ 0.304 |
+ 0.068 |
+ 0.376 |
+ 0.373 |
+ 0.237 |
+ 0.268 |
+ 0.309 |
+ 0.079 |
+ 0.324 |
+ 0.324 |
+ 0.331 |
+ 0.337 |
+ 0.337 |
+ 0.335 |
+ 0.332 |
+ 0.332 |
+ 0.335 |
+ 0.340 |
+ 0.386 |
+ 0.289 |
+ 7217 |
+ OrganismalFitness |
+ CBS_HUMAN |
+ Medium |
+ Human |
+
+
+ CBX4_HUMAN_Tsuboyama_2023_2K28 |
+ 0.498 |
+ 0.548 |
+ 0.674 |
+ 0.706 |
+ 0.633 |
+ 0.685 |
+ -0.266 |
+ 0.611 |
+ 0.623 |
+ 0.613 |
+ 0.615 |
+ 0.650 |
+ 0.648 |
+ -0.271 |
+ 0.735 |
+ 0.699 |
+ 0.682 |
+ 0.609 |
+ 0.606 |
+ 0.648 |
+ 0.486 |
+ 0.589 |
+ 0.581 |
+ 0.587 |
+ 0.553 |
+ 0.577 |
+ 0.589 |
+ 0.531 |
+ 0.569 |
+ 0.633 |
+ 0.667 |
+ 0.653 |
+ 0.423 |
+ 0.425 |
+ 0.505 |
+ 0.540 |
+ 0.601 |
+ 0.600 |
+ 0.623 |
+ 0.676 |
+ 0.698 |
+ 0.701 |
+ 0.621 |
+ -0.320 |
+ 0.369 |
+ 0.561 |
+ 0.167 |
+ 0.155 |
+ 0.614 |
+ 0.588 |
+ 0.695 |
+ 0.626 |
+ 0.690 |
+ 0.676 |
+ 0.709 |
+ 0.668 |
+ 0.677 |
+ 0.718 |
+ 0.664 |
+ 0.695 |
+ 0.642 |
+ 0.751 |
+ 2282 |
+ Stability |
+ CBX4_HUMAN |
+ High |
+ Human |
+
+
+ CCDB_ECOLI_Adkar_2012 |
+ 0.357 |
+ 0.486 |
+ 0.520 |
+ 0.547 |
+ 0.528 |
+ 0.531 |
+ -0.031 |
+ 0.185 |
+ 0.341 |
+ 0.361 |
+ 0.424 |
+ 0.351 |
+ 0.432 |
+ -0.006 |
+ -0.018 |
+ 0.406 |
+ 0.466 |
+ 0.463 |
+ 0.264 |
+ 0.457 |
+ 0.027 |
+ -0.015 |
+ -0.115 |
+ 0.142 |
+ -0.139 |
+ 0.053 |
+ -0.029 |
+ 0.034 |
+ 0.425 |
+ 0.453 |
+ 0.554 |
+ 0.555 |
+ 0.124 |
+ 0.006 |
+ 0.056 |
+ 0.310 |
+ 0.442 |
+ 0.426 |
+ 0.458 |
+ 0.507 |
+ 0.494 |
+ 0.522 |
+ 0.018 |
+ -0.050 |
+ 0.444 |
+ 0.020 |
+ 0.285 |
+ 0.426 |
+ 0.343 |
+ 0.242 |
+ 0.436 |
+ 0.401 |
+ 0.414 |
+ 0.443 |
+ 0.437 |
+ 0.454 |
+ 0.436 |
+ 0.464 |
+ 0.486 |
+ 0.453 |
+ 0.438 |
+ 0.194 |
+ 1176 |
+ Activity |
+ CCDB_ECOLI |
+ High |
+ Prokaryote |
+
+
+ CCDB_ECOLI_Tripathi_2016 |
+ 0.434 |
+ 0.506 |
+ 0.524 |
+ 0.541 |
+ 0.517 |
+ 0.528 |
+ -0.008 |
+ 0.294 |
+ 0.403 |
+ 0.431 |
+ 0.490 |
+ 0.435 |
+ 0.483 |
+ -0.007 |
+ -0.001 |
+ 0.462 |
+ 0.511 |
+ 0.522 |
+ 0.362 |
+ 0.518 |
+ 0.034 |
+ 0.014 |
+ -0.074 |
+ 0.171 |
+ -0.107 |
+ 0.094 |
+ -0.029 |
+ 0.074 |
+ 0.476 |
+ 0.505 |
+ 0.542 |
+ 0.510 |
+ 0.112 |
+ 0.009 |
+ 0.129 |
+ 0.424 |
+ 0.480 |
+ 0.480 |
+ 0.532 |
+ 0.518 |
+ 0.517 |
+ 0.548 |
+ 0.005 |
+ -0.055 |
+ 0.495 |
+ 0.017 |
+ 0.317 |
+ 0.433 |
+ 0.335 |
+ 0.230 |
+ 0.486 |
+ 0.449 |
+ 0.449 |
+ 0.482 |
+ 0.483 |
+ 0.480 |
+ 0.473 |
+ 0.490 |
+ 0.517 |
+ 0.492 |
+ 0.490 |
+ 0.326 |
+ 1663 |
+ OrganismalFitness |
+ CCDB_ECOLI |
+ High |
+ Prokaryote |
+
+
+ CCR5_HUMAN_Gill_2023 |
+ 0.271 |
+ 0.279 |
+ 0.282 |
+ 0.289 |
+ 0.274 |
+ 0.282 |
+ 0.316 |
+ 0.309 |
+ 0.333 |
+ 0.343 |
+ 0.358 |
+ 0.344 |
+ 0.356 |
+ 0.325 |
+ 0.358 |
+ 0.362 |
+ 0.347 |
+ 0.339 |
+ 0.341 |
+ 0.352 |
+ 0.367 |
+ 0.366 |
+ 0.323 |
+ 0.326 |
+ 0.369 |
+ 0.356 |
+ 0.346 |
+ 0.360 |
+ 0.322 |
+ 0.364 |
+ 0.300 |
+ 0.251 |
+ 0.130 |
+ 0.367 |
+ 0.365 |
+ 0.363 |
+ 0.367 |
+ 0.375 |
+ 0.376 |
+ 0.324 |
+ 0.324 |
+ 0.323 |
+ 0.354 |
+ 0.175 |
+ 0.362 |
+ 0.361 |
+ 0.278 |
+ 0.306 |
+ 0.291 |
+ 0.173 |
+ 0.314 |
+ 0.305 |
+ 0.316 |
+ 0.318 |
+ 0.313 |
+ 0.317 |
+ 0.328 |
+ 0.323 |
+ 0.327 |
+ 0.327 |
+ 0.348 |
+ 0.356 |
+ 6137 |
+ Binding |
+ CCR5_HUMAN |
+ High |
+ Human |
+
+
+ CD19_HUMAN_Klesmith_2019_FMC_singles |
+ 0.189 |
+ 0.247 |
+ 0.199 |
+ 0.211 |
+ 0.234 |
+ 0.232 |
+ 0.051 |
+ 0.233 |
+ 0.257 |
+ 0.261 |
+ 0.046 |
+ 0.050 |
+ 0.072 |
+ 0.090 |
+ 0.093 |
+ 0.054 |
+ 0.050 |
+ 0.123 |
+ 0.217 |
+ -0.013 |
+ 0.096 |
+ 0.124 |
+ 0.118 |
+ 0.119 |
+ 0.042 |
+ 0.120 |
+ 0.112 |
+ 0.093 |
+ 0.244 |
+ 0.283 |
+ 0.194 |
+ 0.159 |
+ 0.077 |
+ 0.075 |
+ 0.120 |
+ 0.093 |
+ 0.203 |
+ 0.210 |
+ 0.202 |
+ 0.233 |
+ 0.235 |
+ 0.231 |
+ 0.083 |
+ 0.050 |
+ 0.104 |
+ 0.086 |
+ 0.316 |
+ 0.258 |
+ 0.301 |
+ 0.174 |
+ 0.158 |
+ 0.180 |
+ 0.173 |
+ 0.196 |
+ 0.186 |
+ 0.195 |
+ 0.204 |
+ 0.170 |
+ 0.185 |
+ 0.195 |
+ 0.312 |
+ 0.244 |
+ 3761 |
+ Binding |
+ CD19_HUMAN |
+ Low |
+ Human |
+
+
+ CP2C9_HUMAN_Amorosi_2021_abundance |
+ 0.505 |
+ 0.572 |
+ 0.594 |
+ 0.611 |
+ 0.596 |
+ 0.610 |
+ 0.529 |
+ 0.555 |
+ 0.601 |
+ 0.615 |
+ 0.513 |
+ 0.588 |
+ 0.621 |
+ 0.531 |
+ 0.605 |
+ 0.625 |
+ 0.635 |
+ 0.626 |
+ 0.591 |
+ 0.611 |
+ 0.553 |
+ 0.548 |
+ 0.587 |
+ 0.558 |
+ 0.571 |
+ 0.585 |
+ 0.590 |
+ 0.572 |
+ 0.572 |
+ 0.605 |
+ 0.573 |
+ 0.511 |
+ 0.208 |
+ 0.581 |
+ 0.563 |
+ 0.569 |
+ 0.618 |
+ 0.618 |
+ 0.624 |
+ 0.634 |
+ 0.630 |
+ 0.632 |
+ 0.579 |
+ 0.116 |
+ 0.534 |
+ 0.607 |
+ 0.555 |
+ 0.538 |
+ 0.607 |
+ 0.218 |
+ 0.600 |
+ 0.598 |
+ 0.606 |
+ 0.629 |
+ 0.631 |
+ 0.623 |
+ 0.622 |
+ 0.611 |
+ 0.617 |
+ 0.635 |
+ 0.637 |
+ 0.618 |
+ 6370 |
+ Expression |
+ CP2C9_HUMAN |
+ High |
+ Human |
+
+
+ CP2C9_HUMAN_Amorosi_2021_activity |
+ 0.526 |
+ 0.590 |
+ 0.626 |
+ 0.645 |
+ 0.616 |
+ 0.635 |
+ 0.583 |
+ 0.615 |
+ 0.606 |
+ 0.635 |
+ 0.519 |
+ 0.642 |
+ 0.666 |
+ 0.570 |
+ 0.655 |
+ 0.679 |
+ 0.679 |
+ 0.678 |
+ 0.620 |
+ 0.652 |
+ 0.587 |
+ 0.592 |
+ 0.619 |
+ 0.573 |
+ 0.618 |
+ 0.587 |
+ 0.593 |
+ 0.592 |
+ 0.582 |
+ 0.637 |
+ 0.603 |
+ 0.544 |
+ 0.225 |
+ 0.638 |
+ 0.605 |
+ 0.572 |
+ 0.666 |
+ 0.658 |
+ 0.638 |
+ 0.668 |
+ 0.659 |
+ 0.650 |
+ 0.638 |
+ 0.094 |
+ 0.557 |
+ 0.666 |
+ 0.603 |
+ 0.577 |
+ 0.656 |
+ 0.263 |
+ 0.645 |
+ 0.649 |
+ 0.652 |
+ 0.663 |
+ 0.674 |
+ 0.672 |
+ 0.668 |
+ 0.656 |
+ 0.662 |
+ 0.681 |
+ 0.673 |
+ 0.667 |
+ 6142 |
+ Binding |
+ CP2C9_HUMAN |
+ High |
+ Human |
+
+
+ CSN4_MOUSE_Tsuboyama_2023_1UFM |
+ 0.389 |
+ 0.504 |
+ 0.432 |
+ 0.445 |
+ 0.470 |
+ 0.477 |
+ 0.260 |
+ 0.400 |
+ 0.461 |
+ 0.461 |
+ 0.481 |
+ 0.546 |
+ 0.605 |
+ 0.449 |
+ 0.686 |
+ 0.518 |
+ 0.484 |
+ 0.423 |
+ 0.415 |
+ 0.406 |
+ 0.449 |
+ 0.455 |
+ 0.502 |
+ 0.514 |
+ 0.443 |
+ 0.587 |
+ 0.530 |
+ 0.504 |
+ 0.501 |
+ 0.532 |
+ 0.401 |
+ 0.389 |
+ 0.244 |
+ 0.341 |
+ 0.474 |
+ 0.546 |
+ 0.497 |
+ 0.536 |
+ 0.576 |
+ 0.468 |
+ 0.513 |
+ 0.530 |
+ 0.416 |
+ 0.326 |
+ 0.494 |
+ 0.538 |
+ 0.562 |
+ 0.541 |
+ 0.653 |
+ 0.620 |
+ 0.531 |
+ 0.520 |
+ 0.532 |
+ 0.557 |
+ 0.580 |
+ 0.556 |
+ 0.539 |
+ 0.580 |
+ 0.548 |
+ 0.556 |
+ 0.575 |
+ 0.720 |
+ 3295 |
+ Stability |
+ CSN4_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ CUE1_YEAST_Tsuboyama_2023_2MYX |
+ 0.357 |
+ 0.451 |
+ 0.435 |
+ 0.438 |
+ 0.474 |
+ 0.445 |
+ 0.154 |
+ 0.453 |
+ 0.516 |
+ 0.547 |
+ 0.483 |
+ 0.427 |
+ 0.398 |
+ 0.153 |
+ 0.428 |
+ 0.478 |
+ 0.547 |
+ 0.498 |
+ 0.467 |
+ 0.304 |
+ 0.174 |
+ 0.206 |
+ 0.225 |
+ 0.102 |
+ 0.117 |
+ 0.224 |
+ 0.255 |
+ 0.248 |
+ 0.434 |
+ 0.420 |
+ 0.469 |
+ 0.414 |
+ 0.231 |
+ 0.160 |
+ 0.146 |
+ 0.213 |
+ 0.380 |
+ 0.387 |
+ 0.393 |
+ 0.438 |
+ 0.441 |
+ 0.448 |
+ 0.154 |
+ 0.064 |
+ 0.462 |
+ 0.256 |
+ 0.515 |
+ 0.501 |
+ 0.654 |
+ 0.482 |
+ 0.549 |
+ 0.508 |
+ 0.494 |
+ 0.552 |
+ 0.517 |
+ 0.551 |
+ 0.538 |
+ 0.525 |
+ 0.517 |
+ 0.538 |
+ 0.600 |
+ 0.571 |
+ 1580 |
+ Stability |
+ CUE1_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ D7PM05_CLYGR_Somermeyer_2022 |
+ 0.530 |
+ 0.625 |
+ 0.613 |
+ 0.582 |
+ 0.629 |
+ 0.631 |
+ 0.073 |
+ 0.475 |
+ 0.649 |
+ 0.657 |
+ 0.439 |
+ 0.066 |
+ 0.064 |
+ 0.068 |
+ 0.053 |
+ 0.036 |
+ 0.050 |
+ 0.072 |
+ 0.152 |
+ 0.438 |
+ 0.039 |
+ 0.042 |
+ 0.114 |
+ 0.082 |
+ 0.009 |
+ 0.047 |
+ 0.068 |
+ 0.194 |
+ 0.361 |
+ 0.623 |
+ 0.563 |
+ 0.574 |
+ 0.016 |
+ 0.096 |
+ 0.125 |
+ 0.156 |
+ 0.536 |
+ 0.536 |
+ 0.532 |
+ 0.630 |
+ 0.629 |
+ 0.618 |
+ 0.004 |
+ 0.024 |
+ 0.009 |
+ 0.028 |
+ 0.292 |
+ 0.316 |
+ 0.505 |
+ 0.396 |
+ 0.495 |
+ 0.491 |
+ 0.496 |
+ 0.505 |
+ 0.494 |
+ 0.502 |
+ 0.494 |
+ 0.490 |
+ 0.487 |
+ 0.498 |
+ 0.345 |
+ 0.208 |
+ 24515 |
+ Activity |
+ D7PM05_CLYGR |
+ Low |
+ Eukaryote |
+
+
+ DLG4_HUMAN_Faure_2021 |
+ 0.679 |
+ 0.584 |
+ 0.607 |
+ 0.577 |
+ 0.609 |
+ 0.616 |
+ 0.723 |
+ 0.629 |
+ 0.524 |
+ 0.535 |
+ 0.520 |
+ 0.550 |
+ 0.608 |
+ 0.743 |
+ 0.764 |
+ 0.728 |
+ 0.584 |
+ 0.487 |
+ 0.443 |
+ 0.650 |
+ 0.574 |
+ 0.583 |
+ 0.569 |
+ 0.538 |
+ 0.605 |
+ 0.607 |
+ 0.564 |
+ 0.562 |
+ 0.495 |
+ 0.614 |
+ 0.634 |
+ 0.621 |
+ 0.511 |
+ 0.576 |
+ 0.638 |
+ 0.580 |
+ 0.653 |
+ 0.700 |
+ 0.667 |
+ 0.643 |
+ 0.661 |
+ 0.636 |
+ 0.581 |
+ 0.233 |
+ 0.372 |
+ 0.528 |
+ 0.637 |
+ 0.453 |
+ 0.672 |
+ 0.315 |
+ 0.483 |
+ 0.370 |
+ 0.539 |
+ 0.499 |
+ 0.520 |
+ 0.492 |
+ 0.508 |
+ 0.486 |
+ 0.504 |
+ 0.504 |
+ 0.521 |
+ 0.745 |
+ 6976 |
+ OrganismalFitness |
+ DLG4_HUMAN |
+ Low |
+ Human |
+
+
+ DLG4_RAT_McLaughlin_2012 |
+ 0.486 |
+ 0.442 |
+ 0.487 |
+ 0.491 |
+ 0.526 |
+ 0.539 |
+ 0.490 |
+ 0.437 |
+ 0.483 |
+ 0.522 |
+ 0.469 |
+ 0.557 |
+ 0.588 |
+ 0.411 |
+ 0.543 |
+ 0.581 |
+ 0.543 |
+ 0.478 |
+ 0.471 |
+ 0.444 |
+ 0.378 |
+ 0.389 |
+ 0.373 |
+ 0.371 |
+ 0.407 |
+ 0.420 |
+ 0.407 |
+ 0.367 |
+ 0.395 |
+ 0.490 |
+ 0.556 |
+ 0.537 |
+ 0.246 |
+ 0.367 |
+ 0.392 |
+ 0.307 |
+ 0.470 |
+ 0.478 |
+ 0.443 |
+ 0.538 |
+ 0.544 |
+ 0.540 |
+ 0.527 |
+ 0.076 |
+ 0.443 |
+ 0.500 |
+ 0.439 |
+ 0.308 |
+ 0.459 |
+ 0.135 |
+ 0.442 |
+ 0.375 |
+ 0.444 |
+ 0.433 |
+ 0.444 |
+ 0.430 |
+ 0.430 |
+ 0.464 |
+ 0.460 |
+ 0.469 |
+ 0.504 |
+ 0.535 |
+ 1576 |
+ Binding |
+ DLG4_RAT |
+ Low |
+ Eukaryote |
+
+
+ DN7A_SACS2_Tsuboyama_2023_1JIC |
+ 0.171 |
+ 0.198 |
+ 0.218 |
+ 0.230 |
+ 0.200 |
+ 0.259 |
+ -0.071 |
+ 0.317 |
+ 0.368 |
+ 0.386 |
+ 0.085 |
+ 0.126 |
+ 0.131 |
+ 0.074 |
+ 0.209 |
+ 0.255 |
+ 0.337 |
+ 0.337 |
+ 0.370 |
+ 0.235 |
+ -0.068 |
+ 0.045 |
+ 0.033 |
+ 0.066 |
+ 0.041 |
+ 0.107 |
+ 0.057 |
+ 0.071 |
+ 0.235 |
+ 0.298 |
+ 0.431 |
+ 0.351 |
+ 0.054 |
+ -0.013 |
+ -0.023 |
+ 0.031 |
+ 0.212 |
+ 0.200 |
+ 0.211 |
+ 0.243 |
+ 0.227 |
+ 0.231 |
+ 0.106 |
+ -0.048 |
+ 0.299 |
+ 0.129 |
+ 0.568 |
+ 0.560 |
+ 0.660 |
+ 0.556 |
+ 0.424 |
+ 0.436 |
+ 0.516 |
+ 0.507 |
+ 0.491 |
+ 0.476 |
+ 0.472 |
+ 0.444 |
+ 0.436 |
+ 0.490 |
+ 0.556 |
+ 0.481 |
+ 1008 |
+ Stability |
+ DN7A_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ DNJA1_HUMAN_Tsuboyama_2023_2LO1 |
+ 0.738 |
+ 0.750 |
+ 0.759 |
+ 0.757 |
+ 0.767 |
+ 0.764 |
+ 0.756 |
+ 0.732 |
+ 0.776 |
+ 0.790 |
+ 0.773 |
+ 0.756 |
+ 0.785 |
+ 0.827 |
+ 0.816 |
+ 0.817 |
+ 0.803 |
+ 0.753 |
+ 0.798 |
+ 0.781 |
+ 0.669 |
+ 0.736 |
+ 0.776 |
+ 0.750 |
+ 0.782 |
+ 0.765 |
+ 0.774 |
+ 0.721 |
+ 0.764 |
+ 0.775 |
+ 0.729 |
+ 0.714 |
+ 0.660 |
+ 0.780 |
+ 0.776 |
+ 0.805 |
+ 0.808 |
+ 0.816 |
+ 0.823 |
+ 0.774 |
+ 0.787 |
+ 0.787 |
+ 0.670 |
+ 0.164 |
+ 0.443 |
+ 0.684 |
+ 0.643 |
+ 0.528 |
+ 0.779 |
+ 0.785 |
+ 0.757 |
+ 0.761 |
+ 0.760 |
+ 0.793 |
+ 0.789 |
+ 0.803 |
+ 0.789 |
+ 0.786 |
+ 0.794 |
+ 0.786 |
+ 0.726 |
+ 0.801 |
+ 2264 |
+ Stability |
+ DNJA1_HUMAN |
+ High |
+ Human |
+
+
+ DOCK1_MOUSE_Tsuboyama_2023_2M0Y |
+ 0.418 |
+ 0.453 |
+ 0.421 |
+ 0.439 |
+ 0.494 |
+ 0.501 |
+ 0.057 |
+ 0.432 |
+ 0.406 |
+ 0.403 |
+ 0.525 |
+ 0.472 |
+ 0.501 |
+ 0.224 |
+ 0.458 |
+ 0.486 |
+ 0.531 |
+ 0.508 |
+ 0.531 |
+ 0.518 |
+ 0.340 |
+ 0.418 |
+ 0.402 |
+ 0.408 |
+ 0.315 |
+ 0.402 |
+ 0.365 |
+ 0.348 |
+ 0.395 |
+ 0.448 |
+ 0.471 |
+ 0.469 |
+ 0.216 |
+ 0.202 |
+ 0.321 |
+ 0.252 |
+ 0.473 |
+ 0.490 |
+ 0.494 |
+ 0.489 |
+ 0.491 |
+ 0.507 |
+ 0.359 |
+ -0.177 |
+ 0.513 |
+ 0.439 |
+ 0.380 |
+ 0.394 |
+ 0.419 |
+ 0.439 |
+ 0.492 |
+ 0.481 |
+ 0.497 |
+ 0.507 |
+ 0.491 |
+ 0.512 |
+ 0.477 |
+ 0.493 |
+ 0.504 |
+ 0.497 |
+ 0.567 |
+ 0.531 |
+ 2915 |
+ Stability |
+ DOCK1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ DYR_ECOLI_Nguyen_2023 |
+ 0.340 |
+ 0.424 |
+ 0.448 |
+ 0.453 |
+ 0.451 |
+ 0.454 |
+ 0.004 |
+ 0.425 |
+ 0.505 |
+ 0.481 |
+ 0.523 |
+ 0.535 |
+ 0.545 |
+ 0.133 |
+ 0.500 |
+ 0.525 |
+ 0.544 |
+ 0.529 |
+ 0.511 |
+ 0.502 |
+ 0.373 |
+ 0.421 |
+ 0.281 |
+ 0.342 |
+ 0.435 |
+ 0.388 |
+ 0.387 |
+ 0.454 |
+ 0.333 |
+ 0.480 |
+ 0.502 |
+ 0.477 |
+ 0.148 |
+ 0.405 |
+ 0.235 |
+ 0.225 |
+ 0.460 |
+ 0.330 |
+ 0.356 |
+ 0.459 |
+ 0.406 |
+ 0.418 |
+ 0.469 |
+ 0.013 |
+ 0.519 |
+ 0.499 |
+ 0.264 |
+ 0.428 |
+ 0.374 |
+ 0.157 |
+ 0.501 |
+ 0.459 |
+ 0.470 |
+ 0.483 |
+ 0.492 |
+ 0.490 |
+ 0.503 |
+ 0.495 |
+ 0.511 |
+ 0.505 |
+ 0.511 |
+ 0.471 |
+ 2916 |
+ OrganismalFitness |
+ DYR_ECOLI |
+ High |
+ Prokaryote |
+
+
+ DYR_ECOLI_Thompson_2019 |
+ 0.384 |
+ 0.484 |
+ 0.469 |
+ 0.472 |
+ 0.476 |
+ 0.474 |
+ -0.018 |
+ 0.321 |
+ 0.470 |
+ 0.501 |
+ 0.469 |
+ 0.416 |
+ 0.436 |
+ 0.055 |
+ 0.349 |
+ 0.455 |
+ 0.480 |
+ 0.506 |
+ 0.512 |
+ 0.485 |
+ 0.199 |
+ 0.360 |
+ 0.266 |
+ 0.310 |
+ 0.342 |
+ 0.436 |
+ 0.476 |
+ 0.441 |
+ 0.418 |
+ 0.451 |
+ 0.451 |
+ 0.408 |
+ 0.100 |
+ 0.355 |
+ 0.348 |
+ 0.347 |
+ 0.407 |
+ 0.415 |
+ 0.423 |
+ 0.459 |
+ 0.476 |
+ 0.481 |
+ 0.318 |
+ -0.019 |
+ 0.442 |
+ 0.390 |
+ 0.248 |
+ 0.403 |
+ 0.321 |
+ 0.111 |
+ 0.440 |
+ 0.411 |
+ 0.415 |
+ 0.418 |
+ 0.435 |
+ 0.422 |
+ 0.441 |
+ 0.442 |
+ 0.455 |
+ 0.445 |
+ 0.491 |
+ 0.387 |
+ 2363 |
+ OrganismalFitness |
+ DYR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ ENV_HV1B9_DuenasDecamp_2016 |
+ 0.369 |
+ 0.397 |
+ 0.238 |
+ 0.367 |
+ 0.388 |
+ 0.377 |
+ 0.056 |
+ 0.353 |
+ 0.341 |
+ 0.366 |
+ 0.359 |
+ 0.415 |
+ 0.389 |
+ 0.110 |
+ 0.055 |
+ 0.035 |
+ 0.012 |
+ 0.048 |
+ 0.095 |
+ 0.388 |
+ 0.380 |
+ 0.358 |
+ 0.408 |
+ 0.419 |
+ 0.263 |
+ 0.394 |
+ 0.401 |
+ 0.391 |
+ 0.374 |
+ 0.389 |
+ 0.355 |
+ 0.322 |
+ 0.253 |
+ 0.368 |
+ 0.373 |
+ 0.404 |
+ 0.392 |
+ 0.396 |
+ 0.407 |
+ 0.390 |
+ 0.392 |
+ 0.392 |
+ 0.343 |
+ -0.040 |
+ 0.334 |
+ 0.395 |
+ 0.355 |
+ 0.325 |
+ 0.364 |
+ 0.231 |
+ 0.215 |
+ 0.261 |
+ 0.274 |
+ 0.240 |
+ 0.214 |
+ 0.233 |
+ 0.241 |
+ 0.258 |
+ 0.240 |
+ 0.263 |
+ 0.150 |
+ 0.102 |
+ 375 |
+ OrganismalFitness |
+ ENV_HV1B9 |
+ Medium |
+ Virus |
+
+
+ ENV_HV1BR_Haddox_2016 |
+ 0.338 |
+ 0.303 |
+ 0.322 |
+ 0.323 |
+ 0.339 |
+ 0.345 |
+ -0.001 |
+ 0.322 |
+ 0.345 |
+ 0.344 |
+ 0.298 |
+ 0.320 |
+ 0.336 |
+ -0.009 |
+ -0.006 |
+ 0.004 |
+ 0.046 |
+ 0.069 |
+ 0.160 |
+ 0.337 |
+ 0.350 |
+ 0.358 |
+ 0.371 |
+ 0.364 |
+ 0.345 |
+ 0.358 |
+ 0.358 |
+ 0.354 |
+ 0.362 |
+ 0.350 |
+ 0.325 |
+ 0.290 |
+ 0.189 |
+ 0.344 |
+ 0.360 |
+ 0.358 |
+ 0.359 |
+ 0.366 |
+ 0.362 |
+ 0.367 |
+ 0.369 |
+ 0.365 |
+ 0.233 |
+ -0.013 |
+ 0.321 |
+ 0.293 |
+ 0.212 |
+ 0.245 |
+ 0.119 |
+ 0.072 |
+ 0.180 |
+ 0.179 |
+ 0.203 |
+ 0.226 |
+ 0.188 |
+ 0.202 |
+ 0.200 |
+ 0.192 |
+ 0.193 |
+ 0.209 |
+ 0.167 |
+ 0.098 |
+ 12863 |
+ OrganismalFitness |
+ ENV_HV1BR |
+ Medium |
+ Virus |
+
+
+ ENVZ_ECOLI_Ghose_2023 |
+ 0.115 |
+ 0.095 |
+ 0.180 |
+ 0.178 |
+ 0.187 |
+ 0.188 |
+ 0.040 |
+ 0.190 |
+ 0.190 |
+ 0.198 |
+ 0.154 |
+ 0.204 |
+ 0.209 |
+ 0.194 |
+ 0.207 |
+ 0.195 |
+ 0.172 |
+ 0.154 |
+ 0.113 |
+ 0.183 |
+ 0.164 |
+ 0.152 |
+ 0.201 |
+ 0.183 |
+ 0.176 |
+ 0.164 |
+ 0.199 |
+ 0.183 |
+ 0.163 |
+ 0.178 |
+ 0.133 |
+ 0.135 |
+ 0.142 |
+ 0.180 |
+ 0.184 |
+ 0.193 |
+ 0.207 |
+ 0.213 |
+ 0.215 |
+ 0.195 |
+ 0.201 |
+ 0.199 |
+ 0.194 |
+ 0.134 |
+ 0.187 |
+ 0.185 |
+ 0.055 |
+ 0.149 |
+ 0.114 |
+ 0.036 |
+ 0.189 |
+ 0.186 |
+ 0.168 |
+ 0.188 |
+ 0.220 |
+ 0.197 |
+ 0.183 |
+ 0.190 |
+ 0.200 |
+ 0.199 |
+ 0.148 |
+ 0.224 |
+ 1121 |
+ Activity |
+ ENVZ_ECOLI |
+ High |
+ Prokaryote |
+
+
+ EPHB2_HUMAN_Tsuboyama_2023_1F0M |
+ 0.615 |
+ 0.706 |
+ 0.705 |
+ 0.708 |
+ 0.732 |
+ 0.733 |
+ -0.320 |
+ 0.684 |
+ 0.781 |
+ 0.789 |
+ 0.762 |
+ 0.787 |
+ 0.789 |
+ -0.378 |
+ 0.768 |
+ 0.803 |
+ 0.810 |
+ 0.769 |
+ 0.744 |
+ 0.741 |
+ 0.555 |
+ 0.599 |
+ 0.685 |
+ 0.656 |
+ 0.641 |
+ 0.664 |
+ 0.684 |
+ 0.705 |
+ 0.735 |
+ 0.795 |
+ 0.752 |
+ 0.738 |
+ 0.677 |
+ 0.612 |
+ 0.603 |
+ 0.644 |
+ 0.716 |
+ 0.711 |
+ 0.728 |
+ 0.741 |
+ 0.737 |
+ 0.735 |
+ 0.625 |
+ -0.253 |
+ 0.618 |
+ 0.630 |
+ 0.665 |
+ 0.604 |
+ 0.828 |
+ 0.760 |
+ 0.805 |
+ 0.801 |
+ 0.806 |
+ 0.811 |
+ 0.820 |
+ 0.809 |
+ 0.802 |
+ 0.806 |
+ 0.800 |
+ 0.812 |
+ 0.799 |
+ 0.805 |
+ 1960 |
+ Stability |
+ EPHB2_HUMAN |
+ High |
+ Human |
+
+
+ ERBB2_HUMAN_Elazar_2016 |
+ 0.386 |
+ 0.368 |
+ 0.254 |
+ 0.250 |
+ 0.244 |
+ 0.268 |
+ 0.449 |
+ 0.415 |
+ 0.405 |
+ 0.416 |
+ 0.449 |
+ 0.480 |
+ 0.490 |
+ 0.481 |
+ 0.463 |
+ 0.469 |
+ 0.423 |
+ 0.373 |
+ 0.295 |
+ 0.047 |
+ 0.457 |
+ 0.499 |
+ 0.550 |
+ 0.471 |
+ 0.505 |
+ 0.536 |
+ 0.596 |
+ 0.598 |
+ 0.506 |
+ 0.381 |
+ 0.424 |
+ 0.066 |
+ 0.489 |
+ 0.553 |
+ 0.471 |
+ 0.521 |
+ 0.494 |
+ 0.465 |
+ 0.492 |
+ 0.447 |
+ 0.441 |
+ 0.439 |
+ 0.468 |
+ 0.459 |
+ 0.537 |
+ 0.463 |
+ 0.461 |
+ 0.491 |
+ -0.139 |
+ 0.022 |
+ 0.410 |
+ 0.406 |
+ 0.451 |
+ 0.440 |
+ 0.415 |
+ 0.406 |
+ 0.432 |
+ 0.452 |
+ 0.438 |
+ 0.446 |
+ 0.525 |
+ 0.488 |
+ 326 |
+ Expression |
+ ERBB2_HUMAN |
+ Low |
+ Human |
+
+
+ ESTA_BACSU_Nutschel_2020 |
+ 0.258 |
+ 0.399 |
+ 0.389 |
+ 0.415 |
+ 0.387 |
+ 0.387 |
+ 0.187 |
+ 0.315 |
+ 0.345 |
+ 0.413 |
+ 0.336 |
+ 0.305 |
+ 0.335 |
+ 0.169 |
+ 0.270 |
+ 0.267 |
+ 0.300 |
+ 0.281 |
+ 0.323 |
+ 0.320 |
+ 0.121 |
+ 0.196 |
+ 0.268 |
+ 0.286 |
+ 0.270 |
+ 0.254 |
+ 0.311 |
+ 0.272 |
+ 0.379 |
+ 0.379 |
+ 0.434 |
+ 0.406 |
+ 0.075 |
+ 0.184 |
+ 0.272 |
+ 0.261 |
+ 0.310 |
+ 0.329 |
+ 0.326 |
+ 0.403 |
+ 0.407 |
+ 0.397 |
+ 0.193 |
+ 0.121 |
+ 0.289 |
+ 0.262 |
+ 0.557 |
+ 0.489 |
+ 0.545 |
+ 0.368 |
+ 0.366 |
+ 0.334 |
+ 0.347 |
+ 0.343 |
+ 0.378 |
+ 0.347 |
+ 0.347 |
+ 0.365 |
+ 0.369 |
+ 0.370 |
+ 0.391 |
+ 0.212 |
+ 2172 |
+ Stability |
+ ESTA_BACSU |
+ High |
+ Prokaryote |
+
+
+ F7YBW8_MESOW_Aakre_2015 |
+ 0.060 |
+ 0.395 |
+ 0.395 |
+ 0.440 |
+ 0.428 |
+ 0.430 |
+ 0.016 |
+ 0.320 |
+ 0.382 |
+ 0.395 |
+ 0.437 |
+ 0.382 |
+ 0.393 |
+ -0.091 |
+ -0.001 |
+ 0.049 |
+ 0.383 |
+ 0.366 |
+ 0.434 |
+ 0.391 |
+ -0.075 |
+ -0.122 |
+ -0.104 |
+ -0.005 |
+ 0.098 |
+ 0.125 |
+ -0.021 |
+ 0.298 |
+ 0.403 |
+ 0.375 |
+ 0.461 |
+ 0.441 |
+ 0.037 |
+ -0.036 |
+ -0.124 |
+ 0.433 |
+ 0.062 |
+ 0.011 |
+ 0.425 |
+ 0.380 |
+ 0.367 |
+ 0.444 |
+ -0.064 |
+ -0.081 |
+ 0.199 |
+ -0.048 |
+ -0.008 |
+ 0.246 |
+ 0.069 |
+ 0.035 |
+ 0.431 |
+ 0.447 |
+ 0.423 |
+ 0.460 |
+ 0.453 |
+ 0.457 |
+ 0.430 |
+ 0.443 |
+ 0.447 |
+ 0.448 |
+ 0.270 |
+ -0.126 |
+ 9192 |
+ OrganismalFitness |
+ F7YBW8_MESOW |
+ High |
+ Prokaryote |
+
+
+ FECA_ECOLI_Tsuboyama_2023_2D1U |
+ 0.380 |
+ 0.431 |
+ 0.449 |
+ 0.458 |
+ 0.449 |
+ 0.441 |
+ 0.023 |
+ 0.297 |
+ 0.372 |
+ 0.456 |
+ 0.509 |
+ 0.269 |
+ 0.433 |
+ 0.059 |
+ 0.449 |
+ 0.536 |
+ 0.524 |
+ 0.480 |
+ 0.386 |
+ 0.464 |
+ 0.169 |
+ 0.272 |
+ 0.343 |
+ 0.288 |
+ 0.156 |
+ 0.477 |
+ 0.451 |
+ 0.292 |
+ 0.394 |
+ 0.520 |
+ 0.409 |
+ 0.336 |
+ 0.276 |
+ 0.065 |
+ 0.121 |
+ 0.148 |
+ 0.355 |
+ 0.348 |
+ 0.302 |
+ 0.415 |
+ 0.415 |
+ 0.389 |
+ 0.395 |
+ 0.008 |
+ 0.501 |
+ 0.447 |
+ 0.582 |
+ 0.551 |
+ 0.588 |
+ 0.525 |
+ 0.527 |
+ 0.483 |
+ 0.514 |
+ 0.512 |
+ 0.500 |
+ 0.528 |
+ 0.499 |
+ 0.489 |
+ 0.506 |
+ 0.515 |
+ 0.595 |
+ 0.585 |
+ 1886 |
+ Stability |
+ FECA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ FKBP3_HUMAN_Tsuboyama_2023_2KFV |
+ 0.423 |
+ 0.394 |
+ 0.479 |
+ 0.482 |
+ 0.487 |
+ 0.498 |
+ 0.180 |
+ 0.342 |
+ 0.237 |
+ 0.237 |
+ 0.171 |
+ 0.170 |
+ 0.164 |
+ 0.172 |
+ 0.163 |
+ 0.147 |
+ 0.188 |
+ 0.265 |
+ 0.378 |
+ 0.287 |
+ 0.221 |
+ 0.213 |
+ 0.218 |
+ 0.279 |
+ 0.169 |
+ 0.229 |
+ 0.193 |
+ 0.250 |
+ 0.220 |
+ 0.478 |
+ 0.315 |
+ 0.319 |
+ -0.077 |
+ 0.095 |
+ 0.168 |
+ 0.230 |
+ 0.398 |
+ 0.410 |
+ 0.342 |
+ 0.484 |
+ 0.490 |
+ 0.409 |
+ 0.180 |
+ 0.184 |
+ 0.177 |
+ 0.210 |
+ 0.685 |
+ 0.603 |
+ 0.687 |
+ 0.589 |
+ 0.306 |
+ 0.397 |
+ 0.443 |
+ 0.402 |
+ 0.397 |
+ 0.403 |
+ 0.369 |
+ 0.368 |
+ 0.296 |
+ 0.399 |
+ 0.581 |
+ 0.368 |
+ 1237 |
+ Stability |
+ FKBP3_HUMAN |
+ Medium |
+ Human |
+
+
+ GAL4_YEAST_Kitzman_2015 |
+ 0.237 |
+ 0.403 |
+ 0.507 |
+ 0.567 |
+ 0.486 |
+ 0.532 |
+ 0.337 |
+ -0.004 |
+ 0.513 |
+ 0.575 |
+ 0.623 |
+ 0.458 |
+ 0.462 |
+ 0.340 |
+ 0.392 |
+ 0.528 |
+ 0.668 |
+ 0.670 |
+ 0.655 |
+ 0.507 |
+ 0.319 |
+ 0.355 |
+ 0.383 |
+ 0.372 |
+ 0.364 |
+ 0.424 |
+ 0.493 |
+ 0.451 |
+ 0.579 |
+ 0.628 |
+ 0.651 |
+ 0.563 |
+ 0.303 |
+ 0.279 |
+ 0.331 |
+ 0.325 |
+ 0.560 |
+ 0.555 |
+ 0.557 |
+ 0.508 |
+ 0.509 |
+ 0.503 |
+ 0.395 |
+ 0.301 |
+ 0.604 |
+ 0.446 |
+ 0.287 |
+ 0.568 |
+ 0.308 |
+ 0.114 |
+ 0.627 |
+ 0.634 |
+ 0.613 |
+ 0.625 |
+ 0.632 |
+ 0.634 |
+ 0.620 |
+ 0.617 |
+ 0.619 |
+ 0.638 |
+ 0.558 |
+ 0.408 |
+ 1195 |
+ OrganismalFitness |
+ GAL4_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ GCN4_YEAST_Staller_2018 |
+ 0.253 |
+ 0.250 |
+ 0.245 |
+ 0.248 |
+ 0.242 |
+ 0.241 |
+ 0.182 |
+ 0.221 |
+ 0.240 |
+ 0.240 |
+ 0.238 |
+ 0.272 |
+ 0.284 |
+ 0.318 |
+ 0.278 |
+ 0.266 |
+ 0.281 |
+ 0.261 |
+ 0.248 |
+ 0.186 |
+ 0.175 |
+ 0.189 |
+ 0.168 |
+ 0.167 |
+ 0.158 |
+ 0.118 |
+ 0.081 |
+ 0.134 |
+ 0.164 |
+ 0.225 |
+ 0.255 |
+ 0.247 |
+ 0.025 |
+ 0.175 |
+ 0.169 |
+ 0.265 |
+ 0.258 |
+ 0.257 |
+ 0.278 |
+ 0.258 |
+ 0.257 |
+ 0.274 |
+ 0.107 |
+ 0.192 |
+ 0.157 |
+ 0.126 |
+ 0.145 |
+ 0.183 |
+ 0.201 |
+ 0.167 |
+ 0.227 |
+ 0.220 |
+ 0.217 |
+ 0.214 |
+ 0.226 |
+ 0.224 |
+ 0.219 |
+ 0.221 |
+ 0.224 |
+ 0.222 |
+ 0.221 |
+ 0.245 |
+ 2638 |
+ Binding |
+ GCN4_YEAST |
+ Low |
+ Eukaryote |
+
+
+ GDIA_HUMAN_Silverstein_2021 |
+ 0.442 |
+ 0.429 |
+ 0.439 |
+ 0.446 |
+ 0.451 |
+ 0.452 |
+ 0.205 |
+ 0.398 |
+ 0.466 |
+ 0.462 |
+ 0.391 |
+ 0.423 |
+ 0.460 |
+ 0.159 |
+ 0.247 |
+ 0.448 |
+ 0.397 |
+ 0.356 |
+ 0.411 |
+ 0.401 |
+ 0.253 |
+ 0.385 |
+ 0.380 |
+ 0.409 |
+ 0.326 |
+ 0.439 |
+ 0.425 |
+ 0.384 |
+ 0.397 |
+ 0.415 |
+ 0.423 |
+ 0.393 |
+ 0.276 |
+ 0.321 |
+ 0.394 |
+ 0.356 |
+ 0.432 |
+ 0.448 |
+ 0.437 |
+ 0.451 |
+ 0.470 |
+ 0.461 |
+ 0.233 |
+ 0.121 |
+ 0.432 |
+ 0.260 |
+ 0.357 |
+ 0.421 |
+ 0.387 |
+ 0.101 |
+ 0.389 |
+ 0.335 |
+ 0.402 |
+ 0.378 |
+ 0.391 |
+ 0.399 |
+ 0.377 |
+ 0.391 |
+ 0.375 |
+ 0.405 |
+ 0.461 |
+ 0.338 |
+ 1154 |
+ OrganismalFitness |
+ GDIA_HUMAN |
+ Low |
+ Human |
+
+
+ GFP_AEQVI_Sarkisyan_2016 |
+ 0.649 |
+ 0.644 |
+ 0.672 |
+ 0.673 |
+ 0.679 |
+ 0.679 |
+ 0.049 |
+ 0.635 |
+ 0.667 |
+ 0.661 |
+ 0.524 |
+ 0.098 |
+ 0.103 |
+ 0.078 |
+ 0.131 |
+ 0.103 |
+ 0.108 |
+ 0.149 |
+ 0.290 |
+ 0.598 |
+ 0.078 |
+ 0.105 |
+ 0.181 |
+ 0.098 |
+ 0.046 |
+ 0.188 |
+ 0.298 |
+ 0.642 |
+ 0.647 |
+ 0.678 |
+ 0.607 |
+ 0.615 |
+ 0.074 |
+ 0.066 |
+ 0.182 |
+ 0.629 |
+ 0.649 |
+ 0.651 |
+ 0.672 |
+ 0.683 |
+ 0.685 |
+ 0.706 |
+ 0.024 |
+ -0.014 |
+ 0.041 |
+ 0.036 |
+ 0.512 |
+ 0.508 |
+ 0.713 |
+ 0.602 |
+ 0.591 |
+ 0.600 |
+ 0.610 |
+ 0.617 |
+ 0.609 |
+ 0.611 |
+ 0.605 |
+ 0.606 |
+ 0.587 |
+ 0.607 |
+ 0.623 |
+ 0.449 |
+ 51714 |
+ Activity |
+ GFP_AEQVI |
+ Low |
+ Eukaryote |
+
+
+ GLPA_HUMAN_Elazar_2016 |
+ 0.227 |
+ 0.134 |
+ 0.207 |
+ 0.210 |
+ 0.180 |
+ 0.162 |
+ 0.307 |
+ 0.524 |
+ 0.385 |
+ 0.361 |
+ 0.388 |
+ 0.417 |
+ 0.412 |
+ 0.339 |
+ 0.392 |
+ 0.453 |
+ 0.425 |
+ 0.380 |
+ 0.470 |
+ 0.163 |
+ 0.370 |
+ 0.388 |
+ 0.364 |
+ 0.407 |
+ 0.363 |
+ 0.390 |
+ 0.414 |
+ 0.420 |
+ 0.490 |
+ 0.349 |
+ 0.435 |
+ 0.276 |
+ 0.281 |
+ 0.423 |
+ 0.372 |
+ 0.414 |
+ 0.423 |
+ 0.392 |
+ 0.417 |
+ 0.391 |
+ 0.362 |
+ 0.405 |
+ 0.405 |
+ 0.384 |
+ 0.428 |
+ 0.341 |
+ 0.330 |
+ 0.413 |
+ 0.310 |
+ 0.177 |
+ 0.529 |
+ 0.513 |
+ 0.507 |
+ 0.479 |
+ 0.500 |
+ 0.455 |
+ 0.500 |
+ 0.434 |
+ 0.464 |
+ 0.507 |
+ 0.443 |
+ 0.411 |
+ 245 |
+ Expression |
+ GLPA_HUMAN |
+ Low |
+ Human |
+
+
+ GRB2_HUMAN_Faure_2021 |
+ 0.405 |
+ 0.521 |
+ 0.507 |
+ 0.538 |
+ 0.540 |
+ 0.546 |
+ 0.486 |
+ 0.450 |
+ 0.539 |
+ 0.491 |
+ 0.533 |
+ 0.455 |
+ 0.515 |
+ 0.526 |
+ 0.598 |
+ 0.626 |
+ 0.647 |
+ 0.535 |
+ 0.579 |
+ 0.534 |
+ 0.544 |
+ 0.516 |
+ 0.514 |
+ 0.453 |
+ 0.532 |
+ 0.509 |
+ 0.446 |
+ 0.524 |
+ 0.448 |
+ 0.504 |
+ 0.428 |
+ 0.422 |
+ 0.464 |
+ 0.536 |
+ 0.506 |
+ 0.404 |
+ 0.517 |
+ 0.506 |
+ 0.436 |
+ 0.562 |
+ 0.561 |
+ 0.532 |
+ 0.584 |
+ 0.323 |
+ 0.530 |
+ 0.572 |
+ 0.672 |
+ 0.527 |
+ 0.690 |
+ 0.516 |
+ 0.621 |
+ 0.649 |
+ 0.638 |
+ 0.638 |
+ 0.647 |
+ 0.628 |
+ 0.629 |
+ 0.645 |
+ 0.625 |
+ 0.650 |
+ 0.556 |
+ 0.607 |
+ 63366 |
+ OrganismalFitness |
+ GRB2_HUMAN |
+ Medium |
+ Human |
+
+
+ HCP_LAMBD_Tsuboyama_2023_2L6Q |
+ 0.315 |
+ 0.410 |
+ 0.113 |
+ 0.147 |
+ 0.351 |
+ 0.360 |
+ 0.241 |
+ 0.399 |
+ 0.430 |
+ 0.485 |
+ 0.666 |
+ 0.450 |
+ 0.496 |
+ 0.285 |
+ 0.416 |
+ 0.574 |
+ 0.686 |
+ 0.588 |
+ 0.577 |
+ 0.245 |
+ 0.292 |
+ 0.306 |
+ 0.362 |
+ 0.368 |
+ 0.324 |
+ 0.325 |
+ 0.210 |
+ 0.345 |
+ 0.515 |
+ 0.579 |
+ 0.636 |
+ 0.504 |
+ 0.191 |
+ 0.262 |
+ 0.287 |
+ 0.486 |
+ 0.430 |
+ 0.443 |
+ 0.531 |
+ 0.374 |
+ 0.430 |
+ 0.507 |
+ 0.310 |
+ 0.285 |
+ 0.614 |
+ 0.376 |
+ 0.667 |
+ 0.724 |
+ 0.730 |
+ 0.601 |
+ 0.692 |
+ 0.698 |
+ 0.710 |
+ 0.687 |
+ 0.714 |
+ 0.711 |
+ 0.696 |
+ 0.714 |
+ 0.719 |
+ 0.723 |
+ 0.768 |
+ 0.609 |
+ 1040 |
+ Stability |
+ HCP_LAMBD |
+ Medium |
+ Virus |
+
+
+ HECD1_HUMAN_Tsuboyama_2023_3DKM |
+ 0.291 |
+ 0.370 |
+ 0.324 |
+ 0.324 |
+ 0.332 |
+ 0.336 |
+ 0.055 |
+ 0.234 |
+ 0.359 |
+ 0.370 |
+ 0.303 |
+ 0.303 |
+ 0.293 |
+ 0.134 |
+ 0.206 |
+ 0.474 |
+ 0.387 |
+ 0.340 |
+ 0.385 |
+ 0.337 |
+ -0.024 |
+ 0.232 |
+ 0.302 |
+ 0.264 |
+ 0.160 |
+ 0.294 |
+ 0.308 |
+ 0.247 |
+ 0.214 |
+ 0.312 |
+ 0.332 |
+ 0.307 |
+ 0.103 |
+ 0.135 |
+ 0.101 |
+ 0.229 |
+ 0.327 |
+ 0.298 |
+ 0.353 |
+ 0.327 |
+ 0.341 |
+ 0.359 |
+ 0.089 |
+ 0.038 |
+ 0.433 |
+ 0.138 |
+ 0.252 |
+ 0.260 |
+ 0.380 |
+ 0.321 |
+ 0.368 |
+ 0.408 |
+ 0.345 |
+ 0.392 |
+ 0.411 |
+ 0.436 |
+ 0.442 |
+ 0.428 |
+ 0.451 |
+ 0.417 |
+ 0.414 |
+ 0.219 |
+ 5586 |
+ Stability |
+ HECD1_HUMAN |
+ Medium |
+ Human |
+
+
+ HEM3_HUMAN_Loggerenberg_2023 |
+ 0.418 |
+ 0.409 |
+ 0.403 |
+ 0.407 |
+ 0.420 |
+ 0.418 |
+ 0.126 |
+ 0.103 |
+ 0.432 |
+ 0.442 |
+ 0.388 |
+ 0.383 |
+ 0.396 |
+ 0.142 |
+ 0.387 |
+ 0.408 |
+ 0.393 |
+ 0.418 |
+ 0.429 |
+ 0.108 |
+ 0.349 |
+ 0.373 |
+ 0.370 |
+ 0.375 |
+ 0.358 |
+ 0.387 |
+ 0.376 |
+ 0.385 |
+ 0.402 |
+ 0.435 |
+ 0.420 |
+ 0.410 |
+ 0.093 |
+ 0.377 |
+ 0.381 |
+ 0.377 |
+ 0.420 |
+ 0.430 |
+ 0.438 |
+ 0.427 |
+ 0.434 |
+ 0.440 |
+ 0.323 |
+ 0.135 |
+ 0.384 |
+ 0.378 |
+ 0.314 |
+ 0.358 |
+ 0.301 |
+ 0.146 |
+ 0.357 |
+ 0.337 |
+ 0.353 |
+ 0.363 |
+ 0.377 |
+ 0.374 |
+ 0.371 |
+ 0.373 |
+ 0.368 |
+ 0.377 |
+ 0.431 |
+ 0.415 |
+ 5689 |
+ Activity |
+ HEM3_HUMAN |
+ Medium |
+ Human |
+
+
+ HIS7_YEAST_Pokusaeva_2019 |
+ 0.472 |
+ 0.542 |
+ 0.558 |
+ 0.557 |
+ 0.533 |
+ 0.531 |
+ 0.146 |
+ 0.273 |
+ 0.468 |
+ 0.508 |
+ 0.472 |
+ 0.411 |
+ 0.477 |
+ 0.057 |
+ 0.113 |
+ 0.440 |
+ 0.411 |
+ 0.475 |
+ 0.480 |
+ 0.454 |
+ 0.325 |
+ 0.402 |
+ 0.433 |
+ 0.478 |
+ 0.404 |
+ 0.485 |
+ 0.464 |
+ 0.516 |
+ 0.539 |
+ 0.524 |
+ 0.400 |
+ 0.374 |
+ 0.143 |
+ 0.397 |
+ 0.453 |
+ 0.585 |
+ 0.488 |
+ 0.496 |
+ 0.616 |
+ 0.573 |
+ 0.548 |
+ 0.582 |
+ 0.140 |
+ 0.040 |
+ 0.299 |
+ 0.020 |
+ 0.378 |
+ 0.424 |
+ 0.529 |
+ 0.392 |
+ 0.444 |
+ 0.350 |
+ 0.404 |
+ 0.427 |
+ 0.440 |
+ 0.407 |
+ 0.418 |
+ 0.450 |
+ 0.431 |
+ 0.424 |
+ 0.521 |
+ 0.222 |
+ 496137 |
+ OrganismalFitness |
+ HIS7_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HMDH_HUMAN_Jiang_2019 |
+ 0.491 |
+ 0.421 |
+ 0.382 |
+ 0.394 |
+ 0.456 |
+ 0.451 |
+ 0.158 |
+ 0.282 |
+ 0.282 |
+ 0.274 |
+ 0.275 |
+ 0.341 |
+ 0.315 |
+ 0.046 |
+ 0.010 |
+ 0.317 |
+ 0.509 |
+ 0.374 |
+ 0.378 |
+ 0.302 |
+ 0.451 |
+ 0.162 |
+ 0.175 |
+ 0.221 |
+ 0.435 |
+ 0.197 |
+ 0.168 |
+ 0.209 |
+ 0.213 |
+ 0.476 |
+ 0.413 |
+ 0.401 |
+ 0.228 |
+ 0.308 |
+ 0.247 |
+ 0.219 |
+ 0.440 |
+ 0.340 |
+ 0.330 |
+ 0.437 |
+ 0.361 |
+ 0.360 |
+ 0.172 |
+ 0.069 |
+ 0.477 |
+ 0.496 |
+ 0.323 |
+ 0.444 |
+ -0.184 |
+ 0.105 |
+ 0.493 |
+ 0.492 |
+ 0.486 |
+ 0.498 |
+ 0.489 |
+ 0.492 |
+ 0.480 |
+ 0.500 |
+ 0.485 |
+ 0.499 |
+ 0.529 |
+ 0.438 |
+ 16853 |
+ OrganismalFitness |
+ HMDH_HUMAN |
+ Low |
+ Human |
+
+
+ HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2 |
+ 0.316 |
+ 0.360 |
+ 0.371 |
+ 0.376 |
+ 0.364 |
+ 0.365 |
+ 0.194 |
+ 0.394 |
+ 0.427 |
+ 0.437 |
+ 0.362 |
+ 0.416 |
+ 0.453 |
+ -0.073 |
+ 0.071 |
+ 0.234 |
+ 0.257 |
+ 0.309 |
+ 0.293 |
+ 0.020 |
+ 0.374 |
+ 0.398 |
+ 0.399 |
+ 0.386 |
+ 0.338 |
+ 0.366 |
+ 0.411 |
+ 0.415 |
+ 0.420 |
+ 0.397 |
+ 0.436 |
+ 0.395 |
+ 0.225 |
+ 0.340 |
+ 0.340 |
+ 0.340 |
+ 0.363 |
+ 0.362 |
+ 0.361 |
+ 0.382 |
+ 0.381 |
+ 0.381 |
+ 0.010 |
+ -0.056 |
+ 0.383 |
+ 0.400 |
+ 0.068 |
+ 0.267 |
+ 0.097 |
+ -0.015 |
+ 0.241 |
+ 0.257 |
+ 0.243 |
+ 0.258 |
+ 0.255 |
+ 0.243 |
+ 0.269 |
+ 0.250 |
+ 0.236 |
+ 0.256 |
+ 0.367 |
+ 0.301 |
+ 2252 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HSP82_YEAST_Flynn_2019 |
+ 0.349 |
+ 0.367 |
+ 0.398 |
+ 0.409 |
+ 0.401 |
+ 0.401 |
+ 0.159 |
+ 0.347 |
+ 0.417 |
+ 0.406 |
+ 0.367 |
+ 0.392 |
+ 0.417 |
+ 0.018 |
+ 0.094 |
+ 0.174 |
+ 0.283 |
+ 0.359 |
+ 0.370 |
+ 0.391 |
+ 0.384 |
+ 0.397 |
+ 0.389 |
+ 0.407 |
+ 0.398 |
+ 0.419 |
+ 0.423 |
+ 0.397 |
+ 0.434 |
+ 0.425 |
+ 0.454 |
+ 0.441 |
+ 0.194 |
+ 0.380 |
+ 0.394 |
+ 0.392 |
+ 0.403 |
+ 0.415 |
+ 0.411 |
+ 0.418 |
+ 0.427 |
+ 0.424 |
+ 0.133 |
+ -0.065 |
+ 0.371 |
+ 0.313 |
+ 0.188 |
+ 0.308 |
+ 0.110 |
+ 0.062 |
+ 0.291 |
+ 0.283 |
+ 0.295 |
+ 0.305 |
+ 0.293 |
+ 0.296 |
+ 0.301 |
+ 0.296 |
+ 0.282 |
+ 0.302 |
+ 0.386 |
+ 0.329 |
+ 13294 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HSP82_YEAST_Mishra_2016 |
+ 0.487 |
+ 0.461 |
+ 0.541 |
+ 0.553 |
+ 0.543 |
+ 0.543 |
+ 0.385 |
+ 0.399 |
+ 0.472 |
+ 0.490 |
+ 0.491 |
+ 0.547 |
+ 0.556 |
+ 0.178 |
+ 0.291 |
+ 0.372 |
+ 0.461 |
+ 0.473 |
+ 0.429 |
+ 0.485 |
+ 0.501 |
+ 0.504 |
+ 0.494 |
+ 0.528 |
+ 0.532 |
+ 0.510 |
+ 0.540 |
+ 0.479 |
+ 0.515 |
+ 0.557 |
+ 0.558 |
+ 0.550 |
+ 0.338 |
+ 0.496 |
+ 0.510 |
+ 0.505 |
+ 0.525 |
+ 0.534 |
+ 0.528 |
+ 0.552 |
+ 0.558 |
+ 0.552 |
+ 0.495 |
+ -0.104 |
+ 0.485 |
+ 0.533 |
+ 0.105 |
+ 0.352 |
+ 0.096 |
+ 0.040 |
+ 0.433 |
+ 0.416 |
+ 0.417 |
+ 0.446 |
+ 0.442 |
+ 0.440 |
+ 0.451 |
+ 0.460 |
+ 0.436 |
+ 0.450 |
+ 0.458 |
+ 0.445 |
+ 4323 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HXK4_HUMAN_Gersing_2022_activity |
+ 0.489 |
+ 0.516 |
+ 0.477 |
+ 0.490 |
+ 0.484 |
+ 0.492 |
+ 0.229 |
+ 0.387 |
+ 0.511 |
+ 0.527 |
+ 0.446 |
+ 0.464 |
+ 0.491 |
+ 0.183 |
+ 0.292 |
+ 0.510 |
+ 0.517 |
+ 0.508 |
+ 0.476 |
+ 0.051 |
+ 0.453 |
+ 0.449 |
+ 0.442 |
+ 0.421 |
+ 0.449 |
+ 0.465 |
+ 0.452 |
+ 0.448 |
+ 0.402 |
+ 0.503 |
+ 0.492 |
+ 0.450 |
+ 0.179 |
+ 0.472 |
+ 0.426 |
+ 0.370 |
+ 0.503 |
+ 0.482 |
+ 0.446 |
+ 0.513 |
+ 0.508 |
+ 0.493 |
+ 0.316 |
+ 0.032 |
+ 0.501 |
+ 0.478 |
+ 0.356 |
+ 0.439 |
+ 0.394 |
+ 0.168 |
+ 0.507 |
+ 0.499 |
+ 0.488 |
+ 0.497 |
+ 0.498 |
+ 0.509 |
+ 0.502 |
+ 0.511 |
+ 0.508 |
+ 0.513 |
+ 0.520 |
+ 0.481 |
+ 8570 |
+ OrganismalFitness |
+ HXK4_HUMAN |
+ Medium |
+ Human |
+
+
+ HXK4_HUMAN_Gersing_2023_abundance |
+ 0.322 |
+ 0.362 |
+ 0.325 |
+ 0.351 |
+ 0.341 |
+ 0.348 |
+ 0.057 |
+ 0.329 |
+ 0.386 |
+ 0.407 |
+ 0.338 |
+ 0.331 |
+ 0.354 |
+ 0.136 |
+ 0.185 |
+ 0.328 |
+ 0.367 |
+ 0.376 |
+ 0.358 |
+ 0.401 |
+ 0.282 |
+ 0.290 |
+ 0.314 |
+ 0.316 |
+ 0.285 |
+ 0.307 |
+ 0.316 |
+ 0.315 |
+ 0.320 |
+ 0.374 |
+ 0.295 |
+ 0.253 |
+ 0.170 |
+ 0.305 |
+ 0.304 |
+ 0.317 |
+ 0.340 |
+ 0.349 |
+ 0.359 |
+ 0.348 |
+ 0.362 |
+ 0.369 |
+ 0.168 |
+ 0.071 |
+ 0.351 |
+ 0.336 |
+ 0.399 |
+ 0.382 |
+ 0.391 |
+ 0.160 |
+ 0.359 |
+ 0.374 |
+ 0.371 |
+ 0.368 |
+ 0.379 |
+ 0.376 |
+ 0.375 |
+ 0.368 |
+ 0.371 |
+ 0.378 |
+ 0.424 |
+ 0.329 |
+ 8396 |
+ Expression |
+ HXK4_HUMAN |
+ Medium |
+ Human |
+
+
+ I6TAH8_I68A0_Doud_2015 |
+ 0.347 |
+ 0.317 |
+ 0.268 |
+ 0.264 |
+ 0.364 |
+ 0.361 |
+ -0.005 |
+ 0.297 |
+ 0.299 |
+ 0.336 |
+ 0.013 |
+ 0.018 |
+ 0.014 |
+ 0.018 |
+ 0.025 |
+ 0.010 |
+ 0.020 |
+ 0.013 |
+ 0.094 |
+ 0.212 |
+ 0.308 |
+ 0.328 |
+ 0.374 |
+ 0.377 |
+ 0.004 |
+ 0.011 |
+ 0.097 |
+ 0.002 |
+ 0.302 |
+ 0.368 |
+ 0.250 |
+ 0.253 |
+ 0.171 |
+ 0.304 |
+ 0.332 |
+ 0.337 |
+ 0.329 |
+ 0.354 |
+ 0.348 |
+ 0.383 |
+ 0.399 |
+ 0.401 |
+ 0.007 |
+ 0.020 |
+ 0.016 |
+ 0.019 |
+ 0.209 |
+ 0.209 |
+ 0.231 |
+ 0.100 |
+ 0.075 |
+ 0.114 |
+ 0.126 |
+ 0.135 |
+ 0.111 |
+ 0.106 |
+ 0.089 |
+ 0.098 |
+ 0.067 |
+ 0.109 |
+ 0.056 |
+ 0.018 |
+ 9462 |
+ OrganismalFitness |
+ I6TAH8_I68A0 |
+ Medium |
+ Virus |
+
+
+ IF1_ECOLI_Kelsic_2016 |
+ 0.328 |
+ 0.499 |
+ 0.541 |
+ 0.539 |
+ 0.527 |
+ 0.537 |
+ 0.177 |
+ 0.387 |
+ 0.261 |
+ 0.255 |
+ 0.534 |
+ 0.540 |
+ 0.565 |
+ 0.193 |
+ 0.477 |
+ 0.550 |
+ 0.599 |
+ 0.556 |
+ 0.515 |
+ 0.490 |
+ 0.361 |
+ 0.438 |
+ 0.366 |
+ 0.411 |
+ 0.451 |
+ 0.458 |
+ 0.482 |
+ 0.443 |
+ 0.459 |
+ 0.401 |
+ 0.520 |
+ 0.463 |
+ 0.238 |
+ 0.458 |
+ 0.477 |
+ 0.527 |
+ 0.463 |
+ 0.470 |
+ 0.495 |
+ 0.529 |
+ 0.532 |
+ 0.539 |
+ 0.371 |
+ 0.150 |
+ 0.562 |
+ 0.516 |
+ 0.321 |
+ 0.518 |
+ 0.475 |
+ 0.244 |
+ 0.545 |
+ 0.534 |
+ 0.555 |
+ 0.575 |
+ 0.577 |
+ 0.559 |
+ 0.551 |
+ 0.571 |
+ 0.560 |
+ 0.583 |
+ 0.616 |
+ 0.506 |
+ 1367 |
+ OrganismalFitness |
+ IF1_ECOLI |
+ High |
+ Prokaryote |
+
+
+ ILF3_HUMAN_Tsuboyama_2023_2L33 |
+ 0.233 |
+ 0.336 |
+ 0.444 |
+ 0.438 |
+ 0.437 |
+ 0.436 |
+ 0.022 |
+ 0.315 |
+ 0.371 |
+ 0.416 |
+ 0.353 |
+ 0.352 |
+ 0.350 |
+ 0.192 |
+ 0.324 |
+ 0.435 |
+ 0.318 |
+ 0.282 |
+ 0.167 |
+ 0.480 |
+ 0.206 |
+ 0.214 |
+ 0.257 |
+ 0.312 |
+ 0.347 |
+ 0.233 |
+ 0.372 |
+ 0.255 |
+ 0.259 |
+ 0.457 |
+ 0.349 |
+ 0.313 |
+ 0.097 |
+ 0.127 |
+ 0.282 |
+ 0.362 |
+ 0.282 |
+ 0.349 |
+ 0.401 |
+ 0.382 |
+ 0.389 |
+ 0.445 |
+ 0.279 |
+ 0.045 |
+ 0.339 |
+ 0.395 |
+ 0.418 |
+ 0.368 |
+ 0.475 |
+ 0.342 |
+ 0.304 |
+ 0.274 |
+ 0.343 |
+ 0.330 |
+ 0.370 |
+ 0.395 |
+ 0.330 |
+ 0.312 |
+ 0.414 |
+ 0.370 |
+ 0.319 |
+ 0.349 |
+ 1329 |
+ Stability |
+ ILF3_HUMAN |
+ High |
+ Human |
+
+
+ ISDH_STAAW_Tsuboyama_2023_2LHR |
+ 0.094 |
+ 0.101 |
+ 0.206 |
+ 0.197 |
+ 0.231 |
+ 0.231 |
+ 0.326 |
+ 0.128 |
+ 0.326 |
+ 0.356 |
+ 0.453 |
+ 0.413 |
+ 0.434 |
+ 0.377 |
+ 0.394 |
+ 0.449 |
+ 0.429 |
+ 0.479 |
+ 0.450 |
+ 0.185 |
+ 0.326 |
+ 0.199 |
+ 0.218 |
+ 0.228 |
+ 0.318 |
+ 0.335 |
+ 0.329 |
+ 0.298 |
+ 0.360 |
+ 0.304 |
+ 0.396 |
+ 0.334 |
+ 0.236 |
+ 0.291 |
+ 0.249 |
+ 0.298 |
+ 0.259 |
+ 0.219 |
+ 0.245 |
+ 0.263 |
+ 0.221 |
+ 0.244 |
+ 0.300 |
+ 0.248 |
+ 0.349 |
+ 0.329 |
+ 0.592 |
+ 0.526 |
+ 0.519 |
+ 0.490 |
+ 0.420 |
+ 0.407 |
+ 0.429 |
+ 0.428 |
+ 0.427 |
+ 0.429 |
+ 0.409 |
+ 0.414 |
+ 0.419 |
+ 0.426 |
+ 0.608 |
+ 0.522 |
+ 1944 |
+ Stability |
+ ISDH_STAAW |
+ High |
+ Prokaryote |
+
+
+ KCNE1_HUMAN_Muhammad_2023_expression |
+ 0.130 |
+ 0.116 |
+ 0.093 |
+ 0.128 |
+ 0.124 |
+ 0.103 |
+ 0.209 |
+ 0.024 |
+ 0.041 |
+ 0.053 |
+ 0.187 |
+ 0.159 |
+ 0.134 |
+ 0.163 |
+ 0.177 |
+ 0.129 |
+ 0.106 |
+ 0.062 |
+ 0.073 |
+ 0.116 |
+ 0.181 |
+ 0.183 |
+ 0.092 |
+ 0.039 |
+ 0.191 |
+ 0.111 |
+ 0.108 |
+ 0.055 |
+ 0.046 |
+ 0.096 |
+ 0.089 |
+ 0.057 |
+ -0.170 |
+ 0.147 |
+ 0.225 |
+ 0.097 |
+ 0.138 |
+ 0.152 |
+ 0.115 |
+ 0.138 |
+ 0.141 |
+ 0.121 |
+ 0.199 |
+ 0.166 |
+ 0.093 |
+ 0.194 |
+ 0.179 |
+ 0.116 |
+ 0.139 |
+ 0.086 |
+ 0.088 |
+ 0.087 |
+ 0.091 |
+ 0.071 |
+ 0.067 |
+ 0.065 |
+ 0.105 |
+ 0.100 |
+ 0.082 |
+ 0.086 |
+ 0.110 |
+ 0.193 |
+ 2339 |
+ Expression |
+ KCNE1_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNE1_HUMAN_Muhammad_2023_function |
+ 0.417 |
+ 0.498 |
+ 0.479 |
+ 0.496 |
+ 0.479 |
+ 0.514 |
+ 0.215 |
+ 0.520 |
+ 0.514 |
+ 0.520 |
+ 0.368 |
+ 0.474 |
+ 0.584 |
+ 0.209 |
+ 0.162 |
+ 0.599 |
+ 0.525 |
+ 0.465 |
+ 0.380 |
+ 0.241 |
+ 0.272 |
+ 0.293 |
+ 0.448 |
+ 0.539 |
+ 0.285 |
+ 0.614 |
+ 0.580 |
+ 0.603 |
+ 0.575 |
+ 0.488 |
+ 0.568 |
+ 0.475 |
+ -0.034 |
+ 0.189 |
+ 0.376 |
+ 0.616 |
+ 0.464 |
+ 0.500 |
+ 0.618 |
+ 0.524 |
+ 0.551 |
+ 0.634 |
+ 0.212 |
+ 0.208 |
+ 0.528 |
+ 0.198 |
+ 0.202 |
+ 0.425 |
+ 0.182 |
+ 0.093 |
+ 0.540 |
+ 0.540 |
+ 0.523 |
+ 0.528 |
+ 0.537 |
+ 0.513 |
+ 0.548 |
+ 0.545 |
+ 0.548 |
+ 0.554 |
+ 0.642 |
+ 0.238 |
+ 2315 |
+ Activity |
+ KCNE1_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNH2_HUMAN_Kozek_2020 |
+ 0.449 |
+ 0.421 |
+ 0.298 |
+ 0.292 |
+ 0.211 |
+ 0.210 |
+ 0.454 |
+ 0.151 |
+ 0.303 |
+ 0.306 |
+ 0.307 |
+ 0.216 |
+ 0.234 |
+ 0.253 |
+ 0.197 |
+ 0.229 |
+ 0.280 |
+ 0.258 |
+ 0.248 |
+ 0.382 |
+ 0.468 |
+ 0.513 |
+ 0.495 |
+ 0.463 |
+ 0.502 |
+ 0.491 |
+ 0.455 |
+ 0.477 |
+ 0.481 |
+ 0.460 |
+ 0.433 |
+ 0.318 |
+ 0.344 |
+ 0.471 |
+ 0.525 |
+ 0.511 |
+ 0.499 |
+ 0.554 |
+ 0.538 |
+ 0.472 |
+ 0.517 |
+ 0.509 |
+ 0.394 |
+ 0.223 |
+ 0.193 |
+ 0.291 |
+ 0.219 |
+ -0.079 |
+ -0.202 |
+ 0.086 |
+ 0.530 |
+ 0.390 |
+ 0.481 |
+ 0.435 |
+ 0.444 |
+ 0.442 |
+ 0.476 |
+ 0.441 |
+ 0.435 |
+ 0.475 |
+ 0.271 |
+ 0.411 |
+ 200 |
+ Activity |
+ KCNH2_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNJ2_MOUSE_Coyote-Maestas_2022_function |
+ 0.256 |
+ 0.329 |
+ 0.334 |
+ 0.344 |
+ 0.347 |
+ 0.353 |
+ 0.059 |
+ 0.238 |
+ 0.315 |
+ 0.309 |
+ 0.370 |
+ 0.348 |
+ 0.361 |
+ 0.058 |
+ 0.289 |
+ 0.379 |
+ 0.382 |
+ 0.385 |
+ 0.388 |
+ 0.161 |
+ 0.345 |
+ 0.305 |
+ 0.252 |
+ 0.241 |
+ 0.351 |
+ 0.335 |
+ 0.316 |
+ 0.345 |
+ 0.224 |
+ 0.372 |
+ 0.366 |
+ 0.352 |
+ 0.148 |
+ 0.327 |
+ 0.294 |
+ 0.235 |
+ 0.350 |
+ 0.329 |
+ 0.289 |
+ 0.364 |
+ 0.354 |
+ 0.335 |
+ 0.213 |
+ -0.022 |
+ 0.328 |
+ 0.343 |
+ 0.189 |
+ 0.286 |
+ 0.227 |
+ 0.084 |
+ 0.374 |
+ 0.355 |
+ 0.365 |
+ 0.365 |
+ 0.369 |
+ 0.361 |
+ 0.371 |
+ 0.365 |
+ 0.365 |
+ 0.373 |
+ 0.371 |
+ 0.266 |
+ 6963 |
+ Activity |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ KCNJ2_MOUSE_Coyote-Maestas_2022_surface |
+ 0.381 |
+ 0.341 |
+ 0.254 |
+ 0.265 |
+ 0.338 |
+ 0.340 |
+ 0.119 |
+ 0.224 |
+ 0.280 |
+ 0.277 |
+ 0.323 |
+ 0.287 |
+ 0.297 |
+ 0.176 |
+ 0.360 |
+ 0.370 |
+ 0.352 |
+ 0.343 |
+ 0.354 |
+ 0.218 |
+ 0.239 |
+ 0.228 |
+ 0.217 |
+ 0.245 |
+ 0.265 |
+ 0.265 |
+ 0.260 |
+ 0.273 |
+ 0.251 |
+ 0.368 |
+ 0.284 |
+ 0.255 |
+ 0.174 |
+ 0.243 |
+ 0.252 |
+ 0.235 |
+ 0.324 |
+ 0.331 |
+ 0.330 |
+ 0.328 |
+ 0.338 |
+ 0.340 |
+ 0.266 |
+ 0.065 |
+ 0.263 |
+ 0.309 |
+ 0.305 |
+ 0.262 |
+ 0.306 |
+ 0.167 |
+ 0.322 |
+ 0.311 |
+ 0.330 |
+ 0.320 |
+ 0.339 |
+ 0.321 |
+ 0.343 |
+ 0.321 |
+ 0.317 |
+ 0.334 |
+ 0.351 |
+ 0.349 |
+ 6917 |
+ Expression |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ KKA2_KLEPN_Melnikov_2014 |
+ 0.250 |
+ 0.530 |
+ 0.437 |
+ 0.622 |
+ 0.599 |
+ 0.603 |
+ 0.199 |
+ 0.423 |
+ 0.538 |
+ 0.606 |
+ 0.566 |
+ 0.597 |
+ 0.621 |
+ 0.212 |
+ 0.261 |
+ 0.487 |
+ 0.601 |
+ 0.645 |
+ 0.662 |
+ 0.498 |
+ 0.285 |
+ 0.420 |
+ 0.518 |
+ 0.540 |
+ 0.353 |
+ 0.578 |
+ 0.577 |
+ 0.568 |
+ 0.638 |
+ 0.627 |
+ 0.637 |
+ 0.586 |
+ 0.148 |
+ 0.243 |
+ 0.485 |
+ 0.586 |
+ 0.445 |
+ 0.524 |
+ 0.588 |
+ 0.586 |
+ 0.609 |
+ 0.630 |
+ 0.253 |
+ 0.076 |
+ 0.559 |
+ 0.386 |
+ 0.456 |
+ 0.582 |
+ 0.551 |
+ 0.267 |
+ 0.587 |
+ 0.572 |
+ 0.588 |
+ 0.578 |
+ 0.579 |
+ 0.590 |
+ 0.574 |
+ 0.583 |
+ 0.591 |
+ 0.599 |
+ 0.641 |
+ 0.329 |
+ 4960 |
+ OrganismalFitness |
+ KKA2_KLEPN |
+ High |
+ Prokaryote |
+
+
+ LGK_LIPST_Klesmith_2015 |
+ 0.280 |
+ 0.426 |
+ 0.485 |
+ 0.494 |
+ 0.481 |
+ 0.498 |
+ 0.104 |
+ 0.465 |
+ 0.467 |
+ 0.559 |
+ 0.444 |
+ 0.537 |
+ 0.564 |
+ 0.179 |
+ 0.327 |
+ 0.382 |
+ 0.515 |
+ 0.580 |
+ 0.578 |
+ 0.499 |
+ 0.276 |
+ 0.396 |
+ 0.462 |
+ 0.495 |
+ 0.365 |
+ 0.492 |
+ 0.525 |
+ 0.479 |
+ 0.552 |
+ 0.542 |
+ 0.537 |
+ 0.460 |
+ 0.139 |
+ 0.307 |
+ 0.398 |
+ 0.555 |
+ 0.370 |
+ 0.404 |
+ 0.527 |
+ 0.482 |
+ 0.495 |
+ 0.540 |
+ 0.176 |
+ 0.051 |
+ 0.483 |
+ 0.367 |
+ 0.398 |
+ 0.507 |
+ 0.492 |
+ 0.145 |
+ 0.510 |
+ 0.505 |
+ 0.511 |
+ 0.510 |
+ 0.513 |
+ 0.520 |
+ 0.518 |
+ 0.517 |
+ 0.520 |
+ 0.527 |
+ 0.411 |
+ 0.334 |
+ 7890 |
+ Activity |
+ LGK_LIPST |
+ Medium |
+ Eukaryote |
+
+
+ LYAM1_HUMAN_Elazar_2016 |
+ 0.314 |
+ 0.303 |
+ 0.189 |
+ 0.158 |
+ 0.258 |
+ 0.266 |
+ 0.361 |
+ 0.172 |
+ 0.413 |
+ 0.416 |
+ 0.436 |
+ 0.370 |
+ 0.371 |
+ 0.307 |
+ 0.363 |
+ 0.336 |
+ 0.310 |
+ 0.426 |
+ 0.454 |
+ 0.336 |
+ 0.382 |
+ 0.391 |
+ 0.386 |
+ 0.385 |
+ 0.362 |
+ 0.369 |
+ 0.373 |
+ 0.298 |
+ 0.339 |
+ 0.349 |
+ 0.313 |
+ 0.179 |
+ 0.203 |
+ 0.353 |
+ 0.437 |
+ 0.343 |
+ 0.362 |
+ 0.410 |
+ 0.348 |
+ 0.343 |
+ 0.372 |
+ 0.313 |
+ 0.307 |
+ 0.346 |
+ 0.318 |
+ 0.350 |
+ 0.170 |
+ 0.259 |
+ 0.163 |
+ 0.103 |
+ 0.235 |
+ 0.317 |
+ 0.398 |
+ 0.330 |
+ 0.279 |
+ 0.295 |
+ 0.352 |
+ 0.348 |
+ 0.272 |
+ 0.332 |
+ 0.419 |
+ 0.390 |
+ 359 |
+ Expression |
+ LYAM1_HUMAN |
+ Medium |
+ Human |
+
+
+ MAFG_MOUSE_Tsuboyama_2023_1K1V |
+ 0.613 |
+ 0.636 |
+ 0.633 |
+ 0.635 |
+ 0.621 |
+ 0.623 |
+ 0.405 |
+ 0.643 |
+ 0.633 |
+ 0.626 |
+ 0.619 |
+ 0.355 |
+ 0.600 |
+ 0.453 |
+ 0.429 |
+ 0.471 |
+ 0.480 |
+ 0.551 |
+ 0.413 |
+ 0.627 |
+ 0.448 |
+ 0.494 |
+ 0.522 |
+ 0.566 |
+ 0.460 |
+ 0.515 |
+ 0.447 |
+ 0.536 |
+ 0.490 |
+ 0.602 |
+ 0.621 |
+ 0.609 |
+ 0.392 |
+ 0.329 |
+ 0.565 |
+ 0.509 |
+ 0.589 |
+ 0.647 |
+ 0.657 |
+ 0.629 |
+ 0.661 |
+ 0.683 |
+ 0.312 |
+ 0.079 |
+ 0.425 |
+ 0.246 |
+ 0.596 |
+ 0.327 |
+ 0.638 |
+ 0.611 |
+ 0.646 |
+ 0.615 |
+ 0.617 |
+ 0.638 |
+ 0.623 |
+ 0.638 |
+ 0.636 |
+ 0.642 |
+ 0.651 |
+ 0.639 |
+ 0.719 |
+ 0.727 |
+ 1429 |
+ Stability |
+ MAFG_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ MBD11_ARATH_Tsuboyama_2023_6ACV |
+ 0.551 |
+ 0.723 |
+ 0.708 |
+ 0.727 |
+ 0.758 |
+ 0.763 |
+ -0.058 |
+ 0.572 |
+ 0.658 |
+ 0.744 |
+ 0.727 |
+ 0.422 |
+ 0.512 |
+ -0.146 |
+ 0.025 |
+ 0.742 |
+ 0.699 |
+ 0.685 |
+ 0.740 |
+ 0.715 |
+ -0.032 |
+ 0.058 |
+ 0.576 |
+ 0.615 |
+ 0.193 |
+ 0.561 |
+ 0.537 |
+ 0.568 |
+ 0.714 |
+ 0.748 |
+ 0.729 |
+ 0.670 |
+ 0.649 |
+ -0.282 |
+ 0.063 |
+ 0.033 |
+ 0.586 |
+ 0.658 |
+ 0.634 |
+ 0.729 |
+ 0.751 |
+ 0.742 |
+ -0.064 |
+ -0.203 |
+ 0.629 |
+ 0.433 |
+ 0.521 |
+ 0.660 |
+ 0.773 |
+ 0.576 |
+ 0.793 |
+ 0.760 |
+ 0.792 |
+ 0.745 |
+ 0.758 |
+ 0.741 |
+ 0.763 |
+ 0.776 |
+ 0.774 |
+ 0.780 |
+ 0.813 |
+ 0.720 |
+ 2116 |
+ Stability |
+ MBD11_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ MET_HUMAN_Estevam_2023 |
+ 0.454 |
+ 0.544 |
+ 0.538 |
+ 0.563 |
+ 0.582 |
+ 0.584 |
+ 0.517 |
+ 0.520 |
+ 0.550 |
+ 0.562 |
+ 0.585 |
+ 0.550 |
+ 0.562 |
+ 0.452 |
+ 0.515 |
+ 0.542 |
+ 0.590 |
+ 0.596 |
+ 0.599 |
+ 0.583 |
+ 0.526 |
+ 0.506 |
+ 0.466 |
+ 0.492 |
+ 0.530 |
+ 0.519 |
+ 0.519 |
+ 0.488 |
+ 0.454 |
+ 0.548 |
+ 0.570 |
+ 0.530 |
+ 0.286 |
+ 0.471 |
+ 0.501 |
+ 0.525 |
+ 0.512 |
+ 0.539 |
+ 0.563 |
+ 0.558 |
+ 0.565 |
+ 0.573 |
+ 0.542 |
+ 0.347 |
+ 0.571 |
+ 0.575 |
+ 0.350 |
+ 0.420 |
+ 0.498 |
+ 0.192 |
+ 0.540 |
+ 0.548 |
+ 0.548 |
+ 0.550 |
+ 0.562 |
+ 0.554 |
+ 0.557 |
+ 0.561 |
+ 0.556 |
+ 0.566 |
+ 0.575 |
+ 0.529 |
+ 5393 |
+ Activity |
+ MET_HUMAN |
+ Medium |
+ Human |
+
+
+ MK01_HUMAN_Brenan_2016 |
+ 0.176 |
+ 0.198 |
+ 0.237 |
+ 0.241 |
+ 0.223 |
+ 0.227 |
+ 0.209 |
+ 0.163 |
+ 0.198 |
+ 0.184 |
+ 0.033 |
+ 0.167 |
+ 0.182 |
+ 0.170 |
+ 0.199 |
+ 0.195 |
+ 0.178 |
+ 0.189 |
+ 0.141 |
+ 0.184 |
+ 0.209 |
+ 0.117 |
+ 0.069 |
+ 0.016 |
+ 0.183 |
+ 0.089 |
+ 0.057 |
+ 0.076 |
+ -0.067 |
+ 0.218 |
+ 0.176 |
+ 0.222 |
+ 0.106 |
+ 0.169 |
+ 0.058 |
+ 0.004 |
+ 0.193 |
+ 0.119 |
+ 0.093 |
+ 0.227 |
+ 0.205 |
+ 0.202 |
+ 0.196 |
+ 0.125 |
+ 0.135 |
+ 0.184 |
+ 0.059 |
+ -0.000 |
+ 0.133 |
+ 0.006 |
+ 0.161 |
+ 0.162 |
+ 0.148 |
+ 0.166 |
+ 0.170 |
+ 0.163 |
+ 0.163 |
+ 0.169 |
+ 0.177 |
+ 0.171 |
+ 0.150 |
+ 0.186 |
+ 6809 |
+ OrganismalFitness |
+ MK01_HUMAN |
+ Medium |
+ Human |
+
+
+ MLAC_ECOLI_MacRae_2023 |
+ 0.225 |
+ 0.352 |
+ 0.419 |
+ 0.424 |
+ 0.409 |
+ 0.411 |
+ -0.014 |
+ 0.360 |
+ 0.412 |
+ 0.419 |
+ 0.399 |
+ 0.416 |
+ 0.432 |
+ -0.011 |
+ 0.236 |
+ 0.273 |
+ 0.382 |
+ 0.382 |
+ 0.412 |
+ 0.387 |
+ 0.078 |
+ 0.366 |
+ 0.380 |
+ 0.402 |
+ 0.313 |
+ 0.383 |
+ 0.392 |
+ 0.382 |
+ 0.424 |
+ 0.400 |
+ 0.440 |
+ 0.410 |
+ 0.039 |
+ 0.131 |
+ 0.315 |
+ 0.296 |
+ 0.265 |
+ 0.327 |
+ 0.315 |
+ 0.374 |
+ 0.396 |
+ 0.395 |
+ 0.177 |
+ -0.036 |
+ 0.329 |
+ 0.235 |
+ 0.006 |
+ 0.193 |
+ 0.123 |
+ -0.026 |
+ 0.319 |
+ 0.280 |
+ 0.288 |
+ 0.293 |
+ 0.315 |
+ 0.308 |
+ 0.295 |
+ 0.305 |
+ 0.332 |
+ 0.313 |
+ 0.230 |
+ 0.203 |
+ 4007 |
+ OrganismalFitness |
+ MLAC_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ MSH2_HUMAN_Jia_2020 |
+ 0.352 |
+ 0.399 |
+ 0.367 |
+ 0.376 |
+ 0.383 |
+ 0.390 |
+ 0.204 |
+ 0.339 |
+ 0.395 |
+ 0.402 |
+ 0.349 |
+ 0.380 |
+ 0.400 |
+ 0.213 |
+ 0.349 |
+ 0.397 |
+ 0.339 |
+ 0.296 |
+ 0.204 |
+ 0.364 |
+ 0.296 |
+ 0.310 |
+ 0.264 |
+ 0.257 |
+ 0.326 |
+ 0.317 |
+ 0.320 |
+ 0.326 |
+ 0.301 |
+ 0.394 |
+ 0.351 |
+ 0.324 |
+ 0.208 |
+ 0.277 |
+ 0.351 |
+ 0.281 |
+ 0.350 |
+ 0.388 |
+ 0.346 |
+ 0.387 |
+ 0.405 |
+ 0.383 |
+ 0.284 |
+ 0.103 |
+ 0.400 |
+ 0.355 |
+ 0.293 |
+ 0.363 |
+ 0.062 |
+ 0.099 |
+ 0.305 |
+ 0.279 |
+ 0.291 |
+ 0.309 |
+ 0.317 |
+ 0.300 |
+ 0.295 |
+ 0.308 |
+ 0.339 |
+ 0.321 |
+ 0.394 |
+ 0.366 |
+ 16749 |
+ OrganismalFitness |
+ MSH2_HUMAN |
+ Medium |
+ Human |
+
+
+ MTH3_HAEAE_RockahShmuel_2015 |
+ 0.371 |
+ 0.612 |
+ 0.709 |
+ 0.718 |
+ 0.696 |
+ 0.704 |
+ 0.314 |
+ 0.644 |
+ 0.673 |
+ 0.688 |
+ 0.581 |
+ 0.692 |
+ 0.708 |
+ 0.190 |
+ 0.352 |
+ 0.405 |
+ 0.527 |
+ 0.608 |
+ 0.644 |
+ 0.658 |
+ 0.311 |
+ 0.454 |
+ 0.600 |
+ 0.662 |
+ 0.488 |
+ 0.663 |
+ 0.711 |
+ 0.684 |
+ 0.727 |
+ 0.679 |
+ 0.697 |
+ 0.652 |
+ 0.291 |
+ 0.353 |
+ 0.495 |
+ 0.665 |
+ 0.422 |
+ 0.496 |
+ 0.638 |
+ 0.657 |
+ 0.679 |
+ 0.714 |
+ 0.324 |
+ -0.031 |
+ 0.477 |
+ 0.341 |
+ 0.449 |
+ 0.574 |
+ 0.583 |
+ 0.225 |
+ 0.544 |
+ 0.531 |
+ 0.559 |
+ 0.569 |
+ 0.561 |
+ 0.573 |
+ 0.551 |
+ 0.571 |
+ 0.557 |
+ 0.572 |
+ 0.580 |
+ 0.378 |
+ 1777 |
+ OrganismalFitness |
+ MTH3_HAEAE |
+ Medium |
+ Prokaryote |
+
+
+ MTHR_HUMAN_Weile_2021 |
+ 0.250 |
+ 0.271 |
+ 0.249 |
+ 0.257 |
+ 0.258 |
+ 0.257 |
+ 0.205 |
+ 0.194 |
+ 0.292 |
+ 0.293 |
+ 0.387 |
+ 0.299 |
+ 0.320 |
+ 0.169 |
+ 0.414 |
+ 0.482 |
+ 0.348 |
+ 0.276 |
+ 0.285 |
+ 0.253 |
+ 0.443 |
+ 0.162 |
+ 0.217 |
+ 0.265 |
+ 0.437 |
+ 0.239 |
+ 0.264 |
+ 0.245 |
+ 0.295 |
+ 0.296 |
+ 0.220 |
+ 0.189 |
+ 0.143 |
+ 0.451 |
+ 0.307 |
+ 0.212 |
+ 0.378 |
+ 0.315 |
+ 0.254 |
+ 0.363 |
+ 0.309 |
+ 0.269 |
+ 0.278 |
+ 0.111 |
+ 0.379 |
+ 0.411 |
+ 0.239 |
+ 0.338 |
+ 0.377 |
+ 0.092 |
+ 0.324 |
+ 0.289 |
+ 0.299 |
+ 0.297 |
+ 0.305 |
+ 0.316 |
+ 0.335 |
+ 0.319 |
+ 0.347 |
+ 0.325 |
+ 0.332 |
+ 0.376 |
+ 12464 |
+ OrganismalFitness |
+ MTHR_HUMAN |
+ Low |
+ Human |
+
+
+ MYO3_YEAST_Tsuboyama_2023_2BTT |
+ 0.143 |
+ 0.248 |
+ 0.334 |
+ 0.362 |
+ 0.381 |
+ 0.412 |
+ 0.155 |
+ 0.251 |
+ 0.428 |
+ 0.419 |
+ 0.519 |
+ 0.482 |
+ 0.478 |
+ 0.171 |
+ 0.411 |
+ 0.583 |
+ 0.588 |
+ 0.507 |
+ 0.247 |
+ 0.321 |
+ 0.351 |
+ 0.247 |
+ 0.336 |
+ 0.311 |
+ 0.342 |
+ 0.287 |
+ 0.364 |
+ 0.313 |
+ 0.336 |
+ 0.283 |
+ 0.279 |
+ 0.178 |
+ 0.126 |
+ 0.419 |
+ 0.332 |
+ 0.282 |
+ 0.382 |
+ 0.329 |
+ 0.292 |
+ 0.411 |
+ 0.401 |
+ 0.382 |
+ 0.112 |
+ 0.045 |
+ 0.431 |
+ 0.294 |
+ 0.392 |
+ 0.427 |
+ 0.479 |
+ 0.359 |
+ 0.517 |
+ 0.495 |
+ 0.545 |
+ 0.512 |
+ 0.525 |
+ 0.529 |
+ 0.513 |
+ 0.519 |
+ 0.506 |
+ 0.525 |
+ 0.571 |
+ 0.500 |
+ 3297 |
+ Stability |
+ MYO3_YEAST |
+ High |
+ Eukaryote |
+
+
+ NCAP_I34A1_Doud_2015 |
+ 0.364 |
+ 0.328 |
+ 0.334 |
+ 0.333 |
+ 0.363 |
+ 0.364 |
+ 0.002 |
+ 0.335 |
+ 0.373 |
+ 0.348 |
+ 0.015 |
+ 0.019 |
+ 0.020 |
+ 0.023 |
+ 0.026 |
+ 0.020 |
+ 0.028 |
+ 0.031 |
+ 0.107 |
+ 0.269 |
+ 0.352 |
+ 0.382 |
+ 0.408 |
+ 0.413 |
+ 0.018 |
+ 0.042 |
+ 0.108 |
+ 0.030 |
+ 0.352 |
+ 0.377 |
+ 0.270 |
+ 0.279 |
+ 0.126 |
+ 0.356 |
+ 0.373 |
+ 0.415 |
+ 0.390 |
+ 0.402 |
+ 0.424 |
+ 0.425 |
+ 0.426 |
+ 0.441 |
+ 0.019 |
+ 0.015 |
+ 0.027 |
+ 0.025 |
+ 0.259 |
+ 0.270 |
+ 0.278 |
+ 0.123 |
+ 0.129 |
+ 0.171 |
+ 0.166 |
+ 0.184 |
+ 0.172 |
+ 0.172 |
+ 0.155 |
+ 0.163 |
+ 0.129 |
+ 0.171 |
+ 0.135 |
+ 0.081 |
+ 9462 |
+ OrganismalFitness |
+ NCAP_I34A1 |
+ Medium |
+ Virus |
+
+
+ NKX31_HUMAN_Tsuboyama_2023_2L9R |
+ 0.428 |
+ 0.548 |
+ 0.586 |
+ 0.584 |
+ 0.607 |
+ 0.619 |
+ 0.582 |
+ 0.553 |
+ 0.584 |
+ 0.608 |
+ 0.560 |
+ 0.568 |
+ 0.572 |
+ 0.627 |
+ 0.638 |
+ 0.674 |
+ 0.651 |
+ 0.582 |
+ 0.602 |
+ 0.558 |
+ 0.564 |
+ 0.557 |
+ 0.556 |
+ 0.564 |
+ 0.576 |
+ 0.579 |
+ 0.606 |
+ 0.540 |
+ 0.577 |
+ 0.640 |
+ 0.525 |
+ 0.506 |
+ 0.551 |
+ 0.494 |
+ 0.565 |
+ 0.577 |
+ 0.578 |
+ 0.624 |
+ 0.628 |
+ 0.601 |
+ 0.618 |
+ 0.621 |
+ 0.454 |
+ 0.395 |
+ 0.342 |
+ 0.408 |
+ 0.589 |
+ 0.401 |
+ 0.643 |
+ 0.642 |
+ 0.634 |
+ 0.626 |
+ 0.639 |
+ 0.648 |
+ 0.669 |
+ 0.641 |
+ 0.635 |
+ 0.629 |
+ 0.642 |
+ 0.645 |
+ 0.566 |
+ 0.596 |
+ 2482 |
+ Stability |
+ NKX31_HUMAN |
+ High |
+ Human |
+
+
+ NPC1_HUMAN_Erwood_2022_HEK293T |
+ 0.683 |
+ 0.700 |
+ 0.627 |
+ 0.632 |
+ 0.700 |
+ 0.700 |
+ 0.220 |
+ 0.529 |
+ 0.739 |
+ 0.734 |
+ 0.701 |
+ 0.325 |
+ 0.458 |
+ 0.221 |
+ 0.394 |
+ 0.614 |
+ 0.694 |
+ 0.721 |
+ 0.719 |
+ 0.147 |
+ 0.197 |
+ 0.460 |
+ 0.412 |
+ 0.419 |
+ 0.421 |
+ 0.491 |
+ 0.631 |
+ 0.447 |
+ 0.495 |
+ 0.710 |
+ 0.659 |
+ 0.548 |
+ 0.095 |
+ 0.231 |
+ 0.547 |
+ 0.579 |
+ 0.639 |
+ 0.675 |
+ 0.696 |
+ 0.691 |
+ 0.708 |
+ 0.722 |
+ 0.300 |
+ 0.134 |
+ 0.702 |
+ 0.568 |
+ 0.499 |
+ 0.683 |
+ 0.060 |
+ 0.211 |
+ 0.681 |
+ 0.664 |
+ 0.687 |
+ 0.679 |
+ 0.678 |
+ 0.664 |
+ 0.693 |
+ 0.701 |
+ 0.685 |
+ 0.697 |
+ 0.700 |
+ 0.392 |
+ 637 |
+ Activity |
+ NPC1_HUMAN |
+ Low |
+ Human |
+
+
+ NPC1_HUMAN_Erwood_2022_RPE1 |
+ 0.779 |
+ 0.688 |
+ 0.534 |
+ 0.554 |
+ 0.716 |
+ 0.731 |
+ 0.417 |
+ 0.535 |
+ 0.584 |
+ 0.707 |
+ 0.704 |
+ 0.298 |
+ 0.357 |
+ 0.314 |
+ 0.394 |
+ 0.629 |
+ 0.661 |
+ 0.640 |
+ 0.709 |
+ 0.405 |
+ 0.631 |
+ 0.490 |
+ 0.543 |
+ 0.417 |
+ 0.408 |
+ 0.468 |
+ 0.536 |
+ 0.373 |
+ 0.531 |
+ 0.633 |
+ 0.740 |
+ 0.568 |
+ 0.475 |
+ 0.466 |
+ 0.377 |
+ 0.612 |
+ 0.787 |
+ 0.672 |
+ 0.774 |
+ 0.783 |
+ 0.700 |
+ 0.751 |
+ 0.506 |
+ 0.442 |
+ 0.786 |
+ 0.391 |
+ 0.472 |
+ 0.557 |
+ -0.107 |
+ 0.157 |
+ 0.574 |
+ 0.537 |
+ 0.559 |
+ 0.482 |
+ 0.569 |
+ 0.560 |
+ 0.571 |
+ 0.646 |
+ 0.640 |
+ 0.595 |
+ 0.692 |
+ 0.482 |
+ 63 |
+ Activity |
+ NPC1_HUMAN |
+ Low |
+ Human |
+
+
+ NRAM_I33A0_Jiang_2016 |
+ 0.569 |
+ 0.565 |
+ 0.501 |
+ 0.490 |
+ 0.584 |
+ 0.584 |
+ 0.035 |
+ 0.390 |
+ 0.625 |
+ 0.638 |
+ -0.076 |
+ 0.162 |
+ 0.448 |
+ -0.075 |
+ -0.098 |
+ 0.005 |
+ 0.161 |
+ 0.541 |
+ 0.575 |
+ 0.343 |
+ 0.583 |
+ 0.633 |
+ 0.584 |
+ 0.571 |
+ 0.047 |
+ 0.530 |
+ 0.627 |
+ 0.462 |
+ 0.654 |
+ 0.635 |
+ 0.404 |
+ 0.441 |
+ -0.169 |
+ 0.512 |
+ 0.532 |
+ 0.551 |
+ 0.592 |
+ 0.615 |
+ 0.621 |
+ 0.628 |
+ 0.638 |
+ 0.632 |
+ -0.105 |
+ -0.127 |
+ -0.111 |
+ -0.111 |
+ 0.422 |
+ 0.414 |
+ 0.448 |
+ 0.178 |
+ 0.190 |
+ 0.228 |
+ 0.235 |
+ 0.297 |
+ 0.253 |
+ 0.290 |
+ 0.263 |
+ 0.247 |
+ 0.199 |
+ 0.254 |
+ 0.292 |
+ 0.145 |
+ 298 |
+ OrganismalFitness |
+ NRAM_I33A0 |
+ Low |
+ Virus |
+
+
+ NUD15_HUMAN_Suiter_2020 |
+ 0.271 |
+ 0.453 |
+ 0.564 |
+ 0.596 |
+ 0.591 |
+ 0.594 |
+ -0.005 |
+ 0.389 |
+ 0.602 |
+ 0.665 |
+ 0.603 |
+ 0.600 |
+ 0.645 |
+ 0.299 |
+ 0.419 |
+ 0.460 |
+ 0.526 |
+ 0.583 |
+ 0.599 |
+ 0.501 |
+ 0.301 |
+ 0.454 |
+ 0.546 |
+ 0.518 |
+ 0.412 |
+ 0.579 |
+ 0.577 |
+ 0.549 |
+ 0.542 |
+ 0.587 |
+ 0.623 |
+ 0.553 |
+ 0.147 |
+ 0.360 |
+ 0.423 |
+ 0.575 |
+ 0.433 |
+ 0.456 |
+ 0.604 |
+ 0.587 |
+ 0.586 |
+ 0.635 |
+ 0.386 |
+ 0.015 |
+ 0.603 |
+ 0.429 |
+ 0.483 |
+ 0.581 |
+ 0.534 |
+ 0.317 |
+ 0.550 |
+ 0.521 |
+ 0.544 |
+ 0.568 |
+ 0.556 |
+ 0.571 |
+ 0.567 |
+ 0.559 |
+ 0.556 |
+ 0.573 |
+ 0.679 |
+ 0.564 |
+ 2844 |
+ Expression |
+ NUD15_HUMAN |
+ High |
+ Human |
+
+
+ NUSA_ECOLI_Tsuboyama_2023_1WCL |
+ 0.392 |
+ 0.589 |
+ 0.580 |
+ 0.571 |
+ 0.602 |
+ 0.613 |
+ 0.285 |
+ 0.465 |
+ 0.638 |
+ 0.599 |
+ 0.560 |
+ 0.323 |
+ 0.346 |
+ 0.322 |
+ 0.390 |
+ 0.405 |
+ 0.494 |
+ 0.532 |
+ 0.533 |
+ 0.560 |
+ 0.394 |
+ 0.549 |
+ 0.572 |
+ 0.589 |
+ 0.325 |
+ 0.497 |
+ 0.440 |
+ 0.525 |
+ 0.572 |
+ 0.638 |
+ 0.670 |
+ 0.706 |
+ 0.183 |
+ 0.365 |
+ 0.419 |
+ 0.425 |
+ 0.505 |
+ 0.493 |
+ 0.508 |
+ 0.628 |
+ 0.601 |
+ 0.622 |
+ 0.375 |
+ 0.246 |
+ 0.381 |
+ 0.357 |
+ 0.682 |
+ 0.641 |
+ 0.730 |
+ 0.658 |
+ 0.709 |
+ 0.695 |
+ 0.699 |
+ 0.721 |
+ 0.726 |
+ 0.741 |
+ 0.716 |
+ 0.716 |
+ 0.719 |
+ 0.723 |
+ 0.685 |
+ 0.575 |
+ 2028 |
+ Stability |
+ NUSA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ NUSG_MYCTU_Tsuboyama_2023_2MI6 |
+ 0.405 |
+ 0.404 |
+ 0.417 |
+ 0.431 |
+ 0.451 |
+ 0.434 |
+ 0.230 |
+ 0.423 |
+ 0.485 |
+ 0.461 |
+ 0.398 |
+ 0.361 |
+ 0.421 |
+ 0.389 |
+ 0.561 |
+ 0.510 |
+ 0.518 |
+ 0.525 |
+ 0.477 |
+ 0.521 |
+ 0.264 |
+ 0.339 |
+ 0.315 |
+ 0.347 |
+ 0.380 |
+ 0.385 |
+ 0.461 |
+ 0.334 |
+ 0.378 |
+ 0.502 |
+ 0.394 |
+ 0.387 |
+ 0.321 |
+ 0.269 |
+ 0.275 |
+ 0.372 |
+ 0.395 |
+ 0.399 |
+ 0.426 |
+ 0.413 |
+ 0.417 |
+ 0.424 |
+ 0.445 |
+ 0.144 |
+ 0.395 |
+ 0.531 |
+ 0.676 |
+ 0.560 |
+ 0.759 |
+ 0.640 |
+ 0.487 |
+ 0.457 |
+ 0.454 |
+ 0.507 |
+ 0.487 |
+ 0.485 |
+ 0.485 |
+ 0.486 |
+ 0.483 |
+ 0.488 |
+ 0.410 |
+ 0.588 |
+ 1380 |
+ Stability |
+ NUSG_MYCTU |
+ High |
+ Prokaryote |
+
+
+ OBSCN_HUMAN_Tsuboyama_2023_1V1C |
+ 0.540 |
+ 0.697 |
+ 0.759 |
+ 0.777 |
+ 0.745 |
+ 0.757 |
+ 0.384 |
+ 0.455 |
+ 0.751 |
+ 0.763 |
+ 0.691 |
+ 0.735 |
+ 0.759 |
+ 0.392 |
+ 0.779 |
+ 0.819 |
+ 0.797 |
+ 0.786 |
+ 0.755 |
+ 0.723 |
+ 0.688 |
+ 0.689 |
+ 0.663 |
+ 0.687 |
+ 0.589 |
+ 0.667 |
+ 0.694 |
+ 0.726 |
+ 0.604 |
+ 0.735 |
+ 0.694 |
+ 0.679 |
+ 0.549 |
+ 0.262 |
+ 0.461 |
+ 0.575 |
+ 0.687 |
+ 0.695 |
+ 0.704 |
+ 0.755 |
+ 0.760 |
+ 0.764 |
+ 0.543 |
+ 0.115 |
+ 0.507 |
+ 0.633 |
+ 0.630 |
+ 0.540 |
+ 0.782 |
+ 0.747 |
+ 0.762 |
+ 0.760 |
+ 0.761 |
+ 0.755 |
+ 0.772 |
+ 0.760 |
+ 0.773 |
+ 0.767 |
+ 0.776 |
+ 0.771 |
+ 0.825 |
+ 0.772 |
+ 3197 |
+ Stability |
+ OBSCN_HUMAN |
+ High |
+ Human |
+
+
+ ODP2_GEOSE_Tsuboyama_2023_1W4G |
+ -0.211 |
+ -0.170 |
+ 0.339 |
+ 0.188 |
+ 0.332 |
+ 0.332 |
+ -0.149 |
+ 0.093 |
+ 0.175 |
+ 0.190 |
+ 0.064 |
+ -0.100 |
+ -0.115 |
+ -0.069 |
+ -0.151 |
+ 0.140 |
+ 0.221 |
+ 0.089 |
+ 0.210 |
+ 0.233 |
+ 0.189 |
+ 0.174 |
+ 0.159 |
+ 0.177 |
+ 0.171 |
+ 0.195 |
+ 0.166 |
+ 0.192 |
+ 0.233 |
+ 0.327 |
+ 0.062 |
+ 0.023 |
+ 0.245 |
+ -0.050 |
+ 0.025 |
+ 0.012 |
+ 0.190 |
+ 0.181 |
+ 0.178 |
+ 0.271 |
+ 0.260 |
+ 0.290 |
+ 0.268 |
+ -0.070 |
+ 0.339 |
+ 0.327 |
+ 0.452 |
+ 0.424 |
+ 0.437 |
+ 0.322 |
+ 0.118 |
+ 0.075 |
+ 0.085 |
+ 0.129 |
+ 0.073 |
+ 0.127 |
+ 0.114 |
+ 0.116 |
+ 0.094 |
+ 0.107 |
+ 0.286 |
+ 0.171 |
+ 1134 |
+ Stability |
+ ODP2_GEOSE |
+ High |
+ Prokaryote |
+
+
+ OPSD_HUMAN_Wan_2019 |
+ 0.170 |
+ 0.433 |
+ 0.445 |
+ 0.517 |
+ 0.463 |
+ 0.466 |
+ 0.347 |
+ 0.569 |
+ 0.563 |
+ 0.600 |
+ 0.397 |
+ 0.499 |
+ 0.560 |
+ 0.290 |
+ 0.508 |
+ 0.518 |
+ 0.552 |
+ 0.449 |
+ 0.482 |
+ 0.555 |
+ 0.528 |
+ 0.549 |
+ 0.538 |
+ 0.578 |
+ 0.552 |
+ 0.594 |
+ 0.598 |
+ 0.577 |
+ 0.570 |
+ 0.511 |
+ 0.422 |
+ 0.326 |
+ 0.170 |
+ 0.503 |
+ 0.530 |
+ 0.485 |
+ 0.537 |
+ 0.532 |
+ 0.503 |
+ 0.485 |
+ 0.482 |
+ 0.488 |
+ 0.386 |
+ 0.080 |
+ 0.493 |
+ 0.472 |
+ 0.632 |
+ 0.518 |
+ 0.632 |
+ 0.224 |
+ 0.402 |
+ 0.353 |
+ 0.412 |
+ 0.421 |
+ 0.482 |
+ 0.487 |
+ 0.418 |
+ 0.421 |
+ 0.490 |
+ 0.472 |
+ 0.582 |
+ 0.522 |
+ 165 |
+ Expression |
+ OPSD_HUMAN |
+ High |
+ Human |
+
+
+ OTC_HUMAN_Lo_2023 |
+ 0.524 |
+ 0.564 |
+ 0.484 |
+ 0.522 |
+ 0.550 |
+ 0.552 |
+ 0.127 |
+ 0.404 |
+ 0.590 |
+ 0.592 |
+ 0.546 |
+ 0.573 |
+ 0.581 |
+ 0.142 |
+ 0.411 |
+ 0.512 |
+ 0.531 |
+ 0.510 |
+ 0.534 |
+ 0.565 |
+ 0.453 |
+ 0.461 |
+ 0.522 |
+ 0.545 |
+ 0.476 |
+ 0.555 |
+ 0.526 |
+ 0.540 |
+ 0.574 |
+ 0.599 |
+ 0.475 |
+ 0.409 |
+ 0.085 |
+ 0.429 |
+ 0.511 |
+ 0.569 |
+ 0.522 |
+ 0.570 |
+ 0.614 |
+ 0.567 |
+ 0.584 |
+ 0.599 |
+ 0.279 |
+ 0.092 |
+ 0.524 |
+ 0.463 |
+ 0.649 |
+ 0.606 |
+ 0.674 |
+ 0.364 |
+ 0.526 |
+ 0.547 |
+ 0.544 |
+ 0.574 |
+ 0.556 |
+ 0.567 |
+ 0.560 |
+ 0.548 |
+ 0.541 |
+ 0.564 |
+ 0.616 |
+ 0.490 |
+ 1570 |
+ Activity |
+ OTC_HUMAN |
+ Medium |
+ Human |
+
+
+ OTU7A_HUMAN_Tsuboyama_2023_2L2D |
+ 0.116 |
+ 0.243 |
+ 0.184 |
+ 0.201 |
+ 0.224 |
+ 0.227 |
+ 0.192 |
+ 0.134 |
+ 0.141 |
+ 0.146 |
+ 0.434 |
+ 0.604 |
+ 0.581 |
+ 0.205 |
+ 0.563 |
+ 0.588 |
+ 0.319 |
+ 0.362 |
+ 0.408 |
+ 0.155 |
+ 0.184 |
+ 0.178 |
+ 0.221 |
+ 0.275 |
+ 0.345 |
+ 0.260 |
+ 0.380 |
+ 0.307 |
+ 0.148 |
+ 0.259 |
+ 0.406 |
+ 0.388 |
+ 0.132 |
+ 0.168 |
+ 0.225 |
+ 0.349 |
+ 0.201 |
+ 0.198 |
+ 0.308 |
+ 0.242 |
+ 0.220 |
+ 0.262 |
+ 0.365 |
+ 0.184 |
+ 0.543 |
+ 0.548 |
+ 0.532 |
+ 0.533 |
+ 0.610 |
+ 0.537 |
+ 0.395 |
+ 0.342 |
+ 0.348 |
+ 0.401 |
+ 0.422 |
+ 0.421 |
+ 0.374 |
+ 0.409 |
+ 0.414 |
+ 0.404 |
+ 0.642 |
+ 0.632 |
+ 635 |
+ Stability |
+ OTU7A_HUMAN |
+ High |
+ Human |
+
+
+ OXDA_RHOTO_Vanella_2023_activity |
+ 0.104 |
+ 0.292 |
+ 0.364 |
+ 0.373 |
+ 0.356 |
+ 0.356 |
+ 0.195 |
+ 0.251 |
+ 0.278 |
+ 0.285 |
+ 0.383 |
+ 0.348 |
+ 0.359 |
+ 0.122 |
+ 0.219 |
+ 0.326 |
+ 0.370 |
+ 0.390 |
+ 0.401 |
+ 0.259 |
+ 0.125 |
+ 0.214 |
+ 0.303 |
+ 0.305 |
+ 0.219 |
+ 0.343 |
+ 0.356 |
+ 0.329 |
+ 0.407 |
+ 0.378 |
+ 0.429 |
+ 0.408 |
+ 0.102 |
+ 0.164 |
+ 0.213 |
+ 0.293 |
+ 0.267 |
+ 0.275 |
+ 0.305 |
+ 0.342 |
+ 0.344 |
+ 0.355 |
+ 0.153 |
+ 0.014 |
+ 0.369 |
+ 0.247 |
+ 0.295 |
+ 0.386 |
+ 0.361 |
+ 0.139 |
+ 0.367 |
+ 0.360 |
+ 0.355 |
+ 0.361 |
+ 0.365 |
+ 0.374 |
+ 0.367 |
+ 0.376 |
+ 0.358 |
+ 0.377 |
+ 0.396 |
+ 0.242 |
+ 6396 |
+ Activity |
+ OXDA_RHOTO |
+ High |
+ Eukaryote |
+
+
+ OXDA_RHOTO_Vanella_2023_expression |
+ 0.180 |
+ 0.253 |
+ 0.290 |
+ 0.287 |
+ 0.276 |
+ 0.272 |
+ 0.190 |
+ 0.193 |
+ 0.296 |
+ 0.305 |
+ 0.327 |
+ 0.310 |
+ 0.311 |
+ 0.221 |
+ 0.257 |
+ 0.298 |
+ 0.315 |
+ 0.351 |
+ 0.340 |
+ 0.175 |
+ 0.169 |
+ 0.243 |
+ 0.252 |
+ 0.233 |
+ 0.217 |
+ 0.264 |
+ 0.295 |
+ 0.275 |
+ 0.310 |
+ 0.308 |
+ 0.289 |
+ 0.241 |
+ 0.082 |
+ 0.227 |
+ 0.211 |
+ 0.246 |
+ 0.268 |
+ 0.259 |
+ 0.267 |
+ 0.284 |
+ 0.278 |
+ 0.278 |
+ 0.243 |
+ 0.144 |
+ 0.304 |
+ 0.234 |
+ 0.385 |
+ 0.356 |
+ 0.333 |
+ 0.142 |
+ 0.329 |
+ 0.318 |
+ 0.308 |
+ 0.317 |
+ 0.323 |
+ 0.308 |
+ 0.321 |
+ 0.316 |
+ 0.311 |
+ 0.326 |
+ 0.408 |
+ 0.321 |
+ 6769 |
+ Expression |
+ OXDA_RHOTO |
+ High |
+ Eukaryote |
+
+
+ P53_HUMAN_Giacomelli_2018_Null_Etoposide |
+ 0.415 |
+ 0.424 |
+ 0.329 |
+ 0.344 |
+ 0.432 |
+ 0.427 |
+ -0.088 |
+ 0.319 |
+ 0.283 |
+ 0.291 |
+ 0.467 |
+ 0.457 |
+ 0.509 |
+ -0.154 |
+ -0.146 |
+ 0.314 |
+ 0.432 |
+ 0.469 |
+ 0.497 |
+ 0.116 |
+ 0.312 |
+ 0.438 |
+ 0.453 |
+ 0.409 |
+ 0.390 |
+ 0.472 |
+ 0.484 |
+ 0.493 |
+ 0.368 |
+ 0.423 |
+ 0.465 |
+ 0.410 |
+ 0.208 |
+ 0.258 |
+ 0.449 |
+ 0.355 |
+ 0.403 |
+ 0.462 |
+ 0.400 |
+ 0.427 |
+ 0.469 |
+ 0.413 |
+ -0.151 |
+ -0.155 |
+ 0.475 |
+ -0.051 |
+ 0.389 |
+ 0.471 |
+ 0.414 |
+ 0.186 |
+ 0.449 |
+ 0.450 |
+ 0.451 |
+ 0.450 |
+ 0.452 |
+ 0.470 |
+ 0.448 |
+ 0.454 |
+ 0.452 |
+ 0.464 |
+ 0.477 |
+ 0.186 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Giacomelli_2018_Null_Nutlin |
+ 0.350 |
+ 0.395 |
+ 0.294 |
+ 0.304 |
+ 0.383 |
+ 0.380 |
+ -0.087 |
+ 0.269 |
+ 0.287 |
+ 0.289 |
+ 0.482 |
+ 0.447 |
+ 0.498 |
+ -0.158 |
+ -0.145 |
+ 0.290 |
+ 0.404 |
+ 0.431 |
+ 0.459 |
+ 0.126 |
+ 0.294 |
+ 0.447 |
+ 0.463 |
+ 0.430 |
+ 0.375 |
+ 0.487 |
+ 0.500 |
+ 0.523 |
+ 0.329 |
+ 0.420 |
+ 0.450 |
+ 0.375 |
+ 0.213 |
+ 0.239 |
+ 0.453 |
+ 0.360 |
+ 0.354 |
+ 0.431 |
+ 0.379 |
+ 0.372 |
+ 0.430 |
+ 0.383 |
+ -0.144 |
+ -0.153 |
+ 0.471 |
+ -0.063 |
+ 0.403 |
+ 0.480 |
+ 0.426 |
+ 0.188 |
+ 0.423 |
+ 0.427 |
+ 0.427 |
+ 0.433 |
+ 0.431 |
+ 0.440 |
+ 0.426 |
+ 0.430 |
+ 0.434 |
+ 0.440 |
+ 0.469 |
+ 0.182 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Giacomelli_2018_WT_Nutlin |
+ 0.445 |
+ 0.462 |
+ 0.342 |
+ 0.366 |
+ 0.479 |
+ 0.481 |
+ -0.143 |
+ 0.300 |
+ 0.258 |
+ 0.259 |
+ 0.551 |
+ 0.496 |
+ 0.554 |
+ -0.194 |
+ -0.174 |
+ 0.362 |
+ 0.400 |
+ 0.446 |
+ 0.467 |
+ 0.028 |
+ 0.410 |
+ 0.563 |
+ 0.521 |
+ 0.480 |
+ 0.482 |
+ 0.535 |
+ 0.550 |
+ 0.616 |
+ 0.285 |
+ 0.507 |
+ 0.493 |
+ 0.449 |
+ 0.225 |
+ 0.326 |
+ 0.612 |
+ 0.408 |
+ 0.448 |
+ 0.568 |
+ 0.433 |
+ 0.475 |
+ 0.567 |
+ 0.438 |
+ -0.185 |
+ -0.202 |
+ 0.544 |
+ -0.080 |
+ 0.452 |
+ 0.553 |
+ 0.461 |
+ 0.223 |
+ 0.438 |
+ 0.450 |
+ 0.449 |
+ 0.452 |
+ 0.461 |
+ 0.471 |
+ 0.437 |
+ 0.461 |
+ 0.438 |
+ 0.463 |
+ 0.482 |
+ 0.166 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Kotler_2018 |
+ 0.629 |
+ 0.588 |
+ 0.496 |
+ 0.556 |
+ 0.555 |
+ 0.576 |
+ 0.109 |
+ 0.498 |
+ 0.573 |
+ 0.585 |
+ 0.600 |
+ 0.538 |
+ 0.622 |
+ 0.095 |
+ 0.108 |
+ 0.628 |
+ 0.678 |
+ 0.698 |
+ 0.702 |
+ 0.432 |
+ 0.429 |
+ 0.461 |
+ 0.478 |
+ 0.463 |
+ 0.470 |
+ 0.480 |
+ 0.465 |
+ 0.470 |
+ 0.493 |
+ 0.593 |
+ 0.631 |
+ 0.585 |
+ 0.039 |
+ 0.398 |
+ 0.468 |
+ 0.420 |
+ 0.611 |
+ 0.584 |
+ 0.564 |
+ 0.604 |
+ 0.588 |
+ 0.581 |
+ 0.090 |
+ 0.040 |
+ 0.638 |
+ 0.221 |
+ 0.467 |
+ 0.583 |
+ 0.479 |
+ 0.252 |
+ 0.638 |
+ 0.659 |
+ 0.658 |
+ 0.642 |
+ 0.645 |
+ 0.655 |
+ 0.673 |
+ 0.658 |
+ 0.663 |
+ 0.672 |
+ 0.705 |
+ 0.396 |
+ 1048 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P84126_THETH_Chan_2017 |
+ 0.508 |
+ 0.580 |
+ 0.603 |
+ 0.617 |
+ 0.564 |
+ 0.578 |
+ 0.364 |
+ 0.526 |
+ 0.644 |
+ 0.623 |
+ 0.569 |
+ 0.552 |
+ 0.588 |
+ 0.319 |
+ 0.551 |
+ 0.559 |
+ 0.605 |
+ 0.596 |
+ 0.571 |
+ 0.618 |
+ 0.419 |
+ 0.507 |
+ 0.478 |
+ 0.558 |
+ 0.521 |
+ 0.584 |
+ 0.589 |
+ 0.571 |
+ 0.653 |
+ 0.522 |
+ 0.553 |
+ 0.469 |
+ 0.359 |
+ 0.474 |
+ 0.499 |
+ 0.536 |
+ 0.515 |
+ 0.526 |
+ 0.547 |
+ 0.560 |
+ 0.568 |
+ 0.581 |
+ 0.465 |
+ 0.032 |
+ 0.543 |
+ 0.510 |
+ 0.337 |
+ 0.452 |
+ 0.512 |
+ 0.125 |
+ 0.543 |
+ 0.527 |
+ 0.487 |
+ 0.549 |
+ 0.551 |
+ 0.519 |
+ 0.526 |
+ 0.553 |
+ 0.561 |
+ 0.562 |
+ 0.622 |
+ 0.610 |
+ 1519 |
+ OrganismalFitness |
+ P84126_THETH |
+ Medium |
+ Prokaryote |
+
+
+ PA_I34A1_Wu_2015 |
+ 0.518 |
+ 0.519 |
+ 0.499 |
+ 0.508 |
+ 0.539 |
+ 0.543 |
+ 0.041 |
+ 0.358 |
+ 0.152 |
+ 0.165 |
+ 0.037 |
+ 0.054 |
+ 0.101 |
+ 0.024 |
+ 0.029 |
+ 0.025 |
+ 0.038 |
+ 0.041 |
+ 0.356 |
+ 0.325 |
+ 0.456 |
+ 0.493 |
+ 0.533 |
+ 0.538 |
+ 0.219 |
+ 0.408 |
+ 0.444 |
+ 0.429 |
+ 0.438 |
+ 0.584 |
+ 0.384 |
+ 0.374 |
+ 0.107 |
+ 0.436 |
+ 0.475 |
+ 0.541 |
+ 0.546 |
+ 0.561 |
+ 0.572 |
+ 0.588 |
+ 0.592 |
+ 0.584 |
+ 0.028 |
+ 0.019 |
+ 0.031 |
+ 0.020 |
+ 0.244 |
+ 0.232 |
+ 0.172 |
+ 0.095 |
+ 0.154 |
+ 0.173 |
+ 0.184 |
+ 0.187 |
+ 0.168 |
+ 0.178 |
+ 0.158 |
+ 0.173 |
+ 0.125 |
+ 0.181 |
+ 0.202 |
+ 0.132 |
+ 1820 |
+ OrganismalFitness |
+ PA_I34A1 |
+ Medium |
+ Virus |
+
+
+ PABP_YEAST_Melamed_2013 |
+ 0.663 |
+ 0.617 |
+ 0.541 |
+ 0.550 |
+ 0.654 |
+ 0.648 |
+ 0.474 |
+ 0.569 |
+ 0.637 |
+ 0.663 |
+ 0.688 |
+ 0.662 |
+ 0.678 |
+ 0.476 |
+ 0.566 |
+ 0.648 |
+ 0.716 |
+ 0.684 |
+ 0.695 |
+ 0.541 |
+ 0.638 |
+ 0.665 |
+ 0.666 |
+ 0.692 |
+ 0.638 |
+ 0.698 |
+ 0.700 |
+ 0.676 |
+ 0.666 |
+ 0.674 |
+ 0.703 |
+ 0.635 |
+ 0.261 |
+ 0.638 |
+ 0.648 |
+ 0.640 |
+ 0.689 |
+ 0.692 |
+ 0.688 |
+ 0.683 |
+ 0.687 |
+ 0.684 |
+ 0.605 |
+ -0.027 |
+ 0.670 |
+ 0.654 |
+ 0.361 |
+ 0.612 |
+ 0.514 |
+ 0.197 |
+ 0.664 |
+ 0.660 |
+ 0.672 |
+ 0.693 |
+ 0.685 |
+ 0.689 |
+ 0.691 |
+ 0.690 |
+ 0.718 |
+ 0.705 |
+ 0.715 |
+ 0.622 |
+ 37708 |
+ OrganismalFitness |
+ PABP_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ PAI1_HUMAN_Huttinger_2021 |
+ 0.403 |
+ 0.395 |
+ 0.377 |
+ 0.392 |
+ 0.402 |
+ 0.413 |
+ 0.054 |
+ 0.314 |
+ 0.428 |
+ 0.438 |
+ 0.435 |
+ 0.408 |
+ 0.430 |
+ 0.083 |
+ 0.391 |
+ 0.434 |
+ 0.448 |
+ 0.335 |
+ 0.270 |
+ 0.422 |
+ 0.132 |
+ 0.381 |
+ 0.364 |
+ 0.345 |
+ 0.351 |
+ 0.395 |
+ 0.399 |
+ 0.372 |
+ 0.330 |
+ 0.423 |
+ 0.393 |
+ 0.356 |
+ 0.141 |
+ 0.171 |
+ 0.370 |
+ 0.379 |
+ 0.384 |
+ 0.404 |
+ 0.412 |
+ 0.414 |
+ 0.424 |
+ 0.426 |
+ 0.350 |
+ 0.049 |
+ 0.430 |
+ 0.407 |
+ 0.379 |
+ 0.450 |
+ 0.404 |
+ 0.152 |
+ 0.441 |
+ 0.437 |
+ 0.439 |
+ 0.448 |
+ 0.448 |
+ 0.445 |
+ 0.449 |
+ 0.449 |
+ 0.446 |
+ 0.457 |
+ 0.468 |
+ 0.391 |
+ 5345 |
+ Activity |
+ PAI1_HUMAN |
+ NaN |
+ Human |
+
+
+ PHOT_CHLRE_Chen_2023 |
+ 0.211 |
+ 0.440 |
+ 0.731 |
+ 0.706 |
+ 0.352 |
+ 0.335 |
+ 0.654 |
+ 0.552 |
+ 0.711 |
+ 0.721 |
+ 0.647 |
+ 0.757 |
+ 0.783 |
+ 0.768 |
+ 0.823 |
+ 0.714 |
+ 0.749 |
+ 0.714 |
+ 0.758 |
+ 0.652 |
+ 0.685 |
+ 0.678 |
+ 0.596 |
+ 0.699 |
+ 0.533 |
+ 0.616 |
+ 0.596 |
+ 0.656 |
+ 0.602 |
+ 0.587 |
+ 0.572 |
+ 0.490 |
+ 0.303 |
+ 0.565 |
+ 0.511 |
+ 0.602 |
+ 0.597 |
+ 0.564 |
+ 0.606 |
+ 0.416 |
+ 0.445 |
+ 0.422 |
+ 0.623 |
+ 0.395 |
+ 0.599 |
+ 0.621 |
+ 0.200 |
+ 0.502 |
+ 0.669 |
+ 0.410 |
+ 0.603 |
+ 0.543 |
+ 0.553 |
+ 0.551 |
+ 0.555 |
+ 0.570 |
+ 0.568 |
+ 0.574 |
+ 0.583 |
+ 0.569 |
+ 0.737 |
+ 0.775 |
+ 167529 |
+ Activity |
+ PHOT_CHLRE |
+ High |
+ Eukaryote |
+
+
+ PIN1_HUMAN_Tsuboyama_2023_1I6C |
+ 0.222 |
+ 0.327 |
+ 0.680 |
+ 0.645 |
+ 0.631 |
+ 0.661 |
+ 0.585 |
+ 0.500 |
+ 0.670 |
+ 0.707 |
+ 0.654 |
+ 0.576 |
+ 0.674 |
+ 0.660 |
+ 0.646 |
+ 0.665 |
+ 0.670 |
+ 0.520 |
+ 0.548 |
+ 0.656 |
+ 0.456 |
+ 0.671 |
+ 0.630 |
+ 0.610 |
+ 0.559 |
+ 0.672 |
+ 0.681 |
+ 0.647 |
+ 0.658 |
+ 0.689 |
+ 0.692 |
+ 0.644 |
+ 0.531 |
+ 0.539 |
+ 0.626 |
+ 0.667 |
+ 0.607 |
+ 0.671 |
+ 0.720 |
+ 0.677 |
+ 0.705 |
+ 0.730 |
+ 0.556 |
+ -0.189 |
+ 0.623 |
+ 0.510 |
+ 0.547 |
+ 0.574 |
+ 0.691 |
+ 0.669 |
+ 0.659 |
+ 0.627 |
+ 0.498 |
+ 0.563 |
+ 0.575 |
+ 0.604 |
+ 0.608 |
+ 0.649 |
+ 0.626 |
+ 0.620 |
+ 0.740 |
+ 0.742 |
+ 802 |
+ Stability |
+ PIN1_HUMAN |
+ High |
+ Human |
+
+
+ PITX2_HUMAN_Tsuboyama_2023_2L7M |
+ 0.579 |
+ 0.520 |
+ 0.548 |
+ 0.555 |
+ 0.549 |
+ 0.553 |
+ 0.506 |
+ 0.470 |
+ 0.538 |
+ 0.561 |
+ 0.467 |
+ 0.446 |
+ 0.436 |
+ 0.557 |
+ 0.652 |
+ 0.675 |
+ 0.626 |
+ 0.586 |
+ 0.507 |
+ 0.542 |
+ 0.513 |
+ 0.454 |
+ 0.477 |
+ 0.450 |
+ 0.547 |
+ 0.493 |
+ 0.465 |
+ 0.428 |
+ 0.468 |
+ 0.536 |
+ 0.403 |
+ 0.386 |
+ 0.450 |
+ 0.558 |
+ 0.508 |
+ 0.481 |
+ 0.596 |
+ 0.548 |
+ 0.538 |
+ 0.573 |
+ 0.537 |
+ 0.538 |
+ 0.468 |
+ 0.412 |
+ 0.282 |
+ 0.333 |
+ 0.453 |
+ 0.295 |
+ 0.693 |
+ 0.630 |
+ 0.572 |
+ 0.579 |
+ 0.572 |
+ 0.590 |
+ 0.602 |
+ 0.609 |
+ 0.593 |
+ 0.594 |
+ 0.598 |
+ 0.598 |
+ 0.470 |
+ 0.593 |
+ 1824 |
+ Stability |
+ PITX2_HUMAN |
+ High |
+ Human |
+
+
+ PKN1_HUMAN_Tsuboyama_2023_1URF |
+ 0.179 |
+ 0.188 |
+ 0.218 |
+ 0.234 |
+ 0.249 |
+ 0.254 |
+ 0.198 |
+ 0.187 |
+ 0.165 |
+ 0.206 |
+ 0.253 |
+ 0.278 |
+ 0.310 |
+ 0.314 |
+ 0.347 |
+ 0.464 |
+ 0.298 |
+ 0.236 |
+ 0.238 |
+ 0.277 |
+ 0.310 |
+ 0.289 |
+ 0.278 |
+ 0.325 |
+ 0.300 |
+ 0.291 |
+ 0.310 |
+ 0.292 |
+ 0.295 |
+ 0.290 |
+ 0.326 |
+ 0.317 |
+ 0.067 |
+ 0.335 |
+ 0.278 |
+ 0.311 |
+ 0.262 |
+ 0.285 |
+ 0.308 |
+ 0.288 |
+ 0.280 |
+ 0.293 |
+ 0.266 |
+ 0.279 |
+ 0.260 |
+ 0.300 |
+ 0.512 |
+ 0.379 |
+ 0.493 |
+ 0.469 |
+ 0.258 |
+ 0.303 |
+ 0.317 |
+ 0.260 |
+ 0.284 |
+ 0.287 |
+ 0.286 |
+ 0.299 |
+ 0.314 |
+ 0.299 |
+ 0.324 |
+ 0.464 |
+ 1301 |
+ Stability |
+ PKN1_HUMAN |
+ High |
+ Human |
+
+
+ POLG_CXB3N_Mattenberger_2021 |
+ 0.423 |
+ 0.390 |
+ 0.377 |
+ 0.411 |
+ 0.460 |
+ 0.473 |
+ -0.036 |
+ 0.336 |
+ 0.500 |
+ 0.499 |
+ 0.292 |
+ -0.059 |
+ 0.042 |
+ -0.079 |
+ -0.056 |
+ 0.177 |
+ 0.395 |
+ 0.404 |
+ 0.426 |
+ 0.356 |
+ 0.339 |
+ 0.390 |
+ 0.381 |
+ 0.377 |
+ 0.138 |
+ 0.388 |
+ 0.383 |
+ 0.369 |
+ 0.393 |
+ 0.495 |
+ 0.386 |
+ 0.319 |
+ 0.007 |
+ 0.049 |
+ 0.274 |
+ 0.355 |
+ 0.342 |
+ 0.385 |
+ 0.413 |
+ 0.387 |
+ 0.430 |
+ 0.458 |
+ -0.065 |
+ -0.066 |
+ 0.342 |
+ -0.054 |
+ 0.193 |
+ 0.339 |
+ 0.113 |
+ 0.046 |
+ 0.342 |
+ 0.361 |
+ 0.364 |
+ 0.369 |
+ 0.364 |
+ 0.365 |
+ 0.365 |
+ 0.365 |
+ 0.374 |
+ 0.374 |
+ 0.110 |
+ 0.068 |
+ 15711 |
+ OrganismalFitness |
+ POLG_CXB3N |
+ Medium |
+ Virus |
+
+
+ POLG_DEN26_Suphatrakul_2023 |
+ 0.500 |
+ 0.578 |
+ 0.289 |
+ 0.286 |
+ 0.532 |
+ 0.533 |
+ -0.044 |
+ 0.434 |
+ 0.670 |
+ 0.675 |
+ 0.318 |
+ -0.006 |
+ 0.000 |
+ -0.044 |
+ 0.007 |
+ 0.077 |
+ 0.153 |
+ 0.275 |
+ 0.366 |
+ 0.409 |
+ 0.453 |
+ 0.473 |
+ 0.449 |
+ 0.456 |
+ 0.470 |
+ 0.501 |
+ 0.493 |
+ 0.488 |
+ 0.461 |
+ 0.622 |
+ 0.603 |
+ 0.522 |
+ 0.035 |
+ -0.064 |
+ 0.115 |
+ 0.471 |
+ 0.432 |
+ 0.383 |
+ 0.542 |
+ 0.469 |
+ 0.397 |
+ 0.563 |
+ -0.055 |
+ -0.059 |
+ 0.391 |
+ -0.005 |
+ 0.360 |
+ 0.510 |
+ 0.101 |
+ 0.116 |
+ 0.241 |
+ 0.229 |
+ 0.244 |
+ 0.260 |
+ 0.227 |
+ 0.261 |
+ 0.240 |
+ 0.246 |
+ 0.223 |
+ 0.254 |
+ 0.161 |
+ 0.032 |
+ 16897 |
+ OrganismalFitness |
+ POLG_DEN26 |
+ Low |
+ Virus |
+
+
+ POLG_HCVJF_Qi_2014 |
+ 0.605 |
+ 0.547 |
+ 0.410 |
+ 0.413 |
+ 0.605 |
+ 0.614 |
+ -0.039 |
+ 0.196 |
+ 0.565 |
+ 0.577 |
+ 0.178 |
+ 0.637 |
+ 0.635 |
+ 0.101 |
+ 0.128 |
+ 0.114 |
+ 0.116 |
+ 0.090 |
+ 0.078 |
+ 0.260 |
+ 0.400 |
+ 0.443 |
+ 0.452 |
+ 0.492 |
+ 0.422 |
+ 0.475 |
+ 0.320 |
+ 0.410 |
+ 0.517 |
+ 0.630 |
+ 0.587 |
+ 0.485 |
+ 0.182 |
+ 0.474 |
+ 0.505 |
+ 0.522 |
+ 0.515 |
+ 0.547 |
+ 0.578 |
+ 0.480 |
+ 0.528 |
+ 0.560 |
+ 0.114 |
+ 0.053 |
+ 0.506 |
+ 0.109 |
+ -0.037 |
+ 0.349 |
+ 0.643 |
+ 0.350 |
+ 0.308 |
+ 0.378 |
+ 0.344 |
+ 0.379 |
+ 0.342 |
+ 0.283 |
+ 0.323 |
+ 0.318 |
+ 0.314 |
+ 0.372 |
+ 0.173 |
+ 0.159 |
+ 1630 |
+ OrganismalFitness |
+ POLG_HCVJF |
+ Medium |
+ Virus |
+
+
+ POLG_PESV_Tsuboyama_2023_2MXD |
+ 0.251 |
+ 0.455 |
+ 0.373 |
+ 0.407 |
+ 0.471 |
+ 0.463 |
+ 0.070 |
+ 0.420 |
+ 0.390 |
+ 0.514 |
+ 0.375 |
+ 0.062 |
+ 0.157 |
+ 0.132 |
+ 0.140 |
+ 0.031 |
+ 0.160 |
+ 0.114 |
+ 0.084 |
+ 0.592 |
+ 0.079 |
+ 0.058 |
+ 0.016 |
+ 0.157 |
+ 0.106 |
+ -0.059 |
+ 0.008 |
+ -0.084 |
+ 0.170 |
+ 0.507 |
+ 0.575 |
+ 0.580 |
+ 0.131 |
+ 0.003 |
+ -0.033 |
+ 0.053 |
+ 0.382 |
+ 0.383 |
+ 0.384 |
+ 0.453 |
+ 0.463 |
+ 0.452 |
+ 0.006 |
+ -0.037 |
+ 0.033 |
+ 0.026 |
+ 0.448 |
+ 0.443 |
+ 0.650 |
+ 0.576 |
+ 0.658 |
+ 0.664 |
+ 0.615 |
+ 0.736 |
+ 0.705 |
+ 0.718 |
+ 0.692 |
+ 0.731 |
+ 0.718 |
+ 0.718 |
+ 0.716 |
+ 0.686 |
+ 5130 |
+ Stability |
+ POLG_PESV |
+ Medium |
+ Virus |
+
+
+ PPARG_HUMAN_Majithia_2016 |
+ 0.353 |
+ 0.539 |
+ 0.629 |
+ 0.655 |
+ 0.612 |
+ 0.632 |
+ 0.187 |
+ 0.339 |
+ 0.557 |
+ 0.571 |
+ 0.623 |
+ 0.696 |
+ 0.695 |
+ 0.028 |
+ 0.126 |
+ 0.456 |
+ 0.594 |
+ 0.736 |
+ 0.768 |
+ 0.717 |
+ 0.675 |
+ 0.692 |
+ 0.250 |
+ 0.288 |
+ 0.592 |
+ 0.722 |
+ 0.692 |
+ 0.719 |
+ 0.299 |
+ 0.712 |
+ 0.583 |
+ 0.509 |
+ 0.384 |
+ 0.709 |
+ 0.683 |
+ 0.587 |
+ 0.673 |
+ 0.700 |
+ 0.637 |
+ 0.698 |
+ 0.706 |
+ 0.680 |
+ 0.266 |
+ 0.030 |
+ 0.585 |
+ 0.450 |
+ 0.594 |
+ 0.607 |
+ 0.631 |
+ 0.302 |
+ 0.580 |
+ 0.588 |
+ 0.593 |
+ 0.575 |
+ 0.599 |
+ 0.583 |
+ 0.602 |
+ 0.598 |
+ 0.585 |
+ 0.603 |
+ 0.657 |
+ 0.499 |
+ 9576 |
+ Activity |
+ PPARG_HUMAN |
+ Medium |
+ Human |
+
+
+ PPM1D_HUMAN_Miller_2022 |
+ 0.550 |
+ 0.523 |
+ 0.531 |
+ 0.536 |
+ 0.609 |
+ 0.610 |
+ 0.035 |
+ 0.378 |
+ 0.460 |
+ 0.517 |
+ 0.578 |
+ 0.599 |
+ 0.618 |
+ 0.281 |
+ 0.386 |
+ 0.470 |
+ 0.602 |
+ 0.628 |
+ 0.618 |
+ 0.387 |
+ 0.450 |
+ 0.526 |
+ 0.537 |
+ 0.430 |
+ 0.516 |
+ 0.577 |
+ 0.567 |
+ 0.557 |
+ 0.419 |
+ 0.601 |
+ 0.594 |
+ 0.546 |
+ 0.234 |
+ 0.428 |
+ 0.528 |
+ 0.537 |
+ 0.582 |
+ 0.593 |
+ 0.591 |
+ 0.615 |
+ 0.614 |
+ 0.616 |
+ 0.361 |
+ -0.050 |
+ 0.575 |
+ 0.515 |
+ 0.503 |
+ 0.579 |
+ 0.563 |
+ 0.236 |
+ 0.574 |
+ 0.575 |
+ 0.570 |
+ 0.586 |
+ 0.593 |
+ 0.580 |
+ 0.584 |
+ 0.585 |
+ 0.590 |
+ 0.595 |
+ 0.633 |
+ 0.509 |
+ 7889 |
+ OrganismalFitness |
+ PPM1D_HUMAN |
+ Low |
+ Human |
+
+
+ PR40A_HUMAN_Tsuboyama_2023_1UZC |
+ 0.562 |
+ 0.655 |
+ 0.745 |
+ 0.736 |
+ 0.772 |
+ 0.774 |
+ 0.518 |
+ 0.493 |
+ 0.774 |
+ 0.783 |
+ 0.727 |
+ 0.592 |
+ 0.681 |
+ 0.540 |
+ 0.535 |
+ 0.821 |
+ 0.805 |
+ 0.740 |
+ 0.727 |
+ 0.715 |
+ 0.552 |
+ 0.694 |
+ 0.700 |
+ 0.725 |
+ 0.634 |
+ 0.716 |
+ 0.733 |
+ 0.723 |
+ 0.744 |
+ 0.795 |
+ 0.777 |
+ 0.756 |
+ 0.637 |
+ 0.593 |
+ 0.556 |
+ 0.611 |
+ 0.735 |
+ 0.746 |
+ 0.738 |
+ 0.764 |
+ 0.784 |
+ 0.775 |
+ 0.438 |
+ 0.281 |
+ 0.326 |
+ 0.431 |
+ 0.639 |
+ 0.423 |
+ 0.794 |
+ 0.783 |
+ 0.792 |
+ 0.809 |
+ 0.804 |
+ 0.812 |
+ 0.828 |
+ 0.823 |
+ 0.819 |
+ 0.816 |
+ 0.816 |
+ 0.821 |
+ 0.801 |
+ 0.835 |
+ 2033 |
+ Stability |
+ PR40A_HUMAN |
+ Medium |
+ Human |
+
+
+ PRKN_HUMAN_Clausen_2023 |
+ 0.605 |
+ 0.619 |
+ 0.600 |
+ 0.586 |
+ 0.622 |
+ 0.614 |
+ 0.182 |
+ 0.459 |
+ 0.525 |
+ 0.540 |
+ 0.582 |
+ 0.608 |
+ 0.638 |
+ 0.233 |
+ 0.292 |
+ 0.363 |
+ 0.501 |
+ 0.658 |
+ 0.663 |
+ 0.447 |
+ 0.281 |
+ 0.557 |
+ 0.609 |
+ 0.576 |
+ 0.447 |
+ 0.632 |
+ 0.623 |
+ 0.623 |
+ 0.520 |
+ 0.622 |
+ 0.568 |
+ 0.526 |
+ 0.266 |
+ 0.241 |
+ 0.550 |
+ 0.593 |
+ 0.569 |
+ 0.621 |
+ 0.640 |
+ 0.611 |
+ 0.638 |
+ 0.654 |
+ 0.309 |
+ 0.093 |
+ 0.591 |
+ 0.352 |
+ 0.649 |
+ 0.618 |
+ 0.696 |
+ 0.298 |
+ 0.542 |
+ 0.555 |
+ 0.561 |
+ 0.583 |
+ 0.567 |
+ 0.571 |
+ 0.579 |
+ 0.573 |
+ 0.561 |
+ 0.583 |
+ 0.672 |
+ 0.519 |
+ 8756 |
+ Expression |
+ PRKN_HUMAN |
+ Low |
+ Human |
+
+
+ PSAE_SYNP2_Tsuboyama_2023_1PSE |
+ 0.572 |
+ 0.576 |
+ 0.566 |
+ 0.566 |
+ 0.576 |
+ 0.572 |
+ 0.300 |
+ 0.442 |
+ 0.491 |
+ 0.497 |
+ 0.692 |
+ 0.601 |
+ 0.615 |
+ 0.498 |
+ 0.570 |
+ 0.678 |
+ 0.708 |
+ 0.627 |
+ 0.583 |
+ 0.477 |
+ 0.304 |
+ 0.242 |
+ 0.467 |
+ 0.391 |
+ 0.462 |
+ 0.492 |
+ -0.202 |
+ 0.471 |
+ 0.529 |
+ 0.553 |
+ 0.558 |
+ 0.505 |
+ 0.098 |
+ 0.328 |
+ 0.409 |
+ 0.430 |
+ 0.546 |
+ 0.543 |
+ 0.498 |
+ 0.586 |
+ 0.585 |
+ 0.541 |
+ 0.422 |
+ 0.237 |
+ 0.635 |
+ 0.482 |
+ 0.586 |
+ 0.626 |
+ 0.699 |
+ 0.643 |
+ 0.700 |
+ 0.684 |
+ 0.687 |
+ 0.685 |
+ 0.686 |
+ 0.692 |
+ 0.681 |
+ 0.687 |
+ 0.689 |
+ 0.699 |
+ 0.682 |
+ 0.674 |
+ 1579 |
+ Stability |
+ PSAE_PICP2 |
+ Medium |
+ Prokaryote |
+
+
+ PTEN_HUMAN_Matreyek_2021 |
+ 0.372 |
+ 0.393 |
+ 0.368 |
+ 0.390 |
+ 0.397 |
+ 0.408 |
+ 0.153 |
+ 0.358 |
+ 0.426 |
+ 0.471 |
+ 0.449 |
+ 0.409 |
+ 0.454 |
+ 0.165 |
+ 0.274 |
+ 0.462 |
+ 0.465 |
+ 0.265 |
+ 0.290 |
+ 0.477 |
+ 0.191 |
+ 0.460 |
+ 0.408 |
+ 0.391 |
+ 0.270 |
+ 0.338 |
+ 0.323 |
+ 0.377 |
+ 0.294 |
+ 0.479 |
+ 0.390 |
+ 0.388 |
+ 0.127 |
+ 0.275 |
+ 0.427 |
+ 0.343 |
+ 0.383 |
+ 0.457 |
+ 0.412 |
+ 0.419 |
+ 0.463 |
+ 0.447 |
+ 0.242 |
+ 0.027 |
+ 0.442 |
+ 0.347 |
+ 0.482 |
+ 0.473 |
+ 0.482 |
+ 0.229 |
+ 0.446 |
+ 0.428 |
+ 0.461 |
+ 0.441 |
+ 0.440 |
+ 0.451 |
+ 0.456 |
+ 0.455 |
+ 0.447 |
+ 0.460 |
+ 0.499 |
+ 0.454 |
+ 5083 |
+ Expression |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ PTEN_HUMAN_Mighell_2018 |
+ 0.480 |
+ 0.504 |
+ 0.511 |
+ 0.516 |
+ 0.536 |
+ 0.540 |
+ 0.177 |
+ 0.347 |
+ 0.504 |
+ 0.511 |
+ 0.475 |
+ 0.469 |
+ 0.499 |
+ 0.206 |
+ 0.407 |
+ 0.546 |
+ 0.519 |
+ 0.308 |
+ 0.291 |
+ 0.514 |
+ 0.335 |
+ 0.413 |
+ 0.328 |
+ 0.279 |
+ 0.410 |
+ 0.291 |
+ 0.278 |
+ 0.317 |
+ 0.228 |
+ 0.525 |
+ 0.485 |
+ 0.475 |
+ 0.058 |
+ 0.374 |
+ 0.404 |
+ 0.299 |
+ 0.499 |
+ 0.474 |
+ 0.418 |
+ 0.532 |
+ 0.527 |
+ 0.531 |
+ 0.371 |
+ -0.009 |
+ 0.510 |
+ 0.484 |
+ 0.421 |
+ 0.416 |
+ 0.480 |
+ 0.202 |
+ 0.488 |
+ 0.459 |
+ 0.488 |
+ 0.495 |
+ 0.511 |
+ 0.505 |
+ 0.495 |
+ 0.495 |
+ 0.509 |
+ 0.510 |
+ 0.539 |
+ 0.510 |
+ 7260 |
+ Activity |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ Q2N0S5_9HIV1_Haddox_2018 |
+ 0.493 |
+ 0.379 |
+ 0.352 |
+ 0.393 |
+ 0.495 |
+ 0.502 |
+ 0.003 |
+ 0.437 |
+ 0.510 |
+ 0.515 |
+ 0.470 |
+ 0.509 |
+ 0.537 |
+ -0.005 |
+ -0.003 |
+ -0.002 |
+ 0.044 |
+ 0.093 |
+ 0.168 |
+ 0.403 |
+ 0.518 |
+ 0.403 |
+ 0.390 |
+ 0.337 |
+ 0.517 |
+ 0.401 |
+ 0.394 |
+ 0.436 |
+ 0.354 |
+ 0.507 |
+ 0.478 |
+ 0.483 |
+ 0.291 |
+ 0.494 |
+ 0.419 |
+ 0.406 |
+ 0.520 |
+ 0.502 |
+ 0.501 |
+ 0.526 |
+ 0.515 |
+ 0.513 |
+ 0.413 |
+ -0.016 |
+ 0.502 |
+ 0.456 |
+ 0.396 |
+ 0.463 |
+ 0.263 |
+ 0.145 |
+ 0.226 |
+ 0.266 |
+ 0.307 |
+ 0.301 |
+ 0.250 |
+ 0.257 |
+ 0.260 |
+ 0.245 |
+ 0.223 |
+ 0.280 |
+ 0.232 |
+ 0.116 |
+ 12729 |
+ OrganismalFitness |
+ Q2N0S5_9HIV1 |
+ Medium |
+ Virus |
+
+
+ Q53Z42_HUMAN_McShan_2019_binding-TAPBPR |
+ 0.356 |
+ 0.341 |
+ 0.348 |
+ 0.353 |
+ 0.358 |
+ 0.355 |
+ 0.081 |
+ 0.245 |
+ 0.314 |
+ 0.324 |
+ 0.235 |
+ 0.301 |
+ 0.325 |
+ 0.192 |
+ 0.186 |
+ 0.364 |
+ 0.330 |
+ 0.287 |
+ 0.347 |
+ 0.329 |
+ 0.249 |
+ 0.267 |
+ 0.257 |
+ 0.215 |
+ 0.292 |
+ 0.297 |
+ 0.268 |
+ 0.306 |
+ 0.299 |
+ 0.303 |
+ 0.371 |
+ 0.341 |
+ 0.098 |
+ 0.238 |
+ 0.268 |
+ 0.182 |
+ 0.311 |
+ 0.334 |
+ 0.294 |
+ 0.352 |
+ 0.365 |
+ 0.348 |
+ 0.402 |
+ 0.202 |
+ 0.321 |
+ 0.328 |
+ 0.311 |
+ 0.341 |
+ 0.338 |
+ 0.192 |
+ 0.291 |
+ 0.302 |
+ 0.333 |
+ 0.334 |
+ 0.321 |
+ 0.321 |
+ 0.316 |
+ 0.312 |
+ 0.312 |
+ 0.325 |
+ 0.330 |
+ 0.465 |
+ 3344 |
+ Binding |
+ Q53Z42_HUMAN |
+ Medium |
+ Human |
+
+
+ Q53Z42_HUMAN_McShan_2019_expression |
+ 0.524 |
+ 0.485 |
+ 0.526 |
+ 0.538 |
+ 0.550 |
+ 0.548 |
+ -0.062 |
+ 0.415 |
+ 0.541 |
+ 0.556 |
+ 0.413 |
+ 0.467 |
+ 0.507 |
+ 0.088 |
+ 0.106 |
+ 0.489 |
+ 0.554 |
+ 0.550 |
+ 0.558 |
+ 0.546 |
+ 0.415 |
+ 0.470 |
+ 0.459 |
+ 0.448 |
+ 0.476 |
+ 0.514 |
+ 0.492 |
+ 0.511 |
+ 0.534 |
+ 0.533 |
+ 0.553 |
+ 0.505 |
+ 0.183 |
+ 0.417 |
+ 0.440 |
+ 0.405 |
+ 0.520 |
+ 0.538 |
+ 0.533 |
+ 0.555 |
+ 0.564 |
+ 0.563 |
+ 0.308 |
+ 0.093 |
+ 0.526 |
+ 0.483 |
+ 0.454 |
+ 0.490 |
+ 0.458 |
+ 0.207 |
+ 0.533 |
+ 0.546 |
+ 0.555 |
+ 0.551 |
+ 0.565 |
+ 0.556 |
+ 0.557 |
+ 0.560 |
+ 0.564 |
+ 0.570 |
+ 0.513 |
+ 0.622 |
+ 3344 |
+ Expression |
+ Q53Z42_HUMAN |
+ Medium |
+ Human |
+
+
+ Q59976_STRSQ_Romero_2015 |
+ 0.475 |
+ 0.593 |
+ 0.634 |
+ 0.643 |
+ 0.654 |
+ 0.662 |
+ 0.363 |
+ 0.535 |
+ 0.672 |
+ 0.679 |
+ 0.588 |
+ 0.519 |
+ 0.543 |
+ 0.144 |
+ 0.448 |
+ 0.516 |
+ 0.572 |
+ 0.560 |
+ 0.570 |
+ 0.649 |
+ 0.598 |
+ 0.651 |
+ 0.652 |
+ 0.663 |
+ 0.622 |
+ 0.662 |
+ 0.677 |
+ 0.673 |
+ 0.680 |
+ 0.684 |
+ 0.633 |
+ 0.575 |
+ 0.304 |
+ 0.606 |
+ 0.652 |
+ 0.634 |
+ 0.616 |
+ 0.653 |
+ 0.657 |
+ 0.666 |
+ 0.672 |
+ 0.675 |
+ 0.490 |
+ 0.008 |
+ 0.587 |
+ 0.524 |
+ 0.408 |
+ 0.533 |
+ 0.533 |
+ 0.155 |
+ 0.560 |
+ 0.537 |
+ 0.533 |
+ 0.556 |
+ 0.557 |
+ 0.542 |
+ 0.562 |
+ 0.545 |
+ 0.553 |
+ 0.563 |
+ 0.634 |
+ 0.540 |
+ 2999 |
+ Activity |
+ Q59976_STRSQ |
+ Medium |
+ Prokaryote |
+
+
+ Q6WV13_9MAXI_Somermeyer_2022 |
+ 0.327 |
+ 0.393 |
+ 0.260 |
+ 0.264 |
+ 0.333 |
+ 0.334 |
+ -0.031 |
+ 0.115 |
+ 0.413 |
+ 0.407 |
+ 0.242 |
+ 0.036 |
+ 0.022 |
+ 0.014 |
+ 0.046 |
+ 0.011 |
+ 0.010 |
+ 0.010 |
+ -0.022 |
+ 0.294 |
+ 0.013 |
+ 0.045 |
+ 0.003 |
+ 0.102 |
+ 0.004 |
+ -0.002 |
+ 0.030 |
+ -0.005 |
+ -0.013 |
+ 0.426 |
+ 0.344 |
+ 0.354 |
+ 0.043 |
+ 0.031 |
+ 0.022 |
+ 0.014 |
+ 0.248 |
+ 0.246 |
+ 0.245 |
+ 0.330 |
+ 0.327 |
+ 0.324 |
+ 0.014 |
+ -0.015 |
+ -0.004 |
+ 0.019 |
+ 0.161 |
+ 0.156 |
+ 0.299 |
+ 0.169 |
+ 0.228 |
+ 0.248 |
+ 0.237 |
+ 0.287 |
+ 0.267 |
+ 0.272 |
+ 0.261 |
+ 0.263 |
+ 0.264 |
+ 0.261 |
+ 0.072 |
+ 0.031 |
+ 31401 |
+ Activity |
+ Q6WV12_9MAXI |
+ Low |
+ Eukaryote |
+
+
+ Q837P4_ENTFA_Meier_2023 |
+ 0.451 |
+ 0.465 |
+ 0.489 |
+ 0.507 |
+ 0.503 |
+ 0.517 |
+ 0.454 |
+ 0.404 |
+ 0.479 |
+ 0.495 |
+ 0.563 |
+ 0.545 |
+ 0.566 |
+ 0.420 |
+ 0.475 |
+ 0.492 |
+ 0.515 |
+ 0.545 |
+ 0.523 |
+ -0.019 |
+ 0.521 |
+ 0.445 |
+ 0.441 |
+ 0.462 |
+ 0.552 |
+ 0.535 |
+ 0.541 |
+ 0.449 |
+ 0.521 |
+ 0.524 |
+ 0.527 |
+ 0.477 |
+ 0.190 |
+ 0.491 |
+ 0.516 |
+ 0.454 |
+ 0.491 |
+ 0.538 |
+ 0.512 |
+ 0.540 |
+ 0.549 |
+ 0.544 |
+ 0.458 |
+ 0.370 |
+ 0.545 |
+ 0.490 |
+ 0.231 |
+ 0.454 |
+ 0.373 |
+ 0.114 |
+ 0.511 |
+ 0.488 |
+ 0.499 |
+ 0.511 |
+ 0.522 |
+ 0.529 |
+ 0.536 |
+ 0.491 |
+ 0.524 |
+ 0.531 |
+ 0.589 |
+ 0.483 |
+ 697 |
+ Activity |
+ Q837P4_ENTFA |
+ Medium |
+ Prokaryote |
+
+
+ Q837P5_ENTFA_Meier_2023 |
+ 0.196 |
+ 0.416 |
+ 0.421 |
+ 0.459 |
+ 0.390 |
+ 0.395 |
+ 0.222 |
+ 0.263 |
+ 0.282 |
+ 0.294 |
+ 0.326 |
+ 0.353 |
+ 0.342 |
+ 0.129 |
+ 0.240 |
+ 0.334 |
+ 0.374 |
+ 0.436 |
+ 0.392 |
+ 0.321 |
+ 0.356 |
+ 0.399 |
+ 0.435 |
+ 0.410 |
+ 0.319 |
+ 0.408 |
+ 0.428 |
+ 0.511 |
+ 0.455 |
+ 0.336 |
+ 0.378 |
+ 0.352 |
+ 0.186 |
+ 0.399 |
+ 0.427 |
+ 0.510 |
+ 0.342 |
+ 0.391 |
+ 0.477 |
+ 0.417 |
+ 0.428 |
+ 0.457 |
+ 0.274 |
+ 0.186 |
+ 0.356 |
+ 0.271 |
+ 0.353 |
+ 0.296 |
+ 0.391 |
+ 0.098 |
+ 0.326 |
+ 0.344 |
+ 0.359 |
+ 0.313 |
+ 0.383 |
+ 0.348 |
+ 0.318 |
+ 0.353 |
+ 0.320 |
+ 0.355 |
+ 0.341 |
+ 0.319 |
+ 747 |
+ Activity |
+ Q837P5_ENTFA |
+ Medium |
+ Prokaryote |
+
+
+ Q8WTC7_9CNID_Somermeyer_2022 |
+ 0.286 |
+ 0.368 |
+ 0.217 |
+ 0.215 |
+ 0.310 |
+ 0.305 |
+ 0.025 |
+ 0.303 |
+ 0.316 |
+ 0.322 |
+ 0.223 |
+ -0.009 |
+ -0.018 |
+ -0.027 |
+ -0.035 |
+ -0.023 |
+ -0.025 |
+ -0.007 |
+ 0.034 |
+ 0.230 |
+ 0.013 |
+ 0.037 |
+ 0.057 |
+ 0.013 |
+ -0.005 |
+ -0.001 |
+ 0.037 |
+ 0.252 |
+ 0.267 |
+ 0.373 |
+ 0.297 |
+ 0.307 |
+ -0.018 |
+ -0.026 |
+ 0.009 |
+ 0.303 |
+ 0.231 |
+ 0.237 |
+ 0.318 |
+ 0.304 |
+ 0.309 |
+ 0.345 |
+ -0.013 |
+ -0.008 |
+ -0.024 |
+ -0.027 |
+ 0.097 |
+ 0.143 |
+ 0.292 |
+ 0.189 |
+ 0.242 |
+ 0.234 |
+ 0.246 |
+ 0.260 |
+ 0.240 |
+ 0.254 |
+ 0.249 |
+ 0.243 |
+ 0.238 |
+ 0.247 |
+ 0.137 |
+ -0.001 |
+ 33510 |
+ Activity |
+ Q8WTC7_9CNID |
+ Low |
+ Eukaryote |
+
+
+ R1AB_SARS2_Flynn_2022 |
+ 0.577 |
+ 0.561 |
+ 0.212 |
+ 0.227 |
+ 0.600 |
+ 0.605 |
+ -0.049 |
+ 0.292 |
+ -0.037 |
+ -0.037 |
+ 0.103 |
+ -0.030 |
+ -0.040 |
+ -0.009 |
+ -0.026 |
+ 0.079 |
+ 0.105 |
+ 0.498 |
+ 0.577 |
+ 0.266 |
+ 0.214 |
+ 0.259 |
+ 0.274 |
+ 0.289 |
+ 0.242 |
+ 0.236 |
+ 0.203 |
+ 0.210 |
+ 0.224 |
+ 0.558 |
+ 0.507 |
+ 0.408 |
+ -0.056 |
+ 0.157 |
+ 0.220 |
+ 0.216 |
+ 0.350 |
+ 0.399 |
+ 0.401 |
+ 0.546 |
+ 0.567 |
+ 0.565 |
+ -0.031 |
+ -0.051 |
+ 0.073 |
+ -0.038 |
+ 0.453 |
+ 0.415 |
+ 0.496 |
+ 0.232 |
+ 0.251 |
+ 0.228 |
+ 0.265 |
+ 0.298 |
+ 0.275 |
+ 0.274 |
+ 0.273 |
+ 0.268 |
+ 0.237 |
+ 0.277 |
+ 0.241 |
+ 0.137 |
+ 5725 |
+ OrganismalFitness |
+ R1AB_SARS2 |
+ Medium |
+ Virus |
+
+
+ RAD_ANTMA_Tsuboyama_2023_2CJJ |
+ 0.280 |
+ 0.183 |
+ 0.286 |
+ 0.299 |
+ 0.324 |
+ 0.356 |
+ 0.508 |
+ 0.373 |
+ 0.654 |
+ 0.624 |
+ 0.507 |
+ 0.403 |
+ 0.450 |
+ 0.458 |
+ 0.687 |
+ 0.704 |
+ 0.493 |
+ 0.467 |
+ 0.542 |
+ 0.412 |
+ 0.541 |
+ 0.575 |
+ 0.536 |
+ 0.454 |
+ 0.552 |
+ 0.419 |
+ 0.546 |
+ 0.444 |
+ 0.417 |
+ 0.598 |
+ 0.422 |
+ 0.365 |
+ 0.258 |
+ 0.477 |
+ 0.601 |
+ 0.439 |
+ 0.497 |
+ 0.588 |
+ 0.467 |
+ 0.436 |
+ 0.486 |
+ 0.410 |
+ 0.573 |
+ 0.222 |
+ 0.147 |
+ 0.370 |
+ 0.460 |
+ 0.233 |
+ 0.656 |
+ 0.548 |
+ 0.544 |
+ 0.578 |
+ 0.577 |
+ 0.577 |
+ 0.606 |
+ 0.581 |
+ 0.571 |
+ 0.575 |
+ 0.573 |
+ 0.585 |
+ 0.373 |
+ 0.582 |
+ 912 |
+ Stability |
+ RAD_ANTMA |
+ High |
+ Eukaryote |
+
+
+ RAF1_HUMAN_Zinkus-Boltz_2019 |
+ 0.405 |
+ 0.425 |
+ 0.382 |
+ 0.389 |
+ 0.408 |
+ 0.408 |
+ 0.045 |
+ 0.339 |
+ 0.445 |
+ 0.439 |
+ 0.471 |
+ 0.423 |
+ 0.482 |
+ 0.034 |
+ 0.242 |
+ 0.442 |
+ 0.473 |
+ 0.433 |
+ 0.399 |
+ 0.311 |
+ 0.285 |
+ 0.378 |
+ 0.378 |
+ 0.339 |
+ 0.377 |
+ 0.395 |
+ 0.378 |
+ 0.355 |
+ 0.359 |
+ 0.439 |
+ 0.473 |
+ 0.421 |
+ 0.120 |
+ 0.287 |
+ 0.339 |
+ 0.363 |
+ 0.372 |
+ 0.380 |
+ 0.396 |
+ 0.403 |
+ 0.430 |
+ 0.441 |
+ 0.221 |
+ 0.039 |
+ 0.480 |
+ 0.387 |
+ 0.271 |
+ 0.383 |
+ 0.264 |
+ 0.234 |
+ 0.456 |
+ 0.430 |
+ 0.437 |
+ 0.483 |
+ 0.451 |
+ 0.475 |
+ 0.439 |
+ 0.453 |
+ 0.460 |
+ 0.472 |
+ 0.432 |
+ 0.160 |
+ 297 |
+ OrganismalFitness |
+ RAF1_HUMAN |
+ Low |
+ Human |
+
+
+ RASH_HUMAN_Bandaru_2017 |
+ 0.447 |
+ 0.436 |
+ 0.444 |
+ 0.476 |
+ 0.466 |
+ 0.480 |
+ 0.313 |
+ 0.353 |
+ 0.426 |
+ 0.446 |
+ 0.318 |
+ 0.360 |
+ 0.405 |
+ 0.457 |
+ 0.514 |
+ 0.487 |
+ 0.498 |
+ 0.440 |
+ 0.313 |
+ 0.514 |
+ 0.437 |
+ 0.414 |
+ 0.420 |
+ 0.396 |
+ 0.433 |
+ 0.403 |
+ 0.401 |
+ 0.374 |
+ 0.305 |
+ 0.434 |
+ 0.338 |
+ 0.319 |
+ 0.180 |
+ 0.399 |
+ 0.439 |
+ 0.377 |
+ 0.454 |
+ 0.478 |
+ 0.450 |
+ 0.472 |
+ 0.485 |
+ 0.487 |
+ 0.478 |
+ 0.326 |
+ 0.322 |
+ 0.453 |
+ 0.386 |
+ 0.252 |
+ 0.458 |
+ 0.255 |
+ 0.431 |
+ 0.423 |
+ 0.443 |
+ 0.428 |
+ 0.456 |
+ 0.430 |
+ 0.452 |
+ 0.430 |
+ 0.439 |
+ 0.455 |
+ 0.373 |
+ 0.474 |
+ 3134 |
+ Activity |
+ RASH_HUMAN |
+ High |
+ Human |
+
+
+ RASK_HUMAN_Weng_2022_abundance |
+ 0.299 |
+ 0.244 |
+ 0.356 |
+ 0.346 |
+ 0.332 |
+ 0.353 |
+ 0.211 |
+ 0.244 |
+ 0.244 |
+ 0.271 |
+ 0.276 |
+ 0.210 |
+ 0.245 |
+ 0.336 |
+ 0.367 |
+ 0.335 |
+ 0.271 |
+ 0.231 |
+ 0.210 |
+ 0.331 |
+ 0.266 |
+ 0.333 |
+ 0.372 |
+ 0.384 |
+ 0.279 |
+ 0.347 |
+ 0.402 |
+ 0.345 |
+ 0.470 |
+ 0.357 |
+ 0.210 |
+ 0.167 |
+ 0.208 |
+ 0.198 |
+ 0.325 |
+ 0.398 |
+ 0.275 |
+ 0.368 |
+ 0.436 |
+ 0.334 |
+ 0.370 |
+ 0.388 |
+ 0.237 |
+ 0.335 |
+ 0.058 |
+ 0.243 |
+ 0.334 |
+ 0.327 |
+ 0.377 |
+ 0.326 |
+ 0.264 |
+ 0.245 |
+ 0.294 |
+ 0.279 |
+ 0.310 |
+ 0.281 |
+ 0.298 |
+ 0.280 |
+ 0.260 |
+ 0.287 |
+ 0.339 |
+ 0.331 |
+ 26012 |
+ Expression |
+ RASK_HUMAN |
+ High |
+ Human |
+
+
+ RASK_HUMAN_Weng_2022_binding-DARPin_K55 |
+ 0.419 |
+ 0.470 |
+ 0.602 |
+ 0.617 |
+ 0.598 |
+ 0.598 |
+ 0.268 |
+ 0.350 |
+ 0.641 |
+ 0.639 |
+ 0.585 |
+ 0.524 |
+ 0.566 |
+ 0.573 |
+ 0.605 |
+ 0.621 |
+ 0.652 |
+ 0.563 |
+ 0.446 |
+ 0.677 |
+ 0.568 |
+ 0.511 |
+ 0.549 |
+ 0.458 |
+ 0.537 |
+ 0.534 |
+ 0.479 |
+ 0.396 |
+ 0.358 |
+ 0.569 |
+ 0.529 |
+ 0.490 |
+ 0.289 |
+ 0.515 |
+ 0.589 |
+ 0.499 |
+ 0.555 |
+ 0.601 |
+ 0.543 |
+ 0.605 |
+ 0.616 |
+ 0.597 |
+ 0.560 |
+ 0.298 |
+ 0.464 |
+ 0.597 |
+ 0.239 |
+ 0.269 |
+ 0.585 |
+ 0.281 |
+ 0.545 |
+ 0.549 |
+ 0.542 |
+ 0.552 |
+ 0.556 |
+ 0.554 |
+ 0.544 |
+ 0.543 |
+ 0.510 |
+ 0.560 |
+ 0.601 |
+ 0.573 |
+ 24873 |
+ Binding |
+ RASK_HUMAN |
+ High |
+ Human |
+
+
+ RBP1_HUMAN_Tsuboyama_2023_2KWH |
+ 0.190 |
+ 0.108 |
+ 0.302 |
+ 0.307 |
+ 0.305 |
+ 0.303 |
+ 0.271 |
+ 0.277 |
+ 0.218 |
+ 0.233 |
+ 0.430 |
+ 0.409 |
+ 0.404 |
+ 0.321 |
+ 0.394 |
+ 0.491 |
+ 0.552 |
+ 0.328 |
+ 0.335 |
+ 0.259 |
+ 0.366 |
+ 0.108 |
+ 0.267 |
+ 0.263 |
+ 0.297 |
+ 0.078 |
+ 0.160 |
+ 0.293 |
+ 0.202 |
+ 0.365 |
+ 0.373 |
+ 0.318 |
+ 0.239 |
+ 0.291 |
+ 0.364 |
+ 0.384 |
+ 0.318 |
+ 0.340 |
+ 0.339 |
+ 0.317 |
+ 0.334 |
+ 0.318 |
+ 0.319 |
+ 0.255 |
+ 0.415 |
+ 0.406 |
+ 0.487 |
+ 0.545 |
+ 0.578 |
+ 0.459 |
+ 0.494 |
+ 0.479 |
+ 0.491 |
+ 0.485 |
+ 0.492 |
+ 0.486 |
+ 0.476 |
+ 0.493 |
+ 0.480 |
+ 0.493 |
+ 0.539 |
+ 0.526 |
+ 1332 |
+ Stability |
+ RBP1_HUMAN |
+ High |
+ Human |
+
+
+ RCD1_ARATH_Tsuboyama_2023_5OAO |
+ 0.304 |
+ 0.284 |
+ 0.362 |
+ 0.370 |
+ 0.367 |
+ 0.355 |
+ 0.303 |
+ 0.255 |
+ 0.412 |
+ 0.421 |
+ 0.486 |
+ 0.342 |
+ 0.393 |
+ 0.368 |
+ 0.392 |
+ 0.498 |
+ 0.505 |
+ 0.482 |
+ 0.478 |
+ 0.391 |
+ 0.267 |
+ 0.243 |
+ 0.330 |
+ 0.395 |
+ 0.320 |
+ 0.481 |
+ 0.454 |
+ 0.412 |
+ 0.465 |
+ 0.398 |
+ 0.478 |
+ 0.406 |
+ 0.046 |
+ 0.293 |
+ 0.308 |
+ 0.324 |
+ 0.360 |
+ 0.361 |
+ 0.352 |
+ 0.403 |
+ 0.402 |
+ 0.386 |
+ 0.304 |
+ 0.285 |
+ 0.371 |
+ 0.327 |
+ 0.442 |
+ 0.432 |
+ 0.560 |
+ 0.532 |
+ 0.541 |
+ 0.555 |
+ 0.559 |
+ 0.559 |
+ 0.545 |
+ 0.555 |
+ 0.551 |
+ 0.549 |
+ 0.556 |
+ 0.560 |
+ 0.531 |
+ 0.501 |
+ 1261 |
+ Stability |
+ RCD1_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ RCRO_LAMBD_Tsuboyama_2023_1ORC |
+ 0.293 |
+ 0.518 |
+ 0.584 |
+ 0.577 |
+ 0.557 |
+ 0.588 |
+ 0.167 |
+ 0.263 |
+ 0.552 |
+ 0.556 |
+ 0.558 |
+ 0.380 |
+ 0.492 |
+ 0.226 |
+ 0.422 |
+ 0.451 |
+ 0.596 |
+ 0.573 |
+ 0.639 |
+ 0.573 |
+ 0.129 |
+ 0.170 |
+ 0.225 |
+ 0.175 |
+ -0.041 |
+ 0.132 |
+ -0.003 |
+ 0.087 |
+ 0.577 |
+ 0.590 |
+ 0.569 |
+ 0.537 |
+ 0.096 |
+ 0.178 |
+ 0.127 |
+ 0.567 |
+ 0.440 |
+ 0.450 |
+ 0.580 |
+ 0.568 |
+ 0.598 |
+ 0.621 |
+ 0.028 |
+ 0.139 |
+ 0.471 |
+ 0.187 |
+ 0.635 |
+ 0.646 |
+ 0.782 |
+ 0.690 |
+ 0.592 |
+ 0.580 |
+ 0.560 |
+ 0.585 |
+ 0.574 |
+ 0.583 |
+ 0.568 |
+ 0.569 |
+ 0.600 |
+ 0.584 |
+ 0.646 |
+ 0.541 |
+ 2278 |
+ Stability |
+ RCRO_LAMBD |
+ High |
+ Virus |
+
+
+ RD23A_HUMAN_Tsuboyama_2023_1IFY |
+ 0.330 |
+ 0.333 |
+ 0.466 |
+ 0.466 |
+ 0.479 |
+ 0.479 |
+ 0.196 |
+ 0.434 |
+ 0.512 |
+ 0.517 |
+ 0.457 |
+ 0.550 |
+ 0.592 |
+ 0.254 |
+ 0.675 |
+ 0.533 |
+ 0.513 |
+ 0.464 |
+ 0.390 |
+ 0.462 |
+ 0.422 |
+ 0.422 |
+ 0.395 |
+ 0.453 |
+ 0.545 |
+ 0.513 |
+ 0.510 |
+ 0.477 |
+ 0.494 |
+ 0.510 |
+ 0.414 |
+ 0.399 |
+ 0.429 |
+ 0.288 |
+ 0.531 |
+ 0.475 |
+ 0.484 |
+ 0.568 |
+ 0.540 |
+ 0.482 |
+ 0.537 |
+ 0.536 |
+ 0.512 |
+ 0.074 |
+ 0.556 |
+ 0.548 |
+ 0.502 |
+ 0.548 |
+ 0.582 |
+ 0.505 |
+ 0.470 |
+ 0.454 |
+ 0.467 |
+ 0.449 |
+ 0.478 |
+ 0.475 |
+ 0.475 |
+ 0.474 |
+ 0.478 |
+ 0.481 |
+ 0.482 |
+ 0.631 |
+ 1019 |
+ Stability |
+ RD23A_HUMAN |
+ High |
+ Human |
+
+
+ RDRP_I33A0_Li_2023 |
+ 0.313 |
+ 0.367 |
+ 0.376 |
+ 0.387 |
+ 0.482 |
+ 0.484 |
+ 0.029 |
+ 0.355 |
+ 0.520 |
+ 0.525 |
+ 0.166 |
+ 0.045 |
+ 0.057 |
+ 0.033 |
+ 0.041 |
+ 0.128 |
+ 0.322 |
+ 0.389 |
+ 0.491 |
+ 0.427 |
+ 0.385 |
+ 0.436 |
+ 0.454 |
+ 0.475 |
+ 0.125 |
+ 0.358 |
+ 0.342 |
+ 0.345 |
+ 0.413 |
+ 0.520 |
+ 0.450 |
+ 0.395 |
+ 0.075 |
+ 0.377 |
+ 0.423 |
+ 0.459 |
+ 0.425 |
+ 0.452 |
+ 0.474 |
+ 0.494 |
+ 0.508 |
+ 0.526 |
+ 0.030 |
+ 0.020 |
+ 0.171 |
+ 0.033 |
+ 0.215 |
+ 0.230 |
+ 0.188 |
+ 0.055 |
+ 0.300 |
+ 0.271 |
+ 0.279 |
+ 0.282 |
+ 0.303 |
+ 0.296 |
+ 0.283 |
+ 0.304 |
+ 0.310 |
+ 0.307 |
+ 0.149 |
+ 0.110 |
+ 12003 |
+ OrganismalFitness |
+ RDRP_I33A0 |
+ Low |
+ Virus |
+
+
+ REV_HV1H2_Fernandes_2016 |
+ 0.206 |
+ 0.159 |
+ 0.221 |
+ 0.227 |
+ 0.216 |
+ 0.216 |
+ 0.038 |
+ 0.316 |
+ 0.222 |
+ 0.232 |
+ 0.128 |
+ 0.245 |
+ 0.267 |
+ 0.046 |
+ 0.046 |
+ 0.173 |
+ 0.240 |
+ 0.281 |
+ 0.274 |
+ 0.170 |
+ 0.216 |
+ 0.259 |
+ 0.238 |
+ 0.240 |
+ 0.290 |
+ 0.294 |
+ 0.160 |
+ 0.253 |
+ 0.255 |
+ 0.282 |
+ 0.350 |
+ 0.353 |
+ 0.060 |
+ 0.231 |
+ 0.273 |
+ 0.240 |
+ 0.245 |
+ 0.269 |
+ 0.236 |
+ 0.246 |
+ 0.261 |
+ 0.235 |
+ 0.053 |
+ 0.060 |
+ 0.208 |
+ 0.071 |
+ 0.258 |
+ 0.234 |
+ 0.304 |
+ 0.224 |
+ 0.193 |
+ 0.221 |
+ 0.276 |
+ 0.246 |
+ 0.267 |
+ 0.255 |
+ 0.281 |
+ 0.241 |
+ 0.310 |
+ 0.279 |
+ 0.274 |
+ 0.229 |
+ 2147 |
+ OrganismalFitness |
+ REV_HV1H2 |
+ Medium |
+ Virus |
+
+
+ RFAH_ECOLI_Tsuboyama_2023_2LCL |
+ 0.064 |
+ 0.210 |
+ 0.212 |
+ 0.233 |
+ 0.230 |
+ 0.227 |
+ -0.015 |
+ 0.238 |
+ 0.232 |
+ 0.258 |
+ 0.248 |
+ 0.175 |
+ 0.200 |
+ -0.032 |
+ 0.078 |
+ 0.055 |
+ 0.299 |
+ 0.261 |
+ 0.246 |
+ 0.249 |
+ -0.067 |
+ 0.059 |
+ 0.121 |
+ 0.115 |
+ -0.043 |
+ 0.130 |
+ 0.151 |
+ 0.114 |
+ 0.158 |
+ 0.214 |
+ 0.268 |
+ 0.213 |
+ -0.009 |
+ 0.009 |
+ 0.102 |
+ 0.133 |
+ 0.107 |
+ 0.144 |
+ 0.156 |
+ 0.220 |
+ 0.216 |
+ 0.208 |
+ 0.115 |
+ -0.075 |
+ 0.193 |
+ 0.107 |
+ 0.304 |
+ 0.248 |
+ 0.384 |
+ 0.328 |
+ 0.331 |
+ 0.310 |
+ 0.291 |
+ 0.333 |
+ 0.314 |
+ 0.300 |
+ 0.314 |
+ 0.309 |
+ 0.327 |
+ 0.319 |
+ 0.318 |
+ 0.220 |
+ 1326 |
+ Stability |
+ RFAH_ECOLI |
+ High |
+ Prokaryote |
+
+
+ RL20_AQUAE_Tsuboyama_2023_1GYZ |
+ 0.341 |
+ 0.588 |
+ 0.607 |
+ 0.601 |
+ 0.603 |
+ 0.610 |
+ 0.224 |
+ 0.577 |
+ 0.405 |
+ 0.352 |
+ 0.675 |
+ 0.670 |
+ 0.657 |
+ 0.235 |
+ 0.421 |
+ 0.487 |
+ 0.699 |
+ 0.680 |
+ 0.676 |
+ 0.613 |
+ 0.151 |
+ 0.493 |
+ 0.470 |
+ 0.475 |
+ 0.473 |
+ 0.481 |
+ 0.534 |
+ 0.545 |
+ 0.622 |
+ 0.636 |
+ 0.563 |
+ 0.544 |
+ 0.078 |
+ 0.481 |
+ 0.522 |
+ 0.506 |
+ 0.540 |
+ 0.579 |
+ 0.574 |
+ 0.588 |
+ 0.624 |
+ 0.625 |
+ 0.041 |
+ -0.048 |
+ 0.467 |
+ 0.283 |
+ 0.684 |
+ 0.640 |
+ 0.816 |
+ 0.769 |
+ 0.716 |
+ 0.715 |
+ 0.720 |
+ 0.726 |
+ 0.713 |
+ 0.718 |
+ 0.723 |
+ 0.724 |
+ 0.732 |
+ 0.728 |
+ 0.738 |
+ 0.710 |
+ 1461 |
+ Stability |
+ RL20_AQUAE |
+ High |
+ Prokaryote |
+
+
+ RL40A_YEAST_Mavor_2016 |
+ 0.283 |
+ 0.370 |
+ 0.372 |
+ 0.417 |
+ 0.380 |
+ 0.399 |
+ 0.122 |
+ 0.372 |
+ 0.441 |
+ 0.448 |
+ 0.192 |
+ 0.285 |
+ 0.315 |
+ 0.104 |
+ 0.397 |
+ 0.482 |
+ 0.518 |
+ 0.446 |
+ 0.527 |
+ 0.430 |
+ 0.382 |
+ 0.512 |
+ 0.480 |
+ 0.422 |
+ 0.493 |
+ 0.433 |
+ 0.433 |
+ 0.430 |
+ 0.400 |
+ 0.319 |
+ 0.406 |
+ 0.408 |
+ 0.044 |
+ 0.387 |
+ 0.451 |
+ 0.368 |
+ 0.407 |
+ 0.456 |
+ 0.398 |
+ 0.439 |
+ 0.469 |
+ 0.411 |
+ 0.286 |
+ 0.071 |
+ 0.282 |
+ 0.294 |
+ 0.068 |
+ 0.091 |
+ 0.177 |
+ -0.026 |
+ 0.487 |
+ 0.457 |
+ 0.481 |
+ 0.470 |
+ 0.499 |
+ 0.476 |
+ 0.474 |
+ 0.498 |
+ 0.498 |
+ 0.503 |
+ 0.338 |
+ 0.345 |
+ 1253 |
+ OrganismalFitness |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RL40A_YEAST_Roscoe_2013 |
+ 0.313 |
+ 0.402 |
+ 0.418 |
+ 0.470 |
+ 0.430 |
+ 0.454 |
+ 0.086 |
+ 0.367 |
+ 0.528 |
+ 0.528 |
+ 0.218 |
+ 0.307 |
+ 0.367 |
+ 0.122 |
+ 0.423 |
+ 0.534 |
+ 0.599 |
+ 0.479 |
+ 0.556 |
+ 0.488 |
+ 0.424 |
+ 0.553 |
+ 0.549 |
+ 0.467 |
+ 0.537 |
+ 0.490 |
+ 0.503 |
+ 0.487 |
+ 0.460 |
+ 0.391 |
+ 0.523 |
+ 0.538 |
+ 0.139 |
+ 0.437 |
+ 0.522 |
+ 0.427 |
+ 0.459 |
+ 0.526 |
+ 0.449 |
+ 0.494 |
+ 0.533 |
+ 0.467 |
+ 0.357 |
+ 0.076 |
+ 0.317 |
+ 0.319 |
+ 0.106 |
+ 0.116 |
+ 0.253 |
+ 0.026 |
+ 0.569 |
+ 0.529 |
+ 0.554 |
+ 0.532 |
+ 0.566 |
+ 0.558 |
+ 0.542 |
+ 0.582 |
+ 0.580 |
+ 0.583 |
+ 0.381 |
+ 0.391 |
+ 1195 |
+ OrganismalFitness |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RL40A_YEAST_Roscoe_2014 |
+ 0.371 |
+ 0.285 |
+ 0.351 |
+ 0.387 |
+ 0.331 |
+ 0.368 |
+ 0.161 |
+ 0.381 |
+ 0.419 |
+ 0.442 |
+ 0.163 |
+ 0.240 |
+ 0.267 |
+ 0.227 |
+ 0.521 |
+ 0.529 |
+ 0.530 |
+ 0.427 |
+ 0.438 |
+ 0.394 |
+ 0.370 |
+ 0.466 |
+ 0.433 |
+ 0.405 |
+ 0.439 |
+ 0.380 |
+ 0.373 |
+ 0.336 |
+ 0.325 |
+ 0.359 |
+ 0.350 |
+ 0.361 |
+ 0.236 |
+ 0.390 |
+ 0.431 |
+ 0.351 |
+ 0.425 |
+ 0.455 |
+ 0.408 |
+ 0.420 |
+ 0.428 |
+ 0.383 |
+ 0.292 |
+ 0.154 |
+ 0.268 |
+ 0.295 |
+ 0.317 |
+ 0.247 |
+ 0.380 |
+ 0.207 |
+ 0.527 |
+ 0.511 |
+ 0.488 |
+ 0.544 |
+ 0.547 |
+ 0.543 |
+ 0.491 |
+ 0.517 |
+ 0.516 |
+ 0.544 |
+ 0.302 |
+ 0.401 |
+ 1380 |
+ Activity |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RNC_ECOLI_Weeks_2023 |
+ 0.544 |
+ 0.592 |
+ 0.570 |
+ 0.584 |
+ 0.594 |
+ 0.595 |
+ 0.058 |
+ 0.423 |
+ 0.591 |
+ 0.591 |
+ 0.590 |
+ 0.579 |
+ 0.593 |
+ 0.064 |
+ 0.557 |
+ 0.588 |
+ 0.596 |
+ 0.595 |
+ 0.592 |
+ 0.586 |
+ 0.554 |
+ 0.570 |
+ 0.509 |
+ 0.499 |
+ 0.557 |
+ 0.584 |
+ 0.583 |
+ 0.586 |
+ 0.571 |
+ 0.594 |
+ 0.578 |
+ 0.539 |
+ 0.187 |
+ 0.538 |
+ 0.540 |
+ 0.431 |
+ 0.581 |
+ 0.587 |
+ 0.545 |
+ 0.617 |
+ 0.616 |
+ 0.606 |
+ 0.529 |
+ 0.060 |
+ 0.588 |
+ 0.567 |
+ 0.299 |
+ 0.524 |
+ 0.281 |
+ 0.177 |
+ 0.541 |
+ 0.530 |
+ 0.520 |
+ 0.553 |
+ 0.544 |
+ 0.533 |
+ 0.548 |
+ 0.543 |
+ 0.544 |
+ 0.554 |
+ 0.608 |
+ 0.533 |
+ 4277 |
+ Activity |
+ RNC_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ RPC1_BP434_Tsuboyama_2023_1R69 |
+ 0.627 |
+ 0.681 |
+ 0.655 |
+ 0.673 |
+ 0.606 |
+ 0.644 |
+ 0.670 |
+ 0.593 |
+ 0.617 |
+ 0.659 |
+ 0.711 |
+ 0.744 |
+ 0.750 |
+ 0.722 |
+ 0.741 |
+ 0.752 |
+ 0.708 |
+ 0.687 |
+ 0.630 |
+ 0.697 |
+ 0.699 |
+ 0.750 |
+ 0.735 |
+ 0.682 |
+ 0.742 |
+ 0.731 |
+ 0.718 |
+ 0.723 |
+ 0.660 |
+ 0.743 |
+ 0.666 |
+ 0.609 |
+ 0.621 |
+ 0.633 |
+ 0.738 |
+ 0.716 |
+ 0.720 |
+ 0.747 |
+ 0.737 |
+ 0.699 |
+ 0.693 |
+ 0.690 |
+ 0.749 |
+ 0.674 |
+ 0.735 |
+ 0.728 |
+ 0.696 |
+ 0.640 |
+ 0.793 |
+ 0.723 |
+ 0.702 |
+ 0.672 |
+ 0.687 |
+ 0.700 |
+ 0.699 |
+ 0.690 |
+ 0.711 |
+ 0.702 |
+ 0.711 |
+ 0.708 |
+ 0.756 |
+ 0.764 |
+ 1459 |
+ Stability |
+ RPC1_BP434 |
+ High |
+ Virus |
+
+
+ RPC1_LAMBD_Li_2019_high-expression |
+ 0.288 |
+ 0.398 |
+ 0.484 |
+ 0.493 |
+ 0.450 |
+ 0.447 |
+ 0.240 |
+ 0.361 |
+ 0.457 |
+ 0.470 |
+ 0.442 |
+ 0.461 |
+ 0.514 |
+ 0.290 |
+ 0.289 |
+ 0.401 |
+ 0.549 |
+ 0.520 |
+ 0.509 |
+ 0.434 |
+ 0.176 |
+ 0.290 |
+ 0.351 |
+ 0.386 |
+ 0.223 |
+ 0.349 |
+ 0.323 |
+ 0.305 |
+ 0.467 |
+ 0.383 |
+ 0.535 |
+ 0.472 |
+ 0.110 |
+ 0.149 |
+ 0.338 |
+ 0.457 |
+ 0.278 |
+ 0.362 |
+ 0.464 |
+ 0.416 |
+ 0.449 |
+ 0.491 |
+ 0.297 |
+ 0.314 |
+ 0.418 |
+ 0.328 |
+ 0.250 |
+ 0.407 |
+ 0.376 |
+ 0.176 |
+ 0.544 |
+ 0.520 |
+ 0.514 |
+ 0.532 |
+ 0.547 |
+ 0.555 |
+ 0.533 |
+ 0.531 |
+ 0.538 |
+ 0.553 |
+ 0.523 |
+ 0.398 |
+ 351 |
+ Activity |
+ RPC1_LAMBD |
+ High |
+ Virus |
+
+
+ RPC1_LAMBD_Li_2019_low-expression |
+ 0.246 |
+ 0.412 |
+ 0.475 |
+ 0.498 |
+ 0.462 |
+ 0.455 |
+ 0.165 |
+ 0.466 |
+ 0.450 |
+ 0.468 |
+ 0.488 |
+ 0.466 |
+ 0.481 |
+ 0.253 |
+ 0.263 |
+ 0.363 |
+ 0.523 |
+ 0.579 |
+ 0.591 |
+ 0.408 |
+ 0.195 |
+ 0.338 |
+ 0.380 |
+ 0.415 |
+ 0.182 |
+ 0.366 |
+ 0.321 |
+ 0.299 |
+ 0.562 |
+ 0.447 |
+ 0.635 |
+ 0.611 |
+ 0.119 |
+ 0.135 |
+ 0.322 |
+ 0.463 |
+ 0.235 |
+ 0.323 |
+ 0.454 |
+ 0.420 |
+ 0.447 |
+ 0.498 |
+ 0.269 |
+ 0.271 |
+ 0.357 |
+ 0.280 |
+ 0.262 |
+ 0.375 |
+ 0.393 |
+ 0.266 |
+ 0.520 |
+ 0.488 |
+ 0.510 |
+ 0.519 |
+ 0.503 |
+ 0.530 |
+ 0.522 |
+ 0.511 |
+ 0.494 |
+ 0.528 |
+ 0.518 |
+ 0.344 |
+ 351 |
+ Activity |
+ RPC1_LAMBD |
+ High |
+ Virus |
+
+
+ RS15_GEOSE_Tsuboyama_2023_1A32 |
+ 0.393 |
+ 0.310 |
+ 0.336 |
+ 0.328 |
+ 0.321 |
+ 0.326 |
+ 0.178 |
+ 0.188 |
+ 0.387 |
+ 0.389 |
+ 0.468 |
+ 0.394 |
+ 0.390 |
+ 0.289 |
+ 0.326 |
+ 0.510 |
+ 0.409 |
+ 0.372 |
+ 0.330 |
+ 0.291 |
+ 0.199 |
+ 0.330 |
+ 0.317 |
+ 0.323 |
+ 0.297 |
+ 0.340 |
+ 0.416 |
+ 0.362 |
+ 0.367 |
+ 0.375 |
+ 0.413 |
+ 0.396 |
+ 0.117 |
+ 0.423 |
+ 0.325 |
+ 0.306 |
+ 0.453 |
+ 0.377 |
+ 0.365 |
+ 0.395 |
+ 0.348 |
+ 0.320 |
+ 0.333 |
+ 0.262 |
+ 0.443 |
+ 0.381 |
+ 0.582 |
+ 0.440 |
+ 0.607 |
+ 0.550 |
+ 0.396 |
+ 0.403 |
+ 0.403 |
+ 0.399 |
+ 0.407 |
+ 0.377 |
+ 0.415 |
+ 0.420 |
+ 0.395 |
+ 0.413 |
+ 0.433 |
+ 0.646 |
+ 1195 |
+ Stability |
+ RS15_GEOSE |
+ Medium |
+ Prokaryote |
+
+
+ S22A1_HUMAN_Yee_2023_abundance |
+ 0.507 |
+ 0.550 |
+ 0.564 |
+ 0.573 |
+ 0.584 |
+ 0.597 |
+ 0.380 |
+ 0.511 |
+ 0.608 |
+ 0.607 |
+ 0.597 |
+ 0.621 |
+ 0.655 |
+ 0.484 |
+ 0.537 |
+ 0.558 |
+ 0.626 |
+ 0.599 |
+ 0.566 |
+ 0.038 |
+ 0.492 |
+ 0.596 |
+ 0.606 |
+ 0.590 |
+ 0.526 |
+ 0.611 |
+ 0.629 |
+ 0.600 |
+ 0.557 |
+ 0.582 |
+ 0.571 |
+ 0.444 |
+ 0.330 |
+ 0.522 |
+ 0.596 |
+ 0.590 |
+ 0.566 |
+ 0.627 |
+ 0.631 |
+ 0.618 |
+ 0.633 |
+ 0.633 |
+ 0.504 |
+ 0.274 |
+ 0.584 |
+ 0.555 |
+ 0.440 |
+ 0.547 |
+ 0.534 |
+ 0.176 |
+ 0.582 |
+ 0.571 |
+ 0.574 |
+ 0.594 |
+ 0.607 |
+ 0.597 |
+ 0.594 |
+ 0.599 |
+ 0.592 |
+ 0.609 |
+ 0.630 |
+ 0.580 |
+ 9803 |
+ Expression |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ S22A1_HUMAN_Yee_2023_activity |
+ 0.450 |
+ 0.517 |
+ 0.557 |
+ 0.570 |
+ 0.553 |
+ 0.566 |
+ 0.347 |
+ 0.508 |
+ 0.562 |
+ 0.557 |
+ 0.553 |
+ 0.572 |
+ 0.614 |
+ 0.448 |
+ 0.480 |
+ 0.477 |
+ 0.575 |
+ 0.547 |
+ 0.525 |
+ 0.030 |
+ 0.488 |
+ 0.579 |
+ 0.570 |
+ 0.561 |
+ 0.526 |
+ 0.584 |
+ 0.597 |
+ 0.567 |
+ 0.533 |
+ 0.578 |
+ 0.596 |
+ 0.508 |
+ 0.299 |
+ 0.504 |
+ 0.577 |
+ 0.568 |
+ 0.527 |
+ 0.590 |
+ 0.591 |
+ 0.586 |
+ 0.604 |
+ 0.601 |
+ 0.458 |
+ 0.238 |
+ 0.561 |
+ 0.501 |
+ 0.408 |
+ 0.527 |
+ 0.503 |
+ 0.169 |
+ 0.542 |
+ 0.520 |
+ 0.514 |
+ 0.539 |
+ 0.560 |
+ 0.542 |
+ 0.542 |
+ 0.559 |
+ 0.552 |
+ 0.560 |
+ 0.542 |
+ 0.500 |
+ 10094 |
+ Activity |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ SAV1_MOUSE_Tsuboyama_2023_2YSB |
+ 0.137 |
+ 0.201 |
+ 0.394 |
+ 0.373 |
+ 0.407 |
+ 0.399 |
+ 0.426 |
+ 0.503 |
+ 0.410 |
+ 0.430 |
+ 0.481 |
+ 0.472 |
+ 0.508 |
+ 0.238 |
+ 0.496 |
+ 0.496 |
+ 0.512 |
+ 0.520 |
+ 0.406 |
+ 0.430 |
+ 0.491 |
+ 0.458 |
+ 0.449 |
+ 0.479 |
+ 0.524 |
+ 0.519 |
+ 0.476 |
+ 0.491 |
+ 0.482 |
+ 0.518 |
+ 0.469 |
+ 0.419 |
+ 0.353 |
+ 0.424 |
+ 0.497 |
+ 0.499 |
+ 0.431 |
+ 0.498 |
+ 0.489 |
+ 0.411 |
+ 0.443 |
+ 0.444 |
+ 0.503 |
+ -0.042 |
+ 0.545 |
+ 0.531 |
+ 0.336 |
+ 0.332 |
+ 0.443 |
+ 0.502 |
+ 0.454 |
+ 0.348 |
+ 0.234 |
+ 0.367 |
+ 0.362 |
+ 0.395 |
+ 0.392 |
+ 0.445 |
+ 0.426 |
+ 0.387 |
+ 0.575 |
+ 0.607 |
+ 965 |
+ Stability |
+ SAV1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ SBI_STAAM_Tsuboyama_2023_2JVG |
+ 0.232 |
+ 0.225 |
+ 0.380 |
+ 0.345 |
+ 0.440 |
+ 0.403 |
+ 0.205 |
+ 0.178 |
+ 0.481 |
+ 0.533 |
+ 0.323 |
+ 0.268 |
+ 0.297 |
+ 0.230 |
+ 0.266 |
+ 0.376 |
+ 0.573 |
+ 0.658 |
+ 0.318 |
+ 0.359 |
+ 0.228 |
+ 0.263 |
+ 0.259 |
+ 0.235 |
+ 0.210 |
+ 0.228 |
+ 0.221 |
+ 0.245 |
+ 0.339 |
+ 0.549 |
+ 0.582 |
+ 0.487 |
+ 0.169 |
+ 0.195 |
+ 0.200 |
+ 0.225 |
+ 0.297 |
+ 0.305 |
+ 0.314 |
+ 0.355 |
+ 0.368 |
+ 0.382 |
+ 0.262 |
+ 0.228 |
+ 0.316 |
+ 0.305 |
+ 0.664 |
+ 0.627 |
+ 0.687 |
+ 0.616 |
+ 0.529 |
+ 0.526 |
+ 0.561 |
+ 0.550 |
+ 0.545 |
+ 0.547 |
+ 0.569 |
+ 0.565 |
+ 0.558 |
+ 0.561 |
+ 0.683 |
+ 0.571 |
+ 1025 |
+ Stability |
+ SBI_STAAM |
+ Medium |
+ Prokaryote |
+
+
+ SC6A4_HUMAN_Young_2021 |
+ 0.387 |
+ 0.456 |
+ 0.423 |
+ 0.433 |
+ 0.504 |
+ 0.522 |
+ 0.352 |
+ 0.507 |
+ 0.562 |
+ 0.574 |
+ 0.545 |
+ 0.531 |
+ 0.542 |
+ 0.159 |
+ 0.327 |
+ 0.502 |
+ 0.543 |
+ 0.548 |
+ 0.524 |
+ 0.555 |
+ 0.498 |
+ 0.511 |
+ 0.500 |
+ 0.496 |
+ 0.512 |
+ 0.520 |
+ 0.518 |
+ 0.525 |
+ 0.511 |
+ 0.541 |
+ 0.463 |
+ 0.360 |
+ 0.342 |
+ 0.513 |
+ 0.512 |
+ 0.491 |
+ 0.530 |
+ 0.542 |
+ 0.530 |
+ 0.549 |
+ 0.555 |
+ 0.545 |
+ 0.489 |
+ 0.112 |
+ 0.549 |
+ 0.531 |
+ 0.467 |
+ 0.527 |
+ 0.501 |
+ 0.182 |
+ 0.516 |
+ 0.526 |
+ 0.535 |
+ 0.543 |
+ 0.545 |
+ 0.536 |
+ 0.536 |
+ 0.538 |
+ 0.526 |
+ 0.546 |
+ 0.578 |
+ 0.528 |
+ 11576 |
+ Activity |
+ SC6A4_HUMAN |
+ Medium |
+ Human |
+
+
+ SCIN_STAAR_Tsuboyama_2023_2QFF |
+ 0.083 |
+ 0.140 |
+ 0.261 |
+ 0.264 |
+ 0.300 |
+ 0.272 |
+ 0.154 |
+ 0.065 |
+ 0.271 |
+ 0.258 |
+ 0.299 |
+ 0.298 |
+ 0.269 |
+ 0.208 |
+ 0.305 |
+ 0.279 |
+ 0.346 |
+ 0.406 |
+ 0.414 |
+ 0.250 |
+ 0.106 |
+ 0.162 |
+ 0.134 |
+ 0.144 |
+ 0.155 |
+ 0.194 |
+ 0.152 |
+ 0.200 |
+ 0.215 |
+ 0.229 |
+ 0.296 |
+ 0.208 |
+ -0.012 |
+ 0.072 |
+ 0.104 |
+ 0.187 |
+ 0.132 |
+ 0.132 |
+ 0.156 |
+ 0.245 |
+ 0.235 |
+ 0.255 |
+ 0.231 |
+ 0.177 |
+ 0.300 |
+ 0.269 |
+ 0.525 |
+ 0.527 |
+ 0.517 |
+ 0.477 |
+ 0.438 |
+ 0.442 |
+ 0.437 |
+ 0.480 |
+ 0.422 |
+ 0.478 |
+ 0.430 |
+ 0.430 |
+ 0.391 |
+ 0.454 |
+ 0.620 |
+ 0.513 |
+ 1212 |
+ Stability |
+ SCIN_STAAR |
+ High |
+ Prokaryote |
+
+
+ SCN5A_HUMAN_Glazer_2019 |
+ 0.130 |
+ 0.131 |
+ 0.162 |
+ 0.162 |
+ 0.153 |
+ 0.158 |
+ 0.130 |
+ -0.014 |
+ 0.163 |
+ 0.180 |
+ 0.141 |
+ 0.217 |
+ 0.135 |
+ 0.216 |
+ 0.177 |
+ 0.156 |
+ 0.183 |
+ 0.152 |
+ 0.125 |
+ 0.126 |
+ 0.106 |
+ 0.143 |
+ 0.151 |
+ 0.127 |
+ 0.124 |
+ 0.106 |
+ 0.093 |
+ 0.107 |
+ 0.167 |
+ 0.144 |
+ 0.186 |
+ 0.181 |
+ 0.095 |
+ 0.075 |
+ 0.068 |
+ 0.069 |
+ 0.095 |
+ 0.086 |
+ 0.086 |
+ 0.140 |
+ 0.140 |
+ 0.152 |
+ 0.086 |
+ 0.137 |
+ 0.165 |
+ 0.154 |
+ 0.030 |
+ 0.092 |
+ 0.053 |
+ 0.014 |
+ 0.127 |
+ 0.160 |
+ 0.157 |
+ 0.127 |
+ 0.105 |
+ 0.135 |
+ 0.143 |
+ 0.136 |
+ 0.169 |
+ 0.146 |
+ 0.232 |
+ 0.168 |
+ 224 |
+ OrganismalFitness |
+ SCN5A_HUMAN |
+ Medium |
+ Human |
+
+
+ SDA_BACSU_Tsuboyama_2023_1PV0 |
+ 0.631 |
+ 0.646 |
+ 0.615 |
+ 0.619 |
+ 0.647 |
+ 0.650 |
+ 0.001 |
+ 0.303 |
+ 0.456 |
+ 0.472 |
+ 0.606 |
+ 0.418 |
+ 0.466 |
+ 0.249 |
+ 0.274 |
+ 0.608 |
+ 0.594 |
+ 0.598 |
+ 0.596 |
+ 0.530 |
+ 0.026 |
+ 0.067 |
+ 0.003 |
+ 0.238 |
+ 0.081 |
+ 0.123 |
+ 0.045 |
+ 0.119 |
+ 0.214 |
+ 0.640 |
+ 0.604 |
+ 0.581 |
+ -0.165 |
+ 0.010 |
+ 0.260 |
+ 0.107 |
+ 0.607 |
+ 0.613 |
+ 0.368 |
+ 0.652 |
+ 0.650 |
+ 0.486 |
+ 0.037 |
+ -0.079 |
+ 0.336 |
+ 0.011 |
+ 0.434 |
+ 0.410 |
+ 0.608 |
+ 0.597 |
+ 0.650 |
+ 0.641 |
+ 0.588 |
+ 0.647 |
+ 0.630 |
+ 0.613 |
+ 0.622 |
+ 0.623 |
+ 0.653 |
+ 0.633 |
+ 0.639 |
+ 0.636 |
+ 2770 |
+ Stability |
+ SDA_BACSU |
+ Medium |
+ Prokaryote |
+
+
+ SERC_HUMAN_Xie_2023 |
+ 0.375 |
+ 0.483 |
+ 0.544 |
+ 0.544 |
+ 0.520 |
+ 0.530 |
+ 0.013 |
+ 0.456 |
+ 0.570 |
+ 0.576 |
+ 0.518 |
+ 0.541 |
+ 0.547 |
+ 0.183 |
+ 0.407 |
+ 0.529 |
+ 0.558 |
+ 0.577 |
+ 0.543 |
+ 0.535 |
+ 0.492 |
+ 0.511 |
+ 0.513 |
+ 0.484 |
+ 0.500 |
+ 0.530 |
+ 0.518 |
+ 0.534 |
+ 0.504 |
+ 0.506 |
+ 0.516 |
+ 0.469 |
+ 0.192 |
+ 0.513 |
+ 0.528 |
+ 0.519 |
+ 0.528 |
+ 0.540 |
+ 0.544 |
+ 0.548 |
+ 0.551 |
+ 0.552 |
+ 0.319 |
+ 0.031 |
+ 0.540 |
+ 0.460 |
+ 0.382 |
+ 0.461 |
+ 0.352 |
+ 0.189 |
+ 0.539 |
+ 0.524 |
+ 0.540 |
+ 0.551 |
+ 0.558 |
+ 0.548 |
+ 0.553 |
+ 0.550 |
+ 0.543 |
+ 0.561 |
+ 0.562 |
+ 0.478 |
+ 1914 |
+ OrganismalFitness |
+ SERC_HUMAN |
+ High |
+ Human |
+
+
+ SHOC2_HUMAN_Kwon_2022 |
+ 0.208 |
+ 0.328 |
+ 0.378 |
+ 0.383 |
+ 0.370 |
+ 0.379 |
+ 0.226 |
+ 0.360 |
+ 0.424 |
+ 0.422 |
+ 0.395 |
+ 0.384 |
+ 0.410 |
+ 0.242 |
+ 0.249 |
+ 0.267 |
+ 0.426 |
+ 0.388 |
+ 0.290 |
+ 0.388 |
+ 0.262 |
+ 0.363 |
+ 0.385 |
+ 0.355 |
+ 0.267 |
+ 0.394 |
+ 0.390 |
+ 0.400 |
+ 0.358 |
+ 0.407 |
+ 0.416 |
+ 0.366 |
+ 0.153 |
+ 0.254 |
+ 0.308 |
+ 0.376 |
+ 0.264 |
+ 0.302 |
+ 0.367 |
+ 0.367 |
+ 0.367 |
+ 0.396 |
+ 0.262 |
+ 0.236 |
+ 0.421 |
+ 0.291 |
+ 0.303 |
+ 0.358 |
+ 0.286 |
+ 0.095 |
+ 0.373 |
+ 0.367 |
+ 0.387 |
+ 0.390 |
+ 0.396 |
+ 0.398 |
+ 0.381 |
+ 0.392 |
+ 0.384 |
+ 0.398 |
+ 0.322 |
+ 0.286 |
+ 10972 |
+ OrganismalFitness |
+ SHOC2_HUMAN |
+ Medium |
+ Human |
+
+
+ SOX30_HUMAN_Tsuboyama_2023_7JJK |
+ 0.225 |
+ 0.241 |
+ 0.235 |
+ 0.242 |
+ 0.282 |
+ 0.273 |
+ 0.369 |
+ 0.170 |
+ 0.210 |
+ 0.246 |
+ 0.254 |
+ 0.335 |
+ 0.336 |
+ 0.298 |
+ 0.396 |
+ 0.333 |
+ 0.322 |
+ 0.262 |
+ 0.261 |
+ 0.223 |
+ 0.166 |
+ 0.167 |
+ 0.162 |
+ 0.190 |
+ 0.264 |
+ 0.247 |
+ 0.202 |
+ 0.253 |
+ 0.189 |
+ 0.231 |
+ 0.087 |
+ 0.060 |
+ 0.222 |
+ 0.133 |
+ 0.267 |
+ 0.229 |
+ 0.217 |
+ 0.241 |
+ 0.234 |
+ 0.275 |
+ 0.274 |
+ 0.262 |
+ 0.271 |
+ 0.150 |
+ 0.208 |
+ 0.356 |
+ 0.526 |
+ 0.346 |
+ 0.530 |
+ 0.464 |
+ 0.333 |
+ 0.380 |
+ 0.348 |
+ 0.333 |
+ 0.326 |
+ 0.385 |
+ 0.333 |
+ 0.334 |
+ 0.334 |
+ 0.356 |
+ 0.333 |
+ 0.466 |
+ 1010 |
+ Stability |
+ SOX30_HUMAN |
+ High |
+ Human |
+
+
+ SPA_STAAU_Tsuboyama_2023_1LP1 |
+ 0.432 |
+ 0.509 |
+ 0.518 |
+ 0.501 |
+ 0.552 |
+ 0.549 |
+ -0.093 |
+ 0.495 |
+ 0.409 |
+ 0.408 |
+ 0.393 |
+ -0.041 |
+ 0.001 |
+ -0.032 |
+ -0.033 |
+ -0.064 |
+ 0.005 |
+ -0.010 |
+ 0.020 |
+ 0.514 |
+ -0.154 |
+ 0.032 |
+ 0.205 |
+ 0.105 |
+ -0.081 |
+ -0.041 |
+ 0.057 |
+ 0.338 |
+ 0.428 |
+ 0.564 |
+ 0.432 |
+ 0.442 |
+ -0.034 |
+ -0.011 |
+ 0.001 |
+ 0.066 |
+ 0.437 |
+ 0.430 |
+ 0.427 |
+ 0.548 |
+ 0.547 |
+ 0.536 |
+ -0.137 |
+ -0.087 |
+ -0.071 |
+ -0.009 |
+ 0.512 |
+ 0.453 |
+ 0.705 |
+ 0.560 |
+ 0.520 |
+ 0.511 |
+ 0.493 |
+ 0.534 |
+ 0.498 |
+ 0.515 |
+ 0.510 |
+ 0.501 |
+ 0.504 |
+ 0.513 |
+ 0.546 |
+ 0.414 |
+ 2105 |
+ Stability |
+ SPA_STAAU |
+ Medium |
+ Prokaryote |
+
+
+ SPG1_STRSG_Olson_2014 |
+ 0.239 |
+ 0.282 |
+ -0.004 |
+ 0.006 |
+ 0.247 |
+ 0.272 |
+ -0.041 |
+ 0.017 |
+ -0.038 |
+ 0.171 |
+ 0.305 |
+ 0.237 |
+ 0.192 |
+ 0.280 |
+ 0.232 |
+ 0.264 |
+ 0.301 |
+ 0.251 |
+ 0.337 |
+ 0.278 |
+ 0.252 |
+ 0.216 |
+ 0.214 |
+ 0.208 |
+ 0.232 |
+ 0.222 |
+ 0.218 |
+ 0.239 |
+ 0.354 |
+ 0.283 |
+ 0.478 |
+ 0.494 |
+ 0.111 |
+ 0.223 |
+ 0.175 |
+ 0.279 |
+ 0.243 |
+ 0.204 |
+ 0.289 |
+ 0.229 |
+ 0.188 |
+ 0.290 |
+ -0.092 |
+ -0.112 |
+ 0.158 |
+ 0.004 |
+ 0.362 |
+ 0.306 |
+ 0.423 |
+ 0.147 |
+ 0.380 |
+ 0.315 |
+ 0.364 |
+ 0.414 |
+ 0.414 |
+ 0.360 |
+ 0.418 |
+ 0.417 |
+ 0.410 |
+ 0.407 |
+ 0.387 |
+ 0.374 |
+ 536962 |
+ Binding |
+ SPG1_STRSG |
+ Low |
+ Prokaryote |
+
+
+ SPG1_STRSG_Wu_2016 |
+ -0.051 |
+ 0.025 |
+ 0.048 |
+ 0.048 |
+ 0.036 |
+ 0.042 |
+ 0.036 |
+ -0.048 |
+ 0.170 |
+ 0.171 |
+ 0.121 |
+ 0.086 |
+ 0.059 |
+ 0.030 |
+ 0.097 |
+ 0.118 |
+ 0.128 |
+ 0.165 |
+ 0.199 |
+ 0.083 |
+ -0.012 |
+ -0.033 |
+ -0.051 |
+ -0.023 |
+ -0.061 |
+ 0.016 |
+ -0.014 |
+ -0.031 |
+ -0.031 |
+ 0.102 |
+ 0.146 |
+ 0.125 |
+ -0.083 |
+ 0.001 |
+ -0.085 |
+ -0.024 |
+ 0.039 |
+ 0.013 |
+ 0.021 |
+ 0.044 |
+ 0.027 |
+ 0.027 |
+ -0.013 |
+ -0.099 |
+ 0.040 |
+ -0.026 |
+ 0.091 |
+ 0.023 |
+ 0.180 |
+ 0.089 |
+ 0.144 |
+ 0.136 |
+ 0.132 |
+ 0.176 |
+ 0.156 |
+ 0.131 |
+ 0.194 |
+ 0.167 |
+ 0.177 |
+ 0.159 |
+ 0.171 |
+ 0.114 |
+ 149360 |
+ Binding |
+ SPG1_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPG2_STRSG_Tsuboyama_2023_5UBS |
+ 0.439 |
+ 0.517 |
+ 0.496 |
+ 0.539 |
+ 0.569 |
+ 0.565 |
+ 0.325 |
+ 0.309 |
+ 0.543 |
+ 0.478 |
+ 0.501 |
+ 0.437 |
+ 0.409 |
+ 0.366 |
+ 0.432 |
+ 0.451 |
+ 0.500 |
+ 0.504 |
+ 0.528 |
+ 0.507 |
+ 0.388 |
+ 0.433 |
+ 0.436 |
+ 0.392 |
+ 0.343 |
+ 0.465 |
+ 0.356 |
+ 0.485 |
+ 0.501 |
+ 0.614 |
+ 0.561 |
+ 0.536 |
+ -0.192 |
+ 0.356 |
+ 0.440 |
+ 0.462 |
+ 0.294 |
+ 0.335 |
+ 0.382 |
+ 0.561 |
+ 0.562 |
+ 0.569 |
+ 0.253 |
+ 0.089 |
+ 0.286 |
+ 0.240 |
+ 0.447 |
+ 0.369 |
+ 0.704 |
+ 0.570 |
+ 0.564 |
+ 0.566 |
+ 0.584 |
+ 0.592 |
+ 0.569 |
+ 0.595 |
+ 0.588 |
+ 0.610 |
+ 0.584 |
+ 0.589 |
+ 0.598 |
+ 0.549 |
+ 1451 |
+ Stability |
+ SPG2_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPIKE_SARS2_Starr_2020_binding |
+ 0.158 |
+ 0.205 |
+ 0.062 |
+ 0.151 |
+ 0.249 |
+ 0.252 |
+ -0.046 |
+ 0.369 |
+ 0.356 |
+ 0.360 |
+ -0.002 |
+ -0.064 |
+ -0.039 |
+ -0.049 |
+ -0.036 |
+ -0.022 |
+ -0.018 |
+ -0.008 |
+ 0.038 |
+ 0.337 |
+ 0.305 |
+ 0.368 |
+ 0.357 |
+ 0.364 |
+ 0.384 |
+ 0.347 |
+ 0.315 |
+ 0.378 |
+ 0.319 |
+ 0.247 |
+ 0.324 |
+ 0.357 |
+ 0.171 |
+ 0.338 |
+ 0.328 |
+ 0.355 |
+ 0.334 |
+ 0.325 |
+ 0.317 |
+ 0.356 |
+ 0.346 |
+ 0.341 |
+ -0.036 |
+ -0.013 |
+ -0.022 |
+ -0.008 |
+ 0.517 |
+ 0.483 |
+ 0.405 |
+ 0.160 |
+ 0.224 |
+ 0.241 |
+ 0.226 |
+ 0.297 |
+ 0.251 |
+ 0.301 |
+ 0.258 |
+ 0.279 |
+ 0.228 |
+ 0.279 |
+ 0.357 |
+ 0.159 |
+ 3802 |
+ Binding |
+ SPIKE_SARS2 |
+ Medium |
+ Virus |
+
+
+ SPIKE_SARS2_Starr_2020_expression |
+ 0.200 |
+ 0.314 |
+ 0.165 |
+ 0.279 |
+ 0.445 |
+ 0.450 |
+ -0.016 |
+ 0.352 |
+ 0.413 |
+ 0.446 |
+ 0.050 |
+ -0.023 |
+ 0.004 |
+ -0.026 |
+ 0.001 |
+ 0.010 |
+ 0.012 |
+ 0.018 |
+ 0.072 |
+ 0.355 |
+ 0.317 |
+ 0.382 |
+ 0.374 |
+ 0.387 |
+ 0.379 |
+ 0.367 |
+ 0.322 |
+ 0.397 |
+ 0.328 |
+ 0.359 |
+ 0.313 |
+ 0.346 |
+ 0.178 |
+ 0.344 |
+ 0.346 |
+ 0.383 |
+ 0.361 |
+ 0.361 |
+ 0.367 |
+ 0.451 |
+ 0.455 |
+ 0.475 |
+ 0.005 |
+ 0.022 |
+ 0.024 |
+ 0.041 |
+ 0.561 |
+ 0.533 |
+ 0.465 |
+ 0.187 |
+ 0.321 |
+ 0.354 |
+ 0.330 |
+ 0.422 |
+ 0.379 |
+ 0.413 |
+ 0.384 |
+ 0.392 |
+ 0.363 |
+ 0.405 |
+ 0.498 |
+ 0.266 |
+ 3798 |
+ Expression |
+ SPIKE_SARS2 |
+ Medium |
+ Virus |
+
+
+ SPTN1_CHICK_Tsuboyama_2023_1TUD |
+ 0.572 |
+ 0.613 |
+ 0.647 |
+ 0.599 |
+ 0.632 |
+ 0.615 |
+ 0.230 |
+ 0.524 |
+ 0.552 |
+ 0.593 |
+ 0.637 |
+ 0.649 |
+ 0.641 |
+ -0.133 |
+ 0.619 |
+ 0.627 |
+ 0.637 |
+ 0.726 |
+ 0.657 |
+ 0.624 |
+ 0.578 |
+ 0.643 |
+ 0.604 |
+ 0.589 |
+ 0.589 |
+ 0.613 |
+ 0.652 |
+ 0.563 |
+ 0.601 |
+ 0.621 |
+ 0.631 |
+ 0.632 |
+ 0.457 |
+ 0.508 |
+ 0.570 |
+ 0.548 |
+ 0.629 |
+ 0.641 |
+ 0.621 |
+ 0.623 |
+ 0.628 |
+ 0.618 |
+ 0.488 |
+ -0.221 |
+ 0.465 |
+ 0.498 |
+ 0.421 |
+ 0.401 |
+ 0.620 |
+ 0.623 |
+ 0.693 |
+ 0.679 |
+ 0.683 |
+ 0.685 |
+ 0.696 |
+ 0.685 |
+ 0.676 |
+ 0.674 |
+ 0.696 |
+ 0.691 |
+ 0.667 |
+ 0.526 |
+ 3201 |
+ Stability |
+ SPTN1_CHICK |
+ High |
+ Eukaryote |
+
+
+ SQSTM_MOUSE_Tsuboyama_2023_2RRU |
+ 0.399 |
+ 0.522 |
+ 0.597 |
+ 0.610 |
+ 0.604 |
+ 0.604 |
+ 0.086 |
+ 0.449 |
+ 0.565 |
+ 0.592 |
+ 0.514 |
+ 0.373 |
+ 0.503 |
+ 0.186 |
+ 0.442 |
+ 0.529 |
+ 0.618 |
+ 0.640 |
+ 0.527 |
+ 0.607 |
+ 0.218 |
+ 0.474 |
+ 0.490 |
+ 0.505 |
+ 0.345 |
+ 0.544 |
+ 0.569 |
+ 0.506 |
+ 0.538 |
+ 0.623 |
+ 0.566 |
+ 0.526 |
+ 0.155 |
+ 0.194 |
+ 0.444 |
+ 0.518 |
+ 0.502 |
+ 0.540 |
+ 0.589 |
+ 0.614 |
+ 0.604 |
+ 0.625 |
+ 0.209 |
+ 0.119 |
+ 0.582 |
+ 0.583 |
+ 0.651 |
+ 0.626 |
+ 0.629 |
+ 0.619 |
+ 0.626 |
+ 0.582 |
+ 0.635 |
+ 0.620 |
+ 0.634 |
+ 0.628 |
+ 0.634 |
+ 0.610 |
+ 0.603 |
+ 0.639 |
+ 0.682 |
+ 0.514 |
+ 707 |
+ Stability |
+ SQSTM_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ SR43C_ARATH_Tsuboyama_2023_2N88 |
+ 0.596 |
+ 0.658 |
+ 0.684 |
+ 0.681 |
+ 0.693 |
+ 0.684 |
+ -0.190 |
+ 0.508 |
+ 0.638 |
+ 0.610 |
+ 0.702 |
+ 0.639 |
+ 0.654 |
+ -0.267 |
+ 0.662 |
+ 0.702 |
+ 0.675 |
+ 0.698 |
+ 0.681 |
+ 0.634 |
+ -0.109 |
+ 0.377 |
+ 0.382 |
+ 0.504 |
+ 0.493 |
+ 0.575 |
+ 0.607 |
+ 0.608 |
+ 0.610 |
+ 0.665 |
+ 0.674 |
+ 0.635 |
+ 0.359 |
+ 0.039 |
+ 0.189 |
+ 0.368 |
+ 0.667 |
+ 0.669 |
+ 0.673 |
+ 0.685 |
+ 0.689 |
+ 0.680 |
+ 0.575 |
+ -0.296 |
+ 0.703 |
+ 0.618 |
+ 0.214 |
+ 0.637 |
+ 0.731 |
+ 0.716 |
+ 0.677 |
+ 0.673 |
+ 0.670 |
+ 0.692 |
+ 0.696 |
+ 0.680 |
+ 0.668 |
+ 0.669 |
+ 0.685 |
+ 0.684 |
+ 0.788 |
+ 0.744 |
+ 1583 |
+ Stability |
+ SR43C_ARATH |
+ High |
+ Eukaryote |
+
+
+ SRBS1_HUMAN_Tsuboyama_2023_2O2W |
+ 0.360 |
+ 0.511 |
+ 0.696 |
+ 0.694 |
+ 0.723 |
+ 0.718 |
+ 0.549 |
+ 0.362 |
+ 0.743 |
+ 0.742 |
+ 0.628 |
+ 0.664 |
+ 0.700 |
+ 0.498 |
+ 0.713 |
+ 0.731 |
+ 0.734 |
+ 0.673 |
+ 0.694 |
+ 0.653 |
+ 0.679 |
+ 0.649 |
+ 0.653 |
+ 0.620 |
+ 0.642 |
+ 0.637 |
+ 0.659 |
+ 0.630 |
+ 0.607 |
+ 0.694 |
+ 0.696 |
+ 0.658 |
+ 0.577 |
+ 0.597 |
+ 0.637 |
+ 0.645 |
+ 0.660 |
+ 0.697 |
+ 0.705 |
+ 0.701 |
+ 0.727 |
+ 0.728 |
+ 0.388 |
+ 0.084 |
+ 0.538 |
+ 0.532 |
+ 0.579 |
+ 0.538 |
+ 0.640 |
+ 0.661 |
+ 0.751 |
+ 0.731 |
+ 0.734 |
+ 0.736 |
+ 0.746 |
+ 0.741 |
+ 0.735 |
+ 0.735 |
+ 0.740 |
+ 0.750 |
+ 0.707 |
+ 0.689 |
+ 1556 |
+ Stability |
+ SRBS1_HUMAN |
+ High |
+ Human |
+
+
+ SRC_HUMAN_Ahler_2019 |
+ 0.526 |
+ 0.508 |
+ 0.465 |
+ 0.465 |
+ 0.496 |
+ 0.507 |
+ 0.532 |
+ 0.440 |
+ 0.514 |
+ 0.532 |
+ 0.469 |
+ 0.561 |
+ 0.585 |
+ 0.447 |
+ 0.486 |
+ 0.490 |
+ 0.555 |
+ 0.515 |
+ 0.467 |
+ 0.484 |
+ 0.439 |
+ 0.413 |
+ 0.421 |
+ 0.373 |
+ 0.437 |
+ 0.460 |
+ 0.438 |
+ 0.422 |
+ 0.330 |
+ 0.542 |
+ 0.578 |
+ 0.574 |
+ 0.431 |
+ 0.410 |
+ 0.411 |
+ 0.348 |
+ 0.502 |
+ 0.502 |
+ 0.492 |
+ 0.521 |
+ 0.520 |
+ 0.516 |
+ 0.567 |
+ 0.397 |
+ 0.487 |
+ 0.546 |
+ 0.309 |
+ 0.296 |
+ 0.409 |
+ 0.098 |
+ 0.517 |
+ 0.495 |
+ 0.492 |
+ 0.517 |
+ 0.524 |
+ 0.521 |
+ 0.517 |
+ 0.527 |
+ 0.524 |
+ 0.532 |
+ 0.506 |
+ 0.479 |
+ 3372 |
+ Activity |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM |
+ 0.435 |
+ 0.427 |
+ 0.410 |
+ 0.433 |
+ 0.441 |
+ 0.437 |
+ 0.454 |
+ 0.401 |
+ 0.458 |
+ 0.454 |
+ 0.403 |
+ 0.486 |
+ 0.506 |
+ 0.358 |
+ 0.390 |
+ 0.393 |
+ 0.465 |
+ 0.426 |
+ 0.409 |
+ 0.397 |
+ 0.372 |
+ 0.350 |
+ 0.354 |
+ 0.309 |
+ 0.370 |
+ 0.395 |
+ 0.364 |
+ 0.350 |
+ 0.257 |
+ 0.456 |
+ 0.498 |
+ 0.495 |
+ 0.360 |
+ 0.339 |
+ 0.335 |
+ 0.280 |
+ 0.414 |
+ 0.411 |
+ 0.401 |
+ 0.443 |
+ 0.441 |
+ 0.437 |
+ 0.475 |
+ 0.316 |
+ 0.423 |
+ 0.458 |
+ 0.238 |
+ 0.246 |
+ 0.324 |
+ 0.046 |
+ 0.434 |
+ 0.408 |
+ 0.412 |
+ 0.437 |
+ 0.441 |
+ 0.438 |
+ 0.432 |
+ 0.442 |
+ 0.447 |
+ 0.447 |
+ 0.431 |
+ 0.391 |
+ 3637 |
+ Activity |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SRC_HUMAN_Nguyen_2022 |
+ 0.428 |
+ 0.426 |
+ 0.413 |
+ 0.428 |
+ 0.429 |
+ 0.425 |
+ 0.442 |
+ 0.405 |
+ 0.357 |
+ 0.411 |
+ 0.401 |
+ 0.483 |
+ 0.506 |
+ 0.342 |
+ 0.378 |
+ 0.387 |
+ 0.471 |
+ 0.443 |
+ 0.415 |
+ 0.396 |
+ 0.373 |
+ 0.344 |
+ 0.350 |
+ 0.306 |
+ 0.381 |
+ 0.397 |
+ 0.372 |
+ 0.358 |
+ 0.269 |
+ 0.453 |
+ 0.493 |
+ 0.484 |
+ 0.355 |
+ 0.348 |
+ 0.343 |
+ 0.282 |
+ 0.416 |
+ 0.412 |
+ 0.399 |
+ 0.440 |
+ 0.437 |
+ 0.430 |
+ 0.477 |
+ 0.304 |
+ 0.427 |
+ 0.466 |
+ 0.221 |
+ 0.227 |
+ 0.310 |
+ 0.063 |
+ 0.439 |
+ 0.405 |
+ 0.409 |
+ 0.432 |
+ 0.441 |
+ 0.433 |
+ 0.434 |
+ 0.444 |
+ 0.441 |
+ 0.445 |
+ 0.430 |
+ 0.382 |
+ 3366 |
+ OrganismalFitness |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SUMO1_HUMAN_Weile_2017 |
+ 0.369 |
+ 0.373 |
+ 0.419 |
+ 0.438 |
+ 0.480 |
+ 0.478 |
+ 0.130 |
+ 0.425 |
+ 0.449 |
+ 0.386 |
+ 0.433 |
+ 0.467 |
+ 0.510 |
+ 0.250 |
+ 0.496 |
+ 0.533 |
+ 0.509 |
+ 0.380 |
+ 0.340 |
+ 0.517 |
+ 0.217 |
+ 0.414 |
+ 0.443 |
+ 0.430 |
+ 0.468 |
+ 0.386 |
+ 0.460 |
+ 0.431 |
+ 0.333 |
+ 0.438 |
+ 0.460 |
+ 0.445 |
+ 0.212 |
+ 0.245 |
+ 0.473 |
+ 0.329 |
+ 0.405 |
+ 0.511 |
+ 0.410 |
+ 0.452 |
+ 0.522 |
+ 0.463 |
+ 0.490 |
+ 0.086 |
+ 0.395 |
+ 0.527 |
+ 0.497 |
+ 0.428 |
+ 0.509 |
+ 0.412 |
+ 0.492 |
+ 0.453 |
+ 0.500 |
+ 0.500 |
+ 0.493 |
+ 0.517 |
+ 0.479 |
+ 0.498 |
+ 0.491 |
+ 0.511 |
+ 0.479 |
+ 0.524 |
+ 1700 |
+ OrganismalFitness |
+ SUMO1_HUMAN |
+ High |
+ Human |
+
+
+ SYUA_HUMAN_Newberry_2020 |
+ 0.103 |
+ 0.111 |
+ 0.119 |
+ 0.132 |
+ 0.131 |
+ 0.139 |
+ 0.131 |
+ 0.212 |
+ 0.171 |
+ 0.162 |
+ 0.234 |
+ 0.242 |
+ 0.233 |
+ 0.105 |
+ 0.142 |
+ 0.150 |
+ 0.137 |
+ 0.206 |
+ 0.205 |
+ 0.220 |
+ 0.138 |
+ 0.203 |
+ 0.155 |
+ 0.141 |
+ 0.086 |
+ 0.151 |
+ 0.167 |
+ 0.105 |
+ 0.136 |
+ 0.227 |
+ 0.205 |
+ 0.189 |
+ 0.005 |
+ 0.137 |
+ 0.197 |
+ 0.181 |
+ 0.124 |
+ 0.176 |
+ 0.159 |
+ 0.134 |
+ 0.166 |
+ 0.156 |
+ 0.115 |
+ 0.068 |
+ 0.160 |
+ 0.155 |
+ -0.052 |
+ 0.046 |
+ -0.060 |
+ -0.074 |
+ 0.069 |
+ 0.056 |
+ 0.053 |
+ 0.088 |
+ 0.129 |
+ 0.098 |
+ 0.127 |
+ 0.102 |
+ 0.144 |
+ 0.098 |
+ -0.011 |
+ -0.045 |
+ 2497 |
+ OrganismalFitness |
+ SYUA_HUMAN |
+ Medium |
+ Human |
+
+
+ TADBP_HUMAN_Bolognesi_2019 |
+ 0.097 |
+ 0.061 |
+ 0.100 |
+ 0.096 |
+ 0.080 |
+ 0.080 |
+ 0.291 |
+ -0.018 |
+ 0.077 |
+ 0.072 |
+ 0.013 |
+ 0.051 |
+ 0.048 |
+ 0.121 |
+ 0.047 |
+ 0.021 |
+ -0.120 |
+ -0.028 |
+ 0.053 |
+ -0.075 |
+ 0.158 |
+ 0.063 |
+ -0.011 |
+ -0.006 |
+ 0.206 |
+ 0.082 |
+ 0.006 |
+ 0.023 |
+ -0.014 |
+ 0.036 |
+ 0.088 |
+ 0.112 |
+ -0.051 |
+ 0.226 |
+ 0.273 |
+ 0.121 |
+ 0.142 |
+ 0.185 |
+ 0.121 |
+ 0.105 |
+ 0.129 |
+ 0.109 |
+ 0.121 |
+ 0.212 |
+ -0.002 |
+ 0.078 |
+ 0.271 |
+ 0.033 |
+ 0.223 |
+ 0.059 |
+ 0.105 |
+ 0.013 |
+ 0.004 |
+ 0.040 |
+ 0.002 |
+ 0.034 |
+ 0.045 |
+ -0.011 |
+ -0.079 |
+ 0.021 |
+ 0.054 |
+ 0.121 |
+ 1196 |
+ OrganismalFitness |
+ TADBP_HUMAN |
+ Low |
+ Human |
+
+
+ TAT_HV1BR_Fernandes_2016 |
+ 0.293 |
+ 0.201 |
+ 0.255 |
+ 0.268 |
+ 0.319 |
+ 0.300 |
+ -0.090 |
+ 0.397 |
+ 0.274 |
+ 0.288 |
+ 0.185 |
+ 0.343 |
+ 0.342 |
+ -0.034 |
+ -0.008 |
+ -0.005 |
+ 0.017 |
+ -0.057 |
+ 0.044 |
+ 0.203 |
+ 0.379 |
+ 0.364 |
+ 0.396 |
+ 0.398 |
+ 0.394 |
+ 0.269 |
+ 0.136 |
+ 0.246 |
+ 0.223 |
+ 0.368 |
+ 0.405 |
+ 0.387 |
+ 0.283 |
+ 0.385 |
+ 0.211 |
+ 0.203 |
+ 0.379 |
+ 0.263 |
+ 0.237 |
+ 0.366 |
+ 0.295 |
+ 0.268 |
+ 0.005 |
+ -0.054 |
+ 0.292 |
+ 0.181 |
+ 0.230 |
+ 0.284 |
+ 0.344 |
+ 0.207 |
+ 0.114 |
+ 0.134 |
+ 0.142 |
+ 0.137 |
+ 0.116 |
+ 0.133 |
+ 0.120 |
+ 0.117 |
+ 0.117 |
+ 0.134 |
+ 0.157 |
+ 0.116 |
+ 1577 |
+ OrganismalFitness |
+ TAT_HV1BR |
+ High |
+ Virus |
+
+
+ TCRG1_MOUSE_Tsuboyama_2023_1E0L |
+ 0.576 |
+ 0.615 |
+ 0.644 |
+ 0.655 |
+ 0.660 |
+ 0.662 |
+ 0.696 |
+ 0.292 |
+ 0.677 |
+ 0.708 |
+ 0.591 |
+ 0.739 |
+ 0.750 |
+ 0.724 |
+ 0.707 |
+ 0.784 |
+ 0.769 |
+ 0.707 |
+ 0.745 |
+ 0.577 |
+ 0.490 |
+ 0.552 |
+ 0.570 |
+ 0.609 |
+ 0.604 |
+ 0.613 |
+ 0.660 |
+ 0.648 |
+ 0.597 |
+ 0.680 |
+ 0.640 |
+ 0.595 |
+ 0.642 |
+ 0.485 |
+ 0.619 |
+ 0.669 |
+ 0.688 |
+ 0.715 |
+ 0.710 |
+ 0.680 |
+ 0.712 |
+ 0.695 |
+ 0.753 |
+ 0.184 |
+ 0.629 |
+ 0.752 |
+ 0.680 |
+ 0.596 |
+ 0.774 |
+ 0.759 |
+ 0.721 |
+ 0.697 |
+ 0.657 |
+ 0.757 |
+ 0.733 |
+ 0.763 |
+ 0.723 |
+ 0.738 |
+ 0.710 |
+ 0.735 |
+ 0.790 |
+ 0.774 |
+ 1058 |
+ Stability |
+ TCRG1_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ THO1_YEAST_Tsuboyama_2023_2WQG |
+ 0.309 |
+ 0.397 |
+ 0.482 |
+ 0.473 |
+ 0.483 |
+ 0.481 |
+ 0.072 |
+ 0.061 |
+ 0.523 |
+ 0.520 |
+ 0.511 |
+ 0.502 |
+ 0.532 |
+ 0.186 |
+ 0.557 |
+ 0.548 |
+ 0.513 |
+ 0.532 |
+ 0.546 |
+ 0.515 |
+ -0.047 |
+ 0.332 |
+ 0.349 |
+ 0.305 |
+ 0.351 |
+ 0.383 |
+ 0.279 |
+ 0.423 |
+ 0.534 |
+ 0.543 |
+ 0.526 |
+ 0.512 |
+ 0.480 |
+ 0.003 |
+ 0.403 |
+ 0.337 |
+ 0.446 |
+ 0.505 |
+ 0.495 |
+ 0.472 |
+ 0.510 |
+ 0.511 |
+ 0.301 |
+ 0.114 |
+ 0.463 |
+ 0.389 |
+ 0.504 |
+ 0.484 |
+ 0.561 |
+ 0.597 |
+ 0.555 |
+ 0.546 |
+ 0.566 |
+ 0.573 |
+ 0.584 |
+ 0.548 |
+ 0.561 |
+ 0.551 |
+ 0.545 |
+ 0.561 |
+ 0.502 |
+ 0.483 |
+ 1279 |
+ Stability |
+ THO1_YEAST |
+ High |
+ Eukaryote |
+
+
+ TNKS2_HUMAN_Tsuboyama_2023_5JRT |
+ 0.464 |
+ 0.473 |
+ 0.506 |
+ 0.495 |
+ 0.507 |
+ 0.502 |
+ -0.099 |
+ 0.392 |
+ 0.573 |
+ 0.576 |
+ 0.472 |
+ 0.538 |
+ 0.526 |
+ 0.044 |
+ 0.547 |
+ 0.589 |
+ 0.525 |
+ 0.529 |
+ 0.533 |
+ 0.466 |
+ 0.262 |
+ 0.361 |
+ 0.429 |
+ 0.394 |
+ 0.395 |
+ 0.455 |
+ 0.385 |
+ 0.395 |
+ 0.436 |
+ 0.528 |
+ 0.484 |
+ 0.451 |
+ 0.250 |
+ 0.061 |
+ 0.312 |
+ 0.333 |
+ 0.535 |
+ 0.523 |
+ 0.517 |
+ 0.541 |
+ 0.518 |
+ 0.518 |
+ 0.343 |
+ -0.030 |
+ 0.572 |
+ 0.492 |
+ 0.526 |
+ 0.649 |
+ 0.675 |
+ 0.624 |
+ 0.510 |
+ 0.530 |
+ 0.516 |
+ 0.512 |
+ 0.517 |
+ 0.511 |
+ 0.521 |
+ 0.512 |
+ 0.513 |
+ 0.524 |
+ 0.597 |
+ 0.604 |
+ 1479 |
+ Stability |
+ TNKS2_HUMAN |
+ High |
+ Human |
+
+
+ TPK1_HUMAN_Weile_2017 |
+ 0.217 |
+ 0.236 |
+ 0.219 |
+ 0.228 |
+ 0.230 |
+ 0.229 |
+ 0.067 |
+ 0.253 |
+ 0.286 |
+ 0.270 |
+ 0.285 |
+ 0.270 |
+ 0.318 |
+ 0.108 |
+ 0.199 |
+ 0.253 |
+ 0.339 |
+ 0.328 |
+ 0.371 |
+ 0.249 |
+ 0.088 |
+ 0.142 |
+ 0.228 |
+ 0.272 |
+ 0.115 |
+ 0.267 |
+ 0.273 |
+ 0.253 |
+ 0.306 |
+ 0.255 |
+ 0.253 |
+ 0.188 |
+ 0.117 |
+ 0.115 |
+ 0.214 |
+ 0.301 |
+ 0.232 |
+ 0.252 |
+ 0.305 |
+ 0.243 |
+ 0.252 |
+ 0.277 |
+ 0.120 |
+ 0.089 |
+ 0.309 |
+ 0.170 |
+ 0.206 |
+ 0.267 |
+ 0.245 |
+ 0.116 |
+ 0.253 |
+ 0.277 |
+ 0.275 |
+ 0.278 |
+ 0.293 |
+ 0.291 |
+ 0.281 |
+ 0.291 |
+ 0.292 |
+ 0.298 |
+ 0.268 |
+ 0.203 |
+ 3181 |
+ OrganismalFitness |
+ TPK1_HUMAN |
+ Medium |
+ Human |
+
+
+ TPMT_HUMAN_Matreyek_2018 |
+ 0.372 |
+ 0.456 |
+ 0.489 |
+ 0.509 |
+ 0.499 |
+ 0.513 |
+ 0.241 |
+ 0.445 |
+ 0.497 |
+ 0.500 |
+ 0.546 |
+ 0.517 |
+ 0.547 |
+ 0.308 |
+ 0.433 |
+ 0.531 |
+ 0.539 |
+ 0.476 |
+ 0.448 |
+ 0.511 |
+ 0.333 |
+ 0.420 |
+ 0.481 |
+ 0.496 |
+ 0.447 |
+ 0.506 |
+ 0.481 |
+ 0.466 |
+ 0.424 |
+ 0.547 |
+ 0.542 |
+ 0.489 |
+ 0.334 |
+ 0.330 |
+ 0.471 |
+ 0.411 |
+ 0.478 |
+ 0.513 |
+ 0.478 |
+ 0.517 |
+ 0.537 |
+ 0.515 |
+ 0.371 |
+ 0.204 |
+ 0.539 |
+ 0.495 |
+ 0.476 |
+ 0.534 |
+ 0.540 |
+ 0.238 |
+ 0.522 |
+ 0.519 |
+ 0.530 |
+ 0.514 |
+ 0.541 |
+ 0.535 |
+ 0.541 |
+ 0.529 |
+ 0.534 |
+ 0.543 |
+ 0.568 |
+ 0.469 |
+ 3648 |
+ Expression |
+ TPMT_HUMAN |
+ Medium |
+ Human |
+
+
+ TPOR_HUMAN_Bridgford_2020 |
+ 0.376 |
+ 0.365 |
+ 0.283 |
+ 0.263 |
+ 0.288 |
+ 0.296 |
+ 0.370 |
+ 0.455 |
+ 0.454 |
+ 0.456 |
+ 0.407 |
+ 0.368 |
+ 0.370 |
+ 0.279 |
+ 0.378 |
+ 0.294 |
+ 0.290 |
+ 0.422 |
+ 0.362 |
+ 0.362 |
+ 0.250 |
+ 0.338 |
+ 0.313 |
+ 0.377 |
+ 0.367 |
+ 0.200 |
+ 0.471 |
+ 0.451 |
+ 0.340 |
+ 0.450 |
+ 0.436 |
+ 0.274 |
+ 0.395 |
+ 0.372 |
+ 0.321 |
+ 0.419 |
+ 0.429 |
+ 0.408 |
+ 0.451 |
+ 0.408 |
+ 0.382 |
+ 0.437 |
+ 0.295 |
+ 0.303 |
+ 0.353 |
+ 0.348 |
+ 0.126 |
+ 0.357 |
+ 0.309 |
+ 0.035 |
+ 0.344 |
+ 0.308 |
+ 0.344 |
+ 0.354 |
+ 0.322 |
+ 0.345 |
+ 0.396 |
+ 0.257 |
+ 0.293 |
+ 0.341 |
+ 0.389 |
+ 0.372 |
+ 562 |
+ OrganismalFitness |
+ TPOR_HUMAN |
+ Low |
+ Human |
+
+
+ TRPC_SACS2_Chan_2017 |
+ 0.575 |
+ 0.606 |
+ 0.558 |
+ 0.574 |
+ 0.575 |
+ 0.585 |
+ 0.173 |
+ 0.499 |
+ 0.655 |
+ 0.671 |
+ 0.621 |
+ 0.611 |
+ 0.643 |
+ 0.325 |
+ 0.539 |
+ 0.618 |
+ 0.653 |
+ 0.632 |
+ 0.601 |
+ 0.592 |
+ 0.421 |
+ 0.564 |
+ 0.508 |
+ 0.575 |
+ 0.498 |
+ 0.499 |
+ 0.558 |
+ 0.521 |
+ 0.524 |
+ 0.529 |
+ 0.590 |
+ 0.520 |
+ 0.221 |
+ 0.520 |
+ 0.555 |
+ 0.558 |
+ 0.575 |
+ 0.598 |
+ 0.591 |
+ 0.590 |
+ 0.612 |
+ 0.601 |
+ 0.434 |
+ 0.072 |
+ 0.612 |
+ 0.498 |
+ 0.321 |
+ 0.512 |
+ 0.507 |
+ 0.122 |
+ 0.593 |
+ 0.594 |
+ 0.582 |
+ 0.646 |
+ 0.614 |
+ 0.615 |
+ 0.621 |
+ 0.630 |
+ 0.614 |
+ 0.636 |
+ 0.626 |
+ 0.578 |
+ 1519 |
+ OrganismalFitness |
+ TRPC_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ TRPC_THEMA_Chan_2017 |
+ 0.405 |
+ 0.450 |
+ 0.394 |
+ 0.411 |
+ 0.416 |
+ 0.417 |
+ 0.395 |
+ 0.434 |
+ 0.471 |
+ 0.470 |
+ 0.439 |
+ 0.478 |
+ 0.506 |
+ 0.327 |
+ 0.470 |
+ 0.503 |
+ 0.464 |
+ 0.470 |
+ 0.499 |
+ 0.430 |
+ 0.336 |
+ 0.399 |
+ 0.395 |
+ 0.455 |
+ 0.437 |
+ 0.421 |
+ 0.369 |
+ 0.387 |
+ 0.478 |
+ 0.390 |
+ 0.427 |
+ 0.365 |
+ 0.140 |
+ 0.329 |
+ 0.460 |
+ 0.448 |
+ 0.390 |
+ 0.442 |
+ 0.438 |
+ 0.418 |
+ 0.445 |
+ 0.442 |
+ 0.397 |
+ 0.140 |
+ 0.461 |
+ 0.395 |
+ 0.253 |
+ 0.386 |
+ 0.393 |
+ 0.075 |
+ 0.459 |
+ 0.410 |
+ 0.431 |
+ 0.462 |
+ 0.463 |
+ 0.409 |
+ 0.439 |
+ 0.441 |
+ 0.418 |
+ 0.454 |
+ 0.504 |
+ 0.491 |
+ 1519 |
+ OrganismalFitness |
+ TRPC_THEMA |
+ Medium |
+ Prokaryote |
+
+
+ UBC9_HUMAN_Weile_2017 |
+ 0.371 |
+ 0.496 |
+ 0.521 |
+ 0.531 |
+ 0.508 |
+ 0.522 |
+ -0.042 |
+ 0.403 |
+ 0.512 |
+ 0.519 |
+ 0.420 |
+ 0.477 |
+ 0.509 |
+ 0.001 |
+ 0.004 |
+ 0.403 |
+ 0.473 |
+ 0.518 |
+ 0.538 |
+ 0.557 |
+ 0.220 |
+ 0.405 |
+ 0.445 |
+ 0.419 |
+ 0.420 |
+ 0.479 |
+ 0.471 |
+ 0.449 |
+ 0.442 |
+ 0.484 |
+ 0.454 |
+ 0.432 |
+ 0.071 |
+ 0.233 |
+ 0.403 |
+ 0.428 |
+ 0.327 |
+ 0.452 |
+ 0.475 |
+ 0.443 |
+ 0.517 |
+ 0.540 |
+ 0.277 |
+ -0.008 |
+ 0.427 |
+ 0.492 |
+ 0.322 |
+ 0.350 |
+ 0.446 |
+ 0.243 |
+ 0.440 |
+ 0.417 |
+ 0.408 |
+ 0.433 |
+ 0.425 |
+ 0.427 |
+ 0.433 |
+ 0.440 |
+ 0.442 |
+ 0.439 |
+ 0.370 |
+ 0.426 |
+ 2563 |
+ OrganismalFitness |
+ UBC9_HUMAN |
+ Medium |
+ Human |
+
+
+ UBE4B_HUMAN_Tsuboyama_2023_3L1X |
+ 0.325 |
+ 0.443 |
+ 0.464 |
+ 0.467 |
+ 0.466 |
+ 0.466 |
+ 0.051 |
+ 0.346 |
+ 0.379 |
+ 0.415 |
+ 0.391 |
+ 0.453 |
+ 0.449 |
+ 0.347 |
+ 0.420 |
+ 0.490 |
+ 0.460 |
+ 0.467 |
+ 0.477 |
+ 0.421 |
+ 0.142 |
+ 0.353 |
+ 0.357 |
+ 0.364 |
+ 0.345 |
+ 0.374 |
+ 0.343 |
+ 0.322 |
+ 0.355 |
+ 0.486 |
+ 0.475 |
+ 0.453 |
+ 0.441 |
+ 0.079 |
+ 0.110 |
+ 0.255 |
+ 0.383 |
+ 0.385 |
+ 0.329 |
+ 0.472 |
+ 0.477 |
+ 0.424 |
+ 0.359 |
+ -0.095 |
+ 0.348 |
+ 0.344 |
+ 0.488 |
+ 0.409 |
+ 0.520 |
+ 0.351 |
+ 0.411 |
+ 0.436 |
+ 0.424 |
+ 0.447 |
+ 0.448 |
+ 0.428 |
+ 0.407 |
+ 0.429 |
+ 0.420 |
+ 0.432 |
+ 0.500 |
+ 0.407 |
+ 3622 |
+ Stability |
+ UBE4B_HUMAN |
+ High |
+ Human |
+
+
+ UBE4B_MOUSE_Starita_2013 |
+ 0.412 |
+ 0.417 |
+ 0.454 |
+ 0.468 |
+ 0.463 |
+ 0.470 |
+ 0.078 |
+ 0.089 |
+ 0.392 |
+ 0.394 |
+ 0.361 |
+ 0.447 |
+ 0.471 |
+ 0.399 |
+ 0.463 |
+ 0.459 |
+ 0.459 |
+ 0.351 |
+ 0.349 |
+ 0.383 |
+ 0.122 |
+ 0.324 |
+ 0.337 |
+ 0.300 |
+ 0.444 |
+ 0.376 |
+ 0.395 |
+ 0.376 |
+ 0.289 |
+ 0.433 |
+ 0.437 |
+ 0.396 |
+ 0.091 |
+ 0.026 |
+ 0.096 |
+ 0.262 |
+ 0.419 |
+ 0.351 |
+ 0.390 |
+ 0.467 |
+ 0.454 |
+ 0.472 |
+ 0.409 |
+ 0.041 |
+ 0.420 |
+ 0.458 |
+ 0.307 |
+ 0.393 |
+ -0.017 |
+ 0.118 |
+ 0.426 |
+ 0.409 |
+ 0.456 |
+ 0.458 |
+ 0.438 |
+ 0.458 |
+ 0.432 |
+ 0.449 |
+ 0.443 |
+ 0.454 |
+ 0.440 |
+ 0.412 |
+ 899 |
+ Activity |
+ UBE4B_MOUSE |
+ Low |
+ Eukaryote |
+
+
+ UBR5_HUMAN_Tsuboyama_2023_1I2T |
+ 0.442 |
+ 0.550 |
+ 0.540 |
+ 0.538 |
+ 0.566 |
+ 0.577 |
+ 0.152 |
+ 0.251 |
+ 0.498 |
+ 0.528 |
+ 0.528 |
+ 0.425 |
+ 0.511 |
+ 0.226 |
+ 0.226 |
+ 0.158 |
+ 0.197 |
+ 0.591 |
+ 0.601 |
+ 0.476 |
+ 0.422 |
+ 0.414 |
+ 0.467 |
+ 0.514 |
+ 0.473 |
+ 0.462 |
+ 0.495 |
+ 0.488 |
+ 0.492 |
+ 0.557 |
+ 0.497 |
+ 0.388 |
+ 0.288 |
+ 0.304 |
+ 0.455 |
+ 0.401 |
+ 0.484 |
+ 0.519 |
+ 0.502 |
+ 0.592 |
+ 0.594 |
+ 0.583 |
+ 0.133 |
+ 0.121 |
+ 0.422 |
+ 0.209 |
+ 0.557 |
+ 0.439 |
+ 0.637 |
+ 0.574 |
+ 0.560 |
+ 0.588 |
+ 0.597 |
+ 0.609 |
+ 0.599 |
+ 0.612 |
+ 0.589 |
+ 0.605 |
+ 0.585 |
+ 0.600 |
+ 0.661 |
+ 0.597 |
+ 1453 |
+ Stability |
+ UBR5_HUMAN |
+ Medium |
+ Human |
+
+
+ VG08_BPP22_Tsuboyama_2023_2GP8 |
+ 0.228 |
+ 0.314 |
+ 0.228 |
+ 0.246 |
+ 0.266 |
+ 0.255 |
+ 0.256 |
+ 0.064 |
+ 0.371 |
+ 0.392 |
+ 0.623 |
+ 0.544 |
+ 0.562 |
+ 0.360 |
+ 0.514 |
+ 0.601 |
+ 0.655 |
+ 0.611 |
+ 0.653 |
+ 0.167 |
+ 0.395 |
+ 0.474 |
+ 0.458 |
+ 0.531 |
+ 0.419 |
+ 0.425 |
+ 0.410 |
+ 0.467 |
+ 0.516 |
+ 0.487 |
+ 0.510 |
+ 0.386 |
+ 0.038 |
+ 0.307 |
+ 0.473 |
+ 0.464 |
+ 0.343 |
+ 0.415 |
+ 0.464 |
+ 0.336 |
+ 0.403 |
+ 0.409 |
+ 0.469 |
+ 0.295 |
+ 0.552 |
+ 0.483 |
+ 0.458 |
+ 0.561 |
+ 0.543 |
+ 0.505 |
+ 0.652 |
+ 0.626 |
+ 0.661 |
+ 0.661 |
+ 0.664 |
+ 0.691 |
+ 0.634 |
+ 0.658 |
+ 0.684 |
+ 0.674 |
+ 0.619 |
+ 0.600 |
+ 723 |
+ Stability |
+ VG08_BPP22 |
+ High |
+ Virus |
+
+
+ VILI_CHICK_Tsuboyama_2023_1YU5 |
+ 0.284 |
+ 0.422 |
+ 0.594 |
+ 0.630 |
+ 0.634 |
+ 0.637 |
+ 0.162 |
+ 0.540 |
+ 0.553 |
+ 0.590 |
+ 0.673 |
+ 0.630 |
+ 0.673 |
+ 0.371 |
+ 0.316 |
+ 0.652 |
+ 0.662 |
+ 0.589 |
+ 0.478 |
+ 0.592 |
+ 0.293 |
+ 0.431 |
+ 0.601 |
+ 0.524 |
+ 0.424 |
+ 0.575 |
+ 0.494 |
+ 0.482 |
+ 0.579 |
+ 0.599 |
+ 0.701 |
+ 0.694 |
+ 0.322 |
+ 0.346 |
+ 0.515 |
+ 0.521 |
+ 0.487 |
+ 0.564 |
+ 0.563 |
+ 0.634 |
+ 0.659 |
+ 0.642 |
+ 0.272 |
+ 0.283 |
+ 0.426 |
+ 0.421 |
+ 0.542 |
+ 0.433 |
+ 0.775 |
+ 0.702 |
+ 0.672 |
+ 0.658 |
+ 0.666 |
+ 0.658 |
+ 0.653 |
+ 0.665 |
+ 0.649 |
+ 0.656 |
+ 0.652 |
+ 0.662 |
+ 0.685 |
+ 0.798 |
+ 2568 |
+ Stability |
+ VILI_CHICK |
+ High |
+ Eukaryote |
+
+
+ VKOR1_HUMAN_Chiasson_2020_abundance |
+ 0.409 |
+ 0.416 |
+ 0.402 |
+ 0.422 |
+ 0.437 |
+ 0.437 |
+ 0.245 |
+ 0.396 |
+ 0.501 |
+ 0.520 |
+ 0.438 |
+ 0.447 |
+ 0.474 |
+ 0.251 |
+ 0.443 |
+ 0.500 |
+ 0.493 |
+ 0.484 |
+ 0.472 |
+ 0.464 |
+ 0.286 |
+ 0.298 |
+ 0.396 |
+ 0.430 |
+ 0.368 |
+ 0.471 |
+ 0.463 |
+ 0.437 |
+ 0.442 |
+ 0.481 |
+ 0.383 |
+ 0.288 |
+ 0.090 |
+ 0.256 |
+ 0.364 |
+ 0.505 |
+ 0.462 |
+ 0.483 |
+ 0.548 |
+ 0.476 |
+ 0.492 |
+ 0.538 |
+ 0.236 |
+ 0.180 |
+ 0.464 |
+ 0.384 |
+ 0.446 |
+ 0.441 |
+ 0.490 |
+ 0.279 |
+ 0.520 |
+ 0.490 |
+ 0.530 |
+ 0.543 |
+ 0.530 |
+ 0.513 |
+ 0.526 |
+ 0.508 |
+ 0.525 |
+ 0.533 |
+ 0.494 |
+ 0.479 |
+ 2695 |
+ Expression |
+ VKOR1_HUMAN |
+ Medium |
+ Human |
+
+
+ VKOR1_HUMAN_Chiasson_2020_activity |
+ 0.329 |
+ 0.368 |
+ 0.399 |
+ 0.410 |
+ 0.413 |
+ 0.428 |
+ 0.000 |
+ 0.386 |
+ 0.423 |
+ 0.441 |
+ 0.419 |
+ 0.427 |
+ 0.448 |
+ 0.016 |
+ 0.298 |
+ 0.369 |
+ 0.430 |
+ 0.446 |
+ 0.445 |
+ 0.415 |
+ 0.051 |
+ 0.073 |
+ 0.272 |
+ 0.310 |
+ 0.132 |
+ 0.384 |
+ 0.384 |
+ 0.360 |
+ 0.385 |
+ 0.421 |
+ 0.431 |
+ 0.388 |
+ 0.179 |
+ 0.022 |
+ 0.135 |
+ 0.370 |
+ 0.339 |
+ 0.350 |
+ 0.401 |
+ 0.409 |
+ 0.420 |
+ 0.436 |
+ 0.084 |
+ 0.021 |
+ 0.416 |
+ 0.226 |
+ 0.225 |
+ 0.334 |
+ 0.366 |
+ 0.100 |
+ 0.404 |
+ 0.358 |
+ 0.369 |
+ 0.389 |
+ 0.406 |
+ 0.389 |
+ 0.391 |
+ 0.394 |
+ 0.394 |
+ 0.401 |
+ 0.427 |
+ 0.322 |
+ 697 |
+ Activity |
+ VKOR1_HUMAN |
+ Medium |
+ Human |
+
+
+ VRPI_BPT7_Tsuboyama_2023_2WNM |
+ -0.119 |
+ 0.043 |
+ 0.164 |
+ 0.194 |
+ 0.078 |
+ 0.119 |
+ 0.082 |
+ 0.147 |
+ 0.366 |
+ 0.356 |
+ 0.498 |
+ 0.368 |
+ 0.374 |
+ 0.252 |
+ 0.410 |
+ 0.507 |
+ 0.587 |
+ 0.612 |
+ 0.468 |
+ 0.128 |
+ -0.020 |
+ 0.135 |
+ 0.133 |
+ 0.222 |
+ 0.215 |
+ 0.202 |
+ 0.131 |
+ 0.199 |
+ 0.291 |
+ 0.084 |
+ 0.499 |
+ 0.429 |
+ 0.242 |
+ 0.065 |
+ 0.059 |
+ 0.122 |
+ -0.039 |
+ -0.028 |
+ -0.057 |
+ 0.064 |
+ 0.083 |
+ 0.006 |
+ 0.254 |
+ 0.056 |
+ 0.501 |
+ 0.357 |
+ 0.589 |
+ 0.557 |
+ 0.593 |
+ 0.606 |
+ 0.612 |
+ 0.592 |
+ 0.633 |
+ 0.639 |
+ 0.622 |
+ 0.641 |
+ 0.615 |
+ 0.630 |
+ 0.604 |
+ 0.640 |
+ 0.662 |
+ 0.582 |
+ 1047 |
+ Stability |
+ VRPI_BPT7 |
+ Medium |
+ Virus |
+
+
+ YAIA_ECOLI_Tsuboyama_2023_2KVT |
+ 0.291 |
+ 0.602 |
+ 0.572 |
+ 0.596 |
+ 0.599 |
+ 0.591 |
+ -0.221 |
+ 0.533 |
+ 0.559 |
+ 0.598 |
+ 0.593 |
+ 0.270 |
+ 0.481 |
+ -0.107 |
+ 0.173 |
+ 0.572 |
+ 0.625 |
+ 0.684 |
+ 0.684 |
+ 0.591 |
+ -0.091 |
+ -0.182 |
+ -0.130 |
+ 0.264 |
+ -0.125 |
+ -0.166 |
+ -0.231 |
+ -0.023 |
+ 0.520 |
+ 0.605 |
+ 0.681 |
+ 0.643 |
+ -0.060 |
+ -0.234 |
+ -0.064 |
+ 0.502 |
+ 0.349 |
+ 0.359 |
+ 0.528 |
+ 0.572 |
+ 0.569 |
+ 0.604 |
+ -0.046 |
+ -0.261 |
+ 0.208 |
+ -0.013 |
+ 0.449 |
+ 0.425 |
+ 0.651 |
+ 0.544 |
+ 0.587 |
+ 0.627 |
+ 0.585 |
+ 0.619 |
+ 0.612 |
+ 0.627 |
+ 0.625 |
+ 0.657 |
+ 0.677 |
+ 0.632 |
+ 0.616 |
+ 0.498 |
+ 1890 |
+ Stability |
+ YAIA_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ YAP1_HUMAN_Araya_2012 |
+ 0.428 |
+ 0.321 |
+ 0.464 |
+ 0.460 |
+ 0.449 |
+ 0.455 |
+ 0.329 |
+ 0.289 |
+ 0.018 |
+ 0.036 |
+ 0.333 |
+ 0.280 |
+ 0.285 |
+ 0.411 |
+ 0.457 |
+ 0.451 |
+ 0.466 |
+ 0.382 |
+ 0.313 |
+ 0.189 |
+ 0.180 |
+ 0.176 |
+ 0.158 |
+ 0.167 |
+ 0.299 |
+ 0.226 |
+ 0.257 |
+ 0.207 |
+ 0.150 |
+ 0.332 |
+ 0.425 |
+ 0.451 |
+ 0.139 |
+ 0.302 |
+ 0.196 |
+ 0.217 |
+ 0.402 |
+ 0.326 |
+ 0.359 |
+ 0.435 |
+ 0.405 |
+ 0.439 |
+ 0.485 |
+ -0.104 |
+ 0.335 |
+ 0.486 |
+ 0.340 |
+ 0.362 |
+ 0.383 |
+ 0.197 |
+ 0.446 |
+ 0.379 |
+ 0.432 |
+ 0.426 |
+ 0.417 |
+ 0.412 |
+ 0.415 |
+ 0.441 |
+ 0.444 |
+ 0.438 |
+ 0.386 |
+ 0.422 |
+ 10075 |
+ Binding |
+ YAP1_HUMAN |
+ Low |
+ Human |
+
+
+ YNZC_BACSU_Tsuboyama_2023_2JVD |
+ 0.596 |
+ 0.610 |
+ 0.595 |
+ 0.618 |
+ 0.604 |
+ 0.609 |
+ 0.492 |
+ 0.394 |
+ 0.591 |
+ 0.583 |
+ 0.673 |
+ 0.649 |
+ 0.667 |
+ 0.584 |
+ 0.621 |
+ 0.650 |
+ 0.658 |
+ 0.631 |
+ 0.630 |
+ 0.525 |
+ 0.488 |
+ 0.534 |
+ 0.612 |
+ 0.604 |
+ 0.594 |
+ 0.611 |
+ 0.587 |
+ 0.590 |
+ 0.643 |
+ 0.594 |
+ 0.586 |
+ 0.547 |
+ 0.568 |
+ 0.573 |
+ 0.567 |
+ 0.544 |
+ 0.665 |
+ 0.634 |
+ 0.621 |
+ 0.669 |
+ 0.624 |
+ 0.630 |
+ 0.446 |
+ 0.353 |
+ 0.536 |
+ 0.448 |
+ 0.483 |
+ 0.562 |
+ 0.680 |
+ 0.650 |
+ 0.701 |
+ 0.649 |
+ 0.665 |
+ 0.687 |
+ 0.700 |
+ 0.691 |
+ 0.687 |
+ 0.676 |
+ 0.679 |
+ 0.690 |
+ 0.638 |
+ 0.760 |
+ 2300 |
+ Stability |
+ YNZC_BACSU |
+ Medium |
+ Prokaryote |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/substitutions/Spearman/DMS_substitutions_Spearman_Uniprot_Selection_Type_level.csv b/benchmarks/DMS_zero_shot/substitutions/Spearman/DMS_substitutions_Spearman_Uniprot_Selection_Type_level.csv
new file mode 100644
index 0000000..82517d7
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/Spearman/DMS_substitutions_Spearman_Uniprot_Selection_Type_level.csv
@@ -0,0 +1,7 @@
+Site_Independent,EVmutation,DeepSequence_single,DeepSequence_ensemble,EVE_single,EVE_ensemble,Unirep,Unirep_evotune,MSA_Transformer_single,MSA_Transformer_ensemble,ESM1b,ESM1v_single,ESM1v_ensemble,ESM2_8M,ESM2_35M,ESM2_150M,ESM2_650M,ESM2_3B,ESM2_15B,Wavenet,RITA_s,RITA_m,RITA_l,RITA_xl,Progen2_small,Progen2_medium,Progen2_base,Progen2_large,Progen2_xlarge,GEMME,VESPA,VESPAl,ProtGPT2,Tranception_S_no_retrieval,Tranception_M_no_retrieval,Tranception_L_no_retrieval,Tranception_S,Tranception_M,Tranception_L,TranceptEVE_S,TranceptEVE_M,TranceptEVE_L,CARP_38M,CARP_600K,CARP_640M,CARP_76M,MIF,MIFST,ESM-IF1,ProteinMPNN,ProtSSN_k10_h512,ProtSSN_k10_h768,ProtSSN_k10_h1280,ProtSSN_k20_h512,ProtSSN_k20_h768,ProtSSN_k20_h1280,ProtSSN_k30_h512,ProtSSN_k30_h768,ProtSSN_k30_h1280,ProtSSN_ensemble,SaProt_650M_AF2,SaProt_35M_AF2,Selection Type
+0.369,0.44,0.447,0.455,0.458,0.464,0.182,0.355,0.455,0.473,0.428,0.396,0.42,0.201,0.314,0.391,0.425,0.417,0.405,0.379,0.294,0.352,0.359,0.366,0.333,0.393,0.396,0.406,0.402,0.482,0.468,0.429,0.176,0.288,0.349,0.401,0.436,0.448,0.465,0.475,0.479,0.487,0.285,0.112,0.395,0.342,0.327,0.39,0.368,0.197,0.451,0.441,0.446,0.457,0.459,0.458,0.455,0.457,0.454,0.466,0.458,0.372,Activity
+0.344,0.317,0.349,0.363,0.372,0.386,0.202,0.305,0.312,0.329,0.287,0.268,0.32,0.26,0.291,0.326,0.337,0.321,0.317,0.325,0.275,0.273,0.29,0.302,0.275,0.295,0.294,0.293,0.302,0.383,0.366,0.347,0.149,0.286,0.284,0.288,0.372,0.361,0.349,0.396,0.386,0.376,0.268,0.084,0.273,0.282,0.336,0.321,0.389,0.163,0.344,0.337,0.352,0.353,0.347,0.366,0.35,0.351,0.357,0.366,0.378,0.357,Binding
+0.343,0.378,0.371,0.39,0.404,0.408,0.216,0.365,0.429,0.446,0.406,0.405,0.429,0.266,0.343,0.402,0.415,0.403,0.405,0.35,0.336,0.405,0.42,0.414,0.384,0.433,0.437,0.427,0.418,0.438,0.404,0.326,0.193,0.349,0.406,0.413,0.42,0.441,0.45,0.443,0.452,0.457,0.312,0.171,0.397,0.369,0.43,0.438,0.407,0.198,0.42,0.417,0.436,0.437,0.439,0.435,0.443,0.435,0.431,0.449,0.488,0.439,Expression
+0.379,0.409,0.395,0.411,0.44,0.446,0.135,0.344,0.411,0.418,0.349,0.361,0.386,0.136,0.212,0.3,0.368,0.378,0.387,0.363,0.325,0.369,0.374,0.381,0.334,0.381,0.378,0.379,0.387,0.45,0.438,0.402,0.164,0.317,0.359,0.386,0.409,0.419,0.433,0.447,0.452,0.458,0.215,0.056,0.365,0.268,0.297,0.364,0.32,0.161,0.376,0.369,0.38,0.39,0.386,0.385,0.384,0.383,0.383,0.396,0.363,0.285,OrganismalFitness
+0.358,0.43,0.473,0.476,0.487,0.491,0.21,0.366,0.475,0.492,0.5,0.437,0.477,0.262,0.439,0.51,0.523,0.509,0.488,0.449,0.289,0.348,0.383,0.398,0.349,0.396,0.383,0.396,0.445,0.519,0.5,0.461,0.257,0.27,0.342,0.381,0.452,0.465,0.471,0.497,0.502,0.5,0.314,0.105,0.412,0.376,0.522,0.485,0.624,0.565,0.557,0.547,0.548,0.563,0.563,0.566,0.557,0.561,0.561,0.568,0.592,0.575,Stability
+0.359,0.395,0.407,0.419,0.432,0.439,0.189,0.347,0.416,0.432,0.394,0.374,0.406,0.225,0.32,0.386,0.414,0.405,0.4,0.373,0.304,0.349,0.365,0.372,0.335,0.379,0.377,0.38,0.391,0.454,0.435,0.393,0.188,0.302,0.348,0.374,0.418,0.427,0.434,0.452,0.454,0.456,0.279,0.106,0.368,0.327,0.382,0.4,0.422,0.257,0.429,0.422,0.433,0.44,0.439,0.442,0.438,0.437,0.437,0.449,0.456,0.406,
diff --git a/benchmarks/DMS_zero_shot/substitutions/Spearman/DMS_substitutions_Spearman_Uniprot_level.csv b/benchmarks/DMS_zero_shot/substitutions/Spearman/DMS_substitutions_Spearman_Uniprot_level.csv
new file mode 100644
index 0000000..3a934a6
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/Spearman/DMS_substitutions_Spearman_Uniprot_level.csv
@@ -0,0 +1,223 @@
+Site_Independent,EVmutation,DeepSequence_single,DeepSequence_ensemble,EVE_single,EVE_ensemble,Unirep,Unirep_evotune,MSA_Transformer_single,MSA_Transformer_ensemble,ESM1b,ESM1v_single,ESM1v_ensemble,ESM2_8M,ESM2_35M,ESM2_150M,ESM2_650M,ESM2_3B,ESM2_15B,Wavenet,RITA_s,RITA_m,RITA_l,RITA_xl,Progen2_small,Progen2_medium,Progen2_base,Progen2_large,Progen2_xlarge,GEMME,VESPA,VESPAl,ProtGPT2,Tranception_S_no_retrieval,Tranception_M_no_retrieval,Tranception_L_no_retrieval,Tranception_S,Tranception_M,Tranception_L,TranceptEVE_S,TranceptEVE_M,TranceptEVE_L,CARP_38M,CARP_600K,CARP_640M,CARP_76M,MIF,MIFST,ESM-IF1,ProteinMPNN,ProtSSN_k10_h512,ProtSSN_k10_h768,ProtSSN_k10_h1280,ProtSSN_k20_h512,ProtSSN_k20_h768,ProtSSN_k20_h1280,ProtSSN_k30_h512,ProtSSN_k30_h768,ProtSSN_k30_h1280,ProtSSN_ensemble,SaProt_650M_AF2,SaProt_35M_AF2,Selection Type,UniProt_ID,MSA_Neff_L_category,Taxon
+0.383,0.354,0.131,0.103,0.394,0.405,-0.133,0.062,0.436,0.439,-0.001,-0.048,0.015,-0.073,-0.054,-0.058,0.213,0.363,0.375,0.216,0.361,0.309,0.317,0.305,0.329,0.342,0.328,0.312,0.293,0.43,0.319,0.296,0.005,0.304,0.331,0.272,0.362,0.366,0.351,0.361,0.358,0.373,-0.06,-0.073,0.131,-0.061,0.273,0.29,0.287,0.129,0.253,0.251,0.282,0.296,0.277,0.269,0.263,0.269,0.265,0.278,0.199,0.116,OrganismalFitness,A0A140D2T1_ZIKV,Medium,Virus
+0.481,0.407,0.413,0.432,0.509,0.516,0.0,0.513,0.511,0.515,0.456,0.492,0.516,-0.003,0.015,0.035,0.08,0.132,0.164,0.465,0.496,0.507,0.509,0.505,0.497,0.501,0.463,0.49,0.484,0.496,0.541,0.507,0.327,0.492,0.483,0.514,0.509,0.503,0.513,0.524,0.518,0.528,0.422,-0.021,0.497,0.429,0.329,0.451,0.206,0.136,0.202,0.23,0.258,0.263,0.21,0.226,0.228,0.2,0.183,0.239,0.173,0.084,OrganismalFitness,A0A192B1T2_9HIV1,Medium,Virus
+-0.011,0.044,0.107,0.098,0.053,0.054,-0.024,0.084,0.082,0.077,0.04,0.068,0.068,-0.037,-0.047,-0.016,0.03,0.027,0.025,0.067,-0.01,0.047,0.071,0.088,0.05,0.088,0.08,0.089,0.095,0.045,0.046,0.036,0.03,0.036,0.055,0.099,0.031,0.041,0.057,0.058,0.061,0.075,-0.055,-0.056,0.039,-0.043,0.051,0.055,0.039,0.04,0.054,0.042,0.039,0.035,0.037,0.052,0.039,0.058,0.041,0.047,0.04,-0.005,Activity,A0A1I9GEU1_NEIME,Medium,Prokaryote
+0.436,0.459,0.109,0.041,0.428,0.428,0.003,0.026,0.299,0.32,0.087,0.088,0.093,0.064,0.084,0.097,0.066,0.084,0.084,0.059,0.021,0.015,0.04,0.078,-0.029,0.067,0.01,0.037,0.106,0.473,0.337,0.351,0.093,0.041,0.024,0.057,0.301,0.292,0.311,0.253,0.242,0.263,0.047,0.002,0.074,0.084,0.418,0.441,0.476,0.364,0.274,0.326,0.314,0.344,0.313,0.36,0.312,0.331,0.265,0.335,0.427,0.28,Activity,A0A247D711_LISMN,High,Prokaryote
+0.479,0.49,0.469,0.498,0.526,0.534,0.015,0.464,0.495,0.502,0.14,0.489,0.53,0.022,0.043,0.068,0.486,0.49,0.493,0.436,0.458,0.518,0.502,0.526,0.389,0.49,0.498,0.471,0.49,0.526,0.466,0.388,0.08,0.456,0.509,0.521,0.512,0.541,0.539,0.549,0.566,0.566,0.029,0.026,0.296,0.043,0.392,0.434,0.412,0.16,0.46,0.452,0.492,0.487,0.49,0.493,0.502,0.483,0.504,0.503,0.214,0.174,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+0.479,0.49,0.469,0.498,0.526,0.534,0.015,0.464,0.495,0.502,0.14,0.489,0.53,0.022,0.043,0.068,0.486,0.49,0.493,0.436,0.458,0.518,0.502,0.526,0.389,0.49,0.498,0.471,0.49,0.526,0.466,0.388,0.08,0.456,0.509,0.521,0.512,0.541,0.539,0.549,0.566,0.566,0.029,0.026,0.296,0.043,0.392,0.434,0.412,0.16,0.46,0.452,0.492,0.487,0.49,0.493,0.502,0.483,0.504,0.503,0.214,0.174,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+0.411,0.34,0.406,0.403,0.409,0.408,0.029,0.364,0.297,0.287,0.044,0.026,0.033,0.022,0.02,0.027,0.149,0.211,0.27,0.26,0.329,0.386,0.404,0.398,0.064,0.26,0.285,0.244,0.33,0.443,0.318,0.296,0.055,0.348,0.371,0.404,0.366,0.38,0.393,0.434,0.439,0.461,0.019,0.0,0.08,-0.001,0.262,0.254,0.126,0.105,0.183,0.18,0.191,0.216,0.209,0.207,0.2,0.198,0.193,0.204,0.148,0.092,OrganismalFitness,A4D664_9INFA,Medium,Virus
+0.317,0.492,0.657,0.664,0.623,0.641,0.344,0.529,0.641,0.681,0.68,0.647,0.668,0.431,0.528,0.655,0.738,0.712,0.627,0.567,0.409,0.533,0.564,0.624,0.526,0.622,0.639,0.638,0.705,0.671,0.742,0.673,0.242,0.423,0.552,0.629,0.557,0.588,0.652,0.653,0.66,0.679,0.481,0.071,0.671,0.601,0.664,0.713,0.629,0.422,0.728,0.718,0.711,0.719,0.729,0.733,0.728,0.733,0.735,0.745,0.722,0.513,OrganismalFitness,A4GRB6_PSEAI,High,Prokaryote
+0.394,0.385,0.449,0.425,0.312,0.316,0.345,0.167,0.337,0.351,0.302,0.316,0.404,0.382,0.366,0.372,0.431,0.48,0.352,0.261,0.327,0.281,0.318,0.306,0.406,0.297,0.32,0.313,0.314,0.478,0.251,0.186,0.487,0.393,0.273,0.37,0.441,0.372,0.458,0.436,0.4,0.416,0.366,0.362,0.421,0.356,0.279,0.358,-0.191,0.06,0.449,0.44,0.417,0.41,0.406,0.38,0.423,0.346,0.372,0.417,0.411,0.388,Stability,A4_HUMAN,Low,Human
+0.282,0.474,0.331,0.414,0.487,0.491,0.196,0.191,0.501,0.504,0.413,0.483,0.489,0.196,0.248,0.256,0.49,0.511,0.518,0.391,0.206,0.232,0.276,0.306,0.273,0.428,0.407,0.413,0.438,0.465,0.507,0.455,0.021,0.259,0.246,0.417,0.427,0.418,0.466,0.471,0.466,0.495,0.216,0.17,0.354,0.243,0.257,0.374,0.357,0.183,0.481,0.492,0.483,0.488,0.495,0.486,0.486,0.477,0.494,0.5,0.461,0.282,OrganismalFitness,AACC1_PSEAI,High,Prokaryote
+0.254,0.237,0.187,0.2,0.249,0.242,-0.049,0.077,0.236,0.251,0.244,0.173,0.213,-0.053,0.062,0.216,0.225,0.175,0.178,0.227,0.033,0.129,0.175,0.207,0.042,0.162,0.207,0.208,0.261,0.217,0.164,0.138,0.08,0.044,0.12,0.134,0.217,0.19,0.196,0.228,0.2,0.205,0.005,0.005,0.244,0.078,0.336,0.292,0.317,0.106,0.217,0.255,0.226,0.222,0.236,0.249,0.223,0.221,0.214,0.234,0.289,0.168,Binding,ACE2_HUMAN,Medium,Human
+0.331,0.423,0.506,0.514,0.497,0.517,0.463,0.5,0.531,0.53,0.534,0.526,0.539,0.408,0.444,0.474,0.493,0.505,0.515,0.521,0.509,0.524,0.517,0.493,0.524,0.531,0.538,0.533,0.502,0.545,0.5,0.414,0.298,0.528,0.53,0.506,0.528,0.541,0.533,0.533,0.538,0.537,0.471,0.181,0.528,0.51,0.419,0.477,0.451,0.223,0.486,0.486,0.5,0.508,0.501,0.498,0.493,0.506,0.494,0.509,0.554,0.516,Activity,ADRB2_HUMAN,Medium,Human
+0.138,0.396,0.474,0.464,0.429,0.445,-0.287,0.327,0.32,0.385,0.415,0.376,0.478,-0.245,-0.264,0.196,0.328,0.347,0.262,0.479,-0.219,0.14,0.388,0.395,0.013,0.321,0.417,0.413,0.282,0.376,0.45,0.405,0.057,-0.039,0.057,0.339,0.206,0.218,0.367,0.399,0.402,0.444,-0.228,-0.238,0.379,0.258,0.328,0.401,0.328,0.243,0.337,0.404,0.344,0.447,0.359,0.365,0.341,0.343,0.312,0.359,0.257,-0.036,Activity,AICDA_HUMAN,Medium,Human
+0.358,0.418,0.419,0.407,0.386,0.398,-0.301,0.13,0.22,0.24,0.374,0.07,0.423,-0.209,0.286,0.321,0.261,0.297,0.26,0.062,-0.052,-0.218,-0.051,-0.038,-0.317,-0.089,-0.05,-0.121,0.056,0.396,0.254,0.228,0.09,-0.287,-0.139,-0.275,0.392,0.402,0.356,0.424,0.422,0.374,-0.185,-0.194,0.165,0.114,0.251,0.112,0.433,0.353,0.446,0.326,0.204,0.349,0.342,0.349,0.343,0.339,0.361,0.343,0.347,0.397,Stability,AMFR_HUMAN,Medium,Human
+0.246,0.443,0.496,0.506,0.477,0.464,0.093,0.436,0.401,0.605,0.57,0.613,0.662,0.326,0.408,0.428,0.557,0.674,0.613,0.506,0.555,0.525,0.532,0.542,0.566,0.561,0.534,0.559,0.508,0.558,0.583,0.528,0.265,0.601,0.47,0.405,0.585,0.479,0.438,0.58,0.506,0.481,0.356,0.096,0.483,0.412,0.368,0.469,0.407,0.213,0.555,0.519,0.53,0.537,0.529,0.541,0.523,0.54,0.564,0.556,0.607,0.41,Activity,AMIE_PSEAE,High,Prokaryote
+0.562,0.521,0.445,0.472,0.524,0.528,0.525,0.452,0.503,0.524,0.527,0.522,0.543,0.535,0.592,0.598,0.609,0.598,0.57,0.021,0.462,0.493,0.509,0.5,0.497,0.519,0.513,0.469,0.494,0.557,0.542,0.525,0.318,0.483,0.503,0.475,0.537,0.549,0.538,0.545,0.556,0.543,0.549,0.42,0.496,0.523,0.471,0.458,0.422,0.165,0.566,0.555,0.572,0.578,0.572,0.584,0.569,0.583,0.577,0.589,0.585,0.601,Activity,ANCSZ,Medium,Eukaryote
+0.277,0.413,0.411,0.43,0.431,0.423,0.306,0.386,0.43,0.463,0.409,0.296,0.405,0.332,0.297,0.511,0.497,0.499,0.435,0.434,0.372,0.439,0.448,0.403,0.362,0.437,0.473,0.461,0.418,0.46,0.409,0.31,0.073,0.394,0.396,0.43,0.405,0.415,0.439,0.465,0.46,0.471,0.285,0.267,0.45,0.408,0.725,0.651,0.738,0.636,0.513,0.532,0.527,0.529,0.517,0.508,0.503,0.526,0.494,0.533,0.604,0.511,Stability,ARGR_ECOLI,Medium,Prokaryote
+0.688,0.27,0.668,0.661,0.53,0.646,0.216,0.529,0.188,0.164,0.206,-0.019,0.396,0.174,0.123,0.151,0.27,0.367,0.315,0.336,0.239,0.076,0.304,0.568,0.067,0.251,0.362,0.372,0.366,0.734,0.358,0.338,0.295,0.266,0.213,0.341,0.544,0.469,0.388,0.615,0.57,0.402,0.146,0.138,0.207,0.17,0.287,0.307,0.432,-0.005,0.352,0.367,0.419,0.283,0.255,0.47,0.295,0.288,0.421,0.374,0.242,0.204,Binding,B2L11_HUMAN,Low,Human
+0.216,0.332,0.395,0.399,0.394,0.403,0.212,0.256,0.438,0.454,0.425,0.425,0.451,0.331,0.421,0.423,0.479,0.458,0.499,0.332,0.335,0.346,0.376,0.343,0.144,0.391,0.403,0.382,0.307,0.338,0.345,0.314,0.273,0.354,0.261,0.299,0.392,0.338,0.365,0.421,0.386,0.405,0.21,0.113,0.314,0.242,0.558,0.451,0.644,0.566,0.434,0.444,0.457,0.472,0.463,0.463,0.46,0.458,0.471,0.461,0.54,0.579,Stability,BBC1_YEAST,High,Eukaryote
+0.364,0.418,0.357,0.364,0.365,0.382,0.1,0.248,0.383,0.501,0.394,0.329,0.406,0.166,0.182,0.301,0.51,0.521,0.421,0.407,0.122,0.362,0.22,0.376,0.207,0.421,0.281,0.402,0.435,0.607,0.549,0.496,0.07,0.245,0.294,0.431,0.44,0.44,0.483,0.392,0.396,0.436,0.116,0.109,0.205,0.175,0.579,0.537,0.683,0.564,0.549,0.554,0.563,0.579,0.561,0.581,0.529,0.556,0.543,0.562,0.516,0.296,Stability,BCHB_CHLTE,Medium,Prokaryote
+0.429,0.652,0.668,0.683,0.658,0.672,0.109,0.418,0.642,0.684,0.646,0.622,0.654,0.354,0.497,0.602,0.675,0.553,0.427,0.602,0.518,0.512,0.502,0.492,0.539,0.594,0.598,0.534,0.429,0.598,0.691,0.673,0.148,0.51,0.501,0.459,0.594,0.595,0.596,0.672,0.68,0.681,0.471,0.032,0.636,0.54,0.542,0.633,0.61,0.297,0.66,0.649,0.653,0.673,0.674,0.67,0.678,0.659,0.67,0.685,0.688,0.544,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.429,0.652,0.668,0.683,0.658,0.672,0.109,0.418,0.642,0.684,0.646,0.622,0.654,0.354,0.497,0.602,0.675,0.553,0.427,0.602,0.518,0.512,0.502,0.492,0.539,0.594,0.598,0.534,0.429,0.598,0.691,0.673,0.148,0.51,0.501,0.459,0.594,0.595,0.596,0.672,0.68,0.681,0.471,0.032,0.636,0.54,0.542,0.633,0.61,0.297,0.66,0.649,0.653,0.673,0.674,0.67,0.678,0.659,0.67,0.685,0.688,0.544,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.429,0.652,0.668,0.683,0.658,0.672,0.109,0.418,0.642,0.684,0.646,0.622,0.654,0.354,0.497,0.602,0.675,0.553,0.427,0.602,0.518,0.512,0.502,0.492,0.539,0.594,0.598,0.534,0.429,0.598,0.691,0.673,0.148,0.51,0.501,0.459,0.594,0.595,0.596,0.672,0.68,0.681,0.471,0.032,0.636,0.54,0.542,0.633,0.61,0.297,0.66,0.649,0.653,0.673,0.674,0.67,0.678,0.659,0.67,0.685,0.688,0.544,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.429,0.652,0.668,0.683,0.658,0.672,0.109,0.418,0.642,0.684,0.646,0.622,0.654,0.354,0.497,0.602,0.675,0.553,0.427,0.602,0.518,0.512,0.502,0.492,0.539,0.594,0.598,0.534,0.429,0.598,0.691,0.673,0.148,0.51,0.501,0.459,0.594,0.595,0.596,0.672,0.68,0.681,0.471,0.032,0.636,0.54,0.542,0.633,0.61,0.297,0.66,0.649,0.653,0.673,0.674,0.67,0.678,0.659,0.67,0.685,0.688,0.544,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.501,0.385,0.441,0.432,0.424,0.466,0.125,0.189,0.387,0.407,0.494,0.404,0.447,0.134,0.392,0.455,0.515,0.497,0.394,0.296,0.145,0.379,0.405,0.361,0.305,0.473,0.457,0.442,0.423,0.438,0.539,0.482,0.22,0.151,0.179,0.409,0.456,0.456,0.514,0.475,0.476,0.533,0.229,0.132,0.534,0.457,0.44,0.469,0.116,0.12,0.515,0.501,0.506,0.502,0.512,0.504,0.513,0.512,0.504,0.519,0.539,0.481,OrganismalFitness,BRCA1_HUMAN,Low,Human
+0.481,0.346,0.44,0.445,0.447,0.481,0.095,0.477,0.095,-0.03,0.48,0.183,0.103,0.088,0.103,0.416,0.51,0.464,0.473,0.033,0.09,0.494,0.534,0.482,0.533,0.483,0.496,0.499,0.014,0.4,0.297,0.328,0.14,0.119,0.185,0.138,0.42,0.425,0.426,0.419,0.415,0.414,0.163,0.093,0.497,0.047,0.052,0.053,-0.091,0.285,0.436,0.477,0.473,0.424,0.455,0.436,0.456,0.438,0.453,0.465,0.0,0.016,OrganismalFitness,BRCA2_HUMAN,,Human
+0.393,0.371,0.351,0.357,0.433,0.436,-0.025,0.451,0.374,0.378,0.06,0.428,0.492,-0.014,-0.022,-0.007,0.483,0.405,0.426,0.343,0.396,0.363,0.378,0.349,0.254,0.454,0.428,0.465,0.37,0.475,0.457,0.36,0.11,0.354,0.374,0.392,0.412,0.419,0.433,0.439,0.441,0.452,-0.024,-0.033,0.294,-0.002,0.513,0.539,0.544,0.227,0.498,0.497,0.519,0.52,0.531,0.51,0.528,0.519,0.521,0.529,0.313,0.185,OrganismalFitness,C6KNH7_9INFA,Medium,Virus
+0.175,0.233,0.238,0.233,0.238,0.236,0.174,0.187,0.236,0.253,0.25,0.235,0.264,0.161,0.199,0.195,0.212,0.218,0.225,0.229,0.173,0.227,0.242,0.254,0.232,0.276,0.279,0.3,0.313,0.239,0.21,0.146,0.087,0.216,0.253,0.291,0.218,0.242,0.266,0.238,0.244,0.246,0.213,0.156,0.268,0.251,0.106,0.143,0.164,0.088,0.177,0.182,0.188,0.171,0.189,0.176,0.18,0.192,0.178,0.188,0.287,0.238,OrganismalFitness,CALM1_HUMAN,High,Human
+0.407,0.345,0.317,0.372,0.346,0.33,0.374,0.424,0.329,0.363,0.183,0.196,0.199,0.254,0.286,0.203,0.277,0.183,0.124,0.258,0.191,0.25,0.269,0.279,0.203,0.198,0.266,0.204,0.399,0.445,0.183,0.177,0.132,0.202,0.262,0.492,0.38,0.386,0.473,0.338,0.343,0.426,0.092,0.16,0.26,0.112,0.402,0.393,0.347,0.319,0.235,0.225,0.216,0.235,0.214,0.215,0.22,0.208,0.228,0.223,0.301,0.222,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+0.343,0.244,0.221,0.204,0.24,0.256,0.104,-0.003,0.258,0.269,0.375,0.407,0.382,0.07,0.105,0.4,0.414,0.439,0.448,0.202,0.146,0.255,0.26,0.246,0.072,0.259,0.268,0.292,0.179,0.258,0.38,0.337,0.262,0.001,0.206,0.16,0.246,0.266,0.226,0.262,0.264,0.232,0.088,0.042,0.339,0.263,0.368,0.37,0.393,0.158,0.43,0.429,0.425,0.416,0.415,0.424,0.41,0.428,0.436,0.434,0.493,0.379,OrganismalFitness,CAR11_HUMAN,Low,Human
+0.343,0.244,0.221,0.204,0.24,0.256,0.104,-0.003,0.258,0.269,0.375,0.407,0.382,0.07,0.105,0.4,0.414,0.439,0.448,0.202,0.146,0.255,0.26,0.246,0.072,0.259,0.268,0.292,0.179,0.258,0.38,0.337,0.262,0.001,0.206,0.16,0.246,0.266,0.226,0.262,0.264,0.232,0.088,0.042,0.339,0.263,0.368,0.37,0.393,0.158,0.43,0.429,0.425,0.416,0.415,0.424,0.41,0.428,0.436,0.434,0.493,0.379,OrganismalFitness,CAR11_HUMAN,Low,Human
+0.159,0.179,0.162,0.166,0.169,0.177,0.04,0.1,0.18,0.183,0.175,0.069,0.073,0.063,0.09,0.154,0.182,0.188,0.194,0.006,0.043,0.081,0.173,0.12,0.043,0.194,0.182,0.179,0.068,0.187,0.199,0.179,0.027,0.035,0.047,0.166,0.17,0.172,0.189,0.184,0.184,0.194,0.061,0.028,0.181,0.106,0.106,0.181,0.036,0.037,0.176,0.177,0.178,0.185,0.18,0.181,0.177,0.178,0.177,0.183,0.177,0.114,Activity,CAS9_STRP1,Medium,Prokaryote
+0.382,0.543,0.61,0.622,0.626,0.632,0.054,0.478,0.644,0.653,0.59,0.595,0.624,0.264,0.574,0.639,0.638,0.554,0.514,0.603,0.233,0.512,0.517,0.532,0.505,0.556,0.554,0.526,0.567,0.608,0.599,0.542,0.217,0.092,0.482,0.52,0.533,0.562,0.582,0.628,0.619,0.628,0.422,-0.006,0.616,0.563,0.414,0.558,0.499,0.263,0.553,0.571,0.571,0.57,0.575,0.567,0.585,0.571,0.576,0.59,0.652,0.513,Activity,CASP3_HUMAN,High,Human
+0.372,0.516,0.619,0.617,0.609,0.613,0.057,0.47,0.6,0.627,0.585,0.611,0.632,0.306,0.598,0.632,0.622,0.572,0.562,0.611,0.206,0.532,0.529,0.558,0.529,0.579,0.589,0.563,0.572,0.642,0.596,0.538,0.332,0.103,0.543,0.521,0.517,0.594,0.586,0.599,0.628,0.628,0.461,0.019,0.646,0.596,0.479,0.612,0.573,0.329,0.585,0.57,0.562,0.587,0.608,0.602,0.591,0.578,0.596,0.605,0.643,0.533,Activity,CASP7_HUMAN,Medium,Human
+0.533,0.559,0.599,0.606,0.627,0.63,0.68,0.558,0.565,0.528,0.614,0.638,0.657,0.564,0.689,0.688,0.658,0.615,0.644,0.041,0.621,0.603,0.619,0.602,0.641,0.627,0.593,0.561,0.587,0.649,0.611,0.619,0.439,0.626,0.611,0.633,0.649,0.642,0.665,0.622,0.62,0.638,0.44,0.427,0.315,0.46,0.475,0.4,0.679,0.572,0.663,0.652,0.676,0.668,0.687,0.662,0.67,0.653,0.663,0.674,0.62,0.686,Stability,CATR_CHLRE,High,Eukaryote
+0.644,0.718,0.744,0.745,0.716,0.729,0.482,0.577,0.745,0.754,0.77,0.752,0.736,0.529,0.688,0.736,0.702,0.727,0.718,0.739,0.486,0.522,0.699,0.692,0.547,0.665,0.673,0.674,0.71,0.732,0.71,0.654,0.104,0.509,0.642,0.703,0.699,0.706,0.732,0.746,0.736,0.748,0.51,0.302,0.631,0.646,0.701,0.689,0.82,0.773,0.772,0.768,0.778,0.768,0.782,0.78,0.77,0.772,0.769,0.779,0.716,0.835,Stability,CBPA2_HUMAN,Medium,Human
+0.334,0.358,0.364,0.383,0.371,0.379,0.202,0.251,0.374,0.377,0.347,0.346,0.372,0.085,0.226,0.335,0.342,0.329,0.339,0.35,0.356,0.26,0.285,0.288,0.349,0.273,0.29,0.273,0.31,0.381,0.37,0.344,0.197,0.361,0.283,0.266,0.384,0.332,0.327,0.394,0.363,0.363,0.304,0.068,0.376,0.373,0.237,0.268,0.309,0.079,0.324,0.324,0.331,0.337,0.337,0.335,0.332,0.332,0.335,0.34,0.386,0.289,OrganismalFitness,CBS_HUMAN,Medium,Human
+0.498,0.548,0.674,0.706,0.633,0.685,-0.266,0.611,0.623,0.613,0.615,0.65,0.648,-0.271,0.735,0.699,0.682,0.609,0.606,0.648,0.486,0.589,0.581,0.587,0.553,0.577,0.589,0.531,0.569,0.633,0.667,0.653,0.423,0.425,0.505,0.54,0.601,0.6,0.623,0.676,0.698,0.701,0.621,-0.32,0.369,0.561,0.167,0.155,0.614,0.588,0.695,0.626,0.69,0.676,0.709,0.668,0.677,0.718,0.664,0.695,0.642,0.751,Stability,CBX4_HUMAN,High,Human
+0.395,0.496,0.522,0.544,0.522,0.53,-0.02,0.24,0.372,0.396,0.457,0.393,0.458,-0.007,-0.01,0.434,0.489,0.493,0.313,0.488,0.03,-0.0,-0.094,0.156,-0.123,0.074,-0.029,0.054,0.45,0.479,0.548,0.532,0.118,0.008,0.092,0.367,0.461,0.453,0.495,0.512,0.506,0.535,0.012,-0.053,0.47,0.019,0.301,0.43,0.339,0.236,0.461,0.425,0.432,0.462,0.46,0.467,0.454,0.477,0.502,0.473,0.464,0.26,Activity,CCDB_ECOLI,High,Prokaryote
+0.395,0.496,0.522,0.544,0.522,0.53,-0.02,0.24,0.372,0.396,0.457,0.393,0.458,-0.007,-0.01,0.434,0.489,0.493,0.313,0.488,0.03,-0.0,-0.094,0.156,-0.123,0.074,-0.029,0.054,0.45,0.479,0.548,0.532,0.118,0.008,0.092,0.367,0.461,0.453,0.495,0.512,0.506,0.535,0.012,-0.053,0.47,0.019,0.301,0.43,0.339,0.236,0.461,0.425,0.432,0.462,0.46,0.467,0.454,0.477,0.502,0.473,0.464,0.26,OrganismalFitness,CCDB_ECOLI,High,Prokaryote
+0.271,0.279,0.282,0.289,0.274,0.282,0.316,0.309,0.333,0.343,0.358,0.344,0.356,0.325,0.358,0.362,0.347,0.339,0.341,0.352,0.367,0.366,0.323,0.326,0.369,0.356,0.346,0.36,0.322,0.364,0.3,0.251,0.13,0.367,0.365,0.363,0.367,0.375,0.376,0.324,0.324,0.323,0.354,0.175,0.362,0.361,0.278,0.306,0.291,0.173,0.314,0.305,0.316,0.318,0.313,0.317,0.328,0.323,0.327,0.327,0.348,0.356,Binding,CCR5_HUMAN,High,Human
+0.189,0.247,0.199,0.211,0.234,0.232,0.051,0.233,0.257,0.261,0.046,0.05,0.072,0.09,0.093,0.054,0.05,0.123,0.217,-0.013,0.096,0.124,0.118,0.119,0.042,0.12,0.112,0.093,0.244,0.283,0.194,0.159,0.077,0.075,0.12,0.093,0.203,0.21,0.202,0.233,0.235,0.231,0.083,0.05,0.104,0.086,0.316,0.258,0.301,0.174,0.158,0.18,0.173,0.196,0.186,0.195,0.204,0.17,0.185,0.195,0.312,0.244,Binding,CD19_HUMAN,Low,Human
+0.516,0.581,0.61,0.628,0.606,0.622,0.556,0.585,0.603,0.625,0.516,0.615,0.644,0.55,0.63,0.652,0.657,0.652,0.605,0.632,0.57,0.57,0.603,0.566,0.594,0.586,0.591,0.582,0.577,0.621,0.588,0.528,0.216,0.609,0.584,0.57,0.642,0.638,0.631,0.651,0.645,0.641,0.608,0.105,0.546,0.637,0.579,0.558,0.632,0.24,0.622,0.624,0.629,0.646,0.653,0.648,0.645,0.634,0.64,0.658,0.655,0.643,Expression,CP2C9_HUMAN,High,Human
+0.516,0.581,0.61,0.628,0.606,0.622,0.556,0.585,0.603,0.625,0.516,0.615,0.644,0.55,0.63,0.652,0.657,0.652,0.605,0.632,0.57,0.57,0.603,0.566,0.594,0.586,0.591,0.582,0.577,0.621,0.588,0.528,0.216,0.609,0.584,0.57,0.642,0.638,0.631,0.651,0.645,0.641,0.608,0.105,0.546,0.637,0.579,0.558,0.632,0.24,0.622,0.624,0.629,0.646,0.653,0.648,0.645,0.634,0.64,0.658,0.655,0.643,Binding,CP2C9_HUMAN,High,Human
+0.389,0.504,0.432,0.445,0.47,0.477,0.26,0.4,0.461,0.461,0.481,0.546,0.605,0.449,0.686,0.518,0.484,0.423,0.415,0.406,0.449,0.455,0.502,0.514,0.443,0.587,0.53,0.504,0.501,0.532,0.401,0.389,0.244,0.341,0.474,0.546,0.497,0.536,0.576,0.468,0.513,0.53,0.416,0.326,0.494,0.538,0.562,0.541,0.653,0.62,0.531,0.52,0.532,0.557,0.58,0.556,0.539,0.58,0.548,0.556,0.575,0.72,Stability,CSN4_MOUSE,Medium,Eukaryote
+0.357,0.451,0.435,0.438,0.474,0.445,0.154,0.453,0.516,0.547,0.483,0.427,0.398,0.153,0.428,0.478,0.547,0.498,0.467,0.304,0.174,0.206,0.225,0.102,0.117,0.224,0.255,0.248,0.434,0.42,0.469,0.414,0.231,0.16,0.146,0.213,0.38,0.387,0.393,0.438,0.441,0.448,0.154,0.064,0.462,0.256,0.515,0.501,0.654,0.482,0.549,0.508,0.494,0.552,0.517,0.551,0.538,0.525,0.517,0.538,0.6,0.571,Stability,CUE1_YEAST,Medium,Eukaryote
+0.53,0.625,0.613,0.582,0.629,0.631,0.073,0.475,0.649,0.657,0.439,0.066,0.064,0.068,0.053,0.036,0.05,0.072,0.152,0.438,0.039,0.042,0.114,0.082,0.009,0.047,0.068,0.194,0.361,0.623,0.563,0.574,0.016,0.096,0.125,0.156,0.536,0.536,0.532,0.63,0.629,0.618,0.004,0.024,0.009,0.028,0.292,0.316,0.505,0.396,0.495,0.491,0.496,0.505,0.494,0.502,0.494,0.49,0.487,0.498,0.345,0.208,Activity,D7PM05_CLYGR,Low,Eukaryote
+0.679,0.584,0.607,0.577,0.609,0.616,0.723,0.629,0.524,0.535,0.52,0.55,0.608,0.743,0.764,0.728,0.584,0.487,0.443,0.65,0.574,0.583,0.569,0.538,0.605,0.607,0.564,0.562,0.495,0.614,0.634,0.621,0.511,0.576,0.638,0.58,0.653,0.7,0.667,0.643,0.661,0.636,0.581,0.233,0.372,0.528,0.637,0.453,0.672,0.315,0.483,0.37,0.539,0.499,0.52,0.492,0.508,0.486,0.504,0.504,0.521,0.745,OrganismalFitness,DLG4_HUMAN,Low,Human
+0.486,0.442,0.487,0.491,0.526,0.539,0.49,0.437,0.483,0.522,0.469,0.557,0.588,0.411,0.543,0.581,0.543,0.478,0.471,0.444,0.378,0.389,0.373,0.371,0.407,0.42,0.407,0.367,0.395,0.49,0.556,0.537,0.246,0.367,0.392,0.307,0.47,0.478,0.443,0.538,0.544,0.54,0.527,0.076,0.443,0.5,0.439,0.308,0.459,0.135,0.442,0.375,0.444,0.433,0.444,0.43,0.43,0.464,0.46,0.469,0.504,0.535,Binding,DLG4_RAT,Low,Eukaryote
+0.171,0.198,0.218,0.23,0.2,0.259,-0.071,0.317,0.368,0.386,0.085,0.126,0.131,0.074,0.209,0.255,0.337,0.337,0.37,0.235,-0.068,0.045,0.033,0.066,0.041,0.107,0.057,0.071,0.235,0.298,0.431,0.351,0.054,-0.013,-0.023,0.031,0.212,0.2,0.211,0.243,0.227,0.231,0.106,-0.048,0.299,0.129,0.568,0.56,0.66,0.556,0.424,0.436,0.516,0.507,0.491,0.476,0.472,0.444,0.436,0.49,0.556,0.481,Stability,DN7A_SACS2,Medium,Prokaryote
+0.738,0.75,0.759,0.757,0.767,0.764,0.756,0.732,0.776,0.79,0.773,0.756,0.785,0.827,0.816,0.817,0.803,0.753,0.798,0.781,0.669,0.736,0.776,0.75,0.782,0.765,0.774,0.721,0.764,0.775,0.729,0.714,0.66,0.78,0.776,0.805,0.808,0.816,0.823,0.774,0.787,0.787,0.67,0.164,0.443,0.684,0.643,0.528,0.779,0.785,0.757,0.761,0.76,0.793,0.789,0.803,0.789,0.786,0.794,0.786,0.726,0.801,Stability,DNJA1_HUMAN,High,Human
+0.418,0.453,0.421,0.439,0.494,0.501,0.057,0.432,0.406,0.403,0.525,0.472,0.501,0.224,0.458,0.486,0.531,0.508,0.531,0.518,0.34,0.418,0.402,0.408,0.315,0.402,0.365,0.348,0.395,0.448,0.471,0.469,0.216,0.202,0.321,0.252,0.473,0.49,0.494,0.489,0.491,0.507,0.359,-0.177,0.513,0.439,0.38,0.394,0.419,0.439,0.492,0.481,0.497,0.507,0.491,0.512,0.477,0.493,0.504,0.497,0.567,0.531,Stability,DOCK1_MOUSE,High,Eukaryote
+0.362,0.454,0.458,0.462,0.464,0.464,-0.007,0.373,0.488,0.491,0.496,0.476,0.491,0.094,0.424,0.49,0.512,0.518,0.512,0.494,0.286,0.39,0.274,0.326,0.388,0.412,0.432,0.448,0.376,0.466,0.477,0.442,0.124,0.38,0.292,0.286,0.434,0.372,0.389,0.459,0.441,0.45,0.393,-0.003,0.481,0.444,0.256,0.416,0.348,0.134,0.471,0.435,0.442,0.45,0.464,0.456,0.472,0.468,0.483,0.475,0.501,0.429,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+0.362,0.454,0.458,0.462,0.464,0.464,-0.007,0.373,0.488,0.491,0.496,0.476,0.491,0.094,0.424,0.49,0.512,0.518,0.512,0.494,0.286,0.39,0.274,0.326,0.388,0.412,0.432,0.448,0.376,0.466,0.477,0.442,0.124,0.38,0.292,0.286,0.434,0.372,0.389,0.459,0.441,0.45,0.393,-0.003,0.481,0.444,0.256,0.416,0.348,0.134,0.471,0.435,0.442,0.45,0.464,0.456,0.472,0.468,0.483,0.475,0.501,0.429,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+0.362,0.454,0.458,0.462,0.464,0.464,-0.007,0.373,0.488,0.491,0.496,0.476,0.491,0.094,0.424,0.49,0.512,0.518,0.512,0.494,0.286,0.39,0.274,0.326,0.388,0.412,0.432,0.448,0.376,0.466,0.477,0.442,0.124,0.38,0.292,0.286,0.434,0.372,0.389,0.459,0.441,0.45,0.393,-0.003,0.481,0.444,0.256,0.416,0.348,0.134,0.471,0.435,0.442,0.45,0.464,0.456,0.472,0.468,0.483,0.475,0.501,0.429,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+0.362,0.454,0.458,0.462,0.464,0.464,-0.007,0.373,0.488,0.491,0.496,0.476,0.491,0.094,0.424,0.49,0.512,0.518,0.512,0.494,0.286,0.39,0.274,0.326,0.388,0.412,0.432,0.448,0.376,0.466,0.477,0.442,0.124,0.38,0.292,0.286,0.434,0.372,0.389,0.459,0.441,0.45,0.393,-0.003,0.481,0.444,0.256,0.416,0.348,0.134,0.471,0.435,0.442,0.45,0.464,0.456,0.472,0.468,0.483,0.475,0.501,0.429,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+0.115,0.095,0.18,0.178,0.187,0.188,0.04,0.19,0.19,0.198,0.154,0.204,0.209,0.194,0.207,0.195,0.172,0.154,0.113,0.183,0.164,0.152,0.201,0.183,0.176,0.164,0.199,0.183,0.163,0.178,0.133,0.135,0.142,0.18,0.184,0.193,0.207,0.213,0.215,0.195,0.201,0.199,0.194,0.134,0.187,0.185,0.055,0.149,0.114,0.036,0.189,0.186,0.168,0.188,0.22,0.197,0.183,0.19,0.2,0.199,0.148,0.224,Activity,ENVZ_ECOLI,High,Prokaryote
+0.369,0.397,0.238,0.367,0.388,0.377,0.056,0.353,0.341,0.366,0.359,0.415,0.389,0.11,0.055,0.035,0.012,0.048,0.095,0.388,0.38,0.358,0.408,0.419,0.263,0.394,0.401,0.391,0.374,0.389,0.355,0.322,0.253,0.368,0.373,0.404,0.392,0.396,0.407,0.39,0.392,0.392,0.343,-0.04,0.334,0.395,0.355,0.325,0.364,0.231,0.215,0.261,0.274,0.24,0.214,0.233,0.241,0.258,0.24,0.263,0.15,0.102,OrganismalFitness,ENV_HV1B9,Medium,Virus
+0.338,0.303,0.322,0.323,0.339,0.345,-0.001,0.322,0.345,0.344,0.298,0.32,0.336,-0.009,-0.006,0.004,0.046,0.069,0.16,0.337,0.35,0.358,0.371,0.364,0.345,0.358,0.358,0.354,0.362,0.35,0.325,0.29,0.189,0.344,0.36,0.358,0.359,0.366,0.362,0.367,0.369,0.365,0.233,-0.013,0.321,0.293,0.212,0.245,0.119,0.072,0.18,0.179,0.203,0.226,0.188,0.202,0.2,0.192,0.193,0.209,0.167,0.098,OrganismalFitness,ENV_HV1BR,Medium,Virus
+0.615,0.706,0.705,0.708,0.732,0.733,-0.32,0.684,0.781,0.789,0.762,0.787,0.789,-0.378,0.768,0.803,0.81,0.769,0.744,0.741,0.555,0.599,0.685,0.656,0.641,0.664,0.684,0.705,0.735,0.795,0.752,0.738,0.677,0.612,0.603,0.644,0.716,0.711,0.728,0.741,0.737,0.735,0.625,-0.253,0.618,0.63,0.665,0.604,0.828,0.76,0.805,0.801,0.806,0.811,0.82,0.809,0.802,0.806,0.8,0.812,0.799,0.805,Stability,EPHB2_HUMAN,High,Human
+0.386,0.368,0.254,0.25,0.244,0.268,0.449,0.415,0.405,0.416,0.449,0.48,0.49,0.481,0.463,0.469,0.423,0.373,0.295,0.047,0.457,0.499,0.55,0.471,0.505,0.536,0.596,0.598,0.506,0.381,0.424,0.066,0.489,0.553,0.471,0.521,0.494,0.465,0.492,0.447,0.441,0.439,0.468,0.459,0.537,0.463,0.461,0.491,-0.139,0.022,0.41,0.406,0.451,0.44,0.415,0.406,0.432,0.452,0.438,0.446,0.525,0.488,Expression,ERBB2_HUMAN,Low,Human
+0.258,0.399,0.389,0.415,0.387,0.387,0.187,0.315,0.345,0.413,0.336,0.305,0.335,0.169,0.27,0.267,0.3,0.281,0.323,0.32,0.121,0.196,0.268,0.286,0.27,0.254,0.311,0.272,0.379,0.379,0.434,0.406,0.075,0.184,0.272,0.261,0.31,0.329,0.326,0.403,0.407,0.397,0.193,0.121,0.289,0.262,0.557,0.489,0.545,0.368,0.366,0.334,0.347,0.343,0.378,0.347,0.347,0.365,0.369,0.37,0.391,0.212,Stability,ESTA_BACSU,High,Prokaryote
+0.06,0.395,0.395,0.44,0.428,0.43,0.016,0.32,0.382,0.395,0.437,0.382,0.393,-0.091,-0.001,0.049,0.383,0.366,0.434,0.391,-0.075,-0.122,-0.104,-0.005,0.098,0.125,-0.021,0.298,0.403,0.375,0.461,0.441,0.037,-0.036,-0.124,0.433,0.062,0.011,0.425,0.38,0.367,0.444,-0.064,-0.081,0.199,-0.048,-0.008,0.246,0.069,0.035,0.431,0.447,0.423,0.46,0.453,0.457,0.43,0.443,0.447,0.448,0.27,-0.126,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+0.06,0.395,0.395,0.44,0.428,0.43,0.016,0.32,0.382,0.395,0.437,0.382,0.393,-0.091,-0.001,0.049,0.383,0.366,0.434,0.391,-0.075,-0.122,-0.104,-0.005,0.098,0.125,-0.021,0.298,0.403,0.375,0.461,0.441,0.037,-0.036,-0.124,0.433,0.062,0.011,0.425,0.38,0.367,0.444,-0.064,-0.081,0.199,-0.048,-0.008,0.246,0.069,0.035,0.431,0.447,0.423,0.46,0.453,0.457,0.43,0.443,0.447,0.448,0.27,-0.126,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+0.38,0.431,0.449,0.458,0.449,0.441,0.023,0.297,0.372,0.456,0.509,0.269,0.433,0.059,0.449,0.536,0.524,0.48,0.386,0.464,0.169,0.272,0.343,0.288,0.156,0.477,0.451,0.292,0.394,0.52,0.409,0.336,0.276,0.065,0.121,0.148,0.355,0.348,0.302,0.415,0.415,0.389,0.395,0.008,0.501,0.447,0.582,0.551,0.588,0.525,0.527,0.483,0.514,0.512,0.5,0.528,0.499,0.489,0.506,0.515,0.595,0.585,Stability,FECA_ECOLI,High,Prokaryote
+0.423,0.394,0.479,0.482,0.487,0.498,0.18,0.342,0.237,0.237,0.171,0.17,0.164,0.172,0.163,0.147,0.188,0.265,0.378,0.287,0.221,0.213,0.218,0.279,0.169,0.229,0.193,0.25,0.22,0.478,0.315,0.319,-0.077,0.095,0.168,0.23,0.398,0.41,0.342,0.484,0.49,0.409,0.18,0.184,0.177,0.21,0.685,0.603,0.687,0.589,0.306,0.397,0.443,0.402,0.397,0.403,0.369,0.368,0.296,0.399,0.581,0.368,Stability,FKBP3_HUMAN,Medium,Human
+0.237,0.403,0.507,0.567,0.486,0.532,0.337,-0.004,0.513,0.575,0.623,0.458,0.462,0.34,0.392,0.528,0.668,0.67,0.655,0.507,0.319,0.355,0.383,0.372,0.364,0.424,0.493,0.451,0.579,0.628,0.651,0.563,0.303,0.279,0.331,0.325,0.56,0.555,0.557,0.508,0.509,0.503,0.395,0.301,0.604,0.446,0.287,0.568,0.308,0.114,0.627,0.634,0.613,0.625,0.632,0.634,0.62,0.617,0.619,0.638,0.558,0.408,OrganismalFitness,GAL4_YEAST,Medium,Eukaryote
+0.253,0.25,0.245,0.248,0.242,0.241,0.182,0.221,0.24,0.24,0.238,0.272,0.284,0.318,0.278,0.266,0.281,0.261,0.248,0.186,0.175,0.189,0.168,0.167,0.158,0.118,0.081,0.134,0.164,0.225,0.255,0.247,0.025,0.175,0.169,0.265,0.258,0.257,0.278,0.258,0.257,0.274,0.107,0.192,0.157,0.126,0.145,0.183,0.201,0.167,0.227,0.22,0.217,0.214,0.226,0.224,0.219,0.221,0.224,0.222,0.221,0.245,Binding,GCN4_YEAST,Low,Eukaryote
+0.442,0.429,0.439,0.446,0.451,0.452,0.205,0.398,0.466,0.462,0.391,0.423,0.46,0.159,0.247,0.448,0.397,0.356,0.411,0.401,0.253,0.385,0.38,0.409,0.326,0.439,0.425,0.384,0.397,0.415,0.423,0.393,0.276,0.321,0.394,0.356,0.432,0.448,0.437,0.451,0.47,0.461,0.233,0.121,0.432,0.26,0.357,0.421,0.387,0.101,0.389,0.335,0.402,0.378,0.391,0.399,0.377,0.391,0.375,0.405,0.461,0.338,OrganismalFitness,GDIA_HUMAN,Low,Human
+0.649,0.644,0.672,0.673,0.679,0.679,0.049,0.635,0.667,0.661,0.524,0.098,0.103,0.078,0.131,0.103,0.108,0.149,0.29,0.598,0.078,0.105,0.181,0.098,0.046,0.188,0.298,0.642,0.647,0.678,0.607,0.615,0.074,0.066,0.182,0.629,0.649,0.651,0.672,0.683,0.685,0.706,0.024,-0.014,0.041,0.036,0.512,0.508,0.713,0.602,0.591,0.6,0.61,0.617,0.609,0.611,0.605,0.606,0.587,0.607,0.623,0.449,Activity,GFP_AEQVI,Low,Eukaryote
+0.227,0.134,0.207,0.21,0.18,0.162,0.307,0.524,0.385,0.361,0.388,0.417,0.412,0.339,0.392,0.453,0.425,0.38,0.47,0.163,0.37,0.388,0.364,0.407,0.363,0.39,0.414,0.42,0.49,0.349,0.435,0.276,0.281,0.423,0.372,0.414,0.423,0.392,0.417,0.391,0.362,0.405,0.405,0.384,0.428,0.341,0.33,0.413,0.31,0.177,0.529,0.513,0.507,0.479,0.5,0.455,0.5,0.434,0.464,0.507,0.443,0.411,Expression,GLPA_HUMAN,Low,Human
+0.405,0.521,0.507,0.538,0.54,0.546,0.486,0.45,0.539,0.491,0.533,0.455,0.515,0.526,0.598,0.626,0.647,0.535,0.579,0.534,0.544,0.516,0.514,0.453,0.532,0.509,0.446,0.524,0.448,0.504,0.428,0.422,0.464,0.536,0.506,0.404,0.517,0.506,0.436,0.562,0.561,0.532,0.584,0.323,0.53,0.572,0.672,0.527,0.69,0.516,0.621,0.649,0.638,0.638,0.647,0.628,0.629,0.645,0.625,0.65,0.556,0.607,OrganismalFitness,GRB2_HUMAN,Medium,Human
+0.315,0.41,0.113,0.147,0.351,0.36,0.241,0.399,0.43,0.485,0.666,0.45,0.496,0.285,0.416,0.574,0.686,0.588,0.577,0.245,0.292,0.306,0.362,0.368,0.324,0.325,0.21,0.345,0.515,0.579,0.636,0.504,0.191,0.262,0.287,0.486,0.43,0.443,0.531,0.374,0.43,0.507,0.31,0.285,0.614,0.376,0.667,0.724,0.73,0.601,0.692,0.698,0.71,0.687,0.714,0.711,0.696,0.714,0.719,0.723,0.768,0.609,Stability,HCP_LAMBD,Medium,Virus
+0.291,0.37,0.324,0.324,0.332,0.336,0.055,0.234,0.359,0.37,0.303,0.303,0.293,0.134,0.206,0.474,0.387,0.34,0.385,0.337,-0.024,0.232,0.302,0.264,0.16,0.294,0.308,0.247,0.214,0.312,0.332,0.307,0.103,0.135,0.101,0.229,0.327,0.298,0.353,0.327,0.341,0.359,0.089,0.038,0.433,0.138,0.252,0.26,0.38,0.321,0.368,0.408,0.345,0.392,0.411,0.436,0.442,0.428,0.451,0.417,0.414,0.219,Stability,HECD1_HUMAN,Medium,Human
+0.418,0.409,0.403,0.407,0.42,0.418,0.126,0.103,0.432,0.442,0.388,0.383,0.396,0.142,0.387,0.408,0.393,0.418,0.429,0.108,0.349,0.373,0.37,0.375,0.358,0.387,0.376,0.385,0.402,0.435,0.42,0.41,0.093,0.377,0.381,0.377,0.42,0.43,0.438,0.427,0.434,0.44,0.323,0.135,0.384,0.378,0.314,0.358,0.301,0.146,0.357,0.337,0.353,0.363,0.377,0.374,0.371,0.373,0.368,0.377,0.431,0.415,Activity,HEM3_HUMAN,Medium,Human
+0.472,0.542,0.558,0.557,0.533,0.531,0.146,0.273,0.468,0.508,0.472,0.411,0.477,0.057,0.113,0.44,0.411,0.475,0.48,0.454,0.325,0.402,0.433,0.478,0.404,0.485,0.464,0.516,0.539,0.524,0.4,0.374,0.143,0.397,0.453,0.585,0.488,0.496,0.616,0.573,0.548,0.582,0.14,0.04,0.299,0.02,0.378,0.424,0.529,0.392,0.444,0.35,0.404,0.427,0.44,0.407,0.418,0.45,0.431,0.424,0.521,0.222,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+0.491,0.421,0.382,0.394,0.456,0.451,0.158,0.282,0.282,0.274,0.275,0.341,0.315,0.046,0.01,0.317,0.509,0.374,0.378,0.302,0.451,0.162,0.175,0.221,0.435,0.197,0.168,0.209,0.213,0.476,0.413,0.401,0.228,0.308,0.247,0.219,0.44,0.34,0.33,0.437,0.361,0.36,0.172,0.069,0.477,0.496,0.323,0.444,-0.184,0.105,0.493,0.492,0.486,0.498,0.489,0.492,0.48,0.5,0.485,0.499,0.529,0.438,OrganismalFitness,HMDH_HUMAN,Low,Human
+0.384,0.396,0.437,0.446,0.436,0.436,0.246,0.38,0.439,0.444,0.407,0.452,0.475,0.041,0.152,0.26,0.334,0.38,0.364,0.299,0.42,0.433,0.427,0.44,0.423,0.432,0.458,0.43,0.456,0.46,0.483,0.462,0.252,0.405,0.415,0.412,0.43,0.437,0.433,0.451,0.455,0.452,0.213,-0.075,0.413,0.415,0.12,0.309,0.101,0.029,0.322,0.319,0.318,0.336,0.33,0.326,0.34,0.335,0.318,0.336,0.404,0.358,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.384,0.396,0.437,0.446,0.436,0.436,0.246,0.38,0.439,0.444,0.407,0.452,0.475,0.041,0.152,0.26,0.334,0.38,0.364,0.299,0.42,0.433,0.427,0.44,0.423,0.432,0.458,0.43,0.456,0.46,0.483,0.462,0.252,0.405,0.415,0.412,0.43,0.437,0.433,0.451,0.455,0.452,0.213,-0.075,0.413,0.415,0.12,0.309,0.101,0.029,0.322,0.319,0.318,0.336,0.33,0.326,0.34,0.335,0.318,0.336,0.404,0.358,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.384,0.396,0.437,0.446,0.436,0.436,0.246,0.38,0.439,0.444,0.407,0.452,0.475,0.041,0.152,0.26,0.334,0.38,0.364,0.299,0.42,0.433,0.427,0.44,0.423,0.432,0.458,0.43,0.456,0.46,0.483,0.462,0.252,0.405,0.415,0.412,0.43,0.437,0.433,0.451,0.455,0.452,0.213,-0.075,0.413,0.415,0.12,0.309,0.101,0.029,0.322,0.319,0.318,0.336,0.33,0.326,0.34,0.335,0.318,0.336,0.404,0.358,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.406,0.439,0.401,0.42,0.412,0.42,0.143,0.358,0.448,0.467,0.392,0.398,0.422,0.16,0.238,0.419,0.442,0.442,0.417,0.226,0.368,0.37,0.378,0.368,0.367,0.386,0.384,0.382,0.361,0.438,0.393,0.352,0.174,0.388,0.365,0.344,0.422,0.416,0.402,0.43,0.435,0.431,0.242,0.052,0.426,0.407,0.378,0.41,0.392,0.164,0.433,0.436,0.43,0.432,0.438,0.442,0.438,0.44,0.44,0.446,0.472,0.405,OrganismalFitness,HXK4_HUMAN,Medium,Human
+0.406,0.439,0.401,0.42,0.412,0.42,0.143,0.358,0.448,0.467,0.392,0.398,0.422,0.16,0.238,0.419,0.442,0.442,0.417,0.226,0.368,0.37,0.378,0.368,0.367,0.386,0.384,0.382,0.361,0.438,0.393,0.352,0.174,0.388,0.365,0.344,0.422,0.416,0.402,0.43,0.435,0.431,0.242,0.052,0.426,0.407,0.378,0.41,0.392,0.164,0.433,0.436,0.43,0.432,0.438,0.442,0.438,0.44,0.44,0.446,0.472,0.405,Expression,HXK4_HUMAN,Medium,Human
+0.347,0.317,0.268,0.264,0.364,0.361,-0.005,0.297,0.299,0.336,0.013,0.018,0.014,0.018,0.025,0.01,0.02,0.013,0.094,0.212,0.308,0.328,0.374,0.377,0.004,0.011,0.097,0.002,0.302,0.368,0.25,0.253,0.171,0.304,0.332,0.337,0.329,0.354,0.348,0.383,0.399,0.401,0.007,0.02,0.016,0.019,0.209,0.209,0.231,0.1,0.075,0.114,0.126,0.135,0.111,0.106,0.089,0.098,0.067,0.109,0.056,0.018,OrganismalFitness,I6TAH8_I68A0,Medium,Virus
+0.328,0.499,0.541,0.539,0.527,0.537,0.177,0.387,0.261,0.255,0.534,0.54,0.565,0.193,0.477,0.55,0.599,0.556,0.515,0.49,0.361,0.438,0.366,0.411,0.451,0.458,0.482,0.443,0.459,0.401,0.52,0.463,0.238,0.458,0.477,0.527,0.463,0.47,0.495,0.529,0.532,0.539,0.371,0.15,0.562,0.516,0.321,0.518,0.475,0.244,0.545,0.534,0.555,0.575,0.577,0.559,0.551,0.571,0.56,0.583,0.616,0.506,OrganismalFitness,IF1_ECOLI,High,Prokaryote
+0.233,0.336,0.444,0.438,0.437,0.436,0.022,0.315,0.371,0.416,0.353,0.352,0.35,0.192,0.324,0.435,0.318,0.282,0.167,0.48,0.206,0.214,0.257,0.312,0.347,0.233,0.372,0.255,0.259,0.457,0.349,0.313,0.097,0.127,0.282,0.362,0.282,0.349,0.401,0.382,0.389,0.445,0.279,0.045,0.339,0.395,0.418,0.368,0.475,0.342,0.304,0.274,0.343,0.33,0.37,0.395,0.33,0.312,0.414,0.37,0.319,0.349,Stability,ILF3_HUMAN,High,Human
+0.094,0.101,0.206,0.197,0.231,0.231,0.326,0.128,0.326,0.356,0.453,0.413,0.434,0.377,0.394,0.449,0.429,0.479,0.45,0.185,0.326,0.199,0.218,0.228,0.318,0.335,0.329,0.298,0.36,0.304,0.396,0.334,0.236,0.291,0.249,0.298,0.259,0.219,0.245,0.263,0.221,0.244,0.3,0.248,0.349,0.329,0.592,0.526,0.519,0.49,0.42,0.407,0.429,0.428,0.427,0.429,0.409,0.414,0.419,0.426,0.608,0.522,Stability,ISDH_STAAW,High,Prokaryote
+0.273,0.307,0.286,0.312,0.302,0.308,0.212,0.272,0.278,0.287,0.277,0.316,0.359,0.186,0.169,0.364,0.316,0.264,0.226,0.178,0.226,0.238,0.27,0.289,0.238,0.362,0.344,0.329,0.31,0.292,0.328,0.266,-0.102,0.168,0.3,0.356,0.301,0.326,0.366,0.331,0.346,0.378,0.206,0.187,0.31,0.196,0.19,0.27,0.16,0.09,0.314,0.314,0.307,0.3,0.302,0.289,0.326,0.322,0.315,0.32,0.376,0.216,Expression,KCNE1_HUMAN,Medium,Human
+0.273,0.307,0.286,0.312,0.302,0.308,0.212,0.272,0.278,0.287,0.277,0.316,0.359,0.186,0.169,0.364,0.316,0.264,0.226,0.178,0.226,0.238,0.27,0.289,0.238,0.362,0.344,0.329,0.31,0.292,0.328,0.266,-0.102,0.168,0.3,0.356,0.301,0.326,0.366,0.331,0.346,0.378,0.206,0.187,0.31,0.196,0.19,0.27,0.16,0.09,0.314,0.314,0.307,0.3,0.302,0.289,0.326,0.322,0.315,0.32,0.376,0.216,Activity,KCNE1_HUMAN,Medium,Human
+0.449,0.421,0.298,0.292,0.211,0.21,0.454,0.151,0.303,0.306,0.307,0.216,0.234,0.253,0.197,0.229,0.28,0.258,0.248,0.382,0.468,0.513,0.495,0.463,0.502,0.491,0.455,0.477,0.481,0.46,0.433,0.318,0.344,0.471,0.525,0.511,0.499,0.554,0.538,0.472,0.517,0.509,0.394,0.223,0.193,0.291,0.219,-0.079,-0.202,0.086,0.53,0.39,0.481,0.435,0.444,0.442,0.476,0.441,0.435,0.475,0.271,0.411,Activity,KCNH2_HUMAN,Medium,Human
+0.318,0.335,0.294,0.304,0.342,0.347,0.089,0.231,0.298,0.293,0.347,0.318,0.329,0.117,0.324,0.374,0.367,0.364,0.371,0.19,0.292,0.266,0.234,0.243,0.308,0.3,0.288,0.309,0.238,0.37,0.325,0.304,0.161,0.285,0.273,0.235,0.337,0.33,0.31,0.346,0.346,0.338,0.24,0.022,0.296,0.326,0.247,0.274,0.266,0.126,0.348,0.333,0.348,0.342,0.354,0.341,0.357,0.343,0.341,0.354,0.361,0.308,Activity,KCNJ2_MOUSE,Medium,Eukaryote
+0.318,0.335,0.294,0.304,0.342,0.347,0.089,0.231,0.298,0.293,0.347,0.318,0.329,0.117,0.324,0.374,0.367,0.364,0.371,0.19,0.292,0.266,0.234,0.243,0.308,0.3,0.288,0.309,0.238,0.37,0.325,0.304,0.161,0.285,0.273,0.235,0.337,0.33,0.31,0.346,0.346,0.338,0.24,0.022,0.296,0.326,0.247,0.274,0.266,0.126,0.348,0.333,0.348,0.342,0.354,0.341,0.357,0.343,0.341,0.354,0.361,0.308,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+0.25,0.53,0.437,0.622,0.599,0.603,0.199,0.423,0.538,0.606,0.566,0.597,0.621,0.212,0.261,0.487,0.601,0.645,0.662,0.498,0.285,0.42,0.518,0.54,0.353,0.578,0.577,0.568,0.638,0.627,0.637,0.586,0.148,0.243,0.485,0.586,0.445,0.524,0.588,0.586,0.609,0.63,0.253,0.076,0.559,0.386,0.456,0.582,0.551,0.267,0.587,0.572,0.588,0.578,0.579,0.59,0.574,0.583,0.591,0.599,0.641,0.329,OrganismalFitness,KKA2_KLEPN,High,Prokaryote
+0.28,0.426,0.485,0.494,0.481,0.498,0.104,0.465,0.467,0.559,0.444,0.537,0.564,0.179,0.327,0.382,0.515,0.58,0.578,0.499,0.276,0.396,0.462,0.495,0.365,0.492,0.525,0.479,0.552,0.542,0.537,0.46,0.139,0.307,0.398,0.555,0.37,0.404,0.527,0.482,0.495,0.54,0.176,0.051,0.483,0.367,0.398,0.507,0.492,0.145,0.51,0.505,0.511,0.51,0.513,0.52,0.518,0.517,0.52,0.527,0.411,0.334,Activity,LGK_LIPST,Medium,Eukaryote
+0.314,0.303,0.189,0.158,0.258,0.266,0.361,0.172,0.413,0.416,0.436,0.37,0.371,0.307,0.363,0.336,0.31,0.426,0.454,0.336,0.382,0.391,0.386,0.385,0.362,0.369,0.373,0.298,0.339,0.349,0.313,0.179,0.203,0.353,0.437,0.343,0.362,0.41,0.348,0.343,0.372,0.313,0.307,0.346,0.318,0.35,0.17,0.259,0.163,0.103,0.235,0.317,0.398,0.33,0.279,0.295,0.352,0.348,0.272,0.332,0.419,0.39,Expression,LYAM1_HUMAN,Medium,Human
+0.613,0.636,0.633,0.635,0.621,0.623,0.405,0.643,0.633,0.626,0.619,0.355,0.6,0.453,0.429,0.471,0.48,0.551,0.413,0.627,0.448,0.494,0.522,0.566,0.46,0.515,0.447,0.536,0.49,0.602,0.621,0.609,0.392,0.329,0.565,0.509,0.589,0.647,0.657,0.629,0.661,0.683,0.312,0.079,0.425,0.246,0.596,0.327,0.638,0.611,0.646,0.615,0.617,0.638,0.623,0.638,0.636,0.642,0.651,0.639,0.719,0.727,Stability,MAFG_MOUSE,Medium,Eukaryote
+0.551,0.723,0.708,0.727,0.758,0.763,-0.058,0.572,0.658,0.744,0.727,0.422,0.512,-0.146,0.025,0.742,0.699,0.685,0.74,0.715,-0.032,0.058,0.576,0.615,0.193,0.561,0.537,0.568,0.714,0.748,0.729,0.67,0.649,-0.282,0.063,0.033,0.586,0.658,0.634,0.729,0.751,0.742,-0.064,-0.203,0.629,0.433,0.521,0.66,0.773,0.576,0.793,0.76,0.792,0.745,0.758,0.741,0.763,0.776,0.774,0.78,0.813,0.72,Stability,MBD11_ARATH,Medium,Eukaryote
+0.454,0.544,0.538,0.563,0.582,0.584,0.517,0.52,0.55,0.562,0.585,0.55,0.562,0.452,0.515,0.542,0.59,0.596,0.599,0.583,0.526,0.506,0.466,0.492,0.53,0.519,0.519,0.488,0.454,0.548,0.57,0.53,0.286,0.471,0.501,0.525,0.512,0.539,0.563,0.558,0.565,0.573,0.542,0.347,0.571,0.575,0.35,0.42,0.498,0.192,0.54,0.548,0.548,0.55,0.562,0.554,0.557,0.561,0.556,0.566,0.575,0.529,Activity,MET_HUMAN,Medium,Human
+0.176,0.198,0.237,0.241,0.223,0.227,0.209,0.163,0.198,0.184,0.033,0.167,0.182,0.17,0.199,0.195,0.178,0.189,0.141,0.184,0.209,0.117,0.069,0.016,0.183,0.089,0.057,0.076,-0.067,0.218,0.176,0.222,0.106,0.169,0.058,0.004,0.193,0.119,0.093,0.227,0.205,0.202,0.196,0.125,0.135,0.184,0.059,0.0,0.133,0.006,0.161,0.162,0.148,0.166,0.17,0.163,0.163,0.169,0.177,0.171,0.15,0.186,OrganismalFitness,MK01_HUMAN,Medium,Human
+0.225,0.352,0.419,0.424,0.409,0.411,-0.014,0.36,0.412,0.419,0.399,0.416,0.432,-0.011,0.236,0.273,0.382,0.382,0.412,0.387,0.078,0.366,0.38,0.402,0.313,0.383,0.392,0.382,0.424,0.4,0.44,0.41,0.039,0.131,0.315,0.296,0.265,0.327,0.315,0.374,0.396,0.395,0.177,-0.036,0.329,0.235,0.006,0.193,0.123,-0.026,0.319,0.28,0.288,0.293,0.315,0.308,0.295,0.305,0.332,0.313,0.23,0.203,OrganismalFitness,MLAC_ECOLI,Medium,Prokaryote
+0.352,0.399,0.367,0.376,0.383,0.39,0.204,0.339,0.395,0.402,0.349,0.38,0.4,0.213,0.349,0.397,0.339,0.296,0.204,0.364,0.296,0.31,0.264,0.257,0.326,0.317,0.32,0.326,0.301,0.394,0.351,0.324,0.208,0.277,0.351,0.281,0.35,0.388,0.346,0.387,0.405,0.383,0.284,0.103,0.4,0.355,0.293,0.363,0.062,0.099,0.305,0.279,0.291,0.309,0.317,0.3,0.295,0.308,0.339,0.321,0.394,0.366,OrganismalFitness,MSH2_HUMAN,Medium,Human
+0.371,0.612,0.709,0.718,0.696,0.704,0.314,0.644,0.673,0.688,0.581,0.692,0.708,0.19,0.352,0.405,0.527,0.608,0.644,0.658,0.311,0.454,0.6,0.662,0.488,0.663,0.711,0.684,0.727,0.679,0.697,0.652,0.291,0.353,0.495,0.665,0.422,0.496,0.638,0.657,0.679,0.714,0.324,-0.031,0.477,0.341,0.449,0.574,0.583,0.225,0.544,0.531,0.559,0.569,0.561,0.573,0.551,0.571,0.557,0.572,0.58,0.378,OrganismalFitness,MTH3_HAEAE,Medium,Prokaryote
+0.25,0.271,0.249,0.257,0.258,0.257,0.205,0.194,0.292,0.293,0.387,0.299,0.32,0.169,0.414,0.482,0.348,0.276,0.285,0.253,0.443,0.162,0.217,0.265,0.437,0.239,0.264,0.245,0.295,0.296,0.22,0.189,0.143,0.451,0.307,0.212,0.378,0.315,0.254,0.363,0.309,0.269,0.278,0.111,0.379,0.411,0.239,0.338,0.377,0.092,0.324,0.289,0.299,0.297,0.305,0.316,0.335,0.319,0.347,0.325,0.332,0.376,OrganismalFitness,MTHR_HUMAN,Low,Human
+0.143,0.248,0.334,0.362,0.381,0.412,0.155,0.251,0.428,0.419,0.519,0.482,0.478,0.171,0.411,0.583,0.588,0.507,0.247,0.321,0.351,0.247,0.336,0.311,0.342,0.287,0.364,0.313,0.336,0.283,0.279,0.178,0.126,0.419,0.332,0.282,0.382,0.329,0.292,0.411,0.401,0.382,0.112,0.045,0.431,0.294,0.392,0.427,0.479,0.359,0.517,0.495,0.545,0.512,0.525,0.529,0.513,0.519,0.506,0.525,0.571,0.5,Stability,MYO3_YEAST,High,Eukaryote
+0.364,0.328,0.334,0.333,0.363,0.364,0.002,0.335,0.373,0.348,0.015,0.019,0.02,0.023,0.026,0.02,0.028,0.031,0.107,0.269,0.352,0.382,0.408,0.413,0.018,0.042,0.108,0.03,0.352,0.377,0.27,0.279,0.126,0.356,0.373,0.415,0.39,0.402,0.424,0.425,0.426,0.441,0.019,0.015,0.027,0.025,0.259,0.27,0.278,0.123,0.129,0.171,0.166,0.184,0.172,0.172,0.155,0.163,0.129,0.171,0.135,0.081,OrganismalFitness,NCAP_I34A1,Medium,Virus
+0.428,0.548,0.586,0.584,0.607,0.619,0.582,0.553,0.584,0.608,0.56,0.568,0.572,0.627,0.638,0.674,0.651,0.582,0.602,0.558,0.564,0.557,0.556,0.564,0.576,0.579,0.606,0.54,0.577,0.64,0.525,0.506,0.551,0.494,0.565,0.577,0.578,0.624,0.628,0.601,0.618,0.621,0.454,0.395,0.342,0.408,0.589,0.401,0.643,0.642,0.634,0.626,0.639,0.648,0.669,0.641,0.635,0.629,0.642,0.645,0.566,0.596,Stability,NKX31_HUMAN,High,Human
+0.731,0.694,0.58,0.593,0.708,0.716,0.318,0.532,0.662,0.72,0.702,0.312,0.408,0.268,0.394,0.621,0.678,0.68,0.714,0.276,0.414,0.475,0.478,0.418,0.414,0.48,0.584,0.41,0.513,0.672,0.7,0.558,0.285,0.349,0.462,0.595,0.713,0.674,0.735,0.737,0.704,0.736,0.403,0.288,0.744,0.48,0.486,0.62,-0.024,0.184,0.628,0.6,0.623,0.58,0.624,0.612,0.632,0.674,0.663,0.646,0.696,0.437,Activity,NPC1_HUMAN,Low,Human
+0.731,0.694,0.58,0.593,0.708,0.716,0.318,0.532,0.662,0.72,0.702,0.312,0.408,0.268,0.394,0.621,0.678,0.68,0.714,0.276,0.414,0.475,0.478,0.418,0.414,0.48,0.584,0.41,0.513,0.672,0.7,0.558,0.285,0.349,0.462,0.595,0.713,0.674,0.735,0.737,0.704,0.736,0.403,0.288,0.744,0.48,0.486,0.62,-0.024,0.184,0.628,0.6,0.623,0.58,0.624,0.612,0.632,0.674,0.663,0.646,0.696,0.437,Activity,NPC1_HUMAN,Low,Human
+0.569,0.565,0.501,0.49,0.584,0.584,0.035,0.39,0.625,0.638,-0.076,0.162,0.448,-0.075,-0.098,0.005,0.161,0.541,0.575,0.343,0.583,0.633,0.584,0.571,0.047,0.53,0.627,0.462,0.654,0.635,0.404,0.441,-0.169,0.512,0.532,0.551,0.592,0.615,0.621,0.628,0.638,0.632,-0.105,-0.127,-0.111,-0.111,0.422,0.414,0.448,0.178,0.19,0.228,0.235,0.297,0.253,0.29,0.263,0.247,0.199,0.254,0.292,0.145,OrganismalFitness,NRAM_I33A0,Low,Virus
+0.271,0.453,0.564,0.596,0.591,0.594,-0.005,0.389,0.602,0.665,0.603,0.6,0.645,0.299,0.419,0.46,0.526,0.583,0.599,0.501,0.301,0.454,0.546,0.518,0.412,0.579,0.577,0.549,0.542,0.587,0.623,0.553,0.147,0.36,0.423,0.575,0.433,0.456,0.604,0.587,0.586,0.635,0.386,0.015,0.603,0.429,0.483,0.581,0.534,0.317,0.55,0.521,0.544,0.568,0.556,0.571,0.567,0.559,0.556,0.573,0.679,0.564,Expression,NUD15_HUMAN,High,Human
+0.392,0.589,0.58,0.571,0.602,0.613,0.285,0.465,0.638,0.599,0.56,0.323,0.346,0.322,0.39,0.405,0.494,0.532,0.533,0.56,0.394,0.549,0.572,0.589,0.325,0.497,0.44,0.525,0.572,0.638,0.67,0.706,0.183,0.365,0.419,0.425,0.505,0.493,0.508,0.628,0.601,0.622,0.375,0.246,0.381,0.357,0.682,0.641,0.73,0.658,0.709,0.695,0.699,0.721,0.726,0.741,0.716,0.716,0.719,0.723,0.685,0.575,Stability,NUSA_ECOLI,High,Prokaryote
+0.405,0.404,0.417,0.431,0.451,0.434,0.23,0.423,0.485,0.461,0.398,0.361,0.421,0.389,0.561,0.51,0.518,0.525,0.477,0.521,0.264,0.339,0.315,0.347,0.38,0.385,0.461,0.334,0.378,0.502,0.394,0.387,0.321,0.269,0.275,0.372,0.395,0.399,0.426,0.413,0.417,0.424,0.445,0.144,0.395,0.531,0.676,0.56,0.759,0.64,0.487,0.457,0.454,0.507,0.487,0.485,0.485,0.486,0.483,0.488,0.41,0.588,Stability,NUSG_MYCTU,High,Prokaryote
+0.54,0.697,0.759,0.777,0.745,0.757,0.384,0.455,0.751,0.763,0.691,0.735,0.759,0.392,0.779,0.819,0.797,0.786,0.755,0.723,0.688,0.689,0.663,0.687,0.589,0.667,0.694,0.726,0.604,0.735,0.694,0.679,0.549,0.262,0.461,0.575,0.687,0.695,0.704,0.755,0.76,0.764,0.543,0.115,0.507,0.633,0.63,0.54,0.782,0.747,0.762,0.76,0.761,0.755,0.772,0.76,0.773,0.767,0.776,0.771,0.825,0.772,Stability,OBSCN_HUMAN,High,Human
+-0.211,-0.17,0.339,0.188,0.332,0.332,-0.149,0.093,0.175,0.19,0.064,-0.1,-0.115,-0.069,-0.151,0.14,0.221,0.089,0.21,0.233,0.189,0.174,0.159,0.177,0.171,0.195,0.166,0.192,0.233,0.327,0.062,0.023,0.245,-0.05,0.025,0.012,0.19,0.181,0.178,0.271,0.26,0.29,0.268,-0.07,0.339,0.327,0.452,0.424,0.437,0.322,0.118,0.075,0.085,0.129,0.073,0.127,0.114,0.116,0.094,0.107,0.286,0.171,Stability,ODP2_GEOSE,High,Prokaryote
+0.17,0.433,0.445,0.517,0.463,0.466,0.347,0.569,0.563,0.6,0.397,0.499,0.56,0.29,0.508,0.518,0.552,0.449,0.482,0.555,0.528,0.549,0.538,0.578,0.552,0.594,0.598,0.577,0.57,0.511,0.422,0.326,0.17,0.503,0.53,0.485,0.537,0.532,0.503,0.485,0.482,0.488,0.386,0.08,0.493,0.472,0.632,0.518,0.632,0.224,0.402,0.353,0.412,0.421,0.482,0.487,0.418,0.421,0.49,0.472,0.582,0.522,Expression,OPSD_HUMAN,High,Human
+0.524,0.564,0.484,0.522,0.55,0.552,0.127,0.404,0.59,0.592,0.546,0.573,0.581,0.142,0.411,0.512,0.531,0.51,0.534,0.565,0.453,0.461,0.522,0.545,0.476,0.555,0.526,0.54,0.574,0.599,0.475,0.409,0.085,0.429,0.511,0.569,0.522,0.57,0.614,0.567,0.584,0.599,0.279,0.092,0.524,0.463,0.649,0.606,0.674,0.364,0.526,0.547,0.544,0.574,0.556,0.567,0.56,0.548,0.541,0.564,0.616,0.49,Activity,OTC_HUMAN,Medium,Human
+0.116,0.243,0.184,0.201,0.224,0.227,0.192,0.134,0.141,0.146,0.434,0.604,0.581,0.205,0.563,0.588,0.319,0.362,0.408,0.155,0.184,0.178,0.221,0.275,0.345,0.26,0.38,0.307,0.148,0.259,0.406,0.388,0.132,0.168,0.225,0.349,0.201,0.198,0.308,0.242,0.22,0.262,0.365,0.184,0.543,0.548,0.532,0.533,0.61,0.537,0.395,0.342,0.348,0.401,0.422,0.421,0.374,0.409,0.414,0.404,0.642,0.632,Stability,OTU7A_HUMAN,High,Human
+0.142,0.272,0.327,0.33,0.316,0.314,0.192,0.222,0.287,0.295,0.355,0.329,0.335,0.172,0.238,0.312,0.342,0.37,0.371,0.217,0.147,0.228,0.277,0.269,0.218,0.304,0.326,0.302,0.358,0.343,0.359,0.324,0.092,0.196,0.212,0.269,0.268,0.267,0.286,0.313,0.311,0.316,0.198,0.079,0.336,0.24,0.34,0.371,0.347,0.14,0.348,0.339,0.332,0.339,0.344,0.341,0.344,0.346,0.334,0.352,0.402,0.282,Activity,OXDA_RHOTO,High,Eukaryote
+0.142,0.272,0.327,0.33,0.316,0.314,0.192,0.222,0.287,0.295,0.355,0.329,0.335,0.172,0.238,0.312,0.342,0.37,0.371,0.217,0.147,0.228,0.277,0.269,0.218,0.304,0.326,0.302,0.358,0.343,0.359,0.324,0.092,0.196,0.212,0.269,0.268,0.267,0.286,0.313,0.311,0.316,0.198,0.079,0.336,0.24,0.34,0.371,0.347,0.14,0.348,0.339,0.332,0.339,0.344,0.341,0.344,0.346,0.334,0.352,0.402,0.282,Expression,OXDA_RHOTO,High,Eukaryote
+0.46,0.467,0.365,0.392,0.462,0.466,-0.052,0.347,0.35,0.356,0.525,0.485,0.546,-0.103,-0.089,0.398,0.479,0.511,0.531,0.176,0.361,0.477,0.479,0.446,0.429,0.494,0.5,0.526,0.369,0.486,0.51,0.455,0.171,0.305,0.496,0.386,0.454,0.511,0.444,0.47,0.514,0.454,-0.098,-0.118,0.532,0.007,0.428,0.522,0.445,0.212,0.487,0.496,0.496,0.494,0.497,0.509,0.496,0.501,0.497,0.51,0.533,0.232,OrganismalFitness,P53_HUMAN,Low,Human
+0.46,0.467,0.365,0.392,0.462,0.466,-0.052,0.347,0.35,0.356,0.525,0.485,0.546,-0.103,-0.089,0.398,0.479,0.511,0.531,0.176,0.361,0.477,0.479,0.446,0.429,0.494,0.5,0.526,0.369,0.486,0.51,0.455,0.171,0.305,0.496,0.386,0.454,0.511,0.444,0.47,0.514,0.454,-0.098,-0.118,0.532,0.007,0.428,0.522,0.445,0.212,0.487,0.496,0.496,0.494,0.497,0.509,0.496,0.501,0.497,0.51,0.533,0.232,OrganismalFitness,P53_HUMAN,Low,Human
+0.46,0.467,0.365,0.392,0.462,0.466,-0.052,0.347,0.35,0.356,0.525,0.485,0.546,-0.103,-0.089,0.398,0.479,0.511,0.531,0.176,0.361,0.477,0.479,0.446,0.429,0.494,0.5,0.526,0.369,0.486,0.51,0.455,0.171,0.305,0.496,0.386,0.454,0.511,0.444,0.47,0.514,0.454,-0.098,-0.118,0.532,0.007,0.428,0.522,0.445,0.212,0.487,0.496,0.496,0.494,0.497,0.509,0.496,0.501,0.497,0.51,0.533,0.232,OrganismalFitness,P53_HUMAN,Low,Human
+0.46,0.467,0.365,0.392,0.462,0.466,-0.052,0.347,0.35,0.356,0.525,0.485,0.546,-0.103,-0.089,0.398,0.479,0.511,0.531,0.176,0.361,0.477,0.479,0.446,0.429,0.494,0.5,0.526,0.369,0.486,0.51,0.455,0.171,0.305,0.496,0.386,0.454,0.511,0.444,0.47,0.514,0.454,-0.098,-0.118,0.532,0.007,0.428,0.522,0.445,0.212,0.487,0.496,0.496,0.494,0.497,0.509,0.496,0.501,0.497,0.51,0.533,0.232,OrganismalFitness,P53_HUMAN,Low,Human
+0.508,0.58,0.603,0.617,0.564,0.578,0.364,0.526,0.644,0.623,0.569,0.552,0.588,0.319,0.551,0.559,0.605,0.596,0.571,0.618,0.419,0.507,0.478,0.558,0.521,0.584,0.589,0.571,0.653,0.522,0.553,0.469,0.359,0.474,0.499,0.536,0.515,0.526,0.547,0.56,0.568,0.581,0.465,0.032,0.543,0.51,0.337,0.452,0.512,0.125,0.543,0.527,0.487,0.549,0.551,0.519,0.526,0.553,0.561,0.562,0.622,0.61,OrganismalFitness,P84126_THETH,Medium,Prokaryote
+0.663,0.617,0.541,0.55,0.654,0.648,0.474,0.569,0.637,0.663,0.688,0.662,0.678,0.476,0.566,0.648,0.716,0.684,0.695,0.541,0.638,0.665,0.666,0.692,0.638,0.698,0.7,0.676,0.666,0.674,0.703,0.635,0.261,0.638,0.648,0.64,0.689,0.692,0.688,0.683,0.687,0.684,0.605,-0.027,0.67,0.654,0.361,0.612,0.514,0.197,0.664,0.66,0.672,0.693,0.685,0.689,0.691,0.69,0.718,0.705,0.715,0.622,OrganismalFitness,PABP_YEAST,Medium,Eukaryote
+0.403,0.395,0.377,0.392,0.402,0.413,0.054,0.314,0.428,0.438,0.435,0.408,0.43,0.083,0.391,0.434,0.448,0.335,0.27,0.422,0.132,0.381,0.364,0.345,0.351,0.395,0.399,0.372,0.33,0.423,0.393,0.356,0.141,0.171,0.37,0.379,0.384,0.404,0.412,0.414,0.424,0.426,0.35,0.049,0.43,0.407,0.379,0.45,0.404,0.152,0.441,0.437,0.439,0.448,0.448,0.445,0.449,0.449,0.446,0.457,0.468,0.391,Activity,PAI1_HUMAN,,Human
+0.518,0.519,0.499,0.508,0.539,0.543,0.041,0.358,0.152,0.165,0.037,0.054,0.101,0.024,0.029,0.025,0.038,0.041,0.356,0.325,0.456,0.493,0.533,0.538,0.219,0.408,0.444,0.429,0.438,0.584,0.384,0.374,0.107,0.436,0.475,0.541,0.546,0.561,0.572,0.588,0.592,0.584,0.028,0.019,0.031,0.02,0.244,0.232,0.172,0.095,0.154,0.173,0.184,0.187,0.168,0.178,0.158,0.173,0.125,0.181,0.202,0.132,OrganismalFitness,PA_I34A1,Medium,Virus
+0.211,0.44,0.731,0.706,0.352,0.335,0.654,0.552,0.711,0.721,0.647,0.757,0.783,0.768,0.823,0.714,0.749,0.714,0.758,0.652,0.685,0.678,0.596,0.699,0.533,0.616,0.596,0.656,0.602,0.587,0.572,0.49,0.303,0.565,0.511,0.602,0.597,0.564,0.606,0.416,0.445,0.422,0.623,0.395,0.599,0.621,0.2,0.502,0.669,0.41,0.603,0.543,0.553,0.551,0.555,0.57,0.568,0.574,0.583,0.569,0.737,0.775,Activity,PHOT_CHLRE,High,Eukaryote
+0.222,0.327,0.68,0.645,0.631,0.661,0.585,0.5,0.67,0.707,0.654,0.576,0.674,0.66,0.646,0.665,0.67,0.52,0.548,0.656,0.456,0.671,0.63,0.61,0.559,0.672,0.681,0.647,0.658,0.689,0.692,0.644,0.531,0.539,0.626,0.667,0.607,0.671,0.72,0.677,0.705,0.73,0.556,-0.189,0.623,0.51,0.547,0.574,0.691,0.669,0.659,0.627,0.498,0.563,0.575,0.604,0.608,0.649,0.626,0.62,0.74,0.742,Stability,PIN1_HUMAN,High,Human
+0.579,0.52,0.548,0.555,0.549,0.553,0.506,0.47,0.538,0.561,0.467,0.446,0.436,0.557,0.652,0.675,0.626,0.586,0.507,0.542,0.513,0.454,0.477,0.45,0.547,0.493,0.465,0.428,0.468,0.536,0.403,0.386,0.45,0.558,0.508,0.481,0.596,0.548,0.538,0.573,0.537,0.538,0.468,0.412,0.282,0.333,0.453,0.295,0.693,0.63,0.572,0.579,0.572,0.59,0.602,0.609,0.593,0.594,0.598,0.598,0.47,0.593,Stability,PITX2_HUMAN,High,Human
+0.179,0.188,0.218,0.234,0.249,0.254,0.198,0.187,0.165,0.206,0.253,0.278,0.31,0.314,0.347,0.464,0.298,0.236,0.238,0.277,0.31,0.289,0.278,0.325,0.3,0.291,0.31,0.292,0.295,0.29,0.326,0.317,0.067,0.335,0.278,0.311,0.262,0.285,0.308,0.288,0.28,0.293,0.266,0.279,0.26,0.3,0.512,0.379,0.493,0.469,0.258,0.303,0.317,0.26,0.284,0.287,0.286,0.299,0.314,0.299,0.324,0.464,Stability,PKN1_HUMAN,High,Human
+0.423,0.39,0.377,0.411,0.46,0.473,-0.036,0.336,0.5,0.499,0.292,-0.059,0.042,-0.079,-0.056,0.177,0.395,0.404,0.426,0.356,0.339,0.39,0.381,0.377,0.138,0.388,0.383,0.369,0.393,0.495,0.386,0.319,0.007,0.049,0.274,0.355,0.342,0.385,0.413,0.387,0.43,0.458,-0.065,-0.066,0.342,-0.054,0.193,0.339,0.113,0.046,0.342,0.361,0.364,0.369,0.364,0.365,0.365,0.365,0.374,0.374,0.11,0.068,OrganismalFitness,POLG_CXB3N,Medium,Virus
+0.5,0.578,0.289,0.286,0.532,0.533,-0.044,0.434,0.67,0.675,0.318,-0.006,0.0,-0.044,0.007,0.077,0.153,0.275,0.366,0.409,0.453,0.473,0.449,0.456,0.47,0.501,0.493,0.488,0.461,0.622,0.603,0.522,0.035,-0.064,0.115,0.471,0.432,0.383,0.542,0.469,0.397,0.563,-0.055,-0.059,0.391,-0.005,0.36,0.51,0.101,0.116,0.241,0.229,0.244,0.26,0.227,0.261,0.24,0.246,0.223,0.254,0.161,0.032,OrganismalFitness,POLG_DEN26,Low,Virus
+0.605,0.547,0.41,0.413,0.605,0.614,-0.039,0.196,0.565,0.577,0.178,0.637,0.635,0.101,0.128,0.114,0.116,0.09,0.078,0.26,0.4,0.443,0.452,0.492,0.422,0.475,0.32,0.41,0.517,0.63,0.587,0.485,0.182,0.474,0.505,0.522,0.515,0.547,0.578,0.48,0.528,0.56,0.114,0.053,0.506,0.109,-0.037,0.349,0.643,0.35,0.308,0.378,0.344,0.379,0.342,0.283,0.323,0.318,0.314,0.372,0.173,0.159,OrganismalFitness,POLG_HCVJF,Medium,Virus
+0.251,0.455,0.373,0.407,0.471,0.463,0.07,0.42,0.39,0.514,0.375,0.062,0.157,0.132,0.14,0.031,0.16,0.114,0.084,0.592,0.079,0.058,0.016,0.157,0.106,-0.059,0.008,-0.084,0.17,0.507,0.575,0.58,0.131,0.003,-0.033,0.053,0.382,0.383,0.384,0.453,0.463,0.452,0.006,-0.037,0.033,0.026,0.448,0.443,0.65,0.576,0.658,0.664,0.615,0.736,0.705,0.718,0.692,0.731,0.718,0.718,0.716,0.686,Stability,POLG_PESV,Medium,Virus
+0.353,0.539,0.629,0.655,0.612,0.632,0.187,0.339,0.557,0.571,0.623,0.696,0.695,0.028,0.126,0.456,0.594,0.736,0.768,0.717,0.675,0.692,0.25,0.288,0.592,0.722,0.692,0.719,0.299,0.712,0.583,0.509,0.384,0.709,0.683,0.587,0.673,0.7,0.637,0.698,0.706,0.68,0.266,0.03,0.585,0.45,0.594,0.607,0.631,0.302,0.58,0.588,0.593,0.575,0.599,0.583,0.602,0.598,0.585,0.603,0.657,0.499,Activity,PPARG_HUMAN,Medium,Human
+0.55,0.523,0.531,0.536,0.609,0.61,0.035,0.378,0.46,0.517,0.578,0.599,0.618,0.281,0.386,0.47,0.602,0.628,0.618,0.387,0.45,0.526,0.537,0.43,0.516,0.577,0.567,0.557,0.419,0.601,0.594,0.546,0.234,0.428,0.528,0.537,0.582,0.593,0.591,0.615,0.614,0.616,0.361,-0.05,0.575,0.515,0.503,0.579,0.563,0.236,0.574,0.575,0.57,0.586,0.593,0.58,0.584,0.585,0.59,0.595,0.633,0.509,OrganismalFitness,PPM1D_HUMAN,Low,Human
+0.562,0.655,0.745,0.736,0.772,0.774,0.518,0.493,0.774,0.783,0.727,0.592,0.681,0.54,0.535,0.821,0.805,0.74,0.727,0.715,0.552,0.694,0.7,0.725,0.634,0.716,0.733,0.723,0.744,0.795,0.777,0.756,0.637,0.593,0.556,0.611,0.735,0.746,0.738,0.764,0.784,0.775,0.438,0.281,0.326,0.431,0.639,0.423,0.794,0.783,0.792,0.809,0.804,0.812,0.828,0.823,0.819,0.816,0.816,0.821,0.801,0.835,Stability,PR40A_HUMAN,Medium,Human
+0.605,0.619,0.6,0.586,0.622,0.614,0.182,0.459,0.525,0.54,0.582,0.608,0.638,0.233,0.292,0.363,0.501,0.658,0.663,0.447,0.281,0.557,0.609,0.576,0.447,0.632,0.623,0.623,0.52,0.622,0.568,0.526,0.266,0.241,0.55,0.593,0.569,0.621,0.64,0.611,0.638,0.654,0.309,0.093,0.591,0.352,0.649,0.618,0.696,0.298,0.542,0.555,0.561,0.583,0.567,0.571,0.579,0.573,0.561,0.583,0.672,0.519,Expression,PRKN_HUMAN,Low,Human
+0.572,0.576,0.566,0.566,0.576,0.572,0.3,0.442,0.491,0.497,0.692,0.601,0.615,0.498,0.57,0.678,0.708,0.627,0.583,0.477,0.304,0.242,0.467,0.391,0.462,0.492,-0.202,0.471,0.529,0.553,0.558,0.505,0.098,0.328,0.409,0.43,0.546,0.543,0.498,0.586,0.585,0.541,0.422,0.237,0.635,0.482,0.586,0.626,0.699,0.643,0.7,0.684,0.687,0.685,0.686,0.692,0.681,0.687,0.689,0.699,0.682,0.674,Stability,PSAE_PICP2,Medium,Prokaryote
+0.426,0.448,0.44,0.453,0.466,0.474,0.165,0.352,0.465,0.491,0.462,0.439,0.477,0.186,0.34,0.504,0.492,0.286,0.29,0.496,0.263,0.436,0.368,0.335,0.34,0.314,0.3,0.347,0.261,0.502,0.438,0.432,0.092,0.324,0.416,0.321,0.441,0.466,0.415,0.476,0.495,0.489,0.306,0.009,0.476,0.416,0.452,0.444,0.481,0.216,0.467,0.444,0.475,0.468,0.476,0.478,0.476,0.475,0.478,0.485,0.519,0.482,Expression,PTEN_HUMAN,Medium,Human
+0.426,0.448,0.44,0.453,0.466,0.474,0.165,0.352,0.465,0.491,0.462,0.439,0.477,0.186,0.34,0.504,0.492,0.286,0.29,0.496,0.263,0.436,0.368,0.335,0.34,0.314,0.3,0.347,0.261,0.502,0.438,0.432,0.092,0.324,0.416,0.321,0.441,0.466,0.415,0.476,0.495,0.489,0.306,0.009,0.476,0.416,0.452,0.444,0.481,0.216,0.467,0.444,0.475,0.468,0.476,0.478,0.476,0.475,0.478,0.485,0.519,0.482,Activity,PTEN_HUMAN,Medium,Human
+0.493,0.379,0.352,0.393,0.495,0.502,0.003,0.437,0.51,0.515,0.47,0.509,0.537,-0.005,-0.003,-0.002,0.044,0.093,0.168,0.403,0.518,0.403,0.39,0.337,0.517,0.401,0.394,0.436,0.354,0.507,0.478,0.483,0.291,0.494,0.419,0.406,0.52,0.502,0.501,0.526,0.515,0.513,0.413,-0.016,0.502,0.456,0.396,0.463,0.263,0.145,0.226,0.266,0.307,0.301,0.25,0.257,0.26,0.245,0.223,0.28,0.232,0.116,OrganismalFitness,Q2N0S5_9HIV1,Medium,Virus
+0.44,0.413,0.437,0.446,0.454,0.452,0.01,0.33,0.428,0.44,0.324,0.384,0.416,0.14,0.146,0.426,0.442,0.418,0.452,0.438,0.332,0.368,0.358,0.332,0.384,0.406,0.38,0.408,0.416,0.418,0.462,0.423,0.14,0.328,0.354,0.294,0.416,0.436,0.414,0.454,0.464,0.455,0.355,0.148,0.424,0.406,0.382,0.416,0.398,0.2,0.412,0.424,0.444,0.442,0.443,0.438,0.436,0.436,0.438,0.448,0.422,0.544,Binding,Q53Z42_HUMAN,Medium,Human
+0.44,0.413,0.437,0.446,0.454,0.452,0.01,0.33,0.428,0.44,0.324,0.384,0.416,0.14,0.146,0.426,0.442,0.418,0.452,0.438,0.332,0.368,0.358,0.332,0.384,0.406,0.38,0.408,0.416,0.418,0.462,0.423,0.14,0.328,0.354,0.294,0.416,0.436,0.414,0.454,0.464,0.455,0.355,0.148,0.424,0.406,0.382,0.416,0.398,0.2,0.412,0.424,0.444,0.442,0.443,0.438,0.436,0.436,0.438,0.448,0.422,0.544,Expression,Q53Z42_HUMAN,Medium,Human
+0.475,0.593,0.634,0.643,0.654,0.662,0.363,0.535,0.672,0.679,0.588,0.519,0.543,0.144,0.448,0.516,0.572,0.56,0.57,0.649,0.598,0.651,0.652,0.663,0.622,0.662,0.677,0.673,0.68,0.684,0.633,0.575,0.304,0.606,0.652,0.634,0.616,0.653,0.657,0.666,0.672,0.675,0.49,0.008,0.587,0.524,0.408,0.533,0.533,0.155,0.56,0.537,0.533,0.556,0.557,0.542,0.562,0.545,0.553,0.563,0.634,0.54,Activity,Q59976_STRSQ,Medium,Prokaryote
+0.327,0.393,0.26,0.264,0.333,0.334,-0.031,0.115,0.413,0.407,0.242,0.036,0.022,0.014,0.046,0.011,0.01,0.01,-0.022,0.294,0.013,0.045,0.003,0.102,0.004,-0.002,0.03,-0.005,-0.013,0.426,0.344,0.354,0.043,0.031,0.022,0.014,0.248,0.246,0.245,0.33,0.327,0.324,0.014,-0.015,-0.004,0.019,0.161,0.156,0.299,0.169,0.228,0.248,0.237,0.287,0.267,0.272,0.261,0.263,0.264,0.261,0.072,0.031,Activity,Q6WV12_9MAXI,Low,Eukaryote
+0.451,0.465,0.489,0.507,0.503,0.517,0.454,0.404,0.479,0.495,0.563,0.545,0.566,0.42,0.475,0.492,0.515,0.545,0.523,-0.019,0.521,0.445,0.441,0.462,0.552,0.535,0.541,0.449,0.521,0.524,0.527,0.477,0.19,0.491,0.516,0.454,0.491,0.538,0.512,0.54,0.549,0.544,0.458,0.37,0.545,0.49,0.231,0.454,0.373,0.114,0.511,0.488,0.499,0.511,0.522,0.529,0.536,0.491,0.524,0.531,0.589,0.483,Activity,Q837P4_ENTFA,Medium,Prokaryote
+0.196,0.416,0.421,0.459,0.39,0.395,0.222,0.263,0.282,0.294,0.326,0.353,0.342,0.129,0.24,0.334,0.374,0.436,0.392,0.321,0.356,0.399,0.435,0.41,0.319,0.408,0.428,0.511,0.455,0.336,0.378,0.352,0.186,0.399,0.427,0.51,0.342,0.391,0.477,0.417,0.428,0.457,0.274,0.186,0.356,0.271,0.353,0.296,0.391,0.098,0.326,0.344,0.359,0.313,0.383,0.348,0.318,0.353,0.32,0.355,0.341,0.319,Activity,Q837P5_ENTFA,Medium,Prokaryote
+0.286,0.368,0.217,0.215,0.31,0.305,0.025,0.303,0.316,0.322,0.223,-0.009,-0.018,-0.027,-0.035,-0.023,-0.025,-0.007,0.034,0.23,0.013,0.037,0.057,0.013,-0.005,-0.001,0.037,0.252,0.267,0.373,0.297,0.307,-0.018,-0.026,0.009,0.303,0.231,0.237,0.318,0.304,0.309,0.345,-0.013,-0.008,-0.024,-0.027,0.097,0.143,0.292,0.189,0.242,0.234,0.246,0.26,0.24,0.254,0.249,0.243,0.238,0.247,0.137,-0.001,Activity,Q8WTC7_9CNID,Low,Eukaryote
+0.577,0.561,0.212,0.227,0.6,0.605,-0.049,0.292,-0.037,-0.037,0.103,-0.03,-0.04,-0.009,-0.026,0.079,0.105,0.498,0.577,0.266,0.214,0.259,0.274,0.289,0.242,0.236,0.203,0.21,0.224,0.558,0.507,0.408,-0.056,0.157,0.22,0.216,0.35,0.399,0.401,0.546,0.567,0.565,-0.031,-0.051,0.073,-0.038,0.453,0.415,0.496,0.232,0.251,0.228,0.265,0.298,0.275,0.274,0.273,0.268,0.237,0.277,0.241,0.137,OrganismalFitness,R1AB_SARS2,Medium,Virus
+0.28,0.183,0.286,0.299,0.324,0.356,0.508,0.373,0.654,0.624,0.507,0.403,0.45,0.458,0.687,0.704,0.493,0.467,0.542,0.412,0.541,0.575,0.536,0.454,0.552,0.419,0.546,0.444,0.417,0.598,0.422,0.365,0.258,0.477,0.601,0.439,0.497,0.588,0.467,0.436,0.486,0.41,0.573,0.222,0.147,0.37,0.46,0.233,0.656,0.548,0.544,0.578,0.577,0.577,0.606,0.581,0.571,0.575,0.573,0.585,0.373,0.582,Stability,RAD_ANTMA,High,Eukaryote
+0.405,0.425,0.382,0.389,0.408,0.408,0.045,0.339,0.445,0.439,0.471,0.423,0.482,0.034,0.242,0.442,0.473,0.433,0.399,0.311,0.285,0.378,0.378,0.339,0.377,0.395,0.378,0.355,0.359,0.439,0.473,0.421,0.12,0.287,0.339,0.363,0.372,0.38,0.396,0.403,0.43,0.441,0.221,0.039,0.48,0.387,0.271,0.383,0.264,0.234,0.456,0.43,0.437,0.483,0.451,0.475,0.439,0.453,0.46,0.472,0.432,0.16,OrganismalFitness,RAF1_HUMAN,Low,Human
+0.447,0.436,0.444,0.476,0.466,0.48,0.313,0.353,0.426,0.446,0.318,0.36,0.405,0.457,0.514,0.487,0.498,0.44,0.313,0.514,0.437,0.414,0.42,0.396,0.433,0.403,0.401,0.374,0.305,0.434,0.338,0.319,0.18,0.399,0.439,0.377,0.454,0.478,0.45,0.472,0.485,0.487,0.478,0.326,0.322,0.453,0.386,0.252,0.458,0.255,0.431,0.423,0.443,0.428,0.456,0.43,0.452,0.43,0.439,0.455,0.373,0.474,Activity,RASH_HUMAN,High,Human
+0.359,0.357,0.479,0.482,0.465,0.476,0.24,0.297,0.442,0.455,0.43,0.367,0.406,0.454,0.486,0.478,0.462,0.397,0.328,0.504,0.417,0.422,0.46,0.421,0.408,0.44,0.44,0.37,0.414,0.463,0.37,0.328,0.248,0.357,0.457,0.448,0.415,0.484,0.49,0.47,0.493,0.492,0.398,0.316,0.261,0.42,0.286,0.298,0.481,0.304,0.404,0.397,0.418,0.416,0.433,0.418,0.421,0.412,0.385,0.424,0.47,0.452,Expression,RASK_HUMAN,High,Human
+0.359,0.357,0.479,0.482,0.465,0.476,0.24,0.297,0.442,0.455,0.43,0.367,0.406,0.454,0.486,0.478,0.462,0.397,0.328,0.504,0.417,0.422,0.46,0.421,0.408,0.44,0.44,0.37,0.414,0.463,0.37,0.328,0.248,0.357,0.457,0.448,0.415,0.484,0.49,0.47,0.493,0.492,0.398,0.316,0.261,0.42,0.286,0.298,0.481,0.304,0.404,0.397,0.418,0.416,0.433,0.418,0.421,0.412,0.385,0.424,0.47,0.452,Binding,RASK_HUMAN,High,Human
+0.19,0.108,0.302,0.307,0.305,0.303,0.271,0.277,0.218,0.233,0.43,0.409,0.404,0.321,0.394,0.491,0.552,0.328,0.335,0.259,0.366,0.108,0.267,0.263,0.297,0.078,0.16,0.293,0.202,0.365,0.373,0.318,0.239,0.291,0.364,0.384,0.318,0.34,0.339,0.317,0.334,0.318,0.319,0.255,0.415,0.406,0.487,0.545,0.578,0.459,0.494,0.479,0.491,0.485,0.492,0.486,0.476,0.493,0.48,0.493,0.539,0.526,Stability,RBP1_HUMAN,High,Human
+0.304,0.284,0.362,0.37,0.367,0.355,0.303,0.255,0.412,0.421,0.486,0.342,0.393,0.368,0.392,0.498,0.505,0.482,0.478,0.391,0.267,0.243,0.33,0.395,0.32,0.481,0.454,0.412,0.465,0.398,0.478,0.406,0.046,0.293,0.308,0.324,0.36,0.361,0.352,0.403,0.402,0.386,0.304,0.285,0.371,0.327,0.442,0.432,0.56,0.532,0.541,0.555,0.559,0.559,0.545,0.555,0.551,0.549,0.556,0.56,0.531,0.501,Stability,RCD1_ARATH,Medium,Eukaryote
+0.293,0.518,0.584,0.577,0.557,0.588,0.167,0.263,0.552,0.556,0.558,0.38,0.492,0.226,0.422,0.451,0.596,0.573,0.639,0.573,0.129,0.17,0.225,0.175,-0.041,0.132,-0.003,0.087,0.577,0.59,0.569,0.537,0.096,0.178,0.127,0.567,0.44,0.45,0.58,0.568,0.598,0.621,0.028,0.139,0.471,0.187,0.635,0.646,0.782,0.69,0.592,0.58,0.56,0.585,0.574,0.583,0.568,0.569,0.6,0.584,0.646,0.541,Stability,RCRO_LAMBD,High,Virus
+0.33,0.333,0.466,0.466,0.479,0.479,0.196,0.434,0.512,0.517,0.457,0.55,0.592,0.254,0.675,0.533,0.513,0.464,0.39,0.462,0.422,0.422,0.395,0.453,0.545,0.513,0.51,0.477,0.494,0.51,0.414,0.399,0.429,0.288,0.531,0.475,0.484,0.568,0.54,0.482,0.537,0.536,0.512,0.074,0.556,0.548,0.502,0.548,0.582,0.505,0.47,0.454,0.467,0.449,0.478,0.475,0.475,0.474,0.478,0.481,0.482,0.631,Stability,RD23A_HUMAN,High,Human
+0.313,0.367,0.376,0.387,0.482,0.484,0.029,0.355,0.52,0.525,0.166,0.045,0.057,0.033,0.041,0.128,0.322,0.389,0.491,0.427,0.385,0.436,0.454,0.475,0.125,0.358,0.342,0.345,0.413,0.52,0.45,0.395,0.075,0.377,0.423,0.459,0.425,0.452,0.474,0.494,0.508,0.526,0.03,0.02,0.171,0.033,0.215,0.23,0.188,0.055,0.3,0.271,0.279,0.282,0.303,0.296,0.283,0.304,0.31,0.307,0.149,0.11,OrganismalFitness,RDRP_I33A0,Low,Virus
+0.206,0.159,0.221,0.227,0.216,0.216,0.038,0.316,0.222,0.232,0.128,0.245,0.267,0.046,0.046,0.173,0.24,0.281,0.274,0.17,0.216,0.259,0.238,0.24,0.29,0.294,0.16,0.253,0.255,0.282,0.35,0.353,0.06,0.231,0.273,0.24,0.245,0.269,0.236,0.246,0.261,0.235,0.053,0.06,0.208,0.071,0.258,0.234,0.304,0.224,0.193,0.221,0.276,0.246,0.267,0.255,0.281,0.241,0.31,0.279,0.274,0.229,OrganismalFitness,REV_HV1H2,Medium,Virus
+0.064,0.21,0.212,0.233,0.23,0.227,-0.015,0.238,0.232,0.258,0.248,0.175,0.2,-0.032,0.078,0.055,0.299,0.261,0.246,0.249,-0.067,0.059,0.121,0.115,-0.043,0.13,0.151,0.114,0.158,0.214,0.268,0.213,-0.009,0.009,0.102,0.133,0.107,0.144,0.156,0.22,0.216,0.208,0.115,-0.075,0.193,0.107,0.304,0.248,0.384,0.328,0.331,0.31,0.291,0.333,0.314,0.3,0.314,0.309,0.327,0.319,0.318,0.22,Stability,RFAH_ECOLI,High,Prokaryote
+0.341,0.588,0.607,0.601,0.603,0.61,0.224,0.577,0.405,0.352,0.675,0.67,0.657,0.235,0.421,0.487,0.699,0.68,0.676,0.613,0.151,0.493,0.47,0.475,0.473,0.481,0.534,0.545,0.622,0.636,0.563,0.544,0.078,0.481,0.522,0.506,0.54,0.579,0.574,0.588,0.624,0.625,0.041,-0.048,0.467,0.283,0.684,0.64,0.816,0.769,0.716,0.715,0.72,0.726,0.713,0.718,0.723,0.724,0.732,0.728,0.738,0.71,Stability,RL20_AQUAE,High,Prokaryote
+0.322,0.352,0.38,0.425,0.38,0.407,0.123,0.373,0.463,0.473,0.191,0.277,0.316,0.151,0.447,0.515,0.549,0.451,0.507,0.437,0.392,0.51,0.487,0.431,0.49,0.434,0.436,0.418,0.395,0.356,0.426,0.436,0.14,0.405,0.468,0.382,0.43,0.479,0.418,0.451,0.477,0.42,0.312,0.1,0.289,0.303,0.164,0.151,0.27,0.069,0.528,0.499,0.508,0.515,0.537,0.526,0.502,0.532,0.531,0.543,0.34,0.379,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+0.322,0.352,0.38,0.425,0.38,0.407,0.123,0.373,0.463,0.473,0.191,0.277,0.316,0.151,0.447,0.515,0.549,0.451,0.507,0.437,0.392,0.51,0.487,0.431,0.49,0.434,0.436,0.418,0.395,0.356,0.426,0.436,0.14,0.405,0.468,0.382,0.43,0.479,0.418,0.451,0.477,0.42,0.312,0.1,0.289,0.303,0.164,0.151,0.27,0.069,0.528,0.499,0.508,0.515,0.537,0.526,0.502,0.532,0.531,0.543,0.34,0.379,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+0.322,0.352,0.38,0.425,0.38,0.407,0.123,0.373,0.463,0.473,0.191,0.277,0.316,0.151,0.447,0.515,0.549,0.451,0.507,0.437,0.392,0.51,0.487,0.431,0.49,0.434,0.436,0.418,0.395,0.356,0.426,0.436,0.14,0.405,0.468,0.382,0.43,0.479,0.418,0.451,0.477,0.42,0.312,0.1,0.289,0.303,0.164,0.151,0.27,0.069,0.528,0.499,0.508,0.515,0.537,0.526,0.502,0.532,0.531,0.543,0.34,0.379,Activity,RL40A_YEAST,Medium,Eukaryote
+0.544,0.592,0.57,0.584,0.594,0.595,0.058,0.423,0.591,0.591,0.59,0.579,0.593,0.064,0.557,0.588,0.596,0.595,0.592,0.586,0.554,0.57,0.509,0.499,0.557,0.584,0.583,0.586,0.571,0.594,0.578,0.539,0.187,0.538,0.54,0.431,0.581,0.587,0.545,0.617,0.616,0.606,0.529,0.06,0.588,0.567,0.299,0.524,0.281,0.177,0.541,0.53,0.52,0.553,0.544,0.533,0.548,0.543,0.544,0.554,0.608,0.533,Activity,RNC_ECOLI,Medium,Prokaryote
+0.627,0.681,0.655,0.673,0.606,0.644,0.67,0.593,0.617,0.659,0.711,0.744,0.75,0.722,0.741,0.752,0.708,0.687,0.63,0.697,0.699,0.75,0.735,0.682,0.742,0.731,0.718,0.723,0.66,0.743,0.666,0.609,0.621,0.633,0.738,0.716,0.72,0.747,0.737,0.699,0.693,0.69,0.749,0.674,0.735,0.728,0.696,0.64,0.793,0.723,0.702,0.672,0.687,0.7,0.699,0.69,0.711,0.702,0.711,0.708,0.756,0.764,Stability,RPC1_BP434,High,Virus
+0.267,0.405,0.48,0.496,0.456,0.451,0.202,0.414,0.454,0.469,0.465,0.464,0.498,0.271,0.276,0.382,0.536,0.55,0.55,0.421,0.186,0.314,0.366,0.4,0.202,0.358,0.322,0.302,0.515,0.415,0.585,0.542,0.114,0.142,0.33,0.46,0.256,0.342,0.459,0.418,0.448,0.494,0.283,0.292,0.387,0.304,0.256,0.391,0.384,0.221,0.532,0.504,0.512,0.526,0.525,0.542,0.528,0.521,0.516,0.54,0.52,0.371,Activity,RPC1_LAMBD,High,Virus
+0.267,0.405,0.48,0.496,0.456,0.451,0.202,0.414,0.454,0.469,0.465,0.464,0.498,0.271,0.276,0.382,0.536,0.55,0.55,0.421,0.186,0.314,0.366,0.4,0.202,0.358,0.322,0.302,0.515,0.415,0.585,0.542,0.114,0.142,0.33,0.46,0.256,0.342,0.459,0.418,0.448,0.494,0.283,0.292,0.387,0.304,0.256,0.391,0.384,0.221,0.532,0.504,0.512,0.526,0.525,0.542,0.528,0.521,0.516,0.54,0.52,0.371,Activity,RPC1_LAMBD,High,Virus
+0.393,0.31,0.336,0.328,0.321,0.326,0.178,0.188,0.387,0.389,0.468,0.394,0.39,0.289,0.326,0.51,0.409,0.372,0.33,0.291,0.199,0.33,0.317,0.323,0.297,0.34,0.416,0.362,0.367,0.375,0.413,0.396,0.117,0.423,0.325,0.306,0.453,0.377,0.365,0.395,0.348,0.32,0.333,0.262,0.443,0.381,0.582,0.44,0.607,0.55,0.396,0.403,0.403,0.399,0.407,0.377,0.415,0.42,0.395,0.413,0.433,0.646,Stability,RS15_GEOSE,Medium,Prokaryote
+0.479,0.534,0.56,0.571,0.568,0.581,0.364,0.51,0.585,0.582,0.575,0.596,0.635,0.466,0.508,0.518,0.6,0.573,0.546,0.034,0.49,0.587,0.588,0.576,0.526,0.597,0.613,0.583,0.545,0.58,0.583,0.476,0.314,0.513,0.586,0.579,0.546,0.608,0.611,0.602,0.618,0.617,0.481,0.256,0.572,0.528,0.424,0.537,0.518,0.172,0.562,0.546,0.544,0.566,0.584,0.57,0.568,0.579,0.572,0.584,0.586,0.54,Expression,S22A1_HUMAN,Medium,Human
+0.479,0.534,0.56,0.571,0.568,0.581,0.364,0.51,0.585,0.582,0.575,0.596,0.635,0.466,0.508,0.518,0.6,0.573,0.546,0.034,0.49,0.587,0.588,0.576,0.526,0.597,0.613,0.583,0.545,0.58,0.583,0.476,0.314,0.513,0.586,0.579,0.546,0.608,0.611,0.602,0.618,0.617,0.481,0.256,0.572,0.528,0.424,0.537,0.518,0.172,0.562,0.546,0.544,0.566,0.584,0.57,0.568,0.579,0.572,0.584,0.586,0.54,Activity,S22A1_HUMAN,Medium,Human
+0.137,0.201,0.394,0.373,0.407,0.399,0.426,0.503,0.41,0.43,0.481,0.472,0.508,0.238,0.496,0.496,0.512,0.52,0.406,0.43,0.491,0.458,0.449,0.479,0.524,0.519,0.476,0.491,0.482,0.518,0.469,0.419,0.353,0.424,0.497,0.499,0.431,0.498,0.489,0.411,0.443,0.444,0.503,-0.042,0.545,0.531,0.336,0.332,0.443,0.502,0.454,0.348,0.234,0.367,0.362,0.395,0.392,0.445,0.426,0.387,0.575,0.607,Stability,SAV1_MOUSE,High,Eukaryote
+0.232,0.225,0.38,0.345,0.44,0.403,0.205,0.178,0.481,0.533,0.323,0.268,0.297,0.23,0.266,0.376,0.573,0.658,0.318,0.359,0.228,0.263,0.259,0.235,0.21,0.228,0.221,0.245,0.339,0.549,0.582,0.487,0.169,0.195,0.2,0.225,0.297,0.305,0.314,0.355,0.368,0.382,0.262,0.228,0.316,0.305,0.664,0.627,0.687,0.616,0.529,0.526,0.561,0.55,0.545,0.547,0.569,0.565,0.558,0.561,0.683,0.571,Stability,SBI_STAAM,Medium,Prokaryote
+0.387,0.456,0.423,0.433,0.504,0.522,0.352,0.507,0.562,0.574,0.545,0.531,0.542,0.159,0.327,0.502,0.543,0.548,0.524,0.555,0.498,0.511,0.5,0.496,0.512,0.52,0.518,0.525,0.511,0.541,0.463,0.36,0.342,0.513,0.512,0.491,0.53,0.542,0.53,0.549,0.555,0.545,0.489,0.112,0.549,0.531,0.467,0.527,0.501,0.182,0.516,0.526,0.535,0.543,0.545,0.536,0.536,0.538,0.526,0.546,0.578,0.528,Activity,SC6A4_HUMAN,Medium,Human
+0.083,0.14,0.261,0.264,0.3,0.272,0.154,0.065,0.271,0.258,0.299,0.298,0.269,0.208,0.305,0.279,0.346,0.406,0.414,0.25,0.106,0.162,0.134,0.144,0.155,0.194,0.152,0.2,0.215,0.229,0.296,0.208,-0.012,0.072,0.104,0.187,0.132,0.132,0.156,0.245,0.235,0.255,0.231,0.177,0.3,0.269,0.525,0.527,0.517,0.477,0.438,0.442,0.437,0.48,0.422,0.478,0.43,0.43,0.391,0.454,0.62,0.513,Stability,SCIN_STAAR,High,Prokaryote
+0.13,0.131,0.162,0.162,0.153,0.158,0.13,-0.014,0.163,0.18,0.141,0.217,0.135,0.216,0.177,0.156,0.183,0.152,0.125,0.126,0.106,0.143,0.151,0.127,0.124,0.106,0.093,0.107,0.167,0.144,0.186,0.181,0.095,0.075,0.068,0.069,0.095,0.086,0.086,0.14,0.14,0.152,0.086,0.137,0.165,0.154,0.03,0.092,0.053,0.014,0.127,0.16,0.157,0.127,0.105,0.135,0.143,0.136,0.169,0.146,0.232,0.168,OrganismalFitness,SCN5A_HUMAN,Medium,Human
+0.631,0.646,0.615,0.619,0.647,0.65,0.001,0.303,0.456,0.472,0.606,0.418,0.466,0.249,0.274,0.608,0.594,0.598,0.596,0.53,0.026,0.067,0.003,0.238,0.081,0.123,0.045,0.119,0.214,0.64,0.604,0.581,-0.165,0.01,0.26,0.107,0.607,0.613,0.368,0.652,0.65,0.486,0.037,-0.079,0.336,0.011,0.434,0.41,0.608,0.597,0.65,0.641,0.588,0.647,0.63,0.613,0.622,0.623,0.653,0.633,0.639,0.636,Stability,SDA_BACSU,Medium,Prokaryote
+0.375,0.483,0.544,0.544,0.52,0.53,0.013,0.456,0.57,0.576,0.518,0.541,0.547,0.183,0.407,0.529,0.558,0.577,0.543,0.535,0.492,0.511,0.513,0.484,0.5,0.53,0.518,0.534,0.504,0.506,0.516,0.469,0.192,0.513,0.528,0.519,0.528,0.54,0.544,0.548,0.551,0.552,0.319,0.031,0.54,0.46,0.382,0.461,0.352,0.189,0.539,0.524,0.54,0.551,0.558,0.548,0.553,0.55,0.543,0.561,0.562,0.478,OrganismalFitness,SERC_HUMAN,High,Human
+0.208,0.328,0.378,0.383,0.37,0.379,0.226,0.36,0.424,0.422,0.395,0.384,0.41,0.242,0.249,0.267,0.426,0.388,0.29,0.388,0.262,0.363,0.385,0.355,0.267,0.394,0.39,0.4,0.358,0.407,0.416,0.366,0.153,0.254,0.308,0.376,0.264,0.302,0.367,0.367,0.367,0.396,0.262,0.236,0.421,0.291,0.303,0.358,0.286,0.095,0.373,0.367,0.387,0.39,0.396,0.398,0.381,0.392,0.384,0.398,0.322,0.286,OrganismalFitness,SHOC2_HUMAN,Medium,Human
+0.225,0.241,0.235,0.242,0.282,0.273,0.369,0.17,0.21,0.246,0.254,0.335,0.336,0.298,0.396,0.333,0.322,0.262,0.261,0.223,0.166,0.167,0.162,0.19,0.264,0.247,0.202,0.253,0.189,0.231,0.087,0.06,0.222,0.133,0.267,0.229,0.217,0.241,0.234,0.275,0.274,0.262,0.271,0.15,0.208,0.356,0.526,0.346,0.53,0.464,0.333,0.38,0.348,0.333,0.326,0.385,0.333,0.334,0.334,0.356,0.333,0.466,Stability,SOX30_HUMAN,High,Human
+0.432,0.509,0.518,0.501,0.552,0.549,-0.093,0.495,0.409,0.408,0.393,-0.041,0.001,-0.032,-0.033,-0.064,0.005,-0.01,0.02,0.514,-0.154,0.032,0.205,0.105,-0.081,-0.041,0.057,0.338,0.428,0.564,0.432,0.442,-0.034,-0.011,0.001,0.066,0.437,0.43,0.427,0.548,0.547,0.536,-0.137,-0.087,-0.071,-0.009,0.512,0.453,0.705,0.56,0.52,0.511,0.493,0.534,0.498,0.515,0.51,0.501,0.504,0.513,0.546,0.414,Stability,SPA_STAAU,Medium,Prokaryote
+0.094,0.154,0.022,0.027,0.142,0.157,-0.003,-0.016,0.066,0.171,0.213,0.161,0.126,0.155,0.164,0.191,0.214,0.208,0.268,0.181,0.12,0.092,0.082,0.092,0.086,0.119,0.102,0.104,0.161,0.192,0.312,0.31,0.014,0.112,0.045,0.128,0.141,0.108,0.155,0.136,0.108,0.158,-0.052,-0.106,0.099,-0.011,0.226,0.164,0.302,0.118,0.262,0.226,0.248,0.295,0.285,0.246,0.306,0.292,0.294,0.283,0.279,0.244,Binding,SPG1_STRSG,Low,Prokaryote
+0.094,0.154,0.022,0.027,0.142,0.157,-0.003,-0.016,0.066,0.171,0.213,0.161,0.126,0.155,0.164,0.191,0.214,0.208,0.268,0.181,0.12,0.092,0.082,0.092,0.086,0.119,0.102,0.104,0.161,0.192,0.312,0.31,0.014,0.112,0.045,0.128,0.141,0.108,0.155,0.136,0.108,0.158,-0.052,-0.106,0.099,-0.011,0.226,0.164,0.302,0.118,0.262,0.226,0.248,0.295,0.285,0.246,0.306,0.292,0.294,0.283,0.279,0.244,Binding,SPG1_STRSG,Low,Prokaryote
+0.094,0.154,0.022,0.027,0.142,0.157,-0.003,-0.016,0.066,0.171,0.213,0.161,0.126,0.155,0.164,0.191,0.214,0.208,0.268,0.181,0.12,0.092,0.082,0.092,0.086,0.119,0.102,0.104,0.161,0.192,0.312,0.31,0.014,0.112,0.045,0.128,0.141,0.108,0.155,0.136,0.108,0.158,-0.052,-0.106,0.099,-0.011,0.226,0.164,0.302,0.118,0.262,0.226,0.248,0.295,0.285,0.246,0.306,0.292,0.294,0.283,0.279,0.244,Binding,SPG1_STRSG,Medium,Prokaryote
+0.094,0.154,0.022,0.027,0.142,0.157,-0.003,-0.016,0.066,0.171,0.213,0.161,0.126,0.155,0.164,0.191,0.214,0.208,0.268,0.181,0.12,0.092,0.082,0.092,0.086,0.119,0.102,0.104,0.161,0.192,0.312,0.31,0.014,0.112,0.045,0.128,0.141,0.108,0.155,0.136,0.108,0.158,-0.052,-0.106,0.099,-0.011,0.226,0.164,0.302,0.118,0.262,0.226,0.248,0.295,0.285,0.246,0.306,0.292,0.294,0.283,0.279,0.244,Binding,SPG1_STRSG,Medium,Prokaryote
+0.439,0.517,0.496,0.539,0.569,0.565,0.325,0.309,0.543,0.478,0.501,0.437,0.409,0.366,0.432,0.451,0.5,0.504,0.528,0.507,0.388,0.433,0.436,0.392,0.343,0.465,0.356,0.485,0.501,0.614,0.561,0.536,-0.192,0.356,0.44,0.462,0.294,0.335,0.382,0.561,0.562,0.569,0.253,0.089,0.286,0.24,0.447,0.369,0.704,0.57,0.564,0.566,0.584,0.592,0.569,0.595,0.588,0.61,0.584,0.589,0.598,0.549,Stability,SPG2_STRSG,Medium,Prokaryote
+0.179,0.26,0.114,0.215,0.347,0.351,-0.031,0.36,0.384,0.403,0.024,-0.044,-0.018,-0.038,-0.017,-0.006,-0.003,0.005,0.055,0.346,0.311,0.375,0.366,0.376,0.382,0.357,0.318,0.388,0.324,0.303,0.318,0.352,0.174,0.341,0.337,0.369,0.348,0.343,0.342,0.403,0.4,0.408,-0.015,0.004,0.001,0.016,0.539,0.508,0.435,0.174,0.272,0.298,0.278,0.36,0.315,0.357,0.321,0.336,0.296,0.342,0.428,0.213,Binding,SPIKE_SARS2,Medium,Virus
+0.179,0.26,0.114,0.215,0.347,0.351,-0.031,0.36,0.384,0.403,0.024,-0.044,-0.018,-0.038,-0.017,-0.006,-0.003,0.005,0.055,0.346,0.311,0.375,0.366,0.376,0.382,0.357,0.318,0.388,0.324,0.303,0.318,0.352,0.174,0.341,0.337,0.369,0.348,0.343,0.342,0.403,0.4,0.408,-0.015,0.004,0.001,0.016,0.539,0.508,0.435,0.174,0.272,0.298,0.278,0.36,0.315,0.357,0.321,0.336,0.296,0.342,0.428,0.213,Expression,SPIKE_SARS2,Medium,Virus
+0.572,0.613,0.647,0.599,0.632,0.615,0.23,0.524,0.552,0.593,0.637,0.649,0.641,-0.133,0.619,0.627,0.637,0.726,0.657,0.624,0.578,0.643,0.604,0.589,0.589,0.613,0.652,0.563,0.601,0.621,0.631,0.632,0.457,0.508,0.57,0.548,0.629,0.641,0.621,0.623,0.628,0.618,0.488,-0.221,0.465,0.498,0.421,0.401,0.62,0.623,0.693,0.679,0.683,0.685,0.696,0.685,0.676,0.674,0.696,0.691,0.667,0.526,Stability,SPTN1_CHICK,High,Eukaryote
+0.399,0.522,0.597,0.61,0.604,0.604,0.086,0.449,0.565,0.592,0.514,0.373,0.503,0.186,0.442,0.529,0.618,0.64,0.527,0.607,0.218,0.474,0.49,0.505,0.345,0.544,0.569,0.506,0.538,0.623,0.566,0.526,0.155,0.194,0.444,0.518,0.502,0.54,0.589,0.614,0.604,0.625,0.209,0.119,0.582,0.583,0.651,0.626,0.629,0.619,0.626,0.582,0.635,0.62,0.634,0.628,0.634,0.61,0.603,0.639,0.682,0.514,Stability,SQSTM_MOUSE,Medium,Eukaryote
+0.596,0.658,0.684,0.681,0.693,0.684,-0.19,0.508,0.638,0.61,0.702,0.639,0.654,-0.267,0.662,0.702,0.675,0.698,0.681,0.634,-0.109,0.377,0.382,0.504,0.493,0.575,0.607,0.608,0.61,0.665,0.674,0.635,0.359,0.039,0.189,0.368,0.667,0.669,0.673,0.685,0.689,0.68,0.575,-0.296,0.703,0.618,0.214,0.637,0.731,0.716,0.677,0.673,0.67,0.692,0.696,0.68,0.668,0.669,0.685,0.684,0.788,0.744,Stability,SR43C_ARATH,High,Eukaryote
+0.36,0.511,0.696,0.694,0.723,0.718,0.549,0.362,0.743,0.742,0.628,0.664,0.7,0.498,0.713,0.731,0.734,0.673,0.694,0.653,0.679,0.649,0.653,0.62,0.642,0.637,0.659,0.63,0.607,0.694,0.696,0.658,0.577,0.597,0.637,0.645,0.66,0.697,0.705,0.701,0.727,0.728,0.388,0.084,0.538,0.532,0.579,0.538,0.64,0.661,0.751,0.731,0.734,0.736,0.746,0.741,0.735,0.735,0.74,0.75,0.707,0.689,Stability,SRBS1_HUMAN,High,Human
+0.463,0.454,0.429,0.442,0.455,0.456,0.476,0.415,0.443,0.466,0.424,0.51,0.532,0.382,0.418,0.423,0.497,0.461,0.43,0.426,0.395,0.369,0.375,0.329,0.396,0.417,0.391,0.377,0.285,0.484,0.523,0.518,0.382,0.366,0.363,0.303,0.444,0.442,0.431,0.468,0.466,0.461,0.506,0.339,0.446,0.49,0.256,0.256,0.348,0.069,0.463,0.436,0.438,0.462,0.469,0.464,0.461,0.471,0.471,0.475,0.456,0.417,Activity,SRC_HUMAN,Medium,Human
+0.463,0.454,0.429,0.442,0.455,0.456,0.476,0.415,0.443,0.466,0.424,0.51,0.532,0.382,0.418,0.423,0.497,0.461,0.43,0.426,0.395,0.369,0.375,0.329,0.396,0.417,0.391,0.377,0.285,0.484,0.523,0.518,0.382,0.366,0.363,0.303,0.444,0.442,0.431,0.468,0.466,0.461,0.506,0.339,0.446,0.49,0.256,0.256,0.348,0.069,0.463,0.436,0.438,0.462,0.469,0.464,0.461,0.471,0.471,0.475,0.456,0.417,Activity,SRC_HUMAN,Medium,Human
+0.463,0.454,0.429,0.442,0.455,0.456,0.476,0.415,0.443,0.466,0.424,0.51,0.532,0.382,0.418,0.423,0.497,0.461,0.43,0.426,0.395,0.369,0.375,0.329,0.396,0.417,0.391,0.377,0.285,0.484,0.523,0.518,0.382,0.366,0.363,0.303,0.444,0.442,0.431,0.468,0.466,0.461,0.506,0.339,0.446,0.49,0.256,0.256,0.348,0.069,0.463,0.436,0.438,0.462,0.469,0.464,0.461,0.471,0.471,0.475,0.456,0.417,OrganismalFitness,SRC_HUMAN,Medium,Human
+0.369,0.373,0.419,0.438,0.48,0.478,0.13,0.425,0.449,0.386,0.433,0.467,0.51,0.25,0.496,0.533,0.509,0.38,0.34,0.517,0.217,0.414,0.443,0.43,0.468,0.386,0.46,0.431,0.333,0.438,0.46,0.445,0.212,0.245,0.473,0.329,0.405,0.511,0.41,0.452,0.522,0.463,0.49,0.086,0.395,0.527,0.497,0.428,0.509,0.412,0.492,0.453,0.5,0.5,0.493,0.517,0.479,0.498,0.491,0.511,0.479,0.524,OrganismalFitness,SUMO1_HUMAN,High,Human
+0.103,0.111,0.119,0.132,0.131,0.139,0.131,0.212,0.171,0.162,0.234,0.242,0.233,0.105,0.142,0.15,0.137,0.206,0.205,0.22,0.138,0.203,0.155,0.141,0.086,0.151,0.167,0.105,0.136,0.227,0.205,0.189,0.005,0.137,0.197,0.181,0.124,0.176,0.159,0.134,0.166,0.156,0.115,0.068,0.16,0.155,-0.052,0.046,-0.06,-0.074,0.069,0.056,0.053,0.088,0.129,0.098,0.127,0.102,0.144,0.098,-0.011,-0.045,OrganismalFitness,SYUA_HUMAN,Medium,Human
+0.097,0.061,0.1,0.096,0.08,0.08,0.291,-0.018,0.077,0.072,0.013,0.051,0.048,0.121,0.047,0.021,-0.12,-0.028,0.053,-0.075,0.158,0.063,-0.011,-0.006,0.206,0.082,0.006,0.023,-0.014,0.036,0.088,0.112,-0.051,0.226,0.273,0.121,0.142,0.185,0.121,0.105,0.129,0.109,0.121,0.212,-0.002,0.078,0.271,0.033,0.223,0.059,0.105,0.013,0.004,0.04,0.002,0.034,0.045,-0.011,-0.079,0.021,0.054,0.121,OrganismalFitness,TADBP_HUMAN,Low,Human
+0.293,0.201,0.255,0.268,0.319,0.3,-0.09,0.397,0.274,0.288,0.185,0.343,0.342,-0.034,-0.008,-0.005,0.017,-0.057,0.044,0.203,0.379,0.364,0.396,0.398,0.394,0.269,0.136,0.246,0.223,0.368,0.405,0.387,0.283,0.385,0.211,0.203,0.379,0.263,0.237,0.366,0.295,0.268,0.005,-0.054,0.292,0.181,0.23,0.284,0.344,0.207,0.114,0.134,0.142,0.137,0.116,0.133,0.12,0.117,0.117,0.134,0.157,0.116,OrganismalFitness,TAT_HV1BR,High,Virus
+0.576,0.615,0.644,0.655,0.66,0.662,0.696,0.292,0.677,0.708,0.591,0.739,0.75,0.724,0.707,0.784,0.769,0.707,0.745,0.577,0.49,0.552,0.57,0.609,0.604,0.613,0.66,0.648,0.597,0.68,0.64,0.595,0.642,0.485,0.619,0.669,0.688,0.715,0.71,0.68,0.712,0.695,0.753,0.184,0.629,0.752,0.68,0.596,0.774,0.759,0.721,0.697,0.657,0.757,0.733,0.763,0.723,0.738,0.71,0.735,0.79,0.774,Stability,TCRG1_MOUSE,Medium,Eukaryote
+0.309,0.397,0.482,0.473,0.483,0.481,0.072,0.061,0.523,0.52,0.511,0.502,0.532,0.186,0.557,0.548,0.513,0.532,0.546,0.515,-0.047,0.332,0.349,0.305,0.351,0.383,0.279,0.423,0.534,0.543,0.526,0.512,0.48,0.003,0.403,0.337,0.446,0.505,0.495,0.472,0.51,0.511,0.301,0.114,0.463,0.389,0.504,0.484,0.561,0.597,0.555,0.546,0.566,0.573,0.584,0.548,0.561,0.551,0.545,0.561,0.502,0.483,Stability,THO1_YEAST,High,Eukaryote
+0.464,0.473,0.506,0.495,0.507,0.502,-0.099,0.392,0.573,0.576,0.472,0.538,0.526,0.044,0.547,0.589,0.525,0.529,0.533,0.466,0.262,0.361,0.429,0.394,0.395,0.455,0.385,0.395,0.436,0.528,0.484,0.451,0.25,0.061,0.312,0.333,0.535,0.523,0.517,0.541,0.518,0.518,0.343,-0.03,0.572,0.492,0.526,0.649,0.675,0.624,0.51,0.53,0.516,0.512,0.517,0.511,0.521,0.512,0.513,0.524,0.597,0.604,Stability,TNKS2_HUMAN,High,Human
+0.217,0.236,0.219,0.228,0.23,0.229,0.067,0.253,0.286,0.27,0.285,0.27,0.318,0.108,0.199,0.253,0.339,0.328,0.371,0.249,0.088,0.142,0.228,0.272,0.115,0.267,0.273,0.253,0.306,0.255,0.253,0.188,0.117,0.115,0.214,0.301,0.232,0.252,0.305,0.243,0.252,0.277,0.12,0.089,0.309,0.17,0.206,0.267,0.245,0.116,0.253,0.277,0.275,0.278,0.293,0.291,0.281,0.291,0.292,0.298,0.268,0.203,OrganismalFitness,TPK1_HUMAN,Medium,Human
+0.372,0.456,0.489,0.509,0.499,0.513,0.241,0.445,0.497,0.5,0.546,0.517,0.547,0.308,0.433,0.531,0.539,0.476,0.448,0.511,0.333,0.42,0.481,0.496,0.447,0.506,0.481,0.466,0.424,0.547,0.542,0.489,0.334,0.33,0.471,0.411,0.478,0.513,0.478,0.517,0.537,0.515,0.371,0.204,0.539,0.495,0.476,0.534,0.54,0.238,0.522,0.519,0.53,0.514,0.541,0.535,0.541,0.529,0.534,0.543,0.568,0.469,Expression,TPMT_HUMAN,Medium,Human
+0.376,0.365,0.283,0.263,0.288,0.296,0.37,0.455,0.454,0.456,0.407,0.368,0.37,0.279,0.378,0.294,0.29,0.422,0.362,0.362,0.25,0.338,0.313,0.377,0.367,0.2,0.471,0.451,0.34,0.45,0.436,0.274,0.395,0.372,0.321,0.419,0.429,0.408,0.451,0.408,0.382,0.437,0.295,0.303,0.353,0.348,0.126,0.357,0.309,0.035,0.344,0.308,0.344,0.354,0.322,0.345,0.396,0.257,0.293,0.341,0.389,0.372,OrganismalFitness,TPOR_HUMAN,Low,Human
+0.575,0.606,0.558,0.574,0.575,0.585,0.173,0.499,0.655,0.671,0.621,0.611,0.643,0.325,0.539,0.618,0.653,0.632,0.601,0.592,0.421,0.564,0.508,0.575,0.498,0.499,0.558,0.521,0.524,0.529,0.59,0.52,0.221,0.52,0.555,0.558,0.575,0.598,0.591,0.59,0.612,0.601,0.434,0.072,0.612,0.498,0.321,0.512,0.507,0.122,0.593,0.594,0.582,0.646,0.614,0.615,0.621,0.63,0.614,0.636,0.626,0.578,OrganismalFitness,TRPC_SACS2,Medium,Prokaryote
+0.405,0.45,0.394,0.411,0.416,0.417,0.395,0.434,0.471,0.47,0.439,0.478,0.506,0.327,0.47,0.503,0.464,0.47,0.499,0.43,0.336,0.399,0.395,0.455,0.437,0.421,0.369,0.387,0.478,0.39,0.427,0.365,0.14,0.329,0.46,0.448,0.39,0.442,0.438,0.418,0.445,0.442,0.397,0.14,0.461,0.395,0.253,0.386,0.393,0.075,0.459,0.41,0.431,0.462,0.463,0.409,0.439,0.441,0.418,0.454,0.504,0.491,OrganismalFitness,TRPC_THEMA,Medium,Prokaryote
+0.371,0.496,0.521,0.531,0.508,0.522,-0.042,0.403,0.512,0.519,0.42,0.477,0.509,0.001,0.004,0.403,0.473,0.518,0.538,0.557,0.22,0.405,0.445,0.419,0.42,0.479,0.471,0.449,0.442,0.484,0.454,0.432,0.071,0.233,0.403,0.428,0.327,0.452,0.475,0.443,0.517,0.54,0.277,-0.008,0.427,0.492,0.322,0.35,0.446,0.243,0.44,0.417,0.408,0.433,0.425,0.427,0.433,0.44,0.442,0.439,0.37,0.426,OrganismalFitness,UBC9_HUMAN,Medium,Human
+0.325,0.443,0.464,0.467,0.466,0.466,0.051,0.346,0.379,0.415,0.391,0.453,0.449,0.347,0.42,0.49,0.46,0.467,0.477,0.421,0.142,0.353,0.357,0.364,0.345,0.374,0.343,0.322,0.355,0.486,0.475,0.453,0.441,0.079,0.11,0.255,0.383,0.385,0.329,0.472,0.477,0.424,0.359,-0.095,0.348,0.344,0.488,0.409,0.52,0.351,0.411,0.436,0.424,0.447,0.448,0.428,0.407,0.429,0.42,0.432,0.5,0.407,Stability,UBE4B_HUMAN,High,Human
+0.412,0.417,0.454,0.468,0.463,0.47,0.078,0.089,0.392,0.394,0.361,0.447,0.471,0.399,0.463,0.459,0.459,0.351,0.349,0.383,0.122,0.324,0.337,0.3,0.444,0.376,0.395,0.376,0.289,0.433,0.437,0.396,0.091,0.026,0.096,0.262,0.419,0.351,0.39,0.467,0.454,0.472,0.409,0.041,0.42,0.458,0.307,0.393,-0.017,0.118,0.426,0.409,0.456,0.458,0.438,0.458,0.432,0.449,0.443,0.454,0.44,0.412,Activity,UBE4B_MOUSE,Low,Eukaryote
+0.442,0.55,0.54,0.538,0.566,0.577,0.152,0.251,0.498,0.528,0.528,0.425,0.511,0.226,0.226,0.158,0.197,0.591,0.601,0.476,0.422,0.414,0.467,0.514,0.473,0.462,0.495,0.488,0.492,0.557,0.497,0.388,0.288,0.304,0.455,0.401,0.484,0.519,0.502,0.592,0.594,0.583,0.133,0.121,0.422,0.209,0.557,0.439,0.637,0.574,0.56,0.588,0.597,0.609,0.599,0.612,0.589,0.605,0.585,0.6,0.661,0.597,Stability,UBR5_HUMAN,Medium,Human
+0.228,0.314,0.228,0.246,0.266,0.255,0.256,0.064,0.371,0.392,0.623,0.544,0.562,0.36,0.514,0.601,0.655,0.611,0.653,0.167,0.395,0.474,0.458,0.531,0.419,0.425,0.41,0.467,0.516,0.487,0.51,0.386,0.038,0.307,0.473,0.464,0.343,0.415,0.464,0.336,0.403,0.409,0.469,0.295,0.552,0.483,0.458,0.561,0.543,0.505,0.652,0.626,0.661,0.661,0.664,0.691,0.634,0.658,0.684,0.674,0.619,0.6,Stability,VG08_BPP22,High,Virus
+0.284,0.422,0.594,0.63,0.634,0.637,0.162,0.54,0.553,0.59,0.673,0.63,0.673,0.371,0.316,0.652,0.662,0.589,0.478,0.592,0.293,0.431,0.601,0.524,0.424,0.575,0.494,0.482,0.579,0.599,0.701,0.694,0.322,0.346,0.515,0.521,0.487,0.564,0.563,0.634,0.659,0.642,0.272,0.283,0.426,0.421,0.542,0.433,0.775,0.702,0.672,0.658,0.666,0.658,0.653,0.665,0.649,0.656,0.652,0.662,0.685,0.798,Stability,VILI_CHICK,High,Eukaryote
+0.369,0.392,0.4,0.416,0.425,0.432,0.122,0.391,0.462,0.481,0.428,0.437,0.461,0.134,0.37,0.434,0.462,0.465,0.458,0.44,0.168,0.186,0.334,0.37,0.25,0.428,0.424,0.398,0.414,0.451,0.407,0.338,0.134,0.139,0.25,0.438,0.4,0.416,0.475,0.442,0.456,0.487,0.16,0.1,0.44,0.305,0.336,0.388,0.428,0.19,0.462,0.424,0.45,0.466,0.468,0.451,0.458,0.451,0.46,0.467,0.46,0.4,Expression,VKOR1_HUMAN,Medium,Human
+0.369,0.392,0.4,0.416,0.425,0.432,0.122,0.391,0.462,0.481,0.428,0.437,0.461,0.134,0.37,0.434,0.462,0.465,0.458,0.44,0.168,0.186,0.334,0.37,0.25,0.428,0.424,0.398,0.414,0.451,0.407,0.338,0.134,0.139,0.25,0.438,0.4,0.416,0.475,0.442,0.456,0.487,0.16,0.1,0.44,0.305,0.336,0.388,0.428,0.19,0.462,0.424,0.45,0.466,0.468,0.451,0.458,0.451,0.46,0.467,0.46,0.4,Activity,VKOR1_HUMAN,Medium,Human
+-0.119,0.043,0.164,0.194,0.078,0.119,0.082,0.147,0.366,0.356,0.498,0.368,0.374,0.252,0.41,0.507,0.587,0.612,0.468,0.128,-0.02,0.135,0.133,0.222,0.215,0.202,0.131,0.199,0.291,0.084,0.499,0.429,0.242,0.065,0.059,0.122,-0.039,-0.028,-0.057,0.064,0.083,0.006,0.254,0.056,0.501,0.357,0.589,0.557,0.593,0.606,0.612,0.592,0.633,0.639,0.622,0.641,0.615,0.63,0.604,0.64,0.662,0.582,Stability,VRPI_BPT7,Medium,Virus
+0.291,0.602,0.572,0.596,0.599,0.591,-0.221,0.533,0.559,0.598,0.593,0.27,0.481,-0.107,0.173,0.572,0.625,0.684,0.684,0.591,-0.091,-0.182,-0.13,0.264,-0.125,-0.166,-0.231,-0.023,0.52,0.605,0.681,0.643,-0.06,-0.234,-0.064,0.502,0.349,0.359,0.528,0.572,0.569,0.604,-0.046,-0.261,0.208,-0.013,0.449,0.425,0.651,0.544,0.587,0.627,0.585,0.619,0.612,0.627,0.625,0.657,0.677,0.632,0.616,0.498,Stability,YAIA_ECOLI,Medium,Prokaryote
+0.428,0.321,0.464,0.46,0.449,0.455,0.329,0.289,0.018,0.036,0.333,0.28,0.285,0.411,0.457,0.451,0.466,0.382,0.313,0.189,0.18,0.176,0.158,0.167,0.299,0.226,0.257,0.207,0.15,0.332,0.425,0.451,0.139,0.302,0.196,0.217,0.402,0.326,0.359,0.435,0.405,0.439,0.485,-0.104,0.335,0.486,0.34,0.362,0.383,0.197,0.446,0.379,0.432,0.426,0.417,0.412,0.415,0.441,0.444,0.438,0.386,0.422,Binding,YAP1_HUMAN,Low,Human
+0.596,0.61,0.595,0.618,0.604,0.609,0.492,0.394,0.591,0.583,0.673,0.649,0.667,0.584,0.621,0.65,0.658,0.631,0.63,0.525,0.488,0.534,0.612,0.604,0.594,0.611,0.587,0.59,0.643,0.594,0.586,0.547,0.568,0.573,0.567,0.544,0.665,0.634,0.621,0.669,0.624,0.63,0.446,0.353,0.536,0.448,0.483,0.562,0.68,0.65,0.701,0.649,0.665,0.687,0.7,0.691,0.687,0.676,0.679,0.69,0.638,0.76,Stability,YNZC_BACSU,Medium,Prokaryote
+0.364,0.416,0.424,0.434,0.451,0.457,0.172,0.35,0.435,0.45,0.418,0.395,0.426,0.202,0.316,0.397,0.437,0.43,0.424,0.388,0.306,0.357,0.371,0.379,0.34,0.388,0.385,0.39,0.403,0.471,0.461,0.422,0.194,0.298,0.35,0.381,0.425,0.435,0.446,0.464,0.468,0.471,0.266,0.088,0.387,0.323,0.384,0.411,0.432,0.29,0.454,0.445,0.453,0.463,0.463,0.463,0.46,0.461,0.46,0.471,0.47,0.409,,,,
diff --git a/benchmarks/DMS_zero_shot/substitutions/Spearman/Summary_performance_DMS_substitutions_Spearman.csv b/benchmarks/DMS_zero_shot/substitutions/Spearman/Summary_performance_DMS_substitutions_Spearman.csv
new file mode 100644
index 0000000..5d07eb1
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/Spearman/Summary_performance_DMS_substitutions_Spearman.csv
@@ -0,0 +1,63 @@
+Model_rank,Model_name,Model type,Average_Spearman,Bootstrap_standard_error_Spearman,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,Depth_1,Depth_2,Depth_3,Depth_4,Depth_5+,Model details,References
+1,SaProt (650M),Hybrid - Structure & PLM,0.456,0.0,0.458,0.378,0.488,0.363,0.592,0.395,0.45,0.543,0.478,0.529,0.51,0.32,0.458,0.311,0.266,0.25,0.318,SaProt (650M),"Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv."
+2,TranceptEVE L,Hybrid - Alignment & PLM,0.456,0.008,0.487,0.376,0.457,0.458,0.5,0.451,0.467,0.49,0.473,0.513,0.453,0.461,0.447,0.282,0.363,0.322,0.38,TranceptEVE Large model (Tranception Large & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+3,GEMME,Alignment-based model,0.454,0.011,0.482,0.383,0.438,0.45,0.519,0.455,0.47,0.496,0.469,0.516,0.465,0.472,0.448,0.279,0.335,0.334,0.422,GEMME model,"Laine, É., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619."
+4,TranceptEVE M,Hybrid - Alignment & PLM,0.454,0.009,0.479,0.386,0.452,0.452,0.502,0.44,0.468,0.486,0.474,0.513,0.448,0.446,0.442,0.283,0.316,0.306,0.368,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+5,TranceptEVE S,Hybrid - Alignment & PLM,0.452,0.009,0.475,0.396,0.443,0.447,0.497,0.449,0.46,0.483,0.47,0.504,0.451,0.438,0.437,0.278,0.318,0.306,0.367,TranceptEVE Small model (Tranception Small & retrieved EVE model),"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+6,ProtSSN (ensemble),Hybrid - Structure & PLM,0.449,0.006,0.466,0.366,0.449,0.396,0.568,0.401,0.458,0.521,0.47,0.528,0.491,0.37,0.459,0.293,0.297,0.237,0.293,ProtSSN (ensemble of 9 models),"Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+7,ProtSSN (k=20 h=1280),Hybrid - Structure & PLM,0.442,0.007,0.458,0.366,0.435,0.385,0.566,0.395,0.448,0.519,0.464,0.525,0.482,0.363,0.45,0.283,0.298,0.235,0.294,"ProtSSN (k=20, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+8,ProtSSN (k=20 h=512),Hybrid - Structure & PLM,0.44,0.006,0.457,0.353,0.437,0.39,0.563,0.394,0.453,0.512,0.458,0.524,0.487,0.373,0.45,0.293,0.315,0.254,0.303,"ProtSSN (k=20, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+9,EVE (ensemble),Alignment-based model,0.439,0.01,0.464,0.386,0.408,0.446,0.491,0.425,0.453,0.48,0.454,0.495,0.455,0.434,0.429,0.277,0.326,0.304,0.352,EVE model (ensemble of 5 independently-trained models),"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature."
+10,ProtSSN (k=20 h=768),Hybrid - Structure & PLM,0.439,0.006,0.459,0.347,0.439,0.386,0.563,0.387,0.45,0.516,0.462,0.525,0.483,0.357,0.448,0.296,0.286,0.228,0.294,"ProtSSN (k=20, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+11,ProtSSN (k=30 h=512),Hybrid - Structure & PLM,0.438,0.006,0.455,0.35,0.443,0.384,0.557,0.393,0.447,0.508,0.461,0.519,0.478,0.356,0.445,0.277,0.279,0.233,0.29,"ProtSSN (k=30, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+12,ProtSSN (k=30 h=768),Hybrid - Structure & PLM,0.437,0.006,0.457,0.351,0.435,0.383,0.561,0.386,0.449,0.512,0.458,0.525,0.482,0.356,0.447,0.285,0.282,0.226,0.291,"ProtSSN (k=30, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+13,ProtSSN (k=30 h=1280),Hybrid - Structure & PLM,0.437,0.007,0.454,0.357,0.431,0.383,0.561,0.388,0.445,0.515,0.46,0.522,0.481,0.35,0.444,0.3,0.287,0.221,0.291,"ProtSSN (k=30, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+14,VESPA,Protein language model,0.435,0.007,0.468,0.366,0.404,0.438,0.5,0.427,0.455,0.483,0.44,0.503,0.472,0.443,0.435,0.183,0.359,0.298,0.323,VESPA model,"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647."
+15,Tranception L,Hybrid - Alignment & PLM,0.434,0.008,0.465,0.349,0.45,0.433,0.471,0.432,0.438,0.471,0.455,0.497,0.412,0.438,0.423,0.256,0.354,0.31,0.381,Tranception Large model (700M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+16,ProtSSN (k=10 h=1280),Hybrid - Structure & PLM,0.433,0.007,0.446,0.352,0.436,0.38,0.548,0.391,0.44,0.502,0.455,0.512,0.469,0.358,0.441,0.246,0.258,0.209,0.277,"ProtSSN (k=10, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+17,EVE (single),Alignment-based model,0.432,0.009,0.458,0.372,0.404,0.44,0.487,0.417,0.448,0.476,0.446,0.491,0.452,0.43,0.423,0.275,0.321,0.3,0.355,EVE model (single seed),"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature."
+18,MSA Transformer (ensemble),Hybrid - Alignment & PLM,0.432,0.009,0.473,0.329,0.446,0.418,0.492,0.403,0.45,0.482,0.439,0.516,0.445,0.421,0.428,0.225,0.371,0.366,0.412,MSA Transformer (ensemble of 5 MSA samples),"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML."
+19,ProtSSN (k=10 h=512),Hybrid - Structure & PLM,0.429,0.007,0.451,0.344,0.42,0.376,0.557,0.391,0.436,0.509,0.454,0.521,0.477,0.338,0.437,0.283,0.272,0.214,0.289,"ProtSSN (k=10, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+20,Tranception M,Hybrid - Alignment & PLM,0.427,0.009,0.448,0.361,0.441,0.419,0.465,0.417,0.432,0.454,0.452,0.489,0.39,0.414,0.411,0.248,0.241,0.257,0.339,Tranception Medium model (300M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+21,ProtSSN (k=10 h=768),Hybrid - Structure & PLM,0.422,0.007,0.441,0.337,0.417,0.369,0.547,0.372,0.434,0.496,0.446,0.503,0.466,0.346,0.431,0.271,0.276,0.208,0.272,"ProtSSN (k=10, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+22,ESM-IF1,Inverse folding model,0.422,0.011,0.368,0.389,0.407,0.32,0.624,0.3,0.431,0.544,0.417,0.502,0.492,0.389,0.439,0.346,0.285,0.27,0.353,ESM-IF1 model,"Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. BioRxiv."
+23,DeepSequence (ensemble),Alignment-based model,0.419,0.012,0.455,0.363,0.39,0.411,0.476,0.383,0.428,0.471,0.443,0.486,0.438,0.358,0.405,0.268,0.332,0.316,0.376,DeepSequence model (ensemble of 5 independently-trained models),"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822."
+24,Tranception S,Hybrid - Alignment & PLM,0.418,0.009,0.436,0.372,0.42,0.409,0.452,0.428,0.415,0.443,0.44,0.476,0.385,0.401,0.398,0.242,0.255,0.267,0.342,Tranception Small model (85M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+25,MSA Transformer (single),Hybrid - Alignment & PLM,0.416,0.009,0.455,0.312,0.429,0.411,0.475,0.388,0.436,0.465,0.43,0.501,0.425,0.406,0.412,0.211,0.372,0.358,0.403,MSA Transformer (single MSA sample),"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML."
+26,ESM2 (650M),Protein language model,0.414,0.011,0.425,0.337,0.415,0.368,0.523,0.335,0.406,0.516,0.457,0.486,0.457,0.261,0.422,0.25,0.209,0.16,0.213,ESM2 model (650M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+27,DeepSequence (single),Alignment-based model,0.407,0.012,0.447,0.349,0.371,0.395,0.473,0.382,0.415,0.463,0.437,0.479,0.428,0.338,0.392,0.256,0.295,0.302,0.368,DeepSequence model (single seed),"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822."
+28,ESM-1v (ensemble),Protein language model,0.406,0.012,0.42,0.32,0.429,0.386,0.477,0.326,0.405,0.5,0.458,0.464,0.413,0.294,0.403,0.218,0.178,0.151,0.215,ESM-1v (ensemble of 5 independently-trained models),"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS."
+29,SaProt (35M),Hybrid - Structure & PLM,0.406,0.008,0.372,0.357,0.439,0.285,0.575,0.323,0.392,0.506,0.443,0.492,0.425,0.242,0.409,0.354,0.178,0.172,0.231,SaProt (35M),"Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv."
+30,ESM2 (3B),Protein language model,0.405,0.01,0.417,0.321,0.403,0.378,0.509,0.348,0.415,0.49,0.442,0.477,0.457,0.294,0.414,0.216,0.194,0.165,0.212,ESM2 model (3B params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+31,ESM2 (15B),Protein language model,0.4,0.01,0.405,0.317,0.405,0.387,0.488,0.357,0.414,0.472,0.432,0.467,0.436,0.333,0.405,0.205,0.226,0.169,0.228,ESM2 model (15B params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+32,MIF-ST,Hybrid - Structure & PLM,0.4,0.009,0.39,0.321,0.438,0.364,0.485,0.376,0.403,0.455,0.399,0.413,0.455,0.405,0.431,0.263,0.333,0.282,0.307,MIF-ST model,"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv."
+33,EVmutation,Alignment-based model,0.395,0.008,0.44,0.317,0.378,0.409,0.43,0.403,0.423,0.408,0.41,0.448,0.413,0.393,0.376,0.277,0.344,0.31,0.395,EVmutation model,"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135."
+34,ESM-1b,Protein language model,0.394,0.009,0.428,0.287,0.406,0.349,0.5,0.35,0.398,0.482,0.435,0.491,0.438,0.26,0.384,0.229,0.184,0.148,0.268,ESM-1b (w/ Brandes et al. extensions),"[1] Original model: Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118. [2] Extensions: Brandes, N., Goldman, G., Wang, C.H., Ye, C.J., & Ntranos, V. (2022). Genome-wide prediction of disease variants with a deep protein language model. bioRxiv."
+35,VESPAl,Protein language model,0.393,0.009,0.429,0.347,0.326,0.402,0.461,0.382,0.412,0.447,0.394,0.472,0.433,0.403,0.386,0.143,0.332,0.272,0.31,VESPAl model,"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647."
+36,Progen2 XL,Protein language model,0.391,0.009,0.402,0.302,0.418,0.387,0.445,0.354,0.405,0.445,0.386,0.458,0.418,0.402,0.385,0.187,0.283,0.235,0.289,Progen2 xlarge model (6.4B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+37,ESM2 (150M),Protein language model,0.386,0.012,0.391,0.326,0.402,0.3,0.51,0.306,0.358,0.495,0.45,0.475,0.392,0.157,0.386,0.242,0.122,0.141,0.2,ESM2 model (150M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+38,MIF,Inverse folding model,0.382,0.01,0.327,0.336,0.43,0.297,0.522,0.349,0.375,0.446,0.398,0.387,0.413,0.365,0.411,0.268,0.261,0.229,0.251,MIF model,"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv."
+39,Progen2 L,Protein language model,0.38,0.009,0.406,0.293,0.427,0.379,0.396,0.348,0.381,0.42,0.411,0.432,0.363,0.323,0.372,0.146,0.23,0.205,0.265,Progen2 large model (2.7B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+40,Progen2 M,Protein language model,0.379,0.009,0.393,0.295,0.433,0.381,0.396,0.318,0.382,0.426,0.413,0.417,0.353,0.335,0.373,0.132,0.144,0.133,0.174,Progen2 medium model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+41,Progen2 Base,Protein language model,0.377,0.01,0.396,0.294,0.437,0.378,0.383,0.342,0.368,0.423,0.422,0.422,0.328,0.316,0.37,0.128,0.136,0.144,0.196,Progen2 base model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+42,Tranception L no retrieval,Protein language model,0.374,0.01,0.401,0.288,0.413,0.386,0.381,0.358,0.371,0.416,0.391,0.394,0.353,0.404,0.363,0.174,0.305,0.247,0.325,Tranception Large model (700M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+43,ESM-1v (single),Protein language model,0.374,0.011,0.396,0.268,0.405,0.361,0.437,0.287,0.371,0.476,0.427,0.433,0.385,0.258,0.376,0.192,0.184,0.146,0.207,ESM-1v (single seed),"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS."
+44,Wavenet,Alignment-based model,0.373,0.013,0.379,0.325,0.35,0.363,0.449,0.299,0.389,0.451,0.393,0.425,0.405,0.34,0.358,0.207,0.241,0.209,0.29,Wavenet model,"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12."
+45,RITA XL,Protein language model,0.372,0.011,0.366,0.302,0.414,0.381,0.398,0.315,0.382,0.413,0.396,0.399,0.342,0.394,0.358,0.141,0.144,0.157,0.23,RITA xlarge model (1.2B params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+46,CARP (640M),Protein language model,0.368,0.01,0.395,0.273,0.397,0.365,0.412,0.314,0.374,0.428,0.417,0.395,0.38,0.284,0.391,0.213,0.173,0.151,0.171,CARP model (640M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+47,RITA L,Protein language model,0.365,0.01,0.359,0.29,0.42,0.374,0.383,0.316,0.369,0.403,0.396,0.399,0.309,0.384,0.348,0.136,0.13,0.138,0.204,RITA large model (680M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+48,Site-Independent,Alignment-based model,0.359,0.014,0.369,0.344,0.343,0.379,0.358,0.426,0.373,0.316,0.38,0.389,0.312,0.375,0.337,0.238,0.232,0.261,0.338,Site-Independent model,"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135."
+49,RITA M,Protein language model,0.349,0.01,0.352,0.273,0.405,0.369,0.348,0.304,0.349,0.388,0.379,0.367,0.295,0.375,0.337,0.112,0.124,0.143,0.2,RITA medium model (300M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+50,Tranception M no retrieval,Protein language model,0.348,0.01,0.349,0.284,0.406,0.359,0.342,0.293,0.349,0.376,0.38,0.357,0.295,0.34,0.332,0.14,0.094,0.12,0.182,Tranception Medium model (300M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+51,Unirep evotuned,Hybrid - Alignment & PLM,0.347,0.012,0.355,0.305,0.365,0.344,0.366,0.33,0.344,0.371,0.357,0.378,0.33,0.348,0.319,0.156,0.247,0.228,0.295,Unirep model w/ evotuning,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+52,Progen2 S,Protein language model,0.335,0.011,0.333,0.275,0.384,0.334,0.349,0.283,0.321,0.389,0.385,0.358,0.286,0.271,0.327,0.111,0.106,0.111,0.162,Progen2 small model (150M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+53,CARP (76M),Protein language model,0.327,0.012,0.342,0.282,0.369,0.268,0.376,0.247,0.301,0.406,0.388,0.366,0.296,0.157,0.336,0.204,0.096,0.109,0.117,CARP model (76M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+54,ESM2 (35M),Protein language model,0.32,0.013,0.314,0.291,0.343,0.212,0.439,0.239,0.271,0.448,0.371,0.408,0.308,0.12,0.305,0.24,0.113,0.131,0.198,ESM2 model (35M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+55,RITA S,Protein language model,0.304,0.012,0.294,0.275,0.336,0.325,0.289,0.276,0.297,0.332,0.331,0.299,0.232,0.344,0.287,0.111,0.09,0.112,0.173,RITA small model (85M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+56,Tranception S no retrieval,Protein language model,0.302,0.012,0.288,0.286,0.349,0.317,0.27,0.258,0.295,0.317,0.318,0.283,0.257,0.304,0.279,0.113,0.096,0.117,0.167,Tranception Small model (85M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+57,CARP (38M),Protein language model,0.279,0.014,0.285,0.268,0.312,0.215,0.314,0.196,0.239,0.356,0.322,0.307,0.248,0.127,0.279,0.168,0.083,0.111,0.133,CARP model (38M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+58,ProteinMPNN,Inverse folding model,0.257,0.011,0.197,0.163,0.198,0.161,0.565,0.173,0.279,0.433,0.284,0.394,0.343,0.262,0.292,0.259,0.167,0.17,0.273,ProteinMPNN model,"J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378."
+59,ESM2 (8M),Protein language model,0.225,0.014,0.201,0.26,0.266,0.136,0.262,0.194,0.179,0.261,0.241,0.248,0.194,0.09,0.203,0.135,0.087,0.115,0.176,ESM2 model (8M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+60,Unirep,Protein language model,0.189,0.016,0.182,0.202,0.216,0.135,0.21,0.181,0.161,0.205,0.215,0.233,0.146,0.066,0.175,0.07,0.098,0.123,0.171,Unirep model,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+61,ProtGPT2,Protein language model,0.188,0.012,0.176,0.149,0.193,0.164,0.257,0.177,0.173,0.253,0.244,0.242,0.132,0.138,0.181,0.139,0.041,0.036,0.064,ProtGPT2 model,"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13."
+62,CARP (600K),Protein language model,0.106,0.016,0.112,0.084,0.171,0.056,0.105,0.095,0.087,0.098,0.122,0.086,0.072,0.055,0.108,0.025,0.026,0.068,0.1,CARP model (600K params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
diff --git a/benchmarks/DMS_zero_shot/substitutions/Spearman/Summary_performance_DMS_substitutions_Spearman.html b/benchmarks/DMS_zero_shot/substitutions/Spearman/Summary_performance_DMS_substitutions_Spearman.html
new file mode 100644
index 0000000..0b62c5e
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/Spearman/Summary_performance_DMS_substitutions_Spearman.html
@@ -0,0 +1,1670 @@
+
+
+
+ |
+ Model_name |
+ Model type |
+ Average_Spearman |
+ Bootstrap_standard_error_Spearman |
+ Function_Activity |
+ Function_Binding |
+ Function_Expression |
+ Function_OrganismalFitness |
+ Function_Stability |
+ Low_MSA_depth |
+ Medium_MSA_depth |
+ High_MSA_depth |
+ Taxa_Human |
+ Taxa_Other_Eukaryote |
+ Taxa_Prokaryote |
+ Taxa_Virus |
+ Depth_1 |
+ Depth_2 |
+ Depth_3 |
+ Depth_4 |
+ Depth_5+ |
+ Model details |
+ References |
+
+
+ Model_rank |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+
+
+
+
+ 1 |
+ SaProt (650M) |
+ Hybrid - Structure & PLM |
+ 0.456 |
+ 0.000 |
+ 0.458 |
+ 0.378 |
+ 0.488 |
+ 0.363 |
+ 0.592 |
+ 0.395 |
+ 0.450 |
+ 0.543 |
+ 0.478 |
+ 0.529 |
+ 0.510 |
+ 0.320 |
+ 0.458 |
+ 0.311 |
+ 0.266 |
+ 0.250 |
+ 0.318 |
+ SaProt (650M) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.10.01.560349v5'>Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv.</a> |
+
+
+ 2 |
+ TranceptEVE L |
+ Hybrid - Alignment & PLM |
+ 0.456 |
+ 0.008 |
+ 0.487 |
+ 0.376 |
+ 0.457 |
+ 0.458 |
+ 0.500 |
+ 0.451 |
+ 0.467 |
+ 0.490 |
+ 0.473 |
+ 0.513 |
+ 0.453 |
+ 0.461 |
+ 0.447 |
+ 0.282 |
+ 0.363 |
+ 0.322 |
+ 0.380 |
+ TranceptEVE Large model (Tranception Large & retrieved EVE model) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.12.07.519495v1?rss=1'>Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.</a> |
+
+
+ 3 |
+ GEMME |
+ Alignment-based model |
+ 0.454 |
+ 0.011 |
+ 0.482 |
+ 0.383 |
+ 0.438 |
+ 0.450 |
+ 0.519 |
+ 0.455 |
+ 0.470 |
+ 0.496 |
+ 0.469 |
+ 0.516 |
+ 0.465 |
+ 0.472 |
+ 0.448 |
+ 0.279 |
+ 0.335 |
+ 0.334 |
+ 0.422 |
+ GEMME model |
+ <a href='https://pubmed.ncbi.nlm.nih.gov/31406981/'>Laine, É., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619.</a> |
+
+
+ 4 |
+ TranceptEVE M |
+ Hybrid - Alignment & PLM |
+ 0.454 |
+ 0.009 |
+ 0.479 |
+ 0.386 |
+ 0.452 |
+ 0.452 |
+ 0.502 |
+ 0.440 |
+ 0.468 |
+ 0.486 |
+ 0.474 |
+ 0.513 |
+ 0.448 |
+ 0.446 |
+ 0.442 |
+ 0.283 |
+ 0.316 |
+ 0.306 |
+ 0.368 |
+ TranceptEVE Medium model (Tranception Medium & retrieved EVE model) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.12.07.519495v1?rss=1'>Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.</a> |
+
+
+ 5 |
+ TranceptEVE S |
+ Hybrid - Alignment & PLM |
+ 0.452 |
+ 0.009 |
+ 0.475 |
+ 0.396 |
+ 0.443 |
+ 0.447 |
+ 0.497 |
+ 0.449 |
+ 0.460 |
+ 0.483 |
+ 0.470 |
+ 0.504 |
+ 0.451 |
+ 0.438 |
+ 0.437 |
+ 0.278 |
+ 0.318 |
+ 0.306 |
+ 0.367 |
+ TranceptEVE Small model (Tranception Small & retrieved EVE model) |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 6 |
+ ProtSSN (ensemble) |
+ Hybrid - Structure & PLM |
+ 0.449 |
+ 0.006 |
+ 0.466 |
+ 0.366 |
+ 0.449 |
+ 0.396 |
+ 0.568 |
+ 0.401 |
+ 0.458 |
+ 0.521 |
+ 0.470 |
+ 0.528 |
+ 0.491 |
+ 0.370 |
+ 0.459 |
+ 0.293 |
+ 0.297 |
+ 0.237 |
+ 0.293 |
+ ProtSSN (ensemble of 9 models) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 7 |
+ ProtSSN (k=20 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.442 |
+ 0.007 |
+ 0.458 |
+ 0.366 |
+ 0.435 |
+ 0.385 |
+ 0.566 |
+ 0.395 |
+ 0.448 |
+ 0.519 |
+ 0.464 |
+ 0.525 |
+ 0.482 |
+ 0.363 |
+ 0.450 |
+ 0.283 |
+ 0.298 |
+ 0.235 |
+ 0.294 |
+ ProtSSN (k=20, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 8 |
+ ProtSSN (k=20 h=512) |
+ Hybrid - Structure & PLM |
+ 0.440 |
+ 0.006 |
+ 0.457 |
+ 0.353 |
+ 0.437 |
+ 0.390 |
+ 0.563 |
+ 0.394 |
+ 0.453 |
+ 0.512 |
+ 0.458 |
+ 0.524 |
+ 0.487 |
+ 0.373 |
+ 0.450 |
+ 0.293 |
+ 0.315 |
+ 0.254 |
+ 0.303 |
+ ProtSSN (k=20, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 9 |
+ EVE (ensemble) |
+ Alignment-based model |
+ 0.439 |
+ 0.010 |
+ 0.464 |
+ 0.386 |
+ 0.408 |
+ 0.446 |
+ 0.491 |
+ 0.425 |
+ 0.453 |
+ 0.480 |
+ 0.454 |
+ 0.495 |
+ 0.455 |
+ 0.434 |
+ 0.429 |
+ 0.277 |
+ 0.326 |
+ 0.304 |
+ 0.352 |
+ EVE model (ensemble of 5 independently-trained models) |
+ <a href='https://www.nature.com/articles/s41586-021-04043-8'>Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.</a> |
+
+
+ 10 |
+ ProtSSN (k=20 h=768) |
+ Hybrid - Structure & PLM |
+ 0.439 |
+ 0.006 |
+ 0.459 |
+ 0.347 |
+ 0.439 |
+ 0.386 |
+ 0.563 |
+ 0.387 |
+ 0.450 |
+ 0.516 |
+ 0.462 |
+ 0.525 |
+ 0.483 |
+ 0.357 |
+ 0.448 |
+ 0.296 |
+ 0.286 |
+ 0.228 |
+ 0.294 |
+ ProtSSN (k=20, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 11 |
+ ProtSSN (k=30 h=512) |
+ Hybrid - Structure & PLM |
+ 0.438 |
+ 0.006 |
+ 0.455 |
+ 0.350 |
+ 0.443 |
+ 0.384 |
+ 0.557 |
+ 0.393 |
+ 0.447 |
+ 0.508 |
+ 0.461 |
+ 0.519 |
+ 0.478 |
+ 0.356 |
+ 0.445 |
+ 0.277 |
+ 0.279 |
+ 0.233 |
+ 0.290 |
+ ProtSSN (k=30, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 12 |
+ ProtSSN (k=30 h=768) |
+ Hybrid - Structure & PLM |
+ 0.437 |
+ 0.006 |
+ 0.457 |
+ 0.351 |
+ 0.435 |
+ 0.383 |
+ 0.561 |
+ 0.386 |
+ 0.449 |
+ 0.512 |
+ 0.458 |
+ 0.525 |
+ 0.482 |
+ 0.356 |
+ 0.447 |
+ 0.285 |
+ 0.282 |
+ 0.226 |
+ 0.291 |
+ ProtSSN (k=30, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 13 |
+ ProtSSN (k=30 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.437 |
+ 0.007 |
+ 0.454 |
+ 0.357 |
+ 0.431 |
+ 0.383 |
+ 0.561 |
+ 0.388 |
+ 0.445 |
+ 0.515 |
+ 0.460 |
+ 0.522 |
+ 0.481 |
+ 0.350 |
+ 0.444 |
+ 0.300 |
+ 0.287 |
+ 0.221 |
+ 0.291 |
+ ProtSSN (k=30, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 14 |
+ VESPA |
+ Protein language model |
+ 0.435 |
+ 0.007 |
+ 0.468 |
+ 0.366 |
+ 0.404 |
+ 0.438 |
+ 0.500 |
+ 0.427 |
+ 0.455 |
+ 0.483 |
+ 0.440 |
+ 0.503 |
+ 0.472 |
+ 0.443 |
+ 0.435 |
+ 0.183 |
+ 0.359 |
+ 0.298 |
+ 0.323 |
+ VESPA model |
+ <a href='https://link.springer.com/article/10.1007/s00439-021-02411-y'>Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.</a> |
+
+
+ 15 |
+ Tranception L |
+ Hybrid - Alignment & PLM |
+ 0.434 |
+ 0.008 |
+ 0.465 |
+ 0.349 |
+ 0.450 |
+ 0.433 |
+ 0.471 |
+ 0.432 |
+ 0.438 |
+ 0.471 |
+ 0.455 |
+ 0.497 |
+ 0.412 |
+ 0.438 |
+ 0.423 |
+ 0.256 |
+ 0.354 |
+ 0.310 |
+ 0.381 |
+ Tranception Large model (700M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 16 |
+ ProtSSN (k=10 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.433 |
+ 0.007 |
+ 0.446 |
+ 0.352 |
+ 0.436 |
+ 0.380 |
+ 0.548 |
+ 0.391 |
+ 0.440 |
+ 0.502 |
+ 0.455 |
+ 0.512 |
+ 0.469 |
+ 0.358 |
+ 0.441 |
+ 0.246 |
+ 0.258 |
+ 0.209 |
+ 0.277 |
+ ProtSSN (k=10, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 17 |
+ EVE (single) |
+ Alignment-based model |
+ 0.432 |
+ 0.009 |
+ 0.458 |
+ 0.372 |
+ 0.404 |
+ 0.440 |
+ 0.487 |
+ 0.417 |
+ 0.448 |
+ 0.476 |
+ 0.446 |
+ 0.491 |
+ 0.452 |
+ 0.430 |
+ 0.423 |
+ 0.275 |
+ 0.321 |
+ 0.300 |
+ 0.355 |
+ EVE model (single seed) |
+ <a href='https://www.nature.com/articles/s41586-021-04043-8'>Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.</a> |
+
+
+ 18 |
+ MSA Transformer (ensemble) |
+ Hybrid - Alignment & PLM |
+ 0.432 |
+ 0.009 |
+ 0.473 |
+ 0.329 |
+ 0.446 |
+ 0.418 |
+ 0.492 |
+ 0.403 |
+ 0.450 |
+ 0.482 |
+ 0.439 |
+ 0.516 |
+ 0.445 |
+ 0.421 |
+ 0.428 |
+ 0.225 |
+ 0.371 |
+ 0.366 |
+ 0.412 |
+ MSA Transformer (ensemble of 5 MSA samples) |
+ <a href='http://proceedings.mlr.press/v139/rao21a.html'>Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.</a> |
+
+
+ 19 |
+ ProtSSN (k=10 h=512) |
+ Hybrid - Structure & PLM |
+ 0.429 |
+ 0.007 |
+ 0.451 |
+ 0.344 |
+ 0.420 |
+ 0.376 |
+ 0.557 |
+ 0.391 |
+ 0.436 |
+ 0.509 |
+ 0.454 |
+ 0.521 |
+ 0.477 |
+ 0.338 |
+ 0.437 |
+ 0.283 |
+ 0.272 |
+ 0.214 |
+ 0.289 |
+ ProtSSN (k=10, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 20 |
+ Tranception M |
+ Hybrid - Alignment & PLM |
+ 0.427 |
+ 0.009 |
+ 0.448 |
+ 0.361 |
+ 0.441 |
+ 0.419 |
+ 0.465 |
+ 0.417 |
+ 0.432 |
+ 0.454 |
+ 0.452 |
+ 0.489 |
+ 0.390 |
+ 0.414 |
+ 0.411 |
+ 0.248 |
+ 0.241 |
+ 0.257 |
+ 0.339 |
+ Tranception Medium model (300M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 21 |
+ ProtSSN (k=10 h=768) |
+ Hybrid - Structure & PLM |
+ 0.422 |
+ 0.007 |
+ 0.441 |
+ 0.337 |
+ 0.417 |
+ 0.369 |
+ 0.547 |
+ 0.372 |
+ 0.434 |
+ 0.496 |
+ 0.446 |
+ 0.503 |
+ 0.466 |
+ 0.346 |
+ 0.431 |
+ 0.271 |
+ 0.276 |
+ 0.208 |
+ 0.272 |
+ ProtSSN (k=10, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 22 |
+ ESM-IF1 |
+ Inverse folding model |
+ 0.422 |
+ 0.011 |
+ 0.368 |
+ 0.389 |
+ 0.407 |
+ 0.320 |
+ 0.624 |
+ 0.300 |
+ 0.431 |
+ 0.544 |
+ 0.417 |
+ 0.502 |
+ 0.492 |
+ 0.389 |
+ 0.439 |
+ 0.346 |
+ 0.285 |
+ 0.270 |
+ 0.353 |
+ ESM-IF1 model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.04.10.487779v2.full.pdf+html'>Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. BioRxiv.</a> |
+
+
+ 23 |
+ DeepSequence (ensemble) |
+ Alignment-based model |
+ 0.419 |
+ 0.012 |
+ 0.455 |
+ 0.363 |
+ 0.390 |
+ 0.411 |
+ 0.476 |
+ 0.383 |
+ 0.428 |
+ 0.471 |
+ 0.443 |
+ 0.486 |
+ 0.438 |
+ 0.358 |
+ 0.405 |
+ 0.268 |
+ 0.332 |
+ 0.316 |
+ 0.376 |
+ DeepSequence model (ensemble of 5 independently-trained models) |
+ <a href='https://www.nature.com/articles/s41592-018-0138-4'>Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.</a> |
+
+
+ 24 |
+ Tranception S |
+ Hybrid - Alignment & PLM |
+ 0.418 |
+ 0.009 |
+ 0.436 |
+ 0.372 |
+ 0.420 |
+ 0.409 |
+ 0.452 |
+ 0.428 |
+ 0.415 |
+ 0.443 |
+ 0.440 |
+ 0.476 |
+ 0.385 |
+ 0.401 |
+ 0.398 |
+ 0.242 |
+ 0.255 |
+ 0.267 |
+ 0.342 |
+ Tranception Small model (85M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 25 |
+ MSA Transformer (single) |
+ Hybrid - Alignment & PLM |
+ 0.416 |
+ 0.009 |
+ 0.455 |
+ 0.312 |
+ 0.429 |
+ 0.411 |
+ 0.475 |
+ 0.388 |
+ 0.436 |
+ 0.465 |
+ 0.430 |
+ 0.501 |
+ 0.425 |
+ 0.406 |
+ 0.412 |
+ 0.211 |
+ 0.372 |
+ 0.358 |
+ 0.403 |
+ MSA Transformer (single MSA sample) |
+ <a href='http://proceedings.mlr.press/v139/rao21a.html'>Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.</a> |
+
+
+ 26 |
+ ESM2 (650M) |
+ Protein language model |
+ 0.414 |
+ 0.011 |
+ 0.425 |
+ 0.337 |
+ 0.415 |
+ 0.368 |
+ 0.523 |
+ 0.335 |
+ 0.406 |
+ 0.516 |
+ 0.457 |
+ 0.486 |
+ 0.457 |
+ 0.261 |
+ 0.422 |
+ 0.250 |
+ 0.209 |
+ 0.160 |
+ 0.213 |
+ ESM2 model (650M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 27 |
+ DeepSequence (single) |
+ Alignment-based model |
+ 0.407 |
+ 0.012 |
+ 0.447 |
+ 0.349 |
+ 0.371 |
+ 0.395 |
+ 0.473 |
+ 0.382 |
+ 0.415 |
+ 0.463 |
+ 0.437 |
+ 0.479 |
+ 0.428 |
+ 0.338 |
+ 0.392 |
+ 0.256 |
+ 0.295 |
+ 0.302 |
+ 0.368 |
+ DeepSequence model (single seed) |
+ <a href='https://www.nature.com/articles/s41592-018-0138-4'>Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.</a> |
+
+
+ 28 |
+ ESM-1v (ensemble) |
+ Protein language model |
+ 0.406 |
+ 0.012 |
+ 0.420 |
+ 0.320 |
+ 0.429 |
+ 0.386 |
+ 0.477 |
+ 0.326 |
+ 0.405 |
+ 0.500 |
+ 0.458 |
+ 0.464 |
+ 0.413 |
+ 0.294 |
+ 0.403 |
+ 0.218 |
+ 0.178 |
+ 0.151 |
+ 0.215 |
+ ESM-1v (ensemble of 5 independently-trained models) |
+ <a href='https://proceedings.neurips.cc/paper/2021/hash/f51338d736f95dd42427296047067694-Abstract.html'>Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.</a> |
+
+
+ 29 |
+ SaProt (35M) |
+ Hybrid - Structure & PLM |
+ 0.406 |
+ 0.008 |
+ 0.372 |
+ 0.357 |
+ 0.439 |
+ 0.285 |
+ 0.575 |
+ 0.323 |
+ 0.392 |
+ 0.506 |
+ 0.443 |
+ 0.492 |
+ 0.425 |
+ 0.242 |
+ 0.409 |
+ 0.354 |
+ 0.178 |
+ 0.172 |
+ 0.231 |
+ SaProt (35M) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.10.01.560349v5'>Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv.</a> |
+
+
+ 30 |
+ ESM2 (3B) |
+ Protein language model |
+ 0.405 |
+ 0.010 |
+ 0.417 |
+ 0.321 |
+ 0.403 |
+ 0.378 |
+ 0.509 |
+ 0.348 |
+ 0.415 |
+ 0.490 |
+ 0.442 |
+ 0.477 |
+ 0.457 |
+ 0.294 |
+ 0.414 |
+ 0.216 |
+ 0.194 |
+ 0.165 |
+ 0.212 |
+ ESM2 model (3B params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 31 |
+ ESM2 (15B) |
+ Protein language model |
+ 0.400 |
+ 0.010 |
+ 0.405 |
+ 0.317 |
+ 0.405 |
+ 0.387 |
+ 0.488 |
+ 0.357 |
+ 0.414 |
+ 0.472 |
+ 0.432 |
+ 0.467 |
+ 0.436 |
+ 0.333 |
+ 0.405 |
+ 0.205 |
+ 0.226 |
+ 0.169 |
+ 0.228 |
+ ESM2 model (15B params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 32 |
+ MIF-ST |
+ Hybrid - Structure & PLM |
+ 0.400 |
+ 0.009 |
+ 0.390 |
+ 0.321 |
+ 0.438 |
+ 0.364 |
+ 0.485 |
+ 0.376 |
+ 0.403 |
+ 0.455 |
+ 0.399 |
+ 0.413 |
+ 0.455 |
+ 0.405 |
+ 0.431 |
+ 0.263 |
+ 0.333 |
+ 0.282 |
+ 0.307 |
+ MIF-ST model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.25.493516v3'>Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.</a> |
+
+
+ 33 |
+ EVmutation |
+ Alignment-based model |
+ 0.395 |
+ 0.008 |
+ 0.440 |
+ 0.317 |
+ 0.378 |
+ 0.409 |
+ 0.430 |
+ 0.403 |
+ 0.423 |
+ 0.408 |
+ 0.410 |
+ 0.448 |
+ 0.413 |
+ 0.393 |
+ 0.376 |
+ 0.277 |
+ 0.344 |
+ 0.310 |
+ 0.395 |
+ EVmutation model |
+ <a href='https://www.nature.com/articles/nbt.3769'>Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.</a> |
+
+
+ 34 |
+ ESM-1b |
+ Protein language model |
+ 0.394 |
+ 0.009 |
+ 0.428 |
+ 0.287 |
+ 0.406 |
+ 0.349 |
+ 0.500 |
+ 0.350 |
+ 0.398 |
+ 0.482 |
+ 0.435 |
+ 0.491 |
+ 0.438 |
+ 0.260 |
+ 0.384 |
+ 0.229 |
+ 0.184 |
+ 0.148 |
+ 0.268 |
+ ESM-1b (w/ Brandes et al. extensions) |
+ [1] Original model: <a href='https://www.biorxiv.org/content/10.1101/622803v4'>Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118.</a> [2] Extensions: <a href='https://www.biorxiv.org/content/10.1101/2022.08.25.505311v1'>Brandes, N., Goldman, G., Wang, C.H., Ye, C.J., & Ntranos, V. (2022). Genome-wide prediction of disease variants with a deep protein language model. bioRxiv.</a> |
+
+
+ 35 |
+ VESPAl |
+ Protein language model |
+ 0.393 |
+ 0.009 |
+ 0.429 |
+ 0.347 |
+ 0.326 |
+ 0.402 |
+ 0.461 |
+ 0.382 |
+ 0.412 |
+ 0.447 |
+ 0.394 |
+ 0.472 |
+ 0.433 |
+ 0.403 |
+ 0.386 |
+ 0.143 |
+ 0.332 |
+ 0.272 |
+ 0.310 |
+ VESPAl model |
+ <a href='https://link.springer.com/article/10.1007/s00439-021-02411-y'>Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.</a> |
+
+
+ 36 |
+ Progen2 XL |
+ Protein language model |
+ 0.391 |
+ 0.009 |
+ 0.402 |
+ 0.302 |
+ 0.418 |
+ 0.387 |
+ 0.445 |
+ 0.354 |
+ 0.405 |
+ 0.445 |
+ 0.386 |
+ 0.458 |
+ 0.418 |
+ 0.402 |
+ 0.385 |
+ 0.187 |
+ 0.283 |
+ 0.235 |
+ 0.289 |
+ Progen2 xlarge model (6.4B params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 37 |
+ ESM2 (150M) |
+ Protein language model |
+ 0.386 |
+ 0.012 |
+ 0.391 |
+ 0.326 |
+ 0.402 |
+ 0.300 |
+ 0.510 |
+ 0.306 |
+ 0.358 |
+ 0.495 |
+ 0.450 |
+ 0.475 |
+ 0.392 |
+ 0.157 |
+ 0.386 |
+ 0.242 |
+ 0.122 |
+ 0.141 |
+ 0.200 |
+ ESM2 model (150M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 38 |
+ MIF |
+ Inverse folding model |
+ 0.382 |
+ 0.010 |
+ 0.327 |
+ 0.336 |
+ 0.430 |
+ 0.297 |
+ 0.522 |
+ 0.349 |
+ 0.375 |
+ 0.446 |
+ 0.398 |
+ 0.387 |
+ 0.413 |
+ 0.365 |
+ 0.411 |
+ 0.268 |
+ 0.261 |
+ 0.229 |
+ 0.251 |
+ MIF model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.25.493516v3'>Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.</a> |
+
+
+ 39 |
+ Progen2 L |
+ Protein language model |
+ 0.380 |
+ 0.009 |
+ 0.406 |
+ 0.293 |
+ 0.427 |
+ 0.379 |
+ 0.396 |
+ 0.348 |
+ 0.381 |
+ 0.420 |
+ 0.411 |
+ 0.432 |
+ 0.363 |
+ 0.323 |
+ 0.372 |
+ 0.146 |
+ 0.230 |
+ 0.205 |
+ 0.265 |
+ Progen2 large model (2.7B params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 40 |
+ Progen2 M |
+ Protein language model |
+ 0.379 |
+ 0.009 |
+ 0.393 |
+ 0.295 |
+ 0.433 |
+ 0.381 |
+ 0.396 |
+ 0.318 |
+ 0.382 |
+ 0.426 |
+ 0.413 |
+ 0.417 |
+ 0.353 |
+ 0.335 |
+ 0.373 |
+ 0.132 |
+ 0.144 |
+ 0.133 |
+ 0.174 |
+ Progen2 medium model (760M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 41 |
+ Progen2 Base |
+ Protein language model |
+ 0.377 |
+ 0.010 |
+ 0.396 |
+ 0.294 |
+ 0.437 |
+ 0.378 |
+ 0.383 |
+ 0.342 |
+ 0.368 |
+ 0.423 |
+ 0.422 |
+ 0.422 |
+ 0.328 |
+ 0.316 |
+ 0.370 |
+ 0.128 |
+ 0.136 |
+ 0.144 |
+ 0.196 |
+ Progen2 base model (760M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 42 |
+ Tranception L no retrieval |
+ Protein language model |
+ 0.374 |
+ 0.010 |
+ 0.401 |
+ 0.288 |
+ 0.413 |
+ 0.386 |
+ 0.381 |
+ 0.358 |
+ 0.371 |
+ 0.416 |
+ 0.391 |
+ 0.394 |
+ 0.353 |
+ 0.404 |
+ 0.363 |
+ 0.174 |
+ 0.305 |
+ 0.247 |
+ 0.325 |
+ Tranception Large model (700M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 43 |
+ ESM-1v (single) |
+ Protein language model |
+ 0.374 |
+ 0.011 |
+ 0.396 |
+ 0.268 |
+ 0.405 |
+ 0.361 |
+ 0.437 |
+ 0.287 |
+ 0.371 |
+ 0.476 |
+ 0.427 |
+ 0.433 |
+ 0.385 |
+ 0.258 |
+ 0.376 |
+ 0.192 |
+ 0.184 |
+ 0.146 |
+ 0.207 |
+ ESM-1v (single seed) |
+ <a href='https://proceedings.neurips.cc/paper/2021/hash/f51338d736f95dd42427296047067694-Abstract.html'>Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.</a> |
+
+
+ 44 |
+ Wavenet |
+ Alignment-based model |
+ 0.373 |
+ 0.013 |
+ 0.379 |
+ 0.325 |
+ 0.350 |
+ 0.363 |
+ 0.449 |
+ 0.299 |
+ 0.389 |
+ 0.451 |
+ 0.393 |
+ 0.425 |
+ 0.405 |
+ 0.340 |
+ 0.358 |
+ 0.207 |
+ 0.241 |
+ 0.209 |
+ 0.290 |
+ Wavenet model |
+ <a href='https://www.nature.com/articles/s41467-021-22732-w'>Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12.</a> |
+
+
+ 45 |
+ RITA XL |
+ Protein language model |
+ 0.372 |
+ 0.011 |
+ 0.366 |
+ 0.302 |
+ 0.414 |
+ 0.381 |
+ 0.398 |
+ 0.315 |
+ 0.382 |
+ 0.413 |
+ 0.396 |
+ 0.399 |
+ 0.342 |
+ 0.394 |
+ 0.358 |
+ 0.141 |
+ 0.144 |
+ 0.157 |
+ 0.230 |
+ RITA xlarge model (1.2B params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 46 |
+ CARP (640M) |
+ Protein language model |
+ 0.368 |
+ 0.010 |
+ 0.395 |
+ 0.273 |
+ 0.397 |
+ 0.365 |
+ 0.412 |
+ 0.314 |
+ 0.374 |
+ 0.428 |
+ 0.417 |
+ 0.395 |
+ 0.380 |
+ 0.284 |
+ 0.391 |
+ 0.213 |
+ 0.173 |
+ 0.151 |
+ 0.171 |
+ CARP model (640M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 47 |
+ RITA L |
+ Protein language model |
+ 0.365 |
+ 0.010 |
+ 0.359 |
+ 0.290 |
+ 0.420 |
+ 0.374 |
+ 0.383 |
+ 0.316 |
+ 0.369 |
+ 0.403 |
+ 0.396 |
+ 0.399 |
+ 0.309 |
+ 0.384 |
+ 0.348 |
+ 0.136 |
+ 0.130 |
+ 0.138 |
+ 0.204 |
+ RITA large model (680M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 48 |
+ Site-Independent |
+ Alignment-based model |
+ 0.359 |
+ 0.014 |
+ 0.369 |
+ 0.344 |
+ 0.343 |
+ 0.379 |
+ 0.358 |
+ 0.426 |
+ 0.373 |
+ 0.316 |
+ 0.380 |
+ 0.389 |
+ 0.312 |
+ 0.375 |
+ 0.337 |
+ 0.238 |
+ 0.232 |
+ 0.261 |
+ 0.338 |
+ Site-Independent model |
+ <a href='https://www.nature.com/articles/nbt.3769'>Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.</a> |
+
+
+ 49 |
+ RITA M |
+ Protein language model |
+ 0.349 |
+ 0.010 |
+ 0.352 |
+ 0.273 |
+ 0.405 |
+ 0.369 |
+ 0.348 |
+ 0.304 |
+ 0.349 |
+ 0.388 |
+ 0.379 |
+ 0.367 |
+ 0.295 |
+ 0.375 |
+ 0.337 |
+ 0.112 |
+ 0.124 |
+ 0.143 |
+ 0.200 |
+ RITA medium model (300M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 50 |
+ Tranception M no retrieval |
+ Protein language model |
+ 0.348 |
+ 0.010 |
+ 0.349 |
+ 0.284 |
+ 0.406 |
+ 0.359 |
+ 0.342 |
+ 0.293 |
+ 0.349 |
+ 0.376 |
+ 0.380 |
+ 0.357 |
+ 0.295 |
+ 0.340 |
+ 0.332 |
+ 0.140 |
+ 0.094 |
+ 0.120 |
+ 0.182 |
+ Tranception Medium model (300M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 51 |
+ Unirep evotuned |
+ Hybrid - Alignment & PLM |
+ 0.347 |
+ 0.012 |
+ 0.355 |
+ 0.305 |
+ 0.365 |
+ 0.344 |
+ 0.366 |
+ 0.330 |
+ 0.344 |
+ 0.371 |
+ 0.357 |
+ 0.378 |
+ 0.330 |
+ 0.348 |
+ 0.319 |
+ 0.156 |
+ 0.247 |
+ 0.228 |
+ 0.295 |
+ Unirep model w/ evotuning |
+ <a href='https://www.nature.com/articles/s41592-019-0598-1'>Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.</a> |
+
+
+ 52 |
+ Progen2 S |
+ Protein language model |
+ 0.335 |
+ 0.011 |
+ 0.333 |
+ 0.275 |
+ 0.384 |
+ 0.334 |
+ 0.349 |
+ 0.283 |
+ 0.321 |
+ 0.389 |
+ 0.385 |
+ 0.358 |
+ 0.286 |
+ 0.271 |
+ 0.327 |
+ 0.111 |
+ 0.106 |
+ 0.111 |
+ 0.162 |
+ Progen2 small model (150M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 53 |
+ CARP (76M) |
+ Protein language model |
+ 0.327 |
+ 0.012 |
+ 0.342 |
+ 0.282 |
+ 0.369 |
+ 0.268 |
+ 0.376 |
+ 0.247 |
+ 0.301 |
+ 0.406 |
+ 0.388 |
+ 0.366 |
+ 0.296 |
+ 0.157 |
+ 0.336 |
+ 0.204 |
+ 0.096 |
+ 0.109 |
+ 0.117 |
+ CARP model (76M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 54 |
+ ESM2 (35M) |
+ Protein language model |
+ 0.320 |
+ 0.013 |
+ 0.314 |
+ 0.291 |
+ 0.343 |
+ 0.212 |
+ 0.439 |
+ 0.239 |
+ 0.271 |
+ 0.448 |
+ 0.371 |
+ 0.408 |
+ 0.308 |
+ 0.120 |
+ 0.305 |
+ 0.240 |
+ 0.113 |
+ 0.131 |
+ 0.198 |
+ ESM2 model (35M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 55 |
+ RITA S |
+ Protein language model |
+ 0.304 |
+ 0.012 |
+ 0.294 |
+ 0.275 |
+ 0.336 |
+ 0.325 |
+ 0.289 |
+ 0.276 |
+ 0.297 |
+ 0.332 |
+ 0.331 |
+ 0.299 |
+ 0.232 |
+ 0.344 |
+ 0.287 |
+ 0.111 |
+ 0.090 |
+ 0.112 |
+ 0.173 |
+ RITA small model (85M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 56 |
+ Tranception S no retrieval |
+ Protein language model |
+ 0.302 |
+ 0.012 |
+ 0.288 |
+ 0.286 |
+ 0.349 |
+ 0.317 |
+ 0.270 |
+ 0.258 |
+ 0.295 |
+ 0.317 |
+ 0.318 |
+ 0.283 |
+ 0.257 |
+ 0.304 |
+ 0.279 |
+ 0.113 |
+ 0.096 |
+ 0.117 |
+ 0.167 |
+ Tranception Small model (85M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 57 |
+ CARP (38M) |
+ Protein language model |
+ 0.279 |
+ 0.014 |
+ 0.285 |
+ 0.268 |
+ 0.312 |
+ 0.215 |
+ 0.314 |
+ 0.196 |
+ 0.239 |
+ 0.356 |
+ 0.322 |
+ 0.307 |
+ 0.248 |
+ 0.127 |
+ 0.279 |
+ 0.168 |
+ 0.083 |
+ 0.111 |
+ 0.133 |
+ CARP model (38M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 58 |
+ ProteinMPNN |
+ Inverse folding model |
+ 0.257 |
+ 0.011 |
+ 0.197 |
+ 0.163 |
+ 0.198 |
+ 0.161 |
+ 0.565 |
+ 0.173 |
+ 0.279 |
+ 0.433 |
+ 0.284 |
+ 0.394 |
+ 0.343 |
+ 0.262 |
+ 0.292 |
+ 0.259 |
+ 0.167 |
+ 0.170 |
+ 0.273 |
+ ProteinMPNN model |
+ <a href='https://www.science.org/doi/10.1126/science.add2187'>J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378.</a> |
+
+
+ 59 |
+ ESM2 (8M) |
+ Protein language model |
+ 0.225 |
+ 0.014 |
+ 0.201 |
+ 0.260 |
+ 0.266 |
+ 0.136 |
+ 0.262 |
+ 0.194 |
+ 0.179 |
+ 0.261 |
+ 0.241 |
+ 0.248 |
+ 0.194 |
+ 0.090 |
+ 0.203 |
+ 0.135 |
+ 0.087 |
+ 0.115 |
+ 0.176 |
+ ESM2 model (8M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 60 |
+ Unirep |
+ Protein language model |
+ 0.189 |
+ 0.016 |
+ 0.182 |
+ 0.202 |
+ 0.216 |
+ 0.135 |
+ 0.210 |
+ 0.181 |
+ 0.161 |
+ 0.205 |
+ 0.215 |
+ 0.233 |
+ 0.146 |
+ 0.066 |
+ 0.175 |
+ 0.070 |
+ 0.098 |
+ 0.123 |
+ 0.171 |
+ Unirep model |
+ <a href='https://www.nature.com/articles/s41592-019-0598-1'>Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.</a> |
+
+
+ 61 |
+ ProtGPT2 |
+ Protein language model |
+ 0.188 |
+ 0.012 |
+ 0.176 |
+ 0.149 |
+ 0.193 |
+ 0.164 |
+ 0.257 |
+ 0.177 |
+ 0.173 |
+ 0.253 |
+ 0.244 |
+ 0.242 |
+ 0.132 |
+ 0.138 |
+ 0.181 |
+ 0.139 |
+ 0.041 |
+ 0.036 |
+ 0.064 |
+ ProtGPT2 model |
+ <a href='https://www.nature.com/articles/s41467-022-32007-7'>Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13.</a> |
+
+
+ 62 |
+ CARP (600K) |
+ Protein language model |
+ 0.106 |
+ 0.016 |
+ 0.112 |
+ 0.084 |
+ 0.171 |
+ 0.056 |
+ 0.105 |
+ 0.095 |
+ 0.087 |
+ 0.098 |
+ 0.122 |
+ 0.086 |
+ 0.072 |
+ 0.055 |
+ 0.108 |
+ 0.025 |
+ 0.026 |
+ 0.068 |
+ 0.100 |
+ CARP model (600K params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/substitutions/Top_recall/DMS_substitutions_Top_recall_DMS_level.csv b/benchmarks/DMS_zero_shot/substitutions/Top_recall/DMS_substitutions_Top_recall_DMS_level.csv
new file mode 100644
index 0000000..7b3dcde
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/Top_recall/DMS_substitutions_Top_recall_DMS_level.csv
@@ -0,0 +1,217 @@
+DMS ID,Site-Independent,EVmutation,DeepSequence (single),DeepSequence (ensemble),EVE (single),EVE (ensemble),Unirep,Unirep evotuned,MSA Transformer (single),MSA Transformer (ensemble),ESM-1b,ESM-1v (single),ESM-1v (ensemble),ESM2 (8M),ESM2 (35M),ESM2 (150M),ESM2 (650M),ESM2 (3B),ESM2 (15B),Wavenet,RITA S,RITA M,RITA L,RITA XL,Progen2 S,Progen2 M,Progen2 Base,Progen2 L,Progen2 XL,GEMME,VESPA,VESPAl,ProtGPT2,Tranception S no retrieval,Tranception M no retrieval,Tranception L no retrieval,Tranception S,Tranception M,Tranception L,TranceptEVE S,TranceptEVE M,TranceptEVE L,CARP (38M),CARP (600K),CARP (640M),CARP (76M),MIF,MIF-ST,ESM-IF1,ProteinMPNN,ProtSSN (k=10 h=512),ProtSSN (k=10 h=768),ProtSSN (k=10 h=1280),ProtSSN (k=20 h=512),ProtSSN (k=20 h=768),ProtSSN (k=20 h=1280),ProtSSN (k=30 h=512),ProtSSN (k=30 h=768),ProtSSN (k=30 h=1280),ProtSSN (ensemble),SaProt (650M),SaProt (35M),Number of Mutants,Selection Type,UniProt ID,MSA_Neff_L_category,Taxon
+A0A140D2T1_ZIKV_Sourisseau_2019,0.278,0.279,0.204,0.205,0.273,0.28,0.093,0.086,0.315,0.318,0.117,0.1,0.124,0.081,0.096,0.086,0.127,0.237,0.262,0.178,0.268,0.28,0.285,0.278,0.251,0.267,0.286,0.254,0.277,0.291,0.257,0.219,0.099,0.223,0.232,0.241,0.265,0.267,0.286,0.268,0.269,0.285,0.104,0.103,0.148,0.101,0.169,0.201,0.151,0.153,0.18,0.168,0.183,0.183,0.181,0.166,0.148,0.172,0.159,0.171,0.16,0.135,9576,OrganismalFitness,A0A140D2T1_ZIKV,Medium,Virus
+A0A192B1T2_9HIV1_Haddox_2018,0.276,0.27,0.269,0.273,0.277,0.28,0.095,0.269,0.293,0.293,0.234,0.259,0.266,0.103,0.094,0.095,0.103,0.11,0.117,0.281,0.279,0.28,0.286,0.291,0.273,0.289,0.279,0.282,0.293,0.293,0.263,0.248,0.221,0.274,0.284,0.289,0.281,0.284,0.286,0.286,0.282,0.285,0.192,0.072,0.267,0.261,0.138,0.268,0.121,0.145,0.129,0.148,0.154,0.138,0.143,0.141,0.144,0.126,0.121,0.143,0.137,0.115,12577,OrganismalFitness,A0A192B1T2_9HIV1,Medium,Virus
+A0A1I9GEU1_NEIME_Kennouche_2019,0.086,0.097,0.108,0.108,0.086,0.097,0.075,0.129,0.108,0.075,0.118,0.129,0.108,0.086,0.097,0.118,0.075,0.086,0.108,0.065,0.108,0.097,0.086,0.086,0.086,0.097,0.086,0.108,0.075,0.065,0.108,0.097,0.183,0.086,0.108,0.097,0.108,0.086,0.097,0.086,0.097,0.097,0.075,0.032,0.097,0.065,0.129,0.108,0.14,0.118,0.129,0.097,0.118,0.14,0.075,0.118,0.097,0.151,0.097,0.097,0.14,0.054,922,Activity,A0A1I9GEU1_NEIME,Medium,Prokaryote
+A0A247D711_LISMN_Stadelmann_2021,0.139,0.151,0.12,0.114,0.163,0.151,0.078,0.096,0.163,0.169,0.12,0.12,0.127,0.09,0.09,0.114,0.133,0.12,0.127,0.157,0.09,0.096,0.114,0.114,0.066,0.127,0.133,0.078,0.114,0.163,0.157,0.187,0.06,0.09,0.108,0.084,0.133,0.139,0.151,0.157,0.157,0.157,0.114,0.102,0.12,0.108,0.139,0.145,0.193,0.175,0.163,0.181,0.12,0.12,0.145,0.12,0.12,0.127,0.139,0.145,0.133,0.102,1653,Activity,A0A247D711_LISMN,High,Prokaryote
+A0A2Z5U3Z0_9INFA_Doud_2016,0.257,0.327,0.31,0.316,0.317,0.313,0.089,0.271,0.324,0.326,0.109,0.293,0.329,0.103,0.104,0.104,0.271,0.301,0.297,0.299,0.305,0.322,0.324,0.329,0.203,0.311,0.332,0.315,0.303,0.314,0.244,0.19,0.105,0.294,0.319,0.308,0.31,0.313,0.31,0.329,0.327,0.324,0.091,0.095,0.166,0.1,0.176,0.218,0.194,0.162,0.247,0.261,0.257,0.264,0.255,0.264,0.252,0.265,0.267,0.258,0.178,0.166,10715,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+A0A2Z5U3Z0_9INFA_Wu_2014,0.209,0.234,0.234,0.247,0.238,0.234,0.098,0.272,0.247,0.243,0.153,0.217,0.247,0.136,0.136,0.14,0.213,0.2,0.213,0.306,0.221,0.234,0.26,0.255,0.183,0.217,0.26,0.213,0.243,0.243,0.247,0.217,0.085,0.221,0.226,0.23,0.243,0.26,0.26,0.23,0.238,0.23,0.128,0.111,0.179,0.145,0.162,0.196,0.196,0.136,0.209,0.221,0.217,0.209,0.2,0.221,0.196,0.2,0.209,0.204,0.123,0.145,2350,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+A4_HUMAN_Seuma_2022,0.104,0.158,0.167,0.166,0.128,0.134,0.072,0.13,0.164,0.155,0.089,0.123,0.149,0.069,0.068,0.076,0.078,0.126,0.126,0.141,0.096,0.105,0.096,0.121,0.104,0.099,0.095,0.095,0.103,0.144,0.13,0.14,0.159,0.076,0.146,0.173,0.097,0.155,0.164,0.119,0.144,0.152,0.06,0.059,0.134,0.07,0.111,0.092,0.032,0.111,0.142,0.148,0.128,0.122,0.115,0.131,0.125,0.115,0.131,0.124,0.087,0.091,14811,Stability,A4_HUMAN,Low,Human
+A4D664_9INFA_Soh_2019,0.242,0.245,0.247,0.247,0.256,0.257,0.105,0.228,0.191,0.188,0.096,0.095,0.096,0.105,0.109,0.104,0.135,0.146,0.198,0.222,0.204,0.23,0.234,0.215,0.107,0.168,0.171,0.173,0.2,0.237,0.199,0.191,0.106,0.202,0.231,0.235,0.234,0.245,0.245,0.258,0.259,0.261,0.105,0.109,0.125,0.112,0.122,0.129,0.072,0.13,0.124,0.123,0.105,0.133,0.123,0.13,0.119,0.127,0.129,0.119,0.118,0.109,14421,OrganismalFitness,A4D664_9INFA,Medium,Virus
+A4GRB6_PSEAI_Chen_2020,0.18,0.204,0.198,0.22,0.184,0.184,0.138,0.176,0.244,0.226,0.257,0.234,0.253,0.146,0.216,0.263,0.309,0.261,0.271,0.23,0.136,0.172,0.194,0.232,0.184,0.23,0.238,0.234,0.242,0.192,0.253,0.21,0.054,0.174,0.204,0.216,0.238,0.236,0.214,0.226,0.214,0.212,0.138,0.092,0.305,0.21,0.283,0.309,0.248,0.222,0.329,0.307,0.293,0.337,0.331,0.323,0.317,0.343,0.329,0.331,0.271,0.234,5004,OrganismalFitness,A4GRB6_PSEAI,High,Prokaryote
+AACC1_PSEAI_Dandage_2018,0.177,0.26,0.188,0.204,0.193,0.193,0.144,0.094,0.249,0.238,0.204,0.21,0.199,0.133,0.155,0.166,0.215,0.21,0.238,0.227,0.149,0.166,0.199,0.138,0.16,0.204,0.193,0.193,0.21,0.276,0.16,0.122,0.083,0.199,0.177,0.227,0.227,0.204,0.21,0.21,0.21,0.215,0.138,0.127,0.16,0.166,0.227,0.238,0.227,0.188,0.249,0.221,0.221,0.232,0.232,0.265,0.215,0.238,0.265,0.249,0.171,0.166,1801,OrganismalFitness,AACC1_PSEAI,High,Prokaryote
+ACE2_HUMAN_Chan_2020,0.157,0.157,0.211,0.193,0.166,0.157,0.054,0.148,0.157,0.148,0.103,0.143,0.139,0.058,0.09,0.108,0.139,0.121,0.139,0.193,0.126,0.202,0.22,0.188,0.157,0.188,0.184,0.197,0.184,0.197,0.157,0.139,0.143,0.108,0.188,0.17,0.17,0.152,0.175,0.184,0.17,0.184,0.085,0.072,0.121,0.067,0.076,0.121,0.143,0.09,0.121,0.13,0.117,0.108,0.117,0.139,0.126,0.112,0.121,0.099,0.112,0.103,2223,Binding,ACE2_HUMAN,Medium,Human
+ADRB2_HUMAN_Jones_2020,0.142,0.168,0.168,0.174,0.201,0.206,0.142,0.206,0.171,0.188,0.178,0.156,0.177,0.146,0.15,0.158,0.158,0.167,0.172,0.159,0.168,0.174,0.172,0.171,0.188,0.179,0.183,0.181,0.159,0.162,0.155,0.133,0.121,0.186,0.168,0.182,0.165,0.165,0.174,0.191,0.192,0.204,0.163,0.118,0.172,0.177,0.156,0.181,0.167,0.127,0.155,0.16,0.135,0.165,0.16,0.153,0.15,0.154,0.145,0.154,0.171,0.16,7800,Activity,ADRB2_HUMAN,Medium,Human
+AICDA_HUMAN_Gajula_2014_3cycles,0.143,0.238,0.333,0.19,0.238,0.238,0.048,0.143,0.19,0.19,0.238,0.19,0.19,0.095,0.0,0.143,0.19,0.143,0.095,0.19,0.048,0.095,0.286,0.19,0.048,0.19,0.19,0.19,0.19,0.19,0.19,0.238,0.095,0.095,0.095,0.143,0.238,0.19,0.238,0.238,0.238,0.19,0.048,0.048,0.19,0.048,0.333,0.143,0.143,0.333,0.143,0.333,0.143,0.286,0.143,0.143,0.143,0.143,0.143,0.143,0.048,0.048,209,Activity,AICDA_HUMAN,Medium,Human
+AMFR_HUMAN_Tsuboyama_2023_4G3O,0.289,0.372,0.372,0.369,0.359,0.379,0.07,0.336,0.352,0.369,0.433,0.101,0.319,0.057,0.124,0.409,0.386,0.409,0.413,0.366,0.057,0.117,0.346,0.342,0.114,0.339,0.342,0.265,0.383,0.336,0.383,0.346,0.326,0.057,0.094,0.114,0.339,0.336,0.346,0.383,0.379,0.383,0.141,0.134,0.396,0.315,0.359,0.356,0.416,0.406,0.406,0.409,0.399,0.386,0.403,0.396,0.379,0.386,0.403,0.396,0.43,0.235,2972,Stability,AMFR_HUMAN,Medium,Human
+AMIE_PSEAE_Wrenbeck_2017,0.143,0.114,0.13,0.124,0.127,0.119,0.103,0.159,0.119,0.141,0.181,0.144,0.128,0.152,0.167,0.197,0.193,0.233,0.127,0.165,0.188,0.175,0.157,0.13,0.151,0.151,0.148,0.169,0.103,0.104,0.13,0.133,0.074,0.188,0.136,0.112,0.127,0.12,0.106,0.124,0.122,0.124,0.159,0.146,0.193,0.17,0.185,0.226,0.207,0.143,0.188,0.194,0.18,0.175,0.159,0.154,0.165,0.169,0.165,0.17,0.234,0.161,6227,Activity,AMIE_PSEAE,High,Prokaryote
+ANCSZ_Hobbs_2022,0.214,0.208,0.208,0.206,0.206,0.199,0.156,0.176,0.154,0.154,0.156,0.171,0.165,0.195,0.231,0.251,0.244,0.225,0.212,0.086,0.169,0.128,0.146,0.148,0.173,0.15,0.141,0.137,0.113,0.191,0.139,0.126,0.139,0.165,0.126,0.137,0.193,0.167,0.169,0.203,0.186,0.188,0.206,0.169,0.152,0.188,0.173,0.158,0.188,0.133,0.251,0.242,0.214,0.225,0.225,0.236,0.238,0.257,0.233,0.24,0.208,0.236,4670,Activity,ANCSZ,Medium,Eukaryote
+ARGR_ECOLI_Tsuboyama_2023_1AOY,0.209,0.194,0.248,0.24,0.24,0.217,0.155,0.132,0.171,0.202,0.233,0.178,0.233,0.194,0.233,0.287,0.264,0.24,0.202,0.178,0.217,0.132,0.14,0.14,0.217,0.116,0.155,0.147,0.147,0.163,0.109,0.062,0.031,0.178,0.132,0.132,0.194,0.217,0.186,0.248,0.209,0.194,0.194,0.155,0.233,0.248,0.372,0.31,0.302,0.279,0.24,0.279,0.264,0.264,0.256,0.256,0.233,0.256,0.271,0.256,0.295,0.31,1287,Stability,ARGR_ECOLI,Medium,Prokaryote
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,0.294,0.235,0.353,0.294,0.294,0.294,0.176,0.294,0.294,0.294,0.059,0.059,0.471,0.235,0.235,0.235,0.353,0.294,0.294,0.294,0.176,0.059,0.176,0.235,0.059,0.235,0.294,0.118,0.235,0.235,0.235,0.235,0.176,0.294,0.118,0.176,0.353,0.353,0.294,0.353,0.353,0.294,0.235,0.235,0.412,0.176,0.176,0.353,0.235,0.059,0.235,0.294,0.235,0.235,0.176,0.235,0.235,0.294,0.235,0.235,0.353,0.176,170,Binding,B2L11_HUMAN,Low,Human
+BBC1_YEAST_Tsuboyama_2023_1TG0,0.217,0.28,0.203,0.203,0.213,0.179,0.184,0.174,0.285,0.285,0.271,0.29,0.309,0.126,0.275,0.3,0.372,0.329,0.329,0.251,0.208,0.227,0.251,0.242,0.126,0.261,0.198,0.227,0.217,0.188,0.184,0.14,0.15,0.232,0.188,0.242,0.222,0.232,0.266,0.208,0.174,0.198,0.227,0.077,0.217,0.232,0.285,0.246,0.348,0.343,0.324,0.338,0.329,0.382,0.353,0.396,0.362,0.362,0.362,0.353,0.314,0.353,2069,Stability,BBC1_YEAST,High,Eukaryote
+BCHB_CHLTE_Tsuboyama_2023_2KRU,0.253,0.247,0.228,0.241,0.234,0.247,0.127,0.184,0.196,0.278,0.241,0.297,0.304,0.133,0.158,0.228,0.335,0.266,0.31,0.272,0.12,0.215,0.272,0.31,0.241,0.291,0.234,0.297,0.278,0.253,0.215,0.177,0.095,0.177,0.228,0.259,0.253,0.272,0.272,0.241,0.241,0.253,0.19,0.158,0.266,0.234,0.297,0.342,0.386,0.323,0.386,0.361,0.342,0.373,0.354,0.354,0.361,0.361,0.31,0.361,0.361,0.222,1572,Stability,BCHB_CHLTE,Medium,Prokaryote
+BLAT_ECOLX_Deng_2012,0.238,0.334,0.34,0.348,0.366,0.366,0.11,0.17,0.276,0.34,0.298,0.29,0.32,0.172,0.25,0.268,0.294,0.268,0.256,0.298,0.258,0.26,0.276,0.238,0.248,0.336,0.304,0.306,0.228,0.3,0.34,0.32,0.098,0.25,0.296,0.246,0.298,0.338,0.314,0.36,0.386,0.378,0.204,0.074,0.318,0.248,0.334,0.326,0.324,0.218,0.338,0.354,0.336,0.38,0.352,0.352,0.368,0.352,0.376,0.362,0.33,0.29,4996,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Firnberg_2014,0.229,0.385,0.404,0.394,0.41,0.41,0.14,0.19,0.3,0.362,0.338,0.317,0.356,0.2,0.262,0.323,0.35,0.321,0.252,0.333,0.292,0.265,0.258,0.235,0.296,0.346,0.306,0.315,0.208,0.296,0.335,0.317,0.14,0.271,0.292,0.238,0.317,0.338,0.321,0.404,0.412,0.423,0.235,0.123,0.367,0.279,0.362,0.381,0.406,0.215,0.365,0.352,0.35,0.369,0.381,0.367,0.369,0.373,0.358,0.377,0.402,0.298,4783,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Jacquier_2013,0.131,0.225,0.214,0.217,0.217,0.219,0.099,0.139,0.174,0.201,0.182,0.187,0.193,0.134,0.144,0.155,0.187,0.179,0.179,0.168,0.179,0.176,0.195,0.171,0.166,0.179,0.195,0.203,0.174,0.142,0.187,0.195,0.115,0.182,0.193,0.195,0.19,0.203,0.201,0.214,0.209,0.219,0.128,0.078,0.179,0.152,0.15,0.193,0.182,0.155,0.19,0.201,0.206,0.195,0.184,0.198,0.198,0.184,0.201,0.195,0.19,0.155,989,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BLAT_ECOLX_Stiffler_2015,0.204,0.35,0.356,0.336,0.374,0.384,0.088,0.222,0.256,0.334,0.314,0.294,0.324,0.182,0.216,0.274,0.306,0.292,0.258,0.292,0.288,0.286,0.26,0.25,0.272,0.336,0.336,0.302,0.226,0.292,0.356,0.364,0.084,0.288,0.29,0.264,0.33,0.328,0.324,0.38,0.396,0.4,0.18,0.09,0.328,0.214,0.318,0.348,0.344,0.19,0.352,0.32,0.334,0.356,0.364,0.356,0.354,0.33,0.342,0.364,0.346,0.244,4996,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+BRCA1_HUMAN_Findlay_2018,0.223,0.196,0.234,0.245,0.217,0.239,0.136,0.136,0.201,0.207,0.168,0.152,0.163,0.152,0.12,0.147,0.168,0.212,0.13,0.158,0.136,0.212,0.223,0.158,0.207,0.228,0.239,0.185,0.179,0.196,0.212,0.19,0.12,0.13,0.147,0.179,0.207,0.217,0.223,0.255,0.25,0.212,0.141,0.168,0.228,0.179,0.185,0.228,0.13,0.141,0.174,0.185,0.168,0.163,0.163,0.152,0.158,0.179,0.163,0.163,0.147,0.168,1837,OrganismalFitness,BRCA1_HUMAN,Low,Human
+BRCA2_HUMAN_Erwood_2022_HEK293T,0.259,0.185,0.148,0.185,0.259,0.185,0.148,0.111,0.111,0.148,0.148,0.111,0.037,0.111,0.148,0.185,0.185,0.185,0.111,0.037,0.185,0.259,0.296,0.148,0.222,0.148,0.111,0.259,0.074,0.111,0.148,0.222,0.111,0.185,0.222,0.185,0.148,0.185,0.185,0.185,0.185,0.185,0.111,0.148,0.185,0.185,0.148,0.111,0.037,0.111,0.222,0.185,0.148,0.111,0.222,0.148,0.111,0.148,0.222,0.148,0.074,0.111,265,OrganismalFitness,BRCA2_HUMAN,,Human
+C6KNH7_9INFA_Lee_2018,0.255,0.295,0.288,0.283,0.302,0.305,0.088,0.247,0.298,0.301,0.098,0.27,0.331,0.089,0.092,0.097,0.25,0.271,0.305,0.251,0.262,0.254,0.266,0.242,0.143,0.264,0.273,0.294,0.242,0.3,0.271,0.225,0.125,0.243,0.269,0.257,0.28,0.287,0.285,0.302,0.309,0.304,0.097,0.092,0.172,0.099,0.236,0.294,0.276,0.177,0.262,0.261,0.269,0.267,0.262,0.246,0.259,0.265,0.253,0.266,0.181,0.157,10754,OrganismalFitness,C6KNH7_9INFA,Medium,Virus
+CALM1_HUMAN_Weile_2017,0.088,0.082,0.066,0.055,0.055,0.055,0.099,0.049,0.071,0.049,0.055,0.077,0.071,0.11,0.093,0.077,0.055,0.038,0.099,0.055,0.082,0.077,0.077,0.06,0.06,0.055,0.049,0.077,0.038,0.038,0.049,0.055,0.088,0.099,0.077,0.038,0.071,0.082,0.049,0.066,0.055,0.06,0.06,0.104,0.066,0.071,0.115,0.066,0.06,0.121,0.077,0.071,0.077,0.071,0.077,0.066,0.066,0.077,0.071,0.077,0.06,0.077,1813,OrganismalFitness,CALM1_HUMAN,High,Human
+CAPSD_AAV2S_Sinai_2021,0.263,0.258,0.246,0.279,0.258,0.262,0.174,0.265,0.233,0.254,0.136,0.121,0.129,0.131,0.147,0.134,0.165,0.148,0.112,0.174,0.122,0.129,0.133,0.153,0.119,0.122,0.154,0.126,0.18,0.272,0.13,0.12,0.098,0.133,0.142,0.263,0.23,0.231,0.275,0.253,0.255,0.273,0.128,0.163,0.19,0.147,0.176,0.188,0.145,0.156,0.162,0.145,0.133,0.145,0.125,0.133,0.142,0.14,0.175,0.147,0.152,0.118,42328,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+CAR11_HUMAN_Meitlis_2020_gof,0.076,0.071,0.071,0.071,0.08,0.071,0.223,0.055,0.071,0.071,0.071,0.101,0.088,0.122,0.109,0.168,0.092,0.084,0.088,0.092,0.113,0.071,0.097,0.076,0.038,0.076,0.08,0.067,0.097,0.059,0.071,0.059,0.097,0.151,0.08,0.055,0.084,0.084,0.071,0.08,0.084,0.076,0.118,0.092,0.092,0.151,0.088,0.071,0.113,0.118,0.16,0.134,0.13,0.139,0.143,0.126,0.143,0.139,0.13,0.13,0.168,0.172,2374,OrganismalFitness,CAR11_HUMAN,Low,Human
+CAR11_HUMAN_Meitlis_2020_lof,0.062,0.075,0.071,0.067,0.062,0.071,0.204,0.042,0.092,0.092,0.067,0.146,0.112,0.121,0.104,0.183,0.112,0.129,0.1,0.125,0.125,0.058,0.096,0.075,0.062,0.071,0.088,0.062,0.121,0.05,0.075,0.062,0.067,0.167,0.079,0.075,0.067,0.067,0.05,0.075,0.075,0.075,0.121,0.092,0.088,0.183,0.108,0.083,0.154,0.129,0.158,0.121,0.133,0.108,0.142,0.133,0.146,0.138,0.129,0.125,0.208,0.188,2395,OrganismalFitness,CAR11_HUMAN,Low,Human
+CAS9_STRP1_Spencer_2017_positive,0.096,0.112,0.111,0.105,0.111,0.112,0.096,0.106,0.116,0.117,0.11,0.096,0.106,0.097,0.103,0.126,0.106,0.117,0.11,0.097,0.099,0.096,0.117,0.116,0.1,0.112,0.096,0.111,0.101,0.105,0.117,0.117,0.111,0.091,0.099,0.128,0.111,0.116,0.123,0.101,0.101,0.117,0.095,0.091,0.111,0.099,0.099,0.111,0.103,0.094,0.118,0.107,0.115,0.102,0.111,0.111,0.099,0.112,0.1,0.11,0.127,0.115,8117,Activity,CAS9_STRP1,Medium,Prokaryote
+CASP3_HUMAN_Roychowdhury_2020,0.159,0.204,0.153,0.115,0.134,0.153,0.089,0.159,0.21,0.223,0.146,0.115,0.153,0.089,0.146,0.166,0.178,0.146,0.21,0.159,0.146,0.159,0.159,0.153,0.14,0.159,0.178,0.172,0.204,0.146,0.229,0.223,0.115,0.102,0.14,0.172,0.134,0.166,0.166,0.146,0.172,0.159,0.096,0.076,0.159,0.14,0.159,0.166,0.127,0.102,0.185,0.178,0.191,0.14,0.172,0.166,0.185,0.166,0.172,0.166,0.146,0.134,1567,Activity,CASP3_HUMAN,High,Human
+CASP7_HUMAN_Roychowdhury_2020,0.149,0.202,0.238,0.214,0.25,0.244,0.06,0.155,0.226,0.25,0.268,0.19,0.232,0.107,0.208,0.274,0.208,0.196,0.19,0.238,0.095,0.214,0.155,0.167,0.202,0.179,0.179,0.167,0.196,0.226,0.167,0.155,0.107,0.083,0.19,0.137,0.208,0.226,0.19,0.22,0.232,0.244,0.185,0.06,0.262,0.274,0.238,0.28,0.244,0.19,0.196,0.214,0.244,0.232,0.238,0.226,0.214,0.22,0.226,0.22,0.226,0.208,1680,Activity,CASP7_HUMAN,Medium,Human
+CATR_CHLRE_Tsuboyama_2023_2AMI,0.272,0.147,0.147,0.152,0.183,0.162,0.236,0.094,0.131,0.136,0.115,0.173,0.147,0.225,0.257,0.209,0.152,0.157,0.131,0.089,0.12,0.126,0.11,0.105,0.073,0.089,0.089,0.073,0.105,0.152,0.126,0.173,0.183,0.131,0.11,0.099,0.199,0.173,0.126,0.168,0.152,0.157,0.194,0.183,0.084,0.126,0.314,0.131,0.183,0.293,0.183,0.115,0.141,0.152,0.141,0.147,0.157,0.141,0.141,0.157,0.099,0.257,1903,Stability,CATR_CHLRE,High,Eukaryote
+CBPA2_HUMAN_Tsuboyama_2023_1O6X,0.227,0.159,0.227,0.227,0.237,0.237,0.222,0.184,0.174,0.198,0.304,0.242,0.256,0.227,0.246,0.251,0.222,0.227,0.246,0.237,0.174,0.193,0.261,0.251,0.29,0.261,0.285,0.246,0.213,0.184,0.184,0.14,0.155,0.237,0.29,0.285,0.227,0.242,0.246,0.261,0.237,0.261,0.29,0.261,0.275,0.29,0.295,0.309,0.242,0.285,0.179,0.242,0.246,0.217,0.237,0.232,0.184,0.203,0.203,0.232,0.237,0.309,2068,Stability,CBPA2_HUMAN,Medium,Human
+CBS_HUMAN_Sun_2020,0.168,0.161,0.169,0.166,0.176,0.175,0.1,0.141,0.173,0.162,0.163,0.15,0.162,0.098,0.127,0.127,0.132,0.145,0.157,0.161,0.143,0.133,0.166,0.173,0.147,0.152,0.154,0.163,0.152,0.163,0.145,0.13,0.109,0.15,0.141,0.151,0.158,0.177,0.194,0.172,0.18,0.179,0.13,0.102,0.158,0.15,0.147,0.147,0.169,0.116,0.139,0.136,0.147,0.141,0.145,0.151,0.15,0.154,0.155,0.151,0.147,0.127,7217,OrganismalFitness,CBS_HUMAN,Medium,Human
+CBX4_HUMAN_Tsuboyama_2023_2K28,0.437,0.406,0.441,0.445,0.472,0.48,0.013,0.31,0.445,0.424,0.428,0.397,0.402,0.022,0.533,0.533,0.48,0.445,0.41,0.445,0.306,0.341,0.328,0.332,0.376,0.358,0.371,0.341,0.332,0.445,0.432,0.376,0.192,0.301,0.328,0.341,0.45,0.402,0.441,0.48,0.424,0.441,0.437,0.096,0.319,0.406,0.371,0.262,0.52,0.467,0.507,0.48,0.485,0.48,0.52,0.459,0.472,0.472,0.476,0.507,0.437,0.507,2282,Stability,CBX4_HUMAN,High,Human
+CCDB_ECOLI_Adkar_2012,0.124,0.152,0.146,0.146,0.118,0.129,0.096,0.107,0.079,0.118,0.146,0.056,0.079,0.101,0.112,0.096,0.157,0.112,0.14,0.107,0.062,0.045,0.067,0.096,0.017,0.101,0.09,0.146,0.112,0.107,0.197,0.163,0.067,0.056,0.073,0.124,0.118,0.129,0.152,0.135,0.146,0.146,0.096,0.079,0.118,0.124,0.062,0.152,0.135,0.101,0.174,0.118,0.174,0.213,0.185,0.169,0.163,0.169,0.14,0.169,0.157,0.09,1176,Activity,CCDB_ECOLI,High,Prokaryote
+CCDB_ECOLI_Tripathi_2016,0.117,0.125,0.124,0.125,0.124,0.124,0.09,0.117,0.108,0.117,0.115,0.112,0.112,0.093,0.097,0.112,0.12,0.124,0.127,0.12,0.093,0.09,0.083,0.11,0.085,0.102,0.086,0.102,0.125,0.118,0.126,0.127,0.108,0.089,0.099,0.124,0.118,0.116,0.124,0.122,0.122,0.124,0.096,0.085,0.114,0.095,0.111,0.123,0.113,0.105,0.121,0.118,0.115,0.12,0.124,0.122,0.121,0.124,0.123,0.122,0.12,0.117,1663,OrganismalFitness,CCDB_ECOLI,High,Prokaryote
+CCR5_HUMAN_Gill_2023,0.146,0.143,0.156,0.164,0.156,0.14,0.114,0.231,0.164,0.165,0.169,0.146,0.154,0.124,0.135,0.148,0.141,0.154,0.138,0.17,0.13,0.196,0.167,0.161,0.157,0.196,0.18,0.196,0.178,0.178,0.141,0.125,0.119,0.167,0.175,0.167,0.156,0.159,0.169,0.144,0.148,0.148,0.135,0.079,0.183,0.138,0.133,0.177,0.14,0.132,0.148,0.148,0.143,0.128,0.127,0.138,0.135,0.144,0.125,0.135,0.172,0.151,6137,Binding,CCR5_HUMAN,High,Human
+CD19_HUMAN_Klesmith_2019_FMC_singles,0.167,0.127,0.164,0.162,0.146,0.138,0.143,0.122,0.103,0.109,0.18,0.167,0.196,0.186,0.228,0.175,0.167,0.183,0.138,0.09,0.194,0.162,0.207,0.162,0.149,0.225,0.159,0.204,0.149,0.162,0.13,0.101,0.127,0.241,0.252,0.167,0.191,0.18,0.156,0.164,0.154,0.151,0.215,0.183,0.218,0.212,0.34,0.321,0.324,0.138,0.228,0.191,0.202,0.215,0.218,0.218,0.236,0.21,0.228,0.218,0.353,0.302,3761,Binding,CD19_HUMAN,Low,Human
+CP2C9_HUMAN_Amorosi_2021_abundance,0.25,0.272,0.267,0.279,0.298,0.294,0.242,0.273,0.278,0.3,0.261,0.284,0.309,0.217,0.284,0.275,0.289,0.284,0.278,0.295,0.287,0.253,0.276,0.267,0.284,0.275,0.279,0.279,0.284,0.262,0.246,0.212,0.11,0.284,0.281,0.276,0.294,0.292,0.289,0.3,0.298,0.295,0.287,0.129,0.272,0.281,0.276,0.279,0.309,0.154,0.275,0.279,0.29,0.287,0.286,0.292,0.3,0.292,0.289,0.284,0.292,0.273,6370,Expression,CP2C9_HUMAN,High,Human
+CP2C9_HUMAN_Amorosi_2021_activity,0.285,0.332,0.345,0.359,0.369,0.371,0.309,0.319,0.311,0.359,0.32,0.372,0.38,0.285,0.364,0.377,0.397,0.376,0.354,0.395,0.346,0.307,0.332,0.291,0.359,0.327,0.324,0.302,0.306,0.328,0.247,0.198,0.117,0.367,0.359,0.335,0.379,0.364,0.372,0.379,0.374,0.367,0.371,0.14,0.346,0.385,0.314,0.328,0.353,0.202,0.403,0.363,0.366,0.38,0.379,0.39,0.366,0.379,0.385,0.387,0.387,0.354,6142,Binding,CP2C9_HUMAN,High,Human
+CSN4_MOUSE_Tsuboyama_2023_1UFM,0.358,0.33,0.327,0.339,0.339,0.342,0.176,0.309,0.379,0.361,0.355,0.394,0.418,0.27,0.464,0.397,0.248,0.336,0.324,0.327,0.318,0.339,0.348,0.355,0.224,0.376,0.409,0.352,0.333,0.33,0.348,0.321,0.203,0.236,0.358,0.358,0.385,0.385,0.373,0.355,0.367,0.358,0.248,0.224,0.336,0.394,0.421,0.33,0.467,0.455,0.342,0.352,0.339,0.336,0.333,0.339,0.342,0.336,0.318,0.336,0.367,0.452,3295,Stability,CSN4_MOUSE,Medium,Eukaryote
+CUE1_YEAST_Tsuboyama_2023_2MYX,0.297,0.323,0.316,0.316,0.342,0.329,0.158,0.278,0.361,0.361,0.304,0.278,0.209,0.146,0.19,0.259,0.291,0.304,0.304,0.234,0.133,0.146,0.171,0.108,0.101,0.114,0.184,0.184,0.31,0.259,0.278,0.209,0.165,0.108,0.12,0.177,0.297,0.291,0.316,0.329,0.335,0.342,0.196,0.127,0.259,0.152,0.329,0.329,0.354,0.348,0.278,0.342,0.31,0.342,0.278,0.297,0.316,0.285,0.31,0.31,0.361,0.304,1580,Stability,CUE1_YEAST,Medium,Eukaryote
+D7PM05_CLYGR_Somermeyer_2022,0.25,0.361,0.315,0.293,0.336,0.333,0.098,0.222,0.369,0.381,0.237,0.097,0.1,0.097,0.092,0.088,0.085,0.097,0.103,0.233,0.079,0.061,0.092,0.084,0.056,0.081,0.08,0.108,0.147,0.337,0.32,0.314,0.056,0.079,0.098,0.104,0.293,0.295,0.29,0.334,0.336,0.33,0.135,0.131,0.137,0.138,0.257,0.249,0.305,0.23,0.274,0.281,0.286,0.283,0.277,0.283,0.269,0.286,0.268,0.285,0.198,0.136,24515,Activity,D7PM05_CLYGR,Low,Eukaryote
+DLG4_HUMAN_Faure_2021,0.322,0.319,0.328,0.322,0.338,0.34,0.314,0.305,0.282,0.292,0.268,0.321,0.318,0.378,0.423,0.37,0.315,0.318,0.279,0.348,0.284,0.289,0.278,0.277,0.287,0.292,0.291,0.297,0.265,0.321,0.274,0.285,0.272,0.287,0.308,0.287,0.331,0.36,0.351,0.345,0.354,0.35,0.285,0.209,0.249,0.287,0.385,0.285,0.341,0.216,0.284,0.256,0.282,0.291,0.282,0.302,0.262,0.282,0.277,0.282,0.317,0.398,6976,OrganismalFitness,DLG4_HUMAN,Low,Human
+DLG4_RAT_McLaughlin_2012,0.158,0.215,0.158,0.139,0.165,0.171,0.184,0.133,0.127,0.165,0.184,0.133,0.127,0.108,0.139,0.177,0.101,0.127,0.133,0.101,0.108,0.108,0.101,0.114,0.089,0.127,0.095,0.108,0.127,0.133,0.203,0.234,0.12,0.114,0.133,0.108,0.133,0.133,0.12,0.133,0.133,0.158,0.158,0.089,0.146,0.133,0.19,0.127,0.108,0.133,0.108,0.076,0.063,0.089,0.089,0.082,0.076,0.095,0.095,0.082,0.095,0.133,1576,Binding,DLG4_RAT,Low,Eukaryote
+DN7A_SACS2_Tsuboyama_2023_1JIC,0.168,0.149,0.139,0.119,0.109,0.129,0.089,0.109,0.228,0.287,0.119,0.089,0.119,0.089,0.178,0.198,0.248,0.267,0.218,0.228,0.079,0.089,0.069,0.109,0.119,0.149,0.079,0.119,0.218,0.208,0.129,0.059,0.168,0.099,0.099,0.079,0.168,0.168,0.158,0.149,0.158,0.119,0.139,0.089,0.188,0.168,0.287,0.317,0.267,0.307,0.416,0.366,0.347,0.277,0.307,0.297,0.297,0.228,0.327,0.356,0.218,0.277,1008,Stability,DN7A_SACS2,Medium,Prokaryote
+DNJA1_HUMAN_Tsuboyama_2023_2LO1,0.3,0.282,0.295,0.295,0.313,0.313,0.322,0.264,0.3,0.308,0.33,0.322,0.374,0.388,0.485,0.432,0.449,0.392,0.401,0.317,0.308,0.317,0.357,0.37,0.313,0.273,0.3,0.308,0.291,0.264,0.189,0.198,0.242,0.282,0.26,0.361,0.304,0.308,0.317,0.308,0.3,0.313,0.383,0.256,0.295,0.352,0.427,0.348,0.454,0.463,0.467,0.507,0.511,0.485,0.485,0.489,0.48,0.515,0.458,0.502,0.304,0.383,2264,Stability,DNJA1_HUMAN,High,Human
+DOCK1_MOUSE_Tsuboyama_2023_2M0Y,0.342,0.325,0.329,0.295,0.288,0.318,0.099,0.25,0.223,0.192,0.305,0.185,0.233,0.027,0.192,0.233,0.305,0.305,0.274,0.332,0.24,0.284,0.291,0.26,0.103,0.223,0.134,0.195,0.312,0.164,0.318,0.291,0.247,0.075,0.075,0.048,0.325,0.356,0.329,0.322,0.315,0.322,0.134,0.058,0.325,0.192,0.384,0.377,0.363,0.445,0.349,0.363,0.366,0.39,0.353,0.38,0.366,0.363,0.339,0.373,0.404,0.37,2915,Stability,DOCK1_MOUSE,High,Eukaryote
+DYR_ECOLI_Nguyen_2023,0.165,0.101,0.126,0.128,0.108,0.093,0.099,0.104,0.2,0.15,0.154,0.205,0.174,0.126,0.183,0.165,0.174,0.161,0.17,0.185,0.154,0.152,0.143,0.148,0.187,0.079,0.079,0.123,0.088,0.088,0.141,0.117,0.024,0.099,0.064,0.073,0.134,0.084,0.075,0.088,0.075,0.086,0.161,0.106,0.141,0.167,0.148,0.187,0.198,0.159,0.185,0.174,0.172,0.163,0.183,0.183,0.181,0.172,0.2,0.174,0.203,0.192,2916,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+DYR_ECOLI_Thompson_2019,0.131,0.127,0.101,0.101,0.118,0.105,0.127,0.127,0.11,0.131,0.152,0.143,0.143,0.105,0.139,0.16,0.165,0.169,0.143,0.152,0.118,0.068,0.127,0.084,0.11,0.152,0.169,0.127,0.131,0.114,0.122,0.089,0.038,0.084,0.118,0.194,0.127,0.143,0.169,0.114,0.118,0.118,0.131,0.089,0.156,0.152,0.148,0.127,0.135,0.122,0.143,0.173,0.152,0.165,0.165,0.194,0.173,0.173,0.186,0.177,0.143,0.194,2363,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+ENV_HV1B9_DuenasDecamp_2016,0.4,0.35,0.3,0.3,0.35,0.35,0.025,0.375,0.375,0.35,0.3,0.4,0.35,0.025,0.025,0.05,0.075,0.05,0.175,0.4,0.275,0.35,0.375,0.35,0.25,0.4,0.3,0.375,0.4,0.3,0.425,0.375,0.325,0.325,0.375,0.425,0.35,0.375,0.375,0.375,0.4,0.4,0.3,0.0,0.35,0.325,0.325,0.3,0.175,0.15,0.125,0.275,0.225,0.15,0.275,0.2,0.175,0.225,0.175,0.175,0.1,0.125,375,OrganismalFitness,ENV_HV1B9,Medium,Virus
+ENV_HV1BR_Haddox_2016,0.206,0.211,0.227,0.234,0.231,0.228,0.084,0.214,0.203,0.206,0.155,0.193,0.191,0.099,0.103,0.099,0.108,0.113,0.13,0.239,0.198,0.212,0.233,0.239,0.189,0.218,0.233,0.211,0.237,0.171,0.198,0.172,0.128,0.2,0.215,0.225,0.211,0.213,0.221,0.23,0.232,0.235,0.165,0.076,0.196,0.175,0.139,0.193,0.104,0.113,0.123,0.134,0.145,0.144,0.15,0.141,0.134,0.133,0.135,0.145,0.129,0.113,12863,OrganismalFitness,ENV_HV1BR,Medium,Virus
+ENVZ_ECOLI_Ghose_2023,0.133,0.115,0.124,0.106,0.115,0.097,0.097,0.115,0.124,0.115,0.08,0.088,0.071,0.097,0.124,0.08,0.071,0.088,0.08,0.08,0.097,0.053,0.106,0.062,0.133,0.062,0.088,0.08,0.062,0.088,0.08,0.115,0.133,0.106,0.097,0.062,0.106,0.097,0.088,0.115,0.106,0.106,0.124,0.088,0.115,0.053,0.062,0.097,0.071,0.08,0.088,0.088,0.088,0.08,0.08,0.062,0.097,0.08,0.08,0.08,0.071,0.088,1121,Activity,ENVZ_ECOLI,High,Prokaryote
+EPHB2_HUMAN_Tsuboyama_2023_1F0M,0.321,0.327,0.383,0.367,0.388,0.388,0.005,0.296,0.372,0.372,0.311,0.327,0.301,0.015,0.327,0.347,0.372,0.388,0.347,0.357,0.347,0.347,0.306,0.321,0.327,0.316,0.372,0.352,0.337,0.281,0.311,0.235,0.235,0.357,0.301,0.357,0.398,0.352,0.383,0.403,0.362,0.378,0.26,0.112,0.327,0.327,0.444,0.378,0.423,0.408,0.383,0.398,0.388,0.429,0.403,0.403,0.388,0.398,0.408,0.393,0.423,0.429,1960,Stability,EPHB2_HUMAN,High,Human
+ERBB2_HUMAN_Elazar_2016,0.121,0.242,0.121,0.152,0.212,0.212,0.152,0.182,0.242,0.303,0.121,0.121,0.182,0.182,0.182,0.152,0.242,0.182,0.212,0.091,0.152,0.182,0.182,0.182,0.152,0.182,0.212,0.273,0.182,0.152,0.091,0.121,0.182,0.152,0.091,0.182,0.182,0.121,0.182,0.212,0.212,0.182,0.152,0.182,0.212,0.121,0.212,0.182,0.03,0.091,0.152,0.152,0.182,0.152,0.121,0.121,0.121,0.182,0.212,0.121,0.121,0.152,326,Expression,ERBB2_HUMAN,Low,Human
+ESTA_BACSU_Nutschel_2020,0.234,0.271,0.275,0.266,0.252,0.261,0.22,0.188,0.202,0.284,0.229,0.28,0.28,0.229,0.243,0.22,0.225,0.197,0.225,0.234,0.257,0.261,0.284,0.22,0.307,0.252,0.294,0.312,0.257,0.206,0.179,0.193,0.087,0.243,0.229,0.165,0.261,0.248,0.161,0.289,0.284,0.243,0.294,0.206,0.284,0.275,0.427,0.353,0.298,0.28,0.284,0.234,0.275,0.289,0.312,0.294,0.28,0.248,0.271,0.289,0.225,0.271,2172,Stability,ESTA_BACSU,High,Prokaryote
+F7YBW8_MESOW_Aakre_2015,0.087,0.609,0.564,0.672,0.626,0.642,0.095,0.448,0.492,0.523,0.62,0.503,0.528,0.063,0.067,0.112,0.48,0.399,0.618,0.567,0.065,0.028,0.062,0.093,0.103,0.253,0.063,0.426,0.625,0.554,0.699,0.659,0.068,0.062,0.053,0.69,0.092,0.072,0.647,0.517,0.493,0.68,0.1,0.084,0.316,0.109,0.096,0.383,0.136,0.08,0.624,0.657,0.601,0.677,0.649,0.667,0.601,0.614,0.639,0.647,0.323,0.028,9192,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+FECA_ECOLI_Tsuboyama_2023_2D1U,0.212,0.243,0.233,0.217,0.249,0.233,0.111,0.116,0.228,0.275,0.275,0.228,0.243,0.116,0.286,0.328,0.296,0.249,0.254,0.222,0.185,0.175,0.169,0.169,0.159,0.185,0.249,0.238,0.19,0.201,0.185,0.159,0.164,0.122,0.153,0.175,0.217,0.18,0.212,0.212,0.206,0.217,0.291,0.138,0.28,0.286,0.312,0.27,0.265,0.286,0.254,0.265,0.291,0.265,0.291,0.249,0.249,0.275,0.27,0.249,0.296,0.291,1886,Stability,FECA_ECOLI,High,Prokaryote
+FKBP3_HUMAN_Tsuboyama_2023_2KFV,0.153,0.177,0.169,0.177,0.21,0.202,0.04,0.097,0.089,0.089,0.073,0.081,0.065,0.056,0.056,0.073,0.105,0.089,0.089,0.073,0.065,0.097,0.024,0.177,0.065,0.185,0.121,0.137,0.089,0.145,0.081,0.081,0.081,0.048,0.097,0.145,0.169,0.153,0.169,0.185,0.177,0.169,0.056,0.073,0.121,0.081,0.218,0.258,0.282,0.234,0.218,0.274,0.234,0.177,0.226,0.25,0.202,0.202,0.194,0.234,0.218,0.177,1237,Stability,FKBP3_HUMAN,Medium,Human
+GAL4_YEAST_Kitzman_2015,0.075,0.1,0.25,0.175,0.183,0.192,0.092,0.1,0.25,0.208,0.2,0.158,0.142,0.108,0.167,0.158,0.242,0.258,0.242,0.158,0.133,0.092,0.15,0.15,0.142,0.125,0.15,0.158,0.267,0.183,0.242,0.267,0.158,0.092,0.15,0.1,0.167,0.15,0.167,0.183,0.192,0.217,0.117,0.192,0.208,0.125,0.075,0.2,0.075,0.083,0.267,0.2,0.275,0.242,0.217,0.217,0.275,0.217,0.242,0.25,0.175,0.183,1195,OrganismalFitness,GAL4_YEAST,Medium,Eukaryote
+GCN4_YEAST_Staller_2018,0.242,0.223,0.201,0.231,0.227,0.22,0.091,0.117,0.227,0.22,0.197,0.178,0.182,0.174,0.14,0.186,0.227,0.22,0.223,0.22,0.064,0.064,0.076,0.064,0.053,0.042,0.053,0.061,0.144,0.22,0.208,0.212,0.095,0.053,0.049,0.223,0.231,0.231,0.239,0.223,0.22,0.246,0.148,0.14,0.152,0.186,0.17,0.216,0.152,0.148,0.193,0.182,0.186,0.186,0.197,0.208,0.201,0.197,0.197,0.197,0.136,0.125,2638,Binding,GCN4_YEAST,Low,Eukaryote
+GDIA_HUMAN_Silverstein_2021,0.121,0.112,0.129,0.138,0.138,0.147,0.155,0.129,0.121,0.129,0.138,0.112,0.121,0.069,0.086,0.121,0.138,0.138,0.138,0.138,0.095,0.164,0.155,0.19,0.121,0.164,0.138,0.129,0.138,0.147,0.172,0.181,0.112,0.121,0.103,0.112,0.138,0.121,0.129,0.138,0.121,0.147,0.112,0.052,0.155,0.103,0.172,0.164,0.147,0.078,0.138,0.147,0.147,0.164,0.164,0.181,0.155,0.138,0.164,0.155,0.138,0.121,1154,OrganismalFitness,GDIA_HUMAN,Low,Human
+GFP_AEQVI_Sarkisyan_2016,0.277,0.275,0.28,0.279,0.287,0.287,0.061,0.256,0.297,0.298,0.188,0.083,0.086,0.085,0.092,0.079,0.078,0.09,0.112,0.241,0.074,0.073,0.089,0.08,0.066,0.098,0.118,0.259,0.268,0.296,0.238,0.239,0.087,0.069,0.093,0.249,0.285,0.283,0.279,0.288,0.289,0.304,0.128,0.11,0.134,0.134,0.279,0.28,0.308,0.254,0.235,0.239,0.25,0.253,0.243,0.251,0.241,0.245,0.233,0.247,0.262,0.188,51714,Activity,GFP_AEQVI,Low,Eukaryote
+GLPA_HUMAN_Elazar_2016,0.32,0.2,0.36,0.36,0.36,0.36,0.28,0.28,0.2,0.2,0.2,0.4,0.32,0.28,0.28,0.32,0.36,0.32,0.2,0.2,0.32,0.28,0.36,0.32,0.24,0.36,0.32,0.4,0.24,0.2,0.28,0.24,0.24,0.28,0.32,0.36,0.4,0.4,0.44,0.36,0.36,0.36,0.28,0.24,0.4,0.32,0.28,0.2,0.32,0.16,0.48,0.44,0.32,0.36,0.4,0.28,0.32,0.48,0.36,0.4,0.32,0.4,245,Expression,GLPA_HUMAN,Low,Human
+GRB2_HUMAN_Faure_2021,0.303,0.36,0.354,0.363,0.376,0.384,0.317,0.288,0.38,0.339,0.365,0.305,0.345,0.321,0.384,0.425,0.442,0.346,0.393,0.366,0.371,0.357,0.328,0.31,0.343,0.347,0.287,0.381,0.283,0.345,0.281,0.266,0.343,0.377,0.362,0.245,0.381,0.376,0.291,0.399,0.402,0.375,0.398,0.196,0.35,0.362,0.46,0.337,0.491,0.345,0.416,0.437,0.415,0.427,0.438,0.434,0.417,0.437,0.412,0.439,0.359,0.402,63366,OrganismalFitness,GRB2_HUMAN,Medium,Human
+HCP_LAMBD_Tsuboyama_2023_2L6Q,0.269,0.26,0.192,0.212,0.202,0.221,0.192,0.192,0.125,0.192,0.346,0.269,0.317,0.221,0.269,0.298,0.337,0.231,0.221,0.106,0.125,0.183,0.212,0.26,0.231,0.26,0.221,0.212,0.26,0.212,0.231,0.163,0.173,0.106,0.163,0.24,0.269,0.279,0.26,0.231,0.25,0.25,0.26,0.173,0.298,0.212,0.25,0.337,0.269,0.202,0.298,0.317,0.356,0.317,0.356,0.337,0.337,0.327,0.317,0.356,0.423,0.279,1040,Stability,HCP_LAMBD,Medium,Virus
+HECD1_HUMAN_Tsuboyama_2023_3DKM,0.581,0.547,0.619,0.633,0.649,0.651,0.222,0.469,0.606,0.623,0.599,0.174,0.229,0.143,0.152,0.608,0.617,0.605,0.614,0.581,0.191,0.503,0.538,0.538,0.501,0.549,0.544,0.565,0.533,0.596,0.642,0.581,0.109,0.199,0.168,0.333,0.606,0.574,0.606,0.63,0.617,0.623,0.243,0.231,0.608,0.343,0.46,0.592,0.474,0.345,0.639,0.606,0.628,0.615,0.644,0.653,0.63,0.639,0.646,0.644,0.673,0.22,5586,Stability,HECD1_HUMAN,Medium,Human
+HEM3_HUMAN_Loggerenberg_2023,0.165,0.174,0.172,0.172,0.174,0.174,0.121,0.063,0.181,0.178,0.155,0.167,0.146,0.09,0.172,0.183,0.155,0.169,0.163,0.072,0.174,0.174,0.174,0.167,0.16,0.172,0.163,0.16,0.176,0.183,0.158,0.156,0.109,0.172,0.193,0.165,0.163,0.19,0.158,0.185,0.181,0.172,0.121,0.098,0.165,0.158,0.163,0.188,0.139,0.144,0.167,0.172,0.169,0.167,0.165,0.17,0.172,0.172,0.17,0.17,0.167,0.178,5689,Activity,HEM3_HUMAN,Medium,Human
+HIS7_YEAST_Pokusaeva_2019,0.165,0.202,0.209,0.209,0.203,0.199,0.133,0.086,0.188,0.191,0.204,0.174,0.185,0.1,0.13,0.179,0.2,0.202,0.194,0.17,0.175,0.198,0.225,0.211,0.203,0.213,0.191,0.194,0.191,0.177,0.199,0.202,0.088,0.211,0.212,0.197,0.19,0.188,0.186,0.209,0.205,0.202,0.148,0.132,0.197,0.109,0.182,0.209,0.188,0.154,0.196,0.178,0.19,0.193,0.194,0.191,0.193,0.196,0.196,0.193,0.203,0.107,496137,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+HMDH_HUMAN_Jiang_2019,0.135,0.122,0.124,0.12,0.128,0.122,0.133,0.193,0.121,0.122,0.126,0.13,0.123,0.122,0.123,0.136,0.149,0.13,0.133,0.111,0.137,0.108,0.116,0.125,0.143,0.106,0.108,0.108,0.124,0.12,0.127,0.127,0.114,0.124,0.111,0.12,0.125,0.125,0.122,0.116,0.12,0.12,0.124,0.117,0.161,0.16,0.132,0.129,0.105,0.126,0.145,0.129,0.14,0.149,0.144,0.15,0.143,0.151,0.14,0.139,0.155,0.148,16853,OrganismalFitness,HMDH_HUMAN,Low,Human
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,0.102,0.142,0.142,0.155,0.137,0.146,0.119,0.168,0.133,0.137,0.137,0.137,0.146,0.08,0.066,0.115,0.124,0.15,0.111,0.088,0.164,0.168,0.173,0.159,0.15,0.159,0.15,0.173,0.164,0.159,0.19,0.19,0.084,0.159,0.168,0.155,0.119,0.137,0.133,0.155,0.15,0.155,0.106,0.075,0.159,0.142,0.119,0.155,0.115,0.111,0.142,0.137,0.119,0.111,0.133,0.133,0.137,0.155,0.142,0.133,0.142,0.159,2252,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HSP82_YEAST_Flynn_2019,0.125,0.092,0.108,0.109,0.109,0.111,0.106,0.129,0.11,0.108,0.115,0.111,0.107,0.094,0.114,0.111,0.156,0.15,0.139,0.126,0.114,0.108,0.116,0.11,0.128,0.128,0.108,0.119,0.105,0.097,0.126,0.115,0.095,0.12,0.129,0.12,0.119,0.132,0.126,0.111,0.117,0.109,0.108,0.095,0.115,0.119,0.117,0.112,0.086,0.12,0.15,0.14,0.146,0.145,0.146,0.149,0.143,0.138,0.135,0.15,0.109,0.138,13294,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HSP82_YEAST_Mishra_2016,0.159,0.182,0.178,0.194,0.185,0.187,0.129,0.127,0.182,0.199,0.18,0.194,0.192,0.074,0.088,0.111,0.162,0.159,0.132,0.155,0.215,0.192,0.206,0.182,0.21,0.196,0.199,0.196,0.187,0.18,0.196,0.206,0.118,0.182,0.176,0.192,0.189,0.178,0.173,0.182,0.189,0.189,0.189,0.051,0.173,0.199,0.104,0.176,0.079,0.127,0.129,0.125,0.125,0.143,0.176,0.15,0.141,0.141,0.148,0.15,0.171,0.164,4323,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+HXK4_HUMAN_Gersing_2022_activity,0.232,0.161,0.149,0.151,0.156,0.156,0.098,0.127,0.149,0.148,0.128,0.116,0.135,0.117,0.127,0.172,0.174,0.151,0.138,0.12,0.14,0.117,0.141,0.134,0.14,0.14,0.153,0.138,0.125,0.109,0.133,0.119,0.085,0.144,0.139,0.148,0.165,0.133,0.155,0.162,0.14,0.158,0.135,0.107,0.14,0.144,0.179,0.16,0.159,0.142,0.162,0.176,0.162,0.173,0.165,0.176,0.182,0.163,0.166,0.17,0.17,0.162,8570,OrganismalFitness,HXK4_HUMAN,Medium,Human
+HXK4_HUMAN_Gersing_2023_abundance,0.152,0.16,0.162,0.173,0.154,0.16,0.086,0.16,0.156,0.179,0.162,0.173,0.192,0.09,0.113,0.148,0.173,0.167,0.174,0.185,0.165,0.186,0.186,0.19,0.173,0.185,0.194,0.183,0.17,0.187,0.165,0.165,0.111,0.164,0.171,0.193,0.167,0.179,0.187,0.158,0.162,0.165,0.123,0.106,0.164,0.163,0.158,0.154,0.163,0.129,0.162,0.161,0.162,0.148,0.167,0.167,0.175,0.163,0.165,0.165,0.175,0.155,8396,Expression,HXK4_HUMAN,Medium,Human
+I6TAH8_I68A0_Doud_2015,0.277,0.262,0.234,0.233,0.285,0.282,0.083,0.225,0.247,0.275,0.092,0.086,0.087,0.082,0.077,0.088,0.1,0.091,0.124,0.216,0.23,0.253,0.261,0.263,0.077,0.094,0.119,0.083,0.238,0.284,0.185,0.188,0.143,0.261,0.251,0.252,0.27,0.266,0.26,0.283,0.28,0.268,0.08,0.088,0.092,0.086,0.186,0.185,0.186,0.146,0.103,0.136,0.13,0.136,0.121,0.117,0.105,0.109,0.1,0.12,0.124,0.11,9462,OrganismalFitness,I6TAH8_I68A0,Medium,Virus
+IF1_ECOLI_Kelsic_2016,0.13,0.217,0.196,0.232,0.21,0.21,0.196,0.159,0.087,0.087,0.254,0.181,0.196,0.101,0.188,0.261,0.239,0.196,0.203,0.196,0.188,0.232,0.203,0.203,0.225,0.217,0.261,0.225,0.152,0.304,0.203,0.159,0.145,0.21,0.188,0.21,0.181,0.152,0.203,0.203,0.196,0.203,0.145,0.13,0.261,0.232,0.109,0.232,0.217,0.174,0.217,0.268,0.239,0.239,0.232,0.246,0.232,0.21,0.254,0.239,0.188,0.188,1367,OrganismalFitness,IF1_ECOLI,High,Prokaryote
+ILF3_HUMAN_Tsuboyama_2023_2L33,0.075,0.075,0.195,0.203,0.195,0.211,0.06,0.12,0.083,0.12,0.203,0.12,0.113,0.053,0.053,0.075,0.083,0.165,0.188,0.188,0.203,0.173,0.173,0.188,0.271,0.211,0.248,0.18,0.165,0.218,0.12,0.075,0.203,0.083,0.226,0.233,0.06,0.158,0.18,0.105,0.128,0.165,0.09,0.06,0.158,0.128,0.09,0.15,0.083,0.15,0.188,0.105,0.158,0.15,0.165,0.165,0.158,0.143,0.15,0.158,0.218,0.09,1329,Stability,ILF3_HUMAN,High,Human
+ISDH_STAAW_Tsuboyama_2023_2LHR,0.226,0.179,0.236,0.236,0.221,0.236,0.313,0.128,0.226,0.282,0.246,0.323,0.344,0.344,0.369,0.451,0.374,0.421,0.395,0.19,0.318,0.2,0.195,0.215,0.349,0.354,0.369,0.333,0.251,0.205,0.179,0.123,0.159,0.231,0.241,0.287,0.236,0.215,0.267,0.251,0.231,0.251,0.313,0.272,0.349,0.338,0.451,0.467,0.333,0.313,0.251,0.292,0.297,0.308,0.292,0.287,0.287,0.297,0.272,0.292,0.415,0.308,1944,Stability,ISDH_STAAW,High,Prokaryote
+KCNE1_HUMAN_Muhammad_2023_expression,0.043,0.03,0.021,0.017,0.047,0.047,0.077,0.141,0.03,0.034,0.085,0.09,0.073,0.073,0.077,0.047,0.073,0.043,0.017,0.026,0.073,0.09,0.081,0.12,0.085,0.162,0.068,0.034,0.034,0.034,0.038,0.038,0.111,0.064,0.107,0.094,0.064,0.073,0.073,0.051,0.047,0.056,0.081,0.098,0.013,0.081,0.068,0.021,0.085,0.132,0.068,0.051,0.064,0.068,0.056,0.064,0.073,0.06,0.06,0.06,0.051,0.085,2339,Expression,KCNE1_HUMAN,Medium,Human
+KCNE1_HUMAN_Muhammad_2023_function,0.194,0.224,0.203,0.207,0.22,0.22,0.19,0.155,0.22,0.237,0.194,0.233,0.25,0.19,0.194,0.246,0.207,0.22,0.172,0.168,0.198,0.177,0.185,0.177,0.207,0.19,0.25,0.228,0.19,0.194,0.216,0.185,0.034,0.203,0.241,0.216,0.19,0.203,0.198,0.224,0.22,0.211,0.185,0.159,0.203,0.185,0.224,0.181,0.172,0.134,0.198,0.22,0.216,0.177,0.228,0.228,0.233,0.198,0.211,0.207,0.297,0.177,2315,Activity,KCNE1_HUMAN,Medium,Human
+KCNH2_HUMAN_Kozek_2020,0.0,0.05,0.0,0.0,0.0,0.0,0.0,0.1,0.0,0.0,0.05,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.2,0.05,0.1,0.1,0.1,0.1,0.05,0.05,0.05,0.05,0.05,0.2,0.35,0.05,0.05,0.05,0.1,0.0,0.0,0.05,0.0,0.0,0.0,0.1,0.0,0.05,0.05,0.0,0.0,0.05,0.05,0.15,0.1,0.1,0.15,0.15,0.1,0.15,0.1,0.15,0.15,0.0,0.1,200,Activity,KCNH2_HUMAN,Medium,Human
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,0.146,0.156,0.136,0.155,0.162,0.158,0.108,0.125,0.139,0.136,0.166,0.154,0.156,0.106,0.149,0.166,0.168,0.164,0.161,0.129,0.162,0.161,0.149,0.128,0.161,0.158,0.154,0.155,0.148,0.149,0.141,0.131,0.102,0.159,0.165,0.138,0.156,0.158,0.145,0.154,0.158,0.146,0.129,0.113,0.136,0.155,0.142,0.143,0.149,0.113,0.158,0.162,0.166,0.158,0.162,0.152,0.148,0.171,0.161,0.164,0.136,0.159,6963,Activity,KCNJ2_MOUSE,Medium,Eukaryote
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,0.149,0.108,0.136,0.127,0.12,0.132,0.104,0.072,0.104,0.107,0.152,0.146,0.117,0.152,0.188,0.208,0.185,0.159,0.189,0.077,0.175,0.092,0.075,0.082,0.185,0.094,0.085,0.139,0.078,0.1,0.085,0.091,0.077,0.162,0.072,0.085,0.132,0.095,0.078,0.13,0.105,0.1,0.175,0.127,0.098,0.189,0.153,0.087,0.129,0.126,0.175,0.156,0.182,0.156,0.184,0.162,0.171,0.169,0.175,0.169,0.169,0.192,6917,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+KKA2_KLEPN_Melnikov_2014,0.159,0.22,0.216,0.24,0.222,0.224,0.107,0.169,0.208,0.238,0.212,0.19,0.226,0.113,0.123,0.169,0.226,0.24,0.238,0.2,0.151,0.188,0.22,0.216,0.165,0.204,0.228,0.188,0.242,0.218,0.169,0.149,0.077,0.125,0.167,0.194,0.185,0.19,0.21,0.232,0.23,0.236,0.123,0.101,0.202,0.143,0.22,0.23,0.228,0.171,0.224,0.214,0.228,0.216,0.216,0.222,0.22,0.208,0.216,0.226,0.236,0.181,4960,OrganismalFitness,KKA2_KLEPN,High,Prokaryote
+LGK_LIPST_Klesmith_2015,0.217,0.432,0.426,0.427,0.446,0.447,0.133,0.347,0.272,0.406,0.251,0.351,0.388,0.119,0.174,0.227,0.324,0.404,0.435,0.326,0.161,0.241,0.298,0.319,0.185,0.328,0.406,0.305,0.445,0.493,0.371,0.319,0.139,0.171,0.245,0.47,0.214,0.246,0.444,0.436,0.447,0.47,0.105,0.085,0.283,0.175,0.231,0.361,0.376,0.158,0.305,0.347,0.337,0.354,0.337,0.347,0.323,0.335,0.323,0.352,0.269,0.19,7890,Activity,LGK_LIPST,Medium,Eukaryote
+LYAM1_HUMAN_Elazar_2016,0.139,0.25,0.25,0.25,0.222,0.25,0.278,0.111,0.25,0.25,0.25,0.222,0.222,0.25,0.278,0.222,0.194,0.167,0.278,0.25,0.333,0.333,0.278,0.25,0.222,0.25,0.194,0.25,0.083,0.222,0.194,0.056,0.194,0.278,0.333,0.222,0.194,0.333,0.222,0.278,0.278,0.25,0.25,0.278,0.194,0.194,0.278,0.222,0.194,0.194,0.194,0.306,0.333,0.25,0.25,0.25,0.278,0.278,0.139,0.278,0.306,0.306,359,Expression,LYAM1_HUMAN,Medium,Human
+MAFG_MOUSE_Tsuboyama_2023_1K1V,0.315,0.315,0.35,0.35,0.343,0.336,0.343,0.245,0.287,0.28,0.294,0.322,0.406,0.385,0.378,0.378,0.413,0.455,0.273,0.231,0.224,0.217,0.259,0.301,0.28,0.273,0.294,0.273,0.273,0.252,0.21,0.217,0.161,0.231,0.301,0.231,0.301,0.315,0.294,0.343,0.315,0.322,0.385,0.259,0.35,0.357,0.413,0.315,0.364,0.385,0.413,0.441,0.448,0.469,0.427,0.476,0.462,0.462,0.434,0.462,0.462,0.483,1429,Stability,MAFG_MOUSE,Medium,Eukaryote
+MBD11_ARATH_Tsuboyama_2023_6ACV,0.288,0.382,0.33,0.33,0.335,0.335,0.057,0.151,0.274,0.302,0.264,0.255,0.226,0.028,0.028,0.264,0.259,0.264,0.377,0.241,0.024,0.061,0.193,0.179,0.094,0.264,0.274,0.236,0.288,0.274,0.325,0.297,0.316,0.028,0.042,0.047,0.269,0.264,0.283,0.316,0.33,0.316,0.09,0.057,0.307,0.25,0.288,0.335,0.288,0.288,0.34,0.387,0.344,0.344,0.316,0.335,0.33,0.311,0.274,0.33,0.382,0.264,2116,Stability,MBD11_ARATH,Medium,Eukaryote
+MET_HUMAN_Estevam_2023,0.148,0.185,0.163,0.174,0.163,0.178,0.163,0.174,0.156,0.167,0.191,0.178,0.174,0.143,0.148,0.172,0.183,0.172,0.187,0.163,0.185,0.187,0.159,0.193,0.193,0.181,0.183,0.191,0.2,0.193,0.17,0.172,0.12,0.161,0.178,0.196,0.167,0.174,0.198,0.176,0.17,0.176,0.17,0.139,0.187,0.178,0.15,0.183,0.169,0.119,0.161,0.163,0.165,0.167,0.156,0.176,0.172,0.169,0.187,0.159,0.185,0.167,5393,Activity,MET_HUMAN,Medium,Human
+MK01_HUMAN_Brenan_2016,0.091,0.134,0.134,0.144,0.138,0.122,0.084,0.075,0.116,0.095,0.076,0.119,0.107,0.104,0.103,0.09,0.072,0.085,0.098,0.097,0.101,0.094,0.082,0.09,0.095,0.109,0.103,0.109,0.101,0.122,0.082,0.095,0.109,0.087,0.101,0.09,0.078,0.098,0.088,0.125,0.125,0.106,0.097,0.095,0.113,0.12,0.06,0.109,0.07,0.097,0.084,0.104,0.095,0.088,0.085,0.079,0.09,0.081,0.081,0.088,0.098,0.104,6809,OrganismalFitness,MK01_HUMAN,Medium,Human
+MLAC_ECOLI_MacRae_2023,0.095,0.145,0.147,0.137,0.132,0.142,0.115,0.14,0.162,0.15,0.152,0.155,0.177,0.12,0.112,0.112,0.137,0.152,0.18,0.167,0.142,0.14,0.145,0.15,0.147,0.15,0.177,0.16,0.145,0.157,0.12,0.14,0.15,0.112,0.117,0.135,0.09,0.122,0.115,0.127,0.142,0.137,0.1,0.087,0.14,0.11,0.057,0.115,0.115,0.067,0.12,0.112,0.117,0.117,0.13,0.112,0.112,0.112,0.115,0.11,0.125,0.097,4007,OrganismalFitness,MLAC_ECOLI,Medium,Prokaryote
+MSH2_HUMAN_Jia_2020,0.133,0.162,0.143,0.147,0.147,0.149,0.1,0.1,0.156,0.153,0.146,0.164,0.164,0.105,0.131,0.153,0.152,0.159,0.146,0.152,0.114,0.127,0.137,0.146,0.138,0.144,0.146,0.134,0.162,0.143,0.147,0.135,0.132,0.128,0.153,0.155,0.134,0.153,0.152,0.159,0.167,0.157,0.122,0.093,0.156,0.134,0.13,0.152,0.101,0.118,0.149,0.144,0.15,0.142,0.148,0.159,0.154,0.153,0.154,0.158,0.154,0.134,16749,OrganismalFitness,MSH2_HUMAN,Medium,Human
+MTH3_HAEAE_RockahShmuel_2015,0.144,0.316,0.337,0.326,0.321,0.342,0.134,0.337,0.294,0.299,0.203,0.289,0.316,0.102,0.128,0.155,0.176,0.182,0.23,0.283,0.139,0.16,0.235,0.299,0.203,0.321,0.326,0.283,0.305,0.321,0.332,0.305,0.193,0.171,0.182,0.299,0.16,0.16,0.278,0.342,0.342,0.337,0.128,0.053,0.155,0.123,0.166,0.235,0.262,0.134,0.214,0.225,0.225,0.198,0.203,0.209,0.219,0.219,0.209,0.214,0.209,0.107,1777,OrganismalFitness,MTH3_HAEAE,Medium,Prokaryote
+MTHR_HUMAN_Weile_2021,0.122,0.123,0.113,0.123,0.113,0.108,0.096,0.121,0.133,0.138,0.123,0.126,0.126,0.118,0.124,0.135,0.116,0.132,0.138,0.122,0.121,0.118,0.126,0.149,0.13,0.12,0.131,0.119,0.162,0.137,0.139,0.148,0.075,0.127,0.126,0.127,0.11,0.118,0.125,0.109,0.116,0.123,0.122,0.11,0.123,0.126,0.112,0.122,0.127,0.118,0.12,0.115,0.121,0.118,0.117,0.118,0.12,0.118,0.126,0.117,0.132,0.116,12464,OrganismalFitness,MTHR_HUMAN,Low,Human
+MYO3_YEAST_Tsuboyama_2023_2BTT,0.227,0.206,0.276,0.212,0.236,0.245,0.073,0.061,0.297,0.285,0.294,0.261,0.264,0.109,0.23,0.355,0.421,0.318,0.179,0.233,0.139,0.091,0.164,0.155,0.188,0.261,0.255,0.227,0.291,0.209,0.209,0.167,0.17,0.212,0.139,0.127,0.291,0.221,0.2,0.276,0.255,0.261,0.194,0.073,0.27,0.2,0.248,0.315,0.339,0.27,0.303,0.306,0.297,0.312,0.303,0.306,0.291,0.303,0.3,0.303,0.297,0.348,3297,Stability,MYO3_YEAST,High,Eukaryote
+NCAP_I34A1_Doud_2015,0.258,0.246,0.246,0.247,0.259,0.263,0.111,0.22,0.266,0.261,0.106,0.099,0.094,0.101,0.093,0.096,0.099,0.102,0.129,0.182,0.235,0.264,0.264,0.263,0.098,0.111,0.109,0.107,0.242,0.268,0.199,0.194,0.148,0.249,0.252,0.285,0.26,0.271,0.27,0.28,0.282,0.278,0.094,0.099,0.106,0.093,0.161,0.166,0.178,0.144,0.103,0.118,0.127,0.122,0.114,0.114,0.115,0.124,0.102,0.115,0.139,0.13,9462,OrganismalFitness,NCAP_I34A1,Medium,Virus
+NKX31_HUMAN_Tsuboyama_2023_2L9R,0.434,0.426,0.434,0.454,0.442,0.474,0.353,0.361,0.378,0.406,0.305,0.398,0.369,0.402,0.394,0.478,0.51,0.486,0.458,0.406,0.386,0.365,0.329,0.329,0.378,0.317,0.357,0.301,0.321,0.373,0.349,0.369,0.305,0.39,0.386,0.337,0.43,0.402,0.402,0.442,0.446,0.454,0.382,0.406,0.321,0.345,0.386,0.341,0.418,0.418,0.482,0.426,0.474,0.45,0.482,0.478,0.454,0.494,0.438,0.466,0.349,0.41,2482,Stability,NKX31_HUMAN,High,Human
+NPC1_HUMAN_Erwood_2022_HEK293T,0.391,0.391,0.406,0.422,0.453,0.453,0.141,0.281,0.391,0.453,0.344,0.172,0.219,0.156,0.219,0.297,0.438,0.391,0.422,0.125,0.203,0.234,0.141,0.297,0.266,0.188,0.406,0.266,0.219,0.359,0.281,0.219,0.078,0.156,0.281,0.375,0.359,0.422,0.422,0.406,0.422,0.453,0.156,0.125,0.375,0.281,0.344,0.375,0.062,0.125,0.375,0.422,0.391,0.375,0.344,0.359,0.344,0.359,0.359,0.391,0.391,0.266,637,Activity,NPC1_HUMAN,Low,Human
+NPC1_HUMAN_Erwood_2022_RPE1,0.429,0.429,0.429,0.429,0.429,0.429,0.286,0.286,0.429,0.714,0.286,0.143,0.143,0.143,0.286,0.286,0.429,0.0,0.429,0.143,0.286,0.571,0.571,0.286,0.143,0.286,0.429,0.571,0.571,0.429,0.429,0.286,0.0,0.286,0.429,0.286,0.429,0.429,0.429,0.429,0.429,0.429,0.286,0.286,0.286,0.0,0.286,0.0,0.143,0.143,0.143,0.143,0.143,0.286,0.0,0.143,0.143,0.143,0.286,0.143,0.143,0.143,63,Activity,NPC1_HUMAN,Low,Human
+NRAM_I33A0_Jiang_2016,0.433,0.567,0.467,0.433,0.567,0.567,0.067,0.6,0.667,0.7,0.067,0.133,0.6,0.0,0.0,0.067,0.233,0.567,0.5,0.4,0.7,0.667,0.533,0.533,0.0,0.433,0.567,0.4,0.633,0.567,0.4,0.4,0.067,0.6,0.533,0.6,0.533,0.567,0.6,0.633,0.6,0.567,0.0,0.033,0.0,0.033,0.267,0.2,0.4,0.233,0.167,0.2,0.3,0.233,0.3,0.4,0.267,0.267,0.333,0.333,0.233,0.033,298,OrganismalFitness,NRAM_I33A0,Low,Virus
+NUD15_HUMAN_Suiter_2020,0.193,0.26,0.281,0.274,0.274,0.284,0.053,0.179,0.253,0.267,0.256,0.207,0.284,0.105,0.126,0.14,0.193,0.239,0.263,0.204,0.102,0.13,0.249,0.253,0.133,0.298,0.267,0.239,0.312,0.309,0.267,0.211,0.098,0.123,0.119,0.305,0.196,0.204,0.302,0.274,0.288,0.309,0.116,0.07,0.284,0.14,0.196,0.33,0.211,0.168,0.214,0.249,0.211,0.228,0.239,0.263,0.239,0.256,0.246,0.26,0.298,0.2,2844,Expression,NUD15_HUMAN,High,Human
+NUSA_ECOLI_Tsuboyama_2023_1WCL,0.123,0.256,0.163,0.167,0.172,0.172,0.133,0.128,0.202,0.138,0.177,0.03,0.059,0.103,0.103,0.084,0.148,0.167,0.217,0.163,0.172,0.202,0.197,0.217,0.094,0.143,0.113,0.207,0.251,0.138,0.217,0.217,0.074,0.163,0.143,0.148,0.158,0.118,0.143,0.172,0.163,0.192,0.172,0.138,0.103,0.133,0.374,0.305,0.276,0.345,0.246,0.291,0.266,0.261,0.286,0.246,0.246,0.246,0.241,0.261,0.271,0.158,2028,Stability,NUSA_ECOLI,High,Prokaryote
+NUSG_MYCTU_Tsuboyama_2023_2MI6,0.283,0.217,0.225,0.246,0.239,0.246,0.159,0.181,0.283,0.225,0.232,0.254,0.246,0.123,0.261,0.254,0.232,0.217,0.217,0.138,0.116,0.152,0.152,0.152,0.254,0.181,0.167,0.167,0.174,0.246,0.232,0.196,0.217,0.167,0.145,0.181,0.21,0.203,0.21,0.196,0.188,0.21,0.217,0.08,0.188,0.297,0.254,0.21,0.37,0.319,0.283,0.341,0.312,0.275,0.29,0.29,0.304,0.319,0.268,0.297,0.188,0.275,1380,Stability,NUSG_MYCTU,High,Prokaryote
+OBSCN_HUMAN_Tsuboyama_2023_1V1C,0.372,0.484,0.547,0.559,0.575,0.541,0.119,0.378,0.5,0.494,0.509,0.469,0.478,0.131,0.453,0.497,0.569,0.519,0.512,0.55,0.447,0.434,0.431,0.428,0.438,0.416,0.45,0.412,0.403,0.494,0.497,0.509,0.328,0.169,0.122,0.319,0.466,0.475,0.491,0.538,0.534,0.531,0.381,0.144,0.406,0.431,0.562,0.438,0.669,0.578,0.578,0.591,0.588,0.6,0.606,0.603,0.553,0.578,0.6,0.609,0.631,0.5,3197,Stability,OBSCN_HUMAN,High,Human
+ODP2_GEOSE_Tsuboyama_2023_1W4G,0.184,0.132,0.237,0.167,0.246,0.132,0.193,0.149,0.114,0.114,0.193,0.184,0.184,0.175,0.158,0.211,0.175,0.184,0.167,0.193,0.105,0.114,0.184,0.096,0.088,0.114,0.167,0.105,0.158,0.202,0.123,0.096,0.123,0.167,0.044,0.167,0.158,0.044,0.079,0.158,0.114,0.132,0.175,0.175,0.184,0.184,0.237,0.193,0.246,0.307,0.228,0.211,0.246,0.254,0.263,0.237,0.263,0.263,0.228,0.246,0.202,0.193,1134,Stability,ODP2_GEOSE,High,Prokaryote
+OPSD_HUMAN_Wan_2019,0.176,0.412,0.412,0.412,0.471,0.412,0.176,0.412,0.294,0.412,0.353,0.294,0.294,0.294,0.235,0.235,0.294,0.412,0.176,0.412,0.412,0.353,0.235,0.294,0.471,0.353,0.353,0.412,0.235,0.294,0.235,0.235,0.0,0.353,0.235,0.118,0.412,0.294,0.294,0.412,0.412,0.412,0.353,0.059,0.353,0.412,0.176,0.353,0.176,0.176,0.235,0.235,0.176,0.176,0.176,0.294,0.235,0.235,0.235,0.176,0.412,0.294,165,Expression,OPSD_HUMAN,High,Human
+OTC_HUMAN_Lo_2023,0.316,0.342,0.297,0.291,0.31,0.304,0.139,0.152,0.316,0.297,0.291,0.323,0.335,0.158,0.291,0.342,0.335,0.297,0.297,0.304,0.323,0.272,0.272,0.278,0.278,0.348,0.304,0.297,0.234,0.304,0.241,0.171,0.133,0.266,0.278,0.278,0.329,0.304,0.323,0.31,0.297,0.316,0.209,0.127,0.316,0.259,0.367,0.304,0.291,0.259,0.304,0.348,0.342,0.361,0.329,0.348,0.348,0.342,0.342,0.342,0.38,0.342,1570,Activity,OTC_HUMAN,Medium,Human
+OTU7A_HUMAN_Tsuboyama_2023_2L2D,0.188,0.266,0.203,0.203,0.219,0.203,0.078,0.125,0.109,0.141,0.25,0.297,0.234,0.109,0.281,0.328,0.156,0.172,0.188,0.156,0.109,0.062,0.078,0.078,0.109,0.062,0.078,0.094,0.047,0.031,0.109,0.109,0.062,0.141,0.125,0.141,0.188,0.156,0.125,0.203,0.172,0.188,0.172,0.141,0.25,0.344,0.25,0.203,0.328,0.281,0.219,0.219,0.219,0.188,0.219,0.203,0.234,0.172,0.172,0.219,0.359,0.328,635,Stability,OTU7A_HUMAN,High,Human
+OXDA_RHOTO_Vanella_2023_activity,0.127,0.192,0.231,0.223,0.238,0.233,0.114,0.131,0.141,0.145,0.181,0.162,0.172,0.112,0.114,0.156,0.18,0.209,0.217,0.15,0.125,0.142,0.156,0.155,0.119,0.161,0.181,0.162,0.225,0.238,0.191,0.177,0.072,0.133,0.131,0.134,0.138,0.128,0.136,0.223,0.233,0.234,0.122,0.095,0.177,0.136,0.195,0.242,0.248,0.162,0.184,0.216,0.202,0.205,0.197,0.223,0.208,0.223,0.194,0.216,0.222,0.148,6396,Activity,OXDA_RHOTO,High,Eukaryote
+OXDA_RHOTO_Vanella_2023_expression,0.096,0.14,0.168,0.161,0.155,0.164,0.149,0.123,0.133,0.12,0.149,0.133,0.134,0.096,0.12,0.151,0.161,0.173,0.18,0.081,0.092,0.145,0.136,0.134,0.111,0.162,0.152,0.14,0.152,0.177,0.127,0.121,0.09,0.108,0.114,0.134,0.109,0.09,0.114,0.155,0.149,0.158,0.111,0.096,0.145,0.114,0.176,0.189,0.186,0.146,0.164,0.146,0.142,0.139,0.16,0.158,0.161,0.16,0.168,0.158,0.183,0.157,6769,Expression,OXDA_RHOTO,High,Eukaryote
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,0.099,0.103,0.108,0.1,0.107,0.118,0.098,0.096,0.079,0.078,0.167,0.131,0.158,0.1,0.083,0.107,0.111,0.135,0.143,0.07,0.115,0.151,0.181,0.123,0.116,0.165,0.194,0.201,0.096,0.078,0.142,0.102,0.118,0.107,0.138,0.072,0.103,0.126,0.079,0.099,0.104,0.099,0.111,0.091,0.154,0.087,0.139,0.171,0.13,0.129,0.131,0.12,0.118,0.106,0.103,0.118,0.114,0.123,0.11,0.108,0.124,0.09,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,0.084,0.078,0.074,0.075,0.08,0.082,0.116,0.107,0.06,0.059,0.193,0.126,0.174,0.108,0.091,0.112,0.114,0.127,0.149,0.094,0.119,0.166,0.218,0.114,0.112,0.194,0.25,0.244,0.099,0.084,0.171,0.118,0.131,0.119,0.154,0.063,0.106,0.129,0.066,0.078,0.087,0.071,0.126,0.11,0.167,0.099,0.151,0.182,0.163,0.114,0.146,0.155,0.126,0.123,0.112,0.119,0.127,0.122,0.12,0.12,0.146,0.111,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,0.074,0.058,0.07,0.062,0.059,0.059,0.074,0.054,0.048,0.046,0.11,0.092,0.107,0.112,0.126,0.111,0.08,0.099,0.106,0.035,0.126,0.099,0.099,0.07,0.09,0.108,0.11,0.112,0.059,0.052,0.09,0.075,0.067,0.071,0.099,0.058,0.079,0.09,0.064,0.058,0.063,0.06,0.091,0.096,0.099,0.096,0.106,0.1,0.115,0.107,0.099,0.098,0.09,0.082,0.099,0.091,0.086,0.076,0.08,0.083,0.09,0.099,7467,OrganismalFitness,P53_HUMAN,Low,Human
+P53_HUMAN_Kotler_2018,0.324,0.257,0.276,0.286,0.238,0.229,0.124,0.267,0.238,0.257,0.286,0.219,0.248,0.143,0.133,0.238,0.267,0.276,0.267,0.219,0.2,0.267,0.219,0.229,0.21,0.2,0.21,0.2,0.162,0.286,0.267,0.219,0.038,0.162,0.229,0.229,0.324,0.305,0.314,0.276,0.286,0.257,0.143,0.114,0.2,0.114,0.19,0.229,0.21,0.181,0.21,0.248,0.248,0.229,0.238,0.238,0.21,0.229,0.238,0.248,0.286,0.219,1048,OrganismalFitness,P53_HUMAN,Low,Human
+P84126_THETH_Chan_2017,0.276,0.414,0.388,0.401,0.382,0.388,0.184,0.289,0.401,0.375,0.408,0.368,0.382,0.204,0.401,0.355,0.401,0.368,0.414,0.375,0.263,0.329,0.342,0.382,0.322,0.395,0.395,0.355,0.447,0.316,0.362,0.322,0.118,0.309,0.316,0.355,0.289,0.27,0.309,0.355,0.362,0.355,0.257,0.112,0.382,0.395,0.204,0.388,0.362,0.178,0.329,0.362,0.322,0.349,0.362,0.395,0.375,0.362,0.401,0.388,0.395,0.388,1519,OrganismalFitness,P84126_THETH,Medium,Prokaryote
+PA_I34A1_Wu_2015,0.335,0.335,0.335,0.324,0.335,0.324,0.115,0.269,0.088,0.132,0.11,0.11,0.093,0.104,0.093,0.104,0.104,0.099,0.247,0.236,0.28,0.275,0.264,0.33,0.121,0.258,0.258,0.225,0.264,0.291,0.264,0.242,0.126,0.275,0.28,0.302,0.335,0.319,0.319,0.33,0.341,0.319,0.11,0.11,0.104,0.11,0.115,0.099,0.071,0.137,0.088,0.093,0.099,0.088,0.088,0.11,0.071,0.082,0.082,0.115,0.104,0.071,1820,OrganismalFitness,PA_I34A1,Medium,Virus
+PABP_YEAST_Melamed_2013,0.279,0.269,0.275,0.275,0.29,0.285,0.182,0.234,0.269,0.276,0.277,0.287,0.289,0.172,0.191,0.21,0.242,0.245,0.259,0.249,0.255,0.285,0.275,0.282,0.27,0.281,0.292,0.281,0.25,0.291,0.273,0.249,0.121,0.244,0.27,0.261,0.269,0.285,0.272,0.286,0.291,0.286,0.242,0.115,0.288,0.264,0.126,0.288,0.2,0.148,0.233,0.216,0.229,0.221,0.223,0.217,0.221,0.227,0.228,0.225,0.269,0.213,37708,OrganismalFitness,PABP_YEAST,Medium,Eukaryote
+PAI1_HUMAN_Huttinger_2021,0.207,0.215,0.207,0.211,0.211,0.211,0.086,0.172,0.2,0.222,0.196,0.198,0.217,0.121,0.196,0.209,0.2,0.193,0.179,0.202,0.146,0.17,0.151,0.2,0.178,0.179,0.181,0.183,0.168,0.168,0.138,0.121,0.129,0.142,0.196,0.183,0.194,0.215,0.198,0.224,0.217,0.224,0.185,0.133,0.196,0.183,0.217,0.247,0.219,0.155,0.213,0.202,0.219,0.213,0.211,0.226,0.224,0.221,0.221,0.224,0.232,0.206,5345,Activity,PAI1_HUMAN,,Human
+PHOT_CHLRE_Chen_2023,0.273,0.396,0.515,0.502,0.368,0.354,0.401,0.393,0.501,0.49,0.434,0.47,0.502,0.505,0.529,0.49,0.526,0.497,0.507,0.362,0.397,0.458,0.413,0.463,0.406,0.438,0.459,0.459,0.423,0.389,0.426,0.381,0.342,0.325,0.317,0.442,0.412,0.388,0.453,0.376,0.39,0.379,0.463,0.288,0.464,0.49,0.23,0.415,0.396,0.269,0.412,0.377,0.378,0.39,0.386,0.401,0.397,0.402,0.401,0.395,0.491,0.484,167529,Activity,PHOT_CHLRE,High,Eukaryote
+PIN1_HUMAN_Tsuboyama_2023_1I6C,0.333,0.383,0.42,0.444,0.407,0.395,0.346,0.185,0.333,0.432,0.321,0.296,0.358,0.37,0.358,0.395,0.42,0.296,0.346,0.346,0.296,0.222,0.16,0.111,0.21,0.185,0.123,0.099,0.086,0.198,0.247,0.185,0.198,0.321,0.247,0.21,0.395,0.37,0.296,0.383,0.346,0.309,0.321,0.148,0.358,0.358,0.272,0.321,0.284,0.259,0.296,0.383,0.383,0.321,0.309,0.358,0.346,0.333,0.346,0.37,0.407,0.395,802,Stability,PIN1_HUMAN,High,Human
+PITX2_HUMAN_Tsuboyama_2023_2L7M,0.366,0.339,0.333,0.322,0.333,0.339,0.295,0.213,0.273,0.35,0.219,0.29,0.273,0.415,0.383,0.415,0.393,0.361,0.328,0.322,0.213,0.186,0.186,0.186,0.273,0.219,0.235,0.18,0.191,0.311,0.24,0.273,0.23,0.246,0.301,0.164,0.295,0.35,0.251,0.328,0.344,0.322,0.295,0.311,0.18,0.257,0.388,0.202,0.443,0.377,0.377,0.41,0.421,0.41,0.41,0.404,0.388,0.388,0.41,0.404,0.18,0.339,1824,Stability,PITX2_HUMAN,High,Human
+PKN1_HUMAN_Tsuboyama_2023_1URF,0.145,0.282,0.145,0.137,0.122,0.153,0.229,0.046,0.107,0.099,0.069,0.13,0.099,0.183,0.16,0.191,0.092,0.046,0.061,0.176,0.176,0.16,0.198,0.153,0.153,0.137,0.16,0.183,0.069,0.046,0.092,0.099,0.099,0.221,0.191,0.191,0.176,0.176,0.168,0.183,0.168,0.153,0.153,0.214,0.13,0.183,0.252,0.153,0.16,0.214,0.115,0.115,0.122,0.115,0.107,0.107,0.099,0.122,0.115,0.115,0.198,0.153,1301,Stability,PKN1_HUMAN,High,Human
+POLG_CXB3N_Mattenberger_2021,0.309,0.38,0.361,0.387,0.392,0.399,0.088,0.261,0.394,0.389,0.176,0.09,0.113,0.097,0.091,0.136,0.231,0.252,0.288,0.279,0.288,0.348,0.352,0.328,0.145,0.316,0.319,0.31,0.372,0.391,0.289,0.284,0.096,0.099,0.251,0.316,0.286,0.321,0.342,0.349,0.373,0.375,0.098,0.092,0.196,0.1,0.151,0.215,0.101,0.121,0.197,0.211,0.211,0.212,0.199,0.202,0.214,0.223,0.212,0.218,0.109,0.102,15711,OrganismalFitness,POLG_CXB3N,Medium,Virus
+POLG_DEN26_Suphatrakul_2023,0.327,0.395,0.341,0.358,0.384,0.386,0.086,0.299,0.473,0.477,0.143,0.089,0.102,0.08,0.086,0.109,0.134,0.14,0.164,0.331,0.33,0.325,0.331,0.292,0.331,0.344,0.34,0.351,0.349,0.454,0.411,0.314,0.104,0.089,0.16,0.366,0.288,0.332,0.405,0.383,0.393,0.433,0.089,0.086,0.198,0.088,0.188,0.278,0.07,0.136,0.138,0.138,0.166,0.146,0.152,0.154,0.144,0.15,0.14,0.157,0.12,0.114,16897,OrganismalFitness,POLG_DEN26,Low,Virus
+POLG_HCVJF_Qi_2014,0.362,0.411,0.368,0.38,0.393,0.405,0.074,0.153,0.387,0.387,0.123,0.387,0.35,0.086,0.092,0.08,0.11,0.092,0.104,0.239,0.196,0.276,0.325,0.288,0.282,0.27,0.239,0.239,0.294,0.387,0.313,0.221,0.067,0.368,0.356,0.288,0.362,0.374,0.362,0.393,0.368,0.344,0.086,0.074,0.331,0.117,0.08,0.245,0.233,0.19,0.172,0.153,0.123,0.153,0.141,0.129,0.221,0.153,0.166,0.178,0.092,0.098,1630,OrganismalFitness,POLG_HCVJF,Medium,Virus
+POLG_PESV_Tsuboyama_2023_2MXD,0.164,0.519,0.435,0.454,0.464,0.464,0.088,0.48,0.333,0.497,0.489,0.113,0.129,0.099,0.123,0.117,0.156,0.15,0.127,0.595,0.097,0.078,0.086,0.09,0.109,0.049,0.08,0.051,0.131,0.558,0.608,0.62,0.113,0.105,0.094,0.088,0.296,0.294,0.281,0.437,0.437,0.429,0.185,0.127,0.189,0.154,0.55,0.507,0.561,0.581,0.622,0.63,0.63,0.651,0.653,0.663,0.634,0.69,0.637,0.663,0.569,0.326,5130,Stability,POLG_PESV,Medium,Virus
+PPARG_HUMAN_Majithia_2016,0.16,0.237,0.259,0.253,0.255,0.241,0.146,0.193,0.252,0.242,0.245,0.274,0.29,0.124,0.135,0.138,0.158,0.22,0.277,0.288,0.278,0.293,0.175,0.175,0.234,0.308,0.332,0.342,0.179,0.269,0.162,0.076,0.117,0.294,0.268,0.274,0.275,0.33,0.308,0.255,0.275,0.266,0.124,0.121,0.25,0.142,0.244,0.274,0.248,0.162,0.184,0.167,0.146,0.137,0.193,0.147,0.164,0.148,0.127,0.153,0.198,0.149,9576,Activity,PPARG_HUMAN,Medium,Human
+PPM1D_HUMAN_Miller_2022,0.18,0.176,0.18,0.187,0.18,0.177,0.128,0.154,0.176,0.173,0.183,0.21,0.2,0.131,0.14,0.161,0.174,0.188,0.201,0.163,0.172,0.174,0.205,0.181,0.177,0.202,0.192,0.214,0.174,0.181,0.172,0.135,0.113,0.163,0.188,0.206,0.191,0.192,0.205,0.187,0.185,0.186,0.142,0.115,0.191,0.171,0.169,0.201,0.188,0.135,0.177,0.18,0.176,0.167,0.181,0.168,0.163,0.168,0.173,0.181,0.215,0.158,7889,OrganismalFitness,PPM1D_HUMAN,Low,Human
+PR40A_HUMAN_Tsuboyama_2023_1UZC,0.191,0.23,0.343,0.343,0.299,0.284,0.216,0.216,0.392,0.392,0.397,0.191,0.265,0.221,0.225,0.358,0.387,0.402,0.265,0.299,0.363,0.358,0.387,0.333,0.353,0.392,0.363,0.353,0.363,0.382,0.299,0.25,0.221,0.206,0.245,0.235,0.294,0.275,0.245,0.319,0.314,0.324,0.216,0.221,0.299,0.24,0.338,0.338,0.426,0.319,0.382,0.412,0.382,0.387,0.397,0.407,0.402,0.392,0.407,0.402,0.402,0.431,2033,Stability,PR40A_HUMAN,Medium,Human
+PRKN_HUMAN_Clausen_2023,0.205,0.189,0.201,0.207,0.21,0.217,0.12,0.187,0.17,0.187,0.197,0.236,0.227,0.17,0.185,0.225,0.256,0.293,0.292,0.197,0.183,0.213,0.194,0.168,0.187,0.249,0.178,0.241,0.179,0.186,0.151,0.128,0.11,0.155,0.202,0.167,0.209,0.202,0.183,0.216,0.211,0.187,0.185,0.11,0.264,0.185,0.289,0.243,0.308,0.161,0.249,0.271,0.284,0.269,0.236,0.245,0.273,0.273,0.275,0.268,0.276,0.271,8756,Expression,PRKN_HUMAN,Low,Human
+PSAE_SYNP2_Tsuboyama_2023_1PSE,0.177,0.165,0.127,0.133,0.152,0.152,0.127,0.07,0.108,0.095,0.373,0.158,0.165,0.12,0.19,0.266,0.278,0.209,0.196,0.133,0.12,0.057,0.12,0.139,0.146,0.152,0.044,0.089,0.101,0.133,0.146,0.133,0.108,0.12,0.089,0.127,0.19,0.177,0.139,0.152,0.165,0.146,0.165,0.139,0.203,0.139,0.278,0.259,0.316,0.291,0.31,0.241,0.259,0.222,0.259,0.234,0.266,0.259,0.222,0.266,0.209,0.272,1579,Stability,PSAE_PICP2,Medium,Prokaryote
+PTEN_HUMAN_Matreyek_2021,0.157,0.183,0.171,0.165,0.153,0.163,0.102,0.159,0.181,0.2,0.185,0.191,0.212,0.104,0.136,0.157,0.183,0.147,0.169,0.181,0.11,0.204,0.191,0.204,0.151,0.169,0.173,0.181,0.179,0.208,0.167,0.165,0.118,0.12,0.208,0.183,0.145,0.2,0.194,0.165,0.193,0.185,0.118,0.094,0.194,0.149,0.173,0.204,0.159,0.13,0.2,0.198,0.193,0.179,0.189,0.181,0.179,0.181,0.193,0.2,0.194,0.169,5083,Expression,PTEN_HUMAN,Medium,Human
+PTEN_HUMAN_Mighell_2018,0.208,0.16,0.179,0.185,0.175,0.175,0.116,0.139,0.153,0.15,0.132,0.138,0.139,0.123,0.169,0.187,0.16,0.161,0.169,0.156,0.125,0.132,0.172,0.153,0.15,0.171,0.157,0.161,0.16,0.153,0.161,0.138,0.107,0.146,0.135,0.152,0.191,0.147,0.175,0.175,0.153,0.174,0.16,0.11,0.136,0.198,0.149,0.135,0.179,0.157,0.15,0.156,0.161,0.157,0.147,0.16,0.16,0.15,0.16,0.15,0.196,0.207,7260,Activity,PTEN_HUMAN,Medium,Human
+Q2N0S5_9HIV1_Haddox_2018,0.24,0.211,0.189,0.19,0.211,0.214,0.079,0.189,0.225,0.229,0.218,0.233,0.236,0.088,0.082,0.086,0.11,0.104,0.123,0.208,0.238,0.191,0.184,0.163,0.236,0.192,0.185,0.185,0.178,0.232,0.195,0.196,0.162,0.232,0.204,0.207,0.234,0.229,0.23,0.219,0.215,0.215,0.202,0.07,0.222,0.22,0.165,0.237,0.109,0.149,0.125,0.14,0.152,0.148,0.139,0.151,0.134,0.141,0.122,0.147,0.123,0.108,12729,OrganismalFitness,Q2N0S5_9HIV1,Medium,Virus
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,0.15,0.168,0.171,0.165,0.168,0.168,0.112,0.1,0.179,0.174,0.153,0.165,0.179,0.147,0.176,0.174,0.168,0.188,0.159,0.165,0.135,0.097,0.097,0.094,0.179,0.171,0.15,0.144,0.121,0.115,0.126,0.115,0.144,0.121,0.088,0.079,0.135,0.126,0.1,0.168,0.162,0.147,0.168,0.124,0.176,0.15,0.2,0.191,0.153,0.147,0.141,0.135,0.165,0.15,0.162,0.156,0.156,0.153,0.144,0.147,0.197,0.185,3344,Binding,Q53Z42_HUMAN,Medium,Human
+Q53Z42_HUMAN_McShan_2019_expression,0.289,0.283,0.301,0.295,0.31,0.301,0.095,0.17,0.28,0.28,0.286,0.262,0.292,0.14,0.158,0.259,0.283,0.298,0.292,0.289,0.229,0.208,0.19,0.185,0.312,0.307,0.265,0.301,0.265,0.182,0.235,0.199,0.196,0.226,0.176,0.176,0.277,0.253,0.205,0.301,0.298,0.28,0.188,0.14,0.298,0.256,0.298,0.31,0.25,0.161,0.262,0.262,0.292,0.295,0.301,0.286,0.28,0.31,0.289,0.28,0.304,0.357,3344,Expression,Q53Z42_HUMAN,Medium,Human
+Q59976_STRSQ_Romero_2015,0.193,0.25,0.243,0.253,0.27,0.267,0.16,0.243,0.27,0.27,0.23,0.177,0.207,0.143,0.18,0.2,0.227,0.233,0.233,0.267,0.25,0.287,0.287,0.297,0.27,0.31,0.313,0.287,0.283,0.277,0.227,0.183,0.15,0.243,0.283,0.297,0.217,0.25,0.273,0.273,0.273,0.277,0.213,0.13,0.243,0.243,0.207,0.23,0.223,0.147,0.2,0.203,0.227,0.2,0.22,0.2,0.223,0.22,0.217,0.213,0.257,0.22,2999,Activity,Q59976_STRSQ,Medium,Prokaryote
+Q6WV13_9MAXI_Somermeyer_2022,0.136,0.145,0.111,0.11,0.127,0.124,0.08,0.097,0.132,0.132,0.1,0.092,0.079,0.074,0.078,0.072,0.07,0.072,0.072,0.119,0.088,0.109,0.082,0.116,0.093,0.07,0.074,0.062,0.077,0.149,0.117,0.117,0.087,0.095,0.089,0.115,0.112,0.117,0.112,0.123,0.128,0.125,0.099,0.079,0.084,0.092,0.186,0.152,0.18,0.148,0.131,0.138,0.114,0.149,0.144,0.145,0.141,0.133,0.135,0.141,0.11,0.102,31401,Activity,Q6WV12_9MAXI,Low,Eukaryote
+Q837P4_ENTFA_Meier_2023,0.2,0.229,0.214,0.2,0.257,0.257,0.086,0.129,0.243,0.271,0.257,0.229,0.3,0.129,0.186,0.186,0.214,0.286,0.271,0.1,0.271,0.257,0.257,0.357,0.286,0.271,0.286,0.3,0.3,0.271,0.286,0.257,0.1,0.329,0.271,0.3,0.314,0.271,0.286,0.257,0.229,0.243,0.186,0.071,0.243,0.2,0.129,0.229,0.2,0.086,0.3,0.243,0.243,0.271,0.229,0.2,0.271,0.271,0.271,0.286,0.3,0.214,697,Activity,Q837P4_ENTFA,Medium,Prokaryote
+Q837P5_ENTFA_Meier_2023,0.067,0.293,0.267,0.28,0.307,0.307,0.12,0.2,0.133,0.133,0.253,0.267,0.24,0.067,0.12,0.12,0.187,0.267,0.227,0.147,0.2,0.2,0.293,0.227,0.187,0.227,0.293,0.267,0.333,0.2,0.24,0.28,0.093,0.24,0.2,0.28,0.24,0.2,0.267,0.24,0.24,0.24,0.093,0.093,0.213,0.16,0.12,0.2,0.227,0.133,0.2,0.227,0.227,0.2,0.213,0.173,0.227,0.213,0.24,0.2,0.213,0.227,747,Activity,Q837P5_ENTFA,Medium,Prokaryote
+Q8WTC7_9CNID_Somermeyer_2022,0.183,0.207,0.173,0.179,0.189,0.189,0.097,0.15,0.195,0.199,0.124,0.107,0.107,0.094,0.101,0.104,0.095,0.115,0.11,0.141,0.108,0.11,0.093,0.11,0.097,0.108,0.116,0.185,0.189,0.198,0.12,0.123,0.09,0.104,0.101,0.191,0.179,0.18,0.195,0.192,0.191,0.206,0.108,0.113,0.117,0.112,0.184,0.179,0.201,0.163,0.141,0.138,0.144,0.148,0.144,0.152,0.14,0.158,0.141,0.15,0.132,0.119,33510,Activity,Q8WTC7_9CNID,Low,Eukaryote
+R1AB_SARS2_Flynn_2022,0.169,0.171,0.148,0.161,0.166,0.161,0.077,0.12,0.059,0.059,0.086,0.058,0.056,0.068,0.07,0.073,0.084,0.147,0.147,0.117,0.138,0.15,0.145,0.15,0.134,0.131,0.129,0.119,0.143,0.147,0.119,0.119,0.113,0.112,0.113,0.15,0.161,0.155,0.164,0.164,0.164,0.168,0.058,0.056,0.079,0.07,0.154,0.122,0.169,0.147,0.098,0.122,0.119,0.112,0.113,0.103,0.092,0.092,0.105,0.096,0.106,0.077,5725,OrganismalFitness,R1AB_SARS2,Medium,Virus
+RAD_ANTMA_Tsuboyama_2023_2CJJ,0.315,0.163,0.13,0.185,0.13,0.185,0.228,0.163,0.196,0.217,0.13,0.076,0.109,0.283,0.315,0.261,0.217,0.239,0.141,0.174,0.337,0.13,0.12,0.174,0.152,0.163,0.152,0.098,0.098,0.239,0.163,0.13,0.207,0.13,0.174,0.207,0.293,0.272,0.25,0.185,0.196,0.196,0.228,0.283,0.13,0.207,0.217,0.141,0.25,0.25,0.185,0.283,0.228,0.207,0.25,0.239,0.239,0.261,0.261,0.25,0.098,0.261,912,Stability,RAD_ANTMA,High,Eukaryote
+RAF1_HUMAN_Zinkus-Boltz_2019,0.133,0.167,0.133,0.167,0.2,0.2,0.033,0.2,0.167,0.167,0.2,0.133,0.2,0.1,0.167,0.133,0.133,0.167,0.167,0.167,0.133,0.2,0.167,0.133,0.2,0.2,0.2,0.2,0.133,0.167,0.167,0.167,0.067,0.167,0.2,0.133,0.133,0.167,0.133,0.167,0.133,0.133,0.133,0.067,0.167,0.133,0.067,0.2,0.067,0.067,0.167,0.133,0.2,0.133,0.1,0.133,0.133,0.1,0.133,0.133,0.1,0.067,297,OrganismalFitness,RAF1_HUMAN,Low,Human
+RASH_HUMAN_Bandaru_2017,0.166,0.092,0.086,0.092,0.08,0.089,0.073,0.057,0.092,0.086,0.086,0.076,0.061,0.162,0.182,0.156,0.111,0.099,0.105,0.127,0.096,0.099,0.07,0.08,0.086,0.08,0.08,0.067,0.067,0.108,0.054,0.073,0.064,0.115,0.086,0.08,0.124,0.105,0.111,0.092,0.089,0.089,0.134,0.229,0.115,0.115,0.191,0.111,0.121,0.194,0.118,0.115,0.127,0.108,0.111,0.118,0.115,0.115,0.089,0.118,0.083,0.153,3134,Activity,RASH_HUMAN,High,Human
+RASK_HUMAN_Weng_2022_abundance,0.174,0.153,0.167,0.174,0.167,0.175,0.107,0.156,0.171,0.186,0.157,0.154,0.157,0.215,0.187,0.197,0.174,0.169,0.176,0.171,0.135,0.154,0.16,0.169,0.154,0.173,0.168,0.163,0.177,0.132,0.153,0.133,0.109,0.132,0.159,0.169,0.143,0.171,0.187,0.173,0.176,0.179,0.183,0.203,0.124,0.172,0.165,0.146,0.2,0.162,0.165,0.16,0.161,0.162,0.169,0.16,0.171,0.164,0.162,0.168,0.177,0.188,26012,Expression,RASK_HUMAN,High,Human
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,0.235,0.258,0.274,0.271,0.267,0.27,0.07,0.201,0.289,0.305,0.246,0.253,0.26,0.279,0.282,0.286,0.293,0.267,0.251,0.272,0.238,0.261,0.258,0.249,0.279,0.274,0.252,0.246,0.226,0.242,0.23,0.209,0.176,0.24,0.267,0.256,0.229,0.262,0.258,0.272,0.275,0.273,0.256,0.165,0.271,0.267,0.148,0.219,0.291,0.19,0.238,0.225,0.23,0.23,0.237,0.228,0.237,0.233,0.232,0.234,0.268,0.242,24873,Binding,RASK_HUMAN,High,Human
+RBP1_HUMAN_Tsuboyama_2023_2KWH,0.239,0.157,0.239,0.239,0.239,0.239,0.284,0.112,0.209,0.209,0.351,0.343,0.351,0.328,0.343,0.351,0.366,0.157,0.187,0.254,0.328,0.097,0.164,0.082,0.343,0.201,0.194,0.104,0.104,0.157,0.142,0.104,0.104,0.299,0.351,0.291,0.269,0.246,0.239,0.261,0.246,0.224,0.358,0.351,0.343,0.343,0.321,0.321,0.328,0.284,0.336,0.291,0.306,0.299,0.328,0.321,0.291,0.306,0.313,0.328,0.321,0.321,1332,Stability,RBP1_HUMAN,High,Human
+RCD1_ARATH_Tsuboyama_2023_5OAO,0.181,0.142,0.165,0.165,0.15,0.157,0.165,0.087,0.142,0.142,0.283,0.26,0.276,0.252,0.307,0.252,0.205,0.157,0.197,0.094,0.228,0.157,0.181,0.205,0.157,0.205,0.213,0.181,0.197,0.134,0.11,0.047,0.071,0.228,0.165,0.244,0.213,0.228,0.22,0.189,0.173,0.173,0.236,0.26,0.22,0.22,0.228,0.26,0.228,0.276,0.189,0.213,0.181,0.189,0.189,0.189,0.213,0.173,0.181,0.189,0.173,0.276,1261,Stability,RCD1_ARATH,Medium,Eukaryote
+RCRO_LAMBD_Tsuboyama_2023_1ORC,0.197,0.272,0.298,0.351,0.346,0.311,0.237,0.175,0.281,0.311,0.303,0.272,0.364,0.25,0.307,0.311,0.289,0.281,0.276,0.263,0.18,0.25,0.228,0.25,0.145,0.303,0.175,0.25,0.294,0.289,0.281,0.254,0.184,0.219,0.14,0.254,0.263,0.246,0.272,0.298,0.303,0.281,0.254,0.311,0.316,0.263,0.539,0.404,0.469,0.469,0.382,0.399,0.377,0.408,0.399,0.377,0.39,0.346,0.351,0.386,0.465,0.158,2278,Stability,RCRO_LAMBD,High,Virus
+RD23A_HUMAN_Tsuboyama_2023_1IFY,0.441,0.304,0.304,0.324,0.304,0.294,0.304,0.196,0.245,0.235,0.216,0.245,0.255,0.275,0.382,0.275,0.225,0.196,0.186,0.284,0.147,0.255,0.196,0.216,0.265,0.235,0.216,0.245,0.176,0.196,0.235,0.206,0.294,0.137,0.245,0.216,0.353,0.353,0.294,0.333,0.324,0.294,0.255,0.245,0.245,0.284,0.382,0.314,0.441,0.422,0.216,0.245,0.255,0.255,0.284,0.265,0.275,0.245,0.255,0.255,0.235,0.392,1019,Stability,RD23A_HUMAN,High,Human
+RDRP_I33A0_Li_2023,0.184,0.259,0.386,0.395,0.396,0.392,0.071,0.346,0.356,0.353,0.117,0.096,0.107,0.102,0.096,0.108,0.214,0.301,0.338,0.405,0.327,0.366,0.377,0.397,0.131,0.332,0.314,0.306,0.356,0.416,0.345,0.335,0.122,0.339,0.36,0.398,0.351,0.376,0.386,0.412,0.415,0.413,0.102,0.092,0.132,0.106,0.138,0.182,0.098,0.136,0.224,0.218,0.232,0.221,0.221,0.219,0.22,0.238,0.229,0.226,0.119,0.111,12003,OrganismalFitness,RDRP_I33A0,Low,Virus
+REV_HV1H2_Fernandes_2016,0.13,0.135,0.144,0.135,0.149,0.144,0.112,0.14,0.135,0.126,0.084,0.144,0.126,0.107,0.093,0.065,0.13,0.107,0.107,0.112,0.116,0.098,0.093,0.13,0.102,0.149,0.098,0.126,0.13,0.135,0.135,0.126,0.116,0.13,0.14,0.112,0.144,0.153,0.121,0.153,0.144,0.14,0.102,0.088,0.112,0.102,0.07,0.098,0.116,0.126,0.135,0.13,0.126,0.098,0.116,0.093,0.112,0.116,0.102,0.112,0.079,0.121,2147,OrganismalFitness,REV_HV1H2,Medium,Virus
+RFAH_ECOLI_Tsuboyama_2023_2LCL,0.233,0.248,0.18,0.173,0.195,0.165,0.165,0.075,0.105,0.173,0.248,0.173,0.226,0.18,0.158,0.211,0.241,0.12,0.068,0.09,0.143,0.165,0.18,0.158,0.256,0.158,0.195,0.15,0.083,0.083,0.135,0.105,0.053,0.203,0.083,0.068,0.211,0.09,0.158,0.195,0.135,0.173,0.226,0.18,0.233,0.241,0.241,0.173,0.301,0.286,0.188,0.188,0.195,0.18,0.188,0.188,0.241,0.188,0.195,0.195,0.278,0.226,1326,Stability,RFAH_ECOLI,High,Prokaryote
+RL20_AQUAE_Tsuboyama_2023_1GYZ,0.095,0.184,0.129,0.136,0.129,0.129,0.163,0.122,0.184,0.163,0.252,0.204,0.238,0.19,0.184,0.19,0.19,0.136,0.129,0.122,0.116,0.156,0.122,0.15,0.102,0.122,0.129,0.15,0.143,0.048,0.129,0.109,0.088,0.136,0.156,0.109,0.143,0.163,0.143,0.143,0.143,0.136,0.224,0.15,0.15,0.224,0.333,0.184,0.361,0.313,0.204,0.204,0.211,0.184,0.177,0.197,0.177,0.184,0.184,0.19,0.177,0.272,1461,Stability,RL20_AQUAE,High,Prokaryote
+RL40A_YEAST_Mavor_2016,0.159,0.167,0.167,0.19,0.19,0.183,0.095,0.143,0.175,0.19,0.135,0.19,0.159,0.095,0.143,0.19,0.151,0.151,0.206,0.19,0.175,0.135,0.127,0.111,0.183,0.167,0.151,0.19,0.135,0.159,0.143,0.167,0.087,0.119,0.119,0.119,0.159,0.167,0.167,0.183,0.198,0.167,0.143,0.143,0.135,0.111,0.103,0.056,0.127,0.04,0.198,0.167,0.175,0.167,0.175,0.151,0.151,0.183,0.183,0.183,0.111,0.135,1253,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+RL40A_YEAST_Roscoe_2013,0.117,0.192,0.175,0.217,0.208,0.217,0.017,0.175,0.217,0.233,0.15,0.15,0.192,0.058,0.142,0.208,0.183,0.183,0.233,0.167,0.2,0.175,0.208,0.15,0.125,0.175,0.167,0.167,0.192,0.242,0.242,0.233,0.175,0.192,0.15,0.15,0.183,0.158,0.158,0.192,0.217,0.2,0.183,0.1,0.192,0.158,0.042,0.125,0.108,0.05,0.125,0.142,0.117,0.142,0.133,0.142,0.125,0.142,0.175,0.125,0.192,0.175,1195,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+RL40A_YEAST_Roscoe_2014,0.123,0.094,0.116,0.13,0.116,0.109,0.072,0.138,0.101,0.109,0.159,0.138,0.123,0.123,0.217,0.116,0.152,0.138,0.138,0.145,0.174,0.167,0.138,0.167,0.188,0.13,0.101,0.123,0.123,0.138,0.109,0.13,0.123,0.174,0.13,0.109,0.138,0.116,0.116,0.123,0.13,0.123,0.13,0.145,0.145,0.109,0.087,0.101,0.101,0.094,0.145,0.167,0.152,0.188,0.174,0.159,0.138,0.138,0.145,0.159,0.138,0.123,1380,Activity,RL40A_YEAST,Medium,Eukaryote
+RNC_ECOLI_Weeks_2023,0.147,0.196,0.117,0.121,0.129,0.131,0.077,0.143,0.138,0.133,0.147,0.136,0.154,0.103,0.147,0.136,0.152,0.171,0.173,0.131,0.189,0.199,0.161,0.182,0.182,0.152,0.168,0.152,0.182,0.143,0.173,0.173,0.119,0.161,0.173,0.15,0.15,0.166,0.168,0.14,0.136,0.15,0.173,0.093,0.138,0.131,0.152,0.168,0.131,0.121,0.143,0.161,0.152,0.152,0.154,0.164,0.147,0.147,0.164,0.147,0.166,0.129,4277,Activity,RNC_ECOLI,Medium,Prokaryote
+RPC1_BP434_Tsuboyama_2023_1R69,0.274,0.295,0.233,0.26,0.247,0.26,0.281,0.171,0.185,0.199,0.295,0.288,0.322,0.301,0.315,0.322,0.267,0.26,0.295,0.281,0.219,0.356,0.336,0.26,0.274,0.329,0.247,0.295,0.247,0.253,0.247,0.24,0.178,0.185,0.295,0.301,0.199,0.281,0.288,0.301,0.281,0.281,0.329,0.274,0.26,0.267,0.356,0.308,0.301,0.37,0.274,0.24,0.274,0.281,0.267,0.253,0.247,0.26,0.247,0.26,0.322,0.308,1459,Stability,RPC1_BP434,High,Virus
+RPC1_LAMBD_Li_2019_high-expression,0.139,0.083,0.111,0.111,0.111,0.111,0.167,0.028,0.139,0.139,0.083,0.139,0.139,0.194,0.167,0.194,0.167,0.083,0.139,0.194,0.139,0.139,0.222,0.139,0.194,0.139,0.139,0.167,0.111,0.194,0.028,0.0,0.083,0.139,0.139,0.111,0.139,0.139,0.167,0.139,0.111,0.111,0.167,0.222,0.139,0.139,0.194,0.25,0.139,0.167,0.139,0.111,0.167,0.139,0.139,0.167,0.139,0.167,0.139,0.167,0.278,0.25,351,Activity,RPC1_LAMBD,High,Virus
+RPC1_LAMBD_Li_2019_low-expression,0.194,0.25,0.222,0.25,0.222,0.222,0.139,0.111,0.306,0.25,0.167,0.194,0.25,0.222,0.194,0.194,0.278,0.222,0.167,0.222,0.111,0.25,0.25,0.167,0.083,0.194,0.222,0.194,0.222,0.222,0.194,0.167,0.139,0.222,0.25,0.194,0.194,0.194,0.167,0.222,0.222,0.222,0.167,0.194,0.194,0.222,0.111,0.25,0.222,0.056,0.25,0.25,0.194,0.25,0.278,0.25,0.278,0.25,0.25,0.278,0.194,0.111,351,Activity,RPC1_LAMBD,High,Virus
+RS15_GEOSE_Tsuboyama_2023_1A32,0.142,0.083,0.108,0.117,0.108,0.117,0.092,0.083,0.108,0.092,0.108,0.1,0.083,0.15,0.208,0.192,0.092,0.125,0.142,0.067,0.108,0.108,0.075,0.083,0.133,0.083,0.125,0.1,0.117,0.05,0.075,0.042,0.183,0.117,0.083,0.1,0.133,0.092,0.1,0.117,0.083,0.083,0.192,0.15,0.125,0.217,0.275,0.15,0.158,0.292,0.117,0.117,0.108,0.142,0.117,0.092,0.108,0.108,0.125,0.125,0.15,0.325,1195,Stability,RS15_GEOSE,Medium,Prokaryote
+S22A1_HUMAN_Yee_2023_abundance,0.191,0.244,0.262,0.276,0.256,0.255,0.177,0.193,0.262,0.261,0.257,0.262,0.272,0.193,0.204,0.238,0.288,0.269,0.255,0.109,0.242,0.267,0.238,0.229,0.234,0.25,0.248,0.266,0.227,0.254,0.266,0.206,0.13,0.223,0.262,0.254,0.253,0.288,0.272,0.269,0.265,0.269,0.216,0.151,0.272,0.221,0.214,0.288,0.22,0.149,0.276,0.268,0.288,0.291,0.286,0.28,0.284,0.276,0.279,0.29,0.269,0.231,9803,Expression,S22A1_HUMAN,Medium,Human
+S22A1_HUMAN_Yee_2023_activity,0.167,0.19,0.214,0.213,0.21,0.216,0.133,0.17,0.203,0.211,0.197,0.215,0.198,0.168,0.152,0.171,0.204,0.208,0.18,0.094,0.192,0.199,0.191,0.176,0.187,0.193,0.185,0.201,0.166,0.182,0.191,0.159,0.14,0.168,0.205,0.205,0.171,0.212,0.215,0.217,0.212,0.218,0.164,0.127,0.207,0.157,0.196,0.232,0.193,0.146,0.206,0.195,0.214,0.216,0.212,0.208,0.207,0.217,0.201,0.211,0.206,0.184,10094,Activity,S22A1_HUMAN,Medium,Human
+SAV1_MOUSE_Tsuboyama_2023_2YSB,0.454,0.381,0.423,0.433,0.464,0.474,0.309,0.103,0.454,0.474,0.258,0.268,0.258,0.196,0.423,0.412,0.258,0.278,0.175,0.206,0.103,0.103,0.072,0.093,0.268,0.134,0.103,0.093,0.103,0.062,0.216,0.206,0.144,0.216,0.196,0.165,0.402,0.412,0.392,0.433,0.443,0.433,0.361,0.216,0.34,0.32,0.381,0.371,0.392,0.412,0.351,0.289,0.361,0.268,0.371,0.361,0.309,0.351,0.33,0.351,0.227,0.412,965,Stability,SAV1_MOUSE,High,Eukaryote
+SBI_STAAM_Tsuboyama_2023_2JVG,0.204,0.194,0.223,0.214,0.233,0.214,0.194,0.107,0.214,0.282,0.262,0.184,0.204,0.155,0.175,0.223,0.32,0.272,0.223,0.126,0.194,0.204,0.184,0.165,0.175,0.184,0.204,0.165,0.184,0.165,0.291,0.184,0.117,0.146,0.146,0.126,0.175,0.194,0.204,0.214,0.223,0.223,0.184,0.223,0.214,0.184,0.311,0.33,0.311,0.34,0.291,0.311,0.34,0.32,0.311,0.282,0.272,0.359,0.359,0.34,0.301,0.272,1025,Stability,SBI_STAAM,Medium,Prokaryote
+SC6A4_HUMAN_Young_2021,0.164,0.22,0.201,0.202,0.218,0.204,0.159,0.298,0.217,0.238,0.242,0.212,0.222,0.106,0.135,0.177,0.196,0.213,0.217,0.262,0.244,0.306,0.304,0.286,0.251,0.294,0.285,0.273,0.269,0.226,0.195,0.139,0.187,0.257,0.294,0.289,0.215,0.241,0.251,0.209,0.221,0.217,0.205,0.096,0.247,0.213,0.232,0.266,0.219,0.142,0.193,0.206,0.221,0.219,0.211,0.215,0.21,0.199,0.187,0.201,0.221,0.205,11576,Activity,SC6A4_HUMAN,Medium,Human
+SCIN_STAAR_Tsuboyama_2023_2QFF,0.156,0.139,0.123,0.139,0.131,0.139,0.238,0.074,0.23,0.189,0.246,0.246,0.254,0.172,0.246,0.279,0.328,0.262,0.303,0.098,0.164,0.189,0.197,0.23,0.27,0.254,0.279,0.254,0.238,0.131,0.098,0.082,0.148,0.115,0.123,0.213,0.164,0.172,0.172,0.123,0.148,0.156,0.254,0.238,0.262,0.27,0.287,0.303,0.221,0.197,0.328,0.336,0.336,0.352,0.32,0.402,0.311,0.385,0.27,0.352,0.369,0.393,1212,Stability,SCIN_STAAR,High,Prokaryote
+SCN5A_HUMAN_Glazer_2019,0.13,0.087,0.13,0.13,0.13,0.13,0.217,0.174,0.174,0.174,0.13,0.13,0.174,0.174,0.13,0.087,0.13,0.13,0.174,0.174,0.13,0.174,0.13,0.13,0.13,0.13,0.13,0.087,0.13,0.13,0.087,0.174,0.174,0.087,0.13,0.174,0.043,0.13,0.174,0.087,0.13,0.13,0.174,0.174,0.174,0.087,0.043,0.043,0.13,0.043,0.174,0.174,0.174,0.174,0.174,0.174,0.174,0.174,0.174,0.174,0.043,0.13,224,OrganismalFitness,SCN5A_HUMAN,Medium,Human
+SDA_BACSU_Tsuboyama_2023_1PV0,0.531,0.52,0.574,0.563,0.588,0.596,0.13,0.458,0.552,0.578,0.538,0.495,0.516,0.209,0.264,0.523,0.538,0.567,0.545,0.531,0.141,0.181,0.191,0.274,0.191,0.466,0.101,0.347,0.52,0.592,0.509,0.469,0.014,0.116,0.264,0.462,0.542,0.542,0.523,0.578,0.585,0.574,0.224,0.184,0.52,0.264,0.534,0.552,0.592,0.563,0.563,0.567,0.57,0.585,0.57,0.57,0.592,0.578,0.596,0.592,0.592,0.585,2770,Stability,SDA_BACSU,Medium,Prokaryote
+SERC_HUMAN_Xie_2023,0.13,0.156,0.135,0.146,0.141,0.156,0.089,0.146,0.135,0.13,0.135,0.156,0.12,0.073,0.12,0.141,0.12,0.146,0.125,0.135,0.13,0.13,0.151,0.141,0.109,0.109,0.13,0.13,0.156,0.151,0.13,0.161,0.115,0.135,0.167,0.156,0.146,0.135,0.151,0.146,0.141,0.141,0.125,0.073,0.125,0.12,0.135,0.125,0.141,0.141,0.141,0.125,0.13,0.135,0.125,0.109,0.12,0.141,0.12,0.125,0.151,0.135,1914,OrganismalFitness,SERC_HUMAN,High,Human
+SHOC2_HUMAN_Kwon_2022,0.14,0.148,0.156,0.156,0.156,0.154,0.123,0.143,0.16,0.15,0.139,0.145,0.147,0.124,0.12,0.117,0.138,0.127,0.117,0.144,0.124,0.125,0.138,0.123,0.109,0.127,0.134,0.141,0.137,0.143,0.15,0.153,0.111,0.12,0.142,0.144,0.132,0.148,0.136,0.149,0.153,0.157,0.127,0.126,0.138,0.119,0.139,0.139,0.119,0.119,0.143,0.151,0.166,0.146,0.152,0.158,0.154,0.142,0.138,0.148,0.124,0.123,10972,OrganismalFitness,SHOC2_HUMAN,Medium,Human
+SOX30_HUMAN_Tsuboyama_2023_7JJK,0.168,0.149,0.089,0.109,0.119,0.129,0.139,0.089,0.089,0.139,0.139,0.158,0.129,0.178,0.149,0.208,0.188,0.178,0.109,0.099,0.089,0.05,0.099,0.069,0.099,0.089,0.079,0.099,0.119,0.099,0.099,0.089,0.099,0.099,0.05,0.079,0.149,0.099,0.099,0.149,0.119,0.109,0.149,0.099,0.109,0.178,0.139,0.139,0.139,0.178,0.178,0.218,0.228,0.208,0.198,0.208,0.178,0.188,0.178,0.218,0.178,0.158,1010,Stability,SOX30_HUMAN,High,Human
+SPA_STAAU_Tsuboyama_2023_1LP1,0.327,0.384,0.346,0.346,0.36,0.351,0.085,0.275,0.251,0.246,0.289,0.095,0.1,0.076,0.104,0.071,0.152,0.171,0.142,0.327,0.028,0.118,0.308,0.156,0.081,0.057,0.109,0.303,0.322,0.218,0.237,0.204,0.043,0.09,0.085,0.123,0.332,0.336,0.332,0.37,0.379,0.379,0.085,0.1,0.185,0.166,0.28,0.265,0.37,0.355,0.341,0.384,0.389,0.417,0.351,0.379,0.36,0.398,0.351,0.393,0.37,0.318,2105,Stability,SPA_STAAU,Medium,Prokaryote
+SPG1_STRSG_Olson_2014,0.208,0.214,0.094,0.098,0.204,0.206,0.089,0.081,0.108,0.191,0.221,0.2,0.199,0.237,0.212,0.204,0.205,0.203,0.217,0.17,0.221,0.206,0.204,0.213,0.212,0.205,0.215,0.214,0.22,0.219,0.239,0.239,0.141,0.193,0.187,0.186,0.203,0.198,0.195,0.204,0.199,0.201,0.073,0.087,0.101,0.089,0.201,0.189,0.216,0.155,0.247,0.199,0.218,0.215,0.213,0.209,0.226,0.225,0.224,0.227,0.202,0.209,536962,Binding,SPG1_STRSG,Low,Prokaryote
+SPG1_STRSG_Wu_2016,0.135,0.197,0.187,0.206,0.199,0.202,0.174,0.168,0.261,0.246,0.279,0.283,0.267,0.259,0.272,0.283,0.306,0.311,0.32,0.274,0.222,0.212,0.222,0.231,0.202,0.241,0.24,0.229,0.229,0.195,0.333,0.323,0.137,0.232,0.2,0.243,0.185,0.178,0.208,0.216,0.214,0.228,0.186,0.158,0.221,0.165,0.248,0.239,0.334,0.227,0.338,0.34,0.344,0.348,0.34,0.338,0.361,0.349,0.354,0.347,0.34,0.305,149360,Binding,SPG1_STRSG,Medium,Prokaryote
+SPG2_STRSG_Tsuboyama_2023_5UBS,0.151,0.096,0.13,0.11,0.144,0.13,0.219,0.151,0.137,0.205,0.233,0.24,0.205,0.171,0.219,0.199,0.226,0.274,0.233,0.192,0.185,0.253,0.199,0.144,0.123,0.205,0.178,0.26,0.082,0.212,0.144,0.103,0.021,0.164,0.301,0.137,0.212,0.233,0.171,0.164,0.158,0.158,0.205,0.158,0.199,0.178,0.205,0.212,0.212,0.185,0.185,0.192,0.26,0.212,0.219,0.233,0.219,0.192,0.185,0.212,0.288,0.205,1451,Stability,SPG2_STRSG,Medium,Prokaryote
+SPIKE_SARS2_Starr_2020_binding,0.155,0.227,0.181,0.196,0.208,0.213,0.099,0.227,0.229,0.232,0.104,0.111,0.114,0.109,0.126,0.116,0.111,0.111,0.097,0.229,0.205,0.22,0.222,0.21,0.21,0.215,0.2,0.215,0.191,0.227,0.213,0.213,0.159,0.208,0.22,0.227,0.22,0.217,0.222,0.232,0.234,0.232,0.101,0.106,0.109,0.116,0.29,0.227,0.196,0.159,0.155,0.2,0.217,0.203,0.198,0.208,0.184,0.176,0.167,0.2,0.215,0.15,3802,Binding,SPIKE_SARS2,Medium,Virus
+SPIKE_SARS2_Starr_2020_expression,0.141,0.234,0.139,0.193,0.219,0.224,0.098,0.195,0.216,0.244,0.095,0.108,0.108,0.103,0.134,0.108,0.111,0.08,0.072,0.229,0.167,0.18,0.162,0.162,0.198,0.172,0.159,0.18,0.167,0.237,0.201,0.201,0.129,0.172,0.175,0.211,0.165,0.185,0.203,0.211,0.226,0.226,0.103,0.09,0.129,0.118,0.357,0.365,0.326,0.165,0.237,0.278,0.283,0.27,0.303,0.275,0.283,0.254,0.283,0.293,0.332,0.208,3798,Expression,SPIKE_SARS2,Medium,Virus
+SPTN1_CHICK_Tsuboyama_2023_1TUD,0.399,0.486,0.396,0.374,0.433,0.396,0.231,0.318,0.371,0.368,0.495,0.386,0.449,0.065,0.417,0.436,0.386,0.517,0.399,0.427,0.38,0.452,0.38,0.393,0.399,0.467,0.421,0.449,0.47,0.427,0.47,0.486,0.1,0.374,0.33,0.346,0.411,0.433,0.396,0.396,0.396,0.399,0.283,0.125,0.346,0.312,0.374,0.383,0.567,0.558,0.558,0.548,0.558,0.539,0.558,0.548,0.551,0.567,0.536,0.567,0.533,0.349,3201,Stability,SPTN1_CHICK,High,Eukaryote
+SQSTM_MOUSE_Tsuboyama_2023_2RRU,0.296,0.324,0.423,0.437,0.437,0.423,0.042,0.239,0.324,0.38,0.31,0.155,0.296,0.155,0.254,0.282,0.352,0.324,0.352,0.338,0.141,0.296,0.211,0.296,0.324,0.31,0.268,0.268,0.31,0.268,0.239,0.254,0.169,0.183,0.254,0.324,0.338,0.423,0.394,0.451,0.423,0.408,0.183,0.169,0.282,0.38,0.408,0.282,0.366,0.408,0.366,0.366,0.352,0.338,0.296,0.38,0.324,0.296,0.296,0.352,0.296,0.324,707,Stability,SQSTM_MOUSE,Medium,Eukaryote
+SR43C_ARATH_Tsuboyama_2023_2N88,0.308,0.308,0.302,0.283,0.296,0.302,0.057,0.157,0.252,0.264,0.377,0.327,0.34,0.038,0.346,0.371,0.384,0.409,0.409,0.308,0.044,0.189,0.22,0.17,0.245,0.233,0.277,0.296,0.302,0.189,0.327,0.264,0.126,0.189,0.132,0.208,0.352,0.289,0.296,0.321,0.314,0.296,0.277,0.057,0.358,0.333,0.233,0.384,0.428,0.453,0.346,0.34,0.358,0.39,0.352,0.327,0.365,0.39,0.365,0.358,0.44,0.365,1583,Stability,SR43C_ARATH,High,Eukaryote
+SRBS1_HUMAN_Tsuboyama_2023_2O2W,0.231,0.179,0.378,0.333,0.256,0.301,0.218,0.237,0.276,0.314,0.308,0.288,0.301,0.179,0.333,0.301,0.263,0.288,0.333,0.199,0.269,0.244,0.269,0.205,0.231,0.237,0.263,0.192,0.212,0.301,0.186,0.167,0.212,0.237,0.231,0.269,0.256,0.256,0.276,0.308,0.321,0.295,0.231,0.083,0.269,0.288,0.301,0.218,0.378,0.353,0.327,0.327,0.314,0.308,0.314,0.333,0.321,0.327,0.295,0.333,0.237,0.327,1556,Stability,SRBS1_HUMAN,High,Human
+SRC_HUMAN_Ahler_2019,0.121,0.112,0.086,0.101,0.089,0.089,0.13,0.074,0.101,0.098,0.083,0.101,0.092,0.151,0.183,0.13,0.115,0.124,0.089,0.071,0.083,0.056,0.074,0.062,0.077,0.08,0.077,0.071,0.065,0.115,0.074,0.092,0.121,0.08,0.071,0.083,0.086,0.083,0.109,0.086,0.089,0.089,0.148,0.157,0.068,0.109,0.098,0.083,0.08,0.095,0.133,0.148,0.112,0.133,0.139,0.104,0.107,0.133,0.13,0.124,0.086,0.163,3372,Activity,SRC_HUMAN,Medium,Human
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,0.11,0.091,0.096,0.102,0.102,0.082,0.118,0.069,0.055,0.077,0.08,0.091,0.091,0.137,0.132,0.104,0.088,0.099,0.091,0.06,0.052,0.038,0.074,0.049,0.058,0.06,0.077,0.049,0.063,0.124,0.066,0.085,0.121,0.066,0.058,0.06,0.074,0.074,0.077,0.082,0.069,0.077,0.107,0.118,0.052,0.093,0.096,0.074,0.066,0.088,0.107,0.115,0.093,0.113,0.121,0.091,0.099,0.124,0.104,0.107,0.077,0.137,3637,Activity,SRC_HUMAN,Medium,Human
+SRC_HUMAN_Nguyen_2022,0.107,0.083,0.095,0.08,0.065,0.065,0.092,0.08,0.089,0.077,0.074,0.086,0.086,0.131,0.131,0.104,0.101,0.107,0.086,0.08,0.104,0.083,0.101,0.074,0.092,0.086,0.089,0.077,0.065,0.107,0.071,0.098,0.101,0.089,0.077,0.083,0.083,0.086,0.089,0.083,0.071,0.077,0.122,0.119,0.08,0.089,0.11,0.083,0.077,0.092,0.101,0.116,0.095,0.095,0.104,0.086,0.08,0.101,0.095,0.098,0.092,0.131,3366,OrganismalFitness,SRC_HUMAN,Medium,Human
+SUMO1_HUMAN_Weile_2017,0.118,0.141,0.124,0.141,0.112,0.118,0.082,0.165,0.165,0.147,0.229,0.188,0.206,0.071,0.141,0.153,0.182,0.153,0.159,0.176,0.118,0.159,0.153,0.176,0.182,0.176,0.147,0.2,0.188,0.329,0.194,0.2,0.088,0.088,0.176,0.176,0.124,0.135,0.188,0.129,0.129,0.141,0.124,0.082,0.171,0.129,0.129,0.182,0.188,0.141,0.153,0.153,0.159,0.165,0.171,0.165,0.165,0.176,0.176,0.176,0.194,0.106,1700,OrganismalFitness,SUMO1_HUMAN,High,Human
+SYUA_HUMAN_Newberry_2020,0.064,0.032,0.036,0.024,0.036,0.024,0.08,0.044,0.052,0.036,0.028,0.04,0.024,0.036,0.06,0.044,0.032,0.032,0.032,0.04,0.056,0.044,0.02,0.04,0.048,0.028,0.032,0.012,0.036,0.06,0.044,0.06,0.032,0.028,0.036,0.036,0.02,0.032,0.024,0.028,0.028,0.02,0.052,0.064,0.024,0.044,0.064,0.024,0.04,0.076,0.036,0.024,0.032,0.016,0.036,0.04,0.048,0.036,0.044,0.028,0.052,0.048,2497,OrganismalFitness,SYUA_HUMAN,Medium,Human
+TADBP_HUMAN_Bolognesi_2019,0.008,0.017,0.008,0.008,0.008,0.008,0.267,0.0,0.008,0.008,0.067,0.092,0.042,0.125,0.133,0.1,0.05,0.017,0.008,0.008,0.158,0.017,0.008,0.008,0.217,0.033,0.0,0.008,0.008,0.008,0.0,0.0,0.017,0.2,0.267,0.0,0.05,0.092,0.008,0.008,0.017,0.0,0.183,0.25,0.067,0.083,0.2,0.067,0.208,0.117,0.142,0.1,0.1,0.108,0.092,0.117,0.058,0.083,0.1,0.108,0.142,0.158,1196,OrganismalFitness,TADBP_HUMAN,Low,Human
+TAT_HV1BR_Fernandes_2016,0.12,0.139,0.139,0.139,0.139,0.139,0.076,0.133,0.158,0.171,0.127,0.152,0.139,0.114,0.127,0.082,0.089,0.108,0.12,0.158,0.12,0.152,0.152,0.158,0.133,0.146,0.101,0.133,0.133,0.158,0.095,0.095,0.12,0.127,0.139,0.146,0.139,0.127,0.127,0.146,0.139,0.146,0.082,0.063,0.108,0.114,0.101,0.108,0.101,0.089,0.07,0.101,0.133,0.133,0.108,0.101,0.082,0.082,0.133,0.095,0.089,0.114,1577,OrganismalFitness,TAT_HV1BR,High,Virus
+TCRG1_MOUSE_Tsuboyama_2023_1E0L,0.396,0.34,0.377,0.387,0.349,0.349,0.443,0.085,0.368,0.302,0.104,0.217,0.226,0.368,0.17,0.283,0.358,0.151,0.226,0.132,0.094,0.075,0.104,0.142,0.123,0.085,0.104,0.142,0.057,0.085,0.075,0.047,0.349,0.132,0.16,0.274,0.302,0.34,0.34,0.311,0.34,0.311,0.33,0.16,0.17,0.264,0.189,0.142,0.274,0.358,0.311,0.245,0.208,0.349,0.283,0.34,0.292,0.302,0.245,0.292,0.358,0.236,1058,Stability,TCRG1_MOUSE,Medium,Eukaryote
+THO1_YEAST_Tsuboyama_2023_2WQG,0.297,0.289,0.352,0.375,0.391,0.391,0.211,0.227,0.312,0.359,0.312,0.375,0.352,0.102,0.352,0.344,0.336,0.328,0.359,0.391,0.164,0.289,0.227,0.141,0.273,0.305,0.086,0.289,0.422,0.336,0.281,0.219,0.352,0.117,0.336,0.211,0.367,0.43,0.352,0.367,0.422,0.391,0.281,0.258,0.383,0.297,0.422,0.445,0.445,0.508,0.375,0.359,0.398,0.359,0.383,0.359,0.367,0.359,0.344,0.352,0.406,0.391,1279,Stability,THO1_YEAST,High,Eukaryote
+TNKS2_HUMAN_Tsuboyama_2023_5JRT,0.297,0.277,0.243,0.243,0.216,0.216,0.034,0.196,0.297,0.338,0.25,0.23,0.25,0.061,0.27,0.291,0.257,0.25,0.243,0.196,0.209,0.291,0.257,0.277,0.216,0.284,0.203,0.264,0.23,0.23,0.297,0.284,0.142,0.135,0.209,0.216,0.277,0.291,0.27,0.236,0.23,0.236,0.25,0.162,0.351,0.236,0.345,0.345,0.372,0.419,0.223,0.297,0.27,0.264,0.27,0.291,0.297,0.277,0.284,0.264,0.338,0.311,1479,Stability,TNKS2_HUMAN,High,Human
+TPK1_HUMAN_Weile_2017,0.135,0.129,0.132,0.11,0.122,0.122,0.097,0.132,0.135,0.116,0.141,0.15,0.119,0.091,0.129,0.132,0.125,0.129,0.144,0.122,0.094,0.132,0.107,0.141,0.113,0.144,0.125,0.11,0.135,0.15,0.129,0.119,0.088,0.094,0.144,0.138,0.129,0.125,0.138,0.122,0.119,0.135,0.103,0.088,0.116,0.094,0.147,0.138,0.141,0.138,0.132,0.147,0.129,0.15,0.144,0.141,0.141,0.135,0.135,0.122,0.141,0.113,3181,OrganismalFitness,TPK1_HUMAN,Medium,Human
+TPMT_HUMAN_Matreyek_2018,0.181,0.219,0.216,0.197,0.195,0.197,0.107,0.189,0.189,0.192,0.205,0.197,0.2,0.132,0.156,0.205,0.203,0.2,0.184,0.208,0.137,0.164,0.205,0.205,0.167,0.186,0.208,0.192,0.205,0.216,0.244,0.2,0.132,0.129,0.2,0.175,0.175,0.205,0.175,0.205,0.214,0.208,0.132,0.09,0.205,0.173,0.178,0.238,0.222,0.173,0.175,0.173,0.178,0.175,0.181,0.178,0.173,0.197,0.178,0.178,0.197,0.181,3648,Expression,TPMT_HUMAN,Medium,Human
+TPOR_HUMAN_Bridgford_2020,0.368,0.211,0.404,0.281,0.228,0.228,0.351,0.228,0.281,0.228,0.298,0.281,0.298,0.211,0.263,0.263,0.193,0.228,0.246,0.175,0.246,0.281,0.228,0.263,0.298,0.193,0.351,0.351,0.298,0.175,0.175,0.123,0.386,0.246,0.263,0.281,0.368,0.351,0.263,0.281,0.298,0.228,0.228,0.246,0.211,0.298,0.105,0.281,0.298,0.193,0.263,0.246,0.263,0.263,0.281,0.281,0.281,0.263,0.281,0.281,0.281,0.368,562,OrganismalFitness,TPOR_HUMAN,Low,Human
+TRPC_SACS2_Chan_2017,0.382,0.513,0.461,0.474,0.48,0.474,0.125,0.414,0.48,0.48,0.454,0.428,0.434,0.191,0.329,0.401,0.48,0.5,0.447,0.48,0.178,0.355,0.388,0.414,0.395,0.322,0.401,0.421,0.487,0.513,0.474,0.395,0.158,0.322,0.355,0.342,0.375,0.414,0.408,0.461,0.454,0.454,0.237,0.145,0.388,0.296,0.283,0.434,0.454,0.151,0.428,0.441,0.428,0.461,0.421,0.447,0.421,0.447,0.441,0.467,0.513,0.329,1519,OrganismalFitness,TRPC_SACS2,Medium,Prokaryote
+TRPC_THEMA_Chan_2017,0.303,0.355,0.276,0.316,0.342,0.329,0.197,0.243,0.289,0.336,0.316,0.309,0.336,0.211,0.316,0.336,0.368,0.349,0.336,0.355,0.23,0.283,0.316,0.382,0.329,0.329,0.316,0.362,0.368,0.224,0.336,0.257,0.092,0.27,0.349,0.362,0.309,0.329,0.322,0.336,0.342,0.336,0.316,0.086,0.349,0.303,0.309,0.368,0.316,0.191,0.336,0.349,0.329,0.362,0.342,0.322,0.342,0.322,0.289,0.336,0.382,0.375,1519,OrganismalFitness,TRPC_THEMA,Medium,Prokaryote
+UBC9_HUMAN_Weile_2017,0.152,0.187,0.171,0.171,0.187,0.187,0.07,0.125,0.14,0.156,0.16,0.144,0.144,0.074,0.093,0.144,0.152,0.16,0.148,0.16,0.109,0.121,0.128,0.109,0.128,0.144,0.128,0.136,0.125,0.144,0.097,0.101,0.054,0.109,0.113,0.128,0.136,0.148,0.16,0.167,0.183,0.175,0.144,0.086,0.128,0.171,0.156,0.16,0.132,0.144,0.144,0.163,0.191,0.16,0.152,0.175,0.167,0.148,0.148,0.16,0.144,0.132,2563,OrganismalFitness,UBC9_HUMAN,Medium,Human
+UBE4B_HUMAN_Tsuboyama_2023_3L1X,0.251,0.386,0.408,0.399,0.427,0.424,0.193,0.333,0.383,0.43,0.49,0.433,0.455,0.229,0.427,0.51,0.499,0.477,0.46,0.402,0.143,0.38,0.397,0.43,0.322,0.408,0.413,0.405,0.419,0.386,0.466,0.405,0.328,0.132,0.16,0.446,0.311,0.311,0.402,0.435,0.438,0.427,0.264,0.171,0.27,0.311,0.372,0.325,0.521,0.342,0.512,0.545,0.534,0.534,0.543,0.51,0.512,0.529,0.526,0.526,0.499,0.388,3622,Stability,UBE4B_HUMAN,High,Human
+UBE4B_MOUSE_Starita_2013,0.156,0.122,0.122,0.122,0.1,0.122,0.122,0.122,0.133,0.111,0.111,0.133,0.111,0.078,0.122,0.122,0.122,0.122,0.089,0.056,0.089,0.056,0.089,0.1,0.078,0.089,0.089,0.1,0.1,0.111,0.056,0.067,0.1,0.111,0.078,0.089,0.156,0.156,0.156,0.122,0.122,0.122,0.1,0.144,0.067,0.122,0.133,0.089,0.044,0.156,0.156,0.122,0.133,0.122,0.133,0.167,0.144,0.133,0.133,0.156,0.167,0.156,899,Activity,UBE4B_MOUSE,Low,Eukaryote
+UBR5_HUMAN_Tsuboyama_2023_1I2T,0.171,0.192,0.137,0.151,0.164,0.164,0.185,0.068,0.158,0.144,0.13,0.185,0.24,0.192,0.192,0.205,0.096,0.205,0.185,0.13,0.137,0.158,0.164,0.178,0.199,0.13,0.178,0.158,0.192,0.178,0.164,0.089,0.185,0.185,0.178,0.192,0.144,0.164,0.164,0.164,0.158,0.192,0.212,0.212,0.212,0.164,0.226,0.185,0.192,0.253,0.164,0.219,0.247,0.24,0.24,0.212,0.219,0.226,0.212,0.233,0.212,0.24,1453,Stability,UBR5_HUMAN,Medium,Human
+VG08_BPP22_Tsuboyama_2023_2GP8,0.288,0.178,0.164,0.164,0.192,0.205,0.233,0.11,0.178,0.26,0.178,0.274,0.247,0.274,0.219,0.192,0.247,0.178,0.123,0.123,0.233,0.219,0.11,0.11,0.205,0.096,0.151,0.123,0.082,0.205,0.096,0.068,0.123,0.192,0.26,0.082,0.233,0.329,0.164,0.205,0.205,0.164,0.205,0.26,0.219,0.164,0.082,0.068,0.096,0.205,0.205,0.151,0.233,0.205,0.233,0.233,0.205,0.247,0.274,0.219,0.26,0.192,723,Stability,VG08_BPP22,High,Virus
+VILI_CHICK_Tsuboyama_2023_1YU5,0.249,0.436,0.502,0.525,0.486,0.502,0.125,0.28,0.444,0.459,0.432,0.412,0.444,0.171,0.117,0.463,0.553,0.494,0.401,0.467,0.109,0.265,0.284,0.296,0.346,0.346,0.331,0.292,0.428,0.447,0.463,0.424,0.187,0.148,0.346,0.319,0.346,0.42,0.385,0.467,0.486,0.463,0.187,0.237,0.479,0.385,0.428,0.475,0.541,0.397,0.541,0.51,0.545,0.545,0.514,0.556,0.553,0.533,0.525,0.556,0.556,0.424,2568,Stability,VILI_CHICK,High,Eukaryote
+VKOR1_HUMAN_Chiasson_2020_abundance,0.252,0.174,0.215,0.219,0.196,0.196,0.137,0.163,0.211,0.204,0.167,0.193,0.181,0.126,0.215,0.241,0.204,0.222,0.178,0.207,0.181,0.152,0.248,0.215,0.211,0.144,0.167,0.148,0.137,0.204,0.167,0.133,0.126,0.159,0.196,0.141,0.23,0.219,0.185,0.196,0.189,0.215,0.137,0.104,0.196,0.215,0.204,0.174,0.2,0.163,0.211,0.181,0.219,0.215,0.222,0.219,0.207,0.181,0.215,0.207,0.156,0.23,2695,Expression,VKOR1_HUMAN,Medium,Human
+VKOR1_HUMAN_Chiasson_2020_activity,0.171,0.171,0.186,0.171,0.171,0.186,0.071,0.157,0.157,0.171,0.129,0.143,0.157,0.071,0.114,0.157,0.171,0.143,0.157,0.2,0.1,0.1,0.143,0.114,0.1,0.114,0.143,0.129,0.086,0.171,0.114,0.143,0.186,0.057,0.057,0.1,0.143,0.157,0.143,0.157,0.143,0.157,0.071,0.057,0.143,0.086,0.171,0.186,0.186,0.143,0.129,0.157,0.129,0.171,0.229,0.157,0.143,0.143,0.129,0.157,0.157,0.114,697,Activity,VKOR1_HUMAN,Medium,Human
+VRPI_BPT7_Tsuboyama_2023_2WNM,0.048,0.152,0.143,0.162,0.133,0.124,0.076,0.048,0.095,0.057,0.181,0.133,0.133,0.114,0.133,0.19,0.229,0.257,0.162,0.076,0.086,0.143,0.095,0.133,0.152,0.105,0.086,0.124,0.248,0.124,0.2,0.19,0.114,0.105,0.162,0.133,0.076,0.095,0.095,0.143,0.133,0.124,0.152,0.105,0.181,0.124,0.276,0.238,0.21,0.267,0.333,0.267,0.333,0.333,0.295,0.362,0.352,0.324,0.343,0.352,0.381,0.267,1047,Stability,VRPI_BPT7,Medium,Virus
+YAIA_ECOLI_Tsuboyama_2023_2KVT,0.18,0.275,0.302,0.302,0.296,0.296,0.069,0.249,0.36,0.349,0.265,0.122,0.312,0.079,0.095,0.344,0.392,0.487,0.439,0.317,0.048,0.058,0.048,0.212,0.063,0.074,0.053,0.074,0.36,0.328,0.307,0.265,0.053,0.048,0.116,0.392,0.18,0.243,0.333,0.296,0.312,0.339,0.164,0.079,0.18,0.175,0.439,0.392,0.434,0.365,0.429,0.444,0.381,0.423,0.429,0.45,0.439,0.429,0.402,0.444,0.439,0.222,1890,Stability,YAIA_ECOLI,Medium,Prokaryote
+YAP1_HUMAN_Araya_2012,0.22,0.155,0.232,0.231,0.233,0.228,0.176,0.073,0.065,0.057,0.175,0.135,0.116,0.181,0.202,0.225,0.209,0.191,0.152,0.093,0.08,0.068,0.065,0.086,0.096,0.06,0.066,0.089,0.062,0.136,0.125,0.136,0.138,0.133,0.067,0.092,0.188,0.127,0.15,0.209,0.184,0.204,0.259,0.034,0.178,0.263,0.148,0.182,0.125,0.162,0.115,0.123,0.138,0.151,0.107,0.13,0.149,0.147,0.166,0.137,0.234,0.195,10075,Binding,YAP1_HUMAN,Low,Human
+YNZC_BACSU_Tsuboyama_2023_2JVD,0.374,0.374,0.378,0.374,0.378,0.378,0.383,0.33,0.391,0.409,0.457,0.448,0.457,0.461,0.474,0.439,0.483,0.47,0.43,0.378,0.435,0.491,0.509,0.517,0.478,0.443,0.478,0.43,0.422,0.435,0.422,0.391,0.439,0.435,0.452,0.426,0.426,0.391,0.383,0.4,0.387,0.391,0.422,0.361,0.422,0.439,0.439,0.496,0.548,0.53,0.53,0.474,0.504,0.504,0.513,0.517,0.496,0.5,0.491,0.496,0.457,0.591,2300,Stability,YNZC_BACSU,Medium,Prokaryote
diff --git a/benchmarks/DMS_zero_shot/substitutions/Top_recall/DMS_substitutions_Top_recall_DMS_level.html b/benchmarks/DMS_zero_shot/substitutions/Top_recall/DMS_substitutions_Top_recall_DMS_level.html
new file mode 100644
index 0000000..d385cf4
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/Top_recall/DMS_substitutions_Top_recall_DMS_level.html
@@ -0,0 +1,15196 @@
+
+
+
+ score |
+ Site-Independent |
+ EVmutation |
+ DeepSequence (single) |
+ DeepSequence (ensemble) |
+ EVE (single) |
+ EVE (ensemble) |
+ Unirep |
+ Unirep evotuned |
+ MSA Transformer (single) |
+ MSA Transformer (ensemble) |
+ ESM-1b |
+ ESM-1v (single) |
+ ESM-1v (ensemble) |
+ ESM2 (8M) |
+ ESM2 (35M) |
+ ESM2 (150M) |
+ ESM2 (650M) |
+ ESM2 (3B) |
+ ESM2 (15B) |
+ Wavenet |
+ RITA S |
+ RITA M |
+ RITA L |
+ RITA XL |
+ Progen2 S |
+ Progen2 M |
+ Progen2 Base |
+ Progen2 L |
+ Progen2 XL |
+ GEMME |
+ VESPA |
+ VESPAl |
+ ProtGPT2 |
+ Tranception S no retrieval |
+ Tranception M no retrieval |
+ Tranception L no retrieval |
+ Tranception S |
+ Tranception M |
+ Tranception L |
+ TranceptEVE S |
+ TranceptEVE M |
+ TranceptEVE L |
+ CARP (38M) |
+ CARP (600K) |
+ CARP (640M) |
+ CARP (76M) |
+ MIF |
+ MIF-ST |
+ ESM-IF1 |
+ ProteinMPNN |
+ ProtSSN (k=10 h=512) |
+ ProtSSN (k=10 h=768) |
+ ProtSSN (k=10 h=1280) |
+ ProtSSN (k=20 h=512) |
+ ProtSSN (k=20 h=768) |
+ ProtSSN (k=20 h=1280) |
+ ProtSSN (k=30 h=512) |
+ ProtSSN (k=30 h=768) |
+ ProtSSN (k=30 h=1280) |
+ ProtSSN (ensemble) |
+ SaProt (650M) |
+ SaProt (35M) |
+ Number of Mutants |
+ Selection Type |
+ UniProt ID |
+ MSA_Neff_L_category |
+ Taxon |
+
+
+
+
+ A0A140D2T1_ZIKV_Sourisseau_2019 |
+ 0.278 |
+ 0.279 |
+ 0.204 |
+ 0.205 |
+ 0.273 |
+ 0.280 |
+ 0.093 |
+ 0.086 |
+ 0.315 |
+ 0.318 |
+ 0.117 |
+ 0.100 |
+ 0.124 |
+ 0.081 |
+ 0.096 |
+ 0.086 |
+ 0.127 |
+ 0.237 |
+ 0.262 |
+ 0.178 |
+ 0.268 |
+ 0.280 |
+ 0.285 |
+ 0.278 |
+ 0.251 |
+ 0.267 |
+ 0.286 |
+ 0.254 |
+ 0.277 |
+ 0.291 |
+ 0.257 |
+ 0.219 |
+ 0.099 |
+ 0.223 |
+ 0.232 |
+ 0.241 |
+ 0.265 |
+ 0.267 |
+ 0.286 |
+ 0.268 |
+ 0.269 |
+ 0.285 |
+ 0.104 |
+ 0.103 |
+ 0.148 |
+ 0.101 |
+ 0.169 |
+ 0.201 |
+ 0.151 |
+ 0.153 |
+ 0.180 |
+ 0.168 |
+ 0.183 |
+ 0.183 |
+ 0.181 |
+ 0.166 |
+ 0.148 |
+ 0.172 |
+ 0.159 |
+ 0.171 |
+ 0.160 |
+ 0.135 |
+ 9576 |
+ OrganismalFitness |
+ A0A140D2T1_ZIKV |
+ Medium |
+ Virus |
+
+
+ A0A192B1T2_9HIV1_Haddox_2018 |
+ 0.276 |
+ 0.270 |
+ 0.269 |
+ 0.273 |
+ 0.277 |
+ 0.280 |
+ 0.095 |
+ 0.269 |
+ 0.293 |
+ 0.293 |
+ 0.234 |
+ 0.259 |
+ 0.266 |
+ 0.103 |
+ 0.094 |
+ 0.095 |
+ 0.103 |
+ 0.110 |
+ 0.117 |
+ 0.281 |
+ 0.279 |
+ 0.280 |
+ 0.286 |
+ 0.291 |
+ 0.273 |
+ 0.289 |
+ 0.279 |
+ 0.282 |
+ 0.293 |
+ 0.293 |
+ 0.263 |
+ 0.248 |
+ 0.221 |
+ 0.274 |
+ 0.284 |
+ 0.289 |
+ 0.281 |
+ 0.284 |
+ 0.286 |
+ 0.286 |
+ 0.282 |
+ 0.285 |
+ 0.192 |
+ 0.072 |
+ 0.267 |
+ 0.261 |
+ 0.138 |
+ 0.268 |
+ 0.121 |
+ 0.145 |
+ 0.129 |
+ 0.148 |
+ 0.154 |
+ 0.138 |
+ 0.143 |
+ 0.141 |
+ 0.144 |
+ 0.126 |
+ 0.121 |
+ 0.143 |
+ 0.137 |
+ 0.115 |
+ 12577 |
+ OrganismalFitness |
+ A0A192B1T2_9HIV1 |
+ Medium |
+ Virus |
+
+
+ A0A1I9GEU1_NEIME_Kennouche_2019 |
+ 0.086 |
+ 0.097 |
+ 0.108 |
+ 0.108 |
+ 0.086 |
+ 0.097 |
+ 0.075 |
+ 0.129 |
+ 0.108 |
+ 0.075 |
+ 0.118 |
+ 0.129 |
+ 0.108 |
+ 0.086 |
+ 0.097 |
+ 0.118 |
+ 0.075 |
+ 0.086 |
+ 0.108 |
+ 0.065 |
+ 0.108 |
+ 0.097 |
+ 0.086 |
+ 0.086 |
+ 0.086 |
+ 0.097 |
+ 0.086 |
+ 0.108 |
+ 0.075 |
+ 0.065 |
+ 0.108 |
+ 0.097 |
+ 0.183 |
+ 0.086 |
+ 0.108 |
+ 0.097 |
+ 0.108 |
+ 0.086 |
+ 0.097 |
+ 0.086 |
+ 0.097 |
+ 0.097 |
+ 0.075 |
+ 0.032 |
+ 0.097 |
+ 0.065 |
+ 0.129 |
+ 0.108 |
+ 0.140 |
+ 0.118 |
+ 0.129 |
+ 0.097 |
+ 0.118 |
+ 0.140 |
+ 0.075 |
+ 0.118 |
+ 0.097 |
+ 0.151 |
+ 0.097 |
+ 0.097 |
+ 0.140 |
+ 0.054 |
+ 922 |
+ Activity |
+ A0A1I9GEU1_NEIME |
+ Medium |
+ Prokaryote |
+
+
+ A0A247D711_LISMN_Stadelmann_2021 |
+ 0.139 |
+ 0.151 |
+ 0.120 |
+ 0.114 |
+ 0.163 |
+ 0.151 |
+ 0.078 |
+ 0.096 |
+ 0.163 |
+ 0.169 |
+ 0.120 |
+ 0.120 |
+ 0.127 |
+ 0.090 |
+ 0.090 |
+ 0.114 |
+ 0.133 |
+ 0.120 |
+ 0.127 |
+ 0.157 |
+ 0.090 |
+ 0.096 |
+ 0.114 |
+ 0.114 |
+ 0.066 |
+ 0.127 |
+ 0.133 |
+ 0.078 |
+ 0.114 |
+ 0.163 |
+ 0.157 |
+ 0.187 |
+ 0.060 |
+ 0.090 |
+ 0.108 |
+ 0.084 |
+ 0.133 |
+ 0.139 |
+ 0.151 |
+ 0.157 |
+ 0.157 |
+ 0.157 |
+ 0.114 |
+ 0.102 |
+ 0.120 |
+ 0.108 |
+ 0.139 |
+ 0.145 |
+ 0.193 |
+ 0.175 |
+ 0.163 |
+ 0.181 |
+ 0.120 |
+ 0.120 |
+ 0.145 |
+ 0.120 |
+ 0.120 |
+ 0.127 |
+ 0.139 |
+ 0.145 |
+ 0.133 |
+ 0.102 |
+ 1653 |
+ Activity |
+ A0A247D711_LISMN |
+ High |
+ Prokaryote |
+
+
+ A0A2Z5U3Z0_9INFA_Doud_2016 |
+ 0.257 |
+ 0.327 |
+ 0.310 |
+ 0.316 |
+ 0.317 |
+ 0.313 |
+ 0.089 |
+ 0.271 |
+ 0.324 |
+ 0.326 |
+ 0.109 |
+ 0.293 |
+ 0.329 |
+ 0.103 |
+ 0.104 |
+ 0.104 |
+ 0.271 |
+ 0.301 |
+ 0.297 |
+ 0.299 |
+ 0.305 |
+ 0.322 |
+ 0.324 |
+ 0.329 |
+ 0.203 |
+ 0.311 |
+ 0.332 |
+ 0.315 |
+ 0.303 |
+ 0.314 |
+ 0.244 |
+ 0.190 |
+ 0.105 |
+ 0.294 |
+ 0.319 |
+ 0.308 |
+ 0.310 |
+ 0.313 |
+ 0.310 |
+ 0.329 |
+ 0.327 |
+ 0.324 |
+ 0.091 |
+ 0.095 |
+ 0.166 |
+ 0.100 |
+ 0.176 |
+ 0.218 |
+ 0.194 |
+ 0.162 |
+ 0.247 |
+ 0.261 |
+ 0.257 |
+ 0.264 |
+ 0.255 |
+ 0.264 |
+ 0.252 |
+ 0.265 |
+ 0.267 |
+ 0.258 |
+ 0.178 |
+ 0.166 |
+ 10715 |
+ OrganismalFitness |
+ A0A2Z5U3Z0_9INFA |
+ Medium |
+ Virus |
+
+
+ A0A2Z5U3Z0_9INFA_Wu_2014 |
+ 0.209 |
+ 0.234 |
+ 0.234 |
+ 0.247 |
+ 0.238 |
+ 0.234 |
+ 0.098 |
+ 0.272 |
+ 0.247 |
+ 0.243 |
+ 0.153 |
+ 0.217 |
+ 0.247 |
+ 0.136 |
+ 0.136 |
+ 0.140 |
+ 0.213 |
+ 0.200 |
+ 0.213 |
+ 0.306 |
+ 0.221 |
+ 0.234 |
+ 0.260 |
+ 0.255 |
+ 0.183 |
+ 0.217 |
+ 0.260 |
+ 0.213 |
+ 0.243 |
+ 0.243 |
+ 0.247 |
+ 0.217 |
+ 0.085 |
+ 0.221 |
+ 0.226 |
+ 0.230 |
+ 0.243 |
+ 0.260 |
+ 0.260 |
+ 0.230 |
+ 0.238 |
+ 0.230 |
+ 0.128 |
+ 0.111 |
+ 0.179 |
+ 0.145 |
+ 0.162 |
+ 0.196 |
+ 0.196 |
+ 0.136 |
+ 0.209 |
+ 0.221 |
+ 0.217 |
+ 0.209 |
+ 0.200 |
+ 0.221 |
+ 0.196 |
+ 0.200 |
+ 0.209 |
+ 0.204 |
+ 0.123 |
+ 0.145 |
+ 2350 |
+ OrganismalFitness |
+ A0A2Z5U3Z0_9INFA |
+ Medium |
+ Virus |
+
+
+ A4_HUMAN_Seuma_2022 |
+ 0.104 |
+ 0.158 |
+ 0.167 |
+ 0.166 |
+ 0.128 |
+ 0.134 |
+ 0.072 |
+ 0.130 |
+ 0.164 |
+ 0.155 |
+ 0.089 |
+ 0.123 |
+ 0.149 |
+ 0.069 |
+ 0.068 |
+ 0.076 |
+ 0.078 |
+ 0.126 |
+ 0.126 |
+ 0.141 |
+ 0.096 |
+ 0.105 |
+ 0.096 |
+ 0.121 |
+ 0.104 |
+ 0.099 |
+ 0.095 |
+ 0.095 |
+ 0.103 |
+ 0.144 |
+ 0.130 |
+ 0.140 |
+ 0.159 |
+ 0.076 |
+ 0.146 |
+ 0.173 |
+ 0.097 |
+ 0.155 |
+ 0.164 |
+ 0.119 |
+ 0.144 |
+ 0.152 |
+ 0.060 |
+ 0.059 |
+ 0.134 |
+ 0.070 |
+ 0.111 |
+ 0.092 |
+ 0.032 |
+ 0.111 |
+ 0.142 |
+ 0.148 |
+ 0.128 |
+ 0.122 |
+ 0.115 |
+ 0.131 |
+ 0.125 |
+ 0.115 |
+ 0.131 |
+ 0.124 |
+ 0.087 |
+ 0.091 |
+ 14811 |
+ Stability |
+ A4_HUMAN |
+ Low |
+ Human |
+
+
+ A4D664_9INFA_Soh_2019 |
+ 0.242 |
+ 0.245 |
+ 0.247 |
+ 0.247 |
+ 0.256 |
+ 0.257 |
+ 0.105 |
+ 0.228 |
+ 0.191 |
+ 0.188 |
+ 0.096 |
+ 0.095 |
+ 0.096 |
+ 0.105 |
+ 0.109 |
+ 0.104 |
+ 0.135 |
+ 0.146 |
+ 0.198 |
+ 0.222 |
+ 0.204 |
+ 0.230 |
+ 0.234 |
+ 0.215 |
+ 0.107 |
+ 0.168 |
+ 0.171 |
+ 0.173 |
+ 0.200 |
+ 0.237 |
+ 0.199 |
+ 0.191 |
+ 0.106 |
+ 0.202 |
+ 0.231 |
+ 0.235 |
+ 0.234 |
+ 0.245 |
+ 0.245 |
+ 0.258 |
+ 0.259 |
+ 0.261 |
+ 0.105 |
+ 0.109 |
+ 0.125 |
+ 0.112 |
+ 0.122 |
+ 0.129 |
+ 0.072 |
+ 0.130 |
+ 0.124 |
+ 0.123 |
+ 0.105 |
+ 0.133 |
+ 0.123 |
+ 0.130 |
+ 0.119 |
+ 0.127 |
+ 0.129 |
+ 0.119 |
+ 0.118 |
+ 0.109 |
+ 14421 |
+ OrganismalFitness |
+ A4D664_9INFA |
+ Medium |
+ Virus |
+
+
+ A4GRB6_PSEAI_Chen_2020 |
+ 0.180 |
+ 0.204 |
+ 0.198 |
+ 0.220 |
+ 0.184 |
+ 0.184 |
+ 0.138 |
+ 0.176 |
+ 0.244 |
+ 0.226 |
+ 0.257 |
+ 0.234 |
+ 0.253 |
+ 0.146 |
+ 0.216 |
+ 0.263 |
+ 0.309 |
+ 0.261 |
+ 0.271 |
+ 0.230 |
+ 0.136 |
+ 0.172 |
+ 0.194 |
+ 0.232 |
+ 0.184 |
+ 0.230 |
+ 0.238 |
+ 0.234 |
+ 0.242 |
+ 0.192 |
+ 0.253 |
+ 0.210 |
+ 0.054 |
+ 0.174 |
+ 0.204 |
+ 0.216 |
+ 0.238 |
+ 0.236 |
+ 0.214 |
+ 0.226 |
+ 0.214 |
+ 0.212 |
+ 0.138 |
+ 0.092 |
+ 0.305 |
+ 0.210 |
+ 0.283 |
+ 0.309 |
+ 0.248 |
+ 0.222 |
+ 0.329 |
+ 0.307 |
+ 0.293 |
+ 0.337 |
+ 0.331 |
+ 0.323 |
+ 0.317 |
+ 0.343 |
+ 0.329 |
+ 0.331 |
+ 0.271 |
+ 0.234 |
+ 5004 |
+ OrganismalFitness |
+ A4GRB6_PSEAI |
+ High |
+ Prokaryote |
+
+
+ AACC1_PSEAI_Dandage_2018 |
+ 0.177 |
+ 0.260 |
+ 0.188 |
+ 0.204 |
+ 0.193 |
+ 0.193 |
+ 0.144 |
+ 0.094 |
+ 0.249 |
+ 0.238 |
+ 0.204 |
+ 0.210 |
+ 0.199 |
+ 0.133 |
+ 0.155 |
+ 0.166 |
+ 0.215 |
+ 0.210 |
+ 0.238 |
+ 0.227 |
+ 0.149 |
+ 0.166 |
+ 0.199 |
+ 0.138 |
+ 0.160 |
+ 0.204 |
+ 0.193 |
+ 0.193 |
+ 0.210 |
+ 0.276 |
+ 0.160 |
+ 0.122 |
+ 0.083 |
+ 0.199 |
+ 0.177 |
+ 0.227 |
+ 0.227 |
+ 0.204 |
+ 0.210 |
+ 0.210 |
+ 0.210 |
+ 0.215 |
+ 0.138 |
+ 0.127 |
+ 0.160 |
+ 0.166 |
+ 0.227 |
+ 0.238 |
+ 0.227 |
+ 0.188 |
+ 0.249 |
+ 0.221 |
+ 0.221 |
+ 0.232 |
+ 0.232 |
+ 0.265 |
+ 0.215 |
+ 0.238 |
+ 0.265 |
+ 0.249 |
+ 0.171 |
+ 0.166 |
+ 1801 |
+ OrganismalFitness |
+ AACC1_PSEAI |
+ High |
+ Prokaryote |
+
+
+ ACE2_HUMAN_Chan_2020 |
+ 0.157 |
+ 0.157 |
+ 0.211 |
+ 0.193 |
+ 0.166 |
+ 0.157 |
+ 0.054 |
+ 0.148 |
+ 0.157 |
+ 0.148 |
+ 0.103 |
+ 0.143 |
+ 0.139 |
+ 0.058 |
+ 0.090 |
+ 0.108 |
+ 0.139 |
+ 0.121 |
+ 0.139 |
+ 0.193 |
+ 0.126 |
+ 0.202 |
+ 0.220 |
+ 0.188 |
+ 0.157 |
+ 0.188 |
+ 0.184 |
+ 0.197 |
+ 0.184 |
+ 0.197 |
+ 0.157 |
+ 0.139 |
+ 0.143 |
+ 0.108 |
+ 0.188 |
+ 0.170 |
+ 0.170 |
+ 0.152 |
+ 0.175 |
+ 0.184 |
+ 0.170 |
+ 0.184 |
+ 0.085 |
+ 0.072 |
+ 0.121 |
+ 0.067 |
+ 0.076 |
+ 0.121 |
+ 0.143 |
+ 0.090 |
+ 0.121 |
+ 0.130 |
+ 0.117 |
+ 0.108 |
+ 0.117 |
+ 0.139 |
+ 0.126 |
+ 0.112 |
+ 0.121 |
+ 0.099 |
+ 0.112 |
+ 0.103 |
+ 2223 |
+ Binding |
+ ACE2_HUMAN |
+ Medium |
+ Human |
+
+
+ ADRB2_HUMAN_Jones_2020 |
+ 0.142 |
+ 0.168 |
+ 0.168 |
+ 0.174 |
+ 0.201 |
+ 0.206 |
+ 0.142 |
+ 0.206 |
+ 0.171 |
+ 0.188 |
+ 0.178 |
+ 0.156 |
+ 0.177 |
+ 0.146 |
+ 0.150 |
+ 0.158 |
+ 0.158 |
+ 0.167 |
+ 0.172 |
+ 0.159 |
+ 0.168 |
+ 0.174 |
+ 0.172 |
+ 0.171 |
+ 0.188 |
+ 0.179 |
+ 0.183 |
+ 0.181 |
+ 0.159 |
+ 0.162 |
+ 0.155 |
+ 0.133 |
+ 0.121 |
+ 0.186 |
+ 0.168 |
+ 0.182 |
+ 0.165 |
+ 0.165 |
+ 0.174 |
+ 0.191 |
+ 0.192 |
+ 0.204 |
+ 0.163 |
+ 0.118 |
+ 0.172 |
+ 0.177 |
+ 0.156 |
+ 0.181 |
+ 0.167 |
+ 0.127 |
+ 0.155 |
+ 0.160 |
+ 0.135 |
+ 0.165 |
+ 0.160 |
+ 0.153 |
+ 0.150 |
+ 0.154 |
+ 0.145 |
+ 0.154 |
+ 0.171 |
+ 0.160 |
+ 7800 |
+ Activity |
+ ADRB2_HUMAN |
+ Medium |
+ Human |
+
+
+ AICDA_HUMAN_Gajula_2014_3cycles |
+ 0.143 |
+ 0.238 |
+ 0.333 |
+ 0.190 |
+ 0.238 |
+ 0.238 |
+ 0.048 |
+ 0.143 |
+ 0.190 |
+ 0.190 |
+ 0.238 |
+ 0.190 |
+ 0.190 |
+ 0.095 |
+ 0.000 |
+ 0.143 |
+ 0.190 |
+ 0.143 |
+ 0.095 |
+ 0.190 |
+ 0.048 |
+ 0.095 |
+ 0.286 |
+ 0.190 |
+ 0.048 |
+ 0.190 |
+ 0.190 |
+ 0.190 |
+ 0.190 |
+ 0.190 |
+ 0.190 |
+ 0.238 |
+ 0.095 |
+ 0.095 |
+ 0.095 |
+ 0.143 |
+ 0.238 |
+ 0.190 |
+ 0.238 |
+ 0.238 |
+ 0.238 |
+ 0.190 |
+ 0.048 |
+ 0.048 |
+ 0.190 |
+ 0.048 |
+ 0.333 |
+ 0.143 |
+ 0.143 |
+ 0.333 |
+ 0.143 |
+ 0.333 |
+ 0.143 |
+ 0.286 |
+ 0.143 |
+ 0.143 |
+ 0.143 |
+ 0.143 |
+ 0.143 |
+ 0.143 |
+ 0.048 |
+ 0.048 |
+ 209 |
+ Activity |
+ AICDA_HUMAN |
+ Medium |
+ Human |
+
+
+ AMFR_HUMAN_Tsuboyama_2023_4G3O |
+ 0.289 |
+ 0.372 |
+ 0.372 |
+ 0.369 |
+ 0.359 |
+ 0.379 |
+ 0.070 |
+ 0.336 |
+ 0.352 |
+ 0.369 |
+ 0.433 |
+ 0.101 |
+ 0.319 |
+ 0.057 |
+ 0.124 |
+ 0.409 |
+ 0.386 |
+ 0.409 |
+ 0.413 |
+ 0.366 |
+ 0.057 |
+ 0.117 |
+ 0.346 |
+ 0.342 |
+ 0.114 |
+ 0.339 |
+ 0.342 |
+ 0.265 |
+ 0.383 |
+ 0.336 |
+ 0.383 |
+ 0.346 |
+ 0.326 |
+ 0.057 |
+ 0.094 |
+ 0.114 |
+ 0.339 |
+ 0.336 |
+ 0.346 |
+ 0.383 |
+ 0.379 |
+ 0.383 |
+ 0.141 |
+ 0.134 |
+ 0.396 |
+ 0.315 |
+ 0.359 |
+ 0.356 |
+ 0.416 |
+ 0.406 |
+ 0.406 |
+ 0.409 |
+ 0.399 |
+ 0.386 |
+ 0.403 |
+ 0.396 |
+ 0.379 |
+ 0.386 |
+ 0.403 |
+ 0.396 |
+ 0.430 |
+ 0.235 |
+ 2972 |
+ Stability |
+ AMFR_HUMAN |
+ Medium |
+ Human |
+
+
+ AMIE_PSEAE_Wrenbeck_2017 |
+ 0.143 |
+ 0.114 |
+ 0.130 |
+ 0.124 |
+ 0.127 |
+ 0.119 |
+ 0.103 |
+ 0.159 |
+ 0.119 |
+ 0.141 |
+ 0.181 |
+ 0.144 |
+ 0.128 |
+ 0.152 |
+ 0.167 |
+ 0.197 |
+ 0.193 |
+ 0.233 |
+ 0.127 |
+ 0.165 |
+ 0.188 |
+ 0.175 |
+ 0.157 |
+ 0.130 |
+ 0.151 |
+ 0.151 |
+ 0.148 |
+ 0.169 |
+ 0.103 |
+ 0.104 |
+ 0.130 |
+ 0.133 |
+ 0.074 |
+ 0.188 |
+ 0.136 |
+ 0.112 |
+ 0.127 |
+ 0.120 |
+ 0.106 |
+ 0.124 |
+ 0.122 |
+ 0.124 |
+ 0.159 |
+ 0.146 |
+ 0.193 |
+ 0.170 |
+ 0.185 |
+ 0.226 |
+ 0.207 |
+ 0.143 |
+ 0.188 |
+ 0.194 |
+ 0.180 |
+ 0.175 |
+ 0.159 |
+ 0.154 |
+ 0.165 |
+ 0.169 |
+ 0.165 |
+ 0.170 |
+ 0.234 |
+ 0.161 |
+ 6227 |
+ Activity |
+ AMIE_PSEAE |
+ High |
+ Prokaryote |
+
+
+ ANCSZ_Hobbs_2022 |
+ 0.214 |
+ 0.208 |
+ 0.208 |
+ 0.206 |
+ 0.206 |
+ 0.199 |
+ 0.156 |
+ 0.176 |
+ 0.154 |
+ 0.154 |
+ 0.156 |
+ 0.171 |
+ 0.165 |
+ 0.195 |
+ 0.231 |
+ 0.251 |
+ 0.244 |
+ 0.225 |
+ 0.212 |
+ 0.086 |
+ 0.169 |
+ 0.128 |
+ 0.146 |
+ 0.148 |
+ 0.173 |
+ 0.150 |
+ 0.141 |
+ 0.137 |
+ 0.113 |
+ 0.191 |
+ 0.139 |
+ 0.126 |
+ 0.139 |
+ 0.165 |
+ 0.126 |
+ 0.137 |
+ 0.193 |
+ 0.167 |
+ 0.169 |
+ 0.203 |
+ 0.186 |
+ 0.188 |
+ 0.206 |
+ 0.169 |
+ 0.152 |
+ 0.188 |
+ 0.173 |
+ 0.158 |
+ 0.188 |
+ 0.133 |
+ 0.251 |
+ 0.242 |
+ 0.214 |
+ 0.225 |
+ 0.225 |
+ 0.236 |
+ 0.238 |
+ 0.257 |
+ 0.233 |
+ 0.240 |
+ 0.208 |
+ 0.236 |
+ 4670 |
+ Activity |
+ ANCSZ |
+ Medium |
+ Eukaryote |
+
+
+ ARGR_ECOLI_Tsuboyama_2023_1AOY |
+ 0.209 |
+ 0.194 |
+ 0.248 |
+ 0.240 |
+ 0.240 |
+ 0.217 |
+ 0.155 |
+ 0.132 |
+ 0.171 |
+ 0.202 |
+ 0.233 |
+ 0.178 |
+ 0.233 |
+ 0.194 |
+ 0.233 |
+ 0.287 |
+ 0.264 |
+ 0.240 |
+ 0.202 |
+ 0.178 |
+ 0.217 |
+ 0.132 |
+ 0.140 |
+ 0.140 |
+ 0.217 |
+ 0.116 |
+ 0.155 |
+ 0.147 |
+ 0.147 |
+ 0.163 |
+ 0.109 |
+ 0.062 |
+ 0.031 |
+ 0.178 |
+ 0.132 |
+ 0.132 |
+ 0.194 |
+ 0.217 |
+ 0.186 |
+ 0.248 |
+ 0.209 |
+ 0.194 |
+ 0.194 |
+ 0.155 |
+ 0.233 |
+ 0.248 |
+ 0.372 |
+ 0.310 |
+ 0.302 |
+ 0.279 |
+ 0.240 |
+ 0.279 |
+ 0.264 |
+ 0.264 |
+ 0.256 |
+ 0.256 |
+ 0.233 |
+ 0.256 |
+ 0.271 |
+ 0.256 |
+ 0.295 |
+ 0.310 |
+ 1287 |
+ Stability |
+ ARGR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ B2L11_HUMAN_Dutta_2010_binding-Mcl-1 |
+ 0.294 |
+ 0.235 |
+ 0.353 |
+ 0.294 |
+ 0.294 |
+ 0.294 |
+ 0.176 |
+ 0.294 |
+ 0.294 |
+ 0.294 |
+ 0.059 |
+ 0.059 |
+ 0.471 |
+ 0.235 |
+ 0.235 |
+ 0.235 |
+ 0.353 |
+ 0.294 |
+ 0.294 |
+ 0.294 |
+ 0.176 |
+ 0.059 |
+ 0.176 |
+ 0.235 |
+ 0.059 |
+ 0.235 |
+ 0.294 |
+ 0.118 |
+ 0.235 |
+ 0.235 |
+ 0.235 |
+ 0.235 |
+ 0.176 |
+ 0.294 |
+ 0.118 |
+ 0.176 |
+ 0.353 |
+ 0.353 |
+ 0.294 |
+ 0.353 |
+ 0.353 |
+ 0.294 |
+ 0.235 |
+ 0.235 |
+ 0.412 |
+ 0.176 |
+ 0.176 |
+ 0.353 |
+ 0.235 |
+ 0.059 |
+ 0.235 |
+ 0.294 |
+ 0.235 |
+ 0.235 |
+ 0.176 |
+ 0.235 |
+ 0.235 |
+ 0.294 |
+ 0.235 |
+ 0.235 |
+ 0.353 |
+ 0.176 |
+ 170 |
+ Binding |
+ B2L11_HUMAN |
+ Low |
+ Human |
+
+
+ BBC1_YEAST_Tsuboyama_2023_1TG0 |
+ 0.217 |
+ 0.280 |
+ 0.203 |
+ 0.203 |
+ 0.213 |
+ 0.179 |
+ 0.184 |
+ 0.174 |
+ 0.285 |
+ 0.285 |
+ 0.271 |
+ 0.290 |
+ 0.309 |
+ 0.126 |
+ 0.275 |
+ 0.300 |
+ 0.372 |
+ 0.329 |
+ 0.329 |
+ 0.251 |
+ 0.208 |
+ 0.227 |
+ 0.251 |
+ 0.242 |
+ 0.126 |
+ 0.261 |
+ 0.198 |
+ 0.227 |
+ 0.217 |
+ 0.188 |
+ 0.184 |
+ 0.140 |
+ 0.150 |
+ 0.232 |
+ 0.188 |
+ 0.242 |
+ 0.222 |
+ 0.232 |
+ 0.266 |
+ 0.208 |
+ 0.174 |
+ 0.198 |
+ 0.227 |
+ 0.077 |
+ 0.217 |
+ 0.232 |
+ 0.285 |
+ 0.246 |
+ 0.348 |
+ 0.343 |
+ 0.324 |
+ 0.338 |
+ 0.329 |
+ 0.382 |
+ 0.353 |
+ 0.396 |
+ 0.362 |
+ 0.362 |
+ 0.362 |
+ 0.353 |
+ 0.314 |
+ 0.353 |
+ 2069 |
+ Stability |
+ BBC1_YEAST |
+ High |
+ Eukaryote |
+
+
+ BCHB_CHLTE_Tsuboyama_2023_2KRU |
+ 0.253 |
+ 0.247 |
+ 0.228 |
+ 0.241 |
+ 0.234 |
+ 0.247 |
+ 0.127 |
+ 0.184 |
+ 0.196 |
+ 0.278 |
+ 0.241 |
+ 0.297 |
+ 0.304 |
+ 0.133 |
+ 0.158 |
+ 0.228 |
+ 0.335 |
+ 0.266 |
+ 0.310 |
+ 0.272 |
+ 0.120 |
+ 0.215 |
+ 0.272 |
+ 0.310 |
+ 0.241 |
+ 0.291 |
+ 0.234 |
+ 0.297 |
+ 0.278 |
+ 0.253 |
+ 0.215 |
+ 0.177 |
+ 0.095 |
+ 0.177 |
+ 0.228 |
+ 0.259 |
+ 0.253 |
+ 0.272 |
+ 0.272 |
+ 0.241 |
+ 0.241 |
+ 0.253 |
+ 0.190 |
+ 0.158 |
+ 0.266 |
+ 0.234 |
+ 0.297 |
+ 0.342 |
+ 0.386 |
+ 0.323 |
+ 0.386 |
+ 0.361 |
+ 0.342 |
+ 0.373 |
+ 0.354 |
+ 0.354 |
+ 0.361 |
+ 0.361 |
+ 0.310 |
+ 0.361 |
+ 0.361 |
+ 0.222 |
+ 1572 |
+ Stability |
+ BCHB_CHLTE |
+ Medium |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Deng_2012 |
+ 0.238 |
+ 0.334 |
+ 0.340 |
+ 0.348 |
+ 0.366 |
+ 0.366 |
+ 0.110 |
+ 0.170 |
+ 0.276 |
+ 0.340 |
+ 0.298 |
+ 0.290 |
+ 0.320 |
+ 0.172 |
+ 0.250 |
+ 0.268 |
+ 0.294 |
+ 0.268 |
+ 0.256 |
+ 0.298 |
+ 0.258 |
+ 0.260 |
+ 0.276 |
+ 0.238 |
+ 0.248 |
+ 0.336 |
+ 0.304 |
+ 0.306 |
+ 0.228 |
+ 0.300 |
+ 0.340 |
+ 0.320 |
+ 0.098 |
+ 0.250 |
+ 0.296 |
+ 0.246 |
+ 0.298 |
+ 0.338 |
+ 0.314 |
+ 0.360 |
+ 0.386 |
+ 0.378 |
+ 0.204 |
+ 0.074 |
+ 0.318 |
+ 0.248 |
+ 0.334 |
+ 0.326 |
+ 0.324 |
+ 0.218 |
+ 0.338 |
+ 0.354 |
+ 0.336 |
+ 0.380 |
+ 0.352 |
+ 0.352 |
+ 0.368 |
+ 0.352 |
+ 0.376 |
+ 0.362 |
+ 0.330 |
+ 0.290 |
+ 4996 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Firnberg_2014 |
+ 0.229 |
+ 0.385 |
+ 0.404 |
+ 0.394 |
+ 0.410 |
+ 0.410 |
+ 0.140 |
+ 0.190 |
+ 0.300 |
+ 0.362 |
+ 0.338 |
+ 0.317 |
+ 0.356 |
+ 0.200 |
+ 0.262 |
+ 0.323 |
+ 0.350 |
+ 0.321 |
+ 0.252 |
+ 0.333 |
+ 0.292 |
+ 0.265 |
+ 0.258 |
+ 0.235 |
+ 0.296 |
+ 0.346 |
+ 0.306 |
+ 0.315 |
+ 0.208 |
+ 0.296 |
+ 0.335 |
+ 0.317 |
+ 0.140 |
+ 0.271 |
+ 0.292 |
+ 0.238 |
+ 0.317 |
+ 0.338 |
+ 0.321 |
+ 0.404 |
+ 0.412 |
+ 0.423 |
+ 0.235 |
+ 0.123 |
+ 0.367 |
+ 0.279 |
+ 0.362 |
+ 0.381 |
+ 0.406 |
+ 0.215 |
+ 0.365 |
+ 0.352 |
+ 0.350 |
+ 0.369 |
+ 0.381 |
+ 0.367 |
+ 0.369 |
+ 0.373 |
+ 0.358 |
+ 0.377 |
+ 0.402 |
+ 0.298 |
+ 4783 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Jacquier_2013 |
+ 0.131 |
+ 0.225 |
+ 0.214 |
+ 0.217 |
+ 0.217 |
+ 0.219 |
+ 0.099 |
+ 0.139 |
+ 0.174 |
+ 0.201 |
+ 0.182 |
+ 0.187 |
+ 0.193 |
+ 0.134 |
+ 0.144 |
+ 0.155 |
+ 0.187 |
+ 0.179 |
+ 0.179 |
+ 0.168 |
+ 0.179 |
+ 0.176 |
+ 0.195 |
+ 0.171 |
+ 0.166 |
+ 0.179 |
+ 0.195 |
+ 0.203 |
+ 0.174 |
+ 0.142 |
+ 0.187 |
+ 0.195 |
+ 0.115 |
+ 0.182 |
+ 0.193 |
+ 0.195 |
+ 0.190 |
+ 0.203 |
+ 0.201 |
+ 0.214 |
+ 0.209 |
+ 0.219 |
+ 0.128 |
+ 0.078 |
+ 0.179 |
+ 0.152 |
+ 0.150 |
+ 0.193 |
+ 0.182 |
+ 0.155 |
+ 0.190 |
+ 0.201 |
+ 0.206 |
+ 0.195 |
+ 0.184 |
+ 0.198 |
+ 0.198 |
+ 0.184 |
+ 0.201 |
+ 0.195 |
+ 0.190 |
+ 0.155 |
+ 989 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BLAT_ECOLX_Stiffler_2015 |
+ 0.204 |
+ 0.350 |
+ 0.356 |
+ 0.336 |
+ 0.374 |
+ 0.384 |
+ 0.088 |
+ 0.222 |
+ 0.256 |
+ 0.334 |
+ 0.314 |
+ 0.294 |
+ 0.324 |
+ 0.182 |
+ 0.216 |
+ 0.274 |
+ 0.306 |
+ 0.292 |
+ 0.258 |
+ 0.292 |
+ 0.288 |
+ 0.286 |
+ 0.260 |
+ 0.250 |
+ 0.272 |
+ 0.336 |
+ 0.336 |
+ 0.302 |
+ 0.226 |
+ 0.292 |
+ 0.356 |
+ 0.364 |
+ 0.084 |
+ 0.288 |
+ 0.290 |
+ 0.264 |
+ 0.330 |
+ 0.328 |
+ 0.324 |
+ 0.380 |
+ 0.396 |
+ 0.400 |
+ 0.180 |
+ 0.090 |
+ 0.328 |
+ 0.214 |
+ 0.318 |
+ 0.348 |
+ 0.344 |
+ 0.190 |
+ 0.352 |
+ 0.320 |
+ 0.334 |
+ 0.356 |
+ 0.364 |
+ 0.356 |
+ 0.354 |
+ 0.330 |
+ 0.342 |
+ 0.364 |
+ 0.346 |
+ 0.244 |
+ 4996 |
+ OrganismalFitness |
+ BLAT_ECOLX |
+ High |
+ Prokaryote |
+
+
+ BRCA1_HUMAN_Findlay_2018 |
+ 0.223 |
+ 0.196 |
+ 0.234 |
+ 0.245 |
+ 0.217 |
+ 0.239 |
+ 0.136 |
+ 0.136 |
+ 0.201 |
+ 0.207 |
+ 0.168 |
+ 0.152 |
+ 0.163 |
+ 0.152 |
+ 0.120 |
+ 0.147 |
+ 0.168 |
+ 0.212 |
+ 0.130 |
+ 0.158 |
+ 0.136 |
+ 0.212 |
+ 0.223 |
+ 0.158 |
+ 0.207 |
+ 0.228 |
+ 0.239 |
+ 0.185 |
+ 0.179 |
+ 0.196 |
+ 0.212 |
+ 0.190 |
+ 0.120 |
+ 0.130 |
+ 0.147 |
+ 0.179 |
+ 0.207 |
+ 0.217 |
+ 0.223 |
+ 0.255 |
+ 0.250 |
+ 0.212 |
+ 0.141 |
+ 0.168 |
+ 0.228 |
+ 0.179 |
+ 0.185 |
+ 0.228 |
+ 0.130 |
+ 0.141 |
+ 0.174 |
+ 0.185 |
+ 0.168 |
+ 0.163 |
+ 0.163 |
+ 0.152 |
+ 0.158 |
+ 0.179 |
+ 0.163 |
+ 0.163 |
+ 0.147 |
+ 0.168 |
+ 1837 |
+ OrganismalFitness |
+ BRCA1_HUMAN |
+ Low |
+ Human |
+
+
+ BRCA2_HUMAN_Erwood_2022_HEK293T |
+ 0.259 |
+ 0.185 |
+ 0.148 |
+ 0.185 |
+ 0.259 |
+ 0.185 |
+ 0.148 |
+ 0.111 |
+ 0.111 |
+ 0.148 |
+ 0.148 |
+ 0.111 |
+ 0.037 |
+ 0.111 |
+ 0.148 |
+ 0.185 |
+ 0.185 |
+ 0.185 |
+ 0.111 |
+ 0.037 |
+ 0.185 |
+ 0.259 |
+ 0.296 |
+ 0.148 |
+ 0.222 |
+ 0.148 |
+ 0.111 |
+ 0.259 |
+ 0.074 |
+ 0.111 |
+ 0.148 |
+ 0.222 |
+ 0.111 |
+ 0.185 |
+ 0.222 |
+ 0.185 |
+ 0.148 |
+ 0.185 |
+ 0.185 |
+ 0.185 |
+ 0.185 |
+ 0.185 |
+ 0.111 |
+ 0.148 |
+ 0.185 |
+ 0.185 |
+ 0.148 |
+ 0.111 |
+ 0.037 |
+ 0.111 |
+ 0.222 |
+ 0.185 |
+ 0.148 |
+ 0.111 |
+ 0.222 |
+ 0.148 |
+ 0.111 |
+ 0.148 |
+ 0.222 |
+ 0.148 |
+ 0.074 |
+ 0.111 |
+ 265 |
+ OrganismalFitness |
+ BRCA2_HUMAN |
+ NaN |
+ Human |
+
+
+ C6KNH7_9INFA_Lee_2018 |
+ 0.255 |
+ 0.295 |
+ 0.288 |
+ 0.283 |
+ 0.302 |
+ 0.305 |
+ 0.088 |
+ 0.247 |
+ 0.298 |
+ 0.301 |
+ 0.098 |
+ 0.270 |
+ 0.331 |
+ 0.089 |
+ 0.092 |
+ 0.097 |
+ 0.250 |
+ 0.271 |
+ 0.305 |
+ 0.251 |
+ 0.262 |
+ 0.254 |
+ 0.266 |
+ 0.242 |
+ 0.143 |
+ 0.264 |
+ 0.273 |
+ 0.294 |
+ 0.242 |
+ 0.300 |
+ 0.271 |
+ 0.225 |
+ 0.125 |
+ 0.243 |
+ 0.269 |
+ 0.257 |
+ 0.280 |
+ 0.287 |
+ 0.285 |
+ 0.302 |
+ 0.309 |
+ 0.304 |
+ 0.097 |
+ 0.092 |
+ 0.172 |
+ 0.099 |
+ 0.236 |
+ 0.294 |
+ 0.276 |
+ 0.177 |
+ 0.262 |
+ 0.261 |
+ 0.269 |
+ 0.267 |
+ 0.262 |
+ 0.246 |
+ 0.259 |
+ 0.265 |
+ 0.253 |
+ 0.266 |
+ 0.181 |
+ 0.157 |
+ 10754 |
+ OrganismalFitness |
+ C6KNH7_9INFA |
+ Medium |
+ Virus |
+
+
+ CALM1_HUMAN_Weile_2017 |
+ 0.088 |
+ 0.082 |
+ 0.066 |
+ 0.055 |
+ 0.055 |
+ 0.055 |
+ 0.099 |
+ 0.049 |
+ 0.071 |
+ 0.049 |
+ 0.055 |
+ 0.077 |
+ 0.071 |
+ 0.110 |
+ 0.093 |
+ 0.077 |
+ 0.055 |
+ 0.038 |
+ 0.099 |
+ 0.055 |
+ 0.082 |
+ 0.077 |
+ 0.077 |
+ 0.060 |
+ 0.060 |
+ 0.055 |
+ 0.049 |
+ 0.077 |
+ 0.038 |
+ 0.038 |
+ 0.049 |
+ 0.055 |
+ 0.088 |
+ 0.099 |
+ 0.077 |
+ 0.038 |
+ 0.071 |
+ 0.082 |
+ 0.049 |
+ 0.066 |
+ 0.055 |
+ 0.060 |
+ 0.060 |
+ 0.104 |
+ 0.066 |
+ 0.071 |
+ 0.115 |
+ 0.066 |
+ 0.060 |
+ 0.121 |
+ 0.077 |
+ 0.071 |
+ 0.077 |
+ 0.071 |
+ 0.077 |
+ 0.066 |
+ 0.066 |
+ 0.077 |
+ 0.071 |
+ 0.077 |
+ 0.060 |
+ 0.077 |
+ 1813 |
+ OrganismalFitness |
+ CALM1_HUMAN |
+ High |
+ Human |
+
+
+ CAPSD_AAV2S_Sinai_2021 |
+ 0.263 |
+ 0.258 |
+ 0.246 |
+ 0.279 |
+ 0.258 |
+ 0.262 |
+ 0.174 |
+ 0.265 |
+ 0.233 |
+ 0.254 |
+ 0.136 |
+ 0.121 |
+ 0.129 |
+ 0.131 |
+ 0.147 |
+ 0.134 |
+ 0.165 |
+ 0.148 |
+ 0.112 |
+ 0.174 |
+ 0.122 |
+ 0.129 |
+ 0.133 |
+ 0.153 |
+ 0.119 |
+ 0.122 |
+ 0.154 |
+ 0.126 |
+ 0.180 |
+ 0.272 |
+ 0.130 |
+ 0.120 |
+ 0.098 |
+ 0.133 |
+ 0.142 |
+ 0.263 |
+ 0.230 |
+ 0.231 |
+ 0.275 |
+ 0.253 |
+ 0.255 |
+ 0.273 |
+ 0.128 |
+ 0.163 |
+ 0.190 |
+ 0.147 |
+ 0.176 |
+ 0.188 |
+ 0.145 |
+ 0.156 |
+ 0.162 |
+ 0.145 |
+ 0.133 |
+ 0.145 |
+ 0.125 |
+ 0.133 |
+ 0.142 |
+ 0.140 |
+ 0.175 |
+ 0.147 |
+ 0.152 |
+ 0.118 |
+ 42328 |
+ OrganismalFitness |
+ CAPSD_AAV2S |
+ Low |
+ Virus |
+
+
+ CAR11_HUMAN_Meitlis_2020_gof |
+ 0.076 |
+ 0.071 |
+ 0.071 |
+ 0.071 |
+ 0.080 |
+ 0.071 |
+ 0.223 |
+ 0.055 |
+ 0.071 |
+ 0.071 |
+ 0.071 |
+ 0.101 |
+ 0.088 |
+ 0.122 |
+ 0.109 |
+ 0.168 |
+ 0.092 |
+ 0.084 |
+ 0.088 |
+ 0.092 |
+ 0.113 |
+ 0.071 |
+ 0.097 |
+ 0.076 |
+ 0.038 |
+ 0.076 |
+ 0.080 |
+ 0.067 |
+ 0.097 |
+ 0.059 |
+ 0.071 |
+ 0.059 |
+ 0.097 |
+ 0.151 |
+ 0.080 |
+ 0.055 |
+ 0.084 |
+ 0.084 |
+ 0.071 |
+ 0.080 |
+ 0.084 |
+ 0.076 |
+ 0.118 |
+ 0.092 |
+ 0.092 |
+ 0.151 |
+ 0.088 |
+ 0.071 |
+ 0.113 |
+ 0.118 |
+ 0.160 |
+ 0.134 |
+ 0.130 |
+ 0.139 |
+ 0.143 |
+ 0.126 |
+ 0.143 |
+ 0.139 |
+ 0.130 |
+ 0.130 |
+ 0.168 |
+ 0.172 |
+ 2374 |
+ OrganismalFitness |
+ CAR11_HUMAN |
+ Low |
+ Human |
+
+
+ CAR11_HUMAN_Meitlis_2020_lof |
+ 0.062 |
+ 0.075 |
+ 0.071 |
+ 0.067 |
+ 0.062 |
+ 0.071 |
+ 0.204 |
+ 0.042 |
+ 0.092 |
+ 0.092 |
+ 0.067 |
+ 0.146 |
+ 0.112 |
+ 0.121 |
+ 0.104 |
+ 0.183 |
+ 0.112 |
+ 0.129 |
+ 0.100 |
+ 0.125 |
+ 0.125 |
+ 0.058 |
+ 0.096 |
+ 0.075 |
+ 0.062 |
+ 0.071 |
+ 0.088 |
+ 0.062 |
+ 0.121 |
+ 0.050 |
+ 0.075 |
+ 0.062 |
+ 0.067 |
+ 0.167 |
+ 0.079 |
+ 0.075 |
+ 0.067 |
+ 0.067 |
+ 0.050 |
+ 0.075 |
+ 0.075 |
+ 0.075 |
+ 0.121 |
+ 0.092 |
+ 0.088 |
+ 0.183 |
+ 0.108 |
+ 0.083 |
+ 0.154 |
+ 0.129 |
+ 0.158 |
+ 0.121 |
+ 0.133 |
+ 0.108 |
+ 0.142 |
+ 0.133 |
+ 0.146 |
+ 0.138 |
+ 0.129 |
+ 0.125 |
+ 0.208 |
+ 0.188 |
+ 2395 |
+ OrganismalFitness |
+ CAR11_HUMAN |
+ Low |
+ Human |
+
+
+ CAS9_STRP1_Spencer_2017_positive |
+ 0.096 |
+ 0.112 |
+ 0.111 |
+ 0.105 |
+ 0.111 |
+ 0.112 |
+ 0.096 |
+ 0.106 |
+ 0.116 |
+ 0.117 |
+ 0.110 |
+ 0.096 |
+ 0.106 |
+ 0.097 |
+ 0.103 |
+ 0.126 |
+ 0.106 |
+ 0.117 |
+ 0.110 |
+ 0.097 |
+ 0.099 |
+ 0.096 |
+ 0.117 |
+ 0.116 |
+ 0.100 |
+ 0.112 |
+ 0.096 |
+ 0.111 |
+ 0.101 |
+ 0.105 |
+ 0.117 |
+ 0.117 |
+ 0.111 |
+ 0.091 |
+ 0.099 |
+ 0.128 |
+ 0.111 |
+ 0.116 |
+ 0.123 |
+ 0.101 |
+ 0.101 |
+ 0.117 |
+ 0.095 |
+ 0.091 |
+ 0.111 |
+ 0.099 |
+ 0.099 |
+ 0.111 |
+ 0.103 |
+ 0.094 |
+ 0.118 |
+ 0.107 |
+ 0.115 |
+ 0.102 |
+ 0.111 |
+ 0.111 |
+ 0.099 |
+ 0.112 |
+ 0.100 |
+ 0.110 |
+ 0.127 |
+ 0.115 |
+ 8117 |
+ Activity |
+ CAS9_STRP1 |
+ Medium |
+ Prokaryote |
+
+
+ CASP3_HUMAN_Roychowdhury_2020 |
+ 0.159 |
+ 0.204 |
+ 0.153 |
+ 0.115 |
+ 0.134 |
+ 0.153 |
+ 0.089 |
+ 0.159 |
+ 0.210 |
+ 0.223 |
+ 0.146 |
+ 0.115 |
+ 0.153 |
+ 0.089 |
+ 0.146 |
+ 0.166 |
+ 0.178 |
+ 0.146 |
+ 0.210 |
+ 0.159 |
+ 0.146 |
+ 0.159 |
+ 0.159 |
+ 0.153 |
+ 0.140 |
+ 0.159 |
+ 0.178 |
+ 0.172 |
+ 0.204 |
+ 0.146 |
+ 0.229 |
+ 0.223 |
+ 0.115 |
+ 0.102 |
+ 0.140 |
+ 0.172 |
+ 0.134 |
+ 0.166 |
+ 0.166 |
+ 0.146 |
+ 0.172 |
+ 0.159 |
+ 0.096 |
+ 0.076 |
+ 0.159 |
+ 0.140 |
+ 0.159 |
+ 0.166 |
+ 0.127 |
+ 0.102 |
+ 0.185 |
+ 0.178 |
+ 0.191 |
+ 0.140 |
+ 0.172 |
+ 0.166 |
+ 0.185 |
+ 0.166 |
+ 0.172 |
+ 0.166 |
+ 0.146 |
+ 0.134 |
+ 1567 |
+ Activity |
+ CASP3_HUMAN |
+ High |
+ Human |
+
+
+ CASP7_HUMAN_Roychowdhury_2020 |
+ 0.149 |
+ 0.202 |
+ 0.238 |
+ 0.214 |
+ 0.250 |
+ 0.244 |
+ 0.060 |
+ 0.155 |
+ 0.226 |
+ 0.250 |
+ 0.268 |
+ 0.190 |
+ 0.232 |
+ 0.107 |
+ 0.208 |
+ 0.274 |
+ 0.208 |
+ 0.196 |
+ 0.190 |
+ 0.238 |
+ 0.095 |
+ 0.214 |
+ 0.155 |
+ 0.167 |
+ 0.202 |
+ 0.179 |
+ 0.179 |
+ 0.167 |
+ 0.196 |
+ 0.226 |
+ 0.167 |
+ 0.155 |
+ 0.107 |
+ 0.083 |
+ 0.190 |
+ 0.137 |
+ 0.208 |
+ 0.226 |
+ 0.190 |
+ 0.220 |
+ 0.232 |
+ 0.244 |
+ 0.185 |
+ 0.060 |
+ 0.262 |
+ 0.274 |
+ 0.238 |
+ 0.280 |
+ 0.244 |
+ 0.190 |
+ 0.196 |
+ 0.214 |
+ 0.244 |
+ 0.232 |
+ 0.238 |
+ 0.226 |
+ 0.214 |
+ 0.220 |
+ 0.226 |
+ 0.220 |
+ 0.226 |
+ 0.208 |
+ 1680 |
+ Activity |
+ CASP7_HUMAN |
+ Medium |
+ Human |
+
+
+ CATR_CHLRE_Tsuboyama_2023_2AMI |
+ 0.272 |
+ 0.147 |
+ 0.147 |
+ 0.152 |
+ 0.183 |
+ 0.162 |
+ 0.236 |
+ 0.094 |
+ 0.131 |
+ 0.136 |
+ 0.115 |
+ 0.173 |
+ 0.147 |
+ 0.225 |
+ 0.257 |
+ 0.209 |
+ 0.152 |
+ 0.157 |
+ 0.131 |
+ 0.089 |
+ 0.120 |
+ 0.126 |
+ 0.110 |
+ 0.105 |
+ 0.073 |
+ 0.089 |
+ 0.089 |
+ 0.073 |
+ 0.105 |
+ 0.152 |
+ 0.126 |
+ 0.173 |
+ 0.183 |
+ 0.131 |
+ 0.110 |
+ 0.099 |
+ 0.199 |
+ 0.173 |
+ 0.126 |
+ 0.168 |
+ 0.152 |
+ 0.157 |
+ 0.194 |
+ 0.183 |
+ 0.084 |
+ 0.126 |
+ 0.314 |
+ 0.131 |
+ 0.183 |
+ 0.293 |
+ 0.183 |
+ 0.115 |
+ 0.141 |
+ 0.152 |
+ 0.141 |
+ 0.147 |
+ 0.157 |
+ 0.141 |
+ 0.141 |
+ 0.157 |
+ 0.099 |
+ 0.257 |
+ 1903 |
+ Stability |
+ CATR_CHLRE |
+ High |
+ Eukaryote |
+
+
+ CBPA2_HUMAN_Tsuboyama_2023_1O6X |
+ 0.227 |
+ 0.159 |
+ 0.227 |
+ 0.227 |
+ 0.237 |
+ 0.237 |
+ 0.222 |
+ 0.184 |
+ 0.174 |
+ 0.198 |
+ 0.304 |
+ 0.242 |
+ 0.256 |
+ 0.227 |
+ 0.246 |
+ 0.251 |
+ 0.222 |
+ 0.227 |
+ 0.246 |
+ 0.237 |
+ 0.174 |
+ 0.193 |
+ 0.261 |
+ 0.251 |
+ 0.290 |
+ 0.261 |
+ 0.285 |
+ 0.246 |
+ 0.213 |
+ 0.184 |
+ 0.184 |
+ 0.140 |
+ 0.155 |
+ 0.237 |
+ 0.290 |
+ 0.285 |
+ 0.227 |
+ 0.242 |
+ 0.246 |
+ 0.261 |
+ 0.237 |
+ 0.261 |
+ 0.290 |
+ 0.261 |
+ 0.275 |
+ 0.290 |
+ 0.295 |
+ 0.309 |
+ 0.242 |
+ 0.285 |
+ 0.179 |
+ 0.242 |
+ 0.246 |
+ 0.217 |
+ 0.237 |
+ 0.232 |
+ 0.184 |
+ 0.203 |
+ 0.203 |
+ 0.232 |
+ 0.237 |
+ 0.309 |
+ 2068 |
+ Stability |
+ CBPA2_HUMAN |
+ Medium |
+ Human |
+
+
+ CBS_HUMAN_Sun_2020 |
+ 0.168 |
+ 0.161 |
+ 0.169 |
+ 0.166 |
+ 0.176 |
+ 0.175 |
+ 0.100 |
+ 0.141 |
+ 0.173 |
+ 0.162 |
+ 0.163 |
+ 0.150 |
+ 0.162 |
+ 0.098 |
+ 0.127 |
+ 0.127 |
+ 0.132 |
+ 0.145 |
+ 0.157 |
+ 0.161 |
+ 0.143 |
+ 0.133 |
+ 0.166 |
+ 0.173 |
+ 0.147 |
+ 0.152 |
+ 0.154 |
+ 0.163 |
+ 0.152 |
+ 0.163 |
+ 0.145 |
+ 0.130 |
+ 0.109 |
+ 0.150 |
+ 0.141 |
+ 0.151 |
+ 0.158 |
+ 0.177 |
+ 0.194 |
+ 0.172 |
+ 0.180 |
+ 0.179 |
+ 0.130 |
+ 0.102 |
+ 0.158 |
+ 0.150 |
+ 0.147 |
+ 0.147 |
+ 0.169 |
+ 0.116 |
+ 0.139 |
+ 0.136 |
+ 0.147 |
+ 0.141 |
+ 0.145 |
+ 0.151 |
+ 0.150 |
+ 0.154 |
+ 0.155 |
+ 0.151 |
+ 0.147 |
+ 0.127 |
+ 7217 |
+ OrganismalFitness |
+ CBS_HUMAN |
+ Medium |
+ Human |
+
+
+ CBX4_HUMAN_Tsuboyama_2023_2K28 |
+ 0.437 |
+ 0.406 |
+ 0.441 |
+ 0.445 |
+ 0.472 |
+ 0.480 |
+ 0.013 |
+ 0.310 |
+ 0.445 |
+ 0.424 |
+ 0.428 |
+ 0.397 |
+ 0.402 |
+ 0.022 |
+ 0.533 |
+ 0.533 |
+ 0.480 |
+ 0.445 |
+ 0.410 |
+ 0.445 |
+ 0.306 |
+ 0.341 |
+ 0.328 |
+ 0.332 |
+ 0.376 |
+ 0.358 |
+ 0.371 |
+ 0.341 |
+ 0.332 |
+ 0.445 |
+ 0.432 |
+ 0.376 |
+ 0.192 |
+ 0.301 |
+ 0.328 |
+ 0.341 |
+ 0.450 |
+ 0.402 |
+ 0.441 |
+ 0.480 |
+ 0.424 |
+ 0.441 |
+ 0.437 |
+ 0.096 |
+ 0.319 |
+ 0.406 |
+ 0.371 |
+ 0.262 |
+ 0.520 |
+ 0.467 |
+ 0.507 |
+ 0.480 |
+ 0.485 |
+ 0.480 |
+ 0.520 |
+ 0.459 |
+ 0.472 |
+ 0.472 |
+ 0.476 |
+ 0.507 |
+ 0.437 |
+ 0.507 |
+ 2282 |
+ Stability |
+ CBX4_HUMAN |
+ High |
+ Human |
+
+
+ CCDB_ECOLI_Adkar_2012 |
+ 0.124 |
+ 0.152 |
+ 0.146 |
+ 0.146 |
+ 0.118 |
+ 0.129 |
+ 0.096 |
+ 0.107 |
+ 0.079 |
+ 0.118 |
+ 0.146 |
+ 0.056 |
+ 0.079 |
+ 0.101 |
+ 0.112 |
+ 0.096 |
+ 0.157 |
+ 0.112 |
+ 0.140 |
+ 0.107 |
+ 0.062 |
+ 0.045 |
+ 0.067 |
+ 0.096 |
+ 0.017 |
+ 0.101 |
+ 0.090 |
+ 0.146 |
+ 0.112 |
+ 0.107 |
+ 0.197 |
+ 0.163 |
+ 0.067 |
+ 0.056 |
+ 0.073 |
+ 0.124 |
+ 0.118 |
+ 0.129 |
+ 0.152 |
+ 0.135 |
+ 0.146 |
+ 0.146 |
+ 0.096 |
+ 0.079 |
+ 0.118 |
+ 0.124 |
+ 0.062 |
+ 0.152 |
+ 0.135 |
+ 0.101 |
+ 0.174 |
+ 0.118 |
+ 0.174 |
+ 0.213 |
+ 0.185 |
+ 0.169 |
+ 0.163 |
+ 0.169 |
+ 0.140 |
+ 0.169 |
+ 0.157 |
+ 0.090 |
+ 1176 |
+ Activity |
+ CCDB_ECOLI |
+ High |
+ Prokaryote |
+
+
+ CCDB_ECOLI_Tripathi_2016 |
+ 0.117 |
+ 0.125 |
+ 0.124 |
+ 0.125 |
+ 0.124 |
+ 0.124 |
+ 0.090 |
+ 0.117 |
+ 0.108 |
+ 0.117 |
+ 0.115 |
+ 0.112 |
+ 0.112 |
+ 0.093 |
+ 0.097 |
+ 0.112 |
+ 0.120 |
+ 0.124 |
+ 0.127 |
+ 0.120 |
+ 0.093 |
+ 0.090 |
+ 0.083 |
+ 0.110 |
+ 0.085 |
+ 0.102 |
+ 0.086 |
+ 0.102 |
+ 0.125 |
+ 0.118 |
+ 0.126 |
+ 0.127 |
+ 0.108 |
+ 0.089 |
+ 0.099 |
+ 0.124 |
+ 0.118 |
+ 0.116 |
+ 0.124 |
+ 0.122 |
+ 0.122 |
+ 0.124 |
+ 0.096 |
+ 0.085 |
+ 0.114 |
+ 0.095 |
+ 0.111 |
+ 0.123 |
+ 0.113 |
+ 0.105 |
+ 0.121 |
+ 0.118 |
+ 0.115 |
+ 0.120 |
+ 0.124 |
+ 0.122 |
+ 0.121 |
+ 0.124 |
+ 0.123 |
+ 0.122 |
+ 0.120 |
+ 0.117 |
+ 1663 |
+ OrganismalFitness |
+ CCDB_ECOLI |
+ High |
+ Prokaryote |
+
+
+ CCR5_HUMAN_Gill_2023 |
+ 0.146 |
+ 0.143 |
+ 0.156 |
+ 0.164 |
+ 0.156 |
+ 0.140 |
+ 0.114 |
+ 0.231 |
+ 0.164 |
+ 0.165 |
+ 0.169 |
+ 0.146 |
+ 0.154 |
+ 0.124 |
+ 0.135 |
+ 0.148 |
+ 0.141 |
+ 0.154 |
+ 0.138 |
+ 0.170 |
+ 0.130 |
+ 0.196 |
+ 0.167 |
+ 0.161 |
+ 0.157 |
+ 0.196 |
+ 0.180 |
+ 0.196 |
+ 0.178 |
+ 0.178 |
+ 0.141 |
+ 0.125 |
+ 0.119 |
+ 0.167 |
+ 0.175 |
+ 0.167 |
+ 0.156 |
+ 0.159 |
+ 0.169 |
+ 0.144 |
+ 0.148 |
+ 0.148 |
+ 0.135 |
+ 0.079 |
+ 0.183 |
+ 0.138 |
+ 0.133 |
+ 0.177 |
+ 0.140 |
+ 0.132 |
+ 0.148 |
+ 0.148 |
+ 0.143 |
+ 0.128 |
+ 0.127 |
+ 0.138 |
+ 0.135 |
+ 0.144 |
+ 0.125 |
+ 0.135 |
+ 0.172 |
+ 0.151 |
+ 6137 |
+ Binding |
+ CCR5_HUMAN |
+ High |
+ Human |
+
+
+ CD19_HUMAN_Klesmith_2019_FMC_singles |
+ 0.167 |
+ 0.127 |
+ 0.164 |
+ 0.162 |
+ 0.146 |
+ 0.138 |
+ 0.143 |
+ 0.122 |
+ 0.103 |
+ 0.109 |
+ 0.180 |
+ 0.167 |
+ 0.196 |
+ 0.186 |
+ 0.228 |
+ 0.175 |
+ 0.167 |
+ 0.183 |
+ 0.138 |
+ 0.090 |
+ 0.194 |
+ 0.162 |
+ 0.207 |
+ 0.162 |
+ 0.149 |
+ 0.225 |
+ 0.159 |
+ 0.204 |
+ 0.149 |
+ 0.162 |
+ 0.130 |
+ 0.101 |
+ 0.127 |
+ 0.241 |
+ 0.252 |
+ 0.167 |
+ 0.191 |
+ 0.180 |
+ 0.156 |
+ 0.164 |
+ 0.154 |
+ 0.151 |
+ 0.215 |
+ 0.183 |
+ 0.218 |
+ 0.212 |
+ 0.340 |
+ 0.321 |
+ 0.324 |
+ 0.138 |
+ 0.228 |
+ 0.191 |
+ 0.202 |
+ 0.215 |
+ 0.218 |
+ 0.218 |
+ 0.236 |
+ 0.210 |
+ 0.228 |
+ 0.218 |
+ 0.353 |
+ 0.302 |
+ 3761 |
+ Binding |
+ CD19_HUMAN |
+ Low |
+ Human |
+
+
+ CP2C9_HUMAN_Amorosi_2021_abundance |
+ 0.250 |
+ 0.272 |
+ 0.267 |
+ 0.279 |
+ 0.298 |
+ 0.294 |
+ 0.242 |
+ 0.273 |
+ 0.278 |
+ 0.300 |
+ 0.261 |
+ 0.284 |
+ 0.309 |
+ 0.217 |
+ 0.284 |
+ 0.275 |
+ 0.289 |
+ 0.284 |
+ 0.278 |
+ 0.295 |
+ 0.287 |
+ 0.253 |
+ 0.276 |
+ 0.267 |
+ 0.284 |
+ 0.275 |
+ 0.279 |
+ 0.279 |
+ 0.284 |
+ 0.262 |
+ 0.246 |
+ 0.212 |
+ 0.110 |
+ 0.284 |
+ 0.281 |
+ 0.276 |
+ 0.294 |
+ 0.292 |
+ 0.289 |
+ 0.300 |
+ 0.298 |
+ 0.295 |
+ 0.287 |
+ 0.129 |
+ 0.272 |
+ 0.281 |
+ 0.276 |
+ 0.279 |
+ 0.309 |
+ 0.154 |
+ 0.275 |
+ 0.279 |
+ 0.290 |
+ 0.287 |
+ 0.286 |
+ 0.292 |
+ 0.300 |
+ 0.292 |
+ 0.289 |
+ 0.284 |
+ 0.292 |
+ 0.273 |
+ 6370 |
+ Expression |
+ CP2C9_HUMAN |
+ High |
+ Human |
+
+
+ CP2C9_HUMAN_Amorosi_2021_activity |
+ 0.285 |
+ 0.332 |
+ 0.345 |
+ 0.359 |
+ 0.369 |
+ 0.371 |
+ 0.309 |
+ 0.319 |
+ 0.311 |
+ 0.359 |
+ 0.320 |
+ 0.372 |
+ 0.380 |
+ 0.285 |
+ 0.364 |
+ 0.377 |
+ 0.397 |
+ 0.376 |
+ 0.354 |
+ 0.395 |
+ 0.346 |
+ 0.307 |
+ 0.332 |
+ 0.291 |
+ 0.359 |
+ 0.327 |
+ 0.324 |
+ 0.302 |
+ 0.306 |
+ 0.328 |
+ 0.247 |
+ 0.198 |
+ 0.117 |
+ 0.367 |
+ 0.359 |
+ 0.335 |
+ 0.379 |
+ 0.364 |
+ 0.372 |
+ 0.379 |
+ 0.374 |
+ 0.367 |
+ 0.371 |
+ 0.140 |
+ 0.346 |
+ 0.385 |
+ 0.314 |
+ 0.328 |
+ 0.353 |
+ 0.202 |
+ 0.403 |
+ 0.363 |
+ 0.366 |
+ 0.380 |
+ 0.379 |
+ 0.390 |
+ 0.366 |
+ 0.379 |
+ 0.385 |
+ 0.387 |
+ 0.387 |
+ 0.354 |
+ 6142 |
+ Binding |
+ CP2C9_HUMAN |
+ High |
+ Human |
+
+
+ CSN4_MOUSE_Tsuboyama_2023_1UFM |
+ 0.358 |
+ 0.330 |
+ 0.327 |
+ 0.339 |
+ 0.339 |
+ 0.342 |
+ 0.176 |
+ 0.309 |
+ 0.379 |
+ 0.361 |
+ 0.355 |
+ 0.394 |
+ 0.418 |
+ 0.270 |
+ 0.464 |
+ 0.397 |
+ 0.248 |
+ 0.336 |
+ 0.324 |
+ 0.327 |
+ 0.318 |
+ 0.339 |
+ 0.348 |
+ 0.355 |
+ 0.224 |
+ 0.376 |
+ 0.409 |
+ 0.352 |
+ 0.333 |
+ 0.330 |
+ 0.348 |
+ 0.321 |
+ 0.203 |
+ 0.236 |
+ 0.358 |
+ 0.358 |
+ 0.385 |
+ 0.385 |
+ 0.373 |
+ 0.355 |
+ 0.367 |
+ 0.358 |
+ 0.248 |
+ 0.224 |
+ 0.336 |
+ 0.394 |
+ 0.421 |
+ 0.330 |
+ 0.467 |
+ 0.455 |
+ 0.342 |
+ 0.352 |
+ 0.339 |
+ 0.336 |
+ 0.333 |
+ 0.339 |
+ 0.342 |
+ 0.336 |
+ 0.318 |
+ 0.336 |
+ 0.367 |
+ 0.452 |
+ 3295 |
+ Stability |
+ CSN4_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ CUE1_YEAST_Tsuboyama_2023_2MYX |
+ 0.297 |
+ 0.323 |
+ 0.316 |
+ 0.316 |
+ 0.342 |
+ 0.329 |
+ 0.158 |
+ 0.278 |
+ 0.361 |
+ 0.361 |
+ 0.304 |
+ 0.278 |
+ 0.209 |
+ 0.146 |
+ 0.190 |
+ 0.259 |
+ 0.291 |
+ 0.304 |
+ 0.304 |
+ 0.234 |
+ 0.133 |
+ 0.146 |
+ 0.171 |
+ 0.108 |
+ 0.101 |
+ 0.114 |
+ 0.184 |
+ 0.184 |
+ 0.310 |
+ 0.259 |
+ 0.278 |
+ 0.209 |
+ 0.165 |
+ 0.108 |
+ 0.120 |
+ 0.177 |
+ 0.297 |
+ 0.291 |
+ 0.316 |
+ 0.329 |
+ 0.335 |
+ 0.342 |
+ 0.196 |
+ 0.127 |
+ 0.259 |
+ 0.152 |
+ 0.329 |
+ 0.329 |
+ 0.354 |
+ 0.348 |
+ 0.278 |
+ 0.342 |
+ 0.310 |
+ 0.342 |
+ 0.278 |
+ 0.297 |
+ 0.316 |
+ 0.285 |
+ 0.310 |
+ 0.310 |
+ 0.361 |
+ 0.304 |
+ 1580 |
+ Stability |
+ CUE1_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ D7PM05_CLYGR_Somermeyer_2022 |
+ 0.250 |
+ 0.361 |
+ 0.315 |
+ 0.293 |
+ 0.336 |
+ 0.333 |
+ 0.098 |
+ 0.222 |
+ 0.369 |
+ 0.381 |
+ 0.237 |
+ 0.097 |
+ 0.100 |
+ 0.097 |
+ 0.092 |
+ 0.088 |
+ 0.085 |
+ 0.097 |
+ 0.103 |
+ 0.233 |
+ 0.079 |
+ 0.061 |
+ 0.092 |
+ 0.084 |
+ 0.056 |
+ 0.081 |
+ 0.080 |
+ 0.108 |
+ 0.147 |
+ 0.337 |
+ 0.320 |
+ 0.314 |
+ 0.056 |
+ 0.079 |
+ 0.098 |
+ 0.104 |
+ 0.293 |
+ 0.295 |
+ 0.290 |
+ 0.334 |
+ 0.336 |
+ 0.330 |
+ 0.135 |
+ 0.131 |
+ 0.137 |
+ 0.138 |
+ 0.257 |
+ 0.249 |
+ 0.305 |
+ 0.230 |
+ 0.274 |
+ 0.281 |
+ 0.286 |
+ 0.283 |
+ 0.277 |
+ 0.283 |
+ 0.269 |
+ 0.286 |
+ 0.268 |
+ 0.285 |
+ 0.198 |
+ 0.136 |
+ 24515 |
+ Activity |
+ D7PM05_CLYGR |
+ Low |
+ Eukaryote |
+
+
+ DLG4_HUMAN_Faure_2021 |
+ 0.322 |
+ 0.319 |
+ 0.328 |
+ 0.322 |
+ 0.338 |
+ 0.340 |
+ 0.314 |
+ 0.305 |
+ 0.282 |
+ 0.292 |
+ 0.268 |
+ 0.321 |
+ 0.318 |
+ 0.378 |
+ 0.423 |
+ 0.370 |
+ 0.315 |
+ 0.318 |
+ 0.279 |
+ 0.348 |
+ 0.284 |
+ 0.289 |
+ 0.278 |
+ 0.277 |
+ 0.287 |
+ 0.292 |
+ 0.291 |
+ 0.297 |
+ 0.265 |
+ 0.321 |
+ 0.274 |
+ 0.285 |
+ 0.272 |
+ 0.287 |
+ 0.308 |
+ 0.287 |
+ 0.331 |
+ 0.360 |
+ 0.351 |
+ 0.345 |
+ 0.354 |
+ 0.350 |
+ 0.285 |
+ 0.209 |
+ 0.249 |
+ 0.287 |
+ 0.385 |
+ 0.285 |
+ 0.341 |
+ 0.216 |
+ 0.284 |
+ 0.256 |
+ 0.282 |
+ 0.291 |
+ 0.282 |
+ 0.302 |
+ 0.262 |
+ 0.282 |
+ 0.277 |
+ 0.282 |
+ 0.317 |
+ 0.398 |
+ 6976 |
+ OrganismalFitness |
+ DLG4_HUMAN |
+ Low |
+ Human |
+
+
+ DLG4_RAT_McLaughlin_2012 |
+ 0.158 |
+ 0.215 |
+ 0.158 |
+ 0.139 |
+ 0.165 |
+ 0.171 |
+ 0.184 |
+ 0.133 |
+ 0.127 |
+ 0.165 |
+ 0.184 |
+ 0.133 |
+ 0.127 |
+ 0.108 |
+ 0.139 |
+ 0.177 |
+ 0.101 |
+ 0.127 |
+ 0.133 |
+ 0.101 |
+ 0.108 |
+ 0.108 |
+ 0.101 |
+ 0.114 |
+ 0.089 |
+ 0.127 |
+ 0.095 |
+ 0.108 |
+ 0.127 |
+ 0.133 |
+ 0.203 |
+ 0.234 |
+ 0.120 |
+ 0.114 |
+ 0.133 |
+ 0.108 |
+ 0.133 |
+ 0.133 |
+ 0.120 |
+ 0.133 |
+ 0.133 |
+ 0.158 |
+ 0.158 |
+ 0.089 |
+ 0.146 |
+ 0.133 |
+ 0.190 |
+ 0.127 |
+ 0.108 |
+ 0.133 |
+ 0.108 |
+ 0.076 |
+ 0.063 |
+ 0.089 |
+ 0.089 |
+ 0.082 |
+ 0.076 |
+ 0.095 |
+ 0.095 |
+ 0.082 |
+ 0.095 |
+ 0.133 |
+ 1576 |
+ Binding |
+ DLG4_RAT |
+ Low |
+ Eukaryote |
+
+
+ DN7A_SACS2_Tsuboyama_2023_1JIC |
+ 0.168 |
+ 0.149 |
+ 0.139 |
+ 0.119 |
+ 0.109 |
+ 0.129 |
+ 0.089 |
+ 0.109 |
+ 0.228 |
+ 0.287 |
+ 0.119 |
+ 0.089 |
+ 0.119 |
+ 0.089 |
+ 0.178 |
+ 0.198 |
+ 0.248 |
+ 0.267 |
+ 0.218 |
+ 0.228 |
+ 0.079 |
+ 0.089 |
+ 0.069 |
+ 0.109 |
+ 0.119 |
+ 0.149 |
+ 0.079 |
+ 0.119 |
+ 0.218 |
+ 0.208 |
+ 0.129 |
+ 0.059 |
+ 0.168 |
+ 0.099 |
+ 0.099 |
+ 0.079 |
+ 0.168 |
+ 0.168 |
+ 0.158 |
+ 0.149 |
+ 0.158 |
+ 0.119 |
+ 0.139 |
+ 0.089 |
+ 0.188 |
+ 0.168 |
+ 0.287 |
+ 0.317 |
+ 0.267 |
+ 0.307 |
+ 0.416 |
+ 0.366 |
+ 0.347 |
+ 0.277 |
+ 0.307 |
+ 0.297 |
+ 0.297 |
+ 0.228 |
+ 0.327 |
+ 0.356 |
+ 0.218 |
+ 0.277 |
+ 1008 |
+ Stability |
+ DN7A_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ DNJA1_HUMAN_Tsuboyama_2023_2LO1 |
+ 0.300 |
+ 0.282 |
+ 0.295 |
+ 0.295 |
+ 0.313 |
+ 0.313 |
+ 0.322 |
+ 0.264 |
+ 0.300 |
+ 0.308 |
+ 0.330 |
+ 0.322 |
+ 0.374 |
+ 0.388 |
+ 0.485 |
+ 0.432 |
+ 0.449 |
+ 0.392 |
+ 0.401 |
+ 0.317 |
+ 0.308 |
+ 0.317 |
+ 0.357 |
+ 0.370 |
+ 0.313 |
+ 0.273 |
+ 0.300 |
+ 0.308 |
+ 0.291 |
+ 0.264 |
+ 0.189 |
+ 0.198 |
+ 0.242 |
+ 0.282 |
+ 0.260 |
+ 0.361 |
+ 0.304 |
+ 0.308 |
+ 0.317 |
+ 0.308 |
+ 0.300 |
+ 0.313 |
+ 0.383 |
+ 0.256 |
+ 0.295 |
+ 0.352 |
+ 0.427 |
+ 0.348 |
+ 0.454 |
+ 0.463 |
+ 0.467 |
+ 0.507 |
+ 0.511 |
+ 0.485 |
+ 0.485 |
+ 0.489 |
+ 0.480 |
+ 0.515 |
+ 0.458 |
+ 0.502 |
+ 0.304 |
+ 0.383 |
+ 2264 |
+ Stability |
+ DNJA1_HUMAN |
+ High |
+ Human |
+
+
+ DOCK1_MOUSE_Tsuboyama_2023_2M0Y |
+ 0.342 |
+ 0.325 |
+ 0.329 |
+ 0.295 |
+ 0.288 |
+ 0.318 |
+ 0.099 |
+ 0.250 |
+ 0.223 |
+ 0.192 |
+ 0.305 |
+ 0.185 |
+ 0.233 |
+ 0.027 |
+ 0.192 |
+ 0.233 |
+ 0.305 |
+ 0.305 |
+ 0.274 |
+ 0.332 |
+ 0.240 |
+ 0.284 |
+ 0.291 |
+ 0.260 |
+ 0.103 |
+ 0.223 |
+ 0.134 |
+ 0.195 |
+ 0.312 |
+ 0.164 |
+ 0.318 |
+ 0.291 |
+ 0.247 |
+ 0.075 |
+ 0.075 |
+ 0.048 |
+ 0.325 |
+ 0.356 |
+ 0.329 |
+ 0.322 |
+ 0.315 |
+ 0.322 |
+ 0.134 |
+ 0.058 |
+ 0.325 |
+ 0.192 |
+ 0.384 |
+ 0.377 |
+ 0.363 |
+ 0.445 |
+ 0.349 |
+ 0.363 |
+ 0.366 |
+ 0.390 |
+ 0.353 |
+ 0.380 |
+ 0.366 |
+ 0.363 |
+ 0.339 |
+ 0.373 |
+ 0.404 |
+ 0.370 |
+ 2915 |
+ Stability |
+ DOCK1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ DYR_ECOLI_Nguyen_2023 |
+ 0.165 |
+ 0.101 |
+ 0.126 |
+ 0.128 |
+ 0.108 |
+ 0.093 |
+ 0.099 |
+ 0.104 |
+ 0.200 |
+ 0.150 |
+ 0.154 |
+ 0.205 |
+ 0.174 |
+ 0.126 |
+ 0.183 |
+ 0.165 |
+ 0.174 |
+ 0.161 |
+ 0.170 |
+ 0.185 |
+ 0.154 |
+ 0.152 |
+ 0.143 |
+ 0.148 |
+ 0.187 |
+ 0.079 |
+ 0.079 |
+ 0.123 |
+ 0.088 |
+ 0.088 |
+ 0.141 |
+ 0.117 |
+ 0.024 |
+ 0.099 |
+ 0.064 |
+ 0.073 |
+ 0.134 |
+ 0.084 |
+ 0.075 |
+ 0.088 |
+ 0.075 |
+ 0.086 |
+ 0.161 |
+ 0.106 |
+ 0.141 |
+ 0.167 |
+ 0.148 |
+ 0.187 |
+ 0.198 |
+ 0.159 |
+ 0.185 |
+ 0.174 |
+ 0.172 |
+ 0.163 |
+ 0.183 |
+ 0.183 |
+ 0.181 |
+ 0.172 |
+ 0.200 |
+ 0.174 |
+ 0.203 |
+ 0.192 |
+ 2916 |
+ OrganismalFitness |
+ DYR_ECOLI |
+ High |
+ Prokaryote |
+
+
+ DYR_ECOLI_Thompson_2019 |
+ 0.131 |
+ 0.127 |
+ 0.101 |
+ 0.101 |
+ 0.118 |
+ 0.105 |
+ 0.127 |
+ 0.127 |
+ 0.110 |
+ 0.131 |
+ 0.152 |
+ 0.143 |
+ 0.143 |
+ 0.105 |
+ 0.139 |
+ 0.160 |
+ 0.165 |
+ 0.169 |
+ 0.143 |
+ 0.152 |
+ 0.118 |
+ 0.068 |
+ 0.127 |
+ 0.084 |
+ 0.110 |
+ 0.152 |
+ 0.169 |
+ 0.127 |
+ 0.131 |
+ 0.114 |
+ 0.122 |
+ 0.089 |
+ 0.038 |
+ 0.084 |
+ 0.118 |
+ 0.194 |
+ 0.127 |
+ 0.143 |
+ 0.169 |
+ 0.114 |
+ 0.118 |
+ 0.118 |
+ 0.131 |
+ 0.089 |
+ 0.156 |
+ 0.152 |
+ 0.148 |
+ 0.127 |
+ 0.135 |
+ 0.122 |
+ 0.143 |
+ 0.173 |
+ 0.152 |
+ 0.165 |
+ 0.165 |
+ 0.194 |
+ 0.173 |
+ 0.173 |
+ 0.186 |
+ 0.177 |
+ 0.143 |
+ 0.194 |
+ 2363 |
+ OrganismalFitness |
+ DYR_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ ENV_HV1B9_DuenasDecamp_2016 |
+ 0.400 |
+ 0.350 |
+ 0.300 |
+ 0.300 |
+ 0.350 |
+ 0.350 |
+ 0.025 |
+ 0.375 |
+ 0.375 |
+ 0.350 |
+ 0.300 |
+ 0.400 |
+ 0.350 |
+ 0.025 |
+ 0.025 |
+ 0.050 |
+ 0.075 |
+ 0.050 |
+ 0.175 |
+ 0.400 |
+ 0.275 |
+ 0.350 |
+ 0.375 |
+ 0.350 |
+ 0.250 |
+ 0.400 |
+ 0.300 |
+ 0.375 |
+ 0.400 |
+ 0.300 |
+ 0.425 |
+ 0.375 |
+ 0.325 |
+ 0.325 |
+ 0.375 |
+ 0.425 |
+ 0.350 |
+ 0.375 |
+ 0.375 |
+ 0.375 |
+ 0.400 |
+ 0.400 |
+ 0.300 |
+ 0.000 |
+ 0.350 |
+ 0.325 |
+ 0.325 |
+ 0.300 |
+ 0.175 |
+ 0.150 |
+ 0.125 |
+ 0.275 |
+ 0.225 |
+ 0.150 |
+ 0.275 |
+ 0.200 |
+ 0.175 |
+ 0.225 |
+ 0.175 |
+ 0.175 |
+ 0.100 |
+ 0.125 |
+ 375 |
+ OrganismalFitness |
+ ENV_HV1B9 |
+ Medium |
+ Virus |
+
+
+ ENV_HV1BR_Haddox_2016 |
+ 0.206 |
+ 0.211 |
+ 0.227 |
+ 0.234 |
+ 0.231 |
+ 0.228 |
+ 0.084 |
+ 0.214 |
+ 0.203 |
+ 0.206 |
+ 0.155 |
+ 0.193 |
+ 0.191 |
+ 0.099 |
+ 0.103 |
+ 0.099 |
+ 0.108 |
+ 0.113 |
+ 0.130 |
+ 0.239 |
+ 0.198 |
+ 0.212 |
+ 0.233 |
+ 0.239 |
+ 0.189 |
+ 0.218 |
+ 0.233 |
+ 0.211 |
+ 0.237 |
+ 0.171 |
+ 0.198 |
+ 0.172 |
+ 0.128 |
+ 0.200 |
+ 0.215 |
+ 0.225 |
+ 0.211 |
+ 0.213 |
+ 0.221 |
+ 0.230 |
+ 0.232 |
+ 0.235 |
+ 0.165 |
+ 0.076 |
+ 0.196 |
+ 0.175 |
+ 0.139 |
+ 0.193 |
+ 0.104 |
+ 0.113 |
+ 0.123 |
+ 0.134 |
+ 0.145 |
+ 0.144 |
+ 0.150 |
+ 0.141 |
+ 0.134 |
+ 0.133 |
+ 0.135 |
+ 0.145 |
+ 0.129 |
+ 0.113 |
+ 12863 |
+ OrganismalFitness |
+ ENV_HV1BR |
+ Medium |
+ Virus |
+
+
+ ENVZ_ECOLI_Ghose_2023 |
+ 0.133 |
+ 0.115 |
+ 0.124 |
+ 0.106 |
+ 0.115 |
+ 0.097 |
+ 0.097 |
+ 0.115 |
+ 0.124 |
+ 0.115 |
+ 0.080 |
+ 0.088 |
+ 0.071 |
+ 0.097 |
+ 0.124 |
+ 0.080 |
+ 0.071 |
+ 0.088 |
+ 0.080 |
+ 0.080 |
+ 0.097 |
+ 0.053 |
+ 0.106 |
+ 0.062 |
+ 0.133 |
+ 0.062 |
+ 0.088 |
+ 0.080 |
+ 0.062 |
+ 0.088 |
+ 0.080 |
+ 0.115 |
+ 0.133 |
+ 0.106 |
+ 0.097 |
+ 0.062 |
+ 0.106 |
+ 0.097 |
+ 0.088 |
+ 0.115 |
+ 0.106 |
+ 0.106 |
+ 0.124 |
+ 0.088 |
+ 0.115 |
+ 0.053 |
+ 0.062 |
+ 0.097 |
+ 0.071 |
+ 0.080 |
+ 0.088 |
+ 0.088 |
+ 0.088 |
+ 0.080 |
+ 0.080 |
+ 0.062 |
+ 0.097 |
+ 0.080 |
+ 0.080 |
+ 0.080 |
+ 0.071 |
+ 0.088 |
+ 1121 |
+ Activity |
+ ENVZ_ECOLI |
+ High |
+ Prokaryote |
+
+
+ EPHB2_HUMAN_Tsuboyama_2023_1F0M |
+ 0.321 |
+ 0.327 |
+ 0.383 |
+ 0.367 |
+ 0.388 |
+ 0.388 |
+ 0.005 |
+ 0.296 |
+ 0.372 |
+ 0.372 |
+ 0.311 |
+ 0.327 |
+ 0.301 |
+ 0.015 |
+ 0.327 |
+ 0.347 |
+ 0.372 |
+ 0.388 |
+ 0.347 |
+ 0.357 |
+ 0.347 |
+ 0.347 |
+ 0.306 |
+ 0.321 |
+ 0.327 |
+ 0.316 |
+ 0.372 |
+ 0.352 |
+ 0.337 |
+ 0.281 |
+ 0.311 |
+ 0.235 |
+ 0.235 |
+ 0.357 |
+ 0.301 |
+ 0.357 |
+ 0.398 |
+ 0.352 |
+ 0.383 |
+ 0.403 |
+ 0.362 |
+ 0.378 |
+ 0.260 |
+ 0.112 |
+ 0.327 |
+ 0.327 |
+ 0.444 |
+ 0.378 |
+ 0.423 |
+ 0.408 |
+ 0.383 |
+ 0.398 |
+ 0.388 |
+ 0.429 |
+ 0.403 |
+ 0.403 |
+ 0.388 |
+ 0.398 |
+ 0.408 |
+ 0.393 |
+ 0.423 |
+ 0.429 |
+ 1960 |
+ Stability |
+ EPHB2_HUMAN |
+ High |
+ Human |
+
+
+ ERBB2_HUMAN_Elazar_2016 |
+ 0.121 |
+ 0.242 |
+ 0.121 |
+ 0.152 |
+ 0.212 |
+ 0.212 |
+ 0.152 |
+ 0.182 |
+ 0.242 |
+ 0.303 |
+ 0.121 |
+ 0.121 |
+ 0.182 |
+ 0.182 |
+ 0.182 |
+ 0.152 |
+ 0.242 |
+ 0.182 |
+ 0.212 |
+ 0.091 |
+ 0.152 |
+ 0.182 |
+ 0.182 |
+ 0.182 |
+ 0.152 |
+ 0.182 |
+ 0.212 |
+ 0.273 |
+ 0.182 |
+ 0.152 |
+ 0.091 |
+ 0.121 |
+ 0.182 |
+ 0.152 |
+ 0.091 |
+ 0.182 |
+ 0.182 |
+ 0.121 |
+ 0.182 |
+ 0.212 |
+ 0.212 |
+ 0.182 |
+ 0.152 |
+ 0.182 |
+ 0.212 |
+ 0.121 |
+ 0.212 |
+ 0.182 |
+ 0.030 |
+ 0.091 |
+ 0.152 |
+ 0.152 |
+ 0.182 |
+ 0.152 |
+ 0.121 |
+ 0.121 |
+ 0.121 |
+ 0.182 |
+ 0.212 |
+ 0.121 |
+ 0.121 |
+ 0.152 |
+ 326 |
+ Expression |
+ ERBB2_HUMAN |
+ Low |
+ Human |
+
+
+ ESTA_BACSU_Nutschel_2020 |
+ 0.234 |
+ 0.271 |
+ 0.275 |
+ 0.266 |
+ 0.252 |
+ 0.261 |
+ 0.220 |
+ 0.188 |
+ 0.202 |
+ 0.284 |
+ 0.229 |
+ 0.280 |
+ 0.280 |
+ 0.229 |
+ 0.243 |
+ 0.220 |
+ 0.225 |
+ 0.197 |
+ 0.225 |
+ 0.234 |
+ 0.257 |
+ 0.261 |
+ 0.284 |
+ 0.220 |
+ 0.307 |
+ 0.252 |
+ 0.294 |
+ 0.312 |
+ 0.257 |
+ 0.206 |
+ 0.179 |
+ 0.193 |
+ 0.087 |
+ 0.243 |
+ 0.229 |
+ 0.165 |
+ 0.261 |
+ 0.248 |
+ 0.161 |
+ 0.289 |
+ 0.284 |
+ 0.243 |
+ 0.294 |
+ 0.206 |
+ 0.284 |
+ 0.275 |
+ 0.427 |
+ 0.353 |
+ 0.298 |
+ 0.280 |
+ 0.284 |
+ 0.234 |
+ 0.275 |
+ 0.289 |
+ 0.312 |
+ 0.294 |
+ 0.280 |
+ 0.248 |
+ 0.271 |
+ 0.289 |
+ 0.225 |
+ 0.271 |
+ 2172 |
+ Stability |
+ ESTA_BACSU |
+ High |
+ Prokaryote |
+
+
+ F7YBW8_MESOW_Aakre_2015 |
+ 0.087 |
+ 0.609 |
+ 0.564 |
+ 0.672 |
+ 0.626 |
+ 0.642 |
+ 0.095 |
+ 0.448 |
+ 0.492 |
+ 0.523 |
+ 0.620 |
+ 0.503 |
+ 0.528 |
+ 0.063 |
+ 0.067 |
+ 0.112 |
+ 0.480 |
+ 0.399 |
+ 0.618 |
+ 0.567 |
+ 0.065 |
+ 0.028 |
+ 0.062 |
+ 0.093 |
+ 0.103 |
+ 0.253 |
+ 0.063 |
+ 0.426 |
+ 0.625 |
+ 0.554 |
+ 0.699 |
+ 0.659 |
+ 0.068 |
+ 0.062 |
+ 0.053 |
+ 0.690 |
+ 0.092 |
+ 0.072 |
+ 0.647 |
+ 0.517 |
+ 0.493 |
+ 0.680 |
+ 0.100 |
+ 0.084 |
+ 0.316 |
+ 0.109 |
+ 0.096 |
+ 0.383 |
+ 0.136 |
+ 0.080 |
+ 0.624 |
+ 0.657 |
+ 0.601 |
+ 0.677 |
+ 0.649 |
+ 0.667 |
+ 0.601 |
+ 0.614 |
+ 0.639 |
+ 0.647 |
+ 0.323 |
+ 0.028 |
+ 9192 |
+ OrganismalFitness |
+ F7YBW8_MESOW |
+ High |
+ Prokaryote |
+
+
+ FECA_ECOLI_Tsuboyama_2023_2D1U |
+ 0.212 |
+ 0.243 |
+ 0.233 |
+ 0.217 |
+ 0.249 |
+ 0.233 |
+ 0.111 |
+ 0.116 |
+ 0.228 |
+ 0.275 |
+ 0.275 |
+ 0.228 |
+ 0.243 |
+ 0.116 |
+ 0.286 |
+ 0.328 |
+ 0.296 |
+ 0.249 |
+ 0.254 |
+ 0.222 |
+ 0.185 |
+ 0.175 |
+ 0.169 |
+ 0.169 |
+ 0.159 |
+ 0.185 |
+ 0.249 |
+ 0.238 |
+ 0.190 |
+ 0.201 |
+ 0.185 |
+ 0.159 |
+ 0.164 |
+ 0.122 |
+ 0.153 |
+ 0.175 |
+ 0.217 |
+ 0.180 |
+ 0.212 |
+ 0.212 |
+ 0.206 |
+ 0.217 |
+ 0.291 |
+ 0.138 |
+ 0.280 |
+ 0.286 |
+ 0.312 |
+ 0.270 |
+ 0.265 |
+ 0.286 |
+ 0.254 |
+ 0.265 |
+ 0.291 |
+ 0.265 |
+ 0.291 |
+ 0.249 |
+ 0.249 |
+ 0.275 |
+ 0.270 |
+ 0.249 |
+ 0.296 |
+ 0.291 |
+ 1886 |
+ Stability |
+ FECA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ FKBP3_HUMAN_Tsuboyama_2023_2KFV |
+ 0.153 |
+ 0.177 |
+ 0.169 |
+ 0.177 |
+ 0.210 |
+ 0.202 |
+ 0.040 |
+ 0.097 |
+ 0.089 |
+ 0.089 |
+ 0.073 |
+ 0.081 |
+ 0.065 |
+ 0.056 |
+ 0.056 |
+ 0.073 |
+ 0.105 |
+ 0.089 |
+ 0.089 |
+ 0.073 |
+ 0.065 |
+ 0.097 |
+ 0.024 |
+ 0.177 |
+ 0.065 |
+ 0.185 |
+ 0.121 |
+ 0.137 |
+ 0.089 |
+ 0.145 |
+ 0.081 |
+ 0.081 |
+ 0.081 |
+ 0.048 |
+ 0.097 |
+ 0.145 |
+ 0.169 |
+ 0.153 |
+ 0.169 |
+ 0.185 |
+ 0.177 |
+ 0.169 |
+ 0.056 |
+ 0.073 |
+ 0.121 |
+ 0.081 |
+ 0.218 |
+ 0.258 |
+ 0.282 |
+ 0.234 |
+ 0.218 |
+ 0.274 |
+ 0.234 |
+ 0.177 |
+ 0.226 |
+ 0.250 |
+ 0.202 |
+ 0.202 |
+ 0.194 |
+ 0.234 |
+ 0.218 |
+ 0.177 |
+ 1237 |
+ Stability |
+ FKBP3_HUMAN |
+ Medium |
+ Human |
+
+
+ GAL4_YEAST_Kitzman_2015 |
+ 0.075 |
+ 0.100 |
+ 0.250 |
+ 0.175 |
+ 0.183 |
+ 0.192 |
+ 0.092 |
+ 0.100 |
+ 0.250 |
+ 0.208 |
+ 0.200 |
+ 0.158 |
+ 0.142 |
+ 0.108 |
+ 0.167 |
+ 0.158 |
+ 0.242 |
+ 0.258 |
+ 0.242 |
+ 0.158 |
+ 0.133 |
+ 0.092 |
+ 0.150 |
+ 0.150 |
+ 0.142 |
+ 0.125 |
+ 0.150 |
+ 0.158 |
+ 0.267 |
+ 0.183 |
+ 0.242 |
+ 0.267 |
+ 0.158 |
+ 0.092 |
+ 0.150 |
+ 0.100 |
+ 0.167 |
+ 0.150 |
+ 0.167 |
+ 0.183 |
+ 0.192 |
+ 0.217 |
+ 0.117 |
+ 0.192 |
+ 0.208 |
+ 0.125 |
+ 0.075 |
+ 0.200 |
+ 0.075 |
+ 0.083 |
+ 0.267 |
+ 0.200 |
+ 0.275 |
+ 0.242 |
+ 0.217 |
+ 0.217 |
+ 0.275 |
+ 0.217 |
+ 0.242 |
+ 0.250 |
+ 0.175 |
+ 0.183 |
+ 1195 |
+ OrganismalFitness |
+ GAL4_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ GCN4_YEAST_Staller_2018 |
+ 0.242 |
+ 0.223 |
+ 0.201 |
+ 0.231 |
+ 0.227 |
+ 0.220 |
+ 0.091 |
+ 0.117 |
+ 0.227 |
+ 0.220 |
+ 0.197 |
+ 0.178 |
+ 0.182 |
+ 0.174 |
+ 0.140 |
+ 0.186 |
+ 0.227 |
+ 0.220 |
+ 0.223 |
+ 0.220 |
+ 0.064 |
+ 0.064 |
+ 0.076 |
+ 0.064 |
+ 0.053 |
+ 0.042 |
+ 0.053 |
+ 0.061 |
+ 0.144 |
+ 0.220 |
+ 0.208 |
+ 0.212 |
+ 0.095 |
+ 0.053 |
+ 0.049 |
+ 0.223 |
+ 0.231 |
+ 0.231 |
+ 0.239 |
+ 0.223 |
+ 0.220 |
+ 0.246 |
+ 0.148 |
+ 0.140 |
+ 0.152 |
+ 0.186 |
+ 0.170 |
+ 0.216 |
+ 0.152 |
+ 0.148 |
+ 0.193 |
+ 0.182 |
+ 0.186 |
+ 0.186 |
+ 0.197 |
+ 0.208 |
+ 0.201 |
+ 0.197 |
+ 0.197 |
+ 0.197 |
+ 0.136 |
+ 0.125 |
+ 2638 |
+ Binding |
+ GCN4_YEAST |
+ Low |
+ Eukaryote |
+
+
+ GDIA_HUMAN_Silverstein_2021 |
+ 0.121 |
+ 0.112 |
+ 0.129 |
+ 0.138 |
+ 0.138 |
+ 0.147 |
+ 0.155 |
+ 0.129 |
+ 0.121 |
+ 0.129 |
+ 0.138 |
+ 0.112 |
+ 0.121 |
+ 0.069 |
+ 0.086 |
+ 0.121 |
+ 0.138 |
+ 0.138 |
+ 0.138 |
+ 0.138 |
+ 0.095 |
+ 0.164 |
+ 0.155 |
+ 0.190 |
+ 0.121 |
+ 0.164 |
+ 0.138 |
+ 0.129 |
+ 0.138 |
+ 0.147 |
+ 0.172 |
+ 0.181 |
+ 0.112 |
+ 0.121 |
+ 0.103 |
+ 0.112 |
+ 0.138 |
+ 0.121 |
+ 0.129 |
+ 0.138 |
+ 0.121 |
+ 0.147 |
+ 0.112 |
+ 0.052 |
+ 0.155 |
+ 0.103 |
+ 0.172 |
+ 0.164 |
+ 0.147 |
+ 0.078 |
+ 0.138 |
+ 0.147 |
+ 0.147 |
+ 0.164 |
+ 0.164 |
+ 0.181 |
+ 0.155 |
+ 0.138 |
+ 0.164 |
+ 0.155 |
+ 0.138 |
+ 0.121 |
+ 1154 |
+ OrganismalFitness |
+ GDIA_HUMAN |
+ Low |
+ Human |
+
+
+ GFP_AEQVI_Sarkisyan_2016 |
+ 0.277 |
+ 0.275 |
+ 0.280 |
+ 0.279 |
+ 0.287 |
+ 0.287 |
+ 0.061 |
+ 0.256 |
+ 0.297 |
+ 0.298 |
+ 0.188 |
+ 0.083 |
+ 0.086 |
+ 0.085 |
+ 0.092 |
+ 0.079 |
+ 0.078 |
+ 0.090 |
+ 0.112 |
+ 0.241 |
+ 0.074 |
+ 0.073 |
+ 0.089 |
+ 0.080 |
+ 0.066 |
+ 0.098 |
+ 0.118 |
+ 0.259 |
+ 0.268 |
+ 0.296 |
+ 0.238 |
+ 0.239 |
+ 0.087 |
+ 0.069 |
+ 0.093 |
+ 0.249 |
+ 0.285 |
+ 0.283 |
+ 0.279 |
+ 0.288 |
+ 0.289 |
+ 0.304 |
+ 0.128 |
+ 0.110 |
+ 0.134 |
+ 0.134 |
+ 0.279 |
+ 0.280 |
+ 0.308 |
+ 0.254 |
+ 0.235 |
+ 0.239 |
+ 0.250 |
+ 0.253 |
+ 0.243 |
+ 0.251 |
+ 0.241 |
+ 0.245 |
+ 0.233 |
+ 0.247 |
+ 0.262 |
+ 0.188 |
+ 51714 |
+ Activity |
+ GFP_AEQVI |
+ Low |
+ Eukaryote |
+
+
+ GLPA_HUMAN_Elazar_2016 |
+ 0.320 |
+ 0.200 |
+ 0.360 |
+ 0.360 |
+ 0.360 |
+ 0.360 |
+ 0.280 |
+ 0.280 |
+ 0.200 |
+ 0.200 |
+ 0.200 |
+ 0.400 |
+ 0.320 |
+ 0.280 |
+ 0.280 |
+ 0.320 |
+ 0.360 |
+ 0.320 |
+ 0.200 |
+ 0.200 |
+ 0.320 |
+ 0.280 |
+ 0.360 |
+ 0.320 |
+ 0.240 |
+ 0.360 |
+ 0.320 |
+ 0.400 |
+ 0.240 |
+ 0.200 |
+ 0.280 |
+ 0.240 |
+ 0.240 |
+ 0.280 |
+ 0.320 |
+ 0.360 |
+ 0.400 |
+ 0.400 |
+ 0.440 |
+ 0.360 |
+ 0.360 |
+ 0.360 |
+ 0.280 |
+ 0.240 |
+ 0.400 |
+ 0.320 |
+ 0.280 |
+ 0.200 |
+ 0.320 |
+ 0.160 |
+ 0.480 |
+ 0.440 |
+ 0.320 |
+ 0.360 |
+ 0.400 |
+ 0.280 |
+ 0.320 |
+ 0.480 |
+ 0.360 |
+ 0.400 |
+ 0.320 |
+ 0.400 |
+ 245 |
+ Expression |
+ GLPA_HUMAN |
+ Low |
+ Human |
+
+
+ GRB2_HUMAN_Faure_2021 |
+ 0.303 |
+ 0.360 |
+ 0.354 |
+ 0.363 |
+ 0.376 |
+ 0.384 |
+ 0.317 |
+ 0.288 |
+ 0.380 |
+ 0.339 |
+ 0.365 |
+ 0.305 |
+ 0.345 |
+ 0.321 |
+ 0.384 |
+ 0.425 |
+ 0.442 |
+ 0.346 |
+ 0.393 |
+ 0.366 |
+ 0.371 |
+ 0.357 |
+ 0.328 |
+ 0.310 |
+ 0.343 |
+ 0.347 |
+ 0.287 |
+ 0.381 |
+ 0.283 |
+ 0.345 |
+ 0.281 |
+ 0.266 |
+ 0.343 |
+ 0.377 |
+ 0.362 |
+ 0.245 |
+ 0.381 |
+ 0.376 |
+ 0.291 |
+ 0.399 |
+ 0.402 |
+ 0.375 |
+ 0.398 |
+ 0.196 |
+ 0.350 |
+ 0.362 |
+ 0.460 |
+ 0.337 |
+ 0.491 |
+ 0.345 |
+ 0.416 |
+ 0.437 |
+ 0.415 |
+ 0.427 |
+ 0.438 |
+ 0.434 |
+ 0.417 |
+ 0.437 |
+ 0.412 |
+ 0.439 |
+ 0.359 |
+ 0.402 |
+ 63366 |
+ OrganismalFitness |
+ GRB2_HUMAN |
+ Medium |
+ Human |
+
+
+ HCP_LAMBD_Tsuboyama_2023_2L6Q |
+ 0.269 |
+ 0.260 |
+ 0.192 |
+ 0.212 |
+ 0.202 |
+ 0.221 |
+ 0.192 |
+ 0.192 |
+ 0.125 |
+ 0.192 |
+ 0.346 |
+ 0.269 |
+ 0.317 |
+ 0.221 |
+ 0.269 |
+ 0.298 |
+ 0.337 |
+ 0.231 |
+ 0.221 |
+ 0.106 |
+ 0.125 |
+ 0.183 |
+ 0.212 |
+ 0.260 |
+ 0.231 |
+ 0.260 |
+ 0.221 |
+ 0.212 |
+ 0.260 |
+ 0.212 |
+ 0.231 |
+ 0.163 |
+ 0.173 |
+ 0.106 |
+ 0.163 |
+ 0.240 |
+ 0.269 |
+ 0.279 |
+ 0.260 |
+ 0.231 |
+ 0.250 |
+ 0.250 |
+ 0.260 |
+ 0.173 |
+ 0.298 |
+ 0.212 |
+ 0.250 |
+ 0.337 |
+ 0.269 |
+ 0.202 |
+ 0.298 |
+ 0.317 |
+ 0.356 |
+ 0.317 |
+ 0.356 |
+ 0.337 |
+ 0.337 |
+ 0.327 |
+ 0.317 |
+ 0.356 |
+ 0.423 |
+ 0.279 |
+ 1040 |
+ Stability |
+ HCP_LAMBD |
+ Medium |
+ Virus |
+
+
+ HECD1_HUMAN_Tsuboyama_2023_3DKM |
+ 0.581 |
+ 0.547 |
+ 0.619 |
+ 0.633 |
+ 0.649 |
+ 0.651 |
+ 0.222 |
+ 0.469 |
+ 0.606 |
+ 0.623 |
+ 0.599 |
+ 0.174 |
+ 0.229 |
+ 0.143 |
+ 0.152 |
+ 0.608 |
+ 0.617 |
+ 0.605 |
+ 0.614 |
+ 0.581 |
+ 0.191 |
+ 0.503 |
+ 0.538 |
+ 0.538 |
+ 0.501 |
+ 0.549 |
+ 0.544 |
+ 0.565 |
+ 0.533 |
+ 0.596 |
+ 0.642 |
+ 0.581 |
+ 0.109 |
+ 0.199 |
+ 0.168 |
+ 0.333 |
+ 0.606 |
+ 0.574 |
+ 0.606 |
+ 0.630 |
+ 0.617 |
+ 0.623 |
+ 0.243 |
+ 0.231 |
+ 0.608 |
+ 0.343 |
+ 0.460 |
+ 0.592 |
+ 0.474 |
+ 0.345 |
+ 0.639 |
+ 0.606 |
+ 0.628 |
+ 0.615 |
+ 0.644 |
+ 0.653 |
+ 0.630 |
+ 0.639 |
+ 0.646 |
+ 0.644 |
+ 0.673 |
+ 0.220 |
+ 5586 |
+ Stability |
+ HECD1_HUMAN |
+ Medium |
+ Human |
+
+
+ HEM3_HUMAN_Loggerenberg_2023 |
+ 0.165 |
+ 0.174 |
+ 0.172 |
+ 0.172 |
+ 0.174 |
+ 0.174 |
+ 0.121 |
+ 0.063 |
+ 0.181 |
+ 0.178 |
+ 0.155 |
+ 0.167 |
+ 0.146 |
+ 0.090 |
+ 0.172 |
+ 0.183 |
+ 0.155 |
+ 0.169 |
+ 0.163 |
+ 0.072 |
+ 0.174 |
+ 0.174 |
+ 0.174 |
+ 0.167 |
+ 0.160 |
+ 0.172 |
+ 0.163 |
+ 0.160 |
+ 0.176 |
+ 0.183 |
+ 0.158 |
+ 0.156 |
+ 0.109 |
+ 0.172 |
+ 0.193 |
+ 0.165 |
+ 0.163 |
+ 0.190 |
+ 0.158 |
+ 0.185 |
+ 0.181 |
+ 0.172 |
+ 0.121 |
+ 0.098 |
+ 0.165 |
+ 0.158 |
+ 0.163 |
+ 0.188 |
+ 0.139 |
+ 0.144 |
+ 0.167 |
+ 0.172 |
+ 0.169 |
+ 0.167 |
+ 0.165 |
+ 0.170 |
+ 0.172 |
+ 0.172 |
+ 0.170 |
+ 0.170 |
+ 0.167 |
+ 0.178 |
+ 5689 |
+ Activity |
+ HEM3_HUMAN |
+ Medium |
+ Human |
+
+
+ HIS7_YEAST_Pokusaeva_2019 |
+ 0.165 |
+ 0.202 |
+ 0.209 |
+ 0.209 |
+ 0.203 |
+ 0.199 |
+ 0.133 |
+ 0.086 |
+ 0.188 |
+ 0.191 |
+ 0.204 |
+ 0.174 |
+ 0.185 |
+ 0.100 |
+ 0.130 |
+ 0.179 |
+ 0.200 |
+ 0.202 |
+ 0.194 |
+ 0.170 |
+ 0.175 |
+ 0.198 |
+ 0.225 |
+ 0.211 |
+ 0.203 |
+ 0.213 |
+ 0.191 |
+ 0.194 |
+ 0.191 |
+ 0.177 |
+ 0.199 |
+ 0.202 |
+ 0.088 |
+ 0.211 |
+ 0.212 |
+ 0.197 |
+ 0.190 |
+ 0.188 |
+ 0.186 |
+ 0.209 |
+ 0.205 |
+ 0.202 |
+ 0.148 |
+ 0.132 |
+ 0.197 |
+ 0.109 |
+ 0.182 |
+ 0.209 |
+ 0.188 |
+ 0.154 |
+ 0.196 |
+ 0.178 |
+ 0.190 |
+ 0.193 |
+ 0.194 |
+ 0.191 |
+ 0.193 |
+ 0.196 |
+ 0.196 |
+ 0.193 |
+ 0.203 |
+ 0.107 |
+ 496137 |
+ OrganismalFitness |
+ HIS7_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HMDH_HUMAN_Jiang_2019 |
+ 0.135 |
+ 0.122 |
+ 0.124 |
+ 0.120 |
+ 0.128 |
+ 0.122 |
+ 0.133 |
+ 0.193 |
+ 0.121 |
+ 0.122 |
+ 0.126 |
+ 0.130 |
+ 0.123 |
+ 0.122 |
+ 0.123 |
+ 0.136 |
+ 0.149 |
+ 0.130 |
+ 0.133 |
+ 0.111 |
+ 0.137 |
+ 0.108 |
+ 0.116 |
+ 0.125 |
+ 0.143 |
+ 0.106 |
+ 0.108 |
+ 0.108 |
+ 0.124 |
+ 0.120 |
+ 0.127 |
+ 0.127 |
+ 0.114 |
+ 0.124 |
+ 0.111 |
+ 0.120 |
+ 0.125 |
+ 0.125 |
+ 0.122 |
+ 0.116 |
+ 0.120 |
+ 0.120 |
+ 0.124 |
+ 0.117 |
+ 0.161 |
+ 0.160 |
+ 0.132 |
+ 0.129 |
+ 0.105 |
+ 0.126 |
+ 0.145 |
+ 0.129 |
+ 0.140 |
+ 0.149 |
+ 0.144 |
+ 0.150 |
+ 0.143 |
+ 0.151 |
+ 0.140 |
+ 0.139 |
+ 0.155 |
+ 0.148 |
+ 16853 |
+ OrganismalFitness |
+ HMDH_HUMAN |
+ Low |
+ Human |
+
+
+ HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2 |
+ 0.102 |
+ 0.142 |
+ 0.142 |
+ 0.155 |
+ 0.137 |
+ 0.146 |
+ 0.119 |
+ 0.168 |
+ 0.133 |
+ 0.137 |
+ 0.137 |
+ 0.137 |
+ 0.146 |
+ 0.080 |
+ 0.066 |
+ 0.115 |
+ 0.124 |
+ 0.150 |
+ 0.111 |
+ 0.088 |
+ 0.164 |
+ 0.168 |
+ 0.173 |
+ 0.159 |
+ 0.150 |
+ 0.159 |
+ 0.150 |
+ 0.173 |
+ 0.164 |
+ 0.159 |
+ 0.190 |
+ 0.190 |
+ 0.084 |
+ 0.159 |
+ 0.168 |
+ 0.155 |
+ 0.119 |
+ 0.137 |
+ 0.133 |
+ 0.155 |
+ 0.150 |
+ 0.155 |
+ 0.106 |
+ 0.075 |
+ 0.159 |
+ 0.142 |
+ 0.119 |
+ 0.155 |
+ 0.115 |
+ 0.111 |
+ 0.142 |
+ 0.137 |
+ 0.119 |
+ 0.111 |
+ 0.133 |
+ 0.133 |
+ 0.137 |
+ 0.155 |
+ 0.142 |
+ 0.133 |
+ 0.142 |
+ 0.159 |
+ 2252 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HSP82_YEAST_Flynn_2019 |
+ 0.125 |
+ 0.092 |
+ 0.108 |
+ 0.109 |
+ 0.109 |
+ 0.111 |
+ 0.106 |
+ 0.129 |
+ 0.110 |
+ 0.108 |
+ 0.115 |
+ 0.111 |
+ 0.107 |
+ 0.094 |
+ 0.114 |
+ 0.111 |
+ 0.156 |
+ 0.150 |
+ 0.139 |
+ 0.126 |
+ 0.114 |
+ 0.108 |
+ 0.116 |
+ 0.110 |
+ 0.128 |
+ 0.128 |
+ 0.108 |
+ 0.119 |
+ 0.105 |
+ 0.097 |
+ 0.126 |
+ 0.115 |
+ 0.095 |
+ 0.120 |
+ 0.129 |
+ 0.120 |
+ 0.119 |
+ 0.132 |
+ 0.126 |
+ 0.111 |
+ 0.117 |
+ 0.109 |
+ 0.108 |
+ 0.095 |
+ 0.115 |
+ 0.119 |
+ 0.117 |
+ 0.112 |
+ 0.086 |
+ 0.120 |
+ 0.150 |
+ 0.140 |
+ 0.146 |
+ 0.145 |
+ 0.146 |
+ 0.149 |
+ 0.143 |
+ 0.138 |
+ 0.135 |
+ 0.150 |
+ 0.109 |
+ 0.138 |
+ 13294 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HSP82_YEAST_Mishra_2016 |
+ 0.159 |
+ 0.182 |
+ 0.178 |
+ 0.194 |
+ 0.185 |
+ 0.187 |
+ 0.129 |
+ 0.127 |
+ 0.182 |
+ 0.199 |
+ 0.180 |
+ 0.194 |
+ 0.192 |
+ 0.074 |
+ 0.088 |
+ 0.111 |
+ 0.162 |
+ 0.159 |
+ 0.132 |
+ 0.155 |
+ 0.215 |
+ 0.192 |
+ 0.206 |
+ 0.182 |
+ 0.210 |
+ 0.196 |
+ 0.199 |
+ 0.196 |
+ 0.187 |
+ 0.180 |
+ 0.196 |
+ 0.206 |
+ 0.118 |
+ 0.182 |
+ 0.176 |
+ 0.192 |
+ 0.189 |
+ 0.178 |
+ 0.173 |
+ 0.182 |
+ 0.189 |
+ 0.189 |
+ 0.189 |
+ 0.051 |
+ 0.173 |
+ 0.199 |
+ 0.104 |
+ 0.176 |
+ 0.079 |
+ 0.127 |
+ 0.129 |
+ 0.125 |
+ 0.125 |
+ 0.143 |
+ 0.176 |
+ 0.150 |
+ 0.141 |
+ 0.141 |
+ 0.148 |
+ 0.150 |
+ 0.171 |
+ 0.164 |
+ 4323 |
+ OrganismalFitness |
+ HSP82_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ HXK4_HUMAN_Gersing_2022_activity |
+ 0.232 |
+ 0.161 |
+ 0.149 |
+ 0.151 |
+ 0.156 |
+ 0.156 |
+ 0.098 |
+ 0.127 |
+ 0.149 |
+ 0.148 |
+ 0.128 |
+ 0.116 |
+ 0.135 |
+ 0.117 |
+ 0.127 |
+ 0.172 |
+ 0.174 |
+ 0.151 |
+ 0.138 |
+ 0.120 |
+ 0.140 |
+ 0.117 |
+ 0.141 |
+ 0.134 |
+ 0.140 |
+ 0.140 |
+ 0.153 |
+ 0.138 |
+ 0.125 |
+ 0.109 |
+ 0.133 |
+ 0.119 |
+ 0.085 |
+ 0.144 |
+ 0.139 |
+ 0.148 |
+ 0.165 |
+ 0.133 |
+ 0.155 |
+ 0.162 |
+ 0.140 |
+ 0.158 |
+ 0.135 |
+ 0.107 |
+ 0.140 |
+ 0.144 |
+ 0.179 |
+ 0.160 |
+ 0.159 |
+ 0.142 |
+ 0.162 |
+ 0.176 |
+ 0.162 |
+ 0.173 |
+ 0.165 |
+ 0.176 |
+ 0.182 |
+ 0.163 |
+ 0.166 |
+ 0.170 |
+ 0.170 |
+ 0.162 |
+ 8570 |
+ OrganismalFitness |
+ HXK4_HUMAN |
+ Medium |
+ Human |
+
+
+ HXK4_HUMAN_Gersing_2023_abundance |
+ 0.152 |
+ 0.160 |
+ 0.162 |
+ 0.173 |
+ 0.154 |
+ 0.160 |
+ 0.086 |
+ 0.160 |
+ 0.156 |
+ 0.179 |
+ 0.162 |
+ 0.173 |
+ 0.192 |
+ 0.090 |
+ 0.113 |
+ 0.148 |
+ 0.173 |
+ 0.167 |
+ 0.174 |
+ 0.185 |
+ 0.165 |
+ 0.186 |
+ 0.186 |
+ 0.190 |
+ 0.173 |
+ 0.185 |
+ 0.194 |
+ 0.183 |
+ 0.170 |
+ 0.187 |
+ 0.165 |
+ 0.165 |
+ 0.111 |
+ 0.164 |
+ 0.171 |
+ 0.193 |
+ 0.167 |
+ 0.179 |
+ 0.187 |
+ 0.158 |
+ 0.162 |
+ 0.165 |
+ 0.123 |
+ 0.106 |
+ 0.164 |
+ 0.163 |
+ 0.158 |
+ 0.154 |
+ 0.163 |
+ 0.129 |
+ 0.162 |
+ 0.161 |
+ 0.162 |
+ 0.148 |
+ 0.167 |
+ 0.167 |
+ 0.175 |
+ 0.163 |
+ 0.165 |
+ 0.165 |
+ 0.175 |
+ 0.155 |
+ 8396 |
+ Expression |
+ HXK4_HUMAN |
+ Medium |
+ Human |
+
+
+ I6TAH8_I68A0_Doud_2015 |
+ 0.277 |
+ 0.262 |
+ 0.234 |
+ 0.233 |
+ 0.285 |
+ 0.282 |
+ 0.083 |
+ 0.225 |
+ 0.247 |
+ 0.275 |
+ 0.092 |
+ 0.086 |
+ 0.087 |
+ 0.082 |
+ 0.077 |
+ 0.088 |
+ 0.100 |
+ 0.091 |
+ 0.124 |
+ 0.216 |
+ 0.230 |
+ 0.253 |
+ 0.261 |
+ 0.263 |
+ 0.077 |
+ 0.094 |
+ 0.119 |
+ 0.083 |
+ 0.238 |
+ 0.284 |
+ 0.185 |
+ 0.188 |
+ 0.143 |
+ 0.261 |
+ 0.251 |
+ 0.252 |
+ 0.270 |
+ 0.266 |
+ 0.260 |
+ 0.283 |
+ 0.280 |
+ 0.268 |
+ 0.080 |
+ 0.088 |
+ 0.092 |
+ 0.086 |
+ 0.186 |
+ 0.185 |
+ 0.186 |
+ 0.146 |
+ 0.103 |
+ 0.136 |
+ 0.130 |
+ 0.136 |
+ 0.121 |
+ 0.117 |
+ 0.105 |
+ 0.109 |
+ 0.100 |
+ 0.120 |
+ 0.124 |
+ 0.110 |
+ 9462 |
+ OrganismalFitness |
+ I6TAH8_I68A0 |
+ Medium |
+ Virus |
+
+
+ IF1_ECOLI_Kelsic_2016 |
+ 0.130 |
+ 0.217 |
+ 0.196 |
+ 0.232 |
+ 0.210 |
+ 0.210 |
+ 0.196 |
+ 0.159 |
+ 0.087 |
+ 0.087 |
+ 0.254 |
+ 0.181 |
+ 0.196 |
+ 0.101 |
+ 0.188 |
+ 0.261 |
+ 0.239 |
+ 0.196 |
+ 0.203 |
+ 0.196 |
+ 0.188 |
+ 0.232 |
+ 0.203 |
+ 0.203 |
+ 0.225 |
+ 0.217 |
+ 0.261 |
+ 0.225 |
+ 0.152 |
+ 0.304 |
+ 0.203 |
+ 0.159 |
+ 0.145 |
+ 0.210 |
+ 0.188 |
+ 0.210 |
+ 0.181 |
+ 0.152 |
+ 0.203 |
+ 0.203 |
+ 0.196 |
+ 0.203 |
+ 0.145 |
+ 0.130 |
+ 0.261 |
+ 0.232 |
+ 0.109 |
+ 0.232 |
+ 0.217 |
+ 0.174 |
+ 0.217 |
+ 0.268 |
+ 0.239 |
+ 0.239 |
+ 0.232 |
+ 0.246 |
+ 0.232 |
+ 0.210 |
+ 0.254 |
+ 0.239 |
+ 0.188 |
+ 0.188 |
+ 1367 |
+ OrganismalFitness |
+ IF1_ECOLI |
+ High |
+ Prokaryote |
+
+
+ ILF3_HUMAN_Tsuboyama_2023_2L33 |
+ 0.075 |
+ 0.075 |
+ 0.195 |
+ 0.203 |
+ 0.195 |
+ 0.211 |
+ 0.060 |
+ 0.120 |
+ 0.083 |
+ 0.120 |
+ 0.203 |
+ 0.120 |
+ 0.113 |
+ 0.053 |
+ 0.053 |
+ 0.075 |
+ 0.083 |
+ 0.165 |
+ 0.188 |
+ 0.188 |
+ 0.203 |
+ 0.173 |
+ 0.173 |
+ 0.188 |
+ 0.271 |
+ 0.211 |
+ 0.248 |
+ 0.180 |
+ 0.165 |
+ 0.218 |
+ 0.120 |
+ 0.075 |
+ 0.203 |
+ 0.083 |
+ 0.226 |
+ 0.233 |
+ 0.060 |
+ 0.158 |
+ 0.180 |
+ 0.105 |
+ 0.128 |
+ 0.165 |
+ 0.090 |
+ 0.060 |
+ 0.158 |
+ 0.128 |
+ 0.090 |
+ 0.150 |
+ 0.083 |
+ 0.150 |
+ 0.188 |
+ 0.105 |
+ 0.158 |
+ 0.150 |
+ 0.165 |
+ 0.165 |
+ 0.158 |
+ 0.143 |
+ 0.150 |
+ 0.158 |
+ 0.218 |
+ 0.090 |
+ 1329 |
+ Stability |
+ ILF3_HUMAN |
+ High |
+ Human |
+
+
+ ISDH_STAAW_Tsuboyama_2023_2LHR |
+ 0.226 |
+ 0.179 |
+ 0.236 |
+ 0.236 |
+ 0.221 |
+ 0.236 |
+ 0.313 |
+ 0.128 |
+ 0.226 |
+ 0.282 |
+ 0.246 |
+ 0.323 |
+ 0.344 |
+ 0.344 |
+ 0.369 |
+ 0.451 |
+ 0.374 |
+ 0.421 |
+ 0.395 |
+ 0.190 |
+ 0.318 |
+ 0.200 |
+ 0.195 |
+ 0.215 |
+ 0.349 |
+ 0.354 |
+ 0.369 |
+ 0.333 |
+ 0.251 |
+ 0.205 |
+ 0.179 |
+ 0.123 |
+ 0.159 |
+ 0.231 |
+ 0.241 |
+ 0.287 |
+ 0.236 |
+ 0.215 |
+ 0.267 |
+ 0.251 |
+ 0.231 |
+ 0.251 |
+ 0.313 |
+ 0.272 |
+ 0.349 |
+ 0.338 |
+ 0.451 |
+ 0.467 |
+ 0.333 |
+ 0.313 |
+ 0.251 |
+ 0.292 |
+ 0.297 |
+ 0.308 |
+ 0.292 |
+ 0.287 |
+ 0.287 |
+ 0.297 |
+ 0.272 |
+ 0.292 |
+ 0.415 |
+ 0.308 |
+ 1944 |
+ Stability |
+ ISDH_STAAW |
+ High |
+ Prokaryote |
+
+
+ KCNE1_HUMAN_Muhammad_2023_expression |
+ 0.043 |
+ 0.030 |
+ 0.021 |
+ 0.017 |
+ 0.047 |
+ 0.047 |
+ 0.077 |
+ 0.141 |
+ 0.030 |
+ 0.034 |
+ 0.085 |
+ 0.090 |
+ 0.073 |
+ 0.073 |
+ 0.077 |
+ 0.047 |
+ 0.073 |
+ 0.043 |
+ 0.017 |
+ 0.026 |
+ 0.073 |
+ 0.090 |
+ 0.081 |
+ 0.120 |
+ 0.085 |
+ 0.162 |
+ 0.068 |
+ 0.034 |
+ 0.034 |
+ 0.034 |
+ 0.038 |
+ 0.038 |
+ 0.111 |
+ 0.064 |
+ 0.107 |
+ 0.094 |
+ 0.064 |
+ 0.073 |
+ 0.073 |
+ 0.051 |
+ 0.047 |
+ 0.056 |
+ 0.081 |
+ 0.098 |
+ 0.013 |
+ 0.081 |
+ 0.068 |
+ 0.021 |
+ 0.085 |
+ 0.132 |
+ 0.068 |
+ 0.051 |
+ 0.064 |
+ 0.068 |
+ 0.056 |
+ 0.064 |
+ 0.073 |
+ 0.060 |
+ 0.060 |
+ 0.060 |
+ 0.051 |
+ 0.085 |
+ 2339 |
+ Expression |
+ KCNE1_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNE1_HUMAN_Muhammad_2023_function |
+ 0.194 |
+ 0.224 |
+ 0.203 |
+ 0.207 |
+ 0.220 |
+ 0.220 |
+ 0.190 |
+ 0.155 |
+ 0.220 |
+ 0.237 |
+ 0.194 |
+ 0.233 |
+ 0.250 |
+ 0.190 |
+ 0.194 |
+ 0.246 |
+ 0.207 |
+ 0.220 |
+ 0.172 |
+ 0.168 |
+ 0.198 |
+ 0.177 |
+ 0.185 |
+ 0.177 |
+ 0.207 |
+ 0.190 |
+ 0.250 |
+ 0.228 |
+ 0.190 |
+ 0.194 |
+ 0.216 |
+ 0.185 |
+ 0.034 |
+ 0.203 |
+ 0.241 |
+ 0.216 |
+ 0.190 |
+ 0.203 |
+ 0.198 |
+ 0.224 |
+ 0.220 |
+ 0.211 |
+ 0.185 |
+ 0.159 |
+ 0.203 |
+ 0.185 |
+ 0.224 |
+ 0.181 |
+ 0.172 |
+ 0.134 |
+ 0.198 |
+ 0.220 |
+ 0.216 |
+ 0.177 |
+ 0.228 |
+ 0.228 |
+ 0.233 |
+ 0.198 |
+ 0.211 |
+ 0.207 |
+ 0.297 |
+ 0.177 |
+ 2315 |
+ Activity |
+ KCNE1_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNH2_HUMAN_Kozek_2020 |
+ 0.000 |
+ 0.050 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.100 |
+ 0.000 |
+ 0.000 |
+ 0.050 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.200 |
+ 0.050 |
+ 0.100 |
+ 0.100 |
+ 0.100 |
+ 0.100 |
+ 0.050 |
+ 0.050 |
+ 0.050 |
+ 0.050 |
+ 0.050 |
+ 0.200 |
+ 0.350 |
+ 0.050 |
+ 0.050 |
+ 0.050 |
+ 0.100 |
+ 0.000 |
+ 0.000 |
+ 0.050 |
+ 0.000 |
+ 0.000 |
+ 0.000 |
+ 0.100 |
+ 0.000 |
+ 0.050 |
+ 0.050 |
+ 0.000 |
+ 0.000 |
+ 0.050 |
+ 0.050 |
+ 0.150 |
+ 0.100 |
+ 0.100 |
+ 0.150 |
+ 0.150 |
+ 0.100 |
+ 0.150 |
+ 0.100 |
+ 0.150 |
+ 0.150 |
+ 0.000 |
+ 0.100 |
+ 200 |
+ Activity |
+ KCNH2_HUMAN |
+ Medium |
+ Human |
+
+
+ KCNJ2_MOUSE_Coyote-Maestas_2022_function |
+ 0.146 |
+ 0.156 |
+ 0.136 |
+ 0.155 |
+ 0.162 |
+ 0.158 |
+ 0.108 |
+ 0.125 |
+ 0.139 |
+ 0.136 |
+ 0.166 |
+ 0.154 |
+ 0.156 |
+ 0.106 |
+ 0.149 |
+ 0.166 |
+ 0.168 |
+ 0.164 |
+ 0.161 |
+ 0.129 |
+ 0.162 |
+ 0.161 |
+ 0.149 |
+ 0.128 |
+ 0.161 |
+ 0.158 |
+ 0.154 |
+ 0.155 |
+ 0.148 |
+ 0.149 |
+ 0.141 |
+ 0.131 |
+ 0.102 |
+ 0.159 |
+ 0.165 |
+ 0.138 |
+ 0.156 |
+ 0.158 |
+ 0.145 |
+ 0.154 |
+ 0.158 |
+ 0.146 |
+ 0.129 |
+ 0.113 |
+ 0.136 |
+ 0.155 |
+ 0.142 |
+ 0.143 |
+ 0.149 |
+ 0.113 |
+ 0.158 |
+ 0.162 |
+ 0.166 |
+ 0.158 |
+ 0.162 |
+ 0.152 |
+ 0.148 |
+ 0.171 |
+ 0.161 |
+ 0.164 |
+ 0.136 |
+ 0.159 |
+ 6963 |
+ Activity |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ KCNJ2_MOUSE_Coyote-Maestas_2022_surface |
+ 0.149 |
+ 0.108 |
+ 0.136 |
+ 0.127 |
+ 0.120 |
+ 0.132 |
+ 0.104 |
+ 0.072 |
+ 0.104 |
+ 0.107 |
+ 0.152 |
+ 0.146 |
+ 0.117 |
+ 0.152 |
+ 0.188 |
+ 0.208 |
+ 0.185 |
+ 0.159 |
+ 0.189 |
+ 0.077 |
+ 0.175 |
+ 0.092 |
+ 0.075 |
+ 0.082 |
+ 0.185 |
+ 0.094 |
+ 0.085 |
+ 0.139 |
+ 0.078 |
+ 0.100 |
+ 0.085 |
+ 0.091 |
+ 0.077 |
+ 0.162 |
+ 0.072 |
+ 0.085 |
+ 0.132 |
+ 0.095 |
+ 0.078 |
+ 0.130 |
+ 0.105 |
+ 0.100 |
+ 0.175 |
+ 0.127 |
+ 0.098 |
+ 0.189 |
+ 0.153 |
+ 0.087 |
+ 0.129 |
+ 0.126 |
+ 0.175 |
+ 0.156 |
+ 0.182 |
+ 0.156 |
+ 0.184 |
+ 0.162 |
+ 0.171 |
+ 0.169 |
+ 0.175 |
+ 0.169 |
+ 0.169 |
+ 0.192 |
+ 6917 |
+ Expression |
+ KCNJ2_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ KKA2_KLEPN_Melnikov_2014 |
+ 0.159 |
+ 0.220 |
+ 0.216 |
+ 0.240 |
+ 0.222 |
+ 0.224 |
+ 0.107 |
+ 0.169 |
+ 0.208 |
+ 0.238 |
+ 0.212 |
+ 0.190 |
+ 0.226 |
+ 0.113 |
+ 0.123 |
+ 0.169 |
+ 0.226 |
+ 0.240 |
+ 0.238 |
+ 0.200 |
+ 0.151 |
+ 0.188 |
+ 0.220 |
+ 0.216 |
+ 0.165 |
+ 0.204 |
+ 0.228 |
+ 0.188 |
+ 0.242 |
+ 0.218 |
+ 0.169 |
+ 0.149 |
+ 0.077 |
+ 0.125 |
+ 0.167 |
+ 0.194 |
+ 0.185 |
+ 0.190 |
+ 0.210 |
+ 0.232 |
+ 0.230 |
+ 0.236 |
+ 0.123 |
+ 0.101 |
+ 0.202 |
+ 0.143 |
+ 0.220 |
+ 0.230 |
+ 0.228 |
+ 0.171 |
+ 0.224 |
+ 0.214 |
+ 0.228 |
+ 0.216 |
+ 0.216 |
+ 0.222 |
+ 0.220 |
+ 0.208 |
+ 0.216 |
+ 0.226 |
+ 0.236 |
+ 0.181 |
+ 4960 |
+ OrganismalFitness |
+ KKA2_KLEPN |
+ High |
+ Prokaryote |
+
+
+ LGK_LIPST_Klesmith_2015 |
+ 0.217 |
+ 0.432 |
+ 0.426 |
+ 0.427 |
+ 0.446 |
+ 0.447 |
+ 0.133 |
+ 0.347 |
+ 0.272 |
+ 0.406 |
+ 0.251 |
+ 0.351 |
+ 0.388 |
+ 0.119 |
+ 0.174 |
+ 0.227 |
+ 0.324 |
+ 0.404 |
+ 0.435 |
+ 0.326 |
+ 0.161 |
+ 0.241 |
+ 0.298 |
+ 0.319 |
+ 0.185 |
+ 0.328 |
+ 0.406 |
+ 0.305 |
+ 0.445 |
+ 0.493 |
+ 0.371 |
+ 0.319 |
+ 0.139 |
+ 0.171 |
+ 0.245 |
+ 0.470 |
+ 0.214 |
+ 0.246 |
+ 0.444 |
+ 0.436 |
+ 0.447 |
+ 0.470 |
+ 0.105 |
+ 0.085 |
+ 0.283 |
+ 0.175 |
+ 0.231 |
+ 0.361 |
+ 0.376 |
+ 0.158 |
+ 0.305 |
+ 0.347 |
+ 0.337 |
+ 0.354 |
+ 0.337 |
+ 0.347 |
+ 0.323 |
+ 0.335 |
+ 0.323 |
+ 0.352 |
+ 0.269 |
+ 0.190 |
+ 7890 |
+ Activity |
+ LGK_LIPST |
+ Medium |
+ Eukaryote |
+
+
+ LYAM1_HUMAN_Elazar_2016 |
+ 0.139 |
+ 0.250 |
+ 0.250 |
+ 0.250 |
+ 0.222 |
+ 0.250 |
+ 0.278 |
+ 0.111 |
+ 0.250 |
+ 0.250 |
+ 0.250 |
+ 0.222 |
+ 0.222 |
+ 0.250 |
+ 0.278 |
+ 0.222 |
+ 0.194 |
+ 0.167 |
+ 0.278 |
+ 0.250 |
+ 0.333 |
+ 0.333 |
+ 0.278 |
+ 0.250 |
+ 0.222 |
+ 0.250 |
+ 0.194 |
+ 0.250 |
+ 0.083 |
+ 0.222 |
+ 0.194 |
+ 0.056 |
+ 0.194 |
+ 0.278 |
+ 0.333 |
+ 0.222 |
+ 0.194 |
+ 0.333 |
+ 0.222 |
+ 0.278 |
+ 0.278 |
+ 0.250 |
+ 0.250 |
+ 0.278 |
+ 0.194 |
+ 0.194 |
+ 0.278 |
+ 0.222 |
+ 0.194 |
+ 0.194 |
+ 0.194 |
+ 0.306 |
+ 0.333 |
+ 0.250 |
+ 0.250 |
+ 0.250 |
+ 0.278 |
+ 0.278 |
+ 0.139 |
+ 0.278 |
+ 0.306 |
+ 0.306 |
+ 359 |
+ Expression |
+ LYAM1_HUMAN |
+ Medium |
+ Human |
+
+
+ MAFG_MOUSE_Tsuboyama_2023_1K1V |
+ 0.315 |
+ 0.315 |
+ 0.350 |
+ 0.350 |
+ 0.343 |
+ 0.336 |
+ 0.343 |
+ 0.245 |
+ 0.287 |
+ 0.280 |
+ 0.294 |
+ 0.322 |
+ 0.406 |
+ 0.385 |
+ 0.378 |
+ 0.378 |
+ 0.413 |
+ 0.455 |
+ 0.273 |
+ 0.231 |
+ 0.224 |
+ 0.217 |
+ 0.259 |
+ 0.301 |
+ 0.280 |
+ 0.273 |
+ 0.294 |
+ 0.273 |
+ 0.273 |
+ 0.252 |
+ 0.210 |
+ 0.217 |
+ 0.161 |
+ 0.231 |
+ 0.301 |
+ 0.231 |
+ 0.301 |
+ 0.315 |
+ 0.294 |
+ 0.343 |
+ 0.315 |
+ 0.322 |
+ 0.385 |
+ 0.259 |
+ 0.350 |
+ 0.357 |
+ 0.413 |
+ 0.315 |
+ 0.364 |
+ 0.385 |
+ 0.413 |
+ 0.441 |
+ 0.448 |
+ 0.469 |
+ 0.427 |
+ 0.476 |
+ 0.462 |
+ 0.462 |
+ 0.434 |
+ 0.462 |
+ 0.462 |
+ 0.483 |
+ 1429 |
+ Stability |
+ MAFG_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ MBD11_ARATH_Tsuboyama_2023_6ACV |
+ 0.288 |
+ 0.382 |
+ 0.330 |
+ 0.330 |
+ 0.335 |
+ 0.335 |
+ 0.057 |
+ 0.151 |
+ 0.274 |
+ 0.302 |
+ 0.264 |
+ 0.255 |
+ 0.226 |
+ 0.028 |
+ 0.028 |
+ 0.264 |
+ 0.259 |
+ 0.264 |
+ 0.377 |
+ 0.241 |
+ 0.024 |
+ 0.061 |
+ 0.193 |
+ 0.179 |
+ 0.094 |
+ 0.264 |
+ 0.274 |
+ 0.236 |
+ 0.288 |
+ 0.274 |
+ 0.325 |
+ 0.297 |
+ 0.316 |
+ 0.028 |
+ 0.042 |
+ 0.047 |
+ 0.269 |
+ 0.264 |
+ 0.283 |
+ 0.316 |
+ 0.330 |
+ 0.316 |
+ 0.090 |
+ 0.057 |
+ 0.307 |
+ 0.250 |
+ 0.288 |
+ 0.335 |
+ 0.288 |
+ 0.288 |
+ 0.340 |
+ 0.387 |
+ 0.344 |
+ 0.344 |
+ 0.316 |
+ 0.335 |
+ 0.330 |
+ 0.311 |
+ 0.274 |
+ 0.330 |
+ 0.382 |
+ 0.264 |
+ 2116 |
+ Stability |
+ MBD11_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ MET_HUMAN_Estevam_2023 |
+ 0.148 |
+ 0.185 |
+ 0.163 |
+ 0.174 |
+ 0.163 |
+ 0.178 |
+ 0.163 |
+ 0.174 |
+ 0.156 |
+ 0.167 |
+ 0.191 |
+ 0.178 |
+ 0.174 |
+ 0.143 |
+ 0.148 |
+ 0.172 |
+ 0.183 |
+ 0.172 |
+ 0.187 |
+ 0.163 |
+ 0.185 |
+ 0.187 |
+ 0.159 |
+ 0.193 |
+ 0.193 |
+ 0.181 |
+ 0.183 |
+ 0.191 |
+ 0.200 |
+ 0.193 |
+ 0.170 |
+ 0.172 |
+ 0.120 |
+ 0.161 |
+ 0.178 |
+ 0.196 |
+ 0.167 |
+ 0.174 |
+ 0.198 |
+ 0.176 |
+ 0.170 |
+ 0.176 |
+ 0.170 |
+ 0.139 |
+ 0.187 |
+ 0.178 |
+ 0.150 |
+ 0.183 |
+ 0.169 |
+ 0.119 |
+ 0.161 |
+ 0.163 |
+ 0.165 |
+ 0.167 |
+ 0.156 |
+ 0.176 |
+ 0.172 |
+ 0.169 |
+ 0.187 |
+ 0.159 |
+ 0.185 |
+ 0.167 |
+ 5393 |
+ Activity |
+ MET_HUMAN |
+ Medium |
+ Human |
+
+
+ MK01_HUMAN_Brenan_2016 |
+ 0.091 |
+ 0.134 |
+ 0.134 |
+ 0.144 |
+ 0.138 |
+ 0.122 |
+ 0.084 |
+ 0.075 |
+ 0.116 |
+ 0.095 |
+ 0.076 |
+ 0.119 |
+ 0.107 |
+ 0.104 |
+ 0.103 |
+ 0.090 |
+ 0.072 |
+ 0.085 |
+ 0.098 |
+ 0.097 |
+ 0.101 |
+ 0.094 |
+ 0.082 |
+ 0.090 |
+ 0.095 |
+ 0.109 |
+ 0.103 |
+ 0.109 |
+ 0.101 |
+ 0.122 |
+ 0.082 |
+ 0.095 |
+ 0.109 |
+ 0.087 |
+ 0.101 |
+ 0.090 |
+ 0.078 |
+ 0.098 |
+ 0.088 |
+ 0.125 |
+ 0.125 |
+ 0.106 |
+ 0.097 |
+ 0.095 |
+ 0.113 |
+ 0.120 |
+ 0.060 |
+ 0.109 |
+ 0.070 |
+ 0.097 |
+ 0.084 |
+ 0.104 |
+ 0.095 |
+ 0.088 |
+ 0.085 |
+ 0.079 |
+ 0.090 |
+ 0.081 |
+ 0.081 |
+ 0.088 |
+ 0.098 |
+ 0.104 |
+ 6809 |
+ OrganismalFitness |
+ MK01_HUMAN |
+ Medium |
+ Human |
+
+
+ MLAC_ECOLI_MacRae_2023 |
+ 0.095 |
+ 0.145 |
+ 0.147 |
+ 0.137 |
+ 0.132 |
+ 0.142 |
+ 0.115 |
+ 0.140 |
+ 0.162 |
+ 0.150 |
+ 0.152 |
+ 0.155 |
+ 0.177 |
+ 0.120 |
+ 0.112 |
+ 0.112 |
+ 0.137 |
+ 0.152 |
+ 0.180 |
+ 0.167 |
+ 0.142 |
+ 0.140 |
+ 0.145 |
+ 0.150 |
+ 0.147 |
+ 0.150 |
+ 0.177 |
+ 0.160 |
+ 0.145 |
+ 0.157 |
+ 0.120 |
+ 0.140 |
+ 0.150 |
+ 0.112 |
+ 0.117 |
+ 0.135 |
+ 0.090 |
+ 0.122 |
+ 0.115 |
+ 0.127 |
+ 0.142 |
+ 0.137 |
+ 0.100 |
+ 0.087 |
+ 0.140 |
+ 0.110 |
+ 0.057 |
+ 0.115 |
+ 0.115 |
+ 0.067 |
+ 0.120 |
+ 0.112 |
+ 0.117 |
+ 0.117 |
+ 0.130 |
+ 0.112 |
+ 0.112 |
+ 0.112 |
+ 0.115 |
+ 0.110 |
+ 0.125 |
+ 0.097 |
+ 4007 |
+ OrganismalFitness |
+ MLAC_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ MSH2_HUMAN_Jia_2020 |
+ 0.133 |
+ 0.162 |
+ 0.143 |
+ 0.147 |
+ 0.147 |
+ 0.149 |
+ 0.100 |
+ 0.100 |
+ 0.156 |
+ 0.153 |
+ 0.146 |
+ 0.164 |
+ 0.164 |
+ 0.105 |
+ 0.131 |
+ 0.153 |
+ 0.152 |
+ 0.159 |
+ 0.146 |
+ 0.152 |
+ 0.114 |
+ 0.127 |
+ 0.137 |
+ 0.146 |
+ 0.138 |
+ 0.144 |
+ 0.146 |
+ 0.134 |
+ 0.162 |
+ 0.143 |
+ 0.147 |
+ 0.135 |
+ 0.132 |
+ 0.128 |
+ 0.153 |
+ 0.155 |
+ 0.134 |
+ 0.153 |
+ 0.152 |
+ 0.159 |
+ 0.167 |
+ 0.157 |
+ 0.122 |
+ 0.093 |
+ 0.156 |
+ 0.134 |
+ 0.130 |
+ 0.152 |
+ 0.101 |
+ 0.118 |
+ 0.149 |
+ 0.144 |
+ 0.150 |
+ 0.142 |
+ 0.148 |
+ 0.159 |
+ 0.154 |
+ 0.153 |
+ 0.154 |
+ 0.158 |
+ 0.154 |
+ 0.134 |
+ 16749 |
+ OrganismalFitness |
+ MSH2_HUMAN |
+ Medium |
+ Human |
+
+
+ MTH3_HAEAE_RockahShmuel_2015 |
+ 0.144 |
+ 0.316 |
+ 0.337 |
+ 0.326 |
+ 0.321 |
+ 0.342 |
+ 0.134 |
+ 0.337 |
+ 0.294 |
+ 0.299 |
+ 0.203 |
+ 0.289 |
+ 0.316 |
+ 0.102 |
+ 0.128 |
+ 0.155 |
+ 0.176 |
+ 0.182 |
+ 0.230 |
+ 0.283 |
+ 0.139 |
+ 0.160 |
+ 0.235 |
+ 0.299 |
+ 0.203 |
+ 0.321 |
+ 0.326 |
+ 0.283 |
+ 0.305 |
+ 0.321 |
+ 0.332 |
+ 0.305 |
+ 0.193 |
+ 0.171 |
+ 0.182 |
+ 0.299 |
+ 0.160 |
+ 0.160 |
+ 0.278 |
+ 0.342 |
+ 0.342 |
+ 0.337 |
+ 0.128 |
+ 0.053 |
+ 0.155 |
+ 0.123 |
+ 0.166 |
+ 0.235 |
+ 0.262 |
+ 0.134 |
+ 0.214 |
+ 0.225 |
+ 0.225 |
+ 0.198 |
+ 0.203 |
+ 0.209 |
+ 0.219 |
+ 0.219 |
+ 0.209 |
+ 0.214 |
+ 0.209 |
+ 0.107 |
+ 1777 |
+ OrganismalFitness |
+ MTH3_HAEAE |
+ Medium |
+ Prokaryote |
+
+
+ MTHR_HUMAN_Weile_2021 |
+ 0.122 |
+ 0.123 |
+ 0.113 |
+ 0.123 |
+ 0.113 |
+ 0.108 |
+ 0.096 |
+ 0.121 |
+ 0.133 |
+ 0.138 |
+ 0.123 |
+ 0.126 |
+ 0.126 |
+ 0.118 |
+ 0.124 |
+ 0.135 |
+ 0.116 |
+ 0.132 |
+ 0.138 |
+ 0.122 |
+ 0.121 |
+ 0.118 |
+ 0.126 |
+ 0.149 |
+ 0.130 |
+ 0.120 |
+ 0.131 |
+ 0.119 |
+ 0.162 |
+ 0.137 |
+ 0.139 |
+ 0.148 |
+ 0.075 |
+ 0.127 |
+ 0.126 |
+ 0.127 |
+ 0.110 |
+ 0.118 |
+ 0.125 |
+ 0.109 |
+ 0.116 |
+ 0.123 |
+ 0.122 |
+ 0.110 |
+ 0.123 |
+ 0.126 |
+ 0.112 |
+ 0.122 |
+ 0.127 |
+ 0.118 |
+ 0.120 |
+ 0.115 |
+ 0.121 |
+ 0.118 |
+ 0.117 |
+ 0.118 |
+ 0.120 |
+ 0.118 |
+ 0.126 |
+ 0.117 |
+ 0.132 |
+ 0.116 |
+ 12464 |
+ OrganismalFitness |
+ MTHR_HUMAN |
+ Low |
+ Human |
+
+
+ MYO3_YEAST_Tsuboyama_2023_2BTT |
+ 0.227 |
+ 0.206 |
+ 0.276 |
+ 0.212 |
+ 0.236 |
+ 0.245 |
+ 0.073 |
+ 0.061 |
+ 0.297 |
+ 0.285 |
+ 0.294 |
+ 0.261 |
+ 0.264 |
+ 0.109 |
+ 0.230 |
+ 0.355 |
+ 0.421 |
+ 0.318 |
+ 0.179 |
+ 0.233 |
+ 0.139 |
+ 0.091 |
+ 0.164 |
+ 0.155 |
+ 0.188 |
+ 0.261 |
+ 0.255 |
+ 0.227 |
+ 0.291 |
+ 0.209 |
+ 0.209 |
+ 0.167 |
+ 0.170 |
+ 0.212 |
+ 0.139 |
+ 0.127 |
+ 0.291 |
+ 0.221 |
+ 0.200 |
+ 0.276 |
+ 0.255 |
+ 0.261 |
+ 0.194 |
+ 0.073 |
+ 0.270 |
+ 0.200 |
+ 0.248 |
+ 0.315 |
+ 0.339 |
+ 0.270 |
+ 0.303 |
+ 0.306 |
+ 0.297 |
+ 0.312 |
+ 0.303 |
+ 0.306 |
+ 0.291 |
+ 0.303 |
+ 0.300 |
+ 0.303 |
+ 0.297 |
+ 0.348 |
+ 3297 |
+ Stability |
+ MYO3_YEAST |
+ High |
+ Eukaryote |
+
+
+ NCAP_I34A1_Doud_2015 |
+ 0.258 |
+ 0.246 |
+ 0.246 |
+ 0.247 |
+ 0.259 |
+ 0.263 |
+ 0.111 |
+ 0.220 |
+ 0.266 |
+ 0.261 |
+ 0.106 |
+ 0.099 |
+ 0.094 |
+ 0.101 |
+ 0.093 |
+ 0.096 |
+ 0.099 |
+ 0.102 |
+ 0.129 |
+ 0.182 |
+ 0.235 |
+ 0.264 |
+ 0.264 |
+ 0.263 |
+ 0.098 |
+ 0.111 |
+ 0.109 |
+ 0.107 |
+ 0.242 |
+ 0.268 |
+ 0.199 |
+ 0.194 |
+ 0.148 |
+ 0.249 |
+ 0.252 |
+ 0.285 |
+ 0.260 |
+ 0.271 |
+ 0.270 |
+ 0.280 |
+ 0.282 |
+ 0.278 |
+ 0.094 |
+ 0.099 |
+ 0.106 |
+ 0.093 |
+ 0.161 |
+ 0.166 |
+ 0.178 |
+ 0.144 |
+ 0.103 |
+ 0.118 |
+ 0.127 |
+ 0.122 |
+ 0.114 |
+ 0.114 |
+ 0.115 |
+ 0.124 |
+ 0.102 |
+ 0.115 |
+ 0.139 |
+ 0.130 |
+ 9462 |
+ OrganismalFitness |
+ NCAP_I34A1 |
+ Medium |
+ Virus |
+
+
+ NKX31_HUMAN_Tsuboyama_2023_2L9R |
+ 0.434 |
+ 0.426 |
+ 0.434 |
+ 0.454 |
+ 0.442 |
+ 0.474 |
+ 0.353 |
+ 0.361 |
+ 0.378 |
+ 0.406 |
+ 0.305 |
+ 0.398 |
+ 0.369 |
+ 0.402 |
+ 0.394 |
+ 0.478 |
+ 0.510 |
+ 0.486 |
+ 0.458 |
+ 0.406 |
+ 0.386 |
+ 0.365 |
+ 0.329 |
+ 0.329 |
+ 0.378 |
+ 0.317 |
+ 0.357 |
+ 0.301 |
+ 0.321 |
+ 0.373 |
+ 0.349 |
+ 0.369 |
+ 0.305 |
+ 0.390 |
+ 0.386 |
+ 0.337 |
+ 0.430 |
+ 0.402 |
+ 0.402 |
+ 0.442 |
+ 0.446 |
+ 0.454 |
+ 0.382 |
+ 0.406 |
+ 0.321 |
+ 0.345 |
+ 0.386 |
+ 0.341 |
+ 0.418 |
+ 0.418 |
+ 0.482 |
+ 0.426 |
+ 0.474 |
+ 0.450 |
+ 0.482 |
+ 0.478 |
+ 0.454 |
+ 0.494 |
+ 0.438 |
+ 0.466 |
+ 0.349 |
+ 0.410 |
+ 2482 |
+ Stability |
+ NKX31_HUMAN |
+ High |
+ Human |
+
+
+ NPC1_HUMAN_Erwood_2022_HEK293T |
+ 0.391 |
+ 0.391 |
+ 0.406 |
+ 0.422 |
+ 0.453 |
+ 0.453 |
+ 0.141 |
+ 0.281 |
+ 0.391 |
+ 0.453 |
+ 0.344 |
+ 0.172 |
+ 0.219 |
+ 0.156 |
+ 0.219 |
+ 0.297 |
+ 0.438 |
+ 0.391 |
+ 0.422 |
+ 0.125 |
+ 0.203 |
+ 0.234 |
+ 0.141 |
+ 0.297 |
+ 0.266 |
+ 0.188 |
+ 0.406 |
+ 0.266 |
+ 0.219 |
+ 0.359 |
+ 0.281 |
+ 0.219 |
+ 0.078 |
+ 0.156 |
+ 0.281 |
+ 0.375 |
+ 0.359 |
+ 0.422 |
+ 0.422 |
+ 0.406 |
+ 0.422 |
+ 0.453 |
+ 0.156 |
+ 0.125 |
+ 0.375 |
+ 0.281 |
+ 0.344 |
+ 0.375 |
+ 0.062 |
+ 0.125 |
+ 0.375 |
+ 0.422 |
+ 0.391 |
+ 0.375 |
+ 0.344 |
+ 0.359 |
+ 0.344 |
+ 0.359 |
+ 0.359 |
+ 0.391 |
+ 0.391 |
+ 0.266 |
+ 637 |
+ Activity |
+ NPC1_HUMAN |
+ Low |
+ Human |
+
+
+ NPC1_HUMAN_Erwood_2022_RPE1 |
+ 0.429 |
+ 0.429 |
+ 0.429 |
+ 0.429 |
+ 0.429 |
+ 0.429 |
+ 0.286 |
+ 0.286 |
+ 0.429 |
+ 0.714 |
+ 0.286 |
+ 0.143 |
+ 0.143 |
+ 0.143 |
+ 0.286 |
+ 0.286 |
+ 0.429 |
+ 0.000 |
+ 0.429 |
+ 0.143 |
+ 0.286 |
+ 0.571 |
+ 0.571 |
+ 0.286 |
+ 0.143 |
+ 0.286 |
+ 0.429 |
+ 0.571 |
+ 0.571 |
+ 0.429 |
+ 0.429 |
+ 0.286 |
+ 0.000 |
+ 0.286 |
+ 0.429 |
+ 0.286 |
+ 0.429 |
+ 0.429 |
+ 0.429 |
+ 0.429 |
+ 0.429 |
+ 0.429 |
+ 0.286 |
+ 0.286 |
+ 0.286 |
+ 0.000 |
+ 0.286 |
+ 0.000 |
+ 0.143 |
+ 0.143 |
+ 0.143 |
+ 0.143 |
+ 0.143 |
+ 0.286 |
+ 0.000 |
+ 0.143 |
+ 0.143 |
+ 0.143 |
+ 0.286 |
+ 0.143 |
+ 0.143 |
+ 0.143 |
+ 63 |
+ Activity |
+ NPC1_HUMAN |
+ Low |
+ Human |
+
+
+ NRAM_I33A0_Jiang_2016 |
+ 0.433 |
+ 0.567 |
+ 0.467 |
+ 0.433 |
+ 0.567 |
+ 0.567 |
+ 0.067 |
+ 0.600 |
+ 0.667 |
+ 0.700 |
+ 0.067 |
+ 0.133 |
+ 0.600 |
+ 0.000 |
+ 0.000 |
+ 0.067 |
+ 0.233 |
+ 0.567 |
+ 0.500 |
+ 0.400 |
+ 0.700 |
+ 0.667 |
+ 0.533 |
+ 0.533 |
+ 0.000 |
+ 0.433 |
+ 0.567 |
+ 0.400 |
+ 0.633 |
+ 0.567 |
+ 0.400 |
+ 0.400 |
+ 0.067 |
+ 0.600 |
+ 0.533 |
+ 0.600 |
+ 0.533 |
+ 0.567 |
+ 0.600 |
+ 0.633 |
+ 0.600 |
+ 0.567 |
+ 0.000 |
+ 0.033 |
+ 0.000 |
+ 0.033 |
+ 0.267 |
+ 0.200 |
+ 0.400 |
+ 0.233 |
+ 0.167 |
+ 0.200 |
+ 0.300 |
+ 0.233 |
+ 0.300 |
+ 0.400 |
+ 0.267 |
+ 0.267 |
+ 0.333 |
+ 0.333 |
+ 0.233 |
+ 0.033 |
+ 298 |
+ OrganismalFitness |
+ NRAM_I33A0 |
+ Low |
+ Virus |
+
+
+ NUD15_HUMAN_Suiter_2020 |
+ 0.193 |
+ 0.260 |
+ 0.281 |
+ 0.274 |
+ 0.274 |
+ 0.284 |
+ 0.053 |
+ 0.179 |
+ 0.253 |
+ 0.267 |
+ 0.256 |
+ 0.207 |
+ 0.284 |
+ 0.105 |
+ 0.126 |
+ 0.140 |
+ 0.193 |
+ 0.239 |
+ 0.263 |
+ 0.204 |
+ 0.102 |
+ 0.130 |
+ 0.249 |
+ 0.253 |
+ 0.133 |
+ 0.298 |
+ 0.267 |
+ 0.239 |
+ 0.312 |
+ 0.309 |
+ 0.267 |
+ 0.211 |
+ 0.098 |
+ 0.123 |
+ 0.119 |
+ 0.305 |
+ 0.196 |
+ 0.204 |
+ 0.302 |
+ 0.274 |
+ 0.288 |
+ 0.309 |
+ 0.116 |
+ 0.070 |
+ 0.284 |
+ 0.140 |
+ 0.196 |
+ 0.330 |
+ 0.211 |
+ 0.168 |
+ 0.214 |
+ 0.249 |
+ 0.211 |
+ 0.228 |
+ 0.239 |
+ 0.263 |
+ 0.239 |
+ 0.256 |
+ 0.246 |
+ 0.260 |
+ 0.298 |
+ 0.200 |
+ 2844 |
+ Expression |
+ NUD15_HUMAN |
+ High |
+ Human |
+
+
+ NUSA_ECOLI_Tsuboyama_2023_1WCL |
+ 0.123 |
+ 0.256 |
+ 0.163 |
+ 0.167 |
+ 0.172 |
+ 0.172 |
+ 0.133 |
+ 0.128 |
+ 0.202 |
+ 0.138 |
+ 0.177 |
+ 0.030 |
+ 0.059 |
+ 0.103 |
+ 0.103 |
+ 0.084 |
+ 0.148 |
+ 0.167 |
+ 0.217 |
+ 0.163 |
+ 0.172 |
+ 0.202 |
+ 0.197 |
+ 0.217 |
+ 0.094 |
+ 0.143 |
+ 0.113 |
+ 0.207 |
+ 0.251 |
+ 0.138 |
+ 0.217 |
+ 0.217 |
+ 0.074 |
+ 0.163 |
+ 0.143 |
+ 0.148 |
+ 0.158 |
+ 0.118 |
+ 0.143 |
+ 0.172 |
+ 0.163 |
+ 0.192 |
+ 0.172 |
+ 0.138 |
+ 0.103 |
+ 0.133 |
+ 0.374 |
+ 0.305 |
+ 0.276 |
+ 0.345 |
+ 0.246 |
+ 0.291 |
+ 0.266 |
+ 0.261 |
+ 0.286 |
+ 0.246 |
+ 0.246 |
+ 0.246 |
+ 0.241 |
+ 0.261 |
+ 0.271 |
+ 0.158 |
+ 2028 |
+ Stability |
+ NUSA_ECOLI |
+ High |
+ Prokaryote |
+
+
+ NUSG_MYCTU_Tsuboyama_2023_2MI6 |
+ 0.283 |
+ 0.217 |
+ 0.225 |
+ 0.246 |
+ 0.239 |
+ 0.246 |
+ 0.159 |
+ 0.181 |
+ 0.283 |
+ 0.225 |
+ 0.232 |
+ 0.254 |
+ 0.246 |
+ 0.123 |
+ 0.261 |
+ 0.254 |
+ 0.232 |
+ 0.217 |
+ 0.217 |
+ 0.138 |
+ 0.116 |
+ 0.152 |
+ 0.152 |
+ 0.152 |
+ 0.254 |
+ 0.181 |
+ 0.167 |
+ 0.167 |
+ 0.174 |
+ 0.246 |
+ 0.232 |
+ 0.196 |
+ 0.217 |
+ 0.167 |
+ 0.145 |
+ 0.181 |
+ 0.210 |
+ 0.203 |
+ 0.210 |
+ 0.196 |
+ 0.188 |
+ 0.210 |
+ 0.217 |
+ 0.080 |
+ 0.188 |
+ 0.297 |
+ 0.254 |
+ 0.210 |
+ 0.370 |
+ 0.319 |
+ 0.283 |
+ 0.341 |
+ 0.312 |
+ 0.275 |
+ 0.290 |
+ 0.290 |
+ 0.304 |
+ 0.319 |
+ 0.268 |
+ 0.297 |
+ 0.188 |
+ 0.275 |
+ 1380 |
+ Stability |
+ NUSG_MYCTU |
+ High |
+ Prokaryote |
+
+
+ OBSCN_HUMAN_Tsuboyama_2023_1V1C |
+ 0.372 |
+ 0.484 |
+ 0.547 |
+ 0.559 |
+ 0.575 |
+ 0.541 |
+ 0.119 |
+ 0.378 |
+ 0.500 |
+ 0.494 |
+ 0.509 |
+ 0.469 |
+ 0.478 |
+ 0.131 |
+ 0.453 |
+ 0.497 |
+ 0.569 |
+ 0.519 |
+ 0.512 |
+ 0.550 |
+ 0.447 |
+ 0.434 |
+ 0.431 |
+ 0.428 |
+ 0.438 |
+ 0.416 |
+ 0.450 |
+ 0.412 |
+ 0.403 |
+ 0.494 |
+ 0.497 |
+ 0.509 |
+ 0.328 |
+ 0.169 |
+ 0.122 |
+ 0.319 |
+ 0.466 |
+ 0.475 |
+ 0.491 |
+ 0.538 |
+ 0.534 |
+ 0.531 |
+ 0.381 |
+ 0.144 |
+ 0.406 |
+ 0.431 |
+ 0.562 |
+ 0.438 |
+ 0.669 |
+ 0.578 |
+ 0.578 |
+ 0.591 |
+ 0.588 |
+ 0.600 |
+ 0.606 |
+ 0.603 |
+ 0.553 |
+ 0.578 |
+ 0.600 |
+ 0.609 |
+ 0.631 |
+ 0.500 |
+ 3197 |
+ Stability |
+ OBSCN_HUMAN |
+ High |
+ Human |
+
+
+ ODP2_GEOSE_Tsuboyama_2023_1W4G |
+ 0.184 |
+ 0.132 |
+ 0.237 |
+ 0.167 |
+ 0.246 |
+ 0.132 |
+ 0.193 |
+ 0.149 |
+ 0.114 |
+ 0.114 |
+ 0.193 |
+ 0.184 |
+ 0.184 |
+ 0.175 |
+ 0.158 |
+ 0.211 |
+ 0.175 |
+ 0.184 |
+ 0.167 |
+ 0.193 |
+ 0.105 |
+ 0.114 |
+ 0.184 |
+ 0.096 |
+ 0.088 |
+ 0.114 |
+ 0.167 |
+ 0.105 |
+ 0.158 |
+ 0.202 |
+ 0.123 |
+ 0.096 |
+ 0.123 |
+ 0.167 |
+ 0.044 |
+ 0.167 |
+ 0.158 |
+ 0.044 |
+ 0.079 |
+ 0.158 |
+ 0.114 |
+ 0.132 |
+ 0.175 |
+ 0.175 |
+ 0.184 |
+ 0.184 |
+ 0.237 |
+ 0.193 |
+ 0.246 |
+ 0.307 |
+ 0.228 |
+ 0.211 |
+ 0.246 |
+ 0.254 |
+ 0.263 |
+ 0.237 |
+ 0.263 |
+ 0.263 |
+ 0.228 |
+ 0.246 |
+ 0.202 |
+ 0.193 |
+ 1134 |
+ Stability |
+ ODP2_GEOSE |
+ High |
+ Prokaryote |
+
+
+ OPSD_HUMAN_Wan_2019 |
+ 0.176 |
+ 0.412 |
+ 0.412 |
+ 0.412 |
+ 0.471 |
+ 0.412 |
+ 0.176 |
+ 0.412 |
+ 0.294 |
+ 0.412 |
+ 0.353 |
+ 0.294 |
+ 0.294 |
+ 0.294 |
+ 0.235 |
+ 0.235 |
+ 0.294 |
+ 0.412 |
+ 0.176 |
+ 0.412 |
+ 0.412 |
+ 0.353 |
+ 0.235 |
+ 0.294 |
+ 0.471 |
+ 0.353 |
+ 0.353 |
+ 0.412 |
+ 0.235 |
+ 0.294 |
+ 0.235 |
+ 0.235 |
+ 0.000 |
+ 0.353 |
+ 0.235 |
+ 0.118 |
+ 0.412 |
+ 0.294 |
+ 0.294 |
+ 0.412 |
+ 0.412 |
+ 0.412 |
+ 0.353 |
+ 0.059 |
+ 0.353 |
+ 0.412 |
+ 0.176 |
+ 0.353 |
+ 0.176 |
+ 0.176 |
+ 0.235 |
+ 0.235 |
+ 0.176 |
+ 0.176 |
+ 0.176 |
+ 0.294 |
+ 0.235 |
+ 0.235 |
+ 0.235 |
+ 0.176 |
+ 0.412 |
+ 0.294 |
+ 165 |
+ Expression |
+ OPSD_HUMAN |
+ High |
+ Human |
+
+
+ OTC_HUMAN_Lo_2023 |
+ 0.316 |
+ 0.342 |
+ 0.297 |
+ 0.291 |
+ 0.310 |
+ 0.304 |
+ 0.139 |
+ 0.152 |
+ 0.316 |
+ 0.297 |
+ 0.291 |
+ 0.323 |
+ 0.335 |
+ 0.158 |
+ 0.291 |
+ 0.342 |
+ 0.335 |
+ 0.297 |
+ 0.297 |
+ 0.304 |
+ 0.323 |
+ 0.272 |
+ 0.272 |
+ 0.278 |
+ 0.278 |
+ 0.348 |
+ 0.304 |
+ 0.297 |
+ 0.234 |
+ 0.304 |
+ 0.241 |
+ 0.171 |
+ 0.133 |
+ 0.266 |
+ 0.278 |
+ 0.278 |
+ 0.329 |
+ 0.304 |
+ 0.323 |
+ 0.310 |
+ 0.297 |
+ 0.316 |
+ 0.209 |
+ 0.127 |
+ 0.316 |
+ 0.259 |
+ 0.367 |
+ 0.304 |
+ 0.291 |
+ 0.259 |
+ 0.304 |
+ 0.348 |
+ 0.342 |
+ 0.361 |
+ 0.329 |
+ 0.348 |
+ 0.348 |
+ 0.342 |
+ 0.342 |
+ 0.342 |
+ 0.380 |
+ 0.342 |
+ 1570 |
+ Activity |
+ OTC_HUMAN |
+ Medium |
+ Human |
+
+
+ OTU7A_HUMAN_Tsuboyama_2023_2L2D |
+ 0.188 |
+ 0.266 |
+ 0.203 |
+ 0.203 |
+ 0.219 |
+ 0.203 |
+ 0.078 |
+ 0.125 |
+ 0.109 |
+ 0.141 |
+ 0.250 |
+ 0.297 |
+ 0.234 |
+ 0.109 |
+ 0.281 |
+ 0.328 |
+ 0.156 |
+ 0.172 |
+ 0.188 |
+ 0.156 |
+ 0.109 |
+ 0.062 |
+ 0.078 |
+ 0.078 |
+ 0.109 |
+ 0.062 |
+ 0.078 |
+ 0.094 |
+ 0.047 |
+ 0.031 |
+ 0.109 |
+ 0.109 |
+ 0.062 |
+ 0.141 |
+ 0.125 |
+ 0.141 |
+ 0.188 |
+ 0.156 |
+ 0.125 |
+ 0.203 |
+ 0.172 |
+ 0.188 |
+ 0.172 |
+ 0.141 |
+ 0.250 |
+ 0.344 |
+ 0.250 |
+ 0.203 |
+ 0.328 |
+ 0.281 |
+ 0.219 |
+ 0.219 |
+ 0.219 |
+ 0.188 |
+ 0.219 |
+ 0.203 |
+ 0.234 |
+ 0.172 |
+ 0.172 |
+ 0.219 |
+ 0.359 |
+ 0.328 |
+ 635 |
+ Stability |
+ OTU7A_HUMAN |
+ High |
+ Human |
+
+
+ OXDA_RHOTO_Vanella_2023_activity |
+ 0.127 |
+ 0.192 |
+ 0.231 |
+ 0.223 |
+ 0.238 |
+ 0.233 |
+ 0.114 |
+ 0.131 |
+ 0.141 |
+ 0.145 |
+ 0.181 |
+ 0.162 |
+ 0.172 |
+ 0.112 |
+ 0.114 |
+ 0.156 |
+ 0.180 |
+ 0.209 |
+ 0.217 |
+ 0.150 |
+ 0.125 |
+ 0.142 |
+ 0.156 |
+ 0.155 |
+ 0.119 |
+ 0.161 |
+ 0.181 |
+ 0.162 |
+ 0.225 |
+ 0.238 |
+ 0.191 |
+ 0.177 |
+ 0.072 |
+ 0.133 |
+ 0.131 |
+ 0.134 |
+ 0.138 |
+ 0.128 |
+ 0.136 |
+ 0.223 |
+ 0.233 |
+ 0.234 |
+ 0.122 |
+ 0.095 |
+ 0.177 |
+ 0.136 |
+ 0.195 |
+ 0.242 |
+ 0.248 |
+ 0.162 |
+ 0.184 |
+ 0.216 |
+ 0.202 |
+ 0.205 |
+ 0.197 |
+ 0.223 |
+ 0.208 |
+ 0.223 |
+ 0.194 |
+ 0.216 |
+ 0.222 |
+ 0.148 |
+ 6396 |
+ Activity |
+ OXDA_RHOTO |
+ High |
+ Eukaryote |
+
+
+ OXDA_RHOTO_Vanella_2023_expression |
+ 0.096 |
+ 0.140 |
+ 0.168 |
+ 0.161 |
+ 0.155 |
+ 0.164 |
+ 0.149 |
+ 0.123 |
+ 0.133 |
+ 0.120 |
+ 0.149 |
+ 0.133 |
+ 0.134 |
+ 0.096 |
+ 0.120 |
+ 0.151 |
+ 0.161 |
+ 0.173 |
+ 0.180 |
+ 0.081 |
+ 0.092 |
+ 0.145 |
+ 0.136 |
+ 0.134 |
+ 0.111 |
+ 0.162 |
+ 0.152 |
+ 0.140 |
+ 0.152 |
+ 0.177 |
+ 0.127 |
+ 0.121 |
+ 0.090 |
+ 0.108 |
+ 0.114 |
+ 0.134 |
+ 0.109 |
+ 0.090 |
+ 0.114 |
+ 0.155 |
+ 0.149 |
+ 0.158 |
+ 0.111 |
+ 0.096 |
+ 0.145 |
+ 0.114 |
+ 0.176 |
+ 0.189 |
+ 0.186 |
+ 0.146 |
+ 0.164 |
+ 0.146 |
+ 0.142 |
+ 0.139 |
+ 0.160 |
+ 0.158 |
+ 0.161 |
+ 0.160 |
+ 0.168 |
+ 0.158 |
+ 0.183 |
+ 0.157 |
+ 6769 |
+ Expression |
+ OXDA_RHOTO |
+ High |
+ Eukaryote |
+
+
+ P53_HUMAN_Giacomelli_2018_Null_Etoposide |
+ 0.099 |
+ 0.103 |
+ 0.108 |
+ 0.100 |
+ 0.107 |
+ 0.118 |
+ 0.098 |
+ 0.096 |
+ 0.079 |
+ 0.078 |
+ 0.167 |
+ 0.131 |
+ 0.158 |
+ 0.100 |
+ 0.083 |
+ 0.107 |
+ 0.111 |
+ 0.135 |
+ 0.143 |
+ 0.070 |
+ 0.115 |
+ 0.151 |
+ 0.181 |
+ 0.123 |
+ 0.116 |
+ 0.165 |
+ 0.194 |
+ 0.201 |
+ 0.096 |
+ 0.078 |
+ 0.142 |
+ 0.102 |
+ 0.118 |
+ 0.107 |
+ 0.138 |
+ 0.072 |
+ 0.103 |
+ 0.126 |
+ 0.079 |
+ 0.099 |
+ 0.104 |
+ 0.099 |
+ 0.111 |
+ 0.091 |
+ 0.154 |
+ 0.087 |
+ 0.139 |
+ 0.171 |
+ 0.130 |
+ 0.129 |
+ 0.131 |
+ 0.120 |
+ 0.118 |
+ 0.106 |
+ 0.103 |
+ 0.118 |
+ 0.114 |
+ 0.123 |
+ 0.110 |
+ 0.108 |
+ 0.124 |
+ 0.090 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Giacomelli_2018_Null_Nutlin |
+ 0.084 |
+ 0.078 |
+ 0.074 |
+ 0.075 |
+ 0.080 |
+ 0.082 |
+ 0.116 |
+ 0.107 |
+ 0.060 |
+ 0.059 |
+ 0.193 |
+ 0.126 |
+ 0.174 |
+ 0.108 |
+ 0.091 |
+ 0.112 |
+ 0.114 |
+ 0.127 |
+ 0.149 |
+ 0.094 |
+ 0.119 |
+ 0.166 |
+ 0.218 |
+ 0.114 |
+ 0.112 |
+ 0.194 |
+ 0.250 |
+ 0.244 |
+ 0.099 |
+ 0.084 |
+ 0.171 |
+ 0.118 |
+ 0.131 |
+ 0.119 |
+ 0.154 |
+ 0.063 |
+ 0.106 |
+ 0.129 |
+ 0.066 |
+ 0.078 |
+ 0.087 |
+ 0.071 |
+ 0.126 |
+ 0.110 |
+ 0.167 |
+ 0.099 |
+ 0.151 |
+ 0.182 |
+ 0.163 |
+ 0.114 |
+ 0.146 |
+ 0.155 |
+ 0.126 |
+ 0.123 |
+ 0.112 |
+ 0.119 |
+ 0.127 |
+ 0.122 |
+ 0.120 |
+ 0.120 |
+ 0.146 |
+ 0.111 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Giacomelli_2018_WT_Nutlin |
+ 0.074 |
+ 0.058 |
+ 0.070 |
+ 0.062 |
+ 0.059 |
+ 0.059 |
+ 0.074 |
+ 0.054 |
+ 0.048 |
+ 0.046 |
+ 0.110 |
+ 0.092 |
+ 0.107 |
+ 0.112 |
+ 0.126 |
+ 0.111 |
+ 0.080 |
+ 0.099 |
+ 0.106 |
+ 0.035 |
+ 0.126 |
+ 0.099 |
+ 0.099 |
+ 0.070 |
+ 0.090 |
+ 0.108 |
+ 0.110 |
+ 0.112 |
+ 0.059 |
+ 0.052 |
+ 0.090 |
+ 0.075 |
+ 0.067 |
+ 0.071 |
+ 0.099 |
+ 0.058 |
+ 0.079 |
+ 0.090 |
+ 0.064 |
+ 0.058 |
+ 0.063 |
+ 0.060 |
+ 0.091 |
+ 0.096 |
+ 0.099 |
+ 0.096 |
+ 0.106 |
+ 0.100 |
+ 0.115 |
+ 0.107 |
+ 0.099 |
+ 0.098 |
+ 0.090 |
+ 0.082 |
+ 0.099 |
+ 0.091 |
+ 0.086 |
+ 0.076 |
+ 0.080 |
+ 0.083 |
+ 0.090 |
+ 0.099 |
+ 7467 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P53_HUMAN_Kotler_2018 |
+ 0.324 |
+ 0.257 |
+ 0.276 |
+ 0.286 |
+ 0.238 |
+ 0.229 |
+ 0.124 |
+ 0.267 |
+ 0.238 |
+ 0.257 |
+ 0.286 |
+ 0.219 |
+ 0.248 |
+ 0.143 |
+ 0.133 |
+ 0.238 |
+ 0.267 |
+ 0.276 |
+ 0.267 |
+ 0.219 |
+ 0.200 |
+ 0.267 |
+ 0.219 |
+ 0.229 |
+ 0.210 |
+ 0.200 |
+ 0.210 |
+ 0.200 |
+ 0.162 |
+ 0.286 |
+ 0.267 |
+ 0.219 |
+ 0.038 |
+ 0.162 |
+ 0.229 |
+ 0.229 |
+ 0.324 |
+ 0.305 |
+ 0.314 |
+ 0.276 |
+ 0.286 |
+ 0.257 |
+ 0.143 |
+ 0.114 |
+ 0.200 |
+ 0.114 |
+ 0.190 |
+ 0.229 |
+ 0.210 |
+ 0.181 |
+ 0.210 |
+ 0.248 |
+ 0.248 |
+ 0.229 |
+ 0.238 |
+ 0.238 |
+ 0.210 |
+ 0.229 |
+ 0.238 |
+ 0.248 |
+ 0.286 |
+ 0.219 |
+ 1048 |
+ OrganismalFitness |
+ P53_HUMAN |
+ Low |
+ Human |
+
+
+ P84126_THETH_Chan_2017 |
+ 0.276 |
+ 0.414 |
+ 0.388 |
+ 0.401 |
+ 0.382 |
+ 0.388 |
+ 0.184 |
+ 0.289 |
+ 0.401 |
+ 0.375 |
+ 0.408 |
+ 0.368 |
+ 0.382 |
+ 0.204 |
+ 0.401 |
+ 0.355 |
+ 0.401 |
+ 0.368 |
+ 0.414 |
+ 0.375 |
+ 0.263 |
+ 0.329 |
+ 0.342 |
+ 0.382 |
+ 0.322 |
+ 0.395 |
+ 0.395 |
+ 0.355 |
+ 0.447 |
+ 0.316 |
+ 0.362 |
+ 0.322 |
+ 0.118 |
+ 0.309 |
+ 0.316 |
+ 0.355 |
+ 0.289 |
+ 0.270 |
+ 0.309 |
+ 0.355 |
+ 0.362 |
+ 0.355 |
+ 0.257 |
+ 0.112 |
+ 0.382 |
+ 0.395 |
+ 0.204 |
+ 0.388 |
+ 0.362 |
+ 0.178 |
+ 0.329 |
+ 0.362 |
+ 0.322 |
+ 0.349 |
+ 0.362 |
+ 0.395 |
+ 0.375 |
+ 0.362 |
+ 0.401 |
+ 0.388 |
+ 0.395 |
+ 0.388 |
+ 1519 |
+ OrganismalFitness |
+ P84126_THETH |
+ Medium |
+ Prokaryote |
+
+
+ PA_I34A1_Wu_2015 |
+ 0.335 |
+ 0.335 |
+ 0.335 |
+ 0.324 |
+ 0.335 |
+ 0.324 |
+ 0.115 |
+ 0.269 |
+ 0.088 |
+ 0.132 |
+ 0.110 |
+ 0.110 |
+ 0.093 |
+ 0.104 |
+ 0.093 |
+ 0.104 |
+ 0.104 |
+ 0.099 |
+ 0.247 |
+ 0.236 |
+ 0.280 |
+ 0.275 |
+ 0.264 |
+ 0.330 |
+ 0.121 |
+ 0.258 |
+ 0.258 |
+ 0.225 |
+ 0.264 |
+ 0.291 |
+ 0.264 |
+ 0.242 |
+ 0.126 |
+ 0.275 |
+ 0.280 |
+ 0.302 |
+ 0.335 |
+ 0.319 |
+ 0.319 |
+ 0.330 |
+ 0.341 |
+ 0.319 |
+ 0.110 |
+ 0.110 |
+ 0.104 |
+ 0.110 |
+ 0.115 |
+ 0.099 |
+ 0.071 |
+ 0.137 |
+ 0.088 |
+ 0.093 |
+ 0.099 |
+ 0.088 |
+ 0.088 |
+ 0.110 |
+ 0.071 |
+ 0.082 |
+ 0.082 |
+ 0.115 |
+ 0.104 |
+ 0.071 |
+ 1820 |
+ OrganismalFitness |
+ PA_I34A1 |
+ Medium |
+ Virus |
+
+
+ PABP_YEAST_Melamed_2013 |
+ 0.279 |
+ 0.269 |
+ 0.275 |
+ 0.275 |
+ 0.290 |
+ 0.285 |
+ 0.182 |
+ 0.234 |
+ 0.269 |
+ 0.276 |
+ 0.277 |
+ 0.287 |
+ 0.289 |
+ 0.172 |
+ 0.191 |
+ 0.210 |
+ 0.242 |
+ 0.245 |
+ 0.259 |
+ 0.249 |
+ 0.255 |
+ 0.285 |
+ 0.275 |
+ 0.282 |
+ 0.270 |
+ 0.281 |
+ 0.292 |
+ 0.281 |
+ 0.250 |
+ 0.291 |
+ 0.273 |
+ 0.249 |
+ 0.121 |
+ 0.244 |
+ 0.270 |
+ 0.261 |
+ 0.269 |
+ 0.285 |
+ 0.272 |
+ 0.286 |
+ 0.291 |
+ 0.286 |
+ 0.242 |
+ 0.115 |
+ 0.288 |
+ 0.264 |
+ 0.126 |
+ 0.288 |
+ 0.200 |
+ 0.148 |
+ 0.233 |
+ 0.216 |
+ 0.229 |
+ 0.221 |
+ 0.223 |
+ 0.217 |
+ 0.221 |
+ 0.227 |
+ 0.228 |
+ 0.225 |
+ 0.269 |
+ 0.213 |
+ 37708 |
+ OrganismalFitness |
+ PABP_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ PAI1_HUMAN_Huttinger_2021 |
+ 0.207 |
+ 0.215 |
+ 0.207 |
+ 0.211 |
+ 0.211 |
+ 0.211 |
+ 0.086 |
+ 0.172 |
+ 0.200 |
+ 0.222 |
+ 0.196 |
+ 0.198 |
+ 0.217 |
+ 0.121 |
+ 0.196 |
+ 0.209 |
+ 0.200 |
+ 0.193 |
+ 0.179 |
+ 0.202 |
+ 0.146 |
+ 0.170 |
+ 0.151 |
+ 0.200 |
+ 0.178 |
+ 0.179 |
+ 0.181 |
+ 0.183 |
+ 0.168 |
+ 0.168 |
+ 0.138 |
+ 0.121 |
+ 0.129 |
+ 0.142 |
+ 0.196 |
+ 0.183 |
+ 0.194 |
+ 0.215 |
+ 0.198 |
+ 0.224 |
+ 0.217 |
+ 0.224 |
+ 0.185 |
+ 0.133 |
+ 0.196 |
+ 0.183 |
+ 0.217 |
+ 0.247 |
+ 0.219 |
+ 0.155 |
+ 0.213 |
+ 0.202 |
+ 0.219 |
+ 0.213 |
+ 0.211 |
+ 0.226 |
+ 0.224 |
+ 0.221 |
+ 0.221 |
+ 0.224 |
+ 0.232 |
+ 0.206 |
+ 5345 |
+ Activity |
+ PAI1_HUMAN |
+ NaN |
+ Human |
+
+
+ PHOT_CHLRE_Chen_2023 |
+ 0.273 |
+ 0.396 |
+ 0.515 |
+ 0.502 |
+ 0.368 |
+ 0.354 |
+ 0.401 |
+ 0.393 |
+ 0.501 |
+ 0.490 |
+ 0.434 |
+ 0.470 |
+ 0.502 |
+ 0.505 |
+ 0.529 |
+ 0.490 |
+ 0.526 |
+ 0.497 |
+ 0.507 |
+ 0.362 |
+ 0.397 |
+ 0.458 |
+ 0.413 |
+ 0.463 |
+ 0.406 |
+ 0.438 |
+ 0.459 |
+ 0.459 |
+ 0.423 |
+ 0.389 |
+ 0.426 |
+ 0.381 |
+ 0.342 |
+ 0.325 |
+ 0.317 |
+ 0.442 |
+ 0.412 |
+ 0.388 |
+ 0.453 |
+ 0.376 |
+ 0.390 |
+ 0.379 |
+ 0.463 |
+ 0.288 |
+ 0.464 |
+ 0.490 |
+ 0.230 |
+ 0.415 |
+ 0.396 |
+ 0.269 |
+ 0.412 |
+ 0.377 |
+ 0.378 |
+ 0.390 |
+ 0.386 |
+ 0.401 |
+ 0.397 |
+ 0.402 |
+ 0.401 |
+ 0.395 |
+ 0.491 |
+ 0.484 |
+ 167529 |
+ Activity |
+ PHOT_CHLRE |
+ High |
+ Eukaryote |
+
+
+ PIN1_HUMAN_Tsuboyama_2023_1I6C |
+ 0.333 |
+ 0.383 |
+ 0.420 |
+ 0.444 |
+ 0.407 |
+ 0.395 |
+ 0.346 |
+ 0.185 |
+ 0.333 |
+ 0.432 |
+ 0.321 |
+ 0.296 |
+ 0.358 |
+ 0.370 |
+ 0.358 |
+ 0.395 |
+ 0.420 |
+ 0.296 |
+ 0.346 |
+ 0.346 |
+ 0.296 |
+ 0.222 |
+ 0.160 |
+ 0.111 |
+ 0.210 |
+ 0.185 |
+ 0.123 |
+ 0.099 |
+ 0.086 |
+ 0.198 |
+ 0.247 |
+ 0.185 |
+ 0.198 |
+ 0.321 |
+ 0.247 |
+ 0.210 |
+ 0.395 |
+ 0.370 |
+ 0.296 |
+ 0.383 |
+ 0.346 |
+ 0.309 |
+ 0.321 |
+ 0.148 |
+ 0.358 |
+ 0.358 |
+ 0.272 |
+ 0.321 |
+ 0.284 |
+ 0.259 |
+ 0.296 |
+ 0.383 |
+ 0.383 |
+ 0.321 |
+ 0.309 |
+ 0.358 |
+ 0.346 |
+ 0.333 |
+ 0.346 |
+ 0.370 |
+ 0.407 |
+ 0.395 |
+ 802 |
+ Stability |
+ PIN1_HUMAN |
+ High |
+ Human |
+
+
+ PITX2_HUMAN_Tsuboyama_2023_2L7M |
+ 0.366 |
+ 0.339 |
+ 0.333 |
+ 0.322 |
+ 0.333 |
+ 0.339 |
+ 0.295 |
+ 0.213 |
+ 0.273 |
+ 0.350 |
+ 0.219 |
+ 0.290 |
+ 0.273 |
+ 0.415 |
+ 0.383 |
+ 0.415 |
+ 0.393 |
+ 0.361 |
+ 0.328 |
+ 0.322 |
+ 0.213 |
+ 0.186 |
+ 0.186 |
+ 0.186 |
+ 0.273 |
+ 0.219 |
+ 0.235 |
+ 0.180 |
+ 0.191 |
+ 0.311 |
+ 0.240 |
+ 0.273 |
+ 0.230 |
+ 0.246 |
+ 0.301 |
+ 0.164 |
+ 0.295 |
+ 0.350 |
+ 0.251 |
+ 0.328 |
+ 0.344 |
+ 0.322 |
+ 0.295 |
+ 0.311 |
+ 0.180 |
+ 0.257 |
+ 0.388 |
+ 0.202 |
+ 0.443 |
+ 0.377 |
+ 0.377 |
+ 0.410 |
+ 0.421 |
+ 0.410 |
+ 0.410 |
+ 0.404 |
+ 0.388 |
+ 0.388 |
+ 0.410 |
+ 0.404 |
+ 0.180 |
+ 0.339 |
+ 1824 |
+ Stability |
+ PITX2_HUMAN |
+ High |
+ Human |
+
+
+ PKN1_HUMAN_Tsuboyama_2023_1URF |
+ 0.145 |
+ 0.282 |
+ 0.145 |
+ 0.137 |
+ 0.122 |
+ 0.153 |
+ 0.229 |
+ 0.046 |
+ 0.107 |
+ 0.099 |
+ 0.069 |
+ 0.130 |
+ 0.099 |
+ 0.183 |
+ 0.160 |
+ 0.191 |
+ 0.092 |
+ 0.046 |
+ 0.061 |
+ 0.176 |
+ 0.176 |
+ 0.160 |
+ 0.198 |
+ 0.153 |
+ 0.153 |
+ 0.137 |
+ 0.160 |
+ 0.183 |
+ 0.069 |
+ 0.046 |
+ 0.092 |
+ 0.099 |
+ 0.099 |
+ 0.221 |
+ 0.191 |
+ 0.191 |
+ 0.176 |
+ 0.176 |
+ 0.168 |
+ 0.183 |
+ 0.168 |
+ 0.153 |
+ 0.153 |
+ 0.214 |
+ 0.130 |
+ 0.183 |
+ 0.252 |
+ 0.153 |
+ 0.160 |
+ 0.214 |
+ 0.115 |
+ 0.115 |
+ 0.122 |
+ 0.115 |
+ 0.107 |
+ 0.107 |
+ 0.099 |
+ 0.122 |
+ 0.115 |
+ 0.115 |
+ 0.198 |
+ 0.153 |
+ 1301 |
+ Stability |
+ PKN1_HUMAN |
+ High |
+ Human |
+
+
+ POLG_CXB3N_Mattenberger_2021 |
+ 0.309 |
+ 0.380 |
+ 0.361 |
+ 0.387 |
+ 0.392 |
+ 0.399 |
+ 0.088 |
+ 0.261 |
+ 0.394 |
+ 0.389 |
+ 0.176 |
+ 0.090 |
+ 0.113 |
+ 0.097 |
+ 0.091 |
+ 0.136 |
+ 0.231 |
+ 0.252 |
+ 0.288 |
+ 0.279 |
+ 0.288 |
+ 0.348 |
+ 0.352 |
+ 0.328 |
+ 0.145 |
+ 0.316 |
+ 0.319 |
+ 0.310 |
+ 0.372 |
+ 0.391 |
+ 0.289 |
+ 0.284 |
+ 0.096 |
+ 0.099 |
+ 0.251 |
+ 0.316 |
+ 0.286 |
+ 0.321 |
+ 0.342 |
+ 0.349 |
+ 0.373 |
+ 0.375 |
+ 0.098 |
+ 0.092 |
+ 0.196 |
+ 0.100 |
+ 0.151 |
+ 0.215 |
+ 0.101 |
+ 0.121 |
+ 0.197 |
+ 0.211 |
+ 0.211 |
+ 0.212 |
+ 0.199 |
+ 0.202 |
+ 0.214 |
+ 0.223 |
+ 0.212 |
+ 0.218 |
+ 0.109 |
+ 0.102 |
+ 15711 |
+ OrganismalFitness |
+ POLG_CXB3N |
+ Medium |
+ Virus |
+
+
+ POLG_DEN26_Suphatrakul_2023 |
+ 0.327 |
+ 0.395 |
+ 0.341 |
+ 0.358 |
+ 0.384 |
+ 0.386 |
+ 0.086 |
+ 0.299 |
+ 0.473 |
+ 0.477 |
+ 0.143 |
+ 0.089 |
+ 0.102 |
+ 0.080 |
+ 0.086 |
+ 0.109 |
+ 0.134 |
+ 0.140 |
+ 0.164 |
+ 0.331 |
+ 0.330 |
+ 0.325 |
+ 0.331 |
+ 0.292 |
+ 0.331 |
+ 0.344 |
+ 0.340 |
+ 0.351 |
+ 0.349 |
+ 0.454 |
+ 0.411 |
+ 0.314 |
+ 0.104 |
+ 0.089 |
+ 0.160 |
+ 0.366 |
+ 0.288 |
+ 0.332 |
+ 0.405 |
+ 0.383 |
+ 0.393 |
+ 0.433 |
+ 0.089 |
+ 0.086 |
+ 0.198 |
+ 0.088 |
+ 0.188 |
+ 0.278 |
+ 0.070 |
+ 0.136 |
+ 0.138 |
+ 0.138 |
+ 0.166 |
+ 0.146 |
+ 0.152 |
+ 0.154 |
+ 0.144 |
+ 0.150 |
+ 0.140 |
+ 0.157 |
+ 0.120 |
+ 0.114 |
+ 16897 |
+ OrganismalFitness |
+ POLG_DEN26 |
+ Low |
+ Virus |
+
+
+ POLG_HCVJF_Qi_2014 |
+ 0.362 |
+ 0.411 |
+ 0.368 |
+ 0.380 |
+ 0.393 |
+ 0.405 |
+ 0.074 |
+ 0.153 |
+ 0.387 |
+ 0.387 |
+ 0.123 |
+ 0.387 |
+ 0.350 |
+ 0.086 |
+ 0.092 |
+ 0.080 |
+ 0.110 |
+ 0.092 |
+ 0.104 |
+ 0.239 |
+ 0.196 |
+ 0.276 |
+ 0.325 |
+ 0.288 |
+ 0.282 |
+ 0.270 |
+ 0.239 |
+ 0.239 |
+ 0.294 |
+ 0.387 |
+ 0.313 |
+ 0.221 |
+ 0.067 |
+ 0.368 |
+ 0.356 |
+ 0.288 |
+ 0.362 |
+ 0.374 |
+ 0.362 |
+ 0.393 |
+ 0.368 |
+ 0.344 |
+ 0.086 |
+ 0.074 |
+ 0.331 |
+ 0.117 |
+ 0.080 |
+ 0.245 |
+ 0.233 |
+ 0.190 |
+ 0.172 |
+ 0.153 |
+ 0.123 |
+ 0.153 |
+ 0.141 |
+ 0.129 |
+ 0.221 |
+ 0.153 |
+ 0.166 |
+ 0.178 |
+ 0.092 |
+ 0.098 |
+ 1630 |
+ OrganismalFitness |
+ POLG_HCVJF |
+ Medium |
+ Virus |
+
+
+ POLG_PESV_Tsuboyama_2023_2MXD |
+ 0.164 |
+ 0.519 |
+ 0.435 |
+ 0.454 |
+ 0.464 |
+ 0.464 |
+ 0.088 |
+ 0.480 |
+ 0.333 |
+ 0.497 |
+ 0.489 |
+ 0.113 |
+ 0.129 |
+ 0.099 |
+ 0.123 |
+ 0.117 |
+ 0.156 |
+ 0.150 |
+ 0.127 |
+ 0.595 |
+ 0.097 |
+ 0.078 |
+ 0.086 |
+ 0.090 |
+ 0.109 |
+ 0.049 |
+ 0.080 |
+ 0.051 |
+ 0.131 |
+ 0.558 |
+ 0.608 |
+ 0.620 |
+ 0.113 |
+ 0.105 |
+ 0.094 |
+ 0.088 |
+ 0.296 |
+ 0.294 |
+ 0.281 |
+ 0.437 |
+ 0.437 |
+ 0.429 |
+ 0.185 |
+ 0.127 |
+ 0.189 |
+ 0.154 |
+ 0.550 |
+ 0.507 |
+ 0.561 |
+ 0.581 |
+ 0.622 |
+ 0.630 |
+ 0.630 |
+ 0.651 |
+ 0.653 |
+ 0.663 |
+ 0.634 |
+ 0.690 |
+ 0.637 |
+ 0.663 |
+ 0.569 |
+ 0.326 |
+ 5130 |
+ Stability |
+ POLG_PESV |
+ Medium |
+ Virus |
+
+
+ PPARG_HUMAN_Majithia_2016 |
+ 0.160 |
+ 0.237 |
+ 0.259 |
+ 0.253 |
+ 0.255 |
+ 0.241 |
+ 0.146 |
+ 0.193 |
+ 0.252 |
+ 0.242 |
+ 0.245 |
+ 0.274 |
+ 0.290 |
+ 0.124 |
+ 0.135 |
+ 0.138 |
+ 0.158 |
+ 0.220 |
+ 0.277 |
+ 0.288 |
+ 0.278 |
+ 0.293 |
+ 0.175 |
+ 0.175 |
+ 0.234 |
+ 0.308 |
+ 0.332 |
+ 0.342 |
+ 0.179 |
+ 0.269 |
+ 0.162 |
+ 0.076 |
+ 0.117 |
+ 0.294 |
+ 0.268 |
+ 0.274 |
+ 0.275 |
+ 0.330 |
+ 0.308 |
+ 0.255 |
+ 0.275 |
+ 0.266 |
+ 0.124 |
+ 0.121 |
+ 0.250 |
+ 0.142 |
+ 0.244 |
+ 0.274 |
+ 0.248 |
+ 0.162 |
+ 0.184 |
+ 0.167 |
+ 0.146 |
+ 0.137 |
+ 0.193 |
+ 0.147 |
+ 0.164 |
+ 0.148 |
+ 0.127 |
+ 0.153 |
+ 0.198 |
+ 0.149 |
+ 9576 |
+ Activity |
+ PPARG_HUMAN |
+ Medium |
+ Human |
+
+
+ PPM1D_HUMAN_Miller_2022 |
+ 0.180 |
+ 0.176 |
+ 0.180 |
+ 0.187 |
+ 0.180 |
+ 0.177 |
+ 0.128 |
+ 0.154 |
+ 0.176 |
+ 0.173 |
+ 0.183 |
+ 0.210 |
+ 0.200 |
+ 0.131 |
+ 0.140 |
+ 0.161 |
+ 0.174 |
+ 0.188 |
+ 0.201 |
+ 0.163 |
+ 0.172 |
+ 0.174 |
+ 0.205 |
+ 0.181 |
+ 0.177 |
+ 0.202 |
+ 0.192 |
+ 0.214 |
+ 0.174 |
+ 0.181 |
+ 0.172 |
+ 0.135 |
+ 0.113 |
+ 0.163 |
+ 0.188 |
+ 0.206 |
+ 0.191 |
+ 0.192 |
+ 0.205 |
+ 0.187 |
+ 0.185 |
+ 0.186 |
+ 0.142 |
+ 0.115 |
+ 0.191 |
+ 0.171 |
+ 0.169 |
+ 0.201 |
+ 0.188 |
+ 0.135 |
+ 0.177 |
+ 0.180 |
+ 0.176 |
+ 0.167 |
+ 0.181 |
+ 0.168 |
+ 0.163 |
+ 0.168 |
+ 0.173 |
+ 0.181 |
+ 0.215 |
+ 0.158 |
+ 7889 |
+ OrganismalFitness |
+ PPM1D_HUMAN |
+ Low |
+ Human |
+
+
+ PR40A_HUMAN_Tsuboyama_2023_1UZC |
+ 0.191 |
+ 0.230 |
+ 0.343 |
+ 0.343 |
+ 0.299 |
+ 0.284 |
+ 0.216 |
+ 0.216 |
+ 0.392 |
+ 0.392 |
+ 0.397 |
+ 0.191 |
+ 0.265 |
+ 0.221 |
+ 0.225 |
+ 0.358 |
+ 0.387 |
+ 0.402 |
+ 0.265 |
+ 0.299 |
+ 0.363 |
+ 0.358 |
+ 0.387 |
+ 0.333 |
+ 0.353 |
+ 0.392 |
+ 0.363 |
+ 0.353 |
+ 0.363 |
+ 0.382 |
+ 0.299 |
+ 0.250 |
+ 0.221 |
+ 0.206 |
+ 0.245 |
+ 0.235 |
+ 0.294 |
+ 0.275 |
+ 0.245 |
+ 0.319 |
+ 0.314 |
+ 0.324 |
+ 0.216 |
+ 0.221 |
+ 0.299 |
+ 0.240 |
+ 0.338 |
+ 0.338 |
+ 0.426 |
+ 0.319 |
+ 0.382 |
+ 0.412 |
+ 0.382 |
+ 0.387 |
+ 0.397 |
+ 0.407 |
+ 0.402 |
+ 0.392 |
+ 0.407 |
+ 0.402 |
+ 0.402 |
+ 0.431 |
+ 2033 |
+ Stability |
+ PR40A_HUMAN |
+ Medium |
+ Human |
+
+
+ PRKN_HUMAN_Clausen_2023 |
+ 0.205 |
+ 0.189 |
+ 0.201 |
+ 0.207 |
+ 0.210 |
+ 0.217 |
+ 0.120 |
+ 0.187 |
+ 0.170 |
+ 0.187 |
+ 0.197 |
+ 0.236 |
+ 0.227 |
+ 0.170 |
+ 0.185 |
+ 0.225 |
+ 0.256 |
+ 0.293 |
+ 0.292 |
+ 0.197 |
+ 0.183 |
+ 0.213 |
+ 0.194 |
+ 0.168 |
+ 0.187 |
+ 0.249 |
+ 0.178 |
+ 0.241 |
+ 0.179 |
+ 0.186 |
+ 0.151 |
+ 0.128 |
+ 0.110 |
+ 0.155 |
+ 0.202 |
+ 0.167 |
+ 0.209 |
+ 0.202 |
+ 0.183 |
+ 0.216 |
+ 0.211 |
+ 0.187 |
+ 0.185 |
+ 0.110 |
+ 0.264 |
+ 0.185 |
+ 0.289 |
+ 0.243 |
+ 0.308 |
+ 0.161 |
+ 0.249 |
+ 0.271 |
+ 0.284 |
+ 0.269 |
+ 0.236 |
+ 0.245 |
+ 0.273 |
+ 0.273 |
+ 0.275 |
+ 0.268 |
+ 0.276 |
+ 0.271 |
+ 8756 |
+ Expression |
+ PRKN_HUMAN |
+ Low |
+ Human |
+
+
+ PSAE_SYNP2_Tsuboyama_2023_1PSE |
+ 0.177 |
+ 0.165 |
+ 0.127 |
+ 0.133 |
+ 0.152 |
+ 0.152 |
+ 0.127 |
+ 0.070 |
+ 0.108 |
+ 0.095 |
+ 0.373 |
+ 0.158 |
+ 0.165 |
+ 0.120 |
+ 0.190 |
+ 0.266 |
+ 0.278 |
+ 0.209 |
+ 0.196 |
+ 0.133 |
+ 0.120 |
+ 0.057 |
+ 0.120 |
+ 0.139 |
+ 0.146 |
+ 0.152 |
+ 0.044 |
+ 0.089 |
+ 0.101 |
+ 0.133 |
+ 0.146 |
+ 0.133 |
+ 0.108 |
+ 0.120 |
+ 0.089 |
+ 0.127 |
+ 0.190 |
+ 0.177 |
+ 0.139 |
+ 0.152 |
+ 0.165 |
+ 0.146 |
+ 0.165 |
+ 0.139 |
+ 0.203 |
+ 0.139 |
+ 0.278 |
+ 0.259 |
+ 0.316 |
+ 0.291 |
+ 0.310 |
+ 0.241 |
+ 0.259 |
+ 0.222 |
+ 0.259 |
+ 0.234 |
+ 0.266 |
+ 0.259 |
+ 0.222 |
+ 0.266 |
+ 0.209 |
+ 0.272 |
+ 1579 |
+ Stability |
+ PSAE_PICP2 |
+ Medium |
+ Prokaryote |
+
+
+ PTEN_HUMAN_Matreyek_2021 |
+ 0.157 |
+ 0.183 |
+ 0.171 |
+ 0.165 |
+ 0.153 |
+ 0.163 |
+ 0.102 |
+ 0.159 |
+ 0.181 |
+ 0.200 |
+ 0.185 |
+ 0.191 |
+ 0.212 |
+ 0.104 |
+ 0.136 |
+ 0.157 |
+ 0.183 |
+ 0.147 |
+ 0.169 |
+ 0.181 |
+ 0.110 |
+ 0.204 |
+ 0.191 |
+ 0.204 |
+ 0.151 |
+ 0.169 |
+ 0.173 |
+ 0.181 |
+ 0.179 |
+ 0.208 |
+ 0.167 |
+ 0.165 |
+ 0.118 |
+ 0.120 |
+ 0.208 |
+ 0.183 |
+ 0.145 |
+ 0.200 |
+ 0.194 |
+ 0.165 |
+ 0.193 |
+ 0.185 |
+ 0.118 |
+ 0.094 |
+ 0.194 |
+ 0.149 |
+ 0.173 |
+ 0.204 |
+ 0.159 |
+ 0.130 |
+ 0.200 |
+ 0.198 |
+ 0.193 |
+ 0.179 |
+ 0.189 |
+ 0.181 |
+ 0.179 |
+ 0.181 |
+ 0.193 |
+ 0.200 |
+ 0.194 |
+ 0.169 |
+ 5083 |
+ Expression |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ PTEN_HUMAN_Mighell_2018 |
+ 0.208 |
+ 0.160 |
+ 0.179 |
+ 0.185 |
+ 0.175 |
+ 0.175 |
+ 0.116 |
+ 0.139 |
+ 0.153 |
+ 0.150 |
+ 0.132 |
+ 0.138 |
+ 0.139 |
+ 0.123 |
+ 0.169 |
+ 0.187 |
+ 0.160 |
+ 0.161 |
+ 0.169 |
+ 0.156 |
+ 0.125 |
+ 0.132 |
+ 0.172 |
+ 0.153 |
+ 0.150 |
+ 0.171 |
+ 0.157 |
+ 0.161 |
+ 0.160 |
+ 0.153 |
+ 0.161 |
+ 0.138 |
+ 0.107 |
+ 0.146 |
+ 0.135 |
+ 0.152 |
+ 0.191 |
+ 0.147 |
+ 0.175 |
+ 0.175 |
+ 0.153 |
+ 0.174 |
+ 0.160 |
+ 0.110 |
+ 0.136 |
+ 0.198 |
+ 0.149 |
+ 0.135 |
+ 0.179 |
+ 0.157 |
+ 0.150 |
+ 0.156 |
+ 0.161 |
+ 0.157 |
+ 0.147 |
+ 0.160 |
+ 0.160 |
+ 0.150 |
+ 0.160 |
+ 0.150 |
+ 0.196 |
+ 0.207 |
+ 7260 |
+ Activity |
+ PTEN_HUMAN |
+ Medium |
+ Human |
+
+
+ Q2N0S5_9HIV1_Haddox_2018 |
+ 0.240 |
+ 0.211 |
+ 0.189 |
+ 0.190 |
+ 0.211 |
+ 0.214 |
+ 0.079 |
+ 0.189 |
+ 0.225 |
+ 0.229 |
+ 0.218 |
+ 0.233 |
+ 0.236 |
+ 0.088 |
+ 0.082 |
+ 0.086 |
+ 0.110 |
+ 0.104 |
+ 0.123 |
+ 0.208 |
+ 0.238 |
+ 0.191 |
+ 0.184 |
+ 0.163 |
+ 0.236 |
+ 0.192 |
+ 0.185 |
+ 0.185 |
+ 0.178 |
+ 0.232 |
+ 0.195 |
+ 0.196 |
+ 0.162 |
+ 0.232 |
+ 0.204 |
+ 0.207 |
+ 0.234 |
+ 0.229 |
+ 0.230 |
+ 0.219 |
+ 0.215 |
+ 0.215 |
+ 0.202 |
+ 0.070 |
+ 0.222 |
+ 0.220 |
+ 0.165 |
+ 0.237 |
+ 0.109 |
+ 0.149 |
+ 0.125 |
+ 0.140 |
+ 0.152 |
+ 0.148 |
+ 0.139 |
+ 0.151 |
+ 0.134 |
+ 0.141 |
+ 0.122 |
+ 0.147 |
+ 0.123 |
+ 0.108 |
+ 12729 |
+ OrganismalFitness |
+ Q2N0S5_9HIV1 |
+ Medium |
+ Virus |
+
+
+ Q53Z42_HUMAN_McShan_2019_binding-TAPBPR |
+ 0.150 |
+ 0.168 |
+ 0.171 |
+ 0.165 |
+ 0.168 |
+ 0.168 |
+ 0.112 |
+ 0.100 |
+ 0.179 |
+ 0.174 |
+ 0.153 |
+ 0.165 |
+ 0.179 |
+ 0.147 |
+ 0.176 |
+ 0.174 |
+ 0.168 |
+ 0.188 |
+ 0.159 |
+ 0.165 |
+ 0.135 |
+ 0.097 |
+ 0.097 |
+ 0.094 |
+ 0.179 |
+ 0.171 |
+ 0.150 |
+ 0.144 |
+ 0.121 |
+ 0.115 |
+ 0.126 |
+ 0.115 |
+ 0.144 |
+ 0.121 |
+ 0.088 |
+ 0.079 |
+ 0.135 |
+ 0.126 |
+ 0.100 |
+ 0.168 |
+ 0.162 |
+ 0.147 |
+ 0.168 |
+ 0.124 |
+ 0.176 |
+ 0.150 |
+ 0.200 |
+ 0.191 |
+ 0.153 |
+ 0.147 |
+ 0.141 |
+ 0.135 |
+ 0.165 |
+ 0.150 |
+ 0.162 |
+ 0.156 |
+ 0.156 |
+ 0.153 |
+ 0.144 |
+ 0.147 |
+ 0.197 |
+ 0.185 |
+ 3344 |
+ Binding |
+ Q53Z42_HUMAN |
+ Medium |
+ Human |
+
+
+ Q53Z42_HUMAN_McShan_2019_expression |
+ 0.289 |
+ 0.283 |
+ 0.301 |
+ 0.295 |
+ 0.310 |
+ 0.301 |
+ 0.095 |
+ 0.170 |
+ 0.280 |
+ 0.280 |
+ 0.286 |
+ 0.262 |
+ 0.292 |
+ 0.140 |
+ 0.158 |
+ 0.259 |
+ 0.283 |
+ 0.298 |
+ 0.292 |
+ 0.289 |
+ 0.229 |
+ 0.208 |
+ 0.190 |
+ 0.185 |
+ 0.312 |
+ 0.307 |
+ 0.265 |
+ 0.301 |
+ 0.265 |
+ 0.182 |
+ 0.235 |
+ 0.199 |
+ 0.196 |
+ 0.226 |
+ 0.176 |
+ 0.176 |
+ 0.277 |
+ 0.253 |
+ 0.205 |
+ 0.301 |
+ 0.298 |
+ 0.280 |
+ 0.188 |
+ 0.140 |
+ 0.298 |
+ 0.256 |
+ 0.298 |
+ 0.310 |
+ 0.250 |
+ 0.161 |
+ 0.262 |
+ 0.262 |
+ 0.292 |
+ 0.295 |
+ 0.301 |
+ 0.286 |
+ 0.280 |
+ 0.310 |
+ 0.289 |
+ 0.280 |
+ 0.304 |
+ 0.357 |
+ 3344 |
+ Expression |
+ Q53Z42_HUMAN |
+ Medium |
+ Human |
+
+
+ Q59976_STRSQ_Romero_2015 |
+ 0.193 |
+ 0.250 |
+ 0.243 |
+ 0.253 |
+ 0.270 |
+ 0.267 |
+ 0.160 |
+ 0.243 |
+ 0.270 |
+ 0.270 |
+ 0.230 |
+ 0.177 |
+ 0.207 |
+ 0.143 |
+ 0.180 |
+ 0.200 |
+ 0.227 |
+ 0.233 |
+ 0.233 |
+ 0.267 |
+ 0.250 |
+ 0.287 |
+ 0.287 |
+ 0.297 |
+ 0.270 |
+ 0.310 |
+ 0.313 |
+ 0.287 |
+ 0.283 |
+ 0.277 |
+ 0.227 |
+ 0.183 |
+ 0.150 |
+ 0.243 |
+ 0.283 |
+ 0.297 |
+ 0.217 |
+ 0.250 |
+ 0.273 |
+ 0.273 |
+ 0.273 |
+ 0.277 |
+ 0.213 |
+ 0.130 |
+ 0.243 |
+ 0.243 |
+ 0.207 |
+ 0.230 |
+ 0.223 |
+ 0.147 |
+ 0.200 |
+ 0.203 |
+ 0.227 |
+ 0.200 |
+ 0.220 |
+ 0.200 |
+ 0.223 |
+ 0.220 |
+ 0.217 |
+ 0.213 |
+ 0.257 |
+ 0.220 |
+ 2999 |
+ Activity |
+ Q59976_STRSQ |
+ Medium |
+ Prokaryote |
+
+
+ Q6WV13_9MAXI_Somermeyer_2022 |
+ 0.136 |
+ 0.145 |
+ 0.111 |
+ 0.110 |
+ 0.127 |
+ 0.124 |
+ 0.080 |
+ 0.097 |
+ 0.132 |
+ 0.132 |
+ 0.100 |
+ 0.092 |
+ 0.079 |
+ 0.074 |
+ 0.078 |
+ 0.072 |
+ 0.070 |
+ 0.072 |
+ 0.072 |
+ 0.119 |
+ 0.088 |
+ 0.109 |
+ 0.082 |
+ 0.116 |
+ 0.093 |
+ 0.070 |
+ 0.074 |
+ 0.062 |
+ 0.077 |
+ 0.149 |
+ 0.117 |
+ 0.117 |
+ 0.087 |
+ 0.095 |
+ 0.089 |
+ 0.115 |
+ 0.112 |
+ 0.117 |
+ 0.112 |
+ 0.123 |
+ 0.128 |
+ 0.125 |
+ 0.099 |
+ 0.079 |
+ 0.084 |
+ 0.092 |
+ 0.186 |
+ 0.152 |
+ 0.180 |
+ 0.148 |
+ 0.131 |
+ 0.138 |
+ 0.114 |
+ 0.149 |
+ 0.144 |
+ 0.145 |
+ 0.141 |
+ 0.133 |
+ 0.135 |
+ 0.141 |
+ 0.110 |
+ 0.102 |
+ 31401 |
+ Activity |
+ Q6WV12_9MAXI |
+ Low |
+ Eukaryote |
+
+
+ Q837P4_ENTFA_Meier_2023 |
+ 0.200 |
+ 0.229 |
+ 0.214 |
+ 0.200 |
+ 0.257 |
+ 0.257 |
+ 0.086 |
+ 0.129 |
+ 0.243 |
+ 0.271 |
+ 0.257 |
+ 0.229 |
+ 0.300 |
+ 0.129 |
+ 0.186 |
+ 0.186 |
+ 0.214 |
+ 0.286 |
+ 0.271 |
+ 0.100 |
+ 0.271 |
+ 0.257 |
+ 0.257 |
+ 0.357 |
+ 0.286 |
+ 0.271 |
+ 0.286 |
+ 0.300 |
+ 0.300 |
+ 0.271 |
+ 0.286 |
+ 0.257 |
+ 0.100 |
+ 0.329 |
+ 0.271 |
+ 0.300 |
+ 0.314 |
+ 0.271 |
+ 0.286 |
+ 0.257 |
+ 0.229 |
+ 0.243 |
+ 0.186 |
+ 0.071 |
+ 0.243 |
+ 0.200 |
+ 0.129 |
+ 0.229 |
+ 0.200 |
+ 0.086 |
+ 0.300 |
+ 0.243 |
+ 0.243 |
+ 0.271 |
+ 0.229 |
+ 0.200 |
+ 0.271 |
+ 0.271 |
+ 0.271 |
+ 0.286 |
+ 0.300 |
+ 0.214 |
+ 697 |
+ Activity |
+ Q837P4_ENTFA |
+ Medium |
+ Prokaryote |
+
+
+ Q837P5_ENTFA_Meier_2023 |
+ 0.067 |
+ 0.293 |
+ 0.267 |
+ 0.280 |
+ 0.307 |
+ 0.307 |
+ 0.120 |
+ 0.200 |
+ 0.133 |
+ 0.133 |
+ 0.253 |
+ 0.267 |
+ 0.240 |
+ 0.067 |
+ 0.120 |
+ 0.120 |
+ 0.187 |
+ 0.267 |
+ 0.227 |
+ 0.147 |
+ 0.200 |
+ 0.200 |
+ 0.293 |
+ 0.227 |
+ 0.187 |
+ 0.227 |
+ 0.293 |
+ 0.267 |
+ 0.333 |
+ 0.200 |
+ 0.240 |
+ 0.280 |
+ 0.093 |
+ 0.240 |
+ 0.200 |
+ 0.280 |
+ 0.240 |
+ 0.200 |
+ 0.267 |
+ 0.240 |
+ 0.240 |
+ 0.240 |
+ 0.093 |
+ 0.093 |
+ 0.213 |
+ 0.160 |
+ 0.120 |
+ 0.200 |
+ 0.227 |
+ 0.133 |
+ 0.200 |
+ 0.227 |
+ 0.227 |
+ 0.200 |
+ 0.213 |
+ 0.173 |
+ 0.227 |
+ 0.213 |
+ 0.240 |
+ 0.200 |
+ 0.213 |
+ 0.227 |
+ 747 |
+ Activity |
+ Q837P5_ENTFA |
+ Medium |
+ Prokaryote |
+
+
+ Q8WTC7_9CNID_Somermeyer_2022 |
+ 0.183 |
+ 0.207 |
+ 0.173 |
+ 0.179 |
+ 0.189 |
+ 0.189 |
+ 0.097 |
+ 0.150 |
+ 0.195 |
+ 0.199 |
+ 0.124 |
+ 0.107 |
+ 0.107 |
+ 0.094 |
+ 0.101 |
+ 0.104 |
+ 0.095 |
+ 0.115 |
+ 0.110 |
+ 0.141 |
+ 0.108 |
+ 0.110 |
+ 0.093 |
+ 0.110 |
+ 0.097 |
+ 0.108 |
+ 0.116 |
+ 0.185 |
+ 0.189 |
+ 0.198 |
+ 0.120 |
+ 0.123 |
+ 0.090 |
+ 0.104 |
+ 0.101 |
+ 0.191 |
+ 0.179 |
+ 0.180 |
+ 0.195 |
+ 0.192 |
+ 0.191 |
+ 0.206 |
+ 0.108 |
+ 0.113 |
+ 0.117 |
+ 0.112 |
+ 0.184 |
+ 0.179 |
+ 0.201 |
+ 0.163 |
+ 0.141 |
+ 0.138 |
+ 0.144 |
+ 0.148 |
+ 0.144 |
+ 0.152 |
+ 0.140 |
+ 0.158 |
+ 0.141 |
+ 0.150 |
+ 0.132 |
+ 0.119 |
+ 33510 |
+ Activity |
+ Q8WTC7_9CNID |
+ Low |
+ Eukaryote |
+
+
+ R1AB_SARS2_Flynn_2022 |
+ 0.169 |
+ 0.171 |
+ 0.148 |
+ 0.161 |
+ 0.166 |
+ 0.161 |
+ 0.077 |
+ 0.120 |
+ 0.059 |
+ 0.059 |
+ 0.086 |
+ 0.058 |
+ 0.056 |
+ 0.068 |
+ 0.070 |
+ 0.073 |
+ 0.084 |
+ 0.147 |
+ 0.147 |
+ 0.117 |
+ 0.138 |
+ 0.150 |
+ 0.145 |
+ 0.150 |
+ 0.134 |
+ 0.131 |
+ 0.129 |
+ 0.119 |
+ 0.143 |
+ 0.147 |
+ 0.119 |
+ 0.119 |
+ 0.113 |
+ 0.112 |
+ 0.113 |
+ 0.150 |
+ 0.161 |
+ 0.155 |
+ 0.164 |
+ 0.164 |
+ 0.164 |
+ 0.168 |
+ 0.058 |
+ 0.056 |
+ 0.079 |
+ 0.070 |
+ 0.154 |
+ 0.122 |
+ 0.169 |
+ 0.147 |
+ 0.098 |
+ 0.122 |
+ 0.119 |
+ 0.112 |
+ 0.113 |
+ 0.103 |
+ 0.092 |
+ 0.092 |
+ 0.105 |
+ 0.096 |
+ 0.106 |
+ 0.077 |
+ 5725 |
+ OrganismalFitness |
+ R1AB_SARS2 |
+ Medium |
+ Virus |
+
+
+ RAD_ANTMA_Tsuboyama_2023_2CJJ |
+ 0.315 |
+ 0.163 |
+ 0.130 |
+ 0.185 |
+ 0.130 |
+ 0.185 |
+ 0.228 |
+ 0.163 |
+ 0.196 |
+ 0.217 |
+ 0.130 |
+ 0.076 |
+ 0.109 |
+ 0.283 |
+ 0.315 |
+ 0.261 |
+ 0.217 |
+ 0.239 |
+ 0.141 |
+ 0.174 |
+ 0.337 |
+ 0.130 |
+ 0.120 |
+ 0.174 |
+ 0.152 |
+ 0.163 |
+ 0.152 |
+ 0.098 |
+ 0.098 |
+ 0.239 |
+ 0.163 |
+ 0.130 |
+ 0.207 |
+ 0.130 |
+ 0.174 |
+ 0.207 |
+ 0.293 |
+ 0.272 |
+ 0.250 |
+ 0.185 |
+ 0.196 |
+ 0.196 |
+ 0.228 |
+ 0.283 |
+ 0.130 |
+ 0.207 |
+ 0.217 |
+ 0.141 |
+ 0.250 |
+ 0.250 |
+ 0.185 |
+ 0.283 |
+ 0.228 |
+ 0.207 |
+ 0.250 |
+ 0.239 |
+ 0.239 |
+ 0.261 |
+ 0.261 |
+ 0.250 |
+ 0.098 |
+ 0.261 |
+ 912 |
+ Stability |
+ RAD_ANTMA |
+ High |
+ Eukaryote |
+
+
+ RAF1_HUMAN_Zinkus-Boltz_2019 |
+ 0.133 |
+ 0.167 |
+ 0.133 |
+ 0.167 |
+ 0.200 |
+ 0.200 |
+ 0.033 |
+ 0.200 |
+ 0.167 |
+ 0.167 |
+ 0.200 |
+ 0.133 |
+ 0.200 |
+ 0.100 |
+ 0.167 |
+ 0.133 |
+ 0.133 |
+ 0.167 |
+ 0.167 |
+ 0.167 |
+ 0.133 |
+ 0.200 |
+ 0.167 |
+ 0.133 |
+ 0.200 |
+ 0.200 |
+ 0.200 |
+ 0.200 |
+ 0.133 |
+ 0.167 |
+ 0.167 |
+ 0.167 |
+ 0.067 |
+ 0.167 |
+ 0.200 |
+ 0.133 |
+ 0.133 |
+ 0.167 |
+ 0.133 |
+ 0.167 |
+ 0.133 |
+ 0.133 |
+ 0.133 |
+ 0.067 |
+ 0.167 |
+ 0.133 |
+ 0.067 |
+ 0.200 |
+ 0.067 |
+ 0.067 |
+ 0.167 |
+ 0.133 |
+ 0.200 |
+ 0.133 |
+ 0.100 |
+ 0.133 |
+ 0.133 |
+ 0.100 |
+ 0.133 |
+ 0.133 |
+ 0.100 |
+ 0.067 |
+ 297 |
+ OrganismalFitness |
+ RAF1_HUMAN |
+ Low |
+ Human |
+
+
+ RASH_HUMAN_Bandaru_2017 |
+ 0.166 |
+ 0.092 |
+ 0.086 |
+ 0.092 |
+ 0.080 |
+ 0.089 |
+ 0.073 |
+ 0.057 |
+ 0.092 |
+ 0.086 |
+ 0.086 |
+ 0.076 |
+ 0.061 |
+ 0.162 |
+ 0.182 |
+ 0.156 |
+ 0.111 |
+ 0.099 |
+ 0.105 |
+ 0.127 |
+ 0.096 |
+ 0.099 |
+ 0.070 |
+ 0.080 |
+ 0.086 |
+ 0.080 |
+ 0.080 |
+ 0.067 |
+ 0.067 |
+ 0.108 |
+ 0.054 |
+ 0.073 |
+ 0.064 |
+ 0.115 |
+ 0.086 |
+ 0.080 |
+ 0.124 |
+ 0.105 |
+ 0.111 |
+ 0.092 |
+ 0.089 |
+ 0.089 |
+ 0.134 |
+ 0.229 |
+ 0.115 |
+ 0.115 |
+ 0.191 |
+ 0.111 |
+ 0.121 |
+ 0.194 |
+ 0.118 |
+ 0.115 |
+ 0.127 |
+ 0.108 |
+ 0.111 |
+ 0.118 |
+ 0.115 |
+ 0.115 |
+ 0.089 |
+ 0.118 |
+ 0.083 |
+ 0.153 |
+ 3134 |
+ Activity |
+ RASH_HUMAN |
+ High |
+ Human |
+
+
+ RASK_HUMAN_Weng_2022_abundance |
+ 0.174 |
+ 0.153 |
+ 0.167 |
+ 0.174 |
+ 0.167 |
+ 0.175 |
+ 0.107 |
+ 0.156 |
+ 0.171 |
+ 0.186 |
+ 0.157 |
+ 0.154 |
+ 0.157 |
+ 0.215 |
+ 0.187 |
+ 0.197 |
+ 0.174 |
+ 0.169 |
+ 0.176 |
+ 0.171 |
+ 0.135 |
+ 0.154 |
+ 0.160 |
+ 0.169 |
+ 0.154 |
+ 0.173 |
+ 0.168 |
+ 0.163 |
+ 0.177 |
+ 0.132 |
+ 0.153 |
+ 0.133 |
+ 0.109 |
+ 0.132 |
+ 0.159 |
+ 0.169 |
+ 0.143 |
+ 0.171 |
+ 0.187 |
+ 0.173 |
+ 0.176 |
+ 0.179 |
+ 0.183 |
+ 0.203 |
+ 0.124 |
+ 0.172 |
+ 0.165 |
+ 0.146 |
+ 0.200 |
+ 0.162 |
+ 0.165 |
+ 0.160 |
+ 0.161 |
+ 0.162 |
+ 0.169 |
+ 0.160 |
+ 0.171 |
+ 0.164 |
+ 0.162 |
+ 0.168 |
+ 0.177 |
+ 0.188 |
+ 26012 |
+ Expression |
+ RASK_HUMAN |
+ High |
+ Human |
+
+
+ RASK_HUMAN_Weng_2022_binding-DARPin_K55 |
+ 0.235 |
+ 0.258 |
+ 0.274 |
+ 0.271 |
+ 0.267 |
+ 0.270 |
+ 0.070 |
+ 0.201 |
+ 0.289 |
+ 0.305 |
+ 0.246 |
+ 0.253 |
+ 0.260 |
+ 0.279 |
+ 0.282 |
+ 0.286 |
+ 0.293 |
+ 0.267 |
+ 0.251 |
+ 0.272 |
+ 0.238 |
+ 0.261 |
+ 0.258 |
+ 0.249 |
+ 0.279 |
+ 0.274 |
+ 0.252 |
+ 0.246 |
+ 0.226 |
+ 0.242 |
+ 0.230 |
+ 0.209 |
+ 0.176 |
+ 0.240 |
+ 0.267 |
+ 0.256 |
+ 0.229 |
+ 0.262 |
+ 0.258 |
+ 0.272 |
+ 0.275 |
+ 0.273 |
+ 0.256 |
+ 0.165 |
+ 0.271 |
+ 0.267 |
+ 0.148 |
+ 0.219 |
+ 0.291 |
+ 0.190 |
+ 0.238 |
+ 0.225 |
+ 0.230 |
+ 0.230 |
+ 0.237 |
+ 0.228 |
+ 0.237 |
+ 0.233 |
+ 0.232 |
+ 0.234 |
+ 0.268 |
+ 0.242 |
+ 24873 |
+ Binding |
+ RASK_HUMAN |
+ High |
+ Human |
+
+
+ RBP1_HUMAN_Tsuboyama_2023_2KWH |
+ 0.239 |
+ 0.157 |
+ 0.239 |
+ 0.239 |
+ 0.239 |
+ 0.239 |
+ 0.284 |
+ 0.112 |
+ 0.209 |
+ 0.209 |
+ 0.351 |
+ 0.343 |
+ 0.351 |
+ 0.328 |
+ 0.343 |
+ 0.351 |
+ 0.366 |
+ 0.157 |
+ 0.187 |
+ 0.254 |
+ 0.328 |
+ 0.097 |
+ 0.164 |
+ 0.082 |
+ 0.343 |
+ 0.201 |
+ 0.194 |
+ 0.104 |
+ 0.104 |
+ 0.157 |
+ 0.142 |
+ 0.104 |
+ 0.104 |
+ 0.299 |
+ 0.351 |
+ 0.291 |
+ 0.269 |
+ 0.246 |
+ 0.239 |
+ 0.261 |
+ 0.246 |
+ 0.224 |
+ 0.358 |
+ 0.351 |
+ 0.343 |
+ 0.343 |
+ 0.321 |
+ 0.321 |
+ 0.328 |
+ 0.284 |
+ 0.336 |
+ 0.291 |
+ 0.306 |
+ 0.299 |
+ 0.328 |
+ 0.321 |
+ 0.291 |
+ 0.306 |
+ 0.313 |
+ 0.328 |
+ 0.321 |
+ 0.321 |
+ 1332 |
+ Stability |
+ RBP1_HUMAN |
+ High |
+ Human |
+
+
+ RCD1_ARATH_Tsuboyama_2023_5OAO |
+ 0.181 |
+ 0.142 |
+ 0.165 |
+ 0.165 |
+ 0.150 |
+ 0.157 |
+ 0.165 |
+ 0.087 |
+ 0.142 |
+ 0.142 |
+ 0.283 |
+ 0.260 |
+ 0.276 |
+ 0.252 |
+ 0.307 |
+ 0.252 |
+ 0.205 |
+ 0.157 |
+ 0.197 |
+ 0.094 |
+ 0.228 |
+ 0.157 |
+ 0.181 |
+ 0.205 |
+ 0.157 |
+ 0.205 |
+ 0.213 |
+ 0.181 |
+ 0.197 |
+ 0.134 |
+ 0.110 |
+ 0.047 |
+ 0.071 |
+ 0.228 |
+ 0.165 |
+ 0.244 |
+ 0.213 |
+ 0.228 |
+ 0.220 |
+ 0.189 |
+ 0.173 |
+ 0.173 |
+ 0.236 |
+ 0.260 |
+ 0.220 |
+ 0.220 |
+ 0.228 |
+ 0.260 |
+ 0.228 |
+ 0.276 |
+ 0.189 |
+ 0.213 |
+ 0.181 |
+ 0.189 |
+ 0.189 |
+ 0.189 |
+ 0.213 |
+ 0.173 |
+ 0.181 |
+ 0.189 |
+ 0.173 |
+ 0.276 |
+ 1261 |
+ Stability |
+ RCD1_ARATH |
+ Medium |
+ Eukaryote |
+
+
+ RCRO_LAMBD_Tsuboyama_2023_1ORC |
+ 0.197 |
+ 0.272 |
+ 0.298 |
+ 0.351 |
+ 0.346 |
+ 0.311 |
+ 0.237 |
+ 0.175 |
+ 0.281 |
+ 0.311 |
+ 0.303 |
+ 0.272 |
+ 0.364 |
+ 0.250 |
+ 0.307 |
+ 0.311 |
+ 0.289 |
+ 0.281 |
+ 0.276 |
+ 0.263 |
+ 0.180 |
+ 0.250 |
+ 0.228 |
+ 0.250 |
+ 0.145 |
+ 0.303 |
+ 0.175 |
+ 0.250 |
+ 0.294 |
+ 0.289 |
+ 0.281 |
+ 0.254 |
+ 0.184 |
+ 0.219 |
+ 0.140 |
+ 0.254 |
+ 0.263 |
+ 0.246 |
+ 0.272 |
+ 0.298 |
+ 0.303 |
+ 0.281 |
+ 0.254 |
+ 0.311 |
+ 0.316 |
+ 0.263 |
+ 0.539 |
+ 0.404 |
+ 0.469 |
+ 0.469 |
+ 0.382 |
+ 0.399 |
+ 0.377 |
+ 0.408 |
+ 0.399 |
+ 0.377 |
+ 0.390 |
+ 0.346 |
+ 0.351 |
+ 0.386 |
+ 0.465 |
+ 0.158 |
+ 2278 |
+ Stability |
+ RCRO_LAMBD |
+ High |
+ Virus |
+
+
+ RD23A_HUMAN_Tsuboyama_2023_1IFY |
+ 0.441 |
+ 0.304 |
+ 0.304 |
+ 0.324 |
+ 0.304 |
+ 0.294 |
+ 0.304 |
+ 0.196 |
+ 0.245 |
+ 0.235 |
+ 0.216 |
+ 0.245 |
+ 0.255 |
+ 0.275 |
+ 0.382 |
+ 0.275 |
+ 0.225 |
+ 0.196 |
+ 0.186 |
+ 0.284 |
+ 0.147 |
+ 0.255 |
+ 0.196 |
+ 0.216 |
+ 0.265 |
+ 0.235 |
+ 0.216 |
+ 0.245 |
+ 0.176 |
+ 0.196 |
+ 0.235 |
+ 0.206 |
+ 0.294 |
+ 0.137 |
+ 0.245 |
+ 0.216 |
+ 0.353 |
+ 0.353 |
+ 0.294 |
+ 0.333 |
+ 0.324 |
+ 0.294 |
+ 0.255 |
+ 0.245 |
+ 0.245 |
+ 0.284 |
+ 0.382 |
+ 0.314 |
+ 0.441 |
+ 0.422 |
+ 0.216 |
+ 0.245 |
+ 0.255 |
+ 0.255 |
+ 0.284 |
+ 0.265 |
+ 0.275 |
+ 0.245 |
+ 0.255 |
+ 0.255 |
+ 0.235 |
+ 0.392 |
+ 1019 |
+ Stability |
+ RD23A_HUMAN |
+ High |
+ Human |
+
+
+ RDRP_I33A0_Li_2023 |
+ 0.184 |
+ 0.259 |
+ 0.386 |
+ 0.395 |
+ 0.396 |
+ 0.392 |
+ 0.071 |
+ 0.346 |
+ 0.356 |
+ 0.353 |
+ 0.117 |
+ 0.096 |
+ 0.107 |
+ 0.102 |
+ 0.096 |
+ 0.108 |
+ 0.214 |
+ 0.301 |
+ 0.338 |
+ 0.405 |
+ 0.327 |
+ 0.366 |
+ 0.377 |
+ 0.397 |
+ 0.131 |
+ 0.332 |
+ 0.314 |
+ 0.306 |
+ 0.356 |
+ 0.416 |
+ 0.345 |
+ 0.335 |
+ 0.122 |
+ 0.339 |
+ 0.360 |
+ 0.398 |
+ 0.351 |
+ 0.376 |
+ 0.386 |
+ 0.412 |
+ 0.415 |
+ 0.413 |
+ 0.102 |
+ 0.092 |
+ 0.132 |
+ 0.106 |
+ 0.138 |
+ 0.182 |
+ 0.098 |
+ 0.136 |
+ 0.224 |
+ 0.218 |
+ 0.232 |
+ 0.221 |
+ 0.221 |
+ 0.219 |
+ 0.220 |
+ 0.238 |
+ 0.229 |
+ 0.226 |
+ 0.119 |
+ 0.111 |
+ 12003 |
+ OrganismalFitness |
+ RDRP_I33A0 |
+ Low |
+ Virus |
+
+
+ REV_HV1H2_Fernandes_2016 |
+ 0.130 |
+ 0.135 |
+ 0.144 |
+ 0.135 |
+ 0.149 |
+ 0.144 |
+ 0.112 |
+ 0.140 |
+ 0.135 |
+ 0.126 |
+ 0.084 |
+ 0.144 |
+ 0.126 |
+ 0.107 |
+ 0.093 |
+ 0.065 |
+ 0.130 |
+ 0.107 |
+ 0.107 |
+ 0.112 |
+ 0.116 |
+ 0.098 |
+ 0.093 |
+ 0.130 |
+ 0.102 |
+ 0.149 |
+ 0.098 |
+ 0.126 |
+ 0.130 |
+ 0.135 |
+ 0.135 |
+ 0.126 |
+ 0.116 |
+ 0.130 |
+ 0.140 |
+ 0.112 |
+ 0.144 |
+ 0.153 |
+ 0.121 |
+ 0.153 |
+ 0.144 |
+ 0.140 |
+ 0.102 |
+ 0.088 |
+ 0.112 |
+ 0.102 |
+ 0.070 |
+ 0.098 |
+ 0.116 |
+ 0.126 |
+ 0.135 |
+ 0.130 |
+ 0.126 |
+ 0.098 |
+ 0.116 |
+ 0.093 |
+ 0.112 |
+ 0.116 |
+ 0.102 |
+ 0.112 |
+ 0.079 |
+ 0.121 |
+ 2147 |
+ OrganismalFitness |
+ REV_HV1H2 |
+ Medium |
+ Virus |
+
+
+ RFAH_ECOLI_Tsuboyama_2023_2LCL |
+ 0.233 |
+ 0.248 |
+ 0.180 |
+ 0.173 |
+ 0.195 |
+ 0.165 |
+ 0.165 |
+ 0.075 |
+ 0.105 |
+ 0.173 |
+ 0.248 |
+ 0.173 |
+ 0.226 |
+ 0.180 |
+ 0.158 |
+ 0.211 |
+ 0.241 |
+ 0.120 |
+ 0.068 |
+ 0.090 |
+ 0.143 |
+ 0.165 |
+ 0.180 |
+ 0.158 |
+ 0.256 |
+ 0.158 |
+ 0.195 |
+ 0.150 |
+ 0.083 |
+ 0.083 |
+ 0.135 |
+ 0.105 |
+ 0.053 |
+ 0.203 |
+ 0.083 |
+ 0.068 |
+ 0.211 |
+ 0.090 |
+ 0.158 |
+ 0.195 |
+ 0.135 |
+ 0.173 |
+ 0.226 |
+ 0.180 |
+ 0.233 |
+ 0.241 |
+ 0.241 |
+ 0.173 |
+ 0.301 |
+ 0.286 |
+ 0.188 |
+ 0.188 |
+ 0.195 |
+ 0.180 |
+ 0.188 |
+ 0.188 |
+ 0.241 |
+ 0.188 |
+ 0.195 |
+ 0.195 |
+ 0.278 |
+ 0.226 |
+ 1326 |
+ Stability |
+ RFAH_ECOLI |
+ High |
+ Prokaryote |
+
+
+ RL20_AQUAE_Tsuboyama_2023_1GYZ |
+ 0.095 |
+ 0.184 |
+ 0.129 |
+ 0.136 |
+ 0.129 |
+ 0.129 |
+ 0.163 |
+ 0.122 |
+ 0.184 |
+ 0.163 |
+ 0.252 |
+ 0.204 |
+ 0.238 |
+ 0.190 |
+ 0.184 |
+ 0.190 |
+ 0.190 |
+ 0.136 |
+ 0.129 |
+ 0.122 |
+ 0.116 |
+ 0.156 |
+ 0.122 |
+ 0.150 |
+ 0.102 |
+ 0.122 |
+ 0.129 |
+ 0.150 |
+ 0.143 |
+ 0.048 |
+ 0.129 |
+ 0.109 |
+ 0.088 |
+ 0.136 |
+ 0.156 |
+ 0.109 |
+ 0.143 |
+ 0.163 |
+ 0.143 |
+ 0.143 |
+ 0.143 |
+ 0.136 |
+ 0.224 |
+ 0.150 |
+ 0.150 |
+ 0.224 |
+ 0.333 |
+ 0.184 |
+ 0.361 |
+ 0.313 |
+ 0.204 |
+ 0.204 |
+ 0.211 |
+ 0.184 |
+ 0.177 |
+ 0.197 |
+ 0.177 |
+ 0.184 |
+ 0.184 |
+ 0.190 |
+ 0.177 |
+ 0.272 |
+ 1461 |
+ Stability |
+ RL20_AQUAE |
+ High |
+ Prokaryote |
+
+
+ RL40A_YEAST_Mavor_2016 |
+ 0.159 |
+ 0.167 |
+ 0.167 |
+ 0.190 |
+ 0.190 |
+ 0.183 |
+ 0.095 |
+ 0.143 |
+ 0.175 |
+ 0.190 |
+ 0.135 |
+ 0.190 |
+ 0.159 |
+ 0.095 |
+ 0.143 |
+ 0.190 |
+ 0.151 |
+ 0.151 |
+ 0.206 |
+ 0.190 |
+ 0.175 |
+ 0.135 |
+ 0.127 |
+ 0.111 |
+ 0.183 |
+ 0.167 |
+ 0.151 |
+ 0.190 |
+ 0.135 |
+ 0.159 |
+ 0.143 |
+ 0.167 |
+ 0.087 |
+ 0.119 |
+ 0.119 |
+ 0.119 |
+ 0.159 |
+ 0.167 |
+ 0.167 |
+ 0.183 |
+ 0.198 |
+ 0.167 |
+ 0.143 |
+ 0.143 |
+ 0.135 |
+ 0.111 |
+ 0.103 |
+ 0.056 |
+ 0.127 |
+ 0.040 |
+ 0.198 |
+ 0.167 |
+ 0.175 |
+ 0.167 |
+ 0.175 |
+ 0.151 |
+ 0.151 |
+ 0.183 |
+ 0.183 |
+ 0.183 |
+ 0.111 |
+ 0.135 |
+ 1253 |
+ OrganismalFitness |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RL40A_YEAST_Roscoe_2013 |
+ 0.117 |
+ 0.192 |
+ 0.175 |
+ 0.217 |
+ 0.208 |
+ 0.217 |
+ 0.017 |
+ 0.175 |
+ 0.217 |
+ 0.233 |
+ 0.150 |
+ 0.150 |
+ 0.192 |
+ 0.058 |
+ 0.142 |
+ 0.208 |
+ 0.183 |
+ 0.183 |
+ 0.233 |
+ 0.167 |
+ 0.200 |
+ 0.175 |
+ 0.208 |
+ 0.150 |
+ 0.125 |
+ 0.175 |
+ 0.167 |
+ 0.167 |
+ 0.192 |
+ 0.242 |
+ 0.242 |
+ 0.233 |
+ 0.175 |
+ 0.192 |
+ 0.150 |
+ 0.150 |
+ 0.183 |
+ 0.158 |
+ 0.158 |
+ 0.192 |
+ 0.217 |
+ 0.200 |
+ 0.183 |
+ 0.100 |
+ 0.192 |
+ 0.158 |
+ 0.042 |
+ 0.125 |
+ 0.108 |
+ 0.050 |
+ 0.125 |
+ 0.142 |
+ 0.117 |
+ 0.142 |
+ 0.133 |
+ 0.142 |
+ 0.125 |
+ 0.142 |
+ 0.175 |
+ 0.125 |
+ 0.192 |
+ 0.175 |
+ 1195 |
+ OrganismalFitness |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RL40A_YEAST_Roscoe_2014 |
+ 0.123 |
+ 0.094 |
+ 0.116 |
+ 0.130 |
+ 0.116 |
+ 0.109 |
+ 0.072 |
+ 0.138 |
+ 0.101 |
+ 0.109 |
+ 0.159 |
+ 0.138 |
+ 0.123 |
+ 0.123 |
+ 0.217 |
+ 0.116 |
+ 0.152 |
+ 0.138 |
+ 0.138 |
+ 0.145 |
+ 0.174 |
+ 0.167 |
+ 0.138 |
+ 0.167 |
+ 0.188 |
+ 0.130 |
+ 0.101 |
+ 0.123 |
+ 0.123 |
+ 0.138 |
+ 0.109 |
+ 0.130 |
+ 0.123 |
+ 0.174 |
+ 0.130 |
+ 0.109 |
+ 0.138 |
+ 0.116 |
+ 0.116 |
+ 0.123 |
+ 0.130 |
+ 0.123 |
+ 0.130 |
+ 0.145 |
+ 0.145 |
+ 0.109 |
+ 0.087 |
+ 0.101 |
+ 0.101 |
+ 0.094 |
+ 0.145 |
+ 0.167 |
+ 0.152 |
+ 0.188 |
+ 0.174 |
+ 0.159 |
+ 0.138 |
+ 0.138 |
+ 0.145 |
+ 0.159 |
+ 0.138 |
+ 0.123 |
+ 1380 |
+ Activity |
+ RL40A_YEAST |
+ Medium |
+ Eukaryote |
+
+
+ RNC_ECOLI_Weeks_2023 |
+ 0.147 |
+ 0.196 |
+ 0.117 |
+ 0.121 |
+ 0.129 |
+ 0.131 |
+ 0.077 |
+ 0.143 |
+ 0.138 |
+ 0.133 |
+ 0.147 |
+ 0.136 |
+ 0.154 |
+ 0.103 |
+ 0.147 |
+ 0.136 |
+ 0.152 |
+ 0.171 |
+ 0.173 |
+ 0.131 |
+ 0.189 |
+ 0.199 |
+ 0.161 |
+ 0.182 |
+ 0.182 |
+ 0.152 |
+ 0.168 |
+ 0.152 |
+ 0.182 |
+ 0.143 |
+ 0.173 |
+ 0.173 |
+ 0.119 |
+ 0.161 |
+ 0.173 |
+ 0.150 |
+ 0.150 |
+ 0.166 |
+ 0.168 |
+ 0.140 |
+ 0.136 |
+ 0.150 |
+ 0.173 |
+ 0.093 |
+ 0.138 |
+ 0.131 |
+ 0.152 |
+ 0.168 |
+ 0.131 |
+ 0.121 |
+ 0.143 |
+ 0.161 |
+ 0.152 |
+ 0.152 |
+ 0.154 |
+ 0.164 |
+ 0.147 |
+ 0.147 |
+ 0.164 |
+ 0.147 |
+ 0.166 |
+ 0.129 |
+ 4277 |
+ Activity |
+ RNC_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ RPC1_BP434_Tsuboyama_2023_1R69 |
+ 0.274 |
+ 0.295 |
+ 0.233 |
+ 0.260 |
+ 0.247 |
+ 0.260 |
+ 0.281 |
+ 0.171 |
+ 0.185 |
+ 0.199 |
+ 0.295 |
+ 0.288 |
+ 0.322 |
+ 0.301 |
+ 0.315 |
+ 0.322 |
+ 0.267 |
+ 0.260 |
+ 0.295 |
+ 0.281 |
+ 0.219 |
+ 0.356 |
+ 0.336 |
+ 0.260 |
+ 0.274 |
+ 0.329 |
+ 0.247 |
+ 0.295 |
+ 0.247 |
+ 0.253 |
+ 0.247 |
+ 0.240 |
+ 0.178 |
+ 0.185 |
+ 0.295 |
+ 0.301 |
+ 0.199 |
+ 0.281 |
+ 0.288 |
+ 0.301 |
+ 0.281 |
+ 0.281 |
+ 0.329 |
+ 0.274 |
+ 0.260 |
+ 0.267 |
+ 0.356 |
+ 0.308 |
+ 0.301 |
+ 0.370 |
+ 0.274 |
+ 0.240 |
+ 0.274 |
+ 0.281 |
+ 0.267 |
+ 0.253 |
+ 0.247 |
+ 0.260 |
+ 0.247 |
+ 0.260 |
+ 0.322 |
+ 0.308 |
+ 1459 |
+ Stability |
+ RPC1_BP434 |
+ High |
+ Virus |
+
+
+ RPC1_LAMBD_Li_2019_high-expression |
+ 0.139 |
+ 0.083 |
+ 0.111 |
+ 0.111 |
+ 0.111 |
+ 0.111 |
+ 0.167 |
+ 0.028 |
+ 0.139 |
+ 0.139 |
+ 0.083 |
+ 0.139 |
+ 0.139 |
+ 0.194 |
+ 0.167 |
+ 0.194 |
+ 0.167 |
+ 0.083 |
+ 0.139 |
+ 0.194 |
+ 0.139 |
+ 0.139 |
+ 0.222 |
+ 0.139 |
+ 0.194 |
+ 0.139 |
+ 0.139 |
+ 0.167 |
+ 0.111 |
+ 0.194 |
+ 0.028 |
+ 0.000 |
+ 0.083 |
+ 0.139 |
+ 0.139 |
+ 0.111 |
+ 0.139 |
+ 0.139 |
+ 0.167 |
+ 0.139 |
+ 0.111 |
+ 0.111 |
+ 0.167 |
+ 0.222 |
+ 0.139 |
+ 0.139 |
+ 0.194 |
+ 0.250 |
+ 0.139 |
+ 0.167 |
+ 0.139 |
+ 0.111 |
+ 0.167 |
+ 0.139 |
+ 0.139 |
+ 0.167 |
+ 0.139 |
+ 0.167 |
+ 0.139 |
+ 0.167 |
+ 0.278 |
+ 0.250 |
+ 351 |
+ Activity |
+ RPC1_LAMBD |
+ High |
+ Virus |
+
+
+ RPC1_LAMBD_Li_2019_low-expression |
+ 0.194 |
+ 0.250 |
+ 0.222 |
+ 0.250 |
+ 0.222 |
+ 0.222 |
+ 0.139 |
+ 0.111 |
+ 0.306 |
+ 0.250 |
+ 0.167 |
+ 0.194 |
+ 0.250 |
+ 0.222 |
+ 0.194 |
+ 0.194 |
+ 0.278 |
+ 0.222 |
+ 0.167 |
+ 0.222 |
+ 0.111 |
+ 0.250 |
+ 0.250 |
+ 0.167 |
+ 0.083 |
+ 0.194 |
+ 0.222 |
+ 0.194 |
+ 0.222 |
+ 0.222 |
+ 0.194 |
+ 0.167 |
+ 0.139 |
+ 0.222 |
+ 0.250 |
+ 0.194 |
+ 0.194 |
+ 0.194 |
+ 0.167 |
+ 0.222 |
+ 0.222 |
+ 0.222 |
+ 0.167 |
+ 0.194 |
+ 0.194 |
+ 0.222 |
+ 0.111 |
+ 0.250 |
+ 0.222 |
+ 0.056 |
+ 0.250 |
+ 0.250 |
+ 0.194 |
+ 0.250 |
+ 0.278 |
+ 0.250 |
+ 0.278 |
+ 0.250 |
+ 0.250 |
+ 0.278 |
+ 0.194 |
+ 0.111 |
+ 351 |
+ Activity |
+ RPC1_LAMBD |
+ High |
+ Virus |
+
+
+ RS15_GEOSE_Tsuboyama_2023_1A32 |
+ 0.142 |
+ 0.083 |
+ 0.108 |
+ 0.117 |
+ 0.108 |
+ 0.117 |
+ 0.092 |
+ 0.083 |
+ 0.108 |
+ 0.092 |
+ 0.108 |
+ 0.100 |
+ 0.083 |
+ 0.150 |
+ 0.208 |
+ 0.192 |
+ 0.092 |
+ 0.125 |
+ 0.142 |
+ 0.067 |
+ 0.108 |
+ 0.108 |
+ 0.075 |
+ 0.083 |
+ 0.133 |
+ 0.083 |
+ 0.125 |
+ 0.100 |
+ 0.117 |
+ 0.050 |
+ 0.075 |
+ 0.042 |
+ 0.183 |
+ 0.117 |
+ 0.083 |
+ 0.100 |
+ 0.133 |
+ 0.092 |
+ 0.100 |
+ 0.117 |
+ 0.083 |
+ 0.083 |
+ 0.192 |
+ 0.150 |
+ 0.125 |
+ 0.217 |
+ 0.275 |
+ 0.150 |
+ 0.158 |
+ 0.292 |
+ 0.117 |
+ 0.117 |
+ 0.108 |
+ 0.142 |
+ 0.117 |
+ 0.092 |
+ 0.108 |
+ 0.108 |
+ 0.125 |
+ 0.125 |
+ 0.150 |
+ 0.325 |
+ 1195 |
+ Stability |
+ RS15_GEOSE |
+ Medium |
+ Prokaryote |
+
+
+ S22A1_HUMAN_Yee_2023_abundance |
+ 0.191 |
+ 0.244 |
+ 0.262 |
+ 0.276 |
+ 0.256 |
+ 0.255 |
+ 0.177 |
+ 0.193 |
+ 0.262 |
+ 0.261 |
+ 0.257 |
+ 0.262 |
+ 0.272 |
+ 0.193 |
+ 0.204 |
+ 0.238 |
+ 0.288 |
+ 0.269 |
+ 0.255 |
+ 0.109 |
+ 0.242 |
+ 0.267 |
+ 0.238 |
+ 0.229 |
+ 0.234 |
+ 0.250 |
+ 0.248 |
+ 0.266 |
+ 0.227 |
+ 0.254 |
+ 0.266 |
+ 0.206 |
+ 0.130 |
+ 0.223 |
+ 0.262 |
+ 0.254 |
+ 0.253 |
+ 0.288 |
+ 0.272 |
+ 0.269 |
+ 0.265 |
+ 0.269 |
+ 0.216 |
+ 0.151 |
+ 0.272 |
+ 0.221 |
+ 0.214 |
+ 0.288 |
+ 0.220 |
+ 0.149 |
+ 0.276 |
+ 0.268 |
+ 0.288 |
+ 0.291 |
+ 0.286 |
+ 0.280 |
+ 0.284 |
+ 0.276 |
+ 0.279 |
+ 0.290 |
+ 0.269 |
+ 0.231 |
+ 9803 |
+ Expression |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ S22A1_HUMAN_Yee_2023_activity |
+ 0.167 |
+ 0.190 |
+ 0.214 |
+ 0.213 |
+ 0.210 |
+ 0.216 |
+ 0.133 |
+ 0.170 |
+ 0.203 |
+ 0.211 |
+ 0.197 |
+ 0.215 |
+ 0.198 |
+ 0.168 |
+ 0.152 |
+ 0.171 |
+ 0.204 |
+ 0.208 |
+ 0.180 |
+ 0.094 |
+ 0.192 |
+ 0.199 |
+ 0.191 |
+ 0.176 |
+ 0.187 |
+ 0.193 |
+ 0.185 |
+ 0.201 |
+ 0.166 |
+ 0.182 |
+ 0.191 |
+ 0.159 |
+ 0.140 |
+ 0.168 |
+ 0.205 |
+ 0.205 |
+ 0.171 |
+ 0.212 |
+ 0.215 |
+ 0.217 |
+ 0.212 |
+ 0.218 |
+ 0.164 |
+ 0.127 |
+ 0.207 |
+ 0.157 |
+ 0.196 |
+ 0.232 |
+ 0.193 |
+ 0.146 |
+ 0.206 |
+ 0.195 |
+ 0.214 |
+ 0.216 |
+ 0.212 |
+ 0.208 |
+ 0.207 |
+ 0.217 |
+ 0.201 |
+ 0.211 |
+ 0.206 |
+ 0.184 |
+ 10094 |
+ Activity |
+ S22A1_HUMAN |
+ Medium |
+ Human |
+
+
+ SAV1_MOUSE_Tsuboyama_2023_2YSB |
+ 0.454 |
+ 0.381 |
+ 0.423 |
+ 0.433 |
+ 0.464 |
+ 0.474 |
+ 0.309 |
+ 0.103 |
+ 0.454 |
+ 0.474 |
+ 0.258 |
+ 0.268 |
+ 0.258 |
+ 0.196 |
+ 0.423 |
+ 0.412 |
+ 0.258 |
+ 0.278 |
+ 0.175 |
+ 0.206 |
+ 0.103 |
+ 0.103 |
+ 0.072 |
+ 0.093 |
+ 0.268 |
+ 0.134 |
+ 0.103 |
+ 0.093 |
+ 0.103 |
+ 0.062 |
+ 0.216 |
+ 0.206 |
+ 0.144 |
+ 0.216 |
+ 0.196 |
+ 0.165 |
+ 0.402 |
+ 0.412 |
+ 0.392 |
+ 0.433 |
+ 0.443 |
+ 0.433 |
+ 0.361 |
+ 0.216 |
+ 0.340 |
+ 0.320 |
+ 0.381 |
+ 0.371 |
+ 0.392 |
+ 0.412 |
+ 0.351 |
+ 0.289 |
+ 0.361 |
+ 0.268 |
+ 0.371 |
+ 0.361 |
+ 0.309 |
+ 0.351 |
+ 0.330 |
+ 0.351 |
+ 0.227 |
+ 0.412 |
+ 965 |
+ Stability |
+ SAV1_MOUSE |
+ High |
+ Eukaryote |
+
+
+ SBI_STAAM_Tsuboyama_2023_2JVG |
+ 0.204 |
+ 0.194 |
+ 0.223 |
+ 0.214 |
+ 0.233 |
+ 0.214 |
+ 0.194 |
+ 0.107 |
+ 0.214 |
+ 0.282 |
+ 0.262 |
+ 0.184 |
+ 0.204 |
+ 0.155 |
+ 0.175 |
+ 0.223 |
+ 0.320 |
+ 0.272 |
+ 0.223 |
+ 0.126 |
+ 0.194 |
+ 0.204 |
+ 0.184 |
+ 0.165 |
+ 0.175 |
+ 0.184 |
+ 0.204 |
+ 0.165 |
+ 0.184 |
+ 0.165 |
+ 0.291 |
+ 0.184 |
+ 0.117 |
+ 0.146 |
+ 0.146 |
+ 0.126 |
+ 0.175 |
+ 0.194 |
+ 0.204 |
+ 0.214 |
+ 0.223 |
+ 0.223 |
+ 0.184 |
+ 0.223 |
+ 0.214 |
+ 0.184 |
+ 0.311 |
+ 0.330 |
+ 0.311 |
+ 0.340 |
+ 0.291 |
+ 0.311 |
+ 0.340 |
+ 0.320 |
+ 0.311 |
+ 0.282 |
+ 0.272 |
+ 0.359 |
+ 0.359 |
+ 0.340 |
+ 0.301 |
+ 0.272 |
+ 1025 |
+ Stability |
+ SBI_STAAM |
+ Medium |
+ Prokaryote |
+
+
+ SC6A4_HUMAN_Young_2021 |
+ 0.164 |
+ 0.220 |
+ 0.201 |
+ 0.202 |
+ 0.218 |
+ 0.204 |
+ 0.159 |
+ 0.298 |
+ 0.217 |
+ 0.238 |
+ 0.242 |
+ 0.212 |
+ 0.222 |
+ 0.106 |
+ 0.135 |
+ 0.177 |
+ 0.196 |
+ 0.213 |
+ 0.217 |
+ 0.262 |
+ 0.244 |
+ 0.306 |
+ 0.304 |
+ 0.286 |
+ 0.251 |
+ 0.294 |
+ 0.285 |
+ 0.273 |
+ 0.269 |
+ 0.226 |
+ 0.195 |
+ 0.139 |
+ 0.187 |
+ 0.257 |
+ 0.294 |
+ 0.289 |
+ 0.215 |
+ 0.241 |
+ 0.251 |
+ 0.209 |
+ 0.221 |
+ 0.217 |
+ 0.205 |
+ 0.096 |
+ 0.247 |
+ 0.213 |
+ 0.232 |
+ 0.266 |
+ 0.219 |
+ 0.142 |
+ 0.193 |
+ 0.206 |
+ 0.221 |
+ 0.219 |
+ 0.211 |
+ 0.215 |
+ 0.210 |
+ 0.199 |
+ 0.187 |
+ 0.201 |
+ 0.221 |
+ 0.205 |
+ 11576 |
+ Activity |
+ SC6A4_HUMAN |
+ Medium |
+ Human |
+
+
+ SCIN_STAAR_Tsuboyama_2023_2QFF |
+ 0.156 |
+ 0.139 |
+ 0.123 |
+ 0.139 |
+ 0.131 |
+ 0.139 |
+ 0.238 |
+ 0.074 |
+ 0.230 |
+ 0.189 |
+ 0.246 |
+ 0.246 |
+ 0.254 |
+ 0.172 |
+ 0.246 |
+ 0.279 |
+ 0.328 |
+ 0.262 |
+ 0.303 |
+ 0.098 |
+ 0.164 |
+ 0.189 |
+ 0.197 |
+ 0.230 |
+ 0.270 |
+ 0.254 |
+ 0.279 |
+ 0.254 |
+ 0.238 |
+ 0.131 |
+ 0.098 |
+ 0.082 |
+ 0.148 |
+ 0.115 |
+ 0.123 |
+ 0.213 |
+ 0.164 |
+ 0.172 |
+ 0.172 |
+ 0.123 |
+ 0.148 |
+ 0.156 |
+ 0.254 |
+ 0.238 |
+ 0.262 |
+ 0.270 |
+ 0.287 |
+ 0.303 |
+ 0.221 |
+ 0.197 |
+ 0.328 |
+ 0.336 |
+ 0.336 |
+ 0.352 |
+ 0.320 |
+ 0.402 |
+ 0.311 |
+ 0.385 |
+ 0.270 |
+ 0.352 |
+ 0.369 |
+ 0.393 |
+ 1212 |
+ Stability |
+ SCIN_STAAR |
+ High |
+ Prokaryote |
+
+
+ SCN5A_HUMAN_Glazer_2019 |
+ 0.130 |
+ 0.087 |
+ 0.130 |
+ 0.130 |
+ 0.130 |
+ 0.130 |
+ 0.217 |
+ 0.174 |
+ 0.174 |
+ 0.174 |
+ 0.130 |
+ 0.130 |
+ 0.174 |
+ 0.174 |
+ 0.130 |
+ 0.087 |
+ 0.130 |
+ 0.130 |
+ 0.174 |
+ 0.174 |
+ 0.130 |
+ 0.174 |
+ 0.130 |
+ 0.130 |
+ 0.130 |
+ 0.130 |
+ 0.130 |
+ 0.087 |
+ 0.130 |
+ 0.130 |
+ 0.087 |
+ 0.174 |
+ 0.174 |
+ 0.087 |
+ 0.130 |
+ 0.174 |
+ 0.043 |
+ 0.130 |
+ 0.174 |
+ 0.087 |
+ 0.130 |
+ 0.130 |
+ 0.174 |
+ 0.174 |
+ 0.174 |
+ 0.087 |
+ 0.043 |
+ 0.043 |
+ 0.130 |
+ 0.043 |
+ 0.174 |
+ 0.174 |
+ 0.174 |
+ 0.174 |
+ 0.174 |
+ 0.174 |
+ 0.174 |
+ 0.174 |
+ 0.174 |
+ 0.174 |
+ 0.043 |
+ 0.130 |
+ 224 |
+ OrganismalFitness |
+ SCN5A_HUMAN |
+ Medium |
+ Human |
+
+
+ SDA_BACSU_Tsuboyama_2023_1PV0 |
+ 0.531 |
+ 0.520 |
+ 0.574 |
+ 0.563 |
+ 0.588 |
+ 0.596 |
+ 0.130 |
+ 0.458 |
+ 0.552 |
+ 0.578 |
+ 0.538 |
+ 0.495 |
+ 0.516 |
+ 0.209 |
+ 0.264 |
+ 0.523 |
+ 0.538 |
+ 0.567 |
+ 0.545 |
+ 0.531 |
+ 0.141 |
+ 0.181 |
+ 0.191 |
+ 0.274 |
+ 0.191 |
+ 0.466 |
+ 0.101 |
+ 0.347 |
+ 0.520 |
+ 0.592 |
+ 0.509 |
+ 0.469 |
+ 0.014 |
+ 0.116 |
+ 0.264 |
+ 0.462 |
+ 0.542 |
+ 0.542 |
+ 0.523 |
+ 0.578 |
+ 0.585 |
+ 0.574 |
+ 0.224 |
+ 0.184 |
+ 0.520 |
+ 0.264 |
+ 0.534 |
+ 0.552 |
+ 0.592 |
+ 0.563 |
+ 0.563 |
+ 0.567 |
+ 0.570 |
+ 0.585 |
+ 0.570 |
+ 0.570 |
+ 0.592 |
+ 0.578 |
+ 0.596 |
+ 0.592 |
+ 0.592 |
+ 0.585 |
+ 2770 |
+ Stability |
+ SDA_BACSU |
+ Medium |
+ Prokaryote |
+
+
+ SERC_HUMAN_Xie_2023 |
+ 0.130 |
+ 0.156 |
+ 0.135 |
+ 0.146 |
+ 0.141 |
+ 0.156 |
+ 0.089 |
+ 0.146 |
+ 0.135 |
+ 0.130 |
+ 0.135 |
+ 0.156 |
+ 0.120 |
+ 0.073 |
+ 0.120 |
+ 0.141 |
+ 0.120 |
+ 0.146 |
+ 0.125 |
+ 0.135 |
+ 0.130 |
+ 0.130 |
+ 0.151 |
+ 0.141 |
+ 0.109 |
+ 0.109 |
+ 0.130 |
+ 0.130 |
+ 0.156 |
+ 0.151 |
+ 0.130 |
+ 0.161 |
+ 0.115 |
+ 0.135 |
+ 0.167 |
+ 0.156 |
+ 0.146 |
+ 0.135 |
+ 0.151 |
+ 0.146 |
+ 0.141 |
+ 0.141 |
+ 0.125 |
+ 0.073 |
+ 0.125 |
+ 0.120 |
+ 0.135 |
+ 0.125 |
+ 0.141 |
+ 0.141 |
+ 0.141 |
+ 0.125 |
+ 0.130 |
+ 0.135 |
+ 0.125 |
+ 0.109 |
+ 0.120 |
+ 0.141 |
+ 0.120 |
+ 0.125 |
+ 0.151 |
+ 0.135 |
+ 1914 |
+ OrganismalFitness |
+ SERC_HUMAN |
+ High |
+ Human |
+
+
+ SHOC2_HUMAN_Kwon_2022 |
+ 0.140 |
+ 0.148 |
+ 0.156 |
+ 0.156 |
+ 0.156 |
+ 0.154 |
+ 0.123 |
+ 0.143 |
+ 0.160 |
+ 0.150 |
+ 0.139 |
+ 0.145 |
+ 0.147 |
+ 0.124 |
+ 0.120 |
+ 0.117 |
+ 0.138 |
+ 0.127 |
+ 0.117 |
+ 0.144 |
+ 0.124 |
+ 0.125 |
+ 0.138 |
+ 0.123 |
+ 0.109 |
+ 0.127 |
+ 0.134 |
+ 0.141 |
+ 0.137 |
+ 0.143 |
+ 0.150 |
+ 0.153 |
+ 0.111 |
+ 0.120 |
+ 0.142 |
+ 0.144 |
+ 0.132 |
+ 0.148 |
+ 0.136 |
+ 0.149 |
+ 0.153 |
+ 0.157 |
+ 0.127 |
+ 0.126 |
+ 0.138 |
+ 0.119 |
+ 0.139 |
+ 0.139 |
+ 0.119 |
+ 0.119 |
+ 0.143 |
+ 0.151 |
+ 0.166 |
+ 0.146 |
+ 0.152 |
+ 0.158 |
+ 0.154 |
+ 0.142 |
+ 0.138 |
+ 0.148 |
+ 0.124 |
+ 0.123 |
+ 10972 |
+ OrganismalFitness |
+ SHOC2_HUMAN |
+ Medium |
+ Human |
+
+
+ SOX30_HUMAN_Tsuboyama_2023_7JJK |
+ 0.168 |
+ 0.149 |
+ 0.089 |
+ 0.109 |
+ 0.119 |
+ 0.129 |
+ 0.139 |
+ 0.089 |
+ 0.089 |
+ 0.139 |
+ 0.139 |
+ 0.158 |
+ 0.129 |
+ 0.178 |
+ 0.149 |
+ 0.208 |
+ 0.188 |
+ 0.178 |
+ 0.109 |
+ 0.099 |
+ 0.089 |
+ 0.050 |
+ 0.099 |
+ 0.069 |
+ 0.099 |
+ 0.089 |
+ 0.079 |
+ 0.099 |
+ 0.119 |
+ 0.099 |
+ 0.099 |
+ 0.089 |
+ 0.099 |
+ 0.099 |
+ 0.050 |
+ 0.079 |
+ 0.149 |
+ 0.099 |
+ 0.099 |
+ 0.149 |
+ 0.119 |
+ 0.109 |
+ 0.149 |
+ 0.099 |
+ 0.109 |
+ 0.178 |
+ 0.139 |
+ 0.139 |
+ 0.139 |
+ 0.178 |
+ 0.178 |
+ 0.218 |
+ 0.228 |
+ 0.208 |
+ 0.198 |
+ 0.208 |
+ 0.178 |
+ 0.188 |
+ 0.178 |
+ 0.218 |
+ 0.178 |
+ 0.158 |
+ 1010 |
+ Stability |
+ SOX30_HUMAN |
+ High |
+ Human |
+
+
+ SPA_STAAU_Tsuboyama_2023_1LP1 |
+ 0.327 |
+ 0.384 |
+ 0.346 |
+ 0.346 |
+ 0.360 |
+ 0.351 |
+ 0.085 |
+ 0.275 |
+ 0.251 |
+ 0.246 |
+ 0.289 |
+ 0.095 |
+ 0.100 |
+ 0.076 |
+ 0.104 |
+ 0.071 |
+ 0.152 |
+ 0.171 |
+ 0.142 |
+ 0.327 |
+ 0.028 |
+ 0.118 |
+ 0.308 |
+ 0.156 |
+ 0.081 |
+ 0.057 |
+ 0.109 |
+ 0.303 |
+ 0.322 |
+ 0.218 |
+ 0.237 |
+ 0.204 |
+ 0.043 |
+ 0.090 |
+ 0.085 |
+ 0.123 |
+ 0.332 |
+ 0.336 |
+ 0.332 |
+ 0.370 |
+ 0.379 |
+ 0.379 |
+ 0.085 |
+ 0.100 |
+ 0.185 |
+ 0.166 |
+ 0.280 |
+ 0.265 |
+ 0.370 |
+ 0.355 |
+ 0.341 |
+ 0.384 |
+ 0.389 |
+ 0.417 |
+ 0.351 |
+ 0.379 |
+ 0.360 |
+ 0.398 |
+ 0.351 |
+ 0.393 |
+ 0.370 |
+ 0.318 |
+ 2105 |
+ Stability |
+ SPA_STAAU |
+ Medium |
+ Prokaryote |
+
+
+ SPG1_STRSG_Olson_2014 |
+ 0.208 |
+ 0.214 |
+ 0.094 |
+ 0.098 |
+ 0.204 |
+ 0.206 |
+ 0.089 |
+ 0.081 |
+ 0.108 |
+ 0.191 |
+ 0.221 |
+ 0.200 |
+ 0.199 |
+ 0.237 |
+ 0.212 |
+ 0.204 |
+ 0.205 |
+ 0.203 |
+ 0.217 |
+ 0.170 |
+ 0.221 |
+ 0.206 |
+ 0.204 |
+ 0.213 |
+ 0.212 |
+ 0.205 |
+ 0.215 |
+ 0.214 |
+ 0.220 |
+ 0.219 |
+ 0.239 |
+ 0.239 |
+ 0.141 |
+ 0.193 |
+ 0.187 |
+ 0.186 |
+ 0.203 |
+ 0.198 |
+ 0.195 |
+ 0.204 |
+ 0.199 |
+ 0.201 |
+ 0.073 |
+ 0.087 |
+ 0.101 |
+ 0.089 |
+ 0.201 |
+ 0.189 |
+ 0.216 |
+ 0.155 |
+ 0.247 |
+ 0.199 |
+ 0.218 |
+ 0.215 |
+ 0.213 |
+ 0.209 |
+ 0.226 |
+ 0.225 |
+ 0.224 |
+ 0.227 |
+ 0.202 |
+ 0.209 |
+ 536962 |
+ Binding |
+ SPG1_STRSG |
+ Low |
+ Prokaryote |
+
+
+ SPG1_STRSG_Wu_2016 |
+ 0.135 |
+ 0.197 |
+ 0.187 |
+ 0.206 |
+ 0.199 |
+ 0.202 |
+ 0.174 |
+ 0.168 |
+ 0.261 |
+ 0.246 |
+ 0.279 |
+ 0.283 |
+ 0.267 |
+ 0.259 |
+ 0.272 |
+ 0.283 |
+ 0.306 |
+ 0.311 |
+ 0.320 |
+ 0.274 |
+ 0.222 |
+ 0.212 |
+ 0.222 |
+ 0.231 |
+ 0.202 |
+ 0.241 |
+ 0.240 |
+ 0.229 |
+ 0.229 |
+ 0.195 |
+ 0.333 |
+ 0.323 |
+ 0.137 |
+ 0.232 |
+ 0.200 |
+ 0.243 |
+ 0.185 |
+ 0.178 |
+ 0.208 |
+ 0.216 |
+ 0.214 |
+ 0.228 |
+ 0.186 |
+ 0.158 |
+ 0.221 |
+ 0.165 |
+ 0.248 |
+ 0.239 |
+ 0.334 |
+ 0.227 |
+ 0.338 |
+ 0.340 |
+ 0.344 |
+ 0.348 |
+ 0.340 |
+ 0.338 |
+ 0.361 |
+ 0.349 |
+ 0.354 |
+ 0.347 |
+ 0.340 |
+ 0.305 |
+ 149360 |
+ Binding |
+ SPG1_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPG2_STRSG_Tsuboyama_2023_5UBS |
+ 0.151 |
+ 0.096 |
+ 0.130 |
+ 0.110 |
+ 0.144 |
+ 0.130 |
+ 0.219 |
+ 0.151 |
+ 0.137 |
+ 0.205 |
+ 0.233 |
+ 0.240 |
+ 0.205 |
+ 0.171 |
+ 0.219 |
+ 0.199 |
+ 0.226 |
+ 0.274 |
+ 0.233 |
+ 0.192 |
+ 0.185 |
+ 0.253 |
+ 0.199 |
+ 0.144 |
+ 0.123 |
+ 0.205 |
+ 0.178 |
+ 0.260 |
+ 0.082 |
+ 0.212 |
+ 0.144 |
+ 0.103 |
+ 0.021 |
+ 0.164 |
+ 0.301 |
+ 0.137 |
+ 0.212 |
+ 0.233 |
+ 0.171 |
+ 0.164 |
+ 0.158 |
+ 0.158 |
+ 0.205 |
+ 0.158 |
+ 0.199 |
+ 0.178 |
+ 0.205 |
+ 0.212 |
+ 0.212 |
+ 0.185 |
+ 0.185 |
+ 0.192 |
+ 0.260 |
+ 0.212 |
+ 0.219 |
+ 0.233 |
+ 0.219 |
+ 0.192 |
+ 0.185 |
+ 0.212 |
+ 0.288 |
+ 0.205 |
+ 1451 |
+ Stability |
+ SPG2_STRSG |
+ Medium |
+ Prokaryote |
+
+
+ SPIKE_SARS2_Starr_2020_binding |
+ 0.155 |
+ 0.227 |
+ 0.181 |
+ 0.196 |
+ 0.208 |
+ 0.213 |
+ 0.099 |
+ 0.227 |
+ 0.229 |
+ 0.232 |
+ 0.104 |
+ 0.111 |
+ 0.114 |
+ 0.109 |
+ 0.126 |
+ 0.116 |
+ 0.111 |
+ 0.111 |
+ 0.097 |
+ 0.229 |
+ 0.205 |
+ 0.220 |
+ 0.222 |
+ 0.210 |
+ 0.210 |
+ 0.215 |
+ 0.200 |
+ 0.215 |
+ 0.191 |
+ 0.227 |
+ 0.213 |
+ 0.213 |
+ 0.159 |
+ 0.208 |
+ 0.220 |
+ 0.227 |
+ 0.220 |
+ 0.217 |
+ 0.222 |
+ 0.232 |
+ 0.234 |
+ 0.232 |
+ 0.101 |
+ 0.106 |
+ 0.109 |
+ 0.116 |
+ 0.290 |
+ 0.227 |
+ 0.196 |
+ 0.159 |
+ 0.155 |
+ 0.200 |
+ 0.217 |
+ 0.203 |
+ 0.198 |
+ 0.208 |
+ 0.184 |
+ 0.176 |
+ 0.167 |
+ 0.200 |
+ 0.215 |
+ 0.150 |
+ 3802 |
+ Binding |
+ SPIKE_SARS2 |
+ Medium |
+ Virus |
+
+
+ SPIKE_SARS2_Starr_2020_expression |
+ 0.141 |
+ 0.234 |
+ 0.139 |
+ 0.193 |
+ 0.219 |
+ 0.224 |
+ 0.098 |
+ 0.195 |
+ 0.216 |
+ 0.244 |
+ 0.095 |
+ 0.108 |
+ 0.108 |
+ 0.103 |
+ 0.134 |
+ 0.108 |
+ 0.111 |
+ 0.080 |
+ 0.072 |
+ 0.229 |
+ 0.167 |
+ 0.180 |
+ 0.162 |
+ 0.162 |
+ 0.198 |
+ 0.172 |
+ 0.159 |
+ 0.180 |
+ 0.167 |
+ 0.237 |
+ 0.201 |
+ 0.201 |
+ 0.129 |
+ 0.172 |
+ 0.175 |
+ 0.211 |
+ 0.165 |
+ 0.185 |
+ 0.203 |
+ 0.211 |
+ 0.226 |
+ 0.226 |
+ 0.103 |
+ 0.090 |
+ 0.129 |
+ 0.118 |
+ 0.357 |
+ 0.365 |
+ 0.326 |
+ 0.165 |
+ 0.237 |
+ 0.278 |
+ 0.283 |
+ 0.270 |
+ 0.303 |
+ 0.275 |
+ 0.283 |
+ 0.254 |
+ 0.283 |
+ 0.293 |
+ 0.332 |
+ 0.208 |
+ 3798 |
+ Expression |
+ SPIKE_SARS2 |
+ Medium |
+ Virus |
+
+
+ SPTN1_CHICK_Tsuboyama_2023_1TUD |
+ 0.399 |
+ 0.486 |
+ 0.396 |
+ 0.374 |
+ 0.433 |
+ 0.396 |
+ 0.231 |
+ 0.318 |
+ 0.371 |
+ 0.368 |
+ 0.495 |
+ 0.386 |
+ 0.449 |
+ 0.065 |
+ 0.417 |
+ 0.436 |
+ 0.386 |
+ 0.517 |
+ 0.399 |
+ 0.427 |
+ 0.380 |
+ 0.452 |
+ 0.380 |
+ 0.393 |
+ 0.399 |
+ 0.467 |
+ 0.421 |
+ 0.449 |
+ 0.470 |
+ 0.427 |
+ 0.470 |
+ 0.486 |
+ 0.100 |
+ 0.374 |
+ 0.330 |
+ 0.346 |
+ 0.411 |
+ 0.433 |
+ 0.396 |
+ 0.396 |
+ 0.396 |
+ 0.399 |
+ 0.283 |
+ 0.125 |
+ 0.346 |
+ 0.312 |
+ 0.374 |
+ 0.383 |
+ 0.567 |
+ 0.558 |
+ 0.558 |
+ 0.548 |
+ 0.558 |
+ 0.539 |
+ 0.558 |
+ 0.548 |
+ 0.551 |
+ 0.567 |
+ 0.536 |
+ 0.567 |
+ 0.533 |
+ 0.349 |
+ 3201 |
+ Stability |
+ SPTN1_CHICK |
+ High |
+ Eukaryote |
+
+
+ SQSTM_MOUSE_Tsuboyama_2023_2RRU |
+ 0.296 |
+ 0.324 |
+ 0.423 |
+ 0.437 |
+ 0.437 |
+ 0.423 |
+ 0.042 |
+ 0.239 |
+ 0.324 |
+ 0.380 |
+ 0.310 |
+ 0.155 |
+ 0.296 |
+ 0.155 |
+ 0.254 |
+ 0.282 |
+ 0.352 |
+ 0.324 |
+ 0.352 |
+ 0.338 |
+ 0.141 |
+ 0.296 |
+ 0.211 |
+ 0.296 |
+ 0.324 |
+ 0.310 |
+ 0.268 |
+ 0.268 |
+ 0.310 |
+ 0.268 |
+ 0.239 |
+ 0.254 |
+ 0.169 |
+ 0.183 |
+ 0.254 |
+ 0.324 |
+ 0.338 |
+ 0.423 |
+ 0.394 |
+ 0.451 |
+ 0.423 |
+ 0.408 |
+ 0.183 |
+ 0.169 |
+ 0.282 |
+ 0.380 |
+ 0.408 |
+ 0.282 |
+ 0.366 |
+ 0.408 |
+ 0.366 |
+ 0.366 |
+ 0.352 |
+ 0.338 |
+ 0.296 |
+ 0.380 |
+ 0.324 |
+ 0.296 |
+ 0.296 |
+ 0.352 |
+ 0.296 |
+ 0.324 |
+ 707 |
+ Stability |
+ SQSTM_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ SR43C_ARATH_Tsuboyama_2023_2N88 |
+ 0.308 |
+ 0.308 |
+ 0.302 |
+ 0.283 |
+ 0.296 |
+ 0.302 |
+ 0.057 |
+ 0.157 |
+ 0.252 |
+ 0.264 |
+ 0.377 |
+ 0.327 |
+ 0.340 |
+ 0.038 |
+ 0.346 |
+ 0.371 |
+ 0.384 |
+ 0.409 |
+ 0.409 |
+ 0.308 |
+ 0.044 |
+ 0.189 |
+ 0.220 |
+ 0.170 |
+ 0.245 |
+ 0.233 |
+ 0.277 |
+ 0.296 |
+ 0.302 |
+ 0.189 |
+ 0.327 |
+ 0.264 |
+ 0.126 |
+ 0.189 |
+ 0.132 |
+ 0.208 |
+ 0.352 |
+ 0.289 |
+ 0.296 |
+ 0.321 |
+ 0.314 |
+ 0.296 |
+ 0.277 |
+ 0.057 |
+ 0.358 |
+ 0.333 |
+ 0.233 |
+ 0.384 |
+ 0.428 |
+ 0.453 |
+ 0.346 |
+ 0.340 |
+ 0.358 |
+ 0.390 |
+ 0.352 |
+ 0.327 |
+ 0.365 |
+ 0.390 |
+ 0.365 |
+ 0.358 |
+ 0.440 |
+ 0.365 |
+ 1583 |
+ Stability |
+ SR43C_ARATH |
+ High |
+ Eukaryote |
+
+
+ SRBS1_HUMAN_Tsuboyama_2023_2O2W |
+ 0.231 |
+ 0.179 |
+ 0.378 |
+ 0.333 |
+ 0.256 |
+ 0.301 |
+ 0.218 |
+ 0.237 |
+ 0.276 |
+ 0.314 |
+ 0.308 |
+ 0.288 |
+ 0.301 |
+ 0.179 |
+ 0.333 |
+ 0.301 |
+ 0.263 |
+ 0.288 |
+ 0.333 |
+ 0.199 |
+ 0.269 |
+ 0.244 |
+ 0.269 |
+ 0.205 |
+ 0.231 |
+ 0.237 |
+ 0.263 |
+ 0.192 |
+ 0.212 |
+ 0.301 |
+ 0.186 |
+ 0.167 |
+ 0.212 |
+ 0.237 |
+ 0.231 |
+ 0.269 |
+ 0.256 |
+ 0.256 |
+ 0.276 |
+ 0.308 |
+ 0.321 |
+ 0.295 |
+ 0.231 |
+ 0.083 |
+ 0.269 |
+ 0.288 |
+ 0.301 |
+ 0.218 |
+ 0.378 |
+ 0.353 |
+ 0.327 |
+ 0.327 |
+ 0.314 |
+ 0.308 |
+ 0.314 |
+ 0.333 |
+ 0.321 |
+ 0.327 |
+ 0.295 |
+ 0.333 |
+ 0.237 |
+ 0.327 |
+ 1556 |
+ Stability |
+ SRBS1_HUMAN |
+ High |
+ Human |
+
+
+ SRC_HUMAN_Ahler_2019 |
+ 0.121 |
+ 0.112 |
+ 0.086 |
+ 0.101 |
+ 0.089 |
+ 0.089 |
+ 0.130 |
+ 0.074 |
+ 0.101 |
+ 0.098 |
+ 0.083 |
+ 0.101 |
+ 0.092 |
+ 0.151 |
+ 0.183 |
+ 0.130 |
+ 0.115 |
+ 0.124 |
+ 0.089 |
+ 0.071 |
+ 0.083 |
+ 0.056 |
+ 0.074 |
+ 0.062 |
+ 0.077 |
+ 0.080 |
+ 0.077 |
+ 0.071 |
+ 0.065 |
+ 0.115 |
+ 0.074 |
+ 0.092 |
+ 0.121 |
+ 0.080 |
+ 0.071 |
+ 0.083 |
+ 0.086 |
+ 0.083 |
+ 0.109 |
+ 0.086 |
+ 0.089 |
+ 0.089 |
+ 0.148 |
+ 0.157 |
+ 0.068 |
+ 0.109 |
+ 0.098 |
+ 0.083 |
+ 0.080 |
+ 0.095 |
+ 0.133 |
+ 0.148 |
+ 0.112 |
+ 0.133 |
+ 0.139 |
+ 0.104 |
+ 0.107 |
+ 0.133 |
+ 0.130 |
+ 0.124 |
+ 0.086 |
+ 0.163 |
+ 3372 |
+ Activity |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM |
+ 0.110 |
+ 0.091 |
+ 0.096 |
+ 0.102 |
+ 0.102 |
+ 0.082 |
+ 0.118 |
+ 0.069 |
+ 0.055 |
+ 0.077 |
+ 0.080 |
+ 0.091 |
+ 0.091 |
+ 0.137 |
+ 0.132 |
+ 0.104 |
+ 0.088 |
+ 0.099 |
+ 0.091 |
+ 0.060 |
+ 0.052 |
+ 0.038 |
+ 0.074 |
+ 0.049 |
+ 0.058 |
+ 0.060 |
+ 0.077 |
+ 0.049 |
+ 0.063 |
+ 0.124 |
+ 0.066 |
+ 0.085 |
+ 0.121 |
+ 0.066 |
+ 0.058 |
+ 0.060 |
+ 0.074 |
+ 0.074 |
+ 0.077 |
+ 0.082 |
+ 0.069 |
+ 0.077 |
+ 0.107 |
+ 0.118 |
+ 0.052 |
+ 0.093 |
+ 0.096 |
+ 0.074 |
+ 0.066 |
+ 0.088 |
+ 0.107 |
+ 0.115 |
+ 0.093 |
+ 0.113 |
+ 0.121 |
+ 0.091 |
+ 0.099 |
+ 0.124 |
+ 0.104 |
+ 0.107 |
+ 0.077 |
+ 0.137 |
+ 3637 |
+ Activity |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SRC_HUMAN_Nguyen_2022 |
+ 0.107 |
+ 0.083 |
+ 0.095 |
+ 0.080 |
+ 0.065 |
+ 0.065 |
+ 0.092 |
+ 0.080 |
+ 0.089 |
+ 0.077 |
+ 0.074 |
+ 0.086 |
+ 0.086 |
+ 0.131 |
+ 0.131 |
+ 0.104 |
+ 0.101 |
+ 0.107 |
+ 0.086 |
+ 0.080 |
+ 0.104 |
+ 0.083 |
+ 0.101 |
+ 0.074 |
+ 0.092 |
+ 0.086 |
+ 0.089 |
+ 0.077 |
+ 0.065 |
+ 0.107 |
+ 0.071 |
+ 0.098 |
+ 0.101 |
+ 0.089 |
+ 0.077 |
+ 0.083 |
+ 0.083 |
+ 0.086 |
+ 0.089 |
+ 0.083 |
+ 0.071 |
+ 0.077 |
+ 0.122 |
+ 0.119 |
+ 0.080 |
+ 0.089 |
+ 0.110 |
+ 0.083 |
+ 0.077 |
+ 0.092 |
+ 0.101 |
+ 0.116 |
+ 0.095 |
+ 0.095 |
+ 0.104 |
+ 0.086 |
+ 0.080 |
+ 0.101 |
+ 0.095 |
+ 0.098 |
+ 0.092 |
+ 0.131 |
+ 3366 |
+ OrganismalFitness |
+ SRC_HUMAN |
+ Medium |
+ Human |
+
+
+ SUMO1_HUMAN_Weile_2017 |
+ 0.118 |
+ 0.141 |
+ 0.124 |
+ 0.141 |
+ 0.112 |
+ 0.118 |
+ 0.082 |
+ 0.165 |
+ 0.165 |
+ 0.147 |
+ 0.229 |
+ 0.188 |
+ 0.206 |
+ 0.071 |
+ 0.141 |
+ 0.153 |
+ 0.182 |
+ 0.153 |
+ 0.159 |
+ 0.176 |
+ 0.118 |
+ 0.159 |
+ 0.153 |
+ 0.176 |
+ 0.182 |
+ 0.176 |
+ 0.147 |
+ 0.200 |
+ 0.188 |
+ 0.329 |
+ 0.194 |
+ 0.200 |
+ 0.088 |
+ 0.088 |
+ 0.176 |
+ 0.176 |
+ 0.124 |
+ 0.135 |
+ 0.188 |
+ 0.129 |
+ 0.129 |
+ 0.141 |
+ 0.124 |
+ 0.082 |
+ 0.171 |
+ 0.129 |
+ 0.129 |
+ 0.182 |
+ 0.188 |
+ 0.141 |
+ 0.153 |
+ 0.153 |
+ 0.159 |
+ 0.165 |
+ 0.171 |
+ 0.165 |
+ 0.165 |
+ 0.176 |
+ 0.176 |
+ 0.176 |
+ 0.194 |
+ 0.106 |
+ 1700 |
+ OrganismalFitness |
+ SUMO1_HUMAN |
+ High |
+ Human |
+
+
+ SYUA_HUMAN_Newberry_2020 |
+ 0.064 |
+ 0.032 |
+ 0.036 |
+ 0.024 |
+ 0.036 |
+ 0.024 |
+ 0.080 |
+ 0.044 |
+ 0.052 |
+ 0.036 |
+ 0.028 |
+ 0.040 |
+ 0.024 |
+ 0.036 |
+ 0.060 |
+ 0.044 |
+ 0.032 |
+ 0.032 |
+ 0.032 |
+ 0.040 |
+ 0.056 |
+ 0.044 |
+ 0.020 |
+ 0.040 |
+ 0.048 |
+ 0.028 |
+ 0.032 |
+ 0.012 |
+ 0.036 |
+ 0.060 |
+ 0.044 |
+ 0.060 |
+ 0.032 |
+ 0.028 |
+ 0.036 |
+ 0.036 |
+ 0.020 |
+ 0.032 |
+ 0.024 |
+ 0.028 |
+ 0.028 |
+ 0.020 |
+ 0.052 |
+ 0.064 |
+ 0.024 |
+ 0.044 |
+ 0.064 |
+ 0.024 |
+ 0.040 |
+ 0.076 |
+ 0.036 |
+ 0.024 |
+ 0.032 |
+ 0.016 |
+ 0.036 |
+ 0.040 |
+ 0.048 |
+ 0.036 |
+ 0.044 |
+ 0.028 |
+ 0.052 |
+ 0.048 |
+ 2497 |
+ OrganismalFitness |
+ SYUA_HUMAN |
+ Medium |
+ Human |
+
+
+ TADBP_HUMAN_Bolognesi_2019 |
+ 0.008 |
+ 0.017 |
+ 0.008 |
+ 0.008 |
+ 0.008 |
+ 0.008 |
+ 0.267 |
+ 0.000 |
+ 0.008 |
+ 0.008 |
+ 0.067 |
+ 0.092 |
+ 0.042 |
+ 0.125 |
+ 0.133 |
+ 0.100 |
+ 0.050 |
+ 0.017 |
+ 0.008 |
+ 0.008 |
+ 0.158 |
+ 0.017 |
+ 0.008 |
+ 0.008 |
+ 0.217 |
+ 0.033 |
+ 0.000 |
+ 0.008 |
+ 0.008 |
+ 0.008 |
+ 0.000 |
+ 0.000 |
+ 0.017 |
+ 0.200 |
+ 0.267 |
+ 0.000 |
+ 0.050 |
+ 0.092 |
+ 0.008 |
+ 0.008 |
+ 0.017 |
+ 0.000 |
+ 0.183 |
+ 0.250 |
+ 0.067 |
+ 0.083 |
+ 0.200 |
+ 0.067 |
+ 0.208 |
+ 0.117 |
+ 0.142 |
+ 0.100 |
+ 0.100 |
+ 0.108 |
+ 0.092 |
+ 0.117 |
+ 0.058 |
+ 0.083 |
+ 0.100 |
+ 0.108 |
+ 0.142 |
+ 0.158 |
+ 1196 |
+ OrganismalFitness |
+ TADBP_HUMAN |
+ Low |
+ Human |
+
+
+ TAT_HV1BR_Fernandes_2016 |
+ 0.120 |
+ 0.139 |
+ 0.139 |
+ 0.139 |
+ 0.139 |
+ 0.139 |
+ 0.076 |
+ 0.133 |
+ 0.158 |
+ 0.171 |
+ 0.127 |
+ 0.152 |
+ 0.139 |
+ 0.114 |
+ 0.127 |
+ 0.082 |
+ 0.089 |
+ 0.108 |
+ 0.120 |
+ 0.158 |
+ 0.120 |
+ 0.152 |
+ 0.152 |
+ 0.158 |
+ 0.133 |
+ 0.146 |
+ 0.101 |
+ 0.133 |
+ 0.133 |
+ 0.158 |
+ 0.095 |
+ 0.095 |
+ 0.120 |
+ 0.127 |
+ 0.139 |
+ 0.146 |
+ 0.139 |
+ 0.127 |
+ 0.127 |
+ 0.146 |
+ 0.139 |
+ 0.146 |
+ 0.082 |
+ 0.063 |
+ 0.108 |
+ 0.114 |
+ 0.101 |
+ 0.108 |
+ 0.101 |
+ 0.089 |
+ 0.070 |
+ 0.101 |
+ 0.133 |
+ 0.133 |
+ 0.108 |
+ 0.101 |
+ 0.082 |
+ 0.082 |
+ 0.133 |
+ 0.095 |
+ 0.089 |
+ 0.114 |
+ 1577 |
+ OrganismalFitness |
+ TAT_HV1BR |
+ High |
+ Virus |
+
+
+ TCRG1_MOUSE_Tsuboyama_2023_1E0L |
+ 0.396 |
+ 0.340 |
+ 0.377 |
+ 0.387 |
+ 0.349 |
+ 0.349 |
+ 0.443 |
+ 0.085 |
+ 0.368 |
+ 0.302 |
+ 0.104 |
+ 0.217 |
+ 0.226 |
+ 0.368 |
+ 0.170 |
+ 0.283 |
+ 0.358 |
+ 0.151 |
+ 0.226 |
+ 0.132 |
+ 0.094 |
+ 0.075 |
+ 0.104 |
+ 0.142 |
+ 0.123 |
+ 0.085 |
+ 0.104 |
+ 0.142 |
+ 0.057 |
+ 0.085 |
+ 0.075 |
+ 0.047 |
+ 0.349 |
+ 0.132 |
+ 0.160 |
+ 0.274 |
+ 0.302 |
+ 0.340 |
+ 0.340 |
+ 0.311 |
+ 0.340 |
+ 0.311 |
+ 0.330 |
+ 0.160 |
+ 0.170 |
+ 0.264 |
+ 0.189 |
+ 0.142 |
+ 0.274 |
+ 0.358 |
+ 0.311 |
+ 0.245 |
+ 0.208 |
+ 0.349 |
+ 0.283 |
+ 0.340 |
+ 0.292 |
+ 0.302 |
+ 0.245 |
+ 0.292 |
+ 0.358 |
+ 0.236 |
+ 1058 |
+ Stability |
+ TCRG1_MOUSE |
+ Medium |
+ Eukaryote |
+
+
+ THO1_YEAST_Tsuboyama_2023_2WQG |
+ 0.297 |
+ 0.289 |
+ 0.352 |
+ 0.375 |
+ 0.391 |
+ 0.391 |
+ 0.211 |
+ 0.227 |
+ 0.312 |
+ 0.359 |
+ 0.312 |
+ 0.375 |
+ 0.352 |
+ 0.102 |
+ 0.352 |
+ 0.344 |
+ 0.336 |
+ 0.328 |
+ 0.359 |
+ 0.391 |
+ 0.164 |
+ 0.289 |
+ 0.227 |
+ 0.141 |
+ 0.273 |
+ 0.305 |
+ 0.086 |
+ 0.289 |
+ 0.422 |
+ 0.336 |
+ 0.281 |
+ 0.219 |
+ 0.352 |
+ 0.117 |
+ 0.336 |
+ 0.211 |
+ 0.367 |
+ 0.430 |
+ 0.352 |
+ 0.367 |
+ 0.422 |
+ 0.391 |
+ 0.281 |
+ 0.258 |
+ 0.383 |
+ 0.297 |
+ 0.422 |
+ 0.445 |
+ 0.445 |
+ 0.508 |
+ 0.375 |
+ 0.359 |
+ 0.398 |
+ 0.359 |
+ 0.383 |
+ 0.359 |
+ 0.367 |
+ 0.359 |
+ 0.344 |
+ 0.352 |
+ 0.406 |
+ 0.391 |
+ 1279 |
+ Stability |
+ THO1_YEAST |
+ High |
+ Eukaryote |
+
+
+ TNKS2_HUMAN_Tsuboyama_2023_5JRT |
+ 0.297 |
+ 0.277 |
+ 0.243 |
+ 0.243 |
+ 0.216 |
+ 0.216 |
+ 0.034 |
+ 0.196 |
+ 0.297 |
+ 0.338 |
+ 0.250 |
+ 0.230 |
+ 0.250 |
+ 0.061 |
+ 0.270 |
+ 0.291 |
+ 0.257 |
+ 0.250 |
+ 0.243 |
+ 0.196 |
+ 0.209 |
+ 0.291 |
+ 0.257 |
+ 0.277 |
+ 0.216 |
+ 0.284 |
+ 0.203 |
+ 0.264 |
+ 0.230 |
+ 0.230 |
+ 0.297 |
+ 0.284 |
+ 0.142 |
+ 0.135 |
+ 0.209 |
+ 0.216 |
+ 0.277 |
+ 0.291 |
+ 0.270 |
+ 0.236 |
+ 0.230 |
+ 0.236 |
+ 0.250 |
+ 0.162 |
+ 0.351 |
+ 0.236 |
+ 0.345 |
+ 0.345 |
+ 0.372 |
+ 0.419 |
+ 0.223 |
+ 0.297 |
+ 0.270 |
+ 0.264 |
+ 0.270 |
+ 0.291 |
+ 0.297 |
+ 0.277 |
+ 0.284 |
+ 0.264 |
+ 0.338 |
+ 0.311 |
+ 1479 |
+ Stability |
+ TNKS2_HUMAN |
+ High |
+ Human |
+
+
+ TPK1_HUMAN_Weile_2017 |
+ 0.135 |
+ 0.129 |
+ 0.132 |
+ 0.110 |
+ 0.122 |
+ 0.122 |
+ 0.097 |
+ 0.132 |
+ 0.135 |
+ 0.116 |
+ 0.141 |
+ 0.150 |
+ 0.119 |
+ 0.091 |
+ 0.129 |
+ 0.132 |
+ 0.125 |
+ 0.129 |
+ 0.144 |
+ 0.122 |
+ 0.094 |
+ 0.132 |
+ 0.107 |
+ 0.141 |
+ 0.113 |
+ 0.144 |
+ 0.125 |
+ 0.110 |
+ 0.135 |
+ 0.150 |
+ 0.129 |
+ 0.119 |
+ 0.088 |
+ 0.094 |
+ 0.144 |
+ 0.138 |
+ 0.129 |
+ 0.125 |
+ 0.138 |
+ 0.122 |
+ 0.119 |
+ 0.135 |
+ 0.103 |
+ 0.088 |
+ 0.116 |
+ 0.094 |
+ 0.147 |
+ 0.138 |
+ 0.141 |
+ 0.138 |
+ 0.132 |
+ 0.147 |
+ 0.129 |
+ 0.150 |
+ 0.144 |
+ 0.141 |
+ 0.141 |
+ 0.135 |
+ 0.135 |
+ 0.122 |
+ 0.141 |
+ 0.113 |
+ 3181 |
+ OrganismalFitness |
+ TPK1_HUMAN |
+ Medium |
+ Human |
+
+
+ TPMT_HUMAN_Matreyek_2018 |
+ 0.181 |
+ 0.219 |
+ 0.216 |
+ 0.197 |
+ 0.195 |
+ 0.197 |
+ 0.107 |
+ 0.189 |
+ 0.189 |
+ 0.192 |
+ 0.205 |
+ 0.197 |
+ 0.200 |
+ 0.132 |
+ 0.156 |
+ 0.205 |
+ 0.203 |
+ 0.200 |
+ 0.184 |
+ 0.208 |
+ 0.137 |
+ 0.164 |
+ 0.205 |
+ 0.205 |
+ 0.167 |
+ 0.186 |
+ 0.208 |
+ 0.192 |
+ 0.205 |
+ 0.216 |
+ 0.244 |
+ 0.200 |
+ 0.132 |
+ 0.129 |
+ 0.200 |
+ 0.175 |
+ 0.175 |
+ 0.205 |
+ 0.175 |
+ 0.205 |
+ 0.214 |
+ 0.208 |
+ 0.132 |
+ 0.090 |
+ 0.205 |
+ 0.173 |
+ 0.178 |
+ 0.238 |
+ 0.222 |
+ 0.173 |
+ 0.175 |
+ 0.173 |
+ 0.178 |
+ 0.175 |
+ 0.181 |
+ 0.178 |
+ 0.173 |
+ 0.197 |
+ 0.178 |
+ 0.178 |
+ 0.197 |
+ 0.181 |
+ 3648 |
+ Expression |
+ TPMT_HUMAN |
+ Medium |
+ Human |
+
+
+ TPOR_HUMAN_Bridgford_2020 |
+ 0.368 |
+ 0.211 |
+ 0.404 |
+ 0.281 |
+ 0.228 |
+ 0.228 |
+ 0.351 |
+ 0.228 |
+ 0.281 |
+ 0.228 |
+ 0.298 |
+ 0.281 |
+ 0.298 |
+ 0.211 |
+ 0.263 |
+ 0.263 |
+ 0.193 |
+ 0.228 |
+ 0.246 |
+ 0.175 |
+ 0.246 |
+ 0.281 |
+ 0.228 |
+ 0.263 |
+ 0.298 |
+ 0.193 |
+ 0.351 |
+ 0.351 |
+ 0.298 |
+ 0.175 |
+ 0.175 |
+ 0.123 |
+ 0.386 |
+ 0.246 |
+ 0.263 |
+ 0.281 |
+ 0.368 |
+ 0.351 |
+ 0.263 |
+ 0.281 |
+ 0.298 |
+ 0.228 |
+ 0.228 |
+ 0.246 |
+ 0.211 |
+ 0.298 |
+ 0.105 |
+ 0.281 |
+ 0.298 |
+ 0.193 |
+ 0.263 |
+ 0.246 |
+ 0.263 |
+ 0.263 |
+ 0.281 |
+ 0.281 |
+ 0.281 |
+ 0.263 |
+ 0.281 |
+ 0.281 |
+ 0.281 |
+ 0.368 |
+ 562 |
+ OrganismalFitness |
+ TPOR_HUMAN |
+ Low |
+ Human |
+
+
+ TRPC_SACS2_Chan_2017 |
+ 0.382 |
+ 0.513 |
+ 0.461 |
+ 0.474 |
+ 0.480 |
+ 0.474 |
+ 0.125 |
+ 0.414 |
+ 0.480 |
+ 0.480 |
+ 0.454 |
+ 0.428 |
+ 0.434 |
+ 0.191 |
+ 0.329 |
+ 0.401 |
+ 0.480 |
+ 0.500 |
+ 0.447 |
+ 0.480 |
+ 0.178 |
+ 0.355 |
+ 0.388 |
+ 0.414 |
+ 0.395 |
+ 0.322 |
+ 0.401 |
+ 0.421 |
+ 0.487 |
+ 0.513 |
+ 0.474 |
+ 0.395 |
+ 0.158 |
+ 0.322 |
+ 0.355 |
+ 0.342 |
+ 0.375 |
+ 0.414 |
+ 0.408 |
+ 0.461 |
+ 0.454 |
+ 0.454 |
+ 0.237 |
+ 0.145 |
+ 0.388 |
+ 0.296 |
+ 0.283 |
+ 0.434 |
+ 0.454 |
+ 0.151 |
+ 0.428 |
+ 0.441 |
+ 0.428 |
+ 0.461 |
+ 0.421 |
+ 0.447 |
+ 0.421 |
+ 0.447 |
+ 0.441 |
+ 0.467 |
+ 0.513 |
+ 0.329 |
+ 1519 |
+ OrganismalFitness |
+ TRPC_SACS2 |
+ Medium |
+ Prokaryote |
+
+
+ TRPC_THEMA_Chan_2017 |
+ 0.303 |
+ 0.355 |
+ 0.276 |
+ 0.316 |
+ 0.342 |
+ 0.329 |
+ 0.197 |
+ 0.243 |
+ 0.289 |
+ 0.336 |
+ 0.316 |
+ 0.309 |
+ 0.336 |
+ 0.211 |
+ 0.316 |
+ 0.336 |
+ 0.368 |
+ 0.349 |
+ 0.336 |
+ 0.355 |
+ 0.230 |
+ 0.283 |
+ 0.316 |
+ 0.382 |
+ 0.329 |
+ 0.329 |
+ 0.316 |
+ 0.362 |
+ 0.368 |
+ 0.224 |
+ 0.336 |
+ 0.257 |
+ 0.092 |
+ 0.270 |
+ 0.349 |
+ 0.362 |
+ 0.309 |
+ 0.329 |
+ 0.322 |
+ 0.336 |
+ 0.342 |
+ 0.336 |
+ 0.316 |
+ 0.086 |
+ 0.349 |
+ 0.303 |
+ 0.309 |
+ 0.368 |
+ 0.316 |
+ 0.191 |
+ 0.336 |
+ 0.349 |
+ 0.329 |
+ 0.362 |
+ 0.342 |
+ 0.322 |
+ 0.342 |
+ 0.322 |
+ 0.289 |
+ 0.336 |
+ 0.382 |
+ 0.375 |
+ 1519 |
+ OrganismalFitness |
+ TRPC_THEMA |
+ Medium |
+ Prokaryote |
+
+
+ UBC9_HUMAN_Weile_2017 |
+ 0.152 |
+ 0.187 |
+ 0.171 |
+ 0.171 |
+ 0.187 |
+ 0.187 |
+ 0.070 |
+ 0.125 |
+ 0.140 |
+ 0.156 |
+ 0.160 |
+ 0.144 |
+ 0.144 |
+ 0.074 |
+ 0.093 |
+ 0.144 |
+ 0.152 |
+ 0.160 |
+ 0.148 |
+ 0.160 |
+ 0.109 |
+ 0.121 |
+ 0.128 |
+ 0.109 |
+ 0.128 |
+ 0.144 |
+ 0.128 |
+ 0.136 |
+ 0.125 |
+ 0.144 |
+ 0.097 |
+ 0.101 |
+ 0.054 |
+ 0.109 |
+ 0.113 |
+ 0.128 |
+ 0.136 |
+ 0.148 |
+ 0.160 |
+ 0.167 |
+ 0.183 |
+ 0.175 |
+ 0.144 |
+ 0.086 |
+ 0.128 |
+ 0.171 |
+ 0.156 |
+ 0.160 |
+ 0.132 |
+ 0.144 |
+ 0.144 |
+ 0.163 |
+ 0.191 |
+ 0.160 |
+ 0.152 |
+ 0.175 |
+ 0.167 |
+ 0.148 |
+ 0.148 |
+ 0.160 |
+ 0.144 |
+ 0.132 |
+ 2563 |
+ OrganismalFitness |
+ UBC9_HUMAN |
+ Medium |
+ Human |
+
+
+ UBE4B_HUMAN_Tsuboyama_2023_3L1X |
+ 0.251 |
+ 0.386 |
+ 0.408 |
+ 0.399 |
+ 0.427 |
+ 0.424 |
+ 0.193 |
+ 0.333 |
+ 0.383 |
+ 0.430 |
+ 0.490 |
+ 0.433 |
+ 0.455 |
+ 0.229 |
+ 0.427 |
+ 0.510 |
+ 0.499 |
+ 0.477 |
+ 0.460 |
+ 0.402 |
+ 0.143 |
+ 0.380 |
+ 0.397 |
+ 0.430 |
+ 0.322 |
+ 0.408 |
+ 0.413 |
+ 0.405 |
+ 0.419 |
+ 0.386 |
+ 0.466 |
+ 0.405 |
+ 0.328 |
+ 0.132 |
+ 0.160 |
+ 0.446 |
+ 0.311 |
+ 0.311 |
+ 0.402 |
+ 0.435 |
+ 0.438 |
+ 0.427 |
+ 0.264 |
+ 0.171 |
+ 0.270 |
+ 0.311 |
+ 0.372 |
+ 0.325 |
+ 0.521 |
+ 0.342 |
+ 0.512 |
+ 0.545 |
+ 0.534 |
+ 0.534 |
+ 0.543 |
+ 0.510 |
+ 0.512 |
+ 0.529 |
+ 0.526 |
+ 0.526 |
+ 0.499 |
+ 0.388 |
+ 3622 |
+ Stability |
+ UBE4B_HUMAN |
+ High |
+ Human |
+
+
+ UBE4B_MOUSE_Starita_2013 |
+ 0.156 |
+ 0.122 |
+ 0.122 |
+ 0.122 |
+ 0.100 |
+ 0.122 |
+ 0.122 |
+ 0.122 |
+ 0.133 |
+ 0.111 |
+ 0.111 |
+ 0.133 |
+ 0.111 |
+ 0.078 |
+ 0.122 |
+ 0.122 |
+ 0.122 |
+ 0.122 |
+ 0.089 |
+ 0.056 |
+ 0.089 |
+ 0.056 |
+ 0.089 |
+ 0.100 |
+ 0.078 |
+ 0.089 |
+ 0.089 |
+ 0.100 |
+ 0.100 |
+ 0.111 |
+ 0.056 |
+ 0.067 |
+ 0.100 |
+ 0.111 |
+ 0.078 |
+ 0.089 |
+ 0.156 |
+ 0.156 |
+ 0.156 |
+ 0.122 |
+ 0.122 |
+ 0.122 |
+ 0.100 |
+ 0.144 |
+ 0.067 |
+ 0.122 |
+ 0.133 |
+ 0.089 |
+ 0.044 |
+ 0.156 |
+ 0.156 |
+ 0.122 |
+ 0.133 |
+ 0.122 |
+ 0.133 |
+ 0.167 |
+ 0.144 |
+ 0.133 |
+ 0.133 |
+ 0.156 |
+ 0.167 |
+ 0.156 |
+ 899 |
+ Activity |
+ UBE4B_MOUSE |
+ Low |
+ Eukaryote |
+
+
+ UBR5_HUMAN_Tsuboyama_2023_1I2T |
+ 0.171 |
+ 0.192 |
+ 0.137 |
+ 0.151 |
+ 0.164 |
+ 0.164 |
+ 0.185 |
+ 0.068 |
+ 0.158 |
+ 0.144 |
+ 0.130 |
+ 0.185 |
+ 0.240 |
+ 0.192 |
+ 0.192 |
+ 0.205 |
+ 0.096 |
+ 0.205 |
+ 0.185 |
+ 0.130 |
+ 0.137 |
+ 0.158 |
+ 0.164 |
+ 0.178 |
+ 0.199 |
+ 0.130 |
+ 0.178 |
+ 0.158 |
+ 0.192 |
+ 0.178 |
+ 0.164 |
+ 0.089 |
+ 0.185 |
+ 0.185 |
+ 0.178 |
+ 0.192 |
+ 0.144 |
+ 0.164 |
+ 0.164 |
+ 0.164 |
+ 0.158 |
+ 0.192 |
+ 0.212 |
+ 0.212 |
+ 0.212 |
+ 0.164 |
+ 0.226 |
+ 0.185 |
+ 0.192 |
+ 0.253 |
+ 0.164 |
+ 0.219 |
+ 0.247 |
+ 0.240 |
+ 0.240 |
+ 0.212 |
+ 0.219 |
+ 0.226 |
+ 0.212 |
+ 0.233 |
+ 0.212 |
+ 0.240 |
+ 1453 |
+ Stability |
+ UBR5_HUMAN |
+ Medium |
+ Human |
+
+
+ VG08_BPP22_Tsuboyama_2023_2GP8 |
+ 0.288 |
+ 0.178 |
+ 0.164 |
+ 0.164 |
+ 0.192 |
+ 0.205 |
+ 0.233 |
+ 0.110 |
+ 0.178 |
+ 0.260 |
+ 0.178 |
+ 0.274 |
+ 0.247 |
+ 0.274 |
+ 0.219 |
+ 0.192 |
+ 0.247 |
+ 0.178 |
+ 0.123 |
+ 0.123 |
+ 0.233 |
+ 0.219 |
+ 0.110 |
+ 0.110 |
+ 0.205 |
+ 0.096 |
+ 0.151 |
+ 0.123 |
+ 0.082 |
+ 0.205 |
+ 0.096 |
+ 0.068 |
+ 0.123 |
+ 0.192 |
+ 0.260 |
+ 0.082 |
+ 0.233 |
+ 0.329 |
+ 0.164 |
+ 0.205 |
+ 0.205 |
+ 0.164 |
+ 0.205 |
+ 0.260 |
+ 0.219 |
+ 0.164 |
+ 0.082 |
+ 0.068 |
+ 0.096 |
+ 0.205 |
+ 0.205 |
+ 0.151 |
+ 0.233 |
+ 0.205 |
+ 0.233 |
+ 0.233 |
+ 0.205 |
+ 0.247 |
+ 0.274 |
+ 0.219 |
+ 0.260 |
+ 0.192 |
+ 723 |
+ Stability |
+ VG08_BPP22 |
+ High |
+ Virus |
+
+
+ VILI_CHICK_Tsuboyama_2023_1YU5 |
+ 0.249 |
+ 0.436 |
+ 0.502 |
+ 0.525 |
+ 0.486 |
+ 0.502 |
+ 0.125 |
+ 0.280 |
+ 0.444 |
+ 0.459 |
+ 0.432 |
+ 0.412 |
+ 0.444 |
+ 0.171 |
+ 0.117 |
+ 0.463 |
+ 0.553 |
+ 0.494 |
+ 0.401 |
+ 0.467 |
+ 0.109 |
+ 0.265 |
+ 0.284 |
+ 0.296 |
+ 0.346 |
+ 0.346 |
+ 0.331 |
+ 0.292 |
+ 0.428 |
+ 0.447 |
+ 0.463 |
+ 0.424 |
+ 0.187 |
+ 0.148 |
+ 0.346 |
+ 0.319 |
+ 0.346 |
+ 0.420 |
+ 0.385 |
+ 0.467 |
+ 0.486 |
+ 0.463 |
+ 0.187 |
+ 0.237 |
+ 0.479 |
+ 0.385 |
+ 0.428 |
+ 0.475 |
+ 0.541 |
+ 0.397 |
+ 0.541 |
+ 0.510 |
+ 0.545 |
+ 0.545 |
+ 0.514 |
+ 0.556 |
+ 0.553 |
+ 0.533 |
+ 0.525 |
+ 0.556 |
+ 0.556 |
+ 0.424 |
+ 2568 |
+ Stability |
+ VILI_CHICK |
+ High |
+ Eukaryote |
+
+
+ VKOR1_HUMAN_Chiasson_2020_abundance |
+ 0.252 |
+ 0.174 |
+ 0.215 |
+ 0.219 |
+ 0.196 |
+ 0.196 |
+ 0.137 |
+ 0.163 |
+ 0.211 |
+ 0.204 |
+ 0.167 |
+ 0.193 |
+ 0.181 |
+ 0.126 |
+ 0.215 |
+ 0.241 |
+ 0.204 |
+ 0.222 |
+ 0.178 |
+ 0.207 |
+ 0.181 |
+ 0.152 |
+ 0.248 |
+ 0.215 |
+ 0.211 |
+ 0.144 |
+ 0.167 |
+ 0.148 |
+ 0.137 |
+ 0.204 |
+ 0.167 |
+ 0.133 |
+ 0.126 |
+ 0.159 |
+ 0.196 |
+ 0.141 |
+ 0.230 |
+ 0.219 |
+ 0.185 |
+ 0.196 |
+ 0.189 |
+ 0.215 |
+ 0.137 |
+ 0.104 |
+ 0.196 |
+ 0.215 |
+ 0.204 |
+ 0.174 |
+ 0.200 |
+ 0.163 |
+ 0.211 |
+ 0.181 |
+ 0.219 |
+ 0.215 |
+ 0.222 |
+ 0.219 |
+ 0.207 |
+ 0.181 |
+ 0.215 |
+ 0.207 |
+ 0.156 |
+ 0.230 |
+ 2695 |
+ Expression |
+ VKOR1_HUMAN |
+ Medium |
+ Human |
+
+
+ VKOR1_HUMAN_Chiasson_2020_activity |
+ 0.171 |
+ 0.171 |
+ 0.186 |
+ 0.171 |
+ 0.171 |
+ 0.186 |
+ 0.071 |
+ 0.157 |
+ 0.157 |
+ 0.171 |
+ 0.129 |
+ 0.143 |
+ 0.157 |
+ 0.071 |
+ 0.114 |
+ 0.157 |
+ 0.171 |
+ 0.143 |
+ 0.157 |
+ 0.200 |
+ 0.100 |
+ 0.100 |
+ 0.143 |
+ 0.114 |
+ 0.100 |
+ 0.114 |
+ 0.143 |
+ 0.129 |
+ 0.086 |
+ 0.171 |
+ 0.114 |
+ 0.143 |
+ 0.186 |
+ 0.057 |
+ 0.057 |
+ 0.100 |
+ 0.143 |
+ 0.157 |
+ 0.143 |
+ 0.157 |
+ 0.143 |
+ 0.157 |
+ 0.071 |
+ 0.057 |
+ 0.143 |
+ 0.086 |
+ 0.171 |
+ 0.186 |
+ 0.186 |
+ 0.143 |
+ 0.129 |
+ 0.157 |
+ 0.129 |
+ 0.171 |
+ 0.229 |
+ 0.157 |
+ 0.143 |
+ 0.143 |
+ 0.129 |
+ 0.157 |
+ 0.157 |
+ 0.114 |
+ 697 |
+ Activity |
+ VKOR1_HUMAN |
+ Medium |
+ Human |
+
+
+ VRPI_BPT7_Tsuboyama_2023_2WNM |
+ 0.048 |
+ 0.152 |
+ 0.143 |
+ 0.162 |
+ 0.133 |
+ 0.124 |
+ 0.076 |
+ 0.048 |
+ 0.095 |
+ 0.057 |
+ 0.181 |
+ 0.133 |
+ 0.133 |
+ 0.114 |
+ 0.133 |
+ 0.190 |
+ 0.229 |
+ 0.257 |
+ 0.162 |
+ 0.076 |
+ 0.086 |
+ 0.143 |
+ 0.095 |
+ 0.133 |
+ 0.152 |
+ 0.105 |
+ 0.086 |
+ 0.124 |
+ 0.248 |
+ 0.124 |
+ 0.200 |
+ 0.190 |
+ 0.114 |
+ 0.105 |
+ 0.162 |
+ 0.133 |
+ 0.076 |
+ 0.095 |
+ 0.095 |
+ 0.143 |
+ 0.133 |
+ 0.124 |
+ 0.152 |
+ 0.105 |
+ 0.181 |
+ 0.124 |
+ 0.276 |
+ 0.238 |
+ 0.210 |
+ 0.267 |
+ 0.333 |
+ 0.267 |
+ 0.333 |
+ 0.333 |
+ 0.295 |
+ 0.362 |
+ 0.352 |
+ 0.324 |
+ 0.343 |
+ 0.352 |
+ 0.381 |
+ 0.267 |
+ 1047 |
+ Stability |
+ VRPI_BPT7 |
+ Medium |
+ Virus |
+
+
+ YAIA_ECOLI_Tsuboyama_2023_2KVT |
+ 0.180 |
+ 0.275 |
+ 0.302 |
+ 0.302 |
+ 0.296 |
+ 0.296 |
+ 0.069 |
+ 0.249 |
+ 0.360 |
+ 0.349 |
+ 0.265 |
+ 0.122 |
+ 0.312 |
+ 0.079 |
+ 0.095 |
+ 0.344 |
+ 0.392 |
+ 0.487 |
+ 0.439 |
+ 0.317 |
+ 0.048 |
+ 0.058 |
+ 0.048 |
+ 0.212 |
+ 0.063 |
+ 0.074 |
+ 0.053 |
+ 0.074 |
+ 0.360 |
+ 0.328 |
+ 0.307 |
+ 0.265 |
+ 0.053 |
+ 0.048 |
+ 0.116 |
+ 0.392 |
+ 0.180 |
+ 0.243 |
+ 0.333 |
+ 0.296 |
+ 0.312 |
+ 0.339 |
+ 0.164 |
+ 0.079 |
+ 0.180 |
+ 0.175 |
+ 0.439 |
+ 0.392 |
+ 0.434 |
+ 0.365 |
+ 0.429 |
+ 0.444 |
+ 0.381 |
+ 0.423 |
+ 0.429 |
+ 0.450 |
+ 0.439 |
+ 0.429 |
+ 0.402 |
+ 0.444 |
+ 0.439 |
+ 0.222 |
+ 1890 |
+ Stability |
+ YAIA_ECOLI |
+ Medium |
+ Prokaryote |
+
+
+ YAP1_HUMAN_Araya_2012 |
+ 0.220 |
+ 0.155 |
+ 0.232 |
+ 0.231 |
+ 0.233 |
+ 0.228 |
+ 0.176 |
+ 0.073 |
+ 0.065 |
+ 0.057 |
+ 0.175 |
+ 0.135 |
+ 0.116 |
+ 0.181 |
+ 0.202 |
+ 0.225 |
+ 0.209 |
+ 0.191 |
+ 0.152 |
+ 0.093 |
+ 0.080 |
+ 0.068 |
+ 0.065 |
+ 0.086 |
+ 0.096 |
+ 0.060 |
+ 0.066 |
+ 0.089 |
+ 0.062 |
+ 0.136 |
+ 0.125 |
+ 0.136 |
+ 0.138 |
+ 0.133 |
+ 0.067 |
+ 0.092 |
+ 0.188 |
+ 0.127 |
+ 0.150 |
+ 0.209 |
+ 0.184 |
+ 0.204 |
+ 0.259 |
+ 0.034 |
+ 0.178 |
+ 0.263 |
+ 0.148 |
+ 0.182 |
+ 0.125 |
+ 0.162 |
+ 0.115 |
+ 0.123 |
+ 0.138 |
+ 0.151 |
+ 0.107 |
+ 0.130 |
+ 0.149 |
+ 0.147 |
+ 0.166 |
+ 0.137 |
+ 0.234 |
+ 0.195 |
+ 10075 |
+ Binding |
+ YAP1_HUMAN |
+ Low |
+ Human |
+
+
+ YNZC_BACSU_Tsuboyama_2023_2JVD |
+ 0.374 |
+ 0.374 |
+ 0.378 |
+ 0.374 |
+ 0.378 |
+ 0.378 |
+ 0.383 |
+ 0.330 |
+ 0.391 |
+ 0.409 |
+ 0.457 |
+ 0.448 |
+ 0.457 |
+ 0.461 |
+ 0.474 |
+ 0.439 |
+ 0.483 |
+ 0.470 |
+ 0.430 |
+ 0.378 |
+ 0.435 |
+ 0.491 |
+ 0.509 |
+ 0.517 |
+ 0.478 |
+ 0.443 |
+ 0.478 |
+ 0.430 |
+ 0.422 |
+ 0.435 |
+ 0.422 |
+ 0.391 |
+ 0.439 |
+ 0.435 |
+ 0.452 |
+ 0.426 |
+ 0.426 |
+ 0.391 |
+ 0.383 |
+ 0.400 |
+ 0.387 |
+ 0.391 |
+ 0.422 |
+ 0.361 |
+ 0.422 |
+ 0.439 |
+ 0.439 |
+ 0.496 |
+ 0.548 |
+ 0.530 |
+ 0.530 |
+ 0.474 |
+ 0.504 |
+ 0.504 |
+ 0.513 |
+ 0.517 |
+ 0.496 |
+ 0.500 |
+ 0.491 |
+ 0.496 |
+ 0.457 |
+ 0.591 |
+ 2300 |
+ Stability |
+ YNZC_BACSU |
+ Medium |
+ Prokaryote |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/DMS_zero_shot/substitutions/Top_recall/DMS_substitutions_Top_recall_Uniprot_Selection_Type_level.csv b/benchmarks/DMS_zero_shot/substitutions/Top_recall/DMS_substitutions_Top_recall_Uniprot_Selection_Type_level.csv
new file mode 100644
index 0000000..d32dcbd
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/Top_recall/DMS_substitutions_Top_recall_Uniprot_Selection_Type_level.csv
@@ -0,0 +1,7 @@
+Site_Independent,EVmutation,DeepSequence_single,DeepSequence_ensemble,EVE_single,EVE_ensemble,Unirep,Unirep_evotune,MSA_Transformer_single,MSA_Transformer_ensemble,ESM1b,ESM1v_single,ESM1v_ensemble,ESM2_8M,ESM2_35M,ESM2_150M,ESM2_650M,ESM2_3B,ESM2_15B,Wavenet,RITA_s,RITA_m,RITA_l,RITA_xl,Progen2_small,Progen2_medium,Progen2_base,Progen2_large,Progen2_xlarge,GEMME,VESPA,VESPAl,ProtGPT2,Tranception_S_no_retrieval,Tranception_M_no_retrieval,Tranception_L_no_retrieval,Tranception_S,Tranception_M,Tranception_L,TranceptEVE_S,TranceptEVE_M,TranceptEVE_L,CARP_38M,CARP_600K,CARP_640M,CARP_76M,MIF,MIFST,ESM-IF1,ProteinMPNN,ProtSSN_k10_h512,ProtSSN_k10_h768,ProtSSN_k10_h1280,ProtSSN_k20_h512,ProtSSN_k20_h768,ProtSSN_k20_h1280,ProtSSN_k30_h512,ProtSSN_k30_h768,ProtSSN_k30_h1280,ProtSSN_ensemble,SaProt_650M_AF2,SaProt_35M_AF2,Selection Type
+0.17,0.201,0.199,0.194,0.199,0.199,0.116,0.163,0.19,0.2,0.18,0.166,0.173,0.126,0.155,0.168,0.178,0.176,0.18,0.167,0.155,0.168,0.173,0.17,0.155,0.172,0.183,0.185,0.181,0.196,0.18,0.172,0.113,0.154,0.163,0.181,0.186,0.188,0.2,0.198,0.198,0.2,0.147,0.116,0.176,0.155,0.178,0.191,0.184,0.149,0.188,0.193,0.187,0.196,0.189,0.187,0.189,0.19,0.187,0.191,0.191,0.168,Activity
+0.198,0.204,0.216,0.213,0.217,0.214,0.138,0.174,0.194,0.204,0.178,0.175,0.213,0.178,0.197,0.204,0.213,0.207,0.196,0.204,0.169,0.163,0.178,0.173,0.166,0.19,0.182,0.175,0.179,0.198,0.192,0.183,0.138,0.188,0.176,0.185,0.215,0.208,0.205,0.223,0.218,0.218,0.188,0.124,0.206,0.185,0.201,0.223,0.208,0.146,0.198,0.195,0.195,0.196,0.19,0.2,0.2,0.202,0.199,0.196,0.233,0.198,Binding
+0.179,0.209,0.214,0.218,0.223,0.225,0.141,0.186,0.201,0.218,0.197,0.204,0.21,0.162,0.181,0.196,0.215,0.212,0.199,0.19,0.194,0.199,0.203,0.202,0.204,0.221,0.205,0.223,0.184,0.198,0.184,0.159,0.126,0.182,0.19,0.191,0.208,0.211,0.21,0.226,0.227,0.224,0.177,0.132,0.212,0.195,0.214,0.221,0.205,0.152,0.216,0.22,0.22,0.212,0.218,0.215,0.218,0.228,0.218,0.22,0.235,0.225,Expression
+0.196,0.223,0.219,0.223,0.228,0.228,0.123,0.188,0.219,0.219,0.176,0.182,0.193,0.117,0.138,0.151,0.177,0.184,0.195,0.204,0.185,0.198,0.204,0.202,0.172,0.198,0.198,0.202,0.214,0.223,0.207,0.194,0.121,0.183,0.198,0.213,0.202,0.21,0.221,0.228,0.228,0.229,0.14,0.106,0.179,0.152,0.16,0.192,0.17,0.138,0.184,0.187,0.189,0.187,0.189,0.19,0.183,0.185,0.189,0.191,0.17,0.152,OrganismalFitness
+0.261,0.272,0.28,0.282,0.284,0.283,0.176,0.192,0.259,0.278,0.282,0.245,0.265,0.181,0.254,0.299,0.3,0.286,0.27,0.252,0.186,0.207,0.215,0.216,0.218,0.229,0.219,0.224,0.237,0.24,0.242,0.214,0.168,0.177,0.194,0.216,0.271,0.271,0.265,0.286,0.28,0.279,0.233,0.176,0.265,0.254,0.327,0.3,0.344,0.342,0.325,0.33,0.335,0.331,0.332,0.335,0.327,0.329,0.321,0.337,0.331,0.311,Stability
+0.201,0.222,0.225,0.226,0.23,0.23,0.139,0.181,0.213,0.224,0.203,0.194,0.211,0.153,0.185,0.204,0.217,0.213,0.208,0.203,0.178,0.187,0.195,0.193,0.183,0.202,0.197,0.202,0.199,0.211,0.201,0.184,0.133,0.177,0.184,0.197,0.216,0.218,0.22,0.232,0.23,0.23,0.177,0.131,0.208,0.188,0.216,0.225,0.222,0.186,0.222,0.225,0.225,0.224,0.224,0.226,0.223,0.227,0.223,0.227,0.232,0.211,
diff --git a/benchmarks/DMS_zero_shot/substitutions/Top_recall/DMS_substitutions_Top_recall_Uniprot_level.csv b/benchmarks/DMS_zero_shot/substitutions/Top_recall/DMS_substitutions_Top_recall_Uniprot_level.csv
new file mode 100644
index 0000000..10c5e01
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/Top_recall/DMS_substitutions_Top_recall_Uniprot_level.csv
@@ -0,0 +1,223 @@
+Site_Independent,EVmutation,DeepSequence_single,DeepSequence_ensemble,EVE_single,EVE_ensemble,Unirep,Unirep_evotune,MSA_Transformer_single,MSA_Transformer_ensemble,ESM1b,ESM1v_single,ESM1v_ensemble,ESM2_8M,ESM2_35M,ESM2_150M,ESM2_650M,ESM2_3B,ESM2_15B,Wavenet,RITA_s,RITA_m,RITA_l,RITA_xl,Progen2_small,Progen2_medium,Progen2_base,Progen2_large,Progen2_xlarge,GEMME,VESPA,VESPAl,ProtGPT2,Tranception_S_no_retrieval,Tranception_M_no_retrieval,Tranception_L_no_retrieval,Tranception_S,Tranception_M,Tranception_L,TranceptEVE_S,TranceptEVE_M,TranceptEVE_L,CARP_38M,CARP_600K,CARP_640M,CARP_76M,MIF,MIFST,ESM-IF1,ProteinMPNN,ProtSSN_k10_h512,ProtSSN_k10_h768,ProtSSN_k10_h1280,ProtSSN_k20_h512,ProtSSN_k20_h768,ProtSSN_k20_h1280,ProtSSN_k30_h512,ProtSSN_k30_h768,ProtSSN_k30_h1280,ProtSSN_ensemble,SaProt_650M_AF2,SaProt_35M_AF2,Selection Type,UniProt_ID,MSA_Neff_L_category,Taxon
+0.278,0.279,0.204,0.205,0.273,0.28,0.093,0.086,0.315,0.318,0.117,0.1,0.124,0.081,0.096,0.086,0.127,0.237,0.262,0.178,0.268,0.28,0.285,0.278,0.251,0.267,0.286,0.254,0.277,0.291,0.257,0.219,0.099,0.223,0.232,0.241,0.265,0.267,0.286,0.268,0.269,0.285,0.104,0.103,0.148,0.101,0.169,0.201,0.151,0.153,0.18,0.168,0.183,0.183,0.181,0.166,0.148,0.172,0.159,0.171,0.16,0.135,OrganismalFitness,A0A140D2T1_ZIKV,Medium,Virus
+0.276,0.27,0.269,0.273,0.277,0.28,0.095,0.269,0.293,0.293,0.234,0.259,0.266,0.103,0.094,0.095,0.103,0.11,0.117,0.281,0.279,0.28,0.286,0.291,0.273,0.289,0.279,0.282,0.293,0.293,0.263,0.248,0.221,0.274,0.284,0.289,0.281,0.284,0.286,0.286,0.282,0.285,0.192,0.072,0.267,0.261,0.138,0.268,0.121,0.145,0.129,0.148,0.154,0.138,0.143,0.141,0.144,0.126,0.121,0.143,0.137,0.115,OrganismalFitness,A0A192B1T2_9HIV1,Medium,Virus
+0.086,0.097,0.108,0.108,0.086,0.097,0.075,0.129,0.108,0.075,0.118,0.129,0.108,0.086,0.097,0.118,0.075,0.086,0.108,0.065,0.108,0.097,0.086,0.086,0.086,0.097,0.086,0.108,0.075,0.065,0.108,0.097,0.183,0.086,0.108,0.097,0.108,0.086,0.097,0.086,0.097,0.097,0.075,0.032,0.097,0.065,0.129,0.108,0.14,0.118,0.129,0.097,0.118,0.14,0.075,0.118,0.097,0.151,0.097,0.097,0.14,0.054,Activity,A0A1I9GEU1_NEIME,Medium,Prokaryote
+0.139,0.151,0.12,0.114,0.163,0.151,0.078,0.096,0.163,0.169,0.12,0.12,0.127,0.09,0.09,0.114,0.133,0.12,0.127,0.157,0.09,0.096,0.114,0.114,0.066,0.127,0.133,0.078,0.114,0.163,0.157,0.187,0.06,0.09,0.108,0.084,0.133,0.139,0.151,0.157,0.157,0.157,0.114,0.102,0.12,0.108,0.139,0.145,0.193,0.175,0.163,0.181,0.12,0.12,0.145,0.12,0.12,0.127,0.139,0.145,0.133,0.102,Activity,A0A247D711_LISMN,High,Prokaryote
+0.233,0.28,0.272,0.282,0.277,0.274,0.094,0.272,0.286,0.284,0.131,0.255,0.288,0.12,0.12,0.122,0.242,0.25,0.255,0.302,0.263,0.278,0.292,0.292,0.193,0.264,0.296,0.264,0.273,0.278,0.246,0.204,0.095,0.258,0.272,0.269,0.276,0.286,0.285,0.28,0.282,0.277,0.11,0.103,0.172,0.122,0.169,0.207,0.195,0.149,0.228,0.241,0.237,0.236,0.228,0.242,0.224,0.232,0.238,0.231,0.15,0.156,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+0.233,0.28,0.272,0.282,0.277,0.274,0.094,0.272,0.286,0.284,0.131,0.255,0.288,0.12,0.12,0.122,0.242,0.25,0.255,0.302,0.263,0.278,0.292,0.292,0.193,0.264,0.296,0.264,0.273,0.278,0.246,0.204,0.095,0.258,0.272,0.269,0.276,0.286,0.285,0.28,0.282,0.277,0.11,0.103,0.172,0.122,0.169,0.207,0.195,0.149,0.228,0.241,0.237,0.236,0.228,0.242,0.224,0.232,0.238,0.231,0.15,0.156,OrganismalFitness,A0A2Z5U3Z0_9INFA,Medium,Virus
+0.242,0.245,0.247,0.247,0.256,0.257,0.105,0.228,0.191,0.188,0.096,0.095,0.096,0.105,0.109,0.104,0.135,0.146,0.198,0.222,0.204,0.23,0.234,0.215,0.107,0.168,0.171,0.173,0.2,0.237,0.199,0.191,0.106,0.202,0.231,0.235,0.234,0.245,0.245,0.258,0.259,0.261,0.105,0.109,0.125,0.112,0.122,0.129,0.072,0.13,0.124,0.123,0.105,0.133,0.123,0.13,0.119,0.127,0.129,0.119,0.118,0.109,OrganismalFitness,A4D664_9INFA,Medium,Virus
+0.18,0.204,0.198,0.22,0.184,0.184,0.138,0.176,0.244,0.226,0.257,0.234,0.253,0.146,0.216,0.263,0.309,0.261,0.271,0.23,0.136,0.172,0.194,0.232,0.184,0.23,0.238,0.234,0.242,0.192,0.253,0.21,0.054,0.174,0.204,0.216,0.238,0.236,0.214,0.226,0.214,0.212,0.138,0.092,0.305,0.21,0.283,0.309,0.248,0.222,0.329,0.307,0.293,0.337,0.331,0.323,0.317,0.343,0.329,0.331,0.271,0.234,OrganismalFitness,A4GRB6_PSEAI,High,Prokaryote
+0.104,0.158,0.167,0.166,0.128,0.134,0.072,0.13,0.164,0.155,0.089,0.123,0.149,0.069,0.068,0.076,0.078,0.126,0.126,0.141,0.096,0.105,0.096,0.121,0.104,0.099,0.095,0.095,0.103,0.144,0.13,0.14,0.159,0.076,0.146,0.173,0.097,0.155,0.164,0.119,0.144,0.152,0.06,0.059,0.134,0.07,0.111,0.092,0.032,0.111,0.142,0.148,0.128,0.122,0.115,0.131,0.125,0.115,0.131,0.124,0.087,0.091,Stability,A4_HUMAN,Low,Human
+0.177,0.26,0.188,0.204,0.193,0.193,0.144,0.094,0.249,0.238,0.204,0.21,0.199,0.133,0.155,0.166,0.215,0.21,0.238,0.227,0.149,0.166,0.199,0.138,0.16,0.204,0.193,0.193,0.21,0.276,0.16,0.122,0.083,0.199,0.177,0.227,0.227,0.204,0.21,0.21,0.21,0.215,0.138,0.127,0.16,0.166,0.227,0.238,0.227,0.188,0.249,0.221,0.221,0.232,0.232,0.265,0.215,0.238,0.265,0.249,0.171,0.166,OrganismalFitness,AACC1_PSEAI,High,Prokaryote
+0.157,0.157,0.211,0.193,0.166,0.157,0.054,0.148,0.157,0.148,0.103,0.143,0.139,0.058,0.09,0.108,0.139,0.121,0.139,0.193,0.126,0.202,0.22,0.188,0.157,0.188,0.184,0.197,0.184,0.197,0.157,0.139,0.143,0.108,0.188,0.17,0.17,0.152,0.175,0.184,0.17,0.184,0.085,0.072,0.121,0.067,0.076,0.121,0.143,0.09,0.121,0.13,0.117,0.108,0.117,0.139,0.126,0.112,0.121,0.099,0.112,0.103,Binding,ACE2_HUMAN,Medium,Human
+0.142,0.168,0.168,0.174,0.201,0.206,0.142,0.206,0.171,0.188,0.178,0.156,0.177,0.146,0.15,0.158,0.158,0.167,0.172,0.159,0.168,0.174,0.172,0.171,0.188,0.179,0.183,0.181,0.159,0.162,0.155,0.133,0.121,0.186,0.168,0.182,0.165,0.165,0.174,0.191,0.192,0.204,0.163,0.118,0.172,0.177,0.156,0.181,0.167,0.127,0.155,0.16,0.135,0.165,0.16,0.153,0.15,0.154,0.145,0.154,0.171,0.16,Activity,ADRB2_HUMAN,Medium,Human
+0.143,0.238,0.333,0.19,0.238,0.238,0.048,0.143,0.19,0.19,0.238,0.19,0.19,0.095,0.0,0.143,0.19,0.143,0.095,0.19,0.048,0.095,0.286,0.19,0.048,0.19,0.19,0.19,0.19,0.19,0.19,0.238,0.095,0.095,0.095,0.143,0.238,0.19,0.238,0.238,0.238,0.19,0.048,0.048,0.19,0.048,0.333,0.143,0.143,0.333,0.143,0.333,0.143,0.286,0.143,0.143,0.143,0.143,0.143,0.143,0.048,0.048,Activity,AICDA_HUMAN,Medium,Human
+0.289,0.372,0.372,0.369,0.359,0.379,0.07,0.336,0.352,0.369,0.433,0.101,0.319,0.057,0.124,0.409,0.386,0.409,0.413,0.366,0.057,0.117,0.346,0.342,0.114,0.339,0.342,0.265,0.383,0.336,0.383,0.346,0.326,0.057,0.094,0.114,0.339,0.336,0.346,0.383,0.379,0.383,0.141,0.134,0.396,0.315,0.359,0.356,0.416,0.406,0.406,0.409,0.399,0.386,0.403,0.396,0.379,0.386,0.403,0.396,0.43,0.235,Stability,AMFR_HUMAN,Medium,Human
+0.143,0.114,0.13,0.124,0.127,0.119,0.103,0.159,0.119,0.141,0.181,0.144,0.128,0.152,0.167,0.197,0.193,0.233,0.127,0.165,0.188,0.175,0.157,0.13,0.151,0.151,0.148,0.169,0.103,0.104,0.13,0.133,0.074,0.188,0.136,0.112,0.127,0.12,0.106,0.124,0.122,0.124,0.159,0.146,0.193,0.17,0.185,0.226,0.207,0.143,0.188,0.194,0.18,0.175,0.159,0.154,0.165,0.169,0.165,0.17,0.234,0.161,Activity,AMIE_PSEAE,High,Prokaryote
+0.214,0.208,0.208,0.206,0.206,0.199,0.156,0.176,0.154,0.154,0.156,0.171,0.165,0.195,0.231,0.251,0.244,0.225,0.212,0.086,0.169,0.128,0.146,0.148,0.173,0.15,0.141,0.137,0.113,0.191,0.139,0.126,0.139,0.165,0.126,0.137,0.193,0.167,0.169,0.203,0.186,0.188,0.206,0.169,0.152,0.188,0.173,0.158,0.188,0.133,0.251,0.242,0.214,0.225,0.225,0.236,0.238,0.257,0.233,0.24,0.208,0.236,Activity,ANCSZ,Medium,Eukaryote
+0.209,0.194,0.248,0.24,0.24,0.217,0.155,0.132,0.171,0.202,0.233,0.178,0.233,0.194,0.233,0.287,0.264,0.24,0.202,0.178,0.217,0.132,0.14,0.14,0.217,0.116,0.155,0.147,0.147,0.163,0.109,0.062,0.031,0.178,0.132,0.132,0.194,0.217,0.186,0.248,0.209,0.194,0.194,0.155,0.233,0.248,0.372,0.31,0.302,0.279,0.24,0.279,0.264,0.264,0.256,0.256,0.233,0.256,0.271,0.256,0.295,0.31,Stability,ARGR_ECOLI,Medium,Prokaryote
+0.294,0.235,0.353,0.294,0.294,0.294,0.176,0.294,0.294,0.294,0.059,0.059,0.471,0.235,0.235,0.235,0.353,0.294,0.294,0.294,0.176,0.059,0.176,0.235,0.059,0.235,0.294,0.118,0.235,0.235,0.235,0.235,0.176,0.294,0.118,0.176,0.353,0.353,0.294,0.353,0.353,0.294,0.235,0.235,0.412,0.176,0.176,0.353,0.235,0.059,0.235,0.294,0.235,0.235,0.176,0.235,0.235,0.294,0.235,0.235,0.353,0.176,Binding,B2L11_HUMAN,Low,Human
+0.217,0.28,0.203,0.203,0.213,0.179,0.184,0.174,0.285,0.285,0.271,0.29,0.309,0.126,0.275,0.3,0.372,0.329,0.329,0.251,0.208,0.227,0.251,0.242,0.126,0.261,0.198,0.227,0.217,0.188,0.184,0.14,0.15,0.232,0.188,0.242,0.222,0.232,0.266,0.208,0.174,0.198,0.227,0.077,0.217,0.232,0.285,0.246,0.348,0.343,0.324,0.338,0.329,0.382,0.353,0.396,0.362,0.362,0.362,0.353,0.314,0.353,Stability,BBC1_YEAST,High,Eukaryote
+0.253,0.247,0.228,0.241,0.234,0.247,0.127,0.184,0.196,0.278,0.241,0.297,0.304,0.133,0.158,0.228,0.335,0.266,0.31,0.272,0.12,0.215,0.272,0.31,0.241,0.291,0.234,0.297,0.278,0.253,0.215,0.177,0.095,0.177,0.228,0.259,0.253,0.272,0.272,0.241,0.241,0.253,0.19,0.158,0.266,0.234,0.297,0.342,0.386,0.323,0.386,0.361,0.342,0.373,0.354,0.354,0.361,0.361,0.31,0.361,0.361,0.222,Stability,BCHB_CHLTE,Medium,Prokaryote
+0.2,0.324,0.328,0.324,0.342,0.345,0.109,0.18,0.252,0.309,0.283,0.272,0.298,0.172,0.218,0.255,0.284,0.265,0.236,0.273,0.254,0.247,0.247,0.224,0.246,0.299,0.285,0.282,0.209,0.257,0.304,0.299,0.109,0.248,0.268,0.236,0.284,0.302,0.29,0.34,0.351,0.355,0.187,0.091,0.298,0.223,0.291,0.312,0.314,0.194,0.311,0.307,0.306,0.325,0.32,0.318,0.322,0.31,0.319,0.324,0.317,0.247,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.2,0.324,0.328,0.324,0.342,0.345,0.109,0.18,0.252,0.309,0.283,0.272,0.298,0.172,0.218,0.255,0.284,0.265,0.236,0.273,0.254,0.247,0.247,0.224,0.246,0.299,0.285,0.282,0.209,0.257,0.304,0.299,0.109,0.248,0.268,0.236,0.284,0.302,0.29,0.34,0.351,0.355,0.187,0.091,0.298,0.223,0.291,0.312,0.314,0.194,0.311,0.307,0.306,0.325,0.32,0.318,0.322,0.31,0.319,0.324,0.317,0.247,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.2,0.324,0.328,0.324,0.342,0.345,0.109,0.18,0.252,0.309,0.283,0.272,0.298,0.172,0.218,0.255,0.284,0.265,0.236,0.273,0.254,0.247,0.247,0.224,0.246,0.299,0.285,0.282,0.209,0.257,0.304,0.299,0.109,0.248,0.268,0.236,0.284,0.302,0.29,0.34,0.351,0.355,0.187,0.091,0.298,0.223,0.291,0.312,0.314,0.194,0.311,0.307,0.306,0.325,0.32,0.318,0.322,0.31,0.319,0.324,0.317,0.247,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.2,0.324,0.328,0.324,0.342,0.345,0.109,0.18,0.252,0.309,0.283,0.272,0.298,0.172,0.218,0.255,0.284,0.265,0.236,0.273,0.254,0.247,0.247,0.224,0.246,0.299,0.285,0.282,0.209,0.257,0.304,0.299,0.109,0.248,0.268,0.236,0.284,0.302,0.29,0.34,0.351,0.355,0.187,0.091,0.298,0.223,0.291,0.312,0.314,0.194,0.311,0.307,0.306,0.325,0.32,0.318,0.322,0.31,0.319,0.324,0.317,0.247,OrganismalFitness,BLAT_ECOLX,High,Prokaryote
+0.223,0.196,0.234,0.245,0.217,0.239,0.136,0.136,0.201,0.207,0.168,0.152,0.163,0.152,0.12,0.147,0.168,0.212,0.13,0.158,0.136,0.212,0.223,0.158,0.207,0.228,0.239,0.185,0.179,0.196,0.212,0.19,0.12,0.13,0.147,0.179,0.207,0.217,0.223,0.255,0.25,0.212,0.141,0.168,0.228,0.179,0.185,0.228,0.13,0.141,0.174,0.185,0.168,0.163,0.163,0.152,0.158,0.179,0.163,0.163,0.147,0.168,OrganismalFitness,BRCA1_HUMAN,Low,Human
+0.259,0.185,0.148,0.185,0.259,0.185,0.148,0.111,0.111,0.148,0.148,0.111,0.037,0.111,0.148,0.185,0.185,0.185,0.111,0.037,0.185,0.259,0.296,0.148,0.222,0.148,0.111,0.259,0.074,0.111,0.148,0.222,0.111,0.185,0.222,0.185,0.148,0.185,0.185,0.185,0.185,0.185,0.111,0.148,0.185,0.185,0.148,0.111,0.037,0.111,0.222,0.185,0.148,0.111,0.222,0.148,0.111,0.148,0.222,0.148,0.074,0.111,OrganismalFitness,BRCA2_HUMAN,,Human
+0.255,0.295,0.288,0.283,0.302,0.305,0.088,0.247,0.298,0.301,0.098,0.27,0.331,0.089,0.092,0.097,0.25,0.271,0.305,0.251,0.262,0.254,0.266,0.242,0.143,0.264,0.273,0.294,0.242,0.3,0.271,0.225,0.125,0.243,0.269,0.257,0.28,0.287,0.285,0.302,0.309,0.304,0.097,0.092,0.172,0.099,0.236,0.294,0.276,0.177,0.262,0.261,0.269,0.267,0.262,0.246,0.259,0.265,0.253,0.266,0.181,0.157,OrganismalFitness,C6KNH7_9INFA,Medium,Virus
+0.088,0.082,0.066,0.055,0.055,0.055,0.099,0.049,0.071,0.049,0.055,0.077,0.071,0.11,0.093,0.077,0.055,0.038,0.099,0.055,0.082,0.077,0.077,0.06,0.06,0.055,0.049,0.077,0.038,0.038,0.049,0.055,0.088,0.099,0.077,0.038,0.071,0.082,0.049,0.066,0.055,0.06,0.06,0.104,0.066,0.071,0.115,0.066,0.06,0.121,0.077,0.071,0.077,0.071,0.077,0.066,0.066,0.077,0.071,0.077,0.06,0.077,OrganismalFitness,CALM1_HUMAN,High,Human
+0.263,0.258,0.246,0.279,0.258,0.262,0.174,0.265,0.233,0.254,0.136,0.121,0.129,0.131,0.147,0.134,0.165,0.148,0.112,0.174,0.122,0.129,0.133,0.153,0.119,0.122,0.154,0.126,0.18,0.272,0.13,0.12,0.098,0.133,0.142,0.263,0.23,0.231,0.275,0.253,0.255,0.273,0.128,0.163,0.19,0.147,0.176,0.188,0.145,0.156,0.162,0.145,0.133,0.145,0.125,0.133,0.142,0.14,0.175,0.147,0.152,0.118,OrganismalFitness,CAPSD_AAV2S,Low,Virus
+0.069,0.073,0.071,0.069,0.071,0.071,0.214,0.048,0.081,0.081,0.069,0.124,0.1,0.122,0.106,0.176,0.102,0.107,0.094,0.108,0.119,0.064,0.096,0.076,0.05,0.074,0.084,0.064,0.109,0.054,0.073,0.06,0.082,0.159,0.08,0.065,0.076,0.076,0.06,0.078,0.08,0.076,0.12,0.092,0.09,0.167,0.098,0.077,0.134,0.124,0.159,0.128,0.132,0.124,0.142,0.13,0.144,0.138,0.13,0.128,0.188,0.18,OrganismalFitness,CAR11_HUMAN,Low,Human
+0.069,0.073,0.071,0.069,0.071,0.071,0.214,0.048,0.081,0.081,0.069,0.124,0.1,0.122,0.106,0.176,0.102,0.107,0.094,0.108,0.119,0.064,0.096,0.076,0.05,0.074,0.084,0.064,0.109,0.054,0.073,0.06,0.082,0.159,0.08,0.065,0.076,0.076,0.06,0.078,0.08,0.076,0.12,0.092,0.09,0.167,0.098,0.077,0.134,0.124,0.159,0.128,0.132,0.124,0.142,0.13,0.144,0.138,0.13,0.128,0.188,0.18,OrganismalFitness,CAR11_HUMAN,Low,Human
+0.096,0.112,0.111,0.105,0.111,0.112,0.096,0.106,0.116,0.117,0.11,0.096,0.106,0.097,0.103,0.126,0.106,0.117,0.11,0.097,0.099,0.096,0.117,0.116,0.1,0.112,0.096,0.111,0.101,0.105,0.117,0.117,0.111,0.091,0.099,0.128,0.111,0.116,0.123,0.101,0.101,0.117,0.095,0.091,0.111,0.099,0.099,0.111,0.103,0.094,0.118,0.107,0.115,0.102,0.111,0.111,0.099,0.112,0.1,0.11,0.127,0.115,Activity,CAS9_STRP1,Medium,Prokaryote
+0.159,0.204,0.153,0.115,0.134,0.153,0.089,0.159,0.21,0.223,0.146,0.115,0.153,0.089,0.146,0.166,0.178,0.146,0.21,0.159,0.146,0.159,0.159,0.153,0.14,0.159,0.178,0.172,0.204,0.146,0.229,0.223,0.115,0.102,0.14,0.172,0.134,0.166,0.166,0.146,0.172,0.159,0.096,0.076,0.159,0.14,0.159,0.166,0.127,0.102,0.185,0.178,0.191,0.14,0.172,0.166,0.185,0.166,0.172,0.166,0.146,0.134,Activity,CASP3_HUMAN,High,Human
+0.149,0.202,0.238,0.214,0.25,0.244,0.06,0.155,0.226,0.25,0.268,0.19,0.232,0.107,0.208,0.274,0.208,0.196,0.19,0.238,0.095,0.214,0.155,0.167,0.202,0.179,0.179,0.167,0.196,0.226,0.167,0.155,0.107,0.083,0.19,0.137,0.208,0.226,0.19,0.22,0.232,0.244,0.185,0.06,0.262,0.274,0.238,0.28,0.244,0.19,0.196,0.214,0.244,0.232,0.238,0.226,0.214,0.22,0.226,0.22,0.226,0.208,Activity,CASP7_HUMAN,Medium,Human
+0.272,0.147,0.147,0.152,0.183,0.162,0.236,0.094,0.131,0.136,0.115,0.173,0.147,0.225,0.257,0.209,0.152,0.157,0.131,0.089,0.12,0.126,0.11,0.105,0.073,0.089,0.089,0.073,0.105,0.152,0.126,0.173,0.183,0.131,0.11,0.099,0.199,0.173,0.126,0.168,0.152,0.157,0.194,0.183,0.084,0.126,0.314,0.131,0.183,0.293,0.183,0.115,0.141,0.152,0.141,0.147,0.157,0.141,0.141,0.157,0.099,0.257,Stability,CATR_CHLRE,High,Eukaryote
+0.227,0.159,0.227,0.227,0.237,0.237,0.222,0.184,0.174,0.198,0.304,0.242,0.256,0.227,0.246,0.251,0.222,0.227,0.246,0.237,0.174,0.193,0.261,0.251,0.29,0.261,0.285,0.246,0.213,0.184,0.184,0.14,0.155,0.237,0.29,0.285,0.227,0.242,0.246,0.261,0.237,0.261,0.29,0.261,0.275,0.29,0.295,0.309,0.242,0.285,0.179,0.242,0.246,0.217,0.237,0.232,0.184,0.203,0.203,0.232,0.237,0.309,Stability,CBPA2_HUMAN,Medium,Human
+0.168,0.161,0.169,0.166,0.176,0.175,0.1,0.141,0.173,0.162,0.163,0.15,0.162,0.098,0.127,0.127,0.132,0.145,0.157,0.161,0.143,0.133,0.166,0.173,0.147,0.152,0.154,0.163,0.152,0.163,0.145,0.13,0.109,0.15,0.141,0.151,0.158,0.177,0.194,0.172,0.18,0.179,0.13,0.102,0.158,0.15,0.147,0.147,0.169,0.116,0.139,0.136,0.147,0.141,0.145,0.151,0.15,0.154,0.155,0.151,0.147,0.127,OrganismalFitness,CBS_HUMAN,Medium,Human
+0.437,0.406,0.441,0.445,0.472,0.48,0.013,0.31,0.445,0.424,0.428,0.397,0.402,0.022,0.533,0.533,0.48,0.445,0.41,0.445,0.306,0.341,0.328,0.332,0.376,0.358,0.371,0.341,0.332,0.445,0.432,0.376,0.192,0.301,0.328,0.341,0.45,0.402,0.441,0.48,0.424,0.441,0.437,0.096,0.319,0.406,0.371,0.262,0.52,0.467,0.507,0.48,0.485,0.48,0.52,0.459,0.472,0.472,0.476,0.507,0.437,0.507,Stability,CBX4_HUMAN,High,Human
+0.12,0.138,0.135,0.136,0.121,0.126,0.093,0.112,0.094,0.118,0.13,0.084,0.096,0.097,0.105,0.104,0.138,0.118,0.134,0.113,0.078,0.068,0.075,0.103,0.051,0.102,0.088,0.124,0.118,0.112,0.162,0.145,0.088,0.072,0.086,0.124,0.118,0.122,0.138,0.128,0.134,0.135,0.096,0.082,0.116,0.11,0.086,0.138,0.124,0.103,0.148,0.118,0.144,0.166,0.154,0.146,0.142,0.147,0.132,0.146,0.138,0.104,Activity,CCDB_ECOLI,High,Prokaryote
+0.12,0.138,0.135,0.136,0.121,0.126,0.093,0.112,0.094,0.118,0.13,0.084,0.096,0.097,0.105,0.104,0.138,0.118,0.134,0.113,0.078,0.068,0.075,0.103,0.051,0.102,0.088,0.124,0.118,0.112,0.162,0.145,0.088,0.072,0.086,0.124,0.118,0.122,0.138,0.128,0.134,0.135,0.096,0.082,0.116,0.11,0.086,0.138,0.124,0.103,0.148,0.118,0.144,0.166,0.154,0.146,0.142,0.147,0.132,0.146,0.138,0.104,OrganismalFitness,CCDB_ECOLI,High,Prokaryote
+0.146,0.143,0.156,0.164,0.156,0.14,0.114,0.231,0.164,0.165,0.169,0.146,0.154,0.124,0.135,0.148,0.141,0.154,0.138,0.17,0.13,0.196,0.167,0.161,0.157,0.196,0.18,0.196,0.178,0.178,0.141,0.125,0.119,0.167,0.175,0.167,0.156,0.159,0.169,0.144,0.148,0.148,0.135,0.079,0.183,0.138,0.133,0.177,0.14,0.132,0.148,0.148,0.143,0.128,0.127,0.138,0.135,0.144,0.125,0.135,0.172,0.151,Binding,CCR5_HUMAN,High,Human
+0.167,0.127,0.164,0.162,0.146,0.138,0.143,0.122,0.103,0.109,0.18,0.167,0.196,0.186,0.228,0.175,0.167,0.183,0.138,0.09,0.194,0.162,0.207,0.162,0.149,0.225,0.159,0.204,0.149,0.162,0.13,0.101,0.127,0.241,0.252,0.167,0.191,0.18,0.156,0.164,0.154,0.151,0.215,0.183,0.218,0.212,0.34,0.321,0.324,0.138,0.228,0.191,0.202,0.215,0.218,0.218,0.236,0.21,0.228,0.218,0.353,0.302,Binding,CD19_HUMAN,Low,Human
+0.267,0.302,0.306,0.319,0.334,0.332,0.275,0.296,0.294,0.33,0.29,0.328,0.344,0.251,0.324,0.326,0.343,0.33,0.316,0.345,0.316,0.28,0.304,0.279,0.322,0.301,0.302,0.29,0.295,0.295,0.246,0.205,0.114,0.326,0.32,0.306,0.336,0.328,0.33,0.34,0.336,0.331,0.329,0.134,0.309,0.333,0.295,0.304,0.331,0.178,0.339,0.321,0.328,0.334,0.332,0.341,0.333,0.336,0.337,0.336,0.34,0.314,Expression,CP2C9_HUMAN,High,Human
+0.267,0.302,0.306,0.319,0.334,0.332,0.275,0.296,0.294,0.33,0.29,0.328,0.344,0.251,0.324,0.326,0.343,0.33,0.316,0.345,0.316,0.28,0.304,0.279,0.322,0.301,0.302,0.29,0.295,0.295,0.246,0.205,0.114,0.326,0.32,0.306,0.336,0.328,0.33,0.34,0.336,0.331,0.329,0.134,0.309,0.333,0.295,0.304,0.331,0.178,0.339,0.321,0.328,0.334,0.332,0.341,0.333,0.336,0.337,0.336,0.34,0.314,Binding,CP2C9_HUMAN,High,Human
+0.358,0.33,0.327,0.339,0.339,0.342,0.176,0.309,0.379,0.361,0.355,0.394,0.418,0.27,0.464,0.397,0.248,0.336,0.324,0.327,0.318,0.339,0.348,0.355,0.224,0.376,0.409,0.352,0.333,0.33,0.348,0.321,0.203,0.236,0.358,0.358,0.385,0.385,0.373,0.355,0.367,0.358,0.248,0.224,0.336,0.394,0.421,0.33,0.467,0.455,0.342,0.352,0.339,0.336,0.333,0.339,0.342,0.336,0.318,0.336,0.367,0.452,Stability,CSN4_MOUSE,Medium,Eukaryote
+0.297,0.323,0.316,0.316,0.342,0.329,0.158,0.278,0.361,0.361,0.304,0.278,0.209,0.146,0.19,0.259,0.291,0.304,0.304,0.234,0.133,0.146,0.171,0.108,0.101,0.114,0.184,0.184,0.31,0.259,0.278,0.209,0.165,0.108,0.12,0.177,0.297,0.291,0.316,0.329,0.335,0.342,0.196,0.127,0.259,0.152,0.329,0.329,0.354,0.348,0.278,0.342,0.31,0.342,0.278,0.297,0.316,0.285,0.31,0.31,0.361,0.304,Stability,CUE1_YEAST,Medium,Eukaryote
+0.25,0.361,0.315,0.293,0.336,0.333,0.098,0.222,0.369,0.381,0.237,0.097,0.1,0.097,0.092,0.088,0.085,0.097,0.103,0.233,0.079,0.061,0.092,0.084,0.056,0.081,0.08,0.108,0.147,0.337,0.32,0.314,0.056,0.079,0.098,0.104,0.293,0.295,0.29,0.334,0.336,0.33,0.135,0.131,0.137,0.138,0.257,0.249,0.305,0.23,0.274,0.281,0.286,0.283,0.277,0.283,0.269,0.286,0.268,0.285,0.198,0.136,Activity,D7PM05_CLYGR,Low,Eukaryote
+0.322,0.319,0.328,0.322,0.338,0.34,0.314,0.305,0.282,0.292,0.268,0.321,0.318,0.378,0.423,0.37,0.315,0.318,0.279,0.348,0.284,0.289,0.278,0.277,0.287,0.292,0.291,0.297,0.265,0.321,0.274,0.285,0.272,0.287,0.308,0.287,0.331,0.36,0.351,0.345,0.354,0.35,0.285,0.209,0.249,0.287,0.385,0.285,0.341,0.216,0.284,0.256,0.282,0.291,0.282,0.302,0.262,0.282,0.277,0.282,0.317,0.398,OrganismalFitness,DLG4_HUMAN,Low,Human
+0.158,0.215,0.158,0.139,0.165,0.171,0.184,0.133,0.127,0.165,0.184,0.133,0.127,0.108,0.139,0.177,0.101,0.127,0.133,0.101,0.108,0.108,0.101,0.114,0.089,0.127,0.095,0.108,0.127,0.133,0.203,0.234,0.12,0.114,0.133,0.108,0.133,0.133,0.12,0.133,0.133,0.158,0.158,0.089,0.146,0.133,0.19,0.127,0.108,0.133,0.108,0.076,0.063,0.089,0.089,0.082,0.076,0.095,0.095,0.082,0.095,0.133,Binding,DLG4_RAT,Low,Eukaryote
+0.168,0.149,0.139,0.119,0.109,0.129,0.089,0.109,0.228,0.287,0.119,0.089,0.119,0.089,0.178,0.198,0.248,0.267,0.218,0.228,0.079,0.089,0.069,0.109,0.119,0.149,0.079,0.119,0.218,0.208,0.129,0.059,0.168,0.099,0.099,0.079,0.168,0.168,0.158,0.149,0.158,0.119,0.139,0.089,0.188,0.168,0.287,0.317,0.267,0.307,0.416,0.366,0.347,0.277,0.307,0.297,0.297,0.228,0.327,0.356,0.218,0.277,Stability,DN7A_SACS2,Medium,Prokaryote
+0.3,0.282,0.295,0.295,0.313,0.313,0.322,0.264,0.3,0.308,0.33,0.322,0.374,0.388,0.485,0.432,0.449,0.392,0.401,0.317,0.308,0.317,0.357,0.37,0.313,0.273,0.3,0.308,0.291,0.264,0.189,0.198,0.242,0.282,0.26,0.361,0.304,0.308,0.317,0.308,0.3,0.313,0.383,0.256,0.295,0.352,0.427,0.348,0.454,0.463,0.467,0.507,0.511,0.485,0.485,0.489,0.48,0.515,0.458,0.502,0.304,0.383,Stability,DNJA1_HUMAN,High,Human
+0.342,0.325,0.329,0.295,0.288,0.318,0.099,0.25,0.223,0.192,0.305,0.185,0.233,0.027,0.192,0.233,0.305,0.305,0.274,0.332,0.24,0.284,0.291,0.26,0.103,0.223,0.134,0.195,0.312,0.164,0.318,0.291,0.247,0.075,0.075,0.048,0.325,0.356,0.329,0.322,0.315,0.322,0.134,0.058,0.325,0.192,0.384,0.377,0.363,0.445,0.349,0.363,0.366,0.39,0.353,0.38,0.366,0.363,0.339,0.373,0.404,0.37,Stability,DOCK1_MOUSE,High,Eukaryote
+0.148,0.114,0.114,0.114,0.113,0.099,0.113,0.115,0.155,0.14,0.153,0.174,0.158,0.115,0.161,0.162,0.169,0.165,0.156,0.168,0.136,0.11,0.135,0.116,0.148,0.115,0.124,0.125,0.11,0.101,0.132,0.103,0.031,0.092,0.091,0.134,0.13,0.113,0.122,0.101,0.096,0.102,0.146,0.098,0.148,0.16,0.148,0.157,0.166,0.14,0.164,0.174,0.162,0.164,0.174,0.188,0.177,0.172,0.193,0.176,0.173,0.193,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+0.148,0.114,0.114,0.114,0.113,0.099,0.113,0.115,0.155,0.14,0.153,0.174,0.158,0.115,0.161,0.162,0.169,0.165,0.156,0.168,0.136,0.11,0.135,0.116,0.148,0.115,0.124,0.125,0.11,0.101,0.132,0.103,0.031,0.092,0.091,0.134,0.13,0.113,0.122,0.101,0.096,0.102,0.146,0.098,0.148,0.16,0.148,0.157,0.166,0.14,0.164,0.174,0.162,0.164,0.174,0.188,0.177,0.172,0.193,0.176,0.173,0.193,OrganismalFitness,DYR_ECOLI,High,Prokaryote
+0.148,0.114,0.114,0.114,0.113,0.099,0.113,0.115,0.155,0.14,0.153,0.174,0.158,0.115,0.161,0.162,0.169,0.165,0.156,0.168,0.136,0.11,0.135,0.116,0.148,0.115,0.124,0.125,0.11,0.101,0.132,0.103,0.031,0.092,0.091,0.134,0.13,0.113,0.122,0.101,0.096,0.102,0.146,0.098,0.148,0.16,0.148,0.157,0.166,0.14,0.164,0.174,0.162,0.164,0.174,0.188,0.177,0.172,0.193,0.176,0.173,0.193,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+0.148,0.114,0.114,0.114,0.113,0.099,0.113,0.115,0.155,0.14,0.153,0.174,0.158,0.115,0.161,0.162,0.169,0.165,0.156,0.168,0.136,0.11,0.135,0.116,0.148,0.115,0.124,0.125,0.11,0.101,0.132,0.103,0.031,0.092,0.091,0.134,0.13,0.113,0.122,0.101,0.096,0.102,0.146,0.098,0.148,0.16,0.148,0.157,0.166,0.14,0.164,0.174,0.162,0.164,0.174,0.188,0.177,0.172,0.193,0.176,0.173,0.193,OrganismalFitness,DYR_ECOLI,Medium,Prokaryote
+0.133,0.115,0.124,0.106,0.115,0.097,0.097,0.115,0.124,0.115,0.08,0.088,0.071,0.097,0.124,0.08,0.071,0.088,0.08,0.08,0.097,0.053,0.106,0.062,0.133,0.062,0.088,0.08,0.062,0.088,0.08,0.115,0.133,0.106,0.097,0.062,0.106,0.097,0.088,0.115,0.106,0.106,0.124,0.088,0.115,0.053,0.062,0.097,0.071,0.08,0.088,0.088,0.088,0.08,0.08,0.062,0.097,0.08,0.08,0.08,0.071,0.088,Activity,ENVZ_ECOLI,High,Prokaryote
+0.4,0.35,0.3,0.3,0.35,0.35,0.025,0.375,0.375,0.35,0.3,0.4,0.35,0.025,0.025,0.05,0.075,0.05,0.175,0.4,0.275,0.35,0.375,0.35,0.25,0.4,0.3,0.375,0.4,0.3,0.425,0.375,0.325,0.325,0.375,0.425,0.35,0.375,0.375,0.375,0.4,0.4,0.3,0.0,0.35,0.325,0.325,0.3,0.175,0.15,0.125,0.275,0.225,0.15,0.275,0.2,0.175,0.225,0.175,0.175,0.1,0.125,OrganismalFitness,ENV_HV1B9,Medium,Virus
+0.206,0.211,0.227,0.234,0.231,0.228,0.084,0.214,0.203,0.206,0.155,0.193,0.191,0.099,0.103,0.099,0.108,0.113,0.13,0.239,0.198,0.212,0.233,0.239,0.189,0.218,0.233,0.211,0.237,0.171,0.198,0.172,0.128,0.2,0.215,0.225,0.211,0.213,0.221,0.23,0.232,0.235,0.165,0.076,0.196,0.175,0.139,0.193,0.104,0.113,0.123,0.134,0.145,0.144,0.15,0.141,0.134,0.133,0.135,0.145,0.129,0.113,OrganismalFitness,ENV_HV1BR,Medium,Virus
+0.321,0.327,0.383,0.367,0.388,0.388,0.005,0.296,0.372,0.372,0.311,0.327,0.301,0.015,0.327,0.347,0.372,0.388,0.347,0.357,0.347,0.347,0.306,0.321,0.327,0.316,0.372,0.352,0.337,0.281,0.311,0.235,0.235,0.357,0.301,0.357,0.398,0.352,0.383,0.403,0.362,0.378,0.26,0.112,0.327,0.327,0.444,0.378,0.423,0.408,0.383,0.398,0.388,0.429,0.403,0.403,0.388,0.398,0.408,0.393,0.423,0.429,Stability,EPHB2_HUMAN,High,Human
+0.121,0.242,0.121,0.152,0.212,0.212,0.152,0.182,0.242,0.303,0.121,0.121,0.182,0.182,0.182,0.152,0.242,0.182,0.212,0.091,0.152,0.182,0.182,0.182,0.152,0.182,0.212,0.273,0.182,0.152,0.091,0.121,0.182,0.152,0.091,0.182,0.182,0.121,0.182,0.212,0.212,0.182,0.152,0.182,0.212,0.121,0.212,0.182,0.03,0.091,0.152,0.152,0.182,0.152,0.121,0.121,0.121,0.182,0.212,0.121,0.121,0.152,Expression,ERBB2_HUMAN,Low,Human
+0.234,0.271,0.275,0.266,0.252,0.261,0.22,0.188,0.202,0.284,0.229,0.28,0.28,0.229,0.243,0.22,0.225,0.197,0.225,0.234,0.257,0.261,0.284,0.22,0.307,0.252,0.294,0.312,0.257,0.206,0.179,0.193,0.087,0.243,0.229,0.165,0.261,0.248,0.161,0.289,0.284,0.243,0.294,0.206,0.284,0.275,0.427,0.353,0.298,0.28,0.284,0.234,0.275,0.289,0.312,0.294,0.28,0.248,0.271,0.289,0.225,0.271,Stability,ESTA_BACSU,High,Prokaryote
+0.087,0.609,0.564,0.672,0.626,0.642,0.095,0.448,0.492,0.523,0.62,0.503,0.528,0.063,0.067,0.112,0.48,0.399,0.618,0.567,0.065,0.028,0.062,0.093,0.103,0.253,0.063,0.426,0.625,0.554,0.699,0.659,0.068,0.062,0.053,0.69,0.092,0.072,0.647,0.517,0.493,0.68,0.1,0.084,0.316,0.109,0.096,0.383,0.136,0.08,0.624,0.657,0.601,0.677,0.649,0.667,0.601,0.614,0.639,0.647,0.323,0.028,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+0.087,0.609,0.564,0.672,0.626,0.642,0.095,0.448,0.492,0.523,0.62,0.503,0.528,0.063,0.067,0.112,0.48,0.399,0.618,0.567,0.065,0.028,0.062,0.093,0.103,0.253,0.063,0.426,0.625,0.554,0.699,0.659,0.068,0.062,0.053,0.69,0.092,0.072,0.647,0.517,0.493,0.68,0.1,0.084,0.316,0.109,0.096,0.383,0.136,0.08,0.624,0.657,0.601,0.677,0.649,0.667,0.601,0.614,0.639,0.647,0.323,0.028,OrganismalFitness,F7YBW8_MESOW,High,Prokaryote
+0.212,0.243,0.233,0.217,0.249,0.233,0.111,0.116,0.228,0.275,0.275,0.228,0.243,0.116,0.286,0.328,0.296,0.249,0.254,0.222,0.185,0.175,0.169,0.169,0.159,0.185,0.249,0.238,0.19,0.201,0.185,0.159,0.164,0.122,0.153,0.175,0.217,0.18,0.212,0.212,0.206,0.217,0.291,0.138,0.28,0.286,0.312,0.27,0.265,0.286,0.254,0.265,0.291,0.265,0.291,0.249,0.249,0.275,0.27,0.249,0.296,0.291,Stability,FECA_ECOLI,High,Prokaryote
+0.153,0.177,0.169,0.177,0.21,0.202,0.04,0.097,0.089,0.089,0.073,0.081,0.065,0.056,0.056,0.073,0.105,0.089,0.089,0.073,0.065,0.097,0.024,0.177,0.065,0.185,0.121,0.137,0.089,0.145,0.081,0.081,0.081,0.048,0.097,0.145,0.169,0.153,0.169,0.185,0.177,0.169,0.056,0.073,0.121,0.081,0.218,0.258,0.282,0.234,0.218,0.274,0.234,0.177,0.226,0.25,0.202,0.202,0.194,0.234,0.218,0.177,Stability,FKBP3_HUMAN,Medium,Human
+0.075,0.1,0.25,0.175,0.183,0.192,0.092,0.1,0.25,0.208,0.2,0.158,0.142,0.108,0.167,0.158,0.242,0.258,0.242,0.158,0.133,0.092,0.15,0.15,0.142,0.125,0.15,0.158,0.267,0.183,0.242,0.267,0.158,0.092,0.15,0.1,0.167,0.15,0.167,0.183,0.192,0.217,0.117,0.192,0.208,0.125,0.075,0.2,0.075,0.083,0.267,0.2,0.275,0.242,0.217,0.217,0.275,0.217,0.242,0.25,0.175,0.183,OrganismalFitness,GAL4_YEAST,Medium,Eukaryote
+0.242,0.223,0.201,0.231,0.227,0.22,0.091,0.117,0.227,0.22,0.197,0.178,0.182,0.174,0.14,0.186,0.227,0.22,0.223,0.22,0.064,0.064,0.076,0.064,0.053,0.042,0.053,0.061,0.144,0.22,0.208,0.212,0.095,0.053,0.049,0.223,0.231,0.231,0.239,0.223,0.22,0.246,0.148,0.14,0.152,0.186,0.17,0.216,0.152,0.148,0.193,0.182,0.186,0.186,0.197,0.208,0.201,0.197,0.197,0.197,0.136,0.125,Binding,GCN4_YEAST,Low,Eukaryote
+0.121,0.112,0.129,0.138,0.138,0.147,0.155,0.129,0.121,0.129,0.138,0.112,0.121,0.069,0.086,0.121,0.138,0.138,0.138,0.138,0.095,0.164,0.155,0.19,0.121,0.164,0.138,0.129,0.138,0.147,0.172,0.181,0.112,0.121,0.103,0.112,0.138,0.121,0.129,0.138,0.121,0.147,0.112,0.052,0.155,0.103,0.172,0.164,0.147,0.078,0.138,0.147,0.147,0.164,0.164,0.181,0.155,0.138,0.164,0.155,0.138,0.121,OrganismalFitness,GDIA_HUMAN,Low,Human
+0.277,0.275,0.28,0.279,0.287,0.287,0.061,0.256,0.297,0.298,0.188,0.083,0.086,0.085,0.092,0.079,0.078,0.09,0.112,0.241,0.074,0.073,0.089,0.08,0.066,0.098,0.118,0.259,0.268,0.296,0.238,0.239,0.087,0.069,0.093,0.249,0.285,0.283,0.279,0.288,0.289,0.304,0.128,0.11,0.134,0.134,0.279,0.28,0.308,0.254,0.235,0.239,0.25,0.253,0.243,0.251,0.241,0.245,0.233,0.247,0.262,0.188,Activity,GFP_AEQVI,Low,Eukaryote
+0.32,0.2,0.36,0.36,0.36,0.36,0.28,0.28,0.2,0.2,0.2,0.4,0.32,0.28,0.28,0.32,0.36,0.32,0.2,0.2,0.32,0.28,0.36,0.32,0.24,0.36,0.32,0.4,0.24,0.2,0.28,0.24,0.24,0.28,0.32,0.36,0.4,0.4,0.44,0.36,0.36,0.36,0.28,0.24,0.4,0.32,0.28,0.2,0.32,0.16,0.48,0.44,0.32,0.36,0.4,0.28,0.32,0.48,0.36,0.4,0.32,0.4,Expression,GLPA_HUMAN,Low,Human
+0.303,0.36,0.354,0.363,0.376,0.384,0.317,0.288,0.38,0.339,0.365,0.305,0.345,0.321,0.384,0.425,0.442,0.346,0.393,0.366,0.371,0.357,0.328,0.31,0.343,0.347,0.287,0.381,0.283,0.345,0.281,0.266,0.343,0.377,0.362,0.245,0.381,0.376,0.291,0.399,0.402,0.375,0.398,0.196,0.35,0.362,0.46,0.337,0.491,0.345,0.416,0.437,0.415,0.427,0.438,0.434,0.417,0.437,0.412,0.439,0.359,0.402,OrganismalFitness,GRB2_HUMAN,Medium,Human
+0.269,0.26,0.192,0.212,0.202,0.221,0.192,0.192,0.125,0.192,0.346,0.269,0.317,0.221,0.269,0.298,0.337,0.231,0.221,0.106,0.125,0.183,0.212,0.26,0.231,0.26,0.221,0.212,0.26,0.212,0.231,0.163,0.173,0.106,0.163,0.24,0.269,0.279,0.26,0.231,0.25,0.25,0.26,0.173,0.298,0.212,0.25,0.337,0.269,0.202,0.298,0.317,0.356,0.317,0.356,0.337,0.337,0.327,0.317,0.356,0.423,0.279,Stability,HCP_LAMBD,Medium,Virus
+0.581,0.547,0.619,0.633,0.649,0.651,0.222,0.469,0.606,0.623,0.599,0.174,0.229,0.143,0.152,0.608,0.617,0.605,0.614,0.581,0.191,0.503,0.538,0.538,0.501,0.549,0.544,0.565,0.533,0.596,0.642,0.581,0.109,0.199,0.168,0.333,0.606,0.574,0.606,0.63,0.617,0.623,0.243,0.231,0.608,0.343,0.46,0.592,0.474,0.345,0.639,0.606,0.628,0.615,0.644,0.653,0.63,0.639,0.646,0.644,0.673,0.22,Stability,HECD1_HUMAN,Medium,Human
+0.165,0.174,0.172,0.172,0.174,0.174,0.121,0.063,0.181,0.178,0.155,0.167,0.146,0.09,0.172,0.183,0.155,0.169,0.163,0.072,0.174,0.174,0.174,0.167,0.16,0.172,0.163,0.16,0.176,0.183,0.158,0.156,0.109,0.172,0.193,0.165,0.163,0.19,0.158,0.185,0.181,0.172,0.121,0.098,0.165,0.158,0.163,0.188,0.139,0.144,0.167,0.172,0.169,0.167,0.165,0.17,0.172,0.172,0.17,0.17,0.167,0.178,Activity,HEM3_HUMAN,Medium,Human
+0.165,0.202,0.209,0.209,0.203,0.199,0.133,0.086,0.188,0.191,0.204,0.174,0.185,0.1,0.13,0.179,0.2,0.202,0.194,0.17,0.175,0.198,0.225,0.211,0.203,0.213,0.191,0.194,0.191,0.177,0.199,0.202,0.088,0.211,0.212,0.197,0.19,0.188,0.186,0.209,0.205,0.202,0.148,0.132,0.197,0.109,0.182,0.209,0.188,0.154,0.196,0.178,0.19,0.193,0.194,0.191,0.193,0.196,0.196,0.193,0.203,0.107,OrganismalFitness,HIS7_YEAST,Medium,Eukaryote
+0.135,0.122,0.124,0.12,0.128,0.122,0.133,0.193,0.121,0.122,0.126,0.13,0.123,0.122,0.123,0.136,0.149,0.13,0.133,0.111,0.137,0.108,0.116,0.125,0.143,0.106,0.108,0.108,0.124,0.12,0.127,0.127,0.114,0.124,0.111,0.12,0.125,0.125,0.122,0.116,0.12,0.12,0.124,0.117,0.161,0.16,0.132,0.129,0.105,0.126,0.145,0.129,0.14,0.149,0.144,0.15,0.143,0.151,0.14,0.139,0.155,0.148,OrganismalFitness,HMDH_HUMAN,Low,Human
+0.129,0.139,0.143,0.153,0.144,0.148,0.118,0.141,0.142,0.148,0.144,0.147,0.148,0.083,0.089,0.112,0.147,0.153,0.127,0.123,0.164,0.156,0.165,0.15,0.163,0.161,0.152,0.163,0.152,0.145,0.171,0.17,0.099,0.154,0.158,0.156,0.142,0.149,0.144,0.149,0.152,0.151,0.134,0.074,0.149,0.153,0.113,0.148,0.093,0.119,0.14,0.134,0.13,0.133,0.152,0.144,0.14,0.145,0.142,0.144,0.141,0.154,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.129,0.139,0.143,0.153,0.144,0.148,0.118,0.141,0.142,0.148,0.144,0.147,0.148,0.083,0.089,0.112,0.147,0.153,0.127,0.123,0.164,0.156,0.165,0.15,0.163,0.161,0.152,0.163,0.152,0.145,0.171,0.17,0.099,0.154,0.158,0.156,0.142,0.149,0.144,0.149,0.152,0.151,0.134,0.074,0.149,0.153,0.113,0.148,0.093,0.119,0.14,0.134,0.13,0.133,0.152,0.144,0.14,0.145,0.142,0.144,0.141,0.154,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.129,0.139,0.143,0.153,0.144,0.148,0.118,0.141,0.142,0.148,0.144,0.147,0.148,0.083,0.089,0.112,0.147,0.153,0.127,0.123,0.164,0.156,0.165,0.15,0.163,0.161,0.152,0.163,0.152,0.145,0.171,0.17,0.099,0.154,0.158,0.156,0.142,0.149,0.144,0.149,0.152,0.151,0.134,0.074,0.149,0.153,0.113,0.148,0.093,0.119,0.14,0.134,0.13,0.133,0.152,0.144,0.14,0.145,0.142,0.144,0.141,0.154,OrganismalFitness,HSP82_YEAST,Medium,Eukaryote
+0.192,0.16,0.156,0.162,0.155,0.158,0.092,0.144,0.152,0.163,0.145,0.144,0.164,0.104,0.12,0.16,0.174,0.159,0.156,0.152,0.153,0.152,0.163,0.162,0.156,0.162,0.174,0.16,0.148,0.148,0.149,0.142,0.098,0.154,0.155,0.17,0.166,0.156,0.171,0.16,0.151,0.162,0.129,0.106,0.152,0.154,0.168,0.157,0.161,0.136,0.162,0.168,0.162,0.16,0.166,0.172,0.178,0.163,0.166,0.168,0.172,0.158,OrganismalFitness,HXK4_HUMAN,Medium,Human
+0.192,0.16,0.156,0.162,0.155,0.158,0.092,0.144,0.152,0.163,0.145,0.144,0.164,0.104,0.12,0.16,0.174,0.159,0.156,0.152,0.153,0.152,0.163,0.162,0.156,0.162,0.174,0.16,0.148,0.148,0.149,0.142,0.098,0.154,0.155,0.17,0.166,0.156,0.171,0.16,0.151,0.162,0.129,0.106,0.152,0.154,0.168,0.157,0.161,0.136,0.162,0.168,0.162,0.16,0.166,0.172,0.178,0.163,0.166,0.168,0.172,0.158,Expression,HXK4_HUMAN,Medium,Human
+0.277,0.262,0.234,0.233,0.285,0.282,0.083,0.225,0.247,0.275,0.092,0.086,0.087,0.082,0.077,0.088,0.1,0.091,0.124,0.216,0.23,0.253,0.261,0.263,0.077,0.094,0.119,0.083,0.238,0.284,0.185,0.188,0.143,0.261,0.251,0.252,0.27,0.266,0.26,0.283,0.28,0.268,0.08,0.088,0.092,0.086,0.186,0.185,0.186,0.146,0.103,0.136,0.13,0.136,0.121,0.117,0.105,0.109,0.1,0.12,0.124,0.11,OrganismalFitness,I6TAH8_I68A0,Medium,Virus
+0.13,0.217,0.196,0.232,0.21,0.21,0.196,0.159,0.087,0.087,0.254,0.181,0.196,0.101,0.188,0.261,0.239,0.196,0.203,0.196,0.188,0.232,0.203,0.203,0.225,0.217,0.261,0.225,0.152,0.304,0.203,0.159,0.145,0.21,0.188,0.21,0.181,0.152,0.203,0.203,0.196,0.203,0.145,0.13,0.261,0.232,0.109,0.232,0.217,0.174,0.217,0.268,0.239,0.239,0.232,0.246,0.232,0.21,0.254,0.239,0.188,0.188,OrganismalFitness,IF1_ECOLI,High,Prokaryote
+0.075,0.075,0.195,0.203,0.195,0.211,0.06,0.12,0.083,0.12,0.203,0.12,0.113,0.053,0.053,0.075,0.083,0.165,0.188,0.188,0.203,0.173,0.173,0.188,0.271,0.211,0.248,0.18,0.165,0.218,0.12,0.075,0.203,0.083,0.226,0.233,0.06,0.158,0.18,0.105,0.128,0.165,0.09,0.06,0.158,0.128,0.09,0.15,0.083,0.15,0.188,0.105,0.158,0.15,0.165,0.165,0.158,0.143,0.15,0.158,0.218,0.09,Stability,ILF3_HUMAN,High,Human
+0.226,0.179,0.236,0.236,0.221,0.236,0.313,0.128,0.226,0.282,0.246,0.323,0.344,0.344,0.369,0.451,0.374,0.421,0.395,0.19,0.318,0.2,0.195,0.215,0.349,0.354,0.369,0.333,0.251,0.205,0.179,0.123,0.159,0.231,0.241,0.287,0.236,0.215,0.267,0.251,0.231,0.251,0.313,0.272,0.349,0.338,0.451,0.467,0.333,0.313,0.251,0.292,0.297,0.308,0.292,0.287,0.287,0.297,0.272,0.292,0.415,0.308,Stability,ISDH_STAAW,High,Prokaryote
+0.118,0.127,0.112,0.112,0.134,0.134,0.134,0.148,0.125,0.136,0.14,0.162,0.162,0.132,0.136,0.146,0.14,0.132,0.094,0.097,0.136,0.134,0.133,0.148,0.146,0.176,0.159,0.131,0.112,0.114,0.127,0.112,0.073,0.134,0.174,0.155,0.127,0.138,0.136,0.138,0.134,0.134,0.133,0.128,0.108,0.133,0.146,0.101,0.128,0.133,0.133,0.136,0.14,0.122,0.142,0.146,0.153,0.129,0.136,0.134,0.174,0.131,Expression,KCNE1_HUMAN,Medium,Human
+0.118,0.127,0.112,0.112,0.134,0.134,0.134,0.148,0.125,0.136,0.14,0.162,0.162,0.132,0.136,0.146,0.14,0.132,0.094,0.097,0.136,0.134,0.133,0.148,0.146,0.176,0.159,0.131,0.112,0.114,0.127,0.112,0.073,0.134,0.174,0.155,0.127,0.138,0.136,0.138,0.134,0.134,0.133,0.128,0.108,0.133,0.146,0.101,0.128,0.133,0.133,0.136,0.14,0.122,0.142,0.146,0.153,0.129,0.136,0.134,0.174,0.131,Activity,KCNE1_HUMAN,Medium,Human
+0.0,0.05,0.0,0.0,0.0,0.0,0.0,0.1,0.0,0.0,0.05,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.2,0.05,0.1,0.1,0.1,0.1,0.05,0.05,0.05,0.05,0.05,0.2,0.35,0.05,0.05,0.05,0.1,0.0,0.0,0.05,0.0,0.0,0.0,0.1,0.0,0.05,0.05,0.0,0.0,0.05,0.05,0.15,0.1,0.1,0.15,0.15,0.1,0.15,0.1,0.15,0.15,0.0,0.1,Activity,KCNH2_HUMAN,Medium,Human
+0.148,0.132,0.136,0.141,0.141,0.145,0.106,0.098,0.122,0.122,0.159,0.15,0.136,0.129,0.168,0.187,0.176,0.162,0.175,0.103,0.168,0.126,0.112,0.105,0.173,0.126,0.12,0.147,0.113,0.124,0.113,0.111,0.09,0.16,0.118,0.112,0.144,0.126,0.111,0.142,0.132,0.123,0.152,0.12,0.117,0.172,0.148,0.115,0.139,0.12,0.166,0.159,0.174,0.157,0.173,0.157,0.16,0.17,0.168,0.166,0.153,0.176,Activity,KCNJ2_MOUSE,Medium,Eukaryote
+0.148,0.132,0.136,0.141,0.141,0.145,0.106,0.098,0.122,0.122,0.159,0.15,0.136,0.129,0.168,0.187,0.176,0.162,0.175,0.103,0.168,0.126,0.112,0.105,0.173,0.126,0.12,0.147,0.113,0.124,0.113,0.111,0.09,0.16,0.118,0.112,0.144,0.126,0.111,0.142,0.132,0.123,0.152,0.12,0.117,0.172,0.148,0.115,0.139,0.12,0.166,0.159,0.174,0.157,0.173,0.157,0.16,0.17,0.168,0.166,0.153,0.176,Expression,KCNJ2_MOUSE,Medium,Eukaryote
+0.159,0.22,0.216,0.24,0.222,0.224,0.107,0.169,0.208,0.238,0.212,0.19,0.226,0.113,0.123,0.169,0.226,0.24,0.238,0.2,0.151,0.188,0.22,0.216,0.165,0.204,0.228,0.188,0.242,0.218,0.169,0.149,0.077,0.125,0.167,0.194,0.185,0.19,0.21,0.232,0.23,0.236,0.123,0.101,0.202,0.143,0.22,0.23,0.228,0.171,0.224,0.214,0.228,0.216,0.216,0.222,0.22,0.208,0.216,0.226,0.236,0.181,OrganismalFitness,KKA2_KLEPN,High,Prokaryote
+0.217,0.432,0.426,0.427,0.446,0.447,0.133,0.347,0.272,0.406,0.251,0.351,0.388,0.119,0.174,0.227,0.324,0.404,0.435,0.326,0.161,0.241,0.298,0.319,0.185,0.328,0.406,0.305,0.445,0.493,0.371,0.319,0.139,0.171,0.245,0.47,0.214,0.246,0.444,0.436,0.447,0.47,0.105,0.085,0.283,0.175,0.231,0.361,0.376,0.158,0.305,0.347,0.337,0.354,0.337,0.347,0.323,0.335,0.323,0.352,0.269,0.19,Activity,LGK_LIPST,Medium,Eukaryote
+0.139,0.25,0.25,0.25,0.222,0.25,0.278,0.111,0.25,0.25,0.25,0.222,0.222,0.25,0.278,0.222,0.194,0.167,0.278,0.25,0.333,0.333,0.278,0.25,0.222,0.25,0.194,0.25,0.083,0.222,0.194,0.056,0.194,0.278,0.333,0.222,0.194,0.333,0.222,0.278,0.278,0.25,0.25,0.278,0.194,0.194,0.278,0.222,0.194,0.194,0.194,0.306,0.333,0.25,0.25,0.25,0.278,0.278,0.139,0.278,0.306,0.306,Expression,LYAM1_HUMAN,Medium,Human
+0.315,0.315,0.35,0.35,0.343,0.336,0.343,0.245,0.287,0.28,0.294,0.322,0.406,0.385,0.378,0.378,0.413,0.455,0.273,0.231,0.224,0.217,0.259,0.301,0.28,0.273,0.294,0.273,0.273,0.252,0.21,0.217,0.161,0.231,0.301,0.231,0.301,0.315,0.294,0.343,0.315,0.322,0.385,0.259,0.35,0.357,0.413,0.315,0.364,0.385,0.413,0.441,0.448,0.469,0.427,0.476,0.462,0.462,0.434,0.462,0.462,0.483,Stability,MAFG_MOUSE,Medium,Eukaryote
+0.288,0.382,0.33,0.33,0.335,0.335,0.057,0.151,0.274,0.302,0.264,0.255,0.226,0.028,0.028,0.264,0.259,0.264,0.377,0.241,0.024,0.061,0.193,0.179,0.094,0.264,0.274,0.236,0.288,0.274,0.325,0.297,0.316,0.028,0.042,0.047,0.269,0.264,0.283,0.316,0.33,0.316,0.09,0.057,0.307,0.25,0.288,0.335,0.288,0.288,0.34,0.387,0.344,0.344,0.316,0.335,0.33,0.311,0.274,0.33,0.382,0.264,Stability,MBD11_ARATH,Medium,Eukaryote
+0.148,0.185,0.163,0.174,0.163,0.178,0.163,0.174,0.156,0.167,0.191,0.178,0.174,0.143,0.148,0.172,0.183,0.172,0.187,0.163,0.185,0.187,0.159,0.193,0.193,0.181,0.183,0.191,0.2,0.193,0.17,0.172,0.12,0.161,0.178,0.196,0.167,0.174,0.198,0.176,0.17,0.176,0.17,0.139,0.187,0.178,0.15,0.183,0.169,0.119,0.161,0.163,0.165,0.167,0.156,0.176,0.172,0.169,0.187,0.159,0.185,0.167,Activity,MET_HUMAN,Medium,Human
+0.091,0.134,0.134,0.144,0.138,0.122,0.084,0.075,0.116,0.095,0.076,0.119,0.107,0.104,0.103,0.09,0.072,0.085,0.098,0.097,0.101,0.094,0.082,0.09,0.095,0.109,0.103,0.109,0.101,0.122,0.082,0.095,0.109,0.087,0.101,0.09,0.078,0.098,0.088,0.125,0.125,0.106,0.097,0.095,0.113,0.12,0.06,0.109,0.07,0.097,0.084,0.104,0.095,0.088,0.085,0.079,0.09,0.081,0.081,0.088,0.098,0.104,OrganismalFitness,MK01_HUMAN,Medium,Human
+0.095,0.145,0.147,0.137,0.132,0.142,0.115,0.14,0.162,0.15,0.152,0.155,0.177,0.12,0.112,0.112,0.137,0.152,0.18,0.167,0.142,0.14,0.145,0.15,0.147,0.15,0.177,0.16,0.145,0.157,0.12,0.14,0.15,0.112,0.117,0.135,0.09,0.122,0.115,0.127,0.142,0.137,0.1,0.087,0.14,0.11,0.057,0.115,0.115,0.067,0.12,0.112,0.117,0.117,0.13,0.112,0.112,0.112,0.115,0.11,0.125,0.097,OrganismalFitness,MLAC_ECOLI,Medium,Prokaryote
+0.133,0.162,0.143,0.147,0.147,0.149,0.1,0.1,0.156,0.153,0.146,0.164,0.164,0.105,0.131,0.153,0.152,0.159,0.146,0.152,0.114,0.127,0.137,0.146,0.138,0.144,0.146,0.134,0.162,0.143,0.147,0.135,0.132,0.128,0.153,0.155,0.134,0.153,0.152,0.159,0.167,0.157,0.122,0.093,0.156,0.134,0.13,0.152,0.101,0.118,0.149,0.144,0.15,0.142,0.148,0.159,0.154,0.153,0.154,0.158,0.154,0.134,OrganismalFitness,MSH2_HUMAN,Medium,Human
+0.144,0.316,0.337,0.326,0.321,0.342,0.134,0.337,0.294,0.299,0.203,0.289,0.316,0.102,0.128,0.155,0.176,0.182,0.23,0.283,0.139,0.16,0.235,0.299,0.203,0.321,0.326,0.283,0.305,0.321,0.332,0.305,0.193,0.171,0.182,0.299,0.16,0.16,0.278,0.342,0.342,0.337,0.128,0.053,0.155,0.123,0.166,0.235,0.262,0.134,0.214,0.225,0.225,0.198,0.203,0.209,0.219,0.219,0.209,0.214,0.209,0.107,OrganismalFitness,MTH3_HAEAE,Medium,Prokaryote
+0.122,0.123,0.113,0.123,0.113,0.108,0.096,0.121,0.133,0.138,0.123,0.126,0.126,0.118,0.124,0.135,0.116,0.132,0.138,0.122,0.121,0.118,0.126,0.149,0.13,0.12,0.131,0.119,0.162,0.137,0.139,0.148,0.075,0.127,0.126,0.127,0.11,0.118,0.125,0.109,0.116,0.123,0.122,0.11,0.123,0.126,0.112,0.122,0.127,0.118,0.12,0.115,0.121,0.118,0.117,0.118,0.12,0.118,0.126,0.117,0.132,0.116,OrganismalFitness,MTHR_HUMAN,Low,Human
+0.227,0.206,0.276,0.212,0.236,0.245,0.073,0.061,0.297,0.285,0.294,0.261,0.264,0.109,0.23,0.355,0.421,0.318,0.179,0.233,0.139,0.091,0.164,0.155,0.188,0.261,0.255,0.227,0.291,0.209,0.209,0.167,0.17,0.212,0.139,0.127,0.291,0.221,0.2,0.276,0.255,0.261,0.194,0.073,0.27,0.2,0.248,0.315,0.339,0.27,0.303,0.306,0.297,0.312,0.303,0.306,0.291,0.303,0.3,0.303,0.297,0.348,Stability,MYO3_YEAST,High,Eukaryote
+0.258,0.246,0.246,0.247,0.259,0.263,0.111,0.22,0.266,0.261,0.106,0.099,0.094,0.101,0.093,0.096,0.099,0.102,0.129,0.182,0.235,0.264,0.264,0.263,0.098,0.111,0.109,0.107,0.242,0.268,0.199,0.194,0.148,0.249,0.252,0.285,0.26,0.271,0.27,0.28,0.282,0.278,0.094,0.099,0.106,0.093,0.161,0.166,0.178,0.144,0.103,0.118,0.127,0.122,0.114,0.114,0.115,0.124,0.102,0.115,0.139,0.13,OrganismalFitness,NCAP_I34A1,Medium,Virus
+0.434,0.426,0.434,0.454,0.442,0.474,0.353,0.361,0.378,0.406,0.305,0.398,0.369,0.402,0.394,0.478,0.51,0.486,0.458,0.406,0.386,0.365,0.329,0.329,0.378,0.317,0.357,0.301,0.321,0.373,0.349,0.369,0.305,0.39,0.386,0.337,0.43,0.402,0.402,0.442,0.446,0.454,0.382,0.406,0.321,0.345,0.386,0.341,0.418,0.418,0.482,0.426,0.474,0.45,0.482,0.478,0.454,0.494,0.438,0.466,0.349,0.41,Stability,NKX31_HUMAN,High,Human
+0.41,0.41,0.418,0.426,0.441,0.441,0.213,0.284,0.41,0.584,0.315,0.157,0.181,0.15,0.252,0.292,0.434,0.196,0.426,0.134,0.244,0.402,0.356,0.292,0.205,0.237,0.418,0.418,0.395,0.394,0.355,0.252,0.039,0.221,0.355,0.33,0.394,0.426,0.426,0.418,0.426,0.441,0.221,0.206,0.33,0.14,0.315,0.188,0.102,0.134,0.259,0.282,0.267,0.33,0.172,0.251,0.244,0.251,0.322,0.267,0.267,0.205,Activity,NPC1_HUMAN,Low,Human
+0.41,0.41,0.418,0.426,0.441,0.441,0.213,0.284,0.41,0.584,0.315,0.157,0.181,0.15,0.252,0.292,0.434,0.196,0.426,0.134,0.244,0.402,0.356,0.292,0.205,0.237,0.418,0.418,0.395,0.394,0.355,0.252,0.039,0.221,0.355,0.33,0.394,0.426,0.426,0.418,0.426,0.441,0.221,0.206,0.33,0.14,0.315,0.188,0.102,0.134,0.259,0.282,0.267,0.33,0.172,0.251,0.244,0.251,0.322,0.267,0.267,0.205,Activity,NPC1_HUMAN,Low,Human
+0.433,0.567,0.467,0.433,0.567,0.567,0.067,0.6,0.667,0.7,0.067,0.133,0.6,0.0,0.0,0.067,0.233,0.567,0.5,0.4,0.7,0.667,0.533,0.533,0.0,0.433,0.567,0.4,0.633,0.567,0.4,0.4,0.067,0.6,0.533,0.6,0.533,0.567,0.6,0.633,0.6,0.567,0.0,0.033,0.0,0.033,0.267,0.2,0.4,0.233,0.167,0.2,0.3,0.233,0.3,0.4,0.267,0.267,0.333,0.333,0.233,0.033,OrganismalFitness,NRAM_I33A0,Low,Virus
+0.193,0.26,0.281,0.274,0.274,0.284,0.053,0.179,0.253,0.267,0.256,0.207,0.284,0.105,0.126,0.14,0.193,0.239,0.263,0.204,0.102,0.13,0.249,0.253,0.133,0.298,0.267,0.239,0.312,0.309,0.267,0.211,0.098,0.123,0.119,0.305,0.196,0.204,0.302,0.274,0.288,0.309,0.116,0.07,0.284,0.14,0.196,0.33,0.211,0.168,0.214,0.249,0.211,0.228,0.239,0.263,0.239,0.256,0.246,0.26,0.298,0.2,Expression,NUD15_HUMAN,High,Human
+0.123,0.256,0.163,0.167,0.172,0.172,0.133,0.128,0.202,0.138,0.177,0.03,0.059,0.103,0.103,0.084,0.148,0.167,0.217,0.163,0.172,0.202,0.197,0.217,0.094,0.143,0.113,0.207,0.251,0.138,0.217,0.217,0.074,0.163,0.143,0.148,0.158,0.118,0.143,0.172,0.163,0.192,0.172,0.138,0.103,0.133,0.374,0.305,0.276,0.345,0.246,0.291,0.266,0.261,0.286,0.246,0.246,0.246,0.241,0.261,0.271,0.158,Stability,NUSA_ECOLI,High,Prokaryote
+0.283,0.217,0.225,0.246,0.239,0.246,0.159,0.181,0.283,0.225,0.232,0.254,0.246,0.123,0.261,0.254,0.232,0.217,0.217,0.138,0.116,0.152,0.152,0.152,0.254,0.181,0.167,0.167,0.174,0.246,0.232,0.196,0.217,0.167,0.145,0.181,0.21,0.203,0.21,0.196,0.188,0.21,0.217,0.08,0.188,0.297,0.254,0.21,0.37,0.319,0.283,0.341,0.312,0.275,0.29,0.29,0.304,0.319,0.268,0.297,0.188,0.275,Stability,NUSG_MYCTU,High,Prokaryote
+0.372,0.484,0.547,0.559,0.575,0.541,0.119,0.378,0.5,0.494,0.509,0.469,0.478,0.131,0.453,0.497,0.569,0.519,0.512,0.55,0.447,0.434,0.431,0.428,0.438,0.416,0.45,0.412,0.403,0.494,0.497,0.509,0.328,0.169,0.122,0.319,0.466,0.475,0.491,0.538,0.534,0.531,0.381,0.144,0.406,0.431,0.562,0.438,0.669,0.578,0.578,0.591,0.588,0.6,0.606,0.603,0.553,0.578,0.6,0.609,0.631,0.5,Stability,OBSCN_HUMAN,High,Human
+0.184,0.132,0.237,0.167,0.246,0.132,0.193,0.149,0.114,0.114,0.193,0.184,0.184,0.175,0.158,0.211,0.175,0.184,0.167,0.193,0.105,0.114,0.184,0.096,0.088,0.114,0.167,0.105,0.158,0.202,0.123,0.096,0.123,0.167,0.044,0.167,0.158,0.044,0.079,0.158,0.114,0.132,0.175,0.175,0.184,0.184,0.237,0.193,0.246,0.307,0.228,0.211,0.246,0.254,0.263,0.237,0.263,0.263,0.228,0.246,0.202,0.193,Stability,ODP2_GEOSE,High,Prokaryote
+0.176,0.412,0.412,0.412,0.471,0.412,0.176,0.412,0.294,0.412,0.353,0.294,0.294,0.294,0.235,0.235,0.294,0.412,0.176,0.412,0.412,0.353,0.235,0.294,0.471,0.353,0.353,0.412,0.235,0.294,0.235,0.235,0.0,0.353,0.235,0.118,0.412,0.294,0.294,0.412,0.412,0.412,0.353,0.059,0.353,0.412,0.176,0.353,0.176,0.176,0.235,0.235,0.176,0.176,0.176,0.294,0.235,0.235,0.235,0.176,0.412,0.294,Expression,OPSD_HUMAN,High,Human
+0.316,0.342,0.297,0.291,0.31,0.304,0.139,0.152,0.316,0.297,0.291,0.323,0.335,0.158,0.291,0.342,0.335,0.297,0.297,0.304,0.323,0.272,0.272,0.278,0.278,0.348,0.304,0.297,0.234,0.304,0.241,0.171,0.133,0.266,0.278,0.278,0.329,0.304,0.323,0.31,0.297,0.316,0.209,0.127,0.316,0.259,0.367,0.304,0.291,0.259,0.304,0.348,0.342,0.361,0.329,0.348,0.348,0.342,0.342,0.342,0.38,0.342,Activity,OTC_HUMAN,Medium,Human
+0.188,0.266,0.203,0.203,0.219,0.203,0.078,0.125,0.109,0.141,0.25,0.297,0.234,0.109,0.281,0.328,0.156,0.172,0.188,0.156,0.109,0.062,0.078,0.078,0.109,0.062,0.078,0.094,0.047,0.031,0.109,0.109,0.062,0.141,0.125,0.141,0.188,0.156,0.125,0.203,0.172,0.188,0.172,0.141,0.25,0.344,0.25,0.203,0.328,0.281,0.219,0.219,0.219,0.188,0.219,0.203,0.234,0.172,0.172,0.219,0.359,0.328,Stability,OTU7A_HUMAN,High,Human
+0.112,0.166,0.2,0.192,0.196,0.198,0.132,0.127,0.137,0.132,0.165,0.148,0.153,0.104,0.117,0.154,0.17,0.191,0.198,0.115,0.108,0.144,0.146,0.145,0.115,0.162,0.166,0.151,0.188,0.208,0.159,0.149,0.081,0.12,0.122,0.134,0.124,0.109,0.125,0.189,0.191,0.196,0.116,0.096,0.161,0.125,0.186,0.216,0.217,0.154,0.174,0.181,0.172,0.172,0.178,0.19,0.184,0.192,0.181,0.187,0.202,0.152,Activity,OXDA_RHOTO,High,Eukaryote
+0.112,0.166,0.2,0.192,0.196,0.198,0.132,0.127,0.137,0.132,0.165,0.148,0.153,0.104,0.117,0.154,0.17,0.191,0.198,0.115,0.108,0.144,0.146,0.145,0.115,0.162,0.166,0.151,0.188,0.208,0.159,0.149,0.081,0.12,0.122,0.134,0.124,0.109,0.125,0.189,0.191,0.196,0.116,0.096,0.161,0.125,0.186,0.216,0.217,0.154,0.174,0.181,0.172,0.172,0.178,0.19,0.184,0.192,0.181,0.187,0.202,0.152,Expression,OXDA_RHOTO,High,Eukaryote
+0.145,0.124,0.132,0.131,0.121,0.122,0.103,0.131,0.106,0.11,0.189,0.142,0.172,0.116,0.108,0.142,0.143,0.159,0.166,0.105,0.14,0.171,0.179,0.134,0.132,0.167,0.191,0.189,0.104,0.125,0.168,0.128,0.088,0.115,0.155,0.106,0.153,0.162,0.131,0.128,0.135,0.122,0.118,0.103,0.155,0.099,0.147,0.17,0.154,0.133,0.146,0.155,0.146,0.135,0.138,0.142,0.134,0.138,0.137,0.14,0.161,0.13,OrganismalFitness,P53_HUMAN,Low,Human
+0.145,0.124,0.132,0.131,0.121,0.122,0.103,0.131,0.106,0.11,0.189,0.142,0.172,0.116,0.108,0.142,0.143,0.159,0.166,0.105,0.14,0.171,0.179,0.134,0.132,0.167,0.191,0.189,0.104,0.125,0.168,0.128,0.088,0.115,0.155,0.106,0.153,0.162,0.131,0.128,0.135,0.122,0.118,0.103,0.155,0.099,0.147,0.17,0.154,0.133,0.146,0.155,0.146,0.135,0.138,0.142,0.134,0.138,0.137,0.14,0.161,0.13,OrganismalFitness,P53_HUMAN,Low,Human
+0.145,0.124,0.132,0.131,0.121,0.122,0.103,0.131,0.106,0.11,0.189,0.142,0.172,0.116,0.108,0.142,0.143,0.159,0.166,0.105,0.14,0.171,0.179,0.134,0.132,0.167,0.191,0.189,0.104,0.125,0.168,0.128,0.088,0.115,0.155,0.106,0.153,0.162,0.131,0.128,0.135,0.122,0.118,0.103,0.155,0.099,0.147,0.17,0.154,0.133,0.146,0.155,0.146,0.135,0.138,0.142,0.134,0.138,0.137,0.14,0.161,0.13,OrganismalFitness,P53_HUMAN,Low,Human
+0.145,0.124,0.132,0.131,0.121,0.122,0.103,0.131,0.106,0.11,0.189,0.142,0.172,0.116,0.108,0.142,0.143,0.159,0.166,0.105,0.14,0.171,0.179,0.134,0.132,0.167,0.191,0.189,0.104,0.125,0.168,0.128,0.088,0.115,0.155,0.106,0.153,0.162,0.131,0.128,0.135,0.122,0.118,0.103,0.155,0.099,0.147,0.17,0.154,0.133,0.146,0.155,0.146,0.135,0.138,0.142,0.134,0.138,0.137,0.14,0.161,0.13,OrganismalFitness,P53_HUMAN,Low,Human
+0.276,0.414,0.388,0.401,0.382,0.388,0.184,0.289,0.401,0.375,0.408,0.368,0.382,0.204,0.401,0.355,0.401,0.368,0.414,0.375,0.263,0.329,0.342,0.382,0.322,0.395,0.395,0.355,0.447,0.316,0.362,0.322,0.118,0.309,0.316,0.355,0.289,0.27,0.309,0.355,0.362,0.355,0.257,0.112,0.382,0.395,0.204,0.388,0.362,0.178,0.329,0.362,0.322,0.349,0.362,0.395,0.375,0.362,0.401,0.388,0.395,0.388,OrganismalFitness,P84126_THETH,Medium,Prokaryote
+0.279,0.269,0.275,0.275,0.29,0.285,0.182,0.234,0.269,0.276,0.277,0.287,0.289,0.172,0.191,0.21,0.242,0.245,0.259,0.249,0.255,0.285,0.275,0.282,0.27,0.281,0.292,0.281,0.25,0.291,0.273,0.249,0.121,0.244,0.27,0.261,0.269,0.285,0.272,0.286,0.291,0.286,0.242,0.115,0.288,0.264,0.126,0.288,0.2,0.148,0.233,0.216,0.229,0.221,0.223,0.217,0.221,0.227,0.228,0.225,0.269,0.213,OrganismalFitness,PABP_YEAST,Medium,Eukaryote
+0.207,0.215,0.207,0.211,0.211,0.211,0.086,0.172,0.2,0.222,0.196,0.198,0.217,0.121,0.196,0.209,0.2,0.193,0.179,0.202,0.146,0.17,0.151,0.2,0.178,0.179,0.181,0.183,0.168,0.168,0.138,0.121,0.129,0.142,0.196,0.183,0.194,0.215,0.198,0.224,0.217,0.224,0.185,0.133,0.196,0.183,0.217,0.247,0.219,0.155,0.213,0.202,0.219,0.213,0.211,0.226,0.224,0.221,0.221,0.224,0.232,0.206,Activity,PAI1_HUMAN,,Human
+0.335,0.335,0.335,0.324,0.335,0.324,0.115,0.269,0.088,0.132,0.11,0.11,0.093,0.104,0.093,0.104,0.104,0.099,0.247,0.236,0.28,0.275,0.264,0.33,0.121,0.258,0.258,0.225,0.264,0.291,0.264,0.242,0.126,0.275,0.28,0.302,0.335,0.319,0.319,0.33,0.341,0.319,0.11,0.11,0.104,0.11,0.115,0.099,0.071,0.137,0.088,0.093,0.099,0.088,0.088,0.11,0.071,0.082,0.082,0.115,0.104,0.071,OrganismalFitness,PA_I34A1,Medium,Virus
+0.273,0.396,0.515,0.502,0.368,0.354,0.401,0.393,0.501,0.49,0.434,0.47,0.502,0.505,0.529,0.49,0.526,0.497,0.507,0.362,0.397,0.458,0.413,0.463,0.406,0.438,0.459,0.459,0.423,0.389,0.426,0.381,0.342,0.325,0.317,0.442,0.412,0.388,0.453,0.376,0.39,0.379,0.463,0.288,0.464,0.49,0.23,0.415,0.396,0.269,0.412,0.377,0.378,0.39,0.386,0.401,0.397,0.402,0.401,0.395,0.491,0.484,Activity,PHOT_CHLRE,High,Eukaryote
+0.333,0.383,0.42,0.444,0.407,0.395,0.346,0.185,0.333,0.432,0.321,0.296,0.358,0.37,0.358,0.395,0.42,0.296,0.346,0.346,0.296,0.222,0.16,0.111,0.21,0.185,0.123,0.099,0.086,0.198,0.247,0.185,0.198,0.321,0.247,0.21,0.395,0.37,0.296,0.383,0.346,0.309,0.321,0.148,0.358,0.358,0.272,0.321,0.284,0.259,0.296,0.383,0.383,0.321,0.309,0.358,0.346,0.333,0.346,0.37,0.407,0.395,Stability,PIN1_HUMAN,High,Human
+0.366,0.339,0.333,0.322,0.333,0.339,0.295,0.213,0.273,0.35,0.219,0.29,0.273,0.415,0.383,0.415,0.393,0.361,0.328,0.322,0.213,0.186,0.186,0.186,0.273,0.219,0.235,0.18,0.191,0.311,0.24,0.273,0.23,0.246,0.301,0.164,0.295,0.35,0.251,0.328,0.344,0.322,0.295,0.311,0.18,0.257,0.388,0.202,0.443,0.377,0.377,0.41,0.421,0.41,0.41,0.404,0.388,0.388,0.41,0.404,0.18,0.339,Stability,PITX2_HUMAN,High,Human
+0.145,0.282,0.145,0.137,0.122,0.153,0.229,0.046,0.107,0.099,0.069,0.13,0.099,0.183,0.16,0.191,0.092,0.046,0.061,0.176,0.176,0.16,0.198,0.153,0.153,0.137,0.16,0.183,0.069,0.046,0.092,0.099,0.099,0.221,0.191,0.191,0.176,0.176,0.168,0.183,0.168,0.153,0.153,0.214,0.13,0.183,0.252,0.153,0.16,0.214,0.115,0.115,0.122,0.115,0.107,0.107,0.099,0.122,0.115,0.115,0.198,0.153,Stability,PKN1_HUMAN,High,Human
+0.309,0.38,0.361,0.387,0.392,0.399,0.088,0.261,0.394,0.389,0.176,0.09,0.113,0.097,0.091,0.136,0.231,0.252,0.288,0.279,0.288,0.348,0.352,0.328,0.145,0.316,0.319,0.31,0.372,0.391,0.289,0.284,0.096,0.099,0.251,0.316,0.286,0.321,0.342,0.349,0.373,0.375,0.098,0.092,0.196,0.1,0.151,0.215,0.101,0.121,0.197,0.211,0.211,0.212,0.199,0.202,0.214,0.223,0.212,0.218,0.109,0.102,OrganismalFitness,POLG_CXB3N,Medium,Virus
+0.327,0.395,0.341,0.358,0.384,0.386,0.086,0.299,0.473,0.477,0.143,0.089,0.102,0.08,0.086,0.109,0.134,0.14,0.164,0.331,0.33,0.325,0.331,0.292,0.331,0.344,0.34,0.351,0.349,0.454,0.411,0.314,0.104,0.089,0.16,0.366,0.288,0.332,0.405,0.383,0.393,0.433,0.089,0.086,0.198,0.088,0.188,0.278,0.07,0.136,0.138,0.138,0.166,0.146,0.152,0.154,0.144,0.15,0.14,0.157,0.12,0.114,OrganismalFitness,POLG_DEN26,Low,Virus
+0.362,0.411,0.368,0.38,0.393,0.405,0.074,0.153,0.387,0.387,0.123,0.387,0.35,0.086,0.092,0.08,0.11,0.092,0.104,0.239,0.196,0.276,0.325,0.288,0.282,0.27,0.239,0.239,0.294,0.387,0.313,0.221,0.067,0.368,0.356,0.288,0.362,0.374,0.362,0.393,0.368,0.344,0.086,0.074,0.331,0.117,0.08,0.245,0.233,0.19,0.172,0.153,0.123,0.153,0.141,0.129,0.221,0.153,0.166,0.178,0.092,0.098,OrganismalFitness,POLG_HCVJF,Medium,Virus
+0.164,0.519,0.435,0.454,0.464,0.464,0.088,0.48,0.333,0.497,0.489,0.113,0.129,0.099,0.123,0.117,0.156,0.15,0.127,0.595,0.097,0.078,0.086,0.09,0.109,0.049,0.08,0.051,0.131,0.558,0.608,0.62,0.113,0.105,0.094,0.088,0.296,0.294,0.281,0.437,0.437,0.429,0.185,0.127,0.189,0.154,0.55,0.507,0.561,0.581,0.622,0.63,0.63,0.651,0.653,0.663,0.634,0.69,0.637,0.663,0.569,0.326,Stability,POLG_PESV,Medium,Virus
+0.16,0.237,0.259,0.253,0.255,0.241,0.146,0.193,0.252,0.242,0.245,0.274,0.29,0.124,0.135,0.138,0.158,0.22,0.277,0.288,0.278,0.293,0.175,0.175,0.234,0.308,0.332,0.342,0.179,0.269,0.162,0.076,0.117,0.294,0.268,0.274,0.275,0.33,0.308,0.255,0.275,0.266,0.124,0.121,0.25,0.142,0.244,0.274,0.248,0.162,0.184,0.167,0.146,0.137,0.193,0.147,0.164,0.148,0.127,0.153,0.198,0.149,Activity,PPARG_HUMAN,Medium,Human
+0.18,0.176,0.18,0.187,0.18,0.177,0.128,0.154,0.176,0.173,0.183,0.21,0.2,0.131,0.14,0.161,0.174,0.188,0.201,0.163,0.172,0.174,0.205,0.181,0.177,0.202,0.192,0.214,0.174,0.181,0.172,0.135,0.113,0.163,0.188,0.206,0.191,0.192,0.205,0.187,0.185,0.186,0.142,0.115,0.191,0.171,0.169,0.201,0.188,0.135,0.177,0.18,0.176,0.167,0.181,0.168,0.163,0.168,0.173,0.181,0.215,0.158,OrganismalFitness,PPM1D_HUMAN,Low,Human
+0.191,0.23,0.343,0.343,0.299,0.284,0.216,0.216,0.392,0.392,0.397,0.191,0.265,0.221,0.225,0.358,0.387,0.402,0.265,0.299,0.363,0.358,0.387,0.333,0.353,0.392,0.363,0.353,0.363,0.382,0.299,0.25,0.221,0.206,0.245,0.235,0.294,0.275,0.245,0.319,0.314,0.324,0.216,0.221,0.299,0.24,0.338,0.338,0.426,0.319,0.382,0.412,0.382,0.387,0.397,0.407,0.402,0.392,0.407,0.402,0.402,0.431,Stability,PR40A_HUMAN,Medium,Human
+0.205,0.189,0.201,0.207,0.21,0.217,0.12,0.187,0.17,0.187,0.197,0.236,0.227,0.17,0.185,0.225,0.256,0.293,0.292,0.197,0.183,0.213,0.194,0.168,0.187,0.249,0.178,0.241,0.179,0.186,0.151,0.128,0.11,0.155,0.202,0.167,0.209,0.202,0.183,0.216,0.211,0.187,0.185,0.11,0.264,0.185,0.289,0.243,0.308,0.161,0.249,0.271,0.284,0.269,0.236,0.245,0.273,0.273,0.275,0.268,0.276,0.271,Expression,PRKN_HUMAN,Low,Human
+0.177,0.165,0.127,0.133,0.152,0.152,0.127,0.07,0.108,0.095,0.373,0.158,0.165,0.12,0.19,0.266,0.278,0.209,0.196,0.133,0.12,0.057,0.12,0.139,0.146,0.152,0.044,0.089,0.101,0.133,0.146,0.133,0.108,0.12,0.089,0.127,0.19,0.177,0.139,0.152,0.165,0.146,0.165,0.139,0.203,0.139,0.278,0.259,0.316,0.291,0.31,0.241,0.259,0.222,0.259,0.234,0.266,0.259,0.222,0.266,0.209,0.272,Stability,PSAE_PICP2,Medium,Prokaryote
+0.182,0.172,0.175,0.175,0.164,0.169,0.109,0.149,0.167,0.175,0.158,0.164,0.176,0.113,0.153,0.172,0.172,0.154,0.169,0.168,0.118,0.168,0.182,0.178,0.15,0.17,0.165,0.171,0.169,0.18,0.164,0.152,0.112,0.133,0.172,0.167,0.168,0.174,0.184,0.17,0.173,0.18,0.139,0.102,0.165,0.174,0.161,0.169,0.169,0.144,0.175,0.177,0.177,0.168,0.168,0.17,0.169,0.165,0.176,0.175,0.195,0.188,Expression,PTEN_HUMAN,Medium,Human
+0.182,0.172,0.175,0.175,0.164,0.169,0.109,0.149,0.167,0.175,0.158,0.164,0.176,0.113,0.153,0.172,0.172,0.154,0.169,0.168,0.118,0.168,0.182,0.178,0.15,0.17,0.165,0.171,0.169,0.18,0.164,0.152,0.112,0.133,0.172,0.167,0.168,0.174,0.184,0.17,0.173,0.18,0.139,0.102,0.165,0.174,0.161,0.169,0.169,0.144,0.175,0.177,0.177,0.168,0.168,0.17,0.169,0.165,0.176,0.175,0.195,0.188,Activity,PTEN_HUMAN,Medium,Human
+0.24,0.211,0.189,0.19,0.211,0.214,0.079,0.189,0.225,0.229,0.218,0.233,0.236,0.088,0.082,0.086,0.11,0.104,0.123,0.208,0.238,0.191,0.184,0.163,0.236,0.192,0.185,0.185,0.178,0.232,0.195,0.196,0.162,0.232,0.204,0.207,0.234,0.229,0.23,0.219,0.215,0.215,0.202,0.07,0.222,0.22,0.165,0.237,0.109,0.149,0.125,0.14,0.152,0.148,0.139,0.151,0.134,0.141,0.122,0.147,0.123,0.108,OrganismalFitness,Q2N0S5_9HIV1,Medium,Virus
+0.219,0.225,0.236,0.23,0.239,0.234,0.104,0.135,0.23,0.227,0.219,0.214,0.236,0.144,0.167,0.216,0.225,0.243,0.225,0.227,0.182,0.152,0.144,0.14,0.246,0.239,0.208,0.222,0.193,0.148,0.18,0.157,0.17,0.174,0.132,0.128,0.206,0.19,0.152,0.234,0.23,0.214,0.178,0.132,0.237,0.203,0.249,0.25,0.202,0.154,0.202,0.198,0.228,0.222,0.231,0.221,0.218,0.231,0.216,0.214,0.25,0.271,Binding,Q53Z42_HUMAN,Medium,Human
+0.219,0.225,0.236,0.23,0.239,0.234,0.104,0.135,0.23,0.227,0.219,0.214,0.236,0.144,0.167,0.216,0.225,0.243,0.225,0.227,0.182,0.152,0.144,0.14,0.246,0.239,0.208,0.222,0.193,0.148,0.18,0.157,0.17,0.174,0.132,0.128,0.206,0.19,0.152,0.234,0.23,0.214,0.178,0.132,0.237,0.203,0.249,0.25,0.202,0.154,0.202,0.198,0.228,0.222,0.231,0.221,0.218,0.231,0.216,0.214,0.25,0.271,Expression,Q53Z42_HUMAN,Medium,Human
+0.193,0.25,0.243,0.253,0.27,0.267,0.16,0.243,0.27,0.27,0.23,0.177,0.207,0.143,0.18,0.2,0.227,0.233,0.233,0.267,0.25,0.287,0.287,0.297,0.27,0.31,0.313,0.287,0.283,0.277,0.227,0.183,0.15,0.243,0.283,0.297,0.217,0.25,0.273,0.273,0.273,0.277,0.213,0.13,0.243,0.243,0.207,0.23,0.223,0.147,0.2,0.203,0.227,0.2,0.22,0.2,0.223,0.22,0.217,0.213,0.257,0.22,Activity,Q59976_STRSQ,Medium,Prokaryote
+0.136,0.145,0.111,0.11,0.127,0.124,0.08,0.097,0.132,0.132,0.1,0.092,0.079,0.074,0.078,0.072,0.07,0.072,0.072,0.119,0.088,0.109,0.082,0.116,0.093,0.07,0.074,0.062,0.077,0.149,0.117,0.117,0.087,0.095,0.089,0.115,0.112,0.117,0.112,0.123,0.128,0.125,0.099,0.079,0.084,0.092,0.186,0.152,0.18,0.148,0.131,0.138,0.114,0.149,0.144,0.145,0.141,0.133,0.135,0.141,0.11,0.102,Activity,Q6WV12_9MAXI,Low,Eukaryote
+0.2,0.229,0.214,0.2,0.257,0.257,0.086,0.129,0.243,0.271,0.257,0.229,0.3,0.129,0.186,0.186,0.214,0.286,0.271,0.1,0.271,0.257,0.257,0.357,0.286,0.271,0.286,0.3,0.3,0.271,0.286,0.257,0.1,0.329,0.271,0.3,0.314,0.271,0.286,0.257,0.229,0.243,0.186,0.071,0.243,0.2,0.129,0.229,0.2,0.086,0.3,0.243,0.243,0.271,0.229,0.2,0.271,0.271,0.271,0.286,0.3,0.214,Activity,Q837P4_ENTFA,Medium,Prokaryote
+0.067,0.293,0.267,0.28,0.307,0.307,0.12,0.2,0.133,0.133,0.253,0.267,0.24,0.067,0.12,0.12,0.187,0.267,0.227,0.147,0.2,0.2,0.293,0.227,0.187,0.227,0.293,0.267,0.333,0.2,0.24,0.28,0.093,0.24,0.2,0.28,0.24,0.2,0.267,0.24,0.24,0.24,0.093,0.093,0.213,0.16,0.12,0.2,0.227,0.133,0.2,0.227,0.227,0.2,0.213,0.173,0.227,0.213,0.24,0.2,0.213,0.227,Activity,Q837P5_ENTFA,Medium,Prokaryote
+0.183,0.207,0.173,0.179,0.189,0.189,0.097,0.15,0.195,0.199,0.124,0.107,0.107,0.094,0.101,0.104,0.095,0.115,0.11,0.141,0.108,0.11,0.093,0.11,0.097,0.108,0.116,0.185,0.189,0.198,0.12,0.123,0.09,0.104,0.101,0.191,0.179,0.18,0.195,0.192,0.191,0.206,0.108,0.113,0.117,0.112,0.184,0.179,0.201,0.163,0.141,0.138,0.144,0.148,0.144,0.152,0.14,0.158,0.141,0.15,0.132,0.119,Activity,Q8WTC7_9CNID,Low,Eukaryote
+0.169,0.171,0.148,0.161,0.166,0.161,0.077,0.12,0.059,0.059,0.086,0.058,0.056,0.068,0.07,0.073,0.084,0.147,0.147,0.117,0.138,0.15,0.145,0.15,0.134,0.131,0.129,0.119,0.143,0.147,0.119,0.119,0.113,0.112,0.113,0.15,0.161,0.155,0.164,0.164,0.164,0.168,0.058,0.056,0.079,0.07,0.154,0.122,0.169,0.147,0.098,0.122,0.119,0.112,0.113,0.103,0.092,0.092,0.105,0.096,0.106,0.077,OrganismalFitness,R1AB_SARS2,Medium,Virus
+0.315,0.163,0.13,0.185,0.13,0.185,0.228,0.163,0.196,0.217,0.13,0.076,0.109,0.283,0.315,0.261,0.217,0.239,0.141,0.174,0.337,0.13,0.12,0.174,0.152,0.163,0.152,0.098,0.098,0.239,0.163,0.13,0.207,0.13,0.174,0.207,0.293,0.272,0.25,0.185,0.196,0.196,0.228,0.283,0.13,0.207,0.217,0.141,0.25,0.25,0.185,0.283,0.228,0.207,0.25,0.239,0.239,0.261,0.261,0.25,0.098,0.261,Stability,RAD_ANTMA,High,Eukaryote
+0.133,0.167,0.133,0.167,0.2,0.2,0.033,0.2,0.167,0.167,0.2,0.133,0.2,0.1,0.167,0.133,0.133,0.167,0.167,0.167,0.133,0.2,0.167,0.133,0.2,0.2,0.2,0.2,0.133,0.167,0.167,0.167,0.067,0.167,0.2,0.133,0.133,0.167,0.133,0.167,0.133,0.133,0.133,0.067,0.167,0.133,0.067,0.2,0.067,0.067,0.167,0.133,0.2,0.133,0.1,0.133,0.133,0.1,0.133,0.133,0.1,0.067,OrganismalFitness,RAF1_HUMAN,Low,Human
+0.166,0.092,0.086,0.092,0.08,0.089,0.073,0.057,0.092,0.086,0.086,0.076,0.061,0.162,0.182,0.156,0.111,0.099,0.105,0.127,0.096,0.099,0.07,0.08,0.086,0.08,0.08,0.067,0.067,0.108,0.054,0.073,0.064,0.115,0.086,0.08,0.124,0.105,0.111,0.092,0.089,0.089,0.134,0.229,0.115,0.115,0.191,0.111,0.121,0.194,0.118,0.115,0.127,0.108,0.111,0.118,0.115,0.115,0.089,0.118,0.083,0.153,Activity,RASH_HUMAN,High,Human
+0.204,0.206,0.221,0.222,0.217,0.222,0.088,0.178,0.23,0.246,0.202,0.204,0.209,0.247,0.234,0.242,0.233,0.218,0.214,0.222,0.186,0.208,0.209,0.209,0.217,0.224,0.21,0.205,0.202,0.187,0.192,0.171,0.142,0.186,0.213,0.213,0.186,0.217,0.222,0.222,0.226,0.226,0.22,0.184,0.198,0.22,0.156,0.182,0.246,0.176,0.202,0.192,0.196,0.196,0.203,0.194,0.204,0.198,0.197,0.201,0.222,0.215,Expression,RASK_HUMAN,High,Human
+0.204,0.206,0.221,0.222,0.217,0.222,0.088,0.178,0.23,0.246,0.202,0.204,0.209,0.247,0.234,0.242,0.233,0.218,0.214,0.222,0.186,0.208,0.209,0.209,0.217,0.224,0.21,0.205,0.202,0.187,0.192,0.171,0.142,0.186,0.213,0.213,0.186,0.217,0.222,0.222,0.226,0.226,0.22,0.184,0.198,0.22,0.156,0.182,0.246,0.176,0.202,0.192,0.196,0.196,0.203,0.194,0.204,0.198,0.197,0.201,0.222,0.215,Binding,RASK_HUMAN,High,Human
+0.239,0.157,0.239,0.239,0.239,0.239,0.284,0.112,0.209,0.209,0.351,0.343,0.351,0.328,0.343,0.351,0.366,0.157,0.187,0.254,0.328,0.097,0.164,0.082,0.343,0.201,0.194,0.104,0.104,0.157,0.142,0.104,0.104,0.299,0.351,0.291,0.269,0.246,0.239,0.261,0.246,0.224,0.358,0.351,0.343,0.343,0.321,0.321,0.328,0.284,0.336,0.291,0.306,0.299,0.328,0.321,0.291,0.306,0.313,0.328,0.321,0.321,Stability,RBP1_HUMAN,High,Human
+0.181,0.142,0.165,0.165,0.15,0.157,0.165,0.087,0.142,0.142,0.283,0.26,0.276,0.252,0.307,0.252,0.205,0.157,0.197,0.094,0.228,0.157,0.181,0.205,0.157,0.205,0.213,0.181,0.197,0.134,0.11,0.047,0.071,0.228,0.165,0.244,0.213,0.228,0.22,0.189,0.173,0.173,0.236,0.26,0.22,0.22,0.228,0.26,0.228,0.276,0.189,0.213,0.181,0.189,0.189,0.189,0.213,0.173,0.181,0.189,0.173,0.276,Stability,RCD1_ARATH,Medium,Eukaryote
+0.197,0.272,0.298,0.351,0.346,0.311,0.237,0.175,0.281,0.311,0.303,0.272,0.364,0.25,0.307,0.311,0.289,0.281,0.276,0.263,0.18,0.25,0.228,0.25,0.145,0.303,0.175,0.25,0.294,0.289,0.281,0.254,0.184,0.219,0.14,0.254,0.263,0.246,0.272,0.298,0.303,0.281,0.254,0.311,0.316,0.263,0.539,0.404,0.469,0.469,0.382,0.399,0.377,0.408,0.399,0.377,0.39,0.346,0.351,0.386,0.465,0.158,Stability,RCRO_LAMBD,High,Virus
+0.441,0.304,0.304,0.324,0.304,0.294,0.304,0.196,0.245,0.235,0.216,0.245,0.255,0.275,0.382,0.275,0.225,0.196,0.186,0.284,0.147,0.255,0.196,0.216,0.265,0.235,0.216,0.245,0.176,0.196,0.235,0.206,0.294,0.137,0.245,0.216,0.353,0.353,0.294,0.333,0.324,0.294,0.255,0.245,0.245,0.284,0.382,0.314,0.441,0.422,0.216,0.245,0.255,0.255,0.284,0.265,0.275,0.245,0.255,0.255,0.235,0.392,Stability,RD23A_HUMAN,High,Human
+0.184,0.259,0.386,0.395,0.396,0.392,0.071,0.346,0.356,0.353,0.117,0.096,0.107,0.102,0.096,0.108,0.214,0.301,0.338,0.405,0.327,0.366,0.377,0.397,0.131,0.332,0.314,0.306,0.356,0.416,0.345,0.335,0.122,0.339,0.36,0.398,0.351,0.376,0.386,0.412,0.415,0.413,0.102,0.092,0.132,0.106,0.138,0.182,0.098,0.136,0.224,0.218,0.232,0.221,0.221,0.219,0.22,0.238,0.229,0.226,0.119,0.111,OrganismalFitness,RDRP_I33A0,Low,Virus
+0.13,0.135,0.144,0.135,0.149,0.144,0.112,0.14,0.135,0.126,0.084,0.144,0.126,0.107,0.093,0.065,0.13,0.107,0.107,0.112,0.116,0.098,0.093,0.13,0.102,0.149,0.098,0.126,0.13,0.135,0.135,0.126,0.116,0.13,0.14,0.112,0.144,0.153,0.121,0.153,0.144,0.14,0.102,0.088,0.112,0.102,0.07,0.098,0.116,0.126,0.135,0.13,0.126,0.098,0.116,0.093,0.112,0.116,0.102,0.112,0.079,0.121,OrganismalFitness,REV_HV1H2,Medium,Virus
+0.233,0.248,0.18,0.173,0.195,0.165,0.165,0.075,0.105,0.173,0.248,0.173,0.226,0.18,0.158,0.211,0.241,0.12,0.068,0.09,0.143,0.165,0.18,0.158,0.256,0.158,0.195,0.15,0.083,0.083,0.135,0.105,0.053,0.203,0.083,0.068,0.211,0.09,0.158,0.195,0.135,0.173,0.226,0.18,0.233,0.241,0.241,0.173,0.301,0.286,0.188,0.188,0.195,0.18,0.188,0.188,0.241,0.188,0.195,0.195,0.278,0.226,Stability,RFAH_ECOLI,High,Prokaryote
+0.095,0.184,0.129,0.136,0.129,0.129,0.163,0.122,0.184,0.163,0.252,0.204,0.238,0.19,0.184,0.19,0.19,0.136,0.129,0.122,0.116,0.156,0.122,0.15,0.102,0.122,0.129,0.15,0.143,0.048,0.129,0.109,0.088,0.136,0.156,0.109,0.143,0.163,0.143,0.143,0.143,0.136,0.224,0.15,0.15,0.224,0.333,0.184,0.361,0.313,0.204,0.204,0.211,0.184,0.177,0.197,0.177,0.184,0.184,0.19,0.177,0.272,Stability,RL20_AQUAE,High,Prokaryote
+0.133,0.151,0.153,0.179,0.171,0.17,0.061,0.152,0.164,0.177,0.148,0.159,0.158,0.092,0.167,0.171,0.162,0.157,0.192,0.167,0.183,0.159,0.158,0.143,0.165,0.157,0.14,0.16,0.15,0.18,0.165,0.177,0.128,0.162,0.133,0.126,0.16,0.147,0.147,0.166,0.182,0.163,0.152,0.129,0.157,0.126,0.077,0.094,0.112,0.061,0.156,0.159,0.148,0.166,0.161,0.151,0.138,0.154,0.168,0.156,0.147,0.144,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+0.133,0.151,0.153,0.179,0.171,0.17,0.061,0.152,0.164,0.177,0.148,0.159,0.158,0.092,0.167,0.171,0.162,0.157,0.192,0.167,0.183,0.159,0.158,0.143,0.165,0.157,0.14,0.16,0.15,0.18,0.165,0.177,0.128,0.162,0.133,0.126,0.16,0.147,0.147,0.166,0.182,0.163,0.152,0.129,0.157,0.126,0.077,0.094,0.112,0.061,0.156,0.159,0.148,0.166,0.161,0.151,0.138,0.154,0.168,0.156,0.147,0.144,OrganismalFitness,RL40A_YEAST,Medium,Eukaryote
+0.133,0.151,0.153,0.179,0.171,0.17,0.061,0.152,0.164,0.177,0.148,0.159,0.158,0.092,0.167,0.171,0.162,0.157,0.192,0.167,0.183,0.159,0.158,0.143,0.165,0.157,0.14,0.16,0.15,0.18,0.165,0.177,0.128,0.162,0.133,0.126,0.16,0.147,0.147,0.166,0.182,0.163,0.152,0.129,0.157,0.126,0.077,0.094,0.112,0.061,0.156,0.159,0.148,0.166,0.161,0.151,0.138,0.154,0.168,0.156,0.147,0.144,Activity,RL40A_YEAST,Medium,Eukaryote
+0.147,0.196,0.117,0.121,0.129,0.131,0.077,0.143,0.138,0.133,0.147,0.136,0.154,0.103,0.147,0.136,0.152,0.171,0.173,0.131,0.189,0.199,0.161,0.182,0.182,0.152,0.168,0.152,0.182,0.143,0.173,0.173,0.119,0.161,0.173,0.15,0.15,0.166,0.168,0.14,0.136,0.15,0.173,0.093,0.138,0.131,0.152,0.168,0.131,0.121,0.143,0.161,0.152,0.152,0.154,0.164,0.147,0.147,0.164,0.147,0.166,0.129,Activity,RNC_ECOLI,Medium,Prokaryote
+0.274,0.295,0.233,0.26,0.247,0.26,0.281,0.171,0.185,0.199,0.295,0.288,0.322,0.301,0.315,0.322,0.267,0.26,0.295,0.281,0.219,0.356,0.336,0.26,0.274,0.329,0.247,0.295,0.247,0.253,0.247,0.24,0.178,0.185,0.295,0.301,0.199,0.281,0.288,0.301,0.281,0.281,0.329,0.274,0.26,0.267,0.356,0.308,0.301,0.37,0.274,0.24,0.274,0.281,0.267,0.253,0.247,0.26,0.247,0.26,0.322,0.308,Stability,RPC1_BP434,High,Virus
+0.166,0.166,0.166,0.18,0.166,0.166,0.153,0.07,0.222,0.194,0.125,0.166,0.194,0.208,0.18,0.194,0.223,0.152,0.153,0.208,0.125,0.194,0.236,0.153,0.138,0.166,0.18,0.18,0.166,0.208,0.111,0.084,0.111,0.18,0.194,0.152,0.166,0.166,0.167,0.18,0.166,0.166,0.167,0.208,0.166,0.18,0.152,0.25,0.18,0.112,0.194,0.18,0.18,0.194,0.209,0.209,0.209,0.209,0.194,0.223,0.236,0.18,Activity,RPC1_LAMBD,High,Virus
+0.166,0.166,0.166,0.18,0.166,0.166,0.153,0.07,0.222,0.194,0.125,0.166,0.194,0.208,0.18,0.194,0.223,0.152,0.153,0.208,0.125,0.194,0.236,0.153,0.138,0.166,0.18,0.18,0.166,0.208,0.111,0.084,0.111,0.18,0.194,0.152,0.166,0.166,0.167,0.18,0.166,0.166,0.167,0.208,0.166,0.18,0.152,0.25,0.18,0.112,0.194,0.18,0.18,0.194,0.209,0.209,0.209,0.209,0.194,0.223,0.236,0.18,Activity,RPC1_LAMBD,High,Virus
+0.142,0.083,0.108,0.117,0.108,0.117,0.092,0.083,0.108,0.092,0.108,0.1,0.083,0.15,0.208,0.192,0.092,0.125,0.142,0.067,0.108,0.108,0.075,0.083,0.133,0.083,0.125,0.1,0.117,0.05,0.075,0.042,0.183,0.117,0.083,0.1,0.133,0.092,0.1,0.117,0.083,0.083,0.192,0.15,0.125,0.217,0.275,0.15,0.158,0.292,0.117,0.117,0.108,0.142,0.117,0.092,0.108,0.108,0.125,0.125,0.15,0.325,Stability,RS15_GEOSE,Medium,Prokaryote
+0.179,0.217,0.238,0.244,0.233,0.236,0.155,0.182,0.232,0.236,0.227,0.238,0.235,0.18,0.178,0.205,0.246,0.238,0.218,0.102,0.217,0.233,0.214,0.202,0.211,0.222,0.216,0.234,0.196,0.218,0.228,0.182,0.135,0.196,0.233,0.229,0.212,0.25,0.244,0.243,0.238,0.244,0.19,0.139,0.24,0.189,0.205,0.26,0.207,0.148,0.241,0.232,0.251,0.254,0.249,0.244,0.246,0.246,0.24,0.25,0.238,0.208,Expression,S22A1_HUMAN,Medium,Human
+0.179,0.217,0.238,0.244,0.233,0.236,0.155,0.182,0.232,0.236,0.227,0.238,0.235,0.18,0.178,0.205,0.246,0.238,0.218,0.102,0.217,0.233,0.214,0.202,0.211,0.222,0.216,0.234,0.196,0.218,0.228,0.182,0.135,0.196,0.233,0.229,0.212,0.25,0.244,0.243,0.238,0.244,0.19,0.139,0.24,0.189,0.205,0.26,0.207,0.148,0.241,0.232,0.251,0.254,0.249,0.244,0.246,0.246,0.24,0.25,0.238,0.208,Activity,S22A1_HUMAN,Medium,Human
+0.454,0.381,0.423,0.433,0.464,0.474,0.309,0.103,0.454,0.474,0.258,0.268,0.258,0.196,0.423,0.412,0.258,0.278,0.175,0.206,0.103,0.103,0.072,0.093,0.268,0.134,0.103,0.093,0.103,0.062,0.216,0.206,0.144,0.216,0.196,0.165,0.402,0.412,0.392,0.433,0.443,0.433,0.361,0.216,0.34,0.32,0.381,0.371,0.392,0.412,0.351,0.289,0.361,0.268,0.371,0.361,0.309,0.351,0.33,0.351,0.227,0.412,Stability,SAV1_MOUSE,High,Eukaryote
+0.204,0.194,0.223,0.214,0.233,0.214,0.194,0.107,0.214,0.282,0.262,0.184,0.204,0.155,0.175,0.223,0.32,0.272,0.223,0.126,0.194,0.204,0.184,0.165,0.175,0.184,0.204,0.165,0.184,0.165,0.291,0.184,0.117,0.146,0.146,0.126,0.175,0.194,0.204,0.214,0.223,0.223,0.184,0.223,0.214,0.184,0.311,0.33,0.311,0.34,0.291,0.311,0.34,0.32,0.311,0.282,0.272,0.359,0.359,0.34,0.301,0.272,Stability,SBI_STAAM,Medium,Prokaryote
+0.164,0.22,0.201,0.202,0.218,0.204,0.159,0.298,0.217,0.238,0.242,0.212,0.222,0.106,0.135,0.177,0.196,0.213,0.217,0.262,0.244,0.306,0.304,0.286,0.251,0.294,0.285,0.273,0.269,0.226,0.195,0.139,0.187,0.257,0.294,0.289,0.215,0.241,0.251,0.209,0.221,0.217,0.205,0.096,0.247,0.213,0.232,0.266,0.219,0.142,0.193,0.206,0.221,0.219,0.211,0.215,0.21,0.199,0.187,0.201,0.221,0.205,Activity,SC6A4_HUMAN,Medium,Human
+0.156,0.139,0.123,0.139,0.131,0.139,0.238,0.074,0.23,0.189,0.246,0.246,0.254,0.172,0.246,0.279,0.328,0.262,0.303,0.098,0.164,0.189,0.197,0.23,0.27,0.254,0.279,0.254,0.238,0.131,0.098,0.082,0.148,0.115,0.123,0.213,0.164,0.172,0.172,0.123,0.148,0.156,0.254,0.238,0.262,0.27,0.287,0.303,0.221,0.197,0.328,0.336,0.336,0.352,0.32,0.402,0.311,0.385,0.27,0.352,0.369,0.393,Stability,SCIN_STAAR,High,Prokaryote
+0.13,0.087,0.13,0.13,0.13,0.13,0.217,0.174,0.174,0.174,0.13,0.13,0.174,0.174,0.13,0.087,0.13,0.13,0.174,0.174,0.13,0.174,0.13,0.13,0.13,0.13,0.13,0.087,0.13,0.13,0.087,0.174,0.174,0.087,0.13,0.174,0.043,0.13,0.174,0.087,0.13,0.13,0.174,0.174,0.174,0.087,0.043,0.043,0.13,0.043,0.174,0.174,0.174,0.174,0.174,0.174,0.174,0.174,0.174,0.174,0.043,0.13,OrganismalFitness,SCN5A_HUMAN,Medium,Human
+0.531,0.52,0.574,0.563,0.588,0.596,0.13,0.458,0.552,0.578,0.538,0.495,0.516,0.209,0.264,0.523,0.538,0.567,0.545,0.531,0.141,0.181,0.191,0.274,0.191,0.466,0.101,0.347,0.52,0.592,0.509,0.469,0.014,0.116,0.264,0.462,0.542,0.542,0.523,0.578,0.585,0.574,0.224,0.184,0.52,0.264,0.534,0.552,0.592,0.563,0.563,0.567,0.57,0.585,0.57,0.57,0.592,0.578,0.596,0.592,0.592,0.585,Stability,SDA_BACSU,Medium,Prokaryote
+0.13,0.156,0.135,0.146,0.141,0.156,0.089,0.146,0.135,0.13,0.135,0.156,0.12,0.073,0.12,0.141,0.12,0.146,0.125,0.135,0.13,0.13,0.151,0.141,0.109,0.109,0.13,0.13,0.156,0.151,0.13,0.161,0.115,0.135,0.167,0.156,0.146,0.135,0.151,0.146,0.141,0.141,0.125,0.073,0.125,0.12,0.135,0.125,0.141,0.141,0.141,0.125,0.13,0.135,0.125,0.109,0.12,0.141,0.12,0.125,0.151,0.135,OrganismalFitness,SERC_HUMAN,High,Human
+0.14,0.148,0.156,0.156,0.156,0.154,0.123,0.143,0.16,0.15,0.139,0.145,0.147,0.124,0.12,0.117,0.138,0.127,0.117,0.144,0.124,0.125,0.138,0.123,0.109,0.127,0.134,0.141,0.137,0.143,0.15,0.153,0.111,0.12,0.142,0.144,0.132,0.148,0.136,0.149,0.153,0.157,0.127,0.126,0.138,0.119,0.139,0.139,0.119,0.119,0.143,0.151,0.166,0.146,0.152,0.158,0.154,0.142,0.138,0.148,0.124,0.123,OrganismalFitness,SHOC2_HUMAN,Medium,Human
+0.168,0.149,0.089,0.109,0.119,0.129,0.139,0.089,0.089,0.139,0.139,0.158,0.129,0.178,0.149,0.208,0.188,0.178,0.109,0.099,0.089,0.05,0.099,0.069,0.099,0.089,0.079,0.099,0.119,0.099,0.099,0.089,0.099,0.099,0.05,0.079,0.149,0.099,0.099,0.149,0.119,0.109,0.149,0.099,0.109,0.178,0.139,0.139,0.139,0.178,0.178,0.218,0.228,0.208,0.198,0.208,0.178,0.188,0.178,0.218,0.178,0.158,Stability,SOX30_HUMAN,High,Human
+0.327,0.384,0.346,0.346,0.36,0.351,0.085,0.275,0.251,0.246,0.289,0.095,0.1,0.076,0.104,0.071,0.152,0.171,0.142,0.327,0.028,0.118,0.308,0.156,0.081,0.057,0.109,0.303,0.322,0.218,0.237,0.204,0.043,0.09,0.085,0.123,0.332,0.336,0.332,0.37,0.379,0.379,0.085,0.1,0.185,0.166,0.28,0.265,0.37,0.355,0.341,0.384,0.389,0.417,0.351,0.379,0.36,0.398,0.351,0.393,0.37,0.318,Stability,SPA_STAAU,Medium,Prokaryote
+0.172,0.206,0.14,0.152,0.202,0.204,0.132,0.124,0.184,0.218,0.25,0.242,0.233,0.248,0.242,0.244,0.256,0.257,0.268,0.222,0.222,0.209,0.213,0.222,0.207,0.223,0.227,0.222,0.224,0.207,0.286,0.281,0.139,0.213,0.194,0.214,0.194,0.188,0.202,0.21,0.207,0.215,0.13,0.122,0.161,0.127,0.224,0.214,0.275,0.191,0.292,0.27,0.281,0.282,0.276,0.274,0.294,0.287,0.289,0.287,0.271,0.257,Binding,SPG1_STRSG,Low,Prokaryote
+0.172,0.206,0.14,0.152,0.202,0.204,0.132,0.124,0.184,0.218,0.25,0.242,0.233,0.248,0.242,0.244,0.256,0.257,0.268,0.222,0.222,0.209,0.213,0.222,0.207,0.223,0.227,0.222,0.224,0.207,0.286,0.281,0.139,0.213,0.194,0.214,0.194,0.188,0.202,0.21,0.207,0.215,0.13,0.122,0.161,0.127,0.224,0.214,0.275,0.191,0.292,0.27,0.281,0.282,0.276,0.274,0.294,0.287,0.289,0.287,0.271,0.257,Binding,SPG1_STRSG,Low,Prokaryote
+0.172,0.206,0.14,0.152,0.202,0.204,0.132,0.124,0.184,0.218,0.25,0.242,0.233,0.248,0.242,0.244,0.256,0.257,0.268,0.222,0.222,0.209,0.213,0.222,0.207,0.223,0.227,0.222,0.224,0.207,0.286,0.281,0.139,0.213,0.194,0.214,0.194,0.188,0.202,0.21,0.207,0.215,0.13,0.122,0.161,0.127,0.224,0.214,0.275,0.191,0.292,0.27,0.281,0.282,0.276,0.274,0.294,0.287,0.289,0.287,0.271,0.257,Binding,SPG1_STRSG,Medium,Prokaryote
+0.172,0.206,0.14,0.152,0.202,0.204,0.132,0.124,0.184,0.218,0.25,0.242,0.233,0.248,0.242,0.244,0.256,0.257,0.268,0.222,0.222,0.209,0.213,0.222,0.207,0.223,0.227,0.222,0.224,0.207,0.286,0.281,0.139,0.213,0.194,0.214,0.194,0.188,0.202,0.21,0.207,0.215,0.13,0.122,0.161,0.127,0.224,0.214,0.275,0.191,0.292,0.27,0.281,0.282,0.276,0.274,0.294,0.287,0.289,0.287,0.271,0.257,Binding,SPG1_STRSG,Medium,Prokaryote
+0.151,0.096,0.13,0.11,0.144,0.13,0.219,0.151,0.137,0.205,0.233,0.24,0.205,0.171,0.219,0.199,0.226,0.274,0.233,0.192,0.185,0.253,0.199,0.144,0.123,0.205,0.178,0.26,0.082,0.212,0.144,0.103,0.021,0.164,0.301,0.137,0.212,0.233,0.171,0.164,0.158,0.158,0.205,0.158,0.199,0.178,0.205,0.212,0.212,0.185,0.185,0.192,0.26,0.212,0.219,0.233,0.219,0.192,0.185,0.212,0.288,0.205,Stability,SPG2_STRSG,Medium,Prokaryote
+0.148,0.23,0.16,0.194,0.214,0.218,0.098,0.211,0.222,0.238,0.1,0.11,0.111,0.106,0.13,0.112,0.111,0.096,0.084,0.229,0.186,0.2,0.192,0.186,0.204,0.194,0.18,0.198,0.179,0.232,0.207,0.207,0.144,0.19,0.198,0.219,0.192,0.201,0.213,0.222,0.23,0.229,0.102,0.098,0.119,0.117,0.324,0.296,0.261,0.162,0.196,0.239,0.25,0.237,0.25,0.242,0.233,0.215,0.225,0.246,0.274,0.179,Binding,SPIKE_SARS2,Medium,Virus
+0.148,0.23,0.16,0.194,0.214,0.218,0.098,0.211,0.222,0.238,0.1,0.11,0.111,0.106,0.13,0.112,0.111,0.096,0.084,0.229,0.186,0.2,0.192,0.186,0.204,0.194,0.18,0.198,0.179,0.232,0.207,0.207,0.144,0.19,0.198,0.219,0.192,0.201,0.213,0.222,0.23,0.229,0.102,0.098,0.119,0.117,0.324,0.296,0.261,0.162,0.196,0.239,0.25,0.237,0.25,0.242,0.233,0.215,0.225,0.246,0.274,0.179,Expression,SPIKE_SARS2,Medium,Virus
+0.399,0.486,0.396,0.374,0.433,0.396,0.231,0.318,0.371,0.368,0.495,0.386,0.449,0.065,0.417,0.436,0.386,0.517,0.399,0.427,0.38,0.452,0.38,0.393,0.399,0.467,0.421,0.449,0.47,0.427,0.47,0.486,0.1,0.374,0.33,0.346,0.411,0.433,0.396,0.396,0.396,0.399,0.283,0.125,0.346,0.312,0.374,0.383,0.567,0.558,0.558,0.548,0.558,0.539,0.558,0.548,0.551,0.567,0.536,0.567,0.533,0.349,Stability,SPTN1_CHICK,High,Eukaryote
+0.296,0.324,0.423,0.437,0.437,0.423,0.042,0.239,0.324,0.38,0.31,0.155,0.296,0.155,0.254,0.282,0.352,0.324,0.352,0.338,0.141,0.296,0.211,0.296,0.324,0.31,0.268,0.268,0.31,0.268,0.239,0.254,0.169,0.183,0.254,0.324,0.338,0.423,0.394,0.451,0.423,0.408,0.183,0.169,0.282,0.38,0.408,0.282,0.366,0.408,0.366,0.366,0.352,0.338,0.296,0.38,0.324,0.296,0.296,0.352,0.296,0.324,Stability,SQSTM_MOUSE,Medium,Eukaryote
+0.308,0.308,0.302,0.283,0.296,0.302,0.057,0.157,0.252,0.264,0.377,0.327,0.34,0.038,0.346,0.371,0.384,0.409,0.409,0.308,0.044,0.189,0.22,0.17,0.245,0.233,0.277,0.296,0.302,0.189,0.327,0.264,0.126,0.189,0.132,0.208,0.352,0.289,0.296,0.321,0.314,0.296,0.277,0.057,0.358,0.333,0.233,0.384,0.428,0.453,0.346,0.34,0.358,0.39,0.352,0.327,0.365,0.39,0.365,0.358,0.44,0.365,Stability,SR43C_ARATH,High,Eukaryote
+0.231,0.179,0.378,0.333,0.256,0.301,0.218,0.237,0.276,0.314,0.308,0.288,0.301,0.179,0.333,0.301,0.263,0.288,0.333,0.199,0.269,0.244,0.269,0.205,0.231,0.237,0.263,0.192,0.212,0.301,0.186,0.167,0.212,0.237,0.231,0.269,0.256,0.256,0.276,0.308,0.321,0.295,0.231,0.083,0.269,0.288,0.301,0.218,0.378,0.353,0.327,0.327,0.314,0.308,0.314,0.333,0.321,0.327,0.295,0.333,0.237,0.327,Stability,SRBS1_HUMAN,High,Human
+0.113,0.095,0.092,0.094,0.085,0.079,0.113,0.074,0.082,0.084,0.079,0.093,0.09,0.14,0.149,0.113,0.101,0.11,0.089,0.07,0.08,0.059,0.083,0.062,0.076,0.075,0.081,0.066,0.064,0.115,0.07,0.092,0.114,0.078,0.069,0.075,0.081,0.081,0.092,0.084,0.076,0.081,0.126,0.131,0.067,0.097,0.101,0.08,0.074,0.092,0.114,0.126,0.1,0.114,0.121,0.094,0.095,0.119,0.11,0.11,0.085,0.144,Activity,SRC_HUMAN,Medium,Human
+0.113,0.095,0.092,0.094,0.085,0.079,0.113,0.074,0.082,0.084,0.079,0.093,0.09,0.14,0.149,0.113,0.101,0.11,0.089,0.07,0.08,0.059,0.083,0.062,0.076,0.075,0.081,0.066,0.064,0.115,0.07,0.092,0.114,0.078,0.069,0.075,0.081,0.081,0.092,0.084,0.076,0.081,0.126,0.131,0.067,0.097,0.101,0.08,0.074,0.092,0.114,0.126,0.1,0.114,0.121,0.094,0.095,0.119,0.11,0.11,0.085,0.144,Activity,SRC_HUMAN,Medium,Human
+0.113,0.095,0.092,0.094,0.085,0.079,0.113,0.074,0.082,0.084,0.079,0.093,0.09,0.14,0.149,0.113,0.101,0.11,0.089,0.07,0.08,0.059,0.083,0.062,0.076,0.075,0.081,0.066,0.064,0.115,0.07,0.092,0.114,0.078,0.069,0.075,0.081,0.081,0.092,0.084,0.076,0.081,0.126,0.131,0.067,0.097,0.101,0.08,0.074,0.092,0.114,0.126,0.1,0.114,0.121,0.094,0.095,0.119,0.11,0.11,0.085,0.144,OrganismalFitness,SRC_HUMAN,Medium,Human
+0.118,0.141,0.124,0.141,0.112,0.118,0.082,0.165,0.165,0.147,0.229,0.188,0.206,0.071,0.141,0.153,0.182,0.153,0.159,0.176,0.118,0.159,0.153,0.176,0.182,0.176,0.147,0.2,0.188,0.329,0.194,0.2,0.088,0.088,0.176,0.176,0.124,0.135,0.188,0.129,0.129,0.141,0.124,0.082,0.171,0.129,0.129,0.182,0.188,0.141,0.153,0.153,0.159,0.165,0.171,0.165,0.165,0.176,0.176,0.176,0.194,0.106,OrganismalFitness,SUMO1_HUMAN,High,Human
+0.064,0.032,0.036,0.024,0.036,0.024,0.08,0.044,0.052,0.036,0.028,0.04,0.024,0.036,0.06,0.044,0.032,0.032,0.032,0.04,0.056,0.044,0.02,0.04,0.048,0.028,0.032,0.012,0.036,0.06,0.044,0.06,0.032,0.028,0.036,0.036,0.02,0.032,0.024,0.028,0.028,0.02,0.052,0.064,0.024,0.044,0.064,0.024,0.04,0.076,0.036,0.024,0.032,0.016,0.036,0.04,0.048,0.036,0.044,0.028,0.052,0.048,OrganismalFitness,SYUA_HUMAN,Medium,Human
+0.008,0.017,0.008,0.008,0.008,0.008,0.267,0.0,0.008,0.008,0.067,0.092,0.042,0.125,0.133,0.1,0.05,0.017,0.008,0.008,0.158,0.017,0.008,0.008,0.217,0.033,0.0,0.008,0.008,0.008,0.0,0.0,0.017,0.2,0.267,0.0,0.05,0.092,0.008,0.008,0.017,0.0,0.183,0.25,0.067,0.083,0.2,0.067,0.208,0.117,0.142,0.1,0.1,0.108,0.092,0.117,0.058,0.083,0.1,0.108,0.142,0.158,OrganismalFitness,TADBP_HUMAN,Low,Human
+0.12,0.139,0.139,0.139,0.139,0.139,0.076,0.133,0.158,0.171,0.127,0.152,0.139,0.114,0.127,0.082,0.089,0.108,0.12,0.158,0.12,0.152,0.152,0.158,0.133,0.146,0.101,0.133,0.133,0.158,0.095,0.095,0.12,0.127,0.139,0.146,0.139,0.127,0.127,0.146,0.139,0.146,0.082,0.063,0.108,0.114,0.101,0.108,0.101,0.089,0.07,0.101,0.133,0.133,0.108,0.101,0.082,0.082,0.133,0.095,0.089,0.114,OrganismalFitness,TAT_HV1BR,High,Virus
+0.396,0.34,0.377,0.387,0.349,0.349,0.443,0.085,0.368,0.302,0.104,0.217,0.226,0.368,0.17,0.283,0.358,0.151,0.226,0.132,0.094,0.075,0.104,0.142,0.123,0.085,0.104,0.142,0.057,0.085,0.075,0.047,0.349,0.132,0.16,0.274,0.302,0.34,0.34,0.311,0.34,0.311,0.33,0.16,0.17,0.264,0.189,0.142,0.274,0.358,0.311,0.245,0.208,0.349,0.283,0.34,0.292,0.302,0.245,0.292,0.358,0.236,Stability,TCRG1_MOUSE,Medium,Eukaryote
+0.297,0.289,0.352,0.375,0.391,0.391,0.211,0.227,0.312,0.359,0.312,0.375,0.352,0.102,0.352,0.344,0.336,0.328,0.359,0.391,0.164,0.289,0.227,0.141,0.273,0.305,0.086,0.289,0.422,0.336,0.281,0.219,0.352,0.117,0.336,0.211,0.367,0.43,0.352,0.367,0.422,0.391,0.281,0.258,0.383,0.297,0.422,0.445,0.445,0.508,0.375,0.359,0.398,0.359,0.383,0.359,0.367,0.359,0.344,0.352,0.406,0.391,Stability,THO1_YEAST,High,Eukaryote
+0.297,0.277,0.243,0.243,0.216,0.216,0.034,0.196,0.297,0.338,0.25,0.23,0.25,0.061,0.27,0.291,0.257,0.25,0.243,0.196,0.209,0.291,0.257,0.277,0.216,0.284,0.203,0.264,0.23,0.23,0.297,0.284,0.142,0.135,0.209,0.216,0.277,0.291,0.27,0.236,0.23,0.236,0.25,0.162,0.351,0.236,0.345,0.345,0.372,0.419,0.223,0.297,0.27,0.264,0.27,0.291,0.297,0.277,0.284,0.264,0.338,0.311,Stability,TNKS2_HUMAN,High,Human
+0.135,0.129,0.132,0.11,0.122,0.122,0.097,0.132,0.135,0.116,0.141,0.15,0.119,0.091,0.129,0.132,0.125,0.129,0.144,0.122,0.094,0.132,0.107,0.141,0.113,0.144,0.125,0.11,0.135,0.15,0.129,0.119,0.088,0.094,0.144,0.138,0.129,0.125,0.138,0.122,0.119,0.135,0.103,0.088,0.116,0.094,0.147,0.138,0.141,0.138,0.132,0.147,0.129,0.15,0.144,0.141,0.141,0.135,0.135,0.122,0.141,0.113,OrganismalFitness,TPK1_HUMAN,Medium,Human
+0.181,0.219,0.216,0.197,0.195,0.197,0.107,0.189,0.189,0.192,0.205,0.197,0.2,0.132,0.156,0.205,0.203,0.2,0.184,0.208,0.137,0.164,0.205,0.205,0.167,0.186,0.208,0.192,0.205,0.216,0.244,0.2,0.132,0.129,0.2,0.175,0.175,0.205,0.175,0.205,0.214,0.208,0.132,0.09,0.205,0.173,0.178,0.238,0.222,0.173,0.175,0.173,0.178,0.175,0.181,0.178,0.173,0.197,0.178,0.178,0.197,0.181,Expression,TPMT_HUMAN,Medium,Human
+0.368,0.211,0.404,0.281,0.228,0.228,0.351,0.228,0.281,0.228,0.298,0.281,0.298,0.211,0.263,0.263,0.193,0.228,0.246,0.175,0.246,0.281,0.228,0.263,0.298,0.193,0.351,0.351,0.298,0.175,0.175,0.123,0.386,0.246,0.263,0.281,0.368,0.351,0.263,0.281,0.298,0.228,0.228,0.246,0.211,0.298,0.105,0.281,0.298,0.193,0.263,0.246,0.263,0.263,0.281,0.281,0.281,0.263,0.281,0.281,0.281,0.368,OrganismalFitness,TPOR_HUMAN,Low,Human
+0.382,0.513,0.461,0.474,0.48,0.474,0.125,0.414,0.48,0.48,0.454,0.428,0.434,0.191,0.329,0.401,0.48,0.5,0.447,0.48,0.178,0.355,0.388,0.414,0.395,0.322,0.401,0.421,0.487,0.513,0.474,0.395,0.158,0.322,0.355,0.342,0.375,0.414,0.408,0.461,0.454,0.454,0.237,0.145,0.388,0.296,0.283,0.434,0.454,0.151,0.428,0.441,0.428,0.461,0.421,0.447,0.421,0.447,0.441,0.467,0.513,0.329,OrganismalFitness,TRPC_SACS2,Medium,Prokaryote
+0.303,0.355,0.276,0.316,0.342,0.329,0.197,0.243,0.289,0.336,0.316,0.309,0.336,0.211,0.316,0.336,0.368,0.349,0.336,0.355,0.23,0.283,0.316,0.382,0.329,0.329,0.316,0.362,0.368,0.224,0.336,0.257,0.092,0.27,0.349,0.362,0.309,0.329,0.322,0.336,0.342,0.336,0.316,0.086,0.349,0.303,0.309,0.368,0.316,0.191,0.336,0.349,0.329,0.362,0.342,0.322,0.342,0.322,0.289,0.336,0.382,0.375,OrganismalFitness,TRPC_THEMA,Medium,Prokaryote
+0.152,0.187,0.171,0.171,0.187,0.187,0.07,0.125,0.14,0.156,0.16,0.144,0.144,0.074,0.093,0.144,0.152,0.16,0.148,0.16,0.109,0.121,0.128,0.109,0.128,0.144,0.128,0.136,0.125,0.144,0.097,0.101,0.054,0.109,0.113,0.128,0.136,0.148,0.16,0.167,0.183,0.175,0.144,0.086,0.128,0.171,0.156,0.16,0.132,0.144,0.144,0.163,0.191,0.16,0.152,0.175,0.167,0.148,0.148,0.16,0.144,0.132,OrganismalFitness,UBC9_HUMAN,Medium,Human
+0.251,0.386,0.408,0.399,0.427,0.424,0.193,0.333,0.383,0.43,0.49,0.433,0.455,0.229,0.427,0.51,0.499,0.477,0.46,0.402,0.143,0.38,0.397,0.43,0.322,0.408,0.413,0.405,0.419,0.386,0.466,0.405,0.328,0.132,0.16,0.446,0.311,0.311,0.402,0.435,0.438,0.427,0.264,0.171,0.27,0.311,0.372,0.325,0.521,0.342,0.512,0.545,0.534,0.534,0.543,0.51,0.512,0.529,0.526,0.526,0.499,0.388,Stability,UBE4B_HUMAN,High,Human
+0.156,0.122,0.122,0.122,0.1,0.122,0.122,0.122,0.133,0.111,0.111,0.133,0.111,0.078,0.122,0.122,0.122,0.122,0.089,0.056,0.089,0.056,0.089,0.1,0.078,0.089,0.089,0.1,0.1,0.111,0.056,0.067,0.1,0.111,0.078,0.089,0.156,0.156,0.156,0.122,0.122,0.122,0.1,0.144,0.067,0.122,0.133,0.089,0.044,0.156,0.156,0.122,0.133,0.122,0.133,0.167,0.144,0.133,0.133,0.156,0.167,0.156,Activity,UBE4B_MOUSE,Low,Eukaryote
+0.171,0.192,0.137,0.151,0.164,0.164,0.185,0.068,0.158,0.144,0.13,0.185,0.24,0.192,0.192,0.205,0.096,0.205,0.185,0.13,0.137,0.158,0.164,0.178,0.199,0.13,0.178,0.158,0.192,0.178,0.164,0.089,0.185,0.185,0.178,0.192,0.144,0.164,0.164,0.164,0.158,0.192,0.212,0.212,0.212,0.164,0.226,0.185,0.192,0.253,0.164,0.219,0.247,0.24,0.24,0.212,0.219,0.226,0.212,0.233,0.212,0.24,Stability,UBR5_HUMAN,Medium,Human
+0.288,0.178,0.164,0.164,0.192,0.205,0.233,0.11,0.178,0.26,0.178,0.274,0.247,0.274,0.219,0.192,0.247,0.178,0.123,0.123,0.233,0.219,0.11,0.11,0.205,0.096,0.151,0.123,0.082,0.205,0.096,0.068,0.123,0.192,0.26,0.082,0.233,0.329,0.164,0.205,0.205,0.164,0.205,0.26,0.219,0.164,0.082,0.068,0.096,0.205,0.205,0.151,0.233,0.205,0.233,0.233,0.205,0.247,0.274,0.219,0.26,0.192,Stability,VG08_BPP22,High,Virus
+0.249,0.436,0.502,0.525,0.486,0.502,0.125,0.28,0.444,0.459,0.432,0.412,0.444,0.171,0.117,0.463,0.553,0.494,0.401,0.467,0.109,0.265,0.284,0.296,0.346,0.346,0.331,0.292,0.428,0.447,0.463,0.424,0.187,0.148,0.346,0.319,0.346,0.42,0.385,0.467,0.486,0.463,0.187,0.237,0.479,0.385,0.428,0.475,0.541,0.397,0.541,0.51,0.545,0.545,0.514,0.556,0.553,0.533,0.525,0.556,0.556,0.424,Stability,VILI_CHICK,High,Eukaryote
+0.212,0.172,0.2,0.195,0.184,0.191,0.104,0.16,0.184,0.188,0.148,0.168,0.169,0.098,0.164,0.199,0.188,0.182,0.167,0.204,0.14,0.126,0.196,0.164,0.156,0.129,0.155,0.138,0.112,0.188,0.14,0.138,0.156,0.108,0.126,0.12,0.186,0.188,0.164,0.176,0.166,0.186,0.104,0.08,0.169,0.15,0.188,0.18,0.193,0.153,0.17,0.169,0.174,0.193,0.226,0.188,0.175,0.162,0.172,0.182,0.156,0.172,Expression,VKOR1_HUMAN,Medium,Human
+0.212,0.172,0.2,0.195,0.184,0.191,0.104,0.16,0.184,0.188,0.148,0.168,0.169,0.098,0.164,0.199,0.188,0.182,0.167,0.204,0.14,0.126,0.196,0.164,0.156,0.129,0.155,0.138,0.112,0.188,0.14,0.138,0.156,0.108,0.126,0.12,0.186,0.188,0.164,0.176,0.166,0.186,0.104,0.08,0.169,0.15,0.188,0.18,0.193,0.153,0.17,0.169,0.174,0.193,0.226,0.188,0.175,0.162,0.172,0.182,0.156,0.172,Activity,VKOR1_HUMAN,Medium,Human
+0.048,0.152,0.143,0.162,0.133,0.124,0.076,0.048,0.095,0.057,0.181,0.133,0.133,0.114,0.133,0.19,0.229,0.257,0.162,0.076,0.086,0.143,0.095,0.133,0.152,0.105,0.086,0.124,0.248,0.124,0.2,0.19,0.114,0.105,0.162,0.133,0.076,0.095,0.095,0.143,0.133,0.124,0.152,0.105,0.181,0.124,0.276,0.238,0.21,0.267,0.333,0.267,0.333,0.333,0.295,0.362,0.352,0.324,0.343,0.352,0.381,0.267,Stability,VRPI_BPT7,Medium,Virus
+0.18,0.275,0.302,0.302,0.296,0.296,0.069,0.249,0.36,0.349,0.265,0.122,0.312,0.079,0.095,0.344,0.392,0.487,0.439,0.317,0.048,0.058,0.048,0.212,0.063,0.074,0.053,0.074,0.36,0.328,0.307,0.265,0.053,0.048,0.116,0.392,0.18,0.243,0.333,0.296,0.312,0.339,0.164,0.079,0.18,0.175,0.439,0.392,0.434,0.365,0.429,0.444,0.381,0.423,0.429,0.45,0.439,0.429,0.402,0.444,0.439,0.222,Stability,YAIA_ECOLI,Medium,Prokaryote
+0.22,0.155,0.232,0.231,0.233,0.228,0.176,0.073,0.065,0.057,0.175,0.135,0.116,0.181,0.202,0.225,0.209,0.191,0.152,0.093,0.08,0.068,0.065,0.086,0.096,0.06,0.066,0.089,0.062,0.136,0.125,0.136,0.138,0.133,0.067,0.092,0.188,0.127,0.15,0.209,0.184,0.204,0.259,0.034,0.178,0.263,0.148,0.182,0.125,0.162,0.115,0.123,0.138,0.151,0.107,0.13,0.149,0.147,0.166,0.137,0.234,0.195,Binding,YAP1_HUMAN,Low,Human
+0.374,0.374,0.378,0.374,0.378,0.378,0.383,0.33,0.391,0.409,0.457,0.448,0.457,0.461,0.474,0.439,0.483,0.47,0.43,0.378,0.435,0.491,0.509,0.517,0.478,0.443,0.478,0.43,0.422,0.435,0.422,0.391,0.439,0.435,0.452,0.426,0.426,0.391,0.383,0.4,0.387,0.391,0.422,0.361,0.422,0.439,0.439,0.496,0.548,0.53,0.53,0.474,0.504,0.504,0.513,0.517,0.496,0.5,0.491,0.496,0.457,0.591,Stability,YNZC_BACSU,Medium,Prokaryote
+0.207,0.23,0.231,0.233,0.237,0.237,0.14,0.181,0.221,0.232,0.213,0.201,0.215,0.147,0.185,0.208,0.223,0.218,0.217,0.21,0.179,0.191,0.199,0.196,0.184,0.203,0.201,0.206,0.209,0.218,0.212,0.194,0.132,0.174,0.186,0.203,0.219,0.222,0.227,0.237,0.236,0.237,0.175,0.132,0.209,0.188,0.221,0.228,0.23,0.203,0.235,0.238,0.238,0.238,0.238,0.239,0.235,0.237,0.235,0.241,0.235,0.214,,,,
diff --git a/benchmarks/DMS_zero_shot/substitutions/Top_recall/Summary_performance_DMS_substitutions_Top_recall.csv b/benchmarks/DMS_zero_shot/substitutions/Top_recall/Summary_performance_DMS_substitutions_Top_recall.csv
new file mode 100644
index 0000000..00ea548
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/Top_recall/Summary_performance_DMS_substitutions_Top_recall.csv
@@ -0,0 +1,63 @@
+Model_rank,Model_name,Model type,Average_Top_recall,Bootstrap_standard_error_Top_recall,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,Depth_1,Depth_2,Depth_3,Depth_4,Depth_5+,Model details,References
+1,SaProt (650M),Hybrid - Structure & PLM,0.232,0.0,0.191,0.233,0.235,0.17,0.331,0.191,0.232,0.278,0.23,0.269,0.266,0.196,0.207,0.261,0.189,0.185,0.244,SaProt (650M),"Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv."
+2,TranceptEVE S,Hybrid - Alignment & PLM,0.232,0.006,0.198,0.223,0.226,0.228,0.286,0.231,0.242,0.254,0.226,0.27,0.231,0.286,0.21,0.258,0.213,0.208,0.278,TranceptEVE Small model (Tranception Small & retrieved EVE model),"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+3,EVE (single),Alignment-based model,0.23,0.006,0.199,0.217,0.223,0.228,0.284,0.226,0.241,0.254,0.225,0.27,0.235,0.281,0.208,0.258,0.239,0.217,0.279,EVE model (single seed),"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature."
+4,TranceptEVE M,Hybrid - Alignment & PLM,0.23,0.006,0.198,0.218,0.227,0.228,0.28,0.23,0.242,0.248,0.224,0.271,0.227,0.286,0.209,0.252,0.213,0.212,0.278,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+5,TranceptEVE L,Hybrid - Alignment & PLM,0.23,0.006,0.2,0.218,0.224,0.229,0.279,0.227,0.241,0.251,0.222,0.271,0.234,0.283,0.208,0.257,0.23,0.204,0.277,TranceptEVE Large model (Tranception Large & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+6,EVE (ensemble),Alignment-based model,0.23,0.006,0.199,0.214,0.225,0.228,0.283,0.227,0.241,0.253,0.225,0.271,0.232,0.281,0.209,0.259,0.244,0.222,0.272,EVE model (ensemble of 5 independently-trained models),"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature."
+7,ProtSSN (ensemble),Hybrid - Structure & PLM,0.227,0.005,0.191,0.196,0.22,0.191,0.337,0.192,0.239,0.284,0.228,0.278,0.274,0.215,0.214,0.248,0.21,0.186,0.232,ProtSSN (ensemble of 9 models),"Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+8,ProtSSN (k=30 h=768),Hybrid - Structure & PLM,0.227,0.005,0.19,0.202,0.228,0.185,0.329,0.193,0.233,0.281,0.227,0.275,0.269,0.208,0.209,0.241,0.195,0.181,0.227,"ProtSSN (k=30, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+9,DeepSequence (ensemble),Alignment-based model,0.226,0.006,0.194,0.213,0.218,0.223,0.282,0.216,0.235,0.255,0.223,0.27,0.229,0.267,0.203,0.257,0.222,0.2,0.276,DeepSequence model (ensemble of 5 independently-trained models),"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822."
+10,ProtSSN (k=20 h=1280),Hybrid - Structure & PLM,0.226,0.005,0.187,0.2,0.215,0.19,0.335,0.19,0.236,0.285,0.228,0.28,0.267,0.213,0.21,0.252,0.208,0.191,0.242,"ProtSSN (k=20, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+11,DeepSequence (single),Alignment-based model,0.225,0.006,0.199,0.216,0.214,0.219,0.28,0.218,0.235,0.252,0.225,0.271,0.226,0.257,0.204,0.256,0.196,0.198,0.272,DeepSequence model (single seed),"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822."
+12,MIF-ST,Hybrid - Structure & PLM,0.225,0.004,0.191,0.223,0.221,0.192,0.3,0.194,0.232,0.261,0.215,0.258,0.264,0.226,0.206,0.23,0.22,0.181,0.267,MIF-ST model,"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv."
+13,ProtSSN (k=10 h=1280),Hybrid - Structure & PLM,0.225,0.005,0.187,0.195,0.22,0.189,0.335,0.189,0.237,0.282,0.229,0.27,0.269,0.214,0.212,0.232,0.183,0.164,0.216,"ProtSSN (k=10, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+14,ProtSSN (k=10 h=768),Hybrid - Structure & PLM,0.225,0.005,0.193,0.195,0.22,0.187,0.33,0.186,0.239,0.278,0.231,0.27,0.27,0.203,0.208,0.233,0.205,0.167,0.215,"ProtSSN (k=10, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+15,ProtSSN (k=20 h=512),Hybrid - Structure & PLM,0.224,0.005,0.196,0.196,0.212,0.187,0.331,0.188,0.237,0.279,0.226,0.276,0.271,0.208,0.208,0.247,0.215,0.192,0.238,"ProtSSN (k=20, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+16,MSA Transformer (ensemble),Hybrid - Alignment & PLM,0.224,0.007,0.2,0.204,0.218,0.219,0.278,0.227,0.232,0.251,0.216,0.264,0.233,0.275,0.204,0.232,0.223,0.235,0.289,MSA Transformer (ensemble of 5 MSA samples),"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML."
+17,ProtSSN (k=20 h=768),Hybrid - Structure & PLM,0.224,0.006,0.189,0.19,0.218,0.189,0.332,0.18,0.235,0.283,0.227,0.27,0.268,0.213,0.209,0.254,0.19,0.177,0.24,"ProtSSN (k=20, h=768)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+18,ProtSSN (k=30 h=512),Hybrid - Structure & PLM,0.223,0.004,0.189,0.2,0.218,0.183,0.327,0.183,0.234,0.278,0.223,0.274,0.266,0.205,0.205,0.243,0.191,0.176,0.222,"ProtSSN (k=30, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+19,ProtSSN (k=30 h=1280),Hybrid - Structure & PLM,0.223,0.005,0.187,0.199,0.218,0.189,0.321,0.195,0.229,0.276,0.225,0.266,0.265,0.207,0.208,0.248,0.194,0.178,0.226,"ProtSSN (k=30, h=1280)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+20,ProtSSN (k=10 h=512),Hybrid - Structure & PLM,0.222,0.005,0.188,0.198,0.216,0.184,0.325,0.192,0.23,0.277,0.224,0.273,0.269,0.195,0.205,0.244,0.192,0.166,0.214,"ProtSSN (k=10, h=512)","Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv."
+21,ESM-IF1,Inverse folding model,0.222,0.005,0.184,0.208,0.205,0.17,0.344,0.181,0.228,0.288,0.224,0.279,0.268,0.194,0.209,0.27,0.176,0.172,0.257,ESM-IF1 model,"Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. BioRxiv."
+22,EVmutation,Alignment-based model,0.222,0.007,0.201,0.204,0.209,0.223,0.272,0.213,0.235,0.249,0.212,0.262,0.236,0.278,0.2,0.257,0.228,0.219,0.296,EVmutation model,"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135."
+23,Tranception L,Hybrid - Alignment & PLM,0.22,0.007,0.2,0.205,0.21,0.221,0.265,0.226,0.232,0.239,0.216,0.259,0.223,0.271,0.202,0.251,0.208,0.197,0.268,Tranception Large model (700M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+24,Tranception M,Hybrid - Alignment & PLM,0.218,0.006,0.188,0.208,0.211,0.21,0.271,0.227,0.228,0.231,0.218,0.26,0.2,0.27,0.199,0.246,0.164,0.19,0.261,Tranception Medium model (300M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+25,ESM2 (650M),Protein language model,0.217,0.006,0.178,0.213,0.215,0.177,0.3,0.178,0.213,0.27,0.221,0.257,0.251,0.168,0.196,0.233,0.179,0.152,0.179,ESM2 model (650M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+26,Tranception S,Hybrid - Alignment & PLM,0.216,0.006,0.186,0.215,0.208,0.202,0.271,0.222,0.22,0.238,0.214,0.256,0.208,0.257,0.197,0.243,0.168,0.19,0.262,Tranception Small model (85M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+27,MIF,Inverse folding model,0.216,0.007,0.178,0.201,0.214,0.16,0.327,0.194,0.218,0.265,0.221,0.25,0.246,0.206,0.21,0.244,0.189,0.16,0.241,MIF model,"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv."
+28,ESM2 (3B),Protein language model,0.213,0.006,0.176,0.207,0.212,0.184,0.286,0.187,0.215,0.255,0.213,0.256,0.246,0.18,0.192,0.231,0.17,0.168,0.177,ESM2 model (3B params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+29,MSA Transformer (single),Hybrid - Alignment & PLM,0.213,0.007,0.19,0.194,0.201,0.219,0.259,0.214,0.225,0.237,0.206,0.258,0.221,0.26,0.192,0.216,0.23,0.234,0.29,MSA Transformer (single MSA sample),"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML."
+30,GEMME,Alignment-based model,0.211,0.007,0.196,0.198,0.198,0.223,0.24,0.213,0.228,0.222,0.202,0.231,0.218,0.284,0.186,0.228,0.188,0.211,0.296,GEMME model,"Laine, É., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619."
+31,ESM-1v (ensemble),Protein language model,0.211,0.006,0.173,0.213,0.21,0.193,0.265,0.182,0.21,0.251,0.209,0.236,0.234,0.203,0.193,0.213,0.17,0.151,0.17,ESM-1v (ensemble of 5 independently-trained models),"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS."
+32,SaProt (35M),Hybrid - Structure & PLM,0.211,0.005,0.168,0.198,0.225,0.152,0.311,0.172,0.209,0.26,0.218,0.261,0.237,0.147,0.203,0.274,0.15,0.166,0.186,SaProt (35M),"Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv."
+33,ESM2 (15B),Protein language model,0.208,0.007,0.18,0.196,0.199,0.195,0.27,0.184,0.217,0.245,0.209,0.242,0.244,0.192,0.189,0.222,0.172,0.156,0.179,ESM2 model (15B params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+34,CARP (640M),Protein language model,0.208,0.006,0.176,0.206,0.212,0.179,0.265,0.174,0.209,0.241,0.215,0.231,0.224,0.18,0.188,0.215,0.164,0.151,0.182,CARP model (640M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+35,ESM2 (150M),Protein language model,0.204,0.006,0.168,0.204,0.196,0.151,0.299,0.162,0.195,0.264,0.222,0.249,0.223,0.129,0.186,0.234,0.127,0.14,0.165,ESM2 model (150M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+36,Wavenet,Alignment-based model,0.203,0.007,0.167,0.204,0.19,0.204,0.252,0.175,0.215,0.237,0.203,0.219,0.215,0.243,0.182,0.239,0.18,0.158,0.202,Wavenet model,"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12."
+37,ESM-1b,Protein language model,0.203,0.007,0.18,0.178,0.197,0.176,0.282,0.161,0.214,0.251,0.209,0.241,0.247,0.169,0.182,0.215,0.165,0.146,0.196,ESM-1b (w/ Brandes et al. extensions),"[1] Original model: Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118. [2] Extensions: Brandes, N., Goldman, G., Wang, C.H., Ye, C.J., & Ntranos, V. (2022). Genome-wide prediction of disease variants with a deep protein language model. bioRxiv."
+38,Progen2 M,Protein language model,0.202,0.006,0.172,0.19,0.221,0.198,0.229,0.177,0.21,0.218,0.204,0.202,0.206,0.224,0.185,0.192,0.151,0.134,0.143,Progen2 medium model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+39,Progen2 L,Protein language model,0.202,0.007,0.185,0.175,0.223,0.202,0.224,0.193,0.208,0.216,0.204,0.203,0.219,0.214,0.183,0.195,0.183,0.158,0.191,Progen2 large model (2.7B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+40,VESPA,Protein language model,0.201,0.007,0.18,0.192,0.184,0.207,0.242,0.192,0.218,0.22,0.19,0.231,0.223,0.247,0.176,0.207,0.216,0.204,0.248,VESPA model,"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647."
+41,Site-Independent,Alignment-based model,0.201,0.007,0.17,0.198,0.179,0.196,0.261,0.209,0.207,0.224,0.202,0.246,0.192,0.245,0.191,0.248,0.173,0.192,0.261,Site-Independent model,"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135."
+42,Progen2 XL,Protein language model,0.199,0.007,0.181,0.179,0.184,0.214,0.237,0.194,0.221,0.215,0.185,0.231,0.231,0.251,0.18,0.219,0.209,0.166,0.215,Progen2 xlarge model (6.4B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+43,Progen2 Base,Protein language model,0.197,0.006,0.183,0.182,0.205,0.198,0.219,0.19,0.206,0.21,0.205,0.195,0.201,0.218,0.183,0.189,0.146,0.14,0.15,Progen2 base model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+44,Tranception L no retrieval,Protein language model,0.197,0.008,0.181,0.185,0.191,0.213,0.216,0.199,0.209,0.208,0.191,0.201,0.212,0.253,0.181,0.21,0.204,0.165,0.223,Tranception Large model (700M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+45,RITA L,Protein language model,0.195,0.007,0.173,0.178,0.203,0.204,0.215,0.181,0.208,0.204,0.199,0.185,0.194,0.246,0.177,0.195,0.13,0.141,0.149,RITA large model (680M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+46,ESM-1v (single),Protein language model,0.194,0.007,0.166,0.175,0.204,0.182,0.245,0.151,0.196,0.241,0.195,0.226,0.217,0.178,0.182,0.202,0.162,0.148,0.162,ESM-1v (single seed),"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS."
+47,RITA XL,Protein language model,0.193,0.007,0.17,0.173,0.202,0.202,0.216,0.178,0.212,0.197,0.194,0.188,0.2,0.243,0.175,0.202,0.139,0.147,0.179,RITA xlarge model (1.2B params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+48,CARP (76M),Protein language model,0.188,0.006,0.155,0.185,0.195,0.152,0.254,0.151,0.181,0.234,0.197,0.22,0.2,0.145,0.176,0.2,0.133,0.142,0.176,CARP model (76M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+49,RITA M,Protein language model,0.187,0.008,0.168,0.163,0.199,0.198,0.207,0.179,0.194,0.204,0.194,0.177,0.178,0.25,0.175,0.182,0.126,0.133,0.154,RITA medium model (300M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+50,ProteinMPNN,Inverse folding model,0.186,0.008,0.149,0.146,0.152,0.138,0.342,0.146,0.201,0.271,0.196,0.267,0.227,0.189,0.186,0.242,0.173,0.142,0.22,ProteinMPNN model,"J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378."
+51,ESM2 (35M),Protein language model,0.185,0.006,0.155,0.197,0.181,0.138,0.254,0.153,0.163,0.245,0.197,0.219,0.193,0.124,0.174,0.223,0.127,0.139,0.17,ESM2 model (35M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+52,VESPAl,Protein language model,0.184,0.008,0.172,0.183,0.159,0.194,0.214,0.181,0.197,0.201,0.176,0.216,0.197,0.225,0.155,0.189,0.195,0.188,0.238,VESPAl model,"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647."
+53,Tranception M no retrieval,Protein language model,0.184,0.008,0.163,0.176,0.19,0.198,0.194,0.178,0.194,0.187,0.188,0.174,0.175,0.234,0.173,0.179,0.124,0.133,0.141,Tranception Medium model (300M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+54,Progen2 S,Protein language model,0.183,0.007,0.155,0.166,0.204,0.172,0.218,0.144,0.184,0.212,0.196,0.177,0.191,0.17,0.169,0.178,0.129,0.141,0.134,Progen2 small model (150M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+55,Unirep evotuned,Hybrid - Alignment & PLM,0.181,0.007,0.163,0.174,0.186,0.188,0.192,0.19,0.188,0.18,0.177,0.182,0.177,0.227,0.158,0.176,0.157,0.152,0.208,Unirep model w/ evotuning,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+56,RITA S,Protein language model,0.178,0.007,0.155,0.169,0.194,0.185,0.186,0.177,0.174,0.191,0.181,0.163,0.164,0.226,0.166,0.168,0.114,0.13,0.146,RITA small model (85M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+57,CARP (38M),Protein language model,0.177,0.006,0.147,0.188,0.177,0.14,0.233,0.148,0.165,0.218,0.185,0.198,0.181,0.141,0.165,0.188,0.131,0.147,0.169,CARP model (38M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+58,Tranception S no retrieval,Protein language model,0.177,0.006,0.154,0.188,0.182,0.183,0.177,0.173,0.174,0.183,0.174,0.159,0.171,0.215,0.166,0.166,0.119,0.133,0.134,Tranception Small model (85M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+59,ESM2 (8M),Protein language model,0.153,0.007,0.126,0.178,0.162,0.117,0.181,0.141,0.136,0.173,0.155,0.155,0.15,0.12,0.144,0.178,0.123,0.132,0.159,ESM2 model (8M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+60,Unirep,Protein language model,0.139,0.008,0.116,0.138,0.141,0.123,0.176,0.145,0.129,0.164,0.148,0.158,0.143,0.113,0.138,0.158,0.108,0.13,0.142,Unirep model,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+61,ProtGPT2,Protein language model,0.133,0.009,0.113,0.138,0.126,0.121,0.168,0.122,0.136,0.147,0.144,0.156,0.114,0.133,0.125,0.173,0.111,0.112,0.114,ProtGPT2 model,"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13."
+62,CARP (600K),Protein language model,0.131,0.008,0.116,0.124,0.132,0.106,0.176,0.13,0.122,0.157,0.14,0.148,0.131,0.115,0.131,0.143,0.102,0.125,0.137,CARP model (600K params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
diff --git a/benchmarks/DMS_zero_shot/substitutions/Top_recall/Summary_performance_DMS_substitutions_Top_recall.html b/benchmarks/DMS_zero_shot/substitutions/Top_recall/Summary_performance_DMS_substitutions_Top_recall.html
new file mode 100644
index 0000000..2b227ad
--- /dev/null
+++ b/benchmarks/DMS_zero_shot/substitutions/Top_recall/Summary_performance_DMS_substitutions_Top_recall.html
@@ -0,0 +1,1670 @@
+
+
+
+ |
+ Model_name |
+ Model type |
+ Average_Top_recall |
+ Bootstrap_standard_error_Top_recall |
+ Function_Activity |
+ Function_Binding |
+ Function_Expression |
+ Function_OrganismalFitness |
+ Function_Stability |
+ Low_MSA_depth |
+ Medium_MSA_depth |
+ High_MSA_depth |
+ Taxa_Human |
+ Taxa_Other_Eukaryote |
+ Taxa_Prokaryote |
+ Taxa_Virus |
+ Depth_1 |
+ Depth_2 |
+ Depth_3 |
+ Depth_4 |
+ Depth_5+ |
+ Model details |
+ References |
+
+
+ Model_rank |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+
+
+
+
+ 1 |
+ SaProt (650M) |
+ Hybrid - Structure & PLM |
+ 0.232 |
+ 0.000 |
+ 0.191 |
+ 0.233 |
+ 0.235 |
+ 0.170 |
+ 0.331 |
+ 0.191 |
+ 0.232 |
+ 0.278 |
+ 0.230 |
+ 0.269 |
+ 0.266 |
+ 0.196 |
+ 0.207 |
+ 0.261 |
+ 0.189 |
+ 0.185 |
+ 0.244 |
+ SaProt (650M) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.10.01.560349v5'>Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv.</a> |
+
+
+ 2 |
+ TranceptEVE S |
+ Hybrid - Alignment & PLM |
+ 0.232 |
+ 0.006 |
+ 0.198 |
+ 0.223 |
+ 0.226 |
+ 0.228 |
+ 0.286 |
+ 0.231 |
+ 0.242 |
+ 0.254 |
+ 0.226 |
+ 0.270 |
+ 0.231 |
+ 0.286 |
+ 0.210 |
+ 0.258 |
+ 0.213 |
+ 0.208 |
+ 0.278 |
+ TranceptEVE Small model (Tranception Small & retrieved EVE model) |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 3 |
+ EVE (single) |
+ Alignment-based model |
+ 0.230 |
+ 0.006 |
+ 0.199 |
+ 0.217 |
+ 0.223 |
+ 0.228 |
+ 0.284 |
+ 0.226 |
+ 0.241 |
+ 0.254 |
+ 0.225 |
+ 0.270 |
+ 0.235 |
+ 0.281 |
+ 0.208 |
+ 0.258 |
+ 0.239 |
+ 0.217 |
+ 0.279 |
+ EVE model (single seed) |
+ <a href='https://www.nature.com/articles/s41586-021-04043-8'>Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.</a> |
+
+
+ 4 |
+ TranceptEVE M |
+ Hybrid - Alignment & PLM |
+ 0.230 |
+ 0.006 |
+ 0.198 |
+ 0.218 |
+ 0.227 |
+ 0.228 |
+ 0.280 |
+ 0.230 |
+ 0.242 |
+ 0.248 |
+ 0.224 |
+ 0.271 |
+ 0.227 |
+ 0.286 |
+ 0.209 |
+ 0.252 |
+ 0.213 |
+ 0.212 |
+ 0.278 |
+ TranceptEVE Medium model (Tranception Medium & retrieved EVE model) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.12.07.519495v1?rss=1'>Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.</a> |
+
+
+ 5 |
+ TranceptEVE L |
+ Hybrid - Alignment & PLM |
+ 0.230 |
+ 0.006 |
+ 0.200 |
+ 0.218 |
+ 0.224 |
+ 0.229 |
+ 0.279 |
+ 0.227 |
+ 0.241 |
+ 0.251 |
+ 0.222 |
+ 0.271 |
+ 0.234 |
+ 0.283 |
+ 0.208 |
+ 0.257 |
+ 0.230 |
+ 0.204 |
+ 0.277 |
+ TranceptEVE Large model (Tranception Large & retrieved EVE model) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.12.07.519495v1?rss=1'>Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.</a> |
+
+
+ 6 |
+ EVE (ensemble) |
+ Alignment-based model |
+ 0.230 |
+ 0.006 |
+ 0.199 |
+ 0.214 |
+ 0.225 |
+ 0.228 |
+ 0.283 |
+ 0.227 |
+ 0.241 |
+ 0.253 |
+ 0.225 |
+ 0.271 |
+ 0.232 |
+ 0.281 |
+ 0.209 |
+ 0.259 |
+ 0.244 |
+ 0.222 |
+ 0.272 |
+ EVE model (ensemble of 5 independently-trained models) |
+ <a href='https://www.nature.com/articles/s41586-021-04043-8'>Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.</a> |
+
+
+ 7 |
+ ProtSSN (ensemble) |
+ Hybrid - Structure & PLM |
+ 0.227 |
+ 0.005 |
+ 0.191 |
+ 0.196 |
+ 0.220 |
+ 0.191 |
+ 0.337 |
+ 0.192 |
+ 0.239 |
+ 0.284 |
+ 0.228 |
+ 0.278 |
+ 0.274 |
+ 0.215 |
+ 0.214 |
+ 0.248 |
+ 0.210 |
+ 0.186 |
+ 0.232 |
+ ProtSSN (ensemble of 9 models) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 8 |
+ ProtSSN (k=30 h=768) |
+ Hybrid - Structure & PLM |
+ 0.227 |
+ 0.005 |
+ 0.190 |
+ 0.202 |
+ 0.228 |
+ 0.185 |
+ 0.329 |
+ 0.193 |
+ 0.233 |
+ 0.281 |
+ 0.227 |
+ 0.275 |
+ 0.269 |
+ 0.208 |
+ 0.209 |
+ 0.241 |
+ 0.195 |
+ 0.181 |
+ 0.227 |
+ ProtSSN (k=30, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 9 |
+ DeepSequence (ensemble) |
+ Alignment-based model |
+ 0.226 |
+ 0.006 |
+ 0.194 |
+ 0.213 |
+ 0.218 |
+ 0.223 |
+ 0.282 |
+ 0.216 |
+ 0.235 |
+ 0.255 |
+ 0.223 |
+ 0.270 |
+ 0.229 |
+ 0.267 |
+ 0.203 |
+ 0.257 |
+ 0.222 |
+ 0.200 |
+ 0.276 |
+ DeepSequence model (ensemble of 5 independently-trained models) |
+ <a href='https://www.nature.com/articles/s41592-018-0138-4'>Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.</a> |
+
+
+ 10 |
+ ProtSSN (k=20 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.226 |
+ 0.005 |
+ 0.187 |
+ 0.200 |
+ 0.215 |
+ 0.190 |
+ 0.335 |
+ 0.190 |
+ 0.236 |
+ 0.285 |
+ 0.228 |
+ 0.280 |
+ 0.267 |
+ 0.213 |
+ 0.210 |
+ 0.252 |
+ 0.208 |
+ 0.191 |
+ 0.242 |
+ ProtSSN (k=20, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 11 |
+ DeepSequence (single) |
+ Alignment-based model |
+ 0.225 |
+ 0.006 |
+ 0.199 |
+ 0.216 |
+ 0.214 |
+ 0.219 |
+ 0.280 |
+ 0.218 |
+ 0.235 |
+ 0.252 |
+ 0.225 |
+ 0.271 |
+ 0.226 |
+ 0.257 |
+ 0.204 |
+ 0.256 |
+ 0.196 |
+ 0.198 |
+ 0.272 |
+ DeepSequence model (single seed) |
+ <a href='https://www.nature.com/articles/s41592-018-0138-4'>Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.</a> |
+
+
+ 12 |
+ MIF-ST |
+ Hybrid - Structure & PLM |
+ 0.225 |
+ 0.004 |
+ 0.191 |
+ 0.223 |
+ 0.221 |
+ 0.192 |
+ 0.300 |
+ 0.194 |
+ 0.232 |
+ 0.261 |
+ 0.215 |
+ 0.258 |
+ 0.264 |
+ 0.226 |
+ 0.206 |
+ 0.230 |
+ 0.220 |
+ 0.181 |
+ 0.267 |
+ MIF-ST model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.25.493516v3'>Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.</a> |
+
+
+ 13 |
+ ProtSSN (k=10 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.225 |
+ 0.005 |
+ 0.187 |
+ 0.195 |
+ 0.220 |
+ 0.189 |
+ 0.335 |
+ 0.189 |
+ 0.237 |
+ 0.282 |
+ 0.229 |
+ 0.270 |
+ 0.269 |
+ 0.214 |
+ 0.212 |
+ 0.232 |
+ 0.183 |
+ 0.164 |
+ 0.216 |
+ ProtSSN (k=10, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 14 |
+ ProtSSN (k=10 h=768) |
+ Hybrid - Structure & PLM |
+ 0.225 |
+ 0.005 |
+ 0.193 |
+ 0.195 |
+ 0.220 |
+ 0.187 |
+ 0.330 |
+ 0.186 |
+ 0.239 |
+ 0.278 |
+ 0.231 |
+ 0.270 |
+ 0.270 |
+ 0.203 |
+ 0.208 |
+ 0.233 |
+ 0.205 |
+ 0.167 |
+ 0.215 |
+ ProtSSN (k=10, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 15 |
+ ProtSSN (k=20 h=512) |
+ Hybrid - Structure & PLM |
+ 0.224 |
+ 0.005 |
+ 0.196 |
+ 0.196 |
+ 0.212 |
+ 0.187 |
+ 0.331 |
+ 0.188 |
+ 0.237 |
+ 0.279 |
+ 0.226 |
+ 0.276 |
+ 0.271 |
+ 0.208 |
+ 0.208 |
+ 0.247 |
+ 0.215 |
+ 0.192 |
+ 0.238 |
+ ProtSSN (k=20, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 16 |
+ MSA Transformer (ensemble) |
+ Hybrid - Alignment & PLM |
+ 0.224 |
+ 0.007 |
+ 0.200 |
+ 0.204 |
+ 0.218 |
+ 0.219 |
+ 0.278 |
+ 0.227 |
+ 0.232 |
+ 0.251 |
+ 0.216 |
+ 0.264 |
+ 0.233 |
+ 0.275 |
+ 0.204 |
+ 0.232 |
+ 0.223 |
+ 0.235 |
+ 0.289 |
+ MSA Transformer (ensemble of 5 MSA samples) |
+ <a href='http://proceedings.mlr.press/v139/rao21a.html'>Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.</a> |
+
+
+ 17 |
+ ProtSSN (k=20 h=768) |
+ Hybrid - Structure & PLM |
+ 0.224 |
+ 0.006 |
+ 0.189 |
+ 0.190 |
+ 0.218 |
+ 0.189 |
+ 0.332 |
+ 0.180 |
+ 0.235 |
+ 0.283 |
+ 0.227 |
+ 0.270 |
+ 0.268 |
+ 0.213 |
+ 0.209 |
+ 0.254 |
+ 0.190 |
+ 0.177 |
+ 0.240 |
+ ProtSSN (k=20, h=768) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 18 |
+ ProtSSN (k=30 h=512) |
+ Hybrid - Structure & PLM |
+ 0.223 |
+ 0.004 |
+ 0.189 |
+ 0.200 |
+ 0.218 |
+ 0.183 |
+ 0.327 |
+ 0.183 |
+ 0.234 |
+ 0.278 |
+ 0.223 |
+ 0.274 |
+ 0.266 |
+ 0.205 |
+ 0.205 |
+ 0.243 |
+ 0.191 |
+ 0.176 |
+ 0.222 |
+ ProtSSN (k=30, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 19 |
+ ProtSSN (k=30 h=1280) |
+ Hybrid - Structure & PLM |
+ 0.223 |
+ 0.005 |
+ 0.187 |
+ 0.199 |
+ 0.218 |
+ 0.189 |
+ 0.321 |
+ 0.195 |
+ 0.229 |
+ 0.276 |
+ 0.225 |
+ 0.266 |
+ 0.265 |
+ 0.207 |
+ 0.208 |
+ 0.248 |
+ 0.194 |
+ 0.178 |
+ 0.226 |
+ ProtSSN (k=30, h=1280) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 20 |
+ ProtSSN (k=10 h=512) |
+ Hybrid - Structure & PLM |
+ 0.222 |
+ 0.005 |
+ 0.188 |
+ 0.198 |
+ 0.216 |
+ 0.184 |
+ 0.325 |
+ 0.192 |
+ 0.230 |
+ 0.277 |
+ 0.224 |
+ 0.273 |
+ 0.269 |
+ 0.195 |
+ 0.205 |
+ 0.244 |
+ 0.192 |
+ 0.166 |
+ 0.214 |
+ ProtSSN (k=10, h=512) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.12.01.569522v1'>Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.</a> |
+
+
+ 21 |
+ ESM-IF1 |
+ Inverse folding model |
+ 0.222 |
+ 0.005 |
+ 0.184 |
+ 0.208 |
+ 0.205 |
+ 0.170 |
+ 0.344 |
+ 0.181 |
+ 0.228 |
+ 0.288 |
+ 0.224 |
+ 0.279 |
+ 0.268 |
+ 0.194 |
+ 0.209 |
+ 0.270 |
+ 0.176 |
+ 0.172 |
+ 0.257 |
+ ESM-IF1 model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.04.10.487779v2.full.pdf+html'>Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. BioRxiv.</a> |
+
+
+ 22 |
+ EVmutation |
+ Alignment-based model |
+ 0.222 |
+ 0.007 |
+ 0.201 |
+ 0.204 |
+ 0.209 |
+ 0.223 |
+ 0.272 |
+ 0.213 |
+ 0.235 |
+ 0.249 |
+ 0.212 |
+ 0.262 |
+ 0.236 |
+ 0.278 |
+ 0.200 |
+ 0.257 |
+ 0.228 |
+ 0.219 |
+ 0.296 |
+ EVmutation model |
+ <a href='https://www.nature.com/articles/nbt.3769'>Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.</a> |
+
+
+ 23 |
+ Tranception L |
+ Hybrid - Alignment & PLM |
+ 0.220 |
+ 0.007 |
+ 0.200 |
+ 0.205 |
+ 0.210 |
+ 0.221 |
+ 0.265 |
+ 0.226 |
+ 0.232 |
+ 0.239 |
+ 0.216 |
+ 0.259 |
+ 0.223 |
+ 0.271 |
+ 0.202 |
+ 0.251 |
+ 0.208 |
+ 0.197 |
+ 0.268 |
+ Tranception Large model (700M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 24 |
+ Tranception M |
+ Hybrid - Alignment & PLM |
+ 0.218 |
+ 0.006 |
+ 0.188 |
+ 0.208 |
+ 0.211 |
+ 0.210 |
+ 0.271 |
+ 0.227 |
+ 0.228 |
+ 0.231 |
+ 0.218 |
+ 0.260 |
+ 0.200 |
+ 0.270 |
+ 0.199 |
+ 0.246 |
+ 0.164 |
+ 0.190 |
+ 0.261 |
+ Tranception Medium model (300M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 25 |
+ ESM2 (650M) |
+ Protein language model |
+ 0.217 |
+ 0.006 |
+ 0.178 |
+ 0.213 |
+ 0.215 |
+ 0.177 |
+ 0.300 |
+ 0.178 |
+ 0.213 |
+ 0.270 |
+ 0.221 |
+ 0.257 |
+ 0.251 |
+ 0.168 |
+ 0.196 |
+ 0.233 |
+ 0.179 |
+ 0.152 |
+ 0.179 |
+ ESM2 model (650M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 26 |
+ Tranception S |
+ Hybrid - Alignment & PLM |
+ 0.216 |
+ 0.006 |
+ 0.186 |
+ 0.215 |
+ 0.208 |
+ 0.202 |
+ 0.271 |
+ 0.222 |
+ 0.220 |
+ 0.238 |
+ 0.214 |
+ 0.256 |
+ 0.208 |
+ 0.257 |
+ 0.197 |
+ 0.243 |
+ 0.168 |
+ 0.190 |
+ 0.262 |
+ Tranception Small model (85M params) with retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 27 |
+ MIF |
+ Inverse folding model |
+ 0.216 |
+ 0.007 |
+ 0.178 |
+ 0.201 |
+ 0.214 |
+ 0.160 |
+ 0.327 |
+ 0.194 |
+ 0.218 |
+ 0.265 |
+ 0.221 |
+ 0.250 |
+ 0.246 |
+ 0.206 |
+ 0.210 |
+ 0.244 |
+ 0.189 |
+ 0.160 |
+ 0.241 |
+ MIF model |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.25.493516v3'>Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.</a> |
+
+
+ 28 |
+ ESM2 (3B) |
+ Protein language model |
+ 0.213 |
+ 0.006 |
+ 0.176 |
+ 0.207 |
+ 0.212 |
+ 0.184 |
+ 0.286 |
+ 0.187 |
+ 0.215 |
+ 0.255 |
+ 0.213 |
+ 0.256 |
+ 0.246 |
+ 0.180 |
+ 0.192 |
+ 0.231 |
+ 0.170 |
+ 0.168 |
+ 0.177 |
+ ESM2 model (3B params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 29 |
+ MSA Transformer (single) |
+ Hybrid - Alignment & PLM |
+ 0.213 |
+ 0.007 |
+ 0.190 |
+ 0.194 |
+ 0.201 |
+ 0.219 |
+ 0.259 |
+ 0.214 |
+ 0.225 |
+ 0.237 |
+ 0.206 |
+ 0.258 |
+ 0.221 |
+ 0.260 |
+ 0.192 |
+ 0.216 |
+ 0.230 |
+ 0.234 |
+ 0.290 |
+ MSA Transformer (single MSA sample) |
+ <a href='http://proceedings.mlr.press/v139/rao21a.html'>Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.</a> |
+
+
+ 30 |
+ GEMME |
+ Alignment-based model |
+ 0.211 |
+ 0.007 |
+ 0.196 |
+ 0.198 |
+ 0.198 |
+ 0.223 |
+ 0.240 |
+ 0.213 |
+ 0.228 |
+ 0.222 |
+ 0.202 |
+ 0.231 |
+ 0.218 |
+ 0.284 |
+ 0.186 |
+ 0.228 |
+ 0.188 |
+ 0.211 |
+ 0.296 |
+ GEMME model |
+ <a href='https://pubmed.ncbi.nlm.nih.gov/31406981/'>Laine, É., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619.</a> |
+
+
+ 31 |
+ ESM-1v (ensemble) |
+ Protein language model |
+ 0.211 |
+ 0.006 |
+ 0.173 |
+ 0.213 |
+ 0.210 |
+ 0.193 |
+ 0.265 |
+ 0.182 |
+ 0.210 |
+ 0.251 |
+ 0.209 |
+ 0.236 |
+ 0.234 |
+ 0.203 |
+ 0.193 |
+ 0.213 |
+ 0.170 |
+ 0.151 |
+ 0.170 |
+ ESM-1v (ensemble of 5 independently-trained models) |
+ <a href='https://proceedings.neurips.cc/paper/2021/hash/f51338d736f95dd42427296047067694-Abstract.html'>Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.</a> |
+
+
+ 32 |
+ SaProt (35M) |
+ Hybrid - Structure & PLM |
+ 0.211 |
+ 0.005 |
+ 0.168 |
+ 0.198 |
+ 0.225 |
+ 0.152 |
+ 0.311 |
+ 0.172 |
+ 0.209 |
+ 0.260 |
+ 0.218 |
+ 0.261 |
+ 0.237 |
+ 0.147 |
+ 0.203 |
+ 0.274 |
+ 0.150 |
+ 0.166 |
+ 0.186 |
+ SaProt (35M) |
+ <a href='https://www.biorxiv.org/content/10.1101/2023.10.01.560349v5'>Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv.</a> |
+
+
+ 33 |
+ ESM2 (15B) |
+ Protein language model |
+ 0.208 |
+ 0.007 |
+ 0.180 |
+ 0.196 |
+ 0.199 |
+ 0.195 |
+ 0.270 |
+ 0.184 |
+ 0.217 |
+ 0.245 |
+ 0.209 |
+ 0.242 |
+ 0.244 |
+ 0.192 |
+ 0.189 |
+ 0.222 |
+ 0.172 |
+ 0.156 |
+ 0.179 |
+ ESM2 model (15B params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 34 |
+ CARP (640M) |
+ Protein language model |
+ 0.208 |
+ 0.006 |
+ 0.176 |
+ 0.206 |
+ 0.212 |
+ 0.179 |
+ 0.265 |
+ 0.174 |
+ 0.209 |
+ 0.241 |
+ 0.215 |
+ 0.231 |
+ 0.224 |
+ 0.180 |
+ 0.188 |
+ 0.215 |
+ 0.164 |
+ 0.151 |
+ 0.182 |
+ CARP model (640M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 35 |
+ ESM2 (150M) |
+ Protein language model |
+ 0.204 |
+ 0.006 |
+ 0.168 |
+ 0.204 |
+ 0.196 |
+ 0.151 |
+ 0.299 |
+ 0.162 |
+ 0.195 |
+ 0.264 |
+ 0.222 |
+ 0.249 |
+ 0.223 |
+ 0.129 |
+ 0.186 |
+ 0.234 |
+ 0.127 |
+ 0.140 |
+ 0.165 |
+ ESM2 model (150M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 36 |
+ Wavenet |
+ Alignment-based model |
+ 0.203 |
+ 0.007 |
+ 0.167 |
+ 0.204 |
+ 0.190 |
+ 0.204 |
+ 0.252 |
+ 0.175 |
+ 0.215 |
+ 0.237 |
+ 0.203 |
+ 0.219 |
+ 0.215 |
+ 0.243 |
+ 0.182 |
+ 0.239 |
+ 0.180 |
+ 0.158 |
+ 0.202 |
+ Wavenet model |
+ <a href='https://www.nature.com/articles/s41467-021-22732-w'>Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12.</a> |
+
+
+ 37 |
+ ESM-1b |
+ Protein language model |
+ 0.203 |
+ 0.007 |
+ 0.180 |
+ 0.178 |
+ 0.197 |
+ 0.176 |
+ 0.282 |
+ 0.161 |
+ 0.214 |
+ 0.251 |
+ 0.209 |
+ 0.241 |
+ 0.247 |
+ 0.169 |
+ 0.182 |
+ 0.215 |
+ 0.165 |
+ 0.146 |
+ 0.196 |
+ ESM-1b (w/ Brandes et al. extensions) |
+ [1] Original model: <a href='https://www.biorxiv.org/content/10.1101/622803v4'>Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118.</a> [2] Extensions: <a href='https://www.biorxiv.org/content/10.1101/2022.08.25.505311v1'>Brandes, N., Goldman, G., Wang, C.H., Ye, C.J., & Ntranos, V. (2022). Genome-wide prediction of disease variants with a deep protein language model. bioRxiv.</a> |
+
+
+ 38 |
+ Progen2 M |
+ Protein language model |
+ 0.202 |
+ 0.006 |
+ 0.172 |
+ 0.190 |
+ 0.221 |
+ 0.198 |
+ 0.229 |
+ 0.177 |
+ 0.210 |
+ 0.218 |
+ 0.204 |
+ 0.202 |
+ 0.206 |
+ 0.224 |
+ 0.185 |
+ 0.192 |
+ 0.151 |
+ 0.134 |
+ 0.143 |
+ Progen2 medium model (760M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 39 |
+ Progen2 L |
+ Protein language model |
+ 0.202 |
+ 0.007 |
+ 0.185 |
+ 0.175 |
+ 0.223 |
+ 0.202 |
+ 0.224 |
+ 0.193 |
+ 0.208 |
+ 0.216 |
+ 0.204 |
+ 0.203 |
+ 0.219 |
+ 0.214 |
+ 0.183 |
+ 0.195 |
+ 0.183 |
+ 0.158 |
+ 0.191 |
+ Progen2 large model (2.7B params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 40 |
+ VESPA |
+ Protein language model |
+ 0.201 |
+ 0.007 |
+ 0.180 |
+ 0.192 |
+ 0.184 |
+ 0.207 |
+ 0.242 |
+ 0.192 |
+ 0.218 |
+ 0.220 |
+ 0.190 |
+ 0.231 |
+ 0.223 |
+ 0.247 |
+ 0.176 |
+ 0.207 |
+ 0.216 |
+ 0.204 |
+ 0.248 |
+ VESPA model |
+ <a href='https://link.springer.com/article/10.1007/s00439-021-02411-y'>Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.</a> |
+
+
+ 41 |
+ Site-Independent |
+ Alignment-based model |
+ 0.201 |
+ 0.007 |
+ 0.170 |
+ 0.198 |
+ 0.179 |
+ 0.196 |
+ 0.261 |
+ 0.209 |
+ 0.207 |
+ 0.224 |
+ 0.202 |
+ 0.246 |
+ 0.192 |
+ 0.245 |
+ 0.191 |
+ 0.248 |
+ 0.173 |
+ 0.192 |
+ 0.261 |
+ Site-Independent model |
+ <a href='https://www.nature.com/articles/nbt.3769'>Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.</a> |
+
+
+ 42 |
+ Progen2 XL |
+ Protein language model |
+ 0.199 |
+ 0.007 |
+ 0.181 |
+ 0.179 |
+ 0.184 |
+ 0.214 |
+ 0.237 |
+ 0.194 |
+ 0.221 |
+ 0.215 |
+ 0.185 |
+ 0.231 |
+ 0.231 |
+ 0.251 |
+ 0.180 |
+ 0.219 |
+ 0.209 |
+ 0.166 |
+ 0.215 |
+ Progen2 xlarge model (6.4B params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 43 |
+ Progen2 Base |
+ Protein language model |
+ 0.197 |
+ 0.006 |
+ 0.183 |
+ 0.182 |
+ 0.205 |
+ 0.198 |
+ 0.219 |
+ 0.190 |
+ 0.206 |
+ 0.210 |
+ 0.205 |
+ 0.195 |
+ 0.201 |
+ 0.218 |
+ 0.183 |
+ 0.189 |
+ 0.146 |
+ 0.140 |
+ 0.150 |
+ Progen2 base model (760M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 44 |
+ Tranception L no retrieval |
+ Protein language model |
+ 0.197 |
+ 0.008 |
+ 0.181 |
+ 0.185 |
+ 0.191 |
+ 0.213 |
+ 0.216 |
+ 0.199 |
+ 0.209 |
+ 0.208 |
+ 0.191 |
+ 0.201 |
+ 0.212 |
+ 0.253 |
+ 0.181 |
+ 0.210 |
+ 0.204 |
+ 0.165 |
+ 0.223 |
+ Tranception Large model (700M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 45 |
+ RITA L |
+ Protein language model |
+ 0.195 |
+ 0.007 |
+ 0.173 |
+ 0.178 |
+ 0.203 |
+ 0.204 |
+ 0.215 |
+ 0.181 |
+ 0.208 |
+ 0.204 |
+ 0.199 |
+ 0.185 |
+ 0.194 |
+ 0.246 |
+ 0.177 |
+ 0.195 |
+ 0.130 |
+ 0.141 |
+ 0.149 |
+ RITA large model (680M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 46 |
+ ESM-1v (single) |
+ Protein language model |
+ 0.194 |
+ 0.007 |
+ 0.166 |
+ 0.175 |
+ 0.204 |
+ 0.182 |
+ 0.245 |
+ 0.151 |
+ 0.196 |
+ 0.241 |
+ 0.195 |
+ 0.226 |
+ 0.217 |
+ 0.178 |
+ 0.182 |
+ 0.202 |
+ 0.162 |
+ 0.148 |
+ 0.162 |
+ ESM-1v (single seed) |
+ <a href='https://proceedings.neurips.cc/paper/2021/hash/f51338d736f95dd42427296047067694-Abstract.html'>Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.</a> |
+
+
+ 47 |
+ RITA XL |
+ Protein language model |
+ 0.193 |
+ 0.007 |
+ 0.170 |
+ 0.173 |
+ 0.202 |
+ 0.202 |
+ 0.216 |
+ 0.178 |
+ 0.212 |
+ 0.197 |
+ 0.194 |
+ 0.188 |
+ 0.200 |
+ 0.243 |
+ 0.175 |
+ 0.202 |
+ 0.139 |
+ 0.147 |
+ 0.179 |
+ RITA xlarge model (1.2B params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 48 |
+ CARP (76M) |
+ Protein language model |
+ 0.188 |
+ 0.006 |
+ 0.155 |
+ 0.185 |
+ 0.195 |
+ 0.152 |
+ 0.254 |
+ 0.151 |
+ 0.181 |
+ 0.234 |
+ 0.197 |
+ 0.220 |
+ 0.200 |
+ 0.145 |
+ 0.176 |
+ 0.200 |
+ 0.133 |
+ 0.142 |
+ 0.176 |
+ CARP model (76M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 49 |
+ RITA M |
+ Protein language model |
+ 0.187 |
+ 0.008 |
+ 0.168 |
+ 0.163 |
+ 0.199 |
+ 0.198 |
+ 0.207 |
+ 0.179 |
+ 0.194 |
+ 0.204 |
+ 0.194 |
+ 0.177 |
+ 0.178 |
+ 0.250 |
+ 0.175 |
+ 0.182 |
+ 0.126 |
+ 0.133 |
+ 0.154 |
+ RITA medium model (300M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 50 |
+ ProteinMPNN |
+ Inverse folding model |
+ 0.186 |
+ 0.008 |
+ 0.149 |
+ 0.146 |
+ 0.152 |
+ 0.138 |
+ 0.342 |
+ 0.146 |
+ 0.201 |
+ 0.271 |
+ 0.196 |
+ 0.267 |
+ 0.227 |
+ 0.189 |
+ 0.186 |
+ 0.242 |
+ 0.173 |
+ 0.142 |
+ 0.220 |
+ ProteinMPNN model |
+ <a href='https://www.science.org/doi/10.1126/science.add2187'>J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378.</a> |
+
+
+ 51 |
+ ESM2 (35M) |
+ Protein language model |
+ 0.185 |
+ 0.006 |
+ 0.155 |
+ 0.197 |
+ 0.181 |
+ 0.138 |
+ 0.254 |
+ 0.153 |
+ 0.163 |
+ 0.245 |
+ 0.197 |
+ 0.219 |
+ 0.193 |
+ 0.124 |
+ 0.174 |
+ 0.223 |
+ 0.127 |
+ 0.139 |
+ 0.170 |
+ ESM2 model (35M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 52 |
+ VESPAl |
+ Protein language model |
+ 0.184 |
+ 0.008 |
+ 0.172 |
+ 0.183 |
+ 0.159 |
+ 0.194 |
+ 0.214 |
+ 0.181 |
+ 0.197 |
+ 0.201 |
+ 0.176 |
+ 0.216 |
+ 0.197 |
+ 0.225 |
+ 0.155 |
+ 0.189 |
+ 0.195 |
+ 0.188 |
+ 0.238 |
+ VESPAl model |
+ <a href='https://link.springer.com/article/10.1007/s00439-021-02411-y'>Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.</a> |
+
+
+ 53 |
+ Tranception M no retrieval |
+ Protein language model |
+ 0.184 |
+ 0.008 |
+ 0.163 |
+ 0.176 |
+ 0.190 |
+ 0.198 |
+ 0.194 |
+ 0.178 |
+ 0.194 |
+ 0.187 |
+ 0.188 |
+ 0.174 |
+ 0.175 |
+ 0.234 |
+ 0.173 |
+ 0.179 |
+ 0.124 |
+ 0.133 |
+ 0.141 |
+ Tranception Medium model (300M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 54 |
+ Progen2 S |
+ Protein language model |
+ 0.183 |
+ 0.007 |
+ 0.155 |
+ 0.166 |
+ 0.204 |
+ 0.172 |
+ 0.218 |
+ 0.144 |
+ 0.184 |
+ 0.212 |
+ 0.196 |
+ 0.177 |
+ 0.191 |
+ 0.170 |
+ 0.169 |
+ 0.178 |
+ 0.129 |
+ 0.141 |
+ 0.134 |
+ Progen2 small model (150M params) |
+ <a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a> |
+
+
+ 55 |
+ Unirep evotuned |
+ Hybrid - Alignment & PLM |
+ 0.181 |
+ 0.007 |
+ 0.163 |
+ 0.174 |
+ 0.186 |
+ 0.188 |
+ 0.192 |
+ 0.190 |
+ 0.188 |
+ 0.180 |
+ 0.177 |
+ 0.182 |
+ 0.177 |
+ 0.227 |
+ 0.158 |
+ 0.176 |
+ 0.157 |
+ 0.152 |
+ 0.208 |
+ Unirep model w/ evotuning |
+ <a href='https://www.nature.com/articles/s41592-019-0598-1'>Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.</a> |
+
+
+ 56 |
+ RITA S |
+ Protein language model |
+ 0.178 |
+ 0.007 |
+ 0.155 |
+ 0.169 |
+ 0.194 |
+ 0.185 |
+ 0.186 |
+ 0.177 |
+ 0.174 |
+ 0.191 |
+ 0.181 |
+ 0.163 |
+ 0.164 |
+ 0.226 |
+ 0.166 |
+ 0.168 |
+ 0.114 |
+ 0.130 |
+ 0.146 |
+ RITA small model (85M params) |
+ <a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a> |
+
+
+ 57 |
+ CARP (38M) |
+ Protein language model |
+ 0.177 |
+ 0.006 |
+ 0.147 |
+ 0.188 |
+ 0.177 |
+ 0.140 |
+ 0.233 |
+ 0.148 |
+ 0.165 |
+ 0.218 |
+ 0.185 |
+ 0.198 |
+ 0.181 |
+ 0.141 |
+ 0.165 |
+ 0.188 |
+ 0.131 |
+ 0.147 |
+ 0.169 |
+ CARP model (38M params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+ 58 |
+ Tranception S no retrieval |
+ Protein language model |
+ 0.177 |
+ 0.006 |
+ 0.154 |
+ 0.188 |
+ 0.182 |
+ 0.183 |
+ 0.177 |
+ 0.173 |
+ 0.174 |
+ 0.183 |
+ 0.174 |
+ 0.159 |
+ 0.171 |
+ 0.215 |
+ 0.166 |
+ 0.166 |
+ 0.119 |
+ 0.133 |
+ 0.134 |
+ Tranception Small model (85M params) without retrieval |
+ <a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a> |
+
+
+ 59 |
+ ESM2 (8M) |
+ Protein language model |
+ 0.153 |
+ 0.007 |
+ 0.126 |
+ 0.178 |
+ 0.162 |
+ 0.117 |
+ 0.181 |
+ 0.141 |
+ 0.136 |
+ 0.173 |
+ 0.155 |
+ 0.155 |
+ 0.150 |
+ 0.120 |
+ 0.144 |
+ 0.178 |
+ 0.123 |
+ 0.132 |
+ 0.159 |
+ ESM2 model (8M params) |
+ <a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a> |
+
+
+ 60 |
+ Unirep |
+ Protein language model |
+ 0.139 |
+ 0.008 |
+ 0.116 |
+ 0.138 |
+ 0.141 |
+ 0.123 |
+ 0.176 |
+ 0.145 |
+ 0.129 |
+ 0.164 |
+ 0.148 |
+ 0.158 |
+ 0.143 |
+ 0.113 |
+ 0.138 |
+ 0.158 |
+ 0.108 |
+ 0.130 |
+ 0.142 |
+ Unirep model |
+ <a href='https://www.nature.com/articles/s41592-019-0598-1'>Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.</a> |
+
+
+ 61 |
+ ProtGPT2 |
+ Protein language model |
+ 0.133 |
+ 0.009 |
+ 0.113 |
+ 0.138 |
+ 0.126 |
+ 0.121 |
+ 0.168 |
+ 0.122 |
+ 0.136 |
+ 0.147 |
+ 0.144 |
+ 0.156 |
+ 0.114 |
+ 0.133 |
+ 0.125 |
+ 0.173 |
+ 0.111 |
+ 0.112 |
+ 0.114 |
+ ProtGPT2 model |
+ <a href='https://www.nature.com/articles/s41467-022-32007-7'>Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13.</a> |
+
+
+ 62 |
+ CARP (600K) |
+ Protein language model |
+ 0.131 |
+ 0.008 |
+ 0.116 |
+ 0.124 |
+ 0.132 |
+ 0.106 |
+ 0.176 |
+ 0.130 |
+ 0.122 |
+ 0.157 |
+ 0.140 |
+ 0.148 |
+ 0.131 |
+ 0.115 |
+ 0.131 |
+ 0.143 |
+ 0.102 |
+ 0.125 |
+ 0.137 |
+ CARP model (600K params) |
+ <a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a> |
+
+
+
\ No newline at end of file
diff --git a/benchmarks/clinical_supervised/substitutions/AUC/Summary_performance_clinical_substitutions_AUC.csv b/benchmarks/clinical_supervised/substitutions/AUC/Summary_performance_clinical_substitutions_AUC.csv
new file mode 100644
index 0000000..a818eaa
--- /dev/null
+++ b/benchmarks/clinical_supervised/substitutions/AUC/Summary_performance_clinical_substitutions_AUC.csv
@@ -0,0 +1,18 @@
+Model_rank,Model_name,Model type,Average_AUC,Bootstrap_standard_error_AUC
+1,ClinPred,Supervised,0.981,0.0
+2,MetaRNN,Supervised,0.977,0.001
+3,BayesDel (addAF),Supervised,0.972,0.001
+4,VEST4,Supervised,0.929,0.003
+5,REVEL,Supervised,0.928,0.003
+6,BayesDel (noAF),Supervised,0.925,0.003
+7,VARITY (R),Supervised,0.921,0.003
+8,VARITY (ER),Supervised,0.918,0.003
+9,gMVP,Supervised,0.914,0.004
+10,CADD,Supervised,0.905,0.003
+11,PolyPhen2 (HVAR),Supervised,0.896,0.003
+12,DEOGEN2,Supervised,0.894,0.004
+13,MPC,Supervised,0.881,0.004
+14,PolyPhen2 (HDIV),Supervised,0.879,0.004
+15,MutationTaster,Supervised,0.816,0.004
+16,DANN,Supervised,0.812,0.004
+17,FATHMM,Supervised,0.723,0.006
diff --git a/benchmarks/clinical_supervised/substitutions/AUC/clinical_substitutions_AUC_DMS_level.csv b/benchmarks/clinical_supervised/substitutions/AUC/clinical_substitutions_AUC_DMS_level.csv
new file mode 100644
index 0000000..f6a9370
--- /dev/null
+++ b/benchmarks/clinical_supervised/substitutions/AUC/clinical_substitutions_AUC_DMS_level.csv
@@ -0,0 +1,2526 @@
+RefSeq ID,MetaRNN,REVEL,ClinPred,gMVP,VEST4,CADD,DEOGEN2,MPC,DANN,FATHMM,MutationTaster,BayesDel (addAF),BayesDel (noAF),VARITY (R),VARITY (ER),PolyPhen2 (HDIV),PolyPhen2 (HVAR)
+NP_001291646.4,1.0,0.993,1.0,1.0,1.0,1.0,1.0,1.0,0.979,0.69,0.986,1.0,0.951,1.0,1.0,0.993,0.993
+NP_001104262.1,0.977,0.954,0.986,0.972,0.933,0.934,,,0.919,0.931,0.841,0.96,0.935,,,0.87,0.903
+NP_004357.3,1.0,0.975,1.0,0.988,0.962,0.95,1.0,0.912,0.95,0.875,0.6,1.0,0.962,0.95,0.962,0.95,0.925
+NP_004270.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.917,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055134.2,0.929,0.393,0.893,0.5,0.607,0.321,0.179,,0.357,0.071,0.625,0.786,0.429,0.5,0.464,0.357,0.464
+NP_036342.2,1.0,1.0,1.0,0.667,1.0,0.667,0.333,0.333,0.667,0.333,0.333,1.0,1.0,0.667,0.667,0.667,0.667
+NP_001907.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005912.1,1.0,0.991,0.983,0.957,1.0,0.939,0.974,0.939,0.835,0.87,0.822,0.965,0.957,0.922,0.87,0.874,0.904
+NP_003095.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,0.917,1.0,1.0,1.0,1.0,0.917,0.833
+NP_060767.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.925,0.775,0.775,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004993.1,1.0,0.75,1.0,0.875,0.875,0.875,0.75,0.5,1.0,0.5,0.875,1.0,0.625,0.875,0.75,0.875,0.75
+NP_004542.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_808807.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,,1.0,1.0,1.0,,,1.0,1.0
+NP_001075019.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.875,0.167,0.833,1.0,1.0,1.0,0.958,1.0,1.0
+NP_004484.1,1.0,0.975,0.975,,1.0,0.95,,,0.675,0.5,0.625,1.0,0.95,,,1.0,1.0
+NP_079033.4,0.991,0.989,0.989,0.995,0.989,0.899,0.948,0.96,0.693,0.84,0.927,0.995,0.993,0.995,0.997,0.944,0.979
+NP_000371.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055040.2,0.934,0.901,0.973,0.885,0.89,0.879,0.962,0.967,0.731,0.415,0.577,0.956,0.929,0.945,0.94,0.885,0.912
+NP_056110.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.593,0.815,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_149975.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0
+NP_004665.2,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0
+NP_036382.2,1.0,1.0,1.0,1.0,0.625,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004105.1,1.0,1.0,1.0,,1.0,0.0,1.0,,0.0,1.0,0.5,1.0,1.0,1.0,0.0,1.0,1.0
+NP_066360.1,0.656,0.531,0.688,0.406,0.75,0.672,0.719,0.688,0.688,0.016,0.594,0.625,0.531,0.625,0.594,0.797,0.656
+NP_001098717.1,1.0,,1.0,,0.75,0.5,0.5,,0.75,0.833,0.708,1.0,1.0,0.5,0.5,0.792,0.917
+NP_055066.1,0.975,0.975,0.975,,0.95,1.0,0.95,,0.975,0.75,0.725,0.975,0.95,0.925,0.925,0.95,0.975
+NP_000008.1,0.983,0.992,1.0,0.933,0.975,0.925,0.917,0.933,0.933,0.658,0.917,0.983,0.858,0.933,0.858,0.992,0.992
+NP_000523.2,0.991,0.861,0.935,,0.958,0.877,0.782,,0.852,0.715,0.583,0.921,0.792,0.931,0.949,0.907,0.889
+NP_057021.2,1.0,1.0,1.0,1.0,1.0,0.683,0.933,0.867,0.733,0.967,0.6,1.0,1.0,0.8,0.8,0.9,0.933
+NP_001243423.1,0.0,,1.0,,1.0,1.0,,,1.0,0.0,1.0,1.0,1.0,,,,
+NP_477352.3,0.667,0.667,0.667,0.889,1.0,0.778,0.667,0.667,0.889,0.333,0.833,0.667,0.667,0.778,1.0,,
+NP_694984.5,0.929,0.905,0.929,0.857,0.952,0.94,0.905,0.679,0.905,0.405,0.762,0.857,0.81,0.905,0.952,0.857,0.952
+NP_777362.1,1.0,,1.0,,,1.0,0.5,,0.667,,0.833,1.0,0.889,1.0,1.0,0.889,0.944
+NP_291028.3,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,0.0,0.917,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001106279.3,1.0,0.981,1.0,,0.963,0.759,,,0.741,0.778,0.731,1.0,0.926,,,,
+NP_004586.2,1.0,1.0,1.0,1.0,1.0,0.962,0.923,0.962,0.577,0.635,0.712,1.0,1.0,0.962,0.962,1.0,0.962
+NP_005622.1,1.0,1.0,1.0,1.0,1.0,1.0,0.958,0.875,0.75,1.0,0.917,1.0,1.0,1.0,1.0,0.896,0.875
+NP_064551.3,0.993,0.958,0.979,0.888,0.937,0.864,0.923,0.902,0.86,0.521,0.864,0.986,0.951,0.937,0.93,0.92,0.916
+NP_061947.1,1.0,,1.0,,,1.0,1.0,,0.818,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000209.2,1.0,0.994,1.0,0.995,0.996,0.992,0.99,0.987,0.957,0.347,0.993,0.997,0.995,0.995,0.995,0.99,0.996
+NP_078925.3,1.0,0.844,0.906,0.812,1.0,0.906,0.719,0.5,0.156,0.625,0.5,1.0,0.781,0.812,0.75,0.547,0.703
+NP_001843.1,1.0,1.0,1.0,1.0,1.0,0.947,0.895,1.0,0.684,0.947,0.974,1.0,1.0,1.0,1.0,0.895,0.974
+NP_006386.1,0.5,1.0,0.5,,0.5,1.0,,,1.0,1.0,0.5,1.0,1.0,,,0.5,0.5
+NP_000276.2,1.0,0.846,1.0,,0.942,0.788,0.981,0.798,0.769,0.894,0.692,1.0,0.827,0.981,0.981,0.788,0.827
+NP_001689.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_919255.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057303.2,1.0,1.0,1.0,1.0,1.0,0.979,1.0,0.979,0.958,0.958,0.917,1.0,1.0,1.0,1.0,0.958,1.0
+NP_149100.2,1.0,1.0,1.0,0.974,0.987,0.961,0.87,0.961,0.792,0.714,0.987,1.0,1.0,0.987,0.974,0.909,0.87
+NP_079353.3,1.0,0.95,1.0,0.9,0.95,0.9,1.0,0.95,0.8,0.0,0.65,0.95,0.95,0.95,0.95,0.8,0.8
+NP_004861.2,1.0,0.8,1.0,0.8,0.75,0.833,0.7,0.8,0.333,0.75,0.75,1.0,0.833,0.7,0.8,0.5,0.5
+NP_940988.2,1.0,1.0,1.0,1.0,1.0,0.95,0.95,1.0,0.7,0.6,1.0,1.0,0.95,1.0,0.95,1.0,1.0
+NP_057090.2,1.0,,1.0,1.0,,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,0.9,0.8
+NP_003014.3,1.0,0.992,1.0,,0.706,0.63,0.849,,0.664,1.0,0.987,1.0,0.966,0.975,0.983,0.849,0.899
+NP_963883.2,0.5,0.5,0.5,0.0,0.0,0.5,0.5,,0.5,1.0,0.0,0.5,0.5,0.5,0.5,0.0,0.0
+NP_005452.2,1.0,1.0,1.0,1.0,1.0,0.967,1.0,,1.0,1.0,0.983,1.0,1.0,1.0,0.967,1.0,0.967
+NP_201569.1,1.0,1.0,1.0,1.0,1.0,0.962,,1.0,0.904,0.923,0.615,1.0,1.0,,,1.0,1.0
+NP_055862.1,1.0,1.0,1.0,0.875,1.0,1.0,0.958,0.896,0.792,0.792,0.969,1.0,0.958,0.958,1.0,1.0,1.0
+NP_005111.2,0.969,0.913,0.995,1.0,0.929,0.943,0.714,0.607,0.872,0.612,0.906,0.964,0.941,1.0,1.0,0.908,0.932
+NP_060289.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001004127.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000007.1,0.571,0.726,0.882,,0.744,0.688,0.688,,0.847,0.359,0.441,0.694,0.794,0.5,0.512,0.559,0.532
+NP_000017.1,0.85,,0.96,0.638,0.725,0.7,0.69,,0.73,,0.74,0.88,0.82,0.712,0.75,0.706,0.675
+NP_015556.1,0.943,0.924,0.919,0.867,0.962,0.883,0.871,0.767,0.781,0.712,0.683,0.962,0.943,0.905,0.91,0.886,0.888
+NP_115909.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_061929.2,0.917,0.833,0.917,0.5,0.917,0.833,1.0,0.5,0.833,,0.792,1.0,1.0,1.0,1.0,0.833,0.833
+NP_005778.1,1.0,1.0,1.0,1.0,1.0,0.95,1.0,1.0,1.0,0.8,0.45,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079375.3,0.971,0.867,0.943,0.686,0.905,0.914,0.743,0.762,0.933,0.71,0.876,0.962,0.895,0.676,0.638,0.876,0.843
+NP_550438.1,1.0,1.0,1.0,0.875,1.0,0.833,1.0,0.875,0.917,0.25,0.667,1.0,0.917,1.0,1.0,0.688,0.812
+NP_004971.2,1.0,1.0,1.0,,1.0,0.969,,,0.938,1.0,0.875,1.0,1.0,,,,
+NP_036413.1,1.0,1.0,1.0,1.0,1.0,0.741,1.0,0.926,0.667,1.0,0.889,1.0,1.0,0.944,0.944,0.88,0.926
+NP_078846.2,1.0,0.929,0.929,0.964,1.0,0.821,0.857,0.786,0.679,0.821,0.571,0.964,0.964,0.821,0.75,0.821,0.786
+NP_064632.2,1.0,0.978,1.0,0.991,0.996,0.959,0.983,0.996,0.896,0.515,0.818,0.952,0.97,1.0,1.0,0.974,0.983
+NP_001275880.1,1.0,1.0,1.0,1.0,1.0,0.956,0.923,1.0,0.947,0.895,0.868,1.0,1.0,1.0,1.0,0.962,1.0
+NP_001351.2,1.0,1.0,1.0,,0.994,1.0,0.997,,0.945,0.884,0.997,1.0,0.997,0.991,0.994,0.997,1.0
+NP_066264.4,1.0,0.99,1.0,,0.92,0.842,,,0.683,0.772,0.84,1.0,0.973,,,,
+NP_115733.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,0.667,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005042.1,1.0,1.0,1.0,0.722,1.0,0.889,0.889,1.0,0.778,0.889,0.833,1.0,0.944,0.944,0.944,0.917,1.0
+NP_000775.1,1.0,0.994,1.0,0.982,0.959,0.977,0.93,0.868,0.883,0.868,0.898,1.0,0.994,0.936,0.936,0.842,0.886
+NP_060272.3,1.0,0.935,1.0,1.0,0.957,0.913,0.913,1.0,0.674,0.511,0.978,0.978,0.957,0.957,0.978,0.902,0.913
+NP_000427.1,1.0,1.0,1.0,1.0,0.938,1.0,0.938,1.0,1.0,0.438,0.75,1.0,0.938,1.0,0.75,1.0,1.0
+NP_063940.1,1.0,0.75,1.0,0.5,0.75,1.0,0.75,0.75,1.0,0.5,0.5,1.0,0.75,0.75,0.75,0.875,1.0
+NP_002992.1,1.0,1.0,1.0,1.0,1.0,0.889,0.889,1.0,0.778,0.889,0.444,1.0,1.0,0.889,0.889,0.778,0.889
+NP_001077083.1,1.0,0.908,1.0,,0.985,0.938,0.985,0.985,0.954,0.254,0.938,1.0,0.938,0.985,0.969,0.938,0.954
+NP_573566.2,1.0,1.0,1.0,0.979,0.979,0.968,1.0,0.979,0.884,0.589,0.932,1.0,1.0,0.979,0.979,0.963,0.932
+NP_001944.1,1.0,1.0,1.0,,0.921,0.993,0.938,,0.948,0.876,0.838,1.0,0.976,1.0,0.986,1.0,1.0
+NP_004541.1,1.0,1.0,1.0,1.0,1.0,1.0,0.85,0.95,1.0,0.775,0.75,1.0,1.0,0.9,0.9,1.0,1.0
+NP_001368914.1,1.0,1.0,1.0,,1.0,1.0,0.917,,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004083.3,1.0,0.989,1.0,1.0,1.0,1.0,0.967,0.944,0.989,0.622,0.967,1.0,1.0,1.0,1.0,0.967,0.956
+NP_057457.1,0.954,,1.0,0.657,0.764,0.917,0.657,,0.676,0.37,0.431,0.889,0.657,0.889,0.889,0.75,0.741
+NP_004324.2,0.994,,0.98,0.985,,0.92,,,0.731,,0.892,0.998,0.98,,,,
+NP_683695.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_076984.2,1.0,1.0,1.0,0.958,1.0,0.833,0.917,0.917,0.792,0.833,0.917,1.0,1.0,1.0,0.958,0.896,0.896
+NP_001017420.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000166.2,1.0,,1.0,,,0.944,0.833,,0.889,1.0,0.792,1.0,0.861,0.375,0.542,0.875,0.667
+NP_004332.2,1.0,0.542,1.0,0.625,0.792,0.792,0.542,0.708,0.75,0.417,0.708,0.958,0.625,0.833,0.792,0.625,0.5
+NP_001374.4,,,,,,,,,,,,,,,,,
+NP_001366127.1,0.962,0.904,0.988,0.985,0.981,0.984,,,0.751,0.884,0.464,0.949,0.923,,,,
+NP_000182.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_115693.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005393.2,1.0,1.0,1.0,1.0,1.0,0.833,1.0,0.806,0.667,1.0,0.5,0.944,0.944,0.722,0.667,1.0,1.0
+NP_001189472.1,1.0,0.833,1.0,,0.833,0.769,,,0.769,0.125,0.231,1.0,1.0,,,0.833,0.833
+NP_056082.2,1.0,1.0,0.944,1.0,0.944,1.0,1.0,1.0,1.0,0.944,0.944,1.0,1.0,1.0,1.0,0.972,1.0
+NP_077310.1,1.0,0.941,1.0,0.824,0.941,1.0,0.647,0.765,0.941,0.412,0.882,1.0,0.941,0.824,0.765,0.912,0.824
+NP_877437.2,1.0,1.0,1.0,0.988,1.0,1.0,0.988,0.94,0.893,1.0,0.917,1.0,0.976,0.988,1.0,0.94,0.958
+NP_116145.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056241.3,1.0,1.0,1.0,1.0,1.0,0.714,0.929,1.0,0.786,0.857,0.857,1.0,0.929,0.929,0.929,0.714,0.857
+NP_004563.2,1.0,0.957,1.0,0.929,0.979,0.982,0.857,0.976,0.971,0.4,0.957,1.0,0.979,0.952,0.929,0.971,0.964
+NP_005253.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,0.8,0.8,1.0,1.0,1.0,1.0,1.0,1.0
+NP_940916.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000432.1,0.999,0.912,0.994,0.852,0.919,0.877,0.889,0.865,0.882,0.652,0.868,0.99,0.942,0.935,0.938,0.915,0.91
+NP_000294.1,1.0,0.991,1.0,0.991,0.991,0.922,0.862,1.0,0.879,0.793,0.642,1.0,1.0,0.983,0.957,1.0,0.974
+NP_001308971.2,1.0,1.0,1.0,0.971,0.99,1.0,1.0,,0.942,0.885,0.976,1.0,1.0,1.0,1.0,0.973,0.967
+NP_001035526.1,1.0,0.9,1.0,0.95,1.0,0.85,0.85,1.0,0.85,0.55,0.875,1.0,1.0,0.95,0.95,0.9,0.95
+NP_004555.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000149.4,0.975,0.971,1.0,0.983,0.975,0.945,0.79,0.924,0.824,0.779,0.891,0.992,0.954,0.983,0.966,0.899,0.88
+NP_037379.1,1.0,0.75,1.0,0.5,0.75,1.0,0.75,0.5,1.0,0.75,0.875,1.0,0.75,0.75,0.75,0.75,0.75
+NP_002426.1,1.0,0.906,1.0,0.969,0.719,0.906,0.906,0.812,0.844,0.938,0.625,0.906,0.875,0.906,0.906,0.781,0.875
+NP_001376.1,1.0,1.0,1.0,1.0,1.0,0.9,0.95,1.0,0.85,0.95,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_078807.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.458,1.0,1.0,1.0,1.0,1.0,0.938,1.0
+NP_003898.2,1.0,1.0,0.992,1.0,1.0,0.975,0.958,0.985,0.985,0.481,0.936,0.992,1.0,0.977,0.955,0.943,0.949
+NP_892003.2,1.0,,1.0,,0.833,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001157750.1,1.0,,,1.0,1.0,1.0,0.98,1.0,0.98,,,1.0,0.987,,,,
+NP_658986.1,0.933,0.833,0.9,0.733,0.733,0.633,0.5,,0.733,0.567,0.5,0.933,0.867,0.667,0.6,0.633,0.733
+NP_006006.3,0.978,0.962,0.972,0.964,0.97,0.912,0.883,0.812,0.695,0.94,0.771,0.994,0.985,0.972,0.968,0.844,0.879
+NP_055006.1,0.914,0.915,0.903,0.938,0.896,0.806,0.856,0.801,0.584,0.863,0.726,0.902,0.888,0.87,0.873,0.701,0.768
+NP_001076585.1,1.0,0.947,1.0,,1.0,0.988,0.895,,0.765,0.211,0.926,1.0,0.975,1.0,0.895,0.789,0.947
+NP_872282.1,1.0,1.0,1.0,1.0,0.917,1.0,1.0,0.75,0.917,1.0,0.583,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001001563.2,1.0,,1.0,,0.933,1.0,1.0,,1.0,,0.833,0.933,0.867,0.867,1.0,1.0,1.0
+NP_689629.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.8,0.4,0.6,1.0,1.0,1.0,1.0,1.0,1.0
+NP_061982.3,1.0,1.0,1.0,0.844,0.964,0.969,0.951,0.897,0.987,0.545,0.902,1.0,1.0,0.996,0.991,0.969,0.964
+NP_000439.2,0.996,0.992,0.993,0.936,0.986,0.969,0.958,0.931,0.871,0.801,0.984,1.0,0.994,0.964,0.986,0.917,0.878
+NP_000532.2,1.0,1.0,1.0,,,1.0,,,0.75,0.75,0.875,1.0,1.0,,,,
+NP_057049.5,1.0,0.0,1.0,1.0,1.0,0.0,0.0,0.0,1.0,0.0,1.0,1.0,0.0,1.0,0.0,0.0,0.0
+NP_258412.1,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,0.75,,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060297.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003841.1,1.0,,1.0,,,1.0,0.956,,0.922,,0.65,1.0,0.967,1.0,1.0,1.0,1.0
+NP_001294.2,1.0,,1.0,,1.0,0.9,1.0,,0.9,1.0,0.55,1.0,1.0,,,,
+NP_976035.1,1.0,0.985,1.0,0.985,1.0,0.985,0.862,0.892,0.785,0.6,0.969,1.0,1.0,1.0,1.0,0.938,0.938
+NP_001091738.1,1.0,0.982,1.0,,0.991,1.0,0.973,,0.864,0.818,0.927,1.0,0.982,1.0,0.991,1.0,1.0
+NP_037451.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,0.8,0.8,1.0,1.0,1.0,1.0
+NP_001243371.1,1.0,1.0,1.0,,0.992,0.995,0.987,,0.76,0.896,0.865,1.0,0.973,1.0,1.0,0.877,0.954
+NP_004784.2,0.953,0.734,0.945,0.867,0.82,0.812,0.828,0.875,0.625,0.551,0.711,0.836,0.672,0.93,0.922,0.859,0.855
+NP_065796.2,0.992,0.947,0.988,0.903,0.899,0.846,0.895,0.83,0.765,0.943,0.636,0.984,0.935,0.883,0.883,0.791,0.822
+NP_003468.2,1.0,1.0,1.0,1.0,0.85,0.8,0.9,0.8,0.9,1.0,0.85,0.9,0.9,0.85,0.95,0.9,0.95
+NP_001002755.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003163.1,1.0,0.958,1.0,0.917,0.917,0.958,0.917,0.917,0.917,0.583,1.0,1.0,0.958,0.958,1.0,0.917,0.958
+NP_938149.2,1.0,1.0,1.0,0.95,1.0,1.0,,0.85,1.0,0.775,0.625,1.0,1.0,,,1.0,1.0
+NP_056451.3,0.933,1.0,1.0,,0.867,1.0,,,0.667,0.667,1.0,1.0,1.0,,,0.8,0.8
+NP_073741.3,1.0,1.0,1.0,,1.0,1.0,1.0,,0.75,1.0,1.0,1.0,1.0,1.0,1.0,,
+NP_001035963.1,0.96,0.978,0.978,,0.964,0.918,0.975,0.927,0.847,0.795,0.651,0.971,0.967,0.989,0.985,0.92,0.951
+NP_001077430.1,1.0,1.0,1.0,0.983,1.0,0.978,,0.965,0.926,0.65,1.0,0.991,0.987,,,0.957,0.957
+NP_000052.1,0.993,1.0,0.997,,0.988,0.982,0.977,1.0,0.866,1.0,0.804,0.992,0.987,1.0,1.0,1.0,1.0
+NP_001202.5,1.0,1.0,1.0,0.947,0.982,1.0,1.0,0.982,0.912,0.807,0.868,1.0,1.0,0.982,0.982,0.965,0.982
+NP_612486.2,1.0,0.982,1.0,0.964,0.964,0.893,0.875,0.83,0.777,0.737,0.955,1.0,1.0,0.964,0.955,0.955,0.982
+NP_000048.1,1.0,1.0,1.0,1.0,1.0,1.0,0.991,1.0,0.866,0.723,0.893,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001120.3,1.0,1.0,1.0,1.0,1.0,1.0,0.944,1.0,0.944,0.222,1.0,1.0,1.0,1.0,1.0,0.944,0.944
+NP_001909.4,0.995,1.0,1.0,0.981,1.0,0.964,,0.971,0.899,0.933,0.885,1.0,0.995,,,,
+NP_001166939.1,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,0.75,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002486.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079005.3,0.977,0.825,1.0,,0.919,0.663,0.625,,0.767,0.867,0.831,1.0,0.977,0.65,0.625,0.593,0.64
+NP_056303.3,0.933,0.733,0.956,0.8,0.8,0.889,0.533,0.733,0.778,0.4,0.378,0.978,0.844,0.8,0.8,0.8,0.756
+NP_000503.1,0.998,0.965,0.986,0.954,0.958,0.877,0.855,0.913,0.754,0.669,0.873,0.993,0.94,0.931,0.928,0.891,0.91
+NP_002991.2,1.0,0.987,0.995,0.992,0.995,0.985,0.987,0.985,0.894,0.689,0.826,0.992,0.985,0.987,0.985,0.985,0.992
+NP_006259.1,1.0,1.0,1.0,,1.0,0.938,0.958,1.0,0.562,0.625,1.0,0.854,0.812,1.0,1.0,0.896,0.896
+NP_055975.1,1.0,1.0,1.0,1.0,1.0,1.0,0.8,1.0,0.8,0.0,0.8,1.0,1.0,1.0,1.0,1.0,1.0
+NP_064716.2,1.0,1.0,1.0,0.974,1.0,1.0,0.987,1.0,0.923,0.737,1.0,1.0,1.0,0.987,0.987,1.0,1.0
+NP_005205.2,1.0,,1.0,1.0,,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001063.2,1.0,1.0,1.0,,,1.0,1.0,,1.0,0.818,1.0,1.0,1.0,,,,
+NP_002632.1,0.976,0.956,0.984,0.893,0.948,0.915,0.901,0.921,0.905,0.71,0.861,0.94,0.917,0.889,0.897,0.935,0.909
+NP_036546.2,1.0,1.0,1.0,0.933,0.933,0.967,1.0,1.0,0.967,0.933,1.0,1.0,0.867,1.0,1.0,1.0,1.0
+NP_005493.2,1.0,0.996,0.972,0.948,0.972,0.982,0.968,0.825,0.889,0.756,0.883,0.865,0.813,0.964,0.96,0.938,0.96
+NP_057688.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.897,1.0,1.0,0.808,1.0,1.0,1.0,1.0,1.0,0.974
+NP_054745.1,0.9,0.9,0.8,1.0,0.6,0.6,0.7,0.7,1.0,0.5,0.7,0.7,0.7,0.6,0.7,0.7,0.8
+NP_000183.2,0.969,0.948,0.995,0.928,0.918,0.856,0.863,0.93,0.704,0.886,0.769,0.959,0.894,0.913,0.906,0.847,0.877
+NP_001275676.1,0.923,,1.0,,0.712,0.558,,,0.558,,0.769,0.942,0.846,,,,
+NP_003679.2,0.774,0.833,0.911,0.633,,0.752,,,0.752,,0.5,0.863,0.759,,,0.681,0.672
+NP_112180.4,1.0,1.0,1.0,,0.75,1.0,,1.0,1.0,1.0,0.75,1.0,1.0,,,1.0,1.0
+NP_777576.1,1.0,1.0,1.0,0.977,1.0,0.955,0.932,1.0,0.75,0.955,0.705,1.0,1.0,1.0,1.0,0.955,1.0
+NP_476517.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_061154.1,1.0,1.0,1.0,0.958,1.0,1.0,1.0,1.0,0.942,,0.462,1.0,1.0,1.0,1.0,0.625,0.688
+NP_055180.1,1.0,0.95,1.0,0.8,1.0,0.875,0.95,0.9,0.65,1.0,1.0,1.0,0.9,0.9,0.9,0.8,0.85
+NP_000354.4,1.0,0.826,1.0,,0.992,0.769,0.811,0.894,0.773,0.784,0.682,1.0,0.879,0.939,0.917,0.701,0.678
+NP_006070.2,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003968.3,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,0.5,1.0,1.0,1.0,,,,
+NP_078974.1,1.0,1.0,1.0,0.75,1.0,1.0,1.0,0.75,1.0,0.0,1.0,1.0,1.0,1.0,1.0,0.875,1.0
+NP_055331.1,1.0,1.0,1.0,,1.0,0.921,1.0,,0.794,0.857,0.984,1.0,1.0,1.0,1.0,0.968,0.984
+NP_004512.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004513.1,1.0,0.969,1.0,1.0,0.938,0.938,1.0,0.906,1.0,0.625,0.875,0.969,0.906,1.0,1.0,0.953,0.875
+NP_000498.2,1.0,1.0,1.0,,0.983,0.992,1.0,,1.0,0.95,0.892,1.0,0.967,0.967,0.917,1.0,1.0
+NP_000378.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079495.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,0.833,0.833,1.0,1.0,0.833,0.833
+NP_005850.1,0.925,0.877,0.928,0.934,0.937,0.904,0.913,0.934,0.811,0.479,0.808,0.88,0.871,0.907,0.916,0.892,0.875
+NP_079469.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,0.917,1.0,1.0,,,,
+NP_000476.1,1.0,0.857,1.0,1.0,0.952,0.857,0.905,0.81,0.762,0.143,0.643,1.0,0.905,1.0,1.0,1.0,1.0
+NP_067638.3,0.95,0.77,0.973,0.805,0.93,0.853,0.687,0.834,0.75,0.564,0.74,0.994,0.889,0.775,0.765,0.823,0.809
+NP_001014437.1,1.0,0.833,1.0,1.0,0.917,0.917,,0.917,1.0,0.5,0.917,1.0,0.917,,,,
+NP_055915.2,0.982,0.862,0.976,0.803,0.982,0.913,0.946,0.889,0.853,0.565,0.79,0.94,0.919,0.876,0.889,0.842,0.838
+NP_000381.1,0.931,0.69,1.0,0.862,0.776,0.922,0.879,0.759,0.759,0.448,0.672,0.879,0.759,0.862,0.879,0.793,0.845
+NP_078805.3,1.0,,1.0,1.0,,1.0,1.0,,1.0,,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_054733.2,0.968,0.937,0.942,0.868,0.934,0.879,0.858,0.816,0.853,0.584,0.553,0.974,0.916,0.963,0.932,0.958,0.979
+NP_001867.2,0.995,0.952,1.0,,0.952,0.963,,,0.904,0.775,0.872,1.0,0.968,,,1.0,0.989
+NP_001129524.1,1.0,1.0,1.0,,0.923,0.846,,,0.692,1.0,0.846,1.0,1.0,,,0.923,1.0
+NP_004691.2,0.993,,0.987,,,0.883,0.967,,0.76,,0.8,0.987,0.973,0.973,0.98,0.87,0.92
+NP_001128715.1,0.993,,1.0,,,1.0,,,0.732,,0.972,1.0,1.0,,,,
+NP_000361.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079541.1,1.0,1.0,1.0,,,1.0,1.0,,1.0,0.5,1.0,1.0,1.0,,,,
+NP_001121775.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055112.2,1.0,1.0,1.0,0.875,1.0,1.0,0.917,1.0,1.0,0.958,0.708,1.0,1.0,1.0,1.0,1.0,1.0
+NP_570824.1,0.88,0.889,0.741,0.852,0.852,0.718,0.917,0.852,0.5,,0.5,0.944,0.926,0.917,0.917,0.713,0.769
+NP_003989.2,1.0,1.0,1.0,0.933,0.933,0.967,,1.0,0.833,0.783,0.95,0.983,0.967,,,0.942,0.983
+NP_001020774.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.8,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_071398.5,,,,,,,,,,,,,,,,,
+NP_000332.2,1.0,1.0,0.955,1.0,1.0,1.0,0.818,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_808880.1,1.0,1.0,1.0,1.0,0.667,0.667,1.0,0.667,1.0,0.667,1.0,1.0,1.0,1.0,1.0,0.833,1.0
+NP_005632.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.5,1.0,0.5,1.0,1.0,1.0,1.0,0.25,0.75
+NP_008872.1,1.0,1.0,0.984,1.0,0.984,0.984,1.0,0.952,0.829,0.716,0.5,1.0,0.996,1.0,1.0,0.956,0.988
+NP_073752.6,0.95,,1.0,1.0,1.0,0.85,0.9,,0.8,1.0,0.9,1.0,1.0,0.85,1.0,0.9,0.9
+NP_001307594.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,0.333,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000277.1,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001440.2,1.0,,1.0,,1.0,0.875,,,0.0,,0.729,1.0,1.0,,,,
+NP_001171582.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000178.2,1.0,0.944,1.0,1.0,0.987,0.946,0.97,0.987,0.857,0.89,0.697,1.0,0.987,0.97,0.961,0.996,0.987
+NP_001119580.2,1.0,0.989,0.997,0.983,0.962,0.939,0.905,0.978,0.943,0.75,0.913,0.997,0.988,0.968,0.968,0.941,0.967
+NP_001976.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.75,0.875,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001244074.1,0.833,0.881,1.0,,0.929,0.845,,,0.81,0.548,0.714,0.976,0.881,,,0.867,0.667
+NP_647593.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.625,1.0,1.0,1.0,,,1.0,1.0
+NP_038472.2,1.0,0.944,1.0,0.611,1.0,0.944,0.611,0.833,0.778,1.0,0.778,1.0,1.0,0.944,0.833,0.889,0.778
+NP_002258.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057126.2,1.0,1.0,1.0,1.0,1.0,0.905,0.952,0.81,0.762,1.0,0.667,1.0,1.0,1.0,1.0,0.762,0.81
+NP_060420.2,0.923,0.462,0.731,,0.269,0.5,,,0.0,0.231,0.288,0.769,0.077,,,,
+NP_005901.2,1.0,0.986,1.0,,0.972,1.0,,,0.931,0.931,0.854,1.0,1.0,,,0.896,0.931
+NP_689549.3,0.75,,1.0,1.0,,1.0,,1.0,1.0,,1.0,1.0,1.0,,,,
+NP_004067.1,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,0.5,0.667,0.667,1.0,1.0,1.0,1.0,0.833,0.667
+NP_008909.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,0.833,1.0,1.0,1.0,1.0,0.917,1.0
+NP_001905.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006281.1,1.0,,1.0,1.0,1.0,1.0,0.96,,0.6,,0.92,0.96,0.92,1.0,1.0,0.94,1.0
+NP_002100.2,0.944,0.833,0.972,,0.833,0.889,1.0,,0.667,0.667,0.792,0.972,0.861,1.0,1.0,0.944,0.889
+NP_060496.2,0.991,0.954,0.991,0.917,0.963,0.944,0.87,0.648,0.889,0.639,0.907,0.972,0.963,0.954,0.889,0.968,0.949
+NP_065979.1,0.88,0.92,0.99,0.83,0.83,0.81,0.94,0.875,0.69,0.635,0.8,0.9,0.79,0.96,0.95,0.796,0.832
+NP_001094383.2,0.722,0.709,0.772,,0.823,0.943,0.775,,0.987,0.143,0.69,0.835,0.785,0.75,0.8,0.506,0.38
+NP_064502.9,0.973,0.853,1.0,0.76,0.827,0.833,0.667,0.733,0.853,0.6,0.8,0.96,0.827,0.853,0.813,0.893,0.84
+NP_001258928.1,1.0,1.0,1.0,1.0,1.0,1.0,,,0.444,0.444,0.722,1.0,1.0,,,1.0,1.0
+NP_000533.4,1.0,1.0,1.0,,1.0,1.0,0.889,,0.75,,1.0,1.0,0.972,0.972,1.0,0.944,0.944
+NP_004517.2,1.0,0.929,1.0,0.929,0.857,0.643,0.5,0.429,0.214,0.643,0.714,1.0,1.0,1.0,0.929,0.214,0.286
+NP_000014.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.757,0.693,1.0,0.986,1.0,1.0,1.0,0.986
+NP_004975.2,0.953,0.947,0.957,0.952,0.939,0.901,0.914,0.962,0.699,0.68,0.858,0.971,0.937,0.955,0.957,0.946,0.951
+NP_055612.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.875,0.25,0.875,0.875,1.0,1.0,1.0,0.875,0.875,0.875
+NP_060247.2,1.0,1.0,1.0,1.0,1.0,0.8,0.8,1.0,0.8,0.8,1.0,1.0,1.0,1.0,1.0,0.8,0.8
+NP_085055.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,0.833,1.0,1.0,1.0,1.0,0.917,1.0
+NP_653259.3,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.5,1.0,1.0,1.0,,,1.0,1.0
+NP_000144.2,1.0,0.998,1.0,0.976,0.998,0.957,0.945,0.909,0.742,0.668,0.868,1.0,0.988,1.0,0.988,0.915,0.925
+NP_006614.2,1.0,,0.889,0.778,,0.667,0.778,,0.889,,0.5,1.0,0.778,0.556,0.444,0.722,0.667
+NP_068587.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.75,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001153705.1,1.0,0.962,1.0,,0.962,0.875,0.912,,0.6,0.694,0.75,0.988,0.95,0.988,0.975,0.925,0.962
+NP_005016.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.844,1.0,0.938,1.0,1.0,1.0,1.0,1.0,1.0
+NP_212134.3,1.0,,1.0,,,1.0,1.0,,0.875,,1.0,1.0,1.0,1.0,1.0,0.888,0.9
+NP_003192.1,1.0,0.667,1.0,0.333,1.0,0.333,0.333,0.0,0.667,0.333,0.667,0.333,0.667,0.0,0.0,1.0,1.0
+NP_001278933.1,1.0,0.933,0.989,,0.944,0.867,0.922,,0.811,0.883,0.778,0.956,0.811,,,,
+NP_653186.2,1.0,1.0,1.0,1.0,1.0,1.0,0.625,1.0,1.0,0.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001290185.1,1.0,0.96,1.0,0.982,1.0,0.971,0.916,,0.96,0.919,0.996,1.0,0.982,0.982,0.993,0.945,0.955
+NP_001171634.3,1.0,,0.667,0.667,1.0,0.667,,0.333,1.0,0.667,0.667,0.667,0.667,,,,
+NP_001171536.2,1.0,0.99,1.0,0.931,1.0,1.0,0.691,0.99,0.979,0.641,0.921,1.0,0.993,0.951,0.984,1.0,1.0
+NP_477511.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057411.1,1.0,1.0,1.0,1.0,0.667,1.0,0.333,1.0,0.667,0.667,0.667,1.0,0.667,1.0,1.0,0.833,1.0
+NP_000259.1,1.0,1.0,0.917,0.889,0.917,1.0,1.0,0.889,0.75,0.667,0.625,1.0,1.0,0.667,0.889,1.0,1.0
+NP_057182.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.25,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056007.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.0,1.0,1.0,1.0,,,1.0,1.0
+NP_006603.2,1.0,0.992,0.985,0.923,0.985,0.8,0.938,0.738,0.692,0.815,0.923,0.985,0.954,0.938,0.938,0.769,0.777
+NP_004312.2,0.982,,0.974,,,0.845,0.983,,0.661,,0.865,0.983,0.974,0.979,0.976,0.815,0.839
+NP_004288.1,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,0.0,0.0,0.5,1.0,1.0,1.0,1.0,0.5,1.0
+NP_001254507.1,1.0,,1.0,1.0,0.857,0.571,,,1.0,,1.0,1.0,1.0,,,,
+NP_001129123.1,0.625,0.625,0.875,0.875,0.625,0.625,0.5,0.625,0.375,0.75,0.438,0.875,0.875,0.125,0.25,0.625,0.625
+NP_000881.3,1.0,1.0,1.0,1.0,1.0,0.917,1.0,1.0,0.667,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057386.2,1.0,,1.0,,,0.75,,,0.5,,1.0,1.0,1.0,,,,
+NP_004809.2,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,0.5,1.0,1.0,0.5,1.0,1.0,0.5,0.5
+NP_000466.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.837,0.779,0.957,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004811.1,1.0,1.0,1.0,0.818,1.0,1.0,1.0,1.0,1.0,0.909,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000273.2,1.0,0.986,1.0,1.0,0.995,0.968,1.0,1.0,0.838,0.847,0.715,0.981,0.991,1.0,1.0,1.0,1.0
+NP_060586.2,1.0,,0.958,,,0.958,,,0.875,,0.792,0.833,0.833,,,,
+NP_127497.1,1.0,0.885,1.0,,0.926,0.929,,,0.929,0.333,0.443,0.986,0.843,,,1.0,1.0
+NP_001304919.1,1.0,1.0,1.0,0.0,1.0,1.0,1.0,,1.0,0.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0
+NP_004429.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_945345.2,1.0,0.854,0.896,1.0,0.979,0.854,1.0,0.833,0.938,0.938,0.75,0.979,0.875,0.875,0.896,0.854,0.844
+NP_071882.1,1.0,,1.0,1.0,,1.0,1.0,,0.714,,0.214,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000272.1,1.0,1.0,1.0,1.0,0.5,1.0,0.5,1.0,0.5,0.5,1.0,1.0,1.0,1.0,1.0,0.5,1.0
+NP_077010.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,0.933,1.0,1.0,1.0,1.0,0.8,0.8
+NP_219487.3,1.0,1.0,1.0,1.0,0.964,0.946,1.0,1.0,0.75,,0.625,1.0,1.0,1.0,1.0,0.86,0.9
+NP_149059.1,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002730.1,1.0,1.0,1.0,1.0,1.0,1.0,0.958,0.986,0.764,0.847,0.799,0.986,0.986,1.0,1.0,0.986,0.986
+NP_644807.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,0.833,0.833,1.0,0.833,1.0,1.0,1.0,1.0
+NP_689991.1,1.0,,1.0,0.778,,1.0,0.833,,0.778,,0.722,0.944,0.778,0.889,0.833,0.833,0.917
+NP_851853.1,1.0,0.9,1.0,1.0,1.0,0.8,1.0,0.8,0.6,1.0,1.0,1.0,1.0,0.9,0.9,1.0,1.0
+NP_001265868.1,0.985,0.864,1.0,,0.848,0.879,0.636,,0.894,0.854,0.773,1.0,0.879,,,0.889,0.889
+NP_060547.2,1.0,0.0,1.0,0.333,0.0,0.778,0.0,0.0,0.0,0.444,0.444,1.0,0.444,0.333,0.333,0.444,0.556
+NP_997235.3,1.0,0.979,0.99,0.99,1.0,0.979,0.979,0.979,0.854,0.927,0.891,0.99,0.979,0.948,0.958,0.958,0.979
+NP_003611.1,1.0,,0.929,,0.786,0.929,,,0.786,,0.786,0.857,0.786,,,,
+NP_005179.2,1.0,0.969,1.0,0.918,0.954,0.974,0.964,0.9,0.687,0.9,0.849,1.0,0.99,0.99,0.964,0.808,0.885
+NP_005521.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005018.2,0.924,0.924,0.933,0.873,0.933,0.958,0.93,0.933,0.776,0.702,0.9,0.945,0.945,0.952,0.958,0.861,0.918
+NP_006758.2,1.0,0.886,0.99,0.924,0.886,0.857,0.8,0.99,0.771,0.371,0.867,1.0,0.924,0.971,0.962,0.848,0.895
+NP_115979.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001005731.1,1.0,,1.0,,0.978,0.901,,,0.653,0.685,0.817,1.0,0.97,,,,
+NP_055771.4,1.0,1.0,1.0,,,1.0,,,0.929,0.964,0.982,1.0,1.0,,,,
+NP_036318.1,1.0,0.958,1.0,1.0,0.979,0.875,0.979,,0.688,1.0,0.927,1.0,1.0,0.979,0.958,0.844,0.958
+NP_940978.2,1.0,0.954,1.0,0.765,0.922,0.889,,0.824,0.719,0.637,0.791,0.98,0.863,,,,
+NP_387510.1,1.0,1.0,1.0,,1.0,0.955,1.0,,1.0,0.545,0.955,1.0,1.0,1.0,1.0,1.0,1.0
+NP_722541.1,1.0,1.0,1.0,,1.0,1.0,,,0.714,0.476,0.881,1.0,1.0,,,,
+NP_001374178.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,,1.0,1.0,1.0,,,1.0,1.0
+NP_004475.1,1.0,0.887,0.903,,0.726,0.548,0.484,,0.71,0.25,0.048,1.0,0.968,0.516,0.5,0.952,0.992
+NP_001036068.1,1.0,0.875,1.0,0.625,1.0,0.75,0.375,0.25,0.75,0.875,0.875,1.0,1.0,0.75,0.375,0.812,0.875
+NP_005013.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_671729.2,1.0,,1.0,,,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002159.2,1.0,0.75,1.0,0.875,0.875,0.875,0.875,0.75,0.625,0.75,0.625,1.0,0.75,0.625,0.625,0.812,0.75
+NP_004473.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,0.917,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006787.2,1.0,0.955,1.0,0.977,0.955,0.818,0.879,0.917,0.841,0.189,0.818,1.0,0.924,0.992,0.992,0.92,0.955
+NP_000026.2,0.952,0.905,0.857,0.81,0.952,0.762,0.905,0.905,0.81,0.857,0.429,0.905,0.81,0.714,0.667,0.714,0.762
+NP_001243805.1,0.923,0.462,0.974,,0.744,0.744,0.41,,0.744,0.59,0.859,0.897,0.769,0.718,0.667,0.769,0.718
+NP_071731.1,1.0,0.909,1.0,0.795,0.882,0.832,0.846,0.872,0.7,0.732,0.85,1.0,0.945,0.692,0.692,0.782,0.808
+NP_000497.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.875,0.875
+NP_004144.2,1.0,1.0,1.0,0.964,0.945,0.964,0.945,1.0,1.0,1.0,0.891,1.0,0.982,0.855,0.855,0.964,0.964
+NP_037468.1,1.0,0.333,1.0,0.667,1.0,0.333,0.333,1.0,0.0,,0.5,0.667,0.667,0.333,0.333,0.333,0.333
+NP_001412.1,1.0,0.5,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_066124.1,0.998,0.993,0.996,0.986,0.995,0.997,0.977,0.736,0.999,0.885,0.976,0.995,0.989,0.972,0.973,0.997,0.997
+NP_001186181.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,,1.0,1.0,1.0,,,1.0,1.0
+NP_001289.1,1.0,0.999,1.0,0.998,0.989,0.97,0.978,0.979,0.947,0.647,0.97,1.0,1.0,0.998,0.998,0.959,0.971
+NP_005620.1,0.989,0.977,0.999,0.923,0.987,0.927,0.956,0.825,0.788,0.908,0.866,0.956,0.934,0.96,0.974,0.93,0.949
+NP_002231.1,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056999.2,0.898,0.713,1.0,,0.741,0.671,,,0.593,0.361,0.727,0.991,0.852,,,,
+NP_002854.3,0.97,0.864,1.0,0.75,0.879,0.83,1.0,1.0,0.576,0.742,0.682,0.932,0.826,0.75,0.75,0.881,0.905
+NP_000539.2,0.966,0.948,0.972,0.962,0.93,0.887,0.948,,0.717,0.824,0.795,0.973,0.94,0.962,0.955,0.758,0.8
+NP_000924.3,1.0,0.994,1.0,,1.0,0.956,1.0,,0.888,0.909,0.941,1.0,0.982,1.0,0.955,0.992,1.0
+NP_060124.2,0.95,,0.964,,0.79,0.796,0.72,,0.792,,0.698,0.972,0.841,0.728,0.742,0.764,0.726
+NP_004449.1,1.0,1.0,1.0,,0.833,0.833,,,0.667,0.5,0.75,1.0,1.0,,,,
+NP_036425.1,1.0,1.0,1.0,1.0,1.0,1.0,0.857,0.571,0.929,0.5,0.929,1.0,1.0,1.0,1.0,0.714,0.643
+NP_003874.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004729.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_775831.2,0.949,0.923,1.0,1.0,0.949,0.795,1.0,1.0,0.897,0.846,0.731,0.897,0.923,1.0,1.0,0.795,0.872
+NP_001153695.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.5,0.5,1.0,1.0,,,1.0,1.0
+NP_996991.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.0,1.0,1.0,1.0,,,1.0,1.0
+NP_775960.4,1.0,0.981,1.0,,0.878,0.949,,,0.724,0.974,0.958,1.0,0.968,,,0.942,0.946
+NP_000278.3,1.0,0.932,1.0,0.765,0.837,0.86,0.796,0.835,0.719,0.699,0.731,1.0,0.973,0.914,0.91,0.86,0.857
+NP_113663.2,1.0,,1.0,0.81,,0.976,0.81,,1.0,,0.81,0.952,0.952,0.952,0.952,0.762,0.857
+NP_003990.1,1.0,1.0,1.0,0.8,0.875,0.917,0.9,1.0,0.167,0.583,0.917,1.0,1.0,0.7,0.6,0.958,0.917
+NP_000349.1,1.0,0.694,1.0,0.597,0.653,0.715,0.583,0.764,0.431,0.5,0.569,0.944,0.653,0.667,0.667,0.597,0.583
+NP_689606.2,1.0,1.0,1.0,1.0,0.98,0.96,1.0,0.92,0.82,0.74,0.83,1.0,1.0,1.0,1.0,0.94,0.98
+NP_004637.1,0.998,0.964,0.999,0.965,0.951,0.923,0.922,0.872,0.816,0.64,0.807,0.987,0.961,0.953,0.932,0.933,0.952
+NP_006138.1,1.0,1.0,1.0,1.0,0.988,1.0,0.964,1.0,0.833,0.988,0.976,1.0,1.0,0.964,0.976,0.988,1.0
+NP_056651.1,1.0,0.952,0.976,1.0,0.905,0.905,1.0,0.5,0.619,0.857,0.893,0.976,0.976,1.0,1.0,0.583,0.833
+NP_055742.2,1.0,,1.0,,,1.0,,,1.0,,1.0,1.0,1.0,,,,
+NP_059488.2,0.0,1.0,1.0,,0.0,1.0,0.0,0.0,1.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0
+NP_005907.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_957705.1,0.947,0.847,0.95,1.0,0.853,0.865,1.0,1.0,0.776,0.719,0.79,0.95,0.899,1.0,1.0,1.0,1.0
+NP_938143.1,0.875,1.0,0.875,,0.875,0.625,,,0.875,0.75,0.75,1.0,1.0,,,,
+NP_004290.2,1.0,,1.0,0.953,,0.888,,,0.894,,0.853,0.988,0.906,,,0.935,0.929
+NP_683720.2,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.25,0.5,1.0,1.0,,,,
+NP_000407.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.4,0.7,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001186136.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.9,0.8,1.0,0.9,1.0,1.0,,,,
+NP_002071.2,1.0,1.0,1.0,1.0,1.0,1.0,0.926,0.926,0.704,0.778,0.722,1.0,0.926,0.963,0.889,0.926,0.926
+NP_919224.1,0.994,0.952,0.996,0.965,0.968,0.861,0.942,0.901,0.756,0.673,0.733,0.996,0.992,0.942,0.935,0.858,0.871
+NP_000313.2,1.0,0.971,1.0,0.921,0.899,0.948,,0.861,0.733,0.907,0.96,1.0,0.973,,,,
+NP_001073882.3,0.867,0.8,1.0,0.8,0.333,0.867,0.8,0.8,0.867,0.733,0.367,0.867,0.733,0.7,0.6,0.6,0.6
+NP_000136.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,0.75,1.0,1.0,1.0,,,,
+NP_066382.1,1.0,,1.0,1.0,,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_036610.2,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.667,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006168.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.857,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003986.2,1.0,1.0,1.0,0.972,0.944,0.722,0.861,0.944,0.722,0.917,0.931,1.0,0.944,0.778,0.778,0.875,0.917
+NP_001759.3,1.0,1.0,1.0,,1.0,0.5,,,1.0,0.0,1.0,1.0,1.0,,,0.5,0.75
+NP_055139.4,1.0,1.0,1.0,,1.0,1.0,1.0,,0.875,0.875,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000088.3,1.0,1.0,0.917,0.875,1.0,0.917,1.0,0.958,1.0,0.833,0.833,1.0,1.0,1.0,0.958,0.938,0.938
+NP_006508.2,0.982,,0.984,0.987,0.974,0.937,0.908,0.895,0.8,0.504,0.908,0.984,0.947,0.968,0.963,0.872,0.926
+NP_000488.3,1.0,0.974,0.997,0.878,0.997,0.974,0.974,0.964,0.974,0.748,0.933,0.99,0.98,0.987,0.97,0.974,0.975
+NP_000489.3,0.977,0.985,0.992,0.815,0.977,0.923,0.869,0.931,0.877,0.588,0.831,1.0,0.985,0.954,0.969,0.942,0.95
+NP_001684.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.933,0.333,0.833,1.0,1.0,1.0,1.0,1.0,0.967
+NP_000150.1,1.0,0.882,0.994,,0.777,0.851,0.828,0.813,0.815,0.646,0.692,0.974,0.875,0.806,0.806,0.814,0.867
+NP_004961.1,1.0,1.0,1.0,,1.0,0.875,0.875,,0.75,0.75,0.5,0.875,0.625,1.0,1.0,0.812,0.75
+NP_004535.1,1.0,0.6,1.0,0.4,0.8,0.6,0.6,0.8,0.2,0.1,0.6,0.8,0.8,0.8,0.8,0.6,0.6
+NP_116190.3,1.0,1.0,1.0,1.0,1.0,0.875,1.0,0.75,1.0,0.5,0.625,1.0,1.0,1.0,1.0,0.875,1.0
+NP_001687.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005984.3,1.0,0.987,0.954,0.987,0.967,0.934,0.974,0.901,0.947,0.819,0.836,1.0,1.0,0.888,0.928,0.891,0.829
+NP_003045.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004689.1,1.0,1.0,1.0,0.833,0.958,1.0,1.0,1.0,0.917,1.0,0.625,1.0,0.917,1.0,1.0,1.0,1.0
+NP_714544.1,1.0,1.0,0.985,0.908,0.954,1.0,0.908,,0.785,0.938,0.692,0.985,0.938,0.938,0.954,0.954,0.954
+NP_060083.1,1.0,0.963,1.0,1.0,0.981,0.944,0.975,1.0,1.0,,0.787,0.981,0.981,0.917,1.0,1.0,1.0
+NP_005248.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.929,1.0
+NP_001034302.2,1.0,,1.0,0.417,0.75,0.667,0.333,0.833,0.667,0.583,0.75,0.917,0.667,0.75,0.583,,
+NP_001813.1,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,0.5,0.667,0.667,1.0,1.0,1.0,1.0
+NP_057097.2,1.0,1.0,1.0,0.8,1.0,1.0,0.8,0.8,1.0,1.0,0.85,1.0,1.0,0.9,0.9,0.85,0.95
+NP_001093326.2,0.958,,0.917,0.958,0.708,0.625,,,0.458,,1.0,0.917,0.833,,,0.458,0.521
+NP_006008.1,1.0,0.786,0.964,,0.75,0.786,0.714,,0.75,0.607,0.625,0.786,0.714,0.786,0.786,0.804,0.768
+NP_001035835.1,1.0,1.0,1.0,0.929,1.0,1.0,1.0,1.0,1.0,0.286,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000198.1,1.0,1.0,1.0,0.955,1.0,1.0,1.0,1.0,1.0,0.273,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002428.1,0.917,0.917,1.0,,0.917,0.917,0.917,,0.75,0.083,0.5,1.0,0.917,0.833,0.833,0.417,0.583
+NP_079426.2,1.0,1.0,1.0,1.0,1.0,1.0,0.857,1.0,1.0,,0.812,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055529.2,0.998,0.914,0.983,0.917,0.937,0.856,0.833,0.88,0.828,0.592,0.857,0.987,0.935,0.87,0.863,0.882,0.889
+NP_006740.1,0.986,0.958,0.986,,0.972,0.847,0.917,0.889,0.792,0.84,0.931,0.958,0.917,0.972,0.972,0.875,0.889
+NP_001683.2,1.0,0.944,1.0,1.0,0.944,1.0,0.944,1.0,0.944,0.778,0.75,1.0,0.889,0.833,0.833,1.0,1.0
+NP_001352465.1,0.956,0.918,0.99,0.933,0.874,0.817,0.846,0.8,0.772,0.869,0.807,0.971,0.883,0.86,0.814,,
+NP_004580.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002866.2,1.0,0.812,1.0,,0.75,0.875,1.0,,1.0,0.438,0.5,1.0,0.812,,,1.0,1.0
+NP_001124497.1,1.0,1.0,1.0,,1.0,0.667,1.0,,0.333,0.5,0.833,0.667,0.667,1.0,1.0,1.0,1.0
+NP_005140.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005645.1,0.985,,0.985,,,0.992,0.985,,0.969,,0.897,0.99,0.985,0.995,0.99,0.959,0.969
+NP_001121370.1,1.0,0.955,1.0,0.966,1.0,0.943,,0.989,0.739,0.693,0.818,1.0,0.943,,,0.864,0.864
+NP_932173.1,1.0,0.995,0.996,1.0,0.979,0.911,,0.994,0.885,0.942,0.914,0.998,0.988,,,0.925,0.925
+NP_002238.2,1.0,,1.0,,,0.967,,,0.9,,1.0,0.967,0.967,,,,
+NP_861448.2,1.0,1.0,1.0,,,0.889,1.0,,0.667,0.889,0.944,1.0,1.0,,,,
+NP_000302.1,1.0,1.0,1.0,0.872,0.949,0.91,0.795,0.936,0.872,0.551,0.859,1.0,0.936,0.846,0.782,0.846,0.808
+NP_036475.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.857,0.571,0.714,1.0,1.0,1.0,1.0,1.0,1.0
+NP_683763.2,1.0,0.833,1.0,0.944,0.944,1.0,0.889,0.778,0.444,0.556,0.778,1.0,0.889,0.944,0.944,0.722,0.833
+NP_689854.2,1.0,0.941,1.0,0.971,0.971,0.941,0.324,0.941,0.559,0.897,0.721,1.0,0.912,1.0,0.971,0.765,0.765
+NP_056517.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.667,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001278.1,1.0,0.98,0.996,0.967,0.991,0.946,0.974,0.972,0.869,0.778,0.817,0.976,0.93,1.0,1.0,0.925,0.958
+NP_722540.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_005947.3,1.0,1.0,1.0,,1.0,1.0,,1.0,0.889,,0.778,1.0,0.963,,,,
+NP_001369746.2,1.0,0.933,1.0,0.8,0.933,1.0,0.667,0.933,1.0,0.4,0.867,1.0,0.867,0.867,0.8,0.9,0.867
+NP_077282.3,0.98,0.98,0.98,0.97,0.97,0.99,0.97,0.929,0.99,0.682,0.722,1.0,0.99,0.939,0.96,0.99,0.97
+NP_076968.2,1.0,0.812,1.0,0.75,0.625,0.562,0.438,0.0,0.188,0.938,0.688,0.938,0.75,0.812,0.75,0.812,0.812
+NP_004035.2,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,0.75,0.583,0.833,1.0,1.0,1.0,1.0,0.958,1.0
+NP_071437.3,1.0,0.875,1.0,0.0,0.75,0.688,0.0,0.0,0.438,0.938,0.75,1.0,0.875,0.0,0.0,0.0,0.0
+NP_000168.1,1.0,0.989,1.0,0.989,0.954,0.851,0.701,0.805,0.586,0.925,0.759,0.966,0.759,0.977,0.977,0.816,0.874
+NP_003156.1,0.944,0.941,0.935,0.938,0.928,0.852,,0.786,0.645,0.753,0.626,0.93,0.928,,,0.822,0.858
+NP_524144.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,,
+NP_996879.1,1.0,0.909,1.0,1.0,1.0,1.0,0.909,1.0,0.818,0.909,1.0,1.0,1.0,0.909,0.909,0.955,0.909
+NP_000249.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.25,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001002841.1,1.0,,1.0,,0.0,0.0,0.0,,0.0,0.5,0.0,1.0,0.0,0.0,0.0,0.0,0.0
+NP_001116293.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,0.5,1.0,0.5,1.0,1.0,1.0,1.0
+NP_002001.1,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,0.5,0.25,1.0,1.0,1.0,0.5,1.0,1.0,1.0
+NP_064576.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000206.2,1.0,0.865,1.0,0.719,0.99,1.0,0.74,0.934,0.837,0.564,0.969,1.0,0.893,0.852,0.857,0.913,0.908
+NP_056170.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057390.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003830.1,1.0,,0.967,,,0.933,,,0.9,,0.8,1.0,1.0,,,,
+NP_001366429.1,1.0,1.0,1.0,,1.0,0.854,,,0.488,1.0,0.829,1.0,1.0,,,0.878,0.963
+NP_001188356.1,0.667,,1.0,,0.333,1.0,1.0,,1.0,,0.667,1.0,1.0,,,,
+NP_008880.2,1.0,1.0,1.0,1.0,1.0,0.982,1.0,0.952,0.964,1.0,0.875,1.0,1.0,1.0,1.0,0.929,0.976
+NP_000968.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001616.1,1.0,0.944,1.0,,1.0,1.0,,,0.964,1.0,0.643,0.929,0.964,,,0.722,0.833
+NP_064569.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_058633.2,0.743,,0.771,,0.257,0.514,,,0.771,,0.514,0.886,0.571,,,,
+NP_000101.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.25,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_078804.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_078868.1,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060033.3,1.0,0.9,1.0,1.0,0.9,0.9,0.5,0.8,0.9,0.9,0.55,0.7,0.7,0.9,0.6,1.0,0.9
+NP_003784.2,1.0,0.929,1.0,1.0,1.0,0.857,1.0,0.857,0.857,0.5,0.893,1.0,1.0,1.0,1.0,0.893,0.929
+NP_004937.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.818,1.0,1.0,1.0,0.955,0.977,0.955
+NP_001089.1,1.0,0.5,1.0,0.778,0.833,0.444,0.778,0.778,0.722,0.667,0.611,0.944,0.611,0.556,0.444,0.472,0.444
+NP_001269862.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,0.762,1.0,0.881,1.0,1.0,1.0,1.0,1.0,1.0
+NP_008853.3,0.974,0.953,0.982,,0.93,0.903,0.949,,0.76,0.981,0.855,0.991,0.964,0.952,0.944,0.903,0.94
+NP_001307679.1,1.0,0.952,1.0,1.0,1.0,0.929,0.905,0.714,0.833,0.667,0.81,1.0,1.0,1.0,1.0,0.929,0.976
+NP_000127.2,1.0,0.972,0.972,0.833,0.972,0.917,1.0,0.972,0.806,0.875,0.75,1.0,0.917,0.944,0.917,0.875,0.861
+NP_006484.2,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001182192.1,1.0,,1.0,,,0.967,0.917,,0.933,,0.75,1.0,0.917,0.983,0.967,0.833,0.917
+NP_963925.2,1.0,1.0,0.938,1.0,1.0,0.917,0.917,0.958,0.917,1.0,0.948,1.0,1.0,1.0,1.0,0.938,1.0
+NP_001157281.1,1.0,1.0,1.0,1.0,0.962,0.923,0.962,0.885,0.923,0.462,0.808,1.0,0.923,0.962,0.962,,
+NP_055908.1,1.0,,1.0,,,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001275582.1,1.0,,1.0,0.857,0.857,0.786,0.857,,0.5,,0.5,1.0,0.857,,,,
+NP_009029.3,1.0,,0.889,,0.833,0.778,,,0.833,,0.861,0.944,0.889,,,0.667,0.611
+NP_001035202.1,1.0,0.971,1.0,0.95,0.971,0.942,,0.867,0.899,0.384,0.717,1.0,0.986,,,,
+NP_060764.3,1.0,0.933,1.0,0.933,0.933,0.933,0.933,0.8,0.733,0.8,0.833,1.0,0.8,0.867,0.8,0.867,0.933
+NP_066925.1,1.0,0.905,1.0,0.952,1.0,0.952,0.857,0.905,0.762,0.857,1.0,0.905,0.952,0.905,0.905,0.929,1.0
+NP_002464.1,0.984,0.95,0.99,0.908,0.971,0.901,0.91,0.925,0.73,0.62,0.894,0.979,0.942,0.958,0.956,0.894,0.896
+NP_000359.1,0.994,0.989,0.999,0.986,0.99,0.977,0.977,0.968,0.717,0.779,0.713,0.99,0.974,0.981,0.978,0.923,0.952
+NP_064445.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,0.5,1.0,1.0,1.0,,,,
+NP_005955.3,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_733774.1,1.0,,1.0,1.0,0.95,0.9,1.0,1.0,0.8,1.0,0.5,1.0,1.0,1.0,1.0,0.95,1.0
+NP_004073.2,0.994,0.925,1.0,0.919,0.957,0.922,0.919,0.938,0.764,0.733,0.724,0.981,0.913,0.938,0.919,0.87,0.913
+NP_066287.2,0.985,0.991,0.976,,0.909,0.905,0.962,,0.799,0.908,0.843,0.983,0.923,0.958,0.901,0.768,0.763
+NP_443108.1,0.9,0.885,0.947,0.802,0.823,0.9,0.867,0.823,0.953,0.3,0.853,0.82,0.767,0.75,0.729,0.938,0.885
+NP_000013.2,1.0,1.0,1.0,0.956,1.0,1.0,1.0,1.0,0.985,0.835,0.786,1.0,0.995,1.0,1.0,1.0,1.0
+NP_001013860.1,1.0,0.993,1.0,0.947,0.947,0.776,1.0,0.974,0.487,0.947,0.77,1.0,0.961,1.0,0.961,0.934,0.974
+NP_005410.1,1.0,0.857,1.0,1.0,0.857,1.0,0.857,1.0,1.0,0.714,0.929,1.0,0.857,1.0,1.0,0.929,1.0
+NP_644805.1,0.925,0.929,0.971,0.805,0.887,0.814,0.852,0.844,0.655,0.621,0.61,0.931,0.832,0.91,0.904,0.738,0.756
+NP_061950.2,1.0,1.0,1.0,0.974,1.0,1.0,1.0,,1.0,1.0,0.987,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000410.2,1.0,0.98,1.0,0.972,0.988,0.945,0.955,0.857,0.871,0.901,0.886,1.0,0.987,0.967,0.952,0.842,0.862
+NP_000071.1,1.0,0.984,1.0,,0.962,0.879,0.978,0.846,0.951,0.83,0.777,1.0,1.0,0.929,0.901,0.964,0.956
+NP_055713.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_036470.1,1.0,1.0,1.0,1.0,1.0,0.917,1.0,0.833,0.833,0.917,0.833,1.0,1.0,0.917,0.833,0.833,0.833
+NP_001120693.1,0.992,,0.987,,1.0,0.946,0.957,,0.885,,0.959,0.995,0.984,,,,
+NP_000712.2,0.969,0.978,0.97,,0.963,0.847,,,0.6,0.891,0.825,0.976,0.952,,,0.798,0.858
+NP_001439.2,0.75,0.25,1.0,0.25,0.0,0.25,0.0,0.0,0.0,0.0,0.25,0.75,0.25,0.5,0.25,0.0,0.0
+NP_861454.2,0.969,0.875,1.0,1.0,0.969,0.844,0.688,1.0,0.844,0.344,0.734,0.969,0.938,1.0,1.0,0.906,0.906
+NP_006293.2,1.0,,1.0,,,1.0,1.0,,1.0,,0.714,1.0,0.929,1.0,1.0,0.643,0.714
+NP_714928.1,1.0,0.97,0.939,1.0,1.0,0.364,0.909,,0.606,0.636,0.242,1.0,1.0,0.97,1.0,0.864,0.909
+NP_001070984.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.833,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_009051.1,1.0,1.0,1.0,0.955,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001094832.1,0.865,0.567,0.865,0.786,0.913,0.726,0.468,0.841,0.73,0.258,0.635,0.786,0.667,0.651,0.675,0.659,0.619
+NP_004551.2,0.96,0.831,0.943,0.811,0.823,0.829,0.754,0.749,0.829,0.609,0.64,0.897,0.834,0.829,0.84,0.623,0.697
+NP_000172.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.692,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001032.2,1.0,0.983,0.983,0.767,1.0,0.983,0.933,0.833,0.95,0.733,0.983,0.983,1.0,0.967,0.95,0.892,0.917
+NP_001287829.1,1.0,1.0,1.0,1.0,1.0,1.0,0.625,,1.0,0.625,0.875,1.0,1.0,,,0.875,0.75
+NP_064425.2,1.0,1.0,1.0,0.962,0.962,1.0,0.846,0.962,0.923,0.923,0.846,0.962,1.0,0.923,0.923,0.846,0.923
+NP_004619.3,1.0,1.0,0.938,1.0,1.0,0.875,0.812,0.875,0.875,0.75,0.938,1.0,1.0,0.938,0.938,0.969,0.969
+NP_000018.2,1.0,1.0,1.0,0.854,0.979,0.979,1.0,0.958,0.958,0.812,0.812,1.0,1.0,1.0,1.0,1.0,0.979
+NP_001008701.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_112224.1,1.0,1.0,1.0,0.857,1.0,1.0,1.0,1.0,1.0,0.571,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002963.2,0.909,0.091,0.364,0.636,0.364,0.091,,,0.182,0.455,0.182,0.545,0.091,,,,
+NP_000751.1,1.0,0.667,1.0,,0.818,0.697,0.727,,0.697,0.697,0.606,0.939,0.848,0.667,0.636,0.788,0.758
+NP_705690.2,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,0.167,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006615.2,0.898,0.864,1.0,,0.764,0.898,0.739,0.875,0.795,0.667,,0.932,0.852,0.736,0.875,,
+NP_054772.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,0.375,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004420.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.978,0.778,1.0,1.0,1.0,1.0,1.0,0.978,0.978
+NP_001233.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000834.2,1.0,0.976,1.0,,0.976,0.878,0.935,0.81,0.792,0.827,0.866,0.982,0.952,0.958,0.946,0.815,0.866
+NP_000325.4,1.0,0.985,1.0,0.934,0.979,0.968,0.985,0.961,0.818,0.906,0.906,1.0,0.982,0.969,0.958,0.895,0.919
+NP_015564.5,0.945,,0.945,0.85,0.836,0.927,0.98,,0.873,,0.8,0.836,0.818,0.95,0.95,0.636,0.761
+NP_000739.1,0.95,0.883,0.967,0.9,0.983,0.867,0.817,0.933,0.55,0.55,0.675,0.983,0.883,0.9,0.9,0.767,0.783
+NP_006350.1,1.0,0.986,1.0,1.0,1.0,0.938,0.667,0.917,0.861,0.417,0.806,0.986,0.958,1.0,1.0,0.972,1.0
+NP_001230106.1,1.0,,1.0,1.0,1.0,0.5,,,0.5,0.5,0.5,1.0,1.0,,,,
+NP_001934.2,1.0,0.987,1.0,0.967,0.953,0.973,0.92,0.953,0.813,0.96,0.907,1.0,0.98,0.867,0.873,0.98,0.987
+NP_061949.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002171.2,0.996,0.966,0.994,0.996,0.982,0.899,0.994,0.933,0.841,0.767,0.923,0.994,0.974,0.982,0.966,0.911,0.949
+NP_078875.4,0.96,1.0,1.0,0.84,1.0,0.82,0.86,0.94,0.7,1.0,0.91,1.0,1.0,0.98,1.0,0.94,1.0
+NP_000255.2,0.982,0.971,0.995,0.99,0.964,0.958,0.971,0.957,0.717,0.657,0.807,0.989,0.977,0.987,0.991,0.928,0.949
+NP_000243.1,0.994,0.989,0.997,0.987,0.99,0.982,0.987,0.98,0.866,0.738,0.709,0.996,0.989,0.989,0.989,0.981,0.981
+NP_061176.4,1.0,0.529,1.0,0.765,0.941,0.853,0.353,1.0,0.882,0.265,0.941,0.882,0.647,0.588,0.412,1.0,0.882
+NP_758440.1,1.0,1.0,1.0,0.955,0.955,0.955,0.864,0.909,0.773,0.773,0.591,1.0,0.955,1.0,1.0,0.795,0.818
+NP_005325.2,0.993,0.943,1.0,0.993,0.943,0.975,1.0,0.929,0.943,1.0,0.907,1.0,1.0,0.986,0.957,0.929,0.95
+NP_055569.1,0.722,0.833,0.722,0.611,0.694,0.694,0.667,0.722,0.417,0.583,0.708,0.833,0.639,0.583,0.611,0.667,0.694
+NP_000487.1,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,0.9,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001745.2,1.0,0.998,1.0,,0.986,0.942,,,0.817,1.0,0.833,1.0,1.0,,,0.845,0.888
+NP_005351.2,1.0,,1.0,,0.934,0.956,,,0.835,0.703,0.835,1.0,0.989,,,,
+NP_000039.2,0.994,0.994,0.987,0.985,0.958,0.981,0.952,0.912,0.923,0.62,0.996,1.0,0.987,0.981,0.975,0.937,0.95
+NP_009057.1,1.0,1.0,1.0,1.0,0.96,1.0,1.0,1.0,1.0,1.0,0.94,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003394.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001264.2,0.918,0.964,0.961,0.97,0.889,0.846,0.9,0.889,0.797,1.0,0.813,0.971,0.946,0.904,0.874,0.554,0.65
+NP_075066.1,1.0,0.75,1.0,1.0,1.0,1.0,0.5,1.0,0.75,0.5,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003089.1,0.857,0.857,1.0,1.0,1.0,0.857,1.0,0.857,1.0,0.857,0.929,0.857,0.857,0.857,0.857,1.0,0.857
+NP_000021.1,1.0,0.998,1.0,0.955,0.945,0.962,0.939,0.968,0.912,0.766,0.952,1.0,0.991,0.992,0.991,0.936,0.913
+NP_000362.1,0.996,,0.979,,,0.728,0.872,,0.663,,0.728,0.967,0.778,0.984,0.955,0.702,0.724
+NP_036450.1,1.0,1.0,1.0,1.0,1.0,1.0,0.929,0.929,1.0,,0.964,1.0,1.0,0.929,0.929,1.0,1.0
+NP_009193.2,1.0,,1.0,,,0.889,0.778,,0.889,0.778,0.778,1.0,0.778,0.889,0.889,0.889,0.889
+NP_573400.3,1.0,0.714,1.0,,0.429,0.571,1.0,1.0,0.571,0.0,0.286,0.857,0.286,0.0,0.0,0.0,0.0
+NP_079535.4,1.0,0.5,1.0,,1.0,1.0,,,1.0,0.5,,1.0,0.5,,,1.0,1.0
+NP_004764.1,1.0,,1.0,1.0,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003891.1,0.983,0.967,1.0,0.833,0.833,0.867,0.95,0.8,0.8,0.983,0.8,1.0,0.983,0.933,0.883,0.883,0.9
+NP_570901.3,1.0,0.882,1.0,0.967,0.933,0.872,0.878,0.929,0.835,0.529,0.659,1.0,0.929,0.694,0.647,0.835,0.882
+NP_036205.1,1.0,1.0,1.0,0.5,1.0,0.0,0.5,0.5,0.5,1.0,0.5,1.0,0.5,0.5,0.0,0.75,0.5
+NP_065737.2,1.0,1.0,1.0,,1.0,1.0,0.905,,0.905,0.667,0.833,1.0,1.0,0.952,0.952,1.0,1.0
+NP_004951.1,0.991,0.944,0.991,0.87,0.583,0.819,0.926,0.898,0.685,0.769,0.5,1.0,0.917,0.944,0.963,0.537,0.546
+NP_001092692.1,0.917,0.833,0.875,,0.625,0.667,0.375,,0.667,0.542,1.0,0.917,0.833,0.625,0.75,0.5,0.375
+NP_001019801.3,1.0,1.0,1.0,,0.968,0.992,0.0,,0.556,1.0,0.667,1.0,1.0,0.0,0.0,0.5,0.5
+NP_000301.1,1.0,,1.0,0.939,,0.985,1.0,,0.894,,0.977,1.0,1.0,1.0,1.0,0.939,0.924
+NP_001005271.2,0.951,0.936,0.986,0.93,0.96,0.881,,0.86,0.852,0.615,0.737,0.962,0.914,,,,
+NP_001158233.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000264.2,0.959,0.929,0.98,0.939,0.959,1.0,0.867,0.847,0.908,0.786,0.918,0.98,0.888,0.898,0.898,0.939,0.929
+NP_005449.5,0.863,0.851,0.904,0.922,0.827,0.796,0.872,0.65,0.64,0.747,0.73,0.838,0.811,0.943,0.916,0.761,0.85
+NP_002440.2,1.0,1.0,1.0,0.833,1.0,0.917,1.0,1.0,0.833,1.0,0.917,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055282.1,1.0,0.778,1.0,0.889,0.778,1.0,0.444,1.0,0.889,0.0,0.889,1.0,0.889,0.889,0.778,0.944,0.889
+NP_004514.2,0.963,0.904,0.978,0.993,0.949,0.868,0.904,0.882,0.691,0.548,0.842,0.963,0.912,0.926,0.949,0.897,0.893
+NP_115881.3,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,0.667,,1.0,1.0,1.0,1.0,1.0,1.0,0.833
+NP_001244941.2,1.0,0.972,1.0,0.963,0.981,0.926,,0.917,0.5,0.944,0.694,0.972,0.944,,,,
+NP_001165558.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000053.2,1.0,1.0,1.0,0.895,0.991,0.963,0.978,0.969,0.906,0.65,0.883,1.0,0.997,1.0,1.0,0.892,0.94
+NP_065811.1,0.992,0.98,0.995,0.98,0.91,0.964,0.983,0.989,0.855,0.939,0.93,0.987,0.965,0.977,0.986,0.945,0.976
+NP_003168.2,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,0.333,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_689677.1,1.0,0.4,1.0,0.9,1.0,0.4,0.8,0.8,0.0,0.4,0.15,1.0,0.6,0.8,0.9,0.8,0.7
+NP_060595.3,1.0,1.0,1.0,1.0,0.667,1.0,0.667,0.667,0.667,1.0,0.833,1.0,0.667,1.0,1.0,1.0,1.0
+NP_004688.2,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,0.75,0.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000584.3,,,,,,,,,,,,,,,,,
+NP_858058.1,0.833,0.667,0.917,1.0,0.833,0.75,0.833,0.833,0.25,0.75,0.5,0.833,0.833,0.667,0.917,0.833,0.667
+NP_652763.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004162.2,0.82,0.777,0.847,0.782,0.803,0.692,0.741,0.777,0.592,0.697,0.573,0.85,0.81,0.789,0.827,0.741,0.776
+NP_078789.2,1.0,1.0,0.974,,0.974,1.0,,,0.923,0.692,1.0,1.0,1.0,,,,
+NP_005546.2,1.0,0.985,1.0,0.955,1.0,0.955,1.0,0.833,0.864,0.894,0.924,1.0,0.985,0.985,0.985,0.955,0.97
+NP_775109.2,1.0,1.0,1.0,1.0,1.0,0.714,1.0,0.714,0.857,1.0,0.857,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005545.1,1.0,0.987,1.0,1.0,1.0,0.933,0.981,0.788,0.776,0.968,0.846,1.0,0.949,0.987,0.974,0.885,0.955
+NP_060116.2,1.0,1.0,1.0,1.0,1.0,1.0,0.889,1.0,0.917,0.917,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056508.2,1.0,1.0,1.0,1.0,0.846,1.0,1.0,0.923,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.923,0.923
+NP_056155.1,0.992,0.914,1.0,,0.992,0.949,0.961,,0.891,0.742,0.723,0.969,0.953,0.938,0.93,0.914,0.969
+NP_001355738.1,,,,,,,,,,,,,,,,,
+NP_001034639.1,1.0,1.0,1.0,0.875,0.875,0.875,0.625,1.0,0.625,0.875,1.0,1.0,1.0,1.0,1.0,0.625,0.75
+NP_001010867.1,1.0,1.0,1.0,0.81,1.0,1.0,0.524,1.0,0.905,0.571,0.905,1.0,1.0,0.952,1.0,1.0,1.0
+NP_000298.3,1.0,1.0,0.917,1.0,1.0,0.917,,0.833,0.667,1.0,0.75,1.0,1.0,,,,
+NP_115584.1,1.0,0.833,1.0,1.0,0.833,0.833,1.0,0.333,0.667,0.833,0.833,1.0,0.833,1.0,1.0,0.75,0.833
+NP_006554.1,1.0,0.889,1.0,1.0,1.0,1.0,0.889,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056146.1,0.833,0.0,0.667,0.333,0.333,0.333,0.0,0.667,0.667,0.083,0.917,0.667,0.333,0.167,0.167,0.333,0.667
+NP_694551.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001091981.1,0.854,0.812,0.906,,0.885,0.74,0.75,,0.833,0.359,0.536,0.823,0.708,0.767,0.75,0.839,0.812
+NP_000684.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.925,0.925,0.95,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056496.1,1.0,0.5,1.0,1.0,0.75,0.5,0.5,0.0,0.125,0.5,0.688,0.875,0.75,1.0,1.0,0.25,0.5
+NP_006406.1,1.0,0.933,1.0,0.883,0.917,0.958,0.85,1.0,0.667,0.533,0.75,0.95,0.917,0.867,0.817,0.917,0.9
+NP_510965.1,0.944,0.917,1.0,1.0,0.944,0.944,0.944,0.972,0.611,0.861,0.833,0.972,0.944,1.0,1.0,0.889,0.972
+NP_005564.1,1.0,1.0,1.0,0.95,0.95,1.0,0.95,0.95,0.6,0.6,0.75,1.0,0.95,0.95,1.0,1.0,1.0
+NP_001341530.2,,,,,,,,,,,,,,,,,
+NP_001071.1,1.0,0.945,1.0,,0.953,0.903,0.965,,0.847,0.771,0.831,0.996,0.951,0.988,0.982,0.916,0.953
+NP_000374.1,1.0,0.995,1.0,0.963,0.963,0.975,0.989,0.984,0.904,0.66,0.873,1.0,0.992,0.976,0.979,0.96,0.971
+NP_065857.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_982301.1,0.947,0.917,0.992,1.0,0.909,0.932,0.918,0.857,0.606,0.364,0.727,0.955,0.909,0.898,0.918,0.841,0.894
+NP_000199.2,1.0,0.906,1.0,0.8,0.922,0.766,0.8,0.767,0.969,0.797,0.742,0.938,0.938,0.933,0.933,0.828,0.797
+NP_056092.2,0.888,0.862,0.938,,0.862,0.872,,,0.8,0.772,0.544,0.875,0.819,,,0.784,0.788
+NP_061951.1,1.0,1.0,1.0,0.909,1.0,1.0,1.0,1.0,1.0,0.939,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_061966.1,1.0,1.0,1.0,0.864,1.0,1.0,0.773,1.0,0.955,0.909,0.568,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000506.2,0.944,0.667,0.944,0.5,0.722,0.722,0.667,0.167,0.778,0.667,0.556,0.889,0.611,0.167,0.833,0.167,0.167
+NP_003854.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_741960.1,1.0,1.0,1.0,,0.938,0.938,0.938,,0.688,0.75,0.5,1.0,1.0,1.0,1.0,0.875,0.875
+NP_005557.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,0.5,1.0,1.0,0.0,0.0,0.5,1.0
+NP_057119.3,1.0,0.973,1.0,1.0,0.964,0.973,0.982,1.0,0.911,0.821,0.991,1.0,0.964,0.964,0.982,0.875,0.875
+NP_542119.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_954712.1,1.0,0.583,1.0,0.778,0.722,0.833,0.694,0.75,0.5,0.458,0.694,0.972,0.75,0.667,0.639,0.667,0.667
+NP_001050.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_982272.2,0.981,0.896,0.981,0.885,0.962,0.981,0.712,,0.692,,0.635,1.0,0.942,0.692,0.673,0.817,0.837
+NP_004437.2,1.0,1.0,1.0,0.944,1.0,0.792,0.917,0.889,0.528,0.597,0.778,1.0,0.944,0.972,0.917,0.611,0.694
+NP_006736.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.8,0.6,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003033.3,0.934,,0.941,,,0.881,0.889,,0.795,,0.71,0.959,0.954,0.944,0.945,0.851,0.891
+NP_000169.1,1.0,0.75,1.0,,0.667,0.667,0.833,0.167,0.333,0.917,0.5,0.833,0.667,0.667,0.667,0.25,0.667
+NP_003578.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003745.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_068741.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001092744.1,1.0,,1.0,,,1.0,0.982,,0.946,,0.875,1.0,1.0,1.0,0.982,0.884,0.893
+NP_001895.1,0.929,,0.976,,,1.0,0.905,,0.929,,0.583,0.833,0.786,0.905,0.905,0.869,0.905
+NP_443076.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.125,0.95,1.0,0.95,1.0,0.9,0.95,1.0
+NP_714915.3,1.0,0.983,1.0,1.0,1.0,0.997,0.957,0.983,0.95,0.711,0.993,1.0,0.99,0.997,0.997,1.0,0.983
+NP_001070884.2,1.0,,1.0,,0.5,0.5,,0.0,0.5,,0.75,1.0,0.5,,,,
+NP_000185.1,1.0,1.0,1.0,1.0,1.0,0.952,0.762,0.952,1.0,0.405,0.952,1.0,1.0,0.952,0.952,0.952,0.952
+NP_000066.1,1.0,1.0,1.0,,1.0,1.0,0.5,,1.0,1.0,0.75,1.0,1.0,,,,
+NP_057118.1,1.0,1.0,1.0,1.0,1.0,1.0,0.857,0.929,1.0,0.857,1.0,1.0,1.0,1.0,0.929,1.0,1.0
+NP_001313340.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,0.857,,0.786,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001211.3,0.919,0.821,0.967,,0.927,0.862,,,0.837,0.179,0.559,0.89,0.858,,,0.864,0.856
+NP_001358001.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,,,,
+NP_001130.1,1.0,0.991,1.0,,0.981,0.977,0.806,0.981,0.898,0.657,0.75,0.991,0.972,0.954,0.926,0.907,0.898
+NP_006354.2,1.0,0.856,1.0,0.844,0.844,0.822,0.911,,0.778,0.822,0.722,0.978,0.978,0.756,0.667,0.933,0.911
+NP_005041.1,1.0,0.889,1.0,1.0,0.917,0.889,0.972,0.944,0.889,0.653,0.889,0.889,0.861,0.972,1.0,0.944,0.972
+NP_000700.1,0.997,0.974,0.989,0.969,0.953,0.953,0.931,0.883,0.786,0.807,0.772,0.967,0.975,0.978,0.956,0.91,0.958
+NP_038202.2,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000364.1,1.0,0.75,1.0,0.75,1.0,0.5,0.5,0.5,1.0,0.5,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_078824.2,1.0,0.462,1.0,0.846,0.846,0.692,0.615,0.231,0.385,0.538,0.962,1.0,0.462,0.462,0.615,0.231,0.231
+NP_079011.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_443077.1,1.0,1.0,1.0,1.0,1.0,0.893,1.0,1.0,0.893,0.964,0.571,1.0,1.0,0.946,0.911,0.857,1.0
+NP_056090.1,1.0,1.0,1.0,,0.9,0.9,1.0,0.9,1.0,0.9,0.9,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055472.1,0.5,0.0,1.0,0.0,0.5,1.0,0.0,0.5,1.0,0.0,0.5,1.0,0.0,0.5,0.5,0.5,1.0
+NP_079509.5,1.0,1.0,1.0,,1.0,0.8,1.0,,1.0,1.0,0.6,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005756.2,1.0,,1.0,1.0,,1.0,1.0,,1.0,,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003696.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.333,1.0,0.833,0.833,0.833,1.0,1.0,0.833,0.833
+NP_000010.1,1.0,1.0,1.0,0.988,0.988,1.0,1.0,0.983,0.878,0.703,0.977,0.977,0.953,0.994,1.0,0.977,0.994
+NP_057043.1,1.0,0.5,0.5,,1.0,1.0,,,1.0,0.0,0.75,1.0,1.0,,,1.0,1.0
+NP_003110.1,0.991,,0.986,0.941,,0.916,0.973,,0.764,,0.92,0.977,0.977,0.945,0.936,0.941,0.968
+NP_001080.2,0.962,0.893,1.0,0.842,0.94,0.919,0.876,0.94,0.868,0.718,0.947,1.0,0.97,0.97,0.97,0.981,0.987
+NP_077718.3,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,,,,
+NP_002993.1,1.0,1.0,1.0,0.99,0.981,1.0,0.99,1.0,0.74,0.942,0.75,1.0,0.942,0.99,0.971,0.962,0.971
+NP_006021.2,1.0,1.0,1.0,,1.0,1.0,,,0.556,1.0,0.889,1.0,1.0,,,1.0,1.0
+NP_004587.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000912.3,1.0,0.955,1.0,0.909,0.909,0.909,0.773,0.818,0.864,0.636,0.909,1.0,0.955,0.773,0.818,0.955,1.0
+NP_060579.3,1.0,0.929,1.0,0.857,0.857,1.0,1.0,1.0,1.0,0.857,1.0,0.857,0.857,1.0,1.0,1.0,1.0
+NP_004705.1,1.0,,1.0,0.667,1.0,1.0,1.0,0.667,0.833,0.167,0.667,0.833,0.333,0.333,0.667,1.0,0.667
+NP_000242.1,0.882,0.866,0.879,0.879,0.867,0.863,0.875,0.786,0.711,0.669,0.675,0.885,0.865,0.894,0.879,0.82,0.843
+NP_003066.2,1.0,1.0,1.0,,1.0,1.0,1.0,0.906,0.958,0.55,0.667,1.0,1.0,1.0,1.0,1.0,1.0
+NP_689726.3,0.714,1.0,1.0,1.0,1.0,1.0,0.857,1.0,1.0,1.0,0.714,1.0,1.0,1.0,1.0,1.0,1.0
+NP_115785.1,1.0,1.0,0.967,1.0,1.0,0.75,1.0,0.933,0.733,0.433,0.933,1.0,1.0,1.0,1.0,0.867,0.9
+NP_000812.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,0.979,1.0,1.0,1.0,1.0,0.958,0.917
+NP_060311.1,1.0,1.0,1.0,,,1.0,,,0.917,1.0,0.75,1.0,1.0,,,,
+NP_898871.1,1.0,1.0,1.0,,1.0,1.0,,,0.942,0.108,1.0,1.0,0.992,,,,
+NP_060814.4,1.0,,1.0,,,0.94,0.976,,0.952,,0.881,1.0,1.0,1.0,1.0,0.94,0.923
+NP_036580.2,1.0,0.667,1.0,1.0,1.0,1.0,0.667,1.0,1.0,0.333,0.333,1.0,0.333,1.0,1.0,1.0,1.0
+NP_002212.3,0.037,0.333,0.222,,0.259,0.519,0.259,,0.37,0.352,0.389,0.0,0.407,0.556,0.407,0.407,0.407
+NP_067641.2,1.0,0.833,1.0,,0.933,0.933,0.833,,0.833,0.4,0.817,0.933,0.8,0.8,0.8,0.867,0.867
+NP_000952.1,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0
+NP_005133.2,1.0,1.0,1.0,1.0,1.0,1.0,,0.857,1.0,0.0,0.857,1.0,1.0,,,,
+NP_001186728.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.0,0.95,1.0,1.0,,,1.0,1.0
+NP_036325.2,1.0,0.925,0.988,0.938,0.912,0.95,0.938,0.938,0.737,0.7,0.75,0.988,0.962,0.9,0.938,0.95,0.975
+NP_001352.2,0.875,1.0,1.0,0.5,1.0,1.0,1.0,1.0,0.625,0.375,0.875,1.0,1.0,1.0,1.0,0.75,1.0
+NP_061923.2,1.0,1.0,1.0,,1.0,0.905,1.0,,0.857,0.429,0.905,1.0,1.0,1.0,1.0,0.929,0.881
+NP_001159762.1,0.716,0.784,0.775,,0.676,0.681,,,0.843,0.765,0.75,0.686,0.735,,,,
+NP_005711.1,1.0,1.0,1.0,,,0.917,1.0,,0.333,1.0,0.667,1.0,1.0,,,,
+NP_997005.1,0.955,0.944,0.949,,0.944,0.811,0.939,,0.692,0.848,0.667,0.965,0.919,0.884,0.864,0.813,0.874
+NP_057278.2,1.0,1.0,1.0,0.5,1.0,1.0,0.5,0.833,0.667,0.333,0.75,1.0,1.0,1.0,1.0,0.833,0.833
+NP_001351952.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.8,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001137282.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,0.5,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_597680.2,1.0,1.0,1.0,,0.979,1.0,0.979,0.938,1.0,0.5,0.958,1.0,0.958,1.0,1.0,0.979,1.0
+NP_690870.3,1.0,1.0,1.0,,1.0,1.0,,,0.8,0.9,0.6,1.0,1.0,,,0.875,0.812
+NP_001138545.1,1.0,,,1.0,1.0,1.0,1.0,,0.0,,,1.0,1.0,0.0,0.0,1.0,1.0
+NP_057288.1,0.929,,,1.0,1.0,1.0,0.929,,,0.429,,,,,,,
+NP_991331.1,0.914,0.857,0.943,0.981,0.981,0.914,,0.867,0.886,0.557,0.8,0.829,0.81,,,0.752,0.762
+NP_001783.2,0.867,0.6,0.967,0.6,0.567,0.633,0.933,0.433,0.767,0.6,0.733,0.633,0.4,0.7,0.667,0.633,0.667
+NP_068712.1,0.984,0.979,0.984,0.979,0.995,0.886,0.962,0.922,0.706,0.616,0.782,0.975,0.953,0.996,0.994,0.974,0.983
+NP_056480.1,1.0,1.0,1.0,1.0,1.0,1.0,0.929,1.0,0.714,0.75,0.75,1.0,1.0,1.0,0.929,1.0,1.0
+NP_055151.3,1.0,0.911,1.0,0.911,0.982,0.946,0.982,0.946,0.607,0.589,0.571,0.982,0.929,0.964,0.911,1.0,0.964
+NP_006051.1,1.0,,1.0,1.0,,1.0,1.0,,1.0,,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002644.4,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,0.75,1.0,1.0,1.0,1.0,0.5,0.75
+NP_003280.2,0.923,0.545,0.923,,0.182,0.538,,0.7,0.615,0.455,0.615,0.385,0.385,,,0.5,0.583
+NP_001018005.1,0.967,0.9,1.0,,0.833,0.867,,,0.933,0.7,0.5,0.967,0.933,,,,
+NP_001511.2,1.0,,1.0,,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_000806.2,0.944,0.889,0.889,0.833,0.833,0.778,0.889,0.833,0.944,0.667,0.722,0.889,0.833,0.778,0.889,0.833,0.833
+NP_478059.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,0.75,1.0,1.0,,,1.0,1.0
+NP_005008.2,1.0,1.0,1.0,,1.0,0.755,0.99,,0.653,,0.602,0.99,0.99,,,,
+NP_001185.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.852,1.0,1.0,1.0,1.0,0.963,1.0,1.0
+NP_114148.3,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_835455.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004202.4,1.0,0.974,1.0,0.995,0.974,0.944,0.98,0.974,0.821,0.969,0.875,0.995,0.954,1.0,1.0,0.931,0.957
+NP_079519.1,1.0,,1.0,1.0,,0.867,1.0,,0.867,,0.783,1.0,1.0,1.0,1.0,0.983,1.0
+NP_054858.2,0.968,0.848,0.976,0.892,0.892,0.846,0.864,0.888,0.868,0.87,0.868,0.992,0.92,0.856,0.864,0.916,0.916
+NP_002759.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,0.0,1.0,0.0,1.0,1.0,1.0,1.0
+NP_001230695.2,0.99,0.934,0.957,,0.97,0.882,0.874,,0.812,0.768,0.572,0.977,0.957,0.861,0.861,0.794,0.889
+NP_002509.2,1.0,0.2,1.0,0.2,1.0,0.2,0.6,0.6,0.0,0.4,0.0,1.0,0.0,0.2,0.2,0.4,0.6
+NP_001484.1,1.0,1.0,0.5,1.0,1.0,1.0,0.5,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_112506.2,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0
+NP_002706.1,0.949,0.949,0.872,0.808,1.0,0.718,0.615,0.846,0.359,0.564,0.5,1.0,1.0,1.0,1.0,0.808,0.846
+NP_068810.3,1.0,1.0,1.0,1.0,0.8,0.8,0.2,1.0,0.4,0.0,0.8,1.0,1.0,0.8,0.6,0.4,1.0
+NP_001165806.1,,,,,,,,,,,,,,,,,
+NP_004167.3,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,0.938,1.0,1.0,1.0,1.0,0.938,0.938
+NP_078951.2,1.0,1.0,1.0,0.923,1.0,1.0,0.962,0.923,0.885,0.654,1.0,1.0,0.962,1.0,0.962,0.923,0.923
+NP_002392.2,1.0,1.0,1.0,,1.0,0.5,1.0,,0.0,0.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002576.1,1.0,,1.0,,0.9,0.985,0.97,,0.97,,0.667,1.0,1.0,,,,
+NP_056109.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.0,0.75,0.875,1.0,1.0,1.0,1.0,0.75,1.0
+NP_002920.1,0.667,0.667,1.0,1.0,1.0,1.0,0.667,1.0,0.667,0.333,1.0,0.667,0.667,1.0,1.0,0.833,0.833
+NP_077744.4,,,,,,,,,,,,,,,,,
+NP_705831.1,1.0,0.0,1.0,0.75,0.5,0.312,0.5,0.75,0.25,0.625,0.5,0.875,0.5,1.0,1.0,0.188,0.125
+NP_001681.2,0.944,0.833,0.944,0.889,0.667,0.611,0.944,0.5,0.889,0.056,0.75,0.944,0.5,0.833,0.889,0.917,0.889
+NP_001219.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_002682.2,1.0,0.978,1.0,,1.0,0.978,0.978,0.956,0.844,0.867,0.967,1.0,1.0,1.0,1.0,0.933,0.956
+NP_001316485.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,,,,
+NP_814444.1,1.0,0.955,1.0,1.0,1.0,0.977,1.0,0.909,0.932,0.773,0.977,1.0,0.977,1.0,1.0,0.909,1.0
+NP_056385.1,0.885,0.881,0.969,,0.753,0.685,,,0.508,0.463,0.785,0.873,0.808,,,0.679,0.764
+NP_055509.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.819,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002008.2,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000528.1,1.0,0.5,0.9,0.4,0.6,0.8,0.7,0.7,1.0,0.6,0.6,1.0,0.7,0.6,0.7,1.0,0.7
+NP_945350.1,1.0,1.0,1.0,,1.0,1.0,,,0.833,0.167,0.583,1.0,1.0,,,1.0,1.0
+NP_001182399.1,1.0,1.0,1.0,,0.952,1.0,1.0,,0.857,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001850.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005340.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.8,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001123476.1,1.0,0.829,1.0,0.75,0.895,0.89,0.737,1.0,0.714,0.6,0.8,0.981,0.895,0.947,0.947,0.956,0.956
+NP_001876.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002307.2,1.0,1.0,1.0,,0.913,0.952,,,0.626,1.0,0.6,1.0,0.983,,,,
+NP_001153839.1,0.0,0.0,1.0,,1.0,1.0,,,0.0,0.5,1.0,1.0,1.0,,,1.0,1.0
+NP_005468.1,1.0,1.0,1.0,1.0,0.995,0.968,0.995,0.918,0.855,0.87,0.995,1.0,1.0,1.0,1.0,0.986,1.0
+NP_000434.1,1.0,,1.0,,,1.0,,,0.692,,0.853,0.987,0.923,,,0.923,0.936
+NP_057953.1,1.0,1.0,1.0,1.0,0.8,1.0,1.0,1.0,1.0,1.0,0.6,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002016.2,0.875,0.844,0.938,0.5,0.812,0.844,0.625,0.938,0.406,0.578,0.594,1.0,0.812,0.688,0.688,0.781,0.781
+NP_056536.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.72,0.36,0.98,1.0,1.0,1.0,1.0,0.96,1.0
+NP_789794.1,1.0,1.0,1.0,0.857,0.929,1.0,1.0,1.0,1.0,0.429,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006381.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.889,1.0,1.0,0.944,1.0,0.889,0.889,0.889,0.889,0.889
+NP_009225.1,0.956,0.924,0.939,,0.947,0.957,0.538,,0.783,0.719,0.851,0.956,0.934,0.932,0.897,0.687,0.633
+NP_006388.2,0.92,0.92,0.96,0.92,0.92,1.0,0.84,1.0,0.84,0.56,0.74,0.88,0.84,0.96,0.96,0.92,0.96
+NP_001928.2,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,1.0,0.917,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000214.1,1.0,,1.0,1.0,,0.5,1.0,,0.5,,0.583,1.0,0.833,1.0,1.0,0.417,0.417
+NP_000154.1,1.0,1.0,1.0,,1.0,1.0,1.0,0.889,0.611,1.0,0.889,1.0,0.889,1.0,1.0,0.917,0.944
+NP_000269.3,1.0,1.0,1.0,,1.0,0.917,,,0.0,1.0,1.0,1.0,1.0,,,0.833,0.833
+NP_001104505.1,1.0,,1.0,,,0.972,0.972,,0.833,,0.792,1.0,1.0,1.0,1.0,0.917,0.917
+NP_000801.1,0.812,0.75,0.875,,0.875,0.812,0.875,0.812,0.812,1.0,0.75,0.812,0.812,0.812,0.812,0.844,0.906
+NP_005350.1,0.971,1.0,0.943,,1.0,0.971,0.981,,0.819,1.0,0.452,0.981,0.981,1.0,1.0,1.0,1.0
+NP_001008537.1,0.878,0.841,0.939,0.598,0.927,0.854,0.817,0.939,0.707,0.616,0.695,0.963,0.939,0.805,0.854,0.866,0.89
+NP_000108.1,1.0,0.909,1.0,0.545,1.0,0.818,0.545,0.909,0.636,0.773,0.136,1.0,1.0,0.636,0.727,1.0,0.909
+NP_689956.2,1.0,0.847,1.0,0.931,0.931,0.972,0.931,0.944,0.903,0.41,0.91,1.0,0.931,0.931,0.931,0.854,0.792
+NP_061155.2,0.978,1.0,1.0,0.609,0.761,0.739,0.804,0.826,0.717,0.63,0.663,0.935,0.674,0.674,0.609,0.609,0.587
+NP_116245.2,1.0,1.0,1.0,0.917,1.0,1.0,0.958,1.0,1.0,0.542,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001123617.1,1.0,0.6,1.0,,0.6,0.6,,,0.4,0.0,0.7,1.0,0.6,,,,
+NP_004278.2,1.0,,1.0,,,1.0,,,1.0,,0.833,1.0,1.0,,,,
+NP_055059.2,1.0,1.0,1.0,1.0,1.0,0.933,0.933,1.0,0.733,0.933,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000043.4,0.982,0.969,0.986,0.963,0.975,0.954,0.95,0.88,0.839,0.73,0.711,0.978,0.96,0.973,0.965,0.876,0.927
+NP_000344.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055527.1,0.909,0.352,1.0,0.568,0.795,0.932,0.886,0.795,1.0,0.159,0.864,0.818,0.432,0.864,0.909,0.886,0.932
+NP_060663.2,1.0,1.0,1.0,,1.0,0.947,,,0.868,0.947,0.868,1.0,1.0,,,0.947,0.987
+NP_078800.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.333,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005700.2,1.0,1.0,1.0,,1.0,1.0,0.875,,1.0,0.375,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_065750.1,0.905,0.714,0.952,0.857,0.714,0.81,0.714,0.857,0.524,0.619,0.667,0.857,0.762,0.667,0.667,0.714,0.81
+NP_065770.1,0.919,0.743,1.0,0.946,0.703,0.959,,0.676,0.892,,0.973,1.0,1.0,,,0.973,1.0
+NP_000814.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.969,0.109,0.938,1.0,1.0,1.0,1.0,1.0,1.0
+NP_065134.1,1.0,0.8,1.0,0.75,0.95,0.85,0.85,0.75,0.8,,0.8,0.95,0.9,0.9,0.9,0.75,0.675
+NP_001020279.1,1.0,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_068838.3,1.0,0.8,1.0,,1.0,1.0,0.6,1.0,1.0,1.0,0.9,1.0,0.8,0.8,1.0,0.9,0.9
+NP_065970.2,1.0,1.0,1.0,0.977,0.966,1.0,1.0,1.0,0.716,0.864,0.864,1.0,1.0,1.0,1.0,0.977,1.0
+NP_000450.3,1.0,1.0,1.0,0.906,0.989,0.983,0.968,0.994,0.879,0.935,0.939,1.0,1.0,0.994,1.0,0.956,1.0
+NP_065712.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005996.2,0.994,0.954,0.998,,0.961,0.939,0.923,,0.939,0.713,0.905,0.988,0.942,0.913,0.92,0.973,0.981
+NP_001186036.1,1.0,,1.0,1.0,0.95,0.952,1.0,,0.933,0.692,0.793,1.0,0.923,1.0,0.833,0.938,0.942
+NP_079029.3,1.0,0.97,1.0,0.97,1.0,0.97,1.0,1.0,0.909,0.485,0.848,1.0,1.0,0.97,0.97,0.985,0.909
+NP_006054.2,1.0,1.0,1.0,0.9,0.9,1.0,1.0,0.9,0.9,0.9,0.6,1.0,1.0,0.9,1.0,0.95,1.0
+NP_002233.2,1.0,0.5,1.0,1.0,0.667,1.0,0.667,0.833,1.0,0.333,0.667,1.0,0.667,0.667,0.667,1.0,1.0
+NP_004076.1,1.0,,1.0,,,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000334.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.604,1.0,1.0,1.0,1.0,0.958,1.0,1.0
+NP_001596.2,1.0,0.931,1.0,0.969,0.962,0.815,0.931,0.9,0.638,0.8,0.654,1.0,1.0,0.846,0.854,0.946,0.919
+NP_001271165.2,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001020466.1,1.0,0.977,1.0,0.977,0.864,1.0,1.0,0.909,0.795,0.955,1.0,0.977,0.932,0.977,1.0,0.966,0.955
+NP_004238.3,0.994,0.931,1.0,1.0,0.956,1.0,0.944,0.961,0.833,0.55,0.625,0.967,0.928,0.994,0.972,0.944,0.939
+NP_055484.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.8,1.0,0.9,1.0,1.0,1.0,1.0,1.0,1.0
+NP_620409.1,0.778,0.667,0.778,0.222,0.333,0.222,0.333,0.333,0.333,0.667,0.667,0.778,0.667,0.444,0.111,0.444,0.444
+NP_796374.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001001557.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.867,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002240.3,1.0,1.0,1.0,,1.0,1.0,,0.7,1.0,0.5,1.0,1.0,1.0,0.9,0.8,,
+NP_003147.2,1.0,1.0,1.0,,1.0,0.944,1.0,0.889,0.952,0.937,0.611,1.0,0.968,0.968,0.952,0.889,0.921
+NP_000336.1,1.0,0.875,1.0,,1.0,0.8,,,0.8,0.75,0.7,1.0,1.0,,,0.4,0.45
+NP_055543.2,1.0,0.867,1.0,0.933,1.0,0.933,0.8,0.733,0.8,0.467,1.0,1.0,1.0,1.0,1.0,0.667,0.8
+NP_758957.2,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003009.2,1.0,0.972,1.0,1.0,0.972,0.889,0.944,0.875,0.694,0.667,0.667,0.972,0.944,,,,
+NP_000084.3,0.972,0.913,0.965,0.953,0.929,0.912,0.85,0.604,0.56,0.727,0.802,0.961,0.931,0.952,0.96,0.762,0.853
+NP_060552.4,0.854,0.823,0.958,0.708,0.729,0.854,0.656,0.865,0.75,0.74,0.667,0.792,0.74,0.865,0.865,0.927,0.875
+NP_002878.2,1.0,0.964,0.929,0.75,0.893,0.929,0.786,0.75,0.893,0.571,0.732,0.821,0.714,0.929,0.929,0.714,0.75
+NP_008927.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.95,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000485.3,0.967,0.973,0.933,0.722,0.84,0.81,0.5,0.778,0.853,0.903,0.807,0.947,0.96,0.833,0.778,0.864,0.848
+NP_001073883.2,1.0,0.89,1.0,0.854,0.793,0.915,0.841,0.878,0.817,0.78,0.811,1.0,0.829,0.927,0.976,0.768,0.841
+NP_006774.2,1.0,1.0,1.0,,0.958,0.938,1.0,,0.958,0.958,0.958,1.0,0.958,1.0,1.0,0.896,0.875
+NP_003995.2,0.973,0.964,0.981,0.944,0.974,0.863,0.916,0.963,0.783,0.729,0.91,0.97,0.952,0.934,0.907,0.883,0.911
+NP_076872.1,1.0,0.75,0.972,0.639,0.75,0.583,0.639,0.611,0.472,0.444,0.778,1.0,0.861,0.806,0.694,0.556,0.528
+NP_001129668.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055891.1,1.0,0.5,1.0,0.2,0.8,0.8,0.8,1.0,1.0,0.4,0.5,0.4,0.4,1.0,0.8,1.0,1.0
+NP_001251.1,0.833,1.0,1.0,0.667,0.667,0.083,0.667,0.5,0.0,0.667,0.5,1.0,1.0,0.667,1.0,0.25,0.583
+NP_115977.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_112210.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,0.0,1.0,1.0,0.0,0.0,1.0,1.0
+NP_004950.2,1.0,,1.0,,0.848,0.97,,,0.606,,0.985,1.0,0.97,,,,
+NP_003027.1,0.995,0.992,0.993,0.986,0.99,0.945,1.0,0.904,0.813,0.794,0.945,0.99,0.978,1.0,1.0,0.903,0.914
+NP_002326.2,0.996,0.97,0.995,0.968,0.954,0.9,0.952,0.964,0.864,0.729,0.83,0.994,0.929,0.954,0.949,0.926,0.929
+NP_079130.2,1.0,1.0,1.0,0.9,1.0,1.0,0.8,1.0,0.9,,0.7,1.0,0.9,1.0,1.0,1.0,1.0
+NP_077740.1,1.0,0.923,1.0,1.0,1.0,0.923,0.769,1.0,1.0,0.154,0.923,1.0,1.0,1.0,1.0,0.923,0.923
+NP_004603.1,1.0,1.0,1.0,,1.0,1.0,0.643,1.0,0.902,0.789,0.714,1.0,1.0,1.0,1.0,0.985,1.0
+NP_001098101.1,0.977,0.943,1.0,0.92,0.955,0.915,0.955,0.977,0.864,0.932,0.892,0.966,0.966,0.932,0.966,0.727,0.795
+NP_003638.1,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006252.4,1.0,1.0,1.0,0.964,1.0,0.964,1.0,0.964,0.929,1.0,1.0,1.0,1.0,1.0,1.0,0.821,0.875
+NP_000435.3,0.986,0.975,0.994,0.977,0.964,0.941,0.971,0.95,0.683,0.864,0.81,0.988,0.979,0.98,0.979,0.895,0.936
+NP_001094896.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.929,1.0
+NP_001104595.1,0.967,0.946,0.98,0.964,1.0,0.928,,0.714,0.76,0.679,0.834,0.955,0.935,1.0,1.0,,
+NP_001159725.2,,,,,,,,,,,,,,,,,
+NP_036192.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.417,1.0,0.75,0.75,,,,
+NP_004521.1,1.0,1.0,1.0,0.833,0.833,1.0,0.833,1.0,1.0,0.667,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060021.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001017535.1,1.0,1.0,1.0,,0.781,1.0,1.0,,0.938,0.625,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004269.1,1.0,,1.0,,1.0,1.0,1.0,,1.0,1.0,0.625,1.0,1.0,,,,
+NP_000365.3,1.0,1.0,1.0,1.0,1.0,0.875,1.0,1.0,1.0,0.938,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006693.3,0.991,0.973,0.991,,0.92,0.835,,,0.902,0.946,0.866,1.0,0.946,,,0.929,0.946
+NP_000159.3,0.953,0.921,0.977,0.973,0.948,0.938,0.905,0.922,0.75,0.822,0.816,0.972,0.938,0.953,0.978,0.967,0.975
+NP_003796.1,0.6,1.0,0.6,,0.6,1.0,1.0,,1.0,1.0,0.9,1.0,0.8,0.5,0.5,0.5,0.5
+NP_870998.2,1.0,0.667,1.0,0.3,0.738,0.143,0.6,0.45,0.571,0.714,0.119,0.952,0.238,0.15,0.2,0.571,0.619
+NP_002009.1,1.0,0.5,1.0,1.0,0.25,1.0,0.75,1.0,1.0,0.5,0.625,0.25,0.5,1.0,1.0,1.0,1.0
+NP_060352.1,0.982,1.0,0.982,0.975,0.945,1.0,1.0,0.818,0.673,0.873,0.745,1.0,0.927,0.975,1.0,0.912,0.925
+NP_000254.2,0.997,0.97,0.968,0.904,0.97,0.972,0.974,0.943,0.917,0.623,0.937,0.975,0.971,0.978,0.98,0.942,0.956
+NP_000444.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001243328.1,1.0,1.0,1.0,,1.0,0.5,0.5,1.0,0.5,0.0,1.0,1.0,1.0,1.0,1.0,0.5,0.5
+NP_066005.2,1.0,0.5,0.95,,0.65,0.417,0.69,0.154,0.4,0.466,0.483,1.0,0.517,,,,
+NP_056093.3,1.0,0.946,1.0,0.839,0.982,0.946,1.0,0.929,0.696,0.875,0.929,1.0,0.946,1.0,0.982,0.857,0.955
+NP_001308031.1,1.0,1.0,1.0,,1.0,0.727,1.0,,0.909,0.909,0.864,1.0,1.0,0.727,0.636,0.955,1.0
+NP_071505.2,0.667,0.667,0.667,0.333,0.333,0.333,0.333,0.333,0.333,0.333,0.0,0.667,1.0,0.333,0.333,0.333,0.333
+NP_001548.1,1.0,0.889,1.0,1.0,0.6,0.889,0.8,1.0,0.889,0.444,1.0,1.0,0.778,0.8,0.8,1.0,1.0
+NP_004990.3,1.0,0.817,1.0,,0.767,0.9,0.717,,0.45,0.542,0.75,0.95,0.817,,,,
+NP_001124162.1,1.0,1.0,1.0,0.812,,1.0,,,1.0,,0.688,1.0,0.75,,,,
+NP_443105.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,0.5,1.0,0.917,1.0,1.0,,,1.0,1.0
+NP_000402.3,1.0,,0.995,,1.0,1.0,0.955,,0.914,,0.995,1.0,1.0,0.995,1.0,0.995,0.995
+NP_055692.3,1.0,1.0,1.0,,1.0,0.5,0.5,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_066288.2,0.993,0.985,0.954,0.988,0.986,0.873,0.998,0.917,0.829,0.889,0.875,0.991,0.986,0.965,0.975,0.981,0.979
+NP_001137451.1,1.0,1.0,0.5,0.5,1.0,1.0,,1.0,0.5,,0.75,1.0,1.0,0.5,1.0,,
+NP_004091.3,1.0,1.0,1.0,,0.833,1.0,0.5,0.833,1.0,1.0,0.583,1.0,0.667,0.667,0.5,1.0,1.0
+NP_001010892.1,1.0,1.0,1.0,,0.944,1.0,,,0.778,0.778,0.667,1.0,1.0,,,0.944,0.944
+NP_067045.1,1.0,0.867,1.0,0.967,0.967,0.988,0.867,0.933,0.929,0.533,0.857,0.929,0.857,0.833,0.833,0.883,0.833
+NP_079017.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003672.1,1.0,1.0,1.0,,1.0,0.0,,,0.5,,0.5,1.0,0.5,,,0.5,1.0
+NP_113631.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001605.1,0.929,,0.905,,0.952,0.935,0.905,,0.702,0.893,0.869,0.905,0.905,0.893,0.964,0.661,0.548
+NP_064543.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,0.625,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000076.2,1.0,1.0,1.0,0.975,1.0,0.996,1.0,0.748,1.0,0.983,0.956,1.0,1.0,0.975,0.983,0.945,0.966
+NP_000142.2,0.987,1.0,0.987,1.0,1.0,0.95,1.0,0.857,0.733,0.807,0.45,1.0,0.973,0.99,0.99,0.714,0.838
+NP_996994.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,0.864,0.045,0.909,1.0,1.0,,,0.955,1.0
+NP_003032.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.333,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_640334.2,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,0.833,0.0,0.917,1.0,1.0,1.0,1.0,0.917,1.0
+NP_000467.1,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079120.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,0.8,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079058.1,1.0,1.0,1.0,,1.0,1.0,,,0.333,1.0,1.0,1.0,1.0,,,,
+NP_659451.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,0.625,1.0,1.0,1.0,1.0,0.875,1.0
+NP_000333.1,1.0,0.993,0.99,0.983,1.0,0.918,0.957,0.94,0.813,0.863,0.913,1.0,1.0,0.97,0.963,0.893,0.945
+NP_006874.1,1.0,1.0,1.0,,1.0,1.0,1.0,0.778,0.667,1.0,0.611,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002015.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.786,1.0,0.929,1.0,1.0,1.0,1.0,1.0,1.0
+NP_071378.1,1.0,0.871,1.0,0.848,0.97,0.924,0.939,0.894,0.909,0.894,0.864,1.0,0.955,0.879,0.864,0.848,0.894
+NP_005655.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060956.1,1.0,,0.997,,,1.0,,,0.904,,0.625,1.0,1.0,,,,
+NP_000537.3,0.989,0.984,0.983,0.981,0.98,0.944,0.935,0.433,0.824,0.937,0.787,0.991,0.99,0.931,0.954,0.934,0.961
+NP_001139189.2,1.0,0.981,1.0,,0.981,0.963,0.9,0.933,0.898,0.954,0.926,1.0,0.972,1.0,0.983,0.9,0.917
+NP_443197.1,1.0,,1.0,,,1.0,0.5,,1.0,,1.0,1.0,1.0,1.0,0.5,1.0,1.0
+NP_001094385.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_116119.2,1.0,0.857,1.0,0.714,0.857,0.786,0.667,0.905,0.952,0.762,0.524,0.905,0.714,0.619,0.619,0.81,0.738
+NP_002040.1,0.985,0.932,0.977,0.947,0.925,0.97,0.977,0.872,0.759,0.955,0.842,0.985,0.955,0.97,0.962,0.883,0.902
+NP_065098.1,1.0,,1.0,,1.0,1.0,,,0.889,,0.778,1.0,0.833,,,1.0,0.917
+NP_579877.1,0.989,0.931,0.954,,0.989,0.977,0.977,,0.885,0.213,0.92,0.954,0.793,0.989,0.977,0.931,0.954
+NP_001005741.1,0.993,0.988,0.993,,1.0,0.998,0.98,,0.94,0.701,1.0,1.0,0.995,0.99,0.988,0.984,0.99
+NP_573568.1,1.0,1.0,1.0,,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001640.1,0.286,1.0,0.571,1.0,0.857,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001521.1,1.0,0.714,0.929,,0.714,0.643,0.786,,0.429,0.286,0.643,0.929,0.714,0.786,0.714,0.857,0.857
+NP_036602.1,0.333,0.0,0.333,0.0,0.0,1.0,0.667,0.667,1.0,0.0,0.0,0.0,0.0,0.0,0.333,0.167,0.167
+NP_006556.1,0.972,0.917,1.0,,0.882,0.84,1.0,0.986,0.694,0.736,0.667,0.854,0.847,1.0,1.0,0.91,0.913
+NP_065690.2,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.333,1.0,1.0,1.0,1.0,1.0,,
+NP_653178.3,1.0,0.093,1.0,0.37,0.704,0.87,0.444,0.556,0.556,0.315,0.778,0.815,0.815,0.333,0.556,0.519,0.481
+NP_006339.4,,,,,,,,,,,,,,,,,
+NP_066550.2,0.966,0.969,0.955,0.978,0.976,0.888,0.974,0.935,0.854,0.802,0.938,0.95,0.938,0.989,0.974,0.764,0.83
+NP_001600.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,0.938,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001273540.1,1.0,0.625,0.875,1.0,0.625,1.0,0.875,,0.25,0.625,0.375,1.0,0.75,1.0,0.875,0.75,0.875
+NP_001029.1,1.0,1.0,1.0,,1.0,0.889,1.0,,1.0,1.0,0.833,1.0,1.0,0.944,0.944,0.75,0.889
+NP_000874.2,1.0,0.889,1.0,0.75,1.0,0.778,,0.833,0.667,0.722,0.722,0.944,0.556,,,,
+NP_057075.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000742.1,1.0,1.0,1.0,1.0,1.0,0.967,0.889,0.833,0.533,0.933,0.6,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001073911.1,1.0,0.933,1.0,1.0,1.0,1.0,1.0,,0.933,0.4,0.967,1.0,0.933,0.933,0.933,0.967,1.0
+NP_542400.2,1.0,,1.0,1.0,1.0,1.0,1.0,,1.0,,0.917,1.0,1.0,0.833,0.833,1.0,1.0
+NP_071890.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002871.1,0.959,0.996,0.994,,0.907,0.899,0.962,0.864,0.569,0.789,0.569,0.986,0.937,0.99,0.99,0.921,0.964
+NP_000292.1,0.769,0.538,0.462,0.0,0.154,0.462,0.0,0.636,0.0,0.0,0.692,0.846,0.385,0.455,0.455,0.091,0.0
+NP_055826.1,1.0,1.0,1.0,,1.0,0.875,1.0,0.875,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.969,1.0
+NP_002573.1,1.0,0.75,1.0,0.75,1.0,1.0,0.75,0.5,1.0,0.5,0.875,1.0,0.75,0.75,0.75,0.75,0.5
+NP_922932.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_075462.3,1.0,1.0,1.0,,1.0,0.833,1.0,,0.875,0.875,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004264.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.926,0.63,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001019242.1,1.0,,1.0,1.0,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,0.947,1.0
+NP_004804.2,1.0,1.0,1.0,1.0,1.0,0.875,,1.0,1.0,1.0,0.875,1.0,1.0,,,1.0,1.0
+NP_005506.3,1.0,1.0,1.0,1.0,1.0,0.944,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003029.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057240.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.857,0.571,0.571,1.0,1.0,1.0,1.0,1.0,1.0
+NP_112533.1,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,0.5,0.5
+NP_037471.2,1.0,1.0,1.0,0.917,0.958,1.0,0.952,0.792,1.0,0.917,0.708,1.0,0.958,0.917,1.0,,
+NP_114413.1,1.0,0.8,1.0,,1.0,0.2,0.8,0.8,0.6,0.8,0.6,1.0,0.6,0.8,0.6,0.8,0.8
+NP_036356.1,0.971,0.926,1.0,,0.868,0.971,0.838,,0.779,0.059,0.919,0.985,0.956,0.971,0.971,0.919,0.956
+NP_004439.2,1.0,,1.0,,1.0,0.763,,,0.474,0.053,0.395,1.0,0.737,,,,
+NP_835366.1,1.0,,1.0,,,0.173,,,0.0,,0.962,1.0,0.987,0.974,0.974,,
+NP_000373.1,1.0,1.0,1.0,,1.0,0.988,1.0,,0.669,0.8,0.922,1.0,0.981,1.0,1.0,1.0,1.0
+NP_000735.1,0.955,0.97,0.996,0.996,0.933,0.929,0.951,0.922,0.836,0.472,0.884,0.937,0.914,0.974,0.978,0.959,0.97
+NP_001750.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.944,0.944,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004422.2,1.0,1.0,1.0,1.0,1.0,0.636,1.0,0.864,0.773,0.909,0.886,1.0,1.0,1.0,0.955,0.955,1.0
+NP_068603.4,1.0,1.0,1.0,,1.0,0.929,,,0.786,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_004435.3,0.989,0.961,1.0,,0.967,0.933,,,0.713,0.738,0.914,0.988,0.954,,,0.842,0.92
+NP_002290.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000451.1,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,0.833,0.833,0.667
+NP_003712.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,0.8,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002283.3,1.0,1.0,1.0,,1.0,0.923,1.0,1.0,0.75,1.0,0.885,1.0,1.0,1.0,1.0,0.846,0.875
+NP_001191197.1,1.0,,1.0,0.667,1.0,0.667,,,1.0,,0.667,1.0,1.0,,,,
+NP_006403.2,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,0.667,1.0,1.0,1.0,0.833,1.0,1.0,0.667,0.667
+NP_742055.1,0.986,1.0,0.986,,0.972,0.896,0.944,,0.597,0.681,0.653,0.972,0.958,0.931,0.917,0.806,0.806
+NP_001305092.1,,,,,,,,,,,,,,,,,
+NP_001672.1,0.985,0.985,1.0,,0.956,0.941,,,0.882,0.669,0.721,1.0,0.956,,,1.0,0.985
+NP_003292.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,0.5,1.0,1.0,1.0,1.0,1.0,0.5,0.5
+NP_000261.2,1.0,1.0,1.0,1.0,1.0,1.0,0.917,1.0,0.958,0.583,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_689862.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_619542.1,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000231.1,1.0,1.0,1.0,0.917,0.917,1.0,0.917,1.0,1.0,1.0,0.833,1.0,1.0,0.833,1.0,1.0,1.0
+NP_899196.1,1.0,0.917,1.0,0.983,1.0,1.0,1.0,0.817,0.817,0.6,0.9,0.967,0.95,1.0,1.0,0.875,0.933
+NP_073624.2,1.0,0.974,1.0,1.0,0.949,0.949,0.974,0.786,1.0,0.436,0.923,1.0,0.974,1.0,1.0,1.0,1.0
+NP_001340274.1,1.0,,1.0,0.989,0.989,0.993,0.952,,0.8,,0.981,0.981,0.941,1.0,0.944,1.0,1.0
+NP_005206.2,1.0,1.0,1.0,1.0,1.0,0.8,1.0,1.0,0.533,1.0,1.0,1.0,1.0,1.0,1.0,0.933,1.0
+NP_061905.2,1.0,1.0,1.0,,1.0,0.971,1.0,1.0,0.962,1.0,0.962,1.0,0.981,0.962,0.962,0.865,0.856
+NP_000068.1,1.0,0.961,1.0,,1.0,1.0,0.984,,0.924,0.508,0.818,1.0,0.947,1.0,0.992,0.977,0.93
+NP_006185.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,0.9,1.0,0.95,1.0,1.0,1.0,1.0
+NP_057195.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.577,0.962,1.0,1.0,1.0,0.981,0.942,0.952
+NP_057006.1,0.988,0.94,1.0,,1.0,0.958,0.988,,0.905,0.857,0.929,1.0,0.976,0.988,0.976,0.875,0.917
+NP_055577.1,1.0,,1.0,,,0.6,0.2,,0.8,,0.5,1.0,1.0,1.0,1.0,0.3,0.4
+NP_997229.2,1.0,0.895,1.0,0.916,0.958,0.947,0.768,0.763,0.768,0.789,0.937,1.0,0.884,0.874,0.884,0.868,0.9
+NP_115980.1,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060521.4,1.0,0.96,1.0,1.0,1.0,0.86,0.96,0.44,0.96,0.92,0.98,1.0,0.92,0.96,0.92,0.98,1.0
+NP_996917.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006362.1,1.0,0.833,1.0,0.833,0.833,0.833,0.833,0.833,0.667,1.0,0.75,1.0,0.833,1.0,0.833,0.667,0.5
+NP_004178.2,0.991,0.945,0.997,0.941,0.98,0.943,0.961,0.964,0.872,0.557,0.889,0.972,0.954,0.988,0.986,0.947,0.955
+NP_116222.4,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,,,,
+NP_003026.2,1.0,0.944,1.0,,1.0,1.0,0.944,,1.0,1.0,1.0,0.889,1.0,1.0,1.0,1.0,1.0
+NP_056321.2,1.0,0.964,1.0,0.957,0.986,0.819,0.949,0.888,0.826,0.906,0.833,1.0,0.971,0.986,0.964,0.797,0.895
+NP_001258750.1,1.0,1.0,1.0,,1.0,1.0,,,0.889,0.722,0.944,1.0,1.0,,,0.889,0.917
+NP_005526.1,1.0,,1.0,,0.976,0.899,0.905,,0.869,0.637,0.631,1.0,0.988,0.75,0.774,0.935,0.929
+NP_002380.3,1.0,0.875,1.0,,1.0,0.875,1.0,,0.875,0.625,1.0,1.0,0.875,0.875,0.875,0.875,0.75
+NP_000267.2,0.989,0.986,0.998,0.995,0.975,0.971,0.959,0.98,0.891,0.696,0.972,0.998,0.991,0.991,0.986,0.982,0.982
+NP_004817.2,1.0,0.939,1.0,0.909,0.977,0.939,0.955,0.788,0.879,0.462,0.833,1.0,0.97,0.864,0.909,0.962,0.955
+NP_001092690.1,0.75,0.25,1.0,0.25,0.5,0.25,0.5,0.0,0.75,0.25,0.5,0.75,0.0,0.75,0.75,0.75,0.75
+NP_001037.1,0.923,0.769,1.0,0.308,0.615,0.885,0.538,0.308,0.538,0.269,0.846,1.0,0.923,0.538,0.615,1.0,1.0
+NP_006507.2,0.983,0.955,0.992,0.933,0.975,0.938,0.961,0.821,0.864,0.823,0.947,0.959,0.945,0.969,0.961,0.927,0.934
+NP_003928.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.625,0.875,1.0,1.0,1.0,1.0,1.0,0.875,1.0
+NP_000060.2,0.948,0.934,0.977,0.98,0.911,0.786,0.872,0.793,0.706,0.859,0.761,0.975,0.928,0.939,0.959,0.703,0.707
+NP_005207.3,1.0,,1.0,,1.0,0.75,0.75,,0.25,,0.75,1.0,1.0,,,,
+NP_000070.1,1.0,1.0,1.0,,1.0,0.821,,,0.857,0.714,0.571,1.0,1.0,,,,
+NP_001845.3,0.966,0.974,0.964,0.973,0.952,0.797,0.862,0.553,0.7,0.958,0.688,0.994,0.975,0.958,0.951,0.845,0.88
+NP_001120680.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.96,1.0,0.96,1.0,1.0,1.0,1.0,0.98,1.0
+NP_955366.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001073918.2,1.0,0.872,1.0,1.0,0.949,0.872,0.795,,0.949,1.0,0.692,1.0,1.0,0.8,0.8,0.795,0.744
+NP_006205.1,1.0,1.0,1.0,1.0,0.958,1.0,1.0,1.0,0.833,0.917,0.875,1.0,1.0,1.0,1.0,0.875,0.875
+NP_079455.3,1.0,0.783,1.0,0.826,0.957,0.826,0.609,0.391,0.87,0.087,0.87,1.0,0.87,0.826,0.826,0.913,0.87
+NP_112190.2,1.0,1.0,1.0,0.895,1.0,0.947,1.0,0.895,0.895,0.961,0.921,1.0,1.0,0.947,0.947,0.947,0.947
+NP_000074.3,1.0,0.993,1.0,0.973,0.98,0.974,0.951,0.951,0.921,0.98,0.986,1.0,0.988,0.976,0.98,0.949,0.964
+NP_001159435.1,0.988,0.959,0.992,,0.967,0.977,0.933,,0.795,0.819,0.923,0.981,0.973,0.956,0.969,0.681,0.777
+NP_619729.1,1.0,0.967,1.0,,0.8,1.0,,,0.967,0.667,1.0,0.967,1.0,,,1.0,1.0
+NP_001308943.1,0.996,0.978,0.993,0.996,0.97,0.956,0.974,0.904,0.944,0.83,0.815,0.989,0.978,0.981,0.978,0.98,0.974
+NP_004827.4,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.818,0.909,0.091,0.818,1.0,1.0,1.0,1.0,1.0,1.0
+NP_065822.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_077315.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,0.5,0.5
+NP_005441.1,1.0,0.833,1.0,0.889,0.833,0.833,0.833,0.722,0.778,0.722,0.806,0.944,0.778,0.778,0.667,0.833,0.806
+NP_542411.2,0.919,0.87,0.939,1.0,0.933,0.751,0.828,0.833,0.604,0.822,0.743,0.902,0.865,,,,
+NP_006199.2,1.0,0.982,1.0,,0.964,1.0,1.0,0.982,0.821,0.929,1.0,1.0,0.964,0.982,0.982,0.946,0.964
+NP_071801.1,1.0,1.0,1.0,,1.0,1.0,,,0.625,0.875,1.0,1.0,1.0,,,0.938,1.0
+NP_004251.4,0.973,,,0.822,0.822,0.798,0.809,,0.516,,,0.926,0.841,0.837,0.74,0.756,0.752
+NP_003218.2,1.0,1.0,1.0,,0.889,0.792,0.889,,0.694,0.819,0.722,1.0,1.0,0.944,0.944,0.944,1.0
+NP_055440.1,1.0,0.938,1.0,0.969,0.875,0.906,0.844,0.859,0.797,0.25,0.867,1.0,0.938,0.922,0.859,0.984,0.984
+NP_000430.3,1.0,1.0,1.0,1.0,1.0,1.0,0.933,1.0,0.8,0.867,1.0,1.0,1.0,0.867,0.867,0.933,0.933
+NP_036337.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002046.1,0.998,,1.0,,,0.853,,,0.707,,0.7,1.0,0.945,,,,
+NP_056080.1,0.935,0.918,0.928,0.847,0.946,0.93,0.85,0.896,0.805,0.747,0.874,0.919,0.914,0.915,0.91,0.873,0.895
+NP_000187.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.938,0.25,0.812,1.0,1.0,1.0,1.0,1.0,0.938,0.938
+NP_004355.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003046.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,0.75,0.812,0.688,1.0,1.0,1.0,1.0,1.0,1.0
+NP_599023.1,1.0,1.0,1.0,,0.667,1.0,0.667,,0.333,0.333,0.5,1.0,1.0,1.0,1.0,,
+NP_001309863.1,1.0,1.0,1.0,,0.833,1.0,1.0,,0.714,0.571,1.0,1.0,0.857,,,,
+NP_001162625.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001295205.1,1.0,,1.0,,1.0,1.0,,,1.0,,0.667,1.0,1.0,,,1.0,1.0
+NP_115569.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005227.1,1.0,0.889,1.0,1.0,0.976,0.786,0.833,0.889,0.762,0.714,0.81,1.0,1.0,0.889,0.833,0.917,1.0
+NP_002725.1,1.0,1.0,0.75,,1.0,0.125,0.75,,0.0,0.5,0.5,0.75,0.75,0.5,0.5,0.125,0.25
+NP_000228.1,1.0,1.0,0.995,0.989,1.0,0.975,0.98,0.967,0.882,0.922,0.975,1.0,1.0,0.989,0.989,0.956,0.956
+NP_001123292.1,1.0,1.0,1.0,1.0,1.0,1.0,0.7,,1.0,0.8,1.0,1.0,1.0,1.0,0.9,1.0,1.0
+NP_001171809.1,0.994,0.988,0.99,,0.98,0.874,,,0.74,0.813,0.841,0.963,0.939,,,,
+NP_066956.1,1.0,0.9,0.9,0.9,1.0,0.9,0.8,0.9,0.9,0.5,0.8,1.0,1.0,0.9,0.9,0.9,0.85
+NP_056374.2,0.964,,0.989,0.962,,0.801,0.974,,0.704,,0.93,0.983,0.963,0.975,0.979,0.89,0.93
+NP_055051.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_612396.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005135.2,1.0,1.0,1.0,1.0,1.0,0.953,1.0,0.922,0.703,1.0,0.984,1.0,0.953,1.0,1.0,0.836,0.781
+NP_002131.2,0.917,0.917,0.917,1.0,0.917,1.0,,,0.583,0.333,0.917,0.75,0.75,,,0.818,0.818
+NP_057732.2,1.0,1.0,1.0,,1.0,0.9,,,0.9,1.0,0.9,1.0,1.0,1.0,1.0,0.95,1.0
+NP_000892.2,1.0,1.0,1.0,,1.0,1.0,1.0,,0.771,1.0,0.906,1.0,1.0,1.0,1.0,,
+NP_000372.1,0.914,0.829,0.886,0.743,0.829,0.714,0.686,0.714,0.657,0.657,0.643,0.971,0.829,0.829,0.771,0.714,0.757
+NP_005563.1,1.0,0.314,1.0,,0.962,0.247,,,0.647,0.596,0.417,0.436,0.436,,,0.202,0.337
+NP_055064.1,1.0,,0.993,0.924,0.958,0.826,0.924,,0.868,,,0.979,0.944,0.833,0.847,0.809,0.875
+NP_001121699.1,1.0,0.973,1.0,0.865,0.932,0.919,,0.919,0.892,0.709,0.709,1.0,0.973,,,0.932,0.926
+NP_116027.2,1.0,1.0,0.996,1.0,0.986,0.998,0.996,0.976,0.797,0.997,0.917,1.0,0.998,1.0,0.996,0.992,0.993
+NP_056989.2,1.0,0.8,0.8,0.6,1.0,1.0,0.8,0.8,1.0,0.8,0.9,1.0,1.0,0.6,0.8,1.0,1.0
+NP_055840.2,0.923,,0.953,,,0.883,,,0.847,,0.847,0.957,0.92,,,,
+NP_001501.2,1.0,1.0,1.0,1.0,1.0,0.778,0.889,0.778,0.889,0.611,0.75,1.0,1.0,0.889,0.944,1.0,1.0
+NP_112740.1,1.0,1.0,1.0,,1.0,1.0,1.0,0.75,0.25,1.0,0.875,1.0,1.0,1.0,0.5,1.0,1.0
+NP_000035.2,0.953,0.939,0.943,,0.952,0.914,,,0.724,0.868,0.962,0.967,0.943,,,,
+NP_862823.1,1.0,0.857,1.0,0.857,1.0,0.857,0.857,0.857,0.857,0.857,0.929,1.0,0.857,0.857,0.857,0.857,0.857
+NP_001002295.1,1.0,1.0,1.0,0.939,0.97,0.985,,1.0,0.818,1.0,0.833,1.0,0.879,,,1.0,1.0
+NP_001120864.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.875,0.938,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000446.1,1.0,,1.0,,1.0,1.0,0.967,,0.992,,1.0,1.0,1.0,,,,
+NP_001136082.1,1.0,1.0,1.0,,1.0,0.5,0.5,,0.5,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_689518.1,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,0.667,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005667.2,1.0,0.952,1.0,,1.0,0.952,1.0,0.857,0.762,0.81,0.762,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001036167.1,1.0,,1.0,,,1.0,0.917,,0.5,,0.833,1.0,1.0,0.667,0.667,1.0,1.0
+NP_036375.1,1.0,,1.0,,,1.0,,,1.0,,1.0,1.0,1.0,,,,
+NP_001323.1,1.0,0.8,1.0,0.8,0.933,0.867,0.8,0.2,0.733,1.0,0.6,1.0,1.0,0.8,1.0,0.8,0.867
+NP_066545.3,1.0,1.0,1.0,1.0,1.0,0.5,0.5,1.0,0.5,0.5,0.75,1.0,1.0,1.0,1.0,0.5,1.0
+NP_000956.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,0.75,1.0,1.0,1.0,,,,
+NP_001073279.2,0.998,0.998,0.999,,0.998,0.904,,,0.978,0.723,0.889,0.996,0.987,,,,
+NP_003350.1,1.0,1.0,1.0,0.333,0.0,0.0,0.667,0.333,0.0,0.5,0.333,1.0,0.667,0.667,0.333,0.0,0.0
+NP_001103279.2,1.0,0.857,1.0,0.429,0.857,1.0,,0.857,1.0,0.714,1.0,1.0,0.857,,,0.857,0.857
+NP_066552.2,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,1.0,0.833,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001006658.1,1.0,0.654,1.0,0.708,0.833,0.705,0.333,0.854,0.654,0.487,0.538,0.962,0.782,0.583,0.667,0.654,0.673
+NP_057485.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_001005463.1,0.5,0.857,0.911,0.625,0.518,0.929,,0.518,0.929,0.839,0.732,0.75,0.75,,,0.982,0.982
+NP_002421.3,1.0,1.0,1.0,0.833,0.833,0.667,0.833,,0.833,0.5,1.0,1.0,0.833,1.0,1.0,0.833,0.75
+NP_001010874.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.75,1.0,1.0,1.0,,,,
+NP_061971.3,0.955,0.833,0.977,0.864,0.826,0.879,0.856,0.826,0.765,0.712,0.864,0.924,0.841,0.848,0.856,0.773,0.814
+NP_077027.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,0.0,0.0
+NP_060921.3,1.0,,1.0,,0.808,0.885,,,0.769,,0.885,1.0,0.923,,,,
+NP_003005.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060700.2,1.0,1.0,1.0,0.923,0.974,0.769,1.0,0.923,0.795,0.615,0.846,1.0,1.0,1.0,1.0,0.872,0.846
+NP_068758.3,1.0,,1.0,,,1.0,0.833,,0.5,1.0,1.0,1.0,1.0,,,,
+NP_000121.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.92,0.96,0.971,1.0,1.0,0.926,0.914,1.0,1.0
+NP_059129.3,1.0,,1.0,,,0.941,0.941,,0.529,,0.794,1.0,1.0,1.0,1.0,0.971,1.0
+NP_001156908.2,1.0,1.0,1.0,0.981,1.0,1.0,1.0,1.0,0.827,0.846,0.885,0.981,0.962,1.0,1.0,1.0,1.0
+NP_065109.1,1.0,1.0,0.975,1.0,1.0,1.0,1.0,1.0,0.95,0.875,1.0,1.0,1.0,1.0,1.0,0.925,0.9
+NP_569735.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,0.917,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004378.1,1.0,1.0,1.0,1.0,1.0,0.991,1.0,0.87,0.926,0.981,0.722,1.0,1.0,1.0,0.981,0.963,0.981
+NP_009186.1,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,0.5,0.5,1.0,1.0
+NP_065390.1,0.667,0.917,0.917,0.25,0.583,0.667,0.833,0.833,0.917,1.0,0.792,0.833,0.583,0.75,0.75,0.583,0.667
+NP_031400.2,1.0,1.0,1.0,1.0,1.0,0.75,1.0,0.5,1.0,0.75,0.75,1.0,0.75,0.75,0.75,1.0,1.0
+NP_852107.1,0.971,0.952,1.0,0.981,0.971,0.981,0.952,,0.75,0.8,0.942,0.913,0.865,0.942,0.933,0.904,0.952
+NP_000117.1,1.0,,1.0,1.0,1.0,1.0,1.0,0.833,0.667,0.833,0.667,1.0,1.0,0.833,1.0,0.75,0.667
+NP_004199.1,0.995,0.918,0.989,0.893,0.951,0.967,0.732,0.911,0.934,0.594,0.635,0.989,0.956,0.875,0.875,0.914,0.898
+NP_733764.1,1.0,0.5,1.0,,0.714,0.714,,,0.857,0.143,0.357,1.0,0.429,,,0.929,1.0
+NP_006068.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,0.75,1.0,1.0,,,,
+NP_149045.3,1.0,0.974,1.0,,0.931,0.819,0.974,,0.698,0.882,0.724,1.0,0.974,,,,
+NP_001243439.1,1.0,1.0,1.0,,1.0,1.0,0.5,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001340741.2,,,,,,,,,,,,,,,,,
+NP_005576.3,1.0,0.904,1.0,0.981,0.885,1.0,0.942,0.596,0.692,0.808,0.904,1.0,0.885,0.962,0.942,0.942,0.962
+NP_004046.2,1.0,0.848,1.0,0.909,0.833,0.864,0.773,0.879,0.848,0.583,0.636,1.0,0.879,0.894,0.924,0.841,0.818
+NP_000079.2,0.995,0.978,0.993,0.99,0.984,0.899,,0.697,0.705,0.912,0.837,0.994,0.987,,,,
+NP_055805.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000265.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.938,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_619520.1,0.975,0.975,1.0,0.975,0.8,0.85,0.95,0.95,0.85,0.825,1.0,1.0,0.975,0.925,0.925,1.0,1.0
+NP_001167621.1,1.0,1.0,1.0,1.0,0.917,1.0,1.0,1.0,0.875,0.833,0.917,1.0,1.0,1.0,1.0,0.979,0.958
+NP_068657.1,0.978,1.0,1.0,0.778,0.933,0.844,0.844,0.667,0.689,0.711,0.822,0.978,0.933,0.933,0.911,0.822,0.822
+NP_569733.2,1.0,0.867,1.0,,0.867,0.867,0.667,,0.733,0.667,0.8,0.867,0.733,1.0,0.867,0.7,0.733
+NP_001026895.2,1.0,1.0,1.0,,1.0,0.923,,,0.769,0.077,0.885,1.0,1.0,,,,
+NP_001171481.1,1.0,0.786,1.0,1.0,0.929,0.786,0.8,0.5,0.786,0.714,0.661,1.0,0.857,1.0,1.0,0.9,1.0
+NP_001231118.1,1.0,0.796,1.0,0.759,0.759,0.963,,0.796,0.444,0.5,0.657,0.981,0.759,,,,
+NP_001359162.1,1.0,,1.0,,,1.0,,1.0,0.8,,0.4,1.0,1.0,,,,
+NP_849164.2,1.0,1.0,1.0,0.963,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006256.1,1.0,0.929,1.0,0.929,1.0,1.0,1.0,1.0,0.69,0.929,0.857,1.0,0.976,1.0,1.0,0.952,0.952
+NP_036255.2,1.0,0.828,1.0,1.0,0.938,0.875,,1.0,0.844,0.905,0.766,0.969,0.844,,,0.828,0.828
+NP_001006659.1,0.975,0.9,0.975,1.0,0.9,0.85,0.8,1.0,0.85,0.4,0.975,0.925,0.9,0.8,0.8,0.85,0.85
+NP_000780.1,1.0,0.786,1.0,,0.933,0.633,0.633,,0.867,0.929,0.667,1.0,0.833,,,,
+NP_003276.3,1.0,0.929,1.0,1.0,0.929,0.964,0.964,0.857,0.964,0.732,0.893,1.0,0.964,0.893,0.893,1.0,1.0
+NP_680475.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_116277.2,1.0,1.0,1.0,1.0,1.0,0.956,0.879,0.568,0.771,1.0,0.896,1.0,1.0,1.0,1.0,0.887,0.932
+NP_004295.2,0.936,0.764,0.986,0.671,0.807,0.886,0.871,0.921,0.857,0.979,0.614,0.986,0.879,0.886,0.9,0.904,0.896
+NP_002151.2,0.875,0.312,0.839,0.583,0.205,0.321,0.625,0.125,0.571,0.732,0.821,0.804,0.304,0.333,0.542,0.458,0.333
+NP_001156685.1,0.975,0.933,0.987,0.924,0.902,0.865,0.99,0.914,0.559,0.71,0.794,0.931,0.88,0.96,0.99,0.823,0.846
+NP_000389.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.667,0.967,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006756.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_002327.2,1.0,1.0,1.0,1.0,0.941,0.882,1.0,0.941,0.941,0.735,0.676,1.0,0.971,1.0,1.0,0.882,0.941
+NP_000519.2,1.0,0.969,1.0,0.997,1.0,0.933,0.941,0.913,0.873,0.564,0.938,1.0,1.0,0.994,0.991,0.887,0.953
+NP_001025023.1,0.75,0.375,0.625,,0.0,0.125,,,0.0,0.375,0.0,0.5,0.125,,,0.0,0.0
+NP_001001547.1,0.5,1.0,1.0,,1.0,1.0,1.0,0.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_008846.2,1.0,0.985,1.0,1.0,1.0,0.992,1.0,1.0,0.917,0.788,0.977,1.0,1.0,1.0,1.0,0.996,1.0
+NP_006487.1,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,0.5,1.0,0.5,1.0,0.833,1.0,1.0,1.0,1.0
+NP_002060.4,1.0,,1.0,,,1.0,1.0,,1.0,,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_066268.1,0.961,0.977,0.968,,0.971,0.901,0.951,,0.627,0.922,0.586,0.924,0.917,0.985,0.985,0.948,0.947
+NP_001025054.1,1.0,1.0,1.0,1.0,0.957,0.913,0.565,0.957,0.826,0.957,0.957,1.0,1.0,0.957,0.957,0.935,1.0
+NP_003671.1,1.0,1.0,1.0,0.975,1.0,0.888,0.925,1.0,0.9,0.625,0.812,1.0,1.0,1.0,1.0,0.988,1.0
+NP_001354752.1,,,,,,,,,,,,,,,,,
+NP_005076.3,1.0,1.0,1.0,1.0,1.0,1.0,0.933,0.733,1.0,1.0,0.967,1.0,1.0,0.933,0.933,1.0,1.0
+NP_001142.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,0.833,1.0,1.0,0.917,1.0,1.0,1.0
+NP_001278585.2,1.0,1.0,1.0,,1.0,1.0,,1.0,0.938,0.781,0.953,1.0,1.0,,,,
+NP_689509.1,0.99,,0.975,,,0.891,0.915,,0.732,,0.929,0.968,0.961,0.895,0.906,0.883,0.885
+NP_056980.2,1.0,0.926,1.0,1.0,0.889,0.963,1.0,1.0,0.815,0.667,0.815,1.0,0.963,1.0,1.0,0.815,0.852
+NP_071881.1,1.0,1.0,1.0,1.0,1.0,0.8,0.95,1.0,0.85,0.5,0.8,0.95,0.95,1.0,1.0,1.0,1.0
+NP_001001344.1,1.0,0.972,1.0,,0.958,1.0,0.792,,0.931,0.542,0.778,1.0,0.958,0.944,0.944,0.903,0.958
+NP_001001331.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.909,0.864,1.0,1.0,1.0,1.0,1.0,1.0
+NP_998764.1,0.993,1.0,1.0,0.954,0.974,1.0,0.98,0.915,0.98,0.618,0.961,1.0,1.0,0.987,0.987,1.0,1.0
+NP_036211.2,1.0,,1.0,1.0,1.0,1.0,1.0,,0.917,,0.833,1.0,1.0,1.0,1.0,0.917,1.0
+NP_000248.2,0.884,0.863,0.987,0.804,0.79,0.855,0.822,0.868,0.701,0.728,0.811,0.936,0.835,0.915,0.903,0.752,0.768
+NP_055085.1,1.0,,0.862,,0.969,0.954,0.985,,0.969,0.792,0.862,1.0,0.985,0.938,0.969,0.938,0.969
+NP_054722.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,0.5,0.625,1.0,0.833,0.917,0.917,0.958,1.0
+NP_000188.1,1.0,1.0,1.0,0.944,0.972,0.972,0.972,0.972,1.0,0.833,0.944,1.0,1.0,0.889,0.889,1.0,1.0
+NP_000156.1,1.0,,1.0,1.0,,1.0,1.0,,0.826,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005258.2,1.0,0.981,1.0,0.965,0.913,0.984,0.971,,0.933,0.835,0.992,0.994,0.968,0.971,0.974,0.994,0.984
+NP_068773.2,1.0,1.0,1.0,1.0,1.0,1.0,0.98,1.0,0.96,0.92,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006822.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_071445.1,0.983,0.944,0.94,0.913,0.976,0.864,0.884,0.857,0.766,0.778,0.793,0.983,0.937,0.901,0.894,0.716,0.743
+NP_775822.3,1.0,0.857,1.0,0.738,0.905,0.929,0.643,0.833,0.667,0.429,0.595,1.0,0.857,0.833,0.786,0.607,0.702
+NP_006195.3,0.959,0.714,0.98,0.857,0.857,0.939,0.673,0.888,0.837,0.49,0.827,0.939,0.898,0.918,0.898,0.806,0.786
+NP_077015.2,1.0,,1.0,,0.977,0.978,0.767,0.943,0.789,0.611,0.917,0.967,0.889,0.862,0.851,0.862,0.954
+NP_076927.1,1.0,1.0,1.0,,,1.0,1.0,,0.943,,1.0,0.979,0.984,1.0,1.0,0.984,0.984
+NP_031374.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.95,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_998818.1,1.0,,,1.0,1.0,0.952,1.0,,,0.81,,,,,,,
+NP_859065.2,1.0,0.857,1.0,,0.929,1.0,0.857,,1.0,0.786,0.857,1.0,0.929,0.929,0.929,1.0,1.0
+NP_036348.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056953.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.583,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_009185.2,1.0,1.0,1.0,1.0,0.95,0.65,0.9,1.0,0.85,0.7,0.9,1.0,0.85,1.0,1.0,1.0,1.0
+NP_000431.2,1.0,0.958,1.0,0.875,0.893,0.946,0.732,0.625,0.679,0.917,0.562,1.0,0.982,0.958,0.958,0.688,0.667
+NP_055824.1,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,0.5,1.0,0.5,1.0,1.0,1.0,1.0,1.0
+NP_006297.2,0.98,0.944,0.983,0.852,0.964,0.918,0.879,0.905,0.763,0.632,0.633,0.949,0.946,0.972,0.967,0.943,0.967
+NP_079085.2,1.0,1.0,1.0,1.0,1.0,0.9,1.0,1.0,0.9,0.95,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000960.2,1.0,,1.0,1.0,,1.0,1.0,,1.0,,0.667,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000341.2,0.984,0.962,0.983,0.922,0.979,0.937,0.944,0.937,0.904,0.652,0.946,0.986,0.973,0.971,0.972,0.917,0.932
+NP_644813.1,1.0,0.833,1.0,1.0,1.0,0.917,0.833,0.917,0.833,0.75,0.875,1.0,1.0,1.0,1.0,0.917,0.917
+NP_001035897.1,1.0,,1.0,,,0.917,1.0,,0.792,,0.833,1.0,1.0,1.0,1.0,0.917,0.917
+NP_078853.2,1.0,0.96,0.996,0.924,0.978,0.931,0.915,0.964,0.902,0.52,0.797,0.978,0.942,0.987,0.987,0.964,1.0
+NP_891988.1,1.0,1.0,1.0,1.0,1.0,0.905,,1.0,0.762,0.857,0.857,1.0,1.0,,,0.905,0.905
+NP_002241.1,1.0,,1.0,,,0.9,0.95,,0.95,,0.875,0.95,0.7,0.85,0.85,0.8,0.8
+NP_001186846.1,1.0,1.0,1.0,0.952,1.0,1.0,,1.0,0.905,0.619,1.0,0.857,0.857,,,,
+NP_001161206.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,0.926,0.907,0.833,1.0,1.0,,,,
+NP_038475.2,0.605,0.186,0.442,0.279,0.395,0.116,0.558,0.419,0.209,0.349,0.186,0.628,0.14,0.14,0.302,0.116,0.326
+NP_000175.1,0.9,,0.9,,,0.7,,,1.0,,0.9,1.0,1.0,,,,
+NP_002538.1,0.952,0.917,1.0,0.857,0.952,0.857,0.964,0.81,0.714,0.548,1.0,0.976,0.917,0.917,0.881,0.833,0.893
+NP_000633.2,0.975,0.896,0.95,,0.917,0.829,0.742,,0.692,0.808,0.792,0.95,0.883,0.842,0.808,0.8,0.812
+NP_110404.1,1.0,1.0,1.0,1.0,0.987,0.974,1.0,1.0,0.962,0.949,0.897,1.0,1.0,0.974,0.987,0.981,0.962
+NP_000879.2,1.0,1.0,1.0,1.0,1.0,0.857,1.0,1.0,0.571,0.571,0.786,1.0,1.0,1.0,1.0,0.357,0.429
+NP_000603.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.833,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079428.2,1.0,0.75,1.0,,,0.25,1.0,,1.0,0.5,0.375,0.75,1.0,,,,
+NP_001949.1,0.749,0.66,0.76,0.665,0.689,0.615,0.709,0.673,0.607,0.491,0.564,0.633,0.582,0.545,0.593,0.618,0.607
+NP_066269.1,1.0,1.0,1.0,0.833,1.0,0.833,0.833,1.0,0.5,1.0,0.5,1.0,1.0,1.0,1.0,0.833,0.833
+NP_008822.2,0.875,0.667,0.833,0.708,0.75,0.417,0.625,0.708,0.625,0.375,0.438,0.875,0.667,0.583,0.708,0.521,0.583
+NP_077006.1,1.0,1.0,1.0,,,1.0,1.0,,1.0,1.0,0.941,1.0,1.0,,,,
+NP_008825.4,1.0,0.955,1.0,0.97,0.939,0.955,0.894,0.515,0.97,0.924,0.856,0.985,0.97,0.924,0.909,0.886,0.879
+NP_116045.2,1.0,0.981,1.0,1.0,1.0,0.957,0.981,1.0,0.933,0.865,0.865,1.0,0.942,0.99,0.99,0.928,0.962
+NP_004161.4,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000113.1,0.833,0.5,0.667,0.667,0.667,0.667,0.833,0.667,0.667,0.667,0.667,0.667,0.5,0.667,0.5,1.0,1.0
+NP_569157.2,0.5,1.0,0.5,0.5,1.0,1.0,,1.0,1.0,0.5,1.0,0.5,1.0,,,1.0,1.0
+NP_009220.2,1.0,1.0,1.0,,0.979,0.927,1.0,,0.917,0.75,0.812,1.0,1.0,0.938,0.979,0.938,0.906
+NP_060844.2,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,0.929,1.0,1.0,1.0,1.0,1.0,1.0
+NP_891552.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003237.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_940927.2,1.0,1.0,0.95,1.0,1.0,1.0,0.95,0.95,0.65,0.85,1.0,1.0,1.0,0.95,1.0,0.9,0.9
+NP_036596.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005238.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001357188.2,1.0,1.0,1.0,,1.0,1.0,0.986,,0.904,0.981,0.99,0.997,0.997,,,1.0,1.0
+NP_004510.1,1.0,0.988,0.995,0.995,0.971,0.979,0.983,0.946,0.873,0.24,0.952,1.0,0.983,0.99,0.99,0.95,0.957
+NP_816931.1,1.0,0.875,1.0,,0.875,0.75,0.875,,0.5,0.625,0.812,1.0,0.875,0.875,0.875,0.75,0.75
+NP_073563.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000524.3,1.0,,1.0,1.0,1.0,0.92,0.92,,0.2,,0.7,1.0,0.88,0.947,1.0,0.947,0.947
+NP_005621.2,1.0,1.0,1.0,1.0,1.0,0.964,1.0,0.976,0.833,0.881,0.857,1.0,1.0,1.0,1.0,0.964,0.994
+NP_003792.1,1.0,0.933,1.0,0.978,1.0,0.933,1.0,0.933,0.867,,0.844,1.0,0.933,0.867,0.889,0.844,0.878
+NP_004302.1,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001784.2,1.0,1.0,1.0,1.0,1.0,1.0,0.944,1.0,0.944,0.889,1.0,1.0,1.0,0.944,0.944,1.0,1.0
+NP_000382.3,0.998,0.989,0.998,0.961,0.971,0.972,0.944,0.973,0.885,0.907,0.877,1.0,0.995,0.963,0.975,0.973,0.982
+NP_079491.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.917,1.0,0.812,1.0,1.0,1.0,1.0,1.0,1.0
+NP_665860.2,1.0,1.0,1.0,1.0,1.0,0.9,0.9,1.0,0.7,0.8,1.0,1.0,1.0,0.8,1.0,1.0,1.0
+NP_005324.3,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,0.667,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001056.1,1.0,0.976,0.972,0.993,0.972,0.965,0.958,0.972,0.815,0.967,0.948,0.997,0.997,0.993,0.997,0.967,0.965
+NP_005421.1,1.0,,1.0,,1.0,1.0,,,1.0,,1.0,1.0,1.0,,,,
+NP_004963.1,1.0,1.0,1.0,0.8,1.0,1.0,0.8,0.8,0.875,1.0,0.792,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005848.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_073623.1,1.0,1.0,1.0,1.0,1.0,0.75,1.0,0.75,1.0,0.5,0.875,1.0,1.0,1.0,1.0,0.5,0.5
+NP_848621.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,0.5,1.0,1.0,1.0,1.0
+NP_001288153.1,,,,,,,,,,,,,,,,,
+NP_000594.2,0.955,0.455,1.0,0.409,0.727,0.591,0.5,0.273,0.591,0.273,0.864,0.864,0.5,0.545,0.636,0.45,0.5
+NP_919438.1,0.964,,0.964,,,0.75,,,0.536,,0.732,0.929,0.786,,,,
+NP_064450.3,1.0,0.8,1.0,0.933,0.667,0.867,0.933,1.0,0.8,0.467,0.8,1.0,0.733,0.933,0.933,0.8,0.667
+NP_004399.2,0.897,0.847,0.876,0.883,0.826,0.844,0.848,0.819,0.53,0.696,0.593,0.875,0.839,0.901,0.895,0.784,0.821
+NP_004936.2,0.933,0.992,0.93,,0.81,0.746,,,0.81,0.405,0.712,0.964,0.844,,,0.617,0.733
+NP_001136438.1,0.933,0.747,0.92,,0.693,0.747,0.733,,0.52,0.68,0.573,0.907,0.693,0.667,0.627,0.687,0.687
+NP_000400.2,1.0,0.882,1.0,,1.0,0.755,0.933,,0.647,0.549,0.784,1.0,0.922,1.0,1.0,1.0,1.0
+NP_000388.2,0.992,0.969,0.996,0.952,0.974,0.852,0.946,0.95,0.78,0.731,0.767,0.992,0.972,0.978,0.972,0.928,0.959
+NP_057062.1,0.957,0.88,1.0,0.772,0.88,0.826,0.815,,0.848,0.696,0.685,1.0,0.902,0.902,0.891,0.87,0.81
+NP_006334.2,0.995,0.995,1.0,0.97,0.99,0.856,0.985,0.939,0.914,0.798,0.909,0.99,0.98,0.97,0.949,0.939,0.962
+NP_003914.1,1.0,0.896,1.0,0.896,0.875,0.938,0.833,0.792,0.854,0.792,0.917,1.0,0.917,0.938,0.917,0.51,0.562
+NP_002195.1,1.0,1.0,1.0,0.972,1.0,0.833,0.639,,0.833,0.611,0.792,1.0,1.0,0.972,0.944,1.0,1.0
+NP_055619.2,1.0,0.826,1.0,1.0,1.0,0.913,0.957,0.913,0.696,0.565,0.783,1.0,1.0,1.0,1.0,0.87,0.913
+NP_005202.2,0.999,0.994,0.998,0.981,0.993,0.991,0.992,0.966,0.89,0.99,0.955,0.997,0.992,0.996,0.993,0.929,0.955
+NP_061116.5,1.0,,,1.0,1.0,1.0,,,,,,,,,,,
+NP_000220.1,1.0,0.963,1.0,0.889,1.0,0.975,0.827,0.951,0.975,0.753,0.895,1.0,1.0,0.914,0.951,0.951,0.963
+NP_000401.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000037.2,0.964,0.911,0.949,0.885,0.861,0.86,0.842,0.778,0.698,0.541,0.771,0.916,0.905,0.907,0.916,0.877,0.888
+NP_110420.3,1.0,1.0,1.0,,1.0,0.571,,,0.643,0.0,0.679,1.0,0.857,,,0.429,0.5
+NP_001290039.1,1.0,1.0,1.0,1.0,1.0,0.952,0.952,0.686,0.914,0.995,0.948,1.0,1.0,0.981,0.976,0.91,0.943
+NP_000055.2,0.964,1.0,0.929,0.75,0.929,0.768,0.893,0.857,0.821,0.893,0.679,1.0,1.0,0.821,0.857,0.821,0.75
+NP_000518.1,0.969,,0.994,0.952,0.97,0.968,0.971,0.985,0.91,0.875,0.893,0.947,0.949,0.982,0.985,0.966,0.97
+NP_004351.1,0.826,0.688,0.88,0.692,0.724,0.798,0.705,0.651,0.705,0.629,0.65,0.851,0.705,0.75,0.746,0.676,0.658
+NP_005046.2,1.0,0.865,1.0,0.846,0.981,0.625,0.654,0.846,0.615,0.558,0.856,1.0,0.962,0.808,0.885,0.779,0.827
+NP_056280.2,1.0,1.0,1.0,0.938,0.938,0.906,0.969,1.0,0.625,0.844,0.797,1.0,1.0,0.969,0.938,0.938,0.938
+NP_001106212.1,1.0,0.975,1.0,0.5,0.9,0.95,0.905,1.0,0.9,0.862,1.0,1.0,0.925,0.952,0.952,1.0,1.0
+NP_000110.2,1.0,,1.0,,,1.0,,,0.875,,1.0,1.0,1.0,,,1.0,1.0
+NP_000092.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.825,0.825
+NP_056089.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003352.2,1.0,0.997,1.0,,0.909,0.87,,,0.577,0.94,0.923,1.0,0.987,0.925,0.917,0.969,0.948
+NP_009297.2,0.963,0.978,0.954,0.931,0.92,0.958,,0.953,0.788,0.913,0.831,0.988,0.972,,,,
+NP_055177.2,1.0,0.963,1.0,,1.0,1.0,,,1.0,0.704,1.0,1.0,1.0,,,1.0,1.0
+NP_004447.2,0.963,0.976,0.995,0.958,0.92,0.877,0.883,0.969,0.759,0.504,0.727,0.971,0.955,0.917,0.917,0.861,0.869
+NP_859056.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001837.2,0.994,0.989,0.997,0.982,0.963,0.901,0.905,0.878,0.74,0.988,0.921,0.997,0.984,0.969,0.972,0.917,0.959
+NP_000693.1,1.0,0.988,1.0,0.975,0.982,0.962,0.984,0.934,0.873,0.708,0.929,0.998,0.984,0.971,0.964,0.993,0.991
+NP_003233.4,1.0,1.0,1.0,,0.998,0.952,0.968,,0.762,0.803,1.0,0.998,0.988,0.995,1.0,0.973,0.988
+NP_002388.2,0.921,0.94,0.909,,0.834,0.741,0.821,,0.624,0.941,0.596,0.918,0.926,0.894,0.871,0.694,0.778
+NP_060631.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,0.75,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001028.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.5,0.0,1.0,1.0,1.0,0.75,0.75,0.75,1.0
+NP_660149.2,0.5,1.0,1.0,1.0,1.0,1.0,1.0,0.5,0.5,1.0,0.75,1.0,1.0,1.0,1.0,,
+NP_000495.1,1.0,1.0,1.0,0.625,0.967,0.967,0.9,0.125,1.0,0.4,0.95,1.0,0.967,0.5,0.5,0.933,1.0
+NP_000393.4,,,,,,,,,,,,,,,,,
+NP_001243506.2,1.0,1.0,1.0,,,1.0,,,1.0,,0.5,1.0,1.0,,,,
+NP_060225.4,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,0.7,0.9,1.0,1.0,1.0,1.0,0.9,1.0
+NP_036412.1,1.0,0.905,1.0,,0.714,1.0,,,0.714,0.714,0.857,0.905,0.714,,,0.905,0.857
+NP_000300.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.357,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000824.1,0.945,0.935,0.97,,0.966,0.895,0.934,0.957,0.83,0.782,0.772,0.944,0.926,0.978,0.963,0.867,0.857
+NP_079492.2,1.0,1.0,1.0,0.944,0.989,0.911,0.978,0.683,0.683,0.933,0.75,1.0,0.994,0.967,0.972,0.919,0.961
+NP_057287.2,1.0,1.0,1.0,1.0,1.0,1.0,0.995,0.981,0.947,0.942,0.962,1.0,1.0,1.0,1.0,0.913,0.942
+NP_055830.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_005263.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.875,0.125,1.0,0.875,1.0,1.0,1.0,1.0,0.438,0.75
+NP_006536.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,,0.75,1.0,1.0,1.0,1.0,0.75,1.0
+NP_061928.4,0.924,0.5,0.894,0.576,0.424,0.515,0.545,0.621,0.47,0.659,0.515,0.803,0.53,0.364,0.5,0.674,0.682
+NP_000398.1,1.0,0.9,1.0,1.0,0.8,1.0,1.0,0.4,0.9,0.2,1.0,1.0,0.4,1.0,1.0,0.3,0.0
+NP_689966.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.889,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004989.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,0.583,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005114.1,1.0,0.5,1.0,,0.5,0.5,,,0.5,0.5,0.75,1.0,0.5,,,,
+NP_036595.2,1.0,1.0,1.0,1.0,0.833,0.833,1.0,1.0,0.833,0.5,0.917,1.0,1.0,1.0,1.0,0.917,1.0
+NP_006010.2,1.0,0.956,0.978,1.0,1.0,0.967,1.0,0.867,0.978,0.933,0.9,0.978,0.956,0.889,0.911,0.933,0.956
+NP_938033.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057264.4,1.0,0.978,1.0,0.889,0.944,0.8,0.867,0.867,0.778,0.689,0.7,0.989,0.956,0.922,0.956,0.889,0.889
+NP_000405.1,0.991,1.0,0.968,,0.977,0.875,0.944,,0.741,0.803,0.787,1.0,0.991,0.894,0.921,0.965,0.944
+NP_149353.1,0.2,0.0,0.8,0.6,0.2,1.0,,0.8,0.2,0.8,0.4,0.6,0.6,,,0.8,1.0
+NP_338599.1,0.944,,1.0,1.0,,1.0,,,0.556,,0.75,0.889,0.833,0.944,0.944,,
+NP_443091.1,1.0,0.964,1.0,0.929,1.0,1.0,1.0,0.857,0.679,0.732,0.857,1.0,1.0,0.964,0.929,1.0,1.0
+NP_005609.3,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,0.222,1.0,0.667,1.0,0.833,1.0,1.0,0.722,0.778
+NP_003667.1,1.0,0.875,1.0,0.875,0.958,0.875,0.833,0.958,1.0,0.375,0.917,0.958,0.875,0.875,0.833,0.958,0.958
+NP_001319.1,1.0,1.0,1.0,0.8,1.0,1.0,1.0,0.8,1.0,0.8,0.8,1.0,1.0,1.0,0.6,1.0,1.0
+NP_001255.4,1.0,0.905,1.0,0.905,0.952,0.952,0.952,0.952,0.905,1.0,0.905,1.0,0.952,,,,
+NP_000882.1,1.0,1.0,1.0,,0.976,1.0,1.0,,1.0,1.0,0.976,0.976,0.976,1.0,1.0,1.0,1.0
+NP_653200.2,1.0,,,1.0,1.0,1.0,1.0,,,0.667,,,,,,,
+NP_000307.1,1.0,0.95,1.0,1.0,1.0,0.95,1.0,,0.55,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_694972.3,1.0,0.8,1.0,0.686,0.943,0.729,0.829,0.771,0.714,0.343,0.657,0.914,0.8,0.829,0.829,0.829,0.857
+NP_056201.2,1.0,0.833,1.0,0.972,0.917,0.875,0.75,0.861,0.917,0.806,0.722,1.0,0.833,,,,
+NP_006249.1,1.0,,1.0,,,1.0,,,1.0,,0.75,1.0,1.0,,,,
+NP_001138780.1,1.0,1.0,1.0,0.667,1.0,1.0,0.667,,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0
+NP_065995.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.944,0.917,,0.944,1.0,1.0,1.0,1.0,0.931,0.972
+NP_004654.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004209.2,1.0,1.0,1.0,0.938,1.0,1.0,1.0,1.0,0.875,1.0,0.938,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003374.3,1.0,1.0,1.0,0.875,1.0,1.0,1.0,1.0,0.75,0.5,0.625,1.0,1.0,1.0,1.0,0.875,1.0
+NP_006197.1,1.0,1.0,1.0,1.0,1.0,0.967,0.933,1.0,0.667,1.0,0.8,1.0,1.0,1.0,1.0,0.967,1.0
+NP_000095.2,1.0,,1.0,0.984,1.0,0.944,0.976,,0.917,,0.944,1.0,0.991,1.0,1.0,,
+NP_004416.2,1.0,1.0,1.0,,1.0,0.875,1.0,,0.875,0.719,0.938,1.0,1.0,1.0,1.0,0.812,0.906
+NP_003916.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.125,0.938,1.0,1.0,,,1.0,1.0
+NP_777596.2,0.988,0.924,1.0,0.924,0.912,0.912,0.819,0.953,0.749,0.778,0.804,1.0,0.942,0.86,0.842,0.865,0.889
+NP_872621.1,1.0,0.75,1.0,1.0,1.0,1.0,,1.0,1.0,0.75,0.75,1.0,1.0,,,0.75,0.75
+NP_001354906.1,1.0,,1.0,,,1.0,,,1.0,,1.0,1.0,1.0,,,,
+NP_060546.2,1.0,0.958,1.0,1.0,1.0,0.917,1.0,0.75,0.625,0.667,0.688,1.0,0.917,0.958,0.958,0.958,1.0
+NP_001003694.1,0.917,0.833,1.0,0.944,0.817,0.817,,0.778,0.683,0.667,0.683,1.0,0.917,,,0.8,0.883
+NP_112205.2,1.0,1.0,1.0,0.5,1.0,0.75,0.5,1.0,1.0,0.75,1.0,1.0,1.0,0.5,0.75,1.0,1.0
+NP_000241.1,1.0,1.0,0.833,1.0,1.0,1.0,1.0,1.0,0.833,0.833,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000114.3,1.0,0.915,1.0,0.91,0.91,0.97,0.81,0.91,0.85,0.82,0.91,0.96,0.89,0.93,0.98,0.87,0.85
+NP_000143.2,1.0,0.942,0.999,,0.987,0.883,0.926,,0.853,0.744,0.905,0.997,0.951,0.948,0.929,0.916,0.941
+NP_536350.2,1.0,1.0,1.0,1.0,1.0,0.993,1.0,0.986,0.969,0.624,0.833,1.0,0.997,1.0,1.0,0.946,0.99
+NP_079483.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.778,,0.667,1.0,1.0,1.0,1.0,1.0,1.0
+NP_036604.2,1.0,,1.0,,1.0,1.0,,,0.667,,0.917,1.0,1.0,,,,
+NP_055775.2,1.0,,1.0,1.0,1.0,0.923,1.0,1.0,1.0,1.0,0.962,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000189.1,1.0,1.0,1.0,0.967,1.0,1.0,1.0,0.733,0.833,0.683,1.0,1.0,1.0,1.0,1.0,0.75,0.833
+NP_001390.1,0.975,0.905,0.988,0.961,0.967,0.906,0.891,0.938,0.643,0.501,0.879,0.962,0.914,0.922,0.953,0.792,0.869
+NP_060525.3,1.0,1.0,1.0,1.0,0.938,1.0,1.0,1.0,0.812,0.469,1.0,1.0,1.0,1.0,1.0,0.938,1.0
+NP_542378.1,1.0,0.964,1.0,1.0,1.0,0.946,,0.964,1.0,1.0,0.643,0.964,0.964,,,,
+NP_000928.1,,,,,,,,,,,,,,,,,
+NP_005190.4,1.0,1.0,1.0,,0.833,1.0,0.667,0.833,0.75,0.667,0.75,1.0,0.833,0.667,0.667,0.708,0.708
+NP_004454.2,1.0,1.0,1.0,0.954,0.981,1.0,0.991,0.926,0.954,0.907,0.963,1.0,1.0,0.991,1.0,0.954,0.981
+NP_001077431.1,0.913,0.888,0.932,,0.875,0.884,0.765,,0.816,0.842,0.714,0.898,0.879,0.706,0.723,0.769,0.731
+NP_055884.2,1.0,,1.0,0.857,0.857,0.286,0.714,0.857,0.714,0.643,0.786,1.0,1.0,0.714,0.714,0.929,1.0
+NP_000266.2,1.0,0.993,1.0,0.944,0.996,0.933,0.976,0.872,0.874,0.666,0.967,0.998,0.979,0.979,0.983,0.915,0.935
+NP_056477.1,1.0,0.976,1.0,0.964,0.952,0.976,0.893,0.952,0.845,0.786,0.81,0.988,1.0,0.976,0.94,0.911,0.929
+NP_116195.2,1.0,1.0,1.0,,1.0,0.857,1.0,,0.857,0.857,0.857,1.0,1.0,1.0,1.0,0.857,1.0
+NP_058648.4,0.952,0.855,1.0,0.724,0.79,0.782,0.793,0.707,0.694,0.516,0.847,0.968,0.935,0.707,0.724,0.742,0.726
+NP_848547.4,1.0,0.923,1.0,0.885,1.0,1.0,0.846,0.846,0.846,0.769,0.808,1.0,0.962,0.923,0.885,0.885,0.923
+NP_002418.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_598408.1,1.0,,1.0,,1.0,1.0,1.0,,1.0,0.667,0.7,0.933,0.933,1.0,1.0,1.0,1.0
+NP_620128.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.7,0.7,1.0,1.0,1.0,1.0,1.0,1.0
+NP_620596.2,0.984,0.94,0.982,0.929,0.949,0.892,0.926,0.946,0.844,0.644,0.693,1.0,0.989,0.766,0.714,0.841,0.841
+NP_001073893.1,0.991,0.948,1.0,0.802,0.845,0.909,,0.793,0.647,0.75,0.879,0.991,0.914,,,0.935,0.966
+NP_009075.1,0.872,,0.872,,0.974,0.897,1.0,,0.462,,0.449,0.872,0.821,0.641,0.513,0.654,0.667
+NP_068556.2,1.0,1.0,1.0,1.0,1.0,1.0,,,0.5,,1.0,1.0,1.0,,,,
+NP_000046.1,0.929,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003212.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001387.2,0.987,,0.987,,,0.95,,,0.756,,0.55,0.977,0.979,,,,
+NP_115940.2,1.0,0.5,1.0,0.75,0.688,0.75,0.5,1.0,0.625,0.75,0.625,1.0,0.75,0.75,0.625,0.5,0.5
+NP_000327.2,0.982,1.0,0.909,0.945,0.945,0.945,0.873,0.891,0.945,0.791,0.9,1.0,1.0,0.855,0.818,0.891,0.818
+NP_000851.2,1.0,1.0,1.0,0.875,0.875,0.875,0.875,1.0,1.0,0.875,0.625,1.0,1.0,1.0,1.0,0.75,0.75
+NP_000221.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003229.1,0.981,0.885,1.0,,0.942,0.538,0.923,,0.538,0.942,0.75,0.981,0.962,0.865,0.846,0.827,0.856
+NP_065682.2,1.0,1.0,1.0,,1.0,1.0,,,0.8,0.467,0.833,1.0,1.0,,,,
+NP_001036214.1,1.0,1.0,1.0,0.889,1.0,0.981,1.0,1.0,0.923,0.865,0.846,1.0,1.0,,,1.0,1.0
+NP_000179.2,1.0,0.875,1.0,,1.0,1.0,0.875,,1.0,0.25,1.0,1.0,0.875,1.0,1.0,1.0,1.0
+NP_066973.2,1.0,0.857,1.0,1.0,1.0,1.0,,1.0,1.0,0.143,1.0,1.0,0.857,,,1.0,1.0
+NP_002294.2,0.944,0.667,1.0,,0.694,0.639,,,0.444,0.556,0.5,0.944,0.611,,,0.583,0.583
+NP_001092272.1,1.0,1.0,1.0,,1.0,0.944,0.8,,0.778,1.0,1.0,1.0,1.0,0.8,1.0,0.917,1.0
+NP_000102.1,1.0,0.965,1.0,0.953,1.0,0.871,1.0,0.965,0.894,0.529,0.788,1.0,0.988,0.988,0.988,0.988,1.0
+NP_000065.1,1.0,1.0,1.0,0.958,1.0,0.996,0.958,0.979,0.9,0.944,0.973,1.0,1.0,1.0,0.992,0.935,0.988
+NP_000119.1,0.986,0.907,0.957,0.85,0.914,0.864,0.736,0.914,0.893,0.693,0.868,1.0,0.943,0.864,0.836,0.839,0.839
+NP_060119.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,0.867,1.0,1.0,1.0,1.0,1.0,0.933,0.867,1.0
+NP_001298127.1,1.0,1.0,1.0,1.0,0.833,0.833,,,0.667,0.5,0.5,0.833,0.833,,,,
+NP_000799.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001035197.1,1.0,1.0,1.0,0.984,1.0,0.951,1.0,0.951,0.869,1.0,0.877,1.0,1.0,1.0,1.0,0.975,0.992
+NP_009008.2,1.0,1.0,1.0,,,1.0,,,1.0,,1.0,1.0,1.0,,,,
+NP_612431.2,1.0,0.833,1.0,0.833,1.0,0.667,,0.833,1.0,,0.583,1.0,0.667,,,,
+NP_000883.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,0.955,0.727,0.955,1.0,1.0,,,,
+NP_006173.2,1.0,1.0,1.0,,,1.0,1.0,,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000244.2,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,0.941,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000676.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.667,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000210.2,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.875,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001123482.1,1.0,,1.0,,1.0,1.0,1.0,0.833,0.5,0.25,0.5,1.0,0.833,0.833,0.75,0.667,0.583
+NP_057331.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001366543.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,,,,
+NP_000306.1,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_036276.1,1.0,0.924,1.0,,0.964,0.917,0.869,0.939,0.81,0.53,0.94,1.0,0.929,0.955,0.939,0.886,0.894
+NP_001193.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056968.1,1.0,0.933,1.0,0.911,0.956,1.0,0.778,0.978,0.8,0.4,0.956,0.956,0.844,1.0,1.0,0.933,0.911
+NP_060345.2,1.0,0.978,1.0,0.933,0.956,0.689,0.933,0.933,0.867,0.856,0.767,1.0,0.978,0.956,0.956,0.867,0.867
+NP_000299.3,1.0,,1.0,,,1.0,1.0,,0.667,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001835.3,0.991,0.991,0.992,0.984,0.978,0.86,0.94,0.754,0.706,0.954,0.779,0.994,0.989,0.963,0.963,0.897,0.933
+NP_004928.2,1.0,0.996,1.0,,0.996,0.974,0.974,,0.97,0.953,0.701,0.983,0.979,0.987,0.991,0.976,0.983
+NP_060017.1,1.0,0.975,1.0,0.975,0.95,1.0,0.875,1.0,1.0,0.95,0.825,0.95,0.95,0.95,0.95,1.0,1.0
+NP_254275.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001003811.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.8,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004125.3,1.0,,1.0,1.0,,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004713.2,0.917,0.833,0.917,,1.0,0.875,,1.0,0.5,0.417,0.875,0.917,0.917,1.0,1.0,1.0,1.0
+NP_001366380.1,0.944,1.0,1.0,,1.0,0.889,,,0.889,0.844,1.0,0.944,0.889,,,,
+NP_002612.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.444,1.0,1.0,1.0,1.0,1.0,1.0,0.778,0.889
+NP_057445.4,1.0,,0.833,0.667,1.0,0.833,0.667,,0.833,0.167,1.0,0.667,1.0,0.667,1.0,0.833,0.833
+NP_005853.2,0.919,0.906,0.944,0.888,0.906,0.938,0.881,0.931,0.906,0.891,0.781,0.969,0.95,0.806,0.856,0.925,0.925
+NP_570718.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004553.2,0.988,0.917,0.97,0.929,0.935,0.828,0.941,0.825,0.675,0.686,0.568,0.959,0.858,0.935,0.899,0.787,0.817
+NP_000030.1,1.0,1.0,1.0,,0.917,1.0,1.0,,1.0,0.833,0.625,1.0,1.0,1.0,1.0,1.0,1.0
+NP_659505.1,1.0,0.988,1.0,0.994,0.994,0.857,,0.901,0.768,0.869,0.926,1.0,0.988,,,0.807,0.824
+NP_835466.2,1.0,,1.0,0.7,0.6,0.7,,,0.75,,0.5,1.0,1.0,,,,
+NP_037466.3,0.971,0.941,0.926,1.0,1.0,0.978,1.0,0.75,0.971,0.765,0.875,0.971,0.985,0.933,0.933,0.949,0.956
+NP_000230.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001120972.1,1.0,1.0,1.0,0.833,0.991,0.939,,0.81,0.781,0.623,0.969,1.0,0.991,,,0.811,0.868
+NP_055086.1,1.0,0.889,1.0,1.0,1.0,1.0,1.0,1.0,0.889,0.444,0.944,1.0,0.889,1.0,1.0,0.889,0.778
+NP_919259.3,1.0,,1.0,,,0.857,1.0,,0.857,,0.929,1.0,1.0,1.0,1.0,0.929,1.0
+NP_065094.3,1.0,1.0,1.0,0.5,0.75,0.75,0.75,0.75,1.0,1.0,0.875,0.75,0.75,0.25,0.5,1.0,0.5
+NP_000186.2,1.0,0.922,1.0,,0.978,0.989,0.889,,0.944,0.794,0.972,1.0,0.978,0.933,0.933,0.722,0.75
+NP_113621.1,1.0,,1.0,1.0,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_085135.1,0.914,0.86,0.914,0.548,0.849,0.742,0.559,0.602,0.624,0.914,0.516,0.925,0.785,0.753,0.796,0.742,0.71
+NP_003755.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,0.25,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_443106.1,0.417,0.75,0.583,0.917,0.5,0.625,0.5,0.5,0.333,0.25,0.5,0.833,0.833,0.833,0.833,0.5,0.667
+NP_002411.3,1.0,0.919,1.0,,0.909,0.89,0.962,,0.839,0.672,0.828,0.978,0.909,0.925,0.914,0.836,0.876
+NP_036442.3,1.0,0.75,1.0,0.75,0.5,0.75,0.75,0.5,1.0,0.75,0.75,1.0,0.75,0.75,1.0,0.75,0.75
+NP_005059.2,0.778,0.111,1.0,0.778,0.444,0.444,0.889,0.0,0.556,0.111,0.778,0.889,0.333,0.444,0.333,0.444,0.278
+NP_001191244.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,,,,
+NP_060132.3,1.0,0.9,1.0,0.917,0.917,0.808,0.9,0.767,0.6,0.7,0.667,1.0,0.917,0.8,0.767,0.783,0.783
+NP_006183.2,1.0,1.0,1.0,1.0,1.0,0.909,1.0,1.0,0.909,1.0,0.955,1.0,1.0,1.0,1.0,1.0,1.0
+NP_008823.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.609,0.844,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005852.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.8,1.0,0.75,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_775300.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,0.583,1.0,1.0,,,1.0,1.0
+NP_000520.1,1.0,0.78,1.0,0.78,0.84,0.69,0.76,0.72,0.9,0.44,0.78,0.9,0.82,0.92,0.9,0.72,0.67
+NP_061896.1,1.0,1.0,1.0,0.875,1.0,1.0,0.875,1.0,0.917,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_068775.1,1.0,0.6,1.0,0.933,0.867,0.933,1.0,1.0,0.8,0.6,0.2,0.733,0.733,1.0,1.0,1.0,1.0
+NP_055023.2,1.0,1.0,1.0,1.0,0.981,1.0,0.894,,0.919,0.959,1.0,1.0,1.0,0.994,0.994,0.923,0.949
+NP_000087.2,1.0,1.0,1.0,0.987,0.987,0.987,,1.0,0.893,0.727,0.96,1.0,0.987,,,,
+NP_003150.1,0.989,0.983,0.989,,0.977,0.94,0.98,,0.822,0.335,0.958,0.971,0.957,0.987,0.988,0.905,0.936
+NP_001138940.4,0.971,0.941,1.0,,0.99,0.926,,,0.647,0.593,0.838,0.971,0.922,,,,
+NP_001642.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.8,1.0,0.0,0.8,1.0,1.0,1.0,1.0,0.9,1.0
+NP_065107.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003383.4,0.792,0.875,0.917,,0.792,0.708,0.833,0.917,0.583,0.833,0.875,0.792,0.792,0.875,0.917,0.917,0.917
+NP_000890.1,1.0,,1.0,0.667,,0.833,0.333,,0.833,,1.0,1.0,0.667,1.0,1.0,0.667,0.667
+NP_542437.2,1.0,1.0,1.0,,1.0,1.0,1.0,,0.875,0.25,1.0,1.0,0.875,1.0,1.0,1.0,1.0
+NP_000104.1,1.0,1.0,0.9,0.8,1.0,0.7,0.7,1.0,1.0,0.85,1.0,1.0,1.0,0.9,0.9,0.9,0.7
+NP_036332.2,1.0,0.7,1.0,0.9,0.9,1.0,0.8,0.6,0.8,0.6,0.8,1.0,0.8,0.9,0.9,0.7,0.8
+NP_001317964.1,0.969,1.0,0.938,,1.0,1.0,,,1.0,1.0,1.0,0.875,0.906,,,1.0,1.0
+NP_001121903.1,1.0,1.0,1.0,,1.0,1.0,,1.0,1.0,1.0,0.25,1.0,0.5,,,,
+NP_542172.2,1.0,,1.0,1.0,,1.0,1.0,,0.833,,0.833,1.0,1.0,1.0,1.0,0.917,0.917
+NP_003357.2,1.0,1.0,1.0,0.875,1.0,1.0,1.0,1.0,1.0,0.25,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000391.1,1.0,1.0,1.0,0.98,1.0,0.966,1.0,0.99,0.941,0.985,0.814,1.0,0.99,1.0,1.0,0.99,1.0
+NP_000486.1,0.988,0.99,0.994,0.984,0.984,0.944,0.873,0.699,0.749,0.968,0.908,0.994,0.989,0.978,0.971,0.94,0.939
+NP_004919.1,1.0,0.942,0.981,,1.0,0.99,0.981,,0.981,0.394,0.837,0.962,0.981,1.0,1.0,0.962,0.981
+NP_003782.1,1.0,1.0,1.0,1.0,0.964,1.0,1.0,1.0,0.786,0.5,0.821,1.0,1.0,1.0,1.0,0.964,0.929
+NP_004886.3,1.0,0.935,1.0,,0.948,0.772,,,0.873,0.683,0.813,1.0,0.972,,,0.69,0.696
+NP_003230.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.95,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001358995.1,0.98,1.0,1.0,,0.96,1.0,,,0.98,1.0,0.71,1.0,1.0,,,,
+NP_064608.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.967,0.567,0.933,1.0,1.0,1.0,1.0,0.967,0.967
+NP_000226.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.986,0.993,0.76,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000078.3,,,,,,,,,,,,,,,,,
+NP_004735.2,1.0,1.0,1.0,,,1.0,,,1.0,0.0,1.0,1.0,1.0,,,,
+NP_005226.1,1.0,0.75,1.0,0.625,0.667,1.0,0.792,0.792,0.917,0.458,0.792,0.917,0.708,0.708,0.792,0.562,0.5
+NP_115920.1,0.875,0.812,0.938,0.688,0.938,0.531,0.438,0.875,0.25,0.375,0.719,1.0,0.875,0.875,0.875,0.562,0.625
+NP_001076580.1,1.0,1.0,1.0,,1.0,0.5,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000825.2,0.936,,0.921,0.968,0.924,0.864,0.948,0.838,0.777,,0.837,0.928,0.911,0.897,0.937,0.864,0.904
+NP_001070834.1,1.0,0.993,1.0,,1.0,0.882,,,0.816,0.824,0.926,1.0,0.978,,,,
+NP_000525.1,0.906,0.733,0.844,0.4,0.667,0.969,0.656,0.867,0.938,0.667,0.594,0.75,0.75,0.6,0.6,0.867,0.933
+NP_055610.1,0.959,,0.997,0.992,0.915,0.927,0.987,,0.722,,0.788,0.936,0.913,1.0,1.0,0.844,0.896
+NP_000224.2,1.0,0.989,1.0,0.989,0.977,1.0,0.864,1.0,0.818,0.295,0.75,1.0,1.0,1.0,1.0,0.949,0.96
+NP_056346.3,1.0,0.951,1.0,1.0,1.0,0.971,0.853,,0.559,0.696,0.912,1.0,0.971,0.98,0.951,0.917,0.861
+NP_570853.1,0.92,,0.969,,,0.915,,,0.79,,0.739,0.92,0.92,,,,
+NP_689935.2,0.958,0.917,1.0,1.0,0.958,1.0,0.792,0.833,0.75,0.708,0.958,1.0,1.0,1.0,0.958,1.0,1.0
+NP_079408.3,1.0,0.9,1.0,0.969,0.9,0.862,0.869,0.954,0.792,0.677,0.773,0.992,0.892,0.985,0.985,0.923,0.923
+NP_003165.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.571,0.964,1.0,1.0,,,0.857,0.786
+NP_005132.2,1.0,1.0,1.0,1.0,1.0,1.0,0.8,1.0,1.0,0.6,0.9,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002237.1,1.0,1.0,0.9,0.9,1.0,0.9,0.9,,0.7,0.6,1.0,0.9,0.8,1.0,1.0,0.8,1.0
+NP_001269463.1,1.0,,1.0,,,1.0,0.333,,0.333,,1.0,1.0,1.0,1.0,0.667,0.667,0.667
+NP_001138830.1,0.963,0.944,0.963,0.981,0.833,0.889,1.0,0.944,0.759,0.963,0.731,1.0,0.907,1.0,1.0,0.907,0.926
+NP_079023.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.917,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001223.2,1.0,0.92,1.0,0.8,0.96,1.0,0.84,0.92,1.0,0.84,0.76,1.0,0.92,0.92,0.84,1.0,1.0
+NP_612565.1,1.0,1.0,1.0,1.0,1.0,1.0,0.909,1.0,0.545,0.864,0.864,1.0,1.0,1.0,1.0,0.955,0.955
+NP_066963.2,0.958,0.919,1.0,0.929,0.871,0.862,0.938,0.913,0.808,0.779,0.764,0.935,0.887,0.891,0.891,0.814,0.844
+NP_660161.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060399.1,1.0,,1.0,,1.0,1.0,,,0.833,,1.0,1.0,1.0,,,,
+NP_001098548.2,1.0,,1.0,,0.8,0.7,,,0.4,,0.85,1.0,0.9,,,,
+NP_000866.1,1.0,,1.0,0.977,,0.977,1.0,,0.886,,0.847,0.989,0.943,1.0,0.977,0.989,0.989
+NP_055067.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,0.967,1.0,1.0,1.0,1.0,1.0,1.0
+NP_803187.1,0.991,0.986,0.998,,0.973,0.972,0.998,,0.881,0.923,0.9,0.984,0.966,0.987,0.992,0.947,0.936
+NP_004704.3,1.0,0.333,1.0,0.333,1.0,1.0,1.0,0.667,0.333,0.333,1.0,1.0,1.0,1.0,1.0,1.0,0.667
+NP_001317619.1,0.875,0.875,0.917,,0.833,0.917,0.833,0.0,0.875,1.0,0.688,0.958,0.958,0.5,0.5,0.75,0.583
+NP_060551.2,1.0,1.0,1.0,,1.0,0.833,1.0,,0.917,0.917,1.0,1.0,1.0,1.0,1.0,0.917,0.917
+NP_004954.2,0.95,0.65,0.95,0.55,0.85,0.7,0.75,0.85,0.95,0.5,0.85,0.85,0.75,0.6,0.75,0.825,0.85
+NP_006485.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.95,1.0,1.0,1.0,1.0,0.95,0.975
+NP_001007027.1,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,0.429,1.0,1.0,1.0,1.0,1.0,1.0,0.714,0.857
+NP_055197.2,0.8,0.4,1.0,,0.4,0.6,0.4,,0.4,0.8,0.5,1.0,0.6,0.4,0.6,0.4,0.6
+NP_000578.2,1.0,1.0,1.0,1.0,1.0,0.75,1.0,0.917,1.0,1.0,0.958,1.0,1.0,1.0,1.0,0.917,1.0
+NP_000366.1,1.0,0.964,1.0,,0.964,0.83,0.75,0.929,0.75,0.571,0.795,1.0,0.982,0.964,0.893,0.786,0.857
+NP_004762.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000032.1,1.0,1.0,1.0,1.0,0.286,1.0,0.857,1.0,1.0,0.143,1.0,1.0,0.857,1.0,1.0,1.0,1.0
+NP_060545.3,1.0,1.0,1.0,0.81,0.952,0.762,0.905,0.857,0.476,0.833,0.714,1.0,1.0,0.857,0.81,0.881,0.857
+NP_002932.1,1.0,0.857,1.0,1.0,0.762,0.952,0.952,0.762,0.762,0.0,0.905,0.905,0.524,1.0,1.0,0.952,0.976
+NP_001356355.1,1.0,1.0,1.0,1.0,1.0,0.965,,1.0,0.903,1.0,0.944,1.0,1.0,,,,
+NP_001125.1,1.0,0.5,1.0,1.0,0.5,1.0,1.0,0.5,0.5,0.0,1.0,1.0,0.5,0.5,0.5,0.5,0.5
+NP_671724.1,1.0,0.625,1.0,0.5,0.25,0.5,1.0,1.0,0.25,1.0,0.5,1.0,0.625,1.0,1.0,0.625,1.0
+NP_000772.2,1.0,0.9,1.0,0.85,0.85,0.9,0.9,0.85,0.9,0.75,0.6,1.0,0.9,0.8,0.85,0.95,0.85
+NP_004973.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_109587.1,0.919,0.904,0.939,0.894,0.864,0.77,0.808,0.838,0.636,0.922,0.773,0.879,0.854,0.869,0.864,0.861,0.831
+NP_000263.2,0.938,1.0,1.0,0.938,0.75,0.938,,0.875,1.0,0.688,1.0,1.0,1.0,,,0.875,0.938
+NP_115763.2,0.978,0.978,1.0,0.756,0.956,0.711,0.889,0.233,0.578,0.933,0.911,1.0,1.0,1.0,0.978,0.689,0.856
+NP_665898.1,0.857,0.143,1.0,,0.0,0.143,,,0.143,0.0,0.714,1.0,0.0,,,0.143,0.143
+NP_009034.2,1.0,,1.0,0.93,,1.0,1.0,,0.965,,0.632,1.0,0.982,0.965,0.982,1.0,1.0
+NP_001041675.1,1.0,0.9,1.0,,1.0,0.8,,,0.8,0.9,0.8,1.0,0.9,,,0.95,1.0
+NP_005923.3,1.0,1.0,1.0,0.952,0.929,0.893,1.0,0.976,0.881,0.81,0.964,0.976,0.929,1.0,1.0,1.0,1.0
+NP_000153.1,1.0,1.0,1.0,,1.0,1.0,0.993,,0.993,0.651,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055661.3,1.0,0.944,1.0,1.0,0.944,0.907,0.944,0.926,0.833,0.889,0.75,1.0,0.963,0.981,0.963,0.824,0.898
+NP_005240.3,0.99,0.987,0.987,0.986,0.975,0.968,0.992,,0.926,0.979,0.925,0.975,0.975,0.987,0.984,0.958,0.99
+NP_001265067.1,0.971,,0.956,,0.956,0.927,0.749,,0.768,,0.787,0.97,0.94,,,,
+NP_003385.2,1.0,1.0,1.0,1.0,0.75,1.0,0.75,0.25,0.5,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000383.2,1.0,,1.0,0.871,,0.853,1.0,,0.895,,0.947,1.0,0.979,0.953,0.953,0.971,0.988
+NP_000239.1,0.956,0.978,0.933,,0.944,1.0,,,0.933,0.911,0.833,0.967,0.989,,,0.911,0.911
+NP_001157814.1,0.988,0.967,0.988,1.0,0.928,0.972,0.966,0.858,0.759,0.809,0.815,0.949,0.913,0.968,0.955,0.899,0.94
+NP_063938.1,1.0,0.95,1.0,0.875,0.9,0.9,0.875,0.875,1.0,0.6,0.725,1.0,0.95,0.938,0.875,0.844,0.875
+NP_077288.2,0.983,0.939,1.0,0.939,0.916,0.943,0.962,0.872,0.896,0.484,0.891,0.986,0.922,0.954,0.957,0.906,0.926
+NP_060336.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,0.944,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000522.3,0.966,0.913,0.987,0.859,0.904,0.862,0.851,0.923,0.842,0.842,0.822,0.95,0.879,0.892,0.889,0.921,0.918
+NP_079066.5,1.0,0.812,1.0,0.75,0.938,1.0,,0.938,1.0,0.75,0.625,0.875,0.75,,,0.938,0.812
+NP_060111.2,0.967,0.967,1.0,,0.933,0.956,,,0.922,0.667,0.933,1.0,0.933,,,0.806,0.767
+NP_065689.1,1.0,1.0,1.0,1.0,1.0,1.0,0.875,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000073.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.8,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004728.1,1.0,1.0,1.0,1.0,1.0,0.8,1.0,0.8,0.8,0.6,0.8,1.0,1.0,1.0,1.0,0.8,0.9
+NP_009203.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.944,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000034.1,1.0,1.0,1.0,,0.988,0.988,0.925,0.975,0.912,0.625,0.938,1.0,0.95,0.988,1.0,0.938,0.95
+NP_000202.3,1.0,1.0,1.0,,1.0,0.93,,,0.789,0.978,0.889,1.0,1.0,,,,
+NP_061169.2,1.0,0.875,1.0,1.0,0.875,0.875,0.875,1.0,0.5,0.875,0.875,1.0,0.875,1.0,1.0,0.188,0.188
+NP_612422.2,1.0,1.0,1.0,0.922,0.951,0.971,0.956,0.989,0.912,0.696,0.814,1.0,0.99,1.0,1.0,0.98,1.0
+NP_001730.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060762.3,1.0,0.778,1.0,1.0,0.889,1.0,0.556,1.0,0.889,0.556,0.778,1.0,0.778,0.889,0.889,0.778,0.889
+NP_065434.1,1.0,0.686,1.0,0.914,0.914,0.914,0.629,0.714,0.771,0.429,0.871,0.914,0.914,0.857,0.943,1.0,1.0
+NP_110440.1,0.985,0.956,1.0,0.912,0.853,0.691,0.721,0.926,0.691,0.941,0.618,1.0,0.985,0.926,0.941,0.765,0.875
+NP_078954.4,1.0,0.96,1.0,0.94,1.0,1.0,0.96,0.94,0.76,0.72,0.82,1.0,0.88,0.94,0.9,0.98,0.98
+NP_001288.3,1.0,0.921,0.996,0.971,0.993,0.971,0.996,0.921,0.91,0.692,0.968,0.982,0.978,0.989,1.0,0.903,0.932
+NP_000038.2,0.947,0.86,0.973,,0.905,0.858,0.746,0.86,0.784,0.697,0.784,0.97,0.905,0.811,0.83,0.824,0.814
+NP_919253.1,0.996,0.974,0.991,0.97,0.991,0.815,0.905,0.832,0.698,0.627,0.884,0.978,0.97,0.97,0.944,0.8,0.847
+NP_115925.2,1.0,1.0,1.0,,1.0,1.0,,,0.5,1.0,0.5,1.0,1.0,,,,
+NP_660200.1,1.0,0.0,1.0,,1.0,0.0,1.0,,1.0,,0.5,1.0,1.0,0.0,0.0,0.0,0.0
+NP_056240.2,1.0,0.952,1.0,1.0,1.0,1.0,0.952,0.905,0.952,0.357,0.952,1.0,0.952,1.0,1.0,0.857,0.905
+NP_001034437.1,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,0.667,1.0,0.583,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001193928.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.688,0.781,0.594,0.844,1.0,1.0,,,,
+NP_000911.2,1.0,0.992,1.0,,0.973,0.999,0.962,,0.865,0.644,0.861,0.997,0.97,0.989,0.995,0.984,0.989
+NP_000197.1,0.99,0.975,0.982,0.934,0.966,0.978,0.934,0.973,0.878,0.661,0.828,0.986,0.982,0.962,0.951,0.938,0.943
+NP_598004.1,1.0,0.984,1.0,0.94,0.989,0.889,1.0,0.978,0.832,0.647,0.935,1.0,0.978,0.967,0.984,0.924,0.965
+NP_001073945.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.2,1.0,0.9,1.0,1.0,1.0,1.0,0.8,0.8
+NP_443099.1,0.849,0.8,0.816,0.781,0.736,0.773,0.814,0.9,0.581,0.662,0.538,0.857,0.754,0.835,0.814,0.693,0.743
+NP_000340.2,1.0,1.0,1.0,0.909,1.0,0.932,1.0,1.0,0.5,0.409,0.455,1.0,1.0,0.955,1.0,0.636,0.682
+NP_001182058.1,1.0,,,,1.0,1.0,1.0,,0.625,,,1.0,1.0,,,,
+NP_000363.1,1.0,0.939,0.985,0.939,0.992,0.86,0.951,0.407,0.686,0.322,0.871,0.939,0.928,0.928,0.917,0.653,0.646
+NP_009060.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,0.75,0.25,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003403.2,0.833,0.667,0.778,0.611,0.667,0.861,0.833,,0.722,0.722,0.75,0.778,0.667,0.889,0.944,0.583,0.722
+NP_036459.1,1.0,,1.0,0.948,,0.903,0.909,,0.883,,0.779,0.987,0.896,0.844,0.831,0.942,0.929
+NP_005085.2,1.0,0.969,1.0,0.875,0.938,0.906,0.969,0.781,0.906,0.625,0.812,1.0,1.0,0.969,1.0,0.922,0.938
+NP_000184.1,1.0,1.0,1.0,0.969,0.938,1.0,1.0,,0.938,0.812,1.0,1.0,1.0,0.969,1.0,0.969,0.938
+NP_000491.4,0.984,0.995,1.0,0.989,1.0,1.0,,0.937,1.0,0.789,0.921,1.0,0.989,,,1.0,0.995
+NP_000447.2,1.0,0.861,0.972,,0.75,0.833,0.833,,0.778,0.611,0.611,0.944,0.806,0.833,0.833,0.778,0.806
+NP_001461.1,1.0,1.0,1.0,1.0,1.0,0.8,0.2,1.0,1.0,0.2,0.3,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000295.1,1.0,,1.0,,0.987,0.962,0.962,,0.808,,0.641,0.974,0.885,0.987,0.962,0.981,0.962
+NP_001165948.1,,,,,,,,,,,,,,,,,
+NP_005129.2,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_114091.4,1.0,,1.0,,0.984,0.968,0.946,,0.825,0.476,0.778,0.984,0.905,,,,
+NP_001194.1,1.0,1.0,1.0,,0.938,0.833,1.0,,0.875,1.0,0.771,1.0,1.0,0.938,1.0,0.885,0.948
+NP_004605.4,0.976,1.0,1.0,0.774,0.964,0.964,0.893,,0.786,,0.917,0.964,0.988,0.929,0.917,0.962,0.962
+NP_000045.1,0.998,,0.995,0.973,,0.975,0.982,,0.723,,0.891,0.998,0.982,0.986,0.975,0.951,0.967
+NP_775925.1,1.0,1.0,1.0,1.0,1.0,0.833,1.0,,0.833,,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057613.4,1.0,1.0,1.0,0.9,1.0,0.6,,0.8,1.0,0.8,0.7,1.0,1.0,0.8,0.8,,
+NP_058634.4,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.75,1.0,1.0,1.0,,,1.0,1.0
+NP_001549.2,1.0,0.929,1.0,0.946,0.946,0.929,0.982,0.982,0.964,0.607,0.536,1.0,0.982,0.893,0.929,0.964,0.964
+NP_000475.1,1.0,0.994,1.0,0.994,0.974,0.776,0.981,0.909,0.61,0.955,0.786,1.0,0.955,0.942,0.981,0.844,0.961
+NP_777577.2,0.917,1.0,0.972,,0.917,0.972,1.0,0.972,0.889,1.0,0.764,1.0,0.944,1.0,1.0,0.958,1.0
+NP_775754.2,1.0,1.0,1.0,0.875,0.933,0.912,0.992,0.933,0.958,0.821,0.908,1.0,0.992,0.967,0.992,0.896,0.921
+NP_001243643.1,1.0,,1.0,1.0,1.0,1.0,,0.864,1.0,1.0,0.773,0.955,1.0,,,,
+NP_006170.1,1.0,,1.0,1.0,1.0,1.0,1.0,,0.5,1.0,1.0,1.0,0.5,1.0,1.0,0.5,1.0
+NP_000484.2,0.88,0.759,0.886,,0.755,0.886,0.775,,0.623,0.21,0.611,0.861,0.719,0.898,0.881,0.748,0.734
+NP_689996.4,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.938,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060958.2,1.0,1.0,1.0,0.961,1.0,0.984,,,0.595,0.905,0.861,1.0,1.0,,,,
+NP_001840.3,0.994,0.994,0.998,0.977,0.988,0.985,0.98,0.935,0.791,0.955,0.912,0.993,0.984,0.983,0.987,0.889,0.941
+NP_619649.1,1.0,0.938,1.0,0.938,0.938,0.938,0.875,1.0,0.938,0.812,0.531,1.0,1.0,0.812,0.938,0.938,0.938
+NP_065753.2,1.0,1.0,1.0,0.7,1.0,0.8,0.8,1.0,0.6,,1.0,1.0,1.0,0.6,0.6,0.8,0.8
+NP_005754.2,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002791.1,1.0,0.5,1.0,1.0,1.0,0.5,1.0,0.5,0.5,1.0,0.5,1.0,0.5,1.0,1.0,0.5,0.5
+NP_000246.2,1.0,0.999,1.0,0.996,0.999,0.984,,1.0,0.99,0.801,0.921,0.999,0.989,,,,
+NP_004888.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.889,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_054799.4,0.995,0.956,0.956,0.951,0.99,0.941,0.848,,0.956,,0.956,0.951,0.951,,,,
+NP_886552.3,1.0,1.0,1.0,0.875,1.0,1.0,0.938,0.812,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.938,1.0
+NP_000683.3,1.0,0.8,1.0,0.8,0.8,0.6,,0.6,0.6,1.0,0.3,1.0,1.0,,,,
+NP_065685.1,1.0,1.0,1.0,0.667,0.833,0.833,1.0,0.833,0.833,0.667,0.917,1.0,1.0,1.0,0.833,0.917,0.917
+NP_640341.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,0.625,1.0,1.0,1.0,1.0,0.75,0.75
+NP_001157877.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,
+NP_002520.2,0.979,0.897,0.99,0.88,0.957,0.913,0.966,0.846,0.812,0.688,0.786,0.984,0.925,0.927,0.915,0.898,0.937
+NP_001293061.2,1.0,1.0,1.0,,0.924,0.784,,,0.498,1.0,0.754,1.0,1.0,,,0.854,0.844
+NP_068776.2,1.0,1.0,1.0,0.986,1.0,1.0,1.0,0.931,0.778,0.625,0.972,1.0,1.0,1.0,1.0,0.931,0.944
+NP_004997.4,1.0,1.0,1.0,,1.0,0.917,1.0,,0.75,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002147.2,1.0,1.0,1.0,,0.833,0.667,0.333,0.0,1.0,1.0,0.5,1.0,1.0,0.833,0.833,0.333,0.333
+NP_000933.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_077025.2,1.0,0.952,0.976,0.929,0.952,0.929,0.952,0.905,0.81,0.405,0.833,0.976,0.976,0.929,0.905,0.976,0.952
+NP_002431.2,1.0,1.0,1.0,1.0,1.0,0.5,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_037523.2,0.714,1.0,0.857,1.0,1.0,0.857,0.857,0.857,0.857,0.714,0.786,1.0,1.0,1.0,1.0,0.857,0.857
+NP_775766.2,0.917,0.667,1.0,1.0,0.833,0.833,0.667,0.75,0.833,0.833,0.833,0.917,0.75,0.75,0.833,0.583,0.583
+NP_000909.2,1.0,1.0,1.0,1.0,1.0,0.929,0.929,1.0,0.643,0.929,0.607,1.0,1.0,1.0,1.0,0.929,0.964
+NP_003621.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_612404.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003485.1,0.975,0.96,0.977,,0.935,0.898,0.82,,0.681,0.683,0.799,0.983,0.955,0.922,0.899,0.845,0.882
+NP_001473.1,1.0,0.944,1.0,0.944,0.944,0.944,0.778,1.0,0.833,0.722,0.806,1.0,0.944,1.0,1.0,0.944,0.944
+NP_037518.3,0.556,,1.0,0.667,1.0,1.0,1.0,1.0,0.889,0.778,0.778,1.0,1.0,1.0,1.0,0.833,0.778
+NP_071765.2,1.0,0.963,1.0,0.993,0.982,0.969,0.904,0.926,0.71,0.614,0.86,0.996,0.971,0.993,0.993,0.912,0.945
+NP_060215.4,1.0,1.0,1.0,,0.85,0.788,,,0.75,0.95,0.9,1.0,1.0,,,0.862,0.975
+NP_001011551.1,1.0,0.4,1.0,0.4,1.0,1.0,0.4,1.0,0.8,0.5,0.2,1.0,0.8,0.4,0.4,1.0,1.0
+NP_000422.1,1.0,1.0,1.0,0.95,1.0,0.969,1.0,1.0,0.984,0.484,0.953,1.0,1.0,0.95,1.0,1.0,1.0
+NP_000080.2,0.999,0.996,1.0,0.995,0.996,0.993,0.994,0.708,0.777,0.959,0.968,0.999,0.994,0.997,0.996,0.983,0.989
+NP_001009999.1,1.0,,1.0,,,1.0,,,1.0,,1.0,1.0,1.0,,,,
+NP_057683.1,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_061868.1,1.0,1.0,1.0,,1.0,0.8,0.8,,0.8,0.4,0.9,1.0,1.0,1.0,1.0,0.8,0.8
+NP_002851.2,1.0,0.868,1.0,0.912,0.923,0.801,0.794,0.787,0.75,0.449,0.732,1.0,0.926,0.757,0.794,0.761,0.86
+NP_116281.2,1.0,1.0,1.0,1.0,1.0,0.0,0.0,0.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,0.0
+NP_078961.3,1.0,1.0,1.0,1.0,1.0,0.893,1.0,0.871,0.814,0.314,0.943,1.0,1.0,1.0,0.971,0.9,0.929
+NP_060141.3,1.0,1.0,1.0,1.0,1.0,0.583,0.75,0.583,0.583,1.0,0.25,1.0,1.0,0.917,1.0,0.583,0.667
+NP_004968.2,0.982,0.839,1.0,,0.929,0.92,0.768,,0.857,0.67,0.714,0.964,0.821,,,0.844,0.781
+NP_005994.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001001937.1,1.0,1.0,1.0,1.0,0.8,1.0,1.0,,1.0,0.8,0.7,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003043.3,0.944,0.833,0.889,1.0,0.917,0.778,1.0,1.0,0.556,0.667,0.833,0.833,0.778,1.0,1.0,1.0,1.0
+NP_008871.3,0.882,0.902,0.951,1.0,0.77,0.73,0.912,0.882,0.431,0.853,0.725,0.775,0.745,0.882,0.892,0.775,0.853
+NP_000438.2,1.0,0.9,1.0,1.0,0.85,0.95,0.95,0.7,0.4,0.5,0.75,0.95,0.9,1.0,1.0,0.65,0.8
+NP_004362.2,0.968,0.964,0.988,0.849,0.9,0.897,,0.722,0.806,0.913,0.674,0.953,0.902,,,0.902,0.939
+NP_001123561.1,1.0,0.929,1.0,0.857,0.964,1.0,,1.0,1.0,0.714,0.786,1.0,1.0,,,1.0,0.964
+NP_006151.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055744.2,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,0.818,0.909,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001269154.1,1.0,0.962,0.975,1.0,0.975,0.869,0.938,,0.925,0.538,0.788,0.988,0.95,0.988,0.95,0.862,0.938
+NP_001365406.2,,,,,,,,,,,,,,,,,
+NP_004444.2,0.997,1.0,0.971,0.977,0.997,0.979,0.895,0.987,0.847,0.582,0.934,0.992,1.0,0.997,0.961,0.98,0.984
+NP_689656.2,1.0,0.987,1.0,0.987,0.987,0.962,1.0,0.987,0.923,0.667,0.936,1.0,1.0,0.949,0.962,0.962,0.949
+NP_597725.1,1.0,0.667,1.0,,0.667,0.667,1.0,1.0,0.5,0.5,0.583,1.0,1.0,1.0,1.0,0.5,0.667
+NP_055929.1,1.0,0.87,1.0,0.696,0.913,1.0,0.783,1.0,0.826,1.0,0.891,1.0,0.913,0.957,0.913,0.87,0.87
+NP_055199.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.333,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005403.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,0.667,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001174.2,0.885,0.885,0.962,0.962,0.923,0.769,0.923,0.846,0.923,,0.865,0.885,0.885,0.962,0.923,0.904,0.846
+NP_056227.2,1.0,0.667,1.0,0.667,0.667,0.667,0.667,0.667,0.667,1.0,0.333,1.0,1.0,0.667,0.667,0.667,0.667
+NP_203123.1,1.0,0.9,1.0,1.0,1.0,0.5,1.0,0.7,0.4,1.0,0.3,1.0,0.9,1.0,0.9,0.7,0.8
+NP_036240.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,0.0,1.0,1.0,1.0,1.0,0.5,0.5,0.5
+NP_000384.2,0.981,0.985,0.99,0.992,0.994,0.856,0.866,0.676,0.744,1.0,0.786,1.0,0.999,0.921,0.947,0.881,0.91
+NP_003602.1,0.942,0.864,0.95,0.845,0.876,0.946,0.837,0.895,0.857,0.723,0.922,0.93,0.868,0.888,0.884,0.901,0.901
+NP_075555.1,1.0,,1.0,1.0,,1.0,1.0,,0.75,,0.986,1.0,1.0,0.944,0.889,0.847,0.972
+NP_065875.3,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,0.7,0.5,1.0,1.0,0.8,,,,
+NP_000545.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.8,0.986,0.857,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003651.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000350.1,1.0,0.991,1.0,0.986,0.985,0.955,0.993,0.958,0.795,0.953,0.867,1.0,0.995,0.988,0.972,0.938,0.951
+NP_001001412.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006155.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.714,0.905,0.905,1.0,1.0,1.0,1.0,0.929,0.976
+NP_055513.2,1.0,1.0,1.0,0.944,1.0,0.889,1.0,0.944,0.556,0.889,0.778,1.0,0.944,1.0,1.0,0.667,0.667
+NP_689476.2,1.0,0.714,1.0,,0.857,1.0,,,0.643,0.714,1.0,1.0,0.857,,,,
+NP_001121631.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,0.571,1.0,1.0,,,1.0,1.0
+NP_006843.2,0.958,0.979,1.0,,0.896,0.896,,0.833,0.479,0.458,0.667,0.958,0.958,,,0.896,0.875
+NP_001305781.1,0.978,,1.0,0.972,0.911,0.911,,,0.889,,0.889,1.0,1.0,,,,
+NP_008860.4,1.0,1.0,1.0,0.923,1.0,1.0,0.846,0.923,1.0,0.846,0.808,1.0,1.0,0.923,0.923,0.962,0.923
+NP_001027567.2,1.0,1.0,1.0,,1.0,0.933,,,0.467,0.933,0.967,1.0,1.0,,,0.733,0.733
+NP_008990.2,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,0.5,0.5,0.5,1.0,0.75,1.0,1.0,0.5,0.75
+NP_067632.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003623.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.875,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001027565.1,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,0.6,1.0,1.0,1.0,1.0,1.0,1.0,0.6
+NP_004482.4,1.0,,1.0,,1.0,1.0,0.667,,1.0,,0.667,1.0,1.0,1.0,1.0,,
+NP_002460.1,1.0,0.0,1.0,0.25,0.0,0.0,0.25,0.5,1.0,0.5,0.25,0.5,0.0,0.25,0.0,0.25,0.25
+NP_003064.2,0.25,,0.5,,,0.625,0.429,,0.75,,0.5,0.625,0.625,0.429,0.571,0.857,0.571
+NP_000711.1,0.96,,0.984,,,0.944,,,0.905,,0.885,1.0,1.0,,,0.967,0.99
+NP_001123990.1,1.0,0.944,1.0,0.972,1.0,0.917,0.917,0.806,0.75,0.694,0.861,0.833,0.75,0.861,0.833,0.917,0.972
+NP_653213.6,0.988,0.929,0.965,0.941,0.965,0.8,,,0.6,0.859,0.741,0.965,0.929,,,0.806,0.841
+NP_003969.2,1.0,,1.0,0.708,0.958,0.75,0.833,0.625,1.0,0.729,0.792,0.958,0.917,0.792,0.833,0.687,0.792
+NP_000465.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.952,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055306.1,0.65,1.0,0.65,,0.6,0.6,0.55,,0.6,1.0,0.35,0.7,0.6,1.0,1.0,0.875,0.875
+NP_116071.2,0.981,0.947,0.994,,0.948,0.929,0.953,,0.713,0.821,0.805,0.969,0.966,0.963,0.963,0.978,0.978
+NP_057664.1,1.0,0.882,1.0,0.974,0.974,0.921,0.868,0.974,0.75,0.329,0.868,1.0,0.974,0.961,0.987,0.875,0.914
+NP_203129.1,1.0,0.875,1.0,1.0,0.875,0.5,0.875,0.875,0.125,1.0,0.75,0.875,0.875,0.875,0.875,0.938,1.0
+NP_002891.1,0.889,,,0.852,0.815,0.815,0.741,,,0.519,,,,,,,
+NP_001262.3,0.982,,0.974,,0.961,0.883,,,0.824,,0.867,0.963,0.961,,,0.937,0.94
+NP_778243.1,1.0,,1.0,1.0,0.95,1.0,,,1.0,,0.6,1.0,1.0,,,,
+NP_001181939.1,1.0,0.955,1.0,,1.0,0.864,,,0.864,0.773,0.909,1.0,0.909,,,,
+NP_055782.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000081.2,1.0,0.997,1.0,0.998,0.997,0.972,0.935,0.776,0.684,0.961,0.858,1.0,0.995,0.998,0.998,0.95,0.96
+NP_071417.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003579.3,0.867,0.833,1.0,0.933,0.833,0.933,0.933,0.8,0.9,0.75,0.8,0.867,0.8,0.8,0.8,0.817,0.833
+NP_758840.1,1.0,0.75,1.0,,1.0,1.0,0.75,,0.75,0.75,1.0,1.0,0.75,1.0,0.75,1.0,1.0
+NP_001129649.1,1.0,1.0,1.0,0.967,1.0,1.0,1.0,1.0,1.0,0.367,1.0,1.0,0.9,1.0,1.0,1.0,1.0
+NP_001012426.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001177666.1,1.0,1.0,0.958,,0.958,1.0,0.944,,0.875,1.0,0.625,1.0,1.0,1.0,1.0,0.556,0.556
+NP_068745.2,1.0,1.0,1.0,1.0,1.0,0.8,1.0,0.7,0.7,1.0,0.9,1.0,1.0,1.0,1.0,0.7,0.7
+NP_659434.2,0.933,0.967,1.0,1.0,1.0,0.8,0.867,0.967,0.6,0.6,0.917,0.9,1.0,1.0,0.967,0.933,1.0
+NP_689967.2,1.0,0.667,1.0,0.667,0.667,0.667,0.667,0.667,1.0,0.167,0.833,1.0,0.667,0.667,0.667,1.0,1.0
+NP_001229817.2,1.0,0.95,1.0,,0.892,0.908,1.0,,0.842,0.708,0.712,1.0,0.925,1.0,1.0,1.0,1.0
+NP_000211.1,1.0,0.939,1.0,0.818,0.818,0.818,0.909,0.758,0.788,0.439,0.773,1.0,0.879,0.848,0.818,0.758,0.697
+NP_001138584.1,1.0,,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000316.2,1.0,,1.0,,,1.0,,,1.0,,0.5,1.0,1.0,,,1.0,1.0
+NP_653174.3,1.0,1.0,1.0,1.0,1.0,0.625,0.625,0.875,0.875,0.875,0.625,1.0,0.875,0.875,0.75,0.75,0.75
+NP_775748.2,1.0,,1.0,1.0,1.0,0.9,1.0,,0.4,,0.9,1.0,1.0,1.0,1.0,0.9,1.0
+NP_148982.1,1.0,,1.0,,,1.0,,,1.0,,1.0,1.0,1.0,,,,
+NP_055917.1,1.0,0.917,1.0,0.917,0.708,0.823,0.854,0.812,0.667,0.76,0.948,1.0,0.958,1.0,0.958,0.844,0.885
+NP_001128493.1,1.0,1.0,1.0,,,0.833,1.0,,0.667,1.0,0.958,1.0,1.0,,,,
+NP_001617.1,1.0,1.0,1.0,1.0,1.0,1.0,0.0,0.0,1.0,0.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003713.3,0.996,0.981,0.994,1.0,0.976,0.864,0.921,0.844,0.654,0.745,0.7,0.985,0.956,0.974,0.977,0.913,0.972
+NP_067032.3,1.0,0.958,1.0,,0.951,0.931,0.812,,0.785,0.104,,0.972,0.889,0.868,0.924,0.903,0.91
+NP_065114.3,1.0,0.8,1.0,0.8,0.8,0.8,0.9,0.2,0.8,0.0,0.9,1.0,0.8,0.8,0.8,0.85,0.9
+NP_001139267.1,0.833,0.833,0.5,1.0,0.167,0.667,1.0,0.333,0.5,1.0,0.167,0.667,0.833,1.0,1.0,0.25,0.083
+NP_000200.1,0.9,0.9,1.0,0.8,0.6,1.0,0.8,1.0,0.9,0.9,0.9,1.0,0.9,0.9,0.9,0.9,0.85
+NP_060798.2,0.684,0.549,0.816,0.549,0.861,0.712,0.759,0.622,0.553,0.675,0.729,0.774,0.684,0.898,0.921,0.632,0.633
+NP_001261.2,0.7,,0.6,0.4,0.5,0.45,0.6,,0.6,,0.55,0.8,0.7,0.5,0.6,0.4,0.4
+NP_005466.1,0.875,1.0,0.875,0.75,1.0,1.0,0.875,0.625,1.0,0.625,1.0,1.0,1.0,1.0,1.0,0.938,1.0
+NP_006061.2,0.944,0.25,0.889,,0.889,0.667,1.0,,0.833,0.75,0.556,0.833,0.833,0.0,0.0,0.25,0.25
+NP_000530.1,0.995,0.971,0.997,0.95,0.987,0.775,0.921,0.849,0.624,0.635,0.716,0.994,0.946,0.983,0.976,0.757,0.841
+NP_073747.1,1.0,1.0,1.0,1.0,0.8,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_067063.1,1.0,0.9,1.0,0.2,0.8,0.6,0.5,0.8,0.6,0.2,0.9,1.0,0.9,0.6,0.6,0.6,0.6
+NP_003461.2,1.0,1.0,1.0,0.944,1.0,1.0,0.889,1.0,0.889,0.944,0.667,1.0,1.0,0.944,1.0,1.0,1.0
+NP_068799.2,0.889,0.778,1.0,0.667,0.556,1.0,0.333,0.667,0.667,0.667,0.5,1.0,0.778,0.778,0.889,0.667,0.778
+NP_003482.1,0.905,0.952,0.952,0.905,1.0,0.952,0.952,0.952,0.857,0.929,0.476,1.0,1.0,0.952,0.952,1.0,0.952
+NP_002885.1,1.0,1.0,1.0,,1.0,1.0,0.75,,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004647.1,1.0,0.857,1.0,0.952,0.929,0.976,0.976,0.881,0.905,0.762,0.952,1.0,1.0,0.952,0.952,0.976,0.976
+NP_036601.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001121901.1,1.0,0.4,1.0,,1.0,1.0,,0.8,1.0,0.6,0.8,1.0,0.8,,,0.7,0.8
+NP_000337.1,0.988,0.97,0.974,0.997,0.979,0.93,0.972,0.635,0.759,0.899,0.833,0.99,0.981,0.979,0.979,0.759,0.806
+NP_055147.1,1.0,,1.0,0.0,,0.0,1.0,,0.0,,1.0,1.0,0.0,1.0,0.0,0.0,0.0
+NP_006450.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,,,,
+NP_001244159.1,0.958,0.929,1.0,,0.946,0.893,,,0.798,0.804,0.604,0.94,0.875,,,0.762,0.78
+NP_001095876.1,0.931,0.793,0.983,,0.741,0.707,0.983,0.69,0.793,0.983,0.724,0.948,0.793,0.862,0.828,0.94,0.828
+NP_001307509.1,0.875,0.875,0.875,1.0,1.0,0.875,0.875,0.75,0.375,,0.875,1.0,1.0,1.0,1.0,0.625,0.75
+NP_061764.2,0.91,0.857,0.859,0.714,0.476,0.808,0.81,0.667,0.846,0.667,0.744,0.859,0.782,0.714,0.667,0.571,0.429
+NP_004026.2,1.0,1.0,1.0,1.0,0.8,1.0,,1.0,0.8,0.8,0.8,1.0,1.0,,,1.0,1.0
+NP_002760.1,0.897,0.735,0.786,0.775,0.897,0.88,0.8,0.75,0.897,0.577,0.624,0.812,0.812,0.7,0.55,0.938,0.85
+NP_996919.1,1.0,1.0,1.0,1.0,0.944,1.0,,1.0,0.889,1.0,0.889,1.0,1.0,,,0.889,0.889
+NP_001073991.2,1.0,0.895,0.994,0.983,0.932,0.932,0.943,0.895,0.79,0.67,0.892,0.985,0.914,0.893,0.914,0.918,0.919
+NP_056655.3,1.0,1.0,1.0,,,1.0,,,0.929,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_056979.1,0.889,0.556,1.0,,1.0,1.0,0.444,,0.889,0.889,0.778,1.0,0.778,0.667,0.778,0.556,0.667
+NP_001005369.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060859.4,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,0.667,0.167,0.75,1.0,0.833,1.0,1.0,0.75,1.0
+NP_002599.1,1.0,0.25,0.875,0.875,0.5,1.0,0.875,1.0,0.75,0.625,0.375,0.875,0.5,0.75,0.875,0.938,1.0
+NP_055238.1,1.0,0.986,0.972,0.917,1.0,0.889,0.903,0.743,0.889,0.944,0.84,0.986,0.958,0.917,0.931,0.84,0.812
+NP_058651.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.917,1.0,1.0,0.792,1.0,0.917,1.0,1.0,0.833,0.917
+NP_004616.2,1.0,1.0,1.0,1.0,1.0,0.0,1.0,0.5,0.5,1.0,0.5,1.0,1.0,1.0,1.0,0.75,1.0
+NP_679211.2,1.0,1.0,1.0,,1.0,1.0,,0.923,0.538,0.385,0.231,1.0,0.769,,,0.875,0.75
+NP_001229886.1,1.0,0.963,1.0,,1.0,0.741,0.958,0.875,1.0,0.5,0.722,1.0,0.963,0.958,1.0,1.0,1.0
+NP_659403.4,1.0,1.0,1.0,1.0,1.0,1.0,0.975,,0.9,1.0,1.0,1.0,1.0,1.0,1.0,0.95,0.9
+NP_065071.1,0.733,0.533,0.848,0.733,0.476,0.6,0.657,0.695,0.762,0.619,0.324,0.762,0.59,0.676,0.6,0.686,0.743
+NP_945185.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055761.2,0.995,0.99,0.993,0.99,0.979,0.973,0.978,0.992,0.971,0.929,0.93,0.991,0.967,0.985,0.984,0.972,0.975
+NP_001177203.1,0.908,0.93,0.908,0.915,0.858,0.822,0.896,0.892,0.787,0.864,0.535,0.887,0.876,0.89,0.887,0.772,0.826
+NP_115996.1,1.0,1.0,0.923,1.0,1.0,1.0,0.923,0.923,1.0,1.0,0.885,1.0,1.0,1.0,1.0,0.962,0.923
+NP_009190.2,1.0,0.889,1.0,1.0,1.0,0.889,0.889,1.0,0.778,0.556,0.833,0.889,0.889,1.0,1.0,1.0,1.0
+NP_694944.1,1.0,1.0,1.0,0.8,0.9,0.3,0.4,0.8,0.1,0.6,0.3,1.0,0.8,0.7,0.7,0.65,0.7
+NP_000157.1,1.0,0.998,1.0,,1.0,0.968,1.0,,0.868,0.877,0.929,0.996,0.987,1.0,1.0,0.922,0.98
+NP_598006.1,1.0,0.733,1.0,1.0,0.533,0.733,0.917,0.5,0.8,1.0,0.733,0.933,0.933,0.833,0.833,0.867,0.867
+NP_005413.2,0.995,0.9,1.0,0.873,0.95,0.955,0.9,0.945,0.791,0.759,0.882,0.973,0.941,0.964,0.959,0.866,0.907
+NP_009004.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.7,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055461.1,0.833,0.5,1.0,0.833,0.833,1.0,0.833,1.0,0.833,0.667,0.917,0.833,0.667,0.833,1.0,0.917,1.0
+NP_004743.1,0.969,0.877,0.908,0.692,0.846,0.692,0.677,0.831,0.785,0.6,0.669,0.954,0.831,0.662,0.692,0.8,0.792
+NP_004966.1,0.985,0.98,0.974,0.981,0.985,0.932,0.988,0.968,0.853,0.942,0.933,0.977,0.97,0.981,0.979,0.954,0.959
+NP_004741.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001534.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_005553.2,1.0,1.0,1.0,,0.913,0.826,,,0.565,1.0,0.978,1.0,1.0,,,0.565,0.565
+NP_783328.1,0.987,0.954,0.992,0.966,0.975,0.914,0.979,0.933,0.739,0.483,0.989,0.958,0.924,0.996,0.992,0.95,0.937
+NP_001025178.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,0.643,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001124295.1,0.969,0.936,1.0,0.971,0.975,0.968,,0.92,0.906,0.707,0.954,0.972,0.947,,,0.942,0.969
+NP_004786.2,1.0,1.0,1.0,1.0,1.0,0.643,0.929,0.929,0.714,0.929,0.893,1.0,1.0,0.929,0.929,0.821,0.893
+NP_060379.2,1.0,0.5,1.0,1.0,,0.786,0.929,,0.5,,0.929,1.0,0.929,1.0,0.929,0.857,0.929
+NP_060277.1,0.885,0.808,1.0,0.923,0.615,0.923,0.846,1.0,0.808,0.808,0.75,0.885,0.731,0.923,0.923,0.962,0.923
+NP_476516.1,0.833,0.778,1.0,0.778,0.778,0.889,0.611,0.833,0.444,0.778,0.556,1.0,0.833,0.667,0.667,0.778,0.778
+NP_000165.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006177.1,1.0,1.0,0.9,1.0,1.0,1.0,1.0,1.0,0.8,0.9,0.5,1.0,1.0,0.9,1.0,1.0,1.0
+NP_000456.2,1.0,1.0,1.0,0.833,0.967,1.0,1.0,0.967,0.767,1.0,0.767,1.0,1.0,0.9,0.8,1.0,1.0
+NP_001070.2,1.0,0.5,1.0,1.0,0.688,0.625,0.562,0.75,0.5,0.25,0.688,0.938,0.5,0.75,0.75,0.625,0.75
+NP_000481.2,1.0,0.875,1.0,0.917,0.958,0.75,0.833,0.917,0.75,0.771,0.833,1.0,1.0,0.917,0.917,0.854,0.917
+NP_073746.2,1.0,0.71,0.774,0.935,1.0,1.0,1.0,,0.839,0.192,0.452,0.903,0.903,1.0,1.0,0.903,0.903
+NP_000247.2,0.998,0.915,1.0,0.929,0.891,0.865,0.79,0.892,0.654,0.735,0.863,0.988,0.921,0.889,0.871,0.845,0.89
+NP_002456.2,0.944,0.958,0.986,,0.958,0.833,,,0.819,0.889,0.667,0.972,0.917,,,0.833,0.833
+NP_001353076.1,,,,,,,,,,,,,,,,,
+NP_000044.2,0.977,0.985,0.983,0.941,0.934,0.958,0.905,0.941,0.749,0.635,0.839,0.926,0.934,0.938,0.934,0.932,0.918
+NP_938011.1,0.857,0.607,0.982,0.786,0.679,0.536,,0.661,0.357,0.848,0.089,0.821,0.661,,,,
+NP_647480.1,1.0,0.75,1.0,0.75,0.75,0.875,0.75,0.375,0.875,0.375,0.625,1.0,0.75,0.75,0.75,0.812,0.875
+NP_115791.3,1.0,0.984,0.997,1.0,0.997,0.987,0.96,0.984,0.929,0.664,0.963,0.992,1.0,1.0,1.0,0.971,0.985
+NP_647479.2,1.0,1.0,0.968,1.0,0.976,0.948,1.0,0.976,0.984,1.0,0.54,1.0,1.0,0.952,0.976,0.96,0.96
+NP_001774.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001709.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060035.2,0.958,,1.0,0.917,0.875,0.958,0.875,0.875,0.875,0.958,0.958,1.0,0.833,1.0,0.958,0.875,0.938
+NP_001123616.1,1.0,1.0,1.0,,1.0,1.0,,,0.8,0.6,0.8,1.0,1.0,,,0.95,1.0
+NP_808592.2,1.0,,0.996,,,0.989,0.9,,0.958,,0.933,0.966,0.958,0.973,0.969,0.943,0.925
+NP_055054.1,1.0,1.0,1.0,1.0,0.875,1.0,1.0,1.0,0.688,1.0,0.688,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006120.1,0.944,0.778,0.778,0.833,0.778,0.444,0.722,0.611,0.333,0.556,0.5,0.944,0.889,0.833,0.833,0.722,0.556
+NP_009139.1,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003457.1,1.0,1.0,1.0,,1.0,0.875,,,0.667,0.917,0.583,0.917,0.792,,,1.0,1.0
+NP_938023.1,1.0,0.964,1.0,,0.911,0.929,0.964,0.875,0.875,0.821,0.902,1.0,0.911,0.929,0.946,0.929,0.929
+NP_055887.3,1.0,0.889,0.972,0.786,1.0,0.847,0.714,0.857,0.889,0.694,0.694,0.972,0.944,0.786,0.786,0.847,0.833
+NP_116575.3,1.0,0.95,1.0,,0.95,0.84,1.0,,0.85,0.53,0.9,0.99,0.94,1.0,1.0,0.9,0.94
+NP_002746.1,1.0,1.0,1.0,1.0,0.95,1.0,1.0,1.0,1.0,0.933,0.75,1.0,1.0,1.0,1.0,0.967,0.967
+NP_002435.1,1.0,0.841,1.0,0.955,0.818,0.727,1.0,0.818,0.364,0.523,0.727,0.955,0.909,0.955,0.955,0.932,1.0
+NP_000343.2,0.993,1.0,0.998,0.68,0.976,0.946,0.943,1.0,0.797,0.667,0.788,0.99,0.971,0.925,0.922,0.946,0.941
+NP_004577.1,1.0,0.986,1.0,,0.984,0.995,0.973,0.984,0.989,0.47,0.812,1.0,0.946,0.995,1.0,1.0,1.0
+NP_694571.2,1.0,1.0,1.0,0.875,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_612370.3,1.0,1.0,1.0,,0.889,0.963,1.0,,1.0,1.0,0.963,1.0,1.0,1.0,1.0,0.963,0.963
+NP_000107.1,1.0,,1.0,,1.0,1.0,,,1.0,0.833,1.0,1.0,1.0,,,1.0,1.0
+NP_036340.1,1.0,0.9,1.0,,0.85,0.925,0.8,1.0,0.8,0.8,0.75,1.0,1.0,0.9,0.75,1.0,1.0
+NP_000164.5,1.0,0.928,0.993,0.914,0.974,0.947,0.796,0.895,0.928,0.628,0.839,1.0,0.967,0.961,0.928,,
+NP_065873.2,0.972,0.927,0.976,,0.932,0.868,,,0.778,0.391,0.866,0.95,0.91,,,,
+NP_115618.3,1.0,1.0,1.0,1.0,1.0,1.0,0.818,0.909,0.909,0.727,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005701.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_570722.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.4,0.833,1.0,1.0,,,0.833,0.833
+NP_003313.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.966,0.932,1.0,0.993,1.0,1.0,1.0,1.0,0.939,0.973
+NP_004319.1,0.956,0.926,0.971,,1.0,0.985,1.0,,0.985,0.941,0.824,0.882,0.941,1.0,1.0,1.0,1.0
+NP_001482.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.5,1.0,1.0,1.0,,,,
+NP_001020105.1,1.0,0.333,1.0,,0.0,0.0,0.0,,1.0,0.0,1.0,1.0,0.333,0.0,0.0,0.0,0.0
+NP_758454.1,1.0,,1.0,,,1.0,1.0,,1.0,,0.712,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006294.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,1.0,1.0,1.0,1.0,1.0,1.0
+NP_068770.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.333,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833
+NP_000115.1,0.997,0.994,0.994,1.0,0.982,0.966,0.985,0.967,0.833,0.711,0.938,0.994,0.976,0.988,0.994,0.951,0.994
+NP_002755.1,1.0,1.0,1.0,1.0,0.941,0.941,1.0,1.0,0.824,1.0,0.882,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001155197.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,,,,
+NP_054728.2,0.959,0.937,0.973,0.937,0.938,0.929,0.943,0.796,0.867,0.822,0.888,0.977,0.955,0.923,0.948,0.824,0.827
+NP_004175.2,1.0,1.0,1.0,,1.0,1.0,0.5,,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001191354.2,1.0,,1.0,0.655,,0.964,,,0.893,,0.893,0.952,0.905,,,,
+NP_003467.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_037514.2,1.0,1.0,1.0,1.0,1.0,0.85,0.933,0.933,0.767,0.85,0.8,1.0,0.967,0.933,0.967,0.867,0.9
+NP_005017.3,1.0,0.94,0.96,0.829,0.96,0.92,0.914,0.886,0.64,0.3,0.88,1.0,0.9,1.0,0.971,0.7,0.71
+NP_000312.2,0.968,0.879,0.99,0.906,0.827,0.963,0.81,0.823,0.798,0.617,0.922,0.943,0.829,0.905,0.881,0.85,0.867
+NP_001483.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.95,1.0,0.85,1.0,1.0,1.0,1.0,1.0,,
+NP_001761.3,1.0,1.0,1.0,,1.0,1.0,,0.667,0.5,0.25,1.0,1.0,1.0,,,,
+NP_006114.1,0.986,0.974,1.0,0.966,0.959,0.93,0.938,0.806,0.806,0.739,0.839,0.99,0.957,0.982,0.971,0.935,0.945
+NP_055496.2,1.0,1.0,1.0,1.0,1.0,0.857,0.929,1.0,0.857,0.786,0.964,1.0,1.0,1.0,1.0,0.964,1.0
+NP_006209.2,0.996,0.95,1.0,0.875,0.975,0.885,0.7,0.996,0.821,0.548,0.8,0.996,0.967,0.833,0.817,0.95,0.933
+NP_001017995.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.889,0.944,0.611,1.0,1.0,,,,
+NP_000651.3,1.0,1.0,1.0,,0.971,1.0,,0.829,0.914,0.743,0.8,1.0,1.0,,,,
+NP_004320.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.867,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_937983.2,1.0,0.993,0.999,0.909,0.98,0.995,0.895,0.978,0.955,0.691,0.936,1.0,0.984,0.964,0.99,0.972,0.984
+NP_001061.2,1.0,0.857,1.0,0.571,0.714,0.143,0.857,1.0,0.429,0.286,1.0,1.0,0.714,1.0,1.0,0.286,0.643
+NP_058515.1,0.382,0.971,0.794,,0.912,0.853,0.969,0.562,0.706,0.853,1.0,0.765,0.971,0.969,0.938,0.984,0.969
+NP_998885.1,1.0,1.0,0.95,0.6,0.95,0.75,0.85,,0.95,0.8,0.7,0.95,0.95,,,,
+NP_001153605.1,0.948,0.958,0.967,0.971,0.946,0.904,,0.885,0.846,0.314,0.901,0.959,0.94,,,,
+NP_002372.1,1.0,0.984,1.0,0.734,0.969,0.656,0.875,0.68,0.875,0.648,0.766,1.0,1.0,0.922,0.891,0.875,0.836
+NP_001020560.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_937799.1,0.958,0.798,0.747,0.885,0.939,0.693,0.8,0.436,0.972,0.918,0.849,0.93,0.93,0.703,0.752,0.751,0.735
+NP_001209.1,1.0,1.0,1.0,1.0,1.0,0.8,1.0,1.0,0.4,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005254.2,1.0,0.889,1.0,,1.0,1.0,0.889,0.556,0.667,0.889,0.944,0.889,0.667,1.0,1.0,0.889,0.889
+NP_004179.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.875,0.687,0.875,1.0,0.917,1.0,1.0,0.958,1.0
+NP_000534.3,1.0,0.983,1.0,0.988,0.998,0.986,0.889,0.985,0.838,0.703,0.988,0.998,0.98,0.988,0.984,,
+NP_001340.2,1.0,0.667,1.0,0.667,0.667,0.889,0.444,0.889,0.778,0.222,0.833,1.0,0.778,0.889,1.0,0.778,0.778
+NP_006651.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.875,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005369.2,1.0,0.943,1.0,0.943,1.0,1.0,0.943,0.829,1.0,0.857,0.886,0.943,0.943,1.0,1.0,1.0,0.971
+NP_001003722.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.812,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057092.2,0.909,0.591,1.0,,0.909,0.545,0.727,0.636,0.636,0.318,0.5,0.864,0.591,0.591,0.682,0.682,0.682
+NP_000062.1,1.0,0.991,1.0,,0.983,0.97,0.957,,0.97,0.422,0.966,0.996,0.987,0.983,0.987,0.95,0.953
+NP_063945.2,1.0,0.981,1.0,0.977,0.972,0.986,0.977,0.954,0.931,0.731,0.965,1.0,0.949,0.991,0.972,0.931,0.968
+NP_057161.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000899.1,1.0,0.875,1.0,1.0,0.75,0.75,0.75,,0.25,0.375,0.875,1.0,0.875,1.0,1.0,0.75,1.0
+NP_002038.2,0.991,0.913,1.0,0.905,0.939,0.898,0.926,0.887,0.706,0.571,0.727,0.991,0.892,0.948,0.952,0.835,0.866
+NP_002287.2,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001170787.2,0.991,0.991,0.939,0.965,0.956,0.833,0.947,0.825,0.789,0.974,0.807,1.0,1.0,0.93,0.974,0.781,0.833
+NP_003709.3,0.966,0.931,0.971,0.933,0.931,0.905,0.771,0.94,0.781,0.737,0.875,0.972,0.931,0.933,0.932,0.908,0.935
+NP_000286.3,0.781,0.938,0.836,,0.939,0.876,0.893,,0.868,0.815,0.812,0.864,0.947,0.848,0.864,0.906,0.905
+NP_003892.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_065842.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,0.857,0.857,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005364.1,1.0,0.957,0.95,0.936,0.986,0.943,1.0,0.9,0.864,0.554,0.864,1.0,0.993,0.979,0.936,0.886,0.893
+NP_005699.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055660.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,0.75,1.0,1.0,1.0,1.0,0.875,1.0
+NP_060121.3,0.994,0.97,1.0,,0.98,0.937,0.97,0.928,0.748,0.925,0.914,1.0,0.974,0.981,0.987,0.854,0.908
+NP_054742.2,0.875,0.719,1.0,0.969,0.844,0.906,0.75,0.844,0.688,0.719,0.75,1.0,0.906,0.781,0.75,0.875,0.812
+NP_954986.2,1.0,0.965,0.991,0.974,0.93,0.895,0.904,1.0,0.825,0.789,0.965,1.0,0.982,0.947,0.956,0.974,0.974
+NP_060676.2,1.0,1.0,1.0,0.9,1.0,0.9,0.9,1.0,1.0,0.6,0.6,1.0,0.6,1.0,1.0,1.0,1.0
+NP_000061.1,0.995,0.917,0.992,0.786,0.958,0.94,0.808,1.0,0.864,0.493,0.867,0.991,0.945,0.786,0.857,0.979,0.979
+NP_005678.3,1.0,0.867,1.0,0.8,0.867,0.767,0.933,0.967,0.733,0.688,0.7,1.0,0.867,0.9,0.867,0.833,0.867
+NP_001275929.1,1.0,0.911,1.0,,1.0,0.689,,,0.733,0.711,0.922,1.0,1.0,,,0.9,0.944
+NP_054706.1,1.0,1.0,1.0,0.667,1.0,1.0,0.667,0.667,0.667,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000024.2,0.996,0.97,0.997,0.908,0.984,0.92,0.938,0.95,0.833,0.805,0.948,0.986,0.976,0.978,0.97,0.923,0.951
+NP_653234.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_620148.1,1.0,,1.0,,1.0,0.88,1.0,,0.92,,0.9,1.0,0.96,1.0,1.0,0.92,0.92
+NP_004808.2,1.0,,1.0,,,0.987,0.947,,0.88,,0.833,0.987,0.947,0.96,0.973,0.953,0.953
+NP_000229.1,0.969,0.954,0.999,0.932,0.941,0.872,0.95,0.906,0.779,0.53,0.878,0.937,0.928,0.927,0.935,0.873,0.93
+NP_001041629.1,0.0,1.0,0.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001025482.1,1.0,1.0,1.0,,1.0,1.0,,,0.867,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_001170991.1,0.994,0.995,0.993,,0.989,0.985,,,0.882,0.842,0.668,0.928,0.931,,,,
+NP_054831.2,0.984,0.937,1.0,0.984,0.883,0.885,,0.884,0.802,0.814,0.83,0.96,0.87,,,0.899,0.918
+NP_006236.1,0.929,0.905,0.976,0.798,0.857,0.881,0.952,0.863,0.798,0.94,0.667,0.929,0.929,0.881,0.893,0.935,0.952
+NP_006585.2,1.0,1.0,1.0,1.0,1.0,1.0,0.875,1.0,1.0,0.75,0.938,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000028.3,1.0,0.75,1.0,,0.938,0.938,0.938,,0.75,0.75,0.719,1.0,0.875,0.938,0.938,0.938,0.875
+NP_000009.1,1.0,0.995,0.995,,0.993,1.0,0.976,,0.906,0.714,1.0,0.995,0.993,0.998,1.0,0.983,0.986
+NP_068369.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.75,0.938,1.0,1.0,,,0.5,0.5
+NP_001177716.1,1.0,0.667,1.0,1.0,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,0.667,1.0,1.0
+NP_665877.1,1.0,1.0,1.0,,1.0,1.0,,1.0,0.0,1.0,0.5,1.0,1.0,,,,
+NP_061485.1,0.929,1.0,0.714,1.0,0.786,0.929,,1.0,0.357,0.786,0.5,0.929,0.929,,,,
+NP_002863.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000118.2,1.0,1.0,1.0,0.985,0.985,0.994,1.0,0.905,0.864,0.911,0.922,0.994,0.981,0.996,0.994,0.966,0.976
+NP_060476.2,1.0,0.985,1.0,0.977,1.0,0.931,1.0,0.977,0.923,0.885,0.8,1.0,0.969,0.992,0.977,0.931,0.938
+NP_002606.3,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,1.0,0.333,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079436.4,0.933,0.733,0.967,,0.833,0.7,0.867,0.667,0.5,0.7,0.5,0.9,0.867,0.733,0.733,,
+NP_005624.2,0.974,0.917,0.985,0.914,0.95,0.862,0.892,0.914,0.735,0.699,0.744,0.966,0.927,0.902,0.897,0.813,0.848
+NP_005242.1,1.0,,1.0,,,1.0,1.0,,1.0,,0.938,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001444.2,1.0,1.0,1.0,,1.0,0.925,1.0,,0.944,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_073625.1,0.5,0.5,0.625,0.5,0.5,0.5,0.375,0.625,0.5,0.125,0.625,0.5,0.5,0.5,0.75,0.5,0.5
+NP_001153508.1,0.879,0.844,0.881,0.932,0.815,0.861,,0.862,0.799,0.771,0.665,0.895,0.858,,,,
+NP_115927.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001529.2,1.0,1.0,1.0,,,1.0,1.0,,0.9,1.0,0.725,0.9,0.9,,,,
+NP_003551.2,1.0,0.996,1.0,0.921,0.996,0.992,0.974,0.977,0.762,0.938,0.736,1.0,0.989,1.0,1.0,0.932,0.966
+NP_003871.1,1.0,0.889,1.0,,0.956,0.889,0.889,,0.756,0.8,0.922,0.978,0.889,0.889,0.867,0.778,0.778
+NP_001129507.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001107655.1,0.786,0.571,0.893,,0.714,0.643,0.643,,0.5,0.429,0.643,0.75,0.607,0.643,0.75,0.589,0.714
+NP_001123577.1,0.978,,1.0,0.911,,1.0,,,0.8,,0.611,0.889,0.889,,,0.889,0.933
+NP_002766.1,1.0,0.938,0.985,0.969,0.985,0.977,0.923,0.954,0.938,0.862,0.985,0.985,0.938,0.954,0.954,0.954,0.954
+NP_116023.2,1.0,0.889,1.0,1.0,0.911,0.844,0.905,0.857,0.689,0.956,0.722,0.911,0.867,1.0,1.0,0.867,0.844
+NP_001077088.2,1.0,,1.0,,,0.75,,,0.85,,1.0,1.0,0.95,,,,
+NP_006584.1,1.0,0.986,1.0,1.0,0.986,0.972,1.0,,0.667,0.875,0.812,1.0,0.986,1.0,1.0,0.868,0.944
+NP_004172.2,1.0,1.0,1.0,,,1.0,,,1.0,1.0,1.0,1.0,1.0,,,,
+NP_003910.1,0.938,1.0,1.0,,1.0,1.0,1.0,,0.875,0.75,0.719,0.938,0.875,,,1.0,0.833
+NP_000160.1,0.997,0.991,0.998,0.999,0.994,0.986,0.915,0.984,0.943,0.443,0.952,1.0,0.99,0.985,0.995,0.984,0.98
+NP_006867.1,0.9,0.85,1.0,0.5,0.6,0.7,0.7,0.5,0.5,0.1,0.55,0.8,0.7,0.5,0.5,0.4,0.5
+NP_001355847.1,,,,,,,,,,,,,,,,,
+NP_003982.1,1.0,,1.0,,,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000022.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_008870.2,0.96,0.954,0.956,0.91,0.941,0.872,0.735,0.876,0.662,0.897,0.824,0.985,0.971,0.88,0.908,0.646,0.676
+NP_006027.2,1.0,0.667,1.0,0.533,0.867,1.0,0.733,0.933,0.867,0.429,0.4,0.933,0.267,0.867,1.0,0.567,0.733
+NP_597700.1,1.0,1.0,1.0,1.0,1.0,0.889,0.667,1.0,1.0,0.556,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056289.2,1.0,,1.0,1.0,,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_008919.3,1.0,0.778,1.0,0.667,0.778,0.667,0.444,0.889,0.333,0.222,0.611,1.0,0.667,0.556,0.444,0.667,0.667
+NP_000472.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.95,0.7,1.0,1.0,1.0,1.0,1.0,0.98,1.0
+NP_002011.2,1.0,1.0,1.0,0.956,1.0,0.972,1.0,0.9,0.906,0.667,0.961,1.0,1.0,1.0,1.0,0.994,0.994
+NP_004367.2,1.0,1.0,1.0,,1.0,1.0,,,0.917,0.5,0.875,1.0,1.0,,,1.0,1.0
+NP_005509.1,1.0,0.909,1.0,1.0,0.636,0.636,1.0,0.818,0.909,0.727,0.818,0.909,0.818,0.909,0.909,0.364,0.455
+NP_653164.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_619646.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,0.5
+NP_945316.1,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000483.3,0.997,0.997,0.987,0.981,0.999,0.998,0.961,0.932,0.99,0.591,0.989,1.0,1.0,0.996,0.998,0.997,0.997
+NP_612433.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,0.889,0.889,1.0,1.0,,,,
+NP_000124.1,0.981,0.937,0.985,0.909,0.936,0.954,0.88,0.859,0.854,0.718,0.899,0.99,0.961,0.943,0.947,0.923,0.927
+NP_000161.2,0.997,0.977,0.999,0.944,0.95,0.912,0.969,0.937,0.867,0.581,0.795,0.999,0.974,0.96,0.956,0.922,0.932
+NP_005641.1,0.942,0.942,0.984,0.984,0.868,0.878,0.984,0.833,0.831,0.688,0.817,0.921,0.841,0.974,0.963,0.884,0.926
+NP_109590.3,0.812,0.735,0.811,0.853,0.786,0.875,0.74,0.796,0.673,0.418,0.524,0.827,0.779,0.851,0.811,0.695,0.709
+NP_002244.1,1.0,,1.0,0.917,,0.917,0.75,,1.0,,0.875,1.0,0.917,1.0,1.0,0.875,0.958
+NP_000396.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_037386.1,1.0,0.545,0.909,1.0,0.818,0.727,0.273,0.591,0.5,0.545,0.955,0.955,0.591,0.864,0.909,0.364,0.409
+NP_000099.2,1.0,1.0,1.0,1.0,1.0,1.0,0.952,0.952,0.81,0.429,0.667,1.0,1.0,1.0,1.0,1.0,1.0
+NP_995322.1,1.0,1.0,1.0,1.0,1.0,0.8,1.0,1.0,0.8,0.6,0.85,1.0,1.0,1.0,1.0,1.0,1.0
+NP_115484.2,0.864,,1.0,0.879,0.455,0.758,0.97,0.788,0.591,0.712,0.803,0.97,0.652,0.727,0.773,0.795,0.864
+NP_005857.1,1.0,0.7,1.0,0.9,1.0,0.5,0.7,0.7,0.6,0.7,0.75,1.0,0.9,0.8,0.8,0.6,0.8
+NP_001307852.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.938,0.875,0.875,1.0,1.0,0.938,1.0,1.0,1.0
+NP_001839.2,0.974,0.965,0.971,0.95,0.97,0.877,0.949,0.413,0.721,0.943,0.894,0.97,0.954,0.93,0.944,0.865,0.908
+NP_000538.3,1.0,0.987,0.929,0.975,0.968,0.891,0.934,0.727,0.801,0.679,0.724,1.0,0.987,0.909,0.893,0.91,0.897
+NP_000496.2,1.0,0.8,1.0,0.6,0.9,0.7,0.3,0.8,0.6,0.6,0.2,1.0,1.0,0.5,0.8,0.6,0.6
+NP_001973.2,1.0,0.667,1.0,0.833,0.722,0.778,0.667,0.667,0.389,0.444,0.556,0.778,0.778,0.722,0.722,0.667,0.639
+NP_000138.2,1.0,1.0,1.0,1.0,1.0,0.955,0.818,0.909,0.864,0.955,0.955,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002217.3,1.0,1.0,1.0,1.0,1.0,0.833,0.667,1.0,0.667,0.167,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000203.2,1.0,,1.0,0.956,0.931,0.836,0.936,0.882,0.576,0.792,0.774,0.995,0.936,0.916,0.927,0.857,0.851
+NP_443711.2,1.0,0.878,1.0,0.84,0.911,0.878,,0.747,0.767,0.322,0.667,0.989,0.922,,,,
+NP_001354903.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001365421.1,1.0,1.0,1.0,0.85,1.0,1.0,1.0,0.85,0.95,0.675,0.75,1.0,1.0,1.0,1.0,0.675,0.65
+NP_000470.3,1.0,0.812,1.0,0.562,0.812,1.0,0.562,0.938,1.0,0.25,0.75,0.875,0.812,0.75,0.438,1.0,0.938
+NP_003051.1,0.996,0.998,0.982,,0.991,0.987,0.961,0.965,0.959,0.787,0.928,0.99,0.995,0.988,0.987,0.996,0.994
+NP_006000.2,0.993,1.0,0.987,,1.0,0.94,0.966,0.953,0.899,0.869,0.5,0.953,0.953,0.946,0.953,0.963,0.96
+NP_005991.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,0.0,1.0
+NP_006077.2,0.721,0.812,1.0,0.906,0.919,0.779,0.703,0.693,0.577,0.562,0.5,0.847,0.838,0.792,0.75,0.656,0.656
+NP_001060.1,0.667,0.512,0.845,0.56,0.685,0.583,0.554,,0.827,0.366,0.643,0.536,0.53,0.655,0.655,0.548,0.589
+NP_821080.1,0.645,0.847,0.887,0.242,0.629,0.968,0.548,,0.629,0.444,0.734,0.774,0.79,0.532,0.565,0.484,0.5
+NP_821133.1,1.0,0.957,1.0,0.957,1.0,0.761,1.0,0.87,0.957,0.565,1.0,1.0,0.826,1.0,0.957,0.174,0.435
+NP_110400.1,1.0,1.0,1.0,1.0,1.0,0.872,0.974,0.821,0.59,1.0,0.731,1.0,1.0,0.897,0.897,0.923,1.0
+NP_006078.2,0.709,0.827,0.709,0.727,0.691,0.658,0.406,0.694,0.558,0.742,0.506,0.594,0.594,0.406,0.455,0.5,0.509
+NP_060733.4,0.943,0.721,0.971,,0.857,0.943,0.9,0.986,0.543,0.764,0.686,0.986,0.914,0.929,1.0,0.857,0.871
+NP_005583.1,1.0,0.979,1.0,1.0,0.979,0.885,0.896,0.896,0.812,0.375,0.812,1.0,0.979,0.979,0.958,0.958,0.99
+NP_005985.3,1.0,0.75,1.0,1.0,0.5,1.0,0.75,,0.25,0.25,0.0,1.0,0.75,1.0,1.0,1.0,1.0
+NP_001030025.1,0.994,0.986,0.99,,0.983,0.97,,,0.951,0.962,0.934,0.975,0.956,,,0.944,0.976
+NP_004010.1,0.971,0.914,1.0,,1.0,1.0,,,0.629,0.857,0.643,1.0,1.0,,,,
+NP_002308.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.944,0.778,1.0,1.0,1.0,1.0,1.0,1.0,0.778,0.944
+NP_000252.1,1.0,0.99,1.0,0.974,0.98,0.895,0.984,0.901,0.832,0.91,0.916,1.0,0.97,0.98,0.962,0.892,0.938
+NP_006423.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,,,,
+NP_005661.1,1.0,1.0,1.0,1.0,0.9,1.0,0.95,0.9,0.8,0.725,0.95,1.0,1.0,,,0.65,0.65
+NP_000315.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000827.2,0.949,0.904,0.985,0.944,0.884,0.962,0.965,,0.838,0.662,0.914,0.944,0.843,0.975,0.975,0.942,0.949
+NP_000213.1,1.0,0.98,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.92,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_036345.2,1.0,1.0,1.0,1.0,0.818,0.909,0.909,0.727,1.0,0.364,0.909,1.0,1.0,1.0,1.0,1.0,1.0
+NP_478123.1,0.958,,0.958,,0.917,0.917,0.875,,0.375,0.812,0.958,0.958,0.917,,,,
+NP_001859.1,1.0,1.0,1.0,1.0,1.0,1.0,0.909,1.0,0.97,0.182,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_076932.1,1.0,0.833,1.0,,1.0,1.0,0.667,,0.5,0.5,0.75,1.0,1.0,1.0,1.0,0.917,1.0
+NP_056372.1,1.0,1.0,1.0,1.0,1.0,0.958,1.0,1.0,0.833,0.792,0.708,1.0,1.0,1.0,1.0,0.917,0.958
+NP_001388.1,1.0,1.0,1.0,1.0,0.6,1.0,1.0,0.8,0.4,1.0,0.8,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001070712.1,1.0,1.0,0.5,,1.0,1.0,,,1.0,0.0,0.75,0.5,0.5,,,,
+NP_037383.1,1.0,1.0,1.0,,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_063916.1,1.0,1.0,0.968,,1.0,0.968,,,0.968,0.871,0.935,1.0,0.968,,,,
+NP_115497.4,1.0,1.0,1.0,1.0,1.0,0.909,1.0,1.0,0.909,1.0,0.727,1.0,0.909,,,,
+NP_000257.1,1.0,,1.0,,,1.0,1.0,,0.857,,1.0,1.0,1.0,1.0,1.0,0.964,1.0
+NP_001137453.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.875,0.4,1.0,1.0,1.0,1.0,,,,
+NP_000275.1,0.994,0.97,0.994,,0.939,0.933,0.964,,0.788,0.455,0.782,0.984,0.955,0.95,0.955,0.863,0.896
+NP_001035937.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,0.833,1.0,1.0,,,1.0,1.0
+NP_003886.3,0.96,0.96,1.0,,1.0,1.0,0.96,,1.0,0.4,0.84,0.96,0.96,,,0.941,0.941
+NP_000797.2,0.829,,0.867,,,0.852,0.843,,0.772,,0.629,0.814,0.796,0.779,0.777,0.897,0.893
+NP_001245218.1,1.0,,1.0,1.0,1.0,1.0,,,1.0,,1.0,1.0,1.0,,,,
+NP_004981.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.625,0.75,0.75,1.0,1.0,1.0,1.0,0.938,1.0
+NP_005899.3,1.0,,1.0,,,1.0,,,1.0,,0.889,1.0,0.889,,,,
+NP_065851.1,1.0,1.0,1.0,0.857,1.0,0.857,0.714,0.857,0.429,0.0,0.786,1.0,0.714,0.714,0.857,0.714,0.714
+NP_000177.2,1.0,0.981,1.0,1.0,0.886,0.895,,0.981,0.819,0.748,0.786,1.0,0.99,,,,
+NP_001287771.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,,0.688,1.0,1.0,,,1.0,1.0
+NP_079272.4,0.957,0.935,0.976,,0.962,0.877,0.901,0.953,0.909,0.601,0.793,0.981,0.942,0.901,0.943,0.904,0.918
+NP_002067.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002600.1,0.994,0.984,1.0,0.969,0.986,0.939,0.986,0.924,0.688,0.9,0.96,0.992,0.957,0.984,0.987,0.858,0.898
+NP_000548.2,1.0,1.0,1.0,,0.889,1.0,1.0,,1.0,1.0,0.906,1.0,1.0,,,1.0,1.0
+NP_036311.3,1.0,0.917,1.0,0.917,0.917,0.75,1.0,0.917,0.75,0.583,0.5,1.0,0.917,0.75,0.917,0.958,0.792
+NP_077277.1,0.982,0.957,0.982,0.913,0.957,0.902,0.87,0.826,0.929,0.732,0.929,1.0,0.804,0.87,0.87,1.0,1.0
+NP_055069.1,1.0,1.0,1.0,1.0,1.0,0.8,1.0,0.8,0.9,,0.9,1.0,1.0,1.0,1.0,0.9,1.0
+NP_001269613.2,,,,,,,,,,,,,,,,,
+NP_000708.1,1.0,0.857,0.786,0.571,0.5,0.5,0.571,0.5,0.786,0.857,0.714,0.929,0.857,0.714,0.643,0.571,0.643
+NP_665859.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003631.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.921,0.947,0.974,0.711,1.0,1.0,1.0,1.0,0.908,0.934
+NP_001106962.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.25,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_689703.1,1.0,1.0,1.0,0.75,1.0,0.875,1.0,0.5,0.5,1.0,1.0,1.0,1.0,1.0,1.0,0.75,0.75
+NP_000360.2,0.967,0.909,0.901,0.959,0.975,0.764,,0.901,0.744,0.736,0.913,0.917,0.917,,,,
+NP_004854.1,0.979,0.938,1.0,0.854,0.917,1.0,0.958,0.979,0.812,0.562,0.708,0.938,0.854,0.917,0.896,0.896,0.896
+NP_000219.2,1.0,1.0,0.993,,1.0,0.897,0.952,,0.697,0.979,0.931,1.0,0.993,0.952,0.91,0.869,0.9
+NP_000293.2,1.0,1.0,1.0,0.857,1.0,0.964,1.0,0.929,0.929,0.679,0.929,1.0,1.0,1.0,1.0,0.929,0.929
+NP_000194.2,1.0,0.913,0.999,0.91,0.906,0.854,0.939,0.861,0.804,0.738,0.828,0.996,0.947,0.935,0.94,0.864,0.895
+NP_919299.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.929,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001076007.1,1.0,,1.0,,1.0,1.0,,,1.0,,0.75,1.0,1.0,,,1.0,1.0
+NP_001457.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,0.778,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005219.2,0.962,0.987,0.962,0.775,0.987,0.846,0.875,0.85,0.808,0.769,0.808,0.974,0.923,0.85,0.85,0.929,0.936
+NP_036335.1,1.0,0.946,1.0,1.0,0.982,0.875,0.893,0.946,0.714,0.812,0.812,1.0,1.0,0.982,0.982,0.839,0.857
+NP_001138341.1,1.0,0.875,1.0,0.875,1.0,0.812,1.0,,0.5,0.75,0.75,1.0,0.875,1.0,1.0,0.875,1.0
+NP_149091.1,1.0,,1.0,0.913,0.957,0.391,1.0,,0.348,,0.826,1.0,1.0,1.0,1.0,0.609,0.609
+NP_002445.2,1.0,0.961,1.0,,0.961,1.0,,,1.0,0.784,0.941,1.0,1.0,0.863,0.902,,
+NP_705902.2,0.981,0.962,0.975,0.994,0.975,0.928,1.0,0.862,0.806,0.966,0.381,1.0,0.956,0.975,0.894,0.894,0.875
+NP_004846.4,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,1.0,1.0,0.85,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002325.2,1.0,1.0,1.0,1.0,1.0,0.846,0.923,0.885,0.923,1.0,0.885,1.0,0.962,1.0,1.0,0.885,0.808
+NP_001708.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004472.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003570.2,0.931,0.906,0.969,,0.894,0.934,0.875,,0.8,0.572,0.775,0.95,0.906,0.944,0.912,0.912,0.969
+NP_002226.1,1.0,1.0,1.0,,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,,,,
+NP_000131.2,0.971,0.857,1.0,,0.886,0.886,0.686,,0.6,0.529,0.886,0.971,0.914,0.886,0.943,0.871,0.914
+NP_000205.1,0.967,0.956,0.973,0.955,0.94,0.903,0.934,0.916,0.754,0.909,0.846,0.975,0.96,0.937,0.936,0.938,0.945
+NP_036566.1,1.0,0.944,1.0,1.0,0.944,0.889,1.0,1.0,1.0,0.889,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000283.1,1.0,0.99,1.0,0.973,1.0,0.897,0.953,0.934,0.823,0.798,0.87,1.0,0.998,0.998,0.979,0.972,0.99
+NP_009146.2,1.0,0.833,1.0,1.0,0.833,1.0,1.0,1.0,0.833,0.833,0.833,1.0,0.5,1.0,1.0,0.833,0.833
+NP_001304019.1,1.0,0.917,1.0,1.0,1.0,0.896,0.833,,0.5,0.708,0.896,0.792,0.75,0.958,1.0,0.833,0.875
+NP_001037850.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,,1.0,1.0,1.0,1.0,1.0,,
+NP_001273644.1,1.0,,1.0,1.0,1.0,1.0,,,1.0,,1.0,1.0,1.0,,,,
+NP_003887.3,1.0,,1.0,1.0,,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_877435.3,1.0,1.0,1.0,0.889,1.0,1.0,0.778,0.889,1.0,0.889,0.944,1.0,1.0,1.0,1.0,0.944,1.0
+NP_001229825.1,1.0,0.962,1.0,,1.0,1.0,,,0.712,0.95,0.795,1.0,0.985,,,1.0,1.0
+NP_001025053.1,0.929,0.75,0.918,,0.663,0.848,0.81,,0.397,0.603,0.815,0.946,0.891,0.837,0.94,0.587,0.696
+NP_640335.2,1.0,0.875,1.0,0.906,0.969,0.875,0.844,0.656,0.875,0.625,0.844,1.0,0.906,0.875,0.812,0.781,0.766
+NP_001908.3,0.5,0.0,1.0,,0.0,0.0,0.25,,0.25,0.25,0.0,0.0,0.0,,,,
+NP_060530.3,1.0,0.857,1.0,0.864,0.864,0.818,0.818,,0.864,,0.682,0.955,0.818,0.864,0.864,0.864,0.864
+NP_055120.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_078874.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_078772.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.411,0.964,0.0,0.857,1.0,1.0,1.0,0.964,0.964,0.964
+NP_004272.2,1.0,1.0,1.0,1.0,1.0,0.994,1.0,0.678,0.967,1.0,1.0,1.0,0.989,0.989,0.989,0.961,1.0
+NP_004077.1,1.0,1.0,1.0,,0.875,0.705,1.0,0.893,0.714,0.714,0.812,0.982,0.964,0.964,0.946,0.741,0.768
+NP_071372.1,1.0,1.0,1.0,1.0,1.0,1.0,0.9,1.0,0.7,0.6,0.7,1.0,0.9,0.9,1.0,0.8,0.8
+NP_852664.1,1.0,0.917,1.0,0.917,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,0.917,1.0,1.0
+NP_490597.1,1.0,1.0,1.0,1.0,1.0,1.0,0.875,1.0,0.812,0.531,0.969,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000036.2,1.0,1.0,1.0,,1.0,0.846,1.0,,0.538,0.308,0.423,1.0,1.0,0.692,1.0,1.0,1.0
+NP_006026.3,0.889,0.818,0.949,0.909,0.889,0.884,0.97,0.97,0.788,0.424,0.818,0.949,0.869,0.899,0.859,0.879,0.833
+NP_789845.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,1.0,0.975,0.938,1.0,1.0,1.0,1.0,1.0,1.0
+NP_659428.2,1.0,,1.0,0.857,,0.881,1.0,,0.714,,0.857,0.976,0.857,1.0,1.0,0.81,0.952
+NP_000541.1,1.0,1.0,1.0,1.0,1.0,0.833,1.0,0.8,0.767,0.867,0.917,1.0,1.0,1.0,1.0,0.825,0.8
+NP_002733.2,1.0,,0.955,,0.727,1.0,0.773,,0.864,,0.727,1.0,0.773,0.818,0.727,0.773,0.773
+NP_000147.1,1.0,1.0,1.0,,0.974,0.868,0.886,,0.772,0.781,0.961,1.0,0.965,0.956,0.956,0.763,0.768
+NP_001003800.1,0.991,0.941,0.993,0.895,0.955,0.889,,0.9,0.746,0.948,0.78,0.979,0.931,,,0.869,0.876
+NP_001254538.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,0.5,1.0,1.0,,,,
+NP_000082.2,0.999,0.999,1.0,0.996,0.994,0.98,0.965,0.63,0.835,0.965,0.967,1.0,0.999,0.989,0.994,0.902,0.957
+NP_523353.2,1.0,1.0,1.0,1.0,1.0,0.8,0.6,0.6,0.6,0.6,1.0,1.0,0.8,1.0,1.0,0.0,0.0
+NP_005112.2,0.865,0.721,0.942,0.779,0.75,0.851,0.76,0.731,0.721,0.529,0.716,0.875,0.75,0.827,0.846,0.712,0.774
+NP_002463.2,1.0,1.0,1.0,0.955,0.864,1.0,1.0,0.773,1.0,0.955,0.795,1.0,1.0,0.818,0.818,1.0,1.0
+NP_001222.3,1.0,0.571,1.0,0.429,0.143,0.0,0.0,0.286,0.0,0.286,0.0,0.714,0.286,0.143,0.143,0.0,0.0
+NP_000279.1,1.0,0.833,1.0,,0.944,0.972,,,0.889,0.222,0.778,1.0,0.944,,,,
+NP_057444.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,0.833,1.0,1.0,1.0,1.0,0.667,0.333
+NP_004773.1,1.0,0.5,1.0,1.0,1.0,1.0,0.75,1.0,0.0,,0.25,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001014839.1,0.875,0.75,0.875,,0.625,0.875,1.0,,0.75,0.0,0.875,0.75,0.75,0.875,0.75,0.875,0.875
+NP_060404.4,0.923,0.891,0.923,0.92,0.891,0.83,0.774,0.863,0.682,0.393,0.862,0.935,0.914,0.884,0.911,0.719,0.805
+NP_071357.2,0.74,0.365,0.698,0.688,0.688,0.448,0.385,0.495,0.271,0.411,0.495,0.625,0.312,0.562,0.417,0.448,0.469
+NP_000190.1,0.996,0.954,0.983,0.926,0.89,0.894,0.711,0.889,0.776,0.695,0.684,0.968,0.878,0.937,0.895,0.893,0.88
+NP_001116229.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_000414.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.935,0.826,0.88,1.0,1.0,0.989,0.978,0.989,0.913,1.0
+NP_000217.2,1.0,1.0,1.0,0.984,0.984,0.921,0.873,0.857,1.0,0.937,0.968,0.968,0.952,0.921,0.921,0.952,0.952
+NP_620170.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000067.1,1.0,1.0,1.0,0.971,1.0,1.0,1.0,0.771,0.857,0.943,0.857,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006748.1,1.0,1.0,1.0,,1.0,0.667,,1.0,0.667,1.0,0.5,1.0,1.0,,,0.833,1.0
+NP_997253.2,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004571.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002569.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.25,1.0,1.0,1.0,,,1.0,1.0
+NP_006454.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.833,0.667,1.0,1.0,1.0,1.0,1.0,1.0,0.917
+NP_001001430.1,0.988,0.977,0.988,,0.977,0.962,,,0.977,0.837,0.936,0.983,0.953,,,,
+NP_000678.1,1.0,1.0,1.0,0.875,1.0,1.0,0.75,0.688,0.812,1.0,1.0,1.0,1.0,1.0,1.0,0.812,0.812
+NP_543136.1,1.0,1.0,1.0,0.9,0.95,0.9,1.0,0.85,0.8,1.0,0.95,1.0,1.0,1.0,1.0,0.85,0.9
+NP_054774.2,1.0,0.8,1.0,,0.6,0.6,,,1.0,0.8,0.6,1.0,0.8,,,1.0,1.0
+NP_055790.1,1.0,1.0,1.0,1.0,1.0,0.976,0.917,,0.952,0.702,1.0,1.0,1.0,1.0,0.988,0.976,0.976
+NP_003733.2,0.999,0.997,0.994,,0.981,0.984,0.956,0.974,0.895,0.699,0.879,0.999,0.996,0.98,0.987,0.963,0.981
+NP_000126.2,0.987,0.967,0.979,0.954,0.988,0.968,0.926,,0.913,0.834,0.912,1.0,0.977,0.962,0.966,0.909,0.917
+NP_001273003.2,,,,,,,,,,,,,,,,,
+NP_003184.1,1.0,,1.0,,,0.939,0.848,,0.879,,0.848,1.0,1.0,0.788,0.848,0.97,1.0
+NP_006260.1,1.0,0.978,1.0,0.913,1.0,0.947,0.957,0.978,0.915,0.804,0.968,0.989,0.963,1.0,0.978,1.0,0.978
+NP_055414.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,0.429,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079146.2,1.0,0.667,1.0,0.667,0.667,1.0,0.667,0.333,0.667,0.667,1.0,1.0,0.667,0.667,0.667,0.667,0.667
+NP_001185797.1,1.0,0.833,1.0,,0.867,0.778,0.833,,0.722,0.889,0.917,1.0,0.833,0.8,0.733,0.733,0.733
+NP_689807.1,0.972,1.0,1.0,1.0,1.0,0.889,0.972,0.75,0.778,0.917,0.917,1.0,1.0,1.0,1.0,0.958,0.944
+NP_000317.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,0.917,0.0,0.667,1.0,1.0,1.0,0.833,1.0,1.0
+NP_938012.2,1.0,0.733,1.0,0.733,0.867,0.633,0.8,0.85,0.6,0.767,0.8,1.0,0.9,0.767,0.7,0.55,0.55
+NP_000544.2,1.0,0.906,1.0,1.0,0.938,0.969,0.938,0.938,0.938,0.531,0.953,1.0,0.969,0.938,0.969,0.906,0.891
+NP_848612.2,1.0,,1.0,,,1.0,,,1.0,,0.929,1.0,1.0,,,,
+NP_002065.1,0.94,,0.925,,0.835,0.91,0.745,,0.6,,0.625,0.88,0.885,0.96,0.975,0.822,0.865
+NP_067642.1,1.0,1.0,1.0,0.886,1.0,0.971,0.743,0.8,0.8,0.743,0.643,1.0,1.0,1.0,1.0,0.914,0.914
+NP_005264.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001072982.1,1.0,0.333,1.0,,0.333,0.667,0.333,,0.333,0.0,0.0,1.0,0.333,0.0,0.0,0.667,1.0
+NP_000935.1,0.927,0.836,0.982,0.939,0.873,0.691,0.697,0.606,0.655,0.4,0.6,0.909,0.855,0.848,0.848,0.864,0.879
+NP_003788.2,1.0,1.0,1.0,,1.0,1.0,0.9,1.0,1.0,0.3,0.75,1.0,0.9,1.0,0.7,1.0,1.0
+NP_005174.2,0.983,0.964,1.0,0.977,0.994,0.861,0.985,0.692,0.756,0.589,0.753,1.0,0.989,0.985,0.992,0.942,0.992
+NP_001978.1,1.0,1.0,1.0,0.933,1.0,0.967,1.0,0.867,0.667,0.6,1.0,1.0,1.0,0.8,0.8,1.0,0.933
+NP_000120.2,1.0,0.971,1.0,1.0,1.0,0.8,,0.971,0.886,0.971,0.914,1.0,1.0,,,,
+NP_031401.1,0.588,,0.706,,,0.176,,,0.882,0.941,0.5,0.941,0.765,,,,
+NP_689735.1,1.0,0.0,1.0,1.0,0.0,0.0,1.0,0.0,0.0,,1.0,1.0,1.0,0.0,0.0,0.0,0.0
+NP_000128.1,1.0,0.8,0.982,0.867,0.933,0.795,0.755,0.433,0.809,0.918,0.536,0.973,0.873,0.9,0.867,0.4,0.717
+NP_055856.1,1.0,,1.0,,0.5,0.875,0.875,0.875,0.875,1.0,0.875,1.0,1.0,0.75,0.625,0.875,0.875
+NP_000240.1,0.981,0.892,0.991,0.948,0.897,0.876,0.9,0.889,0.672,0.748,0.659,0.984,0.883,0.941,0.927,0.877,0.905
+NP_001007528.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.875,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_689632.2,1.0,1.0,1.0,0.987,1.0,0.951,,1.0,0.942,0.854,0.945,1.0,1.0,,,,
+NP_001357587.1,1.0,1.0,0.996,,,0.988,,,0.937,1.0,0.857,0.984,0.945,,,,
+NP_001164672.1,1.0,,1.0,1.0,1.0,1.0,,,1.0,,0.75,1.0,1.0,,,,
+NP_054768.2,0.99,0.948,0.997,0.892,0.924,0.944,0.944,0.868,0.951,0.799,0.786,0.99,0.958,0.92,0.917,0.877,0.892
+NP_964012.2,1.0,0.929,1.0,1.0,1.0,0.929,0.929,0.929,1.0,0.929,0.786,1.0,1.0,0.929,1.0,1.0,1.0
+NP_775771.3,0.944,0.833,1.0,0.944,0.889,0.944,0.889,0.944,0.167,0.0,0.944,0.944,0.778,1.0,0.889,0.5,0.833
+NP_056028.2,1.0,1.0,1.0,0.8,1.0,1.0,0.2,1.0,1.0,0.2,1.0,1.0,1.0,0.6,0.8,1.0,1.0
+NP_000368.1,0.996,0.993,0.996,0.945,0.855,0.994,0.883,0.8,0.981,0.572,0.869,0.998,0.987,0.945,0.945,0.909,0.909
+NP_065166.2,1.0,1.0,1.0,0.9,1.0,1.0,1.0,,0.7,1.0,0.9,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057272.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.625,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001015880.1,1.0,1.0,1.0,,0.833,1.0,1.0,,1.0,1.0,0.917,1.0,1.0,1.0,1.0,0.917,0.917
+NP_006525.2,0.333,0.667,0.667,,1.0,1.0,,,1.0,0.0,0.667,0.667,1.0,,,1.0,1.0
+NP_036584.1,1.0,,1.0,,1.0,1.0,1.0,,0.6,0.8,0.8,1.0,1.0,,,,
+NP_000181.2,0.994,0.985,0.994,,0.938,0.952,0.926,0.889,0.855,0.716,0.7,1.0,0.988,0.951,0.951,0.858,0.877
+NP_056087.2,0.971,0.722,0.956,,0.75,0.735,,,0.735,0.556,0.721,0.956,0.838,,,0.757,0.765
+NP_001349700.1,1.0,,1.0,,,1.0,1.0,,0.988,,0.5,1.0,1.0,1.0,1.0,0.952,0.988
+NP_000987.2,1.0,1.0,1.0,1.0,1.0,0.25,1.0,0.75,0.75,,0.5,1.0,1.0,0.75,0.75,0.75,0.75
+NP_005211.1,1.0,1.0,1.0,,1.0,1.0,1.0,0.778,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001011658.1,1.0,0.5,1.0,,1.0,1.0,1.0,,1.0,0.5,0.75,1.0,1.0,1.0,1.0,0.5,0.5
+NP_899200.1,0.975,0.967,1.0,0.9,0.96,0.961,0.935,0.967,0.828,0.895,0.898,0.997,0.975,0.978,0.975,0.897,0.89
+NP_004259.3,1.0,1.0,1.0,0.889,0.889,0.778,1.0,0.944,1.0,0.667,0.889,1.0,0.944,1.0,1.0,1.0,1.0
+NP_004159.2,1.0,0.952,1.0,0.857,1.0,0.881,1.0,0.81,0.762,0.81,0.75,1.0,0.929,1.0,1.0,1.0,1.0
+NP_006512.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001796.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000058.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_065099.3,0.953,0.953,1.0,0.938,0.953,1.0,0.938,0.896,0.891,1.0,0.906,0.969,0.969,0.896,0.875,0.875,0.906
+NP_001008214.1,1.0,1.0,1.0,,0.917,0.917,1.0,,1.0,0.778,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005052.1,1.0,1.0,1.0,0.75,1.0,1.0,0.75,1.0,0.75,0.5,1.0,1.0,1.0,1.0,1.0,0.875,1.0
+NP_001180345.1,0.986,,0.986,1.0,,0.898,0.843,,0.88,,0.833,0.981,0.977,0.995,0.995,0.968,0.968
+NP_001073986.1,0.98,0.957,0.995,0.968,0.908,0.933,0.94,0.957,0.723,0.342,0.892,0.971,0.882,0.982,0.986,0.885,0.957
+NP_057122.2,1.0,1.0,1.0,0.429,1.0,0.857,0.429,0.857,0.571,0.143,1.0,1.0,1.0,0.857,1.0,1.0,0.857
+NP_000320.1,1.0,0.737,1.0,0.704,0.911,0.831,0.785,0.785,0.817,0.653,0.793,0.989,0.812,0.78,0.833,0.712,0.723
+NP_078941.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.3,0.988,1.0,1.0,1.0,1.0,1.0,1.0
+NP_891847.1,0.932,0.903,0.983,0.915,0.864,0.906,0.903,0.886,0.801,0.452,0.83,0.972,0.926,0.915,0.92,0.75,0.801
+NP_004964.2,1.0,0.2,1.0,0.933,1.0,0.867,0.867,0.867,0.8,0.267,0.967,0.8,0.333,0.933,1.0,0.967,0.967
+NP_005887.2,1.0,1.0,0.958,,0.958,1.0,0.958,,0.958,0.729,0.917,0.958,1.0,0.917,0.917,0.792,0.792
+NP_001290425.1,1.0,0.835,1.0,0.72,0.885,0.808,0.775,0.74,0.745,0.832,0.63,0.995,0.805,0.76,0.765,0.755,0.755
+NP_003070.3,1.0,1.0,1.0,1.0,1.0,0.778,1.0,1.0,0.0,1.0,0.5,1.0,1.0,1.0,1.0,0.222,0.389
+NP_112420.1,1.0,0.667,1.0,,0.667,1.0,,,0.333,0.0,0.167,1.0,0.333,,,0.333,0.667
+NP_954659.1,1.0,1.0,1.0,,0.8,1.0,1.0,,0.0,1.0,0.7,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001035.1,1.0,1.0,1.0,1.0,0.9,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060011.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_116250.3,1.0,1.0,1.0,1.0,1.0,0.939,1.0,0.909,0.909,0.97,0.955,1.0,1.0,1.0,1.0,0.939,0.97
+NP_078792.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055800.2,1.0,0.912,1.0,0.846,0.941,1.0,0.923,0.923,0.824,0.559,0.941,1.0,1.0,0.962,0.923,0.846,0.846
+NP_004526.1,1.0,0.667,1.0,,0.833,0.833,,,1.0,0.833,0.75,1.0,1.0,,,,
+NP_038460.4,1.0,0.97,1.0,0.909,0.939,0.955,1.0,0.97,1.0,0.576,0.894,0.97,0.939,1.0,0.909,0.924,0.955
+NP_002653.1,1.0,0.967,1.0,0.963,1.0,0.95,1.0,0.889,0.833,0.55,0.775,1.0,1.0,1.0,0.926,0.889,0.926
+NP_001073890.2,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,0.5,0.75,1.0,0.5,0.75,1.0,0.625,0.5
+NP_055689.1,1.0,0.998,1.0,,1.0,0.986,0.991,,1.0,0.721,0.747,1.0,0.988,1.0,0.998,0.997,1.0
+NP_001166167.1,1.0,1.0,1.0,0.933,0.857,0.857,,0.867,0.952,0.556,0.714,1.0,0.905,,,0.905,0.905
+NP_000329.2,1.0,,1.0,,,1.0,,,0.907,,0.889,1.0,0.963,,,,
+NP_054859.2,1.0,1.0,1.0,1.0,1.0,0.979,0.969,0.924,0.958,0.883,0.958,1.0,1.0,1.0,1.0,0.913,0.962
+NP_000527.2,0.993,0.974,0.997,,0.984,0.964,0.918,0.885,0.842,0.651,0.965,1.0,0.987,0.954,0.938,0.944,0.952
+NP_006850.2,1.0,,1.0,1.0,,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_853512.1,1.0,0.833,1.0,0.667,1.0,0.667,0.667,1.0,0.833,0.833,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_006222.2,0.968,0.849,0.946,0.871,0.849,0.57,0.806,0.871,0.645,0.828,0.688,0.957,0.731,0.849,0.796,0.925,0.914
+NP_000274.3,1.0,0.938,1.0,,0.95,0.908,,,0.808,0.896,0.877,0.988,0.938,,,0.875,0.896
+NP_005682.2,0.985,0.97,1.0,1.0,0.833,1.0,,0.985,0.955,0.485,1.0,1.0,1.0,,,1.0,1.0
+NP_005987.3,1.0,1.0,1.0,,1.0,0.931,,,0.417,0.722,0.833,1.0,1.0,,,0.958,0.931
+NP_000282.1,0.979,0.646,0.896,0.917,0.885,0.573,0.854,0.562,0.646,0.938,0.375,0.771,0.604,0.708,0.667,0.594,0.615
+NP_005328.2,1.0,0.679,1.0,0.857,0.786,0.571,0.857,0.679,0.429,0.696,0.518,0.964,0.75,0.571,0.607,0.321,0.357
+NP_000424.2,1.0,1.0,1.0,,1.0,1.0,1.0,,0.9,0.8,1.0,1.0,1.0,0.8,0.6,0.9,0.5
+NP_056265.2,1.0,0.917,1.0,0.917,0.833,0.917,0.917,0.917,0.75,0.5,0.958,0.917,0.833,0.833,0.917,0.667,0.667
+NP_004985.2,1.0,1.0,1.0,0.778,0.778,1.0,1.0,,1.0,0.889,0.778,1.0,1.0,1.0,0.889,1.0,1.0
+NP_000478.3,0.983,0.948,0.989,,0.952,0.924,0.923,,0.769,0.793,0.868,0.981,0.942,,,,
+NP_005973.1,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.95,0.95
+NP_058628.3,1.0,0.667,1.0,1.0,0.667,1.0,0.667,0.0,1.0,0.667,1.0,1.0,0.667,0.667,0.667,1.0,1.0
+NP_001108220.1,1.0,0.667,0.917,0.417,0.667,1.0,0.583,,0.667,,0.667,0.833,0.417,0.417,0.417,0.667,0.667
+NP_055844.2,1.0,1.0,1.0,1.0,0.909,1.0,0.867,1.0,0.267,0.133,0.933,1.0,0.867,1.0,1.0,0.818,0.909
+NP_071451.2,0.92,,0.915,,,0.846,0.855,,0.811,,0.843,0.942,0.869,0.825,0.83,0.792,0.792
+NP_002668.1,1.0,1.0,1.0,1.0,1.0,1.0,0.929,1.0,0.786,0.714,0.571,1.0,1.0,1.0,0.929,1.0,1.0
+NP_003581.1,1.0,0.938,1.0,1.0,0.75,0.938,1.0,0.938,0.75,0.375,0.625,1.0,0.938,1.0,1.0,0.938,0.938
+NP_001122092.1,1.0,1.0,1.0,,0.964,0.964,0.821,,0.786,0.5,0.732,1.0,1.0,1.0,1.0,1.0,1.0
+NP_620689.1,1.0,0.998,0.987,0.994,0.996,0.951,0.996,0.741,0.814,0.979,0.973,0.983,0.983,0.976,0.972,0.945,0.955
+NP_004229.1,0.975,0.875,1.0,,0.925,0.875,1.0,0.975,0.475,0.325,0.362,0.975,0.95,0.925,0.95,0.625,0.662
+NP_065825.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_612373.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001380429.1,1.0,,1.0,,,1.0,,,1.0,,0.667,1.0,1.0,,,,
+NP_000245.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.958,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002172.2,1.0,1.0,1.0,0.905,0.714,0.952,0.905,0.762,1.0,0.905,1.0,1.0,0.952,0.857,0.714,0.952,0.952
+NP_064612.2,1.0,1.0,1.0,1.0,1.0,1.0,0.875,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060269.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.625,1.0,1.0,1.0,1.0,1.0,1.0
+NP_619636.2,1.0,1.0,1.0,,1.0,1.0,0.952,1.0,0.81,0.619,0.952,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003856.1,1.0,,1.0,1.0,,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_068761.4,0.967,0.567,1.0,0.767,0.567,0.933,0.833,0.5,0.733,,0.467,0.867,0.667,0.633,0.6,0.483,0.617
+NP_036292.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.958,1.0,0.792,1.0,1.0,1.0,1.0,1.0,0.917,0.958
+NP_001878.1,1.0,,1.0,1.0,,1.0,1.0,,1.0,,0.778,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001918.3,0.983,0.983,1.0,1.0,0.983,0.966,0.948,0.931,0.983,0.483,0.983,1.0,0.983,1.0,0.983,0.983,0.966
+NP_060789.2,1.0,1.0,1.0,,1.0,1.0,,,0.833,1.0,0.917,1.0,1.0,,,1.0,1.0
+NP_079191.2,1.0,1.0,1.0,0.8,1.0,1.0,1.0,1.0,1.0,0.7,0.8,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001274420.1,1.0,1.0,0.893,0.964,1.0,0.929,,0.929,0.929,,0.929,0.964,0.964,,,,
+NP_001171818.1,1.0,,,,1.0,1.0,1.0,,,1.0,,,,,,,
+NP_001334748.1,1.0,0.75,1.0,0.5,0.75,0.75,0.75,0.75,1.0,0.75,0.625,1.0,0.75,0.75,0.75,1.0,0.75
+NP_061939.3,1.0,,,0.844,0.778,0.844,0.822,,0.511,,,1.0,0.978,0.556,0.622,,
+NP_055637.2,1.0,1.0,1.0,0.938,0.938,0.938,1.0,0.875,0.75,0.312,0.531,1.0,0.938,0.875,0.938,0.938,0.969
+NP_001356.1,1.0,,1.0,1.0,,1.0,,,0.875,,1.0,1.0,0.875,,,1.0,1.0
+NP_004612.2,0.95,0.775,1.0,0.65,0.875,0.975,0.65,0.575,0.85,0.462,0.688,1.0,0.875,0.95,0.875,0.762,0.788
+NP_003681.1,1.0,0.5,0.875,0.25,0.875,0.0,0.25,0.375,0.0,0.125,0.125,0.75,0.625,0.25,0.125,0.312,0.375
+NP_114423.1,1.0,0.969,1.0,,0.977,1.0,0.922,,1.0,0.578,0.828,1.0,1.0,0.875,0.859,0.953,0.922
+NP_009330.1,0.711,0.789,0.711,0.632,0.684,0.421,0.974,0.421,0.316,1.0,0.934,0.763,0.763,0.474,0.316,0.408,0.289
+NP_000514.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_938207.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.3,1.0,1.0,1.0,,,1.0,1.0
+NP_009125.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001185885.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_055448.1,1.0,0.95,0.925,0.975,1.0,0.894,0.962,1.0,0.9,0.975,0.9,1.0,1.0,1.0,0.95,0.85,0.844
+NP_005723.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,0.909,,0.864,1.0,1.0,0.909,1.0,1.0,1.0
+NP_001073270.1,1.0,,1.0,,1.0,1.0,1.0,,0.667,1.0,1.0,1.0,1.0,1.0,1.0,0.667,0.778
+NP_000321.1,0.996,0.985,0.999,0.974,0.961,0.982,0.868,0.961,0.709,0.744,0.713,0.996,0.988,0.988,0.991,0.946,0.957
+NP_001336584.1,1.0,0.833,0.833,1.0,0.833,0.333,0.833,0.5,0.5,1.0,0.333,1.0,1.0,1.0,1.0,0.5,0.5
+NP_061336.1,1.0,1.0,1.0,1.0,0.975,0.962,1.0,1.0,0.625,0.35,1.0,1.0,1.0,0.975,1.0,1.0,1.0
+NP_002684.1,0.982,0.966,0.976,0.959,0.978,0.964,0.944,0.932,0.873,0.8,0.916,0.959,0.96,0.978,0.976,0.966,0.968
+NP_658985.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001245150.1,1.0,,1.0,,1.0,1.0,1.0,,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005600.1,1.0,0.933,1.0,0.97,0.962,0.762,0.91,0.86,0.781,0.829,0.752,0.99,0.943,0.93,0.95,0.945,0.98
+NP_065793.1,1.0,1.0,1.0,,1.0,1.0,1.0,0.9,1.0,0.1,0.75,1.0,1.0,0.9,1.0,0.95,1.0
+NP_476429.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.95,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,1.0
+NP_006112.3,0.995,0.971,1.0,0.938,0.981,0.904,0.909,0.923,0.841,0.873,0.899,1.0,0.976,0.957,0.957,0.938,0.959
+NP_000415.2,0.992,0.978,0.992,0.927,0.986,0.926,0.962,0.931,0.843,0.601,0.907,0.998,0.967,0.931,0.951,0.896,0.953
+NP_000170.1,0.991,0.973,0.997,0.96,0.972,0.959,0.971,,0.921,0.801,0.694,0.982,0.959,0.955,0.966,0.965,0.973
+NP_004620.1,1.0,0.964,1.0,1.0,0.964,1.0,1.0,0.964,1.0,0.643,1.0,1.0,0.893,0.893,0.929,0.946,0.893
+NP_009005.1,1.0,,1.0,,,1.0,0.5,,1.0,1.0,0.75,1.0,1.0,,,,
+NP_001128243.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,0.667,1.0,0.944,1.0,1.0,1.0,1.0
+NP_002430.3,1.0,0.65,1.0,0.95,0.95,1.0,0.85,0.95,0.9,0.6,0.925,1.0,0.95,0.4,0.5,0.579,0.684
+NP_002959.2,0.853,0.706,1.0,0.676,0.618,0.882,0.853,0.706,0.706,0.794,0.662,0.941,0.765,0.765,0.824,0.471,0.426
+NP_075044.2,0.87,0.667,0.852,,0.583,0.648,,,0.296,0.333,0.5,0.926,0.852,,,,
+NP_612808.1,1.0,0.922,1.0,1.0,0.984,0.953,0.984,0.609,0.703,0.281,0.648,0.859,0.672,1.0,0.984,0.953,0.984
+NP_036214.2,1.0,0.64,1.0,0.88,0.84,0.92,0.84,0.96,0.6,0.34,0.84,1.0,0.88,0.92,0.92,0.96,0.92
+NP_006763.2,0.911,,0.907,,,0.861,0.885,,0.657,,0.823,0.888,0.866,0.868,0.827,0.801,0.822
+NP_001007227.1,0.417,0.333,1.0,0.917,0.833,0.583,0.833,,0.417,0.167,0.5,0.833,0.5,0.583,0.833,0.583,0.583
+NP_003592.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001442.2,1.0,0.958,0.958,0.958,1.0,0.958,0.938,,0.792,0.833,0.865,1.0,1.0,0.958,0.958,0.917,0.917
+NP_058432.1,1.0,1.0,1.0,,1.0,1.0,,,0.75,0.5,0.812,1.0,1.0,,,1.0,1.0
+NP_004443.3,1.0,0.833,1.0,1.0,1.0,1.0,,1.0,1.0,0.333,0.917,1.0,1.0,,,,
+NP_060004.3,1.0,0.938,1.0,,1.0,0.906,1.0,0.4,0.875,0.5,0.531,1.0,0.938,1.0,1.0,0.2,0.4
+NP_002461.2,0.947,,0.961,0.966,0.935,0.795,0.912,0.912,0.534,0.657,0.815,0.969,0.937,0.93,0.915,0.782,0.831
+NP_689505.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000103.2,1.0,0.971,1.0,0.912,1.0,0.784,0.902,0.892,0.608,0.686,0.971,1.0,1.0,0.951,0.961,0.873,0.863
+NP_036320.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001029027.1,1.0,1.0,1.0,,0.929,0.929,,,0.952,0.929,0.988,1.0,0.976,,,0.952,0.929
+NP_006149.2,1.0,,,1.0,0.985,1.0,0.877,,0.754,,,1.0,0.969,0.969,0.969,1.0,1.0
+NP_115647.2,1.0,1.0,1.0,1.0,0.85,0.838,0.867,1.0,0.75,0.667,1.0,1.0,0.975,1.0,1.0,1.0,0.967
+NP_056062.1,1.0,0.875,1.0,,0.8,0.8,0.0,,0.5,0.55,0.7,1.0,1.0,0.875,0.75,,
+NP_005893.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.891,0.935,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055454.1,1.0,,1.0,1.0,,0.905,0.952,,0.905,,0.929,1.0,1.0,1.0,1.0,0.762,0.762
+NP_000041.2,1.0,1.0,1.0,,0.993,0.947,1.0,,0.947,0.839,0.704,1.0,0.98,0.98,0.987,1.0,1.0
+NP_660208.2,1.0,1.0,1.0,0.932,1.0,1.0,0.887,1.0,0.992,0.669,1.0,1.0,0.985,0.947,0.947,1.0,0.992
+NP_201583.2,0.714,0.653,0.939,,0.592,0.602,,,0.408,0.541,0.735,0.796,0.592,,,0.694,0.735
+NP_114432.2,0.938,0.781,0.948,0.885,0.875,0.958,0.841,0.76,0.906,0.208,0.87,0.958,0.917,0.833,0.833,0.915,0.898
+NP_071899.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.8,0.9,0.9,0.95,1.0,1.0,1.0,1.0,0.95,1.0
+NP_898898.1,1.0,1.0,1.0,1.0,1.0,0.963,0.944,0.833,0.889,0.778,0.954,1.0,1.0,1.0,1.0,0.981,0.963
+NP_000773.2,1.0,1.0,0.987,0.935,1.0,0.981,0.974,0.844,0.857,0.766,0.844,1.0,1.0,1.0,0.987,0.844,0.987
+NP_036579.2,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000807.2,0.993,,0.99,,,0.853,,,0.93,,0.654,0.957,0.95,,,0.983,0.987
+NP_000137.2,1.0,1.0,1.0,1.0,0.667,1.0,0.667,1.0,1.0,0.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0
+NP_071764.3,1.0,1.0,1.0,,1.0,1.0,1.0,,0.924,1.0,1.0,1.0,1.0,1.0,1.0,0.939,1.0
+NP_006028.2,0.981,0.923,1.0,0.846,1.0,0.942,1.0,0.942,0.5,0.577,0.75,1.0,1.0,0.981,0.981,0.962,1.0
+NP_057616.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.857,0.857,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002543.2,1.0,0.875,1.0,0.875,1.0,1.0,0.875,,0.75,0.625,0.562,1.0,1.0,0.875,1.0,0.875,0.875
+NP_001136256.1,1.0,1.0,1.0,1.0,1.0,0.821,0.821,0.929,0.607,0.929,0.964,1.0,0.964,1.0,1.0,0.75,0.839
+NP_031399.2,1.0,1.0,1.0,0.214,0.786,1.0,0.214,0.643,0.714,0.857,0.464,1.0,1.0,0.714,0.714,1.0,1.0
+NP_003325.2,0.633,0.2,0.867,,0.433,0.65,0.6,,0.5,0.317,0.467,0.633,0.267,0.267,0.333,0.567,0.6
+NP_006746.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,0.75,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_114032.2,0.896,1.0,0.929,0.777,1.0,0.878,0.865,,0.738,0.896,0.682,0.858,0.82,0.872,0.858,0.702,0.72
+NP_000717.2,1.0,0.0,1.0,0.0,0.0,0.0,1.0,1.0,1.0,1.0,0.5,1.0,0.0,0.0,0.0,0.0,0.0
+NP_115834.1,0.867,0.8,1.0,,0.933,1.0,0.733,,0.4,0.567,0.833,1.0,0.933,0.733,0.8,0.567,0.667
+NP_056029.2,1.0,0.929,1.0,1.0,1.0,0.857,1.0,0.875,0.786,1.0,0.857,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006171.2,0.917,0.625,0.958,,0.875,0.958,,0.917,0.583,0.333,0.625,0.958,0.833,,,0.688,0.75
+NP_004230.2,1.0,0.966,0.966,0.862,0.983,0.991,0.569,0.914,0.672,0.664,0.81,1.0,0.931,0.759,0.828,0.862,0.922
+NP_005582.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.938,1.0,0.969,1.0,1.0,1.0,1.0,0.969,1.0
+NP_004388.2,1.0,,1.0,1.0,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_663304.1,0.973,0.936,0.991,,0.955,0.805,,,0.636,0.618,0.65,0.909,0.836,,,0.864,0.886
+NP_000268.1,1.0,0.975,0.997,0.722,0.878,0.924,0.958,0.931,0.78,0.867,0.647,1.0,0.977,0.764,0.849,0.931,0.916
+NP_000308.1,0.269,0.0,0.077,0.115,0.038,0.038,0.0,0.0,0.0,0.231,0.404,0.0,0.0,0.038,0.115,0.115,0.135
+NP_005594.2,0.99,0.885,0.99,,0.969,0.896,1.0,,0.802,0.771,0.875,0.99,0.938,0.896,0.896,0.854,0.922
+NP_066566.3,0.833,1.0,1.0,1.0,0.667,0.833,,0.833,0.833,0.833,0.667,1.0,1.0,,,0.917,1.0
+NP_751951.1,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006286.1,1.0,0.889,1.0,0.778,0.722,0.685,0.889,0.657,0.722,0.444,0.315,1.0,0.963,0.593,0.463,0.704,0.741
+NP_015566.1,0.8,0.763,0.878,,0.755,0.846,0.875,,0.709,0.402,0.769,0.805,0.776,0.914,0.883,0.743,0.835
+NP_001122317.1,0.972,0.79,0.994,0.97,0.927,0.841,0.942,,0.831,0.544,0.675,0.915,0.907,0.875,0.872,0.741,0.768
+NP_003061.3,0.977,0.976,0.988,,0.982,0.95,,,0.748,0.691,0.845,0.978,0.968,,,0.887,0.91
+NP_036361.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0
+NP_002652.2,0.975,,0.975,0.75,0.806,0.819,0.875,,0.825,,0.925,0.888,0.75,0.875,0.917,0.958,1.0
+NP_689482.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001018494.1,1.0,1.0,1.0,1.0,1.0,0.833,,0.667,0.667,0.833,0.5,1.0,1.0,,,,
+NP_001284478.1,1.0,,0.5,1.0,0.5,0.5,0.5,,0.5,1.0,0.75,1.0,1.0,0.5,0.5,0.75,0.5
+NP_001073936.1,1.0,1.0,1.0,0.972,0.986,0.979,1.0,0.889,0.903,0.847,0.875,1.0,1.0,0.972,0.958,0.976,0.965
+NP_037460.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.25,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_066918.2,0.889,0.889,1.0,0.889,0.889,0.889,0.889,0.889,0.889,0.889,0.944,0.889,0.889,0.889,0.889,0.667,0.889
+NP_071934.3,0.998,0.999,0.998,,0.988,0.979,,,0.892,0.985,0.971,1.0,0.998,,,0.955,0.987
+NP_689597.1,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,0.25,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_149107.4,1.0,0.926,1.0,0.741,0.926,1.0,0.667,1.0,0.778,0.296,0.889,1.0,0.926,1.0,1.0,0.87,0.852
+NP_057116.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.929,1.0,0.429,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057623.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.333,1.0,1.0,1.0,,,1.0,1.0
+NP_001153844.1,1.0,,1.0,,,1.0,1.0,,1.0,,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004853.2,1.0,,1.0,0.886,0.914,1.0,0.971,0.971,0.771,0.914,0.8,1.0,0.829,1.0,0.943,0.814,0.886
+NP_000412.4,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.982,1.0,0.815,1.0,1.0,1.0,1.0,0.929,0.988
+NP_112483.1,1.0,0.875,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,0.75,0.75
+NP_056085.1,1.0,0.667,1.0,,0.667,1.0,,,1.0,0.0,0.833,1.0,0.667,,,1.0,1.0
+NP_079163.2,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060658.3,1.0,0.933,1.0,0.95,1.0,0.867,0.9,0.95,0.978,0.478,0.978,0.956,0.889,0.95,1.0,0.925,0.925
+NP_002927.2,1.0,1.0,1.0,0.667,1.0,0.667,0.667,0.667,0.667,0.5,0.667,0.667,0.667,0.667,1.0,0.833,0.667
+NP_000500.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.646,1.0,1.0,1.0,,,1.0,1.0
+NP_001547.1,1.0,0.722,1.0,0.944,0.833,0.833,0.278,1.0,0.833,0.444,0.556,0.889,0.611,0.722,0.667,0.833,0.833
+NP_001914.3,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.5,0.5,0.0,0.0,0.0,0.0,0.25,0.0
+NP_001148.1,0.895,0.526,0.86,,0.614,0.404,0.333,,0.298,0.096,0.333,0.982,0.667,0.404,0.351,0.579,0.456
+NP_001377.1,1.0,1.0,1.0,,1.0,1.0,0.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_009009.1,0.915,0.729,0.966,0.864,0.695,0.983,,0.977,0.881,0.195,0.797,0.966,0.898,,,,
+NP_056150.1,0.953,0.932,0.964,0.913,0.925,0.918,0.848,0.754,0.778,0.862,0.888,0.956,0.917,0.925,0.929,0.888,0.905
+NP_000171.1,0.996,0.978,0.996,0.925,0.976,0.994,0.988,0.759,0.964,0.85,0.889,1.0,0.976,0.988,0.984,1.0,1.0
+NP_036234.3,0.933,0.929,0.974,0.947,0.88,0.8,0.916,0.407,0.613,0.846,0.613,0.96,0.923,0.963,0.976,0.636,0.692
+NP_001421.2,1.0,0.99,1.0,0.959,1.0,0.964,1.0,0.973,0.806,0.969,0.888,1.0,1.0,0.973,0.973,1.0,1.0
+NP_001785.2,1.0,,1.0,1.0,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002519.2,1.0,0.25,1.0,0.0,1.0,1.0,1.0,0.5,1.0,0.25,0.25,0.25,0.25,1.0,1.0,1.0,1.0
+NP_065394.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.55,0.9,1.0,0.95,1.0,1.0,1.0,1.0
+NP_001097.2,1.0,1.0,1.0,1.0,1.0,1.0,0.25,1.0,0.75,0.75,0.875,1.0,1.0,1.0,1.0,0.75,1.0
+NP_000262.2,1.0,0.976,1.0,0.883,0.96,0.956,0.948,0.898,0.848,0.709,0.929,0.997,0.966,0.928,0.943,0.952,0.966
+NP_060167.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.0,1.0,1.0,1.0,,,,
+NP_003840.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,0.333,0.333,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001307127.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,0.75,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060599.1,1.0,,1.0,1.0,,1.0,1.0,,0.444,,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002462.2,0.981,0.731,0.942,0.721,0.827,0.769,0.808,0.635,0.721,0.144,0.635,0.942,0.74,0.817,0.788,0.692,0.716
+NP_001137326.1,1.0,1.0,1.0,,1.0,1.0,0.0,,0.0,0.0,0.0,1.0,0.0,0.0,1.0,1.0,1.0
+NP_000331.1,1.0,1.0,1.0,1.0,1.0,0.972,1.0,0.889,1.0,0.556,0.833,1.0,1.0,0.944,0.889,0.917,0.944
+NP_001001438.1,1.0,0.957,1.0,,0.986,0.943,0.914,,0.814,0.621,0.879,1.0,0.929,0.943,0.943,0.9,0.943
+NP_659501.1,1.0,1.0,1.0,1.0,1.0,0.75,0.75,1.0,0.0,1.0,0.625,1.0,1.0,1.0,1.0,0.625,1.0
+NP_000012.1,1.0,0.876,1.0,0.75,0.992,0.807,0.797,0.724,0.908,0.77,0.732,0.766,0.763,0.807,0.792,0.769,0.788
+NP_001531.1,0.963,0.926,0.852,0.667,0.87,0.815,0.926,0.889,0.778,0.981,0.963,0.907,0.907,0.852,0.926,0.944,0.972
+NP_005334.1,1.0,1.0,1.0,,0.993,0.917,1.0,,0.967,0.947,0.5,1.0,1.0,1.0,1.0,0.98,0.98
+NP_001004051.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000425.1,0.985,0.941,0.926,0.779,0.897,0.912,0.809,0.838,0.603,0.596,0.757,1.0,0.971,0.824,0.809,0.75,0.794
+NP_878314.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.917,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001671.2,1.0,1.0,1.0,0.667,1.0,0.667,1.0,0.667,1.0,1.0,0.667,1.0,1.0,1.0,1.0,0.667,1.0
+NP_612468.1,1.0,1.0,1.0,1.0,0.833,1.0,1.0,0.667,1.0,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001510.2,1.0,,1.0,1.0,1.0,1.0,1.0,,1.0,,0.812,1.0,1.0,1.0,1.0,0.969,1.0
+NP_005212.1,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001431.1,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,0.333,0.667,0.667,1.0,0.833,1.0,0.833,0.667,0.667
+NP_000280.1,1.0,1.0,1.0,,,1.0,1.0,,0.625,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002176.2,1.0,1.0,1.0,0.967,1.0,1.0,0.88,0.983,0.948,0.74,0.917,1.0,0.969,0.967,1.0,0.98,0.96
+NP_000174.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.933,0.6,0.467,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001242976.1,1.0,,1.0,,1.0,1.0,,,0.917,,0.75,1.0,1.0,,,,
+NP_003826.2,1.0,,1.0,,,1.0,1.0,,0.636,,1.0,1.0,1.0,,,,
+NP_004174.1,1.0,0.981,1.0,,0.986,0.898,0.988,,0.832,0.928,0.926,0.997,0.953,0.981,0.98,0.961,0.964
+NP_002063.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002058.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001963.1,0.998,,0.947,0.956,0.964,0.863,0.958,0.842,0.662,0.916,0.804,0.996,1.0,0.947,0.944,0.784,0.871
+NP_000011.2,1.0,0.897,0.995,0.924,0.893,0.882,0.838,0.94,0.694,0.624,0.732,0.983,0.9,0.976,0.968,0.846,0.89
+NP_001136029.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,0.7,1.0,1.0,1.0,,,1.0,1.0
+NP_056000.1,1.0,1.0,1.0,,1.0,1.0,0.974,1.0,0.974,0.647,0.654,1.0,1.0,0.936,0.923,1.0,1.0
+NP_001877.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_997464.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_758872.1,0.916,0.8,0.912,,0.767,0.849,,,0.8,0.4,0.793,0.875,0.839,,,,
+NP_003404.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.952,0.857,0.81,0.786,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002829.3,1.0,0.909,1.0,0.727,0.682,0.909,,0.818,1.0,0.818,0.364,1.0,0.727,1.0,1.0,,
+NP_065168.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,0.923,0.692,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000738.2,1.0,1.0,1.0,0.833,1.0,0.792,0.917,0.75,0.75,1.0,0.583,1.0,1.0,0.917,0.833,0.75,0.792
+NP_056512.5,1.0,1.0,1.0,1.0,1.0,1.0,,0.943,0.943,0.914,1.0,1.0,1.0,,,,
+NP_001121897.1,0.984,1.0,1.0,1.0,1.0,0.997,1.0,0.333,0.962,0.933,0.917,1.0,1.0,1.0,1.0,0.97,0.987
+NP_000452.2,1.0,1.0,1.0,1.0,0.893,0.917,1.0,1.0,0.883,1.0,1.0,1.0,0.966,0.993,0.959,0.993,0.986
+NP_066279.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,0.812,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005673.3,0.979,,1.0,0.9,0.917,0.974,0.95,,0.849,0.82,0.927,0.974,0.865,1.0,0.95,0.903,0.875
+NP_000353.1,1.0,1.0,1.0,0.944,1.0,0.833,0.778,0.944,0.333,0.75,0.5,1.0,1.0,0.833,0.833,0.833,0.778
+NP_055981.1,0.994,1.0,0.994,,0.951,0.991,0.87,,0.914,0.753,0.957,1.0,1.0,0.883,0.883,0.901,0.883
+NP_009006.2,0.967,0.921,0.99,0.845,0.944,0.922,1.0,0.927,0.781,0.452,0.606,0.962,0.944,1.0,1.0,0.858,0.895
+NP_000776.1,1.0,1.0,1.0,0.982,0.982,0.947,0.877,0.965,0.93,0.798,0.842,1.0,1.0,1.0,0.947,0.895,0.895
+NP_055400.1,0.985,0.961,0.98,0.985,0.98,0.912,0.902,0.912,0.868,0.676,0.958,0.99,0.966,0.975,0.98,0.951,0.966
+NP_001269333.1,1.0,0.917,1.0,,0.917,0.917,1.0,,0.917,0.625,0.583,1.0,1.0,,,,
+NP_055723.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.875,0.5,0.125,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001866.2,0.988,0.931,0.985,,0.954,0.924,0.818,,0.759,0.778,0.842,0.989,0.935,0.934,0.914,0.913,0.909
+NP_000040.1,0.985,0.978,0.988,,0.963,0.973,0.768,,0.849,0.705,0.776,0.985,0.95,0.97,0.985,0.815,0.919
+NP_057425.3,1.0,0.947,1.0,1.0,1.0,1.0,1.0,0.895,1.0,0.684,0.947,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001026859.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000657.1,1.0,,1.0,,,1.0,,,1.0,,0.875,1.0,1.0,,,,
+NP_056348.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001018123.1,1.0,0.974,1.0,,1.0,0.921,0.947,1.0,0.763,0.667,1.0,1.0,1.0,1.0,1.0,0.947,0.947
+NP_001137458.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.6,1.0,1.0,1.0,1.0,1.0,,
+NP_004628.4,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,0.917,0.958,0.833,0.917,0.833,1.0,1.0,0.917,1.0
+NP_002825.3,0.994,0.97,1.0,,0.945,0.928,,0.964,0.769,0.598,0.78,0.981,0.93,,,0.905,0.949
+NP_001075.1,1.0,1.0,1.0,0.944,1.0,0.944,0.861,1.0,0.75,0.417,0.944,1.0,1.0,1.0,1.0,0.917,1.0
+NP_001195.2,0.986,0.959,0.99,0.97,0.956,0.899,0.944,0.926,0.716,0.812,0.513,0.985,0.95,0.985,0.986,0.902,0.935
+NP_000511.2,0.977,0.889,0.995,,0.944,0.958,0.704,0.759,0.889,0.708,0.736,0.972,0.921,0.819,0.796,0.729,0.931
+NP_001243556.1,1.0,1.0,1.0,,1.0,0.8,,,0.8,0.0,0.9,1.0,1.0,,,0.8,0.8
+NP_002232.2,1.0,,1.0,,,1.0,1.0,,0.375,,0.75,1.0,1.0,0.75,1.0,0.625,0.75
+NP_005548.2,1.0,1.0,1.0,1.0,0.994,0.978,0.875,0.944,0.631,0.909,0.925,1.0,0.969,0.956,0.95,0.872,0.903
+NP_000517.3,1.0,0.962,1.0,0.866,0.948,0.995,0.972,0.993,0.81,0.91,0.909,1.0,0.966,0.986,0.979,0.969,0.962
+NP_660201.1,0.95,0.9,1.0,0.9,0.9,0.7,0.85,0.9,0.8,0.55,0.8,0.85,0.8,0.9,0.85,0.9,0.9
+NP_055129.2,1.0,0.941,1.0,0.971,1.0,1.0,0.882,0.735,0.706,0.529,0.868,1.0,1.0,0.824,0.824,0.779,0.75
+NP_036288.2,0.846,,0.846,,0.818,0.692,,,0.923,,0.846,0.923,0.923,,,,
+NP_060221.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,,1.0,1.0,1.0,,,,
+NP_001278650.1,0.879,0.849,0.92,,0.846,0.768,0.863,0.854,0.591,0.676,0.518,0.94,0.885,,,0.842,0.848
+NP_852122.1,0.992,0.959,0.988,0.922,0.963,0.931,,0.894,0.739,0.959,0.641,0.971,0.935,,,,
+NP_003312.3,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,0.667,1.0,1.0,1.0,,,,
+NP_006570.1,0.964,0.898,1.0,0.875,0.896,0.932,0.943,0.948,0.807,0.76,0.854,0.984,0.922,0.974,0.958,0.833,0.865
+NP_002056.2,1.0,1.0,1.0,,1.0,1.0,1.0,,0.0,0.0,0.5,1.0,0.0,1.0,1.0,1.0,1.0
+NP_071919.2,0.5,1.0,0.5,0.0,0.5,0.5,0.0,0.5,0.5,1.0,0.5,1.0,1.0,0.0,0.0,0.0,0.0
+NP_000413.1,1.0,,1.0,0.98,,0.832,0.959,,0.684,,0.75,1.0,1.0,0.939,0.939,0.811,0.867
+NP_037377.1,1.0,1.0,1.0,0.933,1.0,1.0,1.0,1.0,0.933,0.867,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006355.2,1.0,1.0,1.0,1.0,0.5,1.0,1.0,0.5,1.0,1.0,0.75,1.0,1.0,0.5,0.5,1.0,1.0
+NP_004965.1,1.0,,0.994,0.971,0.991,1.0,0.997,,0.871,,0.682,1.0,1.0,0.997,0.991,0.978,1.0
+NP_937859.1,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,0.5,1.0,1.0,,,,
+NP_000208.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.944,0.944,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001265273.1,0.868,1.0,0.965,,0.789,0.636,,,0.623,,0.632,0.982,0.974,,,,
+NP_001316843.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_071449.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.688,0.906,1.0,1.0,1.0,1.0,0.969,1.0
+NP_000173.2,0.969,1.0,0.985,1.0,1.0,1.0,0.946,1.0,0.908,0.792,0.831,1.0,1.0,0.992,0.977,1.0,1.0
+NP_477520.2,1.0,1.0,1.0,,1.0,1.0,,,0.833,0.6,0.8,1.0,1.0,,,0.9,0.917
+NP_001123575.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.125,1.0,0.625,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001229804.1,1.0,1.0,1.0,1.0,0.929,0.857,,0.923,0.893,1.0,0.929,1.0,1.0,,,,
+NP_071450.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,0.958,1.0,1.0,1.0,1.0,1.0,1.0
+NP_054701.1,0.85,0.6,1.0,0.3,0.7,1.0,,0.7,0.6,0.4,0.4,0.85,0.75,,,,
+NP_001096.1,1.0,1.0,0.975,1.0,1.0,0.988,0.775,1.0,0.9,0.875,0.9,1.0,1.0,1.0,1.0,0.938,0.975
+NP_940905.2,1.0,0.9,1.0,,1.0,0.9,,,0.7,0.5,1.0,1.0,1.0,,,1.0,1.0
+NP_036611.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.929,1.0,0.5,0.964,0.964,1.0,1.0,1.0,1.0
+NP_001119578.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.5,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060661.3,1.0,0.0,0.9,,0.8,0.6,0.1,,0.6,0.3,0.3,0.9,0.4,0.4,0.4,0.5,0.3
+NP_004915.2,1.0,1.0,1.0,0.979,1.0,1.0,1.0,1.0,0.646,0.958,0.75,1.0,1.0,1.0,1.0,0.938,0.958
+NP_490647.1,1.0,1.0,1.0,,,0.972,0.0,,0.667,1.0,0.833,1.0,1.0,,,,
+NP_005404.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055595.2,1.0,1.0,1.0,,0.989,0.932,0.989,,0.636,0.818,0.96,1.0,1.0,0.989,0.989,0.869,0.932
+NP_001269210.1,1.0,1.0,0.985,0.908,0.908,0.931,,,0.954,0.877,0.8,1.0,0.985,,,,
+NP_000894.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,0.625,1.0,1.0,1.0,1.0,1.0,1.0
+NP_954634.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.75,1.0,1.0,0.625,1.0,0.75,,,0.75,1.0
+NP_006558.1,1.0,0.98,1.0,0.993,0.974,0.98,0.98,0.967,0.922,0.784,0.987,1.0,0.987,0.987,0.993,0.98,0.967
+NP_068380.3,1.0,1.0,1.0,,0.667,0.833,1.0,,0.0,0.667,0.5,0.833,1.0,1.0,1.0,1.0,1.0
+NP_113606.2,1.0,1.0,1.0,0.958,0.958,1.0,1.0,0.875,0.75,0.708,0.906,1.0,0.896,1.0,1.0,0.812,0.875
+NP_000509.1,0.682,0.734,0.716,,0.666,0.712,0.768,0.682,0.687,0.67,0.669,0.713,0.709,0.66,0.637,0.787,0.736
+NP_000251.3,0.989,0.975,0.994,0.968,0.962,0.882,0.94,0.844,0.67,0.678,0.797,0.99,0.966,0.97,0.968,0.84,0.887
+NP_060829.2,1.0,0.778,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_115822.1,1.0,0.952,1.0,1.0,1.0,1.0,0.905,,0.714,0.762,0.786,1.0,0.952,1.0,1.0,0.81,0.786
+NP_008986.2,1.0,1.0,1.0,0.938,0.938,0.929,0.964,1.0,0.786,0.571,0.679,0.991,0.973,0.964,0.973,1.0,0.991
+NP_473368.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,0.667,1.0,1.0,1.0,1.0,0.333,0.667
+NP_002624.2,1.0,1.0,1.0,,1.0,1.0,0.92,,0.68,0.84,0.62,1.0,1.0,1.0,1.0,0.9,0.96
+NP_006851.1,1.0,1.0,1.0,,1.0,0.5,,,0.5,0.833,1.0,1.0,0.833,,,0.917,1.0
+NP_006757.2,0.912,0.862,0.983,0.848,0.846,0.861,0.884,,0.709,0.861,0.858,0.878,0.782,0.927,0.944,0.87,0.888
+NP_036462.2,0.7,,0.868,,,0.697,0.673,,0.682,,0.652,0.798,0.598,0.652,0.625,0.655,0.675
+NP_872354.1,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001127835.2,0.974,0.965,0.983,0.939,0.955,0.924,,,0.771,0.665,0.958,0.996,0.971,,,,
+NP_003127.1,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,,0.75,1.0,1.0,1.0,1.0,0.667,0.667
+NP_775901.3,0.989,,1.0,0.897,,0.81,0.885,,0.839,,0.851,0.966,0.805,0.943,0.874,0.793,0.845
+NP_006504.2,1.0,1.0,1.0,,1.0,1.0,0.5,0.5,1.0,0.0,0.75,1.0,1.0,1.0,0.0,1.0,1.0
+NP_112503.1,1.0,0.8,1.0,0.833,0.967,0.833,0.633,0.833,0.8,0.433,0.9,0.933,0.767,0.933,0.8,0.8,0.833
+NP_000608.1,1.0,1.0,1.0,,1.0,0.833,1.0,,1.0,0.833,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004195.2,1.0,1.0,1.0,,1.0,1.0,,,0.941,1.0,0.765,1.0,1.0,,,0.792,1.0
+NP_005439.2,0.893,0.893,0.607,0.571,0.321,0.357,0.464,0.464,0.357,0.304,0.482,0.929,0.893,0.25,0.286,0.554,0.536
+NP_001309774.1,1.0,1.0,1.0,0.882,0.882,1.0,1.0,0.882,0.824,0.941,0.912,1.0,0.941,0.882,0.941,0.941,0.941
+NP_003038.2,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,0.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_036407.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,0.0
+NP_000508.1;NP_000549.1,0.821,0.831,0.828,0.828,0.894,0.781,0.843,0.847,0.751,0.759,0.722,0.763,0.844,0.88,0.88,,
+NP_000421.1,0.929,0.941,0.946,0.919,0.937,0.838,0.835,0.83,0.478,0.771,0.54,0.909,0.889,0.952,0.952,0.772,0.802
+NP_689831.2,1.0,1.0,1.0,0.873,1.0,1.0,1.0,1.0,0.982,0.855,0.964,1.0,1.0,1.0,0.982,1.0,1.0
+NP_001136.1,1.0,0.667,0.667,1.0,1.0,0.667,1.0,0.5,0.667,0.5,0.75,0.833,0.667,1.0,0.833,0.667,0.667
+NP_001821.2,0.949,0.934,0.978,0.908,0.923,0.906,0.978,0.886,0.908,0.846,0.781,0.974,0.952,0.945,0.941,0.949,0.963
+NP_116056.3,1.0,0.833,1.0,,0.5,0.639,,,0.722,0.167,0.528,1.0,0.833,,,0.722,0.667
+NP_005948.3,1.0,0.985,1.0,,0.952,0.87,0.944,0.919,0.781,0.859,0.87,0.996,0.959,0.944,0.959,0.794,0.88
+NP_000710.5,1.0,1.0,1.0,,0.978,0.942,1.0,,0.813,0.921,0.859,0.993,0.96,,,0.85,0.91
+NP_001952.1,1.0,1.0,1.0,0.667,1.0,1.0,1.0,,0.333,0.667,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000288.1,1.0,1.0,1.0,1.0,0.972,0.965,1.0,0.839,0.706,0.885,0.794,1.0,0.972,1.0,0.965,0.871,0.934
+NP_005903.2,1.0,0.981,0.963,0.944,0.963,0.889,0.944,0.907,0.741,0.759,0.944,1.0,1.0,0.981,0.981,0.787,0.907
+NP_079160.1,1.0,1.0,1.0,,0.917,0.938,0.896,,0.938,0.438,0.833,1.0,1.0,,,0.958,0.958
+NP_000086.2,0.927,0.93,0.974,0.899,0.933,0.857,0.866,,0.771,0.877,0.681,0.95,0.914,0.927,0.922,0.779,0.777
+NP_112235.2,1.0,,1.0,,1.0,1.0,,,0.6,,0.7,1.0,0.4,,,,
+NP_000207.2,0.953,0.918,0.976,0.976,0.906,0.841,0.918,0.847,0.741,0.812,0.965,0.953,0.871,0.976,0.976,0.771,0.841
+NP_597716.1,1.0,1.0,1.0,,1.0,0.9,0.9,0.8,0.7,0.9,0.7,1.0,1.0,1.0,1.0,0.7,1.0
+NP_000089.1,0.946,0.914,0.938,0.875,0.872,0.804,0.857,0.759,0.807,0.878,0.815,0.949,0.929,0.842,0.845,0.787,0.835
+NP_742105.1,0.997,1.0,0.984,1.0,0.987,0.889,1.0,0.922,0.776,0.996,0.906,0.997,0.995,1.0,0.998,0.761,0.788
+NP_000835.1,1.0,1.0,1.0,1.0,1.0,1.0,0.929,1.0,1.0,0.929,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_115759.2,1.0,0.722,1.0,0.889,1.0,0.944,0.889,0.556,0.667,0.5,0.667,1.0,1.0,0.889,0.889,0.778,0.667
+NP_003099.1,1.0,1.0,1.0,0.986,1.0,1.0,0.986,1.0,0.784,0.608,1.0,1.0,1.0,1.0,1.0,0.851,0.919
+NP_003098.1,1.0,1.0,1.0,1.0,1.0,0.833,1.0,0.833,0.5,1.0,1.0,1.0,1.0,1.0,1.0,0.667,0.667
+NP_060575.1,1.0,1.0,1.0,0.833,0.833,0.944,0.944,0.944,0.556,0.778,0.611,1.0,1.0,1.0,1.0,0.806,0.889
+NP_000222.2,1.0,0.75,0.833,1.0,0.833,0.917,0.583,0.833,0.917,0.667,0.833,0.75,0.833,0.667,0.75,1.0,1.0
+NP_001003841.1,0.989,0.973,0.945,0.945,0.967,0.978,0.978,0.89,0.978,0.967,0.962,1.0,0.978,0.967,0.956,0.907,0.951
+NP_002393.2,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001129277.1,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,0.909,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_115604.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000536.6,0.987,0.96,0.985,,0.945,0.875,,0.879,0.769,0.743,0.842,0.973,0.914,,,,
+NP_000449.1,0.94,,0.855,0.957,0.938,0.859,0.917,,0.787,,0.832,0.865,0.857,0.892,0.689,0.788,0.777
+NP_149075.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,0.98,1.0,1.0,1.0,1.0,,,,
+NP_694955.2,1.0,0.857,1.0,0.952,0.905,0.798,0.81,0.738,0.762,0.476,0.726,1.0,0.905,0.762,0.833,0.679,0.714
+NP_058637.1,1.0,1.0,0.955,,0.909,0.727,,,0.773,0.636,0.727,1.0,1.0,,,,
+NP_000122.1,0.95,0.967,0.933,0.992,1.0,0.992,0.95,1.0,0.95,0.746,0.821,0.925,0.967,0.983,0.958,0.967,0.983
+NP_570971.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_997304.3,1.0,,,1.0,1.0,1.0,1.0,,0.771,,,1.0,1.0,1.0,0.914,,
+NP_065683.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.958,1.0,1.0,1.0,1.0,1.0,1.0
+NP_787110.2,0.938,0.958,0.979,,0.979,0.938,,,0.438,0.917,0.5,0.958,0.938,,,,
+NP_000132.3,1.0,0.982,1.0,,0.991,0.822,0.978,,0.746,0.804,0.812,1.0,0.986,0.978,0.986,1.0,1.0
+NP_000023.2,0.976,,0.994,,,0.891,0.885,,0.788,,0.942,0.988,0.939,0.964,0.976,0.894,0.915
+NP_066970.3,1.0,1.0,1.0,1.0,1.0,1.0,0.929,0.786,0.714,0.786,0.929,1.0,1.0,0.929,0.857,0.714,0.714
+NP_115969.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,,,,
+NP_005680.1,1.0,0.909,1.0,1.0,0.909,0.818,0.818,0.864,0.909,0.909,0.682,1.0,1.0,1.0,1.0,0.955,0.955
+NP_942559.1,1.0,,1.0,1.0,,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000416.1,0.969,0.969,0.985,0.993,0.961,0.945,0.913,0.88,0.813,0.809,0.948,0.997,0.985,0.983,0.969,0.887,0.93
+NP_060950.3,1.0,0.6,0.8,1.0,0.6,1.0,0.8,0.6,1.0,0.3,0.8,0.8,0.0,1.0,0.8,0.5,0.75
+NP_005391.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079412.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.5,0.75,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005668.2,1.0,0.92,1.0,0.76,0.72,0.82,0.82,0.8,0.66,0.48,0.64,1.0,0.76,0.78,0.78,0.96,0.94
+NP_001357524.1,,,,,,,,,,,,,,,,,
+NP_060106.2,1.0,0.667,0.952,0.667,0.952,0.81,0.524,0.476,0.81,0.238,0.952,1.0,0.714,0.762,0.762,0.619,0.667
+NP_057206.2,1.0,1.0,1.0,1.0,1.0,0.775,,1.0,0.6,0.7,0.8,0.9,0.9,1.0,1.0,,
+NP_001373.2,1.0,1.0,1.0,1.0,1.0,1.0,0.923,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.923,0.962
+NP_060597.4,0.98,0.92,1.0,0.88,0.96,0.92,0.72,0.86,0.82,0.44,0.81,0.92,0.9,0.94,0.96,0.85,0.88
+NP_071415.1,1.0,1.0,1.0,0.885,0.965,1.0,0.947,0.962,0.772,0.673,1.0,1.0,1.0,1.0,1.0,0.962,0.962
+NP_958850.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.889,1.0,0.944,1.0,1.0,1.0,1.0,1.0,1.0
+NP_036196.1,1.0,1.0,1.0,,1.0,1.0,0.917,1.0,0.917,0.958,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000477.1,0.982,0.982,0.964,0.964,0.964,0.929,0.893,0.911,0.893,0.616,0.839,1.0,0.982,0.875,0.875,0.982,0.929
+NP_000093.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.609,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004532.1,1.0,1.0,1.0,0.0,1.0,1.0,0.0,1.0,0.0,0.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_065731.3,1.0,0.9,1.0,0.9,0.9,0.8,1.0,0.9,1.0,0.7,0.75,1.0,0.9,0.7,0.9,0.9,0.9
+NP_060342.2,1.0,0.5,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001303260.1,,,,,,,,,,,,,,,,,
+NP_004165.2,1.0,,1.0,,,1.0,,,1.0,,1.0,1.0,1.0,,,,
+NP_000387.1,1.0,1.0,1.0,1.0,0.974,1.0,1.0,1.0,0.987,0.895,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001035940.1,1.0,0.958,1.0,,1.0,1.0,0.958,,0.917,1.0,0.938,1.0,0.75,1.0,1.0,1.0,1.0
+NP_000457.1,1.0,0.966,1.0,0.939,0.971,0.901,0.924,1.0,0.763,0.729,0.826,1.0,0.961,0.985,0.99,0.914,0.947
+NP_001380.2,0.889,0.889,1.0,0.667,0.778,0.778,0.667,0.889,0.667,0.222,0.889,0.889,0.667,0.889,0.778,0.889,1.0
+NP_002476.2,0.941,0.765,0.941,0.824,0.779,0.882,0.735,0.794,0.735,0.647,0.809,0.971,0.853,0.647,0.706,0.838,0.853
+NP_078790.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005934.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0
+NP_004456.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001092138.1,1.0,1.0,1.0,1.0,0.917,0.833,,1.0,0.917,0.5,0.667,1.0,0.917,,,,
+NP_001469.1,1.0,1.0,1.0,1.0,0.952,0.857,0.952,1.0,0.952,0.81,0.905,1.0,1.0,1.0,1.0,0.952,0.952
+NP_060222.2,1.0,0.95,1.0,1.0,1.0,0.7,1.0,0.95,0.65,0.75,0.85,1.0,0.95,0.95,0.85,0.7,0.775
+NP_079152.3,1.0,1.0,1.0,0.967,1.0,0.933,0.933,0.95,0.867,0.867,1.0,1.0,1.0,1.0,1.0,0.9,1.0
+NP_002104.2,1.0,0.583,1.0,0.667,0.833,0.75,0.583,0.667,0.25,0.417,0.75,0.583,0.5,0.333,0.333,0.167,0.167
+NP_002896.2,1.0,1.0,1.0,1.0,1.0,1.0,0.907,0.963,0.907,0.667,0.685,1.0,1.0,0.981,0.981,1.0,1.0
+NP_067628.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_061850.2,0.889,0.778,1.0,,0.778,0.667,,,0.667,0.556,0.556,0.889,0.889,,,0.833,0.833
+NP_001702.1,1.0,0.25,1.0,0.5,1.0,1.0,0.25,0.75,0.75,0.0,0.5,0.75,0.0,0.5,0.5,0.5,0.5
+NP_001051.1,1.0,0.667,0.833,,1.0,1.0,0.833,,1.0,0.167,0.167,0.833,0.5,0.667,0.667,0.667,0.667
+NP_060307.2,1.0,0.9,1.0,,1.0,0.9,1.0,,0.7,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000303.1,0.989,0.957,0.957,0.872,0.936,0.936,0.851,0.926,0.819,0.84,0.947,0.915,0.979,0.851,0.872,0.968,0.957
+NP_009129.1,1.0,0.933,1.0,0.867,1.0,0.733,1.0,1.0,0.933,0.867,1.0,1.0,1.0,1.0,1.0,0.8,0.733
+NP_598399.2,1.0,1.0,1.0,1.0,1.0,0.958,1.0,1.0,0.972,0.611,0.958,1.0,1.0,0.9,0.9,1.0,1.0
+NP_001189333.2,1.0,0.929,1.0,,0.982,0.806,1.0,,0.867,0.554,0.788,0.976,0.879,,,,
+NP_056375.2,0.989,0.99,0.997,,,0.975,0.983,,0.763,0.607,0.772,0.981,0.967,0.984,0.971,0.942,0.944
+NP_001922.2,1.0,1.0,1.0,1.0,1.0,0.923,1.0,1.0,0.904,0.885,0.808,1.0,1.0,1.0,0.981,0.962,0.962
+NP_068711.1,0.954,0.928,0.948,,0.916,0.815,0.918,,0.774,0.732,0.544,0.892,0.854,0.952,0.919,0.909,0.943
+NP_001073136.1,1.0,1.0,0.988,1.0,1.0,0.964,,,0.798,0.976,0.714,1.0,1.0,,,0.911,0.893
+NP_000083.3,0.995,0.993,1.0,0.989,0.997,0.93,0.885,0.83,0.839,0.935,0.969,0.999,0.995,0.964,0.972,0.922,0.955
+NP_789788.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_075598.2,0.998,0.993,1.0,,0.976,0.89,0.974,,0.659,0.914,0.545,1.0,0.991,0.991,0.979,0.927,0.958
+NP_000135.2,1.0,,1.0,1.0,,1.0,1.0,,1.0,,0.7,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002078.1,1.0,1.0,1.0,0.875,1.0,1.0,0.75,1.0,0.75,0.875,1.0,1.0,1.0,1.0,1.0,0.875,1.0
+NP_000803.2,1.0,1.0,1.0,0.917,1.0,1.0,1.0,1.0,1.0,0.708,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001336644.1,,,,,,,,,,,,,,,,,
+NP_068602.2,0.99,0.99,0.99,1.0,0.99,0.855,0.78,0.97,0.81,0.605,0.96,1.0,1.0,0.93,0.95,0.78,0.84
+NP_848549.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002769.1,1.0,1.0,1.0,0.905,0.976,0.905,1.0,0.762,0.714,0.929,0.976,1.0,0.952,0.976,1.0,0.94,0.952
+NP_060319.1,1.0,0.8,1.0,1.0,0.6,1.0,0.8,0.6,0.6,0.8,0.7,1.0,0.8,1.0,1.0,0.8,0.6
+NP_002633.1,1.0,1.0,1.0,,1.0,1.0,1.0,,0.833,0.667,0.75,1.0,1.0,1.0,1.0,0.667,0.667
+NP_006412.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001129392.2,1.0,1.0,1.0,,1.0,1.0,,,1.0,,1.0,1.0,1.0,,,,
+NP_981932.1,1.0,1.0,1.0,0.833,0.938,1.0,,0.917,0.938,0.625,0.812,1.0,1.0,,,,
+NP_060592.2,1.0,0.917,1.0,0.952,0.952,0.952,0.81,0.929,0.905,0.625,0.643,0.964,0.929,0.94,0.94,0.917,0.893
+NP_004599.2,1.0,0.778,0.889,0.556,0.556,0.833,0.556,0.778,0.556,0.444,0.667,0.889,0.667,0.667,0.667,0.556,0.611
+NP_005436.1,0.987,0.98,0.987,0.933,0.953,0.98,0.947,0.947,0.827,0.757,0.657,0.967,0.953,0.973,0.96,0.927,0.917
+NP_003915.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001554.2,1.0,0.389,1.0,0.444,0.722,0.778,0.667,0.778,0.778,0.889,0.25,1.0,0.889,0.833,0.889,0.722,0.889
+NP_003097.1,0.978,1.0,0.911,1.0,0.956,1.0,0.956,1.0,0.867,0.867,0.9,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002043.2,0.964,0.964,1.0,,1.0,0.982,0.873,1.0,0.945,0.982,0.855,1.0,1.0,1.0,1.0,0.964,0.945
+NP_001243169.1,0.75,,0.8,,0.75,0.625,,,0.6,,0.55,0.9,0.8,,,,
+NP_000134.2,0.994,0.994,0.99,0.99,0.996,0.983,0.926,0.997,0.766,0.436,0.829,0.997,0.999,0.992,0.994,0.991,0.992
+NP_001186.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,0.9,0.85,1.0,1.0,1.0,1.0,0.95,0.9
+NP_005262.1,0.949,0.923,0.885,0.808,0.897,0.885,0.821,0.718,0.756,0.474,0.75,0.885,0.885,0.846,0.897,0.91,0.91
+NP_055940.3,0.986,0.986,1.0,1.0,0.707,0.929,0.917,1.0,0.736,0.833,1.0,0.993,0.964,0.833,0.917,0.958,0.917
+NP_065069.1,0.5,0.5,0.5,0.5,0.5,1.0,0.5,0.5,0.5,,0.75,1.0,1.0,0.5,0.5,0.5,0.5
+NP_000162.2,1.0,1.0,1.0,,0.99,0.96,,,0.817,0.729,0.776,1.0,0.984,,,0.846,0.894
+NP_000479.1,0.957,0.515,0.986,0.971,0.771,0.657,0.647,0.794,0.793,0.647,0.725,0.957,0.721,0.706,0.735,0.662,0.588
+NP_005586.1,0.875,0.75,1.0,,1.0,1.0,0.75,,0.375,0.875,0.5,1.0,1.0,0.5,0.5,1.0,1.0
+NP_002492.2,0.957,0.978,0.978,0.989,0.967,0.957,,0.978,0.565,0.891,0.875,0.967,0.967,,,0.777,0.821
diff --git a/benchmarks/clinical_zero_shot/indels/AUC/Summary_performance_clinical_indels_AUC.csv b/benchmarks/clinical_zero_shot/indels/AUC/Summary_performance_clinical_indels_AUC.csv
new file mode 100644
index 0000000..1751bdc
--- /dev/null
+++ b/benchmarks/clinical_zero_shot/indels/AUC/Summary_performance_clinical_indels_AUC.csv
@@ -0,0 +1,18 @@
+Model_rank,Model_name,Model type,Average_AUC
+1,Provean,Unsupervised,0.927
+2,RITA L,Unsupervised,0.921
+3,RITA XL,Unsupervised,0.915
+4,RITA M,Unsupervised,0.891
+5,Tranception L no retrieval,Unsupervised,0.863
+6,ProGen2 L,Unsupervised,0.847
+7,Tranception M no retrieval,Unsupervised,0.845
+8,ProGen2 M,Unsupervised,0.843
+9,ProGen2 Base,Unsupervised,0.842
+10,Tranception L,Unsupervised,0.84
+11,ProGen2 XL,Unsupervised,0.84
+12,Tranception M,Unsupervised,0.828
+13,RITA S,Unsupervised,0.785
+14,ProGen2 S,Unsupervised,0.763
+15,Hidden Markov Model,Unsupervised,0.675
+16,ProtGPT2,Unsupervised,0.648
+17,UniRep,Unsupervised,0.401
diff --git a/benchmarks/clinical_zero_shot/substitutions/AUC/Summary_performance_clinical_substitutions_AUC.csv b/benchmarks/clinical_zero_shot/substitutions/AUC/Summary_performance_clinical_substitutions_AUC.csv
new file mode 100644
index 0000000..a7bc81f
--- /dev/null
+++ b/benchmarks/clinical_zero_shot/substitutions/AUC/Summary_performance_clinical_substitutions_AUC.csv
@@ -0,0 +1,13 @@
+Model_rank,Model_name,Model type,Average_AUC,Bootstrap_standard_error_AUC
+1,TranceptEVE,Unsupervised,0.92,0.0
+2,GEMME,Unsupervised,0.919,0.002
+3,EVE,Unsupervised,0.917,0.002
+4,ESM-1b,Unsupervised,0.892,0.003
+5,PROVEAN,Unsupervised,0.886,0.003
+6,SIFT,Unsupervised,0.878,0.003
+7,SIFT4G,Unsupervised,0.877,0.003
+8,MutationAssessor,Unsupervised,0.877,0.003
+9,MutPred,Unsupervised,0.875,0.005
+10,PrimateAI,Unsupervised,0.855,0.004
+11,LIST-S2,Unsupervised,0.842,0.004
+12,LRT,Unsupervised,0.805,0.004
diff --git a/benchmarks/clinical_zero_shot/substitutions/AUC/clinical_substitutions_AUC_DMS_level.csv b/benchmarks/clinical_zero_shot/substitutions/AUC/clinical_substitutions_AUC_DMS_level.csv
new file mode 100644
index 0000000..d25e76a
--- /dev/null
+++ b/benchmarks/clinical_zero_shot/substitutions/AUC/clinical_substitutions_AUC_DMS_level.csv
@@ -0,0 +1,2526 @@
+RefSeq ID,SIFT4G,PrimateAI,SIFT,PROVEAN,MutationAssessor,MutPred,LIST-S2,LRT,ESM-1b,EVE,TranceptEVE,GEMME
+NP_001291646.4,1.0,0.942,1.0,1.0,1.0,,0.915,0.979,0.998,0.914,0.974,0.999
+NP_001104262.1,0.707,0.946,0.818,0.908,,1.0,0.805,0.901,0.915,0.932,0.955,0.945
+NP_004357.3,0.888,1.0,0.925,0.888,0.95,1.0,0.9,0.925,0.95,1.0,1.0,1.0
+NP_004270.2,1.0,1.0,1.0,1.0,1.0,,1.0,0.833,1.0,1.0,1.0,1.0
+NP_055134.2,0.107,0.607,0.107,0.429,0.429,0.429,0.286,0.75,0.679,0.464,0.464,0.5
+NP_036342.2,0.667,0.667,0.667,0.0,0.333,1.0,0.667,0.667,1.0,0.667,0.667,0.333
+NP_001907.3,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005912.1,0.783,0.739,0.909,0.961,0.565,0.864,0.8,0.765,0.878,0.913,0.93,0.983
+NP_003095.2,0.833,1.0,0.917,1.0,1.0,,1.0,0.917,1.0,1.0,1.0,1.0
+NP_060767.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.975,1.0,1.0,1.0,1.0
+NP_004993.1,0.5,1.0,0.375,0.75,0.75,1.0,0.875,0.812,0.375,0.625,0.625,0.625
+NP_004542.1,1.0,1.0,1.0,1.0,0.5,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_808807.2,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,0.0,1.0,0.5
+NP_001075019.1,0.958,1.0,1.0,1.0,0.917,,0.792,1.0,0.75,0.917,0.875,0.958
+NP_004484.1,0.812,1.0,0.788,0.85,0.925,0.9,0.85,0.663,0.9,0.9,0.875,0.825
+NP_079033.4,0.964,0.968,0.933,0.985,0.928,0.977,0.947,0.91,0.971,0.968,0.989,0.98
+NP_000371.1,1.0,0.833,1.0,1.0,1.0,1.0,0.833,0.917,1.0,1.0,1.0,0.833
+NP_055040.2,0.978,0.846,0.951,0.934,0.978,0.877,0.736,0.467,0.874,0.945,0.951,0.912
+NP_056110.2,1.0,0.926,0.889,1.0,0.926,1.0,0.926,0.938,0.593,1.0,1.0,1.0
+NP_149975.1,1.0,1.0,0.75,1.0,1.0,,0.833,1.0,1.0,1.0,1.0,1.0
+NP_004665.2,0.0,0.0,0.0,0.0,0.0,1.0,1.0,1.0,1.0,0.0,0.0,1.0
+NP_036382.2,1.0,1.0,1.0,1.0,1.0,1.0,0.625,0.188,1.0,1.0,1.0,1.0
+NP_004105.1,1.0,,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0
+NP_066360.1,0.562,0.5,0.438,0.781,0.531,0.714,0.625,0.844,0.625,0.844,0.75,0.688
+NP_001098717.1,0.583,0.5,0.25,0.667,0.5,,,0.875,0.667,0.667,0.583,0.5
+NP_055066.1,0.875,0.825,0.95,0.7,0.925,0.944,0.85,0.75,0.95,0.875,0.85,0.925
+NP_000008.1,0.883,0.85,0.896,0.858,0.917,0.923,0.9,0.904,0.917,0.883,0.908,0.9
+NP_000523.2,0.898,0.884,0.801,0.759,0.85,1.0,0.792,0.738,0.81,0.917,0.94,0.903
+NP_057021.2,0.9,0.933,0.9,0.767,1.0,1.0,0.733,0.8,1.0,1.0,1.0,1.0
+NP_001243423.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_477352.3,0.667,0.667,,,,,0.667,0.833,0.667,0.778,0.667,0.667
+NP_694984.5,0.821,0.738,1.0,0.905,0.929,0.708,1.0,0.714,0.833,0.95,0.857,1.0
+NP_777362.1,,0.556,,,0.929,0.667,,0.875,0.889,0.889,1.0,0.667
+NP_291028.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001106279.3,0.778,,0.88,1.0,,,0.648,0.778,0.728,0.859,0.88,0.902
+NP_004586.2,0.962,1.0,0.827,0.923,0.846,1.0,0.962,1.0,0.962,0.962,0.923,0.962
+NP_005622.1,0.958,0.917,0.958,0.958,1.0,1.0,0.917,1.0,1.0,1.0,1.0,1.0
+NP_064551.3,0.857,0.923,0.899,0.853,0.932,1.0,0.881,0.808,0.909,0.972,0.958,0.965
+NP_061947.1,,0.987,,,1.0,0.818,,1.0,0.974,1.0,1.0,1.0
+NP_000209.2,0.983,0.94,0.985,0.99,0.953,0.993,0.968,0.973,0.986,0.994,0.997,0.999
+NP_078925.3,0.625,0.625,0.719,0.625,0.688,1.0,0.594,,0.719,0.781,0.75,0.75
+NP_001843.1,0.895,0.842,0.947,1.0,0.947,1.0,0.842,0.974,0.947,1.0,1.0,0.947
+NP_006386.1,1.0,0.0,1.0,1.0,1.0,,0.0,0.5,1.0,1.0,1.0,1.0
+NP_000276.2,0.798,0.827,0.721,0.721,0.962,1.0,0.769,0.5,0.788,0.942,1.0,0.962
+NP_001689.1,1.0,0.583,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_919255.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057303.2,1.0,1.0,1.0,0.938,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_149100.2,0.864,0.987,0.857,0.87,0.961,0.857,,0.896,0.974,0.961,0.948,0.922
+NP_079353.3,0.9,0.75,0.95,0.9,1.0,1.0,1.0,0.75,1.0,0.9,1.0,1.0
+NP_004861.2,0.75,0.667,0.6,0.7,0.5,0.5,0.25,1.0,0.833,0.833,0.667,0.833
+NP_940988.2,0.9,0.9,0.85,0.8,1.0,1.0,0.85,0.9,0.95,0.95,1.0,1.0
+NP_057090.2,,1.0,,,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_003014.3,0.576,0.849,0.668,0.891,,0.733,,0.815,0.109,0.84,0.723,0.874
+NP_963883.2,0.5,1.0,0.5,0.5,0.5,0.0,0.5,0.5,1.0,0.0,0.5,0.0
+NP_005452.2,1.0,1.0,0.967,1.0,1.0,1.0,1.0,1.0,0.967,1.0,1.0,1.0
+NP_201569.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.962,1.0,0.962,1.0,1.0
+NP_055862.1,0.938,1.0,1.0,1.0,1.0,0.0,0.917,0.906,1.0,0.958,0.771,0.979
+NP_005111.2,0.911,0.909,0.901,0.917,0.921,0.853,0.876,0.829,0.917,0.951,0.979,0.913
+NP_060289.2,0.5,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001004127.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000007.1,0.497,0.371,0.453,0.503,0.541,0.727,0.659,0.609,0.359,0.471,0.494,0.547
+NP_000017.1,0.76,0.64,,,0.988,,0.51,0.7,0.73,0.91,0.88,0.91
+NP_015556.1,0.738,0.952,0.857,0.905,0.852,0.865,0.805,,0.8,0.881,0.914,0.881
+NP_115909.1,1.0,0.929,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_061929.2,0.75,0.583,0.833,0.5,1.0,0.5,0.5,0.708,0.583,0.917,0.5,0.667
+NP_005778.1,1.0,0.9,1.0,1.0,1.0,,1.0,0.95,1.0,1.0,1.0,1.0
+NP_079375.3,0.733,0.619,0.805,0.624,0.695,0.875,0.829,0.829,0.79,0.743,0.81,0.79
+NP_550438.1,1.0,1.0,1.0,1.0,1.0,,0.917,0.75,0.917,1.0,1.0,1.0
+NP_004971.2,0.953,1.0,0.953,1.0,1.0,1.0,,0.852,1.0,1.0,1.0,1.0
+NP_036413.1,0.963,0.981,0.889,0.889,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_078846.2,0.643,0.607,0.732,0.786,0.893,1.0,0.679,0.554,0.821,0.857,0.929,0.714
+NP_064632.2,0.965,1.0,0.965,0.983,,1.0,,0.935,0.987,0.983,0.987,1.0
+NP_001275880.1,0.939,0.963,0.904,0.93,1.0,1.0,0.544,0.796,1.0,0.982,1.0,1.0
+NP_001351.2,0.909,0.985,0.922,0.988,1.0,0.959,,0.928,0.979,0.997,1.0,0.97
+NP_066264.4,0.903,0.83,0.88,0.913,,,,0.807,0.947,0.94,0.94,0.92
+NP_115733.2,0.75,1.0,0.75,1.0,0.75,,0.5,0.875,1.0,0.75,0.75,1.0
+NP_005042.1,1.0,0.833,1.0,0.944,0.889,1.0,0.889,0.75,1.0,0.833,0.778,0.778
+NP_000775.1,0.88,0.871,0.88,0.865,0.936,1.0,0.865,0.904,0.936,0.918,0.883,0.965
+NP_060272.3,0.87,1.0,0.87,0.978,,1.0,0.739,0.783,0.935,1.0,0.978,1.0
+NP_000427.1,0.938,0.938,1.0,0.875,0.938,1.0,0.875,0.812,0.875,1.0,1.0,0.938
+NP_063940.1,0.75,0.75,0.5,0.75,0.75,0.5,1.0,0.5,0.5,0.5,0.75,1.0
+NP_002992.1,0.889,1.0,1.0,0.889,1.0,1.0,0.889,1.0,1.0,1.0,1.0,1.0
+NP_001077083.1,0.769,0.892,0.754,1.0,0.923,1.0,0.985,0.923,1.0,0.985,0.985,0.815
+NP_573566.2,0.958,0.726,1.0,1.0,0.958,1.0,0.905,0.842,0.895,1.0,1.0,1.0
+NP_001944.1,0.9,0.519,0.924,0.99,1.0,0.954,,0.968,0.919,1.0,0.99,0.938
+NP_004541.1,1.0,1.0,1.0,0.75,0.95,1.0,0.85,0.775,0.95,0.95,0.95,0.9
+NP_001368914.1,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_004083.3,0.911,0.489,0.933,0.967,0.867,0.789,0.956,0.989,0.967,0.967,0.978,0.989
+NP_057457.1,0.685,0.88,0.801,0.704,0.707,0.583,0.741,0.63,0.796,0.852,0.75,0.843
+NP_004324.2,,0.775,,,,0.991,0.83,0.838,0.962,0.964,0.957,0.979
+NP_683695.1,1.0,1.0,1.0,0.95,1.0,,0.95,1.0,1.0,1.0,1.0,1.0
+NP_076984.2,0.896,0.875,0.917,1.0,0.958,,0.917,0.917,0.917,1.0,1.0,1.0
+NP_001017420.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000166.2,1.0,0.778,,,0.5,1.0,,0.639,0.583,0.694,0.667,0.583
+NP_004332.2,0.833,0.833,0.75,0.667,0.792,0.833,0.792,0.417,0.875,1.0,0.917,0.917
+NP_001374.4,,,,,,,,,1.0,1.0,1.0,1.0
+NP_001366127.1,0.938,0.929,0.935,0.978,,0.895,,0.86,0.887,0.988,0.967,0.994
+NP_000182.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_115693.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005393.2,0.972,0.722,0.861,0.667,1.0,1.0,0.722,0.5,0.722,0.611,0.778,1.0
+NP_001189472.1,0.833,0.846,0.917,0.917,,,0.923,0.962,1.0,0.923,0.923,1.0
+NP_056082.2,1.0,1.0,0.944,0.833,0.667,1.0,1.0,0.917,0.889,1.0,1.0,1.0
+NP_077310.1,0.824,0.765,0.706,0.882,0.882,1.0,0.882,0.529,0.235,0.824,0.824,0.706
+NP_877437.2,0.964,0.464,0.976,0.988,0.917,1.0,0.988,0.81,1.0,0.988,1.0,1.0
+NP_116145.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056241.3,1.0,0.857,1.0,0.786,1.0,1.0,0.929,0.679,1.0,1.0,1.0,1.0
+NP_004563.2,0.929,0.871,0.918,0.914,0.882,1.0,0.907,0.865,0.943,0.9,0.85,0.696
+NP_005253.3,1.0,0.7,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_940916.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000432.1,0.897,0.782,0.899,0.924,0.922,0.886,,0.763,0.924,0.919,0.929,0.932
+NP_000294.1,0.918,0.888,0.815,0.802,0.905,,0.931,0.651,0.81,0.948,0.957,0.871
+NP_001308971.2,0.99,0.981,1.0,0.952,0.962,0.972,,1.0,1.0,1.0,1.0,0.99
+NP_001035526.1,0.95,1.0,0.9,0.8,1.0,,0.35,0.85,0.95,1.0,1.0,0.8
+NP_004555.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000149.4,0.954,0.798,0.954,0.941,0.971,0.865,0.92,0.788,0.924,0.983,1.0,0.987
+NP_037379.1,0.5,0.0,0.5,0.75,0.75,,0.75,1.0,0.5,0.5,0.5,0.5
+NP_002426.1,0.938,0.594,1.0,1.0,0.844,0.75,0.656,0.703,1.0,0.906,0.844,0.844
+NP_001376.1,1.0,0.55,0.85,0.95,0.75,1.0,0.95,1.0,0.9,1.0,1.0,1.0
+NP_078807.1,1.0,0.917,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_003898.2,0.989,0.78,0.943,0.962,0.943,0.985,,0.85,0.905,1.0,1.0,0.973
+NP_892003.2,0.583,1.0,,,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_001157750.1,0.987,1.0,,,,,1.0,,0.993,0.954,0.974,0.987
+NP_658986.1,0.8,0.667,0.683,0.433,0.7,0.889,0.733,0.7,0.833,0.8,0.767,0.833
+NP_006006.3,0.833,0.702,0.818,0.889,0.871,0.91,0.93,0.742,0.953,0.86,0.947,0.944
+NP_055006.1,0.789,0.911,0.751,0.775,0.78,0.855,0.621,0.758,0.878,0.893,0.896,0.889
+NP_001076585.1,0.947,0.827,0.895,0.895,0.947,,0.625,0.79,0.642,0.926,0.877,1.0
+NP_872282.1,0.75,1.0,0.75,0.917,0.917,,1.0,0.542,0.833,1.0,1.0,1.0
+NP_001001563.2,1.0,0.8,,,1.0,1.0,0.933,1.0,0.8,0.833,0.8,0.867
+NP_689629.2,1.0,0.6,1.0,1.0,1.0,1.0,1.0,0.9,1.0,1.0,1.0,1.0
+NP_061982.3,0.973,0.884,0.989,0.938,0.987,0.879,0.92,0.946,0.897,0.933,0.938,1.0
+NP_000439.2,0.998,0.989,0.959,0.894,0.825,0.848,0.939,0.955,0.811,0.976,0.971,0.992
+NP_000532.2,1.0,1.0,1.0,1.0,,,,1.0,1.0,1.0,1.0,1.0
+NP_057049.5,0.0,,0.0,0.0,0.0,,0.0,0.5,1.0,0.0,1.0,0.0
+NP_258412.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060297.1,1.0,1.0,1.0,1.0,1.0,,,0.0,1.0,1.0,1.0,1.0
+NP_003841.1,,0.944,,,0.989,1.0,,0.9,0.978,1.0,1.0,1.0
+NP_001294.2,0.7,1.0,,,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_976035.1,0.877,0.785,0.877,0.846,0.846,0.909,,0.869,0.892,0.831,0.923,0.954
+NP_001091738.1,0.982,0.945,0.973,1.0,0.964,0.926,,0.945,1.0,0.991,1.0,1.0
+NP_037451.1,1.0,0.4,1.0,1.0,1.0,,1.0,0.7,1.0,1.0,1.0,1.0
+NP_001243371.1,0.871,0.997,0.688,1.0,0.968,1.0,,0.87,0.997,0.913,1.0,0.917
+NP_004784.2,0.871,0.609,0.961,0.75,0.891,0.942,0.684,0.531,0.703,0.875,0.859,0.758
+NP_065796.2,0.83,0.753,0.822,0.891,0.806,0.742,0.757,0.735,0.895,0.888,0.883,0.903
+NP_003468.2,0.9,0.9,1.0,0.9,0.8,,1.0,0.9,1.0,0.9,0.9,0.95
+NP_001002755.1,1.0,1.0,1.0,1.0,1.0,,0.917,0.917,1.0,1.0,1.0,1.0
+NP_003163.1,0.917,0.917,0.708,0.875,0.917,,,0.958,0.833,0.958,0.958,0.917
+NP_938149.2,0.975,1.0,0.925,1.0,,,0.925,0.875,1.0,1.0,0.95,1.0
+NP_056451.3,0.733,1.0,0.733,0.8,,,1.0,1.0,1.0,0.867,1.0,1.0
+NP_073741.3,1.0,1.0,0.75,1.0,1.0,,0.75,1.0,1.0,1.0,1.0,0.75
+NP_001035963.1,0.913,0.891,0.922,0.978,0.971,0.894,0.815,0.947,0.985,0.967,0.989,0.989
+NP_001077430.1,0.983,0.983,0.957,1.0,0.908,0.9,0.957,0.976,0.991,0.983,0.983,0.991
+NP_000052.1,0.991,0.869,1.0,1.0,1.0,0.985,0.92,0.855,0.997,0.983,0.993,0.995
+NP_001202.5,0.974,0.895,1.0,0.965,0.982,0.636,0.912,0.921,0.825,0.982,0.982,0.982
+NP_612486.2,0.906,0.723,0.688,0.911,0.973,1.0,0.938,0.897,0.705,0.893,0.866,0.964
+NP_000048.1,1.0,0.955,1.0,0.982,1.0,1.0,0.982,0.938,0.991,1.0,0.973,1.0
+NP_001120.3,0.917,0.944,0.944,1.0,0.944,1.0,0.944,0.471,1.0,0.833,1.0,0.944
+NP_001909.4,0.976,0.875,0.998,0.99,,0.886,,0.877,0.981,0.986,0.995,0.986
+NP_001166939.1,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002486.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079005.3,0.692,0.663,,,0.913,0.636,0.7,,0.895,0.942,0.988,0.826
+NP_056303.3,0.8,0.822,0.911,0.467,0.489,0.875,0.222,0.2,0.533,0.556,0.533,0.489
+NP_000503.1,0.788,0.854,0.798,0.855,0.639,0.836,0.849,0.793,0.888,0.829,0.825,0.917
+NP_002991.2,0.975,0.917,0.976,0.99,0.995,0.992,1.0,0.881,0.995,0.985,0.99,0.997
+NP_006259.1,1.0,0.917,0.938,1.0,0.958,1.0,0.823,0.833,1.0,1.0,0.979,0.917
+NP_055975.1,0.8,1.0,1.0,0.8,1.0,1.0,1.0,0.8,1.0,1.0,0.8,1.0
+NP_064716.2,0.955,0.897,0.955,0.994,1.0,1.0,0.946,0.891,1.0,1.0,1.0,1.0
+NP_005205.2,,1.0,,,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_001063.2,1.0,,1.0,1.0,,,0.97,,1.0,1.0,1.0,1.0
+NP_002632.1,0.863,0.849,0.867,0.885,0.937,0.923,,0.875,0.857,0.937,0.909,0.905
+NP_036546.2,1.0,0.8,1.0,1.0,0.933,1.0,1.0,0.917,0.833,1.0,0.933,0.967
+NP_005493.2,0.968,0.893,0.978,0.976,0.833,0.879,0.77,0.766,0.948,0.972,0.96,0.937
+NP_057688.3,1.0,1.0,0.962,1.0,1.0,1.0,1.0,0.923,1.0,1.0,1.0,1.0
+NP_054745.1,1.0,0.5,0.95,0.9,0.7,,0.8,0.5,0.8,0.8,0.9,0.9
+NP_000183.2,0.889,0.904,0.853,0.918,0.904,0.88,,0.84,0.897,0.906,0.839,0.911
+NP_001275676.1,0.827,0.596,,,0.865,0.654,0.558,0.779,0.596,0.923,0.885,0.75
+NP_003679.2,,0.678,,,0.771,0.898,0.559,0.494,0.715,0.641,0.663,0.77
+NP_112180.4,1.0,0.75,1.0,1.0,1.0,1.0,0.25,0.625,1.0,1.0,1.0,1.0
+NP_777576.1,1.0,0.955,0.966,1.0,0.955,1.0,0.841,0.955,0.955,1.0,0.909,1.0
+NP_476517.1,1.0,0.833,1.0,1.0,0.833,0.833,1.0,0.917,1.0,1.0,1.0,1.0
+NP_061154.1,1.0,1.0,1.0,1.0,1.0,0.981,0.865,0.5,1.0,1.0,1.0,1.0
+NP_055180.1,0.8,0.9,0.775,0.85,0.7,0.8,0.55,0.875,0.85,0.9,0.9,0.9
+NP_000354.4,0.902,0.795,0.939,0.803,0.898,1.0,0.985,0.818,0.818,0.848,0.917,0.962
+NP_006070.2,1.0,0.0,1.0,1.0,1.0,,,1.0,1.0,0.5,1.0,0.5
+NP_003968.3,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_078974.1,1.0,0.75,1.0,1.0,0.5,,,1.0,1.0,1.0,1.0,1.0
+NP_055331.1,1.0,0.905,0.905,1.0,0.921,1.0,,0.921,1.0,1.0,1.0,1.0
+NP_004512.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004513.1,0.969,0.75,0.953,0.844,,1.0,0.875,0.938,1.0,1.0,1.0,1.0
+NP_000498.2,1.0,0.883,0.983,1.0,1.0,0.889,,0.8,1.0,1.0,1.0,0.95
+NP_000378.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079495.1,1.0,0.333,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0
+NP_005850.1,0.913,0.796,0.742,0.895,0.937,0.922,0.88,0.7,0.934,0.907,0.952,0.97
+NP_079469.2,1.0,0.833,1.0,0.833,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000476.1,0.952,1.0,0.952,0.952,0.952,,0.619,0.905,1.0,0.857,0.905,0.952
+NP_067638.3,0.657,0.834,0.64,0.691,0.602,0.842,,0.709,0.712,0.737,0.779,0.816
+NP_001014437.1,1.0,1.0,1.0,0.833,,,,0.875,1.0,1.0,1.0,0.917
+NP_055915.2,0.848,0.856,0.803,0.904,0.874,0.839,0.841,0.676,0.736,0.904,0.886,0.904
+NP_000381.1,0.664,0.741,0.69,0.759,0.621,1.0,0.828,0.793,0.914,0.793,0.81,0.879
+NP_078805.3,,1.0,,,1.0,1.0,,0.25,1.0,1.0,1.0,1.0
+NP_054733.2,0.911,0.847,0.934,0.842,0.816,0.844,0.9,0.616,0.953,0.995,1.0,0.953
+NP_001867.2,0.885,0.957,0.917,0.968,0.947,0.9,,0.821,0.947,1.0,1.0,0.995
+NP_001129524.1,0.885,0.769,0.769,1.0,,,0.6,1.0,1.0,1.0,1.0,1.0
+NP_004691.2,,0.587,,,0.94,0.952,,0.94,0.873,0.9,0.953,0.953
+NP_001128715.1,,1.0,,,,0.688,0.715,0.933,0.972,0.88,0.493,0.817
+NP_000361.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0
+NP_079541.1,0.917,1.0,0.75,0.833,,0.667,1.0,1.0,0.833,1.0,0.917,0.833
+NP_001121775.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055112.2,0.958,1.0,1.0,1.0,1.0,,0.917,1.0,0.958,1.0,1.0,1.0
+NP_570824.1,0.884,0.603,0.861,0.787,0.75,0.819,0.694,0.519,0.88,0.861,0.926,0.917
+NP_003989.2,0.992,1.0,0.983,0.983,,,0.983,0.908,0.983,1.0,1.0,0.983
+NP_001020774.1,0.6,1.0,0.8,1.0,0.9,,,1.0,1.0,1.0,1.0,1.0
+NP_071398.5,,,,,,,,,1.0,1.0,1.0,1.0
+NP_000332.2,0.909,1.0,1.0,1.0,1.0,1.0,0.545,1.0,1.0,0.955,0.955,1.0
+NP_808880.1,1.0,,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_005632.1,1.0,1.0,0.75,1.0,1.0,,,0.875,1.0,1.0,1.0,1.0
+NP_008872.1,0.996,0.909,1.0,0.952,0.835,1.0,,0.901,0.988,1.0,0.988,0.996
+NP_073752.6,0.65,1.0,0.9,1.0,0.8,0.667,0.8,0.9,0.95,0.95,0.95,1.0
+NP_001307594.1,1.0,0.333,1.0,1.0,0.667,1.0,1.0,0.5,0.938,1.0,1.0,0.875
+NP_000277.1,0.75,1.0,1.0,1.0,1.0,0.5,,1.0,1.0,1.0,1.0,1.0
+NP_001440.2,0.917,1.0,,,0.909,,,1.0,1.0,1.0,1.0,0.958
+NP_001171582.1,1.0,1.0,1.0,1.0,1.0,0.0,,1.0,1.0,1.0,1.0,1.0
+NP_000178.2,1.0,0.84,0.903,0.959,0.991,0.933,0.835,0.658,0.97,0.991,0.996,0.991
+NP_001119580.2,0.951,0.951,0.946,0.96,0.941,0.906,0.971,0.811,0.93,0.973,0.973,0.941
+NP_001976.1,1.0,0.75,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001244074.1,0.94,0.842,0.821,0.952,,,0.619,0.6,0.857,0.881,0.857,0.786
+NP_647593.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_038472.2,0.778,0.889,0.833,0.556,0.444,0.667,0.889,0.944,0.833,0.722,0.667,0.722
+NP_002258.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057126.2,1.0,1.0,1.0,0.952,1.0,,0.857,0.905,1.0,1.0,0.952,0.952
+NP_060420.2,0.269,0.308,0.0,0.423,0.346,1.0,0.654,,0.269,0.154,0.115,0.462
+NP_005901.2,0.972,0.972,0.868,0.944,,,0.931,0.7,0.986,0.986,0.972,0.972
+NP_689549.3,,1.0,,,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004067.1,0.833,1.0,1.0,1.0,1.0,,0.667,0.667,1.0,1.0,1.0,1.0
+NP_008909.1,1.0,0.833,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_001905.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006281.1,0.94,1.0,,,1.0,1.0,0.6,0.88,1.0,0.96,1.0,0.96
+NP_002100.2,1.0,0.722,0.833,0.667,0.944,1.0,0.833,0.583,1.0,0.972,0.972,0.944
+NP_060496.2,0.87,0.741,0.75,0.926,0.972,0.673,0.806,0.867,0.889,0.981,0.917,0.806
+NP_065979.1,0.865,0.95,0.935,0.97,0.94,0.257,0.83,0.711,0.82,0.92,0.91,0.86
+NP_001094383.2,0.62,0.987,0.607,0.732,0.9,0.031,0.772,0.861,0.747,0.911,0.924,0.924
+NP_064502.9,0.8,0.813,0.86,0.76,0.713,0.733,0.987,0.68,0.813,0.707,0.76,0.84
+NP_001258928.1,1.0,1.0,1.0,1.0,1.0,1.0,0.778,0.75,1.0,1.0,1.0,1.0
+NP_000533.4,0.944,0.944,,,0.778,,,0.889,0.806,0.958,0.944,0.944
+NP_004517.2,0.786,1.0,0.571,0.929,0.857,1.0,0.429,0.857,1.0,1.0,0.929,1.0
+NP_000014.1,0.9,0.857,0.886,0.886,0.957,0.958,0.914,0.643,0.843,0.9,0.914,0.929
+NP_004975.2,0.931,0.858,0.922,0.842,0.822,0.938,0.872,0.696,0.867,0.911,0.909,0.946
+NP_055612.2,0.938,0.75,0.875,1.0,1.0,1.0,0.625,0.938,1.0,0.875,1.0,0.75
+NP_060247.2,0.8,0.8,1.0,0.8,0.8,1.0,1.0,1.0,0.6,1.0,1.0,1.0
+NP_085055.2,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0
+NP_653259.3,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000144.2,0.916,0.808,0.942,0.977,0.888,0.995,0.76,0.932,0.852,0.989,0.978,0.978
+NP_006614.2,,0.556,,,0.778,1.0,,0.556,0.667,0.889,0.667,0.556
+NP_068587.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.875,1.0,1.0,1.0,1.0
+NP_001153705.1,0.875,0.925,0.888,0.975,0.9,0.875,0.838,0.738,1.0,0.962,0.975,1.0
+NP_005016.1,1.0,1.0,0.938,1.0,1.0,1.0,,0.875,0.969,1.0,1.0,1.0
+NP_212134.3,,1.0,,,0.9,1.0,,0.8,1.0,1.0,1.0,1.0
+NP_003192.1,0.667,0.0,0.667,1.0,1.0,1.0,1.0,0.333,0.667,0.667,0.667,1.0
+NP_001278933.1,0.978,0.933,0.978,0.989,,,0.878,0.689,0.944,1.0,1.0,1.0
+NP_653186.2,1.0,1.0,1.0,0.7,1.0,,1.0,0.9,1.0,1.0,1.0,1.0
+NP_001290185.1,0.821,1.0,0.842,0.912,0.817,0.884,0.879,0.949,0.938,0.949,0.974,0.989
+NP_001171634.3,1.0,0.333,,,1.0,,1.0,0.25,1.0,1.0,1.0,1.0
+NP_001171536.2,0.896,0.971,0.944,0.961,0.902,0.946,0.964,0.926,0.931,0.985,0.992,0.992
+NP_477511.1,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057411.1,1.0,1.0,0.667,0.333,0.667,,1.0,1.0,1.0,1.0,1.0,0.667
+NP_000259.1,0.792,1.0,0.667,0.833,0.833,1.0,0.667,0.875,0.833,0.833,0.833,0.667
+NP_057182.1,1.0,,1.0,1.0,1.0,,0.5,0.5,1.0,1.0,1.0,1.0
+NP_056007.1,1.0,0.667,1.0,1.0,1.0,1.0,,0.833,1.0,1.0,1.0,1.0
+NP_006603.2,0.954,0.831,0.946,0.885,0.954,1.0,0.923,,0.892,0.985,1.0,1.0
+NP_004312.2,,0.893,,,0.932,0.961,,0.672,0.902,0.976,0.947,0.944
+NP_004288.1,1.0,1.0,0.0,1.0,1.0,,0.0,0.5,1.0,1.0,1.0,1.0
+NP_001254507.1,0.929,0.857,,,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001129123.1,0.25,0.625,0.25,0.0,0.0,1.0,,0.0,0.125,0.0,0.0,0.125
+NP_000881.3,1.0,1.0,1.0,1.0,1.0,1.0,,0.875,1.0,1.0,1.0,1.0
+NP_057386.2,,1.0,,,,1.0,0.5,1.0,1.0,1.0,1.0,1.0
+NP_004809.2,0.0,1.0,0.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,0.5
+NP_000466.2,0.899,0.894,0.817,0.99,1.0,1.0,0.894,0.952,1.0,1.0,0.933,1.0
+NP_004811.1,1.0,0.591,1.0,1.0,1.0,,0.909,1.0,1.0,1.0,1.0,1.0
+NP_000273.2,0.991,0.87,0.896,0.963,1.0,0.9,0.935,0.812,0.944,1.0,1.0,1.0
+NP_060586.2,,0.542,,,,,1.0,0.75,0.708,0.875,0.917,0.917
+NP_127497.1,0.796,0.514,0.865,0.692,,0.833,,0.603,0.829,0.8,0.586,0.757
+NP_001304919.1,1.0,1.0,,1.0,0.0,0.0,,1.0,1.0,1.0,1.0,1.0
+NP_004429.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_945345.2,0.917,0.979,0.917,0.917,0.989,,1.0,0.896,0.917,1.0,1.0,0.979
+NP_071882.1,,1.0,,,0.857,1.0,,0.857,1.0,1.0,1.0,1.0
+NP_000272.1,1.0,1.0,1.0,1.0,0.5,,0.0,0.5,1.0,0.5,1.0,1.0
+NP_077010.1,0.8,0.933,0.8,1.0,0.933,,0.933,0.833,1.0,1.0,1.0,1.0
+NP_219487.3,1.0,0.768,1.0,0.96,0.96,1.0,0.955,0.75,1.0,0.982,1.0,1.0
+NP_149059.1,1.0,1.0,1.0,0.5,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002730.1,1.0,0.986,0.931,0.986,0.944,1.0,0.889,,0.972,0.944,0.958,0.958
+NP_644807.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_689991.1,,0.778,,,0.833,1.0,,0.944,0.944,0.944,0.944,0.917
+NP_851853.1,0.7,0.6,0.85,1.0,0.8,,,0.9,1.0,1.0,0.9,0.9
+NP_001265868.1,0.5,0.877,0.604,0.667,,,0.788,,0.712,0.939,0.879,0.894
+NP_060547.2,0.222,0.556,0.222,0.0,0.222,0.0,0.222,0.444,0.111,0.0,0.0,0.444
+NP_997235.3,0.938,0.938,0.979,0.964,0.969,1.0,0.927,0.875,0.969,0.979,0.969,0.938
+NP_003611.1,0.857,1.0,,,0.571,0.5,0.571,0.214,0.857,0.714,0.714,0.5
+NP_005179.2,0.944,0.913,0.887,0.959,0.918,0.889,0.938,0.767,0.959,0.979,0.99,0.979
+NP_005521.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_005018.2,0.876,0.788,0.873,0.927,0.936,0.694,,0.912,0.933,0.933,0.933,0.936
+NP_006758.2,0.919,0.867,0.929,0.943,0.933,0.95,,0.79,0.924,0.981,0.99,0.905
+NP_115979.3,0.5,1.0,0.75,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0
+NP_001005731.1,0.934,0.895,,,0.94,0.911,,0.86,0.981,0.89,0.959,0.938
+NP_055771.4,0.955,1.0,0.884,1.0,,1.0,0.964,1.0,0.929,0.964,1.0,0.857
+NP_036318.1,0.938,0.938,1.0,0.896,1.0,1.0,0.958,1.0,0.917,0.979,0.979,1.0
+NP_940978.2,0.971,0.863,0.879,0.745,0.864,0.667,0.866,0.719,0.627,0.869,0.85,0.804
+NP_387510.1,1.0,0.955,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_722541.1,0.905,0.81,0.857,0.952,1.0,,0.762,0.571,1.0,1.0,1.0,1.0
+NP_001374178.1,1.0,1.0,1.0,1.0,,,,1.0,0.5,1.0,1.0,1.0
+NP_004475.1,0.645,0.29,0.879,0.242,0.669,0.897,0.629,0.306,0.677,0.677,0.484,0.097
+NP_001036068.1,0.5,1.0,0.75,0.375,0.625,,1.0,0.5,0.25,1.0,1.0,1.0
+NP_005013.1,1.0,1.0,1.0,1.0,0.0,1.0,0.0,1.0,0.0,1.0,1.0,1.0
+NP_671729.2,,0.333,,,1.0,,,1.0,0.667,1.0,0.667,1.0
+NP_002159.2,0.625,0.75,0.688,0.75,0.875,0.0,0.875,0.625,0.875,1.0,1.0,0.875
+NP_004473.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.917,1.0,1.0,1.0,1.0
+NP_006787.2,0.939,0.848,0.875,0.78,0.803,0.879,0.697,0.985,0.864,0.902,0.864,0.939
+NP_000026.2,0.881,0.81,0.762,0.762,0.786,1.0,,0.619,0.714,0.857,0.81,0.952
+NP_001243805.1,0.359,0.795,0.462,0.333,0.872,,,0.949,0.564,0.615,0.59,0.462
+NP_071731.1,0.791,0.855,0.736,0.891,0.591,0.8,0.872,0.727,0.8,0.818,0.855,0.877
+NP_000497.1,1.0,1.0,,1.0,0.75,,1.0,0.875,0.625,1.0,1.0,1.0
+NP_004144.2,0.673,0.964,0.764,0.836,0.964,1.0,,0.927,0.909,0.982,0.927,0.982
+NP_037468.1,0.167,1.0,0.333,0.333,0.333,0.333,0.333,0.5,0.333,0.333,0.333,0.667
+NP_001412.1,0.5,1.0,0.5,1.0,1.0,,,1.0,0.0,1.0,1.0,1.0
+NP_066124.1,0.961,0.946,0.983,0.982,0.924,0.975,0.854,0.961,0.922,0.982,0.978,0.962
+NP_001186181.1,1.0,1.0,1.0,1.0,,,,1.0,1.0,1.0,1.0,1.0
+NP_001289.1,0.982,0.955,0.987,0.995,0.982,,0.965,0.916,0.971,0.968,0.991,0.99
+NP_005620.1,0.969,0.792,0.96,0.927,0.97,0.925,0.843,0.781,0.974,0.972,0.986,0.996
+NP_002231.1,1.0,0.5,,,1.0,,,0.5,1.0,1.0,1.0,1.0
+NP_056999.2,0.574,0.981,0.546,0.824,0.537,0.519,1.0,0.435,0.782,0.917,0.954,0.898
+NP_002854.3,0.902,0.788,0.871,0.795,0.75,1.0,0.742,0.511,0.924,0.909,0.947,0.886
+NP_000539.2,0.866,0.877,0.873,0.912,0.903,0.91,0.658,0.769,0.913,0.875,0.842,0.827
+NP_000924.3,1.0,0.929,0.988,0.982,0.994,1.0,0.988,0.838,1.0,1.0,0.994,1.0
+NP_060124.2,0.773,0.659,,,0.725,0.642,0.819,0.705,0.7,0.702,0.708,0.708
+NP_004449.1,1.0,1.0,1.0,0.667,,,0.167,0.75,1.0,1.0,1.0,1.0
+NP_036425.1,1.0,1.0,0.964,1.0,1.0,1.0,0.857,0.857,1.0,0.929,1.0,1.0
+NP_003874.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004729.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0
+NP_775831.2,1.0,1.0,0.795,1.0,1.0,1.0,0.885,0.923,0.974,1.0,1.0,0.821
+NP_001153695.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_996991.1,1.0,1.0,1.0,1.0,,,1.0,1.0,0.0,1.0,1.0,1.0
+NP_775960.4,0.811,1.0,0.92,0.923,0.865,0.889,0.644,0.615,0.853,0.865,0.929,0.955
+NP_000278.3,0.862,0.742,0.971,0.842,0.869,0.792,0.701,0.792,0.796,0.837,0.842,0.805
+NP_113663.2,,0.905,,,0.714,0.833,,0.762,0.762,0.952,1.0,0.905
+NP_003990.1,0.833,1.0,0.0,0.917,1.0,1.0,0.833,0.667,1.0,0.917,0.917,0.75
+NP_000349.1,0.611,0.653,0.535,0.486,0.611,0.667,0.5,0.604,0.694,0.75,0.736,0.611
+NP_689606.2,0.92,0.78,0.94,1.0,0.94,1.0,0.94,0.75,1.0,0.95,1.0,1.0
+NP_004637.1,0.902,0.769,0.92,0.941,0.898,0.965,0.714,0.839,0.887,0.902,0.951,0.934
+NP_006138.1,1.0,1.0,1.0,1.0,1.0,1.0,0.714,0.958,1.0,1.0,1.0,0.988
+NP_056651.1,1.0,0.833,1.0,0.893,0.929,1.0,0.762,0.917,1.0,0.952,0.976,1.0
+NP_055742.2,,0.833,,,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_059488.2,0.0,1.0,0.0,0.0,0.0,,0.0,1.0,0.0,0.0,0.0,0.0
+NP_005907.3,1.0,0.75,1.0,1.0,1.0,0.5,1.0,0.5,1.0,1.0,1.0,1.0
+NP_957705.1,0.853,0.835,0.843,0.894,0.919,0.782,0.805,,0.888,0.937,0.912,0.966
+NP_938143.1,1.0,0.625,1.0,0.875,,,1.0,0.375,0.75,1.0,1.0,1.0
+NP_004290.2,,0.859,,,,,,0.853,0.918,0.706,0.824,0.906
+NP_683720.2,0.875,0.75,0.75,0.75,,0.5,,0.75,1.0,1.0,1.0,1.0
+NP_000407.1,1.0,1.0,1.0,1.0,0.9,1.0,0.8,0.3,0.9,1.0,1.0,1.0
+NP_001186136.1,0.9,1.0,0.9,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002071.2,0.852,0.296,0.778,0.926,1.0,,0.963,0.593,0.926,1.0,1.0,1.0
+NP_919224.1,0.867,0.8,0.848,0.773,0.954,0.852,0.875,0.716,0.927,0.967,0.929,0.944
+NP_000313.2,0.873,0.879,0.854,0.916,,0.986,,0.821,0.936,0.896,0.85,0.892
+NP_001073882.3,0.6,0.733,0.6,0.667,0.867,0.667,0.8,0.2,0.533,0.733,0.8,0.8
+NP_000136.2,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_066382.1,,1.0,,,1.0,,,0.333,1.0,1.0,1.0,1.0
+NP_036610.2,1.0,1.0,1.0,1.0,1.0,,1.0,0.833,1.0,1.0,1.0,1.0
+NP_006168.1,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,0.714,1.0,1.0
+NP_003986.2,0.542,0.944,0.597,0.75,0.778,,0.806,0.694,0.722,0.833,0.778,0.694
+NP_001759.3,0.75,0.5,0.5,1.0,0.5,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055139.4,1.0,1.0,1.0,0.75,1.0,,0.75,0.812,1.0,1.0,1.0,1.0
+NP_000088.3,0.792,0.917,0.771,0.667,1.0,1.0,1.0,0.792,1.0,1.0,1.0,1.0
+NP_006508.2,0.897,0.747,,,0.928,0.955,0.834,0.889,0.939,0.929,0.953,0.974
+NP_000488.3,0.97,0.809,0.98,0.967,0.967,1.0,0.967,0.832,0.97,0.885,0.918,0.987
+NP_000489.3,0.912,0.715,0.862,0.877,0.931,0.972,0.862,0.908,0.923,0.923,0.954,0.954
+NP_001684.2,1.0,1.0,1.0,1.0,0.967,,1.0,0.933,0.933,1.0,1.0,1.0
+NP_000150.1,0.854,0.752,0.809,0.826,0.882,0.947,,0.627,0.734,0.922,0.896,0.924
+NP_004961.1,0.375,1.0,1.0,1.0,1.0,,1.0,0.812,1.0,0.875,1.0,1.0
+NP_004535.1,0.0,0.6,0.0,1.0,0.6,0.5,,0.6,1.0,0.4,0.6,0.4
+NP_116190.3,1.0,0.75,1.0,1.0,1.0,,0.75,0.875,0.75,1.0,0.75,1.0
+NP_001687.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005984.3,0.862,0.803,0.878,0.849,0.888,1.0,0.928,0.871,0.961,0.947,0.934,0.98
+NP_003045.2,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_004689.1,0.958,0.958,0.958,1.0,1.0,0.944,0.875,0.875,0.958,1.0,1.0,1.0
+NP_714544.1,0.892,0.831,0.846,0.877,,0.85,0.738,0.923,0.862,0.985,0.969,0.954
+NP_060083.1,0.981,0.963,1.0,1.0,1.0,1.0,0.944,0.926,0.37,1.0,0.981,0.963
+NP_005248.2,1.0,0.643,0.929,1.0,1.0,1.0,,0.875,1.0,1.0,1.0,1.0
+NP_001034302.2,0.75,0.833,,,0.667,,,,0.833,1.0,1.0,0.667
+NP_001813.1,1.0,1.0,1.0,1.0,1.0,,1.0,0.5,1.0,1.0,1.0,1.0
+NP_057097.2,0.95,0.6,0.95,0.7,1.0,1.0,1.0,0.8,1.0,0.8,0.9,0.9
+NP_001093326.2,0.722,0.833,,,,,0.792,0.944,0.708,1.0,1.0,0.917
+NP_006008.1,0.768,0.643,0.768,0.786,0.554,0.571,0.75,0.5,0.857,0.571,0.786,0.786
+NP_001035835.1,1.0,,1.0,1.0,1.0,,1.0,1.0,0.857,,1.0,1.0
+NP_000198.1,1.0,,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002428.1,0.583,0.75,0.75,1.0,0.917,0.889,,0.583,0.917,0.917,0.917,0.917
+NP_079426.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055529.2,0.806,0.846,0.818,0.873,0.822,0.836,0.817,0.895,0.869,0.863,0.854,0.906
+NP_006740.1,0.938,0.958,0.944,0.944,0.903,1.0,,0.931,0.861,0.958,0.917,0.986
+NP_001683.2,1.0,0.667,0.944,0.889,1.0,,0.889,0.611,0.889,0.889,0.944,0.944
+NP_001352465.1,0.865,0.927,0.893,0.786,0.851,,0.733,0.752,0.901,0.875,0.883,0.914
+NP_004580.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0
+NP_002866.2,1.0,0.938,1.0,0.5,1.0,1.0,0.5,0.312,0.875,1.0,1.0,1.0
+NP_001124497.1,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_005140.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005645.1,,0.667,,,0.985,0.987,,0.854,1.0,0.969,0.99,0.959
+NP_001121370.1,0.955,0.932,0.977,0.989,,,,0.773,0.966,1.0,1.0,0.989
+NP_932173.1,0.916,0.973,0.949,0.937,,0.961,0.968,0.973,0.965,0.985,0.987,0.964
+NP_002238.2,,0.867,,,,1.0,0.9,0.95,0.933,0.933,0.967,0.967
+NP_861448.2,1.0,1.0,1.0,1.0,,1.0,1.0,0.889,1.0,1.0,1.0,1.0
+NP_000302.1,0.776,0.885,0.776,0.705,0.933,,,0.718,0.538,0.864,0.879,0.924
+NP_036475.3,0.929,1.0,1.0,1.0,1.0,1.0,1.0,0.786,1.0,1.0,1.0,1.0
+NP_683763.2,0.833,0.889,0.75,0.833,0.875,0.8,1.0,0.889,0.778,0.889,0.889,0.944
+NP_689854.2,0.588,0.912,0.971,0.618,0.706,0.941,0.618,0.838,0.029,0.912,0.941,0.971
+NP_056517.1,0.833,1.0,0.75,1.0,1.0,,,0.917,0.833,1.0,0.833,1.0
+NP_001278.1,0.976,0.917,0.963,0.978,0.94,0.93,0.919,0.872,0.963,0.978,0.963,0.987
+NP_722540.1,1.0,1.0,1.0,1.0,,,1.0,1.0,0.75,1.0,1.0,1.0
+NP_005947.3,0.926,1.0,,,,,0.926,0.574,1.0,0.926,0.963,0.926
+NP_001369746.2,0.933,0.4,0.8,0.667,1.0,,1.0,0.867,0.667,0.667,0.733,0.667
+NP_077282.3,1.0,0.899,1.0,0.955,0.929,0.972,0.879,0.894,0.96,0.97,0.97,1.0
+NP_076968.2,0.625,0.062,0.688,0.875,1.0,,0.375,0.75,0.875,0.875,0.812,0.812
+NP_004035.2,0.917,0.75,0.958,1.0,1.0,1.0,1.0,0.792,1.0,1.0,1.0,0.917
+NP_071437.3,0.812,0.562,0.812,0.938,0.5,,0.667,0.75,0.875,0.938,0.875,0.875
+NP_000168.1,0.868,0.701,0.885,0.897,0.902,1.0,0.874,0.828,0.954,0.908,0.92,0.943
+NP_003156.1,0.904,0.636,0.917,0.884,0.896,0.942,0.755,0.543,0.893,0.927,0.913,0.942
+NP_524144.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_996879.1,0.955,1.0,0.818,0.909,1.0,,0.909,0.818,0.727,0.727,0.909,0.909
+NP_000249.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.75,1.0,1.0,1.0,1.0
+NP_001002841.1,0.0,1.0,,,0.0,0.0,,0.0,0.5,0.0,0.0,0.0
+NP_001116293.1,0.5,0.0,0.5,1.0,1.0,,0.5,0.5,0.5,1.0,1.0,1.0
+NP_002001.1,1.0,0.5,1.0,0.75,0.75,0.5,0.5,0.75,0.5,1.0,1.0,1.0
+NP_064576.1,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_000206.2,0.852,0.679,0.77,0.837,0.883,0.8,0.934,0.851,0.888,0.872,0.908,0.959
+NP_056170.2,1.0,0.5,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0
+NP_057390.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.833,1.0,1.0,1.0,1.0
+NP_003830.1,,0.833,,,,1.0,0.9,0.815,0.933,0.9,0.933,0.833
+NP_001366429.1,0.634,0.976,0.634,0.951,,,0.634,0.756,0.951,0.195,0.78,0.659
+NP_001188356.1,0.667,1.0,,,,,,0.333,1.0,1.0,1.0,1.0
+NP_008880.2,1.0,0.929,1.0,0.982,0.964,1.0,0.964,0.893,0.911,1.0,1.0,1.0
+NP_000968.2,1.0,1.0,1.0,1.0,1.0,,,0.5,1.0,1.0,1.0,1.0
+NP_001616.1,0.982,0.714,0.889,0.833,1.0,1.0,0.929,0.857,0.964,1.0,1.0,1.0
+NP_064569.3,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_058633.2,0.486,0.457,,,,,0.571,0.514,0.571,0.229,0.229,0.457
+NP_000101.2,1.0,1.0,1.0,1.0,0.75,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_078804.2,1.0,1.0,1.0,1.0,1.0,,1.0,0.167,1.0,1.0,1.0,1.0
+NP_078868.1,1.0,0.5,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0
+NP_060033.3,0.9,0.6,0.55,0.55,0.3,0.5,0.5,0.85,1.0,0.7,0.9,0.6
+NP_003784.2,1.0,0.857,0.929,0.714,1.0,1.0,0.929,0.929,1.0,0.857,0.786,1.0
+NP_004937.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.909,1.0,1.0,1.0,1.0
+NP_001089.1,0.75,0.722,0.444,0.667,0.611,,0.667,0.667,0.5,0.5,0.611,0.611
+NP_001269862.1,1.0,1.0,1.0,1.0,1.0,1.0,0.524,0.952,1.0,1.0,1.0,1.0
+NP_008853.3,0.919,0.973,0.912,0.891,0.903,0.925,0.716,0.693,0.957,0.93,0.927,0.945
+NP_001307679.1,0.786,0.929,0.786,0.857,0.94,1.0,0.81,0.929,1.0,0.881,0.952,1.0
+NP_000127.2,0.875,0.806,0.944,0.917,0.833,1.0,,0.722,0.389,0.889,0.917,0.972
+NP_006484.2,,1.0,,,1.0,1.0,1.0,1.0,0.6,1.0,1.0,1.0
+NP_001182192.1,,1.0,,,0.583,1.0,,0.792,1.0,1.0,0.933,0.967
+NP_963925.2,0.969,0.854,0.969,0.979,0.854,1.0,0.708,1.0,0.917,1.0,0.979,1.0
+NP_001157281.1,0.923,0.846,0.962,1.0,0.923,0.5,0.885,0.667,0.654,0.923,0.962,0.962
+NP_055908.1,,1.0,,,1.0,1.0,,0.75,1.0,1.0,1.0,1.0
+NP_001275582.1,0.929,0.571,,,,,0.714,0.429,0.929,0.857,0.929,0.857
+NP_009029.3,0.833,0.833,,,0.389,,0.833,0.667,0.944,0.222,0.278,0.778
+NP_001035202.1,0.971,0.883,0.993,0.891,,,0.942,0.875,0.899,0.986,0.942,0.928
+NP_060764.3,0.867,0.933,1.0,0.733,0.8,,0.867,0.733,1.0,0.867,0.8,0.733
+NP_066925.1,0.786,1.0,0.762,1.0,1.0,1.0,0.857,0.857,1.0,0.905,1.0,0.905
+NP_002464.1,0.862,0.84,0.908,0.894,0.882,0.938,0.826,0.791,0.883,0.944,0.918,0.911
+NP_000359.1,0.962,0.938,0.943,0.978,0.919,0.947,,0.867,0.956,0.948,0.974,0.939
+NP_064445.2,1.0,1.0,1.0,1.0,,,,1.0,1.0,1.0,1.0,1.0
+NP_005955.3,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_733774.1,1.0,0.9,1.0,0.95,1.0,1.0,0.95,0.4,1.0,1.0,1.0,1.0
+NP_004073.2,0.947,0.801,0.95,0.907,0.916,0.933,0.876,0.764,0.901,0.925,0.907,0.969
+NP_066287.2,0.952,0.982,0.912,0.842,0.848,0.791,0.845,0.645,0.952,0.973,0.957,0.983
+NP_443108.1,0.783,0.807,0.781,0.771,0.646,0.84,0.813,0.833,0.887,0.907,0.92,0.9
+NP_000013.2,0.99,0.966,0.99,0.995,0.992,1.0,0.956,0.906,1.0,0.995,0.985,0.961
+NP_001013860.1,0.901,0.789,0.829,0.908,,1.0,0.421,0.895,0.921,0.921,0.974,0.842
+NP_005410.1,0.714,0.857,0.929,1.0,0.857,1.0,0.857,1.0,1.0,0.714,0.857,0.857
+NP_644805.1,0.756,0.828,0.769,0.884,0.83,0.801,,0.691,0.875,0.76,0.831,0.819
+NP_061950.2,1.0,,1.0,1.0,,,1.0,,1.0,1.0,1.0,1.0
+NP_000410.2,0.912,0.856,0.881,0.934,0.942,0.961,0.831,0.806,0.937,0.904,0.904,0.973
+NP_000071.1,0.923,0.747,0.846,0.918,0.956,0.971,,0.912,0.984,0.978,0.989,0.967
+NP_055713.1,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_036470.1,0.75,0.833,0.833,1.0,1.0,1.0,,0.833,0.75,1.0,1.0,1.0
+NP_001120693.1,0.987,0.888,,,,0.968,0.899,0.87,0.981,0.955,0.979,0.927
+NP_000712.2,0.922,0.951,0.814,0.883,0.921,0.946,,0.82,0.974,0.937,0.957,0.917
+NP_001439.2,0.0,0.25,0.0,0.0,0.0,0.333,0.0,0.75,0.0,0.0,0.0,0.25
+NP_861454.2,0.906,0.75,0.938,0.844,0.929,,0.688,0.531,0.938,0.906,1.0,0.938
+NP_006293.2,,0.786,,,0.786,0.8,,0.643,1.0,0.929,0.929,1.0
+NP_714928.1,0.97,0.667,0.545,0.818,0.97,1.0,0.879,0.727,1.0,0.879,1.0,0.939
+NP_001070984.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_009051.1,1.0,1.0,1.0,1.0,,1.0,1.0,,1.0,1.0,1.0,1.0
+NP_001094832.1,0.802,0.429,0.786,0.738,0.663,0.772,0.913,,0.429,0.73,0.698,0.738
+NP_004551.2,0.789,0.669,0.797,0.797,0.817,0.943,0.766,0.586,0.76,0.76,0.806,0.834
+NP_000172.2,0.974,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001032.2,0.933,0.933,0.958,0.933,0.9,,0.983,0.95,0.917,0.942,0.933,0.933
+NP_001287829.1,0.875,0.5,0.625,0.875,,1.0,0.75,0.75,0.25,1.0,0.75,0.875
+NP_064425.2,0.923,1.0,0.923,0.731,0.692,,1.0,0.5,1.0,1.0,1.0,1.0
+NP_004619.3,1.0,0.938,0.844,0.938,1.0,1.0,1.0,0.969,0.75,0.875,1.0,1.0
+NP_000018.2,1.0,0.812,0.979,0.979,1.0,1.0,0.917,0.792,0.979,1.0,1.0,1.0
+NP_001008701.1,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_112224.1,1.0,0.857,1.0,0.857,0.857,1.0,0.429,0.429,0.286,0.857,0.857,0.714
+NP_002963.2,0.273,0.273,0.364,0.182,0.429,,0.455,0.727,0.182,0.364,0.273,0.273
+NP_000751.1,0.773,0.727,0.848,0.758,0.667,,,0.712,0.818,0.758,0.788,0.909
+NP_705690.2,1.0,1.0,1.0,1.0,0.833,1.0,1.0,0.75,1.0,1.0,1.0,1.0
+NP_006615.2,0.818,0.719,0.847,0.705,0.896,,,0.46,0.761,0.83,0.864,0.886
+NP_054772.1,0.917,0.667,0.896,1.0,1.0,,1.0,1.0,1.0,0.875,0.958,1.0
+NP_004420.1,1.0,1.0,1.0,1.0,0.978,1.0,0.867,1.0,1.0,1.0,1.0,1.0
+NP_001233.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000834.2,0.887,0.923,0.893,0.952,0.946,1.0,0.875,,0.899,0.964,0.982,0.97
+NP_000325.4,0.898,0.995,0.887,0.914,0.91,1.0,0.894,0.829,0.952,0.961,0.963,0.989
+NP_015564.5,1.0,0.98,,,0.925,0.909,0.682,0.9,1.0,0.927,0.836,0.982
+NP_000739.1,0.825,1.0,0.908,0.767,0.667,0.979,0.75,0.692,0.9,0.85,0.833,0.9
+NP_006350.1,0.944,0.847,0.972,0.986,0.986,0.972,0.931,0.785,1.0,1.0,1.0,0.986
+NP_001230106.1,0.5,0.5,1.0,1.0,,,,0.5,1.0,1.0,1.0,1.0
+NP_001934.2,0.92,0.907,0.927,0.913,0.933,0.891,0.973,0.88,0.873,0.993,0.953,0.953
+NP_061949.3,1.0,,1.0,1.0,,,1.0,,1.0,1.0,1.0,1.0
+NP_002171.2,0.925,0.903,0.888,0.954,0.877,0.979,0.947,0.871,0.913,0.913,0.946,0.952
+NP_078875.4,0.65,1.0,0.98,0.86,1.0,0.375,0.96,0.98,0.42,0.98,0.84,0.96
+NP_000255.2,0.934,0.914,0.928,0.952,0.923,0.953,0.77,0.793,0.943,0.951,0.962,0.958
+NP_000243.1,0.967,0.883,0.974,0.97,0.986,0.973,0.899,0.861,0.955,0.988,0.974,0.959
+NP_061176.4,0.882,0.647,0.824,0.647,0.941,0.0,0.294,0.853,1.0,0.882,0.882,0.882
+NP_758440.1,0.773,,0.909,0.909,1.0,0.75,0.727,0.864,1.0,1.0,1.0,1.0
+NP_005325.2,0.843,0.886,0.854,1.0,0.9,0.743,0.929,0.911,0.986,0.729,0.836,0.829
+NP_055569.1,0.444,0.611,0.458,0.444,0.528,0.533,0.694,0.611,0.611,0.611,0.583,0.75
+NP_000487.1,1.0,0.9,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001745.2,0.946,0.874,0.915,0.954,,,0.993,0.833,1.0,1.0,0.998,1.0
+NP_005351.2,0.956,0.945,0.912,0.912,,0.654,1.0,0.908,0.978,0.89,0.978,0.89
+NP_000039.2,0.888,0.954,0.888,0.944,0.958,0.98,,0.988,0.967,0.967,0.977,0.971
+NP_009057.1,1.0,0.96,1.0,1.0,0.96,,0.96,0.48,1.0,1.0,1.0,1.0
+NP_003394.1,1.0,1.0,1.0,0.929,1.0,1.0,1.0,1.0,1.0,0.857,1.0,1.0
+NP_001264.2,1.0,0.901,0.95,0.95,0.756,0.902,0.878,0.79,0.996,0.947,0.966,0.968
+NP_075066.1,0.75,0.5,0.875,0.75,0.5,,0.75,0.5,1.0,1.0,1.0,1.0
+NP_003089.1,0.857,0.143,0.929,1.0,0.857,0.667,1.0,0.929,0.857,0.714,0.857,0.857
+NP_000021.1,0.917,0.967,0.943,0.897,0.953,1.0,0.918,0.777,0.936,0.938,0.97,0.939
+NP_000362.1,,0.823,,,0.706,1.0,,0.691,0.702,0.568,0.646,0.72
+NP_036450.1,1.0,0.786,0.929,0.929,1.0,1.0,1.0,0.643,1.0,1.0,1.0,1.0
+NP_009193.2,0.722,0.889,,,0.778,,,0.833,0.556,1.0,1.0,0.778
+NP_573400.3,1.0,0.143,0.5,0.667,0.0,1.0,0.286,0.286,0.857,0.143,0.286,0.714
+NP_079535.4,1.0,,1.0,1.0,,,,0.5,1.0,1.0,1.0,0.5
+NP_004764.1,1.0,1.0,,,1.0,,,1.0,1.0,,1.0,1.0
+NP_003891.1,0.908,0.65,0.792,0.883,0.817,0.722,0.9,0.804,0.883,0.967,0.933,0.817
+NP_570901.3,0.794,0.871,0.835,0.729,0.776,1.0,0.922,0.682,0.878,0.9,0.922,0.933
+NP_036205.1,0.5,0.5,1.0,0.0,0.5,,0.5,0.75,0.0,0.5,0.0,0.5
+NP_065737.2,0.857,0.667,0.952,0.857,0.952,1.0,,0.81,0.857,0.857,0.81,1.0
+NP_004951.1,0.819,0.574,0.917,0.954,0.705,0.938,0.713,0.69,0.815,0.861,0.907,0.611
+NP_001092692.1,0.417,0.833,0.5,0.708,0.5,1.0,0.958,0.812,0.75,0.75,0.792,0.625
+NP_001019801.3,0.984,0.857,0.667,0.667,0.952,,1.0,0.556,0.968,0.968,0.968,0.952
+NP_000301.1,,0.788,,,1.0,0.969,,0.985,1.0,1.0,1.0,0.97
+NP_001005271.2,0.921,0.93,0.858,0.909,,,0.953,0.689,0.96,0.948,0.95,0.955
+NP_001158233.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000264.2,0.765,0.878,0.862,0.888,0.878,0.829,0.796,0.939,0.867,0.929,0.867,0.939
+NP_005449.5,0.777,0.88,0.802,0.86,0.742,0.709,0.746,0.578,0.855,0.848,0.845,0.784
+NP_002440.2,1.0,1.0,1.0,1.0,0.25,1.0,0.917,0.917,1.0,1.0,1.0,1.0
+NP_055282.1,0.778,1.0,0.667,0.778,0.778,0.8,1.0,0.889,0.889,0.778,1.0,0.667
+NP_004514.2,0.857,0.831,0.956,0.934,0.787,0.847,0.809,0.846,0.949,0.926,0.882,0.934
+NP_115881.3,1.0,1.0,1.0,1.0,1.0,,,1.0,0.0,1.0,1.0,1.0
+NP_001244941.2,0.926,0.972,0.917,0.722,,,0.852,0.889,0.926,0.833,0.852,0.944
+NP_001165558.2,1.0,1.0,1.0,1.0,1.0,1.0,,0.833,1.0,1.0,1.0,1.0
+NP_000053.2,0.978,0.951,0.966,0.971,0.989,,0.892,0.931,0.981,0.907,0.946,0.987
+NP_065811.1,0.95,0.986,0.908,0.965,0.943,0.972,,0.975,0.97,0.965,0.968,0.967
+NP_003168.2,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_689677.1,0.9,1.0,0.8,0.5,0.8,,0.4,0.7,0.7,0.9,0.9,1.0
+NP_060595.3,0.667,1.0,0.667,0.667,1.0,,1.0,1.0,0.333,0.667,0.667,0.667
+NP_004688.2,0.75,1.0,1.0,1.0,0.25,1.0,1.0,0.75,0.5,1.0,0.75,0.75
+NP_000584.3,,,,,,,,,0.944,1.0,1.0,1.0
+NP_858058.1,0.417,1.0,0.75,0.833,0.5,1.0,0.833,0.5,0.833,0.667,0.667,0.917
+NP_652763.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_004162.2,0.781,0.796,0.677,0.765,0.743,0.717,0.663,0.694,0.748,0.769,0.789,0.83
+NP_078789.2,1.0,1.0,0.974,1.0,0.692,,1.0,0.962,1.0,0.897,0.974,0.923
+NP_005546.2,0.818,0.97,0.977,0.833,0.924,1.0,0.97,0.826,0.955,0.97,0.97,0.985
+NP_775109.2,0.714,1.0,1.0,0.857,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005545.1,0.942,0.949,0.939,0.865,0.994,0.972,0.936,0.891,0.962,1.0,1.0,1.0
+NP_060116.2,0.833,0.917,1.0,0.778,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056508.2,0.846,1.0,0.923,1.0,0.923,0.833,0.846,1.0,0.923,1.0,0.923,0.923
+NP_056155.1,0.953,0.797,0.938,0.93,0.934,0.879,,0.797,0.93,0.891,0.875,0.969
+NP_001355738.1,,,,,,,,,1.0,1.0,1.0,1.0
+NP_001034639.1,0.75,1.0,1.0,0.75,0.75,1.0,0.875,0.75,0.75,0.875,0.875,0.75
+NP_001010867.1,0.905,1.0,0.952,0.857,0.905,0.889,0.952,0.905,0.762,1.0,1.0,1.0
+NP_000298.3,0.806,1.0,0.917,0.583,,1.0,,1.0,0.889,1.0,1.0,0.889
+NP_115584.1,1.0,0.833,0.833,1.0,0.833,1.0,1.0,0.833,1.0,0.833,0.833,1.0
+NP_006554.1,0.944,1.0,1.0,1.0,0.111,1.0,1.0,0.889,1.0,1.0,1.0,1.0
+NP_056146.1,0.167,0.5,0.167,0.0,0.167,0.5,0.5,0.333,0.0,0.0,0.0,0.167
+NP_694551.1,0.875,1.0,1.0,1.0,1.0,1.0,1.0,0.875,1.0,1.0,1.0,0.75
+NP_001091981.1,0.766,0.719,0.786,0.719,0.767,0.85,0.927,0.566,0.833,0.844,0.844,0.885
+NP_000684.2,1.0,0.925,1.0,1.0,1.0,1.0,0.9,0.875,1.0,1.0,1.0,1.0
+NP_056496.1,0.5,1.0,0.625,0.75,1.0,,0.375,0.625,0.75,0.75,0.75,0.625
+NP_006406.1,0.783,0.85,0.85,0.817,0.9,0.889,0.85,0.567,0.933,0.9,0.917,0.95
+NP_510965.1,0.917,1.0,0.917,0.917,1.0,1.0,1.0,1.0,0.944,0.917,0.917,0.972
+NP_005564.1,1.0,1.0,1.0,0.9,1.0,0.8,1.0,0.475,1.0,1.0,1.0,1.0
+NP_001341530.2,,,,,,,,,0.97,0.909,0.985,0.97
+NP_001071.1,0.841,0.635,0.855,0.953,0.949,0.98,0.925,0.819,0.996,0.98,0.982,0.98
+NP_000374.1,0.948,0.754,0.866,0.817,0.829,0.962,0.762,0.904,0.922,0.949,0.965,0.92
+NP_065857.1,1.0,1.0,0.833,1.0,1.0,,1.0,0.833,0.667,1.0,1.0,0.833
+NP_982301.1,0.902,0.833,0.966,0.947,0.883,0.917,0.694,0.591,0.856,0.942,0.97,0.932
+NP_000199.2,0.828,0.656,0.867,0.859,0.891,1.0,0.969,0.429,1.0,0.906,0.953,0.984
+NP_056092.2,0.941,0.712,0.931,0.841,0.902,0.964,0.538,0.76,0.856,0.912,0.838,0.819
+NP_061951.1,1.0,,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_061966.1,0.795,,0.705,1.0,,1.0,1.0,,1.0,1.0,1.0,1.0
+NP_000506.2,0.611,0.5,0.667,0.722,0.533,0.5,0.889,0.167,0.556,0.722,0.778,0.611
+NP_003854.1,1.0,1.0,1.0,1.0,,1.0,1.0,,1.0,1.0,1.0,1.0
+NP_741960.1,0.625,1.0,1.0,0.875,0.875,0.75,0.375,0.188,0.812,0.938,0.812,0.75
+NP_005557.1,1.0,0.5,1.0,0.5,1.0,1.0,,0.5,0.0,1.0,0.5,1.0
+NP_057119.3,0.964,0.875,0.964,0.964,0.857,0.714,0.929,0.92,0.732,0.9,0.982,0.964
+NP_542119.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_954712.1,0.667,0.75,0.611,0.75,0.472,0.667,0.417,0.556,0.361,0.556,0.556,0.861
+NP_001050.1,0.667,1.0,0.667,1.0,1.0,1.0,0.667,0.75,1.0,0.833,0.833,0.833
+NP_982272.2,0.712,0.769,,,0.75,0.6,0.75,0.523,0.75,0.692,0.692,0.712
+NP_004437.2,0.944,1.0,0.917,0.806,0.778,1.0,0.833,0.806,0.972,1.0,0.889,0.889
+NP_006736.1,1.0,1.0,1.0,1.0,1.0,,0.8,0.7,1.0,1.0,1.0,1.0
+NP_003033.3,,0.915,,,0.902,0.905,,0.796,0.921,0.88,0.902,0.936
+NP_000169.1,0.5,1.0,0.5,0.667,0.833,,,0.833,0.833,0.5,0.333,0.667
+NP_003578.2,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_003745.1,1.0,1.0,0.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_068741.1,1.0,0.778,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001092744.1,,0.946,,,0.875,1.0,,0.893,0.321,0.964,1.0,1.0
+NP_001895.1,,0.81,,,0.857,0.905,,0.583,0.952,0.905,0.905,0.857
+NP_443076.2,0.95,0.75,0.7,0.95,0.9,1.0,1.0,0.9,0.95,0.9,1.0,1.0
+NP_714915.3,0.96,0.99,0.97,0.98,1.0,0.887,0.993,0.973,0.937,0.993,0.983,0.983
+NP_001070884.2,0.5,0.5,,,,,0.5,0.75,0.63,0.741,0.667,0.778
+NP_000185.1,0.952,1.0,0.952,0.881,0.952,1.0,0.571,0.381,0.714,0.952,0.952,0.952
+NP_000066.1,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057118.1,0.714,1.0,0.714,0.857,0.929,,,1.0,1.0,1.0,0.929,1.0
+NP_001313340.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001211.3,0.829,0.976,0.935,0.915,0.587,0.917,,,0.854,0.846,0.866,0.996
+NP_001358001.1,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001130.1,0.815,0.917,0.87,0.722,0.894,1.0,,0.907,0.935,0.935,0.907,0.954
+NP_006354.2,0.933,0.622,0.878,0.756,0.956,0.5,,0.567,0.933,0.933,1.0,1.0
+NP_005041.1,1.0,1.0,0.972,0.944,0.938,1.0,0.944,0.917,1.0,0.972,0.972,1.0
+NP_000700.1,0.978,0.822,0.944,0.938,0.958,,0.881,0.799,0.967,0.978,0.983,0.986
+NP_038202.2,1.0,1.0,1.0,1.0,1.0,,1.0,0.875,0.75,1.0,1.0,1.0
+NP_000364.1,0.75,1.0,0.875,1.0,0.75,,1.0,0.875,1.0,0.75,0.75,0.75
+NP_078824.2,0.462,0.615,0.538,0.692,0.962,0.25,0.231,0.692,0.385,0.538,0.462,0.769
+NP_079011.3,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_443077.1,0.839,0.768,0.893,0.946,0.982,,0.786,0.804,1.0,0.946,0.911,0.982
+NP_056090.1,1.0,0.7,1.0,1.0,,,0.7,0.8,1.0,1.0,0.9,0.9
+NP_055472.1,0.5,1.0,0.5,0.0,1.0,0.0,,0.5,1.0,0.5,0.5,0.5
+NP_079509.5,0.8,0.9,,,1.0,1.0,1.0,0.7,1.0,0.9,1.0,1.0
+NP_005756.2,,0.75,,,0.5,0.5,1.0,0.875,0.5,1.0,1.0,1.0
+NP_003696.2,1.0,1.0,1.0,1.0,1.0,,0.667,0.833,1.0,1.0,1.0,1.0
+NP_000010.1,0.994,0.733,0.951,0.988,1.0,0.974,1.0,0.895,1.0,1.0,1.0,1.0
+NP_057043.1,1.0,1.0,0.5,0.0,0.5,,1.0,1.0,1.0,0.0,0.5,1.0
+NP_003110.1,,0.859,,,0.961,0.92,0.877,0.823,0.991,0.973,0.977,0.964
+NP_001080.2,0.97,0.906,0.974,0.932,0.959,0.844,0.949,0.891,0.932,0.972,0.868,0.962
+NP_077718.3,1.0,1.0,1.0,1.0,1.0,,,,1.0,1.0,1.0,1.0
+NP_002993.1,0.928,0.981,0.942,0.99,0.98,0.962,0.904,0.861,0.923,0.981,0.99,0.962
+NP_006021.2,0.889,0.667,1.0,1.0,,,0.444,0.286,1.0,1.0,1.0,1.0
+NP_004587.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0
+NP_000912.3,0.955,0.909,0.773,0.909,0.636,0.75,0.909,0.773,0.864,1.0,0.864,0.909
+NP_060579.3,1.0,0.929,1.0,1.0,0.857,,0.857,0.792,0.857,0.929,0.857,1.0
+NP_004705.1,1.0,0.5,,,1.0,,,0.833,0.667,0.833,1.0,1.0
+NP_000242.1,0.841,0.77,0.83,0.843,0.847,0.853,0.757,0.714,0.823,0.888,0.896,0.888
+NP_003066.2,1.0,1.0,0.708,1.0,1.0,,1.0,0.875,1.0,1.0,1.0,1.0
+NP_689726.3,1.0,1.0,1.0,1.0,1.0,,0.857,1.0,1.0,1.0,1.0,0.857
+NP_115785.1,1.0,0.6,0.967,1.0,1.0,1.0,0.833,0.733,0.967,0.933,0.967,1.0
+NP_000812.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060311.1,1.0,1.0,1.0,1.0,,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_898871.1,0.942,0.917,0.983,1.0,0.983,0.897,1.0,1.0,1.0,0.975,0.992,1.0
+NP_060814.4,,0.845,,,0.893,,,0.952,1.0,0.922,1.0,1.0
+NP_036580.2,1.0,1.0,0.0,0.667,0.0,1.0,0.333,1.0,1.0,0.667,0.667,1.0
+NP_002212.3,0.333,0.481,0.407,0.148,0.333,,,0.259,0.556,0.296,0.407,0.407
+NP_067641.2,0.8,0.9,0.8,0.8,0.733,,0.6,0.9,0.9,0.7,0.733,0.8
+NP_000952.1,1.0,1.0,0.5,1.0,1.0,,0.5,1.0,0.5,1.0,0.5,1.0
+NP_005133.2,0.571,0.857,1.0,0.857,,,1.0,1.0,0.429,1.0,0.571,1.0
+NP_001186728.1,1.0,1.0,1.0,0.9,0.95,1.0,1.0,1.0,0.9,1.0,1.0,1.0
+NP_036325.2,0.838,0.888,0.837,0.888,0.9,1.0,0.95,0.812,0.862,0.925,0.938,0.95
+NP_001352.2,1.0,0.625,1.0,1.0,1.0,,1.0,0.75,1.0,1.0,1.0,1.0
+NP_061923.2,0.81,0.952,1.0,1.0,0.952,1.0,,0.881,1.0,1.0,1.0,1.0
+NP_001159762.1,0.779,0.627,0.735,0.882,,,,0.51,0.843,0.823,0.812,0.865
+NP_005711.1,1.0,0.833,1.0,1.0,,1.0,0.167,0.667,1.0,1.0,1.0,1.0
+NP_997005.1,0.884,0.884,0.861,0.919,0.914,0.873,,0.659,0.919,0.899,0.894,0.904
+NP_057278.2,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,0.667
+NP_001351952.1,1.0,1.0,1.0,1.0,1.0,1.0,0.8,1.0,1.0,1.0,1.0,1.0
+NP_001137282.1,1.0,1.0,0.75,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_597680.2,0.99,0.938,1.0,0.979,0.958,1.0,1.0,0.958,1.0,1.0,1.0,1.0
+NP_690870.3,0.8,1.0,0.9,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001138545.1,1.0,1.0,,,1.0,,1.0,,1.0,1.0,1.0,1.0
+NP_057288.1,1.0,,,,,,0.929,,0.857,0.929,0.929,0.929
+NP_991331.1,0.867,1.0,0.91,0.905,,,0.657,0.79,0.962,0.981,0.952,0.867
+NP_001783.2,0.7,0.5,0.6,0.867,0.567,0.8,0.8,0.833,0.933,0.625,0.733,0.567
+NP_068712.1,0.927,0.996,0.936,0.939,0.888,,0.838,0.922,0.983,0.969,0.953,0.995
+NP_056480.1,0.536,1.0,0.5,0.857,0.929,1.0,0.643,1.0,0.929,0.857,0.857,0.929
+NP_055151.3,0.911,0.857,0.982,0.911,0.982,,0.821,0.875,0.857,0.946,0.929,0.964
+NP_006051.1,,1.0,,,,,,1.0,1.0,1.0,1.0,1.0
+NP_002644.4,1.0,1.0,1.0,1.0,0.5,1.0,,0.75,1.0,1.0,1.0,1.0
+NP_003280.2,0.909,0.8,0.818,0.727,0.833,0.615,,,0.308,0.692,0.692,0.769
+NP_001018005.1,0.667,,0.967,1.0,0.967,,0.967,,0.967,0.967,1.0,1.0
+NP_001511.2,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000806.2,0.944,0.778,0.917,0.778,0.889,1.0,0.833,0.889,0.778,0.778,0.889,1.0
+NP_478059.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005008.2,1.0,0.99,0.827,0.969,,1.0,0.923,0.571,1.0,0.969,0.949,0.99
+NP_001185.3,0.843,0.704,0.824,1.0,1.0,0.967,1.0,,1.0,1.0,1.0,1.0
+NP_114148.3,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_835455.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004202.4,0.926,1.0,1.0,0.985,0.969,0.972,0.918,0.806,0.98,0.98,0.99,1.0
+NP_079519.1,,0.933,,,0.967,1.0,,0.983,0.967,0.983,1.0,1.0
+NP_054858.2,0.884,0.952,0.856,0.892,0.864,0.913,0.856,0.876,0.916,0.936,0.948,0.844
+NP_002759.2,1.0,0.0,0.5,0.0,1.0,,0.0,,1.0,1.0,0.0,1.0
+NP_001230695.2,0.885,0.938,0.919,0.934,0.778,0.991,0.964,0.832,0.924,0.905,0.868,0.941
+NP_002509.2,0.2,0.4,0.4,0.4,0.4,,0.0,0.8,0.6,0.8,0.6,0.8
+NP_001484.1,0.5,1.0,1.0,0.5,0.0,0.5,1.0,0.75,0.5,1.0,1.0,1.0
+NP_112506.2,1.0,0.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002706.1,0.795,,0.974,0.872,0.731,0.818,0.923,0.667,1.0,0.974,0.872,1.0
+NP_068810.3,0.6,1.0,0.2,0.6,0.0,1.0,1.0,1.0,0.0,1.0,1.0,0.6
+NP_001165806.1,,,,,,,,,1.0,1.0,0.857,1.0
+NP_004167.3,1.0,0.875,1.0,1.0,1.0,1.0,0.75,0.812,0.688,1.0,1.0,1.0
+NP_078951.2,0.885,0.962,1.0,0.885,0.885,1.0,0.923,1.0,0.615,0.962,1.0,0.962
+NP_002392.2,1.0,0.0,1.0,1.0,0.0,,0.0,1.0,1.0,1.0,1.0,1.0
+NP_002576.1,0.712,0.758,,,,,0.909,1.0,1.0,1.0,1.0,1.0
+NP_056109.1,0.75,1.0,0.5,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_002920.1,1.0,1.0,1.0,0.667,,0.0,0.667,,1.0,0.667,1.0,1.0
+NP_077744.4,,,,,,,,,0.833,0.908,0.933,0.933
+NP_705831.1,0.375,0.625,0.75,0.75,0.5,,0.375,0.0,0.875,0.25,0.25,0.375
+NP_001681.2,0.833,0.944,0.611,0.778,0.667,0.889,0.861,0.75,0.722,0.944,0.944,0.944
+NP_001219.2,1.0,1.0,1.0,1.0,,,1.0,,1.0,1.0,1.0,1.0
+NP_002682.2,0.844,1.0,0.867,0.933,1.0,0.944,,0.867,0.889,0.978,1.0,0.978
+NP_001316485.1,1.0,1.0,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0
+NP_814444.1,1.0,1.0,0.989,0.909,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056385.1,0.781,0.892,0.702,0.852,0.645,0.434,0.731,0.723,0.892,0.896,0.904,0.792
+NP_055509.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_002008.2,1.0,1.0,,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000528.1,1.0,0.7,1.0,0.8,0.3,0.333,0.6,0.2,0.5,0.9,0.9,0.7
+NP_945350.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,0.833,0.833,1.0
+NP_001182399.1,1.0,1.0,0.952,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_001850.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005340.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001123476.1,0.867,0.8,0.819,0.752,0.877,0.763,1.0,0.871,0.857,0.905,0.914,0.971
+NP_001876.1,1.0,1.0,1.0,1.0,1.0,0.333,,1.0,1.0,1.0,1.0,1.0
+NP_002307.2,0.874,0.696,1.0,0.983,0.989,,0.904,0.77,0.965,1.0,1.0,1.0
+NP_001153839.1,1.0,0.0,1.0,1.0,,,,0.0,1.0,1.0,1.0,0.5
+NP_005468.1,0.98,0.545,0.868,0.995,0.991,1.0,0.995,,0.995,1.0,1.0,1.0
+NP_000434.1,,0.859,,,0.788,0.962,,0.569,0.897,0.923,0.949,0.91
+NP_057953.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,1.0,1.0,1.0,1.0
+NP_002016.2,0.359,0.531,0.609,0.594,0.484,0.8,0.844,0.469,0.688,0.375,0.625,0.812
+NP_056536.2,0.92,1.0,0.96,1.0,0.82,0.75,0.64,0.9,0.88,0.92,1.0,0.88
+NP_789794.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006381.2,0.889,1.0,0.778,0.889,1.0,,1.0,0.667,0.889,0.889,1.0,1.0
+NP_009225.1,0.878,0.933,0.915,0.484,0.753,0.94,0.858,0.887,0.929,0.88,0.919,0.783
+NP_006388.2,0.96,0.8,0.96,0.8,0.8,0.867,0.72,0.7,0.72,0.92,0.92,1.0
+NP_001928.2,0.917,1.0,0.917,1.0,1.0,,1.0,0.917,1.0,0.833,1.0,0.833
+NP_000214.1,,1.0,,,1.0,,,0.917,1.0,1.0,1.0,0.917
+NP_000154.1,1.0,0.722,0.917,0.556,0.667,0.5,,0.889,1.0,1.0,0.611,0.833
+NP_000269.3,1.0,0.917,1.0,1.0,,,1.0,1.0,0.917,1.0,1.0,1.0
+NP_001104505.1,,0.917,,,0.972,,,1.0,0.861,1.0,0.972,0.944
+NP_000801.1,0.938,1.0,0.812,0.875,0.812,0.25,,0.844,0.812,1.0,1.0,0.812
+NP_005350.1,0.838,0.819,0.976,0.952,0.976,0.964,,0.89,0.905,0.943,0.952,0.895
+NP_001008537.1,0.902,0.768,0.854,0.841,,0.73,,0.427,0.988,0.793,0.915,0.89
+NP_000108.1,0.636,0.545,0.909,1.0,0.591,0.889,0.455,0.727,0.909,0.909,0.818,0.909
+NP_689956.2,0.938,0.778,0.91,0.917,0.944,0.848,0.639,0.833,0.667,0.931,0.972,0.944
+NP_061155.2,0.848,0.652,0.815,0.804,0.87,0.5,0.761,0.598,0.63,0.717,0.717,0.739
+NP_116245.2,0.917,1.0,1.0,0.875,1.0,1.0,1.0,1.0,0.875,1.0,1.0,1.0
+NP_001123617.1,0.2,1.0,0.8,1.0,1.0,,1.0,0.2,1.0,1.0,1.0,1.0
+NP_004278.2,,0.667,,,,1.0,0.667,0.833,1.0,1.0,1.0,1.0
+NP_055059.2,1.0,1.0,1.0,1.0,1.0,1.0,,0.933,1.0,1.0,1.0,1.0
+NP_000043.4,0.913,0.917,0.9,0.937,0.952,0.891,0.95,0.833,0.923,0.952,0.964,0.959
+NP_000344.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.917,1.0,1.0,1.0,1.0
+NP_055527.1,0.909,0.818,0.67,0.886,0.955,0.727,0.864,0.75,0.955,0.932,0.909,0.909
+NP_060663.2,0.921,0.974,0.868,1.0,0.786,1.0,0.947,0.921,0.842,0.974,1.0,0.763
+NP_078800.3,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_005700.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,0.875,1.0,1.0,1.0
+NP_065750.1,0.643,0.762,0.643,0.667,0.762,0.5,,0.762,0.857,0.619,0.619,0.81
+NP_065770.1,0.959,0.973,0.932,1.0,,0.5,0.946,0.875,1.0,1.0,1.0,1.0
+NP_000814.2,1.0,1.0,0.969,1.0,1.0,1.0,0.938,,1.0,1.0,1.0,1.0
+NP_065134.1,0.8,0.7,0.85,0.9,0.9,,0.65,0.9,0.8,0.8,0.8,0.8
+NP_001020279.1,1.0,1.0,1.0,1.0,,,1.0,1.0,0.5,1.0,1.0,1.0
+NP_068838.3,1.0,1.0,1.0,0.6,0.6,1.0,,1.0,0.6,1.0,1.0,1.0
+NP_065970.2,0.807,0.773,0.966,0.977,1.0,1.0,0.699,0.795,0.955,0.966,0.977,0.932
+NP_000450.3,0.913,0.996,0.891,0.976,0.621,0.967,0.871,0.888,0.981,0.939,1.0,0.983
+NP_065712.1,1.0,1.0,1.0,1.0,1.0,,1.0,0.5,1.0,1.0,1.0,1.0
+NP_005996.2,0.891,0.888,0.897,0.902,0.914,0.94,,0.818,0.882,0.953,0.955,0.93
+NP_001186036.1,0.933,0.76,0.967,0.925,0.971,0.72,,0.827,0.913,0.942,0.942,0.952
+NP_079029.3,0.939,0.848,0.97,0.909,1.0,0.5,0.909,0.879,0.97,1.0,1.0,1.0
+NP_006054.2,1.0,0.7,0.95,1.0,0.9,1.0,1.0,0.85,1.0,1.0,0.9,1.0
+NP_002233.2,0.333,0.833,0.667,0.667,0.333,0.25,,0.833,0.667,0.833,0.833,0.833
+NP_004076.1,,1.0,,,,,,0.25,1.0,1.0,1.0,1.0
+NP_000334.1,1.0,0.75,1.0,1.0,1.0,1.0,0.917,0.875,1.0,1.0,1.0,1.0
+NP_001596.2,0.777,0.954,0.769,0.877,0.838,0.933,0.892,0.631,0.954,0.962,0.946,0.946
+NP_001271165.2,1.0,1.0,1.0,1.0,0.8,,,1.0,1.0,1.0,1.0,1.0
+NP_001020466.1,0.977,1.0,1.0,0.977,1.0,0.944,1.0,0.955,0.955,1.0,0.955,0.977
+NP_004238.3,1.0,0.739,0.994,0.95,0.928,0.962,,0.583,0.994,0.967,0.983,1.0
+NP_055484.3,1.0,0.6,0.95,0.9,1.0,,1.0,0.7,0.9,1.0,1.0,1.0
+NP_620409.1,0.333,0.778,0.222,0.222,0.444,,0.333,0.556,0.444,0.333,0.333,0.333
+NP_796374.2,1.0,0.75,1.0,1.0,1.0,1.0,1.0,0.25,1.0,1.0,1.0,1.0
+NP_001001557.1,1.0,0.933,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002240.3,1.0,1.0,0.6,1.0,,,1.0,0.7,1.0,1.0,1.0,1.0
+NP_003147.2,0.952,1.0,0.944,1.0,1.0,1.0,0.968,0.921,1.0,0.984,0.984,1.0
+NP_000336.1,0.8,1.0,0.75,0.75,0.8,1.0,0.5,0.7,0.8,0.7,0.8,0.8
+NP_055543.2,1.0,0.867,1.0,0.933,0.9,0.833,0.267,0.733,0.267,0.667,0.8,1.0
+NP_758957.2,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_003009.2,1.0,0.75,0.972,0.944,,1.0,0.944,0.917,0.222,0.917,0.972,0.972
+NP_000084.3,0.906,0.887,0.908,0.863,0.943,0.919,0.755,0.791,0.919,0.955,0.957,0.915
+NP_060552.4,0.792,0.781,0.833,0.854,0.667,0.571,0.812,0.75,0.802,0.844,0.698,0.854
+NP_002878.2,0.821,0.821,0.857,0.75,0.714,0.667,0.643,0.839,0.821,0.75,0.821,0.75
+NP_008927.1,1.0,0.8,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000485.3,0.857,0.887,0.84,0.837,0.88,0.95,1.0,0.807,0.813,0.787,0.867,0.833
+NP_001073883.2,0.805,0.805,0.78,0.939,0.732,0.458,0.902,0.654,0.963,0.841,0.988,0.988
+NP_006774.2,0.958,0.958,0.958,1.0,0.917,1.0,,0.875,1.0,0.917,1.0,0.958
+NP_003995.2,0.854,0.839,0.83,0.852,0.902,0.724,,0.907,0.921,0.962,0.97,0.923
+NP_076872.1,0.611,0.667,0.556,0.667,0.528,,,0.681,0.806,0.583,0.611,0.722
+NP_001129668.1,0.833,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055891.1,0.3,0.8,0.6,0.4,1.0,1.0,0.8,1.0,0.0,1.0,1.0,1.0
+NP_001251.1,0.833,0.667,0.833,1.0,0.833,0.833,1.0,0.333,0.833,0.833,0.833,0.833
+NP_115977.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_112210.1,1.0,0.0,1.0,0.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_004950.2,0.939,0.848,,,,1.0,0.97,0.879,0.939,0.939,1.0,0.955
+NP_003027.1,1.0,1.0,1.0,1.0,0.605,0.874,0.976,0.801,0.916,1.0,0.995,0.986
+NP_002326.2,0.943,0.744,0.918,0.955,0.948,0.922,0.827,0.744,0.918,0.963,0.96,0.949
+NP_079130.2,1.0,0.9,1.0,0.9,0.85,1.0,0.9,0.9,0.8,0.9,0.9,0.9
+NP_077740.1,0.846,1.0,1.0,1.0,0.885,1.0,0.769,0.889,1.0,1.0,1.0,1.0
+NP_004603.1,1.0,0.97,1.0,0.996,0.94,0.996,0.85,0.857,1.0,1.0,1.0,1.0
+NP_001098101.1,0.892,0.943,0.881,0.898,0.898,0.909,0.864,0.812,0.989,0.898,0.898,0.886
+NP_003638.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006252.4,0.964,1.0,1.0,0.964,1.0,1.0,1.0,1.0,0.893,1.0,1.0,1.0
+NP_000435.3,0.888,0.885,0.905,0.952,0.936,0.943,0.66,0.79,0.948,0.948,0.971,0.969
+NP_001094896.1,1.0,0.571,0.857,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001104595.1,0.929,0.611,0.964,0.893,,0.93,0.885,0.882,0.962,0.949,0.955,0.924
+NP_001159725.2,,,,,,,,,1.0,1.0,1.0,1.0
+NP_036192.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004521.1,0.833,1.0,0.833,0.833,0.667,0.5,1.0,1.0,0.833,1.0,1.0,1.0
+NP_060021.1,1.0,1.0,1.0,1.0,1.0,,1.0,0.5,1.0,1.0,1.0,1.0
+NP_001017535.1,0.938,0.75,0.875,0.875,1.0,,,1.0,1.0,0.812,0.875,0.938
+NP_004269.1,1.0,1.0,,,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000365.3,1.0,0.875,1.0,1.0,1.0,,1.0,0.75,1.0,1.0,1.0,1.0
+NP_006693.3,0.911,0.848,0.929,0.924,,,,0.839,0.893,0.875,0.911,0.929
+NP_000159.3,0.893,0.896,0.968,0.86,0.734,0.91,0.693,0.802,0.953,0.95,0.947,0.909
+NP_003796.1,0.8,0.2,0.6,0.8,0.5,,,0.4,0.8,0.6,0.6,0.4
+NP_870998.2,0.429,0.762,0.429,0.048,0.81,0.2,0.571,0.737,0.952,0.905,0.952,0.857
+NP_002009.1,1.0,0.5,1.0,1.0,0.5,1.0,,0.875,0.25,1.0,0.5,1.0
+NP_060352.1,0.855,0.8,1.0,0.927,0.944,,0.909,0.864,0.673,0.927,0.927,0.909
+NP_000254.2,0.906,0.783,0.886,0.974,0.968,0.952,0.892,0.901,0.925,0.937,0.939,0.996
+NP_000444.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0
+NP_001243328.1,0.0,1.0,1.0,1.0,0.0,0.0,,1.0,0.0,0.5,0.5,1.0
+NP_066005.2,0.483,0.517,0.75,0.25,,,0.625,0.593,0.45,0.65,0.617,0.6
+NP_056093.3,0.705,1.0,0.893,1.0,1.0,0.429,0.786,0.875,1.0,1.0,1.0,0.982
+NP_001308031.1,0.864,0.636,0.818,0.727,0.727,1.0,,0.864,1.0,0.818,0.727,0.909
+NP_071505.2,0.0,0.667,0.667,0.333,0.0,0.0,0.333,0.333,0.333,0.333,0.333,0.333
+NP_001548.1,0.778,0.556,0.778,0.667,0.6,1.0,0.667,0.667,0.889,1.0,1.0,0.889
+NP_004990.3,0.8,0.817,0.75,0.767,,0.8,0.642,0.708,0.7,0.767,0.8,0.833
+NP_001124162.1,,1.0,,,,,0.75,0.719,1.0,0.938,0.875,1.0
+NP_443105.2,1.0,1.0,0.917,1.0,,,0.667,0.917,0.333,1.0,0.833,1.0
+NP_000402.3,1.0,0.973,,,0.891,0.848,0.945,0.923,0.977,1.0,1.0,1.0
+NP_055692.3,0.0,1.0,0.5,0.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_066288.2,0.896,0.799,0.819,0.968,0.938,0.991,0.944,0.881,0.97,0.97,0.991,0.972
+NP_001137451.1,1.0,,0.5,0.0,,0.0,,,0.5,1.0,0.5,0.0
+NP_004091.3,0.5,0.667,0.5,0.5,0.833,0.0,,0.667,0.5,1.0,0.833,0.667
+NP_001010892.1,1.0,0.889,0.889,1.0,1.0,1.0,0.889,0.889,1.0,1.0,1.0,1.0
+NP_067045.1,0.85,0.762,0.833,0.833,,1.0,0.857,0.81,0.929,0.952,0.976,0.905
+NP_079017.1,0.929,1.0,0.929,1.0,0.786,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003672.1,0.0,1.0,0.0,0.5,0.5,1.0,1.0,1.0,0.5,,0.5,0.5
+NP_113631.1,1.0,1.0,1.0,1.0,0.938,1.0,0.625,0.625,1.0,0.75,1.0,1.0
+NP_001605.1,0.917,0.583,,,0.893,0.89,,0.857,0.762,0.893,0.893,0.821
+NP_064543.3,1.0,1.0,1.0,1.0,1.0,1.0,0.5,0.625,1.0,1.0,1.0,1.0
+NP_000076.2,0.95,,0.962,0.95,0.983,0.952,0.968,0.92,0.976,0.984,0.984,0.98
+NP_000142.2,0.947,0.953,0.962,1.0,0.6,,0.653,0.837,0.887,0.887,0.987,0.913
+NP_996994.1,1.0,1.0,0.773,1.0,,,0.864,1.0,1.0,1.0,1.0,1.0
+NP_003032.1,1.0,1.0,1.0,1.0,0.333,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_640334.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000467.1,1.0,1.0,1.0,1.0,1.0,,,0.75,1.0,1.0,1.0,1.0
+NP_079120.1,1.0,0.8,1.0,1.0,1.0,1.0,0.8,1.0,1.0,1.0,1.0,1.0
+NP_079058.1,1.0,1.0,1.0,1.0,1.0,,0.667,1.0,1.0,1.0,1.0,1.0
+NP_659451.1,0.5,0.5,1.0,1.0,0.5,,,1.0,0.5,1.0,1.0,1.0
+NP_000333.1,0.923,0.99,0.92,0.91,0.97,0.909,0.89,0.946,0.99,0.987,0.993,0.977
+NP_006874.1,0.889,0.889,1.0,1.0,1.0,0.889,0.444,0.667,1.0,1.0,1.0,1.0
+NP_002015.1,1.0,0.964,1.0,1.0,1.0,0.75,1.0,0.536,0.964,1.0,1.0,0.964
+NP_071378.1,0.848,0.97,0.758,0.97,0.856,0.833,1.0,0.788,0.924,0.955,0.939,0.924
+NP_005655.1,0.8,1.0,0.8,1.0,1.0,1.0,1.0,0.8,1.0,1.0,1.0,1.0
+NP_060956.1,,0.981,,,,0.948,0.733,0.866,0.997,1.0,0.994,0.981
+NP_000537.3,0.944,0.922,0.906,0.972,0.862,0.982,0.839,0.918,0.964,0.963,0.97,0.978
+NP_001139189.2,0.806,0.926,0.778,0.907,0.817,1.0,0.907,0.685,0.991,0.954,0.991,0.963
+NP_443197.1,,0.5,,,0.5,1.0,1.0,0.75,0.0,1.0,1.0,1.0
+NP_001094385.1,1.0,1.0,0.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_116119.2,0.786,0.81,0.762,0.714,0.714,0.75,0.81,0.667,0.714,1.0,0.952,0.857
+NP_002040.1,0.936,0.842,0.891,0.932,0.902,0.952,0.97,0.831,0.857,0.895,0.932,0.947
+NP_065098.1,1.0,0.889,,,0.528,1.0,0.722,0.833,0.889,0.944,0.944,1.0
+NP_579877.1,0.943,0.885,0.891,0.954,1.0,0.771,,0.874,1.0,0.954,0.954,0.931
+NP_001005741.1,0.983,0.933,0.993,0.99,0.995,1.0,,0.99,0.871,1.0,1.0,1.0
+NP_573568.1,1.0,0.667,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_001640.1,0.143,0.857,0.857,0.857,0.571,,1.0,1.0,1.0,0.714,0.714,1.0
+NP_001521.1,0.643,0.643,0.571,0.571,0.571,,0.643,0.786,0.571,0.786,0.786,0.786
+NP_036602.1,1.0,0.0,1.0,0.333,0.667,,0.0,0.0,0.0,0.667,0.667,0.0
+NP_006556.1,1.0,0.972,0.84,1.0,0.795,0.906,,0.74,0.972,0.986,1.0,0.993
+NP_065690.2,1.0,0.778,1.0,1.0,1.0,1.0,1.0,0.889,1.0,1.0,1.0,1.0
+NP_653178.3,0.593,0.926,0.704,0.611,0.815,,0.481,,0.667,0.778,0.815,0.815
+NP_006339.4,,,,,,,,,1.0,1.0,1.0,1.0
+NP_066550.2,0.958,0.822,0.851,0.962,0.879,0.923,0.967,0.758,0.934,0.951,0.966,0.977
+NP_001600.1,1.0,1.0,1.0,1.0,1.0,,1.0,0.938,1.0,1.0,1.0,1.0
+NP_001273540.1,1.0,0.625,0.75,0.375,1.0,,1.0,0.375,0.0,0.5,0.75,0.75
+NP_001029.1,0.917,1.0,0.917,0.889,0.944,1.0,1.0,0.889,0.889,0.944,0.889,0.944
+NP_000874.2,0.611,0.833,1.0,0.833,,,0.722,0.833,1.0,0.778,0.722,0.944
+NP_057075.1,1.0,1.0,1.0,1.0,1.0,,,,1.0,1.0,1.0,1.0
+NP_000742.1,0.933,1.0,1.0,0.967,0.944,1.0,0.933,0.933,0.867,1.0,1.0,1.0
+NP_001073911.1,0.5,0.933,0.8,0.933,0.9,0.667,0.867,0.867,1.0,0.933,0.933,1.0
+NP_542400.2,1.0,1.0,,,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_071890.2,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_002871.1,0.918,0.906,0.902,0.96,0.845,0.847,0.794,0.683,0.976,0.938,0.938,0.964
+NP_000292.1,0.038,0.846,0.0,0.0,0.0,0.0,0.462,0.769,0.0,0.0,0.0,0.0
+NP_055826.1,0.938,0.75,0.938,1.0,1.0,1.0,,0.75,1.0,0.889,1.0,1.0
+NP_002573.1,0.25,1.0,0.25,0.75,0.75,1.0,1.0,0.875,0.5,1.0,0.75,0.75
+NP_922932.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_075462.3,1.0,0.917,1.0,1.0,1.0,1.0,0.714,0.917,1.0,1.0,0.958,1.0
+NP_004264.2,0.981,1.0,0.981,1.0,1.0,1.0,1.0,1.0,0.981,1.0,1.0,1.0
+NP_001019242.1,0.947,0.526,,,0.895,,,1.0,1.0,1.0,1.0,0.947
+NP_004804.2,1.0,0.375,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005506.3,0.917,0.556,0.917,0.944,1.0,1.0,0.833,0.733,1.0,0.944,0.833,1.0
+NP_003029.2,1.0,0.833,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057240.3,1.0,1.0,1.0,1.0,0.857,,0.857,0.5,0.929,1.0,1.0,1.0
+NP_112533.1,0.5,1.0,1.0,1.0,0.5,,,0.0,1.0,1.0,1.0,1.0
+NP_037471.2,1.0,0.667,0.917,0.833,0.952,1.0,,0.521,0.917,1.0,0.958,1.0
+NP_114413.1,0.4,,0.6,1.0,,1.0,1.0,1.0,1.0,1.0,0.8,1.0
+NP_036356.1,0.978,1.0,0.919,0.956,0.647,0.708,0.985,0.868,0.897,0.926,0.926,0.897
+NP_004439.2,0.737,0.684,,,,0.833,0.632,,0.789,0.895,1.0,0.895
+NP_835366.1,,0.987,,,,,,0.263,1.0,1.0,1.0,1.0
+NP_000373.1,0.988,0.956,1.0,1.0,1.0,1.0,1.0,0.794,0.988,0.993,1.0,1.0
+NP_000735.1,0.948,0.892,0.97,0.951,0.959,0.909,0.922,0.616,0.937,0.996,0.996,0.996
+NP_001750.1,0.944,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,0.889
+NP_004422.2,1.0,1.0,0.977,0.773,1.0,1.0,0.773,0.909,1.0,1.0,1.0,1.0
+NP_068603.4,1.0,1.0,1.0,1.0,1.0,0.5,0.929,0.929,1.0,1.0,1.0,1.0
+NP_004435.3,0.907,0.966,0.9,0.926,,0.94,0.908,0.724,0.98,0.966,0.978,0.972
+NP_002290.2,0.917,1.0,1.0,1.0,0.958,,0.958,1.0,1.0,1.0,1.0,1.0
+NP_000451.1,1.0,1.0,1.0,0.833,0.75,,,0.5,0.5,1.0,1.0,0.833
+NP_003712.1,0.8,0.9,1.0,1.0,0.562,1.0,,0.9,1.0,1.0,1.0,1.0
+NP_002283.3,0.952,1.0,0.981,1.0,0.962,1.0,,0.769,0.962,0.981,1.0,1.0
+NP_001191197.1,1.0,,,,,,1.0,1.0,0.667,1.0,1.0,1.0
+NP_006403.2,0.75,1.0,0.417,0.5,0.333,,0.0,0.833,1.0,0.833,0.833,1.0
+NP_742055.1,0.958,0.889,0.958,0.889,1.0,1.0,,0.438,0.889,0.968,0.958,0.875
+NP_001305092.1,,,,,,,,,1.0,1.0,1.0,1.0
+NP_001672.1,1.0,0.956,1.0,1.0,1.0,0.941,1.0,0.603,0.941,1.0,1.0,1.0
+NP_003292.1,1.0,1.0,1.0,1.0,,1.0,,0.5,1.0,1.0,1.0,1.0
+NP_000261.2,1.0,0.625,1.0,1.0,0.833,1.0,1.0,0.771,1.0,1.0,1.0,1.0
+NP_689862.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_619542.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000231.1,0.917,0.833,0.917,0.667,1.0,1.0,0.667,0.917,0.917,0.833,0.917,0.917
+NP_899196.1,1.0,0.967,0.967,0.983,1.0,1.0,,0.908,1.0,0.917,1.0,1.0
+NP_073624.2,0.949,0.872,0.872,0.949,1.0,,1.0,0.795,0.872,0.949,0.949,0.872
+NP_001340274.1,0.904,0.923,,,0.974,1.0,,0.957,0.993,0.996,0.985,0.993
+NP_005206.2,0.967,0.267,0.967,1.0,1.0,0.833,0.533,0.767,1.0,0.933,1.0,1.0
+NP_061905.2,1.0,0.904,0.981,0.942,1.0,0.429,,,1.0,1.0,1.0,0.981
+NP_000068.1,0.883,0.953,0.793,0.992,,1.0,0.902,,0.97,0.97,0.992,0.97
+NP_006185.1,1.0,1.0,1.0,1.0,0.95,1.0,1.0,0.9,1.0,1.0,1.0,1.0
+NP_057195.2,0.981,1.0,0.981,0.962,1.0,1.0,0.981,0.962,0.962,1.0,0.981,1.0
+NP_057006.1,0.976,0.833,0.97,0.964,0.952,0.917,0.905,0.81,0.988,0.94,0.94,0.976
+NP_055577.1,,0.6,,,0.3,,,0.5,1.0,0.6,0.8,1.0
+NP_997229.2,0.832,0.768,0.853,0.684,0.684,1.0,0.674,0.905,0.674,0.958,0.916,0.942
+NP_115980.1,1.0,1.0,0.5,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060521.4,1.0,0.72,0.96,0.96,0.95,,1.0,0.98,0.96,1.0,1.0,0.96
+NP_996917.1,1.0,1.0,1.0,1.0,1.0,,,,1.0,1.0,1.0,1.0
+NP_006362.1,0.667,1.0,0.833,0.833,0.833,1.0,0.75,0.75,0.833,0.833,0.833,0.667
+NP_004178.2,0.892,0.967,0.921,0.956,0.953,0.938,0.911,0.936,0.97,0.979,0.941,0.976
+NP_116222.4,0.818,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_003026.2,0.889,1.0,1.0,0.889,1.0,1.0,0.833,0.944,0.833,1.0,1.0,1.0
+NP_056321.2,0.902,0.899,0.746,0.949,0.906,0.975,0.801,0.754,0.42,0.906,0.942,0.935
+NP_001258750.1,0.917,0.667,1.0,0.917,0.917,1.0,0.611,0.833,1.0,0.944,0.889,0.889
+NP_005526.1,0.851,0.702,,,0.845,1.0,,0.702,0.94,0.94,0.929,0.845
+NP_002380.3,1.0,1.0,0.875,1.0,0.875,0.75,1.0,0.333,0.75,1.0,1.0,0.875
+NP_000267.2,0.99,0.9,0.991,0.969,0.991,0.99,0.943,0.933,0.982,0.989,0.993,0.941
+NP_004817.2,0.894,0.667,0.886,0.955,0.985,0.933,0.833,0.803,0.879,0.97,0.97,0.939
+NP_001092690.1,0.75,0.25,0.75,0.5,0.625,0.0,0.75,0.25,1.0,0.75,0.75,0.75
+NP_001037.1,0.692,0.385,0.615,0.538,0.846,0.333,0.769,0.731,0.462,0.231,0.385,0.231
+NP_006507.2,0.945,0.891,0.947,0.948,0.928,0.94,0.914,0.678,0.947,0.947,0.964,0.973
+NP_003928.1,0.625,1.0,0.75,0.75,0.75,,,0.75,1.0,0.625,0.5,1.0
+NP_000060.2,0.881,0.948,0.866,0.792,1.0,0.879,0.824,0.583,0.901,0.899,0.939,0.898
+NP_005207.3,1.0,1.0,,,,,1.0,0.625,0.75,0.75,1.0,0.75
+NP_000070.1,0.929,0.964,0.929,0.893,,,0.821,0.75,1.0,1.0,0.964,1.0
+NP_001845.3,0.848,0.955,0.872,0.855,0.95,0.929,0.815,0.718,0.964,0.969,0.971,0.981
+NP_001120680.1,0.94,1.0,0.48,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0
+NP_955366.1,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_001073918.2,0.795,0.872,0.859,0.821,1.0,0.75,0.872,0.641,1.0,0.897,1.0,0.974
+NP_006205.1,0.958,0.833,0.958,0.833,0.958,,1.0,0.812,1.0,1.0,1.0,1.0
+NP_079455.3,0.391,0.913,0.848,0.609,0.783,0.2,0.957,0.976,0.826,0.739,0.783,0.652
+NP_112190.2,0.934,0.842,0.908,0.842,0.947,0.8,1.0,1.0,0.974,1.0,0.947,1.0
+NP_000074.3,0.909,0.954,0.929,0.927,0.891,0.957,0.973,0.925,0.955,0.98,0.978,0.989
+NP_001159435.1,0.952,0.949,0.928,0.931,0.933,0.863,0.89,0.794,0.945,0.972,0.967,0.971
+NP_619729.1,0.833,0.8,0.9,0.967,0.7,,0.933,0.767,0.767,0.9,0.867,0.667
+NP_001308943.1,0.907,0.937,0.965,0.993,0.941,0.952,0.948,0.763,0.898,0.946,0.971,0.916
+NP_004827.4,1.0,0.636,1.0,1.0,1.0,1.0,1.0,0.909,1.0,1.0,1.0,1.0
+NP_065822.2,1.0,1.0,1.0,1.0,1.0,,1.0,0.5,1.0,1.0,1.0,1.0
+NP_077315.2,1.0,1.0,,1.0,1.0,,0.0,1.0,1.0,1.0,1.0,1.0
+NP_005441.1,0.833,1.0,0.667,0.833,0.778,,0.889,0.722,0.667,0.833,0.889,0.722
+NP_542411.2,0.931,0.862,0.865,0.784,,0.96,,0.785,0.887,0.899,0.9,0.918
+NP_006199.2,0.929,0.804,0.893,0.929,0.982,1.0,,1.0,0.964,1.0,0.982,0.946
+NP_071801.1,0.938,0.875,0.938,1.0,1.0,1.0,0.714,1.0,1.0,1.0,1.0,1.0
+NP_004251.4,0.775,0.76,,,0.778,,0.574,,0.806,0.814,0.783,0.779
+NP_003218.2,0.917,0.944,0.931,0.917,1.0,1.0,1.0,0.986,0.917,0.889,0.889,0.917
+NP_055440.1,0.828,0.312,0.773,0.922,0.906,0.643,1.0,0.969,0.969,0.938,0.984,0.922
+NP_000430.3,1.0,1.0,1.0,0.933,0.733,1.0,0.867,0.833,0.933,0.933,1.0,1.0
+NP_036337.2,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002046.1,,0.873,,,,0.906,0.868,0.757,0.914,0.832,0.857,0.943
+NP_056080.1,0.892,0.877,0.897,0.881,0.914,0.856,,0.736,0.915,0.912,0.927,0.96
+NP_000187.3,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.812,1.0,1.0
+NP_004355.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0
+NP_003046.2,0.875,1.0,0.906,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_599023.1,1.0,0.667,1.0,1.0,0.333,,0.0,0.5,0.667,1.0,1.0,1.0
+NP_001309863.1,1.0,0.857,1.0,0.833,0.167,0.0,0.857,0.143,1.0,,1.0,1.0
+NP_001162625.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0
+NP_001295205.1,1.0,1.0,,,,,1.0,0.667,1.0,1.0,1.0,1.0
+NP_115569.2,1.0,0.333,1.0,1.0,1.0,1.0,,1.0,0.667,1.0,1.0,1.0
+NP_005227.1,0.821,1.0,0.889,0.778,0.952,1.0,0.857,0.786,0.952,0.905,0.929,0.929
+NP_002725.1,0.75,0.5,0.5,0.75,0.75,1.0,,0.5,0.5,0.75,0.75,1.0
+NP_000228.1,0.949,0.971,0.951,0.956,0.889,0.935,0.917,0.811,0.946,0.959,0.959,0.988
+NP_001123292.1,0.7,1.0,0.55,0.6,0.7,1.0,0.6,,1.0,0.8,0.8,0.9
+NP_001171809.1,0.925,0.89,0.92,0.981,0.942,0.955,0.847,0.816,0.983,0.914,0.952,0.994
+NP_066956.1,0.9,1.0,0.85,0.7,1.0,1.0,0.9,0.9,0.8,1.0,1.0,1.0
+NP_056374.2,,0.967,,,0.882,0.825,,0.909,0.996,0.955,0.947,0.955
+NP_055051.1,0.9,1.0,0.9,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_612396.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005135.2,0.953,0.984,0.914,1.0,0.953,1.0,0.797,1.0,1.0,0.953,1.0,0.938
+NP_002131.2,0.917,1.0,1.0,0.917,0.75,1.0,,0.833,0.917,0.917,1.0,1.0
+NP_057732.2,1.0,0.9,0.9,1.0,1.0,1.0,,0.95,1.0,1.0,1.0,1.0
+NP_000892.2,1.0,1.0,1.0,1.0,1.0,1.0,,0.938,1.0,1.0,1.0,1.0
+NP_000372.1,0.629,0.657,0.643,0.686,0.743,0.933,0.657,0.643,0.714,0.686,0.771,0.743
+NP_005563.1,0.064,0.712,0.125,0.157,0.593,,0.378,0.788,0.357,0.42,0.656,0.115
+NP_055064.1,0.882,0.866,,,0.917,0.972,0.896,0.677,0.764,0.903,0.917,0.917
+NP_001121699.1,0.824,0.865,0.831,0.932,,,0.865,0.676,0.878,0.946,0.959,0.959
+NP_116027.2,0.99,0.982,0.996,0.98,1.0,0.985,,0.899,0.998,0.988,0.998,0.998
+NP_056989.2,0.8,0.6,0.9,1.0,1.0,0.4,1.0,1.0,0.8,0.8,0.8,1.0
+NP_055840.2,,0.923,,,,0.619,0.89,0.837,0.873,0.923,0.913,0.91
+NP_001501.2,0.889,0.722,0.889,1.0,1.0,0.667,0.889,0.75,0.833,1.0,0.944,0.889
+NP_112740.1,0.25,0.25,0.25,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_000035.2,0.913,0.944,0.811,0.887,,,0.765,0.952,0.903,0.866,0.903,0.92
+NP_862823.1,1.0,0.857,1.0,1.0,0.714,1.0,0.857,0.929,0.857,1.0,0.857,0.857
+NP_001002295.1,1.0,0.939,1.0,1.0,,,,0.606,0.97,1.0,1.0,1.0
+NP_001120864.1,1.0,1.0,1.0,1.0,,0.875,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000446.1,0.992,1.0,,,,0.992,1.0,0.733,0.967,1.0,1.0,1.0
+NP_001136082.1,0.5,1.0,1.0,1.0,1.0,,,1.0,0.5,1.0,1.0,1.0
+NP_689518.1,1.0,1.0,1.0,1.0,1.0,1.0,0.833,0.917,0.833,1.0,1.0,1.0
+NP_005667.2,1.0,1.0,0.929,1.0,1.0,0.917,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001036167.1,,0.75,,,0.75,0.5,,0.25,0.667,1.0,0.917,0.833
+NP_036375.1,,1.0,,,,,0.0,1.0,1.0,1.0,1.0,1.0
+NP_001323.1,0.267,0.667,0.967,0.8,1.0,1.0,0.867,0.933,0.867,1.0,1.0,1.0
+NP_066545.3,1.0,1.0,1.0,1.0,0.5,,1.0,0.75,1.0,1.0,1.0,1.0
+NP_000956.2,1.0,1.0,1.0,1.0,,,1.0,0.708,0.958,0.917,1.0,1.0
+NP_001073279.2,0.951,0.989,0.968,0.972,,,,0.995,0.987,0.999,0.997,0.964
+NP_003350.1,0.0,0.0,0.0,0.667,0.333,,,0.333,0.667,1.0,0.667,0.333
+NP_001103279.2,1.0,1.0,0.714,0.714,,,1.0,1.0,0.857,1.0,0.857,1.0
+NP_066552.2,1.0,0.667,1.0,1.0,0.833,,0.667,0.833,1.0,1.0,1.0,1.0
+NP_001006658.1,0.5,0.718,0.609,0.538,0.577,0.3,0.603,0.744,0.603,0.577,0.679,0.564
+NP_057485.2,1.0,,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001005463.1,0.661,0.232,0.607,0.804,0.981,0.0,0.893,0.652,0.786,0.982,0.839,0.821
+NP_002421.3,0.917,0.333,0.833,0.667,0.333,1.0,0.667,0.667,0.5,0.833,0.833,0.833
+NP_001010874.2,1.0,1.0,1.0,0.75,,1.0,1.0,0.75,0.5,1.0,1.0,1.0
+NP_061971.3,0.758,0.833,0.769,0.716,0.742,0.861,0.712,0.592,0.864,0.788,0.833,0.826
+NP_077027.1,1.0,1.0,1.0,1.0,,,0.0,1.0,1.0,1.0,1.0,1.0
+NP_060921.3,0.846,0.731,,,0.923,0.5,0.846,0.692,0.654,1.0,0.769,0.731
+NP_003005.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060700.2,0.769,0.897,0.846,0.923,0.782,0.5,0.718,0.885,1.0,0.897,1.0,1.0
+NP_068758.3,0.833,1.0,,,,,0.333,0.833,1.0,0.5,0.833,0.833
+NP_000121.2,1.0,0.994,0.997,1.0,1.0,0.978,1.0,0.938,1.0,1.0,0.983,0.983
+NP_059129.3,,0.941,,,0.912,1.0,,0.912,1.0,0.941,1.0,1.0
+NP_001156908.2,1.0,1.0,1.0,1.0,1.0,1.0,,0.846,1.0,1.0,0.923,1.0
+NP_065109.1,0.888,0.95,0.65,0.975,1.0,1.0,0.975,0.889,1.0,1.0,1.0,1.0
+NP_569735.1,1.0,0.333,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004378.1,0.981,0.944,0.981,0.981,0.926,1.0,0.815,0.907,1.0,0.981,0.981,0.963
+NP_009186.1,1.0,0.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_065390.1,0.958,1.0,1.0,0.5,0.417,0.5,0.917,,0.75,0.833,0.75,0.667
+NP_031400.2,0.5,1.0,0.625,1.0,1.0,,1.0,0.5,1.0,0.75,0.5,1.0
+NP_852107.1,0.981,0.981,0.861,0.962,0.894,0.897,,0.894,0.913,0.99,0.971,0.942
+NP_000117.1,0.833,0.833,0.833,0.833,1.0,1.0,0.5,0.833,1.0,0.833,1.0,1.0
+NP_004199.1,0.973,0.912,0.766,0.938,0.934,0.933,0.812,0.893,0.951,0.967,0.962,0.956
+NP_733764.1,0.929,0.857,0.571,0.714,,,0.857,0.857,1.0,1.0,1.0,1.0
+NP_006068.2,1.0,1.0,1.0,1.0,,,0.5,1.0,1.0,1.0,1.0,1.0
+NP_149045.3,0.905,0.914,0.784,0.961,,0.854,0.871,,0.94,0.897,0.94,0.836
+NP_001243439.1,0.5,1.0,1.0,1.0,0.5,,,1.0,0.5,1.0,1.0,1.0
+NP_001340741.2,,,,,,,,,1.0,1.0,1.0,1.0
+NP_005576.3,0.981,0.596,0.894,0.962,0.962,0.812,0.923,0.923,1.0,1.0,0.981,0.942
+NP_004046.2,0.606,0.833,0.674,0.712,0.727,,,0.379,0.894,0.697,0.818,0.955
+NP_000079.2,0.946,0.973,0.953,0.938,,0.984,,0.772,0.978,0.853,0.967,0.976
+NP_055805.1,1.0,0.75,1.0,1.0,1.0,,1.0,0.875,0.75,1.0,0.75,1.0
+NP_000265.1,1.0,1.0,1.0,1.0,1.0,0.909,0.75,1.0,1.0,1.0,1.0,1.0
+NP_619520.1,1.0,0.9,0.862,0.9,0.975,0.917,,0.813,0.975,0.975,0.975,0.95
+NP_001167621.1,0.958,1.0,1.0,1.0,0.896,1.0,,0.917,1.0,1.0,0.958,1.0
+NP_068657.1,0.889,0.778,0.867,0.844,0.744,0.625,0.622,0.4,0.8,0.756,0.556,0.889
+NP_569733.2,0.733,1.0,0.733,0.867,0.733,1.0,,0.6,0.867,0.8,0.867,0.867
+NP_001026895.2,1.0,0.692,1.0,0.923,1.0,,0.615,0.846,1.0,1.0,1.0,1.0
+NP_001171481.1,0.75,0.75,0.714,0.643,0.8,1.0,0.75,0.792,0.75,0.714,0.821,0.857
+NP_001231118.1,0.47,0.722,0.481,0.611,,,0.741,0.761,0.63,0.741,0.778,0.667
+NP_001359162.1,,1.0,,,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_849164.2,1.0,0.926,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_006256.1,1.0,0.905,0.952,0.952,1.0,1.0,0.667,0.786,1.0,1.0,1.0,1.0
+NP_036255.2,0.883,0.844,0.875,0.844,1.0,1.0,0.844,0.75,0.859,0.875,0.875,0.859
+NP_001006659.1,0.875,0.775,0.9,0.9,0.2,0.667,0.9,,0.9,0.9,0.9,0.875
+NP_000780.1,0.433,0.933,0.464,0.429,,0.5,0.6,0.783,0.9,0.767,0.867,0.9
+NP_003276.3,0.929,0.929,0.857,0.929,1.0,1.0,,0.804,1.0,0.964,1.0,0.929
+NP_680475.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0
+NP_116277.2,0.979,0.985,0.982,0.965,0.994,1.0,0.997,,1.0,0.991,1.0,1.0
+NP_004295.2,0.896,0.943,0.879,0.8,0.625,0.767,0.857,0.815,0.736,0.936,0.921,0.929
+NP_002151.2,0.795,0.536,0.366,0.491,0.708,0.4,0.179,0.393,0.125,0.571,0.429,0.518
+NP_001156685.1,0.848,0.746,0.875,0.852,0.879,0.984,0.801,0.795,0.912,0.902,0.926,0.926
+NP_000389.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006756.2,1.0,1.0,1.0,1.0,1.0,,1.0,0.0,1.0,1.0,1.0,1.0
+NP_002327.2,0.956,0.618,0.971,0.941,0.941,1.0,,0.824,0.971,0.971,0.971,1.0
+NP_000519.2,0.969,0.944,0.953,0.978,0.947,0.896,0.829,0.695,0.929,0.972,0.978,0.984
+NP_001025023.1,0.125,1.0,0.25,0.125,0.2,,0.125,0.125,0.0,0.0,0.0,0.125
+NP_001001547.1,1.0,1.0,1.0,1.0,1.0,,,0.75,1.0,1.0,1.0,1.0
+NP_008846.2,1.0,1.0,0.992,1.0,1.0,0.974,0.97,0.909,1.0,1.0,1.0,1.0
+NP_006487.1,1.0,0.833,0.75,0.833,1.0,,0.5,0.5,1.0,1.0,1.0,1.0
+NP_002060.4,,1.0,,,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_066268.1,0.944,0.919,0.949,0.927,0.953,0.953,,0.614,0.983,0.947,0.961,0.95
+NP_001025054.1,1.0,,0.957,1.0,,1.0,0.826,1.0,0.826,1.0,0.826,1.0
+NP_003671.1,0.9,0.95,0.95,1.0,0.912,1.0,1.0,0.812,1.0,0.975,0.95,0.95
+NP_001354752.1,,,,,,,,,0.9,1.0,1.0,1.0
+NP_005076.3,1.0,1.0,0.933,0.933,1.0,,1.0,1.0,0.867,1.0,1.0,1.0
+NP_001142.2,1.0,1.0,1.0,0.917,1.0,1.0,0.667,0.792,1.0,1.0,1.0,1.0
+NP_001278585.2,0.984,1.0,1.0,1.0,,1.0,1.0,0.812,1.0,1.0,0.969,1.0
+NP_689509.1,,0.926,,,0.953,0.989,,0.885,0.879,0.981,0.98,0.991
+NP_056980.2,0.87,1.0,0.852,0.926,1.0,,0.333,0.907,0.519,0.926,0.815,1.0
+NP_071881.1,0.9,1.0,0.95,0.95,0.95,1.0,,0.55,1.0,0.95,0.95,1.0
+NP_001001344.1,0.847,0.847,0.986,0.875,0.819,0.833,,0.715,0.764,0.889,0.875,0.944
+NP_001001331.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.909,1.0,1.0,1.0,1.0
+NP_998764.1,0.954,0.967,0.967,0.987,0.928,0.867,0.935,0.915,0.993,0.961,0.987,1.0
+NP_036211.2,1.0,0.833,,,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_000248.2,0.819,0.752,0.812,0.828,0.692,0.786,0.778,,0.865,0.814,0.816,0.873
+NP_055085.1,0.938,0.846,,,0.938,1.0,,0.862,0.938,0.972,0.969,0.908
+NP_054722.2,0.917,1.0,0.833,0.833,1.0,1.0,1.0,0.25,0.583,1.0,1.0,1.0
+NP_000188.1,0.944,0.833,0.972,0.972,1.0,,0.833,0.972,0.944,0.972,0.972,0.944
+NP_000156.1,,1.0,,,0.957,,,1.0,0.826,1.0,1.0,1.0
+NP_005258.2,0.889,0.933,0.955,0.978,0.88,1.0,0.934,0.873,0.936,0.946,0.942,0.974
+NP_068773.2,1.0,1.0,1.0,1.0,0.98,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006822.1,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_071445.1,0.82,0.838,0.85,0.877,0.86,0.867,0.739,0.841,0.826,0.841,0.903,0.886
+NP_775822.3,0.643,0.738,0.774,0.714,0.714,0.667,0.643,0.583,0.667,0.667,0.881,0.833
+NP_006195.3,0.796,0.653,0.694,0.735,0.776,0.929,0.816,0.653,0.837,0.735,0.755,0.776
+NP_077015.2,0.856,0.956,,,0.655,0.467,,0.867,0.944,0.967,0.878,0.933
+NP_076927.1,,0.953,,,0.943,1.0,0.899,0.862,1.0,0.974,0.99,1.0
+NP_031374.2,1.0,1.0,0.95,1.0,1.0,1.0,1.0,0.95,1.0,1.0,1.0,1.0
+NP_998818.1,1.0,,1.0,1.0,,,0.714,,1.0,1.0,1.0,1.0
+NP_859065.2,0.643,0.786,0.75,0.714,0.857,,,0.714,0.857,0.929,0.857,0.643
+NP_036348.2,1.0,1.0,1.0,1.0,1.0,0.0,0.0,1.0,1.0,1.0,1.0,1.0
+NP_056953.2,0.333,1.0,0.833,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_009185.2,1.0,1.0,0.625,1.0,1.0,1.0,1.0,0.45,0.9,0.95,1.0,1.0
+NP_000431.2,0.929,0.714,0.979,0.875,0.875,1.0,0.804,0.482,0.839,0.768,0.929,0.768
+NP_055824.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006297.2,0.926,0.815,0.926,0.946,0.963,0.768,0.819,0.602,0.965,0.955,0.942,0.956
+NP_079085.2,1.0,1.0,0.95,1.0,0.95,1.0,1.0,1.0,1.0,1.0,1.0,0.95
+NP_000960.2,,0.667,,,1.0,1.0,,0.833,1.0,1.0,1.0,1.0
+NP_000341.2,0.922,0.941,0.938,0.94,0.932,0.899,0.919,0.819,0.909,0.972,0.968,0.97
+NP_644813.1,0.917,0.833,0.917,0.917,0.833,,0.833,0.917,1.0,1.0,1.0,0.917
+NP_001035897.1,,0.917,,,1.0,,1.0,0.729,1.0,1.0,1.0,1.0
+NP_078853.2,0.877,0.875,0.953,0.978,0.982,0.983,0.906,0.96,0.942,0.991,0.969,0.996
+NP_891988.1,1.0,1.0,1.0,1.0,,,0.905,0.714,0.952,0.81,1.0,1.0
+NP_002241.1,,0.75,,,0.812,1.0,,0.65,0.85,0.95,1.0,1.0
+NP_001186846.1,0.905,0.952,0.857,1.0,,,0.905,0.833,1.0,1.0,1.0,0.952
+NP_001161206.1,1.0,1.0,1.0,1.0,,,0.981,0.926,0.981,1.0,1.0,1.0
+NP_038475.2,0.535,0.093,0.442,0.326,,0.364,0.07,,0.721,0.512,0.372,0.535
+NP_000175.1,,,,,,0.8,0.8,0.667,0.5,0.8,0.9,0.8
+NP_002538.1,0.851,0.833,0.845,0.869,0.958,0.75,0.798,0.845,0.881,0.917,0.881,0.845
+NP_000633.2,0.708,0.875,0.712,0.667,0.75,0.762,0.742,0.746,0.8,0.708,0.742,0.792
+NP_110404.1,0.974,1.0,0.929,0.962,1.0,1.0,0.936,0.897,1.0,1.0,1.0,1.0
+NP_000879.2,0.714,0.429,0.929,1.0,1.0,1.0,,0.643,0.714,1.0,1.0,1.0
+NP_000603.1,0.875,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_079428.2,0.75,0.75,1.0,1.0,,0.5,0.75,0.5,0.75,0.5,0.5,1.0
+NP_001949.1,0.596,0.895,0.567,0.753,0.76,0.718,0.498,0.385,0.615,0.664,0.724,0.604
+NP_066269.1,0.833,0.667,1.0,1.0,1.0,1.0,,1.0,1.0,0.833,0.833,1.0
+NP_008822.2,0.729,0.375,0.75,0.625,0.625,0.667,,0.396,0.75,0.792,0.75,0.708
+NP_077006.1,1.0,0.882,0.765,0.824,,,1.0,1.0,1.0,1.0,0.941,1.0
+NP_008825.4,0.871,0.864,0.886,0.894,0.939,,0.909,0.909,0.924,1.0,0.985,0.955
+NP_116045.2,0.923,0.971,0.933,0.99,1.0,1.0,1.0,0.88,0.99,1.0,1.0,1.0
+NP_004161.4,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000113.1,0.667,0.5,0.667,0.333,0.75,1.0,0.5,0.75,0.5,0.5,0.667,0.667
+NP_569157.2,1.0,1.0,1.0,0.5,,,1.0,0.5,1.0,1.0,1.0,1.0
+NP_009220.2,0.979,0.958,0.969,0.938,1.0,1.0,,0.833,1.0,1.0,1.0,0.979
+NP_060844.2,0.857,1.0,0.714,1.0,1.0,1.0,,0.571,1.0,1.0,1.0,1.0
+NP_891552.1,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003237.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_940927.2,0.875,0.7,0.85,0.9,0.85,0.875,0.9,0.929,1.0,0.95,0.95,1.0
+NP_036596.3,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005238.1,0.5,1.0,0.75,1.0,1.0,1.0,0.75,0.75,1.0,1.0,1.0,1.0
+NP_001357188.2,0.985,0.982,0.986,0.978,,1.0,0.63,0.951,0.979,0.992,0.988,0.99
+NP_004510.1,0.995,0.931,0.984,0.983,0.956,0.932,0.949,0.871,0.971,0.975,0.983,0.941
+NP_816931.1,0.875,0.75,0.875,0.875,1.0,0.857,,0.375,0.875,1.0,1.0,0.875
+NP_073563.1,0.917,1.0,1.0,1.0,1.0,,1.0,1.0,0.917,1.0,1.0,0.833
+NP_000524.3,1.0,0.84,,,0.737,,1.0,0.92,0.8,0.64,0.92,0.72
+NP_005621.2,0.988,0.917,0.982,1.0,1.0,1.0,0.94,0.962,1.0,1.0,1.0,1.0
+NP_003792.1,0.956,0.711,0.922,1.0,1.0,0.6,0.744,0.511,0.933,0.956,1.0,1.0
+NP_004302.1,1.0,1.0,1.0,1.0,1.0,,0.0,0.5,0.0,,1.0,1.0
+NP_001784.2,1.0,0.889,0.944,1.0,0.944,1.0,1.0,0.833,0.944,1.0,0.944,1.0
+NP_000382.3,0.917,0.963,0.951,0.941,0.958,0.932,0.936,0.952,0.924,0.929,0.949,0.966
+NP_079491.2,1.0,0.958,1.0,1.0,1.0,1.0,0.917,1.0,1.0,1.0,1.0,1.0
+NP_665860.2,1.0,1.0,0.75,0.8,0.4,1.0,0.8,1.0,0.9,1.0,0.8,0.8
+NP_005324.3,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_001056.1,0.986,0.822,0.986,0.944,0.951,0.991,0.738,0.752,0.965,0.927,0.951,0.906
+NP_005421.1,1.0,1.0,,,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004963.1,0.6,0.917,0.6,0.8,0.8,0.667,0.583,0.792,1.0,0.958,0.958,0.958
+NP_005848.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_073623.1,0.75,1.0,1.0,1.0,0.75,1.0,1.0,0.75,1.0,1.0,1.0,1.0
+NP_848621.2,1.0,1.0,1.0,1.0,1.0,1.0,0.5,0.75,1.0,1.0,1.0,1.0
+NP_001288153.1,,,,,,,,,1.0,1.0,1.0,1.0
+NP_000594.2,0.591,0.682,0.591,0.545,0.65,0.5,0.591,0.409,0.727,0.545,0.545,0.5
+NP_919438.1,,0.714,,,,1.0,0.821,0.607,0.571,0.821,0.821,0.786
+NP_064450.3,1.0,1.0,0.767,0.933,0.6,0.0,0.933,0.909,1.0,1.0,0.933,1.0
+NP_004399.2,0.809,0.769,0.83,0.827,0.85,0.865,0.828,0.716,0.86,0.889,0.899,0.884
+NP_004936.2,0.609,0.808,1.0,0.5,0.591,0.667,0.82,0.729,0.844,0.774,0.755,0.793
+NP_001136438.1,0.673,0.587,0.693,0.72,0.613,0.72,0.507,0.812,0.413,0.707,0.707,0.693
+NP_000400.2,0.843,0.706,0.755,0.745,0.867,0.941,,0.431,0.882,1.0,0.98,0.882
+NP_000388.2,0.935,0.901,0.941,0.954,0.949,0.963,0.876,0.803,0.978,0.966,0.966,0.972
+NP_057062.1,0.783,,0.859,0.837,0.761,0.6,0.739,0.804,0.783,0.739,0.826,0.88
+NP_006334.2,0.965,0.96,0.965,0.929,0.949,0.97,,0.773,0.995,0.96,0.975,0.97
+NP_003914.1,0.875,0.667,0.885,0.833,0.896,0.833,0.938,0.781,0.833,0.771,0.833,0.917
+NP_002195.1,0.917,0.889,0.875,0.861,0.944,1.0,0.889,1.0,0.861,0.944,0.889,0.944
+NP_055619.2,0.739,1.0,1.0,0.957,,0.75,0.783,1.0,1.0,0.87,0.826,1.0
+NP_005202.2,0.949,0.975,0.939,0.949,0.86,0.979,0.988,0.922,0.992,0.988,0.992,0.973
+NP_061116.5,,,,,,,1.0,,1.0,1.0,1.0,1.0
+NP_000220.1,1.0,0.938,1.0,0.988,1.0,1.0,0.901,0.914,0.877,0.963,0.951,0.975
+NP_000401.1,1.0,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0
+NP_000037.2,0.872,0.845,0.913,0.971,0.853,0.922,0.741,0.575,0.827,0.887,0.893,0.911
+NP_110420.3,0.643,0.714,0.429,0.857,0.556,1.0,0.5,0.714,0.286,0.643,0.786,0.857
+NP_001290039.1,0.948,0.905,0.99,0.981,1.0,1.0,0.852,0.995,1.0,1.0,1.0,0.995
+NP_000055.2,0.714,0.536,0.75,0.857,0.929,0.75,0.911,0.357,0.75,1.0,0.929,0.893
+NP_000518.1,0.967,0.914,0.977,0.958,0.96,0.858,0.905,0.91,0.945,0.975,0.974,0.974
+NP_004351.1,0.581,0.626,0.674,0.739,0.624,0.605,0.712,0.678,0.587,0.731,0.715,0.771
+NP_005046.2,0.519,0.885,0.702,0.519,0.394,1.0,0.635,0.673,0.904,0.904,0.712,0.865
+NP_056280.2,0.781,0.875,0.781,0.938,0.844,0.75,0.875,0.75,0.938,0.812,0.969,0.938
+NP_001106212.1,0.9,0.925,0.9,0.95,0.905,1.0,0.875,0.952,0.975,0.975,0.975,0.975
+NP_000110.2,,0.875,,,,,,0.75,1.0,1.0,1.0,1.0
+NP_000092.2,0.95,0.6,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056089.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003352.2,0.904,0.875,0.906,0.899,0.965,1.0,0.642,0.873,0.912,0.953,0.99,0.971
+NP_009297.2,0.935,0.932,0.824,0.941,,,0.835,0.808,0.93,0.954,0.95,0.961
+NP_055177.2,0.963,0.889,0.963,1.0,1.0,0.611,0.963,0.722,0.963,1.0,1.0,1.0
+NP_004447.2,0.964,0.952,0.956,0.987,0.864,0.917,0.798,0.834,0.947,0.987,0.984,0.976
+NP_859056.2,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001837.2,0.933,0.925,0.943,0.943,0.988,0.917,0.733,0.866,0.988,0.973,0.971,0.997
+NP_000693.1,0.975,0.916,0.945,0.943,0.973,0.989,0.925,0.848,0.977,0.988,0.995,0.988
+NP_003233.4,0.933,0.993,0.882,0.934,0.86,0.912,0.765,0.985,0.939,0.983,0.98,0.953
+NP_002388.2,0.872,0.903,0.793,0.832,0.846,0.89,,0.659,0.856,0.906,0.909,0.89
+NP_060631.2,1.0,0.875,1.0,1.0,1.0,1.0,0.75,0.75,0.75,1.0,1.0,1.0
+NP_001028.1,0.75,0.5,0.75,1.0,1.0,1.0,,0.75,0.75,1.0,0.75,0.75
+NP_660149.2,0.5,1.0,0.5,1.0,0.5,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000495.1,0.867,1.0,1.0,0.9,1.0,1.0,1.0,0.833,1.0,0.967,0.933,1.0
+NP_000393.4,,,,,,,,,0.929,0.94,0.918,0.893
+NP_001243506.2,1.0,,1.0,1.0,,1.0,,,1.0,0.625,0.875,0.875
+NP_060225.4,1.0,0.6,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_036412.1,0.81,1.0,0.833,0.905,0.952,0.667,0.619,0.857,0.714,0.857,0.857,0.81
+NP_000300.1,0.964,1.0,1.0,1.0,0.958,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_000824.1,0.883,0.957,0.888,0.954,0.897,0.801,,0.793,0.978,0.938,0.97,0.941
+NP_079492.2,0.875,0.678,0.894,0.994,0.967,0.933,0.856,0.812,0.944,0.956,0.994,0.956
+NP_057287.2,0.971,0.947,0.815,1.0,0.957,0.988,1.0,0.959,1.0,1.0,1.0,1.0
+NP_055830.1,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005263.1,1.0,1.0,0.875,1.0,1.0,1.0,0.625,1.0,1.0,1.0,0.875,1.0
+NP_006536.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0
+NP_061928.4,0.561,0.591,0.561,0.53,0.447,0.333,0.545,0.697,0.591,0.621,0.576,0.621
+NP_000398.1,0.8,,1.0,1.0,1.0,,0.1,0.9,1.0,1.0,1.0,0.9
+NP_689966.2,0.889,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004989.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_005114.1,0.0,0.5,0.5,0.5,,,1.0,0.5,1.0,1.0,1.0,0.5
+NP_036595.2,0.917,0.833,1.0,1.0,1.0,1.0,0.833,0.833,0.833,1.0,1.0,1.0
+NP_006010.2,0.9,0.956,0.867,0.867,0.962,1.0,0.933,0.893,1.0,0.933,0.933,0.956
+NP_938033.1,1.0,0.0,1.0,1.0,1.0,,,0.5,1.0,1.0,1.0,1.0
+NP_057264.4,0.833,0.9,0.889,0.856,0.867,0.917,0.933,0.7,0.967,0.933,0.933,0.989
+NP_000405.1,0.956,0.81,0.951,0.907,0.912,0.821,,0.752,0.972,0.944,0.991,0.954
+NP_149353.1,0.4,0.2,0.6,0.4,,,0.0,0.4,0.4,0.8,0.8,0.8
+NP_338599.1,,0.944,,,,,0.889,0.667,0.833,0.833,1.0,0.778
+NP_443091.1,0.804,1.0,0.875,0.964,0.964,1.0,0.786,0.786,0.893,1.0,1.0,1.0
+NP_005609.3,0.972,1.0,0.972,0.944,0.944,1.0,,0.75,1.0,1.0,1.0,1.0
+NP_003667.1,0.958,0.792,0.917,0.875,0.917,1.0,,0.917,1.0,0.917,0.917,0.917
+NP_001319.1,1.0,0.4,0.6,1.0,0.8,1.0,1.0,0.8,1.0,1.0,1.0,0.8
+NP_001255.4,0.333,0.476,0.952,0.952,,0.0,1.0,0.842,0.333,1.0,0.524,1.0
+NP_000882.1,1.0,0.69,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_653200.2,1.0,,1.0,1.0,,,1.0,,1.0,1.0,1.0,1.0
+NP_000307.1,1.0,1.0,1.0,1.0,1.0,,,,1.0,1.0,0.95,1.0
+NP_694972.3,0.771,0.714,0.814,0.886,0.829,0.5,0.743,0.6,0.714,0.743,0.714,0.886
+NP_056201.2,0.833,0.944,0.833,0.778,,,0.917,0.722,0.861,0.889,0.833,0.806
+NP_006249.1,,0.5,,,,,1.0,0.0,1.0,1.0,1.0,1.0
+NP_001138780.1,0.667,1.0,1.0,1.0,1.0,,1.0,,0.333,0.667,0.667,1.0
+NP_065995.1,1.0,1.0,1.0,1.0,0.889,1.0,0.972,0.917,1.0,1.0,0.972,1.0
+NP_004654.1,1.0,1.0,1.0,1.0,0.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0
+NP_004209.2,1.0,1.0,1.0,1.0,1.0,1.0,0.625,0.688,1.0,1.0,1.0,1.0
+NP_003374.3,1.0,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,0.875,1.0,1.0
+NP_006197.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.6,1.0,1.0,1.0,1.0
+NP_000095.2,0.948,0.852,,,0.981,,,0.991,1.0,0.992,1.0,0.968
+NP_004416.2,0.938,1.0,0.938,0.938,0.875,,0.688,1.0,0.812,1.0,0.938,1.0
+NP_003916.1,1.0,1.0,0.875,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_777596.2,0.848,0.848,0.845,0.865,0.851,0.667,0.865,0.789,0.637,0.865,0.883,0.895
+NP_872621.1,1.0,1.0,1.0,0.75,,,0.5,0.75,1.0,1.0,1.0,0.75
+NP_001354906.1,,,,,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060546.2,0.979,0.958,1.0,0.917,1.0,0.857,0.917,0.708,1.0,1.0,1.0,1.0
+NP_001003694.1,0.717,0.683,0.583,0.733,1.0,1.0,0.75,0.733,0.667,0.833,0.783,0.8
+NP_112205.2,1.0,0.0,1.0,0.25,0.5,1.0,0.75,1.0,1.0,0.75,1.0,0.75
+NP_000241.1,1.0,1.0,1.0,1.0,0.833,,1.0,0.833,1.0,1.0,0.833,1.0
+NP_000114.3,0.84,,0.865,0.84,0.64,0.9,0.83,0.925,0.74,0.88,0.86,0.85
+NP_000143.2,0.861,0.858,0.912,0.912,0.935,0.989,0.926,0.843,0.95,0.967,0.982,0.978
+NP_536350.2,0.684,0.974,0.875,0.993,0.875,1.0,1.0,0.978,0.995,0.879,1.0,0.984
+NP_079483.3,1.0,0.889,1.0,1.0,1.0,1.0,1.0,0.944,0.778,1.0,1.0,0.889
+NP_036604.2,1.0,0.833,,,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055775.2,1.0,1.0,,,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_000189.1,0.767,0.933,0.867,0.8,0.933,,0.7,0.917,0.9,0.767,0.767,0.667
+NP_001390.1,0.753,0.829,0.483,0.752,0.818,0.96,0.734,0.872,0.889,0.923,0.952,0.934
+NP_060525.3,0.938,1.0,1.0,1.0,0.688,1.0,1.0,0.938,0.938,1.0,0.938,0.875
+NP_542378.1,1.0,0.286,0.911,1.0,0.8,1.0,0.964,0.812,1.0,1.0,1.0,1.0
+NP_000928.1,,,,,,,,,1.0,0.938,1.0,0.938
+NP_005190.4,0.708,0.75,0.792,0.667,0.667,1.0,0.833,0.583,0.917,0.75,0.667,0.75
+NP_004454.2,0.981,0.935,0.889,0.991,0.981,0.815,0.981,0.972,0.954,1.0,1.0,1.0
+NP_001077431.1,0.858,0.882,0.828,0.891,0.837,,,0.708,0.884,0.919,0.902,0.881
+NP_055884.2,0.429,0.857,0.714,0.857,1.0,0.0,,0.714,0.714,0.857,0.857,0.714
+NP_000266.2,0.938,0.92,0.955,0.954,0.948,0.882,0.919,0.926,0.959,0.96,0.974,0.968
+NP_056477.1,0.952,0.952,0.952,0.893,0.929,0.619,0.81,0.786,0.357,0.94,0.786,0.893
+NP_116195.2,0.857,1.0,0.857,1.0,1.0,,,0.857,1.0,1.0,1.0,1.0
+NP_058648.4,0.823,0.839,0.855,0.823,0.733,0.4,0.855,0.694,0.5,0.887,0.887,0.887
+NP_848547.4,1.0,0.769,0.808,0.769,0.923,1.0,1.0,0.958,0.962,1.0,1.0,0.962
+NP_002418.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_598408.1,1.0,1.0,0.867,1.0,1.0,1.0,,0.708,1.0,1.0,1.0,1.0
+NP_620128.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,1.0,1.0,1.0,1.0
+NP_620596.2,0.877,0.851,0.906,0.874,0.956,1.0,0.871,,0.887,0.903,0.936,0.91
+NP_001073893.1,0.642,0.888,0.578,0.875,0.923,0.875,0.78,0.795,0.897,0.802,0.819,0.741
+NP_009075.1,1.0,0.846,,,1.0,1.0,,0.885,0.872,0.821,0.872,0.974
+NP_068556.2,1.0,0.9,,,,,1.0,0.889,1.0,1.0,1.0,1.0
+NP_000046.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,0.929,1.0,1.0
+NP_003212.2,1.0,1.0,1.0,1.0,1.0,,1.0,0.25,1.0,1.0,1.0,1.0
+NP_001387.2,,0.932,,,,0.939,0.936,0.657,0.957,0.994,0.99,0.989
+NP_115940.2,0.5,0.625,0.5,0.5,0.5,1.0,0.5,0.5,0.5,0.75,0.625,0.625
+NP_000327.2,0.909,0.964,0.836,0.891,0.873,0.75,0.927,0.982,0.727,0.909,0.836,0.782
+NP_000851.2,0.875,0.75,0.875,0.75,0.667,1.0,0.75,1.0,1.0,1.0,0.875,0.875
+NP_000221.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003229.1,0.904,0.923,0.913,0.904,0.846,0.769,0.769,0.75,0.981,0.942,0.885,0.865
+NP_065682.2,0.967,0.933,0.933,1.0,,,,0.933,1.0,1.0,1.0,1.0
+NP_001036214.1,0.981,0.846,1.0,0.981,1.0,1.0,1.0,0.808,0.885,0.981,0.962,0.981
+NP_000179.2,0.812,1.0,0.75,0.875,1.0,,1.0,1.0,0.938,1.0,1.0,0.875
+NP_066973.2,1.0,1.0,1.0,0.857,,0.0,1.0,0.786,1.0,0.857,1.0,1.0
+NP_002294.2,0.833,0.5,0.444,0.611,0.722,0.667,0.556,0.278,0.667,0.778,0.722,0.833
+NP_001092272.1,0.889,0.333,0.944,1.0,0.8,,0.333,0.444,1.0,1.0,1.0,0.889
+NP_000102.1,0.976,0.882,0.976,0.976,0.953,0.875,1.0,0.859,1.0,1.0,1.0,1.0
+NP_000065.1,0.881,0.979,0.875,0.983,0.992,0.993,0.783,0.938,1.0,0.962,0.992,0.938
+NP_000119.1,0.757,0.743,0.761,0.743,0.811,0.773,0.879,0.829,0.886,0.8,0.829,0.871
+NP_060119.3,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,0.933,1.0,1.0
+NP_001298127.1,1.0,0.5,1.0,1.0,,,0.6,0.5,1.0,1.0,1.0,1.0
+NP_000799.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_001035197.1,1.0,0.951,0.992,0.902,0.984,1.0,1.0,0.975,1.0,0.918,1.0,0.902
+NP_009008.2,0.667,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_612431.2,0.833,1.0,,,,,0.833,0.667,1.0,1.0,1.0,1.0
+NP_000883.2,1.0,1.0,1.0,1.0,,,,0.818,1.0,1.0,1.0,1.0
+NP_006173.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_000244.2,0.971,0.971,1.0,1.0,0.912,,,0.971,1.0,1.0,1.0,1.0
+NP_000676.1,1.0,1.0,0.667,1.0,1.0,,,0.833,1.0,1.0,1.0,1.0
+NP_000210.2,1.0,1.0,1.0,1.0,1.0,0.833,,1.0,1.0,1.0,1.0,1.0
+NP_001123482.1,1.0,,0.917,0.833,0.667,,,,0.583,0.75,0.75,1.0
+NP_057331.2,1.0,1.0,0.75,1.0,1.0,,1.0,0.9,1.0,0.9,1.0,1.0
+NP_001366543.1,0.5,1.0,0.5,0.5,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000306.1,1.0,0.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_036276.1,0.881,0.94,0.833,0.848,0.841,0.95,0.833,0.833,0.905,0.893,0.929,0.881
+NP_001193.2,1.0,1.0,1.0,1.0,1.0,1.0,,0.917,1.0,1.0,1.0,1.0
+NP_056968.1,0.967,0.756,1.0,0.933,1.0,0.7,1.0,1.0,1.0,1.0,1.0,0.933
+NP_060345.2,0.856,0.844,0.911,0.956,0.933,,,0.767,0.889,0.933,0.956,0.911
+NP_000299.3,,1.0,,,1.0,,,1.0,1.0,,1.0,1.0
+NP_001835.3,0.941,0.984,0.952,0.936,0.977,0.984,0.896,0.785,0.977,0.979,0.976,0.983
+NP_004928.2,0.929,0.991,0.934,0.929,0.974,1.0,0.927,0.962,0.987,0.974,0.983,0.987
+NP_060017.1,0.975,0.925,0.962,0.825,1.0,1.0,0.975,0.9,0.875,0.95,0.95,0.975
+NP_254275.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001003811.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004125.3,,0.5,,,1.0,1.0,,0.75,1.0,1.0,1.0,1.0
+NP_004713.2,0.667,1.0,0.75,0.75,0.889,,,1.0,0.667,0.917,1.0,0.917
+NP_001366380.1,0.889,0.667,0.906,1.0,,0.167,,0.778,0.556,0.944,0.889,0.722
+NP_002612.1,0.778,0.889,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_057445.4,1.0,1.0,,,1.0,,0.833,0.833,1.0,1.0,0.833,0.833
+NP_005853.2,0.853,0.844,0.816,0.862,0.856,0.756,0.95,0.803,0.85,0.881,0.831,0.888
+NP_570718.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004553.2,0.888,0.763,0.82,0.876,,0.611,0.849,0.74,0.929,0.888,0.899,0.893
+NP_000030.1,1.0,0.75,1.0,1.0,1.0,0.667,,0.417,1.0,1.0,1.0,1.0
+NP_659505.1,0.997,0.988,0.95,1.0,0.952,0.778,0.887,0.812,0.97,0.994,0.988,0.994
+NP_835466.2,0.8,0.0,,,,1.0,,0.5,0.9,0.9,0.9,0.9
+NP_037466.3,0.941,0.985,0.662,0.912,0.933,0.909,1.0,0.985,0.985,0.912,0.985,0.971
+NP_000230.1,1.0,1.0,1.0,1.0,1.0,1.0,0.8,0.9,1.0,1.0,1.0,1.0
+NP_001120972.1,0.886,0.982,0.939,0.693,,,0.737,0.947,0.833,0.93,0.956,0.877
+NP_055086.1,0.667,0.778,0.889,0.889,0.556,1.0,0.556,0.556,1.0,1.0,1.0,1.0
+NP_919259.3,,1.0,,,1.0,1.0,,0.929,1.0,1.0,1.0,1.0
+NP_065094.3,0.5,0.25,0.0,0.75,0.5,,0.75,0.75,0.5,0.5,0.5,0.5
+NP_000186.2,0.761,0.956,0.889,0.9,0.7,0.861,,0.783,0.8,0.833,0.889,0.922
+NP_113621.1,1.0,1.0,,,0.9,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_085135.1,0.715,0.72,0.769,0.763,0.344,0.528,0.839,,0.774,0.505,0.495,0.892
+NP_003755.2,1.0,1.0,1.0,1.0,1.0,,0.75,1.0,1.0,1.0,1.0,1.0
+NP_443106.1,0.667,1.0,0.792,0.5,0.5,0.25,0.167,0.667,0.667,0.75,0.833,0.833
+NP_002411.3,0.911,0.86,0.922,0.919,0.957,0.963,0.973,0.78,0.839,0.909,0.909,0.892
+NP_036442.3,1.0,0.75,1.0,0.75,0.75,0.5,0.75,1.0,0.5,0.75,0.75,0.5
+NP_005059.2,0.667,1.0,0.444,0.444,0.667,0.0,,0.556,0.111,0.333,0.333,0.333
+NP_001191244.1,1.0,,1.0,1.0,,1.0,,,1.0,1.0,1.0,1.0
+NP_060132.3,0.733,0.933,0.725,0.933,0.85,0.875,0.817,0.783,0.9,0.867,0.9,0.817
+NP_006183.2,0.909,1.0,0.909,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_008823.1,0.906,1.0,1.0,1.0,1.0,1.0,0.938,1.0,1.0,1.0,1.0,1.0
+NP_005852.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.95,1.0,1.0,1.0,1.0
+NP_775300.1,1.0,1.0,1.0,0.917,,,,0.417,1.0,1.0,1.0,1.0
+NP_000520.1,0.92,0.64,0.95,0.74,0.96,1.0,0.8,0.72,0.8,0.7,0.8,0.84
+NP_061896.1,0.917,1.0,0.917,0.917,0.875,0.5,,1.0,0.667,0.917,0.917,0.917
+NP_068775.1,1.0,0.867,0.9,0.667,0.867,1.0,1.0,0.5,1.0,1.0,0.933,1.0
+NP_055023.2,0.988,,,0.806,1.0,0.977,0.812,1.0,0.75,0.894,0.662,0.9
+NP_000087.2,0.96,0.827,0.973,1.0,,0.84,,0.86,0.947,1.0,1.0,0.96
+NP_003150.1,0.797,0.973,0.758,0.967,0.927,0.995,,0.813,0.991,0.984,0.985,0.967
+NP_001138940.4,0.971,0.843,0.971,0.98,,0.727,,0.779,0.971,1.0,0.961,0.98
+NP_001642.1,0.9,1.0,0.9,0.8,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_065107.1,1.0,,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_003383.4,0.958,0.7,0.958,0.833,0.833,0.583,,0.896,0.833,0.833,0.833,0.833
+NP_000890.1,,0.833,,,0.667,1.0,,1.0,0.333,0.0,1.0,0.833
+NP_542437.2,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_000104.1,0.7,0.6,0.8,0.8,0.8,1.0,0.8,0.9,0.6,0.8,0.8,0.8
+NP_036332.2,0.7,0.9,0.8,0.9,0.8,,0.5,1.0,0.9,0.8,0.8,0.8
+NP_001317964.1,1.0,0.562,1.0,1.0,,,0.781,0.812,0.625,0.844,0.781,0.938
+NP_001121903.1,1.0,1.0,1.0,1.0,0.5,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_542172.2,,0.5,,,0.75,0.875,,1.0,0.667,1.0,1.0,0.833
+NP_003357.2,0.875,1.0,0.875,1.0,0.875,,,0.688,0.875,1.0,1.0,1.0
+NP_000391.1,0.99,0.902,0.99,0.98,0.99,1.0,1.0,0.828,0.99,1.0,1.0,1.0
+NP_000486.1,0.97,0.972,0.963,0.978,0.979,0.972,0.916,0.908,0.987,0.989,0.992,0.991
+NP_004919.1,0.904,,0.923,0.962,,1.0,1.0,0.808,1.0,1.0,1.0,1.0
+NP_003782.1,1.0,0.857,1.0,1.0,0.857,0.0,1.0,0.786,1.0,1.0,1.0,1.0
+NP_004886.3,0.831,0.762,0.847,0.861,,0.933,0.958,0.794,0.825,0.704,0.881,0.905
+NP_003230.1,1.0,0.95,1.0,1.0,1.0,1.0,1.0,0.875,1.0,1.0,1.0,1.0
+NP_001358995.1,0.98,1.0,0.98,0.96,,,,0.68,1.0,0.96,0.96,0.96
+NP_064608.2,1.0,0.833,1.0,0.933,,1.0,1.0,,1.0,1.0,1.0,1.0
+NP_000226.2,1.0,0.986,1.0,1.0,1.0,1.0,0.98,0.97,1.0,1.0,1.0,1.0
+NP_000078.3,,,,,,,,,0.914,0.971,0.943,0.971
+NP_004735.2,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005226.1,1.0,1.0,1.0,0.875,0.708,0.5,1.0,0.667,0.667,0.75,0.833,0.708
+NP_115920.1,0.5,0.875,0.594,0.688,0.625,0.875,0.75,0.812,0.562,0.875,0.688,0.688
+NP_001076580.1,1.0,0.5,1.0,1.0,,,,1.0,1.0,1.0,1.0,1.0
+NP_000825.2,0.914,0.908,,,0.885,0.779,0.966,0.759,0.962,0.944,0.975,0.966
+NP_001070834.1,0.842,0.904,0.827,0.86,,,0.853,0.779,0.838,0.985,0.963,1.0
+NP_000525.1,0.867,0.844,0.8,0.4,0.4,1.0,0.75,0.859,0.75,0.656,0.75,0.75
+NP_055610.1,0.999,0.965,,,0.817,0.704,0.773,0.726,0.993,0.992,0.97,0.977
+NP_000224.2,1.0,0.989,1.0,0.932,0.989,1.0,0.943,0.915,0.966,1.0,1.0,1.0
+NP_056346.3,0.853,0.941,0.936,0.882,0.941,0.923,0.951,0.681,0.902,0.902,0.902,0.824
+NP_570853.1,,0.933,,,,,,0.609,0.924,0.96,0.969,0.938
+NP_689935.2,1.0,0.833,0.958,0.917,0.917,0.25,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079408.3,0.892,0.715,0.938,0.923,0.892,0.667,0.785,0.638,0.9,0.915,0.985,0.985
+NP_003165.2,0.857,1.0,1.0,0.929,,,1.0,0.893,0.857,0.857,1.0,0.929
+NP_005132.2,1.0,0.8,1.0,1.0,0.9,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002237.1,0.6,0.6,1.0,1.0,0.8,0.9,1.0,0.85,1.0,0.9,0.9,1.0
+NP_001269463.1,,1.0,,,0.0,0.667,,1.0,1.0,1.0,0.667,1.0
+NP_001138830.1,0.898,0.778,0.963,1.0,0.815,0.933,,0.556,1.0,0.907,0.981,0.889
+NP_079023.2,1.0,0.833,0.958,1.0,1.0,1.0,0.917,1.0,0.833,1.0,1.0,1.0
+NP_001223.2,0.92,0.8,0.92,0.88,0.92,1.0,0.92,0.76,0.84,0.92,0.96,0.88
+NP_612565.1,1.0,0.818,1.0,1.0,0.9,1.0,,0.818,1.0,0.909,1.0,1.0
+NP_066963.2,0.926,0.881,0.935,0.847,0.95,0.833,0.879,0.732,0.921,0.923,0.873,0.869
+NP_660161.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060399.1,1.0,0.5,,,0.833,1.0,0.833,1.0,1.0,1.0,1.0,1.0
+NP_001098548.2,0.8,0.9,,,,,0.7,0.222,0.5,0.9,0.7,0.7
+NP_000866.1,,0.955,,,0.909,0.917,,0.83,0.989,1.0,0.977,0.977
+NP_055067.1,1.0,0.933,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_803187.1,0.976,0.96,0.984,0.994,0.961,0.943,,0.821,0.984,0.949,0.988,0.899
+NP_004704.3,0.667,1.0,1.0,1.0,0.333,0.0,1.0,0.333,0.667,1.0,1.0,1.0
+NP_001317619.1,0.875,0.867,0.917,0.917,0.333,0.667,,0.8,0.75,0.75,0.792,0.75
+NP_060551.2,0.833,0.917,0.917,1.0,,,,1.0,0.917,1.0,0.917,1.0
+NP_004954.2,0.55,0.95,0.55,0.55,0.5,0.5,0.7,0.85,0.75,0.75,0.75,0.8
+NP_006485.2,1.0,0.975,0.975,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.975
+NP_001007027.1,1.0,0.286,1.0,1.0,1.0,1.0,,0.667,1.0,0.857,1.0,0.857
+NP_055197.2,0.2,0.4,0.2,0.4,0.0,0.0,,0.6,0.2,0.6,0.4,0.4
+NP_000578.2,0.958,0.75,0.958,1.0,1.0,1.0,0.833,0.875,0.667,1.0,1.0,1.0
+NP_000366.1,0.875,0.893,0.75,0.893,0.893,1.0,,0.946,0.857,0.893,0.911,0.929
+NP_004762.2,1.0,1.0,1.0,0.8,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000032.1,1.0,0.857,0.714,1.0,0.857,,1.0,,0.857,1.0,1.0,1.0
+NP_060545.3,0.786,1.0,0.786,1.0,0.762,,0.524,0.786,0.905,0.905,0.905,0.81
+NP_002932.1,0.881,0.952,0.952,1.0,0.81,1.0,1.0,0.714,1.0,1.0,1.0,1.0
+NP_001356355.1,1.0,1.0,0.882,0.986,,1.0,,0.944,1.0,0.986,1.0,0.986
+NP_001125.1,1.0,1.0,1.0,0.5,0.5,1.0,0.5,0.0,1.0,1.0,1.0,0.5
+NP_671724.1,0.625,1.0,0.75,0.625,0.625,0.5,0.5,0.125,1.0,0.625,1.0,0.75
+NP_000772.2,1.0,0.55,0.925,0.85,0.8,1.0,1.0,0.675,0.95,0.8,0.85,1.0
+NP_004973.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_109587.1,0.624,0.854,0.732,0.869,0.727,0.735,0.606,0.808,0.874,0.689,0.712,0.717
+NP_000263.2,0.938,0.75,0.906,1.0,,,1.0,0.844,0.625,1.0,0.812,1.0
+NP_115763.2,0.833,0.667,0.956,0.711,1.0,0.8,0.844,0.822,0.6,0.6,0.533,0.556
+NP_665898.1,0.0,0.143,0.143,0.0,,,0.0,0.143,0.0,0.286,0.143,0.0
+NP_009034.2,,0.965,,,1.0,1.0,,0.772,0.807,1.0,1.0,1.0
+NP_001041675.1,1.0,1.0,1.0,0.9,0.95,1.0,0.6,0.9,0.9,1.0,0.9,0.8
+NP_005923.3,0.976,0.833,0.976,1.0,0.952,1.0,0.857,0.81,1.0,1.0,0.952,1.0
+NP_000153.1,0.983,0.993,0.983,0.973,0.966,0.993,1.0,0.993,1.0,1.0,1.0,1.0
+NP_055661.3,0.944,0.944,0.944,0.963,,,0.796,0.667,1.0,0.944,0.926,0.944
+NP_005240.3,0.96,0.55,0.856,0.97,0.961,0.987,0.95,0.907,0.971,0.996,0.953,0.99
+NP_001265067.1,0.859,0.842,,,,,0.621,0.658,0.701,0.915,0.898,0.931
+NP_003385.2,0.75,0.5,1.0,0.75,0.75,1.0,0.75,0.25,1.0,1.0,1.0,0.75
+NP_000383.2,,1.0,,,0.906,1.0,,0.921,0.958,0.971,0.979,1.0
+NP_000239.1,0.989,0.933,0.967,0.978,,,0.978,0.889,1.0,1.0,1.0,1.0
+NP_001157814.1,0.927,1.0,0.708,0.988,0.914,1.0,0.754,0.882,0.992,0.992,0.98,0.886
+NP_063938.1,0.85,0.65,0.775,0.85,0.9,1.0,0.85,0.9,0.9,0.8,0.85,0.75
+NP_077288.2,0.958,0.878,0.958,0.959,0.89,0.829,0.91,0.762,0.942,0.938,0.957,0.939
+NP_060336.3,1.0,0.667,0.833,1.0,0.944,1.0,1.0,1.0,1.0,0.889,0.889,0.944
+NP_000522.3,0.862,0.819,0.865,0.859,0.862,0.968,0.869,0.824,0.83,0.882,0.892,0.906
+NP_079066.5,0.625,0.688,0.688,0.625,,,1.0,0.533,0.938,0.875,0.938,1.0
+NP_060111.2,0.994,0.811,0.95,0.944,,,0.989,0.214,0.967,0.911,0.989,1.0
+NP_065689.1,1.0,0.688,0.875,0.875,1.0,,1.0,1.0,0.875,1.0,1.0,1.0
+NP_000073.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004728.1,0.8,1.0,0.8,0.8,,1.0,,0.8,1.0,1.0,1.0,1.0
+NP_009203.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.944,1.0,1.0,1.0,1.0
+NP_000034.1,0.975,0.862,0.912,0.925,0.975,1.0,0.85,0.525,1.0,1.0,1.0,0.975
+NP_000202.3,0.905,1.0,0.978,0.956,,1.0,,,1.0,0.83,0.85,0.93
+NP_061169.2,1.0,1.0,1.0,0.875,1.0,0.333,0.5,0.75,1.0,0.75,1.0,0.375
+NP_612422.2,0.877,,0.882,0.814,0.853,0.75,0.922,0.99,0.971,0.941,0.971,0.961
+NP_001730.1,0.938,1.0,0.938,0.75,1.0,,,0.875,0.625,1.0,1.0,1.0
+NP_060762.3,0.778,0.778,0.667,0.778,0.333,1.0,1.0,0.556,1.0,0.889,0.889,0.778
+NP_065434.1,0.886,0.886,0.829,0.8,1.0,1.0,0.8,0.843,0.886,1.0,0.914,1.0
+NP_110440.1,0.721,0.844,0.743,0.868,0.868,0.778,0.897,0.809,0.912,0.941,0.882,0.846
+NP_078954.4,0.86,0.64,0.9,0.96,0.82,0.75,0.9,0.59,0.94,0.96,0.96,0.96
+NP_001288.3,0.959,0.885,0.914,0.944,1.0,1.0,0.996,0.942,0.925,1.0,0.978,0.978
+NP_000038.2,0.784,0.727,0.759,0.792,0.739,,0.833,0.686,0.811,0.871,0.845,0.864
+NP_919253.1,0.944,0.866,0.791,0.957,0.955,0.938,0.69,0.901,0.948,0.944,0.948,0.866
+NP_115925.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_660200.1,0.0,0.0,0.0,0.0,1.0,,1.0,0.0,1.0,0.0,1.0,1.0
+NP_056240.2,0.952,0.905,0.952,0.952,0.952,1.0,0.857,0.667,1.0,1.0,0.952,0.952
+NP_001034437.1,1.0,0.667,0.833,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_001193928.1,1.0,0.438,0.938,1.0,1.0,1.0,0.688,0.656,1.0,1.0,1.0,0.938
+NP_000911.2,0.97,0.918,1.0,0.975,0.973,1.0,,0.795,0.997,0.981,0.995,1.0
+NP_000197.1,0.839,0.89,0.901,0.934,0.98,0.977,0.834,0.907,0.966,0.966,0.954,0.974
+NP_598004.1,0.997,0.897,0.935,0.973,1.0,1.0,0.902,0.899,0.918,0.984,0.989,0.989
+NP_001073945.1,0.8,1.0,1.0,1.0,1.0,1.0,0.8,0.6,1.0,1.0,1.0,1.0
+NP_443099.1,0.774,0.727,0.745,0.705,0.766,0.667,0.673,0.538,0.824,0.727,0.786,0.778
+NP_000340.2,1.0,0.727,1.0,0.955,0.909,1.0,0.409,0.705,1.0,1.0,0.818,0.909
+NP_001182058.1,0.938,1.0,,,,,1.0,,1.0,1.0,1.0,1.0
+NP_000363.1,0.909,0.917,0.898,0.958,0.902,1.0,0.86,0.634,0.852,0.962,0.966,0.992
+NP_009060.2,1.0,1.0,1.0,1.0,1.0,1.0,0.0,0.25,1.0,1.0,1.0,1.0
+NP_003403.2,0.833,0.778,0.778,0.778,0.611,0.667,0.333,0.694,0.778,0.833,0.833,0.722
+NP_036459.1,,0.688,,,0.909,0.76,,0.623,0.909,0.935,0.909,0.961
+NP_005085.2,0.969,0.969,0.984,0.969,1.0,1.0,0.906,1.0,1.0,1.0,1.0,1.0
+NP_000184.1,1.0,0.594,1.0,0.969,1.0,1.0,1.0,0.792,0.938,1.0,0.938,1.0
+NP_000491.4,0.984,0.995,0.913,0.984,,1.0,,0.853,0.953,0.963,0.979,0.989
+NP_000447.2,0.722,0.694,0.861,0.5,0.806,0.8,,0.597,0.722,0.75,0.694,0.75
+NP_001461.1,0.8,0.4,1.0,0.4,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_000295.1,0.987,0.744,,,0.962,1.0,0.885,0.936,0.987,0.846,0.974,1.0
+NP_001165948.1,,,,,,,,,1.0,0.857,0.918,1.0
+NP_005129.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_114091.4,0.984,0.714,0.968,0.984,,,0.75,0.746,0.968,0.968,1.0,0.968
+NP_001194.1,0.948,0.917,0.979,0.896,0.979,0.95,,0.708,0.854,1.0,1.0,1.0
+NP_004605.4,0.857,0.524,,,0.964,,0.94,0.845,0.952,0.964,0.964,0.893
+NP_000045.1,,0.882,,,0.935,1.0,,0.916,0.991,0.964,0.98,0.986
+NP_775925.1,1.0,1.0,1.0,1.0,1.0,,,1.0,0.333,1.0,1.0,1.0
+NP_057613.4,0.9,0.9,0.9,0.8,,,0.9,0.85,1.0,1.0,1.0,1.0
+NP_058634.4,1.0,1.0,0.938,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001549.2,0.982,0.75,1.0,0.982,0.973,1.0,0.946,0.732,0.893,0.929,0.982,0.946
+NP_000475.1,0.74,0.825,0.799,0.841,0.607,0.878,0.805,0.932,0.883,0.877,0.883,0.818
+NP_777577.2,0.931,0.944,0.958,0.889,1.0,1.0,,0.944,1.0,1.0,1.0,0.917
+NP_775754.2,0.958,0.717,0.958,1.0,1.0,0.9,,0.854,0.983,0.95,0.975,0.983
+NP_001243643.1,0.932,1.0,,,,,0.955,1.0,0.955,1.0,1.0,1.0
+NP_006170.1,1.0,1.0,,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000484.2,0.746,0.687,0.748,0.713,0.715,0.674,0.735,0.642,0.846,0.725,0.892,0.671
+NP_689996.4,1.0,0.75,1.0,1.0,0.875,,1.0,0.812,1.0,1.0,1.0,1.0
+NP_060958.2,0.933,0.857,0.956,0.937,0.976,0.967,,0.889,0.937,0.984,1.0,0.992
+NP_001840.3,0.981,0.982,0.987,0.985,0.984,0.983,0.891,0.911,0.965,0.973,0.993,0.988
+NP_619649.1,1.0,0.875,0.875,0.938,,1.0,0.875,0.938,0.938,1.0,1.0,0.938
+NP_065753.2,0.75,0.8,,,,1.0,0.7,0.375,0.4,1.0,0.9,0.9
+NP_005754.2,1.0,0.625,0.938,0.875,0.75,1.0,,0.625,1.0,1.0,1.0,0.875
+NP_002791.1,0.0,1.0,0.0,1.0,0.0,,,1.0,0.5,0.5,0.5,1.0
+NP_000246.2,0.984,0.971,0.998,0.991,,0.976,0.95,0.842,0.994,0.995,1.0,1.0
+NP_004888.2,1.0,0.444,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_054799.4,0.853,0.926,0.885,0.828,,1.0,0.941,0.914,0.838,0.917,0.922,0.912
+NP_886552.3,0.875,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_000683.3,0.9,0.6,0.9,0.8,,,0.8,0.0,0.6,0.8,0.8,0.8
+NP_065685.1,1.0,0.833,1.0,0.833,1.0,0.8,0.833,0.917,1.0,1.0,1.0,1.0
+NP_640341.1,0.75,1.0,1.0,1.0,1.0,1.0,0.5,0.75,1.0,1.0,1.0,1.0
+NP_001157877.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002520.2,0.925,0.814,0.91,0.897,0.856,0.918,0.845,0.875,0.974,0.944,0.948,0.903
+NP_001293061.2,0.934,0.935,0.976,0.969,0.707,1.0,0.704,0.871,0.985,0.921,0.954,0.961
+NP_068776.2,0.972,1.0,0.958,1.0,1.0,1.0,0.986,0.847,1.0,1.0,1.0,1.0
+NP_004997.4,1.0,0.833,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_002147.2,0.5,0.833,0.583,0.333,0.333,,,0.333,0.667,0.333,0.5,0.667
+NP_000933.1,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0
+NP_077025.2,1.0,0.929,0.976,0.881,0.952,0.944,0.81,0.81,0.929,0.905,0.952,0.929
+NP_002431.2,0.75,0.5,1.0,1.0,1.0,,1.0,0.75,1.0,1.0,1.0,1.0
+NP_037523.2,0.571,1.0,1.0,1.0,0.857,,0.714,0.786,1.0,1.0,1.0,1.0
+NP_775766.2,0.833,1.0,0.833,0.917,1.0,1.0,0.583,0.542,0.917,0.875,0.917,0.75
+NP_000909.2,1.0,1.0,1.0,1.0,1.0,1.0,0.929,0.821,0.714,1.0,1.0,1.0
+NP_003621.1,1.0,0.0,1.0,1.0,1.0,,0.0,1.0,1.0,1.0,1.0,1.0
+NP_612404.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003485.1,0.925,0.854,0.926,0.918,0.897,0.896,0.814,0.772,0.862,0.903,0.923,0.894
+NP_001473.1,0.889,0.889,0.889,0.944,0.944,1.0,0.833,0.75,1.0,1.0,0.944,0.944
+NP_037518.3,0.889,1.0,0.833,0.889,1.0,0.667,,0.5,0.889,1.0,1.0,0.889
+NP_071765.2,0.906,0.853,0.857,0.89,0.875,0.987,0.846,0.738,0.897,0.945,0.989,0.949
+NP_060215.4,0.725,0.975,0.55,0.95,0.557,1.0,,,0.875,0.925,0.975,1.0
+NP_001011551.1,1.0,0.6,0.6,0.4,0.4,0.5,,0.2,1.0,0.8,1.0,0.4
+NP_000422.1,1.0,0.969,0.984,1.0,1.0,,1.0,0.984,1.0,1.0,1.0,0.969
+NP_000080.2,0.982,0.995,0.992,0.996,0.995,0.986,0.931,0.96,0.989,0.988,0.994,0.991
+NP_001009999.1,,0.5,,,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057683.1,1.0,1.0,0.5,1.0,1.0,1.0,1.0,0.875,1.0,1.0,1.0,1.0
+NP_061868.1,0.8,1.0,0.8,0.8,0.8,1.0,1.0,1.0,1.0,0.8,1.0,1.0
+NP_002851.2,0.838,0.875,0.757,0.831,0.743,0.569,0.882,0.669,0.846,0.868,0.86,0.838
+NP_116281.2,0.0,1.0,1.0,0.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_078961.3,0.986,0.9,0.9,0.943,0.907,0.909,,0.786,1.0,0.857,0.943,0.957
+NP_060141.3,0.5,0.417,0.5,0.5,0.75,0.333,0.5,0.333,1.0,0.75,1.0,0.833
+NP_004968.2,0.875,0.75,0.812,0.804,,0.812,0.866,0.844,0.821,0.893,0.911,0.929
+NP_005994.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0
+NP_001001937.1,1.0,1.0,1.0,1.0,1.0,,,0.8,1.0,1.0,1.0,1.0
+NP_003043.3,0.667,0.833,0.528,0.722,1.0,0.667,1.0,0.75,1.0,0.944,1.0,1.0
+NP_008871.3,0.912,0.559,0.745,0.892,0.941,0.922,0.255,0.471,0.833,0.902,0.912,0.824
+NP_000438.2,0.75,0.75,0.95,0.9,0.9,1.0,0.6,0.75,1.0,1.0,0.9,1.0
+NP_004362.2,0.973,0.811,0.983,0.912,0.469,0.732,0.83,0.631,0.961,0.988,0.951,0.995
+NP_001123561.1,0.893,0.893,0.929,1.0,,,0.929,1.0,0.929,0.964,1.0,0.929
+NP_006151.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055744.2,0.909,0.818,0.955,1.0,1.0,1.0,,0.909,1.0,0.909,1.0,1.0
+NP_001269154.1,0.919,0.85,0.95,0.812,,,,0.812,0.988,0.975,0.875,0.962
+NP_001365406.2,,,,,,,,,1.0,1.0,1.0,1.0
+NP_004444.2,0.847,0.997,0.794,0.893,0.862,0.995,0.905,0.982,0.924,0.776,0.874,0.929
+NP_689656.2,0.987,0.962,1.0,1.0,0.962,1.0,,0.891,0.962,0.962,0.962,0.987
+NP_597725.1,0.917,0.5,1.0,1.0,0.833,1.0,,0.667,1.0,0.833,0.833,1.0
+NP_055929.1,1.0,0.913,0.957,0.739,0.957,0.333,0.957,0.87,0.826,1.0,0.957,0.957
+NP_055199.1,0.958,0.75,1.0,0.917,1.0,1.0,0.917,0.75,1.0,0.958,0.958,1.0
+NP_005403.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001174.2,0.846,0.731,0.865,0.846,0.865,0.583,0.846,0.827,1.0,0.962,1.0,0.885
+NP_056227.2,1.0,,0.833,0.667,,,0.667,0.667,1.0,1.0,1.0,1.0
+NP_203123.1,1.0,0.4,1.0,1.0,1.0,,,0.45,1.0,1.0,1.0,1.0
+NP_036240.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_000384.2,0.931,0.97,0.923,0.917,0.985,0.964,,0.648,0.979,0.928,0.929,0.923
+NP_003602.1,0.86,0.95,0.831,0.876,0.771,0.875,0.899,0.839,0.938,0.86,0.888,0.849
+NP_075555.1,,0.25,,,1.0,1.0,,0.847,0.917,0.972,0.944,1.0
+NP_065875.3,1.0,1.0,0.95,1.0,,,0.8,0.95,1.0,1.0,1.0,1.0
+NP_000545.1,1.0,0.929,0.957,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003651.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,0.667,0.667,1.0,1.0
+NP_000350.1,0.924,0.896,0.953,0.986,0.965,1.0,,0.882,0.927,0.996,0.996,0.986
+NP_001001412.3,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006155.2,1.0,1.0,1.0,1.0,1.0,0.75,0.857,0.929,1.0,1.0,1.0,1.0
+NP_055513.2,1.0,0.722,1.0,1.0,0.833,0.778,0.833,0.556,0.889,0.778,0.889,0.889
+NP_689476.2,0.429,1.0,0.607,0.786,,,0.143,0.929,0.929,0.786,0.786,1.0
+NP_001121631.1,1.0,0.857,1.0,1.0,1.0,1.0,,0.786,1.0,1.0,1.0,1.0
+NP_006843.2,1.0,1.0,0.958,0.979,1.0,1.0,0.833,0.948,0.854,0.958,0.958,0.958
+NP_001305781.1,0.956,0.867,,,,,0.978,0.822,0.978,1.0,1.0,1.0
+NP_008860.4,0.923,1.0,0.885,0.923,0.846,1.0,,0.923,0.846,0.923,1.0,0.923
+NP_001027567.2,,0.933,1.0,1.0,,,0.667,0.8,1.0,1.0,1.0,1.0
+NP_008990.2,0.5,0.5,0.75,1.0,1.0,1.0,0.75,0.75,1.0,1.0,1.0,1.0
+NP_067632.2,1.0,0.667,1.0,1.0,0.833,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003623.1,1.0,1.0,1.0,1.0,1.0,0.75,0.75,0.75,1.0,1.0,1.0,1.0
+NP_001027565.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004482.4,1.0,1.0,,,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0
+NP_002460.1,0.0,0.25,0.0,0.25,0.0,,0.5,0.625,0.75,0.25,0.25,0.5
+NP_003064.2,,1.0,,,0.375,,,0.188,0.5,0.875,0.625,0.0
+NP_000711.1,,0.992,,,,,1.0,0.658,0.921,0.952,0.96,0.984
+NP_001123990.1,0.792,0.833,0.806,0.944,0.944,1.0,,0.833,0.972,1.0,0.972,1.0
+NP_653213.6,0.882,0.871,0.824,0.906,,1.0,0.859,0.8,0.965,0.812,0.929,0.953
+NP_003969.2,0.75,0.917,0.708,0.792,0.833,1.0,0.875,0.833,0.875,0.636,0.625,0.792
+NP_000465.1,1.0,0.19,1.0,1.0,0.905,1.0,1.0,0.762,1.0,1.0,1.0,1.0
+NP_055306.1,0.8,0.6,0.938,0.875,1.0,0.833,0.0,0.325,0.8,0.6,0.55,0.5
+NP_116071.2,0.941,0.94,0.911,0.934,0.937,0.925,,0.775,0.941,0.967,0.961,0.936
+NP_057664.1,0.849,0.816,0.77,0.961,1.0,0.833,0.671,0.928,1.0,0.974,1.0,1.0
+NP_203129.1,0.938,0.875,0.875,1.0,1.0,1.0,0.875,0.75,1.0,1.0,1.0,0.875
+NP_002891.1,0.778,,,,,,0.519,,0.593,0.667,0.778,0.593
+NP_001262.3,0.962,0.945,,,0.917,0.949,0.954,0.592,0.968,0.952,0.962,0.968
+NP_778243.1,1.0,1.0,,,,,0.9,0.9,1.0,1.0,1.0,1.0
+NP_001181939.1,1.0,1.0,1.0,0.909,,,0.909,0.818,1.0,1.0,0.955,0.909
+NP_055782.3,1.0,0.938,1.0,0.938,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000081.2,0.977,0.999,0.992,0.994,0.994,0.994,0.895,0.89,0.999,0.988,0.994,0.98
+NP_071417.2,1.0,0.667,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003579.3,0.8,0.867,0.667,0.967,0.867,0.8,0.9,0.7,0.8,0.933,0.933,0.9
+NP_758840.1,0.75,0.75,0.75,0.75,1.0,,,0.875,0.75,1.0,1.0,1.0
+NP_001129649.1,1.0,1.0,1.0,1.0,0.833,0.933,,1.0,1.0,1.0,1.0,1.0
+NP_001012426.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001177666.1,1.0,1.0,1.0,0.792,0.889,0.875,0.958,0.688,0.958,0.958,0.958,1.0
+NP_068745.2,0.8,1.0,0.8,1.0,1.0,1.0,1.0,0.85,1.0,1.0,1.0,1.0
+NP_659434.2,1.0,0.933,0.9,0.833,0.733,0.85,0.9,0.667,0.933,0.967,1.0,0.933
+NP_689967.2,0.667,0.667,0.667,1.0,0.667,0.0,0.667,0.833,0.667,1.0,1.0,0.667
+NP_001229817.2,0.917,0.842,0.967,0.892,1.0,1.0,0.894,0.671,0.917,0.817,0.858,0.858
+NP_000211.1,0.879,0.758,0.818,0.879,0.909,,0.788,0.712,0.909,0.818,0.909,0.788
+NP_001138584.1,1.0,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0
+NP_000316.2,,1.0,,,,,,0.442,1.0,1.0,1.0,1.0
+NP_653174.3,0.938,1.0,0.5,1.0,0.25,1.0,1.0,1.0,1.0,0.875,0.75,1.0
+NP_775748.2,1.0,0.8,,,1.0,1.0,0.2,0.9,1.0,0.8,0.6,0.6
+NP_148982.1,,1.0,,,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_055917.1,0.448,0.75,0.667,0.979,0.688,0.75,0.333,0.729,1.0,0.729,0.688,0.938
+NP_001128493.1,1.0,0.9,0.917,1.0,,1.0,0.857,0.4,1.0,1.0,1.0,1.0
+NP_001617.1,1.0,1.0,1.0,1.0,1.0,,0.0,0.5,1.0,1.0,1.0,1.0
+NP_003713.3,0.843,0.86,0.878,0.96,0.945,0.99,0.857,0.739,0.919,0.941,0.967,0.941
+NP_067032.3,,0.965,0.708,0.764,0.861,0.852,,0.889,0.812,0.965,0.889,0.896
+NP_065114.3,0.8,0.8,0.75,0.8,0.9,1.0,0.9,1.0,0.9,0.9,0.8,0.9
+NP_001139267.1,0.833,0.333,1.0,1.0,1.0,,,0.833,1.0,0.333,0.5,0.167
+NP_000200.1,0.9,0.7,0.9,0.8,0.7,0.8,0.9,0.9,0.8,0.8,0.8,0.9
+NP_060798.2,0.385,0.797,0.726,0.742,0.603,0.627,0.78,0.577,0.733,0.737,0.733,0.763
+NP_001261.2,0.65,0.4,,,0.3,0.5,,0.5,0.7,0.7,0.5,0.5
+NP_005466.1,0.938,0.75,0.938,0.75,1.0,,1.0,1.0,1.0,0.875,1.0,1.0
+NP_006061.2,1.0,0.778,0.25,1.0,0.833,0.5,,0.667,1.0,0.889,0.778,0.833
+NP_000530.1,0.849,0.911,0.782,0.924,0.915,0.977,0.737,0.879,0.896,0.932,0.917,0.876
+NP_073747.1,1.0,1.0,0.8,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_067063.1,0.6,0.6,0.7,0.7,0.9,0.5,0.7,0.9,0.1,0.3,0.2,0.5
+NP_003461.2,0.833,1.0,1.0,1.0,1.0,1.0,0.667,0.833,0.778,1.0,1.0,1.0
+NP_068799.2,0.556,0.556,0.556,0.444,0.556,,,1.0,0.333,0.333,0.333,0.333
+NP_003482.1,0.952,0.81,1.0,0.81,1.0,1.0,0.905,0.857,0.952,0.952,1.0,0.952
+NP_002885.1,1.0,1.0,1.0,0.5,,,,1.0,1.0,1.0,1.0,1.0
+NP_004647.1,0.905,0.905,0.952,0.881,0.94,0.857,0.917,0.952,0.952,1.0,1.0,0.929
+NP_036601.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001121901.1,0.0,1.0,0.4,0.0,0.0,,,1.0,0.0,0.8,0.6,0.0
+NP_000337.1,0.957,0.955,0.94,0.897,0.79,0.973,,0.734,0.885,0.983,0.92,0.955
+NP_055147.1,,0.0,,,,0.0,,0.5,1.0,0.0,0.0,0.0
+NP_006450.2,1.0,1.0,1.0,1.0,,,,1.0,1.0,1.0,1.0,1.0
+NP_001244159.1,0.878,0.893,0.845,0.649,,,,0.595,0.548,0.935,0.976,0.804
+NP_001095876.1,0.94,0.707,0.957,0.845,0.69,0.667,0.69,0.776,0.914,0.672,0.638,0.793
+NP_001307509.1,0.875,0.875,0.5,0.875,1.0,1.0,,0.438,0.75,0.875,0.875,1.0
+NP_061764.2,0.837,0.846,0.619,0.762,0.381,0.75,1.0,0.885,0.692,0.744,0.859,0.846
+NP_004026.2,1.0,1.0,1.0,1.0,0.6,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002760.1,0.821,0.513,0.791,0.735,0.862,1.0,,0.637,0.735,0.767,0.769,0.821
+NP_996919.1,1.0,1.0,0.972,0.944,,,,0.889,1.0,1.0,1.0,1.0
+NP_001073991.2,0.91,0.915,0.861,0.931,0.863,0.871,0.913,0.819,0.89,0.967,0.961,0.884
+NP_056655.3,,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056979.1,0.444,0.778,0.222,0.667,0.889,0.333,,0.778,0.111,0.556,0.556,0.778
+NP_001005369.1,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_060859.4,0.833,1.0,0.917,1.0,1.0,1.0,0.667,0.667,1.0,1.0,1.0,1.0
+NP_002599.1,0.75,0.625,0.375,0.5,0.5,0.0,0.75,0.625,0.875,0.75,0.75,1.0
+NP_055238.1,1.0,0.792,0.944,0.917,0.889,1.0,0.917,0.861,0.528,0.986,0.986,0.875
+NP_058651.3,1.0,0.75,1.0,1.0,1.0,1.0,0.917,0.5,1.0,1.0,0.833,1.0
+NP_004616.2,0.5,1.0,0.75,0.5,1.0,1.0,0.5,1.0,0.5,1.0,0.5,1.0
+NP_679211.2,0.923,0.846,0.769,1.0,,,1.0,0.769,0.923,0.923,1.0,1.0
+NP_001229886.1,0.907,1.0,0.896,0.958,0.875,1.0,,0.833,0.963,0.963,0.963,0.963
+NP_659403.4,0.925,0.975,0.925,0.975,1.0,,0.85,0.9,1.0,0.95,0.975,1.0
+NP_065071.1,0.738,0.505,0.662,0.638,0.743,0.26,0.771,0.33,0.657,0.848,0.829,0.781
+NP_945185.1,1.0,0.75,1.0,1.0,0.875,1.0,,0.75,1.0,0.625,0.875,0.875
+NP_055761.2,0.936,0.592,0.94,0.955,0.903,0.975,,0.98,0.99,0.975,0.987,0.99
+NP_001177203.1,0.873,0.754,0.759,0.904,0.833,0.796,0.852,0.534,0.831,0.873,0.929,0.907
+NP_115996.1,0.962,0.923,0.962,0.846,0.846,,0.923,0.885,1.0,1.0,1.0,1.0
+NP_009190.2,1.0,0.889,1.0,1.0,1.0,1.0,,0.833,1.0,1.0,1.0,1.0
+NP_694944.1,0.4,0.3,0.45,0.4,0.4,1.0,0.2,0.6,0.7,0.7,0.8,0.8
+NP_000157.1,0.964,0.975,0.946,0.993,0.989,0.892,,0.959,0.971,1.0,1.0,1.0
+NP_598006.1,0.867,0.417,1.0,0.667,0.75,0.8,0.8,1.0,0.867,0.867,0.867,0.933
+NP_005413.2,0.923,0.941,0.914,0.805,0.852,0.903,,0.832,0.955,0.909,0.905,0.852
+NP_009004.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,0.9
+NP_055461.1,0.833,0.833,0.917,0.833,0.833,,0.833,0.833,0.833,0.833,0.833,0.833
+NP_004743.1,0.585,0.738,0.623,0.646,0.662,0.5,0.846,0.823,0.6,0.95,0.692,0.862
+NP_004966.1,0.911,0.975,0.909,0.97,0.933,0.949,,0.932,0.986,0.974,0.98,0.975
+NP_004741.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001534.1,1.0,1.0,1.0,1.0,,0.5,1.0,0.8,1.0,1.0,1.0,1.0
+NP_005553.2,0.978,0.87,0.739,1.0,1.0,1.0,0.783,0.935,1.0,1.0,1.0,0.913
+NP_783328.1,0.88,0.992,0.681,0.95,0.975,1.0,,0.914,0.992,0.975,0.983,0.971
+NP_001025178.1,1.0,1.0,1.0,1.0,1.0,,,0.643,1.0,1.0,1.0,1.0
+NP_001124295.1,0.932,0.983,0.943,0.928,,,0.956,0.955,0.931,0.972,0.95,0.975
+NP_004786.2,1.0,1.0,0.929,0.929,1.0,1.0,0.857,0.75,0.929,0.929,1.0,0.929
+NP_060379.2,,0.786,,,0.857,0.833,,1.0,0.857,0.643,0.857,0.929
+NP_060277.1,1.0,0.769,1.0,0.769,0.885,0.222,0.846,0.731,0.885,0.923,0.885,1.0
+NP_476516.1,0.583,0.944,0.722,0.667,0.778,0.333,0.889,0.556,0.889,0.889,0.833,0.778
+NP_000165.1,1.0,1.0,1.0,1.0,1.0,,0.875,1.0,1.0,1.0,1.0,1.0
+NP_006177.1,1.0,1.0,1.0,0.9,0.8,1.0,,0.5,1.0,1.0,1.0,1.0
+NP_000456.2,0.967,0.767,1.0,1.0,1.0,1.0,0.9,0.8,0.8,0.967,0.9,1.0
+NP_001070.2,0.75,0.562,0.688,0.5,0.375,0.875,0.75,0.375,0.875,0.5,0.5,0.562
+NP_000481.2,0.958,0.542,0.917,0.75,0.917,1.0,0.708,0.771,0.875,0.917,0.917,0.917
+NP_073746.2,0.855,0.839,1.0,0.962,1.0,,1.0,1.0,0.968,0.903,1.0,0.742
+NP_000247.2,0.746,0.87,0.766,0.805,0.868,1.0,0.815,,0.915,0.851,0.893,0.882
+NP_002456.2,0.847,0.803,0.785,0.722,,,0.903,0.486,0.875,0.806,0.889,0.958
+NP_001353076.1,,,,,,,,,0.75,0.8,0.8,0.85
+NP_000044.2,0.91,0.921,0.914,0.877,0.862,0.994,0.914,0.853,0.943,0.901,0.924,0.926
+NP_938011.1,0.955,0.321,0.955,0.839,,,0.411,0.472,0.911,0.464,0.554,0.321
+NP_647480.1,0.938,0.75,1.0,0.875,0.625,1.0,1.0,0.625,1.0,1.0,0.875,0.875
+NP_115791.3,0.983,0.987,0.896,1.0,0.942,0.844,0.966,0.979,0.992,0.971,0.995,0.979
+NP_647479.2,0.984,0.841,0.992,0.952,0.992,1.0,0.968,0.833,0.984,1.0,0.984,1.0
+NP_001774.1,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0
+NP_001709.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060035.2,0.979,0.792,,,0.854,0.667,0.917,0.75,1.0,0.917,0.958,0.958
+NP_001123616.1,1.0,0.6,1.0,1.0,1.0,1.0,0.6,0.5,1.0,0.9,1.0,1.0
+NP_808592.2,,0.968,,,0.921,0.865,,0.917,0.77,0.943,0.923,0.92
+NP_055054.1,1.0,1.0,1.0,1.0,1.0,,0.812,0.438,1.0,1.0,1.0,1.0
+NP_006120.1,0.639,0.944,0.833,0.722,0.722,0.667,0.833,1.0,0.778,0.722,0.722,0.667
+NP_009139.1,1.0,1.0,1.0,1.0,0.667,,1.0,0.833,1.0,1.0,1.0,0.667
+NP_003457.1,1.0,1.0,1.0,1.0,1.0,1.0,0.792,0.917,1.0,1.0,1.0,0.917
+NP_938023.1,0.866,0.857,0.938,0.893,0.893,1.0,0.982,0.911,0.429,0.929,0.929,0.875
+NP_055887.3,0.778,0.889,0.833,0.778,1.0,,0.944,0.583,1.0,0.917,0.917,0.917
+NP_116575.3,0.783,0.96,0.74,0.93,1.0,1.0,0.95,0.908,0.764,0.928,0.952,0.992
+NP_002746.1,0.983,1.0,0.983,1.0,0.717,0.8,0.917,0.983,0.983,1.0,1.0,1.0
+NP_002435.1,0.545,0.591,0.636,0.909,0.955,0.786,0.682,0.727,0.5,0.955,0.909,1.0
+NP_000343.2,0.968,0.822,0.939,0.914,0.98,0.952,0.938,0.71,0.914,0.939,0.935,0.903
+NP_004577.1,0.995,0.788,1.0,0.989,0.804,0.967,,0.826,0.995,0.989,0.995,1.0
+NP_694571.2,1.0,1.0,0.938,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_612370.3,0.963,1.0,1.0,1.0,1.0,,0.889,0.944,1.0,1.0,1.0,1.0
+NP_000107.1,0.895,1.0,,,0.947,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_036340.1,0.825,0.85,0.85,0.75,0.85,1.0,,0.95,0.55,0.95,0.95,0.85
+NP_000164.5,0.845,0.928,0.882,0.901,0.849,1.0,0.75,0.754,0.947,0.934,0.908,0.934
+NP_065873.2,0.751,0.928,0.781,0.781,,0.867,0.889,0.86,0.932,0.882,0.925,0.922
+NP_115618.3,0.909,1.0,1.0,1.0,1.0,1.0,0.909,0.818,1.0,1.0,1.0,1.0
+NP_005701.1,1.0,0.667,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0,1.0,1.0
+NP_570722.2,0.833,0.667,1.0,1.0,,1.0,0.667,0.333,1.0,1.0,1.0,1.0
+NP_003313.3,1.0,0.973,0.952,1.0,1.0,1.0,0.993,0.952,1.0,1.0,1.0,1.0
+NP_004319.1,0.985,0.868,0.956,1.0,0.933,1.0,,0.809,1.0,1.0,1.0,0.882
+NP_001482.1,1.0,,,0.5,,,,,1.0,1.0,1.0,1.0
+NP_001020105.1,0.0,0.0,0.667,0.0,0.0,,,0.333,0.0,0.667,0.333,0.333
+NP_758454.1,,0.654,,,1.0,1.0,,0.635,1.0,1.0,1.0,1.0
+NP_006294.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.6,1.0,1.0,1.0
+NP_068770.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000115.1,0.99,0.985,1.0,0.975,0.906,0.993,0.952,,0.988,0.988,1.0,0.994
+NP_002755.1,1.0,0.706,1.0,1.0,1.0,,1.0,0.441,1.0,1.0,1.0,1.0
+NP_001155197.1,1.0,1.0,1.0,1.0,,,1.0,0.5,1.0,1.0,1.0,1.0
+NP_054728.2,0.886,0.937,0.881,0.877,0.869,0.93,0.857,0.844,0.894,0.946,0.918,0.897
+NP_004175.2,0.5,1.0,0.0,0.5,0.5,,,1.0,1.0,1.0,1.0,0.5
+NP_001191354.2,,,,,,,0.679,0.929,0.726,0.869,0.869,0.94
+NP_003467.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.5,1.0,1.0,1.0,1.0
+NP_037514.2,0.933,1.0,0.933,0.933,0.867,,0.767,0.9,0.867,0.933,0.933,0.933
+NP_005017.3,0.58,1.0,0.69,0.84,0.543,,1.0,0.62,0.84,0.72,0.86,0.86
+NP_000312.2,0.78,0.754,0.808,0.804,0.84,0.827,0.67,0.822,0.801,0.832,0.806,0.794
+NP_001483.3,1.0,,1.0,1.0,1.0,1.0,0.95,0.933,0.9,1.0,1.0,1.0
+NP_001761.3,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006114.1,0.966,0.918,0.966,0.924,0.928,0.923,0.864,0.844,0.936,0.943,0.961,0.955
+NP_055496.2,0.929,0.857,1.0,1.0,1.0,1.0,0.857,0.929,0.643,1.0,1.0,1.0
+NP_006209.2,0.708,0.983,0.767,0.767,0.704,0.472,0.867,0.752,0.779,0.805,0.925,0.938
+NP_001017995.1,1.0,0.944,1.0,1.0,,1.0,0.944,0.882,1.0,0.889,1.0,0.722
+NP_000651.3,1.0,0.543,1.0,1.0,,0.857,0.771,0.943,0.829,1.0,1.0,1.0
+NP_004320.2,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_937983.2,0.89,0.74,0.917,0.952,0.976,0.818,0.974,0.959,0.799,0.988,0.974,0.975
+NP_001061.2,0.571,0.714,0.571,0.571,0.714,,0.857,0.857,0.714,0.857,0.143,0.857
+NP_058515.1,0.912,1.0,0.647,1.0,0.967,,0.941,0.912,1.0,1.0,1.0,1.0
+NP_998885.1,0.9,0.9,0.875,0.85,,0.8,0.8,0.55,0.9,0.85,0.85,0.9
+NP_001153605.1,0.939,0.732,0.842,0.944,0.955,0.778,0.931,0.961,0.935,0.941,0.948,0.971
+NP_002372.1,0.852,0.766,0.828,0.812,0.953,0.929,0.875,0.875,0.75,0.922,0.984,0.984
+NP_001020560.1,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_937799.1,0.725,0.705,,0.733,0.821,0.973,0.611,0.728,0.368,0.572,0.782,0.705
+NP_001209.1,0.9,1.0,0.9,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005254.2,1.0,0.667,0.778,1.0,0.111,1.0,,0.944,1.0,1.0,1.0,1.0
+NP_004179.3,1.0,0.958,0.875,1.0,0.458,0.5,,1.0,1.0,0.875,1.0,0.583
+NP_000534.3,0.948,0.992,0.945,0.937,,,0.867,0.986,0.945,0.967,0.973,0.977
+NP_001340.2,1.0,0.778,1.0,0.778,0.889,1.0,0.778,0.833,0.889,1.0,0.889,1.0
+NP_006651.2,1.0,1.0,0.938,1.0,1.0,1.0,1.0,0.688,1.0,1.0,1.0,1.0
+NP_005369.2,1.0,0.657,1.0,1.0,1.0,1.0,0.943,0.814,0.943,0.971,0.914,1.0
+NP_001003722.1,1.0,1.0,1.0,1.0,0.875,,0.875,0.875,1.0,1.0,1.0,1.0
+NP_057092.2,0.591,0.682,0.795,0.591,0.773,0.5,0.682,0.636,0.864,0.864,0.818,0.682
+NP_000062.1,0.95,0.888,0.961,0.966,0.953,0.977,,0.802,0.97,0.94,0.961,0.978
+NP_063945.2,0.912,0.718,0.736,0.986,0.981,0.845,0.887,0.947,0.963,0.954,0.991,0.981
+NP_057161.1,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_000899.1,0.812,0.875,1.0,0.875,0.75,0.5,1.0,0.438,0.75,0.875,1.0,1.0
+NP_002038.2,0.874,0.952,0.892,0.944,,0.857,0.831,0.75,0.9,0.918,0.922,0.883
+NP_002287.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_001170787.2,0.934,0.763,0.886,0.776,0.86,1.0,0.886,0.798,0.921,0.912,0.947,0.982
+NP_003709.3,0.937,0.811,0.844,0.933,0.881,0.966,0.952,,0.948,0.962,0.957,0.946
+NP_000286.3,0.884,0.714,0.912,0.852,0.886,0.909,,0.551,0.821,0.886,0.935,0.888
+NP_003892.2,1.0,0.917,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_065842.1,1.0,1.0,1.0,1.0,0.833,1.0,0.857,0.429,1.0,1.0,1.0,1.0
+NP_005364.1,0.779,0.871,0.8,0.971,0.925,1.0,0.8,0.769,0.914,0.943,0.921,0.964
+NP_005699.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_055660.1,0.75,0.5,1.0,0.75,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0
+NP_060121.3,0.94,0.898,0.889,0.98,0.941,0.874,,0.899,0.974,0.935,0.957,0.985
+NP_054742.2,0.781,0.875,0.703,0.719,0.75,0.188,,0.625,0.875,0.844,0.875,0.781
+NP_954986.2,0.882,0.886,0.904,0.904,0.789,0.588,0.794,0.904,0.877,0.921,0.921,0.939
+NP_060676.2,1.0,0.7,1.0,0.8,1.0,1.0,0.5,0.5,1.0,1.0,0.8,1.0
+NP_000061.1,0.84,0.914,0.879,0.823,0.896,0.912,0.836,0.711,0.862,0.919,0.939,0.95
+NP_005678.3,0.633,0.7,0.7,0.8,0.783,1.0,0.733,0.8,0.767,0.9,0.867,0.833
+NP_001275929.1,0.844,0.867,0.889,0.911,,,0.844,0.967,0.956,0.956,0.978,0.933
+NP_054706.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_000024.2,0.91,0.856,0.931,0.944,0.926,0.969,0.907,0.896,0.963,0.953,0.962,0.963
+NP_653234.2,1.0,1.0,1.0,1.0,1.0,,1.0,0.5,0.5,1.0,1.0,1.0
+NP_620148.1,0.82,0.84,,,1.0,1.0,0.88,0.88,1.0,1.0,1.0,0.96
+NP_004808.2,,0.88,,,0.947,0.6,,0.92,1.0,0.96,0.96,0.96
+NP_000229.1,0.858,0.809,0.944,0.903,0.945,0.814,0.95,0.831,0.896,0.944,0.935,0.944
+NP_001041629.1,1.0,1.0,1.0,0.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_001025482.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.867,1.0,1.0,1.0,1.0
+NP_001170991.1,0.769,1.0,0.669,0.929,,,0.867,0.974,0.905,0.788,0.818,0.831
+NP_054831.2,0.853,0.99,0.817,0.852,,,0.866,0.887,0.986,0.923,0.976,0.848
+NP_006236.1,0.929,0.726,0.964,0.833,0.976,0.9,0.905,0.75,0.833,0.869,0.881,0.893
+NP_006585.2,1.0,1.0,1.0,1.0,1.0,0.5,,0.938,0.625,1.0,0.875,1.0
+NP_000028.3,1.0,0.875,0.938,0.938,0.812,0.667,0.812,0.844,1.0,0.938,0.938,0.875
+NP_000009.1,0.947,0.978,0.333,1.0,0.853,0.812,0.976,0.978,0.978,0.884,0.976,0.993
+NP_068369.1,1.0,1.0,1.0,0.625,0.375,,1.0,0.75,0.5,1.0,1.0,0.625
+NP_001177716.1,1.0,0.333,1.0,0.667,1.0,1.0,1.0,,1.0,1.0,1.0,1.0
+NP_665877.1,0.5,1.0,1.0,0.0,,,1.0,0.75,0.5,0.5,0.5,0.0
+NP_061485.1,0.929,0.429,0.857,0.714,,,0.429,0.286,0.929,1.0,1.0,0.571
+NP_002863.1,1.0,1.0,1.0,0.333,1.0,,0.333,1.0,1.0,1.0,1.0,1.0
+NP_000118.2,0.971,0.991,0.974,0.989,0.985,0.996,0.868,0.684,0.957,0.996,0.987,0.996
+NP_060476.2,0.954,0.785,1.0,1.0,0.929,1.0,,0.888,0.992,0.954,0.969,0.985
+NP_002606.3,1.0,1.0,1.0,0.833,0.833,1.0,0.833,0.833,1.0,1.0,1.0,1.0
+NP_079436.4,0.617,0.7,0.667,0.733,0.667,0.8,0.8,0.675,0.767,0.667,0.767,0.733
+NP_005624.2,0.882,0.876,0.822,0.861,0.824,0.938,,0.824,0.92,0.902,0.905,0.933
+NP_005242.1,,0.438,,,1.0,1.0,,0.5,0.875,1.0,1.0,1.0
+NP_001444.2,1.0,0.762,0.992,1.0,1.0,0.956,,1.0,1.0,1.0,1.0,1.0
+NP_073625.1,0.5,0.5,0.5,0.5,0.5,0.0,0.375,0.5,0.5,0.5,0.5,0.5
+NP_001153508.1,0.699,0.815,0.695,0.717,,,0.768,0.697,0.738,0.866,0.82,0.844
+NP_115927.1,1.0,1.0,0.714,1.0,0.857,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001529.2,0.875,0.95,0.812,0.75,,0.667,1.0,0.925,1.0,0.8,0.9,0.9
+NP_003551.2,0.926,1.0,0.974,0.958,0.925,0.913,0.966,0.97,0.974,0.992,1.0,1.0
+NP_003871.1,0.9,0.889,0.822,0.822,0.933,1.0,0.8,0.844,0.889,0.933,0.911,0.956
+NP_001129507.1,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0,1.0
+NP_001107655.1,0.679,0.679,0.679,0.607,0.643,0.583,0.5,0.643,0.714,0.429,0.536,0.536
+NP_001123577.1,,0.711,,,,,0.911,0.875,1.0,0.978,0.933,1.0
+NP_002766.1,0.938,0.815,0.969,0.985,1.0,0.9,0.938,0.985,0.938,0.954,0.985,0.985
+NP_116023.2,0.889,0.889,0.911,0.822,0.889,0.375,0.81,0.933,0.911,0.978,0.933,0.822
+NP_001077088.2,,1.0,,,0.8,,0.85,1.0,1.0,1.0,0.95,0.95
+NP_006584.1,1.0,0.819,1.0,1.0,0.972,0.967,0.708,0.958,1.0,1.0,1.0,0.986
+NP_004172.2,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003910.1,1.0,0.938,1.0,1.0,,,0.875,0.438,0.812,1.0,1.0,0.875
+NP_000160.1,0.964,0.954,0.958,0.928,0.946,0.972,0.869,0.936,0.938,0.988,0.98,0.98
+NP_006867.1,0.4,0.5,0.5,0.8,,0.5,0.6,0.55,0.6,0.5,0.8,0.6
+NP_001355847.1,,,,,,,,,0.977,0.932,0.992,0.955
+NP_003982.1,,1.0,,,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_000022.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0
+NP_008870.2,0.852,0.939,0.833,0.771,0.845,0.86,0.775,0.753,0.861,0.876,0.874,0.916
+NP_006027.2,0.4,0.6,0.667,0.6,0.133,0.5,,0.933,1.0,0.917,0.6,0.533
+NP_597700.1,1.0,1.0,1.0,1.0,1.0,,0.778,1.0,0.889,1.0,1.0,1.0
+NP_056289.2,,1.0,,,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_008919.3,0.667,0.222,0.667,0.333,0.444,,0.222,,0.778,0.667,0.667,0.778
+NP_000472.2,0.96,0.87,0.97,1.0,1.0,1.0,0.96,0.875,0.97,1.0,1.0,1.0
+NP_002011.2,0.989,0.961,1.0,0.956,0.803,0.95,0.978,,0.978,1.0,1.0,1.0
+NP_004367.2,1.0,0.778,1.0,0.917,0.889,1.0,0.833,0.833,1.0,1.0,1.0,1.0
+NP_005509.1,0.727,0.818,0.818,0.909,0.909,,0.909,0.818,0.909,0.727,0.818,0.909
+NP_653164.2,1.0,0.5,1.0,1.0,1.0,,0.75,0.625,1.0,1.0,1.0,1.0
+NP_619646.1,1.0,0.5,1.0,0.5,1.0,,,1.0,0.0,1.0,0.5,1.0
+NP_945316.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000483.3,0.985,0.955,0.997,0.988,0.99,0.979,0.996,0.813,0.992,0.997,0.998,1.0
+NP_612433.1,0.889,1.0,0.889,0.889,,,1.0,0.889,1.0,1.0,1.0,1.0
+NP_000124.1,0.894,0.829,0.902,0.909,0.868,0.984,0.912,0.901,0.912,0.901,0.916,0.935
+NP_000161.2,0.955,0.914,0.963,0.954,0.965,0.924,0.877,0.824,0.954,0.984,0.974,0.972
+NP_005641.1,0.984,0.862,0.873,0.667,0.857,0.892,,0.897,0.952,0.894,0.915,0.884
+NP_109590.3,0.574,0.598,0.772,0.43,0.739,0.902,0.827,0.807,0.735,0.608,0.712,0.879
+NP_002244.1,,0.833,,,0.917,1.0,,0.875,0.917,0.417,0.583,0.833
+NP_000396.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_037386.1,0.341,0.545,0.25,0.364,0.727,,0.818,0.818,0.591,0.727,0.955,0.682
+NP_000099.2,1.0,0.857,1.0,1.0,1.0,,,0.786,1.0,1.0,1.0,1.0
+NP_995322.1,1.0,0.8,1.0,1.0,1.0,,1.0,0.8,1.0,1.0,1.0,1.0
+NP_115484.2,0.742,0.545,,,0.697,0.4,0.742,0.433,0.803,0.955,0.833,0.818
+NP_005857.1,0.7,0.9,0.7,0.6,0.9,,0.6,1.0,0.7,0.9,0.8,0.8
+NP_001307852.1,0.938,1.0,0.875,1.0,0.812,1.0,1.0,,1.0,1.0,1.0,1.0
+NP_001839.2,0.874,0.908,0.895,0.927,0.935,0.943,0.92,0.83,0.9,0.862,0.939,0.941
+NP_000538.3,0.872,0.712,0.923,0.885,0.909,0.944,0.939,0.705,0.84,0.885,0.885,0.891
+NP_000496.2,0.6,0.8,0.75,0.6,0.3,1.0,0.4,0.4,0.4,0.8,0.6,0.8
+NP_001973.2,0.722,0.778,0.667,0.667,0.889,,0.778,0.722,0.722,0.778,0.722,0.722
+NP_000138.2,0.909,1.0,0.886,1.0,1.0,1.0,0.955,0.909,0.977,0.977,1.0,0.955
+NP_002217.3,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000203.2,0.917,0.776,0.934,0.949,0.947,0.99,0.836,0.765,0.904,0.971,0.955,0.951
+NP_443711.2,0.811,0.821,0.861,0.911,,,0.667,0.644,0.9,0.878,0.878,0.789
+NP_001354903.1,1.0,1.0,1.0,1.0,1.0,,1.0,0.0,1.0,1.0,1.0,1.0
+NP_001365421.1,0.65,1.0,1.0,0.95,0.9,,0.85,0.8,0.848,0.939,0.97,1.0
+NP_000470.3,0.312,0.938,0.5,0.25,0.667,0.25,0.938,0.688,0.688,0.75,0.875,0.938
+NP_003051.1,0.948,0.967,0.989,0.993,0.986,0.931,0.983,0.952,0.994,0.997,0.996,1.0
+NP_006000.2,0.842,0.98,0.973,0.899,0.933,0.966,,0.148,0.94,0.886,0.919,0.993
+NP_005991.1,1.0,1.0,,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0
+NP_006077.2,0.617,0.595,,0.854,0.771,0.557,0.613,0.649,0.829,0.64,0.703,0.703
+NP_001060.1,0.53,0.607,,0.688,0.696,0.464,0.33,0.47,0.631,0.762,0.702,0.756
+NP_821080.1,0.661,0.903,,0.516,0.516,0.371,0.532,0.258,0.855,0.581,0.613,0.565
+NP_821133.1,0.913,1.0,,1.0,0.913,,,1.0,0.783,1.0,0.913,0.913
+NP_110400.1,1.0,0.897,0.641,0.897,0.974,1.0,0.641,0.808,0.949,1.0,1.0,0.949
+NP_006078.2,0.721,0.697,,0.242,0.667,0.803,0.564,0.558,0.624,0.715,0.758,0.764
+NP_060733.4,0.871,0.8,0.943,0.929,0.764,0.667,0.757,0.786,0.6,0.871,0.9,0.943
+NP_005583.1,1.0,0.917,1.0,0.854,0.771,0.688,0.917,0.875,1.0,1.0,0.938,0.833
+NP_005985.3,0.5,1.0,0.75,0.5,0.5,1.0,0.75,1.0,1.0,1.0,1.0,1.0
+NP_001030025.1,0.937,0.936,0.921,0.967,0.991,0.905,,0.9,0.937,0.957,0.966,0.952
+NP_004010.1,0.943,0.914,0.986,1.0,,,1.0,0.571,0.886,1.0,1.0,0.943
+NP_002308.2,0.833,0.556,0.778,0.944,1.0,1.0,,1.0,1.0,1.0,1.0,0.944
+NP_000252.1,0.912,0.878,0.89,0.964,0.966,0.994,0.914,0.904,0.936,0.946,0.98,0.97
+NP_006423.1,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005661.1,0.85,0.6,0.95,1.0,0.8,,0.9,0.95,0.8,0.95,0.95,0.95
+NP_000315.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_000827.2,0.965,0.54,0.896,0.944,0.965,0.843,0.955,0.902,0.97,0.929,0.934,0.909
+NP_000213.1,0.98,1.0,1.0,1.0,0.68,1.0,1.0,1.0,0.96,1.0,1.0,1.0
+NP_036345.2,1.0,0.818,1.0,1.0,0.909,1.0,1.0,0.955,1.0,1.0,1.0,1.0
+NP_478123.1,0.792,0.875,,,,1.0,0.958,0.812,0.875,0.792,0.792,0.75
+NP_001859.1,1.0,1.0,0.985,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_076932.1,0.833,1.0,1.0,1.0,1.0,1.0,1.0,0.833,0.833,0.833,0.833,0.833
+NP_056372.1,0.958,1.0,0.875,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001388.1,1.0,1.0,1.0,1.0,1.0,,0.6,0.6,0.4,1.0,1.0,0.6
+NP_001070712.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.75,1.0,1.0,1.0,1.0
+NP_037383.1,1.0,1.0,1.0,1.0,1.0,,,,1.0,1.0,1.0,1.0
+NP_063916.1,0.903,1.0,0.839,0.935,,,0.968,0.935,0.903,0.871,0.935,0.645
+NP_115497.4,1.0,0.818,0.955,1.0,,,,0.875,1.0,1.0,1.0,1.0
+NP_000257.1,,0.929,,,1.0,1.0,,0.929,0.857,1.0,1.0,1.0
+NP_001137453.1,1.0,0.9,1.0,1.0,,,0.5,1.0,1.0,1.0,1.0,1.0
+NP_000275.1,0.91,0.922,0.892,0.959,0.882,0.94,0.765,0.67,0.963,0.967,0.961,0.982
+NP_001035937.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003886.3,0.96,1.0,0.9,0.96,,,1.0,0.72,0.96,0.96,0.96,0.96
+NP_000797.2,,0.835,,,0.789,0.841,,0.724,0.83,0.821,0.784,0.812
+NP_001245218.1,1.0,1.0,,,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004981.2,1.0,1.0,0.938,1.0,1.0,,1.0,0.875,0.875,1.0,1.0,1.0
+NP_005899.3,,0.889,,,,,1.0,0.889,1.0,1.0,1.0,1.0
+NP_065851.1,0.714,0.714,0.857,0.857,0.857,1.0,0.571,0.929,0.857,1.0,0.857,1.0
+NP_000177.2,0.962,0.695,0.914,1.0,,0.972,0.676,,0.857,0.943,0.99,0.819
+NP_001287771.1,0.875,0.857,0.875,1.0,1.0,0.667,1.0,0.429,1.0,1.0,1.0,1.0
+NP_079272.4,0.954,0.875,0.957,0.947,0.892,0.822,0.808,0.825,0.918,0.934,0.923,0.957
+NP_002067.1,1.0,1.0,1.0,1.0,1.0,,1.0,0.833,1.0,1.0,1.0,1.0
+NP_002600.1,0.978,0.922,0.991,0.988,0.657,0.915,0.923,0.825,0.985,0.997,0.993,0.998
+NP_000548.2,1.0,0.938,1.0,0.938,,1.0,,1.0,1.0,0.944,1.0,0.944
+NP_036311.3,0.833,1.0,0.833,0.833,1.0,1.0,0.583,,1.0,0.833,0.833,0.75
+NP_077277.1,0.768,0.339,0.826,0.826,1.0,,1.0,0.911,0.857,0.946,0.893,0.929
+NP_055069.1,1.0,0.8,1.0,1.0,,1.0,1.0,0.9,1.0,1.0,1.0,1.0
+NP_001269613.2,,,,,,,,,1.0,1.0,1.0,1.0
+NP_000708.1,0.786,0.5,0.821,0.571,0.5,1.0,0.714,0.5,0.571,0.643,0.714,0.5
+NP_665859.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.5,1.0,1.0,1.0,1.0
+NP_003631.2,0.947,0.974,0.895,0.974,1.0,1.0,1.0,0.974,1.0,1.0,1.0,1.0
+NP_001106962.1,1.0,1.0,0.5,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_689703.1,1.0,0.75,0.75,0.75,1.0,,,1.0,0.25,1.0,1.0,1.0
+NP_000360.2,0.876,0.884,0.909,0.909,,0.889,,0.876,0.884,0.901,0.95,0.909
+NP_004854.1,1.0,0.417,1.0,0.875,0.812,0.833,1.0,0.667,0.875,0.875,0.917,1.0
+NP_000219.2,0.828,0.91,0.841,0.917,0.9,1.0,,0.821,0.966,0.834,0.903,0.91
+NP_000293.2,0.857,0.929,1.0,0.893,1.0,,1.0,0.946,0.857,1.0,0.964,0.857
+NP_000194.2,0.861,0.666,0.884,0.942,0.883,1.0,0.801,0.845,0.893,0.972,0.961,0.979
+NP_919299.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.929,1.0,1.0,1.0,1.0
+NP_001076007.1,1.0,1.0,,,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001457.1,1.0,1.0,0.963,1.0,1.0,1.0,1.0,0.938,0.889,1.0,0.926,1.0
+NP_005219.2,0.846,0.923,0.833,0.872,0.897,0.774,0.872,0.885,0.744,0.962,0.923,0.936
+NP_036335.1,0.92,0.875,0.982,0.946,0.911,0.875,0.911,0.867,0.982,0.952,1.0,0.946
+NP_001138341.1,0.75,0.875,0.75,0.875,0.875,1.0,0.25,0.75,0.875,0.875,0.875,1.0
+NP_149091.1,0.87,0.522,,,1.0,1.0,0.304,0.674,0.957,1.0,1.0,1.0
+NP_002445.2,0.98,0.98,0.971,0.902,,,1.0,0.725,0.941,0.961,1.0,0.961
+NP_705902.2,0.812,0.756,0.612,0.988,0.925,1.0,0.994,0.962,0.95,0.894,0.938,0.981
+NP_004846.4,1.0,0.7,1.0,1.0,1.0,1.0,0.9,0.85,1.0,1.0,1.0,1.0
+NP_002325.2,0.885,0.962,0.923,0.962,1.0,0.5,1.0,0.673,1.0,1.0,1.0,0.923
+NP_001708.3,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004472.1,1.0,1.0,1.0,0.833,1.0,,0.833,0.583,0.833,1.0,1.0,1.0
+NP_003570.2,0.925,0.894,0.872,0.953,0.9,0.969,0.847,0.838,0.919,0.938,0.9,0.919
+NP_002226.1,1.0,1.0,1.0,1.0,,,,1.0,1.0,1.0,1.0,1.0
+NP_000131.2,0.943,0.743,0.943,0.829,0.886,0.833,0.8,0.886,0.771,0.857,0.857,0.914
+NP_000205.1,0.886,0.915,0.894,0.909,0.939,0.915,0.847,0.732,0.926,0.937,0.95,0.881
+NP_036566.1,1.0,0.667,0.944,1.0,1.0,,0.778,0.5,0.944,0.944,0.944,0.944
+NP_000283.1,0.991,0.922,0.994,0.965,0.968,0.978,0.924,0.781,0.907,0.981,0.938,0.981
+NP_009146.2,0.833,1.0,1.0,1.0,1.0,0.0,0.667,0.75,1.0,0.833,0.833,0.833
+NP_001304019.1,0.667,0.542,0.688,0.958,0.875,0.917,,1.0,1.0,1.0,1.0,0.958
+NP_001037850.1,1.0,1.0,1.0,1.0,0.875,,0.5,0.875,1.0,1.0,1.0,1.0
+NP_001273644.1,1.0,1.0,,,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003887.3,,0.667,,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_877435.3,0.889,1.0,1.0,0.889,0.778,,,0.944,1.0,0.875,1.0,0.889
+NP_001229825.1,0.75,0.97,0.8,0.95,0.857,1.0,0.848,0.825,0.742,0.864,0.955,0.879
+NP_001025053.1,0.696,0.538,0.731,0.902,0.764,0.568,,0.875,0.44,0.875,0.853,0.815
+NP_640335.2,0.672,0.625,0.891,0.812,0.562,0.667,,0.828,0.906,0.792,0.812,0.812
+NP_001908.3,0.5,0.25,0.0,0.5,,,0.25,0.625,0.25,1.0,0.25,0.25
+NP_060530.3,0.636,0.5,,,0.864,,0.591,0.614,0.955,0.818,0.955,0.909
+NP_055120.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_078874.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_078772.1,0.893,0.393,0.911,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_004272.2,0.978,0.989,0.933,0.944,0.889,1.0,0.978,1.0,1.0,1.0,1.0,1.0
+NP_004077.1,0.92,0.607,0.991,0.893,0.821,1.0,,0.732,0.661,0.839,0.857,0.946
+NP_071372.1,0.9,0.9,0.9,0.9,0.7,1.0,0.6,0.7,0.9,0.9,1.0,1.0
+NP_852664.1,1.0,1.0,1.0,1.0,0.917,1.0,,0.75,1.0,1.0,1.0,1.0
+NP_490597.1,1.0,1.0,1.0,1.0,0.844,1.0,1.0,0.938,1.0,1.0,1.0,1.0
+NP_000036.2,1.0,0.923,0.923,1.0,1.0,,0.923,0.923,1.0,0.615,0.769,1.0
+NP_006026.3,0.889,0.939,0.818,0.879,0.672,0.75,0.929,0.712,0.99,0.909,0.98,0.798
+NP_789845.1,1.0,0.625,0.975,1.0,1.0,1.0,0.925,1.0,1.0,1.0,1.0,1.0
+NP_659428.2,,0.857,,,0.857,0.8,,0.976,1.0,0.976,0.976,1.0
+NP_000541.1,0.933,0.933,0.933,1.0,1.0,1.0,0.7,0.917,1.0,1.0,1.0,1.0
+NP_002733.2,0.909,0.682,,,0.591,1.0,0.864,0.682,1.0,0.909,0.818,0.773
+NP_000147.1,0.636,0.342,0.754,0.991,0.886,1.0,0.904,0.798,0.86,0.974,1.0,0.904
+NP_001003800.1,0.911,0.89,0.899,0.967,0.948,0.896,0.938,0.799,0.977,0.966,0.954,0.968
+NP_001254538.1,0.5,1.0,1.0,1.0,,,1.0,,0.0,1.0,1.0,1.0
+NP_000082.2,0.992,0.968,0.991,0.987,0.987,0.986,0.917,0.945,0.993,0.992,0.995,0.995
+NP_523353.2,1.0,0.8,0.6,0.8,0.2,1.0,0.2,0.4,0.2,0.2,0.8,0.6
+NP_005112.2,0.712,0.875,0.846,0.74,0.74,0.583,0.885,0.808,0.865,0.731,0.779,0.846
+NP_002463.2,0.864,0.864,0.841,0.455,1.0,1.0,0.864,0.364,0.682,1.0,1.0,0.773
+NP_001222.3,0.143,0.0,0.0,0.0,0.0,0.0,0.429,0.0,0.143,0.0,0.0,0.0
+NP_000279.1,1.0,1.0,1.0,1.0,1.0,1.0,0.889,0.778,0.944,1.0,1.0,0.833
+NP_057444.2,0.333,0.667,0.667,0.667,1.0,,0.667,0.667,1.0,1.0,1.0,1.0
+NP_004773.1,1.0,1.0,1.0,1.0,0.5,,0.25,0.5,0.75,0.75,1.0,1.0
+NP_001014839.1,0.75,0.75,0.875,1.0,0.875,0.875,,0.75,0.875,1.0,1.0,1.0
+NP_060404.4,0.899,0.926,0.878,0.808,0.827,0.94,0.848,0.698,0.792,0.94,0.875,0.89
+NP_071357.2,0.427,0.396,0.276,0.437,0.344,0.528,,0.167,0.656,0.323,0.344,0.438
+NP_000190.1,0.818,0.88,0.868,0.903,0.706,1.0,0.732,0.802,0.866,0.863,0.901,0.912
+NP_001116229.1,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000414.2,0.929,1.0,0.913,0.924,0.962,1.0,0.967,,0.989,1.0,0.989,0.989
+NP_000217.2,0.73,0.683,0.929,0.889,0.984,1.0,1.0,0.8,0.889,0.937,0.889,0.937
+NP_620170.3,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000067.1,0.743,1.0,0.8,1.0,0.686,0.643,,,1.0,1.0,1.0,1.0
+NP_006748.1,0.833,0.889,0.667,1.0,,1.0,0.778,0.833,0.778,1.0,1.0,1.0
+NP_997253.2,1.0,0.25,1.0,1.0,0.625,,1.0,0.875,1.0,1.0,1.0,1.0
+NP_004571.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_002569.1,1.0,1.0,1.0,1.0,,,,0.9,1.0,1.0,1.0,1.0
+NP_006454.1,1.0,1.0,0.917,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_001001430.1,0.965,0.814,0.965,0.93,,,,0.916,0.936,0.994,1.0,0.994
+NP_000678.1,0.75,0.875,0.75,1.0,1.0,1.0,0.438,0.938,0.938,0.938,0.938,0.688
+NP_543136.1,0.95,1.0,0.95,0.95,1.0,1.0,1.0,0.8,0.9,1.0,1.0,1.0
+NP_054774.2,0.4,0.8,0.8,0.6,0.7,1.0,1.0,0.6,0.6,0.8,0.8,0.8
+NP_055790.1,1.0,1.0,0.94,0.988,0.69,1.0,1.0,0.982,1.0,1.0,1.0,1.0
+NP_003733.2,0.977,0.915,0.963,0.971,0.888,0.995,,0.832,0.974,0.977,0.987,0.983
+NP_000126.2,0.915,0.814,0.948,0.877,0.909,0.894,0.848,0.912,0.688,0.942,0.94,0.945
+NP_001273003.2,,,,,,,,,0.903,0.875,0.922,0.88
+NP_003184.1,,0.727,,,1.0,1.0,,0.697,1.0,1.0,0.939,1.0
+NP_006260.1,0.967,0.894,0.935,1.0,0.826,0.933,0.958,1.0,0.878,0.942,0.947,0.921
+NP_055414.2,1.0,1.0,0.857,1.0,1.0,1.0,,0.857,1.0,1.0,1.0,1.0
+NP_079146.2,0.333,0.667,0.667,0.0,0.667,,0.333,0.75,1.0,1.0,0.667,0.667
+NP_001185797.1,0.806,0.944,0.75,0.778,0.8,1.0,,0.722,0.778,0.889,0.778,0.778
+NP_689807.1,0.917,0.889,0.972,0.972,0.861,0.917,1.0,0.833,0.972,0.917,0.972,0.917
+NP_000317.1,1.0,1.0,0.75,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0
+NP_938012.2,0.683,0.9,0.5,0.8,0.5,0.5,,0.643,0.867,0.833,0.767,0.867
+NP_000544.2,0.906,0.969,0.875,0.938,0.844,1.0,0.906,0.766,0.875,0.969,0.969,0.938
+NP_848612.2,0.929,1.0,,,,0.667,1.0,0.929,1.0,0.857,1.0,1.0
+NP_002065.1,0.94,0.695,,,0.69,0.903,0.83,0.578,0.895,0.905,0.88,0.825
+NP_067642.1,0.943,1.0,1.0,0.957,0.771,0.867,,0.543,1.0,1.0,1.0,0.457
+NP_005264.2,1.0,0.75,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_001072982.1,0.667,0.333,0.667,0.0,1.0,1.0,,0.667,0.667,0.333,0.333,0.333
+NP_000935.1,0.691,0.564,0.836,0.782,0.909,0.955,0.982,0.527,0.745,0.8,0.836,0.891
+NP_003788.2,0.9,1.0,1.0,0.9,0.8,,1.0,0.75,1.0,1.0,1.0,1.0
+NP_005174.2,0.953,0.989,0.917,0.928,0.933,0.988,0.894,0.908,0.978,0.95,1.0,0.983
+NP_001978.1,0.733,1.0,0.733,0.867,0.933,0.6,1.0,0.933,1.0,0.933,1.0,1.0
+NP_000120.2,1.0,0.829,1.0,1.0,,1.0,,0.843,1.0,0.943,1.0,1.0
+NP_031401.1,0.412,0.0,,,,0.625,,0.176,0.941,0.765,0.765,0.471
+NP_689735.1,0.0,0.0,0.0,0.0,0.5,,0.0,1.0,1.0,0.0,0.0,0.0
+NP_000128.1,0.85,0.736,0.9,0.909,0.733,1.0,0.952,0.8,0.827,0.977,0.977,0.92
+NP_055856.1,0.625,0.5,,,0.875,1.0,0.875,0.875,0.875,0.875,0.75,0.75
+NP_000240.1,0.88,0.834,0.877,0.829,0.901,0.933,0.813,0.746,0.891,0.922,0.919,0.927
+NP_001007528.1,0.625,1.0,0.625,1.0,1.0,1.0,1.0,0.75,1.0,0.75,1.0,1.0
+NP_689632.2,0.88,0.903,0.893,0.974,,,0.955,0.95,0.987,0.909,0.981,1.0
+NP_001357587.1,1.0,0.871,1.0,1.0,,,,0.824,0.961,0.98,0.988,0.984
+NP_001164672.1,1.0,1.0,,,,,0.5,1.0,1.0,1.0,1.0,1.0
+NP_054768.2,0.965,0.767,0.96,0.938,0.934,0.901,0.872,0.793,0.889,0.924,0.917,0.934
+NP_964012.2,0.643,1.0,0.714,0.929,0.929,1.0,0.857,0.9,1.0,1.0,1.0,1.0
+NP_775771.3,0.972,0.944,0.5,0.944,0.176,1.0,0.944,0.941,1.0,1.0,1.0,0.778
+NP_056028.2,0.8,0.6,0.8,0.8,0.8,0.0,1.0,0.8,0.8,0.8,0.8,1.0
+NP_000368.1,0.931,0.902,0.928,0.956,0.909,0.979,0.915,0.874,0.972,0.972,0.972,0.991
+NP_065166.2,1.0,0.7,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_057272.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.375,1.0,1.0,1.0,1.0
+NP_001015880.1,0.917,0.833,0.917,0.833,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006525.2,1.0,1.0,1.0,0.333,1.0,,0.333,0.0,0.667,0.667,1.0,0.667
+NP_036584.1,1.0,1.0,,,,,0.8,1.0,1.0,0.8,1.0,1.0
+NP_000181.2,0.926,0.945,0.914,0.821,0.951,,,0.833,0.964,0.964,0.964,0.97
+NP_056087.2,0.778,0.721,0.639,0.611,0.669,0.533,0.389,0.765,0.647,0.735,0.824,0.868
+NP_001349700.1,,1.0,,,0.881,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_000987.2,0.75,1.0,0.5,0.75,0.75,1.0,,0.375,0.75,0.5,0.5,0.75
+NP_005211.1,1.0,0.889,1.0,1.0,1.0,1.0,1.0,0.722,1.0,1.0,1.0,1.0
+NP_001011658.1,1.0,0.5,1.0,0.5,0.5,1.0,0.5,0.5,1.0,1.0,1.0,0.5
+NP_899200.1,0.838,0.837,0.915,0.965,0.946,0.801,0.92,0.877,0.957,0.92,0.942,0.947
+NP_004259.3,1.0,0.833,1.0,1.0,1.0,1.0,0.944,0.833,0.778,1.0,1.0,1.0
+NP_004159.2,1.0,0.881,0.786,0.714,0.952,1.0,0.976,0.56,0.952,0.929,0.905,0.976
+NP_006512.2,0.975,1.0,0.975,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001796.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000058.1,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_065099.3,0.852,0.875,0.867,0.828,0.912,0.4,0.875,0.961,0.844,0.922,0.828,0.906
+NP_001008214.1,0.833,1.0,1.0,1.0,1.0,0.667,,1.0,0.917,0.917,0.917,0.917
+NP_005052.1,1.0,0.75,0.75,0.75,1.0,,1.0,0.875,0.75,0.75,0.75,0.75
+NP_001180345.1,,0.968,,,0.815,0.954,,0.991,0.991,0.875,0.926,0.991
+NP_001073986.1,0.884,0.934,0.857,0.923,0.89,0.136,0.646,0.964,0.942,0.913,0.923,0.969
+NP_057122.2,0.714,0.857,0.714,0.857,0.857,,0.714,0.857,0.857,1.0,0.714,0.857
+NP_000320.1,0.723,0.935,0.737,0.796,0.782,,0.935,0.677,0.823,0.828,0.903,0.79
+NP_078941.2,1.0,1.0,1.0,1.0,0.975,0.975,,0.837,1.0,1.0,1.0,1.0
+NP_891847.1,0.693,0.824,0.83,0.92,0.892,0.651,,0.852,0.898,0.96,0.943,0.972
+NP_004964.2,0.867,1.0,1.0,0.733,0.6,0.0,0.867,0.933,0.333,0.933,0.8,0.933
+NP_005887.2,1.0,0.958,1.0,0.917,1.0,1.0,,0.917,1.0,0.875,0.917,0.958
+NP_001290425.1,0.845,0.785,0.77,0.745,0.852,1.0,,0.688,0.738,0.78,0.78,0.825
+NP_003070.3,1.0,1.0,0.667,1.0,1.0,,,0.5,1.0,1.0,1.0,1.0
+NP_112420.1,0.667,0.667,0.667,1.0,1.0,,,,1.0,1.0,1.0,1.0
+NP_954659.1,0.8,0.8,1.0,1.0,0.6,1.0,,0.8,1.0,1.0,1.0,0.8
+NP_001035.1,1.0,1.0,0.95,1.0,1.0,1.0,1.0,0.8,1.0,1.0,1.0,1.0
+NP_060011.1,1.0,1.0,1.0,1.0,,1.0,,0.5,1.0,1.0,1.0,1.0
+NP_116250.3,0.939,0.97,0.97,0.97,1.0,1.0,,0.909,1.0,1.0,1.0,1.0
+NP_078792.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,0.0,1.0,1.0,1.0
+NP_055800.2,0.971,0.971,0.882,0.853,0.962,0.333,0.882,0.853,0.882,0.794,0.971,0.882
+NP_004526.1,0.833,0.667,0.833,0.5,0.5,1.0,0.333,0.167,0.833,0.833,0.833,0.667
+NP_038460.4,0.955,1.0,0.97,0.97,0.833,,,0.919,1.0,1.0,1.0,1.0
+NP_002653.1,1.0,1.0,0.983,1.0,0.933,0.75,0.967,0.833,0.983,0.983,1.0,0.983
+NP_001073890.2,0.75,1.0,0.75,1.0,1.0,1.0,0.75,0.75,0.75,1.0,1.0,1.0
+NP_055689.1,0.973,0.969,0.988,0.995,0.995,0.792,0.978,0.834,0.993,1.0,1.0,0.998
+NP_001166167.1,0.881,0.905,0.881,0.857,,,0.857,0.833,0.81,0.952,0.905,0.952
+NP_000329.2,,0.917,,,0.907,,,0.865,0.963,0.907,0.963,0.907
+NP_054859.2,0.943,0.969,0.997,0.979,1.0,1.0,,0.932,1.0,0.984,0.984,0.995
+NP_000527.2,0.99,0.868,0.906,0.921,0.931,0.986,,0.908,0.822,0.944,0.977,0.993
+NP_006850.2,,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_853512.1,1.0,0.833,1.0,0.5,,1.0,1.0,0.667,1.0,1.0,0.833,1.0
+NP_006222.2,0.892,0.688,0.946,0.763,0.892,0.697,0.656,0.624,0.914,0.903,0.925,0.849
+NP_000274.3,0.862,0.917,0.925,0.954,0.865,0.796,0.894,0.888,0.958,0.962,0.954,0.954
+NP_005682.2,0.97,0.985,1.0,1.0,1.0,1.0,0.909,0.955,1.0,1.0,1.0,0.985
+NP_005987.3,1.0,0.889,0.931,0.972,0.667,1.0,0.944,0.958,1.0,1.0,1.0,1.0
+NP_000282.1,0.833,0.438,0.583,0.521,0.958,1.0,0.688,0.521,0.667,0.889,0.896,0.917
+NP_005328.2,0.714,0.464,0.768,0.75,0.821,,0.75,0.5,0.75,0.607,0.571,0.679
+NP_000424.2,0.65,1.0,0.6,0.65,1.0,1.0,,0.9,0.9,0.9,1.0,1.0
+NP_056265.2,0.75,0.833,0.833,0.833,0.583,1.0,0.833,0.917,0.917,0.417,0.583,0.583
+NP_004985.2,1.0,0.778,0.889,1.0,1.0,,1.0,0.778,1.0,0.444,1.0,0.778
+NP_000478.3,0.9,0.771,0.919,0.963,,,,0.76,0.873,0.826,0.85,0.865
+NP_005973.1,1.0,1.0,0.95,1.0,1.0,1.0,,0.75,1.0,1.0,1.0,1.0
+NP_058628.3,1.0,0.667,1.0,0.333,0.667,0.0,0.333,1.0,0.667,0.0,0.0,0.0
+NP_001108220.1,0.583,0.667,0.5,0.417,0.583,0.0,,,0.917,0.917,1.0,0.917
+NP_055844.2,0.733,0.867,0.933,1.0,0.091,0.5,,0.733,1.0,0.533,0.667,0.867
+NP_071451.2,,0.801,,,0.746,0.864,,0.746,0.876,0.729,0.827,0.757
+NP_002668.1,1.0,1.0,1.0,0.929,0.786,0.875,0.571,0.786,0.929,1.0,0.929,1.0
+NP_003581.1,0.812,0.688,0.875,0.938,1.0,0.833,0.812,0.531,0.812,1.0,1.0,0.938
+NP_001122092.1,0.821,0.964,0.929,0.929,0.75,0.857,1.0,0.732,1.0,1.0,1.0,1.0
+NP_620689.1,0.931,0.892,0.92,0.963,0.976,0.998,0.983,0.929,0.926,0.946,0.97,0.985
+NP_004229.1,0.925,0.85,0.775,1.0,0.85,0.875,0.925,0.65,0.975,0.925,0.95,0.925
+NP_065825.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_612373.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001380429.1,,1.0,,,,,1.0,,0.333,0.667,0.667,0.667
+NP_000245.2,1.0,1.0,1.0,1.0,0.958,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002172.2,0.881,1.0,0.81,0.429,0.905,1.0,1.0,0.667,0.714,0.857,0.714,0.857
+NP_064612.2,0.875,1.0,1.0,0.875,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060269.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,0.875
+NP_619636.2,0.952,0.905,0.81,1.0,1.0,,,0.952,1.0,1.0,1.0,1.0
+NP_003856.1,,1.0,,,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_068761.4,0.75,0.567,0.783,0.767,0.517,0.7,0.533,0.683,0.7,0.7,0.7,0.5
+NP_036292.2,0.917,1.0,0.958,1.0,1.0,1.0,,0.896,1.0,1.0,1.0,1.0
+NP_001878.1,,0.556,,,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_001918.3,0.983,0.914,0.983,0.966,0.948,0.929,0.655,0.741,0.948,0.966,0.966,0.983
+NP_060789.2,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079191.2,0.2,1.0,0.6,1.0,0.8,,,0.4,1.0,1.0,1.0,1.0
+NP_001274420.1,0.929,0.929,,,,,0.964,0.917,0.964,0.964,0.857,0.964
+NP_001171818.1,1.0,,1.0,1.0,,,1.0,,1.0,1.0,1.0,1.0
+NP_001334748.1,0.875,0.75,0.75,0.5,1.0,,1.0,0.5,0.75,1.0,0.75,1.0
+NP_061939.3,0.489,0.667,,,,,,,0.778,0.622,0.711,0.211
+NP_055637.2,1.0,0.75,1.0,0.938,1.0,,0.938,0.812,1.0,1.0,0.938,1.0
+NP_001356.1,,0.75,,,,,1.0,,0.875,1.0,1.0,1.0
+NP_004612.2,0.85,0.875,0.85,0.875,0.675,0.55,1.0,0.6,0.95,0.875,0.85,0.825
+NP_003681.1,0.125,0.0,0.5,0.25,0.25,,0.25,0.375,0.25,0.25,0.5,0.25
+NP_114423.1,0.953,0.906,0.953,0.812,0.898,1.0,0.938,0.922,0.797,0.891,0.969,1.0
+NP_009330.1,0.763,0.474,0.553,0.724,0.338,0.263,,0.395,0.868,0.474,0.553,0.763
+NP_000514.2,1.0,0.667,1.0,1.0,1.0,,1.0,0.5,1.0,1.0,1.0,1.0
+NP_938207.2,1.0,1.0,1.0,0.9,,,1.0,0.9,1.0,1.0,1.0,1.0
+NP_009125.1,1.0,1.0,1.0,1.0,1.0,,,0.938,1.0,1.0,1.0,1.0
+NP_001185885.1,1.0,1.0,0.5,0.5,,,1.0,1.0,0.5,1.0,1.0,1.0
+NP_055448.1,0.912,0.975,0.888,0.95,0.938,1.0,0.962,0.85,0.912,0.962,0.988,0.938
+NP_005723.2,1.0,1.0,,,0.455,1.0,0.909,0.909,1.0,1.0,1.0,1.0
+NP_001073270.1,0.889,0.889,,,1.0,,,1.0,1.0,1.0,0.889,1.0
+NP_000321.1,0.877,0.869,0.867,0.91,0.907,0.96,0.708,0.845,0.871,0.96,0.966,0.921
+NP_001336584.1,1.0,0.167,0.667,1.0,0.333,,0.5,0.167,0.727,0.682,0.955,0.818
+NP_061336.1,0.975,1.0,1.0,1.0,0.975,,,0.95,0.925,0.975,1.0,0.975
+NP_002684.1,0.896,0.952,0.899,0.965,0.943,0.984,,0.916,0.964,0.958,0.951,0.962
+NP_658985.2,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0
+NP_001245150.1,1.0,1.0,,,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_005600.1,0.876,0.876,0.876,0.895,0.905,0.889,0.867,0.681,0.829,0.924,0.924,0.905
+NP_065793.1,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_476429.2,0.9,1.0,1.0,0.9,1.0,,1.0,0.778,0.9,1.0,1.0,1.0
+NP_006112.3,0.897,0.851,0.841,0.837,0.928,0.979,0.933,,0.942,0.909,0.928,0.995
+NP_000415.2,0.946,0.918,0.951,0.916,0.966,0.997,0.917,0.795,0.909,0.989,0.973,0.96
+NP_000170.1,0.976,0.698,0.974,0.96,0.983,0.923,0.924,0.831,0.966,0.966,0.975,0.982
+NP_004620.1,0.821,1.0,0.964,0.964,0.929,0.917,0.786,,0.857,1.0,1.0,1.0
+NP_009005.1,1.0,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_001128243.1,0.722,1.0,0.944,1.0,1.0,0.75,1.0,0.667,1.0,1.0,1.0,1.0
+NP_002430.3,0.6,0.4,0.85,0.925,0.95,0.556,0.7,0.727,0.0,0.25,0.3,0.75
+NP_002959.2,0.853,0.621,0.897,0.971,0.588,0.412,0.971,0.455,0.5,0.765,0.794,0.735
+NP_075044.2,0.5,0.741,0.167,0.417,0.562,0.611,0.375,0.685,0.759,0.63,0.685,0.63
+NP_612808.1,1.0,0.938,0.984,0.859,0.375,0.846,0.547,0.767,0.953,1.0,1.0,0.969
+NP_036214.2,0.72,0.8,0.9,1.0,0.8,0.929,0.24,0.64,0.4,0.8,0.6,0.54
+NP_006763.2,,0.746,,,0.87,0.856,,0.757,0.91,0.902,0.919,0.892
+NP_001007227.1,0.25,0.75,0.25,0.667,0.167,,,0.5,1.0,0.167,0.667,0.417
+NP_003592.3,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001442.2,0.979,0.812,0.938,0.938,0.938,1.0,0.792,0.688,0.958,0.958,0.979,0.979
+NP_058432.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.938,1.0,1.0,1.0,1.0
+NP_004443.3,1.0,1.0,1.0,1.0,1.0,,,0.75,1.0,1.0,1.0,1.0
+NP_060004.3,0.75,1.0,0.594,0.938,0.812,1.0,,0.562,1.0,0.875,1.0,0.938
+NP_002461.2,0.919,0.932,,,0.827,0.861,0.645,,0.934,0.927,0.944,0.939
+NP_689505.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.75,1.0,1.0,1.0,1.0
+NP_000103.2,0.863,0.882,0.897,0.882,0.882,0.846,0.667,0.667,0.922,0.931,0.892,0.961
+NP_036320.2,1.0,1.0,1.0,1.0,1.0,,1.0,0.75,1.0,1.0,1.0,1.0
+NP_001029027.1,0.845,0.929,0.929,0.893,0.929,,0.857,0.845,0.976,0.976,0.952,0.929
+NP_006149.2,0.769,0.985,,,,,0.785,,0.892,1.0,0.954,0.862
+NP_115647.2,0.575,1.0,0.9,0.9,0.883,1.0,0.867,0.838,0.85,0.975,0.975,0.975
+NP_056062.1,0.35,0.9,0.3,0.8,0.7,,,0.8,0.8,0.7,0.8,0.8
+NP_005893.1,1.0,0.957,1.0,1.0,1.0,,0.848,0.478,1.0,1.0,1.0,1.0
+NP_055454.1,,0.952,,,0.929,0.833,,0.881,1.0,0.952,1.0,1.0
+NP_000041.2,0.914,0.908,0.947,0.98,1.0,,,0.789,0.921,0.967,0.987,0.987
+NP_660208.2,0.891,0.985,0.925,0.88,,1.0,0.94,0.917,0.917,0.97,0.932,0.977
+NP_201583.2,0.633,0.673,0.735,0.735,0.643,0.583,,0.714,0.673,0.837,0.755,0.714
+NP_114432.2,0.875,0.927,0.854,0.823,0.83,0.86,0.969,0.75,0.812,0.865,0.896,0.969
+NP_071899.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_898898.1,0.926,0.889,0.944,0.981,0.963,1.0,0.963,0.944,0.907,1.0,1.0,1.0
+NP_000773.2,0.948,0.844,0.968,0.909,1.0,1.0,0.844,0.87,1.0,0.961,1.0,0.961
+NP_036579.2,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_000807.2,,0.936,,,0.926,0.996,0.86,0.906,0.977,0.977,0.99,0.99
+NP_000137.2,0.667,0.667,0.667,1.0,0.667,,0.667,1.0,1.0,0.667,0.667,0.333
+NP_071764.3,1.0,1.0,0.985,0.985,1.0,1.0,,1.0,0.97,1.0,0.985,1.0
+NP_006028.2,0.971,1.0,0.99,0.962,0.981,0.5,0.981,0.981,0.981,1.0,0.962,1.0
+NP_057616.2,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002543.2,0.5,0.875,0.5,1.0,0.875,,0.625,1.0,0.875,0.875,0.875,0.75
+NP_001136256.1,0.732,0.75,0.875,1.0,0.911,,,0.929,0.75,0.964,0.964,0.964
+NP_031399.2,0.0,0.571,0.821,0.071,0.714,0.5,0.143,0.571,0.143,0.5,0.286,0.857
+NP_003325.2,0.633,0.7,0.567,0.867,0.283,0.056,0.7,0.167,0.2,0.567,0.5,0.633
+NP_006746.1,1.0,1.0,1.0,1.0,1.0,,0.5,0.75,1.0,1.0,1.0,1.0
+NP_114032.2,0.729,0.598,0.979,0.698,0.921,0.847,0.874,0.706,0.921,0.932,0.863,0.779
+NP_000717.2,1.0,1.0,0.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0
+NP_115834.1,0.733,0.8,0.733,0.667,0.933,0.5,,0.733,0.733,0.867,0.733,0.867
+NP_056029.2,0.893,0.929,0.893,0.857,1.0,1.0,,0.857,0.929,0.929,1.0,0.857
+NP_006171.2,0.917,0.667,0.875,0.875,0.667,0.222,0.792,0.562,0.917,0.75,0.917,0.875
+NP_004230.2,0.767,0.793,0.767,0.672,0.69,0.889,0.638,0.517,0.897,0.69,0.621,0.741
+NP_005582.1,1.0,1.0,1.0,1.0,1.0,,0.969,0.938,1.0,1.0,1.0,1.0
+NP_004388.2,1.0,1.0,,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_663304.1,0.982,0.864,0.805,0.955,0.945,0.974,0.755,0.627,0.973,0.964,0.973,0.945
+NP_000268.1,0.98,0.462,0.957,0.943,0.972,0.954,0.927,0.599,0.901,0.911,0.94,0.99
+NP_000308.1,0.192,0.0,0.192,0.0,0.115,0.579,0.077,0.308,0.115,0.115,0.115,0.077
+NP_005594.2,0.938,0.781,0.927,0.979,0.927,0.906,,0.615,0.917,0.917,0.917,0.958
+NP_066566.3,1.0,0.833,0.667,1.0,,,,1.0,1.0,1.0,0.833,1.0
+NP_751951.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006286.1,0.769,0.981,0.704,0.87,0.87,,,0.63,0.981,0.704,0.796,0.926
+NP_015566.1,0.817,0.799,0.764,0.807,0.828,0.829,0.837,0.751,0.78,0.852,0.84,0.89
+NP_001122317.1,0.843,0.91,0.851,0.869,0.802,0.841,,0.525,0.849,0.885,0.901,0.937
+NP_003061.3,0.97,0.945,0.944,0.963,0.952,0.934,,0.893,0.975,0.975,0.983,0.981
+NP_036361.1,0.667,1.0,1.0,1.0,1.0,,,0.833,1.0,1.0,1.0,1.0
+NP_002652.2,0.781,0.725,,,0.75,0.231,0.862,0.825,0.75,0.925,0.888,0.875
+NP_689482.1,1.0,,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_001018494.1,1.0,0.667,0.833,1.0,,,0.833,0.5,0.667,0.667,0.667,0.833
+NP_001284478.1,0.5,0.5,,,1.0,1.0,0.5,0.75,1.0,1.0,0.5,0.5
+NP_001073936.1,0.927,0.868,0.951,0.903,0.993,1.0,0.924,0.806,0.951,0.986,0.993,1.0
+NP_037460.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,,1.0,1.0
+NP_066918.2,0.889,1.0,0.889,0.889,0.889,0.667,,0.944,1.0,0.889,0.889,0.889
+NP_071934.3,0.951,0.911,0.93,0.953,0.959,0.939,0.958,0.379,0.954,0.944,0.981,0.874
+NP_689597.1,1.0,,1.0,1.0,1.0,,0.5,0.375,1.0,1.0,1.0,1.0
+NP_149107.4,0.667,0.852,0.741,0.889,0.667,1.0,0.741,0.87,0.704,1.0,0.963,1.0
+NP_057116.2,1.0,1.0,1.0,1.0,1.0,1.0,,0.929,1.0,1.0,1.0,1.0
+NP_057623.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001153844.1,,0.667,,,1.0,1.0,,0.833,1.0,1.0,0.833,0.833
+NP_004853.2,0.914,0.886,,,0.957,0.944,,0.714,0.971,0.914,0.971,1.0
+NP_000412.4,0.988,0.741,0.824,1.0,1.0,1.0,1.0,0.831,0.988,1.0,0.976,1.0
+NP_112483.1,0.875,1.0,1.0,0.875,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0
+NP_056085.1,1.0,1.0,1.0,1.0,0.333,,1.0,1.0,1.0,0.667,0.667,1.0
+NP_079163.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060658.3,0.933,0.8,0.911,0.867,0.756,1.0,0.822,0.844,0.911,0.911,0.978,0.956
+NP_002927.2,1.0,0.667,1.0,0.667,0.667,1.0,0.667,0.667,1.0,0.667,0.667,0.667
+NP_000500.2,0.917,1.0,1.0,1.0,0.917,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001547.1,0.667,1.0,0.778,0.611,0.812,1.0,,0.833,0.944,0.889,0.778,0.889
+NP_001914.3,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.5,0.0,0.0,0.0,0.0
+NP_001148.1,0.351,0.842,0.386,0.246,0.368,0.556,,0.105,0.123,0.298,0.368,0.386
+NP_001377.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_009009.1,0.881,0.237,0.873,0.864,0.949,0.974,0.602,0.763,0.729,0.356,0.458,0.831
+NP_056150.1,0.894,0.865,0.916,0.915,0.919,0.765,0.865,0.824,0.76,0.918,0.883,0.866
+NP_000171.1,0.982,0.806,0.992,0.945,0.98,0.609,0.941,0.818,0.901,0.984,0.968,0.972
+NP_036234.3,0.834,0.656,0.815,0.966,0.829,0.922,,,0.925,0.782,0.824,0.864
+NP_001421.2,0.99,0.969,0.98,0.918,0.878,0.95,0.99,0.929,0.949,0.99,0.898,0.98
+NP_001785.2,1.0,1.0,,,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_002519.2,0.5,0.5,1.0,1.0,1.0,0.0,0.5,0.0,1.0,1.0,1.0,1.0
+NP_065394.1,0.95,0.75,1.0,1.0,1.0,1.0,0.95,1.0,1.0,1.0,1.0,1.0
+NP_001097.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.875,1.0,1.0,1.0,1.0
+NP_000262.2,0.925,0.853,0.927,0.942,0.968,0.942,0.86,0.781,0.955,0.969,0.967,0.954
+NP_060167.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_003840.2,1.0,0.833,1.0,1.0,0.833,0.667,0.833,0.417,0.833,1.0,1.0,1.0
+NP_001307127.1,0.5,1.0,1.0,0.75,1.0,,,0.75,1.0,0.75,0.75,0.75
+NP_060599.1,,1.0,,,0.889,,0.889,0.5,1.0,1.0,1.0,1.0
+NP_002462.2,0.721,0.76,0.779,0.75,0.692,0.841,0.635,,0.788,0.779,0.837,0.817
+NP_001137326.1,0.0,1.0,1.0,1.0,1.0,,0.0,0.0,0.0,1.0,0.0,1.0
+NP_000331.1,0.889,1.0,0.861,1.0,0.944,,1.0,0.917,0.833,0.889,0.833,0.944
+NP_001001438.1,0.893,0.929,0.971,0.914,0.914,,,0.9,0.9,1.0,0.943,1.0
+NP_659501.1,0.625,0.75,0.75,1.0,1.0,,0.75,0.75,1.0,1.0,1.0,1.0
+NP_000012.1,0.779,0.995,0.809,0.843,0.792,1.0,0.779,0.843,0.755,0.753,0.784,0.745
+NP_001531.1,0.685,0.648,0.704,0.944,0.889,0.824,0.704,0.778,0.815,0.759,0.759,0.87
+NP_005334.1,1.0,1.0,0.927,0.993,0.98,1.0,,0.807,1.0,1.0,1.0,0.993
+NP_001004051.1,1.0,1.0,1.0,1.0,0.75,1.0,,1.0,0.75,1.0,1.0,0.75
+NP_000425.1,0.721,0.853,0.89,0.779,0.529,0.933,,0.757,0.794,0.662,0.735,0.765
+NP_878314.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001671.2,1.0,,1.0,1.0,,,1.0,1.0,0.667,1.0,1.0,1.0
+NP_612468.1,1.0,0.667,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001510.2,1.0,0.844,,,1.0,1.0,0.938,0.969,1.0,1.0,1.0,1.0
+NP_005212.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.667,1.0,1.0,1.0,1.0
+NP_001431.1,0.833,0.833,0.5,1.0,1.0,,0.833,0.667,1.0,1.0,1.0,1.0
+NP_000280.1,1.0,0.75,1.0,1.0,1.0,,,0.875,1.0,1.0,1.0,1.0
+NP_002176.2,0.979,0.938,0.938,0.922,0.98,1.0,0.875,0.75,0.771,1.0,1.0,1.0
+NP_000174.1,1.0,1.0,1.0,1.0,1.0,,0.667,0.367,1.0,1.0,1.0,1.0
+NP_001242976.1,1.0,,,,,,0.917,1.0,1.0,1.0,1.0,1.0
+NP_003826.2,,1.0,,,,1.0,0.818,0.955,0.909,1.0,1.0,1.0
+NP_004174.1,0.926,0.963,0.941,0.995,0.932,,0.948,0.785,0.99,0.933,0.97,0.975
+NP_002063.2,1.0,1.0,1.0,1.0,1.0,,0.75,0.5,1.0,1.0,1.0,1.0
+NP_002058.2,1.0,1.0,1.0,1.0,1.0,,0.833,1.0,1.0,1.0,1.0,1.0
+NP_001963.1,0.703,0.829,,,0.96,0.967,,0.652,0.967,0.829,0.909,0.967
+NP_000011.2,0.89,0.859,0.904,0.857,0.846,0.961,0.759,0.805,0.923,0.923,0.919,0.885
+NP_001136029.1,0.9,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_056000.1,0.974,0.972,0.955,0.974,0.987,0.881,1.0,0.808,0.923,1.0,1.0,0.987
+NP_001877.1,0.5,0.75,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_997464.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.833,1.0,1.0,1.0,1.0
+NP_758872.1,0.567,0.857,0.433,0.5,,0.906,0.866,0.697,0.855,0.885,0.874,0.909
+NP_003404.1,1.0,1.0,1.0,1.0,1.0,1.0,0.762,0.714,1.0,1.0,1.0,1.0
+NP_002829.3,1.0,1.0,,,,,0.909,0.818,0.909,1.0,1.0,1.0
+NP_065168.2,1.0,0.923,1.0,1.0,1.0,1.0,1.0,1.0,0.923,1.0,1.0,1.0
+NP_000738.2,0.854,0.917,0.854,0.792,0.875,0.667,,0.396,0.875,1.0,0.917,0.958
+NP_056512.5,0.886,0.743,1.0,1.0,,,,1.0,1.0,1.0,1.0,1.0
+NP_001121897.1,0.968,0.994,0.994,0.99,0.833,0.993,0.962,0.984,0.965,0.965,0.965,0.981
+NP_000452.2,0.793,0.986,0.338,1.0,0.959,1.0,,,1.0,0.938,1.0,0.952
+NP_066279.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0
+NP_005673.3,0.805,0.896,0.865,0.935,,0.877,1.0,0.839,0.792,0.938,0.901,0.958
+NP_000353.1,0.944,0.778,0.75,0.889,0.667,1.0,0.889,0.389,1.0,0.889,1.0,0.944
+NP_055981.1,0.88,0.747,0.901,0.802,0.87,1.0,,0.907,0.679,0.821,0.975,0.87
+NP_009006.2,0.89,0.876,0.874,0.875,1.0,0.833,1.0,0.618,0.88,0.921,0.908,0.862
+NP_000776.1,0.93,0.368,0.93,0.912,0.965,1.0,0.772,0.632,0.912,0.965,0.965,0.947
+NP_055400.1,0.848,0.966,0.88,0.907,0.907,0.979,0.936,0.934,0.985,0.922,0.956,0.936
+NP_001269333.1,0.917,1.0,0.917,1.0,,,0.75,0.667,0.917,1.0,1.0,1.0
+NP_055723.1,0.812,0.625,0.75,1.0,0.75,1.0,1.0,0.438,1.0,1.0,1.0,1.0
+NP_001866.2,0.896,0.756,0.912,0.863,0.939,0.882,0.83,0.796,0.929,0.961,0.95,0.952
+NP_000040.1,0.896,0.954,0.867,0.926,0.863,1.0,1.0,0.923,0.907,0.911,0.958,0.958
+NP_057425.3,1.0,1.0,1.0,1.0,0.947,0.8,1.0,0.789,1.0,1.0,1.0,1.0
+NP_001026859.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000657.1,,1.0,,,,,0.667,0.875,1.0,0.75,0.75,1.0
+NP_056348.2,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001018123.1,0.961,1.0,0.934,1.0,0.947,1.0,0.763,0.987,0.974,0.947,0.947,0.974
+NP_001137458.1,0.9,1.0,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0
+NP_004628.4,1.0,0.917,1.0,0.917,1.0,1.0,1.0,,1.0,1.0,1.0,1.0
+NP_002825.3,0.922,0.836,0.895,0.96,0.915,0.864,0.852,0.51,0.948,0.97,0.976,0.926
+NP_001075.1,0.833,0.972,0.889,1.0,0.972,1.0,0.944,0.764,0.889,0.944,1.0,0.972
+NP_001195.2,0.919,0.949,0.797,0.99,0.898,0.958,0.873,0.785,0.982,0.969,0.988,0.978
+NP_000511.2,0.748,0.875,0.72,0.711,0.838,,0.782,0.882,0.731,0.844,0.811,0.861
+NP_001243556.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.9,0.8,1.0,1.0,1.0
+NP_002232.2,,1.0,,,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_005548.2,0.897,0.994,0.838,0.812,0.919,1.0,0.756,0.825,0.962,0.95,0.956,0.969
+NP_000517.3,0.99,0.952,0.99,0.933,0.969,0.964,0.941,0.709,0.938,0.986,0.976,0.993
+NP_660201.1,0.95,0.9,0.9,0.8,0.925,,0.8,0.65,0.8,0.85,0.9,0.9
+NP_055129.2,0.853,0.676,0.735,0.824,0.971,0.75,0.941,0.603,0.941,0.912,1.0,1.0
+NP_036288.2,0.615,0.385,,,,0.75,0.692,0.333,0.538,0.538,0.538,0.462
+NP_060221.2,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001278650.1,0.751,0.687,0.821,0.685,,,0.837,0.497,0.956,0.89,0.835,0.879
+NP_852122.1,0.937,0.878,0.902,0.894,0.939,0.971,0.784,0.714,0.906,0.939,0.931,0.943
+NP_003312.3,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_006570.1,0.878,0.891,0.862,0.88,0.917,0.944,0.812,0.82,0.88,0.87,0.906,0.901
+NP_002056.2,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_071919.2,0.0,0.5,0.0,0.5,0.5,0.0,0.0,0.5,0.0,0.0,0.0,0.0
+NP_000413.1,,0.929,,,0.98,1.0,,0.837,0.878,0.949,0.929,0.969
+NP_037377.1,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.667,1.0,1.0,1.0,1.0
+NP_006355.2,1.0,0.5,0.75,0.5,1.0,,0.5,0.75,1.0,0.5,0.5,1.0
+NP_004965.1,0.991,0.894,,,0.959,0.98,,0.783,1.0,1.0,1.0,1.0
+NP_937859.1,1.0,1.0,0.333,1.0,1.0,1.0,0.667,0.833,1.0,1.0,1.0,1.0
+NP_000208.2,0.944,1.0,1.0,0.944,0.944,,0.833,0.944,1.0,1.0,1.0,1.0
+NP_001265273.1,0.969,0.955,,,,,0.509,0.653,0.693,0.746,0.728,0.842
+NP_001316843.1,1.0,0.5,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_071449.1,0.938,0.938,1.0,0.875,1.0,1.0,,0.688,1.0,1.0,1.0,1.0
+NP_000173.2,0.992,0.977,0.981,0.977,1.0,1.0,0.846,0.95,1.0,0.985,1.0,0.954
+NP_477520.2,0.85,0.867,0.817,1.0,1.0,0.8,0.867,0.767,1.0,0.967,1.0,1.0
+NP_001123575.1,1.0,0.875,1.0,1.0,1.0,,0.875,0.875,1.0,0.875,1.0,1.0
+NP_001229804.1,1.0,0.962,0.929,0.821,,,0.964,0.923,1.0,1.0,0.964,1.0
+NP_071450.2,1.0,0.917,1.0,1.0,1.0,1.0,1.0,0.854,0.958,1.0,1.0,1.0
+NP_054701.1,0.35,0.7,0.25,0.45,0.6,1.0,0.9,0.0,0.35,0.55,0.45,0.4
+NP_001096.1,0.95,0.975,0.888,1.0,0.925,0.95,,0.95,0.95,0.975,0.975,0.95
+NP_940905.2,1.0,1.0,1.0,1.0,0.9,,1.0,1.0,0.9,1.0,1.0,1.0
+NP_036611.2,1.0,0.786,1.0,1.0,1.0,1.0,0.929,0.625,1.0,1.0,1.0,1.0
+NP_001119578.1,1.0,0.875,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_060661.3,0.8,0.3,0.8,0.4,0.9,,,0.25,0.5,0.8,0.5,1.0
+NP_004915.2,1.0,0.958,0.906,0.854,1.0,1.0,,0.792,0.917,0.854,0.917,1.0
+NP_490647.1,0.833,0.806,0.917,1.0,,1.0,0.75,0.25,1.0,1.0,1.0,1.0
+NP_005404.1,0.792,0.917,0.875,1.0,1.0,1.0,1.0,0.722,1.0,1.0,1.0,1.0
+NP_055595.2,0.92,0.955,0.955,0.966,1.0,0.833,0.67,0.807,0.875,0.886,0.909,0.977
+NP_001269210.1,0.969,0.962,0.938,0.985,,,0.923,0.923,0.969,0.954,0.969,0.923
+NP_000894.1,0.75,1.0,0.75,1.0,1.0,0.75,1.0,0.875,1.0,0.75,1.0,0.75
+NP_954634.1,1.0,1.0,0.875,0.75,,,1.0,0.875,1.0,1.0,1.0,0.75
+NP_006558.1,0.987,0.915,0.987,0.98,0.961,0.846,,0.928,0.993,0.935,0.961,0.954
+NP_068380.3,1.0,0.5,0.833,1.0,1.0,0.75,1.0,1.0,1.0,1.0,1.0,1.0
+NP_113606.2,1.0,0.958,0.969,1.0,1.0,0.875,0.979,0.875,1.0,0.917,0.958,0.979
+NP_000509.1,0.694,0.654,0.604,0.661,0.756,0.663,,,0.718,0.766,0.766,0.746
+NP_000251.3,0.966,0.892,0.962,0.911,0.95,0.92,0.835,0.66,0.899,0.959,0.948,0.92
+NP_060829.2,0.889,1.0,1.0,1.0,1.0,1.0,0.833,0.778,1.0,1.0,1.0,1.0
+NP_115822.1,0.929,0.952,0.952,0.952,0.857,0.833,,0.714,0.81,1.0,1.0,0.905
+NP_008986.2,0.848,0.946,0.862,1.0,1.0,0.4,0.893,0.714,0.821,0.991,0.982,0.955
+NP_473368.1,1.0,1.0,1.0,0.333,1.0,,0.333,1.0,0.0,1.0,1.0,0.333
+NP_002624.2,1.0,0.7,0.96,0.92,1.0,1.0,0.84,0.8,0.96,1.0,0.96,0.96
+NP_006851.1,1.0,1.0,0.667,1.0,,,0.833,1.0,0.5,1.0,0.5,0.833
+NP_006757.2,0.887,0.898,0.904,0.904,0.609,0.618,0.935,0.696,0.899,0.879,0.933,0.897
+NP_036462.2,,0.855,,,0.261,0.514,,0.705,0.68,0.657,0.639,0.675
+NP_872354.1,1.0,1.0,1.0,1.0,,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_001127835.2,0.891,0.918,0.944,0.862,,0.94,,0.764,0.965,0.992,0.976,0.992
+NP_003127.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_775901.3,,0.805,,,0.839,0.667,0.632,0.931,0.471,0.839,0.862,0.816
+NP_006504.2,1.0,0.0,0.5,0.0,1.0,,0.0,1.0,0.0,1.0,1.0,1.0
+NP_112503.1,0.783,0.867,0.717,0.5,0.933,1.0,,0.833,0.767,0.933,0.9,0.933
+NP_000608.1,1.0,1.0,1.0,1.0,1.0,1.0,,0.833,0.833,1.0,1.0,1.0
+NP_004195.2,1.0,1.0,1.0,1.0,1.0,1.0,,0.846,0.941,1.0,1.0,1.0
+NP_005439.2,0.5,0.393,0.518,0.214,0.518,0.667,0.5,0.679,0.429,0.571,0.464,0.393
+NP_001309774.1,0.941,0.941,0.971,0.882,,1.0,0.647,0.882,1.0,1.0,0.941,0.882
+NP_003038.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_036407.1,1.0,1.0,1.0,1.0,0.0,,,0.0,1.0,1.0,1.0,1.0
+NP_000508.1;NP_000549.1,0.884,0.635,0.686,0.872,,0.646,0.731,0.663,0.863,0.867,0.88,0.855
+NP_000421.1,0.81,0.926,0.844,0.896,0.763,0.78,0.657,0.542,0.882,0.828,0.879,0.875
+NP_689831.2,1.0,0.964,1.0,1.0,0.982,1.0,0.927,0.964,1.0,0.964,0.982,1.0
+NP_001136.1,0.667,1.0,0.667,0.5,0.667,1.0,,1.0,0.833,0.667,0.667,0.667
+NP_001821.2,0.873,0.897,0.956,0.915,0.938,0.898,0.868,0.638,0.901,0.971,0.934,1.0
+NP_116056.3,0.472,0.389,0.278,0.722,0.556,0.8,0.417,0.75,0.667,0.611,0.722,0.611
+NP_005948.3,0.85,0.878,0.911,0.837,0.861,0.989,,0.82,0.944,0.918,0.811,0.885
+NP_000710.5,0.952,0.974,0.89,0.966,1.0,0.714,,0.909,1.0,0.987,0.997,0.988
+NP_001952.1,1.0,0.667,0.333,1.0,1.0,0.0,0.667,0.667,1.0,1.0,1.0,1.0
+NP_000288.1,0.986,0.51,0.948,1.0,1.0,0.977,0.885,0.871,0.923,0.979,0.993,1.0
+NP_005903.2,0.963,0.741,0.954,0.852,0.944,1.0,0.944,0.796,0.981,0.907,0.981,0.981
+NP_079160.1,1.0,0.792,1.0,0.938,,1.0,1.0,0.885,0.979,0.958,0.958,1.0
+NP_000086.2,0.879,0.591,0.868,0.908,0.851,0.735,0.895,0.748,0.903,0.861,0.886,0.952
+NP_112235.2,1.0,0.4,,,,1.0,1.0,0.6,1.0,0.6,1.0,1.0
+NP_000207.2,0.918,0.894,0.947,0.906,0.865,0.857,0.871,0.835,1.0,0.906,0.941,0.929
+NP_597716.1,1.0,1.0,0.95,1.0,0.95,1.0,,1.0,1.0,1.0,1.0,0.9
+NP_000089.1,0.743,0.792,0.751,0.801,0.884,0.926,0.798,0.836,0.854,0.854,0.818,0.866
+NP_742105.1,0.961,0.843,0.883,0.988,0.921,0.985,0.896,0.787,0.986,0.956,0.96,0.976
+NP_000835.1,0.929,1.0,0.857,1.0,1.0,0.833,1.0,0.857,1.0,1.0,1.0,1.0
+NP_115759.2,0.722,1.0,0.833,0.833,0.833,0.0,0.889,0.917,0.722,0.889,0.889,0.889
+NP_003099.1,0.986,0.878,0.973,1.0,0.986,0.932,1.0,1.0,0.973,0.973,0.973,0.986
+NP_003098.1,0.667,0.833,0.583,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_060575.1,1.0,0.889,1.0,0.944,1.0,1.0,0.667,0.611,1.0,1.0,0.944,0.944
+NP_000222.2,0.583,0.75,0.708,0.75,0.833,1.0,0.917,0.667,0.667,0.833,0.917,0.833
+NP_001003841.1,0.945,0.945,0.951,0.923,0.978,1.0,0.923,0.852,0.967,1.0,1.0,1.0
+NP_002393.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001129277.1,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_115604.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000536.6,0.94,0.918,0.926,0.958,,0.898,0.896,0.825,0.943,0.948,0.929,0.96
+NP_000449.1,0.885,0.954,,,0.826,0.85,,0.73,0.913,0.866,0.855,0.878
+NP_149075.2,0.971,1.0,0.961,1.0,1.0,1.0,1.0,1.0,0.784,0.941,0.98,1.0
+NP_694955.2,0.75,0.881,0.679,0.821,0.69,1.0,,0.679,0.81,0.667,0.81,0.81
+NP_058637.1,0.818,0.818,0.773,0.864,0.909,1.0,0.864,0.545,1.0,0.818,1.0,0.909
+NP_000122.1,0.875,0.858,0.871,0.95,0.917,0.983,0.942,0.942,1.0,0.974,0.965,0.983
+NP_570971.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_997304.3,0.971,0.886,,,1.0,,,,0.886,1.0,1.0,0.886
+NP_065683.2,1.0,1.0,1.0,1.0,1.0,,,0.972,0.944,1.0,1.0,1.0
+NP_787110.2,0.875,0.583,0.875,0.812,,,0.458,0.219,0.854,0.905,0.881,0.929
+NP_000132.3,0.92,0.99,0.807,0.977,0.975,1.0,0.777,0.993,0.998,0.968,0.959,0.959
+NP_000023.2,,0.891,,,0.776,0.758,,0.858,0.921,0.855,0.921,0.903
+NP_066970.3,0.786,1.0,0.857,0.857,1.0,1.0,0.786,1.0,1.0,1.0,1.0,1.0
+NP_115969.2,1.0,0.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_005680.1,0.955,1.0,0.818,1.0,1.0,1.0,1.0,0.818,1.0,0.818,1.0,1.0
+NP_942559.1,,1.0,,,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_000416.1,0.922,0.899,0.894,0.944,0.918,0.899,0.776,0.864,0.966,0.948,0.951,0.94
+NP_060950.3,0.6,1.0,1.0,0.2,0.8,0.0,1.0,1.0,0.6,0.8,0.6,1.0
+NP_005391.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_079412.1,0.5,,0.75,0.75,0.75,,1.0,,0.5,1.0,1.0,1.0
+NP_005668.2,0.84,0.8,0.89,0.78,0.92,1.0,0.92,0.65,0.74,0.82,0.86,0.78
+NP_001357524.1,,,,,,,,,0.286,0.857,0.857,1.0
+NP_060106.2,0.714,0.952,0.905,0.762,0.286,1.0,0.905,0.588,0.81,0.429,0.381,0.714
+NP_057206.2,0.75,0.75,0.65,0.85,,,,,0.85,1.0,1.0,1.0
+NP_001373.2,0.923,0.923,1.0,0.846,0.923,,,1.0,0.923,0.923,1.0,1.0
+NP_060597.4,0.78,0.711,0.77,0.9,0.86,0.4,0.7,0.91,0.88,0.84,0.84,0.8
+NP_071415.1,1.0,0.991,1.0,0.981,1.0,1.0,0.763,0.798,0.982,0.991,1.0,1.0
+NP_958850.1,1.0,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_036196.1,1.0,1.0,0.917,1.0,1.0,,0.917,0.875,1.0,1.0,0.917,1.0
+NP_000477.1,0.982,0.875,0.946,0.911,0.893,1.0,0.875,0.562,0.911,1.0,0.982,0.893
+NP_000093.1,0.957,0.967,1.0,1.0,1.0,1.0,0.95,1.0,1.0,1.0,1.0,1.0
+NP_004532.1,0.0,1.0,1.0,0.0,,0.0,0.0,0.0,1.0,1.0,1.0,1.0
+NP_065731.3,1.0,0.9,0.9,0.9,,1.0,0.7,0.4,1.0,1.0,0.9,1.0
+NP_060342.2,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001303260.1,,,,,,,,,1.0,1.0,0.75,0.75
+NP_004165.2,,0.864,,,,1.0,1.0,0.955,0.955,1.0,1.0,1.0
+NP_000387.1,0.961,0.961,0.987,1.0,1.0,0.98,0.974,0.974,1.0,1.0,1.0,1.0
+NP_001035940.1,0.917,1.0,1.0,0.875,1.0,1.0,1.0,0.929,0.958,1.0,1.0,0.958
+NP_000457.1,0.937,0.85,0.94,0.978,0.928,0.923,0.845,0.874,0.952,0.942,0.99,0.99
+NP_001380.2,1.0,1.0,0.944,0.778,0.889,0.5,0.667,0.833,0.889,0.778,0.889,0.889
+NP_002476.2,0.574,0.824,0.779,0.853,0.809,0.65,0.765,0.515,0.706,0.824,0.794,0.824
+NP_078790.2,1.0,1.0,0.833,1.0,1.0,,1.0,0.833,1.0,1.0,1.0,1.0
+NP_005934.2,1.0,1.0,1.0,1.0,1.0,1.0,0.778,1.0,0.875,0.938,0.875,0.938
+NP_004456.1,1.0,1.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001092138.1,1.0,0.167,0.917,1.0,,,,0.75,1.0,1.0,1.0,1.0
+NP_001469.1,0.952,0.667,0.952,0.952,1.0,0.75,1.0,1.0,0.905,0.952,1.0,1.0
+NP_060222.2,1.0,0.8,0.925,0.9,0.725,1.0,0.9,1.0,0.85,1.0,0.95,0.95
+NP_079152.3,1.0,1.0,0.967,0.933,,0.5,1.0,0.95,0.867,0.867,0.867,1.0
+NP_002104.2,0.167,,0.375,0.0,0.0,,0.25,,0.167,0.0,0.0,0.083
+NP_002896.2,0.907,0.981,0.889,0.889,0.963,1.0,,0.926,1.0,0.963,1.0,0.944
+NP_067628.1,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_061850.2,0.611,0.889,0.889,0.667,,,0.778,0.444,0.667,0.778,0.778,0.889
+NP_001702.1,0.0,1.0,0.5,0.25,0.0,1.0,1.0,0.5,0.75,0.5,0.5,0.75
+NP_001051.1,0.667,1.0,0.667,0.833,0.5,0.5,0.5,0.167,0.833,0.667,0.833,0.667
+NP_060307.2,0.45,0.8,0.85,0.6,1.0,,0.0,0.2,0.8,0.7,1.0,0.8
+NP_000303.1,0.968,0.819,,0.84,0.957,,0.862,0.83,0.989,0.947,0.968,0.915
+NP_009129.1,1.0,0.533,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_598399.2,0.972,0.955,0.903,0.931,1.0,1.0,0.903,0.944,0.931,1.0,1.0,0.986
+NP_001189333.2,0.911,0.727,0.893,0.821,,,0.818,0.664,0.903,0.842,0.867,0.836
+NP_056375.2,,0.919,0.357,1.0,0.883,,0.95,0.728,0.933,0.989,0.981,0.986
+NP_001922.2,1.0,0.923,1.0,1.0,1.0,1.0,0.808,0.904,1.0,0.981,1.0,1.0
+NP_068711.1,0.875,0.912,0.943,0.905,0.907,0.93,,0.858,0.97,0.921,0.881,0.97
+NP_001073136.1,0.75,1.0,0.774,0.952,,,0.964,0.903,1.0,1.0,1.0,1.0
+NP_000083.3,0.985,0.993,0.947,0.964,0.987,0.948,0.883,,0.983,0.976,0.974,0.984
+NP_789788.1,1.0,0.0,1.0,1.0,1.0,,1.0,1.0,1.0,1.0,1.0,1.0
+NP_075598.2,0.901,0.887,0.918,0.967,0.892,0.976,0.656,0.784,0.978,0.98,0.982,0.984
+NP_000135.2,,0.6,,,1.0,1.0,,1.0,1.0,1.0,1.0,1.0
+NP_002078.1,1.0,1.0,0.875,0.625,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_000803.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.75,1.0,1.0,1.0,1.0
+NP_001336644.1,,,,,,,,,1.0,1.0,1.0,1.0
+NP_068602.2,0.78,,0.765,0.85,0.97,1.0,0.81,0.97,0.87,0.88,0.87,0.83
+NP_848549.3,1.0,1.0,1.0,1.0,1.0,,,1.0,1.0,1.0,1.0,1.0
+NP_002769.1,0.905,0.976,1.0,1.0,1.0,1.0,0.905,0.881,1.0,1.0,0.976,0.929
+NP_060319.1,0.7,0.6,1.0,0.8,0.8,1.0,,0.0,0.8,0.6,0.6,0.2
+NP_002633.1,0.583,1.0,0.667,0.667,1.0,,,0.5,1.0,1.0,1.0,1.0
+NP_006412.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001129392.2,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_981932.1,0.906,0.833,1.0,0.938,1.0,,1.0,0.667,1.0,1.0,1.0,0.875
+NP_060592.2,0.988,0.893,1.0,0.917,0.962,0.9,,0.875,0.929,0.964,0.952,0.988
+NP_004599.2,0.611,0.889,0.722,0.556,0.778,,,0.667,0.556,0.667,0.556,0.556
+NP_005436.1,0.953,0.76,0.973,0.92,0.92,0.867,0.873,0.547,0.96,0.953,0.953,0.973
+NP_003915.2,1.0,0.889,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
+NP_001554.2,0.333,0.389,0.889,0.722,0.722,1.0,0.722,0.333,0.722,0.833,0.833,0.889
+NP_003097.1,0.956,1.0,0.967,1.0,1.0,0.938,0.844,0.711,1.0,0.978,1.0,0.978
+NP_002043.2,0.955,0.891,0.955,0.945,0.982,0.893,,0.945,0.818,0.982,1.0,1.0
+NP_001243169.1,0.9,0.75,,,,,0.4,0.55,0.7,0.7,0.5,0.75
+NP_000134.2,0.985,0.941,0.986,0.961,0.964,0.605,0.888,0.877,0.994,0.988,0.993,0.989
+NP_001186.1,0.9,1.0,0.9,1.0,0.8,1.0,0.7,0.8,1.0,0.9,1.0,1.0
+NP_005262.1,0.795,0.744,0.885,0.923,0.846,1.0,0.59,0.763,0.718,0.846,0.846,0.833
+NP_055940.3,0.671,0.979,0.862,1.0,0.917,0.911,0.8,,0.8,0.964,0.971,0.893
+NP_065069.1,0.5,0.5,0.5,0.5,0.5,0.0,,0.75,1.0,1.0,1.0,1.0
+NP_000162.2,0.981,0.821,0.986,0.978,1.0,1.0,0.926,0.723,0.994,1.0,1.0,1.0
+NP_000479.1,0.718,0.6,0.647,0.529,0.824,0.72,0.693,0.636,0.7,0.829,0.721,0.736
+NP_005586.1,0.688,0.5,1.0,0.75,1.0,,1.0,0.5,0.75,0.75,0.875,0.75
+NP_002492.2,0.962,0.902,0.94,0.962,0.674,0.924,0.783,0.728,0.826,0.957,0.837,0.946
diff --git a/config.json b/config.json
new file mode 100644
index 0000000..e94b999
--- /dev/null
+++ b/config.json
@@ -0,0 +1,142 @@
+{
+ "PG_data_location": "~/.cache/proteingym",
+ "model_list_zero_shot_substitutions_DMS":{
+ "Site_Independent":{"input_score_name":"prediction_independent", "location":"EVmutation", "directionality": 1, "key": "mutant", "model_type":"Alignment-based model"},
+ "EVmutation":{"input_score_name":"prediction_epistatic", "location":"EVmutation", "directionality": 1, "key": "mutant", "model_type":"Alignment-based model"},
+ "DeepSequence_single":{"input_score_name":"evol_indices_seed_1000", "location":"DeepSequence", "directionality": -1, "key": "mutant","model_type":"Alignment-based model"},
+ "DeepSequence_ensemble":{"input_score_name":"evol_indices_ensemble", "location":"DeepSequence", "directionality": -1, "key": "mutant","model_type":"Alignment-based model"},
+ "EVE_single":{"input_score_name":"evol_indices_seed_1000", "location":"EVE", "directionality": -1, "key": "mutant","model_type":"Alignment-based model"},
+ "EVE_ensemble":{"input_score_name":"evol_indices_ensemble", "location":"EVE", "directionality": -1, "key": "mutant","model_type":"Alignment-based model"},
+ "Unirep":{"input_score_name":"Unirep_score", "location":"UniRep", "directionality": -1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Unirep_evotune":{"input_score_name":"Unirep_score", "location":"UniRep_evotuned", "directionality": -1, "key": "mutated_sequence","model_type":"Hybrid - Alignment & PLM"},
+ "MSA_Transformer_single":{"input_score_name":"esm_msa1b_t12_100M_UR50S_seed1", "location":"MSA_Transformer", "directionality": 1, "key": "mutant","model_type":"Hybrid - Alignment & PLM"},
+ "MSA_Transformer_ensemble":{"input_score_name":"esm_msa1b_t12_100M_UR50S_ensemble", "location":"MSA_Transformer", "directionality": 1, "key": "mutant","model_type":"Hybrid - Alignment & PLM"},
+ "ESM1b":{"input_score_name":"esm1b_t33_650M_UR50S", "location":"ESM1b", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "ESM1v_single":{"input_score_name":"esm1v_t33_650M_UR90S_1", "location":"ESM1v", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "ESM1v_ensemble":{"input_score_name":"Ensemble_ESM1v", "location":"ESM1v", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "ESM2_8M":{"input_score_name":"esm2_t6_8M_UR50D", "location":"ESM2/8M", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "ESM2_35M":{"input_score_name":"esm2_t12_35M_UR50D", "location":"ESM2/35M", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "ESM2_150M":{"input_score_name":"esm2_t30_150M_UR50D", "location":"ESM2/150M", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "ESM2_650M":{"input_score_name":"esm2_t33_650M_UR50D", "location":"ESM2/650M", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "ESM2_3B":{"input_score_name":"esm2_t36_3B_UR50D", "location":"ESM2/3B", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "ESM2_15B":{"input_score_name":"esm2_t48_15B_UR50D", "location":"ESM2/15B", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "Wavenet":{"input_score_name":"Wavenet_prediction", "location":"Wavenet", "directionality": -1, "key": "mutant","model_type":"Alignment-based model"},
+ "RITA_s":{"input_score_name":"RITA_score", "location":"RITA/small", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "RITA_m":{"input_score_name":"RITA_score", "location":"RITA/medium", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "RITA_l":{"input_score_name":"RITA_score", "location":"RITA/large", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "RITA_xl":{"input_score_name":"RITA_score", "location":"RITA/xlarge", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "Progen2_small":{"input_score_name":"Progen2_score", "location":"Progen2/small", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "Progen2_medium":{"input_score_name":"Progen2_score", "location":"Progen2/medium", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "Progen2_base":{"input_score_name":"Progen2_score", "location":"Progen2/base", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "Progen2_large":{"input_score_name":"Progen2_score", "location":"Progen2/large", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "Progen2_xlarge":{"input_score_name":"Progen2_score", "location":"Progen2/xlarge", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "GEMME":{"input_score_name":"GEMME_score", "location":"GEMME", "directionality": 1, "key": "mutant","model_type":"Alignment-based model"},
+ "VESPA":{"input_score_name":"VESPA", "location":"VESPA", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "VESPAl":{"input_score_name":"VESPAl", "location":"VESPA", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "ProtGPT2":{"input_score_name":"ProtGPT2_score", "location":"ProtGPT2", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Tranception_S_no_retrieval":{"input_score_name":"avg_score", "location":"Tranception_no_retrieval/Tranception_S", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Tranception_M_no_retrieval":{"input_score_name":"avg_score", "location":"Tranception_no_retrieval/Tranception_M", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Tranception_L_no_retrieval":{"input_score_name":"avg_score", "location":"Tranception_no_retrieval/Tranception_L", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Tranception_S":{"input_score_name":"avg_score", "location":"Tranception/Tranception_S", "directionality": 1, "key": "mutated_sequence","model_type":"Hybrid - Alignment & PLM"},
+ "Tranception_M":{"input_score_name":"avg_score", "location":"Tranception/Tranception_M", "directionality": 1, "key": "mutated_sequence","model_type":"Hybrid - Alignment & PLM"},
+ "Tranception_L":{"input_score_name":"avg_score", "location":"Tranception/Tranception_L", "directionality": 1, "key": "mutated_sequence","model_type":"Hybrid - Alignment & PLM"},
+ "TranceptEVE_S":{"input_score_name":"avg_score", "location":"TranceptEVE/TranceptEVE_S", "directionality": 1, "key": "mutant","model_type":"Hybrid - Alignment & PLM"},
+ "TranceptEVE_M":{"input_score_name":"avg_score", "location":"TranceptEVE/TranceptEVE_M", "directionality": 1, "key": "mutant","model_type":"Hybrid - Alignment & PLM"},
+ "TranceptEVE_L":{"input_score_name":"avg_score", "location":"TranceptEVE/TranceptEVE_L", "directionality": 1, "key": "mutant","model_type":"Hybrid - Alignment & PLM"},
+ "CARP_38M": {"input_score_name":"carp_38M_score","location":"CARP/carp_38M","directionality":1,"key":"mutant","model_type":"Protein language model"},
+ "CARP_600K": {"input_score_name":"carp_600k_score","location":"CARP/carp_600k","directionality":1,"key":"mutant","model_type":"Protein language model"},
+ "CARP_640M" : {"input_score_name":"carp_640M_score","location":"CARP/carp_640M","directionality":1,"key":"mutant","model_type":"Protein language model"},
+ "CARP_76M": {"input_score_name":"carp_76M_score","location":"CARP/carp_76M","directionality":1,"key":"mutant","model_type":"Protein language model"},
+ "MIF": {"input_score_name": "mif_score","location":"MIF/mif","directionality":1,"key":"mutant","model_type":"Inverse folding model"},
+ "MIFST":{ "input_score_name": "mifst_score","location":"MIF/mifst","directionality":1,"key":"mutant","model_type":"Hybrid - Structure & PLM"},
+ "ESM-IF1": {"input_score_name": "esmif1_ll","location":"ESM-IF1","directionality":1,"key":"mutant","model_type":"Inverse folding model"},
+ "ProteinMPNN": {"input_score_name": "pmpnn_ll","location":"ProteinMPNN","directionality":1,"key":"mutant","model_type":"Inverse folding model"},
+ "ProtSSN_k10_h512":{"input_score_name":"ProtSSN_k10_h512", "location":"ProtSSN", "directionality": 1, "key": "mutant","model_type":"Hybrid - Structure & PLM"},
+ "ProtSSN_k10_h768":{"input_score_name":"ProtSSN_k10_h768", "location":"ProtSSN", "directionality": 1, "key": "mutant","model_type":"Hybrid - Structure & PLM"},
+ "ProtSSN_k10_h1280":{"input_score_name":"ProtSSN_k10_h1280", "location":"ProtSSN", "directionality": 1, "key": "mutant","model_type":"Hybrid - Structure & PLM"},
+ "ProtSSN_k20_h512":{"input_score_name":"ProtSSN_k20_h512", "location":"ProtSSN", "directionality": 1, "key": "mutant","model_type":"Hybrid - Structure & PLM"},
+ "ProtSSN_k20_h768":{"input_score_name":"ProtSSN_k20_h768", "location":"ProtSSN", "directionality": 1, "key": "mutant","model_type":"Hybrid - Structure & PLM"},
+ "ProtSSN_k20_h1280":{"input_score_name":"ProtSSN_k20_h1280", "location":"ProtSSN", "directionality": 1, "key": "mutant","model_type":"Hybrid - Structure & PLM"},
+ "ProtSSN_k30_h512":{"input_score_name":"ProtSSN_k30_h512", "location":"ProtSSN", "directionality": 1, "key": "mutant","model_type":"Hybrid - Structure & PLM"},
+ "ProtSSN_k30_h768":{"input_score_name":"ProtSSN_k30_h768", "location":"ProtSSN", "directionality": 1, "key": "mutant","model_type":"Hybrid - Structure & PLM"},
+ "ProtSSN_k30_h1280":{"input_score_name":"ProtSSN_k30_h1280", "location":"ProtSSN", "directionality": 1, "key": "mutant","model_type":"Hybrid - Structure & PLM"},
+ "ProtSSN_ensemble":{"input_score_name":"ProtSSN_ensemble", "location":"ProtSSN", "directionality": 1, "key": "mutant","model_type":"Hybrid - Structure & PLM"},
+ "SaProt_650M_AF2":{"input_score_name":"SaProt_score", "location":"SaProt/SaProt_650M_AF2", "directionality": 1, "key": "mutant", "model_type":"Hybrid - Structure & PLM"},
+ "SaProt_35M_AF2":{"input_score_name":"SaProt_score", "location":"SaProt/SaProt_35M_AF2", "directionality": 1, "key": "mutant", "model_type":"Hybrid - Structure & PLM"}
+ },
+ "model_list_zero_shot_indels_DMS":{
+ "Unirep":{"input_score_name":"Unirep_score", "location":"UniRep", "directionality": -1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Wavenet":{"input_score_name":"Wavenet_prediction", "location":"Wavenet", "directionality": -1, "key": "mutated_sequence","model_type":"Alignment-based model"},
+ "RITA_s":{"input_score_name":"RITA_score", "location":"RITA/small", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "RITA_m":{"input_score_name":"RITA_score", "location":"RITA/medium", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "RITA_l":{"input_score_name":"RITA_score", "location":"RITA/large", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "RITA_xl":{"input_score_name":"RITA_score", "location":"RITA/xlarge", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "Progen2_small":{"input_score_name":"Progen2_score", "location":"Progen2/small", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "Progen2_medium":{"input_score_name":"Progen2_score", "location":"Progen2/medium", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "Progen2_base":{"input_score_name":"Progen2_score", "location":"Progen2/base", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "Progen2_large":{"input_score_name":"Progen2_score", "location":"Progen2/large", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "Progen2_xlarge":{"input_score_name":"Progen2_score", "location":"Progen2/xlarge", "directionality": 1, "key": "mutant","model_type":"Protein language model"},
+ "ProtGPT2":{"input_score_name":"ProtGPT2_score", "location":"ProtGPT2", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Tranception_S_no_retrieval":{"input_score_name":"avg_score", "location":"Tranception_no_retrieval/Tranception_S", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Tranception_M_no_retrieval":{"input_score_name":"avg_score", "location":"Tranception_no_retrieval/Tranception_M", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Tranception_L_no_retrieval":{"input_score_name":"avg_score", "location":"Tranception_no_retrieval/Tranception_L", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Tranception_S":{"input_score_name":"avg_score", "location":"Tranception/Tranception_S", "directionality": 1, "key": "mutated_sequence","model_type":"Hybrid - Alignment & PLM"},
+ "Tranception_M":{"input_score_name":"avg_score", "location":"Tranception/Tranception_M", "directionality": 1, "key": "mutated_sequence","model_type":"Hybrid - Alignment & PLM"},
+ "Tranception_L":{"input_score_name":"avg_score", "location":"Tranception/Tranception_L", "directionality": 1, "key": "mutated_sequence","model_type":"Hybrid - Alignment & PLM"},
+ "TranceptEVE_S":{"input_score_name":"avg_score", "location":"TranceptEVE/TranceptEVE_S", "directionality": 1, "key": "mutated_sequence","model_type":"Hybrid - Alignment & PLM"},
+ "TranceptEVE_M":{"input_score_name":"avg_score", "location":"TranceptEVE/TranceptEVE_M", "directionality": 1, "key": "mutated_sequence","model_type":"Hybrid - Alignment & PLM"},
+ "TranceptEVE_L":{"input_score_name":"avg_score", "location":"TranceptEVE/TranceptEVE_L", "directionality": 1, "key": "mutated_sequence","model_type":"Hybrid - Alignment & PLM"},
+ "HMM":{"input_score_name":"logprob", "location":"HMM", "directionality": 1, "key": "mutant","model_type":"Alignment-based model"},
+ "Provean":{"input_score_name":"Provean_score","location":"Provean","directionality":1,"key":"mutant","model_type":"Alignment-based model"}
+ },
+ "model_list_zero_shot_indels_clinical":{
+ "HMM":{"input_score_name":"logprob", "location":"HMM", "directionality": 1, "key": "mutated_sequence","model_type":"Alignment-based model"},
+ "Progen2_small":{"input_score_name":"Progen2_score", "location":"Progen2/small", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Progen2_medium":{"input_score_name":"Progen2_score", "location":"Progen2/medium", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Progen2_base":{"input_score_name":"Progen2_score", "location":"Progen2/base", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Progen2_large":{"input_score_name":"Progen2_score", "location":"Progen2/large", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Progen2_xlarge":{"input_score_name":"Progen2_score", "location":"Progen2/xlarge", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "ProtGPT2":{"input_score_name":"ProtGPT2_score", "location":"ProtGPT2", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Provean":{"input_score_name":"provean_score","location":"Provean","directionality":1,"key":"mutated_sequence","model_type":"Alignment-based model"},
+ "RITA_s":{"input_score_name":"RITA_score", "location":"RITA/small", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "RITA_m":{"input_score_name":"RITA_score", "location":"RITA/medium", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "RITA_l":{"input_score_name":"RITA_score", "location":"RITA/large", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "RITA_xl":{"input_score_name":"RITA_score", "location":"RITA/xlarge", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Tranception_M_no_retrieval":{"input_score_name":"avg_score", "location":"Tranception_no_retrieval/Tranception_M", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Tranception_L_no_retrieval":{"input_score_name":"avg_score", "location":"Tranception_no_retrieval/Tranception_L", "directionality": 1, "key": "mutated_sequence","model_type":"Protein language model"},
+ "Tranception_M":{"input_score_name":"avg_score", "location":"Tranception/Tranception_M", "directionality": 1, "key": "mutated_sequence","model_type":"Hybrid - Alignment & PLM"},
+ "Tranception_L":{"input_score_name":"avg_score", "location":"Tranception/Tranception_L", "directionality": 1, "key": "mutated_sequence","model_type":"Hybrid - Alignment & PLM"},
+ "Unirep":{"input_score_name":"Unirep_score", "location":"Unirep", "directionality": -1, "key": "mutated_sequence","model_type":"Protein language model"}
+ },
+ "model_list_zero_shot_substitutions_clinical":{
+ "ClinPred":{"input_score_name":"ClinPred_score", "location":"ClinPred", "directionality": 1, "key": "mutant","model_type":"Clinical Predictor"},
+ "DANN":{"input_score_name":"DANN_score", "location":"DANN", "directionality": 1, "key": "mutant","model_type":"Clinical Predictor"},
+ "MutationTaster":{"input_score_name":"MutationTaster_converted_rankscore","location":"MutationTaster","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "CADD":{"input_score_name":"CADD_phred","location":"CADD","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "FATHMM":{"input_score_name":"FATHMM_score","location":"FATHMM","directionality":-1,"key":"mutant","model_type":"Clinical Predictor"},
+ "LRT":{"input_score_name":"LRT_score","location":"LRT","directionality":-1,"key":"mutant","model_type":"Clinical Predictor"},
+ "LIST-S2":{"input_score_name":"LIST-S2_score","location":"LIST-S2","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "PrimateAI":{"input_score_name":"PrimateAI_score","location":"PrimateAI","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "MutPred":{"input_score_name":"MutPred_score","location":"MutPred","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "MutationAssessor":{"input_score_name":"MutationAssessor_score","location":"MutationAssessor","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "SIFT4G":{"input_score_name":"SIFT4G_score","location":"SIFT4G","directionality":-1,"key":"mutant","model_type":"Clinical Predictor"},
+ "SIFT":{"input_score_name":"SIFT_score","location":"SIFT","directionality":-1,"key":"mutant","model_type":"Clinical Predictor"},
+ "PolyPhen2 (HDIV)":{ "input_score_name":"Polyphen2_HDIV_score","location":"Polyphen2_HDIV","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "MPC":{"input_score_name":"MPC_score","location":"MPC","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "Provean":{"input_score_name":"PROVEAN_score","location":"PROVEAN","directionality":-1,"key":"mutant","model_type":"Clinical Predictor"},
+ "DEOGEN2":{"input_score_name":"DEOGEN2_score","location":"DEOGEN2","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "PolyPhen2 (HVAR)":{"input_score_name":"Polyphen2_HVAR_score","location":"Polyphen2_HVAR","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "gMVP":{"input_score_name":"gMVP_score","location":"gMVP","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "VARITY (R)":{ "input_score_name":"VARITY_R_LOO_score","location":"VARITY_R_LOO","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "VARITY (ER)":{ "input_score_name":"VARITY_ER_LOO_score","location":"VARITY_ER_LOO","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "BayesDel (addAF)":{ "input_score_name":"BayesDel_addAF_score","location":"BayesDel_addAF","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "BayesDel (noAF)":{ "input_score_name":"BayesDel_noAF_score","location":"BayesDel_noAF","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "REVEL":{"input_score_name":"REVEL_score","location":"REVEL","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "VEST4":{"input_score_name":"VEST4_score","location":"VEST4","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "MetaRNN":{"input_score_name":"MetaRNN_score","location":"MetaRNN","directionality":1,"key":"mutant","model_type":"Clinical Predictor"},
+ "TranceptEVE_L":{"input_score_name":"avg_score", "location":"TranceptEVE/TranceptEVE_L", "directionality": 1, "key": "mutant","model_type":"Hybrid - Alignment & PLM"},
+ "GEMME":{"input_score_name":"GEMME_score", "location":"GEMME", "directionality": 1, "key": "mutant","model_type":"Alignment-based model"},
+ "EVE":{"input_score_name":"EVE_evol_index", "location":"EVE", "directionality": -1, "key": "mutant","model_type":"Alignment-based model"},
+ "ESM1b":{"input_score_name":"esm1b_t33_650M_UR50S", "location":"ESM1b", "directionality": 1, "key": "mutant","model_type":"Protein language model"}
+ }
+}
\ No newline at end of file
diff --git a/environments/protein_fitness_prediction_hsu.txt b/environments/protein_fitness_prediction_hsu.txt
new file mode 100644
index 0000000..62619df
--- /dev/null
+++ b/environments/protein_fitness_prediction_hsu.txt
@@ -0,0 +1,236 @@
+absl-py==0.9.0
+aiohttp==3.6.3
+aiohttp-cors==0.7.0
+aioredis==1.3.1
+aliyun-python-sdk-core @ file:///home/conda/feedstock_root/build_artifacts/aliyun-python-sdk-core_1649510078423/work
+aliyun-python-sdk-core-v3 @ file:///home/conda/feedstock_root/build_artifacts/aliyun-python-sdk-core-v3_1650200180106/work
+aliyun-python-sdk-kms @ file:///home/conda/feedstock_root/build_artifacts/aliyun-python-sdk-kms_1685446717428/work
+appdirs==1.4.3
+argcomplete @ file:///home/conda/feedstock_root/build_artifacts/argcomplete_1601194139635/work
+argon2-cffi @ file:///home/conda/feedstock_root/build_artifacts/argon2-cffi_1596629848788/work
+arviz @ file:///home/conda/feedstock_root/build_artifacts/arviz_1600909298598/work
+astunparse==1.6.3
+async-generator==1.10
+async-timeout==3.0.1
+atpublic==1.0
+attrs @ file:///home/conda/feedstock_root/build_artifacts/attrs_1599308529326/work
+awscli==1.18.145
+azure-core @ file:///home/conda/feedstock_root/build_artifacts/azure-core_1644684379874/work
+azure-storage-blob @ file:///home/conda/feedstock_root/build_artifacts/azure-storage-blob_1632181277153/work
+backcall @ file:///home/conda/feedstock_root/build_artifacts/backcall_1592338393461/work
+backports.functools-lru-cache==1.6.1
+bcrypt @ file:///home/conda/feedstock_root/build_artifacts/bcrypt_1597603498735/work
+beautifulsoup4==4.9.3
+billiard @ file:///home/conda/feedstock_root/build_artifacts/billiard_1602289122951/work
+biopython @ file:///home/conda/feedstock_root/build_artifacts/biopython_1599231171740/work
+bleach @ file:///home/conda/feedstock_root/build_artifacts/bleach_1600454382015/work
+blessings==1.7
+blinker==1.4
+bokeh==2.2.2
+boto3 @ file:///home/conda/feedstock_root/build_artifacts/boto3_1602286028824/work
+botocore==1.18.4
+brotlipy==0.7.0
+cachetools @ file:///home/conda/feedstock_root/build_artifacts/cachetools_1593420445823/work
+certifi==2020.12.5
+cffi @ file:///home/conda/feedstock_root/build_artifacts/cffi_1600276415718/work
+cftime @ file:///home/conda/feedstock_root/build_artifacts/cftime_1602504440833/work
+chardet @ file:///home/conda/feedstock_root/build_artifacts/chardet_1602255309768/work
+click==7.1.2
+cloudpickle==1.3.0
+colorama==0.4.3
+colorful==0.5.4
+commonmark==0.9.1
+configobj==5.0.6
+crcmod==1.7
+cryptography @ file:///home/conda/feedstock_root/build_artifacts/cryptography_1600960155371/work
+cycler==0.10.0
+decorator==4.4.2
+defusedxml==0.6.0
+distro @ file:///home/conda/feedstock_root/build_artifacts/distro_1591810140801/work
+docutils==0.15.2
+dpath==2.0.1
+dvc @ file:///home/conda/feedstock_root/build_artifacts/dvc_1601576954619/work
+editdistance==0.5.3
+entrypoints==0.3
+evcouplings==0.1.1
+fair-esm==0.3.1
+fastprogress @ file:///home/conda/feedstock_root/build_artifacts/fastprogress_1597932925331/work
+filelock @ file:///home/conda/feedstock_root/build_artifacts/filelock_1589994591731/work
+flatten-dict @ file:///home/conda/feedstock_root/build_artifacts/flatten-dict_1597439079477/work
+flufl.lock==3.2
+funcy @ file:///home/conda/feedstock_root/build_artifacts/funcy_1601143499053/work
+future==0.18.2
+gast==0.3.3
+gitdb @ file:///home/conda/feedstock_root/build_artifacts/gitdb_1588651542737/work
+gitdb2==4.0.2
+GitPython @ file:///home/conda/feedstock_root/build_artifacts/gitpython_1601547001552/work
+google==3.0.0
+google-api-core @ file:///Users/runner/miniforge3/conda-bld/google-api-core-split_1601998551628/work
+google-api-python-client @ file:///home/conda/feedstock_root/build_artifacts/google-api-python-client_1601441423773/work
+google-auth==1.18.0
+google-auth-httplib2 @ file:///home/conda/feedstock_root/build_artifacts/google-auth-httplib2_1601483325475/work
+google-auth-oauthlib==0.4.1
+google-cloud-core @ file:///home/conda/feedstock_root/build_artifacts/google-cloud-core_1602093568796/work
+google-cloud-storage==1.19.0
+google-crc32c @ file:///home/conda/feedstock_root/build_artifacts/google-crc32c_1597069930917/work
+google-pasta==0.2.0
+google-resumable-media @ file:///home/conda/feedstock_root/build_artifacts/google-resumable-media_1602011069843/work
+googleapis-common-protos @ file:///home/conda/feedstock_root/build_artifacts/googleapis-common-protos-feedstock_1601483873341/work
+gpustat==0.6.0
+grandalf==0.6
+grpcio==1.30.0
+h5py @ file:///home/conda/feedstock_root/build_artifacts/h5py_1602551881630/work
+hiredis==1.1.0
+httplib2 @ file:///home/conda/feedstock_root/build_artifacts/httplib2_1590007784658/work
+idna @ file:///home/conda/feedstock_root/build_artifacts/idna_1593328102638/work
+importlib-metadata @ file:///home/conda/feedstock_root/build_artifacts/importlib-metadata_1602263269022/work
+invoke==1.4.1
+ipykernel @ file:///home/conda/feedstock_root/build_artifacts/ipykernel_1595446871027/work/dist/ipykernel-5.3.4-py3-none-any.whl
+ipython @ file:///home/conda/feedstock_root/build_artifacts/ipython_1598749946943/work
+ipython-genutils==0.2.0
+ipywidgets @ file:///home/conda/feedstock_root/build_artifacts/ipywidgets_1599554010055/work
+isodate==0.6.0
+jedi @ file:///home/conda/feedstock_root/build_artifacts/jedi_1602395235501/work
+Jinja2==2.11.2
+jmespath @ file:///home/conda/feedstock_root/build_artifacts/jmespath_1589369830981/work
+joblib @ file:///home/conda/feedstock_root/build_artifacts/joblib_1601671685479/work
+jsonpath-ng @ file:///home/conda/feedstock_root/build_artifacts/jsonpath-ng_1599497963832/work
+jsonschema==3.2.0
+jupyter-client @ file:///home/conda/feedstock_root/build_artifacts/jupyter_client_1598486169312/work
+jupyter-console @ file:///home/conda/feedstock_root/build_artifacts/jupyter_console_1598728807792/work
+jupyter-core==4.6.3
+jupyterlab-pygments @ file:///home/conda/feedstock_root/build_artifacts/jupyterlab_pygments_1601375948261/work
+Keras-Preprocessing==1.1.2
+kiwisolver @ file:///home/conda/feedstock_root/build_artifacts/kiwisolver_1602517221725/work
+knack==0.6.3
+llvmlite==0.34.0
+Mako @ file:///home/conda/feedstock_root/build_artifacts/mako_1595925083607/work
+Markdown==3.2.2
+MarkupSafe @ file:///home/conda/feedstock_root/build_artifacts/markupsafe_1602267316845/work
+matplotlib @ file:///home/conda/feedstock_root/build_artifacts/matplotlib-base_1600159101071/work
+mistune @ file:///home/conda/feedstock_root/build_artifacts/mistune_1602381812692/work
+mkl-service==2.3.0
+mmtf-python==1.1.2
+msgpack==1.0.0
+msrest @ file:///home/conda/feedstock_root/build_artifacts/msrest_1599590147834/work
+multidict==4.7.6
+nanotime==0.5.2
+nbclient @ file:///home/conda/feedstock_root/build_artifacts/nbclient_1598558657104/work
+nbconvert @ file:///home/conda/feedstock_root/build_artifacts/nbconvert_1601713107734/work
+nbformat @ file:///home/conda/feedstock_root/build_artifacts/nbformat_1594060262917/work
+nest-asyncio @ file:///home/conda/feedstock_root/build_artifacts/nest-asyncio_1601342677072/work
+netCDF4 @ file:///home/conda/feedstock_root/build_artifacts/netcdf4_1602508544050/work
+networkx==2.4
+notebook @ file:///home/conda/feedstock_root/build_artifacts/notebook_1599742225943/work
+numba==0.51.2
+numpy @ file:///home/conda/feedstock_root/build_artifacts/numpy_1591485215893/work
+nvidia-ml-py3==7.352.0
+oauth2client==4.1.3
+oauthlib==3.1.0
+olefile==0.46
+opencensus==0.7.10
+opencensus-context==0.1.1
+opt-einsum==3.2.1
+oss2 @ file:///home/conda/feedstock_root/build_artifacts/oss2_1599191831893/work
+packaging @ file:///home/conda/feedstock_root/build_artifacts/packaging_1589925210001/work
+pandas @ file:///home/conda/feedstock_root/build_artifacts/pandas_1602502751364/work
+pandocfilters==1.4.2
+paramiko @ file:///home/conda/feedstock_root/build_artifacts/paramiko_1598988093657/work
+parso==0.7.0
+pathlib2==2.3.5
+pathspec==0.8.0
+patsy==0.5.1
+pexpect==4.8.0
+pickleshare==0.7.5
+Pillow @ file:///home/conda/feedstock_root/build_artifacts/pillow_1602493094547/work
+ply==3.11
+prometheus-client @ file:///home/conda/feedstock_root/build_artifacts/prometheus_client_1590412252446/work
+prompt-toolkit @ file:///home/conda/feedstock_root/build_artifacts/prompt-toolkit_1602524994744/work
+protobuf==3.12.2
+psutil==5.7.2
+ptyprocess==0.6.0
+py-spy==0.3.3
+pyaml==20.4.0
+pyarrow==1.0.1
+pyasn1==0.4.8
+pyasn1-modules==0.2.8
+pycparser @ file:///home/conda/feedstock_root/build_artifacts/pycparser_1593275161868/work
+pycryptodome @ file:///home/conda/feedstock_root/build_artifacts/pycryptodome_1602326836036/work
+pydot==1.4.1
+PyDrive2 @ file:///home/conda/feedstock_root/build_artifacts/pydrive2_1600764302481/work
+Pygments @ file:///home/conda/feedstock_root/build_artifacts/pygments_1600347314331/work
+pygpu==0.7.6
+pygtrie==2.3.2
+PyJWT==1.7.1
+pymc3 @ file:///home/conda/feedstock_root/build_artifacts/pymc3_1602091536109/work
+PyNaCl @ file:///home/conda/feedstock_root/build_artifacts/pynacl_1601742017220/work
+pynverse==0.1.4.4
+pyOpenSSL==19.1.0
+pyparsing==2.4.7
+PyQt5==5.12.3
+PyQt5-sip==4.19.18
+PyQtChart==5.12
+PyQtWebEngine==5.12.1
+pyrsistent @ file:///home/conda/feedstock_root/build_artifacts/pyrsistent_1602259985647/work
+PySocks @ file:///home/conda/feedstock_root/build_artifacts/pysocks_1602326924965/work
+python-dateutil==2.8.1
+pytz==2020.1
+PyYAML==5.3.1
+pyzmq==19.0.2
+qtconsole @ file:///home/conda/feedstock_root/build_artifacts/qtconsole_1599147533948/work
+QtPy==1.9.0
+redis==3.4.1
+requests @ file:///home/conda/feedstock_root/build_artifacts/requests_1592425495151/work
+requests-oauthlib @ file:///home/conda/feedstock_root/build_artifacts/requests-oauthlib_1595492159598/work
+rich @ file:///home/conda/feedstock_root/build_artifacts/rich_1601737884050/work
+rsa==4.5
+ruamel.yaml @ file:///home/conda/feedstock_root/build_artifacts/ruamel.yaml_1602413189583/work
+ruamel.yaml.clib @ file:///home/conda/feedstock_root/build_artifacts/ruamel.yaml.clib_1602268864115/work
+s3transfer==0.3.3
+scikit-learn @ file:///home/conda/feedstock_root/build_artifacts/scikit-learn_1589920855303/work
+scikit-optimize @ file:///home/conda/feedstock_root/build_artifacts/scikit-optimize_1599212891685/work
+scipy==1.4.1
+SCons==4.0.1
+seaborn @ file:///home/conda/feedstock_root/build_artifacts/seaborn-base_1591878760859/work
+Send2Trash==1.5.0
+setuptools-scm @ file:///home/conda/feedstock_root/build_artifacts/setuptools_scm_1590930292112/work
+shortuuid @ file:///home/conda/feedstock_root/build_artifacts/shortuuid_1602415114408/work
+shtab @ file:///home/conda/feedstock_root/build_artifacts/shtab_1601541025574/work
+simplejson @ file:///home/conda/feedstock_root/build_artifacts/simplejson_1602514248663/work
+six @ file:///home/conda/feedstock_root/build_artifacts/six_1590081179328/work
+sklearn==0.0
+smmap @ file:///home/conda/feedstock_root/build_artifacts/smmap_1588651577140/work
+soupsieve==2.0.1
+statsmodels @ file:///home/conda/feedstock_root/build_artifacts/statsmodels_1598551025620/work
+tabulate==0.8.7
+tensorboard==2.2.2
+tensorboard-plugin-wit==1.7.0
+tensorflow==2.2.0
+tensorflow-addons==0.10.0
+tensorflow-estimator==2.2.0
+tensorflow-probability==0.10.1
+termcolor==1.1.0
+terminado @ file:///home/conda/feedstock_root/build_artifacts/terminado_1600709672075/work
+testpath==0.4.4
+Theano @ file:///home/conda/feedstock_root/build_artifacts/theano_1604228519858/work
+threadpoolctl @ file:///tmp/tmp79xdzxkt/threadpoolctl-2.1.0-py3-none-any.whl
+toml @ file:///home/conda/feedstock_root/build_artifacts/toml_1589469402899/work
+torch==1.4.0
+tornado @ file:///home/conda/feedstock_root/build_artifacts/tornado_1602488893411/work
+tqdm @ file:///home/conda/feedstock_root/build_artifacts/tqdm_1602171507552/work
+traitlets @ file:///home/conda/feedstock_root/build_artifacts/traitlets_1600970644261/work
+typeguard==2.9.1
+typing-extensions==3.7.4.3
+uritemplate==3.0.1
+urllib3==1.25.9
+voluptuous==0.11.7
+wcwidth @ file:///home/conda/feedstock_root/build_artifacts/wcwidth_1600965781394/work
+webencodings==0.5.1
+Werkzeug==1.0.1
+widgetsnbextension @ file:///home/conda/feedstock_root/build_artifacts/widgetsnbextension_1594164347302/work
+wrapt==1.12.1
+xarray @ file:///home/conda/feedstock_root/build_artifacts/xarray_1600638299066/work
+xlrd @ file:///home/conda/feedstock_root/build_artifacts/xlrd_1595712200082/work
+yarl==1.5.1
+zc.lockfile==2.0
+zipp @ file:///home/conda/feedstock_root/build_artifacts/zipp_1601765966131/work
diff --git a/environments/proteingym_env.txt b/environments/proteingym_env.txt
new file mode 100644
index 0000000..f1f365b
--- /dev/null
+++ b/environments/proteingym_env.txt
@@ -0,0 +1,127 @@
+absl-py==1.4.0
+aiohttp==3.8.5
+aiosignal==1.3.1
+appdirs==1.4.4
+async-timeout==4.0.3
+attrs==23.1.0
+billiard==4.2.0
+biopython==1.82
+biotite==0.38.0
+bokeh==3.3.0
+Brotli==1.1.0
+brotlipy==0.7.0
+certifi==2023.7.22
+cffi==1.16.0
+charset-normalizer==3.3.2
+chex==0.1.7
+click==8.1.7
+contourpy==1.2.0
+cryptography==41.0.7
+cycler==0.12.1
+datasets==2.14.4
+dill==0.3.7
+dm-tree==0.1.8
+docker-pycreds==0.4.0
+dpcpp-cpp-rt==2024.0.2
+etils==1.4.1
+filelock==3.12.3
+flax==0.7.2
+fonttools==4.44.0
+frozenlist==1.4.0
+fsspec==2023.6.0
+gitdb==4.0.10
+GitPython==3.1.32
+h5py==3.9.0
+huggingface-hub==0.16.4
+idna==3.6
+importlib-metadata==6.8.0
+importlib-resources==6.0.1
+intel-cmplr-lib-rt==2024.0.2
+intel-cmplr-lic-rt==2024.0.2
+intel-opencl-rt==2024.0.2
+intel-openmp==2024.0.2
+jax==0.4.14
+jaxlib==0.4.14
+Jinja2==3.1.2
+joblib==1.2.0
+kiwisolver==1.4.5
+llvmlite==0.41.1
+markdown-it-py==3.0.0
+MarkupSafe==2.1.3
+matplotlib==3.8.1
+mdurl==0.1.2
+mkl==2024.0.0
+mkl-fft==1.3.6
+mkl-random==1.2.2
+mkl-service==2.4.0
+ml-dtypes==0.2.0
+mmtf-python==1.1.3
+msgpack==1.0.5
+multidict==6.0.4
+multiprocess==0.70.15
+nest-asyncio==1.5.7
+networkx==3.2.1
+numba==0.58.1
+numba-progress==1.1.0
+numpy==1.24.4
+nvidia-cublas-cu11==11.10.3.66
+nvidia-cuda-nvrtc-cu11==11.7.99
+nvidia-cuda-runtime-cu11==11.7.99
+nvidia-cudnn-cu11==8.5.0.96
+opt-einsum==3.3.0
+optax==0.1.7
+orbax-checkpoint==0.3.5
+packaging==23.2
+pandas==1.5.1
+pathtools==0.1.2
+pillow==10.2.0
+platformdirs==4.1.0
+pooch==1.8.0
+protobuf==4.24.2
+psutil==5.9.5
+pyarrow==13.0.0
+pycparser==2.21
+pyfaidx==0.6.4
+Pygments==2.16.1
+pyOpenSSL==23.3.0
+pyparsing==3.1.1
+PySocks==1.7.1
+python-dateutil==2.8.2
+pytz==2022.6
+PyYAML==6.0.1
+regex==2023.8.8
+requests==2.31.0
+rich==13.5.2
+ruamel.yaml==0.18.5
+ruamel.yaml.clib==0.2.8
+safetensors==0.3.3
+scikit-learn==1.3.2
+scipy==1.10.1
+seaborn==0.13.0
+sentencepiece==0.1.99
+sentry-sdk==1.30.0
+sequence-models==1.7.0
+setproctitle==1.3.2
+six==1.16.0
+smmap==5.0.0
+tbb==2021.11.0
+tensorstore==0.1.41
+threadpoolctl==3.1.0
+tokenizers==0.13.3
+toolz==0.12.0
+torch==1.13.1
+torch_geometric==2.4.0
+torchaudio==0.13.1
+torchvision==0.14.1
+tornado==6.3.3
+tqdm==4.66.1
+transformers==4.32.1
+typing_extensions==4.9.0
+tzdata==2023.4
+urllib3==2.1.0
+vespa-effect==1.0.2
+wandb==0.15.9
+xxhash==3.3.0
+xyzservices==2023.10.1
+yarl==1.9.2
+zipp==3.16.2
diff --git a/notebooks/TranceptEVE_example.ipynb b/notebooks/TranceptEVE_example.ipynb
new file mode 100644
index 0000000..36b5759
--- /dev/null
+++ b/notebooks/TranceptEVE_example.ipynb
@@ -0,0 +1,165 @@
+{
+ "cells": [
+ {
+ "cell_type": "markdown",
+ "id": "410842ba-dace-4df2-8fff-7ae4809eac27",
+ "metadata": {},
+ "source": [
+ "This notebook walks through all the steps required to run TranceptEVE on a deep mutational scan (DMS)\n",
+ "To run this notebook you need: \n",
+ "* a csv with all the mutant sequences in the same format as the other proteingym datasets (i.e. they need a mutated_sequence column with all the mutated sequences and a DMS_score column with the experimental values)\n",
+ "* A multiple sequence alignment for the target protein of the DMS\n",
+ "* A copy of the Tranception checkpoint (small, medium or large) that you want to use\n",
+ "* To update the reference file with an additional row describing the DMS (required columns to fill in are \"DMS_id\",\"DMS_filename\",\"MSA_filename\",\"MSA_theta\",\"MSA_start\",\"MSA_end\",\"weight_file_name\", and \"target_seq\")"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "67c24600-47dc-49f8-9782-cb7cb1db0b30",
+ "metadata": {},
+ "source": [
+ "# Step 0: Alignment Generation\n",
+ "If you don't have an alignment for your target protein, there are several methods for generating one:\n",
+ "* [EVCouplings](evcouplings.org): You can use the online webserver at [evcouplings.org](evcouplings.org) or download the software and run it locally from [https://github.com/debbiemarkslab/EVcouplings](https://github.com/debbiemarkslab/EVcouplings)\n",
+ "* [ColabFold](https://github.com/sokrypton/ColabFold) also includes an MSA generation pipeline.\n",
+ "* [BLAST](https://blast.ncbi.nlm.nih.gov/Blast.cgi), the Basic Local Alignment Search Tool, also has a web server for generating alignments\n",
+ "* [MUSCLE](https://www.drive5.com/muscle5/) is another commonly used library for alignment generation\n",
+ "\n",
+ "All the DMS alignments in the original ProteinGym paper were generated using EVCouplings"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "ff7855b8-1c61-46c8-8a4d-dcc002199a53",
+ "metadata": {},
+ "source": [
+ "# Step 1: Training EVE models on wild type MSA \n",
+ "The script at proteingym/baselines/EVE/train_VAE.py can be used to train EVE models on an alignment. Below is essentially the same code to run train_VAE.py that is present in scripts/scoring_DMS_zero_shot/training_EVE_models.sh"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "8847c7c2-540c-46a1-8946-cb343707a449",
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "# This is the index of the DMS you want to run in the reference file csv.\n",
+ "DMS_index=0 # change this to whatever row your new DMS is on \n",
+ "# You can train multiple EVE model with different seeds for initialization and then pass them all to TranceptEVE. \n",
+ "# The log prior in TranceptEVE will then be the ensemble of all those models \n",
+ "random_seeds = [0,1,2,3,4]\n",
+ "\n",
+ "model_parameters_location='../proteingym/baselines/EVE/EVE/default_model_params.json'\n",
+ "training_logs_location='../proteingym/baselines/EVE/logs/'\n",
+ "DMS_reference_file_path=\"../reference_files/DMS_substitutions.csv\"\n",
+ "\n",
+ "# replace the below with the locations of the MSAs and assay csvs on your machine \n",
+ "\n",
+ "DMS_MSA_data_folder=\"Folder containing multiple sequence alignments in a2m format\" \n",
+ "# This is where the EVE models will be written out. The filenames are in the format input-msa-name_seed\n",
+ "DMS_EVE_model_folder=\"Folder where EVE models will be saved\"\n",
+ "# if you don't already have weights here for the MSA, the EVE training script will generate them \n",
+ "DMS_MSA_weights_folder=\"Folder where MSA weights will be saved\" \n",
+ "\n",
+ "# Note that these models generally take a few hours to train, so it is likely easier to run the training_EVE_models.sh script mentioned above and \n",
+ "# train several in parallel than to do them sequentially in a notebook here. \n",
+ "for seed in random_seeds:\n",
+ " command = f\"../proteingym/baselines/EVE/train_VAE.py \\\n",
+ " --MSA_data_folder {DMS_MSA_data_folder} \\\n",
+ " --DMS_reference_file_path {DMS_reference_file_path} \\\n",
+ " --protein_index {DMS_index} \\\n",
+ " --MSA_weights_location {DMS_MSA_weights_folder} \\\n",
+ " --VAE_checkpoint_location {DMS_EVE_model_folder} \\\n",
+ " --model_parameters_location {model_parameters_location} \\\n",
+ " --training_logs_location {training_logs_location} \\\n",
+ " --threshold_focus_cols_frac_gaps 1 \\\n",
+ " --seed {seed} \\\n",
+ " --skip_existing \\\n",
+ " --experimental_stream_data\"\n",
+ " !python $command\n"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "d755a049-3d0e-4449-a993-0c06191e151c",
+ "metadata": {},
+ "source": [
+ "# Step 2: Scoring with TranceptEVE\n",
+ "Now that the EVE models are trained, we can use them in conjunction with the downloaded Tranception checkpoint to run TranceptEVE. The below code is essentially the same as the script at scripts/scoring_DMS_zero_shot/scoring_TranceptEVE_substitutions.sh. The code for scoring indels is essentially the same, with two extra parameters (A flag for setting Tranception to indel mode, \n",
+ "and a path to an installation of Clustal Omega which is an alignment tool used as part of the retrieval process with indels). "
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "a90a60ea-0cd9-4574-b30a-17313af499c6",
+ "metadata": {
+ "scrolled": true
+ },
+ "outputs": [],
+ "source": [
+ "# These values need to match those in the prior steps, so that the script finds the correct EVE models \n",
+ "DMS_index=0 \n",
+ "random_seeds = [0,1,2,3,4]\n",
+ "DMS_MSA_data_folder=\"Folder containing MSA files in a2m format\"\n",
+ "model_parameters_location='../proteingym/baselines/EVE/EVE/default_model_params.json'\n",
+ "DMS_EVE_model_folder=\"Folder containing EVE models\"\n",
+ "DMS_MSA_weights_folder=\"Folder containing MSA weights\"\n",
+ "DMS_reference_file_path=\"../reference_files/DMS_substitutions.csv\"\n",
+ "\n",
+ "# These are new for trancepteve \n",
+ "\n",
+ "inference_time_retrieval_type=\"TranceptEVE\"\n",
+ "# This is the number of samples taken from each EVE model to generate the log prior. This is done at the start of the script and then cached\n",
+ "# so that later runs with the same EVE models don't have to recompute it. \n",
+ "EVE_num_samples_log_proba=200000\n",
+ "# For long proteins, \"sliding\" rather than \"optimal\" may be ideal for this parameter. \n",
+ "scoring_window=\"optimal\" \n",
+ "\n",
+ "# These can be changed based on where the Tranception checkpoint and DMS data files are stored and where you want the output scores to write to \n",
+ "DMS_data_folder=\"Folder containing DMS assay csvs\"\n",
+ "checkpoint = \"Tranception model checkpoint path\"\n",
+ "output_scores_folder=\"Path to folder where scores will be saved\" \n",
+ "\n",
+ "command = f\"../proteingym/baselines/trancepteve/score_trancepteve.py \\\n",
+ " --checkpoint {checkpoint} \\\n",
+ " --DMS_reference_file_path {DMS_reference_file_path} \\\n",
+ " --DMS_data_folder {DMS_data_folder} \\\n",
+ " --DMS_index {DMS_index} \\\n",
+ " --output_scores_folder {output_scores_folder} \\\n",
+ " --inference_time_retrieval_type {inference_time_retrieval_type} \\\n",
+ " --MSA_folder {DMS_MSA_data_folder} \\\n",
+ " --MSA_weights_folder {DMS_MSA_weights_folder} \\\n",
+ " --EVE_num_samples_log_proba {EVE_num_samples_log_proba} \\\n",
+ " --EVE_model_parameters_location {model_parameters_location} \\\n",
+ " --EVE_model_folder {DMS_EVE_model_folder} \\\n",
+ " --scoring_window {scoring_window} \\\n",
+ " --EVE_seeds {\" \".join(random_seeds)} \\\n",
+ " --EVE_recalibrate_probas\"\n",
+ "!python $command"
+ ]
+ }
+ ],
+ "metadata": {
+ "kernelspec": {
+ "display_name": "Python 3 (ipykernel)",
+ "language": "python",
+ "name": "python3"
+ },
+ "language_info": {
+ "codemirror_mode": {
+ "name": "ipython",
+ "version": 3
+ },
+ "file_extension": ".py",
+ "mimetype": "text/x-python",
+ "name": "python",
+ "nbconvert_exporter": "python",
+ "pygments_lexer": "ipython3",
+ "version": "3.9.18"
+ }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
diff --git a/proteingym/__init__.py b/proteingym/__init__.py
new file mode 100644
index 0000000..9b41f58
--- /dev/null
+++ b/proteingym/__init__.py
@@ -0,0 +1 @@
+from proteingym import utils
\ No newline at end of file
diff --git a/proteingym/baselines/EVE/EVE/VAE_decoder.py b/proteingym/baselines/EVE/EVE/VAE_decoder.py
new file mode 100644
index 0000000..7bfa57a
--- /dev/null
+++ b/proteingym/baselines/EVE/EVE/VAE_decoder.py
@@ -0,0 +1,295 @@
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+
+class VAE_Bayesian_MLP_decoder(nn.Module):
+ """
+ Bayesian MLP decoder class for the VAE model.
+ """
+ def __init__(self, params):
+ """
+ Required input parameters:
+ - seq_len: (Int) Sequence length of sequence alignment
+ - alphabet_size: (Int) Alphabet size of sequence alignment (will be driven by the data helper object)
+ - hidden_layers_sizes: (List) List of the sizes of the hidden layers (all DNNs)
+ - z_dim: (Int) Dimension of latent space
+ - first_hidden_nonlinearity: (Str) Type of non-linear activation applied on the first (set of) hidden layer(s)
+ - last_hidden_nonlinearity: (Str) Type of non-linear activation applied on the very last hidden layer (pre-sparsity)
+ - dropout_proba: (Float) Dropout probability applied on all hidden layers. If 0.0 then no dropout applied
+ - convolve_output: (Bool) Whether to perform 1d convolution on output (kernel size 1, stide 1)
+ - convolution_depth: (Int) Size of the 1D-convolution on output
+ - include_temperature_scaler: (Bool) Whether we apply the global temperature scaler
+ - include_sparsity: (Bool) Whether we use the sparsity inducing scheme on the output from the last hidden layer
+ - num_tiles_sparsity: (Int) Number of tiles to use in the sparsity inducing scheme (the more the tiles, the stronger the sparsity)
+ - bayesian_decoder: (Bool) Whether the decoder is bayesian or not
+ """
+ super().__init__()
+ self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+ self.seq_len = params['seq_len']
+ self.alphabet_size = params['alphabet_size']
+ self.hidden_layers_sizes = params['hidden_layers_sizes']
+ self.z_dim = params['z_dim']
+ self.bayesian_decoder = True
+ self.dropout_proba = params['dropout_proba']
+ self.convolve_output = params['convolve_output']
+ self.convolution_depth = params['convolution_output_depth']
+ self.include_temperature_scaler = params['include_temperature_scaler']
+ self.include_sparsity = params['include_sparsity']
+ self.num_tiles_sparsity = params['num_tiles_sparsity']
+
+ self.mu_bias_init = 0.1
+ self.logvar_init = -10.0
+ self.logit_scale_p = 0.001
+
+ self.hidden_layers_mean=nn.ModuleDict()
+ self.hidden_layers_log_var=nn.ModuleDict()
+ for layer_index in range(len(self.hidden_layers_sizes)):
+ if layer_index==0:
+ self.hidden_layers_mean[str(layer_index)] = nn.Linear(self.z_dim, self.hidden_layers_sizes[layer_index])
+ self.hidden_layers_log_var[str(layer_index)] = nn.Linear(self.z_dim, self.hidden_layers_sizes[layer_index])
+ nn.init.constant_(self.hidden_layers_mean[str(layer_index)].bias, self.mu_bias_init)
+ nn.init.constant_(self.hidden_layers_log_var[str(layer_index)].weight, self.logvar_init)
+ nn.init.constant_(self.hidden_layers_log_var[str(layer_index)].bias, self.logvar_init)
+ else:
+ self.hidden_layers_mean[str(layer_index)] = nn.Linear(self.hidden_layers_sizes[layer_index-1],self.hidden_layers_sizes[layer_index])
+ self.hidden_layers_log_var[str(layer_index)] = nn.Linear(self.hidden_layers_sizes[layer_index-1],self.hidden_layers_sizes[layer_index])
+ nn.init.constant_(self.hidden_layers_mean[str(layer_index)].bias, self.mu_bias_init)
+ nn.init.constant_(self.hidden_layers_log_var[str(layer_index)].weight, self.logvar_init)
+ nn.init.constant_(self.hidden_layers_log_var[str(layer_index)].bias, self.logvar_init)
+
+ if params['first_hidden_nonlinearity'] == 'relu':
+ self.first_hidden_nonlinearity = nn.ReLU()
+ elif params['first_hidden_nonlinearity'] == 'tanh':
+ self.first_hidden_nonlinearity = nn.Tanh()
+ elif params['first_hidden_nonlinearity'] == 'sigmoid':
+ self.first_hidden_nonlinearity = nn.Sigmoid()
+ elif params['first_hidden_nonlinearity'] == 'elu':
+ self.first_hidden_nonlinearity = nn.ELU()
+ elif params['first_hidden_nonlinearity'] == 'linear':
+ self.first_hidden_nonlinearity = nn.Identity()
+
+ if params['last_hidden_nonlinearity'] == 'relu':
+ self.last_hidden_nonlinearity = nn.ReLU()
+ elif params['last_hidden_nonlinearity'] == 'tanh':
+ self.last_hidden_nonlinearity = nn.Tanh()
+ elif params['last_hidden_nonlinearity'] == 'sigmoid':
+ self.last_hidden_nonlinearity = nn.Sigmoid()
+ elif params['last_hidden_nonlinearity'] == 'elu':
+ self.last_hidden_nonlinearity = nn.ELU()
+ elif params['last_hidden_nonlinearity'] == 'linear':
+ self.last_hidden_nonlinearity = nn.Identity()
+
+ if self.dropout_proba > 0.0:
+ self.dropout_layer = nn.Dropout(p=self.dropout_proba)
+
+ if self.convolve_output:
+ self.output_convolution_mean = nn.Conv1d(in_channels=self.convolution_depth,out_channels=self.alphabet_size,kernel_size=1,stride=1,bias=False)
+ self.output_convolution_log_var = nn.Conv1d(in_channels=self.convolution_depth,out_channels=self.alphabet_size,kernel_size=1,stride=1,bias=False)
+ nn.init.constant_(self.output_convolution_log_var.weight, self.logvar_init)
+ self.channel_size = self.convolution_depth
+ else:
+ self.channel_size = self.alphabet_size
+
+ if self.include_sparsity:
+ self.sparsity_weight_mean = nn.Parameter(torch.zeros(int(self.hidden_layers_sizes[-1]/self.num_tiles_sparsity), self.seq_len))
+ self.sparsity_weight_log_var = nn.Parameter(torch.ones(int(self.hidden_layers_sizes[-1]/self.num_tiles_sparsity), self.seq_len))
+ nn.init.constant_(self.sparsity_weight_log_var, self.logvar_init)
+
+ self.last_hidden_layer_weight_mean = nn.Parameter(torch.zeros(self.channel_size * self.seq_len,self.hidden_layers_sizes[-1]))
+ self.last_hidden_layer_weight_log_var = nn.Parameter(torch.zeros(self.channel_size * self.seq_len,self.hidden_layers_sizes[-1]))
+ nn.init.xavier_normal_(self.last_hidden_layer_weight_mean) #Glorot initialization
+ nn.init.constant_(self.last_hidden_layer_weight_log_var, self.logvar_init)
+
+ self.last_hidden_layer_bias_mean = nn.Parameter(torch.zeros(self.alphabet_size * self.seq_len))
+ self.last_hidden_layer_bias_log_var = nn.Parameter(torch.zeros(self.alphabet_size * self.seq_len))
+ nn.init.constant_(self.last_hidden_layer_bias_mean, self.mu_bias_init)
+ nn.init.constant_(self.last_hidden_layer_bias_log_var, self.logvar_init)
+
+ if self.include_temperature_scaler:
+ self.temperature_scaler_mean = nn.Parameter(torch.ones(1))
+ self.temperature_scaler_log_var = nn.Parameter(torch.ones(1) * self.logvar_init)
+
+ def sampler(self, mean, log_var):
+ """
+ Samples a latent vector via reparametrization trick
+ """
+ eps = torch.randn_like(mean).to(self.device)
+ z = torch.exp(0.5*log_var) * eps + mean
+ return z
+
+ def forward(self, z):
+ batch_size = z.shape[0]
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(z)
+ else:
+ x = z
+
+ for layer_index in range(len(self.hidden_layers_sizes)-1):
+ layer_i_weight = self.sampler(self.hidden_layers_mean[str(layer_index)].weight, self.hidden_layers_log_var[str(layer_index)].weight)
+ layer_i_bias = self.sampler(self.hidden_layers_mean[str(layer_index)].bias, self.hidden_layers_log_var[str(layer_index)].bias)
+ x = self.first_hidden_nonlinearity(F.linear(x, weight=layer_i_weight, bias=layer_i_bias))
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(x)
+
+ last_index = len(self.hidden_layers_sizes)-1
+ last_layer_weight = self.sampler(self.hidden_layers_mean[str(last_index)].weight, self.hidden_layers_log_var[str(last_index)].weight)
+ last_layer_bias = self.sampler(self.hidden_layers_mean[str(last_index)].bias, self.hidden_layers_log_var[str(last_index)].bias)
+ x = self.last_hidden_nonlinearity(F.linear(x, weight=last_layer_weight, bias=last_layer_bias))
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(x)
+
+ W_out = self.sampler(self.last_hidden_layer_weight_mean, self.last_hidden_layer_weight_log_var)
+ b_out = self.sampler(self.last_hidden_layer_bias_mean, self.last_hidden_layer_bias_log_var)
+
+ if self.convolve_output:
+ output_convolution_weight = self.sampler(self.output_convolution_mean.weight, self.output_convolution_log_var.weight)
+ W_out = torch.mm(W_out.view(self.seq_len * self.hidden_layers_sizes[-1], self.channel_size),
+ output_convolution_weight.view(self.channel_size,self.alphabet_size)) #product of size (H * seq_len, alphabet)
+
+ if self.include_sparsity:
+ sparsity_weights = self.sampler(self.sparsity_weight_mean,self.sparsity_weight_log_var)
+ sparsity_tiled = sparsity_weights.repeat(self.num_tiles_sparsity,1)
+ sparsity_tiled = nn.Sigmoid()(sparsity_tiled).unsqueeze(2)
+
+ W_out = W_out.view(self.hidden_layers_sizes[-1], self.seq_len, self.alphabet_size) * sparsity_tiled
+
+ W_out = W_out.view(self.seq_len * self.alphabet_size, self.hidden_layers_sizes[-1])
+
+ x = F.linear(x, weight=W_out, bias=b_out)
+
+ if self.include_temperature_scaler:
+ temperature_scaler = self.sampler(self.temperature_scaler_mean,self.temperature_scaler_log_var)
+ x = torch.log(1.0+torch.exp(temperature_scaler)) * x
+
+ x = x.view(batch_size, self.seq_len, self.alphabet_size)
+ x_recon_log = F.log_softmax(x, dim=-1) #of shape (batch_size, seq_len, alphabet)
+
+ return x_recon_log
+
+class VAE_Standard_MLP_decoder(nn.Module):
+ """
+ Standard MLP decoder class for the VAE model.
+ """
+ def __init__(self, seq_len, alphabet_size, hidden_layers_sizes, z_dim, first_hidden_nonlinearity, last_hidden_nonlinearity, dropout_proba,
+ convolve_output, convolution_depth, include_temperature_scaler, include_sparsity, num_tiles_sparsity):
+ """
+ Required input parameters:
+ - seq_len: (Int) Sequence length of sequence alignment
+ - alphabet_size: (Int) Alphabet size of sequence alignment (will be driven by the data helper object)
+ - hidden_layers_sizes: (List) List of the sizes of the hidden layers (all DNNs)
+ - z_dim: (Int) Dimension of latent space
+ - first_hidden_nonlinearity: (Str) Type of non-linear activation applied on the first (set of) hidden layer(s)
+ - last_hidden_nonlinearity: (Str) Type of non-linear activation applied on the very last hidden layer (pre-sparsity)
+ - dropout_proba: (Float) Dropout probability applied on all hidden layers. If 0.0 then no dropout applied
+ - convolve_output: (Bool) Whether to perform 1d convolution on output (kernel size 1, stide 1)
+ - convolution_depth: (Int) Size of the 1D-convolution on output
+ - include_temperature_scaler: (Bool) Whether we apply the global temperature scaler
+ - include_sparsity: (Bool) Whether we use the sparsity inducing scheme on the output from the last hidden layer
+ - num_tiles_sparsity: (Int) Number of tiles to use in the sparsity inducing scheme (the more the tiles, the stronger the sparsity)
+ - bayesian_decoder: (Bool) Whether the decoder is bayesian or not
+ """
+ super().__init__()
+ self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+ self.seq_len = params['seq_len']
+ self.alphabet_size = params['alphabet_size']
+ self.hidden_layers_sizes = params['hidden_layers_sizes']
+ self.z_dim = params['z_dim']
+ self.bayesian_decoder = False
+ self.dropout_proba = params['dropout_proba']
+ self.convolve_output = params['convolve_output']
+ self.convolution_depth = params['convolution_depth']
+ self.include_temperature_scaler = params['include_temperature_scaler']
+ self.include_sparsity = params['include_sparsity']
+ self.num_tiles_sparsity = params['num_tiles_sparsity']
+
+ self.mu_bias_init = 0.1
+
+ self.hidden_layers=nn.ModuleDict()
+ for layer_index in range(len(self.hidden_layers_sizes)):
+ if layer_index==0:
+ self.hidden_layers[str(layer_index)] = nn.Linear(self.z_dim, self.hidden_layers_sizes[layer_index])
+ nn.init.constant_(self.hidden_layers[str(layer_index)].bias, self.mu_bias_init)
+ else:
+ self.hidden_layers[str(layer_index)] = nn.Linear(self.hidden_layers_sizes[layer_index-1],self.hidden_layers_sizes[layer_index])
+ nn.init.constant_(self.hidden_layers[str(layer_index)].bias, self.mu_bias_init)
+
+ if params['first_hidden_nonlinearity'] == 'relu':
+ self.first_hidden_nonlinearity = nn.ReLU()
+ elif params['first_hidden_nonlinearity'] == 'tanh':
+ self.first_hidden_nonlinearity = nn.Tanh()
+ elif params['first_hidden_nonlinearity'] == 'sigmoid':
+ self.first_hidden_nonlinearity = nn.Sigmoid()
+ elif params['first_hidden_nonlinearity'] == 'elu':
+ self.first_hidden_nonlinearity = nn.ELU()
+ elif params['first_hidden_nonlinearity'] == 'linear':
+ self.first_hidden_nonlinearity = nn.Identity()
+
+ if params['last_hidden_nonlinearity'] == 'relu':
+ self.last_hidden_nonlinearity = nn.ReLU()
+ elif params['last_hidden_nonlinearity'] == 'tanh':
+ self.last_hidden_nonlinearity = nn.Tanh()
+ elif params['last_hidden_nonlinearity'] == 'sigmoid':
+ self.last_hidden_nonlinearity = nn.Sigmoid()
+ elif params['last_hidden_nonlinearity'] == 'elu':
+ self.last_hidden_nonlinearity = nn.ELU()
+ elif params['last_hidden_nonlinearity'] == 'linear':
+ self.last_hidden_nonlinearity = nn.Identity()
+
+ if self.dropout_proba > 0.0:
+ self.dropout_layer = nn.Dropout(p=self.dropout_proba)
+
+ if self.convolve_output:
+ self.output_convolution = nn.Conv1d(in_channels=self.convolution_depth,out_channels=self.alphabet_size,kernel_size=1,stride=1,bias=False)
+ self.channel_size = self.convolution_depth
+ else:
+ self.channel_size = self.alphabet_size
+
+ if self.include_sparsity:
+ self.sparsity_weight = nn.Parameter(torch.randn(int(self.hidden_layers_sizes[-1]/self.num_tiles_sparsity), self.seq_len))
+
+ self.W_out = nn.Parameter(torch.zeros(self.channel_size * self.seq_len,self.hidden_layers_sizes[-1]))
+ nn.init.xavier_normal_(self.W_out) #Initialize weights with Glorot initialization
+ self.b_out = nn.Parameter(torch.zeros(self.alphabet_size * self.seq_len))
+ nn.init.constant_(self.b_out, self.mu_bias_init)
+
+ if self.include_temperature_scaler:
+ self.temperature_scaler = nn.Parameter(torch.ones(1))
+
+ def forward(self, z):
+ batch_size = z.shape[0]
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(z)
+ else:
+ x=z
+
+ for layer_index in range(len(self.hidden_layers_sizes)-1):
+ x = self.first_hidden_nonlinearity(self.hidden_layers[str(layer_index)](x))
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(x)
+
+ x = self.last_hidden_nonlinearity(self.hidden_layers[str(len(self.hidden_layers_sizes)-1)](x)) #of size (batch_size,H)
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(x)
+
+ W_out = self.W_out.data
+
+ if self.convolve_output:
+ W_out = torch.mm(W_out.view(self.seq_len * self.hidden_layers_sizes[-1], self.channel_size),
+ self.output_convolution.weight.view(self.channel_size,self.alphabet_size))
+
+ if self.include_sparsity:
+ sparsity_tiled = self.sparsity_weight.repeat(self.num_tiles_sparsity,1) #of size (H,seq_len)
+ sparsity_tiled = nn.Sigmoid()(sparsity_tiled).unsqueeze(2) #of size (H,seq_len,1)
+ W_out = W_out.view(self.hidden_layers_sizes[-1], self.seq_len, self.alphabet_size) * sparsity_tiled
+
+ W_out = W_out.view(self.seq_len * self.alphabet_size, self.hidden_layers_sizes[-1])
+
+ x = F.linear(x, weight=W_out, bias=self.b_out)
+
+ if self.include_temperature_scaler:
+ x = torch.log(1.0+torch.exp(self.temperature_scaler)) * x
+
+ x = x.view(batch_size, self.seq_len, self.alphabet_size)
+ x_recon_log = F.log_softmax(x, dim=-1) #of shape (batch_size, seq_len, alphabet)
+
+ return x_recon_log
\ No newline at end of file
diff --git a/proteingym/baselines/EVE/EVE/VAE_encoder.py b/proteingym/baselines/EVE/EVE/VAE_encoder.py
new file mode 100644
index 0000000..3fd40fe
--- /dev/null
+++ b/proteingym/baselines/EVE/EVE/VAE_encoder.py
@@ -0,0 +1,88 @@
+import torch
+import torch.nn as nn
+
+class VAE_MLP_encoder(nn.Module):
+ """
+ MLP encoder class for the VAE model.
+ """
+ def __init__(self,params):
+ """
+ Required input parameters:
+ - seq_len: (Int) Sequence length of sequence alignment
+ - alphabet_size: (Int) Alphabet size of sequence alignment (will be driven by the data helper object)
+ - hidden_layers_sizes: (List) List of sizes of DNN linear layers
+ - z_dim: (Int) Size of latent space
+ - convolve_input: (Bool) Whether to perform 1d convolution on input (kernel size 1, stide 1)
+ - convolution_depth: (Int) Size of the 1D-convolution on input
+ - nonlinear_activation: (Str) Type of non-linear activation to apply on each hidden layer
+ - dropout_proba: (Float) Dropout probability applied on all hidden layers. If 0.0 then no dropout applied
+ """
+ super().__init__()
+ self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+ self.seq_len = params['seq_len']
+ self.alphabet_size = params['alphabet_size']
+ self.hidden_layers_sizes = params['hidden_layers_sizes']
+ self.z_dim = params['z_dim']
+ self.convolve_input = params['convolve_input']
+ self.convolution_depth = params['convolution_input_depth']
+ self.dropout_proba = params['dropout_proba']
+
+ self.mu_bias_init = 0.1
+ self.log_var_bias_init = -10.0
+
+ #Convolving input with kernels of size 1 to capture potential similarities across amino acids when encoding sequences
+ if self.convolve_input:
+ self.input_convolution = nn.Conv1d(in_channels=self.alphabet_size,out_channels=self.convolution_depth,kernel_size=1,stride=1,bias=False)
+ self.channel_size = self.convolution_depth
+ else:
+ self.channel_size = self.alphabet_size
+
+ self.hidden_layers=torch.nn.ModuleDict()
+ for layer_index in range(len(self.hidden_layers_sizes)):
+ if layer_index==0:
+ self.hidden_layers[str(layer_index)] = nn.Linear((self.channel_size*self.seq_len),self.hidden_layers_sizes[layer_index])
+ nn.init.constant_(self.hidden_layers[str(layer_index)].bias, self.mu_bias_init)
+ else:
+ self.hidden_layers[str(layer_index)] = nn.Linear(self.hidden_layers_sizes[layer_index-1],self.hidden_layers_sizes[layer_index])
+ nn.init.constant_(self.hidden_layers[str(layer_index)].bias, self.mu_bias_init)
+
+ self.fc_mean = nn.Linear(self.hidden_layers_sizes[-1],self.z_dim)
+ nn.init.constant_(self.fc_mean.bias, self.mu_bias_init)
+ self.fc_log_var = nn.Linear(self.hidden_layers_sizes[-1],self.z_dim)
+ nn.init.constant_(self.fc_log_var.bias, self.log_var_bias_init)
+
+ # set up non-linearity
+ if params['nonlinear_activation'] == 'relu':
+ self.nonlinear_activation = nn.ReLU()
+ elif params['nonlinear_activation'] == 'tanh':
+ self.nonlinear_activation = nn.Tanh()
+ elif params['nonlinear_activation'] == 'sigmoid':
+ self.nonlinear_activation = nn.Sigmoid()
+ elif params['nonlinear_activation'] == 'elu':
+ self.nonlinear_activation = nn.ELU()
+ elif params['nonlinear_activation'] == 'linear':
+ self.nonlinear_activation = nn.Identity()
+
+ if self.dropout_proba > 0.0:
+ self.dropout_layer = nn.Dropout(p=self.dropout_proba)
+
+ def forward(self, x):
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(x)
+
+ if self.convolve_input:
+ x = x.permute(0,2,1)
+ x = self.input_convolution(x)
+ x = x.view(-1,self.seq_len*self.channel_size)
+ else:
+ x = x.view(-1,self.seq_len*self.channel_size)
+
+ for layer_index in range(len(self.hidden_layers_sizes)):
+ x = self.nonlinear_activation(self.hidden_layers[str(layer_index)](x))
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(x)
+
+ z_mean = self.fc_mean(x)
+ z_log_var = self.fc_log_var(x)
+
+ return z_mean, z_log_var
\ No newline at end of file
diff --git a/proteingym/baselines/EVE/EVE/VAE_model.py b/proteingym/baselines/EVE/EVE/VAE_model.py
new file mode 100644
index 0000000..d1103bd
--- /dev/null
+++ b/proteingym/baselines/EVE/EVE/VAE_model.py
@@ -0,0 +1,551 @@
+import datetime
+import os
+import sys
+from resource import getrusage, RUSAGE_SELF
+
+import numpy as np
+import pandas as pd
+import time
+from tqdm import tqdm
+from scipy.special import erfinv
+from sklearn.model_selection import train_test_split
+
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+import torch.optim as optim
+import torch.backends.cudnn as cudnn
+
+from utils.data_utils import one_hot_3D, get_dataloader
+from . import VAE_encoder, VAE_decoder
+
+
+class VAE_model(nn.Module):
+ """
+ Class for the VAE model with estimation of weights distribution parameters via Mean-Field VI.
+ """
+
+ def __init__(self,
+ model_name,
+ data,
+ encoder_parameters,
+ decoder_parameters,
+ random_seed,
+ seq_len=None,
+ alphabet_size=None,
+ Neff=None,
+ ):
+
+ super().__init__()
+
+ self.model_name = model_name
+ self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+ self.dtype = torch.float32
+ self.random_seed = random_seed
+ torch.manual_seed(random_seed)
+
+ self.seq_len = seq_len if seq_len is not None else data.seq_len
+ self.alphabet_size = alphabet_size if alphabet_size is not None else data.alphabet_size
+ self.Neff = Neff if Neff is not None else data.Neff
+
+ self.encoder_parameters = encoder_parameters
+ self.decoder_parameters = decoder_parameters
+
+ encoder_parameters['seq_len'] = self.seq_len
+ encoder_parameters['alphabet_size'] = self.alphabet_size
+ decoder_parameters['seq_len'] = self.seq_len
+ decoder_parameters['alphabet_size'] = self.alphabet_size
+
+ self.encoder = VAE_encoder.VAE_MLP_encoder(params=encoder_parameters)
+ if decoder_parameters['bayesian_decoder']:
+ self.decoder = VAE_decoder.VAE_Bayesian_MLP_decoder(params=decoder_parameters)
+ else:
+ self.decoder = VAE_decoder.VAE_Standard_MLP_decoder(params=decoder_parameters)
+ self.logit_sparsity_p = decoder_parameters['logit_sparsity_p']
+
+ def sample_latent(self, mu, log_var):
+ """
+ Samples a latent vector via reparametrization trick
+ """
+ eps = torch.randn_like(mu).to(self.device)
+ z = torch.exp(0.5 * log_var) * eps + mu
+ return z
+
+ def KLD_diag_gaussians(self, mu, logvar, p_mu, p_logvar):
+ """
+ KL divergence between diagonal gaussian with prior diagonal gaussian.
+ """
+ KLD = 0.5 * (p_logvar - logvar) + 0.5 * (torch.exp(logvar) + torch.pow(mu - p_mu, 2)) / (
+ torch.exp(p_logvar) + 1e-20) - 0.5
+
+ return torch.sum(KLD)
+
+ def annealing_factor(self, annealing_warm_up, training_step):
+ """
+ Annealing schedule of KL to focus on reconstruction error in early stages of training
+ """
+ if training_step < annealing_warm_up:
+ return training_step / annealing_warm_up
+ else:
+ return 1
+
+ def KLD_global_parameters(self):
+ """
+ KL divergence between the variational distributions and the priors (for the decoder weights).
+ """
+ KLD_decoder_params = 0.0
+ zero_tensor = torch.tensor(0.0).to(self.device)
+
+ for layer_index in range(len(self.decoder.hidden_layers_sizes)):
+ for param_type in ['weight', 'bias']:
+ KLD_decoder_params += self.KLD_diag_gaussians(
+ self.decoder.state_dict(keep_vars=True)[
+ 'hidden_layers_mean.' + str(layer_index) + '.' + param_type].flatten(),
+ self.decoder.state_dict(keep_vars=True)[
+ 'hidden_layers_log_var.' + str(layer_index) + '.' + param_type].flatten(),
+ zero_tensor,
+ zero_tensor
+ )
+
+ for param_type in ['weight', 'bias']:
+ KLD_decoder_params += self.KLD_diag_gaussians(
+ self.decoder.state_dict(keep_vars=True)['last_hidden_layer_' + param_type + '_mean'].flatten(),
+ self.decoder.state_dict(keep_vars=True)['last_hidden_layer_' + param_type + '_log_var'].flatten(),
+ zero_tensor,
+ zero_tensor
+ )
+
+ if self.decoder.include_sparsity:
+ self.logit_scale_sigma = 4.0
+ self.logit_scale_mu = 2.0 ** 0.5 * self.logit_scale_sigma * erfinv(2.0 * self.logit_sparsity_p - 1.0)
+
+ sparsity_mu = torch.tensor(self.logit_scale_mu).to(self.device)
+ sparsity_log_var = torch.log(torch.tensor(self.logit_scale_sigma ** 2)).to(self.device)
+ KLD_decoder_params += self.KLD_diag_gaussians(
+ self.decoder.state_dict(keep_vars=True)['sparsity_weight_mean'].flatten(),
+ self.decoder.state_dict(keep_vars=True)['sparsity_weight_log_var'].flatten(),
+ sparsity_mu,
+ sparsity_log_var
+ )
+
+ if self.decoder.convolve_output:
+ for param_type in ['weight']:
+ KLD_decoder_params += self.KLD_diag_gaussians(
+ self.decoder.state_dict(keep_vars=True)['output_convolution_mean.' + param_type].flatten(),
+ self.decoder.state_dict(keep_vars=True)['output_convolution_log_var.' + param_type].flatten(),
+ zero_tensor,
+ zero_tensor
+ )
+
+ if self.decoder.include_temperature_scaler:
+ KLD_decoder_params += self.KLD_diag_gaussians(
+ self.decoder.state_dict(keep_vars=True)['temperature_scaler_mean'].flatten(),
+ self.decoder.state_dict(keep_vars=True)['temperature_scaler_log_var'].flatten(),
+ zero_tensor,
+ zero_tensor
+ )
+ return KLD_decoder_params
+
+ def loss_function(self, x_recon_log, x, mu, log_var, kl_latent_scale, kl_global_params_scale, annealing_warm_up,
+ training_step, Neff):
+ """
+ Returns mean of negative ELBO, reconstruction loss and KL divergence across batch x.
+ """
+ BCE = F.binary_cross_entropy_with_logits(x_recon_log, x, reduction='sum') / x.shape[0]
+ KLD_latent = (-0.5 * torch.sum(1 + log_var - mu.pow(2) - log_var.exp())) / x.shape[0]
+ if self.decoder.bayesian_decoder:
+ KLD_decoder_params_normalized = self.KLD_global_parameters() / Neff
+ else:
+ KLD_decoder_params_normalized = 0.0
+ warm_up_scale = self.annealing_factor(annealing_warm_up, training_step)
+ neg_ELBO = BCE + warm_up_scale * (
+ kl_latent_scale * KLD_latent + kl_global_params_scale * KLD_decoder_params_normalized)
+ return neg_ELBO, BCE, KLD_latent, KLD_decoder_params_normalized
+
+ def all_likelihood_components(self, x):
+ """
+ Returns tensors of ELBO, reconstruction loss and KL divergence for each point in batch x.
+ """
+ mu, log_var = self.encoder(x)
+ z = self.sample_latent(mu, log_var)
+ recon_x_log = self.decoder(z)
+
+ recon_x_log = recon_x_log.view(-1, self.alphabet_size * self.seq_len)
+ x = x.view(-1, self.alphabet_size * self.seq_len)
+
+ BCE_batch_tensor = torch.sum(F.binary_cross_entropy_with_logits(recon_x_log, x, reduction='none'), dim=1)
+ KLD_batch_tensor = (-0.5 * torch.sum(1 + log_var - mu.pow(2) - log_var.exp(), dim=1))
+
+ ELBO_batch_tensor = -(BCE_batch_tensor + KLD_batch_tensor)
+
+ return ELBO_batch_tensor, BCE_batch_tensor, KLD_batch_tensor
+
+ def all_likelihood_components_z(self, x, mu, log_var):
+ """Skip the encoder part and directly sample z"""
+ # Need to run mu, log_var = self.encoder(x) first
+ z = self.sample_latent(mu, log_var)
+ recon_x_log = self.decoder(z)
+
+ recon_x_log = recon_x_log.view(-1, self.alphabet_size * self.seq_len)
+ x = x.view(-1, self.alphabet_size * self.seq_len)
+
+ BCE_batch_tensor = torch.sum(F.binary_cross_entropy_with_logits(recon_x_log, x, reduction='none'), dim=1)
+ KLD_batch_tensor = (-0.5 * torch.sum(1 + log_var - mu.pow(2) - log_var.exp(), dim=1))
+
+ ELBO_batch_tensor = -(BCE_batch_tensor + KLD_batch_tensor)
+
+ return ELBO_batch_tensor, BCE_batch_tensor, KLD_batch_tensor
+
+ def train_model(self, data, training_parameters, use_dataloader=False):
+ """
+ Training procedure for the VAE model.
+ If use_validation_set is True then:
+ - we split the alignment data in train/val sets.
+ - we train up to num_training_steps steps but store the version of the model with lowest loss on validation set across training
+ If not, then we train the model for num_training_steps and save the model at the end of training
+ """
+ if torch.cuda.is_available():
+ cudnn.benchmark = True
+ self.train()
+
+ if training_parameters['log_training_info']:
+ filename = training_parameters['training_logs_location'] + os.sep + self.model_name + "_losses.csv"
+ with open(filename, "a") as logs:
+ logs.write("Number of sequences in alignment file:\t" + str(data.num_sequences) + "\n")
+ logs.write("Neff:\t" + str(self.Neff) + "\n")
+ logs.write("Alignment sequence length:\t" + str(data.seq_len) + "\n")
+
+ optimizer = optim.Adam(self.parameters(), lr=training_parameters['learning_rate'],
+ weight_decay=training_parameters['l2_regularization'])
+
+ if training_parameters['use_lr_scheduler']:
+ scheduler = optim.lr_scheduler.StepLR(optimizer, step_size=training_parameters['lr_scheduler_step_size'],
+ gamma=training_parameters['lr_scheduler_gamma'])
+
+ if training_parameters['use_validation_set']:
+ # TODO fix this for use with a dataloader
+ x_train, x_val, weights_train, weights_val = train_test_split(data.one_hot_encoding, data.weights,
+ test_size=training_parameters[
+ 'validation_set_pct'],
+ random_state=self.random_seed)
+ best_val_loss = float('inf')
+ best_model_step_index = 0
+ else:
+ x_train = data.one_hot_encoding
+ weights_train = data.weights
+ best_val_loss = None
+ best_model_step_index = training_parameters['num_training_steps']
+
+ seq_sample_probs = weights_train / np.sum(weights_train)
+ assert len(data.seq_name_to_sequence) == weights_train.shape[0] # One weight per sequence
+
+ # TMP TODO: Keep old behaviour for comparison
+ if use_dataloader:
+ # Stream one-hot encodings
+ dataloader = get_dataloader(msa_data=data, batch_size=training_parameters['batch_size'], num_training_steps=training_parameters['num_training_steps'])
+ else:
+ batch_order = np.arange(x_train.shape[0])
+ assert batch_order.shape == seq_sample_probs.shape, f"batch_order and seq_sample_probs must have the same shape. batch_order.shape={batch_order.shape}, seq_sample_probs.shape={seq_sample_probs.shape}"
+ def get_mock_dataloader():
+ while True:
+ # Sample a batch according to sequence weight
+ batch_index = np.random.choice(batch_order, training_parameters['batch_size'], p=seq_sample_probs).tolist()
+ batch = x_train[batch_index]
+ yield batch
+ dataloader = get_mock_dataloader()
+
+ self.Neff_training = np.sum(weights_train)
+
+ start = time.time()
+ train_loss = 0
+ print("debug starting training here:")
+ for training_step, batch in enumerate(tqdm(dataloader, desc="Training model", total=training_parameters['num_training_steps'], mininterval=2)):#mininterval=10)):
+
+ if training_step >= training_parameters['num_training_steps']:
+ print("debug Breaking at step", training_step)
+ break
+ x = batch.to(self.device, dtype=self.dtype)
+ if training_step == 0:
+ print("Got batch 1")
+
+ optimizer.zero_grad()
+
+ mu, log_var = self.encoder(x)
+ z = self.sample_latent(mu, log_var)
+ recon_x_log = self.decoder(z)
+
+ neg_ELBO, BCE, KLD_latent, KLD_decoder_params_normalized = self.loss_function(
+ recon_x_log, x, mu, log_var,
+ training_parameters['kl_latent_scale'],
+ training_parameters['kl_global_params_scale'],
+ training_parameters['annealing_warm_up'],
+ training_step,
+ self.Neff_training)
+
+ neg_ELBO.backward()
+ optimizer.step()
+
+ if training_parameters['use_lr_scheduler']:
+ scheduler.step()
+
+ if training_step % training_parameters['log_training_freq'] == 0:
+ progress = "|Train : Update {0}. Negative ELBO : {1:.3f}, BCE: {2:.3f}, KLD_latent: {3:.3f}, KLD_decoder_params_norm: {4:.3f}, Time: {5:.2f} |".format(
+ training_step, neg_ELBO, BCE, KLD_latent, KLD_decoder_params_normalized, time.time() - start)
+ print(progress)
+
+ if training_parameters['log_training_info']:
+ with open(filename, "a+") as logs:
+ logs.write(progress + "\n")
+
+ if training_step % training_parameters['save_model_params_freq'] == 0:
+ self.save(model_checkpoint=training_parameters[
+ 'model_checkpoint_location'] + os.sep + self.model_name + "_step_" + str(
+ training_step),
+ encoder_parameters=self.encoder_parameters,
+ decoder_parameters=self.decoder_parameters,
+ training_parameters=training_parameters)
+
+ if training_parameters['use_validation_set'] and training_step % training_parameters[
+ 'validation_freq'] == 0:
+ x_val = torch.tensor(x_val, dtype=self.dtype).to(self.device)
+ val_neg_ELBO, val_BCE, val_KLD_latent, val_KLD_global_parameters = self.test_model(x_val, weights_val,
+ training_parameters[
+ 'batch_size'])
+
+ progress_val = "\t\t\t|Val : Update {0}. Negative ELBO : {1:.3f}, BCE: {2:.3f}, KLD_latent: {3:.3f}, KLD_decoder_params_norm: {4:.3f}, Time: {5:.2f} |".format(
+ training_step, val_neg_ELBO, val_BCE, val_KLD_latent, val_KLD_global_parameters,
+ time.time() - start)
+ print(progress_val)
+ if training_parameters['log_training_info']:
+ with open(filename, "a+") as logs:
+ logs.write(progress_val + "\n")
+
+ if val_neg_ELBO < best_val_loss:
+ best_val_loss = val_neg_ELBO
+ best_model_step_index = training_step
+ self.save(model_checkpoint=training_parameters[
+ 'model_checkpoint_location'] + os.sep + self.model_name + "_best",
+ encoder_parameters=self.encoder_parameters,
+ decoder_parameters=self.decoder_parameters,
+ training_parameters=training_parameters)
+ self.train()
+ print("TMP: Finished training, last training_step=", training_step)
+
+
+ def test_model(self, x_val, weights_val, batch_size):
+ self.eval()
+
+ with torch.no_grad():
+ val_batch_order = np.arange(x_val.shape[0])
+ val_seq_sample_probs = weights_val / np.sum(weights_val)
+
+ val_batch_index = np.random.choice(val_batch_order, batch_size, p=val_seq_sample_probs).tolist()
+ x = torch.tensor(x_val[val_batch_index], dtype=self.dtype).to(self.device)
+ mu, log_var = self.encoder(x)
+ z = self.sample_latent(mu, log_var)
+ recon_x_log = self.decoder(z)
+
+ neg_ELBO, BCE, KLD_latent, KLD_global_parameters = self.loss_function(recon_x_log, x, mu, log_var,
+ kl_latent_scale=1.0,
+ kl_global_params_scale=1.0,
+ annealing_warm_up=0, training_step=1,
+ Neff=self.Neff_training) # set annealing factor to 1
+
+ return neg_ELBO.item(), BCE.item(), KLD_latent.item(), KLD_global_parameters.item()
+
+ def save(self, model_checkpoint, encoder_parameters, decoder_parameters, training_parameters, batch_size=256):
+ # Create intermediate dirs above this
+ os.makedirs(os.path.dirname(model_checkpoint), exist_ok=True)
+ torch.save({
+ 'model_state_dict': self.state_dict(),
+ 'encoder_parameters': encoder_parameters,
+ 'decoder_parameters': decoder_parameters,
+ 'training_parameters': training_parameters,
+ }, model_checkpoint)
+
+ def compute_evol_indices(self, msa_data, list_mutations_location, num_samples, batch_size=256,
+ mutant_column="mutations", num_chunks=1, aggregation_method="full"):
+ list_valid_mutations = []
+ evol_indices = []
+
+ full_data = pd.read_csv(list_mutations_location, header=0)
+ print("Full length: ", len(full_data))
+ size_per_chunk = int(len(full_data) / num_chunks)
+ print("size_per_chunk: " + str(size_per_chunk))
+
+ for chunk in range(num_chunks):
+ print("chunk #: " + str(chunk))
+ data_chunk = full_data[chunk * size_per_chunk:(chunk + 1) * size_per_chunk]
+ list_valid_mutations_chunk, evol_indices_chunk, _, _ = self.compute_evol_indices_chunk(msa_data=msa_data,
+ list_mutations_location=data_chunk,
+ num_samples=num_samples,
+ batch_size=batch_size,
+ mutant_column=mutant_column,
+ aggregation_method=aggregation_method)
+ list_valid_mutations.extend(list(list_valid_mutations_chunk))
+ evol_indices.extend(list(evol_indices_chunk))
+ return list_valid_mutations, evol_indices, '', ''
+
+ def compute_evol_indices_chunk(self, msa_data, list_mutations_location, num_samples, batch_size=256,
+ mutant_column="mutations", aggregation_method="full"):
+ """
+ The column in the list_mutations dataframe that contains the mutant(s) for a given variant should be called "mutations"
+ """
+
+ # Note: wt is added inside this function, so no need to add a row in csv/dataframe input with wt
+
+ # Multiple mutations are to be passed colon-separated
+ list_mutations = list_mutations_location # pd.read_csv(list_mutations_location, header=0)
+
+ # Remove (multiple) mutations that are invalid
+ list_valid_mutations = ['wt']
+ list_valid_mutated_sequences = {}
+ list_valid_mutated_sequences['wt'] = msa_data.focus_seq_trimmed # first sequence in the list is the wild_type
+
+ if aggregation_method not in ["full", "batch", "online"]:
+ raise ValueError("Invalid aggregation method: {}".format(aggregation_method))
+
+ for mutation in list_mutations[mutant_column]:
+ try:
+ individual_substitutions = str(mutation).split(':')
+ except Exception as e:
+ print("Error with mutant {}".format(str(mutation)))
+ print("Specific error: " + str(e))
+ continue
+ mutated_sequence = list(msa_data.focus_seq_trimmed)[:]
+ fully_valid_mutation = True
+ for mut in individual_substitutions:
+ try:
+ wt_aa, pos, mut_aa = mut[0], int(mut[1:-1]), mut[-1]
+ if wt_aa == mut_aa:
+ continue
+ # Log specific invalid mutants
+ if pos not in msa_data.uniprot_focus_col_to_wt_aa_dict:
+ print("pos {} not in uniprot_focus_col_to_wt_aa_dict".format(pos))
+ fully_valid_mutation = False
+ # Given it's in the dict, check if it's a valid mutation
+ elif msa_data.uniprot_focus_col_to_wt_aa_dict[pos] != wt_aa:
+ print("wt_aa {} != uniprot_focus_col_to_wt_aa_dict[{}] {}".format(
+ wt_aa, pos, msa_data.uniprot_focus_col_to_wt_aa_dict[pos]))
+ fully_valid_mutation = False
+ if mut not in msa_data.mutant_to_letter_pos_idx_focus_list:
+ print("mut {} not in mutant_to_letter_pos_idx_focus_list".format(mut))
+ fully_valid_mutation = False
+
+ if fully_valid_mutation:
+ wt_aa, pos, idx_focus = msa_data.mutant_to_letter_pos_idx_focus_list[mut]
+ mutated_sequence[idx_focus] = mut_aa # perform the corresponding AA substitution
+ else:
+ print("Not a valid mutant: " + mutation)
+ break
+
+ except Exception as e:
+ print("Error processing mutation {} in mutant {}".format(str(mut), str(mutation)))
+ print("Specific error: " + str(e))
+ fully_valid_mutation = False
+ break
+
+ if fully_valid_mutation:
+ list_valid_mutations.append(mutation)
+ list_valid_mutated_sequences[mutation] = ''.join(mutated_sequence)
+
+ # One-hot encoding of mutated sequences
+ mutated_sequences_one_hot = one_hot_3D(list_valid_mutations, list_valid_mutated_sequences,
+ alphabet=msa_data.alphabet, seq_length=len(msa_data.focus_cols))
+
+ # TODO for low memory might need to calculate one-hot on the fly, or fix chunked calculation with elbo - elbo_wt
+ mutated_sequences_one_hot = torch.tensor(mutated_sequences_one_hot, dtype=torch.bool)
+ print("One-hot encoding of mutated sequences complete")
+ print(f"{datetime.datetime.now()} Peak memory in GB: {getrusage(RUSAGE_SELF).ru_maxrss / 1024 ** 2:.3f}")
+ # https://stackoverflow.com/questions/54361763/pytorch-why-is-the-memory-occupied-by-the-tensor-variable-so-small/54365012#54365012
+ print(
+ f"tmp debug: storage size of mutated_sequences_one_hot: {sys.getsizeof(mutated_sequences_one_hot.storage()) / 1e9:.4f} GB")
+ dataloader = torch.utils.data.DataLoader(mutated_sequences_one_hot, batch_size=batch_size, shuffle=False,
+ num_workers=4, pin_memory=True)
+
+ if aggregation_method == "full":
+ prediction_matrix = torch.zeros((len(list_valid_mutations), num_samples), dtype=self.dtype)
+ print(
+ f"tmp debug: storage size of mutated_sequences_one_hot: {sys.getsizeof(prediction_matrix.storage()) / 1e9:.4f} GB")
+ with torch.no_grad():
+ for i, batch in enumerate(tqdm(dataloader, 'Looping through mutation batches')):
+ x = batch.type(self.dtype).to(self.device)
+ for j in tqdm(range(num_samples),
+ 'Looping through number of samples for batch #: ' + str(i + 1), mininterval=5):
+ seq_predictions, _, _ = self.all_likelihood_components(x)
+ prediction_matrix[i * batch_size:i * batch_size + len(x), j] = seq_predictions
+ tqdm.write('\n')
+ mean_predictions = prediction_matrix.mean(dim=1, keepdim=False)
+ std_predictions = prediction_matrix.std(dim=1, keepdim=False)
+ delta_elbos = mean_predictions - mean_predictions[0]
+ evol_indices = - delta_elbos.detach().cpu().numpy()
+
+ elif aggregation_method == "batch":
+ # Reduce memory by factor of num_batches (num_valid_mutations / batch_size)
+ # Note: This will mean that higher memory GPU needs higher RAM because we store larger sample batches before aggregating
+ mean_predictions = torch.zeros(len(list_valid_mutations))
+ std_predictions = torch.zeros(len(list_valid_mutations))
+
+ with torch.no_grad():
+ for i, batch in enumerate(tqdm(dataloader, 'Looping through mutation batches')):
+ x = batch.type(self.dtype).to(self.device)
+
+ # Simplest: Aggregate mean and std each batch (instead of online per sample)
+ # Reduce memory by factor of num_batches (num_valid_mutations / batch_size)
+ batch_samples = torch.zeros(size=(len(x), 20_000),
+ dtype=self.dtype) # Store these on CPU to save GPU memory
+ for j in tqdm(range(num_samples),
+ 'Looping through number of samples for batch #: ' + str(i + 1), mininterval=1):
+ seq_predictions, _, _ = self.all_likelihood_components(x)
+ batch_samples[:, j] = seq_predictions.detach().cpu()
+
+ # Aggregate mean and std for this batch, this should be negligibly quick
+ mean_predictions[i * batch_size:i * batch_size + len(x)] = batch_samples.mean(dim=1, keepdim=False)
+ std_predictions[i * batch_size:i * batch_size + len(x)] = batch_samples.std(dim=1, keepdim=False)
+ tqdm.tqdm.write('\n')
+
+ delta_elbos = mean_predictions - mean_predictions[0]
+ evol_indices = - delta_elbos.detach().cpu().numpy()
+ elif aggregation_method == "online":
+ # Extension: Completely online, reduce memory by factor of num_samples (20k) with hopefully small overhead
+ mean_predictions = torch.zeros(len(list_valid_mutations))
+ std_predictions = torch.zeros(len(list_valid_mutations))
+ with torch.no_grad():
+ for i, batch in enumerate(tqdm(dataloader, 'Looping through mutation batches')):
+ x = batch.type(self.dtype).to(self.device)
+ print(
+ f"{datetime.datetime.now()} tmp Peak memory in GB: {getrusage(RUSAGE_SELF).ru_maxrss / 1024 ** 2:.3f}")
+ # Simplest: Aggregate mean and std online per sample
+ online_mean = torch.zeros(len(x), dtype=self.dtype, device=self.device)
+ online_s = torch.zeros(len(x), dtype=self.dtype, device=self.device)
+
+ # Run this once per batch to speed up remaining loop
+ mu, log_var = self.encoder(x)
+
+ for j in tqdm(range(num_samples),
+ 'Looping through number of samples for batch #: ' + str(i + 1), mininterval=5):
+ seq_predictions, _, _ = self.all_likelihood_components_z(x, mu, log_var)
+ # Using Welford's method https://stackoverflow.com/a/15638726/10447904
+ # All still on GPU
+ if j == 0:
+ online_mean = seq_predictions.detach()
+ # online_s stays 0
+ else:
+ delta = seq_predictions - online_mean
+ online_mean = online_mean + delta / (j + 1) # / n in original formula
+ online_s = online_s + delta * (seq_predictions - online_mean)
+
+ variance = online_s / (num_samples - 1) # j will end as n-1
+ std = variance.sqrt()
+ # Fill in mean and std arrays for this batch
+ mean_predictions[i * batch_size:i * batch_size + len(x)] = online_mean.detach().cpu()
+ std_predictions[i * batch_size:i * batch_size + len(x)] = std.detach().cpu()
+ tqdm.tqdm.write('\n')
+
+ delta_elbos = mean_predictions - mean_predictions[0]
+ evol_indices = - delta_elbos.detach().cpu().numpy()
+ else:
+ raise ValueError("Invalid aggregation method. Must be one of 'full', 'batch' or 'online'.")
+
+ return list_valid_mutations, evol_indices, mean_predictions[
+ 0].detach().cpu().numpy(), std_predictions.detach().cpu().numpy()
diff --git a/proteingym/baselines/EVE/EVE/deepseq_model_params.json b/proteingym/baselines/EVE/EVE/deepseq_model_params.json
new file mode 100644
index 0000000..1dddaef
--- /dev/null
+++ b/proteingym/baselines/EVE/EVE/deepseq_model_params.json
@@ -0,0 +1,39 @@
+{ "encoder_parameters": {
+ "hidden_layers_sizes" : [1500,1500],
+ "z_dim" : 30,
+ "convolve_input" : false,
+ "convolution_input_depth" : 40,
+ "nonlinear_activation" : "relu",
+ "dropout_proba" : 0.0
+},
+"decoder_parameters": {
+ "hidden_layers_sizes" : [100,2000],
+ "z_dim" : 30,
+ "bayesian_decoder" : true,
+ "first_hidden_nonlinearity" : "relu",
+ "last_hidden_nonlinearity" : "relu",
+ "dropout_proba" : 0.1,
+ "convolve_output" : true,
+ "convolution_output_depth" : 40,
+ "include_temperature_scaler" : true,
+ "include_sparsity" : true,
+ "num_tiles_sparsity" : 4,
+ "logit_sparsity_p" : 0.001
+},
+"training_parameters": {
+ "num_training_steps" : 300000,
+ "learning_rate" : 1e-3,
+ "batch_size" : 256,
+ "annealing_warm_up" : 0,
+ "kl_latent_scale" : 1.0,
+ "kl_global_params_scale" : 1.0,
+ "l2_regularization" : 0.0,
+ "use_lr_scheduler" : false,
+ "use_validation_set" : false,
+ "validation_set_pct" : 0.2,
+ "validation_freq" : 1000,
+ "log_training_info" : true,
+ "log_training_freq" : 1000,
+ "save_model_params_freq" : 500000
+}
+}
\ No newline at end of file
diff --git a/proteingym/baselines/EVE/EVE/default_model_params.json b/proteingym/baselines/EVE/EVE/default_model_params.json
new file mode 100644
index 0000000..5d3b93c
--- /dev/null
+++ b/proteingym/baselines/EVE/EVE/default_model_params.json
@@ -0,0 +1,41 @@
+{ "encoder_parameters": {
+ "hidden_layers_sizes" : [2000,1000,300],
+ "z_dim" : 50,
+ "convolve_input" : false,
+ "convolution_input_depth" : 40,
+ "nonlinear_activation" : "relu",
+ "dropout_proba" : 0.0
+ },
+ "decoder_parameters": {
+ "hidden_layers_sizes" : [300,1000,2000],
+ "z_dim" : 50,
+ "bayesian_decoder" : true,
+ "first_hidden_nonlinearity" : "relu",
+ "last_hidden_nonlinearity" : "relu",
+ "dropout_proba" : 0.1,
+ "convolve_output" : true,
+ "convolution_output_depth" : 40,
+ "include_temperature_scaler" : true,
+ "include_sparsity" : false,
+ "num_tiles_sparsity" : 0,
+ "logit_sparsity_p" : 0
+ },
+ "training_parameters": {
+ "num_training_steps" : 400000,
+ "learning_rate" : 1e-4,
+ "batch_size" : 256,
+ "annealing_warm_up" : 0,
+ "kl_latent_scale" : 1.0,
+ "kl_global_params_scale" : 1.0,
+ "l2_regularization" : 0.0,
+ "use_lr_scheduler" : false,
+ "use_validation_set" : false,
+ "validation_set_pct" : 0.2,
+ "validation_freq" : 1000,
+ "log_training_info" : true,
+ "log_training_freq" : 1000,
+ "save_model_params_freq" : 500000
+ }
+}
+
+
diff --git a/proteingym/baselines/EVE/calc_weights.py b/proteingym/baselines/EVE/calc_weights.py
new file mode 100644
index 0000000..4e1b429
--- /dev/null
+++ b/proteingym/baselines/EVE/calc_weights.py
@@ -0,0 +1,120 @@
+# Basically train_VAE.py but just calculating the weights
+import argparse
+import os
+import time
+
+import numpy as np
+import pandas as pd
+
+from utils import data_utils
+
+
+def create_argparser():
+ parser = argparse.ArgumentParser(description='VAE')
+
+ # If we don't have a mapping file, just use a single MSA path
+ parser.add_argument("--MSA_filepath", type=str, help="Full path to MSA")
+
+ # If we have a mapping file with one MSA path per line
+ parser.add_argument('--MSA_data_folder', type=str, help='Folder where MSAs are stored', required=True)
+ parser.add_argument('--DMS_reference_file_path', type=str, help='List of proteins and corresponding MSA file name', required=True)
+ parser.add_argument('--DMS_index', type=int, help='Row index of protein in input mapping file', required=True)
+ parser.add_argument('--MSA_weights_location', type=str,
+ help='Location where weights for each sequence in the MSA will be stored', required=True)
+ parser.add_argument('--theta_reweighting', type=float, help='Parameters for MSA sequence re-weighting')
+ parser.add_argument("--num_cpus", type=int, help="Number of CPUs to use", default=1)
+ parser.add_argument("--skip_existing", help="Will quit gracefully if weights file already exists", action="store_true", default=False)
+ parser.add_argument("--overwrite", help="Will overwrite existing weights file", action="store_true", default=False)
+ parser.add_argument("--calc_method", choices=["evcouplings", "eve", "both", "identity"], help="Method to use for calculating weights. Note: Both produce the same results as we modified the evcouplings numba code to mirror the eve calculation", default="evcouplings")
+ parser.add_argument("--threshold_focus_cols_frac_gaps", type=float,
+ help="Maximum fraction of gaps allowed in focus columns - see data_utils.MSA_processing")
+ return parser
+
+
+def main(args):
+ print("Arguments:", args)
+
+ weights_file = None
+
+ if args.MSA_filepath is not None:
+ assert os.path.isfile(args.MSA_filepath), f"MSA filepath {args.MSA_filepath} doesn't exist"
+ msa_location = args.MSA_filepath
+ else:
+ # Use mapping file
+ assert os.path.isfile(args.DMS_reference_file_path), f"DMS reference file {args.DMS_reference_file_path} doesn't seem to exist"
+ mapping_file = pd.read_csv(args.DMS_reference_file_path)
+ protein_name = mapping_file['MSA_filename'][args.DMS_index].split(".a2m")[0]
+ msa_location = args.MSA_data_folder + os.sep + mapping_file['MSA_filename'][args.DMS_index]
+ print("Protein name: " + str(protein_name))
+ # If weights_file is in the df_mapping, use that instead
+ if "weight_file_name" in mapping_file.columns:
+ weights_file = args.MSA_weights_location + os.sep + mapping_file["weight_file_name"][args.DMS_index]
+ print("Using weights filename from mapping file:", weights_file)
+
+ print("MSA file: " + str(msa_location))
+
+ if args.theta_reweighting is not None:
+ theta = args.theta_reweighting
+ print(f"Using custom theta value {theta} instead of loading from mapping file.")
+ else:
+ try:
+ theta = float(mapping_file['MSA_theta'][args.DMS_index])
+ except KeyError as e:
+ # Overriding previous errors is bad, but we're being nice to the user
+ raise KeyError("Couldn't load theta from mapping file. "
+ "NOT using default value of theta=0.2; please specify theta manually. Specific line:",
+ mapping_file[args.DMS_index],
+ "Previous error:", e)
+ assert not np.isnan(theta), "Theta is NaN, please provide a custom theta value"
+
+ print("Theta MSA re-weighting: " + str(theta))
+
+ # Using data_kwargs so that if options aren't set, they'll be set to default values
+ data_kwargs = {}
+ if args.threshold_focus_cols_frac_gaps is not None:
+ print("Using custom threshold_focus_cols_frac_gaps: ", args.threshold_focus_cols_frac_gaps)
+ data_kwargs['threshold_focus_cols_frac_gaps'] = args.threshold_focus_cols_frac_gaps
+
+ if not os.path.isdir(args.MSA_weights_location):
+ os.makedirs(args.MSA_weights_location, exist_ok=True)
+ raise NotADirectoryError(f"{args.MSA_weights_location} is not a directory."
+ f"Could create it automatically, but at the moment raising an error.")
+ else:
+ print(f"MSA weights directory: {args.MSA_weights_location}")
+
+ if weights_file is None:
+ print("Weights filename not found - writing to new file")
+ weights_file = args.MSA_weights_location + os.sep + protein_name + '_theta_' + str(theta) + '.npy'
+
+ print(f"Writing to {weights_file}")
+ # First check that the weights file doesn't exist
+ if os.path.isfile(weights_file) and not args.overwrite:
+ if args.skip_existing:
+ print("Weights file already exists, skipping, since --skip_existing was specified")
+ exit(0)
+ else:
+ raise FileExistsError(f"File {weights_file} already exists. "
+ f"Please delete it if you want to re-calculate it. "
+ f"If you want to skip existing files, use --skip_existing.")
+
+ # The msa_data processing has a side effect of saving a weights file
+ _ = data_utils.MSA_processing(
+ MSA_location=msa_location,
+ theta=theta,
+ use_weights=True,
+ weights_location=weights_file,
+ num_cpus=args.num_cpus,
+ weights_calc_method=args.calc_method,
+ overwrite_weights=args.overwrite,
+ debug_only_weights=True,
+ **data_kwargs,
+ )
+
+
+if __name__ == '__main__':
+ start = time.perf_counter()
+ parser = create_argparser()
+ args = parser.parse_args()
+ main(args)
+ end = time.perf_counter()
+ print(f"calc_weights.py took {end-start:.2f} seconds in total.")
diff --git a/proteingym/baselines/EVE/compute_evol_indices_DMS.py b/proteingym/baselines/EVE/compute_evol_indices_DMS.py
new file mode 100644
index 0000000..d34c575
--- /dev/null
+++ b/proteingym/baselines/EVE/compute_evol_indices_DMS.py
@@ -0,0 +1,138 @@
+import datetime
+import os,sys
+import json
+import argparse
+from resource import getrusage, RUSAGE_SELF
+
+import pandas as pd
+import torch
+
+from EVE import VAE_model
+from utils import data_utils
+
+if __name__=='__main__':
+
+ parser = argparse.ArgumentParser(description='Evol indices')
+ parser.add_argument('--MSA_data_folder', type=str, help='Folder where MSAs are stored')
+ parser.add_argument('--DMS_reference_file_path', type=str, help='List of proteins and corresponding MSA file name')
+ parser.add_argument('--protein_index', type=int, help='Row index of protein in input mapping file')
+ parser.add_argument('--MSA_weights_location', type=str, help='Location where weights for each sequence in the MSA will be stored')
+ parser.add_argument('--theta_reweighting', type=float, help='Parameters for MSA sequence re-weighting')
+ parser.add_argument('--random_seeds',type=int,nargs="+", help='Random seed for VAE model initialization')
+ parser.add_argument('--VAE_checkpoint_location', type=str, help='Location where VAE model checkpoints will be stored')
+ parser.add_argument('--model_parameters_location', type=str, help='Location of VAE model parameters')
+ parser.add_argument('--DMS_data_folder', type=str, help='Location of all mutations to compute the evol indices for')
+ parser.add_argument('--output_scores_folder', type=str, help='Output location of computed evol indices')
+ parser.add_argument('--num_samples_compute_evol_indices', type=int, help='Num of samples to approximate delta elbo when computing evol indices')
+ parser.add_argument('--batch_size', default=256, type=int, help='Batch size when computing evol indices')
+ parser.add_argument("--skip_existing", action="store_true", help="Skip scoring if output file already exists")
+ parser.add_argument("--aggregation_method", choices=["full", "batch", "online"], default="full", help="Method to aggregate evol indices")
+ parser.add_argument("--threshold_focus_cols_frac_gaps", type=float,
+ help="Maximum fraction of gaps allowed in focus columns - see data_utils.MSA_processing")
+ args = parser.parse_args()
+
+ print("Arguments:", args)
+
+ assert os.path.isfile(args.DMS_reference_file_path), 'MSA list file does not exist: {}'.format(args.DMS_reference_file_path)
+ mapping_file = pd.read_csv(args.DMS_reference_file_path)
+ DMS_id = mapping_file['DMS_id'][args.protein_index]
+ protein_name = mapping_file['MSA_filename'][args.protein_index].split(".a2m")[0]
+ DMS_filename = mapping_file['DMS_filename'][args.protein_index]
+ mutant = mapping_file['DMS_filename'][args.protein_index]
+ msa_location = args.MSA_data_folder + os.sep + mapping_file['MSA_filename'][args.protein_index]
+ DMS_mutant_column = "mutant"
+ if not DMS_filename.startswith(DMS_id):
+ print(f"Warning: DMS id does not match DMS filename: {DMS_id} vs {DMS_filename}. Continuing for now.")
+
+ # Check filepaths are valid
+ evol_indices_output_filename = os.path.join(args.output_scores_folder, f'{DMS_id}.csv')
+
+
+ if os.path.isfile(evol_indices_output_filename):
+ print("Output file already exists: " + str(evol_indices_output_filename))
+
+ if args.skip_existing:
+ print("Skipping scoring since args.skip_existing is True")
+ sys.exit(0)
+ else:
+ print("Overwriting existing file: " + str(evol_indices_output_filename))
+ print("To skip scoring for existing files, use --skip_existing")
+ # Check if surrounding directory exists
+ else:
+ print("Output file: " + str(evol_indices_output_filename))
+ assert os.path.isdir(os.path.dirname(evol_indices_output_filename)), \
+ 'Output directory does not exist: {}. Please create directory before running script.\nOutput filename given: {}.\nDebugging curdir={}'\
+ .format(os.path.dirname(evol_indices_output_filename), evol_indices_output_filename, os.getcwd())
+
+ if args.theta_reweighting is not None:
+ theta = args.theta_reweighting
+ else:
+ try:
+ theta = float(mapping_file['MSA_theta'][args.protein_index])
+ except:
+ theta = 0.2
+ print("Theta MSA re-weighting: "+str(theta))
+
+ # Using data_kwargs so that if options aren't set, they'll be set to default values
+ data_kwargs = {}
+ if args.threshold_focus_cols_frac_gaps is not None:
+ print("Using custom threshold_focus_cols_frac_gaps: ", args.threshold_focus_cols_frac_gaps)
+ data_kwargs['threshold_focus_cols_frac_gaps'] = args.threshold_focus_cols_frac_gaps
+
+ data = data_utils.MSA_processing(
+ MSA_location=msa_location,
+ theta=theta,
+ use_weights=False, # Don't need weights for evol indices
+ **data_kwargs,
+ )
+
+
+ args.mutations_location = args.DMS_data_folder + os.sep + DMS_filename
+ for seed in args.random_seeds:
+ model_name = protein_name + f"_seed_{seed}"
+ print("Model name: "+str(model_name))
+
+ model_params = json.load(open(args.model_parameters_location))
+
+ model = VAE_model.VAE_model(
+ model_name=model_name,
+ data=data,
+ encoder_parameters=model_params["encoder_parameters"],
+ decoder_parameters=model_params["decoder_parameters"],
+ random_seed=42
+ )
+ model = model.to(model.device)
+ checkpoint_name = str(args.VAE_checkpoint_location) + os.sep + model_name
+ assert os.path.isfile(checkpoint_name), 'Checkpoint file does not exist: {}'.format(checkpoint_name)
+
+ try:
+ checkpoint = torch.load(checkpoint_name, map_location=model.device) # Added map_location so that this works with CPU too
+ model.load_state_dict(checkpoint['model_state_dict'])
+ print("Initialized VAE with checkpoint '{}' ".format(checkpoint_name))
+ except Exception as e:
+ print("Unable to load VAE model checkpoint {}".format(checkpoint_name))
+ raise e
+
+ list_valid_mutations, evol_indices, _, _ = model.compute_evol_indices(
+ msa_data=data,
+ list_mutations_location=args.mutations_location,
+ mutant_column=DMS_mutant_column,
+ num_samples=args.num_samples_compute_evol_indices,
+ batch_size=args.batch_size,
+ aggregation_method=args.aggregation_method
+ )
+
+ df = {}
+ df['mutant'] = list_valid_mutations
+ df[f'evol_indices_seed_{seed}'] = evol_indices
+ df = pd.DataFrame(df)
+
+ if os.path.exists(evol_indices_output_filename) and seed != args.random_seeds[0]:
+ prev_df = pd.read_csv(evol_indices_output_filename)
+ prev_len = len(prev_df)
+ df = pd.merge(prev_df, df, on='mutant', how='inner')
+ # checking that the mutants match after the first seed (first seed will overwrite original score file)
+ assert len(df) == len(prev_df), "Length of merged dataframe doesn't match previous length, mutants must not match across seeds"
+ df.to_csv(evol_indices_output_filename, index=False)
+ else:
+ df.to_csv(evol_indices_output_filename, index=False)
diff --git a/proteingym/baselines/EVE/train_VAE.py b/proteingym/baselines/EVE/train_VAE.py
new file mode 100644
index 0000000..32a9371
--- /dev/null
+++ b/proteingym/baselines/EVE/train_VAE.py
@@ -0,0 +1,167 @@
+import argparse
+import json
+import time
+import os
+
+
+import pandas as pd
+
+from EVE import VAE_model
+from utils import data_utils
+
+if __name__ == '__main__':
+ parser = argparse.ArgumentParser(description='VAE')
+ parser.add_argument('--MSA_data_folder', type=str,
+ help='Folder where MSAs are stored', required=True)
+ parser.add_argument('--DMS_reference_file_path', type=str,
+ help='List of proteins and corresponding MSA file name', required=True)
+ parser.add_argument('--protein_index', type=int,
+ help='Row index of protein in input mapping file', required=True)
+ parser.add_argument('--MSA_weights_location', type=str,
+ help='Location where weights for each sequence in the MSA will be stored', required=True)
+ parser.add_argument('--theta_reweighting', type=float,
+ help='Parameters for MSA sequence re-weighting')
+ parser.add_argument('--VAE_checkpoint_location', type=str,
+ help='Location where VAE model checkpoints will be stored', required=True)
+ parser.add_argument('--model_parameters_location', type=str,
+ help='Location of VAE model parameters', required=True)
+ parser.add_argument('--training_logs_location', type=str,
+ help='Location of VAE model parameters')
+ parser.add_argument("--seed", type=int, help="Random seed", default=42)
+ parser.add_argument(
+ '--z_dim', type=int, help='Specify a different latent dim than in the params file')
+ parser.add_argument("--threshold_focus_cols_frac_gaps", type=float,
+ help="Maximum fraction of gaps allowed in focus columns - see data_utils.MSA_processing")
+ parser.add_argument('--force_load_weights', action='store_true',
+ help="Force loading of weights from MSA_weights_location (useful if you want to make sure you're using precalculated weights). Will fail if weight file doesn't exist.",
+ default=False)
+ parser.add_argument("--overwrite_weights",
+ help="Will overwrite weights file if it already exists", action="store_true", default=False)
+ parser.add_argument("--skip_existing", help="Will quit gracefully if model checkpoint file already exists",
+ action="store_true", default=False)
+ parser.add_argument("--batch_size", type=int,
+ help="Batch size for training", default=None)
+ parser.add_argument("--experimental_stream_data",
+ help="Load one-hot-encodings on the fly", action="store_true", default=False)
+
+ args = parser.parse_args()
+
+ print("Arguments:", args)
+
+ assert os.path.isfile(
+ args.DMS_reference_file_path), f"MSA file list {args.DMS_reference_file_path} doesn't seem to exist"
+ mapping_file = pd.read_csv(args.DMS_reference_file_path)
+
+ if mapping_file["MSA_filename"].duplicated().any():
+ print(f"Note: Duplicate MSA_filename detected in the mapping file. Deduplicating to only have one EVE model per alignment.")
+ mapping_file = mapping_file.drop_duplicates(subset=["MSA_filename"])
+ protein_name = mapping_file['MSA_filename'][args.protein_index].split(".a2m")[
+ 0]
+ msa_location = args.MSA_data_folder + os.sep + \
+ mapping_file['MSA_filename'][args.protein_index]
+ print("Protein name: " + str(protein_name))
+ print("MSA file: " + str(msa_location))
+
+ if args.theta_reweighting is not None:
+ theta = args.theta_reweighting
+ else:
+ try:
+ theta = float(mapping_file['MSA_theta'][args.protein_index])
+ except:
+ print("Couldn't load theta from mapping file. Using default value of 0.2")
+ theta = 0.2
+
+ model_name = protein_name + f"_seed_{args.seed}"
+ print("Model name: " + str(model_name))
+ model_checkpoint_final_path = args.VAE_checkpoint_location + os.sep + model_name
+ if os.path.isfile(model_checkpoint_final_path):
+ if args.skip_existing:
+ print(
+ "Model checkpoint already exists, skipping, since --skip_existing was specified")
+ exit(0)
+ else:
+ raise FileExistsError(f"Model checkpoint {model_checkpoint_final_path} already exists. \
+ Use --skip_existing to skip without raising an error, or delete the destination file if you want to rerun.")
+
+ # Using data_kwargs so that if options aren't set, they'll be set to default values
+ data_kwargs = {}
+ if args.threshold_focus_cols_frac_gaps is not None:
+ print("Using custom threshold_focus_cols_frac_gaps: ",
+ args.threshold_focus_cols_frac_gaps)
+ data_kwargs['threshold_focus_cols_frac_gaps'] = args.threshold_focus_cols_frac_gaps
+
+ if args.overwrite_weights:
+ print("Overwriting weights file")
+ data_kwargs['overwrite_weights'] = True
+
+ print("Theta MSA re-weighting: " + str(theta))
+
+ # Load weights file if it's in the mapping file
+ if "weight_file_name" in mapping_file.columns:
+ weights_file = args.MSA_weights_location + os.sep + \
+ mapping_file["weight_file_name"][args.protein_index]
+ print("Using weights filename from mapping file")
+ else:
+ print(
+ f"weight_file_name not provided in mapping file. Using default weights filename of {protein_name}_theta_{theta}.npy")
+ weights_file = args.MSA_weights_location + os.sep + \
+ protein_name + '_theta_' + str(theta) + '.npy'
+
+ print(f"Weights location: {weights_file}")
+
+ if args.force_load_weights:
+ print("Flag force_load_weights enabled - Forcing that we use weights from file:", weights_file)
+ if not os.path.isfile(weights_file):
+ raise FileNotFoundError(f"Weights file {weights_file} doesn't exist."
+ f"To recompute weights, remove the flag --force_load_weights.")
+
+ data = data_utils.MSA_processing(
+ MSA_location=msa_location,
+ theta=theta,
+ use_weights=True,
+ weights_location=weights_file,
+ debug_only_weights=args.experimental_stream_data,
+ **data_kwargs,
+ )
+
+ assert os.path.isfile(
+ args.model_parameters_location), args.model_parameters_location
+ model_params = json.load(open(args.model_parameters_location))
+
+ # Overwrite params if necessary
+ if args.z_dim:
+ model_params["encoder_parameters"]["z_dim"] = args.z_dim
+ model_params["decoder_parameters"]["z_dim"] = args.z_dim
+ if args.batch_size is not None:
+ print("Using batch_size from command line: ", args.batch_size)
+ model_params["training_parameters"]["batch_size"] = args.batch_size
+
+ model = VAE_model.VAE_model(
+ model_name=model_name,
+ data=data,
+ encoder_parameters=model_params["encoder_parameters"],
+ decoder_parameters=model_params["decoder_parameters"],
+ random_seed=args.seed
+ )
+ model = model.to(model.device)
+
+ model_params["training_parameters"]['training_logs_location'] = args.training_logs_location
+ model_params["training_parameters"]['model_checkpoint_location'] = args.VAE_checkpoint_location
+
+ print("debug batch size=",
+ model_params["training_parameters"]["batch_size"])
+ print("Starting to train model: " + model_name)
+ start = time.perf_counter()
+ model.train_model(
+ data=data, training_parameters=model_params["training_parameters"], use_dataloader=args.experimental_stream_data)
+ end = time.perf_counter()
+ # Show time in hours,minutes,seconds
+ print(
+ f"Finished in {(end - start)//60//60}hours {(end - start)//60%60} minutes and {(end - start)%60} seconds")
+
+ print("Saving model: " + model_name)
+ model.save(model_checkpoint=model_checkpoint_final_path,
+ encoder_parameters=model_params["encoder_parameters"],
+ decoder_parameters=model_params["decoder_parameters"],
+ training_parameters=model_params["training_parameters"]
+ )
diff --git a/proteingym/baselines/EVE/utils/__init__.py b/proteingym/baselines/EVE/utils/__init__.py
new file mode 100644
index 0000000..e69de29
diff --git a/proteingym/baselines/EVE/utils/constants.py b/proteingym/baselines/EVE/utils/constants.py
new file mode 100644
index 0000000..b6de0a9
--- /dev/null
+++ b/proteingym/baselines/EVE/utils/constants.py
@@ -0,0 +1,6 @@
+# Copied from EVCouplings
+GAP = "-"
+MATCH_GAP = GAP
+INSERT_GAP = "."
+ALPHABET_PROTEIN_NOGAP = "ACDEFGHIKLMNPQRSTVWY"
+ALPHABET_PROTEIN_GAP = GAP + ALPHABET_PROTEIN_NOGAP
diff --git a/proteingym/baselines/EVE/utils/data_utils.py b/proteingym/baselines/EVE/utils/data_utils.py
new file mode 100644
index 0000000..a889a01
--- /dev/null
+++ b/proteingym/baselines/EVE/utils/data_utils.py
@@ -0,0 +1,471 @@
+import multiprocessing
+import os
+import time
+from collections import defaultdict
+
+import numpy as np
+import pandas as pd
+from tqdm import tqdm
+import torch
+from torch.utils.data import Dataset, DataLoader, WeightedRandomSampler
+
+from utils.weights import map_from_alphabet, map_matrix, compute_sequence_weights, calc_weights_evcouplings
+
+# TODOs: Can get rid of one_hot_encodings when only calculating weights right?
+
+# constants
+GAP = "-"
+MATCH_GAP = GAP
+INSERT_GAP = "."
+
+ALPHABET_PROTEIN_NOGAP = "ACDEFGHIKLMNPQRSTVWY"
+ALPHABET_PROTEIN_GAP = GAP + ALPHABET_PROTEIN_NOGAP
+
+
+class MSA_processing:
+ def __init__(self,
+ MSA_location="",
+ theta=0.2,
+ use_weights=True,
+ weights_location="./data/weights",
+ preprocess_MSA=True,
+ threshold_sequence_frac_gaps=0.5,
+ threshold_focus_cols_frac_gaps=0.3,
+ remove_sequences_with_indeterminate_AA_in_focus_cols=True,
+ num_cpus=1,
+ weights_calc_method="evcouplings",
+ overwrite_weights=False,
+ debug_only_weights=False,
+ ):
+
+ """
+ Parameters:
+ - msa_location: (path) Location of the MSA data. Constraints on input MSA format:
+ - focus_sequence is the first one in the MSA data
+ - first line is structured as follows: ">focus_seq_name/start_pos-end_pos" (e.g., >SPIKE_SARS2/310-550)
+ - corresponding sequence data located on following line(s)
+ - then all other sequences follow with ">name" on first line, corresponding data on subsequent lines
+ - theta: (float) Sequence weighting hyperparameter. Generally: Prokaryotic and eukaryotic families = 0.2; Viruses = 0.01
+ - use_weights: (bool) If False, sets all sequence weights to 1. If True, checks weights_location -- if non empty uses that;
+ otherwise compute weights from scratch and store them at weights_location
+ - weights_location: (path) File to load from/save to the sequence weights
+ - preprocess_MSA: (bool) performs pre-processing of MSA to remove short fragments and positions that are not well covered.
+ - threshold_sequence_frac_gaps: (float, between 0 and 1) Threshold value to define fragments
+ - sequences with a fraction of gap characters above threshold_sequence_frac_gaps are removed
+ - default is set to 0.5 (i.e., fragments with 50% or more gaps are removed)
+ - threshold_focus_cols_frac_gaps: (float, between 0 and 1) Threshold value to define focus columns
+ - positions with a fraction of gap characters above threshold_focus_cols_pct_gaps will be set to lower case (and not included in the focus_cols)
+ - default is set to 0.3 (i.e., focus positions are the ones with 30% of gaps or less, i.e., 70% or more residue occupancy)
+ - remove_sequences_with_indeterminate_AA_in_focus_cols: (bool) Remove all sequences that have indeterminate AA (e.g., B, J, X, Z) at focus positions of the wild type
+ - num_cpus: (int) Number of CPUs to use for parallel weights calculation processing. If set to -1, all available CPUs are used. If set to 1, weights are computed in serial.
+ - weights_calc_method: (str) Method to use for calculating sequence weights. Options: "evcouplings","eve" or "identity". (default "evcouplings")
+ - Note: For now the "evcouplings" method is modified to be equivalent to the "eve" method,
+ but the "evcouplings" method is faster as it uses numba.
+ - overwrite_weights: (bool) If True, calculate weights and overwrite weights file. If False, load weights from weights_location if it exists.
+ TODO these weights options should be more like calc_weights=[True/False], and the weights_location should be a list of locations to load from/save to.
+ """
+ np.random.seed(2021)
+ self.MSA_location = MSA_location
+ self.weights_location = weights_location
+ self.theta = theta
+ self.alphabet = ALPHABET_PROTEIN_NOGAP
+ self.use_weights = use_weights
+ self.overwrite_weights = overwrite_weights
+ self.preprocess_MSA = preprocess_MSA
+ self.threshold_sequence_frac_gaps = threshold_sequence_frac_gaps
+ self.threshold_focus_cols_frac_gaps = threshold_focus_cols_frac_gaps
+ self.remove_sequences_with_indeterminate_AA_in_focus_cols = remove_sequences_with_indeterminate_AA_in_focus_cols
+ self.debug_only_weights = debug_only_weights
+ self.weights_calc_method = weights_calc_method
+
+ # Defined by gen_alignment
+ self.aa_dict = {}
+ self.focus_seq_name = ""
+ self.seq_name_to_sequence = defaultdict(str)
+ self.focus_seq, self.focus_cols, self.focus_seq_trimmed, self.seq_len, self.alphabet_size = [None] * 5
+ self.focus_start_loc, self.focus_stop_loc = None, None
+ self.uniprot_focus_col_to_wt_aa_dict, self.uniprot_focus_col_to_focus_idx = None, None
+ self.one_hot_encoding, self.weights, self.Neff, self.num_sequences = [None] * 4
+
+ # Defined by create_all_singles
+ self.mutant_to_letter_pos_idx_focus_list = None
+ self.all_single_mutations = None
+
+ # Fill in the instance variables
+ self.gen_alignment()
+ self.calc_weights(num_cpus=num_cpus, method=weights_calc_method)
+
+ if not self.debug_only_weights:
+ print("Creating all single mutations")
+ self.create_all_singles()
+
+ def gen_alignment(self):
+ """ Read training alignment and store basics in class instance """
+ self.aa_dict = {}
+ for i, aa in enumerate(self.alphabet):
+ self.aa_dict[aa] = i
+
+ self.seq_name_to_sequence = defaultdict(str)
+ name = ""
+ with open(self.MSA_location, "r") as msa_data:
+ for i, line in enumerate(msa_data):
+ line = line.rstrip()
+ if line.startswith(">"):
+ name = line
+ if i == 0:
+ self.focus_seq_name = name
+ else:
+ self.seq_name_to_sequence[name] += line
+ print("Number of sequences in MSA (before preprocessing):", len(self.seq_name_to_sequence))
+
+ ## MSA pre-processing to remove inadequate columns and sequences
+ if self.preprocess_MSA:
+ # Overwrite self.seq_name_to_sequence
+ self.seq_name_to_sequence = self.preprocess_msa(
+ seq_name_to_sequence=self.seq_name_to_sequence,
+ focus_seq_name=self.focus_seq_name,
+ threshold_sequence_frac_gaps=self.threshold_sequence_frac_gaps,
+ threshold_focus_cols_frac_gaps=self.threshold_focus_cols_frac_gaps
+ )
+
+ self.focus_seq = self.seq_name_to_sequence[self.focus_seq_name]
+ self.focus_cols = [ix for ix, s in enumerate(self.focus_seq) if s == s.upper() and s != '-']
+ self.focus_seq_trimmed = "".join([self.focus_seq[ix] for ix in self.focus_cols])
+ self.seq_len = len(self.focus_cols)
+ self.alphabet_size = len(self.alphabet)
+
+ # Connect local sequence index with uniprot index (index shift inferred from 1st row of MSA)
+ focus_loc = self.focus_seq_name.split("/")[-1]
+ start, stop = focus_loc.split("-")
+ self.focus_start_loc = int(start)
+ self.focus_stop_loc = int(stop)
+ self.uniprot_focus_col_to_wt_aa_dict \
+ = {idx_col + int(start): self.focus_seq[idx_col] for idx_col in self.focus_cols}
+ self.uniprot_focus_col_to_focus_idx \
+ = {idx_col + int(start): idx_col for idx_col in self.focus_cols}
+
+ # Move all letters to CAPS; keeps focus columns only
+ for seq_name, sequence in self.seq_name_to_sequence.items():
+ sequence = sequence.replace(".", "-")
+ self.seq_name_to_sequence[seq_name] = "".join(
+ [sequence[ix].upper() for ix in self.focus_cols]) # Makes a List[str] instead of str
+
+ # Remove sequences that have indeterminate AA (e.g., B, J, X, Z) in the focus columns
+ if self.remove_sequences_with_indeterminate_AA_in_focus_cols:
+ alphabet_set = set(list(self.alphabet))
+ seq_names_to_remove = []
+ for seq_name, sequence in self.seq_name_to_sequence.items():
+ for letter in sequence:
+ if letter not in alphabet_set and letter != "-":
+ seq_names_to_remove.append(seq_name)
+ continue
+ seq_names_to_remove = list(set(seq_names_to_remove))
+ for seq_name in seq_names_to_remove:
+ del self.seq_name_to_sequence[seq_name]
+
+ print("Number of sequences after preprocessing:", len(self.seq_name_to_sequence))
+
+ if self.debug_only_weights and self.weights_calc_method == "evcouplings":
+ print("Weights-only mode with evcouplings: Skipping one-hot encodings")
+ else:
+ # Encode the sequences
+ print("One-hot encoding sequences")
+ self.one_hot_encoding = one_hot_3D(
+ seq_keys=self.seq_name_to_sequence.keys(), # Note: Dicts are unordered for python < 3.6
+ seq_name_to_sequence=self.seq_name_to_sequence,
+ alphabet=self.alphabet,
+ seq_length=self.seq_len,
+ )
+
+ # Using staticmethod to keep this under the MSAProcessing namespace, but this is apparently not best practice
+ @staticmethod
+ def preprocess_msa(seq_name_to_sequence, focus_seq_name, threshold_sequence_frac_gaps, threshold_focus_cols_frac_gaps):
+ """Remove inadequate columns and sequences from MSA, overwrite self.seq_name_to_sequence."""
+ print("Pre-processing MSA to remove inadequate columns and sequences...")
+ msa_df = pd.DataFrame.from_dict(seq_name_to_sequence, orient='index', columns=['sequence'])
+ # Data clean up
+ msa_df.sequence = msa_df.sequence.apply(lambda x: x.replace(".", "-")).apply(
+ lambda x: ''.join([aa.upper() for aa in x]))
+ # Remove columns that would be gaps in the wild type
+ non_gap_wt_cols = [aa != '-' for aa in msa_df.sequence[focus_seq_name]]
+ msa_df['sequence'] = msa_df['sequence'].apply(
+ lambda x: ''.join([aa for aa, non_gap_ind in zip(x, non_gap_wt_cols) if non_gap_ind]))
+ assert 0.0 <= threshold_sequence_frac_gaps <= 1.0, "Invalid fragment filtering parameter"
+ assert 0.0 <= threshold_focus_cols_frac_gaps <= 1.0, "Invalid focus position filtering parameter"
+ print("Calculating proportion of gaps")
+ msa_array = np.array([list(seq) for seq in msa_df.sequence])
+ gaps_array = np.array(list(map(lambda seq: [aa == '-' for aa in seq], msa_array)))
+ # Identify fragments with too many gaps
+ seq_gaps_frac = gaps_array.mean(axis=1)
+ seq_below_threshold = seq_gaps_frac <= threshold_sequence_frac_gaps
+ print("Proportion of sequences dropped due to fraction of gaps: " + str(
+ round(float(1 - seq_below_threshold.sum() / seq_below_threshold.shape) * 100, 2)) + "%")
+ # Identify focus columns
+ columns_gaps_frac = gaps_array[seq_below_threshold].mean(axis=0)
+ index_cols_below_threshold = columns_gaps_frac <= threshold_focus_cols_frac_gaps
+ print("Proportion of non-focus columns removed: " + str(
+ round(float(1 - index_cols_below_threshold.sum() / index_cols_below_threshold.shape) * 100, 2)) + "%")
+ # Lower case non focus cols and filter fragment sequences
+ def _lower_case_and_filter_fragments(seq):
+ return ''.join([aa.lower() if aa_ix in index_cols_below_threshold else aa for aa_ix, aa in enumerate(seq)])
+ msa_df['sequence'] = msa_df['sequence'].apply(
+ lambda seq: ''.join([aa.upper() if upper_case_ind else aa.lower() for aa, upper_case_ind in
+ zip(seq, index_cols_below_threshold)]))
+ msa_df = msa_df[seq_below_threshold]
+ # Overwrite seq_name_to_sequence with clean version
+ seq_name_to_sequence = defaultdict(str)
+ for seq_idx in range(len(msa_df['sequence'])):
+ seq_name_to_sequence[msa_df.index[seq_idx]] = msa_df.sequence[seq_idx]
+
+ return seq_name_to_sequence
+
+ def calc_weights(self, num_cpus=1, method="evcouplings"):
+ """
+ If num_cpus == 1, weights are computed in serial.
+ If num_cpus == -1, weights are computed in parallel using all available cores.
+ Note: This will use multiprocessing.cpu_count() to get the number of available cores, which on clusters may
+ return all cores, not just the number of cores available to the user.
+ """
+ # Refactored into its own function so that we can call it separately
+ if self.use_weights:
+ if os.path.isfile(self.weights_location) and not self.overwrite_weights:
+ print("Loading sequence weights from disk")
+ self.weights = np.load(file=self.weights_location)
+ else:
+ print("Computing sequence weights")
+ if num_cpus == -1:
+ #multiprocessing.cpu_count()
+ num_cpus = get_num_cpus()
+
+ if method == "evcouplings":
+ alphabet_mapper = map_from_alphabet(ALPHABET_PROTEIN_GAP, default=GAP)
+ arrays = []
+ for seq in self.seq_name_to_sequence.values():
+ arrays.append(np.array(list(seq)))
+ sequences = np.vstack(arrays)
+ sequences_mapped = map_matrix(sequences, alphabet_mapper)
+ print("Starting EVCouplings calculation")
+ start = time.perf_counter()
+ self.weights = calc_weights_evcouplings(sequences_mapped, identity_threshold=1 - self.theta,
+ empty_value=0, num_cpus=num_cpus) # GAP = 0
+ end = time.perf_counter()
+ print(f"EVCouplings weights took {end - start:.2f} seconds")
+ elif method == "eve":
+ list_seq = self.one_hot_encoding.numpy()
+ start = time.perf_counter()
+ self.weights = compute_sequence_weights(list_seq, self.theta, num_cpus=num_cpus)
+ end = time.perf_counter()
+ print(f"EVE weights took {end - start:.2f} seconds")
+ elif method == "identity":
+ self.weights = np.ones(self.one_hot_encoding.shape[0])
+ else:
+ raise ValueError(f"Unknown method: {method}. Must be either 'evcouplings', 'eve' or 'identity'.")
+ print("Saving sequence weights to disk")
+ np.save(file=self.weights_location, arr=self.weights)
+ else:
+ # If not using weights, use an isotropic weight matrix
+ print("Not weighting sequence data")
+ self.weights = np.ones(self.one_hot_encoding.shape[0])
+
+ self.Neff = np.sum(self.weights)
+ self.num_sequences = self.weights.shape[0]
+
+ print("Neff =", str(self.Neff))
+
+ if self.debug_only_weights and self.weights_calc_method == "evcouplings":
+ print("Num sequences: ", self.num_sequences)
+ else:
+ print("Data Shape =", self.one_hot_encoding.shape)
+
+ return self.weights
+
+ def create_all_singles(self):
+ start_idx = self.focus_start_loc
+ focus_seq_index = 0
+ self.mutant_to_letter_pos_idx_focus_list = {}
+ list_valid_mutations = []
+ # find all possible valid mutations that can be run with this alignment
+ alphabet_set = set(list(self.alphabet))
+ for i, letter in enumerate(self.focus_seq):
+ if letter in alphabet_set and letter != "-":
+ for mut in self.alphabet:
+ pos = start_idx + i
+ if mut != letter:
+ mutant = letter + str(pos) + mut
+ self.mutant_to_letter_pos_idx_focus_list[mutant] = [letter, pos, focus_seq_index]
+ list_valid_mutations.append(mutant)
+ focus_seq_index += 1
+ self.all_single_mutations = list_valid_mutations
+
+ def save_all_singles(self, output_filename):
+ with open(output_filename, "w") as output:
+ output.write('mutations')
+ for mutation in self.all_single_mutations:
+ output.write('\n')
+ output.write(mutation)
+
+
+def generate_mutated_sequences(msa_data, list_mutations):
+ """
+ Copied from VAE_model.compute_evol_indices.
+
+ Generate mutated sequences using a MSAProcessing data object and list of mutations of the form "A42T" where position
+ 42 on the wild type is changed from A to T.
+ Multiple mutations are separated by colons e.g. "A42T:C9A"
+
+ Returns a tuple (list_valid_mutations, valid_mutated_sequences),
+ e.g. (['wt', 'A3T'], {'wt': 'AGAKLI', 'A3T': 'AGTKLI'})
+ """
+ list_valid_mutations = ['wt']
+ valid_mutated_sequences = {}
+ valid_mutated_sequences['wt'] = msa_data.focus_seq_trimmed # first sequence in the list is the wild_type
+
+ # Remove (multiple) mutations that are invalid
+ for mutation in list_mutations:
+ individual_substitutions = mutation.split(':')
+ mutated_sequence = list(msa_data.focus_seq_trimmed)[:]
+ fully_valid_mutation = True
+ for mut in individual_substitutions:
+ wt_aa, pos, mut_aa = mut[0], int(mut[1:-1]), mut[-1]
+ if pos not in msa_data.uniprot_focus_col_to_wt_aa_dict \
+ or msa_data.uniprot_focus_col_to_wt_aa_dict[pos] != wt_aa \
+ or mut not in msa_data.mutant_to_letter_pos_idx_focus_list:
+ print("Not a valid mutant: " + mutation)
+ fully_valid_mutation = False
+ break
+ else:
+ wt_aa, pos, idx_focus = msa_data.mutant_to_letter_pos_idx_focus_list[mut]
+ mutated_sequence[idx_focus] = mut_aa # perform the corresponding AA substitution
+
+ if fully_valid_mutation:
+ list_valid_mutations.append(mutation)
+ valid_mutated_sequences[mutation] = ''.join(mutated_sequence)
+
+ return list_valid_mutations, valid_mutated_sequences
+
+
+# Copied from VAE_model.compute_evol_indices
+# One-hot encoding of sequences
+def one_hot_3D(seq_keys, seq_name_to_sequence, alphabet, seq_length):
+ """
+ Take in a list of sequence names/keys and corresponding sequences, and generate a one-hot array according to an alphabet.
+ """
+ aa_dict = {letter: i for (i, letter) in enumerate(alphabet)}
+
+ one_hot_out = np.zeros((len(seq_keys), seq_length, len(alphabet)))
+ for i, seq_key in enumerate(tqdm(seq_keys, desc="One-hot encoding sequences", mininterval=1)):
+ sequence = seq_name_to_sequence[seq_key]
+ for j, letter in enumerate(sequence):
+ if letter in aa_dict:
+ k = aa_dict[letter]
+ one_hot_out[i, j, k] = 1.0
+ one_hot_out = torch.tensor(one_hot_out)
+ return one_hot_out
+
+
+def gen_one_hot_to_sequence(one_hot_tensor, alphabet):
+ """Reverse of one_hot_3D. Need the msa_data again. Returns a list of sequences."""
+ for seq_tensor in one_hot_tensor: # iterate through outer dimension
+ seq = ""
+ letters_idx = seq_tensor.argmax(-1)
+
+ for idx in letters_idx.tolist(): # Could also do map(di.get, letters_idx)
+ letter = alphabet[idx]
+ seq += letter
+ yield seq
+
+
+def one_hot_to_sequence_list(one_hot_tensor, alphabet):
+ return list(gen_one_hot_to_sequence(one_hot_tensor, alphabet))
+
+def get_one_hot_3D_fn(msa_data):
+ aa_dict = {letter: i for (i, letter) in enumerate(msa_data.alphabet)}
+
+ def fn(batch_seqs):
+ one_hot_out = np.zeros((len(batch_seqs), msa_data.seq_len, len(msa_data.alphabet)))
+ for i, sequence in enumerate(batch_seqs):
+ for j, letter in enumerate(sequence):
+ if letter in aa_dict:
+ k = aa_dict[letter]
+ one_hot_out[i, j, k] = 1.0
+ one_hot_out = torch.tensor(one_hot_out)
+ return one_hot_out
+ return fn
+
+def get_num_cpus():
+ if 'SLURM_CPUS_PER_TASK' in os.environ:
+ num_cpus = int(os.environ['SLURM_CPUS_PER_TASK'])
+ print("SLURM_CPUS_PER_TASK:", os.environ['SLURM_CPUS_PER_TASK'])
+ print("Using all available cores (calculated using SLURM_CPUS_PER_TASK):", num_cpus)
+ else:
+ num_cpus = len(os.sched_getaffinity(0))
+ print("Using all available cores (calculated using len(os.sched_getaffinity(0))):", num_cpus)
+ return num_cpus
+
+class OneHotDataset(Dataset):
+ def __init__(self, seq_keys, seq_name_to_sequence, alphabet, seq_length, total_length=None):
+ self.seq_keys = list(seq_keys)
+ self.seq_name_to_sequence = seq_name_to_sequence
+ self.alphabet = alphabet
+ self.seq_length = seq_length
+ self.aa_dict = {letter: i for (i, letter) in enumerate(alphabet)}
+ if total_length is None:
+ self.total_length = len(self.seq_keys)
+ else:
+ self.total_length = int(total_length)
+
+ def __len__(self):
+ return self.total_length
+
+ def __getitem__(self, idx):
+ seq_key = self.seq_keys[idx]
+ sequence = self.seq_name_to_sequence[seq_key]
+ return sequence
+
+class InfiniteDataLoader(DataLoader):
+ def __init__(self, *args, **kwargs):
+ super().__init__(*args, **kwargs)
+ self.iter_loader = super().__iter__()
+
+ def __iter__(self):
+ return self
+
+ def __next__(self):
+ try:
+ batch = next(self.iter_loader)
+ except StopIteration:
+ # If the inner DataLoader has exhausted the dataset, reset it
+ self.iter_loader = super().__iter__()
+ batch = next(self.iter_loader)
+ return batch
+
+def get_dataloader(msa_data: MSA_processing, batch_size, num_training_steps):
+ print("Going to hackily set the length of the dataset to the number of training steps, not the actual number of sequences.")
+ dataset = OneHotDataset(
+ seq_keys=msa_data.seq_name_to_sequence.keys(),
+ seq_name_to_sequence=msa_data.seq_name_to_sequence,
+ alphabet=msa_data.alphabet,
+ seq_length=msa_data.seq_len) #, total_length=num_training_steps
+ # This can take a ton of memory if the weights or num_training_steps*batch_size are large
+ sampler = WeightedRandomSampler(weights=msa_data.weights, num_samples=num_training_steps*batch_size, replacement=True)
+ num_cpus = 1 # get_num_cpus() # TODO test with only 1 CPU
+
+ one_hot_fn = get_one_hot_3D_fn(msa_data)
+
+ def collate_fn(batch_seqs):
+ # Construct a batch of one-hot-encodings
+ batch_seq_tensor = one_hot_fn(batch_seqs)
+ return batch_seq_tensor
+
+
+ # dataloader = DataLoader(dataset,
+
+ # Other option for avoiding the problem of the dataset running out: Wrap it with an iterable that refreshes it every time
+ dataloader = InfiniteDataLoader(
+ dataset=dataset,
+ batch_size=batch_size,
+ num_workers=num_cpus, # collate_fn is not parallelized, so no speedup with multiple CPUs
+ sampler=sampler,
+ collate_fn=collate_fn,) #pin_memory=True
+
+ return dataloader
\ No newline at end of file
diff --git a/proteingym/baselines/EVE/utils/default_uncertainty_threshold.json b/proteingym/baselines/EVE/utils/default_uncertainty_threshold.json
new file mode 100644
index 0000000..bbc14e9
--- /dev/null
+++ b/proteingym/baselines/EVE/utils/default_uncertainty_threshold.json
@@ -0,0 +1,20 @@
+{
+ "deciles": {
+ "1":0.2197550519787074,
+ "2":0.3073271315004903,
+ "3":0.380894592583295,
+ "4":0.452448591753785,
+ "5":0.5199203860577154,
+ "6":0.5610673789803645,
+ "7":0.6028917246467301,
+ "8":0.6483370062758973,
+ "9":0.678595588420331,
+ "10":0.6931471766074847
+ },
+ "quartiles": {
+ "1":0.3443212858392093,
+ "2":0.5199203860577154,
+ "3":0.6272364669515783,
+ "4":0.6931471766074847
+ }
+}
\ No newline at end of file
diff --git a/proteingym/baselines/EVE/utils/performance_helpers.py b/proteingym/baselines/EVE/utils/performance_helpers.py
new file mode 100644
index 0000000..248dde8
--- /dev/null
+++ b/proteingym/baselines/EVE/utils/performance_helpers.py
@@ -0,0 +1,143 @@
+from sklearn.metrics import roc_auc_score
+import numpy as np
+import pandas as pd
+
+def compute_stats(input_array):
+ return {
+ 'mean':input_array.mean(),
+ 'std':input_array.std(),
+ 'min':input_array.min(),
+ 'max':input_array.max(),
+ 'P1':np.percentile(input_array,1),
+ 'P5':np.percentile(input_array,5),
+ 'P10':np.percentile(input_array,10),
+ 'P25':np.percentile(input_array,25),
+ 'P33':np.percentile(input_array,33),
+ 'P40':np.percentile(input_array,40),
+ 'P45':np.percentile(input_array,45),
+ 'P50':np.median(input_array),
+ 'P55':np.percentile(input_array,55),
+ 'P60':np.percentile(input_array,60),
+ 'P66':np.percentile(input_array,66),
+ 'P75':np.percentile(input_array,75),
+ 'P90':np.percentile(input_array,90),
+ 'P95':np.percentile(input_array,95),
+ 'P99':np.percentile(input_array,99)
+}
+
+def compute_accuracy_with_uncertain(class_pred, labels):
+ temp_df = pd.DataFrame({'class_pred': class_pred.copy(),'labels': labels.copy()})
+ initial_num_obs = len(temp_df['labels'])
+ temp_df=temp_df[temp_df['class_pred'] != 'Uncertain']
+ filtered_num_obs = len(temp_df['labels'])
+ temp_df['class_pred_bin'] = temp_df['class_pred'].map(lambda x: 1 if x == 'Pathogenic' else 0)
+ correct_classification = (temp_df['class_pred_bin'] == temp_df['labels']).astype(int)
+ accuracy = round(correct_classification.mean()*100,1)
+ pct_mutations_kept = round(filtered_num_obs/float(initial_num_obs)*100,1)
+ return accuracy, pct_mutations_kept
+
+def compute_AUC_overall_with_uncertain(scores, class_pred, labels):
+ temp_df = pd.DataFrame({'class_pred': class_pred.copy(),'labels': labels.copy(), 'scores': scores.copy()})
+ temp_df=temp_df[temp_df['class_pred'] != 'Uncertain']
+ AUC = roc_auc_score(y_true=temp_df['labels'], y_score=temp_df['scores'])
+ return round(AUC*100,1)
+
+def compute_avg_protein_level_AUC_with_uncertain(scores, class_pred, labels, protein_ID):
+ temp_df = pd.DataFrame({'class_pred': class_pred.copy(),'labels': labels.copy(), 'scores': scores.copy(), 'protein_ID': protein_ID.copy()})
+ temp_df=temp_df[temp_df['class_pred'] != 'Uncertain']
+ def compute_auc_group(group):
+ protein_scores = group['scores']
+ protein_labels = group['labels']
+ try:
+ result = roc_auc_score(y_true=protein_labels, y_score=protein_scores)
+ except:
+ result = np.nan
+ return result
+ protein_level_AUC = temp_df.groupby('protein_ID').apply(compute_auc_group)
+ avg_AUC = protein_level_AUC.mean(skipna=True)
+ return round(avg_AUC*100,1)
+
+def compute_pathogenic_rate_with_uncertain(class_pred, labels):
+ temp_df = pd.DataFrame({'class_pred': class_pred.copy(),'labels': labels.copy()})
+ temp_df=temp_df[temp_df['class_pred'] != 'Uncertain']
+ rate = len(temp_df[temp_df['class_pred'] == 'Pathogenic']) / float(len(temp_df))
+ return round(rate*100,1)
+
+def compute_uncertainty_deciles(score_dataframe, score_name="EVE_scores", uncertainty_name='uncertainty', suffix=''):
+ uncertainty_deciles_name='uncertainty_deciles'+suffix
+ score_dataframe[uncertainty_deciles_name] = pd.qcut(score_dataframe[uncertainty_name], q=10, labels=range(1,11)).astype(int)
+ uncertainty_cutoffs_deciles={}
+ scores_at_uncertainty_deciles_cuttoffs_UB_lower_part={}
+ scores_at_uncertainty_deciles_cuttoffs_LB_upper_part={}
+ for decile in range(1,11):
+ uncertainty_cutoffs_deciles[str(decile)]= np.max(score_dataframe[uncertainty_name][score_dataframe[uncertainty_deciles_name] == decile])
+ scores_at_uncertainty_deciles_cuttoffs_UB_lower_part[str(decile)]= np.max(score_dataframe[score_name][(score_dataframe[uncertainty_deciles_name] == decile) & (score_dataframe[score_name] < 0.5)])
+ scores_at_uncertainty_deciles_cuttoffs_LB_upper_part[str(decile)]= np.min(score_dataframe[score_name][(score_dataframe[uncertainty_deciles_name] == decile) & (score_dataframe[score_name] > 0.5)])
+ return uncertainty_cutoffs_deciles, scores_at_uncertainty_deciles_cuttoffs_UB_lower_part, scores_at_uncertainty_deciles_cuttoffs_LB_upper_part
+
+def compute_uncertainty_quartiles(score_dataframe, score_name="EVE_scores", uncertainty_name='uncertainty', suffix=''):
+ uncertainty_deciles_name='uncertainty_quartiles'+suffix
+ score_dataframe[uncertainty_deciles_name] = pd.qcut(score_dataframe[uncertainty_name], q=4, labels=range(1,5)).astype(int)
+ uncertainty_cutoffs_quartiles={}
+ scores_at_uncertainty_quartiles_cuttoffs_UB_lower_part={}
+ scores_at_uncertainty_quartiles_cuttoffs_LB_upper_part={}
+ for quartile in range(1,5):
+ uncertainty_cutoffs_quartiles[str(quartile)]= np.max(score_dataframe[uncertainty_name][score_dataframe[uncertainty_deciles_name] == quartile])
+ scores_at_uncertainty_quartiles_cuttoffs_UB_lower_part[str(quartile)]= np.max(score_dataframe[score_name][(score_dataframe[uncertainty_deciles_name] == quartile) & (score_dataframe[score_name] < 0.5)])
+ scores_at_uncertainty_quartiles_cuttoffs_LB_upper_part[str(quartile)]= np.min(score_dataframe[score_name][(score_dataframe[uncertainty_deciles_name] == quartile) & (score_dataframe[score_name] > 0.5)])
+ return uncertainty_cutoffs_quartiles, scores_at_uncertainty_quartiles_cuttoffs_UB_lower_part, scores_at_uncertainty_quartiles_cuttoffs_LB_upper_part
+
+def compute_performance_by_uncertainty_decile(score_dataframe, metric="Accuracy", verbose=False, score_name="EVE_scores", uncertainty_name="uncertainty", label_name='ClinVar_labels', protein_name='protein_name', class_100pct_retained_name='EVE_classes_100_pct_retained', suffix=''):
+ uncertainty_cutoffs_deciles, scores_at_uncertainty_deciles_cuttoffs_UB_lower_part, scores_at_uncertainty_deciles_cuttoffs_LB_upper_part = compute_uncertainty_deciles(score_dataframe, score_name, uncertainty_name, suffix)
+ performance_by_uncertainty_deciles = {}
+ pathogenic_rate_by_uncertainty_deciles = {}
+ for decile in range(1,11):
+ classification_name = 'class_pred_removing_'+str((10-decile)*10)+"_pct_most_uncertain"+suffix
+ score_dataframe[classification_name] = score_dataframe[class_100pct_retained_name]
+ score_dataframe.loc[score_dataframe['uncertainty_deciles'+suffix] > decile, classification_name] = 'Uncertain'
+ if metric=="Accuracy":
+ performance_decile = compute_accuracy_with_uncertain(score_dataframe[classification_name], score_dataframe[label_name])[0]
+ elif metric =="Avg_AUC":
+ performance_decile = compute_avg_protein_level_AUC_with_uncertain(scores=score_dataframe[score_name], class_pred=score_dataframe[classification_name], labels=score_dataframe[label_name], protein_ID=score_dataframe[protein_name])
+ performance_by_uncertainty_deciles[decile] = performance_decile
+ pathogenic_rate_by_uncertainty_deciles[decile] = compute_pathogenic_rate_with_uncertain(class_pred=score_dataframe[classification_name], labels=score_dataframe[label_name])
+ if verbose:
+ print(str(metric)+" when dropping the "+str((10-decile)*10)+"% of cases with highest uncertainty:\t"+str(performance_by_uncertainty_deciles[decile])+"% \t with pathogenic rate of "+str(pathogenic_rate_by_uncertainty_deciles[decile])+"%\n")
+ print("Uncertainty decile #"+str(decile)+" cutoff: "+str(uncertainty_cutoffs_deciles[str(decile)])+"\n")
+ print("Score upper bound for lower part in uncertainty decile: "+str(scores_at_uncertainty_deciles_cuttoffs_UB_lower_part[str(decile)])+"\n")
+ print("Score lower bound for higher part in uncertainty decile: "+str(scores_at_uncertainty_deciles_cuttoffs_LB_upper_part[str(decile)])+"\n")
+ return performance_by_uncertainty_deciles, pathogenic_rate_by_uncertainty_deciles
+
+def compute_performance_by_uncertainty_quartile(score_dataframe, metric="Accuracy", verbose=False, score_name="EVE_scores", uncertainty_name="uncertainty", label_name='ClinVar_labels', protein_name='protein_name', class_100pct_retained_name='EVE_classes_100_pct_retained', suffix=''):
+ uncertainty_cutoffs_quartiles, scores_at_uncertainty_quartiles_cuttoffs_UB_lower_part, scores_at_uncertainty_quartiles_cuttoffs_LB_upper_part = compute_uncertainty_quartiles(score_dataframe, score_name, uncertainty_name, suffix)
+ performance_by_uncertainty_quartiles = {}
+ pathogenic_rate_by_uncertainty_quartiles = {}
+ for quartile in range(1,5):
+ classification_name = 'class_pred_removing_'+str((4-quartile)*25)+"_pct_most_uncertain"+suffix
+ score_dataframe[classification_name] = score_dataframe[class_100pct_retained_name]
+ score_dataframe.loc[score_dataframe['uncertainty_quartiles'+suffix] > quartile, classification_name] = 'Uncertain'
+ if metric=="Accuracy":
+ performance_quartile = compute_accuracy_with_uncertain(score_dataframe[classification_name], score_dataframe[label_name])[0]
+ elif metric =="Avg_AUC":
+ performance_quartile = compute_avg_protein_level_AUC_with_uncertain(scores=score_dataframe[score_name], class_pred=score_dataframe[classification_name], labels=score_dataframe[label_name], protein_ID=score_dataframe[protein_name])
+ performance_by_uncertainty_quartiles[quartile] = performance_quartile
+ pathogenic_rate_by_uncertainty_quartiles[quartile] = compute_pathogenic_rate_with_uncertain(class_pred=score_dataframe[classification_name], labels=score_dataframe[label_name])
+ if verbose:
+ print(str(metric)+" when dropping the "+str((4-quartile)*25)+"% of cases with highest uncertainty:\t"+str(performance_by_uncertainty_quartiles[quartile])+"% \t with pathogenic rate of "+str(pathogenic_rate_by_uncertainty_quartiles[quartile])+"%\n")
+ print("Uncertainty quartile #"+str(quartile)+" cutoff: "+str(uncertainty_cutoffs_quartiles[str(quartile)])+"\n")
+ print("Score upper bound for lower part in uncertainty quartile: "+str(scores_at_uncertainty_quartiles_cuttoffs_UB_lower_part[str(quartile)])+"\n")
+ print("Score lower bound for higher part in uncertainty quartile: "+str(scores_at_uncertainty_quartiles_cuttoffs_LB_upper_part[str(quartile)])+"\n")
+ return performance_by_uncertainty_quartiles, pathogenic_rate_by_uncertainty_quartiles
+
+def predictive_entropy_binary_classifier(class1_scores, eps=1e-8):
+ class1_scores = pd.Series(class1_scores).map(lambda x: x - eps if x==1.0 else x + eps if x==0 else x)
+ class0_scores = 1 - class1_scores
+ return - np.array((np.log(class1_scores) * class1_scores + np.log(class0_scores) * class0_scores))
+
+def compute_weighted_score_two_GMMs(X_pred, main_model, protein_model, cluster_index_main, cluster_index_protein, protein_weight):
+ return protein_model.predict_proba(X_pred)[:,cluster_index_protein] * protein_weight + (main_model.predict_proba(X_pred)[:,cluster_index_main]) * (1 - protein_weight)
+
+def compute_weighted_class_two_GMMs(X_pred, main_model, protein_model, cluster_index_main, cluster_index_protein, protein_weight):
+ """By construct, 1 is always index of pathogenic, 0 always that of benign"""
+ proba_pathogenic = protein_model.predict_proba(X_pred)[:,cluster_index_protein] * protein_weight + (main_model.predict_proba(X_pred)[:,cluster_index_main]) * (1 - protein_weight)
+ return (proba_pathogenic > 0.5).astype(int)
diff --git a/proteingym/baselines/EVE/utils/plot_helpers.py b/proteingym/baselines/EVE/utils/plot_helpers.py
new file mode 100644
index 0000000..a150c16
--- /dev/null
+++ b/proteingym/baselines/EVE/utils/plot_helpers.py
@@ -0,0 +1,52 @@
+import os
+import tqdm
+import numpy as np
+import pandas as pd
+import matplotlib.pyplot as plt
+
+def plot_histograms(all_evol_indices, dict_models, dict_pathogenic_cluster_index, protein_GMM_weight, plot_location, output_eve_scores_filename_suffix, protein_list):
+ x = np.linspace(-10, 20, 2000)
+ logprob = dict_models['main'].score_samples(x.reshape(-1,1))
+ pdf = np.exp(logprob)
+ component_share = dict_models['main'].predict_proba(x.reshape(-1, 1))
+ pdf_pathogenic = component_share[:,dict_pathogenic_cluster_index['main']] * pdf
+ pdf_benign = component_share[:,1 - dict_pathogenic_cluster_index['main']] * pdf
+ plt.plot(x,pdf, '--k', color='black')
+ plt.plot(x,pdf_pathogenic, '--k', color = 'xkcd:red',linewidth=4)
+ plt.plot(x,pdf_benign, '--k', color = 'xkcd:sky blue',linewidth=4)
+ plt.hist(all_evol_indices['evol_indices'], color = 'xkcd:grey', bins = 80, histtype='stepfilled', alpha=0.4, density=True)
+ plt.xlabel("Evolutionary index", fontsize=13)
+ plt.ylabel("% of variants", fontsize=13)
+ plt.xticks(fontsize=10)
+ plt.yticks(fontsize=10)
+ plt.savefig(plot_location+os.sep+'histogram_random_samples_'+str(output_eve_scores_filename_suffix)+"_all.png", dpi=800, bbox_inches='tight')
+ plt.clf()
+ if protein_GMM_weight > 0.0:
+ for protein in tqdm.tqdm(protein_list,"Plot protein histograms"):
+ x = np.linspace(-10, 20, 2000)
+ logprob = dict_models[protein].score_samples(x.reshape(-1,1))
+ pdf = np.exp(logprob)
+ component_share = dict_models[protein].predict_proba(x.reshape(-1, 1))
+ pdf_pathogenic = component_share[:,dict_pathogenic_cluster_index[protein]] * pdf
+ pdf_benign = component_share[:, 1 - dict_pathogenic_cluster_index[protein]] * pdf
+ plt.plot(x,pdf, '--k', color='black')
+ plt.plot(x,pdf_pathogenic, '--k', color = 'xkcd:red',linewidth=4)
+ plt.plot(x,pdf_benign, '--k', color = 'xkcd:sky blue',linewidth=4)
+ plt.hist(all_evol_indices['evol_indices'][all_evol_indices['protein_name']==protein], color = 'xkcd:grey', bins = 80, histtype='stepfilled', alpha=0.4, density=True)
+ plt.xlabel("Evolutionary index", fontsize=13)
+ plt.ylabel("% of variants", fontsize=13)
+ plt.xticks(fontsize=10)
+ plt.yticks(fontsize=10)
+ plt.savefig(plot_location+os.sep+'histogram_random_samples_'+str(output_eve_scores_filename_suffix)+"_"+str(protein)+".png", dpi=800, bbox_inches='tight')
+ plt.clf()
+
+def plot_scores_vs_labels(score_df, plot_location, output_eve_scores_filename_suffix, mutation_name='mutations', score_name="EVE_scores", label_name='labels'):
+ score_df_local = score_df.copy()
+ score_df_local = score_df_local[score_df_local[mutation_name] !='w-1t'] #Remove wild type sequence
+ score_df_local['mutation_position'] = score_df[mutation_name].map(lambda x: int(x[1:-1]))
+ labels = score_df_local[label_name]
+ pathogenic = plt.scatter(x=score_df_local['mutation_position'][labels==1], y=score_df_local[score_name][labels==1], color='xkcd:red')
+ benign = plt.scatter(x=score_df_local['mutation_position'][labels==0], y=score_df_local[score_name][labels==0], color='xkcd:sky blue')
+ plt.legend([pathogenic,benign],['pathogenic','benign'])
+ plt.savefig(plot_location+os.sep+'scores_vs_labels_plots_'+str(output_eve_scores_filename_suffix)+".png", dpi=400, bbox_inches='tight')
+ plt.clf()
\ No newline at end of file
diff --git a/proteingym/baselines/EVE/utils/weights.py b/proteingym/baselines/EVE/utils/weights.py
new file mode 100644
index 0000000..81bf247
--- /dev/null
+++ b/proteingym/baselines/EVE/utils/weights.py
@@ -0,0 +1,505 @@
+import multiprocessing
+import time
+from collections import defaultdict
+
+import numba
+from numba import prange
+from numba_progress import ProgressBar
+
+import numpy as np
+from tqdm import tqdm
+
+
+def compute_weight_eve(seq, list_seq, theta):
+ # seq shape: (L * alphabet_size,)
+ number_non_empty_positions = np.sum(seq) # = np.dot(seq,seq), assuming it is a flattened one-hot matrix
+ if number_non_empty_positions > 0:
+ # Dot product of one-hot vectors x and y = (x == y).sum()
+ matches = np.dot(list_seq, seq)
+ denom = matches / number_non_empty_positions # number_non_empty_positions = np.dot(seq,seq)
+ denom = np.sum(denom > 1 - theta) # Lood: Keeping >, and changing EVCouplings code to >
+ return 1 / denom
+ else:
+ return 0.0 # return 0 weight if sequence is fully empty
+
+
+def _compute_weight_global(i):
+ seq = list_seq_global[i]
+ # seq shape: (L * alphabet_size,)
+ number_non_empty_positions = np.sum(seq) # = np.dot(seq,seq), assuming it is a flattened one-hot matrix
+ if number_non_empty_positions > 0:
+ matches = np.dot(list_seq_global, seq)
+ denom = matches / number_non_empty_positions # number_non_empty_positions = np.dot(seq,seq)
+ denom = np.sum(denom > 1 - theta_global) # Lood: Keeping >, and changing EVCouplings code to >
+ return 1 / denom
+ else:
+ return 0.0 # return 0 weight if sequence is fully empty
+
+
+def _init_worker_calc_eve(list_seq, theta):
+ # Initialize the worker process
+ # Note: Using global is not ideal, but not sure how else
+ # It should be safe since processes have private global variables
+ global list_seq_global
+ global theta_global
+ list_seq_global = list_seq
+ theta_global = theta
+
+
+def compute_sequence_weights(list_seq, theta, num_cpus=1):
+ _N, _seq_len, _alphabet_size = list_seq.shape # = len(self.seq_name_to_sequence.keys()), len(self.focus_cols), len(self.alphabet)
+ list_seq = list_seq.reshape((_N, _seq_len * _alphabet_size))
+ print(f"Using {num_cpus} cpus for EVE weights computation")
+
+ if num_cpus > 1:
+ # Compute weights in parallel
+ with multiprocessing.Pool(processes=num_cpus, initializer=_init_worker_calc_eve, initargs=(list_seq, theta)) as pool:
+ # func = functools.partial(compute_weight, list_seq=list_seq, theta=theta)
+ chunksize = max(min(8, int(_N / num_cpus / 4)), 1)
+ print("chunksize: " + str(chunksize))
+ # imap: Lazy version of map
+ # Parallel progress bars are complicated, so just used a single one
+ weights_map = tqdm(pool.imap(_compute_weight_global, range(_N), chunksize=chunksize),
+ total=_N, desc="Computing weights parallel EVE")
+ weights = np.array(list(weights_map))
+ else:
+ weights_map = map(lambda seq: compute_weight_eve(seq, list_seq=list_seq, theta=theta), list_seq)
+ weights = np.array(list(tqdm(weights_map, total=_N, desc="Computing weights serial EVE")))
+
+ return weights
+
+
+def is_empty_sequence_matrix(matrix, empty_value):
+ assert len(matrix.shape) == 2, f"Matrix must be 2D; shape={matrix.shape}"
+ assert isinstance(empty_value, (int, float)), f"empty_value must be a number; type={type(empty_value)}"
+ # Check for each sequence if all positions are equal to empty_value
+ empty_idx = np.all((matrix == empty_value), axis=1)
+ return empty_idx
+
+
+# See calc_num_cluster_members_nogaps
+@numba.jit(nopython=True) # , fastmath=True, parallel=True)
+def calc_num_clusters_i(matrix, identity_threshold, invalid_value, i: int, L_non_gaps: float):
+ N, L = matrix.shape
+ L_non_gaps = 1.0 * L_non_gaps # Show numba it's a float
+
+ # Empty sequences are filtered out before this function and are ignored
+ # minimal cluster size is 1 (self)
+ num_clusters_i = 1 # Self
+ # compare all pairs of sequences
+ for j in range(N):
+ if i == j:
+ continue
+ pair_matches = 0
+ for k in range(L):
+ # Edit(Lood): Don't count gaps as matches
+ if matrix[i, k] == matrix[j, k] and matrix[i, k] != invalid_value:
+ pair_matches += 1
+ # Edit(Lood): Calculate identity as fraction of non-gapped positions (so asymmetric)
+ # Note: Changed >= to > to match EVE / DeepSequence code
+ if pair_matches / L_non_gaps > identity_threshold:
+ num_clusters_i += 1
+
+ return num_clusters_i
+
+
+# Below are util functions copied from EVCouplings: https://github.com/debbiemarkslab/EVcouplings
+# This code looks slow but it's because it's written as a numba kernel
+# Fastmath should be safe here, as we can assume that there are no NaNs in the input etc.
+@numba.jit(nopython=True) # , fastmath=True, parallel=True
+def calc_num_cluster_members_nogaps(matrix, identity_threshold, invalid_value):
+ """
+ From EVCouplings: https://github.com/debbiemarkslab/EVcouplings/blob/develop/evcouplings/align/alignment.py#L1172
+ Calculate number of sequences in alignment
+ within given identity_threshold of each other
+ Parameters
+ ----------
+ matrix : np.array
+ N x L matrix containing N sequences of length L.
+ Matrix must be mapped to range(0, num_symbols) using
+ map_matrix function
+ identity_threshold : float
+ Sequences with at least this pairwise identity will be
+ grouped in the same cluster.
+ Returns
+ -------
+ np.array
+ Vector of length N containing number of cluster
+ members for each sequence (inverse of sequence
+ weight)
+ """
+ N, L = matrix.shape
+ L = 1.0 * L
+
+ # Empty sequences are filtered out before this function and are ignored
+ # minimal cluster size is 1 (self)
+ num_neighbors = np.ones((N))
+ L_non_gaps = L - np.sum(matrix == invalid_value, axis=1) # Edit: From EVE, use the non-gapped length
+ # compare all pairs of sequences
+ for i in range(N - 1):
+ for j in range(i + 1, N):
+ pair_matches = 0
+ for k in range(L):
+ # Edit(Lood): Don't count gaps as matches
+ if matrix[i, k] == matrix[j, k] and matrix[i, k] != invalid_value:
+ pair_matches += 1
+
+ # Edit(Lood): Calculate identity as fraction of non-gapped positions (so asymmetric)
+ # Note: Changed >= to > to match EVE / DeepSequence code
+ if pair_matches / L_non_gaps[i] > identity_threshold:
+ num_neighbors[i] += 1
+ if pair_matches / L_non_gaps[j] > identity_threshold:
+ num_neighbors[j] += 1
+
+ return num_neighbors
+
+
+def calc_weights_evcouplings(matrix_mapped, identity_threshold, empty_value, num_cpus=1):
+ """
+ From EVCouplings: https://github.com/debbiemarkslab/EVcouplings
+ Calculate weights for sequences in alignment by
+ clustering all sequences with sequence identity
+ greater or equal to the given threshold.
+ Parameters
+ ----------
+ identity_threshold : float
+ Sequence identity threshold
+ """
+ empty_idx = is_empty_sequence_matrix(matrix_mapped,
+ empty_value=empty_value) # e.g. sequences with just gaps or lowercase, no valid AAs
+ N = matrix_mapped.shape[0]
+
+ # Original EVCouplings code structure, plus gap handling
+ if num_cpus != 1:
+ # Numba native parallel:
+ # print("Calculating weights using Numba parallel (experimental) since num_cpus > 1. "
+ # "If you want to disable multiprocessing set num_cpus=1.")
+ # print("Default number of threads for Numba:", numba.config.NUMBA_NUM_THREADS)
+ # # num_cpus > numba.config.NUMBA_NUM_THREADS will give an error.
+ # # But we'll leave it so that the user has to be explicit.
+ # numba.set_num_threads(num_cpus)
+ # print("Set number of threads to:", numba.get_num_threads())
+ # num_cluster_members = calc_num_cluster_members_nogaps_parallel(matrix_mapped[~empty_idx], identity_threshold,
+ # invalid_value=empty_value)
+
+
+ update_frequency=1000
+ with ProgressBar(total=N, update_interval=30, miniters=update_frequency) as progress: # can also use tqdm mininterval, maxinterval etc
+ num_cluster_members = calc_num_cluster_members_nogaps_parallel_print(matrix_mapped[~empty_idx], identity_threshold,
+ invalid_value=empty_value, progress_proxy=progress, update_frequency=update_frequency)
+ # print("Num CPUs for EVCouplings code:", num_cpus)
+ # print(
+ # f"Calculating weights using Numba JIT and multiprocessing (experimental) since num_cpus ({num_cpus}) > 1. "
+ # "If you want to disable multiprocessing set num_cpus=1.")
+ # with multiprocessing.Pool(processes=num_cpus, initializer=_init_worker_ev,
+ # initargs=(matrix_mapped[~empty_idx], empty_value, identity_threshold)) as pool:
+ # # Simply: Chunksize is between 1 and 64, preferably N / num_cpus / 16,
+ # # so every CPU gets a 16th of their expected total every time they ask for more work.
+ # # Too small values: Too much overhead sending simple indexes to workers, and them sending back results.
+ # # Too large: May wait a while for the last worker's task to finish.
+ # chunksize = max(1, min(64, int(N / num_cpus / 16)))
+ # print("chunksize: " + str(chunksize))
+
+ # # imap: Lazy version of map
+ # # Parallel progress bars are complicated and pollute logs
+ # cluster_map = tqdm(pool.imap(_worker_func, range(N), chunksize=chunksize), total=N, mininterval=1)
+ # num_cluster_members = np.array(list(cluster_map))
+ else:
+ num_cluster_members = calc_num_cluster_members_nogaps(matrix_mapped[~empty_idx], identity_threshold,
+ invalid_value=empty_value)
+
+ # Empty sequences: weight 0
+ weights = np.zeros((N))
+ weights[~empty_idx] = 1.0 / num_cluster_members
+ return weights
+
+
+# Multiprocessing with numba jit:
+def _init_worker_ev(matrix, empty_value, identity_threshold):
+ global matrix_mapped_global
+ matrix_mapped_global = matrix
+ L = matrix.shape[1]
+ global empty_value_global
+ empty_value_global = empty_value
+ global identity_threshold_global
+ identity_threshold_global = identity_threshold
+ global L_i_global
+ L_i_global = L - np.sum(matrix == empty_value, axis=1)
+ print("Initialising worker")
+ global global_func_num_clusters_i
+ global_func_num_clusters_i = _global_calc_cluster_factory()
+ try:
+ start = time.perf_counter()
+ _ = global_func_num_clusters_i(0) # Timeout, and numba verbosity?
+ end = time.perf_counter()
+ print(f"Initialising worker took: {end - start:.2f}")
+ except Exception as e:
+ print("Worker initialisation failed:", e)
+ raise e
+ print("Function compiled")
+
+
+def _worker_func(i):
+ return global_func_num_clusters_i(i)
+
+
+def _global_calc_cluster_factory():
+ # @numba.jit(nopython=True)
+ def func(i):
+ return calc_num_clusters_i(matrix_mapped_global, identity_threshold_global, empty_value_global, i,
+ L_non_gaps=L_i_global[i])
+ return func
+
+
+# Copied from EVCouplings
+def map_from_alphabet(alphabet, default):
+ """
+ Creates a mapping dictionary from a given alphabet.
+ Parameters
+ ----------
+ alphabet : str
+ Alphabet for remapping. Elements will
+ be remapped according to alphabet starting
+ from 0
+ default : Elements in matrix that are not
+ contained in alphabet will be treated as
+ this character
+ Raises
+ ------
+ ValueError
+ For invalid default character
+ """
+ map_ = {
+ c: i for i, c in enumerate(alphabet)
+ }
+
+ try:
+ default = map_[default]
+ except KeyError:
+ raise ValueError(
+ "Default {} is not in alphabet {}".format(default, alphabet)
+ )
+
+ return defaultdict(lambda: default, map_)
+
+
+# Copied from EVCouplings
+def map_matrix(matrix, map_):
+ """
+ Map elements in a numpy array using alphabet
+ Parameters
+ ----------
+ matrix : np.array
+ Matrix that should be remapped
+ map_ : defaultdict
+ Map that will be applied to matrix elements
+ Returns
+ -------
+ np.array
+ Remapped matrix
+ """
+ return np.vectorize(map_.__getitem__)(matrix)
+
+# The main function
+
+@numba.jit(nopython=True, fastmath=True, parallel=True)
+def calc_num_cluster_members_nogaps_parallel(matrix, identity_threshold, invalid_value):
+ """
+ From EVCouplings: https://github.com/debbiemarkslab/EVcouplings
+ Calculate number of sequences in alignment
+ within given identity_threshold of each other
+ Parameters
+ ----------
+ matrix : np.array
+ N x L matrix containing N sequences of length L.
+ Matrix must be mapped to range(0, num_symbols) using
+ map_matrix function
+ identity_threshold : float
+ Sequences with at least this pairwise identity will be
+ grouped in the same cluster.
+ invalid_value : int
+ Value in matrix that is considered invalid, e.g. gap or lowercase character.
+ Returns
+ -------
+ np.array
+ Vector of length N containing number of cluster
+ members for each sequence (inverse of sequence
+ weight)
+ """
+ N, L = matrix.shape
+ L = 1.0 * L
+
+ # Empty sequences are filtered out before this function and are ignored
+ # minimal cluster size is 1 (self)
+ num_neighbors = np.ones((N))
+ L_non_gaps = L - np.sum(matrix == invalid_value, axis=1) # Edit: From EVE, use the non-gapped length
+ # compare all pairs of sequences
+ # Edit: Rewrote loop without any dependencies between inner and outer loops, so that it can be parallelized
+ for i in prange(N):
+ num_neighbors_i = 1 # num_neighbors_i = 0 # TODO why did I make this 0 again? Probably because I thought I'd have to count i == j
+ for j in range(N):
+ if i == j:
+ continue
+ pair_matches = 0
+ for k in range(L): # This should hopefully be vectorised by numba
+ if matrix[i, k] == matrix[j, k] and matrix[
+ i, k] != invalid_value: # Edit(Lood): Don't count gaps as matches
+ pair_matches += 1
+ # Edit(Lood): Calculate identity as fraction of non-gapped positions (so this similarity is asymmetric)
+ # Note: Changed >= to > to match EVE / DeepSequence code
+ if pair_matches / L_non_gaps[i] > identity_threshold:
+ num_neighbors_i += 1
+
+ num_neighbors[i] = num_neighbors_i
+
+ return num_neighbors
+
+@numba.jit(nopython=True, fastmath=True, parallel=True)
+def calc_num_cluster_members_nogaps_parallel_print(matrix, identity_threshold, invalid_value, progress_proxy=None, update_frequency=1000):
+ """
+ From EVCouplings: https://github.com/debbiemarkslab/EVcouplings
+ Calculate number of sequences in alignment
+ within given identity_threshold of each other
+ Parameters
+ ----------
+ matrix : np.array
+ N x L matrix containing N sequences of length L.
+ Matrix must be mapped to range(0, num_symbols) using
+ map_matrix function
+ identity_threshold : float
+ Sequences with at least this pairwise identity will be
+ grouped in the same cluster.
+ invalid_value : int
+ Value in matrix that is considered invalid, e.g. gap or lowercase character.
+ Returns
+ -------
+ np.array
+ Vector of length N containing number of cluster
+ members for each sequence (inverse of sequence
+ weight)
+ """
+
+ N, L = matrix.shape
+ L = 1.0 * L
+
+ # Empty sequences are filtered out before this function and are ignored
+ # minimal cluster size is 1 (self)
+ num_neighbors = np.ones((N))
+ L_non_gaps = L - np.sum(matrix == invalid_value, axis=1) # Edit: From EVE, use the non-gapped length
+ # compare all pairs of sequences
+ # Edit: Rewrote loop without any dependencies between inner and outer loops, so that it can be parallelized
+ for i in prange(N):
+ num_neighbors_i = 1 # num_neighbors_i = 0 # TODO why did I make this 0 again? Probably because I thought I'd have to count i == j
+ for j in range(N):
+ if i == j:
+ continue
+ pair_matches = 0
+ for k in range(L): # This should hopefully be vectorised by numba
+ if matrix[i, k] == matrix[j, k] and matrix[
+ i, k] != invalid_value: # Edit(Lood): Don't count gaps as matches
+ pair_matches += 1
+ # Edit(Lood): Calculate identity as fraction of non-gapped positions (so this similarity is asymmetric)
+ # Note: Changed >= to > to match EVE / DeepSequence code
+ if pair_matches / L_non_gaps[i] > identity_threshold:
+ num_neighbors_i += 1
+
+ num_neighbors[i] = num_neighbors_i
+ if progress_proxy is not None and i % update_frequency == 0:
+ progress_proxy.update(update_frequency)
+
+ return num_neighbors
+
+########################
+# Failed JIT parallel ideas:
+# N, L = matrix_mapped[~empty_idx].shape
+# L_i = L - np.sum(matrix_mapped[~empty_idx] == empty_value, axis=1)
+
+# Idea 1: Calculate the full pair matrix (i,j) = number of matches between sequence i and sequence j
+# neighbour_matrix = calc_num_pairs(matrix_mapped[~empty_idx], identity_threshold, invalid_value=empty_value)
+# num_cluster_members = (pairs_matrix / L_i[:, None] >= identity_threshold).sum(axis=1)
+
+# Idea 2: Calculate the pair matrix but applying thresholding inside the loop
+# num_cluster_members = neighbour_matrix.sum(axis=1)
+# num_cluster_members = calc_num_cluster_members_nogaps_all_vs_all(
+# matrix_mapped[~empty_idx], identity_threshold, invalid_value=empty_value # matrix_mapped[~empty_idx]
+# )
+
+# Idea 3)
+
+# Inside calling function calc_weights_evcouplings_parallel
+# if num_cpus > 1:
+# Compute weights in parallel
+# print("Num CPUs for EVCouplings code:", num_cpus)
+# with multiprocessing.Pool(processes=num_cpus, initializer=init_worker_ev, initargs=(matrix_mapped[~empty_idx], empty_value, identity_threshold)) as pool:
+# # func = functools.partial(compute_weight, list_seq=list_seq, theta=theta)
+# chunksize = max(min(32, int(N / num_cpus / 4)), 1)
+# print("chunksize: " + str(chunksize))
+# # imap: Lazy version of map
+# # Parallel progress bars are complicated
+# cluster_map = tqdm(pool.imap(_worker_func, range(N), chunksize=chunksize), total=N)
+
+# @numba.jit(nopython=True, fastmath=True, parallel=True)
+# def calc_num_pairs(matrix, identity_threshold, invalid_value):
+# """
+# From EVCouplings: https://github.com/debbiemarkslab/EVcouplings
+# Calculate number of sequences in alignment
+# within given identity_threshold of each other
+# Parameters
+# ----------
+# matrix : np.array
+# N x L matrix containing N sequences of length L.
+# Matrix must be mapped to range(0, num_symbols) using
+# map_matrix function
+# identity_threshold : float
+# Sequences with at least this pairwise identity will be
+# grouped in the same cluster.
+# Returns
+# -------
+# np.array
+# Vector of length N containing number of cluster
+# members for each sequence (inverse of sequence
+# weight)
+# """
+# N, L = matrix.shape
+# # L = 1.0 * L # need to tell numba that L is a float
+#
+# # Empty sequences are filtered out before this function and are ignored
+# # minimal cluster size is 1 (self)
+# # L_non_gaps = L - np.sum(matrix == invalid_value, axis=1) # Edit: From EVE, use the non-gapped length
+# neighbour_matrix = np.eye(N) # dtype=np.bool
+# # Crucial: We assume none of the sequences are empty
+# # Construct a loop that counts a neighbour if the pairwise identity is above the threshold
+# pairs_j = np.zeros(N, dtype=np.int32)
+# for i in range(N):
+# # Calculate the non-gapped length of sequence i
+# # L_i = np.sum(matrix[i] != invalid_value) # Can either use L_i or L_j to calculate the neighbor matrix, the output will simply be transposed
+# pairs_j[:] = 0
+# for j in range(N):
+# num_pairs = 0
+# for k in range(L):
+# if matrix[i, k] == matrix[j, k] and matrix[i, k] != invalid_value:
+# num_pairs += 1
+# pairs_j[j] = num_pairs # Could also just add this as an array at the end of j loop
+# neighbour_matrix[i] = pairs_j # Could also calc identity threshold here
+#
+# return neighbour_matrix
+
+# @numba.jit(nopython=True, parallel=True)
+# def num_cluster_members_from_pair(matrix, identity_threshold, invalid_value, L_i):
+# N = matrix.shape[0]
+# pairs_matrix = np.zeros((N))
+# for i in prange(N):
+# pairs_matrix[i] = calc_num_clusters_i(matrix, identity_threshold=identity_threshold,
+# invalid_value=invalid_value, i=i,
+# L_non_gaps=L_i[i])
+# return pairs_matrix
+
+# Slower than numpy.prange
+# @numba.jit(nopython=True, parallel=True)
+# def func_all_i(matrix, identity_threshold, invalid_value, L_i):
+# N = matrix.shape[0]
+# num_clusters = np.zeros(N)
+# for i in prange(N):
+# num_clusters[i] = calc_num_clusters_i(matrix, identity_threshold=identity_threshold,
+# invalid_value=invalid_value, i=i, L_non_gaps=L_i[i])
+# return num_clusters
\ No newline at end of file
diff --git a/proteingym/baselines/EVmutation/calculations.py b/proteingym/baselines/EVmutation/calculations.py
new file mode 100644
index 0000000..b8c07a9
--- /dev/null
+++ b/proteingym/baselines/EVmutation/calculations.py
@@ -0,0 +1,299 @@
+"""
+High-level mutation calculation functions for EVmutation
+
+.. todo::
+
+ implement segment handling
+
+Authors:
+ Thomas A. Hopf
+ Anna G. Green (generalization for multiple segments)
+"""
+
+import numpy as np
+import pandas as pd
+from evcouplings.utils.calculations import entropy_map
+
+
+COMPONENT_TO_INDEX = {
+ "full": 0,
+ "couplings": 1,
+ "fields": 2,
+}
+
+
+def extract_mutations(mutation_string, offset=0, sep=":"):
+ """
+ Turns a string containing mutations of the format I100V into a list of tuples with
+ format (100, 'I', 'V') (index, from, to)
+
+ Parameters
+ ----------
+ mutation_string : str
+ Comma-separated list of one or more mutations (e.g. "K50R,I100V")
+ offset : int, default: 0
+ Offset to be added to the index/position of each mutation
+ sep : str, default ","
+ String used to separate multiple mutations
+
+ Returns
+ -------
+ list of tuples
+ List of tuples of the form (index+offset, from, to)
+ """
+ try:
+ if mutation_string.lower() not in ["wild", "wt", ""]:
+ mutations = mutation_string.split(sep)
+ return list(map(
+ lambda x: (int(x[1:-1]) + offset, x[0], x[-1]),
+ mutations
+ ))
+ except AttributeError:
+ return []
+
+
+def predict_mutation_table(model, table, output_column="prediction_epistatic",
+ mutant_column="mutant", hamiltonian="full", offset=0,
+ segment=None, sep=":"):
+ """
+ Predicts all mutants in a dataframe and adds predictions
+ as a new column.
+
+ If mutant_column is None, the dataframe index is used,
+ otherwise the given column.
+
+ Mutations which cannot be calculated (e.g. not covered
+ by alignment, or invalid substitution) using object are
+ set to NaN.
+
+ Parameters
+ ----------
+ model : CouplingsModel
+ CouplingsModel instance used to compute mutation
+ effects
+ table : pandas.DataFrame
+ DataFrame with mutants to which delta of
+ statistical energy will be added
+ mutant_column: str
+ Name of column in table that contains mutants
+ output_column : str
+ Name of column in returned dataframe that will
+ contain computed effects
+ hamiltonian: {"full", "couplings", "fields"},
+ default: "full"
+ Use full Hamiltonian of exponential model (default),
+ or only couplings / fields for statistical energy
+ calculation.
+ segment: str, default: None
+ Specificy a segment identifier to use for the positions in the mutation
+ table. This will only be used if the mutation table doesn't already have
+ a segments column.
+
+ Returns
+ -------
+ pandas.DataFrame
+ Dataframe with added column (mutant_column) that contains computed
+ mutation effects
+ """
+ def _predict_mutant(m):
+ # try:
+ delta_E = model.delta_hamiltonian(m)
+ return delta_E[_component]
+
+ # except ValueError:
+ # return np.nan
+
+ # select Hamiltonian component for prediction
+ if hamiltonian in COMPONENT_TO_INDEX:
+ _component = COMPONENT_TO_INDEX[hamiltonian]
+ else:
+ raise ValueError(
+ "Invalid selection for hamiltonian. "
+ "Valid values are: " + ", ".join(COMPONENT_TO_INDEX)
+ )
+
+ # make sure there is a target sequence for which we
+ # can compute statistical energy difference
+ if not model.has_target_seq:
+ raise ValueError(
+ "CouplingsModel object does not have a target "
+ "sequence (non-focus mode). "
+ "Set target sequence, or rerun inference in focus mode."
+ )
+ pred = table.copy()
+
+ # get column which contains mutations
+ if mutant_column is None:
+ mutations = pred.index
+ else:
+ mutations = pred.loc[:, mutant_column]
+
+ # if there is a segment column, use that to apply
+ # segment information to every mutation
+ if "segment" in pred.columns and pred.loc[:, "segment"].notnull().all():
+ segments = pred.loc[:, "segment"]
+
+ # split each comma-delimited string of mutations into a list
+ mutations_separated = [extract_mutations(x,offset=offset) for x in mutations] #map(extract_mutations, mutations)
+
+ # split each comma-delimited string of segments into a list
+ segments_separated = [x.split(",") for x in segments]
+ mutation_list = []
+
+ # create a list of mutation in the format
+ # [[((segment, pos), aa_from, aa_to), ((segment, pos) aa_from, aa_to)], [((segment, pos) aa_from, aa_to)]]
+ if len([segments_separated]) != len([mutations_separated]):
+ raise(
+ ValueError,
+ "Number of mutations provided does not match number of segments of origin provided."
+ )
+
+ for segment_subset, mutation_subset in zip(segments_separated, mutations_separated):
+ _mutation_list = [
+ ((seg, pos), aa_from, aa_to) for
+ (seg, (pos, aa_from, aa_to)) in zip(
+ segment_subset, mutation_subset
+ )
+ ]
+ mutation_list.append(_mutation_list)
+
+ # else if the segment argument was provided
+ # designate that as the segment for every mutation
+ elif segment is not None:
+ mutations_separated = [extract_mutations(x,offset=offset) for x in mutations] #map(extract_mutations, mutations)
+ mutation_list = []
+ for mutation_subset in mutations_separated:
+ _mutation_list = [
+ ((segment, pos), aa_from, aa_to) for
+ (pos, aa_from, aa_to) in mutation_subset
+ ]
+ mutation_list.append(_mutation_list)
+
+ else:
+ mutation_list = [extract_mutations(x,offset=offset) for x in mutations] #map(extract_mutations, mutations)
+ # predict mutations and add to table
+ pred.loc[:, output_column] = [
+ _predict_mutant(m) for m in mutation_list
+ ]
+
+ return pred
+
+
+def single_mutant_matrix(model, output_column="prediction_epistatic",
+ exclude_self_subs=True):
+ """
+ Create table with all possible single substitutions of
+ target sequence in CouplingsModel object.
+
+ Parameters
+ ----------
+ model : CouplingsModel
+ Model that will be used to predict single mutants
+ output_column : str, default: "prediction_epistatic"
+ Name of column in Dataframe that will contain predictions
+ exclude_self_subs : bool, default: True
+ Exclude self-substitutions (e.g. A100A) from results
+
+ Returns
+ -------
+ pandas.DataFrame
+ DataFrame with predictions for all single mutants
+ """
+ res = []
+ cons = entropy_map(model)
+
+ # iterate all positions and substitutions per position
+ for pos in model.index_list:
+ for subs in model.alphabet:
+ # do not predict gaps
+ if subs in ["-", "."]:
+ continue
+
+ # exclude self-substitutions?
+ if exclude_self_subs and subs == model.seq(pos):
+ continue
+
+ # if position is a tuple, it is in format
+ # (segment_id, position). Else, there is
+ # no segment information
+ if isinstance(pos, tuple):
+ position_str = pos[1]
+ segment = pos[0]
+
+ else:
+ position_str = pos
+ segment = np.nan
+
+ wt = model.seq(pos)
+ mutant = "{}{}{}".format(wt, position_str, subs)
+
+ res.append(
+ {
+ "segment": segment,
+ "mutant": mutant,
+ "pos": position_str,
+ "wt": wt,
+ "subs": subs,
+ "frequency": model.fi(pos, subs),
+ "column_conservation": cons[pos],
+ output_column: model.smm(pos, subs),
+ }
+ )
+
+ pred = pd.DataFrame(res)
+ return pred.loc[
+ :, ["segment", "mutant", "pos", "wt", "subs", "frequency",
+ "column_conservation", output_column]
+ ]
+
+
+def split_mutants(x, mutant_column="mutant"):
+ """
+ Splits mutation strings into individual columns in DataFrame
+ (wild-type symbol(s), position(s), substitution(s), number of mutations).
+ This function is e.g. helpful when computing average
+ effects per position using pandas groupby() operations
+
+ Parameters
+ ----------
+ x : pandas.DataFrame
+ Table with mutants
+ mutant_column : str, default: "mutant"
+ Column which contains mutants, set to None
+ to use index of DataFrame
+
+ Returns
+ -------
+ pandas.DataFrame
+ DataFrame with added columns "num_subs", "pos", "wt"
+ and "subs" that contain the number of mutations,
+ and split mutation strings (if higher-order mutations,
+ symbols/numbers are comma-separated)
+ """
+ def _split(mut_str):
+ try:
+ return sorted(extract_mutations(mut_str))
+ except ValueError:
+ return np.nan
+
+ def _join(index):
+ return [
+ ",".join([str(subs[index]) for subs in mutant])
+ for mutant in spl
+ ]
+
+ # get column which contains mutations
+ if mutant_column is None:
+ mutations = x.index
+ else:
+ mutations = x.loc[:, mutant_column]
+
+ # extract wt/pos/subs where possible
+ spl = mutations.map(_split)
+
+ # then store in individual columns
+ x.loc[:, "num_mutations"] = [len(mutant) for mutant in spl]
+ for i, column in enumerate(["pos", "wt", "subs"]):
+ x.loc[:, column] = _join(i)
+
+ return x
diff --git a/proteingym/baselines/EVmutation/score_mutants.py b/proteingym/baselines/EVmutation/score_mutants.py
new file mode 100644
index 0000000..624ae25
--- /dev/null
+++ b/proteingym/baselines/EVmutation/score_mutants.py
@@ -0,0 +1,62 @@
+import os
+import pandas as pd
+import argparse
+from evcouplings.couplings import CouplingsModel
+from evcouplings.mutate import predict_mutation_table, single_mutant_matrix
+import calculations
+
+def score_multi_aa(DMS_filename,DMS_id, DMS_folder, output_score_folder, couplings_model, offset=0):
+ """
+ Makes EVcouplings predictions (epistatic and independent) for multi-AA mutants. Calculates Spearman between
+ predictions and DMS dataset.
+ """
+ c = CouplingsModel(couplings_model)
+ c0 = c.to_independent_model()
+ data = pd.read_csv(DMS_folder + os.sep + DMS_filename)
+ data = calculations.predict_mutation_table(c, data, "prediction_epistatic", sep=":", offset=offset)
+ data = calculations.predict_mutation_table(c0, data, "prediction_independent", sep=":", offset=offset)
+ data.to_csv(output_score_folder + os.sep + DMS_id + ".csv", index=False)
+
+
+if __name__ == "__main__":
+ parser = argparse.ArgumentParser(description='Score EVcouplings models on DMS data')
+ parser.add_argument('--DMS_reference_file_path', type=str, help="Path to DMS reference file")
+ parser.add_argument('--DMS_data_folder',type=str, help="Path to DMS data folder")
+ parser.add_argument('--model_folder',type=str, help="Path to EVcouplings model folder")
+ parser.add_argument('--output_scores_folder',type=str, help="Path to output score folder")
+ parser.add_argument('--DMS_index',type=int,help="Index of DMS assay to score")
+ args = parser.parse_args()
+ mapping = pd.read_csv(args.DMS_reference_file_path)
+
+ list_DMS = mapping["DMS_id"]
+ DMS_id = list_DMS[args.DMS_index]
+
+ # Hardcoding RASK_HUMAN_Ursu case (has a special alignment id since it differs from the other RASK_HUMAN assays)
+ if DMS_id == "RASK_HUMAN_Ursu_2020":
+ UniProt_id = "RASK_HUMAN_Ursu_2020"
+ else:
+ list_UniProt = mapping["UniProt_ID"]
+
+ UniProt_id = list_UniProt[args.DMS_index]
+ if UniProt_id == "F7YBW7_MESOW":
+ UniProt_id = "F7YBW8_MESOW"
+
+ DMS_filename = mapping["DMS_filename"][mapping["DMS_id"]==DMS_id].values[0]
+
+ sequence = mapping["target_seq"][mapping["DMS_id"]==DMS_id].values[0]
+
+ job_name_prefix = UniProt_id
+
+
+ offset = mapping["MSA_start"][mapping["DMS_id"]==DMS_id].values[0] - 1
+ print("Offset: {}".format(offset))
+ print(f"{args.model_folder}/{job_name_prefix}/{job_name_prefix}/couplings/{job_name_prefix}.model")
+ if os.path.exists(f"{args.model_folder}/{job_name_prefix}/{job_name_prefix}/couplings/{job_name_prefix}.model"):
+ score_multi_aa(DMS_filename=DMS_filename,
+ DMS_id=DMS_id,
+ DMS_folder=args.DMS_data_folder,
+ output_score_folder=args.output_scores_folder,
+ couplings_model=f"{args.model_folder}/{job_name_prefix}/{job_name_prefix}/couplings/{job_name_prefix}.model",
+ offset=-offset)
+ else:
+ print(f"No model file for: {job_name_prefix}")
\ No newline at end of file
diff --git a/proteingym/baselines/HMM/score_hmm.py b/proteingym/baselines/HMM/score_hmm.py
new file mode 100644
index 0000000..3d58dc7
--- /dev/null
+++ b/proteingym/baselines/HMM/score_hmm.py
@@ -0,0 +1,111 @@
+import argparse
+import pandas as pd
+import os
+import numpy as np
+
+"""
+This script scores a folder of variants using the HMM model.
+"""
+if __name__ == "__main__":
+ parser = argparse.ArgumentParser(description='Score a folder of variants using the HMM model')
+ #We may pass in all required information about the dataset via the provided reference files, or specify all relevant fields manually
+ parser.add_argument('--DMS_reference_file', type=str, help='Path to reference file with list of target sequences (and filepaths to their associated variants) to score')
+ parser.add_argument('--DMS_index', type=int, help='Index of sequence and variants to score in reference file')
+ parser.add_argument("--hmmer_path", type=str, help="Path to hmmer installation")
+ parser.add_argument('--DMS_folder', type=str, help='Path to folder that contains the protein variants for each target sequence')
+ parser.add_argument('--output_scores_folder', default='./', type=str, help='Name of folder to write model scores to')
+ parser.add_argument("--intermediate_outputs_folder", type=str, default="./intermediate_outputs", help="Path to folder to write intermediate outputs to")
+ parser.add_argument('--MSA_folder', default='.', type=str, help='Path to MSA for neighborhood scoring')
+
+ #Fields to be passed manually if reference file is not used
+ parser.add_argument('--target_seq', default=None, type=str, help='Full wild type sequence that is mutated in the experiment')
+ parser.add_argument('--DMS_file_name', default=None, type=str, help='Name of experiment file')
+ parser.add_argument('--MSA_filename', default=None, type=str, help='Name of MSA (eg., a2m) file constructed on the wild type sequence')
+ parser.add_argument('--MSA_start', default=None, type=int, help='Sequence position that the MSA starts at (1-indexing)')
+ parser.add_argument('--MSA_end', default=None, type=int, help='Sequence position that the MSA ends at (1-indexing)')
+ parser.add_argument("--mutant_column",default="mutant",type=str)
+ parser.add_argument("--mutated_sequence_column",default="mutated_sequence",type=str)
+ args = parser.parse_args()
+
+ if not os.path.exists(args.output_scores_folder):
+ os.makedirs(args.output_scores_folder)
+ if not os.path.exists(args.intermediate_outputs_folder):
+ os.makedirs(args.intermediate_outputs_folder)
+ if not os.path.exists(args.intermediate_outputs_folder + os.sep + "hmm"):
+ os.mkdir(args.intermediate_outputs_folder + os.sep + "hmm")
+ if not os.path.exists(args.intermediate_outputs_folder + os.sep + "fa"):
+ os.mkdir(args.intermediate_outputs_folder + os.sep + "fa")
+ if not os.path.exists(args.intermediate_outputs_folder + os.sep + "a2m"):
+ os.mkdir(args.intermediate_outputs_folder + os.sep + "a2m")
+ if not os.path.exists(args.intermediate_outputs_folder + os.sep + "raw_scores"):
+ os.mkdir(args.intermediate_outputs_folder + os.sep + "raw_scores")
+ hmm_path = args.intermediate_outputs_folder + os.sep + "hmm"
+ fa_path = args.intermediate_outputs_folder + os.sep + "fa"
+ raw_score_path = args.intermediate_outputs_folder + os.sep + "raw_scores"
+
+ if args.DMS_reference_file:
+ DMS_mapfile = pd.read_csv(args.DMS_reference_file)
+ list_target_sequences = DMS_mapfile["DMS_id"].tolist()
+ DMS_id=list_target_sequences[args.DMS_index]
+ print(f"Computing HMM scores for {DMS_id})")
+ target_seq = DMS_mapfile["target_seq"][DMS_mapfile["DMS_id"]==DMS_id].values[0].upper()
+ DMS_file_name = DMS_mapfile["DMS_filename"][DMS_mapfile["DMS_id"]==DMS_id].values[0]
+ MSA_data_file = DMS_mapfile["MSA_filename"].tolist()[args.DMS_index]
+ MSA_data_file = args.MSA_folder + os.sep + MSA_data_file if type(MSA_data_file) != float else None
+ MSA_start = DMS_mapfile["MSA_start"].tolist()[args.DMS_index]
+ MSA_start = int(MSA_start) - 1 if not np.isnan(MSA_start) else None # MSA_start typically based on 1-indexing
+ MSA_end = DMS_mapfile["MSA_end"].tolist()[args.DMS_index]
+ MSA_end = int(MSA_end) if not np.isnan(MSA_end) else None
+ else:
+ target_seqs=args.target_seq
+ DMS_file_names=args.DMS_file_name
+ DMS_id = DMS_file_names.split(".")[0]
+ MSA_data_file = args.MSA_folder + os.sep + args.MSA_filename if args.MSA_folder is not None else None
+ MSA_start = args.MSA_start - 1 # MSA_start based on 1-indexing
+ MSA_end = args.MSA_end
+ # mutated_sequence_column = "mutant"
+ mutated_sequence_column = "mutated_sequence"
+ # checking all a3m files in alignment and calling reformat if an a2m or hmm file are not already generated
+ print("Checking for alignment files and reformatting a3m files if needed")
+ if MSA_data_file is None:
+ print(f"Alignment required for HMM scoring")
+ exit()
+ basename = os.path.splitext(MSA_data_file)[0].split(os.sep)[-1]
+ # Building HMMs from a2ms if HMMs do not already exist
+ print("Building HMMs if they don't already exist")
+ if not os.path.exists(hmm_path + os.sep + basename + ".hmm"):
+ os.system(f"{args.hmmer_path}" + os.sep + "bin" + os.sep + 'hmmbuild --amino ' + f"{hmm_path + os.sep + basename + '.hmm'} {MSA_data_file}")
+
+ print("Writing out variants to fasta file for hmm scoring")
+ if not os.path.exists(args.DMS_folder + os.sep + DMS_file_name):
+ print(f"Warning: {DMS_file_name} not found in {args.DMS_folder}. Skipping.")
+ variant_df = pd.read_csv(args.DMS_folder + os.sep + DMS_file_name)
+ with open(fa_path + os.sep + basename + ".fa", "w+") as f:
+ f.write(f">WT\n{target_seq}\n")
+ for j, row in variant_df.iterrows():
+ f.write(f">Mutant {j}\n{row[mutated_sequence_column]}\n")
+
+ print("Scoring variants with HMM forward-backward algorithm")
+ hmm_file = hmm_path + os.sep + basename + ".hmm"
+ fa_file = fa_path + os.sep + basename + ".fa"
+ output_file = raw_score_path + os.sep + basename + ".csv"
+ os.system(f"{args.hmmer_path}" + os.sep + "src" + os.sep + f"generic_fwdback_example {hmm_file} {fa_file} > {output_file}")
+
+ # Postprocessing scores to have label columns, wt ratio scores
+ print("Postprocessing scores to match output format")
+ if not os.path.exists(os.path.join(args.DMS_folder,DMS_file_name)):
+ print(f"Warning: {DMS_file_name} not found in {args.DMS_folder}. Skipping.")
+ df = pd.read_csv(os.path.join(raw_score_path, basename + ".csv"))
+ # removing extra whitespace from seq_name column
+ df["seq_name"] = df["seq_name"].apply(lambda x: x.replace(" ",""))
+ wt_logprob = df[df["seq_name"] == "WT"]["logprob"].values[0]
+ df["wt_ratio"] = df["logprob"].astype(float) - float(wt_logprob)
+ # Occasionaly there are characters in sequences outside the alphabet of the hmm model, result in logprobs of -inf for both sequences and nan model scores.
+ # We set those NaNs equal to 0 here, assuming no difference between the mutants as it's outside the hmm alphabet.
+ df["wt_ratio"] = df["wt_ratio"].fillna(0)
+ # Dropping WT score here to match other output formats
+ df = df[df["seq_name"] != "WT"]
+ df = df.rename(columns={"seq":"mutant"})
+ variant_df = pd.read_csv(os.path.join(args.DMS_folder, DMS_file_name))
+ df = df.merge(variant_df[["mutant","DMS_score"]], on="mutant", how="left")
+ df.to_csv(os.path.join(args.output_scores_folder,DMS_id + ".csv"), index=False)
\ No newline at end of file
diff --git a/proteingym/baselines/__init__.py b/proteingym/baselines/__init__.py
new file mode 100644
index 0000000..e69de29
diff --git a/proteingym/baselines/carp_mif/__init__.py b/proteingym/baselines/carp_mif/__init__.py
new file mode 100644
index 0000000..e4bfb0a
--- /dev/null
+++ b/proteingym/baselines/carp_mif/__init__.py
@@ -0,0 +1 @@
+from . import carp_mif_utils
\ No newline at end of file
diff --git a/proteingym/baselines/carp_mif/carp_mif_utils.py b/proteingym/baselines/carp_mif/carp_mif_utils.py
new file mode 100644
index 0000000..3997336
--- /dev/null
+++ b/proteingym/baselines/carp_mif/carp_mif_utils.py
@@ -0,0 +1,38 @@
+import torch
+from sequence_models.collaters import SimpleCollater, StructureCollater, BGCCollater
+from sequence_models.pretrained import load_carp,load_gnn,MIF
+from sequence_models.constants import PROTEIN_ALPHABET
+
+CARP_URL = 'https://zenodo.org/record/6564798/files/'
+MIF_URL = 'https://zenodo.org/record/6573779/files/'
+BIG_URL = 'https://zenodo.org/record/6857704/files/'
+
+def load_model_and_alphabet(model_name, model_dir=None):
+ if not model_name.endswith(".pt"):
+ if 'big' in model_name:
+ url = BIG_URL + '%s.pt?download=1' %model_name
+ elif 'carp' in model_name:
+ url = CARP_URL + '%s.pt?download=1' %model_name
+ elif 'mif' in model_name:
+ url = MIF_URL + '%s.pt?download=1' %model_name
+ model_data = torch.hub.load_state_dict_from_url(url, progress=False, map_location="cpu", model_dir=model_dir)
+ else:
+ model_data = torch.load(model_name, map_location="cpu")
+ if 'big' in model_data['model']:
+ pfam_to_domain = model_data['pfam_to_domain']
+ tokens = model_data['tokens']
+ collater = BGCCollater(tokens, pfam_to_domain)
+ else:
+ collater = SimpleCollater(PROTEIN_ALPHABET, pad=True)
+ if 'carp' in model_data['model']:
+ model = load_carp(model_data)
+ elif model_data['model'] in ['mif', 'mif-st']:
+ gnn = load_gnn(model_data)
+ cnn = None
+ if model_data['model'] == 'mif-st':
+ url = CARP_URL + '%s.pt?download=1' % 'carp_640M'
+ cnn_data = torch.hub.load_state_dict_from_url(url, progress=False, map_location="cpu")
+ cnn = load_carp(cnn_data)
+ collater = StructureCollater(collater, n_connections=30)
+ model = MIF(gnn, cnn=cnn)
+ return model, collater
\ No newline at end of file
diff --git a/proteingym/baselines/carp_mif/compute_fitness.py b/proteingym/baselines/carp_mif/compute_fitness.py
new file mode 100644
index 0000000..7750605
--- /dev/null
+++ b/proteingym/baselines/carp_mif/compute_fitness.py
@@ -0,0 +1,174 @@
+import os
+import argparse
+import tqdm
+
+from transformers import AutoModel, AutoModelForCausalLM, AutoTokenizer
+from scipy.stats import spearmanr
+import numpy as np
+import pandas as pd
+
+import torch
+from torch.nn import CrossEntropyLoss
+
+from sequence_models.constants import PROTEIN_ALPHABET, PAD, MASK
+from sequence_models.pdb_utils import parse_PDB, process_coords
+
+from proteingym.baselines.carp_mif.carp_mif_utils import load_model_and_alphabet
+
+def label_row(rows, sequence, token_probs, alphabet, offset_idx=1):
+ rows = rows.split(":")
+ score = 0
+ for row in rows:
+ wt, idx, mt = row[0], int(row[1:-1]) - offset_idx, row[-1]
+
+ assert sequence[idx] == wt, "The listed wildtype does not match the provided sequence"
+
+ wt_encoded, mt_encoded = alphabet.index(wt), alphabet.index(mt)
+
+ score_obj = token_probs[0, idx, mt_encoded] - token_probs[0, idx, wt_encoded]
+ score += score_obj.item()
+ return score / len(rows)
+
+def process_batch_mif(prot,pdb_file,tokenizer,device='cuda:0'):
+ coords, wt, _ = parse_PDB(pdb_file)
+ coords = {
+ 'N': coords[:, 0],
+ 'CA': coords[:, 1],
+ 'C': coords[:, 2]
+ }
+ dist, omega, theta, phi = process_coords(coords)
+ batch = [[prot, torch.tensor(dist, dtype=torch.float),
+ torch.tensor(omega, dtype=torch.float),
+ torch.tensor(theta, dtype=torch.float), torch.tensor(phi, dtype=torch.float)]]
+ input_ids, nodes, edges, connections, edge_mask = tokenizer(batch)
+ input_ids = input_ids.to(device)
+ nodes = nodes.to(device)
+ edges = edges.to(device)
+ connections = connections.to(device)
+ edge_mask = edge_mask.to(device)
+ return input_ids,nodes,edges,connections,edge_mask
+
+def calc_fitness(model, DMS_data, tokenizer, device='cuda:0', model_context_len=1024, mode="masked_marginals", alphabet=PROTEIN_ALPHABET, mutation_col='mutant', target_seq=None, pdb_file=None, model_name=None, offset_idx=1):
+ if mode=="pseudo_likelihood":
+ prots=np.array(DMS_data['mutated_sequence'])
+ loss_fn = CrossEntropyLoss()
+ log_proba_list = []
+ with torch.no_grad():
+ for prot in tqdm.tqdm(prots):
+ loss_fn = CrossEntropyLoss()
+ if 'carp' in model_name:
+ input_ids = tokenizer([[prot]])[0].to(device)
+ logits = model(input_ids, logits=True)['logits']
+ elif 'mif' in model_name:
+ input_ids,nodes,edges,connections,edge_mask = process_batch_mif(prot,pdb_file,tokenizer,device)
+ logits = model(input_ids, nodes, edges, connections, edge_mask, result='logits')
+ log_proba = - loss_fn(target=input_ids.view(-1), input=logits.view(-1,logits.size(-1))).detach().cpu().numpy()
+ log_proba_list += [log_proba]
+ elif mode=="masked_marginals":
+ all_token_probs = []
+ if 'carp' in model_name:
+ input_ids = tokenizer([[target_seq]])[0].to(device)
+ elif 'mif' in model_name:
+ input_ids,nodes,edges,connections,edge_mask = process_batch_mif(target_seq,pdb_file,tokenizer,device)
+ for i in tqdm.tqdm(range(input_ids.size(1))):
+ input_ids_masked = input_ids.clone()
+ input_ids_masked[0, i] = PROTEIN_ALPHABET.index(MASK)
+ with torch.no_grad():
+ if 'carp' in model_name:
+ logits = model(input_ids_masked.cuda(), logits=True)["logits"]
+ elif 'mif' in model_name:
+ logits = model(input_ids, nodes, edges, connections, edge_mask, result='logits')
+ token_probs = torch.log_softmax(logits, dim=-1)
+ all_token_probs.append(token_probs[:, i])
+ token_probs = torch.cat(all_token_probs, dim=0).unsqueeze(0)
+ log_proba_list = DMS_data.apply(
+ lambda row: label_row(
+ row[mutation_col],
+ target_seq,
+ token_probs,
+ PROTEIN_ALPHABET,
+ offset_idx
+ ),
+ axis=1,
+ )
+ return np.array(log_proba_list)
+
+def get_mutated_sequence(focus_seq, mutant, start_idx=1, AA_vocab="ACDEFGHIKLMNPQRSTVWY"):
+ """
+ Helper function that mutates an input sequence (focus_seq) via an input mutation triplet (substitutions only).
+ Mutation triplet are typically based on 1-indexing: start_idx is used for switching to 0-indexing.
+ """
+ mutated_seq = list(focus_seq)
+ for mutation in mutant.split(":"):
+ try:
+ from_AA, position, to_AA = mutation[0], int(mutation[1:-1]), mutation[-1]
+ except:
+ print("Issue with mutant: "+str(mutation))
+ relative_position = position - start_idx
+ assert (from_AA==focus_seq[relative_position]), "Invalid from_AA or mutant position: "+str(mutation)+" from_AA: "+str(from_AA) + " relative pos: "+str(relative_position) + " focus_seq: "+str(focus_seq)
+ assert (to_AA in AA_vocab) , "Mutant to_AA is invalid: "+str(mutation)
+ mutated_seq[relative_position] = to_AA
+ return "".join(mutated_seq)
+
+def main():
+ """
+ Main script to score sets of mutated protein sequences (substitutions or indels) with Tranception.
+ """
+ parser = argparse.ArgumentParser(description='Tranception scoring')
+ parser.add_argument('--model_name', default=None, type=str, help='Name of or path to [CARP,MIF,Evodiff] model')
+ parser.add_argument('--model_path', default=None, type=str, help='Location where model parameters should be stored')
+ parser.add_argument('--DMS_reference_file_path', default='/home/pn73/Tranception/proteingym/ProteinGym_reference_file_substitutions.csv', type=str, help='Path to reference file')
+ parser.add_argument('--DMS_data_folder', default='/n/groups/marks/projects/marks_lab_and_oatml/protein_transformer/Tranception_open_source/DMS_files/ProteinGym_substitutions', type=str, help='Path of DMS folder')
+ parser.add_argument('--structure_data_folder', default='', type=str, help='Path of structure folder')
+ parser.add_argument('--DMS_index', type=int, help='Index of proteins to compute scores for')
+ parser.add_argument('--output_scores_folder', default=None, type=str, help='Name of folder to write model scores to')
+ parser.add_argument('--indel_mode', action='store_true', help='Whether to score sequences with insertions and deletions')
+ parser.add_argument('--fitness_computation_mode', default="masked_marginals", type=str, help='Fitness computtation mode [masked_marginals|pseudo_likelihood]')
+ parser.add_argument('--performance_file', default='CARP_performance.csv', type=str, help='Name of folder to write model scores to')
+ args = parser.parse_args()
+
+ model, tokenizer = load_model_and_alphabet(args.model_name, args.model_path)
+ model.cuda()
+
+ mapping_protein_seq_DMS = pd.read_csv(args.DMS_reference_file_path)
+ list_DMS = mapping_protein_seq_DMS["DMS_id"]
+ DMS_id=list_DMS[args.DMS_index]
+ if not os.path.exists(args.output_scores_folder): os.mkdir(args.output_scores_folder)
+ args.output_scores_folder = args.output_scores_folder + os.sep + args.model_name
+ if not os.path.exists(args.output_scores_folder): os.mkdir(args.output_scores_folder)
+ scoring_filename = args.output_scores_folder+os.sep+DMS_id+'.csv'
+ print("Computing scores for: {} with model: {}".format(DMS_id, args.model_name))
+
+ DMS_file_name = mapping_protein_seq_DMS["DMS_filename"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]
+ target_seq = mapping_protein_seq_DMS["target_seq"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0].upper()
+
+ DMS_data = pd.read_csv(args.DMS_data_folder + os.sep + DMS_file_name, low_memory=False)
+ DMS_data['mutated_sequence'] = DMS_data['mutant'].apply(lambda x: get_mutated_sequence(target_seq, x)) if not args.indel_mode else DMS_data['mutant']
+
+ if 'mif' in args.model_name:
+ pdb_filenames = mapping_protein_seq_DMS["pdb_file"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0].split('|') #if sequence is large (eg., BRCA2_HUMAN) the structure is split in several chunks
+ pdb_ranges = mapping_protein_seq_DMS["pdb_range"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0].split('|')
+ model_scores=[]
+ for pdb_index, pdb_filename in enumerate(pdb_filenames):
+ pdb_file = args.structure_data_folder + os.sep + pdb_filename
+ pdb_range = [int(x) for x in pdb_ranges[pdb_index].split("-")]
+ target_seq_split = target_seq[pdb_range[0]-1:pdb_range[1]] #pdb_range is 1-indexed
+ DMS_data["mutated_position"] = DMS_data['mutant'].apply(lambda x: int(x.split(':')[0][1:-1])) #if multiple mutant, will extract position of first mutant
+ filtered_DMS_data = DMS_data[(DMS_data["mutated_position"] >= pdb_range[0]) & (DMS_data["mutated_position"] <= pdb_range[1])]
+ model_scores.append(calc_fitness(model=model, DMS_data=filtered_DMS_data, tokenizer=tokenizer, mode=args.fitness_computation_mode, target_seq=target_seq_split, pdb_file=pdb_file, model_name=args.model_name, offset_idx=pdb_range[0]))
+ model_scores = np.concatenate(model_scores)
+ else:
+ model_scores = calc_fitness(model=model, DMS_data=DMS_data, tokenizer=tokenizer, mode=args.fitness_computation_mode, target_seq=target_seq, pdb_file=None, model_name=args.model_name)
+
+ DMS_data[args.model_name+'_score']=model_scores
+ DMS_data[['mutant',args.model_name+'_score','DMS_score']].to_csv(scoring_filename, index=False)
+ spearman, _ = spearmanr(DMS_data[args.model_name+'_score'], DMS_data['DMS_score'])
+
+ if not os.path.exists(args.performance_file) or os.stat(args.performance_file).st_size==0:
+ with open(args.performance_file,"w") as performance_file:
+ performance_file.write("DMS_id,spearman\n")
+ with open(args.performance_file, "a") as performance_file:
+ performance_file.write(",".join([DMS_id,str(spearman)])+"\n")
+
+if __name__ == '__main__':
+ main()
diff --git a/proteingym/baselines/esm/__init__.py b/proteingym/baselines/esm/__init__.py
new file mode 100644
index 0000000..2aed3a7
--- /dev/null
+++ b/proteingym/baselines/esm/__init__.py
@@ -0,0 +1 @@
+import esm
\ No newline at end of file
diff --git a/proteingym/baselines/esm/compute_fitness.py b/proteingym/baselines/esm/compute_fitness.py
new file mode 100644
index 0000000..bdcd51c
--- /dev/null
+++ b/proteingym/baselines/esm/compute_fitness.py
@@ -0,0 +1,562 @@
+import argparse
+import pathlib
+import os,sys
+import math
+import random
+import numpy as np
+import pandas as pd
+from tqdm import tqdm
+from Bio import SeqIO
+import itertools
+from typing import List, Tuple
+import torch
+
+sys.path.append(os.path.dirname(os.path.dirname(os.path.dirname(os.path.abspath(__file__)))))
+from baselines.esm import esm
+from esm import Alphabet, FastaBatchedDataset, ProteinBertModel, pretrained, MSATransformer
+
+from utils.scoring_utils import get_optimal_window, set_mutant_offset, undo_mutant_offset
+from utils.data_utils import DMS_file_cleanup
+from utils.msa_utils import MSA_processing
+
+def standardization(x):
+ """Assumes input is numpy array or pandas series"""
+ return (x - x.mean()) / x.std()
+
+def sample_msa(filename: str, nseq: int, sampling_strategy: str, random_seed: int, weight_filename=None, processed_msa=None):
+ """Reads the first nseq sequences from an MSA file, automatically removes insertions."""
+ print("Sampling sequences from MSA with strategy: "+str(sampling_strategy))
+ random.seed(random_seed)
+ if sampling_strategy=='first_x_rows':
+ msa = [
+ (record.description, str(record.seq))
+ for record in itertools.islice(SeqIO.parse(filename, "fasta"), nseq)
+ ]
+ elif sampling_strategy=='random':
+ msa = [
+ (record.description, str(record.seq)) for record in SeqIO.parse(filename, "fasta")
+ ]
+ nseq = min(len(msa),nseq)
+ msa = random.sample(msa, nseq)
+ elif sampling_strategy=='sequence-reweighting':
+ # If MSA has already been processed, just use it here
+ if processed_msa is None:
+ if weight_filename is None:
+ print("Need weight filename if using sequence-reweighting sample strategy")
+ MSA = MSA_processing(
+ MSA_location=filename,
+ use_weights=True,
+ weights_location=weight_filename
+ )
+ print("Name of focus_seq: "+str(MSA.focus_seq_name))
+ else:
+ MSA = processed_msa
+
+ # Make sure we always keep the WT in the subsampled MSA
+ msa = [(MSA.focus_seq_name,MSA.raw_seq_name_to_sequence[MSA.focus_seq_name])]
+
+ non_wt_weights = np.array([w for k, w in MSA.seq_name_to_weight.items() if k != MSA.focus_seq_name])
+ non_wt_sequences = [(k, s) for k, s in MSA.seq_name_to_sequence.items() if k != MSA.focus_seq_name]
+ non_wt_weights = non_wt_weights / non_wt_weights.sum() # Renormalize weights
+
+ # Sample the rest of the MSA according to their weights
+ if len(non_wt_sequences) > 0:
+ msa.extend(random.choices(non_wt_sequences, weights=non_wt_weights, k=nseq-1))
+
+ # This is intended to filter out weights, but after hhfiltering our msa is already the correct set
+ # for seq_name in MSA.raw_seq_name_to_sequence.keys():
+ # if seq_name == MSA.focus_seq_name:
+ # msa.append((seq_name,MSA.raw_seq_name_to_sequence[seq_name]))
+ # del MSA.seq_name_to_weight[seq_name]
+ # else:
+ # if seq_name in MSA.seq_name_to_weight:
+ # all_sequences_msa.append((seq_name,MSA.raw_seq_name_to_sequence[seq_name]))
+ # weights.append(MSA.seq_name_to_weight[seq_name])
+ # if len(all_sequences_msa)>0:
+ # weights = np.array(weights) / np.array(list(MSA.seq_name_to_weight.values())).sum()
+ # print("Check sum weights MSA: "+str(weights.sum()))
+ # msa.extend(random.choices(all_sequences_msa, weights=weights, k=nseq-1))
+
+ print("Check sum weights MSA: "+str(non_wt_weights.sum()))
+
+ msa = [(desc, seq.upper()) for desc, seq in msa]
+ print("First 10 elements of sampled MSA: ")
+ print(msa[:10])
+ #[('CCDB_ECOLI/1-101', 'MQFKVYTYKRESRYRLFVDVQSDIIDTPGRRMVIPLASARLLSDKVSRELYPVVHIGDESWRMMTTDMASVPVSVIGEEVADLSHRENDIKNAINLMFWGI'), ('UniRef100_A0A6M6E7X5/22-113', '.........--GKNIPCVILQNNKGNASNTTIIVPIIAESNYIKSSPTYVHIRKYNLDEDSIAVCDQIRVIDKKRITKAAKALSEEKKQIEDGI--.....'), ('UniRef100_A0A6M6E7X5/144-229', '.........--EKAFPALVIGRDNK-EKQTLLIAPLLQAKKR.YLLPTQAKIPNGCLSDGKILSLEQMRVIDKSRIVSQNIKFSDTLLEIE-----.....'), ('UniRef100_A0A6M6E7X5/256-332', '........SEQRGLRPCLIIQNDTGNKLSTTIVLPLTSSVPKVDLVVNVIVKQEEFVGRSSIVLCNQIQTIDSSRIKQV-----------------.....'), ('UniRef100_A0A6M6E7X5/406-489', '.........------PAVCIQNSYGNHYSVLIVAPLISSKRT.RLLPTQVKIDFNDSGELMVAALEQVRVIDKRRVVDVLDELPEEKKEILNAYCVS....'), ('UniRef100_UPI000B4B97B8/22-115', '........SEQKGERPAVVVQNDFGNRASTTLIVPLTSNFKT.-EIPTHVNISSKELGVDSIALCEQVRVISKERIIQVRTILSKEVKEIDDALLISF...'), ('UniRef100_UPI000B4B97B8/135-229', '........NEQKGNRPAIVIQNDVGNKYSTLIVAPLQLKKKK.-RLPTHVEIPGNLISKDSIALLEQVRVVDKERVTGVVQNLTEEFKNIEEALLVSF...'), ('UniRef100_UPI0009C11867/73-161', '.......GSEQNGLRPVVIIQNNLGNKYGTLIVAPITSQDKK.-DLPVHSEIYNNSLEKDSTILLEQVTTIDKNKVKEFVGHLTRNEKKLNIALAR.....'), ('UniRef100_F4A1A6/19-111', '........SEQGGVRPVLVVQNDIGNKYSTVIVAAITSQINK.AKLPTHVEISASDYGKDSVILLEQIRTIDKKRLREKIGYLSAETKKVDEALQISF...'), ('UniRef100_A0A150FSC1/22-114', '........SEQGGVRPVLVIQNDIGNKYSTVIVAAITSQINK.AKLPIHIEIKANGLNKDSVVLLEQIRTIDKKRLREKIGHFDEEKEKVDQAIQISL...')]
+ return msa
+
+
+def process_msa(filename: str, weight_filename: str, filter_msa: bool, path_to_hhfilter: str, hhfilter_min_cov=75, hhfilter_max_seq_id=100, hhfilter_min_seq_id=0) -> List[Tuple[str, str]]:
+ if filter_msa:
+ input_folder = '/'.join(filename.split('/')[:-1])
+ msa_name = filename.split('/')[-1].split('.')[0]
+ if not os.path.isdir(input_folder+os.sep+'preprocessed'):
+ os.mkdir(input_folder+os.sep+'preprocessed')
+ if not os.path.isdir(input_folder+os.sep+'hhfiltered'):
+ os.mkdir(input_folder+os.sep+'hhfiltered')
+ preprocessed_filename = input_folder+os.sep+'preprocessed'+os.sep+msa_name
+ os.system('cat '+filename+' | tr "." "-" >> '+preprocessed_filename+'.a2m')
+ os.system('dd if='+preprocessed_filename+'.a2m of='+preprocessed_filename+'_UC.a2m conv=ucase')
+ output_filename = input_folder+os.sep+'hhfiltered'+os.sep+msa_name+'_hhfiltered_cov_'+str(hhfilter_min_cov)+'_maxid_'+str(hhfilter_max_seq_id)+'_minid_'+str(hhfilter_min_seq_id)+'.a2m'
+ os.system(path_to_hhfilter+os.sep+'bin/hhfilter -cov '+str(hhfilter_min_cov)+' -id '+str(hhfilter_max_seq_id)+' -qid '+str(hhfilter_min_seq_id)+' -i '+preprocessed_filename+'_UC.a2m -o '+output_filename)
+ filename = output_filename
+
+ MSA = MSA_processing(
+ MSA_location=filename,
+ use_weights=True,
+ weights_location=weight_filename
+ )
+ print("Name of focus_seq: "+str(MSA.focus_seq_name))
+ return MSA
+
+
+def create_parser():
+ parser = argparse.ArgumentParser(
+ description="Label a deep mutational scan with predictions from an ensemble of ESM-1v models." # noqa
+ )
+ parser.add_argument(
+ "--model_type",
+ type=str,
+ help="MSA_transformer Vs ESM1v Vs ESM1b",
+ default="MSA_transformer",
+ nargs="+",
+ )
+ parser.add_argument(
+ "--model-location",
+ type=str,
+ help="PyTorch model file OR name of pretrained model to download (see README for models)",
+ nargs="+",
+ )
+ parser.add_argument(
+ "--sequence",
+ type=str,
+ help="Base sequence to which mutations were applied",
+ )
+ parser.add_argument(
+ "--dms-input",
+ type=pathlib.Path,
+ help="CSV file containing the deep mutational scan",
+ )
+ parser.add_argument(
+ "--dms_index",
+ type=int,
+ help="Index of DMS in mapping file",
+ )
+ parser.add_argument(
+ "--dms_mapping",
+ type=str,
+ help="Location of DMS_mapping",
+ )
+ parser.add_argument(
+ "--mutation-col",
+ type=str,
+ default="mutant",
+ help="column in the deep mutational scan labeling the mutation as 'AiB'"
+ )
+ parser.add_argument(
+ "--dms-output",
+ type=pathlib.Path,
+ help="Output file containing the deep mutational scan along with predictions",
+ )
+ parser.add_argument(
+ "--offset-idx",
+ type=int,
+ default=1,
+ help="Offset of the mutation positions in `--mutation-col`"
+ )
+ parser.add_argument(
+ "--scoring-strategy",
+ type=str,
+ default="wt-marginals",
+ choices=["wt-marginals", "pseudo-ppl", "masked-marginals"],
+ help=""
+ )
+ parser.add_argument(
+ "--msa-path",
+ type=pathlib.Path,
+ help="path to MSA (required for MSA Transformer)"
+ )
+ parser.add_argument(
+ "--msa-sampling-strategy",
+ type=str,
+ default='sequence-reweighting',
+ help="Strategy to sample sequences from MSA [sequence-reweighting|random|first_x_rows]"
+ )
+ parser.add_argument(
+ "--msa-samples",
+ type=int,
+ default=400,
+ help="number of sequences to randomly sample from the MSA"
+ )
+ parser.add_argument(
+ "--msa-weights-folder",
+ type=str,
+ default=None,
+ help="Folder with weights to sample MSA sequences in 'sequence-reweighting' scheme"
+ )
+ parser.add_argument(
+ '--seeds',
+ type=int,
+ default=1,
+ help='Random seed used during training',
+ nargs="+"
+ )
+ parser.add_argument(
+ '--filter-msa',
+ action='store_true',
+ help='Whether to use hhfilter to filter input MSA before sampling'
+ )
+ parser.add_argument(
+ '--hhfilter-min-cov',
+ type=int,
+ default=75,
+ help='minimum coverage with query (%)'
+ )
+ parser.add_argument(
+ '--hhfilter-max-seq-id',
+ type=int,
+ default=90,
+ help='maximum pairwise identity (%)'
+ )
+ parser.add_argument(
+ '--hhfilter-min-seq-id',
+ type=int,
+ default=0,
+ help='minimum sequence identity with query (%)'
+ )
+ parser.add_argument(
+ '--path-to-hhfilter',
+ type=str,
+ default='/n/groups/marks/software/hhsuite/hhsuite-3.3.0',
+ help='Path to hhfilter binaries'
+ )
+ parser.add_argument(
+ '--scoring-window',
+ type=str,
+ default='optimal',
+ help='Approach to handle long sequences [optimal|overlapping]'
+ )
+ parser.add_argument(
+ '--overwrite-prior-scores',
+ action='store_true',
+ help='Whether to overwrite prior scores in the dataframe'
+ )
+ #No ref file provided
+ parser.add_argument('--target_seq', default=None, type=str, help='WT sequence mutated in the assay')
+ parser.add_argument('--weight_file_name', default=None, type=str, help='Wild type sequence mutated in the assay (to be provided if not using a reference file)')
+ parser.add_argument('--MSA_start', default=None, type=int, help='Index of first AA covered by the MSA relative to target_seq coordinates (1-indexing)')
+ parser.add_argument('--MSA_end', default=None, type=int, help='Index of last AA covered by the MSA relative to target_seq coordinates (1-indexing)')
+
+ parser.add_argument("--nogpu", action="store_true", help="Do not use GPU even if available")
+ return parser
+
+def label_row(row, sequence, token_probs, alphabet, offset_idx):
+ score=0
+ for mutation in row.split(":"):
+ wt, idx, mt = mutation[0], int(mutation[1:-1]) - offset_idx, mutation[-1]
+ assert sequence[idx] == wt, "The listed wildtype does not match the provided sequence"
+
+ wt_encoded, mt_encoded = alphabet.get_idx(wt), alphabet.get_idx(mt)
+
+ # add 1 for BOS
+ score += (token_probs[0, 1 + idx, mt_encoded] - token_probs[0, 1 + idx, wt_encoded]).item()
+ return score
+
+def get_mutated_sequence(row, wt_sequence, offset_idx):
+ wt, idx, mt = row[0], int(row[1:-1]) - offset_idx, row[-1]
+ assert wt_sequence[idx] == wt, "The listed wildtype does not match the provided sequence"
+ # modify the sequence
+ sequence = wt_sequence[:idx] + mt + wt_sequence[(idx + 1) :]
+ return sequence
+def compute_pppl(sequence, model, alphabet, MSA_data = None, mode = "ESM1v"):
+ # encode the sequence
+ data = [
+ ("protein1", sequence),
+ ]
+ if mode == "MSA_Transformer":
+ data = [data + MSA_data[0]]
+ batch_converter = alphabet.get_batch_converter()
+
+ _, _, batch_tokens = batch_converter(data)
+ # compute probabilities at each position
+ log_probs = []
+ for i in range(1, len(sequence) - 1):
+ batch_tokens_masked = batch_tokens.clone()
+ batch_tokens_masked[0, i] = alphabet.mask_idx
+ with torch.no_grad():
+ token_probs = torch.log_softmax(model(batch_tokens_masked.cuda())["logits"], dim=-1) # This might OOM because the MSA batch is large (400 by default)
+ if mode == "ESM1v":
+ log_probs.append(token_probs[0, i, alphabet.get_idx(sequence[i])].item()) # vocab size
+ elif mode == "MSA_Transformer":
+ log_probs.append(token_probs[0, 0, i, alphabet.get_idx(sequence[i])].item()) # vocab size
+ return sum(log_probs)
+
+
+def main(args):
+ if not os.path.exists(args.dms_output): os.mkdir(args.dms_output)
+ print("Arguments:", args)
+
+ # Load the deep mutational scan
+ mutant_col = args.mutation_col # Default "mutant"
+ if args.dms_index is not None:
+ mapping_protein_seq_DMS = pd.read_csv(args.dms_mapping)
+ DMS_id = mapping_protein_seq_DMS["DMS_id"][args.dms_index]
+ print("Compute scores for DMS: "+str(DMS_id))
+ row = mapping_protein_seq_DMS[mapping_protein_seq_DMS["DMS_id"]==DMS_id]
+ if len(row) == 0:
+ raise ValueError("No mappings found for DMS: "+str(DMS_id))
+ elif len(row) > 1:
+ raise ValueError("Multiple mappings found for DMS: "+str(DMS_id))
+
+ row = row.iloc[0]
+ row = row.replace(np.nan, "") # Makes it more manageable to use in strings
+
+ args.sequence = row["target_seq"].upper()
+ args.dms_input = str(args.dms_input)+os.sep+row["DMS_filename"]
+
+ mutant_col = row["DMS_mutant_column"] if "DMS_mutant_column" in mapping_protein_seq_DMS.columns else mutant_col
+ args.dms_output=str(args.dms_output)+os.sep+DMS_id+'.csv'
+
+ target_seq_start_index = row["start_idx"] if "start_idx" in mapping_protein_seq_DMS.columns and row["start_idx"]!="" else 1
+ target_seq_end_index = target_seq_start_index + len(args.sequence)
+
+ if "MSA_transformer" in args.model_type: # model_type is a list
+ # Check MSA_filename exists (might be NaN / empty)
+ msa_filename = row["MSA_filename"]
+ if msa_filename == "":
+ raise ValueError("No MSA found for DMS: "+str(DMS_id))
+
+ args.msa_path= str(args.msa_path)+os.sep+msa_filename # msa_path is expected to be the path to the directory where MSAs are located.
+
+ msa_start_index = int(row["MSA_start"]) if "MSA_start" in mapping_protein_seq_DMS.columns else 1
+ msa_end_index = int(row["MSA_end"]) if "MSA_end" in mapping_protein_seq_DMS.columns else len(args.sequence)
+
+ MSA_weight_file_name = args.msa_weights_folder + os.sep + row["weight_file_name"] if ("weight_file_name" in mapping_protein_seq_DMS.columns and args.msa_weights_folder is not None) else None
+ if ((target_seq_start_index!=msa_start_index) or (target_seq_end_index!=msa_end_index)):
+ args.sequence = args.sequence[msa_start_index-1:msa_end_index]
+ target_seq_start_index = msa_start_index
+ target_seq_end_index = msa_end_index
+ df = pd.read_csv(args.dms_input)
+ else:
+
+ DMS_id = str(args.dms_input).split(os.sep)[-1].split('.csv')[0]
+ args.dms_output=str(args.dms_output)+os.sep+DMS_id+'.csv'
+ target_seq_start_index = args.offset_idx
+ args.sequence = args.target_seq.upper()
+ if (args.MSA_start is None) or (args.MSA_end is None):
+ if args.msa_path: print("MSA start and end not provided -- Assuming the MSA is covering the full WT sequence")
+ args.MSA_start = 1
+ args.MSA_end = len(args.target_seq)
+ msa_start_index = args.MSA_start
+ msa_end_index = args.MSA_end
+ MSA_weight_file_name = args.msa_weights_folder + os.sep + args.weight_file_name if args.msa_weights_folder is not None else None
+ df = pd.read_csv(args.dms_input)
+
+ if len(df) == 0:
+ raise ValueError("No rows found in the dataframe")
+ print(f"df shape: {df.shape}", flush=True)
+
+ # inference for each model
+ print("Starting model scoring")
+ for model_location in args.model_location:
+ model, alphabet = pretrained.load_model_and_alphabet(model_location)
+ model_location = model_location.split("/")[-1].split(".")[0]
+ model.eval()
+ if torch.cuda.is_available() and not args.nogpu:
+ model = model.cuda()
+ print("Transferred model to GPU")
+ else:
+ print(f"Not using GPU. torch.cuda.is_available(): {torch.cuda.is_available()}, args.nogpu: {args.nogpu}")
+
+ batch_converter = alphabet.get_batch_converter()
+
+ if isinstance(model, MSATransformer):
+ args.offset_idx = msa_start_index
+ # Process MSA once, then sample from it
+ processed_msa = process_msa(filename=args.msa_path, weight_filename=MSA_weight_file_name, filter_msa=args.filter_msa, hhfilter_min_cov=args.hhfilter_min_cov, hhfilter_max_seq_id=args.hhfilter_max_seq_id, hhfilter_min_seq_id=args.hhfilter_min_seq_id, path_to_hhfilter=args.path_to_hhfilter)
+ for seed in args.seeds:
+ if os.path.exists(args.dms_output):
+ prior_score_df = pd.read_csv(args.dms_output)
+ if f"{model_location}_seed{seed}" in prior_score_df.columns and not args.overwrite_prior_scores:
+ print(f"Skipping seed {seed} as it is already in the dataframe")
+ df = prior_score_df
+ continue
+ else:
+ prior_score_df = None
+ data = [sample_msa(sampling_strategy=args.msa_sampling_strategy, filename=args.msa_path, nseq=args.msa_samples, weight_filename=MSA_weight_file_name, processed_msa=processed_msa, random_seed=seed)]
+ assert (args.scoring_strategy in ["masked-marginals","pseudo-ppl"]), "Zero-shot scoring strategy not supported with MSA Transformer"
+
+ batch_labels, batch_strs, batch_tokens = batch_converter(data)
+ print(f"Batch sizes: {batch_tokens.size()}")
+
+ if args.scoring_strategy == "masked-marginals":
+ all_token_probs = []
+ for i in tqdm(range(batch_tokens.size(2)), desc="Scoring masked-marginals"):
+ batch_tokens_masked = batch_tokens.clone()
+ batch_tokens_masked[0, 0, i] = alphabet.mask_idx # mask out first sequence
+ if batch_tokens.size(-1) > 1024:
+ large_batch_tokens_masked=batch_tokens_masked.clone()
+ start, end = get_optimal_window(mutation_position_relative=i, seq_len_wo_special=len(args.sequence)+2, model_window=1024)
+ print("Start index {} - end index {}".format(start,end))
+ batch_tokens_masked = large_batch_tokens_masked[:,:,start:end]
+ else:
+ start=0
+ with torch.no_grad():
+ token_probs = torch.log_softmax(
+ model(batch_tokens_masked.cuda())["logits"], dim=-1
+ )
+ all_token_probs.append(token_probs[:, 0, i-start].detach().cpu()) # vocab size
+ token_probs = torch.cat(all_token_probs, dim=0).unsqueeze(0)
+ df[f"{model_location}_seed{seed}"] = df.apply(
+ lambda row: label_row(
+ row[mutant_col], args.sequence, token_probs.detach().cpu(), alphabet, args.offset_idx
+ ),
+ axis=1,
+ )
+ elif args.scoring_strategy == "pseudo-ppl":
+ tqdm.pandas()
+ if 'mutated_sequence' not in df:
+ df['mutated_sequence'] = df.progress_apply(
+ lambda row: get_mutated_sequence(
+ row[mutant_col], args.sequence, args.offset_idx
+ ),
+ axis=1,
+ )
+ df[f"{model_location}_seed{seed}"] = df.progress_apply(
+ lambda row: compute_pppl(
+ row['mutated_sequence'], model, alphabet, MSA_data = data, mode = "MSA_Transformer"
+ ),
+ axis=1,
+ )
+ if os.path.exists(args.dms_output) and not args.overwrite_prior_scores:
+ prior_score_df = pd.read_csv(args.dms_output)
+ assert f"{model_location}_seed{seed}" not in prior_score_df.columns, f"Column {model_location}_seed{seed} already exists in {args.dms_output}"
+ prior_score_df = prior_score_df.merge(df[[f"{model_location}_seed{seed}","mutant"]],on="mutant")
+ prior_score_df.to_csv(args.dms_output, index=False)
+ df = prior_score_df
+ else:
+ df.to_csv(args.dms_output, index=False)
+ else:
+ args.offset_idx = target_seq_start_index
+ data = [
+ ("protein1", args.sequence),
+ ]
+ batch_labels, batch_strs, batch_tokens = batch_converter(data)
+
+ if args.scoring_strategy == "wt-marginals":
+ with torch.no_grad():
+ if batch_tokens.size(1) > 1024 and args.scoring_window=="overlapping":
+ batch_size, seq_len = batch_tokens.shape #seq_len includes BOS and EOS
+ token_probs = torch.zeros((batch_size,seq_len,len(alphabet))).cuda() # Note: batch_size = 1 (need to keep batch dimension to score with model though)
+ token_weights = torch.zeros((batch_size,seq_len)).cuda()
+ weights = torch.ones(1024).cuda() # 1 for 256≤i<1022-256
+ for i in range(1,257):
+ weights[i] = 1 / (1 + math.exp(-(i-128)/16))
+ for i in range(1022-256,1023):
+ weights[i] = 1 / (1 + math.exp((i-1022+128)/16))
+ start_left_window = 0
+ end_left_window = 1023 #First window is indexed [0-1023]
+ start_right_window = (batch_tokens.size(1) - 1) - 1024 + 1 #Last index is len-1
+ end_right_window = batch_tokens.size(1) - 1
+ while True:
+ # Left window update
+ left_window_probs = torch.log_softmax(model(batch_tokens[:,start_left_window:end_left_window+1].cuda())["logits"], dim=-1)
+ token_probs[:,start_left_window:end_left_window+1] += left_window_probs * weights.view(-1,1)
+ token_weights[:,start_left_window:end_left_window+1] += weights
+ # Right window update
+ right_window_probs = torch.log_softmax(model(batch_tokens[:,start_right_window:end_right_window+1].cuda())["logits"], dim=-1)
+ token_probs[:,start_right_window:end_right_window+1] += right_window_probs * weights.view(-1,1)
+ token_weights[:,start_right_window:end_right_window+1] += weights
+ if end_left_window > start_right_window:
+ #overlap between windows in that last scoring so we break from the loop
+ break
+ start_left_window+=511
+ end_left_window+=511
+ start_right_window-=511
+ end_right_window-=511
+ #If central overlap not wide engouh, we add one more window at the center
+ final_overlap = end_left_window - start_right_window + 1
+ if final_overlap < 511:
+ start_central_window = int(seq_len / 2) - 512
+ end_central_window = start_central_window + 1023
+ central_window_probs = torch.log_softmax(model(batch_tokens[:,start_central_window:end_central_window+1].cuda())["logits"], dim=-1)
+ token_probs[:,start_central_window:end_central_window+1] += central_window_probs * weights.view(-1,1)
+ token_weights[:,start_central_window:end_central_window+1] += weights
+ #Weight normalization
+ token_probs = token_probs / token_weights.view(-1,1) #Add 1 to broadcast
+ else:
+ token_probs = torch.log_softmax(model(batch_tokens.cuda())["logits"], dim=-1)
+ df[model_location] = df.apply(
+ lambda row: label_row(
+ row[mutant_col],
+ args.sequence,
+ token_probs,
+ alphabet,
+ args.offset_idx,
+ ),
+ axis=1,
+ )
+ elif args.scoring_strategy == "masked-marginals":
+ print("Scoring with masked-marginals and model {}".format(model_location))
+ all_token_probs = []
+ for i in tqdm(range(batch_tokens.size(1)), desc="Scoring masked-marginals"):
+ batch_tokens_masked = batch_tokens.clone()
+ batch_tokens_masked[0, i] = alphabet.mask_idx
+ if batch_tokens.size(1) > 1024 and args.scoring_window=="optimal":
+ large_batch_tokens_masked=batch_tokens_masked.clone()
+ start, end = get_optimal_window(mutation_position_relative=i, seq_len_wo_special=len(args.sequence)+2, model_window=1024)
+ batch_tokens_masked = large_batch_tokens_masked[:,start:end]
+ elif batch_tokens.size(1) > 1024 and args.scoring_window=="overlapping":
+ print("Overlapping not yet implemented for masked-marginals")
+ sys.exit(0)
+ else:
+ start=0
+ with torch.no_grad():
+ token_probs = torch.log_softmax(model(batch_tokens_masked.cuda())["logits"], dim=-1)
+ all_token_probs.append(token_probs[:, i-start]) # vocab size
+ token_probs = torch.cat(all_token_probs, dim=0).unsqueeze(0)
+ df[model_location] = df.apply(
+ lambda row: label_row(
+ row[mutant_col],
+ args.sequence,
+ token_probs,
+ alphabet,
+ args.offset_idx,
+ ),
+ axis=1,
+ )
+ elif args.scoring_strategy == "pseudo-ppl":
+ tqdm.pandas()
+ if 'mutated_sequence' not in df:
+ df['mutated_sequence'] = df.progress_apply(
+ lambda row: get_mutated_sequence(
+ row[mutant_col], args.sequence, args.offset_idx
+ ),
+ axis=1,
+ )
+ df[model_location] = df.progress_apply(
+ lambda row: compute_pppl(
+ row['mutated_sequence'], model, alphabet
+ ),
+ axis=1,
+ )
+ # Compute ensemble score Ensemble_ESM1v, standardizes each model score and then averages them
+ # note this assumes that all the input model checkpoints are ESM1v
+ if "ESM1v" in args.model_type:
+ df["Ensemble_ESM1v"] = 0.0
+ for model_location in args.model_location:
+ model_location = model_location.split("/")[-1].split(".")[0]
+ df["Ensemble_ESM1v"] += df[model_location]
+ df["Ensemble_ESM1v"] /= len(args.model_location)
+ elif "MSA_transformer" in args.model_type:
+ df[f"{model_location}_ensemble"] = 0.0
+ for seed in args.seeds:
+ df[f"{model_location}_ensemble"] += df[f"{model_location}_seed{seed}"]
+ df[f"{model_location}_ensemble"] /= len(args.seeds)
+ df.to_csv(args.dms_output,index=False)
+
+
+if __name__ == "__main__":
+ parser = create_parser()
+ args = parser.parse_args()
+ main(args)
diff --git a/proteingym/baselines/esm/compute_fitness_esm_if1.py b/proteingym/baselines/esm/compute_fitness_esm_if1.py
new file mode 100644
index 0000000..39f9193
--- /dev/null
+++ b/proteingym/baselines/esm/compute_fitness_esm_if1.py
@@ -0,0 +1,108 @@
+import argparse
+import numpy as np
+from pathlib import Path
+import torch
+import torch.nn.functional as F
+from tqdm import tqdm
+import time
+import pandas as pd
+import os
+import sys
+sys.path.append(os.path.dirname(os.path.dirname(os.path.dirname(os.path.abspath(__file__)))))
+from baselines.esm import esm
+import esm.inverse_folding
+from esm.inverse_folding.util import CoordBatchConverter
+
+SCORE_NATIVE = False
+
+
+def get_sequence_loss_batch(model, alphabet, coords_list, seq_list):
+ device = next(model.parameters()).device
+ batch_converter = CoordBatchConverter(alphabet)
+ assert len(coords_list) == len(seq_list)
+ batch = [(coords, None, seq) for coords, seq in zip(coords_list, seq_list)]
+ coords, confidence, strs, tokens, padding_mask = batch_converter(batch, device=device)
+ prev_output_tokens = tokens[:, :-1].to(device)
+ target = tokens[:, 1:]
+ target_padding_mask = (target == alphabet.padding_idx)
+ logits, _ = model.forward(coords, padding_mask, confidence, prev_output_tokens)
+ loss = F.cross_entropy(logits, target, reduction='none')
+ losses = loss.cpu().detach().numpy()
+ target_padding_masks = target_padding_mask.cpu().numpy()
+ return losses, target_padding_masks
+
+def score_sequence_batch(model, alphabet, coords_list, seq_list):
+ losses, target_padding_masks = get_sequence_loss_batch(model, alphabet, coords_list, seq_list)
+ print("debug: losses and target_padding_mask shapes: ", losses.shape, target_padding_masks.shape)
+ ll_fullseqs_batch = -np.sum(losses * ~target_padding_masks, axis=1) / np.sum(~target_padding_masks, axis=1)
+ return ll_fullseqs_batch
+
+
+def score_singlechain_backbone_batch(model, alphabet, pdb_file, chain, mutation_file, output_filepath, batch_size=1,nogpu=False):
+ if not nogpu:
+ assert torch.cuda.is_available(), "Expected GPU. If you want to use CPU, you have to specify --nogpu every time."
+ model = model.cuda()
+ print("Transferred model to GPU")
+ else:
+ print(f"Running model on CPU: torch cuda is available={torch.cuda.is_available()} nogpu={nogpu}")
+
+ start_time = time.perf_counter()
+ coords, native_seq = esm.inverse_folding.util.load_coords(pdb_file, chain)
+ print(f"Coords loaded in {time.perf_counter() - start_time} seconds")
+ mut_df = pd.read_csv(mutation_file)
+ seq_list = mut_df["mutated_sequence"].tolist()
+ header_list = mut_df["mutant"].tolist()
+ coords_list = [coords] * len(seq_list)
+
+ print(f"Sequences loaded in {time.perf_counter() - start_time} seconds")
+
+ start_scoring = time.perf_counter()
+
+ with open(output_filepath, 'w') as fout:
+ fout.write('mutant,esmif1_ll\n')
+ for i in tqdm(range(0, len(seq_list), batch_size)):
+ batch = seq_list[i:i+batch_size]
+ coords_batch = coords_list[i:i+batch_size]
+ ll_fullseq = score_sequence_batch(model, alphabet, coords_batch, batch)
+ with open(output_filepath, 'a') as fout:
+ for header, ll in zip(header_list[i:i+batch_size], ll_fullseq):
+ fout.write(header + ',' + str(ll) + '\n')
+
+ print(f"Scoring in {time.perf_counter() - start_scoring} seconds")
+
+ print(f'Results saved to {output_filepath}')
+ print(f"Total time: {time.perf_counter() - start_time}")
+
+def main():
+ parser = argparse.ArgumentParser(description='Score sequences based on a given structure.')
+ parser.add_argument('--DMS_reference_file_path',type=str,help='path to DMS reference file')
+ parser.add_argument('--DMS_data_folder',type=str,help="path to folder containing DMS data")
+ parser.add_argument('--structure_folder',type=str,help='folder containing pdb files for each DMS')
+ parser.add_argument('--DMS_index',type=int,help='index of DMS in DMS reference file')
+ parser.add_argument('--model_location',type=str,help='path to model')
+ parser.add_argument('--batch_size',type=int,default=1)
+ parser.add_argument('--output_scores_folder',type=str,help='path to folder where scores will be saved')
+ parser.add_argument('--chain', type=str,help='chain id for the chain of interest', default='A')
+ parser.set_defaults(multichain_backbone=False)
+ parser.add_argument('--multichain-backbone', action='store_true',help='use the backbones of all chains in the input for conditioning')
+ parser.add_argument('--singlechain-backbone', dest='multichain_backbone',action='store_false',help='use the backbone of only target chain in the input for conditioning')
+ parser.add_argument("--nogpu", action="store_true", help="Do not use GPU even if available")
+
+ args = parser.parse_args()
+ mapping_df = pd.read_csv(args.DMS_reference_file_path)
+ DMS_id = mapping_df.iloc[args.DMS_index]['DMS_id']
+ DMS_filename = mapping_df.iloc[args.DMS_index]['DMS_filename']
+ output_filename = args.output_scores_folder + os.sep + DMS_id + ".csv"
+ if not os.path.exists(args.output_scores_folder):
+ os.makedirs(args.output_scores_folder)
+
+ pdb_file = args.structure_folder + os.sep + mapping_df.iloc[args.DMS_index]['pdb_file']
+ model, alphabet = esm.pretrained.load_model_and_alphabet(args.model_location)
+ model = model.eval()
+ mutation_filename = args.DMS_data_folder + os.sep + DMS_filename
+ score_singlechain_backbone_batch(model, alphabet, pdb_file=pdb_file, batch_size=args.batch_size, chain=args.chain,mutation_file=mutation_filename,output_filepath=output_filename,nogpu=args.nogpu)
+
+
+
+if __name__ == '__main__':
+ main()
diff --git a/proteingym/baselines/esm/esm/__init__.py b/proteingym/baselines/esm/esm/__init__.py
new file mode 100644
index 0000000..907081d
--- /dev/null
+++ b/proteingym/baselines/esm/esm/__init__.py
@@ -0,0 +1,12 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+from .version import version as __version__ # noqa
+
+from .data import Alphabet, BatchConverter, FastaBatchedDataset # noqa
+from .model.esm1 import ProteinBertModel # noqa
+from .model.esm2 import ESM2 # noqa
+from .model.msa_transformer import MSATransformer #noqa
+from . import pretrained # noqa
diff --git a/proteingym/baselines/esm/esm/axial_attention.py b/proteingym/baselines/esm/esm/axial_attention.py
new file mode 100644
index 0000000..b508d9d
--- /dev/null
+++ b/proteingym/baselines/esm/esm/axial_attention.py
@@ -0,0 +1,297 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import math
+import torch
+import torch.nn as nn
+
+# Adapted from the Tranception codebase (https://github.com/OATML-Markslab/Tranception/blob/main/tranception/model_pytorch.py#L73) with the following 3 main changes:
+ # Works on the 2D inputs of ProteinNPT
+ # Adapted to MLM (instead of autoregressive transformer)
+ # Handles separately protein embeddings and targets
+
+class SpatialDepthWiseConvolution(nn.Module):
+ def __init__(self, head_dim: int, kernel_size: int = 3, num_targets: int = 1):
+ super().__init__()
+ self.kernel_size = kernel_size
+ self.conv = nn.Conv1d(in_channels=head_dim, out_channels=head_dim, kernel_size=(kernel_size,), padding='same', groups=head_dim)
+ self.num_targets = num_targets
+
+ def forward(self, x: torch.Tensor):
+ # Need to separate the targets from protein embeddings (convolutions only apply to protein embeddings)
+ x , y = x[:,:-self.num_targets], x[:,-self.num_targets:]
+ # Apply conv. Input x is of dim (num_rows, seq_len, batch_size, self.num_heads, self.head_dim)
+ num_rows, seq_len, batch_size, num_heads, head_dim = x.shape
+ x = x.permute(0, 2, 3, 4, 1).contiguous()
+ x = x.view(num_rows * batch_size * num_heads, head_dim, seq_len)
+ x = self.conv(x)
+ x = x.view(num_rows, batch_size, num_heads, head_dim, seq_len)
+ x = x.permute(0, 4, 1, 2, 3)
+
+ # Concatenate back with targets
+ x = torch.cat([x,y], dim=1)
+ return x
+
+class RowSelfAttention(nn.Module):
+ """Compute self-attention over rows of a 2D input."""
+
+ def __init__(
+ self,
+ embed_dim,
+ num_heads,
+ dropout=0.0,
+ max_tokens_per_msa: int = 2 ** 16,
+ tranception_attention: bool = False,
+ num_targets: int = 1,
+ ):
+ super().__init__()
+ self.num_heads = num_heads
+ self.dropout = dropout
+ self.head_dim = embed_dim // num_heads
+ self.scaling = self.head_dim ** -0.5
+ self.max_tokens_per_msa = max_tokens_per_msa
+ self.attn_shape = "hnij"
+
+ self.k_proj = nn.Linear(embed_dim, embed_dim)
+ self.v_proj = nn.Linear(embed_dim, embed_dim)
+ self.q_proj = nn.Linear(embed_dim, embed_dim)
+
+ self.out_proj = nn.Linear(embed_dim, embed_dim)
+ self.dropout_module = nn.Dropout(dropout)
+
+ self.tranception_attention = tranception_attention
+ self.num_targets = num_targets
+ if self.tranception_attention:
+ assert self.num_heads%4==0, "Invalid number of heads. Tranception requires the number of heads to be a multiple of 4."
+ self.num_heads_per_kernel_size = self.num_heads // 4
+ self.query_depthwiseconv = nn.ModuleDict()
+ self.key_depthwiseconv = nn.ModuleDict()
+ self.value_depthwiseconv = nn.ModuleDict()
+ for kernel_idx, kernel in enumerate([3,5,7]):
+ self.query_depthwiseconv[str(kernel_idx)] = SpatialDepthWiseConvolution(self.head_dim,kernel,self.num_targets)
+ self.key_depthwiseconv[str(kernel_idx)] = SpatialDepthWiseConvolution(self.head_dim,kernel,self.num_targets)
+ self.value_depthwiseconv[str(kernel_idx)] = SpatialDepthWiseConvolution(self.head_dim,kernel,self.num_targets)
+
+ def align_scaling(self, q):
+ num_rows = q.size(0)
+ return self.scaling / math.sqrt(num_rows)
+
+ def _batched_forward(
+ self,
+ x,
+ self_attn_mask=None,
+ self_attn_padding_mask=None,
+ ):
+ num_rows, num_cols, batch_size, embed_dim = x.size()
+ max_rows = max(1, self.max_tokens_per_msa // num_cols)
+ attns = 0
+ scaling = self.align_scaling(x)
+ for start in range(0, num_rows, max_rows):
+ attn_weights = self.compute_attention_weights(
+ x[start : start + max_rows],
+ scaling,
+ self_attn_mask=self_attn_mask,
+ self_attn_padding_mask=self_attn_padding_mask[:, start : start + max_rows]
+ if self_attn_padding_mask is not None
+ else None,
+ )
+ attns += attn_weights
+ attn_probs = attns.softmax(-1)
+ attn_probs = self.dropout_module(attn_probs)
+
+ outputs = []
+ for start in range(0, num_rows, max_rows):
+ output = self.compute_attention_update(x[start : start + max_rows], attn_probs)
+ outputs.append(output)
+
+ output = torch.cat(outputs, 0)
+ return output, attn_probs
+
+ def compute_attention_weights(
+ self,
+ x,
+ scaling: float,
+ self_attn_mask=None,
+ self_attn_padding_mask=None,
+ ):
+ num_rows, num_cols, batch_size, embed_dim = x.size()
+ q = self.q_proj(x).view(num_rows, num_cols, batch_size, self.num_heads, self.head_dim)
+ k = self.k_proj(x).view(num_rows, num_cols, batch_size, self.num_heads, self.head_dim)
+ q *= scaling
+ if self_attn_padding_mask is not None:
+ # Zero out any padded aligned positions - this is important since
+ # we take a sum across the alignment axis.
+ q *= 1 - self_attn_padding_mask.permute(1, 2, 0).unsqueeze(3).unsqueeze(4).to(q)
+
+ if self.tranception_attention:
+ # We do not do anything on the first self.num_heads_per_kernel_size heads (kernel =1)
+ query_list=[q[:,:,:,:self.num_heads_per_kernel_size,:]]
+ key_list=[k[:,:,:,:self.num_heads_per_kernel_size,:]]
+ for kernel_idx in range(3):
+ query_list.append(self.query_depthwiseconv[str(kernel_idx)](q[:,:,:,(kernel_idx+1)*self.num_heads_per_kernel_size:(kernel_idx+2)*self.num_heads_per_kernel_size,:]))
+ key_list.append(self.key_depthwiseconv[str(kernel_idx)](k[:,:,:,(kernel_idx+1)*self.num_heads_per_kernel_size:(kernel_idx+2)*self.num_heads_per_kernel_size,:]))
+ q=torch.cat(query_list, dim=1)
+ k=torch.cat(key_list, dim=1)
+
+ attn_weights = torch.einsum(f"rinhd,rjnhd->{self.attn_shape}", q, k)
+
+ if self_attn_mask is not None:
+ raise NotImplementedError
+ # Mask Size: [B x R x C], Weights Size: [H x B x C x C]
+
+ if self_attn_padding_mask is not None:
+ attn_weights = attn_weights.masked_fill(
+ self_attn_padding_mask[:, 0].unsqueeze(0).unsqueeze(2),
+ -10000,
+ )
+
+ return attn_weights
+
+ def compute_attention_update(
+ self,
+ x,
+ attn_probs,
+ ):
+ num_rows, num_cols, batch_size, embed_dim = x.size()
+ v = self.v_proj(x).view(num_rows, num_cols, batch_size, self.num_heads, self.head_dim)
+
+ if self.tranception_attention:
+ value_list=[v[:,:,:,:self.num_heads_per_kernel_size,:]]
+ for kernel_idx in range(3):
+ value_list.append(self.value_depthwiseconv[str(kernel_idx)](v[:,:,:,(kernel_idx+1)*self.num_heads_per_kernel_size:(kernel_idx+2)*self.num_heads_per_kernel_size,:]))
+ v=torch.cat(value_list, dim=1)
+
+ context = torch.einsum(f"{self.attn_shape},rjnhd->rinhd", attn_probs, v)
+ context = context.contiguous().view(num_rows, num_cols, batch_size, embed_dim)
+ output = self.out_proj(context)
+ return output
+
+ def forward(
+ self,
+ x,
+ self_attn_mask=None,
+ self_attn_padding_mask=None,
+ ):
+ num_rows, num_cols, batch_size, embed_dim = x.size()
+ if (num_rows * num_cols > self.max_tokens_per_msa) and not torch.is_grad_enabled():
+ return self._batched_forward(x, self_attn_mask, self_attn_padding_mask)
+ else:
+ scaling = self.align_scaling(x)
+ attn_weights = self.compute_attention_weights(
+ x, scaling, self_attn_mask, self_attn_padding_mask
+ )
+ attn_probs = attn_weights.softmax(-1)
+ attn_probs = self.dropout_module(attn_probs)
+ output = self.compute_attention_update(x, attn_probs)
+ return output, attn_probs
+
+
+class ColumnSelfAttention(nn.Module):
+ """Compute self-attention over columns of a 2D input."""
+
+ def __init__(
+ self,
+ embed_dim,
+ num_heads,
+ dropout=0.0,
+ max_tokens_per_msa: int = 2 ** 16,
+ ):
+ super().__init__()
+
+ self.num_heads = num_heads
+ self.dropout = dropout
+ self.head_dim = embed_dim // num_heads
+ self.scaling = self.head_dim ** -0.5
+ self.max_tokens_per_msa = max_tokens_per_msa
+
+ self.k_proj = nn.Linear(embed_dim, embed_dim)
+ self.v_proj = nn.Linear(embed_dim, embed_dim)
+ self.q_proj = nn.Linear(embed_dim, embed_dim)
+
+ self.out_proj = nn.Linear(embed_dim, embed_dim)
+ self.dropout_module = nn.Dropout(dropout)
+
+ def _batched_forward(
+ self,
+ x,
+ self_attn_mask=None,
+ self_attn_padding_mask=None,
+ ):
+ num_rows, num_cols, batch_size, embed_dim = x.size()
+ max_cols = max(1, self.max_tokens_per_msa // num_rows)
+ outputs = []
+ attns = []
+ for start in range(0, num_cols, max_cols):
+ output, attn = self(
+ x[:, start : start + max_cols],
+ self_attn_mask=self_attn_mask,
+ self_attn_padding_mask=self_attn_padding_mask[:, :, start : start + max_cols]
+ if self_attn_padding_mask is not None
+ else None,
+ )
+ outputs.append(output)
+ attns.append(attn)
+ output = torch.cat(outputs, 1)
+ attns = torch.cat(attns, 1)
+ return output, attns
+
+ def compute_attention_update(
+ self,
+ x,
+ self_attn_mask=None,
+ self_attn_padding_mask=None,
+ ):
+ num_rows, num_cols, batch_size, embed_dim = x.size()
+ if num_rows == 1:
+ # if there is only 1 position, this is equivalent and doesn't break with padding
+ attn_probs = torch.ones(
+ self.num_heads,
+ num_cols,
+ batch_size,
+ num_rows,
+ num_rows,
+ device=x.device,
+ dtype=x.dtype,
+ )
+ output = self.out_proj(self.v_proj(x))
+ else:
+ q = self.q_proj(x).view(num_rows, num_cols, batch_size, self.num_heads, self.head_dim)
+ k = self.k_proj(x).view(num_rows, num_cols, batch_size, self.num_heads, self.head_dim)
+ v = self.v_proj(x).view(num_rows, num_cols, batch_size, self.num_heads, self.head_dim)
+ q *= self.scaling
+
+ attn_weights = torch.einsum("icnhd,jcnhd->hcnij", q, k)
+
+ if self_attn_mask is not None:
+ raise NotImplementedError
+ if self_attn_padding_mask is not None:
+ attn_weights = attn_weights.masked_fill(
+ self_attn_padding_mask.permute(2, 0, 1).unsqueeze(0).unsqueeze(3),
+ -10000,
+ )
+
+ attn_probs = attn_weights.softmax(-1)
+ attn_probs = self.dropout_module(attn_probs)
+ context = torch.einsum("hcnij,jcnhd->icnhd", attn_probs, v)
+ context = context.contiguous().view(num_rows, num_cols, batch_size, embed_dim)
+ output = self.out_proj(context)
+ return output, attn_probs
+
+ def forward(
+ self,
+ x,
+ self_attn_mask=None,
+ self_attn_padding_mask=None,
+ ):
+ num_rows, num_cols, batch_size, embed_dim = x.size()
+ # if False and num_rows * num_cols > 2 ** 14 and not torch.is_grad_enabled():
+ if (num_rows * num_cols) > self.max_tokens_per_msa and not torch.is_grad_enabled():
+ return self._batched_forward(
+ x,
+ self_attn_mask,
+ self_attn_padding_mask,
+ )
+ else:
+ return self.compute_attention_update(x, self_attn_mask, self_attn_padding_mask)
diff --git a/proteingym/baselines/esm/esm/constants.py b/proteingym/baselines/esm/esm/constants.py
new file mode 100644
index 0000000..fc9abb1
--- /dev/null
+++ b/proteingym/baselines/esm/esm/constants.py
@@ -0,0 +1,10 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+# fmt: off
+proteinseq_toks = {
+ 'toks': ['L', 'A', 'G', 'V', 'S', 'E', 'R', 'T', 'I', 'D', 'P', 'K', 'Q', 'N', 'F', 'Y', 'M', 'H', 'W', 'C', 'X', 'B', 'U', 'Z', 'O', '.', '-']
+}
+# fmt: on
diff --git a/proteingym/baselines/esm/esm/data.py b/proteingym/baselines/esm/esm/data.py
new file mode 100644
index 0000000..8bff147
--- /dev/null
+++ b/proteingym/baselines/esm/esm/data.py
@@ -0,0 +1,493 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import itertools
+import os
+from typing import Sequence, Tuple, List, Union
+import pickle
+import re
+import shutil
+import torch
+from pathlib import Path
+from .constants import proteinseq_toks
+
+RawMSA = Sequence[Tuple[str, str]]
+
+
+class FastaBatchedDataset(object):
+ def __init__(self, sequence_labels, sequence_strs):
+ self.sequence_labels = list(sequence_labels)
+ self.sequence_strs = list(sequence_strs)
+
+ @classmethod
+ def from_file(cls, fasta_file):
+ sequence_labels, sequence_strs = [], []
+ cur_seq_label = None
+ buf = []
+
+ def _flush_current_seq():
+ nonlocal cur_seq_label, buf
+ if cur_seq_label is None:
+ return
+ sequence_labels.append(cur_seq_label)
+ sequence_strs.append("".join(buf))
+ cur_seq_label = None
+ buf = []
+
+ with open(fasta_file, "r") as infile:
+ for line_idx, line in enumerate(infile):
+ if line.startswith(">"): # label line
+ _flush_current_seq()
+ line = line[1:].strip()
+ if len(line) > 0:
+ cur_seq_label = line
+ else:
+ cur_seq_label = f"seqnum{line_idx:09d}"
+ else: # sequence line
+ buf.append(line.strip())
+
+ _flush_current_seq()
+
+ assert len(set(sequence_labels)) == len(
+ sequence_labels
+ ), "Found duplicate sequence labels"
+
+ return cls(sequence_labels, sequence_strs)
+
+ def __len__(self):
+ return len(self.sequence_labels)
+
+ def __getitem__(self, idx):
+ return self.sequence_labels[idx], self.sequence_strs[idx]
+
+ def get_batch_indices(self, toks_per_batch, extra_toks_per_seq=0):
+ sizes = [(len(s), i) for i, s in enumerate(self.sequence_strs)]
+ sizes.sort()
+ batches = []
+ buf = []
+ max_len = 0
+
+ def _flush_current_buf():
+ nonlocal max_len, buf
+ if len(buf) == 0:
+ return
+ batches.append(buf)
+ buf = []
+ max_len = 0
+
+ for sz, i in sizes:
+ sz += extra_toks_per_seq
+ if max(sz, max_len) * (len(buf) + 1) > toks_per_batch:
+ _flush_current_buf()
+ max_len = max(max_len, sz)
+ buf.append(i)
+
+ _flush_current_buf()
+ return batches
+
+
+class Alphabet(object):
+ def __init__(
+ self,
+ standard_toks: Sequence[str],
+ prepend_toks: Sequence[str] = ("", "", "", ""),
+ append_toks: Sequence[str] = ("", "", ""),
+ prepend_bos: bool = True,
+ append_eos: bool = False,
+ use_msa: bool = False,
+ ):
+ self.standard_toks = list(standard_toks)
+ self.prepend_toks = list(prepend_toks)
+ self.append_toks = list(append_toks)
+ self.prepend_bos = prepend_bos
+ self.append_eos = append_eos
+ self.use_msa = use_msa
+
+ self.all_toks = list(self.prepend_toks)
+ self.all_toks.extend(self.standard_toks)
+ for i in range((8 - (len(self.all_toks) % 8)) % 8):
+ self.all_toks.append(f"")
+ self.all_toks.extend(self.append_toks)
+
+ self.tok_to_idx = {tok: i for i, tok in enumerate(self.all_toks)}
+
+ self.unk_idx = self.tok_to_idx[""]
+ self.padding_idx = self.get_idx("")
+ self.cls_idx = self.get_idx("")
+ self.mask_idx = self.get_idx("")
+ self.eos_idx = self.get_idx("")
+ self.all_special_tokens = ['', '', '', '', '']
+ self.unique_no_split_tokens = self.all_toks
+
+ def __len__(self):
+ return len(self.all_toks)
+
+ def get_idx(self, tok):
+ return self.tok_to_idx.get(tok, self.unk_idx)
+
+ def get_tok(self, ind):
+ return self.all_toks[ind]
+
+ def to_dict(self):
+ return self.tok_to_idx.copy()
+
+ def get_batch_converter(self, truncation_seq_length: int = None):
+ if self.use_msa:
+ return MSABatchConverter(self, truncation_seq_length)
+ else:
+ return BatchConverter(self, truncation_seq_length)
+
+ @classmethod
+ def from_architecture(cls, name: str) -> "Alphabet":
+ if name in ("ESM-1", "protein_bert_base"):
+ standard_toks = proteinseq_toks["toks"]
+ prepend_toks: Tuple[str, ...] = ("", "", "", "")
+ append_toks: Tuple[str, ...] = ("", "", "")
+ prepend_bos = True
+ append_eos = False
+ use_msa = False
+ elif name in ("ESM-1b", "roberta_large"):
+ standard_toks = proteinseq_toks["toks"]
+ prepend_toks = ("", "", "", "")
+ append_toks = ("",)
+ prepend_bos = True
+ append_eos = True
+ use_msa = False
+ elif name in ("MSA Transformer", "msa_transformer"):
+ standard_toks = proteinseq_toks["toks"]
+ prepend_toks = ("", "", "", "")
+ append_toks = ("",)
+ prepend_bos = True
+ append_eos = False
+ use_msa = True
+ elif "invariant_gvp" in name.lower():
+ standard_toks = proteinseq_toks["toks"]
+ prepend_toks = ("", "", "", "")
+ append_toks = ("", "", "")
+ prepend_bos = True
+ append_eos = False
+ use_msa = False
+ else:
+ raise ValueError("Unknown architecture selected")
+ return cls(standard_toks, prepend_toks, append_toks, prepend_bos, append_eos, use_msa)
+
+ def _tokenize(self, text) -> str:
+ return text.split()
+
+ def tokenize(self, text, **kwargs) -> List[str]:
+ """
+ Inspired by https://github.com/huggingface/transformers/blob/master/src/transformers/tokenization_utils.py
+ Converts a string in a sequence of tokens, using the tokenizer.
+
+ Args:
+ text (:obj:`str`):
+ The sequence to be encoded.
+
+ Returns:
+ :obj:`List[str]`: The list of tokens.
+ """
+
+ def split_on_token(tok, text):
+ result = []
+ split_text = text.split(tok)
+ for i, sub_text in enumerate(split_text):
+ # AddedToken can control whitespace stripping around them.
+ # We use them for GPT2 and Roberta to have different behavior depending on the special token
+ # Cf. https://github.com/huggingface/transformers/pull/2778
+ # and https://github.com/huggingface/transformers/issues/3788
+ # We strip left and right by default
+ if i < len(split_text) - 1:
+ sub_text = sub_text.rstrip()
+ if i > 0:
+ sub_text = sub_text.lstrip()
+
+ if i == 0 and not sub_text:
+ result.append(tok)
+ elif i == len(split_text) - 1:
+ if sub_text:
+ result.append(sub_text)
+ else:
+ pass
+ else:
+ if sub_text:
+ result.append(sub_text)
+ result.append(tok)
+ return result
+
+ def split_on_tokens(tok_list, text):
+ if not text.strip():
+ return []
+
+ tokenized_text = []
+ text_list = [text]
+ for tok in tok_list:
+ tokenized_text = []
+ for sub_text in text_list:
+ if sub_text not in self.unique_no_split_tokens:
+ tokenized_text.extend(split_on_token(tok, sub_text))
+ else:
+ tokenized_text.append(sub_text)
+ text_list = tokenized_text
+
+ return list(
+ itertools.chain.from_iterable(
+ (
+ self._tokenize(token)
+ if token not in self.unique_no_split_tokens
+ else [token]
+ for token in tokenized_text
+ )
+ )
+ )
+
+ no_split_token = self.unique_no_split_tokens
+ tokenized_text = split_on_tokens(no_split_token, text)
+ return tokenized_text
+
+ def encode(self, text):
+ return [self.tok_to_idx[tok] for tok in self.tokenize(text)]
+
+
+class BatchConverter(object):
+ """Callable to convert an unprocessed (labels + strings) batch to a
+ processed (labels + tensor) batch.
+ """
+
+ def __init__(self, alphabet, truncation_seq_length: int = None):
+ self.alphabet = alphabet
+ self.truncation_seq_length = truncation_seq_length
+
+ def __call__(self, raw_batch: Sequence[Tuple[str, str]]):
+ # RoBERTa uses an eos token, while ESM-1 does not.
+ batch_size = len(raw_batch)
+ batch_labels, seq_str_list = zip(*raw_batch)
+ seq_encoded_list = [self.alphabet.encode(seq_str) for seq_str in seq_str_list]
+ if self.truncation_seq_length:
+ seq_encoded_list = [seq_str[:self.truncation_seq_length] for seq_str in seq_encoded_list]
+ max_len = max(len(seq_encoded) for seq_encoded in seq_encoded_list)
+ tokens = torch.empty(
+ (
+ batch_size,
+ max_len + int(self.alphabet.prepend_bos) + int(self.alphabet.append_eos),
+ ),
+ dtype=torch.int64,
+ )
+ tokens.fill_(self.alphabet.padding_idx)
+ labels = []
+ strs = []
+
+ for i, (label, seq_str, seq_encoded) in enumerate(
+ zip(batch_labels, seq_str_list, seq_encoded_list)
+ ):
+ labels.append(label)
+ strs.append(seq_str)
+ if self.alphabet.prepend_bos:
+ tokens[i, 0] = self.alphabet.cls_idx
+ seq = torch.tensor(seq_encoded, dtype=torch.int64)
+ tokens[
+ i,
+ int(self.alphabet.prepend_bos) : len(seq_encoded)
+ + int(self.alphabet.prepend_bos),
+ ] = seq
+ if self.alphabet.append_eos:
+ tokens[i, len(seq_encoded) + int(self.alphabet.prepend_bos)] = self.alphabet.eos_idx
+
+ return labels, strs, tokens
+
+
+class MSABatchConverter(BatchConverter):
+ def __call__(self, inputs: Union[Sequence[RawMSA], RawMSA]):
+ if isinstance(inputs[0][0], str):
+ # Input is a single MSA
+ raw_batch: Sequence[RawMSA] = [inputs] # type: ignore
+ else:
+ raw_batch = inputs # type: ignore
+
+ batch_size = len(raw_batch)
+ max_alignments = max(len(msa) for msa in raw_batch)
+ max_seqlen = max(len(msa[0][1]) for msa in raw_batch)
+
+ tokens = torch.empty(
+ (
+ batch_size,
+ max_alignments,
+ max_seqlen + int(self.alphabet.prepend_bos) + int(self.alphabet.append_eos),
+ ),
+ dtype=torch.int64,
+ )
+ tokens.fill_(self.alphabet.padding_idx)
+ labels = []
+ strs = []
+
+ for i, msa in enumerate(raw_batch):
+ msa_seqlens = set(len(seq) for _, seq in msa)
+ if not len(msa_seqlens) == 1:
+ raise RuntimeError(
+ "Received unaligned sequences for input to MSA, all sequence "
+ "lengths must be equal."
+ )
+ msa_labels, msa_strs, msa_tokens = super().__call__(msa)
+ labels.append(msa_labels)
+ strs.append(msa_strs)
+ tokens[i, : msa_tokens.size(0), : msa_tokens.size(1)] = msa_tokens
+
+ return labels, strs, tokens
+
+
+def read_fasta(
+ path,
+ keep_gaps=True,
+ keep_insertions=True,
+ to_upper=False,
+):
+ with open(path, "r") as f:
+ for result in read_alignment_lines(
+ f, keep_gaps=keep_gaps, keep_insertions=keep_insertions, to_upper=to_upper
+ ):
+ yield result
+
+
+def read_alignment_lines(
+ lines,
+ keep_gaps=True,
+ keep_insertions=True,
+ to_upper=False,
+):
+ seq = desc = None
+
+ def parse(s):
+ if not keep_gaps:
+ s = re.sub("-", "", s)
+ if not keep_insertions:
+ s = re.sub("[a-z]", "", s)
+ return s.upper() if to_upper else s
+
+ for line in lines:
+ # Line may be empty if seq % file_line_width == 0
+ if len(line) > 0 and line[0] == ">":
+ if seq is not None:
+ yield desc, parse(seq)
+ desc = line.strip().lstrip(">")
+ seq = ""
+ else:
+ assert isinstance(seq, str)
+ seq += line.strip()
+ assert isinstance(seq, str) and isinstance(desc, str)
+ yield desc, parse(seq)
+
+
+class ESMStructuralSplitDataset(torch.utils.data.Dataset):
+ """
+ Structural Split Dataset as described in section A.10 of the supplement of our paper.
+ https://doi.org/10.1101/622803
+
+ We use the full version of SCOPe 2.07, clustered at 90% sequence identity,
+ generated on January 23, 2020.
+
+ For each SCOPe domain:
+ - We extract the sequence from the corresponding PDB file
+ - We extract the 3D coordinates of the Carbon beta atoms, aligning them
+ to the sequence. We put NaN where Cb atoms are missing.
+ - From the 3D coordinates, we calculate a pairwise distance map, based
+ on L2 distance
+ - We use DSSP to generate secondary structure labels for the corresponding
+ PDB file. This is also aligned to the sequence. We put - where SSP
+ labels are missing.
+
+ For each SCOPe classification level of family/superfamily/fold (in order of difficulty),
+ we have split the data into 5 partitions for cross validation. These are provided
+ in a downloaded splits folder, in the format:
+ splits/{split_level}/{cv_partition}/{train|valid}.txt
+ where train is the partition and valid is the concatentation of the remaining 4.
+
+ For each SCOPe domain, we provide a pkl dump that contains:
+ - seq : The domain sequence, stored as an L-length string
+ - ssp : The secondary structure labels, stored as an L-length string
+ - dist : The distance map, stored as an LxL numpy array
+ - coords : The 3D coordinates, stored as an Lx3 numpy array
+
+ """
+
+ base_folder = "structural-data"
+ file_list = [
+ # url tar filename filename MD5 Hash
+ (
+ "https://dl.fbaipublicfiles.com/fair-esm/structural-data/splits.tar.gz",
+ "splits.tar.gz",
+ "splits",
+ "456fe1c7f22c9d3d8dfe9735da52411d",
+ ),
+ (
+ "https://dl.fbaipublicfiles.com/fair-esm/structural-data/pkl.tar.gz",
+ "pkl.tar.gz",
+ "pkl",
+ "644ea91e56066c750cd50101d390f5db",
+ ),
+ ]
+
+ def __init__(
+ self,
+ split_level,
+ cv_partition,
+ split,
+ root_path=os.path.expanduser("~/.cache/torch/data/esm"),
+ download=False,
+ ):
+ super().__init__()
+ assert split in [
+ "train",
+ "valid",
+ ], "train_valid must be 'train' or 'valid'"
+ self.root_path = root_path
+ self.base_path = os.path.join(self.root_path, self.base_folder)
+
+ # check if root path has what you need or else download it
+ if download:
+ self.download()
+
+ self.split_file = os.path.join(
+ self.base_path, "splits", split_level, cv_partition, f"{split}.txt"
+ )
+ self.pkl_dir = os.path.join(self.base_path, "pkl")
+ self.names = []
+ with open(self.split_file) as f:
+ self.names = f.read().splitlines()
+
+ def __len__(self):
+ return len(self.names)
+
+ def _check_exists(self) -> bool:
+ for (_, _, filename, _) in self.file_list:
+ fpath = os.path.join(self.base_path, filename)
+ if not os.path.exists(fpath) or not os.path.isdir(fpath):
+ return False
+ return True
+
+ def download(self):
+
+ if self._check_exists():
+ print("Files already downloaded and verified")
+ return
+
+ from torchvision.datasets.utils import download_url
+
+ for url, tar_filename, filename, md5_hash in self.file_list:
+ download_path = os.path.join(self.base_path, tar_filename)
+ download_url(url=url, root=self.base_path, filename=tar_filename, md5=md5_hash)
+ shutil.unpack_archive(download_path, self.base_path)
+
+ def __getitem__(self, idx):
+ """
+ Returns a dict with the following entires
+ - seq : Str (domain sequence)
+ - ssp : Str (SSP labels)
+ - dist : np.array (distance map)
+ - coords : np.array (3D coordinates)
+ """
+ name = self.names[idx]
+ pkl_fname = os.path.join(self.pkl_dir, name[1:3], f"{name}.pkl")
+ with open(pkl_fname, "rb") as f:
+ obj = pickle.load(f)
+ return obj
diff --git a/proteingym/baselines/esm/esm/inverse_folding/__init__.py b/proteingym/baselines/esm/esm/inverse_folding/__init__.py
new file mode 100644
index 0000000..2906fc5
--- /dev/null
+++ b/proteingym/baselines/esm/esm/inverse_folding/__init__.py
@@ -0,0 +1,8 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+from . import gvp_transformer
+from . import util
+from . import multichain_util
diff --git a/proteingym/baselines/esm/esm/inverse_folding/features.py b/proteingym/baselines/esm/esm/inverse_folding/features.py
new file mode 100644
index 0000000..f67f3fe
--- /dev/null
+++ b/proteingym/baselines/esm/esm/inverse_folding/features.py
@@ -0,0 +1,352 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+#
+# Portions of this file were adapted from the open source code for the following
+# two papers:
+#
+# Ingraham, J., Garg, V., Barzilay, R., & Jaakkola, T. (2019). Generative
+# models for graph-based protein design. Advances in Neural Information
+# Processing Systems, 32.
+#
+# Jing, B., Eismann, S., Suriana, P., Townshend, R. J. L., & Dror, R. (2020).
+# Learning from Protein Structure with Geometric Vector Perceptrons. In
+# International Conference on Learning Representations.
+#
+# MIT License
+#
+# Copyright (c) 2020 Bowen Jing, Stephan Eismann, Patricia Suriana, Raphael Townshend, Ron Dror
+#
+# Permission is hereby granted, free of charge, to any person obtaining a copy
+# of this software and associated documentation files (the "Software"), to deal
+# in the Software without restriction, including without limitation the rights
+# to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+# copies of the Software, and to permit persons to whom the Software is
+# furnished to do so, subject to the following conditions:
+#
+# The above copyright notice and this permission notice shall be included in all
+# copies or substantial portions of the Software.
+#
+# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+# AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+# OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+# SOFTWARE.
+#
+# ================================================================
+# The below license applies to the portions of the code (parts of
+# src/datasets.py and src/models.py) adapted from Ingraham, et al.
+# ================================================================
+#
+# MIT License
+#
+# Copyright (c) 2019 John Ingraham, Vikas Garg, Regina Barzilay, Tommi Jaakkola
+#
+# Permission is hereby granted, free of charge, to any person obtaining a copy
+# of this software and associated documentation files (the "Software"), to deal
+# in the Software without restriction, including without limitation the rights
+# to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+# copies of the Software, and to permit persons to whom the Software is
+# furnished to do so, subject to the following conditions:
+#
+# The above copyright notice and this permission notice shall be included in all
+# copies or substantial portions of the Software.
+#
+# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+# AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+# OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+# SOFTWARE.
+
+import math
+import numpy as np
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+
+from .gvp_utils import flatten_graph
+from .gvp_modules import GVP, LayerNorm
+from .util import normalize, norm, nan_to_num, rbf
+
+
+class GVPInputFeaturizer(nn.Module):
+
+ @staticmethod
+ def get_node_features(coords, coord_mask, with_coord_mask=True):
+ # scalar features
+ node_scalar_features = GVPInputFeaturizer._dihedrals(coords)
+ if with_coord_mask:
+ node_scalar_features = torch.cat([
+ node_scalar_features,
+ coord_mask.float().unsqueeze(-1)
+ ], dim=-1)
+ # vector features
+ X_ca = coords[:, :, 1]
+ orientations = GVPInputFeaturizer._orientations(X_ca)
+ sidechains = GVPInputFeaturizer._sidechains(coords)
+ node_vector_features = torch.cat([orientations, sidechains.unsqueeze(-2)], dim=-2)
+ return node_scalar_features, node_vector_features
+
+ @staticmethod
+ def _orientations(X):
+ forward = normalize(X[:, 1:] - X[:, :-1])
+ backward = normalize(X[:, :-1] - X[:, 1:])
+ forward = F.pad(forward, [0, 0, 0, 1])
+ backward = F.pad(backward, [0, 0, 1, 0])
+ return torch.cat([forward.unsqueeze(-2), backward.unsqueeze(-2)], -2)
+
+ @staticmethod
+ def _sidechains(X):
+ n, origin, c = X[:, :, 0], X[:, :, 1], X[:, :, 2]
+ c, n = normalize(c - origin), normalize(n - origin)
+ bisector = normalize(c + n)
+ perp = normalize(torch.cross(c, n, dim=-1))
+ vec = -bisector * math.sqrt(1 / 3) - perp * math.sqrt(2 / 3)
+ return vec
+
+ @staticmethod
+ def _dihedrals(X, eps=1e-7):
+ X = torch.flatten(X[:, :, :3], 1, 2)
+ bsz = X.shape[0]
+ dX = X[:, 1:] - X[:, :-1]
+ U = normalize(dX, dim=-1)
+ u_2 = U[:, :-2]
+ u_1 = U[:, 1:-1]
+ u_0 = U[:, 2:]
+
+ # Backbone normals
+ n_2 = normalize(torch.cross(u_2, u_1, dim=-1), dim=-1)
+ n_1 = normalize(torch.cross(u_1, u_0, dim=-1), dim=-1)
+
+ # Angle between normals
+ cosD = torch.sum(n_2 * n_1, -1)
+ cosD = torch.clamp(cosD, -1 + eps, 1 - eps)
+ D = torch.sign(torch.sum(u_2 * n_1, -1)) * torch.acos(cosD)
+
+ # This scheme will remove phi[0], psi[-1], omega[-1]
+ D = F.pad(D, [1, 2])
+ D = torch.reshape(D, [bsz, -1, 3])
+ # Lift angle representations to the circle
+ D_features = torch.cat([torch.cos(D), torch.sin(D)], -1)
+ return D_features
+
+ @staticmethod
+ def _positional_embeddings(edge_index,
+ num_embeddings=None,
+ num_positional_embeddings=16,
+ period_range=[2, 1000]):
+ # From https://github.com/jingraham/neurips19-graph-protein-design
+ num_embeddings = num_embeddings or num_positional_embeddings
+ d = edge_index[0] - edge_index[1]
+
+ frequency = torch.exp(
+ torch.arange(0, num_embeddings, 2, dtype=torch.float32,
+ device=edge_index.device)
+ * -(np.log(10000.0) / num_embeddings)
+ )
+ angles = d.unsqueeze(-1) * frequency
+ E = torch.cat((torch.cos(angles), torch.sin(angles)), -1)
+ return E
+
+ @staticmethod
+ def _dist(X, coord_mask, padding_mask, top_k_neighbors, eps=1e-8):
+ """ Pairwise euclidean distances """
+ bsz, maxlen = X.size(0), X.size(1)
+ coord_mask_2D = torch.unsqueeze(coord_mask,1) * torch.unsqueeze(coord_mask,2)
+ residue_mask = ~padding_mask
+ residue_mask_2D = torch.unsqueeze(residue_mask,1) * torch.unsqueeze(residue_mask,2)
+ dX = torch.unsqueeze(X,1) - torch.unsqueeze(X,2)
+ D = coord_mask_2D * norm(dX, dim=-1)
+
+ # sorting preference: first those with coords, then among the residues that
+ # exist but are masked use distance in sequence as tie breaker, and then the
+ # residues that came from padding are last
+ seqpos = torch.arange(maxlen, device=X.device)
+ Dseq = torch.abs(seqpos.unsqueeze(1) - seqpos.unsqueeze(0)).repeat(bsz, 1, 1)
+ D_adjust = nan_to_num(D) + (~coord_mask_2D) * (1e8 + Dseq*1e6) + (
+ ~residue_mask_2D) * (1e10)
+
+ if top_k_neighbors == -1:
+ D_neighbors = D_adjust
+ E_idx = seqpos.repeat(
+ *D_neighbors.shape[:-1], 1)
+ else:
+ # Identify k nearest neighbors (including self)
+ k = min(top_k_neighbors, X.size(1))
+ D_neighbors, E_idx = torch.topk(D_adjust, k, dim=-1, largest=False)
+
+ coord_mask_neighbors = (D_neighbors < 5e7)
+ residue_mask_neighbors = (D_neighbors < 5e9)
+ return D_neighbors, E_idx, coord_mask_neighbors, residue_mask_neighbors
+
+
+class Normalize(nn.Module):
+ def __init__(self, features, epsilon=1e-6):
+ super(Normalize, self).__init__()
+ self.gain = nn.Parameter(torch.ones(features))
+ self.bias = nn.Parameter(torch.zeros(features))
+ self.epsilon = epsilon
+
+ def forward(self, x, dim=-1):
+ mu = x.mean(dim, keepdim=True)
+ sigma = torch.sqrt(x.var(dim, keepdim=True) + self.epsilon)
+ gain = self.gain
+ bias = self.bias
+ # Reshape
+ if dim != -1:
+ shape = [1] * len(mu.size())
+ shape[dim] = self.gain.size()[0]
+ gain = gain.view(shape)
+ bias = bias.view(shape)
+ return gain * (x - mu) / (sigma + self.epsilon) + bias
+
+
+class DihedralFeatures(nn.Module):
+ def __init__(self, node_embed_dim):
+ """ Embed dihedral angle features. """
+ super(DihedralFeatures, self).__init__()
+ # 3 dihedral angles; sin and cos of each angle
+ node_in = 6
+ # Normalization and embedding
+ self.node_embedding = nn.Linear(node_in, node_embed_dim, bias=True)
+ self.norm_nodes = Normalize(node_embed_dim)
+
+ def forward(self, X):
+ """ Featurize coordinates as an attributed graph """
+ V = self._dihedrals(X)
+ V = self.node_embedding(V)
+ V = self.norm_nodes(V)
+ return V
+
+ @staticmethod
+ def _dihedrals(X, eps=1e-7, return_angles=False):
+ # First 3 coordinates are N, CA, C
+ X = X[:,:,:3,:].reshape(X.shape[0], 3*X.shape[1], 3)
+
+ # Shifted slices of unit vectors
+ dX = X[:,1:,:] - X[:,:-1,:]
+ U = F.normalize(dX, dim=-1)
+ u_2 = U[:,:-2,:]
+ u_1 = U[:,1:-1,:]
+ u_0 = U[:,2:,:]
+ # Backbone normals
+ n_2 = F.normalize(torch.cross(u_2, u_1, dim=-1), dim=-1)
+ n_1 = F.normalize(torch.cross(u_1, u_0, dim=-1), dim=-1)
+
+ # Angle between normals
+ cosD = (n_2 * n_1).sum(-1)
+ cosD = torch.clamp(cosD, -1+eps, 1-eps)
+ D = torch.sign((u_2 * n_1).sum(-1)) * torch.acos(cosD)
+
+ # This scheme will remove phi[0], psi[-1], omega[-1]
+ D = F.pad(D, (1,2), 'constant', 0)
+ D = D.view((D.size(0), int(D.size(1)/3), 3))
+ phi, psi, omega = torch.unbind(D,-1)
+
+ if return_angles:
+ return phi, psi, omega
+
+ # Lift angle representations to the circle
+ D_features = torch.cat((torch.cos(D), torch.sin(D)), 2)
+ return D_features
+
+
+class GVPGraphEmbedding(GVPInputFeaturizer):
+
+ def __init__(self, args):
+ super().__init__()
+ self.top_k_neighbors = args.top_k_neighbors
+ self.num_positional_embeddings = 16
+ self.remove_edges_without_coords = True
+ node_input_dim = (7, 3)
+ edge_input_dim = (34, 1)
+ node_hidden_dim = (args.node_hidden_dim_scalar,
+ args.node_hidden_dim_vector)
+ edge_hidden_dim = (args.edge_hidden_dim_scalar,
+ args.edge_hidden_dim_vector)
+ self.embed_node = nn.Sequential(
+ GVP(node_input_dim, node_hidden_dim, activations=(None, None)),
+ LayerNorm(node_hidden_dim, eps=1e-4)
+ )
+ self.embed_edge = nn.Sequential(
+ GVP(edge_input_dim, edge_hidden_dim, activations=(None, None)),
+ LayerNorm(edge_hidden_dim, eps=1e-4)
+ )
+ self.embed_confidence = nn.Linear(16, args.node_hidden_dim_scalar)
+
+ def forward(self, coords, coord_mask, padding_mask, confidence):
+ with torch.no_grad():
+ node_features = self.get_node_features(coords, coord_mask)
+ edge_features, edge_index = self.get_edge_features(
+ coords, coord_mask, padding_mask)
+ node_embeddings_scalar, node_embeddings_vector = self.embed_node(node_features)
+ edge_embeddings = self.embed_edge(edge_features)
+
+ rbf_rep = rbf(confidence, 0., 1.)
+ node_embeddings = (
+ node_embeddings_scalar + self.embed_confidence(rbf_rep),
+ node_embeddings_vector
+ )
+
+ node_embeddings, edge_embeddings, edge_index = flatten_graph(
+ node_embeddings, edge_embeddings, edge_index)
+ return node_embeddings, edge_embeddings, edge_index
+
+ def get_edge_features(self, coords, coord_mask, padding_mask):
+ X_ca = coords[:, :, 1]
+ # Get distances to the top k neighbors
+ E_dist, E_idx, E_coord_mask, E_residue_mask = GVPInputFeaturizer._dist(
+ X_ca, coord_mask, padding_mask, self.top_k_neighbors)
+ # Flatten the graph to be batch size 1 for torch_geometric package
+ dest = E_idx
+ B, L, k = E_idx.shape[:3]
+ src = torch.arange(L, device=E_idx.device).view([1, L, 1]).expand(B, L, k)
+ # After flattening, [2, B, E]
+ edge_index = torch.stack([src, dest], dim=0).flatten(2, 3)
+ # After flattening, [B, E]
+ E_dist = E_dist.flatten(1, 2)
+ E_coord_mask = E_coord_mask.flatten(1, 2).unsqueeze(-1)
+ E_residue_mask = E_residue_mask.flatten(1, 2)
+ # Calculate relative positional embeddings and distance RBF
+ pos_embeddings = GVPInputFeaturizer._positional_embeddings(
+ edge_index,
+ num_positional_embeddings=self.num_positional_embeddings,
+ )
+ D_rbf = rbf(E_dist, 0., 20.)
+ # Calculate relative orientation
+ X_src = X_ca.unsqueeze(2).expand(-1, -1, k, -1).flatten(1, 2)
+ X_dest = torch.gather(
+ X_ca,
+ 1,
+ edge_index[1, :, :].unsqueeze(-1).expand([B, L*k, 3])
+ )
+ coord_mask_src = coord_mask.unsqueeze(2).expand(-1, -1, k).flatten(1, 2)
+ coord_mask_dest = torch.gather(
+ coord_mask,
+ 1,
+ edge_index[1, :, :].expand([B, L*k])
+ )
+ E_vectors = X_src - X_dest
+ # For the ones without coordinates, substitute in the average vector
+ E_vector_mean = torch.sum(E_vectors * E_coord_mask, dim=1,
+ keepdims=True) / torch.sum(E_coord_mask, dim=1, keepdims=True)
+ E_vectors = E_vectors * E_coord_mask + E_vector_mean * ~(E_coord_mask)
+ # Normalize and remove nans
+ edge_s = torch.cat([D_rbf, pos_embeddings], dim=-1)
+ edge_v = normalize(E_vectors).unsqueeze(-2)
+ edge_s, edge_v = map(nan_to_num, (edge_s, edge_v))
+ # Also add indications of whether the coordinates are present
+ edge_s = torch.cat([
+ edge_s,
+ (~coord_mask_src).float().unsqueeze(-1),
+ (~coord_mask_dest).float().unsqueeze(-1),
+ ], dim=-1)
+ edge_index[:, ~E_residue_mask] = -1
+ if self.remove_edges_without_coords:
+ edge_index[:, ~E_coord_mask.squeeze(-1)] = -1
+ return (edge_s, edge_v), edge_index.transpose(0, 1)
diff --git a/proteingym/baselines/esm/esm/inverse_folding/gvp_encoder.py b/proteingym/baselines/esm/esm/inverse_folding/gvp_encoder.py
new file mode 100644
index 0000000..a565201
--- /dev/null
+++ b/proteingym/baselines/esm/esm/inverse_folding/gvp_encoder.py
@@ -0,0 +1,56 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+from argparse import Namespace
+
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+
+from .features import GVPGraphEmbedding
+from .gvp_modules import GVPConvLayer, LayerNorm
+from .gvp_utils import unflatten_graph
+
+
+
+class GVPEncoder(nn.Module):
+
+ def __init__(self, args):
+ super().__init__()
+ self.args = args
+ self.embed_graph = GVPGraphEmbedding(args)
+
+ node_hidden_dim = (args.node_hidden_dim_scalar,
+ args.node_hidden_dim_vector)
+ edge_hidden_dim = (args.edge_hidden_dim_scalar,
+ args.edge_hidden_dim_vector)
+
+ conv_activations = (F.relu, torch.sigmoid)
+ self.encoder_layers = nn.ModuleList(
+ GVPConvLayer(
+ node_hidden_dim,
+ edge_hidden_dim,
+ drop_rate=args.dropout,
+ vector_gate=True,
+ attention_heads=0,
+ n_message=3,
+ conv_activations=conv_activations,
+ n_edge_gvps=0,
+ eps=1e-4,
+ layernorm=True,
+ )
+ for i in range(args.num_encoder_layers)
+ )
+
+ def forward(self, coords, coord_mask, padding_mask, confidence):
+ node_embeddings, edge_embeddings, edge_index = self.embed_graph(
+ coords, coord_mask, padding_mask, confidence)
+
+ for i, layer in enumerate(self.encoder_layers):
+ node_embeddings, edge_embeddings = layer(node_embeddings,
+ edge_index, edge_embeddings)
+
+ node_embeddings = unflatten_graph(node_embeddings, coords.shape[0])
+ return node_embeddings
diff --git a/proteingym/baselines/esm/esm/inverse_folding/gvp_modules.py b/proteingym/baselines/esm/esm/inverse_folding/gvp_modules.py
new file mode 100644
index 0000000..c60cbd3
--- /dev/null
+++ b/proteingym/baselines/esm/esm/inverse_folding/gvp_modules.py
@@ -0,0 +1,475 @@
+# Contents of this file are from the open source code for
+#
+# Jing, B., Eismann, S., Suriana, P., Townshend, R. J. L., & Dror, R. (2020).
+# Learning from Protein Structure with Geometric Vector Perceptrons. In
+# International Conference on Learning Representations.
+#
+# MIT License
+#
+# Copyright (c) 2020 Bowen Jing, Stephan Eismann, Patricia Suriana, Raphael Townshend, Ron Dror
+#
+# Permission is hereby granted, free of charge, to any person obtaining a copy
+# of this software and associated documentation files (the "Software"), to deal
+# in the Software without restriction, including without limitation the rights
+# to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+# copies of the Software, and to permit persons to whom the Software is
+# furnished to do so, subject to the following conditions:
+#
+# The above copyright notice and this permission notice shall be included in all
+# copies or substantial portions of the Software.
+#
+# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+# AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+# OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+# SOFTWARE.
+
+import typing as T
+import torch
+from torch import nn
+import torch.nn.functional as F
+from torch_geometric.nn import MessagePassing
+
+def tuple_size(tp):
+ return tuple([0 if a is None else a.size() for a in tp])
+
+def tuple_sum(tp1, tp2):
+ s1, v1 = tp1
+ s2, v2 = tp2
+ if v2 is None and v2 is None:
+ return (s1 + s2, None)
+ return (s1 + s2, v1 + v2)
+
+def tuple_cat(*args, dim=-1):
+ '''
+ Concatenates any number of tuples (s, V) elementwise.
+
+ :param dim: dimension along which to concatenate when viewed
+ as the `dim` index for the scalar-channel tensors.
+ This means that `dim=-1` will be applied as
+ `dim=-2` for the vector-channel tensors.
+ '''
+ dim %= len(args[0][0].shape)
+ s_args, v_args = list(zip(*args))
+ return torch.cat(s_args, dim=dim), torch.cat(v_args, dim=dim)
+
+def tuple_index(x, idx):
+ '''
+ Indexes into a tuple (s, V) along the first dimension.
+
+ :param idx: any object which can be used to index into a `torch.Tensor`
+ '''
+ return x[0][idx], x[1][idx]
+
+def randn(n, dims, device="cpu"):
+ '''
+ Returns random tuples (s, V) drawn elementwise from a normal distribution.
+
+ :param n: number of data points
+ :param dims: tuple of dimensions (n_scalar, n_vector)
+
+ :return: (s, V) with s.shape = (n, n_scalar) and
+ V.shape = (n, n_vector, 3)
+ '''
+ return torch.randn(n, dims[0], device=device), \
+ torch.randn(n, dims[1], 3, device=device)
+
+def _norm_no_nan(x, axis=-1, keepdims=False, eps=1e-8, sqrt=True):
+ '''
+ L2 norm of tensor clamped above a minimum value `eps`.
+
+ :param sqrt: if `False`, returns the square of the L2 norm
+ '''
+ # clamp is slow
+ # out = torch.clamp(torch.sum(torch.square(x), axis, keepdims), min=eps)
+ out = torch.sum(torch.square(x), axis, keepdims) + eps
+ return torch.sqrt(out) if sqrt else out
+
+def _split(x, nv):
+ '''
+ Splits a merged representation of (s, V) back into a tuple.
+ Should be used only with `_merge(s, V)` and only if the tuple
+ representation cannot be used.
+
+ :param x: the `torch.Tensor` returned from `_merge`
+ :param nv: the number of vector channels in the input to `_merge`
+ '''
+ v = torch.reshape(x[..., -3*nv:], x.shape[:-1] + (nv, 3))
+ s = x[..., :-3*nv]
+ return s, v
+
+def _merge(s, v):
+ '''
+ Merges a tuple (s, V) into a single `torch.Tensor`, where the
+ vector channels are flattened and appended to the scalar channels.
+ Should be used only if the tuple representation cannot be used.
+ Use `_split(x, nv)` to reverse.
+ '''
+ v = torch.reshape(v, v.shape[:-2] + (3*v.shape[-2],))
+ return torch.cat([s, v], -1)
+
+class GVP(nn.Module):
+ '''
+ Geometric Vector Perceptron. See manuscript and README.md
+ for more details.
+
+ :param in_dims: tuple (n_scalar, n_vector)
+ :param out_dims: tuple (n_scalar, n_vector)
+ :param h_dim: intermediate number of vector channels, optional
+ :param activations: tuple of functions (scalar_act, vector_act)
+ :param tuple_io: whether to keep accepting tuple inputs and outputs when vi
+ or vo = 0
+ '''
+ def __init__(self, in_dims, out_dims, h_dim=None, vector_gate=False,
+ activations=(F.relu, torch.sigmoid), tuple_io=True,
+ eps=1e-8):
+ super(GVP, self).__init__()
+ self.si, self.vi = in_dims
+ self.so, self.vo = out_dims
+ self.tuple_io = tuple_io
+ if self.vi:
+ self.h_dim = h_dim or max(self.vi, self.vo)
+ self.wh = nn.Linear(self.vi, self.h_dim, bias=False)
+ self.ws = nn.Linear(self.h_dim + self.si, self.so)
+ if self.vo:
+ self.wv = nn.Linear(self.h_dim, self.vo, bias=False)
+ if vector_gate:
+ self.wg = nn.Linear(self.so, self.vo)
+ else:
+ self.ws = nn.Linear(self.si, self.so)
+
+ self.vector_gate = vector_gate
+ self.scalar_act, self.vector_act = activations
+ self.eps = eps
+
+ def forward(self, x):
+ '''
+ :param x: tuple (s, V) of `torch.Tensor`,
+ or (if vectors_in is 0), a single `torch.Tensor`
+ :return: tuple (s, V) of `torch.Tensor`,
+ or (if vectors_out is 0), a single `torch.Tensor`
+ '''
+ if self.vi:
+ s, v = x
+ v = torch.transpose(v, -1, -2)
+ vh = self.wh(v)
+ vn = _norm_no_nan(vh, axis=-2, eps=self.eps)
+ s = self.ws(torch.cat([s, vn], -1))
+ if self.scalar_act:
+ s = self.scalar_act(s)
+ if self.vo:
+ v = self.wv(vh)
+ v = torch.transpose(v, -1, -2)
+ if self.vector_gate:
+ g = self.wg(s).unsqueeze(-1)
+ else:
+ g = _norm_no_nan(v, axis=-1, keepdims=True, eps=self.eps)
+ if self.vector_act:
+ g = self.vector_act(g)
+ v = v * g
+ else:
+ if self.tuple_io:
+ assert x[1] is None
+ x = x[0]
+ s = self.ws(x)
+ if self.scalar_act:
+ s = self.scalar_act(s)
+ if self.vo:
+ v = torch.zeros(list(s.shape)[:-1] + [self.vo, 3],
+ device=s.device)
+
+ if self.vo:
+ return (s, v)
+ elif self.tuple_io:
+ return (s, None)
+ else:
+ return s
+
+
+class _VDropout(nn.Module):
+ '''
+ Vector channel dropout where the elements of each
+ vector channel are dropped together.
+ '''
+ def __init__(self, drop_rate):
+ super(_VDropout, self).__init__()
+ self.drop_rate = drop_rate
+
+ def forward(self, x):
+ '''
+ :param x: `torch.Tensor` corresponding to vector channels
+ '''
+ if x is None:
+ return None
+ device = x.device
+ if not self.training:
+ return x
+ mask = torch.bernoulli(
+ (1 - self.drop_rate) * torch.ones(x.shape[:-1], device=device)
+ ).unsqueeze(-1)
+ x = mask * x / (1 - self.drop_rate)
+ return x
+
+class Dropout(nn.Module):
+ '''
+ Combined dropout for tuples (s, V).
+ Takes tuples (s, V) as input and as output.
+ '''
+ def __init__(self, drop_rate):
+ super(Dropout, self).__init__()
+ self.sdropout = nn.Dropout(drop_rate)
+ self.vdropout = _VDropout(drop_rate)
+
+ def forward(self, x):
+ '''
+ :param x: tuple (s, V) of `torch.Tensor`,
+ or single `torch.Tensor`
+ (will be assumed to be scalar channels)
+ '''
+ if type(x) is torch.Tensor:
+ return self.sdropout(x)
+ s, v = x
+ return self.sdropout(s), self.vdropout(v)
+
+class LayerNorm(nn.Module):
+ '''
+ Combined LayerNorm for tuples (s, V).
+ Takes tuples (s, V) as input and as output.
+ '''
+ def __init__(self, dims, tuple_io=True, eps=1e-8):
+ super(LayerNorm, self).__init__()
+ self.tuple_io = tuple_io
+ self.s, self.v = dims
+ self.scalar_norm = nn.LayerNorm(self.s)
+ self.eps = eps
+
+ def forward(self, x):
+ '''
+ :param x: tuple (s, V) of `torch.Tensor`,
+ or single `torch.Tensor`
+ (will be assumed to be scalar channels)
+ '''
+ if not self.v:
+ if self.tuple_io:
+ return self.scalar_norm(x[0]), None
+ return self.scalar_norm(x)
+ s, v = x
+ vn = _norm_no_nan(v, axis=-1, keepdims=True, sqrt=False, eps=self.eps)
+ nonzero_mask = (vn > 2 * self.eps)
+ vn = torch.sum(vn * nonzero_mask, dim=-2, keepdim=True
+ ) / (self.eps + torch.sum(nonzero_mask, dim=-2, keepdim=True))
+ vn = torch.sqrt(vn + self.eps)
+ v = nonzero_mask * (v / vn)
+ return self.scalar_norm(s), v
+
+class GVPConv(MessagePassing):
+ '''
+ Graph convolution / message passing with Geometric Vector Perceptrons.
+ Takes in a graph with node and edge embeddings,
+ and returns new node embeddings.
+
+ This does NOT do residual updates and pointwise feedforward layers
+ ---see `GVPConvLayer`.
+
+ :param in_dims: input node embedding dimensions (n_scalar, n_vector)
+ :param out_dims: output node embedding dimensions (n_scalar, n_vector)
+ :param edge_dims: input edge embedding dimensions (n_scalar, n_vector)
+ :param n_layers: number of GVPs in the message function
+ :param module_list: preconstructed message function, overrides n_layers
+ :param aggr: should be "add" if some incoming edges are masked, as in
+ a masked autoregressive decoder architecture
+ '''
+ def __init__(self, in_dims, out_dims, edge_dims, n_layers=3,
+ vector_gate=False, module_list=None, aggr="mean", eps=1e-8,
+ activations=(F.relu, torch.sigmoid)):
+ super(GVPConv, self).__init__(aggr=aggr)
+ self.eps = eps
+ self.si, self.vi = in_dims
+ self.so, self.vo = out_dims
+ self.se, self.ve = edge_dims
+
+ module_list = module_list or []
+ if not module_list:
+ if n_layers == 1:
+ module_list.append(
+ GVP((2*self.si + self.se, 2*self.vi + self.ve),
+ (self.so, self.vo), activations=(None, None)))
+ else:
+ module_list.append(
+ GVP((2*self.si + self.se, 2*self.vi + self.ve), out_dims,
+ vector_gate=vector_gate, activations=activations)
+ )
+ for i in range(n_layers - 2):
+ module_list.append(GVP(out_dims, out_dims,
+ vector_gate=vector_gate))
+ module_list.append(GVP(out_dims, out_dims,
+ activations=(None, None)))
+ self.message_func = nn.Sequential(*module_list)
+
+ def forward(self, x, edge_index, edge_attr):
+ '''
+ :param x: tuple (s, V) of `torch.Tensor`
+ :param edge_index: array of shape [2, n_edges]
+ :param edge_attr: tuple (s, V) of `torch.Tensor`
+ '''
+ x_s, x_v = x
+ message = self.propagate(edge_index,
+ s=x_s, v=x_v.reshape(x_v.shape[0], 3*x_v.shape[1]),
+ edge_attr=edge_attr)
+ return _split(message, self.vo)
+
+ def message(self, s_i, v_i, s_j, v_j, edge_attr):
+ v_j = v_j.view(v_j.shape[0], v_j.shape[1]//3, 3)
+ v_i = v_i.view(v_i.shape[0], v_i.shape[1]//3, 3)
+ message = tuple_cat((s_j, v_j), edge_attr, (s_i, v_i))
+ message = self.message_func(message)
+ return _merge(*message)
+
+
+class GVPConvLayer(nn.Module):
+ '''
+ Full graph convolution / message passing layer with
+ Geometric Vector Perceptrons. Residually updates node embeddings with
+ aggregated incoming messages, applies a pointwise feedforward
+ network to node embeddings, and returns updated node embeddings.
+
+ To only compute the aggregated messages, see `GVPConv`.
+
+ :param node_dims: node embedding dimensions (n_scalar, n_vector)
+ :param edge_dims: input edge embedding dimensions (n_scalar, n_vector)
+ :param n_message: number of GVPs to use in message function
+ :param n_feedforward: number of GVPs to use in feedforward function
+ :param drop_rate: drop probability in all dropout layers
+ :param autoregressive: if `True`, this `GVPConvLayer` will be used
+ with a different set of input node embeddings for messages
+ where src >= dst
+ '''
+ def __init__(self, node_dims, edge_dims, vector_gate=False,
+ n_message=3, n_feedforward=2, drop_rate=.1,
+ autoregressive=False, attention_heads=0,
+ conv_activations=(F.relu, torch.sigmoid),
+ n_edge_gvps=0, layernorm=True, eps=1e-8):
+
+ super(GVPConvLayer, self).__init__()
+ if attention_heads == 0:
+ self.conv = GVPConv(
+ node_dims, node_dims, edge_dims, n_layers=n_message,
+ vector_gate=vector_gate,
+ aggr="add" if autoregressive else "mean",
+ activations=conv_activations,
+ eps=eps,
+ )
+ else:
+ raise NotImplementedError
+ if layernorm:
+ self.norm = nn.ModuleList([LayerNorm(node_dims, eps=eps) for _ in range(2)])
+ else:
+ self.norm = nn.ModuleList([nn.Identity() for _ in range(2)])
+ self.dropout = nn.ModuleList([Dropout(drop_rate) for _ in range(2)])
+
+ ff_func = []
+ if n_feedforward == 1:
+ ff_func.append(GVP(node_dims, node_dims, activations=(None, None)))
+ else:
+ hid_dims = 4*node_dims[0], 2*node_dims[1]
+ ff_func.append(GVP(node_dims, hid_dims, vector_gate=vector_gate))
+ for i in range(n_feedforward-2):
+ ff_func.append(GVP(hid_dims, hid_dims, vector_gate=vector_gate))
+ ff_func.append(GVP(hid_dims, node_dims, activations=(None, None)))
+ self.ff_func = nn.Sequential(*ff_func)
+
+ self.edge_message_func = None
+ if n_edge_gvps > 0:
+ si, vi = node_dims
+ se, ve = edge_dims
+ module_list = [
+ GVP((2*si + se, 2*vi + ve), edge_dims, vector_gate=vector_gate)
+ ]
+ for i in range(n_edge_gvps - 2):
+ module_list.append(GVP(edge_dims, edge_dims,
+ vector_gate=vector_gate))
+ if n_edge_gvps > 1:
+ module_list.append(GVP(edge_dims, edge_dims,
+ activations=(None, None)))
+ self.edge_message_func = nn.Sequential(*module_list)
+ if layernorm:
+ self.edge_norm = LayerNorm(edge_dims, eps=eps)
+ else:
+ self.edge_norm = nn.Identity()
+ self.edge_dropout = Dropout(drop_rate)
+
+ def forward(self, x, edge_index, edge_attr,
+ autoregressive_x=None, node_mask=None):
+ '''
+ :param x: tuple (s, V) of `torch.Tensor`
+ :param edge_index: array of shape [2, n_edges]
+ :param edge_attr: tuple (s, V) of `torch.Tensor`
+ :param autoregressive_x: tuple (s, V) of `torch.Tensor`.
+ If not `None`, will be used as srcqq node embeddings
+ for forming messages where src >= dst. The corrent node
+ embeddings `x` will still be the base of the update and the
+ pointwise feedforward.
+ :param node_mask: array of type `bool` to index into the first
+ dim of node embeddings (s, V). If not `None`, only
+ these nodes will be updated.
+ '''
+ if self.edge_message_func:
+ src, dst = edge_index
+ if autoregressive_x is None:
+ x_src = x[0][src], x[1][src]
+ else:
+ mask = (src < dst).unsqueeze(-1)
+ x_src = (
+ torch.where(mask, x[0][src], autoregressive_x[0][src]),
+ torch.where(mask.unsqueeze(-1), x[1][src],
+ autoregressive_x[1][src])
+ )
+ x_dst = x[0][dst], x[1][dst]
+ x_edge = (
+ torch.cat([x_src[0], edge_attr[0], x_dst[0]], dim=-1),
+ torch.cat([x_src[1], edge_attr[1], x_dst[1]], dim=-2)
+ )
+ edge_attr_dh = self.edge_message_func(x_edge)
+ edge_attr = self.edge_norm(tuple_sum(edge_attr,
+ self.edge_dropout(edge_attr_dh)))
+
+ if autoregressive_x is not None:
+ # Guarding this import here to remove the dependency on torch_scatter, since this isn't used
+ # in ESM-IF1
+ from torch_scatter import scatter_add
+ src, dst = edge_index
+ mask = src < dst
+ edge_index_forward = edge_index[:, mask]
+ edge_index_backward = edge_index[:, ~mask]
+ edge_attr_forward = tuple_index(edge_attr, mask)
+ edge_attr_backward = tuple_index(edge_attr, ~mask)
+
+ dh = tuple_sum(
+ self.conv(x, edge_index_forward, edge_attr_forward),
+ self.conv(autoregressive_x, edge_index_backward, edge_attr_backward)
+ )
+
+ count = scatter_add(torch.ones_like(dst), dst,
+ dim_size=dh[0].size(0)).clamp(min=1).unsqueeze(-1)
+
+ dh = dh[0] / count, dh[1] / count.unsqueeze(-1)
+
+ else:
+ dh = self.conv(x, edge_index, edge_attr)
+
+ if node_mask is not None:
+ x_ = x
+ x, dh = tuple_index(x, node_mask), tuple_index(dh, node_mask)
+
+ x = self.norm[0](tuple_sum(x, self.dropout[0](dh)))
+
+ dh = self.ff_func(x)
+ x = self.norm[1](tuple_sum(x, self.dropout[1](dh)))
+
+ if node_mask is not None:
+ x_[0][node_mask], x_[1][node_mask] = x[0], x[1]
+ x = x_
+
+ return x, edge_attr
diff --git a/proteingym/baselines/esm/esm/inverse_folding/gvp_transformer.py b/proteingym/baselines/esm/esm/inverse_folding/gvp_transformer.py
new file mode 100644
index 0000000..fabd22f
--- /dev/null
+++ b/proteingym/baselines/esm/esm/inverse_folding/gvp_transformer.py
@@ -0,0 +1,140 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import argparse
+from typing import Any, Dict, List, Optional, Tuple, NamedTuple
+import torch
+from torch import nn
+from torch import Tensor
+import torch.nn.functional as F
+from scipy.spatial import transform
+
+from esm.data import Alphabet
+
+from .features import DihedralFeatures
+from .gvp_encoder import GVPEncoder
+from .gvp_utils import unflatten_graph
+from .gvp_transformer_encoder import GVPTransformerEncoder
+from .transformer_decoder import TransformerDecoder
+from .util import rotate, CoordBatchConverter
+
+
+class GVPTransformerModel(nn.Module):
+ """
+ GVP-Transformer inverse folding model.
+
+ Architecture: Geometric GVP-GNN as initial layers, followed by
+ sequence-to-sequence Transformer encoder and decoder.
+ """
+
+ def __init__(self, args, alphabet):
+ super().__init__()
+ encoder_embed_tokens = self.build_embedding(
+ args, alphabet, args.encoder_embed_dim,
+ )
+ decoder_embed_tokens = self.build_embedding(
+ args, alphabet, args.decoder_embed_dim,
+ )
+ encoder = self.build_encoder(args, alphabet, encoder_embed_tokens)
+ decoder = self.build_decoder(args, alphabet, decoder_embed_tokens)
+ self.args = args
+ self.encoder = encoder
+ self.decoder = decoder
+
+ @classmethod
+ def build_encoder(cls, args, src_dict, embed_tokens):
+ encoder = GVPTransformerEncoder(args, src_dict, embed_tokens)
+ return encoder
+
+ @classmethod
+ def build_decoder(cls, args, tgt_dict, embed_tokens):
+ decoder = TransformerDecoder(
+ args,
+ tgt_dict,
+ embed_tokens,
+ )
+ return decoder
+
+ @classmethod
+ def build_embedding(cls, args, dictionary, embed_dim):
+ num_embeddings = len(dictionary)
+ padding_idx = dictionary.padding_idx
+ emb = nn.Embedding(num_embeddings, embed_dim, padding_idx)
+ nn.init.normal_(emb.weight, mean=0, std=embed_dim ** -0.5)
+ nn.init.constant_(emb.weight[padding_idx], 0)
+ return emb
+
+ def forward(
+ self,
+ coords,
+ padding_mask,
+ confidence,
+ prev_output_tokens,
+ return_all_hiddens: bool = False,
+ features_only: bool = False,
+ ):
+ encoder_out = self.encoder(coords, padding_mask, confidence,
+ return_all_hiddens=return_all_hiddens)
+ logits, extra = self.decoder(
+ prev_output_tokens,
+ encoder_out=encoder_out,
+ features_only=features_only,
+ return_all_hiddens=return_all_hiddens,
+ )
+ return logits, extra
+
+ def sample(self, coords, partial_seq=None, temperature=1.0, confidence=None, device=None):
+ """
+ Samples sequences based on multinomial sampling (no beam search).
+
+ Args:
+ coords: L x 3 x 3 list representing one backbone
+ partial_seq: Optional, partial sequence with mask tokens if part of
+ the sequence is known
+ temperature: sampling temperature, use low temperature for higher
+ sequence recovery and high temperature for higher diversity
+ confidence: optional length L list of confidence scores for coordinates
+ """
+ L = len(coords)
+ # Convert to batch format
+ batch_converter = CoordBatchConverter(self.decoder.dictionary)
+ batch_coords, confidence, _, _, padding_mask = (
+ batch_converter([(coords, confidence, None)], device=device)
+ )
+
+ # Start with prepend token
+ mask_idx = self.decoder.dictionary.get_idx('')
+ sampled_tokens = torch.full((1, 1+L), mask_idx, dtype=int)
+ sampled_tokens[0, 0] = self.decoder.dictionary.get_idx('')
+ if partial_seq is not None:
+ for i, c in enumerate(partial_seq):
+ sampled_tokens[0, i+1] = self.decoder.dictionary.get_idx(c)
+
+ # Save incremental states for faster sampling
+ incremental_state = dict()
+
+ # Run encoder only once
+ encoder_out = self.encoder(batch_coords, padding_mask, confidence)
+
+ # Make sure all tensors are on the same device if a GPU is present
+ if device:
+ sampled_tokens = sampled_tokens.to(device)
+
+ # Decode one token at a time
+ for i in range(1, L+1):
+ logits, _ = self.decoder(
+ sampled_tokens[:, :i],
+ encoder_out,
+ incremental_state=incremental_state,
+ )
+ logits = logits[0].transpose(0, 1)
+ logits /= temperature
+ probs = F.softmax(logits, dim=-1)
+ if sampled_tokens[0, i] == mask_idx:
+ sampled_tokens[:, i] = torch.multinomial(probs, 1).squeeze(-1)
+ sampled_seq = sampled_tokens[0, 1:]
+
+ # Convert back to string via lookup
+ return ''.join([self.decoder.dictionary.get_tok(a) for a in sampled_seq])
diff --git a/proteingym/baselines/esm/esm/inverse_folding/gvp_transformer_encoder.py b/proteingym/baselines/esm/esm/inverse_folding/gvp_transformer_encoder.py
new file mode 100644
index 0000000..b613347
--- /dev/null
+++ b/proteingym/baselines/esm/esm/inverse_folding/gvp_transformer_encoder.py
@@ -0,0 +1,184 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.
+#
+# Contents of this file were adapted from the open source fairseq repository.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import argparse
+import math
+from typing import Dict, List, Optional
+
+import torch
+import torch.nn as nn
+from torch import Tensor
+
+from esm.modules import SinusoidalPositionalEmbedding
+from .features import GVPInputFeaturizer, DihedralFeatures
+from .gvp_encoder import GVPEncoder
+from .transformer_layer import TransformerEncoderLayer
+from .util import nan_to_num, get_rotation_frames, rotate, rbf
+
+
+class GVPTransformerEncoder(nn.Module):
+ """
+ Transformer encoder consisting of *args.encoder.layers* layers. Each layer
+ is a :class:`TransformerEncoderLayer`.
+
+ Args:
+ args (argparse.Namespace): parsed command-line arguments
+ dictionary (~fairseq.data.Dictionary): encoding dictionary
+ embed_tokens (torch.nn.Embedding): input embedding
+ """
+
+ def __init__(self, args, dictionary, embed_tokens):
+ super().__init__()
+ self.args = args
+ self.dictionary = dictionary
+
+ self.dropout_module = nn.Dropout(args.dropout)
+
+ embed_dim = embed_tokens.embedding_dim
+ self.padding_idx = embed_tokens.padding_idx
+
+ self.embed_tokens = embed_tokens
+ self.embed_scale = math.sqrt(embed_dim)
+ self.embed_positions = SinusoidalPositionalEmbedding(
+ embed_dim,
+ self.padding_idx,
+ )
+ self.embed_gvp_input_features = nn.Linear(15, embed_dim)
+ self.embed_confidence = nn.Linear(16, embed_dim)
+ self.embed_dihedrals = DihedralFeatures(embed_dim)
+
+ gvp_args = argparse.Namespace()
+ for k, v in vars(args).items():
+ if k.startswith("gvp_"):
+ setattr(gvp_args, k[4:], v)
+ self.gvp_encoder = GVPEncoder(gvp_args)
+ gvp_out_dim = gvp_args.node_hidden_dim_scalar + (3 *
+ gvp_args.node_hidden_dim_vector)
+ self.embed_gvp_output = nn.Linear(gvp_out_dim, embed_dim)
+
+ self.layers = nn.ModuleList([])
+ self.layers.extend(
+ [self.build_encoder_layer(args) for i in range(args.encoder_layers)]
+ )
+ self.num_layers = len(self.layers)
+ self.layer_norm = nn.LayerNorm(embed_dim)
+
+ def build_encoder_layer(self, args):
+ return TransformerEncoderLayer(args)
+
+ def forward_embedding(self, coords, padding_mask, confidence):
+ """
+ Args:
+ coords: N, CA, C backbone coordinates in shape length x 3 (atoms) x 3
+ padding_mask: boolean Tensor (true for padding) of shape length
+ confidence: confidence scores between 0 and 1 of shape length
+ """
+ components = dict()
+ coord_mask = torch.all(torch.all(torch.isfinite(coords), dim=-1), dim=-1)
+ coords = nan_to_num(coords)
+ mask_tokens = (
+ padding_mask * self.dictionary.padding_idx +
+ ~padding_mask * self.dictionary.get_idx("")
+ )
+ components["tokens"] = self.embed_tokens(mask_tokens) * self.embed_scale
+ components["diherals"] = self.embed_dihedrals(coords)
+
+ # GVP encoder
+ gvp_out_scalars, gvp_out_vectors = self.gvp_encoder(coords,
+ coord_mask, padding_mask, confidence)
+ R = get_rotation_frames(coords)
+ # Rotate to local rotation frame for rotation-invariance
+ gvp_out_features = torch.cat([
+ gvp_out_scalars,
+ rotate(gvp_out_vectors, R.transpose(-2, -1)).flatten(-2, -1),
+ ], dim=-1)
+ components["gvp_out"] = self.embed_gvp_output(gvp_out_features)
+
+ components["confidence"] = self.embed_confidence(
+ rbf(confidence, 0., 1.))
+
+ # In addition to GVP encoder outputs, also directly embed GVP input node
+ # features to the Transformer
+ scalar_features, vector_features = GVPInputFeaturizer.get_node_features(
+ coords, coord_mask, with_coord_mask=False)
+ features = torch.cat([
+ scalar_features,
+ rotate(vector_features, R.transpose(-2, -1)).flatten(-2, -1),
+ ], dim=-1)
+ components["gvp_input_features"] = self.embed_gvp_input_features(features)
+
+ embed = sum(components.values())
+ # for k, v in components.items():
+ # print(k, torch.mean(v, dim=(0,1)), torch.std(v, dim=(0,1)))
+
+ x = embed
+ x = x + self.embed_positions(mask_tokens)
+ x = self.dropout_module(x)
+ return x, components
+
+ def forward(
+ self,
+ coords,
+ encoder_padding_mask,
+ confidence,
+ return_all_hiddens: bool = False,
+ ):
+ """
+ Args:
+ coords (Tensor): backbone coordinates
+ shape batch_size x num_residues x num_atoms (3 for N, CA, C) x 3
+ encoder_padding_mask (ByteTensor): the positions of
+ padding elements of shape `(batch_size x num_residues)`
+ confidence (Tensor): the confidence score of shape (batch_size x
+ num_residues). The value is between 0. and 1. for each residue
+ coordinate, or -1. if no coordinate is given
+ return_all_hiddens (bool, optional): also return all of the
+ intermediate hidden states (default: False).
+
+ Returns:
+ dict:
+ - **encoder_out** (Tensor): the last encoder layer's output of
+ shape `(num_residues, batch_size, embed_dim)`
+ - **encoder_padding_mask** (ByteTensor): the positions of
+ padding elements of shape `(batch_size, num_residues)`
+ - **encoder_embedding** (Tensor): the (scaled) embedding lookup
+ of shape `(batch_size, num_residues, embed_dim)`
+ - **encoder_states** (List[Tensor]): all intermediate
+ hidden states of shape `(num_residues, batch_size, embed_dim)`.
+ Only populated if *return_all_hiddens* is True.
+ """
+ x, encoder_embedding = self.forward_embedding(coords,
+ encoder_padding_mask, confidence)
+ # account for padding while computing the representation
+ x = x * (1 - encoder_padding_mask.unsqueeze(-1).type_as(x))
+
+ # B x T x C -> T x B x C
+ x = x.transpose(0, 1)
+
+ encoder_states = []
+
+ if return_all_hiddens:
+ encoder_states.append(x)
+
+ # encoder layers
+ for layer in self.layers:
+ x = layer(
+ x, encoder_padding_mask=encoder_padding_mask
+ )
+ if return_all_hiddens:
+ assert encoder_states is not None
+ encoder_states.append(x)
+
+ if self.layer_norm is not None:
+ x = self.layer_norm(x)
+
+ return {
+ "encoder_out": [x], # T x B x C
+ "encoder_padding_mask": [encoder_padding_mask], # B x T
+ "encoder_embedding": [encoder_embedding], # dictionary
+ "encoder_states": encoder_states, # List[T x B x C]
+ }
diff --git a/proteingym/baselines/esm/esm/inverse_folding/gvp_utils.py b/proteingym/baselines/esm/esm/inverse_folding/gvp_utils.py
new file mode 100644
index 0000000..fc0617f
--- /dev/null
+++ b/proteingym/baselines/esm/esm/inverse_folding/gvp_utils.py
@@ -0,0 +1,68 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import torch
+
+
+def flatten_graph(node_embeddings, edge_embeddings, edge_index):
+ """
+ Flattens the graph into a batch size one (with disconnected subgraphs for
+ each example) to be compatible with pytorch-geometric package.
+ Args:
+ node_embeddings: node embeddings in tuple form (scalar, vector)
+ - scalar: shape batch size x nodes x node_embed_dim
+ - vector: shape batch size x nodes x node_embed_dim x 3
+ edge_embeddings: edge embeddings of in tuple form (scalar, vector)
+ - scalar: shape batch size x edges x edge_embed_dim
+ - vector: shape batch size x edges x edge_embed_dim x 3
+ edge_index: shape batch_size x 2 (source node and target node) x edges
+ Returns:
+ node_embeddings: node embeddings in tuple form (scalar, vector)
+ - scalar: shape batch total_nodes x node_embed_dim
+ - vector: shape batch total_nodes x node_embed_dim x 3
+ edge_embeddings: edge embeddings of in tuple form (scalar, vector)
+ - scalar: shape batch total_edges x edge_embed_dim
+ - vector: shape batch total_edges x edge_embed_dim x 3
+ edge_index: shape 2 x total_edges
+ """
+ x_s, x_v = node_embeddings
+ e_s, e_v = edge_embeddings
+ batch_size, N = x_s.shape[0], x_s.shape[1]
+ node_embeddings = (torch.flatten(x_s, 0, 1), torch.flatten(x_v, 0, 1))
+ edge_embeddings = (torch.flatten(e_s, 0, 1), torch.flatten(e_v, 0, 1))
+
+ edge_mask = torch.any(edge_index != -1, dim=1)
+ # Re-number the nodes by adding batch_idx * N to each batch
+ edge_index = edge_index + (torch.arange(batch_size, device=edge_index.device) *
+ N).unsqueeze(-1).unsqueeze(-1)
+ edge_index = edge_index.permute(1, 0, 2).flatten(1, 2)
+ edge_mask = edge_mask.flatten()
+ edge_index = edge_index[:, edge_mask]
+ edge_embeddings = (
+ edge_embeddings[0][edge_mask, :],
+ edge_embeddings[1][edge_mask, :]
+ )
+ return node_embeddings, edge_embeddings, edge_index
+
+
+def unflatten_graph(node_embeddings, batch_size):
+ """
+ Unflattens node embeddings.
+ Args:
+ node_embeddings: node embeddings in tuple form (scalar, vector)
+ - scalar: shape batch total_nodes x node_embed_dim
+ - vector: shape batch total_nodes x node_embed_dim x 3
+ batch_size: int
+ Returns:
+ node_embeddings: node embeddings in tuple form (scalar, vector)
+ - scalar: shape batch size x nodes x node_embed_dim
+ - vector: shape batch size x nodes x node_embed_dim x 3
+ """
+ x_s, x_v = node_embeddings
+ x_s = x_s.reshape(batch_size, -1, x_s.shape[1])
+ x_v = x_v.reshape(batch_size, -1, x_v.shape[1], x_v.shape[2])
+ return (x_s, x_v)
+
+
diff --git a/proteingym/baselines/esm/esm/inverse_folding/multichain_util.py b/proteingym/baselines/esm/esm/inverse_folding/multichain_util.py
new file mode 100644
index 0000000..40c4b7b
--- /dev/null
+++ b/proteingym/baselines/esm/esm/inverse_folding/multichain_util.py
@@ -0,0 +1,152 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import biotite.structure
+import numpy as np
+import torch
+from typing import Sequence, Tuple, List
+
+from esm.inverse_folding.util import (
+ load_structure,
+ extract_coords_from_structure,
+ load_coords,
+ get_sequence_loss,
+ get_encoder_output,
+)
+
+
+def extract_coords_from_complex(structure: biotite.structure.AtomArray):
+ """
+ Args:
+ structure: biotite AtomArray
+ Returns:
+ Tuple (coords_list, seq_list)
+ - coords: Dictionary mapping chain ids to L x 3 x 3 array for N, CA, C
+ coordinates representing the backbone of each chain
+ - seqs: Dictionary mapping chain ids to native sequences of each chain
+ """
+ coords = {}
+ seqs = {}
+ all_chains = biotite.structure.get_chains(structure)
+ for chain_id in all_chains:
+ chain = structure[structure.chain_id == chain_id]
+ coords[chain_id], seqs[chain_id] = extract_coords_from_structure(chain)
+ return coords, seqs
+
+
+def load_complex_coords(fpath, chains):
+ """
+ Args:
+ fpath: filepath to either pdb or cif file
+ chains: the chain ids (the order matters for autoregressive model)
+ Returns:
+ Tuple (coords_list, seq_list)
+ - coords: Dictionary mapping chain ids to L x 3 x 3 array for N, CA, C
+ coordinates representing the backbone of each chain
+ - seqs: Dictionary mapping chain ids to native sequences of each chain
+ """
+ structure = load_structure(fpath, chains)
+ return extract_coords_from_complex(structure)
+
+
+def _concatenate_coords(coords, target_chain_id, padding_length=10):
+ """
+ Args:
+ coords: Dictionary mapping chain ids to L x 3 x 3 array for N, CA, C
+ coordinates representing the backbone of each chain
+ target_chain_id: The chain id to sample sequences for
+ padding_length: Length of padding between concatenated chains
+ Returns:
+ Tuple (coords, seq)
+ - coords is an L x 3 x 3 array for N, CA, C coordinates, a
+ concatenation of the chains with padding in between
+ - seq is the extracted sequence, with padding tokens inserted
+ between the concatenated chains
+ """
+ pad_coords = np.full((padding_length, 3, 3), np.nan, dtype=np.float32)
+ # For best performance, put the target chain first in concatenation.
+ coords_list = [coords[target_chain_id]]
+ for chain_id in coords:
+ if chain_id == target_chain_id:
+ continue
+ coords_list.append(pad_coords)
+ coords_list.append(coords[chain_id])
+ coords_concatenated = np.concatenate(coords_list, axis=0)
+ return coords_concatenated
+
+
+def sample_sequence_in_complex(model, coords, target_chain_id, temperature=1.,
+ padding_length=10):
+ """
+ Samples sequence for one chain in a complex.
+ Args:
+ model: An instance of the GVPTransformer model
+ coords: Dictionary mapping chain ids to L x 3 x 3 array for N, CA, C
+ coordinates representing the backbone of each chain
+ target_chain_id: The chain id to sample sequences for
+ padding_length: padding length in between chains
+ Returns:
+ Sampled sequence for the target chain
+ """
+ target_chain_len = coords[target_chain_id].shape[0]
+ all_coords = _concatenate_coords(coords, target_chain_id)
+ device = next(model.parameters()).device
+
+ # Supply padding tokens for other chains to avoid unused sampling for speed
+ padding_pattern = [''] * all_coords.shape[0]
+ for i in range(target_chain_len):
+ padding_pattern[i] = ''
+ sampled = model.sample(all_coords, partial_seq=padding_pattern,
+ temperature=temperature, device=device)
+ sampled = sampled[:target_chain_len]
+ return sampled
+
+
+def score_sequence_in_complex(model, alphabet, coords, target_chain_id,
+ target_seq, padding_length=10):
+ """
+ Scores sequence for one chain in a complex.
+ Args:
+ model: An instance of the GVPTransformer model
+ alphabet: Alphabet for the model
+ coords: Dictionary mapping chain ids to L x 3 x 3 array for N, CA, C
+ coordinates representing the backbone of each chain
+ target_chain_id: The chain id to sample sequences for
+ target_seq: Target sequence for the target chain for scoring.
+ padding_length: padding length in between chains
+ Returns:
+ Tuple (ll_fullseq, ll_withcoord)
+ - ll_fullseq: Average log-likelihood over the full target chain
+ - ll_withcoord: Average log-likelihood in target chain excluding those
+ residues without coordinates
+ """
+ all_coords = _concatenate_coords(coords, target_chain_id)
+
+ loss, target_padding_mask = get_sequence_loss(model, alphabet, all_coords,
+ target_seq)
+ ll_fullseq = -np.sum(loss * ~target_padding_mask) / np.sum(
+ ~target_padding_mask)
+
+ # Also calculate average when excluding masked portions
+ coord_mask = np.all(np.isfinite(coords[target_chain_id]), axis=(-1, -2))
+ ll_withcoord = -np.sum(loss * coord_mask) / np.sum(coord_mask)
+ return ll_fullseq, ll_withcoord
+
+
+def get_encoder_output_for_complex(model, alphabet, coords, target_chain_id):
+ """
+ Args:
+ model: An instance of the GVPTransformer model
+ alphabet: Alphabet for the model
+ coords: Dictionary mapping chain ids to L x 3 x 3 array for N, CA, C
+ coordinates representing the backbone of each chain
+ target_chain_id: The chain id to sample sequences for
+ Returns:
+ Dictionary mapping chain id to encoder output for each chain
+ """
+ all_coords = _concatenate_coords(coords, target_chain_id)
+ all_rep = get_encoder_output(model, alphabet, all_coords)
+ target_chain_len = coords[target_chain_id].shape[0]
+ return all_rep[:target_chain_len]
diff --git a/proteingym/baselines/esm/esm/inverse_folding/transformer_decoder.py b/proteingym/baselines/esm/esm/inverse_folding/transformer_decoder.py
new file mode 100644
index 0000000..59ec6f3
--- /dev/null
+++ b/proteingym/baselines/esm/esm/inverse_folding/transformer_decoder.py
@@ -0,0 +1,228 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.
+#
+# Contents of this file were adapted from the open source fairseq repository.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import math
+from typing import Any, Dict, List, Optional
+
+import torch
+import torch.nn as nn
+from torch import Tensor
+
+from esm.modules import SinusoidalPositionalEmbedding
+from .transformer_layer import TransformerDecoderLayer
+
+
+def fill_with_neg_inf(t):
+ """FP16-compatible function that fills a tensor with -inf."""
+ return t.float().fill_(float("-inf")).type_as(t)
+
+
+class TransformerDecoder(nn.Module):
+ """
+ Transformer decoder consisting of *args.decoder.layers* layers. Each layer
+ is a :class:`TransformerDecoderLayer`.
+
+ Args:
+ args (argparse.Namespace): parsed command-line arguments
+ dictionary (~fairseq.data.Dictionary): decoding dictionary
+ embed_tokens (torch.nn.Embedding): output embedding
+ no_encoder_attn (bool, optional): whether to attend to encoder outputs
+ (default: False).
+ """
+
+ def __init__(
+ self,
+ args,
+ dictionary,
+ embed_tokens,
+ ):
+ super().__init__()
+ self.args = args
+ self.dictionary = dictionary
+ self._future_mask = torch.empty(0)
+
+ self.dropout_module = nn.Dropout(args.dropout)
+
+ input_embed_dim = embed_tokens.embedding_dim
+ embed_dim = args.decoder_embed_dim
+ self.embed_dim = embed_dim
+
+ self.padding_idx = embed_tokens.padding_idx
+
+ self.embed_tokens = embed_tokens
+ self.embed_scale = math.sqrt(embed_dim)
+
+ self.project_in_dim = (
+ nn.Linear(input_embed_dim, embed_dim, bias=False)
+ if embed_dim != input_embed_dim
+ else None
+ )
+ self.embed_positions = SinusoidalPositionalEmbedding(
+ embed_dim,
+ self.padding_idx,
+ )
+
+ self.layers = nn.ModuleList([])
+ self.layers.extend(
+ [
+ self.build_decoder_layer(args)
+ for _ in range(args.decoder_layers)
+ ]
+ )
+ self.num_layers = len(self.layers)
+ self.layer_norm = nn.LayerNorm(embed_dim)
+
+ self.build_output_projection(args, dictionary)
+
+ def build_output_projection(self, args, dictionary):
+ self.output_projection = nn.Linear(
+ args.decoder_embed_dim, len(dictionary), bias=False
+ )
+ nn.init.normal_(
+ self.output_projection.weight, mean=0, std=args.decoder_embed_dim ** -0.5
+ )
+
+ def build_decoder_layer(self, args):
+ return TransformerDecoderLayer(args)
+
+ def forward(
+ self,
+ prev_output_tokens,
+ encoder_out: Optional[Dict[str, List[Tensor]]] = None,
+ incremental_state: Optional[Dict[str, Dict[str, Optional[Tensor]]]] = None,
+ features_only: bool = False,
+ return_all_hiddens: bool = False,
+ ):
+ """
+ Args:
+ prev_output_tokens (LongTensor): previous decoder outputs of shape
+ `(batch, tgt_len)`, for teacher forcing
+ encoder_out (optional): output from the encoder, used for
+ encoder-side attention, should be of size T x B x C
+ incremental_state (dict): dictionary used for storing state during
+ :ref:`Incremental decoding`
+ features_only (bool, optional): only return features without
+ applying output layer (default: False).
+
+ Returns:
+ tuple:
+ - the decoder's output of shape `(batch, tgt_len, vocab)`
+ - a dictionary with any model-specific outputs
+ """
+
+ x, extra = self.extract_features(
+ prev_output_tokens,
+ encoder_out=encoder_out,
+ incremental_state=incremental_state,
+ )
+
+ if not features_only:
+ x = self.output_layer(x)
+ x = x.transpose(1, 2) # B x T x C -> B x C x T
+ return x, extra
+
+ def extract_features(
+ self,
+ prev_output_tokens,
+ encoder_out: Optional[Dict[str, List[Tensor]]],
+ incremental_state: Optional[Dict[str, Dict[str, Optional[Tensor]]]] = None,
+ ):
+ """
+ Similar to *forward* but only return features.
+
+ Includes several features from "Jointly Learning to Align and
+ Translate with Transformer Models" (Garg et al., EMNLP 2019).
+
+ Returns:
+ tuple:
+ - the decoder's features of shape `(batch, tgt_len, embed_dim)`
+ - a dictionary with any model-specific outputs
+ """
+ bs, slen = prev_output_tokens.size()
+
+ enc: Optional[Tensor] = None
+ padding_mask: Optional[Tensor] = None
+ if encoder_out is not None and len(encoder_out["encoder_out"]) > 0:
+ enc = encoder_out["encoder_out"][0]
+ assert (
+ enc.size()[1] == bs
+ ), f"Expected enc.shape == (t, {bs}, c) got {enc.shape}"
+ if encoder_out is not None and len(encoder_out["encoder_padding_mask"]) > 0:
+ padding_mask = encoder_out["encoder_padding_mask"][0]
+
+ # embed positions
+ positions = self.embed_positions(
+ prev_output_tokens
+ )
+
+ if incremental_state is not None:
+ prev_output_tokens = prev_output_tokens[:, -1:]
+ positions = positions[:, -1:]
+
+ # embed tokens and positions
+ x = self.embed_scale * self.embed_tokens(prev_output_tokens)
+
+ if self.project_in_dim is not None:
+ x = self.project_in_dim(x)
+
+ x += positions
+
+ x = self.dropout_module(x)
+
+ # B x T x C -> T x B x C
+ x = x.transpose(0, 1)
+
+ self_attn_padding_mask: Optional[Tensor] = None
+ if prev_output_tokens.eq(self.padding_idx).any():
+ self_attn_padding_mask = prev_output_tokens.eq(self.padding_idx)
+
+ # decoder layers
+ attn: Optional[Tensor] = None
+ inner_states: List[Optional[Tensor]] = [x]
+ for idx, layer in enumerate(self.layers):
+ if incremental_state is None:
+ self_attn_mask = self.buffered_future_mask(x)
+ else:
+ self_attn_mask = None
+
+ x, layer_attn, _ = layer(
+ x,
+ enc,
+ padding_mask,
+ incremental_state,
+ self_attn_mask=self_attn_mask,
+ self_attn_padding_mask=self_attn_padding_mask,
+ need_attn=False,
+ need_head_weights=False,
+ )
+ inner_states.append(x)
+
+ if self.layer_norm is not None:
+ x = self.layer_norm(x)
+
+ # T x B x C -> B x C x T
+ x = x.transpose(0, 1)
+
+ return x, {"inner_states": inner_states}
+
+ def output_layer(self, features):
+ """Project features to the vocabulary size."""
+ return self.output_projection(features)
+
+ def buffered_future_mask(self, tensor):
+ dim = tensor.size(0)
+ # self._future_mask.device != tensor.device is not working in TorchScript. This is a workaround.
+ if (
+ self._future_mask.size(0) == 0
+ or (not self._future_mask.device == tensor.device)
+ or self._future_mask.size(0) < dim
+ ):
+ self._future_mask = torch.triu(
+ fill_with_neg_inf(torch.zeros([dim, dim])), 1
+ )
+ self._future_mask = self._future_mask.to(tensor)
+ return self._future_mask[:dim, :dim]
diff --git a/proteingym/baselines/esm/esm/inverse_folding/transformer_layer.py b/proteingym/baselines/esm/esm/inverse_folding/transformer_layer.py
new file mode 100644
index 0000000..87830a3
--- /dev/null
+++ b/proteingym/baselines/esm/esm/inverse_folding/transformer_layer.py
@@ -0,0 +1,304 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.
+#
+# Contents of this file were adapted from the open source fairseq repository.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+from typing import Dict, List, Optional
+
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+from esm.multihead_attention import MultiheadAttention
+from torch import Tensor
+
+
+class TransformerEncoderLayer(nn.Module):
+ """Encoder layer block.
+ `layernorm -> dropout -> add residual`
+
+ Args:
+ args (argparse.Namespace): parsed command-line arguments
+ """
+
+ def __init__(self, args):
+ super().__init__()
+ self.args = args
+ self.embed_dim = args.encoder_embed_dim
+ self.self_attn = self.build_self_attention(self.embed_dim, args)
+ self.self_attn_layer_norm = torch.nn.LayerNorm(self.embed_dim)
+ self.dropout_module = nn.Dropout(args.dropout)
+ self.activation_fn = F.relu
+ self.fc1 = self.build_fc1(
+ self.embed_dim,
+ args.encoder_ffn_embed_dim,
+ )
+ self.fc2 = self.build_fc2(
+ args.encoder_ffn_embed_dim,
+ self.embed_dim,
+ )
+
+ self.final_layer_norm = nn.LayerNorm(self.embed_dim)
+
+ def build_fc1(self, input_dim, output_dim):
+ return nn.Linear(input_dim, output_dim)
+
+ def build_fc2(self, input_dim, output_dim):
+ return nn.Linear(input_dim, output_dim)
+
+ def build_self_attention(self, embed_dim, args):
+ return MultiheadAttention(
+ embed_dim,
+ args.encoder_attention_heads,
+ dropout=args.attention_dropout,
+ self_attention=True,
+ )
+
+ def residual_connection(self, x, residual):
+ return residual + x
+
+ def forward(
+ self,
+ x,
+ encoder_padding_mask: Optional[Tensor],
+ attn_mask: Optional[Tensor] = None,
+ ):
+ """
+ Args:
+ x (Tensor): input to the layer of shape `(seq_len, batch, embed_dim)`
+ encoder_padding_mask (ByteTensor): binary ByteTensor of shape
+ `(batch, seq_len)` where padding elements are indicated by ``1``.
+ attn_mask (ByteTensor): binary tensor of shape `(tgt_len, src_len)`,
+ where `tgt_len` is the length of output and `src_len` is the
+ length of input, though here both are equal to `seq_len`.
+ `attn_mask[tgt_i, src_j] = 1` means that when calculating the
+ embedding for `tgt_i`, we exclude (mask out) `src_j`. This is
+ useful for strided self-attention.
+
+ Returns:
+ encoded output of shape `(seq_len, batch, embed_dim)`
+ """
+ # anything in original attn_mask = 1, becomes -1e8
+ # anything in original attn_mask = 0, becomes 0
+ # Note that we cannot use -inf here, because at some edge cases,
+ # the attention weight (before softmax) for some padded element in query
+ # will become -inf, which results in NaN in model parameters
+ if attn_mask is not None:
+ attn_mask = attn_mask.masked_fill(
+ attn_mask.to(torch.bool), -1e8 if x.dtype == torch.float32 else -1e4
+ )
+
+ residual = x
+ x = self.self_attn_layer_norm(x)
+ x, _ = self.self_attn(
+ query=x,
+ key=x,
+ value=x,
+ key_padding_mask=encoder_padding_mask,
+ need_weights=False,
+ attn_mask=attn_mask,
+ )
+ x = self.dropout_module(x)
+ x = self.residual_connection(x, residual)
+
+ residual = x
+ x = self.final_layer_norm(x)
+ x = self.activation_fn(self.fc1(x))
+ x = self.fc2(x)
+ x = self.dropout_module(x)
+ x = self.residual_connection(x, residual)
+ return x
+
+
+class TransformerDecoderLayer(nn.Module):
+ """Decoder layer block.
+ `layernorm -> dropout -> add residual`
+
+ Args:
+ args (argparse.Namespace): parsed command-line arguments
+ no_encoder_attn (bool, optional): whether to attend to encoder outputs
+ (default: False).
+ """
+
+ def __init__(
+ self, args, no_encoder_attn=False, add_bias_kv=False, add_zero_attn=False
+ ):
+ super().__init__()
+ self.embed_dim = args.decoder_embed_dim
+ self.dropout_module = nn.Dropout(args.dropout)
+
+ self.self_attn = self.build_self_attention(
+ self.embed_dim,
+ args,
+ add_bias_kv=add_bias_kv,
+ add_zero_attn=add_zero_attn,
+ )
+ self.nh = self.self_attn.num_heads
+ self.head_dim = self.self_attn.head_dim
+
+ self.activation_fn = F.relu
+
+ self.self_attn_layer_norm = nn.LayerNorm(self.embed_dim)
+
+ if no_encoder_attn:
+ self.encoder_attn = None
+ self.encoder_attn_layer_norm = None
+ else:
+ self.encoder_attn = self.build_encoder_attention(self.embed_dim, args)
+ self.encoder_attn_layer_norm = nn.LayerNorm(self.embed_dim)
+
+ self.ffn_layernorm = (
+ LayerNorm(args.decoder_ffn_embed_dim)
+ if getattr(args, "scale_fc", False)
+ else None
+ )
+ self.w_resid = (
+ nn.Parameter(
+ torch.ones(
+ self.embed_dim,
+ ),
+ requires_grad=True,
+ )
+ if getattr(args, "scale_resids", False)
+ else None
+ )
+
+ self.fc1 = self.build_fc1(
+ self.embed_dim,
+ args.decoder_ffn_embed_dim,
+ )
+ self.fc2 = self.build_fc2(
+ args.decoder_ffn_embed_dim,
+ self.embed_dim,
+ )
+
+ self.final_layer_norm = nn.LayerNorm(self.embed_dim)
+ self.need_attn = True
+
+ def build_fc1(self, input_dim, output_dim):
+ return nn.Linear(input_dim, output_dim)
+
+ def build_fc2(self, input_dim, output_dim):
+ return nn.Linear(input_dim, output_dim)
+
+ def build_self_attention(
+ self, embed_dim, args, add_bias_kv=False, add_zero_attn=False
+ ):
+ return MultiheadAttention(
+ embed_dim,
+ args.decoder_attention_heads,
+ dropout=args.attention_dropout,
+ add_bias_kv=add_bias_kv,
+ add_zero_attn=add_zero_attn,
+ self_attention=True,
+ )
+
+ def build_encoder_attention(self, embed_dim, args):
+ return MultiheadAttention(
+ embed_dim,
+ args.decoder_attention_heads,
+ kdim=args.encoder_embed_dim,
+ vdim=args.encoder_embed_dim,
+ dropout=args.attention_dropout,
+ encoder_decoder_attention=True,
+ )
+
+ def residual_connection(self, x, residual):
+ return residual + x
+
+ def forward(
+ self,
+ x,
+ encoder_out: Optional[torch.Tensor] = None,
+ encoder_padding_mask: Optional[torch.Tensor] = None,
+ incremental_state: Optional[Dict[str, Dict[str, Optional[Tensor]]]] = None,
+ prev_self_attn_state: Optional[List[torch.Tensor]] = None,
+ prev_attn_state: Optional[List[torch.Tensor]] = None,
+ self_attn_mask: Optional[torch.Tensor] = None,
+ self_attn_padding_mask: Optional[torch.Tensor] = None,
+ need_attn: bool = False,
+ need_head_weights: bool = False,
+ ):
+ """
+ Args:
+ x (Tensor): input to the layer of shape `(seq_len, batch, embed_dim)`
+ encoder_padding_mask (ByteTensor, optional): binary
+ ByteTensor of shape `(batch, src_len)` where padding
+ elements are indicated by ``1``.
+ need_attn (bool, optional): return attention weights
+ need_head_weights (bool, optional): return attention weights
+ for each head (default: return average over heads).
+
+ Returns:
+ encoded output of shape `(seq_len, batch, embed_dim)`
+ """
+ if need_head_weights:
+ need_attn = True
+
+ residual = x
+ x = self.self_attn_layer_norm(x)
+ if prev_self_attn_state is not None:
+ prev_key, prev_value = prev_self_attn_state[:2]
+ saved_state: Dict[str, Optional[Tensor]] = {
+ "prev_key": prev_key,
+ "prev_value": prev_value,
+ }
+ if len(prev_self_attn_state) >= 3:
+ saved_state["prev_key_padding_mask"] = prev_self_attn_state[2]
+ assert incremental_state is not None
+ self.self_attn._set_input_buffer(incremental_state, saved_state)
+ _self_attn_input_buffer = self.self_attn._get_input_buffer(incremental_state)
+ y = x
+
+ x, attn = self.self_attn(
+ query=x,
+ key=y,
+ value=y,
+ key_padding_mask=self_attn_padding_mask,
+ incremental_state=incremental_state,
+ need_weights=False,
+ attn_mask=self_attn_mask,
+ )
+ x = self.dropout_module(x)
+ x = self.residual_connection(x, residual)
+
+ if self.encoder_attn is not None and encoder_out is not None:
+ residual = x
+ x = self.encoder_attn_layer_norm(x)
+ if prev_attn_state is not None:
+ prev_key, prev_value = prev_attn_state[:2]
+ saved_state: Dict[str, Optional[Tensor]] = {
+ "prev_key": prev_key,
+ "prev_value": prev_value,
+ }
+ if len(prev_attn_state) >= 3:
+ saved_state["prev_key_padding_mask"] = prev_attn_state[2]
+ assert incremental_state is not None
+ self.encoder_attn._set_input_buffer(incremental_state, saved_state)
+
+ x, attn = self.encoder_attn(
+ query=x,
+ key=encoder_out,
+ value=encoder_out,
+ key_padding_mask=encoder_padding_mask,
+ incremental_state=incremental_state,
+ static_kv=True,
+ need_weights=need_attn or (not self.training and self.need_attn),
+ need_head_weights=need_head_weights,
+ )
+ x = self.dropout_module(x)
+ x = self.residual_connection(x, residual)
+
+ residual = x
+ x = self.final_layer_norm(x)
+
+ x = self.activation_fn(self.fc1(x))
+ if self.ffn_layernorm is not None:
+ x = self.ffn_layernorm(x)
+ x = self.fc2(x)
+ x = self.dropout_module(x)
+ if self.w_resid is not None:
+ residual = torch.mul(self.w_resid, residual)
+ x = self.residual_connection(x, residual)
+ return x, attn, None
diff --git a/proteingym/baselines/esm/esm/inverse_folding/util.py b/proteingym/baselines/esm/esm/inverse_folding/util.py
new file mode 100644
index 0000000..e38b700
--- /dev/null
+++ b/proteingym/baselines/esm/esm/inverse_folding/util.py
@@ -0,0 +1,323 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import json
+import math
+
+import biotite.structure
+from biotite.structure.io import pdbx, pdb
+from biotite.structure.residues import get_residues
+from biotite.structure import filter_backbone
+from biotite.structure import get_chains
+from biotite.sequence import ProteinSequence
+import numpy as np
+from scipy.spatial import transform
+from scipy.stats import special_ortho_group
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+import torch.utils.data as data
+from typing import Sequence, Tuple, List
+
+from esm.data import BatchConverter
+
+
+def load_structure(fpath, chain=None):
+ """
+ Args:
+ fpath: filepath to either pdb or cif file
+ chain: the chain id or list of chain ids to load
+ Returns:
+ biotite.structure.AtomArray
+ """
+ if fpath.endswith('cif'):
+ with open(fpath) as fin:
+ pdbxf = pdbx.PDBxFile.read(fin)
+ structure = pdbx.get_structure(pdbxf, model=1)
+ elif fpath.endswith('pdb'):
+ with open(fpath) as fin:
+ pdbf = pdb.PDBFile.read(fin)
+ structure = pdb.get_structure(pdbf, model=1)
+ bbmask = filter_backbone(structure)
+ structure = structure[bbmask]
+ all_chains = get_chains(structure)
+ if len(all_chains) == 0:
+ raise ValueError('No chains found in the input file.')
+ if chain is None:
+ chain_ids = all_chains
+ elif isinstance(chain, list):
+ chain_ids = chain
+ else:
+ chain_ids = [chain]
+ for chain in chain_ids:
+ if chain not in all_chains:
+ raise ValueError(f'Chain {chain} not found in input file')
+ chain_filter = [a.chain_id in chain_ids for a in structure]
+ structure = structure[chain_filter]
+ return structure
+
+
+def extract_coords_from_structure(structure: biotite.structure.AtomArray):
+ """
+ Args:
+ structure: An instance of biotite AtomArray
+ Returns:
+ Tuple (coords, seq)
+ - coords is an L x 3 x 3 array for N, CA, C coordinates
+ - seq is the extracted sequence
+ """
+ coords = get_atom_coords_residuewise(["N", "CA", "C"], structure)
+ residue_identities = get_residues(structure)[1]
+ seq = ''.join([ProteinSequence.convert_letter_3to1(r) for r in residue_identities])
+ return coords, seq
+
+
+def load_coords(fpath, chain):
+ """
+ Args:
+ fpath: filepath to either pdb or cif file
+ chain: the chain id
+ Returns:
+ Tuple (coords, seq)
+ - coords is an L x 3 x 3 array for N, CA, C coordinates
+ - seq is the extracted sequence
+ """
+ structure = load_structure(fpath, chain)
+ return extract_coords_from_structure(structure)
+
+
+def get_atom_coords_residuewise(atoms: List[str], struct: biotite.structure.AtomArray):
+ """
+ Example for atoms argument: ["N", "CA", "C"]
+ """
+ def filterfn(s, axis=None):
+ filters = np.stack([s.atom_name == name for name in atoms], axis=1)
+ sum = filters.sum(0)
+ if not np.all(sum <= np.ones(filters.shape[1])):
+ raise RuntimeError("structure has multiple atoms with same name")
+ index = filters.argmax(0)
+ coords = s[index].coord
+ coords[sum == 0] = float("nan")
+ return coords
+
+ return biotite.structure.apply_residue_wise(struct, struct, filterfn)
+
+
+def get_sequence_loss(model, alphabet, coords, seq):
+ device = next(model.parameters()).device
+ batch_converter = CoordBatchConverter(alphabet)
+ batch = [(coords, None, seq)]
+ coords, confidence, strs, tokens, padding_mask = batch_converter(
+ batch, device=device)
+
+ prev_output_tokens = tokens[:, :-1].to(device)
+ target = tokens[:, 1:]
+ target_padding_mask = (target == alphabet.padding_idx)
+ logits, _ = model.forward(coords, padding_mask, confidence, prev_output_tokens)
+ loss = F.cross_entropy(logits, target, reduction='none')
+ loss = loss[0].cpu().detach().numpy()
+ target_padding_mask = target_padding_mask[0].cpu().numpy()
+ return loss, target_padding_mask
+
+
+def score_sequence(model, alphabet, coords, seq):
+ loss, target_padding_mask = get_sequence_loss(model, alphabet, coords, seq)
+ ll_fullseq = -np.sum(loss * ~target_padding_mask) / np.sum(~target_padding_mask)
+ # Also calculate average when excluding masked portions
+ coord_mask = np.all(np.isfinite(coords), axis=(-1, -2))
+ ll_withcoord = -np.sum(loss * coord_mask) / np.sum(coord_mask)
+ return ll_fullseq, ll_withcoord
+
+
+def get_encoder_output(model, alphabet, coords):
+ device = next(model.parameters()).device
+ batch_converter = CoordBatchConverter(alphabet)
+ batch = [(coords, None, None)]
+ coords, confidence, strs, tokens, padding_mask = batch_converter(
+ batch, device=device)
+ encoder_out = model.encoder.forward(coords, padding_mask, confidence,
+ return_all_hiddens=False)
+ # remove beginning and end (bos and eos tokens)
+ return encoder_out['encoder_out'][0][1:-1, 0]
+
+
+def rotate(v, R):
+ """
+ Rotates a vector by a rotation matrix.
+
+ Args:
+ v: 3D vector, tensor of shape (length x batch_size x channels x 3)
+ R: rotation matrix, tensor of shape (length x batch_size x 3 x 3)
+
+ Returns:
+ Rotated version of v by rotation matrix R.
+ """
+ R = R.unsqueeze(-3)
+ v = v.unsqueeze(-1)
+ return torch.sum(v * R, dim=-2)
+
+
+def get_rotation_frames(coords):
+ """
+ Returns a local rotation frame defined by N, CA, C positions.
+
+ Args:
+ coords: coordinates, tensor of shape (batch_size x length x 3 x 3)
+ where the third dimension is in order of N, CA, C
+
+ Returns:
+ Local relative rotation frames in shape (batch_size x length x 3 x 3)
+ """
+ v1 = coords[:, :, 2] - coords[:, :, 1]
+ v2 = coords[:, :, 0] - coords[:, :, 1]
+ e1 = normalize(v1, dim=-1)
+ u2 = v2 - e1 * torch.sum(e1 * v2, dim=-1, keepdim=True)
+ e2 = normalize(u2, dim=-1)
+ e3 = torch.cross(e1, e2, dim=-1)
+ R = torch.stack([e1, e2, e3], dim=-2)
+ return R
+
+
+def nan_to_num(ts, val=0.0):
+ """
+ Replaces nans in tensor with a fixed value.
+ """
+ val = torch.tensor(val, dtype=ts.dtype, device=ts.device)
+ return torch.where(~torch.isfinite(ts), val, ts)
+
+
+def rbf(values, v_min, v_max, n_bins=16):
+ """
+ Returns RBF encodings in a new dimension at the end.
+ """
+ rbf_centers = torch.linspace(v_min, v_max, n_bins, device=values.device)
+ rbf_centers = rbf_centers.view([1] * len(values.shape) + [-1])
+ rbf_std = (v_max - v_min) / n_bins
+ v_expand = torch.unsqueeze(values, -1)
+ z = (values.unsqueeze(-1) - rbf_centers) / rbf_std
+ return torch.exp(-z ** 2)
+
+
+def norm(tensor, dim, eps=1e-8, keepdim=False):
+ """
+ Returns L2 norm along a dimension.
+ """
+ return torch.sqrt(
+ torch.sum(torch.square(tensor), dim=dim, keepdim=keepdim) + eps)
+
+
+def normalize(tensor, dim=-1):
+ """
+ Normalizes a tensor along a dimension after removing nans.
+ """
+ return nan_to_num(
+ torch.div(tensor, norm(tensor, dim=dim, keepdim=True))
+ )
+
+
+class CoordBatchConverter(BatchConverter):
+ def __call__(self, raw_batch: Sequence[Tuple[Sequence, str]], device=None):
+ """
+ Args:
+ raw_batch: List of tuples (coords, confidence, seq)
+ In each tuple,
+ coords: list of floats, shape L x 3 x 3
+ confidence: list of floats, shape L; or scalar float; or None
+ seq: string of length L
+ Returns:
+ coords: Tensor of shape batch_size x L x 3 x 3
+ confidence: Tensor of shape batch_size x L
+ strs: list of strings
+ tokens: LongTensor of shape batch_size x L
+ padding_mask: ByteTensor of shape batch_size x L
+ """
+ self.alphabet.cls_idx = self.alphabet.get_idx("")
+ batch = []
+ for coords, confidence, seq in raw_batch:
+ if confidence is None:
+ confidence = 1.
+ if isinstance(confidence, float) or isinstance(confidence, int):
+ confidence = [float(confidence)] * len(coords)
+ if seq is None:
+ seq = 'X' * len(coords)
+ batch.append(((coords, confidence), seq))
+
+ coords_and_confidence, strs, tokens = super().__call__(batch)
+
+ # pad beginning and end of each protein due to legacy reasons
+ coords = [
+ F.pad(torch.tensor(cd), (0, 0, 0, 0, 1, 1), value=np.inf)
+ for cd, _ in coords_and_confidence
+ ]
+ confidence = [
+ F.pad(torch.tensor(cf), (1, 1), value=-1.)
+ for _, cf in coords_and_confidence
+ ]
+ coords = self.collate_dense_tensors(coords, pad_v=np.nan)
+ confidence = self.collate_dense_tensors(confidence, pad_v=-1.)
+ if device is not None:
+ coords = coords.to(device)
+ confidence = confidence.to(device)
+ tokens = tokens.to(device)
+ padding_mask = torch.isnan(coords[:,:,0,0])
+ coord_mask = torch.isfinite(coords.sum(-2).sum(-1))
+ confidence = confidence * coord_mask + (-1.) * padding_mask
+ return coords, confidence, strs, tokens, padding_mask
+
+ def from_lists(self, coords_list, confidence_list=None, seq_list=None, device=None):
+ """
+ Args:
+ coords_list: list of length batch_size, each item is a list of
+ floats in shape L x 3 x 3 to describe a backbone
+ confidence_list: one of
+ - None, default to highest confidence
+ - list of length batch_size, each item is a scalar
+ - list of length batch_size, each item is a list of floats of
+ length L to describe the confidence scores for the backbone
+ with values between 0. and 1.
+ seq_list: either None or a list of strings
+ Returns:
+ coords: Tensor of shape batch_size x L x 3 x 3
+ confidence: Tensor of shape batch_size x L
+ strs: list of strings
+ tokens: LongTensor of shape batch_size x L
+ padding_mask: ByteTensor of shape batch_size x L
+ """
+ batch_size = len(coords_list)
+ if confidence_list is None:
+ confidence_list = [None] * batch_size
+ if seq_list is None:
+ seq_list = [None] * batch_size
+ raw_batch = zip(coords_list, confidence_list, seq_list)
+ return self.__call__(raw_batch, device)
+
+ @staticmethod
+ def collate_dense_tensors(samples, pad_v):
+ """
+ Takes a list of tensors with the following dimensions:
+ [(d_11, ..., d_1K),
+ (d_21, ..., d_2K),
+ ...,
+ (d_N1, ..., d_NK)]
+ and stack + pads them into a single tensor of:
+ (N, max_i=1,N { d_i1 }, ..., max_i=1,N {diK})
+ """
+ if len(samples) == 0:
+ return torch.Tensor()
+ if len(set(x.dim() for x in samples)) != 1:
+ raise RuntimeError(
+ f"Samples has varying dimensions: {[x.dim() for x in samples]}"
+ )
+ (device,) = tuple(set(x.device for x in samples)) # assumes all on same device
+ max_shape = [max(lst) for lst in zip(*[x.shape for x in samples])]
+ result = torch.empty(
+ len(samples), *max_shape, dtype=samples[0].dtype, device=device
+ )
+ result.fill_(pad_v)
+ for i in range(len(samples)):
+ result_i = result[i]
+ t = samples[i]
+ result_i[tuple(slice(0, k) for k in t.shape)] = t
+ return result
diff --git a/proteingym/baselines/esm/esm/model/esm1.py b/proteingym/baselines/esm/esm/model/esm1.py
new file mode 100644
index 0000000..b199662
--- /dev/null
+++ b/proteingym/baselines/esm/esm/model/esm1.py
@@ -0,0 +1,200 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import math
+
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+
+from ..modules import (
+ TransformerLayer,
+ LearnedPositionalEmbedding,
+ SinusoidalPositionalEmbedding,
+ RobertaLMHead,
+ ESM1bLayerNorm,
+ ContactPredictionHead,
+)
+
+
+class ProteinBertModel(nn.Module):
+ @classmethod
+ def add_args(cls, parser):
+ parser.add_argument(
+ "--num_layers", default=36, type=int, metavar="N", help="number of layers"
+ )
+ parser.add_argument(
+ "--embed_dim", default=1280, type=int, metavar="N", help="embedding dimension"
+ )
+ parser.add_argument(
+ "--logit_bias", action="store_true", help="whether to apply bias to logits"
+ )
+ parser.add_argument(
+ "--ffn_embed_dim",
+ default=5120,
+ type=int,
+ metavar="N",
+ help="embedding dimension for FFN",
+ )
+ parser.add_argument(
+ "--attention_heads",
+ default=20,
+ type=int,
+ metavar="N",
+ help="number of attention heads",
+ )
+
+ def __init__(self, args, alphabet):
+ super().__init__()
+ self.args = args
+ self.alphabet_size = len(alphabet)
+ self.padding_idx = alphabet.padding_idx
+ self.mask_idx = alphabet.mask_idx
+ self.cls_idx = alphabet.cls_idx
+ self.eos_idx = alphabet.eos_idx
+ self.prepend_bos = alphabet.prepend_bos
+ self.append_eos = alphabet.append_eos
+ self.emb_layer_norm_before = getattr(self.args, "emb_layer_norm_before", False)
+ if self.args.arch == "roberta_large":
+ self.model_version = "ESM-1b"
+ self._init_submodules_esm1b()
+ else:
+ self.model_version = "ESM-1"
+ self._init_submodules_esm1()
+
+ def _init_submodules_common(self):
+ self.embed_tokens = nn.Embedding(
+ self.alphabet_size, self.args.embed_dim, padding_idx=self.padding_idx
+ )
+ self.layers = nn.ModuleList(
+ [
+ TransformerLayer(
+ self.args.embed_dim,
+ self.args.ffn_embed_dim,
+ self.args.attention_heads,
+ add_bias_kv=(self.model_version != "ESM-1b"),
+ use_esm1b_layer_norm=(self.model_version == "ESM-1b"),
+ )
+ for _ in range(self.args.layers)
+ ]
+ )
+
+ self.contact_head = ContactPredictionHead(
+ self.args.layers * self.args.attention_heads,
+ self.prepend_bos,
+ self.append_eos,
+ eos_idx=self.eos_idx,
+ )
+
+ def _init_submodules_esm1b(self):
+ self._init_submodules_common()
+ self.embed_scale = 1
+ self.embed_positions = LearnedPositionalEmbedding(
+ self.args.max_positions, self.args.embed_dim, self.padding_idx
+ )
+ self.emb_layer_norm_before = (
+ ESM1bLayerNorm(self.args.embed_dim) if self.emb_layer_norm_before else None
+ )
+ self.emb_layer_norm_after = ESM1bLayerNorm(self.args.embed_dim)
+ self.lm_head = RobertaLMHead(
+ embed_dim=self.args.embed_dim,
+ output_dim=self.alphabet_size,
+ weight=self.embed_tokens.weight,
+ )
+
+ def _init_submodules_esm1(self):
+ self._init_submodules_common()
+ self.embed_scale = math.sqrt(self.args.embed_dim)
+ self.embed_positions = SinusoidalPositionalEmbedding(self.args.embed_dim, self.padding_idx)
+ self.embed_out = nn.Parameter(torch.zeros((self.alphabet_size, self.args.embed_dim)))
+ self.embed_out_bias = None
+ if self.args.final_bias:
+ self.embed_out_bias = nn.Parameter(torch.zeros(self.alphabet_size))
+
+ def forward(self, tokens, repr_layers=[], need_head_weights=False, return_contacts=False):
+ if return_contacts:
+ need_head_weights = True
+
+ assert tokens.ndim == 2
+ padding_mask = tokens.eq(self.padding_idx) # B, T
+
+ x = self.embed_scale * self.embed_tokens(tokens)
+
+ if getattr(self.args, "token_dropout", False):
+ x.masked_fill_((tokens == self.mask_idx).unsqueeze(-1), 0.0)
+ # x: B x T x C
+ mask_ratio_train = 0.15 * 0.8
+ src_lengths = (~padding_mask).sum(-1)
+ mask_ratio_observed = (tokens == self.mask_idx).sum(-1).float() / src_lengths
+ x = x * (1 - mask_ratio_train) / (1 - mask_ratio_observed)[:, None, None]
+
+ x = x + self.embed_positions(tokens)
+
+ if self.model_version == "ESM-1b":
+ if self.emb_layer_norm_before:
+ x = self.emb_layer_norm_before(x)
+ if padding_mask is not None:
+ x = x * (1 - padding_mask.unsqueeze(-1).type_as(x))
+
+ repr_layers = set(repr_layers)
+ hidden_representations = {}
+ if 0 in repr_layers:
+ hidden_representations[0] = x
+
+ if need_head_weights:
+ attn_weights = []
+
+ # (B, T, E) => (T, B, E)
+ x = x.transpose(0, 1)
+
+ if not padding_mask.any():
+ padding_mask = None
+
+ for layer_idx, layer in enumerate(self.layers):
+ x, attn = layer(
+ x, self_attn_padding_mask=padding_mask, need_head_weights=need_head_weights
+ )
+ if (layer_idx + 1) in repr_layers:
+ hidden_representations[layer_idx + 1] = x.transpose(0, 1)
+ if need_head_weights:
+ # (H, B, T, T) => (B, H, T, T)
+ attn_weights.append(attn.transpose(1, 0))
+
+ if self.model_version == "ESM-1b":
+ x = self.emb_layer_norm_after(x)
+ x = x.transpose(0, 1) # (T, B, E) => (B, T, E)
+
+ # last hidden representation should have layer norm applied
+ if (layer_idx + 1) in repr_layers:
+ hidden_representations[layer_idx + 1] = x
+ x = self.lm_head(x)
+ else:
+ x = F.linear(x, self.embed_out, bias=self.embed_out_bias)
+ x = x.transpose(0, 1) # (T, B, E) => (B, T, E)
+
+ result = {"logits": x, "representations": hidden_representations}
+ if need_head_weights:
+ # attentions: B x L x H x T x T
+ attentions = torch.stack(attn_weights, 1)
+ if self.model_version == "ESM-1":
+ # ESM-1 models have an additional null-token for attention, which we remove
+ attentions = attentions[..., :-1]
+ if padding_mask is not None:
+ attention_mask = 1 - padding_mask.type_as(attentions)
+ attention_mask = attention_mask.unsqueeze(1) * attention_mask.unsqueeze(2)
+ attentions = attentions * attention_mask[:, None, None, :, :]
+ result["attentions"] = attentions
+ if return_contacts:
+ contacts = self.contact_head(tokens, attentions)
+ result["contacts"] = contacts
+
+ return result
+
+ def predict_contacts(self, tokens):
+ return self(tokens, return_contacts=True)["contacts"]
+
+ @property
+ def num_layers(self):
+ return self.args.layers
diff --git a/proteingym/baselines/esm/esm/model/esm2.py b/proteingym/baselines/esm/esm/model/esm2.py
new file mode 100644
index 0000000..e5613e5
--- /dev/null
+++ b/proteingym/baselines/esm/esm/model/esm2.py
@@ -0,0 +1,146 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+from typing import Union
+import torch
+import torch.nn as nn
+import esm
+from ..modules import ContactPredictionHead, ESM1bLayerNorm, RobertaLMHead, TransformerLayer
+
+
+class ESM2(nn.Module):
+ def __init__(
+ self,
+ num_layers: int = 33,
+ embed_dim: int = 1280,
+ attention_heads: int = 20,
+ alphabet: Union[esm.data.Alphabet, str] = "ESM-1b",
+ token_dropout: bool = True,
+ ):
+ super().__init__()
+ self.num_layers = num_layers
+ self.embed_dim = embed_dim
+ self.attention_heads = attention_heads
+ if not (isinstance(alphabet, esm.data.Alphabet) or alphabet.__class__.__name__ == "Alphabet"):
+ alphabet = esm.data.Alphabet.from_architecture(alphabet)
+ self.alphabet = alphabet
+ self.alphabet_size = len(alphabet)
+ self.padding_idx = alphabet.padding_idx
+ self.mask_idx = alphabet.mask_idx
+ self.cls_idx = alphabet.cls_idx
+ self.eos_idx = alphabet.eos_idx
+ self.prepend_bos = alphabet.prepend_bos
+ self.append_eos = alphabet.append_eos
+ self.token_dropout = token_dropout
+
+ self._init_submodules()
+
+ def _init_submodules(self):
+ self.embed_scale = 1
+ self.embed_tokens = nn.Embedding(
+ self.alphabet_size,
+ self.embed_dim,
+ padding_idx=self.padding_idx,
+ )
+
+ self.layers = nn.ModuleList(
+ [
+ TransformerLayer(
+ self.embed_dim,
+ 4 * self.embed_dim,
+ self.attention_heads,
+ add_bias_kv=False,
+ use_esm1b_layer_norm=True,
+ use_rotary_embeddings=True,
+ )
+ for _ in range(self.num_layers)
+ ]
+ )
+
+ self.contact_head = ContactPredictionHead(
+ self.num_layers * self.attention_heads,
+ self.prepend_bos,
+ self.append_eos,
+ eos_idx=self.eos_idx,
+ )
+ self.emb_layer_norm_after = ESM1bLayerNorm(self.embed_dim)
+
+ self.lm_head = RobertaLMHead(
+ embed_dim=self.embed_dim,
+ output_dim=self.alphabet_size,
+ weight=self.embed_tokens.weight,
+ )
+
+ def forward(self, tokens, repr_layers=[], need_head_weights=False, return_contacts=False):
+ if return_contacts:
+ need_head_weights = True
+
+ assert tokens.ndim == 2
+ padding_mask = tokens.eq(self.padding_idx) # B, T
+
+ x = self.embed_scale * self.embed_tokens(tokens)
+
+ if self.token_dropout:
+ x.masked_fill_((tokens == self.mask_idx).unsqueeze(-1), 0.0)
+ # x: B x T x C
+ mask_ratio_train = 0.15 * 0.8
+ src_lengths = (~padding_mask).sum(-1)
+ mask_ratio_observed = (tokens == self.mask_idx).sum(-1).to(x.dtype) / src_lengths
+ x = x * (1 - mask_ratio_train) / (1 - mask_ratio_observed)[:, None, None]
+
+ if padding_mask is not None:
+ x = x * (1 - padding_mask.unsqueeze(-1).type_as(x))
+
+ repr_layers = set(repr_layers)
+ hidden_representations = {}
+ if 0 in repr_layers:
+ hidden_representations[0] = x
+
+ if need_head_weights:
+ attn_weights = []
+
+ # (B, T, E) => (T, B, E)
+ x = x.transpose(0, 1)
+
+ if not padding_mask.any():
+ padding_mask = None
+
+ for layer_idx, layer in enumerate(self.layers):
+ x, attn = layer(
+ x,
+ self_attn_padding_mask=padding_mask,
+ need_head_weights=need_head_weights,
+ )
+ if (layer_idx + 1) in repr_layers:
+ hidden_representations[layer_idx + 1] = x.transpose(0, 1)
+ if need_head_weights:
+ # (H, B, T, T) => (B, H, T, T)
+ attn_weights.append(attn.transpose(1, 0))
+
+ x = self.emb_layer_norm_after(x)
+ x = x.transpose(0, 1) # (T, B, E) => (B, T, E)
+
+ # last hidden representation should have layer norm applied
+ if (layer_idx + 1) in repr_layers:
+ hidden_representations[layer_idx + 1] = x
+ x = self.lm_head(x)
+
+ result = {"logits": x, "representations": hidden_representations}
+ if need_head_weights:
+ # attentions: B x L x H x T x T
+ attentions = torch.stack(attn_weights, 1)
+ if padding_mask is not None:
+ attention_mask = 1 - padding_mask.type_as(attentions)
+ attention_mask = attention_mask.unsqueeze(1) * attention_mask.unsqueeze(2)
+ attentions = attentions * attention_mask[:, None, None, :, :]
+ result["attentions"] = attentions
+ if return_contacts:
+ contacts = self.contact_head(tokens, attentions)
+ result["contacts"] = contacts
+
+ return result
+
+ def predict_contacts(self, tokens):
+ return self(tokens, return_contacts=True)["contacts"]
diff --git a/proteingym/baselines/esm/esm/model/msa_transformer.py b/proteingym/baselines/esm/esm/model/msa_transformer.py
new file mode 100644
index 0000000..ef21cf5
--- /dev/null
+++ b/proteingym/baselines/esm/esm/model/msa_transformer.py
@@ -0,0 +1,238 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import torch
+import torch.nn as nn
+
+from ..modules import (
+ AxialTransformerLayer,
+ LearnedPositionalEmbedding,
+ RobertaLMHead,
+ ESM1bLayerNorm,
+ ContactPredictionHead,
+)
+
+from ..axial_attention import RowSelfAttention, ColumnSelfAttention
+
+
+
+class MSATransformer(nn.Module):
+ @classmethod
+ def add_args(cls, parser):
+ # fmt: off
+ parser.add_argument(
+ "--num_layers",
+ default=12,
+ type=int,
+ metavar="N",
+ help="number of layers"
+ )
+ parser.add_argument(
+ "--embed_dim",
+ default=768,
+ type=int,
+ metavar="N",
+ help="embedding dimension"
+ )
+ parser.add_argument(
+ "--logit_bias",
+ action="store_true",
+ help="whether to apply bias to logits"
+ )
+ parser.add_argument(
+ "--ffn_embed_dim",
+ default=3072,
+ type=int,
+ metavar="N",
+ help="embedding dimension for FFN",
+ )
+ parser.add_argument(
+ "--attention_heads",
+ default=12,
+ type=int,
+ metavar="N",
+ help="number of attention heads",
+ )
+ parser.add_argument(
+ "--dropout",
+ default=0.1,
+ type=float,
+ help="Dropout to apply."
+ )
+ parser.add_argument(
+ "--attention_dropout",
+ default=0.1,
+ type=float,
+ help="Dropout to apply."
+ )
+ parser.add_argument(
+ "--activation_dropout",
+ default=0.1,
+ type=float,
+ help="Dropout to apply."
+ )
+ parser.add_argument(
+ "--max_tokens_per_msa",
+ default=2 ** 14,
+ type=int,
+ help=(
+ "Used during inference to batch attention computations in a single "
+ "forward pass. This allows increased input sizes with less memory."
+ ),
+ )
+ # fmt: on
+
+ def __init__(self, args, alphabet):
+ super().__init__()
+ self.args = args
+ self.alphabet_size = len(alphabet)
+ self.padding_idx = alphabet.padding_idx
+ self.mask_idx = alphabet.mask_idx
+ self.cls_idx = alphabet.cls_idx
+ self.eos_idx = alphabet.eos_idx
+ self.prepend_bos = alphabet.prepend_bos
+ self.append_eos = alphabet.append_eos
+
+ self.embed_tokens = nn.Embedding(
+ self.alphabet_size, self.args.embed_dim, padding_idx=self.padding_idx
+ )
+
+ if getattr(self.args, "embed_positions_msa", False):
+ emb_dim = getattr(self.args, "embed_positions_msa_dim", self.args.embed_dim)
+ self.msa_position_embedding = nn.Parameter(
+ 0.01 * torch.randn(1, 1024, 1, emb_dim),
+ requires_grad=True,
+ )
+ else:
+ self.register_parameter("msa_position_embedding", None)
+
+ self.dropout_module = nn.Dropout(self.args.dropout)
+ self.layers = nn.ModuleList(
+ [
+ AxialTransformerLayer(
+ self.args.embed_dim,
+ self.args.ffn_embed_dim,
+ self.args.attention_heads,
+ self.args.dropout,
+ self.args.attention_dropout,
+ self.args.activation_dropout,
+ getattr(self.args, "max_tokens_per_msa", self.args.max_tokens),
+ )
+ for _ in range(self.args.layers)
+ ]
+ )
+
+ self.contact_head = ContactPredictionHead(
+ self.args.layers * self.args.attention_heads,
+ self.prepend_bos,
+ self.append_eos,
+ eos_idx=self.eos_idx,
+ )
+ self.embed_positions = LearnedPositionalEmbedding(
+ self.args.max_positions,
+ self.args.embed_dim,
+ self.padding_idx,
+ )
+ self.emb_layer_norm_before = ESM1bLayerNorm(self.args.embed_dim)
+ self.emb_layer_norm_after = ESM1bLayerNorm(self.args.embed_dim)
+ self.lm_head = RobertaLMHead(
+ embed_dim=self.args.embed_dim,
+ output_dim=self.alphabet_size,
+ weight=self.embed_tokens.weight,
+ )
+
+ def forward(self, tokens, repr_layers=[], need_head_weights=False, return_contacts=False):
+ if return_contacts:
+ need_head_weights = True
+
+ assert tokens.ndim == 3
+ batch_size, num_alignments, seqlen = tokens.size()
+ padding_mask = tokens.eq(self.padding_idx) # B, R, C
+ if not padding_mask.any():
+ padding_mask = None
+
+ x = self.embed_tokens(tokens)
+ x += self.embed_positions(tokens.view(batch_size * num_alignments, seqlen)).view(x.size())
+ if self.msa_position_embedding is not None:
+ if x.size(1) > 1024:
+ raise RuntimeError(
+ "Using model with MSA position embedding trained on maximum MSA "
+ f"depth of 1024, but received {x.size(1)} alignments."
+ )
+ x += self.msa_position_embedding[:, :num_alignments]
+
+ x = self.emb_layer_norm_before(x)
+
+ x = self.dropout_module(x)
+
+ if padding_mask is not None:
+ x = x * (1 - padding_mask.unsqueeze(-1).type_as(x))
+
+ repr_layers = set(repr_layers)
+ hidden_representations = {}
+ if 0 in repr_layers:
+ hidden_representations[0] = x
+
+ if need_head_weights:
+ row_attn_weights = []
+ col_attn_weights = []
+
+ # B x R x C x D -> R x C x B x D
+ x = x.permute(1, 2, 0, 3)
+
+ for layer_idx, layer in enumerate(self.layers):
+ x = layer(
+ x,
+ self_attn_padding_mask=padding_mask,
+ need_head_weights=need_head_weights,
+ )
+ if need_head_weights:
+ x, col_attn, row_attn = x
+ # H x C x B x R x R -> B x H x C x R x R
+ col_attn_weights.append(col_attn.permute(2, 0, 1, 3, 4))
+ # H x B x C x C -> B x H x C x C
+ row_attn_weights.append(row_attn.permute(1, 0, 2, 3))
+ if (layer_idx + 1) in repr_layers:
+ hidden_representations[layer_idx + 1] = x.permute(2, 0, 1, 3)
+
+ x = self.emb_layer_norm_after(x)
+ x = x.permute(2, 0, 1, 3) # R x C x B x D -> B x R x C x D
+
+ # last hidden representation should have layer norm applied
+ if (layer_idx + 1) in repr_layers:
+ hidden_representations[layer_idx + 1] = x
+ x = self.lm_head(x)
+
+ result = {"logits": x, "representations": hidden_representations}
+ if need_head_weights:
+ # col_attentions: B x L x H x C x R x R
+ col_attentions = torch.stack(col_attn_weights, 1)
+ # row_attentions: B x L x H x C x C
+ row_attentions = torch.stack(row_attn_weights, 1)
+ result["col_attentions"] = col_attentions
+ result["row_attentions"] = row_attentions
+ if return_contacts:
+ contacts = self.contact_head(tokens, row_attentions)
+ result["contacts"] = contacts
+
+ return result
+
+ def predict_contacts(self, tokens):
+ return self(tokens, return_contacts=True)["contacts"]
+
+ @property
+ def num_layers(self):
+ return self.args.layers
+
+ def max_tokens_per_msa_(self, value: int) -> None:
+ """The MSA Transformer automatically batches attention computations when
+ gradients are disabled to allow you to pass in larger MSAs at test time than
+ you can fit in GPU memory. By default this occurs when more than 2^14 tokens
+ are passed in the input MSA. You can set this value to infinity to disable
+ this behavior.
+ """
+ for module in self.modules():
+ if isinstance(module, (RowSelfAttention, ColumnSelfAttention)):
+ module.max_tokens_per_msa = value
diff --git a/proteingym/baselines/esm/esm/modules.py b/proteingym/baselines/esm/esm/modules.py
new file mode 100644
index 0000000..437f764
--- /dev/null
+++ b/proteingym/baselines/esm/esm/modules.py
@@ -0,0 +1,432 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import math
+from typing import Optional
+
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+
+from .multihead_attention import MultiheadAttention # noqa
+from .axial_attention import ColumnSelfAttention, RowSelfAttention
+
+
+def gelu(x):
+ """Implementation of the gelu activation function.
+
+ For information: OpenAI GPT's gelu is slightly different
+ (and gives slightly different results):
+ 0.5 * x * (1 + torch.tanh(math.sqrt(2 / math.pi) * (x + 0.044715 * torch.pow(x, 3))))
+ """
+ return x * 0.5 * (1.0 + torch.erf(x / math.sqrt(2.0)))
+
+
+def symmetrize(x):
+ "Make layer symmetric in final two dimensions, used for contact prediction."
+ return x + x.transpose(-1, -2)
+
+
+def apc(x):
+ "Perform average product correct, used for contact prediction."
+ a1 = x.sum(-1, keepdims=True)
+ a2 = x.sum(-2, keepdims=True)
+ a12 = x.sum((-1, -2), keepdims=True)
+
+ avg = a1 * a2
+ avg.div_(a12) # in-place to reduce memory
+ normalized = x - avg
+ return normalized
+
+
+class ESM1LayerNorm(nn.Module):
+ def __init__(self, hidden_size, eps=1e-12, affine=True):
+ """Construct a layernorm layer in the TF style (eps inside the sqrt)."""
+ super().__init__()
+ self.hidden_size = (hidden_size,) if isinstance(hidden_size, int) else tuple(hidden_size)
+ self.eps = eps
+ self.affine = bool(affine)
+ if self.affine:
+ self.weight = nn.Parameter(torch.ones(hidden_size))
+ self.bias = nn.Parameter(torch.zeros(hidden_size))
+ else:
+ self.weight, self.bias = None, None
+
+ def forward(self, x):
+ dims = tuple(-(i + 1) for i in range(len(self.hidden_size)))
+ means = x.mean(dims, keepdim=True)
+ x_zeromean = x - means
+ variances = x_zeromean.pow(2).mean(dims, keepdim=True)
+ x = x_zeromean / torch.sqrt(variances + self.eps)
+ if self.affine:
+ x = (self.weight * x) + self.bias
+ return x
+
+
+try:
+ from apex.normalization import FusedLayerNorm as _FusedLayerNorm
+
+ class ESM1bLayerNorm(_FusedLayerNorm):
+ @torch.jit.unused
+ def forward(self, x):
+ if not x.is_cuda:
+ return super().forward(x)
+ else:
+ with torch.cuda.device(x.device):
+ return super().forward(x)
+
+except ImportError:
+ from torch.nn import LayerNorm as ESM1bLayerNorm
+
+
+class TransformerLayer(nn.Module):
+ """Transformer layer block."""
+
+ def __init__(
+ self,
+ embed_dim,
+ ffn_embed_dim,
+ attention_heads,
+ add_bias_kv=True,
+ use_esm1b_layer_norm=False,
+ use_rotary_embeddings: bool = False,
+ ):
+ super().__init__()
+ self.embed_dim = embed_dim
+ self.ffn_embed_dim = ffn_embed_dim
+ self.attention_heads = attention_heads
+ self.use_rotary_embeddings = use_rotary_embeddings
+ self._init_submodules(add_bias_kv, use_esm1b_layer_norm)
+
+ def _init_submodules(self, add_bias_kv, use_esm1b_layer_norm):
+ BertLayerNorm = ESM1bLayerNorm if use_esm1b_layer_norm else ESM1LayerNorm
+
+ self.self_attn = MultiheadAttention(
+ self.embed_dim,
+ self.attention_heads,
+ add_bias_kv=add_bias_kv,
+ add_zero_attn=False,
+ use_rotary_embeddings=self.use_rotary_embeddings,
+ )
+ self.self_attn_layer_norm = BertLayerNorm(self.embed_dim)
+
+ self.fc1 = nn.Linear(self.embed_dim, self.ffn_embed_dim)
+ self.fc2 = nn.Linear(self.ffn_embed_dim, self.embed_dim)
+
+ self.final_layer_norm = BertLayerNorm(self.embed_dim)
+
+ def forward(
+ self, x, self_attn_mask=None, self_attn_padding_mask=None, need_head_weights=False
+ ):
+ residual = x
+ x = self.self_attn_layer_norm(x)
+ x, attn = self.self_attn(
+ query=x,
+ key=x,
+ value=x,
+ key_padding_mask=self_attn_padding_mask,
+ need_weights=True,
+ need_head_weights=need_head_weights,
+ attn_mask=self_attn_mask,
+ )
+ x = residual + x
+
+ residual = x
+ x = self.final_layer_norm(x)
+ x = gelu(self.fc1(x))
+ x = self.fc2(x)
+ x = residual + x
+
+ return x, attn
+
+
+class AxialTransformerLayer(nn.Module):
+ """Implements an Axial MSA Transformer block."""
+
+ def __init__(
+ self,
+ embedding_dim: int = 768,
+ ffn_embedding_dim: int = 3072,
+ num_attention_heads: int = 8,
+ dropout: float = 0.1,
+ attention_dropout: float = 0.1,
+ activation_dropout: float = 0.1,
+ max_tokens_per_msa: int = 2**14,
+ deactivate_col_attention: bool = False,
+ tranception_attention: bool = False,
+ num_targets: int = 1,
+ ) -> None:
+ super().__init__()
+
+ # Initialize parameters
+ self.embedding_dim = embedding_dim
+ self.dropout_prob = dropout
+ self.deactivate_col_attention = deactivate_col_attention
+
+ row_self_attention = RowSelfAttention(
+ embedding_dim,
+ num_attention_heads,
+ dropout=dropout,
+ max_tokens_per_msa=max_tokens_per_msa,
+ tranception_attention=tranception_attention,
+ num_targets=num_targets,
+ )
+
+ if not self.deactivate_col_attention:
+ column_self_attention = ColumnSelfAttention(
+ embedding_dim,
+ num_attention_heads,
+ dropout=dropout,
+ max_tokens_per_msa=max_tokens_per_msa,
+ )
+ else:
+ print("No column attention in the underlying axial transformer module")
+
+ feed_forward_layer = FeedForwardNetwork(
+ embedding_dim,
+ ffn_embedding_dim,
+ activation_dropout=activation_dropout,
+ max_tokens_per_msa=max_tokens_per_msa,
+ )
+
+ self.row_self_attention = self.build_residual(row_self_attention)
+ if not self.deactivate_col_attention: self.column_self_attention = self.build_residual(column_self_attention)
+ self.feed_forward_layer = self.build_residual(feed_forward_layer)
+
+ def build_residual(self, layer: nn.Module):
+ return NormalizedResidualBlock(
+ layer,
+ self.embedding_dim,
+ self.dropout_prob,
+ )
+
+ def forward(
+ self,
+ x: torch.Tensor,
+ self_attn_mask: Optional[torch.Tensor] = None,
+ self_attn_padding_mask: Optional[torch.Tensor] = None,
+ need_head_weights: bool = False,
+ ):
+ """
+ LayerNorm is applied either before or after the self-attention/ffn
+ modules similar to the original Transformer implementation.
+ """
+ x, row_attn = self.row_self_attention(
+ x,
+ self_attn_mask=self_attn_mask,
+ self_attn_padding_mask=self_attn_padding_mask,
+ )
+
+ if not self.deactivate_col_attention:
+ x, column_attn = self.column_self_attention(
+ x,
+ self_attn_mask=self_attn_mask,
+ self_attn_padding_mask=self_attn_padding_mask,
+ )
+ else:
+ column_attn = None
+
+ x = self.feed_forward_layer(x)
+ if need_head_weights:
+ return x, column_attn, row_attn
+ else:
+ return x
+
+
+class LearnedPositionalEmbedding(nn.Embedding):
+ """
+ This module learns positional embeddings up to a fixed maximum size.
+ Padding ids are ignored by either offsetting based on padding_idx
+ or by setting padding_idx to None and ensuring that the appropriate
+ position ids are passed to the forward function.
+ """
+
+ def __init__(self, num_embeddings: int, embedding_dim: int, padding_idx: int):
+ if padding_idx is not None:
+ num_embeddings_ = num_embeddings + padding_idx + 1
+ else:
+ num_embeddings_ = num_embeddings
+ super().__init__(num_embeddings_, embedding_dim, padding_idx)
+ self.max_positions = num_embeddings
+
+ def forward(self, input: torch.Tensor):
+ """Input is expected to be of size [bsz x seqlen]."""
+ if input.size(1) > self.max_positions:
+ raise ValueError(
+ f"Sequence length {input.size(1)} above maximum "
+ f" sequence length of {self.max_positions}"
+ )
+ mask = input.ne(self.padding_idx).int()
+ positions = (torch.cumsum(mask, dim=1).type_as(mask) * mask).long() + self.padding_idx
+ return F.embedding(
+ positions,
+ self.weight,
+ self.padding_idx,
+ self.max_norm,
+ self.norm_type,
+ self.scale_grad_by_freq,
+ self.sparse,
+ )
+
+
+class SinusoidalPositionalEmbedding(nn.Module):
+ def __init__(self, embed_dim, padding_idx, learned=False):
+ super().__init__()
+ self.embed_dim = embed_dim
+ self.padding_idx = padding_idx
+ self.register_buffer("_float_tensor", torch.FloatTensor(1))
+ self.weights = None
+
+ def forward(self, x):
+ bsz, seq_len = x.shape
+ max_pos = self.padding_idx + 1 + seq_len
+ if self.weights is None or max_pos > self.weights.size(0):
+ self.weights = self.get_embedding(max_pos)
+ self.weights = self.weights.type_as(self._float_tensor)
+
+ positions = self.make_positions(x)
+ return self.weights.index_select(0, positions.view(-1)).view(bsz, seq_len, -1).detach()
+
+ def make_positions(self, x):
+ mask = x.ne(self.padding_idx)
+ range_buf = torch.arange(x.size(1), device=x.device).expand_as(x) + self.padding_idx + 1
+ positions = range_buf.expand_as(x)
+ return positions * mask.long() + self.padding_idx * (1 - mask.long())
+
+ def get_embedding(self, num_embeddings):
+ half_dim = self.embed_dim // 2
+ emb = math.log(10000) / (half_dim - 1)
+ emb = torch.exp(torch.arange(half_dim, dtype=torch.float) * -emb)
+ emb = torch.arange(num_embeddings, dtype=torch.float).unsqueeze(1) * emb.unsqueeze(0)
+ emb = torch.cat([torch.sin(emb), torch.cos(emb)], dim=1).view(num_embeddings, -1)
+ if self.embed_dim % 2 == 1:
+ # zero pad
+ emb = torch.cat([emb, torch.zeros(num_embeddings, 1)], dim=1)
+ if self.padding_idx is not None:
+ emb[self.padding_idx, :] = 0
+ return emb
+
+
+class RobertaLMHead(nn.Module):
+ """Head for masked language modeling."""
+
+ def __init__(self, embed_dim, output_dim, weight):
+ super().__init__()
+ self.dense = nn.Linear(embed_dim, embed_dim)
+ self.layer_norm = ESM1bLayerNorm(embed_dim)
+ self.weight = weight
+ self.bias = nn.Parameter(torch.zeros(output_dim))
+
+ def forward(self, features):
+ x = self.dense(features)
+ x = gelu(x)
+ x = self.layer_norm(x)
+ # project back to size of vocabulary with bias
+ x = F.linear(x, self.weight) + self.bias
+ return x
+
+
+class ContactPredictionHead(nn.Module):
+ """Performs symmetrization, apc, and computes a logistic regression on the output features"""
+
+ def __init__(
+ self,
+ in_features: int,
+ prepend_bos: bool,
+ append_eos: bool,
+ bias=True,
+ eos_idx: Optional[int] = None,
+ ):
+ super().__init__()
+ self.in_features = in_features
+ self.prepend_bos = prepend_bos
+ self.append_eos = append_eos
+ if append_eos and eos_idx is None:
+ raise ValueError("Using an alphabet with eos token, but no eos token was passed in.")
+ self.eos_idx = eos_idx
+ self.regression = nn.Linear(in_features, 1, bias)
+ self.activation = nn.Sigmoid()
+
+ def forward(self, tokens, attentions):
+ # remove eos token attentions
+ if self.append_eos:
+ eos_mask = tokens.ne(self.eos_idx).to(attentions)
+ eos_mask = eos_mask.unsqueeze(1) * eos_mask.unsqueeze(2)
+ attentions = attentions * eos_mask[:, None, None, :, :]
+ attentions = attentions[..., :-1, :-1]
+ # remove cls token attentions
+ if self.prepend_bos:
+ attentions = attentions[..., 1:, 1:]
+ batch_size, layers, heads, seqlen, _ = attentions.size()
+ attentions = attentions.view(batch_size, layers * heads, seqlen, seqlen)
+
+ # features: B x C x T x T
+ attentions = attentions.to(
+ self.regression.weight.device
+ ) # attentions always float32, may need to convert to float16
+ attentions = apc(symmetrize(attentions))
+ attentions = attentions.permute(0, 2, 3, 1)
+ return self.activation(self.regression(attentions).squeeze(3))
+
+
+class NormalizedResidualBlock(nn.Module):
+ def __init__(
+ self,
+ layer: nn.Module,
+ embedding_dim: int,
+ dropout: float = 0.1,
+ ):
+ super().__init__()
+ self.embedding_dim = embedding_dim
+
+ self.layer = layer
+ self.dropout_module = nn.Dropout(
+ dropout,
+ )
+ self.layer_norm = ESM1bLayerNorm(self.embedding_dim)
+
+ def forward(self, x, *args, **kwargs):
+ residual = x
+ x = self.layer_norm(x)
+ outputs = self.layer(x, *args, **kwargs)
+ if isinstance(outputs, tuple):
+ x, *out = outputs
+ else:
+ x = outputs
+ out = None
+
+ x = self.dropout_module(x)
+ x = residual + x
+
+ if out is not None:
+ return (x,) + tuple(out)
+ else:
+ return x
+
+
+class FeedForwardNetwork(nn.Module):
+ def __init__(
+ self,
+ embedding_dim: int,
+ ffn_embedding_dim: int,
+ activation_dropout: float = 0.1,
+ max_tokens_per_msa: int = 2**14,
+ ):
+ super().__init__()
+ self.embedding_dim = embedding_dim
+ self.ffn_embedding_dim = ffn_embedding_dim
+ self.max_tokens_per_msa = max_tokens_per_msa
+ self.activation_fn = nn.GELU()
+ self.activation_dropout_module = nn.Dropout(
+ activation_dropout,
+ )
+ self.fc1 = nn.Linear(embedding_dim, ffn_embedding_dim)
+ self.fc2 = nn.Linear(ffn_embedding_dim, embedding_dim)
+
+ def forward(self, x):
+ x = self.activation_fn(self.fc1(x))
+ x = self.activation_dropout_module(x)
+ x = self.fc2(x)
+ return x
diff --git a/proteingym/baselines/esm/esm/multihead_attention.py b/proteingym/baselines/esm/esm/multihead_attention.py
new file mode 100644
index 0000000..6092903
--- /dev/null
+++ b/proteingym/baselines/esm/esm/multihead_attention.py
@@ -0,0 +1,508 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import math
+from typing import Dict, Optional, Tuple
+
+import torch
+import torch.nn.functional as F
+from torch import Tensor, nn
+from torch.nn import Parameter
+from .rotary_embedding import RotaryEmbedding
+
+import uuid
+
+
+def utils_softmax(x, dim: int, onnx_trace: bool = False):
+ if onnx_trace:
+ return F.softmax(x.float(), dim=dim)
+ else:
+ return F.softmax(x, dim=dim, dtype=torch.float32)
+
+
+class FairseqIncrementalState(object):
+ def __init__(self, *args, **kwargs):
+ super().__init__(*args, **kwargs)
+ self.init_incremental_state()
+
+ def init_incremental_state(self):
+ self._incremental_state_id = str(uuid.uuid4())
+
+ def _get_full_incremental_state_key(self, key: str) -> str:
+ return "{}.{}".format(self._incremental_state_id, key)
+
+ def get_incremental_state(
+ self,
+ incremental_state: Optional[Dict[str, Dict[str, Optional[Tensor]]]],
+ key: str,
+ ) -> Optional[Dict[str, Optional[Tensor]]]:
+ """Helper for getting incremental state for an nn.Module."""
+ full_key = self._get_full_incremental_state_key(key)
+ if incremental_state is None or full_key not in incremental_state:
+ return None
+ return incremental_state[full_key]
+
+ def set_incremental_state(
+ self,
+ incremental_state: Optional[Dict[str, Dict[str, Optional[Tensor]]]],
+ key: str,
+ value: Dict[str, Optional[Tensor]],
+ ) -> Optional[Dict[str, Dict[str, Optional[Tensor]]]]:
+ """Helper for setting incremental state for an nn.Module."""
+ if incremental_state is not None:
+ full_key = self._get_full_incremental_state_key(key)
+ incremental_state[full_key] = value
+ return incremental_state
+
+
+def with_incremental_state(cls):
+ cls.__bases__ = (FairseqIncrementalState,) + tuple(
+ b for b in cls.__bases__ if b != FairseqIncrementalState
+ )
+ return cls
+
+
+@with_incremental_state
+class MultiheadAttention(nn.Module):
+ """Multi-headed attention.
+
+ See "Attention Is All You Need" for more details.
+ """
+
+ def __init__(
+ self,
+ embed_dim,
+ num_heads,
+ kdim=None,
+ vdim=None,
+ dropout=0.0,
+ bias=True,
+ add_bias_kv: bool = False,
+ add_zero_attn: bool = False,
+ self_attention: bool = False,
+ encoder_decoder_attention: bool = False,
+ use_rotary_embeddings: bool = False,
+ ):
+ super().__init__()
+ self.embed_dim = embed_dim
+ self.kdim = kdim if kdim is not None else embed_dim
+ self.vdim = vdim if vdim is not None else embed_dim
+ self.qkv_same_dim = self.kdim == embed_dim and self.vdim == embed_dim
+
+ self.num_heads = num_heads
+ self.dropout = dropout
+ self.head_dim = embed_dim // num_heads
+ assert (
+ self.head_dim * num_heads == self.embed_dim
+ ), "embed_dim must be divisible by num_heads"
+ self.scaling = self.head_dim**-0.5
+
+ self.self_attention = self_attention
+ self.encoder_decoder_attention = encoder_decoder_attention
+
+ assert not self.self_attention or self.qkv_same_dim, (
+ "Self-attention requires query, key and " "value to be of the same size"
+ )
+
+ self.k_proj = nn.Linear(self.kdim, embed_dim, bias=bias)
+ self.v_proj = nn.Linear(self.vdim, embed_dim, bias=bias)
+ self.q_proj = nn.Linear(embed_dim, embed_dim, bias=bias)
+
+ self.out_proj = nn.Linear(embed_dim, embed_dim, bias=bias)
+
+ if add_bias_kv:
+ self.bias_k = Parameter(torch.Tensor(1, 1, embed_dim))
+ self.bias_v = Parameter(torch.Tensor(1, 1, embed_dim))
+ else:
+ self.bias_k = self.bias_v = None
+
+ self.add_zero_attn = add_zero_attn
+
+ self.reset_parameters()
+
+ self.onnx_trace = False
+ self.rot_emb = None
+ if use_rotary_embeddings:
+ self.rot_emb = RotaryEmbedding(dim=self.head_dim)
+
+ self.enable_torch_version = False
+ if hasattr(F, "multi_head_attention_forward"):
+ self.enable_torch_version = True
+ else:
+ self.enable_torch_version = False
+
+ def prepare_for_onnx_export_(self):
+ self.onnx_trace = True
+
+ def reset_parameters(self):
+ if self.qkv_same_dim:
+ # Empirically observed the convergence to be much better with
+ # the scaled initialization
+ nn.init.xavier_uniform_(self.k_proj.weight, gain=1 / math.sqrt(2))
+ nn.init.xavier_uniform_(self.v_proj.weight, gain=1 / math.sqrt(2))
+ nn.init.xavier_uniform_(self.q_proj.weight, gain=1 / math.sqrt(2))
+ else:
+ nn.init.xavier_uniform_(self.k_proj.weight)
+ nn.init.xavier_uniform_(self.v_proj.weight)
+ nn.init.xavier_uniform_(self.q_proj.weight)
+
+ nn.init.xavier_uniform_(self.out_proj.weight)
+ if self.out_proj.bias is not None:
+ nn.init.constant_(self.out_proj.bias, 0.0)
+ if self.bias_k is not None:
+ nn.init.xavier_normal_(self.bias_k)
+ if self.bias_v is not None:
+ nn.init.xavier_normal_(self.bias_v)
+
+ def forward(
+ self,
+ query,
+ key: Optional[Tensor],
+ value: Optional[Tensor],
+ key_padding_mask: Optional[Tensor] = None,
+ incremental_state: Optional[Dict[str, Dict[str, Optional[Tensor]]]] = None,
+ need_weights: bool = True,
+ static_kv: bool = False,
+ attn_mask: Optional[Tensor] = None,
+ before_softmax: bool = False,
+ need_head_weights: bool = False,
+ ) -> Tuple[Tensor, Optional[Tensor]]:
+ """Input shape: Time x Batch x Channel
+
+ Args:
+ key_padding_mask (ByteTensor, optional): mask to exclude
+ keys that are pads, of shape `(batch, src_len)`, where
+ padding elements are indicated by 1s.
+ need_weights (bool, optional): return the attention weights,
+ averaged over heads (default: False).
+ attn_mask (ByteTensor, optional): typically used to
+ implement causal attention, where the mask prevents the
+ attention from looking forward in time (default: None).
+ before_softmax (bool, optional): return the raw attention
+ weights and values before the attention softmax.
+ need_head_weights (bool, optional): return the attention
+ weights for each head. Implies *need_weights*. Default:
+ return the average attention weights over all heads.
+ """
+ if need_head_weights:
+ need_weights = True
+
+ tgt_len, bsz, embed_dim = query.size()
+ assert embed_dim == self.embed_dim
+ assert list(query.size()) == [tgt_len, bsz, embed_dim]
+
+ if (
+ not self.rot_emb
+ and self.enable_torch_version
+ and not self.onnx_trace
+ and incremental_state is None
+ and not static_kv
+ # A workaround for quantization to work. Otherwise JIT compilation
+ # treats bias in linear module as method.
+ and not torch.jit.is_scripting()
+ and not need_head_weights
+ ):
+ assert key is not None and value is not None
+ return F.multi_head_attention_forward(
+ query,
+ key,
+ value,
+ self.embed_dim,
+ self.num_heads,
+ torch.empty([0]),
+ torch.cat((self.q_proj.bias, self.k_proj.bias, self.v_proj.bias)),
+ self.bias_k,
+ self.bias_v,
+ self.add_zero_attn,
+ self.dropout,
+ self.out_proj.weight,
+ self.out_proj.bias,
+ self.training,
+ key_padding_mask,
+ need_weights,
+ attn_mask,
+ use_separate_proj_weight=True,
+ q_proj_weight=self.q_proj.weight,
+ k_proj_weight=self.k_proj.weight,
+ v_proj_weight=self.v_proj.weight,
+ )
+ if incremental_state is not None:
+ saved_state = self._get_input_buffer(incremental_state)
+ if saved_state is not None and "prev_key" in saved_state:
+ # previous time steps are cached - no need to recompute
+ # key and value if they are static
+ if static_kv:
+ assert self.encoder_decoder_attention and not self.self_attention
+ key = value = None
+ else:
+ saved_state = None
+
+ if self.self_attention:
+ q = self.q_proj(query)
+ k = self.k_proj(query)
+ v = self.v_proj(query)
+ elif self.encoder_decoder_attention:
+ # encoder-decoder attention
+ q = self.q_proj(query)
+ if key is None:
+ assert value is None
+ k = v = None
+ else:
+ k = self.k_proj(key)
+ v = self.v_proj(key)
+
+ else:
+ assert key is not None and value is not None
+ q = self.q_proj(query)
+ k = self.k_proj(key)
+ v = self.v_proj(value)
+ q *= self.scaling
+
+ if self.bias_k is not None:
+ assert self.bias_v is not None
+ k = torch.cat([k, self.bias_k.repeat(1, bsz, 1)])
+ v = torch.cat([v, self.bias_v.repeat(1, bsz, 1)])
+ if attn_mask is not None:
+ attn_mask = torch.cat(
+ [attn_mask, attn_mask.new_zeros(attn_mask.size(0), 1)], dim=1
+ )
+ if key_padding_mask is not None:
+ key_padding_mask = torch.cat(
+ [
+ key_padding_mask,
+ key_padding_mask.new_zeros(key_padding_mask.size(0), 1),
+ ],
+ dim=1,
+ )
+
+ q = q.contiguous().view(tgt_len, bsz * self.num_heads, self.head_dim).transpose(0, 1)
+ if k is not None:
+ k = k.contiguous().view(-1, bsz * self.num_heads, self.head_dim).transpose(0, 1)
+ if v is not None:
+ v = v.contiguous().view(-1, bsz * self.num_heads, self.head_dim).transpose(0, 1)
+
+ if saved_state is not None:
+ # saved states are stored with shape (bsz, num_heads, seq_len, head_dim)
+ if "prev_key" in saved_state:
+ _prev_key = saved_state["prev_key"]
+ assert _prev_key is not None
+ prev_key = _prev_key.view(bsz * self.num_heads, -1, self.head_dim)
+ if static_kv:
+ k = prev_key
+ else:
+ assert k is not None
+ k = torch.cat([prev_key, k], dim=1)
+ if "prev_value" in saved_state:
+ _prev_value = saved_state["prev_value"]
+ assert _prev_value is not None
+ prev_value = _prev_value.view(bsz * self.num_heads, -1, self.head_dim)
+ if static_kv:
+ v = prev_value
+ else:
+ assert v is not None
+ v = torch.cat([prev_value, v], dim=1)
+ prev_key_padding_mask: Optional[Tensor] = None
+ if "prev_key_padding_mask" in saved_state:
+ prev_key_padding_mask = saved_state["prev_key_padding_mask"]
+ assert k is not None and v is not None
+ key_padding_mask = MultiheadAttention._append_prev_key_padding_mask(
+ key_padding_mask=key_padding_mask,
+ prev_key_padding_mask=prev_key_padding_mask,
+ batch_size=bsz,
+ src_len=k.size(1),
+ static_kv=static_kv,
+ )
+
+ saved_state["prev_key"] = k.view(bsz, self.num_heads, -1, self.head_dim)
+ saved_state["prev_value"] = v.view(bsz, self.num_heads, -1, self.head_dim)
+ saved_state["prev_key_padding_mask"] = key_padding_mask
+ # In this branch incremental_state is never None
+ assert incremental_state is not None
+ incremental_state = self._set_input_buffer(incremental_state, saved_state)
+ assert k is not None
+ src_len = k.size(1)
+
+ # This is part of a workaround to get around fork/join parallelism
+ # not supporting Optional types.
+ if key_padding_mask is not None and key_padding_mask.dim() == 0:
+ key_padding_mask = None
+
+ if key_padding_mask is not None:
+ assert key_padding_mask.size(0) == bsz
+ assert key_padding_mask.size(1) == src_len
+
+ if self.add_zero_attn:
+ assert v is not None
+ src_len += 1
+ k = torch.cat([k, k.new_zeros((k.size(0), 1) + k.size()[2:])], dim=1)
+ v = torch.cat([v, v.new_zeros((v.size(0), 1) + v.size()[2:])], dim=1)
+ if attn_mask is not None:
+ attn_mask = torch.cat(
+ [attn_mask, attn_mask.new_zeros(attn_mask.size(0), 1)], dim=1
+ )
+ if key_padding_mask is not None:
+ key_padding_mask = torch.cat(
+ [
+ key_padding_mask,
+ torch.zeros(key_padding_mask.size(0), 1).type_as(key_padding_mask),
+ ],
+ dim=1,
+ )
+
+ if self.rot_emb:
+ q, k = self.rot_emb(q, k)
+
+ attn_weights = torch.bmm(q, k.transpose(1, 2))
+ attn_weights = MultiheadAttention.apply_sparse_mask(attn_weights, tgt_len, src_len, bsz)
+
+ assert list(attn_weights.size()) == [bsz * self.num_heads, tgt_len, src_len]
+
+ if attn_mask is not None:
+ attn_mask = attn_mask.unsqueeze(0)
+ if self.onnx_trace:
+ attn_mask = attn_mask.repeat(attn_weights.size(0), 1, 1)
+ attn_weights += attn_mask
+
+ if key_padding_mask is not None:
+ # don't attend to padding symbols
+ attn_weights = attn_weights.view(bsz, self.num_heads, tgt_len, src_len)
+ attn_weights = attn_weights.masked_fill(
+ key_padding_mask.unsqueeze(1).unsqueeze(2).to(torch.bool), float("-inf")
+ )
+ attn_weights = attn_weights.view(bsz * self.num_heads, tgt_len, src_len)
+
+ if before_softmax:
+ return attn_weights, v
+
+ attn_weights_float = utils_softmax(attn_weights, dim=-1, onnx_trace=self.onnx_trace)
+ attn_weights = attn_weights_float.type_as(attn_weights)
+ attn_probs = F.dropout(
+ attn_weights_float.type_as(attn_weights),
+ p=self.dropout,
+ training=self.training,
+ )
+ assert v is not None
+ attn = torch.bmm(attn_probs, v)
+ assert list(attn.size()) == [bsz * self.num_heads, tgt_len, self.head_dim]
+ if self.onnx_trace and attn.size(1) == 1:
+ # when ONNX tracing a single decoder step (sequence length == 1)
+ # the transpose is a no-op copy before view, thus unnecessary
+ attn = attn.contiguous().view(tgt_len, bsz, embed_dim)
+ else:
+ attn = attn.transpose(0, 1).contiguous().view(tgt_len, bsz, embed_dim)
+ attn = self.out_proj(attn)
+ attn_weights: Optional[Tensor] = None
+ if need_weights:
+ attn_weights = attn_weights_float.view(
+ bsz, self.num_heads, tgt_len, src_len
+ ).type_as(attn).transpose(1, 0)
+ if not need_head_weights:
+ # average attention weights over heads
+ attn_weights = attn_weights.mean(dim=0)
+
+ return attn, attn_weights
+
+ @staticmethod
+ def _append_prev_key_padding_mask(
+ key_padding_mask: Optional[Tensor],
+ prev_key_padding_mask: Optional[Tensor],
+ batch_size: int,
+ src_len: int,
+ static_kv: bool,
+ ) -> Optional[Tensor]:
+ # saved key padding masks have shape (bsz, seq_len)
+ if prev_key_padding_mask is not None and static_kv:
+ new_key_padding_mask = prev_key_padding_mask
+ elif prev_key_padding_mask is not None and key_padding_mask is not None:
+ new_key_padding_mask = torch.cat(
+ [prev_key_padding_mask.float(), key_padding_mask.float()], dim=1
+ )
+ # During incremental decoding, as the padding token enters and
+ # leaves the frame, there will be a time when prev or current
+ # is None
+ elif prev_key_padding_mask is not None:
+ filler = torch.zeros(
+ (batch_size, src_len - prev_key_padding_mask.size(1)),
+ device=prev_key_padding_mask.device,
+ )
+ new_key_padding_mask = torch.cat(
+ [prev_key_padding_mask.float(), filler.float()], dim=1
+ )
+ elif key_padding_mask is not None:
+ filler = torch.zeros(
+ (batch_size, src_len - key_padding_mask.size(1)),
+ device=key_padding_mask.device,
+ )
+ new_key_padding_mask = torch.cat([filler.float(), key_padding_mask.float()], dim=1)
+ else:
+ new_key_padding_mask = prev_key_padding_mask
+ return new_key_padding_mask
+
+ @torch.jit.export
+ def reorder_incremental_state(
+ self, incremental_state: Dict[str, Dict[str, Optional[Tensor]]], new_order: Tensor
+ ):
+ """Reorder buffered internal state (for incremental generation)."""
+ input_buffer = self._get_input_buffer(incremental_state)
+ if input_buffer is not None:
+ for k in input_buffer.keys():
+ input_buffer_k = input_buffer[k]
+ if input_buffer_k is not None:
+ if self.encoder_decoder_attention and input_buffer_k.size(0) == new_order.size(
+ 0
+ ):
+ break
+ input_buffer[k] = input_buffer_k.index_select(0, new_order)
+ incremental_state = self._set_input_buffer(incremental_state, input_buffer)
+ return incremental_state
+
+ def _get_input_buffer(
+ self, incremental_state: Optional[Dict[str, Dict[str, Optional[Tensor]]]]
+ ) -> Dict[str, Optional[Tensor]]:
+ result = self.get_incremental_state(incremental_state, "attn_state")
+ if result is not None:
+ return result
+ else:
+ empty_result: Dict[str, Optional[Tensor]] = {}
+ return empty_result
+
+ def _set_input_buffer(
+ self,
+ incremental_state: Dict[str, Dict[str, Optional[Tensor]]],
+ buffer: Dict[str, Optional[Tensor]],
+ ):
+ return self.set_incremental_state(incremental_state, "attn_state", buffer)
+
+ def apply_sparse_mask(attn_weights, tgt_len: int, src_len: int, bsz: int):
+ return attn_weights
+
+ def upgrade_state_dict_named(self, state_dict, name):
+ prefix = name + "." if name != "" else ""
+ items_to_add = {}
+ keys_to_remove = []
+ for k in state_dict.keys():
+ if k.endswith(prefix + "in_proj_weight"):
+ # in_proj_weight used to be q + k + v with same dimensions
+ dim = int(state_dict[k].shape[0] / 3)
+ items_to_add[prefix + "q_proj.weight"] = state_dict[k][:dim]
+ items_to_add[prefix + "k_proj.weight"] = state_dict[k][dim : 2 * dim]
+ items_to_add[prefix + "v_proj.weight"] = state_dict[k][2 * dim :]
+
+ keys_to_remove.append(k)
+
+ k_bias = prefix + "in_proj_bias"
+ if k_bias in state_dict.keys():
+ dim = int(state_dict[k].shape[0] / 3)
+ items_to_add[prefix + "q_proj.bias"] = state_dict[k_bias][:dim]
+ items_to_add[prefix + "k_proj.bias"] = state_dict[k_bias][dim : 2 * dim]
+ items_to_add[prefix + "v_proj.bias"] = state_dict[k_bias][2 * dim :]
+
+ keys_to_remove.append(prefix + "in_proj_bias")
+
+ for k in keys_to_remove:
+ del state_dict[k]
+
+ for key, value in items_to_add.items():
+ state_dict[key] = value
diff --git a/proteingym/baselines/esm/esm/pretrained.py b/proteingym/baselines/esm/esm/pretrained.py
new file mode 100644
index 0000000..c39c0ad
--- /dev/null
+++ b/proteingym/baselines/esm/esm/pretrained.py
@@ -0,0 +1,417 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import re
+import urllib
+import warnings
+from argparse import Namespace
+from pathlib import Path
+
+import torch
+# sys.path.append(os.path.dirname(os.path.dirname(os.path.dirname(os.path.abspath(__file__)))))
+from baselines.esm import esm
+from .model.esm2 import ESM2
+
+
+def _has_regression_weights(model_name):
+ """Return whether we expect / require regression weights;
+ Right now that is all models except ESM-1v and ESM-IF"""
+ return not ("esm1v" in model_name or "esm_if" in model_name)
+
+
+def load_model_and_alphabet(model_name):
+ if model_name.endswith(".pt"): # treat as filepath
+ return load_model_and_alphabet_local(model_name)
+ else:
+ return load_model_and_alphabet_hub(model_name)
+
+
+def load_hub_workaround(url):
+ try:
+ data = torch.hub.load_state_dict_from_url(url, progress=False, map_location="cpu")
+ except RuntimeError:
+ # Pytorch version issue - see https://github.com/pytorch/pytorch/issues/43106
+ fn = Path(url).name
+ data = torch.load(
+ f"{torch.hub.get_dir()}/checkpoints/{fn}",
+ map_location="cpu",
+ )
+ except urllib.error.HTTPError as e:
+ raise Exception(f"Could not load {url}, check if you specified a correct model name?")
+ return data
+
+
+def load_regression_hub(model_name):
+ url = f"https://dl.fbaipublicfiles.com/fair-esm/regression/{model_name}-contact-regression.pt"
+ regression_data = load_hub_workaround(url)
+ return regression_data
+
+
+def _download_model_and_regression_data(model_name):
+ url = f"https://dl.fbaipublicfiles.com/fair-esm/models/{model_name}.pt"
+ model_data = load_hub_workaround(url)
+ if _has_regression_weights(model_name):
+ regression_data = load_regression_hub(model_name)
+ else:
+ regression_data = None
+ return model_data, regression_data
+
+
+def load_model_and_alphabet_hub(model_name):
+ model_data, regression_data = _download_model_and_regression_data(model_name)
+ return load_model_and_alphabet_core(model_name, model_data, regression_data)
+
+
+def load_model_and_alphabet_local(model_location):
+ """Load from local path. The regression weights need to be co-located"""
+ model_location = Path(model_location)
+ model_data = torch.load(str(model_location), map_location="cpu")
+ model_name = model_location.stem
+ #if _has_regression_weights(model_name):
+ # regression_location = str(model_location.with_suffix("")) + "-contact-regression.pt"
+ # regression_data = torch.load(regression_location, map_location="cpu")
+ #else:
+ regression_data = None
+ return load_model_and_alphabet_core(model_name, model_data, regression_data)
+
+
+def has_emb_layer_norm_before(model_state):
+ """Determine whether layer norm needs to be applied before the encoder"""
+ return any(k.startswith("emb_layer_norm_before") for k, param in model_state.items())
+
+
+def _load_model_and_alphabet_core_v1(model_data):
+ from proteingym.baselines.esm import esm # since esm.inverse_folding is imported below, you actually have to re-import esm here
+ alphabet = esm.Alphabet.from_architecture(model_data["args"].arch)
+ if model_data["args"].arch == "roberta_large":
+ # upgrade state dict
+ pra = lambda s: "".join(s.split("encoder_")[1:] if "encoder" in s else s)
+ prs1 = lambda s: "".join(s.split("encoder.")[1:] if "encoder" in s else s)
+ prs2 = lambda s: "".join(
+ s.split("sentence_encoder.")[1:] if "sentence_encoder" in s else s
+ )
+ model_args = {pra(arg[0]): arg[1] for arg in vars(model_data["args"]).items()}
+ model_state = {prs1(prs2(arg[0])): arg[1] for arg in model_data["model"].items()}
+ model_state["embed_tokens.weight"][alphabet.mask_idx].zero_() # For token drop
+ model_args["emb_layer_norm_before"] = has_emb_layer_norm_before(model_state)
+ model_type = esm.ProteinBertModel
+
+ elif model_data["args"].arch == "protein_bert_base":
+
+ # upgrade state dict
+ pra = lambda s: "".join(s.split("decoder_")[1:] if "decoder" in s else s)
+ prs = lambda s: "".join(s.split("decoder.")[1:] if "decoder" in s else s)
+ model_args = {pra(arg[0]): arg[1] for arg in vars(model_data["args"]).items()}
+ model_state = {prs(arg[0]): arg[1] for arg in model_data["model"].items()}
+ model_type = esm.ProteinBertModel
+ elif model_data["args"].arch == "msa_transformer":
+
+ # upgrade state dict
+ pra = lambda s: "".join(s.split("encoder_")[1:] if "encoder" in s else s)
+ prs1 = lambda s: "".join(s.split("encoder.")[1:] if "encoder" in s else s)
+ prs2 = lambda s: "".join(
+ s.split("sentence_encoder.")[1:] if "sentence_encoder" in s else s
+ )
+ prs3 = lambda s: s.replace("row", "column") if "row" in s else s.replace("column", "row")
+ model_args = {pra(arg[0]): arg[1] for arg in vars(model_data["args"]).items()}
+ model_state = {prs1(prs2(prs3(arg[0]))): arg[1] for arg in model_data["model"].items()}
+ if model_args.get("embed_positions_msa", False):
+ emb_dim = model_state["msa_position_embedding"].size(-1)
+ model_args["embed_positions_msa_dim"] = emb_dim # initial release, bug: emb_dim==1
+
+ model_type = esm.MSATransformer
+
+ elif "invariant_gvp" in model_data["args"].arch:
+ import esm.inverse_folding
+
+ model_type = esm.inverse_folding.gvp_transformer.GVPTransformerModel
+ model_args = vars(model_data["args"]) # convert Namespace -> dict
+
+ def update_name(s):
+ # Map the module names in checkpoints trained with internal code to
+ # the updated module names in open source code
+ s = s.replace("W_v", "embed_graph.embed_node")
+ s = s.replace("W_e", "embed_graph.embed_edge")
+ s = s.replace("embed_scores.0", "embed_confidence")
+ s = s.replace("embed_score.", "embed_graph.embed_confidence.")
+ s = s.replace("seq_logits_projection.", "")
+ s = s.replace("embed_ingraham_features", "embed_dihedrals")
+ s = s.replace("embed_gvp_in_local_frame.0", "embed_gvp_output")
+ s = s.replace("embed_features_in_local_frame.0", "embed_gvp_input_features")
+ return s
+
+ model_state = {
+ update_name(sname): svalue
+ for sname, svalue in model_data["model"].items()
+ if "version" not in sname
+ }
+
+ else:
+ raise ValueError("Unknown architecture selected")
+
+ model = model_type(
+ Namespace(**model_args),
+ alphabet,
+ )
+
+ return model, alphabet, model_state
+
+
+def _load_model_and_alphabet_core_v2(model_data):
+ def upgrade_state_dict(state_dict):
+ """Removes prefixes 'model.encoder.sentence_encoder.' and 'model.encoder.'."""
+ prefixes = ["encoder.sentence_encoder.", "encoder."]
+ pattern = re.compile("^" + "|".join(prefixes))
+ state_dict = {pattern.sub("", name): param for name, param in state_dict.items()}
+ return state_dict
+
+ cfg = model_data["cfg"]["model"]
+ state_dict = model_data["model"]
+ state_dict = upgrade_state_dict(state_dict)
+ alphabet = esm.data.Alphabet.from_architecture("ESM-1b")
+ model = ESM2(
+ num_layers=cfg.encoder_layers,
+ embed_dim=cfg.encoder_embed_dim,
+ attention_heads=cfg.encoder_attention_heads,
+ alphabet=alphabet,
+ token_dropout=cfg.token_dropout,
+ )
+ return model, alphabet, state_dict
+
+
+def load_model_and_alphabet_core(model_name, model_data, regression_data=None):
+ if regression_data is not None:
+ model_data["model"].update(regression_data["model"])
+ if model_name.startswith("esm2"):
+ model, alphabet, model_state = _load_model_and_alphabet_core_v2(model_data)
+ else:
+ model, alphabet, model_state = _load_model_and_alphabet_core_v1(model_data)
+
+ expected_keys = set(model.state_dict().keys())
+ found_keys = set(model_state.keys())
+
+ if regression_data is None:
+ expected_missing = {"contact_head.regression.weight", "contact_head.regression.bias"}
+ error_msgs = []
+ missing = (expected_keys - found_keys) - expected_missing
+ if missing:
+ error_msgs.append(f"Missing key(s) in state_dict: {missing}.")
+ unexpected = found_keys - expected_keys
+ if unexpected:
+ error_msgs.append(f"Unexpected key(s) in state_dict: {unexpected}.")
+
+ if error_msgs:
+ raise RuntimeError(
+ "Error(s) in loading state_dict for {}:\n\t{}".format(
+ model.__class__.__name__, "\n\t".join(error_msgs)
+ )
+ )
+ if expected_missing - found_keys:
+ warnings.warn(
+ "Regression weights not found, predicting contacts will not produce correct results."
+ )
+
+ model.load_state_dict(model_state, strict=regression_data is not None)
+
+ return model, alphabet
+
+
+def esm1_t34_670M_UR50S():
+ """34 layer transformer model with 670M params, trained on Uniref50 Sparse.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm1_t34_670M_UR50S")
+
+
+def esm1_t34_670M_UR50D():
+ """34 layer transformer model with 670M params, trained on Uniref50 Dense.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm1_t34_670M_UR50D")
+
+
+def esm1_t34_670M_UR100():
+ """34 layer transformer model with 670M params, trained on Uniref100.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm1_t34_670M_UR100")
+
+
+def esm1_t12_85M_UR50S():
+ """12 layer transformer model with 85M params, trained on Uniref50 Sparse.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm1_t12_85M_UR50S")
+
+
+def esm1_t6_43M_UR50S():
+ """6 layer transformer model with 43M params, trained on Uniref50 Sparse.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm1_t6_43M_UR50S")
+
+
+def esm1b_t33_650M_UR50S():
+ """33 layer transformer model with 650M params, trained on Uniref50 Sparse.
+ This is our best performing model, which will be described in a future publication.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm1b_t33_650M_UR50S")
+
+
+def esm_msa1_t12_100M_UR50S():
+ warnings.warn(
+ "This model had a minor bug in the positional embeddings, "
+ "please use ESM-MSA-1b: esm.pretrained.esm_msa1b_t12_100M_UR50S()",
+ )
+ return load_model_and_alphabet_hub("esm_msa1_t12_100M_UR50S")
+
+
+def esm_msa1b_t12_100M_UR50S():
+ return load_model_and_alphabet_hub("esm_msa1b_t12_100M_UR50S")
+
+
+def esm1v_t33_650M_UR90S():
+ """33 layer transformer model with 650M params, trained on Uniref90.
+ This is model 1 of a 5 model ensemble.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm1v_t33_650M_UR90S_1")
+
+
+def esm1v_t33_650M_UR90S_1():
+ """33 layer transformer model with 650M params, trained on Uniref90.
+ This is model 1 of a 5 model ensemble.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm1v_t33_650M_UR90S_1")
+
+
+def esm1v_t33_650M_UR90S_2():
+ """33 layer transformer model with 650M params, trained on Uniref90.
+ This is model 2 of a 5 model ensemble.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm1v_t33_650M_UR90S_2")
+
+
+def esm1v_t33_650M_UR90S_3():
+ """33 layer transformer model with 650M params, trained on Uniref90.
+ This is model 3 of a 5 model ensemble.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm1v_t33_650M_UR90S_3")
+
+
+def esm1v_t33_650M_UR90S_4():
+ """33 layer transformer model with 650M params, trained on Uniref90.
+ This is model 4 of a 5 model ensemble.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm1v_t33_650M_UR90S_4")
+
+
+def esm1v_t33_650M_UR90S_5():
+ """33 layer transformer model with 650M params, trained on Uniref90.
+ This is model 5 of a 5 model ensemble.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm1v_t33_650M_UR90S_5")
+
+
+def esm_if1_gvp4_t16_142M_UR50():
+ """Inverse folding model with 142M params, with 4 GVP-GNN layers, 8
+ Transformer encoder layers, and 8 Transformer decoder layers, trained on
+ CATH structures and 12 million alphafold2 predicted structures from UniRef50
+ sequences.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm_if1_gvp4_t16_142M_UR50")
+
+
+def esm2_t6_8M_UR50D():
+ """6 layer ESM-2 model with 8M params, trained on UniRef50.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm2_t6_8M_UR50D")
+
+
+def esm2_t12_35M_UR50D():
+ """12 layer ESM-2 model with 35M params, trained on UniRef50.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm2_t12_35M_UR50D")
+
+
+def esm2_t30_150M_UR50D():
+ """30 layer ESM-2 model with 150M params, trained on UniRef50.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm2_t30_150M_UR50D")
+
+
+def esm2_t33_650M_UR50D():
+ """33 layer ESM-2 model with 650M params, trained on UniRef50.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm2_t33_650M_UR50D")
+
+
+def esm2_t36_3B_UR50D():
+ """36 layer ESM-2 model with 3B params, trained on UniRef50.
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm2_t36_3B_UR50D")
+
+
+def esm2_t48_15B_UR50D():
+ """48 layer ESM-2 model with 15B params, trained on UniRef50.
+ If you have OOM while loading this model, please refer to README
+ on how to employ FSDP and ZeRO CPU offloading
+
+ Returns a tuple of (Model, Alphabet).
+ """
+ return load_model_and_alphabet_hub("esm2_t48_15B_UR50D")
+
+
+def esmfold_v0():
+ """
+ ESMFold v0 model with 3B ESM-2, 48 folding blocks.
+ This version was used for the paper (Lin et al, 2022). It was trained
+ on all PDB chains until 2020-05, to ensure temporal holdout with CASP14
+ and the CAMEO validation and test set reported there.
+ """
+ import esm.esmfold.v1.pretrained
+ return esm.esmfold.v1.pretrained.esmfold_v0()
+
+
+def esmfold_v1():
+ """
+ ESMFold v1 model using 3B ESM-2, 48 folding blocks.
+ ESMFold provides fast high accuracy atomic level structure prediction
+ directly from the individual sequence of a protein. ESMFold uses the ESM2
+ protein language model to extract meaningful representations from the
+ protein sequence.
+ """
+ import esm.esmfold.v1.pretrained
+ return esm.esmfold.v1.pretrained.esmfold_v1()
diff --git a/proteingym/baselines/esm/esm/rotary_embedding.py b/proteingym/baselines/esm/esm/rotary_embedding.py
new file mode 100644
index 0000000..e862196
--- /dev/null
+++ b/proteingym/baselines/esm/esm/rotary_embedding.py
@@ -0,0 +1,69 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+from typing import Tuple
+
+import torch
+
+
+def rotate_half(x):
+ x1, x2 = x.chunk(2, dim=-1)
+ return torch.cat((-x2, x1), dim=-1)
+
+
+def apply_rotary_pos_emb(x, cos, sin):
+ cos = cos[:, : x.shape[-2], :]
+ sin = sin[:, : x.shape[-2], :]
+
+ return (x * cos) + (rotate_half(x) * sin)
+
+
+class RotaryEmbedding(torch.nn.Module):
+ """
+ The rotary position embeddings from RoFormer_ (Su et. al).
+ A crucial insight from the method is that the query and keys are
+ transformed by rotation matrices which depend on the relative positions.
+ Other implementations are available in the Rotary Transformer repo_ and in
+ GPT-NeoX_, GPT-NeoX was an inspiration
+ .. _RoFormer: https://arxiv.org/abs/2104.09864
+ .. _repo: https://github.com/ZhuiyiTechnology/roformer
+ .. _GPT-NeoX: https://github.com/EleutherAI/gpt-neox
+ .. warning: Please note that this embedding is not registered on purpose, as it is transformative
+ (it does not create the embedding dimension) and will likely be picked up (imported) on a ad-hoc basis
+ """
+
+ def __init__(self, dim: int, *_, **__):
+ super().__init__()
+ # Generate and save the inverse frequency buffer (non trainable)
+ inv_freq = 1.0 / (10000 ** (torch.arange(0, dim, 2).float() / dim))
+ self.register_buffer("inv_freq", inv_freq)
+
+ self._seq_len_cached = None
+ self._cos_cached = None
+ self._sin_cached = None
+
+ def _update_cos_sin_tables(self, x, seq_dimension=1):
+ seq_len = x.shape[seq_dimension]
+
+ # Reset the tables if the sequence length has changed,
+ # or if we're on a new device (possibly due to tracing for instance)
+ if seq_len != self._seq_len_cached or self._cos_cached.device != x.device:
+ self._seq_len_cached = seq_len
+ t = torch.arange(x.shape[seq_dimension], device=x.device).type_as(self.inv_freq)
+ freqs = torch.einsum("i,j->ij", t, self.inv_freq)
+ emb = torch.cat((freqs, freqs), dim=-1).to(x.device)
+
+ self._cos_cached = emb.cos()[None, :, :]
+ self._sin_cached = emb.sin()[None, :, :]
+
+ return self._cos_cached, self._sin_cached
+
+ def forward(self, q: torch.Tensor, k: torch.Tensor) -> Tuple[torch.Tensor, torch.Tensor]:
+ self._cos_cached, self._sin_cached = self._update_cos_sin_tables(k, seq_dimension=-2)
+
+ return (
+ apply_rotary_pos_emb(q, self._cos_cached, self._sin_cached),
+ apply_rotary_pos_emb(k, self._cos_cached, self._sin_cached),
+ )
diff --git a/proteingym/baselines/esm/esm/version.py b/proteingym/baselines/esm/esm/version.py
new file mode 100644
index 0000000..0b33889
--- /dev/null
+++ b/proteingym/baselines/esm/esm/version.py
@@ -0,0 +1,6 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+version = "2.0.1"
diff --git a/proteingym/baselines/gemme/compute_fitness.py b/proteingym/baselines/gemme/compute_fitness.py
new file mode 100644
index 0000000..41e7e9d
--- /dev/null
+++ b/proteingym/baselines/gemme/compute_fitness.py
@@ -0,0 +1,122 @@
+import pandas as pd
+import os
+import argparse
+import subprocess
+import sys
+
+sys.path.append(os.path.dirname(os.path.dirname(os.path.dirname(os.path.abspath(__file__)))))
+from utils.scoring_utils import set_mutant_offset, undo_mutant_offset
+"""
+Run GEMME on selected DMS and saves fitness scores
+Note that GEMME has JET2 as a dependency (and psiblast if used for sequence search)
+and requires installations of java, python2, and R
+GEMME also assumes that the first sequence of the alignment is the query sequence
+"""
+if __name__ == "__main__":
+ """
+ Main script to score sets of mutated protein sequences (substitutions or indels) with Tranception.
+ """
+ parser = argparse.ArgumentParser(description='Tranception scoring')
+ #We may pass in all required information about the DMS via the provided reference files, or specify all relevant fields manually
+ parser.add_argument('--DMS_reference_file_path', default=None, type=str, help='Path to reference file with list of DMS to score')
+ parser.add_argument('--DMS_index', default=0, type=int, help='Index of DMS assay in reference file')
+ #Fields to be passed manually if reference file is not used
+ parser.add_argument('--DMS_file_name', default=None, type=str, help='Name of DMS assay file')
+ parser.add_argument('--MSA_filename', default=None, type=str, help='Name of MSA (eg., a2m) file constructed on the wild type sequence')
+ parser.add_argument('--MSA_start', default=None, type=int, help='Sequence position that the MSA starts at (1-indexing)')
+ parser.add_argument('--MSA_end', default=None, type=int, help='Sequence position that the MSA ends at (1-indexing)')
+ parser.add_argument('--DMS_data_folder', type=str, help='Path to folder that contains all DMS assay datasets')
+ parser.add_argument('--output_scores_folder', default='./', type=str, help='Name of folder to write model scores to')
+ parser.add_argument('--MSA_folder', default='.', type=str, help='Path to MSA for neighborhood scoring')
+
+ # GEMME parameters
+ parser.add_argument("--temp_folder", default="./gemme_tmp", type=str, help="Path to temporary folder to store intermediate files")
+ parser.add_argument("--GEMME_path", default="/n/groups/marks/software/GEMME/GEMME", type=str, help="Path to GEMME installation")
+ parser.add_argument("--JET_path", default="/n/groups/marks/software/JET2/JET2", type=str, help="Path to JET2 installation")
+ parser.add_argument("--nseqs", type=int, default=20000)
+ args = parser.parse_args()
+
+ if not os.path.isdir(args.temp_folder):
+ os.mkdir(args.temp_folder)
+
+ if args.DMS_reference_file_path:
+ mapping_protein_seq_DMS = pd.read_csv(args.DMS_reference_file_path)
+ list_DMS = mapping_protein_seq_DMS["DMS_id"]
+ DMS_id=list_DMS[args.DMS_index]
+ print("Compute scores for DMS: "+str(DMS_id))
+ DMS_file_name = mapping_protein_seq_DMS["DMS_filename"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]
+ MSA_data_file = args.MSA_folder + os.sep + mapping_protein_seq_DMS["MSA_filename"][args.DMS_index] if args.MSA_folder is not None else None
+ MSA_start = mapping_protein_seq_DMS["MSA_start"][args.DMS_index]
+ MSA_end = mapping_protein_seq_DMS["MSA_end"][args.DMS_index]
+ else:
+ target_seq=args.target_seq
+ DMS_file_name=args.DMS_file_name
+ DMS_id = DMS_file_name.split(".")[0]
+ MSA_data_file = args.MSA_folder + os.sep + args.MSA_filename if args.MSA_folder is not None else None
+ MSA_start = args.MSA_start
+ MSA_end = args.MSA_end
+ # This is necessary because GEMME splits filenames and things on periods and underscores and dashes and things, so we need to remove those to avoid
+ # any filepath/name issues
+ condensed_DMS_id = DMS_id.replace("_","").replace(".","")
+ if not os.path.isdir(args.temp_folder + os.sep + condensed_DMS_id):
+ os.mkdir(args.temp_folder + os.sep + condensed_DMS_id)
+ full_temp_folder = args.temp_folder + os.sep + condensed_DMS_id
+ # get mutant files in right format for GEMME
+ DMS_data = pd.read_csv(args.DMS_data_folder + os.sep + DMS_file_name)
+ # mutant_temp_file = DMS_id.replace(".","-").replace("_","@") + "_mutants.txt"
+ mutant_temp_file = condensed_DMS_id + "_mutants.txt"
+ if "mutant" not in DMS_data.columns:
+ raise ValueError("DMS data file must contain a column named 'mutant' with the mutant sequences to score")
+ env = os.environ.copy()
+ env["GEMME_PATH"] = args.GEMME_path
+ env["JET_PATH"] = args.JET_path
+ with open(full_temp_folder + os.sep + mutant_temp_file, "w+") as mutant_file:
+ for mutant in DMS_data["mutant"]:
+ offset_mutant = set_mutant_offset(mutant, MSA_start)
+ mutant_csv = offset_mutant.replace(":", ",")
+ mutant_file.write(mutant_csv + "\n")
+
+ # Converting alignment to uppercase if necessary, only uppercases lines that are not headers
+ MSA_upper_file = condensed_DMS_id + "_MSA_upper.txt"
+ if MSA_data_file is not None:
+ with open(MSA_data_file, "r") as f:
+ lines = f.readlines()
+ with open(full_temp_folder + os.sep + MSA_upper_file, "w") as f:
+ for i,line in enumerate(lines):
+ if line[0] == ">":
+ # if i == 0:
+ # f.write(">" + condensed_DMS_id + "\n")
+ # else:
+ if not "/" in line:
+ newline = line.split("\t")[0].replace("_","").replace(".","").rstrip() + "/" + str(MSA_start) + "-" + str(MSA_end) + "\n"
+ f.write(newline)
+ else:
+ f.write(line.replace("_","").replace(".",""))
+ else:
+ f.write(line.upper())
+ else:
+ raise ValueError("MSA data file must be provided to run GEMME")
+
+ # run GEMME using subprocess
+ command = f"python2 {args.GEMME_path}/gemme.py {MSA_upper_file} -r input -f {MSA_upper_file} -m {mutant_temp_file} -N {args.nseqs}"
+ print(command)
+ proc_obj = subprocess.run(command, shell=True, env=env, cwd=full_temp_folder, check=True)
+ # parse output files
+ # find file with suffix _evolCombi.txt
+ for file in os.listdir(full_temp_folder):
+ if file.endswith("_evolCombi.txt"):
+ evol_combi_file = file
+ break
+ else:
+ raise ValueError("GEMME output file not found")
+ score_df = pd.read_csv(full_temp_folder + os.sep + evol_combi_file, sep=" ")
+ score_df = score_df.reset_index().rename(columns={"index":"mutant","x":"GEMME_score"})
+ if not type(score_df["mutant"][0]) == str:
+ print("Weird R dataframe conversion error inside GEMME, remapping mutants to dataframe")
+ score_df["mutant"] = DMS_data["mutant"]
+ else:
+ score_df["mutant"] = score_df["mutant"].apply(undo_mutant_offset, MSA_start=MSA_start)
+ score_df["mutant"] = score_df["mutant"].apply(lambda x: x.replace(",",":"))
+ DMS_data_merged = pd.merge(DMS_data, score_df, on="mutant", how="left")
+ DMS_data_merged.to_csv(args.output_scores_folder + os.sep + DMS_id + ".csv", index=False)
+ os.system(f"rm -rf {full_temp_folder}")
\ No newline at end of file
diff --git a/proteingym/baselines/progen2/compute_fitness.py b/proteingym/baselines/progen2/compute_fitness.py
new file mode 100644
index 0000000..0da3de4
--- /dev/null
+++ b/proteingym/baselines/progen2/compute_fitness.py
@@ -0,0 +1,170 @@
+import os
+import argparse
+import tqdm
+import json
+
+from scipy.stats import spearmanr
+import numpy as np
+import pandas as pd
+
+import torch
+from torch.nn import CrossEntropyLoss
+
+from tokenizers import Tokenizer
+from models.progen.modeling_progen import ProGenForCausalLM
+
+
+########################################################################
+# model
+
+def create_model(ckpt, fp16):
+ if fp16:
+ return ProGenForCausalLM.from_pretrained(ckpt, revision='float16', torch_dtype=torch.float16, low_cpu_mem_usage=True)
+ else:
+ return ProGenForCausalLM.from_pretrained(ckpt)
+
+
+def create_tokenizer_custom(file):
+ with open(file, 'r') as f:
+ return Tokenizer.from_str(f.read())
+
+########################################################################
+# fitness
+
+def calc_fitness(model, prots, tokenizer, device='cuda:0', model_context_len=1024, fp16=False, reduction='mean'):
+ loss_list = []
+ loss_fn = CrossEntropyLoss()
+ with torch.no_grad():
+ with torch.cuda.amp.autocast(enabled=fp16):
+ for prot in tqdm.tqdm(prots):
+ loss_val = 0
+
+ sequence_chunks=[]
+ if len(prot) < model_context_len:
+ sequence_chunks = [prot]
+ else:
+ len_target_seq = len(prot)
+ num_windows = 1 + int( len_target_seq / model_context_len)
+ start=0
+ for window_index in range(1, num_windows+1):
+ sequence_chunks.append(prot[start:start+model_context_len])
+ start += model_context_len
+
+ for chunk in sequence_chunks:
+ for p in [chunk, chunk[::-1]]:
+ ids = torch.tensor(tokenizer.encode(p).ids).to(device)
+
+ input_ids = ids[:-1]
+ targets = ids[1:]
+
+ logits=model(input_ids).logits
+
+ # remove terminals
+ bos_token, eos_token = 3, 4
+ if targets[-1] in [bos_token, eos_token]:
+ logits = logits[:-1, ...]
+ targets = targets[:-1]
+ assert (targets == bos_token).sum() == 0
+ assert (targets == eos_token).sum() == 0
+
+ # remove unused logits
+ first_token, last_token = 5, 29
+ logits = logits[:, first_token:(last_token+1)]
+ targets = targets - first_token
+
+ assert logits.shape[1] == (last_token - first_token + 1)
+
+ loss = loss_fn(target=targets.view(-1), input=logits.view(-1,logits.size(-1)))
+ loss_val += - loss.item()
+
+ loss_val /= 2.0 #normalizing for mirroring
+
+ if reduction=='mean':
+ loss_val /= len(prot) #average by seq length
+
+ loss_list += [loss_val]
+ return np.array(loss_list)
+
+def get_mutated_sequence(focus_seq, mutant, start_idx=1, AA_vocab="ACDEFGHIKLMNPQRSTVWY"):
+ """
+ Helper function that mutates an input sequence (focus_seq) via an input mutation triplet (substitutions only).
+ Mutation triplet are typically based on 1-indexing: start_idx is used for switching to 0-indexing.
+ """
+ mutated_seq = list(focus_seq)
+ for mutation in mutant.split(":"):
+ try:
+ from_AA, position, to_AA = mutation[0], int(mutation[1:-1]), mutation[-1]
+ except:
+ print("Issue with mutant: "+str(mutation))
+ relative_position = position - start_idx
+ assert (from_AA==focus_seq[relative_position]), "Invalid from_AA or mutant position: "+str(mutation)+" from_AA: "+str(from_AA) + " relative pos: "+str(relative_position) + " focus_seq: "+str(focus_seq)
+ assert (to_AA in AA_vocab) , "Mutant to_AA is invalid: "+str(mutation)
+ mutated_seq[relative_position] = to_AA
+ return "1"+"".join(mutated_seq)+"2"
+
+def main():
+ """
+ Main script to score sets of mutated protein sequences (substitutions or indels) with Tranception.
+ """
+
+ models_151M = [ 'progen2-small' ]
+ models_754M = [ 'progen2-medium', 'progen2-oas', 'progen2-base' ]
+ models_2B = [ 'progen2-large', 'progen2-BFD90' ]
+ models_6B = [ 'progen2-xlarge' ]
+ models = models_151M + models_754M + models_2B + models_6B
+
+ parser = argparse.ArgumentParser(description='Tranception scoring')
+ parser.add_argument('--Progen2_model_name_or_path', default="/n/groups/marks/projects/marks_lab_and_oatml/protein_transformer/baseline_models/progen2/progen2-small", type=str, help='Name of or path to Progen2 model')
+ parser.add_argument('--DMS_reference_file_path', default='/home/pn73/Tranception/proteingym/ProteinGym_reference_file_substitutions.csv', type=str, help='Path of DMS folder')
+ parser.add_argument('--DMS_data_folder', default='/n/groups/marks/projects/marks_lab_and_oatml/protein_transformer/Tranception_open_source/DMS_files/ProteinGym_substitutions', type=str, help='Path of DMS folder')
+ parser.add_argument('--DMS_index', type=int, help='Path of DMS folder')
+ parser.add_argument('--output_scores_folder', default=None, type=str, help='Name of folder to write model scores to')
+ parser.add_argument('--indel_mode', action='store_true', help='Whether to score sequences with insertions and deletions')
+ parser.add_argument('--fp16', action='store_true', help='Whether to score sequences with half precision')
+ parser.add_argument('--test', action='store_true', help='Test mode of fitness computation')
+ args = parser.parse_args()
+
+ model = create_model(ckpt=args.Progen2_model_name_or_path, fp16=args.fp16).cuda()
+ config = json.load(open(args.Progen2_model_name_or_path+os.sep+'config.json',"r"))
+ print("Maximum context length: {}".format(config['n_positions']))
+
+ dir_path = os.path.dirname(os.path.realpath(__file__))
+ tokenizer_path = os.path.join(dir_path, 'tokenizer.json')
+ tokenizer = create_tokenizer_custom(file=tokenizer_path)
+
+ mapping_protein_seq_DMS = pd.read_csv(args.DMS_reference_file_path)
+ list_DMS = mapping_protein_seq_DMS["DMS_id"]
+ DMS_id=list_DMS[args.DMS_index]
+ print("Computing scores for: {} with Progen2: {}".format(DMS_id, args.Progen2_model_name_or_path))
+ DMS_file_name = mapping_protein_seq_DMS["DMS_filename"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]
+ target_seq = mapping_protein_seq_DMS["target_seq"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0].upper()
+
+ DMS_data = pd.read_csv(args.DMS_data_folder + os.sep + DMS_file_name, low_memory=False)
+ if not args.indel_mode and "mutated_sequence" not in DMS_data.columns:
+ DMS_data['mutated_sequence'] = DMS_data['mutant'].apply(lambda x: get_mutated_sequence(target_seq, x)) # if not args.indel_mode else DMS_data['mutant'].map(lambda x: "1"+x+"2")
+
+ if args.test:
+ x_uniref90bfd30 = '2GFLPFRGADEGLAAREAATLAARGTAARAYREDSWAVPVPRGLLGDLTARVAALGAASPPPADPLAVTLDLHHVTAEVALTTVLDAATLVHGQTRVLSAEDAAEAATAAAAATEAYLERLQDFVLFMSASVRVWRRGNAAGATGPEWDQWYTVADRDALGSAPTHLAVLGRQADALCHFVLDRVAWGTCGTPLWSGDEDLGNVVATFAGYADRLATAPRDLIM1'
+ x_oas = '1EVQLVESGGGLVQPGGSLRLSCAASGFTFSSYAMHWVRQAPWKGLEYVSAISSNGGSTYYANSVKGRFTISRDNSKNTLYLQMGSLRAEDMAVYYCARDESGYSYGWGYYFDYWGQGTLVTVSS2'
+ x_bfd90 = '1TAPRSTRASGSEGSRPPGIPAKGRRCLPSRAGSVTPRFRHARQGTATVAKEQGRKLIASNRKARHDYHIEDTFEAGLVLTGTEVKSLRMGRASLIDGYAVFYGEELWLEGVHIPEYLNGNWTNHTPRRRRKLLLNRSELTKLAHKTSESGHTIVPLALYFKDGRAKVEIAVAKGKKAYDKRHALRERQDQREV2'
+ model_size = args.Progen2_model_name_or_path.split('/')[-1]
+ print("Model: {}".format(model_size))
+ checkpoint_x_ll = {
+ 'progen2-small': (x_uniref90bfd30, -2.4),
+ 'progen2-medium': (x_uniref90bfd30, -1.9),
+ 'progen2-base': (x_uniref90bfd30, -1.9),
+ 'progen2-large': (x_uniref90bfd30, -1.8),
+ 'progen2-xlarge': (x_uniref90bfd30, -1.0),
+ }
+ model_scores = calc_fitness(model=model, prots=np.array([checkpoint_x_ll[model_size][0]]), tokenizer=tokenizer, fp16=args.fp16, reduction='sum')
+ print(model_scores, checkpoint_x_ll[model_size][1], abs(model_scores - checkpoint_x_ll[model_size][1]))
+ assert abs(model_scores - checkpoint_x_ll[model_size][1]) < 0.1
+
+ model_scores = calc_fitness(model=model, prots=np.array(DMS_data['mutated_sequence']), model_context_len=int(config['n_positions']), tokenizer=tokenizer, fp16=args.fp16)
+
+ DMS_data['Progen2_score']=model_scores
+ scoring_filename = args.output_scores_folder+os.sep+DMS_id+'.csv'
+ DMS_data[['mutant','Progen2_score','DMS_score']].to_csv(scoring_filename, index=False)
+
+if __name__ == '__main__':
+ main()
diff --git a/proteingym/baselines/progen2/models/progen/configuration_progen.py b/proteingym/baselines/progen2/models/progen/configuration_progen.py
new file mode 100644
index 0000000..d32f5a6
--- /dev/null
+++ b/proteingym/baselines/progen2/models/progen/configuration_progen.py
@@ -0,0 +1,87 @@
+# coding=utf-8
+# Copyright 2021 The EleutherAI and HuggingFace Teams. All rights reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+# Modified configuration implementation based on https://github.com/huggingface/transformers/blob/main/src/transformers/models/gptj/configuration_gptj.py
+
+from transformers.configuration_utils import PretrainedConfig
+from transformers.utils import logging
+
+logger = logging.get_logger(__name__)
+
+
+class ProGenConfig(PretrainedConfig):
+ model_type = "progen"
+
+ def __init__(
+ self,
+ vocab_size=50400,
+ n_positions=2048,
+ n_ctx=2048,
+ n_embd=4096,
+ n_layer=28,
+ n_head=16,
+ rotary_dim=64,
+ n_inner=None,
+ activation_function="gelu_new",
+ resid_pdrop=0.0,
+ embd_pdrop=0.0,
+ attn_pdrop=0.0,
+ layer_norm_epsilon=1e-5,
+ initializer_range=0.02,
+ scale_attn_weights=True,
+ gradient_checkpointing=False,
+ use_cache=True,
+ bos_token_id=50256,
+ eos_token_id=50256,
+ **kwargs
+ ):
+ super().__init__(bos_token_id=bos_token_id, eos_token_id=eos_token_id, **kwargs)
+
+ self.vocab_size = vocab_size
+ self.n_ctx = n_ctx
+ self.n_positions = n_positions
+ self.n_embd = n_embd
+ self.n_layer = n_layer
+ self.n_head = n_head
+ self.n_inner = n_inner
+ self.rotary_dim = rotary_dim
+ self.activation_function = activation_function
+ self.resid_pdrop = resid_pdrop
+ self.embd_pdrop = embd_pdrop
+ self.attn_pdrop = attn_pdrop
+ self.layer_norm_epsilon = layer_norm_epsilon
+ self.initializer_range = initializer_range
+ self.gradient_checkpointing = gradient_checkpointing
+ self.scale_attn_weights = scale_attn_weights
+ self.use_cache = use_cache
+
+ self.bos_token_id = bos_token_id
+ self.eos_token_id = eos_token_id
+
+ @property
+ def max_position_embeddings(self):
+ return self.n_positions
+
+ @property
+ def hidden_size(self):
+ return self.n_embd
+
+ @property
+ def num_attention_heads(self):
+ return self.n_head
+
+ @property
+ def num_hidden_layers(self):
+ return self.n_layer
\ No newline at end of file
diff --git a/proteingym/baselines/progen2/models/progen/modeling_progen.py b/proteingym/baselines/progen2/models/progen/modeling_progen.py
new file mode 100644
index 0000000..48873c4
--- /dev/null
+++ b/proteingym/baselines/progen2/models/progen/modeling_progen.py
@@ -0,0 +1,687 @@
+# coding=utf-8
+# Copyright 2021 The EleutherAI and HuggingFace Teams. All rights reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+# Modified forward-pass implementation based on https://github.com/huggingface/transformers/blob/main/src/transformers/models/gptj/modeling_gptj.py
+
+from typing import Tuple
+
+import numpy as np
+
+import torch
+import torch.utils.checkpoint
+from torch import nn
+from torch.nn import CrossEntropyLoss
+
+from transformers.activations import ACT2FN
+from transformers.modeling_outputs import BaseModelOutputWithPast, CausalLMOutputWithPast
+from transformers.modeling_utils import PreTrainedModel
+from transformers.utils import logging
+from transformers.utils.model_parallel_utils import assert_device_map, get_device_map
+from .configuration_progen import ProGenConfig
+
+
+logger = logging.get_logger(__name__)
+
+
+def fixed_pos_embedding(x, seq_dim=1, seq_len=None):
+ dim = x.shape[-1]
+ if seq_len is None:
+ seq_len = x.shape[seq_dim]
+ inv_freq = 1.0 / (10000 ** (torch.arange(0, dim, 2) / dim))
+ sinusoid_inp = torch.einsum("i , j -> i j", torch.arange(seq_len), inv_freq).to(x.device).float()
+ return torch.sin(sinusoid_inp), torch.cos(sinusoid_inp)
+
+
+def rotate_every_two(x):
+ x1 = x[:, :, :, ::2]
+ x2 = x[:, :, :, 1::2]
+ x = torch.stack((-x2, x1), axis=-1)
+ return x.flatten(-2) # in einsum notation: rearrange(x, '... d j -> ... (d j)')
+
+
+def apply_rotary_pos_emb(x, sincos, offset=0):
+ sin, cos = map(lambda t: t[None, offset : x.shape[1] + offset, None, :].repeat_interleave(2, 3), sincos)
+ # einsum notation for lambda t: repeat(t[offset:x.shape[1]+offset,:], "n d -> () n () (d j)", j=2)
+ return (x * cos) + (rotate_every_two(x) * sin)
+
+
+class ProGenAttention(nn.Module):
+ def __init__(self, config):
+ super().__init__()
+
+ max_positions = config.max_position_embeddings
+ self.register_buffer(
+ "bias",
+ torch.tril(torch.ones((max_positions, max_positions), dtype=torch.bool)).view(
+ 1, 1, max_positions, max_positions
+ ),
+ )
+ self.register_buffer("masked_bias", torch.tensor(-1e9))
+
+ self.attn_dropout = nn.Dropout(config.attn_pdrop)
+ self.resid_dropout = nn.Dropout(config.resid_pdrop)
+
+ self.embed_dim = config.hidden_size
+ self.num_attention_heads = config.num_attention_heads
+ self.head_dim = self.embed_dim // self.num_attention_heads
+ if self.head_dim * self.num_attention_heads != self.embed_dim:
+ raise ValueError(
+ f"embed_dim must be divisible by num_attention_heads (got `embed_dim`: {self.embed_dim} and `num_attention_heads`: {self.num_attention_heads})."
+ )
+ self.scale_attn = torch.sqrt(torch.tensor(self.head_dim, dtype=torch.float32)).to(torch.get_default_dtype())
+ self.qkv_proj = nn.Linear(self.embed_dim, self.embed_dim * 3, bias=False)
+
+ self.out_proj = nn.Linear(self.embed_dim, self.embed_dim, bias=False)
+ self.rotary_dim = None
+ if config.rotary_dim is not None:
+ self.rotary_dim = config.rotary_dim
+
+ def _split_heads(self, x, n_head, dim_head, mp_num):
+ reshaped = x.reshape(x.shape[:-1] + (n_head//mp_num, dim_head))
+ reshaped = reshaped.reshape(x.shape[:-2] + (-1, ) + reshaped.shape[-1:])
+ return reshaped
+
+ def _merge_heads(self, tensor, num_attention_heads, attn_head_size):
+ """
+ Merges attn_head_size dim and num_attn_heads dim into n_ctx
+ """
+ if len(tensor.shape) == 5:
+ tensor = tensor.permute(0, 1, 3, 2, 4).contiguous()
+ elif len(tensor.shape) == 4:
+ tensor = tensor.permute(0, 2, 1, 3).contiguous()
+ else:
+ raise ValueError(f"Input tensor rank should be one of [4, 5], but is: {len(tensor.shape)}")
+ new_shape = tensor.size()[:-2] + (num_attention_heads * attn_head_size,)
+ return tensor.view(new_shape)
+
+ def _attn(
+ self,
+ query,
+ key,
+ value,
+ attention_mask=None,
+ head_mask=None,
+ ):
+
+ # compute causal mask from causal mask buffer
+ query_length, key_length = query.size(-2), key.size(-2)
+ causal_mask = self.bias[:, :, key_length - query_length : key_length, :key_length]
+
+ # Keep the attention weights computation in fp32 to avoid overflow issues
+ query = query.to(torch.float32)
+ key = key.to(torch.float32)
+
+ attn_weights = torch.matmul(query, key.transpose(-1, -2))
+
+ attn_weights = attn_weights / self.scale_attn
+ attn_weights = torch.where(causal_mask, attn_weights, self.masked_bias.to(attn_weights.dtype))
+
+ if attention_mask is not None:
+ # Apply the attention mask
+ attn_weights = attn_weights + attention_mask
+
+ attn_weights = nn.Softmax(dim=-1)(attn_weights)
+ attn_weights = attn_weights.to(value.dtype)
+ attn_weights = self.attn_dropout(attn_weights)
+
+ # Mask heads if we want to
+ if head_mask is not None:
+ attn_weights = attn_weights * head_mask
+
+ attn_output = torch.matmul(attn_weights, value)
+
+ return attn_output, attn_weights
+
+ def forward(
+ self,
+ hidden_states,
+ attention_mask=None,
+ layer_past=None,
+ head_mask=None,
+ use_cache=False,
+ output_attentions=False,
+ ):
+
+ qkv = self.qkv_proj(hidden_states)
+ # TODO(enijkamp): factor out number of logical TPU-v3/v4 cores or make forward pass agnostic
+ # mp_num = 4
+ mp_num = 8
+ qkv_split = qkv.reshape(qkv.shape[:-1] + (mp_num, -1))
+
+ local_dim = self.head_dim * self.num_attention_heads // mp_num
+ query, value, key = torch.split(qkv_split, local_dim, dim=-1)
+ query = self._split_heads(query, self.num_attention_heads, self.head_dim, mp_num=mp_num)
+ key = self._split_heads(key, self.num_attention_heads, self.head_dim, mp_num=mp_num)
+
+ value = self._split_heads(value, self.num_attention_heads, self.head_dim, mp_num=mp_num)
+ value = value.permute(0, 2, 1, 3)
+
+ seq_len = key.shape[1]
+ offset = 0
+
+ if layer_past is not None:
+ offset = layer_past[0].shape[-2]
+ seq_len += offset
+
+ if self.rotary_dim is not None:
+ k_rot = key[:, :, :, : self.rotary_dim]
+ k_pass = key[:, :, :, self.rotary_dim :]
+
+ q_rot = query[:, :, :, : self.rotary_dim]
+ q_pass = query[:, :, :, self.rotary_dim :]
+
+ sincos = fixed_pos_embedding(k_rot, 1, seq_len=seq_len)
+ k_rot = apply_rotary_pos_emb(k_rot, sincos, offset=offset)
+ q_rot = apply_rotary_pos_emb(q_rot, sincos, offset=offset)
+
+ key = torch.cat([k_rot, k_pass], dim=-1)
+ query = torch.cat([q_rot, q_pass], dim=-1)
+ else:
+ sincos = fixed_pos_embedding(key, 1, seq_len=seq_len)
+ key = apply_rotary_pos_emb(key, sincos, offset=offset)
+ query = apply_rotary_pos_emb(query, sincos, offset=offset)
+
+ key = key.permute(0, 2, 1, 3)
+ query = query.permute(0, 2, 1, 3)
+
+ if layer_past is not None:
+ past_key = layer_past[0]
+ past_value = layer_past[1]
+ key = torch.cat((past_key, key), dim=-2)
+ value = torch.cat((past_value, value), dim=-2)
+
+ if use_cache is True:
+ present = (key, value)
+ else:
+ present = None
+
+ # compute self-attention: V x Softmax(QK^T)
+ attn_output, attn_weights = self._attn(query, key, value, attention_mask, head_mask)
+
+ attn_output = self._merge_heads(attn_output, self.num_attention_heads, self.head_dim)
+
+ attn_output = self.out_proj(attn_output)
+ attn_output = self.resid_dropout(attn_output)
+
+ outputs = (attn_output, present)
+ if output_attentions:
+ outputs += (attn_weights,)
+
+ return outputs # a, present, (attentions)
+
+
+class ProGenMLP(nn.Module):
+ def __init__(self, intermediate_size, config): # in MLP: intermediate_size= 4 * embed_dim
+ super().__init__()
+ embed_dim = config.n_embd
+
+ self.fc_in = nn.Linear(embed_dim, intermediate_size)
+ self.fc_out = nn.Linear(intermediate_size, embed_dim)
+
+ self.act = ACT2FN[config.activation_function]
+ self.dropout = nn.Dropout(config.resid_pdrop)
+
+ def forward(self, hidden_states):
+ hidden_states = self.fc_in(hidden_states)
+ hidden_states = self.act(hidden_states)
+ hidden_states = self.fc_out(hidden_states)
+ hidden_states = self.dropout(hidden_states)
+ return hidden_states
+
+
+class ProGenBlock(nn.Module):
+ def __init__(self, config):
+ super().__init__()
+ inner_dim = config.n_inner if config.n_inner is not None else 4 * config.n_embd
+ self.ln_1 = nn.LayerNorm(config.n_embd, eps=config.layer_norm_epsilon)
+ self.attn = ProGenAttention(config)
+ self.mlp = ProGenMLP(inner_dim, config)
+
+ def forward(
+ self,
+ hidden_states,
+ layer_past=None,
+ attention_mask=None,
+ head_mask=None,
+ use_cache=False,
+ output_attentions=False,
+ ):
+ residual = hidden_states
+ hidden_states = self.ln_1(hidden_states)
+ attn_outputs = self.attn(
+ hidden_states,
+ layer_past=layer_past,
+ attention_mask=attention_mask,
+ head_mask=head_mask,
+ use_cache=use_cache,
+ output_attentions=output_attentions,
+ )
+ attn_output = attn_outputs[0] # output_attn: a, present, (attentions)
+ outputs = attn_outputs[1:]
+
+ feed_forward_hidden_states = self.mlp(hidden_states)
+ hidden_states = attn_output + feed_forward_hidden_states + residual
+
+ if use_cache:
+ outputs = (hidden_states,) + outputs
+ else:
+ outputs = (hidden_states,) + outputs[1:]
+
+ return outputs # hidden_states, present, (attentions)
+
+
+class ProGenPreTrainedModel(PreTrainedModel):
+ """
+ An abstract class to handle weights initialization and a simple interface for downloading and loading pretrained
+ models.
+ """
+
+ config_class = ProGenConfig
+ base_model_prefix = "transformer"
+ is_parallelizable = True
+
+ def __init__(self, *inputs, **kwargs):
+ super().__init__(*inputs, **kwargs)
+
+ def _init_weights(self, module):
+ """Initialize the weights."""
+ if isinstance(module, (nn.Linear,)):
+ # Slightly different from Mesh Transformer JAX which uses truncated_normal for initialization
+ # cf https://github.com/pytorch/pytorch/pull/5617
+ module.weight.data.normal_(mean=0.0, std=self.config.initializer_range)
+ if module.bias is not None:
+ module.bias.data.zero_()
+ elif isinstance(module, nn.Embedding):
+ module.weight.data.normal_(mean=0.0, std=self.config.initializer_range)
+ if module.padding_idx is not None:
+ module.weight.data[module.padding_idx].zero_()
+ elif isinstance(module, nn.LayerNorm):
+ module.bias.data.zero_()
+ module.weight.data.fill_(1.0)
+
+
+class ProGenModel(ProGenPreTrainedModel):
+ def __init__(self, config):
+ super().__init__(config)
+
+ self.embed_dim = config.n_embd
+ self.vocab_size = config.vocab_size
+ self.wte = nn.Embedding(config.vocab_size, self.embed_dim)
+ self.drop = nn.Dropout(config.embd_pdrop)
+ self.h = nn.ModuleList([ProGenBlock(config) for _ in range(config.n_layer)])
+ self.ln_f = nn.LayerNorm(self.embed_dim, eps=config.layer_norm_epsilon)
+ self.rotary_dim = min(config.rotary_dim, config.n_ctx // config.num_attention_heads)
+ self.init_weights()
+
+ # Model parallel
+ self.model_parallel = False
+ self.device_map = None
+
+
+ def parallelize(self, device_map=None):
+ # Check validity of device_map
+ self.device_map = (
+ get_device_map(len(self.h), range(torch.cuda.device_count())) if device_map is None else device_map
+ )
+ assert_device_map(self.device_map, len(self.h))
+ self.model_parallel = True
+ self.first_device = "cpu" if "cpu" in self.device_map.keys() else "cuda:" + str(min(self.device_map.keys()))
+ self.last_device = "cuda:" + str(max(self.device_map.keys()))
+ self.wte = self.wte.to(self.first_device)
+ # Load onto devices
+ for k, v in self.device_map.items():
+ for block in v:
+ cuda_device = "cuda:" + str(k)
+ self.h[block] = self.h[block].to(cuda_device)
+ # ln_f to last
+ self.ln_f = self.ln_f.to(self.last_device)
+
+
+ def deparallelize(self):
+ self.model_parallel = False
+ self.device_map = None
+ self.first_device = "cpu"
+ self.last_device = "cpu"
+ self.wte = self.wte.to("cpu")
+ for index in range(len(self.h)):
+ self.h[index] = self.h[index].to("cpu")
+ self.ln_f = self.ln_f.to("cpu")
+ torch.cuda.empty_cache()
+
+ def get_input_embeddings(self):
+ return self.wte
+
+ def set_input_embeddings(self, new_embeddings):
+ self.wte = new_embeddings
+
+ def forward(
+ self,
+ input_ids=None,
+ past_key_values=None,
+ attention_mask=None,
+ token_type_ids=None,
+ position_ids=None,
+ head_mask=None,
+ inputs_embeds=None,
+ use_cache=None,
+ output_attentions=None,
+ output_hidden_states=None,
+ return_dict=None,
+ ):
+ output_attentions = output_attentions if output_attentions is not None else self.config.output_attentions
+ output_hidden_states = (
+ output_hidden_states if output_hidden_states is not None else self.config.output_hidden_states
+ )
+ use_cache = use_cache if use_cache is not None else self.config.use_cache
+ return_dict = return_dict if return_dict is not None else self.config.use_return_dict
+
+ if input_ids is not None and inputs_embeds is not None:
+ raise ValueError("You cannot specify both input_ids and inputs_embeds at the same time")
+ elif input_ids is not None:
+ input_shape = input_ids.size()
+ input_ids = input_ids.view(-1, input_shape[-1])
+ batch_size = input_ids.shape[0]
+ elif inputs_embeds is not None:
+ input_shape = inputs_embeds.size()[:-1]
+ batch_size = inputs_embeds.shape[0]
+ else:
+ raise ValueError("You have to specify either input_ids or inputs_embeds")
+
+ device = input_ids.device if input_ids is not None else inputs_embeds.device
+
+ if token_type_ids is not None:
+ token_type_ids = token_type_ids.view(-1, input_shape[-1])
+
+ if position_ids is not None:
+ position_ids = position_ids.view(-1, input_shape[-1])
+
+ if past_key_values is None:
+ past_length = 0
+ past_key_values = tuple([None] * len(self.h))
+ else:
+ past_length = past_key_values[0][0].size(-2)
+
+ if position_ids is None:
+ position_ids = torch.arange(past_length, input_shape[-1] + past_length, dtype=torch.long, device=device)
+ position_ids = position_ids.unsqueeze(0).view(-1, input_shape[-1])
+
+ # Attention mask.
+ if attention_mask is not None:
+ assert batch_size > 0, "batch_size has to be defined and > 0"
+ attention_mask = attention_mask.view(batch_size, -1)
+ # We create a 3D attention mask from a 2D tensor mask.
+ # Sizes are [batch_size, 1, 1, to_seq_length]
+ # So we can broadcast to [batch_size, num_heads, from_seq_length, to_seq_length]
+ # this attention mask is more simple than the triangular masking of causal attention
+ # used in OpenAI GPT, we just need to prepare the broadcast dimension here.
+ attention_mask = attention_mask[:, None, None, :]
+
+ # Since attention_mask is 1.0 for positions we want to attend and 0.0 for
+ # masked positions, this operation will create a tensor which is 0.0 for
+ # positions we want to attend and -10000.0 for masked positions.
+ # Since we are adding it to the raw scores before the softmax, this is
+ # effectively the same as removing these entirely.
+ attention_mask = attention_mask.to(dtype=self.dtype) # fp16 compatibility
+ attention_mask = (1.0 - attention_mask) * -10000.0
+
+ # Prepare head mask if needed
+ # 1.0 in head_mask indicate we keep the head
+ # attention_probs has shape bsz x num_attention_heads x N x N
+ # head_mask has shape n_layer x batch x num_attention_heads x N x N
+ head_mask = self.get_head_mask(head_mask, self.config.n_layer)
+
+ if inputs_embeds is None:
+ inputs_embeds = self.wte(input_ids)
+
+ hidden_states = inputs_embeds
+
+ if token_type_ids is not None:
+ token_type_embeds = self.wte(token_type_ids)
+ hidden_states = hidden_states + token_type_embeds
+
+ hidden_states = self.drop(hidden_states)
+
+ output_shape = input_shape + (hidden_states.size(-1),)
+
+ presents = () if use_cache else None
+ all_self_attentions = () if output_attentions else None
+ all_hidden_states = () if output_hidden_states else None
+ for i, (block, layer_past) in enumerate(zip(self.h, past_key_values)):
+
+ # Model parallel
+ if self.model_parallel:
+ torch.cuda.set_device(hidden_states.device)
+ # Ensure layer_past is on same device as hidden_states (might not be correct)
+ if layer_past is not None:
+ layer_past = tuple(past_state.to(hidden_states.device) for past_state in layer_past)
+ # Ensure that attention_mask is always on the same device as hidden_states
+ if attention_mask is not None:
+ attention_mask = attention_mask.to(hidden_states.device)
+ if isinstance(head_mask, torch.Tensor):
+ head_mask = head_mask.to(hidden_states.device)
+ if output_hidden_states:
+ all_hidden_states = all_hidden_states + (hidden_states,)
+
+ if getattr(self.config, "gradient_checkpointing", False) and self.training:
+
+ if use_cache:
+ logger.warning(
+ "`use_cache=True` is incompatible with `config.gradient_checkpointing=True`. Setting "
+ "`use_cache=False`..."
+ )
+ use_cache = False
+
+ def create_custom_forward(module):
+ def custom_forward(*inputs):
+ # None for past_key_value
+ return module(*inputs, use_cache, output_attentions)
+
+ return custom_forward
+
+ outputs = torch.utils.checkpoint.checkpoint(
+ create_custom_forward(block),
+ hidden_states,
+ None,
+ attention_mask,
+ head_mask[i],
+ )
+ else:
+ outputs = block(
+ hidden_states,
+ layer_past=layer_past,
+ attention_mask=attention_mask,
+ head_mask=head_mask[i],
+ use_cache=use_cache,
+ output_attentions=output_attentions,
+ )
+
+ hidden_states = outputs[0]
+ if use_cache is True:
+ presents = presents + (outputs[1],)
+
+ if output_attentions:
+ all_self_attentions = all_self_attentions + (outputs[2 if use_cache else 1],)
+
+ # Model Parallel: If it's the last layer for that device, put things on the next device
+ if self.model_parallel:
+ for k, v in self.device_map.items():
+ if i == v[-1] and "cuda:" + str(k) != self.last_device:
+ hidden_states = hidden_states.to("cuda:" + str(k + 1))
+
+ hidden_states = self.ln_f(hidden_states)
+
+ hidden_states = hidden_states.view(*output_shape)
+ # Add last hidden state
+ if output_hidden_states:
+ all_hidden_states = all_hidden_states + (hidden_states,)
+
+ if not return_dict:
+ return tuple(v for v in [hidden_states, presents, all_hidden_states, all_self_attentions] if v is not None)
+
+ return BaseModelOutputWithPast(
+ last_hidden_state=hidden_states,
+ past_key_values=presents,
+ hidden_states=all_hidden_states,
+ attentions=all_self_attentions,
+ )
+
+
+class ProGenForCausalLM(ProGenPreTrainedModel):
+ _keys_to_ignore_on_load_missing = [r"h\.\d+\.attn\.masked_bias", r"h\.\d+\.attn\.bias", r"lm_head\.weight"]
+
+ def __init__(self, config):
+ super().__init__(config)
+ self.transformer = ProGenModel(config)
+ self.lm_head = nn.Linear(config.n_embd, config.vocab_size)
+ self.init_weights()
+
+ # Model parallel
+ self.model_parallel = False
+ self.device_map = None
+
+ def parallelize(self, device_map=None):
+ self.device_map = (
+ get_device_map(len(self.transformer.h), range(torch.cuda.device_count()))
+ if device_map is None
+ else device_map
+ )
+ assert_device_map(self.device_map, len(self.transformer.h))
+ self.transformer.parallelize(self.device_map)
+ self.lm_head = self.lm_head.to(self.transformer.first_device)
+ self.model_parallel = True
+
+ def deparallelize(self):
+ self.transformer.deparallelize()
+ self.transformer = self.transformer.to("cpu")
+ self.lm_head = self.lm_head.to("cpu")
+ self.model_parallel = False
+ torch.cuda.empty_cache()
+
+ def get_output_embeddings(self):
+ return None
+
+ def set_output_embeddings(self, new_embeddings):
+ return
+
+ def prepare_inputs_for_generation(self, input_ids, past=None, **kwargs):
+ token_type_ids = kwargs.get("token_type_ids", None)
+ # only last token for inputs_ids if past is defined in kwargs
+ if past:
+ input_ids = input_ids[:, -1].unsqueeze(-1)
+ if token_type_ids is not None:
+ token_type_ids = token_type_ids[:, -1].unsqueeze(-1)
+
+ attention_mask = kwargs.get("attention_mask", None)
+ position_ids = kwargs.get("position_ids", None)
+
+ if attention_mask is not None and position_ids is None:
+ # create position_ids on the fly for batch generation
+ position_ids = attention_mask.long().cumsum(-1) - 1
+ position_ids.masked_fill_(attention_mask == 0, 1)
+ if past:
+ position_ids = position_ids[:, -1].unsqueeze(-1)
+ else:
+ position_ids = None
+ return {
+ "input_ids": input_ids,
+ "past_key_values": past,
+ "use_cache": kwargs.get("use_cache"),
+ "position_ids": position_ids,
+ "attention_mask": attention_mask,
+ "token_type_ids": token_type_ids,
+ }
+
+ def forward(
+ self,
+ input_ids=None,
+ past_key_values=None,
+ attention_mask=None,
+ token_type_ids=None,
+ position_ids=None,
+ head_mask=None,
+ inputs_embeds=None,
+ labels=None,
+ use_cache=None,
+ output_attentions=None,
+ output_hidden_states=None,
+ return_dict=None,
+ ):
+ r"""
+ labels (:obj:`torch.LongTensor` of shape :obj:`(batch_size, sequence_length)`, `optional`):
+ Labels for language modeling. Note that the labels **are shifted** inside the model, i.e. you can set
+ ``labels = input_ids`` Indices are selected in ``[-100, 0, ..., config.vocab_size]`` All labels set to
+ ``-100`` are ignored (masked), the loss is only computed for labels in ``[0, ..., config.vocab_size]``
+ """
+ return_dict = return_dict if return_dict is not None else self.config.use_return_dict
+
+ transformer_outputs = self.transformer(
+ input_ids,
+ past_key_values=past_key_values,
+ attention_mask=attention_mask,
+ token_type_ids=token_type_ids,
+ position_ids=position_ids,
+ head_mask=head_mask,
+ inputs_embeds=inputs_embeds,
+ use_cache=use_cache,
+ output_attentions=output_attentions,
+ output_hidden_states=output_hidden_states,
+ return_dict=return_dict,
+ )
+ hidden_states = transformer_outputs[0]
+
+ # Set device for model parallelism
+ if self.model_parallel:
+ torch.cuda.set_device(self.transformer.first_device)
+ hidden_states = hidden_states.to(self.lm_head.weight.device)
+
+ # make sure sampling in fp16 works correctly and
+ # compute loss in fp32 to match with mesh-tf version
+ # https://github.com/EleutherAI/gpt-neo/blob/89ce74164da2fb16179106f54e2269b5da8db333/models/gpt2/gpt2.py#L179
+ lm_logits = self.lm_head(hidden_states).to(torch.float32)
+
+ loss = None
+ if labels is not None:
+ # Shift so that tokens < n predict n
+ shift_logits = lm_logits[..., :-1, :].contiguous()
+ shift_labels = labels[..., 1:].contiguous()
+ # Flatten the tokens
+ loss_fct = CrossEntropyLoss()
+ loss = loss_fct(shift_logits.view(-1, shift_logits.size(-1)), shift_labels.view(-1))
+
+ loss = loss.to(hidden_states.dtype)
+
+ if not return_dict:
+ output = (lm_logits,) + transformer_outputs[1:]
+ return ((loss,) + output) if loss is not None else output
+
+ return CausalLMOutputWithPast(
+ loss=loss,
+ logits=lm_logits,
+ past_key_values=transformer_outputs.past_key_values,
+ hidden_states=transformer_outputs.hidden_states,
+ attentions=transformer_outputs.attentions,
+ )
+
+ @staticmethod
+ def _reorder_cache(past: Tuple[Tuple[torch.Tensor]], beam_idx: torch.Tensor) -> Tuple[Tuple[torch.Tensor]]:
+ """
+ This function is used to re-order the :obj:`past_key_values` cache if
+ :meth:`~transformers.PretrainedModel.beam_search` or :meth:`~transformers.PretrainedModel.beam_sample` is
+ called. This is required to match :obj:`past_key_values` with the correct beam_idx at every generation step.
+ """
+ return tuple(
+ tuple(past_state.index_select(0, beam_idx.to(past_state.device)) for past_state in layer_past)
+ for layer_past in past
+ )
\ No newline at end of file
diff --git a/proteingym/baselines/progen2/tokenizer.json b/proteingym/baselines/progen2/tokenizer.json
new file mode 100644
index 0000000..1230f29
--- /dev/null
+++ b/proteingym/baselines/progen2/tokenizer.json
@@ -0,0 +1,91 @@
+{
+ "version": "1.0",
+ "truncation": null,
+ "padding": null,
+ "added_tokens": [
+ {
+ "id": 0,
+ "special": true,
+ "content": "<|pad|>",
+ "single_word": false,
+ "lstrip": false,
+ "rstrip": false,
+ "normalized": false
+ },
+ {
+ "id": 1,
+ "special": true,
+ "content": "<|bos|>",
+ "single_word": false,
+ "lstrip": false,
+ "rstrip": false,
+ "normalized": false
+ },
+ {
+ "id": 2,
+ "special": true,
+ "content": "<|eos|>",
+ "single_word": false,
+ "lstrip": false,
+ "rstrip": false,
+ "normalized": false
+ }
+ ],
+ "normalizer": null,
+ "pre_tokenizer": {
+ "type": "ByteLevel",
+ "add_prefix_space": false,
+ "trim_offsets": true
+ },
+ "post_processor": {
+ "type": "ByteLevel",
+ "add_prefix_space": true,
+ "trim_offsets": true
+ },
+ "decoder": {
+ "type": "ByteLevel",
+ "add_prefix_space": true,
+ "trim_offsets": true
+ },
+ "model": {
+ "type": "BPE",
+ "dropout": null,
+ "unk_token": null,
+ "continuing_subword_prefix": null,
+ "end_of_word_suffix": null,
+ "fuse_unk": false,
+ "vocab": {
+ "<|pad|>": 0,
+ "<|bos|>": 1,
+ "<|eos|>": 2,
+ "1": 3,
+ "2": 4,
+ "A": 5,
+ "B": 6,
+ "C": 7,
+ "D": 8,
+ "E": 9,
+ "F": 10,
+ "G": 11,
+ "H": 12,
+ "I": 13,
+ "K": 14,
+ "L": 15,
+ "M": 16,
+ "N": 17,
+ "O": 18,
+ "P": 19,
+ "Q": 20,
+ "R": 21,
+ "S": 22,
+ "T": 23,
+ "U": 24,
+ "V": 25,
+ "W": 26,
+ "X": 27,
+ "Y": 28,
+ "Z": 29
+ },
+ "merges": []
+ }
+ }
\ No newline at end of file
diff --git a/proteingym/baselines/protein_mpnn/compute_fitness.py b/proteingym/baselines/protein_mpnn/compute_fitness.py
new file mode 100755
index 0000000..4d8c0b5
--- /dev/null
+++ b/proteingym/baselines/protein_mpnn/compute_fitness.py
@@ -0,0 +1,275 @@
+import argparse
+import os.path
+import pandas as pd
+
+
+def main(args):
+
+ import json, time, os, sys, glob
+ import shutil
+ import warnings
+ import numpy as np
+ import torch
+ from torch import optim
+ from torch.utils.data import DataLoader
+ from torch.utils.data.dataset import random_split, Subset
+ import copy
+ import torch.nn as nn
+ import torch.nn.functional as F
+ import random
+ import os.path
+ import subprocess
+
+ from protein_mpnn_utils import loss_nll, loss_smoothed, gather_edges, gather_nodes, gather_nodes_t, cat_neighbors_nodes, _scores, _S_to_seq, tied_featurize, parse_PDB, parse_fasta
+ from protein_mpnn_utils import StructureDataset, StructureDatasetPDB, ProteinMPNN
+
+ if args.seed:
+ seed=args.seed
+ else:
+ seed=int(np.random.randint(0, high=999, size=1, dtype=int)[0])
+
+ torch.manual_seed(seed)
+ random.seed(seed)
+ np.random.seed(seed)
+
+ hidden_dim = 128
+ num_layers = 3
+
+ mapping_df = pd.read_csv(args.DMS_reference_file_path)
+ DMS_id = mapping_df.iloc[args.DMS_index]['DMS_id']
+ DMS_filename = mapping_df.iloc[args.DMS_index]['DMS_filename']
+ output_filename = args.output_scores_folder + os.sep + DMS_id + ".csv"
+ pdb_file = args.structure_folder + os.sep + mapping_df.iloc[args.DMS_index]['pdb_file']
+ if not os.path.exists(args.output_scores_folder):
+ os.makedirs(args.output_scores_folder)
+
+ checkpoint_path = args.checkpoint
+
+ NUM_BATCHES = args.num_seq_per_target//args.batch_size
+ BATCH_COPIES = args.batch_size
+ temperatures = [float(item) for item in args.sampling_temp.split()]
+ omit_AAs_list = args.omit_AAs
+ alphabet = 'ACDEFGHIKLMNPQRSTVWYX'
+ alphabet_dict = dict(zip(alphabet, range(21)))
+ print_all = args.suppress_print == 0
+ omit_AAs_np = np.array([AA in omit_AAs_list for AA in alphabet]).astype(np.float32)
+ device = torch.device("cuda:0" if (torch.cuda.is_available()) else "cpu")
+ if os.path.isfile(args.chain_id_jsonl):
+ with open(args.chain_id_jsonl, 'r') as json_file:
+ json_list = list(json_file)
+ for json_str in json_list:
+ chain_id_dict = json.loads(json_str)
+ else:
+ chain_id_dict = None
+ if print_all:
+ print(40*'-')
+ print('chain_id_jsonl is NOT loaded')
+
+ if os.path.isfile(args.fixed_positions_jsonl):
+ with open(args.fixed_positions_jsonl, 'r') as json_file:
+ json_list = list(json_file)
+ for json_str in json_list:
+ fixed_positions_dict = json.loads(json_str)
+ else:
+ if print_all:
+ print(40*'-')
+ print('fixed_positions_jsonl is NOT loaded')
+ fixed_positions_dict = None
+
+
+ if os.path.isfile(args.pssm_jsonl):
+ with open(args.pssm_jsonl, 'r') as json_file:
+ json_list = list(json_file)
+ pssm_dict = {}
+ for json_str in json_list:
+ pssm_dict.update(json.loads(json_str))
+ else:
+ if print_all:
+ print(40*'-')
+ print('pssm_jsonl is NOT loaded')
+ pssm_dict = None
+
+
+ if os.path.isfile(args.omit_AA_jsonl):
+ with open(args.omit_AA_jsonl, 'r') as json_file:
+ json_list = list(json_file)
+ for json_str in json_list:
+ omit_AA_dict = json.loads(json_str)
+ else:
+ if print_all:
+ print(40*'-')
+ print('omit_AA_jsonl is NOT loaded')
+ omit_AA_dict = None
+
+
+ if os.path.isfile(args.bias_AA_jsonl):
+ with open(args.bias_AA_jsonl, 'r') as json_file:
+ json_list = list(json_file)
+ for json_str in json_list:
+ bias_AA_dict = json.loads(json_str)
+ else:
+ if print_all:
+ print(40*'-')
+ print('bias_AA_jsonl is NOT loaded')
+ bias_AA_dict = None
+
+
+ if os.path.isfile(args.tied_positions_jsonl):
+ with open(args.tied_positions_jsonl, 'r') as json_file:
+ json_list = list(json_file)
+ for json_str in json_list:
+ tied_positions_dict = json.loads(json_str)
+ else:
+ if print_all:
+ print(40*'-')
+ print('tied_positions_jsonl is NOT loaded')
+ tied_positions_dict = None
+
+
+ if os.path.isfile(args.bias_by_res_jsonl):
+ with open(args.bias_by_res_jsonl, 'r') as json_file:
+ json_list = list(json_file)
+
+ for json_str in json_list:
+ bias_by_res_dict = json.loads(json_str)
+ if print_all:
+ print('bias by residue dictionary is loaded')
+ else:
+ if print_all:
+ print(40*'-')
+ print('bias by residue dictionary is not loaded, or not provided')
+ bias_by_res_dict = None
+
+
+ if print_all:
+ print(40*'-')
+ bias_AAs_np = np.zeros(len(alphabet))
+ if bias_AA_dict:
+ for n, AA in enumerate(alphabet):
+ if AA in list(bias_AA_dict.keys()):
+ bias_AAs_np[n] = bias_AA_dict[AA]
+
+ pdb_dict_list = parse_PDB(pdb_file, ca_only=False)
+ dataset_valid = StructureDatasetPDB(pdb_dict_list, truncate=None, max_length=args.max_length)
+ all_chain_list = [item[-1:] for item in list(pdb_dict_list[0]) if item[:9]=='seq_chain'] #['A','B', 'C',...]
+ if args.pdb_path_chains:
+ designed_chain_list = [str(item) for item in args.pdb_path_chains.split()]
+ else:
+ designed_chain_list = all_chain_list
+ fixed_chain_list = [letter for letter in all_chain_list if letter not in designed_chain_list]
+ chain_id_dict = {}
+ chain_id_dict[pdb_dict_list[0]['name']]= (designed_chain_list, fixed_chain_list)
+
+ checkpoint = torch.load(checkpoint_path, map_location=device)
+ noise_level_print = checkpoint['noise_level']
+ model = ProteinMPNN(ca_only=False, num_letters=21, node_features=hidden_dim, edge_features=hidden_dim, hidden_dim=hidden_dim, num_encoder_layers=num_layers, num_decoder_layers=num_layers, augment_eps=args.backbone_noise, k_neighbors=checkpoint['num_edges'])
+ model.to(device)
+ model.load_state_dict(checkpoint['model_state_dict'])
+ model.eval()
+
+ if print_all:
+ print(40*'-')
+ print('Number of edges:', checkpoint['num_edges'])
+ print(f'Training noise level: {noise_level_print}A')
+
+ # Timing
+ start_time = time.time()
+ total_residues = 0
+ protein_list = []
+ total_step = 0
+ with torch.no_grad():
+ test_sum, test_weights = 0., 0.
+ dms_df = pd.read_csv(args.DMS_data_folder + os.sep + DMS_filename)
+ # For loop should iterate once for our case (not using multiple structures/proteins)
+ if len(dataset_valid) > 1:
+ raise NotImplementedError("This script is meant for a single protein in dataset_valid. Has not yet been extended to multiple proteins")
+ for ix, protein in enumerate(dataset_valid):
+ score_list = []
+ global_score_list = []
+ all_probs_list = []
+ all_log_probs_list = []
+ S_sample_list = []
+ batch_clones = [copy.deepcopy(protein) for i in range(BATCH_COPIES)]
+ X, S, mask, lengths, chain_M, chain_encoding_all, chain_list_list, visible_list_list, masked_list_list, masked_chain_length_list_list, chain_M_pos, omit_AA_mask, residue_idx, dihedral_mask, tied_pos_list_of_lists_list, pssm_coef, pssm_bias, pssm_log_odds_all, bias_by_res_all, tied_beta = tied_featurize(batch_clones, device, chain_id_dict, fixed_positions_dict, omit_AA_dict, tied_positions_dict, pssm_dict, bias_by_res_dict, ca_only=False)
+ pssm_log_odds_mask = (pssm_log_odds_all > args.pssm_threshold).float()
+ name_ = batch_clones[0]['name']
+ loop_c = 0
+ fasta_names, fasta_seqs = dms_df["mutant"].tolist(), dms_df["mutated_sequence"].tolist()
+ loop_c = len(fasta_seqs)
+ global_score_list = []
+ global_seq_list = []
+ for fc in range(loop_c):
+ native_score_list = []
+ global_native_score_list = []
+ input_seq_length = len(fasta_seqs[fc])
+ S_input = torch.tensor([alphabet_dict[AA] for AA in fasta_seqs[fc]], device=device)[None,:].repeat(X.shape[0], 1)
+ S[:,:input_seq_length] = S_input #assumes that S and S_input are alphabetically sorted for masked_chains
+ for j in range(NUM_BATCHES):
+ randn_1 = torch.randn(chain_M.shape, device=X.device)
+ log_probs = model(X, S, mask, chain_M*chain_M_pos, residue_idx, chain_encoding_all, randn_1)
+ mask_for_loss = mask*chain_M*chain_M_pos
+ scores = _scores(S, log_probs, mask_for_loss)
+ native_score = scores.cpu().data.numpy()
+ native_score_list.append(native_score)
+ global_scores = _scores(S, log_probs, mask)
+ global_native_score = global_scores.cpu().data.numpy()
+ global_native_score_list.append(global_native_score)
+ native_score = np.concatenate(native_score_list, 0)
+ global_native_score = np.concatenate(global_native_score_list, 0)
+ ns_mean = native_score.mean()
+ ns_mean_print = np.format_float_positional(np.float32(ns_mean), unique=False, precision=4)
+ ns_std = native_score.std()
+ ns_std_print = np.format_float_positional(np.float32(ns_std), unique=False, precision=4)
+
+ global_ns_mean = global_native_score.mean()
+ global_ns_mean_print = np.format_float_positional(np.float32(global_ns_mean), unique=False, precision=4)
+ global_ns_std = global_native_score.std()
+ global_ns_std_print = np.format_float_positional(np.float32(global_ns_std), unique=False, precision=4)
+
+ ns_sample_size = native_score.shape[0]
+ seq_str = _S_to_seq(S[0,], chain_M[0,])
+ global_score_list.append(-1*global_native_score[0])
+ global_seq_list.append(seq_str)
+ if print_all:
+ if fc == 0:
+ print(f'Score for {name_} from PDB, mean: {ns_mean_print}, std: {ns_std_print}, sample size: {ns_sample_size}, global score, mean: {global_ns_mean_print}, std: {global_ns_std_print}, sample size: {ns_sample_size}')
+ else:
+ print(f'Score for {name_}_{fc} from FASTA, mean: {ns_mean_print}, std: {ns_std_print}, sample size: {ns_sample_size}, global score, mean: {global_ns_mean_print}, std: {global_ns_std_print}, sample size: {ns_sample_size}')
+ # sanity check
+ assert all([global_seq_list[i] == fasta_seqs[i] for i in range(len(global_seq_list))])
+ score_df = pd.DataFrame({'mutant': fasta_names, 'mutated_sequence': fasta_seqs, 'pmpnn_ll': global_score_list})
+ score_df.to_csv(output_filename, index=False)
+
+if __name__ == "__main__":
+ argparser = argparse.ArgumentParser(formatter_class=argparse.ArgumentDefaultsHelpFormatter)
+ argparser.add_argument('--DMS_reference_file_path',type=str,help='path to DMS reference file')
+ argparser.add_argument('--DMS_data_folder',type=str,help="path to folder containing DMS data")
+ argparser.add_argument('--structure_folder',type=str,help='folder containing pdb files for each DMS')
+ argparser.add_argument('--DMS_index',type=int,help='index of DMS in DMS reference file')
+ argparser.add_argument('--checkpoint',type=str,help='path to model')
+ argparser.add_argument("--suppress_print", type=int, default=0, help="0 for False, 1 for True")
+ argparser.add_argument("--seed", type=int, default=0, help="If set to 0 then a random seed will be picked;")
+ argparser.add_argument("--backbone_noise", type=float, default=0.00, help="Standard deviation of Gaussian noise to add to backbone atoms")
+ argparser.add_argument("--num_seq_per_target", type=int, default=1, help="Number of sequences to generate per target")
+ argparser.add_argument("--batch_size", type=int, default=1, help="Batch size; can set higher for titan, quadro GPUs, reduce this if running out of GPU memory")
+ argparser.add_argument("--max_length", type=int, default=200000, help="Max sequence length")
+ argparser.add_argument("--sampling_temp", type=str, default="0.1", help="A string of temperatures, 0.2 0.25 0.5. Sampling temperature for amino acids. Suggested values 0.1, 0.15, 0.2, 0.25, 0.3. Higher values will lead to more diversity.")
+ argparser.add_argument("--output_scores_folder", type=str, help="Path to a folder to output sequences, e.g. /home/out/")
+ argparser.add_argument("--pdb_path", type=str, default='', help="Path to a single PDB to be designed")
+ argparser.add_argument("--pdb_path_chains", type=str, default='', help="Define which chains need to be designed for a single PDB ")
+ argparser.add_argument("--jsonl_path", type=str, help="Path to a folder with parsed pdb into jsonl")
+ argparser.add_argument("--chain_id_jsonl",type=str, default='', help="Path to a dictionary specifying which chains need to be designed and which ones are fixed, if not specied all chains will be designed.")
+ argparser.add_argument("--fixed_positions_jsonl", type=str, default='', help="Path to a dictionary with fixed positions")
+ argparser.add_argument("--omit_AAs", type=list, default='X', help="Specify which amino acids should be omitted in the generated sequence, e.g. 'AC' would omit alanine and cystine.")
+ argparser.add_argument("--bias_AA_jsonl", type=str, default='', help="Path to a dictionary which specifies AA composion bias if neededi, e.g. {A: -1.1, F: 0.7} would make A less likely and F more likely.")
+ argparser.add_argument("--bias_by_res_jsonl", default='', help="Path to dictionary with per position bias.")
+ argparser.add_argument("--omit_AA_jsonl", type=str, default='', help="Path to a dictionary which specifies which amino acids need to be omited from design at specific chain indices")
+ argparser.add_argument("--pssm_jsonl", type=str, default='', help="Path to a dictionary with pssm")
+ argparser.add_argument("--pssm_multi", type=float, default=0.0, help="A value between [0.0, 1.0], 0.0 means do not use pssm, 1.0 ignore MPNN predictions")
+ argparser.add_argument("--pssm_threshold", type=float, default=0.0, help="A value between -inf + inf to restric per position AAs")
+ argparser.add_argument("--pssm_log_odds_flag", type=int, default=0, help="0 for False, 1 for True")
+ argparser.add_argument("--pssm_bias_flag", type=int, default=0, help="0 for False, 1 for True")
+ argparser.add_argument("--tied_positions_jsonl", type=str, default='', help="Path to a dictionary with tied positions")
+
+ args = argparser.parse_args()
+ main(args)
diff --git a/proteingym/baselines/protein_mpnn/protein_mpnn_utils.py b/proteingym/baselines/protein_mpnn/protein_mpnn_utils.py
new file mode 100755
index 0000000..a8fd008
--- /dev/null
+++ b/proteingym/baselines/protein_mpnn/protein_mpnn_utils.py
@@ -0,0 +1,1383 @@
+from __future__ import print_function
+import json, time, os, sys, glob
+import shutil
+import numpy as np
+import torch
+from torch import optim
+from torch.utils.data import DataLoader
+from torch.utils.data.dataset import random_split, Subset
+
+import copy
+import torch.nn as nn
+import torch.nn.functional as F
+import random
+import itertools
+
+#A number of functions/classes are adopted from: https://github.com/jingraham/neurips19-graph-protein-design
+
+def parse_fasta(filename,limit=-1, omit=[]):
+ header = []
+ sequence = []
+ lines = open(filename, "r")
+ for line in lines:
+ line = line.rstrip()
+ if line[0] == ">":
+ if len(header) == limit:
+ break
+ header.append(line[1:])
+ sequence.append([])
+ else:
+ if omit:
+ line = [item for item in line if item not in omit]
+ line = ''.join(line)
+ line = ''.join(line)
+ sequence[-1].append(line)
+ lines.close()
+ sequence = [''.join(seq) for seq in sequence]
+ return np.array(header), np.array(sequence)
+
+def _scores(S, log_probs, mask):
+ """ Negative log probabilities """
+ criterion = torch.nn.NLLLoss(reduction='none')
+ loss = criterion(
+ log_probs.contiguous().view(-1,log_probs.size(-1)),
+ S.contiguous().view(-1)
+ ).view(S.size())
+ scores = torch.sum(loss * mask, dim=-1) / torch.sum(mask, dim=-1)
+ return scores
+
+def _S_to_seq(S, mask):
+ alphabet = 'ACDEFGHIKLMNPQRSTVWYX'
+ seq = ''.join([alphabet[c] for c, m in zip(S.tolist(), mask.tolist()) if m > 0])
+ return seq
+
+def parse_PDB_biounits(x, atoms=['N','CA','C'], chain=None):
+ '''
+ input: x = PDB filename
+ atoms = atoms to extract (optional)
+ output: (length, atoms, coords=(x,y,z)), sequence
+ '''
+
+ alpha_1 = list("ARNDCQEGHILKMFPSTWYV-")
+ states = len(alpha_1)
+ alpha_3 = ['ALA','ARG','ASN','ASP','CYS','GLN','GLU','GLY','HIS','ILE',
+ 'LEU','LYS','MET','PHE','PRO','SER','THR','TRP','TYR','VAL','GAP']
+
+ aa_1_N = {a:n for n,a in enumerate(alpha_1)}
+ aa_3_N = {a:n for n,a in enumerate(alpha_3)}
+ aa_N_1 = {n:a for n,a in enumerate(alpha_1)}
+ aa_1_3 = {a:b for a,b in zip(alpha_1,alpha_3)}
+ aa_3_1 = {b:a for a,b in zip(alpha_1,alpha_3)}
+
+ def AA_to_N(x):
+ # ["ARND"] -> [[0,1,2,3]]
+ x = np.array(x);
+ if x.ndim == 0: x = x[None]
+ return [[aa_1_N.get(a, states-1) for a in y] for y in x]
+
+ def N_to_AA(x):
+ # [[0,1,2,3]] -> ["ARND"]
+ x = np.array(x);
+ if x.ndim == 1: x = x[None]
+ return ["".join([aa_N_1.get(a,"-") for a in y]) for y in x]
+
+ xyz,seq,min_resn,max_resn = {},{},1e6,-1e6
+ for line in open(x,"rb"):
+ line = line.decode("utf-8","ignore").rstrip()
+
+ if line[:6] == "HETATM" and line[17:17+3] == "MSE":
+ line = line.replace("HETATM","ATOM ")
+ line = line.replace("MSE","MET")
+
+ if line[:4] == "ATOM":
+ ch = line[21:22]
+ if ch == chain or chain is None:
+ atom = line[12:12+4].strip()
+ resi = line[17:17+3]
+ resn = line[22:22+5].strip()
+ x,y,z = [float(line[i:(i+8)]) for i in [30,38,46]]
+
+ if resn[-1].isalpha():
+ resa,resn = resn[-1],int(resn[:-1])-1
+ else:
+ resa,resn = "",int(resn)-1
+# resn = int(resn)
+ if resn < min_resn:
+ min_resn = resn
+ if resn > max_resn:
+ max_resn = resn
+ if resn not in xyz:
+ xyz[resn] = {}
+ if resa not in xyz[resn]:
+ xyz[resn][resa] = {}
+ if resn not in seq:
+ seq[resn] = {}
+ if resa not in seq[resn]:
+ seq[resn][resa] = resi
+
+ if atom not in xyz[resn][resa]:
+ xyz[resn][resa][atom] = np.array([x,y,z])
+
+ # convert to numpy arrays, fill in missing values
+ seq_,xyz_ = [],[]
+ try:
+ for resn in range(min_resn,max_resn+1):
+ if resn in seq:
+ for k in sorted(seq[resn]): seq_.append(aa_3_N.get(seq[resn][k],20))
+ else: seq_.append(20)
+ if resn in xyz:
+ for k in sorted(xyz[resn]):
+ for atom in atoms:
+ if atom in xyz[resn][k]: xyz_.append(xyz[resn][k][atom])
+ else: xyz_.append(np.full(3,np.nan))
+ else:
+ for atom in atoms: xyz_.append(np.full(3,np.nan))
+ return np.array(xyz_).reshape(-1,len(atoms),3), N_to_AA(np.array(seq_))
+ except TypeError:
+ return 'no_chain', 'no_chain'
+
+def parse_PDB(path_to_pdb, input_chain_list=None, ca_only=False):
+ c=0
+ pdb_dict_list = []
+ init_alphabet = ['A', 'B', 'C', 'D', 'E', 'F', 'G','H', 'I', 'J','K', 'L', 'M', 'N', 'O', 'P', 'Q', 'R', 'S', 'T','U', 'V','W','X', 'Y', 'Z', 'a', 'b', 'c', 'd', 'e', 'f', 'g','h', 'i', 'j','k', 'l', 'm', 'n', 'o', 'p', 'q', 'r', 's', 't','u', 'v','w','x', 'y', 'z']
+ extra_alphabet = [str(item) for item in list(np.arange(300))]
+ chain_alphabet = init_alphabet + extra_alphabet
+
+ if input_chain_list:
+ chain_alphabet = input_chain_list
+
+
+ biounit_names = [path_to_pdb]
+ for biounit in biounit_names:
+ my_dict = {}
+ s = 0
+ concat_seq = ''
+ concat_N = []
+ concat_CA = []
+ concat_C = []
+ concat_O = []
+ concat_mask = []
+ coords_dict = {}
+ for letter in chain_alphabet:
+ if ca_only:
+ sidechain_atoms = ['CA']
+ else:
+ sidechain_atoms = ['N', 'CA', 'C', 'O']
+ xyz, seq = parse_PDB_biounits(biounit, atoms=sidechain_atoms, chain=letter)
+ if type(xyz) != str:
+ concat_seq += seq[0]
+ my_dict['seq_chain_'+letter]=seq[0]
+ coords_dict_chain = {}
+ if ca_only:
+ coords_dict_chain['CA_chain_'+letter]=xyz.tolist()
+ else:
+ coords_dict_chain['N_chain_' + letter] = xyz[:, 0, :].tolist()
+ coords_dict_chain['CA_chain_' + letter] = xyz[:, 1, :].tolist()
+ coords_dict_chain['C_chain_' + letter] = xyz[:, 2, :].tolist()
+ coords_dict_chain['O_chain_' + letter] = xyz[:, 3, :].tolist()
+ my_dict['coords_chain_'+letter]=coords_dict_chain
+ s += 1
+ fi = biounit.rfind("/")
+ my_dict['name']=biounit[(fi+1):-4]
+ my_dict['num_of_chains'] = s
+ my_dict['seq'] = concat_seq
+ if s <= len(chain_alphabet):
+ pdb_dict_list.append(my_dict)
+ c+=1
+ return pdb_dict_list
+
+
+
+def tied_featurize(batch, device, chain_dict, fixed_position_dict=None, omit_AA_dict=None, tied_positions_dict=None, pssm_dict=None, bias_by_res_dict=None, ca_only=False):
+ """ Pack and pad batch into torch tensors """
+ alphabet = 'ACDEFGHIKLMNPQRSTVWYX'
+ B = len(batch)
+ lengths = np.array([len(b['seq']) for b in batch], dtype=np.int32) #sum of chain seq lengths
+ L_max = max([len(b['seq']) for b in batch])
+ if ca_only:
+ X = np.zeros([B, L_max, 1, 3])
+ else:
+ X = np.zeros([B, L_max, 4, 3])
+ residue_idx = -100*np.ones([B, L_max], dtype=np.int32)
+ chain_M = np.zeros([B, L_max], dtype=np.int32) #1.0 for the bits that need to be predicted
+ pssm_coef_all = np.zeros([B, L_max], dtype=np.float32) #1.0 for the bits that need to be predicted
+ pssm_bias_all = np.zeros([B, L_max, 21], dtype=np.float32) #1.0 for the bits that need to be predicted
+ pssm_log_odds_all = 10000.0*np.ones([B, L_max, 21], dtype=np.float32) #1.0 for the bits that need to be predicted
+ chain_M_pos = np.zeros([B, L_max], dtype=np.int32) #1.0 for the bits that need to be predicted
+ bias_by_res_all = np.zeros([B, L_max, 21], dtype=np.float32)
+ chain_encoding_all = np.zeros([B, L_max], dtype=np.int32) #1.0 for the bits that need to be predicted
+ S = np.zeros([B, L_max], dtype=np.int32)
+ omit_AA_mask = np.zeros([B, L_max, len(alphabet)], dtype=np.int32)
+ # Build the batch
+ letter_list_list = []
+ visible_list_list = []
+ masked_list_list = []
+ masked_chain_length_list_list = []
+ tied_pos_list_of_lists_list = []
+ for i, b in enumerate(batch):
+ if chain_dict != None:
+ masked_chains, visible_chains = chain_dict[b['name']] #masked_chains a list of chain letters to predict [A, D, F]
+ else:
+ masked_chains = [item[-1:] for item in list(b) if item[:10]=='seq_chain_']
+ visible_chains = []
+ masked_chains.sort() #sort masked_chains
+ visible_chains.sort() #sort visible_chains
+ all_chains = masked_chains + visible_chains
+ for i, b in enumerate(batch):
+ mask_dict = {}
+ a = 0
+ x_chain_list = []
+ chain_mask_list = []
+ chain_seq_list = []
+ chain_encoding_list = []
+ c = 1
+ letter_list = []
+ global_idx_start_list = [0]
+ visible_list = []
+ masked_list = []
+ masked_chain_length_list = []
+ fixed_position_mask_list = []
+ omit_AA_mask_list = []
+ pssm_coef_list = []
+ pssm_bias_list = []
+ pssm_log_odds_list = []
+ bias_by_res_list = []
+ l0 = 0
+ l1 = 0
+ for step, letter in enumerate(all_chains):
+ if letter in visible_chains:
+ letter_list.append(letter)
+ visible_list.append(letter)
+ chain_seq = b[f'seq_chain_{letter}']
+ chain_seq = ''.join([a if a!='-' else 'X' for a in chain_seq])
+ chain_length = len(chain_seq)
+ global_idx_start_list.append(global_idx_start_list[-1]+chain_length)
+ chain_coords = b[f'coords_chain_{letter}'] #this is a dictionary
+ chain_mask = np.zeros(chain_length) #0.0 for visible chains
+ if ca_only:
+ x_chain = np.array(chain_coords[f'CA_chain_{letter}']) #[chain_lenght,1,3] #CA_diff
+ if len(x_chain.shape) == 2:
+ x_chain = x_chain[:,None,:]
+ else:
+ x_chain = np.stack([chain_coords[c] for c in [f'N_chain_{letter}', f'CA_chain_{letter}', f'C_chain_{letter}', f'O_chain_{letter}']], 1) #[chain_lenght,4,3]
+ x_chain_list.append(x_chain)
+ chain_mask_list.append(chain_mask)
+ chain_seq_list.append(chain_seq)
+ chain_encoding_list.append(c*np.ones(np.array(chain_mask).shape[0]))
+ l1 += chain_length
+ residue_idx[i, l0:l1] = 100*(c-1)+np.arange(l0, l1)
+ l0 += chain_length
+ c+=1
+ fixed_position_mask = np.ones(chain_length)
+ fixed_position_mask_list.append(fixed_position_mask)
+ omit_AA_mask_temp = np.zeros([chain_length, len(alphabet)], np.int32)
+ omit_AA_mask_list.append(omit_AA_mask_temp)
+ pssm_coef = np.zeros(chain_length)
+ pssm_bias = np.zeros([chain_length, 21])
+ pssm_log_odds = 10000.0*np.ones([chain_length, 21])
+ pssm_coef_list.append(pssm_coef)
+ pssm_bias_list.append(pssm_bias)
+ pssm_log_odds_list.append(pssm_log_odds)
+ bias_by_res_list.append(np.zeros([chain_length, 21]))
+ if letter in masked_chains:
+ masked_list.append(letter)
+ letter_list.append(letter)
+ chain_seq = b[f'seq_chain_{letter}']
+ chain_seq = ''.join([a if a!='-' else 'X' for a in chain_seq])
+ chain_length = len(chain_seq)
+ global_idx_start_list.append(global_idx_start_list[-1]+chain_length)
+ masked_chain_length_list.append(chain_length)
+ chain_coords = b[f'coords_chain_{letter}'] #this is a dictionary
+ chain_mask = np.ones(chain_length) #1.0 for masked
+ if ca_only:
+ x_chain = np.array(chain_coords[f'CA_chain_{letter}']) #[chain_lenght,1,3] #CA_diff
+ if len(x_chain.shape) == 2:
+ x_chain = x_chain[:,None,:]
+ else:
+ x_chain = np.stack([chain_coords[c] for c in [f'N_chain_{letter}', f'CA_chain_{letter}', f'C_chain_{letter}', f'O_chain_{letter}']], 1) #[chain_lenght,4,3]
+ x_chain_list.append(x_chain)
+ chain_mask_list.append(chain_mask)
+ chain_seq_list.append(chain_seq)
+ chain_encoding_list.append(c*np.ones(np.array(chain_mask).shape[0]))
+ l1 += chain_length
+ residue_idx[i, l0:l1] = 100*(c-1)+np.arange(l0, l1)
+ l0 += chain_length
+ c+=1
+ fixed_position_mask = np.ones(chain_length)
+ if fixed_position_dict!=None:
+ fixed_pos_list = fixed_position_dict[b['name']][letter]
+ if fixed_pos_list:
+ fixed_position_mask[np.array(fixed_pos_list)-1] = 0.0
+ fixed_position_mask_list.append(fixed_position_mask)
+ omit_AA_mask_temp = np.zeros([chain_length, len(alphabet)], np.int32)
+ if omit_AA_dict!=None:
+ for item in omit_AA_dict[b['name']][letter]:
+ idx_AA = np.array(item[0])-1
+ AA_idx = np.array([np.argwhere(np.array(list(alphabet))== AA)[0][0] for AA in item[1]]).repeat(idx_AA.shape[0])
+ idx_ = np.array([[a, b] for a in idx_AA for b in AA_idx])
+ omit_AA_mask_temp[idx_[:,0], idx_[:,1]] = 1
+ omit_AA_mask_list.append(omit_AA_mask_temp)
+ pssm_coef = np.zeros(chain_length)
+ pssm_bias = np.zeros([chain_length, 21])
+ pssm_log_odds = 10000.0*np.ones([chain_length, 21])
+ if pssm_dict:
+ if pssm_dict[b['name']][letter]:
+ pssm_coef = pssm_dict[b['name']][letter]['pssm_coef']
+ pssm_bias = pssm_dict[b['name']][letter]['pssm_bias']
+ pssm_log_odds = pssm_dict[b['name']][letter]['pssm_log_odds']
+ pssm_coef_list.append(pssm_coef)
+ pssm_bias_list.append(pssm_bias)
+ pssm_log_odds_list.append(pssm_log_odds)
+ if bias_by_res_dict:
+ bias_by_res_list.append(bias_by_res_dict[b['name']][letter])
+ else:
+ bias_by_res_list.append(np.zeros([chain_length, 21]))
+
+
+ letter_list_np = np.array(letter_list)
+ tied_pos_list_of_lists = []
+ tied_beta = np.ones(L_max)
+ if tied_positions_dict!=None:
+ tied_pos_list = tied_positions_dict[b['name']]
+ if tied_pos_list:
+ set_chains_tied = set(list(itertools.chain(*[list(item) for item in tied_pos_list])))
+ for tied_item in tied_pos_list:
+ one_list = []
+ for k, v in tied_item.items():
+ start_idx = global_idx_start_list[np.argwhere(letter_list_np == k)[0][0]]
+ if isinstance(v[0], list):
+ for v_count in range(len(v[0])):
+ one_list.append(start_idx+v[0][v_count]-1)#make 0 to be the first
+ tied_beta[start_idx+v[0][v_count]-1] = v[1][v_count]
+ else:
+ for v_ in v:
+ one_list.append(start_idx+v_-1)#make 0 to be the first
+ tied_pos_list_of_lists.append(one_list)
+ tied_pos_list_of_lists_list.append(tied_pos_list_of_lists)
+
+
+
+ x = np.concatenate(x_chain_list,0) #[L, 4, 3]
+ all_sequence = "".join(chain_seq_list)
+ m = np.concatenate(chain_mask_list,0) #[L,], 1.0 for places that need to be predicted
+ chain_encoding = np.concatenate(chain_encoding_list,0)
+ m_pos = np.concatenate(fixed_position_mask_list,0) #[L,], 1.0 for places that need to be predicted
+
+ pssm_coef_ = np.concatenate(pssm_coef_list,0) #[L,], 1.0 for places that need to be predicted
+ pssm_bias_ = np.concatenate(pssm_bias_list,0) #[L,], 1.0 for places that need to be predicted
+ pssm_log_odds_ = np.concatenate(pssm_log_odds_list,0) #[L,], 1.0 for places that need to be predicted
+
+ bias_by_res_ = np.concatenate(bias_by_res_list, 0) #[L,21], 0.0 for places where AA frequencies don't need to be tweaked
+
+ l = len(all_sequence)
+ x_pad = np.pad(x, [[0,L_max-l], [0,0], [0,0]], 'constant', constant_values=(np.nan, ))
+ X[i,:,:,:] = x_pad
+
+ m_pad = np.pad(m, [[0,L_max-l]], 'constant', constant_values=(0.0, ))
+ m_pos_pad = np.pad(m_pos, [[0,L_max-l]], 'constant', constant_values=(0.0, ))
+ omit_AA_mask_pad = np.pad(np.concatenate(omit_AA_mask_list,0), [[0,L_max-l]], 'constant', constant_values=(0.0, ))
+ chain_M[i,:] = m_pad
+ chain_M_pos[i,:] = m_pos_pad
+ omit_AA_mask[i,] = omit_AA_mask_pad
+
+ chain_encoding_pad = np.pad(chain_encoding, [[0,L_max-l]], 'constant', constant_values=(0.0, ))
+ chain_encoding_all[i,:] = chain_encoding_pad
+
+ pssm_coef_pad = np.pad(pssm_coef_, [[0,L_max-l]], 'constant', constant_values=(0.0, ))
+ pssm_bias_pad = np.pad(pssm_bias_, [[0,L_max-l], [0,0]], 'constant', constant_values=(0.0, ))
+ pssm_log_odds_pad = np.pad(pssm_log_odds_, [[0,L_max-l], [0,0]], 'constant', constant_values=(0.0, ))
+
+ pssm_coef_all[i,:] = pssm_coef_pad
+ pssm_bias_all[i,:] = pssm_bias_pad
+ pssm_log_odds_all[i,:] = pssm_log_odds_pad
+
+ bias_by_res_pad = np.pad(bias_by_res_, [[0,L_max-l], [0,0]], 'constant', constant_values=(0.0, ))
+ bias_by_res_all[i,:] = bias_by_res_pad
+
+ # Convert to labels
+ indices = np.asarray([alphabet.index(a) for a in all_sequence], dtype=np.int32)
+ S[i, :l] = indices
+ letter_list_list.append(letter_list)
+ visible_list_list.append(visible_list)
+ masked_list_list.append(masked_list)
+ masked_chain_length_list_list.append(masked_chain_length_list)
+
+
+ isnan = np.isnan(X)
+ mask = np.isfinite(np.sum(X,(2,3))).astype(np.float32)
+ X[isnan] = 0.
+
+ # Conversion
+ pssm_coef_all = torch.from_numpy(pssm_coef_all).to(dtype=torch.float32, device=device)
+ pssm_bias_all = torch.from_numpy(pssm_bias_all).to(dtype=torch.float32, device=device)
+ pssm_log_odds_all = torch.from_numpy(pssm_log_odds_all).to(dtype=torch.float32, device=device)
+
+ tied_beta = torch.from_numpy(tied_beta).to(dtype=torch.float32, device=device)
+
+ jumps = ((residue_idx[:,1:]-residue_idx[:,:-1])==1).astype(np.float32)
+ bias_by_res_all = torch.from_numpy(bias_by_res_all).to(dtype=torch.float32, device=device)
+ phi_mask = np.pad(jumps, [[0,0],[1,0]])
+ psi_mask = np.pad(jumps, [[0,0],[0,1]])
+ omega_mask = np.pad(jumps, [[0,0],[0,1]])
+ dihedral_mask = np.concatenate([phi_mask[:,:,None], psi_mask[:,:,None], omega_mask[:,:,None]], -1) #[B,L,3]
+ dihedral_mask = torch.from_numpy(dihedral_mask).to(dtype=torch.float32, device=device)
+ residue_idx = torch.from_numpy(residue_idx).to(dtype=torch.long,device=device)
+ S = torch.from_numpy(S).to(dtype=torch.long,device=device)
+ X = torch.from_numpy(X).to(dtype=torch.float32, device=device)
+ mask = torch.from_numpy(mask).to(dtype=torch.float32, device=device)
+ chain_M = torch.from_numpy(chain_M).to(dtype=torch.float32, device=device)
+ chain_M_pos = torch.from_numpy(chain_M_pos).to(dtype=torch.float32, device=device)
+ omit_AA_mask = torch.from_numpy(omit_AA_mask).to(dtype=torch.float32, device=device)
+ chain_encoding_all = torch.from_numpy(chain_encoding_all).to(dtype=torch.long, device=device)
+ if ca_only:
+ X_out = X[:,:,0]
+ else:
+ X_out = X
+ return X_out, S, mask, lengths, chain_M, chain_encoding_all, letter_list_list, visible_list_list, masked_list_list, masked_chain_length_list_list, chain_M_pos, omit_AA_mask, residue_idx, dihedral_mask, tied_pos_list_of_lists_list, pssm_coef_all, pssm_bias_all, pssm_log_odds_all, bias_by_res_all, tied_beta
+
+
+
+def loss_nll(S, log_probs, mask):
+ """ Negative log probabilities """
+ criterion = torch.nn.NLLLoss(reduction='none')
+ loss = criterion(
+ log_probs.contiguous().view(-1, log_probs.size(-1)), S.contiguous().view(-1)
+ ).view(S.size())
+ loss_av = torch.sum(loss * mask) / torch.sum(mask)
+ return loss, loss_av
+
+
+def loss_smoothed(S, log_probs, mask, weight=0.1):
+ """ Negative log probabilities """
+ S_onehot = torch.nn.functional.one_hot(S, 21).float()
+
+ # Label smoothing
+ S_onehot = S_onehot + weight / float(S_onehot.size(-1))
+ S_onehot = S_onehot / S_onehot.sum(-1, keepdim=True)
+
+ loss = -(S_onehot * log_probs).sum(-1)
+ loss_av = torch.sum(loss * mask) / torch.sum(mask)
+ return loss, loss_av
+
+class StructureDataset():
+ def __init__(self, jsonl_file, verbose=True, truncate=None, max_length=100,
+ alphabet='ACDEFGHIKLMNPQRSTVWYX-'):
+ alphabet_set = set([a for a in alphabet])
+ discard_count = {
+ 'bad_chars': 0,
+ 'too_long': 0,
+ 'bad_seq_length': 0
+ }
+
+ with open(jsonl_file) as f:
+ self.data = []
+
+ lines = f.readlines()
+ start = time.time()
+ for i, line in enumerate(lines):
+ entry = json.loads(line)
+ seq = entry['seq']
+ name = entry['name']
+
+ # Convert raw coords to np arrays
+ #for key, val in entry['coords'].items():
+ # entry['coords'][key] = np.asarray(val)
+
+ # Check if in alphabet
+ bad_chars = set([s for s in seq]).difference(alphabet_set)
+ if len(bad_chars) == 0:
+ if len(entry['seq']) <= max_length:
+ if True:
+ self.data.append(entry)
+ else:
+ discard_count['bad_seq_length'] += 1
+ else:
+ discard_count['too_long'] += 1
+ else:
+ if verbose:
+ print(name, bad_chars, entry['seq'])
+ discard_count['bad_chars'] += 1
+
+ # Truncate early
+ if truncate is not None and len(self.data) == truncate:
+ return
+
+ if verbose and (i + 1) % 1000 == 0:
+ elapsed = time.time() - start
+ print('{} entries ({} loaded) in {:.1f} s'.format(len(self.data), i+1, elapsed))
+ if verbose:
+ print('discarded', discard_count)
+ def __len__(self):
+ return len(self.data)
+
+ def __getitem__(self, idx):
+ return self.data[idx]
+
+
+class StructureDatasetPDB():
+ def __init__(self, pdb_dict_list, verbose=True, truncate=None, max_length=100,
+ alphabet='ACDEFGHIKLMNPQRSTVWYX-'):
+ alphabet_set = set([a for a in alphabet])
+ discard_count = {
+ 'bad_chars': 0,
+ 'too_long': 0,
+ 'bad_seq_length': 0
+ }
+
+ self.data = []
+
+ start = time.time()
+ for i, entry in enumerate(pdb_dict_list):
+ seq = entry['seq']
+ name = entry['name']
+
+ bad_chars = set([s for s in seq]).difference(alphabet_set)
+ if len(bad_chars) == 0:
+ if len(entry['seq']) <= max_length:
+ self.data.append(entry)
+ else:
+ discard_count['too_long'] += 1
+ else:
+ discard_count['bad_chars'] += 1
+
+ # Truncate early
+ if truncate is not None and len(self.data) == truncate:
+ return
+
+ if verbose and (i + 1) % 1000 == 0:
+ elapsed = time.time() - start
+
+ #print('Discarded', discard_count)
+ def __len__(self):
+ return len(self.data)
+
+ def __getitem__(self, idx):
+ return self.data[idx]
+
+
+
+class StructureLoader():
+ def __init__(self, dataset, batch_size=100, shuffle=True,
+ collate_fn=lambda x:x, drop_last=False):
+ self.dataset = dataset
+ self.size = len(dataset)
+ self.lengths = [len(dataset[i]['seq']) for i in range(self.size)]
+ self.batch_size = batch_size
+ sorted_ix = np.argsort(self.lengths)
+
+ # Cluster into batches of similar sizes
+ clusters, batch = [], []
+ batch_max = 0
+ for ix in sorted_ix:
+ size = self.lengths[ix]
+ if size * (len(batch) + 1) <= self.batch_size:
+ batch.append(ix)
+ batch_max = size
+ else:
+ clusters.append(batch)
+ batch, batch_max = [], 0
+ if len(batch) > 0:
+ clusters.append(batch)
+ self.clusters = clusters
+
+ def __len__(self):
+ return len(self.clusters)
+
+ def __iter__(self):
+ np.random.shuffle(self.clusters)
+ for b_idx in self.clusters:
+ batch = [self.dataset[i] for i in b_idx]
+ yield batch
+
+
+
+# The following gather functions
+def gather_edges(edges, neighbor_idx):
+ # Features [B,N,N,C] at Neighbor indices [B,N,K] => Neighbor features [B,N,K,C]
+ neighbors = neighbor_idx.unsqueeze(-1).expand(-1, -1, -1, edges.size(-1))
+ edge_features = torch.gather(edges, 2, neighbors)
+ return edge_features
+
+def gather_nodes(nodes, neighbor_idx):
+ # Features [B,N,C] at Neighbor indices [B,N,K] => [B,N,K,C]
+ # Flatten and expand indices per batch [B,N,K] => [B,NK] => [B,NK,C]
+ neighbors_flat = neighbor_idx.view((neighbor_idx.shape[0], -1))
+ neighbors_flat = neighbors_flat.unsqueeze(-1).expand(-1, -1, nodes.size(2))
+ # Gather and re-pack
+ neighbor_features = torch.gather(nodes, 1, neighbors_flat)
+ neighbor_features = neighbor_features.view(list(neighbor_idx.shape)[:3] + [-1])
+ return neighbor_features
+
+def gather_nodes_t(nodes, neighbor_idx):
+ # Features [B,N,C] at Neighbor index [B,K] => Neighbor features[B,K,C]
+ idx_flat = neighbor_idx.unsqueeze(-1).expand(-1, -1, nodes.size(2))
+ neighbor_features = torch.gather(nodes, 1, idx_flat)
+ return neighbor_features
+
+def cat_neighbors_nodes(h_nodes, h_neighbors, E_idx):
+ h_nodes = gather_nodes(h_nodes, E_idx)
+ h_nn = torch.cat([h_neighbors, h_nodes], -1)
+ return h_nn
+
+
+class EncLayer(nn.Module):
+ def __init__(self, num_hidden, num_in, dropout=0.1, num_heads=None, scale=30):
+ super(EncLayer, self).__init__()
+ self.num_hidden = num_hidden
+ self.num_in = num_in
+ self.scale = scale
+ self.dropout1 = nn.Dropout(dropout)
+ self.dropout2 = nn.Dropout(dropout)
+ self.dropout3 = nn.Dropout(dropout)
+ self.norm1 = nn.LayerNorm(num_hidden)
+ self.norm2 = nn.LayerNorm(num_hidden)
+ self.norm3 = nn.LayerNorm(num_hidden)
+
+ self.W1 = nn.Linear(num_hidden + num_in, num_hidden, bias=True)
+ self.W2 = nn.Linear(num_hidden, num_hidden, bias=True)
+ self.W3 = nn.Linear(num_hidden, num_hidden, bias=True)
+ self.W11 = nn.Linear(num_hidden + num_in, num_hidden, bias=True)
+ self.W12 = nn.Linear(num_hidden, num_hidden, bias=True)
+ self.W13 = nn.Linear(num_hidden, num_hidden, bias=True)
+ self.act = torch.nn.GELU()
+ self.dense = PositionWiseFeedForward(num_hidden, num_hidden * 4)
+
+ def forward(self, h_V, h_E, E_idx, mask_V=None, mask_attend=None):
+ """ Parallel computation of full transformer layer """
+
+ h_EV = cat_neighbors_nodes(h_V, h_E, E_idx)
+ h_V_expand = h_V.unsqueeze(-2).expand(-1,-1,h_EV.size(-2),-1)
+ h_EV = torch.cat([h_V_expand, h_EV], -1)
+ h_message = self.W3(self.act(self.W2(self.act(self.W1(h_EV)))))
+ if mask_attend is not None:
+ h_message = mask_attend.unsqueeze(-1) * h_message
+ dh = torch.sum(h_message, -2) / self.scale
+ h_V = self.norm1(h_V + self.dropout1(dh))
+
+ dh = self.dense(h_V)
+ h_V = self.norm2(h_V + self.dropout2(dh))
+ if mask_V is not None:
+ mask_V = mask_V.unsqueeze(-1)
+ h_V = mask_V * h_V
+
+ h_EV = cat_neighbors_nodes(h_V, h_E, E_idx)
+ h_V_expand = h_V.unsqueeze(-2).expand(-1,-1,h_EV.size(-2),-1)
+ h_EV = torch.cat([h_V_expand, h_EV], -1)
+ h_message = self.W13(self.act(self.W12(self.act(self.W11(h_EV)))))
+ h_E = self.norm3(h_E + self.dropout3(h_message))
+ return h_V, h_E
+
+
+class DecLayer(nn.Module):
+ def __init__(self, num_hidden, num_in, dropout=0.1, num_heads=None, scale=30):
+ super(DecLayer, self).__init__()
+ self.num_hidden = num_hidden
+ self.num_in = num_in
+ self.scale = scale
+ self.dropout1 = nn.Dropout(dropout)
+ self.dropout2 = nn.Dropout(dropout)
+ self.norm1 = nn.LayerNorm(num_hidden)
+ self.norm2 = nn.LayerNorm(num_hidden)
+
+ self.W1 = nn.Linear(num_hidden + num_in, num_hidden, bias=True)
+ self.W2 = nn.Linear(num_hidden, num_hidden, bias=True)
+ self.W3 = nn.Linear(num_hidden, num_hidden, bias=True)
+ self.act = torch.nn.GELU()
+ self.dense = PositionWiseFeedForward(num_hidden, num_hidden * 4)
+
+ def forward(self, h_V, h_E, mask_V=None, mask_attend=None):
+ """ Parallel computation of full transformer layer """
+
+ # Concatenate h_V_i to h_E_ij
+ h_V_expand = h_V.unsqueeze(-2).expand(-1,-1,h_E.size(-2),-1)
+ h_EV = torch.cat([h_V_expand, h_E], -1)
+
+ h_message = self.W3(self.act(self.W2(self.act(self.W1(h_EV)))))
+ if mask_attend is not None:
+ h_message = mask_attend.unsqueeze(-1) * h_message
+ dh = torch.sum(h_message, -2) / self.scale
+
+ h_V = self.norm1(h_V + self.dropout1(dh))
+
+ # Position-wise feedforward
+ dh = self.dense(h_V)
+ h_V = self.norm2(h_V + self.dropout2(dh))
+
+ if mask_V is not None:
+ mask_V = mask_V.unsqueeze(-1)
+ h_V = mask_V * h_V
+ return h_V
+
+
+
+class PositionWiseFeedForward(nn.Module):
+ def __init__(self, num_hidden, num_ff):
+ super(PositionWiseFeedForward, self).__init__()
+ self.W_in = nn.Linear(num_hidden, num_ff, bias=True)
+ self.W_out = nn.Linear(num_ff, num_hidden, bias=True)
+ self.act = torch.nn.GELU()
+ def forward(self, h_V):
+ h = self.act(self.W_in(h_V))
+ h = self.W_out(h)
+ return h
+
+class PositionalEncodings(nn.Module):
+ def __init__(self, num_embeddings, max_relative_feature=32):
+ super(PositionalEncodings, self).__init__()
+ self.num_embeddings = num_embeddings
+ self.max_relative_feature = max_relative_feature
+ self.linear = nn.Linear(2*max_relative_feature+1+1, num_embeddings)
+
+ def forward(self, offset, mask):
+ d = torch.clip(offset + self.max_relative_feature, 0, 2*self.max_relative_feature)*mask + (1-mask)*(2*self.max_relative_feature+1)
+ d_onehot = torch.nn.functional.one_hot(d, 2*self.max_relative_feature+1+1)
+ E = self.linear(d_onehot.float())
+ return E
+
+
+
+class CA_ProteinFeatures(nn.Module):
+ def __init__(self, edge_features, node_features, num_positional_embeddings=16,
+ num_rbf=16, top_k=30, augment_eps=0., num_chain_embeddings=16):
+ """ Extract protein features """
+ super(CA_ProteinFeatures, self).__init__()
+ self.edge_features = edge_features
+ self.node_features = node_features
+ self.top_k = top_k
+ self.augment_eps = augment_eps
+ self.num_rbf = num_rbf
+ self.num_positional_embeddings = num_positional_embeddings
+
+ # Positional encoding
+ self.embeddings = PositionalEncodings(num_positional_embeddings)
+ # Normalization and embedding
+ node_in, edge_in = 3, num_positional_embeddings + num_rbf*9 + 7
+ self.node_embedding = nn.Linear(node_in, node_features, bias=False) #NOT USED
+ self.edge_embedding = nn.Linear(edge_in, edge_features, bias=False)
+ self.norm_nodes = nn.LayerNorm(node_features)
+ self.norm_edges = nn.LayerNorm(edge_features)
+
+
+ def _quaternions(self, R):
+ """ Convert a batch of 3D rotations [R] to quaternions [Q]
+ R [...,3,3]
+ Q [...,4]
+ """
+ # Simple Wikipedia version
+ # en.wikipedia.org/wiki/Rotation_matrix#Quaternion
+ # For other options see math.stackexchange.com/questions/2074316/calculating-rotation-axis-from-rotation-matrix
+ diag = torch.diagonal(R, dim1=-2, dim2=-1)
+ Rxx, Ryy, Rzz = diag.unbind(-1)
+ magnitudes = 0.5 * torch.sqrt(torch.abs(1 + torch.stack([
+ Rxx - Ryy - Rzz,
+ - Rxx + Ryy - Rzz,
+ - Rxx - Ryy + Rzz
+ ], -1)))
+ _R = lambda i,j: R[:,:,:,i,j]
+ signs = torch.sign(torch.stack([
+ _R(2,1) - _R(1,2),
+ _R(0,2) - _R(2,0),
+ _R(1,0) - _R(0,1)
+ ], -1))
+ xyz = signs * magnitudes
+ # The relu enforces a non-negative trace
+ w = torch.sqrt(F.relu(1 + diag.sum(-1, keepdim=True))) / 2.
+ Q = torch.cat((xyz, w), -1)
+ Q = F.normalize(Q, dim=-1)
+ return Q
+
+ def _orientations_coarse(self, X, E_idx, eps=1e-6):
+ dX = X[:,1:,:] - X[:,:-1,:]
+ dX_norm = torch.norm(dX,dim=-1)
+ dX_mask = (3.6 0:
+ Ca = Ca + self.augment_eps * torch.randn_like(Ca)
+
+ D_neighbors, E_idx, mask_neighbors = self._dist(Ca, mask)
+
+ Ca_0 = torch.zeros(Ca.shape, device=Ca.device)
+ Ca_2 = torch.zeros(Ca.shape, device=Ca.device)
+ Ca_0[:,1:,:] = Ca[:,:-1,:]
+ Ca_1 = Ca
+ Ca_2[:,:-1,:] = Ca[:,1:,:]
+
+ V, O_features = self._orientations_coarse(Ca, E_idx)
+
+ RBF_all = []
+ RBF_all.append(self._rbf(D_neighbors)) #Ca_1-Ca_1
+ RBF_all.append(self._get_rbf(Ca_0, Ca_0, E_idx))
+ RBF_all.append(self._get_rbf(Ca_2, Ca_2, E_idx))
+
+ RBF_all.append(self._get_rbf(Ca_0, Ca_1, E_idx))
+ RBF_all.append(self._get_rbf(Ca_0, Ca_2, E_idx))
+
+ RBF_all.append(self._get_rbf(Ca_1, Ca_0, E_idx))
+ RBF_all.append(self._get_rbf(Ca_1, Ca_2, E_idx))
+
+ RBF_all.append(self._get_rbf(Ca_2, Ca_0, E_idx))
+ RBF_all.append(self._get_rbf(Ca_2, Ca_1, E_idx))
+
+
+ RBF_all = torch.cat(tuple(RBF_all), dim=-1)
+
+
+ offset = residue_idx[:,:,None]-residue_idx[:,None,:]
+ offset = gather_edges(offset[:,:,:,None], E_idx)[:,:,:,0] #[B, L, K]
+
+ d_chains = ((chain_labels[:, :, None] - chain_labels[:,None,:])==0).long()
+ E_chains = gather_edges(d_chains[:,:,:,None], E_idx)[:,:,:,0]
+ E_positional = self.embeddings(offset.long(), E_chains)
+ E = torch.cat((E_positional, RBF_all, O_features), -1)
+
+
+ E = self.edge_embedding(E)
+ E = self.norm_edges(E)
+
+ return E, E_idx
+
+
+
+
+class ProteinFeatures(nn.Module):
+ def __init__(self, edge_features, node_features, num_positional_embeddings=16,
+ num_rbf=16, top_k=30, augment_eps=0., num_chain_embeddings=16):
+ """ Extract protein features """
+ super(ProteinFeatures, self).__init__()
+ self.edge_features = edge_features
+ self.node_features = node_features
+ self.top_k = top_k
+ self.augment_eps = augment_eps
+ self.num_rbf = num_rbf
+ self.num_positional_embeddings = num_positional_embeddings
+
+ self.embeddings = PositionalEncodings(num_positional_embeddings)
+ node_in, edge_in = 6, num_positional_embeddings + num_rbf*25
+ self.edge_embedding = nn.Linear(edge_in, edge_features, bias=False)
+ self.norm_edges = nn.LayerNorm(edge_features)
+
+ def _dist(self, X, mask, eps=1E-6):
+ mask_2D = torch.unsqueeze(mask,1) * torch.unsqueeze(mask,2)
+ dX = torch.unsqueeze(X,1) - torch.unsqueeze(X,2)
+ D = mask_2D * torch.sqrt(torch.sum(dX**2, 3) + eps)
+ D_max, _ = torch.max(D, -1, keepdim=True)
+ D_adjust = D + (1. - mask_2D) * D_max
+ sampled_top_k = self.top_k
+ D_neighbors, E_idx = torch.topk(D_adjust, np.minimum(self.top_k, X.shape[1]), dim=-1, largest=False)
+ return D_neighbors, E_idx
+
+ def _rbf(self, D):
+ device = D.device
+ D_min, D_max, D_count = 2., 22., self.num_rbf
+ D_mu = torch.linspace(D_min, D_max, D_count, device=device)
+ D_mu = D_mu.view([1,1,1,-1])
+ D_sigma = (D_max - D_min) / D_count
+ D_expand = torch.unsqueeze(D, -1)
+ RBF = torch.exp(-((D_expand - D_mu) / D_sigma)**2)
+ return RBF
+
+ def _get_rbf(self, A, B, E_idx):
+ D_A_B = torch.sqrt(torch.sum((A[:,:,None,:] - B[:,None,:,:])**2,-1) + 1e-6) #[B, L, L]
+ D_A_B_neighbors = gather_edges(D_A_B[:,:,:,None], E_idx)[:,:,:,0] #[B,L,K]
+ RBF_A_B = self._rbf(D_A_B_neighbors)
+ return RBF_A_B
+
+ def forward(self, X, mask, residue_idx, chain_labels):
+ if self.augment_eps > 0:
+ X = X + self.augment_eps * torch.randn_like(X)
+
+ b = X[:,:,1,:] - X[:,:,0,:]
+ c = X[:,:,2,:] - X[:,:,1,:]
+ a = torch.cross(b, c, dim=-1)
+ Cb = -0.58273431*a + 0.56802827*b - 0.54067466*c + X[:,:,1,:]
+ Ca = X[:,:,1,:]
+ N = X[:,:,0,:]
+ C = X[:,:,2,:]
+ O = X[:,:,3,:]
+
+ D_neighbors, E_idx = self._dist(Ca, mask)
+
+ RBF_all = []
+ RBF_all.append(self._rbf(D_neighbors)) #Ca-Ca
+ RBF_all.append(self._get_rbf(N, N, E_idx)) #N-N
+ RBF_all.append(self._get_rbf(C, C, E_idx)) #C-C
+ RBF_all.append(self._get_rbf(O, O, E_idx)) #O-O
+ RBF_all.append(self._get_rbf(Cb, Cb, E_idx)) #Cb-Cb
+ RBF_all.append(self._get_rbf(Ca, N, E_idx)) #Ca-N
+ RBF_all.append(self._get_rbf(Ca, C, E_idx)) #Ca-C
+ RBF_all.append(self._get_rbf(Ca, O, E_idx)) #Ca-O
+ RBF_all.append(self._get_rbf(Ca, Cb, E_idx)) #Ca-Cb
+ RBF_all.append(self._get_rbf(N, C, E_idx)) #N-C
+ RBF_all.append(self._get_rbf(N, O, E_idx)) #N-O
+ RBF_all.append(self._get_rbf(N, Cb, E_idx)) #N-Cb
+ RBF_all.append(self._get_rbf(Cb, C, E_idx)) #Cb-C
+ RBF_all.append(self._get_rbf(Cb, O, E_idx)) #Cb-O
+ RBF_all.append(self._get_rbf(O, C, E_idx)) #O-C
+ RBF_all.append(self._get_rbf(N, Ca, E_idx)) #N-Ca
+ RBF_all.append(self._get_rbf(C, Ca, E_idx)) #C-Ca
+ RBF_all.append(self._get_rbf(O, Ca, E_idx)) #O-Ca
+ RBF_all.append(self._get_rbf(Cb, Ca, E_idx)) #Cb-Ca
+ RBF_all.append(self._get_rbf(C, N, E_idx)) #C-N
+ RBF_all.append(self._get_rbf(O, N, E_idx)) #O-N
+ RBF_all.append(self._get_rbf(Cb, N, E_idx)) #Cb-N
+ RBF_all.append(self._get_rbf(C, Cb, E_idx)) #C-Cb
+ RBF_all.append(self._get_rbf(O, Cb, E_idx)) #O-Cb
+ RBF_all.append(self._get_rbf(C, O, E_idx)) #C-O
+ RBF_all = torch.cat(tuple(RBF_all), dim=-1)
+
+ offset = residue_idx[:,:,None]-residue_idx[:,None,:]
+ offset = gather_edges(offset[:,:,:,None], E_idx)[:,:,:,0] #[B, L, K]
+
+ d_chains = ((chain_labels[:, :, None] - chain_labels[:,None,:])==0).long() #find self vs non-self interaction
+ E_chains = gather_edges(d_chains[:,:,:,None], E_idx)[:,:,:,0]
+ E_positional = self.embeddings(offset.long(), E_chains)
+ E = torch.cat((E_positional, RBF_all), -1)
+ E = self.edge_embedding(E)
+ E = self.norm_edges(E)
+ return E, E_idx
+
+
+
+class ProteinMPNN(nn.Module):
+ def __init__(self, num_letters, node_features, edge_features,
+ hidden_dim, num_encoder_layers=3, num_decoder_layers=3,
+ vocab=21, k_neighbors=64, augment_eps=0.05, dropout=0.1, ca_only=False):
+ super(ProteinMPNN, self).__init__()
+
+ # Hyperparameters
+ self.node_features = node_features
+ self.edge_features = edge_features
+ self.hidden_dim = hidden_dim
+
+ # Featurization layers
+ if ca_only:
+ self.features = CA_ProteinFeatures(node_features, edge_features, top_k=k_neighbors, augment_eps=augment_eps)
+ self.W_v = nn.Linear(node_features, hidden_dim, bias=True)
+ else:
+ self.features = ProteinFeatures(node_features, edge_features, top_k=k_neighbors, augment_eps=augment_eps)
+
+ self.W_e = nn.Linear(edge_features, hidden_dim, bias=True)
+ self.W_s = nn.Embedding(vocab, hidden_dim)
+
+ # Encoder layers
+ self.encoder_layers = nn.ModuleList([
+ EncLayer(hidden_dim, hidden_dim*2, dropout=dropout)
+ for _ in range(num_encoder_layers)
+ ])
+
+ # Decoder layers
+ self.decoder_layers = nn.ModuleList([
+ DecLayer(hidden_dim, hidden_dim*3, dropout=dropout)
+ for _ in range(num_decoder_layers)
+ ])
+ self.W_out = nn.Linear(hidden_dim, num_letters, bias=True)
+
+ for p in self.parameters():
+ if p.dim() > 1:
+ nn.init.xavier_uniform_(p)
+
+ def forward(self, X, S, mask, chain_M, residue_idx, chain_encoding_all, randn, use_input_decoding_order=False, decoding_order=None):
+ """ Graph-conditioned sequence model """
+ device=X.device
+ # Prepare node and edge embeddings
+ E, E_idx = self.features(X, mask, residue_idx, chain_encoding_all)
+ h_V = torch.zeros((E.shape[0], E.shape[1], E.shape[-1]), device=E.device)
+ h_E = self.W_e(E)
+
+ # Encoder is unmasked self-attention
+ mask_attend = gather_nodes(mask.unsqueeze(-1), E_idx).squeeze(-1)
+ mask_attend = mask.unsqueeze(-1) * mask_attend
+ for layer in self.encoder_layers:
+ h_V, h_E = layer(h_V, h_E, E_idx, mask, mask_attend)
+
+ # Concatenate sequence embeddings for autoregressive decoder
+ h_S = self.W_s(S)
+ h_ES = cat_neighbors_nodes(h_S, h_E, E_idx)
+
+ # Build encoder embeddings
+ h_EX_encoder = cat_neighbors_nodes(torch.zeros_like(h_S), h_E, E_idx)
+ h_EXV_encoder = cat_neighbors_nodes(h_V, h_EX_encoder, E_idx)
+
+
+ chain_M = chain_M*mask #update chain_M to include missing regions
+ if not use_input_decoding_order:
+ decoding_order = torch.argsort((chain_M+0.0001)*(torch.abs(randn))) #[numbers will be smaller for places where chain_M = 0.0 and higher for places where chain_M = 1.0]
+ mask_size = E_idx.shape[1]
+ permutation_matrix_reverse = torch.nn.functional.one_hot(decoding_order, num_classes=mask_size).float()
+ order_mask_backward = torch.einsum('ij, biq, bjp->bqp',(1-torch.triu(torch.ones(mask_size,mask_size, device=device))), permutation_matrix_reverse, permutation_matrix_reverse)
+ mask_attend = torch.gather(order_mask_backward, 2, E_idx).unsqueeze(-1)
+ mask_1D = mask.view([mask.size(0), mask.size(1), 1, 1])
+ mask_bw = mask_1D * mask_attend
+ mask_fw = mask_1D * (1. - mask_attend)
+
+ h_EXV_encoder_fw = mask_fw * h_EXV_encoder
+ for layer in self.decoder_layers:
+ # Masked positions attend to encoder information, unmasked see.
+ h_ESV = cat_neighbors_nodes(h_V, h_ES, E_idx)
+ h_ESV = mask_bw * h_ESV + h_EXV_encoder_fw
+ h_V = layer(h_V, h_ESV, mask)
+
+ logits = self.W_out(h_V)
+ log_probs = F.log_softmax(logits, dim=-1)
+ return log_probs
+
+
+
+ def sample(self, X, randn, S_true, chain_mask, chain_encoding_all, residue_idx, mask=None, temperature=1.0, omit_AAs_np=None, bias_AAs_np=None, chain_M_pos=None, omit_AA_mask=None, pssm_coef=None, pssm_bias=None, pssm_multi=None, pssm_log_odds_flag=None, pssm_log_odds_mask=None, pssm_bias_flag=None, bias_by_res=None):
+ device = X.device
+ # Prepare node and edge embeddings
+ E, E_idx = self.features(X, mask, residue_idx, chain_encoding_all)
+ h_V = torch.zeros((E.shape[0], E.shape[1], E.shape[-1]), device=device)
+ h_E = self.W_e(E)
+
+ # Encoder is unmasked self-attention
+ mask_attend = gather_nodes(mask.unsqueeze(-1), E_idx).squeeze(-1)
+ mask_attend = mask.unsqueeze(-1) * mask_attend
+ for layer in self.encoder_layers:
+ h_V, h_E = layer(h_V, h_E, E_idx, mask, mask_attend)
+
+ # Decoder uses masked self-attention
+ chain_mask = chain_mask*chain_M_pos*mask #update chain_M to include missing regions
+ decoding_order = torch.argsort((chain_mask+0.0001)*(torch.abs(randn))) #[numbers will be smaller for places where chain_M = 0.0 and higher for places where chain_M = 1.0]
+ mask_size = E_idx.shape[1]
+ permutation_matrix_reverse = torch.nn.functional.one_hot(decoding_order, num_classes=mask_size).float()
+ order_mask_backward = torch.einsum('ij, biq, bjp->bqp',(1-torch.triu(torch.ones(mask_size,mask_size, device=device))), permutation_matrix_reverse, permutation_matrix_reverse)
+ mask_attend = torch.gather(order_mask_backward, 2, E_idx).unsqueeze(-1)
+ mask_1D = mask.view([mask.size(0), mask.size(1), 1, 1])
+ mask_bw = mask_1D * mask_attend
+ mask_fw = mask_1D * (1. - mask_attend)
+
+ N_batch, N_nodes = X.size(0), X.size(1)
+ log_probs = torch.zeros((N_batch, N_nodes, 21), device=device)
+ all_probs = torch.zeros((N_batch, N_nodes, 21), device=device, dtype=torch.float32)
+ h_S = torch.zeros_like(h_V, device=device)
+ S = torch.zeros((N_batch, N_nodes), dtype=torch.int64, device=device)
+ h_V_stack = [h_V] + [torch.zeros_like(h_V, device=device) for _ in range(len(self.decoder_layers))]
+ constant = torch.tensor(omit_AAs_np, device=device)
+ constant_bias = torch.tensor(bias_AAs_np, device=device)
+ #chain_mask_combined = chain_mask*chain_M_pos
+ omit_AA_mask_flag = omit_AA_mask != None
+
+
+ h_EX_encoder = cat_neighbors_nodes(torch.zeros_like(h_S), h_E, E_idx)
+ h_EXV_encoder = cat_neighbors_nodes(h_V, h_EX_encoder, E_idx)
+ h_EXV_encoder_fw = mask_fw * h_EXV_encoder
+ for t_ in range(N_nodes):
+ t = decoding_order[:,t_] #[B]
+ chain_mask_gathered = torch.gather(chain_mask, 1, t[:,None]) #[B]
+ mask_gathered = torch.gather(mask, 1, t[:,None]) #[B]
+ bias_by_res_gathered = torch.gather(bias_by_res, 1, t[:,None,None].repeat(1,1,21))[:,0,:] #[B, 21]
+ if (mask_gathered==0).all(): #for padded or missing regions only
+ S_t = torch.gather(S_true, 1, t[:,None])
+ else:
+ # Hidden layers
+ E_idx_t = torch.gather(E_idx, 1, t[:,None,None].repeat(1,1,E_idx.shape[-1]))
+ h_E_t = torch.gather(h_E, 1, t[:,None,None,None].repeat(1,1,h_E.shape[-2], h_E.shape[-1]))
+ h_ES_t = cat_neighbors_nodes(h_S, h_E_t, E_idx_t)
+ h_EXV_encoder_t = torch.gather(h_EXV_encoder_fw, 1, t[:,None,None,None].repeat(1,1,h_EXV_encoder_fw.shape[-2], h_EXV_encoder_fw.shape[-1]))
+ mask_t = torch.gather(mask, 1, t[:,None])
+ for l, layer in enumerate(self.decoder_layers):
+ # Updated relational features for future states
+ h_ESV_decoder_t = cat_neighbors_nodes(h_V_stack[l], h_ES_t, E_idx_t)
+ h_V_t = torch.gather(h_V_stack[l], 1, t[:,None,None].repeat(1,1,h_V_stack[l].shape[-1]))
+ h_ESV_t = torch.gather(mask_bw, 1, t[:,None,None,None].repeat(1,1,mask_bw.shape[-2], mask_bw.shape[-1])) * h_ESV_decoder_t + h_EXV_encoder_t
+ h_V_stack[l+1].scatter_(1, t[:,None,None].repeat(1,1,h_V.shape[-1]), layer(h_V_t, h_ESV_t, mask_V=mask_t))
+ # Sampling step
+ h_V_t = torch.gather(h_V_stack[-1], 1, t[:,None,None].repeat(1,1,h_V_stack[-1].shape[-1]))[:,0]
+ logits = self.W_out(h_V_t) / temperature
+ probs = F.softmax(logits-constant[None,:]*1e8+constant_bias[None,:]/temperature+bias_by_res_gathered/temperature, dim=-1)
+ if pssm_bias_flag:
+ pssm_coef_gathered = torch.gather(pssm_coef, 1, t[:,None])[:,0]
+ pssm_bias_gathered = torch.gather(pssm_bias, 1, t[:,None,None].repeat(1,1,pssm_bias.shape[-1]))[:,0]
+ probs = (1-pssm_multi*pssm_coef_gathered[:,None])*probs + pssm_multi*pssm_coef_gathered[:,None]*pssm_bias_gathered
+ if pssm_log_odds_flag:
+ pssm_log_odds_mask_gathered = torch.gather(pssm_log_odds_mask, 1, t[:,None, None].repeat(1,1,pssm_log_odds_mask.shape[-1]))[:,0] #[B, 21]
+ probs_masked = probs*pssm_log_odds_mask_gathered
+ probs_masked += probs * 0.001
+ probs = probs_masked/torch.sum(probs_masked, dim=-1, keepdim=True) #[B, 21]
+ if omit_AA_mask_flag:
+ omit_AA_mask_gathered = torch.gather(omit_AA_mask, 1, t[:,None, None].repeat(1,1,omit_AA_mask.shape[-1]))[:,0] #[B, 21]
+ probs_masked = probs*(1.0-omit_AA_mask_gathered)
+ probs = probs_masked/torch.sum(probs_masked, dim=-1, keepdim=True) #[B, 21]
+ S_t = torch.multinomial(probs, 1)
+ all_probs.scatter_(1, t[:,None,None].repeat(1,1,21), (chain_mask_gathered[:,:,None,]*probs[:,None,:]).float())
+ S_true_gathered = torch.gather(S_true, 1, t[:,None])
+ S_t = (S_t*chain_mask_gathered+S_true_gathered*(1.0-chain_mask_gathered)).long()
+ temp1 = self.W_s(S_t)
+ h_S.scatter_(1, t[:,None,None].repeat(1,1,temp1.shape[-1]), temp1)
+ S.scatter_(1, t[:,None], S_t)
+ output_dict = {"S": S, "probs": all_probs, "decoding_order": decoding_order}
+ return output_dict
+
+
+ def tied_sample(self, X, randn, S_true, chain_mask, chain_encoding_all, residue_idx, mask=None, temperature=1.0, omit_AAs_np=None, bias_AAs_np=None, chain_M_pos=None, omit_AA_mask=None, pssm_coef=None, pssm_bias=None, pssm_multi=None, pssm_log_odds_flag=None, pssm_log_odds_mask=None, pssm_bias_flag=None, tied_pos=None, tied_beta=None, bias_by_res=None):
+ device = X.device
+ # Prepare node and edge embeddings
+ E, E_idx = self.features(X, mask, residue_idx, chain_encoding_all)
+ h_V = torch.zeros((E.shape[0], E.shape[1], E.shape[-1]), device=device)
+ h_E = self.W_e(E)
+ # Encoder is unmasked self-attention
+ mask_attend = gather_nodes(mask.unsqueeze(-1), E_idx).squeeze(-1)
+ mask_attend = mask.unsqueeze(-1) * mask_attend
+ for layer in self.encoder_layers:
+ h_V, h_E = layer(h_V, h_E, E_idx, mask, mask_attend)
+
+ # Decoder uses masked self-attention
+ chain_mask = chain_mask*chain_M_pos*mask #update chain_M to include missing regions
+ decoding_order = torch.argsort((chain_mask+0.0001)*(torch.abs(randn))) #[numbers will be smaller for places where chain_M = 0.0 and higher for places where chain_M = 1.0]
+
+ new_decoding_order = []
+ for t_dec in list(decoding_order[0,].cpu().data.numpy()):
+ if t_dec not in list(itertools.chain(*new_decoding_order)):
+ list_a = [item for item in tied_pos if t_dec in item]
+ if list_a:
+ new_decoding_order.append(list_a[0])
+ else:
+ new_decoding_order.append([t_dec])
+ decoding_order = torch.tensor(list(itertools.chain(*new_decoding_order)), device=device)[None,].repeat(X.shape[0],1)
+
+ mask_size = E_idx.shape[1]
+ permutation_matrix_reverse = torch.nn.functional.one_hot(decoding_order, num_classes=mask_size).float()
+ order_mask_backward = torch.einsum('ij, biq, bjp->bqp',(1-torch.triu(torch.ones(mask_size,mask_size, device=device))), permutation_matrix_reverse, permutation_matrix_reverse)
+ mask_attend = torch.gather(order_mask_backward, 2, E_idx).unsqueeze(-1)
+ mask_1D = mask.view([mask.size(0), mask.size(1), 1, 1])
+ mask_bw = mask_1D * mask_attend
+ mask_fw = mask_1D * (1. - mask_attend)
+
+ N_batch, N_nodes = X.size(0), X.size(1)
+ log_probs = torch.zeros((N_batch, N_nodes, 21), device=device)
+ all_probs = torch.zeros((N_batch, N_nodes, 21), device=device, dtype=torch.float32)
+ h_S = torch.zeros_like(h_V, device=device)
+ S = torch.zeros((N_batch, N_nodes), dtype=torch.int64, device=device)
+ h_V_stack = [h_V] + [torch.zeros_like(h_V, device=device) for _ in range(len(self.decoder_layers))]
+ constant = torch.tensor(omit_AAs_np, device=device)
+ constant_bias = torch.tensor(bias_AAs_np, device=device)
+ omit_AA_mask_flag = omit_AA_mask != None
+
+ h_EX_encoder = cat_neighbors_nodes(torch.zeros_like(h_S), h_E, E_idx)
+ h_EXV_encoder = cat_neighbors_nodes(h_V, h_EX_encoder, E_idx)
+ h_EXV_encoder_fw = mask_fw * h_EXV_encoder
+ for t_list in new_decoding_order:
+ logits = 0.0
+ logit_list = []
+ done_flag = False
+ for t in t_list:
+ if (mask[:,t]==0).all():
+ S_t = S_true[:,t]
+ for t in t_list:
+ h_S[:,t,:] = self.W_s(S_t)
+ S[:,t] = S_t
+ done_flag = True
+ break
+ else:
+ E_idx_t = E_idx[:,t:t+1,:]
+ h_E_t = h_E[:,t:t+1,:,:]
+ h_ES_t = cat_neighbors_nodes(h_S, h_E_t, E_idx_t)
+ h_EXV_encoder_t = h_EXV_encoder_fw[:,t:t+1,:,:]
+ mask_t = mask[:,t:t+1]
+ for l, layer in enumerate(self.decoder_layers):
+ h_ESV_decoder_t = cat_neighbors_nodes(h_V_stack[l], h_ES_t, E_idx_t)
+ h_V_t = h_V_stack[l][:,t:t+1,:]
+ h_ESV_t = mask_bw[:,t:t+1,:,:] * h_ESV_decoder_t + h_EXV_encoder_t
+ h_V_stack[l+1][:,t,:] = layer(h_V_t, h_ESV_t, mask_V=mask_t).squeeze(1)
+ h_V_t = h_V_stack[-1][:,t,:]
+ logit_list.append((self.W_out(h_V_t) / temperature)/len(t_list))
+ logits += tied_beta[t]*(self.W_out(h_V_t) / temperature)/len(t_list)
+ if done_flag:
+ pass
+ else:
+ bias_by_res_gathered = bias_by_res[:,t,:] #[B, 21]
+ probs = F.softmax(logits-constant[None,:]*1e8+constant_bias[None,:]/temperature+bias_by_res_gathered/temperature, dim=-1)
+ if pssm_bias_flag:
+ pssm_coef_gathered = pssm_coef[:,t]
+ pssm_bias_gathered = pssm_bias[:,t]
+ probs = (1-pssm_multi*pssm_coef_gathered[:,None])*probs + pssm_multi*pssm_coef_gathered[:,None]*pssm_bias_gathered
+ if pssm_log_odds_flag:
+ pssm_log_odds_mask_gathered = pssm_log_odds_mask[:,t]
+ probs_masked = probs*pssm_log_odds_mask_gathered
+ probs_masked += probs * 0.001
+ probs = probs_masked/torch.sum(probs_masked, dim=-1, keepdim=True) #[B, 21]
+ if omit_AA_mask_flag:
+ omit_AA_mask_gathered = omit_AA_mask[:,t]
+ probs_masked = probs*(1.0-omit_AA_mask_gathered)
+ probs = probs_masked/torch.sum(probs_masked, dim=-1, keepdim=True) #[B, 21]
+ S_t_repeat = torch.multinomial(probs, 1).squeeze(-1)
+ S_t_repeat = (chain_mask[:,t]*S_t_repeat + (1-chain_mask[:,t])*S_true[:,t]).long() #hard pick fixed positions
+ for t in t_list:
+ h_S[:,t,:] = self.W_s(S_t_repeat)
+ S[:,t] = S_t_repeat
+ all_probs[:,t,:] = probs.float()
+ output_dict = {"S": S, "probs": all_probs, "decoding_order": decoding_order}
+ return output_dict
+
+
+ def conditional_probs(self, X, S, mask, chain_M, residue_idx, chain_encoding_all, randn, backbone_only=False):
+ """ Graph-conditioned sequence model """
+ device=X.device
+ # Prepare node and edge embeddings
+ E, E_idx = self.features(X, mask, residue_idx, chain_encoding_all)
+ h_V_enc = torch.zeros((E.shape[0], E.shape[1], E.shape[-1]), device=E.device)
+ h_E = self.W_e(E)
+
+ # Encoder is unmasked self-attention
+ mask_attend = gather_nodes(mask.unsqueeze(-1), E_idx).squeeze(-1)
+ mask_attend = mask.unsqueeze(-1) * mask_attend
+ for layer in self.encoder_layers:
+ h_V_enc, h_E = layer(h_V_enc, h_E, E_idx, mask, mask_attend)
+
+ # Concatenate sequence embeddings for autoregressive decoder
+ h_S = self.W_s(S)
+ h_ES = cat_neighbors_nodes(h_S, h_E, E_idx)
+
+ # Build encoder embeddings
+ h_EX_encoder = cat_neighbors_nodes(torch.zeros_like(h_S), h_E, E_idx)
+ h_EXV_encoder = cat_neighbors_nodes(h_V_enc, h_EX_encoder, E_idx)
+
+
+ chain_M = chain_M*mask #update chain_M to include missing regions
+
+ chain_M_np = chain_M.cpu().numpy()
+ idx_to_loop = np.argwhere(chain_M_np[0,:]==1)[:,0]
+ log_conditional_probs = torch.zeros([X.shape[0], chain_M.shape[1], 21], device=device).float()
+
+ for idx in idx_to_loop:
+ h_V = torch.clone(h_V_enc)
+ order_mask = torch.zeros(chain_M.shape[1], device=device).float()
+ if backbone_only:
+ order_mask = torch.ones(chain_M.shape[1], device=device).float()
+ order_mask[idx] = 0.
+ else:
+ order_mask = torch.zeros(chain_M.shape[1], device=device).float()
+ order_mask[idx] = 1.
+ decoding_order = torch.argsort((order_mask[None,]+0.0001)*(torch.abs(randn))) #[numbers will be smaller for places where chain_M = 0.0 and higher for places where chain_M = 1.0]
+ mask_size = E_idx.shape[1]
+ permutation_matrix_reverse = torch.nn.functional.one_hot(decoding_order, num_classes=mask_size).float()
+ order_mask_backward = torch.einsum('ij, biq, bjp->bqp',(1-torch.triu(torch.ones(mask_size,mask_size, device=device))), permutation_matrix_reverse, permutation_matrix_reverse)
+ mask_attend = torch.gather(order_mask_backward, 2, E_idx).unsqueeze(-1)
+ mask_1D = mask.view([mask.size(0), mask.size(1), 1, 1])
+ mask_bw = mask_1D * mask_attend
+ mask_fw = mask_1D * (1. - mask_attend)
+
+ h_EXV_encoder_fw = mask_fw * h_EXV_encoder
+ for layer in self.decoder_layers:
+ # Masked positions attend to encoder information, unmasked see.
+ h_ESV = cat_neighbors_nodes(h_V, h_ES, E_idx)
+ h_ESV = mask_bw * h_ESV + h_EXV_encoder_fw
+ h_V = layer(h_V, h_ESV, mask)
+
+ logits = self.W_out(h_V)
+ log_probs = F.log_softmax(logits, dim=-1)
+ log_conditional_probs[:,idx,:] = log_probs[:,idx,:]
+ return log_conditional_probs
+
+
+ def unconditional_probs(self, X, mask, residue_idx, chain_encoding_all):
+ """ Graph-conditioned sequence model """
+ device=X.device
+ # Prepare node and edge embeddings
+ E, E_idx = self.features(X, mask, residue_idx, chain_encoding_all)
+ h_V = torch.zeros((E.shape[0], E.shape[1], E.shape[-1]), device=E.device)
+ h_E = self.W_e(E)
+
+ # Encoder is unmasked self-attention
+ mask_attend = gather_nodes(mask.unsqueeze(-1), E_idx).squeeze(-1)
+ mask_attend = mask.unsqueeze(-1) * mask_attend
+ for layer in self.encoder_layers:
+ h_V, h_E = layer(h_V, h_E, E_idx, mask, mask_attend)
+
+ # Build encoder embeddings
+ h_EX_encoder = cat_neighbors_nodes(torch.zeros_like(h_V), h_E, E_idx)
+ h_EXV_encoder = cat_neighbors_nodes(h_V, h_EX_encoder, E_idx)
+
+ order_mask_backward = torch.zeros([X.shape[0], X.shape[1], X.shape[1]], device=device)
+ mask_attend = torch.gather(order_mask_backward, 2, E_idx).unsqueeze(-1)
+ mask_1D = mask.view([mask.size(0), mask.size(1), 1, 1])
+ mask_bw = mask_1D * mask_attend
+ mask_fw = mask_1D * (1. - mask_attend)
+
+ h_EXV_encoder_fw = mask_fw * h_EXV_encoder
+ for layer in self.decoder_layers:
+ h_V = layer(h_V, h_EXV_encoder_fw, mask)
+
+ logits = self.W_out(h_V)
+ log_probs = F.log_softmax(logits, dim=-1)
+ return log_probs
+
diff --git a/proteingym/baselines/protgpt2/compute_fitness.py b/proteingym/baselines/protgpt2/compute_fitness.py
new file mode 100644
index 0000000..8b9904d
--- /dev/null
+++ b/proteingym/baselines/protgpt2/compute_fitness.py
@@ -0,0 +1,98 @@
+import os
+import argparse
+import tqdm
+
+from transformers import AutoModel, AutoModelForCausalLM, AutoTokenizer
+from scipy.stats import spearmanr
+import numpy as np
+import pandas as pd
+
+import torch
+from torch.nn import CrossEntropyLoss
+
+def calc_fitness(model, prots, tokenizer, device='cuda:0', model_context_len=1023, mirror=True):
+ loss_list = []
+ loss_fn = CrossEntropyLoss(reduction='mean') # We compute the CrossEnt per token (to correct for variable length post-mutation)
+ with torch.no_grad():
+ for prot in tqdm.tqdm(prots):
+ loss_val = 0
+
+ sequence_chunks=[]
+ if len(prot) < model_context_len:
+ sequence_chunks = [prot]
+ else:
+ len_target_seq = len(prot)
+ num_windows = 1 + int( len_target_seq / model_context_len)
+ start=0
+ for window_index in range(1, num_windows+1):
+ sequence_chunks.append(prot[start:start+model_context_len])
+ start += model_context_len
+
+ for chunk in sequence_chunks:
+ list_proteins_directions = [chunk] if not mirror else [chunk, chunk[::-1]]
+ for p in list_proteins_directions:
+ ids = torch.tensor([tokenizer.encode(p)]).to(device)
+ input_ids = ids[:, :-1]
+ targets = ids[:, 1:]
+
+ logits=model(input_ids).logits
+ loss = loss_fn(target=targets.view(-1), input=logits.view(-1,logits.size(-1)))
+ loss_val += -loss.item()
+ num_chunks = len(sequence_chunks) if not mirror else 2 * len(sequence_chunks)
+ loss_list += [loss_val / num_chunks] #Average the CrossEnt per token across all chunks
+ return np.array(loss_list)
+
+def get_mutated_sequence(focus_seq, mutant, start_idx=1, AA_vocab="ACDEFGHIKLMNPQRSTVWY"):
+ """
+ Helper function that mutates an input sequence (focus_seq) via an input mutation triplet (substitutions only).
+ Mutation triplet are typically based on 1-indexing: start_idx is used for switching to 0-indexing.
+ """
+ mutated_seq = list(focus_seq)
+ for mutation in mutant.split(":"):
+ try:
+ from_AA, position, to_AA = mutation[0], int(mutation[1:-1]), mutation[-1]
+ except:
+ print("Issue with mutant: "+str(mutation))
+ relative_position = position - start_idx
+ assert (from_AA==focus_seq[relative_position]), "Invalid from_AA or mutant position: "+str(mutation)+" from_AA: "+str(from_AA) + " relative pos: "+str(relative_position) + " focus_seq: "+str(focus_seq)
+ assert (to_AA in AA_vocab) , "Mutant to_AA is invalid: "+str(mutation)
+ mutated_seq[relative_position] = to_AA
+ return "".join(mutated_seq)
+
+def main():
+ """
+ Main script to score sets of mutated protein sequences (substitutions or indels) with Tranception.
+ """
+ parser = argparse.ArgumentParser(description='Tranception scoring')
+ parser.add_argument('--ProtGPT2_model_name_or_path', default="./", type=str, help='Name of or path to ProtGPT2 model')
+ parser.add_argument('--DMS_reference_file_path', default='./proteingym/ProteinGym_reference_file_substitutions.csv', type=str, help='Path of DMS folder')
+ parser.add_argument('--DMS_data_folder', default='./DMS_files/ProteinGym_substitutions', type=str, help='Path of DMS folder')
+ parser.add_argument('--DMS_index', type=int, help='Index of DMS to score')
+ parser.add_argument('--output_scores_folder', default=None, type=str, help='Name of folder to write model scores to')
+ parser.add_argument('--indel_mode', action='store_true', help='Whether to score sequences with insertions and deletions')
+ args = parser.parse_args()
+
+ model = AutoModelForCausalLM.from_pretrained(args.ProtGPT2_model_name_or_path,trust_remote_code=True)
+ model.cuda()
+ tokenizer = AutoTokenizer.from_pretrained("/n/groups/marks/projects/marks_lab_and_oatml/protein_transformer/baseline_models/ProtGPT2")
+
+ mapping_protein_seq_DMS = pd.read_csv(args.DMS_reference_file_path)
+ list_DMS = mapping_protein_seq_DMS["DMS_id"]
+ DMS_id=list_DMS[args.DMS_index]
+ print("Computing scores for: {} with ProtGP2: {}".format(DMS_id, args.ProtGPT2_model_name_or_path))
+ DMS_file_name = mapping_protein_seq_DMS["DMS_filename"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]
+ target_seq = mapping_protein_seq_DMS["target_seq"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0].upper()
+
+ DMS_data = pd.read_csv(args.DMS_data_folder + os.sep + DMS_file_name, low_memory=False)
+ if not args.indel_mode and 'mutated_sequence' not in DMS_data.columns:
+ DMS_data['mutated_sequence'] = DMS_data['mutant'].apply(lambda x: get_mutated_sequence(target_seq, x))
+ # if args.indel_mode:
+ # DMS_data['mutated_sequence'] = DMS_data['mutant']
+ model_scores = calc_fitness(model=model, prots=np.array(DMS_data['mutated_sequence']), tokenizer=tokenizer)
+
+ DMS_data['ProtGPT2_score']=model_scores
+ scoring_filename = args.output_scores_folder+os.sep+DMS_id+'.csv'
+ DMS_data[['mutated_sequence','ProtGPT2_score','DMS_score']].to_csv(scoring_filename, index=False)
+
+if __name__ == '__main__':
+ main()
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/README.md b/proteingym/baselines/protssn/README.md
new file mode 100644
index 0000000..9983e0f
--- /dev/null
+++ b/proteingym/baselines/protssn/README.md
@@ -0,0 +1,81 @@
+# ProtSSN
+
+## Environment
+
+Please make sure you have installed **[Anaconda3](https://www.anaconda.com/download)** or **[Miniconda3](https://docs.conda.io/projects/miniconda/en/latest/)**.
+The `torch_geometric` package should be updated to 2.3 or higher.
+
+```shell
+conda env create -f protssn_environment.yaml
+conda activate protssn
+```
+
+## Model Checkpoints
+
+There are 9 distinct models that can be used, separately or via ensembling.
+If you only want to only use **one model**, authors recommend using **k20_h512**.
+
+| # Version | # Param | # Link |
+| --------- | ------- | ------------------------------------------------------------ |
+| k10_h512 | 148 | https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/model/protssn_k10_h512.pt |
+| k10_h768 | 160 | https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/model/protssn_k10_h768.pt |
+| k10_h1280 | 184 | https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/model/protssn_k10_h1280.pt |
+| k20_h512 | 148 | https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/model/protssn_k20_h512.pt |
+| k20_h768 | 160 | https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/model/protssn_k20_h768.pt |
+| k20_h1280 | 184 | https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/model/protssn_k20_h1280.pt |
+| k30_h512 | 148 | https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/model/protssn_k30_h512.pt |
+| k30_h768 | 160 | https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/model/protssn_k30_h768.pt |
+| k30_h1280 | 184 | https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/model/protssn_k30_h1280.pt |
+
+```shell
+mkdir model
+cd model
+wget https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/model/protssn_k20_h512.pt
+```
+
+`ProtSSN.tar` contains all the model checkpoints. The **training records** and **configs** can be found in `model/history` and `model/config`.
+
+```shell
+wget https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/ProtSSN.model.tar
+tar -xvf ProtSSN.model.tar
+rm ProtSSN.model.tar
+```
+## Data
+
+The pdb and csv files can be downloaded from https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/ProteinGym_substitutions_pdb-csv_checked.zip.
+The files should be arranged as follows:
+
+```
+data/proteingym-benchmark
+|——DATASET
+|——|——Protein1
+|——|——|——Protein1.pdb
+|——|——|——Protein1.tsv
+|——|——Protein2
+|——|——...
+```
+
+- Because the model requires **structure (PDB)** as input, authors recommend using [**Alphafold**](https://github.com/google-deepmind/alphafold) for folding, or **[ESMFold](https://github.com/facebookresearch/esm)**.
+- You may also search the structure for a protein of interest via its Uniprot ID in the **AlphaFold database** (https://alphafold.ebi.ac.uk/).
+
+## Usage
+
+Please refer to the scoring script under `scripts/scoring_DMS_zero_shot/scoring_ProtSSN_substitutions.sh`
+
+## Acknowledgements
+
+For more details about ProtSSN, please refer to the official [ProtSSN GitHub repo](https://github.com/tyang816/ProtSSN).
+
+Please cite the following paper if you use ProtSSN in your work:
+
+```
+@article{tan2023protssn,
+ title={Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability},
+ author={Tan, Yang and Zhou, Bingxin and Zheng, Lirong and Fan, Guisheng and Hong, Liang},
+ journal={bioRxiv},
+ pages={2023--12},
+ year={2023},
+ publisher={Cold Spring Harbor Laboratory}
+}
+```
+
diff --git a/proteingym/baselines/protssn/compute_fitness.py b/proteingym/baselines/protssn/compute_fitness.py
new file mode 100644
index 0000000..8e914c2
--- /dev/null
+++ b/proteingym/baselines/protssn/compute_fitness.py
@@ -0,0 +1,190 @@
+import argparse
+import json
+import warnings
+import torch
+import os
+import sys
+import yaml
+import numpy as np
+import pandas as pd
+from torch import nn
+from torch_geometric.loader import DataLoader
+from numpy import nan
+from typing import *
+from tqdm import tqdm
+from scipy.stats import spearmanr
+from transformers import logging
+from src.models import PLM_model, GNN_model
+from src.data import build_mutant_dataset
+from src.utils.utils import param_num
+
+# set path
+current_dir = os.getcwd()
+sys.path.append(current_dir)
+# ignore warning information
+logging.set_verbosity_error()
+warnings.filterwarnings("ignore")
+
+amino_acids_type = ['A', 'R', 'N', 'D', 'C', 'Q', 'E', 'G', 'H', 'I',
+ 'L', 'K', 'M', 'F', 'P', 'S', 'T', 'W', 'Y', 'V']
+
+def label_row(rows, sequence, token_probs, offset_idx=1):
+ s = []
+ sep = ";"
+ if ":" in rows:
+ sep = ":"
+ for row in rows.split(sep):
+ if row.lower() == "wt":
+ s.append(0)
+ continue
+ try:
+ wt, idx, mt = row[0], int(row[1:-1]) - offset_idx, row[-1]
+ except:
+ print(f"row: {row}, sequence: {sequence}")
+ raise ValueError
+ assert sequence[idx] == wt, f"The {row}, {sequence[idx]}"
+ wt_encoded, mt_encoded = amino_acids_type.index(wt), amino_acids_type.index(mt)
+ score = token_probs[idx, mt_encoded] - token_probs[idx, wt_encoded]
+ score = score.item()
+ s.append(score)
+
+ return sum(s)
+
+
+def predict(args, plm_model, gnn_model, loader):
+ gnn_model.eval()
+ softmax = nn.Softmax()
+ result_dict = {"name": [], "count": [], args.score_name: []}
+
+ with torch.no_grad():
+ for data in loader:
+ protein_name = data.protein_name[0]
+ graph_data = plm_model(data)
+ out, _ = gnn_model(graph_data)
+ seq = "".join([amino_acids_type[i] for i in data.y])
+ out = torch.log(softmax(out[:, :20]) + 1e-9)
+
+ # check the mutant file
+ mutant_file_tsv = os.path.join(args.mutant_dataset_dir, "DATASET", protein_name, f"{protein_name}.tsv")
+ mutant_file_csv = os.path.join(args.mutant_dataset_dir, "DATASET", protein_name, f"{protein_name}.csv")
+ if os.path.exists(mutant_file_tsv):
+ mutant_df = pd.read_table(mutant_file_tsv)
+ elif os.path.exists(mutant_file_csv):
+ mutant_df = pd.read_csv(mutant_file_csv)
+ else:
+ raise ValueError(f"Invalid file: {mutant_file_tsv} or {mutant_file_csv}")
+
+ # check the offset
+ if protein_name == "A0A140D2T1_ZIKV_Sourisseau_2019":
+ offset = 291
+ else:
+ offset = 1
+
+ # label the mutant
+ mutant_df[args.score_name] = mutant_df[args.mutant_pos_col].apply(
+ lambda x: label_row(x, seq, out.cpu().numpy(), offset)
+ )
+ result_file = os.path.join(args.output_scores_folder, protein_name + ".csv")
+ if not os.path.exists(result_file):
+ mutant_df.to_csv(result_file, index=False)
+
+ result = pd.read_csv(result_file)
+ result[args.score_name] = mutant_df[args.score_name]
+ result.to_csv(result_file, index=False)
+
+ # save the spearmanr score
+ result_dict['count'].append(len(result))
+ result_dict['name'].append(protein_name)
+ spearmanr_score = spearmanr(result[args.mutant_score_col], result[args.score_name]).correlation
+ if spearmanr_score is nan:
+ spearmanr_score = 0
+ result_dict[args.score_name].append(spearmanr_score)
+
+ print(f">>> {protein_name}: {spearmanr_score}; mutant_num: {len(result)}")
+
+ if args.score_info is not None:
+ if os.path.exists(args.score_info):
+ total_result = pd.read_csv(args.score_info)
+ total_result[args.score_name] = result_dict[args.score_name]
+ total_result.to_csv(args.score_info, index=False)
+ else:
+ pd.DataFrame(result_dict).to_csv(args.score_info, index=False)
+
+ print(f">>> {args.score_name} average spearmanr: {np.mean(result_dict[args.score_name])}\n")
+
+def ensemble(args):
+ print("----------------- Ensemble -----------------")
+ result_files = os.listdir(args.output_scores_folder)
+ sp_scores = []
+ for file in tqdm(result_files):
+ result_file = os.path.join(args.output_scores_folder, file)
+ result_df = pd.read_csv(result_file)
+ models_pred = [result_df[col].to_list() for col in result_df.columns if col.startswith("ProtSSN")]
+ ensemble_pred = np.mean(models_pred, axis=0)
+ result_df["ProtSSN_ensemble"] = ensemble_pred
+ result_df.to_csv(result_file, index=False)
+ sp_score = spearmanr(result_df[args.mutant_score_col], result_df["ProtSSN_ensemble"]).correlation
+ sp_scores.append(sp_score)
+ print(">>> Ensemble spearmanr: ", np.mean(sp_scores))
+
+def prepare(args, dataset_name, k, h):
+ # for build dataset
+ args.mutant_name = f"{dataset_name}_k{k}"
+ mutant_dataset = build_mutant_dataset(args)
+ protein_names = mutant_dataset.protein_names
+ print(f">>> Protein names: {protein_names}")
+ mutant_loader = DataLoader(mutant_dataset, batch_size=1, shuffle=False)
+ print(f">>> Number of proteins: {len(mutant_dataset)}")
+ gnn_model = GNN_model(args)
+ print(f">>> k{k}_h{h} {param_num(gnn_model)}")
+ gnn_model_path = os.path.join(args.gnn_model_dir, f"protssn_k{k}_h{h}.pt")
+ gnn_model.load_state_dict(torch.load(gnn_model_path))
+ return args, mutant_loader, gnn_model
+
+def create_parser():
+ parser = argparse.ArgumentParser()
+ parser.add_argument("--gnn", type=str, default="egnn", help="gat, gcn, or egnn")
+ parser.add_argument("--gnn_config", type=str, default="src/config/egnn.yaml", help="gnn config")
+ parser.add_argument("--gnn_model_dir", type=str, default="model/", help="test model name")
+ parser.add_argument("--gnn_model_name", type=str, default=None, nargs="+", help="test model name")
+
+ parser.add_argument("--plm", type=str, default="facebook/esm2_t33_650M_UR50D", help="esm param number")
+ parser.add_argument("--use_ensemble", action="store_true", help="use ensemble model")
+
+ # dataset
+ parser.add_argument("--mutant_dataset_dir", type=str, default="data/evaluation", help="mutation dataset")
+ parser.add_argument("--mutant_name", type=str, default=None, help="name of mutation dataset")
+ parser.add_argument("--mutant_pos_col", type=str, default="mutant", help="mutation column name")
+ parser.add_argument("--mutant_score_col", type=str, default="DMS_score", help="the model output score column name")
+
+ parser.add_argument("--score_info", type=str, default=None, help="the model output spearmanr score file")
+ parser.add_argument("--output_scores_folder", type=str, default="result/", help="the result output path")
+ parser.add_argument("--repo_path", type=str, default=None, help="Path to ProteinGym repo")
+
+ args = parser.parse_args()
+ return args
+
+
+if __name__ == "__main__":
+ args = create_parser()
+ args.gnn_config = yaml.load(open(args.gnn_config), Loader=yaml.FullLoader)[args.gnn]
+ if args.repo_path is None: args.repo_path = os.path.dirname(os.path.dirname(os.getcwd()))
+ plm_model = PLM_model(args)
+ args.plm_hidden_size = plm_model.model.config.hidden_size
+ dataset_name = args.mutant_dataset_dir.split("/")[-1]
+ os.makedirs(args.output_scores_folder, exist_ok=True)
+
+ for gnn in args.gnn_model_name:
+ k, h = gnn.split("_")
+ k, h = int(k[1:]), int(h[1:])
+ print(f"--------------- ProtSSN k{k}_h{h} ---------------")
+ assert k in [10, 20, 30], f"Invalid k: {k}"
+ assert h in [512, 768, 1280], f"Invalid h: {h}"
+ args.gnn_config["hidden_channels"] = h
+ args.c_alpha_max_neighbors = k
+ args.score_name = f"ProtSSN_k{k}_h{h}"
+ args, mutant_loader, gnn_model = prepare(args, dataset_name, k, h)
+ predict(args=args, plm_model=plm_model, gnn_model=gnn_model, loader=mutant_loader)
+
+ if args.use_ensemble:
+ ensemble(args)
diff --git a/proteingym/baselines/protssn/norm/cath_k10_mean_attr.pt b/proteingym/baselines/protssn/norm/cath_k10_mean_attr.pt
new file mode 100644
index 0000000..8c3fb69
Binary files /dev/null and b/proteingym/baselines/protssn/norm/cath_k10_mean_attr.pt differ
diff --git a/proteingym/baselines/protssn/norm/cath_k20_mean_attr.pt b/proteingym/baselines/protssn/norm/cath_k20_mean_attr.pt
new file mode 100644
index 0000000..126555d
Binary files /dev/null and b/proteingym/baselines/protssn/norm/cath_k20_mean_attr.pt differ
diff --git a/proteingym/baselines/protssn/norm/cath_k30_mean_attr.pt b/proteingym/baselines/protssn/norm/cath_k30_mean_attr.pt
new file mode 100644
index 0000000..47a7399
Binary files /dev/null and b/proteingym/baselines/protssn/norm/cath_k30_mean_attr.pt differ
diff --git a/proteingym/baselines/protssn/protssn_environment.yaml b/proteingym/baselines/protssn/protssn_environment.yaml
new file mode 100644
index 0000000..0a373cb
--- /dev/null
+++ b/proteingym/baselines/protssn/protssn_environment.yaml
@@ -0,0 +1,298 @@
+name: protssn
+channels:
+ - schrodinger
+ - bioconda
+ - ostrokach
+ - https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
+ - https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/conda-forge
+ - https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main/
+ - https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/free/
+ - https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/conda-forge/
+ - pytorch
+ - nvidia
+ - pyg
+dependencies:
+ - _libgcc_mutex=0.1=conda_forge
+ - _openmp_mutex=4.5=2_kmp_llvm
+ - appdirs=1.4.4=pyhd3eb1b0_0
+ - blas=1.0=mkl
+ - blessings=1.7=py310h06a4308_1002
+ - brotlipy=0.7.0=py310h7f8727e_1002
+ - bzip2=1.0.8=h7b6447c_0
+ - c-ares=1.19.1=h5eee18b_0
+ - ca-certificates=2023.08.22=h06a4308_0
+ - certifi=2023.11.17=py310h06a4308_0
+ - cffi=1.15.1=py310h74dc2b5_0
+ - charset-normalizer=2.0.4=pyhd3eb1b0_0
+ - cryptography=39.0.1=py310h9ce1e76_0
+ - cuda-cudart=11.7.99=0
+ - cuda-cupti=11.7.101=0
+ - cuda-libraries=11.7.1=0
+ - cuda-nvrtc=11.7.99=0
+ - cuda-nvtx=11.7.91=0
+ - cuda-runtime=11.7.1=0
+ - curl=8.2.1=h37d81fd_0
+ - deepspeed=0.9.2=cpu_py310h11dbdba_0
+ - dssp=2.2.1=1
+ - ffmpeg=4.3=hf484d3e_0
+ - freetype=2.12.1=h4a9f257_0
+ - giflib=5.2.1=h5eee18b_3
+ - gmp=6.2.1=h295c915_3
+ - gnutls=3.6.15=he1e5248_0
+ - gpustat=0.6.0=pyhd3eb1b0_1
+ - hjson-py=3.1.0=pyhd8ed1ab_0
+ - idna=3.4=py310h06a4308_0
+ - intel-openmp=2021.4.0=h06a4308_3561
+ - jinja2=3.1.2=py310h06a4308_0
+ - joblib=1.2.0=py310h06a4308_0
+ - jpeg=9e=h5eee18b_1
+ - krb5=1.20.1=h568e23c_1
+ - lame=3.100=h7b6447c_0
+ - lcms2=2.12=h3be6417_0
+ - ld_impl_linux-64=2.38=h1181459_1
+ - lerc=3.0=h295c915_0
+ - libaio=0.3.113=h5eee18b_0
+ - libblas=3.9.0=12_linux64_mkl
+ - libcblas=3.9.0=12_linux64_mkl
+ - libcublas=11.10.3.66=0
+ - libcufft=10.7.2.124=h4fbf590_0
+ - libcufile=1.6.1.9=0
+ - libcurand=10.3.2.106=0
+ - libcurl=8.2.1=h91b91d3_0
+ - libcusolver=11.4.0.1=0
+ - libcusparse=11.7.4.91=0
+ - libdeflate=1.17=h5eee18b_0
+ - libedit=3.1.20221030=h5eee18b_0
+ - libev=4.33=h7f8727e_1
+ - libffi=3.3=he6710b0_2
+ - libgcc-ng=13.1.0=he5830b7_0
+ - libgfortran-ng=7.5.0=ha8ba4b0_17
+ - libgfortran4=7.5.0=ha8ba4b0_17
+ - libiconv=1.16=h7f8727e_2
+ - libidn2=2.3.4=h5eee18b_0
+ - liblapack=3.9.0=12_linux64_mkl
+ - libnghttp2=1.52.0=ha637b67_1
+ - libnpp=11.7.4.75=0
+ - libnvjpeg=11.8.0.2=0
+ - libpng=1.6.39=h5eee18b_0
+ - libssh2=1.10.0=h37d81fd_2
+ - libstdcxx-ng=13.1.0=hfd8a6a1_0
+ - libtasn1=4.19.0=h5eee18b_0
+ - libtiff=4.5.0=h6a678d5_2
+ - libunistring=0.9.10=h27cfd23_0
+ - libuuid=1.41.5=h5eee18b_0
+ - libwebp=1.2.4=h11a3e52_1
+ - libwebp-base=1.2.4=h5eee18b_1
+ - llvm-openmp=14.0.6=h9e868ea_0
+ - lz4-c=1.9.4=h6a678d5_0
+ - markupsafe=2.1.1=py310h7f8727e_0
+ - mkl=2021.4.0=h06a4308_640
+ - mkl-service=2.4.0=py310h7f8727e_0
+ - mkl_fft=1.3.1=py310hd6ae3a3_0
+ - mkl_random=1.2.2=py310h00e6091_0
+ - ncurses=6.4=h6a678d5_0
+ - nettle=3.7.3=hbbd107a_1
+ - nvidia-ml=7.352.0=pyhd3eb1b0_0
+ - openh264=2.1.1=h4ff587b_0
+ - openssl=1.1.1w=h7f8727e_0
+ - packaging=23.1=py310h06a4308_0
+ - pillow=9.4.0=py310h6a678d5_0
+ - pip=23.0.1=py310h06a4308_0
+ - pooch=1.4.0=pyhd3eb1b0_0
+ - psutil=5.9.0=py310h5eee18b_0
+ - py-cpuinfo=8.0.0=pyhd3eb1b0_1
+ - pycparser=2.21=pyhd3eb1b0_0
+ - pyg=2.3.0=py310_torch_1.13.0_cu117
+ - pyopenssl=23.0.0=py310h06a4308_0
+ - pyparsing=3.0.9=py310h06a4308_0
+ - pysocks=1.7.1=py310h06a4308_0
+ - python=3.10.0=h12debd9_5
+ - python_abi=3.10=2_cp310
+ - pytorch=1.13.1=py3.10_cuda11.7_cudnn8.5.0_0
+ - pytorch-cuda=11.7=h778d358_5
+ - pytorch-mutex=1.0=cuda
+ - readline=8.2=h5eee18b_0
+ - scikit-learn=1.2.2=py310h6a678d5_1
+ - seqkit=2.5.0=h9ee0642_0
+ - setuptools=66.0.0=py310h06a4308_0
+ - six=1.16.0=pyhd3eb1b0_1
+ - sqlite=3.41.2=h5eee18b_0
+ - threadpoolctl=2.2.0=pyh0d69192_0
+ - tk=8.6.12=h1ccaba5_0
+ - tmalign=20170708=h0e1e685_2
+ - torchaudio=0.13.1=py310_cu117
+ - torchvision=0.14.1=py310_cu117
+ - tqdm=4.65.0=py310h2f386ee_0
+ - urllib3=1.26.15=py310h06a4308_0
+ - wheel=0.38.4=py310h06a4308_0
+ - xz=5.4.2=h5eee18b_0
+ - zlib=1.2.13=h5eee18b_0
+ - zstd=1.5.5=hc292b87_0
+ - pip:
+ - accelerate==0.25.0
+ - aiofiles==23.2.1
+ - aiohttp==3.9.1
+ - aiosignal==1.3.1
+ - altair==5.2.0
+ - annotated-types==0.6.0
+ - antlr4-python3-runtime==4.9.3
+ - anyio==3.7.1
+ - argon2-cffi==23.1.0
+ - argon2-cffi-bindings==21.2.0
+ - arrow==1.3.0
+ - asttokens==2.4.1
+ - async-lru==2.0.4
+ - async-timeout==4.0.3
+ - attrs==23.1.0
+ - babel==2.14.0
+ - beartype==0.16.4
+ - beautifulsoup4==4.12.2
+ - bio==1.6.0
+ - biopython==1.81
+ - biothings-client==0.3.1
+ - biotite==0.38.0
+ - bleach==6.1.0
+ - click==8.1.7
+ - colorama==0.4.6
+ - comm==0.2.0
+ - conda-pack==0.7.1
+ - contourpy==1.2.0
+ - cycler==0.12.1
+ - debugpy==1.8.0
+ - decorator==5.1.1
+ - defusedxml==0.7.1
+ - dm-tree==0.1.8
+ - einops==0.7.0
+ - ema-pytorch==0.3.1
+ - exceptiongroup==1.2.0
+ - executing==2.0.1
+ - fair-esm==2.0.0
+ - fastapi==0.105.0
+ - fastjsonschema==2.19.0
+ - ffmpy==0.3.1
+ - filelock==3.13.1
+ - fonttools==4.45.1
+ - fqdn==1.5.1
+ - frozenlist==1.4.0
+ - fsspec==2023.10.0
+ - gemmi==0.6.4
+ - gprofiler-official==1.0.0
+ - gradio==4.11.0
+ - gradio-client==0.7.3
+ - h11==0.14.0
+ - httpcore==1.0.2
+ - httpx==0.25.2
+ - huggingface-hub==0.19.4
+ - icetk==0.0.7
+ - importlib-resources==6.1.1
+ - iniconfig==2.0.0
+ - ipykernel==6.27.1
+ - ipython==8.18.1
+ - isoduration==20.11.0
+ - jedi==0.19.1
+ - json5==0.9.14
+ - jsonpointer==2.4
+ - jsonschema==4.20.0
+ - jsonschema-specifications==2023.11.2
+ - jupyter-client==8.6.0
+ - jupyter-core==5.5.0
+ - jupyter-events==0.9.0
+ - jupyter-lsp==2.2.1
+ - jupyter-server==2.12.1
+ - jupyter-server-terminals==0.5.0
+ - jupyterlab==4.0.9
+ - jupyterlab-pygments==0.3.0
+ - jupyterlab-server==2.25.2
+ - kiwisolver==1.4.5
+ - latex2mathml==3.77.0
+ - lightning==2.1.2
+ - lightning-utilities==0.10.0
+ - markdown==3.5.1
+ - markdown-it-py==3.0.0
+ - matplotlib==3.8.2
+ - matplotlib-inline==0.1.6
+ - mdtex2html==1.2.0
+ - mdurl==0.1.2
+ - mistune==3.0.2
+ - msgpack==1.0.7
+ - multidict==6.0.4
+ - mygene==3.2.2
+ - nbclient==0.9.0
+ - nbconvert==7.12.0
+ - nbformat==5.9.2
+ - nest-asyncio==1.5.8
+ - networkx==3.2.1
+ - notebook-shim==0.2.3
+ - numpy==1.26.2
+ - omegaconf==2.3.0
+ - orjson==3.9.10
+ - overrides==7.4.0
+ - pandas==2.1.3
+ - pandocfilters==1.5.0
+ - parso==0.8.3
+ - peft==0.7.1
+ - pexpect==4.9.0
+ - platformdirs==4.1.0
+ - pluggy==1.3.0
+ - prometheus-client==0.19.0
+ - prompt-toolkit==3.0.43
+ - protobuf==3.18.3
+ - ptyprocess==0.7.0
+ - pure-eval==0.2.2
+ - pydantic==2.5.2
+ - pydantic-core==2.14.5
+ - pydub==0.25.1
+ - pygments==2.17.2
+ - pytest==7.4.3
+ - python-dateutil==2.8.2
+ - python-json-logger==2.0.7
+ - python-multipart==0.0.6
+ - pytorch-lightning==2.1.2
+ - pytz==2023.3.post1
+ - pyyaml==6.0.1
+ - pyzmq==25.1.2
+ - rdkit==2023.9.2
+ - referencing==0.32.0
+ - regex==2023.10.3
+ - requests==2.31.0
+ - rfc3339-validator==0.1.4
+ - rfc3986-validator==0.1.1
+ - rich==13.7.0
+ - rpds-py==0.13.2
+ - safetensors==0.4.0
+ - scipy==1.11.4
+ - seaborn==0.13.0
+ - semantic-version==2.10.0
+ - send2trash==1.8.2
+ - sentencepiece==0.1.99
+ - shellingham==1.5.4
+ - sniffio==1.3.0
+ - soupsieve==2.5
+ - stack-data==0.6.3
+ - starlette==0.27.0
+ - svgwrite==1.4.3
+ - terminado==0.18.0
+ - tinycss2==1.2.1
+ - tokenizers==0.15.0
+ - tomli==2.0.1
+ - tomlkit==0.12.0
+ - toolz==0.12.0
+ - torch-cluster==1.6.3
+ - torch-scatter==2.1.2
+ - torch-sparse==0.6.18
+ - torchmetrics==1.2.1
+ - tornado==6.4
+ - traitlets==5.14.0
+ - transformers==4.35.2
+ - typer==0.9.0
+ - types-python-dateutil==2.8.19.14
+ - typing-extensions==4.9.0
+ - tzdata==2023.3
+ - uri-template==1.3.0
+ - uvicorn==0.24.0.post1
+ - wcwidth==0.2.12
+ - webcolors==1.13
+ - webencodings==0.5.1
+ - websocket-client==1.7.0
+ - websockets==11.0.3
+ - yarl==1.9.4
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/src/build_sav.py b/proteingym/baselines/protssn/src/build_sav.py
new file mode 100644
index 0000000..82a9095
--- /dev/null
+++ b/proteingym/baselines/protssn/src/build_sav.py
@@ -0,0 +1,40 @@
+import argparse
+import os
+import pandas as pd
+from tqdm import tqdm
+from utils.utils import load_coords
+
+parser = argparse.ArgumentParser(description='make single mutant tsv')
+parser.add_argument("-d", "--dataset", type=str, default=None)
+args = parser.parse_args()
+
+one_letter = {
+ 'VAL':'V', 'ILE':'I', 'LEU':'L', 'GLU':'E', 'GLN':'Q',
+ 'ASP':'D', 'ASN':'N', 'HIS':'H', 'TRP':'W', 'PHE':'F', 'TYR':'Y',
+ 'ARG':'R', 'LYS':'K', 'SER':'S', 'THR':'T', 'MET':'M', 'ALA':'A',
+ 'GLY':'G', 'PRO':'P', 'CYS':'C'
+ }
+AA = list(one_letter.values())
+
+base_dir = os.path.join(args.dataset, "DATASET")
+proteins = os.listdir(base_dir)
+for p in proteins:
+ fasta_file = os.path.join(base_dir, p, f"{p}.fasta")
+ pdb_file = os.path.join(base_dir, p, f"{p}.pdb")
+ if os.path.exists(fasta_file):
+ seq = open(fasta_file, "r").readlines()[1].strip()
+ elif os.path.exists(pdb_file):
+ _, seq = load_coords(pdb_file, "A")
+ else:
+ raise ValueError(f"Invalid file: {fasta_file} or {pdb_file}")
+ data = {"mutant":[], "score":[]}
+ for idx,s in enumerate(seq):
+ for a in AA:
+ if a == s:
+ continue
+ data["mutant"].append(f"{s}{idx+1}{a}")
+ data["score"].append(0)
+
+ print(f"{p} contains { len(data['mutant'])}")
+ out_file = os.path.join(base_dir, p, f"{p}.csv")
+ pd.DataFrame(data).to_csv(out_file, index=False)
diff --git a/proteingym/baselines/protssn/src/config/egnn.yaml b/proteingym/baselines/protssn/src/config/egnn.yaml
new file mode 100644
index 0000000..e2c3de2
--- /dev/null
+++ b/proteingym/baselines/protssn/src/config/egnn.yaml
@@ -0,0 +1,9 @@
+egnn:
+ edge_attr_dim: 93
+ output_dim: 20
+ dropout: 0
+ n_layers: 6
+ residual: False
+ embedding: False
+ embedding_dim: 64
+ mlp_num: 2
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/src/config/fsdp_config.yaml b/proteingym/baselines/protssn/src/config/fsdp_config.yaml
new file mode 100644
index 0000000..e313ce3
--- /dev/null
+++ b/proteingym/baselines/protssn/src/config/fsdp_config.yaml
@@ -0,0 +1,25 @@
+compute_environment: LOCAL_MACHINE
+deepspeed_config: {}
+distributed_type: FSDP
+downcast_bf16: 'no'
+dynamo_config: {}
+dynamo_backend: 'no'
+fsdp_config:
+ fsdp_auto_wrap_policy: TRANSFORMER_BASED_WRAP
+ fsdp_backward_prefetch_policy: BACKWARD_PRE
+ fsdp_offload_params: false
+ fsdp_sharding_strategy: 1
+ fsdp_state_dict_type: FULL_STATE_DICT
+ fsdp_transformer_layer_cls_to_wrap: EsmLayer
+machine_rank: 0
+main_training_function: main
+megatron_lm_config: {}
+mixed_precision: 'no'
+num_machines: 1
+num_processes: 4
+rdzv_backend: static
+same_network: true
+tpu_env: []
+tpu_use_cluster: false
+tpu_use_sudo: false
+use_cpu: false
diff --git a/proteingym/baselines/protssn/src/config/gat.yaml b/proteingym/baselines/protssn/src/config/gat.yaml
new file mode 100644
index 0000000..f2f026b
--- /dev/null
+++ b/proteingym/baselines/protssn/src/config/gat.yaml
@@ -0,0 +1,5 @@
+gat:
+ hidden_channels: 512
+ edge_attr_dim: 93
+ dropout: 0.5
+ n_layers: 4
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/src/config/gcn.yaml b/proteingym/baselines/protssn/src/config/gcn.yaml
new file mode 100644
index 0000000..193309c
--- /dev/null
+++ b/proteingym/baselines/protssn/src/config/gcn.yaml
@@ -0,0 +1,5 @@
+gcn:
+ hidden_channels: 512
+ edge_attr_dim: 93
+ dropout: 0.5
+ n_layers: 4
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/src/data.py b/proteingym/baselines/protssn/src/data.py
new file mode 100644
index 0000000..e4baef5
--- /dev/null
+++ b/proteingym/baselines/protssn/src/data.py
@@ -0,0 +1,64 @@
+import os, sys
+# set path
+current_dir = os.getcwd()
+sys.path.append(current_dir)
+import argparse
+from src.dataset.cath_dataset import CathDataset
+from src.dataset.mutant_dataset import MutantDataset
+from src.utils.dataset_utils import NormalizeProtein
+
+
+def build_cath_dataset(args, split):
+ dataset = CathDataset(
+ root=args.cath_dataset,
+ split=split,
+ divide_num=1,
+ divide_idx=0,
+ c_alpha_max_neighbors=args.c_alpha_max_neighbors,
+ set_length=None,
+ p=args.noise_ratio,
+ normalize_file=args.repo_path+os.sep+f'/proteingym/baselines/protssn/norm/cath_k{args.c_alpha_max_neighbors}_mean_attr.pt',
+ )
+ return dataset
+
+
+def build_mutant_dataset(args):
+ mm_dataset = MutantDataset(
+ root=args.mutant_dataset_dir,
+ name=args.mutant_name,
+ raw_dir=args.mutant_dataset_dir+"/DATASET",
+ c_alpha_max_neighbors=args.c_alpha_max_neighbors,
+ pre_transform=NormalizeProtein(
+ filename=args.repo_path+os.sep+f'/proteingym/baselines/protssn/norm/cath_k{args.c_alpha_max_neighbors}_mean_attr.pt'
+ ),
+ )
+ return mm_dataset
+
+def prepare_train_val_dataset(args):
+ # load protein dataset like CATHs40
+ train_dataset = build_cath_dataset(args, "train")
+ val_dataset = build_cath_dataset(args, "val")
+
+ return train_dataset, val_dataset
+
+
+
+if __name__ == "__main__":
+ parser = argparse.ArgumentParser()
+ parser.add_argument("--c_alpha_max_neighbors", type=int, default=30, help="number of nearest nodes to build graph")
+
+ parser.add_argument("--data_type", type=str, choices=["cath", "mutant"], default="cath", help="type of dataset")
+ # build cath train dataset
+ parser.add_argument("--cath_dataset", type=str, default="data/cath40_k30", help="name of cath dataset")
+ parser.add_argument("--noise_ratio", type=float, default=0.05)
+
+ # build zero-shot mutant prediction dataset
+ parser.add_argument("--mutant_dataset_dir",type=str,default="data/proteingym-benchmark",help="dir of mutation dataset")
+ parser.add_argument("--mutant_name",type=str,default="proteingym_k30",help="name of mutation dataset")
+
+ args = parser.parse_args()
+
+ if args.data_type == "cath":
+ cath_dataset = build_cath_dataset(args, "train")
+ elif args.data_type == "mutant":
+ mutant_dataset = build_mutant_dataset(args)
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/src/dataset/cath_dataset.py b/proteingym/baselines/protssn/src/dataset/cath_dataset.py
new file mode 100644
index 0000000..298987d
--- /dev/null
+++ b/proteingym/baselines/protssn/src/dataset/cath_dataset.py
@@ -0,0 +1,753 @@
+
+import os
+import torch
+import sys
+import math
+import random
+import warnings
+import torch
+import os
+import sys
+import torch.nn.functional as F
+import scipy.spatial as spa
+import numpy as np
+from tqdm import tqdm
+from scipy.special import softmax
+from Bio.PDB import PDBParser, ShrakeRupley
+from Bio.PDB.PDBExceptions import PDBConstructionWarning
+from rdkit.Chem import GetPeriodicTable
+from typing import Callable, List, Optional
+from torch.utils.data import DataLoader
+from torch_geometric.data import InMemoryDataset, Data
+current_dir = os.getcwd()
+sys.path.append(current_dir)
+from src.utils.dataset_utils import safe_index, one_hot_res, log, dihedral, NormalizeProtein, dataset_argument_, get_stat
+
+cwd = os.getcwd()
+
+sys.path.append(cwd + '/src/dataset_utils')
+warnings.filterwarnings("ignore")
+
+one_letter = {
+ 'VAL':'V', 'ILE':'I', 'LEU':'L', 'GLU':'E', 'GLN':'Q',
+ 'ASP':'D', 'ASN':'N', 'HIS':'H', 'TRP':'W', 'PHE':'F', 'TYR':'Y',
+ 'ARG':'R', 'LYS':'K', 'SER':'S', 'THR':'T', 'MET':'M', 'ALA':'A',
+ 'GLY':'G', 'PRO':'P', 'CYS':'C'
+ }
+
+class CathDataset(InMemoryDataset):
+ r"""
+ Args:
+ root (string): Root directory where the dataset should be saved.
+ name (string): The name of the dataset.
+ raw_dir (string, optional): Root directory where the
+ original dataset stored(default: :obj:`None`)
+
+ num_residue_type (int, optional): The number of amino acid types.
+ (default: obj:'20')
+ micro_radius (int, optional): The radius of micro-environment
+ centered on the mask node. (default: obj:'20')
+ c_alpha_max_neighbors (int, optional): The number of maximum
+ connected nodes. (default: obj:'10')
+ cutoff (int, optional): The maximum connected nodes distance
+ (default: obj:'30')
+ seq_dist_cut (int, optional): one-hot encoding the sequence distance
+ edge attribute
+ (default: obj:)
+ [0.25,0.5,0.75,0.9,0.95,0.98,0.99]
+ [ 2. 3. 13. 63. 127. 247. 347.]
+ num_val (int, optional): The number of validation samples in case of "random" split. (default: 500)
+ num_test (int, optional): The number of test samples in case of "random" split. (default: 1000)
+
+ # use_localdatastet (bool) (bool,optional): If :obj:'True', online dataset
+ # will be downloaded. If not, local pdb files will be used
+ # (default: obj:'True')
+
+ transform (callable, optional): A function/transform that takes in an
+ :obj:`torch_geometric.data.Data` object and returns a transformed
+ version. The data object will be transformed before every access.
+ (default: :obj:`None`)
+ pre_transform (callable, optional): A function/transform that takes in
+ an :obj:`torch_geometric.data.Data` object and returns a
+ transformed version. The data object will be transformed before
+ being saved to disk. (default: :obj:`None`)
+ pre_filter (callable, optional): A function that takes in an
+ :obj:`torch_geometric.data.Data` object and returns a boolean
+ value, indicating whether the data object should be included in the
+ final dataset. (default: :obj:`None`)
+ """
+
+ splits = ['train', 'val', 'test']
+ allowable_features = {
+ 'possible_atomic_num_list': list(range(1, 119)) + ['misc'],
+ 'possible_chirality_list': [
+ 'CHI_UNSPECIFIED',
+ 'CHI_TETRAHEDRAL_CW',
+ 'CHI_TETRAHEDRAL_CCW',
+ 'CHI_OTHER'
+ ],
+ 'possible_degree_list': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 'misc'],
+ 'possible_numring_list': [0, 1, 2, 3, 4, 5, 6, 'misc'],
+ 'possible_implicit_valence_list': [0, 1, 2, 3, 4, 5, 6, 'misc'],
+ 'possible_formal_charge_list': [-5, -4, -3, -2, -1, 0, 1, 2, 3, 4, 5, 'misc'],
+ 'possible_numH_list': [0, 1, 2, 3, 4, 5, 6, 7, 8, 'misc'],
+ 'possible_number_radical_e_list': [0, 1, 2, 3, 4, 'misc'],
+ 'possible_hybridization_list': [
+ 'SP', 'SP2', 'SP3', 'SP3D', 'SP3D2', 'misc'
+ ],
+ 'possible_is_aromatic_list': [False, True],
+ 'possible_is_in_ring3_list': [False, True],
+ 'possible_is_in_ring4_list': [False, True],
+ 'possible_is_in_ring5_list': [False, True],
+ 'possible_is_in_ring6_list': [False, True],
+ 'possible_is_in_ring7_list': [False, True],
+ 'possible_is_in_ring8_list': [False, True],
+ 'possible_amino_acids': ['ALA', 'ARG', 'ASN', 'ASP', 'CYS', 'GLN', 'GLU', 'GLY', 'HIS', 'ILE', 'LEU', 'LYS',
+ 'MET',
+ 'PHE', 'PRO', 'SER', 'THR', 'TRP', 'TYR', 'VAL', 'HIP', 'HIE', 'TPO', 'HID', 'LEV',
+ 'MEU',
+ 'PTR', 'GLV', 'CYT', 'SEP', 'HIZ', 'CYM', 'GLM', 'ASQ', 'TYS', 'CYX', 'GLZ', 'misc'],
+ 'possible_atom_type_2': ['C*', 'CA', 'CB', 'CD', 'CE', 'CG', 'CH', 'CZ', 'N*', 'ND', 'NE', 'NH', 'NZ', 'O*',
+ 'OD',
+ 'OE', 'OG', 'OH', 'OX', 'S*', 'SD', 'SG', 'misc'],
+ 'possible_atom_type_3': ['C', 'CA', 'CB', 'CD', 'CD1', 'CD2', 'CE', 'CE1', 'CE2', 'CE3', 'CG', 'CG1', 'CG2',
+ 'CH2',
+ 'CZ', 'CZ2', 'CZ3', 'N', 'ND1', 'ND2', 'NE', 'NE1', 'NE2', 'NH1', 'NH2', 'NZ', 'O',
+ 'OD1',
+ 'OD2', 'OE1', 'OE2', 'OG', 'OG1', 'OH', 'OXT', 'SD', 'SG', 'misc'],
+ }
+
+ def __init__(self, root: str,
+ split: str = 'train',
+ num_residue_type: int = 20,
+ micro_radius: int = 20,
+ c_alpha_max_neighbors: int = 10,
+ cutoff: int = 30,
+ seq_dist_cut: int = 64,
+ use_micro: bool = False,
+ use_angle: bool = False,
+ use_omega: bool = False,
+ transform: Optional[Callable] = None,
+ pre_transform: Optional[Callable] = None,
+ pre_filter: Optional[Callable] = None,
+ divide_num: int = 1,
+ divide_idx: int = 0,
+ set_length: int = 500,
+ num_val: int = 10,
+ is_normalize: bool = True,
+ normalize_file: str = None,
+ p: float = 0.5,
+ use_sasa: bool =False,
+ use_bfactor: bool = False,
+ use_dihedral: bool = False,
+ use_coordinate: bool = False,
+ use_denoise: bool = False,
+ noise_type: str = 'wild',
+ temperature = 1.0
+ ):
+ self.p=p
+ self.use_sasa=use_sasa
+ self.use_bfactor=use_bfactor
+ self.use_dihedral=use_dihedral
+ self.use_coordinate=use_coordinate
+ self.use_denoise=use_denoise
+ self.noise_type = noise_type
+ self.temperature = temperature
+
+ self.split = split
+ assert self.split in self.splits
+
+ self.num_residue_type = num_residue_type
+ self.micro_radius = micro_radius
+ self.c_alpha_max_neighbors = c_alpha_max_neighbors
+ self.seq_dist_cut = seq_dist_cut
+ self.use_micro = use_micro
+ self.use_angle = use_angle
+ self.use_omega = use_omega
+ self.cutoff = cutoff
+
+ self.num_val = num_val
+ self.divide_num = divide_num
+ self.divide_idx = divide_idx
+ self.set_length = set_length
+
+ self.is_normalize = is_normalize
+ self.normalize_file = normalize_file
+
+ self.wrong_proteins = ['1kp0A01', '2atcA02']
+
+ self.sr = ShrakeRupley(probe_radius=1.4, # in A. Default is 1.40 roughly the radius of a water molecule.
+ n_points=100) # resolution of the surface of each atom. Default is 100. A higher number of points results in more precise measurements, but slows down the calculation.
+ self.periodic_table = GetPeriodicTable()
+ self.biopython_parser = PDBParser()
+
+ super().__init__(root, transform, pre_transform, pre_filter)
+ self.dataset = torch.load(self.processed_paths[self.splits.index(self.split)])
+ # self.data, self.slices = torch.load(
+ # self.processed_paths[self.splits.index(self.split)])
+ # self.nums_amino_cum = self.slices['x']
+
+ @property
+ def raw_file_names(self) -> str:
+ raw_file_names = os.path.join('data', 'cath', "dompdb")
+ if not os.path.exists(raw_file_names):
+ os.mkdir(raw_file_names)
+ return raw_file_names
+
+ @property
+ def raw_dir(self) -> str:
+ if not os.path.exists(self.root):
+ os.mkdir(self.root)
+ raw_dir = os.path.join(self.root, 'raw')
+ if not os.path.exists(raw_dir):
+ os.mkdir(raw_dir)
+ return raw_dir
+
+ @property
+ def saved_graph_dir(self) -> str:
+ dir_root = os.path.join(self.root)
+ if not os.path.exists(dir_root):
+ os.mkdir(dir_root)
+ dir_name = os.path.join(dir_root, 'graph_seq')
+ if not os.path.exists(dir_name):
+ os.mkdir(dir_name)
+ if not self.set_length:
+ self.set_length = len(os.listdir(dir_name))
+ return dir_name
+
+ @property
+ def saved_amino_cum(self) -> str:
+ amino_cum_name = os.path.join(
+ self.root, 'amino_cum.pt')
+ return amino_cum_name
+
+ @property
+ def processed_dir(self) -> str:
+ return os.path.join(self.root, 'processed_seq')
+
+ @property
+ def processed_file_names(self) -> str:
+ return ['train.pt', 'val.pt']
+
+
+ def write_info(self):
+ written_filename = os.path.join(self.root, 'wrong_protein_names.txt')
+ file = open(written_filename, "w+")
+ for protein_name in self.wrong_proteins:
+ file.writelines(protein_name + '\n')
+ file.close()
+
+ def process(self):
+ #generate graph data and save in graph dir
+ self.generate_protein_graph()
+ # self.write_info()
+
+ filenames = os.listdir(self.saved_graph_dir)
+ protein_length = len(filenames)
+ if self.set_length:
+ protein_length = min(protein_length, self.set_length)
+
+ if not self.normalize_file:
+ self.normalize_file = get_stat(self.saved_graph_dir)
+
+ random.shuffle(filenames)
+ train_list = [f for f in filenames if "_" in f or "-" in f]
+ filenames = [f for f in filenames if "_" not in f or "-" not in f]
+ train_list.extend(filenames[:-self.num_val])
+ filenames_list = [train_list, filenames[-self.num_val:]]
+
+ for k in range(2):####split train,val,test
+ data_list = []
+
+ ###move special name to test set
+ special_name_list = ["p53-dimer.pdb.pt"]
+ for special_name in special_name_list:
+ if special_name in filenames_list[0]:
+ filenames_list[0].remove(special_name)
+ filenames_list[1].append(special_name)
+ for i in tqdm(range(len(filenames_list[k]))):
+ file = filenames_list[k][i]
+ try:
+ graph1 = torch.load(os.path.join(self.saved_graph_dir, file))##load processed graph data torch pt file
+ except:
+ print(file)
+ continue
+ del graph1['distances']
+ del graph1['edge_dist']
+ del graph1['mu_r_norm']
+ del graph1['seq']
+ data_list.append(graph1)
+ if self.is_normalize:
+ normalize_transform = NormalizeProtein(filename=self.normalize_file)
+ data_list = [d for d in data_list if normalize_transform(d)]
+ if self.pre_filter is not None:
+ data_list = [d for d in data_list if self.pre_filter(d)]
+ if self.pre_transform is not None:
+ data_list = [self.pre_transform(d) for d in data_list]
+
+ torch.save(data_list, self.processed_paths[k])
+
+ def generate_protein_graph(self):
+ names = os.listdir(self.raw_file_names)
+ print(names)
+ names.sort()
+ n = int(np.ceil(len(names) / self.divide_num))
+ names = names[n * self.divide_idx:min(len(names), n * (self.divide_idx + 1))]
+ for idx, name in enumerate(tqdm(names)):
+ saved_graph_filename = os.path.join(self.saved_graph_dir, name + '.pt')
+ if os.path.exists(saved_graph_filename):
+ continue
+ protein_filename = os.path.join(self.raw_file_names, name)
+ if (name in self.wrong_proteins) or (not protein_filename):
+ continue
+ try:
+ rec, rec_coords, c_alpha_coords, n_coords, c_coords,seq = self.get_receptor_inference(protein_filename)
+ except:
+ continue
+ if rec !=False:
+ if len(seq)>len(c_alpha_coords):
+ del seq[-(len(seq)-len(c_alpha_coords)):]
+ #meet "dna" data will remove the file and rec will be false
+ # print(self.c_alpha_max_neighbors)
+ rec_graph = self.get_calpha_graph(rec, c_alpha_coords, n_coords, c_coords, rec_coords,seq)
+ if not rec_graph:
+ self.wrong_proteins.append(name)
+ continue
+ torch.save(rec_graph, saved_graph_filename)
+
+ def rec_residue_featurizer(self, rec, chain_id, one_hot=True, add_feature=None):
+ count = 0
+ flag_sasa=1
+ try:
+ self.sr.compute(rec, level="R")
+ except:
+ flag_sasa=0
+ for i, chain in enumerate(rec.get_chains()):
+ if i != chain_id:
+ continue
+ num_res = len(list(chain.get_residues()))#len([_ for _ in rec.get_residues()])
+ num_feature = 2
+ if add_feature.any():
+ num_feature += add_feature.shape[1]
+ res_feature = torch.zeros(num_res, self.num_residue_type + num_feature)
+ for i, residue in enumerate(chain.get_residues()):
+ if flag_sasa==0:
+ residue.sasa=0
+ sasa = residue.sasa
+ for atom in residue:
+ if atom.name == 'CA':
+ bfactor = atom.bfactor
+ assert not np.isinf(bfactor)
+ assert not np.isnan(bfactor)
+ assert not np.isinf(sasa)
+ assert not np.isnan(sasa)
+
+ residx = safe_index(
+ self.allowable_features['possible_amino_acids'], residue.get_resname())
+ res_feat_1 = one_hot_res(
+ residx, num_residue_type=self.num_residue_type) if one_hot else [residx]
+ if not res_feat_1:
+ return False
+ res_feat_1.append(sasa)
+ res_feat_1.append(bfactor)
+ if num_feature > 2:
+ res_feat_1.extend(list(add_feature[count, :]))
+ res_feature[count, :] = torch.tensor(res_feat_1, dtype=torch.float32)
+ count += 1
+ # print("numnodes:", num_res, count,len(list(chain.get_residues())))
+ for k in range(self.num_residue_type, self.num_residue_type + 2):
+ mean = res_feature[:, k].mean()
+ std = res_feature[:, k].std()
+ res_feature[:, k] = (res_feature[:, k] -mean) / (std + 0.000000001)
+ return res_feature
+
+ def get_node_features(self, n_coords, c_coords, c_alpha_coords, coord_mask, with_coord_mask=True, use_angle=False,
+ use_omega=False):
+ num_res = n_coords.shape[0]
+ if use_omega:
+ num_angle_type = 3
+ angles = np.zeros((num_res, num_angle_type))
+ for i in range(num_res - 1):
+ # These angles are called φ (phi) which involves the backbone atoms C-N-Cα-C
+ angles[i, 0] = dihedral(
+ c_coords[i], n_coords[i], c_alpha_coords[i], n_coords[i + 1])
+ # psi involves the backbone atoms N-Cα-C-N.
+ angles[i, 1] = dihedral(
+ n_coords[i], c_alpha_coords[i], c_coords[i], n_coords[i + 1])
+ angles[i, 2] = dihedral(
+ c_alpha_coords[i], c_coords[i], n_coords[i + 1], c_alpha_coords[i + 1])
+ else:
+ num_angle_type = 2
+ angles = np.zeros((num_res, num_angle_type))
+ for i in range(num_res - 1):
+ # These angles are called φ (phi) which involves the backbone atoms C-N-Cα-C
+ angles[i, 0] = dihedral(
+ c_coords[i], n_coords[i], c_alpha_coords[i], n_coords[i + 1])
+ # psi involves the backbone atoms N-Cα-C-N.
+ angles[i, 1] = dihedral(
+ n_coords[i], c_alpha_coords[i], c_coords[i], n_coords[i + 1])
+ if use_angle:
+ node_scalar_features = angles
+ else:
+ node_scalar_features = np.zeros((num_res, num_angle_type * 2))
+ for i in range(num_angle_type):
+ node_scalar_features[:, 2 * i] = np.sin(angles[:, i])
+ node_scalar_features[:, 2 * i + 1] = np.cos(angles[:, i])
+
+ if with_coord_mask:
+ node_scalar_features = torch.cat([
+ node_scalar_features,
+ coord_mask.float().unsqueeze(-1)
+ ], dim=-1)
+ node_vector_features = None
+ return node_scalar_features, node_vector_features
+
+ def get_calpha_graph(self, rec, c_alpha_coords, n_coords, c_coords, coords, seq):
+ chain_id = 0
+ scalar_feature, vec_feature = self.get_node_features(n_coords, c_coords, c_alpha_coords, coord_mask=None, with_coord_mask=False, use_angle=self.use_angle, use_omega=self.use_omega)
+ # Extract 3D coordinates and n_i,u_i,v_i
+ # vectors of representative residues ################
+ residue_representatives_loc_list = []
+ n_i_list = []
+ u_i_list = []
+ v_i_list = []
+ for i, chain in enumerate(rec.get_chains()):
+ if i != chain_id:
+ continue
+ for i, residue in enumerate(chain.get_residues()):
+ n_coord = n_coords[i]
+ c_alpha_coord = c_alpha_coords[i]
+ c_coord = c_coords[i]
+ u_i = (n_coord - c_alpha_coord) / \
+ np.linalg.norm(n_coord - c_alpha_coord)
+ t_i = (c_coord - c_alpha_coord) / \
+ np.linalg.norm(c_coord - c_alpha_coord)
+ n_i = np.cross(u_i, t_i) / \
+ np.linalg.norm(np.cross(u_i, t_i)) # main chain
+ v_i = np.cross(n_i, u_i)
+ assert (math.fabs(
+ np.linalg.norm(v_i) - 1.) < 1e-5), "protein utils protein_to_graph_dips, v_i norm larger than 1"
+ n_i_list.append(n_i)
+ u_i_list.append(u_i)
+ v_i_list.append(v_i)
+ residue_representatives_loc_list.append(c_alpha_coord)
+
+ residue_representatives_loc_feat = np.stack(residue_representatives_loc_list, axis=0) # (N_res, 3)
+ n_i_feat = np.stack(n_i_list, axis=0)
+ u_i_feat = np.stack(u_i_list, axis=0)
+ v_i_feat = np.stack(v_i_list, axis=0)
+ num_residues = len(c_alpha_coords)
+ if num_residues <= 1:
+ raise ValueError(f"rec contains only 1 residue!")
+ ################### Build the k-NN graph ##############################
+ assert num_residues == residue_representatives_loc_feat.shape[0]
+ assert residue_representatives_loc_feat.shape[1] == 3
+ distances = spa.distance.cdist(c_alpha_coords, c_alpha_coords)
+
+ src_list = []
+ dst_list = []
+ dist_list = []
+ mean_norm_list = []
+ for i in range(num_residues):
+ dst = list(np.where(distances[i, :] < self.cutoff)[0])
+ dst.remove(i)
+ if self.c_alpha_max_neighbors != None and len(dst) > self.c_alpha_max_neighbors:
+ dst = list(np.argsort(distances[i, :]))[
+ 1: self.c_alpha_max_neighbors + 1]
+ if len(dst) == 0:
+ # choose second because first is i itself
+ dst = list(np.argsort(distances[i, :]))[1:2]
+ log(
+ f'The c_alpha_cutoff {self.cutoff} was too small for one c_alpha such that it had no neighbors. So we connected it to the closest other c_alpha')
+ assert i not in dst
+ src = [i] * len(dst)
+ src_list.extend(src)
+ dst_list.extend(dst)
+ valid_dist = list(distances[i, dst])
+ dist_list.extend(valid_dist)
+ valid_dist_np = distances[i, dst]
+ sigma = np.array([1., 2., 5., 10., 30.]).reshape((-1, 1))
+ weights = softmax(- valid_dist_np.reshape((1, -1))** 2 / sigma, axis=1) # (sigma_num, neigh_num)
+ # print(weights) why weight??
+ assert weights[0].sum() > 1 - 1e-2 and weights[0].sum() < 1.01
+ diff_vecs = residue_representatives_loc_feat[src, :] - residue_representatives_loc_feat[dst, :] # (neigh_num, 3)
+ mean_vec = weights.dot(diff_vecs) # (sigma_num, 3)
+ denominator = weights.dot(np.linalg.norm(diff_vecs, axis=1)) # (sigma_num,)
+ mean_vec_ratio_norm = np.linalg.norm(mean_vec, axis=1) / denominator # (sigma_num,)
+ mean_norm_list.append(mean_vec_ratio_norm)
+ assert len(src_list) == len(dst_list)
+ assert len(dist_list) == len(dst_list)
+ residue_representatives_loc_feat = torch.from_numpy(residue_representatives_loc_feat.astype(np.float32))
+ x = self.rec_residue_featurizer(rec, chain_id, one_hot=True, add_feature=scalar_feature)
+ if isinstance(x, bool) and (not x):
+ return False
+ ######key part to generate graph!!!!!main
+ graph = Data(
+ x=x,## 26 feature 20+sasa+b factor+ two face angle
+ pos=residue_representatives_loc_feat,
+ edge_attr=self.get_edge_features(src_list, dst_list, dist_list, divisor=4), ##edge features
+ edge_index=torch.tensor([src_list, dst_list]),
+ edge_dist=torch.tensor(dist_list),
+ distances=torch.tensor(distances),
+ mu_r_norm=torch.from_numpy(np.array(mean_norm_list).astype(np.float32)),
+ seq = seq) ##about density capture
+ # Loop over all edges of the graph and build the various p_ij, q_ij, k_ij, t_ij pairs
+ edge_feat_ori_list = []
+ for i in range(len(dist_list)):
+ src = src_list[i]
+ dst = dst_list[i]
+ # place n_i, u_i, v_i as lines in a 3x3 basis matrix
+ basis_matrix = np.stack(
+ (n_i_feat[dst, :], u_i_feat[dst, :], v_i_feat[dst, :]), axis=0)
+ p_ij = np.matmul(basis_matrix,residue_representatives_loc_feat[src, :] - residue_representatives_loc_feat[dst, :])
+ q_ij = np.matmul(basis_matrix, n_i_feat[src, :]) # shape (3,)
+ k_ij = np.matmul(basis_matrix, u_i_feat[src, :])
+ t_ij = np.matmul(basis_matrix, v_i_feat[src, :])
+ s_ij = np.concatenate((p_ij, q_ij, k_ij, t_ij), axis=0) # shape (12,)
+ edge_feat_ori_list.append(s_ij)
+
+ edge_feat_ori_feat = np.stack(edge_feat_ori_list, axis=0) # shape (num_edges, 4, 3)
+ edge_feat_ori_feat = torch.from_numpy(edge_feat_ori_feat.astype(np.float32))
+ graph.edge_attr = torch.cat([graph.edge_attr, edge_feat_ori_feat], axis=1) # (num_edges, 17)
+ # graph = self.remove_node(graph, graph.x.shape[0]-1)###remove the last node, can not calculate the two face angle
+ # self.get_calpha_graph_single(graph, 6)
+ return graph
+
+ def remove_node(self, graph, node_idx):
+ new_graph = Data.clone(graph)
+ # delete node
+ new_graph.x = torch.cat(
+ [new_graph.x[:node_idx, :], new_graph.x[node_idx + 1:, :]])
+ new_graph.pos = torch.cat(
+ [new_graph.pos[:node_idx, :], new_graph.pos[node_idx + 1:, :]])
+ new_graph.mu_r_norm = torch.cat(
+ [new_graph.mu_r_norm[:node_idx, :], new_graph.mu_r_norm[node_idx + 1:, :]])
+
+ # delete edge
+ keep_edge = (torch.sum(new_graph.edge_index == node_idx, dim=0) == 0)
+ new_graph.edge_index = new_graph.edge_index[:, keep_edge]
+ new_graph.edge_attr = new_graph.edge_attr[keep_edge, :]
+ return new_graph
+
+ def get_edge_features(self, src_list, dst_list, dist_list, divisor=4):
+ seq_edge = torch.absolute(torch.tensor(
+ src_list) - torch.tensor(dst_list)).reshape(-1, 1)
+ seq_edge = torch.where(seq_edge > self.seq_dist_cut,
+ self.seq_dist_cut, seq_edge)
+ seq_edge = F.one_hot(
+ seq_edge, num_classes=self.seq_dist_cut + 1).reshape((-1, self.seq_dist_cut + 1))
+ contact_sig = torch.where(torch.tensor(
+ dist_list) <= 8, 1, 0).reshape(-1, 1)
+ # avg distance = 7. So divisor = (4/7)*7 = 4
+ dist_fea = self.distance_featurizer(dist_list, divisor=divisor)
+ return torch.concat([seq_edge, dist_fea, contact_sig], dim=-1)
+
+ def get_receptor_inference(self, rec_path):
+ chain_id=0
+ with warnings.catch_warnings():
+ warnings.filterwarnings("ignore", category=PDBConstructionWarning)
+ structure = self.biopython_parser.get_structure('random_id', rec_path)
+ rec = structure[0]##len(structure)=1
+ head = self.biopython_parser.get_header()['head']
+ if head.find('dna') > -1:
+ return False, False, False, False, False,False
+ coords = []
+ c_alpha_coords = []
+ n_coords = []
+ c_coords = []
+ valid_chain_ids = []
+ lengths = []
+ seq = []
+ for i, chain in enumerate(rec):
+ print("chain num",i,chain_id,chain)
+ if i != chain_id:##select chain A:i=0 or B:i=1
+ continue
+ chain_coords = [] # num_residues, num_atoms, 3
+ chain_c_alpha_coords = []
+ chain_n_coords = []
+ chain_c_coords = []
+ count = 0
+ invalid_res_ids = []
+ for res_idx, residue in enumerate(chain):
+ if residue.get_resname() == 'HOH':
+ invalid_res_ids.append(residue.get_id())
+ continue
+ residue_coords = []
+ c_alpha, n, c = None, None, None
+ for atom in residue:
+ if atom.name == 'CA':
+ c_alpha = list(atom.get_vector())
+ seq.append(str(residue).split(" ")[1])
+ if atom.name == 'N':
+ n = list(atom.get_vector())
+ if atom.name == 'C':
+ c = list(atom.get_vector())
+ residue_coords.append(list(atom.get_vector()))
+ # only append residue if it is an amino acid and not some weired molecule that is part of the complex
+ if c_alpha != None and n != None and c != None:
+ chain_c_alpha_coords.append(c_alpha)
+ chain_n_coords.append(n)
+ chain_c_coords.append(c)
+ chain_coords.append(np.array(residue_coords))
+ count += 1
+ else:
+ invalid_res_ids.append(residue.get_id())
+ for res_id in invalid_res_ids:
+ chain.detach_child(res_id)
+ lengths.append(count)
+ coords.append(chain_coords)
+ c_alpha_coords.append(np.array(chain_c_alpha_coords))
+ n_coords.append(np.array(chain_n_coords))
+ c_coords.append(np.array(chain_c_coords))
+ if len(chain_coords) > 0:
+ valid_chain_ids.append(chain.get_id())
+ valid_coords = []
+ valid_c_alpha_coords = []
+ valid_n_coords = []
+ valid_c_coords = []
+ valid_lengths = []
+ invalid_chain_ids = []
+ for i, chain in enumerate(rec):
+ # print("chain:",i,chain, len(valid_coords), len(valid_chain_ids), len(coords), coords[0][0].shape, len(coords[0]))
+ if i != chain_id:
+ continue
+ if chain.get_id() in valid_chain_ids:
+ valid_coords.append(coords[0])
+ valid_c_alpha_coords.append(c_alpha_coords[0])
+ valid_n_coords.append(n_coords[0])
+ valid_c_coords.append(c_coords[0])
+ valid_lengths.append(lengths[0])
+ else:
+ invalid_chain_ids.append(chain.get_id())
+ # list with n_residues arrays: [n_atoms, 3]
+ coords = [item for sublist in valid_coords for item in sublist]
+ if len(valid_c_alpha_coords) == 0:
+ return False, False, False, False, False,False
+ c_alpha_coords = np.concatenate(valid_c_alpha_coords, axis=0) # [n_residues, 3]
+ n_coords = np.concatenate(valid_n_coords, axis=0) # [n_residues, 3]
+ c_coords = np.concatenate(valid_c_coords, axis=0) # [n_residues, 3]
+
+ for invalid_id in invalid_chain_ids:
+ rec.detach_child(invalid_id)
+
+ assert len(c_alpha_coords) == len(n_coords)
+ assert len(c_alpha_coords) == len(c_coords)
+ assert sum(valid_lengths) == len(c_alpha_coords)
+ return rec, coords, c_alpha_coords, n_coords, c_coords,seq
+
+ def len(self):
+ return len(self.dataset)
+
+ def get_statistic_info(self):
+ node_num = torch.zeros(self.length_total)
+ edge_num = torch.zeros(self.length_total)
+ for i in tqdm(range(self.length_total)):
+ graph = self.get(i)
+ node_num[i] = graph.x.shape[0]
+ edge_num[i] = graph.edge_index.shape[1]
+ num_node_min = torch.min(node_num)
+ num_node_max = torch.max(node_num)
+ num_node_avg = torch.mean(node_num)
+ num_edge_min = torch.min(edge_num)
+ num_edge_max = torch.max(edge_num)
+ num_edge_avg = torch.mean(edge_num)
+ print(f'Graph Num: {self.length_total}')
+ print(
+ f'Min Nodes: {num_node_min:.2f} Max Nodes: {num_node_max:.2f}. Avg Nodes: {num_node_avg:.2f}')
+ print(
+ f'Min Edges: {num_edge_min:.2f} Max Edges: {num_edge_max:.2f}. Avg Edges: {num_edge_avg:.2f}')
+
+ def _get_noise(self, token_len: int, prob: List=[]):
+ prob = prob if prob else [0.08, 0.05, 0.04, 0.06, 0.01, 0.04, 0.07, 0.07, 0.02, 0.06, 0.1, 0.06,
+ 0.02, 0.04, 0.04, 0.06, 0.05, 0.01, 0.03, 0.07]
+ multant_pos = ((torch.rand(token_len) <= self.p)).nonzero().flatten()
+ if len(multant_pos) == 0:
+ return None, None
+ multant_trg = torch.multinomial(torch.tensor(prob), len(multant_pos), replacement=True)
+ return multant_pos, multant_trg
+
+
+ def _token_rep_noise(self, data, multant_pos, multant_trg, rep_noise_type='window_3'):
+ num_classes = 20
+ multant_rep = data.token_rep.clone()
+ for mut_pos, mut_trg in zip(multant_pos, multant_trg):
+ mut_trg_ = F.one_hot(mut_trg, num_classes=num_classes)
+ if rep_noise_type == 'mean':
+ trg_rep = data.token_rep[(data.x[:,:20] == mut_trg_).sum(1) == num_classes].mean(0)
+ if torch.isnan(trg_rep).sum() > 0:
+ continue
+ multant_rep[mut_pos] = trg_rep
+ elif "window" in rep_noise_type:
+ window_size = int(rep_noise_type.split("_")[-1])
+ start_pos = mut_pos - math.ceil(window_size/2)
+ end_pos = start_pos + window_size
+ if end_pos > len(data.token_rep):
+ start_pos = mut_pos - window_size
+ trg_rep = data.token_rep[start_pos:].mean(0)
+ elif start_pos < 0:
+ end_pos = window_size
+ trg_rep = data.token_rep[:end_pos].mean(0)
+ else:
+ trg_rep = data.token_rep[start_pos:end_pos].mean(0)
+ multant_rep[mut_pos] = trg_rep
+ return multant_rep
+
+ def get(self, idx):
+ # idx_protein = idx
+ # idx_x0, idx_x1 = self.slices['x'][idx_protein], self.slices['x'][idx_protein + 1]
+ # idx_edge0, idx_edge1 = self.slices['edge_index'][idx_protein], self.slices['edge_index'][idx_protein + 1]
+
+ # data = Data(
+ # x=self.data.x[idx_x0:idx_x1, :],
+ # pos=self.data.pos[idx_x0:idx_x1, :],
+ # edge_index=self.data.edge_index[:, idx_edge0:idx_edge1],
+ # edge_attr=self.data.edge_attr[idx_edge0:idx_edge1, :],
+ # lenth=idx_x1-idx_x0
+ # )
+ data = self.dataset[idx]
+
+ token_len = data.x.shape[0]
+ data.y = data.x[:token_len, :self.num_residue_type].argmax(1)
+ multant_pos, multant_trg = self._get_noise(token_len=token_len)
+ if multant_pos is not None:
+ noisey = data.x[:, :20].argmax(dim=1)
+ noisey[multant_pos] = multant_trg
+ data.x[:,:20] = F.one_hot(noisey, num_classes=20)
+
+ return data
+
+
+ def find_idx(self, idx_protein, amino_idx):
+ idx = (self.distances[idx_protein][:-1, amino_idx]< self.micro_radius).nonzero(as_tuple=True)[0]
+ return idx
+
+ def get_calpha_graph_single(self, graph, idx_protein, amino_idx):
+ choosen_amino_idx = self.find_idx(idx_protein, amino_idx)
+ keep_edge_index = []
+ for edge_idx in range(graph.num_edges):
+ edge = graph.edge_index.t()[edge_idx]
+ if (edge[0] in choosen_amino_idx) and (edge[1] in choosen_amino_idx):
+ keep_edge_index.append(edge_idx)
+ graph1 = Data(x=graph.x[choosen_amino_idx, :],
+ pos=graph.pos[choosen_amino_idx, :],
+ edge_index=graph.edge_index[:, keep_edge_index],
+ edge_attr=graph.edge_attr[keep_edge_index, :],
+ mu_r_norm=graph.mu_r_norm[choosen_amino_idx, :])
+ return graph1
+
+ def __repr__(self) -> str:
+ return f'{self.__class__.__name__}()'
+
+ def distance_featurizer(self, dist_list, divisor) -> torch.Tensor:
+ # you want to use a divisor that is close to 4/7 times the average distance that you want to encode
+ length_scale_list = [1.5 ** x for x in range(15)]
+ center_list = [0. for _ in range(15)]
+ num_edge = len(dist_list)
+ dist_list = np.array(dist_list)
+ transformed_dist = [np.exp(- ((dist_list / divisor) ** 2) / float(length_scale))
+ for length_scale, center in zip(length_scale_list, center_list)]
+ transformed_dist = np.array(transformed_dist).T
+ transformed_dist = transformed_dist.reshape((num_edge, -1))
+ return torch.from_numpy(transformed_dist.astype(np.float32))
+
+
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/src/dataset/mutant_dataset.py b/proteingym/baselines/protssn/src/dataset/mutant_dataset.py
new file mode 100644
index 0000000..3ffa2b1
--- /dev/null
+++ b/proteingym/baselines/protssn/src/dataset/mutant_dataset.py
@@ -0,0 +1,637 @@
+
+
+import os
+import torch
+import math
+import warnings
+import torch.nn.functional as F
+import pandas as pd
+import numpy as np
+import scipy.spatial as spa
+from torch_geometric.data import Data, Dataset, InMemoryDataset
+from tqdm import tqdm
+from scipy.special import softmax
+from Bio.PDB import PDBParser, ShrakeRupley
+from Bio.PDB.PDBExceptions import PDBConstructionWarning
+from rdkit.Chem import GetPeriodicTable
+from typing import Callable, List, Optional
+from src.utils.dataset_utils import safe_index, one_hot_res, log, dihedral
+warnings.filterwarnings("ignore")
+
+one_letter ={'VAL':'V', 'ILE':'I', 'LEU':'L', 'GLU':'E', 'GLN':'Q',
+ 'ASP':'D', 'ASN':'N', 'HIS':'H', 'TRP':'W', 'PHE':'F', 'TYR':'Y',
+ 'ARG':'R', 'LYS':'K', 'SER':'S', 'THR':'T', 'MET':'M', 'ALA':'A',
+ 'GLY':'G', 'PRO':'P', 'CYS':'C'}
+class MutantDataset(Dataset):
+ r"""
+ Args:
+ root (string): Root directory where the dataset should be saved.
+ name (string): The name of the dataset.
+ raw_dir (string, optional): Root directory where the
+ original dataset stored(default: :obj:`None`)
+
+ num_residue_type (int, optional): The number of amino acid types.
+ (default: obj:'20')
+ micro_radius (int, optional): The radius of micro-environment
+ centered on the mask node. (default: obj:'20')
+ c_alpha_max_neighbors (int, optional): The number of maximum
+ connected nodes. (default: obj:'10')
+ cutoff (int, optional): The maximum connected nodes distance
+ (default: obj:'30')
+ seq_dist_cut (int, optional): one-hot encoding the sequence distance
+ edge attribute
+ (default: obj:)
+ [0.25,0.5,0.75,0.9,0.95,0.98,0.99]
+ [ 2. 3. 13. 63. 127. 247. 347.]
+
+ # use_localdatastet (bool) (bool,optional): If :obj:'True', online dataset
+ # will be downloaded. If not, local pdb files will be used
+ # (default: obj:'True')
+
+ transform (callable, optional): A function/transform that takes in an
+ :obj:`torch_geometric.data.Data` object and returns a transformed
+ version. The data object will be transformed before every access.
+ (default: :obj:`None`)
+ pre_transform (callable, optional): A function/transform that takes in
+ an :obj:`torch_geometric.data.Data` object and returns a
+ transformed version. The data object will be transformed before
+ being saved to disk. (default: :obj:`None`)
+ pre_filter (callable, optional): A function that takes in an
+ :obj:`torch_geometric.data.Data` object and returns a boolean
+ value, indicating whether the data object should be included in the
+ final dataset. (default: :obj:`None`)
+ """
+ allowable_features = {
+ 'possible_atomic_num_list': list(range(1, 119)) + ['misc'],
+ 'possible_chirality_list': [
+ 'CHI_UNSPECIFIED',
+ 'CHI_TETRAHEDRAL_CW',
+ 'CHI_TETRAHEDRAL_CCW',
+ 'CHI_OTHER'
+ ],
+ 'possible_degree_list': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 'misc'],
+ 'possible_numring_list': [0, 1, 2, 3, 4, 5, 6, 'misc'],
+ 'possible_implicit_valence_list': [0, 1, 2, 3, 4, 5, 6, 'misc'],
+ 'possible_formal_charge_list': [-5, -4, -3, -2, -1, 0, 1, 2, 3, 4, 5, 'misc'],
+ 'possible_numH_list': [0, 1, 2, 3, 4, 5, 6, 7, 8, 'misc'],
+ 'possible_number_radical_e_list': [0, 1, 2, 3, 4, 'misc'],
+ 'possible_hybridization_list': [
+ 'SP', 'SP2', 'SP3', 'SP3D', 'SP3D2', 'misc'
+ ],
+ 'possible_is_aromatic_list': [False, True],
+ 'possible_is_in_ring3_list': [False, True],
+ 'possible_is_in_ring4_list': [False, True],
+ 'possible_is_in_ring5_list': [False, True],
+ 'possible_is_in_ring6_list': [False, True],
+ 'possible_is_in_ring7_list': [False, True],
+ 'possible_is_in_ring8_list': [False, True],
+ 'possible_amino_acids': ['ALA', 'ARG', 'ASN', 'ASP', 'CYS', 'GLN', 'GLU', 'GLY', 'HIS', 'ILE', 'LEU', 'LYS', 'MET',
+ 'PHE', 'PRO', 'SER', 'THR', 'TRP', 'TYR', 'VAL', 'HIP', 'HIE', 'TPO', 'HID', 'LEV', 'MEU',
+ 'PTR', 'GLV', 'CYT', 'SEP', 'HIZ', 'CYM', 'GLM', 'ASQ', 'TYS', 'CYX', 'GLZ', 'misc'],
+ 'possible_atom_type_2': ['C*', 'CA', 'CB', 'CD', 'CE', 'CG', 'CH', 'CZ', 'N*', 'ND', 'NE', 'NH', 'NZ', 'O*', 'OD',
+ 'OE', 'OG', 'OH', 'OX', 'S*', 'SD', 'SG', 'misc'],
+ 'possible_atom_type_3': ['C', 'CA', 'CB', 'CD', 'CD1', 'CD2', 'CE', 'CE1', 'CE2', 'CE3', 'CG', 'CG1', 'CG2', 'CH2',
+ 'CZ', 'CZ2', 'CZ3', 'N', 'ND1', 'ND2', 'NE', 'NE1', 'NE2', 'NH1', 'NH2', 'NZ', 'O', 'OD1',
+ 'OD2', 'OE1', 'OE2', 'OG', 'OG1', 'OH', 'OXT', 'SD', 'SG', 'misc'],
+ }
+ amino_acids_type = ['A', 'R', 'N', 'D', 'C', 'Q', 'E', 'G', 'H', 'I',
+ 'L', 'K', 'M', 'F', 'P', 'S', 'T', 'W', 'Y', 'V']
+
+ def __init__(self, root: str, name: str, raw_dir: str,
+ num_residue_type: int = 20,
+ micro_radius: int = 20,
+ c_alpha_max_neighbors: int = 10,
+ cutoff: int = 30,
+ seq_dist_cut: int = 64,
+ use_micro: bool = False,
+ use_angle: bool = False,
+ use_omega: bool = False,
+ transform: Optional[Callable] = None,
+ pre_transform: Optional[Callable] = None,
+ pre_filter: Optional[Callable] = None,
+ divide_num: int = 1,
+ divide_idx: int = 0,
+ replace_graph: bool = False,
+ replace_process: bool = False
+ ):
+ self.divide_num = divide_num
+ self.divide_idx = divide_idx
+ self.replace_graph = replace_graph
+ self.replace_process = replace_process
+
+ self.root = root
+ self.name = name
+ self.raw_root = raw_dir
+ self.num_residue_type = num_residue_type
+ self.micro_radius = micro_radius
+ self.c_alpha_max_neighbors = c_alpha_max_neighbors
+ self.cutoff = cutoff
+ self.seq_dist_cut = seq_dist_cut
+ self.use_micro = use_micro
+ self.use_angle = use_angle
+ self.use_omega = use_omega
+
+ self.protein_names = []
+ self.wrong_protein_names = []
+ self.total_protein_names = []
+ if os.path.exists(self.total_protein_name_file):
+ self.protein_names = open(self.saved_protein_name_file, 'r').read().splitlines()
+ if os.path.exists(self.wrong_protein_name_file):
+ self.wrong_protein_names = open(self.wrong_protein_name_file, 'r').read().splitlines()
+ if os.path.exists(self.total_protein_name_file):
+ self.total_protein_names = open(self.total_protein_name_file, 'r').read().splitlines()
+
+ # in A. Default is 1.40 roughly the radius of a water molecule.
+ # resolution of the surface of each atom. Default is 100. A higher number of points results in more precise measurements, but slows down the calculation.
+ self.sr = ShrakeRupley(probe_radius=1.4, n_points=100)
+ self.biopython_parser = PDBParser()
+
+ self.saved_graph_path = self.mk_saved_graph_path()
+ super().__init__(root, transform, pre_transform, pre_filter)
+ # After processing protein from pdb --> Data
+
+ self.length_total = len(self.protein_names)
+
+ @property
+ def raw_file_names(self) -> str:
+ return self.raw_root
+
+ @property
+ def raw_dir(self) -> str:
+ return self.raw_root
+
+ def mk_saved_graph_path(self) -> str:
+ os.makedirs(os.path.join(self.root, self.name.capitalize()), exist_ok=True)
+ graph_dir = os.path.join(self.root, self.name.capitalize(), 'graph')
+ os.makedirs(graph_dir, exist_ok=True)
+ return graph_dir
+
+ @property
+ def total_protein_name_file(self) -> str:
+ return os.path.join(self.root, self.name.capitalize(), 'total_proteins.txt')
+
+ @property
+ def saved_protein_name_file(self) -> str:
+ return os.path.join(self.root, self.name.capitalize(), 'saved_proteins.txt')
+
+ @property
+ def wrong_protein_name_file(self) -> str:
+ return os.path.join(self.root, self.name.capitalize(), 'wrong_proteins.txt')
+
+ @property
+ def processed_dir(self) -> str:
+ return os.path.join(self.root, self.name.capitalize(), 'processed')
+
+ @property
+ def processed_file_names(self) -> str:
+ return [p+".pt" for p in self.protein_names]
+
+ def download(self):
+ pass
+
+ def process(self):
+ # if self.replace_graph:
+ self.generate_protein_graph_evaluation()
+
+ exist_proteins = []
+ proteins = open(self.saved_protein_name_file, 'r').read().splitlines()
+ for p in proteins:
+ file = p + '.pt'
+ if os.path.exists(os.path.join(self.saved_graph_path, file)):
+ exist_proteins.append(file)
+
+ protein_num = len(exist_proteins)
+ if (not self.replace_process) and (len(os.listdir(self.processed_dir)) >= protein_num):
+ return 0
+
+ process_bar = tqdm(exist_proteins)
+ for protein in process_bar:
+ process_bar.set_description(f"Processing {protein}")
+
+ graph_data = torch.load(os.path.join(self.saved_graph_path, protein))
+ tmpseq = [one_letter[amino] for amino in graph_data.seq]
+ graph_data.seq = "".join(tmpseq)
+
+ if self.pre_filter is not None:
+ graph_data = self.pre_filter(graph_data)
+
+ if self.pre_transform is not None:
+ graph_data = self.pre_transform(graph_data)
+
+ saved_prcessed_name = os.path.join(self.processed_dir, protein)
+ torch.save(graph_data, saved_prcessed_name)
+
+ def generate_protein_graph_evaluation(self):
+ self.total_protein_names = sorted(os.listdir(self.raw_dir))
+ process_bar = tqdm(self.total_protein_names)
+ for name in process_bar:
+ process_bar.set_description(f"Processing {name}")
+ protein_dir = os.path.join(self.raw_dir, name)
+
+ if os.path.exists(os.path.join(self.saved_graph_path, name + '.pt')) or not os.path.isdir(protein_dir):
+ continue
+
+ pdb_suffix = ".pdb"
+ pdb_file = os.path.join(protein_dir, name + pdb_suffix)
+ assert os.path.exists(pdb_file), f"{pdb_file} does not exist"
+
+ rec, rec_coords, c_alpha_coords, n_coords, c_coords,seq = self.get_receptor_inference(
+ pdb_file)
+
+ rec_graph = self.get_calpha_graph(rec, c_alpha_coords, n_coords, c_coords,seq)
+ if not rec_graph:
+ self.wrong_protein_names.append(name)
+ continue
+ torch.save(rec_graph, os.path.join(self.saved_graph_path, name + '.pt'))
+
+ with open(self.total_protein_name_file, 'w') as fp:
+ for item in self.total_protein_names:
+ fp.writelines("%s\n" % item)
+ print(f"Total proteins: {self.total_protein_names}")
+
+ self.protein_names = sorted([name.split(".")[0] for name in os.listdir(self.saved_graph_path)])
+ with open(self.saved_protein_name_file, 'w') as fp:
+ for item in self.protein_names:
+ fp.writelines("%s\n" % item)
+
+ with open(self.wrong_protein_name_file, 'w') as fp:
+ for item in self.wrong_protein_names:
+ fp.writelines("%s\n" % item)
+ print(f"Wrong proteins: {self.wrong_protein_names}")
+
+ def rec_residue_featurizer(self, rec, one_hot=True, add_feature=None):
+ num_res = len([_ for _ in rec.get_residues()])
+ num_feature = 2
+ if add_feature.any():
+ num_feature += add_feature.shape[1]
+ res_feature = torch.zeros(num_res, self.num_residue_type + num_feature)
+ count = 0
+ self.sr.compute(rec, level="R")
+ for residue in rec.get_residues():
+ sasa = residue.sasa
+ for atom in residue:
+ if atom.name == 'CA':
+ bfactor = atom.bfactor
+ assert not np.isinf(bfactor)
+ assert not np.isnan(bfactor)
+ assert not np.isinf(sasa)
+ assert not np.isnan(sasa)
+
+ residx = safe_index(
+ self.allowable_features['possible_amino_acids'], residue.get_resname())
+ res_feat_1 = one_hot_res(
+ residx, num_residue_type=self.num_residue_type) if one_hot else [residx]
+ if not res_feat_1:
+ return False
+ res_feat_1.append(sasa)
+ res_feat_1.append(bfactor)
+ if num_feature > 2:
+ res_feat_1.extend(list(add_feature[count, :]))
+ res_feature[count, :] = torch.tensor(
+ res_feat_1, dtype=torch.float32)
+ count += 1
+
+ for k in range(self.num_residue_type, self.num_residue_type + 2):
+ mean = res_feature[:, k].mean()
+ std = res_feature[:, k].std()
+ res_feature[:, k] = (res_feature[:, k] - mean) / (std + 0.000000001)
+ return res_feature
+
+ def get_node_features(self, n_coords, c_coords, c_alpha_coords, coord_mask, with_coord_mask=True, use_angle=False, use_omega=False):
+ num_res = n_coords.shape[0]
+ if use_omega:
+ num_angle_type = 3
+ angles = np.zeros((num_res, num_angle_type))
+ for i in range(num_res-1):
+ # These angles are called φ (phi) which involves the backbone atoms C-N-Cα-C
+ angles[i, 0] = dihedral(c_coords[i], n_coords[i], c_alpha_coords[i], n_coords[i+1])
+ # psi involves the backbone atoms N-Cα-C-N.
+ angles[i, 1] = dihedral(n_coords[i], c_alpha_coords[i], c_coords[i], n_coords[i+1])
+ angles[i, 2] = dihedral(c_alpha_coords[i], c_coords[i], n_coords[i+1], c_alpha_coords[i+1])
+ else:
+ num_angle_type = 2
+ angles = np.zeros((num_res, num_angle_type))
+ for i in range(num_res-1):
+ # These angles are called φ (phi) which involves the backbone atoms C-N-Cα-C
+ angles[i, 0] = dihedral(c_coords[i], n_coords[i], c_alpha_coords[i], n_coords[i+1])
+ # psi involves the backbone atoms N-Cα-C-N.
+ angles[i, 1] = dihedral(n_coords[i], c_alpha_coords[i], c_coords[i], n_coords[i+1])
+ if use_angle:
+ node_scalar_features = angles
+ else:
+ node_scalar_features = np.zeros((num_res, num_angle_type*2))
+ for i in range(num_angle_type):
+ node_scalar_features[:, 2*i] = np.sin(angles[:, i])
+ node_scalar_features[:, 2*i + 1] = np.cos(angles[:, i])
+
+ if with_coord_mask:
+ node_scalar_features = torch.cat([
+ node_scalar_features,
+ coord_mask.float().unsqueeze(-1)
+ ], dim=-1)
+ node_vector_features = None
+ return node_scalar_features, node_vector_features
+
+ def get_calpha_graph(self, rec, c_alpha_coords, n_coords, c_coords,seq):
+ scalar_feature, vec_feature = self.get_node_features(
+ n_coords, c_coords, c_alpha_coords, coord_mask=None,
+ with_coord_mask=False, use_angle=self.use_angle, use_omega=self.use_omega
+ )
+ # Extract 3D coordinates and n_i,u_i,v_i
+ # vectors of representative residues ################
+ residue_representatives_loc_list = []
+ n_i_list = []
+ u_i_list = []
+ v_i_list = []
+ for i, residue in enumerate(rec.get_residues()):
+ n_coord = n_coords[i]
+ c_alpha_coord = c_alpha_coords[i]
+ c_coord = c_coords[i]
+ u_i = (n_coord - c_alpha_coord) / np.linalg.norm(n_coord - c_alpha_coord)
+ t_i = (c_coord - c_alpha_coord) / np.linalg.norm(c_coord - c_alpha_coord)
+ n_i = np.cross(u_i, t_i) / np.linalg.norm(np.cross(u_i, t_i)) # main chain
+ v_i = np.cross(n_i, u_i)
+ assert (math.fabs(np.linalg.norm(v_i) - 1.) < 1e-5), "protein utils protein_to_graph_dips, v_i norm larger than 1"
+ n_i_list.append(n_i)
+ u_i_list.append(u_i)
+ v_i_list.append(v_i)
+ residue_representatives_loc_list.append(c_alpha_coord)
+
+ # (N_res, 3)
+ residue_representatives_loc_feat = np.stack(residue_representatives_loc_list, axis=0)
+
+ n_i_feat = np.stack(n_i_list, axis=0)
+ u_i_feat = np.stack(u_i_list, axis=0)
+ v_i_feat = np.stack(v_i_list, axis=0)
+ num_residues = len(c_alpha_coords)
+ if num_residues <= 1:
+ raise ValueError(f"rec contains only 1 residue!")
+
+ ################### Build the k-NN graph ##############################
+ assert num_residues == residue_representatives_loc_feat.shape[0]
+ assert residue_representatives_loc_feat.shape[1] == 3
+ distances = spa.distance.cdist(c_alpha_coords, c_alpha_coords)
+
+ src_list = []
+ dst_list = []
+ dist_list = []
+ mean_norm_list = []
+ for i in range(num_residues):
+ dst = list(np.where(distances[i, :] < self.cutoff)[0])
+ dst.remove(i)
+ if self.c_alpha_max_neighbors != None and len(dst) > self.c_alpha_max_neighbors:
+ dst = list(np.argsort(distances[i, :]))[1: self.c_alpha_max_neighbors + 1]
+ if len(dst) == 0:
+ # choose second because first is i itself
+ dst = list(np.argsort(distances[i, :]))[1:2]
+ log(f'The c_alpha_cutoff {self.cutoff} was too small for one c_alpha such that it had no neighbors. So we connected it to the closest other c_alpha')
+ assert i not in dst
+
+ src = [i] * len(dst)
+ src_list.extend(src)
+ dst_list.extend(dst)
+ valid_dist = list(distances[i, dst])
+ dist_list.extend(valid_dist)
+ valid_dist_np = distances[i, dst]
+
+ sigma = np.array([1., 2., 5., 10., 30.]).reshape((-1, 1))
+ # (sigma_num, neigh_num)
+ weights = softmax(-valid_dist_np.reshape((1, -1)) ** 2 / sigma, axis=1)
+ # print(weights)
+ assert weights[0].sum() > 1 - 1e-2 and weights[0].sum() < 1.01
+ # (neigh_num, 3)
+ diff_vecs = residue_representatives_loc_feat[src, :] - residue_representatives_loc_feat[dst, :]
+ # (sigma_num, 3)
+ mean_vec = weights.dot(diff_vecs)
+ # (sigma_num,)
+ denominator = weights.dot(np.linalg.norm(diff_vecs, axis=1))
+ # (sigma_num,)
+ mean_vec_ratio_norm = np.linalg.norm(mean_vec, axis=1) / denominator
+ mean_norm_list.append(mean_vec_ratio_norm)
+
+ assert len(src_list) == len(dst_list)
+ assert len(dist_list) == len(dst_list)
+
+ residue_representatives_loc_feat = torch.from_numpy(residue_representatives_loc_feat.astype(np.float32))
+ x = self.rec_residue_featurizer(rec, one_hot=True, add_feature=scalar_feature)
+
+ if isinstance(x, bool) and (not x):
+ return False
+
+ graph = Data(
+ x=x,
+ pos=residue_representatives_loc_feat,
+ edge_attr=self.get_edge_features(src_list, dst_list, dist_list, divisor=4),
+ edge_index=torch.tensor([src_list, dst_list]),
+ edge_dist=torch.tensor(dist_list),
+ distances=torch.tensor(distances),
+ mu_r_norm=torch.from_numpy(np.array(mean_norm_list).astype(np.float32)),
+ seq=seq
+ )
+
+ # Loop over all edges of the graph and build the various p_ij, q_ij, k_ij, t_ij pairs
+ edge_feat_ori_list = []
+ for i in range(len(dist_list)):
+ src = src_list[i]
+ dst = dst_list[i]
+ # place n_i, u_i, v_i as lines in a 3x3 basis matrix
+ basis_matrix = np.stack((n_i_feat[dst, :], u_i_feat[dst, :], v_i_feat[dst, :]), axis=0)
+ p_ij = np.matmul(
+ basis_matrix,
+ residue_representatives_loc_feat[src, :] - residue_representatives_loc_feat[dst, :]
+ )
+ q_ij = np.matmul(basis_matrix, n_i_feat[src, :]) # shape (3,)
+ k_ij = np.matmul(basis_matrix, u_i_feat[src, :])
+ t_ij = np.matmul(basis_matrix, v_i_feat[src, :])
+ s_ij = np.concatenate((p_ij, q_ij, k_ij, t_ij), axis=0) # shape (12,)
+ edge_feat_ori_list.append(s_ij)
+
+ edge_feat_ori_feat = np.stack(edge_feat_ori_list, axis=0) # shape (num_edges, 4, 3)
+ edge_feat_ori_feat = torch.from_numpy(edge_feat_ori_feat.astype(np.float32))
+
+ graph.edge_attr = torch.cat([graph.edge_attr, edge_feat_ori_feat], axis=1) # (num_edges, 17)
+ #graph = self.remove_node(graph, graph.x.shape[0]-1)
+ # self.get_calpha_graph_single(graph, 6)
+ return graph
+
+ def remove_node(self, graph, node_idx):
+ new_graph = Data.clone(graph)
+ # delete node
+ new_graph.x = torch.cat(
+ [new_graph.x[:node_idx, :], new_graph.x[node_idx+1:, :]])
+ new_graph.pos = torch.cat(
+ [new_graph.pos[:node_idx, :], new_graph.pos[node_idx+1:, :]])
+ new_graph.mu_r_norm = torch.cat(
+ [new_graph.mu_r_norm[:node_idx, :], new_graph.mu_r_norm[node_idx+1:, :]])
+
+ # delete edge
+ keep_edge = (torch.sum(new_graph.edge_index == node_idx, dim=0) == 0)
+ new_graph.edge_index = new_graph.edge_index[:, keep_edge]
+ new_graph.edge_attr = new_graph.edge_attr[keep_edge, :]
+ return new_graph
+
+ def get_edge_features(self, src_list, dst_list, dist_list, divisor=4):
+ seq_edge = torch.absolute(torch.tensor(src_list) - torch.tensor(dst_list)).reshape(-1, 1)
+ seq_edge = torch.where(seq_edge > self.seq_dist_cut, self.seq_dist_cut, seq_edge)
+ seq_edge = F.one_hot(seq_edge, num_classes=self.seq_dist_cut + 1).reshape((-1, self.seq_dist_cut + 1))
+
+ contact_sig = torch.where(torch.tensor(dist_list) <= 8, 1, 0).reshape(-1, 1)
+ # avg distance = 7. So divisor = (4/7)*7 = 4
+ dist_fea = self.distance_featurizer(dist_list, divisor=divisor)
+
+ return torch.concat([seq_edge, dist_fea, contact_sig], dim=-1)
+
+ def get_receptor_inference(self, rec_path):
+ with warnings.catch_warnings():
+ warnings.filterwarnings("ignore", category=PDBConstructionWarning)
+ structure = self.biopython_parser.get_structure('random_id', rec_path)
+ rec = structure[0]
+ coords = []
+ c_alpha_coords = []
+ n_coords = []
+ c_coords = []
+ valid_chain_ids = []
+ lengths = []
+ seq = []
+ for i, chain in enumerate(rec):
+ chain_coords = [] # num_residues, num_atoms, 3
+ chain_c_alpha_coords = []
+ chain_n_coords = []
+ chain_c_coords = []
+ count = 0
+ invalid_res_ids = []
+ for res_idx, residue in enumerate(chain):
+ if residue.get_resname() == 'HOH':
+ invalid_res_ids.append(residue.get_id())
+ continue
+ residue_coords = []
+ c_alpha, n, c = None, None, None
+ for atom in residue:
+ if atom.name == 'CA':
+ c_alpha = list(atom.get_vector())
+ seq.append(str(residue).split(" ")[1])
+ if atom.name == 'N':
+ n = list(atom.get_vector())
+ if atom.name == 'C':
+ c = list(atom.get_vector())
+ residue_coords.append(list(atom.get_vector()))
+ # only append residue if it is an amino acid and not some weired molecule that is part of the complex
+ if c_alpha != None and n != None and c != None:
+ chain_c_alpha_coords.append(c_alpha)
+ chain_n_coords.append(n)
+ chain_c_coords.append(c)
+ chain_coords.append(np.array(residue_coords))
+ count += 1
+ else:
+ invalid_res_ids.append(residue.get_id())
+ for res_id in invalid_res_ids:
+ chain.detach_child(res_id)
+ lengths.append(count)
+ coords.append(chain_coords)
+ c_alpha_coords.append(np.array(chain_c_alpha_coords))
+ n_coords.append(np.array(chain_n_coords))
+ c_coords.append(np.array(chain_c_coords))
+ if len(chain_coords) > 0:
+ valid_chain_ids.append(chain.get_id())
+ valid_coords = []
+ valid_c_alpha_coords = []
+ valid_n_coords = []
+ valid_c_coords = []
+ valid_lengths = []
+ invalid_chain_ids = []
+ for i, chain in enumerate(rec):
+ if chain.get_id() in valid_chain_ids:
+ valid_coords.append(coords[i])
+ valid_c_alpha_coords.append(c_alpha_coords[i])
+ valid_n_coords.append(n_coords[i])
+ valid_c_coords.append(c_coords[i])
+ valid_lengths.append(lengths[i])
+ else:
+ invalid_chain_ids.append(chain.get_id())
+ # list with n_residues arrays: [n_atoms, 3]
+ coords = [item for sublist in valid_coords for item in sublist]
+
+ c_alpha_coords = np.concatenate(valid_c_alpha_coords, axis=0) # [n_residues, 3]
+ n_coords = np.concatenate(valid_n_coords, axis=0) # [n_residues, 3]
+ c_coords = np.concatenate(valid_c_coords, axis=0) # [n_residues, 3]
+
+ for invalid_id in invalid_chain_ids:
+ rec.detach_child(invalid_id)
+
+ assert len(c_alpha_coords) == len(n_coords)
+ assert len(c_alpha_coords) == len(c_coords)
+ assert sum(valid_lengths) == len(c_alpha_coords)
+ return rec, coords, c_alpha_coords, n_coords, c_coords,seq
+
+ def len(self):
+ return len(os.listdir(self.saved_graph_path))
+
+ def get_statistic_info(self):
+ node_num = torch.zeros(self.length_total)
+ edge_num = torch.zeros(self.length_total)
+ for i in tqdm(range(self.length_total)):
+ graph = self.get(i)
+ node_num[i] = graph.x.shape[0]
+ edge_num[i] = graph.edge_index.shape[1]
+ # if i == 1000:
+ # break
+ num_node_min = torch.min(node_num)
+ num_node_max = torch.max(node_num)
+ num_node_avg = torch.mean(node_num)
+ num_edge_min = torch.min(edge_num)
+ num_edge_max = torch.max(edge_num)
+ num_edge_avg = torch.mean(edge_num)
+ print(f'Graph Num: {self.length_total}')
+ print(
+ f'Min Nodes: {num_node_min:.2f} Max Nodes: {num_node_max:.2f}. Avg Nodes: {num_node_avg:.2f}')
+ print(
+ f'Min Edges: {num_edge_min:.2f} Max Edges: {num_edge_max:.2f}. Avg Edges: {num_edge_avg:.2f}')
+
+ def get(self, idx):
+ protein_name = self.protein_names[idx]
+ data = torch.load(os.path.join(self.processed_dir, f'{protein_name}.pt'))
+ notes_number = list((data.x[:, :20].argmax(dim=1)).size())[0]
+ data.y = torch.argmax(data.x[torch.tensor(range(notes_number)), :self.num_residue_type], dim=1)
+ data.protein_name = protein_name
+ return data
+
+ def find_idx(self, idx_protein, amino_idx):
+ idx = (self.distances[idx_protein][:-1, amino_idx]
+ < self.micro_radius).nonzero(as_tuple=True)[0]
+ return idx
+
+ def get_calpha_graph_single(self, graph, idx_protein, amino_idx):
+ choosen_amino_idx = self.find_idx(idx_protein, amino_idx)
+ keep_edge_index = []
+
+ for edge_idx in range(graph.num_edges):
+ edge = graph.edge_index.t()[edge_idx]
+ if (edge[0] in choosen_amino_idx) and (edge[1] in choosen_amino_idx):
+ keep_edge_index.append(edge_idx)
+
+ graph1 = Data(
+ x=graph.x[choosen_amino_idx, :],
+ pos=graph.pos[choosen_amino_idx, :],
+ edge_index=graph.edge_index[:, keep_edge_index],
+ edge_attr=graph.edge_attr[keep_edge_index, :],
+ mu_r_norm=graph.mu_r_norm[choosen_amino_idx, :]
+ )
+ return graph1
+
+ def __repr__(self) -> str:
+ return f'{self.__class__.__name__}{self.name.capitalize()}()'
+
+ def distance_featurizer(self, dist_list, divisor) -> torch.Tensor:
+ # you want to use a divisor that is close to 4/7 times the average distance that you want to encode
+ length_scale_list = [1.5 ** x for x in range(15)]
+ center_list = [0. for _ in range(15)]
+
+ num_edge = len(dist_list)
+ dist_list = np.array(dist_list)
+
+ transformed_dist = [np.exp(- ((dist_list / divisor) ** 2) / float(length_scale))
+ for length_scale, center in zip(length_scale_list, center_list)]
+
+ transformed_dist = np.array(transformed_dist).T
+ transformed_dist = transformed_dist.reshape((num_edge, -1))
+ return torch.from_numpy(transformed_dist.astype(np.float32))
diff --git a/proteingym/baselines/protssn/src/models.py b/proteingym/baselines/protssn/src/models.py
new file mode 100644
index 0000000..a60ee52
--- /dev/null
+++ b/proteingym/baselines/protssn/src/models.py
@@ -0,0 +1,127 @@
+import torch
+import random
+import gc
+import torch.nn as nn
+from torch_geometric.data import Batch, Dataset
+from transformers import AutoTokenizer, EsmModel
+from typing import *
+from src.module.egnn.network import EGNN
+from src.module.gcn.network import GCN
+from src.module.gat.network import GAT
+
+class PLM_model(nn.Module):
+ possible_amino_acids = [
+ 'ALA', 'ARG', 'ASN', 'ASP', 'CYS', 'GLN', 'GLU', 'GLY', 'HIS', 'ILE', 'LEU', 'LYS', 'MET',
+ 'PHE', 'PRO', 'SER', 'THR', 'TRP', 'TYR', 'VAL'
+ ]
+ one_letter = {
+ 'VAL':'V', 'ILE':'I', 'LEU':'L', 'GLU':'E', 'GLN':'Q',
+ 'ASP':'D', 'ASN':'N', 'HIS':'H', 'TRP':'W', 'PHE':'F', 'TYR':'Y',
+ 'ARG':'R', 'LYS':'K', 'SER':'S', 'THR':'T', 'MET':'M', 'ALA':'A',
+ 'GLY':'G', 'PRO':'P', 'CYS':'C'
+ }
+
+ def __init__(self, args):
+ super().__init__()
+ # load global config
+ self.args = args
+
+ # esm on the first cuda
+ self.tokenizer = AutoTokenizer.from_pretrained(self.args.plm)
+ self.model = EsmModel.from_pretrained(self.args.plm).cuda()
+
+
+ def forward(self, batch):
+ with torch.no_grad():
+ if not isinstance(batch, List):
+ batch = [batch]
+ # get the target sequence
+ one_hot_seqs = [list(elem.x[:,:20].argmax(1)) for elem in batch]
+ muted_res_seqs = ["".join([self.one_letter[self.possible_amino_acids[idx]] for idx in seq_idx]) for seq_idx in one_hot_seqs]
+ one_hot_truth_seqs = [elem.y for elem in batch]
+ truth_res_seqs = ["".join([self.one_letter[self.possible_amino_acids[idx]] for idx in seq_idx]) for seq_idx in one_hot_truth_seqs]
+
+ if not hasattr(self.args, "noise_type"):
+ input_seqs = truth_res_seqs
+ elif self.args.noise_type == 'mask':
+ input_seqs = self._mask_input_sequence(truth_res_seqs)
+ elif self.args.noise_type == 'mut':
+ input_seqs = muted_res_seqs
+ else:
+ raise ValueError(f"No implement of {self.args.noise_type}")
+
+ batch_graph = self._nlp_inference(input_seqs, batch)
+ return batch_graph
+
+ @torch.no_grad()
+ def _mask_input_sequence(self, truth_res_seqs):
+ input_seqs = []
+ self.mask_ratio = self.args.noise_ratio
+ for truth_seq in truth_res_seqs:
+ masked_seq = ""
+ for truth_token in truth_seq:
+ pattern = torch.multinomial(torch.tensor([1 - self.args.noise_ratio,
+ self.mask_ratio*0.8,
+ self.mask_ratio*0.1,
+ self.mask_ratio*0.1]),
+ num_samples=1,
+ replacement=True)
+ # 80% of the time, we replace masked input tokens with mask_token ([MASK])
+ if pattern == 1:
+ masked_seq += ''
+ # 10% of the time, we replace masked input tokens with random word
+ elif pattern == 2:
+ masked_seq += random.sample(list(self.one_letter.values()), 1)[0]
+ # The rest of the time (10% of the time) we keep the masked input tokens unchanged
+ else:
+ masked_seq += truth_token
+ input_seqs.append(masked_seq)
+ return input_seqs
+
+
+ @torch.no_grad()
+ def _nlp_inference(self, input_seqs, batch):
+ inputs = self.tokenizer(input_seqs, return_tensors="pt", padding=True).to("cuda:0")
+ batch_lens = (inputs["attention_mask"] == 1).sum(1) - 2
+ outputs = self.model(**inputs)
+ last_hidden_states = outputs.last_hidden_state
+ for idx, (hidden_state, seq_len) in enumerate(zip(last_hidden_states, batch_lens)):
+ batch[idx].esm_rep = hidden_state[1: 1+seq_len]
+ del batch[idx].seq
+
+ # move to the GNN devices
+ batch = [elem.cuda() for elem in batch]
+ batch_graph = Batch.from_data_list(batch)
+ gc.collect()
+ torch.cuda.empty_cache()
+ return batch_graph
+
+
+
+class GNN_model(nn.Module):
+ def __init__(self, args):
+ super().__init__()
+ # load graph network config which usually not change
+ self.gnn_config = args.gnn_config
+ # load global config
+ self.args = args
+
+ # calculate input dim according to the input feature
+ self.out_dim = 20
+ self.input_dim = self.args.plm_hidden_size
+
+ # gnn on the rest cudas
+ if "egnn" == self.args.gnn:
+ self.GNN_model = EGNN(self.gnn_config, self.args, self.input_dim, self.out_dim)
+ elif "gcn" == self.args.gnn:
+ self.GNN_model = GCN(self.gnn_config, self.input_dim, self.out_dim)
+ elif "gat" == self.args.gnn:
+ self.GNN_model = GAT(self.gnn_config,self.input_dim, self.out_dim)
+ else:
+ raise KeyError(f"No implement of {self.opt['gnn']}")
+ self.GNN_model = self.GNN_model.cuda()
+
+ def forward(self, batch_graph):
+ gnn_out = self.GNN_model(batch_graph)
+ return gnn_out
+
diff --git a/proteingym/baselines/protssn/src/module/egnn/__init__.py b/proteingym/baselines/protssn/src/module/egnn/__init__.py
new file mode 100644
index 0000000..af6f786
--- /dev/null
+++ b/proteingym/baselines/protssn/src/module/egnn/__init__.py
@@ -0,0 +1,2 @@
+from .egnn_pytorch import EGNN, EGNN_Network
+from .egnn_pytorch_geometric import EGNN_Sparse, EGNN_Sparse_Network
diff --git a/proteingym/baselines/protssn/src/module/egnn/egnn_pytorch.py b/proteingym/baselines/protssn/src/module/egnn/egnn_pytorch.py
new file mode 100644
index 0000000..5376a12
--- /dev/null
+++ b/proteingym/baselines/protssn/src/module/egnn/egnn_pytorch.py
@@ -0,0 +1,454 @@
+import torch
+from torch import nn, einsum, broadcast_tensors
+import torch.nn.functional as F
+
+from einops import rearrange, repeat
+from einops.layers.torch import Rearrange
+
+# helper functions
+
+def exists(val):
+ return val is not None
+
+def safe_div(num, den, eps = 1e-8):
+ res = num.div(den.clamp(min = eps))
+ res.masked_fill_(den == 0, 0.)
+ return res
+
+def batched_index_select(values, indices, dim = 1):
+ value_dims = values.shape[(dim + 1):]
+ values_shape, indices_shape = map(lambda t: list(t.shape), (values, indices))
+ indices = indices[(..., *((None,) * len(value_dims)))]
+ indices = indices.expand(*((-1,) * len(indices_shape)), *value_dims)
+ value_expand_len = len(indices_shape) - (dim + 1)
+ values = values[(*((slice(None),) * dim), *((None,) * value_expand_len), ...)]
+
+ value_expand_shape = [-1] * len(values.shape)
+ expand_slice = slice(dim, (dim + value_expand_len))
+ value_expand_shape[expand_slice] = indices.shape[expand_slice]
+ values = values.expand(*value_expand_shape)
+
+ dim += value_expand_len
+ return values.gather(dim, indices)
+
+def fourier_encode_dist(x, num_encodings = 4, include_self = True):
+ x = x.unsqueeze(-1)
+ device, dtype, orig_x = x.device, x.dtype, x
+ scales = 2 ** torch.arange(num_encodings, device = device, dtype = dtype)
+ x = x / scales
+ x = torch.cat([x.sin(), x.cos()], dim=-1)
+ x = torch.cat((x, orig_x), dim = -1) if include_self else x
+ return x
+
+def embedd_token(x, dims, layers):
+ stop_concat = -len(dims)
+ to_embedd = x[:, stop_concat:].long()
+ for i,emb_layer in enumerate(layers):
+ # the portion corresponding to `to_embedd` part gets dropped
+ x = torch.cat([ x[:, :stop_concat],
+ emb_layer( to_embedd[:, i] )
+ ], dim=-1)
+ stop_concat = x.shape[-1]
+ return x
+
+# swish activation fallback
+
+class Swish_(nn.Module):
+ def forward(self, x):
+ return x * x.sigmoid()
+
+SiLU = nn.SiLU if hasattr(nn, 'SiLU') else Swish_
+
+# helper classes
+
+# this follows the same strategy for normalization as done in SE3 Transformers
+# https://github.com/lucidrains/se3-transformer-pytorch/blob/main/se3_transformer_pytorch/se3_transformer_pytorch.py#L95
+
+class CoorsNorm(nn.Module):
+ def __init__(self, eps = 1e-8, scale_init = 1.):
+ super().__init__()
+ self.eps = eps
+ scale = torch.zeros(1).fill_(scale_init)
+ self.scale = nn.Parameter(scale)
+
+ def forward(self, coors):
+ norm = coors.norm(dim = -1, keepdim = True)
+ normed_coors = coors / norm.clamp(min = self.eps)
+ return normed_coors * self.scale
+
+# global linear attention
+
+class Attention(nn.Module):
+ def __init__(self, dim, heads = 8, dim_head = 64):
+ super().__init__()
+ inner_dim = heads * dim_head
+ self.heads = heads
+ self.scale = dim_head ** -0.5
+
+ self.to_q = nn.Linear(dim, inner_dim, bias = False)
+ self.to_kv = nn.Linear(dim, inner_dim * 2, bias = False)
+ self.to_out = nn.Linear(inner_dim, dim)
+
+ def forward(self, x, context, mask = None):
+ h = self.heads
+
+ q = self.to_q(x)
+ kv = self.to_kv(context).chunk(2, dim = -1)
+
+ q, k, v = map(lambda t: rearrange(t, 'b n (h d) -> b h n d', h = h), (q, *kv))
+ dots = einsum('b h i d, b h j d -> b h i j', q, k) * self.scale
+
+ if exists(mask):
+ mask_value = -torch.finfo(dots.dtype).max
+ mask = rearrange(mask, 'b n -> b () () n')
+ dots.masked_fill_(~mask, mask_value)
+
+ attn = dots.softmax(dim = -1)
+ out = einsum('b h i j, b h j d -> b h i d', attn, v)
+
+ out = rearrange(out, 'b h n d -> b n (h d)', h = h)
+ return self.to_out(out)
+
+class GlobalLinearAttention(nn.Module):
+ def __init__(
+ self,
+ *,
+ dim,
+ heads = 8,
+ dim_head = 64
+ ):
+ super().__init__()
+ self.norm_seq = nn.LayerNorm(dim)
+ self.norm_queries = nn.LayerNorm(dim)
+ self.attn1 = Attention(dim, heads, dim_head)
+ self.attn2 = Attention(dim, heads, dim_head)
+
+ self.ff = nn.Sequential(
+ nn.LayerNorm(dim),
+ nn.Linear(dim, dim * 4),
+ nn.GELU(),
+ nn.Linear(dim * 4, dim)
+ )
+
+ def forward(self, x, queries, mask = None):
+ res_x, res_queries = x, queries
+ x, queries = self.norm_seq(x), self.norm_queries(queries)
+
+ induced = self.attn1(queries, x, mask = mask)
+ out = self.attn2(x, induced)
+
+ x = out + res_x
+ queries = induced + res_queries
+
+ x = self.ff(x) + x
+ return x, queries
+
+# classes
+
+class EGNN(nn.Module):
+ def __init__(
+ self,
+ dim,
+ edge_dim = 0,
+ m_dim = 16,
+ fourier_features = 0,
+ num_nearest_neighbors = 0,
+ dropout = 0.0,
+ init_eps = 1e-3,
+ norm_feats = False,
+ norm_coors = False,
+ norm_coors_scale_init = 1e-2,
+ update_feats = True,
+ update_coors = True,
+ only_sparse_neighbors = False,
+ valid_radius = float('inf'),
+ m_pool_method = 'sum',
+ soft_edges = False,
+ coor_weights_clamp_value = None
+ ):
+ super().__init__()
+ assert m_pool_method in {'sum', 'mean'}, 'pool method must be either sum or mean'
+ assert update_feats or update_coors, 'you must update either features, coordinates, or both'
+
+ self.fourier_features = fourier_features
+
+ edge_input_dim = (fourier_features * 2) + (dim * 2) + edge_dim + 1
+ dropout = nn.Dropout(dropout) if dropout > 0 else nn.Identity()
+
+ self.edge_mlp = nn.Sequential(
+ nn.Linear(edge_input_dim, edge_input_dim * 2),
+ dropout,
+ SiLU(),
+ nn.Linear(edge_input_dim * 2, m_dim),
+ SiLU()
+ )
+
+ self.edge_gate = nn.Sequential(
+ nn.Linear(m_dim, 1),
+ nn.Sigmoid()
+ ) if soft_edges else None
+
+ self.node_norm = nn.LayerNorm(dim) if norm_feats else nn.Identity()
+ self.coors_norm = CoorsNorm(scale_init = norm_coors_scale_init) if norm_coors else nn.Identity()
+
+ self.m_pool_method = m_pool_method
+
+ self.node_mlp = nn.Sequential(
+ nn.Linear(dim + m_dim, dim * 2),
+ dropout,
+ SiLU(),
+ nn.Linear(dim * 2, dim),
+ ) if update_feats else None
+
+ self.coors_mlp = nn.Sequential(
+ nn.Linear(m_dim, m_dim * 4),
+ dropout,
+ SiLU(),
+ nn.Linear(m_dim * 4, 1)
+ ) if update_coors else None
+
+ self.num_nearest_neighbors = num_nearest_neighbors
+ self.only_sparse_neighbors = only_sparse_neighbors
+ self.valid_radius = valid_radius
+
+ self.coor_weights_clamp_value = coor_weights_clamp_value
+
+ self.init_eps = init_eps
+ self.apply(self.init_)
+
+ def init_(self, module):
+ if type(module) in {nn.Linear}:
+ # seems to be needed to keep the network from exploding to NaN with greater depths
+ nn.init.normal_(module.weight, std = self.init_eps)
+
+ def forward(self, feats, coors, edges = None, mask = None, adj_mat = None):
+ b, n, d, device, fourier_features, num_nearest, valid_radius, only_sparse_neighbors = *feats.shape, feats.device, self.fourier_features, self.num_nearest_neighbors, self.valid_radius, self.only_sparse_neighbors
+
+ if exists(mask):
+ num_nodes = mask.sum(dim = -1)
+
+ use_nearest = num_nearest > 0 or only_sparse_neighbors
+
+ rel_coors = rearrange(coors, 'b i d -> b i () d') - rearrange(coors, 'b j d -> b () j d')
+ rel_dist = (rel_coors ** 2).sum(dim = -1, keepdim = True)
+
+ i = j = n
+
+ if use_nearest:
+ ranking = rel_dist[..., 0].clone()
+
+ if exists(mask):
+ rank_mask = mask[:, :, None] * mask[:, None, :]
+ ranking.masked_fill_(~rank_mask, 1e5)
+
+ if exists(adj_mat):
+ if len(adj_mat.shape) == 2:
+ adj_mat = repeat(adj_mat.clone(), 'i j -> b i j', b = b)
+
+ if only_sparse_neighbors:
+ num_nearest = int(adj_mat.float().sum(dim = -1).max().item())
+ valid_radius = 0
+
+ self_mask = rearrange(torch.eye(n, device = device, dtype = torch.bool), 'i j -> () i j')
+
+ adj_mat = adj_mat.masked_fill(self_mask, False)
+ ranking.masked_fill_(self_mask, -1.)
+ ranking.masked_fill_(adj_mat, 0.)
+
+ nbhd_ranking, nbhd_indices = ranking.topk(num_nearest, dim = -1, largest = False)
+
+ nbhd_mask = nbhd_ranking <= valid_radius
+
+ rel_coors = batched_index_select(rel_coors, nbhd_indices, dim = 2)
+ rel_dist = batched_index_select(rel_dist, nbhd_indices, dim = 2)
+
+ if exists(edges):
+ edges = batched_index_select(edges, nbhd_indices, dim = 2)
+
+ j = num_nearest
+
+ if fourier_features > 0:
+ rel_dist = fourier_encode_dist(rel_dist, num_encodings = fourier_features)
+ rel_dist = rearrange(rel_dist, 'b i j () d -> b i j d')
+
+ if use_nearest:
+ feats_j = batched_index_select(feats, nbhd_indices, dim = 1)
+ else:
+ feats_j = rearrange(feats, 'b j d -> b () j d')
+
+ feats_i = rearrange(feats, 'b i d -> b i () d')
+ feats_i, feats_j = broadcast_tensors(feats_i, feats_j)
+
+ edge_input = torch.cat((feats_i, feats_j, rel_dist), dim = -1)
+
+ if exists(edges):
+ edge_input = torch.cat((edge_input, edges), dim = -1)
+
+ m_ij = self.edge_mlp(edge_input)
+
+ if exists(self.edge_gate):
+ m_ij = m_ij * self.edge_gate(m_ij)
+
+ if exists(mask):
+ mask_i = rearrange(mask, 'b i -> b i ()')
+
+ if use_nearest:
+ mask_j = batched_index_select(mask, nbhd_indices, dim = 1)
+ mask = (mask_i * mask_j) & nbhd_mask
+ else:
+ mask_j = rearrange(mask, 'b j -> b () j')
+ mask = mask_i * mask_j
+
+ if exists(self.coors_mlp):
+ coor_weights = self.coors_mlp(m_ij)
+ coor_weights = rearrange(coor_weights, 'b i j () -> b i j')
+
+ rel_coors = self.coors_norm(rel_coors)
+
+ if exists(mask):
+ coor_weights.masked_fill_(~mask, 0.)
+
+ if exists(self.coor_weights_clamp_value):
+ clamp_value = self.coor_weights_clamp_value
+ coor_weights.clamp_(min = -clamp_value, max = clamp_value)
+
+ coors_out = einsum('b i j, b i j c -> b i c', coor_weights, rel_coors) + coors
+ else:
+ coors_out = coors
+
+ if exists(self.node_mlp):
+ if exists(mask):
+ m_ij_mask = rearrange(mask, '... -> ... ()')
+ m_ij = m_ij.masked_fill(~m_ij_mask, 0.)
+
+ if self.m_pool_method == 'mean':
+ if exists(mask):
+ # masked mean
+ mask_sum = m_ij_mask.sum(dim = -2)
+ m_i = safe_div(m_ij.sum(dim = -2), mask_sum)
+ else:
+ m_i = m_ij.mean(dim = -2)
+
+ elif self.m_pool_method == 'sum':
+ m_i = m_ij.sum(dim = -2)
+
+ normed_feats = self.node_norm(feats)
+ node_mlp_input = torch.cat((normed_feats, m_i), dim = -1)
+ node_out = self.node_mlp(node_mlp_input) + feats
+ else:
+ node_out = feats
+
+ return node_out, coors_out
+
+class EGNN_Network(nn.Module):
+ def __init__(
+ self,
+ *,
+ depth,
+ dim,
+ num_tokens = None,
+ num_edge_tokens = None,
+ num_positions = None,
+ edge_dim = 0,
+ num_adj_degrees = None,
+ adj_dim = 0,
+ global_linear_attn_every = 0,
+ global_linear_attn_heads = 8,
+ global_linear_attn_dim_head = 64,
+ num_global_tokens = 4,
+ **kwargs
+ ):
+ super().__init__()
+ assert not (exists(num_adj_degrees) and num_adj_degrees < 1), 'make sure adjacent degrees is greater than 1'
+ self.num_positions = num_positions
+
+ self.token_emb = nn.Embedding(num_tokens, dim) if exists(num_tokens) else None
+ self.pos_emb = nn.Embedding(num_positions, dim) if exists(num_positions) else None
+ self.edge_emb = nn.Embedding(num_edge_tokens, edge_dim) if exists(num_edge_tokens) else None
+ self.has_edges = edge_dim > 0
+
+ self.num_adj_degrees = num_adj_degrees
+ self.adj_emb = nn.Embedding(num_adj_degrees + 1, adj_dim) if exists(num_adj_degrees) and adj_dim > 0 else None
+
+ edge_dim = edge_dim if self.has_edges else 0
+ adj_dim = adj_dim if exists(num_adj_degrees) else 0
+
+ has_global_attn = global_linear_attn_every > 0
+ self.global_tokens = None
+ if has_global_attn:
+ self.global_tokens = nn.Parameter(torch.randn(num_global_tokens, dim))
+
+ self.layers = nn.ModuleList([])
+ for ind in range(depth):
+ is_global_layer = has_global_attn and (ind % global_linear_attn_every) == 0
+
+ self.layers.append(nn.ModuleList([
+ GlobalLinearAttention(dim = dim, heads = global_linear_attn_heads, dim_head = global_linear_attn_dim_head) if is_global_layer else None,
+ EGNN(dim = dim, edge_dim = (edge_dim + adj_dim), norm_feats = True, **kwargs),
+ ]))
+
+ def forward(
+ self,
+ feats,
+ coors,
+ adj_mat = None,
+ edges = None,
+ mask = None,
+ return_coor_changes = False
+ ):
+ b, device = feats.shape[0], feats.device
+
+ if exists(self.token_emb):
+ feats = self.token_emb(feats)
+
+ if exists(self.pos_emb):
+ n = feats.shape[1]
+ assert n <= self.num_positions, f'given sequence length {n} must be less than the number of positions {self.num_positions} set at init'
+ pos_emb = self.pos_emb(torch.arange(n, device = device))
+ feats += rearrange(pos_emb, 'n d -> () n d')
+
+ if exists(edges) and exists(self.edge_emb):
+ edges = self.edge_emb(edges)
+
+ # create N-degrees adjacent matrix from 1st degree connections
+ if exists(self.num_adj_degrees):
+ assert exists(adj_mat), 'adjacency matrix must be passed in (keyword argument adj_mat)'
+
+ if len(adj_mat.shape) == 2:
+ adj_mat = repeat(adj_mat.clone(), 'i j -> b i j', b = b)
+
+ adj_indices = adj_mat.clone().long()
+
+ for ind in range(self.num_adj_degrees - 1):
+ degree = ind + 2
+
+ next_degree_adj_mat = (adj_mat.float() @ adj_mat.float()) > 0
+ next_degree_mask = (next_degree_adj_mat.float() - adj_mat.float()).bool()
+ adj_indices.masked_fill_(next_degree_mask, degree)
+ adj_mat = next_degree_adj_mat.clone()
+
+ if exists(self.adj_emb):
+ adj_emb = self.adj_emb(adj_indices)
+ edges = torch.cat((edges, adj_emb), dim = -1) if exists(edges) else adj_emb
+
+ # setup global attention
+
+ global_tokens = None
+ if exists(self.global_tokens):
+ global_tokens = repeat(self.global_tokens, 'n d -> b n d', b = b)
+
+ # go through layers
+
+ coor_changes = [coors]
+
+ for global_attn, egnn in self.layers:
+ if exists(global_attn):
+ feats, global_tokens = global_attn(feats, global_tokens, mask = mask)
+
+ feats, coors = egnn(feats, coors, adj_mat = adj_mat, edges = edges, mask = mask)
+ coor_changes.append(coors)
+
+ if return_coor_changes:
+ return feats, coors, coor_changes
+
+ return feats, coors
diff --git a/proteingym/baselines/protssn/src/module/egnn/egnn_pytorch_geometric.py b/proteingym/baselines/protssn/src/module/egnn/egnn_pytorch_geometric.py
new file mode 100644
index 0000000..ebb0598
--- /dev/null
+++ b/proteingym/baselines/protssn/src/module/egnn/egnn_pytorch_geometric.py
@@ -0,0 +1,467 @@
+import torch
+from torch import nn, einsum, broadcast_tensors
+import torch.nn.functional as F
+
+from einops import rearrange, repeat
+from einops.layers.torch import Rearrange
+# types
+
+from typing import Optional, List, Union
+
+# pytorch geometric
+
+try:
+ import torch_geometric
+ from torch_geometric.nn import MessagePassing
+ from torch_geometric.typing import Adj, Size, OptTensor, Tensor
+except:
+ Tensor = OptTensor = Adj = MessagePassing = Size = object
+ PYG_AVAILABLE = False
+
+ # to stop throwing errors from type suggestions
+ Adj = object
+ Size = object
+ OptTensor = object
+ Tensor = object
+
+from .egnn_pytorch import *
+
+# global linear attention
+
+class Attention_Sparse(Attention):
+ def __init__(self, dim, heads = 8, dim_head = 64):
+ """ Wraps the attention class to operate with pytorch-geometric inputs. """
+ super(Attention_Sparse, self).__init__(dim, heads = 8, dim_head = 64)
+
+ def sparse_forward(self, x, context, batch=None, batch_uniques=None, mask=None):
+ assert batch is not None or batch_uniques is not None, "Batch/(uniques) must be passed for block_sparse_attn"
+ if batch_uniques is None:
+ batch_uniques = torch.unique(batch, return_counts=True)
+ # only one example in batch - do dense - faster
+ if batch_uniques[0].shape[0] == 1:
+ x, context = map(lambda t: rearrange(t, 'h d -> () h d'), (x, context))
+ return self.forward(x, context, mask=None).squeeze() # get rid of batch dim
+ # multiple examples in batch - do block-sparse by dense loop
+ else:
+ x_list = []
+ aux_count = 0
+ for bi,n_idxs in zip(*batch_uniques):
+ x_list.append(
+ self.sparse_forward(
+ x[aux_count:aux_count+n_idxs],
+ context[aux_count:aux_count+n_idxs],
+ batch_uniques = (bi.unsqueeze(-1), n_idxs.unsqueeze(-1))
+ )
+ )
+ return torch.cat(x_list, dim=0)
+
+
+class GlobalLinearAttention_Sparse(nn.Module):
+ def __init__(
+ self,
+ *,
+ dim,
+ heads = 8,
+ dim_head = 64
+ ):
+ super().__init__()
+ self.norm_seq = torch_geometric.nn.norm.LayerNorm(dim)
+ self.norm_queries = torch_geometric.nn.norm.LayerNorm(dim)
+ self.attn1 = Attention_Sparse(dim, heads, dim_head)
+ self.attn2 = Attention_Sparse(dim, heads, dim_head)
+
+ # can't concat pyg norms with torch sequentials
+ self.ff_norm = torch_geometric.nn.norm.LayerNorm(dim)
+ self.ff = nn.Sequential(
+ nn.Linear(dim, dim * 4),
+ nn.GELU(),
+ nn.Linear(dim * 4, dim)
+ )
+
+ def forward(self, x, queries, batch=None, batch_uniques=None, mask = None):
+ res_x, res_queries = x, queries
+ x, queries = self.norm_seq(x, batch=batch), self.norm_queries(queries, batch=batch)
+
+ induced = self.attn1.sparse_forward(queries, x, batch=batch, batch_uniques=batch_uniques, mask = mask)
+ out = self.attn2.sparse_forward(x, induced, batch=batch, batch_uniques=batch_uniques)
+
+ x = out + res_x
+ queries = induced + res_queries
+
+ x_norm = self.ff_norm(x, batch=batch)
+ x = self.ff(x_norm) + x_norm
+ return x, queries
+
+
+# define pytorch-geometric equivalents
+
+class EGNN_Sparse(MessagePassing):
+ """ Different from the above since it separates the edge assignment
+ from the computation (this allows for great reduction in time and
+ computations when the graph is locally or sparse connected).
+ * aggr: one of ["add", "mean", "max"]
+ """
+ def __init__(
+ self,
+ feats_dim,
+ pos_dim=3,
+ edge_attr_dim = 0,
+ m_dim = 16,
+ fourier_features = 0,
+ soft_edge = 0,
+ norm_feats = False,
+ norm_coors = False,
+ norm_coors_scale_init = 1e-2,
+ update_feats = True,
+ update_coors = False,
+ dropout = 0.,
+ coor_weights_clamp_value = None,
+ aggr = "add",
+ mlp_num = 2,
+ **kwargs
+ ):
+ assert aggr in {'add', 'sum', 'max', 'mean'}, 'pool method must be a valid option'
+ assert update_feats or update_coors, 'you must update either features, coordinates, or both'
+ kwargs.setdefault('aggr', aggr)
+ super(EGNN_Sparse, self).__init__(**kwargs)
+ # model params
+ self.fourier_features = fourier_features
+ self.feats_dim = feats_dim
+ self.pos_dim = pos_dim
+ self.m_dim = m_dim
+ self.soft_edge = soft_edge
+ self.norm_feats = norm_feats
+ self.norm_coors = norm_coors
+ self.update_coors = update_coors
+ self.update_feats = update_feats
+ self.coor_weights_clamp_value = None
+ self.mlp_num = mlp_num
+ self.edge_input_dim = (fourier_features * 2) + edge_attr_dim + 1 + (feats_dim * 2)
+ self.dropout = nn.Dropout(dropout) if dropout > 0 else nn.Identity()
+
+ # EDGES
+ if self.mlp_num >2:
+ self.edge_mlp = nn.Sequential(
+ nn.Linear(self.edge_input_dim, self.edge_input_dim * 8),
+ self.dropout,
+ SiLU(),
+ nn.Linear(self.edge_input_dim * 8, self.edge_input_dim * 4),
+ self.dropout,
+ SiLU(),
+ nn.Linear(self.edge_input_dim * 4, self.edge_input_dim * 2),
+ self.dropout,
+ SiLU(),
+ nn.Linear(self.edge_input_dim * 2, m_dim),
+ SiLU(),
+ ) if update_feats else None
+ else:
+ self.edge_mlp = nn.Sequential(
+ nn.Linear(self.edge_input_dim, self.edge_input_dim * 2),
+ self.dropout,
+ SiLU(),
+ nn.Linear(self.edge_input_dim * 2, m_dim),
+ SiLU()
+ )
+
+ self.edge_weight = nn.Sequential(nn.Linear(m_dim, 1),
+ nn.Sigmoid()
+ ) if soft_edge else None
+
+ # NODES - can't do identity in node_norm bc pyg expects 2 inputs, but identity expects 1.
+ self.node_norm = torch_geometric.nn.norm.LayerNorm(feats_dim) if norm_feats else None
+ self.coors_norm = CoorsNorm(scale_init = norm_coors_scale_init) if norm_coors else nn.Identity()
+ if self.mlp_num >2:
+ self.node_mlp = nn.Sequential(
+ nn.Linear(feats_dim + m_dim, feats_dim * 8),
+ self.dropout,
+ SiLU(),
+ nn.Linear(feats_dim * 8, feats_dim * 4),
+ self.dropout,
+ SiLU(),
+ nn.Linear(feats_dim * 4, feats_dim * 2),
+ self.dropout,
+ SiLU(),
+ nn.Linear(feats_dim * 2, feats_dim),
+ ) if update_feats else None
+ else:
+ self.node_mlp = nn.Sequential(
+ nn.Linear(feats_dim + m_dim, feats_dim * 2),
+ self.dropout,
+ SiLU(),
+ nn.Linear(feats_dim * 2, feats_dim),
+ ) if update_feats else None
+
+ # COORS
+ self.coors_mlp = nn.Sequential(
+ nn.Linear(m_dim, m_dim * 4),
+ self.dropout,
+ SiLU(),
+ nn.Linear(self.m_dim * 4, 1)
+ ) if update_coors else None
+
+ self.apply(self.init_)
+
+ def init_(self, module):
+ if type(module) in {nn.Linear}:
+ # seems to be needed to keep the network from exploding to NaN with greater depths
+ nn.init.xavier_normal_(module.weight)
+ nn.init.zeros_(module.bias)
+
+ def forward(self, x: Tensor, edge_index: Adj,
+ edge_attr: OptTensor = None, batch: Adj = None,
+ angle_data: List = None, size: Size = None) -> Tensor:
+ """ Inputs:
+ * x: (n_points, d) where d is pos_dims + feat_dims
+ * edge_index: (2, n_edges)
+ * edge_attr: tensor (n_edges, n_feats) excluding basic distance feats.
+ * batch: (n_points,) long tensor. specifies xloud belonging for each point
+ * angle_data: list of tensors (levels, n_edges_i, n_length_path) long tensor.
+ * size: None
+ """
+ coors, feats = x[:, :self.pos_dim], x[:, self.pos_dim:]
+
+ rel_coors = coors[edge_index[0]] - coors[edge_index[1]]
+ rel_dist = (rel_coors ** 2).sum(dim=-1, keepdim=True)
+
+ if self.fourier_features > 0:
+ rel_dist = fourier_encode_dist(rel_dist, num_encodings = self.fourier_features)
+ rel_dist = rearrange(rel_dist, 'n () d -> n d')
+
+ if exists(edge_attr):
+ edge_attr_feats = torch.cat([edge_attr, rel_dist], dim=-1)
+ else:
+ edge_attr_feats = rel_dist
+
+ hidden_out, coors_out = self.propagate(edge_index, x=feats, edge_attr=edge_attr_feats,
+ coors=coors, rel_coors=rel_coors,
+ batch=batch)
+ return torch.cat([coors_out, hidden_out], dim=-1)
+
+
+ def message(self, x_i, x_j, edge_attr) -> Tensor:
+ m_ij = self.edge_mlp( torch.cat([x_i, x_j, edge_attr], dim=-1) )
+ return m_ij
+
+ def propagate(self, edge_index: Adj, size: Size = None, **kwargs):
+ """The initial call to start propagating messages.
+ Args:
+ `edge_index` holds the indices of a general (sparse)
+ assignment matrix of shape :obj:`[N, M]`.
+ size (tuple, optional) if none, the size will be inferred
+ and assumed to be quadratic.
+ **kwargs: Any additional data which is needed to construct and
+ aggregate messages, and to update node embeddings.
+ """
+ size = self._check_input(edge_index, size)
+ coll_dict = self._collect(self._user_args, edge_index, size, kwargs)
+ msg_kwargs = self.inspector.distribute('message', coll_dict)
+ aggr_kwargs = self.inspector.distribute('aggregate', coll_dict)
+ update_kwargs = self.inspector.distribute('update', coll_dict)
+
+ # get messages
+ m_ij = self.message(**msg_kwargs)
+
+ # update coors if specified
+ if self.update_coors:
+ coor_wij = self.coors_mlp(m_ij)
+ # clamp if arg is set
+ if self.coor_weights_clamp_value:
+ coor_weights_clamp_value = self.coor_weights_clamp_value
+ # coor_weights.clamp_(min = -clamp_value, max = clamp_value)
+
+ # normalize if needed
+ kwargs["rel_coors"] = self.coors_norm(kwargs["rel_coors"])
+
+ mhat_i = self.aggregate(coor_wij * kwargs["rel_coors"], **aggr_kwargs)
+ coors_out = kwargs["coors"] + mhat_i
+ else:
+ coors_out = kwargs["coors"]
+
+ # update feats if specified
+ if self.update_feats:
+ # weight the edges if arg is passed
+ if self.soft_edge:
+ m_ij = m_ij * self.edge_weight(m_ij)
+ m_i = self.aggregate(m_ij, **aggr_kwargs)
+
+ hidden_feats = self.node_norm(kwargs["x"], kwargs["batch"]) if self.node_norm else kwargs["x"]
+ hidden_out = self.node_mlp( torch.cat([hidden_feats, m_i], dim = -1) )
+ hidden_out = kwargs["x"] + hidden_out
+ else:
+ hidden_out = kwargs["x"]
+
+ # return tuple
+ return self.update((hidden_out, coors_out), **update_kwargs)
+
+ def __repr__(self):
+ dict_print = {}
+ return "E(n)-GNN Layer for Graphs " + str(self.__dict__)
+
+
+class EGNN_Sparse_Network(nn.Module):
+ r"""Sample GNN model architecture that uses the EGNN-Sparse
+ message passing layer to learn over point clouds.
+ Main MPNN layer introduced in https://arxiv.org/abs/2102.09844v1
+
+ Inputs will be standard GNN: x, edge_index, edge_attr, batch, ...
+
+ Args:
+ * n_layers: int. number of MPNN layers
+ * ... : same interpretation as the base layer.
+ * embedding_nums: list. number of unique keys to embedd. for points
+ 1 entry per embedding needed.
+ * embedding_dims: list. point - number of dimensions of
+ the resulting embedding. 1 entry per embedding needed.
+ * edge_embedding_nums: list. number of unique keys to embedd. for edges.
+ 1 entry per embedding needed.
+ * edge_embedding_dims: list. point - number of dimensions of
+ the resulting embedding. 1 entry per embedding needed.
+ * recalc: int. Recalculate edge feats every `recalc` MPNN layers. 0 for no recalc
+ * verbose: bool. verbosity level.
+ -----
+ Diff with normal layer: one has to do preprocessing before (radius, global token, ...)
+ """
+ def __init__(self, n_layers, feats_dim,
+ pos_dim = 3,
+ edge_attr_dim = 0,
+ m_dim = 16,
+ fourier_features = 0,
+ soft_edge = 0,
+ embedding_nums=[],
+ embedding_dims=[],
+ edge_embedding_nums=[],
+ edge_embedding_dims=[],
+ update_coors=True,
+ update_feats=True,
+ norm_feats=True,
+ norm_coors=False,
+ norm_coors_scale_init = 1e-2,
+ dropout=0.,
+ coor_weights_clamp_value=None,
+ aggr="add",
+ global_linear_attn_every = 0,
+ global_linear_attn_heads = 8,
+ global_linear_attn_dim_head = 64,
+ num_global_tokens = 4,
+ recalc=0 ,):
+ super().__init__()
+
+ self.n_layers = n_layers
+
+ # Embeddings? solve here
+ self.embedding_nums = embedding_nums
+ self.embedding_dims = embedding_dims
+ self.emb_layers = nn.ModuleList()
+ self.edge_embedding_nums = edge_embedding_nums
+ self.edge_embedding_dims = edge_embedding_dims
+ self.edge_emb_layers = nn.ModuleList()
+
+ # instantiate point and edge embedding layers
+
+ for i in range( len(self.embedding_dims) ):
+ self.emb_layers.append(nn.Embedding(num_embeddings = embedding_nums[i],
+ embedding_dim = embedding_dims[i]))
+ feats_dim += embedding_dims[i] - 1
+
+ for i in range( len(self.edge_embedding_dims) ):
+ self.edge_emb_layers.append(nn.Embedding(num_embeddings = edge_embedding_nums[i],
+ embedding_dim = edge_embedding_dims[i]))
+ edge_attr_dim += edge_embedding_dims[i] - 1
+ # rest
+ self.mpnn_layers = nn.ModuleList()
+ self.feats_dim = feats_dim
+ self.pos_dim = pos_dim
+ self.edge_attr_dim = edge_attr_dim
+ self.m_dim = m_dim
+ self.fourier_features = fourier_features
+ self.soft_edge = soft_edge
+ self.norm_feats = norm_feats
+ self.norm_coors = norm_coors
+ self.norm_coors_scale_init = norm_coors_scale_init
+ self.update_feats = update_feats
+ self.update_coors = update_coors
+ self.dropout = dropout
+ self.coor_weights_clamp_value = coor_weights_clamp_value
+ self.recalc = recalc
+
+ self.has_global_attn = global_linear_attn_every > 0
+ self.global_tokens = None
+ self.global_linear_attn_every = global_linear_attn_every
+ if self.has_global_attn:
+ self.global_tokens = nn.Parameter(torch.randn(num_global_tokens, self.feats_dim))
+
+ # instantiate layers
+ for i in range(n_layers):
+ layer = EGNN_Sparse(feats_dim = feats_dim,
+ pos_dim = pos_dim,
+ edge_attr_dim = edge_attr_dim,
+ m_dim = m_dim,
+ fourier_features = fourier_features,
+ soft_edge = soft_edge,
+ norm_feats = norm_feats,
+ norm_coors = norm_coors,
+ norm_coors_scale_init = norm_coors_scale_init,
+ update_feats = update_feats,
+ update_coors = update_coors,
+ dropout = dropout,
+ coor_weights_clamp_value = coor_weights_clamp_value)
+
+ # global attention case
+ is_global_layer = self.has_global_attn and (i % self.global_linear_attn_every) == 0
+ if is_global_layer:
+ attn_layer = GlobalLinearAttention_Sparse(dim = self.feats_dim,
+ heads = global_linear_attn_heads,
+ dim_head = global_linear_attn_dim_head)
+ self.mpnn_layers.append(nn.ModuleList([attn_layer,layer]))
+ # normal case
+ else:
+ self.mpnn_layers.append(layer)
+
+
+ def forward(self, x, edge_index, batch, edge_attr,
+ bsize=None, recalc_edge=None, verbose=0):
+ """ Recalculate edge features every `self.recalc_edge` with the
+ `recalc_edge` function if self.recalc_edge is set.
+
+ * x: (N, pos_dim+feats_dim) will be unpacked into coors, feats.
+ """
+ # NODES - Embedd each dim to its target dimensions:
+ x = embedd_token(x, self.embedding_dims, self.emb_layers)
+
+ # regulates wether to embedd edges each layer
+ edges_need_embedding = False
+ for i,layer in enumerate(self.mpnn_layers):
+
+ # EDGES - Embedd each dim to its target dimensions:
+ if edges_need_embedding:
+ edge_attr = embedd_token(edge_attr, self.edge_embedding_dims, self.edge_emb_layers)
+ edges_need_embedding = False
+
+ # attn tokens
+ self.global_tokens = None
+ if exists(self.global_tokens):
+ unique, amounts = torch.unique(batch, return_counts=True)
+ num_idxs = torch.cat([torch.arange(num_idxs_i,device=self.global_tokens.device) for num_idxs_i in amounts], dim=-1)
+ global_tokens = self.global_tokens[num_idxs]
+
+ # pass layers
+ is_global_layer = self.has_global_attn and (i % self.global_linear_attn_every) == 0
+ if not is_global_layer:
+ x = layer(x, edge_index, edge_attr, batch=batch, size=bsize)
+ else:
+ # only pass feats to the attn layer
+ # unique, amounts = torch.unique(batch, return_counts=True)
+ x_attn = layer[0](x[:, self.pos_dim:], x[:, self.pos_dim:],batch)[0]#global_tokens
+ # merge attn-ed feats and coords
+ x = torch.cat( (x[:, :self.pos_dim], x_attn), dim=-1)
+ x = layer[-1](x, edge_index, edge_attr, batch=batch, size=bsize)
+
+ # recalculate edge info - not needed if last layer
+ if self.recalc and ((i%self.recalc == 0) and not (i == len(self.mpnn_layers)-1)) :
+ edge_index, edge_attr, _ = recalc_edge(x) # returns attr, idx, any_other_info
+ edges_need_embedding = True
+
+ return x
+
+ def __repr__(self):
+ return 'EGNN_Sparse_Network of: {0} layers'.format(len(self.mpnn_layers))
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/src/module/egnn/network.py b/proteingym/baselines/protssn/src/module/egnn/network.py
new file mode 100644
index 0000000..8aa6e0e
--- /dev/null
+++ b/proteingym/baselines/protssn/src/module/egnn/network.py
@@ -0,0 +1,124 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+from argparse import Namespace
+
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+
+
+from src.module.egnn import EGNN_Sparse
+from src.module.egnn.utils import get_edge_feature_dims, get_node_feature_dims
+
+class nodeEncoder(torch.nn.Module):
+
+ def __init__(self, emb_dim):
+ super(nodeEncoder, self).__init__()
+
+ self.atom_embedding_list = torch.nn.ModuleList()
+ self.node_feature_dim = get_node_feature_dims()
+ for i, dim in enumerate(self.node_feature_dim):
+ emb = torch.nn.Linear(dim, emb_dim)
+ torch.nn.init.xavier_uniform_(emb.weight.data)
+ self.atom_embedding_list.append(emb)
+
+ def forward(self, x):
+ x_embedding = 0
+ feature_dim_count = 0
+ for i in range(len(self.node_feature_dim)):
+ x_embedding += self.atom_embedding_list[i](
+ x[:, feature_dim_count:feature_dim_count + self.node_feature_dim[i]])
+ feature_dim_count += self.node_feature_dim[i]
+ return x_embedding
+
+
+class edgeEncoder(torch.nn.Module):
+ def __init__(self, emb_dim):
+ super(edgeEncoder, self).__init__()
+ self.atom_embedding_list = torch.nn.ModuleList()
+ self.edge_feature_dims = get_edge_feature_dims()
+ for i, dim in enumerate(self.edge_feature_dims):
+ emb = torch.nn.Linear(dim, emb_dim)
+ torch.nn.init.xavier_uniform_(emb.weight.data)
+ self.atom_embedding_list.append(emb)
+
+ def forward(self, x):
+ x_embedding = 0
+ feature_dim_count = 0
+ for i in range(len(self.edge_feature_dims)):
+ x_embedding += self.atom_embedding_list[i](
+ x[:, feature_dim_count:feature_dim_count + self.edge_feature_dims[i]])
+ feature_dim_count += self.edge_feature_dims[i]
+ return x_embedding
+
+class GNNClassificationHead(nn.Module):
+ """Head for sentence-level classification tasks."""
+
+ def __init__(self, hidden_size, hidden_dropout_prob):
+ super().__init__()
+ self.dense = nn.Linear(hidden_size, hidden_size)
+ self.dropout = nn.Dropout(hidden_dropout_prob)
+ self.out_proj = nn.Linear(hidden_size, 1)
+
+ def forward(self, features, batch):
+ features = features.reshape(max(batch)+1, -1, features.shape[-1])
+ x = torch.mean(features, dim=1) # average pool over the tokens
+ x = self.dropout(x)
+ x = self.dense(x)
+ x = torch.tanh(x)
+ x = self.dropout(x)
+ x = self.out_proj(x)
+ return x
+
+
+class EGNN(nn.Module):
+ def __init__(self, gnn_config, config, input_dim, out_dim):
+ super(EGNN, self).__init__()
+ self.gnn_config = gnn_config
+ self.config = config
+ self.mpnn_layes = nn.ModuleList([
+ EGNN_Sparse(
+ input_dim,
+ m_dim=int(self.gnn_config["hidden_channels"]),
+ edge_attr_dim=int(self.gnn_config["edge_attr_dim"]),
+ dropout=int(self.gnn_config["dropout"]),
+ mlp_num=int(self.gnn_config["mlp_num"]))
+ for _ in range(int(self.gnn_config["n_layers"]))])
+
+ if gnn_config["embedding"]:
+ self.node_embedding = nodeEncoder(input_dim)
+ self.edge_embedding = edgeEncoder(input_dim)
+
+ self.lin = nn.Linear(input_dim, out_dim)
+ self.droplayer = nn.Dropout(int(self.gnn_config["dropout"]))
+
+
+ def forward(self, data):
+ x, pos, edge_index, edge_attr, batch = (
+ data.x, data.pos,
+ data.edge_index,
+ data.edge_attr, data.batch
+ )
+
+ input_x = data.esm_rep
+ input_x = torch.cat([pos, input_x], dim=1)
+
+ if self.gnn_config['embedding']:
+ input_x = self.node_embedding(input_x)
+ edge_attr = self.edge_embedding(edge_attr)
+
+ for i, layer in enumerate(self.mpnn_layes):
+ h = layer(input_x, edge_index, edge_attr, batch)
+ if self.gnn_config['residual']:
+ input_x = input_x + h
+ else:
+ input_x = h
+
+ x = input_x[:, 3:]
+ x = self.droplayer(x)
+ x = self.lin(x)
+
+ return x, input_x[:, 3:]
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/src/module/egnn/utils.py b/proteingym/baselines/protssn/src/module/egnn/utils.py
new file mode 100644
index 0000000..c31020a
--- /dev/null
+++ b/proteingym/baselines/protssn/src/module/egnn/utils.py
@@ -0,0 +1,32 @@
+import torch
+from torch import sin, cos, atan2, acos
+
+def rot_z(gamma):
+ return torch.tensor([
+ [cos(gamma), -sin(gamma), 0],
+ [sin(gamma), cos(gamma), 0],
+ [0, 0, 1]
+ ], dtype=gamma.dtype)
+
+def rot_y(beta):
+ return torch.tensor([
+ [cos(beta), 0, sin(beta)],
+ [0, 1, 0],
+ [-sin(beta), 0, cos(beta)]
+ ], dtype=beta.dtype)
+
+def rot(alpha, beta, gamma):
+ return rot_z(alpha) @ rot_y(beta) @ rot_z(gamma)
+
+def get_node_feature_dims():
+ '''
+ each node has 25 dim feature corrsponding to residual type, sasa, dihedral, mu_r_norm
+ '''
+ return [20, 1,1, 4, 5,640]
+
+
+def get_edge_feature_dims():
+ '''
+ each node has 93 dim feature corrsponding to one hot sequence distance, interatomic distance, local frame orientation
+ '''
+ return [65, 1, 15, 12]
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/src/module/gat/network.py b/proteingym/baselines/protssn/src/module/gat/network.py
new file mode 100644
index 0000000..86e3c6e
--- /dev/null
+++ b/proteingym/baselines/protssn/src/module/gat/network.py
@@ -0,0 +1,59 @@
+import torch
+from torch_geometric.nn import GATConv
+import torch.nn.functional as F
+
+
+class GAT(torch.nn.Module):
+ def __init__(self, config, feat_type, input_dim, out_dim):
+ super(GAT, self).__init__()
+ self.config = config
+ self.feat_type = feat_type
+ self.hidden_dim = config["hidden_channels"]
+ self.input_dim, self.out_dim = input_dim, out_dim
+
+ self.convs = torch.nn.ModuleList()
+ self.convs.append(GATConv(input_dim, self.hidden_dim))
+
+ self.bns = torch.nn.ModuleList()
+ self.bns.append(torch.nn.BatchNorm1d(self.hidden_dim))
+ for _ in range(config["n_layers"] - 2):
+ self.convs.append(
+ GATConv(self.hidden_dim, self.hidden_dim))
+ self.bns.append(torch.nn.BatchNorm1d(self.hidden_dim))
+ self.convs.append(GATConv(self.hidden_dim, self.out_dim))
+
+ self.dropout_prob = config["dropout"]
+
+
+ def reset_parameters(self):
+ for conv in self.convs:
+ conv.reset_parameters()
+ for bn in self.bns:
+ bn.reset_parameters()
+
+ def forward(self, data):
+ x, pos, mu_r_norm, edge_index, edge_attr, batch = (
+ data.x.float(),
+ data.pos.float(),
+ data.mu_r_norm.float(),
+ data.edge_index,
+ data.edge_attr.float(),
+ data.batch
+ )
+
+ input_x = torch.empty([pos.shape[0], 0]).to(x.device)
+ if "manual" in self.feat_type:
+ input_x = torch.cat([input_x, x], dim=1)
+ if "esm" in self.feat_type:
+ esm_rep = data.esm_rep.float()
+ input_x = torch.cat([input_x, esm_rep], dim=1)
+ x = input_x
+ edge_index = edge_index
+
+ for i, conv in enumerate(self.convs[:-1]):
+ x = conv(x, edge_index)
+ x = self.bns[i](x)
+ x = F.relu(x)
+ x = F.dropout(x, p=self.dropout_prob, training=self.training)
+ x = self.convs[-1](x,edge_index)
+ return x
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/src/module/gcn/network.py b/proteingym/baselines/protssn/src/module/gcn/network.py
new file mode 100644
index 0000000..55d582d
--- /dev/null
+++ b/proteingym/baselines/protssn/src/module/gcn/network.py
@@ -0,0 +1,58 @@
+import torch
+from torch_geometric.nn import GCNConv
+import torch.nn.functional as F
+
+
+class GCN(torch.nn.Module):
+ def __init__(self, config, feat_type, input_dim, out_dim):
+ super(GCN, self).__init__()
+ self.config = config
+ self.feat_type = feat_type
+ self.hidden_dim = config["hidden_channels"]
+ self.input_dim, self.out_dim = input_dim, out_dim
+
+ self.convs = torch.nn.ModuleList()
+ self.convs.append(GCNConv(input_dim, self.hidden_dim))
+
+ self.bns = torch.nn.ModuleList()
+ self.bns.append(torch.nn.BatchNorm1d(self.hidden_dim))
+ for _ in range(config["n_layers"] - 2):
+ self.convs.append(
+ GCNConv(self.hidden_dim, self.hidden_dim))
+ self.bns.append(torch.nn.BatchNorm1d(self.hidden_dim))
+ self.convs.append(GCNConv(self.hidden_dim, self.out_dim))
+
+ self.dropout_prob = config["dropout"]
+
+
+ def reset_parameters(self):
+ for conv in self.convs:
+ conv.reset_parameters()
+ for bn in self.bns:
+ bn.reset_parameters()
+
+ def forward(self, data):
+ x, pos, mu_r_norm, edge_index, edge_attr, batch, = (
+ data.x.float(),
+ data.pos.float(),
+ data.mu_r_norm.float(),
+ data.edge_index,
+ data.edge_attr.float(),
+ data.batch)
+
+ input_x = torch.empty([pos.shape[0], 0]).to(x.device)
+ if "manual" in self.feat_type:
+ input_x = torch.cat([input_x, x], dim=1)
+ if "esm" in self.feat_type:
+ esm_rep = data.esm_rep.float()
+ input_x = torch.cat([input_x, esm_rep], dim=1)
+ x = input_x
+ edge_index = edge_index
+
+ for i, conv in enumerate(self.convs[:-1]):
+ x = conv(x, edge_index)
+ x = self.bns[i](x)
+ x = F.relu(x)
+ x = F.dropout(x, p=self.dropout_prob, training=self.training)
+ x = self.convs[-1](x,edge_index)
+ return x
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/src/utils/data_utils.py b/proteingym/baselines/protssn/src/utils/data_utils.py
new file mode 100644
index 0000000..74e2434
--- /dev/null
+++ b/proteingym/baselines/protssn/src/utils/data_utils.py
@@ -0,0 +1,33 @@
+import random
+import torch.utils.data as data
+
+
+class BatchSampler(data.Sampler):
+ def __init__(self, dataset, max_len=5000, batch_token_num=3096, shuffle=True):
+ self.node_num = [d.x.shape[0] for d in dataset]
+ self.idx = [i for i in range(len(self.node_num))
+ if self.node_num[i] <= max_len]
+ self.shuffle = shuffle
+ self.batches = []
+ self.max_len = max_len
+ self.batch_token_num = batch_token_num
+
+ def _form_batches(self):
+ if self.shuffle: random.shuffle(self.idx)
+ idx = self.idx
+ while idx:
+ batch = []
+ n_nodes = 0
+ while idx and n_nodes + self.node_num[idx[0]] <= self.batch_token_num:
+ next_idx, idx = idx[0], idx[1:]
+ n_nodes += self.node_num[next_idx]
+ batch.append(next_idx)
+ self.batches.append(batch)
+
+ def __len__(self):
+ if not self.batches: self._form_batches()
+ return len(self.batches)
+
+ def __iter__(self):
+ if not self.batches: self._form_batches()
+ for batch in self.batches: yield batch
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/src/utils/dataset_utils.py b/proteingym/baselines/protssn/src/utils/dataset_utils.py
new file mode 100644
index 0000000..11b87d0
--- /dev/null
+++ b/proteingym/baselines/protssn/src/utils/dataset_utils.py
@@ -0,0 +1,610 @@
+import torch
+import sys
+import os
+import random
+import csv
+import numpy as np
+import seaborn as sns
+import matplotlib.pyplot as plt
+from datetime import datetime
+from numpy import array, cross, pi, arccos, sqrt
+from tqdm import tqdm
+from time import time
+from torch_geometric.transforms import BaseTransform
+from scipy.stats import spearmanr
+from numpy import nan
+
+def dataset_argument_(root):
+ dataset_arg = {}
+ if root == "cath40_k10_dyn_imem":
+ dataset_arg['root'] = f"data/{root}"
+ dataset_arg['name'] = '40'
+ dataset_arg['set_length'] = None
+ dataset_arg['normal_file'] = None
+ dataset_arg['divide_num'] = 1
+ dataset_arg['divide_idx'] = 0
+ dataset_arg['c_alpha_max_neighbors'] = 10
+ return dataset_arg
+
+def get_stat(graph_root, limited_num=None, num_subgroup=1000, max_limits=100000):
+ # obtain mean and std of graphs in graph_root
+ # graph_root: string, calculate mean and std of all attributes of graphs in graph_root
+ # limited_num: int, optional, just calculated limited number of graphs in graph_root
+ # num_Subgroup: int, group all graphs in graph_root, the number of each subgroup is num_subgroup
+ # max_limits: int, set the initial minimum value as max_limits
+
+ wrong_proteins = []
+ filenames = os.listdir(graph_root)
+ random.shuffle(filenames)
+ # set sample length
+ n = len(filenames)
+ if limited_num:
+ n = min(n, limited_num)
+ count = 0
+ if n < num_subgroup * 10:
+ num_subgroup = 1
+
+ # initialize scalar value
+ num_node_min, num_edge_min = torch.tensor(
+ [max_limits]), torch.tensor([max_limits])
+ num_node_max, num_node_avg, num_edge_max, num_edge_avg = torch.tensor(
+ [0]), torch.tensor([0]), torch.tensor([0]), torch.tensor([0])
+
+ # initialize mean, std
+ graph = torch.load(os.path.join(graph_root, filenames[0]))
+ x, pos, edge_attr = graph.x, graph.pos, graph.edge_attr
+ x_mean = torch.zeros(x.shape[1])
+ x_max = torch.zeros(x.shape[1])
+ x_min = torch.tensor([max_limits for i in range(x.shape[1])])
+ x_std = torch.zeros(x.shape[1])
+ pos_mean = torch.zeros(pos.shape[1])
+ pos_std = torch.zeros(pos.shape[1])
+ edge_attr_mean = torch.zeros(edge_attr.shape[1])
+ edge_attr_std = torch.zeros(edge_attr.shape[1])
+
+ # initialize sub mean, std
+ x_mean_1 = torch.zeros(x.shape[1])
+ x_std_1 = torch.zeros(x.shape[1])
+ pos_mean_1 = torch.zeros(pos.shape[1])
+ pos_std_1 = torch.zeros(pos.shape[1])
+
+ edge_attr_mean_1 = torch.zeros(edge_attr.shape[1])
+ edge_attr_std_1 = torch.zeros(edge_attr.shape[1])
+
+ for i in tqdm(range(n)):
+ file = filenames[i]
+ graph = torch.load(os.path.join(graph_root, file))
+ x, pos, mu_r_norm, edge_attr = graph.x, graph.pos, graph.mu_r_norm, graph.edge_attr
+ if torch.isnan(x).any():
+ wrong_proteins.append(file)
+ continue
+ count += 1
+ node_num = graph.x.shape[0]
+ edge_num = graph.edge_attr.shape[0]
+ num_node_min = min(num_node_min, node_num)
+ num_edge_min = min(num_edge_min, edge_num)
+ num_node_max = max(num_node_max, node_num)
+ num_edge_max = max(num_edge_max, edge_num)
+ num_node_avg += node_num
+ num_edge_avg += edge_num
+
+ x_max = torch.max(x_max, x.max(axis=0).values)
+ x_min = torch.min(x_min, x.min(axis=0).values)
+ x_mean_1 += x.nanmean(axis=0)
+ x_std_1 += x.std(axis=0)
+ pos_mean_1 += pos.mean(axis=0)
+ pos_std_1 += pos.std(axis=0)
+ edge_attr_mean_1 += edge_attr.mean(axis=0)
+ edge_attr_std_1 += edge_attr.std(axis=0)
+
+ if count == num_subgroup:
+ x_mean += x_mean_1.div_(num_subgroup)
+ x_std += x_std_1.div_(num_subgroup)
+ pos_mean += pos_mean_1.div_(num_subgroup)
+ pos_std += pos_std_1.div_(num_subgroup)
+ edge_attr_mean += edge_attr_mean_1.div_(num_subgroup)
+ edge_attr_std += edge_attr_std_1.div_(num_subgroup)
+
+ x_mean_1 = torch.zeros(x.shape[1])
+ x_std_1 = torch.zeros(x.shape[1])
+ pos_mean_1 = torch.zeros(pos.shape[1])
+ pos_std_1 = torch.zeros(pos.shape[1])
+ edge_attr_mean_1 = torch.zeros(edge_attr.shape[1])
+ edge_attr_std_1 = torch.zeros(edge_attr.shape[1])
+ count = 0
+
+ num_node_avg = num_node_avg/n
+ num_edge_avg = num_edge_avg/n
+ n_2 = n // num_subgroup
+ x_mean = x_mean.div_(n_2)
+ x_std = x_std.div_(n_2)
+ pos_mean = pos_mean.div_(n_2)
+ pos_std = pos_std.div_(n_2)
+ edge_attr_mean = edge_attr_mean.div_(n_2)
+ edge_attr_std = edge_attr_std.div_(n_2)
+
+ dic = {'x_max': x_max, 'x_min': x_min, 'x_mean': x_mean, 'x_std': x_std,
+ 'pos_mean': pos_mean, 'pos_std': pos_std,
+ 'edge_attr_mean': edge_attr_mean, 'edge_attr_std': edge_attr_std,
+ 'num_graph': n - len(wrong_proteins),
+ 'num_node_min': num_node_min, 'num_edge_min': num_edge_min,
+ 'num_node_max': num_node_max, 'num_edge_max': num_edge_max,
+ 'num_node_avg': num_node_avg, 'num_edge_avg': num_edge_avg}
+
+ filename = 'mean_attr'
+ saved_filename_pt = os.path.join(
+ '/'.join(graph_root.split('/')[:-1]), filename + '.pt')
+ torch.save(dic, saved_filename_pt)
+ saved_filename = os.path.join(
+ '/'.join(graph_root.split('/')[:-1]), filename + '.csv')
+ w = csv.writer(open(saved_filename, 'w'))
+ for key, val in dic.items():
+ w.writerow([key, val])
+
+ saved_filename = os.path.join(
+ '/'.join(graph_root.split('/')[:-1]), filename + '_proteins.txt')
+ with open(saved_filename, 'w') as f:
+ for i in range(n):
+ f.write(str(filenames[i]) + '\n')
+
+ saved_filename = os.path.join(
+ '/'.join(graph_root.split('/')[:-1]), filename + '_wrong_proteins.txt')
+ with open(saved_filename, 'w') as f:
+ for file in wrong_proteins:
+ f.write(file + '\n')
+
+ return saved_filename_pt
+
+# @functional_transform('normalize_protein')
+
+
+class NormalizeProtein(BaseTransform):
+ r"""Centers and normalizes node positions to the interval :math:`(-1, 1)`
+ (functional name: :obj:`normalize_scale`).
+ """
+
+ def __init__(self, filename, skip_x=20, skip_edge_attr=64, safe_domi=1e-10):
+
+ dic = torch.load(filename)
+ self.skip_x = skip_x
+ self.skip_edge_attr = skip_edge_attr
+ self.safe_domi = safe_domi
+ self.x_mean = dic['x_mean']
+ self.x_std = dic['x_std']
+ self.pos_mean = dic['pos_mean']
+ self.pos_std = torch.mean(dic['pos_std'])
+ self.edge_attr_mean = dic['edge_attr_mean']
+ self.edge_attr_std = dic['edge_attr_std']
+
+ def __call__(self, data):
+ data.x[:, self.skip_x:] = (data.x[:, self.skip_x:] - self.x_mean[self.skip_x:]
+ ).div_(self.x_std[self.skip_x:] + self.safe_domi)
+ data.pos = data.pos - data.pos.mean(dim=-2, keepdim=False)
+ data.pos = data.pos.div_(self.pos_std + self.safe_domi)
+ data.edge_attr[:, self.skip_edge_attr:] = (data.edge_attr[:, self.skip_edge_attr:]
+ - self.edge_attr_mean[self.skip_edge_attr:]).div_(self.edge_attr_std[self.skip_edge_attr:] + self.safe_domi)
+
+ return data
+
+
+class DihedralGeometryError(Exception):
+ pass
+
+
+class AngleGeometryError(Exception):
+ pass
+
+
+ROUND_ERROR = 1e-14
+
+
+class Logger(object):
+ def __init__(self, logpath, syspart=sys.stdout):
+ self.terminal = syspart
+ self.log = open(logpath, "a")
+
+ def write(self, message):
+
+ self.terminal.write(message)
+ self.log.write(message)
+ self.log.flush()
+
+ def flush(self):
+ # this flush method is needed for python 3 compatibility.
+ # this handles the flush command by doing nothing.
+ # you might want to specify some extra behavior here.
+ pass
+
+
+def log(*args):
+ print(f'[{datetime.now()}]', *args)
+
+
+def safe_index(l, e):
+ """
+ Return index of element e in list l. If e is not present, return the last index
+ """
+ try:
+ return l.index(e)
+ except:
+ return len(l) - 1
+
+
+def one_hot_res(type_idx, num_residue_type=20):
+ rec_feat = [0 for _ in range(num_residue_type)]
+ if type_idx < num_residue_type:
+ rec_feat[type_idx] = 1
+ return rec_feat
+ else:
+ # print("Warning: residue type index exceeds "+num_residue_type+" !")
+ return False
+
+
+def nan_to_num(ts, val=0.0):
+ """
+ Replaces nans in tensor with a fixed value.
+ """
+ val = torch.tensor(val, dtype=ts.dtype, device=ts.device)
+ return torch.where(~torch.isfinite(ts), val, ts)
+
+
+def normalize(tensor, dim=-1):
+ """
+ Normalizes a tensor along a dimension after removing nans.
+ """
+ return nan_to_num(
+ torch.div(tensor, norm(tensor, dim=dim, keepdim=True))
+ )
+
+# def norm(tensor, dim, eps=1e-8, keepdim=False):
+# """
+# Returns L2 norm along a dimension.
+# """
+# return torch.sqrt(
+# torch.sum(torch.square(tensor), dim=dim, keepdim=keepdim) + eps)
+
+
+def norm(a):
+ """Returns the norm of a matrix or vector
+ Calculates the Euclidean norm of a vector.
+ Applies the Frobenius norm function to a matrix
+ (a.k.a. Euclidian matrix norm)
+ a = numpy array
+ """
+ return sqrt(sum((a*a).flat))
+
+
+def create_vector(vec):
+ """Returns a vector as a numpy array."""
+ return array([vec[0], vec[1], vec[2]])
+
+
+def create_vectors(vec1, vec2, vec3, vec4):
+ """Returns dihedral angle, takes four
+ Scientific.Geometry.Vector objects
+ (dihedral does not work for them because
+ the Win and Linux libraries are not identical.
+ """
+ return map(create_vector, [vec1, vec2, vec3, vec4])
+
+
+def fix_rounding_error(x):
+ """If x is almost in the range 0-1, fixes it.
+ Specifically, if x is between -ROUND_ERROR and 0, returns 0.
+ If x is between 1 and 1+ROUND_ERROR, returns 1.
+ """
+ if -ROUND_ERROR < x < 0:
+ return 0
+ elif 1 < x < 1+ROUND_ERROR:
+ return 1
+ else:
+ return
+
+
+def angle(v1, v2):
+ """
+ calculates the angle between two vectors.
+ v1 and v2 are numpy.array objects.
+ returns a float containing the angle in radians.
+ """
+ length_product = norm(v1)*norm(v2)
+ if length_product == 0:
+ raise AngleGeometryError(
+ "Cannot calculate angle for vectors with length zero")
+ cosine = scalar(v1, v2)/length_product
+ # angle = arccos(fix_rounding_error(cosine))
+ angle = arccos(cosine)
+
+ return angle
+
+
+def scalar(v1, v2):
+ """
+ calculates the scalar product of two vectors
+ v1 and v2 are numpy.array objects.
+ returns a float for a one-dimensional array.
+ """
+ return sum(v1*v2)
+
+
+def dihedral(vec1, vec2, vec3, vec4):
+ """
+ Returns a float value for the dihedral angle between
+ the four vectors. They define the bond for which the
+ torsion is calculated (~) as:
+ V1 - V2 ~ V3 - V4
+ The vectors vec1 .. vec4 can be array objects, lists or tuples of length
+ three containing floats.
+ For Scientific.geometry.Vector objects the behavior is different
+ on Windows and Linux. Therefore, the latter is not a featured input type
+ even though it may work.
+ If the dihedral angle cant be calculated (because vectors are collinear),
+ the function raises a DihedralGeometryError
+ """
+ # create array instances.
+ v1, v2, v3, v4 = create_vectors(vec1, vec2, vec3, vec4)
+ all_vecs = [v1, v2, v3, v4]
+
+ # rule out that two of the atoms are identical
+ # except the first and last, which may be.
+ for i in range(len(all_vecs)-1):
+ for j in range(i+1, len(all_vecs)):
+ if i > 0 or j < 3: # exclude the (1,4) pair
+ equals = all_vecs[i] == all_vecs[j]
+ if equals.all():
+ raise DihedralGeometryError(
+ "Vectors #%i and #%i may not be identical!" % (i, j))
+
+ # calculate vectors representing bonds
+ v12 = v2-v1
+ v23 = v3-v2
+ v34 = v4-v3
+
+ # calculate vectors perpendicular to the bonds
+ normal1 = cross(v12, v23)
+ normal2 = cross(v23, v34)
+
+ # check for linearity
+ if norm(normal1) == 0 or norm(normal2) == 0:
+ raise DihedralGeometryError(
+ "Vectors are in one line; cannot calculate normals!")
+
+ # normalize them to length 1.0
+ normal1 = normal1/norm(normal1)
+ normal2 = normal2/norm(normal2)
+
+ # calculate torsion and convert to degrees
+ torsion = angle(normal1, normal2) * 180.0/pi
+
+ # take into account the determinant
+ # (the determinant is a scalar value distinguishing
+ # between clockwise and counter-clockwise torsion.
+ if scalar(normal1, v34) >= 0:
+ return torsion
+ else:
+ torsion = 360-torsion
+ if torsion == 360:
+ torsion = 0.0
+ return torsion
+
+
+def seq_dist_distrib(loader):
+ before = time()
+ q = np.array([0.25, 0.5, 0.75, 0.9, 0.95, 0.98, 0.99])
+ seq_dist = torch.Tensor(0)
+ for _, data in tqdm(enumerate(loader)):
+ edge_attr = data.edge_attr
+ seq_dist = torch.cat([seq_dist, edge_attr[:, 0]])
+ print('time elapsed: ', time() - before)
+ print(np.quantile(np.array(seq_dist), q))
+ bins = 200
+ sns.distplot(seq_dist.numpy(), hist=True, kde=False, bins=bins)
+ plt.hist(seq_dist.numpy(), bins=bins, range=(0, 200))
+ plt.title('Histogram of Sequence Distance')
+ plt.xlabel('Sequence Distance')
+ plt.ylabel('Times')
+ plt.savefig('protein/dataset_alpha_Fold/Val_seq_dist' + str(bins) + '.png')
+ return seq_dist
+
+
+def mutat_test4(loader, device, model, dataset):
+ # printed cor arranged along the protein names in dataset
+ model.eval()
+ m = torch.nn.Softmax()
+ correct = 0
+ protein_names = dataset.protein_names
+ n = len(protein_names)
+
+ true_score = [torch.tensor([]).to(device) for _ in range(n)]
+ pred_score = [torch.tensor([]).to(device) for _ in range(n)]
+ with torch.no_grad():
+ # Iterate in batches over the training/test dataset.
+ for data in loader:
+
+ ### calculate in model
+ data = data.to(device)
+ protein_idx = data.protein_idx
+ print(protein_idx)
+ #data = pre_transform(data)
+ x = torch.cat([data.pos, data.mu_r_norm, data.x],
+ dim=1) # data.mu_r_norm,
+ out = model(x, edge_index=data.edge_index,
+ edge_attr=data.edge_attr, batch=data.batch)
+ out = torch.log(m(out[:, :20]))
+ # out = out.cpu()
+
+ # obtain score info
+ score_info = data.score_info[0]
+ num_mutat = len(score_info)
+ true_score[protein_idx] = torch.zeros(num_mutat)
+ pred_score[protein_idx] = torch.zeros(num_mutat)
+ for mutat_idx in range(num_mutat):
+ mutat_info, true_score[protein_idx][mutat_idx] = score_info[mutat_idx]
+ for item in mutat_info:
+ # (log(prob_out) - log(prob_wlid))
+ # print(protein_idx,mutat_idx)
+ if int(item[1]) >= out.shape[0]:
+ continue
+ pred_score[protein_idx][mutat_idx] += (
+ out[int(item[1]), int(item[2])] - out[int(item[1]), int(item[0])]).cpu()
+ spearman_coeef = np.zeros(n)
+ for i in range(n):
+ if len(true_score[i].cpu().numpy()) == 0:
+ continue
+ spearvalue = spearmanr(true_score[i].cpu().numpy(
+ ), pred_score[i].cpu().numpy()).correlation
+ if spearvalue is nan:
+ pass
+ else:
+ spearman_coeef[i] = spearvalue
+ for i in range(len(spearman_coeef)):
+ print(protein_names[i])
+ print(spearman_coeef[i])
+ # Derive ratio of correct predictions.
+ return correct / len(loader.dataset), spearman_coeef.mean()
+
+
+def normalize_prob(coeff):
+ sum_ = coeff.sum(dim = 1)
+ score = coeff/(sum_.view(20,-1))
+ return score
+
+def substitute_label(y,temperature=1.0):
+ original_score = torch.tensor([
+ [ 8., 3., 2., 2., 4., 3., 3., 4., 2., 3., 3., 3., 3., 2.,
+ 3., 5., 4., 1., 2., 4.],
+ [ 3., 9., 4., 2., 1., 5., 4., 2., 4., 1., 2., 6., 3., 1.,
+ 2., 3., 3., 1., 2., 1.],
+ [ 2., 4., 10., 5., 1., 4., 4., 4., 5., 1., 1., 4., 2., 1.,
+ 2., 5., 4., 0., 2., 1.],
+ [ 2., 2., 5., 10., 1., 4., 6., 3., 3., 1., 0., 3., 1., 1.,
+ 3., 4., 3., 0., 1., 1.],
+ [ 4., 1., 1., 1., 13., 1., 0., 1., 1., 3., 3., 1., 3., 2.,
+ 1., 3., 3., 2., 2., 3.],
+ [ 3., 5., 4., 4., 1., 9., 6., 2., 4., 1., 2., 5., 4., 1.,
+ 3., 4., 3., 2., 3., 2.],
+ [ 3., 4., 4., 6., 0., 6., 9., 2., 4., 1., 1., 5., 2., 1.,
+ 3., 4., 3., 1., 2., 2.],
+ [ 4., 2., 4., 3., 1., 2., 2., 10., 2., 0., 0., 2., 1., 1.,
+ 2., 4., 2., 2., 1., 1.],
+ [ 2., 4., 5., 3., 1., 4., 4., 2., 12., 1., 1., 3., 2., 3.,
+ 2., 3., 2., 2., 6., 1.],
+ [ 3., 1., 1., 1., 3., 1., 1., 0., 1., 8., 6., 1., 5., 4.,
+ 1., 2., 3., 1., 3., 7.],
+ [ 3., 2., 1., 0., 3., 2., 1., 0., 1., 6., 8., 2., 6., 4.,
+ 1., 2., 3., 2., 3., 5.],
+ [ 3., 6., 4., 3., 1., 5., 5., 2., 3., 1., 2., 9., 3., 1.,
+ 3., 4., 3., 1., 2., 2.],
+ [ 3., 3., 2., 1., 3., 4., 2., 1., 2., 5., 6., 3., 9., 4.,
+ 2., 3., 3., 3., 3., 5.],
+ [ 2., 1., 1., 1., 2., 1., 1., 1., 3., 4., 4., 1., 4., 10.,
+ 0., 2., 2., 5., 7., 3.],
+ [ 3., 2., 2., 3., 1., 3., 3., 2., 2., 1., 1., 3., 2., 0.,
+ 11., 3., 3., 0., 1., 2.],
+ [ 5., 3., 5., 4., 3., 4., 4., 4., 3., 2., 2., 4., 3., 2.,
+ 3., 8., 5., 1., 2., 2.],
+ [ 4., 3., 4., 3., 3., 3., 3., 2., 2., 3., 3., 3., 3., 2.,
+ 3., 5., 9., 2., 2., 4.],
+ [ 1., 1., 0., 0., 2., 2., 1., 2., 2., 1., 2., 1., 3., 5.,
+ 0., 1., 2., 15., 6., 1.],
+ [ 2., 2., 2., 1., 2., 3., 2., 1., 6., 3., 3., 2., 3., 7.,
+ 1., 2., 2., 6., 11., 3.],
+ [ 4., 1., 1., 1., 3., 2., 2., 1., 1., 7., 5., 2., 5., 3.,
+ 2., 2., 4., 1., 3., 8.]], dtype=torch.float64,device=y.device)
+
+ score = normalize_prob(original_score)
+ tempering_score = score**temperature
+ normalized_score = normalize_prob(tempering_score)
+ out_prob = normalized_score[y]
+
+ return out_prob
+
+def substitution_label_smooth(y): # [1,4,5,6,7]
+ '''
+ mapping tensor convert dataset label ['A', 'R', 'N', 'D', 'C', 'Q', 'E', 'G', 'H', 'I', 'L', 'K', 'M', 'F', 'P', 'S', 'T', 'W', 'Y', 'V']
+ to substitution tabel label ['C', 'S', 'T','A','G','P' ,'D','E', 'Q' ,'N', 'H','R','K','M' ,'I','L','V','W','Y','F']
+ '''
+
+ score = torch.tensor([[8.4015e-01, 5.6609e-03, 2.0825e-03, 2.0825e-03, 1.5388e-02, 5.6609e-03,
+ 5.6609e-03, 1.5388e-02, 2.0825e-03, 5.6609e-03, 5.6609e-03, 5.6609e-03,
+ 5.6609e-03, 2.0825e-03, 5.6609e-03, 4.1829e-02, 1.5388e-02, 7.6612e-04,
+ 2.0825e-03, 1.5388e-02],
+ [2.2443e-03, 9.0541e-01, 6.1006e-03, 8.2563e-04, 3.0373e-04, 1.6583e-02,
+ 6.1006e-03, 8.2563e-04, 6.1006e-03, 3.0373e-04, 8.2563e-04, 4.5078e-02,
+ 2.2443e-03, 3.0373e-04, 8.2563e-04, 2.2443e-03, 2.2443e-03, 3.0373e-04,
+ 8.2563e-04, 3.0373e-04],
+ [3.2347e-04, 2.3901e-03, 9.6424e-01, 6.4970e-03, 1.1900e-04, 2.3901e-03,
+ 2.3901e-03, 2.3901e-03, 6.4970e-03, 1.1900e-04, 1.1900e-04, 2.3901e-03,
+ 3.2347e-04, 1.1900e-04, 3.2347e-04, 6.4970e-03, 2.3901e-03, 4.3776e-05,
+ 3.2347e-04, 1.1900e-04],
+ [3.2378e-04, 3.2378e-04, 6.5034e-03, 9.6518e-01, 1.1911e-04, 2.3925e-03,
+ 1.7678e-02, 8.8013e-04, 8.8013e-04, 1.1911e-04, 4.3819e-05, 8.8013e-04,
+ 1.1911e-04, 1.1911e-04, 8.8013e-04, 2.3925e-03, 8.8013e-04, 4.3819e-05,
+ 1.1911e-04, 1.1911e-04],
+ [1.2335e-04, 6.1412e-06, 6.1412e-06, 6.1412e-06, 9.9950e-01, 6.1412e-06,
+ 2.2592e-06, 6.1412e-06, 6.1412e-06, 4.5377e-05, 4.5377e-05, 6.1412e-06,
+ 4.5377e-05, 1.6693e-05, 6.1412e-06, 4.5377e-05, 4.5377e-05, 1.6693e-05,
+ 1.6693e-05, 4.5377e-05],
+ [2.1845e-03, 1.6142e-02, 5.9382e-03, 5.9382e-03, 2.9565e-04, 8.8131e-01,
+ 4.3878e-02, 8.0365e-04, 5.9382e-03, 2.9565e-04, 8.0365e-04, 1.6142e-02,
+ 5.9382e-03, 2.9565e-04, 2.1845e-03, 5.9382e-03, 2.1845e-03, 8.0365e-04,
+ 2.1845e-03, 8.0365e-04],
+ [2.1417e-03, 5.8217e-03, 5.8217e-03, 4.3017e-02, 1.0663e-04, 4.3017e-02,
+ 8.6401e-01, 7.8788e-04, 5.8217e-03, 2.8984e-04, 2.8984e-04, 1.5825e-02,
+ 7.8788e-04, 2.8984e-04, 2.1417e-03, 5.8217e-03, 2.1417e-03, 2.8984e-04,
+ 7.8788e-04, 7.8788e-04],
+ [2.4500e-03, 3.3157e-04, 2.4500e-03, 9.0130e-04, 1.2198e-04, 3.3157e-04,
+ 3.3157e-04, 9.8840e-01, 3.3157e-04, 4.4873e-05, 4.4873e-05, 3.3157e-04,
+ 1.2198e-04, 1.2198e-04, 3.3157e-04, 2.4500e-03, 3.3157e-04, 3.3157e-04,
+ 1.2198e-04, 1.2198e-04],
+ [4.5164e-05, 3.3372e-04, 9.0714e-04, 1.2277e-04, 1.6615e-05, 3.3372e-04,
+ 3.3372e-04, 4.5164e-05, 9.9480e-01, 1.6615e-05, 1.6615e-05, 1.2277e-04,
+ 4.5164e-05, 1.2277e-04, 4.5164e-05, 1.2277e-04, 4.5164e-05, 4.5164e-05,
+ 2.4659e-03, 1.6615e-05],
+ [4.1869e-03, 5.6664e-04, 5.6664e-04, 5.6664e-04, 4.1869e-03, 5.6664e-04,
+ 5.6664e-04, 2.0845e-04, 5.6664e-04, 6.2139e-01, 8.4096e-02, 5.6664e-04,
+ 3.0937e-02, 1.1381e-02, 5.6664e-04, 1.5403e-03, 4.1869e-03, 5.6664e-04,
+ 4.1869e-03, 2.2860e-01],
+ [4.8740e-03, 1.7930e-03, 6.5962e-04, 2.4266e-04, 4.8740e-03, 1.7930e-03,
+ 6.5962e-04, 2.4266e-04, 6.5962e-04, 9.7896e-02, 7.2336e-01, 1.7930e-03,
+ 9.7896e-02, 1.3249e-02, 6.5962e-04, 1.7930e-03, 4.8740e-03, 1.7930e-03,
+ 4.8740e-03, 3.6014e-02],
+ [2.2137e-03, 4.4462e-02, 6.0173e-03, 2.2137e-03, 2.9959e-04, 1.6357e-02,
+ 1.6357e-02, 8.1436e-04, 2.2137e-03, 2.9959e-04, 8.1436e-04, 8.9305e-01,
+ 2.2137e-03, 2.9959e-04, 2.2137e-03, 6.0173e-03, 2.2137e-03, 2.9959e-04,
+ 8.1436e-04, 8.1436e-04],
+ [2.2052e-03, 2.2052e-03, 8.1125e-04, 2.9844e-04, 2.2052e-03, 5.9944e-03,
+ 8.1125e-04, 2.9844e-04, 8.1125e-04, 1.6294e-02, 4.4293e-02, 2.2052e-03,
+ 8.8965e-01, 5.9944e-03, 8.1125e-04, 2.2052e-03, 2.2052e-03, 2.2052e-03,
+ 2.2052e-03, 1.6294e-02],
+ [3.1409e-04, 1.1555e-04, 1.1555e-04, 1.1555e-04, 3.1409e-04, 1.1555e-04,
+ 1.1555e-04, 1.1555e-04, 8.5379e-04, 2.3209e-03, 2.3209e-03, 1.1555e-04,
+ 2.3209e-03, 9.3630e-01, 4.2508e-05, 3.1409e-04, 3.1409e-04, 6.3087e-03,
+ 4.6616e-02, 8.5379e-04],
+ [3.3436e-04, 1.2300e-04, 1.2300e-04, 3.3436e-04, 4.5250e-05, 3.3436e-04,
+ 3.3436e-04, 1.2300e-04, 1.2300e-04, 4.5250e-05, 4.5250e-05, 3.3436e-04,
+ 1.2300e-04, 1.6647e-05, 9.9671e-01, 3.3436e-04, 3.3436e-04, 1.6647e-05,
+ 4.5250e-05, 1.2300e-04],
+ [3.8657e-02, 5.2316e-03, 3.8657e-02, 1.4221e-02, 5.2316e-03, 1.4221e-02,
+ 1.4221e-02, 1.4221e-02, 5.2316e-03, 1.9246e-03, 1.9246e-03, 1.4221e-02,
+ 5.2316e-03, 1.9246e-03, 5.2316e-03, 7.7644e-01, 3.8657e-02, 7.0802e-04,
+ 1.9246e-03, 1.9246e-03],
+ [6.3097e-03, 2.3212e-03, 6.3097e-03, 2.3212e-03, 2.3212e-03, 2.3212e-03,
+ 2.3212e-03, 8.5392e-04, 8.5392e-04, 2.3212e-03, 2.3212e-03, 2.3212e-03,
+ 2.3212e-03, 8.5392e-04, 2.3212e-03, 1.7151e-02, 9.3644e-01, 8.5392e-04,
+ 8.5392e-04, 6.3097e-03],
+ [8.3137e-07, 8.3137e-07, 3.0584e-07, 3.0584e-07, 2.2599e-06, 2.2599e-06,
+ 8.3137e-07, 2.2599e-06, 2.2599e-06, 8.3137e-07, 2.2599e-06, 8.3137e-07,
+ 6.1430e-06, 4.5391e-05, 3.0584e-07, 8.3137e-07, 2.2599e-06, 9.9980e-01,
+ 1.2339e-04, 8.3137e-07],
+ [1.1928e-04, 1.1928e-04, 1.1928e-04, 4.3882e-05, 1.1928e-04, 3.2425e-04,
+ 1.1928e-04, 4.3882e-05, 6.5127e-03, 3.2425e-04, 3.2425e-04, 1.1928e-04,
+ 3.2425e-04, 1.7703e-02, 4.3882e-05, 1.1928e-04, 1.1928e-04, 6.5127e-03,
+ 9.6656e-01, 3.2425e-04],
+ [1.1877e-02, 5.9130e-04, 5.9130e-04, 5.9130e-04, 4.3692e-03, 1.6073e-03,
+ 1.6073e-03, 5.9130e-04, 5.9130e-04, 2.3855e-01, 3.2284e-02, 1.6073e-03,
+ 3.2284e-02, 4.3692e-03, 1.6073e-03, 1.6073e-03, 1.1877e-02, 5.9130e-04,
+ 4.3692e-03, 6.4844e-01]], dtype=torch.float64, device=y.device)
+ out_prob = score[y]
+
+ return out_prob # []
\ No newline at end of file
diff --git a/proteingym/baselines/protssn/src/utils/utils.py b/proteingym/baselines/protssn/src/utils/utils.py
new file mode 100644
index 0000000..532a8ef
--- /dev/null
+++ b/proteingym/baselines/protssn/src/utils/utils.py
@@ -0,0 +1,199 @@
+# Copyright (c) Facebook, Inc. and its affiliates.
+#
+# This source code is licensed under the MIT license found in the
+# LICENSE file in the root directory of this source tree.
+
+import os
+import torch
+import biotite.structure
+import numpy as np
+import pandas as pd
+from biotite.structure.io import pdbx, pdb
+from biotite.structure.residues import get_residues
+from biotite.structure import filter_backbone
+from biotite.structure import get_chains
+from biotite.sequence import ProteinSequence
+from typing import *
+from Bio import SeqIO
+
+def param_num(model):
+ total = sum([param.numel() for param in model.parameters() if param.requires_grad])
+ num_M = total/1e6
+ if num_M >= 1000:
+ return "Number of parameter: %.2fB" % (num_M/1e3)
+ else:
+ return "Number of parameter: %.2fM" % (num_M)
+
+def create_mutant_file(root):
+ root = "data/evaluation/ProteinGym_substitutions/"
+ proteins = os.listdir(root)
+ for protein in proteins:
+ files = os.listdir(os.path.join(root, protein))
+ files_ = sorted([file for file in files if file.split(".")[1].isdigit()])
+ df = pd.read_table(os.path.join(root,protein, files_[0]))
+ if len(files_) > 1:
+ for i in range(1, len(files_)):
+ file_ = pd.read_table(os.path.join(root,protein, files_[i]))
+ df = pd.concat([df, file_])
+ df.to_csv(os.path.join(root, protein, f"{protein}.tsv"), sep="\t", index=False)
+ else:
+ df.to_csv(os.path.join(root, protein, f"{protein}.tsv"), sep="\t", index=False)
+
+
+def read_fasta(file_path, key):
+ return str(getattr(SeqIO.read(file_path, 'fasta'), key))
+
+def load_structure(fpath, chain=None):
+ """
+ Args:
+ fpath: filepath to either pdb or cif file
+ chain: the chain id or list of chain ids to load
+ Returns:
+ biotite.structure.AtomArray
+ """
+ if fpath.endswith('cif'):
+ with open(fpath) as fin:
+ pdbxf = pdbx.PDBxFile.read(fin)
+ structure = pdbx.get_structure(pdbxf, model=1)
+ # elif fpath.endswith('pdb'):
+ else:
+ with open(fpath) as fin:
+ pdbf = pdb.PDBFile.read(fin)
+ structure = pdb.get_structure(pdbf, model=1)
+ bbmask = filter_backbone(structure)
+ structure = structure[bbmask]
+ all_chains = get_chains(structure)
+ if len(all_chains) == 0:
+ raise ValueError('No chains found in the input file.')
+ if chain is None:
+ chain_ids = all_chains
+ elif isinstance(chain, list):
+ chain_ids = chain
+ else:
+ chain_ids = [chain]
+ for chain in chain_ids:
+ if chain not in all_chains:
+ raise ValueError(f'Chain {chain} not found in input file')
+ chain_filter = [a.chain_id in chain_ids for a in structure]
+ structure = structure[chain_filter]
+ return structure
+
+
+def extract_coords_from_structure(structure: biotite.structure.AtomArray):
+ """
+ Args:
+ structure: An instance of biotite AtomArray
+ Returns:
+ Tuple (coords, seq)
+ - coords is an L x 3 x 3 array for N, CA, C coordinates
+ - seq is the extracted sequence
+ """
+ coords = get_atom_coords_residuewise(["N", "CA", "C"], structure)
+ residue_identities = get_residues(structure)[1]
+ seq = ''.join([ProteinSequence.convert_letter_3to1(r) for r in residue_identities])
+ return coords, seq
+
+
+def load_coords(fpath, chain):
+ """
+ Args:
+ fpath: filepath to either pdb or cif file
+ chain: the chain id
+ Returns:
+ Tuple (coords, seq)
+ - coords is an L x 3 x 3 array for N, CA, C coordinates
+ - seq is the extracted sequence
+ """
+ structure = load_structure(fpath, chain)
+ return extract_coords_from_structure(structure)
+
+
+def get_atom_coords_residuewise(atoms: List[str], struct: biotite.structure.AtomArray):
+ """
+ Example for atoms argument: ["N", "CA", "C"]
+ """
+
+ def filterfn(s, axis=None):
+ filters = np.stack([s.atom_name == name for name in atoms], axis=1)
+ sum = filters.sum(0)
+ if not np.all(sum <= np.ones(filters.shape[1])):
+ raise RuntimeError("structure has multiple atoms with same name")
+ index = filters.argmax(0)
+ coords = s[index].coord
+ coords[sum == 0] = float("nan")
+ return coords
+
+ return biotite.structure.apply_residue_wise(struct, struct, filterfn)
+
+def rotate(v, R):
+ """
+ Rotates a vector by a rotation matrix.
+
+ Args:
+ v: 3D vector, tensor of shape (length x batch_size x channels x 3)
+ R: rotation matrix, tensor of shape (length x batch_size x 3 x 3)
+
+ Returns:
+ Rotated version of v by rotation matrix R.
+ """
+ R = R.unsqueeze(-3)
+ v = v.unsqueeze(-1)
+ return torch.sum(v * R, dim=-2)
+
+
+def get_rotation_frames(coords):
+ """
+ Returns a local rotation frame defined by N, CA, C positions.
+
+ Args:
+ coords: coordinates, tensor of shape (batch_size x length x 3 x 3)
+ where the third dimension is in order of N, CA, C
+
+ Returns:
+ Local relative rotation frames in shape (batch_size x length x 3 x 3)
+ """
+ v1 = coords[:, :, 2] - coords[:, :, 1]
+ v2 = coords[:, :, 0] - coords[:, :, 1]
+ e1 = normalize(v1, dim=-1)
+ u2 = v2 - e1 * torch.sum(e1 * v2, dim=-1, keepdim=True)
+ e2 = normalize(u2, dim=-1)
+ e3 = torch.cross(e1, e2, dim=-1)
+ R = torch.stack([e1, e2, e3], dim=-2)
+ return R
+
+
+def nan_to_num(ts, val=0.0):
+ """
+ Replaces nans in tensor with a fixed value.
+ """
+ val = torch.tensor(val, dtype=ts.dtype, device=ts.device)
+ return torch.where(~torch.isfinite(ts), val, ts)
+
+
+def rbf(values, v_min, v_max, n_bins=16):
+ """
+ Returns RBF encodings in a new dimension at the end.
+ """
+ rbf_centers = torch.linspace(v_min, v_max, n_bins, device=values.device)
+ rbf_centers = rbf_centers.view([1] * len(values.shape) + [-1])
+ rbf_std = (v_max - v_min) / n_bins
+ v_expand = torch.unsqueeze(values, -1)
+ z = (values.unsqueeze(-1) - rbf_centers) / rbf_std
+ return torch.exp(-z ** 2)
+
+
+def norm(tensor, dim, eps=1e-8, keepdim=False):
+ """
+ Returns L2 norm along a dimension.
+ """
+ return torch.sqrt(
+ torch.sum(torch.square(tensor), dim=dim, keepdim=keepdim) + eps)
+
+
+def normalize(tensor, dim=-1):
+ """
+ Normalizes a tensor along a dimension after removing nans.
+ """
+ return nan_to_num(
+ torch.div(tensor, norm(tensor, dim=dim, keepdim=True))
+ )
diff --git a/proteingym/baselines/rita/compute_fitness.py b/proteingym/baselines/rita/compute_fitness.py
new file mode 100644
index 0000000..56c5288
--- /dev/null
+++ b/proteingym/baselines/rita/compute_fitness.py
@@ -0,0 +1,96 @@
+import os
+import argparse
+import tqdm
+
+from transformers import AutoModel, AutoModelForCausalLM, AutoTokenizer
+from scipy.stats import spearmanr
+import numpy as np
+import pandas as pd
+
+import torch
+from torch.nn import CrossEntropyLoss
+
+def calc_fitness(model, prots, tokenizer, device='cuda:0', model_context_len=1023):
+ loss_list = []
+ loss_fn = CrossEntropyLoss()
+ with torch.no_grad():
+ for prot in tqdm.tqdm(prots):
+ loss_val = 0
+
+ sequence_chunks=[]
+ if len(prot) < model_context_len:
+ sequence_chunks = [prot]
+ else:
+ len_target_seq = len(prot)
+ num_windows = 1 + int( len_target_seq / model_context_len)
+ start=0
+ for window_index in range(1, num_windows+1):
+ sequence_chunks.append(prot[start:start+model_context_len])
+ start += model_context_len
+
+ for chunk in sequence_chunks:
+ for p in [chunk, chunk[::-1]]:
+ ids = torch.tensor([tokenizer.encode(p)]).to(device)
+ input_ids = ids[:, :-1]
+ targets = ids[:, 1:]
+
+ logits=model(input_ids).logits
+ loss = loss_fn(target=targets.view(-1), input=logits.view(-1,logits.size(-1)))
+ loss_val += -loss.item()
+
+ loss_list += [loss_val]
+ return np.array(loss_list)
+
+def get_mutated_sequence(focus_seq, mutant, start_idx=1, AA_vocab="ACDEFGHIKLMNPQRSTVWY"):
+ """
+ Helper function that mutates an input sequence (focus_seq) via an input mutation triplet (substitutions only).
+ Mutation triplet are typically based on 1-indexing: start_idx is used for switching to 0-indexing.
+ """
+ mutated_seq = list(focus_seq)
+ for mutation in mutant.split(":"):
+ try:
+ from_AA, position, to_AA = mutation[0], int(mutation[1:-1]), mutation[-1]
+ except:
+ print("Issue with mutant: "+str(mutation))
+ relative_position = position - start_idx
+ assert (from_AA==focus_seq[relative_position]), "Invalid from_AA or mutant position: "+str(mutation)+" from_AA: "+str(from_AA) + " relative pos: "+str(relative_position) + " focus_seq: "+str(focus_seq)
+ assert (to_AA in AA_vocab) , "Mutant to_AA is invalid: "+str(mutation)
+ mutated_seq[relative_position] = to_AA
+ return "".join(mutated_seq)
+
+def main():
+ """
+ Main script to score sets of mutated protein sequences (substitutions or indels) with Tranception.
+ """
+ parser = argparse.ArgumentParser(description='Tranception scoring')
+ parser.add_argument('--RITA_model_name_or_path', default="/n/groups/marks/projects/marks_lab_and_oatml/protein_transformer/baseline_models/RITA/small", type=str, help='Name of or path to RITA model')
+ parser.add_argument('--DMS_reference_file_path', default='/home/pn73/Tranception/proteingym/ProteinGym_reference_file_substitutions.csv', type=str, help='Path of DMS folder')
+ parser.add_argument('--DMS_data_folder', default='/n/groups/marks/projects/marks_lab_and_oatml/protein_transformer/Tranception_open_source/DMS_files/ProteinGym_substitutions', type=str, help='Path of DMS folder')
+ parser.add_argument('--DMS_index', type=int, help='Path of DMS folder')
+ parser.add_argument('--output_scores_folder', default=None, type=str, help='Name of folder to write model scores to')
+ parser.add_argument('--indel_mode', action='store_true', help='Whether to score sequences with insertions and deletions')
+ args = parser.parse_args()
+ model = AutoModelForCausalLM.from_pretrained(args.RITA_model_name_or_path,trust_remote_code=True)
+ model.cuda()
+ tokenizer = AutoTokenizer.from_pretrained("/n/groups/marks/projects/marks_lab_and_oatml/protein_transformer/baseline_models/RITA/tokenizer")
+
+ mapping_protein_seq_DMS = pd.read_csv(args.DMS_reference_file_path)
+ list_DMS = mapping_protein_seq_DMS["DMS_id"]
+ DMS_id=list_DMS[args.DMS_index]
+ print("Computing scores for: {} with RITA: {}".format(DMS_id, args.RITA_model_name_or_path))
+ DMS_file_name = mapping_protein_seq_DMS["DMS_filename"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]
+ target_seq = mapping_protein_seq_DMS["target_seq"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0].upper()
+
+ DMS_data = pd.read_csv(args.DMS_data_folder + os.sep + DMS_file_name, low_memory=False)
+ # Daniel R: shouldn't need this now that all DMS files have both "mutated_sequence" and "mutant" columns
+ if not args.indel_mode and "mutated_sequence" not in DMS_data.columns:
+ DMS_data['mutated_sequence'] = DMS_data['mutant'].apply(lambda x: get_mutated_sequence(target_seq, x)) # if not args.indel_mode else DMS_data['mutant']
+
+ model_scores = calc_fitness(model=model, prots=np.array(DMS_data['mutated_sequence']), tokenizer=tokenizer)
+
+ DMS_data['RITA_score']=model_scores
+ scoring_filename = args.output_scores_folder+os.sep+DMS_id+'.csv'
+ DMS_data[['mutant','RITA_score','DMS_score']].to_csv(scoring_filename, index=False)
+
+if __name__ == '__main__':
+ main()
\ No newline at end of file
diff --git a/proteingym/baselines/rita/rita_configuration.py b/proteingym/baselines/rita/rita_configuration.py
new file mode 100644
index 0000000..063043c
--- /dev/null
+++ b/proteingym/baselines/rita/rita_configuration.py
@@ -0,0 +1,31 @@
+from transformers.configuration_utils import PretrainedConfig
+from transformers.utils import logging
+
+logger = logging.get_logger(__name__)
+
+class RITAConfig(PretrainedConfig):
+ model_type = "rita"
+
+ def __init__(
+ self,
+ vocab_size=26,
+ d_model=768,
+ num_layers=12,
+ max_seq_len=1024,
+ num_heads=12,
+ dropout=0.,
+ ff_ratio=4,
+ eos_token_id=2,
+ initializer_range=0.02,
+ **kwargs,
+ ):
+ super().__init__(eos_token_id=eos_token_id, **kwargs)
+ self.vocab_size = vocab_size
+ self.d_model = d_model
+ self.num_heads = num_heads
+ self.d_feedforward = d_model*ff_ratio
+ self.num_layers = num_layers
+ self.max_seq_len=max_seq_len
+ self.dropout = dropout
+ self.eos_token_id=eos_token_id
+ self.initializer_range=0.02
diff --git a/proteingym/baselines/rita/rita_modeling.py b/proteingym/baselines/rita/rita_modeling.py
new file mode 100644
index 0000000..a209135
--- /dev/null
+++ b/proteingym/baselines/rita/rita_modeling.py
@@ -0,0 +1,488 @@
+import math
+import os
+from dataclasses import dataclass
+from typing import Optional, Tuple, Union
+
+import torch
+import torch.utils.checkpoint
+from torch import nn
+from torch.nn import CrossEntropyLoss, BCEWithLogitsLoss, MSELoss
+
+from transformers.modeling_outputs import (
+ BaseModelOutput,
+ CausalLMOutput,
+ SequenceClassifierOutput
+)
+
+from transformers.modeling_utils import PreTrainedModel
+from transformers.utils import logging
+
+from .rita_configuration import RITAConfig
+import torch.nn.functional as F
+logger = logging.get_logger(__name__)
+
+@torch.jit.script
+def RITA_gelu(hidden_states):
+ return hidden_states * 0.5 * (1.0 + torch.tanh(0.79788456 * hidden_states * (1 + 0.044715 * hidden_states * hidden_states)))
+
+class RITAGELU(nn.Module):
+ def __init__(self):
+ super().__init__()
+
+ def forward(self, hidden_states):
+ return RITA_gelu(hidden_states)
+
+def rotate_half(x):
+ x1, x2 = x[..., : x.shape[-1] // 2], x[..., x.shape[-1] // 2 :]
+ return torch.cat((-x2, x1), dim=x1.ndim - 1)
+
+class RotaryEmbedding(nn.Module):
+ def __init__(self, config):
+ super().__init__()
+ assert config.d_model % config.num_heads == 0
+
+ self.d_model = config.d_model
+ self.num_heads = config.num_heads
+ self.max_seq_len = config.max_seq_len
+
+ head_dim = self.d_model // self.num_heads
+ inv_freq = 1.0 / (10000 ** (torch.arange(0, head_dim, 2).float() / head_dim))
+ self.register_buffer('inv_freq', inv_freq)
+ self.seq_len_cached = None
+ self.cos_cached = None
+ self.sin_cached = None
+
+ def forward(self, x: torch.FloatTensor, seq_dim=1) -> torch.FloatTensor:
+ seq_len = x.shape[seq_dim]
+ if seq_len != self.seq_len_cached:
+ self.seq_len_cached = seq_len
+ t = torch.arange(x.shape[seq_dim], device=x.device).type_as(self.inv_freq)
+ freqs = torch.einsum("i,j->ij", t, self.inv_freq)
+ emb = torch.cat((freqs, freqs), dim=-1).to(x.device)
+ self.cos_cached = emb.cos()[None, None, :, :]
+ self.sin_cached = emb.sin()[None, None, :, :]
+ return self.cos_cached, self.sin_cached
+
+ def apply_rotary_pos_emb(self, q, k, cos, sin):
+ return (q * cos) + (rotate_half(q) * sin), (k * cos) + (rotate_half(k) * sin)
+
+
+class SelfAttention(nn.Module):
+ """Implementation of MultiHeadAttention following `Karpathy's MinGPT `_.
+ modified to use rotary embeddings.
+
+ Parameters
+ ----------
+ d_model: int,
+ total dimension of the model.
+ num_heads: int,
+ number of parallel attention heads.
+ num_layers: int,
+ number of layers in the model, used for the Megatron-like init.
+ rotaty_embedding: Optional[Block], default None,
+ a RotaryEmbedding Block to add positionnal information in Queries and Keys
+ dropout: float, default 0.1,
+ amount of dropout on the attention weights.
+ sigma: float, default 0.02,
+ standard deviation used for the init.
+ trainable: bool, default True,
+ if False, the Module parameters will be hidden from the optimizer.
+ """
+
+ def __init__(
+ self,
+ d_model: int,
+ num_heads: int,
+ num_layers: int,
+ rotary_embedding= None,
+ dropout: float = 0.1,
+ sigma=0.02,
+ use_cache: bool = False,
+ bias=True,
+ ):
+ super().__init__()
+ assert d_model % num_heads == 0
+ self.d_model = d_model
+ self.num_heads = num_heads
+ self.head_dim = self.d_model // self.num_heads
+ self.num_layers = num_layers
+ self.dropout = dropout
+ self.sigma = sigma
+ self.bias = bias
+
+ # key, query, value projections for all heads
+ self.key = nn.Linear(d_model, d_model, bias=bias)
+ self.query = nn.Linear(d_model, d_model, bias=bias)
+ self.value = nn.Linear(d_model, d_model, bias=bias)
+ # regularization
+ self.attn_drop = nn.Dropout(dropout)
+ self.resid_drop = nn.Dropout(dropout)
+ # output projection
+ self.proj = nn.Linear(d_model, d_model, bias=bias)
+
+ self.rotary_embedding = rotary_embedding
+ self.layer_id = None # will be set by the Transformer itself
+ self.use_cache = use_cache
+ self.qkv = None
+ self.bias = bias
+
+ def forward(
+ self,
+ x,
+ causal_mask: Optional[torch.BoolTensor] = None,
+ attention_mask: Optional[torch.BoolTensor] = None,
+ ) -> Tuple[torch.FloatTensor, torch.FloatTensor]:
+
+ N, L, D = x.size() # Batch_size, Context_size, d_model
+
+ # calculate query, key, values for all heads in batch and move head forward to be the batch dim
+ k = (
+ self.key(x).view(N, L, self.num_heads, D // self.num_heads).transpose(1, 2)
+ ) # (N, nh, L, hs)
+ q = (
+ self.query(x).view(N, L, self.num_heads, D // self.num_heads).transpose(1, 2)
+ ) # (N, nh, L, hs)
+ v = (
+ self.value(x).view(N, L, self.num_heads, D // self.num_heads).transpose(1, 2)
+ ) # (N, nh, L, hs)
+
+ if self.rotary_embedding is not None:
+ cos, sin = self.rotary_embedding(x)
+ q, k = self.rotary_embedding.apply_rotary_pos_emb(q, k, cos, sin)
+
+ # causal self-attention; Self-attend: (N, nh, L, hs) x (N, nh, hs, L) -> (N, nh, L, L)
+ att = (q @ k.transpose(-2, -1)) * (1.0 / math.sqrt(k.size(-1)))
+
+ if causal_mask is not None:
+ att[:,:,-L:, -L: ].masked_fill_(causal_mask.view(1, 1, L, L), float("-inf"))
+
+ att = (
+ att.transpose(0, 2)
+ .masked_fill(attention_mask.view(1, 1, N, L)==0, float("-inf"))
+ .transpose(0, 2)
+ if attention_mask is not None
+ else att
+ )
+
+ att = F.softmax(att, dim=-1)
+ att = self.attn_drop(att)
+ y = att @ v # (N, nh, L, L) x (N, nh, L, hs) -> (N, nh, L, hs)
+ y = (
+ y.transpose(1, 2).contiguous().view(N, L, D)
+ ) # re-assemble all head outputs side by side
+
+ # output projection
+ y = self.resid_drop(self.proj(y))
+ return y
+
+class DecoderLayer(nn.Module):
+ """Transformer block containing the self-attention module and the feedfoward module."""
+
+ def __init__(
+ self, config
+ ):
+ super().__init__()
+ self.self_attention = SelfAttention(config.d_model, config.num_heads, config.dropout, rotary_embedding=RotaryEmbedding(config))
+ self.attn_norm = nn.LayerNorm(config.d_model)
+ self.attn_dropout = nn.Dropout(config.dropout)
+
+ self.mlp = nn.Sequential(
+ nn.Linear(config.d_model, config.d_feedforward, bias=True),
+ RITAGELU(),
+ nn.Linear(config.d_feedforward, config.d_model, bias=True),
+ )
+ self.mlp_norm = nn.LayerNorm(config.d_model)
+ self.mlp_dropout = nn.Dropout(config.dropout)
+
+ def forward(
+ self,
+ x: torch.FloatTensor,
+ causal_mask: torch.BoolTensor,
+ attention_mask: Optional[torch.BoolTensor] = None,
+ ) -> torch.FloatTensor:
+ y = self.attn_norm(x)
+ y = self.self_attention(y, causal_mask=causal_mask, attention_mask=attention_mask)
+ x = x + self.attn_dropout(y)
+
+ y = self.mlp_norm(x)
+ y = self.mlp(y)
+ x = x + self.mlp_dropout(y)
+ return x
+
+class RITAModel(PreTrainedModel):
+ config_class = RITAConfig
+ base_model_prefix = "transformer"
+ is_parallelizable = False
+
+ def __init__(
+ self,
+ config
+ ):
+ super().__init__(config)
+ self.embedding = nn.Embedding(config.vocab_size, config.d_model)
+ self.layers = nn.ModuleList([DecoderLayer(config) for _ in range(config.num_layers)])
+ self.final_norm = nn.LayerNorm(config.d_model)
+
+ def forward(
+ self,
+ input_ids=None,
+ past_key_values=None, # NOT USED
+ attention_mask=None,
+ causal_mask=None,
+ token_type_ids=None, # NOT USED
+ position_ids=None, # NOT USED
+ head_mask=None, # NOT USED
+ inputs_embeds=None,
+ encoder_hidden_states=None, # NOT USED
+ encoder_causal_mask=None, # NOT USED
+ labels=None,
+ use_cache=None, # NOT USED
+ output_attentions=None, # NOT USED
+ output_hidden_states=None, # NOT USED
+ return_dict=None # NOT USED
+ ) -> torch.FloatTensor:
+ if inputs_embeds == None:
+ x = self.embedding(input_ids) # N x L x D
+ else:
+ x = inputs_embeds
+ if causal_mask == None:
+ causal_mask = (torch.triu(torch.ones(input_ids.size(1), input_ids.size(1))) == 0).transpose(0, 1).contiguous().to(input_ids.device)
+ for layer in self.layers:
+ x = layer(x, causal_mask=causal_mask, attention_mask=attention_mask)
+ x = self.final_norm(x) # N x L x D
+
+ return BaseModelOutput(
+ hidden_states=x,
+ )
+
+ #Some common HF functions.
+ def get_input_embeddings(self):
+ return self.embedding
+
+ def set_input_embeddings(self, new_embeddings):
+ self.embedding = new_embeddings
+
+ def _init_weights(self, module):
+ """Initialize the weights."""
+ if isinstance(module, nn.Linear):
+ module.weight.data.normal_(mean=0.0, std=self.config.initializer_range)
+ if module.bias is not None:
+ module.bias.data.zero_()
+ elif isinstance(module, nn.Embedding):
+ module.weight.data.normal_(mean=0.0, std=self.config.initializer_range)
+ if module.padding_idx is not None:
+ module.weight.data[module.padding_idx].zero_()
+ elif isinstance(module, nn.LayerNorm):
+ module.bias.data.zero_()
+ module.weight.data.fill_(1.0)
+
+
+class RITAModelForCausalLM(PreTrainedModel):
+ config_class = RITAConfig
+ base_model_prefix = "transformer"
+ is_parallelizable = False
+
+ def __init__(
+ self,
+ config
+ ):
+ super().__init__(config)
+ self.transformer = RITAModel(config)
+ self.lm_head = nn.Linear(config.d_model, config.vocab_size, bias=False)
+
+ def forward(
+ self,
+ input_ids=None,
+ past_key_values=None, # NOT USED
+ attention_mask=None,
+ causal_mask=None,
+ token_type_ids=None, # NOT USED
+ position_ids=None, # NOT USED
+ head_mask=None, # NOT USED
+ inputs_embeds=None,
+ encoder_hidden_states=None, # NOT USED
+ encoder_causal_mask=None, # NOT USED
+ labels=None,
+ use_cache=None, # NOT USED
+ output_attentions=None, # NOT USED
+ output_hidden_states=None, # NOT USED
+ return_dict=None # NOT USED
+ ) -> torch.FloatTensor:
+
+ transformer_outputs = self.transformer(
+ input_ids,
+ past_key_values=past_key_values,
+ causal_mask=causal_mask,
+ attention_mask = attention_mask,
+ token_type_ids=token_type_ids,
+ position_ids=position_ids,
+ head_mask=head_mask,
+ inputs_embeds=inputs_embeds,
+ use_cache=use_cache,
+ output_attentions=output_attentions,
+ output_hidden_states=output_hidden_states,
+ return_dict=return_dict,
+ )
+
+ logits = self.lm_head(transformer_outputs.hidden_states)
+ loss = None
+ if labels is not None:
+ # Shift so that tokens < n predict n
+ shift_logits = logits[..., :-1, :].contiguous()
+ shift_labels = labels[..., 1:].contiguous()
+ # Flatten the tokens
+ loss_fct = CrossEntropyLoss()
+ loss = loss_fct(shift_logits.view(-1, shift_logits.size(-1)), shift_labels.view(-1))
+
+ return CausalLMOutput(
+ loss=loss,
+ logits=logits,
+ hidden_states=transformer_outputs.hidden_states,
+ )
+
+ #Some common HF functions.
+ def get_input_embeddings(self):
+ return self.transformer.embedding
+
+ def set_input_embeddings(self, new_embeddings):
+ self.transformer.embedding = new_embeddings
+
+ def get_output_embeddings(self):
+ return self.lm_head
+
+ def set_output_embeddings(self, lm_head):
+ self.lm_head = lm_head
+
+ def _init_weights(self, module):
+ """Initialize the weights."""
+ if isinstance(module, nn.Linear):
+ module.weight.data.normal_(mean=0.0, std=self.config.initializer_range)
+ if module.bias is not None:
+ module.bias.data.zero_()
+ elif isinstance(module, nn.Embedding):
+ module.weight.data.normal_(mean=0.0, std=self.config.initializer_range)
+ if module.padding_idx is not None:
+ module.weight.data[module.padding_idx].zero_()
+ elif isinstance(module, nn.LayerNorm):
+ module.bias.data.zero_()
+ module.weight.data.fill_(1.0)
+
+
+class RITAModelForSequenceClassification(PreTrainedModel):
+ config_class = RITAConfig
+ base_model_prefix = "transformer"
+ is_parallelizable = False
+
+ def __init__(self, config):
+ super().__init__(config)
+ self.num_labels = config.num_labels
+ self.transformer = RITAModel(config)
+ self.score = nn.Linear(config.d_model, self.num_labels, bias=False)
+
+ def forward(
+ self,
+ input_ids=None,
+ past_key_values=None,
+ attention_mask=None,
+ causal_mask=None,
+ token_type_ids=None,
+ position_ids=None,
+ head_mask=None,
+ inputs_embeds=None,
+ labels=None,
+ use_cache=None,
+ output_attentions=None,
+ output_hidden_states=None,
+ return_dict=None,
+ ):
+ r"""
+ labels (`torch.LongTensor` of shape `(batch_size,)`, *optional*):
+ Labels for computing the sequence classification/regression loss. Indices should be in `[0, ...,
+ config.num_labels - 1]`. If `config.num_labels == 1` a regression loss is computed (Mean-Square loss), If
+ `config.num_labels > 1` a classification loss is computed (Cross-Entropy).
+ """
+ return_dict = return_dict if return_dict is not None else self.config.use_return_dict
+
+ transformer_outputs = self.transformer(
+ input_ids,
+ past_key_values=past_key_values,
+ attention_mask=attention_mask,
+ causal_mask=causal_mask,
+ token_type_ids=token_type_ids,
+ position_ids=position_ids,
+ head_mask=head_mask,
+ inputs_embeds=inputs_embeds,
+ use_cache=use_cache,
+ output_attentions=output_attentions,
+ output_hidden_states=output_hidden_states,
+ return_dict=return_dict,
+ )
+ hidden_states = transformer_outputs[0]
+ logits = self.score(hidden_states)
+
+ if input_ids is not None:
+ batch_size, sequence_length = input_ids.shape[:2]
+ else:
+ batch_size, sequence_length = inputs_embeds.shape[:2]
+
+ assert (
+ self.config.pad_token_id is not None or batch_size == 1
+ ), "Cannot handle batch sizes > 1 if no padding token is defined."
+ if self.config.pad_token_id is None:
+ sequence_lengths = -1
+ else:
+ if input_ids is not None:
+ sequence_lengths = torch.ne(input_ids, self.config.pad_token_id).sum(-1) - 1
+ else:
+ sequence_lengths = -1
+ logger.warning(
+ f"{self.__class__.__name__} will not detect padding tokens in `inputs_embeds`. Results may be "
+ f"unexpected if using padding tokens in conjunction with `inputs_embeds.`"
+ )
+
+ pooled_logits = logits[torch.arange(batch_size, device=self.device), sequence_lengths]
+
+ loss = None
+ if labels is not None:
+ if self.config.problem_type is None:
+ if self.num_labels == 1:
+ self.config.problem_type = "regression"
+ elif self.num_labels > 1 and (labels.dtype == torch.long or labels.dtype == torch.int):
+ self.config.problem_type = "single_label_classification"
+ else:
+ self.config.problem_type = "multi_label_classification"
+
+ if self.config.problem_type == "regression":
+ loss_fct = MSELoss()
+ if self.num_labels == 1:
+ loss = loss_fct(pooled_logits.squeeze(), labels.squeeze())
+ else:
+ loss = loss_fct(pooled_logits, labels)
+ elif self.config.problem_type == "single_label_classification":
+ loss_fct = CrossEntropyLoss()
+ loss = loss_fct(pooled_logits.view(-1, self.num_labels), labels.view(-1))
+ elif self.config.problem_type == "multi_label_classification":
+ loss_fct = BCEWithLogitsLoss()
+ loss = loss_fct(pooled_logits, labels)
+ if not return_dict:
+ output = (pooled_logits,) + transformer_outputs[1:]
+ return ((loss,) + output) if loss is not None else output
+
+ return SequenceClassifierOutput(
+ loss=loss,
+ logits=pooled_logits,
+ )
+
+ def _init_weights(self, module):
+ """Initialize the weights."""
+ if isinstance(module, nn.Linear):
+ module.weight.data.normal_(mean=0.0, std=self.config.initializer_range)
+ if module.bias is not None:
+ module.bias.data.zero_()
+ elif isinstance(module, nn.Embedding):
+ module.weight.data.normal_(mean=0.0, std=self.config.initializer_range)
+ if module.padding_idx is not None:
+ module.weight.data[module.padding_idx].zero_()
+ elif isinstance(module, nn.LayerNorm):
+ module.bias.data.zero_()
+ module.weight.data.fill_(1.0)
diff --git a/proteingym/baselines/saprot/compute_fitness.py b/proteingym/baselines/saprot/compute_fitness.py
new file mode 100644
index 0000000..0cde6a8
--- /dev/null
+++ b/proteingym/baselines/saprot/compute_fitness.py
@@ -0,0 +1,166 @@
+import os
+import argparse
+
+from transformers import AutoModel, AutoModelForMaskedLM, AutoTokenizer
+from scipy.stats import spearmanr
+import numpy as np
+import pandas as pd
+
+import torch
+from torch.nn import CrossEntropyLoss
+from foldseek_util import get_struc_seq
+from tqdm import tqdm
+
+foldseek_struc_vocab = "pynwrqhgdlvtmfsaeikc#"
+
+
+def predict_mut(model, tokenizer, seq: str, mut_info: str) -> float:
+ """
+ Predict the mutational effect of a given mutation
+ Args:
+ seq: The wild type sequence
+
+ mut_info: The mutation information in the format of "A123B", where A is the original amino acid, 123 is the
+ position and B is the mutated amino acid. If multiple mutations are provided, they should be
+ separated by colon, e.g. "A123B:C124D".
+
+ Returns:
+ The predicted mutational effect
+ """
+ tokens = tokenizer.tokenize(seq)
+ for single in mut_info.split(":"):
+ pos = int(single[1:-1])
+ tokens[pos - 1] = "#" + tokens[pos - 1][-1]
+
+ mask_seq = " ".join(tokens)
+ inputs = tokenizer(mask_seq, return_tensors="pt")
+ inputs = {k: v.to(model.device) for k, v in inputs.items()}
+
+ with torch.no_grad():
+ outputs = model(**inputs)
+ logits = outputs.logits
+ probs = logits.softmax(dim=-1)
+
+ score = 0
+ for single in mut_info.split(":"):
+ ori_aa, pos, mut_aa = single[0], int(single[1:-1]), single[-1]
+ ori_st = tokenizer.get_vocab()[ori_aa + foldseek_struc_vocab[0]]
+ mut_st = tokenizer.get_vocab()[mut_aa + foldseek_struc_vocab[0]]
+
+ ori_prob = probs[0, pos, ori_st: ori_st + len(foldseek_struc_vocab)].sum()
+ mut_prob = probs[0, pos, mut_st: mut_st + len(foldseek_struc_vocab)].sum()
+
+ score += torch.log(mut_prob / ori_prob)
+
+ return score
+
+
+def calc_fitness(foldseek_bin, model, DMS_data, tokenizer, mutation_col='mutant', target_seq=None, pdb_file=None, offset_idx=1):
+ # Get 3Di sequence
+ struc_seq = get_struc_seq(foldseek_bin, pdb_file, ["A"], plddt_mask=True, plddt_threshold=70)["A"][1].lower()
+
+ seq = "".join([a + b for a, b in zip(target_seq, struc_seq)])
+ log_proba_list = []
+ for mut_info in tqdm(DMS_data[mutation_col]):
+ mutations = []
+ for row in mut_info.split(":"):
+ ori, pos, mut = row[0], row[1: -1], row[-1]
+ pos = int(pos) - offset_idx + 1
+ mutations.append(f"{ori}{pos}{mut}")
+
+ all_mut = ":".join(mutations)
+ score = predict_mut(model, tokenizer, seq, all_mut).item()
+ log_proba_list.append(score)
+
+ return np.array(log_proba_list)
+
+
+def get_mutated_sequence(focus_seq, mutant, start_idx=1, AA_vocab="ACDEFGHIKLMNPQRSTVWY"):
+ """
+ Helper function that mutates an input sequence (focus_seq) via an input mutation triplet (substitutions only).
+ Mutation triplet are typically based on 1-indexing: start_idx is used for switching to 0-indexing.
+ """
+ mutated_seq = list(focus_seq)
+ for mutation in mutant.split(":"):
+ try:
+ from_AA, position, to_AA = mutation[0], int(mutation[1:-1]), mutation[-1]
+ except:
+ print("Issue with mutant: " + str(mutation))
+ relative_position = position - start_idx
+ assert (from_AA == focus_seq[relative_position]), "Invalid from_AA or mutant position: " + str(
+ mutation) + " from_AA: " + str(from_AA) + " relative pos: " + str(relative_position) + " focus_seq: " + str(
+ focus_seq)
+ assert (to_AA in AA_vocab), "Mutant to_AA is invalid: " + str(mutation)
+ mutated_seq[relative_position] = to_AA
+ return "".join(mutated_seq)
+
+
+def main():
+ """
+ Main script to score sets of mutated protein sequences (substitutions or indels) with SaProt.
+ """
+ parser = argparse.ArgumentParser(description='SaProt scoring')
+ parser.add_argument('--foldseek_bin',
+ default="",
+ type=str, help='Path to foldseek binary file')
+ parser.add_argument('--SaProt_model_name_or_path',
+ default="",
+ type=str, help='Name of or path to SaProt model')
+ parser.add_argument('--DMS_reference_file_path',
+ default='/sujin/Datasets/ProteinGym/v1.0/DMS_substitutions.csv',
+ type=str, help='Path of DMS folder')
+ parser.add_argument('--DMS_data_folder',
+ default='/sujin/Datasets/ProteinGym/v1.0/DMS_ProteinGym_substitutions',
+ type=str, help='Path of DMS folder')
+ parser.add_argument('--structure_data_folder', default='', type=str, help='Path of structure folder')
+ parser.add_argument('--DMS_index', type=int, help='Path of DMS folder')
+ parser.add_argument('--output_scores_folder', default=None, type=str,
+ help='Name of folder to write model scores to')
+ parser.add_argument('--indel_mode', action='store_true',
+ help='Whether to score sequences with insertions and deletions')
+ args = parser.parse_args()
+ model = AutoModelForMaskedLM.from_pretrained(args.SaProt_model_name_or_path, trust_remote_code=True)
+ model.cuda()
+ tokenizer = AutoTokenizer.from_pretrained(args.SaProt_model_name_or_path)
+
+ mapping_protein_seq_DMS = pd.read_csv(args.DMS_reference_file_path)
+ list_DMS = mapping_protein_seq_DMS["DMS_id"]
+ DMS_id = list_DMS[args.DMS_index]
+
+ scoring_filename = args.output_scores_folder + os.sep + DMS_id + '.csv'
+ if os.path.exists(scoring_filename):
+ print("Scores already computed for: {}".format(DMS_id))
+
+ print("Computing scores for: {} with SaProt: {}".format(DMS_id, args.SaProt_model_name_or_path))
+ DMS_file_name = mapping_protein_seq_DMS["DMS_filename"][mapping_protein_seq_DMS["DMS_id"] == DMS_id].values[0]
+ target_seq = mapping_protein_seq_DMS["target_seq"][mapping_protein_seq_DMS["DMS_id"] == DMS_id].values[0].upper()
+
+ DMS_data = pd.read_csv(args.DMS_data_folder + os.sep + DMS_file_name, low_memory=False)
+ DMS_data['mutated_sequence'] = DMS_data['mutant'].apply(
+ lambda x: get_mutated_sequence(target_seq, x)) if not args.indel_mode else DMS_data['mutant']
+
+ pdb_filenames = mapping_protein_seq_DMS["pdb_file"][mapping_protein_seq_DMS["DMS_id"] == DMS_id].values[
+ 0].split('|') # if sequence is large (eg., BRCA2_HUMAN) the structure is split in several chunks
+ pdb_ranges = mapping_protein_seq_DMS["pdb_range"][mapping_protein_seq_DMS["DMS_id"] == DMS_id].values[0].split(
+ '|')
+ model_scores = []
+ for pdb_index, pdb_filename in enumerate(pdb_filenames):
+ pdb_file = args.structure_data_folder + os.sep + pdb_filename
+ pdb_range = [int(x) for x in pdb_ranges[pdb_index].split("-")]
+ target_seq_split = target_seq[pdb_range[0] - 1:pdb_range[1]] # pdb_range is 1-indexed
+ DMS_data["mutated_position"] = DMS_data['mutant'].apply(lambda x: int(x.split(':')[0][1:-1])) #if multiple mutant, will extract position of first mutant
+ filtered_DMS_data = DMS_data[
+ (DMS_data["mutated_position"] >= pdb_range[0]) & (DMS_data["mutated_position"] <= pdb_range[1])]
+ model_scores.append(calc_fitness(foldseek_bin=args.foldseek_bin, model=model, DMS_data=filtered_DMS_data,
+ tokenizer=tokenizer, target_seq=target_seq_split,
+ pdb_file=pdb_file, offset_idx=pdb_range[0]))
+
+ model_scores = np.concatenate(model_scores)
+
+ DMS_data['SaProt_score'] = model_scores
+ scoring_filename = args.output_scores_folder + os.sep + DMS_id + '.csv'
+ DMS_data[['mutant', 'SaProt_score', 'DMS_score']].to_csv(scoring_filename, index=False)
+
+
+if __name__ == '__main__':
+ main()
\ No newline at end of file
diff --git a/proteingym/baselines/saprot/foldseek_util.py b/proteingym/baselines/saprot/foldseek_util.py
new file mode 100644
index 0000000..cfbaa6a
--- /dev/null
+++ b/proteingym/baselines/saprot/foldseek_util.py
@@ -0,0 +1,100 @@
+import os
+import numpy as np
+import re
+import sys
+
+sys.path.append(".")
+
+
+# Get structural seqs from pdb file
+def get_struc_seq(foldseek,
+ path,
+ chains: list = None,
+ process_id: int = 0,
+ plddt_mask: bool = False,
+ plddt_threshold: float = 70.) -> dict:
+ """
+
+ Args:
+ foldseek: Binary executable file of foldseek
+ path: Path to pdb file
+ chains: Chains to be extracted from pdb file. If None, all chains will be extracted.
+ process_id: Process ID for temporary files. This is used for parallel processing.
+ plddt_mask: If True, mask regions with plddt < plddt_threshold. plddt scores are from the pdb file.
+ plddt_threshold: Threshold for plddt. If plddt is lower than this value, the structure will be masked.
+
+ Returns:
+ seq_dict: A dict of structural seqs. The keys are chain IDs. The values are tuples of
+ (seq, struc_seq, combined_seq).
+ """
+ assert os.path.exists(foldseek), f"Foldseek not found: {foldseek}"
+ assert os.path.exists(path), f"Pdb file not found: {path}"
+
+ tmp_save_path = f"get_struc_seq_{process_id}.tsv"
+ cmd = f"{foldseek} structureto3didescriptor -v 0 --threads 1 --chain-name-mode 1 {path} {tmp_save_path}"
+ os.system(cmd)
+
+ seq_dict = {}
+ name = os.path.basename(path)
+ with open(tmp_save_path, "r") as r:
+ for i, line in enumerate(r):
+ desc, seq, struc_seq = line.split("\t")[:3]
+
+ # Mask low plddt
+ if plddt_mask:
+ plddts = extract_plddt(path)
+ assert len(plddts) == len(struc_seq), f"Length mismatch: {len(plddts)} != {len(struc_seq)}"
+
+ # Mask regions with plddt < threshold
+ indices = np.where(plddts < plddt_threshold)[0]
+ np_seq = np.array(list(struc_seq))
+ np_seq[indices] = "#"
+ struc_seq = "".join(np_seq)
+
+ name_chain = desc.split(" ")[0]
+ chain = name_chain.replace(name, "").split("_")[-1]
+
+ if chains is None or chain in chains:
+ if chain not in seq_dict:
+ combined_seq = "".join([a + b.lower() for a, b in zip(seq, struc_seq)])
+ seq_dict[chain] = (seq, struc_seq, combined_seq)
+
+ os.remove(tmp_save_path)
+ os.remove(tmp_save_path + ".dbtype")
+ return seq_dict
+
+
+def extract_plddt(pdb_path: str) -> np.ndarray:
+ """
+ Extract plddt scores from pdb file.
+ Args:
+ pdb_path: Path to pdb file.
+
+ Returns:
+ plddts: plddt scores.
+ """
+ with open(pdb_path, "r") as r:
+ plddt_dict = {}
+ for line in r:
+ line = re.sub(' +', ' ', line).strip()
+ splits = line.split(" ")
+
+ if splits[0] == "ATOM":
+ # If position < 1000
+ if len(splits[4]) == 1:
+ pos = int(splits[5])
+
+ # If position >= 1000, the blank will be removed, e.g. "A 999" -> "A1000"
+ # So the length of splits[4] is not 1
+ else:
+ pos = int(splits[4][1:])
+
+ plddt = float(splits[-2])
+
+ if pos not in plddt_dict:
+ plddt_dict[pos] = [plddt]
+ else:
+ plddt_dict[pos].append(plddt)
+
+ plddts = np.array([np.mean(v) for v in plddt_dict.values()])
+ return plddts
\ No newline at end of file
diff --git a/proteingym/baselines/saprot/substitutions/Spearman/Summary_performance_DMS_substitutions_Spearman.csv b/proteingym/baselines/saprot/substitutions/Spearman/Summary_performance_DMS_substitutions_Spearman.csv
new file mode 100644
index 0000000..199dacc
--- /dev/null
+++ b/proteingym/baselines/saprot/substitutions/Spearman/Summary_performance_DMS_substitutions_Spearman.csv
@@ -0,0 +1,53 @@
+Model_rank,Model_name,Model type,Average_Spearman,Bootstrap_standard_error_Spearman,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,Depth_1,Depth_2,Depth_3,Depth_4,Depth_5+,Model details,References
+1,TranceptEVE L,Hybrid model,0.456,0.0,0.487,0.376,0.457,0.46,0.5,0.451,0.467,0.492,0.471,0.498,0.473,0.453,0.446,0.28,0.35,0.32,0.382,TranceptEVE Large model (Tranception Large & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+2,TranceptEVE M,Hybrid model,0.455,0.004,0.479,0.386,0.452,0.454,0.502,0.44,0.468,0.488,0.473,0.498,0.466,0.441,0.441,0.281,0.304,0.309,0.375,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop."
+3,GEMME,Alignment-based model,0.455,0.007,0.482,0.383,0.438,0.452,0.519,0.455,0.47,0.497,0.468,0.51,0.473,0.469,0.446,0.274,0.321,0.324,0.414,GEMME model,"Laine, É., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619."
+4,TranceptEVE S,Hybrid model,0.452,0.004,0.475,0.396,0.443,0.449,0.497,0.449,0.46,0.484,0.468,0.49,0.467,0.433,0.435,0.275,0.304,0.304,0.372,TranceptEVE Small model (Tranception Small & retrieved EVE model),"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+5,SaProt_650M_AF2,Protein language model,0.449,0.008,0.45,0.373,0.471,0.362,0.587,0.379,0.446,0.542,0.467,0.518,0.523,0.295,0.45,0.309,0.27,0.271,0.321,,
+6,EVE (ensemble),Alignment-based model,0.439,0.006,0.464,0.386,0.408,0.447,0.491,0.425,0.453,0.481,0.453,0.487,0.468,0.428,0.427,0.273,0.308,0.298,0.355,EVE model (ensemble of 5 independently-trained models),"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature."
+7,VESPA,Protein language model,0.436,0.006,0.468,0.366,0.404,0.44,0.5,0.427,0.455,0.484,0.438,0.492,0.49,0.432,0.434,0.183,0.357,0.302,0.328,VESPA model,"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647."
+8,Tranception L,Hybrid model,0.434,0.004,0.465,0.349,0.45,0.436,0.471,0.432,0.438,0.473,0.453,0.483,0.431,0.432,0.423,0.258,0.352,0.318,0.387,Tranception Large model (700M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+9,MSA Transformer (ensemble),Hybrid model,0.434,0.009,0.469,0.337,0.446,0.421,0.495,0.404,0.45,0.488,0.437,0.505,0.463,0.414,0.426,0.238,0.384,0.366,0.408,MSA Transformer (ensemble of 5 MSA samples),"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML."
+10,EVE (single),Alignment-based model,0.433,0.005,0.458,0.372,0.404,0.442,0.487,0.417,0.448,0.477,0.445,0.484,0.464,0.424,0.422,0.271,0.304,0.296,0.359,EVE model (single seed),"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature."
+11,Tranception M,Hybrid model,0.427,0.005,0.448,0.361,0.441,0.422,0.465,0.417,0.432,0.456,0.451,0.476,0.404,0.415,0.41,0.247,0.24,0.27,0.353,Tranception Medium model (300M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+12,ESM-IF1,Inverse folding model,0.422,0.011,0.368,0.389,0.407,0.324,0.624,0.3,0.431,0.544,0.415,0.497,0.507,0.374,0.438,0.345,0.29,0.289,0.358,ESM-IF1 model,"Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. BioRxiv."
+13,MSA Transformer (single),Hybrid model,0.421,0.009,0.457,0.33,0.435,0.409,0.476,0.393,0.435,0.473,0.427,0.491,0.451,0.39,0.411,0.239,0.39,0.368,0.413,MSA Transformer (single MSA sample),"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML."
+14,DeepSequence (ensemble),Alignment-based model,0.419,0.008,0.455,0.363,0.39,0.413,0.476,0.383,0.428,0.473,0.442,0.469,0.46,0.344,0.404,0.264,0.313,0.309,0.378,DeepSequence model (ensemble of 5 independently-trained models),"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822."
+15,Tranception S,Hybrid model,0.418,0.006,0.436,0.372,0.42,0.411,0.452,0.428,0.415,0.444,0.438,0.463,0.396,0.405,0.397,0.24,0.249,0.272,0.352,Tranception Small model (85M params) with retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+16,ESM2 (650M),Protein language model,0.414,0.012,0.425,0.337,0.415,0.369,0.523,0.335,0.406,0.515,0.456,0.471,0.476,0.238,0.421,0.247,0.22,0.162,0.218,ESM2 model (650M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+17,ESM-1v (ensemble),Protein language model,0.41,0.011,0.414,0.318,0.431,0.387,0.5,0.326,0.418,0.502,0.458,0.446,0.454,0.289,0.4,0.221,0.186,0.151,0.203,ESM-1v (ensemble of 5 independently-trained models),"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS."
+18,DeepSequence (single),Alignment-based model,0.408,0.009,0.447,0.349,0.372,0.397,0.473,0.382,0.415,0.465,0.436,0.465,0.448,0.323,0.391,0.252,0.277,0.293,0.37,DeepSequence model (single seed),"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822."
+19,ESM2 (3B),Protein language model,0.406,0.011,0.417,0.321,0.403,0.379,0.509,0.348,0.415,0.49,0.441,0.461,0.477,0.274,0.414,0.213,0.202,0.166,0.217,ESM2 model (3B params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+20,MIF-ST,Inverse folding model,0.402,0.01,0.39,0.321,0.438,0.373,0.486,0.376,0.403,0.456,0.404,0.415,0.463,0.396,0.431,0.265,0.334,0.298,0.298,MIF-ST model,"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv."
+21,ESM2 (15B),Protein language model,0.401,0.01,0.405,0.317,0.405,0.388,0.488,0.357,0.414,0.473,0.431,0.449,0.459,0.313,0.405,0.204,0.239,0.172,0.234,ESM2 model (15B params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+22,SaProt_35M_AF2,Protein language model,0.399,0.01,0.371,0.349,0.422,0.285,0.57,0.31,0.387,0.504,0.434,0.481,0.432,0.225,0.403,0.343,0.179,0.187,0.229,,
+23,EVmutation,Alignment-based model,0.395,0.006,0.44,0.317,0.378,0.411,0.43,0.403,0.423,0.41,0.409,0.444,0.422,0.388,0.375,0.274,0.324,0.301,0.394,EVmutation model,"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135."
+24,ESM-1b,Protein language model,0.394,0.01,0.428,0.287,0.406,0.351,0.5,0.35,0.398,0.482,0.434,0.475,0.455,0.241,0.383,0.227,0.187,0.149,0.27,ESM-1b (w/ Brandes et al. extensions),"[1] Original model: Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118. [2] Extensions: Brandes, N., Goldman, G., Wang, C.H., Ye, C.J., & Ntranos, V. (2022). Genome-wide prediction of disease variants with a deep protein language model. bioRxiv."
+25,VESPAl,Protein language model,0.394,0.007,0.429,0.347,0.326,0.404,0.461,0.382,0.412,0.449,0.392,0.461,0.451,0.392,0.385,0.144,0.324,0.276,0.32,VESPAl model,"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647."
+26,Progen2 XL,Protein language model,0.391,0.008,0.402,0.302,0.418,0.387,0.445,0.354,0.405,0.444,0.384,0.442,0.439,0.391,0.385,0.184,0.28,0.219,0.28,Progen2 xlarge model (6.4B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+27,ESM2 (150M),Protein language model,0.387,0.013,0.391,0.326,0.402,0.305,0.51,0.306,0.358,0.497,0.449,0.46,0.407,0.137,0.386,0.243,0.146,0.173,0.231,ESM2 model (150M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+28,MIF,Inverse folding model,0.383,0.011,0.327,0.336,0.43,0.295,0.524,0.349,0.374,0.446,0.397,0.39,0.417,0.352,0.409,0.269,0.257,0.24,0.24,MIF model,"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv."
+29,Progen2 L,Protein language model,0.38,0.008,0.406,0.293,0.427,0.379,0.396,0.348,0.381,0.42,0.41,0.414,0.369,0.333,0.371,0.144,0.232,0.199,0.258,Progen2 large model (2.7B params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+30,ESM-1v (single),Protein language model,0.38,0.013,0.39,0.266,0.406,0.362,0.476,0.286,0.393,0.481,0.432,0.422,0.43,0.256,0.373,0.192,0.187,0.142,0.197,ESM-1v (single seed),"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS."
+31,Progen2 M,Protein language model,0.379,0.009,0.393,0.295,0.433,0.381,0.396,0.318,0.382,0.425,0.411,0.406,0.356,0.342,0.372,0.13,0.158,0.135,0.177,Progen2 medium model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+32,Progen2 Base,Protein language model,0.378,0.009,0.396,0.294,0.437,0.379,0.383,0.342,0.368,0.423,0.421,0.408,0.331,0.328,0.369,0.13,0.145,0.157,0.208,Progen2 base model (760M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+33,Tranception L no retrieval,Protein language model,0.374,0.008,0.401,0.288,0.413,0.389,0.381,0.358,0.371,0.419,0.389,0.376,0.381,0.395,0.363,0.178,0.308,0.257,0.334,Tranception Large model (700M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+34,RITA XL,Protein language model,0.372,0.009,0.366,0.302,0.414,0.381,0.398,0.315,0.382,0.412,0.394,0.384,0.353,0.402,0.356,0.139,0.136,0.154,0.233,RITA xlarge model (1.2B params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+35,CARP (640M),Protein language model,0.369,0.012,0.395,0.273,0.397,0.364,0.414,0.314,0.375,0.428,0.416,0.386,0.39,0.273,0.389,0.213,0.187,0.164,0.162,CARP model (640M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+36,RITA L,Protein language model,0.365,0.009,0.359,0.29,0.42,0.374,0.383,0.316,0.369,0.403,0.394,0.386,0.319,0.391,0.347,0.137,0.135,0.147,0.21,RITA large model (680M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+37,Site-Independent,Alignment-based model,0.359,0.01,0.369,0.344,0.343,0.382,0.358,0.426,0.373,0.32,0.379,0.385,0.316,0.383,0.336,0.235,0.226,0.267,0.35,Site-Independent model,"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135."
+38,RITA M,Protein language model,0.35,0.01,0.352,0.273,0.405,0.371,0.348,0.304,0.349,0.39,0.378,0.349,0.31,0.385,0.336,0.114,0.134,0.168,0.222,RITA medium model (300M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+39,Tranception M no retrieval,Protein language model,0.348,0.009,0.349,0.284,0.406,0.362,0.342,0.293,0.349,0.379,0.379,0.335,0.314,0.349,0.331,0.142,0.107,0.15,0.21,Tranception Medium model (300M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+40,Unirep evotuned,Hybrid model,0.347,0.009,0.355,0.305,0.365,0.346,0.366,0.33,0.344,0.372,0.355,0.363,0.346,0.349,0.319,0.154,0.25,0.226,0.294,Unirep model w/ evotuning,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+41,Progen2 S,Protein language model,0.336,0.011,0.333,0.275,0.384,0.337,0.349,0.283,0.321,0.391,0.384,0.334,0.298,0.285,0.327,0.113,0.12,0.133,0.187,Progen2 small model (150M params)," Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. "
+42,CARP (76M),Protein language model,0.328,0.013,0.342,0.282,0.369,0.269,0.377,0.247,0.301,0.406,0.387,0.355,0.301,0.15,0.334,0.203,0.11,0.134,0.127,CARP model (76M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+43,ESM2 (35M),Protein language model,0.321,0.014,0.314,0.291,0.343,0.218,0.439,0.239,0.271,0.451,0.37,0.394,0.324,0.102,0.305,0.242,0.134,0.164,0.229,ESM2 model (35M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+44,RITA S,Protein language model,0.304,0.011,0.294,0.275,0.336,0.327,0.289,0.276,0.297,0.334,0.33,0.281,0.245,0.358,0.285,0.11,0.096,0.125,0.19,RITA small model (85M params),"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789."
+45,Tranception S no retrieval,Protein language model,0.303,0.012,0.288,0.286,0.349,0.319,0.27,0.258,0.295,0.32,0.317,0.263,0.272,0.315,0.277,0.114,0.1,0.132,0.189,Tranception Small model (85M params) without retrieval,"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML."
+46,CARP (38M),Protein language model,0.279,0.014,0.285,0.268,0.312,0.217,0.315,0.196,0.24,0.357,0.321,0.298,0.252,0.125,0.277,0.169,0.103,0.137,0.143,CARP model (38M params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
+47,ProteinMPNN,Inverse folding model,0.258,0.011,0.197,0.163,0.198,0.165,0.566,0.173,0.28,0.434,0.282,0.395,0.354,0.248,0.292,0.257,0.171,0.186,0.278,ProteinMPNN model,"J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378."
+48,ESM2 (8M),Protein language model,0.226,0.015,0.201,0.26,0.266,0.141,0.262,0.194,0.179,0.264,0.239,0.236,0.213,0.078,0.202,0.136,0.099,0.132,0.195,ESM2 model (8M params),"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379."
+49,Wavenet,Alignment-based model,0.217,0.017,0.219,0.186,0.195,0.303,0.182,0.207,0.255,0.207,0.145,0.305,0.293,0.283,0.177,0.059,0.218,0.181,0.258,Wavenet model,"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12."
+50,Unirep,Protein language model,0.19,0.016,0.182,0.202,0.216,0.141,0.21,0.181,0.161,0.209,0.213,0.219,0.165,0.057,0.174,0.071,0.111,0.141,0.191,Unirep model,"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8."
+51,ProtGPT2,Protein language model,0.188,0.011,0.176,0.149,0.193,0.166,0.256,0.175,0.173,0.253,0.242,0.235,0.133,0.141,0.18,0.138,0.041,0.034,0.078,ProtGPT2 model,"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13."
+52,CARP (600K),Protein language model,0.106,0.017,0.112,0.084,0.171,0.059,0.105,0.095,0.087,0.101,0.121,0.082,0.087,0.042,0.107,0.026,0.043,0.083,0.096,CARP model (600K params),"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv."
diff --git a/proteingym/baselines/trancepteve/score_trancepteve.py b/proteingym/baselines/trancepteve/score_trancepteve.py
new file mode 100644
index 0000000..e9db2d6
--- /dev/null
+++ b/proteingym/baselines/trancepteve/score_trancepteve.py
@@ -0,0 +1,209 @@
+import os
+import argparse
+import json
+import pandas as pd
+
+import torch
+
+from transformers import PreTrainedTokenizerFast
+import trancepteve
+from trancepteve import config, model_pytorch
+from trancepteve.utils import dms_utils
+
+dir_path = os.path.dirname(os.path.abspath(__file__))
+
+def main():
+ """
+ Main script to score sets of mutated protein sequences (substitutions or indels) with TranceptEVE.
+ """
+ parser = argparse.ArgumentParser(description='TranceptEVE scoring')
+ parser.add_argument('--checkpoint', type=str, help='Path of Tranception model checkpoint')
+ parser.add_argument('--model_framework', default='pytorch', type=str, help='Underlying framework [pytorch|JAX]')
+ parser.add_argument('--batch_size_inference', default=20, type=int, help='Batch size for inference')
+
+ #We may pass in all required information about the DMS via the provided reference files, or specify all relevant fields manually
+ parser.add_argument('--DMS_reference_file_path', default=None, type=str, help='Path to reference file with list of DMS to score')
+ parser.add_argument('--DMS_index', default=0, type=int, help='Index of DMS assay in reference file')
+ #Fields to be passed manually if reference file is not used
+ parser.add_argument('--target_seq', default=None, type=str, help='Full wild type sequence that is mutated in the DMS asssay')
+ parser.add_argument('--DMS_file_name', default=None, type=str, help='Name of DMS assay file')
+ parser.add_argument('--MSA_filename', default=None, type=str, help='Name of MSA (eg., a2m) file constructed on the wild type sequence')
+ parser.add_argument('--MSA_weight_file_name', default=None, type=str, help='Weight of sequences in the MSA (optional)')
+ parser.add_argument('--MSA_start', default=None, type=int, help='Sequence position that the MSA starts at (1-indexing)')
+ parser.add_argument('--MSA_end', default=None, type=int, help='Sequence position that the MSA ends at (1-indexing)')
+ parser.add_argument('--UniprotID', default=None, type=str, help='Uniprot ID of protein (EVE retrieval only)')
+ parser.add_argument('--MSA_threshold_sequence_frac_gaps', default=None, type=float, help='MSA processing: pct fragments threshold')
+ parser.add_argument('--MSA_threshold_focus_cols_frac_gaps', default=None, type=float, help='MSA processing: pct col filled threshold')
+
+ parser.add_argument('--DMS_data_folder', type=str, help='Path to folder that contains all DMS assay datasets')
+ parser.add_argument('--output_scores_folder', default='./', type=str, help='Name of folder to write model scores to')
+ parser.add_argument('--deactivate_scoring_mirror', action='store_true', help='Whether to deactivate sequence scoring from both directions (Left->Right and Right->Left)')
+ parser.add_argument('--indel_mode', action='store_true', help='Flag to be used when scoring insertions and deletions. Otherwise assumes substitutions')
+ parser.add_argument('--scoring_window', default="optimal", type=str, help='Sequence window selection mode (when sequence length longer than model context size)')
+
+ parser.add_argument('--num_workers', default=8, type=int, help='Number of workers for model scoring data loader')
+ parser.add_argument('--inference_time_retrieval_type', default=None, type=str, help='Type of inference time retrieval [None,Tranception,TranceptEVE]')
+ parser.add_argument('--retrieval_weights_manual', action='store_true', help='Whether to manually select the MSA/EVE aggregation weights')
+ parser.add_argument('--retrieval_inference_MSA_weight', default=0.5, type=float, help='Coefficient (alpha) used when aggregating autoregressive transformer and MSA retrieval')
+ parser.add_argument('--retrieval_inference_EVE_weight', default=0.5, type=float, help='Coefficient (beta) used when aggregating autoregressive transformer and EVE retrieval')
+
+ parser.add_argument('--MSA_folder', default='.', type=str, help='Path to MSA for neighborhood scoring')
+ parser.add_argument('--MSA_weights_folder', default=None, type=str, help='Path to MSA weights for neighborhood scoring')
+ parser.add_argument('--clustal_omega_location', default=None, type=str, help='Path to Clustal Omega (only needed with scoring indels with retrieval)')
+
+ parser.add_argument('--EVE_model_folder', type=str, help='Path to folder containing the EVE model(s)')
+ parser.add_argument('--EVE_seeds', nargs='*', help='Seeds of the EVE model(s) to be leveraged')
+ parser.add_argument('--EVE_num_samples_log_proba', default=10, type=int, help='Number of samples to compute the EVE log proba')
+ parser.add_argument('--EVE_model_parameters_location', default=None, type=str, help='Path to EVE model parameters')
+ parser.add_argument('--MSA_recalibrate_probas', action='store_true', help='Whether to normalize EVE & MSA log probas (matching temp. of Transformer)')
+ parser.add_argument('--EVE_recalibrate_probas', action='store_true', help='Whether to normalize EVE & MSA log probas (matching temp. of Transformer)')
+ parser.add_argument('--clinvar_scoring', action='store_true', help='Tweaks when scoring ClinVar input file')
+ args = parser.parse_args()
+ print(args)
+ model_name = args.checkpoint.split("/")[-1]
+
+ tokenizer = PreTrainedTokenizerFast(tokenizer_file=dir_path+os.sep+"trancepteve/utils/tokenizers/Basic_tokenizer",
+ unk_token="[UNK]",
+ sep_token="[SEP]",
+ pad_token="[PAD]",
+ cls_token="[CLS]",
+ mask_token="[MASK]"
+ )
+
+ if args.DMS_reference_file_path:
+ mapping_protein_seq_DMS = pd.read_csv(args.DMS_reference_file_path)
+ list_DMS = mapping_protein_seq_DMS["DMS_id"]
+ DMS_id=list_DMS[args.DMS_index]
+ print("Compute scores for DMS: "+str(DMS_id))
+ target_seq = mapping_protein_seq_DMS["target_seq"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0].upper()
+ DMS_file_name = mapping_protein_seq_DMS["DMS_filename"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]
+ UniProt_ID = mapping_protein_seq_DMS["UniProt_ID"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0] if "UniProt_ID" in mapping_protein_seq_DMS else "No ID"
+ if args.inference_time_retrieval_type is not None:
+ MSA_data_file = args.MSA_folder + os.sep + mapping_protein_seq_DMS["MSA_filename"][args.DMS_index] if args.MSA_folder is not None else None
+ MSA_weight_file_name = args.MSA_weights_folder + os.sep + mapping_protein_seq_DMS["weight_file_name"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0] if args.MSA_weights_folder else None
+ MSA_start = int(mapping_protein_seq_DMS["MSA_start"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]) - 1 # MSA_start typically based on 1-indexing
+ MSA_end = int(mapping_protein_seq_DMS["MSA_end"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0])
+ MSA_threshold_sequence_frac_gaps = float(mapping_protein_seq_DMS["MSA_threshold_sequence_frac_gaps"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]) if "MSA_threshold_sequence_frac_gaps" in mapping_protein_seq_DMS else 0.5
+ if args.clinvar_scoring:
+ MSA_threshold_focus_cols_frac_gaps = float(mapping_protein_seq_DMS["MSA_threshold_focus_cols_frac_gaps"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]) if "MSA_threshold_focus_cols_frac_gaps" in mapping_protein_seq_DMS else 1.0
+ else:
+ MSA_threshold_focus_cols_frac_gaps = float(mapping_protein_seq_DMS["MSA_threshold_focus_cols_frac_gaps"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]) if "MSA_threshold_focus_cols_frac_gaps" in mapping_protein_seq_DMS else 1.0
+ print("Sequence (fragment) gap threshold: "+str(MSA_threshold_sequence_frac_gaps))
+ print("Focus column gap threshold: "+str(MSA_threshold_focus_cols_frac_gaps))
+ else:
+ target_seq=args.target_seq
+ DMS_file_name=args.DMS_file_name
+ DMS_id = DMS_file_name.split(".")[0]
+ UniprotID = args.UniprotID
+ if args.inference_time_retrieval_type is not None:
+ MSA_data_file = args.MSA_folder + os.sep + args.MSA_filename if args.MSA_folder is not None else None
+ MSA_weight_file_name = args.MSA_weights_folder + os.sep + args.MSA_weight_file_name if args.MSA_weights_folder is not None else None
+ MSA_start = args.MSA_start - 1 # MSA_start based on 1-indexing
+ MSA_end = args.MSA_end
+ MSA_threshold_sequence_frac_gaps=args.MSA_threshold_sequence_frac_gaps
+ MSA_threshold_focus_cols_frac_gaps=args.MSA_threshold_focus_cols_frac_gaps
+
+ config = json.load(open(args.checkpoint+os.sep+'config.json'))
+ config = trancepteve.config.TranceptEVEConfig(**config)
+ config.attention_mode="tranception"
+ config.position_embedding="grouped_alibi"
+ config.tokenizer = tokenizer
+ config.full_target_seq = target_seq
+ config.scoring_window = args.scoring_window
+
+ if args.inference_time_retrieval_type is not None:
+ config.inference_time_retrieval_type = args.inference_time_retrieval_type
+ config.retrieval_aggregation_mode = "aggregate_indel" if args.indel_mode else "aggregate_substitution"
+ config.MSA_filename = MSA_data_file
+ config.MSA_weight_file_name = MSA_weight_file_name
+ config.MSA_start = MSA_start
+ config.MSA_end = MSA_end
+ config.MSA_threshold_sequence_frac_gaps = MSA_threshold_sequence_frac_gaps
+ config.MSA_threshold_focus_cols_frac_gaps = MSA_threshold_focus_cols_frac_gaps
+ config.retrieval_weights_manual = args.retrieval_weights_manual
+ config.retrieval_inference_MSA_weight = args.retrieval_inference_MSA_weight
+ config.retrieval_inference_EVE_weight = args.retrieval_inference_EVE_weight
+
+ if "TranceptEVE" in args.inference_time_retrieval_type:
+ EVE_model_paths = []
+ EVE_seeds = args.EVE_seeds
+ num_seeds = len(EVE_seeds)
+ print("Number of distinct EVE models to be leveraged: {}".format(num_seeds))
+ for seed in EVE_seeds:
+ print(f"{args.EVE_model_folder}/{os.path.basename(MSA_data_file.split('.a2m')[0])}_seed_{seed}")
+ if os.path.exists(f"{args.EVE_model_folder}/{os.path.basename(MSA_data_file.split('.a2m')[0])}_seed_{seed}"):
+ EVE_model_name = f"{os.path.basename(MSA_data_file.split('.a2m')[0])}_seed_{seed}"
+ elif os.path.exists(f"{args.EVE_model_folder}/{UniProt_ID}_seed_{seed}"):
+ EVE_model_name = f"{UniProt_ID}_seed_{seed}"
+ else:
+ print(f"No EVE Model available for {MSA_data_file} with random seed {seed} in {args.EVE_model_folder}. Exiting")
+ exit(1)
+
+ EVE_model_paths.append(args.EVE_model_folder + os.sep + EVE_model_name)
+ config.EVE_model_paths = EVE_model_paths
+ config.EVE_num_samples_log_proba = args.EVE_num_samples_log_proba
+ config.EVE_model_parameters_location = args.EVE_model_parameters_location
+ config.MSA_recalibrate_probas = args.MSA_recalibrate_probas
+ config.EVE_recalibrate_probas = args.EVE_recalibrate_probas
+ else:
+ num_seeds=0
+ if args.indel_mode:
+ config.clustal_omega_location = args.clustal_omega_location
+ else:
+ config.inference_time_retrieval_type = None
+ config.retrieval_aggregation_mode = None
+
+ if args.model_framework=="pytorch":
+ model = trancepteve.model_pytorch.TrancepteveLMHeadModel.from_pretrained(pretrained_model_name_or_path=args.checkpoint,config=config)
+ if torch.cuda.is_available():
+ model.cuda()
+ model.eval()
+
+ if not os.path.isdir(args.output_scores_folder):
+ os.mkdir(args.output_scores_folder)
+ mutation_type = '_indels' if args.indel_mode else '_substitutions'
+ mirror_type = '_no_mirror' if args.deactivate_scoring_mirror else ''
+ normalization = '_norm-EVE' if args.EVE_recalibrate_probas else ''
+ normalization = normalization + '_norm-MSA' if args.MSA_recalibrate_probas else normalization
+ retrieval_weights = '_MSA-' + str(args.retrieval_inference_MSA_weight) +'_EVE-'+ str(args.retrieval_inference_EVE_weight) if args.retrieval_weights_manual else ''
+ retrieval_type = ('_retrieval_' + args.inference_time_retrieval_type + retrieval_weights + '_' + str(num_seeds) + '-EVE-models' + normalization) if args.inference_time_retrieval_type is not None else '_no_retrieval'
+ scoring_filename = args.output_scores_folder
+ if not os.path.isdir(scoring_filename):
+ os.mkdir(scoring_filename)
+ scoring_filename += os.sep + DMS_id + '.csv'
+
+ DMS_data = pd.read_csv(args.DMS_data_folder + os.sep + DMS_file_name, low_memory=False)
+ with torch.no_grad():
+ all_scores = model.score_mutants(
+ DMS_data=DMS_data,
+ target_seq=target_seq,
+ scoring_mirror=not args.deactivate_scoring_mirror,
+ batch_size_inference=args.batch_size_inference,
+ num_workers=args.num_workers,
+ indel_mode=args.indel_mode
+ )
+ if len(all_scores) > 0 and args.clinvar_scoring: all_scores = pd.merge(all_scores,DMS_data,how="left",on="mutant")
+ all_scores.to_csv(scoring_filename, index=False)
+
+ if args.clinvar_scoring:
+ experiment_name = args.output_scores_folder.split('/')[-1]
+ print(experiment_name)
+
+ with open("ClinVar_scoring_Tranception_20221130",'a+') as filew:
+ if os.stat("ClinVar_scoring_Tranception_20221130").st_size == 0:
+ header = "DMS_id,num_mutants_scored,num_mutants_scored_no_na,processed_MSA_depth,retrieval_inference_MSA_weight,retrieval_inference_EVE_weight\n"
+ filew.write(header)
+ all_scores_no_na = all_scores.dropna()
+ stat = ",".join([str(x) for x in [DMS_id,len(all_scores),len(all_scores_no_na),model.MSA_processed_depth,model.EVE_processed_depth,model.retrieval_inference_MSA_weight,model.retrieval_inference_EVE_weight]])
+ filew.write(stat+"\n")
+ else:
+ with open("TranceptEVE_aggregation_coefficients_log",'a+') as filew:
+ if os.stat("TranceptEVE_aggregation_coefficients_log").st_size == 0:
+ header = "DMS_id,num_mutants_scored,num_mutants_scored_no_na,processed_MSA_depth,retrieval_inference_MSA_weight,retrieval_inference_EVE_weight\n"
+ filew.write(header)
+ all_scores_no_na = all_scores.dropna()
+ stat = ",".join([str(x) for x in [DMS_id,len(all_scores),len(all_scores_no_na),model.MSA_processed_depth,model.EVE_processed_depth,model.retrieval_inference_MSA_weight,model.retrieval_inference_EVE_weight]])
+ filew.write(stat+"\n")
+
+if __name__ == '__main__':
+ main()
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/EVE/VAE_decoder.py b/proteingym/baselines/trancepteve/trancepteve/EVE/VAE_decoder.py
new file mode 100644
index 0000000..7bfa57a
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/EVE/VAE_decoder.py
@@ -0,0 +1,295 @@
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+
+class VAE_Bayesian_MLP_decoder(nn.Module):
+ """
+ Bayesian MLP decoder class for the VAE model.
+ """
+ def __init__(self, params):
+ """
+ Required input parameters:
+ - seq_len: (Int) Sequence length of sequence alignment
+ - alphabet_size: (Int) Alphabet size of sequence alignment (will be driven by the data helper object)
+ - hidden_layers_sizes: (List) List of the sizes of the hidden layers (all DNNs)
+ - z_dim: (Int) Dimension of latent space
+ - first_hidden_nonlinearity: (Str) Type of non-linear activation applied on the first (set of) hidden layer(s)
+ - last_hidden_nonlinearity: (Str) Type of non-linear activation applied on the very last hidden layer (pre-sparsity)
+ - dropout_proba: (Float) Dropout probability applied on all hidden layers. If 0.0 then no dropout applied
+ - convolve_output: (Bool) Whether to perform 1d convolution on output (kernel size 1, stide 1)
+ - convolution_depth: (Int) Size of the 1D-convolution on output
+ - include_temperature_scaler: (Bool) Whether we apply the global temperature scaler
+ - include_sparsity: (Bool) Whether we use the sparsity inducing scheme on the output from the last hidden layer
+ - num_tiles_sparsity: (Int) Number of tiles to use in the sparsity inducing scheme (the more the tiles, the stronger the sparsity)
+ - bayesian_decoder: (Bool) Whether the decoder is bayesian or not
+ """
+ super().__init__()
+ self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+ self.seq_len = params['seq_len']
+ self.alphabet_size = params['alphabet_size']
+ self.hidden_layers_sizes = params['hidden_layers_sizes']
+ self.z_dim = params['z_dim']
+ self.bayesian_decoder = True
+ self.dropout_proba = params['dropout_proba']
+ self.convolve_output = params['convolve_output']
+ self.convolution_depth = params['convolution_output_depth']
+ self.include_temperature_scaler = params['include_temperature_scaler']
+ self.include_sparsity = params['include_sparsity']
+ self.num_tiles_sparsity = params['num_tiles_sparsity']
+
+ self.mu_bias_init = 0.1
+ self.logvar_init = -10.0
+ self.logit_scale_p = 0.001
+
+ self.hidden_layers_mean=nn.ModuleDict()
+ self.hidden_layers_log_var=nn.ModuleDict()
+ for layer_index in range(len(self.hidden_layers_sizes)):
+ if layer_index==0:
+ self.hidden_layers_mean[str(layer_index)] = nn.Linear(self.z_dim, self.hidden_layers_sizes[layer_index])
+ self.hidden_layers_log_var[str(layer_index)] = nn.Linear(self.z_dim, self.hidden_layers_sizes[layer_index])
+ nn.init.constant_(self.hidden_layers_mean[str(layer_index)].bias, self.mu_bias_init)
+ nn.init.constant_(self.hidden_layers_log_var[str(layer_index)].weight, self.logvar_init)
+ nn.init.constant_(self.hidden_layers_log_var[str(layer_index)].bias, self.logvar_init)
+ else:
+ self.hidden_layers_mean[str(layer_index)] = nn.Linear(self.hidden_layers_sizes[layer_index-1],self.hidden_layers_sizes[layer_index])
+ self.hidden_layers_log_var[str(layer_index)] = nn.Linear(self.hidden_layers_sizes[layer_index-1],self.hidden_layers_sizes[layer_index])
+ nn.init.constant_(self.hidden_layers_mean[str(layer_index)].bias, self.mu_bias_init)
+ nn.init.constant_(self.hidden_layers_log_var[str(layer_index)].weight, self.logvar_init)
+ nn.init.constant_(self.hidden_layers_log_var[str(layer_index)].bias, self.logvar_init)
+
+ if params['first_hidden_nonlinearity'] == 'relu':
+ self.first_hidden_nonlinearity = nn.ReLU()
+ elif params['first_hidden_nonlinearity'] == 'tanh':
+ self.first_hidden_nonlinearity = nn.Tanh()
+ elif params['first_hidden_nonlinearity'] == 'sigmoid':
+ self.first_hidden_nonlinearity = nn.Sigmoid()
+ elif params['first_hidden_nonlinearity'] == 'elu':
+ self.first_hidden_nonlinearity = nn.ELU()
+ elif params['first_hidden_nonlinearity'] == 'linear':
+ self.first_hidden_nonlinearity = nn.Identity()
+
+ if params['last_hidden_nonlinearity'] == 'relu':
+ self.last_hidden_nonlinearity = nn.ReLU()
+ elif params['last_hidden_nonlinearity'] == 'tanh':
+ self.last_hidden_nonlinearity = nn.Tanh()
+ elif params['last_hidden_nonlinearity'] == 'sigmoid':
+ self.last_hidden_nonlinearity = nn.Sigmoid()
+ elif params['last_hidden_nonlinearity'] == 'elu':
+ self.last_hidden_nonlinearity = nn.ELU()
+ elif params['last_hidden_nonlinearity'] == 'linear':
+ self.last_hidden_nonlinearity = nn.Identity()
+
+ if self.dropout_proba > 0.0:
+ self.dropout_layer = nn.Dropout(p=self.dropout_proba)
+
+ if self.convolve_output:
+ self.output_convolution_mean = nn.Conv1d(in_channels=self.convolution_depth,out_channels=self.alphabet_size,kernel_size=1,stride=1,bias=False)
+ self.output_convolution_log_var = nn.Conv1d(in_channels=self.convolution_depth,out_channels=self.alphabet_size,kernel_size=1,stride=1,bias=False)
+ nn.init.constant_(self.output_convolution_log_var.weight, self.logvar_init)
+ self.channel_size = self.convolution_depth
+ else:
+ self.channel_size = self.alphabet_size
+
+ if self.include_sparsity:
+ self.sparsity_weight_mean = nn.Parameter(torch.zeros(int(self.hidden_layers_sizes[-1]/self.num_tiles_sparsity), self.seq_len))
+ self.sparsity_weight_log_var = nn.Parameter(torch.ones(int(self.hidden_layers_sizes[-1]/self.num_tiles_sparsity), self.seq_len))
+ nn.init.constant_(self.sparsity_weight_log_var, self.logvar_init)
+
+ self.last_hidden_layer_weight_mean = nn.Parameter(torch.zeros(self.channel_size * self.seq_len,self.hidden_layers_sizes[-1]))
+ self.last_hidden_layer_weight_log_var = nn.Parameter(torch.zeros(self.channel_size * self.seq_len,self.hidden_layers_sizes[-1]))
+ nn.init.xavier_normal_(self.last_hidden_layer_weight_mean) #Glorot initialization
+ nn.init.constant_(self.last_hidden_layer_weight_log_var, self.logvar_init)
+
+ self.last_hidden_layer_bias_mean = nn.Parameter(torch.zeros(self.alphabet_size * self.seq_len))
+ self.last_hidden_layer_bias_log_var = nn.Parameter(torch.zeros(self.alphabet_size * self.seq_len))
+ nn.init.constant_(self.last_hidden_layer_bias_mean, self.mu_bias_init)
+ nn.init.constant_(self.last_hidden_layer_bias_log_var, self.logvar_init)
+
+ if self.include_temperature_scaler:
+ self.temperature_scaler_mean = nn.Parameter(torch.ones(1))
+ self.temperature_scaler_log_var = nn.Parameter(torch.ones(1) * self.logvar_init)
+
+ def sampler(self, mean, log_var):
+ """
+ Samples a latent vector via reparametrization trick
+ """
+ eps = torch.randn_like(mean).to(self.device)
+ z = torch.exp(0.5*log_var) * eps + mean
+ return z
+
+ def forward(self, z):
+ batch_size = z.shape[0]
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(z)
+ else:
+ x = z
+
+ for layer_index in range(len(self.hidden_layers_sizes)-1):
+ layer_i_weight = self.sampler(self.hidden_layers_mean[str(layer_index)].weight, self.hidden_layers_log_var[str(layer_index)].weight)
+ layer_i_bias = self.sampler(self.hidden_layers_mean[str(layer_index)].bias, self.hidden_layers_log_var[str(layer_index)].bias)
+ x = self.first_hidden_nonlinearity(F.linear(x, weight=layer_i_weight, bias=layer_i_bias))
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(x)
+
+ last_index = len(self.hidden_layers_sizes)-1
+ last_layer_weight = self.sampler(self.hidden_layers_mean[str(last_index)].weight, self.hidden_layers_log_var[str(last_index)].weight)
+ last_layer_bias = self.sampler(self.hidden_layers_mean[str(last_index)].bias, self.hidden_layers_log_var[str(last_index)].bias)
+ x = self.last_hidden_nonlinearity(F.linear(x, weight=last_layer_weight, bias=last_layer_bias))
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(x)
+
+ W_out = self.sampler(self.last_hidden_layer_weight_mean, self.last_hidden_layer_weight_log_var)
+ b_out = self.sampler(self.last_hidden_layer_bias_mean, self.last_hidden_layer_bias_log_var)
+
+ if self.convolve_output:
+ output_convolution_weight = self.sampler(self.output_convolution_mean.weight, self.output_convolution_log_var.weight)
+ W_out = torch.mm(W_out.view(self.seq_len * self.hidden_layers_sizes[-1], self.channel_size),
+ output_convolution_weight.view(self.channel_size,self.alphabet_size)) #product of size (H * seq_len, alphabet)
+
+ if self.include_sparsity:
+ sparsity_weights = self.sampler(self.sparsity_weight_mean,self.sparsity_weight_log_var)
+ sparsity_tiled = sparsity_weights.repeat(self.num_tiles_sparsity,1)
+ sparsity_tiled = nn.Sigmoid()(sparsity_tiled).unsqueeze(2)
+
+ W_out = W_out.view(self.hidden_layers_sizes[-1], self.seq_len, self.alphabet_size) * sparsity_tiled
+
+ W_out = W_out.view(self.seq_len * self.alphabet_size, self.hidden_layers_sizes[-1])
+
+ x = F.linear(x, weight=W_out, bias=b_out)
+
+ if self.include_temperature_scaler:
+ temperature_scaler = self.sampler(self.temperature_scaler_mean,self.temperature_scaler_log_var)
+ x = torch.log(1.0+torch.exp(temperature_scaler)) * x
+
+ x = x.view(batch_size, self.seq_len, self.alphabet_size)
+ x_recon_log = F.log_softmax(x, dim=-1) #of shape (batch_size, seq_len, alphabet)
+
+ return x_recon_log
+
+class VAE_Standard_MLP_decoder(nn.Module):
+ """
+ Standard MLP decoder class for the VAE model.
+ """
+ def __init__(self, seq_len, alphabet_size, hidden_layers_sizes, z_dim, first_hidden_nonlinearity, last_hidden_nonlinearity, dropout_proba,
+ convolve_output, convolution_depth, include_temperature_scaler, include_sparsity, num_tiles_sparsity):
+ """
+ Required input parameters:
+ - seq_len: (Int) Sequence length of sequence alignment
+ - alphabet_size: (Int) Alphabet size of sequence alignment (will be driven by the data helper object)
+ - hidden_layers_sizes: (List) List of the sizes of the hidden layers (all DNNs)
+ - z_dim: (Int) Dimension of latent space
+ - first_hidden_nonlinearity: (Str) Type of non-linear activation applied on the first (set of) hidden layer(s)
+ - last_hidden_nonlinearity: (Str) Type of non-linear activation applied on the very last hidden layer (pre-sparsity)
+ - dropout_proba: (Float) Dropout probability applied on all hidden layers. If 0.0 then no dropout applied
+ - convolve_output: (Bool) Whether to perform 1d convolution on output (kernel size 1, stide 1)
+ - convolution_depth: (Int) Size of the 1D-convolution on output
+ - include_temperature_scaler: (Bool) Whether we apply the global temperature scaler
+ - include_sparsity: (Bool) Whether we use the sparsity inducing scheme on the output from the last hidden layer
+ - num_tiles_sparsity: (Int) Number of tiles to use in the sparsity inducing scheme (the more the tiles, the stronger the sparsity)
+ - bayesian_decoder: (Bool) Whether the decoder is bayesian or not
+ """
+ super().__init__()
+ self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+ self.seq_len = params['seq_len']
+ self.alphabet_size = params['alphabet_size']
+ self.hidden_layers_sizes = params['hidden_layers_sizes']
+ self.z_dim = params['z_dim']
+ self.bayesian_decoder = False
+ self.dropout_proba = params['dropout_proba']
+ self.convolve_output = params['convolve_output']
+ self.convolution_depth = params['convolution_depth']
+ self.include_temperature_scaler = params['include_temperature_scaler']
+ self.include_sparsity = params['include_sparsity']
+ self.num_tiles_sparsity = params['num_tiles_sparsity']
+
+ self.mu_bias_init = 0.1
+
+ self.hidden_layers=nn.ModuleDict()
+ for layer_index in range(len(self.hidden_layers_sizes)):
+ if layer_index==0:
+ self.hidden_layers[str(layer_index)] = nn.Linear(self.z_dim, self.hidden_layers_sizes[layer_index])
+ nn.init.constant_(self.hidden_layers[str(layer_index)].bias, self.mu_bias_init)
+ else:
+ self.hidden_layers[str(layer_index)] = nn.Linear(self.hidden_layers_sizes[layer_index-1],self.hidden_layers_sizes[layer_index])
+ nn.init.constant_(self.hidden_layers[str(layer_index)].bias, self.mu_bias_init)
+
+ if params['first_hidden_nonlinearity'] == 'relu':
+ self.first_hidden_nonlinearity = nn.ReLU()
+ elif params['first_hidden_nonlinearity'] == 'tanh':
+ self.first_hidden_nonlinearity = nn.Tanh()
+ elif params['first_hidden_nonlinearity'] == 'sigmoid':
+ self.first_hidden_nonlinearity = nn.Sigmoid()
+ elif params['first_hidden_nonlinearity'] == 'elu':
+ self.first_hidden_nonlinearity = nn.ELU()
+ elif params['first_hidden_nonlinearity'] == 'linear':
+ self.first_hidden_nonlinearity = nn.Identity()
+
+ if params['last_hidden_nonlinearity'] == 'relu':
+ self.last_hidden_nonlinearity = nn.ReLU()
+ elif params['last_hidden_nonlinearity'] == 'tanh':
+ self.last_hidden_nonlinearity = nn.Tanh()
+ elif params['last_hidden_nonlinearity'] == 'sigmoid':
+ self.last_hidden_nonlinearity = nn.Sigmoid()
+ elif params['last_hidden_nonlinearity'] == 'elu':
+ self.last_hidden_nonlinearity = nn.ELU()
+ elif params['last_hidden_nonlinearity'] == 'linear':
+ self.last_hidden_nonlinearity = nn.Identity()
+
+ if self.dropout_proba > 0.0:
+ self.dropout_layer = nn.Dropout(p=self.dropout_proba)
+
+ if self.convolve_output:
+ self.output_convolution = nn.Conv1d(in_channels=self.convolution_depth,out_channels=self.alphabet_size,kernel_size=1,stride=1,bias=False)
+ self.channel_size = self.convolution_depth
+ else:
+ self.channel_size = self.alphabet_size
+
+ if self.include_sparsity:
+ self.sparsity_weight = nn.Parameter(torch.randn(int(self.hidden_layers_sizes[-1]/self.num_tiles_sparsity), self.seq_len))
+
+ self.W_out = nn.Parameter(torch.zeros(self.channel_size * self.seq_len,self.hidden_layers_sizes[-1]))
+ nn.init.xavier_normal_(self.W_out) #Initialize weights with Glorot initialization
+ self.b_out = nn.Parameter(torch.zeros(self.alphabet_size * self.seq_len))
+ nn.init.constant_(self.b_out, self.mu_bias_init)
+
+ if self.include_temperature_scaler:
+ self.temperature_scaler = nn.Parameter(torch.ones(1))
+
+ def forward(self, z):
+ batch_size = z.shape[0]
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(z)
+ else:
+ x=z
+
+ for layer_index in range(len(self.hidden_layers_sizes)-1):
+ x = self.first_hidden_nonlinearity(self.hidden_layers[str(layer_index)](x))
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(x)
+
+ x = self.last_hidden_nonlinearity(self.hidden_layers[str(len(self.hidden_layers_sizes)-1)](x)) #of size (batch_size,H)
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(x)
+
+ W_out = self.W_out.data
+
+ if self.convolve_output:
+ W_out = torch.mm(W_out.view(self.seq_len * self.hidden_layers_sizes[-1], self.channel_size),
+ self.output_convolution.weight.view(self.channel_size,self.alphabet_size))
+
+ if self.include_sparsity:
+ sparsity_tiled = self.sparsity_weight.repeat(self.num_tiles_sparsity,1) #of size (H,seq_len)
+ sparsity_tiled = nn.Sigmoid()(sparsity_tiled).unsqueeze(2) #of size (H,seq_len,1)
+ W_out = W_out.view(self.hidden_layers_sizes[-1], self.seq_len, self.alphabet_size) * sparsity_tiled
+
+ W_out = W_out.view(self.seq_len * self.alphabet_size, self.hidden_layers_sizes[-1])
+
+ x = F.linear(x, weight=W_out, bias=self.b_out)
+
+ if self.include_temperature_scaler:
+ x = torch.log(1.0+torch.exp(self.temperature_scaler)) * x
+
+ x = x.view(batch_size, self.seq_len, self.alphabet_size)
+ x_recon_log = F.log_softmax(x, dim=-1) #of shape (batch_size, seq_len, alphabet)
+
+ return x_recon_log
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/EVE/VAE_encoder.py b/proteingym/baselines/trancepteve/trancepteve/EVE/VAE_encoder.py
new file mode 100644
index 0000000..3fd40fe
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/EVE/VAE_encoder.py
@@ -0,0 +1,88 @@
+import torch
+import torch.nn as nn
+
+class VAE_MLP_encoder(nn.Module):
+ """
+ MLP encoder class for the VAE model.
+ """
+ def __init__(self,params):
+ """
+ Required input parameters:
+ - seq_len: (Int) Sequence length of sequence alignment
+ - alphabet_size: (Int) Alphabet size of sequence alignment (will be driven by the data helper object)
+ - hidden_layers_sizes: (List) List of sizes of DNN linear layers
+ - z_dim: (Int) Size of latent space
+ - convolve_input: (Bool) Whether to perform 1d convolution on input (kernel size 1, stide 1)
+ - convolution_depth: (Int) Size of the 1D-convolution on input
+ - nonlinear_activation: (Str) Type of non-linear activation to apply on each hidden layer
+ - dropout_proba: (Float) Dropout probability applied on all hidden layers. If 0.0 then no dropout applied
+ """
+ super().__init__()
+ self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+ self.seq_len = params['seq_len']
+ self.alphabet_size = params['alphabet_size']
+ self.hidden_layers_sizes = params['hidden_layers_sizes']
+ self.z_dim = params['z_dim']
+ self.convolve_input = params['convolve_input']
+ self.convolution_depth = params['convolution_input_depth']
+ self.dropout_proba = params['dropout_proba']
+
+ self.mu_bias_init = 0.1
+ self.log_var_bias_init = -10.0
+
+ #Convolving input with kernels of size 1 to capture potential similarities across amino acids when encoding sequences
+ if self.convolve_input:
+ self.input_convolution = nn.Conv1d(in_channels=self.alphabet_size,out_channels=self.convolution_depth,kernel_size=1,stride=1,bias=False)
+ self.channel_size = self.convolution_depth
+ else:
+ self.channel_size = self.alphabet_size
+
+ self.hidden_layers=torch.nn.ModuleDict()
+ for layer_index in range(len(self.hidden_layers_sizes)):
+ if layer_index==0:
+ self.hidden_layers[str(layer_index)] = nn.Linear((self.channel_size*self.seq_len),self.hidden_layers_sizes[layer_index])
+ nn.init.constant_(self.hidden_layers[str(layer_index)].bias, self.mu_bias_init)
+ else:
+ self.hidden_layers[str(layer_index)] = nn.Linear(self.hidden_layers_sizes[layer_index-1],self.hidden_layers_sizes[layer_index])
+ nn.init.constant_(self.hidden_layers[str(layer_index)].bias, self.mu_bias_init)
+
+ self.fc_mean = nn.Linear(self.hidden_layers_sizes[-1],self.z_dim)
+ nn.init.constant_(self.fc_mean.bias, self.mu_bias_init)
+ self.fc_log_var = nn.Linear(self.hidden_layers_sizes[-1],self.z_dim)
+ nn.init.constant_(self.fc_log_var.bias, self.log_var_bias_init)
+
+ # set up non-linearity
+ if params['nonlinear_activation'] == 'relu':
+ self.nonlinear_activation = nn.ReLU()
+ elif params['nonlinear_activation'] == 'tanh':
+ self.nonlinear_activation = nn.Tanh()
+ elif params['nonlinear_activation'] == 'sigmoid':
+ self.nonlinear_activation = nn.Sigmoid()
+ elif params['nonlinear_activation'] == 'elu':
+ self.nonlinear_activation = nn.ELU()
+ elif params['nonlinear_activation'] == 'linear':
+ self.nonlinear_activation = nn.Identity()
+
+ if self.dropout_proba > 0.0:
+ self.dropout_layer = nn.Dropout(p=self.dropout_proba)
+
+ def forward(self, x):
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(x)
+
+ if self.convolve_input:
+ x = x.permute(0,2,1)
+ x = self.input_convolution(x)
+ x = x.view(-1,self.seq_len*self.channel_size)
+ else:
+ x = x.view(-1,self.seq_len*self.channel_size)
+
+ for layer_index in range(len(self.hidden_layers_sizes)):
+ x = self.nonlinear_activation(self.hidden_layers[str(layer_index)](x))
+ if self.dropout_proba > 0.0:
+ x = self.dropout_layer(x)
+
+ z_mean = self.fc_mean(x)
+ z_log_var = self.fc_log_var(x)
+
+ return z_mean, z_log_var
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/EVE/VAE_model.py b/proteingym/baselines/trancepteve/trancepteve/EVE/VAE_model.py
new file mode 100644
index 0000000..1410592
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/EVE/VAE_model.py
@@ -0,0 +1,347 @@
+import os
+import numpy as np
+import pandas as pd
+import time
+import tqdm
+from scipy.special import erfinv
+from sklearn.model_selection import train_test_split
+
+import torch
+import torch.nn as nn
+import torch.nn.functional as F
+import torch.optim as optim
+import torch.backends.cudnn as cudnn
+
+from . import VAE_encoder, VAE_decoder
+
+class VAE_model(nn.Module):
+ """
+ Class for the VAE model with estimation of weights distribution parameters via Mean-Field VI.
+ """
+ def __init__(self,
+ model_name,
+ data,
+ encoder_parameters,
+ decoder_parameters,
+ random_seed
+ ):
+
+ super().__init__()
+
+ self.model_name = model_name
+ self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+ self.dtype = torch.float32
+ self.random_seed = random_seed
+ torch.manual_seed(random_seed)
+
+ self.seq_len = data.seq_len
+ self.alphabet_size = data.alphabet_size
+ self.Neff = data.Neff
+
+ self.encoder_parameters=encoder_parameters
+ self.decoder_parameters=decoder_parameters
+
+ encoder_parameters['seq_len'] = self.seq_len
+ encoder_parameters['alphabet_size'] = self.alphabet_size
+ decoder_parameters['seq_len'] = self.seq_len
+ decoder_parameters['alphabet_size'] = self.alphabet_size
+
+ self.encoder = VAE_encoder.VAE_MLP_encoder(params=encoder_parameters)
+ if decoder_parameters['bayesian_decoder']:
+ self.decoder = VAE_decoder.VAE_Bayesian_MLP_decoder(params=decoder_parameters)
+ else:
+ self.decoder = VAE_decoder.VAE_Standard_MLP_decoder(params=decoder_parameters)
+ self.logit_sparsity_p = decoder_parameters['logit_sparsity_p']
+
+ def sample_latent(self, mu, log_var):
+ """
+ Samples a latent vector via reparametrization trick
+ """
+ eps = torch.randn_like(mu).to(self.device)
+ z = torch.exp(0.5*log_var) * eps + mu
+ return z
+
+ def KLD_diag_gaussians(self, mu, logvar, p_mu, p_logvar):
+ """
+ KL divergence between diagonal gaussian with prior diagonal gaussian.
+ """
+ KLD = 0.5 * (p_logvar - logvar) + 0.5 * (torch.exp(logvar) + torch.pow(mu-p_mu,2)) / (torch.exp(p_logvar)+1e-20) - 0.5
+
+ return torch.sum(KLD)
+
+ def annealing_factor(self, annealing_warm_up, training_step):
+ """
+ Annealing schedule of KL to focus on reconstruction error in early stages of training
+ """
+ if training_step < annealing_warm_up:
+ return training_step/annealing_warm_up
+ else:
+ return 1
+
+ def KLD_global_parameters(self):
+ """
+ KL divergence between the variational distributions and the priors (for the decoder weights).
+ """
+ KLD_decoder_params = 0.0
+ zero_tensor = torch.tensor(0.0).to(self.device)
+
+ for layer_index in range(len(self.decoder.hidden_layers_sizes)):
+ for param_type in ['weight','bias']:
+ KLD_decoder_params += self.KLD_diag_gaussians(
+ self.decoder.state_dict(keep_vars=True)['hidden_layers_mean.'+str(layer_index)+'.'+param_type].flatten(),
+ self.decoder.state_dict(keep_vars=True)['hidden_layers_log_var.'+str(layer_index)+'.'+param_type].flatten(),
+ zero_tensor,
+ zero_tensor
+ )
+
+ for param_type in ['weight','bias']:
+ KLD_decoder_params += self.KLD_diag_gaussians(
+ self.decoder.state_dict(keep_vars=True)['last_hidden_layer_'+param_type+'_mean'].flatten(),
+ self.decoder.state_dict(keep_vars=True)['last_hidden_layer_'+param_type+'_log_var'].flatten(),
+ zero_tensor,
+ zero_tensor
+ )
+
+ if self.decoder.include_sparsity:
+ self.logit_scale_sigma = 4.0
+ self.logit_scale_mu = 2.0**0.5 * self.logit_scale_sigma * erfinv(2.0 * self.logit_sparsity_p - 1.0)
+
+ sparsity_mu = torch.tensor(self.logit_scale_mu).to(self.device)
+ sparsity_log_var = torch.log(torch.tensor(self.logit_scale_sigma**2)).to(self.device)
+ KLD_decoder_params += self.KLD_diag_gaussians(
+ self.decoder.state_dict(keep_vars=True)['sparsity_weight_mean'].flatten(),
+ self.decoder.state_dict(keep_vars=True)['sparsity_weight_log_var'].flatten(),
+ sparsity_mu,
+ sparsity_log_var
+ )
+
+ if self.decoder.convolve_output:
+ for param_type in ['weight']:
+ KLD_decoder_params += self.KLD_diag_gaussians(
+ self.decoder.state_dict(keep_vars=True)['output_convolution_mean.'+param_type].flatten(),
+ self.decoder.state_dict(keep_vars=True)['output_convolution_log_var.'+param_type].flatten(),
+ zero_tensor,
+ zero_tensor
+ )
+
+ if self.decoder.include_temperature_scaler:
+ KLD_decoder_params += self.KLD_diag_gaussians(
+ self.decoder.state_dict(keep_vars=True)['temperature_scaler_mean'].flatten(),
+ self.decoder.state_dict(keep_vars=True)['temperature_scaler_log_var'].flatten(),
+ zero_tensor,
+ zero_tensor
+ )
+ return KLD_decoder_params
+
+ def loss_function(self, x_recon_log, x, mu, log_var, kl_latent_scale, kl_global_params_scale, annealing_warm_up, training_step, Neff):
+ """
+ Returns mean of negative ELBO, reconstruction loss and KL divergence across batch x.
+ """
+ BCE = F.binary_cross_entropy_with_logits(x_recon_log, x, reduction='sum') / x.shape[0]
+ KLD_latent = (-0.5 * torch.sum(1 + log_var - mu.pow(2) - log_var.exp())) / x.shape[0]
+ if self.decoder.bayesian_decoder:
+ KLD_decoder_params_normalized = self.KLD_global_parameters() / Neff
+ else:
+ KLD_decoder_params_normalized = 0.0
+ warm_up_scale = self.annealing_factor(annealing_warm_up,training_step)
+ neg_ELBO = BCE + warm_up_scale * (kl_latent_scale * KLD_latent + kl_global_params_scale * KLD_decoder_params_normalized)
+ return neg_ELBO, BCE, KLD_latent, KLD_decoder_params_normalized
+
+ def all_likelihood_components(self, x):
+ """
+ Returns tensors of ELBO, reconstruction loss and KL divergence for each point in batch x.
+ """
+ mu, log_var = self.encoder(x)
+ z = self.sample_latent(mu, log_var)
+ recon_x_log = self.decoder(z)
+
+ recon_x_log = recon_x_log.view(-1,self.alphabet_size*self.seq_len)
+ x = x.view(-1,self.alphabet_size*self.seq_len)
+
+ BCE_batch_tensor = torch.sum(F.binary_cross_entropy_with_logits(recon_x_log, x, reduction='none'),dim=1)
+ KLD_batch_tensor = (-0.5 * torch.sum(1 + log_var - mu.pow(2) - log_var.exp(),dim=1))
+
+ ELBO_batch_tensor = -(BCE_batch_tensor + KLD_batch_tensor)
+
+ return ELBO_batch_tensor, BCE_batch_tensor, KLD_batch_tensor
+
+ def train_model(self, data, training_parameters):
+ """
+ Training procedure for the VAE model.
+ If use_validation_set is True then:
+ - we split the alignment data in train/val sets.
+ - we train up to num_training_steps steps but store the version of the model with lowest loss on validation set across training
+ If not, then we train the model for num_training_steps and save the model at the end of training
+ """
+ if torch.cuda.is_available():
+ cudnn.benchmark = True
+ self.train()
+
+ if training_parameters['log_training_info']:
+ filename = training_parameters['training_logs_location']+os.sep+self.model_name+"_losses.csv"
+ with open(filename, "a") as logs:
+ logs.write("Number of sequences in alignment file:\t"+str(data.num_sequences)+"\n")
+ logs.write("Neff:\t"+str(self.Neff)+"\n")
+ logs.write("Alignment sequence length:\t"+str(data.seq_len)+"\n")
+
+ optimizer = optim.Adam(self.parameters(), lr=training_parameters['learning_rate'], weight_decay = training_parameters['l2_regularization'])
+
+ if training_parameters['use_lr_scheduler']:
+ scheduler = optim.lr_scheduler.StepLR(optimizer, step_size=training_parameters['lr_scheduler_step_size'], gamma=training_parameters['lr_scheduler_gamma'])
+
+ if training_parameters['use_validation_set']:
+ x_train, x_val, weights_train, weights_val = train_test_split(data.one_hot_encoding, data.weights, test_size=training_parameters['validation_set_pct'], random_state=self.random_seed)
+ best_val_loss = float('inf')
+ best_model_step_index=0
+ else:
+ x_train = data.one_hot_encoding
+ weights_train = data.weights
+ best_val_loss = None
+ best_model_step_index = training_parameters['num_training_steps']
+
+ batch_order = np.arange(x_train.shape[0])
+ seq_sample_probs = weights_train / np.sum(weights_train)
+
+ self.Neff_training = np.sum(weights_train)
+ N_training = x_train.shape[0]
+
+ start = time.time()
+ train_loss = 0
+
+ for training_step in tqdm.tqdm(range(1,training_parameters['num_training_steps']+1), desc="Training model"):
+
+ batch_index = np.random.choice(batch_order, training_parameters['batch_size'], p=seq_sample_probs).tolist()
+ x = torch.tensor(x_train[batch_index], dtype=self.dtype).to(self.device)
+ optimizer.zero_grad()
+
+ mu, log_var = self.encoder(x)
+ z = self.sample_latent(mu, log_var)
+ recon_x_log = self.decoder(z)
+
+ neg_ELBO, BCE, KLD_latent, KLD_decoder_params_normalized = self.loss_function(recon_x_log, x, mu, log_var, training_parameters['kl_latent_scale'], training_parameters['kl_global_params_scale'], training_parameters['annealing_warm_up'], training_step, self.Neff_training)
+
+ neg_ELBO.backward()
+ optimizer.step()
+
+ if training_parameters['use_lr_scheduler']:
+ scheduler.step()
+
+ if training_step % training_parameters['log_training_freq'] == 0:
+ progress = "|Train : Update {0}. Negative ELBO : {1:.3f}, BCE: {2:.3f}, KLD_latent: {3:.3f}, KLD_decoder_params_norm: {4:.3f}, Time: {5:.2f} |".format(training_step, neg_ELBO, BCE, KLD_latent, KLD_decoder_params_normalized, time.time() - start)
+ print(progress)
+
+ if training_parameters['log_training_info']:
+ with open(filename, "a") as logs:
+ logs.write(progress+"\n")
+
+ if training_step % training_parameters['save_model_params_freq']==0:
+ self.save(model_checkpoint=training_parameters['model_checkpoint_location']+os.sep+self.model_name+"_step_"+str(training_step),
+ encoder_parameters=self.encoder_parameters,
+ decoder_parameters=self.decoder_parameters,
+ training_parameters=training_parameters)
+
+ if training_parameters['use_validation_set'] and training_step % training_parameters['validation_freq'] == 0:
+ x_val = torch.tensor(x_val, dtype=self.dtype).to(self.device)
+ val_neg_ELBO, val_BCE, val_KLD_latent, val_KLD_global_parameters = self.test_model(x_val, weights_val, training_parameters['batch_size'])
+
+ progress_val = "\t\t\t|Val : Update {0}. Negative ELBO : {1:.3f}, BCE: {2:.3f}, KLD_latent: {3:.3f}, KLD_decoder_params_norm: {4:.3f}, Time: {5:.2f} |".format(training_step, val_neg_ELBO, val_BCE, val_KLD_latent, val_KLD_global_parameters, time.time() - start)
+ print(progress_val)
+ if training_parameters['log_training_info']:
+ with open(filename, "a") as logs:
+ logs.write(progress_val+"\n")
+
+ if val_neg_ELBO < best_val_loss:
+ best_val_loss = val_neg_ELBO
+ best_model_step_index = training_step
+ self.save(model_checkpoint=training_parameters['model_checkpoint_location']+os.sep+self.model_name+"_best",
+ encoder_parameters=self.encoder_parameters,
+ decoder_parameters=self.decoder_parameters,
+ training_parameters=training_parameters)
+ self.train()
+
+ def test_model(self, x_val, weights_val, batch_size):
+ self.eval()
+
+ with torch.no_grad():
+ val_batch_order = np.arange(x_val.shape[0])
+ val_seq_sample_probs = weights_val / np.sum(weights_val)
+
+ val_batch_index = np.random.choice(val_batch_order, batch_size, p=val_seq_sample_probs).tolist()
+ x = torch.tensor(x_val[val_batch_index], dtype=self.dtype).to(self.device)
+ mu, log_var = self.encoder(x)
+ z = self.sample_latent(mu, log_var)
+ recon_x_log = self.decoder(z)
+
+ neg_ELBO, BCE, KLD_latent, KLD_global_parameters = self.loss_function(recon_x_log, x, mu, log_var, kl_latent_scale=1.0, kl_global_params_scale=1.0, annealing_warm_up=0, training_step=1, Neff = self.Neff_training) #set annealing factor to 1
+
+ return neg_ELBO.item(), BCE.item(), KLD_latent.item(), KLD_global_parameters.item()
+
+
+ def save(self, model_checkpoint, encoder_parameters, decoder_parameters, training_parameters, batch_size=256):
+ torch.save({
+ 'model_state_dict':self.state_dict(),
+ 'encoder_parameters':encoder_parameters,
+ 'decoder_parameters':decoder_parameters,
+ 'training_parameters':training_parameters,
+ }, model_checkpoint)
+
+ def compute_evol_indices(self, msa_data, list_mutations_location, num_samples, batch_size=256):
+ """
+ The column in the list_mutations dataframe that contains the mutant(s) for a given variant should be called "mutations"
+ """
+ #Multiple mutations are to be passed colon-separated
+ list_mutations=pd.read_csv(list_mutations_location, header=0)
+ if 'var' in list_mutations.columns:
+ list['mutations']=list['var']
+ elif 'mutant_id' in list_mutations.columns:
+ list['mutations']=list['mutant_id']
+ else:
+ list['mutations']=list['mutant']
+ #Remove (multiple) mutations that are invalid
+ list_valid_mutations = ['wt']
+ list_valid_mutated_sequences = {}
+ list_valid_mutated_sequences['wt'] = msa_data.focus_seq_trimmed # first sequence in the list is the wild_type
+ for mutation in list_mutations['mutations']:
+ individual_substitutions = mutation.split(':')
+ mutated_sequence = list(msa_data.focus_seq_trimmed)[:]
+ fully_valid_mutation = True
+ for mut in individual_substitutions:
+ wt_aa, pos, mut_aa = mut[0], int(mut[1:-1]), mut[-1]
+ if pos not in msa_data.uniprot_focus_col_to_wt_aa_dict or msa_data.uniprot_focus_col_to_wt_aa_dict[pos] != wt_aa or mut not in msa_data.mutant_to_letter_pos_idx_focus_list:
+ print ("Not a valid mutant: "+mutation)
+ fully_valid_mutation = False
+ break
+ else:
+ wt_aa,pos,idx_focus = msa_data.mutant_to_letter_pos_idx_focus_list[mut]
+ mutated_sequence[idx_focus] = mut_aa #perform the corresponding AA substitution
+
+ if fully_valid_mutation:
+ list_valid_mutations.append(mutation)
+ list_valid_mutated_sequences[mutation] = ''.join(mutated_sequence)
+
+ #One-hot encoding of mutated sequences
+ mutated_sequences_one_hot = np.zeros((len(list_valid_mutations),len(msa_data.focus_cols),len(msa_data.alphabet)))
+ for i,mutation in enumerate(list_valid_mutations):
+ sequence = list_valid_mutated_sequences[mutation]
+ for j,letter in enumerate(sequence):
+ if letter in msa_data.aa_dict:
+ k = msa_data.aa_dict[letter]
+ mutated_sequences_one_hot[i,j,k] = 1.0
+
+ mutated_sequences_one_hot = torch.tensor(mutated_sequences_one_hot)
+ dataloader = torch.utils.data.DataLoader(mutated_sequences_one_hot, batch_size=batch_size, shuffle=False, num_workers=4, pin_memory=True)
+ prediction_matrix = torch.zeros((len(list_valid_mutations),num_samples))
+
+ with torch.no_grad():
+ for i, batch in enumerate(tqdm.tqdm(dataloader, 'Looping through mutation batches')):
+ x = batch.type(self.dtype).to(self.device)
+ for j in tqdm.tqdm(range(num_samples), 'Looping through number of samples for batch #: '+str(i+1)):
+ seq_predictions, _, _ = self.all_likelihood_components(x)
+ prediction_matrix[i*batch_size:i*batch_size+len(x),j] = seq_predictions
+ tqdm.tqdm.write('\n')
+ mean_predictions = prediction_matrix.mean(dim=1, keepdim=False)
+ std_predictions = prediction_matrix.std(dim=1, keepdim=False)
+ delta_elbos = mean_predictions - mean_predictions[0]
+ evol_indices = - delta_elbos.detach().cpu().numpy()
+
+ return list_valid_mutations, evol_indices, mean_predictions[0].detach().cpu().numpy(), std_predictions.detach().cpu().numpy()
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/EVE/default_model_params.json b/proteingym/baselines/trancepteve/trancepteve/EVE/default_model_params.json
new file mode 100644
index 0000000..5d3b93c
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/EVE/default_model_params.json
@@ -0,0 +1,41 @@
+{ "encoder_parameters": {
+ "hidden_layers_sizes" : [2000,1000,300],
+ "z_dim" : 50,
+ "convolve_input" : false,
+ "convolution_input_depth" : 40,
+ "nonlinear_activation" : "relu",
+ "dropout_proba" : 0.0
+ },
+ "decoder_parameters": {
+ "hidden_layers_sizes" : [300,1000,2000],
+ "z_dim" : 50,
+ "bayesian_decoder" : true,
+ "first_hidden_nonlinearity" : "relu",
+ "last_hidden_nonlinearity" : "relu",
+ "dropout_proba" : 0.1,
+ "convolve_output" : true,
+ "convolution_output_depth" : 40,
+ "include_temperature_scaler" : true,
+ "include_sparsity" : false,
+ "num_tiles_sparsity" : 0,
+ "logit_sparsity_p" : 0
+ },
+ "training_parameters": {
+ "num_training_steps" : 400000,
+ "learning_rate" : 1e-4,
+ "batch_size" : 256,
+ "annealing_warm_up" : 0,
+ "kl_latent_scale" : 1.0,
+ "kl_global_params_scale" : 1.0,
+ "l2_regularization" : 0.0,
+ "use_lr_scheduler" : false,
+ "use_validation_set" : false,
+ "validation_set_pct" : 0.2,
+ "validation_freq" : 1000,
+ "log_training_info" : true,
+ "log_training_freq" : 1000,
+ "save_model_params_freq" : 500000
+ }
+}
+
+
diff --git a/proteingym/baselines/trancepteve/trancepteve/EVE/merge_DeepSequence.py b/proteingym/baselines/trancepteve/trancepteve/EVE/merge_DeepSequence.py
new file mode 100644
index 0000000..92f5b5d
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/EVE/merge_DeepSequence.py
@@ -0,0 +1,35 @@
+import pandas as pd
+import os
+
+individual_files = '/n/groups/marks/projects/marks_lab_and_oatml/ProteinGym/model_scores/zero_shot_substitutions/DeepSequence'
+folder_name = '/n/groups/marks/projects/marks_lab_and_oatml/ProteinGym/model_scores/zero_shot_substitutions/DeepSequence/merged'
+
+mapping = "/home/pn73/protein_transformer/utils/mapping_files/ProteinGym_reference_file_substitutions_20220227.csv"
+#mapping='/home/pn73/ProteinGym/ProteinGym_reference_file_substitutions.csv'
+mapping = pd.read_csv(mapping)
+
+for DMS_id in mapping["DMS_id"][:87]:
+ print("DMS id : {}".format(DMS_id))
+ #if DMS_id in exclude:
+ # print("Exclude : {}".format(DMS_id))
+ # continue
+
+ evol_seed = {}
+ for i in [1000,2000,3000,4000,5000]:
+ #evol_seed[i] = pd.read_csv(individual_files+os.sep+"seed_"+str(i)+os.sep+DMS_id+'_20000_samples_Aug7_seed_'+str(i)+'.csv')
+ evol_seed[i] = pd.read_csv(individual_files+os.sep+DMS_id+'_20000_samples_Aug7_seed_'+str(i)+'.csv')
+ evol_seed[i] = evol_seed[i].groupby(['mutant']).mean().reset_index()
+
+ evol_ensemble = evol_seed[1000]
+ for i in [2000,3000,4000,5000]:
+ evol_ensemble = pd.merge(evol_ensemble,evol_seed[i], on='mutant', how='left', suffixes=("_seed_"+str(i-1000),"_seed_"+str(i)))
+ evol_ensemble['evol_indices_seed_5000'] = evol_ensemble['evol_indices']
+
+ evol_ensemble['evol_indices_ensemble']=0
+ for i in [1000,2000,3000,4000,5000]:
+ evol_ensemble['evol_indices_ensemble'] += evol_ensemble["evol_indices_seed_"+str(i)] / 5.0
+
+ final_list = ['mutant']+['evol_indices_seed_'+str(i) for i in [1000,2000,3000,4000,5000]]+['evol_indices_ensemble']
+ evol_ensemble = evol_ensemble[final_list]
+
+ evol_ensemble.to_csv(folder_name+os.sep+DMS_id+'.csv', index=False)
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/EVE/merge_EVE.py b/proteingym/baselines/trancepteve/trancepteve/EVE/merge_EVE.py
new file mode 100644
index 0000000..d8a9e14
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/EVE/merge_EVE.py
@@ -0,0 +1,50 @@
+import pandas as pd
+import os
+
+individual_files = '/n/groups/marks/projects/marks_lab_and_oatml/ProteinGym/model_scores/zero_shot_substitutions/EVE' #'/n/groups/marks/projects/marks_lab_and_oatml/protein_transformer/model_scores/EVE_all_mutants/20220807_EVE_models_all_mutants'
+folder_name = '/n/groups/marks/projects/marks_lab_and_oatml/ProteinGym/model_scores/zero_shot_substitutions/EVE/merged'
+
+#mapping = "/home/pn73/protein_transformer/utils/mapping_files/ProteinGym_reference_file_substitutions_20220227.csv"
+mapping='/home/pn73/ProteinGym/ProteinGym_reference_file_substitutions.csv'
+mapping = pd.read_csv(mapping)
+
+#exclude=[
+#'A4D664_9INFA_Soh_CCL141_2019',
+#'CAPSD_AAV2S_Sinai_substitutions_2021',
+#'ENV_HV1B9_DuenasDecamp_2016',
+#'I6TAH8_I68A0_Doud_2015',
+#'NCAP_I34A1_Doud_2015',
+#'NRAM_I33A0_Jiang_standard_2016',
+#'P53_HUMAN_Kotler_2018',
+#'PA_I34A1_Wu_2015',
+#'R1AB_SARS2_Flynn_growth_2022',
+#'SPG1_STRSG_Olson_2014',
+#'SPIKE_SARS2_Starr_bind_2020',
+#'SPIKE_SARS2_Starr_expr_2020'
+#]
+
+for DMS_id in mapping["DMS_id"][87:100]:
+ print("DMS id : {}".format(DMS_id))
+ #if DMS_id in exclude:
+ # print("Exclude : {}".format(DMS_id))
+ # continue
+
+ evol_seed = {}
+ for i in [1000,2000,3000,4000,5000]:
+ #evol_seed[i] = pd.read_csv(individual_files+os.sep+"seed_"+str(i)+os.sep+DMS_id+'_20000_samples_Aug7_seed_'+str(i)+'.csv')
+ evol_seed[i] = pd.read_csv(individual_files+os.sep+DMS_id+'_20000_samples_Aug7_seed_'+str(i)+'.csv')
+ evol_seed[i] = evol_seed[i].groupby(['mutant']).mean().reset_index()
+
+ evol_ensemble = evol_seed[1000]
+ for i in [2000,3000,4000,5000]:
+ evol_ensemble = pd.merge(evol_ensemble,evol_seed[i], on='mutant', how='left', suffixes=("_seed_"+str(i-1000),"_seed_"+str(i)))
+ evol_ensemble['evol_indices_seed_5000'] = evol_ensemble['evol_indices']
+
+ evol_ensemble['evol_indices_ensemble']=0
+ for i in [1000,2000,3000,4000,5000]:
+ evol_ensemble['evol_indices_ensemble'] += evol_ensemble["evol_indices_seed_"+str(i)] / 5.0
+
+ final_list = ['mutant']+['evol_indices_seed_'+str(i) for i in [1000,2000,3000,4000,5000]]+['evol_indices_ensemble']
+ evol_ensemble = evol_ensemble[final_list]
+
+ evol_ensemble.to_csv(folder_name+os.sep+DMS_id+'.csv', index=False)
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/__init__.py b/proteingym/baselines/trancepteve/trancepteve/__init__.py
new file mode 100644
index 0000000..230e531
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/__init__.py
@@ -0,0 +1 @@
+from . import config, utils
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/activations.py b/proteingym/baselines/trancepteve/trancepteve/activations.py
new file mode 100644
index 0000000..25702ef
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/activations.py
@@ -0,0 +1,114 @@
+import math
+
+import torch
+from packaging import version
+from torch import nn
+
+from transformers.utils import logging
+
+
+logger = logging.get_logger(__name__)
+
+
+def _gelu_python(x):
+ """
+ Original Implementation of the GELU activation function in Google BERT repo when initially created. For
+ information: OpenAI GPT's GELU is slightly different (and gives slightly different results): 0.5 * x * (1 +
+ torch.tanh(math.sqrt(2 / math.pi) * (x + 0.044715 * torch.pow(x, 3)))) This is now written in C in nn.functional
+ Also see the Gaussian Error Linear Units paper: https://arxiv.org/abs/1606.08415
+ """
+ return x * 0.5 * (1.0 + torch.erf(x / math.sqrt(2.0)))
+
+
+def gelu_new(x):
+ """
+ Implementation of the GELU activation function currently in Google BERT repo (identical to OpenAI GPT). Also see
+ the Gaussian Error Linear Units paper: https://arxiv.org/abs/1606.08415
+ """
+ return 0.5 * x * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (x + 0.044715 * torch.pow(x, 3.0))))
+
+
+if version.parse(torch.__version__) < version.parse("1.4"):
+ gelu = _gelu_python
+else:
+ gelu = nn.functional.gelu
+
+
+def gelu_fast(x):
+ return 0.5 * x * (1.0 + torch.tanh(x * 0.7978845608 * (1.0 + 0.044715 * x * x)))
+
+
+def quick_gelu(x):
+ return x * torch.sigmoid(1.702 * x)
+
+
+def _silu_python(x):
+ """
+ See Gaussian Error Linear Units (Hendrycks et al., https://arxiv.org/abs/1606.08415) where the SiLU (Sigmoid Linear
+ Unit) was originally introduced and coined, and see Sigmoid-Weighted Linear Units for Neural Network Function
+ Approximation in Reinforcement Learning (Elfwing et al., https://arxiv.org/abs/1702.03118) and Swish: a Self-Gated
+ Activation Function (Ramachandran et al., https://arxiv.org/abs/1710.05941v1) where the SiLU was experimented with
+ later.
+ """
+ return x * torch.sigmoid(x)
+
+
+if version.parse(torch.__version__) < version.parse("1.7"):
+ silu = _silu_python
+else:
+ silu = nn.functional.silu
+
+
+def _mish_python(x):
+ """
+ See Mish: A Self-Regularized Non-Monotonic Activation Function (Misra., https://arxiv.org/abs/1908.08681). Also
+ visit the official repository for the paper: https://github.com/digantamisra98/Mish
+ """
+ return x * torch.tanh(nn.functional.softplus(x))
+
+
+if version.parse(torch.__version__) < version.parse("1.9"):
+ mish = _mish_python
+else:
+ mish = nn.functional.mish
+
+
+def linear_act(x):
+ return x
+
+def squared_relu(x):
+ """
+ Squared ReLU variant that is fastest with Pytorch.
+ """
+ x = nn.functional.relu(x)
+ return x*x
+
+def squared_relu_xla(x):
+ """
+ Squared ReLU variant that is fastest with JAX.
+ """
+ x = nn.functional.relu(x)
+ return x**2
+
+tranception_ACT2FN = {
+ "relu": nn.functional.relu,
+ "silu": silu,
+ "swish": silu,
+ "gelu": gelu,
+ "tanh": torch.tanh,
+ "gelu_new": gelu_new,
+ "gelu_fast": gelu_fast,
+ "quick_gelu": quick_gelu,
+ "mish": mish,
+ "linear": linear_act,
+ "sigmoid": torch.sigmoid,
+ "squared_relu": squared_relu,
+ "squared_relu_xla": squared_relu_xla,
+}
+
+
+def get_activation(activation_string):
+ if activation_string in tranception_ACT2FN:
+ return tranception_ACT2FN[activation_string]
+ else:
+ raise KeyError(f"function {activation_string} not found in ACT2FN mapping {list(tranception_ACT2FN.keys())}")
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/config.py b/proteingym/baselines/trancepteve/trancepteve/config.py
new file mode 100644
index 0000000..749346c
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/config.py
@@ -0,0 +1,56 @@
+from transformers import GPT2Config
+
+class TranceptEVEConfig(GPT2Config):
+ """
+ Config subclass for Tranception model architecture.
+ """
+ def __init__(
+ self,
+ attention_mode="tranception",
+ position_embedding="grouped_alibi",
+ tokenizer=None,
+ full_target_seq=None,
+ scoring_window=None,
+ inference_time_retrieval_type="TranceptEVE",
+ retrieval_aggregation_mode=None, #[substitutions Vs indels]
+ retrieval_weights_manual=False,
+ retrieval_inference_MSA_weight=0.3,
+ retrieval_inference_EVE_weight=0.7,
+ MSA_filename=None,
+ MSA_weight_file_name=None,
+ MSA_start=None,
+ MSA_end=None,
+ MSA_threshold_sequence_frac_gaps=None,
+ MSA_threshold_focus_cols_frac_gaps=None,
+ clustal_omega_location=None,
+ EVE_model_paths=None,
+ EVE_num_samples_log_proba=None,
+ EVE_model_parameters_location=None,
+ MSA_recalibrate_probas=False,
+ EVE_recalibrate_probas=True,
+ **kwargs
+ ):
+ super().__init__(**kwargs)
+ self.model_type = "tranception"
+ self.attention_mode = attention_mode
+ self.position_embedding = position_embedding
+ self.tokenizer = tokenizer
+ self.full_target_seq = full_target_seq
+ self.scoring_window = scoring_window
+ self.inference_time_retrieval_type = inference_time_retrieval_type
+ self.retrieval_aggregation_mode = retrieval_aggregation_mode
+ self.retrieval_weights_manual = retrieval_weights_manual
+ self.retrieval_inference_MSA_weight = retrieval_inference_MSA_weight
+ self.retrieval_inference_EVE_weight = retrieval_inference_EVE_weight
+ self.MSA_filename = MSA_filename
+ self.MSA_weight_file_name = MSA_weight_file_name
+ self.MSA_start = MSA_start
+ self.MSA_end = MSA_end
+ self.MSA_threshold_sequence_frac_gaps = MSA_threshold_sequence_frac_gaps
+ self.MSA_threshold_focus_cols_frac_gaps = MSA_threshold_focus_cols_frac_gaps
+ self.clustal_omega_location = clustal_omega_location
+ self.EVE_model_paths = EVE_model_paths
+ self.EVE_num_samples_log_proba = EVE_num_samples_log_proba
+ self.EVE_model_parameters_location = EVE_model_parameters_location
+ self.MSA_recalibrate_probas = MSA_recalibrate_probas
+ self.EVE_recalibrate_probas = EVE_recalibrate_probas
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/model_pytorch.py b/proteingym/baselines/trancepteve/trancepteve/model_pytorch.py
new file mode 100644
index 0000000..d23286a
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/model_pytorch.py
@@ -0,0 +1,1232 @@
+from dataclasses import dataclass
+from typing import Optional, Tuple
+import math
+import os,sys
+import numpy as np
+import pandas as pd
+import json
+import tqdm
+import pickle
+import uuid
+import torch
+from torch import nn
+from torch.nn import CrossEntropyLoss, NLLLoss
+from torch.utils.data.sampler import SequentialSampler
+import torch.nn.functional as F
+from transformers import GPT2PreTrainedModel
+
+from transformers.modeling_utils import (
+ Conv1D,
+ PreTrainedModel,
+ SequenceSummary,
+ find_pruneable_heads_and_indices,
+ prune_conv1d_layer,
+)
+from transformers.file_utils import (
+ ModelOutput,
+ add_code_sample_docstrings,
+ add_start_docstrings,
+ add_start_docstrings_to_model_forward,
+ replace_return_docstrings
+)
+from transformers.modeling_outputs import (
+ BaseModelOutputWithPastAndCrossAttentions,
+ CausalLMOutputWithCrossAttentions,
+ SequenceClassifierOutputWithPast,
+ TokenClassifierOutput
+)
+from transformers.utils.model_parallel_utils import assert_device_map, get_device_map
+from transformers import DataCollatorForLanguageModeling, PreTrainedTokenizerFast
+from datasets import Dataset
+
+from trancepteve.activations import tranception_ACT2FN
+from trancepteve.config import TranceptEVEConfig
+from trancepteve.outputs import (
+ TranceptionCausalLMOutputWithCrossAttentions,
+)
+from trancepteve.utils import msa_utils
+from trancepteve.utils import scoring_utils
+from trancepteve.EVE import VAE_model
+
+def nanmean(v, *args, inplace=False, **kwargs):
+ if not inplace:
+ v = v.clone()
+ is_nan = torch.isnan(v)
+ v[is_nan] = 0
+ return v.sum(*args, **kwargs) / (~is_nan).float().sum(*args, **kwargs)
+
+def logistic(x):
+ return 1 / (1 + math.exp(-x))
+
+def normalize(x):
+ return (x - x.mean()) / x.std()
+
+def entropy(x, ignore_tokenizer_characters=True):
+ """
+ Compute entropy over the last dimension of tensor x (assumes it is a log softmax input)
+ """
+ exp_x = torch.exp(x.float())
+ if ignore_tokenizer_characters:
+ entropy = (- exp_x[:,5:]*x[:,5:]).mean(dim=-1)
+ else:
+ entropy = (- exp_x*x).mean(dim=-1)
+ return entropy
+
+def get_slopes(n, mode="standard_alibi", verbose=False):
+ """
+ Function to compute the m constant for each attention head. Code has been adapted from the official ALiBi codebase at:
+ https://github.com/ofirpress/attention_with_linear_biases/blob/master/fairseq/models/transformer.py
+ """
+ def get_slopes_power_of_2(n):
+ start = (2**(-2**-(math.log2(n)-3)))
+ ratio = start
+ return [start*ratio**i for i in range(n)]
+ if mode=="grouped_alibi":
+ n = n // 4
+ if math.log2(n).is_integer():
+ result = get_slopes_power_of_2(n)
+ else:
+ #Workaround when the number of heads is not a power of 2
+ closest_power_of_2 = 2**math.floor(math.log2(n))
+ result = get_slopes_power_of_2(closest_power_of_2) + get_slopes(2*closest_power_of_2)[0::2][:n-closest_power_of_2]
+ if mode=="grouped_alibi":
+ result = result * 4
+ if verbose:
+ print("ALiBi slopes: {}".format(result))
+ return result
+
+class SpatialDepthWiseConvolution(nn.Module):
+ def __init__(self, head_dim: int, kernel_size: int = 3):
+ super().__init__()
+ self.kernel_size = kernel_size
+ self.conv = nn.Conv1d(in_channels=head_dim, out_channels=head_dim, kernel_size=(kernel_size,), padding=(kernel_size - 1,), groups=head_dim)
+
+ def forward(self, x: torch.Tensor):
+ batch_size, heads, seq_len, head_dim = x.shape
+ x = x.permute(0, 1, 3, 2).contiguous()
+ x = x.view(batch_size * heads, head_dim, seq_len)
+ x = self.conv(x)
+ if self.kernel_size>1:
+ x = x[:, :, :-(self.kernel_size - 1)]
+ x = x.view(batch_size, heads, head_dim, seq_len)
+ x = x.permute(0, 1, 3, 2)
+ return x
+
+class TranceptionBlockAttention(nn.Module):
+ def __init__(self, config, is_cross_attention=False, SDWC_kernel_size=None):
+ super().__init__()
+
+ max_positions = config.max_position_embeddings
+ self.register_buffer(
+ "bias",
+ torch.tril(torch.ones((max_positions, max_positions), dtype=torch.uint8)).view(
+ 1, 1, max_positions, max_positions
+ ),
+ )
+ self.register_buffer("masked_bias", torch.tensor(-1e4))
+
+ self.embed_dim = config.hidden_size
+ self.num_heads = config.num_attention_heads
+ self.head_dim = self.embed_dim // self.num_heads
+ self.split_size = self.embed_dim
+ if self.head_dim * self.num_heads != self.embed_dim:
+ raise ValueError(
+ f"`embed_dim` must be divisible by num_heads (got `embed_dim`: {self.embed_dim} and `num_heads`: {self.num_heads})."
+ )
+
+ self.scale_attn_weights = config.scale_attn_weights
+ self.is_cross_attention = is_cross_attention
+
+ if self.is_cross_attention:
+ self.c_attn = Conv1D(2 * self.embed_dim, self.embed_dim)
+ self.q_attn = Conv1D(self.embed_dim, self.embed_dim)
+ else:
+ self.c_attn = Conv1D(3 * self.embed_dim, self.embed_dim)
+ self.c_proj = Conv1D(self.embed_dim, self.embed_dim)
+
+ self.attn_dropout = nn.Dropout(config.attn_pdrop)
+ self.resid_dropout = nn.Dropout(config.resid_pdrop)
+
+ self.pruned_heads = set()
+
+ self.attention_mode=config.attention_mode
+
+ if self.attention_mode=="tranception":
+ assert self.num_heads%4==0, "Invalid number of heads. Tranception requires the number of heads to be a multiple of 4."
+ self.num_heads_per_kernel_size = self.num_heads // 4
+ self.query_depthwiseconv = nn.ModuleDict()
+ self.key_depthwiseconv = nn.ModuleDict()
+ self.value_depthwiseconv = nn.ModuleDict()
+ for kernel_idx, kernel in enumerate([3,5,7]):
+ self.query_depthwiseconv[str(kernel_idx)] = SpatialDepthWiseConvolution(self.head_dim,kernel)
+ self.key_depthwiseconv[str(kernel_idx)] = SpatialDepthWiseConvolution(self.head_dim,kernel)
+ self.value_depthwiseconv[str(kernel_idx)] = SpatialDepthWiseConvolution(self.head_dim,kernel)
+
+ def prune_heads(self, heads):
+ if len(heads) == 0:
+ return
+ heads, index = find_pruneable_heads_and_indices(heads, self.num_heads, self.head_dim, self.pruned_heads)
+ index_attn = torch.cat([index, index + self.split_size, index + (2 * self.split_size)])
+
+ # Prune conv1d layers
+ self.c_attn = prune_conv1d_layer(self.c_attn, index_attn, dim=1)
+ self.c_proj = prune_conv1d_layer(self.c_proj, index, dim=0)
+
+ # Update hyper params
+ self.split_size = (self.split_size // self.num_heads) * (self.num_heads - len(heads))
+ self.num_heads = self.num_heads - len(heads)
+ self.pruned_heads = self.pruned_heads.union(heads)
+
+ def _attn(self, query, key, value, attention_mask=None, head_mask=None, alibi_bias=None):
+ attn_weights = torch.matmul(query, key.transpose(-1, -2))
+
+ if self.scale_attn_weights:
+ attn_weights = attn_weights / (float(value.size(-1)) ** 0.5)
+
+ if not self.is_cross_attention:
+ # if only "normal" attention layer implements causal mask
+ query_length, key_length = query.size(-2), key.size(-2)
+ causal_mask = self.bias[:, :, key_length - query_length : key_length, :key_length].bool()
+ attn_weights = torch.where(causal_mask, attn_weights, self.masked_bias.to(attn_weights.dtype))
+
+ if alibi_bias is not None:
+ attn_weights = attn_weights + alibi_bias[:,:,:attn_weights.size(-1)]
+
+ if attention_mask is not None:
+ # Apply the attention mask
+ attn_weights = attn_weights + attention_mask
+
+ attn_weights = nn.Softmax(dim=-1)(attn_weights)
+ attn_weights = self.attn_dropout(attn_weights)
+
+ # Mask heads if we want to
+ if head_mask is not None:
+ attn_weights = attn_weights * head_mask
+
+ attn_output = torch.matmul(attn_weights, value)
+
+ return attn_output, attn_weights
+
+ def _split_heads(self, tensor, num_heads, attn_head_size):
+ """
+ Splits hidden_size dim into attn_head_size and num_heads
+ """
+ new_shape = tensor.size()[:-1] + (num_heads, attn_head_size)
+ tensor = tensor.view(*new_shape)
+ return tensor.permute(0, 2, 1, 3) # (batch, head, seq_length, head_features)
+
+ def _merge_heads(self, tensor, num_heads, attn_head_size):
+ """
+ Merges attn_head_size dim and num_attn_heads dim into hidden_size
+ """
+ tensor = tensor.permute(0, 2, 1, 3).contiguous()
+ new_shape = tensor.size()[:-2] + (num_heads * attn_head_size,)
+ return tensor.view(new_shape)
+
+ def forward(
+ self,
+ hidden_states,
+ layer_past=None,
+ attention_mask=None,
+ head_mask=None,
+ encoder_hidden_states=None,
+ encoder_attention_mask=None,
+ use_cache=False,
+ output_attentions=False,
+ alibi_bias=None,
+ ):
+ if encoder_hidden_states is not None:
+ if not hasattr(self, "q_attn"):
+ raise ValueError(
+ "If class is used as cross attention, the weights `q_attn` have to be defined. "
+ "Please make sure to instantiate class with `GPT2Attention(..., is_cross_attention=True)`."
+ )
+
+ query = self.q_attn(hidden_states)
+ key, value = self.c_attn(encoder_hidden_states).split(self.split_size, dim=2)
+ attention_mask = encoder_attention_mask
+ else:
+ query, key, value = self.c_attn(hidden_states).split(self.split_size, dim=2)
+
+ query = self._split_heads(query, self.num_heads, self.head_dim)
+ key = self._split_heads(key, self.num_heads, self.head_dim)
+ value = self._split_heads(value, self.num_heads, self.head_dim)
+
+ if layer_past is not None:
+ past_key, past_value = layer_past
+ key = torch.cat((past_key, key), dim=-2)
+ value = torch.cat((past_value, value), dim=-2)
+
+ if use_cache is True:
+ present = (key, value)
+ else:
+ present = None
+
+ if self.attention_mode=="tranception":
+ # We do not do anything on the first self.num_heads_per_kernel_size heads (kernel =1)
+ query_list=[query[:,:self.num_heads_per_kernel_size,:,:]]
+ key_list=[key[:,:self.num_heads_per_kernel_size,:,:]]
+ value_list=[value[:,:self.num_heads_per_kernel_size,:,:]]
+ for kernel_idx in range(3):
+ query_list.append(self.query_depthwiseconv[str(kernel_idx)](query[:,(kernel_idx+1)*self.num_heads_per_kernel_size:(kernel_idx+2)*self.num_heads_per_kernel_size,:,:]))
+ key_list.append(self.key_depthwiseconv[str(kernel_idx)](key[:,(kernel_idx+1)*self.num_heads_per_kernel_size:(kernel_idx+2)*self.num_heads_per_kernel_size,:,:]))
+ value_list.append(self.value_depthwiseconv[str(kernel_idx)](value[:,(kernel_idx+1)*self.num_heads_per_kernel_size:(kernel_idx+2)*self.num_heads_per_kernel_size,:,:]))
+ query=torch.cat(query_list, dim=1)
+ key=torch.cat(key_list, dim=1)
+ value=torch.cat(value_list, dim=1)
+
+ attn_output, attn_weights = self._attn(query, key, value, attention_mask, head_mask, alibi_bias=alibi_bias)
+
+ attn_output = self._merge_heads(attn_output, self.num_heads, self.head_dim)
+ attn_output = self.c_proj(attn_output)
+ attn_output = self.resid_dropout(attn_output)
+
+ outputs = (attn_output, present)
+ if output_attentions:
+ outputs += (attn_weights,)
+
+ return outputs # a, present, (attentions)
+
+class TranceptionBlockMLP(nn.Module):
+ def __init__(self, intermediate_size, config):
+ super().__init__()
+ embed_dim = config.hidden_size
+ self.c_fc = Conv1D(intermediate_size, embed_dim)
+ self.c_proj = Conv1D(embed_dim, intermediate_size)
+ self.act = tranception_ACT2FN[config.activation_function]
+ self.dropout = nn.Dropout(config.resid_pdrop)
+
+ def forward(self, hidden_states):
+ hidden_states = self.c_fc(hidden_states)
+ hidden_states = self.act(hidden_states)
+ hidden_states = self.c_proj(hidden_states)
+ hidden_states = self.dropout(hidden_states)
+ return hidden_states
+
+class TranceptionBlock(nn.Module):
+ def __init__(self, config, SDWC_kernel_size=None):
+ super().__init__()
+ hidden_size = config.hidden_size
+ inner_dim = config.n_inner if config.n_inner is not None else 4 * hidden_size
+
+ self.ln_1 = nn.LayerNorm(hidden_size, eps=config.layer_norm_epsilon)
+ self.attn = TranceptionBlockAttention(config, SDWC_kernel_size=SDWC_kernel_size)
+ self.ln_2 = nn.LayerNorm(hidden_size, eps=config.layer_norm_epsilon)
+
+ if config.add_cross_attention:
+ self.crossattention = TranceptionBlockAttention(config, is_cross_attention=True, SDWC_kernel_size=SDWC_kernel_size)
+ self.ln_cross_attn = nn.LayerNorm(hidden_size, eps=config.layer_norm_epsilon)
+
+ self.mlp = TranceptionBlockMLP(inner_dim, config)
+
+ def forward(
+ self,
+ hidden_states,
+ layer_past=None,
+ attention_mask=None,
+ head_mask=None,
+ encoder_hidden_states=None,
+ encoder_attention_mask=None,
+ use_cache=False,
+ output_attentions=False,
+ alibi_bias=None,
+ ):
+ residual = hidden_states
+ hidden_states = self.ln_1(hidden_states)
+ attn_outputs = self.attn(
+ hidden_states,
+ layer_past=layer_past,
+ attention_mask=attention_mask,
+ head_mask=head_mask,
+ use_cache=use_cache,
+ output_attentions=output_attentions,
+ alibi_bias=alibi_bias,
+ )
+ attn_output = attn_outputs[0] # output_attn: a, present, (attentions)
+ outputs = attn_outputs[1:]
+ # residual connection
+ hidden_states = attn_output + residual
+
+ if encoder_hidden_states is not None:
+ # add one self-attention block for cross-attention
+ if not hasattr(self, "crossattention"):
+ raise ValueError(
+ f"If `encoder_hidden_states` are passed, {self} has to be instantiated with "
+ "cross-attention layers by setting `config.add_cross_attention=True`"
+ )
+ residual = hidden_states
+ hidden_states = self.ln_cross_attn(hidden_states)
+ cross_attn_outputs = self.crossattention(
+ hidden_states,
+ attention_mask=attention_mask,
+ head_mask=head_mask,
+ encoder_hidden_states=encoder_hidden_states,
+ encoder_attention_mask=encoder_attention_mask,
+ output_attentions=output_attentions,
+ )
+ attn_output = cross_attn_outputs[0]
+ # residual connection
+ hidden_states = residual + attn_output
+ outputs = outputs + cross_attn_outputs[2:] # add cross attentions if we output attention weights
+
+ residual = hidden_states
+ hidden_states = self.ln_2(hidden_states)
+
+ feed_forward_hidden_states = self.mlp(hidden_states)
+
+ # residual connection
+ hidden_states = residual + feed_forward_hidden_states
+
+ if use_cache:
+ outputs = (hidden_states,) + outputs
+ else:
+ outputs = (hidden_states,) + outputs[1:]
+
+ return outputs # hidden_states, present, (attentions, cross_attentions)
+
+class TranceptionModel(GPT2PreTrainedModel):
+ _keys_to_ignore_on_load_missing = ["attn.masked_bias"]
+ def __init__(self, config):
+ super().__init__(config)
+
+ self.embed_dim = config.hidden_size
+ self.wte = nn.Embedding(config.vocab_size, self.embed_dim)
+ self.position_embedding = config.position_embedding if hasattr(config, "position_embedding") else "learned"
+ if self.position_embedding=="learned":
+ self.wpe = nn.Embedding(config.max_position_embeddings, self.embed_dim)
+ self.alibi = None
+ elif self.position_embedding=="grouped_alibi":
+ maxpos = config.n_positions
+ attn_heads = config.n_head
+ self.slopes = torch.Tensor(get_slopes(attn_heads, mode=self.position_embedding))
+ #The softmax operation is invariant to translation, and bias functions used are always linear.
+ alibi = self.slopes.unsqueeze(1).unsqueeze(1) * torch.arange(maxpos).unsqueeze(0).unsqueeze(0).expand(attn_heads, -1, -1)
+ alibi = alibi.view(attn_heads, 1, maxpos)
+ self.register_buffer('alibi',alibi)
+
+ self.drop = nn.Dropout(config.embd_pdrop)
+ self.h = nn.ModuleList([TranceptionBlock(config) for _ in range(config.num_hidden_layers)])
+ self.ln_f = nn.LayerNorm(self.embed_dim, eps=config.layer_norm_epsilon)
+
+ self.init_weights()
+
+ # Model parallel
+ self.model_parallel = False
+ self.device_map = None
+ self.gradient_checkpointing = False
+
+ def parallelize(self, device_map=None, num_cores=None):
+ self.device_map = (
+ get_device_map(len(self.h), range(torch.cuda.device_count())) if device_map is None else device_map
+ )
+ device_prefix="cuda:"
+ assert_device_map(self.device_map, len(self.h))
+ self.model_parallel = True
+ self.first_device = "cpu" if "cpu" in self.device_map.keys() else device_prefix + str(min(self.device_map.keys()))
+ self.last_device = device_prefix + str(max(self.device_map.keys()))
+ self.wte = self.wte.to(self.first_device)
+ if self.position_embedding=="learned":
+ self.wpe = self.wpe.to(self.first_device)
+ for k, v in self.device_map.items():
+ print("k,v :"+str(k)+","+str(v))
+ for block in v:
+ cuda_device = device_prefix + str(k)
+ self.h[block] = self.h[block].to(cuda_device)
+ self.ln_f = self.ln_f.to(self.last_device)
+
+ def deparallelize(self):
+ self.model_parallel = False
+ self.device_map = None
+ self.first_device = "cpu"
+ self.last_device = "cpu"
+ self.wte = self.wte.to("cpu")
+ if self.position_embedding=="learned":
+ self.wpe = self.wpe.to("cpu")
+ for index in range(len(self.h)):
+ self.h[index] = self.h[index].to("cpu")
+ self.ln_f = self.ln_f.to("cpu")
+ torch.cuda.empty_cache()
+
+ def get_input_embeddings(self):
+ return self.wte
+
+ def set_input_embeddings(self, new_embeddings):
+ self.wte = new_embeddings
+
+ def _prune_heads(self, heads_to_prune):
+ """
+ Prunes heads of the model. heads_to_prune: dict of {layer_num: list of heads to prune in this layer}
+ """
+ for layer, heads in heads_to_prune.items():
+ self.h[layer].attn.prune_heads(heads)
+
+ def forward(
+ self,
+ input_ids=None,
+ past_key_values=None,
+ attention_mask=None,
+ token_type_ids=None,
+ position_ids=None,
+ head_mask=None,
+ inputs_embeds=None,
+ encoder_hidden_states=None,
+ encoder_attention_mask=None,
+ use_cache=None,
+ output_attentions=None,
+ output_hidden_states=None,
+ return_dict=None,
+ ):
+ output_attentions = output_attentions if output_attentions is not None else self.config.output_attentions
+ output_hidden_states = (
+ output_hidden_states if output_hidden_states is not None else self.config.output_hidden_states
+ )
+ use_cache = use_cache if use_cache is not None else self.config.use_cache
+ return_dict = return_dict if return_dict is not None else self.config.use_return_dict
+
+ if input_ids is not None and inputs_embeds is not None:
+ raise ValueError("You cannot specify both input_ids and inputs_embeds at the same time")
+ elif input_ids is not None:
+ input_shape = input_ids.size()
+ input_ids = input_ids.view(-1, input_shape[-1])
+ batch_size = input_ids.shape[0]
+ elif inputs_embeds is not None:
+ input_shape = inputs_embeds.size()[:-1]
+ batch_size = inputs_embeds.shape[0]
+ else:
+ raise ValueError("You have to specify either input_ids or inputs_embeds")
+
+ device = input_ids.device if input_ids is not None else inputs_embeds.device
+
+ if token_type_ids is not None:
+ token_type_ids = token_type_ids.view(-1, input_shape[-1])
+ if position_ids is not None:
+ position_ids = position_ids.view(-1, input_shape[-1])
+
+ if past_key_values is None:
+ past_length = 0
+ past_key_values = tuple([None] * len(self.h))
+ else:
+ past_length = past_key_values[0][0].size(-2)
+ if position_ids is None:
+ position_ids = torch.arange(past_length, input_shape[-1] + past_length, dtype=torch.long, device=device)
+ position_ids = position_ids.unsqueeze(0).view(-1, input_shape[-1])
+
+ # GPT2Attention mask.
+ if attention_mask is not None:
+ if batch_size <= 0:
+ raise ValueError("batch_size has to be defined and > 0")
+ attention_mask = attention_mask.view(batch_size, -1)
+ # We create a 3D attention mask from a 2D tensor mask.
+ # Sizes are [batch_size, 1, 1, to_seq_length]
+ # So we can broadcast to [batch_size, num_heads, from_seq_length, to_seq_length]
+ # this attention mask is more simple than the triangular masking of causal attention
+ # used in OpenAI GPT, we just need to prepare the broadcast dimension here.
+ attention_mask = attention_mask[:, None, None, :]
+
+ # Since attention_mask is 1.0 for positions we want to attend and 0.0 for
+ # masked positions, this operation will create a tensor which is 0.0 for
+ # positions we want to attend and -10000.0 for masked positions.
+ # Since we are adding it to the raw scores before the softmax, this is
+ # effectively the same as removing these entirely.
+ attention_mask = attention_mask.to(dtype=self.dtype) # fp16 compatibility
+ attention_mask = (1.0 - attention_mask) * -10000.0
+
+ # If a 2D ou 3D attention mask is provided for the cross-attention
+ # we need to make broadcastable to [batch_size, num_heads, seq_length, seq_length]
+ if self.config.add_cross_attention and encoder_hidden_states is not None:
+ encoder_batch_size, encoder_sequence_length, _ = encoder_hidden_states.size()
+ encoder_hidden_shape = (encoder_batch_size, encoder_sequence_length)
+ if encoder_attention_mask is None:
+ encoder_attention_mask = torch.ones(encoder_hidden_shape, device=device)
+ encoder_attention_mask = self.invert_attention_mask(encoder_attention_mask)
+ else:
+ encoder_attention_mask = None
+
+ # Prepare head mask if needed
+ # 1.0 in head_mask indicate we keep the head
+ # attention_probs has shape bsz x n_heads x N x N
+ # head_mask has shape n_layer x batch x n_heads x N x N
+ head_mask = self.get_head_mask(head_mask, self.config.n_layer)
+
+ if inputs_embeds is None:
+ inputs_embeds = self.wte(input_ids)
+ if self.position_embedding=="learned":
+ position_embeds = self.wpe(position_ids)
+ hidden_states = inputs_embeds + position_embeds
+ else:
+ hidden_states = inputs_embeds
+
+ if token_type_ids is not None:
+ token_type_embeds = self.wte(token_type_ids)
+ hidden_states = hidden_states + token_type_embeds
+
+ hidden_states = self.drop(hidden_states)
+
+ output_shape = input_shape + (hidden_states.size(-1),)
+
+ presents = () if use_cache else None
+ all_self_attentions = () if output_attentions else None
+ all_cross_attentions = () if output_attentions and self.config.add_cross_attention else None
+ all_hidden_states = () if output_hidden_states else None
+
+ for i, (block, layer_past) in enumerate(zip(self.h, past_key_values)):
+ # Model parallel
+ if self.model_parallel:
+ torch.cuda.set_device(hidden_states.device)
+ # Ensure layer_past is on same device as hidden_states (might not be correct)
+ if layer_past is not None:
+ layer_past = tuple(past_state.to(hidden_states.device) for past_state in layer_past)
+ # Ensure that attention_mask is always on the same device as hidden_states
+ if attention_mask is not None:
+ attention_mask = attention_mask.to(hidden_states.device)
+ if isinstance(head_mask, torch.Tensor):
+ head_mask = head_mask.to(hidden_states.device)
+ if output_hidden_states:
+ all_hidden_states = all_hidden_states + (hidden_states,)
+
+ if self.gradient_checkpointing and self.training:
+ if use_cache:
+ print("`use_cache=True` is incompatible with gradient checkpointing. Setting `use_cache=False`...")
+ use_cache = False
+
+ def create_custom_forward(module):
+ def custom_forward(*inputs):
+ # None for past_key_value
+ return module(*inputs, use_cache, output_attentions)
+
+ return custom_forward
+
+ outputs = torch.utils.checkpoint.checkpoint(
+ create_custom_forward(block),
+ hidden_states,
+ None,
+ attention_mask,
+ head_mask[i],
+ encoder_hidden_states,
+ encoder_attention_mask,
+ )
+ else:
+ outputs = block(
+ hidden_states,
+ layer_past=layer_past,
+ attention_mask=attention_mask,
+ head_mask=head_mask[i],
+ encoder_hidden_states=encoder_hidden_states,
+ encoder_attention_mask=encoder_attention_mask,
+ use_cache=use_cache,
+ output_attentions=output_attentions,
+ alibi_bias=self.alibi if hasattr(self, "alibi") else None
+ )
+
+ hidden_states = outputs[0]
+
+ if use_cache is True:
+ presents = presents + (outputs[1],)
+
+ if output_attentions:
+ all_self_attentions = all_self_attentions + (outputs[2 if use_cache else 1],)
+ if self.config.add_cross_attention:
+ all_cross_attentions = all_cross_attentions + (outputs[3 if use_cache else 2],)
+
+ if self.model_parallel:
+ device_prefix="cuda:"
+ for k, v in self.device_map.items():
+ if i == v[-1] and device_prefix + str(k) != self.last_device:
+ hidden_states = hidden_states.to(device_prefix + str(k + 1))
+
+ hidden_states = self.ln_f(hidden_states)
+
+ hidden_states = hidden_states.view(*output_shape)
+ # Add last hidden state
+ if output_hidden_states:
+ all_hidden_states = all_hidden_states + (hidden_states,)
+
+ if not return_dict:
+ return tuple(
+ v
+ for v in [hidden_states, presents, all_hidden_states, all_self_attentions, all_cross_attentions]
+ if v is not None
+ )
+
+ return BaseModelOutputWithPastAndCrossAttentions(
+ last_hidden_state=hidden_states,
+ past_key_values=presents,
+ hidden_states=all_hidden_states,
+ attentions=all_self_attentions,
+ cross_attentions=all_cross_attentions,
+ )
+
+class TrancepteveLMHeadModel(GPT2PreTrainedModel):
+ _keys_to_ignore_on_load_missing = [r"attn.masked_bias", r"attn.bias", r"lm_head.weight"]
+ def __init__(self, config):
+ super().__init__(config)
+ self.transformer = TranceptionModel(config)
+ self.lm_head = nn.Linear(config.n_embd, config.vocab_size, bias=False)
+ self.config = config
+ self.clustal_hash = str(uuid.uuid4())
+ self.clustal_hash_eve = str(uuid.uuid4())
+ self.init_weights()
+
+ self.default_model_device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
+ # Model parallel
+ self.model_parallel = False
+ self.device_map = None
+
+ self.inference_time_retrieval_type = config.inference_time_retrieval_type if hasattr(config, "inference_time_retrieval_type") else None
+ self.retrieval_aggregation_mode = config.retrieval_aggregation_mode if hasattr(config, "retrieval_aggregation_mode") else None
+ if self.inference_time_retrieval_type is not None:
+ print("Model leverages both autoregressive and retrieval inference (Type: {})".format(self.inference_time_retrieval_type))
+ self.retrieval_weights_manual = config.retrieval_weights_manual
+ self.MSA_filename = config.MSA_filename if hasattr(config, "MSA_filename") else None
+ self.MSA_folder = '/'.join(self.MSA_filename.split(os.sep)[:-1])
+ self.MSA_name = self.MSA_filename.split(os.sep)[-1]
+ self.MSA_start=config.MSA_start
+ self.MSA_end=config.MSA_end
+ self.full_target_seq = config.full_target_seq if hasattr(config, "full_target_seq") else ''
+ self.full_protein_length = len(self.full_target_seq)
+ self.EVE_model_paths = config.EVE_model_paths
+ self.EVE_recalibrate_probas = config.EVE_recalibrate_probas
+ if self.inference_time_retrieval_type.startswith("Trancept"):
+ self.MSA_log_prior, self.MSA_processed_depth = msa_utils.get_msa_prior(
+ MSA_data_file=self.MSA_filename,
+ MSA_weight_file_name=config.MSA_weight_file_name,
+ retrieval_aggregation_mode=self.retrieval_aggregation_mode,
+ MSA_start=self.MSA_start,
+ MSA_end=self.MSA_end,
+ len_target_seq=self.full_protein_length,
+ vocab=config.tokenizer.get_vocab(),
+ filter_MSA=True,
+ threshold_sequence_frac_gaps=config.MSA_threshold_sequence_frac_gaps,
+ threshold_focus_cols_frac_gaps=config.MSA_threshold_focus_cols_frac_gaps,
+ verbose=True
+ )
+ self.MSA_log_prior = torch.log(torch.tensor(self.MSA_log_prior).float().to(self.default_model_device))
+ else:
+ self.MSA_processed_depth = 0
+ if self.inference_time_retrieval_type=="TranceptEVE":
+ assert (self.EVE_model_paths is not None) and len(self.EVE_model_paths) >=1 , "Could not find a reference for EVE model"
+ self.EVE_models, self.EVE_MSA, self.EVE_log_prior = self.get_EVE_models_and_log_prior(EVE_model_paths=self.EVE_model_paths,
+ EVE_model_parameters_location=config.EVE_model_parameters_location,
+ full_sequence_len=self.full_protein_length,
+ MSA_start=self.MSA_start,
+ MSA_end=self.MSA_end,
+ EVE_num_samples_log_proba=config.EVE_num_samples_log_proba,
+ threshold_sequence_frac_gaps=config.MSA_threshold_sequence_frac_gaps,
+ threshold_focus_cols_frac_gaps=config.MSA_threshold_focus_cols_frac_gaps,
+ verbose=True)
+ self.EVE_log_prior = self.EVE_log_prior.to(self.default_model_device)
+ self.EVE_processed_depth = len(self.EVE_MSA.seq_name_to_sequence.keys())
+ else:
+ self.EVE_processed_depth = 0
+
+ if self.retrieval_weights_manual:
+ self.retrieval_inference_MSA_weight = config.retrieval_inference_MSA_weight if hasattr(config, "retrieval_inference_MSA_weight") else 0.5
+ self.retrieval_inference_EVE_weight = config.retrieval_inference_EVE_weight if hasattr(config, "retrieval_inference_EVE_weight") else 0.5
+ elif self.inference_time_retrieval_type=="Tranception":
+ # Using weights from original Tranception paper
+ self.retrieval_inference_MSA_weight = 0.6
+ self.retrieval_inference_EVE_weight = 0.0
+ elif self.inference_time_retrieval_type=="TranceptEVE":
+ if self.retrieval_aggregation_mode=="aggregate_indel":
+ if self.MSA_processed_depth < 10:
+ self.retrieval_inference_MSA_weight = 0.0
+ self.retrieval_inference_EVE_weight = 0.0
+ else:
+ self.retrieval_inference_MSA_weight = 0.5
+ self.retrieval_inference_EVE_weight = 0.1
+ else:
+ if self.MSA_processed_depth < 10:
+ self.retrieval_inference_MSA_weight = 0.0
+ elif self.MSA_processed_depth < 10**2:
+ self.retrieval_inference_MSA_weight = 0.1
+ elif self.MSA_processed_depth < 10**3:
+ self.retrieval_inference_MSA_weight = 0.3
+ elif self.MSA_processed_depth < 10**4:
+ self.retrieval_inference_MSA_weight = 0.4
+ elif self.MSA_processed_depth < 10**5:
+ self.retrieval_inference_MSA_weight = 0.4
+ else:
+ self.retrieval_inference_MSA_weight = 0.5
+
+ if self.EVE_processed_depth < 10:
+ self.retrieval_inference_EVE_weight = 0.0
+ elif self.EVE_processed_depth < 10**2:
+ self.retrieval_inference_EVE_weight = 0.3
+ elif self.EVE_processed_depth < 10**3:
+ self.retrieval_inference_EVE_weight = 0.6
+ elif self.EVE_processed_depth < 10**4:
+ self.retrieval_inference_EVE_weight = 0.7
+ elif self.EVE_processed_depth < 10**5:
+ self.retrieval_inference_EVE_weight = 0.7
+ else:
+ self.retrieval_inference_EVE_weight = 0.8
+ print("Aggregation weights of retrieved MSA & EVE model are based on processed MSA depth: MSA({}) and EVE({})".format(self.retrieval_inference_MSA_weight,self.retrieval_inference_EVE_weight))
+ else:
+ print("Model only uses autoregressive inference")
+
+ def parallelize(self, device_map=None, num_cores=None, num_pipelines=1):
+ self.num_pipelines=num_pipelines
+ self.device_map = (
+ get_device_map(len(self.transformer.h), range(torch.cuda.device_count()))
+ if device_map is None
+ else device_map
+ )
+ assert_device_map(self.device_map, len(self.transformer.h))
+ self.transformer.parallelize(self.device_map, num_cores=num_cores)
+ self.lm_head = self.lm_head.to(self.transformer.first_device)
+ self.model_parallel = True
+
+ def deparallelize(self):
+ self.transformer.deparallelize()
+ self.transformer = self.transformer.to("cpu")
+ self.lm_head = self.lm_head.to("cpu")
+ self.model_parallel = False
+ torch.cuda.empty_cache()
+
+ def get_output_embeddings(self):
+ return self.lm_head
+
+ def set_output_embeddings(self, new_embeddings):
+ self.lm_head = new_embeddings
+
+ def prepare_inputs_for_generation(self, input_ids, past=None, **kwargs):
+ token_type_ids = kwargs.get("token_type_ids", None)
+ # only last token for inputs_ids if past is defined in kwargs
+ if past:
+ input_ids = input_ids[:, -1].unsqueeze(-1)
+ if token_type_ids is not None:
+ token_type_ids = token_type_ids[:, -1].unsqueeze(-1)
+
+ attention_mask = kwargs.get("attention_mask", None)
+ position_ids = kwargs.get("position_ids", None)
+
+ if attention_mask is not None and position_ids is None:
+ # create position_ids on the fly for batch generation
+ position_ids = attention_mask.long().cumsum(-1) - 1
+ position_ids.masked_fill_(attention_mask == 0, 1)
+ if past:
+ position_ids = position_ids[:, -1].unsqueeze(-1)
+ else:
+ position_ids = None
+
+ return {
+ "input_ids": input_ids,
+ "past_key_values": past,
+ "use_cache": kwargs.get("use_cache"),
+ "position_ids": position_ids,
+ "attention_mask": attention_mask,
+ "token_type_ids": token_type_ids,
+ "flip": kwargs.get("flip", None),
+ }
+
+ def get_transformer_log_softmax(self, sequence, batch_size=20, inference_time_retrieval_type="Tranception"):
+ model_context_len = self.config.n_ctx - 2
+ sequence_df = pd.DataFrame({'mutated_sequence':[sequence]})
+ num_windows = 1 + int( len(sequence) / model_context_len)
+ df_list=[]
+ start=0
+ for window_index in range(1, num_windows+1):
+ df_sliced = sequence_df.copy()
+ df_sliced['sliced_mutated_sequence'] = df_sliced['mutated_sequence'].map(lambda x: x[start:start+model_context_len])
+ df_sliced['start_slice'] = [start]
+ df_sliced['end_slice'] = df_sliced['mutated_sequence'].map(lambda x: min(len(x), start+model_context_len))
+ df_list.append(df_sliced)
+ start += model_context_len
+ df_final = pd.concat(df_list,axis=0)
+ df = df_final.drop_duplicates()
+ self.eval()
+ with torch.no_grad():
+ ds = Dataset.from_pandas(df)
+ ds.set_transform(self.encode_batch)
+ data_collator = DataCollatorForLanguageModeling(
+ tokenizer=self.config.tokenizer,
+ mlm=False)
+ sampler = SequentialSampler(ds)
+ ds_loader = torch.utils.data.DataLoader(ds, batch_size=num_windows, sampler=sampler, collate_fn=data_collator, num_workers=5, pin_memory=True, drop_last=False)
+ for encoded_batch in tqdm.tqdm(ds_loader):
+ encoded_batch['start_slice'] = df['start_slice'].values
+ encoded_batch['end_slice'] = df['end_slice'].values
+ for k, v in encoded_batch.items():
+ if isinstance(v, torch.Tensor):
+ encoded_batch[k] = v.to(self.device)
+ shift_labels = encoded_batch['labels'][..., 1:].contiguous()
+ shift_log_probas=self.forward(**encoded_batch,return_dict=True, retrieval_aggregation_mode="aggregate_substitution",inference_time_retrieval_type=inference_time_retrieval_type).fused_shift_log_probas
+ vocab_size = shift_log_probas.shape[-1]
+ if num_windows>1: #Trim last positions
+ shift_log_probas_trimmed = torch.zeros((len(sequence)+1,vocab_size)).to(shift_log_probas.device)
+ shift_labels_trimmed = torch.zeros((len(sequence)+1,)).to(shift_log_probas.device)
+ start_index = 0
+ for window in range(num_windows):
+ if window < num_windows - 1:
+ shift_log_probas_trimmed[start_index:start_index+model_context_len] = shift_log_probas[window,:model_context_len]
+ shift_labels_trimmed[start_index:start_index+model_context_len] = shift_labels[window,:model_context_len]
+ else:
+ remaining_residues = len(sequence) + 1 - start_index
+ shift_log_probas_trimmed[start_index:] = shift_log_probas[window,:remaining_residues]
+ shift_labels_trimmed[start_index:] = shift_labels[window,:remaining_residues]
+ start_index+=model_context_len
+ shift_log_probas = shift_log_probas_trimmed
+ shift_labels = shift_labels_trimmed.long()
+ #remove dummy tokens at the end -- adding 1 for the EOS token
+ shift_log_probas=shift_log_probas.view(-1,vocab_size)[:len(sequence)+1]
+ shift_labels=shift_labels.view(-1)[:len(sequence)+1]
+ assert shift_log_probas.shape[0]==len(shift_labels), "Length of log probas vector does not match length of labels"
+ return shift_log_probas, shift_labels
+
+ def iterative_recalibrations(self, log_proba_to_calibrate, avg_log_proba_target, distance_stop_criterion=0.001, max_steps = 1000):
+ loss = abs(log_proba_to_calibrate.mean() - avg_log_proba_target)
+ step = 0
+ while (loss > distance_stop_criterion):
+ T = log_proba_to_calibrate.mean() / avg_log_proba_target
+ log_proba_to_calibrate = torch.log_softmax(log_proba_to_calibrate / T, dim=-1)
+ loss = abs(log_proba_to_calibrate.mean() - avg_log_proba_target)
+ step += 1
+ if step > max_steps:
+ break
+ return log_proba_to_calibrate
+
+ def recalibrate_MSA_probas(self):
+ log_softmax_wt_LR, shift_labels_LR = self.get_transformer_log_softmax(sequence=self.full_target_seq, inference_time_retrieval_type=None)
+ log_softmax_wt_RL, shift_labels_RL = self.get_transformer_log_softmax(sequence=self.full_target_seq[::-1], inference_time_retrieval_type=None)
+ log_probas_transformer_mean = (log_softmax_wt_LR[self.MSA_start:self.MSA_end,5:].mean() + log_softmax_wt_RL[self.MSA_start:self.MSA_end,5:].mean())/2.0
+ log_probas_MSA_mean = self.MSA_log_prior[self.MSA_start:self.MSA_end,5:].mean()
+ T_optimal = log_probas_MSA_mean / log_probas_transformer_mean
+ print("Optimal temperature for MSA proba recalibration: {}".format(T_optimal))
+ self.MSA_log_prior[self.MSA_start:self.MSA_end,5:] = self.iterative_recalibrations(self.MSA_log_prior[self.MSA_start:self.MSA_end,5:], avg_log_proba_target=log_probas_transformer_mean)
+
+ def recalibrate_EVE_probas(self):
+ log_softmax_wt_LR, shift_labels_LR = self.get_transformer_log_softmax(sequence=self.full_target_seq)
+ log_softmax_wt_RL, shift_labels_RL = self.get_transformer_log_softmax(sequence=self.full_target_seq[::-1])
+ reindexed_focus_cols = [self.MSA_start+position for position in self.EVE_MSA.focus_cols]
+ log_probas_transformer_mean = (log_softmax_wt_LR[reindexed_focus_cols,5:].mean() + log_softmax_wt_RL[reindexed_focus_cols,5:].mean())/2.0
+ log_probas_EVE_mean = self.EVE_log_prior[reindexed_focus_cols,5:].mean()
+ T_optimal = log_probas_EVE_mean / log_probas_transformer_mean
+ print("Optimal temperature for EVE proba recalibration: {}".format(T_optimal))
+ self.EVE_log_prior[reindexed_focus_cols,5:] = self.iterative_recalibrations(self.EVE_log_prior[reindexed_focus_cols,5:], avg_log_proba_target=log_probas_transformer_mean)
+
+ def get_EVE_model(self, EVE_model_path, EVE_model_parameters_location, threshold_sequence_frac_gaps=None, threshold_focus_cols_frac_gaps=None):
+ assert self.MSA_filename is not None, "MSA_filename not specified"
+ assert self.MSA_folder is not None, "MSA_folder not specified"
+ assert os.path.exists(self.config.MSA_weight_file_name), "MSA weights file not found"
+ if threshold_focus_cols_frac_gaps!=1.0: print("threshold_focus_cols_frac_gaps not 1.0. Only well-covered positions are factored in the EVE retrieval aggregation.")
+ MSA = msa_utils.MSA_processing(
+ MSA_location=self.MSA_filename,
+ #theta=theta, #Dont need to specify weights since EVE model should be trained separately beforehand / we are using weights directly as is
+ use_weights=True,
+ threshold_sequence_frac_gaps=threshold_sequence_frac_gaps,
+ threshold_focus_cols_frac_gaps=threshold_focus_cols_frac_gaps,
+ weights_location=self.config.MSA_weight_file_name
+ )
+ model_params = json.load(open(EVE_model_parameters_location))
+ model = VAE_model.VAE_model(
+ model_name='EVE_model',
+ data=MSA,
+ encoder_parameters=model_params["encoder_parameters"],
+ decoder_parameters=model_params["decoder_parameters"],
+ random_seed=42
+ )
+ model = model.to(model.device)
+ #try:
+ if True:
+ checkpoint = torch.load(EVE_model_path)
+ model.load_state_dict(checkpoint['model_state_dict'])
+ print("Initialized EVE model with checkpoint '{}' ".format(EVE_model_path))
+ #except:
+ else:
+ print("Unable to locate EVE model checkpoint {}".format(EVE_model_path))
+ sys.exit(0)
+ return model, MSA
+
+ def get_EVE_models_and_log_prior(self, EVE_model_paths, EVE_model_parameters_location, full_sequence_len, MSA_start, MSA_end, sequences_to_score=None, EVE_num_samples_log_proba=10, alphabet="ACDEFGHIKLMNPQRSTVWY", verbose=False, threshold_sequence_frac_gaps=None, threshold_focus_cols_frac_gaps=None):
+ """
+ Create ensemble if multiple models passed through EVE_model_paths.
+ """
+ num_EVE_models = len(EVE_model_paths)
+ EVE_ensemble_retrieved_MSA = {}
+ EVE_log_prior = 0
+ for model_index, EVE_model_path in enumerate(EVE_model_paths):
+ EVE_ensemble_retrieved_MSA[model_index], EVE_MSA = self.get_EVE_model(EVE_model_path, EVE_model_parameters_location, threshold_sequence_frac_gaps=threshold_sequence_frac_gaps, threshold_focus_cols_frac_gaps=threshold_focus_cols_frac_gaps)
+ log_prior_list = EVE_model_path.split('/')
+ log_prior_folder = '/'.join(log_prior_list[:-1])+os.sep+'log_prior'
+ if not os.path.exists(log_prior_folder):
+ os.mkdir(log_prior_folder)
+ log_prior_name = '_'.join([log_prior_list[-1],str(EVE_num_samples_log_proba),'log_space'])
+ log_prior_location = log_prior_folder+os.sep+log_prior_name
+ sequences_to_score = [EVE_MSA.focus_seq_trimmed] if sequences_to_score is None else sequences_to_score
+ if not os.path.exists(log_prior_location):
+ print("Computing EVE log prior")
+ EVE_log_prior_single = self.get_EVE_log_prior_single(EVE_model=EVE_ensemble_retrieved_MSA[model_index],
+ sequences_to_score=sequences_to_score,
+ full_sequence_len=full_sequence_len,
+ MSA_start=MSA_start,
+ MSA_end=MSA_end,
+ EVE_MSA=EVE_MSA,
+ EVE_num_samples_log_proba=EVE_num_samples_log_proba,
+ alphabet=alphabet,
+ verbose=verbose)
+ with open(log_prior_location,'wb') as f: pickle.dump(EVE_log_prior_single, f)
+ else:
+ print("Loading EVE log prior from disk")
+ with open(log_prior_location,'rb') as f: EVE_log_prior_single = pickle.load(f)
+ EVE_log_prior += EVE_log_prior_single
+ EVE_log_prior = EVE_log_prior / len(EVE_model_paths)
+ return EVE_ensemble_retrieved_MSA, EVE_MSA, EVE_log_prior
+
+ def get_EVE_log_prior_single(self, EVE_model, sequences_to_score, full_sequence_len, MSA_start, MSA_end, EVE_MSA, EVE_num_samples_log_proba=10, alphabet="ACDEFGHIKLMNPQRSTVWY", verbose=False, average_mode="log_space"):
+ reference_seq = sequences_to_score[0]
+ self.focus_seq_one_hot_encoding = np.zeros((len(sequences_to_score),len(reference_seq),len(alphabet)))
+ aa_dict = {}
+ for i,aa in enumerate(alphabet):
+ aa_dict[aa] = i
+ for i,sequence in enumerate(sequences_to_score):
+ for j,letter in enumerate(sequence):
+ if letter in aa_dict:
+ k = aa_dict[letter]
+ self.focus_seq_one_hot_encoding[i,j,k] = 1.0
+ EVE_model.eval()
+ recon_x_log = 0
+ with torch.no_grad():
+ x = torch.tensor(self.focus_seq_one_hot_encoding, dtype=EVE_model.dtype).to(EVE_model.device)
+ mu, log_var = EVE_model.encoder(x)
+ for _ in tqdm.tqdm(range(EVE_num_samples_log_proba), desc="Sampling EVE log probabilities", mininterval=10):
+ z = EVE_model.sample_latent(mu, log_var)
+ recon_x_log += EVE_model.decoder(z)
+ recon_x_log = recon_x_log / EVE_num_samples_log_proba #Average over iterations
+ recon_x_log = recon_x_log.view(len(sequences_to_score),len(reference_seq),len(alphabet))
+ EVE_log_prior = torch.ones(len(sequences_to_score),full_sequence_len,len(alphabet)+5, dtype=EVE_model.dtype, device=EVE_model.device) * (- np.inf)
+ reindexed_focus_cols = [MSA_start+position for position in EVE_MSA.focus_cols]
+ EVE_log_prior[:,reindexed_focus_cols,5:] = recon_x_log
+ if verbose: print("Target seq len is {}, MSA length is {}, start position is {}, end position is {} and log_prior shape is: {}".format(len(reference_seq),MSA_end-MSA_start,MSA_start,MSA_end,EVE_log_prior.shape))
+ EVE_log_prior = EVE_log_prior.squeeze() #if only 1 sequence to score, drops the first dimension
+ return EVE_log_prior
+
+ def forward(
+ self,
+ input_ids=None,
+ past_key_values=None,
+ attention_mask=None,
+ token_type_ids=None,
+ position_ids=None,
+ head_mask=None,
+ inputs_embeds=None,
+ encoder_hidden_states=None,
+ encoder_attention_mask=None,
+ labels=None,
+ use_cache=None,
+ output_attentions=None,
+ output_hidden_states=None,
+ return_dict=None,
+ flip=None,
+ start_slice=None,
+ end_slice=None,
+ mutated_sequence=None,
+ sliced_mutated_sequence=None,
+ retrieval_aggregation_mode=None,
+ inference_time_retrieval_type=None
+ ):
+ r"""
+ labels (:obj:`torch.LongTensor` of shape :obj:`(batch_size, sequence_length)`, `optional`):
+ Labels for language modeling. Note that the labels **are shifted** inside the model, i.e. you can set
+ ``labels = input_ids`` Indices are selected in ``[-100, 0, ..., config.vocab_size]`` All labels set to
+ ``-100`` are ignored (masked), the loss is only computed for labels in ``[0, ..., config.vocab_size]``
+ """
+ return_dict = return_dict if return_dict is not None else self.config.use_return_dict
+ retrieval_aggregation_mode = retrieval_aggregation_mode if retrieval_aggregation_mode is not None else self.retrieval_aggregation_mode
+ inference_time_retrieval_type = inference_time_retrieval_type if inference_time_retrieval_type is not None else self.inference_time_retrieval_type
+
+ transformer_outputs = self.transformer(
+ input_ids,
+ past_key_values=past_key_values,
+ attention_mask=attention_mask,
+ token_type_ids=token_type_ids,
+ position_ids=position_ids,
+ head_mask=head_mask,
+ inputs_embeds=inputs_embeds,
+ encoder_hidden_states=encoder_hidden_states,
+ encoder_attention_mask=encoder_attention_mask,
+ use_cache=use_cache,
+ output_attentions=output_attentions,
+ output_hidden_states=output_hidden_states,
+ return_dict=return_dict
+ )
+ hidden_states = transformer_outputs[0]
+
+ # Set device for model parallelism
+ if self.model_parallel:
+ torch.cuda.set_device(self.transformer.first_device)
+ hidden_states = hidden_states.to(self.lm_head.weight.device)
+ self.MSA_log_prior = self.MSA_log_prior.to(self.lm_head.weight.device)
+
+ lm_logits = self.lm_head(hidden_states)
+
+ loss = None
+ if labels is not None:
+ # Shift so that tokens < n predict n
+ shift_logits = lm_logits[..., :-1, :].contiguous()
+ shift_labels = labels[..., 1:].contiguous().to(shift_logits.device)
+
+ if retrieval_aggregation_mode is not None:
+ batch_size = input_ids.size(0)
+ if retrieval_aggregation_mode=="aggregate_indel":
+ assert batch_size==1, "Aggregate indel is only supported for batch size of 1"
+ truncated_sequence_text = mutated_sequence[0][start_slice[0]:end_slice[0]]
+ if len(truncated_sequence_text)!=shift_logits.shape[1]-1: # shift_logits only has one extra token compared to truncated_sequence_text (the BOS token)
+ print("Tokenization error -- seq length: {} and shift_logits length - 1 : {}".format(len(truncated_sequence_text),shift_logits.shape[1]-1))
+ try:
+ MSA_log_prior, MSA_start, MSA_end, keep_column, new_column = msa_utils.update_retrieved_MSA_log_prior_indel(self, self.MSA_log_prior, self.MSA_start, self.MSA_end, mutated_sequence[0], self.clustal_hash)
+ except:
+ print("Issue processing the following sequence: {}".format(mutated_sequence[0]))
+ if inference_time_retrieval_type == "TranceptEVE": EVE_log_prior = msa_utils.update_retrieved_MSA_log_prior_indel(self, self.EVE_log_prior, self.MSA_start, self.MSA_end, mutated_sequence[0], self.clustal_hash_eve)[0]
+ elif retrieval_aggregation_mode=="aggregate_substitution":
+ MSA_log_prior=self.MSA_log_prior
+ MSA_start=self.MSA_start
+ MSA_end=self.MSA_end
+ if inference_time_retrieval_type == "TranceptEVE": EVE_log_prior=self.EVE_log_prior
+
+ shift_log_probas = torch.log_softmax(shift_logits,dim=-1)
+ fused_shift_log_probas = shift_log_probas.clone()
+ if flip is None:
+ flip = torch.zeros(batch_size).to(fused_shift_log_probas.device)
+ flip = flip > 0
+
+ for seq_index in range(batch_size):
+ if MSA_start < end_slice[seq_index] and MSA_end > start_slice[seq_index]: #first check whether the MSA region is even in the sliced interval
+ min_prior_slice = max(start_slice[seq_index], MSA_start)
+ max_prior_slice = min(end_slice[seq_index], MSA_end)
+ else: #If there is no overlap, there is no averaging with the MSA / retrieval
+ continue
+ if max_prior_slice <= min_prior_slice:
+ print("Non overlapping region detected: min_prior_slice {} and max_prior_slice {}".format(min_prior_slice,max_prior_slice))
+ continue
+ slice_MSA_prior = MSA_log_prior[min_prior_slice:max_prior_slice,:].to(fused_shift_log_probas.device)
+ if inference_time_retrieval_type == "TranceptEVE": slice_EVE_prior = EVE_log_prior[min_prior_slice:max_prior_slice,:].to(fused_shift_log_probas.device)
+
+ if flip[seq_index]:
+ slice_MSA_prior = torch.flip(slice_MSA_prior,dims=(0,))
+ if inference_time_retrieval_type == "TranceptEVE": slice_EVE_prior = torch.flip(slice_EVE_prior,dims=(0,))
+ min_logits_slice = max(0,end_slice[seq_index]-MSA_end)
+ max_logits_slice = min_logits_slice + (max_prior_slice-min_prior_slice)
+ else:
+ min_logits_slice = max(0, MSA_start-start_slice[seq_index])
+ max_logits_slice = min_logits_slice + (max_prior_slice-min_prior_slice)
+
+ if inference_time_retrieval_type=="Tranception":
+ fused_shift_log_probas[seq_index,min_logits_slice:max_logits_slice,5:] = (1-self.retrieval_inference_MSA_weight) * shift_log_probas[seq_index,min_logits_slice:max_logits_slice,5:] + self.retrieval_inference_MSA_weight * slice_MSA_prior[...,5:]
+ elif inference_time_retrieval_type=="TranceptEVE":
+ fused_shift_log_probas[seq_index,min_logits_slice:max_logits_slice,5:] = (1-self.retrieval_inference_EVE_weight) * ((1-self.retrieval_inference_MSA_weight) * shift_log_probas[seq_index,min_logits_slice:max_logits_slice,5:] + self.retrieval_inference_MSA_weight * slice_MSA_prior[...,5:]) + self.retrieval_inference_EVE_weight * slice_EVE_prior[...,5:]
+ else:
+ print("inference_time_retrieval_type not recognized")
+ sys.exit(0)
+
+ if inference_time_retrieval_type=="TranceptEVE" and self.config.MSA_threshold_focus_cols_frac_gaps<1.0 and retrieval_aggregation_mode!="aggregate_indel":
+ reindexed_non_focus_cols_in_shift_log_probas_coordinates = [ix for ix in range(fused_shift_log_probas.shape[1]) if fused_shift_log_probas[seq_index,ix,5:].min() == (- np.inf)]
+ reindexed_non_focus_cols_in_full_seq_coordinates = [ix + start_slice[seq_index] for ix in reindexed_non_focus_cols_in_shift_log_probas_coordinates]
+ reindexed_non_focus_cols_in_full_seq_coordinates_in_MSA_overlap = [ix for ix in reindexed_non_focus_cols_in_full_seq_coordinates if ix >= MSA_start and ix < MSA_end]
+
+ reindexed_non_focus_cols_in_slice_MSA_coordinates_in_MSA_overlap = [ix - min_prior_slice for ix in reindexed_non_focus_cols_in_full_seq_coordinates_in_MSA_overlap]
+ reindexed_non_focus_cols_in_shift_log_probas_coordinates_in_MSA_overlap = [ix - start_slice[seq_index] for ix in reindexed_non_focus_cols_in_full_seq_coordinates_in_MSA_overlap]
+ reindexed_non_focus_cols_in_shift_log_probas_coordinates_not_in_MSA_overlap = [ix - start_slice[seq_index] for ix in reindexed_non_focus_cols_in_full_seq_coordinates if ix not in reindexed_non_focus_cols_in_full_seq_coordinates_in_MSA_overlap]
+
+ #If positions to remove are in the MSA range, we leverage the MSA prior
+ fused_shift_log_probas[seq_index,reindexed_non_focus_cols_in_shift_log_probas_coordinates_in_MSA_overlap,5:]=(1-self.retrieval_inference_MSA_weight) * shift_log_probas[seq_index,reindexed_non_focus_cols_in_shift_log_probas_coordinates_in_MSA_overlap,5:] + self.retrieval_inference_MSA_weight * slice_MSA_prior[reindexed_non_focus_cols_in_slice_MSA_coordinates_in_MSA_overlap,5:]
+ #Otherwise we fully rely on the autoregressive transformer predictions
+ fused_shift_log_probas[seq_index,reindexed_non_focus_cols_in_shift_log_probas_coordinates_not_in_MSA_overlap,5:]=(1-self.retrieval_inference_MSA_weight) * shift_log_probas[seq_index,reindexed_non_focus_cols_in_shift_log_probas_coordinates_not_in_MSA_overlap,5:]
+
+ if retrieval_aggregation_mode=="aggregate_indel":
+ try:
+ # If a given residue column is an added zero-column, then we overwrite prior fusion and only predict based on the autoregressive transformer inference mode.
+ inserted_retrieval_positions = [True if slice_MSA_prior[i].sum()==0 else False for i in range(len(slice_MSA_prior))]+[True] #Last True is for the end of sentence token
+ fused_shift_log_probas[:,inserted_retrieval_positions,:]=shift_log_probas[:,inserted_retrieval_positions,:]
+ except:
+ print("Error when adding zero column(s) to account for insertion mutations.")
+ loss_fct = NLLLoss(reduction='none')
+ loss = loss_fct(input=fused_shift_log_probas.view(-1, fused_shift_log_probas.size(-1)), target=shift_labels.view(-1)).view(fused_shift_log_probas.shape[0],fused_shift_log_probas.shape[1])
+ mask = attention_mask[..., 1:].float()
+ mask[mask==0]=float('nan')
+ loss *= mask
+ loss = nanmean(loss, dim=1).mean()
+ else:
+ loss_fct = CrossEntropyLoss()
+ loss = loss_fct(shift_logits.view(-1, shift_logits.size(-1)), shift_labels.view(-1))
+ fused_shift_log_probas = None
+
+ if not return_dict:
+ output = (lm_logits,) + transformer_outputs[1:]
+ return ((loss,) + output) if loss is not None else output
+
+ return TranceptionCausalLMOutputWithCrossAttentions(
+ loss=loss,
+ logits=lm_logits,
+ past_key_values=transformer_outputs.past_key_values,
+ hidden_states=transformer_outputs.hidden_states,
+ attentions=transformer_outputs.attentions,
+ cross_attentions=transformer_outputs.cross_attentions,
+ fused_shift_log_probas=fused_shift_log_probas
+ )
+
+ @staticmethod
+ def _reorder_cache(past: Tuple[Tuple[torch.Tensor]], beam_idx: torch.Tensor) -> Tuple[Tuple[torch.Tensor]]:
+ """
+ This function is used to re-order the :obj:`past_key_values` cache if
+ :meth:`~transformers.PreTrainedModel.beam_search` or :meth:`~transformers.PreTrainedModel.beam_sample` is
+ called. This is required to match :obj:`past_key_values` with the correct beam_idx at every generation step.
+ """
+ return tuple(
+ tuple(past_state.index_select(0, beam_idx.to(past_state.device)) for past_state in layer_past)
+ for layer_past in past
+ )
+
+ def score_mutants(self, DMS_data, target_seq=None, scoring_mirror=True, batch_size_inference=10, num_workers=10, indel_mode=False):
+ """
+ Method to score mutants in an input DMS file.
+ DMS_data: (dataframe) Dataframe containing the list of mutated sequences for scoring.
+ target_seq: (string) Full reference sequence (wild type) that is mutated in the DMS assay. If not None, returned scores are delta log likelihood wrt that sequence.
+ scoring_mirror: (bool) Whether to score mutated sequences from both directions (Left->Right and Right->Left).
+ batch_size_inference: (int) Batch size for scoring.
+ num_workers: (int) Number of workers to be used in the data loader.
+ indel_mode: (bool) Flag to be used when scoring insertions and deletions. Otherwise assumes substitutions.
+ """
+ df = DMS_data.copy()
+ if self.config.MSA_recalibrate_probas: self.recalibrate_MSA_probas()
+ if self.config.EVE_recalibrate_probas: self.recalibrate_EVE_probas()
+ if ('mutated_sequence' not in df) and (not indel_mode): df['mutated_sequence'] = df['mutant'].apply(lambda x: scoring_utils.get_mutated_sequence(target_seq, x))
+ # if indel_mode:
+ # df["mutated_sequence"] = df["mutant"]
+ assert ('mutated_sequence' in df), "DMS file to score does not have mutated_sequence column"
+ if 'mutant' not in df: df['mutant'] = df['mutated_sequence'] #if mutant not in DMS file we default to mutated_sequence
+ df = df[['mutated_sequence','mutant']]
+ if target_seq is not None:
+ df_left_to_right_slices = scoring_utils.get_sequence_slices(df, target_seq=target_seq, model_context_len = self.config.n_ctx - 2, indel_mode=indel_mode, scoring_window=self.config.scoring_window)
+ else:
+ df_left_to_right_slices = scoring_utils.get_sequence_slices(df, target_seq=list(df['mutated_sequence'])[0], model_context_len = self.config.n_ctx - 2, indel_mode=indel_mode, scoring_window='sliding')
+ print("Scoring sequences from left to right")
+ scores_L_to_R = scoring_utils.get_tranception_scores_mutated_sequences(model=self, mutated_sequence_df=df_left_to_right_slices, batch_size_inference=batch_size_inference, score_var_name='avg_score_L_to_R', target_seq=target_seq, num_workers=num_workers, indel_mode=indel_mode)
+ if scoring_mirror:
+ print("Scoring sequences from right to left")
+ df_right_to_left_slices = df_left_to_right_slices.copy()
+ df_right_to_left_slices['sliced_mutated_sequence'] = df_right_to_left_slices['sliced_mutated_sequence'].apply(lambda x: x[::-1])
+ scores_R_to_L = scoring_utils.get_tranception_scores_mutated_sequences(model=self, mutated_sequence_df=df_right_to_left_slices, batch_size_inference=batch_size_inference, score_var_name='avg_score_R_to_L', target_seq=target_seq, num_workers=num_workers, reverse=True, indel_mode=indel_mode)
+ all_scores = pd.merge(scores_L_to_R, scores_R_to_L, on='mutated_sequence', how='left', suffixes=('','_R_to_L'))
+ all_scores['avg_score'] = (all_scores['avg_score_L_to_R'] + all_scores['avg_score_R_to_L']) / 2.0
+ else:
+ all_scores = scores_L_to_R
+ all_scores['avg_score'] = all_scores['avg_score_L_to_R']
+ #By design "get_tranception_scores_mutated_sequences" drops the WT from the output. We add it back if that was one of the sequences to score in the DMS (score=0 by definition)
+ if target_seq in df.mutated_sequence.values:
+ if scoring_mirror:
+ wt_row = pd.DataFrame([[target_seq,0,0,0]], columns=['mutated_sequence','avg_score_L_to_R','avg_score_R_to_L','avg_score'])
+ else:
+ wt_row = pd.DataFrame([[target_seq,0,0]], columns=['mutated_sequence','avg_score_L_to_R','avg_score'])
+ all_scores = pd.concat([all_scores,wt_row], ignore_index=True)
+ if len(all_scores) > 0 and indel_mode==False : all_scores = pd.merge(all_scores,df,how="left",on='mutated_sequence') #Add back mutation triplet to scoring file (not needed for indels)
+ return all_scores
+
+ def encode_batch(self, protein_sequence, sequence_name="sliced_mutated_sequence"):
+ """
+ Method to process an input AA sequence batch (protein_sequence) and return a tokenized sequence (via the tokenizer associated to the model).
+ """
+ protein_sequence[sequence_name] = scoring_utils.sequence_replace(sequences=protein_sequence[sequence_name], char_to_replace='X', char_replacements='ACDEFGHIKLMNPQRSTVWY')
+ protein_sequence[sequence_name] = scoring_utils.sequence_replace(sequences=protein_sequence[sequence_name], char_to_replace='B', char_replacements='DN')
+ protein_sequence[sequence_name] = scoring_utils.sequence_replace(sequences=protein_sequence[sequence_name], char_to_replace='J', char_replacements='IL')
+ protein_sequence[sequence_name] = scoring_utils.sequence_replace(sequences=protein_sequence[sequence_name], char_to_replace='Z', char_replacements='EQ')
+ return self.config.tokenizer(list(protein_sequence[sequence_name]), add_special_tokens=True, truncation=True, padding=True, max_length=self.config.n_ctx)
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/outputs.py b/proteingym/baselines/trancepteve/trancepteve/outputs.py
new file mode 100644
index 0000000..889b7bb
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/outputs.py
@@ -0,0 +1,47 @@
+from dataclasses import dataclass
+from typing import Optional, Tuple
+
+import torch
+
+from transformers.file_utils import ModelOutput
+
+@dataclass
+class TranceptionCausalLMOutputWithCrossAttentions(ModelOutput):
+ """
+ Class for Tranception causal language model (or autoregressive) outputs.
+ Args:
+ loss (`torch.FloatTensor` of shape `(1,)`, *optional*, returned when `labels` is provided):
+ Language modeling loss (for next-token prediction).
+ logits (`torch.FloatTensor` of shape `(batch_size, sequence_length, config.vocab_size)`):
+ Prediction scores of the language modeling head (scores for each vocabulary token before SoftMax).
+ hidden_states (`tuple(torch.FloatTensor)`, *optional*, returned when `output_hidden_states=True` is passed or when `config.output_hidden_states=True`):
+ Tuple of `torch.FloatTensor` (one for the output of the embeddings + one for the output of each layer) of
+ shape `(batch_size, sequence_length, hidden_size)`.
+ Hidden-states of the model at the output of each layer plus the initial embedding outputs.
+ attentions (`tuple(torch.FloatTensor)`, *optional*, returned when `output_attentions=True` is passed or when `config.output_attentions=True`):
+ Tuple of `torch.FloatTensor` (one for each layer) of shape `(batch_size, num_heads, sequence_length,
+ sequence_length)`.
+ Attentions weights after the attention softmax, used to compute the weighted average in the self-attention
+ heads.
+ cross_attentions (`tuple(torch.FloatTensor)`, *optional*, returned when `output_attentions=True` is passed or when `config.output_attentions=True`):
+ Tuple of `torch.FloatTensor` (one for each layer) of shape `(batch_size, num_heads, sequence_length,
+ sequence_length)`.
+ Cross attentions weights after the attention softmax, used to compute the weighted average in the
+ cross-attention heads.
+ past_key_values (`tuple(tuple(torch.FloatTensor))`, *optional*, returned when `use_cache=True` is passed or when `config.use_cache=True`):
+ Tuple of `torch.FloatTensor` tuples of length `config.n_layers`, with each tuple containing the cached key,
+ value states of the self-attention and the cross-attention layers if model is used in encoder-decoder
+ setting. Only relevant if `config.is_decoder = True`.
+ Contains pre-computed hidden-states (key and values in the attention blocks) that can be used (see
+ `past_key_values` input) to speed up sequential decoding.
+ fused_shift_log_probas (`torch.FloatTensor` of shape (batch_size, sequence_length, config.vocab_size), *optional*, returned when config.retrieval_aggregation_mode is not None.
+ log_probas for each residue position after aggregating autoregressive logits and retrieval logits.
+ """
+
+ loss: Optional[torch.FloatTensor] = None
+ logits: torch.FloatTensor = None
+ past_key_values: Optional[Tuple[Tuple[torch.FloatTensor]]] = None
+ hidden_states: Optional[Tuple[torch.FloatTensor]] = None
+ attentions: Optional[Tuple[torch.FloatTensor]] = None
+ cross_attentions: Optional[Tuple[torch.FloatTensor]] = None
+ fused_shift_log_probas: Optional[torch.FloatTensor] = None
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/utils/__init__.py b/proteingym/baselines/trancepteve/trancepteve/utils/__init__.py
new file mode 100644
index 0000000..90ad887
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/utils/__init__.py
@@ -0,0 +1 @@
+from . import scoring_utils, msa_utils
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/utils/dms_utils.py b/proteingym/baselines/trancepteve/trancepteve/utils/dms_utils.py
new file mode 100644
index 0000000..fd2ddee
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/utils/dms_utils.py
@@ -0,0 +1,30 @@
+import pandas as pd
+import numpy as np
+from trancepteve.utils import scoring_utils
+
+def DMS_file_cleanup(DMS_filename, target_seq, start_idx=1, end_idx=None, DMS_mutant_column='mutant', DMS_phenotype_name='score', DMS_directionality=1, AA_vocab = "ACDEFGHIKLMNPQRSTVWY"):
+ """
+ Function to process the raw substitution DMS assay data (eg., removing invalid mutants, aggregate silent mutations).
+ """
+ DMS_data = pd.read_csv(DMS_filename, low_memory=False)
+ end_idx = start_idx + len(target_seq) - 1 if end_idx is None else end_idx
+ DMS_data['mutant'] = DMS_data[DMS_mutant_column]
+
+ DMS_data=DMS_data[DMS_data['mutant'].notnull()].copy()
+ DMS_data=DMS_data[DMS_data['mutant'].apply(lambda x: all([len(y)>=3 for y in x.split(":")]))].copy() #Mutant triplets should have at least 3 or more characters
+ DMS_data=DMS_data[DMS_data['mutant'].apply(lambda x: all([(y[0] in AA_vocab) and (y[1:-1].isnumeric()) and (y[-1] in AA_vocab) for y in x.split(":")]))].copy()
+ DMS_data=DMS_data[DMS_data['mutant'].apply(lambda x: all([int(y[1:-1])-start_idx >=0 and int(y[1:-1]) <= end_idx for y in x.split(":")]))].copy()
+ DMS_data=DMS_data[DMS_data['mutant'].apply(lambda x: all([y[0]==target_seq[int(y[1:-1])-start_idx] for y in x.split(":")]))].copy()
+
+ DMS_data[DMS_phenotype_name]=pd.to_numeric(DMS_data[DMS_phenotype_name],errors='coerce')
+ DMS_data=DMS_data[np.isfinite(DMS_data[DMS_phenotype_name])]
+ DMS_data.dropna(subset = [DMS_phenotype_name], inplace=True)
+ DMS_data['DMS_score'] = DMS_data[DMS_phenotype_name] * DMS_directionality
+ DMS_data=DMS_data[['mutant','DMS_score']]
+ DMS_data=DMS_data.groupby('mutant').mean().reset_index()
+
+ DMS_data['mutated_sequence'] = DMS_data['mutant'].apply(lambda x: scoring_utils.get_mutated_sequence(target_seq, x))
+ DMS_data=DMS_data[['mutant','mutated_sequence','DMS_score']]
+
+ return DMS_data
+
diff --git a/proteingym/baselines/trancepteve/trancepteve/utils/eve_model_default_params.json b/proteingym/baselines/trancepteve/trancepteve/utils/eve_model_default_params.json
new file mode 100644
index 0000000..4b8e910
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/utils/eve_model_default_params.json
@@ -0,0 +1,41 @@
+{ "encoder_parameters": {
+ "hidden_layers_sizes" : [2000,1000,300],
+ "z_dim" : 50,
+ "convolve_input" : false,
+ "convolution_input_depth" : 40,
+ "nonlinear_activation" : "relu",
+ "dropout_proba" : 0.0
+},
+"decoder_parameters": {
+ "hidden_layers_sizes" : [300,1000,2000],
+ "z_dim" : 50,
+ "bayesian_decoder" : true,
+ "first_hidden_nonlinearity" : "relu",
+ "last_hidden_nonlinearity" : "relu",
+ "dropout_proba" : 0.1,
+ "convolve_output" : true,
+ "convolution_output_depth" : 40,
+ "include_temperature_scaler" : true,
+ "include_sparsity" : false,
+ "num_tiles_sparsity" : 0,
+ "logit_sparsity_p" : 0
+},
+"training_parameters": {
+ "num_training_steps" : 400000,
+ "learning_rate" : 1e-4,
+ "batch_size" : 256,
+ "annealing_warm_up" : 0,
+ "kl_latent_scale" : 1.0,
+ "kl_global_params_scale" : 1.0,
+ "l2_regularization" : 0.0,
+ "use_lr_scheduler" : false,
+ "use_validation_set" : false,
+ "validation_set_pct" : 0.2,
+ "validation_freq" : 1000,
+ "log_training_info" : true,
+ "log_training_freq" : 1000,
+ "save_model_params_freq" : 500000
+}
+}
+
+
diff --git a/proteingym/baselines/trancepteve/trancepteve/utils/msa_utils.py b/proteingym/baselines/trancepteve/trancepteve/utils/msa_utils.py
new file mode 100644
index 0000000..2c4d3f3
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/utils/msa_utils.py
@@ -0,0 +1,397 @@
+import numpy as np
+import pandas as pd
+from collections import defaultdict
+import random
+import os
+import torch
+from Bio.Align.Applications import ClustalOmegaCommandline
+
+def filter_msa(msa_data, num_sequences_kept=3):
+ """
+ Helper function to filter an input MSA msa_data (obtained via process_msa_data) and keep only num_sequences_kept aligned sequences.
+ If the MSA already has fewer sequences than num_sequences_kept, we keep the MSA as is.
+ If filtering, we always keep the first sequence of the MSA (ie. the wild type) by default.
+ Sampling is done without replacement.
+ """
+ if len(list(msa_data.keys())) <= num_sequences_kept:
+ return msa_data
+ filtered_msa = {}
+ wt_name = next(iter(msa_data))
+ filtered_msa[wt_name] = msa_data[wt_name]
+ del msa_data[wt_name]
+ sequence_names = list(msa_data.keys())
+ sequence_names_sampled = random.sample(sequence_names,k=num_sequences_kept-1)
+ for seq in sequence_names_sampled:
+ filtered_msa[seq] = msa_data[seq]
+ return filtered_msa
+
+def process_msa_data(MSA_data_file):
+ """
+ Helper function that takes as input a path to a MSA file (expects a2m format) and returns a dict mapping sequence ID to the corresponding AA sequence.
+ """
+ msa_data = defaultdict(str)
+ sequence_name = ""
+ with open(MSA_data_file, "r") as msa_file:
+ for i, line in enumerate(msa_file):
+ line = line.rstrip()
+ if line.startswith(">"):
+ sequence_name = line
+ else:
+ msa_data[sequence_name] += line.upper()
+ return msa_data
+
+def get_one_hot_sequences_dict(msa_data,MSA_start,MSA_end,vocab):
+ vocab_size = len(vocab.keys())
+ num_sequences_msa = len(msa_data.keys())
+ one_hots = np.zeros((num_sequences_msa,MSA_end-MSA_start,vocab_size))
+ for i,seq_name in enumerate(msa_data.keys()):
+ sequence = msa_data[seq_name]
+ for j,letter in enumerate(sequence):
+ if letter in vocab:
+ k = vocab[letter]
+ one_hots[i,j,k] = 1.0
+ return one_hots
+
+def one_hot(sequence_string,vocab):
+ one_hots = np.zeros((len(sequence_string),len(vocab.keys())))
+ for j,letter in enumerate(sequence_string):
+ if letter in vocab:
+ k = vocab[letter]
+ one_hots[j,k] = 1.0
+ return one_hots.flatten()
+
+def get_msa_prior(MSA_data_file, MSA_weight_file_name, MSA_start, MSA_end, len_target_seq, vocab, retrieval_aggregation_mode="aggregate_substitution", filter_MSA=True, verbose=False, threshold_sequence_frac_gaps=None, threshold_focus_cols_frac_gaps=None):
+ """
+ Function to enable retrieval inference mode, via computation of (weighted) pseudocounts of AAs at each position of the retrieved MSA.
+ MSA_data_file: (string) path to MSA file (expects a2m format).
+ MSA_weight_file_name: (string) path to sequence weights in MSA.
+ MSA_start: (int) Sequence position that the MSA starts at (1-indexing).
+ MSA_end: (int) Sequence position that the MSA ends at (1-indexing).
+ len_target_seq: (int) Full length of sequence to be scored.
+ vocab: (dict) Vocabulary of the tokenizer.
+ retrieval_aggregation_mode: (string) Mode for retrieval inference (aggregate_substitution Vs aggregate_indel). If None, places a uniform prior over each token.
+ filter_MSA: (bool) Whether to filter out sequences with very low hamming similarity (< 0.2) to the reference sequence in the MSA (first sequence).
+ verbose: (bool) Whether to print to the console processing details along the way.
+ """
+ msa_data = process_msa_data(MSA_data_file)
+ vocab_size = len(vocab.keys())
+ if verbose: print("Target seq len is {}, MSA length is {}, start position is {}, end position is {} and vocab size is {}".format(len_target_seq,MSA_end-MSA_start,MSA_start,MSA_end,vocab_size))
+
+ if filter_MSA:
+ if verbose: print("Num sequences in MSA pre filtering: {}".format(len(msa_data.keys())))
+ list_sequence_names = list(msa_data.keys())
+ focus_sequence_name = list(msa_data.keys())[0]
+ ref_sequence_hot = one_hot(msa_data[focus_sequence_name],vocab)
+ for sequence_name in list_sequence_names:
+ seq_hot = one_hot(msa_data[sequence_name],vocab)
+ hamming_similarity_seq_ref = np.dot(ref_sequence_hot,seq_hot) / np.dot(ref_sequence_hot,ref_sequence_hot)
+ if hamming_similarity_seq_ref < 0.2:
+ del msa_data[sequence_name]
+ if verbose: print("Num sequences in MSA post filtering: {}".format(len(msa_data.keys())))
+
+ if MSA_weight_file_name is not None:
+ if verbose: print("Using weights in {} for sequences in MSA.".format(MSA_weight_file_name))
+ #assert os.path.exists(MSA_weight_file_name), "Weights file not located on disk."
+ #if threshold_focus_cols_frac_gaps!=1.0: print("threshold_focus_cols_frac_gaps not 1.0. Only well-covered positions are factored in the MSA retrieval aggregation.")
+ MSA_EVE = MSA_processing(
+ MSA_location=MSA_data_file,
+ use_weights=True,
+ threshold_sequence_frac_gaps=threshold_sequence_frac_gaps,
+ threshold_focus_cols_frac_gaps=1.0, #threshold_focus_cols_frac_gaps, # By default we always keep all columns in the rettrieved MSA
+ weights_location=MSA_weight_file_name
+ )
+ #We scan through all sequences to see if we have a weight for them as per EVE pre-processing. We drop them otherwise.
+ dropped_sequences=0
+ list_sequence_names = list(msa_data.keys())
+ MSA_weight=[]
+ for sequence_name in list_sequence_names:
+ if sequence_name not in MSA_EVE.seq_name_to_sequence:
+ dropped_sequences +=1
+ del msa_data[sequence_name]
+ else:
+ MSA_weight.append(MSA_EVE.seq_name_to_weight[sequence_name])
+ if verbose: print("Dropped {} sequences from MSA due to absent sequence weights".format(dropped_sequences))
+ else:
+ MSA_weight = [1] * len(list(msa_data.keys()))
+
+ processed_MSA_depth = len(list(msa_data.keys()))
+
+ if retrieval_aggregation_mode=="aggregate_substitution" or retrieval_aggregation_mode=="aggregate_indel":
+ one_hots = get_one_hot_sequences_dict(msa_data,MSA_start,MSA_end,vocab)
+ MSA_weight = np.expand_dims(np.array(MSA_weight),axis=(1,2))
+ base_rate = 1e-5
+ base_rates = np.ones_like(one_hots) * base_rate
+ weighted_one_hots = (one_hots + base_rates) * MSA_weight
+ MSA_weight_norm_counts = weighted_one_hots.sum(axis=-1).sum(axis=0)
+ MSA_weight_norm_counts = np.tile(MSA_weight_norm_counts.reshape(-1,1), (1,vocab_size))
+ one_hots_avg = weighted_one_hots.sum(axis=0) / MSA_weight_norm_counts
+ msa_prior = np.zeros((len_target_seq,vocab_size))
+ msa_prior[MSA_start:MSA_end,:]=one_hots_avg
+ else:
+ msa_prior = np.ones((len_target_seq,vocab_size)) / vocab_size
+
+ if verbose:
+ for idx, position in enumerate(msa_prior):
+ if len(position)!=25:
+ print("Size error")
+ if not round(position.sum(),2)==1.0:
+ print("Position at index {} does not add up to 1: {}".format(idx, position.sum()))
+ return msa_prior, processed_MSA_depth
+
+
+def update_retrieved_MSA_log_prior_indel(model, MSA_log_prior, MSA_start, MSA_end, mutated_sequence, clustal_hash):
+ """
+ Function to process MSA when scoring indels.
+ To identify positions to add / remove in the retrieved MSA, we append and align the sequence to be scored to the original MSA for that protein family with Clustal Omega.
+ If the original MSA is relatively deep (over 100k sequences), we sample (by default) 100k rows at random from that MSA to speed computations.
+ MSA sampling is performed only once (for the first sequence to be scored). Subsequent scoring use the same MSA sample.
+ """
+ random_string = str(int(random.random()*10**10)) #Avoids conflicts if running several iterations concurrently and shared memory
+ model_id = '_'.join([str(x) for x in [model.inference_time_retrieval_type, random_string]])
+ if not os.path.isdir(model.MSA_folder + os.sep + "Sampled"):
+ os.mkdir(model.MSA_folder + os.sep + "Sampled")
+ sampled_MSA_location = model.MSA_folder + os.sep + "Sampled" + os.sep + "Sampled_" + clustal_hash + "_" + model.MSA_filename.split(os.sep)[-1]
+
+ if not os.path.exists(sampled_MSA_location):
+ msa_data = process_msa_data(model.MSA_filename)
+ msa_data_sampled = filter_msa(msa_data, num_sequences_kept=100000) #If MSA has less than 100k sequences, the sample is identical to original MSA
+ with open(sampled_MSA_location, 'w') as sampled_write_location:
+ for index, key in enumerate(msa_data_sampled):
+ key_name = ">REFERENCE_SEQUENCE" if index==0 else key
+ msa_data_sampled[key] = msa_data_sampled[key].upper()
+ msa_data_sampled[key] = msa_data_sampled[key].replace(".","-")
+ sampled_write_location.write(key_name+"\n"+"\n".join([msa_data_sampled[key][i:i+80] for i in range(0, len(msa_data_sampled[key]), 80)])+"\n")
+
+ seq_to_align_location = model.MSA_folder + os.sep + "Sampled" + os.sep + "Seq_to_align_" + clustal_hash + "_" + model_id + '_' + model.MSA_filename.split(os.sep)[-1]
+ sequence_text_split = [mutated_sequence[i:i+80] for i in range(0, len(mutated_sequence), 80)]
+ sequence_text_split_split_join = "\n".join([">SEQ_TO_SCORE"]+sequence_text_split)
+ os.system("echo '"+sequence_text_split_split_join+"' > "+seq_to_align_location)
+
+ expanded_MSA_location = model.MSA_folder + os.sep + "Sampled" + os.sep + "Expanded_" + clustal_hash + "_" + model_id + '_' + model.MSA_filename.split(os.sep)[-1]
+ clustalw_cline = ClustalOmegaCommandline(cmd=model.config.clustal_omega_location,
+ profile1=sampled_MSA_location,
+ profile2=seq_to_align_location,
+ outfile=expanded_MSA_location,
+ force=True)
+ stdout, stderr = clustalw_cline()
+ msa_data = process_msa_data(expanded_MSA_location)
+ aligned_seqA, aligned_seqB = msa_data[">SEQ_TO_SCORE"], msa_data[">REFERENCE_SEQUENCE"]
+ try:
+ keep_column=[] # Whether to keep the column (True) or not (False, ie. deletions)
+ new_column=[] # Whether to add one column (ie. insertions)
+ for column_index_pairwise_alignment in range(len(aligned_seqA)):
+ if aligned_seqA[column_index_pairwise_alignment]=="-" and aligned_seqB[column_index_pairwise_alignment]=="-":
+ continue
+ elif aligned_seqA[column_index_pairwise_alignment]=="-":
+ keep_column.append(False)
+ new_column.append(False)
+ elif aligned_seqB[column_index_pairwise_alignment]=="-":
+ MSA_log_prior=torch.cat((MSA_log_prior[:column_index_pairwise_alignment], torch.zeros(MSA_log_prior.shape[1]).view(1,-1).cuda(), MSA_log_prior[column_index_pairwise_alignment:]),dim=0)
+ keep_column.append(True) #keep the zero column we just added
+ new_column.append(True)
+ else:
+ keep_column.append(True)
+ new_column.append(False)
+ MSA_log_prior = MSA_log_prior[keep_column]
+ MSA_end = MSA_start + len(MSA_log_prior)
+ except:
+ print("Error when processing the following alignment: {}".format(expanded_MSA_location))
+ #cleanups
+ os.system("rm -f "+seq_to_align_location)
+ os.system("rm -f "+expanded_MSA_location)
+ return MSA_log_prior, MSA_start, MSA_end, keep_column, new_column
+
+def update_weight_vector_indel(weight, keep_column, new_column):
+ new_weight = []
+ num_insertions = 0
+ for position_index in range(len(keep_column)):
+ if new_column[position_index]:
+ # if the position is newly added (insertions) we add a zero weight --> Predictions at inserted positions rely fully on the autoregressive transformer
+ new_weight.append(0)
+ num_insertions+=1
+ elif keep_column[position_index]:
+ # otherwise, if the position is kept (not a deletion) we append the corresponding value from the weight tensor
+ weight_relative_index = position_index - num_insertions
+ new_weight.append(weight[weight_relative_index].item())
+ return torch.tensor(new_weight).view(-1,1).cuda()
+
+class MSA_processing:
+ def __init__(self,
+ MSA_location="",
+ theta=0.2,
+ use_weights=True,
+ weights_location=None,
+ preprocess_MSA=True,
+ threshold_sequence_frac_gaps=0.5,
+ threshold_focus_cols_frac_gaps=0.3,
+ remove_sequences_with_indeterminate_AA_in_focus_cols=True
+ ):
+
+ """
+ This MSA_processing class is directly borrowed from the EVE codebase: https://github.com/OATML-Markslab/EVE
+
+ Parameters:
+ - msa_location: (path) Location of the MSA data. Constraints on input MSA format:
+ - focus_sequence is the first one in the MSA data
+ - first line is structured as follows: ">focus_seq_name/start_pos-end_pos" (e.g., >SPIKE_SARS2/310-550)
+ - corespondding sequence data located on following line(s)
+ - then all other sequences follow with ">name" on first line, corresponding data on subsequent lines
+ - theta: (float) Sequence weighting hyperparameter. Generally: Prokaryotic and eukaryotic families = 0.2; Viruses = 0.01
+ - use_weights: (bool) If False, sets all sequence weights to 1. If True, checks weights_location -- if non empty uses that;
+ otherwise compute weights from scratch and store them at weights_location
+ - weights_location: (path) Location to load from/save to the sequence weights
+ - preprocess_MSA: (bool) performs pre-processing of MSA to remove short fragments and positions that are not well covered.
+ - threshold_sequence_frac_gaps: (float, between 0 and 1) Threshold value to define fragments
+ - sequences with a fraction of gap characters above threshold_sequence_frac_gaps are removed
+ - default is set to 0.5 (i.e., fragments with 50% or more gaps are removed)
+ - threshold_focus_cols_frac_gaps: (float, between 0 and 1) Threshold value to define focus columns
+ - positions with a fraction of gap characters above threshold_focus_cols_pct_gaps will be set to lower case (and not included in the focus_cols)
+ - default is set to 0.3 (i.e., focus positions are the ones with 30% of gaps or less, i.e., 70% or more residue occupancy)
+ - remove_sequences_with_indeterminate_AA_in_focus_cols: (bool) Remove all sequences that have indeterminate AA (e.g., B, J, X, Z) at focus positions of the wild type
+ """
+ np.random.seed(2021)
+ self.MSA_location = MSA_location
+ self.weights_location = weights_location
+ self.theta = theta
+ self.alphabet = "ACDEFGHIKLMNPQRSTVWY"
+ self.use_weights = use_weights
+ self.preprocess_MSA = preprocess_MSA
+ self.threshold_sequence_frac_gaps = threshold_sequence_frac_gaps
+ self.threshold_focus_cols_frac_gaps = threshold_focus_cols_frac_gaps
+ self.remove_sequences_with_indeterminate_AA_in_focus_cols = remove_sequences_with_indeterminate_AA_in_focus_cols
+
+ self.gen_alignment()
+
+ def gen_alignment(self, verbose=False):
+ """ Read training alignment and store basics in class instance """
+ self.aa_dict = {}
+ for i,aa in enumerate(self.alphabet):
+ self.aa_dict[aa] = i
+
+ self.seq_name_to_sequence = defaultdict(str)
+ name = ""
+ with open(self.MSA_location, "r") as msa_data:
+ for i, line in enumerate(msa_data):
+ line = line.rstrip()
+ if line.startswith(">"):
+ name = line
+ if i==0:
+ self.focus_seq_name = name
+ else:
+ self.seq_name_to_sequence[name] += line
+
+
+ ## MSA pre-processing to remove inadequate columns and sequences
+ if self.preprocess_MSA:
+ msa_df = pd.DataFrame.from_dict(self.seq_name_to_sequence, orient='index', columns=['sequence'])
+ # Data clean up
+ msa_df.sequence = msa_df.sequence.apply(lambda x: x.replace(".","-")).apply(lambda x: ''.join([aa.upper() for aa in x]))
+ # Remove columns that would be gaps in the wild type
+ non_gap_wt_cols = [aa!='-' for aa in msa_df.sequence[self.focus_seq_name]]
+ msa_df['sequence'] = msa_df['sequence'].apply(lambda x: ''.join([aa for aa,non_gap_ind in zip(x, non_gap_wt_cols) if non_gap_ind]))
+ assert 0.0 <= self.threshold_sequence_frac_gaps <= 1.0,"Invalid fragment filtering parameter"
+ assert 0.0 <= self.threshold_focus_cols_frac_gaps <= 1.0,"Invalid focus position filtering parameter"
+ msa_array = np.array([list(seq) for seq in msa_df.sequence])
+ gaps_array = np.array(list(map(lambda seq: [aa=='-' for aa in seq], msa_array)))
+ # Identify fragments with too many gaps
+ seq_gaps_frac = gaps_array.mean(axis=1)
+ seq_below_threshold = seq_gaps_frac <= self.threshold_sequence_frac_gaps
+ if verbose: print("Proportion of sequences dropped due to fraction of gaps: "+str(round(float(1 - seq_below_threshold.sum()/seq_below_threshold.shape)*100,2))+"%")
+ # Identify focus columns
+ columns_gaps_frac = gaps_array[seq_below_threshold].mean(axis=0)
+ index_cols_below_threshold = columns_gaps_frac <= self.threshold_focus_cols_frac_gaps
+ if verbose: print("Proportion of non-focus columns removed: "+str(round(float(1 - index_cols_below_threshold.sum()/index_cols_below_threshold.shape)*100,2))+"%")
+ # Lower case non focus cols and filter fragment sequences
+ msa_df['sequence'] = msa_df['sequence'].apply(lambda x: ''.join([aa.upper() if upper_case_ind else aa.lower() for aa, upper_case_ind in zip(x, index_cols_below_threshold)]))
+ msa_df = msa_df[seq_below_threshold]
+ # Overwrite seq_name_to_sequence with clean version
+ self.seq_name_to_sequence = defaultdict(str)
+ for seq_idx in range(len(msa_df['sequence'])):
+ self.seq_name_to_sequence[msa_df.index[seq_idx]] = msa_df.sequence[seq_idx]
+
+ self.focus_seq = self.seq_name_to_sequence[self.focus_seq_name]
+ self.focus_cols = [ix for ix, s in enumerate(self.focus_seq) if s == s.upper() and s!='-']
+ self.non_focus_cols = [ix for ix in range(len(self.focus_seq)) if ix not in self.focus_cols]
+ self.focus_seq_trimmed = [self.focus_seq[ix] for ix in self.focus_cols]
+ self.seq_len = len(self.focus_cols)
+ self.alphabet_size = len(self.alphabet)
+
+ # Connect local sequence index with uniprot index (index shift inferred from 1st row of MSA)
+ try:
+ focus_loc = self.focus_seq_name.split("/")[-1]
+ start,stop = focus_loc.split("-")
+ self.focus_start_loc = int(start)
+ self.focus_stop_loc = int(stop)
+ except:
+ start,stop = 1,len(self.focus_seq)
+ self.focus_start_loc = int(start)
+ self.focus_stop_loc = int(stop)
+ self.uniprot_focus_col_to_wt_aa_dict \
+ = {idx_col+int(start):self.focus_seq[idx_col] for idx_col in self.focus_cols}
+ self.uniprot_focus_col_to_focus_idx \
+ = {idx_col+int(start):idx_col for idx_col in self.focus_cols}
+
+ # Move all letters to CAPS; keeps focus columns only
+ self.raw_seq_name_to_sequence = self.seq_name_to_sequence.copy()
+ for seq_name,sequence in self.seq_name_to_sequence.items():
+ sequence = sequence.replace(".","-")
+ self.seq_name_to_sequence[seq_name] = [sequence[ix].upper() for ix in self.focus_cols]
+
+ # Remove sequences that have indeterminate AA (e.g., B, J, X, Z) in the focus columns
+ if self.remove_sequences_with_indeterminate_AA_in_focus_cols:
+ alphabet_set = set(list(self.alphabet))
+ seq_names_to_remove = []
+ for seq_name,sequence in self.seq_name_to_sequence.items():
+ for letter in sequence:
+ if letter not in alphabet_set and letter != "-":
+ seq_names_to_remove.append(seq_name)
+ continue
+ seq_names_to_remove = list(set(seq_names_to_remove))
+ for seq_name in seq_names_to_remove:
+ del self.seq_name_to_sequence[seq_name]
+
+ # Encode the sequences
+ self.one_hot_encoding = np.zeros((len(self.seq_name_to_sequence.keys()),len(self.focus_cols),len(self.alphabet)))
+ if verbose: print("One-hot encoded sequences shape:" + str(self.one_hot_encoding.shape))
+ for i,seq_name in enumerate(self.seq_name_to_sequence.keys()):
+ sequence = self.seq_name_to_sequence[seq_name]
+ for j,letter in enumerate(sequence):
+ if letter in self.aa_dict:
+ k = self.aa_dict[letter]
+ self.one_hot_encoding[i,j,k] = 1.0
+
+ if self.use_weights:
+ if (self.weights_location is not None) and (not os.path.isfile(self.weights_location)):
+ print("Provided weights location is invalid")
+ sys.exit(0)
+ try:
+ self.weights = np.load(file=self.weights_location)
+ if verbose: print("Loaded sequence weights from disk")
+ except:
+ if verbose: print ("Computing sequence weights")
+ list_seq = self.one_hot_encoding
+ list_seq = list_seq.reshape((list_seq.shape[0], list_seq.shape[1] * list_seq.shape[2]))
+ def compute_weight(seq):
+ number_non_empty_positions = np.dot(seq,seq)
+ if number_non_empty_positions>0:
+ denom = np.dot(list_seq,seq) / np.dot(seq,seq)
+ denom = np.sum(denom > 1 - self.theta)
+ return 1/denom
+ else:
+ return 0.0 #return 0 weight if sequence is fully empty
+ self.weights = np.array(list(map(compute_weight,list_seq)))
+ np.save(file=self.weights_location, arr=self.weights)
+ else:
+ # If not using weights, use an isotropic weight matrix
+ if verbose: print("Not weighting sequence data")
+ self.weights = np.ones(self.one_hot_encoding.shape[0])
+
+ self.Neff = np.sum(self.weights)
+ self.num_sequences = self.one_hot_encoding.shape[0]
+ self.seq_name_to_weight={}
+ for i,seq_name in enumerate(self.seq_name_to_sequence.keys()):
+ self.seq_name_to_weight[seq_name]=self.weights[i]
+
+ if verbose:
+ print ("Neff =",str(self.Neff))
+ print ("Data Shape =",self.one_hot_encoding.shape)
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/utils/scoring_utils.py b/proteingym/baselines/trancepteve/trancepteve/utils/scoring_utils.py
new file mode 100644
index 0000000..6d084da
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/utils/scoring_utils.py
@@ -0,0 +1,224 @@
+import os
+import tqdm
+import re
+import numpy as np
+import pandas as pd
+
+import torch
+from torch.nn import CrossEntropyLoss, NLLLoss
+from torch.utils.data.sampler import Sampler, SequentialSampler
+import torch.nn.functional as F
+
+from transformers import DataCollatorForLanguageModeling, PreTrainedTokenizerFast
+from datasets import Dataset,logging
+logging.set_verbosity_error()
+
+AA_vocab = "ACDEFGHIKLMNPQRSTVWY"
+
+def entropy(x, ignore_tokenizer_characters=True):
+ """
+ Compute entropy over the last dimension of tensor x (assumes it is a log softmax input)
+ """
+ exp_x = torch.exp(x.float())
+ if ignore_tokenizer_characters:
+ entropy = (- exp_x[...,5:]*x[...,5:]).mean(dim=-1)
+ else:
+ entropy = (- exp_x*x).mean(dim=-1)
+ return entropy
+
+def get_mutated_sequence(focus_seq, mutant, start_idx=1, AA_vocab=AA_vocab):
+ """
+ Helper function that mutates an input sequence (focus_seq) via an input mutation triplet (substitutions only).
+ Mutation triplet are typically based on 1-indexing: start_idx is used for switching to 0-indexing.
+ """
+ mutated_seq = list(focus_seq)
+ for mutation in mutant.split(":"):
+ try:
+ from_AA, position, to_AA = mutation[0], int(mutation[1:-1]), mutation[-1]
+ except:
+ print("Issue with mutant: "+str(mutation))
+ relative_position = position - start_idx
+ assert (from_AA==focus_seq[relative_position]), "Invalid from_AA or mutant position: "+str(mutation)+" from_AA: "+str(from_AA) + " relative pos: "+str(relative_position) + " focus_seq: "+str(focus_seq)
+ assert (to_AA in AA_vocab) , "Mutant to_AA is invalid: "+str(mutation)
+ mutated_seq[relative_position] = to_AA
+ return "".join(mutated_seq)
+
+def nanmean(v, *args, inplace=False, **kwargs):
+ if not inplace:
+ v = v.clone()
+ is_nan = torch.isnan(v)
+ v[is_nan] = 0
+ return v.sum(*args, **kwargs) / (~is_nan).float().sum(*args, **kwargs)
+
+def nansum(v, *args, inplace=False, **kwargs):
+ if not inplace:
+ v = v.clone()
+ is_nan = torch.isnan(v)
+ v[is_nan] = 0
+ return v.sum(*args, **kwargs)
+
+def get_optimal_window(mutation_position_relative, seq_len_wo_special, model_window):
+ """
+ Helper function that selects an optimal sequence window that fits the maximum model context size.
+ If the sequence length is less than the maximum context size, the full sequence is returned.
+ """
+ half_model_window = model_window // 2
+ if seq_len_wo_special <= model_window:
+ return [0,seq_len_wo_special]
+ elif mutation_position_relative < half_model_window:
+ return [0,model_window]
+ elif mutation_position_relative >= seq_len_wo_special - half_model_window:
+ return [seq_len_wo_special - model_window, seq_len_wo_special]
+ else:
+ return [max(0,mutation_position_relative-half_model_window), min(seq_len_wo_special,mutation_position_relative+half_model_window)]
+
+def sequence_replace_single(sequence, char_to_replace, char_replacements):
+ char_replacements = list(char_replacements)
+ positions = [m.start() for m in re.finditer(char_to_replace, sequence)]
+ replacements = np.random.choice(a=char_replacements, size=len(positions), replace=True)
+ sequence=list(sequence)
+ for idx, position in enumerate(positions):
+ sequence[position]=replacements[idx]
+ return ''.join(sequence)
+
+def sequence_replace(sequences, char_to_replace, char_replacements):
+ """
+ Helper function that replaces all Amino Acids passsed in via char_to_replace (as a string of AAs) with Amino Acids sampled from char_replacements (also a string of eligible AAs).
+ """
+ return [sequence_replace_single(sequence, char_to_replace, char_replacements) for sequence in sequences]
+
+def get_tranception_scores_mutated_sequences(model, mutated_sequence_df, batch_size_inference, score_var_name, target_seq, num_workers=10, reverse=False, indel_mode=False, scoring_window=None):
+ """
+ Helper function that takes as input a set of mutated sequences (in a pandas dataframe) and returns scores for each mutation.
+ If target_seq is not None, returns the delta log likelihood wrt that target sequence -- otherwise returns the log likelihood of the protein sequences.
+ """
+ scoring_window = scoring_window if scoring_window is not None else model.config.scoring_window
+ model.eval() #Enforce eval mode just in case
+ scores = {}
+ scores['mutated_sequence']=[]
+ scores['sliced_mutated_sequence']=[]
+ scores['window_start']=[]
+ scores['window_end']=[]
+ scores['score']=[]
+ with torch.no_grad():
+ ds = Dataset.from_pandas(mutated_sequence_df)
+ ds.set_transform(model.encode_batch)
+ data_collator = DataCollatorForLanguageModeling(
+ tokenizer=model.config.tokenizer,
+ mlm=False)
+ sampler = SequentialSampler(ds)
+ ds_loader = torch.utils.data.DataLoader(ds, batch_size=batch_size_inference, sampler=sampler, collate_fn=data_collator, num_workers=num_workers, pin_memory=True, drop_last=False)
+ mutant_index=0
+ for encoded_batch in tqdm.tqdm(ds_loader):
+ full_batch_length = len(encoded_batch['input_ids'])
+ mutated_sequence = np.array(mutated_sequence_df['mutated_sequence'][mutant_index:mutant_index+full_batch_length])
+ scores['mutated_sequence'] += list(mutated_sequence)
+ sliced_mutated_sequence = np.array(mutated_sequence_df['sliced_mutated_sequence'][mutant_index:mutant_index+full_batch_length])
+ scores['sliced_mutated_sequence'] += list(sliced_mutated_sequence)
+ window_start = np.array(mutated_sequence_df['window_start'][mutant_index:mutant_index+full_batch_length])
+ scores['window_start'] += list(window_start)
+ window_end = np.array(mutated_sequence_df['window_end'][mutant_index:mutant_index+full_batch_length])
+ scores['window_end'] += list(window_end)
+ for k, v in encoded_batch.items():
+ if isinstance(v, torch.Tensor):
+ encoded_batch[k] = v.to(model.device)
+ shift_labels = encoded_batch['labels'][..., 1:].contiguous()
+ if (hasattr(model.config,"retrieval_aggregation_mode")) and (model.config.retrieval_aggregation_mode is not None):
+ if reverse:
+ encoded_batch['flip']=torch.tensor([1]*full_batch_length)
+ encoded_batch['start_slice']=window_start
+ encoded_batch['end_slice']=window_end
+ encoded_batch['mutated_sequence'] = mutated_sequence #only mutated_sequence is flipped if the scoring_mirror branch of score_mutants. No need to flip mutated_sequence for MSA re-aligning
+ encoded_batch['sliced_mutated_sequence'] = sliced_mutated_sequence
+ fused_shift_log_probas=model(**encoded_batch,return_dict=True).fused_shift_log_probas
+ loss_fct = NLLLoss(reduction='none')
+ loss = - loss_fct(input=fused_shift_log_probas.view(-1, fused_shift_log_probas.size(-1)), target=shift_labels.view(-1)).view(fused_shift_log_probas.shape[0],fused_shift_log_probas.shape[1])
+ else:
+ lm_logits=model(**encoded_batch,return_dict=True).logits
+ shift_logits = lm_logits[..., :-1, :].contiguous()
+ loss_fct = NLLLoss(reduction='none')
+ shift_log_probas = torch.log_softmax(shift_logits, dim=-1)
+ loss = - loss_fct(input=shift_log_probas.view(-1, shift_log_probas.size(-1)), target=shift_labels.view(-1)).view(shift_log_probas.shape[0],shift_log_probas.shape[1])
+ mask = encoded_batch['attention_mask'][..., 1:].float()
+ mask[mask==0]=float('nan')
+ loss *= mask
+ loss = nansum(loss, dim=1)
+ scores_batch = list(loss.cpu().numpy())
+ full_batch_length = len(encoded_batch['input_ids'])
+ scores['score'] += scores_batch
+ mutant_index+=full_batch_length
+ scores = pd.DataFrame(scores)
+ if scoring_window=="sliding":
+ scores = scores[['mutated_sequence','score']].groupby('mutated_sequence').sum().reset_index() #We need to aggregate scores when using sliding mode
+ #Normalization by sequence length
+ scores['score'] = scores['score'] / scores['mutated_sequence'].map(lambda x: len(x))
+ if target_seq is not None:
+ scores_mutated_seq = scores[scores.mutated_sequence != target_seq]
+ scores_wt = scores[scores.mutated_sequence == target_seq]
+ merge_delta = 'mutated_sequence' if scoring_window=="sliding" else 'window_start'
+ if scoring_window=="optimal":
+ delta_scores = pd.merge(scores_mutated_seq,scores_wt,how='left',on=[merge_delta],suffixes=('','_wt'))
+ delta_scores[score_var_name] = delta_scores['score'] - delta_scores['score_wt']
+ elif scoring_window=="sliding":
+ delta_scores = scores_mutated_seq.copy()
+ delta_scores[score_var_name] = delta_scores['score'] - list(scores_wt['score'])[0] # In sliding mode there is a single reference window for the WT
+ elif scoring_window=="raw_score":
+ delta_scores = scores_mutated_seq
+ delta_scores[score_var_name] = delta_scores['score']
+ return delta_scores[['mutated_sequence',score_var_name]]
+ else:
+ scores[score_var_name] = scores['score']
+ return scores[['mutated_sequence',score_var_name]]
+
+def get_sequence_slices(df, target_seq, model_context_len, start_idx=1, scoring_window="optimal", indel_mode=False):
+ """
+ Helper function that takes as input a (pandas) dataframe df that contains a list of mutant triplets (substitutions) or full mutated sequences (indels) for scoring.
+ It returns a processed DMS in which sequences have been sliced to satisfy the maximum context window of the model.
+ df: (dataframe) Input dataframe to be processed
+ target_seq: (string) Full reference sequence (wild type) that is mutated in the DMS assay.
+ model_context_len: (int) Maximum context size for the model.
+ start_idx: (int) Integer to move to 0-indexing of positions (mutation triplet are typically based on 1-indexing).
+ scoring_window: (string) Method to slice sequences longer than maximum context size:
+ - optimal selects a single window as large as possible via the get_optimal_window function (this is the default)
+ - sliding splits the full sequence in contiguous (non-overlapping) chunks that are of size equal to the max context (except the last chunk which may be shorter)
+ indel_mode: (bool) Flag to be used when scoring insertions and deletions. Otherwise assumes substitutions.
+ Note: when scoring indels for sequences that would be longer than the model max context length, it is preferable to use the "sliding" scoring_window. Use "optimal" otherwise.
+ """
+ len_target_seq = len(target_seq)
+ num_mutants = len(df['mutated_sequence'])
+ df=df.reset_index(drop=True)
+ if scoring_window=="optimal" or scoring_window=="raw_score":
+ df['mutation_barycenter'] = df['mutant'].apply(lambda x: int(np.array([int(mutation[1:-1]) - start_idx for mutation in x.split(':')]).mean())) if not indel_mode else df['mutated_sequence'].apply(lambda x: len(x)//2)
+ df['scoring_optimal_window'] = df['mutation_barycenter'].apply(lambda x: get_optimal_window(x, len_target_seq, model_context_len)) if not indel_mode else df['mutated_sequence'].apply(lambda x: (0,len(x)))
+ df['sliced_mutated_sequence'] = [df['mutated_sequence'][index][df['scoring_optimal_window'][index][0]:df['scoring_optimal_window'][index][1]] for index in range(num_mutants)]
+ df['window_start'] = df['scoring_optimal_window'].map(lambda x: x[0])
+ df['window_end'] = df['scoring_optimal_window'].map(lambda x: x[1])
+ del df['scoring_optimal_window'], df['mutation_barycenter']
+ if 'mutant' in df: del df['mutant']
+ df_wt=df.copy()
+ df_wt['mutated_sequence'] = [target_seq] * num_mutants
+ if indel_mode: # For indels, we set the wild type reference to be always the same (full length) sequence. We assume here that the length is lower than model context size (otherwise "Sliding" mode should be used)
+ df_wt['window_end'] = df_wt['mutated_sequence'].map(lambda x:len(x))
+ df_wt['sliced_mutated_sequence'] = [target_seq[df_wt['window_start'][index]:df_wt['window_end'][index]] for index in range(num_mutants)]
+ df = pd.concat([df,df_wt], axis=0)
+ df = df.drop_duplicates()
+ elif scoring_window=="sliding":
+ num_windows = 1 + int( len_target_seq / model_context_len)
+ df_list=[]
+ start=0
+ for window_index in range(1, num_windows+1):
+ df_sliced = df.copy()
+ df_sliced['sliced_mutated_sequence'] = df_sliced['mutated_sequence'].map(lambda x: x[start:start+model_context_len])
+ df_sliced['window_start'] = [start] * num_mutants
+ df_sliced['window_end'] = df_sliced['mutated_sequence'].map(lambda x: min(len(x), start+model_context_len))
+ df_sliced_wt = df_sliced.copy()
+ df_sliced_wt['mutated_sequence'] = [target_seq] * num_mutants
+ df_sliced_wt['sliced_mutated_sequence'] = df_sliced_wt['mutated_sequence'].map(lambda x: x[start:start+model_context_len])
+ df_sliced_wt['window_end'] = df_sliced_wt['mutated_sequence'].map(lambda x: min(len(x), start+model_context_len)) #Need to adjust end index if WT and sequence are not same full length
+ df_list.append(df_sliced)
+ df_list.append(df_sliced_wt)
+ start += model_context_len
+ df_final = pd.concat(df_list,axis=0)
+ if 'mutant' in df_final: del df_final['mutant']
+ df = df_final.drop_duplicates()
+ return df.reset_index(drop=True)
\ No newline at end of file
diff --git a/proteingym/baselines/trancepteve/trancepteve/utils/tokenizers/Basic_tokenizer b/proteingym/baselines/trancepteve/trancepteve/utils/tokenizers/Basic_tokenizer
new file mode 100644
index 0000000..b6af745
--- /dev/null
+++ b/proteingym/baselines/trancepteve/trancepteve/utils/tokenizers/Basic_tokenizer
@@ -0,0 +1 @@
+{"version":"1.0","truncation":null,"padding":null,"added_tokens":[{"id":0,"special":true,"content":"[UNK]","single_word":false,"lstrip":false,"rstrip":false,"normalized":false},{"id":1,"special":true,"content":"[CLS]","single_word":false,"lstrip":false,"rstrip":false,"normalized":false},{"id":2,"special":true,"content":"[SEP]","single_word":false,"lstrip":false,"rstrip":false,"normalized":false},{"id":3,"special":true,"content":"[PAD]","single_word":false,"lstrip":false,"rstrip":false,"normalized":false},{"id":4,"special":true,"content":"[MASK]","single_word":false,"lstrip":false,"rstrip":false,"normalized":false}],"normalizer":null,"pre_tokenizer":{"type":"Whitespace"},"post_processor":{"type":"TemplateProcessing","single":[{"SpecialToken":{"id":"[CLS]","type_id":0}},{"Sequence":{"id":"A","type_id":0}},{"SpecialToken":{"id":"[SEP]","type_id":0}}],"pair":[{"SpecialToken":{"id":"[CLS]","type_id":0}},{"Sequence":{"id":"A","type_id":0}},{"SpecialToken":{"id":"[SEP]","type_id":0}},{"Sequence":{"id":"B","type_id":1}},{"SpecialToken":{"id":"[SEP]","type_id":1}}],"special_tokens":{"[CLS]":{"id":"[CLS]","ids":[1],"tokens":["[CLS]"]},"[SEP]":{"id":"[SEP]","ids":[2],"tokens":["[SEP]"]}}},"decoder":null,"model":{"type":"BPE","dropout":null,"unk_token":"[UNK]","continuing_subword_prefix":null,"end_of_word_suffix":null,"fuse_unk":false,"vocab":{"[UNK]":0,"[CLS]":1,"[SEP]":2,"[PAD]":3,"[MASK]":4,"A":5,"C":6,"D":7,"E":8,"F":9,"G":10,"H":11,"I":12,"K":13,"L":14,"M":15,"N":16,"P":17,"Q":18,"R":19,"S":20,"T":21,"V":22,"W":23,"Y":24},"merges":[]}}
\ No newline at end of file
diff --git a/proteingym/baselines/tranception/__init__.py b/proteingym/baselines/tranception/__init__.py
new file mode 100644
index 0000000..e69de29
diff --git a/proteingym/baselines/tranception/score_tranception_proteingym.py b/proteingym/baselines/tranception/score_tranception_proteingym.py
new file mode 100644
index 0000000..6a7a8e5
--- /dev/null
+++ b/proteingym/baselines/tranception/score_tranception_proteingym.py
@@ -0,0 +1,125 @@
+import os
+import argparse
+import json
+import pandas as pd
+
+import torch
+
+from transformers import PreTrainedTokenizerFast
+import tranception
+from tranception import config, model_pytorch
+
+dir_path = os.path.dirname(os.path.abspath(__file__))
+
+def main():
+ """
+ Main script to score sets of mutated protein sequences (substitutions or indels) with Tranception.
+ """
+ parser = argparse.ArgumentParser(description='Tranception scoring')
+ parser.add_argument('--checkpoint', type=str, help='Path of Tranception model checkpoint')
+ parser.add_argument('--model_framework', default='pytorch', type=str, help='Underlying framework [pytorch|JAX]')
+ parser.add_argument('--batch_size_inference', default=20, type=int, help='Batch size for inference')
+
+ #We may pass in all required information about the DMS via the provided reference files, or specify all relevant fields manually
+ parser.add_argument('--DMS_reference_file_path', default=None, type=str, help='Path to reference file with list of DMS to score')
+ parser.add_argument('--DMS_index', default=0, type=int, help='Index of DMS assay in reference file')
+ #Fields to be passed manually if reference file is not used
+ parser.add_argument('--target_seq', default=None, type=str, help='Full wild type sequence that is mutated in the DMS asssay')
+ parser.add_argument('--DMS_file_name', default=None, type=str, help='Name of DMS assay file')
+ parser.add_argument('--MSA_filename', default=None, type=str, help='Name of MSA (eg., a2m) file constructed on the wild type sequence')
+ parser.add_argument('--MSA_weight_file_name', default=None, type=str, help='Weight of sequences in the MSA (optional)')
+ parser.add_argument('--MSA_start', default=None, type=int, help='Sequence position that the MSA starts at (1-indexing)')
+ parser.add_argument('--MSA_end', default=None, type=int, help='Sequence position that the MSA ends at (1-indexing)')
+
+ parser.add_argument('--DMS_data_folder', type=str, help='Path to folder that contains all DMS assay datasets')
+ parser.add_argument('--output_scores_folder', default='./', type=str, help='Name of folder to write model scores to')
+ parser.add_argument('--deactivate_scoring_mirror', action='store_true', help='Whether to deactivate sequence scoring from both directions (Left->Right and Right->Left)')
+ parser.add_argument('--indel_mode', action='store_true', help='Flag to be used when scoring insertions and deletions. Otherwise assumes substitutions')
+ parser.add_argument('--scoring_window', default="optimal", type=str, help='Sequence window selection mode (when sequence length longer than model context size)')
+ parser.add_argument('--num_workers', default=10, type=int, help='Number of workers for model scoring data loader')
+ parser.add_argument('--inference_time_retrieval', action='store_true', help='Whether to perform inference-time retrieval')
+ parser.add_argument('--retrieval_inference_weight', default=0.6, type=float, help='Coefficient (alpha) used when aggregating autoregressive transformer and retrieval')
+ parser.add_argument('--MSA_folder', default='.', type=str, help='Path to MSA for neighborhood scoring')
+ parser.add_argument('--MSA_weights_folder', default=None, type=str, help='Path to MSA weights for neighborhood scoring')
+ parser.add_argument('--clustal_omega_location', default=None, type=str, help='Path to Clustal Omega (only needed with scoring indels with retrieval)')
+ args = parser.parse_args()
+
+ model_name = args.checkpoint.split("/")[-1]
+
+ tokenizer = PreTrainedTokenizerFast(tokenizer_file=dir_path+os.sep+"tranception/utils/tokenizers/Basic_tokenizer",
+ unk_token="[UNK]",
+ sep_token="[SEP]",
+ pad_token="[PAD]",
+ cls_token="[CLS]",
+ mask_token="[MASK]"
+ )
+
+ if args.DMS_reference_file_path:
+ mapping_protein_seq_DMS = pd.read_csv(args.DMS_reference_file_path)
+ list_DMS = mapping_protein_seq_DMS["DMS_id"]
+ DMS_id=list_DMS[args.DMS_index]
+ print("Compute scores for DMS: "+str(DMS_id))
+ target_seq = mapping_protein_seq_DMS["target_seq"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0].upper()
+ DMS_file_name = mapping_protein_seq_DMS["DMS_filename"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]
+ if args.inference_time_retrieval:
+ MSA_data_file = args.MSA_folder + os.sep + mapping_protein_seq_DMS["MSA_filename"][args.DMS_index] if args.MSA_folder is not None else None
+ MSA_weight_file_name = args.MSA_weights_folder + os.sep + mapping_protein_seq_DMS["weight_file_name"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0] if args.MSA_weights_folder else None
+ MSA_start = int(mapping_protein_seq_DMS["MSA_start"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]) - 1 # MSA_start typically based on 1-indexing
+ MSA_end = int(mapping_protein_seq_DMS["MSA_end"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0])
+ else:
+ target_seq=args.target_seq
+ DMS_file_name=args.DMS_file_name
+ DMS_id = DMS_file_name.split(".")[0]
+ if args.inference_time_retrieval:
+ MSA_data_file = args.MSA_folder + os.sep + args.MSA_filename if args.MSA_folder is not None else None
+ MSA_weight_file_name = args.MSA_weights_folder + os.sep + args.MSA_weight_file_name if args.MSA_weights_folder is not None else None
+ MSA_start = args.MSA_start - 1 # MSA_start based on 1-indexing
+ MSA_end = args.MSA_end
+
+ config = json.load(open(args.checkpoint+os.sep+'config.json'))
+ config = tranception.config.TranceptionConfig(**config)
+ config.attention_mode="tranception"
+ config.position_embedding="grouped_alibi"
+ config.tokenizer = tokenizer
+ config.scoring_window = args.scoring_window
+
+ if args.inference_time_retrieval:
+ config.retrieval_aggregation_mode = "aggregate_indel" if args.indel_mode else "aggregate_substitution"
+ config.MSA_filename=MSA_data_file
+ config.full_protein_length=len(target_seq)
+ config.MSA_weight_file_name=MSA_weight_file_name
+ config.retrieval_inference_weight=args.retrieval_inference_weight
+ config.MSA_start = MSA_start
+ config.MSA_end = MSA_end
+ if args.indel_mode:
+ config.clustal_omega_location = args.clustal_omega_location
+ else:
+ config.retrieval_aggregation_mode = None
+
+ if args.model_framework=="pytorch":
+ model = tranception.model_pytorch.TranceptionLMHeadModel.from_pretrained(pretrained_model_name_or_path=args.checkpoint,config=config)
+ if torch.cuda.is_available():
+ model.cuda()
+ model.eval()
+
+ if not os.path.isdir(args.output_scores_folder):
+ os.mkdir(args.output_scores_folder)
+ retrieval_type = '_retrieval_' + str(args.retrieval_inference_weight) if args.inference_time_retrieval else '_no_retrieval'
+ mutation_type = '_indels' if args.indel_mode else '_substitutions'
+ mirror_type = '_no_mirror' if args.deactivate_scoring_mirror else ''
+ scoring_filename = args.output_scores_folder + os.sep + DMS_id + ".csv"
+
+
+ DMS_data = pd.read_csv(args.DMS_data_folder + os.sep + DMS_file_name, low_memory=False)
+ all_scores = model.score_mutants(
+ DMS_data=DMS_data,
+ target_seq=target_seq,
+ scoring_mirror=not args.deactivate_scoring_mirror,
+ batch_size_inference=args.batch_size_inference,
+ num_workers=args.num_workers,
+ indel_mode=args.indel_mode
+ )
+ all_scores.to_csv(scoring_filename, index=False)
+
+if __name__ == '__main__':
+ main()
\ No newline at end of file
diff --git a/proteingym/baselines/tranception/tranception/__init__.py b/proteingym/baselines/tranception/tranception/__init__.py
new file mode 100644
index 0000000..d782e9b
--- /dev/null
+++ b/proteingym/baselines/tranception/tranception/__init__.py
@@ -0,0 +1 @@
+from . import config
\ No newline at end of file
diff --git a/proteingym/baselines/tranception/tranception/activations.py b/proteingym/baselines/tranception/tranception/activations.py
new file mode 100644
index 0000000..25702ef
--- /dev/null
+++ b/proteingym/baselines/tranception/tranception/activations.py
@@ -0,0 +1,114 @@
+import math
+
+import torch
+from packaging import version
+from torch import nn
+
+from transformers.utils import logging
+
+
+logger = logging.get_logger(__name__)
+
+
+def _gelu_python(x):
+ """
+ Original Implementation of the GELU activation function in Google BERT repo when initially created. For
+ information: OpenAI GPT's GELU is slightly different (and gives slightly different results): 0.5 * x * (1 +
+ torch.tanh(math.sqrt(2 / math.pi) * (x + 0.044715 * torch.pow(x, 3)))) This is now written in C in nn.functional
+ Also see the Gaussian Error Linear Units paper: https://arxiv.org/abs/1606.08415
+ """
+ return x * 0.5 * (1.0 + torch.erf(x / math.sqrt(2.0)))
+
+
+def gelu_new(x):
+ """
+ Implementation of the GELU activation function currently in Google BERT repo (identical to OpenAI GPT). Also see
+ the Gaussian Error Linear Units paper: https://arxiv.org/abs/1606.08415
+ """
+ return 0.5 * x * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (x + 0.044715 * torch.pow(x, 3.0))))
+
+
+if version.parse(torch.__version__) < version.parse("1.4"):
+ gelu = _gelu_python
+else:
+ gelu = nn.functional.gelu
+
+
+def gelu_fast(x):
+ return 0.5 * x * (1.0 + torch.tanh(x * 0.7978845608 * (1.0 + 0.044715 * x * x)))
+
+
+def quick_gelu(x):
+ return x * torch.sigmoid(1.702 * x)
+
+
+def _silu_python(x):
+ """
+ See Gaussian Error Linear Units (Hendrycks et al., https://arxiv.org/abs/1606.08415) where the SiLU (Sigmoid Linear
+ Unit) was originally introduced and coined, and see Sigmoid-Weighted Linear Units for Neural Network Function
+ Approximation in Reinforcement Learning (Elfwing et al., https://arxiv.org/abs/1702.03118) and Swish: a Self-Gated
+ Activation Function (Ramachandran et al., https://arxiv.org/abs/1710.05941v1) where the SiLU was experimented with
+ later.
+ """
+ return x * torch.sigmoid(x)
+
+
+if version.parse(torch.__version__) < version.parse("1.7"):
+ silu = _silu_python
+else:
+ silu = nn.functional.silu
+
+
+def _mish_python(x):
+ """
+ See Mish: A Self-Regularized Non-Monotonic Activation Function (Misra., https://arxiv.org/abs/1908.08681). Also
+ visit the official repository for the paper: https://github.com/digantamisra98/Mish
+ """
+ return x * torch.tanh(nn.functional.softplus(x))
+
+
+if version.parse(torch.__version__) < version.parse("1.9"):
+ mish = _mish_python
+else:
+ mish = nn.functional.mish
+
+
+def linear_act(x):
+ return x
+
+def squared_relu(x):
+ """
+ Squared ReLU variant that is fastest with Pytorch.
+ """
+ x = nn.functional.relu(x)
+ return x*x
+
+def squared_relu_xla(x):
+ """
+ Squared ReLU variant that is fastest with JAX.
+ """
+ x = nn.functional.relu(x)
+ return x**2
+
+tranception_ACT2FN = {
+ "relu": nn.functional.relu,
+ "silu": silu,
+ "swish": silu,
+ "gelu": gelu,
+ "tanh": torch.tanh,
+ "gelu_new": gelu_new,
+ "gelu_fast": gelu_fast,
+ "quick_gelu": quick_gelu,
+ "mish": mish,
+ "linear": linear_act,
+ "sigmoid": torch.sigmoid,
+ "squared_relu": squared_relu,
+ "squared_relu_xla": squared_relu_xla,
+}
+
+
+def get_activation(activation_string):
+ if activation_string in tranception_ACT2FN:
+ return tranception_ACT2FN[activation_string]
+ else:
+ raise KeyError(f"function {activation_string} not found in ACT2FN mapping {list(tranception_ACT2FN.keys())}")
\ No newline at end of file
diff --git a/proteingym/baselines/tranception/tranception/config.py b/proteingym/baselines/tranception/tranception/config.py
new file mode 100644
index 0000000..1b35cb0
--- /dev/null
+++ b/proteingym/baselines/tranception/tranception/config.py
@@ -0,0 +1,36 @@
+from transformers import GPT2Config
+
+class TranceptionConfig(GPT2Config):
+ """
+ Config subclass for Tranception model architecture.
+ """
+ def __init__(
+ self,
+ attention_mode="tranception",
+ position_embedding="grouped_alibi",
+ tokenizer=None,
+ retrieval_aggregation_mode=None,
+ retrieval_inference_weight=0.6,
+ MSA_filename=None,
+ MSA_weight_file_name=None,
+ MSA_start=None,
+ MSA_end=None,
+ full_protein_length=None,
+ clustal_omega_location=None,
+ scoring_window=None,
+ **kwargs
+ ):
+ super().__init__(**kwargs)
+ self.model_type="tranception"
+ self.attention_mode=attention_mode
+ self.position_embedding=position_embedding
+ self.tokenizer = tokenizer
+ self.retrieval_aggregation_mode = retrieval_aggregation_mode
+ self.retrieval_inference_weight = retrieval_inference_weight
+ self.MSA_filename = MSA_filename
+ self.MSA_weight_file_name = MSA_weight_file_name
+ self.MSA_start=MSA_start
+ self.MSA_end=MSA_end
+ self.full_protein_length = full_protein_length
+ self.clustal_omega_location = clustal_omega_location
+ self.scoring_window=scoring_window
\ No newline at end of file
diff --git a/proteingym/baselines/tranception/tranception/model_pytorch.py b/proteingym/baselines/tranception/tranception/model_pytorch.py
new file mode 100644
index 0000000..e9f8ff2
--- /dev/null
+++ b/proteingym/baselines/tranception/tranception/model_pytorch.py
@@ -0,0 +1,939 @@
+from dataclasses import dataclass
+from typing import Optional, Tuple
+import math
+import os
+import pandas as pd
+import uuid
+import torch
+from torch import nn
+from torch.nn import CrossEntropyLoss, NLLLoss
+import torch.nn.functional as F
+from transformers import GPT2PreTrainedModel
+
+from transformers.modeling_utils import (
+ Conv1D,
+ PreTrainedModel,
+ SequenceSummary,
+ find_pruneable_heads_and_indices,
+ prune_conv1d_layer,
+)
+from transformers.file_utils import (
+ ModelOutput,
+ add_code_sample_docstrings,
+ add_start_docstrings,
+ add_start_docstrings_to_model_forward,
+ replace_return_docstrings
+)
+from transformers.modeling_outputs import (
+ BaseModelOutputWithPastAndCrossAttentions,
+ CausalLMOutputWithCrossAttentions,
+ SequenceClassifierOutputWithPast,
+ TokenClassifierOutput
+)
+from transformers.utils.model_parallel_utils import assert_device_map, get_device_map
+
+from tranception.activations import tranception_ACT2FN
+from tranception.config import TranceptionConfig
+from tranception.outputs import (
+ TranceptionCausalLMOutputWithCrossAttentions,
+)
+from tranception.utils import msa_utils
+from tranception.utils import scoring_utils
+
+def nanmean(v, *args, inplace=False, **kwargs):
+ if not inplace:
+ v = v.clone()
+ is_nan = torch.isnan(v)
+ v[is_nan] = 0
+ return v.sum(*args, **kwargs) / (~is_nan).float().sum(*args, **kwargs)
+
+def get_slopes(n, mode="standard_alibi", verbose=False):
+ """
+ Function to compute the m constant for each attention head. Code has been adapted from the official ALiBi codebase at:
+ https://github.com/ofirpress/attention_with_linear_biases/blob/master/fairseq/models/transformer.py
+ """
+ def get_slopes_power_of_2(n):
+ start = (2**(-2**-(math.log2(n)-3)))
+ ratio = start
+ return [start*ratio**i for i in range(n)]
+ if mode=="grouped_alibi":
+ n = n // 4
+ if math.log2(n).is_integer():
+ result = get_slopes_power_of_2(n)
+ else:
+ #Workaround when the number of heads is not a power of 2
+ closest_power_of_2 = 2**math.floor(math.log2(n))
+ result = get_slopes_power_of_2(closest_power_of_2) + get_slopes(2*closest_power_of_2)[0::2][:n-closest_power_of_2]
+ if mode=="grouped_alibi":
+ result = result * 4
+ if verbose:
+ print("ALiBi slopes: {}".format(result))
+ return result
+
+class SpatialDepthWiseConvolution(nn.Module):
+ def __init__(self, head_dim: int, kernel_size: int = 3):
+ super().__init__()
+ self.kernel_size = kernel_size
+ self.conv = nn.Conv1d(in_channels=head_dim, out_channels=head_dim, kernel_size=(kernel_size,), padding=(kernel_size - 1,), groups=head_dim)
+
+ def forward(self, x: torch.Tensor):
+ batch_size, heads, seq_len, head_dim = x.shape
+ x = x.permute(0, 1, 3, 2).contiguous()
+ x = x.view(batch_size * heads, head_dim, seq_len)
+ x = self.conv(x)
+ if self.kernel_size>1:
+ x = x[:, :, :-(self.kernel_size - 1)]
+ x = x.view(batch_size, heads, head_dim, seq_len)
+ x = x.permute(0, 1, 3, 2)
+ return x
+
+class TranceptionBlockAttention(nn.Module):
+ def __init__(self, config, is_cross_attention=False, SDWC_kernel_size=None):
+ super().__init__()
+
+ max_positions = config.max_position_embeddings
+ self.register_buffer(
+ "bias",
+ torch.tril(torch.ones((max_positions, max_positions), dtype=torch.uint8)).view(
+ 1, 1, max_positions, max_positions
+ ),
+ )
+ self.register_buffer("masked_bias", torch.tensor(-1e4))
+
+ self.embed_dim = config.hidden_size
+ self.num_heads = config.num_attention_heads
+ self.head_dim = self.embed_dim // self.num_heads
+ self.split_size = self.embed_dim
+ if self.head_dim * self.num_heads != self.embed_dim:
+ raise ValueError(
+ f"`embed_dim` must be divisible by num_heads (got `embed_dim`: {self.embed_dim} and `num_heads`: {self.num_heads})."
+ )
+
+ self.scale_attn_weights = config.scale_attn_weights
+ self.is_cross_attention = is_cross_attention
+
+ if self.is_cross_attention:
+ self.c_attn = Conv1D(2 * self.embed_dim, self.embed_dim)
+ self.q_attn = Conv1D(self.embed_dim, self.embed_dim)
+ else:
+ self.c_attn = Conv1D(3 * self.embed_dim, self.embed_dim)
+ self.c_proj = Conv1D(self.embed_dim, self.embed_dim)
+
+ self.attn_dropout = nn.Dropout(config.attn_pdrop)
+ self.resid_dropout = nn.Dropout(config.resid_pdrop)
+
+ self.pruned_heads = set()
+
+ self.attention_mode=config.attention_mode
+
+ if self.attention_mode=="tranception":
+ assert self.num_heads%4==0, "Invalid number of heads. Tranception requires the number of heads to be a multiple of 4."
+ self.num_heads_per_kernel_size = self.num_heads // 4
+ self.query_depthwiseconv = nn.ModuleDict()
+ self.key_depthwiseconv = nn.ModuleDict()
+ self.value_depthwiseconv = nn.ModuleDict()
+ for kernel_idx, kernel in enumerate([3,5,7]):
+ self.query_depthwiseconv[str(kernel_idx)] = SpatialDepthWiseConvolution(self.head_dim,kernel)
+ self.key_depthwiseconv[str(kernel_idx)] = SpatialDepthWiseConvolution(self.head_dim,kernel)
+ self.value_depthwiseconv[str(kernel_idx)] = SpatialDepthWiseConvolution(self.head_dim,kernel)
+
+ def prune_heads(self, heads):
+ if len(heads) == 0:
+ return
+ heads, index = find_pruneable_heads_and_indices(heads, self.num_heads, self.head_dim, self.pruned_heads)
+ index_attn = torch.cat([index, index + self.split_size, index + (2 * self.split_size)])
+
+ # Prune conv1d layers
+ self.c_attn = prune_conv1d_layer(self.c_attn, index_attn, dim=1)
+ self.c_proj = prune_conv1d_layer(self.c_proj, index, dim=0)
+
+ # Update hyper params
+ self.split_size = (self.split_size // self.num_heads) * (self.num_heads - len(heads))
+ self.num_heads = self.num_heads - len(heads)
+ self.pruned_heads = self.pruned_heads.union(heads)
+
+ def _attn(self, query, key, value, attention_mask=None, head_mask=None, alibi_bias=None):
+ attn_weights = torch.matmul(query, key.transpose(-1, -2))
+
+ if self.scale_attn_weights:
+ attn_weights = attn_weights / (float(value.size(-1)) ** 0.5)
+
+ if not self.is_cross_attention:
+ # if only "normal" attention layer implements causal mask
+ query_length, key_length = query.size(-2), key.size(-2)
+ causal_mask = self.bias[:, :, key_length - query_length : key_length, :key_length].bool()
+ attn_weights = torch.where(causal_mask, attn_weights, self.masked_bias.to(attn_weights.dtype))
+
+ if alibi_bias is not None:
+ attn_weights = attn_weights + alibi_bias[:,:,:attn_weights.size(-1)]
+
+ if attention_mask is not None:
+ # Apply the attention mask
+ attn_weights = attn_weights + attention_mask
+
+ attn_weights = nn.Softmax(dim=-1)(attn_weights)
+ attn_weights = self.attn_dropout(attn_weights)
+
+ # Mask heads if we want to
+ if head_mask is not None:
+ attn_weights = attn_weights * head_mask
+
+ attn_output = torch.matmul(attn_weights, value)
+
+ return attn_output, attn_weights
+
+ def _split_heads(self, tensor, num_heads, attn_head_size):
+ """
+ Splits hidden_size dim into attn_head_size and num_heads
+ """
+ new_shape = tensor.size()[:-1] + (num_heads, attn_head_size)
+ tensor = tensor.view(*new_shape)
+ return tensor.permute(0, 2, 1, 3) # (batch, head, seq_length, head_features)
+
+ def _merge_heads(self, tensor, num_heads, attn_head_size):
+ """
+ Merges attn_head_size dim and num_attn_heads dim into hidden_size
+ """
+ tensor = tensor.permute(0, 2, 1, 3).contiguous()
+ new_shape = tensor.size()[:-2] + (num_heads * attn_head_size,)
+ return tensor.view(new_shape)
+
+ def forward(
+ self,
+ hidden_states,
+ layer_past=None,
+ attention_mask=None,
+ head_mask=None,
+ encoder_hidden_states=None,
+ encoder_attention_mask=None,
+ use_cache=False,
+ output_attentions=False,
+ alibi_bias=None,
+ ):
+ if encoder_hidden_states is not None:
+ if not hasattr(self, "q_attn"):
+ raise ValueError(
+ "If class is used as cross attention, the weights `q_attn` have to be defined. "
+ "Please make sure to instantiate class with `GPT2Attention(..., is_cross_attention=True)`."
+ )
+
+ query = self.q_attn(hidden_states)
+ key, value = self.c_attn(encoder_hidden_states).split(self.split_size, dim=2)
+ attention_mask = encoder_attention_mask
+ else:
+ query, key, value = self.c_attn(hidden_states).split(self.split_size, dim=2)
+
+ query = self._split_heads(query, self.num_heads, self.head_dim)
+ key = self._split_heads(key, self.num_heads, self.head_dim)
+ value = self._split_heads(value, self.num_heads, self.head_dim)
+
+ if layer_past is not None:
+ past_key, past_value = layer_past
+ key = torch.cat((past_key, key), dim=-2)
+ value = torch.cat((past_value, value), dim=-2)
+
+ if use_cache is True:
+ present = (key, value)
+ else:
+ present = None
+
+ if self.attention_mode=="tranception":
+ # We do not do anything on the first self.num_heads_per_kernel_size heads (kernel =1)
+ query_list=[query[:,:self.num_heads_per_kernel_size,:,:]]
+ key_list=[key[:,:self.num_heads_per_kernel_size,:,:]]
+ value_list=[value[:,:self.num_heads_per_kernel_size,:,:]]
+ for kernel_idx in range(3):
+ query_list.append(self.query_depthwiseconv[str(kernel_idx)](query[:,(kernel_idx+1)*self.num_heads_per_kernel_size:(kernel_idx+2)*self.num_heads_per_kernel_size,:,:]))
+ key_list.append(self.key_depthwiseconv[str(kernel_idx)](key[:,(kernel_idx+1)*self.num_heads_per_kernel_size:(kernel_idx+2)*self.num_heads_per_kernel_size,:,:]))
+ value_list.append(self.value_depthwiseconv[str(kernel_idx)](value[:,(kernel_idx+1)*self.num_heads_per_kernel_size:(kernel_idx+2)*self.num_heads_per_kernel_size,:,:]))
+ query=torch.cat(query_list, dim=1)
+ key=torch.cat(key_list, dim=1)
+ value=torch.cat(value_list, dim=1)
+
+ attn_output, attn_weights = self._attn(query, key, value, attention_mask, head_mask, alibi_bias=alibi_bias)
+
+ attn_output = self._merge_heads(attn_output, self.num_heads, self.head_dim)
+ attn_output = self.c_proj(attn_output)
+ attn_output = self.resid_dropout(attn_output)
+
+ outputs = (attn_output, present)
+ if output_attentions:
+ outputs += (attn_weights,)
+
+ return outputs # a, present, (attentions)
+
+class TranceptionBlockMLP(nn.Module):
+ def __init__(self, intermediate_size, config):
+ super().__init__()
+ embed_dim = config.hidden_size
+ self.c_fc = Conv1D(intermediate_size, embed_dim)
+ self.c_proj = Conv1D(embed_dim, intermediate_size)
+ self.act = tranception_ACT2FN[config.activation_function]
+ self.dropout = nn.Dropout(config.resid_pdrop)
+
+ def forward(self, hidden_states):
+ hidden_states = self.c_fc(hidden_states)
+ hidden_states = self.act(hidden_states)
+ hidden_states = self.c_proj(hidden_states)
+ hidden_states = self.dropout(hidden_states)
+ return hidden_states
+
+class TranceptionBlock(nn.Module):
+ def __init__(self, config, SDWC_kernel_size=None):
+ super().__init__()
+ hidden_size = config.hidden_size
+ inner_dim = config.n_inner if config.n_inner is not None else 4 * hidden_size
+
+ self.ln_1 = nn.LayerNorm(hidden_size, eps=config.layer_norm_epsilon)
+ self.attn = TranceptionBlockAttention(config, SDWC_kernel_size=SDWC_kernel_size)
+ self.ln_2 = nn.LayerNorm(hidden_size, eps=config.layer_norm_epsilon)
+
+ if config.add_cross_attention:
+ self.crossattention = TranceptionBlockAttention(config, is_cross_attention=True, SDWC_kernel_size=SDWC_kernel_size)
+ self.ln_cross_attn = nn.LayerNorm(hidden_size, eps=config.layer_norm_epsilon)
+
+ self.mlp = TranceptionBlockMLP(inner_dim, config)
+
+ def forward(
+ self,
+ hidden_states,
+ layer_past=None,
+ attention_mask=None,
+ head_mask=None,
+ encoder_hidden_states=None,
+ encoder_attention_mask=None,
+ use_cache=False,
+ output_attentions=False,
+ alibi_bias=None,
+ ):
+ residual = hidden_states
+ hidden_states = self.ln_1(hidden_states)
+ attn_outputs = self.attn(
+ hidden_states,
+ layer_past=layer_past,
+ attention_mask=attention_mask,
+ head_mask=head_mask,
+ use_cache=use_cache,
+ output_attentions=output_attentions,
+ alibi_bias=alibi_bias,
+ )
+ attn_output = attn_outputs[0] # output_attn: a, present, (attentions)
+ outputs = attn_outputs[1:]
+ # residual connection
+ hidden_states = attn_output + residual
+
+ if encoder_hidden_states is not None:
+ # add one self-attention block for cross-attention
+ if not hasattr(self, "crossattention"):
+ raise ValueError(
+ f"If `encoder_hidden_states` are passed, {self} has to be instantiated with "
+ "cross-attention layers by setting `config.add_cross_attention=True`"
+ )
+ residual = hidden_states
+ hidden_states = self.ln_cross_attn(hidden_states)
+ cross_attn_outputs = self.crossattention(
+ hidden_states,
+ attention_mask=attention_mask,
+ head_mask=head_mask,
+ encoder_hidden_states=encoder_hidden_states,
+ encoder_attention_mask=encoder_attention_mask,
+ output_attentions=output_attentions,
+ )
+ attn_output = cross_attn_outputs[0]
+ # residual connection
+ hidden_states = residual + attn_output
+ outputs = outputs + cross_attn_outputs[2:] # add cross attentions if we output attention weights
+
+ residual = hidden_states
+ hidden_states = self.ln_2(hidden_states)
+
+ feed_forward_hidden_states = self.mlp(hidden_states)
+
+ # residual connection
+ hidden_states = residual + feed_forward_hidden_states
+
+ if use_cache:
+ outputs = (hidden_states,) + outputs
+ else:
+ outputs = (hidden_states,) + outputs[1:]
+
+ return outputs # hidden_states, present, (attentions, cross_attentions)
+
+class TranceptionModel(GPT2PreTrainedModel):
+ _keys_to_ignore_on_load_missing = ["attn.masked_bias"]
+ def __init__(self, config):
+ super().__init__(config)
+
+ self.embed_dim = config.hidden_size
+ self.wte = nn.Embedding(config.vocab_size, self.embed_dim)
+ self.position_embedding = config.position_embedding if hasattr(config, "position_embedding") else "learned"
+ if self.position_embedding=="learned":
+ self.wpe = nn.Embedding(config.max_position_embeddings, self.embed_dim)
+ self.alibi = None
+ elif self.position_embedding=="grouped_alibi":
+ maxpos = config.n_positions
+ attn_heads = config.n_head
+ self.slopes = torch.Tensor(get_slopes(attn_heads, mode=self.position_embedding))
+ #The softmax operation is invariant to translation, and bias functions used are always linear.
+ alibi = self.slopes.unsqueeze(1).unsqueeze(1) * torch.arange(maxpos).unsqueeze(0).unsqueeze(0).expand(attn_heads, -1, -1)
+ alibi = alibi.view(attn_heads, 1, maxpos)
+ self.register_buffer('alibi',alibi)
+
+ self.drop = nn.Dropout(config.embd_pdrop)
+ self.h = nn.ModuleList([TranceptionBlock(config) for _ in range(config.num_hidden_layers)])
+ self.ln_f = nn.LayerNorm(self.embed_dim, eps=config.layer_norm_epsilon)
+
+ self.init_weights()
+
+ # Model parallel
+ self.model_parallel = False
+ self.device_map = None
+ self.gradient_checkpointing = False
+
+ def parallelize(self, device_map=None, num_cores=None):
+ self.device_map = (
+ get_device_map(len(self.h), range(torch.cuda.device_count())) if device_map is None else device_map
+ )
+ device_prefix="cuda:"
+ assert_device_map(self.device_map, len(self.h))
+ self.model_parallel = True
+ self.first_device = "cpu" if "cpu" in self.device_map.keys() else device_prefix + str(min(self.device_map.keys()))
+ self.last_device = device_prefix + str(max(self.device_map.keys()))
+ self.wte = self.wte.to(self.first_device)
+ if self.position_embedding=="learned":
+ self.wpe = self.wpe.to(self.first_device)
+ for k, v in self.device_map.items():
+ print("k,v :"+str(k)+","+str(v))
+ for block in v:
+ cuda_device = device_prefix + str(k)
+ self.h[block] = self.h[block].to(cuda_device)
+ self.ln_f = self.ln_f.to(self.last_device)
+
+ def deparallelize(self):
+ self.model_parallel = False
+ self.device_map = None
+ self.first_device = "cpu"
+ self.last_device = "cpu"
+ self.wte = self.wte.to("cpu")
+ if self.position_embedding=="learned":
+ self.wpe = self.wpe.to("cpu")
+ for index in range(len(self.h)):
+ self.h[index] = self.h[index].to("cpu")
+ self.ln_f = self.ln_f.to("cpu")
+ torch.cuda.empty_cache()
+
+ def get_input_embeddings(self):
+ return self.wte
+
+ def set_input_embeddings(self, new_embeddings):
+ self.wte = new_embeddings
+
+ def _prune_heads(self, heads_to_prune):
+ """
+ Prunes heads of the model. heads_to_prune: dict of {layer_num: list of heads to prune in this layer}
+ """
+ for layer, heads in heads_to_prune.items():
+ self.h[layer].attn.prune_heads(heads)
+
+ def forward(
+ self,
+ input_ids=None,
+ past_key_values=None,
+ attention_mask=None,
+ token_type_ids=None,
+ position_ids=None,
+ head_mask=None,
+ inputs_embeds=None,
+ encoder_hidden_states=None,
+ encoder_attention_mask=None,
+ use_cache=None,
+ output_attentions=None,
+ output_hidden_states=None,
+ return_dict=None,
+ ):
+ output_attentions = output_attentions if output_attentions is not None else self.config.output_attentions
+ output_hidden_states = (
+ output_hidden_states if output_hidden_states is not None else self.config.output_hidden_states
+ )
+ use_cache = use_cache if use_cache is not None else self.config.use_cache
+ return_dict = return_dict if return_dict is not None else self.config.use_return_dict
+
+ if input_ids is not None and inputs_embeds is not None:
+ raise ValueError("You cannot specify both input_ids and inputs_embeds at the same time")
+ elif input_ids is not None:
+ input_shape = input_ids.size()
+ input_ids = input_ids.view(-1, input_shape[-1])
+ batch_size = input_ids.shape[0]
+ elif inputs_embeds is not None:
+ input_shape = inputs_embeds.size()[:-1]
+ batch_size = inputs_embeds.shape[0]
+ else:
+ raise ValueError("You have to specify either input_ids or inputs_embeds")
+
+ device = input_ids.device if input_ids is not None else inputs_embeds.device
+
+ if token_type_ids is not None:
+ token_type_ids = token_type_ids.view(-1, input_shape[-1])
+ if position_ids is not None:
+ position_ids = position_ids.view(-1, input_shape[-1])
+
+ if past_key_values is None:
+ past_length = 0
+ past_key_values = tuple([None] * len(self.h))
+ else:
+ past_length = past_key_values[0][0].size(-2)
+ if position_ids is None:
+ position_ids = torch.arange(past_length, input_shape[-1] + past_length, dtype=torch.long, device=device)
+ position_ids = position_ids.unsqueeze(0).view(-1, input_shape[-1])
+
+ # GPT2Attention mask.
+ if attention_mask is not None:
+ if batch_size <= 0:
+ raise ValueError("batch_size has to be defined and > 0")
+ attention_mask = attention_mask.view(batch_size, -1)
+ # We create a 3D attention mask from a 2D tensor mask.
+ # Sizes are [batch_size, 1, 1, to_seq_length]
+ # So we can broadcast to [batch_size, num_heads, from_seq_length, to_seq_length]
+ # this attention mask is more simple than the triangular masking of causal attention
+ # used in OpenAI GPT, we just need to prepare the broadcast dimension here.
+ attention_mask = attention_mask[:, None, None, :]
+
+ # Since attention_mask is 1.0 for positions we want to attend and 0.0 for
+ # masked positions, this operation will create a tensor which is 0.0 for
+ # positions we want to attend and -10000.0 for masked positions.
+ # Since we are adding it to the raw scores before the softmax, this is
+ # effectively the same as removing these entirely.
+ attention_mask = attention_mask.to(dtype=self.dtype) # fp16 compatibility
+ attention_mask = (1.0 - attention_mask) * -10000.0
+
+ # If a 2D ou 3D attention mask is provided for the cross-attention
+ # we need to make broadcastable to [batch_size, num_heads, seq_length, seq_length]
+ if self.config.add_cross_attention and encoder_hidden_states is not None:
+ encoder_batch_size, encoder_sequence_length, _ = encoder_hidden_states.size()
+ encoder_hidden_shape = (encoder_batch_size, encoder_sequence_length)
+ if encoder_attention_mask is None:
+ encoder_attention_mask = torch.ones(encoder_hidden_shape, device=device)
+ encoder_attention_mask = self.invert_attention_mask(encoder_attention_mask)
+ else:
+ encoder_attention_mask = None
+
+ # Prepare head mask if needed
+ # 1.0 in head_mask indicate we keep the head
+ # attention_probs has shape bsz x n_heads x N x N
+ # head_mask has shape n_layer x batch x n_heads x N x N
+ head_mask = self.get_head_mask(head_mask, self.config.n_layer)
+
+ if inputs_embeds is None:
+ inputs_embeds = self.wte(input_ids)
+ if self.position_embedding=="learned":
+ position_embeds = self.wpe(position_ids)
+ hidden_states = inputs_embeds + position_embeds
+ else:
+ hidden_states = inputs_embeds
+
+ if token_type_ids is not None:
+ token_type_embeds = self.wte(token_type_ids)
+ hidden_states = hidden_states + token_type_embeds
+
+ hidden_states = self.drop(hidden_states)
+
+ output_shape = input_shape + (hidden_states.size(-1),)
+
+ presents = () if use_cache else None
+ all_self_attentions = () if output_attentions else None
+ all_cross_attentions = () if output_attentions and self.config.add_cross_attention else None
+ all_hidden_states = () if output_hidden_states else None
+
+ for i, (block, layer_past) in enumerate(zip(self.h, past_key_values)):
+ # Model parallel
+ if self.model_parallel:
+ torch.cuda.set_device(hidden_states.device)
+ # Ensure layer_past is on same device as hidden_states (might not be correct)
+ if layer_past is not None:
+ layer_past = tuple(past_state.to(hidden_states.device) for past_state in layer_past)
+ # Ensure that attention_mask is always on the same device as hidden_states
+ if attention_mask is not None:
+ attention_mask = attention_mask.to(hidden_states.device)
+ if isinstance(head_mask, torch.Tensor):
+ head_mask = head_mask.to(hidden_states.device)
+ if output_hidden_states:
+ all_hidden_states = all_hidden_states + (hidden_states,)
+
+ if self.gradient_checkpointing and self.training:
+ if use_cache:
+ print("`use_cache=True` is incompatible with gradient checkpointing. Setting `use_cache=False`...")
+ use_cache = False
+
+ def create_custom_forward(module):
+ def custom_forward(*inputs):
+ # None for past_key_value
+ return module(*inputs, use_cache, output_attentions)
+
+ return custom_forward
+
+ outputs = torch.utils.checkpoint.checkpoint(
+ create_custom_forward(block),
+ hidden_states,
+ None,
+ attention_mask,
+ head_mask[i],
+ encoder_hidden_states,
+ encoder_attention_mask,
+ )
+ else:
+ outputs = block(
+ hidden_states,
+ layer_past=layer_past,
+ attention_mask=attention_mask,
+ head_mask=head_mask[i],
+ encoder_hidden_states=encoder_hidden_states,
+ encoder_attention_mask=encoder_attention_mask,
+ use_cache=use_cache,
+ output_attentions=output_attentions,
+ alibi_bias=self.alibi if hasattr(self, "alibi") else None
+ )
+
+ hidden_states = outputs[0]
+
+ if use_cache is True:
+ presents = presents + (outputs[1],)
+
+ if output_attentions:
+ all_self_attentions = all_self_attentions + (outputs[2 if use_cache else 1],)
+ if self.config.add_cross_attention:
+ all_cross_attentions = all_cross_attentions + (outputs[3 if use_cache else 2],)
+
+ if self.model_parallel:
+ device_prefix="cuda:"
+ for k, v in self.device_map.items():
+ if i == v[-1] and device_prefix + str(k) != self.last_device:
+ hidden_states = hidden_states.to(device_prefix + str(k + 1))
+
+ hidden_states = self.ln_f(hidden_states)
+
+ hidden_states = hidden_states.view(*output_shape)
+ # Add last hidden state
+ if output_hidden_states:
+ all_hidden_states = all_hidden_states + (hidden_states,)
+
+ if not return_dict:
+ return tuple(
+ v
+ for v in [hidden_states, presents, all_hidden_states, all_self_attentions, all_cross_attentions]
+ if v is not None
+ )
+
+ return BaseModelOutputWithPastAndCrossAttentions(
+ last_hidden_state=hidden_states,
+ past_key_values=presents,
+ hidden_states=all_hidden_states,
+ attentions=all_self_attentions,
+ cross_attentions=all_cross_attentions,
+ )
+
+class TranceptionLMHeadModel(GPT2PreTrainedModel):
+ _keys_to_ignore_on_load_missing = [r"attn.masked_bias", r"attn.bias", r"lm_head.weight"]
+ def __init__(self, config):
+ super().__init__(config)
+ self.transformer = TranceptionModel(config)
+ self.lm_head = nn.Linear(config.n_embd, config.vocab_size, bias=False)
+ self.config = config
+
+ self.init_weights()
+
+ self.default_model_device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
+ # Model parallel
+ self.model_parallel = False
+ self.device_map = None
+ self.clustal_hash = str(uuid.uuid4())
+ self.retrieval_aggregation_mode = config.retrieval_aggregation_mode if hasattr(config, "retrieval_aggregation_mode") else None
+ if self.retrieval_aggregation_mode is not None:
+ print("Model leverages both autoregressive and retrieval inference")
+ self.MSA_filename = config.MSA_filename if hasattr(config, "MSA_filename") else False
+ self.MSA_folder = '/'.join(self.MSA_filename.split(os.sep)[:-1])
+ self.MSA_name = self.MSA_filename.split(os.sep)[-1]
+ self.retrieval_inference_weight_LR = config.retrieval_inference_weight if hasattr(config, "retrieval_inference_weight") else 0.6
+ self.retrieval_inference_weight_RL = config.retrieval_inference_weight if hasattr(config, "retrieval_inference_weight") else 0.6
+ self.MSA_start=config.MSA_start
+ self.MSA_end=config.MSA_end
+ self.full_protein_length = config.full_protein_length if hasattr(config, "full_protein_length") else -1
+
+ self.MSA_log_prior = torch.log(torch.tensor(
+ msa_utils.get_msa_prior(
+ MSA_data_file=self.MSA_filename,
+ MSA_weight_file_name=config.MSA_weight_file_name,
+ retrieval_aggregation_mode=self.retrieval_aggregation_mode,
+ MSA_start=self.MSA_start,
+ MSA_end=self.MSA_end,
+ len_target_seq=self.full_protein_length,
+ vocab=config.tokenizer.get_vocab(),
+ verbose=False
+ )
+ ).float().to(self.default_model_device))
+ else:
+ print("Model only uses autoregressive inference")
+
+ def parallelize(self, device_map=None, num_cores=None, num_pipelines=1):
+ self.num_pipelines=num_pipelines
+ self.device_map = (
+ get_device_map(len(self.transformer.h), range(torch.cuda.device_count()))
+ if device_map is None
+ else device_map
+ )
+ assert_device_map(self.device_map, len(self.transformer.h))
+ self.transformer.parallelize(self.device_map, num_cores=num_cores)
+ self.lm_head = self.lm_head.to(self.transformer.first_device)
+ self.model_parallel = True
+
+ def deparallelize(self):
+ self.transformer.deparallelize()
+ self.transformer = self.transformer.to("cpu")
+ self.lm_head = self.lm_head.to("cpu")
+ self.model_parallel = False
+ torch.cuda.empty_cache()
+
+ def get_output_embeddings(self):
+ return self.lm_head
+
+ def set_output_embeddings(self, new_embeddings):
+ self.lm_head = new_embeddings
+
+ def prepare_inputs_for_generation(self, input_ids, past=None, **kwargs):
+ token_type_ids = kwargs.get("token_type_ids", None)
+ # only last token for inputs_ids if past is defined in kwargs
+ if past:
+ input_ids = input_ids[:, -1].unsqueeze(-1)
+ if token_type_ids is not None:
+ token_type_ids = token_type_ids[:, -1].unsqueeze(-1)
+
+ attention_mask = kwargs.get("attention_mask", None)
+ position_ids = kwargs.get("position_ids", None)
+
+ if attention_mask is not None and position_ids is None:
+ # create position_ids on the fly for batch generation
+ position_ids = attention_mask.long().cumsum(-1) - 1
+ position_ids.masked_fill_(attention_mask == 0, 1)
+ if past:
+ position_ids = position_ids[:, -1].unsqueeze(-1)
+ else:
+ position_ids = None
+
+ return {
+ "input_ids": input_ids,
+ "past_key_values": past,
+ "use_cache": kwargs.get("use_cache"),
+ "position_ids": position_ids,
+ "attention_mask": attention_mask,
+ "token_type_ids": token_type_ids,
+ "flip": kwargs.get("flip", None),
+ }
+
+ def forward(
+ self,
+ input_ids=None,
+ past_key_values=None,
+ attention_mask=None,
+ token_type_ids=None,
+ position_ids=None,
+ head_mask=None,
+ inputs_embeds=None,
+ encoder_hidden_states=None,
+ encoder_attention_mask=None,
+ labels=None,
+ use_cache=None,
+ output_attentions=None,
+ output_hidden_states=None,
+ return_dict=None,
+ flip=None,
+ start_slice=None,
+ end_slice=None,
+ mutated_sequence=None,
+ ):
+ r"""
+ labels (:obj:`torch.LongTensor` of shape :obj:`(batch_size, sequence_length)`, `optional`):
+ Labels for language modeling. Note that the labels **are shifted** inside the model, i.e. you can set
+ ``labels = input_ids`` Indices are selected in ``[-100, 0, ..., config.vocab_size]`` All labels set to
+ ``-100`` are ignored (masked), the loss is only computed for labels in ``[0, ..., config.vocab_size]``
+ """
+ return_dict = return_dict if return_dict is not None else self.config.use_return_dict
+
+ transformer_outputs = self.transformer(
+ input_ids,
+ past_key_values=past_key_values,
+ attention_mask=attention_mask,
+ token_type_ids=token_type_ids,
+ position_ids=position_ids,
+ head_mask=head_mask,
+ inputs_embeds=inputs_embeds,
+ encoder_hidden_states=encoder_hidden_states,
+ encoder_attention_mask=encoder_attention_mask,
+ use_cache=use_cache,
+ output_attentions=output_attentions,
+ output_hidden_states=output_hidden_states,
+ return_dict=return_dict
+ )
+ hidden_states = transformer_outputs[0]
+
+ # Set device for model parallelism
+ if self.model_parallel:
+ torch.cuda.set_device(self.transformer.first_device)
+ hidden_states = hidden_states.to(self.lm_head.weight.device)
+ self.MSA_log_prior = self.MSA_log_prior.to(self.lm_head.weight.device)
+
+ lm_logits = self.lm_head(hidden_states)
+
+ loss = None
+ fused_shift_log_probas = None
+ if labels is not None:
+ # Shift so that tokens < n predict n
+ shift_logits = lm_logits[..., :-1, :].contiguous()
+ shift_labels = labels[..., 1:].contiguous()
+
+ if self.retrieval_aggregation_mode is not None:
+ batch_size = input_ids.size(0)
+ if self.retrieval_aggregation_mode=="aggregate_indel":
+ assert batch_size==1, "Aggregate indel is only supported for batch size of 1"
+ truncated_sequence_text = mutated_sequence[0][start_slice[0]:end_slice[0]]
+ if len(truncated_sequence_text)!=shift_logits.shape[1]-1: # shift_logits only has one extra token compared to truncated_sequence_text (the BOS token)
+ print("Tokenization error -- seq length: {} and shift_logits length - 1 : {}".format(len(mutated_sequence),shift_logits.shape[1]-1))
+ MSA_log_prior, MSA_start, MSA_end = msa_utils.update_retrieved_MSA_log_prior_indel(self, self.MSA_log_prior, self.MSA_start, self.MSA_end, mutated_sequence[0], self.clustal_hash)
+
+ elif self.retrieval_aggregation_mode=="aggregate_substitution":
+ MSA_log_prior=self.MSA_log_prior
+ MSA_start=self.MSA_start
+ MSA_end=self.MSA_end
+
+ shift_log_probas = torch.log_softmax(shift_logits,dim=-1)
+ fused_shift_log_probas = shift_log_probas.clone()
+ if flip is None:
+ flip = torch.zeros(batch_size).to(fused_shift_log_probas.device)
+ flip = flip > 0
+
+ for seq_index in range(batch_size):
+ min_prior_slice = max(start_slice[seq_index], MSA_start)
+ max_prior_slice = min(end_slice[seq_index], MSA_end)
+
+ # This will cascade into an error
+ if max_prior_slice <= min_prior_slice:
+ print("Non overlapping region detected: min_prior_slice {} and max_prior_slice {}".format(min_prior_slice,max_prior_slice))
+ continue
+
+ slice_prior = MSA_log_prior[min_prior_slice:max_prior_slice,:].to(fused_shift_log_probas.device)
+ if flip[seq_index]:
+ slice_prior = torch.flip(slice_prior,dims=(0,))
+ min_logits_slice = max(0,end_slice[seq_index]-MSA_end)
+ max_logits_slice = min_logits_slice + (max_prior_slice-min_prior_slice)
+ fused_shift_log_probas[seq_index,min_logits_slice:max_logits_slice,:] = (1-self.retrieval_inference_weight_RL)*shift_log_probas[seq_index,min_logits_slice:max_logits_slice,:] + self.retrieval_inference_weight_RL*slice_prior
+ else:
+ min_logits_slice = max(0, MSA_start-start_slice[seq_index])
+ max_logits_slice = min_logits_slice + (max_prior_slice-min_prior_slice)
+ fused_shift_log_probas[seq_index,min_logits_slice:max_logits_slice,:] = (1-self.retrieval_inference_weight_LR)*shift_log_probas[seq_index,min_logits_slice:max_logits_slice,:] + self.retrieval_inference_weight_LR*slice_prior
+
+ if self.retrieval_aggregation_mode=="aggregate_indel":
+ try:
+ # If a given residue colume is an added zero-column, then we overwrite prior fusion and only predict based on the autoregressive transformer inference mode.
+ inserted_retrieval_positions = [True if slice_prior[i].sum()==0 else False for i in range(len(slice_prior))]+[True] #Last True is for the end of sentence token
+ fused_shift_log_probas[:,inserted_retrieval_positions,:]=shift_log_probas[:,inserted_retrieval_positions,:]
+ except Exception as e:
+ print("Error when adding zero column(s) to account for insertion mutations.")
+ raise e
+
+ loss_fct = NLLLoss(reduction='none')
+ loss = loss_fct(input=fused_shift_log_probas.view(-1, fused_shift_log_probas.size(-1)), target=shift_labels.view(-1)).view(fused_shift_log_probas.shape[0],fused_shift_log_probas.shape[1])
+ mask = attention_mask[..., 1:].float()
+ mask[mask==0]=float('nan')
+ loss *= mask
+ loss = nanmean(loss, dim=1).mean()
+ else:
+ loss_fct = CrossEntropyLoss()
+ loss = loss_fct(shift_logits.view(-1, shift_logits.size(-1)), shift_labels.view(-1))
+
+ if not return_dict:
+ output = (lm_logits,) + transformer_outputs[1:]
+ return ((loss,) + output) if loss is not None else output
+
+ return TranceptionCausalLMOutputWithCrossAttentions(
+ loss=loss,
+ logits=lm_logits,
+ past_key_values=transformer_outputs.past_key_values,
+ hidden_states=transformer_outputs.hidden_states,
+ attentions=transformer_outputs.attentions,
+ cross_attentions=transformer_outputs.cross_attentions,
+ fused_shift_log_probas=fused_shift_log_probas
+ )
+
+
+ @staticmethod
+ def _reorder_cache(past: Tuple[Tuple[torch.Tensor]], beam_idx: torch.Tensor) -> Tuple[Tuple[torch.Tensor]]:
+ """
+ This function is used to re-order the :obj:`past_key_values` cache if
+ :meth:`~transformers.PreTrainedModel.beam_search` or :meth:`~transformers.PreTrainedModel.beam_sample` is
+ called. This is required to match :obj:`past_key_values` with the correct beam_idx at every generation step.
+ """
+ return tuple(
+ tuple(past_state.index_select(0, beam_idx.to(past_state.device)) for past_state in layer_past)
+ for layer_past in past
+ )
+
+ def score_mutants(self, DMS_data, target_seq=None, scoring_mirror=True, batch_size_inference=10, num_workers=10, indel_mode=False):
+ """
+ Method to score mutants in an input DMS file.
+ DMS_data: (dataframe) Dataframe containing the list of mutated sequences for scoring.
+ target_seq: (string) Full reference sequence (wild type) that is mutated in the DMS assay. If not None, returned scores are delta log likelihood wrt that sequence.
+ scoring_mirror: (bool) Whether to score mutated sequences from both directions (Left->Right and Right->Left).
+ batch_size_inference: (int) Batch size for scoring.
+ num_workers: (int) Number of workers to be used in the data loader.
+ indel_mode: (bool) Flag to be used when scoring insertions and deletions. Otherwise assumes substitutions.
+ """
+ df = DMS_data.copy()
+ if ('mutated_sequence' not in df) and (not indel_mode): df['mutated_sequence'] = df['mutant'].apply(lambda x: scoring_utils.get_mutated_sequence(target_seq, x))
+
+ # Daniel R: Temporary since indel files all have a 'mutant' column containing the mutated sequence. Should probably
+ # make all of those mutated sequence columns and take this out entirely
+ # if indel_mode:
+ # df["mutated_sequence"] = df["mutant"]
+
+ assert ('mutated_sequence' in df), "DMS file to score does not have mutated_sequence column"
+ if 'mutant' not in df: df['mutant'] = df['mutated_sequence'] #if mutant not in DMS file we default to mutated_sequence
+ #if 'DMS_score' in df: del df['DMS_score']
+ #if 'DMS_score_bin' in df: del df['DMS_score_bin']
+ df = df[['mutated_sequence','mutant']]
+ if target_seq is not None:
+ df_left_to_right_slices = scoring_utils.get_sequence_slices(df, target_seq=target_seq, model_context_len = self.config.n_ctx - 2, indel_mode=indel_mode, scoring_window=self.config.scoring_window)
+ else:
+ df_left_to_right_slices = scoring_utils.get_sequence_slices(df, target_seq=list(df['mutated_sequence'])[0], model_context_len = self.config.n_ctx - 2, indel_mode=indel_mode, scoring_window='sliding')
+ print("Scoring sequences from left to right")
+ scores_L_to_R = scoring_utils.get_tranception_scores_mutated_sequences(model=self, mutated_sequence_df=df_left_to_right_slices, batch_size_inference=batch_size_inference, score_var_name='avg_score_L_to_R', target_seq=target_seq, num_workers=num_workers, indel_mode=indel_mode)
+ if scoring_mirror:
+ print("Scoring sequences from right to left")
+ df_right_to_left_slices = df_left_to_right_slices.copy()
+ df_right_to_left_slices['sliced_mutated_sequence'] = df_right_to_left_slices['sliced_mutated_sequence'].apply(lambda x: x[::-1])
+ scores_R_to_L = scoring_utils.get_tranception_scores_mutated_sequences(model=self, mutated_sequence_df=df_right_to_left_slices, batch_size_inference=batch_size_inference, score_var_name='avg_score_R_to_L', target_seq=target_seq, num_workers=num_workers, reverse=True, indel_mode=indel_mode)
+ all_scores = pd.merge(scores_L_to_R, scores_R_to_L, on='mutated_sequence', how='left', suffixes=('','_R_to_L'))
+ all_scores['avg_score'] = (all_scores['avg_score_L_to_R'] + all_scores['avg_score_R_to_L']) / 2.0
+ else:
+ all_scores = scores_L_to_R
+ all_scores['avg_score'] = all_scores['avg_score_L_to_R']
+ #By design "get_tranception_scores_mutated_sequences" drops the WT from the output. We add it back if that was one of the sequences to score in the DMS (score=0 by definition)
+ if indel_mode:
+ mutant_column = "mutant"
+ else:
+ mutant_column = "mutated_sequence"
+ if target_seq in DMS_data[mutant_column].values:
+ if scoring_mirror:
+ wt_row = pd.DataFrame([[target_seq,0,0,0]], columns=[mutant_column,'avg_score_L_to_R','avg_score_R_to_L','avg_score'])
+ else:
+ wt_row = pd.DataFrame([[target_seq,0,0]], columns=[mutant_column,'avg_score_L_to_R','avg_score'])
+ all_scores = pd.concat([all_scores,wt_row], ignore_index=True)
+ return all_scores
+
+ def encode_batch(self, protein_sequence, sequence_name="sliced_mutated_sequence"):
+ """
+ Method to process an input AA sequence batch (protein_sequence) and return a tokenized sequence (via the tokenizer associated to the model).
+ """
+ protein_sequence[sequence_name] = scoring_utils.sequence_replace(sequences=protein_sequence[sequence_name], char_to_replace='X', char_replacements='ACDEFGHIKLMNPQRSTVWY')
+ protein_sequence[sequence_name] = scoring_utils.sequence_replace(sequences=protein_sequence[sequence_name], char_to_replace='B', char_replacements='DN')
+ protein_sequence[sequence_name] = scoring_utils.sequence_replace(sequences=protein_sequence[sequence_name], char_to_replace='J', char_replacements='IL')
+ protein_sequence[sequence_name] = scoring_utils.sequence_replace(sequences=protein_sequence[sequence_name], char_to_replace='Z', char_replacements='EQ')
+ return self.config.tokenizer(list(protein_sequence[sequence_name]), add_special_tokens=True, truncation=True, padding=True, max_length=self.config.n_ctx)
+
diff --git a/proteingym/baselines/tranception/tranception/outputs.py b/proteingym/baselines/tranception/tranception/outputs.py
new file mode 100644
index 0000000..8b119fc
--- /dev/null
+++ b/proteingym/baselines/tranception/tranception/outputs.py
@@ -0,0 +1,48 @@
+from dataclasses import dataclass
+from typing import Optional, Tuple
+
+import torch
+
+from transformers.file_utils import ModelOutput
+
+@dataclass
+class TranceptionCausalLMOutputWithCrossAttentions(ModelOutput):
+ """
+ Class for Tranception causal language model (or autoregressive) outputs.
+ Args:
+ loss (`torch.FloatTensor` of shape `(1,)`, *optional*, returned when `labels` is provided):
+ Language modeling loss (for next-token prediction).
+ logits (`torch.FloatTensor` of shape `(batch_size, sequence_length, config.vocab_size)`):
+ Prediction scores of the language modeling head (scores for each vocabulary token before SoftMax).
+ hidden_states (`tuple(torch.FloatTensor)`, *optional*, returned when `output_hidden_states=True` is passed or when `config.output_hidden_states=True`):
+ Tuple of `torch.FloatTensor` (one for the output of the embeddings + one for the output of each layer) of
+ shape `(batch_size, sequence_length, hidden_size)`.
+ Hidden-states of the model at the output of each layer plus the initial embedding outputs.
+ attentions (`tuple(torch.FloatTensor)`, *optional*, returned when `output_attentions=True` is passed or when `config.output_attentions=True`):
+ Tuple of `torch.FloatTensor` (one for each layer) of shape `(batch_size, num_heads, sequence_length,
+ sequence_length)`.
+ Attentions weights after the attention softmax, used to compute the weighted average in the self-attention
+ heads.
+ cross_attentions (`tuple(torch.FloatTensor)`, *optional*, returned when `output_attentions=True` is passed or when `config.output_attentions=True`):
+ Tuple of `torch.FloatTensor` (one for each layer) of shape `(batch_size, num_heads, sequence_length,
+ sequence_length)`.
+ Cross attentions weights after the attention softmax, used to compute the weighted average in the
+ cross-attention heads.
+ past_key_values (`tuple(tuple(torch.FloatTensor))`, *optional*, returned when `use_cache=True` is passed or when `config.use_cache=True`):
+ Tuple of `torch.FloatTensor` tuples of length `config.n_layers`, with each tuple containing the cached key,
+ value states of the self-attention and the cross-attention layers if model is used in encoder-decoder
+ setting. Only relevant if `config.is_decoder = True`.
+ Contains pre-computed hidden-states (key and values in the attention blocks) that can be used (see
+ `past_key_values` input) to speed up sequential decoding.
+ fused_shift_log_probas (`torch.FloatTensor` of shape (batch_size, sequence_length, config.vocab_size), *optional*, returned when config.retrieval_aggregation_mode is not None.
+ log_probas for each residue position after aggregating autoregressive logits and retrieval logits.
+
+ """
+
+ loss: Optional[torch.FloatTensor] = None
+ logits: torch.FloatTensor = None
+ past_key_values: Optional[Tuple[Tuple[torch.FloatTensor]]] = None
+ hidden_states: Optional[Tuple[torch.FloatTensor]] = None
+ attentions: Optional[Tuple[torch.FloatTensor]] = None
+ cross_attentions: Optional[Tuple[torch.FloatTensor]] = None
+ fused_shift_log_probas: Optional[torch.FloatTensor] = None
\ No newline at end of file
diff --git a/proteingym/baselines/tranception/tranception/utils/__init__.py b/proteingym/baselines/tranception/tranception/utils/__init__.py
new file mode 100644
index 0000000..90ad887
--- /dev/null
+++ b/proteingym/baselines/tranception/tranception/utils/__init__.py
@@ -0,0 +1 @@
+from . import scoring_utils, msa_utils
\ No newline at end of file
diff --git a/proteingym/baselines/tranception/tranception/utils/dms_utils.py b/proteingym/baselines/tranception/tranception/utils/dms_utils.py
new file mode 100644
index 0000000..3c46e1c
--- /dev/null
+++ b/proteingym/baselines/tranception/tranception/utils/dms_utils.py
@@ -0,0 +1,31 @@
+import pandas as pd
+import numpy as np
+from tranception.utils import scoring_utils
+
+def DMS_file_cleanup(DMS_filename, target_seq, start_idx=1, end_idx=None, DMS_mutant_column='mutant', DMS_phenotype_name='score', DMS_directionality=1, AA_vocab = "ACDEFGHIKLMNPQRSTVWY"):
+ """
+ Function to process the raw substitution DMS assay data (eg., removing invalid mutants, aggregate silent mutations).
+ """
+ DMS_data = pd.read_csv(DMS_filename, low_memory=False)
+ end_idx = start_idx + len(target_seq) - 1 if end_idx is None else end_idx
+ DMS_data['mutant'] = DMS_data[DMS_mutant_column]
+
+ DMS_data=DMS_data[DMS_data['mutant'].notnull()].copy()
+ DMS_data=DMS_data[DMS_data['mutant'].apply(lambda x: all([len(y)>=3 for y in x.split(":")]))].copy() #Mutant triplets should have at least 3 or more characters
+ DMS_data=DMS_data[DMS_data['mutant'].apply(lambda x: all([(y[0] in AA_vocab) and (y[1:-1].isnumeric()) and (y[-1] in AA_vocab) for y in x.split(":")]))].copy()
+ DMS_data=DMS_data[DMS_data['mutant'].apply(lambda x: all([int(y[1:-1])-start_idx >=0 and int(y[1:-1]) <= end_idx for y in x.split(":")]))].copy()
+ DMS_data=DMS_data[DMS_data['mutant'].apply(lambda x: all([y[0]==target_seq[int(y[1:-1])-start_idx] for y in x.split(":")]))].copy()
+ DMS_data=DMS_data[DMS_data['mutant'].apply(lambda x: all([y[0]!=y[-1] for y in x.split(":")]))].copy()
+
+ DMS_data[DMS_phenotype_name]=pd.to_numeric(DMS_data[DMS_phenotype_name],errors='coerce')
+ DMS_data=DMS_data[np.isfinite(DMS_data[DMS_phenotype_name])]
+ DMS_data.dropna(subset = [DMS_phenotype_name], inplace=True)
+ DMS_data['DMS_score'] = DMS_data[DMS_phenotype_name] * DMS_directionality
+ DMS_data=DMS_data[['mutant','DMS_score']]
+ DMS_data=DMS_data.groupby('mutant').mean().reset_index()
+
+ DMS_data['mutated_sequence'] = DMS_data['mutant'].apply(lambda x: scoring_utils.get_mutated_sequence(target_seq, x))
+ DMS_data=DMS_data[['mutant','mutated_sequence','DMS_score']]
+
+ return DMS_data
+
diff --git a/proteingym/baselines/tranception/tranception/utils/msa_utils.py b/proteingym/baselines/tranception/tranception/utils/msa_utils.py
new file mode 100644
index 0000000..e457082
--- /dev/null
+++ b/proteingym/baselines/tranception/tranception/utils/msa_utils.py
@@ -0,0 +1,369 @@
+import numpy as np
+import pandas as pd
+from collections import defaultdict
+import random
+import os
+import torch
+from Bio.Align.Applications import ClustalOmegaCommandline
+
+def filter_msa(msa_data, num_sequences_kept=3):
+ """
+ Helper function to filter an input MSA msa_data (obtained via process_msa_data) and keep only num_sequences_kept aligned sequences.
+ If the MSA already has fewer sequences than num_sequences_kept, we keep the MSA as is.
+ If filtering, we always keep the first sequence of the MSA (ie. the wild type) by default.
+ Sampling is done without replacement.
+ """
+ if len(list(msa_data.keys())) <= num_sequences_kept:
+ return msa_data
+ filtered_msa = {}
+ wt_name = next(iter(msa_data))
+ filtered_msa[wt_name] = msa_data[wt_name]
+ del msa_data[wt_name]
+ sequence_names = list(msa_data.keys())
+ sequence_names_sampled = random.sample(sequence_names,k=num_sequences_kept-1)
+ for seq in sequence_names_sampled:
+ filtered_msa[seq] = msa_data[seq]
+ return filtered_msa
+
+def process_msa_data(MSA_data_file):
+ """
+ Helper function that takes as input a path to a MSA file (expects a2m format) and returns a dict mapping sequence ID to the corresponding AA sequence.
+ """
+ msa_data = defaultdict(str)
+ sequence_name = ""
+ with open(MSA_data_file, "r") as msa_file:
+ for i, line in enumerate(msa_file):
+ line = line.rstrip()
+ if line.startswith(">"):
+ sequence_name = line
+ else:
+ msa_data[sequence_name] += line.upper()
+ return msa_data
+
+def get_one_hot_sequences_dict(msa_data,MSA_start,MSA_end,vocab):
+ vocab_size = len(vocab.keys())
+ num_sequences_msa = len(msa_data.keys())
+ one_hots = np.zeros((num_sequences_msa,MSA_end-MSA_start,vocab_size))
+ for i,seq_name in enumerate(msa_data.keys()):
+ sequence = msa_data[seq_name]
+ for j,letter in enumerate(sequence):
+ if letter in vocab:
+ k = vocab[letter]
+ one_hots[i,j,k] = 1.0
+ return one_hots
+
+def one_hot(sequence_string,vocab):
+ one_hots = np.zeros((len(sequence_string),len(vocab.keys())))
+ for j,letter in enumerate(sequence_string):
+ if letter in vocab:
+ k = vocab[letter]
+ one_hots[j,k] = 1.0
+ return one_hots.flatten()
+
+def get_msa_prior(MSA_data_file, MSA_weight_file_name, MSA_start, MSA_end, len_target_seq, vocab, retrieval_aggregation_mode="aggregate_substitution", filter_MSA=True, verbose=False):
+ """
+ Function to enable retrieval inference mode, via computation of (weighted) pseudocounts of AAs at each position of the retrieved MSA.
+ MSA_data_file: (string) path to MSA file (expects a2m format).
+ MSA_weight_file_name: (string) path to sequence weights in MSA.
+ MSA_start: (int) Sequence position that the MSA starts at (1-indexing).
+ MSA_end: (int) Sequence position that the MSA ends at (1-indexing).
+ len_target_seq: (int) Full length of sequence to be scored.
+ vocab: (dict) Vocabulary of the tokenizer.
+ retrieval_aggregation_mode: (string) Mode for retrieval inference (aggregate_substitution Vs aggregate_indel). If None, places a uniform prior over each token.
+ filter_MSA: (bool) Whether to filter out sequences with very low hamming similarity (< 0.2) to the reference sequence in the MSA (first sequence).
+ verbose: (bool) Whether to print to the console processing details along the way.
+ """
+ msa_data = process_msa_data(MSA_data_file)
+ vocab_size = len(vocab.keys())
+ if verbose: print("Target seq len is {}, MSA length is {}, start position is {}, end position is {} and vocab size is {}".format(len_target_seq,MSA_end-MSA_start,MSA_start,MSA_end,vocab_size))
+
+ if filter_MSA:
+ if verbose: print("Num sequences in MSA pre filtering: {}".format(len(msa_data.keys())))
+ list_sequence_names = list(msa_data.keys())
+ focus_sequence_name = list(msa_data.keys())[0]
+ ref_sequence_hot = one_hot(msa_data[focus_sequence_name],vocab)
+ for sequence_name in list_sequence_names:
+ seq_hot = one_hot(msa_data[sequence_name],vocab)
+ hamming_similarity_seq_ref = np.dot(ref_sequence_hot,seq_hot) / np.dot(ref_sequence_hot,ref_sequence_hot)
+ if hamming_similarity_seq_ref < 0.2:
+ del msa_data[sequence_name]
+ if verbose: print("Num sequences in MSA post filtering: {}".format(len(msa_data.keys())))
+
+ if MSA_weight_file_name is not None:
+ if verbose: print("Using weights in {} for sequences in MSA.".format(MSA_weight_file_name))
+ assert os.path.exists(MSA_weight_file_name), "Weights file not located on disk."
+ MSA_EVE = MSA_processing(
+ MSA_location=MSA_data_file,
+ use_weights=True,
+ weights_location=MSA_weight_file_name
+ )
+ #We scan through all sequences to see if we have a weight for them as per EVE pre-processing. We drop them otherwise.
+ dropped_sequences=0
+ list_sequence_names = list(msa_data.keys())
+ MSA_weight=[]
+ for sequence_name in list_sequence_names:
+ if sequence_name not in MSA_EVE.seq_name_to_sequence:
+ dropped_sequences +=1
+ del msa_data[sequence_name]
+ else:
+ MSA_weight.append(MSA_EVE.seq_name_to_weight[sequence_name])
+ if verbose: print("Dropped {} sequences from MSA due to absent sequence weights".format(dropped_sequences))
+ else:
+ MSA_weight = [1] * len(list(msa_data.keys()))
+
+ if retrieval_aggregation_mode=="aggregate_substitution" or retrieval_aggregation_mode=="aggregate_indel":
+ one_hots = get_one_hot_sequences_dict(msa_data,MSA_start,MSA_end,vocab)
+ MSA_weight = np.expand_dims(np.array(MSA_weight),axis=(1,2))
+ base_rate = 1e-5
+ base_rates = np.ones_like(one_hots) * base_rate
+ weighted_one_hots = (one_hots + base_rates) * MSA_weight
+ MSA_weight_norm_counts = weighted_one_hots.sum(axis=-1).sum(axis=0)
+ MSA_weight_norm_counts = np.tile(MSA_weight_norm_counts.reshape(-1,1), (1,vocab_size))
+ one_hots_avg = weighted_one_hots.sum(axis=0) / MSA_weight_norm_counts
+ msa_prior = np.zeros((len_target_seq,vocab_size))
+ print(MSA_start)
+ print(MSA_end)
+ print(one_hots_avg.shape)
+ msa_prior[MSA_start:MSA_end,:]=one_hots_avg
+ else:
+ msa_prior = np.ones((len_target_seq,vocab_size)) / vocab_size
+
+ if verbose:
+ for idx, position in enumerate(msa_prior):
+ if len(position)!=25:
+ print("Size error")
+ if not round(position.sum(),2)==1.0:
+ print("Position at index {} does not add up to 1: {}".format(idx, position.sum()))
+
+ return msa_prior
+
+
+def update_retrieved_MSA_log_prior_indel(model, MSA_log_prior, MSA_start, MSA_end, mutated_sequence, clustal_hash):
+ """
+ Function to process MSA when scoring indels.
+ To identify positions to add / remove in the retrieved MSA, we append and align the sequence to be scored to the original MSA for that protein family with Clustal Omega.
+ If the original MSA is relatively deep (over 100k sequences), we sample (by default) 100k rows at random from that MSA to speed computations.
+ MSA sampling is performed only once (for the first sequence to be scored). Subsequent scoring use the same MSA sample.
+ """
+ if not os.path.isdir(model.MSA_folder + os.sep + "Sampled"):
+ os.mkdir(model.MSA_folder + os.sep + "Sampled")
+ sampled_MSA_location = model.MSA_folder + os.sep + "Sampled" + os.sep + "Sampled_" + clustal_hash + "_" + model.MSA_filename.split(os.sep)[-1]
+
+ if not os.path.exists(sampled_MSA_location):
+ msa_data = process_msa_data(model.MSA_filename)
+ msa_data_sampled = filter_msa(msa_data, num_sequences_kept=100000) #If MSA has less than 100k sequences, the sample is identical to original MSA
+ with open(sampled_MSA_location, 'w') as sampled_write_location:
+ for index, key in enumerate(msa_data_sampled):
+ key_name = ">REFERENCE_SEQUENCE" if index==0 else key
+ msa_data_sampled[key] = msa_data_sampled[key].upper()
+ msa_data_sampled[key] = msa_data_sampled[key].replace(".","-")
+ sampled_write_location.write(key_name+"\n"+"\n".join([msa_data_sampled[key][i:i+80] for i in range(0, len(msa_data_sampled[key]), 80)])+"\n")
+
+ seq_to_align_location = model.MSA_folder + os.sep + "Sampled" + os.sep + "Seq_to_align_" + clustal_hash + "_" + model.MSA_filename.split(os.sep)[-1]
+ sequence_text_split = [mutated_sequence[i:i+80] for i in range(0, len(mutated_sequence), 80)]
+ sequence_text_split_split_join = "\n".join([">SEQ_TO_SCORE"]+sequence_text_split)
+ os.system("echo '"+sequence_text_split_split_join+"' > "+seq_to_align_location)
+
+ expanded_MSA_location = model.MSA_folder + os.sep + "Sampled" + os.sep + "Expanded_" + clustal_hash + "_" + model.MSA_filename.split(os.sep)[-1]
+ clustalw_cline = ClustalOmegaCommandline(cmd=model.config.clustal_omega_location,
+ profile1=sampled_MSA_location,
+ profile2=seq_to_align_location,
+ outfile=expanded_MSA_location,
+ force=True)
+ stdout, stderr = clustalw_cline()
+ msa_data = process_msa_data(expanded_MSA_location)
+ aligned_seqA, aligned_seqB = msa_data[">SEQ_TO_SCORE"], msa_data[">REFERENCE_SEQUENCE"]
+ try:
+ keep_column=[]
+ for column_index_pairwise_alignment in range(len(aligned_seqA)):
+ if aligned_seqA[column_index_pairwise_alignment]=="-" and aligned_seqB[column_index_pairwise_alignment]=="-":
+ continue # Skips if both are gaps
+ elif aligned_seqA[column_index_pairwise_alignment]=="-":
+ keep_column.append(False) # Skips if the query SEQ_TO_SCORE is a gap
+ elif aligned_seqB[column_index_pairwise_alignment]=="-":
+ MSA_log_prior=torch.cat((MSA_log_prior[:column_index_pairwise_alignment], torch.zeros(MSA_log_prior.shape[1]).view(1,-1).cuda(), MSA_log_prior[column_index_pairwise_alignment:]),dim=0)
+ keep_column.append(True) #keep the zero column we just added
+ else:
+ keep_column.append(True)
+ MSA_log_prior = MSA_log_prior[keep_column]
+ MSA_end = MSA_start + len(MSA_log_prior)
+ except:
+ print("Error when processing the following alignment: {}".format(expanded_MSA_location))
+ return MSA_log_prior, MSA_start, MSA_end
+
+class MSA_processing:
+ def __init__(self,
+ MSA_location="",
+ theta=0.2,
+ use_weights=True,
+ weights_location="./data/weights",
+ preprocess_MSA=True,
+ threshold_sequence_frac_gaps=0.5,
+ threshold_focus_cols_frac_gaps=1.0,
+ remove_sequences_with_indeterminate_AA_in_focus_cols=True
+ ):
+
+ """
+ This MSA_processing class is directly borrowed from the EVE codebase: https://github.com/OATML-Markslab/EVE
+
+ Parameters:
+ - msa_location: (path) Location of the MSA data. Constraints on input MSA format:
+ - focus_sequence is the first one in the MSA data
+ - first line is structured as follows: ">focus_seq_name/start_pos-end_pos" (e.g., >SPIKE_SARS2/310-550)
+ - corespondding sequence data located on following line(s)
+ - then all other sequences follow with ">name" on first line, corresponding data on subsequent lines
+ - theta: (float) Sequence weighting hyperparameter. Generally: Prokaryotic and eukaryotic families = 0.2; Viruses = 0.01
+ - use_weights: (bool) If False, sets all sequence weights to 1. If True, checks weights_location -- if non empty uses that;
+ otherwise compute weights from scratch and store them at weights_location
+ - weights_location: (path) Location to load from/save to the sequence weights
+ - preprocess_MSA: (bool) performs pre-processing of MSA to remove short fragments and positions that are not well covered.
+ - threshold_sequence_frac_gaps: (float, between 0 and 1) Threshold value to define fragments
+ - sequences with a fraction of gap characters above threshold_sequence_frac_gaps are removed
+ - default is set to 0.5 (i.e., fragments with 50% or more gaps are removed)
+ - threshold_focus_cols_frac_gaps: (float, between 0 and 1) Threshold value to define focus columns
+ - positions with a fraction of gap characters above threshold_focus_cols_pct_gaps will be set to lower case (and not included in the focus_cols)
+ - default is set to 0.3 (i.e., focus positions are the ones with 30% of gaps or less, i.e., 70% or more residue occupancy)
+ - remove_sequences_with_indeterminate_AA_in_focus_cols: (bool) Remove all sequences that have indeterminate AA (e.g., B, J, X, Z) at focus positions of the wild type
+ """
+ np.random.seed(2021)
+ self.MSA_location = MSA_location
+ self.weights_location = weights_location
+ self.theta = theta
+ self.alphabet = "ACDEFGHIKLMNPQRSTVWY"
+ self.use_weights = use_weights
+ self.preprocess_MSA = preprocess_MSA
+ self.threshold_sequence_frac_gaps = threshold_sequence_frac_gaps
+ self.threshold_focus_cols_frac_gaps = threshold_focus_cols_frac_gaps
+ self.remove_sequences_with_indeterminate_AA_in_focus_cols = remove_sequences_with_indeterminate_AA_in_focus_cols
+
+ self.gen_alignment()
+
+ def gen_alignment(self, verbose=False):
+ """ Read training alignment and store basics in class instance """
+ self.aa_dict = {}
+ for i,aa in enumerate(self.alphabet):
+ self.aa_dict[aa] = i
+
+ self.seq_name_to_sequence = defaultdict(str)
+ name = ""
+ with open(self.MSA_location, "r") as msa_data:
+ for i, line in enumerate(msa_data):
+ line = line.rstrip()
+ if line.startswith(">"):
+ name = line
+ if i==0:
+ self.focus_seq_name = name
+ else:
+ self.seq_name_to_sequence[name] += line
+
+
+ ## MSA pre-processing to remove inadequate columns and sequences
+ if self.preprocess_MSA:
+ msa_df = pd.DataFrame.from_dict(self.seq_name_to_sequence, orient='index', columns=['sequence'])
+ # Data clean up
+ msa_df.sequence = msa_df.sequence.apply(lambda x: x.replace(".","-")).apply(lambda x: ''.join([aa.upper() for aa in x]))
+ # Remove columns that would be gaps in the wild type
+ non_gap_wt_cols = [aa!='-' for aa in msa_df.sequence[self.focus_seq_name]]
+ msa_df['sequence'] = msa_df['sequence'].apply(lambda x: ''.join([aa for aa,non_gap_ind in zip(x, non_gap_wt_cols) if non_gap_ind]))
+ assert 0.0 <= self.threshold_sequence_frac_gaps <= 1.0,"Invalid fragment filtering parameter"
+ assert 0.0 <= self.threshold_focus_cols_frac_gaps <= 1.0,"Invalid focus position filtering parameter"
+ msa_array = np.array([list(seq) for seq in msa_df.sequence])
+ gaps_array = np.array(list(map(lambda seq: [aa=='-' for aa in seq], msa_array)))
+ # Identify fragments with too many gaps
+ seq_gaps_frac = gaps_array.mean(axis=1)
+ seq_below_threshold = seq_gaps_frac <= self.threshold_sequence_frac_gaps
+ if verbose: print("Proportion of sequences dropped due to fraction of gaps: "+str(round(float(1 - seq_below_threshold.sum()/seq_below_threshold.shape)*100,2))+"%")
+ # Identify focus columns
+ columns_gaps_frac = gaps_array[seq_below_threshold].mean(axis=0)
+ index_cols_below_threshold = columns_gaps_frac <= self.threshold_focus_cols_frac_gaps
+ if verbose: print("Proportion of non-focus columns removed: "+str(round(float(1 - index_cols_below_threshold.sum()/index_cols_below_threshold.shape)*100,2))+"%")
+ # Lower case non focus cols and filter fragment sequences
+ msa_df['sequence'] = msa_df['sequence'].apply(lambda x: ''.join([aa.upper() if upper_case_ind else aa.lower() for aa, upper_case_ind in zip(x, index_cols_below_threshold)]))
+ msa_df = msa_df[seq_below_threshold]
+ # Overwrite seq_name_to_sequence with clean version
+ self.seq_name_to_sequence = defaultdict(str)
+ for seq_idx in range(len(msa_df['sequence'])):
+ self.seq_name_to_sequence[msa_df.index[seq_idx]] = msa_df.sequence[seq_idx]
+
+ self.focus_seq = self.seq_name_to_sequence[self.focus_seq_name]
+ self.focus_cols = [ix for ix, s in enumerate(self.focus_seq) if s == s.upper() and s!='-']
+ self.focus_seq_trimmed = [self.focus_seq[ix] for ix in self.focus_cols]
+ self.seq_len = len(self.focus_cols)
+ self.alphabet_size = len(self.alphabet)
+
+ # Connect local sequence index with uniprot index (index shift inferred from 1st row of MSA)
+ try:
+ focus_loc = self.focus_seq_name.split("/")[-1]
+ start,stop = focus_loc.split("-")
+ self.focus_start_loc = int(start)
+ self.focus_stop_loc = int(stop)
+ except:
+ start,stop = 1,len(self.focus_seq)
+ self.focus_start_loc = int(start)
+ self.focus_stop_loc = int(stop)
+ self.uniprot_focus_col_to_wt_aa_dict \
+ = {idx_col+int(start):self.focus_seq[idx_col] for idx_col in self.focus_cols}
+ self.uniprot_focus_col_to_focus_idx \
+ = {idx_col+int(start):idx_col for idx_col in self.focus_cols}
+
+ # Move all letters to CAPS; keeps focus columns only
+ self.raw_seq_name_to_sequence = self.seq_name_to_sequence.copy()
+ for seq_name,sequence in self.seq_name_to_sequence.items():
+ sequence = sequence.replace(".","-")
+ self.seq_name_to_sequence[seq_name] = [sequence[ix].upper() for ix in self.focus_cols]
+
+ # Remove sequences that have indeterminate AA (e.g., B, J, X, Z) in the focus columns
+ if self.remove_sequences_with_indeterminate_AA_in_focus_cols:
+ alphabet_set = set(list(self.alphabet))
+ seq_names_to_remove = []
+ for seq_name,sequence in self.seq_name_to_sequence.items():
+ for letter in sequence:
+ if letter not in alphabet_set and letter != "-":
+ seq_names_to_remove.append(seq_name)
+ continue
+ seq_names_to_remove = list(set(seq_names_to_remove))
+ for seq_name in seq_names_to_remove:
+ del self.seq_name_to_sequence[seq_name]
+
+ # Encode the sequences
+ self.one_hot_encoding = np.zeros((len(self.seq_name_to_sequence.keys()),len(self.focus_cols),len(self.alphabet)))
+ if verbose: print("One-hot encoded sequences shape:" + str(self.one_hot_encoding.shape))
+ for i,seq_name in enumerate(self.seq_name_to_sequence.keys()):
+ sequence = self.seq_name_to_sequence[seq_name]
+ for j,letter in enumerate(sequence):
+ if letter in self.aa_dict:
+ k = self.aa_dict[letter]
+ self.one_hot_encoding[i,j,k] = 1.0
+
+ if self.use_weights:
+ try:
+ self.weights = np.load(file=self.weights_location)
+ if verbose: print("Loaded sequence weights from disk")
+ except:
+ if verbose: print ("Computing sequence weights")
+ list_seq = self.one_hot_encoding
+ list_seq = list_seq.reshape((list_seq.shape[0], list_seq.shape[1] * list_seq.shape[2]))
+ def compute_weight(seq):
+ number_non_empty_positions = np.dot(seq,seq)
+ if number_non_empty_positions>0:
+ denom = np.dot(list_seq,seq) / np.dot(seq,seq)
+ denom = np.sum(denom > 1 - self.theta)
+ return 1/denom
+ else:
+ return 0.0 #return 0 weight if sequence is fully empty
+ self.weights = np.array(list(map(compute_weight,list_seq)))
+ np.save(file=self.weights_location, arr=self.weights)
+ else:
+ # If not using weights, use an isotropic weight matrix
+ if verbose: print("Not weighting sequence data")
+ self.weights = np.ones(self.one_hot_encoding.shape[0])
+
+ self.Neff = np.sum(self.weights)
+ self.num_sequences = self.one_hot_encoding.shape[0]
+ self.seq_name_to_weight={}
+ for i,seq_name in enumerate(self.seq_name_to_sequence.keys()):
+ self.seq_name_to_weight[seq_name]=self.weights[i]
+
+ if verbose:
+ print ("Neff =",str(self.Neff))
+ print ("Data Shape =",self.one_hot_encoding.shape)
\ No newline at end of file
diff --git a/proteingym/baselines/tranception/tranception/utils/scoring_utils.py b/proteingym/baselines/tranception/tranception/utils/scoring_utils.py
new file mode 100644
index 0000000..36cba5c
--- /dev/null
+++ b/proteingym/baselines/tranception/tranception/utils/scoring_utils.py
@@ -0,0 +1,203 @@
+import os
+import tqdm
+import re
+import numpy as np
+import pandas as pd
+
+import torch
+from torch.nn import CrossEntropyLoss, NLLLoss
+from torch.utils.data.sampler import Sampler, SequentialSampler
+
+from transformers import DataCollatorForLanguageModeling, PreTrainedTokenizerFast
+from datasets import Dataset
+
+AA_vocab = "ACDEFGHIKLMNPQRSTVWY"
+
+def get_mutated_sequence(focus_seq, mutant, start_idx=1, AA_vocab=AA_vocab):
+ """
+ Helper function that mutates an input sequence (focus_seq) via an input mutation triplet (substitutions only).
+ Mutation triplet are typically based on 1-indexing: start_idx is used for switching to 0-indexing.
+ """
+ mutated_seq = list(focus_seq)
+ for mutation in mutant.split(":"):
+ try:
+ from_AA, position, to_AA = mutation[0], int(mutation[1:-1]), mutation[-1]
+ except:
+ print("Issue with mutant: "+str(mutation))
+ relative_position = position - start_idx
+ assert (from_AA==focus_seq[relative_position]), "Invalid from_AA or mutant position: "+str(mutation)+" from_AA: "+str(from_AA) + " relative pos: "+str(relative_position) + " focus_seq: "+str(focus_seq)
+ assert (to_AA in AA_vocab) , "Mutant to_AA is invalid: "+str(mutation)
+ mutated_seq[relative_position] = to_AA
+ return "".join(mutated_seq)
+
+def nanmean(v, *args, inplace=False, **kwargs):
+ if not inplace:
+ v = v.clone()
+ is_nan = torch.isnan(v)
+ v[is_nan] = 0
+ return v.sum(*args, **kwargs) / (~is_nan).float().sum(*args, **kwargs)
+
+def nansum(v, *args, inplace=False, **kwargs):
+ if not inplace:
+ v = v.clone()
+ is_nan = torch.isnan(v)
+ v[is_nan] = 0
+ return v.sum(*args, **kwargs)
+
+def get_optimal_window(mutation_position_relative, seq_len_wo_special, model_window):
+ """
+ Helper function that selects an optimal sequence window that fits the maximum model context size.
+ If the sequence length is less than the maximum context size, the full sequence is returned.
+ """
+ half_model_window = model_window // 2
+ if seq_len_wo_special <= model_window:
+ return [0,seq_len_wo_special]
+ elif mutation_position_relative < half_model_window:
+ return [0,model_window]
+ elif mutation_position_relative >= seq_len_wo_special - half_model_window:
+ return [seq_len_wo_special - model_window, seq_len_wo_special]
+ else:
+ return [max(0,mutation_position_relative-half_model_window), min(seq_len_wo_special,mutation_position_relative+half_model_window)]
+
+def sequence_replace_single(sequence, char_to_replace, char_replacements):
+ char_replacements = list(char_replacements)
+ positions = [m.start() for m in re.finditer(char_to_replace, sequence)]
+ replacements = np.random.choice(a=char_replacements, size=len(positions), replace=True)
+ sequence=list(sequence)
+ for idx, position in enumerate(positions):
+ sequence[position]=replacements[idx]
+ return ''.join(sequence)
+
+def sequence_replace(sequences, char_to_replace, char_replacements):
+ """
+ Helper function that replaces all Amino Acids passsed in via char_to_replace (as a string of AAs) with Amino Acids sampled from char_replacements (also a string of eligible AAs).
+ """
+ return [sequence_replace_single(sequence, char_to_replace, char_replacements) for sequence in sequences]
+
+def get_tranception_scores_mutated_sequences(model, mutated_sequence_df, batch_size_inference, score_var_name, target_seq, num_workers=10, reverse=False, indel_mode=False):
+ """
+ Helper function that takes as input a set of mutated sequences (in a pandas dataframe) and returns scores for each mutation.
+ If target_seq is not None, returns the delta log likelihood wrt that target sequence -- otherwise returns the log likelihood of the protein sequences.
+ """
+ scores = {}
+ scores['mutated_sequence']=[]
+ scores['sliced_mutated_sequence']=[]
+ scores['window_start']=[]
+ scores['window_end']=[]
+ scores['score']=[]
+ with torch.no_grad():
+ ds = Dataset.from_pandas(mutated_sequence_df)
+ ds.set_transform(model.encode_batch)
+ data_collator = DataCollatorForLanguageModeling(
+ tokenizer=model.config.tokenizer,
+ mlm=False)
+ sampler = SequentialSampler(ds)
+ ds_loader = torch.utils.data.DataLoader(ds, batch_size=batch_size_inference, sampler=sampler, collate_fn=data_collator, num_workers=num_workers, pin_memory=True, drop_last=False)
+ mutant_index=0
+ for encoded_batch in tqdm.tqdm(ds_loader, desc="Scoring batches"):
+ full_batch_length = len(encoded_batch['input_ids'])
+ mutated_sequence = np.array(mutated_sequence_df['mutated_sequence'][mutant_index:mutant_index+full_batch_length])
+ scores['mutated_sequence'] += list(mutated_sequence)
+ sliced_mutated_sequence = np.array(mutated_sequence_df['sliced_mutated_sequence'][mutant_index:mutant_index+full_batch_length])
+ scores['sliced_mutated_sequence'] += list(sliced_mutated_sequence)
+ window_start = np.array(mutated_sequence_df['window_start'][mutant_index:mutant_index+full_batch_length])
+ scores['window_start'] += list(window_start)
+ window_end = np.array(mutated_sequence_df['window_end'][mutant_index:mutant_index+full_batch_length])
+ scores['window_end'] += list(window_end)
+ for k, v in encoded_batch.items():
+ if isinstance(v, torch.Tensor):
+ encoded_batch[k] = v.to(model.device)
+ shift_labels = encoded_batch['labels'][..., 1:].contiguous()
+ if (hasattr(model.config,"retrieval_aggregation_mode")) and (model.config.retrieval_aggregation_mode is not None):
+ if reverse:
+ encoded_batch['flip']=torch.tensor([1]*full_batch_length)
+ encoded_batch['start_slice']=window_start
+ encoded_batch['end_slice']=window_end
+ encoded_batch['mutated_sequence'] = mutated_sequence #only mutated_sequence is flipped if the scoring_mirror branch of score_mutants. No need to flip mutated_sequence for MSA re-aligning
+ fused_shift_log_probas=model(**encoded_batch,return_dict=True).fused_shift_log_probas
+ loss_fct = NLLLoss(reduction='none')
+ loss = - loss_fct(input=fused_shift_log_probas.view(-1, fused_shift_log_probas.size(-1)), target=shift_labels.view(-1)).view(fused_shift_log_probas.shape[0],fused_shift_log_probas.shape[1])
+ else:
+ lm_logits=model(**encoded_batch,return_dict=True).logits
+ shift_logits = lm_logits[..., :-1, :].contiguous()
+ loss_fct = CrossEntropyLoss(reduction='none')
+ loss = - loss_fct(input=shift_logits.view(-1, shift_logits.size(-1)), target=shift_labels.view(-1)).view(shift_logits.shape[0],shift_logits.shape[1])
+ mask = encoded_batch['attention_mask'][..., 1:].float()
+ mask[mask==0]=float('nan')
+ loss *= mask
+ loss = nansum(loss, dim=1)
+ scores_batch = list(loss.cpu().numpy())
+ full_batch_length = len(encoded_batch['input_ids'])
+ scores['score'] += scores_batch
+ mutant_index+=full_batch_length
+ scores = pd.DataFrame(scores)
+ if model.config.scoring_window=="sliding":
+ scores = scores[['mutated_sequence','score']].groupby('mutated_sequence').sum().reset_index() #We need to aggregate scores when using sliding mode
+ scores['score'] = scores['score'] / scores['mutated_sequence'].map(lambda x: len(x))
+ if target_seq is not None:
+ scores_mutated_seq = scores[scores.mutated_sequence != target_seq]
+ scores_wt = scores[scores.mutated_sequence == target_seq]
+ merge_delta = 'mutated_sequence' if model.config.scoring_window=="sliding" else 'window_start'
+ if model.config.scoring_window=="optimal":
+ delta_scores = pd.merge(scores_mutated_seq,scores_wt,how='left',on=[merge_delta],suffixes=('','_wt'))
+ delta_scores[score_var_name] = delta_scores['score'] - delta_scores['score_wt']
+ elif model.config.scoring_window=="sliding":
+ delta_scores = scores_mutated_seq.copy()
+ delta_scores[score_var_name] = delta_scores['score'] - list(scores_wt['score'])[0] # In sliding mode there is a single reference window for the WT
+ return delta_scores[['mutated_sequence',score_var_name]]
+ else:
+ scores[score_var_name] = scores['score']
+ return scores[['mutated_sequence',score_var_name]]
+
+def get_sequence_slices(df, target_seq, model_context_len, start_idx=1, scoring_window="optimal", indel_mode=False):
+ """
+ Helper function that takes as input a (pandas) dataframe df that contains a list of mutant triplets (substitutions) or full mutated sequences (indels) for scoring.
+ It returns a processed DMS in which sequences have been sliced to satisfy the maximum context window of the model.
+ df: (dataframe) Input dataframe to be processed
+ target_seq: (string) Full reference sequence (wild type) that is mutated in the DMS assay.
+ model_context_len: (int) Maximum context size for the model.
+ start_idx: (int) Integer to move to 0-indexing of positions (mutation triplet are typically based on 1-indexing).
+ scoring_window: (string) Method to slice sequences longer than maximum context size:
+ - optimal selects a single window as large as possible via the get_optimal_window function (this is the default)
+ - sliding splits the full sequence in contiguous (non-overlapping) chunks that are of size equal to the max context (except the last chunk which may be shorter)
+ indel_mode: (bool) Flag to be used when scoring insertions and deletions. Otherwise assumes substitutions.
+ Note: when scoring indels for sequences that would be longer than the model max context length, it is preferable to use the "sliding" scoring_window. Use "optimal" otherwise.
+ """
+ len_target_seq = len(target_seq)
+ num_mutants = len(df['mutated_sequence'])
+ df=df.reset_index(drop=True)
+ if scoring_window=="optimal":
+ df['mutation_barycenter'] = df['mutant'].apply(lambda x: int(np.array([int(mutation[1:-1]) - start_idx for mutation in x.split(':')]).mean())) if not indel_mode else df['mutated_sequence'].apply(lambda x: len(x)//2)
+ df['scoring_optimal_window'] = df['mutation_barycenter'].apply(lambda x: get_optimal_window(x, len_target_seq, model_context_len)) if not indel_mode else df['mutated_sequence'].apply(lambda x: (0,len(x)))
+ df['sliced_mutated_sequence'] = [df['mutated_sequence'][index][df['scoring_optimal_window'][index][0]:df['scoring_optimal_window'][index][1]] for index in range(num_mutants)]
+ df['window_start'] = df['scoring_optimal_window'].map(lambda x: x[0])
+ df['window_end'] = df['scoring_optimal_window'].map(lambda x: x[1])
+ del df['scoring_optimal_window'], df['mutation_barycenter']
+ if 'mutant' in df: del df['mutant']
+ df_wt=df.copy()
+ df_wt['mutated_sequence'] = [target_seq] * num_mutants
+ if indel_mode: # For indels, we set the wild type reference to be always the same (full length) sequence. We assume here that the length is lower than model context size (otherwise "Sliding" mode should be used)
+ df_wt['window_end'] = df_wt['mutated_sequence'].map(lambda x:len(x))
+ df_wt['sliced_mutated_sequence'] = [target_seq[df_wt['window_start'][index]:df_wt['window_end'][index]] for index in range(num_mutants)]
+ df = pd.concat([df,df_wt], axis=0)
+ df = df.drop_duplicates()
+ elif scoring_window=="sliding":
+ num_windows = 1 + int( len_target_seq / model_context_len)
+ df_list=[]
+ start=0
+ for window_index in range(1, num_windows+1):
+ df_sliced = df.copy()
+ df_sliced['sliced_mutated_sequence'] = df_sliced['mutated_sequence'].map(lambda x: x[start:start+model_context_len])
+ df_sliced['window_start'] = [start] * num_mutants
+ df_sliced['window_end'] = df_sliced['mutated_sequence'].map(lambda x: min(len(x), start+model_context_len))
+ df_sliced_wt = df_sliced.copy()
+ df_sliced_wt['mutated_sequence'] = [target_seq] * num_mutants
+ df_sliced_wt['sliced_mutated_sequence'] = df_sliced_wt['mutated_sequence'].map(lambda x: x[start:start+model_context_len])
+ df_sliced_wt['window_end'] = df_sliced_wt['mutated_sequence'].map(lambda x: min(len(x), start+model_context_len)) #Need to adjust end index if WT and sequence are not same full length
+ df_list.append(df_sliced)
+ df_list.append(df_sliced_wt)
+ start += model_context_len
+ df_final = pd.concat(df_list,axis=0)
+ if 'mutant' in df_final: del df_final['mutant']
+ df = df_final.drop_duplicates()
+ return df.reset_index(drop=True)
\ No newline at end of file
diff --git a/proteingym/baselines/tranception/tranception/utils/tokenizers/Basic_tokenizer b/proteingym/baselines/tranception/tranception/utils/tokenizers/Basic_tokenizer
new file mode 100644
index 0000000..b6af745
--- /dev/null
+++ b/proteingym/baselines/tranception/tranception/utils/tokenizers/Basic_tokenizer
@@ -0,0 +1 @@
+{"version":"1.0","truncation":null,"padding":null,"added_tokens":[{"id":0,"special":true,"content":"[UNK]","single_word":false,"lstrip":false,"rstrip":false,"normalized":false},{"id":1,"special":true,"content":"[CLS]","single_word":false,"lstrip":false,"rstrip":false,"normalized":false},{"id":2,"special":true,"content":"[SEP]","single_word":false,"lstrip":false,"rstrip":false,"normalized":false},{"id":3,"special":true,"content":"[PAD]","single_word":false,"lstrip":false,"rstrip":false,"normalized":false},{"id":4,"special":true,"content":"[MASK]","single_word":false,"lstrip":false,"rstrip":false,"normalized":false}],"normalizer":null,"pre_tokenizer":{"type":"Whitespace"},"post_processor":{"type":"TemplateProcessing","single":[{"SpecialToken":{"id":"[CLS]","type_id":0}},{"Sequence":{"id":"A","type_id":0}},{"SpecialToken":{"id":"[SEP]","type_id":0}}],"pair":[{"SpecialToken":{"id":"[CLS]","type_id":0}},{"Sequence":{"id":"A","type_id":0}},{"SpecialToken":{"id":"[SEP]","type_id":0}},{"Sequence":{"id":"B","type_id":1}},{"SpecialToken":{"id":"[SEP]","type_id":1}}],"special_tokens":{"[CLS]":{"id":"[CLS]","ids":[1],"tokens":["[CLS]"]},"[SEP]":{"id":"[SEP]","ids":[2],"tokens":["[SEP]"]}}},"decoder":null,"model":{"type":"BPE","dropout":null,"unk_token":"[UNK]","continuing_subword_prefix":null,"end_of_word_suffix":null,"fuse_unk":false,"vocab":{"[UNK]":0,"[CLS]":1,"[SEP]":2,"[PAD]":3,"[MASK]":4,"A":5,"C":6,"D":7,"E":8,"F":9,"G":10,"H":11,"I":12,"K":13,"L":14,"M":15,"N":16,"P":17,"Q":18,"R":19,"S":20,"T":21,"V":22,"W":23,"Y":24},"merges":[]}}
\ No newline at end of file
diff --git a/proteingym/baselines/unirep/unirep.py b/proteingym/baselines/unirep/unirep.py
new file mode 100644
index 0000000..4f9b67d
--- /dev/null
+++ b/proteingym/baselines/unirep/unirep.py
@@ -0,0 +1,591 @@
+"""
+The trained 1900-dimensional mLSTM babbler.
+Source: https://github.com/churchlab/UniRep/blob/master/unirep.py
+"""
+
+import os
+import tensorflow.compat.v1 as tf
+tf.disable_v2_behavior()
+import tensorflow_addons as tfa
+import tensorflow_probability as tfp
+
+import numpy as np
+import pandas as pd
+import sys
+sys.path.append(os.path.dirname(os.path.dirname(os.path.dirname(os.path.abspath(__file__)))))
+from baselines.unirep.utils import aa_seq_to_int, int_to_aa
+from baselines.unirep.utils.unirep_utils import bucketbatchpad
+
+# Helpers
+def tf_get_shape(tensor):
+ static_shape = tensor.shape.as_list()
+ dynamic_shape = tf.unstack(tf.shape(tensor))
+ dims = [s[1] if s[0] is None else s[0]
+ for s in zip(static_shape, dynamic_shape)]
+ return dims
+
+def sample_with_temp(logits, t):
+ """
+ Takes temperature between 0 and 1 -> zero most conservative, 1 most liberal. Samples.
+ """
+ t_adjusted = logits / t # broadcast temperature normalization
+ softed = tf.nn.softmax(t_adjusted)
+
+ # Make a categorical distribution from the softmax and sample
+ return tfp.distributions.Categorical(probs=softed).sample()
+
+def initialize_uninitialized(sess):
+ """
+ from https://stackoverflow.com/questions/35164529/in-tensorflow-is-there-any-way-to-just-initialize-uninitialised-variables
+ """
+ global_vars = tf.global_variables()
+ is_not_initialized = sess.run([tf.is_variable_initialized(var) for var in global_vars])
+ not_initialized_vars = [v for (v, f) in zip(global_vars, is_not_initialized) if not f]
+ if len(not_initialized_vars):
+ sess.run(tf.variables_initializer(not_initialized_vars))
+
+
+# Setup to initialize from the correctly named model files.
+class mLSTMCell1900(tf.nn.rnn_cell.RNNCell):
+
+ def __init__(self,
+ num_units,
+ model_path="./",
+ wn=True,
+ scope='mlstm',
+ var_device='cpu:0',
+ ):
+ # Really not sure if I should reuse here
+ super(mLSTMCell1900, self).__init__()
+ self._num_units = num_units
+ self._model_path = model_path
+ self._wn = wn
+ self._scope = scope
+ self._var_device = var_device
+
+ @property
+ def state_size(self):
+ # The state is a tuple of c and h
+ return (self._num_units, self._num_units)
+
+ @property
+ def output_size(self):
+ # The output is h
+ return (self._num_units)
+
+ def zero_state(self, batch_size, dtype):
+ c = tf.zeros([batch_size, self._num_units], dtype=dtype)
+ h = tf.zeros([batch_size, self._num_units], dtype=dtype)
+ return (c, h)
+
+ def call(self, inputs, state):
+ # Inputs will be a [batch_size, input_dim] tensor.
+ # Eg, input_dim for a 10-D embedding is 10
+ nin = inputs.get_shape()[1].value
+
+ # Unpack the state tuple
+ c_prev, h_prev = state
+ with tf.variable_scope(self._scope):
+ wx_init = np.load(os.path.join(self._model_path, "rnn_mlstm_mlstm_wx:0.npy"))
+ wh_init = np.load(os.path.join(self._model_path, "rnn_mlstm_mlstm_wh:0.npy"))
+ wmx_init = np.load(os.path.join(self._model_path, "rnn_mlstm_mlstm_wmx:0.npy"))
+ wmh_init = np.load(os.path.join(self._model_path, "rnn_mlstm_mlstm_wmh:0.npy"))
+ b_init = np.load(os.path.join(self._model_path, "rnn_mlstm_mlstm_b:0.npy"))
+ gx_init = np.load(os.path.join(self._model_path, "rnn_mlstm_mlstm_gx:0.npy"))
+ gh_init = np.load(os.path.join(self._model_path, "rnn_mlstm_mlstm_gh:0.npy"))
+ gmx_init = np.load(os.path.join(self._model_path, "rnn_mlstm_mlstm_gmx:0.npy"))
+ gmh_init = np.load(os.path.join(self._model_path, "rnn_mlstm_mlstm_gmh:0.npy"))
+ wx = tf.get_variable(
+ "wx", initializer=wx_init)
+ wh = tf.get_variable(
+ "wh", initializer=wh_init)
+ wmx = tf.get_variable(
+ "wmx", initializer=wmx_init)
+ wmh = tf.get_variable(
+ "wmh", initializer=wmh_init)
+ b = tf.get_variable(
+ "b", initializer=b_init)
+ if self._wn:
+ gx = tf.get_variable(
+ "gx", initializer=gx_init)
+ gh = tf.get_variable(
+ "gh", initializer=gh_init)
+ gmx = tf.get_variable(
+ "gmx", initializer=gmx_init)
+ gmh = tf.get_variable(
+ "gmh", initializer=gmh_init)
+
+ if self._wn:
+ wx = tf.nn.l2_normalize(wx, axis=0) * gx
+ wh = tf.nn.l2_normalize(wh, axis=0) * gh
+ wmx = tf.nn.l2_normalize(wmx, axis=0) * gmx
+ wmh = tf.nn.l2_normalize(wmh, axis=0) * gmh
+ m = tf.matmul(inputs, wmx) * tf.matmul(h_prev, wmh)
+ z = tf.matmul(inputs, wx) + tf.matmul(m, wh) + b
+ i, f, o, u = tf.split(z, 4, 1)
+ i = tf.nn.sigmoid(i)
+ f = tf.nn.sigmoid(f)
+ o = tf.nn.sigmoid(o)
+ u = tf.tanh(u)
+ c = f * c_prev + i * u
+ h = o * tf.tanh(c)
+ return h, (c, h)
+
+class mLSTMCell(tf.nn.rnn_cell.RNNCell):
+
+ def __init__(self,
+ num_units,
+ wx_init=tf.orthogonal_initializer(),
+ wh_init=tf.orthogonal_initializer(),
+ wmx_init=tf.orthogonal_initializer(),
+ wmh_init=tf.orthogonal_initializer(),
+ b_init=tf.orthogonal_initializer(),
+ gx_init=tf.ones_initializer(),
+ gh_init=tf.ones_initializer(),
+ gmx_init=tf.ones_initializer(),
+ gmh_init=tf.ones_initializer(),
+ wn=True,
+ scope='mlstm',
+ var_device='cpu:0',
+ ):
+ # Really not sure if I should reuse here
+ super(mLSTMCell, self).__init__()
+ self._num_units = num_units
+ self._wn = wn
+ self._scope = scope
+ self._var_device = var_device
+ self._wx_init = wx_init
+ self._wh_init = wh_init
+ self._wmx_init = wmx_init
+ self._wmh_init = wmh_init
+ self._b_init = b_init
+ self._gx_init = gx_init
+ self._gh_init = gh_init
+ self._gmx_init = gmx_init
+ self._gmh_init = gmh_init
+
+ @property
+ def state_size(self):
+ # The state is a tuple of c and h
+ return (self._num_units, self._num_units)
+
+ @property
+ def output_size(self):
+ # The output is h
+ return (self._num_units)
+
+ def zero_state(self, batch_size, dtype):
+ c = tf.zeros([batch_size, self._num_units], dtype=dtype)
+ h = tf.zeros([batch_size, self._num_units], dtype=dtype)
+ return (c, h)
+
+ def call(self, inputs, state):
+ # Inputs will be a [batch_size, input_dim] tensor.
+ # Eg, input_dim for a 10-D embedding is 10
+ nin = inputs.get_shape()[1].value
+
+ # Unpack the state tuple
+ c_prev, h_prev = state
+ with tf.variable_scope(self._scope):
+ wx = tf.get_variable(
+ "wx", initializer=self._wx_init)
+ wh = tf.get_variable(
+ "wh", initializer=self._wh_init)
+ wmx = tf.get_variable(
+ "wmx", initializer=self._wmx_init)
+ wmh = tf.get_variable(
+ "wmh", initializer=self._wmh_init)
+ b = tf.get_variable(
+ "b", initializer=self._b_init)
+ if self._wn:
+ gx = tf.get_variable(
+ "gx", initializer=self._gx_init)
+ gh = tf.get_variable(
+ "gh", initializer=self._gh_init)
+ gmx = tf.get_variable(
+ "gmx", initializer=self._gmx_init)
+ gmh = tf.get_variable(
+ "gmh", initializer=self._gmh_init)
+
+ if self._wn:
+ wx = tf.nn.l2_normalize(wx, dim=0) * gx
+ wh = tf.nn.l2_normalize(wh, dim=0) * gh
+ wmx = tf.nn.l2_normalize(wmx, dim=0) * gmx
+ wmh = tf.nn.l2_normalize(wmh, dim=0) * gmh
+ m = tf.matmul(inputs, wmx) * tf.matmul(h_prev, wmh)
+ z = tf.matmul(inputs, wx) + tf.matmul(m, wh) + b
+ i, f, o, u = tf.split(z, 4, 1)
+ i = tf.nn.sigmoid(i)
+ f = tf.nn.sigmoid(f)
+ o = tf.nn.sigmoid(o)
+ u = tf.tanh(u)
+ c = f * c_prev + i * u
+ h = o * tf.tanh(c)
+ return h, (c, h)
+
+class mLSTMCellStackNPY(tf.nn.rnn_cell.RNNCell):
+
+ def __init__(self,
+ num_units=256,
+ num_layers=4,
+ dropout=None,
+ res_connect=False,
+ wn=True,
+ scope='mlstm_stack',
+ var_device='cpu:0',
+ model_path="./"
+ ):
+ # Really not sure if I should reuse here
+ super(mLSTMCellStackNPY, self).__init__()
+ self._model_path=model_path
+ self._num_units = num_units
+ self._num_layers = num_layers
+ self._dropout = dropout
+ self._res_connect = res_connect
+ self._wn = wn
+ self._scope = scope
+ self._var_device = var_device
+ bs = "rnn_mlstm_stack_mlstm_stack" # base scope see weight file names
+ join = lambda x: os.path.join(self._model_path, x)
+ layers = [mLSTMCell(
+ num_units=self._num_units,
+ wn=self._wn,
+ scope=self._scope + str(i),
+ var_device=self._var_device,
+ wx_init=np.load(join(bs + "{0}_mlstm_stack{1}_wx:0.npy".format(i,i))),
+ wh_init=np.load(join(bs + "{0}_mlstm_stack{1}_wh:0.npy".format(i,i))),
+ wmx_init=np.load(join(bs + "{0}_mlstm_stack{1}_wmx:0.npy".format(i,i))),
+ wmh_init=np.load(join(bs + "{0}_mlstm_stack{1}_wmh:0.npy".format(i,i))),
+ b_init=np.load(join(bs + "{0}_mlstm_stack{1}_b:0.npy".format(i,i))),
+ gx_init=np.load(join(bs + "{0}_mlstm_stack{1}_gx:0.npy".format(i,i))),
+ gh_init=np.load(join(bs + "{0}_mlstm_stack{1}_gh:0.npy".format(i,i))),
+ gmx_init=np.load(join(bs + "{0}_mlstm_stack{1}_gmx:0.npy".format(i,i))),
+ gmh_init=np.load(join(bs + "{0}_mlstm_stack{1}_gmh:0.npy".format(i,i)))
+ ) for i in range(self._num_layers)]
+ if self._dropout:
+ layers = [
+ tf.nn.rnn_cell.DropoutWrapper(
+ layer, output_keep_prob=1-self._dropout) for layer in layers[:-1]] + layers[-1:]
+ self._layers = layers
+
+ @property
+ def state_size(self):
+ # The state is a tuple of c and h
+ return (
+ tuple(self._num_units for _ in range(self._num_layers)),
+ tuple(self._num_units for _ in range(self._num_layers))
+ )
+
+ @property
+ def output_size(self):
+ # The output is h
+ return (self._num_units)
+
+ def zero_state(self, batch_size, dtype):
+ c_stack = tuple(tf.zeros([batch_size, self._num_units], dtype=dtype) for _ in range(self._num_layers))
+ h_stack = tuple(tf.zeros([batch_size, self._num_units], dtype=dtype) for _ in range(self._num_layers))
+ return (c_stack, h_stack)
+
+ def call(self, inputs, state):
+ # Inputs will be a [batch_size, input_dim] tensor.
+ # Eg, input_dim for a 10-D embedding is 10
+
+ # Unpack the state tuple
+ c_prev, h_prev = state
+
+ new_outputs = []
+ new_cs = []
+ new_hs = []
+ for i, layer in enumerate(self._layers):
+ if i == 0:
+ h, (c,h_state) = layer(inputs, (c_prev[i],h_prev[i]))
+ else:
+ h, (c,h_state) = layer(new_outputs[-1], (c_prev[i],h_prev[i]))
+ new_outputs.append(h)
+ new_cs.append(c)
+ new_hs.append(h_state)
+
+ if self._res_connect:
+ # Make sure number of layers does not affect the scale of the output
+ scale_factor = tf.constant(1 / float(self._num_layers))
+ final_output = tf.scalar_mul(scale_factor,tf.add_n(new_outputs))
+ else:
+ final_output = new_outputs[-1]
+
+ return final_output, (tuple(new_cs), tuple(new_hs))
+
+
+class babbler1900():
+
+ def __init__(self,
+ model_path="./pbab_weights",
+ batch_size=256
+ ):
+ self._rnn_size = 1900
+ self._vocab_size = 26
+ self._embed_dim = 10
+ self._wn = True
+ self._shuffle_buffer = 10000
+ self._model_path = model_path
+ self._batch_size = batch_size
+ self._batch_size_placeholder = tf.placeholder(tf.int32, shape=[], name="batch_size")
+ self._minibatch_x_placeholder = tf.placeholder(
+ tf.int32, shape=[None, None], name="minibatch_x")
+ self._initial_state_placeholder = (
+ tf.placeholder(tf.float32, shape=[None, self._rnn_size]),
+ tf.placeholder(tf.float32, shape=[None, self._rnn_size])
+ )
+ self._minibatch_y_placeholder = tf.placeholder(
+ tf.int32, shape=[None, None], name="minibatch_y")
+ # Batch size dimensional placeholder which gives the
+ # Lengths of the input sequence batch. Used to index into
+ # The final_hidden output and select the stop codon -1
+ # final hidden for the graph operation.
+ self._seq_length_placeholder = tf.placeholder(
+ tf.int32, shape=[None], name="seq_len")
+ self._temp_placeholder = tf.placeholder(tf.float32, shape=[], name="temp")
+ rnn = mLSTMCell1900(self._rnn_size, model_path=model_path, wn=self._wn)
+ zero_state = rnn.zero_state(self._batch_size, tf.float32)
+ single_zero = rnn.zero_state(1, tf.float32)
+ mask = tf.sign(self._minibatch_y_placeholder) # 1 for nonpad, zero for pad
+ inverse_mask = 1 - mask # 0 for nonpad, 1 for pad
+
+ total_padded = tf.reduce_sum(inverse_mask)
+
+ pad_adjusted_targets = (self._minibatch_y_placeholder - 1) + inverse_mask
+
+ embed_matrix = tf.get_variable(
+ "embed_matrix", dtype=tf.float32,
+ initializer=np.load(os.path.join(self._model_path, "embed_matrix:0.npy"))
+ )
+ embed_cell = tf.nn.embedding_lookup(embed_matrix, self._minibatch_x_placeholder)
+ self._output, self._final_state = tf.nn.dynamic_rnn(
+ rnn,
+ embed_cell,
+ initial_state=self._initial_state_placeholder,
+ swap_memory=True,
+ parallel_iterations=1
+ )
+
+ # If we are training a model on top of the rep model, we need to access
+ # the final_hidden rep from output. Recall we are padding these sequences
+ # to max length, so the -1 position will not necessarily be the right rep.
+ # to get the right rep, I will use the provided sequence length to index.
+ # Subtract one for the last place
+ indices = self._seq_length_placeholder - 1
+ self._top_final_hidden = tf.gather_nd(self._output,
+ tf.stack([tf.range(tf_get_shape(self._output)[0],
+ dtype=tf.int32), indices], axis=1))
+ fmask = tf.cast(mask, tf.float32)[:, :, None]
+ self._avg_hidden = tf.reduce_sum(fmask * self._output,
+ axis=1) / tf.reduce_sum(fmask, axis=1)
+ # LEFTOFF self._output is a batch size, seq_len, num_hidden.
+ # I want to average along num_hidden, but I'll have to figure out how to mask out
+ # the dimensions along sequence_length which are longer than the given sequence.
+ flat = tf.reshape(self._output, [-1, self._rnn_size])
+ if os.path.exists(os.path.join(self._model_path, "fully_connected_weights:0.npy")):
+ weights_name="fully_connected_weights"
+ bias_name="fully_connected_biases"
+ else:
+ weights_name="dense_kernel"
+ bias_name="dense_bias"
+ weights_init = tf.constant_initializer(
+ np.load(os.path.join(self._model_path, f"{weights_name}:0.npy")))
+ bias_init = tf.constant_initializer(
+ np.load(os.path.join(self._model_path, f"{bias_name}:0.npy")))
+ self.dense_layer = tf.keras.layers.Dense(self._vocab_size-1,
+ activation=None, kernel_initializer=weights_init,
+ bias_initializer=bias_init)
+ logits_flat = self.dense_layer(flat)
+ seqlen = tf_get_shape(self._minibatch_x_placeholder)[1]
+ self._logits = tf.reshape(
+ logits_flat, [batch_size, seqlen, self._vocab_size-1])
+ self.batch_losses = tfa.seq2seq.sequence_loss(
+ self._logits,
+ tf.cast(pad_adjusted_targets, tf.int32),
+ tf.cast(mask, tf.float32),
+ average_across_batch=False
+ )
+ self._loss = tf.reduce_mean(self.batch_losses)
+ self._sample = sample_with_temp(self._logits, self._temp_placeholder)
+ with tf.Session() as sess:
+ self._zero_state = sess.run(zero_state)
+ self._single_zero = sess.run(single_zero)
+
+
+ def get_rep(self,seq):
+ """
+ Input a valid amino acid sequence,
+ outputs a tuple of average hidden, final hidden, final cell representation arrays.
+ Unfortunately, this method accepts one sequence at a time and is as such quite
+ slow.
+ """
+ with tf.Session() as sess:
+ initialize_uninitialized(sess)
+ # Strip any whitespace and convert to integers with the correct coding
+ int_seq = aa_seq_to_int(seq.strip())[:-1]
+ # Final state is a cell_state, hidden_state tuple. Output is
+ # all hidden states
+ final_state_, hs = sess.run(
+ [self._final_state, self._output], feed_dict={
+ self._batch_size_placeholder: 1,
+ self._minibatch_x_placeholder: [int_seq],
+ self._initial_state_placeholder: self._zero_state}
+ )
+
+ final_cell, final_hidden = final_state_
+ # Drop the batch dimension so it is just seq len by
+ # representation size
+ final_cell = final_cell[0]
+ final_hidden = final_hidden[0]
+ hs = hs[0]
+ avg_hidden = np.mean(hs, axis=0)
+ return avg_hidden, final_hidden, final_cell
+
+ def get_babble(self, seed, length=250, temp=1):
+ """
+ Return a babble at temperature temp (on (0,1] with 1 being the noisiest)
+ starting with seed and continuing to length length.
+ Unfortunately, this method accepts one sequence at a time and is as such quite
+ slow.
+ """
+ with tf.Session() as sess:
+ initialize_uninitialized(sess)
+ int_seed = aa_seq_to_int(seed.strip())[:-1]
+
+ # No need for padding because this is a single element
+ seed_samples, final_state_ = sess.run(
+ [self._sample, self._final_state],
+ feed_dict={
+ self._minibatch_x_placeholder: [int_seed],
+ self._initial_state_placeholder: self._zero_state,
+ self._batch_size_placeholder: 1,
+ self._temp_placeholder: temp
+ }
+ )
+ # Just the actual character prediction
+ pred_int = seed_samples[0, -1] + 1
+ seed = seed + int_to_aa[pred_int]
+
+ for i in range(length - len(seed)):
+ pred_int, final_state_ = sess.run(
+ [self._sample, self._final_state],
+ feed_dict={
+ self._minibatch_x_placeholder: [[pred_int]],
+ self._initial_state_placeholder: final_state_,
+ self._batch_size_placeholder: 1,
+ self._temp_placeholder: temp
+ }
+ )
+ pred_int = pred_int[0, 0] + 1
+ seed = seed + int_to_aa[pred_int]
+ return seed
+
+ def get_rep_ops(self):
+ """
+ Return tensorflow operations for the final_hidden state and placeholder.
+ """
+ return self._top_final_hidden, self._avg_hidden, self._minibatch_x_placeholder, self._batch_size_placeholder, self._seq_length_placeholder, self._initial_state_placeholder
+
+ def get_babbler_ops(self):
+ """
+ Return tensorflow operations for
+ the logits, masked loss, minibatch_x placeholder, minibatch y placeholder, batch_size placeholder, initial_state placeholder
+ Use if you plan on using babbler1900 as an initialization for another babbler,
+ eg for fine tuning the babbler to babble a differenct distribution.
+ """
+ return self._logits, self._loss, self._minibatch_x_placeholder, self._minibatch_y_placeholder, self._batch_size_placeholder, self._initial_state_placeholder
+
+ def dump_weights(self,sess,dir_name="./1900_weights"):
+ """
+ Saves the weights of the model in dir_name in the format required
+ for loading in this module. Must be called within a tf.Session
+ For which the weights are already initialized.
+ """
+ vs = tf.trainable_variables()
+ for v in vs:
+ name = v.name
+ value = sess.run(v)
+ np.save(os.path.join(dir_name,name.replace('/', '_') + ".npy"), np.array(value))
+
+
+
+ def format_seq(self,seq,stop=False):
+ """
+ Takes an amino acid sequence, returns a list of integers in the codex of the babbler.
+ Here, the default is to strip the stop symbol (stop=False) which would have
+ otherwise been added to the end of the sequence. If you are trying to generate
+ a rep, do not include the stop. It is probably best to ignore the stop if you are
+ co-tuning the babbler and a top model as well.
+ """
+ if stop:
+ int_seq = aa_seq_to_int(seq.strip())
+ else:
+ int_seq = aa_seq_to_int(seq.strip())[:-1]
+ return int_seq
+
+
+ def bucket_batch_pad(self,filepath, upper=2000, lower=50, interval=10):
+ """
+ Read sequences from a filepath, batch them into buckets of similar lengths, and
+ pad out to the longest sequence.
+ Upper, lower and interval define how the buckets are created.
+ Any sequence shorter than lower will be grouped together, as with any greater
+ than upper. Interval defines the "walls" of all the other buckets.
+ WARNING: Define large intervals for small datasets because the default behavior
+ is to repeat the same sequence to fill a batch. If there is only one sequence
+ within a bucket, it will be repeated batch_size -1 times to fill the batch.
+ """
+ self._bucket_upper = upper
+ self._bucket_lower = lower
+ self._bucket_interval = interval
+ self._bucket = [self._bucket_lower + (i * self._bucket_interval) for i in range(int(self._bucket_upper / self._bucket_interval))]
+ self._bucket_batch = bucketbatchpad(
+ batch_size=self._batch_size,
+ pad_shape=([None]),
+ window_size=self._batch_size,
+ bounds=self._bucket,
+ path_to_data=filepath,
+ shuffle_buffer=self._shuffle_buffer,
+ repeat=None
+ ).make_one_shot_iterator().get_next()
+ return self._bucket_batch
+
+ def split_to_tuple(self, seq_batch):
+ """
+ NOTICE THAT BY DEFAULT THIS STRIPS THE LAST CHARACTER.
+ IF USING IN COMBINATION WITH format_seq then set stop=True there.
+ Return a list of batch, target tuples.
+ The input (array-like) should
+ look like
+ 1. . . . . . . . sequence_length
+ .
+ .
+ .
+ batch_size
+ """
+ q = None
+ num_steps = seq_batch.shape[1]
+ # Minibatches should start at zero index and go to -1
+ # Don't even try to get what is happenning here its a brainfuck and
+ # probably inefficient
+ xypairs = [
+ (seq_batch[:, :-1][:, idx:idx + num_steps], seq_batch[:, idx + 1:idx + num_steps + 1]) for idx in np.arange(len(seq_batch[0]))[0:-1:num_steps]
+ ]
+ if q:
+ for e in xypairs:
+ q.put(e)
+ else:
+ return xypairs[0]
+
+ def is_valid_seq(self, seq, max_len=4000):
+ """
+ True if seq is valid for the babbler, False otherwise.
+ """
+ l = len(seq)
+ ## important: added X here, not in original unirep code
+ valid_aas = "MRHKDESTNQCUGPAVIFYWLOX"
+ if (l <= max_len) and set(seq) <= set(valid_aas):
+ return True
+ else:
+ return False
\ No newline at end of file
diff --git a/proteingym/baselines/unirep/unirep_evotune.py b/proteingym/baselines/unirep/unirep_evotune.py
new file mode 100644
index 0000000..5812f1c
--- /dev/null
+++ b/proteingym/baselines/unirep/unirep_evotune.py
@@ -0,0 +1,156 @@
+'''
+Unsupervised fine-tuning of UniRep on evolutionary data, "evo-tuning"
+Source: https://github.com/chloechsu/combining-evolutionary-and-assay-labelled-data/blob/main/src/unirep_evotune.py
+'''
+
+import argparse
+import os
+import sys
+import pathlib
+import random
+
+import numpy as np
+import pandas as pd
+from sklearn.model_selection import train_test_split
+import tensorflow.compat.v1 as tf
+tf.disable_v2_behavior()
+# append to path the parent directory two levels up
+sys.path.append(os.path.dirname(os.path.dirname(os.path.dirname(os.path.abspath(__file__)))))
+from baselines.unirep.unirep import babbler1900 as babbler
+from baselines.unirep import utils
+
+
+def main():
+ parser = argparse.ArgumentParser()
+ parser.add_argument('--seqs_fasta_path', type=pathlib.Path)
+ parser.add_argument('--save_weights_dir', type=pathlib.Path)
+ parser.add_argument('--mapping_path', type=str)
+ parser.add_argument('--DMS_index', type=int)
+ parser.add_argument('--initial_weights_dir', type=pathlib.Path, default="weights/unirep/global")
+ parser.add_argument('--batch_size', type=int, default=128)
+ parser.add_argument('--max_seq_len', type=int)
+ parser.add_argument('--num_steps', type=int, default=10000)
+ parser.add_argument('--learning_rate', type=float, default=0.00001)
+ args = parser.parse_args()
+
+ # Set seeds
+ tf.set_random_seed(0)
+ np.random.seed(0)
+
+ print("Num GPUs Available: ",len(tf.config.experimental.list_physical_devices('GPU')))
+
+ mapping = pd.read_csv(args.mapping_path)
+ list_DMS = mapping["DMS_id"]
+ DMS_id=list_DMS[args.DMS_index]
+ print("Fine tuning Unirep params for: {}".format(DMS_id))
+ DMS_file_name = mapping["DMS_filename"][mapping["DMS_id"]==DMS_id].values[0]
+ MSA_file_name = mapping["MSA_filename"][mapping["DMS_id"]==DMS_id].values[0]
+ if args.max_seq_len is None:
+ args.max_seq_len = mapping["seq_len"][mapping["DMS_id"]==DMS_id].values[0]
+ print("Max seq len: {}".format(args.max_seq_len))
+
+ #Adjust number of training steps to match teh 65 epochs in the paper
+ MSA_num_seqs = mapping["MSA_num_seqs"][mapping["DMS_id"]==DMS_id].values[0]
+ args.num_steps = min(args.num_steps, int(65*MSA_num_seqs/args.batch_size))
+ print("Training for {} steps".format(args.num_steps))
+
+ # args.save_weights_dir = pathlib.Path(str(args.save_weights_dir) + os.sep + DMS_file_name.split('.csv')[0])
+ # Now saving under MSA filename, as several assays share alignments
+ args.save_weights_dir = pathlib.Path(str(args.save_weights_dir) + os.sep + MSA_file_name.split(".a2m")[0])
+
+
+ # Load pre-trained models
+ # Sync relevant weight files
+ # !aws s3 sync --no-sign-request --quiet s3://unirep-public/1900_weights/ 1900_weights/
+ # Import the mLSTM babbler model
+ # Where model weights are stored.
+ b = babbler(batch_size=args.batch_size, model_path=args.initial_weights_dir)
+
+ # Load seqs from fasta.
+ seqs_all = utils.read_fasta(str(args.seqs_fasta_path)+os.sep+MSA_file_name)
+ seqs = dict()
+ seqs['train'], seqs['val'] = train_test_split(seqs_all, test_size=0.2)
+
+ #print(seqs['train'])
+ bucket_ops = {
+ 'train': None,
+ 'val': None,
+ }
+ for mode in ['train', 'val']:
+ prefix = (str(args.seqs_fasta_path)+os.sep+MSA_file_name).replace('.a2m', '')
+ formatted_seqs_path = prefix + f'_{mode}_formatted.txt'
+ with open(formatted_seqs_path, "w") as destination:
+ for i,seq in enumerate(seqs[mode]):
+ seq = seq.upper().replace('-', 'X')
+ seq = seq.replace('.', 'X')
+ if b.is_valid_seq(seq, max_len=10 * args.max_seq_len):
+ if len(seq) > args.max_seq_len:
+ sample_start = random.randint(0,len(seq)-args.max_seq_len)
+ seq = seq[sample_start:sample_start+args.max_seq_len]
+ formatted = ",".join(map(str,b.format_seq(seq)))
+ destination.write(formatted)
+ destination.write('\n')
+ bucket_ops[mode] = b.bucket_batch_pad(formatted_seqs_path,
+ lower=100, upper=args.max_seq_len, interval=50)
+
+ logits, seqloss, x_ph, y_ph, batch_size_ph, initial_state_ph = (
+ b.get_babbler_ops())
+ optimizer = tf.train.AdamOptimizer(args.learning_rate)
+ tuning_op = optimizer.minimize(seqloss)
+
+ args.save_weights_dir.mkdir(parents=True, exist_ok=True)
+
+ train_loss = np.zeros(args.num_steps)
+ val_loss = np.zeros(args.num_steps)
+ with tf.Session() as sess:
+ sess.run(tf.global_variables_initializer())
+ sess.graph.finalize()
+ for i in range(args.num_steps):
+ print(f"Step {i}")
+ batch_train = sess.run(bucket_ops['train'])
+ train_loss[i], __, = sess.run([seqloss, tuning_op],
+ feed_dict={
+ x_ph: batch_train[:, :-1],
+ y_ph: batch_train[:, 1:],
+ batch_size_ph: args.batch_size,
+ initial_state_ph:b._zero_state
+ },
+ )
+ batch_val = sess.run(bucket_ops['val'])
+ val_loss[i] = sess.run(seqloss,
+ feed_dict={
+ x_ph: batch_val[:, :-1],
+ y_ph: batch_val[:, 1:],
+ batch_size_ph: args.batch_size,
+ initial_state_ph:b._zero_state
+ },
+ )
+ print("Step {0}: {1} (train), {2} (val)".format(
+ i, train_loss[i], val_loss[i]))
+ # Save periodically
+ if i % 1000 == 0 and i > 0:
+ suffix = f'_{int(i / 1000)}k'
+ savedir = os.path.join(args.save_weights_dir, suffix)
+ pathlib.Path(savedir).mkdir(exist_ok=True)
+ # Save weights
+ b.dump_weights(sess, dir_name=savedir)
+ # Save loss trajectories
+ np.savetxt(
+ os.path.join(args.save_weights_dir, 'loss_trajectory_train.npy'),
+ train_loss)
+ np.savetxt(
+ os.path.join(args.save_weights_dir, 'loss_trajectory_val.npy'),
+ val_loss)
+ # Save final weights
+ b.dump_weights(sess, dir_name=args.save_weights_dir)
+ # Save loss trajectories
+ np.savetxt(
+ os.path.join(args.save_weights_dir, 'loss_trajectory_train.npy'),
+ train_loss)
+ np.savetxt(
+ os.path.join(args.save_weights_dir, 'loss_trajectory_val.npy'),
+ val_loss)
+
+
+if __name__ == "__main__":
+ main()
\ No newline at end of file
diff --git a/proteingym/baselines/unirep/unirep_inference.py b/proteingym/baselines/unirep/unirep_inference.py
new file mode 100644
index 0000000..37371d7
--- /dev/null
+++ b/proteingym/baselines/unirep/unirep_inference.py
@@ -0,0 +1,116 @@
+'''
+Infers log-likelihoods from UniRep models.
+Source: https://github.com/chloechsu/combining-evolutionary-and-assay-labelled-data/blob/main/src/unirep_inference.py
+'''
+
+import argparse
+import os
+import sys
+import pathlib
+
+import numpy as np
+import pandas as pd
+import tensorflow as tf
+sys.path.append(os.path.dirname(os.path.dirname(os.path.dirname(os.path.abspath(__file__)))))
+from baselines.unirep.unirep import babbler1900
+from baselines.unirep.utils import load_and_filter_seqs, save, format_batch_seqs, nonpad_len
+
+def run_inference(seqs, model_weight_path, output_dir, output_filename=None,
+ batch_size=64, save_hidden=False):
+ if len(seqs) < batch_size:
+ batch_size = len(seqs)
+ babbler_class = babbler1900
+ # Load model weights
+ b = babbler_class(batch_size=batch_size, model_path=model_weight_path)
+ # Load ops
+ final_hidden_op, avg_hidden_op, x_ph, batch_size_ph, seq_len_ph, init_state_ph = b.get_rep_ops()
+ logits_op, loss_op, x_ph, y_ph, batch_size_ph, init_state_ph = b.get_babbler_ops()
+ batch_loss_op = b.batch_losses
+
+ final_hidden_vals = []
+ avg_hidden_vals = []
+ loss_vals = []
+ with tf.compat.v1.Session() as sess:
+ sess.run(tf.compat.v1.global_variables_initializer())
+ n_batches = int(len(seqs) / batch_size)
+ leftover = len(seqs) % batch_size
+ n_batches += int(bool(leftover))
+ for i in range(n_batches):
+ print('----Running inference for batch # %d------' % i)
+ if i == n_batches - 1:
+ batch_seqs = seqs[-batch_size:]
+ else:
+ batch_seqs = seqs[i*batch_size:(i+1)*batch_size]
+ batch_seqs = [seq.replace('-', 'X') for seq in batch_seqs]
+ batch = format_batch_seqs(batch_seqs)
+ length = nonpad_len(batch)
+ # Run final hidden op
+ avg_hidden_, loss_ = sess.run(
+ [avg_hidden_op, batch_loss_op],
+ feed_dict={
+ # Important! Shift input and expected target by 1.
+ x_ph: batch[:, :-1],
+ y_ph: batch[:, 1:],
+ batch_size_ph: batch.shape[0],
+ seq_len_ph: length,
+ init_state_ph:b._zero_state
+ })
+ if i == n_batches - 1:
+ loss_vals.append(loss_[-leftover:])
+ if save_hidden:
+ avg_hidden_vals.append(avg_hidden_[-leftover:])
+ else:
+ loss_vals.append(loss_)
+ if save_hidden:
+ avg_hidden_vals.append(avg_hidden_)
+
+ loss_vals = np.concatenate(loss_vals, axis=0)
+ if output_filename is not None:
+ output_df=pd.DataFrame({'mutated_sequence':seqs,
+ 'Unirep_score':loss_vals})
+ output_df.to_csv(output_dir+os.sep+output_filename+'.csv',index=False)
+ #else:
+ # loss_filename = os.path.join(output_dir, f'loss.npy')
+ # save(loss_filename, loss_vals)
+
+ #if save_hidden:
+ # avg_hidden_vals = np.concatenate(avg_hidden_vals, axis=0)
+ # avg_hidden_filename = os.path.join(output_dir, f'avg_hidden.npy')
+ # save(avg_hidden_filename, avg_hidden_vals)
+
+ print('Ran inference on %d sequences. Saved results to %s.' %
+ (len(seqs), output_dir))
+
+
+if __name__ == '__main__':
+ parser = argparse.ArgumentParser()
+ parser.add_argument('--model_path', type=str)
+ parser.add_argument('--data_path', type=str)
+ parser.add_argument('--mapping_path', type=str)
+ parser.add_argument('--output_dir', type=str)
+ parser.add_argument('--DMS_index', type=int)
+ parser.add_argument('--batch_size', type=int, default=64)
+ parser.add_argument('--evotune', action='store_true')
+ parser.add_argument('--save_hidden', dest='save_hidden', action='store_true')
+ args = parser.parse_args()
+
+ pathlib.Path(args.output_dir).mkdir(parents=True, exist_ok=True)
+
+ mapping = pd.read_csv(args.mapping_path)
+ list_DMS = mapping["DMS_id"]
+ DMS_id=list_DMS[args.DMS_index]
+ DMS_file_name = mapping["DMS_filename"][mapping["DMS_id"]==DMS_id].values[0]
+ MSA_file_name = mapping["MSA_filename"][mapping["DMS_id"]==DMS_id].values[0]
+ if args.evotune:
+ args.model_path = args.model_path + os.sep + MSA_file_name.split('.a2m')[0]
+ print("Computing scores for: {} with Unirep {}".format(DMS_id, args.model_path))
+
+
+ seqs = load_and_filter_seqs(args.data_path + os.sep + DMS_file_name)
+ #np.savetxt(os.path.join(args.output_dir, 'seqs.npy'), seqs, '%s')
+
+ run_inference(seqs, args.model_path,
+ args.output_dir,
+ output_filename=DMS_id,
+ batch_size=args.batch_size,
+ save_hidden=args.save_hidden)
\ No newline at end of file
diff --git a/proteingym/baselines/unirep/utils/__init__.py b/proteingym/baselines/unirep/utils/__init__.py
new file mode 100644
index 0000000..d60f95e
--- /dev/null
+++ b/proteingym/baselines/unirep/utils/__init__.py
@@ -0,0 +1,8 @@
+from utils.data_utils import *
+from utils.experiment_utils import *
+from utils.io_utils import *
+from utils.metric_utils import *
+from utils.plot_utils import *
+#from utils.notebook_utils import *
+#from utils.esm_utils import *
+#from utils.georgiev_utils import *
\ No newline at end of file
diff --git a/proteingym/baselines/unirep/utils/data_utils.py b/proteingym/baselines/unirep/utils/data_utils.py
new file mode 100644
index 0000000..85406c0
--- /dev/null
+++ b/proteingym/baselines/unirep/utils/data_utils.py
@@ -0,0 +1,238 @@
+"""
+Utilities for data processing.
+"""
+import numpy as np
+import os
+from sklearn.model_selection import train_test_split
+
+
+"""
+File formatting note.
+Data should be preprocessed as a sequence of comma-seperated ints with
+sequences /n seperated
+"""
+
+# Lookup tables
+aa_to_int = {
+ 'M':1,
+ 'R':2,
+ 'H':3,
+ 'K':4,
+ 'D':5,
+ 'E':6,
+ 'S':7,
+ 'T':8,
+ 'N':9,
+ 'Q':10, 'C':11,
+ 'U':12,
+ 'G':13,
+ 'P':14,
+ 'A':15,
+ 'V':16,
+ 'I':17,
+ 'F':18,
+ 'Y':19,
+ 'W':20,
+ 'L':21,
+ 'O':22, #Pyrrolysine
+ 'X':23, # Unknown
+ 'Z':23, # Glutamic acid or GLutamine
+ 'B':23, # Asparagine or aspartic acid
+ 'J':23, # Leucine or isoleucine
+ 'start':24,
+ 'stop':25,
+ '-':26,
+}
+
+int_to_aa = {value:key for key, value in aa_to_int.items()}
+
+def get_aa_to_int():
+ """
+ Get the lookup table (for easy import)
+ """
+ return aa_to_int
+
+def get_int_to_aa():
+ """
+ Get the lookup table (for easy import)
+ """
+ return int_to_aa
+
+# Helper functions
+
+def aa_seq_to_int(s):
+ """
+ Return the int sequence as a list for a given string of amino acids
+ """
+ return [24] + [aa_to_int[a] for a in s] + [25]
+
+def int_seq_to_aa(s):
+ """
+ Return the int sequence as a list for a given string of amino acids
+ """
+ return "".join([int_to_aa[i] for i in s])
+
+
+def nonpad_len(batch):
+ nonzero = batch > 0
+ lengths = np.sum(nonzero, axis=1)
+ return lengths
+
+
+def format_seq(seq,stop=False):
+ """
+ Takes an amino acid sequence, returns a list of integers in the codex of the babbler.
+ Here, the default is to strip the stop symbol (stop=False) which would have
+ otherwise been added to the end of the sequence. If you are trying to generate
+ a rep, do not include the stop. It is probably best to ignore the stop if you are
+ co-tuning the babbler and a top model as well.
+ """
+ if stop:
+ int_seq = aa_seq_to_int(seq.strip())
+ else:
+ int_seq = aa_seq_to_int(seq.strip())[:-1]
+ return int_seq
+
+
+def format_batch_seqs(seqs):
+ maxlen = -1
+ for s in seqs:
+ if len(s) > maxlen:
+ maxlen = len(s)
+ formatted = []
+ for seq in seqs:
+ pad_len = maxlen - len(seq)
+ padded = np.pad(format_seq(seq), (0, pad_len), 'constant', constant_values=0)
+ formatted.append(padded)
+ return np.stack(formatted)
+
+
+def is_valid_seq(seq, max_len=4000):
+ """
+ True if seq is valid for the babbler, False otherwise.
+ """
+ l = len(seq)
+ valid_aas = "MRHKDESTNQCUGPAVIFYWLO"
+ if (l <= max_len) and set(seq) <= set(valid_aas):
+ return True
+ else:
+ return False
+
+
+def seqs_to_onehot(seqs):
+ seqs = format_batch_seqs(seqs)
+ X = np.zeros((seqs.shape[0], seqs.shape[1]*24), dtype=int)
+ for i in range(seqs.shape[1]):
+ for j in range(24):
+ X[:, i*24+j] = (seqs[:, i] == j)
+ return X
+
+
+def seqs_to_binary_onehot(seqs, wt):
+ seqs = np.array([list(s) for s in seqs])
+ X = np.zeros((seqs.shape[0], seqs.shape[1]), dtype=int)
+ for i in range(seqs.shape[1]):
+ X[:, i] = (seqs[:, i] != wt[i])
+ return X
+
+
+def dict2str(d):
+ return ';'.join([f'{k}={v}' for k, v in d.items()])
+
+
+def seq2mutation(seq, model, return_str=False, ignore_gaps=False,
+ sep=":", offset=1):
+ mutations = []
+ for pf, pm in model.index_map.items():
+ if seq[pf-offset] != model.target_seq[pm]:
+ if ignore_gaps and (
+ seq[pf-offset] == '-' or seq[pf-offset] not in model.alphabet):
+ continue
+ mutations.append((pf, model.target_seq[pm], seq[pf-offset]))
+ if return_str:
+ return sep.join([m[1] + str(m[0]) + m[2] for m in mutations])
+ return mutations
+
+
+def seq2mutation_fromwt(seq, wt, ignore_gaps=False, sep=':', offset=1,
+ focus_only=True):
+ mutations = []
+ for i in range(offset, offset+len(seq)):
+ if ignore_gaps and ( seq[i-offset] == '-'):
+ continue
+ if wt[i-offset].islower() and focus_only:
+ continue
+ if seq[i-offset].upper() != wt[i-offset].upper():
+ mutations.append((i, wt[i-offset].upper(), seq[i-offset].upper()))
+ return mutations
+
+
+def seqs2subs(seqs, wt, ignore_gaps=False):
+ pos = []
+ subs = []
+ for s in seqs:
+ p = []
+ su = []
+ for j in range(len(wt)):
+ if s[j] != wt[j]:
+ if ignore_gaps and (s[j] == '-' or s[j] == 'X'):
+ continue
+ p.append(j)
+ su.append(s[j])
+ pos.append(np.array(p))
+ subs.append(np.array(su))
+ return pos, subs
+
+
+def seq2effect(seqs, model, offset=1, ignore_gaps=False):
+ effects = np.zeros(len(seqs))
+ for i in range(len(seqs)):
+ mutations = seq2mutation(seqs[i], model,
+ ignore_gaps=ignore_gaps, offset=offset)
+ dE, _, _ = model.delta_hamiltonian(mutations)
+ effects[i] = dE
+ return effects
+
+
+def mutant2seq(mut, wt, offset):
+ if mut.upper() == 'WT':
+ return wt
+ chars = list(wt)
+ mut = mut.replace(':', ',')
+ mut = mut.replace(';', ',')
+ for m in mut.split(','):
+ idx = int(m[1:-1])-offset
+ assert wt[idx] == m[0]
+ chars[idx] = m[-1]
+ return ''.join(chars)
+
+def get_blosum_scores(seqs, wt, matrix):
+ scores = np.zeros(len(seqs))
+ wt_score = 0
+ for j in range(len(wt)):
+ wt_score += matrix[wt[j], wt[j]]
+ for i, s in enumerate(seqs):
+ for j in range(len(wt)):
+ if s[j] not in matrix.alphabet:
+ print(f'unexpected AA {s[j]} (seq {i}, pos {j})')
+ scores[i] += matrix[wt[j], s[j]]
+ return scores - wt_score
+
+
+def get_wt_seq(mutation_descriptions):
+ wt_len = 0
+ for m in mutation_descriptions:
+ if m == 'WT':
+ continue
+ if int(m[1:-1]) > wt_len:
+ wt_len = int(m[1:-1])
+ wt = ['?' for _ in range(wt_len)]
+ for m in mutation_descriptions:
+ if m == 'WT':
+ continue
+ idx, wt_char = int(m[1:-1])-1, m[0] # 1-index to 0-index
+ if wt[idx] == '?':
+ wt[idx] = wt_char
+ else:
+ assert wt[idx] == wt_char
+ return ''.join(wt), wt_len
\ No newline at end of file
diff --git a/proteingym/baselines/unirep/utils/experiment_utils.py b/proteingym/baselines/unirep/utils/experiment_utils.py
new file mode 100644
index 0000000..c5498e4
--- /dev/null
+++ b/proteingym/baselines/unirep/utils/experiment_utils.py
@@ -0,0 +1,132 @@
+import numpy as np
+import pandas as pd
+from sklearn.linear_model import Ridge
+from sklearn import svm
+
+from utils.metric_utils import get_spearman_fractions, wt_improvement_metric, topk_median
+
+
+def test_regression_multiseeds(X, y, n_train, n_seeds, y_wt, y_cutoff=None,
+ mutation_counts=None, mutation_count_cutoff=None):
+ spm = np.zeros((n_seeds, len(SPEARMAN_FRACTIONS)))
+ r2 = np.zeros(n_seeds)
+ wt_imprv = np.zeros(n_seeds)
+ topk_med = np.zeros(n_seeds)
+ best_alpha = np.zeros(n_seeds)
+ for i in range(n_seeds):
+ spm[i], r2[i], wt_imprv[i], topk_med[i], best_alpha[i] = test_regression(
+ X, y, n_train, y_wt, y_cutoff, i, mutation_counts,
+ mutation_count_cutoff)
+ df = pd.DataFrame({
+ 'R2 score': r2,
+ 'Improvement over WT': wt_imprv,
+ 'Top K median': topk_med,
+ 'N train': n_train,
+ 'Best alpha': best_alpha,
+ })
+ for i, f in enumerate(SPEARMAN_FRACTIONS):
+ df[f'Spearman correlation at {f:.1f}'] = spm[:, i]
+ df['Spearman correlation'] = spm[:, -1]
+ return df
+
+
+def test_regression(X, y, n_train, y_wt, y_cutoff=None, seed=0,
+ mutation_counts=None, mutation_count_cutoff=None):
+ if y_cutoff is not None:
+ is_valid = (y >= y_cutoff)
+ X, y = X[is_valid], y[is_valid]
+ if mutation_counts is not None:
+ mutation_counts = mutation_counts[is_valid]
+ X_tr, X_eval, X_test, y_tr, y_eval, y_test = train_eval_test_split(
+ X, y, seed, n_train, mutation_counts, mutation_count_cutoff)
+ # Model selection.
+ best_alpha = None
+ best_spm = -999.9
+ for alpha in [0.001, 0.01, 0.1, 0.2, 0.3, 0.5, 1.0, 2.0]:
+ model = Ridge(alpha=alpha)
+ model.fit(X_tr, y_tr)
+ y_pred = model.predict(X_eval)
+ spm = spearmanr(y_pred, y_eval).correlation
+ if spm > best_spm:
+ best_alpha = alpha
+ best_spm = spm
+ model = Ridge(alpha=best_alpha)
+ model.fit(X_tr, y_tr)
+ y_pred = model.predict(X_test)
+ spm = get_spearman_fractions(y_pred, y_test)
+ r2 = model.score(X_test, y_test)
+ wt_imprv = wt_improvement_metric(y_pred, y_test, y_wt)
+ topk_med = topk_median(y_pred, y_test)
+ return spm, r2, wt_imprv, topk_med, best_alpha
+
+
+def test_classification_multiseeds(X, y, n_train, n_seeds, y_cutoff=None,
+ mutation_counts=None, mutation_count_cutoff=None):
+ acc = np.zeros(n_seeds)
+ best_C = np.zeros(n_seeds)
+ for i in range(n_seeds):
+ acc[i], best_C[i] = test_classification(
+ X, y, n_train, y_cutoff, i, mutation_counts,
+ mutation_count_cutoff)
+ return acc, best_C
+
+
+def test_classification(X, y, n_train, y_cutoff, seed=0,
+ mutation_counts=None, mutation_count_cutoff=None):
+ y = (y > y_cutoff).astype(int)
+ X_tr, X_eval, X_test, y_tr, y_eval, y_test = train_eval_test_split(
+ X, y, seed, n_train, mutation_counts, mutation_count_cutoff)
+ while len(np.unique(y_tr)) < 2:
+ X_tr, X_eval, X_test, y_tr, y_eval, y_test = train_eval_test_split(
+ X, y, seed+np.random.randint(10000), n_train, mutation_counts,
+ mutation_count_cutoff)
+ best_C = None
+ best_acc = -999.9
+ for C in [0.01, 0.1, 0.5, 1.0, 2.0]:
+ model = svm.LinearSVC(C=C)
+ model.fit(X_tr, y_tr)
+ acc = model.score(X_eval, y_eval)
+ if acc > best_acc:
+ best_C = C
+ best_acc = acc
+ model = svm.LinearSVC(C=best_C)
+ model.fit(X_tr, y_tr)
+ return model.score(X_test, y_test), best_C
+
+
+def run_regression(feature_reps, y, y_wt, y_cutoff, n_seeds,
+ mutation_counts=None, mutation_count_cutoff=None):
+ results = pd.DataFrame()
+ for feature_rep, X in feature_reps.items():
+ print('Staring runs for', feature_rep)
+ for n_train in [8, 24, 96, 192, 480, 960, 9600]:
+ if n_train >= 0.8 * X.shape[0]:
+ continue
+ print('n_train:', n_train)
+ df = test_regression_multiseeds(X, y, n_train, n_seeds, y_wt,
+ y_cutoff, mutation_counts, mutation_count_cutoff)
+ df['Feature rep'] = feature_rep
+ results = pd.concat([results, df], axis=0)
+ return results
+
+
+def run_classification(feature_reps, y, y_cutoff, n_seeds,
+ mutation_counts=None, mutation_count_cutoff=None):
+ results = pd.DataFrame()
+ for feature_rep, X in feature_reps.items():
+ print('Staring runs for', feature_rep)
+ for n_train in [8, 24, 96, 192, 480, 960]:
+ if n_train >= 0.8 * X.shape[0]:
+ continue
+ print('n_train:', n_train)
+ acc, best_C = test_classification_multiseeds(
+ X, y, n_train, n_seeds, y_cutoff, mutation_counts,
+ mutation_count_cutoff)
+ df = pd.DataFrame({
+ 'Accuracy': acc,
+ 'Best reg coeff': best_C,
+ 'N train': n_train,
+ 'Feature rep': feature_rep,
+ })
+ results = pd.concat([results, df], axis=0)
+ return results
\ No newline at end of file
diff --git a/proteingym/baselines/unirep/utils/io_utils.py b/proteingym/baselines/unirep/utils/io_utils.py
new file mode 100644
index 0000000..56354ee
--- /dev/null
+++ b/proteingym/baselines/unirep/utils/io_utils.py
@@ -0,0 +1,193 @@
+import fileinput
+import glob
+import os
+
+from Bio import SeqIO
+import filelock
+import numpy as np
+import pandas as pd
+
+from utils.data_utils import is_valid_seq, seqs_to_onehot
+
+
+def merge_dfs(in_rgx, out_path, index_cols, groupby_cols, ignore_cols):
+ """
+ Merge multiple pandas DataFrames into one and provides a summary file.
+ Args:
+ - in_rgx: regex for input filepath
+ - out_path: output path
+ - index_cols: index column names for DataFrame
+ - groupby_cols: groupby column names in the summary step
+ - ignore_cols: columns to be ignored in the summary step
+ """
+ lock = filelock.FileLock(out_path + '.lock')
+ with lock:
+ frames = []
+ for f in glob.glob(in_rgx):
+ try:
+ frames.append(pd.read_csv(f))
+ os.remove(f)
+ except pd.errors.EmptyDataError:
+ continue
+ df = pd.concat(frames, axis=0, sort=True).sort_values(index_cols)
+ df.set_index(index_cols).to_csv(out_path, float_format='%.4f')
+
+ #df = df.drop(columns=ignore_cols)
+ #means = df.groupby(groupby_cols).mean()
+ #stds = df.groupby(groupby_cols).std()
+ #summary = pd.merge(means, stds, on=groupby_cols, suffixes=('_mean', '_std'))
+ #summary = summary.sort_index(axis=1)
+ #save_path = out_path.replace(".csv", "_summary.csv")
+ #summary.to_csv(save_path, float_format='%.4f')
+ #return summary
+
+
+def parse_var(s):
+ """
+ Parse a key, value pair, separated by '='
+ That's the reverse of ShellArgs.
+ On the command line (argparse) a declaration will typically look like:
+ foo=hello
+ or
+ foo="hello world"
+ """
+ items = s.split('=')
+ key = items[0].strip() # we remove blanks around keys, as is logical
+ if len(items) > 1:
+ # rejoin the rest:
+ value = '='.join(items[1:])
+ return (key, value)
+
+
+def parse_vars(items):
+ """
+ Parse a series of key-value pairs and return a dictionary
+ """
+ d = {}
+
+ if items:
+ for item in items:
+ key, value = parse_var(item)
+ try:
+ d[key] = float(value)
+ except:
+ d[key] = value
+ return d
+
+
+def load_data_split(dataset_name, split_id, seed=0, ignore_gaps=False):
+ data_path = os.path.join('data', dataset_name, 'data.csv')
+ # Sample shuffles the DataFrame.
+ data_pre_split = pd.read_csv(data_path).sample(frac=1.0, random_state=seed)
+ if not ignore_gaps:
+ is_valid = data_pre_split['seq'].apply(is_valid_seq)
+ data_pre_split = data_pre_split[is_valid]
+ if split_id == -1:
+ return data_pre_split
+ return np.array_split(data_pre_split, 3)[split_id]
+
+
+def get_wt_log_fitness(dataset_name):
+ data_path = os.path.join('data', dataset_name, 'data.csv')
+ data = pd.read_csv(data_path)
+ try:
+ return data[data.n_mut == 0].log_fitness.mean()
+ except:
+ return data.log_fitness.mean()
+
+
+def get_log_fitness_cutoff(dataset_name):
+ data_path = os.path.join('data', dataset_name, 'log_fitness_cutoff.npy')
+ return np.loadtxt(data_path).item()
+
+
+def count_rows(filename_glob_pattern):
+ cnt = 0
+ for f in sorted(glob.glob(filename_glob_pattern)):
+ with open(f) as fp:
+ for line in fp:
+ cnt += 1
+ return cnt
+
+
+def load_rows_by_numbers(filename_glob_pattern, line_numbers):
+ lns_sorted = sorted(line_numbers)
+ lns_idx = np.argsort(line_numbers)
+ n_rows = len(line_numbers)
+ current_ln = 0 # current (accumulated) line number in opened file
+ j = 0 # index in lns
+ rows = None
+ for f in sorted(glob.glob(filename_glob_pattern)):
+ with open(f) as fp:
+ for line in fp:
+ while j < n_rows and lns_sorted[j] == current_ln:
+ thisrow = np.array([float(x) for x in line.split(' ')])
+ if rows is None:
+ rows = np.full((n_rows, len(thisrow)), np.nan)
+ rows[lns_idx[j], :] = thisrow
+ j += 1
+ current_ln += 1
+ assert j == n_rows, (f"Expected {n_rows} rows, found {j}. "
+ f"Scanned {current_ln} lines from {filename_glob_pattern}.")
+ return rows
+
+
+def load(filename_glob_pattern):
+ files = sorted(glob.glob(filename_glob_pattern))
+ if len(files) == 0:
+ print("No files found for", filename_glob_pattern)
+ return np.loadtxt(fileinput.input(files))
+
+
+def save(filename_pattern, data, entries_per_file=2000):
+ n_files = int(data.shape[0] / entries_per_file)
+ if data.shape[0] % entries_per_file > 0:
+ n_files += 1
+ for i in range(n_files):
+ filename = filename_pattern + f'-{i:03d}-of-{n_files:03d}'
+ l_idx = i * entries_per_file
+ r_idx = min(l_idx + entries_per_file, data.shape[0])
+ np.savetxt(filename, data[l_idx:r_idx])
+
+
+def load_and_filter_seqs(data_filename, mode="ProteinGym"):
+ """
+ seqs_filename: file to write out filtered sequences
+ """
+ df = pd.read_csv(data_filename, low_memory=False)
+ if mode=="ProteinGym":
+ if "mutated_sequence" in df.columns.values:
+ all_sequences = np.unique(df.mutated_sequence.values)
+ # assumes "mutant" column contains full mutated sequence instead
+ else:
+ all_sequences = np.unique(df.mutant.values)
+ elif 'Sequence' in df.columns.values:
+ all_sequences = np.unique(df.Sequence.values)
+ else:
+ all_sequences = np.unique(df.seq.values)
+ seqs = []
+ stop_codon_cnt = 0
+ for seq in all_sequences:
+ seq = seq.strip('*')
+ if is_valid_seq(seq):
+ seqs.append(seq)
+ else:
+ if '*' in seq:
+ stop_codon_cnt += 1
+ else:
+ print('Invalid seq', seq)
+ print('Formatted %d sequences. Discarded %d with stop codon.' % (len(seqs), stop_codon_cnt))
+ return seqs
+
+
+def read_fasta(filename, return_ids=False):
+ records = SeqIO.parse(filename, 'fasta')
+ seqs = list()
+ ids = list()
+ for record in records:
+ seqs.append(str(record.seq))
+ ids.append(str(record.id))
+ if return_ids:
+ return seqs, ids
+ else:
+ return seqs
\ No newline at end of file
diff --git a/proteingym/baselines/unirep/utils/metric_utils.py b/proteingym/baselines/unirep/utils/metric_utils.py
new file mode 100644
index 0000000..9b714a1
--- /dev/null
+++ b/proteingym/baselines/unirep/utils/metric_utils.py
@@ -0,0 +1,57 @@
+import numpy as np
+from scipy.stats import spearmanr
+from sklearn.metrics import r2_score, roc_auc_score, ndcg_score
+
+SPEARMAN_FRACTIONS = np.linspace(0.1, 1.0, 10)
+
+
+def spearman(y_pred, y_true):
+ if np.var(y_pred) < 1e-6 or np.var(y_true) < 1e-6:
+ return 0.0
+ return spearmanr(y_pred, y_true).correlation
+
+
+def spearman_scoring_fn(sklearn_estimator, X, y):
+ return spearman(sklearn_estimator.predict(X), y)
+
+
+def ndcg(y_pred, y_true):
+ y_true_normalized = (y_true - y_true.mean()) / y_true.std()
+ return ndcg_score(y_true_normalized.reshape(1, -1), y_pred.reshape(1, -1))
+
+
+def topk_mean(y_pred, y_true, topk=96):
+ return np.mean(y_true[np.argsort(y_pred)[-topk:]])
+
+
+def r2(y_pred, y_true):
+ return r2_score(y_true, y_pred)
+
+
+def hit_rate(y_pred, y_true, y_ref=0.0, topk=96):
+ n_above = np.sum(y_true[np.argsort(y_pred)[-topk:]] > y_ref)
+ return float(n_above) / float(topk)
+
+
+def aucroc(y_pred, y_true, y_cutoff):
+ y_true_bin = (y_true >= y_cutoff)
+ return roc_auc_score(y_true_bin, y_pred, average='micro')
+
+
+def get_spearman_fractions(y_pred, y_true):
+ results = np.zeros(len(SPEARMAN_FRACTIONS))
+ for i, f in enumerate(SPEARMAN_FRACTIONS):
+ k = int(f * len(y_true))
+ idx = np.argsort(y_true)[-k:]
+ results[i] = spearmanr(y_pred[idx], y_true[idx]).correlation
+ return results
+
+
+def wt_improvement_metric(y_pred, y_true, y_wt, topk=96):
+ hr = hit_rate(y_pred, y_true, y_wt, topk)
+ baseline = float(np.sum(y_true > y_wt)) / len(y_true)
+ return hr / baseline
+
+
+def topk_median(y_pred, y_true, topk=96):
+ return np.median(y_true[np.argsort(y_pred)[-topk:]])
\ No newline at end of file
diff --git a/proteingym/baselines/unirep/utils/notebook_utils.py b/proteingym/baselines/unirep/utils/notebook_utils.py
new file mode 100644
index 0000000..a2c3c0c
--- /dev/null
+++ b/proteingym/baselines/unirep/utils/notebook_utils.py
@@ -0,0 +1,140 @@
+import os
+from Bio import SeqIO
+import numpy as np
+import pandas as pd
+import matplotlib.pyplot as plt
+import matplotlib
+from matplotlib.ticker import MultipleLocator, FixedLocator
+import seaborn as sns
+
+from utils.io_utils import load, read_fasta
+from utils.data_utils import seq2effect
+
+# EVMutation imports
+from couplings_model import CouplingsModel
+
+
+def add_unirep_model(df, model_names, path, name):
+ # The inference for UniRep has sorted sequences.
+ df = df.sort_values('seq')
+ seqlen = len(df.seq.values[0])
+ df[name] = -seqlen * load(path)
+ model_names.append(name)
+ return df, model_names
+
+
+def add_ev_model(df, model_names, path, name, dataset, include_indep=False):
+ wt = read_fasta(os.path.join('../data', dataset, 'wt.fasta'))[0]
+
+ couplings_model = CouplingsModel(path)
+ df[f'{name}'] = seq2effect(df.seq.values, couplings_model)
+ model_names.append(name)
+
+ if include_indep:
+ indep_model = couplings_model.to_independent_model()
+ df[f'{name}_indep'] = seq2effect(df.seq.values, wt, indep_model)
+ model_names.append(f'{name}_indep')
+ return df, model_names
+
+
+def add_hmm_model(df, model_names, path, name, dataset):
+ df = df.sort_values('seq')
+ records = SeqIO.parse(os.path.join('../data', dataset, 'seqs.fasta'),
+ 'fasta')
+ ids = []
+ seqs = []
+ for rec in records:
+ seqs.append(str(rec.seq))
+ ids.append(str(rec.id))
+ id2seq = pd.Series(index=ids, data=seqs, name='seq')
+ hmm_ll = pd.read_csv(path)[['target', 'score_full']]
+ hmm_ll = hmm_ll.join(id2seq, on='target', how='left')
+ hmm_ll = hmm_ll.drop_duplicates(subset='seq')
+ df[name] = hmm_ll.sort_values('seq')['score_full'].values
+ model_names.append(name)
+ return df, model_names
+
+
+metric_display_name = {
+ 'ndcg': 'NDCG',
+ 'topk_mean': 'Top 96 mean',
+ 'spearman': 'Spearman correlation',
+}
+
+
+def retrieve_metric(df, metric_name, n_mut=None, predictor=None):
+ tmp = df
+ if predictor is not None:
+ if isinstance(predictor, str):
+ predictor = [predictor]
+ tmp = tmp.loc[tmp.predictor.apply(lambda x: x in predictor)]
+ if n_mut is not None:
+ metric_name = f'{metric_name}_{n_mut}mut'
+ tmp = tmp[['predictor', 'n_train', metric_name]]
+ return tmp.rename(columns={metric_name:'val'})
+
+
+def metric_lineplot(df, predictors, metric, predictor_names, dataset_name,
+ max_n_mut, savename='figure', legend=None, mutcounts=None, **kwargs):
+ fig, axes = plt.subplots(1, max_n_mut+1,
+ figsize=((max_n_mut+1)*3, 4),
+ sharex=True, sharey=True)
+ ax = axes[0]
+ nmut_to_title = {
+ 1: 'Single mutants',
+ 2: 'Double mutants',
+ 3: 'Triple mutants',
+ 4: 'Quadruple mutants',
+ }
+ nmut_to_title.update({i: f'{i}th-order Mutants' for i in range(5, 11)})
+ tmp = retrieve_metric(df, metric, n_mut=None, predictor=predictors)
+ sns.lineplot(data=tmp, x='n_train', y='val',
+ hue='predictor', style='predictor', ax=ax,
+ hue_order=predictors, style_order=predictors, **kwargs)
+ #ax.hlines(df[df.predictor == 'mutation'].mean().spearman, 48, 240, color='dimgrey')
+ ax.set_title(f'mutants of all orders')
+ ax.set_ylabel(metric_display_name[metric])
+ ax.set_xlabel('Training data size')
+ for n_mut in range(1, max_n_mut+1):
+ ax = axes[n_mut]
+ tmp = retrieve_metric(df, metric, n_mut=n_mut, predictor=predictors)
+ sns.lineplot(data=tmp, x='n_train', y='val',
+ hue='predictor', style='predictor', ax=ax,
+ hue_order=predictors, style_order=predictors, **kwargs)
+ ax.set_title(nmut_to_title[n_mut])
+ ax.set_ylabel(metric_display_name[metric])
+ ax.set_xlabel('Training data size')
+ if mutcounts is not None:
+ for i in range(max_n_mut+1):
+ axes[i].annotate(f'Data size: {int(mutcounts[i])}',
+ xy=(0.29, 0.03), xycoords='axes fraction',
+ fontsize=9)
+
+ if legend is not None:
+ handles, labels = legend['handles'], legend['labels']
+ lgd = fig.legend(handles, labels, bbox_to_anchor=legend['loc'],
+ loc='upper left', ncol=1, fontsize=11, frameon=False)
+
+ ax.xaxis.set_minor_locator(MultipleLocator(24))
+ ax.xaxis.set_major_locator(MultipleLocator(48))
+ ax.xaxis.set_major_formatter('{x:.0f}')
+
+ for ax in axes:
+ ax.get_legend().remove()
+
+ pad = 8
+ ax = axes[0]
+ annot = ax.annotate(dataset_name, xy=(0, 0.5), xytext=(-ax.yaxis.labelpad - pad, 0),
+ xycoords=ax.yaxis.label, textcoords='offset points',
+ size='large', ha='right', va='center', rotation=90, fontsize=14)
+ ax.annotate('Test on:', xy=(-0.05, 0.492), xycoords=ax.title, textcoords='offset points',
+ size='large', ha='right', va='center', rotation=0, fontsize=13)
+
+ plt.subplots_adjust(top=0.80, wspace=0.1)
+ if legend is not None:
+ plt.savefig('../figs/' + savename + '.png', format='png', dpi=600,
+ bbox_extra_artists=(annot,lgd,), bbox_inches='tight', pad_inches=0)
+ else:
+ plt.savefig('../figs/' + savename + '.png', format='png', dpi=600,
+ bbox_inches='tight', pad_inches=0)
+ plt.show()
\ No newline at end of file
diff --git a/proteingym/baselines/unirep/utils/plot_utils.py b/proteingym/baselines/unirep/utils/plot_utils.py
new file mode 100644
index 0000000..908026a
--- /dev/null
+++ b/proteingym/baselines/unirep/utils/plot_utils.py
@@ -0,0 +1,53 @@
+import pandas as pd
+import numpy as np
+import seaborn as sns
+import matplotlib.pyplot as plt
+from functools import partial
+from utils.metric_utils import spearman, topk_mean, hit_rate, aucroc
+
+
+def get_stratified_metrics(df, model_name, max_n_mut, metric_fn):
+ strat_metrics = np.zeros(max_n_mut+1)
+ for i in range(0, max_n_mut+1):
+ # 0th position is overall aggregated metric
+ tmp = df
+ if i > 0:
+ tmp = df[df.n_mut == i]
+ strat_metrics[i] = metric_fn(tmp[model_name], tmp.log_fitness)
+ return strat_metrics
+
+
+def plot_stratified_metrics(ax, df, models, max_n_mut, metric_fn, vmin, vmax):
+ strat_matrix = np.zeros((len(models), 1+max_n_mut))
+ xticklabels=['All'] + list(range(1, max_n_mut+1))
+ for i, m in enumerate(models):
+ strat_matrix[i] = get_stratified_metrics(df, m, max_n_mut, metric_fn)
+ sns.heatmap(strat_matrix, yticklabels=models, xticklabels=xticklabels,
+ vmin=vmin, vmax=vmax, ax=ax, cmap='viridis')
+ ax.set_xlabel('# Mutations')
+ ax.vlines([1], *ax.get_ylim(), colors='black')
+
+
+def plot_auc_and_corr(df, models, functional_threshold, wt_log_fitness,
+ max_n_mut=5, vmin=[None, None, None], vmax=[None, None, None], topk=96):
+ fig, axes = plt.subplots(1, 3, figsize=(14, 4), sharex=True, sharey=True)
+
+ ax = axes[0]
+ fn = partial(aucroc, y_cutoff=functional_threshold)
+ plot_stratified_metrics(ax, df, models, max_n_mut, fn, vmin[0], vmax[0])
+ ax.set_title(f'Functional vs Non-Functional AUC-ROC')
+
+ ax = axes[1]
+ fn = partial(aucroc, y_cutoff=wt_log_fitness)
+ plot_stratified_metrics(ax, df[df.log_fitness >= functional_threshold],
+ models, max_n_mut, fn, vmin[1], vmax[1])
+ ax.set_title(f'Functional, =WT AUC-ROC')
+
+ ax = axes[2]
+ plot_stratified_metrics(ax, df[df.log_fitness >= functional_threshold],
+ models, max_n_mut, spearman, vmin[2], vmax[2])
+ ax.set_title('Rank Correlation (Functional)')
+
+ fig.suptitle('Model performance, stratified by # mutations')
+ plt.subplots_adjust(wspace=0.1, top=0.85)
+ plt.show()
\ No newline at end of file
diff --git a/proteingym/baselines/unirep/utils/unirep_utils.py b/proteingym/baselines/unirep/utils/unirep_utils.py
new file mode 100644
index 0000000..b54096b
--- /dev/null
+++ b/proteingym/baselines/unirep/utils/unirep_utils.py
@@ -0,0 +1,143 @@
+import os
+import tensorflow.compat.v1 as tf
+
+def tf_str_len(s):
+ """
+ Returns length of tf.string s
+ """
+ return tf.size(tf.string_split([s],""))
+
+def tf_rank1_tensor_len(t):
+ """
+ Returns the length of a rank 1 tensor t as rank 0 int32
+ """
+ l = tf.reduce_sum(tf.sign(tf.abs(t)), 0)
+ return tf.cast(l, tf.int32)
+
+
+def tf_seq_to_tensor(s):
+ """
+ Input a tf.string of comma seperated integers.
+ Returns Rank 1 tensor the length of the input sequence of type int32
+ """
+ return tf.string_to_number(
+ tf.sparse_tensor_to_dense(tf.string_split([s],","), default_value='0'), out_type=tf.int32
+ )[0]
+
+def smart_length(length, bucket_bounds=tf.constant([128, 256])):
+ """
+ Hash the given length into the windows given by bucket bounds.
+ """
+ # num_buckets = tf_len(bucket_bounds) + tf.constant(1)
+ # Subtract length so that smaller bins are negative, then take sign
+ # Eg: len is 129, sign = [-1,1]
+ signed = tf.sign(bucket_bounds - length)
+
+ # Now make 1 everywhere that length is greater than bound, else 0
+ greater = tf.sign(tf.abs(signed - tf.constant(1)))
+
+ # Now simply sum to count the number of bounds smaller than length
+ key = tf.cast(tf.reduce_sum(greater), tf.int64)
+
+ # This will be between 0 and len(bucket_bounds)
+ return key
+
+def pad_batch(ds, batch_size, padding=None, padded_shapes=([None])):
+ """
+ Helper for bucket batch pad- pads with zeros
+ """
+ return ds.padded_batch(batch_size,
+ padded_shapes=padded_shapes,
+ padding_values=padding
+ )
+
+def aas_to_int_seq(aa_seq):
+ int_seq = ""
+ for aa in aa_seq:
+ int_seq += str(aa_to_int[aa]) + ","
+ return str(aa_to_int['start']) + "," + int_seq + str(aa_to_int['stop'])
+
+# Preprocessing in python
+def fasta_to_input_format(source, destination):
+ # I don't know exactly how to do this in tf, so resorting to python.
+ # Should go line by line so everything is not loaded into memory
+
+ sourcefile = os.path.join(source)
+ destination = os.path.join(destiation)
+ with open(sourcefile, 'r') as f:
+ with open(destination, 'w') as dest:
+ seq = ""
+ for line in f:
+ if line[0] == '>' and not seq == "":
+ dest.write(aas_to_int_seq(seq) + '\n')
+ seq = ""
+ elif not line[0] == '>':
+ seq += line.replace("\n","")
+
+# Real data pipelines
+
+def bucketbatchpad(
+ batch_size=256,
+ path_to_data=os.path.join("./data/SwissProt/sprot_ints.fasta"), # Preprocessed- see note
+ compressed="", # See tf.contrib.data.TextLineDataset init args
+ bounds=[128,256], # Default buckets of < 128, 128><256, >256
+ # Unclear exactly what this does, should proly equal batchsize
+ window_size=256, # NOT a tensor
+ padding=None, # Use default padding of zero, otherwise see Dataset docs
+ shuffle_buffer=None, # None or the size of the buffer to shuffle with
+ pad_shape=([None]),
+ repeat=1,
+ filt=None
+):
+ """
+ Streams data from path_to_data that is correctly preprocessed.
+ Divides into buckets given by bounds and pads to full length.
+ Returns a dataset which will return a padded batch of batchsize
+ with iteration.
+ """
+ batch_size=tf.constant(batch_size, tf.int64)
+ bounds=tf.constant(bounds)
+ window_size=tf.constant(window_size, tf.int64)
+
+ path_to_data = os.path.join(path_to_data)
+ # Parse strings to tensors
+ dataset = tf.data.TextLineDataset(path_to_data).map(tf_seq_to_tensor)
+ if filt is not None:
+ dataset = dataset.filter(filt)
+
+ if shuffle_buffer:
+ # Stream elements uniformly randomly from a buffer
+ dataset = dataset.shuffle(buffer_size=shuffle_buffer)
+ # Apply a repeat. Because this is after the shuffle, all elements of the dataset should be seen before repeat.
+ # See https://stackoverflow.com/questions/44132307/tf-contrib-data-dataset-repeat-with-shuffle-notice-epoch-end-mixed-epochs
+ dataset = dataset.repeat(count=repeat)
+ # Apply grouping to bucket and pad
+ group_fn = tf.data.experimental.group_by_window(
+ key_func=lambda seq: smart_length(tf_rank1_tensor_len(seq), bucket_bounds=bounds), # choose a bucket
+ reduce_func=lambda key, ds: pad_batch(ds, batch_size, padding=padding, padded_shapes=pad_shape), # apply reduce funtion to pad
+ window_size=window_size)
+ grouped_dataset = dataset.apply(group_fn)
+ return grouped_dataset
+
+def shufflebatch(
+ batch_size=256,
+ shuffle_buffer=None,
+ repeat=1,
+ path_to_data="./data/SwissProt/sprot_ints.fasta"
+):
+ """
+ Draws from an (optionally shuffled) dataset, repeats dataset repeat times,
+ and serves batches of the specified size.
+ """
+
+ path_to_data = os.path.join(path_to_data)
+ # Parse strings to tensors
+ dataset = tf.contrib.data.TextLineDataset(path_to_data).map(tf_seq_to_tensor)
+ if shuffle_buffer:
+ # Stream elements uniformly randomly from a buffer
+ dataset = dataset.shuffle(buffer_size=shuffle_buffer)
+ # Apply a repeat. Because this is after the shuffle, all elements of the dataset should be seen before repeat.
+ # See https://stackoverflow.com/questions/44132307/tf-contrib-data-dataset-repeat-with-shuffle-notice-epoch-end-mixed-epochs
+ dataset = dataset.repeat(count=repeat)
+ dataset = dataset.batch(batch_size)
+ return dataset
\ No newline at end of file
diff --git a/proteingym/baselines/vespa/compute_fitness.py b/proteingym/baselines/vespa/compute_fitness.py
new file mode 100644
index 0000000..b364c39
--- /dev/null
+++ b/proteingym/baselines/vespa/compute_fitness.py
@@ -0,0 +1,115 @@
+import pandas as pd
+import os
+import argparse
+import subprocess
+import json
+import numpy as np
+import tqdm
+
+if __name__ == "__main__":
+ parser = argparse.ArgumentParser()
+ parser = argparse.ArgumentParser(description='Tranception scoring')
+
+ parser.add_argument("--cache_location", type=str, help="Location of T5 weight cache", default=None)
+ parser.add_argument("--skip_VESPA_computation",type=str,help="Skip running VESPA if it's been prerun and we just want to compute fitness.")
+ parser.add_argument("--wt_fasta_file",type=str,default="./WT_sequences.fasta",help="Location of fasta file containing all wild type sequences to compute VESPA scores for.")
+ parser.add_argument("--vespa_tmp_dir",type=str,default="./vespa_tmp",help="Location of temporary files for VESPA")
+ parser.add_argument('--DMS_reference_file_path', default=None, type=str, help='Path to reference file with list of DMS to score')
+ parser.add_argument("--MSA_data_folder", type=str, help="Folder where MSAs are stored", default=None)
+ parser.add_argument('--DMS_data_folder', type=str, help='Path to folder that contains all DMS assay datasets')
+ parser.add_argument('--DMS_index', default=0, type=int, help='Index of DMS assay in reference file')
+ parser.add_argument("--DMS_index_range_start",type=int, help="Start of range of DMS assays", default=None)
+ parser.add_argument("--DMS_index_range_end",type=int, help="Index of DMS assay in reference file", default=None)
+
+ args = parser.parse_args()
+ if not os.path.exists(args.vespa_tmp_dir):
+ os.makedirs(args.vespa_tmp_dir)
+
+ if args.DMS_reference_file_path is not None:
+ if args.DMS_index_range_start is not None and args.DMS_index_range_end is not None:
+ DMS_reference_file = pd.read_csv(args.DMS_reference_file_path)
+ DMS_filenames = DMS_reference_file['DMS_filename'][args.DMS_index_range_start:args.DMS_index_range_end+1].tolist()
+ MSA_filenames = DMS_reference_file['MSA_filename'][args.DMS_index_range_start:args.DMS_index_range_end+1].unique().tolist()
+ target_seqs = DMS_reference_file['target_seq'][args.DMS_index_range_start:args.DMS_index_range_end+1].tolist()
+ else:
+ DMS_reference_file = pd.read_csv(args.DMS_reference_file_path)
+ DMS_filenames = [DMS_reference_file['DMS_filename'][args.DMS_index]]
+ MSA_filenames = [DMS_reference_file['MSA_filename'][args.DMS_index]]
+ target_seqs = [DMS_reference_file['target_seq'][args.DMS_index]]
+ else:
+ DMS_filenames = [args.DMS_file_name]
+ MSA_filenames = [args.MSA_file_name]
+ target_seqs = [args.target_seq]
+ if not args.skip_VESPA_computation:
+ # Build fasta file of wild types sequences based on first sequence of each alignment in MSA_filenames
+ all_WT_sequences_fasta = args.vespa_tmp_dir + os.sep + args.wt_fasta_file
+ index_map = {"indices":[],"MSA_filename":[]}
+ for i,filename in enumerate(MSA_filenames):
+ index_map["indices"].append(i)
+ index_map["MSA_filename"].append(filename)
+ f = os.path.join(args.MSA_data_folder, filename)
+ target_seq=""
+ with open(f, 'r') as msa_data:
+ for i, line in enumerate(msa_data):
+ line = line.rstrip()
+ if line.startswith(">") and i==0:
+ with open(all_WT_sequences_fasta,'a+') as seq_wt_file:
+ seq_wt_file.write(line+"\n")
+ elif line.startswith(">"):
+ break
+ else:
+ with open(all_WT_sequences_fasta,'a+') as seq_wt_file:
+ seq_wt_file.write(line+"\n")
+ subprocess.run(["vespa",args.wt_fasta_file,"--prott5_weights_cache", args.cache_location, "--vespa"], cwd=args.vespa_tmp_dir, check=True)
+ else:
+ index_map = {"indices":[],"MSA_filename":[]}
+ for i,filename in enumerate(MSA_filenames):
+ index_map["indices"].append(i)
+ index_map["MSA_filename"].append(filename)
+ print(f"Skipping VESPA computation, scoring DMS assays {DMS_filenames} with precomputed VESPA scores. Note that this assumes that the WT sequence file contains the wild type sequences in the same order as the DMS reference file.")
+ # Score each DMS assay
+ assert os.path.exists(f"{args.vespa_tmp_dir}/vespa_run_directory/output/map.json")
+ map_dict = json.load(open(f"{args.vespa_tmp_dir}/vespa_run_directory/output/map.json","r"))
+ index_df = pd.DataFrame(index_map)
+ index_df["VESPA_scoring_file_name"] = index_df["indices"].apply(lambda x: map_dict[str(x)])
+ index_df = index_df.rename(columns={"indices":"VESPA_scoring_file_index"})
+ DMS_reference_file_subset = DMS_reference_file.iloc[args.DMS_index_range_start:args.DMS_index_range_end+1]
+ DMS_reference_file_subset = DMS_reference_file_subset.merge(index_df, how="left",on="MSA_filename")
+ list_DMS = DMS_reference_file_subset["DMS_id"]
+ VESPA_scores_folder = f"{args.vespa_tmp_dir}/vespa_run_directory/output"
+ for DMS_id in list_DMS:
+ print(DMS_id)
+ DMS_filename = DMS_reference_file_subset["DMS_filename"][DMS_reference_file_subset["DMS_id"]==DMS_id].values[0]
+ VESPA_scores_filename = int(DMS_reference_file_subset["VESPA_scoring_file_index"][DMS_reference_file_subset["DMS_id"]==DMS_id].values[0])
+ print(VESPA_scores_filename)
+ VESPA_scores_filename = str(VESPA_scores_filename) + '.csv'
+ DMS_file = pd.read_csv(args.DMS_data_folder+os.sep+DMS_filename) #mutant,mutated_sequence,DMS_score,DMS_score_bin
+ VESPA_scores = pd.read_csv(VESPA_scores_folder+os.sep+VESPA_scores_filename, sep=";")
+ MSA_start = DMS_reference_file_subset["MSA_start"][DMS_reference_file_subset["DMS_id"]==DMS_id].values[0]
+ VESPA_scores['mutant'] = VESPA_scores['Mutant'].apply(lambda x: x[0] + str(int(int(x[1:-1])+MSA_start)) + x[-1])
+ VESPA_scores['VESPA'] = np.log(1 - VESPA_scores['VESPA']) #log proba of being functional (raw score is proba that mutation has an "effect")
+ VESPA_scores['VESPAl'] = np.log(1 - VESPA_scores['VESPAl'])
+ mapping_mutant_VESPA={}
+ mapping_mutant_VESPAl={}
+ for mutant in tqdm.tqdm(DMS_file['mutant']):
+ VESPA_score_singles_sum = 0
+ VESPAl_score_singles_sum = 0
+ #Proba of multiple mutants to be benign is the product that each mutant is benign
+ num_synonomous = 0
+ for single in mutant.split(":"):
+ # skipping synomymous mutations
+ if single[0] == single[-1]:
+ num_synonomous += 1
+ continue
+ VESPA_score_singles_sum += VESPA_scores['VESPA'][VESPA_scores['mutant']==single].values[0]
+ VESPAl_score_singles_sum += VESPA_scores['VESPAl'][VESPA_scores['mutant']==single].values[0]
+ assert VESPA_score_singles_sum!=0 or num_synonomous == len(mutant.split(":")), "Missing VESPA scores"
+ mapping_mutant_VESPA[mutant] = VESPA_score_singles_sum
+ mapping_mutant_VESPAl[mutant] = VESPAl_score_singles_sum
+ mapping_mutant_VESPA = pd.DataFrame.from_dict(mapping_mutant_VESPA, orient='index').reset_index()
+ mapping_mutant_VESPA.columns = ['mutant','VESPA']
+ mapping_mutant_VESPAl = pd.DataFrame.from_dict(mapping_mutant_VESPAl, orient='index').reset_index()
+ mapping_mutant_VESPAl.columns = ['mutant','VESPAl']
+ final_scores_VESPA = pd.merge(DMS_file,mapping_mutant_VESPA, how='left',on='mutant')
+ final_scores_VESPA = pd.merge(final_scores_VESPA,mapping_mutant_VESPAl, how='left',on='mutant')
+ print(final_scores_VESPA)
\ No newline at end of file
diff --git a/proteingym/constants.json b/proteingym/constants.json
new file mode 100644
index 0000000..75bfbba
--- /dev/null
+++ b/proteingym/constants.json
@@ -0,0 +1,262 @@
+{
+ "model_details":{
+ "Tranception_L_no_retrieval":"Tranception Large model (700M params) without retrieval",
+ "Tranception_M_no_retrieval":"Tranception Medium model (300M params) without retrieval",
+ "Tranception_S_no_retrieval":"Tranception Small model (85M params) without retrieval",
+ "Tranception_S":"Tranception Small model (85M params) with retrieval",
+ "Tranception_M":"Tranception Medium model (300M params) with retrieval",
+ "Tranception_L":"Tranception Large model (700M params) with retrieval",
+ "EVE_single":"EVE model (single seed)",
+ "EVE_ensemble":"EVE model (ensemble of 5 independently-trained models)",
+ "MSA_Transformer_single":"MSA Transformer (single MSA sample)",
+ "MSA_Transformer_ensemble":"MSA Transformer (ensemble of 5 MSA samples)",
+ "ESM1v_single":"ESM-1v (single seed)",
+ "ESM1v_ensemble":"ESM-1v (ensemble of 5 independently-trained models)",
+ "Wavenet":"Wavenet model",
+ "DeepSequence_single":"DeepSequence model (single seed)",
+ "DeepSequence_ensemble":"DeepSequence model (ensemble of 5 independently-trained models)",
+ "Site_Independent":"Site-Independent model",
+ "EVmutation":"EVmutation model",
+ "RITA_s":"RITA small model (85M params)",
+ "RITA_m":"RITA medium model (300M params)",
+ "RITA_l":"RITA large model (680M params)",
+ "RITA_xl":"RITA xlarge model (1.2B params)",
+ "RITA_ensemble":"Ensemble of the 4 RITA models",
+ "Progen2_small":"Progen2 small model (150M params)",
+ "Progen2_medium":"Progen2 medium model (760M params)",
+ "Progen2_base":"Progen2 base model (760M params)",
+ "Progen2_large":"Progen2 large model (2.7B params)",
+ "Progen2_xlarge":"Progen2 xlarge model (6.4B params)",
+ "Progen2_ensemble":"Ensemble of the 5 Progen2 models",
+ "Unirep":"Unirep model",
+ "Unirep_evotune":"Unirep model w/ evotuning",
+ "GEMME":"GEMME model",
+ "VESPA":"VESPA model",
+ "VESPAl":"VESPAl model",
+ "ProtGPT2":"ProtGPT2 model",
+ "ESM1b":"ESM-1b (w/ Brandes et al. extensions)",
+ "TranceptEVE_S":"TranceptEVE Small model (Tranception Small & retrieved EVE model)",
+ "TranceptEVE_M":"TranceptEVE Medium model (Tranception Medium & retrieved EVE model)",
+ "TranceptEVE_L":"TranceptEVE Large model (Tranception Large & retrieved EVE model)",
+ "HMM": "Profile Hidden Markov model",
+ "Provean": "Provean model",
+ "ESM-IF1": "ESM-IF1 model",
+ "ESM2_650M": "ESM2 model (650M params)",
+ "ESM2_3B": "ESM2 model (3B params)",
+ "ESM2_15B": "ESM2 model (15B params)",
+ "ESM2_150M": "ESM2 model (150M params)",
+ "ESM2_35M":"ESM2 model (35M params)",
+ "ESM2_8M":"ESM2 model (8M params)",
+ "MIFST":"MIF-ST model",
+ "MIF":"MIF model",
+ "CARP_640M": "CARP model (640M params)",
+ "CARP_76M": "CARP model (76M params)",
+ "CARP_38M": "CARP model (38M params)",
+ "CARP_600K": "CARP model (600K params)",
+ "ProteinMPNN": "ProteinMPNN model",
+ "ProtSSN_k10_h512": "ProtSSN (k=10, h=512)",
+ "ProtSSN_k10_h768": "ProtSSN (k=10, h=768)",
+ "ProtSSN_k10_h1280": "ProtSSN (k=10, h=1280)",
+ "ProtSSN_k20_h512": "ProtSSN (k=20, h=512)",
+ "ProtSSN_k20_h768": "ProtSSN (k=20, h=768)",
+ "ProtSSN_k20_h1280": "ProtSSN (k=20, h=1280)",
+ "ProtSSN_k30_h512": "ProtSSN (k=30, h=512)",
+ "ProtSSN_k30_h768": "ProtSSN (k=30, h=768)",
+ "ProtSSN_k30_h1280": "ProtSSN (k=30, h=1280)",
+ "ProtSSN_ensemble": "ProtSSN (ensemble of 9 models)",
+ "SaProt_650M_AF2": "SaProt (650M)",
+ "SaProt_35M_AF2": "SaProt (35M)"
+ },
+ "model_references":{
+ "Tranception_L_no_retrieval":"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.",
+ "Tranception_M_no_retrieval":"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.",
+ "Tranception_S_no_retrieval":"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.",
+ "Tranception_S":"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.",
+ "Tranception_M":"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.",
+ "Tranception_L":"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.",
+ "EVE_single":"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.",
+ "EVE_ensemble":"Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.",
+ "MSA_Transformer_single":"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.",
+ "MSA_Transformer_ensemble":"Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.",
+ "ESM1v_single":"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.",
+ "ESM1v_ensemble":"Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.",
+ "Wavenet":"Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12.",
+ "DeepSequence_single":"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.",
+ "DeepSequence_ensemble":"Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.",
+ "Site_Independent":"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.",
+ "EVmutation":"Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.",
+ "RITA_s":"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.",
+ "RITA_m":"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.",
+ "RITA_l":"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.",
+ "RITA_xl":"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.",
+ "RITA_ensemble":"Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.",
+ "Progen2_small":" Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. ",
+ "Progen2_medium":" Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. ",
+ "Progen2_base":" Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. ",
+ "Progen2_large":" Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. ",
+ "Progen2_xlarge":" Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. ",
+ "Progen2_ensemble":" Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. ",
+ "Unirep":"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.",
+ "Unirep_evotune":"Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.",
+ "GEMME":"Laine, É., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619.",
+ "ProtGPT2":"Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13.",
+ "VESPA":"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.",
+ "VESPAl":"Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.",
+ "ESM1b":"[1] Original model: Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118. [2] Extensions: Brandes, N., Goldman, G., Wang, C.H., Ye, C.J., & Ntranos, V. (2022). Genome-wide prediction of disease variants with a deep protein language model. bioRxiv.",
+ "TranceptEVE_S":"Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.",
+ "TranceptEVE_M":"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.",
+ "TranceptEVE_L":"Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.",
+ "HMM":"HMMER: biosequence analysis using profile hidden Markov models",
+ "Provean": "Choi Y, Sims GE, Murphy S, Miller JR, Chan AP (2012). Predicting the functional effect of amino acid substitutions and indels. PloS one.",
+ "ESM-IF1": "Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. BioRxiv.",
+ "ESM2_650M":"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.",
+ "ESM2_3B":"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.",
+ "ESM2_15B":"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.",
+ "ESM2_150M":"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.",
+ "ESM2_35M":"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.",
+ "ESM2_8M":"Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.",
+ "MIFST":"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.",
+ "MIF":"Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.",
+ "CARP_640M":"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.",
+ "CARP_76M":"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.",
+ "CARP_38M":"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.",
+ "CARP_600K":"Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.",
+ "ProteinMPNN":"J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378.",
+ "ProtSSN_k10_h512": "Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.",
+ "ProtSSN_k10_h768": "Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.",
+ "ProtSSN_k10_h1280": "Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.",
+ "ProtSSN_k20_h512": "Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.",
+ "ProtSSN_k20_h768": "Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.",
+ "ProtSSN_k20_h1280": "Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.",
+ "ProtSSN_k30_h512": "Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.",
+ "ProtSSN_k30_h768": "Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.",
+ "ProtSSN_k30_h1280": "Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.",
+ "ProtSSN_ensemble": "Yang Tan, Bingxin Zhou, Lirong Zheng, Guisheng Fan, Liang Hong. (2023). Semantical and Topological Protein Encoding Toward Enhanced Bioactivity and Thermostability. bioRxiv.",
+ "SaProt_650M_AF2": "Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv.",
+ "SaProt_35M_AF2": "Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, Fajie Yuan. (2024). SaProt: Protein Language Modeling with Structure-aware Vocabulary. bioRxiv."
+ },
+ "clean_names":{
+ "Tranception_L_no_retrieval":"Tranception L no retrieval",
+ "Tranception_M_no_retrieval": "Tranception M no retrieval",
+ "Tranception_S_no_retrieval": "Tranception S no retrieval",
+ "Tranception_S":"Tranception S",
+ "Tranception_M":"Tranception M",
+ "Tranception_L":"Tranception L",
+ "EVE_single":"EVE (single)",
+ "EVE_ensemble":"EVE (ensemble)",
+ "MSA_Transformer_single":"MSA Transformer (single)",
+ "MSA_Transformer_ensemble":"MSA Transformer (ensemble)",
+ "ESM1v_single":"ESM-1v (single)",
+ "ESM1v_ensemble":"ESM-1v (ensemble)",
+ "Wavenet":"Wavenet",
+ "DeepSequence_single":"DeepSequence (single)",
+ "DeepSequence_ensemble":"DeepSequence (ensemble)",
+ "Site_Independent":"Site-Independent",
+ "EVmutation":"EVmutation",
+ "RITA_s":"RITA S",
+ "RITA_m":"RITA M",
+ "RITA_l":"RITA L",
+ "RITA_xl":"RITA XL",
+ "RITA_ensemble":"RITA (ensemble)",
+ "Progen2_small":"Progen2 S",
+ "Progen2_medium":"Progen2 M",
+ "Progen2_base":"Progen2 Base",
+ "Progen2_large":"Progen2 L",
+ "Progen2_xlarge":"Progen2 XL",
+ "Progen2_ensemble":"Progen2 (ensemble)",
+ "Unirep":"Unirep",
+ "Unirep_evotune":"Unirep evotuned",
+ "GEMME":"GEMME",
+ "ProtGPT2":"ProtGPT2",
+ "VESPA":"VESPA",
+ "VESPAl":"VESPAl",
+ "ESM1b": "ESM-1b",
+ "TranceptEVE_S": "TranceptEVE S",
+ "TranceptEVE_M":"TranceptEVE M",
+ "TranceptEVE_L":"TranceptEVE L",
+ "HMM":"Hidden Markov Model",
+ "Provean": "Provean",
+ "ESM-IF1": "ESM-IF1",
+ "ESM2_650M": "ESM2 (650M)",
+ "ESM2_3B": "ESM2 (3B)",
+ "ESM2_15B": "ESM2 (15B)",
+ "ESM2_150M": "ESM2 (150M)",
+ "ESM2_35M": "ESM2 (35M)",
+ "ESM2_8M": "ESM2 (8M)",
+ "MIFST":"MIF-ST",
+ "MIF":"MIF",
+ "CARP_640M":"CARP (640M)",
+ "CARP_76M":"CARP (76M)",
+ "CARP_38M":"CARP (38M)",
+ "CARP_600K":"CARP (600K)",
+ "ProteinMPNN":"ProteinMPNN",
+ "DMS_id":"DMS ID",
+ "number_mutants":"Number of Mutants",
+ "UniProt_ID":"UniProt ID",
+ "Neff_L_category": "Neff/L Category",
+ "Model_rank":"Rank",
+ "Model_name":"Model name",
+ "ProtSSN_k10_h512": "ProtSSN (k=10 h=512)",
+ "ProtSSN_k10_h768": "ProtSSN (k=10 h=768)",
+ "ProtSSN_k10_h1280": "ProtSSN (k=10 h=1280)",
+ "ProtSSN_k20_h512": "ProtSSN (k=20 h=512)",
+ "ProtSSN_k20_h768": "ProtSSN (k=20 h=768)",
+ "ProtSSN_k20_h1280": "ProtSSN (k=20 h=1280)",
+ "ProtSSN_k30_h512": "ProtSSN (k=30 h=512)",
+ "ProtSSN_k30_h768": "ProtSSN (k=30 h=768)",
+ "ProtSSN_k30_h1280": "ProtSSN (k=30 h=1280)",
+ "ProtSSN_ensemble": "ProtSSN (ensemble)",
+ "SaProt_650M_AF2": "SaProt (650M)",
+ "SaProt_35M_AF2": "SaProt (35M)"
+ },
+ "supervised_model_details": {
+ "ProteinNPT":"ProteinNPT Model",
+ "MSA Transformer Embeddings": "MSA Transformer Embeddings",
+ "ESM-1v Embeddings": "ESM-1v Embeddings",
+ "Tranception Embeddings": "Tranception Embeddings",
+ "TranceptEVE + One-Hot Encodings": "TranceptEVE + One-Hot Encodings",
+ "ESM-1v + One-Hot Encodings": "ESM-1v + One-Hot Encodings",
+ "Tranception + One-Hot Encodings": "Tranception + One-Hot Encodings",
+ "DeepSequence + One-Hot Encodings": "DeepSequence + One-Hot Encodings",
+ "MSA_Transformer + One-Hot Encodings": "MSA Transformer + One-Hot Encodings",
+ "One-Hot Encodings": "One-Hot Encodings"
+ },
+ "supervised_model_references":{
+ "ProteinNPT": "Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",
+ "MSA Transformer Embeddings":"[1] Original model: Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",
+ "ESM-1v Embeddings":"[1] Original model: Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",
+ "Tranception Embeddings":"[1] Original model: Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",
+ "TranceptEVE + One-Hot Encodings": "[1] Original model: Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. & (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",
+ "ESM-1v + One-Hot Encodings":"[1] Original model: Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",
+ "Tranception + One-Hot Encodings": "[1] Original model: Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",
+ "DeepSequence + One-Hot Encodings":"Hsu, C., Nisonoff, H., Fannjiang, C. et al. Learning protein fitness models from evolutionary and assay-labeled data. Nat Biotechnol 40, 1114–1122 (2022). https://doi.org/10.1038/s41587-021-01146-5",
+ "MSA_Transformer + One-Hot Encodings":"[1] Original model: Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML. [2] Extension: Notin, P., Weitzman, R., Marks, D. S., & Gal, Y. (2023). ProteinNPT: Improving protein property prediction and design with non-parametric transformers. Thirty-Seventh Conference on Neural Information Processing Systems",
+ "One-Hot Encodings":"Hsu, C., Nisonoff, H., Fannjiang, C. et al. Learning protein fitness models from evolutionary and assay-labeled data. Nat Biotechnol 40, 1114–1122 (2022). https://doi.org/10.1038/s41587-021-01146-5"
+ },
+ "supervised_clean_names":{
+ "ProteinNPT":"ProteinNPT",
+ "Embeddings - Augmented - MSA Transformer":"MSA Transformer Embeddings",
+ "Embeddings - MSA Transformer":"MSA Transformer Embeddings",
+ "Embeddings - Augmented - EMS1v":"ESM-1v Embeddings",
+ "Embeddings - Augmented - Tranception":"Tranception Embeddings",
+ "Embeddings - Tranception":"Tranception Embeddings",
+ "OHE - Augmented - TransceptEVE":"TranceptEVE + One-Hot Encodings",
+ "OHE - Augmented - Tranception":"Tranception + One-Hot Encodings",
+ "OHE - Augmented - MSA Transformer":"MSA_Transformer + One-Hot Encodings",
+ "OHE - Augmented - DeepSequence":"DeepSequence + One-Hot Encodings",
+ "OHE - Augmented - ESM1v":"ESM-1v + One-Hot Encodings",
+ "OHE - Not augmented":"One-Hot Encodings"
+ },
+ "supervised_model_types":{
+ "ProteinNPT":"Embedding",
+ "MSA Transformer Embeddings":"Embedding",
+ "Tranception Embeddings":"Embedding",
+ "ESM-1v Embeddings":"Embedding",
+ "TranceptEVE + One-Hot Encodings":"One-hot Encoding",
+ "Tranception + One-Hot Encodings":"One-hot Encoding",
+ "MSA_Transformer + One-Hot Encodings":"One-hot Encoding",
+ "DeepSequence + One-Hot Encodings":"One-hot Encoding",
+ "ESM-1v + One-Hot Encodings":"One-hot Encoding",
+ "One-Hot Encodings":"One-hot Encoding"
+ }
+}
\ No newline at end of file
diff --git a/proteingym/merge.py b/proteingym/merge.py
new file mode 100644
index 0000000..2881ad5
--- /dev/null
+++ b/proteingym/merge.py
@@ -0,0 +1,119 @@
+import pandas as pd
+import numpy as np
+import os
+import json
+import argparse
+from tqdm import tqdm
+
+
+def standardization(x):
+ """Assumes input is numpy array or pandas series"""
+ return (x - x.mean()) / x.std()
+
+
+proteingym_folder_path = os.path.dirname(os.path.realpath(__file__))
+
+
+def main():
+ parser = argparse.ArgumentParser(description='ProteinGym score merging')
+ parser.add_argument('--DMS_assays_location', type=str, default='~/.cache/ProteinGym/ProteinGym/DMS_assays/substitutions',
+ help='Path to folder containing all model scores')
+ parser.add_argument('--model_scores_location', type=str, default='~/.cache/ProteinGym/model_scores/zero_shot_substitutions',
+ help='Path to folder containing all model scores')
+ parser.add_argument('--merged_scores_dir', type=str, default="merged_scores",
+ help='Name of folder where all merged scores should be stored (in model_scores_location)')
+ parser.add_argument('--mutation_type', default='substitutions',
+ type=str, help='Type of mutations (substitutions | indels)')
+ parser.add_argument('--dataset', default='DMS', type=str,
+ help='Dataset to merge (DMS | clinical)')
+ parser.add_argument('--DMS_reference_file', default=f'{os.path.dirname(proteingym_folder_path)}/reference_files/DMS_substitutions.csv',
+ type=str, help='Path to reference file containing DMS assay information')
+ parser.add_argument('--config_file', default=f'{os.path.dirname(proteingym_folder_path)}/config.json',
+ type=str, help='Path to config file containing model information')
+ args = parser.parse_args()
+
+ reference_file = pd.read_csv(args.DMS_reference_file)
+ list_DMS = reference_file["DMS_id"]
+
+ with open(args.config_file) as f:
+ config = json.load(f)
+ if args.dataset == "DMS":
+ reference_field = "model_list_zero_shot_substitutions_DMS" if args.mutation_type == "substitutions" else "model_list_zero_shot_indels_DMS"
+ elif args.dataset == "clinical":
+ reference_field = "model_list_zero_shot_substitutions_clinical" if args.mutation_type == "substitutions" else "model_list_zero_shot_indels_clinical"
+
+ list_models = config[reference_field].keys()
+ pbar = tqdm(enumerate(list_DMS), total=len(list_DMS))
+ for DMS_index, DMS_id in pbar:
+ target_seq = reference_file["target_seq"][reference_file["DMS_id"]
+ == DMS_id].values[0]
+ pbar.set_description("Processing DMS assay {}".format(DMS_id))
+ DMS_filename = reference_file["DMS_filename"][reference_file["DMS_id"]
+ == DMS_id].values[0]
+ full_DMS_path = os.path.join(args.DMS_assays_location, DMS_filename)
+ if os.path.exists(full_DMS_path):
+ DMS_file = pd.read_csv(os.path.join(
+ args.DMS_assays_location, DMS_filename))
+ else:
+ print("Could not find DMS file {}. Skipping.".format(full_DMS_path))
+ continue
+
+ if "mutated_sequence" not in DMS_file:
+ DMS_file["mutated_sequence"] = DMS_file["mutant"]
+
+ score_files = {}
+ all_model_scores = DMS_file
+ orig_DMS_length = len(all_model_scores)
+ for model in list_models:
+ mutant_merge_key = config[reference_field][model]["key"]
+ input_score_name = config[reference_field][model]["input_score_name"]
+ # Mutant merge key depends on the model for subs
+ DMS_mutant_column = mutant_merge_key if args.mutation_type == "substitutions" else "mutated_sequence"
+
+ score_files[model] = pd.read_csv(os.path.join(
+ args.model_scores_location, config[reference_field][model]["location"], DMS_id+".csv"))
+ if "sequence" in score_files[model]:
+ score_files[model]["mutated_sequence"] = score_files[model]["sequence"]
+ score_files[model][model] = config[reference_field][model]["directionality"] * \
+ score_files[model][input_score_name]
+ score_files[model] = score_files[model][[mutant_merge_key, model]]
+ score_files[model].drop_duplicates(inplace=True)
+ score_files[model] = score_files[model].groupby(
+ mutant_merge_key).mean().reset_index()
+ # check that score_files[model][mutant_merge_key] and all_model_scores[DMS_mutant_column] are the same
+ if set(score_files[model][mutant_merge_key]) & set(all_model_scores[DMS_mutant_column]) == set():
+ print("Warning: No overlap on mutants for {} with model {}. Skipping".format(DMS_id, model))
+ continue
+ elif set(score_files[model][mutant_merge_key]) < set(all_model_scores[DMS_mutant_column]):
+ # print difference between two key sets
+ print("WARNING: {} and {} do not have the same mutants. Skipping." \
+ .format(model, DMS_id))
+ continue
+
+ score_files[model] = score_files[model].rename(columns={mutant_merge_key: DMS_mutant_column})
+ all_model_scores = pd.merge(all_model_scores, score_files[model], on=DMS_mutant_column, how='left')
+
+ if len(all_model_scores) != orig_DMS_length:
+ print("WARNING: Merge on {} for {} changed length. mutant_merge_keys are likely different between them.".format(
+ model, DMS_id))
+ print("Length DMS: {}".format(orig_DMS_length))
+ print("Length {}: {}".format(model, len(score_files[model])))
+ print("Length all_model_score unique keys: {}".format(
+ len(all_model_scores[mutant_merge_key].unique())))
+ print("Length DMS unique keys: {}".format(
+ len(score_files[model][mutant_merge_key].unique())))
+ print("Length merged file: {}".format(len(all_model_scores)))
+ continue
+ num_mutants_expected = reference_file[reference_file["DMS_id"] == DMS_id]["DMS_total_number_mutants"].values[0]
+ if len(all_model_scores) != num_mutants_expected:
+ print(f"Warning: Insufficient mutants for {DMS_id}: {len(all_model_scores)}, expected {num_mutants_expected}. Original DMS file length: {orig_DMS_length}")
+ if not os.path.isdir(os.path.join(args.model_scores_location, args.merged_scores_dir)):
+ os.mkdir(os.path.join(
+ args.model_scores_location, args.merged_scores_dir))
+ all_model_scores.to_csv(os.path.join(
+ args.model_scores_location, args.merged_scores_dir, f"{DMS_id}.csv"), index=False)
+ print("Length merged file: {}".format(len(all_model_scores)))
+
+
+if __name__ == '__main__':
+ main()
diff --git a/proteingym/performance_DMS_benchmarks.py b/proteingym/performance_DMS_benchmarks.py
new file mode 100644
index 0000000..8c9ed71
--- /dev/null
+++ b/proteingym/performance_DMS_benchmarks.py
@@ -0,0 +1,406 @@
+import pandas as pd
+import numpy as np
+import os
+import argparse
+from scipy.stats import spearmanr
+from sklearn.metrics import roc_auc_score, matthews_corrcoef
+import warnings
+import json
+warnings.simplefilter(action='ignore', category=FutureWarning)
+
+def minmax(x):
+ return ( (x - np.min(x)) / (np.max(x) - np.min(x)) )
+
+def calc_ndcg(y_true, y_score, **kwargs):
+ '''
+ Inputs:
+ y_true: an array of the true scores where higher score is better
+ y_score: an array of the predicted scores where higher score is better
+ Options:
+ quantile: If True, uses the top k quantile of the distribution
+ top: under the quantile setting this is the top quantile to
+ keep in the gains calc. This is a PERCENTAGE (i.e input 10 for top 10%)
+ Notes:
+ Currently we're calculating NDCG on the continuous value of the DMS
+ I tried it on the binary value as well and the metrics seemed mostly
+ the same.
+ '''
+ if 'quantile' not in kwargs:
+ kwargs['quantile'] = True
+ if 'top' not in kwargs:
+ kwargs['top'] = 10
+ if kwargs['quantile']:
+ k = np.floor(y_true.shape[0]*(kwargs['top']/100)).astype(int)
+ else:
+ k = kwargs['top']
+ if isinstance(y_true, pd.Series):
+ y_true = y_true.values
+ if isinstance(y_score, pd.Series):
+ y_score = y_score.values
+ gains = minmax(y_true)
+ ranks = np.argsort(np.argsort(-y_score)) + 1
+
+ if k == 'all':
+ k = len(ranks)
+ #sub to top k
+ ranks_k = ranks[ranks <= k]
+ gains_k = gains[ranks <= k]
+ #all terms with a gain of 0 go to 0
+ ranks_fil = ranks_k[gains_k != 0]
+ gains_fil = gains_k[gains_k != 0]
+
+ #if none of the ranks made it return 0
+ if len(ranks_fil) == 0:
+ return (0)
+
+ #discounted cumulative gains
+ dcg = np.sum([g/np.log2(r+1) for r,g in zip(ranks_fil, gains_fil)])
+
+ #ideal dcg - calculated based on the top k actual gains
+ ideal_ranks = np.argsort(np.argsort(-gains)) + 1
+ ideal_ranks_k = ideal_ranks[ideal_ranks <= k]
+ ideal_gains_k = gains[ideal_ranks <= k]
+ ideal_ranks_fil = ideal_ranks_k[ideal_gains_k != 0]
+ ideal_gains_fil = ideal_gains_k[ideal_gains_k != 0]
+ idcg = np.sum([g/np.log2(r+1) for r,g in zip(ideal_ranks_fil, ideal_gains_fil)])
+
+ #normalize
+ ndcg = dcg/idcg
+
+ return (ndcg)
+def calc_toprecall(true_scores, model_scores, top_true=10, top_model=10):
+ top_true = (true_scores >= np.percentile(true_scores, 100-top_true))
+ top_model = (model_scores >= np.percentile(model_scores, 100-top_model))
+
+ TP = (top_true) & (top_model)
+ recall = TP.sum() / (top_true.sum()) if top_true.sum() > 0 else 0
+
+ return (recall)
+
+def standardization(x):
+ """Assumes input is numpy array or pandas series"""
+ return (x - x.mean()) / x.std()
+
+def compute_bootstrap_standard_error(df, number_assay_reshuffle=10000):
+ """
+ Computes the non-parametric bootstrap standard error for the mean estimate of a given performance metric (eg., Spearman, AUC) across DMS assays (ie., the sample standard deviation of the mean across bootstrap samples)
+ """
+ model_names = df.columns
+ mean_performance_across_samples = []
+ for sample in range(number_assay_reshuffle):
+ mean_performance_across_samples.append(df.sample(frac=1.0, replace=True).mean(axis=0)) #Resample a dataset of the same size (with replacement) then take the sample mean
+ mean_performance_across_samples=pd.DataFrame(data=mean_performance_across_samples,columns=model_names)
+ return mean_performance_across_samples.std(ddof=1)
+
+def compute_bootstrap_standard_error_functional_categories(df, number_assay_reshuffle=10000):
+ """
+ Computes the non-parametric bootstrap standard error for the mean estimate of a given performance metric (eg., Spearman, AUC) across DMS assays (ie., the sample standard deviation of the mean across bootstrap samples)
+ """
+ model_names = df.columns
+ mean_performance_across_samples = {}
+ for category, group in df.groupby("Selection Type"):
+ mean_performance_across_samples[category] = []
+ for sample in range(number_assay_reshuffle):
+ mean_performance_across_samples[category].append(group.sample(frac=1.0, replace=True).mean(axis=0)) #Resample a dataset of the same size (with replacement) then take the sample mean
+ mean_performance_across_samples[category]=pd.DataFrame(data=mean_performance_across_samples[category])
+ categories = list(mean_performance_across_samples.keys())
+ combined_averages = mean_performance_across_samples[categories[0]].copy()
+ for category in categories[1:]:
+ combined_averages += mean_performance_across_samples[category]
+ combined_averages /= len(categories)
+ return combined_averages.std(ddof=1)
+
+
+proteingym_folder_path = os.path.dirname(os.path.realpath(__file__))
+
+def main():
+ parser = argparse.ArgumentParser(description='ProteinGym performance analysis')
+ parser.add_argument('--input_scoring_files_folder', type=str, help='Name of folder where all input scores are present (expects one scoring file per DMS)')
+ parser.add_argument('--output_performance_file_folder', default='./outputs/tranception_performance', type=str, help='Name of folder where to save performance analysis files')
+ parser.add_argument('--DMS_reference_file_path', type=str, help='Reference file with list of DMSs to consider')
+ parser.add_argument('--DMS_data_folder', type=str, help='Path to folder that contains all DMS datasets')
+ parser.add_argument('--indel_mode', action='store_true', help='Whether to score sequences with insertions and deletions')
+ parser.add_argument('--performance_by_depth', action='store_true', help='Whether to compute performance by mutation depth')
+ parser.add_argument('--config_file', default=f'{os.path.dirname(proteingym_folder_path)}/config.json', type=str, help='Path to config file containing model information')
+ args = parser.parse_args()
+
+ mapping_protein_seq_DMS = pd.read_csv(args.DMS_reference_file_path)
+ mapping_protein_seq_DMS["MSA_Neff_L_category"] = mapping_protein_seq_DMS["MSA_Neff_L_category"].apply(lambda x: x[0].upper() + x[1:] if type(x) == str else x)
+ num_DMS=len(mapping_protein_seq_DMS)
+ print("There are {} DMSs in mapping file".format(num_DMS))
+
+ with open(args.config_file) as f:
+ config = json.load(f)
+ with open(f"{os.path.dirname(os.path.realpath(__file__))}/constants.json") as f:
+ constants = json.load(f)
+ uniprot_function_lookup = mapping_protein_seq_DMS[["UniProt_ID","coarse_selection_type"]]
+ uniprot_function_lookup.columns = ["UniProt_ID", "Selection Type"]
+ uniprot_Neff_lookup = mapping_protein_seq_DMS[['UniProt_ID','MSA_Neff_L_category']].drop_duplicates()
+ uniprot_Neff_lookup.columns=['UniProt_ID','MSA_Neff_L_category']
+ uniprot_taxon_lookup = mapping_protein_seq_DMS[['UniProt_ID','taxon']].drop_duplicates()
+ uniprot_taxon_lookup.columns=['UniProt_ID','Taxon']
+ if args.indel_mode:
+ args.performance_by_depth = False
+
+ score_variables = list(config["model_list_zero_shot_substitutions_DMS"].keys()) if not args.indel_mode else list(config["model_list_zero_shot_indels_DMS"].keys())
+ if not os.path.isdir(args.output_performance_file_folder):
+ os.mkdir(args.output_performance_file_folder)
+ for metric in ['Spearman','AUC','MCC',"NDCG","Top_recall"]:
+ if not os.path.isdir(args.output_performance_file_folder+os.sep+metric):
+ os.mkdir(args.output_performance_file_folder+os.sep+metric)
+
+ model_types={}
+ for model in score_variables:
+ model_types[model]=config["model_list_zero_shot_substitutions_DMS"][model]["model_type"] if not args.indel_mode else config["model_list_zero_shot_indels_DMS"][model]["model_type"]
+ model_types=pd.DataFrame.from_dict(model_types,columns=['Model type'],orient='index')
+ model_details=pd.DataFrame.from_dict(constants["model_details"],columns=['Model details'],orient='index')
+ model_references=pd.DataFrame.from_dict(constants["model_references"],columns=['References'],orient='index')
+ clean_names = constants["clean_names"]
+ performance_all_DMS={}
+ output_filename={}
+ for metric in ['Spearman','AUC','MCC', "NDCG", "Top_recall"]:
+ performance_all_DMS[metric]={}
+ mutation_type = "substitutions" if not args.indel_mode else "indels"
+ output_filename[metric]="DMS_" + mutation_type + "_" + metric
+ for i, score in enumerate(score_variables):
+ performance_all_DMS[metric][score]=i
+ if not args.indel_mode and args.performance_by_depth:
+ for depth in ['1','2','3','4','5+']:
+ performance_all_DMS[metric][score+'_'+depth] = i
+ performance_all_DMS[metric]['number_mutants']=-1
+ performance_all_DMS[metric]["Selection Type"] = -1
+ performance_all_DMS[metric]["UniProt_ID"] = -1
+ performance_all_DMS[metric]['MSA_Neff_L_category']=-1
+ performance_all_DMS[metric]['Taxon']=-1
+ performance_all_DMS[metric]=pd.DataFrame.from_dict(performance_all_DMS[metric],orient='index').reset_index()
+ performance_all_DMS[metric].columns=['score','score_index']
+
+ list_DMS = mapping_protein_seq_DMS["DMS_id"]
+ i = 0
+ for DMS_id in list_DMS:
+ try:
+ print(DMS_id)
+ UniProt_ID = mapping_protein_seq_DMS["UniProt_ID"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]
+ DMS_filename = mapping_protein_seq_DMS["DMS_filename"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]
+ selection_type = mapping_protein_seq_DMS["coarse_selection_type"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]
+ MSA_Neff_L_category = mapping_protein_seq_DMS["MSA_Neff_L_category"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]
+ Taxon = mapping_protein_seq_DMS["taxon"][mapping_protein_seq_DMS["DMS_id"]==DMS_id].values[0]
+
+ DMS_file = pd.read_csv(args.DMS_data_folder+os.sep+DMS_filename)
+ print("Length DMS: {}".format(len(DMS_file)))
+ merged_scores = pd.read_csv(args.input_scoring_files_folder + os.sep + DMS_id + ".csv") #We assume no missing value (all models were enforced to score all mutants)
+ if 'mutant' not in merged_scores: merged_scores['mutant'] = merged_scores['mutated_sequence'] #if mutant not in DMS file we default to mutated_sequence (eg., for indels)
+ except:
+ print(f"Scoring file for {DMS_id} missing")
+ continue
+
+ if not args.indel_mode and args.performance_by_depth:
+ merged_scores['mutation_depth']=merged_scores['mutant'].apply(lambda x: len(x.split(":")))
+ merged_scores['mutation_depth_grouped']=merged_scores['mutation_depth'].apply(lambda x: '5+' if x >=5 else str(x))
+ performance_DMS = {}
+ for metric in ['Spearman','AUC','MCC','NDCG','Top_recall']:
+ performance_DMS[metric]={}
+ for score in score_variables:
+ if score not in merged_scores:
+ print("Model scores for {} not in merged scores for DMS {}".format(score,DMS_id))
+ performance_DMS["Spearman"][score] = np.nan
+ performance_DMS["AUC"][score] = np.nan
+ performance_DMS["MCC"][score] = np.nan
+ performance_DMS["NDCG"][score] = np.nan
+ performance_DMS["Top_recall"][score] = np.nan
+ continue
+ performance_DMS['Spearman'][score] = spearmanr(merged_scores['DMS_score'], merged_scores[score])[0]
+ performance_DMS["NDCG"][score] = calc_ndcg(merged_scores['DMS_score'], merged_scores[score])
+ performance_DMS["Top_recall"][score] = calc_toprecall(merged_scores['DMS_score'], merged_scores[score])
+ try:
+ performance_DMS['AUC'][score] = roc_auc_score(y_true=merged_scores['DMS_score_bin'], y_score=merged_scores[score])
+ except:
+ print("AUC issue with: {} for model: {}".format(DMS_id,score))
+ performance_DMS['AUC'][score] = np.nan
+ try:
+ median_cutoff=merged_scores[score].median()
+ merged_scores[score+"_bin"]=merged_scores[score].map(lambda x: 1 if x >= median_cutoff else 0)
+ performance_DMS['MCC'][score] = matthews_corrcoef(y_true=merged_scores['DMS_score_bin'], y_pred=merged_scores[score+"_bin"])
+ except:
+ print("MCC issue with: {} for model: {}".format(DMS_id,score))
+ performance_DMS['MCC'][score] = np.nan
+
+ if not args.indel_mode and args.performance_by_depth:
+ for score in score_variables:
+ if score not in merged_scores:
+ print("Model scores for {} not in merged scores for DMS {}".format(score,DMS_id))
+ for depth in ['1','2','3','4','5+']:
+ performance_DMS["Spearman"][score+'_'+depth] = np.nan
+ performance_DMS["AUC"][score+'_'+depth] = np.nan
+ performance_DMS["MCC"][score+'_'+depth] = np.nan
+ performance_DMS["NDCG"][score+'_'+depth] = np.nan
+ performance_DMS["Top_recall"][score+'_'+depth] = np.nan
+ continue
+ for depth in ['1','2','3','4','5+']:
+ merged_scores_depth = merged_scores[merged_scores.mutation_depth_grouped==depth]
+ if len(merged_scores_depth) > 0:
+ performance_DMS['Spearman'][score+'_'+depth] = spearmanr(merged_scores_depth['DMS_score'], merged_scores_depth[score])[0]
+ performance_DMS["NDCG"][score+'_'+depth] = calc_ndcg(merged_scores_depth['DMS_score'], merged_scores_depth[score])
+ performance_DMS["Top_recall"][score+'_'+depth] = calc_toprecall(merged_scores_depth['DMS_score'], merged_scores_depth[score])
+ try:
+ performance_DMS['AUC'][score+'_'+depth] = roc_auc_score(y_true=merged_scores_depth['DMS_score_bin'], y_score=merged_scores_depth[score])
+ except:
+ performance_DMS['AUC'][score+'_'+depth] = np.nan
+ try:
+ performance_DMS['MCC'][score+'_'+depth] = matthews_corrcoef(y_true=merged_scores_depth['DMS_score_bin'], y_pred=merged_scores_depth[score+"_bin"])
+ except:
+ performance_DMS['MCC'][score+'_'+depth] = np.nan
+ else:
+ performance_DMS['Spearman'][score+'_'+depth] = np.nan
+ performance_DMS['AUC'][score+'_'+depth] = np.nan
+ performance_DMS['MCC'][score+'_'+depth] = np.nan
+ performance_DMS["NDCG"][score+'_'+depth] = np.nan
+ performance_DMS["Top_recall"][score+'_'+depth] = np.nan
+ print("Number of mutants: {}".format(len(merged_scores['DMS_score'].values)))
+ for metric in ['Spearman','AUC','MCC','NDCG','Top_recall']:
+ performance_DMS[metric]['number_mutants']=len(merged_scores['DMS_score'].values)
+ performance_DMS[metric]['UniProt_ID'] = UniProt_ID
+ performance_DMS[metric]["Selection Type"] = selection_type
+ performance_DMS[metric]['MSA_Neff_L_category'] = MSA_Neff_L_category
+ performance_DMS[metric]['Taxon'] = Taxon
+ performance_DMS[metric] = pd.DataFrame.from_dict(performance_DMS[metric],orient='index').reset_index()
+ performance_DMS[metric].columns=['score',DMS_id]
+ performance_all_DMS[metric]=pd.merge(performance_all_DMS[metric],performance_DMS[metric],on='score',how='left')
+ for metric in ['Spearman','AUC','MCC','NDCG','Top_recall']:
+ performance_all_DMS[metric]=performance_all_DMS[metric].set_index('score')
+ del performance_all_DMS[metric]['score_index']
+ performance_all_DMS[metric]=performance_all_DMS[metric].transpose()
+ for var in performance_all_DMS[metric]:
+ if var not in ['UniProt_ID','MSA_Neff_L_category','Taxon',"Selection Type"]:
+ performance_all_DMS[metric][var]=performance_all_DMS[metric][var].astype(float).round(3)
+ if var in ['number_mutants']:
+ performance_all_DMS[metric][var]=performance_all_DMS[metric][var].astype(int)
+ if not args.indel_mode and args.performance_by_depth:
+ all_columns = performance_all_DMS[metric].columns
+ performance_all_DMS_html=performance_all_DMS[metric].copy()
+ performance_all_DMS_html.columns=performance_all_DMS_html.columns.map(lambda x: clean_names[x] if x in clean_names else x)
+ all_not_depth_columns = all_columns[[all_columns[x].split("_")[-1] not in ['1','2','3','4','5+'] for x in range(len(all_columns))]]
+ all_not_depth_columns_clean = all_not_depth_columns.map(lambda x: clean_names[x] if x in clean_names else x)
+ performance_all_DMS_html[all_not_depth_columns_clean].to_html(args.output_performance_file_folder + os.sep + metric + os.sep + output_filename[metric] + '_DMS_level.html')
+ DMS_perf_to_save = performance_all_DMS[metric].copy()[all_not_depth_columns]
+ DMS_perf_to_save.columns = DMS_perf_to_save.columns.map(lambda x: clean_names[x] if x in clean_names else x)
+ DMS_perf_to_save.to_csv(args.output_performance_file_folder + os.sep + metric + os.sep + output_filename[metric] + '_DMS_level.csv', index_label="DMS ID")
+ else:
+ performance_all_DMS_html=performance_all_DMS[metric].copy()
+ performance_all_DMS_html.columns = performance_all_DMS_html.columns.map(lambda x: clean_names[x] if x in clean_names else x)
+ performance_all_DMS_html.to_html(args.output_performance_file_folder + os.sep + metric + os.sep + output_filename[metric] + '_DMS_level.html')
+ DMS_perf_to_save = performance_all_DMS[metric].copy()
+ DMS_perf_to_save.columns = DMS_perf_to_save.columns.map(lambda x: clean_names[x] if x in clean_names else x)
+ DMS_perf_to_save.to_csv(args.output_performance_file_folder + os.sep + metric + os.sep + output_filename[metric] + '_DMS_level.csv', index_label="DMS ID")
+
+ if not args.indel_mode:
+ uniprot_metric_performance = performance_all_DMS[metric].groupby(['UniProt_ID']).mean(numeric_only=True)
+ uniprot_function_metric_performance = performance_all_DMS[metric].groupby(['UniProt_ID',"Selection Type"]).mean(numeric_only=True)
+ uniprot_metric_performance = uniprot_metric_performance.reset_index()
+ uniprot_metric_performance = pd.merge(uniprot_metric_performance,uniprot_Neff_lookup,on='UniProt_ID', how='left')
+ uniprot_metric_performance = pd.merge(uniprot_metric_performance,uniprot_taxon_lookup,on='UniProt_ID', how='left')
+ uniprot_metric_performance = pd.merge(uniprot_metric_performance,uniprot_function_lookup,on="UniProt_ID",how="left")
+ del uniprot_metric_performance['number_mutants']
+ del uniprot_function_metric_performance["number_mutants"]
+ uniprot_level_average = uniprot_metric_performance.mean(numeric_only=True)
+ uniprot_function_level_average = uniprot_function_metric_performance.groupby("Selection Type").mean(numeric_only=True)
+ # bootstrap_standard_error = pd.DataFrame(compute_bootstrap_standard_error_functional_categories(uniprot_function_metric_performance.subtract(uniprot_function_metric_performance['TranceptEVE_L'],axis=0)),columns=["Bootstrap_standard_error_"+metric])
+ uniprot_function_level_average = uniprot_function_level_average.reset_index()
+ final_average = uniprot_function_level_average.mean(numeric_only=True)
+ if args.performance_by_depth:
+ cols = [column for column in all_not_depth_columns if column not in ["number_mutants","Taxon","MSA_Neff_L_category","Selection Type","UniProt_ID"]]
+ top_model = final_average.loc[cols].idxmax()
+ else:
+ top_model = final_average.idxmax()
+ bootstrap_standard_error = pd.DataFrame(compute_bootstrap_standard_error_functional_categories(uniprot_function_metric_performance.subtract(uniprot_function_metric_performance[top_model],axis=0)),columns=["Bootstrap_standard_error_"+metric])
+ uniprot_metric_performance.loc['Average'] = uniprot_level_average
+ uniprot_function_level_average.loc['Average'] = final_average
+ uniprot_metric_performance=uniprot_metric_performance.round(3)
+ uniprot_function_level_average=uniprot_function_level_average.round(3)
+ if args.performance_by_depth:
+ uniprot_metric_performance[[column for column in all_not_depth_columns if column != "number_mutants"]].to_csv(args.output_performance_file_folder + os.sep + metric + os.sep + output_filename[metric] + '_Uniprot_level.csv', index=False)
+ performance_by_depth = {}
+ all_not_depth_columns = [x for x in all_not_depth_columns if x not in ['number_mutants',"UniProt_ID","MSA_Neff_L_category","Taxon"]]
+ for depth in ['1','2','3','4','5+']:
+ depth_columns = all_columns[[all_columns[x].split("_")[-1]==depth for x in range(len(all_columns))]]
+ performance_by_depth[depth] = uniprot_function_metric_performance[depth_columns].mean(numeric_only=True).reset_index()
+ performance_by_depth[depth]['model_name'] = performance_by_depth[depth]['score'].map(lambda x: '_'.join(x.split('_')[:-1]))
+ performance_by_depth[depth]=performance_by_depth[depth][['model_name',0]]
+ performance_by_depth[depth].columns = ['model_name','Depth_'+depth]
+ performance_by_depth[depth].set_index('model_name', inplace=True)
+ uniprot_function_level_average = uniprot_function_level_average[all_not_depth_columns]
+ else:
+ uniprot_metric_performance.to_csv(args.output_performance_file_folder + os.sep + metric + os.sep + output_filename[metric] + '_Uniprot_level.csv', index=False)
+ uniprot_function_level_average.to_csv(args.output_performance_file_folder + os.sep + metric + os.sep + output_filename[metric] + "_Uniprot_Selection_Type_level.csv",index=False)
+ if args.performance_by_depth:
+ performance_by_MSA_depth = performance_all_DMS[metric].groupby(["UniProt_ID","MSA_Neff_L_category"]).mean(numeric_only=True).groupby(["MSA_Neff_L_category"]).mean(numeric_only=True)[[col for col in all_not_depth_columns if col != "Selection Type"]].transpose()
+ else:
+ performance_by_MSA_depth = performance_all_DMS[metric].groupby(["UniProt_ID","MSA_Neff_L_category"]).mean(numeric_only=True).groupby(["MSA_Neff_L_category"]).mean(numeric_only=True).transpose()
+ performance_by_MSA_depth = performance_by_MSA_depth[['Low','Medium','High']]
+ performance_by_MSA_depth.columns = ['Low_MSA_depth','Medium_MSA_depth','High_MSA_depth']
+ if args.performance_by_depth:
+ performance_by_taxon = performance_all_DMS[metric].groupby(["UniProt_ID","Taxon"]).mean(numeric_only=True).groupby(["Taxon"]).mean(numeric_only=True)[[col for col in all_not_depth_columns if col != "Selection Type"]].transpose()
+ else:
+ performance_by_taxon = performance_all_DMS[metric].groupby(["UniProt_ID","Taxon"]).mean(numeric_only=True).groupby(["Taxon"]).mean(numeric_only=True).transpose()
+ performance_by_taxon = performance_by_taxon[['Human','Eukaryote','Prokaryote','Virus']]
+ performance_by_taxon.columns = ['Taxa_Human','Taxa_Other_Eukaryote','Taxa_Prokaryote','Taxa_Virus']
+ performance_by_function = uniprot_function_level_average.drop(labels="Average",axis=0).set_index("Selection Type").transpose()
+ performance_by_function.columns = ["Function_"+x for x in performance_by_function.columns]
+
+ summary_performance = pd.merge(pd.DataFrame(final_average,columns=['Average_'+metric]), performance_by_MSA_depth,left_index=True, right_index=True,how='inner')
+ summary_performance = pd.merge(summary_performance, performance_by_taxon,left_index=True, right_index=True,how='inner')
+ summary_performance = pd.merge(summary_performance, performance_by_function,left_index=True, right_index=True, how='inner')
+ if args.performance_by_depth:
+ for depth in ['1','2','3','4','5+']:
+ summary_performance = pd.merge(summary_performance, performance_by_depth[depth],left_index=True, right_index=True,how='inner')
+ final_column_order = ['Model_name','Model type','Average_'+metric,'Bootstrap_standard_error_'+metric,'Function_Activity','Function_Binding','Function_Expression','Function_OrganismalFitness','Function_Stability','Low_MSA_depth','Medium_MSA_depth','High_MSA_depth','Taxa_Human','Taxa_Other_Eukaryote','Taxa_Prokaryote','Taxa_Virus','Depth_1','Depth_2','Depth_3','Depth_4','Depth_5+','Model details','References']
+
+ else:
+ performance_all_DMS[metric].loc["Average"] = performance_all_DMS[metric].mean(numeric_only=True)
+ uniprot_metric_performance = performance_all_DMS[metric].groupby(['UniProt_ID']).mean(numeric_only=True)
+ uniprot_function_metric_performance = performance_all_DMS[metric].groupby(['UniProt_ID',"Selection Type"]).mean(numeric_only=True)
+ uniprot_metric_performance = pd.merge(uniprot_metric_performance,uniprot_function_lookup,on="UniProt_ID",how="left")
+ del uniprot_metric_performance['number_mutants']
+ uniprot_level_average = uniprot_metric_performance.mean(numeric_only=True)
+ del uniprot_function_metric_performance["number_mutants"]
+ uniprot_function_level_average = uniprot_function_metric_performance.groupby("Selection Type").mean(numeric_only=True)
+ # bootstrap_standard_error = pd.DataFrame(compute_bootstrap_standard_error_functional_categories(uniprot_function_metric_performance.subtract(uniprot_function_metric_performance['TranceptEVE_M'],axis=0)),columns=["Bootstrap_standard_error_"+metric])
+ uniprot_function_level_average = uniprot_function_level_average.reset_index()
+ final_average = uniprot_function_level_average.mean(numeric_only=True)
+ top_model = final_average.idxmax()
+ bootstrap_standard_error = pd.DataFrame(compute_bootstrap_standard_error_functional_categories(uniprot_function_metric_performance.subtract(uniprot_function_metric_performance[top_model],axis=0)),columns=["Bootstrap_standard_error_"+metric])
+ uniprot_metric_performance.loc['Average'] = uniprot_level_average
+ uniprot_function_level_average.loc['Average'] = final_average
+ uniprot_metric_performance=uniprot_metric_performance.round(3)
+ uniprot_function_level_average=uniprot_function_level_average.round(3)
+
+ performance_by_MSA_depth = performance_all_DMS[metric].groupby(["UniProt_ID","MSA_Neff_L_category"]).mean(numeric_only=True).groupby(["MSA_Neff_L_category"]).mean(numeric_only=True).transpose()
+ performance_by_MSA_depth = performance_by_MSA_depth[['Low','Medium','High']]
+ performance_by_MSA_depth.columns = ['Low_MSA_depth','Medium_MSA_depth','High_MSA_depth']
+ performance_by_taxon = performance_all_DMS[metric].groupby(["UniProt_ID","Taxon"]).mean(numeric_only=True).groupby(["Taxon"]).mean(numeric_only=True).transpose()
+ performance_by_taxon = performance_by_taxon[['Human','Eukaryote','Prokaryote','Virus']]
+ performance_by_taxon.columns = ['Taxa_Human','Taxa_Other_Eukaryote','Taxa_Prokaryote','Taxa_Virus']
+ performance_by_function = uniprot_function_level_average.drop(labels="Average",axis=0).set_index("Selection Type").transpose()
+ performance_by_function.columns = ["Function_"+x for x in performance_by_function.columns]
+
+ summary_performance = pd.merge(pd.DataFrame(final_average,columns=['Average_'+metric]), performance_by_MSA_depth,left_index=True,right_index=True,how='inner')
+ summary_performance = pd.merge(summary_performance, performance_by_taxon,left_index=True, right_index=True,how='inner')
+ summary_performance = pd.merge(summary_performance, performance_by_function,left_index=True, right_index=True, how='inner')
+ final_column_order = ['Model_name','Model type','Average_'+metric,'Bootstrap_standard_error_'+metric,'Function_Activity','Function_Binding','Function_Expression','Function_OrganismalFitness','Function_Stability','Low_MSA_depth','Medium_MSA_depth','High_MSA_depth','Taxa_Human','Taxa_Other_Eukaryote','Taxa_Prokaryote','Taxa_Virus','Model details','References']
+ summary_performance.sort_values(by='Average_'+metric,ascending=False,inplace=True)
+ summary_performance.index.name = 'Model_name'
+ summary_performance.reset_index(inplace=True)
+ summary_performance.index = range(1,len(summary_performance)+1)
+ summary_performance.index.name = 'Model_rank'
+ summary_performance = pd.merge(summary_performance, bootstrap_standard_error, left_on='Model_name', right_index=True, how='left')
+ summary_performance = pd.merge(summary_performance, model_types, left_on='Model_name', right_index=True, how='left')
+ summary_performance = pd.merge(summary_performance, model_details, left_on='Model_name', right_index=True, how='left')
+ summary_performance = pd.merge(summary_performance, model_references, left_on='Model_name', right_index=True, how='left')
+ summary_performance=summary_performance.round(3)
+ summary_performance['Model_name']=summary_performance['Model_name'].map(lambda x: clean_names[x] if x in clean_names else x)
+ summary_performance=summary_performance.reindex(columns=final_column_order)
+ summary_performance.to_csv(args.output_performance_file_folder + os.sep + metric + os.sep + 'Summary_performance_'+output_filename[metric]+'.csv')
+ summary_performance.to_html(args.output_performance_file_folder + os.sep + metric + os.sep + 'Summary_performance_'+output_filename[metric]+'.html')
+
+if __name__ == '__main__':
+ main()
\ No newline at end of file
diff --git a/proteingym/performance_DMS_supervised_benchmarks.py b/proteingym/performance_DMS_supervised_benchmarks.py
new file mode 100644
index 0000000..c4621eb
--- /dev/null
+++ b/proteingym/performance_DMS_supervised_benchmarks.py
@@ -0,0 +1,171 @@
+import pandas as pd
+import os
+import argparse
+from tqdm import tqdm
+import json
+
+"""
+This is the script used to compute statistics for the supervised scoring models.
+It uses the score files output from the runs done for the ProteinNPT paper, and the
+code to run those supervised models is available in the ProteinNPT repo
+"""
+
+
+def compute_bootstrap_standard_error_functional_categories(df, number_assay_reshuffle=10000, top_model="ProteinNPT"):
+ """
+ Computes the non-parametric bootstrap standard error for the mean estimate of a given performance metric (eg., Spearman, AUC) across DMS assays (ie., the sample standard deviation of the mean across bootstrap samples)
+ """
+ model_errors = {}
+ for model_name, group in tqdm(df.groupby("model_name")):
+ group_centered = group.subtract(df.loc[top_model],axis=0)
+ mean_performance_across_samples = {}
+ for category, group2 in group_centered.groupby("coarse_selection_type"):
+ mean_performance_across_samples[category] = []
+ for sample in range(number_assay_reshuffle):
+ mean_performance_across_samples[category].append(group2.sample(frac=1.0, replace=True).mean(axis=0)) #Resample a dataset of the same size (with replacement) then take the sample mean
+ mean_performance_across_samples[category]=pd.DataFrame(data=mean_performance_across_samples[category])
+ categories = list(mean_performance_across_samples.keys())
+ combined_averages = mean_performance_across_samples[categories[0]].copy()
+ for category in categories[1:]:
+ combined_averages += mean_performance_across_samples[category]
+ combined_averages /= len(categories)
+ model_errors[model_name] = combined_averages.std(ddof=1)
+ return pd.DataFrame(model_errors).transpose()
+
+if __name__ == "__main__":
+ parser = argparse.ArgumentParser(description='ProteinGym supervised stats script')
+ parser.add_argument('--input_scoring_file', type=str, help='Name of the file where all input scores are present (expects one scoring file per DMS)')
+ parser.add_argument('--output_performance_file_folder', default='./outputs/tranception_performance', type=str, help='Name of folder where to save performance analysis files')
+ parser.add_argument('--DMS_reference_file_path', type=str, help='Reference file with list of DMSs to consider')
+ parser.add_argument('--indel_mode', action='store_true', help='Whether to score sequences with insertions and deletions')
+
+ args = parser.parse_args()
+ metrics = ["Spearman","MSE"]
+ score_column = {"Spearman":"Spearman_fitness","MSE":"loss_fitness"}
+ with open(f"{os.path.dirname(os.path.realpath(__file__))}/constants.json") as f:
+ constants = json.load(f)
+ if not os.path.exists(args.output_performance_file_folder):
+ os.makedirs(args.output_performance_file_folder)
+
+ ref_df = pd.read_csv(args.DMS_reference_file_path)
+ ref_df["MSA_Neff_L_category"] = ref_df["MSA_Neff_L_category"].apply(lambda x: x[0].upper() + x[1:] if type(x) == str else x)
+ score_df = pd.read_csv(args.input_scoring_file)
+ old_ids = ref_df["Old_DMS_ID"].unique()
+ score_df["assay_id"] = score_df["assay_id"].apply(lambda x: ref_df["DMS_id"][ref_df["Old_DMS_ID"]==x].values[0] if x in old_ids else x)
+ score_df = score_df.merge(ref_df[["DMS_id","MSA_Neff_L_category","coarse_selection_type","taxon"]],left_on="assay_id",right_on="DMS_id",how="left")
+ score_df = score_df[["assay_id","model_name", "DMS_id","UniProt_id","MSA_Neff_L_category","coarse_selection_type","taxon","fold_variable_name","Spearman_fitness","loss_fitness"]]
+ if args.indel_mode:
+ cv_schemes = ["fold_random_5"]
+ else:
+ cv_schemes = ["fold_random_5","fold_modulo_5","fold_contiguous_5"]
+ for metric in metrics:
+ if not os.path.exists(os.path.join(args.output_performance_file_folder,f"{metric}")):
+ os.makedirs(os.path.join(args.output_performance_file_folder,f"{metric}"))
+ output_folder = os.path.join(args.output_performance_file_folder,f"{metric}")
+ all_DMS_perf = None
+ all_DMS_cv_schemes_perf = {cv_scheme:None for cv_scheme in cv_schemes}
+ for DMS_id in tqdm(ref_df["DMS_id"].unique()):
+ performance_all_DMS = {}
+ performance_all_DMS_cv_scheme = {cv_scheme:{} for cv_scheme in cv_schemes}
+ score_subset = score_df[score_df['DMS_id']==DMS_id]
+ models = score_subset["model_name"].unique()
+ for model in models:
+ performance_all_DMS[model] = 0.0
+ for cv_scheme in cv_schemes:
+ cv_subset = score_subset[score_subset["fold_variable_name"]==cv_scheme]
+ for model in models:
+ performance_all_DMS[model] += cv_subset[score_column[metric]][cv_subset["model_name"]==model].values[0]/len(cv_schemes)
+ performance_all_DMS_cv_scheme[cv_scheme][model] = cv_subset[score_column[metric]][cv_subset["model_name"]==model].values[0]
+ performance_all_DMS = pd.DataFrame.from_dict(performance_all_DMS,orient="index").reset_index(names="model_names")
+ performance_all_DMS.columns = ["model_names",DMS_id]
+ performance_all_DMS_cv_scheme = {cv_scheme:pd.DataFrame.from_dict(performance_all_DMS_cv_scheme[cv_scheme],orient="index").reset_index(names="model_names") for cv_scheme in cv_schemes}
+ for cv_scheme in cv_schemes:
+ performance_all_DMS_cv_scheme[cv_scheme].columns = ["model_names",DMS_id]
+ if all_DMS_perf is None:
+ all_DMS_perf = performance_all_DMS
+ all_DMS_cv_schemes_perf = {cv_scheme:performance_all_DMS_cv_scheme[cv_scheme] for cv_scheme in cv_schemes}
+ else:
+ all_DMS_perf = all_DMS_perf.merge(performance_all_DMS,on="model_names",how="inner")
+ all_DMS_cv_schemes_perf = {cv_scheme:all_DMS_cv_schemes_perf[cv_scheme].merge(performance_all_DMS_cv_scheme[cv_scheme],on="model_names",how="inner") for cv_scheme in cv_schemes}
+ all_DMS_perf = all_DMS_perf.set_index("model_names").transpose().reset_index(names="DMS_id")
+ all_DMS_perf.columns = [constants["supervised_clean_names"][x] if x in constants["supervised_clean_names"] else x for x in all_DMS_perf.columns]
+ if args.indel_mode:
+ all_DMS_perf.round(3).to_csv(os.path.join(output_folder,f"DMS_indels_{metric}_DMS_level.csv"),index=False)
+ else:
+ all_DMS_perf.round(3).to_csv(os.path.join(output_folder,f"DMS_substitutions_{metric}_DMS_level.csv"),index=False)
+ for cv_scheme in cv_schemes:
+ all_DMS_cv_schemes_perf[cv_scheme] = all_DMS_cv_schemes_perf[cv_scheme].set_index("model_names").transpose().reset_index(names="DMS_id")
+ all_DMS_cv_schemes_perf[cv_scheme].columns = [constants["supervised_clean_names"][x] if x in constants["supervised_clean_names"] else x for x in all_DMS_cv_schemes_perf[cv_scheme].columns]
+ if args.indel_mode:
+ all_DMS_cv_schemes_perf[cv_scheme].round(3).to_csv(os.path.join(output_folder,f"DMS_indels_{metric}_DMS_level_{cv_scheme}.csv"),index=False)
+ else:
+ all_DMS_cv_schemes_perf[cv_scheme].round(3).to_csv(os.path.join(output_folder,f"DMS_substitutions_{metric}_DMS_level_{cv_scheme}.csv"),index=False)
+
+ def pivot_model_df(df, value_column, score_column):
+ df = df[["model_name",value_column,score_column]]
+ df = df.pivot(index="model_name",columns=value_column,values=score_column)
+ return df
+
+ # computing function groupings within CV schemes, then averaging them
+ all_summary_performance = None
+ for cv_scheme in cv_schemes:
+ cv_subset = score_df[score_df["fold_variable_name"] == cv_scheme]
+ if len(cv_subset) == 0:
+ raise ValueError("No scores found for cross-validation scheme {}".format(cv_scheme))
+ cv_uniprot_function = cv_subset.groupby(["model_name","UniProt_id","coarse_selection_type"]).mean()
+ if args.indel_mode:
+ top_model = "Embeddings - Augmented - EMS1v"
+ else:
+ top_model = "ProteinNPT"
+ bootstrap_standard_error = compute_bootstrap_standard_error_functional_categories(cv_uniprot_function,top_model=top_model,number_assay_reshuffle=10000)
+ bootstrap_standard_error = bootstrap_standard_error[score_column[metric]].reset_index()
+ bootstrap_standard_error.columns = ["model_name",f"Bootstrap_standard_error_{metric}"]
+ cv_function_average = cv_uniprot_function.groupby(["model_name","coarse_selection_type"]).mean()
+ cv_final_average = cv_function_average.groupby("model_name").mean()
+ performance_by_MSA_depth = cv_subset.groupby(["model_name","UniProt_id","MSA_Neff_L_category"]).mean().groupby(["model_name","MSA_Neff_L_category"]).mean()
+ performance_by_taxon = cv_subset.groupby(["model_name","UniProt_id","taxon"]).mean().groupby(["model_name","taxon"]).mean()
+ performance_by_MSA_depth = pivot_model_df(performance_by_MSA_depth.reset_index(),"MSA_Neff_L_category",score_column[metric])
+ performance_by_MSA_depth.columns = ['Low_MSA_depth','Medium_MSA_depth','High_MSA_depth']
+ performance_by_taxon = pivot_model_df(performance_by_taxon.reset_index(),"taxon",score_column[metric])
+ performance_by_taxon.columns = ['Taxa_Human','Taxa_Other_Eukaryote','Taxa_Prokaryote','Taxa_Virus']
+ cv_function_average = pivot_model_df(cv_function_average.reset_index(),"coarse_selection_type",score_column[metric])
+ cv_function_average.columns = ["Function_"+x for x in cv_function_average.columns]
+ cv_final_average = cv_final_average.reset_index()[["model_name",score_column[metric]]].copy()
+ cv_final_average.columns = ["model_name",f"Average_{metric}"]
+ summary_performance = pd.merge(cv_final_average,performance_by_MSA_depth,on="model_name",how="inner")
+ summary_performance = pd.merge(summary_performance,performance_by_taxon,on="model_name",how="inner")
+ summary_performance = pd.merge(summary_performance,cv_function_average,on="model_name",how="inner")
+ summary_performance = pd.merge(summary_performance,bootstrap_standard_error,on="model_name",how="inner")
+ if all_summary_performance is None:
+ all_summary_performance = summary_performance.set_index("model_name")/len(cv_schemes)
+ all_summary_performance[f"Average_{metric}_{cv_scheme}"] = all_summary_performance[f"Average_{metric}"]*len(cv_schemes)
+ else:
+ ignore_columns = [f"Average_{metric}_{cv_approach}" for cv_approach in cv_schemes]
+ all_summary_performance[[column for column in all_summary_performance.columns if column not in ignore_columns]] += summary_performance.set_index("model_name")/len(cv_schemes)
+ all_summary_performance[f"Average_{metric}_{cv_scheme}"] = summary_performance[f"Average_{metric}"].values
+ all_summary_performance = all_summary_performance.reset_index(names="Model_name")
+ if metric == "MSE":
+ ascending = True
+ else:
+ ascending = False
+ all_summary_performance.sort_values(by=f"Average_{metric}",ascending=ascending,inplace=True)
+ all_summary_performance.index = range(1,len(all_summary_performance)+1)
+ all_summary_performance.index.name = 'Model_rank'
+ all_summary_performance = all_summary_performance.round(3)
+ all_summary_performance["Model_name"] = all_summary_performance["Model_name"].apply(lambda x: constants["supervised_clean_names"][x] if x in constants["supervised_clean_names"] else x)
+ all_summary_performance["References"] = all_summary_performance["Model_name"].apply(lambda x: constants["supervised_model_references"][x] if x in constants["supervised_model_references"] else "")
+ all_summary_performance["Model details"] = all_summary_performance["Model_name"].apply(lambda x: constants["supervised_model_details"][x] if x in constants["supervised_model_details"] else "")
+ all_summary_performance["Model type"] = all_summary_performance["Model_name"].apply(lambda x: constants["supervised_model_types"][x] if x in constants["supervised_model_types"] else "")
+ if args.indel_mode:
+ all_summary_performance["Function_Binding"] = "N/A"
+ column_order = ["Model_name","Model type",f"Average_{metric}",f"Bootstrap_standard_error_{metric}",f"Average_{metric}_fold_random_5","Function_Activity","Function_Binding","Function_Expression","Function_OrganismalFitness","Function_Stability","Low_MSA_depth","Medium_MSA_depth","High_MSA_depth","Taxa_Human","Taxa_Other_Eukaryote","Taxa_Prokaryote","Taxa_Virus","References","Model details"]
+ else:
+ column_order = ["Model_name","Model type",f"Average_{metric}",f"Bootstrap_standard_error_{metric}",f"Average_{metric}_fold_random_5",f"Average_{metric}_fold_modulo_5",f"Average_{metric}_fold_contiguous_5","Function_Activity","Function_Binding","Function_Expression","Function_OrganismalFitness","Function_Stability","Low_MSA_depth","Medium_MSA_depth","High_MSA_depth","Taxa_Human","Taxa_Other_Eukaryote","Taxa_Prokaryote","Taxa_Virus","References","Model details"]
+ all_summary_performance = all_summary_performance[column_order]
+ if args.indel_mode:
+ all_summary_performance.to_csv(os.path.join(output_folder,f"Summary_performance_DMS_indels_{metric}.csv"))
+ else:
+ all_summary_performance.to_csv(os.path.join(output_folder,f"Summary_performance_DMS_substitutions_{metric}.csv"))
+
+
+
\ No newline at end of file
diff --git a/proteingym/utils/__init__.py b/proteingym/utils/__init__.py
new file mode 100644
index 0000000..e69de29
diff --git a/proteingym/utils/data_utils.py b/proteingym/utils/data_utils.py
new file mode 100644
index 0000000..fdfd0c0
--- /dev/null
+++ b/proteingym/utils/data_utils.py
@@ -0,0 +1,30 @@
+import pandas as pd
+import numpy as np
+from utils.scoring_utils import get_mutated_sequence
+
+def DMS_file_cleanup(DMS_filename, target_seq, start_idx=1, end_idx=None, DMS_mutant_column='mutant', DMS_phenotype_name='score', DMS_directionality=1, AA_vocab = "ACDEFGHIKLMNPQRSTVWY"):
+ """
+ Borrowed from the Tranception codebase: https://github.com/OATML-Markslab/Tranception/blob/main/tranception/utils/dms_utils.py
+ Function to process the raw substitution DMS assay data (eg., removing invalid mutants, aggregate silent mutations).
+ """
+ DMS_data = pd.read_csv(DMS_filename, low_memory=False)
+ end_idx = start_idx + len(target_seq) - 1 if end_idx is None else end_idx
+ DMS_data['mutant'] = DMS_data[DMS_mutant_column]
+
+ DMS_data=DMS_data[DMS_data['mutant'].notnull()].copy()
+ DMS_data=DMS_data[DMS_data['mutant'].apply(lambda x: all([len(y)>=3 for y in x.split(":")]))].copy() #Mutant triplets should have at least 3 or more characters
+ DMS_data=DMS_data[DMS_data['mutant'].apply(lambda x: all([(y[0] in AA_vocab) and (y[1:-1].isnumeric()) and (y[-1] in AA_vocab) for y in x.split(":")]))].copy()
+ DMS_data=DMS_data[DMS_data['mutant'].apply(lambda x: all([int(y[1:-1])-start_idx >=0 and int(y[1:-1]) <= end_idx for y in x.split(":")]))].copy()
+ DMS_data=DMS_data[DMS_data['mutant'].apply(lambda x: all([y[0]==target_seq[int(y[1:-1])-start_idx] for y in x.split(":")]))].copy()
+
+ DMS_data[DMS_phenotype_name]=pd.to_numeric(DMS_data[DMS_phenotype_name],errors='coerce')
+ DMS_data=DMS_data[np.isfinite(DMS_data[DMS_phenotype_name])]
+ DMS_data.dropna(subset = [DMS_phenotype_name], inplace=True)
+ DMS_data['DMS_score'] = DMS_data[DMS_phenotype_name] * DMS_directionality
+ DMS_data=DMS_data[['mutant','DMS_score']]
+ DMS_data=DMS_data.groupby('mutant').mean().reset_index()
+
+ DMS_data['mutated_sequence'] = DMS_data['mutant'].apply(lambda x: get_mutated_sequence(target_seq, x))
+ DMS_data=DMS_data[['mutant','mutated_sequence','DMS_score']]
+
+ return DMS_data
diff --git a/proteingym/utils/msa_utils.py b/proteingym/utils/msa_utils.py
new file mode 100644
index 0000000..4116f2b
--- /dev/null
+++ b/proteingym/utils/msa_utils.py
@@ -0,0 +1,282 @@
+import os
+import random
+from collections import defaultdict
+import numpy as np
+import pandas as pd
+import tempfile
+from tqdm import tqdm
+from numba import njit, prange
+from Bio import SeqIO
+from Bio.SeqRecord import SeqRecord
+from Bio.Seq import Seq
+import subprocess
+
+from utils.weights import map_from_alphabet, map_matrix, calc_weights_fast
+
+# constants
+GAP = "-"
+MATCH_GAP = GAP
+INSERT_GAP = "."
+
+ALPHABET_PROTEIN_NOGAP = "ACDEFGHIKLMNPQRSTVWY"
+ALPHABET_PROTEIN_GAP = GAP + ALPHABET_PROTEIN_NOGAP
+
+class MSA_processing:
+ def __init__(self,
+ MSA_location="",
+ theta=0.2,
+ use_weights=True,
+ weights_location="./data/weights",
+ preprocess_MSA=True,
+ threshold_sequence_frac_gaps=0.5,
+ threshold_focus_cols_frac_gaps=1.0,
+ remove_sequences_with_indeterminate_AA_in_focus_cols=True,
+ weights_calc_method="eve",
+ num_cpus=1,
+ skip_one_hot_encodings=False,
+ ):
+
+ """
+ This class was borrowed from our EVE codebase: https://github.com/OATML-Markslab/EVE
+ Parameters:
+ - msa_location: (path) Location of the MSA data. Constraints on input MSA format:
+ - focus_sequence is the first one in the MSA data
+ - first line is structured as follows: ">focus_seq_name/start_pos-end_pos" (e.g., >SPIKE_SARS2/310-550)
+ - corespondding sequence data located on following line(s)
+ - then all other sequences follow with ">name" on first line, corresponding data on subsequent lines
+ - theta: (float) Sequence weighting hyperparameter. Generally: Prokaryotic and eukaryotic families = 0.2; Viruses = 0.01
+ - use_weights: (bool) If False, sets all sequence weights to 1. If True, checks weights_location -- if non empty uses that;
+ otherwise compute weights from scratch and store them at weights_location
+ - weights_location: (path) Location to load from/save to the sequence weights
+ - preprocess_MSA: (bool) performs pre-processing of MSA to remove short fragments and positions that are not well covered.
+ - threshold_sequence_frac_gaps: (float, between 0 and 1) Threshold value to define fragments
+ - sequences with a fraction of gap characters above threshold_sequence_frac_gaps are removed
+ - default is set to 0.5 (i.e., fragments with 50% or more gaps are removed)
+ - threshold_focus_cols_frac_gaps: (float, between 0 and 1) Threshold value to define focus columns
+ - positions with a fraction of gap characters above threshold_focus_cols_pct_gaps will be set to lower case (and not included in the focus_cols)
+ - default is set to 0.3 (i.e., focus positions are the ones with 30% of gaps or less, i.e., 70% or more residue occupancy)
+ - remove_sequences_with_indeterminate_AA_in_focus_cols: (bool) Remove all sequences that have indeterminate AA (e.g., B, J, X, Z) at focus positions of the wild type
+ - weights_calc_method: (str) Method to use for calculating sequence weights. Options: "eve" or "identity". (default "eve")
+ - num_cpus: (int) Number of CPUs to use for parallel weights calculation processing. If set to -1, all available CPUs are used. If set to 1, weights are computed in serial.
+ - skip_one_hot_encodings: (bool) If True, only use this class to calculate weights. Skip the one-hot encodings (which can be very memory/compute intensive)
+ and don't calculate all singles.
+ """
+ np.random.seed(2021)
+ self.MSA_location = MSA_location
+ self.weights_location = weights_location
+ self.theta = theta
+ self.alphabet = ALPHABET_PROTEIN_NOGAP
+ self.use_weights = use_weights
+ self.preprocess_MSA = preprocess_MSA
+ self.threshold_sequence_frac_gaps = threshold_sequence_frac_gaps
+ self.threshold_focus_cols_frac_gaps = threshold_focus_cols_frac_gaps
+ self.remove_sequences_with_indeterminate_AA_in_focus_cols = remove_sequences_with_indeterminate_AA_in_focus_cols
+ self.weights_calc_method = weights_calc_method
+ self.skip_one_hot_encodings = skip_one_hot_encodings
+
+ # Defined by gen_alignment
+ self.aa_dict = {}
+ self.focus_seq_name = ""
+ self.seq_name_to_sequence = defaultdict(str)
+ self.focus_seq, self.focus_cols, self.focus_seq_trimmed, self.seq_len, self.alphabet_size = [None] * 5
+ self.focus_start_loc, self.focus_stop_loc = None, None
+ self.uniprot_focus_col_to_wt_aa_dict, self.uniprot_focus_col_to_focus_idx = None, None
+ self.one_hot_encoding, self.weights, self.Neff, self.num_sequences = [None] * 4
+
+ # Fill in the instance variables
+ self.gen_alignment()
+
+ # Note: One-hot encodings might take up huge amounts of memory, and this could be skipped in many use cases
+ if not self.skip_one_hot_encodings:
+ #print("One-hot encoding sequences")
+ self.one_hot_encoding = one_hot_3D(
+ seq_keys=self.seq_name_to_sequence.keys(), # Note: Dicts are unordered for python < 3.6
+ seq_name_to_sequence=self.seq_name_to_sequence,
+ alphabet=self.alphabet,
+ seq_length=self.seq_len,
+ )
+ print ("Data Shape =", self.one_hot_encoding.shape)
+
+ self.calc_weights(num_cpus=num_cpus, method=weights_calc_method)
+
+ def gen_alignment(self):
+ """ Read training alignment and store basics in class instance """
+ self.aa_dict = {}
+ for i, aa in enumerate(self.alphabet):
+ self.aa_dict[aa] = i
+
+ self.seq_name_to_sequence = defaultdict(str)
+ name = ""
+ with open(self.MSA_location, "r") as msa_data:
+ for i, line in enumerate(msa_data):
+ line = line.rstrip()
+ if line.startswith(">"):
+ name = line
+ if i == 0:
+ self.focus_seq_name = name
+ else:
+ self.seq_name_to_sequence[name] += line
+ print("Number of sequences in MSA (before preprocessing):", len(self.seq_name_to_sequence))
+
+ ## MSA pre-processing to remove inadequate columns and sequences
+ if self.preprocess_MSA:
+ # Overwrite self.seq_name_to_sequence
+ self.seq_name_to_sequence = self.preprocess_msa(
+ seq_name_to_sequence=self.seq_name_to_sequence,
+ focus_seq_name=self.focus_seq_name,
+ threshold_sequence_frac_gaps=self.threshold_sequence_frac_gaps,
+ threshold_focus_cols_frac_gaps=self.threshold_focus_cols_frac_gaps
+ )
+
+ self.focus_seq = self.seq_name_to_sequence[self.focus_seq_name]
+ self.focus_cols = [ix for ix, s in enumerate(self.focus_seq) if s == s.upper() and s != '-']
+ self.focus_seq_trimmed = "".join([self.focus_seq[ix] for ix in self.focus_cols])
+ self.seq_len = len(self.focus_cols)
+ self.alphabet_size = len(self.alphabet)
+
+ # Move all letters to CAPS; keeps focus columns only
+ self.raw_seq_name_to_sequence = self.seq_name_to_sequence.copy()
+ for seq_name, sequence in self.seq_name_to_sequence.items():
+ sequence = sequence.replace(".", "-")
+ self.seq_name_to_sequence[seq_name] = "".join(
+ [sequence[ix].upper() for ix in self.focus_cols]) # Makes a List[str] instead of str
+
+ # Remove sequences that have indeterminate AA (e.g., B, J, X, Z) in the focus columns
+ if self.remove_sequences_with_indeterminate_AA_in_focus_cols:
+ num_sequences_removed_due_to_indeterminate_AAs = 0
+ num_sequences_before_indeterminate_AA_drop = len(self.seq_name_to_sequence)
+ alphabet_set = set(list(self.alphabet))
+ seq_names_to_remove = []
+ for seq_name, sequence in self.seq_name_to_sequence.items():
+ for letter in sequence:
+ if letter not in alphabet_set and letter != "-":
+ seq_names_to_remove.append(seq_name)
+ continue
+ seq_names_to_remove = list(set(seq_names_to_remove))
+ for seq_name in seq_names_to_remove:
+ num_sequences_removed_due_to_indeterminate_AAs+=1
+ del self.seq_name_to_sequence[seq_name]
+ print("Proportion of sequences dropped due to indeterminate AAs: {}%".format(round(float(num_sequences_removed_due_to_indeterminate_AAs/num_sequences_before_indeterminate_AA_drop*100),2)))
+
+ print("Number of sequences after preprocessing:", len(self.seq_name_to_sequence))
+ self.num_sequences = len(self.seq_name_to_sequence.keys())
+
+ # Using staticmethod to keep this under the MSAProcessing namespace, but this is apparently not best practice
+ @staticmethod
+ def preprocess_msa(seq_name_to_sequence, focus_seq_name, threshold_sequence_frac_gaps, threshold_focus_cols_frac_gaps):
+ """Remove inadequate columns and sequences from MSA, overwrite self.seq_name_to_sequence."""
+ msa_df = pd.DataFrame.from_dict(seq_name_to_sequence, orient='index', columns=['sequence'])
+ # Data clean up
+ msa_df.sequence = msa_df.sequence.apply(lambda x: x.replace(".", "-")).apply(
+ lambda x: ''.join([aa.upper() for aa in x]))
+ # Remove columns that would be gaps in the wild type
+ non_gap_wt_cols = [aa != '-' for aa in msa_df.sequence[focus_seq_name]]
+ msa_df['sequence'] = msa_df['sequence'].apply(
+ lambda x: ''.join([aa for aa, non_gap_ind in zip(x, non_gap_wt_cols) if non_gap_ind]))
+ assert 0.0 <= threshold_sequence_frac_gaps <= 1.0, "Invalid fragment filtering parameter"
+ assert 0.0 <= threshold_focus_cols_frac_gaps <= 1.0, "Invalid focus position filtering parameter"
+ print("Calculating proportion of gaps")
+ msa_array = np.array([list(seq) for seq in msa_df.sequence])
+ gaps_array = np.array(list(map(lambda seq: [aa == '-' for aa in seq], msa_array)))
+ # Identify fragments with too many gaps
+ seq_gaps_frac = gaps_array.mean(axis=1)
+ seq_below_threshold = seq_gaps_frac <= threshold_sequence_frac_gaps
+ print("Proportion of sequences dropped due to fraction of gaps: " + str(
+ round(float(1 - seq_below_threshold.sum() / seq_below_threshold.shape) * 100, 2)) + "%")
+ # Identify focus columns
+ columns_gaps_frac = gaps_array[seq_below_threshold].mean(axis=0)
+ index_cols_below_threshold = columns_gaps_frac <= threshold_focus_cols_frac_gaps
+ print("Proportion of non-focus columns removed: " + str(
+ round(float(1 - index_cols_below_threshold.sum() / index_cols_below_threshold.shape) * 100, 2)) + "%")
+ # Lower case non focus cols and filter fragment sequences
+ def _lower_case_and_filter_fragments(seq):
+ return ''.join([aa.lower() if aa_ix in index_cols_below_threshold else aa for aa_ix, aa in enumerate(seq)])
+ msa_df['sequence'] = msa_df['sequence'].apply(
+ lambda seq: ''.join([aa.upper() if upper_case_ind else aa.lower() for aa, upper_case_ind in
+ zip(seq, index_cols_below_threshold)]))
+ msa_df = msa_df[seq_below_threshold]
+ # Overwrite seq_name_to_sequence with clean version
+ seq_name_to_sequence = defaultdict(str)
+ # Create a dictionary from msa_df.index to msa_df.sequence
+ seq_name_to_sequence = dict(zip(msa_df.index, msa_df.sequence))
+ # for seq_idx in range(len(msa_df['sequence'])):
+ # seq_name_to_sequence[msa_df.index[seq_idx]] = msa_df.sequence[seq_idx]
+
+ return seq_name_to_sequence
+
+ def calc_weights(self, num_cpus=1, method="eve"):
+ """
+ From the EVE repo, but modified to skip printing out progress bar / time taken
+ (because for ProteinNPT embeddings, weights will usually be computed on the fly for small subsamples of MSA).
+
+ If num_cpus == 1, weights are computed in serial.
+ If num_cpus == -1, weights are computed in parallel using all available cores.
+ Note: This will use multiprocessing.cpu_count() to get the number of available cores, which on clusters may
+ return all cores, not just the number of cores available to the user.
+ """
+ # Refactored into its own function so that we can call it separately
+ if self.use_weights:
+ if os.path.isfile(self.weights_location):
+ print("Loading sequence weights from disk: {}".format(self.weights_location))
+ self.weights = np.load(file=self.weights_location)
+ else:
+ print("Computing sequence weights")
+ if num_cpus == -1:
+ num_cpus = get_num_cpus()
+
+ if method == "eve":
+ alphabet_mapper = map_from_alphabet(ALPHABET_PROTEIN_GAP, default=GAP)
+ arrays = []
+ for seq in self.seq_name_to_sequence.values():
+ arrays.append(np.array(list(seq)))
+ sequences = np.vstack(arrays)
+ sequences_mapped = map_matrix(sequences, alphabet_mapper)
+ self.weights = calc_weights_fast(sequences_mapped, identity_threshold=1 - self.theta,
+ empty_value=0, num_cpus=num_cpus) # GAP = 0
+ elif method == "identity":
+ self.weights = np.ones(self.one_hot_encoding.shape[0])
+ else:
+ raise ValueError(f"Unknown method: {method}. Must be either 'eve' or 'identity'.")
+ print("Saving sequence weights to disk")
+ np.save(file=self.weights_location, arr=self.weights)
+ else:
+ # If not using weights, use an isotropic weight matrix
+ print("Not weighting sequence data")
+ self.weights = np.ones(self.one_hot_encoding.shape[0])
+
+ self.Neff = np.sum(self.weights)
+ print("Neff =", str(self.Neff))
+ print("Number of sequences: ", self.num_sequences)
+ assert self.weights.shape[0] == self.num_sequences, f"Expected {self.num_sequences} sequences, loaded weights have {self.weights.shape[0]}"
+ self.seq_name_to_weight={} # For later, if we want to remove certain sequences and associated weights
+ for i,seq_name in enumerate(self.seq_name_to_sequence.keys()):
+ self.seq_name_to_weight[seq_name]=self.weights[i]
+
+ return self.weights
+
+# One-hot encoding of sequences
+def one_hot_3D(seq_keys, seq_name_to_sequence, alphabet, seq_length):
+ """
+ Take in a list of sequence names/keys and corresponding sequences, and generate a one-hot array according to an alphabet.
+ """
+ aa_dict = {letter: i for (i, letter) in enumerate(alphabet)}
+
+ one_hot_out = np.zeros((len(seq_keys), seq_length, len(alphabet)))
+ for i, seq_key in enumerate(seq_keys):
+ sequence = seq_name_to_sequence[seq_key]
+ for j, letter in enumerate(sequence):
+ if letter in aa_dict:
+ k = aa_dict[letter]
+ one_hot_out[i, j, k] = 1.0
+ # one_hot_out = torch.tensor(one_hot_out)
+ return one_hot_out
+
+def get_num_cpus():
+ if 'SLURM_CPUS_PER_TASK' in os.environ:
+ num_cpus = int(os.environ['SLURM_CPUS_PER_TASK'])
+ print("SLURM_CPUS_PER_TASK:", os.environ['SLURM_CPUS_PER_TASK'])
+ print("Using all available cores (calculated using SLURM_CPUS_PER_TASK):", num_cpus)
+ else:
+ num_cpus = len(os.sched_getaffinity(0))
+ print("Using all available cores (calculated using len(os.sched_getaffinity(0))):", num_cpus)
+ return num_cpus
\ No newline at end of file
diff --git a/proteingym/utils/scoring_utils.py b/proteingym/utils/scoring_utils.py
new file mode 100644
index 0000000..3286b05
--- /dev/null
+++ b/proteingym/utils/scoring_utils.py
@@ -0,0 +1,76 @@
+import numpy as np
+import pandas as pd
+import torch
+
+AA_vocab = "ACDEFGHIKLMNPQRSTVWY"
+unusual_AA ="OU" #Pyrrolysine O and selenocysteine U
+indeterminate_AA = "BJXZ" #B = Asparagine or Aspartic acid; J = leucine or isoleucine; X = Any/Unknown ; Z = Glutamine or glutamic acid
+
+def standardize(x, epsilon = 1e-10):
+ return (x - x.mean()) / (x.std() + epsilon)
+
+def nanmean(v, *args, inplace=False, **kwargs):
+ if not inplace:
+ v = v.clone()
+ is_nan = torch.isnan(v)
+ v[is_nan] = 0
+ return v.sum(*args, **kwargs) / (~is_nan).float().sum(*args, **kwargs)
+
+def nansum(v, *args, inplace=False, **kwargs):
+ if not inplace:
+ v = v.clone()
+ is_nan = torch.isnan(v)
+ v[is_nan] = 0
+ return v.sum(*args, **kwargs)
+
+def get_mutated_sequence(focus_seq, mutant, start_idx=1, AA_vocab=AA_vocab):
+ """
+ Helper function that mutates an input sequence (focus_seq) via an input mutation triplet (substitutions only).
+ Mutation triplet are typically based on 1-indexing: start_idx is used for switching to 0-indexing.
+ """
+ mutated_seq = list(focus_seq)
+ for mutation in mutant.split(":"):
+ try:
+ from_AA, position, to_AA = mutation[0], int(mutation[1:-1]), mutation[-1]
+ except:
+ print("Issue with mutant: "+str(mutation))
+ relative_position = position - start_idx
+ assert (from_AA==focus_seq[relative_position]), "Invalid from_AA or mutant position: "+str(mutation)+" from_AA: "+str(from_AA) + " relative pos: "+str(relative_position) + " focus_seq: "+str(focus_seq)
+ assert (to_AA in AA_vocab) , "Mutant to_AA is invalid: "+str(mutation)
+ mutated_seq[relative_position] = to_AA
+ return "".join(mutated_seq)
+
+def get_optimal_window(mutation_position_relative, seq_len_wo_special, model_window):
+ half_model_window = model_window // 2
+ if seq_len_wo_special <= model_window:
+ return [0,seq_len_wo_special]
+ elif mutation_position_relative < half_model_window:
+ return [0,model_window]
+ elif mutation_position_relative >= seq_len_wo_special - half_model_window:
+ return [seq_len_wo_special - model_window, seq_len_wo_special]
+ else:
+ return [max(0,mutation_position_relative-half_model_window), min(seq_len_wo_special,mutation_position_relative+half_model_window)]
+
+def set_mutant_offset(mutant, MSA_start, mutant_delim=":"):
+ """
+ Adjusts the offset of a mutant sequence to match the MSA start and end positions
+ """
+ indiv_mutants = mutant.split(mutant_delim)
+ new_mutants = []
+ for indiv_mutant in indiv_mutants:
+ wt, pos, sub = indiv_mutant[0], int(indiv_mutant[1:-1]), indiv_mutant[-1]
+ shift_pos = pos - MSA_start + 1
+ new_mutants.append(wt + str(int(shift_pos)) + sub)
+ return mutant_delim.join(new_mutants)
+
+def undo_mutant_offset(mutant, MSA_start, mutant_delim=","):
+ """
+ Undoes the offset adjustment of a mutant sequence to match the MSA start and end positions
+ """
+ indiv_mutants = mutant.split(mutant_delim)
+ new_mutants = []
+ for indiv_mutant in indiv_mutants:
+ wt, pos, sub = indiv_mutant[0], int(indiv_mutant[1:-1]), indiv_mutant[-1]
+ shift_pos = pos + MSA_start - 1
+ new_mutants.append(wt + str(int(shift_pos)) + sub)
+ return mutant_delim.join(new_mutants)
\ No newline at end of file
diff --git a/proteingym/utils/weights.py b/proteingym/utils/weights.py
new file mode 100644
index 0000000..f0b1d14
--- /dev/null
+++ b/proteingym/utils/weights.py
@@ -0,0 +1,257 @@
+# This class is copied from the EVE codebase https://github.com/OATML-Markslab/EVE, removing the progress bar (since we're calculating weights on the fly so don't need one)
+import multiprocessing
+import time
+from collections import defaultdict
+
+import numba
+from numba import prange
+# from numba_progress import ProgressBar
+
+import numpy as np
+from tqdm import tqdm
+
+def calc_weights_fast(matrix_mapped, identity_threshold, empty_value, num_cpus=1):
+ """
+ Modified from EVCouplings: https://github.com/debbiemarkslab/EVcouplings
+
+ Note: Numba by default uses `multiprocessing.cpu_count()` threads.
+ On a cluster where a process might only have access to a subset of CPUs, this may be less than the number of CPUs available.
+ The caller should ideally use len(os.sched_getaffinity(0)) to get the number of CPUs available to the process.
+
+ Calculate weights for sequences in alignment by
+ clustering all sequences with sequence identity
+ greater or equal to the given threshold.
+ Parameters
+ ----------
+ identity_threshold : float
+ Sequence identity threshold
+ """
+ empty_idx = is_empty_sequence_matrix(matrix_mapped, empty_value=empty_value) # e.g. sequences with just gaps or lowercase, no valid AAs
+ N = matrix_mapped.shape[0]
+
+ # Original EVCouplings code structure, plus gap handling
+ if num_cpus != 1:
+ # print("Calculating weights using Numba parallel (experimental) since num_cpus > 1. If you want to disable multiprocessing set num_cpus=1.")
+ # print("Default number of threads for Numba:", numba.config.NUMBA_NUM_THREADS)
+
+ # num_cpus > numba.config.NUMBA_NUM_THREADS will give an error.
+ # But we'll leave it so that the user has to be explicit.
+ numba.set_num_threads(num_cpus)
+ print("Set number of threads to:", numba.get_num_threads()) # Sometimes Numba uses all the CPUs anyway
+
+ num_cluster_members = calc_num_cluster_members_nogaps_parallel(matrix_mapped[~empty_idx], identity_threshold,
+ invalid_value=empty_value)
+
+ else:
+ # Use the serial version
+ num_cluster_members = calc_num_cluster_members_nogaps(matrix_mapped[~empty_idx], identity_threshold,
+ invalid_value=empty_value)
+
+ # Empty sequences: weight 0
+ weights = np.zeros((N))
+ weights[~empty_idx] = 1.0 / num_cluster_members
+ return weights
+
+# Below are util functions copied from EVCouplings
+def is_empty_sequence_matrix(matrix, empty_value):
+ assert len(matrix.shape) == 2, f"Matrix must be 2D; shape={matrix.shape}"
+ assert isinstance(empty_value, (int, float)), f"empty_value must be a number; type={type(empty_value)}"
+ # Check for each sequence if all positions are equal to empty_value
+ empty_idx = np.all((matrix == empty_value), axis=1)
+ return empty_idx
+
+
+def map_from_alphabet(alphabet, default):
+ """
+ Creates a mapping dictionary from a given alphabet.
+ Parameters
+ ----------
+ alphabet : str
+ Alphabet for remapping. Elements will
+ be remapped according to alphabet starting
+ from 0
+ default : Elements in matrix that are not
+ contained in alphabet will be treated as
+ this character
+ Raises
+ ------
+ ValueError
+ For invalid default character
+ """
+ map_ = {
+ c: i for i, c in enumerate(alphabet)
+ }
+
+ try:
+ default = map_[default]
+ except KeyError:
+ raise ValueError(
+ "Default {} is not in alphabet {}".format(default, alphabet)
+ )
+
+ return defaultdict(lambda: default, map_)
+
+
+
+def map_matrix(matrix, map_):
+ """
+ Map elements in a numpy array using alphabet
+ Parameters
+ ----------
+ matrix : np.array
+ Matrix that should be remapped
+ map_ : defaultdict
+ Map that will be applied to matrix elements
+ Returns
+ -------
+ np.array
+ Remapped matrix
+ """
+ return np.vectorize(map_.__getitem__)(matrix)
+
+
+# Fastmath should be safe here, as we can assume that there are no NaNs in the input etc.
+@numba.jit(nopython=True, fastmath=True) #parallel=True
+def calc_num_cluster_members_nogaps(matrix, identity_threshold, invalid_value):
+ """
+ From EVCouplings: https://github.com/debbiemarkslab/EVcouplings/blob/develop/evcouplings/align/alignment.py#L1172.
+ Modified to use non-gapped length and not counting gaps as sequence similarity matches.
+
+ Calculate number of sequences in alignment
+ within given identity_threshold of each other
+ Parameters
+ ----------
+ matrix : np.array
+ N x L matrix containing N sequences of length L.
+ Matrix must be mapped to range(0, num_symbols) using
+ map_matrix function
+ identity_threshold : float
+ Sequences with at least this pairwise identity will be
+ grouped in the same cluster.
+ Returns
+ -------
+ np.array
+ Vector of length N containing number of cluster
+ members for each sequence (inverse of sequence
+ weight)
+ """
+ N, L = matrix.shape
+ L = 1.0 * L
+
+ # Empty sequences are filtered out before this function and are ignored
+ # minimal cluster size is 1 (self)
+ num_neighbors = np.ones((N))
+ L_non_gaps = L - np.sum(matrix == invalid_value, axis=1) # Edit: From EVE, use the non-gapped length
+ # compare all pairs of sequences
+ for i in range(N - 1):
+ for j in range(i + 1, N):
+ pair_matches = 0
+ for k in range(L):
+ # Edit(Lood): Don't count gaps as matches
+ if matrix[i, k] == matrix[j, k] and matrix[i, k] != invalid_value:
+ pair_matches += 1
+
+ # Edit(Lood): Calculate identity as fraction of non-gapped positions (so asymmetric)
+ # Note: Changed >= to > to match EVE / DeepSequence code
+ if pair_matches / L_non_gaps[i] > identity_threshold:
+ num_neighbors[i] += 1
+ if pair_matches / L_non_gaps[j] > identity_threshold:
+ num_neighbors[j] += 1
+
+ return num_neighbors
+
+
+@numba.jit(nopython=True, fastmath=True, parallel=True)
+def calc_num_cluster_members_nogaps_parallel(matrix, identity_threshold, invalid_value):
+ """
+ Parallel implementation of calc_num_cluster_members_nogaps above.
+
+ Calculate number of sequences in alignment
+ within given identity_threshold of each other
+ Parameters
+ ----------
+ matrix : np.array
+ N x L matrix containing N sequences of length L.
+ Matrix must be mapped to range(0, num_symbols) using
+ map_matrix function
+ identity_threshold : float
+ Sequences with at least this pairwise identity will be
+ grouped in the same cluster.
+ invalid_value : int
+ Value in matrix that is considered invalid, e.g. gap or lowercase character.
+ Returns
+ -------
+ np.array
+ Vector of length N containing number of cluster
+ members for each sequence (inverse of sequence
+ weight)
+ """
+ N, L = matrix.shape
+ L = 1.0 * L
+
+ # Empty sequences are filtered out before this function and are ignored
+ # minimal cluster size is 1 (self)
+ num_neighbors = np.ones((N))
+ L_non_gaps = L - np.sum(matrix == invalid_value, axis=1) # Edit: From EVE, use the non-gapped length
+ # compare all pairs of sequences
+ # Edit: Rewrote loop without any dependencies between inner and outer loops, so that it can be parallelized
+ for i in prange(N):
+ num_neighbors_i = 1
+ for j in range(N):
+ if i == j:
+ continue
+ pair_matches = 0
+ for k in range(L): # This should hopefully be vectorised by numba
+ # Edit(Lood): Don't count gaps as matches
+ if matrix[i, k] == matrix[j, k] and matrix[i, k] != invalid_value:
+ pair_matches += 1
+
+ # Edit(Lood): Calculate identity as fraction of non-gapped positions (so this similarity is asymmetric)
+ # Note: Changed >= to > to match EVE / DeepSequence code
+ if pair_matches / L_non_gaps[i] > identity_threshold:
+ num_neighbors_i += 1
+
+ num_neighbors[i] = num_neighbors_i
+
+ return num_neighbors
+
+@numba.jit(nopython=True, fastmath=True, parallel=True)
+def calc_num_cluster_members_nogaps_parallel_print(matrix, identity_threshold, invalid_value, progress_proxy=None, update_frequency=1000):
+ """
+ Modified calc_num_cluster_members_nogaps_parallel to add tqdm progress bar - useful for multi-hour weights calc.
+
+ progress_proxy : numba_progress.ProgressBar
+ A handle on the progress bar to update
+ update_frequency : int
+ Similar to miniters in tqdm, how many iterations between updating the progress bar (which then will only print every `update_interval` seconds)
+ """
+
+ N, L = matrix.shape
+ L = 1.0 * L
+
+ # Empty sequences are filtered out before this function and are ignored
+ # minimal cluster size is 1 (self)
+ num_neighbors = np.ones((N))
+ L_non_gaps = L - np.sum(matrix == invalid_value, axis=1) # Edit: From EVE, use the non-gapped length
+ # compare all pairs of sequences
+ # Edit: Rewrote loop without any dependencies between inner and outer loops, so that it can be parallelized
+ for i in prange(N):
+ num_neighbors_i = 1
+ for j in range(N):
+ if i == j:
+ continue
+ pair_matches = 0
+ for k in range(L): # This should hopefully be vectorised by numba
+ # Edit(Lood): Don't count gaps as matches
+ if matrix[i, k] == matrix[j, k] and matrix[i, k] != invalid_value:
+ pair_matches += 1
+ # Edit(Lood): Calculate identity as fraction of non-gapped positions (so this similarity is asymmetric)
+ # Note: Changed >= to > to match EVE / DeepSequence code
+ if pair_matches / L_non_gaps[i] > identity_threshold:
+ num_neighbors_i += 1
+
+ num_neighbors[i] = num_neighbors_i
+ if progress_proxy is not None and i % update_frequency == 0:
+ progress_proxy.update(update_frequency)
+
+ return num_neighbors
diff --git a/reference_files/DMS_indels.csv b/reference_files/DMS_indels.csv
new file mode 100644
index 0000000..d174144
--- /dev/null
+++ b/reference_files/DMS_indels.csv
@@ -0,0 +1,67 @@
+DMS_id,DMS_filename,UniProt_ID,taxon,source_organism,target_seq,seq_len,DMS_total_number_mutants,DMS_binarization_cutoff,DMS_binarization_method,first_author,title,year,jo,molecule_name,selection_assay,selection_type,MSA_filename,MSA_start,MSA_end,MSA_len,MSA_bitscore,MSA_theta,MSA_num_seqs,MSA_perc_cov,MSA_num_cov,MSA_N_eff,MSA_Neff_L,MSA_Neff_L_category,MSA_num_significant,MSA_num_significant_L,raw_DMS_filename,raw_DMS_phenotype_name,raw_DMS_directionality,raw_DMS_mutant_column,weight_file_name,ProteinGym_version,coarse_selection_type
+A4_HUMAN_Seuma_2022_indels,A4_HUMAN_Seuma_2022_indels.csv,A4_HUMAN,Human,Homo sapiens,MLPGLALLLLAAWTARALEVPTDGNAGLLAEPQIAMFCGRLNMHMNVQNGKWDSDPSGTKTCIDTKEGILQYCQEVYPELQITNVVEANQPVTIQNWCKRGRKQCKTHPHFVIPYRCLVGEFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKETCSEKSTNLHDYGMLLPCGIDKFRGVEFVCCPLAEESDNVDSADAEEDDSDVWWGGADTDYADGSEDKVVEVAEEEEVAEVEEEEADDDEDDEDGDEVEEEAEEPYEEATERTTSIATTTTTTTESVEEVVREVCSEQAETGPCRAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTEEYCMAVCGSAMSQSLLKTTQEPLARDPVKLPTTAASTPDAVDKYLETPGDENEHAHFQKAKERLEAKHRERMSQVMREWEEAERQAKNLPKADKKAVIQHFQEKVESLEQEAANERQQLVETHMARVEAMLNDRRRLALENYITALQAVPPRPRHVFNMLKKYVRAEQKDRQHTLKHFEHVRMVDPKKAAQIRSQVMTHLRVIYERMNQSLSLLYNVPAVAEEIQDEVDELLQKEQNYSDDVLANMISEPRISYGNDALMPSLTETKTTVELLPVNGEFSLDDLQPWHSFGADSVPANTENEVEPVDARPAADRGLTTRPGSGLTNIKTEEISEVKMDAEFRHDSGYEVHHQKLVFFAEDVGSNKGAIIGLMVGGVVIATVIVITLVMLKKKQYTSIHHGVVEVDAAVTPEERHLSKMQQNGYENPTYKFFEQMQN,770,2346,-2.319333116,median,Seuma,"An atlas of amyloid aggregation: the impact of substitutions, insertions, deletions and truncations on amyloid beta fibril nucleation",2022,10.1038/s41467-022-34742-3,APP,aggregation,survival assessment assay,A4_HUMAN_2023-08-07_b01.a2m,1,770,770,0.1,0.2,5272,0.987,760,99.3,0.1306578947,Low,0,0,,nscore_c,1,mutated_sequence,A4_HUMAN_theta0.2_2023-08-07_b01.npy,1,Stability
+AMFR_HUMAN_Tsuboyama_2023_4G3O_indels,AMFR_HUMAN_Tsuboyama_2023_4G3O_indels.csv,AMFR_HUMAN,Human,Homo sapiens,YFQGQLNAMAHQIQEMFPQVPYHLVLQDLQLTRSVEITTDNILEGRI,47,117,-1.504736022,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,E3 ubiquitin-protein ligase AMFR,Stability,cDNA display proteolysis,AMFR_HUMAN_2023-08-07_b04.a2m,1,47,47,0.4,0.2,17787,0.872,41,1166.9,28.46097561,Medium,12,0.2926829268,,ddG_ML_float,1,aa_seq,AMFR_HUMAN_theta0.2_2023-08-07_b04.npy,1,Stability
+ARGR_ECOLI_Tsuboyama_2023_1AOY_indels,ARGR_ECOLI_Tsuboyama_2023_1AOY_indels.csv,ARGR_ECOLI,Prokaryote,Escherichia coli,QEELVKAFKALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELGV,69,181,-0.4541373765,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Arginine repressor,Stability,cDNA display proteolysis,ARGR_ECOLI_2023-08-07_b04.a2m,1,69,69,0.4,0.2,21443,0.913,63,3719.2,59.03492063,Medium,29,0.4603174603,,ddG_ML_float,1,aa_seq,ARGR_ECOLI_theta0.2_2023-08-07_b04.npy,1,Stability
+B1LPA6_ECOSM_Russ_2020_indels,B1LPA6_ECOSM_Russ_2020_indels.csv,B1LPA6_ECOSM,Prokaryote,Escherichia coli,MTSENPLLALREKISALDEKLLALLAERRELAVEVGKAKLLSHRPVRDIDRERDLLERLITLGKAHHLDAHYITRLFQLIIEDSVLTQQALLQQHLNKINPHSARIAFLGPKGSYSHLAARQYAARHFEQFIESGCAKFADIFNQVETGQADYAVVPIENTSSGAINDVYDLLQHTSLSIVGEMTLTIDHCLLVSGTTDLSTINTVYSHPQPFQQCSKFLNRYPHWKIEYTESTSAAMEKVAQAKSPHVAALGSEAGGTLYGLQVLERIEANQRQNFTRFVVLARKAINVSDQVPAKTTLLMATGQQAGALVEALLVLRNHSLIMTRLESRPIHGNPWEEMFYLDIQANLESAEMQKALKELGEITRSMKVLGCYPSENVVPVDPT,386,3074,0.4,manual,Russ,An evolution-based model for designing chorismate mutase enzymes,2020,10.1126/science.aba3304,chorismate mutase,enzyme activity,enzyme activity,B1LPA6_ECOSM_full_04-30-2022_b05.a2m,1,386,386,0.5,0.2,33872,0.699,270,6160,22.81481481,Medium,341,1.262962963,B1LPA6_ECOSM_Russ_2020.csv,activity,1,mutant,B1LPA6_ECOSM_theta_0.2.npy,0.1,Activity
+BBC1_YEAST_Tsuboyama_2023_1TG0_indels,BBC1_YEAST_Tsuboyama_2023_1TG0_indels.csv,BBC1_YEAST,Eukaryote,Saccharomyces cerevisiae,EVPFKVVAQFPYKSDYEDDLNFEKDQEIIVTSVEDAEWYFGEYQDSNGDVIEGIFPKSFVAVQG,64,134,-1.271998543,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Myosin tail region-interacting protein MTI1,Stability,cDNA display proteolysis,BBC1_YEAST_2023-08-07_b05.a2m,1,64,64,0.5,0.2,604824,0.844,54,17529.2,324.6148148,High,55,1.018518519,,ddG_ML_float,1,aa_seq,BBC1_YEAST_theta0.2_2023-08-07_b05.npy,1,Stability
+BCHB_CHLTE_Tsuboyama_2023_2KRU_indels,BCHB_CHLTE_Tsuboyama_2023_2KRU_indels.csv,BCHB_CHLTE,Prokaryote,Chlorobaculum tepidum,ELSWTAEAEKMLGKVPFFVRKKVRKNTDNYAREIGEPVVTADVFRKAKEHLG,52,82,-0.9540616602,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Light-independent protochlorophyllide reductase subunit B,Stability,cDNA display proteolysis,BCHB_CHLTE_2023-08-07_b04.a2m,1,52,52,0.4,0.2,12079,0.923,48,2630.8,54.80833333,Medium,18,0.375,,ddG_ML_float,1,aa_seq,BCHB_CHLTE_theta0.2_2023-08-07_b04.npy,1,Stability
+BLAT_ECOLX_Gonzalez_2019_indels,BLAT_ECOLX_Gonzalez_2019_indels.csv,BLAT_ECOLX,Prokaryote,Escherichia coli,MSIQHFRVALIPFFAAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLSRVDAGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRLDRWEPELNEAIPNDERDTTMPAAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW,286,4751,0.015686274,median,Gonzalez,Fitness Effects of Single Amino Acid Insertions and Deletions in TEM-1 β-Lactamase,2019,10.1016/j.jmb.2019.04.030,Beta-lactamase TEM,"antibiotic resistance, MIC",Amp resistance,BLAT_ECOLX_full_11-26-2021_b02.a2m,1,286,286,0.2,0.2,209644,0.752,215,47605,221.4186047,High,446,2.074418605,BLAT_ECOLX_Gonzalez_indels_2019.csv,DMS_score,1,sequence,BLAT_ECOLX_theta_0.2.npy,0.1,OrganismalFitness
+CAPSD_AAV2S_Sinai_2021_designed_indels,CAPSD_AAV2S_Sinai_2021_designed_indels.csv,CAPSD_AAV2S,Virus,Adeno-associated virus 2 (isolate Srivastava/1982) (AAV-2),MAADGYLPDWLEDTLSEGIRQWWKLKPGPPPPKPAERHKDDSRGLVLPGYKYLGPFNGLDKGEPVNEADAAALEHDKAYDRQLDSGDNPYLKYNHADAEFQERLKEDTSFGGNLGRAVFQAKKRVLEPLGLVEEPVKTAPGKKRPVEHSPVEPDSSSGTGKAGQQPARKRLNFGQTGDADSVPDPQPLGQPPAAPSGLGTNTMATGSGAPMADNNEGADGVGNSSGNWHCDSTWMGDRVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQRLINNNWGFRPKRLNFKLFNIQVKEVTQNDGTTTIANNLTSTVQVFTDSEYQLPYVLGSAHQGCLPPFPADVFMVPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLSRTNTPSGTTTQSRLQFSQAGASDIRDQSRNWLPGPCYRQQRVSKTSADNNNSEYSWTGATKYHLNGRDSLVNPGPAMASHKDDEEKFFPQSGVLIFGKQGSEKTNVDIEKVMITDEEEIRTTNPVATEQYGSVSTNLQRGNRQAATADVNTQGVLPGMVWQDRDVYLQGPIWAKIPHTDGHFHPSPLMGGFGLKHPPPQILIKNTPVPANPSTTFSAAKFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVNVDFTVDTNGVYSEPRPIGTRYLTRNL,735,225998,-2.18477642,median,Sinai,Generative AAV capsid diversification by latent interpolation,2021,10.1101/2021.04.16.440236,AAV,viability for AAV capsid production,,CAPSD_AAV2S_uniprot_t099_msc70_mcc70_b0.8.a2m,1,735,735,0.8,0.01,604,0.782,575,213.8,0.371826087,Low,1943,3.379130435,CAPSD_AAV2S_Sinai_indels_2021.csv,label,1,mutated_sequence,CAPSD_AAV2S_theta_0.01.npy,0.1,OrganismalFitness
+CAPSD_AAV2S_Sinai_2021_library_indels,CAPSD_AAV2S_Sinai_2021_library_indels.csv,CAPSD_AAV2S,Virus,Adeno-associated virus 2 (isolate Srivastava/1982) (AAV-2),MAADGYLPDWLEDTLSEGIRQWWKLKPGPPPPKPAERHKDDSRGLVLPGYKYLGPFNGLDKGEPVNEADAAALEHDKAYDRQLDSGDNPYLKYNHADAEFQERLKEDTSFGGNLGRAVFQAKKRVLEPLGLVEEPVKTAPGKKRPVEHSPVEPDSSSGTGKAGQQPARKRLNFGQTGDADSVPDPQPLGQPPAAPSGLGTNTMATGSGAPMADNNEGADGVGNSSGNWHCDSTWMGDRVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQRLINNNWGFRPKRLNFKLFNIQVKEVTQNDGTTTIANNLTSTVQVFTDSEYQLPYVLGSAHQGCLPPFPADVFMVPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLSRTNTPSGTTTQSRLQFSQAGASDIRDQSRNWLPGPCYRQQRVSKTSADNNNSEYSWTGATKYHLNGRDSLVNPGPAMASHKDDEEKFFPQSGVLIFGKQGSEKTNVDIEKVMITDEEEIRTTNPVATEQYGSVSTNLQRGNRQAATADVNTQGVLPGMVWQDRDVYLQGPIWAKIPHTDGHFHPSPLMGGFGLKHPPPQILIKNTPVPANPSTTFSAAKFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVNVDFTVDTNGVYSEPRPIGTRYLTRNL,735,24908,-2.18477642,median,Sinai,Generative AAV capsid diversification by latent interpolation,2021,10.1101/2021.04.16.440236,AAV,viability for AAV capsid production,,CAPSD_AAV2S_uniprot_t099_msc70_mcc70_b0.8.a2m,1,735,735,0.8,0.01,604,0.782,575,213.8,0.371826087,Low,1943,3.379130435,CAPSD_AAV2S_Sinai_indels_2021.csv,label,1,mutated_sequence,CAPSD_AAV2S_theta_0.01.npy,0.1,OrganismalFitness
+CATR_CHLRE_Tsuboyama_2023_2AMI_indels,CATR_CHLRE_Tsuboyama_2023_2AMI_indels.csv,CATR_CHLRE,Eukaryote,Chlamydomonas reinhardtii,GLTEEQKQEIREAFDLFDTDGSGTIDAKELKVAMRALGFEPKKEEIKKMISEIDKDGSGTIDFEEFLTMMTA,72,197,-0.5681612987,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Caltractin,Stability,cDNA display proteolysis,CATR_CHLRE_2023-08-07_b03.a2m,1,72,72,0.3,0.2,551057,0.903,65,75596.9,1163.029231,High,57,0.8769230769,,ddG_ML_float,1,aa_seq,CATR_CHLRE_theta0.2_2023-08-07_b03.npy,1,Stability
+CBPA2_HUMAN_Tsuboyama_2023_1O6X_indels,CBPA2_HUMAN_Tsuboyama_2023_1O6X_indels.csv,CBPA2_HUMAN,Human,Homo sapiens,VGDQVLEIVPSNEEQIKNLLQLEAQEHLQLDFWKSPTTPGETAHVRVPFVNVQAVKVFLESQGIAYSIMIED,72,205,-1.221174658,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Carboxypeptidase A2,Stability,cDNA display proteolysis,CBPA2_HUMAN_2023-08-07_b03.a2m,1,72,72,0.3,0.2,12711,0.986,71,3086.5,43.47183099,Medium,34,0.4788732394,,ddG_ML_float,1,aa_seq,CBPA2_HUMAN_theta0.2_2023-08-07_b03.npy,1,Stability
+CBX4_HUMAN_Tsuboyama_2023_2K28_indels,CBX4_HUMAN_Tsuboyama_2023_2K28_indels.csv,CBX4_HUMAN,Human,Homo sapiens,AVESIEKKRIRKGRVEYLVKWRGWSPKYNTWEPEENILDPRLLIAFQNRE,50,129,-1.635037732,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,E3 SUMO-protein ligase CBX4,Stability,cDNA display proteolysis,CBX4_HUMAN_2023-08-07_b03.a2m,1,50,50,0.3,0.2,108263,0.96,48,13404.4,279.2583333,High,23,0.4791666667,,ddG_ML_float,1,aa_seq,CBX4_HUMAN_theta0.2_2023-08-07_b03.npy,1,Stability
+CSN4_MOUSE_Tsuboyama_2023_1UFM_indels,CSN4_MOUSE_Tsuboyama_2023_1UFM_indels.csv,CSN4_MOUSE,Eukaryote,Mus musculus,SSGGSSILDRAVIEHNLLSASKLYNNITFEELGALLEIPAAKAEKIASQMITEGRMNGFIDQIDGIVHFETR,72,195,-0.7,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,COP9 signalosome complex subunit 4,Stability,cDNA display proteolysis,CSN4_MOUSE_2023-08-07_b03.a2m,1,72,72,0.3,0.2,39217,0.889,64,3492.9,54.5765625,Medium,9,0.140625,,ddG_ML_float,1,aa_seq,CSN4_MOUSE_theta0.2_2023-08-07_b03.npy,1,Stability
+CUE1_YEAST_Tsuboyama_2023_2MYX_indels,CUE1_YEAST_Tsuboyama_2023_2MYX_indels.csv,CUE1_YEAST,Eukaryote,Saccharomyces cerevisiae,GGHPVTTQMVETVQNLAPNLHPEQIRYSLENTGSVEETVERYLRGDEFSFPP,52,140,-1.319713733,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Coupling of ubiquitin conjugation to ER degradation protein 1,Stability,cDNA display proteolysis,CUE1_YEAST_2023-08-07_b08.a2m,1,52,52,0.8,0.2,3213,0.923,48,387.1,8.064583333,Medium,10,0.2083333333,,ddG_ML_float,1,aa_seq,CUE1_YEAST_theta0.2_2023-08-07_b08.npy,1,Stability
+DN7A_SACS2_Tsuboyama_2023_1JIC_indels,DN7A_SACS2_Tsuboyama_2023_1JIC_indels.csv,DN7A_SACS2,Prokaryote,Saccharolobus solfataricus,TVKFKYKGEEKQVDISKIKKVWRVGKMISFTYDEGGGKTGRGAVSEKDAPKELLQ,55,136,-0.472754253,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,DNA-binding protein 7a,Stability,cDNA display proteolysis,DN7A_SACS2_2023-08-07_b02.a2m,1,55,55,0.2,0.2,42895,0.764,42,1248.1,29.71666667,Medium,13,0.3095238095,,ddG_ML_float,1,aa_seq,DN7A_SACS2_theta0.2_2023-08-07_b02.npy,1,Stability
+DNJA1_HUMAN_Tsuboyama_2023_2LO1_indels,DNJA1_HUMAN_Tsuboyama_2023_2LO1_indels.csv,DNJA1_HUMAN,Human,Homo sapiens,TTYYDVLGVKPNATQEELKKAYRKLALKYHPDKNPNEGEKFKQISQAYEVLSDAKKRELYDKGGE,65,174,-2.239788161,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,DnaJ homolog subfamily A member 1,Stability,cDNA display proteolysis,DNJA1_HUMAN_2023-08-07_b07.a2m,1,65,65,0.7,0.2,280284,0.969,63,35361.9,561.3,High,52,0.8253968254,,ddG_ML_float,1,aa_seq,DNJA1_HUMAN_theta0.2_2023-08-07_b07.npy,1,Stability
+DOCK1_MOUSE_Tsuboyama_2023_2M0Y_indels,DOCK1_MOUSE_Tsuboyama_2023_2M0Y_indels.csv,DOCK1_MOUSE,Eukaryote,Mus musculus,WVPTKREEKYGVAFYNYDARGADELSLQIGDTVHILETYEGWYRGYTLRKKSKKGIFPASYIHLKE,66,154,-1.104437518,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Dedicator of cytokinesis protein 1,Stability,cDNA display proteolysis,DOCK1_MOUSE_2023-08-07_b03.a2m,1,66,66,0.3,0.2,705447,0.848,56,22172.3,395.9339286,High,55,0.9821428571,,ddG_ML_float,1,aa_seq,DOCK1_MOUSE_theta0.2_2023-08-07_b03.npy,1,Stability
+EPHB2_HUMAN_Tsuboyama_2023_1F0M_indels,EPHB2_HUMAN_Tsuboyama_2023_1F0M_indels.csv,EPHB2_HUMAN,Human,Homo sapiens,SFNTVDEWLEAIKMGQYKESFANAGFTSFDVVSQMMMEDILRVGVTLAGHQKKILNSIQVMRAQMN,66,185,-1.932053964,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Ephrin type-B receptor 2,Stability,cDNA display proteolysis,EPHB2_HUMAN_2023-08-07_b04.a2m,1,66,66,0.4,0.2,212234,0.894,59,8426.3,142.8186441,High,29,0.4915254237,,ddG_ML_float,1,aa_seq,EPHB2_HUMAN_theta0.2_2023-08-07_b04.npy,1,Stability
+FECA_ECOLI_Tsuboyama_2023_2D1U_indels,FECA_ECOLI_Tsuboyama_2023_2D1U_indels.csv,FECA_ECOLI,Eukaryote,Escherichia coli,QVNIAPGSLDKALNQYAAHSGFTLSVDASLTRGKQSNGLHGDYDVESGLQQLLDGSGLQVKPLGNNSWTLEP,72,193,-0.813576222,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Fe(3+) dicitrate transport protein FecA,Stability,cDNA display proteolysis,FECA_ECOLI_2023-08-07_b06.a2m,1,72,72,0.6,0.2,74248,0.986,71,9949.9,140.1394366,High,63,0.8873239437,,ddG_ML_float,1,aa_seq,FECA_ECOLI_theta0.2_2023-08-07_b06.npy,1,Stability
+HCP_LAMBD_Tsuboyama_2023_2L6Q_indels,HCP_LAMBD_Tsuboyama_2023_2L6Q_indels.csv,HCP_LAMBD,Virus,Escherichia phage lambda (Bacteriophage lambda),VRQEELAAARAALHDLMTGKRVATVQKDGRRVEFTATSVSDLKKYIAELEVQTGM,55,148,-0.350614016,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Head completion protein,Stability,cDNA display proteolysis,HCP_LAMBD_2023-08-07_b05.a2m,1,55,55,0.5,0.2,2128,0.945,52,606.5,11.66346154,Medium,15,0.2884615385,,ddG_ML_float,1,aa_seq,HCP_LAMBD_theta0.2_2023-08-07_b05.npy,1,Stability
+HECD1_HUMAN_Tsuboyama_2023_3DKM_indels,HECD1_HUMAN_Tsuboyama_2023_3DKM_indels.csv,HECD1_HUMAN,Human,Homo sapiens,NLYFQGLKYMVPGARVTRGLDWKWRDQDGSPQGEGTVTGELHNGWIDVTWDAGGSNSYRMGAEGKFDLKLAP,72,154,-0.7,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,E3 ubiquitin-protein ligase HECTD1,Stability,cDNA display proteolysis,HECD1_HUMAN_2023-08-07_b03.a2m,1,72,72,0.3,0.2,18660,0.903,65,1192.3,18.34307692,Medium,24,0.3692307692,,ddG_ML_float,1,aa_seq,HECD1_HUMAN_theta0.2_2023-08-07_b03.npy,1,Stability
+HIS7_YEAST_Pokusaeva_2019_indels,HIS7_YEAST_Pokusaeva_2019_indels.csv,HIS7_YEAST,Eukaryote,Saccharomyces cerevisiae,MTEQKALVKRITNETKIQIAISLKGGPLAIEHSIFPEKEAEAVAEQATQSQVINVHTGIGFLDHMIHALAKHSGWSLIVECIGDLHIDDHHTTEDCGIALGQAFKEALGAVRGVKRFGSGFAPLDEALSRAVVDLSNRPYAVVELGLQREKVGDLSCEMIPHFLESFAEASRITLHVDCLRGKNDHHRSESAFKALAVAIREATSPNGTNDVPSTKGVLM,220,6102,0.25,manual,Pokusaeva,An experimental assay of the interactions of amino acids from orthologous sequences shaping a complex fitness landscape,2019,10.1371/journal.pgen.1008079,IGP dehydratase (HIS3),Growth,Growth,HIS7_YEAST_full_11-26-2021_b09.a2m,1,220,220,0.9,0.2,40154,0.873,192,5191.3,27.03802083,Medium,318,1.65625,HIS7_YEAST_Pokusaeva_indels_2019.csv,DMS_score,1,sequence,HIS7_YEAST_theta_0.2.npy,0.1,OrganismalFitness
+ILF3_HUMAN_Tsuboyama_2023_2L33_indels,ILF3_HUMAN_Tsuboyama_2023_2L33_indels.csv,ILF3_HUMAN,Human,Homo sapiens,MLTKHGKNPVMELNEKRRGLKYELISETGGSHDKRFVMEVEVDGQKFQGAGSNKKVAKAYAALAALEKLFP,71,193,-0.4,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Interleukin enhancer-binding factor 3,Stability,cDNA display proteolysis,ILF3_HUMAN_2023-08-07_b03.a2m,1,71,71,0.3,0.2,145438,0.915,65,21228,326.5846154,High,57,0.8769230769,,ddG_ML_float,1,aa_seq,ILF3_HUMAN_theta0.2_2023-08-07_b03.npy,1,Stability
+KCNJ2_MOUSE_Macdonald_2022_indels,KCNJ2_MOUSE_Macdonald_2022_indels.csv,KCNJ2_MOUSE,Eukaryote,Mus Musculus,MGSVRTNRYSIVSSEEDGMKLATMAVANGFGNGKSKVHTRQQCRSRFVKKDGHCNVQFINVGEKGQRYLADIFTTCVDIRWRWMLVIFCLAFVLSWLFFGCVFWLIALLHGDLDTSKVSKACVSEVNSFTAAFLFSIETQTTIGYGFRCVTDECPIAVFMVVFQSIVGCIIDAFIIGAVMAKMAKPKKRNETLVFSHNAVIAMRDGKLCLMWRVGNLRKSHLVEAHVRAQLLKSRITSEGEYIPLDQIDINVGFDSGIDRIFLVSPITIVHEIDEDSPLYDLSKQDIDNADFEIVVILEGMVEATAMTTQCRSSYLANEILWGHRYEPVLFEEKHYYKVDYSRFHKTYEVPNTPLCSARDLAEKKYILSNANSFCYENEVALTSKEEEEDSENGVPESTSTDSPPGIDLHNQASVPLEPRPLRRESEI,428,10501,-2,manual,Macdonald,"DIMPLE: deep insertion, deletion, and missense mutation libraries for exploring protein variation in evolution, disease, and biology",2022,10.1186/s13059-023-02880-6,Kir2.1,surface trafficking,FACS,KCNJ2_MOUSE_b01.a2m,1,428,428,0.1,0.2,20953,0.743,318,2154.5,6.77515723,Medium,160,0.50314465,KCNJ2_MOUSE_Macdonald_2022_indels_noflag.csv,score,1,mutated_sequence_no_flag,KCNJ2_MOUSE_b01_theta_0.2.npy,1,Expression
+MAFG_MOUSE_Tsuboyama_2023_1K1V_indels,MAFG_MOUSE_Tsuboyama_2023_1K1V_indels.csv,MAFG_MOUSE,Eukaryote,Mus musculus,LTDEELVTMSVRELNQHLRGLSKEEIIQLKQRRRTLKNRGY,41,115,-0.5,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Transcription factor MafG,Stability,cDNA display proteolysis,MAFG_MOUSE_2023-08-07_b07.a2m,1,41,41,0.7,0.2,6178,1,41,156.7,3.82195122,Medium,4,0.09756097561,,ddG_ML_float,1,aa_seq,MAFG_MOUSE_theta0.2_2023-08-07_b07.npy,1,Stability
+MBD11_ARATH_Tsuboyama_2023_6ACV_indels,MBD11_ARATH_Tsuboyama_2023_6ACV_indels.csv,MBD11_ARATH,Eukaryote,Arabidopsis thaliana,VSVELPAPSSWKKLFYPNKVGSVKKTEVVFVAPTGEEISNRKQLEQYLKSHPGNPAIAEFDWTTSG,66,131,-1.578921171,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Methyl-CpG-binding domain-containing protein 11,Stability,cDNA display proteolysis,MBD11_ARATH_2023-08-07_b03.a2m,1,66,66,0.3,0.2,26035,0.909,60,1510.5,25.175,Medium,11,0.1833333333,,ddG_ML_float,1,aa_seq,MBD11_ARATH_theta0.2_2023-08-07_b03.npy,1,Stability
+MYO3_YEAST_Tsuboyama_2023_2BTT_indels,MYO3_YEAST_Tsuboyama_2023_2BTT_indels.csv,MYO3_YEAST,Eukaryote,Saccharomyces cerevisiae,KDPKFEAAYDFPGSGSSSELPLKKGDIVFISRDEPSGWSLAKLLDGSKEGWVPTAYMTPYK,61,80,-1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Myosin-3,Stability,cDNA display proteolysis,MYO3_YEAST_2023-08-07_b07.a2m,1,61,61,0.7,0.2,442941,0.885,54,12893.2,238.762963,High,51,0.9444444444,,ddG_ML_float,1,aa_seq,MYO3_YEAST_theta0.2_2023-08-07_b07.npy,1,Stability
+NKX31_HUMAN_Tsuboyama_2023_2L9R_indels,NKX31_HUMAN_Tsuboyama_2023_2L9R_indels.csv,NKX31_HUMAN,Human,Homo sapiens,HSHMSHTQVIELERKFSHQKYLSAPERAHLAKNLKLTETQVKIWFQNRRYKTKRKQLSSEL,61,178,-0.3,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Homeobox protein Nkx-3.1,Stability,cDNA display proteolysis,NKX31_HUMAN_2023-08-07_b04.a2m,1,61,61,0.4,0.2,319273,0.902,55,8440.8,153.4690909,High,27,0.4909090909,,ddG_ML_float,1,aa_seq,NKX31_HUMAN_theta0.2_2023-08-07_b04.npy,1,Stability
+NUSA_ECOLI_Tsuboyama_2023_1WCL_indels,NUSA_ECOLI_Tsuboyama_2023_1WCL_indels.csv,NUSA_ECOLI,Prokaryote,Escherichia coli,EAHAAIDTFTKYLDIDEDFATVLVEEGFSTLEELAYVPMKELLEIEGLDEPTVEALRERAKNALATIAQ,69,191,-1.318069467,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Transcription termination/antitermination protein NusA,Stability,cDNA display proteolysis,NUSA_ECOLI_2023-08-07_b03.a2m,1,69,69,0.3,0.2,205612,0.812,56,39002.5,696.4732143,High,38,0.6785714286,,ddG_ML_float,1,aa_seq,NUSA_ECOLI_theta0.2_2023-08-07_b03.npy,1,Stability
+NUSG_MYCTU_Tsuboyama_2023_2MI6_indels,NUSG_MYCTU_Tsuboyama_2023_2MI6_indels.csv,NUSG_MYCTU,Prokaryote,Mycobacterium tuberculosis,DYEVGESVTVMDGPFATLPATISEVNAEQQKLKVLVSIFGRETPVELTFGQVSKI,55,157,-0.5,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Transcription termination/antitermination protein NusG,Stability,cDNA display proteolysis,NUSG_MYCTU_2023-08-07_b03.a2m,1,55,55,0.3,0.2,102004,0.964,53,16625.7,313.6924528,High,41,0.7735849057,,ddG_ML_float,1,aa_seq,NUSG_MYCTU_theta0.2_2023-08-07_b03.npy,1,Stability
+OBSCN_HUMAN_Tsuboyama_2023_1V1C_indels,OBSCN_HUMAN_Tsuboyama_2023_1V1C_indels.csv,OBSCN_HUMAN,Human,Homo sapiens,FDIYVVTADYLPLGAEQDAITLREGQYVEVLDAAHPLRWLVRTKPTKSSPSRQGWVSPAYLDRRL,65,169,-1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Obscurin,Stability,cDNA display proteolysis,OBSCN_HUMAN_2023-08-07_b02.a2m,1,65,65,0.2,0.2,718751,0.815,53,23710.7,447.3716981,High,54,1.018867925,,ddG_ML_float,1,aa_seq,OBSCN_HUMAN_theta0.2_2023-08-07_b02.npy,1,Stability
+ODP2_GEOSE_Tsuboyama_2023_1W4G_indels,ODP2_GEOSE_Tsuboyama_2023_1W4G_indels.csv,ODP2_GEOSE,Prokaryote,Geobacillus stearothermophilus,NRRVIAMPSVRKWAREKGVDIRLVQGTGKNGRVLKEDIDAFLAG,44,47,-0.4168227551,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex,Stability,cDNA display proteolysis,ODP2_GEOSE_2023-08-07_b07.a2m,1,44,44,0.7,0.2,163835,0.909,40,14834.6,370.865,High,21,0.525,,ddG_ML_float,1,aa_seq,ODP2_GEOSE_theta0.2_2023-08-07_b07.npy,1,Stability
+OTU7A_HUMAN_Tsuboyama_2023_2L2D_indels,OTU7A_HUMAN_Tsuboyama_2023_2L2D_indels.csv,OTU7A_HUMAN,Human,Homo sapiens,TLDMDAVLSDFVRSTGAEPGLARDLLEGKNWDLTAALSDYEQ,42,84,-1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,OTU domain-containing protein 7A,Stability,cDNA display proteolysis,OTU7A_HUMAN_2023-08-07_b02.a2m,1,42,42,0.2,0.2,1359071,0.881,37,514715.2,13911.22162,High,28,0.7567567568,,ddG_ML_float,1,aa_seq,OTU7A_HUMAN_theta0.2_2023-08-07_b02.npy,1,Stability
+P53_HUMAN_Kotler_2018_indels,P53_HUMAN_Kotler_2018_indels.csv,P53_HUMAN,Human,Homo sapiens,MEEPQSDPSVEPPLSQETFSDLWKLLPENNVLSPLPSQAMDDLMLSPDDIEQWFTEDPGPDEAPRMPEAAPPVAPAPAAPTPAAPAPAPSWPLSSSVPSQKTYQGSYGFRLGFLHSGTAKSVTCTYSPALNKMFCQLAKTCPVQLWVDSTPPPGTRVRAMAIYKQSQHMTEVVRRCPHHERCSDSDGLAPPQHLIRVEGNLRVEYLDDRNTFRHSVVVPYEPPEVGSDCTTIHYNYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRNSFEVRVCACPGRDRRTEEENLRKKGEPHHELPPGSTKRALPNNTSSSPQPKKKPLDGEYFTLQIRGRERFEMFRELNEALELKDAQAGKEPGGSRAHSSHLKSKKGQSTSRHKKLMFKTEGPDSD,393,341,0.206718584,median,Kotler,A Systematic p53 Mutation Library Links Differential Functional Impact to Cancer Mutation Pattern and Evolutionary Conservation,2018,10.1016/j.molcel.2018.06.012,p53,growth,Growth,P53_HUMAN_full_11-26-2021_b09.a2m,1,393,393,0.9,0.2,4129,0.863,339,148,0.4365781711,Low,15,0.04424778761,P53_HUMAN_Kotler_deletions_2018.csv,RFS_H1299,-1,sequence,P53_HUMAN_Kotler_theta_0.2.npy,0.1,OrganismalFitness
+PIN1_HUMAN_Tsuboyama_2023_1I6C_indels,PIN1_HUMAN_Tsuboyama_2023_1I6C_indels.csv,PIN1_HUMAN,Human,Homo sapiens,KLPPGWEKRMSRSSGRVYYFNHITNASQWERPSGNSSSG,39,106,-0.6844420472,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Peptidyl-prolyl cis-trans isomerase NIMA-interacting 1,Stability,cDNA display proteolysis,PIN1_HUMAN_2023-08-07_b02.a2m,1,39,39,0.2,0.2,248269,0.821,32,10833.2,338.5375,High,13,0.40625,,ddG_ML_float,1,aa_seq,PIN1_HUMAN_theta0.2_2023-08-07_b02.npy,1,Stability
+PITX2_HUMAN_Tsuboyama_2023_2L7M_indels,PITX2_HUMAN_Tsuboyama_2023_2L7M_indels.csv,PITX2_HUMAN,Human,Homo sapiens,THFTSQQLQELEATFQRNHYPDMSTREEIAVWTNLTEARVRVWFKNRRAKWR,52,117,-1.201366007,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Pituitary homeobox 2,Stability,cDNA display proteolysis,PITX2_HUMAN_2023-08-07_b04.a2m,1,52,52,0.4,0.2,344174,1,52,9819.6,188.8384615,High,25,0.4807692308,,ddG_ML_float,1,aa_seq,PITX2_HUMAN_theta0.2_2023-08-07_b04.npy,1,Stability
+PKN1_HUMAN_Tsuboyama_2023_1URF_indels,PKN1_HUMAN_Tsuboyama_2023_1URF_indels.csv,PKN1_HUMAN,Human,Homo sapiens,GIPATNLSRVAGLEKQLAIELKVKQGAENMIQTYSNGSTKDRKLLLTAQQMLQDSKTKIDIIRMQLRRALQ,71,187,-0.5,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Serine/threonine-protein kinase N1,Stability,cDNA display proteolysis,PKN1_HUMAN_2023-08-07_b01.a2m,1,71,71,0.1,0.2,187829,0.845,60,53755.8,895.93,High,13,0.2166666667,,ddG_ML_float,1,aa_seq,PKN1_HUMAN_theta0.2_2023-08-07_b01.npy,1,Stability
+POLG_PESV_Tsuboyama_2023_2MXD_indels,POLG_PESV_Tsuboyama_2023_2MXD_indels.csv,POLG_PESV,Virus,Porcine enteric sapovirus (isolate Swine/United States/Cowden/1980),ALRDDEYDEWQDIIRDWRKEMTVQQFLDLKERALSGASDPDSQRYNAWLELRA,53,149,-1.7,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Genome polyprotein,Stability,cDNA display proteolysis,POLG_PESV_2023-08-07_b03.a2m,1,53,53,0.3,0.2,20190,0.887,47,3718.4,79.11489362,Medium,12,0.2553191489,,ddG_ML_float,1,aa_seq,POLG_PESV_theta0.2_2023-08-07_b03.npy,1,Stability
+PR40A_HUMAN_Tsuboyama_2023_1UZC_indels,PR40A_HUMAN_Tsuboyama_2023_1UZC_indels.csv,PR40A_HUMAN,Human,Homo sapiens,TYTWNTKEEAKQAFKELLKEKRVPSNASWEQAMKMIINDPRYSALAKLSEKKQAFNAYKVQTE,63,168,-1.362579422,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Pre-mRNA-processing factor 40 homolog A,Stability,cDNA display proteolysis,PR40A_HUMAN_2023-08-07_b03.a2m,1,63,63,0.3,0.2,63560,0.857,54,3663.8,67.84814815,Medium,16,0.2962962963,,ddG_ML_float,1,aa_seq,PR40A_HUMAN_theta0.2_2023-08-07_b03.npy,1,Stability
+PSAE_PICP2_Tsuboyama_2023_1PSE_indels,PSAE_PICP2_Tsuboyama_2023_1PSE_indels.csv,PSAE_PICP2,Prokaryote,Synechococcus sp,AIERGSKVKILRKESYWYGDVGTVASIDKSGIIYPVIVRFNKVNYNGFSGSAGGLNTNNFAEHELEVV,68,175,-0.7,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Photosystem I reaction center subunit IV,Stability,cDNA display proteolysis,PSAE_PICP2_2023-08-07_b09.a2m,1,68,68,0.9,0.2,1785,0.868,59,130.7,2.215254237,Medium,9,0.1525423729,,ddG_ML_float,1,aa_seq,PSAE_PICP2_theta0.2_2023-08-07_b09.npy,1,Stability
+PTEN_HUMAN_Mighell_2018_indels,PTEN_HUMAN_Mighell_2018_indels.csv,PTEN_HUMAN,Human,Homo sapiens,MTAIIKEIVSRNKRRYQEDGFDLDLTYIYPNIIAMGFPAERLEGVYRNNIDDVVRFLDSKHKNHYKIYNLCAERHYDTAKFNCRVAQYPFEDHNPPQLELIKPFCEDLDQWLSEDDNHVAAIHCKAGKGRTGVMICAYLLHRGKFLKAQEALDFYGEVRTRDKKGVTIPSQRRYVYYYSYLLKNHLDYRPVALLFHKMMFETIPMFSGGTCNPQFVVCQLKVKIYSSNSGPTRREDKFMYFEFPQPLPVCGDIKVEFFHKQNKMLKKDKMFHFWVNTFFIPGPEETSEKVENGSLCDQEIDSICSIERADNDKEYLVLTLTKNDLDKANKDKANRYFSPNFKVKLYFTKTVEEPSNPEASSSTSVTPDVSDNEPDHYRYSDTTDSDPENEPFDEDQHTQITKV,403,314,-2.020820613,median,Mighell,A Saturation Mutagenesis Approach to Understanding PTEN Lipid Phosphatase Activity and Genotype-Phenotype Relationships,2018,10.1016/j.ajhg.2018.03.018,PTEN,"growth (surrogate for enzymatic activity/hydrolysis of lipid phosphates to restore PIP2, which affects proliferation rate)",lipid phosphatase activity,PTEN_HUMAN_full_11-26-2021_b01.a2m,1,403,403,0.1,0.2,19058,0.752,303,1425.3,4.703960396,Medium,52,0.1716171617,PTEN_HUMAN_Mighell_deletions_2018.csv,DMS_score,1,sequence,PTEN_HUMAN_theta_0.2.npy,0.1,Activity
+Q8EG35_SHEON_Campbell_2022_indels,Q8EG35_SHEON_Campbell_2022_indels.csv,Q8EG35_SHEON,Prokaryote,Shewanella oneidensis,MKNCLKMKNLLPALTITMAMSAVMALVVTPNAYASKWDEKMTPEQVEATLDKKFAEGNYSPKGADSCLMCHKKSEKVMDLFKGVHGAIDSSKSPMAGLQCEACHGPLGQHNKGGNEPMITFGKQSTLSADKQNSVCMSCHQDDKRMSWNGGHHDNADVACASCHQVHVAKDPVLSKNTEMEVCTSCHTKQKADMNKRSSHPLKWAQMTCSDCHNPHGSMTDSDLNKPSVNDTCYSCHAEKRGPKLWEHAPVTENCVTCHNPHGSVNDGMLKTRAPQLCQQCHASDGHASNAYLGNTGLGSNVGDNAFTGGRSCLNCHSQVHGSNHPSGKLLQR,333,331,0.003035782214,median,Campbell,Determinants of Multiheme Cytochrome Extracellular Electron Transfer Uncovered by Systematic Peptide Insertion,2022,10.1021/acs.biochem.2c00148,MtrA,extracellular electron transfer,survival assessment assay,Q8EG35_SHEON_b03.a2m,1,333,333,0.3,0.2,2866,0.778,259,1289.1,4.97722008,Medium,89,0.34362934,Q8EG35_SHEON_Campbell_2023.csv,selected_avg,1,mutated_sequence,Q8EG35_SHEON_b03_theta_0.2.npy,1,OrganismalFitness
+RAD_ANTMA_Tsuboyama_2023_2CJJ_indels,RAD_ANTMA_Tsuboyama_2023_2CJJ_indels.csv,RAD_ANTMA,Eukaryote,Antirrhinum majus,PWSAKENKAFERALAVYDKDTPDRWANVARAVEGRTPEEVKKHYEILVEDIKYI,54,97,-0.3943851731,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Transcription factor RADIALIS,Stability,cDNA display proteolysis,RAD_ANTMA_2023-08-07_b01.a2m,1,54,54,0.1,0.2,423275,0.833,45,38133.9,847.42,High,27,0.6,,ddG_ML_float,1,aa_seq,RAD_ANTMA_theta0.2_2023-08-07_b01.npy,1,Stability
+RCD1_ARATH_Tsuboyama_2023_5OAO_indels,RCD1_ARATH_Tsuboyama_2023_5OAO_indels.csv,RCD1_ARATH,Eukaryote,Arabidopsis thaliana,PTLFAAISHKVAENDMLLINADYQQLRDKKMTRAEFVRKLRVIVGDDLLRSTITTLQ,57,124,-0.3828831078,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Inactive poly [ADP-ribose] polymerase RCD1,Stability,cDNA display proteolysis,RCD1_ARATH_2023-08-07_b02.a2m,1,57,57,0.2,0.2,6525,0.93,53,1578.5,29.78301887,Medium,2,0.03773584906,,ddG_ML_float,1,aa_seq,RCD1_ARATH_theta0.2_2023-08-07_b02.npy,1,Stability
+RD23A_HUMAN_Tsuboyama_2023_1IFY_indels,RD23A_HUMAN_Tsuboyama_2023_1IFY_indels.csv,RD23A_HUMAN,Human,Homo sapiens,SEYETMLTEIMSMGYERERVVAALRASYNNPHRAVEYLLTGIPG,44,120,-0.7285205281,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,UV excision repair protein RAD23 homolog A,Stability,cDNA display proteolysis,RD23A_HUMAN_2023-08-07_b04.a2m,1,44,44,0.4,0.2,100991,0.864,38,7912.9,208.2342105,High,21,0.5526315789,,ddG_ML_float,1,aa_seq,RD23A_HUMAN_theta0.2_2023-08-07_b04.npy,1,Stability
+RPC1_BP434_Tsuboyama_2023_1R69_indels,RPC1_BP434_Tsuboyama_2023_1R69_indels.csv,RPC1_BP434,Virus,Enterobacteria phage 434 (Bacteriophage 434),SISSRVKSKRIQLGLNQAELAQKVGTTQQSIEQLENGKTKRPRFLPELASALGVSVDWLLN,61,164,-1.349855239,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Repressor protein CI,Stability,cDNA display proteolysis,RPC1_BP434_2023-08-07_b05.a2m,1,61,61,0.5,0.2,820224,0.951,58,192520.2,3319.313793,High,73,1.25862069,,ddG_ML_float,1,aa_seq,RPC1_BP434_theta0.2_2023-08-07_b05.npy,1,Stability
+RS15_GEOSE_Tsuboyama_2023_1A32_indels,RS15_GEOSE_Tsuboyama_2023_1A32_indels.csv,RS15_GEOSE,Prokaryote,Geobacillus stearothermophilus,SPEVQIAILTEQINNLNEHLRVHKKDHHSRRGLLKMVGKRRRLLAYLRNKDVARYREIVEKLG,63,176,-0.1292928041,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Small ribosomal subunit protein uS15,Stability,cDNA display proteolysis,RS15_GEOSE_2023-08-07_b06.a2m,1,63,63,0.6,0.2,44428,1,63,4519.5,71.73809524,Medium,35,0.5555555556,,ddG_ML_float,1,aa_seq,RS15_GEOSE_theta0.2_2023-08-07_b06.npy,1,Stability
+S22A1_HUMAN_Yee_2023_abundance_indels,S22A1_HUMAN_Yee_2023_abundance_indels.csv,S22A1_HUMAN,Human,Homo sapiens,PTVDDILEQVGESGWFQKQAFLILCLLSAAFAPICVGIVFLGFTPDHHCQSPGVAELSQRCGWSPAEELNYTVPGLGPAGEAFLGQCRRYEVDWNQSALSCVDPLASLATNRSHLPLGPCQDGWVYDTPGSSIVTEFNLVCADSWKLDLFQSCLNAGFLFGSLGVGYFADRFGRKLCLLGTVLVNAVSGVLMAFSPNYMSMLLFRLLQGLVSKGNWMAGYTLITEFVGSGSRRTVAIMYQMAFTVGLVALTGLAYALPHWRWLQLAVSLPTFLFLLYYWCVPESPRWLLSQKRNTEAIKIMDHIAQKNGKLPPADLKMLSLEEDVTEKLSPSFADLFRTPRLRKRTFILMYLWFTDSVLYQGLILHMGATSGNLYLDFLYSALVEIPGAFIALITIDRVGRIYPMAMSNLLAGAACLVMIFISPDLHWLNIIIMCVGRMGITIAIQMICLVNAELYPTFVRNLGVMVCSSLCDIGGIITPFIVFRLREVWQALPLILFAVLGLLAAGVTLLLPETKGVALPETMKDAENLGRKAKPKENTIYLKVQTSEPSGT,553,430,-0.75,manual,Yee,The full spectrum of OCT1 (SLC22A1) mutations bridges transporter biophysics to drug pharmacogenomics,2023,10.1101/2023.06.06.543963,Oct1,abundance,FACS,S22A1_HUMAN_2023-08-07_b02.a2m,1,553,553,0.2,0.2,198790,0.807,446,32557.5,72.99887892,Medium,485,1.087443946,543963_file04.xlsx,GFP_score,1,mutated_sequence,S22A1_HUMAN_theta0.2_2023-08-07_b02.npy,1,Expression
+S22A1_HUMAN_Yee_2023_activity_indels,S22A1_HUMAN_Yee_2023_activity_indels.csv,S22A1_HUMAN,Human,Homo sapiens,PTVDDILEQVGESGWFQKQAFLILCLLSAAFAPICVGIVFLGFTPDHHCQSPGVAELSQRCGWSPAEELNYTVPGLGPAGEAFLGQCRRYEVDWNQSALSCVDPLASLATNRSHLPLGPCQDGWVYDTPGSSIVTEFNLVCADSWKLDLFQSCLNAGFLFGSLGVGYFADRFGRKLCLLGTVLVNAVSGVLMAFSPNYMSMLLFRLLQGLVSKGNWMAGYTLITEFVGSGSRRTVAIMYQMAFTVGLVALTGLAYALPHWRWLQLAVSLPTFLFLLYYWCVPESPRWLLSQKRNTEAIKIMDHIAQKNGKLPPADLKMLSLEEDVTEKLSPSFADLFRTPRLRKRTFILMYLWFTDSVLYQGLILHMGATSGNLYLDFLYSALVEIPGAFIALITIDRVGRIYPMAMSNLLAGAACLVMIFISPDLHWLNIIIMCVGRMGITIAIQMICLVNAELYPTFVRNLGVMVCSSLCDIGGIITPFIVFRLREVWQALPLILFAVLGLLAAGVTLLLPETKGVALPETMKDAENLGRKAKPKENTIYLKVQTSEPSGT,553,490,1,manual,Yee,The full spectrum of OCT1 (SLC22A1) mutations bridges transporter biophysics to drug pharmacogenomics,2023,10.1101/2023.06.06.543963,Oct1,uptake of cytotoxic substrate,Growth,S22A1_HUMAN_2023-08-07_b02.a2m,1,553,553,0.2,0.2,198790,0.807,446,32557.5,72.99887892,Medium,485,1.087443946,543963_file04.xlsx,SM73_1_score,-1,mutated_sequence,S22A1_HUMAN_theta0.2_2023-08-07_b02.npy,1,Activity
+SAV1_MOUSE_Tsuboyama_2023_2YSB_indels,SAV1_MOUSE_Tsuboyama_2023_2YSB_indels.csv,SAV1_MOUSE,Eukaryote,Mus musculus,GEDLPLPPGWSVDWTMRGRKYYIDHNTNTTHWSHPLESGPSSG,43,86,-0.6280556038,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Protein salvador homolog 1,Stability,cDNA display proteolysis,SAV1_MOUSE_2023-08-07_b06.a2m,1,43,43,0.6,0.2,177542,0.791,34,4627.6,136.1058824,High,14,0.4117647059,,ddG_ML_float,1,aa_seq,SAV1_MOUSE_theta0.2_2023-08-07_b06.npy,1,Stability
+SDA_BACSU_Tsuboyama_2023_1PV0_indels,SDA_BACSU_Tsuboyama_2023_1PV0_indels.csv,SDA_BACSU,Prokaryote,Bacillus subtilis,MRKLSDELLIESYFKATEMNLNRDFIELIENEIKRRSLGHIISV,44,127,-1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Sporulation inhibitor sda,Stability,cDNA display proteolysis,SDA_BACSU_2023-08-07_b05.a2m,1,44,44,0.5,0.2,1953,0.886,39,876.8,22.48205128,Medium,4,0.1025641026,,ddG_ML_float,1,aa_seq,SDA_BACSU_theta0.2_2023-08-07_b05.npy,1,Stability
+SOX30_HUMAN_Tsuboyama_2023_7JJK_indels,SOX30_HUMAN_Tsuboyama_2023_7JJK_indels.csv,SOX30_HUMAN,Human,Homo sapiens,RPMNAFMVWARIHRPALAKANPAANNAEISVQLGLEWNKLSEEQKKPYYDEAQKIKE,57,109,-0.3216404755,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Transcription factor SOX-30,Stability,cDNA display proteolysis,SOX30_HUMAN_2023-08-07_b03.a2m,1,57,57,0.3,0.2,158104,0.982,56,14909.6,266.2428571,High,36,0.6428571429,,ddG_ML_float,1,aa_seq,SOX30_HUMAN_theta0.2_2023-08-07_b03.npy,1,Stability
+SPG2_STRSG_Tsuboyama_2023_5UBS_indels,SPG2_STRSG_Tsuboyama_2023_5UBS_indels.csv,SPG2_STRSG,Prokaryote,Streptococcus sp. group G,MTFKLIINGKTLKGETTTEAVDAATAEKVFKQYFNDNGIDGEWTYDDATKTFTITE,56,148,-1.000627629,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Immunoglobulin G-binding protein G,Stability,cDNA display proteolysis,SPG2_STRSG_2023-08-07_b03.a2m,1,56,56,0.3,0.2,39899,0.75,42,2567.6,61.13333333,Medium,6,0.1428571429,,ddG_ML_float,1,aa_seq,SPG2_STRSG_theta0.2_2023-08-07_b03.npy,1,Stability
+SPTN1_CHICK_Tsuboyama_2023_1TUD_indels,SPTN1_CHICK_Tsuboyama_2023_1TUD_indels.csv,SPTN1_CHICK,Eukaryote,Gallus gallus,RQGFVPAAYVKKLDSGTGKELVLALYDYQEKSPREVTMKKGDILTLLNSTNKDWWKVEVN,60,129,-2.360476078,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,"Spectrin alpha chain, non-erythrocytic 1",Stability,cDNA display proteolysis,SPTN1_CHICK_2023-08-07_b03.a2m,1,60,60,0.3,0.2,420793,0.933,56,15051.5,268.7767857,High,47,0.8392857143,,ddG_ML_float,1,aa_seq,SPTN1_CHICK_theta0.2_2023-08-07_b03.npy,1,Stability
+SQSTM_MOUSE_Tsuboyama_2023_2RRU_indels,SQSTM_MOUSE_Tsuboyama_2023_2RRU_indels.csv,SQSTM_MOUSE,Eukaryote,Mus musculus,RLIESLSQMLSMGFSDEGGWLTRLLQTKNYDIGAALDTIQ,40,111,-0.8554856463,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Sequestosome-1,Stability,cDNA display proteolysis,SQSTM_MOUSE_2023-08-07_b05.a2m,1,40,40,0.5,0.2,34660,0.925,37,3244.5,87.68918919,Medium,13,0.3513513514,,ddG_ML_float,1,aa_seq,SQSTM_MOUSE_theta0.2_2023-08-07_b05.npy,1,Stability
+SR43C_ARATH_Tsuboyama_2023_2N88_indels,SR43C_ARATH_Tsuboyama_2023_2N88_indels.csv,SR43C_ARATH,Eukaryote,Arabidopsis thaliana,AVAESVIGKRVGDDGKTIEYLVKWTDMSDATWEPQDNVDSTLVLLYQQ,48,135,-1.591761235,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,"Signal recognition particle 43 kDa protein, chloroplastic",Stability,cDNA display proteolysis,SR43C_ARATH_2023-08-07_b02.a2m,1,48,48,0.2,0.2,101118,0.917,44,12180.6,276.8318182,High,26,0.5909090909,,ddG_ML_float,1,aa_seq,SR43C_ARATH_theta0.2_2023-08-07_b02.npy,1,Stability
+SRBS1_HUMAN_Tsuboyama_2023_2O2W_indels,SRBS1_HUMAN_Tsuboyama_2023_2O2W_indels.csv,SRBS1_HUMAN,Human,Homo sapiens,GIDPFTGEAIAKFNFNGDTQVEMSFRKGERITLLRQVDENWYEGRIPGTSRQGIFPITYVDVIKRPL,67,154,-1.169019411,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Sorbin and SH3 domain-containing protein 1,Stability,cDNA display proteolysis,SRBS1_HUMAN_2023-08-07_b03.a2m,1,67,67,0.3,0.2,708655,0.836,56,22689,405.1607143,High,60,1.071428571,,ddG_ML_float,1,aa_seq,SRBS1_HUMAN_theta0.2_2023-08-07_b03.npy,1,Stability
+TCRG1_MOUSE_Tsuboyama_2023_1E0L_indels,TCRG1_MOUSE_Tsuboyama_2023_1E0L_indels.csv,TCRG1_MOUSE,Eukaryote,Mus musculus,GATAVSEWTEYKTADGKTYYYNNRTLESTWEKPQELK,37,99,-1.2,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Transcription elongation regulator 1,Stability,cDNA display proteolysis,TCRG1_MOUSE_2023-08-07_b08.a2m,1,37,37,0.8,0.2,43363,0.865,32,2819.7,88.115625,Medium,14,0.4375,,ddG_ML_float,1,aa_seq,TCRG1_MOUSE_theta0.2_2023-08-07_b08.npy,1,Stability
+THO1_YEAST_Tsuboyama_2023_2WQG_indels,THO1_YEAST_Tsuboyama_2023_2WQG_indels.csv,THO1_YEAST,Eukaryote,Saccharomyces cerevisiae,SADYSSLTVVQLKDLLTKRNLSVGGLKNEWVQRLIKDDEES,41,82,-0.7,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Protein THO1,Stability,cDNA display proteolysis,THO1_YEAST_2023-08-07_b05.a2m,1,41,41,0.5,0.2,54877,0.805,33,8516.7,258.0818182,High,15,0.4545454545,,ddG_ML_float,1,aa_seq,THO1_YEAST_theta0.2_2023-08-07_b05.npy,1,Stability
+TNKS2_HUMAN_Tsuboyama_2023_5JRT_indels,TNKS2_HUMAN_Tsuboyama_2023_5JRT_indels.csv,TNKS2_HUMAN,Human,Homo sapiens,FSITQFVRNLGLEHLMDIFEREQITLRVLVEMGHKELKEIGINAYGHREKLIKGVERLI,59,171,-0.9451205822,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Poly [ADP-ribose] polymerase tankyrase-2,Stability,cDNA display proteolysis,TNKS2_HUMAN_2023-08-07_b03.a2m,1,59,59,0.3,0.2,270654,0.949,56,11206,200.1071429,High,26,0.4642857143,,ddG_ML_float,1,aa_seq,TNKS2_HUMAN_theta0.2_2023-08-07_b03.npy,1,Stability
+UBE4B_HUMAN_Tsuboyama_2023_3L1X_indels,UBE4B_HUMAN_Tsuboyama_2023_3L1X_indels.csv,UBE4B_HUMAN,Human,Homo sapiens,DAPDEFRDPLMDTLMTDPVRLPSGTIMDRSIILRHLLNSPTDPFNRQTLTESMLEPVPELKEQIQAWMR,69,147,-1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Ubiquitin conjugation factor E4 B,Stability,cDNA display proteolysis,UBE4B_HUMAN_2023-08-07_b04.a2m,1,69,69,0.4,0.2,310943,0.928,64,34185.4,534.146875,High,52,0.8125,,ddG_ML_float,1,aa_seq,UBE4B_HUMAN_theta0.2_2023-08-07_b04.npy,1,Stability
+UBR5_HUMAN_Tsuboyama_2023_1I2T_indels,UBR5_HUMAN_Tsuboyama_2023_1I2T_indels.csv,UBR5_HUMAN,Human,Homo sapiens,HRQALGERLYPRVQAMQPAFASKITGMLLELSPAQLLLLLASEDSLRARVDEAMELII,58,156,-0.4460165437,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,E3 ubiquitin-protein ligase UBR5,Stability,cDNA display proteolysis,UBR5_HUMAN_2023-08-07_b05.a2m,1,58,58,0.5,0.2,17888,0.966,56,1031.7,18.42321429,Medium,14,0.25,,ddG_ML_float,1,aa_seq,UBR5_HUMAN_theta0.2_2023-08-07_b05.npy,1,Stability
+VG08_BPP22_Tsuboyama_2023_2GP8_indels,VG08_BPP22_Tsuboyama_2023_2GP8_indels.csv,VG08_BPP22,Virus,Salmonella phage P22 (Bacteriophage P22),ITGDVSAANKDAIRKQMDAAASKGDVETYRKLKAKLKGIR,40,101,-0.2013306011,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Scaffolding protein,Stability,cDNA display proteolysis,VG08_BPP22_2023-08-07_b05.a2m,1,40,40,0.5,0.2,102464,0.875,35,12963.6,370.3885714,High,13,0.3714285714,,ddG_ML_float,1,aa_seq,VG08_BPP22_theta0.2_2023-08-07_b05.npy,1,Stability
+VILI_CHICK_Tsuboyama_2023_1YU5_indels,VILI_CHICK_Tsuboyama_2023_1YU5_indels.csv,VILI_CHICK,Eukaryote,Gallus gallus,KLETFPLDVLVNTAAEDLPRGVDPSRKENHLSDEDFKAVFGMTRSAFANLPLWKQQNLKKEKGLF,65,156,-0.7,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Villin-1,Stability,cDNA display proteolysis,VILI_CHICK_2023-08-07_b01.a2m,1,65,65,0.1,0.2,254210,0.769,50,46507.8,930.156,High,19,0.38,,ddG_ML_float,1,aa_seq,VILI_CHICK_theta0.2_2023-08-07_b01.npy,1,Stability
+VRPI_BPT7_Tsuboyama_2023_2WNM_indels,VRPI_BPT7_Tsuboyama_2023_2WNM_indels.csv,VRPI_BPT7,Virus,Escherichia phage T7 (Bacteriophage T7),SLSVDNKKFWATVESSEHSFEVPIYAETLDEALELAEWQYVPAGFEVTRVRPCVAP,56,154,-1.1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,Bacterial RNA polymerase inhibitor,Stability,cDNA display proteolysis,VRPI_BPT7_2023-08-07_b02.a2m,1,56,56,0.2,0.2,6266,0.875,49,1555.8,31.75102041,Medium,3,0.0612244898,,ddG_ML_float,1,aa_seq,VRPI_BPT7_theta0.2_2023-08-07_b02.npy,1,Stability
+YNZC_BACSU_Tsuboyama_2023_2JVD_indels,YNZC_BACSU_Tsuboyama_2023_2JVD_indels.csv,YNZC_BACSU,Prokaryote,Bacillus subtilis,MISNAKIARINELAAKAKAGVITEEEKAEQQKLRQEYLK,39,104,-1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-69,UPF0291 protein YnzC,Stability,cDNA display proteolysis,YNZC_BACSU_2023-08-07_b07.a2m,1,39,39,0.7,0.2,7116,0.974,38,1588.3,41.79736842,Medium,13,0.3421052632,,ddG_ML_float,1,aa_seq,YNZC_BACSU_theta0.2_2023-08-07_b07.npy,1,Stability
\ No newline at end of file
diff --git a/reference_files/DMS_substitutions.csv b/reference_files/DMS_substitutions.csv
new file mode 100644
index 0000000..cc89480
--- /dev/null
+++ b/reference_files/DMS_substitutions.csv
@@ -0,0 +1,218 @@
+DMS_id,DMS_filename,UniProt_ID,taxon,source_organism,target_seq,seq_len,includes_multiple_mutants,DMS_total_number_mutants,DMS_number_single_mutants,DMS_number_multiple_mutants,DMS_binarization_cutoff,DMS_binarization_method,first_author,title,year,jo,region_mutated,molecule_name,selection_assay,selection_type,MSA_filename,MSA_start,MSA_end,MSA_len,MSA_bitscore,MSA_theta,MSA_num_seqs,MSA_perc_cov,MSA_num_cov,MSA_N_eff,MSA_Neff_L,MSA_Neff_L_category,MSA_num_significant,MSA_num_significant_L,raw_DMS_filename,raw_DMS_phenotype_name,raw_DMS_directionality,raw_DMS_mutant_column,weight_file_name,pdb_file,pdb_range,ProteinGym_version,raw_mut_offset,coarse_selection_type
+A0A140D2T1_ZIKV_Sourisseau_2019,A0A140D2T1_ZIKV_Sourisseau_2019.csv,A0A140D2T1_ZIKV,Virus,Zika virus (ZIKV),MKNPKKKSGGFRIVNMLKRGVARVNPLGGLKRLPAGLLLGHGPIRMVLAILAFLRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINARKERKRRGADTSIGIIGLLLTTAMAAEITRRGSAYYMYLDRSDAGKAISFATTLGVNKCHVQIMDLGHMCDATMSYECPMLDEGVEPDDVDCWCNTTSTWVVYGTCHHKKGEARRSRRAVTLPSHSTRKLQTRSQTWLESREYTKHLIKVENWIFRNPGFALVAVAIAWLLGSSTSQKVIYLVMILLIAPAYSIRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRCPTQGEAYLDKQSDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGYETDENRAKVEVTPNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVEFKDAHAKRQTVVVLGSQEGAVHTALAGALEAEMDGAKGKLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTVEVQYAGTDGPCKIPVQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEATVRGAKRMAVLGDTAWDFGSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLSTAVSADVGCSVDFSKKETRCGTGVFIYNDVEAWRDRYKYHPDSPRRLAAAVKQAWEEGICGISSVSRMENIMWKSVEGELNAILEENGVQLTVVVGSVKNPMWRGPQRLPVPVNELPHGWKAWGKSYFVRAAKTNNSFVVDGDTLKECPLEHRAWNSFLVEDHGFGVFHTSVWLKVREDYSLECDPAVIGTAVKGREAAHSDLGYWIESEKNDTWRLKRAHLIEMKTCEWPKSHTLWTDGVEESDLIIPKSLAGPLSHHNTREGYRTQVKGPWHSEELEIRFEECPGTKVYVEETCGTRGPSLRSTTASGRVIEEWCCRECTMPPLSFRAKDGCWYGMEIRPRKEPESNLVRSMVTAGSTDHMDHFSLGVLVILLMVQEGLKKRMTTKIIMSTSMAVLVVMILGGFSMSDLAKLVILMGATFAEMNTGGDVAHLALVAAFKVRPALLVSFIFRANWTPRESMLLALASCLLQTAISALEGDLMVLINGFALAWLAIRAMAVPRTDNIALPILAALTPLARGTLLVAWRAGLATCGGIMLLSLKGKGSVKKNLPFVMALGLTAVRVVDPINVVGLLLLTRSGKRSWPPSEVLTAVGLICALAGGFAKADIEMAGPMAAVGLLIVSYVVSGKSVDMYIERAGDITWEKDAEVTGNSPRLDVALDESGDFSLVEEDGPPMREIILKVVLMAICGMNPIAIPFAAGAWYVYVKTGKRSGALWDVPAPKEVKKGETTDGVYRVMTRRLLGSTQVGVGVMQEGVFHTMWHVTKGAALRSGEGRLDPYWGDVKQDLVSYCGPWKLDAAWDGLSEVQLLAVPPGERARNIQTLPGIFKTKDGDIGAVALDYPAGTSGSPILDKCGRVIGLYGNGVVIKNGSYVSAITQGKREEETPVECFEPSMLKKKQLTVLDLHPGAGKTRRVLPEIVREAIKKRLRTVILAPTRVVAAEMEEALRGLPVRYMTTAVNVTHSGTEIVDLMCHATFTSRLLQPIRVPNYNLYIMDEAHFTDPSSIAARGYISTRVEMGEAAAIFMTATPPGTRDAFPDSNSPIMDTEVEVPERAWSSGFDWVTDHSGKTVWFVPSVRNGNEIAACLTKAGKRVIQLSRKTFETEFQKTKNQEWDFVITTDISEMGANFKADRVIDSRRCLKPVILDGERVILAGPMPVTHASAAQRRGRIGRNPNKPGDEYMYGGGCAETDEGHAHWLEARMLLDNIYLQDGLIASLYRPEADKVAAIEGEFKLRTEQRKTFVELMKRGDLPVWLAYQVASAGITYTDRRWCFDGTTNNTIMEDSVPAEVWTKYGEKRVLKPRWMDARVCSDHAALKSFKEFAAGKRGAALGVMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEPEKQRSPQDNQMAIIIMVAVGLLGLITANELGWLERTKNDIAHLMGRREEGATMGFSMDIDLRPASAWAIYAALTTLITPAVQHAVTTSYNNYSLMAMATQAGVLFGMGKGMPFYAWDLGVPLLMMGCYSQLTPLTLIVAIILLVAHYMYLIPGLQAAAARAAQKRTAAGIMKNPVVDGIVVTDIDTMTIDPQVEKKMGQVLLIAVAISSAVLLRTAWGWGEAGALITAATSTLWEGSPNKYWNSSTATSLCNIFRGSYLAGASLIYTVTRNAGLVKRRGGGTGETLGEKWKARLNQMSALEFYSYKKSGITEVCREEARRALKDGVATGGHAVSRGSAKLRWLVERGYLQPYGKVVDLGCGRGGWSYYAATIRKVQEVRGYTKGGPGHEEPMLVQSYGWNIVRLKSGVDVFHMAAEPCDTLLCDIGESSSSPEVEETRTLRVLSMVGDWLEKRPGAFCIKVLCPYTSTMMETMERLQRRHGGGLVRVPLSRNSTHEMYWVSGAKSNIIKSVSTTSQLLLGRMDGPRRPVKYEEDVNLGSGTRAVASCAEAPNMKIIGRRIERIRNEHAETWFLDENHPYRTWAYHGSYEAPTQGSASSLVNGVVRLLSKPWDVVTGVTGIAMTDTTPYGQQRVFKEKVDTRVPDPQEGTRQVMNIVSSWLWKELGKRKRPRVCTKEEFINKVRSNAALGAIFEEEKEWKTAVEAVNDPRFWALVDREREHHLRGECHSCVYNMMGKREKKQGEFGKAKGSRAIWYMWLGARFLEFEALGFLNEDHWMGRENSGGGVEGLGLQRLGYILEEMNRAPGGKMYADDTAGWDTRISKFDLENEALITNQMEEGHRTLALAVIKYTYQNKVVKVLRPAEGGKTVMDIISRQDQRGSGQVVTYALNTFTNLVVQLIRNMEAEEVLEMQDLWLLRKPEKVTRWLQSNGWDRLKRMAVSGDDCVVKPIDDRFAHALRFLNDMGKVRKDTQEWKPSTGWSNWEEVPFCSHHFNKLYLKDGRSIVVPCRHQDELIGRARVSPGAGWSIRETACLAKSYAQMWQLLYFHRRDLRLMANAICSAVPVDWVPTGRTTWSIHGKGEWMTTEDMLMVWNRVWIEENDHMEDKTPVTKWTDIPYLGKREDLWCGSLIGHRPRTTWAENIKDTVNMVRRIIGDEEKYMDYLSTQVRYLGEEGSTPGVL,3423,FALSE,9576,9576,0,0.04324892146,median,Sourisseau,Deep Mutational Scanning Comprehensively Maps How Zika Envelope Protein Mutations Affect Viral Growth and Antibody Escape,2019,10.1128/JVI.01291-19,291-794,Zika virus env,Viral replication,Growth,A0A140D2T1_ZIKV_theta0.99_281-804_11-26-2021_b02.a2m,281,804,524,0.2,0.01,16501,0.948,497,1357.9,2.732193159,medium,329,0.661971831,A0A140D2T1_ZIKV_Sourisseau_growth_2019.csv,effect,1,mutant,A0A140D2T1_ZIKV_theta_0.01.npy,A0A140D2T1_ZIKV.pdb,291-794,0.1,,OrganismalFitness
+A0A192B1T2_9HIV1_Haddox_2018,A0A192B1T2_9HIV1_Haddox_2018.csv,A0A192B1T2_9HIV1,Virus,Human immunodeficiency virus 1,MRVKGIQMNSQHLLRWGIMILGMIMICSVAGNLWVTVYYGVPVWKDAETTLFCASDAKAYDAEVHNIWATHACVPTDPNPQEINLENVTEEFNMWKNNMVEQMHTDIISLWDQGLKPCVKLTPLCVTLDCHNVTYNITSDMKEEITNCSYNVTTVIRDKKQKVSSLFYKLDVVQIGGNNRTNSQYRLINCNTSAITQACPKVTFEPIPIHYCAPAGFAILKCKDEKFNGTGLCKNVSTVQCTHGIKPVVSTQLLLNGSLAEGEVRIRSENITNNAKNIIVQLASPVTINCIRPNNNTRKSVHLGPGQAFYATDGIIGEIRQAHCNVSKKEWNSTLQKVANQLRPYFKNNTIIKFANSSGGDLEITTHSFNCGGEFFYCNTSGLFNSTWEFNSTWNNSNSTENITLQCRIKQIINMWQRAGQAIYAPPIPGVIRCKSNITGLILTRDGGSNKNTSETFRPGGGDMRDNWRSELYKYKVVKIEPIGVAPTRAKRRVVEREKRAVGIGAVFIGFLGAAGSTMGAASVTLTVQARQLLSGIVQQQSNLLRAIEAQQHLLKLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKLICTTNVPWNSSWSNKSQDEIWGNMTWLQWDKEVSNYTQIIYTLIEESQNQQEKNEQDLLALDKWASLWNWFNISQWLWYIKIFIIIVGGLIGLRIVFAVLSVINRVRQGYSPLSFQTRTPNPGELDRPGRIEEEGGEQDRGRSIRLVSGFLALAWDDLRSLCLFSYHRLRDFILIATRTVELLGHSSLKGLRLGWESLKYLGNLLVYWGRELKISAINLCDTIAIAVAGWTDRVIELGQRLCRAILHIPRRIRQGFERALL,852,FALSE,12577,12577,0,-2.2,manual,Haddox,Mapping mutational effects along the evolutionary landscape of HIV envelope,2018,10.7554/eLife.34420,30-691,HIV env (BF520),Viral replication,Growth,A0A192B1T2_9HIV1_theta0.99_full_11-26-2021_b09.a2m,1,852,852,0.9,0.01,74854,0.986,840,36319.9,43.23797619,medium,2382,2.835714286,A0A192B1T2_9HIV1_Haddox_2018.csv,fitness,1,mutant,A0A192B1T2_9HIV1_theta_0.01.npy,A0A192B1T2_9HIV1.pdb,1-852,0.1,,OrganismalFitness
+A0A1I9GEU1_NEIME_Kennouche_2019,A0A1I9GEU1_NEIME_Kennouche_2019.csv,A0A1I9GEU1_NEIME,Prokaryote,Neisseria meningitidis,FTLIELMIVIAIVGILAAVALPAYQDYTARAQVSEAILLAEGQKSAVTEYYLNHGEWPGDNSSAGVATSADIKGKYVQSVTVANGVITAQMASSNVNNEIKSKKLSLWAKRQNGSVKWFCGQPVTRTTATATDVAAANGKTDDKINTKHLPSTCRDDSSAS,161,FALSE,922,922,0,0.141,median,Kennouche,Deep mutational scanning of the Neisseria meningitidis major pilin reveals the importance of pilus tip-mediated adhesion,2019,10.15252/embj.2019102145,1-161,pilin (PilE),"piliation (20D9 anti-pilus monoclonal Ab), aggregation, adhesion (human umbilical vein endothelial cells (HUVECs))",,A0A1I9GEU1_NEIME_full_11-26-2021_b08.a2m,1,161,161,0.8,0.2,5553,0.857,138,2183.6,15.82318841,medium,72,0.5217391304,A0A1I9GEU1_NEIME_Kennouche_2019.csv,piliation_log2_ratio,1,mutants,A0A1I9GEU1_NEIME_theta_0.2.npy,A0A1I9GEU1_NEIME.pdb,1-161,0.1,,Activity
+A0A247D711_LISMN_Stadelmann_2021,A0A247D711_LISMN_Stadelmann_2021.csv,A0A247D711_LISMN,Prokaryote,Listeria monocytogenes,MNINDLIREIKNKDYTVKLSGTDSNSITQLIIRVNNDGNEYVISESENESIVEKFISAFKNGWNQEYEDEEEFYNDMQTITLKSELN,87,FALSE,1653,1653,0,-0.0155627327,median,Stadelmann,A deep mutational scanning platform to characterize the fitness landscape of anti-CRISPR proteins,2021,10.1101/2021.08.21.457204,1-87,Anti-CRISPR protein AcrIIA4,activity against SpyCas9 inducing an RFP reporter,Flow cytometry,A0A247D711_LISMN_full_b0.3.a2m,1,87,87,0.2,0.2,1316890,1,87,188739.9,2169.424138,High,209,2.402298851,A0A247D711_LISMN_Stadelmann_2021.csv,mean_prediction,1,mutant,A0A247D711_LISMN_b03_theta_0.2.npy,A0A247D711_LISMN.pdb,1-87,1,,Activity
+A0A2Z5U3Z0_9INFA_Doud_2016,A0A2Z5U3Z0_9INFA_Doud_2016.csv,A0A2Z5U3Z0_9INFA,Virus,Influenza A virus (A/WSN/1933(H1N1)),MKAKLLVLLYAFVATDADTICIGYHANNSTDTVDTILEKNVAVTHSVNLLEDSHNGKLCKLKGIAPLQLGKCNITGWLLGNPECDSLLPARSWSYIVETPNSENGACYPGDLIDYEELREQLSSVSSLERFEIFPKESSWPNHTFNGVTVSCSHRGKSSFYRNLLWLTKKGDSYPKLTNSYVNNKGKEVLVLWGVHHPSSSDEQQSLYSNGNAYVSVASSNYNRRFTPEIAARPKVRDQHGRMNYYWTLLEPGDTIIFEATGNLIAPWYAFALSRGFESGIITSNASMHECNTKCQTPQGAINSNLPFQNIHPVTIGECPKYVRSTKLRMVTGLRNIPSIQYRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQNAINGITNKVNSVIEKMNTQFTAVGKEFNNLEKRMENLNKKVDDGFLDIWTYNAELLVLLENERTLDFHDLNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCDNECMESVRNGTYDYPKYSEESKLNREKIDGVKLESMGVYQILAIYSTVASSLVLLVSLGAISFWMCSNGSLQCRICI,565,FALSE,10715,10715,0,-2.239942981,median,Doud,Accurate Measurement of the Effects of All Amino-Acid Mutations on Influenza Hemagglutinin,2016,10.3390/v8060155,2-565,Influenza hemagglutinin,viral replication,Growth,A0A2Z5U3Z0_9INFA_theta0.99_full_11-26-2021_b09.a2m,1,565,565,0.9,0.01,57581,0.968,547,9809.4,17.93308958,medium,925,1.691042048,A0A2Z5U3Z0_9INFA_Doud_2016.csv,transformed_pref,1,mutant,A0A2Z5U3Z0_9INFA_theta_0.01.npy,A0A2Z5U3Z0_9INFA.pdb,1-565,0.1,,OrganismalFitness
+A0A2Z5U3Z0_9INFA_Wu_2014,A0A2Z5U3Z0_9INFA_Wu_2014.csv,A0A2Z5U3Z0_9INFA,Virus,Influenza A virus (A/WSN/1933(H1N1)),MKAKLLVLLYAFVATDADTICIGYHANNSTDTVDTILEKNVAVTHSVNLLEDSHNGKLCKLKGIAPLQLGKCNITGWLLGNPECDSLLPARSWSYIVETPNSENGACYPGDLIDYEELREQLSSVSSLERFEIFPKESSWPNHTFNGVTVSCSHRGKSSFYRNLLWLTKKGDSYPKLTNSYVNNKGKEVLVLWGVHHPSSSDEQQSLYSNGNAYVSVASSNYNRRFTPEIAARPKVRDQHGRMNYYWTLLEPGDTIIFEATGNLIAPWYAFALSRGFESGIITSNASMHECNTKCQTPQGAINSNLPFQNIHPVTIGECPKYVRSTKLRMVTGLRNIPSIQYRGLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQNAINGITNKVNSVIEKMNTQFTAVGKEFNNLEKRMENLNKKVDDGFLDIWTYNAELLVLLENERTLDFHDLNVKNLYEKVKSQLKNNAKEIGNGCFEFYHKCDNECMESVRNGTYDYPKYSEESKLNREKIDGVKLESMGVYQILAIYSTVASSLVLLVSLGAISFWMCSNGSLQCRICI,565,FALSE,2350,2350,0,0.0947955855,median,Wu,High-throughput profiling of influenza A virus hemagglutinin gene at single-nucleotide resolution,2014,10.1038/srep04942,6-560,Influenza hemagglutinin,Viral replication,Growth,A0A2Z5U3Z0_9INFA_theta0.99_full_11-26-2021_b09.a2m,1,565,565,0.9,0.01,57581,0.968,547,9809.4,17.93308958,medium,925,1.691042048,A0A2Z5U3Z0_9INFA_Wu_2014.csv,RF Index,1,mutant,A0A2Z5U3Z0_9INFA_theta_0.01.npy,A0A2Z5U3Z0_9INFA.pdb,1-565,0.1,,OrganismalFitness
+A4_HUMAN_Seuma_2022,A4_HUMAN_Seuma_2022.csv,A4_HUMAN,Human,Homo sapiens,MLPGLALLLLAAWTARALEVPTDGNAGLLAEPQIAMFCGRLNMHMNVQNGKWDSDPSGTKTCIDTKEGILQYCQEVYPELQITNVVEANQPVTIQNWCKRGRKQCKTHPHFVIPYRCLVGEFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKETCSEKSTNLHDYGMLLPCGIDKFRGVEFVCCPLAEESDNVDSADAEEDDSDVWWGGADTDYADGSEDKVVEVAEEEEVAEVEEEEADDDEDDEDGDEVEEEAEEPYEEATERTTSIATTTTTTTESVEEVVREVCSEQAETGPCRAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTEEYCMAVCGSAMSQSLLKTTQEPLARDPVKLPTTAASTPDAVDKYLETPGDENEHAHFQKAKERLEAKHRERMSQVMREWEEAERQAKNLPKADKKAVIQHFQEKVESLEQEAANERQQLVETHMARVEAMLNDRRRLALENYITALQAVPPRPRHVFNMLKKYVRAEQKDRQHTLKHFEHVRMVDPKKAAQIRSQVMTHLRVIYERMNQSLSLLYNVPAVAEEIQDEVDELLQKEQNYSDDVLANMISEPRISYGNDALMPSLTETKTTVELLPVNGEFSLDDLQPWHSFGADSVPANTENEVEPVDARPAADRGLTTRPGSGLTNIKTEEISEVKMDAEFRHDSGYEVHHQKLVFFAEDVGSNKGAIIGLMVGGVVIATVIVITLVMLKKKQYTSIHHGVVEVDAAVTPEERHLSKMQQNGYENPTYKFFEQMQN,770,TRUE,14811,796,14015,-2,manual,Seuma,"An atlas of amyloid aggregation: the impact of substitutions, insertions, deletions and truncations on amyloid beta fibril nucleation",2022,10.1038/s41467-022-34742-3,672-713,APP,aggregation,survival assessment assay,A4_HUMAN_2023-08-07_b01.a2m,1,770,770,0.1,0.2,5272,0.987,760,99.3,0.1306578947,Low,0,0,MS_BL_BB_indels_processed_data.tsv,nscore,1,mutant,A4_HUMAN_theta0.2_2023-08-07_b01.npy,A4_HUMAN.pdb,1-770,1,,Stability
+A4D664_9INFA_Soh_2019,A4D664_9INFA_Soh_2019.csv,A4D664_9INFA,Virus,Influenza A virus (A/green-winged teal/Ohio/175/1986(H2N1)),MERIKELRDLMSQSRTREILTKTTVDHMAIIKKYTSGRQEKNPALRMKWMMAMKYPITADKRIMEMIPERNEQGQTLWSKTNDAGSDRVMVSPLAVTWWNRNGPTTSTVHYPKVYKTYFEKVERLKHGTFGPVHFRNQVKIRRRVDINPGHADLSAKEAQDVIMEVVFPNEVGARILTSESQLTITREKKEELQDCKIAPLMVAYMLERELVRKTRFLPVAGGTSSVYIEVLHLTQGTCWEQMYTPGGEVRNDDVDQSLIIAARNIVRRATVSADPLASLLEMCHSTQIGGIRMVDILRQNPTEEQAVDICKAAMGLRISSSFSFGGFTFKRTSGSSVKREEEVLTGNLQTLKIRVHEGYEEFTMVGRRATAILRKATRRLIQLIVSGRDEQSIAEAIIVALVFSQEDCMIKAVRGDLNFVNRANQRLNPMHQLLRHFQKDAKVLFQNWGIEPIDNVMGMIGILPDMTPSTEMSLRGIRVSKMGVDEYSSTERVVVSIDRFLRVRDQRGNVLLSPEEVSETQGTEKLTITYSSSMMWEINGPESVLVNTYQWIIRNWETVKIQWSQDPTMLYNKMEFEPFQSLVPKAARGQYSGFVRTLFQQMRDVLGTFDTVQIIKLLPFAAAPPEQSRMQFSSLTVNVRGSGMRILVRGNSPVFNYNKATKRLTVLGKDAGALTEDPDEGTAGVESAVLRGFLILGKEDKRYGPALSINELSNLAKGEKANVLIGQGDVVLVMKRKRDSSILTDSQTATKRIRMAIN,759,FALSE,14421,14421,0,0.2170105627,median,Soh,Comprehensive mapping of adaptation of the avian influenza polymerase protein PB2 to humans,2019,10.7554/eLife.45079,1-759,Influenza polymerase basic protein 2,Viral replication (avian cells: CCL141 (duck)),Growth,A4D664_9INFA_theta0.99_full_11-26-2021_b09.a2m,1,759,759,0.9,0.01,26683,1,759,1730.2,2.279578393,medium,3736,4.92226614,A4D664_9INFA_Soh_2019.csv,effectCCL141,1,mutant,A4D664_9INFA_theta_0.01.npy,A4D664_9INFA.pdb,1-759,0.1,,OrganismalFitness
+A4GRB6_PSEAI_Chen_2020,A4GRB6_PSEAI_Chen_2020.csv,A4GRB6_PSEAI,Prokaryote,Pseudomonas aeruginosa,MFKLLSKLLVYLTASIMAIASPLAFSVDSSGEYPTVSEIPVGEVRLYQIADGVWSHIATQSFDGAVYPSNGLIVRDGDELLLIDTAWGAKNTAALLAEIEKQIGLPVTRAVSTHFHDDRVGGVDVLRAAGVATYASPSTRRLAEVEGNEIPTHSLEGLSSSGDAVRFGPVELFYPGAAHSTDNLIVYVPSASVLYGGCAIYELSRTSAGNVADADLAEWPTSIERIQQHYPEAQFVIPGHGLPGGLDLLKHTTNVVKAHTNRSVVE,266,FALSE,5004,5004,0,-2.1,manual,Chen,"Comprehensive exploration of the translocation, stability and substrate recognition requirements in VIM-2 lactamase",2020,10.7554/eLife.56707,1-266,Beta-lactamase VIM-2,"drug resistance (128/16/2.0 ug/mL ampicillin, 4.0/0.5 ug/mL cefotaxime, 0.031 ug/mL meropenem @ 25C, 37C)",Antibiotics resistance,A4GRB6_PSEAI_full_11-26-2021_b03.a2m,1,266,266,0.3,0.2,108496,0.726,193,31234.2,161.8352332,high,317,1.642487047,A4GRB6_PSEAI_Chen_2020.csv,0.031ug_mL_MEM_37C,1,mutant,A4GRB6_PSEAI_theta_0.2.npy,A4GRB6_PSEAI.pdb,1-266,0.1,,OrganismalFitness
+AACC1_PSEAI_Dandage_2018,AACC1_PSEAI_Dandage_2018.csv,AACC1_PSEAI,Prokaryote,Pseudomonas aeruginosa,MLRSSNDVTQQGSRPKTKLGGSSMGIIRTCRLGPDQVKSMRAALDLFGREFGDVATYSQHQPDSDYLGNLLRSKTFIALAAFDQEAVVGALAAYVLPKFEQPRSEIYIYDLAVSGEHRRQGIATALINLLKHEANALGAYVIYVQADYGDDPAVALYTKLGIREEVMHFDIDPSTAT,177,FALSE,1801,1801,0,0.7172234411,median,Dandage,Differential strengths of molecular determinants guide environment specific mutational fates,2018,10.1371/journal.pgen.1007419,12-172,GMR (aacC1),"Antibiotic resistance under: heat/cold resistance (32C, 37C (ref), 42C), chemical stability (chemical chaperones TMAO, glycerol), antibiotic resistance (gentamicin), or combo",Antibiotics resistance,AACC1_PSEAI_full_04-29-2022_b03.a2m,1,177,177,0.3,0.2,539868,0.746,132,170256.3,1289.820455,high,235,1.78030303,AACC1_PSEAI_Dandage_2018.csv,30C,1,Mutation,AACC1_PSEAI_theta_0.2.npy,AACC1_PSEAI.pdb,1-177,0.1,,OrganismalFitness
+ACE2_HUMAN_Chan_2020,ACE2_HUMAN_Chan_2020.csv,ACE2_HUMAN,Human,Homo sapiens,MSSSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPDNPQECLLLEPGLNEIMANSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVNGVDGYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLMNAYPSYISPIGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQGFWENSMLTDPGNVQKAVCHPTAWDLGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLFNMLRLGKSEPWTLALENVVGAKNMNVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGDKAYEWNDNEMYLFRSSVAYAMRQYFLKVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVSDIIPRTEVEKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGVIVVGIVILIFTGIRDRKKKNKARSGENPYASIDISKGENNPGFQNTDDVQTSF,805,FALSE,2223,2223,0,-0.266564268,median,Chan,Engineering human ACE2 to optimize binding to the spike protein of SARS coronavirus 2,2020,10.1126/science.abc0870,19-518,ACE2,Binding affinity,Flow Cytometry Assay,ACE2_HUMAN_2023-10-12_b05.a2m,1,805,805,0.5,0.2,11106,0.743,598,1506.7,2.519565217,Medium,349,0.5836120401,,score,1,mutant,ACE2_HUMAN_theta0.2_2023-10-12_b05.npy,ACE2_HUMAN.pdb,1-805,1,,Binding
+ADRB2_HUMAN_Jones_2020,ADRB2_HUMAN_Jones_2020.csv,ADRB2_HUMAN,Human,Homo sapiens,MGQPGNGSAFLLAPNGSHAPDHDVTQERDEVWVVGMGIVMSLIVLAIVFGNVLVITAIAKFERLQTVTNYFITSLACADLVMGLAVVPFGAAHILMKMWTFGNFWCEFWTSIDVLCVTASIETLCVIAVDRYFAITSPFKYQSLLTKNKARVIILMVWIVSGLTSFLPIQMHWYRATHQEAINCYANETCCDFFTNQAYAIASSIVSFYVPLVIMVFVYSRVFQEAKRQLQKIDKSEGRFHVQNLSQVEQDGRTGHGLRRSSKFCLKEHKALKTLGIIMGTFTLCWLPFFIVNIVHVIQDNLIRKEVYILLNWIGYVNSGFNPLIYCRSPDFRIAFQELLCLRRSSLKAYGNGYSSNGNTGEQSGYHVEQEKENKLLCEDLPGTEDFVGHQGTVPSDNIDSQGRNCSTNDSLL,413,FALSE,7800,7800,0,1.859961867,median,Jones,Structural and Functional Characterization of G Protein-Coupled Receptors with Deep Mutational Scanning,2020,10.7554/eLife.54895,2-413,ADRB2,"transcription (luciferase reporter, isoproterenol (beta2AR agonist)-induced)",Receptor activity,ADRB2_HUMAN_full_11-26-2021_b03.a2m,1,413,413,0.3,0.2,204722,0.712,294,25459.6,86.59727891,medium,234,0.7959183673,ADRB2_HUMAN_Jones_2020.csv,0.625,1,mutant_id,ADRB2_HUMAN_theta_0.2.npy,ADRB2_HUMAN.pdb,1-413,0.1,,Activity
+AICDA_HUMAN_Gajula_2014_3cycles,AICDA_HUMAN_Gajula_2014_3cycles.csv,AICDA_HUMAN,Human,Homo sapiens,MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVKRRDSATSFSLDFGYLRNKNGCHVELLFLRYISDWDLDPGRCYRVTWFTSWSPCYDCARHVADFLRGNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHERTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTLGL,198,FALSE,209,209,0,1,manual,Gajula,High-throughput mutagenesis reveals functional determinants for DNA targeting by activation-induced deaminase,2014,10.1093/nar/gku689,113-123,AID,Enzymatic activity,bulk RNA-sequencing,AICDA_HUMAN_2023-08-07_b01.a2m,1,198,198,0.1,0.2,18148,0.879,174,3340,19.1954023,Medium,101,0.5804597701,urn_mavedb_00000106-c-1_scores.csv,DMS_score,1,mutant,AICDA_HUMAN_theta0.2_2023-08-07_b01.npy,AICDA_HUMAN.pdb,1-198,1,,Activity
+AMFR_HUMAN_Tsuboyama_2023_4G3O,AMFR_HUMAN_Tsuboyama_2023_4G3O.csv,AMFR_HUMAN,Human,Homo sapiens,YFQGQLNAMAHQIQEMFPQVPYHLVLQDLQLTRSVEITTDNILEGRI,47,TRUE,2972,820,2152,-1.504736022,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-47,E3 ubiquitin-protein ligase AMFR,Stability,cDNA display proteolysis,AMFR_HUMAN_2023-08-07_b04.a2m,1,47,47,0.4,0.2,17787,0.872,41,1166.9,28.46097561,Medium,12,0.2926829268,Tsuboyama2023_Dataset2_Dataset3,ddG_ML_float,1,mut_type,AMFR_HUMAN_theta0.2_2023-08-07_b04.npy,AMFR_HUMAN.pdb,1-47,1,,Stability
+AMIE_PSEAE_Wrenbeck_2017,AMIE_PSEAE_Wrenbeck_2017.csv,AMIE_PSEAE,Prokaryote,Pseudomonas aeruginosa,MRHGDISSSNDTVGVAVVNYKMPRLHTAAEVLDNARKIAEMIVGMKQGLPGMDLVVFPEYSLQGIMYDPAEMMETAVAIPGEETEIFSRACRKANVWGVFSLTGERHEEHPRKAPYNTLVLIDNNGEIVQKYRKIIPWCPIEGWYPGGQTYVSEGPKGMKISLIICDDGNYPEIWRDCAMKGAELIVRCQGYMYPAKDQQVMMAKAMAWANNCYVAVANAAGFDGVYSYFGHSAIIGFDGRTLGECGEEEMGIQYAQLSLSQIRDARANDQSQNHLFKILHRGYSGLQASGDGDRGLAECPFEFYRTWVTDAEKARENVERLTRSTTGVAQCPVGRLPYEGLEKEA,346,FALSE,6227,6227,0,-0.2222,median,Wrenbeck,Single-mutation fitness landscapes for an enzyme on multiple substrates reveal specificity is globally encoded,2017,10.1038/ncomms15695,1-341,Aliphatic amidase,Enzyme function,Growth,AMIE_PSEAE_full_11-26-2021_b02.a2m,1,346,346,0.2,0.2,140703,0.725,251,29959.3,119.359761,high,557,2.219123506,AMIE_PSEAE_Wrenbeck_2017.csv,isobutyramide_normalized_fitness,1,mutant,AMIE_PSEAE_theta_0.2.npy,AMIE_PSEAE.pdb,1-346,0.1,,Activity
+ANCSZ_Hobbs_2022,ANCSZ_Hobbs_2022.csv,ANCSZ,Eukaryote,Reconstructed ancestor,MADSANHLPYFYGSITREEAEDYLKQGGMSDGLFLLRQSLNSLGGYVLSVVYDRQCHHYTIERQLNGTYAIAGGKPHSGPAELCEYHSQDSDGLVCLLKKPCNRPPGVQPKVGPFEDLKDQLIREYVRQTWNLEGEALEQAIISQRPQLEKLIATTAHEKMPWFHGKISREESERRLLSGAQPNGKFLIRERDENGSYALSLLYEKKVYHYRIDRDKSGKLSIPDGKKFDTLWQLVEHYSHKPDGLLCVLTEPCPNPDSPAGALGAPAPPLPGSHPKLETAGGIISRIKSYSFPKPGFKKKPPSERPKSALNVNGYVPRPKPLGAEGGSRRAMPMDTNVYESPYSDPEELKDKKLYLKREQLMLEEGELGSGNFGTVKKGVYKMRKKEIPVAVKVLKSENDPAVKDELMKEAEFMHQLDNPYIVRMIGICEAESLMLVMELAPLGPLNKFLQKHKDQITVENIVELMHQVSMGMKYLEEKNFVHRDLAARNVLLVNQHYAKISDFGLSKALGADDNYYKAKTAGKWPLKWYAPECINFHKFSSKSDVWSFGVTMWEAFSYGQKPYKGMKGQEVLPFIENGERMECPAECPEEMYELMKDCWTYKADDRPGFVAVELRLRDYYYDISK,627,FALSE,4670,4670,0,-0.0574121626,median,Hobbs,Saturation mutagenesis of a predicted ancestral Syk-family kinase,2022,10.1002/pro.4411,352-627,ancestral spleen tyrosine kinase,successful phosphorylation of bait peptide,enzymatic activity,ANCSZ_b0.4.a2m,1,627,627,0.4,0.2,7424,1,627,1036.7,1.653429027,Medium,109,0.1738437002,ANCSZ_Hobbs_2022.csv,DMS_value,1,mutant,ANCSZ_theta_0.2.npy,ANCSZ.pdb,1-627,1,,Activity
+ARGR_ECOLI_Tsuboyama_2023_1AOY,ARGR_ECOLI_Tsuboyama_2023_1AOY.csv,ARGR_ECOLI,Prokaryote,Escherichia coli,QEELVKAFKALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELGV,69,FALSE,1287,1287,0,-0.4541373765,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-69,Arginine repressor,Stability,cDNA display proteolysis,ARGR_ECOLI_2023-08-07_b04.a2m,1,69,69,0.4,0.2,21443,0.913,63,3719.2,59.03492063,Medium,29,0.4603174603,Tsuboyama2023_Dataset2_Dataset4,ddG_ML_float,1,mut_type,ARGR_ECOLI_theta0.2_2023-08-07_b04.npy,ARGR_ECOLI.pdb,1-69,1,,Stability
+B2L11_HUMAN_Dutta_2010_binding-Mcl-1,B2L11_HUMAN_Dutta_2010_binding-Mcl-1.csv,B2L11_HUMAN,Human,Homo sapiens,MAKQPSDVSSECDREGRQLQPAERPPQLRPGAPTSLQTEPQGNPEGNHGGEGDSCPHGSPQGPLAPPASPGPFATRSPLFIFMRRSSLLSRSSSGYFSFDTDRSPAPMSCDKSTQTPSPPCQAFNHYLSAMASMRQAEPADMRPEIWIAQELRRIGDEFNAYYARRVFLNNYQAAEDHPRMVILRLLRYIVRLVWRMH,198,FALSE,170,170,0,16002529.37,median,Dutta,Determinants of BH3 Binding Specificity for Mcl-1 versus Bcl-xL,2010,10.1016/j.jmb.2010.03.058,148-159,BCL2L11,Binding to Mcl-1 (FACS; yeast-displayed and antibody stained for binding partner),FACS,B2L11_HUMAN_2023-08-07_b04.a2m,1,198,198,0.4,0.2,660,0.995,197,88.5,0.4492385787,Low,2,0.01015228426,,score,1,mut_proteingym,B2L11_HUMAN_theta0.2_2023-08-07_b04.npy,B2L11_HUMAN.pdb,1-198,1,147,Binding
+BBC1_YEAST_Tsuboyama_2023_1TG0,BBC1_YEAST_Tsuboyama_2023_1TG0.csv,BBC1_YEAST,Eukaryote,Saccharomyces cerevisiae,EVPFKVVAQFPYKSDYEDDLNFEKDQEIIVTSVEDAEWYFGEYQDSNGDVIEGIFPKSFVAVQG,64,TRUE,2069,1084,985,-1.271998543,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-64,Myosin tail region-interacting protein MTI1,Stability,cDNA display proteolysis,BBC1_YEAST_2023-08-07_b05.a2m,1,64,64,0.5,0.2,604824,0.844,54,17529.2,324.6148148,High,55,1.018518519,Tsuboyama2023_Dataset2_Dataset5,ddG_ML_float,1,mut_type,BBC1_YEAST_theta0.2_2023-08-07_b05.npy,BBC1_YEAST.pdb,1-64,1,,Stability
+BCHB_CHLTE_Tsuboyama_2023_2KRU,BCHB_CHLTE_Tsuboyama_2023_2KRU.csv,BCHB_CHLTE,Prokaryote,Chlorobaculum tepidum,ELSWTAEAEKMLGKVPFFVRKKVRKNTDNYAREIGEPVVTADVFRKAKEHLG,52,TRUE,1572,890,682,-0.9540616602,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-52,Light-independent protochlorophyllide reductase subunit B,Stability,cDNA display proteolysis,BCHB_CHLTE_2023-08-07_b04.a2m,1,52,52,0.4,0.2,12079,0.923,48,2630.8,54.80833333,Medium,18,0.375,Tsuboyama2023_Dataset2_Dataset6,ddG_ML_float,1,mut_type,BCHB_CHLTE_theta0.2_2023-08-07_b04.npy,BCHB_CHLTE.pdb,1-52,1,,Stability
+BLAT_ECOLX_Deng_2012,BLAT_ECOLX_Deng_2012.csv,BLAT_ECOLX,Prokaryote,Escherichia coli,MSIQHFRVALIPFFAAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLSRVDAGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRLDRWEPELNEAIPNDERDTTMPAAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW,286,FALSE,4996,4996,0,-2.913548,median,Deng,Deep Sequencing of Systematic Combinatorial Libraries Reveals β-Lactamase Sequence Constraints at High Resolution,2012,10.1016/j.jmb.2012.09.014,24-286,Beta-lactamase TEM,"antibiotic resistance, MIC",Amp resistance,BLAT_ECOLX_full_11-26-2021_b02.a2m,1,286,286,0.2,0.2,209644,0.752,215,47605,221.4186047,high,446,2.074418605,BLAT_ECOLX_Deng_2012.csv,ddG_stat,-1,mutant,BLAT_ECOLX_theta_0.2.npy,BLAT_ECOLX.pdb,1-286,0.1,,OrganismalFitness
+BLAT_ECOLX_Firnberg_2014,BLAT_ECOLX_Firnberg_2014.csv,BLAT_ECOLX,Prokaryote,Escherichia coli,MSIQHFRVALIPFFAAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLSRVDAGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRLDRWEPELNEAIPNDERDTTMPAAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW,286,FALSE,4783,4783,0,0.4257,median,Firnberg,"A Comprehensive, High-Resolution Map of a Gene's Fitness Landscape",2014,10.1093/molbev/msu081,24-286,Beta-lactamase TEM,Growth (0.25-1024 ug/mL ampicillin) doubling,Growth,BLAT_ECOLX_full_11-26-2021_b02.a2m,1,286,286,0.2,0.2,209644,0.752,215,47605,221.4186047,high,446,2.074418605,BLAT_ECOLX_Firnberg_2014.csv,linear,1,mutant,BLAT_ECOLX_theta_0.2.npy,BLAT_ECOLX.pdb,1-286,0.1,,OrganismalFitness
+BLAT_ECOLX_Jacquier_2013,BLAT_ECOLX_Jacquier_2013.csv,BLAT_ECOLX,Prokaryote,Escherichia coli,MSIQHFRVALIPFFAAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLSRVDAGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRLDRWEPELNEAIPNDERDTTMPAAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW,286,FALSE,989,989,0,-0.666666667,median,Jacquier,Capturing the mutational landscape of the beta-lactamase TEM-1,2013,10.1073/pnas.1215206110,24-286,Beta-lactamase TEM,MIC,Amoxicillin resistance,BLAT_ECOLX_full_11-26-2021_b02.a2m,1,286,286,0.2,0.2,209644,0.752,215,47605,221.4186047,high,446,2.074418605,BLAT_ECOLX_Jacquier_2013.csv,MIC_score,1,mutant,BLAT_ECOLX_theta_0.2.npy,BLAT_ECOLX.pdb,1-286,0.1,,OrganismalFitness
+BLAT_ECOLX_Stiffler_2015,BLAT_ECOLX_Stiffler_2015.csv,BLAT_ECOLX,Prokaryote,Escherichia coli,MSIQHFRVALIPFFAAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLSRVDAGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRLDRWEPELNEAIPNDERDTTMPAAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW,286,FALSE,4996,4996,0,-1.159498916,median,Stiffler,Evolvability as a Function of Purifying Selection in TEM-1 β-lactamase,2015,10.1016/j.cell.2015.01.035,24-286,Beta-lactamase TEM,Growth (10-2500 ug/mL ampicillin),Growth,BLAT_ECOLX_full_11-26-2021_b02.a2m,1,286,286,0.2,0.2,209644,0.752,215,47605,221.4186047,high,446,2.074418605,BLAT_ECOLX_Stiffler_2015.csv,2500,1,mutant,BLAT_ECOLX_theta_0.2.npy,BLAT_ECOLX.pdb,1-286,0.1,,OrganismalFitness
+BRCA1_HUMAN_Findlay_2018,BRCA1_HUMAN_Findlay_2018.csv,BRCA1_HUMAN,Human,Homo sapiens,MDLSALRVEEVQNVINAMQKILECPICLELIKEPVSTKCDHIFCKFCMLKLLNQKKGPSQCPLCKNDITKRSLQESTRFSQLVEELLKIICAFQLDTGLEYANSYNFAKKENNSPEHLKDEVSIIQSMGYRNRAKRLLQSEPENPSLQETSLSVQLSNLGTVRTLRTKQRIQPQKTSVYIELGSDSSEDTVNKATYCSVGDQELLQITPQGTRDEISLDSAKKAACEFSETDVTNTEHHQPSNNDLNTTEKRAAERHPEKYQGSSVSNLHVEPCGTNTHASSLQHENSSLLLTKDRMNVEKAEFCNKSKQPGLARSQHNRWAGSKETCNDRRTPSTEKKVDLNADPLCERKEWNKQKLPCSENPRDTEDVPWITLNSSIQKVNEWFSRSDELLGSDDSHDGESESNAKVADVLDVLNEVDEYSGSSEKIDLLASDPHEALICKSERVHSKSVESNIEDKIFGKTYRKKASLPNLSHVTENLIIGAFVTEPQIIQERPLTNKLKRKRRPTSGLHPEDFIKKADLAVQKTPEMINQGTNQTEQNGQVMNITNSGHENKTKGDSIQNEKNPNPIESLEKESAFKTKAEPISSSISNMELELNIHNSKAPKKNRLRRKSSTRHIHALELVVSRNLSPPNCTELQIDSCSSSEEIKKKKYNQMPVRHSRNLQLMEGKEPATGAKKSNKPNEQTSKRHDSDTFPELKLTNAPGSFTKCSNTSELKEFVNPSLPREEKEEKLETVKVSNNAEDPKDLMLSGERVLQTERSVESSSISLVPGTDYGTQESISLLEVSTLGKAKTEPNKCVSQCAAFENPKGLIHGCSKDNRNDTEGFKYPLGHEVNHSRETSIEMEESELDAQYLQNTFKVSKRQSFAPFSNPGNAEEECATFSAHSGSLKKQSPKVTFECEQKEENQGKNESNIKPVQTVNITAGFPVVGQKDKPVDNAKCSIKGGSRFCLSSQFRGNETGLITPNKHGLLQNPYRIPPLFPIKSFVKTKCKKNLLEENFEEHSMSPEREMGNENIPSTVSTISRNNIRENVFKEASSSNINEVGSSTNEVGSSINEIGSSDENIQAELGRNRGPKLNAMLRLGVLQPEVYKQSLPGSNCKHPEIKKQEYEEVVQTVNTDFSPYLISDNLEQPMGSSHASQVCSETPDDLLDDGEIKEDTSFAENDIKESSAVFSKSVQKGELSRSPSPFTHTHLAQGYRRGAKKLESSEENLSSEDEELPCFQHLLFGKVNNIPSQSTRHSTVATECLSKNTEENLLSLKNSLNDCSNQVILAKASQEHHLSEETKCSASLFSSQCSELEDLTANTNTQDPFLIGSSKQMRHQSESQGVGLSDKELVSDDEERGTGLEENNQEEQSMDSNLGEAASGCESETSVSEDCSGLSSQSDILTTQQRDTMQHNLIKLQQEMAELEAVLEQHGSQPSNSYPSIISDSSALEDLRNPEQSTSEKAVLTSQKSSEYPISQNPEGLSADKFEVSADSSTSKNKEPGVERSSPSKCPSLDDRWYMHSCSGSLQNRNYPSQEELIKVVDVEEQQLEESGPHDLTETSYLPRQDLEGTPYLESGISLFSDDPESDPSEDRAPESARVGNIPSSTSALKVPQLKVAESAQSPAAAHTTDTAGYNAMEESVSREKPELTASTERVNKRMSMVVSGLTPEEFMLVYKFARKHHITLTNLITEETTHVVMKTDAEFVCERTLKYFLGIAGGKWVVSYFWVTQSIKERKMLNEHDFEVRGDVVNGRNHQGPKRARESQDRKIFRGLEICCYGPFTNMPTDQLEWMVQLCGASVVKELSSFTLGTGVHPIVVVQPDAWTEDNGFHAIGQMCEAPVVTREWVLDSVALYQCQELDTYLIPQIPHSHY,1863,FALSE,1837,1837,0,-1,manual,Findlay,Accurate classification of BRCA1 variants with saturation genome editing,2018,10.1038/s41586-018-0461-z,1-1855,BRCA1,Growth,Growth,BRCA1_HUMAN_full_11-26-2021_b02.a2m,1,1863,1863,0.2,0.2,1008,0.769,1432,108.4,0.07569832402,low,0,0,BRCA1_HUMAN_Findlay_2018.csv,function_score,1,mutant,BRCA1_HUMAN_theta_0.2.npy,BRCA1_HUMAN.pdb,1-1863,0.1,,OrganismalFitness
+BRCA2_HUMAN_Erwood_2022_HEK293T,BRCA2_HUMAN_Erwood_2022_HEK293T.csv,BRCA2_HUMAN,Human,Homo sapiens,MPIGSKERPTFFEIFKTRCNKADLGPISLNWFEELSSEAPPYNSEPAEESEHKNNNYEPNLFKTPQRKPSYNQLASTPIIFKEQGLTLPLYQSPVKELDKFKLDLGRNVPNSRHKSLRTVKTKMDQADDVSCPLLNSCLSESPVVLQCTHVTPQRDKSVVCGSLFHTPKFVKGRQTPKHISESLGAEVDPDMSWSSSLATPPTLSSTVLIVRNEEASETVFPHDTTANVKSYFSNHDESLKKNDRFIASVTDSENTNQREAASHGFGKTSGNSFKVNSCKDHIGKSMPNVLEDEVYETVVDTSEEDSFSLCFSKCRTKNLQKVRTSKTRKKIFHEANADECEKSKNQVKEKYSFVSEVEPNDTDPLDSNVANQKPFESGSDKISKEVVPSLACEWSQLTLSGLNGAQMEKIPLLHISSCDQNISEKDLLDTENKRKKDFLTSENSLPRISSLPKSEKPLNEETVVNKRDEEQHLESHTDCILAVKQAISGTSPVASSFQGIKKSIFRIRESPKETFNASFSGHMTDPNFKKETEASESGLEIHTVCSQKEDSLCPNLIDNGSWPATTTQNSVALKNAGLISTLKKKTNKFIYAIHDETSYKGKKIPKDQKSELINCSAQFEANAFEAPLTFANADSGLLHSSVKRSCSQNDSEEPTLSLTSSFGTILRKCSRNETCSNNTVISQDLDYKEAKCNKEKLQLFITPEADSLSCLQEGQCENDPKSKKVSDIKEEVLAAACHPVQHSKVEYSDTDFQSQKSLLYDHENASTLILTPTSKDVLSNLVMISRGKESYKMSDKLKGNNYESDVELTKNIPMEKNQDVCALNENYKNVELLPPEKYMRVASPSRKVQFNQNTNLRVIQKNQEETTSISKITVNPDSEELFSDNENNFVFQVANERNNLALGNTKELHETDLTCVNEPIFKNSTMVLYGDTGDKQATQVSIKKDLVYVLAEENKNSVKQHIKMTLGQDLKSDISLNIDKIPEKNNDYMNKWAGLLGPISNHSFGGSFRTASNKEIKLSEHNIKKSKMFFKDIEEQYPTSLACVEIVNTLALDNQKKLSKPQSINTVSAHLQSSVVVSDCKNSHITPQMLFSKQDFNSNHNLTPSQKAEITELSTILEESGSQFEFTQFRKPSYILQKSTFEVPENQMTILKTTSEECRDADLHVIMNAPSIGQVDSSKQFEGTVEIKRKFAGLLKNDCNKSASGYLTDENEVGFRGFYSAHGTKLNVSTEALQKAVKLFSDIENISEETSAEVHPISLSSSKCHDSVVSMFKIENHNDKTVSEKNNKCQLILQNNIEMTTGTFVEEITENYKRNTENEDNKYTAASRNSHNLEFDGSDSSKNDTVCIHKDETDLLFTDQHNICLKLSGQFMKEGNTQIKEDLSDLTFLEVAKAQEACHGNTSNKEQLTATKTEQNIKDFETSDTFFQTASGKNISVAKESFNKIVNFFDQKPEELHNFSLNSELHSDIRKNKMDILSYEETDIVKHKILKESVPVGTGNQLVTFQGQPERDEKIKEPTLLGFHTASGKKVKIAKESLDKVKNLFDEKEQGTSEITSFSHQWAKTLKYREACKDLELACETIEITAAPKCKEMQNSLNNDKNLVSIETVVPPKLLSDNLCRQTENLKTSKSIFLKVKVHENVEKETAKSPATCYTNQSPYSVIENSALAFYTSCSRKTSVSQTSLLEAKKWLREGIFDGQPERINTADYVGNYLYENNSNSTIAENDKNHLSEKQDTYLSNSSMSNSYSYHSDEVYNDSGYLSKNKLDSGIEPVLKNVEDQKNTSFSKVISNVKDANAYPQTVNEDICVEELVTSSSPCKNKNAAIKLSISNSNNFEVGPPAFRIASGKIVCVSHETIKKVKDIFTDSFSKVIKENNENKSKICQTKIMAGCYEALDDSEDILHNSLDNDECSTHSHKVFADIQSEEILQHNQNMSGLEKVSKISPCDVSLETSDICKCSIGKLHKSVSSANTCGIFSTASGKSVQVSDASLQNARQVFSEIEDSTKQVFSKVLFKSNEHSDQLTREENTAIRTPEHLISQKGFSYNVVNSSAFSGFSTASGKQVSILESSLHKVKGVLEEFDLIRTEHSLHYSPTSRQNVSKILPRVDKRNPEHCVNSEMEKTCSKEFKLSNNLNVEGGSSENNHSIKVSPYLSQFQQDKQQLVLGTKVSLVENIHVLGKEQASPKNVKMEIGKTETFSDVPVKTNIEVCSTYSKDSENYFETEAVEIAKAFMEDDELTDSKLPSHATHSLFTCPENEEMVLSNSRIGKRRGEPLILVGEPSIKRNLLNEFDRIIENQEKSLKASKSTPDGTIKDRRLFMHHVSLEPITCVPFRTTKERQEIQNPNFTAPGQEFLSKSHLYEHLTLEKSSSNLAVSGHPFYQVSATRNEKMRHLITTGRPTKVFVPPFKTKSHFHRVEQCVRNINLEENRQKQNIDGHGSDDSKNKINDNEIHQFNKNNSNQAVAVTFTKCEEEPLDLITSLQNARDIQDMRIKKKQRQRVFPQPGSLYLAKTSTLPRISLKAAVGGQVPSACSHKQLYTYGVSKHCIKINSKNAESFQFHTEDYFGKESLWTGKGIQLADGGWLIPSNDGKAGKEEFYRALCDTPGVDPKLISRIWVYNHYRWIIWKLAAMECAFPKEFANRCLSPERVLLQLKYRYDTEIDRSRRSAIKKIMERDDTAAKTLVLCVSDIISLSANISETSSNKTSSADTQKVAIIELTDGWYAVKAQLDPPLLAVLKNGRLTVGQKIILHGAELVGSPDACTPLEAPESLMLKISANSTRPARWYTKLGFFPDPRPFPLPLSSLFSDGGNVGCVDVIIQRAYPIQWMEKTSSGLYIFRNEREEEKEAAKYVEAQQKRLEALFTKIQEEFEEHEENTTKPYLPSRALTRQQVRALQDGAELYEAVKNAADPAYLEGYFSEEQLRALNNHRQMLNDKKQAQIQLEIRKAMESAEQKEQGLSRDVTTVWKLRIVSYSKKEKDSVILSIWRPSSDLYSLLTEGKRYRIYHLATSKSKSKSERANIQLAATKKTQYQQLPVSDEILFQIYQPREPLHFSKFLDPDFQPSCSEVDLIGFVVSVVKKTGLAPFVYLSDECYNLLAIKFWIDLNEDIIKPHMLIAASNLQWRPESKSGLLTLFAGDFSVFSASPKEGHFQETFNKMKNTVENIDILCNEAENKLMHILHANDPKWSTPTKDCTSGPYTAQIIPGTGNKLLMSSPNCEIYYQSPLSLCMAKRKSVSTPVSAQMTSKSCKGEKEIDDQKNCKKRRALDFLSRLPLPPPVSPICTFVSPAAQKAFQPPRSCGTKYETPIKKKELNSPQMTPFKKFNEISLLESNSIADEELALINTQALLSGSTGEKQFISVSESTRTAPTSSEDYLRLKRRCTTSLIKEQESSQASTEECEKNKQDTITTKKYI,3418,FALSE,265,265,0,0.8,manual,Erwood,Saturation variant interpretation using CRISPR prime editing,2022,10.1038/s41587-021-01201-1,388-2654,BRCA2,Fitness,Growth,BRCA2_HUMAN_2023-10-12_b01.a2m,1,3418,3418,0.1,0.2,933,,,,,,,,41587_2021_1201_MOESM3_ESM.xlsx,Function Score,1,Protein Annotation,BRCA2_HUMAN_theta0.2_2023-10-12_b01.npy,BRCA2_HUMAN_1-1000.pdb|BRCA2_HUMAN_1001-2085.pdb|BRCA2_HUMAN_2086-2832.pdb,1-1000|1001-2085|2086-2832,1,,OrganismalFitness
+C6KNH7_9INFA_Lee_2018,C6KNH7_9INFA_Lee_2018.csv,C6KNH7_9INFA,Virus,Influenza A virus (A/Perth/16/2009(H3N2)),MKTIIALSYILCLVFAQKLPGNDNSTATLCLGHHAVPNGTIVKTITNDQIEVTNATELVQSSSTGEICDSPHQILDGKNCTLIDALLGDPQCDDFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSKNSFFSRLNWLTHLNFKYPALNVTMPNNEQFDKLYIWGVLHPGTDKDQIFLYAQASGRITVSTKRSQQIVSPNIGSRPRVRNIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCNSECITPNGSIPNDKPFQNVNRITYGACPRYVKQNTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHDVYRDEALNNRFQIKGVELKSGYKDWILWISFAISCFLLCVALLGFIMWACQKGNIRCNICI,566,FALSE,10754,10754,0,-1.720276237,median,Lee,Deep mutational scanning of hemagglutinin helps predict evolutionary fates of human H3N2 influenza variants,2018,10.1073/pnas.1806133115,1-566,Influenza hemagglutinin,Viral replication,Growth,C6KNH7_9INFA_theta0.99_full_11-26-2021_b09.a2m,1,566,566,0.9,0.01,57453,0.977,553,10569.8,19.11356239,medium,964,1.743218807,C6KNH7_9INFA_Lee_2018.csv,log_fitness_by_syn_mut_fitness,1,mutant,C6KNH7_9INFA_theta_0.01.npy,C6KNH7_9INFA.pdb,1-566,0.1,,OrganismalFitness
+CALM1_HUMAN_Weile_2017,CALM1_HUMAN_Weile_2017.csv,CALM1_HUMAN,Human,Homo sapiens,MADQLTEEQIAEFKEAFSLFDKDGDGTITTKELGTVMRSLGQNPTEAELQDMINEVDADGNGTIDFPEFLTMMARKMKDTDSEEEIREAFRVFDKDGNGYISAAELRHVMTNLGEKLTDEEVDEMIREADIDGDGQVNYEEFVQMMTAK,149,FALSE,1813,1813,0,0.872790117,median,Weile,A framework for exhaustively mapping functional missense variants,2017,10.15252/msb.20177908,2-149,CALM1,Yeast growth,complementation,CALM1_HUMAN_full_11-26-2021_b03.a2m,1,149,149,0.3,0.2,177633,0.893,133,28985.1,217.9330827,high,96,0.7218045113,CALM1_HUMAN_Weile_2017.csv,screenscore,1,mutant,CALM1_HUMAN_theta_0.2.npy,CALM1_HUMAN.pdb,1-149,0.1,,OrganismalFitness
+CAPSD_AAV2S_Sinai_2021,CAPSD_AAV2S_Sinai_2021.csv,CAPSD_AAV2S,Virus,Adeno-associated virus 2 (isolate Srivastava/1982) (AAV-2),MAADGYLPDWLEDTLSEGIRQWWKLKPGPPPPKPAERHKDDSRGLVLPGYKYLGPFNGLDKGEPVNEADAAALEHDKAYDRQLDSGDNPYLKYNHADAEFQERLKEDTSFGGNLGRAVFQAKKRVLEPLGLVEEPVKTAPGKKRPVEHSPVEPDSSSGTGKAGQQPARKRLNFGQTGDADSVPDPQPLGQPPAAPSGLGTNTMATGSGAPMADNNEGADGVGNSSGNWHCDSTWMGDRVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQRLINNNWGFRPKRLNFKLFNIQVKEVTQNDGTTTIANNLTSTVQVFTDSEYQLPYVLGSAHQGCLPPFPADVFMVPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLSRTNTPSGTTTQSRLQFSQAGASDIRDQSRNWLPGPCYRQQRVSKTSADNNNSEYSWTGATKYHLNGRDSLVNPGPAMASHKDDEEKFFPQSGVLIFGKQGSEKTNVDIEKVMITDEEEIRTTNPVATEQYGSVSTNLQRGNRQAATADVNTQGVLPGMVWQDRDVYLQGPIWAKIPHTDGHFHPSPLMGGFGLKHPPPQILIKNTPVPANPSTTFSAAKFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVNVDFTVDTNGVYSEPRPIGTRYLTRNL,735,TRUE,42328,532,41796,-1.2,manual,Sinai,Generative AAV capsid diversification by latent interpolation,2021,10.1101/2021.04.16.440236,561-588,AAV,viability for AAV capsid production,,CAPSD_AAV2S_uniprot_t099_msc70_mcc70_b0.8.a2m,1,735,735,0.8,0.01,604,0.782,575,213.8,0.371826087,low,1943,3.379130435,CAPSD_AAV2S_Sinai_substitutions_2021.csv,viral_selection,1,mutant,CAPSD_AAV2S_theta_0.01.npy,CAPSD_AAV2S.pdb,1-735,0.1,,OrganismalFitness
+CAR11_HUMAN_Meitlis_2020_gof,CAR11_HUMAN_Meitlis_2020_gof.csv,CAR11_HUMAN,Human,Homo sapiens,MPGGGPEMDDYMETLKDEEDALWENVECNRHMLSRYINPAKLTPYLRQCKVIDEQDEDEVLNAPMLPSKINRAGRLLDILHTKGQRGYVVFLESLEFYYPELYKLVTGKEPTRRFSTIVVEEGHEGLTHFLMNEVIKLQQQMKAKDLQRCELLARLRQLEDEKKQMTLTRVELLTFQERYYKMKEERDSYNDELVKVKDDNYNLAMRYAQLSEEKNMAVMRSRDLQLEIDQLKHRLNKMEEECKLERNQSLKLKNDIENRPKKEQVLELERENEMLKTKNQELQSIIQAGKRSLPDSDKAILDILEHDRKEALEDRQELVNRIYNLQEEARQAEELRDKYLEEKEDLELKCSTLGKDCEMYKHRMNTVMLQLEEVERERDQAFHSRDEAQTQYSQCLIEKDKYRKQIRELEEKNDEMRIEMVRREACIVNLESKLRRLSKDSNNLDQSLPRNLPVTIISQDFGDASPRTNGQEADDSSTSEESPEDSKYFLPYHPPQRRMNLKGIQLQRAKSPISLKRTSDFQAKGHEEEGTDASPSSCGSLPITNSFTKMQPPRSRSSIMSITAEPPGNDSIVRRYKEDAPHRSTVEEDNDSGGFDALDLDDDSHERYSFGPSSIHSSSSSHQSEGLDAYDLEQVNLMFRKFSLERPFRPSVTSVGHVRGPGPSVQHTTLNGDSLTSQLTLLGGNARGSFVHSVKPGSLAEKAGLREGHQLLLLEGCIRGERQSVPLDTCTKEEAHWTIQRCSGPVTLHYKVNHEGYRKLVKDMEDGLITSGDSFYIRLNLNISSQLDACTMSLKCDDVVHVRDTMYQDRHEWLCARVDPFTDHDLDMGTIPSYSRAQQLLLVKLQRLMHRGSREEVDGTHHTLRALRNTLQPEEALSTSDPRVSPRLSRASFLFGQLLQFVSRSENKYKRMNSNERVRIISGSPLGSLARSSLDATKLLTEKQEELDPESELGKNLSLIPYSLVRAFYCERRRPVLFTPTVLAKTLVQRLLNSGGAMEFTICKSDIVTRDEFLRRQKTETIIYSREKNPNAFECIAPANIEAVAAKNKHCLLEAGIGCTRDLIKSNIYPIVLFIRVCEKNIKRFRKLLPRPETEEEFLRVCRLKEKELEALPCLYATVEPDMWGSVEELLRVVKDKIGEEQRKTIWVDEDQL,1154,FALSE,2374,2374,0,0.14475,manual,Meitlis,Multiplexed Functional Assessment of Genetic Variants in CARD11,2020,10.1016/j.ajhg.2020.10.015.,4-146,CARD11,Signaling (in presence of ibrutinib),survival assessment assay,CAR11_HUMAN_2023-10-12_b02.a2m,1,1154,1154,0.2,0.2,1352,0.998,1152,53.7,0.04661458333,Low,0,0,mmc2.xlsx,log2_score,1,mutant,CAR11_HUMAN_theta0.2_2023-10-12_b02.npy,CAR11_HUMAN.pdb,1-1154,1,,OrganismalFitness
+CAR11_HUMAN_Meitlis_2020_lof,CAR11_HUMAN_Meitlis_2020_lof.csv,CAR11_HUMAN,Human,Homo sapiens,MPGGGPEMDDYMETLKDEEDALWENVECNRHMLSRYINPAKLTPYLRQCKVIDEQDEDEVLNAPMLPSKINRAGRLLDILHTKGQRGYVVFLESLEFYYPELYKLVTGKEPTRRFSTIVVEEGHEGLTHFLMNEVIKLQQQMKAKDLQRCELLARLRQLEDEKKQMTLTRVELLTFQERYYKMKEERDSYNDELVKVKDDNYNLAMRYAQLSEEKNMAVMRSRDLQLEIDQLKHRLNKMEEECKLERNQSLKLKNDIENRPKKEQVLELERENEMLKTKNQELQSIIQAGKRSLPDSDKAILDILEHDRKEALEDRQELVNRIYNLQEEARQAEELRDKYLEEKEDLELKCSTLGKDCEMYKHRMNTVMLQLEEVERERDQAFHSRDEAQTQYSQCLIEKDKYRKQIRELEEKNDEMRIEMVRREACIVNLESKLRRLSKDSNNLDQSLPRNLPVTIISQDFGDASPRTNGQEADDSSTSEESPEDSKYFLPYHPPQRRMNLKGIQLQRAKSPISLKRTSDFQAKGHEEEGTDASPSSCGSLPITNSFTKMQPPRSRSSIMSITAEPPGNDSIVRRYKEDAPHRSTVEEDNDSGGFDALDLDDDSHERYSFGPSSIHSSSSSHQSEGLDAYDLEQVNLMFRKFSLERPFRPSVTSVGHVRGPGPSVQHTTLNGDSLTSQLTLLGGNARGSFVHSVKPGSLAEKAGLREGHQLLLLEGCIRGERQSVPLDTCTKEEAHWTIQRCSGPVTLHYKVNHEGYRKLVKDMEDGLITSGDSFYIRLNLNISSQLDACTMSLKCDDVVHVRDTMYQDRHEWLCARVDPFTDHDLDMGTIPSYSRAQQLLLVKLQRLMHRGSREEVDGTHHTLRALRNTLQPEEALSTSDPRVSPRLSRASFLFGQLLQFVSRSENKYKRMNSNERVRIISGSPLGSLARSSLDATKLLTEKQEELDPESELGKNLSLIPYSLVRAFYCERRRPVLFTPTVLAKTLVQRLLNSGGAMEFTICKSDIVTRDEFLRRQKTETIIYSREKNPNAFECIAPANIEAVAAKNKHCLLEAGIGCTRDLIKSNIYPIVLFIRVCEKNIKRFRKLLPRPETEEEFLRVCRLKEKELEALPCLYATVEPDMWGSVEELLRVVKDKIGEEQRKTIWVDEDQL,1154,FALSE,2395,2395,0,-0.4635,manual,Meitlis,Multiplexed Functional Assessment of Genetic Variants in CARD11,2020,10.1016/j.ajhg.2020.10.015.,4-146,CARD11,Signaling,survival assessment assay,CAR11_HUMAN_2023-10-12_b02.a2m,1,1154,1154,0.2,0.2,1352,0.998,1152,53.7,0.04661458333,Low,0,0,mmc3.xlsx,log2_score,1,mutant,CAR11_HUMAN_theta0.2_2023-10-12_b02.npy,CAR11_HUMAN.pdb,1-1154,1,,OrganismalFitness
+CAS9_STRP1_Spencer_2017_positive,CAS9_STRP1_Spencer_2017_positive.csv,CAS9_STRP1,Prokaryote,Streptococcus pyogenes,MDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGD,1368,FALSE,8117,8117,0,-0.2654328586,median,Spencer,Deep mutational scanning of S. pyogenes Cas9 reveals important functional domains,2017,10.1038/s41598-017-17081-y,1-1368,Streptococcus pyogenes Cas9,count of mutation where survival depends on expression of Cas9 and correct cleavage,Flow cytometry,CAS9_STRP1_2023-08-07_b01.a2m,1,1368,1368,0.1,0.2,5349,0.992,1357,1532.3,1.12918,Medium,241,0.17759764,SPCAS9_Spencer_positive_2022.csv,Log2 Fold Change after Positive Selection,1,mutant,CAS9_STRP1_theta0.2_2023-08-07_b01.npy,CAS9_STRP1.pdb,1-1368,1,,Activity
+CASP3_HUMAN_Roychowdhury_2020,CASP3_HUMAN_Roychowdhury_2020.csv,CASP3_HUMAN,Human,Homo sapiens,MSGISLDNSYKMDYPEMGLCIIINNKNFHKSTGMTSRSGTDVDAANLRETFRNLKYEVRNKNDLTREEIVELMRDVSKEDHSKRSSFVCVLLSHGEEGIIFGTNGPVDLKKITNFFRGDRCRSLTGKPKLFIIQACRGTELDCGIETDSGVDDDMACHKIPVEADFLYAYSTAPGYYSWRNSKDGSWFIQSLCAMLKQYADKLEFMHILTRVNRKVATEFESFSFDATFHAKKQIPCIVSMLTKELYFYHLEHHHHHH,258,FALSE,1567,1567,0,0.03725973017,median,Roychowdhury,Microfluidic deep mutational scanning of the human executioner caspases reveals differences in structure and regulation,2022,10.1038/s41420-021-00799-0,2-258,CASP3,Fluorescence measurement,,CASP3_HUMAN_2023-08-07_b01.a2m,1,258,258,0.1,0.2,86012,0.884,228,28096.2,123.2289474,High,307,1.346491228,CASP3_HUMAN_Roychowdhury_2020.csv,coef,1,mutant,CASP3_HUMAN_theta0.2_2023-08-07_b01.npy,CASP3_HUMAN.pdb,1-258,1,,Activity
+CASP7_HUMAN_Roychowdhury_2020,CASP7_HUMAN_Roychowdhury_2020.csv,CASP7_HUMAN,Human,Homo sapiens,MAKPDRSSFVPSLFSKKKKNVTMRSIKTTRDRVPTYQYNMNFEKLGKCIIINNKNFDKVTGMGVRNGTDKDAEALFKCFRSLGFDVIVYNDCSCAKMQDLLKKASEEDHTNAACFACILLSHGEENVIYGKDGVTPIKDLTAHFRGDRCKTLLEKPKLFFIQACRGTELDDGIQADSGPINDTDANPRYKIPVEADFLFAYSTVPGYYSWRSPGRGSWFVQALCSILEEHGKDLEIMQILTRVNDRVARHFESQSDDPHFHEKKQIPCVVSMLTKELYFSQ,281,FALSE,1680,1680,0,-0.3340768074,median,Roychowdhury,Microfluidic deep mutational scanning of the human executioner caspases reveals differences in structure and regulation,2022,10.1038/s41420-021-00799-0,2-281,CASP7,Fluorescence measurement,,CASP7_HUMAN_2023-08-07_b01.a2m,1,281,281,0.1,0.2,71075,0.854,240,21588.4,89.95166667,Medium,298,1.241666667,CASP7_HUMAN_Roychowdhury_2020.csv,coef,1,mutant,CASP7_HUMAN_theta0.2_2023-08-07_b01.npy,CASP7_HUMAN.pdb,1-281,1,,Activity
+CATR_CHLRE_Tsuboyama_2023_2AMI,CATR_CHLRE_Tsuboyama_2023_2AMI.csv,CATR_CHLRE,Eukaryote,Chlamydomonas reinhardtii,GLTEEQKQEIREAFDLFDTDGSGTIDAKELKVAMRALGFEPKKEEIKKMISEIDKDGSGTIDFEEFLTMMTA,72,TRUE,1903,1340,563,-0.5681612987,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-72,Caltractin,Stability,cDNA display proteolysis,CATR_CHLRE_2023-08-07_b03.a2m,1,72,72,0.3,0.2,551057,0.903,65,75596.9,1163.029231,High,57,0.8769230769,Tsuboyama2023_Dataset2_Dataset7,ddG_ML_float,1,mut_type,CATR_CHLRE_theta0.2_2023-08-07_b03.npy,CATR_CHLRE.pdb,1-72,1,,Stability
+CBPA2_HUMAN_Tsuboyama_2023_1O6X,CBPA2_HUMAN_Tsuboyama_2023_1O6X.csv,CBPA2_HUMAN,Human,Homo sapiens,VGDQVLEIVPSNEEQIKNLLQLEAQEHLQLDFWKSPTTPGETAHVRVPFVNVQAVKVFLESQGIAYSIMIED,72,TRUE,2068,1357,711,-1.221174658,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-72,Carboxypeptidase A2,Stability,cDNA display proteolysis,CBPA2_HUMAN_2023-08-07_b03.a2m,1,72,72,0.3,0.2,12711,0.986,71,3086.5,43.47183099,Medium,34,0.4788732394,Tsuboyama2023_Dataset2_Dataset8,ddG_ML_float,1,mut_type,CBPA2_HUMAN_theta0.2_2023-08-07_b03.npy,CBPA2_HUMAN.pdb,1-72,1,,Stability
+CBS_HUMAN_Sun_2020,CBS_HUMAN_Sun_2020.csv,CBS_HUMAN,Human,Homo sapiens,MPSETPQAEVGPTGCPHRSGPHSAKGSLEKGSPEDKEAKEPLWIRPDAPSRCTWQLGRPASESPHHHTAPAKSPKILPDILKKIGDTPMVRINKIGKKFGLKCELLAKCEFFNAGGSVKDRISLRMIEDAERDGTLKPGDTIIEPTSGNTGIGLALAAAVRGYRCIIVMPEKMSSEKVDVLRALGAEIVRTPTNARFDSPESHVGVAWRLKNEIPNSHILDQYRNASNPLAHYDTTADEILQQCDGKLDMLVASVGTGGTITGIARKLKEKCPGCRIIGVDPEGSILAEPEELNQTEQTTYEVEGIGYDFIPTVLDRTVVDKWFKSNDEEAFTFARMLIAQEGLLCGGSAGSTVAVAVKAAQELQEGQRCVVILPDSVRNYMTKFLSDRWMLQKGFLKEEDLTEKKPWWWHLRVQELGLSAPLTVLPTITCGHTIEILREKGFDQAPVVDEAGVILGMVTLGNMLSSLLAGKVQPSDQVGKVIYKQFKQIRLTDTLGRLSHILEMDHFALVVHEQIQYHSTGKSSQRQMVFGVVTAIDLLNFVAAQERDQK,551,FALSE,7217,7217,0,0.3753910128,median,Sun,A proactive genotype-to-patient-phenotype map for cystathionine beta-synthase,2020,10.1186/s13073-020-0711-1,2-551,cystathionine beta-synthase,Yeast Growth,Growth,CBS_HUMAN_2023-10-12_b08.a2m,1,551,551,0.8,0.2,19563,0.833,459,1886,4.108932462,Medium,289,0.6296296296,,score,1,mutant,CBS_HUMAN_theta0.2_2023-10-12_b08.npy,CBS_HUMAN.pdb,1-551,1,,OrganismalFitness
+CBX4_HUMAN_Tsuboyama_2023_2K28,CBX4_HUMAN_Tsuboyama_2023_2K28.csv,CBX4_HUMAN,Human,Homo sapiens,AVESIEKKRIRKGRVEYLVKWRGWSPKYNTWEPEENILDPRLLIAFQNRE,50,TRUE,2282,917,1365,-1.635037732,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-50,E3 SUMO-protein ligase CBX4,Stability,cDNA display proteolysis,CBX4_HUMAN_2023-08-07_b03.a2m,1,50,50,0.3,0.2,108263,0.96,48,13404.4,279.2583333,High,23,0.4791666667,Tsuboyama2023_Dataset2_Dataset9,ddG_ML_float,1,mut_type,CBX4_HUMAN_theta0.2_2023-08-07_b03.npy,CBX4_HUMAN.pdb,1-50,1,,Stability
+CCDB_ECOLI_Adkar_2012,CCDB_ECOLI_Adkar_2012.csv,CCDB_ECOLI,Prokaryote,Escherichia coli,MQFKVYTYKRESRYRLFVDVQSDIIDTPGRRMVIPLASARLLSDKVSRELYPVVHIGDESWRMMTTDMASVPVSVIGEEVADLSHRENDIKNAINLMFWGI,101,FALSE,1176,1176,0,-19,median,Adkar,Protein model discrimination using mutational sensitivity derived from deep sequencing,2012,10.1016/j.str.2011.11.021,2-101,Toxin CcdB,Protein toxicity (negative effect on cell growth),toxin activity,CCDB_ECOLI_full_11-26-2021_b02.a2m,1,101,101,0.2,0.2,43564,0.851,86,16821.5,195.5988372,high,61,0.7093023256,CCDB_ECOLI_Adkar_2012.csv,score,-1,mutant,CCDB_ECOLI_theta_0.2.npy,CCDB_ECOLI.pdb,1-101,0.1,,Activity
+CCDB_ECOLI_Tripathi_2016,CCDB_ECOLI_Tripathi_2016.csv,CCDB_ECOLI,Prokaryote,Escherichia coli,MQFKVYTYKRESRYRLFVDVQSDIIDTPGRRMVIPLASARLLSDKVSRELYPVVHIGDESWRMMTTDMASVPVSVIGEEVADLSHRENDIKNAINLMFWGI,101,FALSE,1663,1663,0,-3.5,manual,Tripathi,"Molecular Determinants of Mutant Phenotypes, Inferred from Saturation Mutagenesis Data",2016,10.1093/molbev/msw182,2-101,Toxin CcdB,growth (surrogate for toxicity/activity of CCDB),Growth,CCDB_ECOLI_full_11-26-2021_b02.a2m,1,101,101,0.2,0.2,43564,0.851,86,16821.5,195.5988372,high,61,0.7093023256,CCDB_ECOLI_Tripathi_2016.csv,score,-1,mutant,CCDB_ECOLI_theta_0.2.npy,CCDB_ECOLI.pdb,1-101,0.1,,OrganismalFitness
+CCR5_HUMAN_Gill_2023,CCR5_HUMAN_Gill_2023.csv,CCR5_HUMAN,Human,Homo sapiens,MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKRLKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMCQLLTGLYFIGFFSGIFFIILLTIDRYLAVVHAVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIFTIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQVTETLGMTHCCINPIIYAFVGEKFRNYLLVFFQKHIAKRFCKCCSIFQQEAPERASSVYTRSTGEQEISVGL,352,FALSE,6137,6137,0,-0.06,median,Gill,Multiple mechanisms of self-association of chemokine receptors CXCR4 and CCR5 demonstrated by deep mutagenesis,2023,10.1101/2023.03.25.534231,2-344,CCR5,"binding affinity, surface expression",FACS,CCR5_HUMAN_2023-08-07_b03.a2m,1,352,352,0.3,0.2,632074,0.83,292,63056,215.9452055,High,309,1.058219178,,avg_score,1,mutant,CCR5_HUMAN_theta0.2_2023-08-07_b03.npy,CCR5_HUMAN.pdb,1-352,1,,Binding
+CD19_HUMAN_Klesmith_2019_FMC_singles,CD19_HUMAN_Klesmith_2019_FMC_singles.csv,CD19_HUMAN,Human,Homo sapiens,MPPPRLLFFLLFLTPMEVRPEEPLVVKVEEGDNAVLQCLKGTSDGPTQQLTWSRESPLKPFLKLSLGLPGLGIHMRPLAIWLFIFNVSQQMGGFYLCQPGPPSEKAWQPGWTVNVEGSGELFRWNVSDLGGLGCGLKNRSSEGPSSPSGKLMSPKLYVWAKDRPEIWEGEPPCLPPRDSLNQSLSQDLTMAPGSTLWLSCGVPPDSVSRGPLSWTHVHPKGPKSLLSLELKDDRPARDMWVMETGLLLPRATAQDAGKYYCHRGNLTMSFHLEITARPVLWHWLLRTGGWKVSAVTLAYLIFCLCSLVGILHLQRALVLRRKRKRMTDPTRRFFKVTPPPGSGPQNQYGNVLSLPTPTSGLGRAQRWAAGLGGTAPSYGNPSSDVQADGALGSRSPPGVGPEEEEGEGYEEPDSEEDSEFYENDSNLGQDQLSQDGSGYENPEDEPLGPEDEDSFSNAESYENEDEELTQPVARTMDFLSPHGSAWDPSREATSLGSQSYEDMRGILYAAPQLRSIRGQPGPNHEEDADSYENMDNPDGPDPAWGGGGRMGTWSTR,556,FALSE,3761,3761,0,0,manual,Klesmith,Retargeting CD19 Chimeric Antigen Receptor T Cells via Engineered CD19-Fusion Proteins,2019,10.1021/acs.molpharmaceut.9b00418,20-291,CD19,Binding affinity,FACS,CD19_HUMAN_2023-10-12_b01.a2m,1,556,556,0.1,0.2,1183,1,556,275.2,0.4949640288,Low,11,0.01978417266,single-site/Clinical_FMC_T1_Fitness.tsv,Fitness,1,mutant_offset,CD19_HUMAN_theta0.2_2023-10-12_b01.npy,CD19_HUMAN.pdb,1-556,1,,Binding
+CP2C9_HUMAN_Amorosi_2021_abundance,CP2C9_HUMAN_Amorosi_2021_abundance.csv,CP2C9_HUMAN,Human,Homo sapiens,MDSLVVLVLCLSCLLLLSLWRQSSGRGKLPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAERANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLNLMEKLNENIKILSSPWIQICNNFSPIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHNQPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLPTSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFLTSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPV,490,FALSE,6370,6370,0,0.7723244345,median,Amorosi,Massively parallel characterization of CYP2C9 variant enzyme activity and abundance,2021,10.1016/j.ajhg.2021.07.001,2-490,Cytochrome P450 2C9,protein abundance,protein abundance,CP2C9_HUMAN_full_11-26-2021_b04.a2m,1,490,490,0.4,0.2,264279,0.886,434,81212.1,187.1246544,high,1092,2.516129032,CP2C9_HUMAN_Amorosi_2021.csv,abundance_score,1,variant,CP2C9_HUMAN_theta_0.2.npy,CP2C9_HUMAN.pdb,1-490,0.1,,Expression
+CP2C9_HUMAN_Amorosi_2021_activity,CP2C9_HUMAN_Amorosi_2021_activity.csv,CP2C9_HUMAN,Human,Homo sapiens,MDSLVVLVLCLSCLLLLSLWRQSSGRGKLPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAERANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLNLMEKLNENIKILSSPWIQICNNFSPIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHNQPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLPTSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFLTSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPV,490,FALSE,6142,6142,0,0.5476104185,median,Amorosi,Massively parallel characterization of CYP2C9 variant enzyme activity and abundance,2021,10.1016/j.ajhg.2021.07.001,1-490,Cytochrome P450 2C9,"activity, binding (to fluorescent CYP probe)","Activity, binding",CP2C9_HUMAN_full_11-26-2021_b04.a2m,1,490,490,0.4,0.2,264279,0.886,434,81212.1,187.1246544,high,1092,2.516129032,CP2C9_HUMAN_Amorosi_2021.csv,activity_score,1,variant,CP2C9_HUMAN_theta_0.2.npy,CP2C9_HUMAN.pdb,1-490,0.1,,Binding
+CSN4_MOUSE_Tsuboyama_2023_1UFM,CSN4_MOUSE_Tsuboyama_2023_1UFM.csv,CSN4_MOUSE,Eukaryote,Mus musculus,SSGGSSILDRAVIEHNLLSASKLYNNITFEELGALLEIPAAKAEKIASQMITEGRMNGFIDQIDGIVHFETR,72,TRUE,3295,1353,1942,-0.7,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-72,COP9 signalosome complex subunit 4,Stability,cDNA display proteolysis,CSN4_MOUSE_2023-08-07_b03.a2m,1,72,72,0.3,0.2,39217,0.889,64,3492.9,54.5765625,Medium,9,0.140625,Tsuboyama2023_Dataset2_Dataset10,ddG_ML_float,1,mut_type,CSN4_MOUSE_theta0.2_2023-08-07_b03.npy,CSN4_MOUSE.pdb,1-72,1,,Stability
+CUE1_YEAST_Tsuboyama_2023_2MYX,CUE1_YEAST_Tsuboyama_2023_2MYX.csv,CUE1_YEAST,Eukaryote,Saccharomyces cerevisiae,GGHPVTTQMVETVQNLAPNLHPEQIRYSLENTGSVEETVERYLRGDEFSFPP,52,TRUE,1580,955,625,-1.319713733,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-52,Coupling of ubiquitin conjugation to ER degradation protein 1,Stability,cDNA display proteolysis,CUE1_YEAST_2023-08-07_b08.a2m,1,52,52,0.8,0.2,3213,0.923,48,387.1,8.064583333,Medium,10,0.2083333333,Tsuboyama2023_Dataset2_Dataset11,ddG_ML_float,1,mut_type,CUE1_YEAST_theta0.2_2023-08-07_b08.npy,CUE1_YEAST.pdb,1-52,1,,Stability
+D7PM05_CLYGR_Somermeyer_2022,D7PM05_CLYGR_Somermeyer_2022.csv,D7PM05_CLYGR,Eukaryote,Clytia gregaria,MTALTEGAKLFEKEIPYITELEGDVEGMKFIIKGEGTGDATTGTIKAKYICTTGDLPVPWATILSSLSYGVFCFAKYPRHIADFFKSTQPDGYSQDRIISFDNDGQYDVKAKVTYENGTLYNRVTVKGTGFKSNGNILGMRVLYHSPPHAVYILPDRKNGGMKIEYNKAFDVMGGGHQMARHAQFNKPLGAWEEDYPLYHHLTVWTSFGKDPDDDETDHLTIVEVIKAVDLETYR,235,TRUE,24515,1169,23346,12500,manual,Somermeyer,Heterogeneity of the GFP fitness landscape and data-driven protein design,2022,10.7554/eLife.75842,2-235,Green fluorescent protein cgreGFP,Fluorescence,FACS,D7PM05_CLYGR_full_b0.2.a2m,1,235,235,0.2,0.2,694,1,235,137.6,0.5855319149,Low,4,0.0170212766,D7PM05_CLYGR_Somermeyer_2022.csv,replicates_mean_brightness,1,mutant,D7PM05_CLYGR_theta_0.2.npy,D7PM05_CLYGR.pdb,1-235,1,,Activity
+DLG4_HUMAN_Faure_2021,DLG4_HUMAN_Faure_2021.csv,DLG4_HUMAN,Human,Homo sapiens,MDCLCIVTTKKYRYQDEDTPPLEHSPAHLPNQANSPPVIVNTDTLEAPGYELQVNGTEGEMEYEEITLERGNSGLGFSIAGGTDNPHIGDDPSIFITKIIPGGAAAQDGRLRVNDSILFVNEVDVREVTHSAAVEALKEAGSIVRLYVMRRKPPAEKVMEIKLIKGPKGLGFSIAGGVGNQHIPGDNSIYVTKIIEGGAAHKDGRLQIGDKILAVNSVGLEDVMHEDAVAALKNTYDVVYLKVAKPSNAYLSDSYAPPDITTSYSQHLDNEISHSSYLGTDYPTAMTPTSPRRYSPVAKDLLGEEDIPREPRRIVIHRGSTGLGFNIVGGEDGEGIFISFILAGGPADLSGELRKGDQILSVNGVDLRNASHEQAAIALKNAGQTVTIIAQYKPEEYSRFEAKIHDLREQLMNSSLGSGTASLRSNPKRGFYIRALFDYDKTKDCGFLSQALSFRFGDVLHVIDASDEEWWQARRVHSDSETDDIGFIPSKRRVERREWSRLKAKDWGSSSGSQGREDSVLSYETVTQMEVHYARPIIILGPTKDRANDDLLSEFPDKFGSCVPHTTRPKREYEIDGRDYHFVSSREKMEKDIQAHKFIEAGQYNSHLYGTSVQSVREVAEQGKHCILDVSANAVRRLQAAHLHPIAIFIRPRSLENVLEINKRITEEQARKAFDRATKLEQEFTECFSAIVEGDSFEEIYHKVKRVIEDLSGPYIWVPARERL,724,TRUE,6976,1280,5696,-0.5602585328,median,Faure,Mapping the energetic and allosteric landscapes of protein binding domains,2022,10.1038/s41586-022-04586-4,311-394,PSD95-PDZ3,Yeast growth,Growth,DLG4_HUMAN_full_11-26-2021_b02.a2m,1,724,724,0.2,0.2,25338,0.825,597,354.3,0.5934673367,low,7,0.01172529313,DLG4_HUMAN_Faure_2021.csv,fitness,1,mutant,DLG4_HUMAN_theta_0.2.npy,DLG4_HUMAN.pdb,1-724,0.1,,OrganismalFitness
+DLG4_RAT_McLaughlin_2012,DLG4_RAT_McLaughlin_2012.csv,DLG4_RAT,Eukaryote,Rattus norvegicus,MDCLCIVTTKKYRYQDEDTPPLEHSPAHLPNQANSPPVIVNTDTLEAPGYELQVNGTEGEMEYEEITLERGNSGLGFSIAGGTDNPHIGDDPSIFITKIIPGGAAAQDGRLRVNDSILFVNEVDVREVTHSAAVEALKEAGSIVRLYVMRRKPPAEKVMEIKLIKGPKGLGFSIAGGVGNQHIPGDNSIYVTKIIEGGAAHKDGRLQIGDKILAVNSVGLEDVMHEDAVAALKNTYDVVYLKVAKPSNAYLSDSYAPPDITTSYSQHLDNEISHSSYLGTDYPTAMTPTSPRRYSPVAKDLLGEEDIPREPRRIVIHRGSTGLGFNIVGGEDGEGIFISFILAGGPADLSGELRKGDQILSVNGVDLRNASHEQAAIALKNAGQTVTIIAQYKPEEYSRFEAKIHDLREQLMNSSLGSGTASLRSNPKRGFYIRALFDYDKTKDCGFLSQALSFRFGDVLHVIDAGDEEWWQARRVHSDSETDDIGFIPSKRRVERREWSRLKAKDWGSSSGSQGREDSVLSYETVTQMEVHYARPIIILGPTKDRANDDLLSEFPDKFGSCVPHTTRPKREYEIDGRDYHFVSSREKMEKDIQAHKFIEAGQYNSHLYGTSVQSVREVAEQGKHCILDVSANAVRRLQAAHLHPIAIFIRPRSLENVLEINKRITEEQARKAFDRATKLEQEFTECFSAIVEGDSFEEIYHKVKRVIEDLSGPYIWVPARERL,724,FALSE,1576,1576,0,-0.25,manual,McLaughlin,The spatial architecture of protein function and adaptation,2012,10.1038/nature11500,311-393,"Dlg4, (PSD95_PDZ3)",peptide binding - natural ligand,Binding,DLG4_RAT_full_11-26-2021_b03.a2m,1,724,724,0.3,0.2,24705,0.841,609,283.9,0.4661740558,low,6,0.009852216749,DLG4_RAT_McLaughlin_2012.csv,CRIPT,1,mutant,DLG4_RAT_theta_0.2.npy,DLG4_RAT.pdb,1-724,0.1,,Binding
+DN7A_SACS2_Tsuboyama_2023_1JIC,DN7A_SACS2_Tsuboyama_2023_1JIC.csv,DN7A_SACS2,Prokaryote,Saccharolobus solfataricus,TVKFKYKGEEKQVDISKIKKVWRVGKMISFTYDEGGGKTGRGAVSEKDAPKELLQ,55,FALSE,1008,1008,0,-0.472754253,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-55,DNA-binding protein 7a,Stability,cDNA display proteolysis,DN7A_SACS2_2023-08-07_b02.a2m,1,55,55,0.2,0.2,42895,0.764,42,1248.1,29.71666667,Medium,13,0.3095238095,Tsuboyama2023_Dataset2_Dataset12,ddG_ML_float,1,mut_type,DN7A_SACS2_theta0.2_2023-08-07_b02.npy,DN7A_SACS2.pdb,1-55,1,,Stability
+DNJA1_HUMAN_Tsuboyama_2023_2LO1,DNJA1_HUMAN_Tsuboyama_2023_2LO1.csv,DNJA1_HUMAN,Human,Homo sapiens,TTYYDVLGVKPNATQEELKKAYRKLALKYHPDKNPNEGEKFKQISQAYEVLSDAKKRELYDKGGE,65,TRUE,2264,1216,1048,-2.239788161,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-65,DnaJ homolog subfamily A member 1,Stability,cDNA display proteolysis,DNJA1_HUMAN_2023-08-07_b07.a2m,1,65,65,0.7,0.2,280284,0.969,63,35361.9,561.3,High,52,0.8253968254,Tsuboyama2023_Dataset2_Dataset13,ddG_ML_float,1,mut_type,DNJA1_HUMAN_theta0.2_2023-08-07_b07.npy,DNJA1_HUMAN.pdb,1-65,1,,Stability
+DOCK1_MOUSE_Tsuboyama_2023_2M0Y,DOCK1_MOUSE_Tsuboyama_2023_2M0Y.csv,DOCK1_MOUSE,Eukaryote,Mus musculus,WVPTKREEKYGVAFYNYDARGADELSLQIGDTVHILETYEGWYRGYTLRKKSKKGIFPASYIHLKE,66,TRUE,2915,1213,1702,-1.104437518,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-66,Dedicator of cytokinesis protein 1,Stability,cDNA display proteolysis,DOCK1_MOUSE_2023-08-07_b03.a2m,1,66,66,0.3,0.2,705447,0.848,56,22172.3,395.9339286,High,55,0.9821428571,Tsuboyama2023_Dataset2_Dataset14,ddG_ML_float,1,mut_type,DOCK1_MOUSE_theta0.2_2023-08-07_b03.npy,DOCK1_MOUSE.pdb,1-66,1,,Stability
+DYR_ECOLI_Nguyen_2023,DYR_ECOLI_Nguyen_2023.csv,DYR_ECOLI,Prokaryote,Escherichia coli,MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLNKPVIMGRHTWESIGRPLPGRKNIILSSQPGTDDRVTWVKSVDEAIAACGDVPEIMVIGGGRVYEQFLPKAQKLYLTHIDAEVEGDTHFPDYEPDDWESVFSEFHDADAQNSHSYCFEILERR,159,FALSE,2916,2916,0,0.8,manual,Nguyen,The Genetic Landscape of a Metabolic Interaction,2023,10.1101/2023.05.28.542639,2-159,DHFR,cell growth in ∆DHFR bacteria,Growth,DYR_ECOLI_2023-08-07_b01.a2m,1,159,159,0.1,0.2,188828,0.969,154,47685.7,309.6474026,High,337,2.188311688,542639_file03.xlsx,Avg Growth - WT TYMS,1,Mutation,DYR_ECOLI_theta0.2_2023-08-07_b01.npy,DYR_ECOLI.pdb,1-159,1,,OrganismalFitness
+DYR_ECOLI_Thompson_2019,DYR_ECOLI_Thompson_2019.csv,DYR_ECOLI,Prokaryote,Escherichia coli,MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLNKPVIMGRHTWESIGRPLPGRKNIILSSQPGTDDRVTWVKSVDEAIAACGDVPEIMVIGGGRVYEQFLPKAQKLYLTHIDAEVEGDTHFPDYEPDDWESVFSEFHDADAQNSHSYCFEILERR,159,FALSE,2363,2363,0,-0.5,manual,Thompson,Altered expression of a quality control protease in E. coli reshapes the in vivo mutational landscape of a model enzyme,2019,10.7554/eLife.53476,2-159,DHFR reductase,"growth (turbidostat; -Lon for natural absence of Lon protease in E. coli, +Lon for exogenous protease)",Growth,DYR_ECOLI_full_11-26-2021_b08.a2m,1,159,159,0.8,0.2,41921,0.981,156,12203.2,78.22564103,medium,265,1.698717949,DYR_ECOLI_Thompson_plusLon_2019.csv,PlusLon_selection_coefficient,1,mutant,DYR_ECOLI_theta_0.2.npy,DYR_ECOLI.pdb,1-159,0.1,,OrganismalFitness
+ENV_HV1B9_DuenasDecamp_2016,ENV_HV1B9_DuenasDecamp_2016.csv,ENV_HV1B9,Virus,Human immunodeficiency virus type 1 group M subtype B (strain 89.6) (HIV-1),MRVKEIRKNWQHLRGGILLLGMLMICSAAKEKTWVTIYYGVPVWREATTTLFCASDAKAYDTEVHNVWATHACVPTDPNPQEVVLGNVTENFNMWKNNMVDQMHEDIISLWDESLKPCVKLTPLCVTLNCTNLNITKNTTNPTSSSWGMMEKGEIKNCSFYITTSIRNKVKKEYALFNRLDVVPIENTNNTKYRLISCNTSVITQACPKVSFQPIPIHYCVPAGFAMLKCNNKTFNGSGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEDIVIRSENFTDNAKTIIVQLNESVVINCTRPNNNTRRRLSIGPGRAFYARRNIIGDIRQAHCNISRAKWNNTLQQIVIKLREKFRNKTIAFNQPSGGDPEIVRHSFNCGGEFFYCNTAQLFNSTWNVTGGTNGTEGNDIITLQCRIKQIINMWQKVGKAMYAPPITGQIRCSSNITGLLLTRDGGNSTETETEIFRPGGGDMRDNWRSELYKYKVVRIEPIGVAPTRAKRRTVQREKRAVGIGAVFLGFLGAAGSTMGAASVTLTVQARLLLSGIVQQQNNLLRAIEAQQHMLQLTVWGIKQLQARVLALERYLRDQQLMGIWGCSGKLICTTSVPWNVSWSNKSVDDIWNNMTWMEWEREIDNYTDYIYDLLEKSQTQQEKNEKELLELDKWASLWNWFDITNWLWYIRLFIMIVGGLIGLRIVFAVLSIVNRVRQGYSPLSFQTLLPASRGPDRPEGTEEEGGERDRDRSGPLVNGFLALFWVDLRNLCLFLYHLLRNLLLIVTRIVELLGRRGWEALKYWWNLLQYWSQELKNSAVSLLNATAIAVAEGTDRVIKIVQRACRAIRNIPTRIRQGLERALL,853,FALSE,375,375,0,-0.8,manual,Duenas-Decamp,Saturation Mutagenesis of the HIV-1 Envelope CD4 Binding Loop Reveals Residues Controlling Distinct Trimer Conformations,2016,10.1371/journal.ppat.1005988,361-380,HIV env,Viral replication,Growth,ENV_HV1B9_S364P-M373R_b0.3.a2m,1,853,853,0.3,0.01,87271,0.989,844,11807.8,13.99028436,medium,947,1.122037915,ENV_HV1B9_DuenasDecamp_2016.csv,Fitness_Effect,1,mutant,ENV_HV1B9_theta_0.01.npy,ENV_HV1B9.pdb,1-853,0.1,,OrganismalFitness
+ENV_HV1BR_Haddox_2016,ENV_HV1BR_Haddox_2016.csv,ENV_HV1BR,Virus,Human immunodeficiency virus type 1 group M subtype B (isolate BRU/LAI) (HIV-1),MRVKEKYQHLWRWGWKWGTMLLGILMICSATEKLWVTVYYGVPVWKEATTTLFCASDAKAYDTEVHNVWATHACVPTDPNPQEVVLVNVTENFNMWKNDMVEQMHEDIISLWDQSLKPCVKLTPLCVSLKCTDLGNATNTNSSNTNSSSGEMMMEKGEIKNCSFNISTSIRGKVQKEYAFFYKLDIIPIDNDTTSYTLTSCNTSVITQACPKVSFEPIPIHYCAPAGFAILKCNNKTFNGTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSANFTDNAKTIIVQLNQSVEINCTRPNNNTRKSIRIQRGPGRAFVTIGKIGNMRQAHCNISRAKWNATLKQIASKLREQFGNNKTIIFKQSSGGDPEIVTHSFNCGGEFFYCNSTQLFNSTWFNSTWSTEGSNNTEGSDTITLPCRIKQFINMWQEVGKAMYAPPISGQIRCSSNITGLLLTRDGGNNNNGSEIFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTKAKRRVVQREKRAVGIGALFLGFLGAAGSTMGAASMTLTVQARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQLLGIWGCSGKLICTTAVPWNASWSNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNITNWLWYIKIFIMIVGGLVGLRIVFAVLSIVNRVRQGYSPLSFQTHLPTPRGPDRPEGIEEEGGERDRDRSIRLVNGSLALIWDDLRSLCLFSYHRLRDLLLIVTRIVELLGRRGWEALKYWWNLLQYWSQELKNSAVSLLNATAIAVAEGTDRVIEVVQGACRAIRHIPRRIRQGLERILL,861,FALSE,12863,12863,0,0.0191127558,median,Haddox,Experimental Estimation of the Effects of All Amino-Acid Mutations to HIV’s Envelope Protein on Viral Replication in Cell Culture,2016,10.1371/journal.ppat.1006114,31-707,HIV env,Viral replication,Growth,ENV_HV1BR_theta0.99_full_11-26-2021_b09.a2m,1,861,861,0.9,0.01,74844,0.98,844,36809.8,43.61350711,medium,2359,2.795023697,ENV_HV1BR_Haddox_2016.csv,score,1,mutant,ENV_HV1BR_theta_0.01.npy,ENV_HV1BR.pdb,1-861,0.1,,OrganismalFitness
+ENVZ_ECOLI_Ghose_2023,ENVZ_ECOLI_Ghose_2023.csv,ENVZ_ECOLI,Prokaryote,Escherichia coli,LADDRTLLMAGVSHDLRTPLTRIRLATEMMSEQDGYLAESINKDIEECNAIIEQFIDYLR,60,FALSE,1121,1121,0,3.5,manual,Ghose,Marginal specificity in protein interactions constrains evolution of a paralogous family,2023,10.1073/pnas.2221163120,1-60,EnvZ kinase,fluorescent reporter,FACS,ENVZ_ECOLI_2023-08-07_b02.a2m,1,60,60,0.2,0.2,1879223,0.933,56,254652.1,4547.358929,High,55,0.9821428571,ENVZ_ECOLI_Ghose_2023.csv,mean_on,1,mutant,ENVZ_ECOLI_theta0.2_2023-08-07_b02.npy,ENVZ_ECOLI.pdb,1-60,1,,Activity
+EPHB2_HUMAN_Tsuboyama_2023_1F0M,EPHB2_HUMAN_Tsuboyama_2023_1F0M.csv,EPHB2_HUMAN,Human,Homo sapiens,SFNTVDEWLEAIKMGQYKESFANAGFTSFDVVSQMMMEDILRVGVTLAGHQKKILNSIQVMRAQMN,66,TRUE,1960,1239,721,-1.932053964,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-66,Ephrin type-B receptor 2,Stability,cDNA display proteolysis,EPHB2_HUMAN_2023-08-07_b04.a2m,1,66,66,0.4,0.2,212234,0.894,59,8426.3,142.8186441,High,29,0.4915254237,Tsuboyama2023_Dataset2_Dataset15,ddG_ML_float,1,mut_type,EPHB2_HUMAN_theta0.2_2023-08-07_b04.npy,EPHB2_HUMAN.pdb,1-66,1,,Stability
+ERBB2_HUMAN_Elazar_2016,ERBB2_HUMAN_Elazar_2016.csv,ERBB2_HUMAN,Human,Homo sapiens,MELAALCRWGLLLALLPPGAASTQVCTGTDMKLRLPASPETHLDMLRHLYQGCQVVQGNLELTYLPTNASLSFLQDIQEVQGYVLIAHNQVRQVPLQRLRIVRGTQLFEDNYALAVLDNGDPLNNTTPVTGASPGGLRELQLRSLTEILKGGVLIQRNPQLCYQDTILWKDIFHKNNQLALTLIDTNRSRACHPCSPMCKGSRCWGESSEDCQSLTRTVCAGGCARCKGPLPTDCCHEQCAAGCTGPKHSDCLACLHFNHSGICELHCPALVTYNTDTFESMPNPEGRYTFGASCVTACPYNYLSTDVGSCTLVCPLHNQEVTAEDGTQRCEKCSKPCARVCYGLGMEHLREVRAVTSANIQEFAGCKKIFGSLAFLPESFDGDPASNTAPLQPEQLQVFETLEEITGYLYISAWPDSLPDLSVFQNLQVIRGRILHNGAYSLTLQGLGISWLGLRSLRELGSGLALIHHNTHLCFVHTVPWDQLFRNPHQALLHTANRPEDECVGEGLACHQLCARGHCWGPGPTQCVNCSQFLRGQECVEECRVLQGLPREYVNARHCLPCHPECQPQNGSVTCFGPEADQCVACAHYKDPPFCVARCPSGVKPDLSYMPIWKFPDEEGACQPCPINCTHSCVDLDDKGCPAEQRASPLTSIISAVVGILLVVVLGVVFGILIKRRQQKIRKYTMRRLLQETELVEPLTPSGAMPNQAQMRILKETELRKVKVLGSGAFGTVYKGIWIPDGENVKIPVAIKVLRENTSPKANKEILDEAYVMAGVGSPYVSRLLGICLTSTVQLVTQLMPYGCLLDHVRENRGRLGSQDLLNWCMQIAKGMSYLEDVRLVHRDLAARNVLVKSPNHVKITDFGLARLLDIDETEYHADGGKVPIKWMALESILRRRFTHQSDVWSYGVTVWELMTFGAKPYDGIPAREIPDLLEKGERLPQPPICTIDVYMIMVKCWMIDSECRPRFRELVSEFSRMARDPQRFVVIQNEDLGPASPLDSTFYRSLLEDDDMGDLVDAEEYLVPQQGFFCPDPAPGAGGMVHHRHRSSSTRSGGGDLTLGLEPSEEEAPRSPLAPSEGAGSDVFDGDLGMGAAKGLQSLPTHDPSPLQRYSEDPTVPLPSETDGYVAPLTCSPQPEYVNQPDVRPQPPSPREGPLPAARPAGATLERPKTLSPGKNGVVKDVFAFGGAVENPEYLTPQGGAAPQPHPPPAFSPAFDNLYYWDQDPPERGAPPSTFKGTPTAENPEYLGLDVPV,1255,FALSE,326,326,0,0.0678339381,median,Elazar,Mutational scanning reveals the determinants of protein insertion and association energetics in the plasma membrane,2016,10.7554/eLife.12125,651-674,ErbB2 membrane domain,Membrane-protein insertion,TOXCAT-Beta-lactamase (TbL) screen,ERBB2_HUMAN_2023-10-12_b02.a2m,1,1255,1255,0.2,0.2,8311,0.981,1231,447.9,0.363850528,Low,187,0.1519090171,urn_mavedb_00000051-b-1_scores.csv,score,-1,mutant,ERBB2_HUMAN_theta0.2_2023-10-12_b02.npy,ERBB2_HUMAN.pdb,1-1255,1,650,Expression
+ESTA_BACSU_Nutschel_2020,ESTA_BACSU_Nutschel_2020.csv,ESTA_BACSU,Prokaryote,Bacillus subtilis,MKFVKRRIIALVTILMLSVTSLFALQPSAKAAEHNPVVMVHGIGGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNNGPVLSRFVQKVLDETGAKKVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGGANRLTTGKALPGTDPNQKILYTSIYSSADMIVMNYLSRLDGARNVQIHGVGHIGLLYSSQVNSLIKEGLNGGGQNTN,212,FALSE,2172,2172,0,46.34,median,Nutschel,Systematically Scrutinizing the Impact of Substitution Sites on Thermostability and Detergent Tolerance for Bacillus subtilis Lipase A,2020,10.1021/acs.jcim.9b00954,32-205,estA,thermostability,thermostability,ESTA_BACSU_full_11-26-2021_b03.a2m,1,212,212,0.3,0.2,234310,0.774,164,64492.5,393.2469512,high,292,1.780487805,ESTA_BACSU_Nutschel_2020.csv,T50,1,Variants of BsLipA,ESTA_BACSU_theta_0.2.npy,ESTA_BACSU.pdb,1-212,0.1,,Stability
+F7YBW8_MESOW_Ding_2023,F7YBW8_MESOW_Ding_2023.csv,F7YBW8_MESOW,Prokaryote,Mesorhizobium opportunistum,MANVEKMSVAVTPQQAAVMREAVEAGEYATASEIVREAVRDWLAKRELRHDDIRRLRQLWDEGKASGRPEPVDFDALRKEARQKLTEVPPNGR,93,TRUE,7922,80,7842,0.3,manual,Ding,Protein design using structure-based residue preferences,2023,10.1101/2022.10.31.514613,48-82,Antitoxin ParD3,growth enrichment,,F7YBW8_MESOW_full_01-07-2022_b02.a2m,1,93,93,0.2,0.2,38613,0.774,72,16262.4,225.8666667,high,31,0.4305555556,df_at_10pos.csv,DMS_score,1,mutant,F7YBW8_MESOW_theta_0.2.npy,F7YBW8_MESOW.pdb,1-93,1,,OrganismalFitness
+F7YBW8_MESOW_Aakre_2015,F7YBW8_MESOW_Aakre_2015.csv,F7YBW8_MESOW,Prokaryote,Mesorhizobium opportunistum,MANVEKMSVAVTPQQAAVMREAVEAGEYATASEIVREAVRDWLAKRELRHDDIRRLRQLWDEGKASGRPEPVDFDALRKEARQKLTEVPPNGR,93,TRUE,9192,37,9155,-0.001724,median,Aakre,Evolving New Protein-Protein Interaction Specificity through Promiscuous Intermediates,2015,10.1016/j.cell.2015.09.055,59-64,Antitoxin ParD3,fitness,Growth (antitoxin neutralization of ParE3),F7YBW8_MESOW_full_01-07-2022_b02.a2m,1,93,93,0.2,0.2,38613,0.774,72,16262.4,225.8666667,high,31,0.4305555556,F7YBW8_MESOW_Aakre_2015.csv,fitness,1,mutant,F7YBW8_MESOW_theta_0.2.npy,F7YBW8_MESOW.pdb,1-93,0.1,,OrganismalFitness
+FECA_ECOLI_Tsuboyama_2023_2D1U,FECA_ECOLI_Tsuboyama_2023_2D1U.csv,FECA_ECOLI,Prokaryote,Escherichia coli,QVNIAPGSLDKALNQYAAHSGFTLSVDASLTRGKQSNGLHGDYDVESGLQQLLDGSGLQVKPLGNNSWTLEP,72,TRUE,1886,1219,667,-0.813576222,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-72,Fe(3+) dicitrate transport protein FecA,Stability,cDNA display proteolysis,FECA_ECOLI_2023-08-07_b06.a2m,1,72,72,0.6,0.2,74248,0.986,71,9949.9,140.1394366,High,63,0.8873239437,Tsuboyama2023_Dataset2_Dataset16,ddG_ML_float,1,mut_type,FECA_ECOLI_theta0.2_2023-08-07_b06.npy,FECA_ECOLI.pdb,1-72,1,,Stability
+FKBP3_HUMAN_Tsuboyama_2023_2KFV,FKBP3_HUMAN_Tsuboyama_2023_2KFV.csv,FKBP3_HUMAN,Human,Homo sapiens,VPQRAWTVEQLRSEQLPKKDIIKFLQEHGSDSFLAEHKLLGNIKNVAKTANKDHLVTAYNHLFETKRFK,69,FALSE,1237,1237,0,-0.1631252002,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-69,Peptidyl-prolyl cis-trans isomerase FKBP3,Stability,cDNA display proteolysis,FKBP3_HUMAN_2023-08-07_b03.a2m,1,69,69,0.3,0.2,3216,0.957,66,132,2,Medium,7,0.1060606061,Tsuboyama2023_Dataset2_Dataset17,ddG_ML_float,1,mut_type,FKBP3_HUMAN_theta0.2_2023-08-07_b03.npy,FKBP3_HUMAN.pdb,1-69,1,,Stability
+GAL4_YEAST_Kitzman_2015,GAL4_YEAST_Kitzman_2015.csv,GAL4_YEAST,Eukaryote,Saccharomyces cerevisiae,MKLLSSIEQACDICRLKKLKCSKEKPKCAKCLKNNWECRYSPKTKRSPLTRAHLTEVESRLERLEQLFLLIFPREDLDMILKMDSLQDIKALLTGLFVQDNVNKDAVTDRLASVETDMPLTLRQHRISATSSSEESSNKGQRQLTVSIDSAAHHDNSTIPLDFMPRDALHGFDWSEEDDMSDGLPFLKTDPNNNGFFGDGSLLCILRSIGFKPENYTNSNVNRLPTMITDRYTLASRSTTSRLLQSYLNNFHPYCPIVHSPTLMMLYNNQIEIASKDQWQILFNCILAIGAWCIEGESTDIDVFYYQNAKSHLTSKVFESGSIILVTALHLLSRYTQWRQKTNTSYNFHSFSIRMAISLGLNRDLPSSFSDSSILEQRRRIWWSVYSWEIQLSLLYGRSIQLSQNTISFPSSVDDVQRTTTGPTIYHGIIETARLLQVFTKIYELDKTVTAEKSPICAKKCLMICNEIEEVSRQAPKFLQMDISTTALTNLLKEHPWLSFTRFELKWKQLSLIIYVLRDFFTNFTQKKSQLEQDQNDHQSYEVKRCSIMLSDAAQRTVMSVSSYMDNHNVTPYFAWNCSYYLFNAVLVPIKTLLSNSKSNAENNETAQLLQQINTVLMLLKKLATFKIQTCEKYIQVLEEVCAPFLLSQCAIPLPHISYNNSNGSAIKNIVGSATIAQYPTLPEENVNNISVKYVSPGSVGPSPVPLKSGASFSDLVKLLSNRPPSRNSPVTIPRSTPSHRSVTPFLGQQQQLQSLVPLTPSALFGGANFNQSGNIADSSLSFTFTNSSNGPNLITTQTNSQALSQPIASSNVHDNFMNNEITASKIDDGNNSKPLSPGWTDQTAYNAFGITTGMFNTTTMDDVYNYLFDDEDTPPNPKKE,881,FALSE,1195,1195,0,-8,manual,Kitzman,Massively parallel single-amino-acid mutagenesis,2015,10.1038/nmeth.3223,2-65,GAL4,"Growth (no selection, 24h)",Growth,GAL4_YEAST_full_11-26-2021_b02.a2m,1,881,881,0.2,0.2,16159,0.707,623,7942.3,12.74847512,medium,163,0.2616372392,GAL4_YEAST_Kitzman_2015.csv,SEL_C_64h,1,mutant,GAL4_YEAST_theta_0.2.npy,GAL4_YEAST.pdb,1-881,0.1,,OrganismalFitness
+GCN4_YEAST_Staller_2018,GCN4_YEAST_Staller_2018.csv,GCN4_YEAST,Eukaryote,Saccharomyces cerevisiae,MSEYQPSLFALNPMGFSPLDGSKSTNENVSASTSTAKPMVGQLIFDKFIKTEEDPIIKQDTPSNLDFDFALPQTATAPDAKTVLPIPELDDAVVESFFSSSTDSTPMFEYENLEDNSKEWTSLFDNDIPVTTDDVSLADKAIESTEEVSLVPSNLEVSTTSFLPTPVLEDAKLTQTRKVKKPNSVVKKSHHVGKDDESRLDHLGVVAYNRKQRSIPLSPIVPESSDPAALKRARNTEAARRSRARKLQRMKQLEDKVEELLSKNYHLENEVARLKKLVGER,281,TRUE,2638,33,2605,1.293757864,median,Staller,A High-Throughput Mutational Scan of an Intrinsically Disordered Acidic Transcriptional Activation Domain,2018,10.1016/j.cels.2018.01.015,101-144,Gcn4,Binding,FACS,GCN4_YEAST_full_24-02-2022_b03.a2m,1,281,281,0.3,0.2,350,0.719,202,177.9,0.8806930693,low,1,0.00495049505,GCN4_YEAST_Staller_2018.csv,Induction,1,mutant,GCN4_YEAST_theta_0.2.npy,GCN4_YEAST.pdb,1-281,0.1,,Binding
+GDIA_HUMAN_Silverstein_2021,GDIA_HUMAN_Silverstein_2021.csv,GDIA_HUMAN,Human,Homo sapiens,MDEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESSSITPLEELYKRFQLLEGPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVVEGSFVYKGGKIYKVPSTETEALASNLMGMFEKRRFRKFLVFVANFDENDPKTFEGVDPQTTSMRDVYRKFDLGQDVIDFTGHALALYRTDDYLDQPCLETVNRIKLYSESLARYGKSPYLYPLYGLGELPQGFARLSAIYGGTYMLNKPVDDIIMENGKVVGVKSEGEVARCKQLICDPSYIPDRVRKAGQVIRIICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISYAHNVAAQGKYIAIASTTVETTDPEKEVEPALELLEPIDQKFVAISDLYEPIDDGCESQVFCSCSYDATTHFETTCNDIKDIYKRMAGTAFDFENMKRKQNDVFGEAEQ,447,FALSE,1154,1154,0,0.8425936955,median,Silverstein,A systematic genotype-phenotype map for missense variants in the human intellectual disability-associated gene GDI1,2021,10.1101/2021.10.06.463360,2-447,GDI1,Yeast Growth,Growth,GDIA_HUMAN_2023-10-12_b05.a2m,1,447,447,0.5,0.2,5196,0.996,445,398.1,0.8946067416,Low,86,0.193258427,media-1.xlsx,fitness,1,mutant,GDIA_HUMAN_theta0.2_2023-10-12_b05.npy,GDIA_HUMAN.pdb,1-447,1,,OrganismalFitness
+GFP_AEQVI_Sarkisyan_2016,GFP_AEQVI_Sarkisyan_2016.csv,GFP_AEQVI,Eukaryote,Aequorea victoria,MSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLSYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITHGMDELYK,238,TRUE,51714,1084,50630,2.5,manual,Sarkisyan,Local fitness landscape of the green fluorescent protein,2016,10.1038/nature17995,3-237,GFP,Fluorescence,FACS,GFP_AEQVI_full_04-29-2022_b08.a2m,1,238,238,0.8,0.2,396,0.975,232,14.9,0.06422413793,low,0,0,GFP_AEQVI_Sarkisyan_2016.csv,mean_medianBrightness_per_aaseq,1,mutant,GFP_AEQVI_theta_0.2.npy,GFP_AEQVI.pdb,1-238,0.1,,Activity
+GLPA_HUMAN_Elazar_2016,GLPA_HUMAN_Elazar_2016.csv,GLPA_HUMAN,Human,Homo sapiens,MYGKIIFVLLLSEIVSISASSTTGVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAHEVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKKSPSDVKPLPSPDTDVPLSSVEIENPETSDQ,150,FALSE,245,245,0,0.9321105779,median,Elazar,Mutational scanning reveals the determinants of protein insertion and association energetics in the plasma membrane,2016,10.7554/eLife.12125,94-108,Glycophorin A membrane domain,Membrane-protein insertion,TOXCAT-Beta-lactamase (TbL) screen,GLPA_HUMAN_2023-10-12_b03.a2m,1,150,150,0.3,0.2,273,1,150,81,0.54,Low,1,0.006666666667,urn_mavedb_00000051-c-1_scores.csv,score,-1,mutant,GLPA_HUMAN_theta0.2_2023-10-12_b03.npy,GLPA_HUMAN.pdb,1-150,1,93,Expression
+GRB2_HUMAN_Faure_2021,GRB2_HUMAN_Faure_2021.csv,GRB2_HUMAN,Human,Homo sapiens,MEAIAKYDFKATADDELSFKRGDILKVLNEECDQNWYKAELNGKDGFIPKNYIEMKPHPWFFGKIPRAKAEEMLSKQRHDGAFLIRESESAPGDFSLSVKFGNDVQHFKVLRDGAGKYFLWVVKFNSLNELVDYHRSTSVSRNQQIFLRDIEQVPQQPTYVQALFDFDPQEDGELGFRRGDFIHVMDNSDPNWWKGACHGQTGMFPRNYVTPVNRNV,217,TRUE,63366,1034,62332,-0.7,manual,Faure,Mapping the energetic and allosteric landscapes of protein binding domains,2022,10.1038/s41586-022-04586-4,159-214,GRB2-SH3,Yeast growth,Growth,GRB2_HUMAN_full_11-26-2021_b05.a2m,1,217,217,0.5,0.2,33228,0.816,177,1485.9,8.394915254,medium,42,0.2372881356,GRB2_HUMAN_Faure_2021.csv,fitness,1,mutant,GRB2_HUMAN_theta_0.2.npy,GRB2_HUMAN.pdb,1-217,0.1,,OrganismalFitness
+HCP_LAMBD_Tsuboyama_2023_2L6Q,HCP_LAMBD_Tsuboyama_2023_2L6Q.csv,HCP_LAMBD,Virus,Escherichia phage lambda (Bacteriophage lambda),VRQEELAAARAALHDLMTGKRVATVQKDGRRVEFTATSVSDLKKYIAELEVQTGM,55,FALSE,1040,1040,0,-0.350614016,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-55,Head completion protein,Stability,cDNA display proteolysis,HCP_LAMBD_2023-08-07_b05.a2m,1,55,55,0.5,0.01,2128,0.945,52,606.5,11.66346154,Medium,15,0.2884615385,Tsuboyama2023_Dataset2_Dataset18,ddG_ML_float,1,mut_type,HCP_LAMBD_theta0.01_2023-08-07_b05.npy,HCP_LAMBD.pdb,1-55,1,,Stability
+HECD1_HUMAN_Tsuboyama_2023_3DKM,HECD1_HUMAN_Tsuboyama_2023_3DKM.csv,HECD1_HUMAN,Human,Homo sapiens,NLYFQGLKYMVPGARVTRGLDWKWRDQDGSPQGEGTVTGELHNGWIDVTWDAGGSNSYRMGAEGKFDLKLAP,72,TRUE,5586,1244,4342,-0.7,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-72,E3 ubiquitin-protein ligase HECTD1,Stability,cDNA display proteolysis,HECD1_HUMAN_2023-08-07_b03.a2m,1,72,72,0.3,0.2,18660,0.903,65,1192.3,18.34307692,Medium,24,0.3692307692,Tsuboyama2023_Dataset2_Dataset19,ddG_ML_float,1,mut_type,HECD1_HUMAN_theta0.2_2023-08-07_b03.npy,HECD1_HUMAN.pdb,1-72,1,,Stability
+HEM3_HUMAN_Loggerenberg_2023,HEM3_HUMAN_Loggerenberg_2023.csv,HEM3_HUMAN,Human,Homo sapiens,MSGNGNAAATAEENSPKMRVIRVGTRKSQLARIQTDSVVATLKASYPGLQFEIIAMSTTGDKILDTALSKIGEKSLFTKELEHALEKNEVDLVVHSLKDLPTVLPPGFTIGAICKRENPHDAVVFHPKFVGKTLETLPEKSVVGTSSLRRAAQLQRKFPHLEFRSIRGNLNTRLRKLDEQQEFSAIILATAGLQRMGWHNRVGQILHPEECMYAVGQGALGVEVRAKDQDILDLVGVLHDPETLLRCIAERAFLRHLEGGCSVPVAVHTAMKDGQLYLTGGVWSLDGSDSIQETMQATIHVPAQHEDGPEDDPQLVGITARNIPRGPQLAAQNLGISLANLLLSKGAKNILDVARQLNDAH,361,FALSE,5689,5689,0,0.6142990455,median,van Loggerenberg,Systematically testing human HMBS missense variants to reveal mechanism and pathogenic variation,2023,10.1101/2023.02.06.527353,19-360,hydroxymethylbilane synthase,activity,Yeast complementation,HEM3_HUMAN_2023-08-07_b02.a2m,1,361,361,0.2,0.2,59544,0.85,307,11510.2,37.49250814,Medium,500,1.628664495,,score,1,mutant,HEM3_HUMAN_theta0.2_2023-08-07_b02.npy,HEM3_HUMAN.pdb,1-361,1,,Activity
+HIS7_YEAST_Pokusaeva_2019,HIS7_YEAST_Pokusaeva_2019.csv,HIS7_YEAST,Eukaryote,Saccharomyces cerevisiae,MTEQKALVKRITNETKIQIAISLKGGPLAIEHSIFPEKEAEAVAEQATQSQVINVHTGIGFLDHMIHALAKHSGWSLIVECIGDLHIDDHHTTEDCGIALGQAFKEALGAVRGVKRFGSGFAPLDEALSRAVVDLSNRPYAVVELGLQREKVGDLSCEMIPHFLESFAEASRITLHVDCLRGKNDHHRSESAFKALAVAIREATSPNGTNDVPSTKGVLM,220,TRUE,496137,168,495969,0.3,manual,Pokusaeva,An experimental assay of the interactions of amino acids from orthologous sequences shaping a complex fitness landscape,2019,10.1371/journal.pgen.1008079,6-211,IGP dehydratase (HIS3),Growth,Growth,HIS7_YEAST_full_11-26-2021_b09.a2m,1,220,220,0.9,0.2,40154,0.873,192,5191.3,27.03802083,medium,318,1.65625,HIS7_YEAST_Pokusaeva_2019.csv,selection,1,mutant,HIS7_YEAST_theta_0.2.npy,HIS7_YEAST.pdb,1-220,0.1,,OrganismalFitness
+HMDH_HUMAN_Jiang_2019,HMDH_HUMAN_Jiang_2019.csv,HMDH_HUMAN,Human,Homo sapiens,MLSRLFRMHGLFVASHPWEVIVGTVTLTICMMSMNMFTGNNKICGWNYECPKFEEDVLSSDIIILTITRCIAILYIYFQFQNLRQLGSKYILGIAGLFTIFSSFVFSTVVIHFLDKELTGLNEALPFFLLLIDLSRASTLAKFALSSNSQDEVRENIARGMAILGPTFTLDALVECLVIGVGTMSGVRQLEIMCCFGCMSVLANYFVFMTFFPACVSLVLELSRESREGRPIWQLSHFARVLEEEENKPNPVTQRVKMIMSLGLVLVHAHSRWIADPSPQNSTADTSKVSLGLDENVSKRIEPSVSLWQFYLSKMISMDIEQVITLSLALLLAVKYIFFEQTETESTLSLKNPITSPVVTQKKVPDNCCRREPMLVRNNQKCDSVEEETGINRERKVEVIKPLVAETDTPNRATFVVGNSSLLDTSSVLVTQEPEIELPREPRPNEECLQILGNAEKGAKFLSDAEIIQLVNAKHIPAYKLETLMETHERGVSIRRQLLSKKLSEPSSLQYLPYRDYNYSLVMGACCENVIGYMPIPVGVAGPLCLDEKEFQVPMATTEGCLVASTNRGCRAIGLGGGASSRVLADGMTRGPVVRLPRACDSAEVKAWLETSEGFAVIKEAFDSTSRFARLQKLHTSIAGRNLYIRFQSRSGDAMGMNMISKGTEKALSKLHEYFPEMQILAVSGNYCTDKKPAAINWIEGRGKSVVCEAVIPAKVVREVLKTTTEAMIEVNINKNLVGSAMAGSIGGYNAHAANIVTAIYIACGQDAAQNVGSSNCITLMEASGPTNEDLYISCTMPSIEIGTVGGGTNLLPQQACLQMLGVQGACKDNPGENARQLARIVCGTVMAGELSLMAALAAGHLVKSHMIHNRSKINLQDLQGACTKKTA,888,FALSE,16853,16853,0,0.48275,median,Jiang,Exhaustive mapping of missense variation in coronary heart disease-related genes,2019,https://hdl.handle.net/1807/98076,2-888,3-hydroxy-3-methylglutaryl-coenzyme A reductase,Fitness,Resistance to statin inhibition,HMDH_HUMAN_2023-10-12_b05.a2m,1,888,888,0.5,0.2,3153,0.995,884,554.6,0.6273755656,Low,778,0.8800904977,urn_mavedb_00000035-a-1_scores.csv,score,1,mutant,HMDH_HUMAN_theta0.2_2023-10-12_b05.npy,HMDH_HUMAN.pdb,1-888,1,,OrganismalFitness
+HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2,HSP82_YEAST_Cote-Hammarlof_2020_growth-H2O2.csv,HSP82_YEAST,Eukaryote,Saccharomyces cerevisiae,MASETFEFQAEITQLMSLIINTVYSNKEIFLRELISNASDALDKIRYKSLSDPKQLETEPDLFIRITPKPEQKVLEIRDSGIGMTKAELINNLGTIAKSGTKAFMEALSAGADVSMIGQFGVGFYSLFLVADRVQVISKSNDDEQYIWESNAGGSFTVTLDEVNERIGRGTILRLFLKDDQLEYLEEKRIKEVIKRHSEFVAYPIQLVVTKEVEKEVPIPEEEKKDEEKKDEEKKDEDDKKPKLEEVDEEEEKKPKTKKVKEEVQEIEELNKTKPLWTRNPSDITQEEYNAFYKSISNDWEDPLYVKHFSVEGQLEFRAILFIPKRAPFDLFESKKKKNNIKLYVRRVFITDEAEDLIPEWLSFVKGVVDSEDLPLNLSREMLQQNKIMKVIRKNIVKKLIEAFNEIAEDSEQFEKFYSAFSKNIKLGVHEDTQNRAALAKLLRYNSTKSVDELTSLTDYVTRMPEHQKNIYYITGESLKAVEKSPFLDALKAKNFEVLFLTDPIDEYAFTQLKEFEGKTLVDITKDFELEETDEEKAEREKEIKEYEPLTKALKEILGDQVEKVVVSYKLLDAPAAIRTGQFGWSANMERIMKAQALRDSSMSSYMSSKKTFEISPKSPIIKELKKRVDEGGAQDKTVKDLTKLLYETALLTSGFSLDEPTSFASRINRLISLGLNIDEDEETETAPEASTAAPVEEVPADTEMEEVD,709,FALSE,2252,2252,0,-0.0020874765,median,Cote-Hammarlof,The Adaptive Potential of the Middle Domain of Yeast Hsp90,2020,10.1093/molbev/msaa211,291-409,HSP82,Growth (H2O2 stress),,HSP82_YEAST_2023-08-07_b01.a2m,1,709,709,0.1,0.2,48695,0.917,650,4395.2,6.761846154,Medium,531,0.8169230769,,score,1,mut_proteingym,HSP82_YEAST_theta0.2_2023-08-07_b01.npy,HSP82_YEAST.pdb,1-709,1,290,OrganismalFitness
+HSP82_YEAST_Flynn_2019,HSP82_YEAST_Flynn_2019.csv,HSP82_YEAST,Eukaryote,Saccharomyces cerevisiae,MASETFEFQAEITQLMSLIINTVYSNKEIFLRELISNASDALDKIRYKSLSDPKQLETEPDLFIRITPKPEQKVLEIRDSGIGMTKAELINNLGTIAKSGTKAFMEALSAGADVSMIGQFGVGFYSLFLVADRVQVISKSNDDEQYIWESNAGGSFTVTLDEVNERIGRGTILRLFLKDDQLEYLEEKRIKEVIKRHSEFVAYPIQLVVTKEVEKEVPIPEEEKKDEEKKDEEKKDEDDKKPKLEEVDEEEEKKPKTKKVKEEVQEIEELNKTKPLWTRNPSDITQEEYNAFYKSISNDWEDPLYVKHFSVEGQLEFRAILFIPKRAPFDLFESKKKKNNIKLYVRRVFITDEAEDLIPEWLSFVKGVVDSEDLPLNLSREMLQQNKIMKVIRKNIVKKLIEAFNEIAEDSEQFEKFYSAFSKNIKLGVHEDTQNRAALAKLLRYNSTKSVDELTSLTDYVTRMPEHQKNIYYITGESLKAVEKSPFLDALKAKNFEVLFLTDPIDEYAFTQLKEFEGKTLVDITKDFELEETDEEKAEREKEIKEYEPLTKALKEILGDQVEKVVVSYKLLDAPAAIRTGQFGWSANMERIMKAQALRDSSMSSYMSSKKTFEISPKSPIIKELKKRVDEGGAQDKTVKDLTKLLYETALLTSGFSLDEPTSFASRINRLISLGLNIDEDEETETAPEASTAAPVEEVPADTEMEEVD,709,FALSE,13294,13294,0,-0.3,manual,Flynn,Comprehensive fitness maps of Hsp90 show widespread environmental dependence,2019,10.7554/eLife.53810,2-709,HSP82,"growth, nitrogen depletion (0.0125% ammonium sulfate), hyperosmotic shock (0.8 M NaCl), alcohol stress (7.5% ethanol), sulfhydryl-oxidation (0.85 mM diamide), temperature shock (37C)",,HSP82_YEAST_full_11-26-2021_b01.a2m,1,709,709,0.1,0.2,38923,0.862,611,3684.8,6.030769231,medium,433,0.7086743044,HSP82_YEAST_Flynn_2019.csv,s (37°C),1,mutant,HSP82_YEAST_theta_0.2.npy,HSP82_YEAST.pdb,1-709,1,,OrganismalFitness
+HSP82_YEAST_Mishra_2016,HSP82_YEAST_Mishra_2016.csv,HSP82_YEAST,Eukaryote,Saccharomyces cerevisiae,MASETFEFQAEITQLMSLIINTVYSNKEIFLRELISNASDALDKIRYKSLSDPKQLETEPDLFIRITPKPEQKVLEIRDSGIGMTKAELINNLGTIAKSGTKAFMEALSAGADVSMIGQFGVGFYSLFLVADRVQVISKSNDDEQYIWESNAGGSFTVTLDEVNERIGRGTILRLFLKDDQLEYLEEKRIKEVIKRHSEFVAYPIQLVVTKEVEKEVPIPEEEKKDEEKKDEEKKDEDDKKPKLEEVDEEEEKKPKTKKVKEEVQEIEELNKTKPLWTRNPSDITQEEYNAFYKSISNDWEDPLYVKHFSVEGQLEFRAILFIPKRAPFDLFESKKKKNNIKLYVRRVFITDEAEDLIPEWLSFVKGVVDSEDLPLNLSREMLQQNKIMKVIRKNIVKKLIEAFNEIAEDSEQFEKFYSAFSKNIKLGVHEDTQNRAALAKLLRYNSTKSVDELTSLTDYVTRMPEHQKNIYYITGESLKAVEKSPFLDALKAKNFEVLFLTDPIDEYAFTQLKEFEGKTLVDITKDFELEETDEEKAEREKEIKEYEPLTKALKEILGDQVEKVVVSYKLLDAPAAIRTGQFGWSANMERIMKAQALRDSSMSSYMSSKKTFEISPKSPIIKELKKRVDEGGAQDKTVKDLTKLLYETALLTSGFSLDEPTSFASRINRLISLGLNIDEDEETETAPEASTAAPVEEVPADTEMEEVD,709,FALSE,4323,4323,0,-0.4,manual,Mishra,Systematic Mutant Analyses Elucidate General and Client-Specific Aspects of Hsp90 Function,2016,10.1016/j.celrep.2016.03.046,2-231,HSP82,Growth,Growth,HSP82_YEAST_full_11-26-2021_b01.a2m,1,709,709,0.1,0.2,38923,0.862,611,3684.8,6.030769231,medium,433,0.7086743044,HSP82_YEAST_Mishra_2016.csv,selection_coefficient,1,mutant,HSP82_YEAST_theta_0.2.npy,HSP82_YEAST.pdb,1-709,0.1,,OrganismalFitness
+HXK4_HUMAN_Gersing_2022_activity,HXK4_HUMAN_Gersing_2022_activity.csv,HXK4_HUMAN,Human,Homo sapiens,MLDDRARMEAAKKEKVEQILAEFQLQEEDLKKVMRRMQKEMDRGLRLETHEEASVKMLPTYVRSTPEGSEVGDFLSLDLGGTNFRVMLVKVGEGEEGQWSVKTKHQMYSIPEDAMTGTAEMLFDYISECISDFLDKHQMKHKKLPLGFTFSFPVRHEDIDKGILLNWTKGFKASGAEGNNVVGLLRDAIKRRGDFEMDVVAMVNDTVATMISCYYEDHQCEVGMIVGTGCNACYMEEMQNVELVEGDEGRMCVNTEWGAFGDSGELDEFLLEYDRLVDESSANPGQQLYEKLIGGKYMGELVRLVLLRLVDENLLFHGEASEQLRTRGAFETRFVSQVESDTGDRKQIYNILSTLGLRPSTTDCDIVRRACESVSTRAAHMCSAGLAGVINRMRESRSEDVMRITVGVDGSVYKLHPSFKERFHASVRRLTPSCEITFIESEEGSGRGAALVSAVACKKACMLGQ,465,FALSE,8570,8570,0,0.5631652235,median,Gersing,A comprehensive map of human glucokinase variant activity,2022,10.1101/2022.05.04.490571,2-465,glucokinase regulatory protein,functional complementation to reduced growth on glucose medium,enzymatic activity,HXK4_HUMAN_b0.1.a2m,1,465,465,0.1,0.2,23354,1,465,2336.1,5.023870968,Medium,181,0.3892473118,HXK4_HUMAN_Gersing_2022.csv,score,1,mutant,HXK4_HUMAN_theta_0.2.npy,HXK4_HUMAN.pdb,1-465,1,,OrganismalFitness
+HXK4_HUMAN_Gersing_2023_abundance,HXK4_HUMAN_Gersing_2023_abundance.csv,HXK4_HUMAN,Human,Homo sapiens,MLDDRARMEAAKKEKVEQILAEFQLQEEDLKKVMRRMQKEMDRGLRLETHEEASVKMLPTYVRSTPEGSEVGDFLSLDLGGTNFRVMLVKVGEGEEGQWSVKTKHQMYSIPEDAMTGTAEMLFDYISECISDFLDKHQMKHKKLPLGFTFSFPVRHEDIDKGILLNWTKGFKASGAEGNNVVGLLRDAIKRRGDFEMDVVAMVNDTVATMISCYYEDHQCEVGMIVGTGCNACYMEEMQNVELVEGDEGRMCVNTEWGAFGDSGELDEFLLEYDRLVDESSANPGQQLYEKLIGGKYMGELVRLVLLRLVDENLLFHGEASEQLRTRGAFETRFVSQVESDTGDRKQIYNILSTLGLRPSTTDCDIVRRACESVSTRAAHMCSAGLAGVINRMRESRSEDVMRITVGVDGSVYKLHPSFKERFHASVRRLTPSCEITFIESEEGSGRGAALVSAVACKKACMLGQ,465,FALSE,8396,8396,0,0.6,manual,Gersing,Characterizing glucokinase variant mechanisms using a multiplexed abundance assay,2023,10.1101/2023.05.24.542036,2-465,GCK,abundance,Growth,HXK4_HUMAN_2023-08-07_b01.a2m,1,465,465,0.1,0.2,24177,0.966,449,2626.4,5.849443207,Medium,170,0.3786191537,HXK4_HUMAN_Gersing_2022.csv,score,1,mutant,HXK4_HUMAN_theta0.2_2023-08-07_b01.npy,HXK4_HUMAN.pdb,1-465,1,,Expression
+I6TAH8_I68A0_Doud_2015,I6TAH8_I68A0_Doud_2015.csv,I6TAH8_I68A0,Virus,Influenza A virus (strain A/Aichi/2/1968 H3N2),MASQGTKRSYEQMETDGERQNATEIRASVGKMIDGIGRFYIQMCTELKLSDYEGRLIQNSLTIERMVLSAFDERRNKYLEEHPSAGKDPKKTGGPIYKRVDRKWMRELVLYDKEEIRRIWRQANNGDDATAGLTHMMIWHSNLNDTTYQRTRALVRTGMDPRMCSLMQGSTLPRRSGAAGAAVKGVGTMVMELIRMIKRGINDRNFWRGENGRKTRSAYERMCNILKGKFQTAAQRAMMDQVRESRNPGNAEIEDLIFLARSALILRGSVAHKSCLPACVYGPAVASGYDFEKEGYSLVGIDPFKLLQNSQVYSLIRPNENPAHKSQLVWMACNSAAFEDLRVLSFIRGTKVSPRGKLSTRGVQIASNENMDAMESSTLELRSRYWAIRTRSGGNTNQQRASAGQISVQPAFSVQRNLPFDKPTIMAAFTGNTEGRTSDMRAEIIRMMEGAKPEEMSFQGRGVFELSDERAANPIVPSFDMSNEGSYFFGDNAEEYDN,498,FALSE,9462,9462,0,-2.329469119,median,Doud,Site-Specific Amino Acid Preferences Are Mostly Conserved in Two Closely Related Protein Homologs,2015,10.1093/molbev/msv167,1-498,Influenza nucleoprotein,,Growth,I6TAH8_I68A0_theta0.99_full_11-26-2021_b09.a2m,1,498,498,0.9,0.01,15390,1,498,1493.3,2.998594378,medium,2118,4.253012048,I6TAH8_I68A0_Doud_2015.csv,log_fitness_by_syn_mut_fitness,1,mutant,I6TAH8_I68A0_theta_0.01.npy,I6TAH8_I68A0.pdb,1-498,0.1,,OrganismalFitness
+IF1_ECOLI_Kelsic_2016,IF1_ECOLI_Kelsic_2016.csv,IF1_ECOLI,Prokaryote,Escherichia coli,MAKEDNIEMQGTVLETLPNTMFRVELENGHVVTAHISGKMRKNYIRILTGDKVTVELTPYDLSKGRIVFRSR,72,FALSE,1367,1367,0,0.8,manual,Kelsic,RNA Structural Determinants of Optimal Codons Revealed by MAGE-Seq,2016,10.1016/j.cels.2016.11.004,1-72,infA,Growth,Growth,IF1_ECOLI_full_11-26-2021_b02.a2m,1,72,72,0.2,0.2,361806,0.806,58,38189,658.4310345,high,46,0.7931034483,IF1_ECOLI_Kelsic_2016.csv,fitness_rich,1,mutant,IF1_ECOLI_theta_0.2.npy,IF1_ECOLI.pdb,1-72,0.1,,OrganismalFitness
+ILF3_HUMAN_Tsuboyama_2023_2L33,ILF3_HUMAN_Tsuboyama_2023_2L33.csv,ILF3_HUMAN,Human,Homo sapiens,MLTKHGKNPVMELNEKRRGLKYELISETGGSHDKRFVMEVEVDGQKFQGAGSNKKVAKAYAALAALEKLFP,71,FALSE,1329,1329,0,-0.4,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-71,Interleukin enhancer-binding factor 3,Stability,cDNA display proteolysis,ILF3_HUMAN_2023-08-07_b03.a2m,1,71,71,0.3,0.2,145438,0.915,65,21228,326.5846154,High,57,0.8769230769,Tsuboyama2023_Dataset2_Dataset20,ddG_ML_float,1,mut_type,ILF3_HUMAN_theta0.2_2023-08-07_b03.npy,ILF3_HUMAN.pdb,1-71,1,,Stability
+ISDH_STAAW_Tsuboyama_2023_2LHR,ISDH_STAAW_Tsuboyama_2023_2LHR.csv,ISDH_STAAW,Prokaryote,Staphylococcus aureus,YNLQKLLAPYHKAKTLERQVYELEKLQEKLPEKYKAEYKKKLDQTRVELADQVKS,55,TRUE,1944,940,1004,-0.7942702247,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-55,Iron-regulated surface determinant protein H,Stability,cDNA display proteolysis,ISDH_STAAW_2023-08-07_b01.a2m,1,55,55,0.1,0.2,115488,0.818,45,38123.1,847.18,High,6,0.1333333333,Tsuboyama2023_Dataset2_Dataset21,ddG_ML_float,1,mut_type,ISDH_STAAW_theta0.2_2023-08-07_b01.npy,ISDH_STAAW.pdb,1-55,1,,Stability
+KCNE1_HUMAN_Muhammad_2023_expression,KCNE1_HUMAN_Muhammad_2023_expression.csv,KCNE1_HUMAN,Human,Homo sapiens,MILSNTTAVTPFLTKLWQETVQQGGNMSGLARRSPRSGDGKLEALYVLMVLGFFGFFTLGIMLSYIRSKKLEHSNDPFNVYIESDAWQEKDKAYVQARVLESYRSCYVVENHLAIEQPNTHLPETKPSP,129,FALSE,2339,2339,0,0.75,manual,Muhammad,"High-throughput functional mapping of variants in an arrhythmia gene, KCNE1, reveals novel biology",2023,10.1101/2023.04.28.538612,1-128,KCNE1,cell surface expression,FACS,KCNE1_HUMAN_2023-08-07_b02.a2m,1,129,129,0.2,0.2,2118,0.969,125,213.7,1.7096,Medium,5,0.04,KCNE1_HUMAN_Muhammad_2023.csv,TrafScore,1,mutant,KCNE1_HUMAN_theta0.2_2023-08-07_b02.npy,KCNE1_HUMAN.pdb,1-129,1,,Expression
+KCNE1_HUMAN_Muhammad_2023_function,KCNE1_HUMAN_Muhammad_2023_function.csv,KCNE1_HUMAN,Human,Homo sapiens,MILSNTTAVTPFLTKLWQETVQQGGNMSGLARRSPRSGDGKLEALYVLMVLGFFGFFTLGIMLSYIRSKKLEHSNDPFNVYIESDAWQEKDKAYVQARVLESYRSCYVVENHLAIEQPNTHLPETKPSP,129,FALSE,2315,2315,0,0.9043514345,median,Muhammad,"High-throughput functional mapping of variants in an arrhythmia gene, KCNE1, reveals novel biology",2023,10.1101/2023.04.28.538612,1-128,KCNE1,potassium channel function,Growth,KCNE1_HUMAN_2023-08-07_b02.a2m,1,129,129,0.2,0.2,2118,0.969,125,213.7,1.7096,Medium,5,0.04,KCNE1_HUMAN_Muhammad_2023.csv,funcScore,1,mutant,KCNE1_HUMAN_theta0.2_2023-08-07_b02.npy,KCNE1_HUMAN.pdb,1-129,1,,Activity
+KCNH2_HUMAN_Kozek_2020,KCNH2_HUMAN_Kozek_2020.csv,KCNH2_HUMAN,Human,Homo sapiens,MPVRRGHVAPQNTFLDTIIRKFEGQSRKFIIANARVENCAVIYCNDGFCELCGYSRAEVMQRPCTCDFLHGPRTQRRAAAQIAQALLGAEERKVEIAFYRKDGSCFLCLVDVVPVKNEDGAVIMFILNFEVVMEKDMVGSPAHDTNHRGPPTSWLAPGRAKTFRLKLPALLALTARESSVRSGGAGGAGAPGAVVVDVDLTPAAPSSESLALDEVTAMDNHVAGLGPAEERRALVGPGSPPRSAPGQLPSPRAHSLNPDASGSSCSLARTRSRESCASVRRASSADDIEAMRAGVLPPPPRHASTGAMHPLRSGLLNSTSDSDLVRYRTISKIPQITLNFVDLKGDPFLASPTSDREIIAPKIKERTHNVTEKVTQVLSLGADVLPEYKLQAPRIHRWTILHYSPFKAVWDWLILLLVIYTAVFTPYSAAFLLKETEEGPPATECGYACQPLAVVDLIVDIMFIVDILINFRTTYVNANEEVVSHPGRIAVHYFKGWFLIDMVAAIPFDLLIFGSGSEELIGLLKTARLLRLVRVARKLDRYSEYGAAVLFLLMCTFALIAHWLACIWYAIGNMEQPHMDSRIGWLHNLGDQIGKPYNSSGLGGPSIKDKYVTALYFTFSSLTSVGFGNVSPNTNSEKIFSICVMLIGSLMYASIFGNVSAIIQRLYSGTARYHTQMLRVREFIRFHQIPNPLRQRLEEYFQHAWSYTNGIDMNAVLKGFPECLQADICLHLNRSLLQHCKPFRGATKGCLRALAMKFKTTHAPPGDTLVHAGDLLTALYFISRGSIEILRGDVVVAILGKNDIFGEPLNLYARPGKSNGDVRALTYCDLHKIHRDDLLEVLDMYPEFSDHFWSSLEITFNLRDTNMIPGSPGSTELEGGFSRQRKRKLSFRRRTDKDTEQPGEVSALGPGRAGAGPSSRGRPGGPWGESPSSGPSSPESSEDEGPGRSSSPLRLVPFSSPRPPGEPPGGEPLMEDCEKSSDTCNPLSGAFSGVSNIFSFWGDSRGRQYQELPRCPAPTPSLLNIPLSSPGRRPRGDVESRLDALQRQLNRLETRLSADMATVLQLLQRQMTLVPPAYSAVTTPGPGPTSTSPLLPVSPLPTLTLDSLSQVSQFMACEELPPGAPELPQEGPTRRLSLPGQLGALTSQPLHRHGSDPGS,1159,FALSE,200,200,0,58.87492867,median,Kozek,High-throughput discovery of trafficking-deficient variants in the cardiac potassium channel KCNH2: Deep mutational scan of KCNH2 trafficking,2020,10.1016/j.hrthm.2020.05.041,545-555,KCNH2,Voltage,Voltage,KCNH2_HUMAN_535-565_11-26-2021_b05.a2m,535,565,31,0.5,0.2,13907,1,31,186.6,6.019354839,medium,1,0.03225806452,KCNH2_HUMAN_Kozek_2020.csv,score.ave,1,var,KCNH2_HUMAN_theta_0.2.npy,KCNH2_HUMAN.pdb,1-1159,0.1,,Activity
+KCNJ2_MOUSE_Coyote-Maestas_2022_function,KCNJ2_MOUSE_Coyote-Maestas_2022_function.csv,KCNJ2_MOUSE,Eukaryote,Mus musculus,MGSVRTNRYSIVSSEEDGMKLATMAVANGFGNGKSKVHTRQQCRSRFVKKDGHCNVQFINVGEKGQRYLADIFTTCVDIRWRWMLVIFCLAFVLSWLFFGCVFWLIALLHGDLDTSKVSKACVSEVNSFTAAFLFSIETQTTIGYGFRCVTDECPIAVFMVVFQSIVGCIIDAFIIGAVMAKMAKPKKRNETLVFSHNAVIAMRDGKLCLMWRVGNLRKSHLVEAHVRAQLLKSRITSEGEYIPLDQIDINVGFDSGIDRIFLVSPITIVHEIDEDSPLYDLSKQDIDNADFEIVVILEGMVEATAMTTQCRSSYLANEILWGHRYEPVLFEEKHYYKVDYSRFHKTYEVPNTPLCSARDLAEKKYILSNANSFCYENEVALTSKEEEEDSENGVPESTSTDSPPGIDLHNQASVPLEPRPLRRESEI,428,FALSE,6963,6963,0,0.039,median,Coyote-Maestas,"Determinants of trafficking, conduction, and disease within a K+ channel revealed through multiparametric deep mutational scanning",2022,10.7554/eLife.76903,2-392,Kir2.1,Ion conduction,FACS,KCNJ2_MOUSE_b01.a2m,1,428,428,0.1,0.2,20953,0.86,370,986.7,2.666756757,Medium,94,0.2540540541,,function_score,1,mutant_noflag,KCNJ2_MOUSE_b01_theta_0.2.npy,KCNJ2_MOUSE.pdb,1-428,1,,Activity
+KCNJ2_MOUSE_Coyote-Maestas_2022_surface,KCNJ2_MOUSE_Coyote-Maestas_2022_surface.csv,KCNJ2_MOUSE,Eukaryote,Mus musculus,MGSVRTNRYSIVSSEEDGMKLATMAVANGFGNGKSKVHTRQQCRSRFVKKDGHCNVQFINVGEKGQRYLADIFTTCVDIRWRWMLVIFCLAFVLSWLFFGCVFWLIALLHGDLDTSKVSKACVSEVNSFTAAFLFSIETQTTIGYGFRCVTDECPIAVFMVVFQSIVGCIIDAFIIGAVMAKMAKPKKRNETLVFSHNAVIAMRDGKLCLMWRVGNLRKSHLVEAHVRAQLLKSRITSEGEYIPLDQIDINVGFDSGIDRIFLVSPITIVHEIDEDSPLYDLSKQDIDNADFEIVVILEGMVEATAMTTQCRSSYLANEILWGHRYEPVLFEEKHYYKVDYSRFHKTYEVPNTPLCSARDLAEKKYILSNANSFCYENEVALTSKEEEEDSENGVPESTSTDSPPGIDLHNQASVPLEPRPLRRESEI,428,FALSE,6917,6917,0,-0.157352583,median,Coyote-Maestas,"Determinants of trafficking, conduction, and disease within a K+ channel revealed through multiparametric deep mutational scanning",2022,10.7554/eLife.76903,2-392,Kir2.1,Surface trafficking,FACS,KCNJ2_MOUSE_b01.a2m,1,428,428,0.1,0.2,20953,0.86,370,986.7,2.666756757,Medium,94,0.2540540541,,surface_score,1,mutant_noflag,KCNJ2_MOUSE_b01_theta_0.2.npy,KCNJ2_MOUSE.pdb,1-428,1,,Expression
+KKA2_KLEPN_Melnikov_2014,KKA2_KLEPN_Melnikov_2014.csv,KKA2_KLEPN,Prokaryote,Klebsiella pneumoniae,MIEQDGLHAGSPAAWVERLFGYDWAQQTIGCSDAAVFRLSAQGRPVLFVKTDLSGALNELQDEAARLSWLATTGVPCAAVLDVVTEAGRDWLLLGEVPGQDLLSSHLAPAEKVSIMADAMRRLHTLDPATCPFDHQAKHRIERARTRMEAGLVDQDDLDEEHQGLAPAELFARLKARMPDGEDLVVTHGDACLPNIMVENGRFSGFIDCGRLGVADRYQDIALATRDIAEELGGEWADRFLVLYGIAAPDSQRIAFYRLLDEFF,264,FALSE,4960,4960,0,0.5,manual,Melnikov,Comprehensive mutational scanning of a kinasein vivoreveals substrate-dependent fitness landscapes,2014,10.1093/nar/gku511,1-264,"APH(3’)II, neo","Growth (225 ug/mL kanamycin) 1:1, 1:2, 1:4, 1:8 dilutions",Growth,KKA2_KLEPN_full_11-26-2021_b02.a2m,1,264,264,0.2,0.2,234760,0.795,210,76876.7,366.0795238,high,377,1.795238095,KKA2_KLEPN_Melnikov_2014.csv,Kan18_avg,1,mutant,KKA2_KLEPN_theta_0.2.npy,KKA2_KLEPN.pdb,1-264,0.1,,OrganismalFitness
+LGK_LIPST_Klesmith_2015,LGK_LIPST_Klesmith_2015.csv,LGK_LIPST,Eukaryote,Lipomyces starkeyi,MPIATSTGDNVLDFTVLGLNSGTSMDGIDCALCHFYQKTPDAPMEFELLEYGEVPLAQPIKQRVMRMILEDTTSPSELSEVNVILGEHFADAVRQFAAERNVDLSTIDAIASHGQTIWLLSMPEEGQVKSALTMAEGAIIAARTGITSITDFRISDQAAGRQGAPLIAFFDALLLHHPTKLRACQNIGGIANVCFIPPDVDGRRTDEYYDFDTGPGNVFIDAVVRHFTNGEQEYDKDGAMGKRGKVDQELVDDFLKMPYFQLDPPKTTGREVFRDTLAHDLIRRAEAKGLSPDDIVATTTRITAQAIVDHYRRYAPSQEIDEIFMCGGGAYNPNIVEFIQQSYPNTKIMMLDEAGVPAGAKEAITFAWQGMECLVGRSIPVPTRVETRQHYVLGKVSPGLNYRSVMKKGMAFGGDAQQLPWVSEMIVKKKGKVITNNWA,439,FALSE,7890,7890,0,-0.6245,median,Klesmith,Comprehensive Sequence-Flux Mapping of a Levoglucosan Utilization Pathway in E. coli,2015,10.1021/acssynbio.5b00131,1-439,LGK (levoglucosan kinase),Growth,Growth,LGK_LIPST_full_11-26-2021_b03.a2m,1,439,439,0.3,0.2,31069,0.813,357,7971,22.32773109,medium,588,1.647058824,B3VI55_LIPST_Klesmith_2015.csv,SelectionTwo,1,mutant,LGK_LIPST_theta_0.2.npy,LGK_LIPST.pdb,1-439,0.1,,Activity
+LYAM1_HUMAN_Elazar_2016,LYAM1_HUMAN_Elazar_2016.csv,LYAM1_HUMAN,Human,Homo sapiens,MIFPWKCQSTQRDLWNIFKLWGWTMLCCDFLAHHGTDCWTYHYSEKPMNWQRARRFCRDNYTDLVAIQNKAEIEYLEKTLPFSRSYYWIGIRKIGGIWTWVGTNKSLTEEAENWGDGEPNNKKNKEDCVEIYIKRNKDAGKWNDDACHKLKAALCYTASCQPWSCSGHGECVEIINNYTCNCDVGYYGPQCQFVIQCEPLEAPELGTMDCTHPLGNFSFSSQCAFSCSEGTNLTGIEETTCGPFGNWSSPEPTCQVIQCEPLSAPDLGIMNCSHPLASFSFTSACTFICSEGTELIGKKKTICESSGIWSNPSPICQKLDKSFSMIKEGDYNPLFIPVAVMVTAFSGLAFIIWLARRLKKGKKSKRSMNDPY,372,FALSE,359,359,0,1.306138768,median,Elazar,Mutational scanning reveals the determinants of protein insertion and association energetics in the plasma membrane,2016,10.7554/eLife.12125,333-355,L-selectin,Membrane-protein insertion,TOXCAT-Beta-lactamase (TbL) screen,LYAM1_HUMAN_2023-10-12_b04.a2m,1,372,372,0.4,0.2,3974,0.825,307,412.3,1.342996743,Medium,41,0.1335504886,urn_mavedb_00000051-a-1_scores.csv,score,-1,mutant,LYAM1_HUMAN_theta0.2_2023-10-12_b04.npy,LYAM1_HUMAN.pdb,1-372,1,332,Expression
+MAFG_MOUSE_Tsuboyama_2023_1K1V,MAFG_MOUSE_Tsuboyama_2023_1K1V.csv,MAFG_MOUSE,Eukaryote,Mus musculus,LTDEELVTMSVRELNQHLRGLSKEEIIQLKQRRRTLKNRGY,41,TRUE,1429,762,667,-0.5,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-41,Transcription factor MafG,Stability,cDNA display proteolysis,MAFG_MOUSE_2023-08-07_b07.a2m,1,41,41,0.7,0.2,6178,1,41,156.7,3.82195122,Medium,4,0.09756097561,Tsuboyama2023_Dataset2_Dataset22,ddG_ML_float,1,mut_type,MAFG_MOUSE_theta0.2_2023-08-07_b07.npy,MAFG_MOUSE.pdb,1-41,1,,Stability
+MBD11_ARATH_Tsuboyama_2023_6ACV,MBD11_ARATH_Tsuboyama_2023_6ACV.csv,MBD11_ARATH,Eukaryote,Arabidopsis thaliana,VSVELPAPSSWKKLFYPNKVGSVKKTEVVFVAPTGEEISNRKQLEQYLKSHPGNPAIAEFDWTTSG,66,TRUE,2116,1155,961,-1.578921171,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-66,Methyl-CpG-binding domain-containing protein 11,Stability,cDNA display proteolysis,MBD11_ARATH_2023-08-07_b03.a2m,1,66,66,0.3,0.2,26035,0.909,60,1510.5,25.175,Medium,11,0.1833333333,Tsuboyama2023_Dataset2_Dataset23,ddG_ML_float,1,mut_type,MBD11_ARATH_theta0.2_2023-08-07_b03.npy,MBD11_ARATH.pdb,1-66,1,,Stability
+MET_HUMAN_Estevam_2023,MET_HUMAN_Estevam_2023.csv,MET_HUMAN,Human,Homo sapiens,NPELVQAVQHVVIGPSSLIVHFNEVIGRGHFGCVYHGTLLDNDGKKIHCAVKSLNRITDIGEVSQFLTEGIIMKDFSHPNVLSLLGICLRSEGSPLVVLPYMKHGDLRNFIRNETHNPTVKDLIGFGLQVAKGMKYLASKKFVHRDLAARNCMLDEKFTVKVADFGLARDMYDKEYYSVHNKTGAKLPVKWMALESLQTQKFTTKSDVWSFGVLLWELMTRGAPPYPDVNTFDITVYLLQGRRLLQPEYCPDPLYEVMLKCWHPKAEMRPSFSELVSRISAIFSTFI,287,FALSE,5393,5393,0,-2,manual,Estevam,Conserved regulatory motifs in the juxtamembrane domain and kinase N-lobe revealed through deep mutational scanning of the MET receptor tyrosine kinase domain,2023,10.1101/2023.08.03.551866,1-287,MET RTK,Human cell line with growth linked to kinase activity in the absense of IL-3,Growth,MET_HUMAN_2023-08-07_b09.a2m,1,287,287,0.9,0.2,185885,0.951,273,5338.5,19.5549451,Medium,200,0.73,ex14_scores.csv,IL3_withdrawal_score,1,mutant,MET_HUMAN_theta0.2_2023-08-07_b09.npy,MET_HUMAN.pdb,1-287,1,,Activity
+MK01_HUMAN_Brenan_2016,MK01_HUMAN_Brenan_2016.csv,MK01_HUMAN,Human,Homo sapiens,MAAAAAAGAGPEMVRGQVFDVGPRYTNLSYIGEGAYGMVCSAYDNVNKVRVAIKKISPFEHQTYCQRTLREIKILLRFRHENIIGINDIIRAPTIEQMKDVYIVQDLMETDLYKLLKTQHLSNDHICYFLYQILRGLKYIHSANVLHRDLKPSNLLLNTTCDLKICDFGLARVADPDHDHTGFLTEYVATRWYRAPEIMLNSKGYTKSIDIWSVGCILAEMLSNRPIFPGKHYLDQLNHILGILGSPSQEDLNCIINLKARNYLLSLPHKNKVPWNRLFPNADSKALDLLDKMLTFNPHKRIEVEQALAHPYLEQYYDPSDEPIAEAPFKFDMELDDLPKEKLKELIFEETARFQPGYRS,360,FALSE,6809,6809,0,-8.040790936,median,Brenan,Phenotypic Characterization of a Comprehensive Set of MAPK1 /ERK2 Missense Mutants,2016,10.1016/j.celrep.2016.09.061,2-360,MAPK1,Growth,inhibitor resistance,MK01_HUMAN_full_11-26-2021_b06.a2m,1,360,360,0.6,0.2,124248,0.806,290,8815.9,30.39965517,medium,287,0.9896551724,MK01_HUMAN_Brenan_2016.csv,DOX_Average,-1,mutant,MK01_HUMAN_theta_0.2.npy,MK01_HUMAN.pdb,1-360,0.1,,OrganismalFitness
+MLAC_ECOLI_MacRae_2023,MLAC_ECOLI_MacRae_2023.csv,MLAC_ECOLI,Prokaryote,Escherichia coli,MFKRLMMVALLVIAPLSAATAADQTNPYKLMDEAAQKTFDRLKNEQPQIRANPDYLRTIVDQELLPYVQVKYAGALVLGQYYKSATPAQREAYFAAFREYLKQAYGQALAMYHGQTYQIAPEQPLGDKTIVPIRVTIIDPNGRPPVRLDFQWRKNSQTGNWQAYDMIAEGVSMITTKQNEWGTLLRTKGIDGLTAQLKSISQQKITLEEKK,211,FALSE,4007,4007,0,-0.1041157905,median,MacRae,Protein-protein interactions in the Mla lipid transport system probed by computational structure prediction and deep mutational scanning,2023,10.1016/j.jbc.2023.104744,1-211,MlaC lipid transporter,cell growth in ∆mlaC and selective medium,Growth,MLAC_ECOLI_2023-08-07_b02.a2m,1,211,211,0.2,0.2,22874,0.934,197,7904.3,40.12335025,Medium,126,0.6395939086,MLAC_ECOLI_MacRae_2023.csv,score,1,mutant,MLAC_ECOLI_theta0.2_2023-08-07_b02.npy,MLAC_ECOLI.pdb,1-211,1,,OrganismalFitness
+MSH2_HUMAN_Jia_2020,MSH2_HUMAN_Jia_2020.csv,MSH2_HUMAN,Human,Homo sapiens,MAVQPKETLQLESAAEVGFVRFFQGMPEKPTTTVRLFDRGDFYTAHGEDALLAAREVFKTQGVIKYMGPAGAKNLQSVVLSKMNFESFVKDLLLVRQYRVEVYKNRAGNKASKENDWYLAYKASPGNLSQFEDILFGNNDMSASIGVVGVKMSAVDGQRQVGVGYVDSIQRKLGLCEFPDNDQFSNLEALLIQIGPKECVLPGGETAGDMGKLRQIIQRGGILITERKKADFSTKDIYQDLNRLLKGKKGEQMNSAVLPEMENQVAVSSLSAVIKFLELLSDDSNFGQFELTTFDFSQYMKLDIAAVRALNLFQGSVEDTTGSQSLAALLNKCKTPQGQRLVNQWIKQPLMDKNRIEERLNLVEAFVEDAELRQTLQEDLLRRFPDLNRLAKKFQRQAANLQDCYRLYQGINQLPNVIQALEKHEGKHQKLLLAVFVTPLTDLRSDFSKFQEMIETTLDMDQVENHEFLVKPSFDPNLSELREIMNDLEKKMQSTLISAARDLGLDPGKQIKLDSSAQFGYYFRVTCKEEKVLRNNKNFSTVDIQKNGVKFTNSKLTSLNEEYTKNKTEYEEAQDAIVKEIVNISSGYVEPMQTLNDVLAQLDAVVSFAHVSNGAPVPYVRPAILEKGQGRIILKASRHACVEVQDEIAFIPNDVYFEKDKQMFHIITGPNMGGKSTYIRQTGVIVLMAQIGCFVPCESAEVSIVDCILARVGAGDSQLKGVSTFMAEMLETASILRSATKDSLIIIDELGRGTSTYDGFGLAWAISEYIATKIGAFCMFATHFHELTALANQIPTVNNLHVTALTTEETLTMLYQVKKGVCDQSFGIHVAELANFPKHVIECAKQKALELEEFQYIGESQGYDIMEPAAKKCYLEREQGEKIIQEFLSKVKQMPFTEMSEENITIKLKQLKAEVIAKNNSFVNEIISRIKVTT,934,FALSE,16749,16749,0,1,manual,Jia,Massively parallel functional testing of MSH2 missense variants conferring Lynch Syndrome risk,2020,10.1016/j.ajhg.2020.12.003,1-934,MSH2,"drug resistance (surrogate for protein activity, 6-thioguanine (6-TG))",,MSH2_HUMAN_full_11-26-2021_b05.a2m,1,934,934,0.5,0.2,61226,0.901,842,10716.4,12.72731591,medium,1035,1.229216152,MSH2_HUMAN_Jia_2020.csv,LOF score,-1,Variant,MSH2_HUMAN_theta_0.2.npy,MSH2_HUMAN.pdb,1-934,0.1,,OrganismalFitness
+MTH3_HAEAE_RockahShmuel_2015,MTH3_HAEAE_RockahShmuel_2015.csv,MTH3_HAEAE,Prokaryote,Haemophilus aegyptius,MNLISLFSGAGGLDLGFQKAGFRIIAANEYDKSIWKTYESNHSAKLIKGDISKISSDEFPKCDGIIGGPPCQSWSEGGSLRGIDDPRGKLFYEYIRILKQKKPKFFLAENVKGMLAQRHNKAVQEFIQEFDNAGYDVHIILLNANDYGVAQDRKRVFYIGFRKELNINYLPPIPHLIKPTLKDVIWDLKDNPIPALDKNKTNGNKCIYPNHEYFIGSYSTIFMSRNRVRQWNEPAFTVQASGRQCQLHPQAPVMLKVSKNLNKFVEGKEHLYRRLTVRECARVQGFPDDFIFHYESLNDGYKMIGNAVPVNLAYEIAKTIKSALEIRKGN,330,FALSE,1777,1777,0,0.01,manual,Rockah-Shmuel,Systematic Mapping of Protein Mutational Space by Prolonged Drift Reveals the Deleterious Effects of Seemingly Neutral Mutations,2015,10.1371/journal.pcbi.1004421,2-330,DNA methylase HaeIII,Growth,Activity,MTH3_HAEAE_full_11-26-2021_b02.a2m,1,330,330,0.2,0.2,82734,0.891,294,26962.4,91.70884354,medium,582,1.979591837,MTH3_HAEAE_Rockah-Shmuel_2015.csv,Wrel_G17_filtered,1,mutant,MTH3_HAEAE_theta_0.2.npy,MTH3_HAEAE.pdb,1-330,0.1,,OrganismalFitness
+MTHR_HUMAN_Weile_2021,MTHR_HUMAN_Weile_2021.csv,MTHR_HUMAN,Human,Homo sapiens,MVNEARGNSSLNPCLEGSASSGSESSKDSSRCSTPGLDPERHERLREKMRRRLESGDKWFSLEFFPPRTAEGAVNLISRFDRMAAGGPLYIDVTWHPAGDPGSDKETSSMMIASTAVNYCGLETILHMTCCRQRLEEITGHLHKAKQLGLKNIMALRGDPIGDQWEEEEGGFNYAVDLVKHIRSEFGDYFDICVAGYPKGHPEAGSFEADLKHLKEKVSAGADFIITQLFFEADTFFRFVKACTDMGITCPIVPGIFPIQGYHSLRQLVKLSKLEVPQEIKDVIEPIKDNDAAIRNYGIELAVSLCQELLASGLVPGLHFYTLNREMATTEVLKRLGMWTEDPRRPLPWALSAHPKRREEDVRPIFWASRPKSYIYRTQEWDEFPNGRWGNSSSPAFGELKDYYLFYLKSKSPKEELLKMWGEELTSEESVFEVFVLYLSGEPNRNGHKVTCLPWNDEPLAAETSLLKEELLRVNRQGILTINSQPNINGKPSSDPIVGWGPSGGYVFQKAYLEFFTSRETAEALLQVLKKYELRVNYHLVNVKGENITNAPELQPNAVTWGIFPGREIIQPTVVDPVSFMFWKDEAFALWIERWGKLYEEESPSRTIIQYIHDNYFLVNLVDNDFPLDNCLWQVVEDTLELLNRPTQNARETEAP,656,FALSE,12464,12464,0,0.746,median,Weile,Shifting landscapes of human MTHFR missense-variant effects,2021,10.1016/j.ajhg.2021.05.009,1-656,MTHFR reductase,Growth,,MTHR_HUMAN_2023-08-07_b02.a2m,1,656,656,0.2,0.2,4783,0.96,630,614.5,0.9753968254,Low,65,0.1031746032,urn_mavedb_00000049-a-6_scores.csv,score,1,mutant,MTHR_HUMAN_theta0.2_2023-08-07_b02.npy,MTHR_HUMAN.pdb,1-656,1,,OrganismalFitness
+MYO3_YEAST_Tsuboyama_2023_2BTT,MYO3_YEAST_Tsuboyama_2023_2BTT.csv,MYO3_YEAST,Eukaryote,Saccharomyces cerevisiae,KDPKFEAAYDFPGSGSSSELPLKKGDIVFISRDEPSGWSLAKLLDGSKEGWVPTAYMTPYK,61,TRUE,3297,947,2350,-1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-61,Myosin-3,Stability,cDNA display proteolysis,MYO3_YEAST_2023-08-07_b07.a2m,1,61,61,0.7,0.2,442941,0.885,54,12893.2,238.762963,High,51,0.9444444444,Tsuboyama2023_Dataset2_Dataset24,ddG_ML_float,1,mut_type,MYO3_YEAST_theta0.2_2023-08-07_b07.npy,MYO3_YEAST.pdb,1-61,1,,Stability
+NCAP_I34A1_Doud_2015,NCAP_I34A1_Doud_2015.csv,NCAP_I34A1,Virus,Influenza A virus (strain A/Puerto Rico/8/1934 H1N1),MASQGTKRSYEQMETDGERQNATEIRASVGKMIGGIGRFYIQMCTELKLSDYEGRLIQNSLTIERMVLSAFDERRNKYLEEHPSAGKDPKKTGGPIYRRVNGKWMRELILYDKEEIRRIWRQANNGDDATAGLTHMMIWHSNLNDATYQRTRALVRTGMDPRMCSLMQGSTLPRRSGAAGAAVKGVGTMVMELVRMIKRGINDRNFWRGENGRKTRIAYERMCNILKGKFQTAAQKAMMDQVRESRNPGNAEFEDLTFLARSALILRGSVAHKSCLPACVYGPAVASGYDFEREGYSLVGIDPFRLLQNSQVYSLIRPNENPAHKSQLVWMACHSAAFEDLRVLSFIKGTKVLPRGKLSTRGVQIASNENMETMESSTLELRSRYWAIRTRSGGNTNQQRASAGQISIQPTFSVQRNLPFDRTTIMAAFNGNTEGRTSDMRTEIIRMMESARPEDVSFQGRGVFELSDEKAASPIVPSFDMSNEGSYFFGDNAEEYDN,498,FALSE,9462,9462,0,-2.872717233,median,Doud,Site-Specific Amino Acid Preferences Are Mostly Conserved in Two Closely Related Protein Homologs,2015,10.1093/molbev/msv167,1-498,Influenza nucleoprotein,,Growth,NCAP_I34A1_theta0.99_full_11-26-2021_b09.a2m,1,498,498,0.9,0.01,15390,1,498,1493.2,2.998393574,medium,2116,4.248995984,NCAP_I34A1_Doud_2015.csv,log_fitness_by_syn_mut_fitness,1,mutant,NCAP_I34A1_theta_0.01.npy,NCAP_I34A1.pdb,1-498,0.1,,OrganismalFitness
+NKX31_HUMAN_Tsuboyama_2023_2L9R,NKX31_HUMAN_Tsuboyama_2023_2L9R.csv,NKX31_HUMAN,Human,Homo sapiens,HSHMSHTQVIELERKFSHQKYLSAPERAHLAKNLKLTETQVKIWFQNRRYKTKRKQLSSEL,61,TRUE,2482,1149,1333,-0.3,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-61,Homeobox protein Nkx-3.1,Stability,cDNA display proteolysis,NKX31_HUMAN_2023-08-07_b04.a2m,1,61,61,0.4,0.2,319273,0.902,55,8440.8,153.4690909,High,27,0.4909090909,Tsuboyama2023_Dataset2_Dataset25,ddG_ML_float,1,mut_type,NKX31_HUMAN_theta0.2_2023-08-07_b04.npy,NKX31_HUMAN.pdb,1-61,1,,Stability
+NPC1_HUMAN_Erwood_2022_HEK293T,NPC1_HUMAN_Erwood_2022_HEK293T.csv,NPC1_HUMAN,Human,Homo sapiens,MTARGLALGLLLLLLCPAQVFSQSCVWYGECGIAYGDKRYNCEYSGPPKPLPKDGYDLVQELCPGFFFGNVSLCCDVRQLQTLKDNLQLPLQFLSRCPSCFYNLLNLFCELTCSPRQSQFLNVTATEDYVDPVTNQTKTNVKELQYYVGQSFANAMYNACRDVEAPSSNDKALGLLCGKDADACNATNWIEYMFNKDNGQAPFTITPVFSDFPVHGMEPMNNATKGCDESVDEVTAPCSCQDCSIVCGPKPQPPPPPAPWTILGLDAMYVIMWITYMAFLLVFFGAFFAVWCYRKRYFVSEYTPIDSNIAFSVNASDKGEASCCDPVSAAFEGCLRRLFTRWGSFCVRNPGCVIFFSLVFITACSSGLVFVRVTTNPVDLWSAPSSQARLEKEYFDQHFGPFFRTEQLIIRAPLTDKHIYQPYPSGADVPFGPPLDIQILHQVLDLQIAIENITASYDNETVTLQDICLAPLSPYNTNCTILSVLNYFQNSHSVLDHKKGDDFFVYADYHTHFLYCVRAPASLNDTSLLHDPCLGTFGGPVFPWLVLGGYDDQNYNNATALVITFPVNNYYNDTEKLQRAQAWEKEFINFVKNYKNPNLTISFTAERSIEDELNRESDSDVFTVVISYAIMFLYISLALGHMKSCRRLLVDSKVSLGIAGILIVLSSVACSLGVFSYIGLPLTLIVIEVIPFLVLAVGVDNIFILVQAYQRDERLQGETLDQQLGRVLGEVAPSMFLSSFSETVAFFLGALSVMPAVHTFSLFAGLAVFIDFLLQITCFVSLLGLDIKRQEKNRLDIFCCVRGAEDGTSVQASESCLFRFFKNSYSPLLLKDWMRPIVIAIFVGVLSFSIAVLNKVDIGLDQSLSMPDDSYMVDYFKSISQYLHAGPPVYFVLEEGHDYTSSKGQNMVCGGMGCNNDSLVQQIFNAAQLDNYTRIGFAPSSWIDDYFDWVKPQSSCCRVDNITDQFCNASVVDPACVRCRPLTPEGKQRPQGGDFMRFLPMFLSDNPNPKCGKGGHAAYSSAVNILLGHGTRVGATYFMTYHTVLQTSADFIDALKKARLIASNVTETMGINGSAYRVFPYSVFYVFYEQYLTIIDDTIFNLGVSLGAIFLVTMVLLGCELWSAVIMCATIAMVLVNMFGVMWLWGISLNAVSLVNLVMSCGISVEFCSHITRAFTVSMKGSRVERAEEALAHMGSSVFSGITLTKFGGIVVLAFAKSQIFQIFYFRMYLAMVLLGATHGLIFLPVLLSYIGPSVNKAKSCATEERYKGTERERLLNF,1278,FALSE,637,637,0,0.8,manual,Erwood,Saturation variant interpretation using CRISPR prime editing,2022,10.1038/s41587-021-01201-1,347-1190,NPC intracellular cholesterol transporter,Fluorescence measurement,Flow Cytometry Assay,NPC1_HUMAN_2023-10-12_b07.a2m,1,1278,1278,0.7,0.2,6333,0.987,1261,918.9,0.7287073751,Low,137,0.1086439334,41587_2021_1201_MOESM3_ESM.xlsx,Function Score,1,Protein Annotation,NPC1_HUMAN_theta0.2_2023-10-12_b07.npy,NPC1_HUMAN.pdb,1-1278,1,,Activity
+NPC1_HUMAN_Erwood_2022_RPE1,NPC1_HUMAN_Erwood_2022_RPE1.csv,NPC1_HUMAN,Human,Homo sapiens,MTARGLALGLLLLLLCPAQVFSQSCVWYGECGIAYGDKRYNCEYSGPPKPLPKDGYDLVQELCPGFFFGNVSLCCDVRQLQTLKDNLQLPLQFLSRCPSCFYNLLNLFCELTCSPRQSQFLNVTATEDYVDPVTNQTKTNVKELQYYVGQSFANAMYNACRDVEAPSSNDKALGLLCGKDADACNATNWIEYMFNKDNGQAPFTITPVFSDFPVHGMEPMNNATKGCDESVDEVTAPCSCQDCSIVCGPKPQPPPPPAPWTILGLDAMYVIMWITYMAFLLVFFGAFFAVWCYRKRYFVSEYTPIDSNIAFSVNASDKGEASCCDPVSAAFEGCLRRLFTRWGSFCVRNPGCVIFFSLVFITACSSGLVFVRVTTNPVDLWSAPSSQARLEKEYFDQHFGPFFRTEQLIIRAPLTDKHIYQPYPSGADVPFGPPLDIQILHQVLDLQIAIENITASYDNETVTLQDICLAPLSPYNTNCTILSVLNYFQNSHSVLDHKKGDDFFVYADYHTHFLYCVRAPASLNDTSLLHDPCLGTFGGPVFPWLVLGGYDDQNYNNATALVITFPVNNYYNDTEKLQRAQAWEKEFINFVKNYKNPNLTISFTAERSIEDELNRESDSDVFTVVISYAIMFLYISLALGHMKSCRRLLVDSKVSLGIAGILIVLSSVACSLGVFSYIGLPLTLIVIEVIPFLVLAVGVDNIFILVQAYQRDERLQGETLDQQLGRVLGEVAPSMFLSSFSETVAFFLGALSVMPAVHTFSLFAGLAVFIDFLLQITCFVSLLGLDIKRQEKNRLDIFCCVRGAEDGTSVQASESCLFRFFKNSYSPLLLKDWMRPIVIAIFVGVLSFSIAVLNKVDIGLDQSLSMPDDSYMVDYFKSISQYLHAGPPVYFVLEEGHDYTSSKGQNMVCGGMGCNNDSLVQQIFNAAQLDNYTRIGFAPSSWIDDYFDWVKPQSSCCRVDNITDQFCNASVVDPACVRCRPLTPEGKQRPQGGDFMRFLPMFLSDNPNPKCGKGGHAAYSSAVNILLGHGTRVGATYFMTYHTVLQTSADFIDALKKARLIASNVTETMGINGSAYRVFPYSVFYVFYEQYLTIIDDTIFNLGVSLGAIFLVTMVLLGCELWSAVIMCATIAMVLVNMFGVMWLWGISLNAVSLVNLVMSCGISVEFCSHITRAFTVSMKGSRVERAEEALAHMGSSVFSGITLTKFGGIVVLAFAKSQIFQIFYFRMYLAMVLLGATHGLIFLPVLLSYIGPSVNKAKSCATEERYKGTERERLLNF,1278,FALSE,63,63,0,0.8,manual,Erwood,Saturation variant interpretation using CRISPR prime editing,2022,10.1038/s41587-021-01201-1,420-920,NPC intracellular cholesterol transporter,Fluorescence measurement,Flow Cytometry Assay,NPC1_HUMAN_2023-10-12_b07.a2m,1,1278,1278,0.7,0.2,6333,0.987,1261,918.9,0.7287073751,Low,137,0.1086439334,41587_2021_1201_MOESM3_ESM.xlsx,Function Score,1,Protein Annotation,NPC1_HUMAN_theta0.2_2023-10-12_b07.npy,NPC1_HUMAN.pdb,1-1278,1,,Activity
+NRAM_I33A0_Jiang_2016,NRAM_I33A0_Jiang_2016.csv,NRAM_I33A0,Virus,Influenza A virus (strain A/Wilson-Smith/1933 H1N1),MNPNQKIITIGSICMVVGIISLILQIGNIISIWISHSIQTGNQNHTGICNQGIITYNVVAGQDSTSVILTGNSSLCPIRGWAIHSKDNGIRIGSKGDVFVIREPFISCSHLECRTFFLTQGALLNDKHSNGTVKDRSPYRALMSCPVGEAPSPYNSRFESVAWSASACHDGMGWLTIGISGPDNGAVAVLKYNGIITETIKSWRKKILRTQESECTCVNGSCFTIMTDGPSNGLASYKIFKIEKGKVTKSIELNAPNSHYEECSCYPDTGKVMCVCRDNWHGSNRPWVSFDQNLDYQIGYICSGVFGDNPRPKDGPGSCGPVSADGANGVKGFSYRYGNGVWIGRTKSDSSRHGFEMIWDPNGWTETDSRFSVRQDVVAMTDRSGYSGSFVQHPELTGLDCMRPCFWVELIRGRPEEETIWTSGSIISFCGVNSDTVDWSWPDGAELPFTIDK,453,FALSE,298,298,0,-0.7772013612,median,Jiang,A Balance between Inhibitor Binding and Substrate Processing Confers Influenza Drug Resistance,2016,10.1016/j.jmb.2015.11.027,67-285,Influenza neuraminidase,,Growth,NRAM_I33A0_full_11-26-2021_b01.a2m,1,453,453,0.1,0.01,47174,0.976,442,33.1,0.07488687783,low,0,0,NRAM_I33A0_Jiang_2016.csv,Standard Conditions,1,mutant,NRAM_I33A0_theta_0.01.npy,NRAM_I33A0.pdb,1-453,0.1,,OrganismalFitness
+NUD15_HUMAN_Suiter_2020,NUD15_HUMAN_Suiter_2020.csv,NUD15_HUMAN,Human,Homo sapiens,MTASAQPRGRRPGVGVGVVVTSCKHPRCVLLGKRKGSVGAGSFQLPGGHLEFGETWEECAQRETWEEAALHLKNVHFASVVNSFIEKENYHYVTILMKGEVDVTHDSEPKNVEPEKNESWEWVPWEELPPLDQLFWGLRCLKEQGYDPFKEDLNHLVGYKGNHL,164,FALSE,2844,2844,0,0.25,manual,Suiter,Massively parallel variant characterization identifies NUDT15 alleles associated with thiopurine toxicity,2020,10.1073/pnas.1915680117,2-164,NUDT15,,"VAMP-seq, drug sensitivity",NUD15_HUMAN_full_11-26-2021_b04.a2m,1,164,164,0.4,0.2,153922,0.72,118,43847.8,371.5915254,high,151,1.279661017,NUD15_HUMAN_Suiter_2020.csv,Final NUDT15 activity Score,1,mutant,NUD15_HUMAN_theta_0.2.npy,NUD15_HUMAN.pdb,1-164,0.1,,Expression
+NUSA_ECOLI_Tsuboyama_2023_1WCL,NUSA_ECOLI_Tsuboyama_2023_1WCL.csv,NUSA_ECOLI,Prokaryote,Escherichia coli,EAHAAIDTFTKYLDIDEDFATVLVEEGFSTLEELAYVPMKELLEIEGLDEPTVEALRERAKNALATIAQ,69,TRUE,2028,1306,722,-1.318069467,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-69,Transcription termination/antitermination protein NusA,Stability,cDNA display proteolysis,NUSA_ECOLI_2023-08-07_b03.a2m,1,69,69,0.3,0.2,205612,0.812,56,39002.5,696.4732143,High,38,0.6785714286,Tsuboyama2023_Dataset2_Dataset26,ddG_ML_float,1,mut_type,NUSA_ECOLI_theta0.2_2023-08-07_b03.npy,NUSA_ECOLI.pdb,1-69,1,,Stability
+NUSG_MYCTU_Tsuboyama_2023_2MI6,NUSG_MYCTU_Tsuboyama_2023_2MI6.csv,NUSG_MYCTU,Prokaryote,Mycobacterium tuberculosis,DYEVGESVTVMDGPFATLPATISEVNAEQQKLKVLVSIFGRETPVELTFGQVSKI,55,TRUE,1380,1019,361,-0.5,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-55,Transcription termination/antitermination protein NusG,Stability,cDNA display proteolysis,NUSG_MYCTU_2023-08-07_b03.a2m,1,55,55,0.3,0.2,102004,0.964,53,16625.7,313.6924528,High,41,0.7735849057,Tsuboyama2023_Dataset2_Dataset27,ddG_ML_float,1,mut_type,NUSG_MYCTU_theta0.2_2023-08-07_b03.npy,NUSG_MYCTU.pdb,1-55,1,,Stability
+OBSCN_HUMAN_Tsuboyama_2023_1V1C,OBSCN_HUMAN_Tsuboyama_2023_1V1C.csv,OBSCN_HUMAN,Human,Homo sapiens,FDIYVVTADYLPLGAEQDAITLREGQYVEVLDAAHPLRWLVRTKPTKSSPSRQGWVSPAYLDRRL,65,TRUE,3197,1213,1984,-1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-65,Obscurin,Stability,cDNA display proteolysis,OBSCN_HUMAN_2023-08-07_b02.a2m,1,65,65,0.2,0.2,718751,0.815,53,23710.7,447.3716981,High,54,1.018867925,Tsuboyama2023_Dataset2_Dataset28,ddG_ML_float,1,mut_type,OBSCN_HUMAN_theta0.2_2023-08-07_b02.npy,OBSCN_HUMAN.pdb,1-65,1,,Stability
+ODP2_GEOSE_Tsuboyama_2023_1W4G,ODP2_GEOSE_Tsuboyama_2023_1W4G.csv,ODP2_GEOSE,Prokaryote,Geobacillus stearothermophilus,NRRVIAMPSVRKWAREKGVDIRLVQGTGKNGRVLKEDIDAFLAG,44,TRUE,1134,669,465,-0.4168227551,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-44,Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex,Stability,cDNA display proteolysis,ODP2_GEOSE_2023-08-07_b07.a2m,1,44,44,0.7,0.2,163835,0.909,40,14834.6,370.865,High,21,0.525,Tsuboyama2023_Dataset2_Dataset29,ddG_ML_float,1,mut_type,ODP2_GEOSE_theta0.2_2023-08-07_b07.npy,ODP2_GEOSE.pdb,1-44,1,,Stability
+OPSD_HUMAN_Wan_2019,OPSD_HUMAN_Wan_2019.csv,OPSD_HUMAN,Human,Homo sapiens,MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLGGEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSRYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKEAAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA,348,FALSE,165,165,0,0.5795144905,median,Wan,Characterizing variants of unknown significance in rhodopsin: A functional genomics approach,2019,10.1002/humu.23762,4-347,Rhodopsin,Expression,Flow Cytometry Assay,OPSD_HUMAN_2023-10-12_b04.a2m,1,348,348,0.4,0.2,342311,0.876,305,36900.5,120.9852459,High,247,0.8098360656,urn_mavedb_00000099-a-1_scores.csv,score,1,mutant,OPSD_HUMAN_theta0.2_2023-10-12_b04.npy,OPSD_HUMAN.pdb,1-348,1,,Expression
+OTC_HUMAN_Lo_2023,OTC_HUMAN_Lo_2023.csv,OTC_HUMAN,Human,Homo sapiens,MLFNLRILLNNAAFRNGHNFMVRNFRCGQPLQNKVQLKGRDLLTLKNFTGEEIKYMLWLSADLKFRIKQKGEYLPLLQGKSLGMIFEKRSTRTRLSTETGFALLGGHPCFLTTQDIHLGVNESLTDTARVLSSMADAVLARVYKQSDLDTLAKEASIPIINGLSDLYHPIQILADYLTLQEHYSSLKGLTLSWIGDGNNILHSIMMSAAKFGMHLQAATPKGYEPDASVTKLAEQYAKENGTKLLLTNDPLEAAHGGNVLITDTWISMGQEEEKKKRLQAFQGYQVTMKTAKVAASDWTFLHCLPRKPEEVDDEVFYSPRSLVFPEAENRKWTIMAVMVSLLTDYSPQLQKPKF,354,FALSE,1570,1570,0,0.417,median,Lo,"The functional impact of 1,570 individual amino acid substitutions in human OTC",2023,10.1016/j.ajhg.2023.03.019,33-354,OTC,Enzymatic activity,,OTC_HUMAN_2023-08-07_b02.a2m,1,354,354,0.2,0.2,135607,0.87,308,18646.2,60.53961039,Medium,641,2.081168831,urn_mavedb_00000112-a-1_scores.csv,DMS_score,1,mutant,OTC_HUMAN_theta0.2_2023-08-07_b02.npy,OTC_HUMAN.pdb,1-354,1,,Activity
+OTU7A_HUMAN_Tsuboyama_2023_2L2D,OTU7A_HUMAN_Tsuboyama_2023_2L2D.csv,OTU7A_HUMAN,Human,Homo sapiens,TLDMDAVLSDFVRSTGAEPGLARDLLEGKNWDLTAALSDYEQ,42,FALSE,635,635,0,-1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-42,OTU domain-containing protein 7A,Stability,cDNA display proteolysis,OTU7A_HUMAN_2023-08-07_b02.a2m,1,42,42,0.2,0.2,1359071,0.881,37,514715.2,13911.22162,High,28,0.7567567568,Tsuboyama2023_Dataset2_Dataset30,ddG_ML_float,1,mut_type,OTU7A_HUMAN_theta0.2_2023-08-07_b02.npy,OTU7A_HUMAN.pdb,1-42,1,,Stability
+OXDA_RHOTO_Vanella_2023_activity,OXDA_RHOTO_Vanella_2023_activity.csv,OXDA_RHOTO,Eukaryote,Rhodotorula gracilis,HSQKRVVVLGSGVIGLSSALILARKGYSVHILARDLPEDVSSQTFASPWAGANWTPFMTLTDGPRQAKWEESTFKKWVELVPTGHAMWLKGTRRFAQNEDGLLGHWYKDITPNYRPLPSSECPPGAIGVTYDTLSVHAPKYCQYLARELQKLGATFERRTVTSLEQAFDGADLVVNATGLGAKSIAGIDDQAAEPIRGQTVLVKSPCKRCTMDSSDPASPAYIIPRPGGEVICGGTYGVGDWDLSVNPETVQRILKHCLRLDPTISSDGTIEGIEVLRHNVGLRPARRGGPRVEAERIVLPLDRTKSPLSLGRGSARAAKEKEVTLVHAYGFSSAGYQQSWGAAEDVAQLVDEAFQRYHGAARE,364,FALSE,6396,6396,0,-0.2,manual,Vanella,Understanding Activity-Stability Tradeoffs in Biocatalysts by Enzyme Proximity Sequencing,2023,10.1101/2023.02.24.529916,1-364,D-amino acid oxidase (DAOx),fluorescent label of enzyme product,FACS,OXDA_RHOTO_2023-08-07_b02.a2m,1,364,364,0.2,0.2,520184,0.876,319,98000.4,307.2112853,High,892,2.796238245,Figure_2.xlsx,activity fitness,1,mutant,OXDA_RHOTO_theta0.2_2023-08-07_b02.npy,OXDA_RHOTO.pdb,1-364,1,,Activity
+OXDA_RHOTO_Vanella_2023_expression,OXDA_RHOTO_Vanella_2023_expression.csv,OXDA_RHOTO,Eukaryote,Rhodotorula gracilis,HSQKRVVVLGSGVIGLSSALILARKGYSVHILARDLPEDVSSQTFASPWAGANWTPFMTLTDGPRQAKWEESTFKKWVELVPTGHAMWLKGTRRFAQNEDGLLGHWYKDITPNYRPLPSSECPPGAIGVTYDTLSVHAPKYCQYLARELQKLGATFERRTVTSLEQAFDGADLVVNATGLGAKSIAGIDDQAAEPIRGQTVLVKSPCKRCTMDSSDPASPAYIIPRPGGEVICGGTYGVGDWDLSVNPETVQRILKHCLRLDPTISSDGTIEGIEVLRHNVGLRPARRGGPRVEAERIVLPLDRTKSPLSLGRGSARAAKEKEVTLVHAYGFSSAGYQQSWGAAEDVAQLVDEAFQRYHGAARE,364,FALSE,6769,6769,0,-0.2,manual,Vanella,Understanding Activity-Stability Tradeoffs in Biocatalysts by Enzyme Proximity Sequencing,2023,10.1101/2023.02.24.529916,1-364,D-amino acid oxidase (DAOx),cell surface expression,FACS,OXDA_RHOTO_2023-08-07_b02.a2m,1,364,364,0.2,0.2,520184,0.876,319,98000.4,307.2112853,High,892,2.796238245,Figure_2.xlsx,expression fitness,1,mutant,OXDA_RHOTO_theta0.2_2023-08-07_b02.npy,OXDA_RHOTO.pdb,1-364,1,,Expression
+P53_HUMAN_Giacomelli_2018_Null_Etoposide,P53_HUMAN_Giacomelli_2018_Null_Etoposide.csv,P53_HUMAN,Human,Homo sapiens,MEEPQSDPSVEPPLSQETFSDLWKLLPENNVLSPLPSQAMDDLMLSPDDIEQWFTEDPGPDEAPRMPEAAPRVAPAPAAPTPAAPAPAPSWPLSSSVPSQKTYQGSYGFRLGFLHSGTAKSVTCTYSPALNKMFCQLAKTCPVQLWVDSTPPPGTRVRAMAIYKQSQHMTEVVRRCPHHERCSDSDGLAPPQHLIRVEGNLRVEYLDDRNTFRHSVVVPYEPPEVGSDCTTIHYNYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRNSFEVRVCACPGRDRRTEEENLRKKGEPHHELPPGSTKRALPNNTSSSPQPKKKPLDGEYFTLQIRGRERFEMFRELNEALELKDAQAGKEPGGSRAHSSHLKSKKGQSTSRHKKLMFKTEGPDSD,393,FALSE,7467,7467,0,-0.5,manual,Giacomelli,Mutational processes shape the landscape of TP53 mutations in human cancer,2018,10.1038/s41588-018-0204-y,1-393,p53,"drug resistance (nutlin-3, etoposide)",Growth,P53_HUMAN_full_04-29-2022_b09.a2m,1,393,393,0.9,0.2,5069,0.858,337,153.2,0.4545994065,low,7,0.02077151335,P53_HUMAN_Giacomelli_2018.csv,A549_p53NULL_Etoposide_Z-score,1,Allele,P53_HUMAN_theta_0.2.npy,P53_HUMAN.pdb,1-393,0.1,,OrganismalFitness
+P53_HUMAN_Giacomelli_2018_Null_Nutlin,P53_HUMAN_Giacomelli_2018_Null_Nutlin.csv,P53_HUMAN,Human,Homo sapiens,MEEPQSDPSVEPPLSQETFSDLWKLLPENNVLSPLPSQAMDDLMLSPDDIEQWFTEDPGPDEAPRMPEAAPRVAPAPAAPTPAAPAPAPSWPLSSSVPSQKTYQGSYGFRLGFLHSGTAKSVTCTYSPALNKMFCQLAKTCPVQLWVDSTPPPGTRVRAMAIYKQSQHMTEVVRRCPHHERCSDSDGLAPPQHLIRVEGNLRVEYLDDRNTFRHSVVVPYEPPEVGSDCTTIHYNYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRNSFEVRVCACPGRDRRTEEENLRKKGEPHHELPPGSTKRALPNNTSSSPQPKKKPLDGEYFTLQIRGRERFEMFRELNEALELKDAQAGKEPGGSRAHSSHLKSKKGQSTSRHKKLMFKTEGPDSD,393,FALSE,7467,7467,0,0.04438920187,median,Giacomelli,Mutational processes shape the landscape of TP53 mutations in human cancer,2018,10.1038/s41588-018-0204-y,1-393,p53,"drug resistance (nutlin-3, etoposide)",Growth,P53_HUMAN_full_04-29-2022_b09.a2m,1,393,393,0.9,0.2,5069,0.858,337,153.2,0.4545994065,low,7,0.02077151335,P53_HUMAN_Giacomelli_2018.csv,A549_p53NULL_Nutlin-3_Z-score,-1,Allele,P53_HUMAN_theta_0.2.npy,P53_HUMAN.pdb,1-393,0.1,,OrganismalFitness
+P53_HUMAN_Giacomelli_2018_WT_Nutlin,P53_HUMAN_Giacomelli_2018_WT_Nutlin.csv,P53_HUMAN,Human,Homo sapiens,MEEPQSDPSVEPPLSQETFSDLWKLLPENNVLSPLPSQAMDDLMLSPDDIEQWFTEDPGPDEAPRMPEAAPRVAPAPAAPTPAAPAPAPSWPLSSSVPSQKTYQGSYGFRLGFLHSGTAKSVTCTYSPALNKMFCQLAKTCPVQLWVDSTPPPGTRVRAMAIYKQSQHMTEVVRRCPHHERCSDSDGLAPPQHLIRVEGNLRVEYLDDRNTFRHSVVVPYEPPEVGSDCTTIHYNYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRNSFEVRVCACPGRDRRTEEENLRKKGEPHHELPPGSTKRALPNNTSSSPQPKKKPLDGEYFTLQIRGRERFEMFRELNEALELKDAQAGKEPGGSRAHSSHLKSKKGQSTSRHKKLMFKTEGPDSD,393,FALSE,7467,7467,0,-1,manual,Giacomelli,Mutational processes shape the landscape of TP53 mutations in human cancer,2018,10.1038/s41588-018-0204-y,1-393,p53,"drug resistance (nutlin-3, etoposide)",Growth,P53_HUMAN_full_04-29-2022_b09.a2m,1,393,393,0.9,0.2,5069,0.858,337,153.2,0.4545994065,low,7,0.02077151335,P53_HUMAN_Giacomelli_2018.csv,A549_p53WT_Nutlin-3_Z-score,-1,Allele,P53_HUMAN_theta_0.2.npy,P53_HUMAN.pdb,1-393,0.1,,OrganismalFitness
+P53_HUMAN_Kotler_2018,P53_HUMAN_Kotler_2018.csv,P53_HUMAN,Human,Homo sapiens,MEEPQSDPSVEPPLSQETFSDLWKLLPENNVLSPLPSQAMDDLMLSPDDIEQWFTEDPGPDEAPRMPEAAPPVAPAPAAPTPAAPAPAPSWPLSSSVPSQKTYQGSYGFRLGFLHSGTAKSVTCTYSPALNKMFCQLAKTCPVQLWVDSTPPPGTRVRAMAIYKQSQHMTEVVRRCPHHERCSDSDGLAPPQHLIRVEGNLRVEYLDDRNTFRHSVVVPYEPPEVGSDCTTIHYNYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRNSFEVRVCACPGRDRRTEEENLRKKGEPHHELPPGSTKRALPNNTSSSPQPKKKPLDGEYFTLQIRGRERFEMFRELNEALELKDAQAGKEPGGSRAHSSHLKSKKGQSTSRHKKLMFKTEGPDSD,393,FALSE,1048,1048,0,1,manual,Kotler,A Systematic p53 Mutation Library Links Differential Functional Impact to Cancer Mutation Pattern and Evolutionary Conservation,2018,10.1016/j.molcel.2018.06.012,102-292,p53,growth,Growth,P53_HUMAN_full_11-26-2021_b09.a2m,1,393,393,0.9,0.2,4129,0.863,339,148,0.4365781711,low,15,0.04424778761,P53_HUMAN_Kotler_2018.csv,RFS_H1299,-1,mutant,P53_HUMAN_Kotler_theta_0.2.npy,P53_HUMAN.pdb,1-393,0.1,,OrganismalFitness
+P84126_THETH_Chan_2017,P84126_THETH_Chan_2017.csv,P84126_THETH,Prokaryote,Thermus thermophilus,MRPDLSRVPGVLGEIARKRASEVAPYPLPEPPSVPSFKEALLRPGLSVIAEVKRQSPSEGLIREVDPVEAALAYARGGARAVSVLTEPHRFGGSLLDLKRVREAVDLPLLRKDFVVDPFMLEEARAFGASAALLIVALLGELTGAYLEEARRLGLEALVEVHTERELEIALEAGAEVLGINNRDLATLHINLETAPRLGRLARKRGFGGVLVAESGYSRKEELKALEGLFDAVLIGTSLMRAPDLEAALRELVG,254,FALSE,1519,1519,0,-0.5,manual,Chan,Correlation of fitness landscapes from three orthologous TIM barrels originates from sequence and structure constraints,2017,10.1038/ncomms14614,44-238,TIM Barrell (T. thermophilus),fitness,Growth,P84126_THETH_full_11-26-2021_b04.a2m,1,254,254,0.4,0.2,53441,0.941,239,10704.6,44.78912134,medium,390,1.631799163,P84126_THETH_Chan_2017.csv,fitness,1,mutant,P84126_THETH_theta_0.2.npy,P84126_THETH.pdb,1-254,0.1,,OrganismalFitness
+PA_I34A1_Wu_2015,PA_I34A1_Wu_2015.csv,PA_I34A1,Virus,Influenza A virus (strain A/Puerto Rico/8/1934 H1N1),MEDFVRQCFNPMIVELAEKAMKEYGEDLKIETNKFAAICTHLEVCFMYSDFHFIDEQGESIVVELGDPNALLKHRFEIIEGRDRTIAWTVVNSICNTTGAEKPKFLPDLYDYKKNRFIEIGVTRREVHIYYLEKANKIKSEKTHIHIFSFTGEEMATKADYTLDEESRARIKTRLFTIRQEMASRGLWDSFRQSERGEETIEERFEITGTMRKLADQSLPPNFSSLEKFRAYVDGFEPNGYIEGKLSQMSKEVNARIEPFLKSTPRPLRLPDGPPCSQRSKFLLMDALKLSIEDPSHEGEGIPLYDAIKCMRTFFGWKEPNVVKPHEKGINPNYLLSWKQVLAELQDIENEEKIPRTKNMKKTSQLKWALGENMAPEKVDFDDCKDVGDLKQYDSDEPELRSLASWIQNEFNKACELTDSSWIELDEIGEDAAPIEHIASMRRNYFTAEVSHCRATEYIMKGVYINTALLNASCAAMDDFQLIPMISKCRTKEGRRKTNLYGFIIKGRSHLRNDTDVVNFVSMEFSLTDPRLEPHKWEKYCVLEVGDMLLRSAIGHVSRPMFLYVRTNGTSKIKMKWGMEMRRCLLQSLQQIESMIEAESSVKEKDMTKEFFENKSETWPVGESPKGVEEGSIGKVCRTLLAKSVFNSLYASPQLEGFSAESRKLLLIVQALRDNLEPGTFDLGGLYEAIEECLINDPWVLLNASWFNSFLTHALR,716,FALSE,1820,1820,0,0.290683953,median,Wu,Functional Constraint Profiling of a Viral Protein Reveals Discordance of Evolutionary Conservation and Functionality,2015,10.1371/journal.pgen.1005310,8-716,Influenza polymerase acidic protein,Viral replication,Growth,PA_I34A1_full_theta0.99_04-29-2022_b09.a2m,1,716,716,0.9,0.01,26750,1,716,1608,2.245810056,medium,3706,5.175977654,PA_I34A1_Wu_2015.csv,RF_index,1,mutant,PA_I34A1_theta_0.01.npy,PA_I34A1.pdb,1-716,0.1,,OrganismalFitness
+PABP_YEAST_Melamed_2013,PABP_YEAST_Melamed_2013.csv,PABP_YEAST,Eukaryote,Saccharomyces cerevisiae,MADITDKTAEQLENLNIQDDQKQAATGSESQSVENSSASLYVGDLEPSVSEAHLYDIFSPIGSVSSIRVCRDAITKTSLGYAYVNFNDHEAGRKAIEQLNYTPIKGRLCRIMWSQRDPSLRKKGSGNIFIKNLHPDIDNKALYDTFSVFGDILSSKIATDENGKSKGFGFVHFEEEGAAKEAIDALNGMLLNGQEIYVAPHLSRKERDSQLEETKAHYTNLYVKNINSETTDEQFQELFAKFGPIVSASLEKDADGKLKGFGFVNYEKHEDAVKAVEALNDSELNGEKLYVGRAQKKNERMHVLKKQYEAYRLEKMAKYQGVNLFVKNLDDSVDDEKLEEEFAPYGTITSAKVMRTENGKSKGFGFVCFSTPEEATKAITEKNQQIVAGKPLYVAIAQRKDVRRSQLAQQIQARNQMRYQQATAAAAAAAAGMPGQFMPPMFYGVMPPRGVPFNGPNPQQMNPMGGMPKNGMPPQFRNGPVYGVPPQGGFPRNANDNNQFYQQKQRQALGEQLYKKVSAKTSNEEAAGKITGMILDLPPQEVFPLLESDELFEQHYKEASAAYESFKKEQEQQTEQA,577,TRUE,37708,1187,36521,0.3,manual,Melamed,Deep mutational scanning of an RRM domain of the Saccharomyces cerevisiae poly(A)-binding protein,2013,10.1261/rna.040709.113,126-200,PAB1,"Growth (essential function), RNA binding",Growth,PABP_YEAST_full_11-26-2021_b07.a2m,1,577,577,0.7,0.2,7866,0.919,530,855.1,1.613396226,medium,83,0.1566037736,PABP_YEAST_Melamed_2013.csv,linear,1,mutant,PABP_YEAST_theta_0.2.npy,PABP_YEAST.pdb,1-577,0.1,,OrganismalFitness
+PAI1_HUMAN_Huttinger_2021,PAI1_HUMAN_Huttinger_2021.csv,PAI1_HUMAN,Human,Homo sapiens,MQMSPALTCLVLGLALVFGEGSAVHHPPSYVAHLASDFGVRVFQQVAQASKDRNVVFSPYGVASVLAMLQLTTGGETQQQIQAAMGFKIDDKGMAPALRHLYKELMGPWNKDEISTTDAIFVQRDLKLVQGFMPHFFRLFRSTVKQVDFSEVERARFIINDWVKTHTKGMISNLLGKGAVDQLTRLVLVNALYFNGQWKTPFPDSSTHRRLFHKSDGSTVSVPMMAQTNKFNYTEFTTPDGHYYDILELPYHGDTLSMFIAAPYEKEVPLSALTNILSAQLISHWKGNMTRLPRLLVLPKFSLETEVDLRKPLENLGMTDMFRQFQADFTSLSDQEPLHVAQALQKVKIEVNESGTVASSSTAVIVSARMAPEEIIMDRPFLFVVRHNPTGTVLFMGQVMEP,402,FALSE,5345,5345,0,0.029313547,median,Huttinger,Deep mutational scanning of the plasminogen activator inhibitor-1 functional landscape,2021,10.1038/s41598-021-97871-7,24-402,"PAI-1, SERPINE1",PAI-1 inhibition of uPA,phage fitness,PAI1_HUMAN_2023-10-12_b05.a2m,1,402,402,0.5,0.2,52528,,,,,,,,PAI1_HUMAN_Huttinger_2021,log2FoldChange,1,mutation,PAI1_HUMAN_theta0.2_2023-10-12_b05.npy,PAI1_HUMAN.pdb,1-402,1,,Activity
+PHOT_CHLRE_Chen_2023,PHOT_CHLRE_Chen_2023.csv,PHOT_CHLRE,Eukaryote,Chlamydomonas reinhardtii,AGLRHTFVVADATLPDCPLVYASEGFYAMTGYGPDEVLGHNARFLQGEGTDPKEVQKIRDAIKKGEACSVRLLNYRKDGTPFWNLLTVTPIKTPDGRVSKFVGVQVDVTSKTEGKALA,118,TRUE,167529,2122,165407,0.6317018878,median,Chen,Deep Mutational Scanning of an Oxygen-Independent Fluorescent Protein CreiLOV for Comprehensive Profiling of Mutational and Epistatic Effects,2023,10.1021/acssynbio.2c00662,1-118,Phototropin,Fluorescence,FACS,PHOT_CHLRE_2023-08-07_b02.a2m,1,118,118,0.2,0.2,1627150,0.873,103,610128.5,5923.57767,High,232,2.252427184,sb2c00662_si_001.xlsx,mean,1,mutant,PHOT_CHLRE_theta0.2_2023-08-07_b02.npy,PHOT_CHLRE.pdb,1-118,1,,Activity
+PIN1_HUMAN_Tsuboyama_2023_1I6C,PIN1_HUMAN_Tsuboyama_2023_1I6C.csv,PIN1_HUMAN,Human,Homo sapiens,KLPPGWEKRMSRSSGRVYYFNHITNASQWERPSGNSSSG,39,TRUE,802,686,116,-0.6844420472,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-39,Peptidyl-prolyl cis-trans isomerase NIMA-interacting 1,Stability,cDNA display proteolysis,PIN1_HUMAN_2023-08-07_b02.a2m,1,39,39,0.2,0.2,248269,0.821,32,10833.2,338.5375,High,13,0.40625,Tsuboyama2023_Dataset2_Dataset31,ddG_ML_float,1,mut_type,PIN1_HUMAN_theta0.2_2023-08-07_b02.npy,PIN1_HUMAN.pdb,1-39,1,,Stability
+PITX2_HUMAN_Tsuboyama_2023_2L7M,PITX2_HUMAN_Tsuboyama_2023_2L7M.csv,PITX2_HUMAN,Human,Homo sapiens,THFTSQQLQELEATFQRNHYPDMSTREEIAVWTNLTEARVRVWFKNRRAKWR,52,TRUE,1824,938,886,-1.201366007,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-52,Pituitary homeobox 2,Stability,cDNA display proteolysis,PITX2_HUMAN_2023-08-07_b04.a2m,1,52,52,0.4,0.2,344174,1,52,9819.6,188.8384615,High,25,0.4807692308,Tsuboyama2023_Dataset2_Dataset32,ddG_ML_float,1,mut_type,PITX2_HUMAN_theta0.2_2023-08-07_b04.npy,PITX2_HUMAN.pdb,1-52,1,,Stability
+PKN1_HUMAN_Tsuboyama_2023_1URF,PKN1_HUMAN_Tsuboyama_2023_1URF.csv,PKN1_HUMAN,Human,Homo sapiens,GIPATNLSRVAGLEKQLAIELKVKQGAENMIQTYSNGSTKDRKLLLTAQQMLQDSKTKIDIIRMQLRRALQ,71,FALSE,1301,1301,0,-0.5,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-71,Serine/threonine-protein kinase N1,Stability,cDNA display proteolysis,PKN1_HUMAN_2023-08-07_b01.a2m,1,71,71,0.1,0.2,187829,0.845,60,53755.8,895.93,High,13,0.2166666667,Tsuboyama2023_Dataset2_Dataset33,ddG_ML_float,1,mut_type,PKN1_HUMAN_theta0.2_2023-08-07_b01.npy,PKN1_HUMAN.pdb,1-71,1,,Stability
+POLG_CXB3N_Mattenberger_2021,POLG_CXB3N_Mattenberger_2021.csv,POLG_CXB3N,Virus,Coxsackievirus B3 (strain Nancy),MGAQVSTQKTGAHETRLNASGNSIIHYTNINYYKDAASNSANRQDFTQDPGKFTEPVKDIMIKSLPALNSPTVEECGYSDRARSITLGNSTITTQECANVVVGYGVWPDYLKDSEATAEDQPTQPDVATCRFYTLDSVQWQKTSPGWWWKLPDALSNLGLFGQNMQYHYLGRTGYTVHVQCNASKFHQGCLLVVCVPEAEMGCATLDNTPSSAELLGGDSAKEFADKPVASGSNKLVQRVVYNAGMGVGVGNLTIFPHQWINLRTNNSATIVMPYTNSVPMDNMFRHNNVTLMVIPFVPLDYCPGSTTYVPITVTIAPMCAEYNGLRLAGHQGLPTMNTPGSCQFLTSDDFQSPSAMPQYDVTPEMRIPGEVKNLMEIAEVDSVVPVQNVGEKVNSMEAYQIPVRSNEGSGTQVFGFPLQPGYSSVFSRTLLGEILNYYTHWSGSIKLTFMFCGSAMATGKFLLAYSPPGAGAPTKRVDAMLGTHVIWDVGLQSSCVLCIPWISQTHYRFVASDEYTAGGFITCWYQTNIVVPADAQSSCYIMCFVSACNDFSVRLLKDTPFISQQNFFQGPVEDAITAAIGRVADTVGTGPTNSEAIPALTAAETGHTSQVVPGDTMQTRHVKNYHSRSESTIENFLCRSACVYFTEYKNSGAKRYAEWVLTPRQAAQLRRKLEFFTYVRFDLELTFVITSTQQPSTTQNQDAQILTHQIMYVPPGGPVPDKVDSYVWQTSTNPSVFWTEGNAPPRMSIPFLSIGNAYSNFYDGWSEFSRNGVYGINTLNNMGTLYARHVNAGSTGPIKSTIRIYFKPKHVKAWIPRPPRLCQYEKAKNVNFQPSGVTTTRQSITTMTNTGAFGQQSGAVYVGNYRVVNRHLATSADWQNCVWESYNRDLLVSTTTAHGCDIIARCQCTTGVYFCASKNKHYPISFEGPGLVEVQESEYYPRRYQSHVLLAAGFSEPGDCGGILRCEHGVIGIVTMGGEGVVGFADIRDLLWLEDDAMEQGVKDYVEQLGNAFGSGFTNQICEQVNLLKESLVGQDSILEKSLKALVKIISALVIVVRNHDDLITVTATLALIGCTSSPWRWLKQKVSQYYGIPMAERQNNSWLKKFTEMTNACKGMEWIAVKIQKFIEWLKVKILPEVREKHEFLNRLKQLPLLESQIATIEQSAPSQSDQEQLFSNVQYFAHYCRKYAPLYAAEAKRVFSLEKKMSNYIQFKSKCRIEPVCLLLHGSPGAGKSVATNLIGRSLAEKLNSSVYSLPPDPDHFDGYKQQAVVIMDDLCQNPDGKDVSLFCQMVSSVDFVPPMAALEEKGILFTSPFVLASTNAGSINAPTVSDSRALARRFHFDMNIEVISMYSQNGKINMPMSVKTCDDECCPVNFKKCCPLVCGKAIQFIDRRTQVRYSLDMLVTEMFREYNHRHSVGTTLEALFQGPPVYREIKISVAPETPPPPAIADLLKSVDSEAVREYCKEKGWLVPEINSTLQIEKHVSRAFICLQALTTFVSVAGIIYIIYKLFAGFQGAYTGVPNQKPRVPTLRQAKVQGPAFEFAVAMMKRNSSTVKTEYGEFTMLGIYDRWAVLPRHAKPGPTILMNDQEVGVLDAKELVDKDGTNLELTLLKLNRNEKFRDIRGFLAKEEVEVNEAVLAINTSKFPNMYIPVGQVTEYGFLNLGGTPTKRMLMYNFPTRAGQCGGVLMSTGKVLGIHVGGNGHQGFSAALLKHYFNDEQGEIEFIESSKDAGFPVINTPSKTKLEPSVFHQVFEGNKEPAVLRSGDPRLKANFEEAIFSKYIGNVNTHVDEYMLEAVDHYAGQLATLDISTEPMKLEDAVYGTEGLEALDLTTSAGYPYVALGIKKRDILSKKTKDLTKLKECMDKYGLNLPMVTYVKDELRSIEKVAKGKSRLIEASSLNDSVAMRQTFGNLYKTFHLNPGVVTGSAVGCDPDLFWSKIPVMLDGHLIAFDYSGYDASLSPVWFACLKMLLEKLGYTHKETNYIDYLCNSHHLYRDKHYFVRGGMPSGCSGTSIFNSMINNIIIRTLMLKVYKGIDLDQFRMIAYGDDVIASYPWPIDASLLAEAGKGYGLIMTPADKGECFNEVTWTNATFLKRYFRADEQYPFLVHPVMPMKDIHESIRWTKDPKNTQDHVRSLCLLAWHNGEHEYEEFIRKIRSVPVGRCLTLPAFSTLRRKWLDSF,2185,FALSE,15711,15711,0,-2.76355725,median,Mattenberger,Globally defining the effects of mutations in a picornavirus capsid,2021,10.7554/eLife.64256,1-851,Picornavirus capsid,Viral replication,Growth,POLG_CXB3N_1-861_theta0.99_04-29-2022_b07.a2m,1,861,861,0.7,0.01,7909,0.959,826,1515.2,1.834382567,medium,94,0.1138014528,POLG_CXB3N_Mattenberger_2021.csv,log_fitness_by_syn_mut_fitness,1,mutant,POLG_CXB3N_theta_0.01.npy,POLG_CXB3N.pdb,1-2185,0.1,,OrganismalFitness
+POLG_DEN26_Suphatrakul_2023,POLG_DEN26_Suphatrakul_2023.csv,POLG_DEN26,Virus,Dengue virus type 2 (strain Thailand/16681/1984) (DENV-2),GTGNIGETLGEKWKSRLNALGKSEFQIYKKSGIQEVDRTLAKEGIKRGETDHHAVSRGSAKLRWFVERNMVTPEGKVVDLGCGRGGWSYYCGGLKNVREVKGLTKGGPGHEEPIPMSTYGWNLVRLQSGVDVFFIPPEKCDTLLCDIGESSPNPTVEAGRTLRVLNLVENWLNNNTQFCIKVLNPYMPSVIEKMEALQRKYGGALVRNPLSRNSTHEMYWVSNASGNIVSSVNMISRMLINRFTMRYKKATYEPDVDLGSGTRNIGIESEIPNLDIIGKRIEKIKQEHETSWHYDQDHPYKTWAYHGSYETKQTGSASSMVNGVVRLLTKPWDVVPMVTQMAMTDTTPFGQQRVFKEKVDTRTQEPKEGTKKLMKITAEWLWKELGKKKTPRMCTREEFTRKVRSNAALGAIFTDENKWKSAREAVEDSRFWELVDKERNLHLEGKCETCVYNMMGKREKKLGEFGKAKGSRAIWYMWLGARFLEFEALGFLNEDHWFSRENSLSGVEGEGLHKLGYILRDVSKKEGGAMYADDTAGWDTRITLEDLKNEEMVTNHMEGEHKKLAEAIFKLTYQNKVVRVQRPTPRGTVMDIISRRDQRGSGQVGTYGLNTFTNMEAQLIRQMEGEGVFKSIQHLTITEEIAVQNWLARVGRERLSRMAISGDDCVVKPLDDRFASALTALNDMGKIRKDIQQWEPSRGWNDWTQVPFCSHHFHELIMKDGRVLVVPCRNQDELIGRARISQGAGWSLRETACLGKSYAQMWSLMYFHRRDLRLAANAICSAVPSHWVPTSRTTWSIHAKHEWMTTEDMLTVWNRVWIQENPWMEDKTPVESWEEIPYLGKREDQWCGSLIGLTSRATWAKNIQAAINQVRSLIGNEEYTDYMPSMKRFRREEEEAGVLW,900,FALSE,16897,16897,0,-5.373371442,median,Suphatrakul,Functional analysis of flavivirus replicase by deep mutational scanning of dengue NS5,2023,10.1101/2023.03.07.531617,1-900,Flavivirus NS5,Viral replication,Growth,POLG_DEN26_2023-08-07_b01.a2m,1,900,900,0.1,0.01,10676,1,900,114.5,0.1272222222,Low,0,0,POLG_DEN26_Suphatrakul_2023.csv,score,1,mutant,POLG_DEN26_theta0.01_2023-08-07_b01.npy,POLG_DEN26.pdb,1-900,1,,OrganismalFitness
+POLG_HCVJF_Qi_2014,POLG_HCVJF_Qi_2014.csv,POLG_HCVJF,Virus,Hepatitis C virus genotype 2a (isolate JFH-1) (HCV),MSTNPKPQRKTKRNTNRRPEDVKFPGGGQIVGGVYLLPRRGPRLGVRTTRKTSERSQPRGRRQPIPKDRRSTGKAWGKPGRPWPLYGNEGLGWAGWLLSPRGSRPSWGPTDPRHRSRNVGKVIDTLTCGFADLMGYIPVVGAPLSGAARAVAHGVRVLEDGVNYATGNLPGFPFSIFLLALLSCITVPVSAAQVKNTSSSYMVTNDCSNDSITWQLEAAVLHVPGCVPCERVGNTSRCWVPVSPNMAVRQPGALTQGLRTHIDMVVMSATFCSALYVGDLCGGVMLAAQVFIVSPQYHWFVQECNCSIYPGTITGHRMAWDMMMNWSPTATMILAYVMRVPEVIIDIVSGAHWGVMFGLAYFSMQGAWAKVIVILLLAAGVDAGTTTVGGAVARSTNVIAGVFSHGPQQNIQLINTNGSWHINRTALNCNDSLNTGFLAALFYTNRFNSSGCPGRLSACRNIEAFRIGWGTLQYEDNVTNPEDMRPYCWHYPPKPCGVVPARSVCGPVYCFTPSPVVVGTTDRRGVPTYTWGENETDVFLLNSTRPPQGSWFGCTWMNSTGFTKTCGAPPCRTRADFNASTDLLCPTDCFRKHPDATYIKCGSGPWLTPKCLVHYPYRLWHYPCTVNFTIFKIRMYVGGVEHRLTAACNFTRGDRCDLEDRDRSQLSPLLHSTTEWAILPCTYSDLPALSTGLLHLHQNIVDVQYMYGLSPAITKYVVRWEWVVLLFLLLADARVCACLWMLILLGQAEAALEKLVVLHAASAANCHGLLYFAIFFVAAWHIRGRVVPLTTYCLTGLWPFCLLLMALPRQAYAYDAPVHGQIGVGLLILITLFTLTPGYKTLLGQCLWWLCYLLTLGEAMIQEWVPPMQVRGGRDGIAWAVTIFCPGVVFDITKWLLALLGPAYLLRAALTHVPYFVRAHALIRVCALVKQLAGGRYVQVALLALGRWTGTYIYDHLTPMSDWAASGLRDLAVAVEPIIFSPMEKKVIVWGAETAACGDILHGLPVSARLGQEILLGPADGYTSKGWKLLAPITAYAQQTRGLLGAIVVSMTGRDRTEQAGEVQILSTVSQSFLGTTISGVLWTVYHGAGNKTLAGLRGPVTQMYSSAEGDLVGWPSPPGTKSLEPCKCGAVDLYLVTRNADVIPARRRGDKRGALLSPRPISTLKGSSGGPVLCPRGHVVGLFRAAVCSRGVAKSIDFIPVETLDVVTRSPTFSDNSTPPAVPQTYQVGYLHAPTGSGKSTKVPVAYAAQGYKVLVLNPSVAATLGFGAYLSKAHGINPNIRTGVRTVMTGEAITYSTYGKFLADGGCASGAYDIIICDECHAVDATSILGIGTVLDQAETAGVRLTVLATATPPGSVTTPHPDIEEVGLGREGEIPFYGRAIPLSCIKGGRHLIFCHSKKKCDELAAALRGMGLNAVAYYRGLDVSIIPAQGDVVVVATDALMTGYTGDFDSVIDCNVAVTQAVDFSLDPTFTITTQTVPQDAVSRSQRRGRTGRGRQGTYRYVSTGERASGMFDSVVLCECYDAGAAWYDLTPAETTVRLRAYFNTPGLPVCQDHLEFWEAVFTGLTHIDAHFLSQTKQAGENFAYLVAYQATVCARAKAPPPSWDAMWKCLARLKPTLAGPTPLLYRLGPITNEVTLTHPGTKYIATCMQADLEVMTSTWVLAGGVLAAVAAYCLATGCVSIIGRLHVNQRVVVAPDKEVLYEAFDEMEECASRAALIEEGQRIAEMLKSKIQGLLQQASKQAQDIQPAMQASWPKVEQFWARHMWNFISGIQYLAGLSTLPGNPAVASMMAFSAALTSPLSTSTTILLNIMGGWLASQIAPPAGATGFVVSGLVGAAVGSIGLGKVLVDILAGYGAGISGALVAFKIMSGEKPSMEDVINLLPGILSPGALVVGVICAAILRRHVGPGEGAVQWMNRLIAFASRGNHVAPTHYVTESDASQRVTQLLGSLTITSLLRRLHNWITEDCPIPCSGSWLRDVWDWVCTILTDFKNWLTSKLFPKLPGLPFISCQKGYKGVWAGTGIMTTRCPCGANISGNVRLGSMRITGPKTCMNTWQGTFPINCYTEGQCAPKPPTNYKTAIWRVAASEYAEVTQHGSYSYVTGLTTDNLKIPCQLPSPEFFSWVDGVQIHRFAPTPKPFFRDEVSFCVGLNSYAVGSQLPCEPEPDADVLRSMLTDPPHITAETAARRLARGSPPSEASSSVSQLSAPSLRATCTTHSNTYDVDMVDANLLMEGGVAQTEPESRVPVLDFLEPMAEEESDLEPSIPSECMLPRSGFPRALPAWARPDYNPPLVESWRRPDYQPPTVAGCALPPPKKAPTPPPRRRRTVGLSESTISEALQQLAIKTFGQPPSSGDAGSSTGAGAAESGGPTSPGEPAPSETGSASSMPPLEGEPGDPDLESDQVELQPPPQGGGVAPGSGSGSWSTCSEEDDTTVCCSMSYSWTGALITPCSPEEEKLPINPLSNSLLRYHNKVYCTTSKSASQRAKKVTFDRTQVLDAHYDSVLKDIKLAASKVSARLLTLEEACQLTPPHSARSKYGFGAKEVRSLSGRAVNHIKSVWKDLLEDPQTPIPTTIMAKNEVFCVDPAKGGKKPARLIVYPDLGVRVCEKMALYDITQKLPQAVMGASYGFQYSPAQRVEYLLKAWAEKKDPMGFSYDTRCFDSTVTERDIRTEESIYQACSLPEEARTAIHSLTERLYVGGPMFNSKGQTCGYRRCRASGVLTTSMGNTITCYVKALAACKAAGIVAPTMLVCGDDLVVISESQGTEEDERNLRAFTEAMTRYSAPPGDPPRPEYDLELITSCSSNVSVALGPRGRRRYYLTRDPTTPLARAAWETVRHSPINSWLGNIIQYAPTIWVRMVLMTHFFSILMVQDTLDQNLNFEMYGSVYSVNPLDLPAIIERLHGLDAFSMHTYSHHELTRVASALRKLGAPPLRVWKSRARAVRASLISRGGKAAVCGRYLFNWAVKTKLKLTPLPEARLLDLSSWFTVGAGGGDIFHSVSRARPRSLLFGLLLLFVGVGLFLLPAR,3033,FALSE,1630,1630,0,-0.95,manual,Qi,A Quantitative High-Resolution Genetic Profile Rapidly Identifies Sequence Determinants of Hepatitis C Viral Fitness and Drug Sensitivity,2014,10.1371/journal.ppat.1004064,1994-2079,NS5A,Viral replication,Growth,POLG_HCVJF_theta0.99_1984-2089_11-26-2021_b08.a2m,1984,2089,106,0.8,0.01,16556,1,106,4421.2,41.70943396,medium,93,0.8773584906,POLG_HCVJF_Qi_2014.csv,fitness,1,mutant,POLG_HCVJF_theta_0.01.npy,POLG_HCVJF.pdb,1981-2224,0.1,,OrganismalFitness
+POLG_PESV_Tsuboyama_2023_2MXD,POLG_PESV_Tsuboyama_2023_2MXD.csv,POLG_PESV,Virus,Porcine enteric sapovirus (isolate Swine/United States/Cowden/1980),ALRDDEYDEWQDIIRDWRKEMTVQQFLDLKERALSGASDPDSQRYNAWLELRA,53,TRUE,5130,995,4135,-1.7,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-53,Genome polyprotein,Stability,cDNA display proteolysis,POLG_PESV_2023-08-07_b03.a2m,1,53,53,0.3,0.01,20190,0.887,47,3718.4,79.11489362,Medium,12,0.2553191489,Tsuboyama2023_Dataset2_Dataset34,ddG_ML_float,1,mut_type,POLG_PESV_theta0.01_2023-08-07_b03.npy,POLG_PESV.pdb,1-53,1,,Stability
+PPARG_HUMAN_Majithia_2016,PPARG_HUMAN_Majithia_2016.csv,PPARG_HUMAN,Human,Homo sapiens,MGETLGDSPIDPESDSFTDTLSANISQEMTMVDTEMPFWPTNFGISSVDLSVMEDHSHSFDIKPFTTVDFSSISTPHYEDIPFTRTDPVVADYKYDLKLQEYQSAIKVEPASPPYYSEKTQLYNKPHEEPSNSLMAIECRVCGDKASGFHYGVHACEGCKGFFRRTIRLKLIYDRCDLNCRIHKKSRNKCQYCRFQKCLAVGMSHNAIRFGRMPQAEKEKLLAEISSDIDQLNPESADLRALAKHLYDSYIKSFPLTKAKARAILTGKTTDKSPFVIYDMNSLMMGEDKIKFKHITPLQEQSKEVAIRIFQGCQFRSVEAVQEITEYAKSIPGFVNLDLNDQVTLLKYGVHEIIYTMLASLMNKDGVLISEGQGFMTREFLKSLRKPFGDFMEPKFEFAVKFNALELDDSDLAIFIAVIILSGDRPGLLNVKPIEDIQDNLLQALELQLKLNHPESSQLFAKLLQKMTDLRQIVTEHVQLLQVIKKTETDMSLHPLLQEIYKDLY,505,FALSE,9576,9576,0,-2.5,manual,Majithia,Prospective functional classification of all possible missense variants in PPARG,2016,10.1038/ng.3700,2-505,PPARG,Expression of CD36,FACS,PPARG_HUMAN_2023-10-12_b04.a2m,1,505,505,0.4,0.2,39993,0.8,404,3092.1,7.653712871,Medium,86,0.2128712871,https://miter.broadinstitute.org/mitergrade/?query=p.Y505A&prevalence=1.0e-5,Experimental function score,1,mutant,PPARG_HUMAN_theta0.2_2023-10-12_b04.npy,PPARG_HUMAN.pdb,1-505,1,,Activity
+PPM1D_HUMAN_Miller_2022,PPM1D_HUMAN_Miller_2022.csv,PPM1D_HUMAN,Human,Homo sapiens,MAGLYSLGVSVFSDQGGRKYMEDVTQIVVEPEPTAEEKPSPRRSLSQPLPPRPSPAALPGGEVSGKGPAVAAREARDPLPDAGASPAPSRCCRRRSSVAFFAVCDGHGGREAAQFAREHLWGFIKKQKGFTSSEPAKVCAAIRKGFLACHLAMWKKLAEWPKTMTGLPSTSGTTASVVIIRGMKMYVAHVGDSGVVLGIQDDPKDDFVRAVEVTQDHKPELPKERERIEGLGGSVMNKSGVNRVVWKRPRLTHNGPVRRSTVIDQIPFLAVARALGDLWSYDFFSGEFVVSPEPDTSVHTLDPQKHKYIILGSDGLWNMIPPQDAISMCQDQEEKKYLMGEHGQSCAKMLVNRALGRWRQRMLRADNTSAIVICISPEVDNQGNFTNEDELYLNLTDSPSYNSQETCVMTPSPCSTPPVKSLEEDPWPRVNSKDHIPALVRSNAFSENFLEVSAEIARENVQGVVIPSKDPEPLEENCAKALTLRIHDSLNNSLPIGLVPTNSTNTVMDQKNLKMSTPGQMKAQEIERTPPTNFKRTLEESNSGPLMKKHRRNGLSRSSGAQPASLPTTSQRKNSVKLTMRRRLRGQKKIGNPLLHQHRKTVCVC,605,FALSE,7889,7889,0,0.01275,median,Miller,Allosteric inhibition of PPM1D serine/threonine phosphatase via an altered conformational state,2022,10.1038/s41467-022-30463-9,2-421,Protein phosphatase 1D,Fitness with GFP reporter,quantification and selection of GFP-positive cells by flow cytometry after DNA damage induced by daunorubicin,PPM1D_HUMAN_2023-10-12_b01.a2m,1,605,605,0.1,0.2,1844,0.993,601,346.3,0.5762063228,Low,27,0.04492512479,PPM1D_HUMAN_Miller_2022_raw.xlsx,fitness,1,mutant,PPM1D_HUMAN_theta0.2_2023-10-12_b01.npy,PPM1D_HUMAN.pdb,1-605,1,,OrganismalFitness
+PR40A_HUMAN_Tsuboyama_2023_1UZC,PR40A_HUMAN_Tsuboyama_2023_1UZC.csv,PR40A_HUMAN,Human,Homo sapiens,TYTWNTKEEAKQAFKELLKEKRVPSNASWEQAMKMIINDPRYSALAKLSEKKQAFNAYKVQTE,63,TRUE,2033,1163,870,-1.362579422,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-63,Pre-mRNA-processing factor 40 homolog A,Stability,cDNA display proteolysis,PR40A_HUMAN_2023-08-07_b03.a2m,1,63,63,0.3,0.2,63560,0.857,54,3663.8,67.84814815,Medium,16,0.2962962963,Tsuboyama2023_Dataset2_Dataset35,ddG_ML_float,1,mut_type,PR40A_HUMAN_theta0.2_2023-08-07_b03.npy,PR40A_HUMAN.pdb,1-63,1,,Stability
+PRKN_HUMAN_Clausen_2023,PRKN_HUMAN_Clausen_2023.csv,PRKN_HUMAN,Human,Homo sapiens,MIVFVRFNSSHGFPVEVDSDTSIFQLKEVVAKRQGVPADQLRVIFAGKELRNDWTVQNCDLDQQSIVHIVQRPWRKGQEMNATGGDDPRNAAGGCEREPQSLTRVDLSSSVLPGDSVGLAVILHTDSRKDSPPAGSPAGRSIYNSFYVYCKGPCQRVQPGKLRVQCSTCRQATLTLTQGPSCWDDVLIPNRMSGECQSPHCPGTSAEFFFKCGAHPTSDKETSVALHLIATNSRNITCITCTDVRSPVLVFQCNSRHVICLDCFHLYCVTRLNDRQFVHDPQLGYSLPCVAGCPNSLIKELHHFRILGEEQYNRYQQYGAEECVLQMGGVLCPRPGCGAGLLPEPDQRKVTCEGGNGLGCGFAFCRECKEAYHEGECSAVFEASGTTTQAYRVDERAAEQARWEAASKETIKKTTKPCPRCHVPVEKNGGCMHMKCPQPQCRLEWCWNCGCEWNRVCMGDHWFDV,465,FALSE,8756,8756,0,0.75,manual,Clausen,A mutational atlas for Parkin proteostasis,2023,10.1101/2023.06.08.544160,1-465,Parkin,protein stability,FACS,PRKN_HUMAN_2023-08-07_b05.a2m,1,465,465,0.5,0.2,1457,0.998,464,195.2,0.4206896552,Low,21,0.04525862069,urn_mavedb_00000114-a-1_scores.csv,score,1,mutant,PRKN_HUMAN_theta0.2_2023-08-07_b05.npy,PRKN_HUMAN.pdb,1-465,1,,Expression
+PSAE_PICP2_Tsuboyama_2023_1PSE,PSAE_PICP2_Tsuboyama_2023_1PSE.csv,PSAE_PICP2,Prokaryote,Picosynechococcus,AIERGSKVKILRKESYWYGDVGTVASIDKSGIIYPVIVRFNKVNYNGFSGSAGGLNTNNFAEHELEVV,68,TRUE,1579,1219,360,-0.7,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-68,Photosystem I reaction center subunit IV,Stability,cDNA display proteolysis,PSAE_PICP2_2023-08-07_b09.a2m,1,68,68,0.9,0.2,1785,0.868,59,130.7,2.215254237,Medium,9,0.1525423729,Tsuboyama2023_Dataset2_Dataset36,ddG_ML_float,1,mut_type,PSAE_PICP2_theta0.2_2023-08-07_b09.npy,PSAE_PICP2.pdb,1-68,1,,Stability
+PTEN_HUMAN_Matreyek_2021,PTEN_HUMAN_Matreyek_2021.csv,PTEN_HUMAN,Human,Homo sapiens,MTAIIKEIVSRNKRRYQEDGFDLDLTYIYPNIIAMGFPAERLEGVYRNNIDDVVRFLDSKHKNHYKIYNLCAERHYDTAKFNCRVAQYPFEDHNPPQLELIKPFCEDLDQWLSEDDNHVAAIHCKAGKGRTGVMICAYLLHRGKFLKAQEALDFYGEVRTRDKKGVTIPSQRRYVYYYSYLLKNHLDYRPVALLFHKMMFETIPMFSGGTCNPQFVVCQLKVKIYSSNSGPTRREDKFMYFEFPQPLPVCGDIKVEFFHKQNKMLKKDKMFHFWVNTFFIPGPEETSEKVENGSLCDQEIDSICSIERADNDKEYLVLTLTKNDLDKANKDKANRYFSPNFKVKLYFTKTVEEPSNPEASSSTSVTPDVSDNEPDHYRYSDTTDSDPENEPFDEDQHTQITKV,403,FALSE,5083,5083,0,0.7708605475,median,Matreyek,Integrating thousands of PTEN variant activity and abundance measurements reveals variant subgroups and new dominant negatives in cancers,2021,10.1186/s13073-021-00984-x,1-403,PTEN,Protein abundance (FACS sorting for abundance of GFP-fused target),Protein stability,PTEN_HUMAN_full_11-26-2021_b01.a2m,1,403,403,0.1,0.2,19058,0.752,303,1425.3,4.703960396,medium,52,0.1716171617,PTEN_HUMAN_Matreyek_2021.csv,score_total,1,variant,PTEN_HUMAN_theta_0.2.npy,PTEN_HUMAN.pdb,1-403,0.1,,Expression
+PTEN_HUMAN_Mighell_2018,PTEN_HUMAN_Mighell_2018.csv,PTEN_HUMAN,Human,Homo sapiens,MTAIIKEIVSRNKRRYQEDGFDLDLTYIYPNIIAMGFPAERLEGVYRNNIDDVVRFLDSKHKNHYKIYNLCAERHYDTAKFNCRVAQYPFEDHNPPQLELIKPFCEDLDQWLSEDDNHVAAIHCKAGKGRTGVMICAYLLHRGKFLKAQEALDFYGEVRTRDKKGVTIPSQRRYVYYYSYLLKNHLDYRPVALLFHKMMFETIPMFSGGTCNPQFVVCQLKVKIYSSNSGPTRREDKFMYFEFPQPLPVCGDIKVEFFHKQNKMLKKDKMFHFWVNTFFIPGPEETSEKVENGSLCDQEIDSICSIERADNDKEYLVLTLTKNDLDKANKDKANRYFSPNFKVKLYFTKTVEEPSNPEASSSTSVTPDVSDNEPDHYRYSDTTDSDPENEPFDEDQHTQITKV,403,FALSE,7260,7260,0,-1.5,manual,Mighell,A Saturation Mutagenesis Approach to Understanding PTEN Lipid Phosphatase Activity and Genotype-Phenotype Relationships,2018,10.1016/j.ajhg.2018.03.018,1-403,PTEN,"growth (surrogate for enzymatic activity/hydrolysis of lipid phosphates to restore PIP2, which affects proliferation rate)",lipid phosphatase activity,PTEN_HUMAN_full_11-26-2021_b01.a2m,1,403,403,0.1,0.2,19058,0.752,303,1425.3,4.703960396,medium,52,0.1716171617,PTEN_HUMAN_Mighell_2018.csv,Fitness_score,1,mutant,PTEN_HUMAN_theta_0.2.npy,PTEN_HUMAN.pdb,1-403,0.1,,Activity
+Q2N0S5_9HIV1_Haddox_2018,Q2N0S5_9HIV1_Haddox_2018.csv,Q2N0S5_9HIV1,Virus,Human immunodeficiency virus 1,MRVMGIQRNCQHLFRWGTMILGMIIICSAAENLWVTVYYGVPVWKDAETTLFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLENVTEEFNMWKNNMVEQMHTDIISLWDQSLKPCVKLTPLCVTLQCTNVTNNITDDMRGELKNCSFNMTTELRDKKQKVYSLFYRLDVVQINENQGNRSNNSNKEYRLINCNTSAITQACPKVSFEPIPIHYCAPAGFAILKCKDKKFNGTGPCPSVSTVQCTHGIKPVVSTQLLLNGSLAEEEVMIRSENITNNAKNILVQFNTPVQINCTRPNNNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVSKATWNETLGKVVKQLRKHFGNNTIIRFANSSGGDLEVTTHSFNCGGEFFYCNTSGLFNSTWISNTSVQGSNSTGSNDSITLPCRIKQIINMWQRIGQAMYAPPIQGVIRCVSNITGLILTRDGGSTNSTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRAKRRVVGREKRAVGIGAVFLGFLGAAGSTMGAASMTLTVQARNLLSGIVQQQSNLLRAIEAQQHLLKLTVWGIKQLQARVLAVERYLRDQQLLGIWGCSGKLICTTNVPWNSSWSNRNLSEIWDNMTWLQWDKEISNYTQIIYGLLEESQNQQEKNEQDLLALDKWASLWNWFDISNWLWYIKIFIMIVGGLIGLRIVFAVLSVIHRVRQGYSPLSFQTHTPNPRGLDRPERIEEEDGEQDRGRSTRLVSGFLALAWDDLRSLCLFCYHRLRDFILIAARIVELLGHSSLKGLRLGWEGLKYLWNLLAYWGRELKISAINLFDTIAIAVAEWTDRVIEIGQRLCRAFLHIPRRIRQGLERALL,860,FALSE,12729,12729,0,-2,manual,Haddox,Mapping mutational effects along the evolutionary landscape of HIV envelope,2018,10.7554/eLife.34420,30-699,HIV env (BG505),Viral replication,Growth,Q2N0S5_9HIV1_full_theta0.99_04-29-2022_b09.a2m,1,860,860,0.9,0.01,75014,0.976,839,36369.7,43.3488677,medium,2462,2.934445769,Q2N0S5_9HIV1_Haddox_2018.csv,fitness,1,mutant,Q2N0S5_9HIV1_theta_0.01.npy,Q2N0S5_9HIV1.pdb,1-860,0.1,,OrganismalFitness
+Q53Z42_HUMAN_McShan_2019_binding-TAPBPR,Q53Z42_HUMAN_McShan_2019_binding-TAPBPR.csv,Q53Z42_HUMAN,Human,Homo sapiens,MAVMAPRTLVLLLSGALALTQTWAGSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYWDGETRKVKAHSQTHRVDLGTLRGYYNQSEAGSHTVQRMYGCDVGSDWRFLRGYHQYAYDGKDYIALKEDLRSWTAADMAAQTTKHKWEAAHVAEQLRAYLEGTCVEWLRRYLENGKETLQRTDAPKTHMTHHAVSDHEATLRCWALSFYPAEITLTWQRDGEDQTQDTELVETRPAGDGTFQKWAAVVVPSGQEQRYTCHVQHEGLPKPLTLRWEPSSQPTIPIVGIIAGLVLFGAVITGAVVAAVMWRRKSSDRKGGSYSQAASSDSAQGSDVSLTACKV,365,FALSE,3344,3344,0,0.19,median,McShan,Molecular determinants of chaperone interactions on MHC-I for folding and antigen repertoire selection,2019,10.1073/pnas.1915562116,26-205,HLA-A,binding affinity (TAPBPR),,Q53Z42_HUMAN_2023-08-07_b01.a2m,1,365,365,0.1,0.2,41636,0.986,360,4986.2,13.85055556,Medium,210,0.5833333333,,score,1,mut_proteingym,Q53Z42_HUMAN_theta0.2_2023-08-07_b01.npy,Q53Z42_HUMAN.pdb,1-365,1,25,Binding
+Q53Z42_HUMAN_McShan_2019_expression,Q53Z42_HUMAN_McShan_2019_expression.csv,Q53Z42_HUMAN,Human,Homo sapiens,MAVMAPRTLVLLLSGALALTQTWAGSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYWDGETRKVKAHSQTHRVDLGTLRGYYNQSEAGSHTVQRMYGCDVGSDWRFLRGYHQYAYDGKDYIALKEDLRSWTAADMAAQTTKHKWEAAHVAEQLRAYLEGTCVEWLRRYLENGKETLQRTDAPKTHMTHHAVSDHEATLRCWALSFYPAEITLTWQRDGEDQTQDTELVETRPAGDGTFQKWAAVVVPSGQEQRYTCHVQHEGLPKPLTLRWEPSSQPTIPIVGIIAGLVLFGAVITGAVVAAVMWRRKSSDRKGGSYSQAASSDSAQGSDVSLTACKV,365,FALSE,3344,3344,0,-0.73,median,McShan,Molecular determinants of chaperone interactions on MHC-I for folding and antigen repertoire selection,2019,10.1073/pnas.1915562116,26-205,HLA-A,surface expression,,Q53Z42_HUMAN_2023-08-07_b01.a2m,1,365,365,0.1,0.2,41636,0.986,360,4986.2,13.85055556,Medium,210,0.5833333333,,score,1,mut_proteingym,Q53Z42_HUMAN_theta0.2_2023-08-07_b01.npy,Q53Z42_HUMAN.pdb,1-365,1,25,Expression
+Q59976_STRSQ_Romero_2015,Q59976_STRSQ_Romero_2015.csv,Q59976_STRSQ,Prokaryote,Streptomyces sp,MVPAAQQTAMAPDAALTFPEGFLWGSATASYQIEGAAAEDGRTPSIWDTYARTPGRVRNGDTGDVATDHYHRWREDVALMAELGLGAYRFSLAWPRIQPTGRGPALQKGLDFYRRLADELLAKGIQPVATLYHWDLPQELENAGGWPERATAERFAEYAAIAADALGDRVKTWTTLNEPWCSAFLGYGSGVHAPGRTDPVAALRAAHHLNLGHGLAVQALRDRLPADAQCSVTLNIHHVRPLTDSDADADAVRRIDALANRVFTGPMLQGAYPEDLVKDTAGLTDWSFVRDGDLRLAHQKLDFLGVNYYSPTLVSEADGSGTHNSDGHGRSAHSPWPGADRVAFHQPPGETTAMGWAVDPSGLYELLRRLSSDFPALPLVITENGAAFHDYADPEGNVNDPERIAYVRDHLAAVHRAIKDGSDVRGYFLWSLLDNFEWAHGYSKRFGAVYVDYPTGTRIPKASARWYAEVARTGVLPTAGDPNSSSVDKLAAALEHHHHHH,501,FALSE,2999,2999,0,-1,manual,Romero,Dissecting enzyme function with microfluidic-based deep mutational scanning,2015,10.1073/pnas.1422285112,2-501,β-glucosidase,Enzyme function,Activity,Q59976_STRSQ_full_11-26-2021_b03.a2m,1,501,501,0.3,0.2,105913,0.882,442,13981.2,31.63167421,medium,850,1.923076923,Q59976_STRSQ_Romero_2015.csv,enrichment,1,mutant,Q59976_STRSQ_theta_0.2.npy,Q59976_STRSQ.pdb,1-501,0.1,,Activity
+Q6WV12_9MAXI_Somermeyer_2022,Q6WV12_9MAXI_Somermeyer_2022.csv,Q6WV12_9MAXI,Eukaryote,Pontellina plumata,MPAMKIECRITGTLNGVEFELVGGGEGTPEQGRMTNKMKSTKGALTFSPYLLSHVMGYGFYHFGTYPSGYENPFLHAINNGGYTNTRIEKYEDGGVLHVSFSYRYEAGRVIGDFKVVGTGFPEDSVIFTDKIIRSNATVEHLHPMGDNVLVGSFARTFSLRDGGYYSFVVDSHMHFKSAIHPSILQNGGPMFAFRRVEELHSNTELGIVEYQHAFKTPIAFA,222,TRUE,31401,1141,30260,15721.24977,median,Somermeyer,Heterogeneity of the GFP fitness landscape and data-driven protein design,2022,10.7554/eLife.75842,2-222,Green fluorescent protein ppluGFP2,Fluorescence,FACS,Q6WV12_9MAXI_full_b0.6.a2m,1,222,222,0.6,0.2,506,1,222,95.9,0.431981982,Low,4,0.01801801802,Q6WV12_9MAXI_Somermeyer_2022.csv,replicates_mean_brightness,1,mutant,Q6WV12_9MAXI_theta_0.2.npy,Q6WV12_9MAXI.pdb,1-222,1,,Activity
+Q837P4_ENTFA_Meier_2023,Q837P4_ENTFA_Meier_2023.csv,Q837P4_ENTFA,Prokaryote,Enterococcus faecalis,MTDLIKASKFFYHYLKRYKVSFLFIFLAIFAATYLQVKAPQFVGEAIQELAKYAVNVMQGKDDKSAFVSVIWKLLIFYVLTSAASFIYSILFTQVVGKSTNRMRIGLFNKLEKLTIRFFDSHQDGEILSRFTSDLDNIQNSLNQALLQVLTNIALLVGVLIMMFRQNVELAWATIASTPIAILIAVFVISKARKYVDLQQDEVGKLNGYMDEKISGQRVIITNGLQEETIDGFLEQNEKVRAATYKGQVYSGLLFPMMQGMSLVNTAIVIFFGGWLAINGSVDRAAALGLVVMFVQYSQQYYQPLMQISSGYSMIQLAVTGARRLNEMFDEPDEIRPENGEKLEEINKAVALNHVVFGYNPETPVLKDVSIHVDKGEMVALVGPTGSGKTTIMNLMNRFYDVNEGAVTFDGVDIREMDLDSLRSHVGIVLQESVLFSGTIRENIAFGKPEATDEEIVQAAKQANIHEFIVNLEQGYDTEITEENNLFSTGQKQLVSIARTIITNPELLILDEATSNVDTVTEAKIQKAMDEAIKGRTSFVIAHRLKTILNADRIIVLRDGEVIEEGNHHELVEQDGFYAELYKNQFVFE,589,FALSE,697,697,0,-0.6270963227,median,Meier,Deep mutational scan of a drug efflux pump reveals its structure–function landscape,2023,10.1038/s41589-022-01205-1,32-543,EfrD ABC transporter,Drug efflux,Growth,Q837P4_ENTFA_2023-08-07_b09.a2m,1,589,589,0.9,0.2,343933,0.975,574,54079.8,94.2151568,Medium,1123,1.95644599,41589_2022_1205_MOESM4_ESM.xlsx,avg_score,1,mutant,Q837P4_ENTFA_theta0.2_2023-08-07_b09.npy,Q837P4_ENTFA.pdb,1-589,1,,Activity
+Q837P5_ENTFA_Meier_2023,Q837P5_ENTFA_Meier_2023.csv,Q837P5_ENTFA,Prokaryote,Enterococcus faecalis,MDLIIQHAKKYKGSVVIALLAVIVMVVSALWQPKLLQQVLEAIMNDDSDKMKNLGIQLIAIAGLGLVAGVINTIFSAKVAQGVSADIREATFRKIQTFSFGNIEKFSAGNLVVRLTNDVTQIQNVIMIALQTLFRIPFLFIGSFILAMLTLPQLWWVIVALVIAVILISMLSFSQMGKHFMIIQNLIDKINGIAKENLLGIRVVKSFVQEKNQLSRFTKVSEELTTHNLIVGSLFAVMIPAFMLVANLAVVGSIFFVSNLVKDDPTLIGGVASFMNYLMQIMMAIIIGGMMMMMTSRAAVSIKRIKEVMETEPDVTYKKVPEQELIGSVEFDHVSFRYPGDEEDTLKDISFSIQPGEMIGIVGATGAGKSTLAQLIPRLFDPTEGKIEVGGVDLREVNEHSLRKTVSFVLQKAILFSGTIAQNLRHGKRDASEADMERASGIAQAKEFIEKLAEGYDAPVEERSNNFSGGQKQRLSITRGVIGEPKILILDDSTSALDARSERLVREALDKELKETTTIVIAQKISSVVHADRILVLDNGRLVGEGTHEELAATNPVYQEIYETQKGKEEA,571,FALSE,747,747,0,-0.85731232,median,Meier,Deep mutational scan of a drug efflux pump reveals its structure–function landscape,2023,10.1038/s41589-022-01205-1,25-523,EfrC ABC transporter,Drug efflux,Growth,Q837P5_ENTFA_2023-08-07_b09.a2m,1,571,571,0.9,0.2,346355,0.993,567,54910.5,96.8439153,Medium,1135,2.00176367,,avg_score,1,mutant,Q837P5_ENTFA_theta0.2_2023-08-07_b09.npy,Q837P5_ENTFA.pdb,1-571,1,,Activity
+Q8WTC7_9CNID_Somermeyer_2022,Q8WTC7_9CNID_Somermeyer_2022.csv,Q8WTC7_9CNID,Eukaryote,Aequorea macrodactyla,MSKGEELFTGIVPVLIELDGDVHGHKFSVRGEGEGDADYGKLEIKFICTTGKLPVPWPTLVTTLSYGILCFARYPEHMKMNDFFKSAMPEGYIQERTIFFQDDGKYKTRGEVKFEGDTLVNRIELKGMDFKEDGNILGHKLEYNFNSHNVYIMPDKANNGLKVNFKIRHNIEGGGVQLADHYQTNVPLGDGPVLIPINHYLSCQTAISKDRNETRDHMVFLEFFSACGHTHGMDELYK,238,TRUE,33510,1201,32309,5000,manual,Somermeyer,Heterogeneity of the GFP fitness landscape and data-driven protein design,2022,10.7554/eLife.75842,2-238,Green fluorescent protein amacGFP,Fluorescence,FACS,Q8WTC7_9CNID_full_b0.5.a2m,1,238,238,0.5,0.2,655,1,238,118.5,0.4978991597,Low,5,0.02100840336,Q8WTC8_9CNID_Somermeyer_2022.csv,replicates_mean_brightness,1,mutant,Q8WTC7_9CNID_theta_0.2.npy,Q8WTC7_9CNID.pdb,1-238,1,,Activity
+R1AB_SARS2_Flynn_2022,R1AB_SARS2_Flynn_2022.csv,R1AB_SARS2,Virus,Severe acute respiratory syndrome coronavirus 2 (2019-nCoV) (SARS-CoV-2),SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQ,306,FALSE,5725,5725,0,0.5,manual,Flynn,Comprehensive fitness landscape of SARS-CoV-2 Mpro reveals insights into viral resistance mechanisms,2022,10.7554/eLife.77433,1-306,SARS-CoV-2 Mpro,"FRET, Growth",,R1AB_SARS2_02-19-2022_b07.a2m,1,306,306,0.7,0.01,182169,1,306,326.3,1.066339869,medium,79,0.2581699346,R1AB_SARS2_Flynn_2022.csv,average_growth,1,mutant,R1AB_SARS2_theta_0.01.npy,R1AB_SARS2.pdb,1-306,0.1,,OrganismalFitness
+RAD_ANTMA_Tsuboyama_2023_2CJJ,RAD_ANTMA_Tsuboyama_2023_2CJJ.csv,RAD_ANTMA,Eukaryote,Antirrhinum majus,PWSAKENKAFERALAVYDKDTPDRWANVARAVEGRTPEEVKKHYEILVEDIKYI,54,TRUE,912,774,138,-0.3943851731,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-54,Transcription factor RADIALIS,Stability,cDNA display proteolysis,RAD_ANTMA_2023-08-07_b01.a2m,1,54,54,0.1,0.2,423275,0.833,45,38133.9,847.42,High,27,0.6,Tsuboyama2023_Dataset2_Dataset37,ddG_ML_float,1,mut_type,RAD_ANTMA_theta0.2_2023-08-07_b01.npy,RAD_ANTMA.pdb,1-54,1,,Stability
+RAF1_HUMAN_Zinkus-Boltz_2019,RAF1_HUMAN_Zinkus-Boltz_2019.csv,RAF1_HUMAN,Human,Homo sapiens,MEHIQGAWKTISNGFGFKDAVFDGSSCISPTIVQQFGYQRRASDDGKLTDPSKTSNTIRVFLPNKQRTVVNVRNGMSLHDCLMKALKVRGLQPECCAVFRLLHEHKGKKARLDWNTDAASLIGEELQVDFLDHVPLTTHNFARKTFLKLAFCDICQKFLLNGFRCQTCGYKFHEHCSTKVPTMCVDWSNIRQLLLFPNSTIGDSGVPALPSLTMRRMRESVSRMPVSSQHRYSTPHAFTFNTSSPSSEGSLSQRQRSTSTPNVHMVSTTLPVDSRMIEDAIRSHSESASPSALSSSPNNLSPTGWSQPKTPVPAQRERAPVSGTQEKNKIRPRGQRDSSYYWEIEASEVMLSTRIGSGSFGTVYKGKWHGDVAVKILKVVDPTPEQFQAFRNEVAVLRKTRHVNILLFMGYMTKDNLAIVTQWCEGSSLYKHLHVQETKFQMFQLIDIARQTAQGMDYLHAKNIIHRDMKSNNIFLHEGLTVKIGDFGLATVKSRWSGSQQVEQPTGSVLWMAPEVIRMQDNNPFSFQSDVYSYGIVLYELMTGELPYSHINNRDQIIFMVGRGYASPDLSKLYKNCPKAMKRLVADCVKKVKEERPLFPQILSSIELLQHSLPKINRSASEPSLHRAAHTEDINACTLTTSPRLPVF,648,FALSE,297,297,0,-0.0671,median,Zinkus-Boltz,A Phage-Assisted Continuous Selection Approach for Deep Mutational Scanning of Protein–Protein Interactions,2019,10.1021/acschembio.9b00669,52-90,RAF oncogene,Viral Replication,binding assays,RAF1_HUMAN_2023-10-12_b05.a2m,1,648,648,0.5,0.2,9685,0.972,630,350.5,0.5563492063,Low,30,0.04761904762,urn_mavedb_00000061-a-1_scores.csv,score,1,mutant,RAF1_HUMAN_theta0.2_2023-10-12_b05.npy,RAF1_HUMAN.pdb,1-648,1,,OrganismalFitness
+RASH_HUMAN_Bandaru_2017,RASH_HUMAN_Bandaru_2017.csv,RASH_HUMAN,Human,Homo sapiens,MTEYKLVVVGAGGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQVVIDGETCLLDILDTAGQEEYSAMRDQYMRTGEGFLCVFAINNTKSFEDIHQYREQIKRVKDSDDVPMVLVGNKCDLAARTVESRQAQDLARSYGIPYIETSAKTRQGVEDAFYTLVREIRQHKLRKLNPPDESGPGCMSCKCVLS,189,FALSE,3134,3134,0,-0.25,manual,Bandaru,Deconstruction of the Ras switching cycle through saturation mutagenesis,2017,10.7554/eLife.27810,2-166,HRAS,C-Raf binding and GEF,activity,RASH_HUMAN_full_11-26-2021_b03.a2m,1,189,189,0.3,0.2,204751,0.862,163,23971.6,147.0650307,high,205,1.257668712,RASH_HUMAN_Bandaru_2017.csv,unregulated,1,mutant,RASH_HUMAN_theta_0.2.npy,RASH_HUMAN.pdb,1-189,0.1,,Activity
+RASK_HUMAN_Weng_2022_abundance,RASK_HUMAN_Weng_2022_abundance.csv,RASK_HUMAN,Human,Homo sapiens,MTEYKLVVVGAGGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQVVIDGETCLLDILDTAGQEEYSAMRDQYMRTGEGFLCVFAINNTKSFEDIHHYREQIKRVKDSEDVPMVLVGNKCDLPSRTVDTKQAQDLARSYGIPFIETSAKTRQGVDDAFYTLVREIRKHKEKMSKDGKKKKKKSKTKCVIM,188,TRUE,26012,3066,22946,-0.504113408,median,Weng,The energetic and allosteric landscape for KRAS inhibition,2022,10.1101/2022.12.06.519122,2-188,KRAS,Yeast growth,,RASK_HUMAN_2023-08-07_b03.a2m,1,188,188,0.3,0.2,260539,0.888,167,27850.5,166.7694611,High,211,1.263473054,kras_fitness.xlsx,fitness,1,mutant,RASK_HUMAN_theta0.2_2023-08-07_b03.npy,RASK_HUMAN.pdb,1-188,1,,Expression
+RASK_HUMAN_Weng_2022_binding-DARPin_K55,RASK_HUMAN_Weng_2022_binding-DARPin_K55.csv,RASK_HUMAN,Human,Homo sapiens,MTEYKLVVVGAGGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQVVIDGETCLLDILDTAGQEEYSAMRDQYMRTGEGFLCVFAINNTKSFEDIHHYREQIKRVKDSEDVPMVLVGNKCDLPSRTVDTKQAQDLARSYGIPFIETSAKTRQGVDDAFYTLVREIRKHKEKMSKDGKKKKKKSKTKCVIM,188,TRUE,24873,3084,21789,-0.4605455262,median,Weng,The energetic and allosteric landscape for KRAS inhibition,2022,10.1101/2022.12.06.519127,2-188,KRAS,Yeast growth,,RASK_HUMAN_2023-08-07_b03.a2m,1,188,188,0.3,0.2,260539,0.888,167,27850.5,166.7694611,High,211,1.263473054,kras_fitness.xlsx,fitness,1,mutant,RASK_HUMAN_theta0.2_2023-08-07_b03.npy,RASK_HUMAN.pdb,1-188,1,,Binding
+RBP1_HUMAN_Tsuboyama_2023_2KWH,RBP1_HUMAN_Tsuboyama_2023_2KWH.csv,RBP1_HUMAN,Human,Homo sapiens,ETQAGIKEEIRRQEFLLNSLHRDLQGGIKDLSKEERLWEVQRILTALKRKLR,52,TRUE,1332,975,357,-0.2693189895,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-52,RalA-binding protein 1,Stability,cDNA display proteolysis,RBP1_HUMAN_2023-08-07_b01.a2m,1,52,52,0.1,0.2,135922,0.827,43,50510,1174.651163,High,6,0.1395348837,Tsuboyama2023_Dataset2_Dataset38,ddG_ML_float,1,mut_type,RBP1_HUMAN_theta0.2_2023-08-07_b01.npy,RBP1_HUMAN.pdb,1-52,1,,Stability
+RCD1_ARATH_Tsuboyama_2023_5OAO,RCD1_ARATH_Tsuboyama_2023_5OAO.csv,RCD1_ARATH,Eukaryote,Arabidopsis thaliana,PTLFAAISHKVAENDMLLINADYQQLRDKKMTRAEFVRKLRVIVGDDLLRSTITTLQ,57,TRUE,1261,988,273,-0.3828831078,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-57,Inactive poly [ADP-ribose] polymerase RCD1,Stability,cDNA display proteolysis,RCD1_ARATH_2023-08-07_b02.a2m,1,57,57,0.2,0.2,6525,0.93,53,1578.5,29.78301887,Medium,2,0.03773584906,Tsuboyama2023_Dataset2_Dataset39,ddG_ML_float,1,mut_type,RCD1_ARATH_theta0.2_2023-08-07_b02.npy,RCD1_ARATH.pdb,1-57,1,,Stability
+RCRO_LAMBD_Tsuboyama_2023_1ORC,RCRO_LAMBD_Tsuboyama_2023_1ORC.csv,RCRO_LAMBD,Virus,Escherichia phage lambda (Bacteriophage lambda),QRITLKDYAMRFGQTKTAKDLGVYQSAINKAIHAGRKIFLTINADGSVYAEEVKDGEVKPFPS,63,TRUE,2278,1195,1083,-1.255848942,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-63,Regulatory protein cro,Stability,cDNA display proteolysis,RCRO_LAMBD_2023-08-07_b03.a2m,1,63,63,0.3,0.2,392895,0.762,48,51658.6,1076.220833,High,32,0.6666666667,Tsuboyama2023_Dataset2_Dataset40,ddG_ML_float,1,mut_type,RCRO_LAMBD_theta0.2_2023-08-07_b03.npy,RCRO_LAMBD.pdb,1-63,1,,Stability
+RD23A_HUMAN_Tsuboyama_2023_1IFY,RD23A_HUMAN_Tsuboyama_2023_1IFY.csv,RD23A_HUMAN,Human,Homo sapiens,SEYETMLTEIMSMGYERERVVAALRASYNNPHRAVEYLLTGIPG,44,TRUE,1019,798,221,-0.7285205281,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-44,UV excision repair protein RAD23 homolog A,Stability,cDNA display proteolysis,RD23A_HUMAN_2023-08-07_b04.a2m,1,44,44,0.4,0.2,100991,0.864,38,7912.9,208.2342105,High,21,0.5526315789,Tsuboyama2023_Dataset2_Dataset41,ddG_ML_float,1,mut_type,RD23A_HUMAN_theta0.2_2023-08-07_b04.npy,RD23A_HUMAN.pdb,1-44,1,,Stability
+RDRP_I33A0_Li_2023,RDRP_I33A0_Li_2023.csv,RDRP_I33A0,Virus,Influenza A virus (strain A/Wilson-Smith/1933 H1N1),MDVNPTLLFLKVPAQNAISTTFPYTGDPPYSHGTGTGYTMDTVNRTHQYSERGRWTTNTETGAPQLNPIDGPLPEDNEPSGYAQTDCVLEAMAFLEESHPGIFETSCLETMEVVQQTRVDKLTQGRQTYDWTLNRNQPAATALANTIEVFRSNGLTANESGRLIDFLKDVMESMNKEEMEITTHFQRKRRVRDNMTKKMVTQRTIGKRKQRLNKRSYLIRALTLNTMTKDAERGKLKRRAIATPGMQIRGFVYFVETLARSICEKLEQSGLPVGGNEKKAKLANVVRKMMTNSQDTEISFTITGDNTKWNENQNPRMFLAMITYITRNQPEWFRNVLSIAPIMFSNKMARLGKGYMFESKSMKIRTQIPAEMLASIDLKYFNDSTRKKIEKIRPLLIDGTASLSPGMMMGMFNMLSTVLGVSILNLGQKRHTKTTYWWDGLQSSDDFALIVNAPNHEGIQAGVNRFYRTCKLLGINMSKKKSYINRTGTFEFTSFFYRYGFVANFSMELPSFGVSGINESADMSIGVTVIKNNMINNDLGPATAQMALQLFIKDYRYTYRCHRGDTQIQTRRSFEIKKLWEQTHSKAGLLVSDGGPNLYNIRNLHIPEVCLKWELMDEDYQGRLCNPLNPFVNHKDIESVNNAVIMPAHGPAKNMEYDAVATTHSWIPKRNRSILNTSQRGILEDEQMYQKCCNLFEKFFPSSSYRRPVGISSMVEAMVSRARIDARIDFESGRIKKEEFTEIMKICSTIEELRRQK,757,FALSE,12003,12003,0,-1,manual,Li,Deep mutational scanning reveals the functional constraints and evolutionary potential of the influenza A virus PB1 protein,2023,10.1101/2023.08.27.554986,1-757,Influenza RNA polymerase PB1,Viral Replication,Growth,RDRP_I33A0_2023-08-07_b01.a2m,1,757,757,0.1,0.01,26589,1,757,102.8,0.1357992074,Low,0,0,554986_file16.csv,fitness,1,mutant,RDRP_I33A0_theta0.01_2023-08-07_b01.npy,RDRP_I33A0.pdb,1-757,1,,OrganismalFitness
+REV_HV1H2_Fernandes_2016,REV_HV1H2_Fernandes_2016.csv,REV_HV1H2,Virus,Human immunodeficiency virus type 1 group M subtype B (isolate HXB2) (HIV-1),MAGRSGDSDEELIRTVRLIKLLYQSNPPPNPEGTRQARRNRRRRWRERQRQIHSISERILSTYLGRSAEPVPLQLPPLERLTLDCNEDCGTSGTQGVGSPQILVESPTVLESGTKE,116,FALSE,2147,2147,0,-0.06744744968,median,Fernandes,Functional Segregation of Overlapping Genes in HIV,2016,10.1016/j.cell.2016.11.031,1-116,HIV rev,Viral replication,Growth,REV_HV1H2_full_theta0.99_04-29-2022_b09.a2m,1,116,116,0.9,0.01,15839,0.948,110,9951.8,90.47090909,medium,54,0.4909090909,REV_HV1H2_Fernandes_2016.csv,sel_coeff_mean,1,mutant,REV_HV1H2_theta_0.01.npy,REV_HV1H2.pdb,1-116,0.1,,OrganismalFitness
+RFAH_ECOLI_Tsuboyama_2023_2LCL,RFAH_ECOLI_Tsuboyama_2023_2LCL.csv,RFAH_ECOLI,Prokaryote,Escherichia coli,ATPYPGDKVIITEGAFEGFQAIFTEPDGEARSMLLLNLINKEIKHSVKNTEFRKL,55,TRUE,1326,969,357,-0.4014057355,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-55,Transcription antitermination protein RfaH,Stability,cDNA display proteolysis,RFAH_ECOLI_2023-08-07_b04.a2m,1,55,55,0.4,0.2,86049,0.927,51,11748.4,230.3607843,High,35,0.6862745098,Tsuboyama2023_Dataset2_Dataset42,ddG_ML_float,1,mut_type,RFAH_ECOLI_theta0.2_2023-08-07_b04.npy,RFAH_ECOLI.pdb,1-55,1,,Stability
+RL20_AQUAE_Tsuboyama_2023_1GYZ,RL20_AQUAE_Tsuboyama_2023_1GYZ.csv,RL20_AQUAE,Prokaryote,Aquifex aeolicus,WIARINAAVRAYGLNYSTFINGLKKAGIELDRKILADMAVRDPQAFEQVVNKVKEALQV,59,TRUE,1461,1121,340,-0.7,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-59,Large ribosomal subunit protein bL20,Stability,cDNA display proteolysis,RL20_AQUAE_2023-08-07_b01.a2m,1,59,59,0.1,0.2,397758,0.814,48,104766.4,2182.633333,High,34,0.7083333333,Tsuboyama2023_Dataset2_Dataset43,ddG_ML_float,1,mut_type,RL20_AQUAE_theta0.2_2023-08-07_b01.npy,RL20_AQUAE.pdb,1-59,1,,Stability
+RL40A_YEAST_Mavor_2016,RL40A_YEAST_Mavor_2016.csv,RL40A_YEAST,Eukaryote,Saccharomyces cerevisiae,MQIFVKTLTGKTITLEVESSDTIDNVKSKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGIIEPSLKALASKYNCDKSVCRKCYARLPPRATNCRKRKCGHTNQLRPKKKLK,128,FALSE,1253,1253,0,-0.2,manual,Mavor,Determination of ubiquitin fitness landscapes under different chemical stresses in a classroom setting,2016,10.7554/eLife.15802,2-76,Ubiquitin,Growth,Growth,RL40A_YEAST_full_11-26-2021_b01.a2m,1,128,128,0.1,0.2,16228,0.695,89,3974.4,44.65617978,medium,12,0.1348314607,RL401_YEAST_Mavor_2016.csv,DMSO,1,mutant,RL40A_YEAST_theta_0.2.npy,RL40A_YEAST.pdb,1-128,0.1,,OrganismalFitness
+RL40A_YEAST_Roscoe_2013,RL40A_YEAST_Roscoe_2013.csv,RL40A_YEAST,Eukaryote,Saccharomyces cerevisiae,MQIFVKTLTGKTITLEVESSDTIDNVKSKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGIIEPSLKALASKYNCDKSVCRKCYARLPPRATNCRKRKCGHTNQLRPKKKLK,128,FALSE,1195,1195,0,-0.2,manual,Roscoe,Analyses of the Effects of All Ubiquitin Point Mutants on Yeast Growth Rate,2013,10.1016/j.jmb.2013.01.032,2-76,Ubiquitin,Growth (essential function),Growth,RL40A_YEAST_full_11-26-2021_b01.a2m,1,128,128,0.1,0.2,16228,0.695,89,3974.4,44.65617978,medium,12,0.1348314607,RL401_YEAST_Roscoe_2013.csv,Selection Coefficient,1,mutant,RL40A_YEAST_theta_0.2.npy,RL40A_YEAST.pdb,1-128,0.1,,OrganismalFitness
+RL40A_YEAST_Roscoe_2014,RL40A_YEAST_Roscoe_2014.csv,RL40A_YEAST,Eukaryote,Saccharomyces cerevisiae,MQIFVKTLTGKTITLEVESSDTIDNVKSKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGIIEPSLKALASKYNCDKSVCRKCYARLPPRATNCRKRKCGHTNQLRPKKKLK,128,FALSE,1380,1380,0,0.5,manual,Roscoe,"Systematic Exploration of Ubiquitin Sequence, E1 Activation Efficiency, and Experimental Fitness in Yeast",2014,10.1016/j.jmb.2014.05.019,2-76,Ubiquitin,E1 reactivity,Binding,RL40A_YEAST_full_11-26-2021_b01.a2m,1,128,128,0.1,0.2,16228,0.695,89,3974.4,44.65617978,medium,12,0.1348314607,RL401_YEAST_Roscoe_2014.csv,rel_react,1,mutant,RL40A_YEAST_theta_0.2.npy,RL40A_YEAST.pdb,1-128,0.1,,Activity
+RNC_ECOLI_Weeks_2023,RNC_ECOLI_Weeks_2023.csv,RNC_ECOLI,Prokaryote,Escherichia coli,MNPIVINRLQRKLGYTFNHQELLQQALTHRSASSKHNERLEFLGDSILSYVIANALYHRFPRVDEGDMSRMRATLVRGNTLAELAREFELGECLRLGPGELKSGGFRRESILADTVEALIGGVFLDSDIQTVEKLILNWYQTRLDEISPGDKQKDPKTRLQEYLQGRHLPLPTYLVVQVRGEAHDQEFTIHCQVSGLSEPVVGTGSSRRKAEQAAAEQALKKLELE,226,FALSE,4277,4277,0,-0.054826707,median,Weeks,Fitness and Functional Landscapes of the E. coli RNase III Gene rnc,2023,10.1093/molbev/msad047,1-226,RNase III,Fluorescence,FACS,RNC_ECOLI_2023-08-07_b06.a2m,1,226,226,0.6,0.2,66507,0.969,219,16221.4,74.07031963,Medium,275,1.255707763,RNC_ECOLI_Weeks_2023.csv,Functional Score Weighted Mean,1,mutant,RNC_ECOLI_theta0.2_2023-08-07_b06.npy,RNC_ECOLI.pdb,1-226,1,,Activity
+RPC1_BP434_Tsuboyama_2023_1R69,RPC1_BP434_Tsuboyama_2023_1R69.csv,RPC1_BP434,Virus,Enterobacteria phage 434 (Bacteriophage 434),SISSRVKSKRIQLGLNQAELAQKVGTTQQSIEQLENGKTKRPRFLPELASALGVSVDWLLN,61,TRUE,1459,1124,335,-1.349855239,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-61,Repressor protein CI,Stability,cDNA display proteolysis,RPC1_BP434_2023-08-07_b05.a2m,1,61,61,0.5,0.01,820224,0.951,58,192520.2,3319.313793,High,73,1.25862069,Tsuboyama2023_Dataset2_Dataset44,ddG_ML_float,1,mut_type,RPC1_BP434_theta0.01_2023-08-07_b05.npy,RPC1_BP434.pdb,1-61,1,,Stability
+RPC1_LAMBD_Li_2019_high-expression,RPC1_LAMBD_Li_2019_high-expression.csv,RPC1_LAMBD,Virus,Escherichia phage lambda (Bacteriophage lambda),MSTKKKPLTQEQLEDARRLKAIYEKKKNELGLSQESVADKMGMGQSGVGALFNGINALNAYNAALLAKILKVSVEEFSPSIAREIYEMYEAVSMQPSLRSEYEYPVFSHVQAGMFSPELRTFTKGDAERWVSTTKKASDSAFWLEVEGNSMTAPTGSKPSFPDGMLILVDPEQAVEPGDFCIARLGGDEFTFKKLIRDSGQVFLQPLNPQYPMIPCNESCSVVGKVIASQWPEETFG,237,FALSE,351,351,0,7,manual,Li,Changes in gene expression predictably shift and switch genetic interactions,2019,10.1038/s41467-019-11735-3,19-77,CI,Repressor activity (FACS sorting for expression of GFP reporter),FACS,RPC1_LAMBD_2023-08-07_b03.a2m,1,237,237,0.3,0.2,100755,0.886,210,28172.8,134.1561905,High,219,1.042857143,,H_GFP_mean_scaled,-1,mut_proteingym,RPC1_LAMBD_theta0.2_2023-08-07_b03.npy,RPC1_LAMBD.pdb,1-237,1,18,Activity
+RPC1_LAMBD_Li_2019_low-expression,RPC1_LAMBD_Li_2019_low-expression.csv,RPC1_LAMBD,Virus,Escherichia phage lambda (Bacteriophage lambda),MSTKKKPLTQEQLEDARRLKAIYEKKKNELGLSQESVADKMGMGQSGVGALFNGINALNAYNAALLAKILKVSVEEFSPSIAREIYEMYEAVSMQPSLRSEYEYPVFSHVQAGMFSPELRTFTKGDAERWVSTTKKASDSAFWLEVEGNSMTAPTGSKPSFPDGMLILVDPEQAVEPGDFCIARLGGDEFTFKKLIRDSGQVFLQPLNPQYPMIPCNESCSVVGKVIASQWPEETFG,237,FALSE,351,351,0,8.481244509,median,Li,Changes in gene expression predictably shift and switch genetic interactions,2019,10.1038/s41467-019-11735-3,19-77,CI,Repressor activity (FACS sorting for expression of GFP reporter),FACS,RPC1_LAMBD_2023-08-07_b03.a2m,1,237,237,0.3,0.2,100755,0.886,210,28172.8,134.1561905,High,219,1.042857143,,L_GFP_mean_scaled,-1,mut_proteingym,RPC1_LAMBD_theta0.2_2023-08-07_b03.npy,RPC1_LAMBD.pdb,1-237,1,18,Activity
+RS15_GEOSE_Tsuboyama_2023_1A32,RS15_GEOSE_Tsuboyama_2023_1A32.csv,RS15_GEOSE,Prokaryote,Geobacillus stearothermophilus,SPEVQIAILTEQINNLNEHLRVHKKDHHSRRGLLKMVGKRRRLLAYLRNKDVARYREIVEKLG,63,FALSE,1195,1195,0,-0.1292928041,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-63,Small ribosomal subunit protein uS15,Stability,cDNA display proteolysis,RS15_GEOSE_2023-08-07_b06.a2m,1,63,63,0.6,0.2,44428,1,63,4519.5,71.73809524,Medium,35,0.5555555556,Tsuboyama2023_Dataset2_Dataset45,ddG_ML_float,1,mut_type,RS15_GEOSE_theta0.2_2023-08-07_b06.npy,RS15_GEOSE.pdb,1-63,1,,Stability
+S22A1_HUMAN_Yee_2023_abundance,S22A1_HUMAN_Yee_2023_abundance.csv,S22A1_HUMAN,Human,Homo sapiens,PTVDDILEQVGESGWFQKQAFLILCLLSAAFAPICVGIVFLGFTPDHHCQSPGVAELSQRCGWSPAEELNYTVPGLGPAGEAFLGQCRRYEVDWNQSALSCVDPLASLATNRSHLPLGPCQDGWVYDTPGSSIVTEFNLVCADSWKLDLFQSCLNAGFLFGSLGVGYFADRFGRKLCLLGTVLVNAVSGVLMAFSPNYMSMLLFRLLQGLVSKGNWMAGYTLITEFVGSGSRRTVAIMYQMAFTVGLVALTGLAYALPHWRWLQLAVSLPTFLFLLYYWCVPESPRWLLSQKRNTEAIKIMDHIAQKNGKLPPADLKMLSLEEDVTEKLSPSFADLFRTPRLRKRTFILMYLWFTDSVLYQGLILHMGATSGNLYLDFLYSALVEIPGAFIALITIDRVGRIYPMAMSNLLAGAACLVMIFISPDLHWLNIIIMCVGRMGITIAIQMICLVNAELYPTFVRNLGVMVCSSLCDIGGIITPFIVFRLREVWQALPLILFAVLGLLAAGVTLLLPETKGVALPETMKDAENLGRKAKPKENTIYLKVQTSEPSGT,553,FALSE,9803,9803,0,-1,manual,Yee,The full spectrum of OCT1 (SLC22A1) mutations bridges transporter biophysics to drug pharmacogenomics,2023,10.1101/2023.06.06.543963,1-549,Oct1,abundance,FACS,S22A1_HUMAN_2023-08-07_b02.a2m,1,553,553,0.2,0.2,198790,0.807,446,32557.5,72.99887892,Medium,485,1.087443946,543963_file04.xlsx,GFP_score,1,mutant,S22A1_HUMAN_theta0.2_2023-08-07_b02.npy,S22A1_HUMAN.pdb,1-553,1,,Expression
+S22A1_HUMAN_Yee_2023_activity,S22A1_HUMAN_Yee_2023_activity.csv,S22A1_HUMAN,Human,Homo sapiens,PTVDDILEQVGESGWFQKQAFLILCLLSAAFAPICVGIVFLGFTPDHHCQSPGVAELSQRCGWSPAEELNYTVPGLGPAGEAFLGQCRRYEVDWNQSALSCVDPLASLATNRSHLPLGPCQDGWVYDTPGSSIVTEFNLVCADSWKLDLFQSCLNAGFLFGSLGVGYFADRFGRKLCLLGTVLVNAVSGVLMAFSPNYMSMLLFRLLQGLVSKGNWMAGYTLITEFVGSGSRRTVAIMYQMAFTVGLVALTGLAYALPHWRWLQLAVSLPTFLFLLYYWCVPESPRWLLSQKRNTEAIKIMDHIAQKNGKLPPADLKMLSLEEDVTEKLSPSFADLFRTPRLRKRTFILMYLWFTDSVLYQGLILHMGATSGNLYLDFLYSALVEIPGAFIALITIDRVGRIYPMAMSNLLAGAACLVMIFISPDLHWLNIIIMCVGRMGITIAIQMICLVNAELYPTFVRNLGVMVCSSLCDIGGIITPFIVFRLREVWQALPLILFAVLGLLAAGVTLLLPETKGVALPETMKDAENLGRKAKPKENTIYLKVQTSEPSGT,553,FALSE,10094,10094,0,1,manual,Yee,The full spectrum of OCT1 (SLC22A1) mutations bridges transporter biophysics to drug pharmacogenomics,2023,10.1101/2023.06.06.543963,1-549,Oct1,uptake of cytotoxic substrate,Growth,S22A1_HUMAN_2023-08-07_b02.a2m,1,553,553,0.2,0.2,198790,0.807,446,32557.5,72.99887892,Medium,485,1.087443946,543963_file04.xlsx,SM73_1_score,-1,mutant,S22A1_HUMAN_theta0.2_2023-08-07_b02.npy,S22A1_HUMAN.pdb,1-553,1,,Activity
+SAV1_MOUSE_Tsuboyama_2023_2YSB,SAV1_MOUSE_Tsuboyama_2023_2YSB.csv,SAV1_MOUSE,Eukaryote,Mus musculus,GEDLPLPPGWSVDWTMRGRKYYIDHNTNTTHWSHPLESGPSSG,43,TRUE,965,679,286,-0.6280556038,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-43,Protein salvador homolog 1,Stability,cDNA display proteolysis,SAV1_MOUSE_2023-08-07_b06.a2m,1,43,43,0.6,0.2,177542,0.791,34,4627.6,136.1058824,High,14,0.4117647059,Tsuboyama2023_Dataset2_Dataset46,ddG_ML_float,1,mut_type,SAV1_MOUSE_theta0.2_2023-08-07_b06.npy,SAV1_MOUSE.pdb,1-43,1,,Stability
+SBI_STAAM_Tsuboyama_2023_2JVG,SBI_STAAM_Tsuboyama_2023_2JVG.csv,SBI_STAAM,Prokaryote,Staphylococcus aureus,VRHDERVKSANDAISKLNEKDSIENRRLAQREVNKAPMDVKEHLQKQLDALVAQKD,56,FALSE,1025,1025,0,-0.5166138978,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-56,Immunoglobulin-binding protein Sbi,Stability,cDNA display proteolysis,SBI_STAAM_2023-08-07_b04.a2m,1,56,56,0.4,0.2,14476,0.875,49,1539.4,31.41632653,Medium,21,0.4285714286,Tsuboyama2023_Dataset2_Dataset47,ddG_ML_float,1,mut_type,SBI_STAAM_theta0.2_2023-08-07_b04.npy,SBI_STAAM.pdb,1-56,1,,Stability
+SC6A4_HUMAN_Young_2021,SC6A4_HUMAN_Young_2021.csv,SC6A4_HUMAN,Human,Homo sapiens,METTPLNSQKQLSACEDGEDCQENGVLQKVVPTPGDKVESGQISNGYSAVPSPGAGDDTRHSIPATTTTLVAELHQGERETWGKKVDFLLSVIGYAVDLGNVWRFPYICYQNGGGAFLLPYTIMAIFGGIPLFYMELALGQYHRNGCISIWRKICPIFKGIGYAICIIAFYIASYYNTIMAWALYYLISSFTDQLPWTSCKNSWNTGNCTNYFSEDNITWTLHSTSPAEEFYTRHVLQIHRSKGLQDLGGISWQLALCIMLIFTVIYFSIWKGVKTSGKVVWVTATFPYIILSVLLVRGATLPGAWRGVLFYLKPNWQKLLETGVWIDAAAQIFFSLGPGFGVLLAFASYNKFNNNCYQDALVTSVVNCMTSFVSGFVIFTVLGYMAEMRNEDVSEVAKDAGPSLLFITYAEAIANMPASTFFAIIFFLMLITLGLDSTFAGLEGVITAVLDEFPHVWAKRRERFVLAVVITCFFGSLVTLTFGGAYVVKLLEEYATGPAVLTVALIEAVAVSWFYGITQFCRDVKEMLGFSPGWFWRICWVAISPLFLLFIICSFLMSPPQLRLFQYNYPYWSIILGYCIGTSSFICIPTYIAYRLIITPGTFKERIIKSITPETPTEIPCGDIRLNAV,630,FALSE,11576,11576,0,-0.1560688323,median,Young,Deep Mutagenesis of a Transporter for Uptake of a Non-Native Substrate Identifies Conformationally Dynamic Regions,2021,10.1101/2021.04.19.440442,2-630,Sodium-dependent serotonin transporter,Fluorescence,Fluorescence,SC6A4_HUMAN_full_11-26-2021_b02.a2m,1,630,630,0.2,0.2,40971,0.805,507,5278.9,10.41203156,medium,278,0.5483234714,SC6A4_HUMAN_Young_2021.csv,avg_MYC,1,mutant,SC6A4_HUMAN_theta_0.2.npy,SC6A4_HUMAN.pdb,1-630,0.1,,Activity
+SCIN_STAAR_Tsuboyama_2023_2QFF,SCIN_STAAR_Tsuboyama_2023_2QFF.csv,SCIN_STAAR,Prokaryote,Staphylococcus aureus,QNEKLANELKSLLDELNVNELATGSLNTYYKRTIKISGQKAMYALKSKDFKKMSEAKYQLQKIYNEIDEA,70,FALSE,1212,1212,0,-0.4037152866,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-70,Staphylococcal complement inhibitor,Stability,cDNA display proteolysis,SCIN_STAAR_2023-08-07_b02.a2m,1,70,70,0.2,0.2,38043,0.9,63,11146.3,176.9253968,High,4,0.06349206349,Tsuboyama2023_Dataset2_Dataset48,ddG_ML_float,1,mut_type,SCIN_STAAR_theta0.2_2023-08-07_b02.npy,SCIN_STAAR.pdb,1-70,1,,Stability
+SCN5A_HUMAN_Glazer_2019,SCN5A_HUMAN_Glazer_2019.csv,SCN5A_HUMAN,Human,Homo sapiens,MANFLLPRGTSSFRRFTRESLAAIEKRMAEKQARGSTTLQESREGLPEEEAPRPQLDLQASKKLPDLYGNPPQELIGEPLEDLDPFYSTQKTFIVLNKGKTIFRFSATNALYVLSPFHPIRRAAVKILVHSLFNMLIMCTILTNCVFMAQHDPPPWTKYVEYTFTAIYTFESLVKILARGFCLHAFTFLRDPWNWLDFSVIIMAYTTEFVDLGNVSALRTFRVLRALKTISVISGLKTIVGALIQSVKKLADVMVLTVFCLSVFALIGLQLFMGNLRHKCVRNFTALNGTNGSVEADGLVWESLDLYLSDPENYLLKNGTSDVLLCGNSSDAGTCPEGYRCLKAGENPDHGYTSFDSFAWAFLALFRLMTQDCWERLYQQTLRSAGKIYMIFFMLVIFLGSFYLVNLILAVVAMAYEEQNQATIAETEEKEKRFQEAMEMLKKEHEALTIRGVDTVSRSSLEMSPLAPVNSHERRSKRRKRMSSGTEECGEDRLPKSDSEDGPRAMNHLSLTRGLSRTSMKPRSSRGSIFTFRRRDLGSEADFADDENSTAGESESHHTSLLVPWPLRRTSAQGQPSPGTSAPGHALHGKKNSTVDCNGVVSLLGAGDPEATSPGSHLLRPVMLEHPPDTTTPSEEPGGPQMLTSQAPCVDGFEEPGARQRALSAVSVLTSALEELEESRHKCPPCWNRLAQRYLIWECCPLWMSIKQGVKLVVMDPFTDLTITMCIVLNTLFMALEHYNMTSEFEEMLQVGNLVFTGIFTAEMTFKIIALDPYYYFQQGWNIFDSIIVILSLMELGLSRMSNLSVLRSFRLLRVFKLAKSWPTLNTLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKNYSELRDSDSGLLPRWHMMDFFHAFLIIFRILCGEWIETMWDCMEVSGQSLCLLVFLLVMVIGNLVVLNLFLALLLSSFSADNLTAPDEDREMNNLQLALARIQRGLRFVKRTTWDFCCGLLRQRPQKPAALAAQGQLPSCIATPYSPPPPETEKVPPTRKETRFEEGEQPGQGTPGDPEPVCVPIAVAESDTDDQEEDEENSLGTEEESSKQQESQPVSGGPEAPPDSRTWSQVSATASSEAEASASQADWRQQWKAEPQAPGCGETPEDSCSEGSTADMTNTAELLEQIPDLGQDVKDPEDCFTEGCVRRCPCCAVDTTQAPGKVWWRLRKTCYHIVEHSWFETFIIFMILLSSGALAFEDIYLEERKTIKVLLEYADKMFTYVFVLEMLLKWVAYGFKKYFTNAWCWLDFLIVDVSLVSLVANTLGFAEMGPIKSLRTLRALRPLRALSRFEGMRVVVNALVGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFGRCINQTEGDLPLNYTIVNNKSQCESLNLTGELYWTKVKVNFDNVGAGYLALLQVATFKGWMDIMYAAVDSRGYEEQPQWEYNLYMYIYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKLGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPLNKYQGFIFDIVTKQAFDVTIMFLICLNMVTMMVETDDQSPEKINILAKINLLFVAIFTGECIVKLAALRHYYFTNSWNIFDFVVVILSIVGTVLSDIIQKYFFSPTLFRVIRLARIGRILRLIRGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYSIFGMANFAYVKWEAGIDDMFNFQTFANSMLCLFQITTSAGWDGLLSPILNTGPPYCDPTLPNSNGSRGDCGSPAVGILFFTTYIIISFLIVVNMYIAIILENFSVATEESTEPLSEDDFDMFYEIWEKFDPEATQFIEYSVLSDFADALSEPLRIAKPNQISLINMDLPMVSGDRIHCMDILFAFTKRVLGESGEMDALKIQMEEKFMAANPSKISYEPITTTLRRKHEEVSAMVIQRAFRRHLLQRSLKHASFLFRQQAGSGLSEEDAPEREGLIAYVMSENFSRPLGPPSSSSISSTSFPPSYDSVTRATSDNLQVRGSDYSHSEDLADFPPSPDRDRESIV,2016,FALSE,224,224,0,-88.35,median,Glazer,Deep Mutational Scan of an SCN5A Voltage Sensor,2019,10.1161/CIRCGEN.119.002786,1621-1632,SCN5A,"drug resistance (triple-drug assay: veratridine + brevetoxin + ouabain; surrogate for sodium channel dysfunction, select against function)",,SCN5A_HUMAN_1611-1642_11-26-2021_b03.a2m,1611,1642,32,0.3,0.2,49973,0.812,26,743.1,28.58076923,medium,2,0.07692307692,SCN5A_HUMAN_Glazer_2019.csv,dms,-1,mutation,SCN5A_HUMAN_theta_0.2.npy,SCN5A_HUMAN.pdb,1-2016,0.1,,OrganismalFitness
+SDA_BACSU_Tsuboyama_2023_1PV0,SDA_BACSU_Tsuboyama_2023_1PV0.csv,SDA_BACSU,Prokaryote,Bacillus subtilis,MRKLSDELLIESYFKATEMNLNRDFIELIENEIKRRSLGHIISV,44,TRUE,2770,834,1936,-1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-44,Sporulation inhibitor sda,Stability,cDNA display proteolysis,SDA_BACSU_2023-08-07_b05.a2m,1,44,44,0.5,0.2,1953,0.886,39,876.8,22.48205128,Medium,4,0.1025641026,Tsuboyama2023_Dataset2_Dataset49,ddG_ML_float,1,mut_type,SDA_BACSU_theta0.2_2023-08-07_b05.npy,SDA_BACSU.pdb,1-44,1,,Stability
+SERC_HUMAN_Xie_2023,SERC_HUMAN_Xie_2023.csv,SERC_HUMAN,Human,Homo sapiens,MDAPRQVVNFGPGPAKLPHSVLLEIQKELLDYKGVGISVLEMSHRSSDFAKIINNTENLVRELLAVPDNYKVIFLQGGGCGQFSAVPLNLIGLKAGRCADYVVTGAWSAKAAEEAKKFGTINIVHPKLGSYTKIPDPSTWNLNPDASYVYYCANETVHGVEFDFIPDVKGAVLVCDMSSNFLSKPVDVSKFGVIFAGAQKNVGSAGVTVVIVRDDLLGFALRECPSVLEYKVQAGNSSLYNTPPCFSIYVMGLVLEWIKNNGGAAAMEKLSSIKSQTIYEIIDNSQGFYVCPVEPQNRSKMNIPFRIGNAKGDDALEKRFLDKALELNMLSLKGHRSVGGIRASLYNAVTIEDVQKLAAFMKKFLEMHQL,370,FALSE,1914,1914,0,0.9360658319,median,Xie,Predicting the functional effect of compound heterozygous genotypes from large scale variant effect maps,2023,10.1101/2023.01.11.523651,2-370,PSAT1,Yeast growth,,SERC_HUMAN_2023-08-07_b02.a2m,1,370,370,0.2,0.2,232438,0.949,351,42521.5,121.1438746,High,899,2.561253561,urn_mavedb_00000107-b-1_scores-2.csv,score,1,mutant,SERC_HUMAN_theta0.2_2023-08-07_b02.npy,SERC_HUMAN.pdb,1-370,1,,OrganismalFitness
+SHOC2_HUMAN_Kwon_2022,SHOC2_HUMAN_Kwon_2022.csv,SHOC2_HUMAN,Human,Homo sapiens,MSSSLGKEKDSKEKDPKVPSAKEREKEAKASGGFGKESKEKEPKTKGKDAKDGKKDSSAAQPGVAFSVDNTIKRPNPAPGTRKKSSNAEVIKELNKCREENSMRLDLSKRSIHILPSSIKELTQLTELYLYSNKLQSLPAEVGCLVNLMTLALSENSLTSLPDSLDNLKKLRMLDLRHNKLREIPSVVYRLDSLTTLYLRFNRITTVEKDIKNLSKLSMLSIRENKIKQLPAEIGELCNLITLDVAHNQLEHLPKEIGNCTQITNLDLQHNELLDLPDTIGNLSSLSRLGLRYNRLSAIPRSLAKCSALEELNLENNNISTLPESLLSSLVKLNSLTLARNCFQLYPVGGPSQFSTIYSLNMEHNRINKIPFGIFSRAKVLSKLNMKDNQLTSLPLDFGTWTSMVELNLATNQLTKIPEDVSGLVSLEVLILSNNLLKKLPHGLGNLRKLRELDLEENKLESLPNEIAYLKDLQKLVLTNNQLTTLPRGIGHLTNLTHLGLGENLLTHLPEEIGTLENLEELYLNDNPNLHSLPFELALCSKLSIMSIENCPLSHLPPQIVAGGPSFIIQFLKMQGPYRAMV,582,FALSE,10972,10972,0,-0.34,median,Kwon,Structure–function analysis of the SHOC2–MRAS–PP1C holophosphatase complex,2022,10.1038/s41586-022-04928-2,2-582,Leucine-rich repeat protein SHOC-2,Drug resistance,Survival (dosed with trametinib),SHOC2_HUMAN_2023-10-12_b04.a2m,1,582,582,0.4,0.2,22163,0.777,452,8806.8,19.4840708,Medium,379,0.8384955752,2022.3.16.Extended Data Table 4.csv,LFC_scaled,1,variant.by.aa,SHOC2_HUMAN_theta0.2_2023-10-12_b04.npy,SHOC2_HUMAN.pdb,1-582,1,,OrganismalFitness
+SOX30_HUMAN_Tsuboyama_2023_7JJK,SOX30_HUMAN_Tsuboyama_2023_7JJK.csv,SOX30_HUMAN,Human,Homo sapiens,RPMNAFMVWARIHRPALAKANPAANNAEISVQLGLEWNKLSEEQKKPYYDEAQKIKE,57,FALSE,1010,1010,0,-0.3216404755,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-57,Transcription factor SOX-30,Stability,cDNA display proteolysis,SOX30_HUMAN_2023-08-07_b03.a2m,1,57,57,0.3,0.2,158104,0.982,56,14909.6,266.2428571,High,36,0.6428571429,Tsuboyama2023_Dataset2_Dataset50,ddG_ML_float,1,mut_type,SOX30_HUMAN_theta0.2_2023-08-07_b03.npy,SOX30_HUMAN.pdb,1-57,1,,Stability
+SPA_STAAU_Tsuboyama_2023_1LP1,SPA_STAAU_Tsuboyama_2023_1LP1.csv,SPA_STAAU,Prokaryote,Staphylococcus aureus,KFNKELSVAGREIVTLPNLNDPQKKAFIFSLWDDPSQSANLLAEAKKLNDAQAPK,55,TRUE,2105,1035,1070,-0.9794586971,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-55,Immunoglobulin G-binding protein A,Stability,cDNA display proteolysis,SPA_STAAU_2023-08-07_b04.a2m,1,55,55,0.4,0.2,184804,0.927,51,2042.1,40.04117647,Medium,25,0.4901960784,Tsuboyama2023_Dataset2_Dataset51,ddG_ML_float,1,mut_type,SPA_STAAU_theta0.2_2023-08-07_b04.npy,SPA_STAAU.pdb,1-55,1,,Stability
+SPG1_STRSG_Olson_2014,SPG1_STRSG_Olson_2014.csv,SPG1_STRSG,Prokaryote,Streptococcus sp. group G,MEKEKKVKYFLRKSAFGLASVSAAFLVGSTVFAVDSPIEDTPIIRNGGELTNLLGNSETTLALRNEESATADLTAAAVADTVAAAAAENAGAAAWEAAAAADALAKAKADALKEFNKYGVSDYYKNLINNAKTVEGIKDLQAQVVESAKKARISEATDGLSDFLKSQTPAEDTVKSIELAEAKVLANRELDKYGVSDYHKNLINNAKTVEGVKELIDEILAALPKTDQYKLILNGKTLKGETTTEAVDAATAEKVFKQYANDNGVDGEWTYDDATKTFTVTEKPEVIDASELTPAVTTYKLVINGKTLKGETTTKAVDAETAEKAFKQYANDNGVDGVWTYDDATKTFTVTEMVTEVPGDAPTEPEKPEASIPLVPLTPATPIAKDDAKKDDTKKEDAKKPEAKKDDAKKAETLPTTGEGSNPFFTAAALAVMAGAGALAVASKRKED,448,TRUE,536962,1045,535917,-4,manual,Olson,A comprehensive biophysical description of pairwise epistasis throughout an entire protein domain,2014,10.1016/j.cub.2014.09.072,228-282,GB1,Binding (IgG),Binding,SPG1_STRSG_full_11-26-2021_b07.a2m,1,448,448,0.7,0.2,44,0.913,409,3.3,0.008068459658,low,0,0,SPG1_STRSG_Olson_2014.csv,lnW,1,mutant,SPG1_STRSG_theta_0.2.npy,SPG1_STRSG.pdb,1-448,0.1,,Binding
+SPG1_STRSG_Wu_2016,SPG1_STRSG_Wu_2016.csv,SPG1_STRSG,Prokaryote,Streptococcus sp. group G,MEKEKKVKYFLRKSAFGLASVSAAFLVGSTVFAVDSPIEDTPIIRNGGELTNLLGNSETTLALRNEESATADLTAAAVADTVAAAAAENAGAAAWEAAAAADALAKAKADALKEFNKYGVSDYYKNLINNAKTVEGIKDLQAQVVESAKKARISEATDGLSDFLKSQTPAEDTVKSIELAEAKVLANRELDKYGVSDYHKNLINNAKTVEGVKELIDEILAALPKTDQYKLILNGKTLKGETTTEAVDAATAEKVFKQYANDNGVDGEWTYDDATKTFTVTEKPEVIDASELTPAVTTYKLVINGKTLKGETTTKAVDAETAEKAFKQYANDNGVDGVWTYDDATKTFTVTEMVTEVPGDAPTEPEKPEASIPLVPLTPATPIAKDDAKKDDTKKEDAKKPEAKKDDAKKAETLPTTGEGSNPFFTAAALAVMAGAGALAVASKRKED,448,TRUE,149360,76,149284,0.1224388752,median,Wu,Adaptation in protein fitness landscapes is facilitated by indirect paths,2016,10.7554/eLife.16965,265-280,GB1,Binding (IgG),binding,SPG1_STRSG_full_b0.1.a2m,1,448,448,0.1,0.2,3109,1,448,600.4,1.340178571,Medium,97,0.2165178571,SPG1_STRSG_Wu_2016.csv,Fitness,1,Variants,SPG1_STRSG_b01_theta_0.2.npy,SPG1_STRSG.pdb,1-448,1,,Binding
+SPG2_STRSG_Tsuboyama_2023_5UBS,SPG2_STRSG_Tsuboyama_2023_5UBS.csv,SPG2_STRSG,Prokaryote,Streptococcus sp. group G,MTFKLIINGKTLKGETTTEAVDAATAEKVFKQYFNDNGIDGEWTYDDATKTFTITE,56,TRUE,1451,1029,422,-1.000627629,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-56,Immunoglobulin G-binding protein G,Stability,cDNA display proteolysis,SPG2_STRSG_2023-08-07_b03.a2m,1,56,56,0.3,0.2,39899,0.75,42,2567.6,61.13333333,Medium,6,0.1428571429,Tsuboyama2023_Dataset2_Dataset52,ddG_ML_float,1,mut_type,SPG2_STRSG_theta0.2_2023-08-07_b03.npy,SPG2_STRSG.pdb,1-56,1,,Stability
+SPIKE_SARS2_Starr_2020_binding,SPIKE_SARS2_Starr_2020_binding.csv,SPIKE_SARS2,Virus,Severe acute respiratory syndrome coronavirus 2 (2019-nCoV) (SARS-CoV-2),MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT,1273,FALSE,3802,3802,0,-0.5,manual,Starr,Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding,2020,10.1016/j.cell.2020.08.012,331-531,Spike RBD,ACE2 binding,Binding,SPIKE_SARS2_theta0.99_full_11-26-2021_b01.a2m,1,1273,1273,0.1,0.01,36931,0.998,1271,1405.2,1.105586153,medium,2059,1.619984264,SPIKE_SARS2_Starr_2020.csv,bind_avg,1,mutation,SPIKE_SARS2_theta_0.01.npy,SPIKE_SARS2.pdb,1-1273,0.1,,Binding
+SPIKE_SARS2_Starr_2020_expression,SPIKE_SARS2_Starr_2020_expression.csv,SPIKE_SARS2,Virus,Severe acute respiratory syndrome coronavirus 2 (2019-nCoV) (SARS-CoV-2),MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT,1273,FALSE,3798,3798,0,-1,manual,Starr,Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding,2020,10.1016/j.cell.2020.08.012,331-531,Spike RBD,ACE2 binding,Binding,SPIKE_SARS2_theta0.99_full_11-26-2021_b01.a2m,1,1273,1273,0.1,0.01,36931,0.998,1271,1405.2,1.105586153,medium,2059,1.619984264,SPIKE_SARS2_Starr_2020.csv,expr_avg,1,mutation,SPIKE_SARS2_theta_0.01.npy,SPIKE_SARS2.pdb,1-1273,0.1,,Expression
+SPTN1_CHICK_Tsuboyama_2023_1TUD,SPTN1_CHICK_Tsuboyama_2023_1TUD.csv,SPTN1_CHICK,Eukaryote,Gallus gallus,RQGFVPAAYVKKLDSGTGKELVLALYDYQEKSPREVTMKKGDILTLLNSTNKDWWKVEVN,60,TRUE,3201,1051,2150,-2.360476078,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-60,"Spectrin alpha chain, non-erythrocytic 1",Stability,cDNA display proteolysis,SPTN1_CHICK_2023-08-07_b03.a2m,1,60,60,0.3,0.2,420793,0.933,56,15051.5,268.7767857,High,47,0.8392857143,Tsuboyama2023_Dataset2_Dataset53,ddG_ML_float,1,mut_type,SPTN1_CHICK_theta0.2_2023-08-07_b03.npy,SPTN1_CHICK.pdb,1-60,1,,Stability
+SQSTM_MOUSE_Tsuboyama_2023_2RRU,SQSTM_MOUSE_Tsuboyama_2023_2RRU.csv,SQSTM_MOUSE,Eukaryote,Mus musculus,RLIESLSQMLSMGFSDEGGWLTRLLQTKNYDIGAALDTIQ,40,FALSE,707,707,0,-0.8554856463,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-40,Sequestosome-1,Stability,cDNA display proteolysis,SQSTM_MOUSE_2023-08-07_b05.a2m,1,40,40,0.5,0.2,34660,0.925,37,3244.5,87.68918919,Medium,13,0.3513513514,Tsuboyama2023_Dataset2_Dataset54,ddG_ML_float,1,mut_type,SQSTM_MOUSE_theta0.2_2023-08-07_b05.npy,SQSTM_MOUSE.pdb,1-40,1,,Stability
+SR43C_ARATH_Tsuboyama_2023_2N88,SR43C_ARATH_Tsuboyama_2023_2N88.csv,SR43C_ARATH,Eukaryote,Arabidopsis thaliana,AVAESVIGKRVGDDGKTIEYLVKWTDMSDATWEPQDNVDSTLVLLYQQ,48,TRUE,1583,889,694,-1.591761235,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-48,"Signal recognition particle 43 kDa protein, chloroplastic",Stability,cDNA display proteolysis,SR43C_ARATH_2023-08-07_b02.a2m,1,48,48,0.2,0.2,101118,0.917,44,12180.6,276.8318182,High,26,0.5909090909,Tsuboyama2023_Dataset2_Dataset55,ddG_ML_float,1,mut_type,SR43C_ARATH_theta0.2_2023-08-07_b02.npy,SR43C_ARATH.pdb,1-48,1,,Stability
+SRBS1_HUMAN_Tsuboyama_2023_2O2W,SRBS1_HUMAN_Tsuboyama_2023_2O2W.csv,SRBS1_HUMAN,Human,Homo sapiens,GIDPFTGEAIAKFNFNGDTQVEMSFRKGERITLLRQVDENWYEGRIPGTSRQGIFPITYVDVIKRPL,67,TRUE,1556,1211,345,-1.169019411,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-67,Sorbin and SH3 domain-containing protein 1,Stability,cDNA display proteolysis,SRBS1_HUMAN_2023-08-07_b03.a2m,1,67,67,0.3,0.2,708655,0.836,56,22689,405.1607143,High,60,1.071428571,Tsuboyama2023_Dataset2_Dataset56,ddG_ML_float,1,mut_type,SRBS1_HUMAN_theta0.2_2023-08-07_b03.npy,SRBS1_HUMAN.pdb,1-67,1,,Stability
+SRC_HUMAN_Ahler_2019,SRC_HUMAN_Ahler_2019.csv,SRC_HUMAN,Human,Homo sapiens,MGSNKSKPKDASQRRRSLEPAENVHGAGGGAFPASQTPSKPASADGHRGPSAAFAPAAAEPKLFGGFNSSDTVTSPQRAGPLAGGVTTFVALYDYESRTETDLSFKKGERLQIVNNTEGDWWLAHSLSTGQTGYIPSNYVAPSDSIQAEEWYFGKITRRESERLLLNAENPRGTFLVRESETTKGAYCLSVSDFDNAKGLNVKHYKIRKLDSGGFYITSRTQFNSLQQLVAYYSKHADGLCHRLTTVCPTSKPQTQGLAKDAWEIPRESLRLEVKLGQGCFGEVWMGTWNGTTRVAIKTLKPGTMSPEAFLQEAQVMKKLRHEKLVQLYAVVSEEPIYIVTEYMSKGSLLDFLKGETGKYLRLPQLVDMAAQIASGMAYVERMNYVHRDLRAANILVGENLVCKVADFGLARLIEDNEYTARQGAKFPIKWTAPEAALYGRFTIKSDVWSFGILLTELTTKGRVPYPGMVNREVLDQVERGYRMPCPPECPESLHDLMCQCWRKEPEERPTFEYLQAFLEDYFTSTEPQYQPGENL,536,FALSE,3372,3372,0,-1,manual,Ahler,"A Combined Approach Reveals a Regulatory Mechanism Coupling Src's Kinase Activity, Localization, and Phosphotransferase-Independent Functions",2019,10.1016/j.molcel.2019.02.003,270-519,SRC,growth (surrogate for phosphorylation activity),Growth,SRC_HUMAN_full_11-26-2021_b06.a2m,1,536,536,0.6,0.2,26974,0.808,433,1405.1,3.245034642,medium,86,0.1986143187,SRC_HUMAN_Ahler_CD_2019.csv,Activity_Score,1,mutant_uniprot_1,SRC_HUMAN_theta_0.2.npy,SRC_HUMAN.pdb,1-536,0.1,,Activity
+SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM,SRC_HUMAN_Chakraborty_2023_binding-DAS_25uM.csv,SRC_HUMAN,Human,Homo sapiens,MGSNKSKPKDASQRRRSLEPAENVHGAGGGAFPASQTPSKPASADGHRGPSAAFAPAAAEPKLFGGFNSSDTVTSPQRAGPLAGGVTTFVALYDYESRTETDLSFKKGERLQIVNNTEGDWWLAHSLSTGQTGYIPSNYVAPSDSIQAEEWYFGKITRRESERLLLNAENPRGTFLVRESETTKGAYCLSVSDFDNAKGLNVKHYKIRKLDSGGFYITSRTQFNSLQQLVAYYSKHADGLCHRLTTVCPTSKPQTQGLAKDAWEIPRESLRLEVKLGQGCFGEVWMGTWNGTTRVAIKTLKPGTMSPEAFLQEAQVMKKLRHEKLVQLYAVVSEEPIYIVTEYMSKGSLLDFLKGETGKYLRLPQLVDMAAQIASGMAYVERMNYVHRDLRAANILVGENLVCKVADFGLARLIEDNEYTARQGAKFPIKWTAPEAALYGRFTIKSDVWSFGILLTELTTKGRVPYPGMVNREVLDQVERGYRMPCPPECPESLHDLMCQCWRKEPEERPTFEYLQAFLEDYFTSTEPQYQPGENL,536,FALSE,3637,3637,0,-0.086077486,median,Chakraborty,Profiling of the drug resistance of thousands of Src tyrosine kinase mutants uncovers a regulatory network that couples autoinhibition to catalytic domain dynamics,2022,10.1101/2021.12.05.471322,270-519,SRC,Fluorescence measurement,,SRC_HUMAN_2023-08-07_b06.a2m,1,536,536,0.6,0.2,37675,0.869,466,1789,3.839055794,Medium,117,0.2510729614,GSE190495_Src_DAS_25_Score.csv,DMS_score,1,mutant,SRC_HUMAN_theta0.2_2023-08-07_b06.npy,SRC_HUMAN.pdb,1-536,1,,Activity
+SRC_HUMAN_Nguyen_2022,SRC_HUMAN_Nguyen_2022.csv,SRC_HUMAN,Human,Homo sapiens,MGSNKSKPKDASQRRRSLEPAENVHGAGGGAFPASQTPSKPASADGHRGPSAAFAPAAAEPKLFGGFNSSDTVTSPQRAGPLAGGVTTFVALYDYESRTETDLSFKKGERLQIVNNTEGDWWLAHSLSTGQTGYIPSNYVAPSDSIQAEEWYFGKITRRESERLLLNAENPRGTFLVRESETTKGAYCLSVSDFDNAKGLNVKHYKIRKLDSGGFYITSRTQFNSLQQLVAYYSKHADGLCHRLTTVCPTSKPQTQGLAKDAWEIPRESLRLEVKLGQGCFGEVWMGTWNGTTRVAIKTLKPGTMSPEAFLQEAQVMKKLRHEKLVQLYAVVSEEPIYIVTEYMSKGSLLDFLKGETGKYLRLPQLVDMAAQIASGMAYVERMNYVHRDLRAANILVGENLVCKVADFGLARLIEDNEYTARQGAKFPIKWTAPEAALYGRFTIKSDVWSFGILLTELTTKGRVPYPGMVNREVLDQVERGYRMPCPPECPESLHDLMCQCWRKEPEERPTFEYLQAFLEDYFTSTEPQYQPGENL,536,FALSE,3366,3366,0,0.535786927,median,Nguyen,Molecular Determinants of Hsp90 Dependence of Src Kinase Revealed by Deep Mutational Scanning,2022,10.1002/pro.4656,270-519,SRC,growth enrichment,,SRC_HUMAN_2023-08-07_b06.a2m,1,536,536,0.6,0.2,37675,0.869,466,1789,3.839055794,Medium,117,0.2510729614,diffsel_calib.csv,diffsel,-1,mutant,SRC_HUMAN_theta0.2_2023-08-07_b06.npy,SRC_HUMAN.pdb,1-536,1,,OrganismalFitness
+SUMO1_HUMAN_Weile_2017,SUMO1_HUMAN_Weile_2017.csv,SUMO1_HUMAN,Human,Homo sapiens,MSDQEAKPSTEDLGDKKEGEYIKLKVIGQDSSEIHFKVKMTTHLKKLKESYCQRQGVPMNSLRFLFEGQRIADNHTPKELGMEEEDVIEVYQEQTGGHSTV,101,FALSE,1700,1700,0,0.3,manual,Weile,A framework for exhaustively mapping functional missense variants,2017,10.15252/msb.20177908,2-97,Small ubiquitin-related modifier 1,Yeast growth,complementation,SUMO1_HUMAN_full_11-26-2021_b02.a2m,1,101,101,0.2,0.2,85570,0.703,71,13120.2,184.7915493,high,67,0.9436619718,SUMO1_HUMAN_Weile_2017.csv,screenscore,1,mutant,SUMO1_HUMAN_theta_0.2.npy,SUMO1_HUMAN.pdb,1-101,0.1,,OrganismalFitness
+SYUA_HUMAN_Newberry_2020,SYUA_HUMAN_Newberry_2020.csv,SYUA_HUMAN,Human,Homo sapiens,MDVYMKGLSKAKEGVVAAAEKTKQGVAEAAGKTKEGVLFVGSKTKEGVVHGVATVAEKTKEQVTNVGGAVVTGVTAVAQKTVEGAGSIAAATGYVKKDQLGKNEEGAPQEGILEDMPVDPDNEAFEMPSEEGFQDFEPEA,140,FALSE,2497,2497,0,-0.1,manual,Newberry,Robust Sequence Determinants of α-Synuclein Toxicity in Yeast Implicate Membrane Binding,2020,10.1021/acschembio.0c00339,1-140,alpha-synuclein,Growth,Growth,SYUA_HUMAN_full_04-29-2022_b01.a2m,1,140,140,0.1,0.2,15711,0.707,99,6509.6,65.75353535,medium,62,0.6262626263,SYUA_HUMAN_Newberry_2020.csv,Fitness Score,-1,mutant,SYUA_HUMAN_theta_0.2.npy,SYUA_HUMAN.pdb,1-140,0.1,,OrganismalFitness
+TADBP_HUMAN_Bolognesi_2019,TADBP_HUMAN_Bolognesi_2019.csv,TADBP_HUMAN,Human,Homo sapiens,MSEYIRVTEDENDEPIEIPSEDDGTVLLSTVTAQFPGACGLRYRNPVSQCMRGVRLVEGILHAPDAGWGNLVYVVNYPKDNKRKMDETDASSAVKVKRAVQKTSDLIVLGLPWKTTEQDLKEYFSTFGEVLMVQVKKDLKTGHSKGFGFVRFTEYETQVKVMSQRHMIDGRWCDCKLPNSKQSQDEPLRSRKVFVGRCTEDMTEDELREFFSQYGDVMDVFIPKPFRAFAFVTFADDQIAQSLCGEDLIIKGISVHISNAEPKHNSNRQLERSGRFGGNPGGFGNQGGFGNSRGGGAGLGNNQGSNMGGGMNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGPSGNNQNQGNMQREPNQAFGSGNNSYSGSNSGAAIGWGSASNAGSGSGFNGGFGSSMDSKSSGWGM,414,FALSE,1196,1196,0,0.003661517102,median,Bolognesi,The mutational landscape of a prion-like domain,2019,10.1038/s41467-019-12101-z,290-373,TARDBP,growth (surrogate for toxicity),Growth,TADBP_HUMAN_full_11-26-2021_b09.a2m,1,414,414,0.9,0.2,1211,0.911,377,147.3,0.3907161804,low,8,0.02122015915,TADBP_HUMAN_Bolognesi_2019.csv,toxicity,1,mutant_uniprot_1,TADBP_HUMAN_theta_0.2.npy,TADBP_HUMAN.pdb,1-414,0.1,,OrganismalFitness
+TAT_HV1BR_Fernandes_2016,TAT_HV1BR_Fernandes_2016.csv,TAT_HV1BR,Virus,Human immunodeficiency virus type 1 group M subtype B (isolate BRU/LAI) (HIV-1),MEPVDPRLEPWKHPGSQPKTACTNCYCKKCCFHCQVCFMTKALGISYGRKKRRQRRRAHQNSQTHQASLSKQPTSQSRGDPTGPKE,86,FALSE,1577,1577,0,-0.2,manual,Fernandes,Functional Segregation of Overlapping Genes in HIV,2016,10.1016/j.cell.2016.11.031,1-86,HIV tat,Viral replication,Growth,TAT_HV1BR_full_theta0.99_04-29-2022_b09.a2m,1,86,86,0.9,0.01,12155,0.988,85,9925,116.7647059,high,49,0.5764705882,TAT_HV1BR_Fernandes_2016.csv,sel_coeff_mean,1,mutant,TAT_HV1BR_theta_0.01.npy,TAT_HV1BR.pdb,1-86,0.1,,OrganismalFitness
+TCRG1_MOUSE_Tsuboyama_2023_1E0L,TCRG1_MOUSE_Tsuboyama_2023_1E0L.csv,TCRG1_MOUSE,Eukaryote,Mus musculus,GATAVSEWTEYKTADGKTYYYNNRTLESTWEKPQELK,37,TRUE,1058,621,437,-1.2,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-37,Transcription elongation regulator 1,Stability,cDNA display proteolysis,TCRG1_MOUSE_2023-08-07_b08.a2m,1,37,37,0.8,0.2,43363,0.865,32,2819.7,88.115625,Medium,14,0.4375,Tsuboyama2023_Dataset2_Dataset57,ddG_ML_float,1,mut_type,TCRG1_MOUSE_theta0.2_2023-08-07_b08.npy,TCRG1_MOUSE.pdb,1-37,1,,Stability
+THO1_YEAST_Tsuboyama_2023_2WQG,THO1_YEAST_Tsuboyama_2023_2WQG.csv,THO1_YEAST,Eukaryote,Saccharomyces cerevisiae,SADYSSLTVVQLKDLLTKRNLSVGGLKNEWVQRLIKDDEES,41,TRUE,1279,656,623,-0.7,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-41,Protein THO1,Stability,cDNA display proteolysis,THO1_YEAST_2023-08-07_b05.a2m,1,41,41,0.5,0.2,54877,0.805,33,8516.7,258.0818182,High,15,0.4545454545,Tsuboyama2023_Dataset2_Dataset58,ddG_ML_float,1,mut_type,THO1_YEAST_theta0.2_2023-08-07_b05.npy,THO1_YEAST.pdb,1-41,1,,Stability
+TNKS2_HUMAN_Tsuboyama_2023_5JRT,TNKS2_HUMAN_Tsuboyama_2023_5JRT.csv,TNKS2_HUMAN,Human,Homo sapiens,FSITQFVRNLGLEHLMDIFEREQITLRVLVEMGHKELKEIGINAYGHREKLIKGVERLI,59,TRUE,1479,1118,361,-0.9451205822,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-59,Poly [ADP-ribose] polymerase tankyrase-2,Stability,cDNA display proteolysis,TNKS2_HUMAN_2023-08-07_b03.a2m,1,59,59,0.3,0.2,270654,0.949,56,11206,200.1071429,High,26,0.4642857143,Tsuboyama2023_Dataset2_Dataset59,ddG_ML_float,1,mut_type,TNKS2_HUMAN_theta0.2_2023-08-07_b03.npy,TNKS2_HUMAN.pdb,1-59,1,,Stability
+TPK1_HUMAN_Weile_2017,TPK1_HUMAN_Weile_2017.csv,TPK1_HUMAN,Human,Homo sapiens,MEHAFTPLEPLLSTGNLKYCLVILNQPLDNYFRHLWNKALLRACADGGANRLYDITEGERESFLPEFINGDFDSIRPEVREYYATKGCELISTPDQDHTDFTKCLKMLQKKIEEKDLKVDVIVTLGGLAGRFDQIMASVNTLFQATHITPFPIIIIQEESLIYLLQPGKHRLHVDTGMEGDWCGLIPVGQPCMQVTTTGLKWNLTNDVLAFGTLVSTSNTYDGSGVVTVETDHPLLWTMAIKS,243,FALSE,3181,3181,0,0.5,manual,Weile,A framework for exhaustively mapping functional missense variants,2017,10.15252/msb.20177908,2-243,Thiamin pyrophosphokinase 1,Yeast growth,complementation,TPK1_HUMAN_full_11-26-2021_b02.a2m,1,243,243,0.2,0.2,21515,0.823,200,7122.6,35.613,medium,234,1.17,TPK1_HUMAN_Weile_2017.csv,screenscore,1,mutant,TPK1_HUMAN_theta_0.2.npy,TPK1_HUMAN.pdb,1-243,0.1,,OrganismalFitness
+TPMT_HUMAN_Matreyek_2018,TPMT_HUMAN_Matreyek_2018.csv,TPMT_HUMAN,Human,Homo sapiens,MDGTRTSLDIEEYSDTEVQKNQVLTLEEWQDKWVNGKTAFHQEQGHQLLKKHLDTFLKGKSGLRVFFPLCGKAVEMKWFADRGHSVVGVEISELGIQEFFTEQNLSYSEEPITEIPGTKVFKSSSGNISLYCCSIFDLPRTNIGKFDMIWDRGALVAINPGDRKCYADTMFSLLGKKFQYLLCVLSYDPTKHPGPPFYVPHAEIERLFGKICNIRCLEKVDAFEERHKSWGIDCLFEKLYLLTEK,245,FALSE,3648,3648,0,0.5,manual,Matreyek,Multiplex Assessment of Protein Variant Abundance by Massively Parallel Sequencing,2018,10.1038/s41588-018-0122-z,1-245,Thiopurine S-methyltransferase,Protein abundance (FACS sorting for abundance of GFP-fused target),Protein stability,TPMT_HUMAN_full_11-26-2021_b03.a2m,1,245,245,0.3,0.2,19526,0.731,179,6296.8,35.17765363,medium,109,0.6089385475,TPMT_HUMAN_Matreyek_2018.csv,score,1,mutant,TPMT_HUMAN_theta_0.2.npy,TPMT_HUMAN.pdb,1-245,0.1,,Expression
+TPOR_HUMAN_Bridgford_2020,TPOR_HUMAN_Bridgford_2020.csv,TPOR_HUMAN,Human,Homo sapiens,MPSWALFMVTSCLLLAPQNLAQVSSQDVSLLASDSEPLKCFSRTFEDLTCFWDEEEAAPSGTYQLLYAYPREKPRACPLSSQSMPHFGTRYVCQFPDQEEVRLFFPLHLWVKNVFLNQTRTQRVLFVDSVGLPAPPSIIKAMGGSQPGELQISWEEPAPEISDFLRYELRYGPRDPKNSTGPTVIQLIATETCCPALQRPHSASALDQSPCAQPTMPWQDGPKQTSPSREASALTAEGGSCLISGLQPGNSYWLQLRSEPDGISLGGSWGSWSLPVTVDLPGDAVALGLQCFTLDLKNVTCQWQQQDHASSQGFFYHSRARCCPRDRYPIWENCEEEEKTNPGLQTPQFSRCHFKSRNDSIIHILVEVTTAPGTVHSYLGSPFWIHQAVRLPTPNLHWREISSGHLELEWQHPSSWAAQETCYQLRYTGEGHQDWKVLEPPLGARGGTLELRPRSRYRLQLRARLNGPTYQGPWSSWSDPTRVETATETAWISLVTALHLVLGLNAVLGLLLLRWQFPAHYRRLRHALWPSLPDLHRVLGQYLRDTAALSPPKATVSDTCEEVEPSLLEILPKSSERTPLPLCSSQAQMDYRRLQPSCLGTMPLSVCPPMAESGSCCTTHIANHSYLPLSYWQQP,635,FALSE,562,562,0,-0.1,manual,Bridgford,Novel drivers and modifiers of MPL-dependent oncogenic transformation identified by deep mutational scanning,2020,10.1182/blood.2019002561,487-517,MPL,growth/survival (surrogate for TpoR/MPL enhanced constitutive activation),Growth,TPOR_HUMAN_full_11-26-2021_b01.a2m,1,635,635,0.1,0.2,937,0.825,524,128.4,0.2450381679,low,0,0,TPOR_HUMAN_Bridgford_S505N_2020.csv,score,1,mutant_uniprot_1,TPOR_HUMAN_theta_0.2.npy,TPOR_HUMAN.pdb,1-635,0.1,,OrganismalFitness
+TRPC_SACS2_Chan_2017,TRPC_SACS2_Chan_2017.csv,TRPC_SACS2,Prokaryote,Thermus thermophilus,MPRYLKGWLKDVVQLSLRRPSFRASRQRPIISLNERILEFNKRNITAIIAEYKRKSPSGLDVERDPIEYSKFMERYAVGLSILTEEKYFNGSYETLRKIASSVSIPILMKDFIVKESQIDDAYNLGADTVLLIVKILTERELESLLEYARSYGMEPLIEINDENDLDIALRIGARFIGINSRDLETLEINKENQRKLISMIPSNVVKVAESGISERNEIEELRKLGVNAFLIGSSLMRNPEKIKEFIL,248,FALSE,1519,1519,0,-0.5,manual,Chan,Correlation of fitness landscapes from three orthologous TIM barrels originates from sequence and structure constraints,2017,10.1038/ncomms14614,44-235,TIM Barrell (S. solfataricus),fitness,Growth,TRPC_SACS2_full_11-26-2021_b07.a2m,1,248,248,0.7,0.2,52935,0.944,234,10651.1,45.51752137,medium,364,1.555555556,TRPC_SACS2_Chan_2017.csv,fitness,1,mutant,TRPC_SACS2_theta_0.2.npy,TRPC_SACS2.pdb,1-248,0.1,,OrganismalFitness
+TRPC_THEMA_Chan_2017,TRPC_THEMA_Chan_2017.csv,TRPC_THEMA,Prokaryote,Thermus thermophilus,MRRLWEIVEAKKKDILEIDGENLIVQRRNHRFLEVLSGKERVKIIAEFKKASPSAGDINADASLEDFIRMYDELADAISILTEKHYFKGDPAFVRAARNLTSRPILAKDFYIDTVQVKLASSVGADAILIIARILTAEQIKEIYEAAEELGMDSLVEVHSREDLEKVFSVIRPKIIGINTRDLDTFEIKKNVLWELLPLVPDDTVVVAESGIKDPRELKDLRGKVNAVLVGTSIMKAENPRRFLEEMRAWSE,252,FALSE,1519,1519,0,-0.5,manual,Chan,Correlation of fitness landscapes from three orthologous TIM barrels originates from sequence and structure constraints,2017,10.1038/ncomms14614,40-233,TIM Barrell (T. maritima),fitness,Growth,TRPC_THEMA_full_11-26-2021_b07.a2m,1,252,252,0.7,0.2,52988,0.948,239,10582.5,44.27824268,medium,380,1.589958159,TRPC_THEMA_Chan_2017.csv,fitness,1,mutant,TRPC_THEMA_theta_0.2.npy,TRPC_THEMA.pdb,1-252,0.1,,OrganismalFitness
+UBC9_HUMAN_Weile_2017,UBC9_HUMAN_Weile_2017.csv,UBC9_HUMAN,Human,Homo sapiens,MSGIALSRLAQERKAWRKDHPFGFVAVPTKNPDGTMNLMNWECAIPGKKGTPWEGGLFKLRMLFKDDYPSSPPKCKFEPPLFHPNVYPSGTVCLSILEEDKDWRPAITIKQILLGIQELLNEPNIQDPAQAEAYTIYCQNRVEYEKRVRAQAKKFAPSY,159,FALSE,2563,2563,0,0.384407289,median,Weile,A framework for exhaustively mapping functional missense variants,2017,10.15252/msb.20177908,1-158,SUMO-conjugating enzyme UBC9,Yeast growth,complementation,UBC9_HUMAN_full_11-26-2021_b03.a2m,1,159,159,0.3,0.2,69788,0.849,135,8394,62.17777778,medium,89,0.6592592593,UBC9_HUMAN_Weile_2017.csv,screenscore,1,mutant,UBC9_HUMAN_theta_0.2.npy,UBC9_HUMAN.pdb,1-159,0.1,,OrganismalFitness
+UBE4B_HUMAN_Tsuboyama_2023_3L1X,UBE4B_HUMAN_Tsuboyama_2023_3L1X.csv,UBE4B_HUMAN,Human,Homo sapiens,DAPDEFRDPLMDTLMTDPVRLPSGTIMDRSIILRHLLNSPTDPFNRQTLTESMLEPVPELKEQIQAWMR,69,TRUE,3622,1118,2504,-1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-69,Ubiquitin conjugation factor E4 B,Stability,cDNA display proteolysis,UBE4B_HUMAN_2023-08-07_b04.a2m,1,69,69,0.4,0.2,310943,0.928,64,34185.4,534.146875,High,52,0.8125,Tsuboyama2023_Dataset2_Dataset60,ddG_ML_float,1,mut_type,UBE4B_HUMAN_theta0.2_2023-08-07_b04.npy,UBE4B_HUMAN.pdb,1-69,1,,Stability
+UBE4B_MOUSE_Starita_2013,UBE4B_MOUSE_Starita_2013.csv,UBE4B_MOUSE,Eukaryote,Mus musculus,MEELSADEIRRRRLARLAGGQTSQPTTPLTSPQRENPPGPPIAASAPGPSQSLGLNVHNMTPATSPIGAAGVAHRSQSSEGVSSLSSSPSNSLETQSQSLSRSQSMDIDGVSCEKSMSQVDVDSGIENMEVDENDRREKRSLSDKEPSSGPEVSEEQALQLVCKIFRVSWKDRDRDVIFLSSLSAQFKQNPKEVFSDFKDLIGQILMEVLMMSTQTRDENPFASLTATSQPIATAARSPDRNLMLNTGSSSGTSPMFCNMGSFSTSSLSSLGASGGASNWDSYSDHFTIETCKETDMLNYLIECFDRVGIEEKKAPKMCSQPAVSQLLSNIRSQCISHTALVLQGSLTQPRSLQQPSFLVPYMLCRNLPYGFIQELVRTTHQDEEVFKQIFIPILQGLALAAKECSLESDYFKYPLMALGELCETKFGKTHPMCNLVASLPLWLPKSLSPGSGRELQRLSYLGAFFSFSVFAEDDAKVVEKYFSGPAITLENTRVVSQSLQHYLELGRQELFKILHSILLNGETREAALSYMAALVNANMKKAQMQADDRLVSTDGFMLNLLWVLQQLSTKIKLETVDPTYIFHPRCRITLPNDETRINATMEDVNERLTELYGDQPPFSEPKFPTECFFLTLHAHHLSILPSCRRYIRRLRAIRELNRTVEDLKNNESQWKDSPLATRHREMLKRCKTQLKKLVRCKACADAGLLDESFLRRCLNFYGLLIQLMLRILDPAYPDVTLPLNSEVPKVFAALPEFYVEDVAEFLFFIVQYSPQVLYEPCTQDIVMFLVVMLCNQNYIRNPYLVAKLVEVMFMTNPSVQPRTQKFFEMIENHPLSTKLLVPSLMKFYTDVEHTGATSEFYDKFTIRYHISTIFKSLWQNIAHHGTFMEEFNSGKQFVRYINMLINDTTFLLDESLESLKRIHEVQEEMKNKEQWDQLPRDQQQARQSQLAQDERVSRSYLALATETVDMFHLLTKQVQKPFLRPELGPRLAAMLNFNLQQLCGPKCRDLKVENPEKYGFEPKKLLDQLTDIYLQLDCARFAKAIADDQRSYSKELFEEVISKMRKAGIKSTIAIEKFKLLAEKVEEIVAKNARAEIDYSDAPDEFRDPLMDTLMTDPVRLPSGTVMDRSIILRHLLNSPTDPFNRQMLTESMLEPVPELKEQIQAWMREKQSSDH,1173,FALSE,899,899,0,-1.8,manual,Starita,Activity-enhancing mutations in an E3 ubiquitin ligase identified by high-throughput mutagenesis,2013,10.1073/pnas.1303309110,1072-1173,Ube4b,Ligase activity (phage display),Auto-ubiquitination,UBE4B_MOUSE_full_11-26-2021_b05.a2m,1,1173,1173,0.5,0.2,4743,0.765,897,679.4,0.7574136009,low,49,0.05462653289,UBE4B_MOUSE_Starita_2013.csv,log2_ratio,1,mutant,UBE4B_MOUSE_theta_0.2.npy,UBE4B_MOUSE.pdb,1-1173,0.1,,Activity
+UBR5_HUMAN_Tsuboyama_2023_1I2T,UBR5_HUMAN_Tsuboyama_2023_1I2T.csv,UBR5_HUMAN,Human,Homo sapiens,HRQALGERLYPRVQAMQPAFASKITGMLLELSPAQLLLLLASEDSLRARVDEAMELII,58,TRUE,1453,1094,359,-0.4460165437,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-58,E3 ubiquitin-protein ligase UBR5,Stability,cDNA display proteolysis,UBR5_HUMAN_2023-08-07_b05.a2m,1,58,58,0.5,0.2,17888,0.966,56,1031.7,18.42321429,Medium,14,0.25,Tsuboyama2023_Dataset2_Dataset61,ddG_ML_float,1,mut_type,UBR5_HUMAN_theta0.2_2023-08-07_b05.npy,UBR5_HUMAN.pdb,1-58,1,,Stability
+VG08_BPP22_Tsuboyama_2023_2GP8,VG08_BPP22_Tsuboyama_2023_2GP8.csv,VG08_BPP22,Virus,Salmonella phage P22 (Bacteriophage P22),ITGDVSAANKDAIRKQMDAAASKGDVETYRKLKAKLKGIR,40,FALSE,723,723,0,-0.2013306011,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-40,Scaffolding protein,Stability,cDNA display proteolysis,VG08_BPP22_2023-08-07_b05.a2m,1,40,40,0.5,0.01,102464,0.875,35,12963.6,370.3885714,High,13,0.3714285714,Tsuboyama2023_Dataset2_Dataset62,ddG_ML_float,1,mut_type,VG08_BPP22_theta0.01_2023-08-07_b05.npy,VG08_BPP22.pdb,1-40,1,,Stability
+VILI_CHICK_Tsuboyama_2023_1YU5,VILI_CHICK_Tsuboyama_2023_1YU5.csv,VILI_CHICK,Eukaryote,Gallus gallus,KLETFPLDVLVNTAAEDLPRGVDPSRKENHLSDEDFKAVFGMTRSAFANLPLWKQQNLKKEKGLF,65,TRUE,2568,1202,1366,-0.7,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-65,Villin-1,Stability,cDNA display proteolysis,VILI_CHICK_2023-08-07_b01.a2m,1,65,65,0.1,0.2,254210,0.769,50,46507.8,930.156,High,19,0.38,Tsuboyama2023_Dataset2_Dataset63,ddG_ML_float,1,mut_type,VILI_CHICK_theta0.2_2023-08-07_b01.npy,VILI_CHICK.pdb,1-65,1,,Stability
+VKOR1_HUMAN_Chiasson_2020_abundance,VKOR1_HUMAN_Chiasson_2020_abundance.csv,VKOR1_HUMAN,Human,Homo sapiens,MGSTWGSPGWVRLALCLTGLVLSLYALHVKAARARDRDYRALCDVGTAISCSRVFSSRWGRGFGLVEHVLGQDSILNQSNSIFGCIFYTLQLLLGCLRTRWASVLMLLSSLVSLAGSVYLAWILFFVLYDFCIVCITTYAINVSLMWLSFRKVQEPQGKAKRH,163,FALSE,2695,2695,0,0.7480893367,median,Chiasson,"Multiplexed measurement of variant abundance and activity reveals VKOR topology, active site and human variant impact",2020,10.7554/eLife.58026,2-163,VKORC1,protein abundance (eGFP fusion reporter),Fluorescence,VKOR1_HUMAN_full_11-26-2021_b03.a2m,1,163,163,0.3,0.2,14510,0.779,127,4655,36.65354331,medium,97,0.7637795276,VKOR1_HUMAN_Chiasson_2020.csv,abundance_score,1,variant,VKOR1_HUMAN_theta_0.2.npy,VKOR1_HUMAN.pdb,1-163,0.1,,Expression
+VKOR1_HUMAN_Chiasson_2020_activity,VKOR1_HUMAN_Chiasson_2020_activity.csv,VKOR1_HUMAN,Human,Homo sapiens,MGSTWGSPGWVRLALCLTGLVLSLYALHVKAARARDRDYRALCDVGTAISCSRVFSSRWGRGFGLVEHVLGQDSILNQSNSIFGCIFYTLQLLLGCLRTRWASVLMLLSSLVSLAGSVYLAWILFFVLYDFCIVCITTYAINVSLMWLSFRKVQEPQGKAKRH,163,FALSE,697,697,0,0.7,manual,Chiasson,"Multiplexed measurement of variant abundance and activity reveals VKOR topology, active site and human variant impact",2020,10.7554/eLife.58026,3-163,VKORC1,carboxylation activity (carboxylation reporter on cell surface),enzymatic activity,VKOR1_HUMAN_full_11-26-2021_b03.a2m,1,163,163,0.3,0.2,14510,0.779,127,4655,36.65354331,medium,97,0.7637795276,VKOR1_HUMAN_Chiasson_2020.csv,activity_score,1,variant,VKOR1_HUMAN_theta_0.2.npy,VKOR1_HUMAN.pdb,1-163,0.1,,Activity
+VRPI_BPT7_Tsuboyama_2023_2WNM,VRPI_BPT7_Tsuboyama_2023_2WNM.csv,VRPI_BPT7,Virus,Escherichia phage T7 (Bacteriophage T7),SLSVDNKKFWATVESSEHSFEVPIYAETLDEALELAEWQYVPAGFEVTRVRPCVAP,56,FALSE,1047,1047,0,-1.1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-56,Bacterial RNA polymerase inhibitor,Stability,cDNA display proteolysis,VRPI_BPT7_2023-08-07_b02.a2m,1,56,56,0.2,0.01,6266,0.875,49,1555.8,31.75102041,Medium,3,0.0612244898,Tsuboyama2023_Dataset2_Dataset64,ddG_ML_float,1,mut_type,VRPI_BPT7_theta0.01_2023-08-07_b02.npy,VRPI_BPT7.pdb,1-56,1,,Stability
+YAIA_ECOLI_Tsuboyama_2023_2KVT,YAIA_ECOLI_Tsuboyama_2023_2KVT.csv,YAIA_ECOLI,Prokaryote,Escherichia coli,PREAYIVTIEKGKPGQTVTWYQLRADHPKPDSLISEHPTAQEAMDAKKRYED,52,TRUE,1890,928,962,-1.953132017,median,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-52,Uncharacterized protein YaiA,Stability,cDNA display proteolysis,YAIA_ECOLI_2023-08-07_b03.a2m,1,52,52,0.3,0.2,5877,0.788,41,737.2,17.9804878,Medium,5,0.1219512195,Tsuboyama2023_Dataset2_Dataset65,ddG_ML_float,1,mut_type,YAIA_ECOLI_theta0.2_2023-08-07_b03.npy,YAIA_ECOLI.pdb,1-52,1,,Stability
+YAP1_HUMAN_Araya_2012,YAP1_HUMAN_Araya_2012.csv,YAP1_HUMAN,Human,Homo sapiens,MDPGQQPPPQPAPQGQGQPPSQPPQGQGPPSGPGQPAPAATQAAPQAPPAGHQIVHVRGDSETDLEALFNAVMNPKTANVPQTVPMRLRKLPDSFFKPPEPKSHSRQASTDAGTAGALTPQHVRAHSSPASLQLGAVSPGTLTPTGVVSGPAATPTAQHLRQSSFEIPDDVPLPAGWEMAKTSSGQRYFLNHIDQTTTWQDPRKAMLSQMNVTAPTSPPVQQNMMNSASGPLPDGWEQAMTQDGEIYYINHKNKTTSWLDPRLDPRFAMNQRISQSAPVKQPPPLAPQSPQGGVMGGSNSNQQQQMRLQQLQMEKERLRLKQQELLRQAMRNINPSTANSPKCQELALRSQLPTLEQDGGTQNPVSSPGMSQELRTMTTNSSDPFLNSGTYHSRDESTDSGLSMSSYSVPRTPDDFLNSVDEMDTGDTINQSTLPSQQNRFPDYLEAIPGTNVDLGTLEGDGMNIEGEELMPSLQEALSSDILNDMESVLAATKLDKESFLTWL,504,TRUE,10075,362,9713,0.6236402571,median,Araya,"A fundamental protein property, thermodynamic stability, revealed solely from large-scale measurements of protein function",2012,10.1073/pnas.1209751109,170-203,YAP1,peptide binding,Binding,YAP1_HUMAN_full_11-26-2021_b02.a2m,1,504,504,0.2,0.2,1604,0.859,433,132.6,0.3062355658,low,1,0.002309468822,YAP1_HUMAN_Araya_2012.csv,W,1,mutant,YAP1_HUMAN_theta_0.2.npy,YAP1_HUMAN.pdb,1-504,0.1,,Binding
+YNZC_BACSU_Tsuboyama_2023_2JVD,YNZC_BACSU_Tsuboyama_2023_2JVD.csv,YNZC_BACSU,Prokaryote,Bacillus subtilis,MISNAKIARINELAAKAKAGVITEEEKAEQQKLRQEYLK,39,TRUE,2300,714,1586,-1,manual,Tsuboyama,Mega-scale experimental analysis of protein folding stability in biology and design,2023,10.1038/s41586-023-06328-6,1-39,UPF0291 protein YnzC,Stability,cDNA display proteolysis,YNZC_BACSU_2023-08-07_b07.a2m,1,39,39,0.7,0.2,7116,0.974,38,1588.3,41.79736842,Medium,13,0.3421052632,Tsuboyama2023_Dataset2_Dataset66,ddG_ML_float,1,mut_type,YNZC_BACSU_theta0.2_2023-08-07_b07.npy,YNZC_BACSU.pdb,1-39,1,,Stability
\ No newline at end of file
diff --git a/reference_files/clinical_indels.csv b/reference_files/clinical_indels.csv
new file mode 100644
index 0000000..3aba056
--- /dev/null
+++ b/reference_files/clinical_indels.csv
@@ -0,0 +1,1556 @@
+DMS_id,DMS_filename,target_seq,mutated_sequence_column,MSA_start,MSA_end,MSA_filename,weight_file_name,dataset
+NP_000007.1,NP_000007.1.csv,MAAGFGRCCRVLRSISRFHWRSQHTKANRQREPGLGFSFEFTEQQKEFQATARKFAREEIIPVAAEYDKTGEYPVPLIRRAWELGLMNTHIPENCGGLGLGTFDACLISEELAYGCTGVQTAIEGNSLGQMPIIIAGNDQQKKKYLGRMTEEPLMCAYCVTEPGAGSDVAGIKTKAEKKGDEYIINGQKMWITNGGKANWYFLLARSDPDPKAPANKAFTGFIVEADTPGIQIGRKELNMGQRCSDTRGIVFEDVKVPKENVLIGDGAGFKVAMGAFDKTRPVVAAGAVGLAQRALDEATKYALERKTFGKLLVEHQAISFMLAEMAMKVELARMSYQRAAWEVDSGRRNTYYASIAKAFAGDIANQLATDAVQILGGNGFNTEYPVEKLMRDAKIYQIYEGTSQIQRLIVAREHIDKYKN,mutated_sequence,1.0,421.0,NP_000007.1.a2m,NP_000007.1.npy,ClinVar
+NP_000008.1,NP_000008.1.csv,MAAALLARASGPARRALCPRAWRQLHTIYQSVELPETHQMLLQTCRDFAEKELFPIAAQVDKEHLFPAAQVKKMGGLGLLAMDVPEELGGAGLDYLAYAIAMEEISRGCASTGVIMSVNNSLYLGPILKFGSKEQKQAWVTPFTSGDKIGCFALSEPGNGSDAGAASTTARAEGDSWVLNGTKAWITNAWEASAAVVFASTDRALQNKGISAFLVPMPTPGLTLGKKEDKLGIRGSSTANLIFEDCRIPKDSILGEPGMGFKIAMQTLDMGRIGIASQALGIAQTALDCAVNYAENRMAFGAPLTKLQVIQFKLADMALALESARLLTWRAAMLKDNKKPFIKEAAMAKLAASEAATAISHQAIQILGGMGYVTEMPAERHYRDARITEIYEGTSEIQRLVIAGHLLRSYRS,mutated_sequence,1.0,412.0,NP_000008.1.a2m,NP_000008.1.npy,ClinVar
+NP_000009.1,NP_000009.1.csv,MQAARMAASLGRQLLRLGGGSSRLTALLGQPRPGPARRPYAGGAAQLALDKSDSHPSDALTRKKPAKAESKSFAVGMFKGQLTTDQVFPYPSVLNEEQTQFLKELVEPVSRFFEEVNDPAKNDALEMVEETTWQGLKELGAFGLQVPSELGGVGLCNTQYARLVEIVGMHDLGVGITLGAHQSIGFKGILLFGTKAQKEKYLPKLASGETVAAFCLTEPSSGSDAASIRTSAVPSPCGKYYTLNGSKLWISNGGLADIFTVFAKTPVTDPATGAVKEKITAFVVERGFGGITHGPPEKKMGIKASNTAEVFFDGVRVPSENVLGEVGSGFKVAMHILNNGRFGMAAALAGTMRGIIAKAVDHATNRTQFGEKIHNFGLIQEKLARMVMLQYVTESMAYMVSANMDQGATDFQIEAAISKIFGSEAAWKVTDECIQIMGGMGFMKEPGVERVLRDLRIFRIFEGTNDILRLFVALQGCMDKGKELSGLGSALKNPFGNAGLLLGEAGKQLRRRAGLGSGLSLSGLVHPELSRSGELAVRALEQFATVVEAKLIKHKKGIVNEQFLLQRLADGAIDLYAMVVVLSRASRSLSEGHPTAQHEKMLCDTWCIEAAARIREGMAALQSDPWQQELYRNFKSISKALVERGGVVTSNPLGF,mutated_sequence,1.0,655.0,NP_000009.1.a2m,NP_000009.1.npy,ClinVar
+NP_000010.1,NP_000010.1.csv,MAVLAALLRSGARSRSPLLRRLVQEIRYVERSYVSKPTLKEVVIVSATRTPIGSFLGSLSLLPATKLGSIAIQGAIEKAGIPKEEVKEAYMGNVLQGGEGQAPTRQAVLGAGLPISTPCTTINKVCASGMKAIMMASQSLMCGHQDVMVAGGMESMSNVPYVMNRGSTPYGGVKLEDLIVKDGLTDVYNKIHMGSCAENTAKKLNIARNEQDAYAINSYTRSKAAWEAGKFGNEVIPVTVTVKGQPDVVVKEDEEYKRVDFSKVPKLKTVFQKENGTVTAANASTLNDGAAALVLMTADAAKRLNVTPLARIVAFADAAVEPIDFPIAPVYAASMVLKDVGLKKEDIAMWEVNEAFSLVVLANIKMLEIDPQKVNINGGAVSLGHPIGMSGARIVGHLTHALKQGEYGLASICNGGGGASAMLIQKL,mutated_sequence,1.0,427.0,NP_000010.1.a2m,NP_000010.1.npy,ClinVar
+NP_000011.2,NP_000011.2.csv,MTLGSPRKGLLMLLMALVTQGDPVKPSRGPLVTCTCESPHCKGPTCRGAWCTVVLVREEGRHPQEHRGCGNLHRELCRGRPTEFVNHYCCDSHLCNHNVSLVLEATQPPSEQPGTDGQLALILGPVLALLALVALGVLGLWHVRRRQEKQRGLHSELGESSLILKASEQGDSMLGDLLDSDCTTGSGSGLPFLVQRTVARQVALVECVGKGRYGEVWRGLWHGESVAVKIFSSRDEQSWFRETEIYNTVLLRHDNILGFIASDMTSRNSSTQLWLITHYHEHGSLYDFLQRQTLEPHLALRLAVSAACGLAHLHVEIFGTQGKPAIAHRDFKSRNVLVKSNLQCCIADLGLAVMHSQGSDYLDIGNNPRVGTKRYMAPEVLDEQIRTDCFESYKWTDIWAFGLVLWEIARRTIVNGIVEDYRPPFYDVVPNDPSFEDMKKVVCVDQQTPTIPNRLAADPVLSGLAQMMRECWYPNPSARLTALRIKKTLQKISNSPEKPKVIQ,mutated_sequence,1.0,503.0,NP_000011.2.a2m,NP_000011.2.npy,ClinVar
+NP_000012.1,NP_000012.1.csv,MTELPAPLSYFQNAQMSEDNHLSNTVRSQNDNRERQEHNDRRSLGHPEPLSNGRPQGNSRQVVEQDEEEDEELTLKYGAKHVIMLFVPVTLCMVVVVATIKSVSFYTRKDGQLIYTPFTEDTETVGQRALHSILNAAIMISVIVVMTILLVVLYKYRCYKVIHAWLIISSLLLLFFFSFIYLGEVFKTYNVAVDYITVALLIWNFGVVGMISIHWKGPLRLQQAYLIMISALMALVFIKYLPEWTAWLILAVISVYDLVAVLCPKGPLRMLVETAQERNETLFPALIYSSTMVWLVNMAEGDPEAQRRVSKNSKYNAESTERESQDTVAENDDGGFSEEWEAQRDSHLGPHRSTPESRAAVQELSSSILAGEDPEERGVKLGLGDFIFYSVLVGKASATASGDWNTTIACFVAILIGLCLTLLLLAIFKKALPALPISITFGLVFYFATDYLVQPFMDQLAFHQFYI,mutated_sequence,1.0,467.0,NP_000012.1.a2m,NP_000012.1.npy,ClinVar
+NP_000017.1,NP_000017.1.csv,MAAGGDHGSPDSYRSPLASRYASPEMCFVFSDRYKFRTWRQLWLWLAEAEQTLGLPITDEQIQEMKSNLENIDFKMAAEEEKRLRHDVMAHVHTFGHCCPKAAGIIHLGATSCYVGDNTDLIILRNALDLLLPKLARVISRLADFAKERASLPTLGFTHFQPAQLTTVGKRCCLWIQDLCMDLQNLKRVRDDLRFRGVKGTTGTQASFLQLFEGDDHKVEQLDKMVTEKAGFKRAFIITGQTYTRKVDIEVLSVLASLGASVHKICTDIRLLANLKEMEEPFEKQQIGSSAMPYKRNPMRSERCCSLARHLMTLVMDPLQTASVQWFERTLDDSANRRICLAEAFLTADTILNTLQNISEGLVVYPKVIERRIRQELPFMATENIIMAMVKAGGSRQDCHEKIRVLSQQAASVVKQEGGDNDLIERIQVDAYFSPIHSQLDHLLDPSSFTGRASQQVQRFLEEEVYPLLKPYESVMKVKAELCL,mutated_sequence,1.0,484.0,NP_000017.1.a2m,NP_000017.1.npy,ClinVar
+NP_000018.2,NP_000018.2.csv,MARKSNLPVLLVPFLLCQALVRCSSPLPLVVNTWPFKNATEAAWRALASGGSALDAVESGCAMCEREQCDGSVGFGGSPDELGETTLDAMIMDGTTMDVGAVGDLRRIKNAIGVARKVLEHTTHTLLVGESATTFAQSMGFINEDLSTTASQALHSDWLARNCQPNYWRNVIPDPSKYCGPYKPPGILKQDIPIHKETEDDRGHDTIGMVVIHKTGHIAAGTSTNGIKFKIHGRVGDSPIPGAGAYADDTAGAAAATGNGDILMRFLPSYQAVEYMRRGEDPTIACQKVISRIQKHFPEFFGAVICANVTGSYGAACNKLSTFTQFSFMVYNSEKNQPTEEKVDCI,mutated_sequence,1.0,346.0,NP_000018.2.a2m,NP_000018.2.npy,ClinVar
+NP_000021.1,NP_000021.1.csv,MASHKLLVTPPKALLKPLSIPNQLLLGPGPSNLPPRIMAAGGLQMIGSMSKDMYQIMDEIKEGIQYVFQTRNPLTLVISGSGHCALEAALVNVLEPGDSFLVGANGIWGQRAVDIGERIGARVHPMTKDPGGHYTLQEVEEGLAQHKPVLLFLTHGESSTGVLQPLDGFGELCHRYKCLLLVDSVASLGGTPLYMDRQGIDILYSGSQKALNAPPGTSLISFSDKAKKKMYSRKTKPFSFYLDIKWLANFWGCDDQPRMYHHTIPVISLYSLRESLALIAEQGLENSWRQHREAAAYLHGRLQALGLQLFVKDPALRLPTVTTVAVPAGYDWRDIVSYVIDHFDIEIMGGLGPSTGKVLRIGLLGCNATRENVDRVTEALRAALQHCPKKKL,mutated_sequence,1.0,392.0,NP_000021.1.a2m,NP_000021.1.npy,ClinVar
+NP_000024.2,NP_000024.2.csv,MPVLSRPRPWRGNTLKRTAVLLALAAYGAHKVYPLVRQCLAPARGLQAPAGEPTQEASGVAAAKAGMNRVFLQRLLWLLRLLFPRVLCRETGLLALHSAALVSRTFLSVYVARLDGRLARCIVRKDPRAFGWQLLQWLLIALPATFVNSAIRYLEGQLALSFRSRLVAHAYRLYFSQQTYYRVSNMDGRLRNPDQSLTEDVVAFAASVAHLYSNLTKPLLDVAVTSYTLLRAARSRGAGTAWPSAIAGLVVFLTANVLRAFSPKFGELVAEEARRKGELRYMHSRVVANSEEIAFYGGHEVELALLQRSYQDLASQINLILLERLWYVMLEQFLMKYVWSASGLLMVAVPIITATGYSESDAEAVKKAALEKKEEELVSERTEAFTIARNLLTAAADAIERIMSSYKEVTELAGYTARVHEMFQVFEDVQRCHFKRPRELEDAQAGSGTIGRSGVRVEGPLKIRGQVVDVEQGIICENIPIVTPSGEVVVASLNIRVEEGMHLLITGPNGCGKSSLFRILGGLWPTYGGVLYKPPPQRMFYIPQRPYMSVGSLRDQVIYPDSVEDMQRKGYSEQDLEAILDVVHLHHILQREGGWEAMCDWKDVLSGGEKQRIGMARMFYHRPKYALLDECTSAVSIDVEGKIFQAAKDAGIALLSITHRPSLWKYHTHLLQFDGEGGWKFEKLDSAARLSLTEEKQRLEQQLAGIPKMQRRLQELCQILGEAVAPAHVPAPSPQGPGGLQGAST,mutated_sequence,1.0,745.0,NP_000024.2.a2m,NP_000024.2.npy,ClinVar
+NP_000026.2,NP_000026.2.csv,MAHRFPALTQEQKKELSEIAQSIVANGKGILAADESVGTMGNRLQRIKVENTEENRRQFREILFSVDSSINQSIGGVILFHETLYQKDSQGKLFRNILKEKGIVVGIKLDQGGAPLAGTNKETTIQGLDGLSERCAQYKKDGVDFGKWRAVLRIADQCPSSLAIQENANALARYASICQQNGLVPIVEPEVIPDGDHDLEHCQYVTEKVLAAVYKALNDHHVYLEGTLLKPNMVTAGHACTKKYTPEQVAMATVTALHRTVPAAVPGICFLSGGMSEEDATLNLNAINLCPLPKPWKLSFSYGRALQASALAAWGGKAANKEATQEAFMKRAMANCQAAKGQYVHTGSSGAASTQSLFTACYTY,mutated_sequence,1.0,364.0,NP_000026.2.a2m,NP_000026.2.npy,ClinVar
+NP_000032.1,NP_000032.1.csv,MKVLWAALLVTFLAGCQAKVEQAVETEPEPELRQQTEWQSGQRWELALGRFWDYLRWVQTLSEQVQEELLSSQVTQELRALMDETMKELKAYKSELEEQLTPVAEETRARLSKELQAAQARLGADMEDVCGRLVQYRGEVQAMLGQSTEELRVRLASHLRKLRKRLLRDADDLQKRLAVYQAGAREGAERGLSAIRERLGPLVEQGRVRAATVGSLAGQPLQERAQAWGERLRARMEEMGSRTRDRLDEVKEQVAEVRAKLEEQAQQIRLQAEAFQARLKSWFEPLVEDMQRQWAGLVEKVQAAVGTSAAPVPSDNH,mutated_sequence,1.0,317.0,NP_000032.1.a2m,NP_000032.1.npy,ClinVar
+NP_000034.1,NP_000034.1.csv,MLGIWTLLPLVLTSVARLSSKSVNAQVTDINSKGLELRKTVTTVETQNLEGLHHDGQFCHKPCPPGERKARDCTVNGDEPDCVPCQEGKEYTDKAHFSSKCRRCRLCDEGHGLEVEINCTRTQNTKCRCKPNFFCNSTVCEHCDPCTKCEHGIIKECTLTSNTKCKEEGSRSNLGWLCLLLLPIPLIVWVKRKEVQKTCRKHRKENQGSHESPTLNPETVAINLSDVDLSKYITTIAGVMTLSQVKGFVRKNGVNEAKIDEIKNDNVQDTAEQKVQLLRNWHQLHGKKEAYDTLIKDLKKANLCTLAEKIQTIILKDITSDSENSNFRNEIQSLV,mutated_sequence,1.0,335.0,NP_000034.1.a2m,NP_000034.1.npy,ClinVar
+NP_000035.2,NP_000035.2.csv,MEVQLGLGRVYPRPPSKTYRGAFQNLFQSVREVIQNPGPRHPEAASAAPPGASLLLLQQQQQQQQQQQQQQQQQQQQQQQETSPRQQQQQQGEDGSPQAHRRGPTGYLVLDEEQQPSQPQSALECHPERGCVPEPGAAVAASKGLPQQLPAPPDEDDSAAPSTLSLLGPTFPGLSSCSADLKDILSEASTMQLLQQQQQEAVSEGSSSGRAREASGAPTSSKDNYLGGTSTISDNAKELCKAVSVSMGLGVEALEHLSPGEQLRGDCMYAPLLGVPPAVRPTPCAPLAECKGSLLDDSAGKSTEDTAEYSPFKGGYTKGLEGESLGCSGSAAAGSSGTLELPSTLSLYKSGALDEAAAYQSRDYYNFPLALAGPPPPPPPPHPHARIKLENPLDYGSAWAAAAAQCRYGDLASLHGAGAAGPGSGSPSAAASSSWHTLFTAEEGQLYGPCGGGGGGGGGGGGGGGGGGGGGGGEAGAVAPYGYTRPPQGLAGQESDFTAPDVWYPGGMVSRVPYPSPTCVKSEMGPWMDSYSGPYGDMRLETARDHVLPIDYYFPPQKTCLICGDEASGCHYGALTCGSCKVFFKRAAEGKQKYLCASRNDCTIDKFRRKNCPSCRLRKCYEAGMTLGARKLKKLGNLKLQEEGEASSTTSPTEETTQKLTVSHIEGYECQPIFLNVLEAIEPGVVCAGHDNNQPDSFAALLSSLNELGERQLVHVVKWAKALPGFRNLHVDDQMAVIQYSWMGLMVFAMGWRSFTNVNSRMLYFAPDLVFNEYRMHKSRMYSQCVRMRHLSQEFGWLQITPQEFLCMKALLLFSIIPVDGLKNQKFFDELRMNYIKELDRIIACKRKNPTSCSRRFYQLTKLLDSVQPIARELHQFTFDLLIKSHMVSVDFPEMMAEIISVQVPKILSGKVKPIYFHTQ,mutated_sequence,1.0,920.0,NP_000035.2.a2m,NP_000035.2.npy,ClinVar
+NP_000037.2,NP_000037.2.csv,MGPRGAASLPRGPGPRRLLLPVVLPLLLLLLLAPPGSGAGASRPPHLVFLLADDLGWNDVGFHGSRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPLFLYLALQSVHEPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAGGNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIELLHNIDPNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTGYPGCGYWFPPPSQYNVSEIPSSDPPTKTLWLFDIDRDPEERHDLSREYPHIVTKLLSRLQFYHKHSVPVYFPAQDPRCDPKATGVWGPWM,mutated_sequence,1.0,533.0,NP_000037.2.a2m,NP_000037.2.npy,ClinVar
+NP_000038.2,NP_000038.2.csv,MLHLHHSCLCFRSWLPAMLAVLLSLAPSASSDISASRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFDHFYGMPFSLMGDCARWELSEKRVNLEQKLNFLFQVLALVALTLVAGKLTHLIPVSWMPVIWSALSAVLLLASSYFVGALIVHADCFLMRNHTITEQPMCFQRTTPLILQEVASFLKRNKHGPFLLFVSFLHVHIPLITMENFLGKSLHGLYGDNVEEMDWMVGRILDTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGWNGIYKGGKGMGGWEGGIRVPGIFRWPGVLPAGRVIGEPTSLMDVFPTVVRLAGGEVPQDRVIDGQDLLPLLLGTAQHSDHEFLMHYCERFLHAARWHQRDRGTMWKVHFVTPVFQPEGAGACYGRKVCPCFGEKVVHHDPPLLFDLSRDPSETHILTPASEPVFYQVMERVQQAVWEHQRTLSPVPLQLDRLGNIWRPWLQPCCGPFPLCWCLREDDPQ,mutated_sequence,1.0,589.0,NP_000038.2.a2m,NP_000038.2.npy,ClinVar
+NP_000042.3,NP_000042.3.csv,MSLVLNDLLICCRQLEHDRATERKKEVEKFKRLIRDPETIKHLDRHSDSKQGKYLNWDAVFRFLQKYIQKETECLRIAKPNVSASTQASRQKKMQEISSLVKYFIKCANRRAPRLKCQELLNYIMDTVKDSSNGAIYGADCSNILLKDILSVRKYWCEISQQQWLELFSVYFRLYLKPSQDVHRVLVARIIHAVTKGCCSQTDGLNSKFLDFFSKAIQCARQEKSSSGLNHILAALTIFLKTLAVNFRIRVCELGDEILPTLLYIWTQHRLNDSLKEVIIELFQLQIYIHHPKGAKTQEKGAYESTKWRSILYNLYDLLVNEISHIGSRGKYSSGFRNIAVKENLIELMADICHQVFNEDTRSLEISQSYTTTQRESSDYSVPCKRKKIELGWEVIKDHLQKSQNDFDLVPWLQIATQLISKYPASLPNCELSPLLMILSQLLPQQRHGERTPYVLRCLTEVALCQDKRSNLESSQKSDLLKLWNKIWCITFRGISSEQIQAENFGLLGAIIQGSLVEVDREFWKLFTGSACRPSCPAVCCLTLALTTSIVPGTVKMGIEQNMCEVNRSFSLKESIMKWLLFYQLEGDLENSTEVPPILHSNFPHLVLEKILVSLTMKNCKAAMNFFQSVPECEHHQKDKEELSFSEVEELFLQTTFDKMDFLTIVRECGIEKHQSSIGFSVHQNLKESLDRCLLGLSEQLLNNYSSEITNSETLVRCSRLLVGVLGCYCYMGVIAEEEAYKSELFQKAKSLMQCAGESITLFKNKTNEEFRIGSLRNMMQLCTRCLSNCTKKSPNKIASGFFLRLLTSKLMNDIADICKSLASFIKKPFDRGEVESMEDDTNGNLMEVEDQSSMNLFNDYPDSSVSDANEPGESQSTIGAINPLAEEYLSKQDLLFLDMLKFLCLCVTTAQTNTVSFRAADIRRKLLMLIDSSTLEPTKSLHLHMYLMLLKELPGEEYPLPMEDVLELLKPLSNVCSLYRRDQDVCKTILNHVLHVVKNLGQSNMDSENTRDAQGQFLTVIGAFWHLTKERKYIFSVRMALVNCLKTLLEADPYSKWAILNVMGKDFPVNEVFTQFLADNHHQVRMLAAESINRLFQDTKGDSSRLLKALPLKLQQTAFENAYLKAQEGMREMSHSAENPETLDEIYNRKSVLLTLIAVVLSCSPICEKQALFALCKSVKENGLEPHLVKKVLEKVSETFGYRRLEDFMASHLDYLVLEWLNLQDTEYNLSSFPFILLNYTNIEDFYRSCYKVLIPHLVIRSHFDEVKSIANQIQEDWKSLLTDCFPKILVNILPYFAYEGTRDSGMAQQRETATKVYDMLKSENLLGKQIDHLFISNLPEIVVELLMTLHEPANSSASQSTDLCDFSGDLDPAPNPPHFPSHVIKATFAYISNCHKTKLKSILEILSKSPDSYQKILLAICEQAAETNNVYKKHRILKIYHLFVSLLLKDIKSGLGGAWAFVLRDVIYTLIHYINQRPSCIMDVSLRSFSLCCDLLSQVCQTAVTYCKDALENHLHVIVGTLIPLVYEQVEVQKQVLDLLKYLVIDNKDNENLYITIKLLDPFPDHVVFKDLRITQQKIKYSRGPFSLLEEINHFLSVSVYDALPLTRLEGLKDLRRQLELHKDQMVDIMRASQDNPQDGIMVKLVVNLLQLSKMAINHTGEKEVLEAVGSCLGEVGPIDFSTIAIQHSKDASYTKALKLFEDKELQWTFIMLTYLNNTLVEDCVKVRSAAVTCLKNILATKTGHSFWEIYKMTTDPMLAYLQPFRTSRKKFLEVPRFDKENPFEGLDDINLWIPLSENHDIWIKTLTCAFLDSGGTKCEILQLLKPMCEVKTDFCQTVLPYLIHDILLQDTNESWRNLLSTHVQGFFTSCLRHFSQTSRSTTPANLDSESEHFFRCCLDKKSQRTMLAVVDYMRRQKRPSSGTIFNDAFWLDLNYLEVAKVAQSCAAHFTALLYAEIYADKKSMDDQEKRSLAFEEGSQSTTISSLSEKSKEETGISLQDLLLEIYRSIGEPDSLYGCGGGKMLQPITRLRTYEHEAMWGKALVTYDLETAIPSSTRQAGIIQALQNLGLCHILSVYLKGLDYENKDWCPELEELHYQAAWRNMQWDHCTSVSKEVEGTSYHESLYNALQSLRDREFSTFYESLKYARVKEVEEMCKRSLESVYSLYPTLSRLQAIGELESIGELFSRSVTHRQLSEVYIKWQKHSQLLKDSDFSFQEPIMALRTVILEILMEKEMDNSQRECIKDILTKHLVELSILARTFKNTQLPERAIFQIKQYNSVSCGVSEWQLEEAQVFWAKKEQSLALSILKQMIKKLDASCAANNPSLKLTYTECLRVCGNWLAETCLENPAVIMQTYLEKAVEVAGNYDGESSDELRNGKMKAFLSLARFSDTQYQRIENYMKSSEFENKQALLKRAKEEVGLLREHKIQTNRYTVKVQRELELDELALRALKEDRKRFLCKAVENYINCLLSGEEHDMWVFRLCSLWLENSGVSEVNGMMKRDGMKIPTYKFLPLMYQLAARMGTKMMGGLGFHEVLNNLISRISMDHPHHTLFIILALANANRDEFLTKPEVARRSRITKNVPKQSSQLDEDRTEAANRIICTIRSRRPQMVRSVEALCDAYIILANLDATQWKTQRKGINIPADQPITKLKNLEDVVVPTMEIKVDHTGEYGNLVTIQSFKAEFRLAGGVNLPKIIDCVGSDGKERRQLVKGRDDLRQDAVMQQVFQMCNTLLQRNTETRKRKLTICTYKVVPLSQRSGVLEWCTGTVPIGEFLVNNEDGAHKRYRPNDFSAFQCQKKMMEVQKKSFEEKYEVFMDVCQNFQPVFRYFCMEKFLDPAIWFEKRLAYTRSVATSSIVGYILGLGDRHVQNILINEQSAELVHIDLGVAFEQGKILPTPETVPFRLTRDIVDGMGITGVEGVFRRCCEKTMEVMRNSQETLLTIVEVLLYDPLFDWTMNPLKALYLQQRPEDETELHPTLNADDQECKRNLSDIDQSFNKVAERVLMRLQEKLKGVEEGTVLSVGGQVNLLIQQAIDPKNLSRLFPGWKAWV,mutated_sequence,1.0,3056.0,NP_000042.3.a2m,NP_000042.3.npy,ClinVar
+NP_000043.4,NP_000043.4.csv,MDPSMGVNSVTISVEGMTCNSCVWTIEQQIGKVNGVHHIKVSLEEKNATIIYDPKLQTPKTLQEAIDDMGFDAVIHNPDPLPVLTDTLFLTVTASLTLPWDHIQSTLLKTKGVTDIKIYPQKRTVAVTIIPSIVNANQIKELVPELSLDTGTLEKKSGACEDHSMAQAGEVVLKMKVEGMTCHSCTSTIEGKIGKLQGVQRIKVSLDNQEATIVYQPHLISVEEMKKQIEAMGFPAFVKKQPKYLKLGAIDVERLKNTPVKSSEGSQQRSPSYTNDSTATFIIDGMHCKSCVSNIESTLSALQYVSSIVVSLENRSAIVKYNASSVTPESLRKAIEAVSPGLYRVSITSEVESTSNSPSSSSLQKIPLNVVSQPLTQETVINIDGMTCNSCVQSIEGVISKKPGVKSIRVSLANSNGTVEYDPLLTSPETLRGAIEDMGFDATLSDTNEPLVVIAQPSSEMPLLTSTNEFYTKGMTPVQDKEEGKNSSKCYIQVTGMTCASCVANIERNLRREEGIYSILVALMAGKAEVRYNPAVIQPPMIAEFIRELGFGATVIENADEGDGVLELVVRGMTCASCVHKIESSLTKHRGILYCSVALATNKAHIKYDPEIIGPRDIIHTIESLGFEASLVKKDRSASHLDHKREIRQWRRSFLVSLFFCIPVMGLMIYMMVMDHHFATLHHNQNMSKEEMINLHSSMFLERQILPGLSVMNLLSFLLCVPVQFFGGWYFYIQAYKALKHKTANMDVLIVLATTIAFAYSLIILLVAMYERAKVNPITFFDTPPMLFVFIALGRWLEHIAKGKTSEALAKLISLQATEATIVTLDSDNILLSEEQVDVELVQRGDIIKVVPGGKFPVDGRVIEGHSMVDESLITGEAMPVAKKPGSTVIAGSINQNGSLLICATHVGADTTLSQIVKLVEEAQTSKAPIQQFADKLSGYFVPFIVFVSIATLLVWIVIGFLNFEIVETYFPGYNRSISRTETIIRFAFQASITVLCIACPCSLGLATPTAVMVGTGVGAQNGILIKGGEPLEMAHKVKVVVFDKTGTITHGTPVVNQVKVLTESNRISHHKILAIVGTAESNSEHPLGTAITKYCKQELDTETLGTCIDFQVVPGCGISCKVTNIEGLLHKNNWNIEDNNIKNASLVQIDASNEQSSTSSSMIIDAQISNALNAQQYKVLIGNREWMIRNGLVINNDVNDFMTEHERKGRTAVLVAVDDELCGLIAIADTVKPEAELAIHILKSMGLEVVLMTGDNSKTARSIASQVGITKVFAEVLPSHKVAKVKQLQEEGKRVAMVGDGINDSPALAMANVGIAIGTGTDVAIEAADVVLIRNDLLDVVASIDLSRKTVKRIRINFVFALIYNLVGIPIAAGVFMPIGLVLQPWMGSAAMAASSVSVVLSSLFLKLYRKPTYESYELPARSQIGQKSPSEISVHVGIDDTSRNSPKLGLLDRIVNYSRASINSLLSDKRSLNSVVTSEPDKHSLLVGDFREDDDTAL,mutated_sequence,1.0,1500.0,NP_000043.4.a2m,NP_000043.4.npy,ClinVar
+NP_000044.2,NP_000044.2.csv,MPEQERQITAREGASRKILSKLSLPTRAWEPAMKKSFAFDNVGYEGGLDGLGPSSQVATSTVRILGMTCQSCVKSIEDRISNLKGIISMKVSLEQGSATVKYVPSVVCLQQVCHQIGDMGFEASIAEGKAASWPSRSLPAQEAVVKLRVEGMTCQSCVSSIEGKVRKLQGVVRVKVSLSNQEAVITYQPYLIQPEDLRDHVNDMGFEAAIKSKVAPLSLGPIDIERLQSTNPKRPLSSANQNFNNSETLGHQGSHVVTLQLRIDGMHCKSCVLNIEENIGQLLGVQSIQVSLENKTAQVKYDPSCTSPVALQRAIEALPPGNFKVSLPDGAEGSGTDHRSSSSHSPGSPPRNQVQGTCSTTLIAIAGMTCASCVHSIEGMISQLEGVQQISVSLAEGTATVLYNPSVISPEELRAAIEDMGFEASVVSESCSTNPLGNHSAGNSMVQTTDGTPTSVQEVAPHTGRLPANHAPDILAKSPQSTRAVAPQKCFLQIKGMTCASCVSNIERNLQKEAGVLSVLVALMAGKAEIKYDPEVIQPLEIAQFIQDLGFEAAVMEDYAGSDGNIELTITGMTCASCVHNIESKLTRTNGITYASVALATSKALVKFDPEIIGPRDIIKIIEEIGFHASLAQRNPNAHHLDHKMEIKQWKKSFLCSLVFGIPVMALMIYMLIPSNEPHQSMVLDHNIIPGLSILNLIFFILCTFVQLLGGWYFYVQAYKSLRHRSANMDVLIVLATSIAYVYSLVILVVAVAEKAERSPVTFFDTPPMLFVFIALGRWLEHLAKSKTSEALAKLMSLQATEATVVTLGEDNLIIREEQVPMELVQRGDIVKVVPGGKFPVDGKVLEGNTMADESLITGEAMPVTKKPGSTVIAGSINAHGSVLIKATHVGNDTTLAQIVKLVEEAQMSKAPIQQLADRFSGYFVPFIIIMSTLTLVVWIVIGFIDFGVVQRYFPNPNKHISQTEVIIRFAFQTSITVLCIACPCSLGLATPTAVMVGTGVAAQNGILIKGGKPLEMAHKIKTVMFDKTGTITHGVPRVMRVLLLGDVATLPLRKVLAVVGTAEASSEHPLGVAVTKYCKEELGTETLGYCTDFQAVPGCGIGCKVSNVEGILAHSERPLSAPASHLNEAGSLPAEKDAVPQTFSVLIGNREWLRRNGLTISSDVSDAMTDHEMKGQTAILVAIDGVLCGMIAIADAVKQEAALAVHTLQSMGVDVVLITGDNRKTARAIATQVGINKVFAEVLPSHKVAKVQELQNKGKKVAMVGDGVNDSPALAQADMGVAIGTGTDVAIEAADVVLIRNDLLDVVASIHLSKRTVRRIRINLVLALIYNLVGIPIAAGVFMPIGIVLQPWMGSAAMAASSVSVVLSSLQLKCYKKPDLERYEAQAHGHMKPLTASQVSVHIGMDDRWRDSPRATPWDQVSYVSQVSLSSLTSDKPSRHSAAADDDGDKWSLLLNGRDEEQYI,mutated_sequence,1.0,1465.0,NP_000044.2.a2m,NP_000044.2.npy,ClinVar
+NP_000045.1,NP_000045.1.csv,MLMASTTSAVPGHPSLPSLPSNSSQERPLDTRDPLLARAELALLSIVFVAVALSNGLVLAALARRGRRGHWAPIHVFIGHLCLADLAVALFQVLPQLAWKATDRFRGPDALCRAVKYLQMVGMYASSYMILAMTLDRHRAICRPMLAYRHGSGAHWNRPVLVAWAFSLLLSLPQLFIFAQRNVEGGSGVTDCWACFAEPWGRRTYVTWIALMVFVAPTLGIAACQVLIFREIHASLVPGPSERPGGRRRGRRTGSPGEGAHVSAAVAKTVRMTLVIVVVYVLCWAPFFLVQLWAAWDPEAPLEGAPFVLLMLLASLNSCTNPWIYASFSSSVSSELRSLLCCARGRTPPSLGPQDESCTTASSSLAKDTSS,mutated_sequence,1.0,371.0,NP_000045.1.a2m,NP_000045.1.npy,ClinVar
+NP_000048.1,NP_000048.1.csv,MAAVPQNNLQEQLERHSARTLNNKLSLSKPKFSGFTFKKKTSSDNNVSVTNVSVAKTPVLRNKDVNVTEDFSFSEPLPNTTNQQRVKDFFKNAPAGQETQRGGSKSLLPDFLQTPKEVVCTTQNTPTVKKSRDTALKKLEFSSSPDSLSTINDWDDMDDFDTSETSKSFVTPPQSHFVRVSTAQKSKKGKRNFFKAQLYTTNTVKTDLPPPSSESEQIDLTEEQKDDSEWLSSDVICIDDGPIAEVHINEDAQESDSLKTHLEDERDNSEKKKNLEEAELHSTEKVPCIEFDDDDYDTDFVPPSPEEIISASSSSSKCLSTLKDLDTSDRKEDVLSTSKDLLSKPEKMSMQELNPETSTDCDARQISLQQQLIHVMEHICKLIDTIPDDKLKLLDCGNELLQQRNIRRKLLTEVDFNKSDASLLGSLWRYRPDSLDGPMEGDSCPTGNSMKELNFSHLPSNSVSPGDCLLTTTLGKTGFSATRKNLFERPLFNTHLQKSFVSSNWAETPRLGKKNESSYFPGNVLTSTAVKDQNKHTASINDLERETQPSYDIDNFDIDDFDDDDDWEDIMHNLAASKSSTAAYQPIKEGRPIKSVSERLSSAKTDCLPVSSTAQNINFSESIQNYTDKSAQNLASRNLKHERFQSLSFPHTKEMMKIFHKKFGLHNFRTNQLEAINAALLGEDCFILMPTGGGKSLCYQLPACVSPGVTVVISPLRSLIVDQVQKLTSLDIPATYLTGDKTDSEATNIYLQLSKKDPIIKLLYVTPEKICASNRLISTLENLYERKLLARFVIDEAHCVSQWGHDFRQDYKRMNMLRQKFPSVPVMALTATANPRVQKDILTQLKILRPQVFSMSFNRHNLKYYVLPKKPKKVAFDCLEWIRKHHPYDSGIIYCLSRRECDTMADTLQRDGLAALAYHAGLSDSARDEVQQKWINQDGCQVICATIAFGMGIDKPDVRFVIHASLPKSVEGYYQESGRAGRDGEISHCLLFYTYHDVTRLKRLIMMEKDGNHHTRETHFNNLYSMVHYCENITECRRIQLLAYFGENGFNPDFCKKHPDVSCDNCCKTKDYKTRDVTDDVKSIVRFVQEHSSSQGMRNIKHVGPSGRFTMNMLVDIFLGSKSAKIQSGIFGKGSAYSRHNAERLFKKLILDKILDEDLYINANDQAIAYVMLGNKAQTVLNGNLKVDFMETENSSSVKKQKALVAKVSQREEMVKKCLGELTEVCKSLGKVFGVHYFNIFNTVTLKKLAESLSSDPEVLLQIDGVTEDKLEKYGAEVISVLQKYSEWTSPAEDSSPGISLSSSRGPGRSAAEELDEEIPVSSHYFASKTRNERKRKKMPASQRSKRRKTASSGSKAKGGSATCRKISSKTKSSSIIGSSSASHTSQATSGANSKLGIMAPPKPINRPFLKPSYAFS,mutated_sequence,1.0,1417.0,NP_000048.1.a2m,NP_000048.1.npy,ClinVar
+NP_000050.3,NP_000050.3.csv,MPIGSKERPTFFEIFKTRCNKADLGPISLNWFEELSSEAPPYNSEPAEESEHKNNNYEPNLFKTPQRKPSYNQLASTPIIFKEQGLTLPLYQSPVKELDKFKLDLGRNVPNSRHKSLRTVKTKMDQADDVSCPLLNSCLSESPVVLQCTHVTPQRDKSVVCGSLFHTPKFVKGRQTPKHISESLGAEVDPDMSWSSSLATPPTLSSTVLIVRNEEASETVFPHDTTANVKSYFSNHDESLKKNDRFIASVTDSENTNQREAASHGFGKTSGNSFKVNSCKDHIGKSMPNVLEDEVYETVVDTSEEDSFSLCFSKCRTKNLQKVRTSKTRKKIFHEANADECEKSKNQVKEKYSFVSEVEPNDTDPLDSNVANQKPFESGSDKISKEVVPSLACEWSQLTLSGLNGAQMEKIPLLHISSCDQNISEKDLLDTENKRKKDFLTSENSLPRISSLPKSEKPLNEETVVNKRDEEQHLESHTDCILAVKQAISGTSPVASSFQGIKKSIFRIRESPKETFNASFSGHMTDPNFKKETEASESGLEIHTVCSQKEDSLCPNLIDNGSWPATTTQNSVALKNAGLISTLKKKTNKFIYAIHDETSYKGKKIPKDQKSELINCSAQFEANAFEAPLTFANADSGLLHSSVKRSCSQNDSEEPTLSLTSSFGTILRKCSRNETCSNNTVISQDLDYKEAKCNKEKLQLFITPEADSLSCLQEGQCENDPKSKKVSDIKEEVLAAACHPVQHSKVEYSDTDFQSQKSLLYDHENASTLILTPTSKDVLSNLVMISRGKESYKMSDKLKGNNYESDVELTKNIPMEKNQDVCALNENYKNVELLPPEKYMRVASPSRKVQFNQNTNLRVIQKNQEETTSISKITVNPDSEELFSDNENNFVFQVANERNNLALGNTKELHETDLTCVNEPIFKNSTMVLYGDTGDKQATQVSIKKDLVYVLAEENKNSVKQHIKMTLGQDLKSDISLNIDKIPEKNNDYMNKWAGLLGPISNHSFGGSFRTASNKEIKLSEHNIKKSKMFFKDIEEQYPTSLACVEIVNTLALDNQKKLSKPQSINTVSAHLQSSVVVSDCKNSHITPQMLFSKQDFNSNHNLTPSQKAEITELSTILEESGSQFEFTQFRKPSYILQKSTFEVPENQMTILKTTSEECRDADLHVIMNAPSIGQVDSSKQFEGTVEIKRKFAGLLKNDCNKSASGYLTDENEVGFRGFYSAHGTKLNVSTEALQKAVKLFSDIENISEETSAEVHPISLSSSKCHDSVVSMFKIENHNDKTVSEKNNKCQLILQNNIEMTTGTFVEEITENYKRNTENEDNKYTAASRNSHNLEFDGSDSSKNDTVCIHKDETDLLFTDQHNICLKLSGQFMKEGNTQIKEDLSDLTFLEVAKAQEACHGNTSNKEQLTATKTEQNIKDFETSDTFFQTASGKNISVAKESFNKIVNFFDQKPEELHNFSLNSELHSDIRKNKMDILSYEETDIVKHKILKESVPVGTGNQLVTFQGQPERDEKIKEPTLLGFHTASGKKVKIAKESLDKVKNLFDEKEQGTSEITSFSHQWAKTLKYREACKDLELACETIEITAAPKCKEMQNSLNNDKNLVSIETVVPPKLLSDNLCRQTENLKTSKSIFLKVKVHENVEKETAKSPATCYTNQSPYSVIENSALAFYTSCSRKTSVSQTSLLEAKKWLREGIFDGQPERINTADYVGNYLYENNSNSTIAENDKNHLSEKQDTYLSNSSMSNSYSYHSDEVYNDSGYLSKNKLDSGIEPVLKNVEDQKNTSFSKVISNVKDANAYPQTVNEDICVEELVTSSSPCKNKNAAIKLSISNSNNFEVGPPAFRIASGKIVCVSHETIKKVKDIFTDSFSKVIKENNENKSKICQTKIMAGCYEALDDSEDILHNSLDNDECSTHSHKVFADIQSEEILQHNQNMSGLEKVSKISPCDVSLETSDICKCSIGKLHKSVSSANTCGIFSTASGKSVQVSDASLQNARQVFSEIEDSTKQVFSKVLFKSNEHSDQLTREENTAIRTPEHLISQKGFSYNVVNSSAFSGFSTASGKQVSILESSLHKVKGVLEEFDLIRTEHSLHYSPTSRQNVSKILPRVDKRNPEHCVNSEMEKTCSKEFKLSNNLNVEGGSSENNHSIKVSPYLSQFQQDKQQLVLGTKVSLVENIHVLGKEQASPKNVKMEIGKTETFSDVPVKTNIEVCSTYSKDSENYFETEAVEIAKAFMEDDELTDSKLPSHATHSLFTCPENEEMVLSNSRIGKRRGEPLILVGEPSIKRNLLNEFDRIIENQEKSLKASKSTPDGTIKDRRLFMHHVSLEPITCVPFRTTKERQEIQNPNFTAPGQEFLSKSHLYEHLTLEKSSSNLAVSGHPFYQVSATRNEKMRHLITTGRPTKVFVPPFKTKSHFHRVEQCVRNINLEENRQKQNIDGHGSDDSKNKINDNEIHQFNKNNSNQAVAVTFTKCEEEPLDLITSLQNARDIQDMRIKKKQRQRVFPQPGSLYLAKTSTLPRISLKAAVGGQVPSACSHKQLYTYGVSKHCIKINSKNAESFQFHTEDYFGKESLWTGKGIQLADGGWLIPSNDGKAGKEEFYRALCDTPGVDPKLISRIWVYNHYRWIIWKLAAMECAFPKEFANRCLSPERVLLQLKYRYDTEIDRSRRSAIKKIMERDDTAAKTLVLCVSDIISLSANISETSSNKTSSADTQKVAIIELTDGWYAVKAQLDPPLLAVLKNGRLTVGQKIILHGAELVGSPDACTPLEAPESLMLKISANSTRPARWYTKLGFFPDPRPFPLPLSSLFSDGGNVGCVDVIIQRAYPIQWMEKTSSGLYIFRNEREEEKEAAKYVEAQQKRLEALFTKIQEEFEEHEENTTKPYLPSRALTRQQVRALQDGAELYEAVKNAADPAYLEGYFSEEQLRALNNHRQMLNDKKQAQIQLEIRKAMESAEQKEQGLSRDVTTVWKLRIVSYSKKEKDSVILSIWRPSSDLYSLLTEGKRYRIYHLATSKSKSKSERANIQLAATKKTQYQQLPVSDEILFQIYQPREPLHFSKFLDPDFQPSCSEVDLIGFVVSVVKKTGLAPFVYLSDECYNLLAIKFWIDLNEDIIKPHMLIAASNLQWRPESKSGLLTLFAGDFSVFSASPKEGHFQETFNKMKNTVENIDILCNEAENKLMHILHANDPKWSTPTKDCTSGPYTAQIIPGTGNKLLMSSPNCEIYYQSPLSLCMAKRKSVSTPVSAQMTSKSCKGEKEIDDQKNCKKRRALDFLSRLPLPPPVSPICTFVSPAAQKAFQPPRSCGTKYETPIKKKELNSPQMTPFKKFNEISLLESNSIADEELALINTQALLSGSTGEKQFISVSESTRTAPTSSEDYLRLKRRCTTSLIKEQESSQASTEECEKNKQDTITTKKYI,mutated_sequence,1.0,3418.0,NP_000050.3.a2m,NP_000050.3.npy,ClinVar
+NP_000052.1,NP_000052.1.csv,MAAVILESIFLKRSQQKKKTSPLNFKKRLFLLTVHKLSYYEYDFERGRRGSKKGSIDVEKITCVETVVPEKNPPPERQIPRRGEESSEMEQISIIERFPYPFQVVYDEGPLYVFSPTEELRKRWIHQLKNVIRYNSDLVQKYHPCFWIDGQYLCCSQTAKNAMGCQILENRNGSLKPGSSHRKTKKPLPPTPEEDQILKKPLPPEPAAAPVSTSELKKVVALYDYMPMNANDLQLRKGDEYFILEESNLPWWRARDKNGQEGYIPSNYVTEAEDSIEMYEWYSKHMTRSQAEQLLKQEGKEGGFIVRDSSKAGKYTVSVFAKSTGDPQGVIRHYVVCSTPQSQYYLAEKHLFSTIPELINYHQHNSAGLISRLKYPVSQQNKNAPSTAGLGYGSWEIDPKDLTFLKELGTGQFGVVKYGKWRGQYDVAIKMIKEGSMSEDEFIEEAKVMMNLSHEKLVQLYGVCTKQRPIFIITEYMANGCLLNYLREMRHRFQTQQLLEMCKDVCEAMEYLESKQFLHRDLAARNCLVNDQGVVKVSDFGLSRYVLDDEYTSSVGSKFPVRWSPPEVLMYSKFSSKSDIWAFGVLMWEIYSLGKMPYERFTNSETAEHIAQGLRLYRPHLASEKVYTIMYSCWHEKADERPTFKILLSNILDVMDEES,mutated_sequence,1.0,659.0,NP_000052.1.a2m,NP_000052.1.npy,ClinVar
+NP_000053.2,NP_000053.2.csv,MASRLTLLTLLLLLLAGDRASSNPNATSSSSQDPESLQDRGEGKVATTVISKMLFVEPILEVSSLPTTNSTTNSATKITANTTDEPTTQPTTEPTTQPTIQPTQPTTQLPTDSPTQPTTGSFCPGPVTLCSDLESHSTEAVLGDALVDFSLKLYHAFSAMKKVETNMAFSPFSIASLLTQVLLGAGENTKTNLESILSYPKDFTCVHQALKGFTTKGVTSVSQIFHSPDLAIRDTFVNASRTLYSSSPRVLSNNSDANLELINTWVAKNTNNKISRLLDSLPSDTRLVLLNAIYLSAKWKTTFDPKKTRMEPFHFKNSVIKVPMMNSKKYPVAHFIDQTLKAKVGQLQLSHNLSLVILVPQNLKHRLEDMEQALSPSVFKAIMEKLEMSKFQPTLLTLPRIKVTTSQDMLSIMEKLEFFDFSYDLNLCGLTEDPDLQVSAMQHQTVLELTETGVEAAAASAISVARTLLVFEVQQPFLFVLWDQQHKFPVFMGRVYDPRA,mutated_sequence,1.0,500.0,NP_000053.2.a2m,NP_000053.2.npy,ClinVar
+NP_000061.1,NP_000061.1.csv,MPTVISASVAPRTAAEPRSPGPVPHPAQSKATEAGGGNPSGIYSAIISRNFPIIGVKEKTFEQLHKKCLEKKVLYVDPEFPPDETSLFYSQKFPIQFVWKRPPEICENPRFIIDGANRTDICQGELGDCWFLAAIACLTLNQHLLFRVIPHDQSFIENYAGIFHFQFWRYGEWVDVVIDDCLPTYNNQLVFTKSNHRNEFWSALLEKAYAKLHGSYEALKGGNTTEAMEDFTGGVAEFFEIRDAPSDMYKIMKKAIERGSLMGCSIDDGTNMTYGTSPSGLNMGELIARMVRNMDNSLLQDSDLDPRGSDERPTRTIIPVQYETRMACGLVRGHAYSVTGLDEVPFKGEKVKLVRLRNPWGQVEWNGSWSDRWKDWSFVDKDEKARLQHQVTEDGEFWMSYEDFIYHFTKLEICNLTADALQSDKLQTWTVSVNEGRWVRGCSAGGCRNFPDTFWTNPQYRLKLLEEDDDPDDSEVICSFLVALMQKNRRKDRKLGASLFTIGFAIYEVPKEMHGNKQHLQKDFFLYNASKARSKTYINMREVSQRFRLPPSEYVIVPSTYEPHQEGEFILRVFSEKRNLSEEVENTISVDRPVKKKKTKPIIFVSDRANSNKELGVDQESEEGKGKTSPDKQKQSPQPQPGSSDQESEEQQQFRNIFKQIAGDDMEICADELKKVLNTVVNKHKDLKTHGFTLESCRSMIALMDTDGSGKLNLQEFHHLWNKIKAWQKIFKHYDTDQSGTINSYEMRNAVNDAGFHLNNQLYDIITMRYADKHMNIDFDSFICCFVRLEGMFRAFHAFDKDGDGIIKLNVLEWLQLTMYA,mutated_sequence,1.0,821.0,NP_000061.1.a2m,NP_000061.1.npy,ClinVar
+NP_000068.1,NP_000068.1.csv,MEPAAGSSMEPSADWLATAAARGRVEEVRALLEAGALPNAPNSYGRRPIQVMMMGSARVAELLLLHGAEPNCADPATLTRPVHDAAREGFLDTLVVLHRAGARLDVRDAWGRLPVDLAEELGHRDVARYLRAAAGGTRGSNHARIDAAEGPSDIPD,mutated_sequence,1.0,156.0,NP_000068.1.a2m,NP_000068.1.npy,ClinVar
+NP_000071.1,NP_000071.1.csv,MARAPLGVLLLLGLLGRGVGKNEELRLYHHLFNNYDPGSRPVREPEDTVTISLKVTLTNLISLNEKEETLTTSVWIGIDWQDYRLNYSKDDFGGIETLRVPSELVWLPEIVLENNIDGQFGVAYDANVLVYEGGSVTWLPPAIYRSVCAVEVTYFPFDWQNCSLIFRSQTYNAEEVEFTFAVDNDGKTINKIDIDTEAYTENGEWAIDFCPGVIRRHHGGATDGPGETDVIYSLIIRRKPLFYVINIIVPCVLISGLVLLAYFLPAQAGGQKCTVSINVLLAQTVFLFLIAQKIPETSLSVPLLGRFLIFVMVVATLIVMNCVIVLNVSQRTPTTHAMSPRLRHVLLELLPRLLGSPPPPEAPRAASPPRRASSVGLLLRAEELILKKPRSELVFEGQRHRQGTWTAAFCQSLGAAAPEVRCCVDAVNFVAESTRDQEATGEEVSDWVRMGNALDNICFWAALVLFSVGSSLIFLGAYFNRVPDLPYAPCIQP,mutated_sequence,1.0,493.0,NP_000071.1.a2m,NP_000071.1.npy,ClinVar
+NP_000073.1,NP_000073.1.csv,MLGFLSARQTGLEDPLRLRRAESTRRVLGLELNKDRDVERIHGGGINTLDIEPVEGRYMLSGGSDGVIVLYDLENSSRQSYYTCKAVCSIGRDHPDVHRYSVETVQWYPHDTGMFTSSSFDKTLKVWDTNTLQTADVFNFEETVYSHHMSPVSTKHCLVAVGTRGPKVQLCDLKSGSCSHILQGHRQEILAVSWSPRYDYILATASADSRVKLWDVRRASGCLITLDQHNGKKSQAVESANTAHNGKVNGLCFTSDGLHLLTVGTDNRMRLWNSSNGENTLVNYGKVCNNSKKGLKFTVSCGCSSEFVFVPYGSTIAVYTVYSGEQITMLKGHYKTVDCCVFQSNFQELYSGSRDCNILAWVPSLYEPVPDDDETTTKSQLNPAFEDAWSSSDEEG,mutated_sequence,1.0,396.0,NP_000073.1.a2m,NP_000073.1.npy,ClinVar
+NP_000074.3,NP_000074.3.csv,MEQSRSQQRGGEQSWWGSDPQYQYMPFEHCTSYGLPSENGGLQHRLRKDAGPRHNVHPTQIYGHHKEQFSDREQDIGMPKKTGSSSTVDSKDEDHYSKCQDCIHRLGQVVRRKLGEDGIFLVLLGLLMALVSWSMDYVSAKSLQAYKWSYAQMQPSLPLQFLVWVTFPLVLILFSALFCHLISPQAVGSGIPEMKTILRGVVLKEYLTMKAFVAKVVALTAGLGSGIPVGKEGPFVHIASICAAVLSKFMSVFCGVYEQPYYYSDILTVGCAVGVGCCFGTPLGGVLFSIEVTSTYFAVRNYWRGFFAATFSAFVFRVLAVWNKDAVTITALFRTNFRMDFPFDLKELPAFAAIGICCGLLGAVFVYLHRQVMLGVRKHKALSQFLAKHRLLYPGIVTFVIASFTFPPGMGQFMAGELMPREAISTLFDNNTWVKHAGDPESLGQSAVWIHPRVNVVIIIFLFFVMKFWMSIVATTMPIPCGGFMPVFVLGAAFGRLVGEIMAMLFPDGILFDDIIYKILPGGYAVIGAAALTGAVSHTVSTAVICFELTGQIAHILPMMVAVILANMVAQSLQPSLYDSIIQVKKLPYLPDLGWNQLSKYTIFVEDIMVRDVKFVSASYTYGELRTLLQTTTVKTLPLVDSKDSMILLGSVERSELQALLQRHLCPERRLRAAQEMARKLSELPYDGKARLAGEGLPGAPPGRPESFAFVDEDEDEDLSGKSELPPSLALHPSTTAPLSPEEPNGPLPGHKQQPEAPEPAGQRPSIFQSLLHCLLGRARPTKKKTTQDSTDLVDNMSPEEIEAWEQEQLSQPVCFDSCCIDQSPFQLVEQTTLHKTHTLFSLLGLHLAYVTSMGKLRGVLALEELQKAIEGHTKSGVQLRPPLASFRNTTSTRKSTGAPPSSAENWNLPEDRPGATGTGDVIAASPETPVPSPSPEPPLSLAPGKVEGELEELELVESPGLEEELADILQGPSLRSTDEEDEDELIL,mutated_sequence,1.0,988.0,NP_000074.3.a2m,NP_000074.3.npy,ClinVar
+NP_000079.2,NP_000079.2.csv,MFSFVDLRLLLLLAATALLTHGQEEGQVEGQDEDIPPITCVQNGLRYHDRDVWKPEPCRICVCDNGKVLCDDVICDETKNCPGAEVPEGECCPVCPDGSESPTDQETTGVEGPKGDTGPRGPRGPAGPPGRDGIPGQPGLPGPPGPPGPPGPPGLGGNFAPQLSYGYDEKSTGGISVPGPMGPSGPRGLPGPPGAPGPQGFQGPPGEPGEPGASGPMGPRGPPGPPGKNGDDGEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGARGNDGATGAAGPPGPTGPAGPPGFPGAVGAKGEAGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAGNPGADGQPGAKGANGAPGIAGAPGFPGARGPSGPQGPGGPPGPKGNSGEPGAPGSKGDTGAKGEPGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAGQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGDKGESGPSGPAGPTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPGPIGNVGAPGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGSAGAPGKDGLNGLPGPIGPPGPRGRTGDAGPVGPPGPPGPPGPPGPPSAGFDFSFLPQPPQEKAHDGGRYYRADDANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPSVAQKNWYISKNPKDKRHVWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQTGNLKKALLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPIIDVAPLDVGAPDQEFGFDVGPVCFL,mutated_sequence,1.0,1464.0,NP_000079.2.a2m,NP_000079.2.npy,ClinVar
+NP_000080.2,NP_000080.2.csv,MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGERGPPGPPGRDGEDGPTGPPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPPGAAGAPGPQGFQGPAGEPGEPGQTGPAGARGPAGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGIRGHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGLPGERGRVGAPGPAGARGSDGSVGPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPAGPAGPRGEVGLPGLSGPVGPPGNPGANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNKGEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGPPGPPGLRGSPGSRGLPGADGRAGVMGPPGSRGASGPAGVRGPNGDAGRPGEPGLMGPRGLPGSPGNIGPAGKEGPVGLPGIDGRPGPIGPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPPGPPGFQGLPGPSGPAGEVGKPGERGLHGEFGLPGPAGPRGERGPPGESGAAGPTGPIGSRGPSGPPGPDGNKGEPGVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARGAPGAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPGAKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDGGPPGMTGFPGAAGRTGPPGPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPPGFAGEKGPSGEAGTAGPPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIAGPPGARGPPGAVGSPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAAGAPGPHGPVGPAGKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLPGLKGHNGLQGLPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIRGPQGHQGPAGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCIRAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLLANYASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKKTNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK,mutated_sequence,1.0,1366.0,NP_000080.2.a2m,NP_000080.2.npy,ClinVar
+NP_000081.2,NP_000081.2.csv,MMSFVQKGSWLLLALLHPTIILAQQEAVEGGCSHLGQSYADRDVWKPEPCQICVCDSGSVLCDDIICDDQELDCPNPEIPFGECCAVCPQPPTAPTRPPNGQGPQGPKGDPGPPGIPGRNGDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSYDVKSGVAVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPSGPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETGAPGLKGENGLPGENGAPGPMGPRGAPGERGRPGLPGAAGARGNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRGEPGPQGHAGAQGPPGPPGINGSPGGKGEMGPAGIPGAPGLMGARGPPGPAGANGAPGLRGGAGEPGKNGAKGEPGPRGERGEAGIPGVPGAKGEDGKDGSPGEPGANGLPGAAGERGAPGFRGPAGPNGIPGEKGPAGERGAPGPAGPRGAAGEPGRDGVPGGPGMRGMPGSPGGPGSDGKPGPPGSQGESGRPGPPGPSGPRGQPGVMGFPGPKGNDGAPGKNGERGGPGGPGPQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPGPQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAPGERGPPGLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKGDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGERGETGPPGPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGGSGPAGPPGPQGVKGERGSPGGPGAAGFPGARGLPGPPGSNGNPGPPGPSGSPGKDGPPGPAGNTGAPGSPGVSGPKGDAGQPGEKGSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQGVKGESGKPGANGLSGERGPPGPQGLPGLAGTAGEPGRDGNPGSDGLPGRDGSPGGKGDRGENGSPGAPGAPGHPGPPGPVGPAGKSGDRGESGPAGPAGAPGPAGSRGAPGPQGPRGDKGETGERGAAGIKGHRGFPGNPGAPGSPGPAGQQGAIGSPGPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGGVGAAAIAGIGGEKAGGFAPYYGDEPMDFKINTDEIMTSLKSVNGQIESLISPDGSRKNPARNCRDLKFCHPELKSGEYWVDPNQGCKLDAIKVFCNMETGETCISANPLNVPRKHWWTDSSAEKKHVWFGESMDGGFQFSYGNPELPEDVLDVHLAFLRLLSSRASQNITYHCKNSIAYMDQASGNVKKALKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEWSKTVFEYRTRKAVRLPIVDIAPYDIGGPDQEFGVDVGPVCFL,mutated_sequence,1.0,1466.0,NP_000081.2.a2m,NP_000081.2.npy,ClinVar
+NP_000082.2,NP_000082.2.csv,MSARTAPRPQVLLLPLLLVLLAAAPAASKGCVCKDKGQCFCDGAKGEKGEKGFPGPPGSPGQKGFTGPEGLPGPQGPKGFPGLPGLTGSKGVRGISGLPGFSGSPGLPGTPGNTGPYGLVGVPGCSGSKGEQGFPGLPGTLGYPGIPGAAGLKGQKGAPAKEEDIELDAKGDPGLPGAPGPQGLPGPPGFPGPVGPPGPPGFFGFPGAMGPRGPKGHMGERVIGHKGERGVKGLTGPPGPPGTVIVTLTGPDNRTDLKGEKGDKGAMGEPGPPGPSGLPGESYGSEKGAPGDPGLQGKPGKDGVPGFPGSEGVKGNRGFPGLMGEDGIKGQKGDIGPPGFRGPTEYYDTYQEKGDEGTPGPPGPRGARGPQGPSGPPGVPGSPGSSRPGLRGAPGWPGLKGSKGERGRPGKDAMGTPGSPGCAGSPGLPGSPGPPGPPGDIVFRKGPPGDHGLPGYLGSPGIPGVDGPKGEPGLLCTQCPYIPGPPGLPGLPGLHGVKGIPGRQGAAGLKGSPGSPGNTGLPGFPGFPGAQGDPGLKGEKGETLQPEGQVGVPGDPGLRGQPGRKGLDGIPGTPGVKGLPGPKGELALSGEKGDQGPPGDPGSPGSPGPAGPAGPPGYGPQGEPGLQGTQGVPGAPGPPGEAGPRGELSVSTPVPGPPGPPGPPGHPGPQGPPGIPGSLGKCGDPGLPGPDGEPGIPGIGFPGPPGPKGDQGFPGTKGSLGCPGKMGEPGLPGKPGLPGAKGEPAVAMPGGPGTPGFPGERGNSGEHGEIGLPGLPGLPGTPGNEGLDGPRGDPGQPGPPGEQGPPGRCIEGPRGAQGLPGLNGLKGQQGRRGKTGPKGDPGIPGLDRSGFPGETGSPGIPGHQGEMGPLGQRGYPGNPGILGPPGEDGVIGMMGFPGAIGPPGPPGNPGTPGQRGSPGIPGVKGQRGTPGAKGEQGDKGNPGPSEISHVIGDKGEPGLKGFAGNPGEKGNRGVPGMPGLKGLKGLPGPAGPPGPRGDLGSTGNPGEPGLRGIPGSMGNMGMPGSKGKRGTLGFPGRAGRPGLPGIHGLQGDKGEPGYSEGTRPGPPGPTGDPGLPGDMGKKGEMGQPGPPGHLGPAGPEGAPGSPGSPGLPGKPGPHGDLGFKGIKGLLGPPGIRGPPGLPGFPGSPGPMGIRGDQGRDGIPGPAGEKGETGLLRAPPGPRGNPGAQGAKGDRGAPGFPGLPGRKGAMGDAGPRGPTGIEGFPGPPGLPGAIIPGQTGNRGPPGSRGSPGAPGPPGPPGSHVIGIKGDKGSMGHPGPKGPPGTAGDMGPPGRLGAPGTPGLPGPRGDPGFQGFPGVKGEKGNPGFLGSIGPPGPIGPKGPPGVRGDPGTLKIISLPGSPGPPGTPGEPGMQGEPGPPGPPGNLGPCGPRGKPGKDGKPGTPGPAGEKGNKGSKGEPGPAGSDGLPGLKGKRGDSGSPATWTTRGFVFTRHSQTTAIPSCPEGTVPLYSGFSFLFVQGNQRAHGQDLGTLGSCLQRFTTMPFLFCNVNDVCNFASRNDYSYWLSTPALMPMNMAPITGRALEPYISRCTVCEGPAIAIAVHSQTTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLECHGRGTCNYYSNSYSFWLASLNPERMFRKPIPSTVKAGELEKIISRCQVCMKKRH,mutated_sequence,1.0,1670.0,NP_000082.2.a2m,NP_000082.2.npy,ClinVar
+NP_000083.3,NP_000083.3.csv,MWSLHIVLMRCSFRLTKSLATGPWSLILILFSVQYVYGSGKKYIGPCGGRDCSVCHCVPEKGSRGPPGPPGPQGPIGPLGAPGPIGLSGEKGMRGDRGPPGAAGDKGDKGPTGVPGFPGLDGIPGHPGPPGPRGKPGMSGHNGSRGDPGFPGGRGALGPGGPLGHPGEKGEKGNSVFILGAVKGIQGDRGDPGLPGLPGSWGAGGPAGPTGYPGEPGLVGPPGQPGRPGLKGNPGVGVKGQMGDPGEVGQQGSPGPTLLVEPPDFCLYKGEKGIKGIPGMVGLPGPPGRKGESGIGAKGEKGIPGFPGPRGDPGSYGSPGFPGLKGELGLVGDPGLFGLIGPKGDPGNRGHPGPPGVLVTPPLPLKGPPGDPGFPGRYGETGDVGPPGPPGLLGRPGEACAGMIGPPGPQGFPGLPGLPGEAGIPGRPDSAPGKPGKPGSPGLPGAPGLQGLPGSSVIYCSVGNPGPQGIKGKVGPPGGRGPKGEKGNEGLCACEPGPMGPPGPPGLPGRQGSKGDLGLPGWLGTKGDPGPPGAEGPPGLPGKHGASGPPGNKGAKGDMVVSRVKGHKGERGPDGPPGFPGQPGSHGRDGHAGEKGDPGPPGDHEDATPGGKGFPGPLGPPGKAGPVGPPGLGFPGPPGERGHPGVPGHPGVRGPDGLKGQKGDTISCNVTYPGRHGPPGFDGPPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTAEIPGPPGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPGDPAFGHLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPGLPGYPGSPGAPGGKGQPGDVGPPGPAGMKGLPGLPGRPGAHGPPGLPGIPGPFGDDGLPGPPGPKGPRGLPGFPGFPGERGKPGAEGCPGAKGEPGEKGMSGLPGDRGLRGAKGAIGPPGDEGEMAIISQKGTPGEPGPPGDDGFPGERGDKGTPGMQGRRGEPGRYGPPGFHRGEPGEKGQPGPPGPPGPPGSTGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPASHFGPPGPKGEPGSPGCPGHFGASGEQGLPGIQGPRGSPGRPGPPGSSGPPGCPGDHGMPGLRGQPGEMGDPGPRGLQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKGTKGASGLHDVGPPGPVGIPGLKGERGDPGSPGISPPGPRGKKGPPGPPGSSGPPGPAGATGRAPKDIPDPGPPGDQGPPGPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGKDGQKGPVGFPGPQGPHGFPGPPGEKGLPGPPGRKGPTGLPGPRGEPGPPADVDDCPRIPGLPGAPGMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGVDGVPGSPGPPGRKGDTGEDGYPGGPGPPGPIGDPGPKGFGPGYLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQEKAHNQDLGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEAIRPYVSRCAVCEAPAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDFRAAPFLECQGRQGTCHFFANKYSFWLTTVKADLQFSSAPAPDTLKESQAQRQKISRCQVCVKYS,mutated_sequence,1.0,1690.0,NP_000083.3.a2m,NP_000083.3.npy,ClinVar
+NP_000084.3,NP_000084.3.csv,MDVHTRWKARSALRPGAPLLPPLLLLLLWAPPPSRAAQPADLLKVLDFHNLPDGITKTTGFCATRRSSKGPDVAYRVTKDAQLSAPTKQLYPASAFPEDFSILTTVKAKKGSQAFLVSIYNEQGIQQIGLELGRSPVFLYEDHTGKPGPEDYPLFRGINLSDGKWHRIALSVHKKNVTLILDCKKKTTKFLDRSDHPMIDINGIIVFGTRILDEEVFEGDIQQLLFVSDHRAAYDYCEHYSPDCDTAVPDTPQSQDPNPDEYYTEGDGEGETYYYEYPYYEDPEDLGKEPTPSKKPVEAAKETTEVPEELTPTPTEAAPMPETSEGAGKEEDVGIGDYDYVPSEDYYTPSPYDDLTYGEGEENPDQPTDPGAGAEIPTSTADTSNSSNPAPPPGEGADDLEGEFTEETIRNLDENYYDPYYDPTSSPSEIGPGMPANQDTIYEGIGGPRGEKGQKGEPAIIEPGMLIEGPPGPEGPAGLPGPPGTMGPTGQVGDPGERGPPGRPGLPGADGLPGPPGTMLMLPFRFGGGGDAGSKGPMVSAQESQAQAILQQARLALRGPAGPMGLTGRPGPVGPPGSGGLKGEPGDVGPQGPRGVQGPPGPAGKPGRRGRAGSDGARGMPGQTGPKGDRGFDGLAGLPGEKGHRGDPGPSGPPGPPGDDGERGDDGEVGPRGLPGEPGPRGLLGPKGPPGPPGPPGVTGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPGEKGPLGKPGLPGMPGADGPPGHPGKEGPPGEKGGQGPPGPQGPIGYPGPRGVKGADGIRGLKGTKGEKGEDGFPGFKGDMGIKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPGPLGPPGEKGKLGVPGLPGYPGRQGPKGSIGFPGFPGANGEKGGRGTPGKPGPRGQRGPTGPRGERGPRGITGKPGPKGNSGGDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPPGPPGVVGPQGPTGETGPMGERGHPGPPGPPGEQGLPGLAGKEGTKGDPGPAGLPGKDGPPGLRGFPGDRGLPGPVGALGLKGNEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGLQGPVGLPGPAGPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPPGPTGPQGPIGQPGPSGADGEPGPRGQQGLFGQKGDEGPRGFPGPPGPVGLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGPPGGIGNPGAVGEKGEPGEAGEPGLPGEGGPPGPKGERGEKGESGPSGAAGPPGPKGPPGDDGPKGSPGPVGFPGDPGPPGEPGPAGQDGPPGDKGDDGEPGQTGSPGPTGEPGPSGPPGKRGPPGPAGPEGRQGEKGAKGEAGLEGPPGKTGPIGPQGAPGKPGPDGLRGIPGPVGEQGLPGSPGPDGPPGPMGPPGLPGLKGDSGPKGEKGHPGLIGLIGPPGEQGEKGDRGLPGPQGSSGPKGEQGITGPSGPIGPPGPPGLPGPPGPKGAKGSSGPTGPKGEAGHPGPPGPPGPPGEVIQPLPIQASRTRRNIDASQLLDDGNGENYVDYADGMEEIFGSLNSLKLEIEQMKRPLGTQQNPARTCKDLQLCHPDFPDGEYWVDPNQGCSRDSFKVYCNFTAGGSTCVFPDKKSEGARITSWPKENPGSWFSEFKRGKLLSYVDAEGNPVGVVQMTFLRLLSASAHQNVTYHCYQSVAWQDAATGSYDKALRFLGSNDEEMSYDNNPYIRALVDGCATKKGYQKTVLEIDTPKVEQVPIVDIMFNDFGEASQKFGFEVGPACFMG,mutated_sequence,1.0,1838.0,NP_000084.3.a2m,NP_000084.3.npy,ClinVar
+NP_000085.1,NP_000085.1.csv,MTLRLLVAALCAGILAEAPRVRAQHRERVTCTRLYAADIVFLLDGSSSIGRSNFREVRSFLEGLVLPFSGAASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQLARPGVPKVCILITDGKSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTTAGGVPVTRPPDDSTSAPRDLVLSEPSSQSLRVQWTAASGPVTGYKVQYTPLTGLGQPLPSERQEVNVPAGETSVRLRGLRPLTEYQVTVIALYANSIGEAVSGTARTTALEGPELTIQNTTAHSLLVAWRSVPGATGYRVTWRVLSGGPTQQQELGPGQGSVLLRDLEPGTDYEVTVSTLFGRSVGPATSLMARTDASVEQTLRPVILGPTSILLSWNLVPEARGYRLEWRRETGLEPPQKVVLPSDVTRYQLDGLQPGTEYRLTLYTLLEGHEVATPATVVPTGPELPVSPVTDLQATELPGQRVRVSWSPVPGATQYRIIVRSTQGVERTLVLPGSQTAFDLDDVQAGLSYTVRVSARVGPREGSASVLTVRREPETPLAVPGLRVVVSDATRVRVAWGPVPGASGFRISWSTGSGPESSQTLPPDSTATDITGLQPGTTYQVAVSVLRGREEGPAAVIVARTDPLGPVRTVHVTQASSSSVTITWTRVPGATGYRVSWHSAHGPEKSQLVSGEATVAELDGLEPDTEYTVHVRAHVAGVDGPPASVVVRTAPEPVGRVSRLQILNASSDVLRITWVGVTGATAYRLAWGRSEGGPMRHQILPGNTDSAEIRGLEGGVSYSVRVTALVGDREGTPVSIVVTTPPEAPPALGTLHVVQRGEHSLRLRWEPVPRAQGFLLHWQPEGGQEQSRVLGPELSSYHLDGLEPATQYRVRLSVLGPAGEGPSAEVTARTESPRVPSIELRVVDTSIDSVTLAWTPVSRASSYILSWRPLRGPGQEVPGSPQTLPGISSSQRVTGLEPGVSYIFSLTPVLDGVRGPEASVTQTPVCPRGLADVVFLPHATQDNAHRAEATRRVLERLVLALGPLGPQAVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGNNLGTAVVTAHRYMLAPDAPGRRQHVPGVMVLLVDEPLRGDIFSPIREAQASGLNVVMLGMAGADPEQLRRLAPGMDSVQTFFAVDDGPSLDQAVSGLATALCQASFTTQPRPEPCPVYCPKGQKGEPGEMGLRGQVGPPGDPGLPGRTGAPGPQGPPGSATAKGERGFPGADGRPGSPGRAGNPGTPGAPGLKGSPGLPGPRGDPGERGPRGPKGEPGAPGQVIGGEGPGLPGRKGDPGPSGPPGPRGPLGDPGPRGPPGLPGTAMKGDKGDRGERGPPGPGEGGIAPGEPGLPGLPGSPGPQGPVGPPGKKGEKGDSEDGAPGLPGQPGSPGEQGPRGPPGAIGPKGDRGFPGPLGEAGEKGERGPPGPAGSRGLPGVAGRPGAKGPEGPPGPTGRQGEKGEPGRPGDPAVVGPAVAGPKGEKGDVGPAGPRGATGVQGERGPPGLVLPGDPGPKGDPGDRGPIGLTGRAGPPGDSGPPGEKGDPGRPGPPGPVGPRGRDGEVGEKGDEGPPGDPGLPGKAGERGLRGAPGVRGPVGEKGDQGDPGEDGRNGSPGSSGPKGDRGEPGPPGPPGRLVDTGPGAREKGEPGDRGQEGPRGPKGDPGLPGAPGERGIEGFRGPPGPQGDPGVRGPAGEKGDRGPPGLDGRSGLDGKPGAAGPSGPNGAAGKAGDPGRDGLPGLRGEQGLPGPSGPPGLPGKPGEDGKPGLNGKNGEPGDPGEDGRKGEKGDSGASGREGRDGPKGERGAPGILGPQGPPGLPGPVGPPGQGFPGVPGGTGPKGDRGETGSKGEQGLPGERGLRGEPGSVPNVDRLLETAGIKASALREIVETWDESSGSFLPVPERRRGPKGDSGEQGPPGKEGPIGFPGERGLKGDRGDPGPQGPPGLALGERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPGERGERGEKGERGEQGRDGPPGLPGTPGPPGPPGPKVSVDEPGPGLSGEQGPPGLKGAKGEPGSNGDQGPKGDRGVPGIKGDRGEPGPRGQDGNPGLPGERGMAGPEGKPGLQGPRGPPGPVGGHGDPGPPGAPGLAGPAGPQGPSGLKGEPGETGPPGRGLTGPTGAVGLPGPPGPSGLVGPQGSPGLPGQVGETGKPGAPGRDGASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGAPGQAVVGLPGAKGEKGAPGGLAGDLVGEPGAKGDRGLPGPRGEKGEAGRAGEPGDPGEDGQKGAPGPKGFKGDPGVGVPGSPGPPGPPGVKGDLGLPGLPGAPGVVGFPGQTGPRGEMGQPGPSGERGLAGPPGREGIPGPLGPPGPPGSVGPPGASGLKGDKGDPGVGLPGPRGERGEPGIRGEDGRPGQEGPRGLTGPPGSRGERGEKGDVGSAGLKGDKGDSAVILGPPGPRGAKGDMGERGPRGLDGDKGPRGDNGDPGDKGSKGEPGDKGSAGLPGLRGLLGPQGQPGAAGIPGDPGSPGKDGVPGIRGEKGDVGFMGPRGLKGERGVKGACGLDGEKGDKGEAGPPGRPGLAGHKGEMGEPGVPGQSGAPGKEGLIGPKGDRGFDGQPGPKGDQGEKGERGTPGIGGFPGPSGNDGSAGPPGPPGSVGPRGPEGLQGQKGERGPPGERVVGAPGVPGAPGERGEQGRPGPAGPRGEKGEAALTEDDIRGFVRQEMSQHCACQGQFIASGSRPLPSYAADTAGSQLHAVPVLRVSHAEEEERVPPEDDEYSEYSEYSVEEYQDPEAPWDSDDPCSLPLDEGSCTAYTLRWYHRAVTGSTEACHPFVYGGCGGNANRFGTREACERRCPPRVVQSQGTGTAQD,mutated_sequence,1.0,2944.0,NP_000085.1.a2m,NP_000085.1.npy,ClinVar
+NP_000086.2,NP_000086.2.csv,MVPDTACVLLLTLAALGASGQGQSPLGSDLGPQMLRELQETNAALQDVRELLRQQVREITFLKNTVMECDACGMQQSVRTGLPSVRPLLHCAPGFCFPGVACIQTESGARCGPCPAGFTGNGSHCTDVNECNAHPCFPRVRCINTSPGFRCEACPPGYSGPTHQGVGLAFAKANKQVCTDINECETGQHNCVPNSVCINTRGSFQCGPCQPGFVGDQASGCQRRAQRFCPDGSPSECHEHADCVLERDGSRSCVCAVGWAGNGILCGRDTDLDGFPDEKLRCPERQCRKDNCVTVPNSGQEDVDRDGIGDACDPDADGDGVPNEKDNCPLVRNPDQRNTDEDKWGDACDNCRSQKNDDQKDTDQDGRGDACDDDIDGDRIRNQADNCPRVPNSDQKDSDGDGIGDACDNCPQKSNPDQADVDHDFVGDACDSDQDQDGDGHQDSRDNCPTVPNSAQEDSDHDGQGDACDDDDDNDGVPDSRDNCRLVPNPGQEDADRDGVGDVCQDDFDADKVVDKIDVCPENAEVTLTDFRAFQTVVLDPEGDAQIDPNWVVLNQGREIVQTMNSDPGLAVGYTAFNGVDFEGTFHVNTVTDDDYAGFIFGYQDSSSFYVVMWKQMEQTYWQANPFRAVAEPGIQLKAVKSSTGPGEQLRNALWHTGDTESQVRLLWKDPRNVGWKDKKSYRWFLQHRPQVGYIRVRFYEGPELVADSNVVLDTTMRGGRLGVFCFSQENIIWANLRYRCNDTIPEDYETHQLRQA,mutated_sequence,1.0,757.0,NP_000086.2.a2m,NP_000086.2.npy,ClinVar
+NP_000089.1,NP_000089.1.csv,MVPRLLLRAWPRGPAVGPGAPSRPLSAGSGPGQYLQRSIVPTMHYQDSLPRLPIPKLEDTIRRYLSAQKPLLNDGQFRKTEQFCKSFENGIGKELHEQLVALDKQNKHTSYISGPWFDMYLSARDSVVLNFNPFMAFNPDPKSEYNDQLTRATNMTVSAIRFLKTLRAGLLEPEVFHLNPAKSDTITFKRLIRFVPSSLSWYGAYLVNAYPLDMSQYFRLFNSTRLPKPSRDELFTDDKARHLLVLRKGNFYIFDVLDQDGNIVSPSEIQAHLKYILSDSSPAPEFPLAYLTSENRDIWAELRQKLMSSGNEESLRKVDSAVFCLCLDDFPIKDLVHLSHNMLHGDGTNRWFDKSFNLIIAKDGSTAVHFEHSWGDGVAVLRFFNEVFKDSTQTPAVTPQSQPATTDSTVTVQKLNFELTDALKTGITAAKEKFDATMKTLTIDCVQFQRGGKEFLKKQKLSPDAVAQLAFQMAFLRQYGQTVATYESCSTAAFKHGRTETIRPASVYTKRCSEAFVREPSRHSAGELQQMMVECSKYHGQLTKEAAMGQGFDRHLFALRHLAAAKGIILPELYLDPAYGQINHNVLSTSTLSSPAVNLGGFAPVVSDGFGVGYAVHDNWIGCNVSSYPGRNAREFLQCVEKALEDMFDALEGKSIKS,mutated_sequence,1.0,658.0,NP_000089.1.a2m,NP_000089.1.npy,ClinVar
+NP_000093.1,NP_000093.1.csv,MWELVALLLLTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLPRHGHMHNNFFKLQKKYGPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQMATLDIASNNRKGIAFADSGAHWQLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATHNGQSIDISFPVFVAVTNVISLICFNTSYKNGDPELNVIQNYNEGIIDNLSKDSLVDLVPWLKIFPNKTLEKLKSHVKIRNDLLNKILENYKEKFRSDSITNMLDTLMQAKMNSDNGNAGPDQDSELLSDNHILTTIGDIFGAGVETTTSVVKWTLAFLLHNPQVKKKLYEEIDQNVGFSRTPTISDRNRLLLLEATIREVLRLRPVAPMLIPHKANVDSSIGEFAVDKGTEVIINLWALHHNEKEWHQPDQFMPERFLNPAGTQLISPSVSYLPFGAGPRSCIGEILARQELFLIMAWLLQRFDLEVPDDGQLPSLEGIPKVVFLIDSFKVKIKVRQAWREAQAEGST,mutated_sequence,1.0,508.0,NP_000093.1.a2m,NP_000093.1.npy,ClinVar
+NP_000095.2,NP_000095.2.csv,MGTSLSPNDPWPLNPLSIQQTTLLLLLSVLATVHVGQRLLRQRRRQLRSAPPGPFAWPLIGNAAAVGQAAHLSFARLARRYGDVFQIRLGSCPIVVLNGERAIHQALVQQGSAFADRPAFASFRVVSGGRSMAFGHYSEHWKVQRRAAHSMMRNFFTRQPRSRQVLEGHVLSEARELVALLVRGSADGAFLDPRPLTVVAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSLVDVMPWLQYFPNPVRTVFREFEQLNRNFSNFILDKFLRHCESLRPGAAPRDMMDAFILSAEKKAAGDSHGGGARLDLENVPATITDIFGASQDTLSTALQWLLLLFTRYPDVQTRVQAELDQVVGRDRLPCMGDQPNLPYVLAFLYEAMRFSSFVPVTIPHATTANTSVLGYHIPKDTVVFVNQWSVNHDPLKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKRRCIGEELSKMQLFLFISILAHQCDFRANPNEPAKMNFSYGLTIKPKSFKVNVTLRESMELLDSAVQNLQAKETCQ,mutated_sequence,1.0,543.0,NP_000095.2.a2m,NP_000095.2.npy,ClinVar
+NP_000102.1,NP_000102.1.csv,MIEPFGNQYIVARPVYSTNAFEENHKKTGRHHKTFLDHLKVCCSCSPQKAKRIVLSLFPIASWLPAYRLKEWLLSDIVSGISTGIVAVLQGLAFALLVDIPPVYGLYASFFPAIIYLFFGTSRHISVGPFPILSMMVGLAVSGAVSKAVPDRNATTLGLPNNSNNSSLLDDERVRVAAAASVTVLSGIIQLAFGILRIGFVVIYLSESLISGFTTAAAVHVLVSQLKFIFQLTVPSHTDPVSIFKVLYSVFSQIEKTNIADLVTALIVLLVVSIVKEINQRFKDKLPVPIPIEFIMTVIAAGVSYGCDFKNRFKVAVVGDMNPGFQPPITPDVETFQNTVGDCFGIAMVAFAVAFSVASVYSLKYDYPLDGNQELIALGLGNIVCGVFRGFAGSTALSRSAVQESTGGKTQIAGLIGAIIVLIVVLAIGFLLAPLQKSVLAALALGNLKGMLMQFAEIGRLWRKDKYDCLIWIMTFIFTIVLGLGLGLAASVAFQLLTIVFRTQFPKCSTLANIGRTNIYKNKKDYYDMYEPEGVKIFRCPSPIYFANIGFFRRKLIDAVGFSPLRILRKRNKALRKIRKLQKQGLLQVTPKGFICTVDTIKDSDEELDNNQIEVLDQPINTTDLPFHIDWNDDLPLNIEVPKISLHSLILDFSAVSFLDVSSVRGLKSILQEFIRIKVDVYIVGTDDDFIEKLNRYEFFDGEVKSSIFFLTIHDAVLHILMKKDYSTSKFNPSQEKDGKIDFTINTNGGLRNRVYEVPVETKF,mutated_sequence,1.0,764.0,NP_000102.1.a2m,NP_000102.1.npy,ClinVar
+NP_000103.2,NP_000103.2.csv,MSSESKEQHNVSPRDSAEGNDSYPSGIHLELQRESSTDFKQFETNDQCRPYHRILIERQEKSDTNFKEFVIKKLQKNCQCSPAKAKNMILGFLPVLQWLPKYDLKKNILGDVMSGLIVGILLVPQSIAYSLLAGQEPVYGLYTSFFASIIYFLLGTSRHISVGIFGVLCLMIGETVDRELQKAGYDNAHSAPSLGMVSNGSTLLNHTSDRICDKSCYAIMVGSTVTFIAGVYQVAMGFFQVGFVSVYLSDALLSGFVTGASFTILTSQAKYLLGLNLPRTNGVGSLITTWIHVFRNIHKTNLCDLITSLLCLLVLLPTKELNEHFKSKLKAPIPIELVVVVAATLASHFGKLHENYNSSIAGHIPTGFMPPKVPEWNLIPSVAVDAIAISIIGFAITVSLSEMFAKKHGYTVKANQEMYAIGFCNIIPSFFHCFTTSAALAKTLVKESTGCHTQLSGVVTALVLLLVLLVIAPLFYSLQKSVLGVITIVNLRGALRKFRDLPKMWSISRMDTVIWFVTMLSSALLSTEIGLLVGVCFSIFCVILRTQKPKSSLLGLVEESEVFESVSAYKNLQIKPGIKIFRFVAPLYYINKECFKSALYKQTVNPILIKVAWKKAAKRKIKEKVVTLGGIQDEMSVQLSHDPLELHTIVIDCSAIQFLDTAGIHTLKEVRRDYEAIGIQVLLAQCNPTVRDSLTNGEYCKKEEENLLFYSVYEAMAFAEVSKNQKGVCVPNGLSLSSD,mutated_sequence,1.0,739.0,NP_000103.2.a2m,NP_000103.2.npy,ClinVar
+NP_000104.1,NP_000104.1.csv,MKLGRAVLGLLLLAPSVVQAVEPISLGLALAGVLTGYIYPRLYCLFAECCGQKRSLSREALQKDLDDNLFGQHLAKKIILNAVFGFINNPKPKKPLTLSLHGWTGTGKNFVSKIIAENIYEGGLNSDYVHLFVATLHFPHASNITLYKDQLQLWIRGNVSACARSIFIFDEMDKMHAGLIDAIKPFLDYYDLVDGVSYQKAMFIFLSNAGAERITDVALDFWRSGKQREDIKLKDIEHALSVSVFNNKNSGFWHSSLIDRNLIDYFVPFLPLEYKHLKMCIRVEMQSRGYEIDEDIVSRVAEEMTFFPKEERVFSDKGCKTVFTKLDYYYDD,mutated_sequence,1.0,332.0,NP_000104.1.a2m,NP_000104.1.npy,ClinVar
+NP_000117.1,NP_000117.1.csv,MFRAAAPGQLRRAASLLRFQSTLVIAEHANDSLAPITLNTITAATRLGGEVSCLVAGTKCDKVAQDLCKVAGIAKVLVAQHDVYKGLLPEELTPLILATQKQFNYTHICAGASAFGKNLLPRVAAKLEVAPISDIIAIKSPDTFVRTIYAGNALCTVKCDEKVKVFSVRGTSFDAAATSGGSASSEKASSTSPVEISEWLDQKLTKSDRPELTGAKVVVSGGRGLKSGENFKLLYDLADQLHAAVGASRAAVDAGFVPNDMQVGQTGKIVAPELYIAVGISGAIQHLAGMKDSKTIVAINKDPEAPIFQVADYGIVADLFKVVPEMTEILKKK,mutated_sequence,1.0,333.0,NP_000117.1.a2m,NP_000117.1.npy,ClinVar
+NP_000118.2,NP_000118.2.csv,MQAKKRYFILLSAGSCLALLFYFGGLQFRASRSHSRREEHSGRNGLHHPSPDHFWPRFPDALRPFVPWDQLENEDSSVHISPRQKRDANSSIYKGKKCRMESCFDFTLCKKNGFKVYVYPQQKGEKIAESYQNILAAIEGSRFYTSDPSQACLFVLSLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFDIGQAMLAKASISTENFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEIIQDRIFKHISRNSLIWNKHPGGLFVLPQYSSYLGDFPYYYANLGLKPPSKFTAVIHAVTPLVSQSQPVLKLLVAAAKSQYCAQIIVLWNCDKPLPAKHRWPATAVPVVVIEGESKVMSSRFLPYDNIITDAVLSLDEDTVLSTTEVDFAFTVWQSFPERIVGYPARSHFWDNSKERWGYTSKWTNDYSMVLTGAAIYHKYYHYLYSHYLPASLKNMVDQLANCEDILMNFLVSAVTKLPPIKVTQKKQYKETMMGQTSRASRWADPDHFAQRQSCMNTFASWFGYMPLIHSQMRLDPVLFKDQVSILRKKYRDIERL,mutated_sequence,1.0,746.0,NP_000118.2.a2m,NP_000118.2.npy,ClinVar
+NP_000119.1,NP_000119.1.csv,MIFLYQVVHFILFTSVSGECVTQLLKDTCFEGGDITTVFTPSAKYCQVVCTYHPRCLLFTFTAESPSEDPTRWFTCVLKDSVTETLPRVNRTAAISGYSFKQCSHQISACNKDIYVDLDMKGINYNSSVAKSAQECQERCTDDVHCHFFTYATRQFPSLEHRNICLLKHTQTGTPTRITKLDKVVSGFSLKSCALSNLACIRDIFPNTVFADSNIDSVMAPDAFVCGRICTHHPGCLFFTFFSQEWPKESQRNLCLLKTSESGLPSTRIKKSKALSGFSLQSCRHSIPVFCHSSFYHDTDFLGEELDIVAAKSHEACQKLCTNAVRCQFFTYTPAQASCNEGKGKCYLKLSSNGSPTKILHGRGGISGYTLRLCKMDNECTTKIKPRIVGGTASVRGEWPWQVTLHTTSPTQRHLCGGSIIGNQWILTAAHCFYGVESPKILRVYSGILNQSEIKEDTSFFGVQEIIIHDQYKMAESGYDIALLKLETTVNYTDSQRPICLPSKGDRNVIYTDCWVTGWGYRKLRDKIQNTLQKAKIPLVTNEECQKRYRGHKITHKMICAGYREGGKDACKGDSGGPLSCKHNEVWHLVGITSWGEGCAQRERPGVYTNVVEYVDWILEKTQAV,mutated_sequence,1.0,625.0,NP_000119.1.a2m,NP_000119.1.npy,ClinVar
+NP_000123.1,NP_000123.1.csv,MQIELSTCFFLCLLRFCFSATRRYYLGAVELSWDYMQSDLGELPVDARFPPRVPKSFPFNTSVVYKKTLFVEFTDHLFNIAKPRPPWMGLLGPTIQAEVYDTVVITLKNMASHPVSLHAVGVSYWKASEGAEYDDQTSQREKEDDKVFPGGSHTYVWQVLKENGPMASDPLCLTYSYLSHVDLVKDLNSGLIGALLVCREGSLAKEKTQTLHKFILLFAVFDEGKSWHSETKNSLMQDRDAASARAWPKMHTVNGYVNRSLPGLIGCHRKSVYWHVIGMGTTPEVHSIFLEGHTFLVRNHRQASLEISPITFLTAQTLLMDLGQFLLFCHISSHQHDGMEAYVKVDSCPEEPQLRMKNNEEAEDYDDDLTDSEMDVVRFDDDNSPSFIQIRSVAKKHPKTWVHYIAAEEEDWDYAPLVLAPDDRSYKSQYLNNGPQRIGRKYKKVRFMAYTDETFKTREAIQHESGILGPLLYGEVGDTLLIIFKNQASRPYNIYPHGITDVRPLYSRRLPKGVKHLKDFPILPGEIFKYKWTVTVEDGPTKSDPRCLTRYYSSFVNMERDLASGLIGPLLICYKESVDQRGNQIMSDKRNVILFSVFDENRSWYLTENIQRFLPNPAGVQLEDPEFQASNIMHSINGYVFDSLQLSVCLHEVAYWYILSIGAQTDFLSVFFSGYTFKHKMVYEDTLTLFPFSGETVFMSMENPGLWILGCHNSDFRNRGMTALLKVSSCDKNTGDYYEDSYEDISAYLLSKNNAIEPRSFSQNSRHPSTRQKQFNATTIPENDIEKTDPWFAHRTPMPKIQNVSSSDLLMLLRQSPTPHGLSLSDLQEAKYETFSDDPSPGAIDSNNSLSEMTHFRPQLHHSGDMVFTPESGLQLRLNEKLGTTAATELKKLDFKVSSTSNNLISTIPSDNLAAGTDNTSSLGPPSMPVHYDSQLDTTLFGKKSSPLTESGGPLSLSEENNDSKLLESGLMNSQESSWGKNVSSTESGRLFKGKRAHGPALLTKDNALFKVSISLLKTNKTSNNSATNRKTHIDGPSLLIENSPSVWQNILESDTEFKKVTPLIHDRMLMDKNATALRLNHMSNKTTSSKNMEMVQQKKEGPIPPDAQNPDMSFFKMLFLPESARWIQRTHGKNSLNSGQGPSPKQLVSLGPEKSVEGQNFLSEKNKVVVGKGEFTKDVGLKEMVFPSSRNLFLTNLDNLHENNTHNQEKKIQEEIEKKETLIQENVVLPQIHTVTGTKNFMKNLFLLSTRQNVEGSYDGAYAPVLQDFRSLNDSTNRTKKHTAHFSKKGEEENLEGLGNQTKQIVEKYACTTRISPNTSQQNFVTQRSKRALKQFRLPLEETELEKRIIVDDTSTQWSKNMKHLTPSTLTQIDYNEKEKGAITQSPLSDCLTRSHSIPQANRSPLPIAKVSSFPSIRPIYLTRVLFQDNSSHLPAASYRKKDSGVQESSHFLQGAKKNNLSLAILTLEMTGDQREVGSLGTSATNSVTYKKVENTVLPKPDLPKTSGKVELLPKVHIYQKDLFPTETSNGSPGHLDLVEGSLLQGTEGAIKWNEANRPGKVPFLRVATESSAKTPSKLLDPLAWDNHYGTQIPKEEWKSQEKSPEKTAFKKKDTILSLNACESNHAIAAINEGQNKPEIEVTWAKQGRTERLCSQNPPVLKRHQREITRTTLQSDQEEIDYDDTISVEMKKEDFDIYDEDENQSPRSFQKKTRHYFIAAVERLWDYGMSSSPHVLRNRAQSGSVPQFKKVVFQEFTDGSFTQPLYRGELNEHLGLLGPYIRAEVEDNIMVTFRNQASRPYSFYSSLISYEEDQRQGAEPRKNFVKPNETKTYFWKVQHHMAPTKDEFDCKAWAYFSDVDLEKDVHSGLIGPLLVCHTNTLNPAHGRQVTVQEFALFFTIFDETKSWYFTENMERNCRAPCNIQMEDPTFKENYRFHAINGYIMDTLPGLVMAQDQRIRWYLLSMGSNENIHSIHFSGHVFTVRKKEEYKMALYNLYPGVFETVEMLPSKAGIWRVECLIGEHLHAGMSTLFLVYSNKCQTPLGMASGHIRDFQITASGQYGQWAPKLARLHYSGSINAWSTKEPFSWIKVDLLAPMIIHGIKTQGARQKFSSLYISQFIIMYSLDGKKWQTYRGNSTGTLMVFFGNVDSSGIKHNIFNPPIIARYIRLHPTHYSIRSTLRMELMGCDLNSCSMPLGMESKAISDAQITASSYFTNMFATWSPSKARLHLQGRSNAWRPQVNNPKEWLQVDFQKTMKVTGVTTQGVKSLLTSMYVKEFLISSSQDGHQWTLFFQNGKVKVFQGNQDSFTPVVNSLDPPLLTRYLRIHPQSWVHQIALRMEVLGCEAQDLY,mutated_sequence,1.0,2351.0,NP_000123.1.a2m,NP_000123.1.npy,ClinVar
+NP_000126.2,NP_000126.2.csv,MSDSWVPNSASGQDPGGRRRAWAELLAGRVKREKYNPERAQKLKESAVRLLRSHQDLNALLLEVEGPLCKKLSLSKVIDCDSSEAYANHSSSFIGSALQDQASRLGVPVGILSAGMVASSVGQICTAPAETSHPVLLTVEQRKKLSSLLEFAQYLLAHSMFSRLSFCQELWKIQSSLLLEAVWHLHVQGIVSLQELLESHPDMHAVGSWLFRNLCCLCEQMEASCQHADVARAMLSDFVQMFVLRGFQKNSDLRRTVEPEKMPQVTVDVLQRMLIFALDALAAGVQEESSTHKIVRCWFGVFSGHTLGSVISTDPLKRFFSHTLTQILTHSPVLKASDAVQMQREWSFARTHPLLTSLYRRLFVMLSAEELVGHLQEVLETQEVHWQRVLSFVSALVVCFPEAQQLLEDWVARLMAQAFESCQLDSMVTAFLVVRQAALEGPSAFLSYADWFKASFGSTRGYHGCSKKALVFLFTFLSELVPFESPRYLQVHILHPPLVPGKYRSLLTDYISLAKTRLADLKVSIENMGLYEDLSSAGDITEPHSQALQDVEKAIMVFEHTGNIPVTVMEASIFRRPYYVSHFLPALLTPRVLPKVPDSRVAFIESLKRADKIPPSLYSTYCQACSAAEEKPEDAALGVRAEPNSAEEPLGQLTAALGELRASMTDPSQRDVISAQVAVISERLRAVLGHNEDDSSVEISKIQLSINTPRLEPREHMAVDLLLTSFCQNLMAASSVAPPERQGPWAALFVRTMCGRVLPAVLTRLCQLLRHQGPSLSAPHVLGLAALAVHLGESRSALPEVDVGPPAPGAGLPVPALFDSLLTCRTRDSLFFCLKFCTAAISYSLCKFSSQSRDTLCSCLSPGLIKKFQFLMFRLFSEARQPLSEEDVASLSWRPLHLPSADWQRAALSLWTHRTFREVLKEEDVHLTYQDWLHLELEIQPEADALSDTERQDFHQWAIHEHFLPESSASGGCDGDLQAACTILVNALMDFHQSSRSYDHSENSDLVFGGRTGNEDIISRLQEMVADLELQQDLIVPLGHTPSQEHFLFEIFRRRLQALTSGWSVAASLQRQRELLMYKRILLRLPSSVLCGSSFQAEQPITARCEQFFHLVNSEMRNFCSHGGALTQDITAHFFRGLLNACLRSRDPSLMVDFILAKCQTKCPLILTSALVWWPSLEPVLLCRWRRHCQSPLPRELQKLQEGRQFASDFLSPEAASPAPNPDWLSAAALHFAIQQVREENIRKQLKKLDCEREELLVFLFFFSLMGLLSSHLTSNSTTDLPKAFHVCAAILECLEKRKISWLALFQLTESDLRLGRLLLRVAPDQHTRLLPFAFYSLLSYFHEDAAIREEAFLHVAVDMYLKLVQLFVAGDTSTVSPPAGRSLELKGQGNPVELITKARLFLLQLIPRCPKKSFSHVAELLADRGDCDPEVSAALQSRQQAAPDADLSQEPHLF,mutated_sequence,1.0,1455.0,NP_000126.2.a2m,NP_000126.2.npy,ClinVar
+NP_000129.3,NP_000129.3.csv,MRRGRLLEIALGFTVLLASYTSHGADANLEAGNVKETRASRAKRRGGGGHDALKGPNVCGSRYNAYCCPGWKTLPGGNQCIVPICRHSCGDGFCSRPNMCTCPSGQIAPSCGSRSIQHCNIRCMNGGSCSDDHCLCQKGYIGTHCGQPVCESGCLNGGRCVAPNRCACTYGFTGPQCERDYRTGPCFTVISNQMCQGQLSGIVCTKTLCCATVGRAWGHPCEMCPAQPHPCRRGFIPNIRTGACQDVDECQAIPGLCQGGNCINTVGSFECKCPAGHKLNEVSQKCEDIDECSTIPGICEGGECTNTVSSYFCKCPPGFYTSPDGTRCIDVRPGYCYTALTNGRCSNQLPQSITKMQCCCDAGRCWSPGVTVAPEMCPIRATEDFNKLCSVPMVIPGRPEYPPPPLGPIPPVLPVPPGFPPGPQIPVPRPPVEYLYPSREPPRVLPVNVTDYCQLVRYLCQNGRCIPTPGSYRCECNKGFQLDLRGECIDVDECEKNPCAGGECINNQGSYTCQCRAGYQSTLTRTECRDIDECLQNGRICNNGRCINTDGSFHCVCNAGFHVTRDGKNCEDMDECSIRNMCLNGMCINEDGSFKCICKPGFQLASDGRYCKDINECETPGICMNGRCVNTDGSYRCECFPGLAVGLDGRVCVDTHMRSTCYGGYKRGQCIKPLFGAVTKSECCCASTEYAFGEPCQPCPAQNSAEYQALCSSGPGMTSAGSDINECALDPDICPNGICENLRGTYKCICNSGYEVDSTGKNCVDINECVLNSLLCDNGQCRNTPGSFVCTCPKGFIYKPDLKTCEDIDECESSPCINGVCKNSPGSFICECSSESTLDPTKTICIETIKGTCWQTVIDGRCEININGATLKSQCCSSLGAAWGSPCTLCQVDPICGKGYSRIKGTQCEDIDECEVFPGVCKNGLCVNTRGSFKCQCPSGMTLDATGRICLDIRLETCFLRYEDEECTLPIAGRHRMDACCCSVGAAWGTEECEECPMRNTPEYEELCPRGPGFATKEITNGKPFFKDINECKMIPSLCTHGKCRNTIGSFKCRCDSGFALDSEERNCTDIDECRISPDLCGRGQCVNTPGDFECKCDEGYESGFMMMKNCMDIDECQRDPLLCRGGVCHNTEGSYRCECPPGHQLSPNISACIDINECELSAHLCPNGRCVNLIGKYQCACNPGYHSTPDRLFCVDIDECSIMNGGCETFCTNSEGSYECSCQPGFALMPDQRSCTDIDECEDNPNICDGGQCTNIPGEYRCLCYDGFMASEDMKTCVDVNECDLNPNICLSGTCENTKGSFICHCDMGYSGKKGKTGCTDINECEIGAHNCGKHAVCTNTAGSFKCSCSPGWIGDGIKCTDLDECSNGTHMCSQHADCKNTMGSYRCLCKEGYTGDGFTCTDLDECSENLNLCGNGQCLNAPGGYRCECDMGFVPSADGKACEDIDECSLPNICVFGTCHNLPGLFRCECEIGYELDRSGGNCTDVNECLDPTTCISGNCVNTPGSYICDCPPDFELNPTRVGCVDTRSGNCYLDIRPRGDNGDTACSNEIGVGVSKASCCCSLGKAWGTPCEMCPAVNTSEYKILCPGGEGFRPNPITVILEDIDECQELPGLCQGGKCINTFGSFQCRCPTGYYLNEDTRVCDDVNECETPGICGPGTCYNTVGNYTCICPPDYMQVNGGNNCMDMRRSLCYRNYYADNQTCDGELLFNMTKKMCCCSYNIGRAWNKPCEQCPIPSTDEFATLCGSQRPGFVIDIYTGLPVDIDECREIPGVCENGVCINMVGSFRCECPVGFFYNDKLLVCEDIDECQNGPVCQRNAECINTAGSYRCDCKPGYRFTSTGQCNDRNECQEIPNICSHGQCIDTVGSFYCLCHTGFKTNDDQTMCLDINECERDACGNGTCRNTIGSFNCRCNHGFILSHNNDCIDVDECASGNGNLCRNGQCINTVGSFQCQCNEGYEVAPDGRTCVDINECLLEPRKCAPGTCQNLDGSYRCICPPGYSLQNEKCEDIDECVEEPEICALGTCSNTEGSFKCLCPEGFSLSSSGRRCQDLRMSYCYAKFEGGKCSSPKSRNHSKQECCCALKGEGWGDPCELCPTEPDEAFRQICPYGSGIIVGPDDSAVDMDECKEPDVCKHGQCINTDGSYRCECPFGYILAGNECVDTDECSVGNPCGNGTCKNVIGGFECTCEEGFEPGPMMTCEDINECAQNPLLCAFRCVNTYGSYECKCPVGYVLREDRRMCKDEDECEEGKHDCTEKQMECKNLIGTYMCICGPGYQRRPDGEGCVDENECQTKPGICENGRCLNTRGSYTCECNDGFTASPNQDECLDNREGYCFTEVLQNMCQIGSSNRNPVTKSECCCDGGRGWGPHCEICPFQGTVAFKKLCPHGRGFMTNGADIDECKVIHDVCRNGECVNDRGSYHCICKTGYTPDITGTSCVDLNECNQAPKPCNFICKNTEGSYQCSCPKGYILQEDGRSCKDLDECATKQHNCQFLCVNTIGGFTCKCPPGFTQHHTSCIDNNECTSDINLCGSKGICQNTPGSFTCECQRGFSLDQTGSSCEDVDECEGNHRCQHGCQNIIGGYRCSCPQGYLQHYQWNQCVDENECLSAHICGGASCHNTLGSYKCMCPAGFQYEQFSGGCQDINECGSAQAPCSYGCSNTEGGYLCGCPPGYFRIGQGHCVSGMGMGRGNPEPPVSGEMDDNSLSPEACYECKINGYPKRGRKRRSTNETDASNIEDQSETEANVSLASWDVEKTAIFAFNISHVSNKVRILELLPALTTLTNHNRYLIESGNEDGFFKINQKEGISYLHFTKKKPVAGTYSLQISSTPLYKKKELNQLEDKYDKDYLSGELGDNLKMKIQVLLH,mutated_sequence,1.0,2871.0,NP_000129.3.a2m,NP_000129.3.npy,ClinVar
+NP_000132.3,NP_000132.3.csv,MVSWGRFICLVVVTMATLSLARPSFSLVEDTTLEPEEPPTKYQISQPEVYVAAPGESLEVRCLLKDAAVISWTKDGVHLGPNNRTVLIGEYLQIKGATPRDSGLYACTASRTVDSETWYFMVNVTDAISSGDDEDDTDGAEDFVSENSNNKRAPYWTNTEKMEKRLHAVPAANTVKFRCPAGGNPMPTMRWLKNGKEFKQEHRIGGYKVRNQHWSLIMESVVPSDKGNYTCVVENEYGSINHTYHLDVVERSPHRPILQAGLPANASTVVGGDVEFVCKVYSDAQPHIQWIKHVEKNGSKYGPDGLPYLKVLKAAGVNTTDKEIEVLYIRNVTFEDAGEYTCLAGNSIGISFHSAWLTVLPAPGREKEITASPDYLEIAIYCIGVFLIACMVVTVILCRMKNTTKKPDFSSQPAVHKLTKRIPLRRQVTVSAESSSSMNSNTPLVRITTRLSSTADTPMLAGVSEYELPEDPKWEFPRDKLTLGKPLGEGCFGQVVMAEAVGIDKDKPKEAVTVAVKMLKDDATEKDLSDLVSEMEMMKMIGKHKNIINLLGACTQDGPLYVIVEYASKGNLREYLRARRPPGMEYSYDINRVPEEQMTFKDLVSCTYQLARGMEYLASQKCIHRDLAARNVLVTENNVMKIADFGLARDINNIDYYKKTTNGRLPVKWMAPEALFDRVYTHQSDVWSFGVLMWEIFTLGGSPYPGIPVEELFKLLKEGHRMDKPANCTNELYMMMRDCWHAVPSQRPTFKQLVEDLDRILTLTTNEEYLDLSQPLEQYSPSYPDTRSSCSSGDDSVFSPDPMPYEPCLPQYPHINGSVKT,mutated_sequence,1.0,821.0,NP_000132.3.a2m,NP_000132.3.npy,ClinVar
+NP_000134.2,NP_000134.2.csv,MYRALRLLARSRPLVRAPAAALASAPGLGGAAVPSFWPPNAARMASQNSFRIEYDTFGELKVPNDKYYGAQTVRSTMNFKIGGVTERMPTPVIKAFGILKRAAAEVNQDYGLDPKIANAIMKAADEVAEGKLNDHFPLVVWQTGSGTQTNMNVNEVISNRAIEMLGGELGSKIPVHPNDHVNKSQSSNDTFPTAMHIAAAIEVHEVLLPGLQKLHDALDAKSKEFAQIIKIGRTHTQDAVPLTLGQEFSGYVQQVKYAMTRIKAAMPRIYELAAGGTAVGTGLNTRIGFAEKVAAKVAALTGLPFVTAPNKFEALAAHDALVELSGAMNTTACSLMKIANDIRFLGSGPRSGLGELILPENEPGSSIMPGKVNPTQCEAMTMVAAQVMGNHVAVTVGGSNGHFELNVFKPMMIKNVLHSARLLGDASVSFTENCVVGIQANTERINKLMNESLMLVTALNPHIGYDKAAKIAKTAHKNGSTLKETAIELGYLTAEQFDEWVKPKDMLGPK,mutated_sequence,1.0,510.0,NP_000134.2.a2m,NP_000134.2.npy,ClinVar
+NP_000135.2,NP_000135.2.csv,MWTLGRRAVAGLLASPSPAQAQTLTRVPRPAELAPLCGRRGLRTDIDATCTPRRASSNQRGLNQIWNVKKQSVYLMNLRKSGTLGHPGSLDETTYERLAEETLDSLAEFFEDLADKPYTFEDYDVSFGSGVLTVKLGGDLGTYVINKQTPNKQIWLSSPSSGPKRYDWTGKNWVYSHDGVSLHELLAAELTKALKTKLDLSSLAYSGKDA,mutated_sequence,1.0,210.0,NP_000135.2.a2m,NP_000135.2.npy,ClinVar
+NP_000142.2,NP_000142.2.csv,MEEGMNVLHDFGIQSTHYLQVNYQDSQDWFILVSVIADLRNAFYVLFPIWFHLQEAVGIKLLWVAVIGDWLNLVFKWILFGQRPYWWVLDTDYYSNTSVPLIKQFPVTCETGPGSPSGHAMGTAGVYYVMVTSTLSIFQGKIKPTYRFRCLNVILWLGFWAVQLNVCLSRIYLAAHFPHQVVAGVLSGIAVAETFSHIHSIYNASLKKYFLITFFLFSFAIGFYLLLKGLGVDLLWTLEKAQRWCEQPEWVHIDTTPFASLLKNLGTLFGLGLALNSSMYRESCKGKLSKWLPFRLSSIVASLVLLHVFDSLKPPSQVELVFYVLSFCKSAVVPLASVSVIPYCLAQVLGQPHKKSL,mutated_sequence,1.0,357.0,NP_000142.2.a2m,NP_000142.2.npy,ClinVar
+NP_000143.2,NP_000143.2.csv,MGVRHPPCSHRLLAVCALVSLATAALLGHILLHDFLLVPRELSGSSPVLEETHPAHQQGASRPGPRDAQAHPGRPRAVPTQCDVPPNSRFDCAPDKAITQEQCEARGCCYIPAKQGLQGAQMGQPWCFFPPSYPSYKLENLSSSEMGYTATLTRTTPTFFPKDILTLRLDVMMETENRLHFTIKDPANRRYEVPLETPHVHSRAPSPLYSVEFSEEPFGVIVRRQLDGRVLLNTTVAPLFFADQFLQLSTSLPSQYITGLAEHLSPLMLSTSWTRITLWNRDLAPTPGANLYGSHPFYLALEDGGSAHGVFLLNSNAMDVVLQPSPALSWRSTGGILDVYIFLGPEPKSVVQQYLDVVGYPFMPPYWGLGFHLCRWGYSSTAITRQVVENMTRAHFPLDVQWNDLDYMDSRRDFTFNKDGFRDFPAMVQELHQGGRRYMMIVDPAISSSGPAGSYRPYDEGLRRGVFITNETGQPLIGKVWPGSTAFPDFTNPTALAWWEDMVAEFHDQVPFDGMWIDMNEPSNFIRGSEDGCPNNELENPPYVPGVVGGTLQAATICASSHQFLSTHYNLHNLYGLTEAIASHRALVKARGTRPFVISRSTFAGHGRYAGHWTGDVWSSWEQLASSVPEILQFNLLGVPLVGADVCGFLGNTSEELCVRWTQLGAFYPFMRNHNSLLSLPQEPYSFSEPAQQAMRKALTLRYALLPHLYTLFHQAHVAGETVARPLFLEFPKDSSTWTVDHQLLWGEALLITPVLQAGKAEVTGYFPLGTWYDLQTVPVEALGSLPPPPAAPREPAIHSEGQWVTLPAPLDTINVHLRAGYIIPLQGPGLTTTESRQQPMALAVALTKGGEARGELFWDDGESLEVLERGAYTQVIFLARNNTIVNELVRVTSEGAGLQLQKVTVLGVATAPQQVLSNGVPVSNFTYSPDTKVLDICVSLLMGEQFLVSWC,mutated_sequence,1.0,952.0,NP_000143.2.a2m,NP_000143.2.npy,ClinVar
+NP_000144.2,NP_000144.2.csv,MAEWLLSASWQRRAKAMTAAAGSAGRAAVPLLLCALLAPGGAYVLDDSDGLGREFDGIGAVSGGGATSRLLVNYPEPYRSQILDYLFKPNFGASLHILKVEIGGDGQTTDGTEPSHMHYALDENYFRGYEWWLMKEAKKRNPNITLIGLPWSFPGWLGKGFDWPYVNLQLTAYYVVTWIVGAKRYHDLDIDYIGIWNERSYNANYIKILRKMLNYQGLQRVKIIASDNLWESISASMLLDAELFKVVDVIGAHYPGTHSAKDAKLTGKKLWSSEDFSTLNSDMGAGCWGRILNQNYINGYMTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHYVVESPVWVSAHTTQFTQPGWYYLKTVGHLEKGGSYVALTDGLGNLTIIIETMSHKHSKCIRPFLPYFNVSQQFATFVLKGSFSEIPELQVWYTKLGKTSERFLFKQLDSLWLLDSDGSFTLSLHEDELFTLTTLTTGRKGSYPLPPKSQPFPSTYKDDFNVDYPFFSEAPNFADQTGVFEYFTNIEDPGEHHFTLRQVLNQRPITWAADASNTISIIGDYNWTNLTIKCDVYIETPDTGGVFIAGRVNKGGILIRSARGIFFWIFANGSYRVTGDLAGWIIYALGRVEVTAKKWYTLTLTIKGHFTSGMLNDKSLWTDIPVNFPKNGWAAIGTHSFEFAQFDNFLVEATR,mutated_sequence,1.0,685.0,NP_000144.2.a2m,NP_000144.2.npy,ClinVar
+NP_000146.2,NP_000146.2.csv,MSRSGTDPQQRQQASEADAAAATFRANDHQHIRYNPLQDEWVLVSAHRMKRPWQGQVEPQLLKTVPRHDPLNPLCPGAIRANGEVNPQYDSTFLFDNDFPALQPDAPSPGPSDHPLFQAKSARGVCKVMCFHPWSDVTLPLMSVPEIRAVVDAWASVTEELGAQYPWVQIFENKGAMMGCSNPHPHCQVWASSFLPDIAQREERSQQAYKSQHGEPLLMEYSRQELLRKERLVLTSEHWLVLVPFWATWPYQTLLLPRRHVRRLPELTPAERDDLASIMKKLLTKYDNLFETSFPYSMGWHGAPTGSEAGANWNHWQLHAHYYPPLLRSATVRKFMVGYEMLAQAQRDLTPEQAAERLRALPEVHYHLGQKDRETATIA,mutated_sequence,1.0,379.0,NP_000146.2.a2m,NP_000146.2.npy,ClinVar
+NP_000148.2,NP_000148.2.csv,MEFSSPSREECPKPLSRVSIMAGSLTGLLLLQAVSWASGARPCIPKSFGYSSVVCVCNATYCDSFDPPTFPALGTFSRYESTRSGRRMELSMGPIQANHTGTGLLLTLQPEQKFQKVKGFGGAMTDAAALNILALSPPAQNLLLKSYFSEEGIGYNIIRVPMASCDFSIRTYTYADTPDDFQLHNFSLPEEDTKLKIPLIHRALQLAQRPVSLLASPWTSPTWLKTNGAVNGKGSLKGQPGDIYHQTWARYFVKFLDAYAEHKLQFWAVTAENEPSAGLLSGYPFQCLGFTPEHQRDFIARDLGPTLANSTHHNVRLLMLDDQRLLLPHWAKVVLTDPEAAKYVHGIAVHWYLDFLAPAKATLGETHRLFPNTMLFASEACVGSKFWEQSVRLGSWDRGMQYSHSIITNLLYHVVGWTDWNLALNPEGGPNWVRNFVDSPIIVDITKDTFYKQPMFYHLGHFSKFIPEGSQRVGLVASQKNDLDAVALMHPDGSAVVVVLNRSSKDVPLTIKDPAVGFLETISPGYSIHTYLWRRQ,mutated_sequence,1.0,536.0,NP_000148.2.a2m,NP_000148.2.npy,ClinVar
+NP_000152.1,NP_000152.1.csv,MEKGPVRAPAEKPRGARCSNGFPERDPPRPGPSRPAEKPPRPEAKSAQPADGWKGERPRSEEDNELNLPNLAAAYSSILSSLGENPQRQGLLKTPWRAASAMQFFTKGYQETISDVLNDAIFDEDHDEMVIVKDIDMFSMCEHHLVPFVGKVHIGYLPNKQVLGLSKLARIVEIYSRRLQVQERLTKQIAVAITEALRPAGVGVVVEATHMCMVMRGVQKMNSKTVTSTMLGVFREDPKTREEFLTLIRS,mutated_sequence,1.0,250.0,NP_000152.1.a2m,NP_000152.1.npy,ClinVar
+NP_000153.1,NP_000153.1.csv,MLDDRARMEAAKKEKVEQILAEFQLQEEDLKKVMRRMQKEMDRGLRLETHEEASVKMLPTYVRSTPEGSEVGDFLSLDLGGTNFRVMLVKVGEGEEGQWSVKTKHQMYSIPEDAMTGTAEMLFDYISECISDFLDKHQMKHKKLPLGFTFSFPVRHEDIDKGILLNWTKGFKASGAEGNNVVGLLRDAIKRRGDFEMDVVAMVNDTVATMISCYYEDHQCEVGMIVGTGCNACYMEEMQNVELVEGDEGRMCVNTEWGAFGDSGELDEFLLEYDRLVDESSANPGQQLYEKLIGGKYMGELVRLVLLRLVDENLLFHGEASEQLRTRGAFETRFVSQVESDTGDRKQIYNILSTLGLRPSTTDCDIVRRACESVSTRAAHMCSAGLAGVINRMRESRSEDVMRITVGVDGSVYKLHPSFKERFHASVRRLTPSCEITFIESEEGSGRGAALVSAVACKKACMLGQ,mutated_sequence,1.0,465.0,NP_000153.1.a2m,NP_000153.1.npy,ClinVar
+NP_000156.1,NP_000156.1.csv,MGDWSALGKLLDKVQAYSTAGGKVWLSVLFIFRILLLGTAVESAWGDEQSAFRCNTQQPGCENVCYDKSFPISHVRFWVLQIIFVSVPTLLYLAHVFYVMRKEEKLNKKEEELKVAQTDGVNVDMHLKQIEIKKFKYGIEEHGKVKMRGGLLRTYIISILFKSIFEVAFLLIQWYIYGFSLSAVYTCKRDPCPHQVDCFLSRPTEKTIFIIFMLVVSLVSLALNIIELFYVFFKGVKDRVKGKSDPYHATSGALSPAKDCGSQKYAYFNGCSSPTAPLSPMSPPGYKLVTGDRNNSSCRNYNKQASEQNWANYSAEQNRMGQAGSTISNSHAQPFDFPDDNQNSKKLAAGHELQPLAIVDQRPSSRASSRASSRPRPDDLEI,mutated_sequence,1.0,382.0,NP_000156.1.a2m,NP_000156.1.npy,ClinVar
+NP_000157.1,NP_000157.1.csv,MNWTGLYTLLSGVNRHSTAIGRVWLSVIFIFRIMVLVVAAESVWGDEKSSFICNTLQPGCNSVCYDQFFPISHVRLWSLQLILVSTPALLVAMHVAHQQHIEKKMLRLEGHGDPLHLEEVKRHKVHISGTLWWTYVISVVFRLLFEAVFMYVFYLLYPGYAMVRLVKCDVYPCPNTVDCFVSRPTEKTVFTVFMLAASGICIILNVAEVVYLIIRACARRAQRRSNPPSRKGSGFGHRLSPEYKQNEINKLLSEQDGSLKDILRRSPGTGAGLAEKSDRCSAC,mutated_sequence,1.0,283.0,NP_000157.1.a2m,NP_000157.1.npy,ClinVar
+NP_000160.1,NP_000160.1.csv,MQLRNPELHLGCALALRFLALVSWDIPGARALDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCIDDCWMAPQRDSEGRLQADPQRFPHGIRQLANYVHSKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLENLADGYKHMSLALNRTGRSIVYSCEWPLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILDWTSFNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHISPQAKALLQDKDVIAINQDPLGKQGYQLRQGDNFEVWERPLSGLAWAVAMINRQEIGGPRSYTIAVASLGKGVACNPACFITQLLPVKRKLGFYEWTSRLRSHINPTGTVLLQLENTMQMSLKDLL,mutated_sequence,1.0,429.0,NP_000160.1.a2m,NP_000160.1.npy,ClinVar
+NP_000161.2,NP_000161.2.csv,MQSCARAWGLRLGRGVGGGRRLAGGSGPCWAPRSRDSSSGGGDSAAAGASRLLERLLPRHDDFARRHIGPGDKDQREMLQTLGLASIDELIEKTVPANIRLKRPLKMEDPVCENEILATLHAISSKNQIWRSYIGMGYYNCSVPQTILRNLLENSGWITQYTPYQPEVSQGRLESLLNYQTMVCDITGLDMANASLLDEGTAAAEALQLCYRHNKRRKFLVDPRCHPQTIAVVQTRAKYTGVLTELKLPCEMDFSGKDVSGVLFQYPDTEGKVEDFTELVERAHQSGSLACCATDLLALCILRPPGEFGVDIALGSSQRFGVPLGYGGPHAAFFAVRESLVRMMPGRMVGVTRDATGKEVYRLALQTREQHIRRDKATSNICTAQALLANMAAMFAIYHGSHGLEHIARRVHNATLILSEGLKRAGHQLQHDLFFDTLKIQCGCSVKEVLGRAAQRQINFRLFEDGTLGISLDETVNEKDLDDLLWIFGCESSAELVAESMGEECRGIPGSVFKRTSPFLTHQVFNSYHSETNIVRYMKKLENKDISLVHSMIPLGSCTMKLNSSSELAPITWKEFANIHPFVPLDQAQGYQQLFRELEKDLCELTGYDQVCFQPNSGAQGEYAGLATIRAYLNQKGEGHRTVCLIPKSAHGTNPASAHMAGMKIQPVEVDKYGNIDAVHLKAMVDKHKENLAAIMITYPSTNGVFEENISDVCDLIHQHGGQVYLDGANMNAQVGICRPGDFGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKKHLAPFLPNHPVISLKRNEDACPVGTVSAAPWGSSSILPISWAYIKMMGGKGLKQATETAILNANYMAKRLETHYRILFRGARGYVGHEFILDTRPFKKSANIEAVDVAKRLQDYGFHAPTMSWPVAGTLMVEPTESEDKAELDRFCDAMISIRQEIADIEEGRIDPRVNPLKMSPHSLTCVTSSHWDRPYSREVAAFPLPFVKPENKFWPTIARIDDIYGDQHLVCTCPPMEVYESPFSEQKRASS,mutated_sequence,1.0,1020.0,NP_000161.2.a2m,NP_000161.2.npy,ClinVar
+NP_000169.1,NP_000169.1.csv,MATNWGSLLQDKQQLEELARQAVDRALAEGVLLRTSQEPTSSEVVSYAPFTLFPSLVPSALLEQAYAVQMDFNLLVDAVSQNAAFLEQTLSSTIKQDDFTARLFDIHKQVLKEGIAQTVFLGLNRSDYMFQRSADGSPALKQIEINTISASFGGLASRTPAVHRHVLSVLSKTKEAGKILSNNPSKGLALGIAKAWELYGSPNALVLLIAQEKERNIFDQRAIENELLARNIHVIRRTFEDISEKGSLDQDRRLFVDGQEIAVVYFRDGYMPRQYSLQNWEARLLLERSHAAKCPDIATQLAGTKKVQQELSRPGMLEMLLPGQPEAVARLRATFAGLYSLDVGEEGDQAIAEALAAPSRFVLKPQREGGGNNLYGEEMVQALKQLKDSEERASYILMEKIEPEPFENCLLRPGSPARVVQCISELGIFGVYVRQEKTLVMNKHVGHLLRTKAIEHADGGVAAGVAVLDNPYPV,mutated_sequence,1.0,474.0,NP_000169.1.a2m,NP_000169.1.npy,ClinVar
+NP_000170.1,NP_000170.1.csv,MSRQSTLYSFFPKSPALSDANKASARASREGGRAAAAPGASPSPGGDAAWSEAGPGPRPLARSASPPKAKNLNGGLRRSVAPAAPTSCDFSPGDLVWAKMEGYPWWPCLVYNHPFDGTFIREKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQKGGHFYSAKPEILRAMQRADEALNKDKIKRLELAVCDEPSEPEEEEEMEVGTTYVTDKSEEDNEIESEEEVQPKTQGSRRSSRQIKKRRVISDSESDIGGSDVEFKPDTKEEGSSDEISSGVGDSESEGLNSPVKVARKRKRMVTGNGSLKRKSSRKETPSATKQATSISSETKNTLRAFSAPQNSESQAHVSGGGDDSSRPTVWYHETLEWLKEEKRRDEHRRRPDHPDFDASTLYVPEDFLNSCTPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLVFMKGNWAHSGFPEIAFGRYSDSLVQKGYKVARVEQTETPEMMEARCRKMAHISKYDRVVRREICRIITKGTQTYSVLEGDPSENYSKYLLSLKEKEEDSSGHTRAYGVCFVDTSLGKFFIGQFSDDRHCSRFRTLVAHYPPVQVLFEKGNLSKETKTILKSSLSCSLQEGLIPGSQFWDASKTLRTLLEEEYFREKLSDGIGVMLPQVLKGMTSESDSIGLTPGEKSELALSALGGCVFYLKKCLIDQELLSMANFEEYIPLDSDTVSTTRSGAIFTKAYQRMVLDAVTLNNLEIFLNGTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIEDLMVVPDKISEVVELLKKLPDLERLLSKIHNVGSPLKSQNHPDSRAIMYEETTYSKKKIIDFLSALEGFKVMCKIIGIMEEVADGFKSKILKQVISLQTKNPEGRFPDLTVELNRWDTAFDHEKARKTGLITPKAGFDSDYDQALADIRENEQSLLEYLEKQRNRIGCRTIVYWGIGRNRYQLEIPENFTTRNLPEEYELKSTKKGCKRYWTKTIEKKLANLINAEERRDVSLKDCMRRLFYNFDKNYKDWQSAVECIAVLDVLLCLANYSRGGDGPMCRPVILLPEDTPPFLELKGSRHPCITKTFFGDDFIPNDILIGCEEEEQENGKAYCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAVRLGHMACMVENECEDPSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKAREFEKMNQSLRLFREVCLASERSTVDAEAVHKLLTLIKEL,mutated_sequence,1.0,1360.0,NP_000170.1.a2m,NP_000170.1.npy,ClinVar
+NP_000177.2,NP_000177.2.csv,MRLLAKIICLMLWAICVAEDCNELPPRRNTEILTGSWSDQTYPEGTQAIYKCRPGYRSLGNVIMVCRKGEWVALNPLRKCQKRPCGHPGDTPFGTFTLTGGNVFEYGVKAVYTCNEGYQLLGEINYRECDTDGWTNDIPICEVVKCLPVTAPENGKIVSSAMEPDREYHFGQAVRFVCNSGYKIEGDEEMHCSDDGFWSKEKPKCVEISCKSPDVINGSPISQKIIYKENERFQYKCNMGYEYSERGDAVCTESGWRPLPSCEEKSCDNPYIPNGDYSPLRIKHRTGDEITYQCRNGFYPATRGNTAKCTSTGWIPAPRCTLKPCDYPDIKHGGLYHENMRRPYFPVAVGKYYSYYCDEHFETPSGSYWDHIHCTQDGWSPAVPCLRKCYFPYLENGYNQNHGRKFVQGKSIDVACHPGYALPKAQTTVTCMENGWSPTPRCIRVKTCSKSSIDIENGFISESQYTYALKEKAKYQCKLGYVTADGETSGSITCGKDGWSAQPTCIKSCDIPVFMNARTKNDFTWFKLNDTLDYECHDGYESNTGSTTGSIVCGYNGWSDLPICYERECELPKIDVHLVPDRKKDQYKVGEVLKFSCKPGFTIVGPNSVQCYHFGLSPDLPICKEQVQSCGPPPELLNGNVKEKTKEEYGHSEVVEYYCNPRFLMKGPNKIQCVDGEWTTLPVCIVEESTCGDIPELEHGWAQLSSPPYYYGDSVEFNCSESFTMIGHRSITCIHGVWTQLPQCVAIDKLKKCKSSNLIILEEHLKNKKEFDHNSNIRYRCRGKEGWIHTVCINGRWDPEVNCSMAQIQLCPPPPQIPNSHNMTTTLNYRDGEKVSVLCQENYLIQEGEEITCKDGRWQSIPLCVEKIPCSQPPQIEHGTINSSRSSQESYAHGTKLSYTCEGGFRISEENETTCYMGKWSSPPQCEGLPCKSPPEISHGVVAHMSDSYQYGEEVTYKCFEGFGIDGPAIAKCLGEKWSHPPSCIKTDCLSLPSFENAIPMGEKKDVYKAGEQVTYTCATYYKMDGASNVTCINSRWTGRPTCRDTSCVNPPTVQNAYIVSRQMSKYPSGERVRYQCRSPYEMFGDEEVMCLNGNWTEPPQCKDSTGKCGPPPPIDNGDITSFPLSVYAPASSVEYQCQNLYQLEGNKRITCRNGQWSEPPKCLHPCVISREIMENYNIALRWTAKQKLYSRTGESVEFVCKRGYRLSSRSHTLRTTCWDGKLEYPTCAKR,mutated_sequence,1.0,1231.0,NP_000177.2.a2m,NP_000177.2.npy,ClinVar
+NP_000184.1,NP_000184.1.csv,MLLLARCLLLVLVSSLLVCSGLACGPGRGFGKRRHPKKLTPLAYKQFIPNVAEKTLGASGRYEGKISRNSERFKELTPNYNPDIIFKDEENTGADRLMTQRCKDKLNALAISVMNQWPGVKLRVTEGWDEDGHHSEESLHYEGRAVDITTSDRDRSKYGMLARLAVEAGFDWVYYESKAHIHCSVKAENSVAAKSGGCFPGSATVHLEQGGTKLVKDLSPGDRVLAADDQGRLLYSDFLTFLDRDDGAKKVFYVIETREPRERLLLTAAHLLFVAPHNDSATGEPEASSGSGPPSGGALGPRALFASRVRPGQRVYVVAERDGDRRLLPAAVHSVTLSEEAAGAYAPLTAQGTILINRVLASCYAVIEEHSWAHRAFAPFRLAHALLAALAPARTDRGGDSGGGDRGGGGGRVALTAPGAADAPGAGATAGIHWYSQLLYQIGTWLLDSEALHPLGMAVKSS,mutated_sequence,1.0,462.0,NP_000184.1.a2m,NP_000184.1.npy,ClinVar
+NP_000187.3,NP_000187.3.csv,MERWPWPSGGAWLLVAARALLQLLRSDLRLGRPLLAALALLAALDWLCQRLLPPPAALAVLAAAGWIALSRLARPQRLPVATRAVLITGCDSGFGKETAKKLDSMGFTVLATVLELNSPGAIELRTCCSPRLRLLQMDLTKPGDISRVLEFTKAHTTSTGLWGLVNNAGHNEVVADAELSPVATFRSCMEVNFFGALELTKGLLPLLRSSRGRIVTVGSPAGDMPYPCLGAYGTSKAAVALLMDTFSCELLPWGVKVSIIQPGCFKTESVRNVGQWEKRKQLLLANLPQELLQAYGKDYIEHLHGQFLHSLRLAMSDLTPVVDAITDALLAARPRRRYYPGQGLGLMYFIHYYLPEGLRRRFLQAFFISHCLPRALQPGQPGTTPPQDAAQDPNLSPGPSPAVAR,mutated_sequence,1.0,405.0,NP_000187.3.a2m,NP_000187.3.npy,ClinVar
+NP_000188.1,NP_000188.1.csv,MGDVLEQFFILTGLLVCLACLAKCVRFSRCVLLNYWKVLPKSFLRSMGQWAVITGAGDGIGKAYSFELAKRGLNVVLISRTLEKLEAIATEIERTTGRSVKIIQADFTKDDIYEHIKEKLAGLEIGILVNNVGMLPNLLPSHFLNAPDEIQSLIHCNITSVVKMTQLILKHMESRQKGLILNISSGIALFPWPLYSMYSASKAFVCAFSKALQEEYKAKEVIIQVLTPYAVSTAMTKYLNTNVITKTADEFVKESLNYVTIGGETCGCLAHEILAGFLSLIPAWAFYSGAFQRLLLTHYVAYLKLNTKVR,mutated_sequence,1.0,310.0,NP_000188.1.a2m,NP_000188.1.npy,ClinVar
+NP_000193.1,NP_000193.1.csv,MPPPRTGRGLLWLGLVLSSVCVALGSETQANSTTDALNVLLIIVDDLRPSLGCYGDKLVRSPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWRVHAGNFSTIPQYFKENGYVTMSVGKVFHPGISSNHTDDSPYSWSFPPYHPSSEKYENTKTCRGPDGELHANLLCPVDVLDVPEGTLPDKQSTEQAIQLLEKMKTSASPFFLAVGYHKPHIPFRYPKEFQKLYPLENITLAPDPEVPDGLPPVAYNPWMDIRQREDVQALNISVPYGPIPVDFQRKIRQSYFASVSYLDTQVGRLLSALDDLQLANSTIIAFTSDHGWALGEHGEWAKYSNFDVATHVPLIFYVPGRTASLPEAGEKLFPYLDPFDSASQLMEPGRQSMDLVELVSLFPTLAGLAGLQVPPRCPVPSFHVELCREGKNLLKHFRFRDLEEDPYLPGNPRELIAYSQYPRPSDIPQWNSDKPSLKDIKIMGYSIRTIDYRYTVWVGFNPDEFLANFSDIHAGELYFVDSDPLQDHNMYNDSQGGDLFQLLMP,mutated_sequence,1.0,550.0,NP_000193.1.a2m,NP_000193.1.npy,ClinVar
+NP_000194.2,NP_000194.2.csv,MRPLRPRAALLALLASLLAAPPVAPAEAPHLVHVDAARALWPLRRFWRSTGFCPPLPHSQADQYVLSWDQQLNLAYVGAVPHRGIKQVRTHWLLELVTTRGSTGRGLSYNFTHLDGYLDLLRENQLLPGFELMGSASGHFTDFEDKQQVFEWKDLVSSLARRYIGRYGLAHVSKWNFETWNEPDHHDFDNVSMTMQGFLNYYDACSEGLRAASPALRLGGPGDSFHTPPRSPLSWGLLRHCHDGTNFFTGEAGVRLDYISLHRKGARSSISILEQEKVVAQQIRQLFPKFADTPIYNDEADPLVGWSLPQPWRADVTYAAMVVKVIAQHQNLLLANTTSAFPYALLSNDNAFLSYHPHPFAQRTLTARFQVNNTRPPHVQLLRKPVLTAMGLLALLDEEQLWAEVSQAGTVLDSNHTVGVLASAHRPQGPADAWRAAVLIYASDDTRAHPNRSVAVTLRLRGVPPGPGLVYVTRYLDNGLCSPDGEWRRLGRPVFPTAEQFRRMRAAEDPVAAAPRPLPAGGRLTLRPALRLPSLLLVHVCARPEKPPGQVTRLRALPLTQGQLVLVWSDEHVGSKCLWTYEIQFSQDGKAYTPVSRKPSTFNLFVFSPDTGAVSGSYRVRALDYWARPGPFSDPVPYLEVPVPRGPPSPGNP,mutated_sequence,1.0,653.0,NP_000194.2.a2m,NP_000194.2.npy,ClinVar
+NP_000197.1,NP_000197.1.csv,MLKPSLPFTSLLFLQLPLLGVGLNTTILTPNGNEDTTADFFLTTMPTDSLSVSTLPLPEVQCFVFNVEYMNCTWNSSSEPQPTNLTLHYWYKNSDNDKVQKCSHYLFSEEITSGCQLQKKEIHLYQTFVVQLQDPREPRRQATQMLKLQNLVIPWAPENLTLHKLSESQLELNWNNRFLNHCLEHLVQYRTDWDHSWTEQSVDYRHKFSLPSVDGQKRYTFRVRSRFNPLCGSAQHWSEWSHPIHWGSNTSKENPFLFALEAVVISVGSMGLIISLLCVYFWLERTMPRIPTLKNLEDLVTEYHGNFSAWSGVSKGLAESLQPDYSERLCLVSEIPPKGGALGEGPGASPCNQHSPYWAPPCYTLKPET,mutated_sequence,1.0,369.0,NP_000197.1.a2m,NP_000197.1.npy,ClinVar
+NP_000199.2,NP_000199.2.csv,MATGGRRGAAAAPLLVAVAALLLGAAGHLYPGEVCPGMDIRNNLTRLHELENCSVIEGHLQILLMFKTRPEDFRDLSFPKLIMITDYLLLFRVYGLESLKDLFPNLTVIRGSRLFFNYALVIFEMVHLKELGLYNLMNITRGSVRIEKNNELCYLATIDWSRILDSVEDNYIVLNKDDNEECGDICPGTAKGKTNCPATVINGQFVERCWTHSHCQKVCPTICKSHGCTAEGLCCHSECLGNCSQPDDPTKCVACRNFYLDGRCVETCPPPYYHFQDWRCVNFSFCQDLHHKCKNSRRQGCHQYVIHNNKCIPECPSGYTMNSSNLLCTPCLGPCPKVCHLLEGEKTIDSVTSAQELRGCTVINGSLIINIRGGNNLAAELEANLGLIEEISGYLKIRRSYALVSLSFFRKLRLIRGETLEIGNYSFYALDNQNLRQLWDWSKHNLTITQGKLFFHYNPKLCLSEIHKMEEVSGTKGRQERNDIALKTNGDQASCENELLKFSYIRTSFDKILLRWEPYWPPDFRDLLGFMLFYKEAPYQNVTEFDGQDACGSNSWTVVDIDPPLRSNDPKSQNHPGWLMRGLKPWTQYAIFVKTLVTFSDERRTYGAKSDIIYVQTDATNPSVPLDPISVSNSSSQIILKWKPPSDPNGNITHYLVFWERQAEDSELFELDYCLKGLKLPSRTWSPPFESEDSQKHNQSEYEDSAGECCSCPKTDSQILKELEESSFRKTFEDYLHNVVFVPRKTSSGTGAEDPRPSRKRRSLGDVGNVTVAVPTVAAFPNTSSTSVPTSPEEHRPFEKVVNKESLVISGLRHFTGYRIELQACNQDTPEERCSVAAYVSARTMPEAKADDIVGPVTHEIFENNVVHLMWQEPKEPNGLIVLYEVSYRRYGDEELHLCVSRKHFALERGCRLRGLSPGNYSVRIRATSLAGNGSWTEPTYFYVTDYLDVPSNIAKIIIGPLIFVFLFSVVIGSIYLFLRKRQPDGPLGPLYASSNPEYLSASDVFPCSVYVPDEWEVSREKITLLRELGQGSFGMVYEGNARDIIKGEAETRVAVKTVNESASLRERIEFLNEASVMKGFTCHHVVRLLGVVSKGQPTLVVMELMAHGDLKSYLRSLRPEAENNPGRPPPTLQEMIQMAAEIADGMAYLNAKKFVHRDLAARNCMVAHDFTVKIGDFGMTRDIYETDYYRKGGKGLLPVRWMAPESLKDGVFTTSSDMWSFGVVLWEITSLAEQPYQGLSNEQVLKFVMDGGYLDQPDNCPERVTDLMRMCWQFNPKMRPTFLEIVNLLKDDLHPSFPEVSFFHSEENKAPESEELEMEFEDMENVPLDRSSHCQREEAGGRDGGSSLGFKRSYEEHIPYTHMNGGKKNGRILTLPRSNPS,mutated_sequence,1.0,1382.0,NP_000199.2.a2m,NP_000199.2.npy,ClinVar
+NP_000203.2,NP_000203.2.csv,MRARPRPRPLWATVLALGALAGVGVGGPNICTTRGVSSCQQCLAVSPMCAWCSDEALPLGSPRCDLKENLLKDNCAPESIEFPVSEARVLEDRPLSDKGSGDSSQVTQVSPQRIALRLRPDDSKNFSIQVRQVEDYPVDIYYLMDLSYSMKDDLWSIQNLGTKLATQMRKLTSNLRIGFGAFVDKPVSPYMYISPPEALENPCYDMKTTCLPMFGYKHVLTLTDQVTRFNEEVKKQSVSRNRDAPEGGFDAIMQATVCDEKIGWRNDASHLLVFTTDAKTHIALDGRLAGIVQPNDGQCHVGSDNHYSASTTMDYPSLGLMTEKLSQKNINLIFAVTENVVNLYQNYSELIPGTTVGVLSMDSSNVLQLIVDAYGKIRSKVELEVRDLPEELSLSFNATCLNNEVIPGLKSCMGLKIGDTVSFSIEAKVRGCPQEKEKSFTIKPVGFKDSLIVQVTFDCDCACQAQAEPNSHRCNNGNGTFECGVCRCGPGWLGSQCECSEEDYRPSQQDECSPREGQPVCSQRGECLCGQCVCHSSDFGKITGKYCECDDFSCVRYKGEMCSGHGQCSCGDCLCDSDWTGYYCNCTTRTDTCMSSNGLLCSGRGKCECGSCVCIQPGSYGDTCEKCPTCPDACTFKKECVECKKFDRGALHDENTCNRYCRDEIESVKELKDTGKDAVNCTYKNEDDCVVRFQYYEDSSGKSILYVVEEPECPKGPDILVVLLSVMGAILLIGLAALLIWKLLITIHDRKEFAKFEEERARAKWDTANNPLYKEATSTFTNITYRGT,mutated_sequence,1.0,788.0,NP_000203.2.a2m,NP_000203.2.npy,ClinVar
+NP_000206.2,NP_000206.2.csv,MAPPSEETPLIPQRSCSLLSTEAGALHVLLPARGPGPPQRLSFSFGDHLAEDLCVQAAKASGILPVYHSLFALATEDLSCWFPPSHIFSVEDASTQVLLYRIRFYFPNWFGLEKCHRFGLRKDLASAILDLPVLEHLFAQHRSDLVSGRLPVGLSLKEQGECLSLAVLDLARMAREQAQRPGELLKTVSYKACLPPSLRDLIQGLSFVTRRRIRRTVRRALRRVAACQADRHSLMAKYIMDLERLDPAGAAETFHVGLPGALGGHDGLGLLRVAGDGGIAWTQGEQEVLQPFCDFPEIVDISIKQAPRVGPAGEHRLVTVTRTDNQILEAEFPGLPEALSFVALVDGYFRLTTDSQHFFCKEVAPPRLLEEVAEQCHGPITLDFAINKLKTGGSRPGSYVLRRSPQDFDSFLLTVCVQNPLGPDYKGCLIRRSPTGTFLLVGLSRPHSSLRELLATCWDGGLHVDGVAVTLTSCCIPRPKEKSNLIVVQRGHSPPTSSLVQPQSQYQLSQMTFHKIPADSLEWHENLGHGSFTKIYRGCRHEVVDGEARKTEVLLKVMDAKHKNCMESFLEAASLMSQVSYRHLVLLHGVCMAGDSTMVQEFVHLGAIDMYLRKRGHLVPASWKLQVVKQLAYALNYLEDKGLPHGNVSARKVLLAREGADGSPPFIKLSDPGVSPAVLSLEMLTDRIPWVAPECLREAQTLSLEADKWGFGATVWEVFSGVTMPISALDPAKKLQFYEDRQQLPAPKWTELALLIQQCMAYEPVQRPSFRAVIRDLNSLISSDYELLSDPTPGALAPRDGLWNGAQLYACQDPTIFEERHLKYISQLGKGNFGSVELCRYDPLGDNTGALVAVKQLQHSGPDQQRDFQREIQILKALHSDFIVKYRGVSYGPGRQSLRLVMEYLPSGCLRDFLQRHRARLDASRLLLYSSQICKGMEYLGSRRCVHRDLAARNILVESEAHVKIADFGLAKLLPLDKDYYVVREPGQSPIFWYAPESLSDNIFSRQSDVWSFGVVLYELFTYCDKSCSPSAEFLRMMGCERDVPALCRLLELLEEGQRLPAPPACPAEVHELMKLCWAPSPQDRPSFSALGPQLDMLWSGSRGCETHAFTAHPEGKHHSLSFS,mutated_sequence,1.0,1124.0,NP_000206.2.a2m,NP_000206.2.npy,ClinVar
+NP_000209.2,NP_000209.2.csv,MAAASSPPRAERKRWGWGRLPGARRGSAGLAKKCPFSLELAEGGPAGGALYAPIAPGAPGPAPPASPAAPAAPPVASDLGPRPPVSLDPRVSIYSTRRPVLARTHVQGRVYNFLERPTGWKCFVYHFAVFLIVLVCLIFSVLSTIEQYAALATGTLFWMEIVLVVFFGTEYVVRLWSAGCRSKYVGLWGRLRFARKPISIIDLIVVVASMVVLCVGSKGQVFATSAIRGIRFLQILRMLHVDRQGGTWRLLGSVVFIHRQELITTLYIGFLGLIFSSYFVYLAEKDAVNESGRVEFGSYADALWWGVVTVTTIGYGDKVPQTWVGKTIASCFSVFAISFFALPAGILGSGFALKVQQKQRQKHFNRQIPAAASLIQTAWRCYAAENPDSSTWKIYIRKAPRSHTLLSPSPKPKKSVVVKKKKFKLDKDNGVTPGEKMLTVPHITCDPPEERRLDHFSVDGYDSSVRKSPTLLEVSMPHFMRTNSFAEDLDLEGETLLTPITHISQLREHHRATIKVIRRMQYFVAKKKFQQARKPYDVRDVIEQYSQGHLNLMVRIKELQRRLDQSIGKPSLFISVSEKSKDRGSNTIGARLNRVEDKVTQLDQRLALITDMLHQLLSLHGGSTPGSGGPPREGGAHITQPCGSGGSVDPELFLPSNTLPTYEQLTVPRRGPDEGS,mutated_sequence,1.0,676.0,NP_000209.2.a2m,NP_000209.2.npy,ClinVar
+NP_000213.1,NP_000213.1.csv,MRGARGAWDFLCVLLLLLRVQTGSSQPSVSPGEPSPPSIHPGKSDLIVRVGDEIRLLCTDPGFVKWTFEILDETNENKQNEWITEKAEATNTGKYTCTNKHGLSNSIYVFVRDPAKLFLVDRSLYGKEDNDTLVRCPLTDPEVTNYSLKGCQGKPLPKDLRFIPDPKAGIMIKSVKRAYHRLCLHCSVDQEGKSVLSEKFILKVRPAFKAVPVVSVSKASYLLREGEEFTVTCTIKDVSSSVYSTWKRENSQTKLQEKYNSWHHGDFNYERQATLTISSARVNDSGVFMCYANNTFGSANVTTTLEVVDKGFINIFPMINTTVFVNDGENVDLIVEYEAFPKPEHQQWIYMNRTFTDKWEDYPKSENESNIRYVSELHLTRLKGTEGGTYTFLVSNSDVNAAIAFNVYVNTKPEILTYDRLVNGMLQCVAAGFPEPTIDWYFCPGTEQRCSASVLPVDVQTLNSSGPPFGKLVVQSSIDSSAFKHNGTVECKAYNDVGKTSAYFNFAFKGNNKEQIHPHTLFTPLLIGFVIVAGMMCIIVMILTYKYLQKPMYEVQWKVVEEINGNNYVYIDPTQLPYDHKWEFPRNRLSFGKTLGAGAFGKVVEATAYGLIKSDAAMTVAVKMLKPSAHLTEREALMSELKVLSYLGNHMNIVNLLGACTIGGPTLVITEYCCYGDLLNFLRRKRDSFICSKQEDHAEAALYKNLLHSKESSCSDSTNEYMDMKPGVSYVVPTKADKRRSVRIGSYIERDVTPAIMEDDELALDLEDLLSFSYQVAKGMAFLASKNCIHRDLAARNILLTHGRITKICDFGLARDIKNDSNYVVKGNARLPVKWMAPESIFNCVYTFESDVWSYGIFLWELFSLGSSPYPGMPVDSKFYKMIKEGFRMLSPEHAPAEMYDIMKTCWDADPLKRPTFKQIVQLIEKQISESTNHIYSNLANCSPNRQKPVVDHSVRINSVGSTASSSQPLLVHDDV,mutated_sequence,1.0,976.0,NP_000213.1.a2m,NP_000213.1.npy,ClinVar
+NP_000220.1,NP_000220.1.csv,MGPPGSPWQWVTLLLGLLLPPAAPFWLLNVLFPPHTTPKAELSNHTRPVILVPGCLGNQLEAKLDKPDVVNWMCYRKTEDFFTIWLDLNMFLPLGVDCWIDNTRVVYNRSSGLVSNAPGVQIRVPGFGKTYSVEYLDSSKLAGYLHTLVQNLVNNGYVRDETVRAAPYDWRLEPGQQEEYYRKLAGLVEEMHAAYGKPVFLIGHSLGCLHLLYFLLRQPQAWKDRFIDGFISLGAPWGGSIKPMLVLASGDNQGIPIMSSIKLKEEQRITTTSPWMFPSRMAWPEDHVFISTPSFNYTGRDFQRFFADLHFEEGWYMWLQSRDLLAGLPAPGVEVYCLYGVGLPTPRTYIYDHGFPYTDPVGVLYEDGDDTVATRSTELCGLWQGRQPQPVHLLPLHGIQHLNMVFSNLTLEHINAILLGAYRQGPPASPTASPEPPPPE,mutated_sequence,1.0,440.0,NP_000220.1.a2m,NP_000220.1.npy,ClinVar
+NP_000223.1,NP_000223.1.csv,MAAAAAAAAEQQSSNGPVKKSMREKAVERRSVNKEHNSNFKAGYIPIDEDRLHKTGLRGRKGNLAICVIILLFILAVINLIITLVIWAVIRIGPNGCDSMEFHESGLLRFKQVSDMGVIHPLYKSTVGGRRNENLVITGNNQPIVFQQGTTKLSVENNKTSITSDIGMQFFDPRTQNILFSTDYETHEFHLPSGVKSLNVQKASTERITSNATSDLNIKVDGRAIVRGNEGVFIMGKTIEFHMGGNMELKAENSIILNGSVMVSTTRLPSSSSGDQLGSGDWVRYKLCMCADGTLFKVQVTSQNMGCQISDNPCGNTH,mutated_sequence,1.0,318.0,NP_000223.1.a2m,NP_000223.1.npy,ClinVar
+NP_000224.2,NP_000224.2.csv,MKQRFSALQLLKLLLLLQPPLPRALREALCPEPCNCVPDGALRCPGPTAGLTRLSLAYLPVKVIPSQAFRGLNEVIKIEISQIDSLERIEANAFDNLLNLSEILIQNTKNLRYIEPGAFINLPRLKYLSICNTGIRKFPDVTKVFSSESNFILEICDNLHITTIPGNAFQGMNNESVTLKLYGNGFEEVQSHAFNGTTLTSLELKENVHLEKMHNGAFRGATGPKTLDISSTKLQALPSYGLESIQRLIATSSYSLKKLPSRETFVNLLEATLTYPSHCCAFRNLPTKEQNFSHSISENFSKQCESTVRKVNNKTLYSSMLAESELSGWDYEYGFCLPKTPRCAPEPDAFNPCEDIMGYDFLRVLIWLINILAIMGNMTVLFVLLTSRYKLTVPRFLMCNLSFADFCMGLYLLLIASVDSQTKGQYYNHAIDWQTGSGCSTAGFFTVFASELSVYTLTVITLERWHTITYAIHLDQKLRLRHAILIMLGGWLFSSLIAMLPLVGVSNYMKVSICFPMDVETTLSQVYILTILILNVVAFFIICACYIKIYFAVRNPELMATNKDTKIAKKMAILIFTDFTCMAPISFFAISAAFKVPLITVTNSKVLLVLFYPINSCANPFLYAIFTKTFQRDFFLLLSKFGCCKRRAELYRRKDFSAYTSNCKNGFTGSNKPSQSTLKLSTLHCQGTALLDKTRYTEC,mutated_sequence,1.0,699.0,NP_000224.2.a2m,NP_000224.2.npy,ClinVar
+NP_000226.2,NP_000226.2.csv,MKMRFLGLVVCLVLWTLHSEGSGGKLTAVDPETNMNVSEIISYWGFPSEEYLVETEDGYILCLNRIPHGRKNHSDKGPKPVVFLQHGLLADSSNWVTNLANSSLGFILADAGFDVWMGNSRGNTWSRKHKTLSVSQDEFWAFSYDEMAKYDLPASINFILNKTGQEQVYYVGHSQGTTIGFIAFSQIPELAKRIKMFFALGPVASVAFCTSPMAKLGRLPDHLIKDLFGDKEFLPQSAFLKWLGTHVCTHVILKELCGNLCFLLCGFNERNLNMSRVDVYTTHSPAGTSVQNMLHWSQAVKFQKFQAFDWGSSAKNYFHYNQSYPPTYNVKDMLVPTAVWSGGHDWLADVYDVNILLTQITNLVFHESIPEWEHLDFIWGLDAPWRLYNKIINLMRKYQ,mutated_sequence,1.0,399.0,NP_000226.2.a2m,NP_000226.2.npy,ClinVar
+NP_000229.1,NP_000229.1.csv,MPVRRGHVAPQNTFLDTIIRKFEGQSRKFIIANARVENCAVIYCNDGFCELCGYSRAEVMQRPCTCDFLHGPRTQRRAAAQIAQALLGAEERKVEIAFYRKDGSCFLCLVDVVPVKNEDGAVIMFILNFEVVMEKDMVGSPAHDTNHRGPPTSWLAPGRAKTFRLKLPALLALTARESSVRSGGAGGAGAPGAVVVDVDLTPAAPSSESLALDEVTAMDNHVAGLGPAEERRALVGPGSPPRSAPGQLPSPRAHSLNPDASGSSCSLARTRSRESCASVRRASSADDIEAMRAGVLPPPPRHASTGAMHPLRSGLLNSTSDSDLVRYRTISKIPQITLNFVDLKGDPFLASPTSDREIIAPKIKERTHNVTEKVTQVLSLGADVLPEYKLQAPRIHRWTILHYSPFKAVWDWLILLLVIYTAVFTPYSAAFLLKETEEGPPATECGYACQPLAVVDLIVDIMFIVDILINFRTTYVNANEEVVSHPGRIAVHYFKGWFLIDMVAAIPFDLLIFGSGSEELIGLLKTARLLRLVRVARKLDRYSEYGAAVLFLLMCTFALIAHWLACIWYAIGNMEQPHMDSRIGWLHNLGDQIGKPYNSSGLGGPSIKDKYVTALYFTFSSLTSVGFGNVSPNTNSEKIFSICVMLIGSLMYASIFGNVSAIIQRLYSGTARYHTQMLRVREFIRFHQIPNPLRQRLEEYFQHAWSYTNGIDMNAVLKGFPECLQADICLHLNRSLLQHCKPFRGATKGCLRALAMKFKTTHAPPGDTLVHAGDLLTALYFISRGSIEILRGDVVVAILGKNDIFGEPLNLYARPGKSNGDVRALTYCDLHKIHRDDLLEVLDMYPEFSDHFWSSLEITFNLRDTNMIPGSPGSTELEGGFSRQRKRKLSFRRRTDKDTEQPGEVSALGPGRAGAGPSSRGRPGGPWGESPSSGPSSPESSEDEGPGRSSSPLRLVPFSSPRPPGEPPGGEPLMEDCEKSSDTCNPLSGAFSGVSNIFSFWGDSRGRQYQELPRCPAPTPSLLNIPLSSPGRRPRGDVESRLDALQRQLNRLETRLSADMATVLQLLQRQMTLVPPAYSAVTTPGPGPTSTSPLLPVSPLPTLTLDSLSQVSQFMACEELPPGAPELPQEGPTRRLSLPGQLGALTSQPLHRHGSDPGS,mutated_sequence,1.0,1159.0,NP_000229.1.a2m,NP_000229.1.npy,ClinVar
+NP_000237.2,NP_000237.2.csv,MRCLAPRPAGSYLSEPQGSSQCATMELGPLEGGYLELLNSDADPLCLYHFYDQMDLAGEEEIELYSEPDTDTINCDQFSRLLCDMEGDEETREAYANIAELDQYVFQDSQLEGLSKDIFKHIGPDEVIGESMEMPAEVGQKSQKRPFPEELPADLKHWKPAEPPTVVTGSLLVGPVSDCSTLPCLPLPALFNQEPASGQMRLEKTDQIPMPFSSSSLSCLNLPEGPIQFVPTISTLPHGLWQISEAGTGVSSIFIYHGEVPQASQVPPPSGFTVHGLPTSPDRPGSTSPFAPSATDLPSMPEPALTSRANMTEHKTSPTQCPAAGEVSNKLPKWPEPVEQFYRSLQDTYGAEPAGPDGILVEVDLVQARLERSSSKSLERELATPDWAERQLAQGGLAEVLLAAKEHRRPRETRVIAVLGKAGQGKSYWAGAVSRAWACGRLPQYDFVFSVPCHCLNRPGDAYGLQDLLFSLGPQPLVAADEVFSHILKRPDRVLLILDGFEELEAQDGFLHSTCGPAPAEPCSLRGLLAGLFQKKLLRGCTLLLTARPRGRLVQSLSKADALFELSGFSMEQAQAYVMRYFESSGMTEHQDRALTLLRDRPLLLSHSHSPTLCRAVCQLSEALLELGEDAKLPSTLTGLYVGLLGRAALDSPPGALAELAKLAWELGRRHQSTLQEDQFPSADVRTWAMAKGLVQHPPRAAESELAFPSFLLQCFLGALWLALSGEIKDKELPQYLALTPRKKRPYDNWLEGVPRFLAGLIFQPPARCLGALLGPSAAASVDRKQKVLARYLKRLQPGTLRARQLLELLHCAHEAEEAGIWQHVVQELPGRLSFLGTRLTPPDAHVLGKALEAAGQDFSLDLRSTGICPSGLGSLVGLSCVTRFRAALSDTVALWESLQQHGETKLLQAAEEKFTIEPFKAKSLKDVEDLGKLVQTQRTRSSSEDTAGELPAVRDLKKLEFALGPVSGPQAFPKLVRILTAFSSLQHLDLDALSENKIGDEGVSQLSATFPQLKSLETLNLSQNNITDLGAYKLAEALPSLAASLLRLSLYNNCICDVGAESLARVLPDMVSLRVMDVQYNKFTAAGAQQLAASLRRCPHVETLAMWTPTIPFSVQEHLQQQDSRISLR,mutated_sequence,1.0,1130.0,NP_000237.2.a2m,NP_000237.2.npy,ClinVar
+NP_000240.1,NP_000240.1.csv,MSFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMIENCLDAKSTSIQVIVKEGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQSFEDLASISTYGFRGEALASISHVAHVTITTKTADGKCAYRASYSDGKLKAPPKPCAGNQGTQITVEDLFYNIATRRKALKNPSEEYGKILEVVGRYSVHNAGISFSVKKQGETVADVRTLPNASTVDNIRSIFGNAVSRELIEIGCEDKTLAFKMNGYISNANYSVKKCIFLLFINHRLVESTSLRKAIETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFLHEESILERVQQHIESKLLGSNSSRMYFTQTLLPGLAGPSGEMVKSTTSLTSSSTSGSSDKVYAHQMVRTDSREQKLDAFLQPLSKPLSSQPQAIVTEDKTDISSGRARQQDEEMLELPAPAEVAAKNQSLEGDTTKGTSEMSEKRGPTSSNPRKRHREDSDVEMVEDDSRKEMTAACTPRRRIINLTSVLSLQEEINEQGHEVLREMLHNHSFVGCVNPQWALAQHQTKLYLLNTTKLSEELFYQILIYDFANFGVLRLSEPAPLFDLAMLALDSPESGWTEEDGPKEGLAEYIVEFLKKKAEMLADYFSLEIDEEGNLIGLPLLIDNYVPPLEGLPIFILRLATEVNWDEEKECFESLSKECAMFYSIRKQYISEESTLSGQQSEVPGSIPNSWKWTVEHIVYKALRSHILPPKHFTEDGNILQLANLPDLYKVFERC,mutated_sequence,1.0,756.0,NP_000240.1.a2m,NP_000240.1.npy,ClinVar
+NP_000242.1,NP_000242.1.csv,MAVQPKETLQLESAAEVGFVRFFQGMPEKPTTTVRLFDRGDFYTAHGEDALLAAREVFKTQGVIKYMGPAGAKNLQSVVLSKMNFESFVKDLLLVRQYRVEVYKNRAGNKASKENDWYLAYKASPGNLSQFEDILFGNNDMSASIGVVGVKMSAVDGQRQVGVGYVDSIQRKLGLCEFPDNDQFSNLEALLIQIGPKECVLPGGETAGDMGKLRQIIQRGGILITERKKADFSTKDIYQDLNRLLKGKKGEQMNSAVLPEMENQVAVSSLSAVIKFLELLSDDSNFGQFELTTFDFSQYMKLDIAAVRALNLFQGSVEDTTGSQSLAALLNKCKTPQGQRLVNQWIKQPLMDKNRIEERLNLVEAFVEDAELRQTLQEDLLRRFPDLNRLAKKFQRQAANLQDCYRLYQGINQLPNVIQALEKHEGKHQKLLLAVFVTPLTDLRSDFSKFQEMIETTLDMDQVENHEFLVKPSFDPNLSELREIMNDLEKKMQSTLISAARDLGLDPGKQIKLDSSAQFGYYFRVTCKEEKVLRNNKNFSTVDIQKNGVKFTNSKLTSLNEEYTKNKTEYEEAQDAIVKEIVNISSGYVEPMQTLNDVLAQLDAVVSFAHVSNGAPVPYVRPAILEKGQGRIILKASRHACVEVQDEIAFIPNDVYFEKDKQMFHIITGPNMGGKSTYIRQTGVIVLMAQIGCFVPCESAEVSIVDCILARVGAGDSQLKGVSTFMAEMLETASILRSATKDSLIIIDELGRGTSTYDGFGLAWAISEYIATKIGAFCMFATHFHELTALANQIPTVNNLHVTALTTEETLTMLYQVKKGVCDQSFGIHVAELANFPKHVIECAKQKALELEEFQYIGESQGYDIMEPAAKKCYLEREQGEKIIQEFLSKVKQMPFTEMSEENITIKLKQLKAEVIAKNNSFVNEIISRIKVTT,mutated_sequence,1.0,934.0,NP_000242.1.a2m,NP_000242.1.npy,ClinVar
+NP_000243.1,NP_000243.1.csv,MASASTSKYNSHSLENESIKRTSRDGVNRDLTEAVPRLPGETLITDKEVIYICPFNGPIKGRVYITNYRLYLRSLETDSSLILDVPLGVISRIEKMGGATSRGENSYGLDITCKDMRNLRFALKQEGHSRRDMFEILTRYAFPLAHSLPLFAFLNEEKFNVDGWTVYNPVEEYRRQGLPNHHWRITFINKCYELCDTYPALLVVPYRASDDDLRRVATFRSRNRIPVLSWIHPENKTVIVRCSQPLVGMSGKRNKDDEKYLDVIRETNKQISKLTIYDARPSVNAVANKATGGGYESDDAYHNAELFFLDIHNIHVMRESLKKVKDIVYPNVEESHWLSSLESTHWLEHIKLVLTGAIQVADKVSSGKSSVLVHCSDGWDRTAQLTSLAMLMLDSFYRSIEGFEILVQKEWISFGHKFASRIGHGDKNHTDADRSPIFLQFIDCVWQMSKQFPTAFEFNEQFLIIILDHLYSCRFGTFLFNCESARERQKVTERTVSLWSLINSNKEKFKNPFYTKEINRVLYPVASMRHLELWVNYYIRWNPRIKQQQPNPVEQRYMELLALRDEYIKRLEELQLANSAKLSDPPTSPSSPSQMMPHVQTHF,mutated_sequence,1.0,603.0,NP_000243.1.a2m,NP_000243.1.npy,ClinVar
+NP_000246.2,NP_000246.2.csv,MLRAKNQLFLLSPHYLRQVKESSGSRLIQQRLLHQQQPLHPEWAALAKKQLKGKNPEDLIWHTPEGISIKPLYSKRDTMDLPEELPGVKPFTRGPYPTMYTFRPWTIRQYAGFSTVEESNKFYKDNIKAGQQGLSVAFDLATHRGYDSDNPRVRGDVGMAGVAIDTVEDTKILFDGIPLEKMSVSMTMNGAVIPVLANFIVTGEEQGVPKEKLTGTIQNDILKEFMVRNTYIFPPEPSMKIIADIFEYTAKHMPKFNSISISGYHMQEAGADAILELAYTLADGLEYSRTGLQAGLTIDEFAPRLSFFWGIGMNFYMEIAKMRAGRRLWAHLIEKMFQPKNSKSLLLRAHCQTSGWSLTEQDPYNNIVRTAIEAMAAVFGGTQSLHTNSFDEALGLPTVKSARIARNTQIIIQEESGIPKVADPWGGSYMMECLTNDVYDAALKLINEIEEMGGMAKAVAEGIPKLRIEECAARRQARIDSGSEVIVGVNKYQLEKEDAVEVLAIDNTSVRNRQIEKLKKIKSSRDQALAERCLAALTECAASGDGNILALAVDASRARCTVGEITDALKKVFGEHKANDRMVSGAYRQEFGESKEITSAIKRVHKFMEREGRRPRLLVAKMGQDGHDRGAKVIATGFADLGFDVDIGPLFQTPREVAQQAVDADVHAVGISTLAAGHKTLVPELIKELNSLGRPDILVMCGGVIPPQDYEFLFEVGVSNVFGPGTRIPKAAVQVLDDIEKCLEKKQQSV,mutated_sequence,1.0,750.0,NP_000246.2.a2m,NP_000246.2.npy,ClinVar
+NP_000247.2,NP_000247.2.csv,MPEPGKKPVSAFSKKPRSVEVAAGSPAVFEAETERAGVKVRWQRGGSDISASNKYGLATEGTRHTLTVREVGPADQGSYAVIAGSSKVKFDLKVIEAEKAEPMLAPAPAPAEATGAPGEAPAPAAELGESAPSPKGSSSAALNGPTPGAPDDPIGLFVMRPQDGEVTVGGSITFSARVAGASLLKPPVVKWFKGKWVDLSSKVGQHLQLHDSYDRASKVYLFELHITDAQPAFTGSYRCEVSTKDKFDCSNFNLTVHEAMGTGDLDLLSAFRRTSLAGGGRRISDSHEDTGILDFSSLLKKRDSFRTPRDSKLEAPAEEDVWEILRQAPPSEYERIAFQYGVTDLRGMLKRLKGMRRDEKKSTAFQKKLEPAYQVSKGHKIRLTVELADHDAEVKWLKNGQEIQMSGSKYIFESIGAKRTLTISQCSLADDAAYQCVVGGEKCSTELFVKEPPVLITRPLEDQLVMVGQRVEFECEVSEEGAQVKWLKDGVELTREETFKYRFKKDGQRHHLIINEAMLEDAGHYALCTSGGQALAELIVQEKKLEVYQSIADLMVGAKDQAVFKCEVSDENVRGVWLKNGKELVPDSRIKVSHIGRVHKLTIDDVTPADEADYSFVPEGFACNLSAKLHFMEVKIDFVPRQEPPKIHLDCPGRIPDTIVVVAGNKLRLDVPISGDPAPTVIWQKAITQGNKAPARPAPDAPEDTGDSDEWVFDKKLLCETEGRVRVETTKDRSIFTVEGAEKEDEGVYTVTVKNPVGEDQVNLTVKVIDVPDAPAAPKISNVGEDSCTVQWEPPAYDGGQPILGYILERKKKKSYRWMRLNFDLIQELSHEARRMIEGVVYEMRVYAVNAIGMSRPSPASQPFMPIGPPSEPTHLAVEDVSDTTVSLKWRPPERVGAGGLDGYSVEYCPEGCSEWVAALQGLTEHTSILVKDLPTGARLLFRVRAHNMAGPGAPVTTTEPVTVQEILQRPRLQLPRHLRQTIQKKVGEPVNLLIPFQGKPRPQVTWTKEGQPLAGEEVSIRNSPTDTILFIRAARRVHSGTYQVTVRIENMEDKATLVLQVVDKPSPPQDLRVTDAWGLNVALEWKPPQDVGNTELWGYTVQKADKKTMEWFTVLEHYRRTHCVVPELIIGNGYYFRVFSQNMVGFSDRAATTKEPVFIPRPGITYEPPNYKALDFSEAPSFTQPLVNRSVIAGYTAMLCCAVRGSPKPKISWFKNGLDLGEDARFRMFSKQGVLTLEIRKPCPFDGGIYVCRATNLQGEARCECRLEVRVPQ,mutated_sequence,1.0,1274.0,NP_000247.2.a2m,NP_000247.2.npy,ClinVar
+NP_000248.2,NP_000248.2.csv,MGDSEMAVFGAAAPYLRKSEKERLEAQTRPFDLKKDVFVPDDKQEFVKAKIVSREGGKVTAETEYGKTVTVKEDQVMQQNPPKFDKIEDMAMLTFLHEPAVLYNLKDRYGSWMIYTYSGLFCVTVNPYKWLPVYTPEVVAAYRGKKRSEAPPHIFSISDNAYQYMLTDRENQSILITGESGAGKTVNTKRVIQYFAVIAAIGDRSKKDQSPGKGTLEDQIIQANPALEAFGNAKTVRNDNSSRFGKFIRIHFGATGKLASADIETYLLEKSRVIFQLKAERDYHIFYQILSNKKPELLDMLLITNNPYDYAFISQGETTVASIDDAEELMATDNAFDVLGFTSEEKNSMYKLTGAIMHFGNMKFKLKQREEQAEPDGTEEADKSAYLMGLNSADLLKGLCHPRVKVGNEYVTKGQNVQQVIYATGALAKAVYERMFNWMVTRINATLETKQPRQYFIGVLDIAGFEIFDFNSFEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTFIDFGMDLQACIDLIEKPMGIMSILEEECMFPKATDMTFKAKLFDNHLGKSANFQKPRNIKGKPEAHFSLIHYAGIVDYNIIGWLQKNKDPLNETVVGLYQKSSLKLLSTLFANYAGADAPIEKGKGKAKKGSSFQTVSALHRENLNKLMTNLRSTHPHFVRCIIPNETKSPGVMDNPLVMHQLRCNGVLEGIRICRKGFPNRILYGDFRQRYRILNPAAIPEGQFIDSRKGAEKLLSSLDIDHNQYKFGHTKVFFKAGLLGLLEEMRDERLSRIITRIQAQSRGVLARMEYKKLLERRDSLLVIQWNIRAFMGVKNWPWMKLYFKIKPLLKSAEREKEMASMKEEFTRLKEALEKSEARRKELEEKMVSLLQEKNDLQLQVQAEQDNLADAEERCDQLIKNKIQLEAKVKEMNERLEDEEEMNAELTAKKRKLEDECSELKRDIDDLELTLAKVEKEKHATENKVKNLTEEMAGLDEIIAKLTKEKKALQEAHQQALDDLQAEEDKVNTLTKAKVKLEQQVDDLEGSLEQEKKVRMDLERAKRKLEGDLKLTQESIMDLENDKQQLDERLKKKDFELNALNARIEDEQALGSQLQKKLKELQARIEELEEELEAERTARAKVEKLRSDLSRELEEISERLEEAGGATSVQIEMNKKREAEFQKMRRDLEEATLQHEATAAALRKKHADSVAELGEQIDNLQRVKQKLEKEKSEFKLELDDVTSNMEQIIKAKANLEKMCRTLEDQMNEHRSKAEETQRSVNDLTSQRAKLQTENGELSRQLDEKEALISQLTRGKLTYTQQLEDLKRQLEEEVKAKNALAHALQSARHDCDLLREQYEEETEAKAELQRVLSKANSEVAQWRTKYETDAIQRTEELEEAKKKLAQRLQEAEEAVEAVNAKCSSLEKTKHRLQNEIEDLMVDVERSNAAAAALDKKQRNFDKILAEWKQKYEESQSELESSQKEARSLSTELFKLKNAYEESLEHLETFKRENKNLQEEISDLTEQLGSSGKTIHELEKVRKQLEAEKMELQSALEEAEASLEHEEGKILRAQLEFNQIKAEIERKLAEKDEEMEQAKRNHLRVVDSLQTSLDAETRSRNEALRVKKKMEGDLNEMEIQLSHANRMAAEAQKQVKSLQSLLKDTQIQLDDAVRANDDLKENIAIVERRNNLLQAELEELRAVVEQTERSRKLAEQELIETSERVQLLHSQNTSLINQKKKMDADLSQLQTEVEEAVQECRNAEEKAKKAITDAAMMAEELKKEQDTSAHLERMKKNMEQTIKDLQHRLDEAEQIALKGGKKQLQKLEARVRELENELEAEQKRNAESVKGMRKSERRIKELTYQTEEDRKNLLRLQDLVDKLQLKVKAYKRQAEEAEEQANTNLSKFRKVQHELDEAEERADIAESQVNKLRAKSRDIGTKGLNEE,mutated_sequence,1.0,1935.0,NP_000248.2.a2m,NP_000248.2.npy,ClinVar
+NP_000251.3,NP_000251.3.csv,MVILQQGDHVWMDLRLGQEFDVPIGAVVKLCDSGQVQVVDDEDNEHWISPQNATHIKPMHPTSVHGVEDMIRLGDLNEAGILRNLLIRYRDHLIYTYTGSILVAVNPYQLLSIYSPEHIRQYTNKKIGEMPPHIFAIADNCYFNMKRNSRDQCCIISGESGAGKTESTKLILQFLAAISGQHSWIEQQVLEATPILEAFGNAKTIRNDNSSRFGKYIDIHFNKRGAIEGAKIEQYLLEKSRVCRQALDERNYHVFYCMLEGMSEDQKKKLGLGQASDYNYLAMGNCITCEGRVDSQEYANIRSAMKVLMFTDTENWEISKLLAAILHLGNLQYEARTFENLDACEVLFSPSLATAASLLEVNPPDLMSCLTSRTLITRGETVSTPLSREQALDVRDAFVKGIYGRLFVWIVDKINAAIYKPPSQDVKNSRRSIGLLDIFGFENFAVNSFEQLCINFANEHLQQFFVRHVFKLEQEEYDLESIDWLHIEFTDNQDALDMIANKPMNIISLIDEESKFPKGTDTTMLHKLNSQHKLNANYIPPKNNHETQFGINHFAGIVYYETQGFLEKNRDTLHGDIIQLVHSSRNKFIKQIFQADVAMGAETRKRSPTLSSQFKRSLELLMRTLGACQPFFVRCIKPNEFKKPMLFDRHLCVRQLRYSGMMETIRIRRAGYPIRYSFVEFVERYRVLLPGVKPAYKQGDLRGTCQRMAEAVLGTHDDWQIGKTKIFLKDHHDMLLEVERDKAITDRVILLQKVIRGFKDRSNFLKLKNAATLIQRHWRGHNCRKNYGLMRLGFLRLQALHRSRKLHQQYRLARQRIIQFQARCRAYLVRKAFRHRLWAVLTVQAYARGMIARRLHQRLRAEYLWRLEAEKMRLAEEEKLRKEMSAKKAKEEAERKHQERLAQLAREDAERELKEKEAARRKKELLEQMERARHEPVNHSDMVDKMFGFLGTSGGLPGQEGQAPSGFEDLERGRREMVEEDLDAALPLPDEDEEDLSEYKFAKFAATYFQGTTTHSYTRRPLKQPLLYHDDEGDQLAALAVWITILRFMGDLPEPKYHTAMSDGSEKIPVMTKIYETLGKKTYKRELQALQGEGEAQLPEGQKKSSVRHKLVHLTLKKKSKLTEEVTKRLHDGESTVQGNSMLEDRPTSNLEKLHFIIGNGILRPALRDEIYCQISKQLTHNPSKSSYARGWILVSLCVGCFAPSEKFVKYLRNFIHGGPPGYAPYCEERLRRTFVNGTRTQPPSWLELQATKSKKPIMLPVTFMDGTTKTLLTDSATTAKELCNALADKISLKDRFGFSLYIALFDKVSSLGSGSDHVMDAISQCEQYAKEQGAQERNAPWRLFFRKEVFTPWHSPSEDNVATNLIYQQVVRGVKFGEYRCEKEDDLAELASQQYFVDYGSEMILERLLNLVPTYIPDREITPLKTLEKWAQLAIAAHKKGIYAQRRTDAQKVKEDVVSYARFKWPLLFSRFYEAYKFSGPSLPKNDVIVAVNWTGVYFVDEQEQVLLELSFPEIMAVSSSRECRVWLSLGCSDLGCAAPHSGWAGLTPAGPCSPCWSCRGAKTTAPSFTLATIKGDEYTFTSSNAEDIRDLVVTFLEGLRKRSKYVVALQDNPNPAGEESGFLSFAKGDLIILDHDTGEQVMNSGWANGINERTKQRGDFPTDSVYVMPTVTMPPREIVALVTMTPDQRQDVVRLLQLRTAEPEVRAKPYTLEEFSYDYFRPPPKHTLSRVMVSKARGKDRLWSHTREPLKQALLKKLLGSEELSQEACLAFIAVLKYMGDYPSKRTRSVNELTDQIFEGPLKAEPLKDEAYVQILKQLTDNHIRYSEERGWELLWLCTGLFPPSNILLPHVQRFLQSRKHCPLAIDCLQRLQKALRNGSRKYPPHLVEVEAIQHKTTQIFHKVYFPDDTDEAFEVESSTKAKDFCQNIATRLLLKSSEGFSLFVKIADKVLSVPENDFFFDFVRHLTDWIKKARPIKDGIVPSLTYQVFFMKKLWTTTVPGKDPMADSIFHYYQELPKYLRGYHKCTREEVLQLGALIYRVKFEEDKSYFPSIPKLLRELVPQDLIRQVSPDDWKRSIVAYFNKHAGKSKEEAKLAFLKLIFKWPTFGSAFFEVKQTTEPNFPEILLIAINKYGVSLIDPKTKDILTTHPFTKISNWSSGNTYFHITIGNLVRGSKLLCETSLGYKMDDLLTSYISQMLTAMSKQRGSRSGK,mutated_sequence,1.0,2215.0,NP_000251.3.a2m,NP_000251.3.npy,ClinVar
+NP_000252.1,NP_000252.1.csv,MRFFCARCCSFGPEMPAVQLLLLACLVWDVGARTAQLRKANDQSGRCQYTFSVASPNESSCPEQSQAMSVIHNLQRDSSTQRLDLEATKARLSSLESLLHQLTLDQAARPQETQEGLQRELGTLRRERDQLETQTRELETAYSNLLRDKSVLEEEKKRLRQENENLARRLESSSQEVARLRRGQCPQTRDTARAVPPGSREVSTWNLDTLAFQELKSELTEVPASRILKESPSGYLRSGEGDTGCGELVWVGEPLTLRTAETITGKYGVWMRDPKPTYPYTQETTWRIDTVGTDVRQVFEYDLISQFMQGYPSKVHILPRPLESTGAVVYSGSLYFQGAESRTVIRYELNTETVKAEKEIPGAGYHGQFPYSWGGYTDIDLAVDEAGLWVIYSTDEAKGAIVLSKLNPENLELEQTWETNIRKQSVANAFIICGTLYTVSSYTSADATVNFAYDTGTGISKTLTIPFKNRYKYSSMIDYNPLEKKLFAWDNLNMVTYDIKLSKM,mutated_sequence,1.0,504.0,NP_000252.1.a2m,NP_000252.1.npy,ClinVar
+NP_000255.2,NP_000255.2.csv,MASAGNAAEPQDRGGGGSGCIGAPGRPAGGGRRRRTGGLRRAAAPDRDYLHRPSYCDAAFALEQISKGKATGRKAPLWLRAKFQRLLFKLGCYIQKNCGKFLVVGLLIFGAFAVGLKAANLETNVEELWVEVGGRVSRELNYTRQKIGEEAMFNPQLMIQTPKEEGANVLTTEALLQHLDSALQASRVHVYMYNRQWKLEHLCYKSGELITETGYMDQIIEYLYPCLIITPLDCFWEGAKLQSGTAYLLGKPPLRWTNFDPLEFLEELKKINYQVDSWEEMLNKAEVGHGYMDRPCLNPADPDCPATAPNKNSTKPLDMALVLNGGCHGLSRKYMHWQEELIVGGTVKNSTGKLVSAHALQTMFQLMTPKQMYEHFKGYEYVSHINWNEDKAAAILEAWQRTYVEVVHQSVAQNSTQKVLSFTTTTLDDILKSFSDVSVIRVASGYLLMLAYACLTMLRWDCSKSQGAVGLAGVLLVALSVAAGLGLCSLIGISFNAATTQVLPFLALGVGVDDVFLLAHAFSETGQNKRIPFEDRTGECLKRTGASVALTSISNVTAFFMAALIPIPALRAFSLQAAVVVVFNFAMVLLIFPAILSMDLYRREDRRLDIFCCFTSPCVSRVIQVEPQAYTDTHDNTRYSPPPPYSSHSFAHETQITMQSTVQLRTEYDPHTHVYYTTAEPRSEISVQPVTVTQDTLSCQSPESTSSTRDLLSQFSDSSLHCLEPPCTKWTLSSFAEKHYAPFLLKPKAKVVVIFLFLGLLGVSLYGTTRVRDGLDLTDIVPRETREYDFIAAQFKYFSFYNMYIVTQKADYPNIQHLLYDLHRSFSNVKYVMLEENKQLPKMWLHYFRDWLQGLQDAFDSDWETGKIMPNNYKNGSDDGVLAYKLLVQTGSRDKPIDISQLTKQRLVDADGIINPSAFYIYLTAWVSNDPVAYAASQANIRPHRPEWVHDKADYMPETRLRIPAAEPIEYAQFPFYLNGLRDTSDFVEAIEKVRTICSNYTSLGLSSYPNGYPFLFWEQYIGLRHWLLLFISVVLACTFLVCAVFLLNPWTAGIIVMVLALMTVELFGMMGLIGIKLSAVPVVILIASVGIGVEFTVHVALAFLTAIGDKNRRAVLALEHMFAPVLDGAVSTLLGVLMLAGSEFDFIVRYFFAVLAILTILGVLNGLVLLPVLLSFFGPYPEVSPANGLNRLPTPSPEPPPSVVRFAMPPGHTHSGSDSSDSEYSSQTTVSGLSEELRHYEAQQGAGGPAHQVIVEATENPVFAHSTVVHPESRHHPPSNPRQQPHLDSGSLPPGRQGQQPRRDPPREGLWPPPYRPRRDAFEISTEGHSGPSNRARWGPRGARSHNPRNPASTAMGSSVPGYCQPITTVTASASVTVAVHPPPVPGPGRNPRGGLCPGYPETDHGLFEDPHVPFHVRCERRDSKVEVIELQDVECEERPRGSSSN,mutated_sequence,1.0,1447.0,NP_000255.2.a2m,NP_000255.2.npy,ClinVar
+NP_000257.1,NP_000257.1.csv,MRKHVLAASFSMLSLLVIMGDTDSKTDSSFIMDSDPRRCMRHHYVDSISHPLYKCSSKMVLLARCEGHCSQASRSEPLVSFSTVLKQPFRSSCHCCRPQTSKLKALRLRCSGGMRLTATYRYILSCHCEECNS,mutated_sequence,1.0,133.0,NP_000257.1.a2m,NP_000257.1.npy,ClinVar
+NP_000259.1,NP_000259.1.csv,MAGAIASRMSFSSLKRKQPKTFTVRIVTMDAEMEFNCEMKWKGKDLFDLVCRTLGLRETWFFGLQYTIKDTVAWLKMDKKVLDHDVSKEEPVTFHFLAKFYPENAEEELVQEITQHLFFLQVKKQILDEKIYCPPEASVLLASYAVQAKYGDYDPSVHKRGFLAQEELLPKRVINLYQMTPEMWEERITAWYAEHRGRARDEAEMEYLKIAQDLEMYGVNYFAIRNKKGTELLLGVDALGLHIYDPENRLTPKISFPWNEIRNISYSDKEFTIKPLDKKIDVFKFNSSKLRVNKLILQLCIGNHDLFMRRRKADSLEVQQMKAQAREEKARKQMERQRLAREKQMREEAERTRDELERRLLQMKEEATMANEALMRSEETADLLAEKAQITEEEAKLLAQKAAEAEQEMQRIKATAIRTEEEKRLMEQKVLEAEVLALKMAEESERRAKEADQLKQDLQEAREAERRAKQKLLEIATKPTYPPMNPIPAPLPPDIPSFNLIGDSLSFDFKDTDMKRLSMEIEKEKVEYMEKSKHLQEQLNELKTEIEALKLKERETALDILHNENSDRGGSSKHNTIKKLTLQSAKSRVAFFEEL,mutated_sequence,1.0,595.0,NP_000259.1.a2m,NP_000259.1.npy,ClinVar
+NP_000262.2,NP_000262.2.csv,MTARGLALGLLLLLLCPAQVFSQSCVWYGECGIAYGDKRYNCEYSGPPKPLPKDGYDLVQELCPGFFFGNVSLCCDVRQLQTLKDNLQLPLQFLSRCPSCFYNLLNLFCELTCSPRQSQFLNVTATEDYVDPVTNQTKTNVKELQYYVGQSFANAMYNACRDVEAPSSNDKALGLLCGKDADACNATNWIEYMFNKDNGQAPFTITPVFSDFPVHGMEPMNNATKGCDESVDEVTAPCSCQDCSIVCGPKPQPPPPPAPWTILGLDAMYVIMWITYMAFLLVFFGAFFAVWCYRKRYFVSEYTPIDSNIAFSVNASDKGEASCCDPVSAAFEGCLRRLFTRWGSFCVRNPGCVIFFSLVFITACSSGLVFVRVTTNPVDLWSAPSSQARLEKEYFDQHFGPFFRTEQLIIRAPLTDKHIYQPYPSGADVPFGPPLDIQILHQVLDLQIAIENITASYDNETVTLQDICLAPLSPYNTNCTILSVLNYFQNSHSVLDHKKGDDFFVYADYHTHFLYCVRAPASLNDTSLLHDPCLGTFGGPVFPWLVLGGYDDQNYNNATALVITFPVNNYYNDTEKLQRAQAWEKEFINFVKNYKNPNLTISFTAERSIEDELNRESDSDVFTVVISYAIMFLYISLALGHMKSCRRLLVDSKVSLGIAGILIVLSSVACSLGVFSYIGLPLTLIVIEVIPFLVLAVGVDNIFILVQAYQRDERLQGETLDQQLGRVLGEVAPSMFLSSFSETVAFFLGALSVMPAVHTFSLFAGLAVFIDFLLQITCFVSLLGLDIKRQEKNRLDIFCCVRGAEDGTSVQASESCLFRFFKNSYSPLLLKDWMRPIVIAIFVGVLSFSIAVLNKVDIGLDQSLSMPDDSYMVDYFKSISQYLHAGPPVYFVLEEGHDYTSSKGQNMVCGGMGCNNDSLVQQIFNAAQLDNYTRIGFAPSSWIDDYFDWVKPQSSCCRVDNITDQFCNASVVDPACVRCRPLTPEGKQRPQGGDFMRFLPMFLSDNPNPKCGKGGHAAYSSAVNILLGHGTRVGATYFMTYHTVLQTSADFIDALKKARLIASNVTETMGINGSAYRVFPYSVFYVFYEQYLTIIDDTIFNLGVSLGAIFLVTMVLLGCELWSAVIMCATIAMVLVNMFGVMWLWGISLNAVSLVNLVMSCGISVEFCSHITRAFTVSMKGSRVERAEEALAHMGSSVFSGITLTKFGGIVVLAFAKSQIFQIFYFRMYLAMVLLGATHGLIFLPVLLSYIGPSVNKAKSCATEERYKGTERERLLNF,mutated_sequence,1.0,1278.0,NP_000262.2.a2m,NP_000262.2.npy,ClinVar
+NP_000265.1,NP_000265.1.csv,MFSKLAHLQRFAVLSRGVHSSVASATSVATKKTVQGPPTSDDIFEREYKYGAHNYHPLPVALERGKGIYLWDVEGRKYFDFLSSYSAVNQGHCHPKIVNALKSQVDKLTLTSRAFYNNVLGEYEEYITKLFNYHKVLPMNTGVEAGETACKLARKWGYTVKGIQKYKAKIVFAAGNFWGRTLSAISSSTDPTSYDGFGPFMPGFDIIPYNDLPALERALQDPNVAAFMVEPIQGEAGVVVPDPGYLMGVRELCTRHQVLFIADEIQTGLARTGRWLAVDYENVRPDIVLLGKALSGGLYPVSAVLCDDDIMLTIKPGEHGSTYGGNPLGCRVAIAALEVLEEENLAENADKLGIILRNELMKLPSDVVTAVRGKGLLNAIVIKETKDWDAWKVCLRLRDNGLLAKPTHGDIIRFAPPLVIKEDELRESIEIINKTILSF,mutated_sequence,1.0,439.0,NP_000265.1.a2m,NP_000265.1.npy,ClinVar
+NP_000266.2,NP_000266.2.csv,MHLEGRDGRRYPGAPAVELLQTSVPSGLAELVAGKRRLPRGAGGADPSHSCPRGAAGQSSWAPAGQEFASFLTKGRSHSSLPQMSSSRSKDSCFTENTPLLRNSLQEKGSRCIPVYHPEFITAEESWEDSSADWERRYLLSREVSGLSASASSEKGDLLDSPHIRLRLSKLRRCVQWLKVMGLFAFVVLCSILFSLYPDQGKLWQLLALSPLENYSVNLSSHVDSTLLQVDLAGALVASGPSRPGREEHIVVELTQADALGSRWRRPQQVTHNWTVYLNPRRSEHSVMSRTFEVLTRETVSISIRASLQQTQAVPLLMAHQYLRGSVETQVTIATAILAGVYALIIFEIVHRTLAAMLGSLAALAALAVIGDRPSLTHVVEWIDFETLALLFGMMILVAIFSETGFFDYCAVKAYRLSRGRVWAMIIMLCLIAAVLSAFLDNVTTMLLFTPVTIRLCEVLNLDPRQVLIAEVIFTNIGGAATAIGDPPNVIIVSNQELRKMGLDFAGFTAHMFIGICLVLLVCFPLLRLLYWNRKLYNKEPSEIVELKHEIHVWRLTAQRISPASREETAVRRLLLGKVLALEHLLARRLHTFHRQISQEDKNWETNIQELQKKHRISDGILLAKCLTVLGFVIFMFFLNSFVPGIHLDLGWIAILGAIWLLILADIHDFEIILHRVEWATLLFFAALFVLMEALAHLHLIEYVGEQTALLIKMVPEEQRLIAAIVLVVWVSALASSLIDNIPFTATMIPVLLNLSHDPEVGLPAPPLMYALAFGACLGGNGTLIGASANVVCAGIAEQHGYGFSFMEFFRLGFPMMVVSCTVGMCYLLVAHVVVGWN,mutated_sequence,1.0,838.0,NP_000266.2.a2m,NP_000266.2.npy,ClinVar
+NP_000267.2,NP_000267.2.csv,MEPPLPVGAQPLATVEGMEMKGPLREPCALTLAQRNGQYELIIQLHEKEQHVQDIIPINSHFRCVQEAEETLLIDIASNSGCKIRVQGDWIRERRFEIPDEEHCLKFLSAVLAAQKAQSQLLVPEQKDSSSWYQKLDTKDKPSVFSGLLGFEDNFSSMNLDKKINSQNQPTGIHREPPPPPFSVNKMLPREKEASNKEQPKVTNTMRKLFVPNTQSGQREGLIKHILAKREKEYVNIQTFRFFVGTWNVNGQSPDSGLEPWLNCDPNPPDIYCIGFQELDLSTEAFFYFESVKEQEWSMAVERGLHSKAKYKKVQLVRLVGMMLLIFARKDQCRYIRDIATETVGTGIMGKMGNKGGVAVRFVFHNTTFCIVNSHLAAHVEDFERRNQDYKDICARMSFVVPNQTLPQLNIMKHEVVIWLGDLNYRLCMPDANEVKSLINKKDLQRLLKFDQLNIQRTQKKAFVDFNEGEIKFIPTYKYDSKTDRWDSSGKCRVPAWCDRILWRGTNVNQLNYRSHMELKTSDHKPVSALFHIGVKVVDERRYRKVFEDSVRIMDRMENDFLPSLELSRREFVFENVKFRQLQKEKFQISNNGQVPCHFSFIPKLNDSQYCKPWLRAEPFEGYLEPNETVDISLDVYVSKDSVTILNSGEDKIEDILVLHLDRGKDYFLTISGNYLPSCFGTSLEALCRMKRPIREVPVTKLIDLEEDSFLEKEKSLLQMVPLDEGASERPLQVPKEIWLLVDHLFKYACHQEDLFQTPGMQEELQQIIDCLDTSIPETIPGSNHSVAEALLIFLEALPEPVICYELYQRCLDSAYDPRICRQVISQLPRCHRNVFRYLMAFLRELLKFSEYNSVNANMIATLFTSLLLRPPPNLMARQTPSDRQRAIQFLLGFLLGSEED,mutated_sequence,1.0,901.0,NP_000267.2.a2m,NP_000267.2.npy,ClinVar
+NP_000268.1,NP_000268.1.csv,MSTAVLENPGLGRKLSDFGQETSYIEDNCNQNGAISLIFSLKEEVGALAKVLRLFEENDVNLTHIESRPSRLKKDEYEFFTHLDKRSLPALTNIIKILRHDIGATVHELSRDKKKDTVPWFPRTIQELDRFANQILSYGAELDADHPGFKDPVYRARRKQFADIAYNYRHGQPIPRVEYMEEEKKTWGTVFKTLKSLYKTHACYEYNHIFPLLEKYCGFHEDNIPQLEDVSQFLQTCTGFRLRPVAGLLSSRDFLGGLAFRVFHCTQYIRHGSKPMYTPEPDICHELLGHVPLFSDRSFAQFSQEIGLASLGAPDEYIEKLATIYWFTVEFGLCKQGDSIKAYGAGLLSSFGELQYCLSEKPKLLPLELEKTAIQNYTVTEFQPLYYVAESFNDAKEKVRNFAATIPRPFSVRYDPYTQRIEVLDNTQQLKILADSINSEIGILCSALQKIK,mutated_sequence,1.0,452.0,NP_000268.1.a2m,NP_000268.1.npy,ClinVar
+NP_000269.3,NP_000269.3.csv,MDMHCKADPFSAMHPGHGGVNQLGGVFVNGRPLPDVVRQRIVELAHQGVRPCDISRQLRVSHGCVSKILGRYYETGSIKPGVIGGSKPKVATPKVVDKIAEYKRQNPTMFAWEIRDRLLAEGICDNDTVPSVSSINRIIRTKVQQPFHPTPDGAGTGVTAPGHTIVPSTASPPVSSASNDPVGSYSINGILGIPRSNGEKRKRDEDVSEGSVPNGDSQSGVDSLRKHLRADTFTQQQLEALDRVFERPSYPDVFQASEHIKSEQGNEYSLPALTPGLDEVKSSLSASTNPELGSNVSGTQTYPVVTGRDMASTTLPGYPPHVPPTGQGSYPTSTLAGMVPGSEFSGNPYSHPQYTAYNEAWRFSNPALLSSPYYYSAAPRGSAPAAAAAAYDRH,mutated_sequence,1.0,394.0,NP_000269.3.a2m,NP_000269.3.npy,ClinVar
+NP_000273.2,NP_000273.2.csv,MAGFWVGTAPLVAAGRRGRWPPQQLMLSAALRTLKHVLYYSRQCLMVSRNLGSVGYDPNEKTFDKILVANRGEIACRVIRTCKKMGIKTVAIHSDVDASSVHVKMADEAVCVGPAPTSKSYLNMDAIMEAIKKTRAQAVHPGYGFLSENKEFARCLAAEDVVFIGPDTHAIQAMGDKIESKLLAKKAEVNTIPGFDGVVKDAEEAVRIAREIGYPVMIKASAGGGGKGMRIAWDDEETRDGFRLSSQEAASSFGDDRLLIEKFIDNPRHIEIQVLGDKHGNALWLNERECSIQRRNQKVVEEAPSIFLDAETRRAMGEQAVALARAVKYSSAGTVEFLVDSKKNFYFLEMNTRLQVEHPVTECITGLDLVQEMIRVAKGYPLRHKQADIRINGWAVECRVYAEDPYKSFGLPSIGRLSQYQEPLHLPGVRVDSGIQPGSDISIYYDPMISKLITYGSDRTEALKRMADALDNYVIRGVTHNIALLREVIINSRFVKGDISTKFLSDVYPDGFKGHMLTKSEKNQLLAIASSLFVAFQLRAQHFQENSRMPVIKPDIANWELSVKLHDKVHTVVASNNGSVFSVEVDGSKLNVTSTWNLASPLLSVSVDGTQRTVQCLSREAGGNMSIQFLGTVYKVNILTRLAAELNKFMLEKVTEDTSSVLRSPMPGVVVAVSVKPGDAVAEGQEICVIEAMKMQNSMTAGKTGTVKSVHCQAGDTVGEGDLLVELE,mutated_sequence,1.0,728.0,NP_000273.2.a2m,NP_000273.2.npy,ClinVar
+NP_000275.1,NP_000275.1.csv,MRKMLAAVSRVLSGASQKPASRVLVASRNFANDATFEIKKCDLHRLEEGPPVTTVLTREDGLKYYRMMQTVRRMELKADQLYKQKIIRGFCHLCDGQEACCVGLEAGINPTDHLITAYRAHGFTFTRGLSVREILAELTGRKGGCAKGKGGSMHMYAKNFYGGNGIVGAQVPLGAGIALACKYNGKDEVCLTLYGDGAANQGQIFEAYNMAALWKLPCIFICENNRYGMGTSVERAAASTDYYKRGDFIPGLRVDGMDILCVREATRFAAAYCRSGKGPILMELQTYRYHGHSMSDPGVSYRTREEIQEVRSKSDPIMLLKDRMVNSNLASVEELKEIDVEVRKEIEDAAQFATADPEPPLEELGYHIYSSDPPFEVRGANQWIKFKSVS,mutated_sequence,1.0,390.0,NP_000275.1.a2m,NP_000275.1.npy,ClinVar
+NP_000276.2,NP_000276.2.csv,MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTDTGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEKYAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEIVECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSSYTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTADQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLGAVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDHLLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMAGCDKAFTPFSGPK,mutated_sequence,1.0,493.0,NP_000276.2.a2m,NP_000276.2.npy,ClinVar
+NP_000277.1,NP_000277.1.csv,MAEHGAHFTAASVADDQPSIFEVVAQDSLMTAVRPALQHVVKVLAESNPTHYGFLWRWFDEIFTLLDLLLQQHYLSRTSASFSENFYGLKRIVMGDTHKSQRLASAGLPKQQLWKSIMFLVLLPYLKVKLEKLVSSLREEDEYSIHPPSSRWKRFYRAFLAAYPFVNMAWEGWFLVQQLRYILGKAQHHSPLLRLAGVQLGRLTVQDIQALEHKPAKASMMQQPARSVSEKINSALKKAVGGVALSLSTGLSVGVFFLQFLDWWYSSENQETIKSLTALPTPPPPVHLDYNSDSPLLPKMKTVCPLCRKTRVNDTVLATSGYVFCYRCVFHYVRSHQACPITGYPTEVQHLIKLYSPEN,mutated_sequence,1.0,359.0,NP_000277.1.a2m,NP_000277.1.npy,ClinVar
+NP_000282.1,NP_000282.1.csv,MSLSNKLTLDKLDVKGKRVVMRVDFNVPMKNNQITNNQRIKAAVPSIKFCLDNGAKSVVLMSHLGRPDGVPMPDKYSLEPVAVELKSLLGKDVLFLKDCVGPEVEKACANPAAGSVILLENLRFHVEEEGKGKDASGNKVKAEPAKIEAFRASLSKLGDVYVNDAFGTAHRAHSSMVGVNLPQKAGGFLMKKELNYFAKALESPERPFLAILGGAKVADKIQLINNMLDKVNEMIIGGGMAFTFLKVLNNMEIGTSLFDEEGAKIVKDLMSKAEKNGVKITLPVDFVTADKFDENAKTGQATVASGIPAGWMGLDCGPESSKKYAEAVTRAKQIVWNGPVGVFEWEAFARGTKALMDEVVKATSRGCITIIGGGDTATCCAKWNTEDKVSHVSTGGGASLELLEGKVLPGVDALSNI,mutated_sequence,1.0,417.0,NP_000282.1.a2m,NP_000282.1.npy,ClinVar
+NP_000283.1,NP_000283.1.csv,MRSRSNSGVRLDGYARLVQQTILCYQNPVTGLLSASHEQKDAWVRDNIYSILAVWGLGMAYRKNADRDEDKAKAYELEQNVVKLMRGLLQCMMRQVAKVEKFKHTQSTKDSLHAKYNTATCGTVVGDDQWGHLQVDATSLFLLFLAQMTASGLRIIFTLDEVAFIQNLVFYIEAAYKVADYGMWERGDKTNQGIPELNASSVGMAKAALEAIDELDLFGAHGGRKSVIHVLPDEVEHCQSILFSMLPRASTSKEIDAGLLSIISFPAFAVEDVNLVNVTKNEIISKLQGRYGCCRFLRDGYKTPREDPNRLHYDPAELKLFENIECEWPVFWTYFIIDGVFSGDAVQVQEYREALEGILIRGKNGIRLVPELYAVPPNKVDEEYKNPHTVDRVPMGKVPHLWGQSLYILSSLLAEGFLAAGEIDPLNRRFSTSVKPDVVVQVTVLAENNHIKDLLRKHGVNVQSIADIHPIQVQPGRILSHIYAKLGRNKNMNLSGRPYRHIGVLGTSKLYVIRNQIFTFTPQFTDQHHFYLALDNEMIVEMLRIELAYLCTCWRMTGRPTLTFPISRTMLTNDGSDIHSAVLSTIRKLEDGYFGGARVKLGNLSEFLTTSFYTYLTFLDPDCDEKLFDNASEGTFSPDSDSDLVGYLEDTCNQESQDELDHYINHLLQSTSLRSYLPPLCKNTEDRHVFSAIHSTRDILSVMAKAKGLEVPFVPMTLPTKVLSAHRKSLNLVDSPQPLLEKVPESDFQWPRDDHGDVDCEKLVEQLKDCSNLQDQADILYILYVIKGPSWDTNLSGQHGVTVQNLLGELYGKAGLNQEWGLIRYISGLLRKKVEVLAEACTDLLSHQKQLTVGLPPEPREKIISAPLPPEELTKLIYEASGQDISIAVLTQEIVVYLAMYVRAQPSLFVEMLRLRIGLIIQVMATELARSLNCSGEEASESLMNLSPFDMKNLLHHILSGKEFGVERSVRPIHSSTSSPTISIHEVGHTGVTKTERSGINRLRSEMKQMTRRFSADEQFFSVGQAASSSAHSSKSARSSTPSSPTGTSSSDSGGHHIGWGERQGQWLRRRRLDGAINRVPVGFYQRVWKILQKCHGLSIDGYVLPSSTTREMTPHEIKFAVHVESVLNRVPQPEYRQLLVEAIMVLTLLSDTEMTSIGGIIHVDQIVQMASQLFLQDQVSIGAMDTLEKDQATGICHFFYDSAPSGAYGTMTYLTRAVASYLQELLPNSGCQMQ,mutated_sequence,1.0,1235.0,NP_000283.1.a2m,NP_000283.1.npy,ClinVar
+NP_000286.3,NP_000286.3.csv,MPSSVSWGILLLAGLCCLVPVSLAEDPQGDAAQKTDTSHHDQDHPTFNKITPNLAEFAFSLYRQLAHQSNSTNIFFSPVSIATAFAMLSLGTKADTHDEILEGLNFNLTEIPEAQIHEGFQELLRTLNQPDSQLQLTTGNGLFLSEGLKLVDKFLEDVKKLYHSEAFTVNFGDTEEAKKQINDYVEKGTQGKIVDLVKELDRDTVFALVNYIFFKGKWERPFEVKDTEEEDFHVDQVTTVKVPMMKRLGMFNIQHCKKLSSWVLLMKYLGNATAIFFLPDEGKLQHLENELTHDIITKFLENEDRRSASLHLPKLSITGTYDLKSVLGQLGITKVFSNGADLSGVTEEAPLKLSKAVHKAVLTIDEKGTEAAGAMFLEAIPMSIPPEVKFNKPFVFLMIEQNTKSPLFMGKVVNPTQK,mutated_sequence,1.0,418.0,NP_000286.3.a2m,NP_000286.3.npy,ClinVar
+NP_000288.1,NP_000288.1.csv,MVNSSRVQPQQPGDAKRPPAPRAPDPGRLMAGCAAVGASLAAPGGLCEQRGLEIEMQRIRQAAARDPPAGAAASPSPPLSSCSRQAWSRDNPGFEAEEEEEEVEGEEGGMVVEMDVEWRPGSRRSAASSAVSSVGARSRGLGGYHGAGHPSGRRRRREDQGPPCPSPVGGGDPLHRHLPLEGQPPRVAWAERLVRGLRGLWGTRLMEESSTNREKYLKSVLRELVTYLLFLIVLCILTYGMMSSNVYYYTRMMSQLFLDTPVSKTEKTNFKTLSSMEDFWKFTEGSLLDGLYWKMQPSNQTEADNRSFIFYENLLLGVPRIRQLRVRNGSCSIPQDLRDEIKECYDVYSVSSEDRAPFGPRNGTAWIYTSEKDLNGSSHWGIIATYSGAGYYLDLSRTREETAAQVASLKKNVWLDRGTRATFIDFSVYNANINLFCVVRLLVEFPATGGVIPSWQFQPLKLIRYVTTFDFFLAACEIIFCFFIFYYVVEEILEIRIHKLHYFRSFWNCLDVVIVVLSVVAIGINIYRTSNVEVLLQFLEDQNTFPNFEHLAYWQIQFNNIAAVTVFFVWIKLFKFINFNRTMSQLSTTMSRCAKDLFGFAIMFFIIFLAYAQLAYLVFGTQVDDFSTFQECIFTQFRIILGDINFAEIEEANRVLGPIYFTTFVFFMFFILLNMFLAIINDTYSEVKSDLAQQKAEMELSDLIRKGYHKALVKLKLKKNTVDDISESLRQGGGKLNFDELRQDLKGKGHTDAEIEAIFTKYDQDGDQELTEHEHQQMRDDLEKEREDLDLDHSSLPRPMSSRSFPRSLDDSEEDDDEDSGHSSRRRGSISSGVSYEEFQVLVRRVDRMEHSIGSIVSKIDAVIVKLEIMERAKLKRREVLGRLLDGVAEDERLGRDSEIHREQMERLVREELERWESDDAASQISHGLGTPVGLNGQPRPRSSRPSSSQSTEGMEGAGGNGSSNVHV,mutated_sequence,1.0,968.0,NP_000288.1.a2m,NP_000288.1.npy,ClinVar
+NP_000289.1,NP_000289.1.csv,MSIQENISSLQLRSWVSKSQRDLAKSILIGAPGGPAGYLRRASVAQLTQELGTAFFQQQQLPAAMADTFLEHLCLLDIDSEPVAARSTSIIATIGPASRSVERLKEMIKAGMNIARLNFSHGSHEYHAESIANVREAVESFAGSPLSYRPVAIALDTKGPEIRTGILQGGPESEVELVKGSQVLVTVDPAFRTRGNANTVWVDYPNIVRVVPVGGRIYIDDGLISLVVQKIGPEGLVTQVENGGVLGSRKGVNLPGAQVDLPGLSEQDVRDLRFGVEHGVDIVFASFVRKASDVAAVRAALGPEGHGIKIISKIENHEGVKRFDEILEVSDGIMVARGDLGIEIPAEKVFLAQKMMIGRCNLAGKPVVCATQMLESMITKPRPTRAETSDVANAVLDGADCIMLSGETAKGNFPVEAVKMQHAIAREAEAAVYHRQLFEELRRAAPLSRDPTEVTAIGAVEAAFKCCAAAIIVLTTTGRSAQLLSRYRPRAAVIAVTRSAQAARQVHLCRGVFPLLYREPPEAIWADDVDRRVQFGIESGKLRGFLRVGDLVIVVTGWRPGSGYTNIMRVLSIS,mutated_sequence,1.0,574.0,NP_000289.1.a2m,NP_000289.1.npy,ClinVar
+NP_000292.1,NP_000292.1.csv,MEHKEVVLLLLLFLKSGQGEPLDDYVNTQGASLFSVTKKQLGAGSIEECAAKCEEDEEFTCRAFQYHSKEQQCVIMAENRKSSIIIRMRDVVLFEKKVYLSECKTGNGKNYRGTMSKTKNGITCQKWSSTSPHRPRFSPATHPSEGLEENYCRNPDNDPQGPWCYTTDPEKRYDYCDILECEEECMHCSGENYDGKISKTMSGLECQAWDSQSPHAHGYIPSKFPNKNLKKNYCRNPDRELRPWCFTTDPNKRWELCDIPRCTTPPPSSGPTYQCLKGTGENYRGNVAVTVSGHTCQHWSAQTPHTHNRTPENFPCKNLDENYCRNPDGKRAPWCHTTNSQVRWEYCKIPSCDSSPVSTEQLAPTAPPELTPVVQDCYHGDGQSYRGTSSTTTTGKKCQSWSSMTPHRHQKTPENYPNAGLTMNYCRNPDADKGPWCFTTDPSVRWEYCNLKKCSGTEASVVAPPPVVLLPDVETPSEEDCMFGNGKGYRGKRATTVTGTPCQDWAAQEPHRHSIFTPETNPRAGLEKNYCRNPDGDVGGPWCYTTNPRKLYDYCDVPQCAAPSFDCGKPQVEPKKCPGRVVGGCVAHPHSWPWQVSLRTRFGMHFCGGTLISPEWVLTAAHCLEKSPRPSSYKVILGAHQEVNLEPHVQEIEVSRLFLEPTRKDIALLKLSSPAVITDKVIPACLPSPNYVVADRTECFITGWGETQGTFGAGLLKEAQLPVIENKVCNRYEFLNGRVQSTELCAGHLAGGTDSCQGDSGGPLVCFEKDKYILQGVTSWGLGCARPNKPGVYVRVSRFVTWIEGVMRNN,mutated_sequence,1.0,810.0,NP_000292.1.a2m,NP_000292.1.npy,ClinVar
+NP_000293.2,NP_000293.2.csv,MRPLLLLALLGWLLLAEAKGDAKPEDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKALEKHADKEDLVILFADSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPVVSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCRIFQNLDGALDEVVLKFEMGHVRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIHNHEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDADVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRRVGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHTLGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVETPCPDVYWFPIFTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYIAPMTEKLYPGYYTRAQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP,mutated_sequence,1.0,727.0,NP_000293.2.a2m,NP_000293.2.npy,ClinVar
+NP_000301.1,NP_000301.1.csv,MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEKKIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQFLRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERLVQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVDSEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAHIIPFLG,mutated_sequence,1.0,306.0,NP_000301.1.a2m,NP_000301.1.npy,ClinVar
+NP_000305.3,NP_000305.3.csv,MTAIIKEIVSRNKRRYQEDGFDLDLTYIYPNIIAMGFPAERLEGVYRNNIDDVVRFLDSKHKNHYKIYNLCAERHYDTAKFNCRVAQYPFEDHNPPQLELIKPFCEDLDQWLSEDDNHVAAIHCKAGKGRTGVMICAYLLHRGKFLKAQEALDFYGEVRTRDKKGVTIPSQRRYVYYYSYLLKNHLDYRPVALLFHKMMFETIPMFSGGTCNPQFVVCQLKVKIYSSNSGPTRREDKFMYFEFPQPLPVCGDIKVEFFHKQNKMLKKDKMFHFWVNTFFIPGPEETSEKVENGSLCDQEIDSICSIERADNDKEYLVLTLTKNDLDKANKDKANRYFSPNFKVKLYFTKTVEEPSNPEASSSTSVTPDVSDNEPDHYRYSDTTDSDPENEPFDEDQHTQITKV,mutated_sequence,1.0,403.0,NP_000305.3.a2m,NP_000305.3.npy,ClinVar
+NP_000308.1,NP_000308.1.csv,MSTEGGGRRCQAQVSRRISFSASHRLYSKFLSDEENLKLFGKCNNPNGHGHNYKVVVTVHGEIDPATGMVMNLADLKKYMEEAIMQPLDHKNLDMDVPYFADVVSTTENVAVYIWDNLQKVLPVGVLYKVKVYETDNNIVVYKGE,mutated_sequence,1.0,145.0,NP_000308.1.a2m,NP_000308.1.npy,ClinVar
+NP_000311.2,NP_000311.2.csv,MAAAAAAGEARRVLVYGGRGALGSRCVQAFRARNWWVASVDVVENEEASASIIVKMTDSFTEQADQVTAEVGKLLGEEKVDAILCVAGGWAGGNAKSKSLFKNCDLMWKQSIWTSTISSHLATKHLKEGGLLTLAGAKAALDGTPGMIGYGMAKGAVHQLCQSLAGKNSGMPPGAAAIAVLPVTLDTPMNRKSMPEADFSSWTPLEFLVETFHDWITGKNRPSSGSLIQVVTTEGRTELTPAYF,mutated_sequence,1.0,244.0,NP_000311.2.a2m,NP_000311.2.npy,ClinVar
+NP_000312.2,NP_000312.2.csv,MPPKTPRKTAATAAAAAAEPPAPPPPPPPEEDPEQDSGPEDLPLVRLEFEETEEPDFTALCQKLKIPDHVRERAWLTWEKVSSVDGVLGGYIQKKKELWGICIFIAAVDLDEMSFTFTELQKNIEISVHKFFNLLKEIDTSTKVDNAMSRLLKKYDVLFALFSKLERTCELIYLTQPSSSISTEINSALVLKVSWITFLLAKGEVLQMEDDLVISFQLMLCVLDYFIKLSPPMLLKEPYKTAVIPINGSPRTPRRGQNRSARIAKQLENDTRIIEVLCKEHECNIDEVKNVYFKNFIPFMNSLGLVTSNGLPEVENLSKRYEEIYLKNKDLDARLFLDHDKTLQTDSIDSFETQRTPRKSNLDEEVNVIPPHTPVRTVMNTIQQLMMILNSASDQPSENLISYFNNCTVNPKESILKRVKDIGYIFKEKFAKAVGQGCVEIGSQRYKLGVRLYYRVMESMLKSEEERLSIQNFSKLLNDNIFHMSLLACALEVVMATYSRSTSQNLDSGTDLSFPWILNVLNLKAFDFYKVIESFIKAEGNLTREMIKHLERCEHRIMESLAWLSDSPLFDLIKQSKDREGPTDHLESACPLNLPLQNNHTAADMYLSPVRSPKKKGSTTRVNSTANAETQATSAFQTQKPLKSTSLSLFYKKVYRLAYLRLNTLCERLLSEHPELEHIIWTLFQHTLQNEYELMRDRHLDQIMMCSMYGICKVKNIDLKFKIIVTAYKDLPHAVQETFKRVLIKEEEYDSIIVFYNSVFMQRLKTNILQYASTRPPTLSPIPHIPRSPYKFPSSPLRIPGGNIYISPLKSPYKISEGLPTPTKMTPRSRILVSIGESFGTSEKFQKINQMVCNSDRVLKRSAEGSNPPKPLKKLRFDIEGSDEADGSKHLPGESKFQQKLAEMTSTRTRMQKQKMNDSMDTSNKEEK,mutated_sequence,1.0,928.0,NP_000312.2.a2m,NP_000312.2.npy,ClinVar
+NP_000313.2,NP_000313.2.csv,MALLKVKFDQKKRVKLAQGLWLMNWFSVLAGIIIFSLGLFLKIELRKRSDVMNNSESHFVPNSLIGMGVLSCVFNSLAGKICYDALDPAKYARWKPWLKPYLAICVLFNIILFLVALCCFLLRGSLENTLGQGLKNGMKYYRDTDTPGRCFMKKTIDMLQIEFKCCGNNGFRDWFEIQWISNRYLDFSSKEVKDRIKSNVDGRYLVDGVPFSCCNPSSPRPCIQYQITNNSAHYSYDHQTEELNLWVRGCRAALLSYYSSLMNSMGVVTLLIWLFEVTITIGLRYLQTSLDGVSNPEESESESQGWLLERSVPETWKAFLESVKKLGKGNQVEAEGADAGQAPEAG,mutated_sequence,1.0,346.0,NP_000313.2.a2m,NP_000313.2.npy,ClinVar
+NP_000320.1,NP_000320.1.csv,MSIQVEHPAGGYKKLFETVEELSSPLTAHVTGRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRRFIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSRFFSYFRGVEVTDNALVNVYPVGEDYYACTETNFITKINPETLETIKQVDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQADKEDPISKSEIVVQFPCSDRFKPSYVHSFGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMGVWLHIADKKRKKYLNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKGFEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDKADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQAFEFPQINYQKYCGKPYTYAYGLGLNHFVPDRLCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDDGVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS,mutated_sequence,1.0,533.0,NP_000320.1.a2m,NP_000320.1.npy,ClinVar
+NP_000325.4,NP_000325.4.csv,MARPSLCTLVPLGPECLRPFTRESLAAIEQRAVEEEARLQRNKQMEIEEPERKPRSDLEAGKNLPMIYGDPPPEVIGIPLEDLDPYYSNKKTFIVLNKGKAIFRFSATPALYLLSPFSVVRRGAIKVLIHALFSMFIMITILTNCVFMTMSDPPPWSKNVEYTFTGIYTFESLIKILARGFCVDDFTFLRDPWNWLDFSVIMMAYLTEFVDLGNISALRTFRVLRALKTITVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALVGLQLFMGNLRQKCVRWPPPFNDTNTTWYSNDTWYGNDTWYGNEMWYGNDSWYANDTWNSHASWATNDTFDWDAYISDEGNFYFLEGSNDALLCGNSSDAGHCPEGYECIKTGRNPNYGYTSYDTFSWAFLALFRLMTQDYWENLFQLTLRAAGKTYMIFFVVIIFLGSFYLINLILAVVAMAYAEQNEATLAEDKEKEEEFQQMLEKFKKHQEELEKAKAAQALEGGEADGDPAHGKDCNGSLDTSQGEKGAPRQSSSGDSGISDAMEELEEAHQKCPPWWYKCAHKVLIWNCCAPWLKFKNIIHLIVMDPFVDLGITICIVLNTLFMAMEHYPMTEHFDNVLTVGNLVFTGIFTAEMVLKLIAMDPYEYFQQGWNIFDSIIVTLSLVELGLANVQGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKSYKECVCKIALDCNLPRWHMHDFFHSFLIVFRILCGEWIETMWDCMEVAGQAMCLTVFLMVMVIGNLVVLNLFLALLLSSFSADSLAASDEDGEMNNLQIAIGRIKLGIGFAKAFLLGLLHGKILSPKDIMLSLGEADGAGEAGEAGETAPEDEKKEPPEEDLKKDNHILNHMGLADGPPSSLELDHLNFINNPYLTIQVPIASEESDLEMPTEEETDTFSEPEDSKKPPQPLYDGNSSVCSTADYKPPEEDPEEQAEENPEGEQPEECFTEACVQRWPCLYVDISQGRGKKWWTLRRACFKIVEHNWFETFIVFMILLSSGALAFEDIYIEQRRVIRTILEYADKVFTYIFIMEMLLKWVAYGFKVYFTNAWCWLDFLIVDVSIISLVANWLGYSELGPIKSLRTLRALRPLRALSRFEGMRVVVNALLGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFYYCINTTTSERFDISEVNNKSECESLMHTGQVRWLNVKVNYDNVGLGYLSLLQVATFKGWMDIMYAAVDSREKEEQPQYEVNLYMYLYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKLGGKDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPQNKIQGMVYDLVTKQAFDITIMILICLNMVTMMVETDNQSQLKVDILYNINMIFIIIFTGECVLKMLALRQYYFTVGWNIFDFVVVILSIVGLALSDLIQKYFVSPTLFRVIRLARIGRVLRLIRGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYSIFGMSNFAYVKKESGIDDMFNFETFGNSIICLFEITTSAGWDGLLNPILNSGPPDCDPNLENPGTSVKGDCGNPSIGICFFCSYIIISFLIVVNMYIAIILENFNVATEESSEPLGEDDFEMFYETWEKFDPDATQFIAYSRLSDFVDTLQEPLRIAKPNKIKLITLDLPMVPGDKIHCLDILFALTKEVLGDSGEMDALKQTMEEKFMAANPSKVSYEPITTTLKRKHEEVCAIKIQRAYRRHLLQRSMKQASYMYRHSHDGSGDDAPEKEGLLANTMSKMYGHENGNSSSPSPEEKGEAGDAGPTMGLMPISPSDTAWPPAPPPGQTVRPGVKESLV,mutated_sequence,1.0,1836.0,NP_000325.4.a2m,NP_000325.4.npy,ClinVar
+NP_000326.2,NP_000326.2.csv,MANFLLPRGTSSFRRFTRESLAAIEKRMAEKQARGSTTLQESREGLPEEEAPRPQLDLQASKKLPDLYGNPPQELIGEPLEDLDPFYSTQKTFIVLNKGKTIFRFSATNALYVLSPFHPIRRAAVKILVHSLFNMLIMCTILTNCVFMAQHDPPPWTKYVEYTFTAIYTFESLVKILARGFCLHAFTFLRDPWNWLDFSVIIMAYTTEFVDLGNVSALRTFRVLRALKTISVISGLKTIVGALIQSVKKLADVMVLTVFCLSVFALIGLQLFMGNLRHKCVRNFTALNGTNGSVEADGLVWESLDLYLSDPENYLLKNGTSDVLLCGNSSDAGTCPEGYRCLKAGENPDHGYTSFDSFAWAFLALFRLMTQDCWERLYQQTLRSAGKIYMIFFMLVIFLGSFYLVNLILAVVAMAYEEQNQATIAETEEKEKRFQEAMEMLKKEHEALTIRGVDTVSRSSLEMSPLAPVNSHERRSKRRKRMSSGTEECGEDRLPKSDSEDGPRAMNHLSLTRGLSRTSMKPRSSRGSIFTFRRRDLGSEADFADDENSTAGESESHHTSLLVPWPLRRTSAQGQPSPGTSAPGHALHGKKNSTVDCNGVVSLLGAGDPEATSPGSHLLRPVMLEHPPDTTTPSEEPGGPQMLTSQAPCVDGFEEPGARQRALSAVSVLTSALEELEESRHKCPPCWNRLAQRYLIWECCPLWMSIKQGVKLVVMDPFTDLTITMCIVLNTLFMALEHYNMTSEFEEMLQVGNLVFTGIFTAEMTFKIIALDPYYYFQQGWNIFDSIIVILSLMELGLSRMSNLSVLRSFRLLRVFKLAKSWPTLNTLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKNYSELRDSDSGLLPRWHMMDFFHAFLIIFRILCGEWIETMWDCMEVSGQSLCLLVFLLVMVIGNLVVLNLFLALLLSSFSADNLTAPDEDREMNNLQLALARIQRGLRFVKRTTWDFCCGLLRQRPQKPAALAAQGQLPSCIATPYSPPPPETEKVPPTRKETRFEEGEQPGQGTPGDPEPVCVPIAVAESDTDDQEEDEENSLGTEEESSKQESQPVSGGPEAPPDSRTWSQVSATASSEAEASASQADWRQQWKAEPQAPGCGETPEDSCSEGSTADMTNTAELLEQIPDLGQDVKDPEDCFTEGCVRRCPCCAVDTTQAPGKVWWRLRKTCYHIVEHSWFETFIIFMILLSSGALAFEDIYLEERKTIKVLLEYADKMFTYVFVLEMLLKWVAYGFKKYFTNAWCWLDFLIVDVSLVSLVANTLGFAEMGPIKSLRTLRALRPLRALSRFEGMRVVVNALVGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFGRCINQTEGDLPLNYTIVNNKSQCESLNLTGELYWTKVKVNFDNVGAGYLALLQVATFKGWMDIMYAAVDSRGYEEQPQWEYNLYMYIYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKLGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPLNKYQGFIFDIVTKQAFDVTIMFLICLNMVTMMVETDDQSPEKINILAKINLLFVAIFTGECIVKLAALRHYYFTNSWNIFDFVVVILSIVGTVLSDIIQKYFFSPTLFRVIRLARIGRILRLIRGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYSIFGMANFAYVKWEAGIDDMFNFQTFANSMLCLFQITTSAGWDGLLSPILNTGPPYCDPTLPNSNGSRGDCGSPAVGILFFTTYIIISFLIVVNMYIAIILENFSVATEESTEPLSEDDFDMFYEIWEKFDPEATQFIEYSVLSDFADALSEPLRIAKPNQISLINMDLPMVSGDRIHCMDILFAFTKRVLGESGEMDALKIQMEEKFMAANPSKISYEPITTTLRRKHEEVSAMVIQRAFRRHLLQRSLKHASFLFRQQAGSGLSEEDAPEREGLIAYVMSENFSRPLGPPSSSSISSTSFPPSYDSVTRATSDNLQVRGSDYSHSEDLADFPPSPDRDRESIV,mutated_sequence,1.0,2015.0,NP_000326.2.a2m,NP_000326.2.npy,ClinVar
+NP_000333.1,NP_000333.1.csv,MEELQDDYEDMMEENLEQEEYEDPDIPESQMEEPAAHDTEATATDYHTTSHPGTHKVYVELQELVMDEKNQELRWMEAARWVQLEENLGENGAWGRPHLSHLTFWSLLELRRVFTKGTVLLDLQETSLAGVANQLLDRFIFEDQIRPQDREELLRALLLKHSHAGELEALGGVKPAVLTRSGDPSQPLLPQHSSLETQLFCEQGDGGTEGHSPSGILEKIPPDSEATLVLVGRADFLEQPVLGFVRLQEAAELEAVELPVPIRFLFVLLGPEAPHIDYTQLGRAAATLMSERVFRIDAYMAQSRGELLHSLEGFLDCSLVLPPTDAPSEQALLSLVPVQRELLRRRYQSSPAKPDSSFYKGLDLNGGPDDPLQQTGQLFGGLVRDIRRRYPYYLSDITDAFSPQVLAAVIFIYFAALSPAITFGGLLGEKTRNQMGVSELLISTAVQGILFALLGAQPLLVVGFSGPLLVFEEAFFSFCETNGLEYIVGRVWIGFWLILLVVLVVAFEGSFLVRFISRYTQEIFSFLISLIFIYETFSKLIKIFQDHPLQKTYNYNVLMVPKPQGPLPNTALLSLVLMAGTFFFAMMLRKFKNSSYFPGKLRRVIGDFGVPISILIMVLVDFFIQDTYTQKLSVPDGFKVSNSSARGWVIHPLGLRSEFPIWMMFASALPALLVFILIFLESQITTLIVSKPERKMVKGSGFHLDLLLVVGMGGVAALFGMPWLSATTVRSVTHANALTVMGKASTPGAAAQIQEVKEQRISGLLVAVLVGLSILMEPILSRIPLAVLFGIFLYMGVTSLSGIQLFDRILLLFKPPKYHPDVPYVKRVKTWRMHLFTGIQIICLAVLWVVKSTPASLALPFVLILTVPLRRVLLPLIFRNVELQCLDADDAKATFDEEEGRDEYDEVAMPV,mutated_sequence,1.0,911.0,NP_000333.1.a2m,NP_000333.1.npy,ClinVar
+NP_000339.2,NP_000339.2.csv,MQVQCQQSPVLAGSATLVALGALALYVAKPSGYGKHTESLKPAATRLPARAAWFLQELPSFAVPAGILARQPLSLFGPPGTVLLGLFCLHYFHRTFVYSLLNRGRPYPAILILRGTAFCTGNGVLQGYYLIYCAEYPDGWYTDIRFSLGVFLFILGMGINIHSDYILRQLRKPGEISYRIPQGGLFTYVSGANFLGEIIEWIGYALATWSLPALAFAFFSLCFLGLRAFHHHRFYLKMFEDYPKSRKALIPFIF,mutated_sequence,1.0,254.0,NP_000339.2.a2m,NP_000339.2.npy,ClinVar
+NP_000341.2,NP_000341.2.csv,MGFVRQIQLLLWKNWTLRKRQKIRFVVELVWPLSLFLVLIWLRNANPLYSHHECHFPNKAMPSAGMLPWLQGIFCNVNNPCFQSPTPGESPGIVSNYNNSILARVYRDFQELLMNAPESQHLGRIWTELHILSQFMDTLRTHPERIAGRGIRIRDILKDEETLTLFLIKNIGLSDSVVYLLINSQVRPEQFAHGVPDLALKDIACSEALLERFIIFSQRRGAKTVRYALCSLSQGTLQWIEDTLYANVDFFKLFRVLPTLLDSRSQGINLRSWGGILSDMSPRIQEFIHRPSMQDLLWVTRPLMQNGGPETFTKLMGILSDLLCGYPEGGGSRVLSFNWYEDNNYKAFLGIDSTRKDPIYSYDRRTTSFCNALIQSLESNPLTKIAWRAAKPLLMGKILYTPDSPAARRILKNANSTFEELEHVRKLVKAWEEVGPQIWYFFDNSTQMNMIRDTLGNPTVKDFLNRQLGEEGITAEAILNFLYKGPRESQADDMANFDWRDIFNITDRTLRLVNQYLECLVLDKFESYNDETQLTQRALSLLEENMFWAGVVFPDMYPWTSSLPPHVKYKIRMDIDVVEKTNKIKDRYWDSGPRADPVEDFRYIWGGFAYLQDMVEQGITRSQVQAEAPVGIYLQQMPYPCFVDDSFMIILNRCFPIFMVLAWIYSVSMTVKSIVLEKELRLKETLKNQGVSNAVIWCTWFLDSFSIMSMSIFLLTIFIMHGRILHYSDPFILFLFLLAFSTATIMLCFLLSTFFSKASLAAACSGVIYFTLYLPHILCFAWQDRMTAELKKAVSLLSPVAFGFGTEYLVRFEEQGLGLQWSNIGNSPTEGDEFSFLLSMQMMLLDAAVYGLLAWYLDQVFPGDYGTPLPWYFLLQESYWLGGEGCSTREERALEKTEPLTEETEDPEHPEGIHDSFFEREHPGWVPGVCVKNLVKIFEPCGRPAVDRLNITFYENQITAFLGHNGAGKTTTLSILTGLLPPTSGTVLVGGRDIETSLDAVRQSLGMCPQHNILFHHLTVAEHMLFYAQLKGKSQEEAQLEMEAMLEDTGLHHKRNEEAQDLSGGMQRKLSVAIAFVGDAKVVILDEPTSGVDPYSRRSIWDLLLKYRSGRTIIMSTHHMDEADLLGDRIAIIAQGRLYCSGTPLFLKNCFGTGLYLTLVRKMKNIQSQRKGSEGTCSCSSKGFSTTCPAHVDDLTPEQVLDGDVNELMDVVLHHVPEAKLVECIGQELIFLLPNKNFKHRAYASLFRELEETLADLGLSSFGISDTPLEEIFLKVTEDSDSGPLFAGGAQQKRENVNPRHPCLGPREKAGQTPQDSNVCSPGAPAAHPEGQPPPEPECPGPQLNTGTQLVLQHVQALLVKRFQHTIRSHKDFLAQIVLPATFVFLALMLSIVIPPFGEYPALTLHPWIYGQQYTFFSMDEPGSEQFTVLADVLLNKPGFGNRCLKEGWLPEYPCGNSTPWKTPSVSPNITQLFQKQKWTQVNPSPSCRCSTREKLTMLPECPEGAGGLPPPQRTQRSTEILQDLTDRNISDFLVKTYPALIRSSLKSKFWVNEQRYGGISIGGKLPVVPITGEALVGFLSDLGRIMNVSGGPITREASKEIPDFLKHLETEDNIKVWFNNKGWHALVSFLNVAHNAILRASLPKDRSPEEYGITVISQPLNLTKEQLSEITVLTTSVDAVVAICVIFSMSFVPASFVLYLIQERVNKSKHLQFISGVSPTTYWVTNFLWDIMNYSVSAGLVVGIFIGFQKKAYTSPENLPALVALLLLYGWAVIPMMYPASFLFDVPSTAYVALSCANLFIGINSSAITFILELFENNRTLLRFNAVLRKLLIVFPHFCLGRGLIDLALSQAVTDVYARFGEEHSANPFHWDLIGKNLFAMVVEGVVYFLLTLLVQRHFFLSQWIAEPTKEPIVDEDDDVAEERQRIITGGNKTDILRLHELTKIYPGTSSPAVDRLCVGVRPGECFGLLGVNGAGKTTTFKMLTGDTTVTSGDATVAGKSILTNISEVHQNMGYCPQFDAIDELLTGREHLYLYARLRGVPAEEIEKVANWSIKSLGLTVYADCLAGTYSGGNKRKLSTAIALIGCPPLVLLDEPTTGMDPQARRMLWNVIVSIIREGRAVVLTSHSMEECEALCTRLAIMVKGAFRCMGTIQHLKSKFGDGYIVTMKIKSPKDDLLPDLNPVEQFFQGNFPGSVQRERHYNMLQFQVSSSSLARIFQLLLSHKDSLLIEEYSVTQTTLDQVFVNFAKQQTESHDLPLHPRAAGASRQAQD,mutated_sequence,1.0,2273.0,NP_000341.2.a2m,NP_000341.2.npy,ClinVar
+NP_000349.1,NP_000349.1.csv,MALFVRLLALALALALGPAATLAGPAKSPYQLVLQHSRLRGRQHGPNVCAVQKVIGTNRKYFTNCKQWYQRKICGKSTVISYECCPGYEKVPGEKGCPAALPLSNLYETLGVVGSTTTQLYTDRTEKLRPEMEGPGSFTIFAPSNEAWASLPAEVLDSLVSNVNIELLNALRYHMVGRRVLTDELKHGMTLTSMYQNSNIQIHHYPNGIVTVNCARLLKADHHATNGVVHLIDKVISTITNNIQQIIEIEDTFETLRAAVAASGLNTMLEGNGQYTLLAPTNEAFEKIPSETLNRILGDPEALRDLLNNHILKSAMCAEAIVAGLSVETLEGTTLEVGCSGDMLTINGKAIISNKDILATNGVIHYIDELLIPDSAKTLFELAAESDVSTAIDLFRQAGLGNHLSGSERLTLLAPLNSVFKDGTPPIDAHTRNLLRNHIIKDQLASKYLYHGQTLETLGGKKLRVFVYRNSLCIENSCIAAHDKRGRYGTLFTMDRVLTPPMGTVMDVLKGDNRFSMLVAAIQSAGLTETLNREGVYTVFAPTNEAFRALPPRERSRLLGDAKELANILKYHIGDEILVSGGIGALVRLKSLQGDKLEVSLKNNVVSVNKEPVAEPDIMATNGVVHVITNVLQPPANRPQERGDELADSALEIFKQASAFSRASQRSVRLAPVYQKLLERMKH,mutated_sequence,1.0,683.0,NP_000349.1.a2m,NP_000349.1.npy,ClinVar
+NP_000350.1,NP_000350.1.csv,MMDGPRSDVGRWGGNPLQPPTTPSPEPEPEPDGRSRRGGGRSFWARCCGCCSCRNAADDDWGPEPSDSRGRGSSSGTRRPGSRGSDSRRPVSRGSGVNAAGDGTIREGMLVVNGVDLLSSRSDQNRREHHTDEYEYDELIVRRGQPFHMLLLLSRTYESSDRITLELLIGNNPEVGKGTHVIIPVGKGGSGGWKAQVVKASGQNLNLRVHTSPNAIIGKFQFTVRTQSDAGEFQLPFDPRNEIYILFNPWCPEDIVYVDHEDWRQEYVLNESGRIYYGTEAQIGERTWNYGQFDHGVLDACLYILDRRGMPYGGRGDPVNVSRVISAMVNSLDDNGVLIGNWSGDYSRGTNPSAWVGSVEILLSYLRTGYSVPYGQCWVFAGVTTTVLRCLGLATRTVTNFNSAHDTDTSLTMDIYFDENMKPLEHLNHDSVWNFHVWNDCWMKRPDLPSGFDGWQVVDATPQETSSGIFCCGPCSVESIKNGLVYMKYDTPFIFAEVNSDKVYWQRQDDGSFKIVYVEEKAIGTLIVTKAISSNMREDITYLYKHPEGSDAERKAVETAAAHGSKPNVYANRGSAEDVAMQVEAQDAVMGQDLMVSVMLINHSSSRRTVKLHLYLSVTFYTGVSGTIFKETKKEVELAPGASDRVTMPVAYKEYRPHLVDQGAMLLNVSGHVKESGQVLAKQHTFRLRTPDLSLTLLGAAVVGQECEVQIVFKNPLPVTLTNVVFRLEGSGLQRPKILNVGDIGGNETVTLRQSFVPVRPGPRQLIASLDSPQLSQVHGVIQVDVAPAPGDGGFFSDAGGDSHLGETIPMASRGGA,mutated_sequence,1.0,817.0,NP_000350.1.a2m,NP_000350.1.npy,ClinVar
+NP_000351.2,NP_000351.2.csv,MPTPDATTPQAKGFRRAVSELDAKQAEAIMSPRFIGRRQSLIEDARKEREAAVAAAAAAVPSEPGDPLEAVAFEEKEGKAVLNLLFSPRATKPSALSRAVKVFETFEAKIHHLETRPAQRPRAGGPHLEYFVRLEVRRGDLAALLSGVRQVSEDVRSPAGPKVPWFPRKVSELDKCHHLVTKFDPDLDLDHPGFSDQVYRQRRKLIAEIAFQYRHGDPIPRVEYTAEEIATWKEVYTTLKGLYATHACGEHLEAFALLERFSGYREDNIPQLEDVSRFLKERTGFQLRPVAGLLSARDFLASLAFRVFQCTQYIRHASSPMHSPEPDCCHELLGHVPMLADRTFAQFSQDIGLASLGASDEEIEKLSTLYWFTVEFGLCKQNGEVKAYGAGLLSSYGELLHCLSEEPEIRAFDPEAAAVQPYQDQTYQSVYFVSESFSDAKDKLRSYASRIQRPFSVKFDPYTLAIDVLDSPQAVRRSLEGVQDELDTLAHALSAIG,mutated_sequence,1.0,497.0,NP_000351.2.a2m,NP_000351.2.npy,ClinVar
+NP_000354.4,NP_000354.4.csv,MADGSSDAAREPRPAPAPIRRRSSNYRAYATEPHAKKKSKISASRKLQLKTLLLQIAKQELEREAEERRGEKGRALSTRCQPLELAGLGFAELQDLCRQLHARVDKVDEERYDIEAKVTKNITEIADLTQKIFDLRGKFKRPTLRRVRISADAMMQALLGARAKESLDLRAHLKQVKKEDTEKENREVGDWRKNIDALSGMEGRKKKFES,mutated_sequence,1.0,210.0,NP_000354.4.a2m,NP_000354.4.npy,ClinVar
+NP_000359.1,NP_000359.1.csv,MAQQANVGELLAMLDSPMLGVRDDVTAVFKENLNSDRGPMLVNTLVDYYLETSSQPALHILTTLQEPHDKHLLDRINEYVGKAATRLSILSLLGHVIRLQPSWKHKLSQAPLLPSLLKCLKMDTDVVVLTTGVLVLITMLPMIPQSGKQHLLDFFDIFGRLSSWCLKKPGHVAEVYLVHLHASVYALFHRLYGMYPCNFVSFLRSHYSMKENLETFEEVVKPMMEHVRIHPELVTGSKDHELDPRRWKRLETHDVVIECAKISLDPTEASYEDGYSVSHQISARFPHRSADVTTSPYADTQNSYGCATSTPYSTSRLMLLNMPGQLPQTLSSPSTRLITEPPQATLWSPSMVCGMTTPPTSPGNVPPDLSHPYSKVFGTTAGGKGTPLGTPATSPPPAPLCHSDDYVHISLPQATVTPPRKEERMDSARPCLHRQHHLLNDRGSEEPPGSKGSVTLSDLPGFLGDLASEEDSIEKDKEEAAISRELSEITTAEAEPVVPRGGFDSPFYRDSLPGSQRKTHSAASSSQGASVNPEPLHSSLDKLGPDTPKQAFTPIDLPCGSADESPAGDRECQTSLETSIFTPSPCKIPPPTRVGFGSGQPPPYDHLFEVALPKTAHHFVIRKTEELLKKAKGNTEEDGVPSTSPMEVLDRLIQQGADAHSKELNKLPLPSKSVDWTHFGGSPPSDEIRTLRDQLLLLHNQLLYERFKRQQHALRNRRLLRKVIKAAALEEHNAAMKDQLKLQEKDIQMWKVSLQKEQARYNQLQEQRDTMVTKLHSQIRQLQHDREEFYNQSQELQTKLEDCRNMIAELRIELKKANNKVCHTELLLSQVSQKLSNSESVQQQMEFLNRQLLVLGEVNELYLEQLQNKHSDTTKEVEMMKAAYRKELEKNRSHVLQQTQRLDTSQKRILELESHLAKKDHLLLEQKKYLEDVKLQARGQLQAAESRYEAQKRITQVFELEILDLYGRLEKDGLLKKLEEEKAEAAEAAEERLDCCNDGCSDSMVGHNEEASGHNGETKTPRPSSARGSSGSRGGGGSSSSSSELSTPEKPPHQRAGPFSSRWETTMGEASASIPTTVGSLPSSKSFLGMKARELFRNKSESQCDEDGMTSSLSESLKTELGKDLGVEAKIPLNLDGPHPSPPTPDSVGQLHIMDYNETHHEHS,mutated_sequence,1.0,1164.0,NP_000359.1.a2m,NP_000359.1.npy,ClinVar
+NP_000362.1,NP_000362.1.csv,MASHRLLLLCLAGLVFVSEAGPTGTGESKCPLMVKVLDAVRGSPAINVAVHVFRKAADDTWEPFASGKTSESGELHGLTTEEEFVEGIYKVEIDTKSYWKALGISPFHEHAEVVFTANDSGPRRYTIAALLSPYSYSTTAVVTNPKE,mutated_sequence,1.0,147.0,NP_000362.1.a2m,NP_000362.1.npy,ClinVar
+NP_000363.1,NP_000363.1.csv,MLLAVLYCLLWSFQTSAGHFPRACVSSKNLMEKECCPPWSGDRSPCGQLSGRGSCQNILLSNAPLGPQFPFTGVDDRESWPSVFYNRTCQCSGNFMGFNCGNCKFGFWGPNCTERRLLVRRNIFDLSAPEKDKFFAYLTLAKHTISSDYVIPIGTYGQMKNGSTPMFNDINIYDLFVWMHYYVSMDALLGGSEIWRDIDFAHEAPAFLPWHRLFLLRWEQEIQKLTGDENFTIPYWDWRDAEKCDICTDEYMGGQHPTNPNLLSPASFFSSWQIVCSRLEEYNSHQSLCNGTPEGPLRRNPGNHDKSRTPRLPSSADVEFCLSLTQYESGSMDKAANFSFRNTLEGFASPLTGIADASQSSMHNALHIYMNGTMSQVQGSANDPIFLLHHAFVDSIFEQWLRRHRPLQEVYPEANAPIGHNRESYMVPFIPLYRNGDFFISSKDLGYDYSYLQDSDPDSFQDYIKSYLEQASRIWSWLLGAAMVGAVLTALLAGLVSLLCRHKRKQLPEEKQPLLMEKEDYHSLYQSHL,mutated_sequence,1.0,529.0,NP_000363.1.a2m,NP_000363.1.npy,ClinVar
+NP_000371.1,NP_000371.1.csv,MAAADGALPEAAALEQPAELPASVRASIERKRQRALMLRQARLAARPYSATAAAATGGMANVKAAPKIIDTGGGFILEEEEEEEQKIGKVVHQPGPVMEFDYVICEECGKEFMDSYLMNHFDLPTCDNCRDADDKHKLITKTEAKQEYLLKDCDLEKREPPLKFIVKKNPHHSQWGDMKLYLKLQIVKRSLEVWGSQEALEEAKEVRQENREKMKQKKFDKKVKELRRAVRSSVWKRETIVHQHEYGPEENLEDDMYRKTCTMCGHELTYEKM,mutated_sequence,1.0,273.0,NP_000371.1.a2m,NP_000371.1.npy,ClinVar
+NP_000372.1,NP_000372.1.csv,METLESELTCPICLELFEDPLLLPCAHSLCFNCAHRILVSHCATNESVESITAFQCPTCRHVITLSQRGLDGLKRNVTLQNIIDRFQKASVSGPNSPSETRRERAFDANTMTSAEKVLCQFCDQDPAQDAVKTCVTCEVSYCDECLKATHPNKKPFTGHRLIEPIPDSHIRGLMCLEHEDEKVNMYCVTDDQLICALCKLVGRHRDHQVAALSERYDKLKQNLESNLTNLIKRNTELETLLAKLIQTCQHVEVNASRQEAKLTEECDLLIEIIQQRRQIIGTKIKEGKVMRLRKLAQQIANCKQCIERSASLISQAEHSLKENDHARFLQTAKNITERVSMATASSQVLIPEINLNDTFDTFALDFSREKKLLECLDYLTAPNPPTIREELCTASYDTITVHWTSDDEFSVVSYELQYTIFTGQANVVSLCNSADSWMIVPNIKQNHYTVHGLQSGTKYIFMVKAINQAGSRSSEPGKLKTNSQPFKLDPKSAHRKLKVSHDNLTVERDESSSKKSHTPERFTSQGSYGVAGNVFIDSGRHYWEVVISGSTWYAIGLAYKSAPKHEWIGKNSASWALCRCNNNWVVRHNSKEIPIEPAPHLRRVGILLDYDNGSIAFYDALNSIHLYTFDVAFAQPVCPTFTVWNKCLTIITGLPIPDHLDCTEQLP,mutated_sequence,1.0,667.0,NP_000372.1.a2m,NP_000372.1.npy,ClinVar
+NP_000373.1,NP_000373.1.csv,MELEVRRVRQAFLSGRSRPLRFRLQQLEALRRMVQEREKDILTAIAADLCKSEFNVYSQEVITVLGEIDFMLENLPEWVTAKPVKKNVLTMLDEAYIQPQPLGVVLIIGAWNYPFVLTIQPLIGAIAAGNAVIIKPSELSENTAKILAKLLPQYLDQDLYIVINGGVEETTELLKQRFDHIFYTGNTAVGKIVMEAAAKHLTPVTLELGGKSPCYIDKDCDLDIVCRRITWGKYMNCGQTCIAPDYILCEASLQNQIVWKIKETVKEFYGENIKESPDYERIINLRHFKRILSLLEGQKIAFGGETDEATRYIAPTVLTDVDPKTKVMQEEIFGPILPIVPVKNVDEAINFINEREKPLALYVFSHNHKLIKRMIDETSSGGVTGNDVIMHFTLNSFPFGGVGSSGMGAYHGKHSFDTFSHQRPCLLKSLKREGANKLRYPPNSQSKVDWGKFFLLKRFNKEKLGLLLLTFLGIVAAVLVKAEYY,mutated_sequence,1.0,485.0,NP_000373.1.a2m,NP_000373.1.npy,ClinVar
+NP_000375.3,NP_000375.3.csv,MDPPRPALLALLALPALLLLLLAGARAEEEMLENVSLVCPKDATRFKHLRKYTYNYEAESSSGVPGTADSRSATRINCKVELEVPQLCSFILKTSQCTLKEVYGFNPEGKALLKKTKNSEEFAAAMSRYELKLAIPEGKQVFLYPEKDEPTYILNIKRGIISALLVPPETEEAKQVLFLDTVYGNCSTHFTVKTRKGNVATEISTERDLGQCDRFKPIRTGISPLALIKGMTRPLSTLISSSQSCQYTLDAKRKHVAEAICKEQHLFLPFSYKNKYGMVAQVTQTLKLEDTPKINSRFFGEGTKKMGLAFESTKSTSPPKQAEAVLKTLQELKKLTISEQNIQRANLFNKLVTELRGLSDEAVTSLLPQLIEVSSPITLQALVQCGQPQCSTHILQWLKRVHANPLLIDVVTYLVALIPEPSAQQLREIFNMARDQRSRATLYALSHAVNNYHKTNPTGTQELLDIANYLMEQIQDDCTGDEDYTYLILRVIGNMGQTMEQLTPELKSSILKCVQSTKPSLMIQKAAIQALRKMEPKDKDQEVLLQTFLDDASPGDKRLAAYLMLMRSPSQADINKIVQILPWEQNEQVKNFVASHIANILNSEELDIQDLKKLVKEALKESQLPTVMDFRKFSRNYQLYKSVSLPSLDPASAKIEGNLIFDPNNYLPKESMLKTTLTAFGFASADLIEIGLEGKGFEPTLEALFGKQGFFPDSVNKALYWVNGQVPDGVSKVLVDHFGYTKDDKHEQDMVNGIMLSVEKLIKDLKSKEVPEARAYLRILGEELGFASLHDLQLLGKLLLMGARTLQGIPQMIGEVIRKGSKNDFFLHYIFMENAFELPTGAGLQLQISSSGVIAPGAKAGVKLEVANMQAELVAKPSVSVEFVTNMGIIIPDFARSGVQMNTNFFHESGLEAHVALKAGKLKFIIPSPKRPVKLLSGGNTLHLVSTTKTEVIPPLIENRQSWSVCKQVFPGLNYCTSGAYSNASSTDSASYYPLTGDTRLELELRPTGEIEQYSVSATYELQREDRALVDTLKFVTQAEGAKQTEATMTFKYNRQSMTLSSEVQIPDFDVDLGTILRVNDESTEGKTSYRLTLDIQNKKITEVALMGHLSCDTKEERKIKGVISIPRLQAEARSEILAHWSPAKLLLQMDSSATAYGSTVSKRVAWHYDEEKIEFEWNTGTNVDTKKMTSNFPVDLSDYPKSLHMYANRLLDHRVPQTDMTFRHVGSKLIVAMSSWLQKASGSLPYTQTLQDHLNSLKEFNLQNMGLPDFHIPENLFLKSDGRVKYTLNKNSLKIEIPLPFGGKSSRDLKMLETVRTPALHFKSVGFHLPSREFQVPTFTIPKLYQLQVPLLGVLDLSTNVYSNLYNWSASYSGGNTSTDHFSLRARYHMKADSVVDLLSYNVQGSGETTYDHKNTFTLSCDGSLRHKFLDSNIKFSHVEKLGNNPVSKGLLIFDASSSWGPQMSASVHLDSKKKQHLFVKEVKIDGQFRVSSFYAKGTYGLSCQRDPNTGRLNGESNLRFNSSYLQGTNQITGRYEDGTLSLTSTSDLQSGIIKNTASLKYENYELTLKSDTNGKYKNFATSNKMDMTFSKQNALLRSEYQADYESLRFFSLLSGSLNSHGLELNADILGTDKINSGAHKATLRIGQDGISTSATTNLKCSLLVLENELNAELGLSGASMKLTTNGRFREHNAKFSLDGKAALTELSLGSAYQAMILGVDSKNIFNFKVSQEGLKLSNDMMGSYAEMKFDHTNSLNIAGLSLDFSSKLDNIYSSDKFYKQTVNLQLQPYSLVTTLNSDLKYNALDLTNNGKLRLEPLKLHVAGNLKGAYQNNEIKHIYAISSAALSASYKADTVAKVQGVEFSHRLNTDIAGLASAIDMSTNYNSDSLHFSNVFRSVMAPFTMTIDAHTNGNGKLALWGEHTGQLYSKFLLKAEPLAFTFSHDYKGSTSHHLVSRKSISAALEHKVSALLTPAEQTGTWKLKTQFNNNEYSQDLDAYNTKDKIGVELTGRTLADLTLLDSPIKVPLLLSEPINIIDALEMRDAVEKPQEFTIVAFVKYDKNQDVHSINLPFFETLQEYFERNRQTIIVVLENVQRNLKHINIDQFVRKYRAALGKLPQQANDYLNSFNWERQVSHAKEKLTALTKKYRITENDIQIALDDAKINFNEKLSQLQTYMIQFDQYIKDSYDLHDLKIAIANIIDEIIEKLKSLDEHYHIRVNLVKTIHDLHLFIENIDFNKSGSSTASWIQNVDTKYQIRIQIQEKLQQLKRHIQNIDIQHLAGKLKQHIEAIDVRVLLDQLGTTISFERINDILEHVKHFVINLIGDFEVAEKINAFRAKVHELIERYEVDQQIQVLMDKLVELAHQYKLKETIQKLSNVLQQVKIKDYFEKLVGFIDDAVKKLNELSFKTFIEDVNKFLDMLIKKLKSFDYHQFVDETNDKIREVTQRLNGEIQALELPQKAEALKLFLEETKATVAVYLESLQDTKITLIINWLQEALSSASLAHMKAKFRETLEDTRDRMYQMDIQQELQRYLSLVGQVYSTLVTYISDWWTLAAKNLTDFAEQYSIQDWAKRMKALVEQGFTVPEIKTILGTMPAFEVSLQALQKATFQTPDFIVPLTDLRIPSVQINFKDLKNIKIPSRFSTPEFTILNTFHIPSFTIDFVEMKVKIIRTIDQMLNSELQWPVPDIYLRDLKVEDIPLARITLPDFRLPEIAIPEFIIPTLNLNDFQVPDLHIPEFQLPHISHTIEVPTFGKLYSILKIQSPLFTLDANADIGNGTTSANEAGIAASITAKGESKLEVLNFDFQANAQLSNPKINPLALKESVKFSSKYLRTEHGSEMLFFGNAIEGKSNTVASLHTEKNTLELSNGVIVKINNQLTLDSNTKYFHKLNIPKLDFSSQADLRNEIKTLLKAGHIAWTSSGKGSWKWACPRFSDEGTHESQISFTIEGPLTSFGLSNKINSKHLRVNQNLVYESGSLNFSKLEIQSQVDSQHVGHSVLTAKGMALFGEGKAEFTGRHDAHLNGKVIGTLKNSLFFSAQPFEITASTNNEGNLKVRFPLRLTGKIDFLNNYALFLSPSAQQASWQVSARFNQYKYNQNFSAGNNENIMEAHVGINGEANLDFLNIPLTIPEMRLPYTIITTPPLKDFSLWEKTGLKEFLKTTKQSFDLSVKAQYKKNKHRHSITNPLAVLCEFISQSIKSFDRHFEKNRNNALDFVTKSYNETKIKFDKYKAEKSHDELPRTFQIPGYTVPVVNVEVSPFTIEMSAFGYVFPKAVSMPSFSILGSDVRVPSYTLILPSLELPVLHVPRNLKLSLPDFKELCTISHIFIPAMGNITYDFSFKSSVITLNTNAELFNQSDIVAHLLSSSSSVIDALQYKLEGTTRLTRKRGLKLATALSLSNKFVEGSHNSTVSLTTKNMEVSVATTTKAQIPILRMNFKQELNGNTKSKPTVSSSMEFKYDFNSSMLYSTAKGAVDHKLSLESLTSYFSIESSTKGDVKGSVLSREYSGTIASEANTYLNSKSTRSSVKLQGTSKIDDIWNLEVKENFAGEATLQRIYSLWEHSTKNHLQLEGLFFTNGEHTSKATLELSPWQMSALVQVHASQPSSFHDFPDLGQEVALNANTKNQKIRWKNEVRIHSGSFQSQVELSNDQEKAHLDIAGSLEGHLRFLKNIILPVYDKSLWDFLKLDVTTSIGRRQHLRVSTAFVYTKNPNGYSFSIPVKVLADKFIIPGLKLNDLNSVLVMPTFHVPFTDLQVPSCKLDFREIQIYKKLRTSSFALNLPTLPEVKFPEVDVLTKYSQPEDSLIPFFEITVPESQLTVSQFTLPKSVSDGIAALDLNAVANKIADFELPTIIVPEQTIEIPSIKFSVPAGIVIPSFQALTARFEVDSPVYNATWSASLKNKADYVETVLDSTCSSTVQFLEYELNVLGTHKIEDGTLASKTKGTFAHRDFSAEYEEDGKYEGLQEWEGKAHLNIKSPAFTDLHLRYQKDKKGISTSAASPAVGTVGMDMDEDDDFSKWNFYYSPQSSPDKKLTIFKTELRVRESDEETQIKVNWEEEAASGLLTSLKDNVPKATGVLYDYVNKYHWEHTGLTLREVSSKLRRNLQNNAEWVYQGAIRQIDDIDVRFQKAASGTTGTYQEWKDKAQNLYQELLTQEGQASFQGLKDNVFDGLVRVTQEFHMKVKHLIDSLIDFLNFPRFQFPGKPGIYTREELCTMFIREVGTVLSQVYSKVHNGSEILFSYFQDLVITLPFELRKHKLIDVISMYRELLKDLSKEAQEVFKAIQSLKTTEVLRNLQDLLQFIFQLIEDNIKQLKEMKFTYLINYIQDEINTIFSDYIPYVFKLLKENLCLNLHKFNEFIQNELQEASQELQQIHQYIMALREEYFDPSIVGWTVKYYELEEKIVSLIKNLLVALKDFHSEYIVSASNFTSQLSSQVEQFLHRNIQEYLSILTDPDGKGKEKIAELSATAQEIIKSQAIATKKIISDYHQQFRYKLQDFSDQLSDYYEKFIAESKRLIDLSIQNYHTFLIYITELLKKLQSTTVMNPYMKLAPGELTIIL,mutated_sequence,1.0,4563.0,NP_000375.3.a2m,NP_000375.3.npy,ClinVar
+NP_000381.1,NP_000381.1.csv,MADTLPSEFDVIVIGTGLPESIIAAACSRSGRRVLHVDSRSYYGGNWASFSFSGLLSWLKEYQENSDIVSDSPVWQDQILENEEAIALSRKDKTIQHVEVFCYASQDLHEDVEEAGALQKNHALVTSANSTEAADSAFLPTEDESLSTMSCEMLTEQTPSSDPENALEVNGAEVTGEKENHCDDKTCVPSTSAEDMSENVPIAEDTTEQPKKNRITYSQIIKEGRRFNIDLVSKLLYSRGLLIDLLIKSNVSRYAEFKNITRILAFREGRVEQVPCSRADVFNSKQLTMVEKRMLMKFLTFCMEYEKYPDEYKGYEEITFYEYLKTQKLTPNLQYIVMHSIAMTSETASSTIDGLKATKNFLHCLGRYGNTPFLFPLYGQGELPQCFCRMCAVFGGIYCLRHSVQCLVVDKESRKCKAIIDQFGQRIISEHFLVEDSYFPENMCSRVQYRQISRAVLITDRSVLKTDSDQQISILTVPAEEPGTFAVRVIELCSSTMTCMKGTYLVHLTCTSSKTAREDLESVVQKLFVPYTEMEIENEQVEKPRILWALYFNMRDSSDISRSCYNDLPSNVYVCSGPDCGLGNDNAVKQAETLFQEICPNEDFCPPPPNPEDIILDGDSLQPEASESSAIPEANSETFKESTNLGNLEESSE,mutated_sequence,1.0,653.0,NP_000381.1.a2m,NP_000381.1.npy,ClinVar
+NP_000382.3,NP_000382.3.csv,MGLQACLLGLFALILSGKCSYSPEPDQRRTLPPGWVSLGRADPEEELSLTFALRQQNVERLSELVQAVSDPSSPQYGKYLTLENVADLVRPSPLTLHTVQKWLLAAGAQKCHSVITQDFLTCWLSIRQAELLLPGAEFHHYVGGPTETHVVRSPHPYQLPQALAPHVDFVGGLHRFPPTSSLRQRPEPQVTGTVGLHLGVTPSVIRKRYNLTSQDVGSGTSNNSQACAQFLEQYFHDSDLAQFMRLFGGNFAHQASVARVVGQQGRGRAGIEASLDVQYLMSAGANISTWVYSSPGRHEGQEPFLQWLMLLSNESALPHVHTVSYGDDEDSLSSAYIQRVNTELMKAAARGLTLLFASGDSGAGCWSVSGRHQFRPTFPASSPYVTTVGGTSFQEPFLITNEIVDYISGGGFSNVFPRPSYQEEAVTKFLSSSPHLPPSSYFNASGRAYPDVAALSDGYWVVSNRVPIPWVSGTSASTPVFGGILSLINEHRILSGRPPLGFLNPRLYQQHGAGLFDVTRGCHESCLDEEVEGQGFCSGPGWDPVTGWGTPNFPALLKTLLNP,mutated_sequence,1.0,563.0,NP_000382.3.a2m,NP_000382.3.npy,ClinVar
+NP_000388.2,NP_000388.2.csv,MGNWAVNEGLSIFVILVWLGLNVFLFVWYYRVYDIPPKFFYTRKLLGSALALARAPAACLNFNCMLILLPVCRNLLSFLRGSSACCSTRVRRQLDRNLTFHKMVAWMIALHSAIHTIAHLFNVEWCVNARVNNSDPYSVALSELGDRQNESYLNFARKRIKNPEGGLYLAVTLLAGITGVVITLCLILIITSSTKTIRRSYFEVFWYTHHLFVIFFIGLAIHGAERIVRGQTAESLAVHNITVCEQKISEWGKIKECPIPQFAGNPPMTWKWIVGPMFLYLCERLVRFWRSQQKVVITKVVTHPFKTIELQMKKKGFKMEVGQYIFVKCPKVSKLEWHPFTLTSAPEEDFFSIHIRIVGDWTEGLFNACGCDKQEFQDAWKLPKIAVDGPFGTASEDVFSYEVVMLVGAGIGVTPFASILKSVWYKYCNNATNLKLKKIYFYWLCRDTHAFEWFADLLQLLESQMQERNNAGFLSYNIYLTGWDESQANHFAVHHDEEKDVITGLKQKTLYGRPNWDNEFKTIASQHPNTRIGVFLCGPEALAETLSKQSISNSESGPRGVHFIFNKENF,mutated_sequence,1.0,570.0,NP_000388.2.a2m,NP_000388.2.npy,ClinVar
+NP_000389.1,NP_000389.1.csv,MGAQLSTLGHMVLFPVWFLYSLLMKLFQRSTPAITLESPDIKYPLRLIDREIISHDTRRFRFALPSPQHILGLPVGQHIYLSARIDGNLVVRPYTPISSDDDKGFVDLVIKVYFKDTHPKFPAGGKMSQYLESMQIGDTIEFRGPSGLLVYQGKGKFAIRPDKKSNPIIRTVKSVGMIAGGTGITPMLQVIRAIMKDPDDHTVCHLLFANQTEKDILLRPELEELRNKHSARFKLWYTLDRAPEAWDYGQGFVNEEMIRDHLPPPEEEPLVLMCGPPPMIQYACLPNLDHVGHPTERCFVF,mutated_sequence,1.0,301.0,NP_000389.1.a2m,NP_000389.1.npy,ClinVar
+NP_000391.1,NP_000391.1.csv,MKLNVDGLLVYFPYDYIYPEQFSYMRELKRTLDAKGHGVLEMPSGTGKTVSLLALIMAYQRAYPLEVTKLIYCSRTVPEIEKVIEELRKLLNFYEKQEGEKLPFLGLALSSRKNLCIHPEVTPLRFGKDVDGKCHSLTASYVRAQYQHDTSLPHCRFYEEFDAHGREVPLPAGIYNLDDLKALGRRQGWCPYFLARYSILHANVVVYSYHYLLDPKIADLVSKELARKAVVVFDEAHNIDNVCIDSMSVNLTRRTLDRCQGNLETLQKTVLRIKETDEQRLRDEYRRLVEGLREASAARETDAHLANPVLPDEVLQEAVPGSIRTAEHFLGFLRRLLEYVKWRLRVQHVVQESPPAFLSGLAQRVCIQRKPLRFCAERLRSLLHTLEITDLADFSPLTLLANFATLVSTYAKGFTIIIEPFDDRTPTIANPILHFSCMDASLAIKPVFERFQSVIITSGTLSPLDIYPKILDFHPVTMATFTMTLARVCLCPMIIGRGNDQVAISSKFETREDIAVIRNYGNLLLEMSAVVPDGIVAFFTSYQYMESTVASWYEQGILENIQRNKLLFIETQDGAETSVALEKYQEACENGRGAILLSVARGKVSEGIDFVHHYGRAVIMFGVPYVYTQSRILKARLEYLRDQFQIRENDFLTFDAMRHAAQCVGRAIRGKTDYGLMVFADKRFARGDKRGKLPRWIQEHLTDANLNLTVDEGVQVAKYFLRQMAQPFHREDQLGLSLLSLEQLESEETLKRIEQIAQQL,mutated_sequence,1.0,760.0,NP_000391.1.a2m,NP_000391.1.npy,ClinVar
+NP_000395.3,NP_000395.3.csv,MPGFLVRILPLLLVLLLLGPTRGLRNATQRMFEIDYSRDSFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGPYICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQVENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGTGSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILARGASVNLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTEKYFALRNIIQKFEKVPEGPIPPSTPKFAYGKVTLEKLKTVGAALDILCPSGPIKSLYPLTFIQVKQHYGFVLYRTTLPQDCSNPAPLSSPLNGVHDRAYVAVDGIPQGVLERNNVITLNITGKAGATLDLLVENMGRVNYGAYINDFKGLVSNLTLSSNILTDWTIFPLDTEDAVCSHLGGWGHRDSGHHDEAWAHNSSNYTLPAFYMGNFSIPSGIPDLPQDTFIQFPGWTKGQVWINGFNLGRYWPARGPQLTLFVPQHILMTSAPNTITVLELEWAPCSSDDPELCAVTFVDRPVIGSSVTYDHPSKPVEKRLMPPPPQKNKDSWLDHV,mutated_sequence,1.0,677.0,NP_000395.3.a2m,NP_000395.3.npy,ClinVar
+NP_000396.2,NP_000396.2.csv,MQSLMQAPLLIALGLLLAAPAQAHLKKPSQLSSFSWDNCDEGKDPAVIRSLTLEPDPIIVPGNVTLSVMGSTSVPLSSPLKVDLVLEKEVAGLWIKIPCTDYIGSCTFEHFCDVLDMLIPTGEPCPEPLRTYGLPCHCPFKEGTYSLPKSEFVVPDLELPSWLTTGNYRIESVLSSSGKRLGCIKIAASLKGI,mutated_sequence,1.0,193.0,NP_000396.2.a2m,NP_000396.2.npy,ClinVar
+NP_000401.1,NP_000401.1.csv,MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVFYDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQVILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNRAYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWLKDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPSPSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE,mutated_sequence,1.0,348.0,NP_000401.1.a2m,NP_000401.1.npy,ClinVar
+NP_000410.2,NP_000410.2.csv,MARALCPLQALWLLEWVLLLLGPCAAPPAWALNLDPVQLTFYAGPNGSQFGFSLDFHKDSHGRVAIVVGAPRTLGPSQEETGGVFLCPWRAEGGQCPSLLFDLRDETRNVGSQTLQTFKARQGLGASVVSWSDVIVACAPWQHWNVLEKTEEAEKTPVGSCFLAQPESGRRAEYSPCRGNTLSRIYVENDFSWDKRYCEAGFSSVVTQAGELVLGAPGGYYFLGLLAQAPVADIFSSYRPGILLWHVSSQSLSFDSSNPEYFDGYWGYSVAVGEFDGDLNTTEYVVGAPTWSWTLGAVEILDSYYQRLHRLRGEQMASYFGHSVAVTDVNGDGRHDLLVGAPLYMESRADRKLAEVGRVYLFLQPRGPHALGAPSLLLTGTQLYGRFGSAIAPLGDLDRDGYNDIAVAAPYGGPSGRGQVLVFLGQSEGLRSRPSQVLDSPFPTGSAFGFSLRGAVDIDDNGYPDLIVGAYGANQVAVYRAQPVVKASVQLLVQDSLNPAVKSCVLPQTKTPVSCFNIQMCVGATGHNIPQKLSLNAELQLDRQKPRQGRRVLLLGSQQAGTTLNLDLGGKHSPICHTTMAFLRDEADFRDKLSPIVLSLNVSLPPTEAGMAPAVVLHGDTHVQEQTRIVLDCGEDDVCVPQLQLTASVTGSPLLVGADNVLELQMDAANEGEGAYEAELAVHLPQGAHYMRALSNVEGFERLICNQKKENETRVVLCELGNPMKKNAQIGIAMLVSVGNLEEAGESVSFQLQIRSKNSQNPNSKIVLLDVPVRAEAQVELRGNSFPASLVVAAEEGEREQNSLDSWGPKVEHTYELHNNGPGTVNGLHLSIHLPGQSQPSDLLYILDIQPQGGLQCFPQPPVNPLKVDWGLPIPSPSPIHPAHHKRDRRQIFLPEPEQPSRLQDPVLVSCDSAPCTVVQCDLQEMARGQRAMVTVLAFLWLPSLYQRPLDQFVLQSHAWFNVSSLPYAVPPLSLPRGEAQVWTQLLRALEERAIPIWWVLVGVLGGLLLLTILVLAMWKVGFFKRNRPPLEEDDEEGE,mutated_sequence,1.0,1039.0,NP_000410.2.a2m,NP_000410.2.npy,ClinVar
+NP_000413.1,NP_000413.1.csv,MTTSIRQFTSSSSIKGSSGLGGGSSRTSCRLSGGLGAGSCRLGSAGGLGSTLGGSSYSSCYSFGSGGGYGSSFGGVDGLLAGGEKATMQNLNDRLASYLDKVRALEEANTELEVKIRDWYQRQAPGPARDYSQYYRTIEELQNKILTATVDNANILLQIDNARLAADDFRTKFETEQALRLSVEADINGLRRVLDELTLARADLEMQIENLKEELAYLKKNHEEEMNALRGQVGGEINVEMDAAPGVDLSRILNEMRDQYEKMAEKNRKDAEDWFFSKTEELNREVATNSELVQSGKSEISELRRTMQALEIELQSQLSMKASLEGNLAETENRYCVQLSQIQGLIGSVEEQLAQLRCEMEQQNQEYKILLDVKTRLEQEIATYRRLLEGEDAHLTQYKKEPVTTRQVRTIVEEVQDGKVISSREQVHQTTR,mutated_sequence,1.0,432.0,NP_000413.1.a2m,NP_000413.1.npy,ClinVar
+NP_000417.3,NP_000417.3.csv,MPGAAGVLLLLLLSGGLGGVQAQRPQQQRQSQAHQQRGLFPAVLNLASNALITTNATCGEKGPEMYCKLVEHVPGQPVRNPQCRICNQNSSNPNQRHPITNAIDGKNTWWQSPSIKNGIEYHYVTITLDLQQVFQIAYVIVKAANSPRPGNWILERSLDDVEYKPWQYHAVTDTECLTLYNIYPRTGPPSYAKDDEVICTSFYSKIHPLENGEIHISLINGRPSADDPSPELLEFTSARYIRLRFQRIRTLNADLMMFAHKDPREIDPIVTRRYYYSVKDISVGGMCICYGHARACPLDPATNKSRCECEHNTCGDSCDQCCPGFHQKPWRAGTFLTKTECEACNCHGKAEECYYDENVARRNLSLNIRGKYIGGGVCINCTQNTAGINCETCTDGFFRPKGVSPNYPRPCQPCHCDPIGSLNEVCVKDEKHARRGLAPGSCHCKTGFGGVSCDRCARGYTGYPDCKACNCSGLGSKNEDPCFGPCICKENVEGGDCSRCKSGFFNLQEDNWKGCDECFCSGVSNRCQSSYWTYGKIQDMSGWYLTDLPGRIRVAPQQDDLDSPQQISISNAEARQALPHSYYWSAPAPYLGNKLPAVGGQLTFTISYDLEEEEEDTERVLQLMIILEGNDLSISTAQDEVYLHPSEEHTNVLLLKEESFTIHGTHFPVRRKEFMTVLANLKRVLLQITYSFGMDAIFRLSSVNLESAVSYPTDGSIAAAVEVCQCPPGYTGSSCESCWPRHRRVNGTIFGGICEPCQCFGHAESCDDVTGECLNCKDHTGGPYCDKCLPGFYGEPTKGTSEDCQPCACPLNIPSNNFSPTCHLDRSLGLICDGCPVGYTGPRCERCAEGYFGQPSVPGGSCQPCQCNDNLDFSIPGSCDSLSGSCLICKPGTTGRYCELCADGYFGDAVDAKNCQPCRCNAGGSFSEVCHSQTGQCECRANVQGQRCDKCKAGTFGLQSARGCVPCNCNSFGSKSFDCEESGQCWCQPGVTGKKCDRCAHGYFNFQEGGCTACECSHLGNNCDPKTGRCICPPNTIGEKCSKCAPNTWGHSITTGCKACNCSTVGSLDFQCNVNTGQCNCHPKFSGAKCTECSRGHWNYPRCNLCDCFLPGTDATTCDSETKKCSCSDQTGQCTCKVNVEGIHCDRCRPGKFGLDAKNPLGCSSCYCFGTTTQCSEAKGLIRTWVTLKAEQTILPLVDEALQHTTTKGIVFQHPEIVAHMDLMREDLHLEPFYWKLPEQFEGKKLMAYGGKLKYAIYFEAREETGFSTYNPQVIIRGGTPTHARIIVRHMAAPLIGQLTRHEIEMTEKEWKYYGDDPRVHRTVTREDFLDILYDIHYILIKATYGNFMRQSRISEISMEVAEQGRGTTMTPPADLIEKCDCPLGYSGLSCEACLPGFYRLRSQPGGRTPGPTLGTCVPCQCNGHSSLCDPETSICQNCQHHTAGDFCERCALGYYGIVKGLPNDCQQCACPLISSSNNFSPSCVAEGLDDYRCTACPRGYEGQYCERCAPGYTGSPGNPGGSCQECECDPYGSLPVPCDPVTGFCTCRPGATGRKCDGCKHWHAREGWECVFCGDECTGLLLGDLARLEQMVMSINLTGPLPAPYKMLYGLENMTQELKHLLSPQRAPERLIQLAEGNLNTLVTEMNELLTRATKVTADGEQTGQDAERTNTRAKSLGEFIKELARDAEAVNEKAIKLNETLGTRDEAFERNLEGLQKEIDQMIKELRRKNLETQKEIAEDELVAAEALLKKVKKLFGESRGENEEMEKDLREKLADYKNKVDDAWDLLREATDKIREANRLFAVNQKNMTALEKKKEAVESGKRQIENTLKEGNDILDEANRLADEINSIIDYVEDIQTKLPPMSEELNDKIDDLSQEIKDRKLAEKVSQAESHAAQLNDSSAVLDGILDEAKNISFNATAAFKAYSNIKDYIDEAEKVAKEAKDLAHEATKLATGPRGLLKEDAKGCLQKSFRILNEAKKLANDVKENEDHLNGLKTRIENADARNGDLLRTLNDTLGKLSAIPNDTAAKLQAVKDKARQANDTAKDVLAQITELHQNLDGLKKNYNKLADSVAKTNAVVKDPSKNKIIADADATVKNLEQEADRLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIKVSVSSGGDCIRTYKPEIKKGSYNNIVVNVKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVEYPDLTIDDSYWYRIVASRTGRNGTISVRALDGPKASIVPSTHHSTSPPGYTILDVDANAMLFVGGLTGKLKKADAVRVITFTGCMGETYFDNKPIGLWNFREKEGDCKGCTVSPQVEDSEGTIQFDGEGYALVSRPIRWYPNISTVMFKFRTFSSSALLMYLATRDLRDFMSVELTDGHIKVSYDLGSGMASVVSNQNHNDGKWKSFTLSRIQKQANISIVDIDTNQEENIATSSSGNNFGLDLKADDKIYFGGLPTLRNLSMKARPEVNLKKYSGCLKDIEISRTPYNILSSPDYVGVTKGCSLENVYTVSFPKPGFVELSPVPIDVGTEINLSFSTKNESGIILLGSGGTPAPPRRKRRQTGQAYYAILLNRGRLEVHLSTGARTMRKIVIRPEPNLFHDGREHSVHVERTRGIFTVQVDENRRYMQNLTVEQPIEVKKLFVGGAPPEFQPSPLRNIPPFEGCIWNLVINSVPMDFARPVSFKNADIGRCAHQKLREDEDGAAPAEIVIQPEPVPTPAFPTPTPVLTHGPCAAESEPALLIGSKQFGLSRNSHIAIAFDDTKVKNRLTIELEVRTEAESGLLFYMARINHADFATVQLRNGLPYFSYDLGSGDTHTMIPTKINDGQWHKIKIMRSKQEGILYVDGASNRTISPKKADILDVVGMLYVGGLPINYTTRRIGPVTYSIDGCVRNLHMAEAPADLEQPTSSFHVGTCFANAQRGTYFDGTGFAKAVGGFKVGLDLLVEFEFRTTTTTGVLLGISSQKMDGMGIEMIDEKLMFHVDNGAGRFTAVYDAGVPGHLCDGQWHKVTANKIKHRIELTVDGNQVEAQSPNPASTSADTNDPVFVGGFPDDLKQFGLTTSIPFRGCIRSLKLTKGTGKPLEVNFAKALELRGVQPVSCPAN,mutated_sequence,1.0,3122.0,NP_000417.3.a2m,NP_000417.3.npy,ClinVar
+NP_000423.2,NP_000423.2.csv,MAPKKAKKRAGGANSNVFSMFEQTQIQEFKEAFTIMDQNRDGFIDKNDLRDTFAALGRVNVKNEEIDEMIKEAPGPINFTVFLTMFGEKLKGADPEETILNAFKVFDPEGKGVLKADYVREMLTTQAERFSKEEVDQMFAAFPPDVTGNLDYKNLVHIITHGEEKD,mutated_sequence,1.0,166.0,NP_000423.2.a2m,NP_000423.2.npy,ClinVar
+NP_000424.2,NP_000424.2.csv,MSLVEAISLWNEGVLAADKKDWKGALDAFSAVQDPHSRICFNIGCMYTILKNMTEAEKAFTRSINRDKHLAVAYFQRGMLYYQTEKYDLAIKDLKEALIQLRGNQLIDYKILGLQFKLFACEVLYNIAFMYAKKEEWKKAEEQLALATSMKSEPRHSKIDKAMECVWKQKLYEPVVIPVGKLFRPNERQVAQLAKKDYLGKATVVASVVDQDSFSGFAPLQPQAAEPPPRPKTPEIFRALEGEAHRVLFGFVPETKEELQVMPGNIVFVLKKGNDNWATVMFNGQKGLVPCNYLEPVELRIHPQQQPQEESSPQSDIPAPPSSKAPGRPQLSPGQKQKEEPKEVKLSVPMPYTLKVHYKYTVVMKTQPGLPYSQVRDMVSKKLELRLEHTKLSYRPRDSNELVPLSEDSMKDAWGQVKNYCLTLWCENTVGDQGFPDEPKESEKADANNQTTEPQLKKGSQVEALFSYEATQPEDLEFQEGDIILVLSKVNEEWLEGECKGKVGIFPKVFVEDCATTDLESTRREV,mutated_sequence,1.0,526.0,NP_000424.2.a2m,NP_000424.2.npy,ClinVar
+NP_000430.3,NP_000430.3.csv,MERRAWSLQCTAFVLFCAWCALNSAKAKRQFVNEWAAEIPGGPEAASAIAEELGYDLLGQIGSLENHYLFKHKNHPRRSRRSAFHITKRLSDDDRVIWAEQQYEKERSKRSALRDSALNLFNDPMWNQQWYLQDTRMTAALPKLDLHVIPVWQKGITGKGVVITVLDDGLEWNHTDIYANYDPEASYDFNDNDHDPFPRYDPTNENKHGTRCAGEIAMQANNHKCGVGVAYNSKVGGIRMLDGIVTDAIEASSIGFNPGHVDIYSASWGPNDDGKTVEGPGRLAQKAFEYGVKQGRQGKGSIFVWASGNGGRQGDNCDCDGYTDSIYTISISSASQQGLSPWYAEKCSSTLATSYSSGDYTDQRITSADLHNDCTETHTGTSASAPLAAGIFALALEANPNLTWRDMQHLVVWTSEYDPLANNPGWKKNGAGLMVNSRFGFGLLNAKALVDLADPRTWRSVPEKKECVVKDNDFEPRALKANGEVIIEIPTRACEGQENAIKSLEHVQFEATIEYSRRGDLHVTLTSAAGTSTVLLAERERDTSPNGFKNWDFMSVHTWGENPIGTWTLRITDMSGRIQNEGRIVNWKLILHGTSSQPEHMKQPRVYTSYNTVQNDRRGVEKMVDPGEEQPTQENPKENTLVSKSPSSSSVGGRRDELEEGAPSQAMLRLLQSAFSKNSPPKQSPKKSPSAKLNIPYENFYEALEKLNKPSQLKDSEDSLYNDYVDVFYNTKPYKHRDDRLLQALVDILNEEN,mutated_sequence,1.0,753.0,NP_000430.3.a2m,NP_000430.3.npy,ClinVar
+NP_000432.1,NP_000432.1.csv,MAAPGGRSEPPQLPEYSCSYMVSRPVYSELAFQQQHERRLQERKTLRESLAKCCSCSRKRAFGVLKTLVPILEWLPKYRVKEWLLSDVISGVSTGLVATLQGMAYALLAAVPVGYGLYSAFFPILTYFIFGTSRHISVGPFPVVSLMVGSVVLSMAPDEHFLVSSSNGTVLNTTMIDTAARDTARVLIASALTLLVGIIQLIFGGLQIGFIVRYLADPLVGGFTTAAAFQVLVSQLKIVLNVSTKNYNGVLSIIYTLVEIFQNIGDTNLADFTAGLLTIVVCMAVKELNDRFRHKIPVPIPIEVIVTIIATAISYGANLEKNYNAGIVKSIPRGFLPPELPPVSLFSEMLAASFSIAVVAYAIAVSVGKVYATKYDYTIDGNQEFIAFGISNIFSGFFSCFVATTALSRTAVQESTGGKTQVAGIISAAIVMIAILALGKLLEPLQKSVLAAVVIANLKGMFMQLCDIPRLWRQNKIDAVIWVFTCIVSIILGLDLGLLAGLIFGLLTVVLRVQFPSWNGLGSIPSTDIYKSTKNYKNIEEPQGVKILRFSSPIFYGNVDGFKKCIKSTVGFDAIRVYNKRLKALRKIQKLIKSGQLRATKNGIISDAVSTNNAFEPDEDIEDLEELDIPTKEIEIQVDWNSELPVKVNVPKVPIHSLVLDCGAISFLDVVGVRSLRVIVKEFQRIDVNVYFASLQDYVIEKLEQCGFFDDNIRKDTFFLTVHDAILYLQNQVKSQEGQGSILETITLIQDCKDTLELIETELTEEELDVQDEAMRTLAS,mutated_sequence,1.0,780.0,NP_000432.1.a2m,NP_000432.1.npy,ClinVar
+NP_000435.3,NP_000435.3.csv,MEAETGSSVETGKKANRGTRIALVVFVGGTLVLGTILFLVSQGLLSLQAKQEYCLKPECIEAAAAILSKVNLSVDPCDNFFRFACDGWISNNPIPEDMPSYGVYPWLRHNVDLKLKELLEKSISRRRDTEAIQKAKILYSSCMNEKAIEKADAKPLLHILRHSPFRWPVLESNIGPEGVWSERKFSLLQTLATFRGQYSNSVFIRLYVSPDDKASNEHILKLDQATLSLAVREDYLDNSTEAKSYRDALYKFMVDTAVLLGANSSRAEHDMKSVLRLEIKIAEIMIPHENRTSEAMYNKMNISELSAMIPQFDWLGYIKKVIDTRLYPHLKDISPSENVVVRVPQYFKDLFRILGSERKKTIANYLVWRMVYSRIPNLSRRFQYRWLEFSRVIQGTTTLLPQWDKCVNFIESALPYVVGKMFVDVYFQEDKKEMMEELVEGVRWAFIDMLEKENEWMDAGTKRKAKEKARAVLAKVGYPEFIMNDTHVNEDLKAIKFSEADYFGNVLQTRKYLAQSDFFWLRKAVPKTEWFTNPTTVNAFYSASTNQIRFPAGELQKPFFWGTEYPRSLSYGAIGVIVGHEFTHGFDNNGRKYDKNGNLDPWWSTESEEKFKEKTKCMINQYSNYYWKKAGLNVKGKRTLGENIADNGGLREAFRAYRKWINDRRQGLEEPLLPGITFTNNQLFFLSYAHVRCNSYRPEAAREQVQIGAHSPPQFRVNGAISNFEEFQKAFNCPPNSTMNRGMDSCRLW,mutated_sequence,1.0,749.0,NP_000435.3.a2m,NP_000435.3.npy,ClinVar
+NP_000442.1,NP_000442.1.csv,MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDITEGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRSRTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMHKGVILGTANHLDACRVAPYVNMGALRMPFQQVQAQLQLEGVAHAHPHLHPHLAAHAPYLMFPPPPFGLPIASLAESASAAAVVAAAAKSNSKNSSIADLRLKARKHAEALGL,mutated_sequence,1.0,292.0,NP_000442.1.a2m,NP_000442.1.npy,ClinVar
+NP_000446.1,NP_000446.1.csv,MEVVDPQQLGMFTEGELMSVGMDTFIHRIDSTEVIYQPRRKRAKLIGKYLMGDLLGEGSYGKVKEVLDSETLCRRAVKILKKKKLRRIPNGEANVKKEIQLLRRLRHKNVIQLVDVLYNEEKQKMYMVMEYCVCGMQEMLDSVPEKRFPVCQAHGYFCQLIDGLEYLHSQGIVHKDIKPGNLLLTTGGTLKISDLGVAEALHPFAADDTCRTSQGSPAFQPPEIANGLDTFSGFKVDIWSAGVTLYNITTGLYPFEGDNIYKLFENIGKGSYAIPGDCGPPLSDLLKGMLEYEPAKRFSIRQIRQHSWFRKKHPPAEAPVPIPPSPDTKDRWRSMTVVPYLEDLHGADEDEDLFDIEDDIIYTQDFTVPGQVPEEEASHNGQRRGLPKAVCMNGTEAAQLSTKSRAEGRAPNPARKACSASSKIRRLSACKQQ,mutated_sequence,1.0,433.0,NP_000446.1.a2m,NP_000446.1.npy,ClinVar
+NP_000449.1,NP_000449.1.csv,MVSKLTSLQQELLSALLSSGVTKEVLVQALEELLPSPNFGVKLETLPLSPGSGAEPDTKPVFHTLTNGHAKGRLSGDEGSEDGDDYDTPPILKELQALNTEEAAEQRAEVDRMLSEDPWRAAKMIKGYMQQHNIPQREVVDVTGLNQSHLSQHLNKGTPMKTQKRAALYTWYVRKQREILRQFNQTVQSSGNMTDKSSQDQLLFLFPEFSQQSHGPGQSDDACSEPTNKKMRRNRFKWGPASQQILYQAYDRQKNPSKEEREALVEECNRAECLQRGVSPSKAHGLGSNLVTEVRVYNWFANRRKEEAFRQKLAMDAYSSNQTHSLNPLLSHGSPHHQPSSSPPNKLSGVRYSQQGNNEITSSSTISHHGNSAMVTSQSVLQQVSPASLDPGHNLLSPDGKMISVSGGGLPPVSTLTNIHSLSHHNPQQSQNLIMTPLSGVMAIAQSLNTSQAQSVPVINSVAGSLAALQPVQFSQQLHSPHQQPLMQQSPGSHMAQQPFMAAVTQLQNSHMYAHKQEPPQYSHTSRFPSAMVVTDTSSISTLTNMSSSKQCPLQAW,mutated_sequence,1.0,557.0,NP_000449.1.a2m,NP_000449.1.npy,ClinVar
+NP_000454.1,NP_000454.1.csv,MAVESQGGRPLVLGLLLCVLGPVVSHAGKILLIPVDGSHWLSMLGAIQQLQQRGHEIVVLAPDASLYIRDGAFYTLKTYPVPFQREDVKESFVSLGHNVFENDSFLQRVIKTYKKIKKDSAMLLSGCSHLLHNKELMASLAESSFDVMLTDPFLPCSPIVAQYLSLPTVFFLHALPCSLEFEATQCPNPFSYVPRPLSSHSDHMTFLQRVKNMLIAFSQNFLCDVVYSPYATLASEFLQREVTVQDLLSSASVWLFRSDFVKDYPRPIMPNMVFVGGINCLHQNPLSQEFEAYINASGEHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH,mutated_sequence,1.0,533.0,NP_000454.1.a2m,NP_000454.1.npy,ClinVar
+NP_000457.1,NP_000457.1.csv,MWGSDRLAGAGGGGAAVTVAFTNARDCFLHLPRRLVAQLHLLQNQAIEVVWSHQPAFLSWVEGRHFSDQGENVAEINRQVGQKLGLSNGGQVFLKPCSHVVSCQQVEVEPLSADDWEILELHAVSLEQHLLDQIRIVFPKAIFPVWVDQQTYIFIQIVALIPAASYGRLETDTKLLIQPKTRRAKENTFSKADAEYKKLHSYGRDQKGMMKELQTKQLQSNTVGITESNENESEIPVDSSSVASLWTMIGSIFSFQSEKKQETSWGLTEINAFKNMQSKVVPLDNIFRVCKSQPPSIYNASATSVFHKHCAIHVFPWDQEYFDVEPSFTVTYGKLVKLLSPKQQQSKTKQNVLSPEKEKQMSEPLDQKKIRSDHNEEDEKACVLQVVWNGLEELNNAIKYTKNVEVLHLGKVWIPDDLRKRLNIEMHAVVRITPVEVTPKIPRSLKLQPRENLPKDISEEDIKTVFYSWLQQSTTTMLPLVISEEEFIKLETKDGLKEFSLSIVHSWEKEKDKNIFLLSPNLLQKTTIQVLLDPMVKEENSEEIDFILPFLKLSSLGGVNSLGVSSLEHITHSLLGRPLSRQLMSLVAGLRNGALLLTGGKGSGKSTLAKAICKEAFDKLDAHVERVDCKALRGKRLENIQKTLEVAFSEAVWMQPSVVLLDDLDLIAGLPAVPEHEHSPDAVQSQRLAHALNDMIKEFISMGSLVALIATSQSQQSLHPLLVSAQGVHIFQCVQHIQPPNQEQRCEILCNVIKNKLDCDINKFTDLDLQHVAKETGGFVARDFTVLVDRAIHSRLSRQSISTREKLVLTTLDFQKALRGFLPASLRSVNLHKPRDLGWDKIGGLHEVRQILMDTIQLPAKYPELFANLPIRQRTGILLYGPPGTGKTLLAGVIARESRMNFISVKGPELLSKYIGASEQAVRDIFIRAQAAKPCILFFDEFESIAPRRGHDNTGVTDRVVNQLLTQLDGVEGLQGVYVLAATSRPDLIDPALLRPGRLDKCVYCPPPDQVSRLEILNVLSDSLPLADDVDLQHVASVTDSFTGADLKALLYNAQLEALHGMLLSSGLQDGSSSSDSDLSLSSMVFLNHSSGSDDSAGDGECGLDQSLVSLEMSEILPDESKFNMYRLYFGSSYESELGNGTSSDLSSQCLSAPSSMTQDLPGVPGKDQLFSQPPVLRTASQEGCQELTQEQRDQLRADISIIKGRYRSQSGEDESMNQPGPIKTRLAISQSHLMTALGHTRPSISEDDWKNFAELYESFQNPKRRKNQSGTMFRPGQKVTLA,mutated_sequence,1.0,1283.0,NP_000457.1.a2m,NP_000457.1.npy,ClinVar
+NP_000465.1,NP_000465.1.csv,MMQDVSSSPVSPADDSLSNSEEEPDRQQPPSGKRGGRKRRSSRRSAGGGAGPGGAAGGGVGGGDEPGSPAQGKRGKKSAGCGGGGGAGGGGGSSSGGGSPQSYEELQTQRVMANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDELDSKMASCSYVAHERLSYAFSVWRMEGAWSMSASH,mutated_sequence,1.0,202.0,NP_000465.1.a2m,NP_000465.1.npy,ClinVar
+NP_000467.1,NP_000467.1.csv,MEEKLKKTKIIFVVGGPGSGKGTQCEKIVQKYGYTHLSTGDLLRSEVSSGSARGKKLSEIMEKGQLVPLETVLDMLRDAMVAKVNTSKGFLIDGYPREVQQGEEFERRIGQPTLLLYVDAGPETMTQRLLKRGETSGRVDDNEETIKKRLETYYKATEPVIAFYEKRGIVRKVNAEGSVDSVFSQVCTHLDALK,mutated_sequence,1.0,194.0,NP_000467.1.a2m,NP_000467.1.npy,ClinVar
+NP_000468.1,NP_000468.1.csv,MKWVTFISLLFLFSSAYSRGVFRRDAHKSEVAHRFKDLGEENFKALVLIAFAQYLQQCPFEDHVKLVNEVTEFAKTCVADESAENCDKSLHTLFGDKLCTVATLRETYGEMADCCAKQEPERNECFLQHKDDNPNLPRLVRPEVDVMCTAFHDNEETFLKKYLYEIARRHPYFYAPELLFFAKRYKAAFTECCQAADKAACLLPKLDELRDEGKASSAKQRLKCASLQKFGERAFKAWAVARLSQRFPKAEFAEVSKLVTDLTKVHTECCHGDLLECADDRADLAKYICENQDSISSKLKECCEKPLLEKSHCIAEVENDEMPADLPSLAADFVESKDVCKNYAEAKDVFLGMFLYEYARRHPDYSVVLLLRLAKTYETTLEKCCAAADPHECYAKVFDEFKPLVEEPQNLIKQNCELFEQLGEYKFQNALLVRYTKKVPQVSTPTLVEVSRNLGKVGSKCCKHPEAKRMPCAEDYLSVVLNQLCVLHEKTPVSDRVTKCCTESLVNRRPCFSALEVDETYVPKEFNAETFTFHADICTLSEKERQIKKQTALVELVKHKPKATKEQLKAVMDDFAAFVEKCCKADDKETCFAEEGKKLVAASQAALGL,mutated_sequence,1.0,609.0,NP_000468.1.a2m,NP_000468.1.npy,ClinVar
+NP_000469.3,NP_000469.3.csv,MISPFLVLAIGTCLTNSLVPEKEKDPKYWRDQAQETLKYALELQKLNTNVAKNVIMFLGDGMGVSTVTAARILKGQLHHNPGEETRLEMDKFPFVALSKTYNTNAQVPDSAGTATAYLCGVKANEGTVGVSAATERSRCNTTQGNEVTSILRWAKDAGKSVGIVTTTRVNHATPSAAYAHSADRDWYSDNEMPPEALSQGCKDIAYQLMHNIRDIDVIMGGGRKYMYPKNKTDVEYESDEKARGTRLDGLDLVDTWKSFKPRYKHSHFIWNRTELLTLDPHNVDYLLGLFEPGDMQYELNRNNVTDPSLSEMVVVAIQILRKNPKGFFLLVEGGRIDHGHHEGKAKQALHEAVEMDRAIGQAGSLTSSEDTLTVVTADHSHVFTFGGYTPRGNSIFGLAPMLSDTDKKPFTAILYGNGPGYKVVGGERENVSMVDYAHNNYQAQSAVPLRHETHGGEDVAVFSKGPMAHLLHGVHEQNYVPHVMAYAACIGANLGHCAPASSAGSLAAGPLLLALALYPLSVLF,mutated_sequence,1.0,524.0,NP_000469.3.a2m,NP_000469.3.npy,ClinVar
+NP_000476.1,NP_000476.1.csv,MADSELQLVEQRIRSFPDFPTPGVVFRDISPVLKDPASFRAAIGLLARHLKATHGGRIDYIAGLDSRGFLFGPSLAQELGLGCVLIRKRGKLPGPTLWASYSLEYGKAELEIQKDALEPGQRVVVVDDLLATGGTMNAACELLGRLQAEVLECVSLVELTSLKGREKLAPVPFFSLLQYE,mutated_sequence,1.0,180.0,NP_000476.1.a2m,NP_000476.1.npy,ClinVar
+NP_000478.3,NP_000478.3.csv,MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQICCHPGCTPRPACCHCPDPHA,mutated_sequence,1.0,509.0,NP_000478.3.a2m,NP_000478.3.npy,ClinVar
+NP_000479.1,NP_000479.1.csv,MYSNVIGTVTSGKRKVYLLSLLLIGFWDCVTCHGSPVDICTAKPRDIPMNPMCIYRSPEKKATEDEGSEQKIPEATNRRVWELSKANSRFATTFYQHLADSKNDNDNIFLSPLSISTAFAMTKLGACNDTLQQLMEVFKFDTISEKTSDQIHFFFAKLNCRLYRKANKSSKLVSANRLFGDKSLTFNETYQDISELVYGAKLQPLDFKENAEQSRAAINKWVSNKTEGRITDVIPSEAINELTVLVLVNTIYFKGLWKSKFSPENTRKELFYKADGESCSASMMYQEGKFRYRRVAEGTQVLELPFKGDDITMVLILPKPEKSLAKVEKELTPEVLQEWLDELEEMMLVVHMPRFRIEDGFSLKEQLQDMGLVDLFSPEKSKLPGIVAEGRDDLYVSDAFHKAFLEVNEEGSEAAASTAVVIAGRSLNPNRVTFKANRPFLVFIREVPLNTIIFMGRVANPCVK,mutated_sequence,1.0,464.0,NP_000479.1.a2m,NP_000479.1.npy,ClinVar
+NP_000480.3,NP_000480.3.csv,MTAEPMSESKLNTLVQKLHDFLAHSSEESEETSSPPRLAMNQNTDKISGSGSNSDMMENSKEEGTSSSEKSKSSGSSRSKRKPSIVTKYVESDDEKPLDDETVNEDASNENSENDITMQSLPKGTVIVQPEPVLNEDKDDFKGPEFRSRSKMKTENLKKRGEDGLHGIVSCTACGQQVNHFQKDSIYRHPSLQVLICKNCFKYYMSDDISRDSDGMDEQCRWCAEGGNLICCDFCHNAFCKKCILRNLGRKELSTIMDENNQWYCYICHPEPLLDLVTACNSVFENLEQLLQQNKKKIKVDSEKSNKVYEHTSRFSPKKTSSNCNGEEKKLDDSCSGSVTYSYSALIVPKEMIKKAKKLIETTANMNSSYVKFLKQATDNSEISSATKLRQLKAFKSVLADIKKAHLALEEDLNSEFRAMDAVNKEKNTKEHKVIDAKFETKARKGEKPCALEKKDISKSEAKLSRKQVDSEHMHQNVPTEEQRTNKSTGGEHKKSDRKEEPQYEPANTSEDLDMDIVSVPSSVPEDIFENLETAMEVQSSVDHQGDGSSGTEQEVESSSVKLNISSKDNRGGIKSKTTAKVTKELYVKLTPVSLSNSPIKGADCQEVPQDKDGYKSCGLNPKLEKCGLGQENSDNEHLVENEVSLLLEESDLRRSPRVKTTPLRRPTETNPVTSNSDEECNETVKEKQKLSVPVRKKDKRNSSDSAIDNPKPNKLPKSKQSETVDQNSDSDEMLAILKEVSRMSHSSSSDTDINEIHTNHKTLYDLKTQAGKDDKGKRKRKSSTSGSDFDTKKGKSAKSSIISKKKRQTQSESSNYDSELEKEIKSMSKIGAARTTKKRIPNTKDFDSSEDEKHSKKGMDNQGHKNLKTSQEGSSDDAERKQERETFSSAEGTVDKDTTIMELRDRLPKKQQASASTDGVDKLSGKEESFTSLEVRKVAETKEKSKHLKTKTCKKVQDGLSDIAEKFLKKDQSDETSEDDKKQSKKGTEEKKKPSDFKKKVIKMEQQYESSSDGTEKLPEREEICHFPKGIKQIKNGTTDGEKKSKKIRDKTSKKKDELSDYAEKSTGKGDSCDSSEDKKSKNGAYGREKKRCKLLGKSSRKRQDCSSSDTEKYSMKEDGCNSSDKRLKRIELRERRNLSSKRNTKEIQSGSSSSDAEESSEDNKKKKQRTSSKKKAVIVKEKKRNSLRTSTKRKQADITSSSSSDIEDDDQNSIGEGSSDEQKIKPVTENLVLSSHTGFCQSSGDEALSKSVPVTVDDDDDDNDPENRIAKKMLLEEIKANLSSDEDGSSDDEPEEGKKRTGKQNEENPGDEEAKNQVNSESDSDSEESKKPRYRHRLLRHKLTVSDGESGEEKKTKPKEHKEVKGRNRRKVSSEDSEDSDFQESGVSEEVSESEDEQRPRTRSAKKAELEENQRSYKQKKKRRRIKVQEDSSSENKSNSEEEEEEKEEEEEEEEEEEEEEEDENDDSKSPGKGRKKIRKILKDDKLRTETQNALKEEEERRKRIAEREREREKLREVIEIEDASPTKCPITTKLVLDEDEETKEPLVQVHRNMVIKLKPHQVDGVQFMWDCCCESVKKTKKSPGSGCILAHCMGLGKTLQVVSFLHTVLLCDKLDFSTALVVCPLNTALNWMNEFEKWQEGLKDDEKLEVSELATVKRPQERSYMLQRWQEDGGVMIIGYEMYRNLAQGRNVKSRKLKEIFNKALVDPGPDFVVCDEGHILKNEASAVSKAMNSIRSRRRIILTGTPLQNNLIEYHCMVNFIKENLLGSIKEFRNRFINPIQNGQCADSTMVDVRVMKKRAHILYEMLAGCVQRKDYTALTKFLPPKHEYVLAVRMTSIQCKLYQYYLDHLTGVGNNSEGGRGKAGAKLFQDFQMLSRIWTHPWCLQLDYISKENKGYFDEDSMDEFIASDSDETSMSLSSDDYTKKKKKGKKGKKDSSSSGSGSDNDVEVIKVWNSRSRGGGEGNVDETGNNPSVSLKLEESKATSSSNPSSPAPDWYKDFVTDADAEVLEHSGKMVLLFEILRMAEEIGDKVLVFSQSLISLDLIEDFLELASREKTEDKDKPLIYKGEGKWLRNIDYYRLDGSTTAQSRKKWAEEFNDETNVRGRLFIISTKAGSLGINLVAANRVIIFDASWNPSYDIQSIFRVYRFGQTKPVYVYRFLAQGTMEDKIYDRQVTKQSLSFRVVDQQQVERHFTMNELTELYTFEPDLLDDPNSEKKKKRDTPMLPKDTILAELLQIHKEHIVGYHEHDSLLDHKEEEELTEEERKAAWAEYEAEKKGLTMRFNIPTGTNLPPVSFNSQTPYIPFNLGALSAMSNQQLEDLINQGREKVVEATNSVTAVRIQPLEDIISAVWKENMNLSEAQVQALALSRQASQELDVKRREAIYNDVLTKQQMLISCVQRILMNRRLQQQYNQQQQQQMTYQQATLGHLMMPKPPNLIMNPSNYQQIDMRGMYQPVAGGMQPPPLQRAPPPMRSKNPGPSQGKSM,mutated_sequence,1.0,2492.0,NP_000480.3.a2m,NP_000480.3.npy,ClinVar
+NP_000481.2,NP_000481.2.csv,MPDTMLPACFLGLLAFSSACYFQNCPRGGKRAMSDLELRQCLPCGPGGKGRCFGPSICCADELGCFVGTAEALRCQEENYLPSPCQSGQKACGSGGRCAAFGVCCNDESCVTEPECREGFHRRARASDRSNATQLDGPAGALLLRLVQLAGAPEPFEPAQPDAY,mutated_sequence,1.0,164.0,NP_000481.2.a2m,NP_000481.2.npy,ClinVar
+NP_000483.3,NP_000483.3.csv,MQRSPLEKASVVSKLFFSWTRPILRKGYRQRLELSDIYQIPSVDSADNLSEKLEREWDRELASKKNPKLINALRRCFFWRFMFYGIFLYLGEVTKAVQPLLLGRIIASYDPDNKEERSIAIYLGIGLCLLFIVRTLLLHPAIFGLHHIGMQMRIAMFSLIYKKTLKLSSRVLDKISIGQLVSLLSNNLNKFDEGLALAHFVWIAPLQVALLMGLIWELLQASAFCGLGFLIVLALFQAGLGRMMMKYRDQRAGKISERLVITSEMIENIQSVKAYCWEEAMEKMIENLRQTELKLTRKAAYVRYFNSSAFFFSGFFVVFLSVLPYALIKGIILRKIFTTISFCIVLRMAVTRQFPWAVQTWYDSLGAINKIQDFLQKQEYKTLEYNLTTTEVVMENVTAFWEEGFGELFEKAKQNNNNRKTSNGDDSLFFSNFSLLGTPVLKDINFKIERGQLLAVAGSTGAGKTSLLMVIMGELEPSEGKIKHSGRISFCSQFSWIMPGTIKENIIFGVSYDEYRYRSVIKACQLEEDISKFAEKDNIVLGEGGITLSGGQRARISLARAVYKDADLYLLDSPFGYLDVLTEKEIFESCVCKLMANKTRILVTSKMEHLKKADKILILHEGSSYFYGTFSELQNLQPDFSSKLMGCDSFDQFSAERRNSILTETLHRFSLEGDAPVSWTETKKQSFKQTGEFGEKRKNSILNPINSIRKFSIVQKTPLQMNGIEEDSDEPLERRLSLVPDSEQGEAILPRISVISTGPTLQARRRQSVLNLMTHSVNQGQNIHRKTTASTRKVSLAPQANLTELDIYSRRLSQETGLEISEEINEEDLKECFFDDMESIPAVTTWNTYLRYITVHKSLIFVLIWCLVIFLAEVAASLVVLWLLGNTPLQDKGNSTHSRNNSYAVIITSTSSYYVFYIYVGVADTLLAMGFFRGLPLVHTLITVSKILHHKMLHSVLQAPMSTLNTLKAGGILNRFSKDIAILDDLLPLTIFDFIQLLLIVIGAIAVVAVLQPYIFVATVPVIVAFIMLRAYFLQTSQQLKQLESEGRSPIFTHLVTSLKGLWTLRAFGRQPYFETLFHKALNLHTANWFLYLSTLRWFQMRIEMIFVIFFIAVTFISILTTGEGEGRVGIILTLAMNIMSTLQWAVNSSIDVDSLMRSVSRVFKFIDMPTEGKPTKSTKPYKNGQLSKVMIIENSHVKKDDIWPSGGQMTVKDLTAKYTEGGNAILENISFSISPGQRVGLLGRTGSGKSTLLSAFLRLLNTEGEIQIDGVSWDSITLQQWRKAFGVIPQKVFIFSGTFRKNLDPYEQWSDQEIWKVADEVGLRSVIEQFPGKLDFVLVDGGCVLSHGHKQLMCLARSVLSKAKILLLDEPSAHLDPVTYQIIRRTLKQAFADCTVILCEHRIEAMLECQQFLVIEENKVRQYDSIQKLLNERSLFRQAISPSDRVKLFPHRNSSKCKSKPQIAALKEETEEEVQDTRL,mutated_sequence,1.0,1480.0,NP_000483.3.a2m,NP_000483.3.npy,ClinVar
+NP_000484.2,NP_000484.2.csv,MLPQIPFLLLVSLNLVHGVFYAERYQMPTGIKGPLPNTKTQFFIPYTIKSKGIAVRGEQGTPGPPGPAGPRGHPGPSGPPGKPGYGSPGLQGEPGLPGPPGPSAVGKPGVPGLPGKPGERGPYGPKGDVGPAGLPGPRGPPGPPGIPGPAGISVPGKPGQQGPTGAPGPRGFPGEKGAPGVPGMNGQKGEMGYGAPGRPGERGLPGPQGPTGPSGPPGVGKRGENGVPGQPGIKGDRGFPGEMGPIGPPGPQGPPGERGPEGIGKPGAAGAPGQPGIPGTKGLPGAPGIAGPPGPPGFGKPGLPGLKGERGPAGLPGGPGAKGEQGPAGLPGKPGLTGPPGNMGPQGPKGIPGSHGLPGPKGETGPAGPAGYPGAKGERGSPGSDGKPGYPGKPGLDGPKGNPGLPGPKGDPGVGGPPGLPGPVGPAGAKGMPGHNGEAGPRGAPGIPGTRGPIGPPGIPGFPGSKGDPGSPGPPGPAGIATKGLNGPTGPPGPPGPRGHSGEPGLPGPPGPPGPPGQAVMPEGFIKAGQRPSLSGTPLVSANQGVTGMPVSAFTVILSKAYPAIGTPIPFDKILYNRQQHYDPRTGIFTCQIPGIYYFSYHVHVKGTHVWVGLYKNGTPVMYTYDEYTKGYLDQASGSAIIDLTENDQVWLQLPNAESNGLYSSEYVHSSFSGFLVAPM,mutated_sequence,1.0,680.0,NP_000484.2.a2m,NP_000484.2.npy,ClinVar
+NP_000485.3,NP_000485.3.csv,MDVTKKNKRDGTEVTERIVTETVTTRLTSLPPKGGTSNGYAKTASLGGGSRLEKQSLTHGSSGYINSTGSTRGHASTSSYRRAHSPASTLPNSPGSTFERKTHVTRHAYEGSSSGNSSPEYPRKEFASSSTRGRSQTRESEIRVRLQSASPSTRWTELDDVKRLLKGSRSASVSPTRNSSNTLPIPKKGTVETKIVTASSQSVSGTYDATILDANLPSHVWSSTLPAGSSMGTYHNNMTTQSSSLLNTNAYSAGSVFGVPNNMASCSPTLHPGLSTSSSVFGMQNNLAPSLTTLSHGTTTTSTAYGVKKNMPQSPAAVNTGVSTSAACTTSVQSDDLLHKDCKFLILEKDNTPAKKEMELLIMTKDSGKVFTASPASIAATSFSEDTLKKEKQAAYNADSGLKAEANGDLKTVSTKGKTTTADIHSYGSSGGGGSGGGGGVGGAGGGPWGPAPAWCPCGSCCSWWKWLLGLLLTWLLLLGLLFGLIALAEEVRKLKARVDELERIRRSILPYGDSMDRIEKDRLQGMAPAAGADLDKIGLHSDSQEELWMFVRKKLMMEQENGNLRGSPGPKGDMGSPGPKGDRGFPGTPGIPGPLGHPGPQGPKGQKGSVGDPGMEGPMGQRGREGPMGPRGEAGPPGSGEKGERGAAGEPGPHGPPGVPGSVGPKGSSGSPGPQGPPGPVGLQGLRGEVGLPGVKGDKGPMGPPGPKGDQGEKGPRGLTGEPGMRGLPGAVGEPGAKGAMGPAGPDGHQGPRGEQGLTGMPGIRGPPGPSGDPGKPGLTGPQGPQGLPGTPGRPGIKGEPGAPGKIVTSEGSSMLTVPGPPGPPGAMGPPGPPGAPGPAGPAGLPGHQEVLNLQGPPGPPGPRGPPGPSIPGPPGPRGPPGEGLPGPPGPPGSFLSNSETFLSGPPGPPGPPGPKGDQGPPGPRGHQGEQGLPGFSTSGSSSFGLNLQGPPGPPGPQGPKGDKGDPGVPGALGIPSGPSEGGSSSTMYVSGPPGPPGPPGPPGSISSSGQEIQQYISEYMQSDSIRSYLSGVQGPPGPPGPPGPVTTITGETFDYSELASHVVSYLRTSGYGVSLFSSSISSEDILAVLQRDDVRQYLRQYLMGPRGPPGPPGASGDGSLLSLDYAELSSRILSYMSSSGISIGLPGPPGPPGLPGTSYEELLSLLRGSEFRGIVGPPGPPGPPGIPGNVWSSISVEDLSSYLHTAGLSFIPGPPGPPGPPGPRGPPGVSGALATYAAENSDSFRSELISYLTSPDVRSFIVGPPGPPGPQGPPGDSRLLSTDASHSRGSSSSSHSSSVRRGSSYSSSMSTGGGGAGSLGAGGAFGEAAGDRGPYGTDIGPGGGYGAAAEGGMYAGNGGLLGADFAGDLDYNELAVRVSESMQRQGLLQGMAYTVQGPPGQPGPQGPPGISKVFSAYSNVTADLMDFFQTYGAIQGPPGQKGEMGTPGPKGDRGPAGPPGHPGPPGPRGHKGEKGDKGDQVYAGRRRRRSIAVKP,mutated_sequence,1.0,1497.0,NP_000485.3.a2m,NP_000485.3.npy,ClinVar
+NP_000489.3,NP_000489.3.csv,MALRAKAEVCVAAPWLSLQRARALGTRAARAPRTVLPFEAMPQHPGNRWLRLLQIWREQGYEHLHLEMHQTFQELGPIFRYNLGGPRMVCVMLPEDVEKLQQVDSLHPCRMILEPWVAYRQHRGHKCGVFLLNGPEWRFNRLRLNPDVLSPKAVQRFLPMVDAVARDFSQALKKKVLQNARGSLTLDVQPSIFHYTIEASNLALFGERLGLVGHSPSSASLNFLHALEVMFKSTVQLMFMPRSLSRWISPKVWKEHFEAWDCIFQYGDNCIQKIYQELAFNRPQHYTGIVAELLLKAELSLEAIKANSMELTAGSVDTTAFPLLMTLFELARNPDVQQILRQESLAAAASISEHPQKATTELPLLRAALKETLRLYPVGLFLERVVSSDLVLQNYHIPAGTLVQVFLYSLGRNAALFPRPERYNPQRWLDIRGSGRNFHHVPFGFGMRQCLGRRLAEAEMLLLLHHVLKHFLVETLTQEDIKMVYSFILRPGTSPLLTFRAIN,mutated_sequence,1.0,503.0,NP_000489.3.a2m,NP_000489.3.npy,ClinVar
+NP_000497.1,NP_000497.1.csv,MAHVRGLQLPGCLALAALCSLVHSQHVFLAPQQARSLLQRVRRANTFLEEVRKGNLERECVEETCSYEEAFEALESSTATDVFWAKYTACETARTPRDKLAACLEGNCAEGLGTNYRGHVNITRSGIECQLWRSRYPHKPEINSTTHPGADLQENFCRNPDSSTTGPWCYTTDPTVRRQECSIPVCGQDQVTVAMTPRSEGSSVNLSPPLEQCVPDRGQQYQGRLAVTTHGLPCLAWASAQAKALSKHQDFNSAVQLVENFCRNPDGDEEGVWCYVAGKPGDFGYCDLNYCEEAVEEETGDGLDEDSDRAIEGRTATSEYQTFFNPRTFGSGEADCGLRPLFEKKSLEDKTERELLESYIDGRIVEGSDAEIGMSPWQVMLFRKSPQELLCGASLISDRWVLTAAHCLLYPPWDKNFTENDLLVRIGKHSRTRYERNIEKISMLEKIYIHPRYNWRENLDRDIALMKLKKPVAFSDYIHPVCLPDRETAASLLQAGYKGRVTGWGNLKETWTANVGKGQPSVLQVVNLPIVERPVCKDSTRIRITDNMFCAGYKPDEGKRGDACEGDSGGPFVMKSPFNNRWYQMGIVSWGEGCDRDGKYGFYTHVFRLKKWIQKVIDQFGE,mutated_sequence,1.0,622.0,NP_000497.1.a2m,NP_000497.1.npy,ClinVar
+NP_000498.2,NP_000498.2.csv,MADQAPFDTDVNTLTRFVMEEGRKARGTGELTQLLNSLCTAVKAISSAVRKAGIAHLYGIAGSTNVTGDQVKKLDVLSNDLVMNMLKSSFATCVLVSEEDKHAIIVEPEKRGKYVVCFDPLDGSSNIDCLVSVGTIFGIYRKKSTDEPSEKDALQPGRNLVAAGYALYGSATMLVLAMDCGVNCFMLDPAIGEFILVDKDVKIKKKGKIYSLNEGYARDFDPAVTEYIQRKKFPPDNSAPYGARYVGSMVADVHRTLVYGGIFLYPANKKSPNGKLRLLYECNPMAYVMEKAGGMATTGKEAVLDVIPTDIHQRAPVILGSPDDVLEFLKVYEKHSAQ,mutated_sequence,1.0,338.0,NP_000498.2.a2m,NP_000498.2.npy,ClinVar
+NP_000499.1,NP_000499.1.csv,MFSMRIVCLVLSVVGTAWTADSGEGDFLAEGGGVRGPRVVERHQSACKDSDWPFCSDEDWNYKCPSGCRMKGLIDEVNQDFTNRINKLKNSLFEYQKNNKDSHSLTTNIMEILRGDFSSANNRDNTYNRVSEDLRSRIEVLKRKVIEKVQHIQLLQKNVRAQLVDMKRLEVDIDIKIRSCRGSCSRALAREVDLKDYEDQQKQLEQVIAKDLLPSRDRQHLPLIKMKPVPDLVPGNFKSQLQKVPPEWKALTDMPQMRMELERPGGNEITRGGSTSYGTGSETESPRNPSSAGSWNSGSSGPGSTGNRNPGSSGTGGTATWKPGSSGPGSTGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSERGSAGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNPDWGTFEEVSGNVSPGTRREYHTEKLVTSKGDKELRTGKEKVTSGSTTTTRRSCSKTVTKTVIGPDGHKEVTKEVVTSEDGSDCPEAMDLGTLSGIGTLDGFRHRHPDEAAFFDTASTGKTFPGFFSPMLGEFVSETESRGSESGIFTNTKESSSHHPGIAEFPSRGKSSSYSKQFTSSTSYNRGDSTFESKSYKMADEAGSEADHEGTHSTKRGHAKSRPVRDCDDVLQTHPSGTQSGIFNIKLPGSSKIFSVYCDQETSLGGWLLIQQRMDGSLNFNRTWQDYKRGFGSLNDEGEGEFWLGNDYLHLLTQRGSVLRVELEDWAGNEAYAEYHFRVGSEAEGYALQVSSYEGTAGDALIEGSVEEGAEYTSHNNMQFSTFDRDADQWEENCAEVYGGGWWYNNCQAANLNGIYYPGGSYDPRNNSPYEIENGVVWVSFRGADYSLRAVRMKIRPLVTQ,mutated_sequence,1.0,866.0,NP_000499.1.a2m,NP_000499.1.npy,ClinVar
+NP_000503.1,NP_000503.1.csv,MAAVVAATRWWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVYASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALVPAQPQLNVCNWAVMNWAPPGCEKLGKCLTPPESIPKKCLWSH,mutated_sequence,1.0,522.0,NP_000503.1.a2m,NP_000503.1.npy,ClinVar
+NP_000507.1,NP_000507.1.csv,MGCLGNSKTEDQRNEEKAQREANKKIEKQLQKDKQVYRATHRLLLLGAGESGKSTIVKQMRILHVNGFNGEGGEEDPQAARSNSDGEKATKVQDIKNNLKEAIETIVAAMSNLVPPVELANPENQFRVDYILSVMNVPDFDFPPEFYEHAKALWEDEGVRACYERSNEYQLIDCAQYFLDKIDVIKQADYVPSDQDLLRCRVLTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVASSSYNMVIREDNQTNRLQEALNLFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENIRRVFNDCRDIIQRMHLRQYELL,mutated_sequence,1.0,394.0,NP_000507.1.a2m,NP_000507.1.npy,ClinVar
+NP_000508.1,NP_000508.1.csv,MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR,mutated_sequence,1.0,142.0,NP_000508.1.a2m,NP_000508.1.npy,ClinVar
+NP_000509.1,NP_000509.1.csv,MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH,mutated_sequence,1.0,147.0,NP_000509.1.a2m,NP_000509.1.npy,ClinVar
+NP_000511.2,NP_000511.2.csv,MTSSRLWFSLLLAAAFAGRATALWPWPQNFQTSDQRYVLYPNNFQFQYDVSSAAQPGCSVLDEAFQRYRDLLFGSGSWPRPYLTGKRHTLEKNVLVVSVVTPGCNQLPTLESVENYTLTINDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFINKTEIEDFPRFPHRGLLLDTSRHYLPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELMRKGSYNPVTHIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSEPSGTFGPVNPSLNNTYEFMSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKKGFGEDFKQLESFYIQTLLDIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMKELELVTKAGFRALLSAPWYLNRISYGPDWKDFYIVEPLAFEGTPEQKALVIGGEACMWGEYVDNTNLVPRLWPRAGAVAERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT,mutated_sequence,1.0,529.0,NP_000511.2.a2m,NP_000511.2.npy,ClinVar
+NP_000512.2,NP_000512.2.csv,MELCGLGLPRPPMLLALLLATLLAAMLALLTQVALVVQVAEAARAPSVSAKPGPALWPLPLLVKMTPNLLHLAPENFYISHSPNSTAGPSCTLLEEAFRRYHGYIFGFYKWHHEPAEFQAKTQVQQLLVSITLQSECDAFPNISSDESYTLLVKEPVAVLKANRVWGALRGLETFSQLVYQDSYGTFTINESTIIDSPRFSHRGILIDTSRHYLPVKIILKTLDAMAFNKFNVLHWHIVDDQSFPYQSITFPELSNKGSYSLSHVYTPNDVRMVIEYARLRGIRVLPEFDTPGHTLSWGKGQKDLLTPCYSRQNKLDSFGPINPTLNTTYSFLTTFFKEISEVFPDQFIHLGGDEVEFKCWESNPKIQDFMRQKGFGTDFKKLESFYIQKVLDIIATINKGSIVWQEVFDDKAKLAPGTIVEVWKDSAYPEELSRVTASGFPVILSAPWYLDLISYGQDWRKYYKVEPLDFGGTQKQKQLFIGGEACLWGEYVDATNLTPRLWPRASAVGERLWSSKDVRDMDDAYDRLTRHRCRMVERGIAAQPLYAGYCNHENM,mutated_sequence,1.0,556.0,NP_000512.2.a2m,NP_000512.2.npy,ClinVar
+NP_000517.3,NP_000517.3.csv,MTTCSRQFTSSSSMKGSCGIGGGIGGGSSRISSVLAGGSCRAPSTYGGGLSVSSSRFSSGGACGLGGGYGGGFSSSSSSFGSGFGGGYGGGLGAGLGGGFGGGFAGGDGLLVGSEKVTMQNLNDRLASYLDKVRALEEANADLEVKIRDWYQRQRPAEIKDYSPYFKTIEDLRNKILTATVDNANVLLQIDNARLAADDFRTKYETELNLRMSVEADINGLRRVLDELTLARADLEMQIESLKEELAYLKKNHEEEMNALRGQVGGDVNVEMDAAPGVDLSRILNEMRDQYEKMAEKNRKDAEEWFFTKTEELNREVATNSELVQSGKSEISELRRTMQNLEIELQSQLSMKASLENSLEETKGRYCMQLAQIQEMIGSVEEQLAQLRCEMEQQNQEYKILLDVKTRLEQEIATYRRLLEGEDAHLSSSQFSSGSQSSRDVTSSSRQIRTKVMDVHDGKVVSTHEQVLRTKN,mutated_sequence,1.0,472.0,NP_000517.3.a2m,NP_000517.3.npy,ClinVar
+NP_000518.1,NP_000518.1.csv,MGPWGWKLRWTVALLLAAAGTAVGDRCERNEFQCQDGKCISYKWVCDGSAECQDGSDESQETCLSVTCKSGDFSCGGRVNRCIPQFWRCDGQVDCDNGSDEQGCPPKTCSQDEFRCHDGKCISRQFVCDSDRDCLDGSDEASCPVLTCGPASFQCNSSTCIPQLWACDNDPDCEDGSDEWPQRCRGLYVFQGDSSPCSAFEFHCLSGECIHSSWRCDGGPDCKDKSDEENCAVATCRPDEFQCSDGNCIHGSRQCDREYDCKDMSDEVGCVNVTLCEGPNKFKCHSGECITLDKVCNMARDCRDWSDEPIKECGTNECLDNNGGCSHVCNDLKIGYECLCPDGFQLVAQRRCEDIDECQDPDTCSQLCVNLEGGYKCQCEEGFQLDPHTKACKAVGSIAYLFFTNRHEVRKMTLDRSEYTSLIPNLRNVVALDTEVASNRIYWSDLSQRMICSTQLDRAHGVSSYDTVISRDIQAPDGLAVDWIHSNIYWTDSVLGTVSVADTKGVKRKTLFRENGSKPRAIVVDPVHGFMYWTDWGTPAKIKKGGLNGVDIYSLVTENIQWPNGITLDLLSGRLYWVDSKLHSISSIDVNGGNRKTILEDEKRLAHPFSLAVFEDKVFWTDIINEAIFSANRLTGSDVNLLAENLLSPEDMVLFHNLTQPRGVNWCERTTLSNGGCQYLCLPAPQINPHSPKFTCACPDGMLLARDMRSCLTEAEAAVATQETSTVRLKVSSTAVRTQHTTTRPVPDTSRLPGATPGLTTVEIVTMSHQALGDVAGRGNEKKPSSVRALSIVLPIVLLVFLCLGVFLLWKNWRLKNINSINFDNPVYQKTTEDEVHICHNQDGYSYPSRQMVSLEDDVA,mutated_sequence,1.0,860.0,NP_000518.1.a2m,NP_000518.1.npy,ClinVar
+NP_000521.2,NP_000521.2.csv,MAPGAPSSSPSPILAVLLFSSLVLSPAQAIVVYTDREVHGAVGSRVTLHCSFWSSEWVSDDISFTWRYQPEGGRDAISIFHYAKGQPYIDEVGTFKERIQWVGDPRWKDGSIVIHNLDYSDNGTFTCDVKNPPDIVGKTSQVTLYVFEKVPTRYGVVLGAVIGGVLGVVLLLLLLFYVVRYCWLRRQAALQRRLSAMEKGKLHKPGKDASKRGRQTPVLYAMLDHSRSTKAVSEKKAKGLGESRKDKK,mutated_sequence,1.0,248.0,NP_000521.2.a2m,NP_000521.2.npy,ClinVar
+NP_000522.3,NP_000522.3.csv,MLFNLRILLNNAAFRNGHNFMVRNFRCGQPLQNKVQLKGRDLLTLKNFTGEEIKYMLWLSADLKFRIKQKGEYLPLLQGKSLGMIFEKRSTRTRLSTETGFALLGGHPCFLTTQDIHLGVNESLTDTARVLSSMADAVLARVYKQSDLDTLAKEASIPIINGLSDLYHPIQILADYLTLQEHYSSLKGLTLSWIGDGNNILHSIMMSAAKFGMHLQAATPKGYEPDASVTKLAEQYAKENGTKLLLTNDPLEAAHGGNVLITDTWISMGQEEEKKKRLQAFQGYQVTMKTAKVAASDWTFLHCLPRKPEEVDDEVFYSPRSLVFPEAENRKWTIMAVMVSLLTDYSPQLQKPKF,mutated_sequence,1.0,354.0,NP_000522.3.a2m,NP_000522.3.npy,ClinVar
+NP_000523.2,NP_000523.2.csv,MAAALRVAAVGARLSVLASGLRAAVRSLCSQATSVNERIENKRRTALLGGGQRRIDAQHKRGKLTARERISLLLDPGSFVESDMFVEHRCADFGMAADKNKFPGDSVVTGRGRINGRLVYVFSQDFTVFGGSLSGAHAQKICKIMDQAITVGAPVIGLNDSGGARIQEGVESLAGYADIFLRNVTASGVIPQISLIMGPCAGGAVYSPALTDFTFMVKDTSYLFITGPDVVKSVTNEDVTQEELGGAKTHTTMSGVAHRAFENDVDALCNLRDFFNYLPLSSQDPAPVRECHDPSDRLVPELDTIVPLESTKAYNMVDIIHSVVDEREFFEIMPNYAKNIIVGFARMNGRTVGIVGNQPKVASGCLDINSSVKGARFVRFCDAFNIPLITFVDVPGFLPGTAQEYGGIIRHGAKLLYAFAEATVPKVTVITRKAYGGAYDVMSSKHLCGDTNYAWPTAEIAVMGAKGAVEIIFKGHENVEAAQAEYIEKFANPFPAAVRGFVDDIIQPSSTRARICCDLDVLASKKVQRPWRKHANIPL,mutated_sequence,1.0,539.0,NP_000523.2.a2m,NP_000523.2.npy,ClinVar
+NP_000524.3,NP_000524.3.csv,MGLLECCARCLVGAPFASLVATGLCFFGVALFCGCGHEALTGTEKLIETYFSKNYQDYEYLINVIHAFQYVIYGTASFFFLYGALLLAEGFYTTGAVRQIFGDYKTTICGKGLSATVTGGQKGRGSRGQHQAHSLERVCHCLGKWLGHPDKFVGITYALTVVWLLVFACSAVPVYIYFNTWTTCQSIAFPSKTSASIGSLCADARMYGVLPWNAFPGKVCGSNLLSICKTAEFQMTFHLFIAAFVGAAATLVSLLTFMIAATYNFAVLKLMGRGTKF,mutated_sequence,1.0,277.0,NP_000524.3.a2m,NP_000524.3.npy,ClinVar
+NP_000526.2,NP_000526.2.csv,MERAESSSTEPAKAIKPIDRKSVHQICSGQVVLSLSTAVKELVENSLDAGATNIDLKLKDYGVDLIEVSDNGCGVEEENFEGLTLKHHTSKIQEFADLTQVETFGFRGEALSSLCALSDVTISTCHASAKVGTRLMFDHNGKIIQKTPYPRPRGTTVSVQQLFSTLPVRHKEFQRNIKKEYAKMVQVLHAYCIISAGIRVSCTNQLGQGKRQPVVCTGGSPSIKENIGSVFGQKQLQSLIPFVQLPPSDSVCEEYGLSCSDALHNLFYISGFISQCTHGVGRSSTDRQFFFINRRPCDPAKVCRLVNEVYHMYNRHQYPFVVLNISVDSECVDINVTPDKRQILLQEEKLLLAVLKTSLIGMFDSDVNKLNVSQQPLLDVEGNLIKMHAADLEKPMVEKQDQSPSLRTGEEKKDVSISRLREAFSLRHTTENKPHSPKTPEPRRSPLGQKRGMLSSSTSGAISDKGVLRPQKEAVSSSHGPSDPTDRAEVEKDSGHGSTSVDSEGFSIPDTGSHCSSEYAASSPGDRGSQEHVDSQEKAPKTDDSFSDVDCHSNQEDTGCKFRVLPQPTNLATPNTKRFKKEEILSSSDICQKLVNTQDMSASQVDVAVKINKKVVPLDFSMSSLAKRIKQLHHEAQQSEGEQNYRKFRAKICPGENQAAEDELRKEISKTMFAEMEIIGQFNLGFIITKLNEDIFIVDQHATDEKYNFEMLQQHTVLQGQRLIAPQTLNLTAVNEAVLIENLEIFRKNGFDFVIDENAPVTERAKLISLPTSKNWTFGPQDVDELIFMLSDSPGVMCRPSRVKQMFASRACRKSVMIGTALNTSEMKKLITHMGEMDHPWNCPHGRPTMRHIANLGVISQN,mutated_sequence,1.0,862.0,NP_000526.2.a2m,NP_000526.2.npy,ClinVar
+NP_000527.2,NP_000527.2.csv,MSLQMVTVSNNIALIQPGFSLMNFDGQVFFFGQKGWPKRSCPTGVFHLDVKHNHVKLKPTIFSKDSCYLPPLRYPATCTFKGSLESEKHQYIIHGGKTPNNEVSDKIYVMSIVCKNNKKVTFRCTEKDLVGDVPEARYGHSINVVYSRGKSMGVLFGGRSYMPSTHRTTEKWNSVADCLPCVFLVDFEFGCATSYILPELQDGLSFHVSIAKNDTIYILGGHSLANNIRPANLYRIRVDLPLGSPAVNCTVLPGGISVSSAILTQTNNDEFVIVGGYQLENQKRMICNIISLEDNKIEIREMETPDWTPDIKHSKIWFGSNMGNGTVFLGIPGDNKQVVSEGFYFYMLKCAEDDTNEEQTTFTNSQTSTEDPGDSTPFEDSEEFCFSAEANSFDGDDEFDTYNEDDEEDESETGYWITCCPTCDVDINTWVPFYSTELNKPAMIYCSHGDGHWVHAQCMDLAERTLIHLSAGSNKYYCNEHVEIARALHTPQRVLPLKKPPMKSLRKKGSGKILTPAKKSFLRRLFD,mutated_sequence,1.0,527.0,NP_000527.2.a2m,NP_000527.2.npy,ClinVar
+NP_000528.1,NP_000528.1.csv,MDGWRRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKRMPSIRESLKERGVDMARLGPEWSQPMKRLTLGNTTSSVILTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVTQMFGEVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGVSVGSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLCTLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALAR,mutated_sequence,1.0,406.0,NP_000528.1.a2m,NP_000528.1.npy,ClinVar
+NP_000530.1,NP_000530.1.csv,MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLGGEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSRYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKEAAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA,mutated_sequence,1.0,348.0,NP_000530.1.a2m,NP_000530.1.npy,ClinVar
+NP_000531.2,NP_000531.2.csv,MGDAEGEDEVQFLRTDDEVVLQCSATVLKEQLKLCLAAEGFGNRLCFLEPTSNAQNVPPDLAICCFVLEQSLSVRALQEMLANTVEAGVESSQGGGHRTLLYGHAILLRHAHSRMYLSCLTTSRSMTDKLAFDVGLQEDATGEACWWTMHPASKQRSEGEKVRVGDDIILVSVSSERYLHLSTASGELQVDASFMQTLWNMNPICSRCEEGFVTGGHVLRLFHGHMDECLTISPADSDDQRRLVYYEGGAVCTHARSLWRLEPLRISWSGSHLRWGQPLRVRHVTTGQYLALTEDQGLVVVDASKAHTKATSFCFRISKEKLDVAPKRDVEGMGPPEIKYGESLCFVQHVASGLWLTYAAPDPKALRLGVLKKKAMLHQEGHMDDALSLTRCQQEESQAARMIHSTNGLYNQFIKSLDSFSGKPRGSGPPAGTALPIEGVILSLQDLIIYFEPPSEDLQHEEKQSKLRSLRNRQSLFQEEGMLSMVLNCIDRLNVYTTAAHFAEFAGEEAAESWKEIVNLLYELLASLIRGNRSNCALFSTNLDWLVSKLDRLEASSGILEVLYCVLIESPEVLNIIQENHIKSIISLLDKHGRNHKVLDVLCSLCVCNGVAVRSNQDLITENLLPGRELLLQTNLINYVTSIRPNIFVGRAEGTTQYSKWYFEVMVDEVTPFLTAQATHLRVGWALTEGYTPYPGAGEGWGGNGVGDDLYSYGFDGLHLWTGHVARPVTSPGQHLLAPEDVISCCLDLSVPSISFRINGCPVQGVFESFNLDGLFFPVVSFSAGVKVRFLLGGRHGEFKFLPPPGYAPCHEAVLPRERLHLEPIKEYRREGPRGPHLVGPSRCLSHTDFVPCPVDTVQIVLPPHLERIREKLAENIHELWALTRIEQGWTYGPVRDDNKRLHPCLVDFHSLPEPERNYNLQMSGETLKTLLALGCHVGMADEKAEDNLKKTKLPKTYMMSNGYKPAPLDLSHVRLTPAQTTLVDRLAENGHNVWARDRVGQGWSYSAVQDIPARRNPRLVPYRLLDEATKRSNRDSLCQAVRTLLGYGYNIEPPDQEPSQVENQSRCDRVRIFRAEKSYTVQSGRWYFEFEAVTTGEMRVGWARPELRPDVELGADELAYVFNGHRGQRWHLGSEPFGRPWQPGDVVGCMIDLTENTIIFTLNGEVLMSDSGSETAFREIEIGDGFLPVCSLGPGQVGHLNLGQDVSSLRFFAICGLQEGFEPFAINMQRPVTTWFSKGLPQFEPVPLEHPHYEVSRVDGTVDTPPCLRLTHRTWGSQNSLVEMLFLRLSLPVQFHQHFRCTAGATPLAPPGLQPPAEDEARAAEPDPDYENLRRSAGGWSEAENGKEGTAKEGAPGGTPQAGGEAQPARAENEKDATTEKNKKRGFLFKAKKVAMMTQPPATPTLPRLPHDVVPADNRDDPEIILNTTTYYYSVRVFAGQEPSCVWAGWVTPDYHQHDMSFDLSKVRVVTVTMGDEQGNVHSSLKCSNCYMVWGGDFVSPGQQGRISHTDLVIGCLVDLATGLMTFTANGKESNTFFQVEPNTKLFPAVFVLPTHQNVIQFELGKQKNIMPLSAAMFQSERKNPAPQCPPRLEMQMLMPVSWSRMPNHFLQVETRRAGERLGWAVQCQEPLTMMALHIPEENRCMDILELSERLDLQRFHSHTLRLYRAVCALGNNRVAHALCSHVDQAQLLHALEDAHLPGPLRAGYYDLLISIHLESACRSRRSMLSEYIVPLTPETRAITLFPPGRSTENGHPRHGLPGVGVTTSLRPPHHFSPPCFVAALPAAGAAEAPARLSPAIPLEALRDKALRMLGEAVRDGGQHARDPVGGSVEFQFVPVLKLVSTLLVMGIFGDEDVKQILKMIEPEVFTEEEEEEDEEEEGEEEDEEEKEEDEEETAQEKEDEEKEEEEAAEGEKEEGLEEGLLQMKLPESVKLQMCHLLEYFCDQELQHRVESLAAFAERYVDKLQANQRSRYGLLIKAFSMTAAETARRTREFRSPPQEQINMLLQFKDGTDEEDCPLPEEIRQDLLDFHQDLLAHCGIQLDGEEEEPEEETTLGSRLMSLLEKVRLVKKKEEKPEEERSAEESKPRSLQELVSHMVVRWAQEDFVQSPELVRAMFSLLHRQYDGLGELLRALPRAYTISPSSVEDTMSLLECLGQIRSLLIVQMGPQEENLMIQSIGNIMNNKVFYQHPNLMRALGMHETVMEVMVNVLGGGESKEIRFPKMVTSCCRFLCYFCRISRQNQRSMFDHLSYLLENSGIGLGMQGSTPLDVAAASVIDNNELALALQEQDLEKVVSYLAGCGLQSCPMLVAKGYPDIGWNPCGGERYLDFLRFAVFVNGESVEENANVVVRLLIRKPECFGPALRGEGGSGLLAAIEEAIRISEDPARDGPGIRRDRRREHFGEEPPEENRVHLGHAIMSFYAALIDLLGRCAPEMHLIQAGKGEALRIRAILRSLVPLEDLVGIISLPLQIPTLGKDGALVQPKMSASFVPDHKASMVLFLDRVYGIENQDFLLHVLDVGFLPDMRAAASLDTATFSTTEMALALNRYLCLAVLPLITKCAPLFAGTEHRAIMVDSMLHTVYRLSRGRSLTKAQRDVIEDCLMSLCRYIRPSMLQHLLRRLVFDVPILNEFAKMPLKLLTNHYERCWKYYCLPTGWANFGVTSEEELHLTRKLFWGIFDSLAHKKYDPELYRMAMPCLCAIAGALPPDYVDASYSSKAEKKATVDAEGNFDPRPVETLNVIIPEKLDSFINKFAEYTHEKWAFDKIQNNWSYGENIDEELKTHPMLRPYKTFSEKDKEIYRWPIKESLKAMIAWEWTIEKAREGEEEKTEKKKTRKISQSAQTYDPREGYNPQPPDLSAVTLSRELQAMAEQLAENYHNTWGRKKKQELEAKGGGTHPLLVPYDTLTAKEKARDREKAQELLKFLQMNGYAVTRGLKDMELDSSSIEKRFAFGFLQQLLRWMDISQEFIAHLEAVVSSGRVEKSPHEQEIKFFAKILLPLINQYFTNHCLYFLSTPAKVLGSGGHASNKEKEMITSLFCKLAALVRHRVSLFGTDAPAVVNCLHILARSLDARTVMKSGPEIVKAGLRSFFESASEDIEKMVENLRLGKVSQARTQVKGVGQNLTYTTVALLPVLTTLFQHIAQHQFGDDVILDDVQVSCYRTLCSIYSLGTTKNTYVEKLRPALGECLARLAAAMPVAFLEPQLNEYNACSVYTTKSPRERAILGLPNSVEEMCPDIPVLERLMADIGGLAESGARYTEMPHVIEITLPMLCSYLPRWWERGPEAPPSALPAGAPPPCTAVTSDHLNSLLGNILRIIVNNLGIDEASWMKRLAVFAQPIVSRARPELLQSHFIPTIGRLRKRAGKVVSEEEQLRLEAKAEAQEGELLVRDEFSVLCRDLYALYPLLIRYVDNNRAQWLTEPNPSAEELFRMVGEIFIYWSKSHNFKREEQNFVVQNEINNMSFLTADNKSKMAKAGDIQSGGSDQERTKKKRRGDRYSVQTSLIVATLKKMLPIGLNMCAPTDQDLITLAKTRYALKDTDEEVREFLHNNLHLQGKVEGSPSLRWQMALYRGVPGREEDADDPEKIVRRVQEVSAVLYYLDQTEHPYKSKKAVWHKLLSKQRRRAVVACFRMTPLYNLPTHRACNMFLESYKAAWILTEDHSFEDRMIDDLSKAGEQEEEEEEVEEKKPDPLHQLVLHFSRTALTEKSKLDEDYLYMAYADIMAKSCHLEEGGENGEAEEEVEVSFEEKQMEKQRLLYQQARLHTRGAAEMVLQMISACKGETGAMVSSTLKLGISILNGGNAEVQQKMLDYLKDKKEVGFFQSIQALMQTCSVLDLNAFERQNKAEGLGMVNEDGTVINRQNGEKVMADDEFTQDLFRFLQLLCEGHNNDFQNYLRTQTGNTTTINIIICTVDYLLRLQESISDFYWYYSGKDVIEEQGKRNFSKAMSVAKQVFNSLTEYIQGPCTGNQQSLAHSRLWDAVVGFLHVFAHMMMKLAQDSSQIELLKELLDLQKDMVVMLLSLLEGNVVNGMIARQMVDMLVESSSNVEMILKFFDMFLKLKDIVGSEAFQDYVTDPRGLISKKDFQKAMDSQKQFSGPEIQFLLSCSEADENEMINCEEFANRFQEPARDIGFNVAVLLTNLSEHVPHDPRLHNFLELAESILEYFRPYLGRIEIMGASRRIERIYFEISETNRAQWEMPQVKESKRQFIFDVVNEGGEAEKMELFVSFCEDTIFEMQIAAQISEPEGEPETDEDEGAGAAEAGAEGAEEGAAGLEGTAATAAAGATARVVAAAGRALRGLSYRSLRRRVRRLRRLTAREAATAVAALLWAAVTRAGAAGAGAAAGALGLLWGSLFGGGLVEGAKKVTVTELLAGMPDPTSDEVHGEQPAGPGGDADGEGASEGAGDAAEGAGDEEEAVHEAGPGGADGAVAVTDGGPFRPEGAGGLGDMGDTTPAEPPTPEGSPILKRKLGVDGVEEELPPEPEPEPEPELEPEKADAENGEKEEVPEPTPEPPKKQAPPSPPPKKEEAGGEFWGELEVQRVKFLNYLSRNFYTLRFLALFLAFAINFILLFYKVSDSPPGEDDMEGSAAGDVSGAGSGGSSGWGLGAGEEAEGDEDENMVYYFLEESTGYMEPALRCLSLLHTLVAFLCIIGYNCLKVPLVIFKREKELARKLEFDGLYITEQPEDDDVKGQWDRLVLNTPSFPSNYWDKFVKRKVLDKHGDIYGRERIAELLGMDLATLEITAHNERKPNPPPGLLTWLMSIDVKYQIWKFGVIFTDNSFLYLGWYMVMSLLGHYNNFFFAAHLLDIAMGVKTLRTILSSVTHNGKQLVMTVGLLAVVVYLYTVVAFNFFRKFYNKSEDEDEPDMKCDDMMTCYLFHMYVGVRAGGGIGDEIEDPAGDEYELYRVVFDITFFFFVIVILLAIIQGLIIDAFGELRDQQEQVKEDMETKCFICGIGSDYFDTTPHGFETHTLEEHNLANYMFFLMYLINKDETEHTGQESYVWKMYQERCWDFFPAGDCFRKQYEDQLS,mutated_sequence,1.0,5038.0,NP_000531.2.a2m,NP_000531.2.npy,ClinVar
+NP_000534.3,NP_000534.3.csv,MPRYGASLRQSCPRSGREQGQDGTAGAPGLLWMGLVLALALALALALALSDSRVLWAPAEAHPLSPQGHPARLHRIVPRLRDVFGWGNLTCPICKGLFTAINLGLKKEPNVARVGSVAIKLCNLLKIAPPAVCQSIVHLFEDDMVEVWRRSVLSPSEACGLLLGSTCGHWDIFSSWNISLPTVPKPPPKPPSPPAPGAPVSRILFLTDLHWDHDYLEGTDPDCADPLCCRRGSGLPPASRPGAGYWGEYSKCDLPLRTLESLLSGLGPAGPFDMVYWTGDIPAHDVWHQTRQDQLRALTTVTALVRKFLGPVPVYPAVGNHESTPVNSFPPPFIEGNHSSRWLYEAMAKAWEPWLPAEALRTLRIGGFYALSPYPGLRLISLNMNFCSRENFWLLINSTDPAGQLQWLVGELQAAEDRGDKVHIIGHIPPGHCLKSWSWNYYRIVARYENTLAAQFFGHTHVDEFEVFYDEETLSRPLAVAFLAPSATTYIGLNPGYRVYQIDGNYSGSSHVVLDHETYILNLTQANIPGAIPHWQLLYRARETYGLPNTLPTAWHNLVYRMRGDMQLFQTFWFLYHKGHPPSEPCGTPCRLATLCAQLSARADSPALCRHLMPDGSLPEAQSLWPRPLFC,mutated_sequence,1.0,631.0,NP_000534.3.a2m,NP_000534.3.npy,ClinVar
+NP_000536.6,NP_000536.6.csv,MVSKLSQLQTELLAALLESGLSKEALIQALGEPGPYLLAGEGPLDKGESCGGGRGELAELPNGLGETRGSEDETDDDGEDFTPPILKELENLSPEEAAHQKAVVETLLQEDPWRVAKMVKSYLQQHNIPQREVVDTTGLNQSHLSQHLNKGTPMKTQKRAALYTWYVRKQREVAQQFTHAGQGGLIEEPTGDELPTKKGRRNRFKWGPASQQILFQAYERQKNPSKEERETLVEECNRAECIQRGVSPSQAQGLGSNLVTEVRVYNWFANRRKEEAFRHKLAMDTYSGPPPGPGPGPALPAHSSPGLPPPALSPSKVHGVRYGQPATSETAEVPSSSGGPLVTVSTPLHQVSPTGLEPSHSLLSTEAKLVSAAGGPLPPVSTLTALHSLEQTSPGLNQQPQNLIMASLPGVMTIGPGEPASLGPTFTNTGASTLVIGLASTQAQSVPVINSMGSSLTTLQPVQFSQPLHPSYQQPLMPPVQSHVTQSPFMATMAQLQSPHALYSHKPEVAQYTHTGLLPQTMLITDTTNLSALASLTPTKQVFTSDTEASSESGLHTPASQATTLHVPSQDPASIQHLQPAHRLSASPTVSSSSLVLYQSSDSSNGQSHLLPSNHSVIETFISTQMASSSQ,mutated_sequence,1.0,631.0,NP_000536.6.a2m,NP_000536.6.npy,ClinVar
+NP_000537.3,NP_000537.3.csv,MEEPQSDPSVEPPLSQETFSDLWKLLPENNVLSPLPSQAMDDLMLSPDDIEQWFTEDPGPDEAPRMPEAAPPVAPAPAAPTPAAPAPAPSWPLSSSVPSQKTYQGSYGFRLGFLHSGTAKSVTCTYSPALNKMFCQLAKTCPVQLWVDSTPPPGTRVRAMAIYKQSQHMTEVVRRCPHHERCSDSDGLAPPQHLIRVEGNLRVEYLDDRNTFRHSVVVPYEPPEVGSDCTTIHYNYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRNSFEVRVCACPGRDRRTEEENLRKKGEPHHELPPGSTKRALPNNTSSSPQPKKKPLDGEYFTLQIRGRERFEMFRELNEALELKDAQAGKEPGGSRAHSSHLKSKKGQSTSRHKKLMFKTEGPDSD,mutated_sequence,1.0,393.0,NP_000537.3.a2m,NP_000537.3.npy,ClinVar
+NP_000539.2,NP_000539.2.csv,MAKPTSKDSGLKEKFKILLGLGTPRPNPRSAEGKQTEFIITAEILRELSMECGLNNRIRMIGQICEVAKTKKFEEHAVEALWKAVADLLQPERPLEARHAVLALLKAIVQGQGERLGVLRALFFKVIKDYPSNEDLHERLEVFKALTDNGRHITYLEEELADFVLQWMDVGLSSEFLLVLVNLVKFNSCYLDEYIARMVQMICLLCVRTASSVDIEVSLQVLDAVVCYNCLPAESLPLFIVTLCRTINVKELCEPCWKLMRNLLGTHLGHSAIYNMCHLMEDRAYMEDAPLLRGAVFFVGMALWGAHRLYSLRNSPTSVLPSFYQAMACPNEVVSYEIVLSITRLIKKYRKELQVVAWDILLNIIERLLQQLQTLDSPELRTIVHDLLTTVEELCDQNEFHGSQERYFELVERCADQRPESSLLNLISYRAQSIHPAKDGWIQNLQALMERFFRSESRGAVRIKVLDVLSFVLLINRQFYEEELINSVVISQLSHIPEDKDHQVRKLATQLLVDLAEGCHTHHFNSLLDIIEKVMARSLSPPPELEERDVAAYSASLEDVKTAVLGLLVILQTKLYTLPASHATRVYEMLVSHIQLHYKHSYTLPIASSIRLQAFDFLLLLRADSLHRLGLPNKDGVVRFSPYCVCDYMEPERGSEKKTSGPLSPPTGPPGPAPAGPAVRLGSVPYSLLFRVLLQCLKQESDWKVLKLVLGRLPESLRYKVLIFTSPCSVDQLCSALCSMLSGPKTLERLRGAPEGFSRTDLHLAVVPVLTALISYHNYLDKTKQREMVYCLEQGLIHRCASQCVVALSICSVEMPDIIIKALPVLVVKLTHISATASMAVPLLEFLSTLARLPHLYRNFAAEQYASVFAISLPYTNPSKFNQYIVCLAHHVIAMWFIRCRLPFRKDFVPFITKGLRSNVLLSFDDTPEKDSFRARSTSLNERPKSLRIARPPKQGLNNSPPVKEFKESSAAEAFRCRSISVSEHVVRSRIQTSLTSASLGSADENSVAQADDSLKNLHLELTETCLDMMARYVFSNFTAVPKRSPVGEFLLAGGRTKTWLVGNKLVTVTTSVGTGTRSLLGLDSGELQSGPESSSSPGVHVRQTKEAPAKLESQAGQQVSRGARDRVRSMSGGHGLRVGALDVPASQFLGSATSPGPRTAPAAKPEKASAGTRVPVQEKTNLAAYVPLLTQGWAEILVRRPTGNTSWLMSLENPLSPFSSDINNMPLQELSNALMAAERFKEHRDTALYKSLSVPAASTAKPPPLPRSNTVASFSSLYQSSCQGQLHRSVSWADSAVVMEEGSPGEVPVLVEPPGLEDVEAALGMDRRTDAYSRSSSVSSQEEKSLHAEELVGRGIPIERVVSSEGGRPSVDLSFQPSQPLSKSSSSPELQTLQDILGDPGDKADVGRLSPEVKARSQSGTLDGESAAWSASGEDSRGQPEGPLPSSSPRSPSGLRPRGYTISDSAPSRRGKRVERDALKSRATASNAEKVPGINPSFVFLQLYHSPFFGDESNKPILLPNESQSFERSVQLLDQIPSYDTHKIAVLYVGEGQSNSELAILSNEHGSYRYTEFLTGLGRLIELKDCQPDKVYLGGLDVCGEDGQFTYCWHDDIMQAVFHIATLMPTKDVDKHRCDKKRHLGNDFVSIVYNDSGEDFKLGTIKGQFNFVHVIVTPLDYECNLVSLQCRKDMEGLVDTSVAKIVSDRNLPFVARQMALHANMASQVHHSRSNPTDIYPSKWIARLRHIKRLRQRICEEAAYSNPSLPLVHPPSHSKAPAQTPAEPTPGYEVGQRKRLISSVEDFTEFV,mutated_sequence,1.0,1807.0,NP_000539.2.a2m,NP_000539.2.npy,ClinVar
+NP_000542.1,NP_000542.1.csv,MPRRAENWDEAEVGAEEAGVEEYGPEEDGGEESGAEESGPEESGPEELGAEEEMEAGRPRPVLRSVNSREPSQVIFCNRSPRVVLPVWLNFDGEPQPYPTLPPGTGRRIHSYRGHLWLFRDAGTHDGLLVNQTELFVPSLNVDGQPIFANITLPVYTLKERCLQVVRSLVKPENYRRLDIVRSLYEDLEDHPNVQKDLERLTQERIAHQRMGD,mutated_sequence,1.0,213.0,NP_000542.1.a2m,NP_000542.1.npy,ClinVar
+NP_000543.3,NP_000543.3.csv,MIPARFAGVLLALALILPGTLCAEGTRGRSSTARCSLFGSDFVNTFDGSMYSFAGYCSYLLAGGCQKRSFSIIGDFQNGKRVSLSVYLGEFFDIHLFVNGTVTQGDQRVSMPYASKGLYLETEAGYYKLSGEAYGFVARIDGSGNFQVLLSDRYFNKTCGLCGNFNIFAEDDFMTQEGTLTSDPYDFANSWALSSGEQWCERASPPSSSCNISSGEMQKGLWEQCQLLKSTSVFARCHPLVDPEPFVALCEKTLCECAGGLECACPALLEYARTCAQEGMVLYGWTDHSACSPVCPAGMEYRQCVSPCARTCQSLHINEMCQERCVDGCSCPEGQLLDEGLCVESTECPCVHSGKRYPPGTSLSRDCNTCICRNSQWICSNEECPGECLVTGQSHFKSFDNRYFTFSGICQYLLARDCQDHSFSIVIETVQCADDRDAVCTRSVTVRLPGLHNSLVKLKHGAGVAMDGQDVQLPLLKGDLRIQHTVTASVRLSYGEDLQMDWDGRGRLLVKLSPVYAGKTCGLCGNYNGNQGDDFLTPSGLAEPRVEDFGNAWKLHGDCQDLQKQHSDPCALNPRMTRFSEEACAVLTSPTFEACHRAVSPLPYLRNCRYDVCSCSDGRECLCGALASYAAACAGRGVRVAWREPGRCELNCPKGQVYLQCGTPCNLTCRSLSYPDEECNEACLEGCFCPPGLYMDERGDCVPKAQCPCYYDGEIFQPEDIFSDHHTMCYCEDGFMHCTMSGVPGSLLPDAVLSSPLSHRSKRSLSCRPPMVKLVCPADNLRAEGLECTKTCQNYDLECMSMGCVSGCLCPPGMVRHENRCVALERCPCFHQGKEYAPGETVKIGCNTCVCQDRKWNCTDHVCDATCSTIGMAHYLTFDGLKYLFPGECQYVLVQDYCGSNPGTFRILVGNKGCSHPSVKCKKRVTILVEGGEIELFDGEVNVKRPMKDETHFEVVESGRYIILLLGKALSVVWDRHLSISVVLKQTYQEKVCGLCGNFDGIQNNDLTSSNLQVEEDPVDFGNSWKVSSQCADTRKVPLDSSPATCHNNIMKQTMVDSSCRILTSDVFQDCNKLVDPEPYLDVCIYDTCSCESIGDCACFCDTIAAYAHVCAQHGKVVTWRTATLCPQSCEERNLRENGYECEWRYNSCAPACQVTCQHPEPLACPVQCVEGCHAHCPPGKILDELLQTCVDPEDCPVCEVAGRRFASGKKVTLNPSDPEHCQICHCDVVNLTCEACQEPGGLVVPPTDAPVSPTTLYVEDISEPPLHDFYCSRLLDLVFLLDGSSRLSEAEFEVLKAFVVDMMERLRISQKWVRVAVVEYHDGSHAYIGLKDRKRPSELRRIASQVKYAGSQVASTSEVLKYTLFQIFSKIDRPEASRITLLLMASQEPQRMSRNFVRYVQGLKKKKVIVIPVGIGPHANLKQIRLIEKQAPENKAFVLSSVDELEQQRDEIVSYLCDLAPEAPPPTLPPDMAQVTVGPGLLGVSTLGPKRNSMVLDVAFVLEGSDKIGEADFNRSKEFMEEVIQRMDVGQDSIHVTVLQYSYMVTVEYPFSEAQSKGDILQRVREIRYQGGNRTNTGLALRYLSDHSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNAPILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPAPDCSQPLDVILLLDGSSSFPASYFDEMKSFAKAFISKANIGPRLTQVSVLQYGSITTIDVPWNVVPEKAHLLSLVDVMQREGGPSQIGDALGFAVRYLTSEMHGARPGASKAVVILVTDVSVDSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGPAGDSNVVKLQRIEDLPTMVTLGNSFLHKLCSGFVRICMDEDGNEKRPGDVWTLPDQCHTVTCQPDGQTLLKSHRVNCDRGLRPSCPNSQSPVKVEETCGCRWTCPCVCTGSSTRHIVTFDGQNFKLTGSCSYVLFQNKEQDLEVILHNGACSPGARQGCMKSIEVKHSALSVELHSDMEVTVNGRLVSVPYVGGNMEVNVYGAIMHEVRFNHLGHIFTFTPQNNEFQLQLSPKTFASKTYGLCGICDENGANDFMLRDGTVTTDWKTLVQEWTVQRPGQTCQPILEEQCLVPDSSHCQVLLLPLFAECHKVLAPATFYAICQQDSCHQEQVCEVIASYAHLCRTNGVCVDWRTPDFCAMSCPPSLVYNHCEHGCPRHCDGNVSSCGDHPSEGCFCPPDKVMLEGSCVPEEACTQCIGEDGVQHQFLEAWVPDHQPCQICTCLSGRKVNCTTQPCPTAKAPTCGLCEVARLRQNADQCCPEYECVCDPVSCDLPPVPHCERGLQPTLTNPGECRPNFTCACRKEECKRVSPPSCPPHRLPTLRKTQCCDEYECACNCVNSTVSCPLGYLASTATNDCGCTTTTCLPDKVCVHRSTIYPVGQFWEEGCDVCTCTDMEDAVMGLRVAQCSQKPCEDSCRSGFTYVLHEGECCGRCLPSACEVVTGSPRGDSQSSWKSVGSQWASPENPCLINECVRVKEEVFIQQRNVSCPQLEVPVCPSGFQLSCKTSACCPSCRCERMEACMLNGTVIGPGKTVMIDVCTTCRCMVQVGVISGFKLECRKTTCNPCPLGYKEENNTGECCGRCLPTACTIQLRGGQIMTLKRDETLQDGCDTHFCKVNERGEYFWEKRVTGCPPFDEHKCLAEGGKIMKIPGTCCDTCEEPECNDITARLQYVKVGSCKSEVEVDIHYCQGKCASKAMYSIDINDVQDQCSCCSPTRTEPMQVALHCTNGSVVYHEVLNAMECKCSPRKCSK,mutated_sequence,1.0,2813.0,NP_000543.3.a2m,NP_000543.3.npy,ClinVar
+NP_000608.1,NP_000608.1.csv,MVLGPEQKMSDDSVSGDHGESASLGNINPAYSNPSLSQSPGDSEEYFATYFNEKISIPEEEYSCFSFRKLWAFTGPGFLMSIAYLDPGNIESDLQSGAVAGFKLLWILLLATLVGLLLQRLAARLGVVTGLHLAEVCHRQYPKVPRVILWLMVELAIIGSDMQEVIGSAIAINLLSVGRIPLWGGVLITIADTFVFLFLDKYGLRKLEAFFGFLITIMALTFGYEYVTVKPSQSQVLKGMFVPSCSGCRTPQIEQAVGIVGAVIMPHNMYLHSALVKSRQVNRNNKQEVREANKYFFIESCIALFVSFIINVFVVSVFAEAFFGKTNEQVVEVCTNTSSPHAGLFPKDNSTLAVDIYKGGVVLGCYFGPAALYIWAVGILAAGQSSTMTGTYSGQFVMEGFLNLKWSRFARVVLTRSIAIIPTLLVAVFQDVEHLTGMNDFLNVLQSLQLPFALIPILTFTSLRPVMSDFANGLGWRIAGGILVLIICSINMYFVVVYVRDLGHVALYVVAAVVSVAYLGFVFYLGWQCLIALGMSFLDCGHTVSISKGLLTEEATRGYVK,mutated_sequence,1.0,561.0,NP_000608.1.a2m,NP_000608.1.npy,ClinVar
+NP_000633.2,NP_000633.2.csv,MGHSKQIRILLLNEMEKLEKTLFRLEQGYELQFRLGPTLQGKAVTVYTNYPFPGETFNREKFRSLDWENPTEREDDSDKYCKLNLQQSGSFQYYFLQGNEKSGGGYIVVDPILRVGADNHVLPLDCVTLQTFLAKCLGPFDEWESRLRVAKESGYNMIHFTPLQTLGLSRSCYSLANQLELNPDFSRPNRKYTWNDVGQLVEKLKKEWNVICITDVVYNHTAANSKWIQEHPECAYNLVNSPHLKPAWVLDRALWRFSCDVAEGKYKEKGIPALIENDHHMNSIRKIIWEDIFPKLKLWEFFQVDVNKAVEQFRRLLTQENRRVTKSDPNQHLTIIQDPEYRRFGCTVDMNIALTTFIPHDKGPAAIEECCNWFHKRMEELNSEKHRLINYHQEQAVNCLLGNVFYERLAGHGPKLGPVTRKHPLVTRYFTFPFEEIDFSMEESMIHLPNKACFLMAHNGWVMGDDPLRNFAEPGSEVYLRRELICWGDSVKLRYGNKPEDCPYLWAHMKKYTEITATYFQGVRLDNCHSTPLHVAEYMLDAARNLQPNLYVVAELFTGSEDLDNVFVTRLGISSLIREAMSAYNSHEEGRLVYRYGGEPVGSFVQPCLRPLMPAIAHALFMDITHDNECPIVHRSAYDALPSTTIVSMACCASGSTRGYDELVPHQISVVSEERFYTKWNPEALPSNTGEVNFQSGIIAARCAISKLHQELGAKGFIQVYVDQVDEDIVAVTRHSPSIHQSVVAVSRTAFRNPKTSFYSKEVPQMCIPGKIEEVVLEARTIERNTKPYRKDENSINGTPDITVEIREHIQLNESKIVKQAGVATKGPNEYIQEIEFENLSPGSVIIFRVSLDPHAQVAVGILRNHLTQFSPHFKSGSLAVDNADPILKIPFASLASRLTLAELNQILYRCESEEKEDGGGCYDIPNWSALKYAGLQGLMSVLAEIRPKNDLGHPFCNNLRSGDWMIDYVSNRLISRSGTIAEVGKWLQAMFFYLKQIPRYLIPCYFDAILIGAYTTLLDTAWKQMSSFVQNGSTFVKHLSLGSVQLCGVGKFPSLPILSPALMDVPYRLNEITKEKEQCCVSLAAGLPHFSSGIFRCWGRDTFIALRGILLITGRYVEARNIILAFAGTLRHGLIPNLLGEGIYARYNCRDAVWWWLQCIQDYCKMVPNGLDILKCPVSRMYPTDDSAPLPAGTLDQPLFEVIQEAMQKHMQGIQFRERNAGPQIDRNMKDEGFNITAGVDEETGFVYGGNRFNCGTWMDKMGESDRARNRGIPATPRDGSAVEIVGLSKSAVRWLLELSKKNIFPYHEVTVKRHGKAIKVSYDEWNRKIQDNFEKLFHVSEDPSDLNEKHPNLVHKRGIYKDSYGASSPWCDYQLRPNFTIAMVVAPELFTTEKAWKALEIAEKKLLGPLGMKTLDPDDMVYCGIYDNALDNDNYNLAKGFNYHQGPEWLWPIGYFLRAKLYFSRLMGPETTAKTIVLVKNVLSRHYVHLERSPWKGLPELTNENAQYCPFSCETQAWSIATILETLYDL,mutated_sequence,1.0,1532.0,NP_000633.2.a2m,NP_000633.2.npy,ClinVar
+NP_000711.1,NP_000711.1.csv,MMMMMMMKKMQHQRQQQADHANEANYARGTRLPLSGEGPTSQPNSSKQTVLSWQAAIDAARQAKAAQTMSTSAPPPVGSLSQRKRQQYAKSKKQGNSSNSRPARALFCLSLNNPIRRACISIVEWKPFDIFILLAIFANCVALAIYIPFPEDDSNSTNHNLEKVEYAFLIIFTVETFLKIIAYGLLLHPNAYVRNGWNLLDFVIVIVGLFSVILEQLTKETEGGNHSSGKSGGFDVKALRAFRVLRPLRLVSGVPSLQVVLNSIIKAMVPLLHIALLVLFVIIIYAIIGLELFIGKMHKTCFFADSDIVAEEDPAPCAFSGNGRQCTANGTECRSGWVGPNGGITNFDNFAFAMLTVFQCITMEGWTDVLYWVNDAIGWEWPWVYFVSLIILGSFFVLNLVLGVLSGEFSKEREKAKARGDFQKLREKQQLEEDLKGYLDWITQAEDIDPENEEEGGEEGKRNTSMPTSETESVNTENVSGEGENRGCCGSLWCWWRRRGAAKAGPSGCRRWGQAISKSKLSRRWRRWNRFNRRRCRAAVKSVTFYWLVIVLVFLNTLTISSEHYNQPDWLTQIQDIANKVLLALFTCEMLVKMYSLGLQAYFVSLFNRFDCFVVCGGITETILVELEIMSPLGISVFRCVRLLRIFKVTRHWTSLSNLVASLLNSMKSIASLLLLLFLFIIIFSLLGMQLFGGKFNFDETQTKRSTFDNFPQALLTVFQILTGEDWNAVMYDGIMAYGGPSSSGMIVCIYFIILFICGNYILLNVFLAIAVDNLADAESLNTAQKEEAEEKERKKIARKESLENKKNNKPEVNQIANSDNKVTIDDYREEDEDKDPYPPCDVPVGEEEEEEEEDEPEVPAGPRPRRISELNMKEKIAPIPEGSAFFILSKTNPIRVGCHKLINHHIFTNLILVFIMLSSAALAAEDPIRSHSFRNTILGYFDYAFTAIFTVEILLKMTTFGAFLHKGAFCRNYFNLLDMLVVGVSLVSFGIQSSAISVVKILRVLRVLRPLRAINRAKGLKHVVQCVFVAIRTIGNIMIVTTLLQFMFACIGVQLFKGKFYRCTDEAKSNPEECRGLFILYKDGDVDSPVVRERIWQNSDFNFDNVLSAMMALFTVSTFEGWPALLYKAIDSNGENIGPIYNHRVEISIFFIIYIIIVAFFMMNIFVGFVIVTFQEQGEKEYKNCELDKNQRQCVEYALKARPLRRYIPKNPYQYKFWYVVNSSPFEYMMFVLIMLNTLCLAMQHYEQSKMFNDAMDILNMVFTGVFTVEMVLKVIAFKPKGYFSDAWNTFDSLIVIGSIIDVALSEADPTESENVPVPTATPGNSEESNRISITFFRLFRVMRLVKLLSRGEGIRTLLWTFIKSFQALPYVALLIAMLFFIYAVIGMQMFGKVAMRDNNQINRNNNFQTFPQAVLLLFRCATGEAWQEIMLACLPGKLCDPESDYNPGEEYTCGSNFAIVYFISFYMLCAFLIINLFVAVIMDNFDYLTRDWSILGPHHLDEFKRIWSEYDPEAKGRIKHLDVVTLLRRIQPPLGFGKLCPHRVACKRLVAMNMPLNSDGTVMFNATLFALVRTALKIKTEGNLEQANEELRAVIKKIWKKTSMKLLDQVVPPAGDDEVTVGKFYATFLIQDYFRKFKKRKEQGLVGKYPAKNTTIALQAGLRTLHDIGPEIRRAISCDLQDDEPEETKREEEDDVFKRNGALLGNHVNHVNSDRRDSLQQTNTTHRPLHVQRPSIPPASDTEKPLFPPAGNSVCHNHHNHNSIGKQVPTSTNANLNNANMSKAAHGKRPSIGNLEHVSENGHHSSHKHDREPQRRSSVKRTRYYETYIRSDSGDEQLPTICREDPEIHGYFRDPHCLGEQEYFSSEECYEDDSSPTWSRQNYGYYSRYPGRNIDSERPRGYHHPQGFLEDDDSPVCYDSRRSPRRRLLPPTPASHRRSSFNFECLRRQSSQEEVPSSPIFPHRTALPLHLMQQQIMAVAGLDSSKAQKYSPSHSTRSWATPPATPPYRDWTPCYTPLIQVEQSEALDQVNGSLPSLHRSSWYTDEPDISYRTFTPASLTVPSSFRNKNSDKQRSADSLVEAVLISEGLGRYARDPKFVSATKHEIADACDLTIDEMESAASTLLNGNVRPRANGDVGPLSHRQDYELQDFGPGYSDEEPDPGRDEEDLADEMICITTL,mutated_sequence,1.0,2181.0,NP_000711.1.a2m,NP_000711.1.npy,ClinVar
+NP_000735.1,NP_000735.1.csv,MELGGPGAPRLLPPLLLLLGTGLLRASSHVETRAHAEERLLKKLFSGYNKWSRPVANISDVVLVRFGLSIAQLIDVDEKNQMMTTNVWVKQEWHDYKLRWDPADYENVTSIRIPSELIWRPDIVLYNNADGDFAVTHLTKAHLFHDGRVQWTPPAIYKSSCSIDVTFFPFDQQNCTMKFGSWTYDKAKIDLVNMHSRVDQLDFWESGEWVIVDAVGTYNTRKYECCAEIYPDITYAFVIRRLPLFYTINLIIPCLLISCLTVLVFYLPSECGEKITLCISVLLSLTVFLLLITEIIPSTSLVIPLIGEYLLFTMIFVTLSIVITVFVLNVHHRSPRTHTMPTWVRRVFLDIVPRLLLMKRPSVVKDNCRRLIESMHKMASAPRFWPEPEGEPPATSGTQSLHPPSPSFCVPLDVPAEPGPSCKSPSDQLPPQQPLEAEKASPHPSPGPCRPPHGTQAPGLAKARSLSVQHMSSPGEAVEGGVRCRSRSIQYCVPRDDAAPEADGQAAGALASRNTHSAELPPPDQPSPCKCTCKKEPSSVSPSATVKTRSTKAPPPHLPLSPALTRAVEGVQYIADHLKAEDTDFSVKEDWKYVAMVIDRIFLWMFIIVCLLGTVGLFLPPWLAGMI,mutated_sequence,1.0,627.0,NP_000735.1.a2m,NP_000735.1.npy,ClinVar
+NP_000738.2,NP_000738.2.csv,MTPGALLMLLGALGAPLAPGVRGSEAEGRLREKLFSGYDSSVRPAREVGDRVRVSVGLILAQLISLNEKDEEMSTKVYLDLEWTDYRLSWDPAEHDGIDSLRITAESVWLPDVVLLNNNDGNFDVALDISVVVSSDGSVRWQPPGIYRSSCSIQVTYFPFDWQNCTMVFSSYSYDSSEVSLQTGLGPDGQGHQEIHIHEGTFIENGQWEIIHKPSRLIQPPGDPRGGREGQRQEVIFYLIIRRKPLFYLVNVIAPCILITLLAIFVFYLPPDAGEKMGLSIFALLTLTVFLLLLADKVPETSLSVPIIIKYLMFTMVLVTFSVILSVVVLNLHHRSPHTHQMPLWVRQIFIHKLPLYLRLKRPKPERDLMPEPPHCSSPGSGWGRGTDEYFIRKPPSDFLFPKPNRFQPELSAPDLRRFIDGPNRAVALLPELREVVSSISYIARQLQEQEDHDALKEDWQFVAMVVDRLFLWTFIIFTSVGTLVIFLDATYHLPPPDPFP,mutated_sequence,1.0,501.0,NP_000738.2.a2m,NP_000738.2.npy,ClinVar
+NP_000772.2,NP_000772.2.csv,MLAKGLPPRSVLVKGCQTFLSAPREGLGRLRVPTGEGAGISTRSPRPFNEIPSPGDNGWLNLYHFWRETGTHKVHLHHVQNFQKYGPIYREKLGNVESVYVIDPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLKKSAAWKKDRVALNQEVMAPEATKNFLPLLDAVSRDFVSVLHRRIKKAGSGNYSGDISDDLFRFAFESITNVIFGERQGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAAWDVIFSKADIYTQNFYWELRQKGSVHHDYRGILYRLLGDSKMSFEDIKANVTEMLAGGVDTTSMTLQWHLYEMARNLKVQDMLRAEVLAARHQAQGDMATMLQLVPLLKASIKETLRLHPISVTLQRYLVNDLVLRDYMIPAKTLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITYFRNLGFGWGVRQCLGRRIAELEMTIFLINMLENFRVEIQHLSDVGTTFNLILMPEKPISFTFWPFNQEATQQ,mutated_sequence,1.0,521.0,NP_000772.2.a2m,NP_000772.2.npy,ClinVar
+NP_000780.1,NP_000780.1.csv,MGAASGRRGPGLLLPLPLLLLLPPQPALALDPGLQPGNFSADEAGAQLFAQSYNSSAEQVLFQSVAASWAHDTNITAENARRQEEAALLSQEFAEAWGQKAKELYEPIWQNFTDPQLRRIIGAVRTLGSANLPLAKRQQYNALLSNMSRIYSTAKVCLPNKTATCWSLDPDLTNILASSRSYAMLLFAWEGWHNAAGIPLKPLYEDFTALSNEAYKQDGFTDTGAYWRSWYNSPTFEDDLEHLYQQLEPLYLNLHAFVRRALHRRYGDRYINLRGPIPAHLLGDMWAQSWENIYDMVVPFPDKPNLDVTSTMLQQGWNATHMFRVAEEFFTSLELSPMPPEFWEGSMLEKPADGREVVCHASAWDFYNRKDFRIKQCTRVTMDQLSTVHHEMGHIQYYLQYKDLPVSLRRGANPGFHEAIGDVLALSVSTPEHLHKIGLLDRVTNDTESDINYLLKMALEKIAFLPFGYLVDQWRWGVFSGRTPPSRYNFDWWYLRTKYQGICPPVTRNETHFDAGAKFHVPNVTPYIRYFVSFVLQFQFHEALCKEAGYEGPLHQCDIYRSTKAGAKLRKVLQAGSSRPWQEVLKDMVGLDALDAQPLLKYFQPVTQWLQEQNQQNGEVLGWPEYQWHPPLPDNYPEGIDLVTDEAEASKFVEEYDRTSQVVWNEYAEANWNYNTNITTETSKILLQKNMQIANHTLKYGTQARKFDVNQLQNTTIKRIIKKVQDLERAALPAQELEEYNKILLDMETTYSVATVCHPNGSCLQLEPDLTNVMATSRKYEDLLWAWEGWRDKAGRAILQFYPKYVELINQAARLNGYVDAGDSWRSMYETPSLEQDLERLFQELQPLYLNLHAYVRRALHRHYGAQHINLEGPIPAHLLGNMWAQTWSNIYDLVVPFPSAPSMDTTEAMLKQGWTPRRMFKEADDFFTSLGLLPVPPEFWNKSMLEKPTDGREVVCHASAWDFYNGKDFRIKQCTTVNLEDLVVAHHEMGHIQYFMQYKDLPVALREGANPGFHEAIGDVLALSVSTPKHLHSLNLLSSEGGSDEHDINFLMKMALDKIAFIPFSYLVDQWRWRVFDGSITKENYNQEWWSLRLKYQGLCPPVPRTQGDFDPGAKFHIPSSVPYIRYFVSFIIQFQFHEALCQAAGHTGPLHKCDIYQSKEAGQRLATAMKLGFSRPWPEAMQLITGQPNMSASAMLSYFKPLLDWLRTENELHGEKLGWPQYNWTPNSARSEGPLPDSGRVSFLGLDLDAQQARVGQWLLLFLGIALLVATLGLSQRLFSIRHRSLHRHSHGPQFGSEVELRHS,mutated_sequence,1.0,1306.0,NP_000780.1.a2m,NP_000780.1.npy,ClinVar
+NP_000808.2,NP_000808.2.csv,MASSTPSSSATSSNAGADPNTTNLRPTTYDTWCGVAHGCTRKLGLKICGFLQRTNSLEEKSRLVSAFKERQSSKNLLSCENSDRDARFRRTETDFSNLFARDLLPAKNGEEQTVQFLLEVVDILLNYVRKTFDRSTKVLDFHHPHQLLEGMEGFNLELSDHPESLEQILVDCRDTLKYGVRTGHPRFFNQLSTGLDIIGLAGEWLTSTANTNMFTYEIAPVFVLMEQITLKKMREIVGWSSKDGDGIFSPGGAISNMYSIMAARYKYFPEVKTKGMAAVPKLVLFTSEQSHYSIKKAGAALGFGTDNVILIKCNERGKIIPADFEAKILEAKQKGYVPFYVNATAGTTVYGAFDPIQEIADICEKYNLWLHVDAAWGGGLLMSRKHRHKLNGIERANSVTWNPHKMMGVLLQCSAILVKEKGILQGCNQMCAGYLFQPDKQYDVSYDTGDKAIQCGRHVDIFKFWLMWKAKGTVGFENQINKCLELAEYLYAKIKNREEFEMVFNGEPEHTNVCFWYIPQSLRGVPDSPQRREKLHKVAPKIKALMMESGTTMVGYQPQGDKANFFRMVISNPAATQSDIDFLIEEIERLGQDL,mutated_sequence,1.0,594.0,NP_000808.2.a2m,NP_000808.2.npy,ClinVar
+NP_000866.1,NP_000866.1.csv,MKSGSGGGSPTSLWGLLFLSAALSLWPTSGEICGPGIDIRNDYQQLKRLENCTVIEGYLHILLISKAEDYRSYRFPKLTVITEYLLLFRVAGLESLGDLFPNLTVIRGWKLFYNYALVIFEMTNLKDIGLYNLRNITRGAIRIEKNADLCYLSTVDWSLILDAVSNNYIVGNKPPKECGDLCPGTMEEKPMCEKTTINNEYNYRCWTTNRCQKMCPSTCGKRACTENNECCHPECLGSCSAPDNDTACVACRHYYYAGVCVPACPPNTYRFEGWRCVDRDFCANILSAESSDSEGFVIHDGECMQECPSGFIRNGSQSMYCIPCEGPCPKVCEEEKKTKTIDSVTSAQMLQGCTIFKGNLLINIRRGNNIASELENFMGLIEVVTGYVKIRHSHALVSLSFLKNLRLILGEEQLEGNYSFYVLDNQNLQQLWDWDHRNLTIKAGKMYFAFNPKLCVSEIYRMEEVTGTKGRQSKGDINTRNNGERASCESDVLHFTSTTTSKNRIIITWHRYRPPDYRDLISFTVYYKEAPFKNVTEYDGQDACGSNSWNMVDVDLPPNKDVEPGILLHGLKPWTQYAVYVKAVTLTMVENDHIRGAKSEILYIRTNASVPSIPLDVLSASNSSSQLIVKWNPPSLPNGNLSYYIVRWQRQPQDGYLYRHNYCSKDKIPIRKYADGTIDIEEVTENPKTEVCGGEKGPCCACPKTEAEKQAEKEEAEYRKVFENFLHNSIFVPRPERKRRDVMQVANTTMSSRSRNTTAADTYNITDPEELETEYPFFESRVDNKERTVISNLRPFTLYRIDIHSCNHEAEKLGCSASNFVFARTMPAEGADDIPGPVTWEPRPENSIFLKWPEPENPNGLILMYEIKYGSQVEDQRECVSRQEYRKYGGAKLNRLNPGNYTARIQATSLSGNGSWTDPVFFYVQAKTGYENFIHLIIALPVAVLLIVGGLVIMLYVFHRKRNNSRLGNGVLYASVNPEYFSAADVYVPDEWEVAREKITMSRELGQGSFGMVYEGVAKGVVKDEPETRVAIKTVNEAASMRERIEFLNEASVMKEFNCHHVVRLLGVVSQGQPTLVIMELMTRGDLKSYLRSLRPEMENNPVLAPPSLSKMIQMAGEIADGMAYLNANKFVHRDLAARNCMVAEDFTVKIGDFGMTRDIYETDYYRKGGKGLLPVRWMSPESLKDGVFTTYSDVWSFGVVLWEIATLAEQPYQGLSNEQVLRFVMEGGLLDKPDNCPDMLFELMRMCWQYNPKMRPSFLEIISSIKEEMEPGFREVSFYYSEENKLPEPEELDLEPENMESVPLDPSASSSSLPLPDRHSGHKAENGPGPGVLVLRASFDERQPYAHMNGGRKNERALPLPQSSTC,mutated_sequence,1.0,1367.0,NP_000866.1.a2m,NP_000866.1.npy,ClinVar
+NP_000882.1,NP_000882.1.csv,MGSVRTNRYSIVSSEEDGMKLATMAVANGFGNGKSKVHTRQQCRSRFVKKDGHCNVQFINVGEKGQRYLADIFTTCVDIRWRWMLVIFCLAFVLSWLFFGCVFWLIALLHGDLDASKEGKACVSEVNSFTAAFLFSIETQTTIGYGFRCVTDECPIAVFMVVFQSIVGCIIDAFIIGAVMAKMAKPKKRNETLVFSHNAVIAMRDGKLCLMWRVGNLRKSHLVEAHVRAQLLKSRITSEGEYIPLDQIDINVGFDSGIDRIFLVSPITIVHEIDEDSPLYDLSKQDIDNADFEIVVILEGMVEATAMTTQCRSSYLANEILWGHRYEPVLFEEKHYYKVDYSRFHKTYEVPNTPLCSARDLAEKKYILSNANSFCYENEVALTSKEEDDSENGVPESTSTDTPPDIDLHNQASVPLEPRPLRRESEI,mutated_sequence,1.0,427.0,NP_000882.1.a2m,NP_000882.1.npy,ClinVar
+NP_000885.1,NP_000885.1.csv,MEMLQGLLLLLLLSMGGAWASREPLRPWCHPINAILAVEKEGCPVCITVNTTICAGYCPTMMRVLQAVLPPLPQVVCTYRDVRFESIRLPGCPRGVDPVVSFPVALSCRCGPCRRSTSDCGGPKDHPLTCDHPQLSGLLFL,mutated_sequence,1.0,141.0,NP_000885.1.a2m,NP_000885.1.npy,ClinVar
+NP_000890.1,NP_000890.1.csv,MKKTQTWILTCIYLQLLLFNPLVKTEGICRNRVTNNVKDVTKLVANLPKDYMITLKYVPGMDVLPSHCWISEMVVQLSDSLTDLLDKFSNISEGLSNYSIIDKLVNIVDDLVECVKENSSKDLKKSFKSPEPRLFTPEEFFRIFNRSIDAFKDFVVASETSDCVVSSTLSPEKDSRVSVTKPFMLPPVAASSLRNDSSSSNRKAKNPPGDSSLHWAAMALPALFSLIIGFAFGALYWKKRQPSLTRAVENIQINEEDNEISMLQEKEREFQEV,mutated_sequence,1.0,273.0,NP_000890.1.a2m,NP_000890.1.npy,ClinVar
+NP_000912.3,NP_000912.3.csv,MAVPGDAARVRDKPVHSGVSQAPTAGRDCHHRADPASPRDSGCRGCWGDLVLQPLRSSRKLSSALCAGSLSFLLALLVRLVRGEVGCDLEQCKEAAAAEEEEAAPGAEGGVFPGPRGGAPGGGARLSPWLQPSALLFSLLCAFFWMGLYLLRAGVRLPLAVALLAACCGGEALVQIGLGVGEDHLLSLPAAGVVLSCLAAATWLVLRLRLGVLMIALTSAVRTVSLISLERFKVAWRPYLAYLAGVLGILLARYVEQILPQSAEAAPREHLGSQLIAGTKEDIPVFKRRRRSSSVVSAEMSGCSSKSHRRTSLPCIPREQLMGHSEWDHKRGPRGSQSSGTSITVDIAVMGEAHGLITDLLADPSLPPNVCTSLRAVSNLLSTQLTFQAIHKPRVNPVTSLSENYTCSDSEESSEKDKLAIPKRLRRSLPPGLLRRVSSTWTTTTSATGLPTLEPAPVRRDRSTSIKLQEAPSSSPDSWNNPVMMTLTKSRSFTSSYAISAANHVKAKKQSRPGALAKISPLSSPCSSPLQGTPASSLVSKISAVQFPESADTTAKQSLGSHRALTYTQSAPDLSPQILTPPVICSSCGRPYSQGNPADEPLERSGVATRTPSRTDDTAQVTSDYETNNNSDSSDIVQNEDETECLREPLRKASACSTYAPETMMFLDKPILAPEPLVMDNLDSIMEQLNTWNFPIFDLVENIGRKCGRILSQVSYRLFEDMGLFEAFKIPIREFMNYFHALEIGYRDIPYHNRIHATDVLHAVWYLTTQPIPGLSTVINDHGSTSDSDSDSGFTHGHMGYVFSKTYNVTDDKYGCLSGNIPALELMALYVAAAMHDYDHPGRTNAFLVATSAPQAVLYNDRSVLENHHAAAAWNLFMSRPEYNFLINLDHVEFKHFRFLVIEAILATDLKKHFDFVAKFNGKVNDDVGIDWTNENDRLLVCQMCIKLADINGPAKCKELHLQWTDGIVNEFYEQGDEEASLGLPISPFMDRSAPQLANLQESFISHIVGPLCNSYDSAGLMPGKWVEDSDESGDTDDPEEEEEEAPAPNEEETCENNESPKKKTFKRRKIYCQITQHLLQNHKMWKKVIEEEQRLAGIENQSLDQTPQSHSSEQIQAIKEEEEEKGKPRGEEIPTQKPDQ,mutated_sequence,1.0,1141.0,NP_000912.3.a2m,NP_000912.3.npy,ClinVar
+NP_000925.2,NP_000925.2.csv,MALLWGLLVLSWSCLQGPCSVFSPVSAMEPLGRQLTSGPNQEQVSPLTLLKLGNQEPGGQTALKSPPGVCSRDPTPEQTHRLARAMMAFTADLFSLVAQTSTCPNLILSPLSVALALSHLALGAQNHTLQRLQQVLHAGSGPCLPHLLSRLCQDLGPGAFRLAARMYLQKGFPIKEDFLEQSEQLFGAKPVSLTGKQEDDLANINQWVKEATEGKIQEFLSGLPEDTVLLLLNAIHFQGFWRNKFDPSLTQRDSFHLDEQFTVPVEMMQARTYPLRWFLLEQPEIQVAHFPFKNNMSFVVLVPTHFEWNVSQVLANLSWDTLHPPLVWERPTKVRLPKLYLKHQMDLVATLSQLGLQELFQAPDLRGISEQSLVVSGVQHQSTLELSEVGVEAAAATSIAMSRMSLSSFSVNRPFLFFIFEDTTGLPLFVGSVRNPNPSAPRELKEQQDSPGNKDFLQSLKGFPRGDKLFGPDLKLVPPMEEDYPQFGSPK,mutated_sequence,1.0,491.0,NP_000925.2.a2m,NP_000925.2.npy,ClinVar
+NP_000928.1,NP_000928.1.csv,MHGGGPPSGDSACPLRTIKRVQFGVLSPDELKRMSVTEGGIKYPETTEGGRPKLGGLMDPRQGVIERTGRCQTCAGNMTECPGHFGHIELAKPVFHVGFLVKTMKVLRCVCFFCSKLLVDSNNPKIKDILAKSKGQPKKRLTHVYDLCKGKNICEGGEEMDNKFGVEQPEGDEDLTKEKGHGGCGRYQPRIRRSGLELYAEWKHVNEDSQEKKILLSPERVHEIFKRISDEECFVLGMEPRYARPEWMIVTVLPVPPLSVRPAVVMQGSARNQDDLTHKLADIVKINNQLRRNEQNGAAAHVIAEDVKLLQFHVATMVDNELPGLPRAMQKSGRPLKSLKQRLKGKEGRVRGNLMGKRVDFSARTVITPDPNLSIDQVGVPRSIAANMTFAEIVTPFNIDRLQELVRRGNSQYPGAKYIIRDNGDRIDLRFHPKPSDLHLQTGYKVERHMCDGDIVIFNRQPTLHKMSMMGHRVRILPWSTFRLNLSVTTPYNADFDGDEMNLHLPQSLETRAEIQELAMVPRMIVTPQSNRPVMGIVQDTLTAVRKFTKRDVFLERGEVMNLLMFLSTWDGKVPQPAILKPRPLWTGKQIFSLIIPGHINCIRTHSTHPDDEDSGPYKHISPGDTKVVVENGELIMGILCKKSLGTSAGSLVHISYLEMGHDITRLFYSNIQTVINNWLLIEGHTIGIGDSIADSKTYQDIQNTIKKAKQDVIEVIEKAHNNELEPTPGNTLRQTFENQVNRILNDARDKTGSSAQKSLSEYNNFKSMVVSGAKGSKINISQVIAVVGQQNVEGKRIPFGFKHRTLPHFIKDDYGPESRGFVENSYLAGLTPTEFFFHAMGGREGLIDTAVKTAETGYIQRRLIKSMESVMVKYDATVRNSINQVVQLRYGEDGLAGESVEFQNLATLKPSNKAFEKKFRFDYTNERALRRTLQEDLVKDVLSNAHIQNELEREFERMREDREVLRVIFPTGDSKVVLPCNLLRMIWNAQKIFHINPRLPSDLHPIKVVEGVKELSKKLVIVNGDDPLSRQAQENATLLFNIHLRSTLCSRRMAEEFRLSGEAFDWLLGEIESKFNQAIAHPGEMVGALAAQSLGEPATQMTLNTFHYAGVSAKNVTLGVPRLKELINISKKPKTPSLTVFLLGQSARDAERAKDILCRLEHTTLRKVTANTAIYYDPNPQSTVVAEDQEWVNVYYEMPDFDVARISPWLLRVELDRKHMTDRKLTMEQIAEKINAGFGDDLNCIFNDDNAEKLVLRIRIMNSDENKMQEEEEVVDKMDDDVFLRCIESNMLTDMTLQGIEQISKVYMHLPQTDNKKKIIITEDGEFKALQEWILETDGVSLMRVLSEKDVDPVRTTSNDIVEIFTVLGIEAVRKALERELYHVISFDGSYVNYRHLALLCDTMTCRGHLMAITRHGVNRQDTGPLMKCSFEETVDVLMEAAAHGESDPMKGVSENIMLGQLAPAGTGCFDLLLDAEKCKYGMEIPTNIPGLGAAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGFSPSAASDASGFSPGYSPAWSPTPGSPGSPGPSSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSPSYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPNYSPTSPNYTPTSPSYSPTSPSYSPTSPNYTPTSPNYSPTSPSYSPTSPSYSPTSPSYSPSSPRYTPQSPTYTPSSPSYSPSSPSYSPTSPKYTPTSPSYSPSSPEYTPTSPKYSPTSPKYSPTSPKYSPTSPTYSPTTPKYSPTSPTYSPTSPVYTPTSPKYSPTSPTYSPTSPKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAISPDDSDEEN,mutated_sequence,1.0,1970.0,NP_000928.1.a2m,NP_000928.1.npy,ClinVar
+NP_000966.2,NP_000966.2.csv,MAQDQGEKENPMRELRIRKLCLNICVGESGDRLTRAAKVLEQLTGQTPVFSKARYTVRSFGIRRNEKIAVHCTVRGAKAEEILEKGLKVREYELRKNNFSDTGNFGFGIQEHIDLGIKYDPSIGIYGLDFYVVLGRPGFSIADKKRRTGCIGAKHRISKEEAMRWFQQKYDGIILPGK,mutated_sequence,1.0,178.0,NP_000966.2.a2m,NP_000966.2.npy,ClinVar
+NP_001001344.1,NP_001001344.1.csv,MGDMANSSIEFHPKPQQQRDVPQAGGFGCTLAELRTLMELRGAEALQKIEEAYGDVSGLCRRLKTSPTEGLADNTNDLEKRRQIYGQNFIPPKQPKTFLQLVWEALQDVTLIILEVAAIVSLGLSFYAPPGEESEACGNVSGGAEDEGEAEAGWIEGAAILLSVICVVLVTAFNDWSKEKQFRGLQSRIEQEQKFTVIRNGQLLQVPVAALVVGDIAQVKYGDLLPADGVLIQANDLKIDESSLTGESDHVRKSADKDPMLLSGTHVMEGSGRMVVTAVGVNSQTGIIFTLLGAGGEEEEKKDKKGKQQDGAMESSQTKAKKQDGAVAMEMQPLKSAEGGEMEEREKKKANAPKKEKSVLQGKLTKLAVQIGKAGLVMSAITVIILVLYFVIETFVVEGRTWLAECTPVYVQYFVKFFIIGVTVLVVAVPEGLPLAVTISLAYSVKKMMKDNNLVRHLDACETMGNATAICSDKTGTLTTNRMTVVQSYLGDTHYKEIPAPSALTPKILDLLVHAISINSAYTTKILPPEKEGALPRQVGNKTECALLGFVLDLKRDFQPVREQIPEDKLYKVYTFNSVRKSMSTVIRMPDGGFRLFSKGASEILLKKCTNILNSNGELRGFRPRDRDDMVRKIIEPMACDGLRTICIAYRDFSAGQEPDWDNENEVVGDLTCIAVVGIEDPVRPEVPEAIRKCQRAGITVRMVTGDNINTARAIAAKCGIIQPGEDFLCLEGKEFNRRIRNEKGEIEQERLDKVWPKLRVLARSSPTDKHTLVKGIIDSTTGEQRQVVAVTGDGTNDGPALKKADVGFAMGIAGTDVAKEASDIILTDDNFTSIVKAVMWGRNVYDSISKFLQFQLTVNVVAVIVAFTGACITQDSPLKAVQMLWVNLIMDTFASLALATEPPTESLLLRKPYGRDKPLISRTMMKNILGHAVYQLAIIFTLLFVGELFFDIDSGRNAPLHSPPSEHYTIIFNTFVMMQLFNEINARKIHGERNVFDGIFSNPIFCTIVLGTFGIQIVIVQFGGKPFSCSPLSTEQWLWCLFVGVGELVWGQVIATIPTSQLKCLKEAGHGPGKDEMTDEELAEGEEEIDHAERELRRGQILWFRGLNRIQTQIRVVKAFRSSLYEGLEKPESKTSIHNFMATPEFLINDYTHNIPLIDDTDVDENEERLRAPPPPSPNQNNNAIDSGIYLTTHVTKSATSSVFSSSPGSPLHSVETSL,mutated_sequence,1.0,1220.0,NP_001001344.1.a2m,NP_001001344.1.npy,ClinVar
+NP_001003694.1,NP_001003694.1.csv,MGVDFDVKTFCHNLRATKPPYECPVETCRKVYKSYSGIEYHLYHYDHDNPPPPQQTPLRKHKKKGRQSRPANKQSPSPSEVSQSPGREVMSYAQAQRMVEVDLHGRVHRISIFDNLDVVSEDEEAPEEAPENGSNKENTETPAATPKSGKHKNKEKRKDSNHHHHHNVSASTTPKLPEVVYRELEQDTPDAPPRPTSYYRYIEKSAEELDEEVEYDMDEEDYIWLDIMNERRKTEGVSPIPQEIFEYLMDRLEKESYFESHNKGDPNALVDEDAVCCICNDGECQNSNVILFCDMCNLAVHQECYGVPYIPEGQWLCRRCLQSPSRAVDCALCPNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRGSGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSFSVRKTAYCDIHTPPGSARRLPALSHSEGEEDEDEEEDEGKGWSSEKVKKAKAKSRIKMKKARKILAEKRAAAPVVSVPCIPPHRLSKITNRLTIQRKSQFMQRLHSYWTLKRQSRNGVPLLRRLQTHLQSQRNCDQVGRDSEDKNWALKEQLKSWQRLRHDLERARLLVELIRKREKLKRETIKVQQIAMEMQLTPFLILLRKTLEQLQEKDTGNIFSEPVPLSEVTELDEVPDYLDHIKKPMDFFTMKQNLEAYRYLNFDDFEEDFNLIVSNCLKYNAKDTIFYRAAVRLREQGGAVLRQARRQAEKMGIDFETGMHIPHSLAGDEATHHTEDAAEEERLVLLENQKHLPVEEQLKLLLERLDEVNASKQSVGRSRRAKMIKKEMTALRRKLAHQRETGRDGPERHGPSSRGSLTPHPAACDKDGQTDSAAEESSSQETSKGLGPNMSSTPAHEVGRRTSVLFSKKNPKTAGPPKRPGRPPKNRESQMTPSHGGSPVGPPQLPIMSSLRQRKRGRSPRPSSSSDSDSDKSTEDPPMDLPANGFSGGNQPVKKSFLVYRNDCSLPRSSSDSESSSSSSSSAASDRTSTTPSKQGRGKPSFSRGTFPEDSSEDTSGTENEAYSVGTGRGVGHSMVRKSLGRGAGWLSEDEDSPLDALDLVWAKCRGYPSYPALIIDPKMPREGMFHHGVPIPVPPLEVLKLGEQMTQEAREHLYLVLFFDNKRTWQWLPRTKLVPLGVNQDLDKEKMLEGRKSNIRKSVQIAYHRALQHRSKVQGEQSSETSDSD,mutated_sequence,1.0,1220.0,NP_001003694.1.a2m,NP_001003694.1.npy,ClinVar
+NP_001003800.1,NP_001003800.1.csv,MSAPSEEEEYARLVMEAQPEWLRAEVKRLSHELAETTREKIQAAEYGLAVLEEKHQLKLQFEELEVDYEAIRSEMEQLKEAFGQAHTNHKKVAADGESREESLIQESASKEQYYVRKVLELQTELKQLRNVLTNTQSENERLASVAQELKEINQNVEIQRGRLRDDIKEYKFREARLLQDYSELEEENISLQKQVSVLRQNQVEFEGLKHEIKRLEEETEYLNSQLEDAIRLKEISERQLEEALETLKTEREQKNSLRKELSHYMSINDSFYTSHLHVSLDGLKFSDDAAEPNNDAEALVNGFEHGGLAKLPLDNKTSTPKKEGLAPPSPSLVSDLLSELNISEIQKLKQQLMQMEREKAGLLATLQDTQKQLEHTRGSLSEQQEKVTRLTENLSALRRLQASKERQTALDNEKDRDSHEDGDYYEVDINGPEILACKYHVAVAEAGELREQLKALRSTHEAREAQHAEEKGRYEAEGQALTEKVSLLEKASRQDRELLARLEKELKKVSDVAGETQGSLSVAQDELVTFSEELANLYHHVCMCNNETPNRVMLDYYREGQGGAGRTSPGGRTSPEARGRRSPILLPKGLLAPEAGRADGGTGDSSPSPGSSLPSPLSDPRREPMNIYNLIAIIRDQIKHLQAAVDRTTELSRQRIASQELGPAVDKDKEALMEEILKLKSLLSTKREQITTLRTVLKANKQTAEVALANLKSKYENEKAMVTETMMKLRNELKALKEDAATFSSLRAMFATRCDEYITQLDEMQRQLAAAEDEKKTLNSLLRMAIQQKLALTQRLELLELDHEQTRRGRAKAAPKTKPATPSVSHTCACASDRAEGTGLANQVFCSEKHSIYCD,mutated_sequence,1.0,855.0,NP_001003800.1.a2m,NP_001003800.1.npy,ClinVar
+NP_001003891.1,NP_001003891.1.csv,MDVSGQETDWRSTAFRQKLVSQIEDAMRKAGVAHSKSSKDMESHVFLKAKTRDEYLSLVARLIIHFRDIHNKKSQASVSDPMNALQSLTGGPAAGAAGIGMPPRGPGQSLGGMGSLGAMGQPMSLSGQPPPGTSGMAPHSMAVVSTATPQTQLQLQQVALQQQQQQQQFQQQQQAALQQQQQQQQQQQFQAQQSAMQQQFQAVVQQQQQLQQQQQQQQHLIKLHHQNQQQIQQQQQQLQRIAQLQLQQQQQQQQQQQQQQQQALQAQPPIQQPPMQQPQPPPSQALPQQLQQMHHTQHHQPPPQPQQPPVAQNQPSQLPPQSQTQPLVSQAQALPGQMLYTQPPLKFVRAPMVVQQPPVQPQVQQQQTAVQTAQAAQMVAPGVQMITEALAQGGMHIRARFPPTTAVSAIPSSSIPLGRQPMAQVSQSSLPMLSSPSPGQQVQTPQSMPPPPQPSPQPGQPSSQPNSNVSSGPAPSPSSFLPSPSPQPSQSPVTARTPQNFSVPSPGPLNTPVNPSSVMSPAGSSQAEEQQYLDKLKQLSKYIEPLRRMINKIDKNEDRKKDLSKMKSLLDILTDPSKRCPLKTLQKCEIALEKLKNDMAVPTPPPPPVPPTKQQYLCQPLLDAVLANIRSPVFNHSLYRTFVPAMTAIHGPPITAPVVCTRKRRLEDDERQSIPSVLQGEVARLDPKFLVNLDPSHCSNNGTVHLICKLDDKDLPSVPPLELSVPADYPAQSPLWIDRQWQYDANPFLQSVHRCMTSRLLQLPDKHSVTALLNTWAQSVHQACLSAA,mutated_sequence,1.0,788.0,NP_001003891.1.a2m,NP_001003891.1.npy,ClinVar
+NP_001004311.2,NP_001004311.2.csv,MDPAPGVLDPRAAPPALLGTPQAEVLEDVLREQFGPLPQLAAVCRLKRLPSGGYSSTENLQLVLERRRVANAKERERIKNLNRGFARLKALVPFLPQSRKPSKVDILKGATEYIQVLSDLLEGAKDSKKQDPDEQSYSNNSSESHTSSARQLSRNITQHISCAFGLKNEEEGPWADGGSGEPAHACRHSVMSTTEIISPTRSLDRFPEVELLSHRLPQV,mutated_sequence,1.0,219.0,NP_001004311.2.a2m,NP_001004311.2.npy,ClinVar
+NP_001005273.1,NP_001005273.1.csv,MKAADTVILWARSKNDQLRISFPPGLCWGDRMPDKDDIRLLPSALGVKKRKRGPKKQKENKPGKPRKRKKRDSEEEFGSERDEYREKSESGGSEYGTGPGRKRRRKHREKKEKKTKRRKKGEGDGGQKQVEQKSSATLLLTWGLEDVEHVFSEEDYHTLTNYKAFSQFMRPLIAKKNPKIPMSKMMTILGAKWREFSANNPFKGSAAAVAAAAAAAAAAVAEQVSAAVSSATPIAPSGPPALPPPPAADIQPPPIRRAKTKEGKGPGHKRRSKSPRVPDGRKKLRGKKMAPLKIKLGLLGGKRKKGGSYVFQSDEGPEPEAEESDLDSGSVHSASGRPDGPVRTKKLKRGRPGRKKKKVLGCPAVAGEEEVDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVAVPAPQQADGNPDVPPPRPLQGRSEREFFVKWVGLSYWHCSWAKELQLEIFHLVMYRNYQRKNDMDEPPPLDYGSGEDDGKSDKRKVKDPHYAEMEEKYYRFGIKPEWMTVHRIINHSVDKKGNYHYLVKWRDLPYDQSTWEEDEMNIPEYEEHKQSYWRHRELIMGEDPAQPRKYKKKKKELQGDGPPSSPTNDPTVKYETQPRFITATGGTLHMYQLEGLNWLRFSWAQGTDTILADEMGLGKTIQTIVFLYSLYKEGHTKGPFLVSAPLSTIINWEREFQMWAPKFYVVTYTGDKDSRAIIRENEFSFEDNAIKGGKKAFKMKREAQVKFHVLLTSYELITIDQAALGSIRWACLVVDEAHRLKNNQSKFFRVLNGYKIDHKLLLTGTPLQNNLEELFHLLNFLTPERFNNLEGFLEEFADISKEDQIKKLHDLLGPHMLRRLKADVFKNMPAKTELIVRVELSPMQKKYYKYILTRNFEALNSRGGGNQVSLLNIMMDLKKCCNHPYLFPVAAMESPKLPSGAYEGGALIKSSGKLMLLQKMLRKLKEQGHRVLIFSQMTKMLDLLEDFLDYEGYKYERIDGGITGALRQEAIDRFNAPGAQQFCFLLSTRAGGLGINLATADTVIIFDSDWNPHNDIQAFSRAHRIGQANKVMIYRFVTRASVEERITQVAKRKMMLTHLVVRPGLGSKAGSMSKQELDDILKFGTEELFKDENEGENKEEDSSVIHYDNEAIARLLDRNQDATEDTDVQNMNEYLSSFKVAQYVVREEDKIEEIEREIIKQEENVDPDYWEKLLRHHYEQQQEDLARNLGKGKRVRKQVNYNDAAQEDQDNQSEYSVGSEEEDEDFDERPEGRRQSKRQLRNEKDKPLPPLLARVGGNIEVLGFNTRQRKAFLNAVMRWGMPPQDAFTTQWLVRDLRGKTEKEFKAYVSLFMRHLCEPGADGSETFADGVPREGLSRQQVLTRIGVMSLVKKKVQEFEHINGRWSMPELMPDPSADSKRSSRASSPTKTSPTTPEASATNSPCTSKPATPAPSEKGEGIRTPLEKEEAENQEEKPEKNSRIGEKMETEADAPSPAPSLGERLEPRKIPLEDEVPGVPGEMEPEPGYRGDREKSATESTPGERGEEKPLDGQEHRERPEGETGDLGKREDVKGDRELRPGPRDEPRSNGRREEKTEKPRFMFNIADGGFTELHTLWQNEERAAISSGKLNEIWHRRHDYWLLAGIVLHGYARWQDIQNDAQFAIINEPFKTEANKGNFLEMKNKFLARRFKLLEQALVIEEQLRRAAYLNLSQEPAHPAMALHARFAEAECLAESHQHLSKESLAGNKPANAVLHKVLNQLEELLSDMKADVTRLPATLSRIPPIAARLQMSERSILSRLASKGTEPHPTPAYPPGPYATPPGYGAAFSAAPVGALAAAGANYSQMPAGSFITAATNGPPVLVKKEKEMVGALVSDGLDRKEPRAGEVICIDD,mutated_sequence,1.0,2000.0,NP_001005273.1.a2m,NP_001005273.1.npy,ClinVar
+NP_001009899.3,NP_001009899.3.csv,MPEMTENETPTKKQHRKKNRETHNAVERHRKKKINAGINRIGELIPCSPALKQSKNMILDQAFKYITELKRQNDELLLNGGNNEQAEEIKKLRKQLEEIQKENGRYIELLKANDICLYDDPTIHWKGNLKNSKVSVVIPSDQVQKKIIVYSNGNQPGGNSQGTAVQGITFNVSHNLQKQTANVVPVQRTCNLVTPVSISGVYPSENKPWHQTTVPALATNQPVPLCLPAAISAQSILELPTSESESNVLGATSGSLIAVSIESEPHQHHSLHTCLNDQNSSENKNGQENPKVLKKMTPCVTNIPHSSSATATKVHHGNKSCLSIQDFRGDFQNTFVVSVTTTVCSQPPRTAGDSSPMSISKSADLTSTATVVASSAPGVGKATIPISTLSGNPLDNGWTLSCSLPSSSVSTSDLKNINSLTRISSAGNTQTTWTTLQLAGNTIQPLSQTPSSAVTPVLNESGTSPTTSNHSRYVATDINLNNSFPADGQPVEQVVVTLPSCPSLPMQPLIAQPQVKSQPPKNILPLNSAMQVIQMAQPVGSAVNSAPTNQNVIILQPPSTTPCPTVMRAEVSNQTVGQQIVIIQAANQNPLPLLPAPPPGSVRLPINGANTVIGSNNSVQNVPTPQTFGGKHLVHILPRPSSLSASNSTQTFSVTMSNQQPQTISLNGQLFALQPVMSSSGTTNQTPMQIIQPTTSEDPNTNVALNTFGALASLNQSISQMAGQSCVQLSISQPANSQTAANSQTTTANCVSLTTTAAPPVTTDSSATLASTYNLVSTSSMNTVACLPNMKSKRLNKKPGGRKHLAANKSACPLNSVRDVSKLDCPNTEGSAEPPCNDGLLESFPAVLPSVSVSQANSVSVSASHSLGVLSSESLIPESVSKSKSAEKSSPPSQESVTSEHFAMAAAKSKDSTPNLQQETSQDKPPSSLALSDAAKPCASANVLIPSPSDPHILVSQVPGLSSTTSTTSTDCVSEVEIIAEPCRVEQDSSDTMQTTGLLKGQGLTTLLSDLAKKKNPQKSSLSDQMDHPDFSSENPKIVDSSVNLHPKQELLLMNNDDRDPPQHHSCLPDQEVINGSLINGRQADSPMSTSSGSSRSFSVASMLPETTREDVTSNATTNTCDSCTFVEQTDIVALAARAIFDQENLEKGRVGLQADIREVASKPSEASLLEGDPPFKSQIPKESGTGQAEATPNEFNSQGSIEATMERPLEKPSCSLGIKTSNASLQDSTSQPPSITSLSVNNLIHQSSISHPLASCAGLSPTSEQTTVPATVNLTVSSSSYGSQPPGPSLMTEYSQEQLNTMTSTIPNSQIQEPLLKPSHESRKDSAKRAVQDDLLLSSAKRQKHCQPAPLRLESMSLMSRTPDTISDQTQMMVSQIPPNSSNSVVPVSNPAHGDGLTRLFPPSNNFVTPALRQTEVQCGSQPSVAEQQQTQASQHLQALQQHVPAQGVSHLHSNHLYIKQQQQQQQQQQQQQQQQQAGQLRERHHLYQMQHHVPHAESSVHSQPHNVHQQRTLQQEVQMQKKRNLVQGTQTSQLSLQPKHHGTDQSRSKTGQPHPHHQQMQQQMQQHFGSSQTEKSCENPSTSRNHHNHPQNHLNQDIMHQQQDVGSRQQGSGVSSEHVSGHNPMQRLLTSRGLEQQMVSQPSIVTRSSDMTCTPHRPERNRVSSYSAEALIGKTSSNSEQRMGISIQGSRVSDQLEMRSYLDVPRNKSLAIHNMQGRVDHTVASDIRLSDCQTFKPSGASQQPQSNFEVQSSRNNEIGNPVSSLRSMQSQAFRISQNTGPPPIDRQKRLSYPPVQSIPTGNGIPSRDSENTCHQSFMQSLLAPHLSDQVIGSQRSLSEHQRNTQCGPSSAIEYNCPPTHENVHIRRESESQNRESCDMSLGAINTRNSTLNIPFSSSSSSGDIQGRNTSPNVSVQKSNPMRITESHATKGHMNPPVTTNMHGVARPALPHPSVSHGNGDQGPAVRQANSSVPQRSRHPLQDSSGSKIRQPERNRSGNQRQSTVFDPSLPHLPLSTGGSMILGRQQPATEKRGSIVRFMPDSPQVPNDNSGPDQHTLSQNFGFSFIPEGGMNPPINANASFIPQVTQPSATRTPALIPVDPQNTLPSFYPPYSPAHPTLSNDISIPYFPNQMFSNPSTEKVNSGSLNNRFGSILSPPRPVGFAQPSFPLLPDMPPMHMTNSHLSNFNMTSLFPEIATALPDGSAMSPLLTIANSSASDSSKQSSNRPAHNISHILGHDCSSAV,mutated_sequence,1.0,2245.0,NP_001009899.3.a2m,NP_001009899.3.npy,ClinVar
+NP_001009944.3,NP_001009944.3.csv,MPPAAPARLALALGLGLWLGALAGGPGRGCGPCEPPCLCGPAPGAACRVNCSGRGLRTLGPALRIPADATALDVSHNLLRALDVGLLANLSALAELDISNNKISTLEEGIFANLFNLSEINLSGNPFECDCGLAWLPRWAEEQQVRVVQPEAATCAGPGSLAGQPLLGIPLLDSGCGEEYVACLPDNSSGTVAAVSFSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSASFACLSLCSGPPPPPAPTCRGPTLLQHVFPASPGATLVGPHGPLASGQLAAFHIAAPLPVTATRWDFGDGSAEVDAAGPAASHRYVLPGRYHVTAVLALGAGSALLGTDVQVEAAPAALELVCPSSVQSDESLDLSIQNRGGSGLEAAYSIVALGEEPARAVHPLCPSDTEIFPGNGHCYRLVVEKAAWLQAQEQCQAWAGAALAMVDSPAVQRFLVSRVTRSLDVWIGFSTVQGVEVGPAPQGEAFSLESCQNWLPGEPHPATAEHCVRLGPTGWCNTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLAQQDGLSAPHEPVEVMVFPGLRLSREAFLTTAEFGTQELRRPAQLRLQVYRLLSTAGTPENGSEPESRSPDNRTQLAPACMPGGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAPYALWREFLFSVPAGPPAQYSVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRAPYLSANASSWLPHLPAQLEGTWACPACALRLLAATEQLTVLLGLRPNPGLRLPGRYEVRAEVGNGVSRHNLSCSFDVVSPVAGLRVIYPAPRDGRLYVPTNGSALVLQVDSGANATATARWPGGSVSARFENVCPALVATFVPGCPWETNDTLFSVVALPWLSEGEHVVDVVVENSASRANLSLRVTAEEPICGLRATPSPEARVLQGVLVRYSPVVEAGSDMVFRWTINDKQSLTFQNVVFNVIYQSAAVFKLSLTASNHVSNVTVNYNVTVERMNRMQGLQVSTVPAVLSPNATLALTAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPDPSVAQVLVEHNVMHTYAAPGEYLLTVLASNAFENLTQQVPVSVRASLPSVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLYTWDFGDGSPVLTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQADVRVFEELRGLSVDMSLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTVGAASPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTGNPAHYLFDWTFGDGSSNTTVRGCPTVTHNFTRSGTFPLALVLSSRVNRAHYFTSICVEPEVGNVTLQPERQFVQLGDEAWLVACAWPPFPYRYTWDFGTEEAAPTRARGPEVTFIYRDPGSYLVTVTASNNISAANDSALVEVQEPVLVTSIKVNGSLGLELQQPYLFSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLVVNASRTVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYVLQLIEGLQVVGGGRYFPTNHTVQLQAVVRDGTNVSYSWTAWRDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTMDFVEPVGWLMVAASPNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAGNPLGSANATVEVDVQVPVSGLSIRASEPGGSFVAAGSSVPFWGQLATGTNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSATYNLTAEEPIVGLVLWASSKVVAPGQLVHFQILLAAGSAVTFRLQVGGANPEVLPGPRFSHSFPRVGDHVVSVRGKNHVSWAQAQVRIVVLEAVSGLQVPNCCEPGIATGTERNFTARVQRGSRVAYAWYFSLQKVQGDSLVILSGRDVTYTPVAAGLLEIQVRAFNALGSENRTLVLEVQDAVQYVALQSGPCFTNRSAQFEAATSPSPRRVAYHWDFGDGSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSFFVAQATVTVQVLACREPEVDVVLPLQVLMRRSQRNYLEAHVDLRDCVTYQTEYRWEVYRTASCQRPGRPARVALPGVDVSRPRLVLPRLALPVGHYCFVFVVSFGDTPLTQSIQANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGSESYDPNLEDGDQTPLSFHWACVASTQREAGGCALNFGPRGSSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGRCLNCSSGSKRGRWAARTFSNKTLVLDETTTSTGSAGMRLVLRRGVLRDGEGYTFTLTVLGRSGEEEGCASIRLSPNRPPLGGSCRLFPLGAVHALTTKVHFECTGWHDAEDAGAPLVYALLLRRCRQGHCEEFCVYKGSLSSYGAVLPPGFRPHFEVGLAVVVQDQLGAAVVALNRSLAITLPEPNGSATGLTVWLHGLTASVLPGLLRQADPQHVIEYSLALVTVLNEYERALDVAAEPKHERQHRAQIRKNITETLVSLRVHTVDDIQQIAAALAQCMGPSRELVCRSCLKQTLHKLEAMMLILQAETTAGTVTPTAIGDSILNITGDLIHLASSDVRAPQPSELGAESPSRMVASQAYNLTSALMRILMRSRVLNEEPLTLAGEEIVAQGKRSDPRSLLCYGGAPGPGCHFSIPEAFSGALANLSDVVQLIFLVDSNPFPFGYISNYTVSTKVASMAFQTQAGAQIPIERLASERAITVKVPNNSDWAARGHRSSANSANSVVVQPQASVGAVVTLDSSNPAAGLHLQLNYTLLDGHYLSEEPEPYLAVYLHSEPRPNEHNCSASRRIRPESLQGADHRPYTFFISPGSRDPAGSYHLNLSSHFRWSALQVSVGLYTSLCQYFSEEDMVWRTEGLLPLEETSPRQAVCLTRHLTAFGASLFVPPSHVRFVFPEPTADVNYIVMLTCAVCLVTYMVMAAILHKLDQLDASRGRAIPFCGQRGRFKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGDRAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAWFLQHVIVRDLQTARSAFFLVNDWLSVETEANGGLVEKEVLAASDAALLRFRRLLVAELQRGFFDKHIWLSIWDRPPRSRFTRIQRATCCVLLICLFLGANAVWYGAVGDSAYSTGHVSRLSPLSVDTVAVGLVSSVVVYPVYLAILFLFRMSRSKVAGSPSPTPAGQQVLDIDSCLDSSVLDSSFLTFSGLHAEQAFVGQMKSDLFLDDSKSLVCWPSGEGTLSWPDLLSDPSIVGSNLRQLARGQAGHGLGPEEDGFSLASPYSPAKSFSASDEDLIQQVLAEGVSSPAPTQDTHMETDLLSSLSSTPGEKTETLALQRLGELGPPSPGLNWEQPQAARLSRTGLVEGLRKRLLPAWCASLAHGLSLLLVAVAVAVSGWVGASFPPGVSVAWLLSSSASFLASFLGWEPLKVLLEALYFSLVAKRLHPDEDDTLVESPAVTPVSARVPRVRPPHGFALFLAKEEARKVKRLHGMLRSLLVYMLFLLVTLLASYGDASCHGHAYRLQSAIKQELHSRAFLAITRSEELWPWMAHVLLPYVHGNQSSPELGPPRLRQVRLQEALYPDPPGPRVHTCSAAGGFSTSDYDVGWESPHNGSGTWAYSAPDLLGAWSWGSCAVYDSGGYVQELGLSLEESRDRLRFLQLHNWLDNRSRAVFLELTRYSPAVGLHAAVTLRLEFPAAGRALAALSVRPFALRRLSAGLSLPLLTSVCLLLFAVHFAVAEARTWHREGRWRVLRLGAWARWLLVALTAATALVRLAQLGAADRQWTRFVRGRPRRFTSFDQVAQLSSAARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLAILLVSSCVDSLWSVAQALLVLCPGTGLSTLCPAESWHLSPLLCVGLWALRLWGALRLGAVILRWRYHALRGELYRPAWEPQDYEMVELFLRRLRLWMGLSKVKEFRHKVRFEGMEPLPSRSSRGSKVSPDVPPPSAGSDASHPSTSSSQLDGLSVSLGRLGTRCEPEPSRLQAVFEALLTQFDRLNQATEDVYQLEQQLHSLQGRRSSRAPAGSSRGPSPGLRPALPSRLARASRGVDLATGPSRTPLRAKNKVHPSST,mutated_sequence,1.0,4303.0,NP_001009944.3.a2m,NP_001009944.3.npy,ClinVar
+NP_001012632.1,NP_001012632.1.csv,MSGVRPPIMNGPLHPRPLVALLDGRDCTVEMPILKDVATVAFCDAQSTQEIHEKVLNEAVGALMYHTITLTREDLEKFKALRIIVRIGSGFDNIDIKSAGDLGIAVCNVPAASVEETADSTLCHILNLYRRATWLHQALREGTRVQSVEQIREVASGAARIRGETLGIIGLGRVGQAVALRAKAFGFNVLFYDPYLSDGVERALGLQRVSTLQDLLFHSDCVTLHCGLNEHNHHLINDFTVKQMRQGAFLVNTARGGLVDEKALAQALKEGRIRGAALDVHESEPFSFSQGPLKDAPNLICTPHAAWYSEQASIEMREEAAREIRRAITGRIPDSLKNCVNKDHLTAATHWASMDPAVVHPELNGAAYRYPPGVVGVAPTGIPAAVEGIVPSAMSLSHGLPPVAHPPHAPSPGQTVKPEADRDHASDQL,mutated_sequence,1.0,429.0,NP_001012632.1.a2m,NP_001012632.1.npy,ClinVar
+NP_001013.1,NP_001013.1.csv,MPGVTVKDVNQQEFVRALAAFLKKSGKLKVPEWVDTVKLAKHKELAPYDENWFYTRAASTARHLYLRGGAGVGSMTKIYGGRQRNGVMPSHFSRGSKSVARRVLQALEGLKMVEKDQDGGRKLTPQGQRDLDRIAGQVAAANKKH,mutated_sequence,1.0,145.0,NP_001013.1.a2m,NP_001013.1.npy,ClinVar
+NP_001013861.1,NP_001013861.1.csv,MIPPQEASARRREIEDKLKQEEETLSFIRDSLEKSDQLTKNMVSILSSFESRLMKLENSIIPVHKQTENLQRLQENVEKTLSCLDHVISYYHVASDTEKIIREGPTGRLEEYLGSMAKIQKAVEYFQDNSPDSPELNKVKLLFERGKEALESEFRSLMTRHSKVVSPVLILDLISGDDDLEAQEDVTLEHLPESVLQDVIRISRWLVEYGRNQDFMNVYYQIRSSQLDRSIKGLKEHFHKSSSSSGVPYSPAIPNKRKDTPTKKPVKRPGTIRKAQNLLKQYSQHGLDGKKGGSNLIPLEGRDDMLDVETDAYIHCVSAFVKLAQSEYQLLADIIPEHHQKKTFDSLIQDALDGLMLEGENIVSAARKAIVRHDFSTVLTVFPILRHLKQTKPEFDQVLQGTAASTKNKLPGLITSMETIGAKALEDFADNIKNDPDKEYNMPKDGTVHELTSNAILFLQQLLDFQETAGAMLASQETSSSATSYSSEFSKRLLSTYICKVLGNLQLNLLSKSKVYEDPALSAIFLHNNYNYILKSLEKSELIQLVAVTQKTAERSYREHIEQQIQTYQRSWLKVTDYIAEKNLPVFQPGVKLRDKERQIIKERFKGFNDGLEELCKIQKAWAIPDTEQRDRIRQAQKTIVKETYGAFLQKFGSVPFTKNPEKYIKYGVEQVGDMIDRLFDTSA,mutated_sequence,1.0,684.0,NP_001013861.1.a2m,NP_001013861.1.npy,ClinVar
+NP_001015877.1,NP_001015877.1.csv,MSSSVEQKKGPTRQRKCGFCKSNRDKECGQLLISENQKVAAHHKCMLFSSALVSSHSDNESLGGFSIEDVQKEIKRGTKLMCSLCHCPGATIGCDVKTCHRTYHYHCALHDKAQIREKPSQGIYMVYCRKHKKTAHNSEADLEESFNEHELEPSSPKSKKKSRKGRPRKTNFKGLSEDTRSTSSHGTDEMESSSYRDRSPHRSSPSDTRPKCGFCHVGEEENEARGKLHIFNAKKAAAHYKCMLFSSGTVQLTTTSRAEFGDFDIKTVLQEIKRGKRMKCTLCSQPGATIGCEIKACVKTYHYHCGVQDKAKYIENMSRGIYKLYCKNHSGNDERDEEDEERESKSRGKVEIDQQQLTQQQLNGN,mutated_sequence,1.0,365.0,NP_001015877.1.a2m,NP_001015877.1.npy,ClinVar
+NP_001015880.1,NP_001015880.1.csv,MSGIKKQKTENQQKSTNVVYQAHHVSRNKRGQVVGTRGGFRGCTVWLTGLSGAGKTTISFALEEYLVSHAIPCYSLDGDNVRHGLNRNLGFSPGDREENIRRIAEVAKLFADAGLVCITSFISPFAKDRENARKIHESAGLPFFEIFVDAPLNICESRDVKGLYKRARAGEIKGFTGIDSDYEKPETPERVLKTNLSTVSDCVHQVVELLQEQNIVPYTIIKDIHELFVPENKLDHVRAEAETLPSLSITKLDLQWVQVLSEGWATPLKGFMREKEYLQVMHFDTLLDGMALPDGVINMSIPIVLPVSAEDKTRLEGCSKFVLAHGGRRVAILRDAEFYEHRKEERCSRVWGTTCTKHPHIKMVMESGDWLVGGDLQVLEKIRWNDGLDQYRLTPLELKQKCKEMNADAVFAFQLRNPVHNGHALLMQDTRRRLLERGYKHPVLLLHPLGGWTKDDDVPLDWRMKQHAAVLEEGVLDPKSTIVAIFPSPMLYAGPTEVQWHCRSRMIAGANFYIVGRDPAGMPHPETKKDLYEPTHGGKVLSMAPGLTSVEIIPFRVAAYNKAKKAMDFYDPARHNEFDFISGTRMRKLAREGENPPDGFMAPKAWKVLTDYYRSLEKN,mutated_sequence,1.0,619.0,NP_001015880.1.a2m,NP_001015880.1.npy,ClinVar
+NP_001018005.1,NP_001018005.1.csv,MDAIKKKMQMLKLDKENALDRAEQAEADKKAAEDRSKQLEDELVSLQKKLKGTEDELDKYSEALKDAQEKLELAEKKATDAEADVASLNRRIQLVEEELDRAQERLATALQKLEEAEKAADESERGMKVIESRAQKDEEKMEIQEIQLKEAKHIAEDADRKYEEVARKLVIIESDLERAEERAELSEGKCAELEEELKTVTNNLKSLEAQAEKYSQKEDRYEEEIKVLSDKLKEAETRAEFAERSVTKLEKSIDDLEDELYAQKLKYKAISEELDHALNDMTSI,mutated_sequence,1.0,284.0,NP_001018005.1.a2m,NP_001018005.1.npy,ClinVar
+NP_001018125.1,NP_001018125.1.csv,MVSKRRLSKSEDKESLTEDASKTRKQPLSKKTKKSHIANEVEENDSIFVKLLKISGIILKTGESQNQLAVDQIAFQKKLFQTLRRHPSYPKIIEEFVSGLESYIEDEDSFRNCLLSCERLQDEEASMGASYSKSLIKLLLGIDILQPAIIKTLFEKLPEYFFENKNSDEINIPRLIVSQLKWLDRVVDGKDLTTKIMQLISIAPENLQHDIITSLPEILGDSQHADVGKELSDLLIENTSLTVPILDVLSSLRLDPNFLLKVRQLVMDKLSSIRLEDLPVIIKFILHSVTAMDTLEVISELREKLDLQHCVLPSRLQASQVKLKSKGRASSSGNQESSGQSCIILLFDVIKSAIRYEKTISEAWIKAIENTASVSEHKVFDLVMLFIIYSTNTQTKKYIDRVLRNKIRSGCIQEQLLQSTFSVHYLVLKDMCSSILSLAQSLLHSLDQSIISFGSLLYKYAFKFFDTYCQQEVVGALVTHICSGNEAEVDTALDVLLELVVLNPSAMMMNAVFVKGILDYLDNISPQQIRKLFYVLSTLAFSKQNEASSHIQDDMHLVIRKQLSSTVFKYKLIGIIGAVTMAGIMAADRSESPSLTQERANLSDEQCTQVTSLLQLVHSCSEQSPQASALYYDEFANLIQHEKLDPKALEWVGHTICNDFQDAFVVDSCVVPEGDFPFPVKALYGLEEYDTQDGIAINLLPLLFSQDFAKDGGPVTSQESGQKLVSPLCLAPYFRLLRLCVERQHNGNLEEIDGLLDCPIFLTDLEPGEKLESMSAKERSFMCSLIFLTLNWFREIVNAFCQETSPEMKGKVLTRLKHIVELQIILEKYLAVTPDYVPPLGNFDVETLDITPHTVTAISAKIRKKGKIERKQKTDGSKTSSSDTLSEEKNSECDPTPSHRGQLNKEFTGKEEKTSLLLHNSHAFFRELDIEVFSILHCGLVTKFILDTEMHTEATEVVQLGPPELLFLLEDLSQKLESMLTPPIARRVPFLKNKGSRNIGFSHLQQRSAQEIVHCVFQLLTPMCNHLENIHNYFQCLAAENHGVVDGPGVKVQEYHIMSSCYQRLLQIFHGLFAWSGFSQPENQNLLYSALHVLSSRLKQGEHSQPLEELLSQSVHYLQNFHQSIPSFQCALYLIRLLMVILEKSTASAQNKEKIASLARQFLCRVWPSGDKEKSNISNDQLHALLCIYLEHTESILKAIEEIAGVGVPELINSPKDASSSTFPTLTRHTFVVFFRVMMAELEKTVKKIEPGTAADSQQIHEEKLLYWNMAVRDFSILINLIKVFDSHPVLHVCLKYGRLFVEAFLKQCMPLLDFSFRKHREDVLSLLETFQLDTRLLHHLCGHSKIHQDTRLTQHVPLLKKTLELLVCRVKAMLTLNNCREAFWLGNLKNRDLQGEEIKSQNSQESTADESEDDMSSQASKSKATEDGEEDEVSAGEKEQDSDESYDDSD,mutated_sequence,1.0,1451.0,NP_001018125.1.a2m,NP_001018125.1.npy,ClinVar
+NP_001019801.3,NP_001019801.3.csv,MASNSLFSTVTPCQQNFFWDPSTSRRFSPPSSSLQPGKMSDVSPVVAAQQQQQQQQQQQQQQQQQQQQQQQEAAAAAAAAAAAAAAAAAVPRLRPPHDNRTMVEIIADHPAELVRTDSPNFLCSVLPSHWRCNKTLPVAFKVVALGEVPDGTVVTVMAGNDENYSAELRNASAVMKNQVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKVTVDGPREPRRHRQKLDDSKPSLFSDRLSDLGRIPHPSMRVGVPPQNPRPSLNSAPSPFNPQGQSQITDPRQAQSSPPWSYDQSYPSYLSQMTSPSIHSTTPLSSTRGTGLPAITDVPRRISDDDTATSDFCLWPSTLSKKSQAGASELGPFSDPRQFPSISSLTESRFSNPRMHYPATFTYTPPVTSGMSLGMSATTHYHTYLPPPYPGSSQSQSGPFQTSSTPYLYYGTSSGSYQFPMVPGGDRSPSRMLPPCTTTSNGSTLLNPNLPNQNDGVDADGSHSSSPTVLNSSGRMDESVWRPY,mutated_sequence,1.0,521.0,NP_001019801.3.a2m,NP_001019801.3.npy,ClinVar
+NP_001025067.1,NP_001025067.1.csv,MTQQPLRGVTSLRFNQDQSCFCCAMETGVRIYNVEPLMEKGHLDHEQVGSMGLVEMLHRSNLLALVGGGSSPKFSEISVLIWDDAREGKDSKEKLVLEFTFTKPVLSVRMRHDKIVIVLKNRIYVYSFPDNPRKLFEFDTRDNPKGLCDLCPSLEKQLLVFPGHKCGSLQLVDLASTKPGTSSAPFTINAHQSDIACVSLNQPGTVVASASQKGTLIRLFDTQSKEKLVELRRGTDPATLYCINFSHDSSFLCASSDKGTVHIFALKDTRLNRRSALARVGKVGPMIGQYVDSQWSLASFTVPAESACICAFGRNTSKNVNSVIAICVDGTFHKYVFTPDGNCNREAFDVYLDICDDDDF,mutated_sequence,1.0,360.0,NP_001025067.1.a2m,NP_001025067.1.npy,ClinVar
+NP_001026.2,NP_001026.2.csv,MADGGEGEDEIQFLRTDDEVVLQCTATIHKEQQKLCLAAEGFGNRLCFLESTSNSKNVPPDLSICTFVLEQSLSVRALQEMLANTVEKSEGQVDVEKWKFMMKTAQGGGHRTLLYGHAILLRHSYSGMYLCCLSTSRSSTDKLAFDVGLQEDTTGEACWWTIHPASKQRSEGEKVRVGDDLILVSVSSERYLHLSYGNGSLHVDAAFQQTLWSVAPISSGSEAAQGYLIGGDVLRLLHGHMDECLTVPSGEHGEEQRRTVHYEGGAVSVHARSLWRLETLRVAWSGSHIRWGQPFRLRHVTTGKYLSLMEDKNLLLMDKEKADVKSTAFTFRSSKEKLDVGVRKEVDGMGTSEIKYGDSVCYIQHVDTGLWLTYQSVDVKSVRMGSIQRKAIMHHEGHMDDGISLSRSQHEESRTARVIRSTVFLFNRFIRGLDALSKKAKASTVDLPIESVSLSLQDLIGYFHPPDEHLEHEDKQNRLRALKNRQNLFQEEGMINLVLECIDRLHVYSSAAHFADVAGREAGESWKSILNSLYELLAALIRGNRKNCAQFSGSLDWLISRLERLEASSGILEVLHCVLVESPEALNIIKEGHIKSIISLLDKHGRNHKVLDVLCSLCVCHGVAVRSNQHLICDNLLPGRDLLLQTRLVNHVSSMRPNIFLGVSEGSAQYKKWYYELMVDHTEPFVTAEATHLRVGWASTEGYSPYPGGGEEWGGNGVGDDLFSYGFDGLHLWSGCIARTVSSPNQHLLRTDDVISCCLDLSAPSISFRINGQPVQGMFENFNIDGLFFPVVSFSAGIKVRFLLGGRHGEFKFLPPPGYAPCYEAVLPKEKLKVEHSREYKQERTYTRDLLGPTVSLTQAAFTPIPVDTSQIVLPPHLERIREKLAENIHELWVMNKIELGWQYGPVRDDNKRQHPCLVEFSKLPEQERNYNLQMSLETLKTLLALGCHVGISDEHAEDKVKKMKLPKNYQLTSGYKPAPMDLSFIKLTPSQEAMVDKLAENAHNVWARDRIRQGWTYGIQQDVKNRRNPRLVPYTLLDDRTKKSNKDSLREAVRTLLGYGYNLEAPDQDHAARAEVCSGTGERFRIFRAEKTYAVKAGRWYFEFETVTAGDMRVGWSRPGCQPDQELGSDERAFAFDGFKAQRWHQGNEHYGRSWQAGDVVGCMVDMNEHTMMFTLNGEILLDDSGSELAFKDFDVGDGFIPVCSLGVAQVGRMNFGKDVSTLKYFTICGLQEGYEPFAVNTNRDITMWLSKRLPQFLQVPSNHEHIEVTRIDGTIDSSPCLKVTQKSFGSQNSNTDIMFYRLSMPIECAEVFSKTVAGGLPGAGLFGPKNDLEDYDADSDFEVLMKTAHGHLVPDRVDKDKEATKPEFNNHKDYAQEKPSRLKQRFLLRRTKPDYSTSHSARLTEDVLADDRDDYDFLMQTSTYYYSVRIFPGQEPANVWVGWITSDFHQYDTGFDLDRVRTVTVTLGDEKGKVHESIKRSNCYMVCAGESMSPGQGRNNNGLEIGCVVDAASGLLTFIANGKELSTYYQVEPSTKLFPAVFAQATSPNVFQFELGRIKNVMPLSAGLFKSEHKNPVPQCPPRLHVQFLSHVLWSRMPNQFLKVDVSRISERQGWLVQCLDPLQFMSLHIPEENRSVDILELTEQEELLKFHYHTLRLYSAVCALGNHRVAHALCSHVDEPQLLYAIENKYMPGLLRAGYYDLLIDIHLSSYATARLMMNNEYIVPMTEETKSITLFPDENKKHGLPGIGLSTSLRPRMQFSSPSFVSISNECYQYSPEFPLDILKSKTIQMLTEAVKEGSLHARDPVGGTTEFLFVPLIKLFYTLLIMGIFHNEDLKHILQLIEPSVFKEAATPEEESDTLEKELSVDDAKLQGAGEEEAKGGKRPKEGLLQMKLPEPVKLQMCLLLQYLCDCQVRHRIEAIVAFSDDFVAKLQDNQRFRYNEVMQALNMSAALTARKTKEFRSPPQEQINMLLNFKDDKSECPCPEEIRDQLLDFHEDLMTHCGIELDEDGSLDGNSDLTIRGRLLSLVEKVTYLKKKQAEKPVESDSKKSSTLQQLISETMVRWAQESVIEDPELVRAMFVLLHRQYDGIGGLVRALPKTYTINGVSVEDTINLLASLGQIRSLLSVRMGKEEEKLMIRGLGDIMNNKVFYQHPNLMRALGMHETVMEVMVNVLGGGESKEITFPKMVANCCRFLCYFCRISRQNQKAMFDHLSYLLENSSVGLASPAMRGSTPLDVAAASVMDNNELALALREPDLEKVVRYLAGCGLQSCQMLVSKGYPDIGWNPVEGERYLDFLRFAVFCNGESVEENANVVVRLLIRRPECFGPALRGEGGNGLLAAMEEAIKIAEDPSRDGPSPNSGSSKTLDTEEEEDDTIHMGNAIMTFYSALIDLLGRCAPEMHLIHAGKGEAIRIRSILRSLIPLGDLVGVISIAFQMPTIAKDGNVVEPDMSAGFCPDHKAAMVLFLDRVYGIEVQDFLLHLLEVGFLPDLRAAASLDTAALSATDMALALNRYLCTAVLPLLTRCAPLFAGTEHHASLIDSLLHTVYRLSKGCSLTKAQRDSIEVCLLSICGQLRPSMMQHLLRRLVFDVPLLNEHAKMPLKLLTNHYERCWKYYCLPGGWGNFGAASEEELHLSRKLFWGIFDALSQKKYEQELFKLALPCLSAVAGALPPDYMESNYVSMMEKQSSMDSEGNFNPQPVDTSNITIPEKLEYFINKYAEHSHDKWSMDKLANGWIYGEIYSDSSKVQPLMKPYKLLSEKEKEIYRWPIKESLKTMLAWGWRIERTREGDSMALYNRTRRISQTSQVSVDAAHGYSPRAIDMSNVTLSRDLHAMAEMMAENYHNIWAKKKKMELESKGGGNHPLLVPYDTLTAKEKAKDREKAQDILKFLQINGYAVSRGFKDLELDTPSIEKRFAYSFLQQLIRYVDEAHQYILEFDGGSRGKGEHFPYEQEIKFFAKVVLPLIDQYFKNHRLYFLSAASRPLCSGGHASNKEKEMVTSLFCKLGVLVRHRISLFGNDATSIVNCLHILGQTLDARTVMKTGLESVKSALRAFLDNAAEDLEKTMENLKQGQFTHTRNQPKGVTQIINYTTVALLPMLSSLFEHIGQHQFGEDLILEDVQVSCYRILTSLYALGTSKSIYVERQRSALGECLAAFAGAFPVAFLETHLDKHNIYSIYNTKSSRERAALSLPTNVEDVCPNIPSLEKLMEEIVELAESGIRYTQMPHVMEVILPMLCSYMSRWWEHGPENNPERAEMCCTALNSEHMNTLLGNILKIIYNNLGIDEGAWMKRLAVFSQPIINKVKPQLLKTHFLPLMEKLKKKAATVVSEEDHLKAEARGDMSEAELLILDEFTTLARDLYAFYPLLIRFVDYNRAKWLKEPNPEAEELFRMVAEVFIYWSKSHNFKREEQNFVVQNEINNMSFLITDTKSKMSKAAVSDQERKKMKRKGDRYSMQTSLIVAALKRLLPIGLNICAPGDQELIALAKNRFSLKDTEDEVRDIIRSNIHLQGKLEDPAIRWQMALYKDLPNRTDDTSDPEKTVERVLDIANVLFHLEQKSKRVGRRHYCLVEHPQRSKKAVWHKLLSKQRKRAVVACFRMAPLYNLPRHRAVNLFLQGYEKSWIETEEHYFEDKLIEDLAKPGAEPPEEDEGTKRVDPLHQLILLFSRTALTEKCKLEEDFLYMAYADIMAKSCHDEEDDDGEEEVKSFEEKEMEKQKLLYQQARLHDRGAAEMVLQTISASKGETGPMVAATLKLGIAILNGGNSTVQQKMLDYLKEKKDVGFFQSLAGLMQSCSVLDLNAFERQNKAEGLGMVTEEGSGEKVLQDDEFTCDLFRFLQLLCEGHNSDFQNYLRTQTGNNTTVNIIISTVDYLLRVQESISDFYWYYSGKDVIDEQGQRNFSKAIQVAKQVFNTLTEYIQGPCTGNQQSLAHSRLWDAVVGFLHVFAHMQMKLSQDSSQIELLKELMDLQKDMVVMLLSMLEGNVVNGTIGKQMVDMLVESSNNVEMILKFFDMFLKLKDLTSSDTFKEYDPDGKGVISKRDFHKAMESHKHYTQSETEFLLSCAETDENETLDYEEFVKRFHEPAKDIGFNVAVLLTNLSEHMPNDTRLQTFLELAESVLNYFQPFLGRIEIMGSAKRIERVYFEISESSRTQWEKPQVKESKRQFIFDVVNEGGEKEKMELFVNFCEDTIFEMQLAAQISESDLNERSANKEESEKERPEEQGPRMAFFSILTVRSALFALRYNILTLMRMLSLKSLKKQMKKVKKMTVKDMVTAFFSSYWSIFMTLLHFVASVFRGFFRIICSLLLGGSLVEGAKKIKVAELLANMPDPTQDEVRGDGEEGERKPLEAALPSEDLTDLKELTEESDLLSDIFGLDLKREGGQYKLIPHNPNAGLSDLMSNPVPMPEVQEKFQEQKAKEEEKEEKEETKSEPEKAEGEDGEKEEKAKEDKGKQKLRQLHTHRYGEPEVPESAFWKKIIAYQQKLLNYFARNFYNMRMLALFVAFAINFILLFYKVSTSSVVEGKELPTRSSSENAKVTSLDSSSHRIIAVHYVLEESSGYMEPTLRILAILHTVISFFCIIGYYCLKVPLVIFKREKEVARKLEFDGLYITEQPSEDDIKGQWDRLVINTQSFPNNYWDKFVKRKVMDKYGEFYGRDRISELLGMDKAALDFSDAREKKKPKKDSSLSAVLNSIDVKYQMWKLGVVFTDNSFLYLAWYMTMSVLGHYNNFFFAAHLLDIAMGFKTLRTILSSVTHNGKQLVLTVGLLAVVVYLYTVVAFNFFRKFYNKSEDGDTPDMKCDDMLTCYMFHMYVGVRAGGGIGDEIEDPAGDEYEIYRIIFDITFFFFVIVILLAIIQGLIIDAFGELRDQQEQVKEDMETKCFICGIGNDYFDTVPHGFETHTLQEHNLANYLFFLMYLINKDETEHTGQESYVWKMYQERCWEFFPAGDCFRKQYEDQLN,mutated_sequence,1.0,4967.0,NP_001026.2.a2m,NP_001026.2.npy,ClinVar
+NP_001026895.2,NP_001026895.2.csv,MFVPRSLKIKRNANDDGKSCVAKIIKPDPEDLQLDKSRDVPVDAVATEAATIDRHISESCPFPSPGGQLAEVHSVSPEQGAKDSHPSEEPVKSFSKTQRWAEPGEPICVVCGRYGEYICDKTDEDVCSLECKAKHLLQVKEKEEKSKLSNPQKADSEPESPLNASYVYKEHPFILNLQEDQIENLKQQLGILVQGQEVTRPIIDFEHCSLPEVLNHNLKKSGYEVPTPIQMQMIPVGLLGRDILASADTGSGKTAAFLLPVIMRALFESKTPSALILTPTRELAIQIERQAKELMSGLPRMKTVLLVGGLPLPPQLYRLQQHVKVIIATPGRLLDIIKQSSVELCGVKIVVVDEADTMLKMGFQQQVLDILENIPNDCQTILVSATIPTSIEQLASQLLHNPVRIITGEKNLPCANVRQIILWVEDPAKKKKLFEILNDKKLFKPPVLVFVDCKLGADLLSEAVQKITGLKSISIHSEKSQIERKNILKGLLEGDYEVVVSTGVLGRGLDLISVRLVVNFDMPSSMDEYVHQIGRVGRLGQNGTAITFINNNSKRLFWDIAKRVKPTGSILPPQLLNSPYLHDQKRKEQQKDKQTQNDLVTGANLMDIIRKHDKSNSQK,mutated_sequence,1.0,619.0,NP_001026895.2.a2m,NP_001026895.2.npy,ClinVar
+NP_001027392.1,NP_001027392.1.csv,MAPIGLKAVVGEKIMHDVIKKVKKKGEWKVLVVDQLSMRMLSSCCKMTDIMTEGITIVEDINKRREPLPSLEAVYLITPSEKSVHSLISDFKDPPTAKYRAAHVFFTDSCPDALFNELVKSRAAKVIKTLTEINIAFLPYESQVYSLDSADSFQSFYSPHKAQMKNPILERLAEQIATLCATLKEYPAVRYRGEYKDNALLAQLIQDKLDAYKADDPTMGEGPDKARSQLLILDRGFDPSSPVLHELTFQAMSYDLLPIENDVYKYETSGIGEARVKEVLLDEDDDLWIALRHKHIAEVSQEVTRSLKDFSSSKRMNTGEKTTMRDLSQMLKKMPQYQKELSKYSTHLHLAEDCMKHYQGTVDKLCRVEQDLAMGTDAEGEKIKDPMRAIVPILLDANVSTYDKIRIILLYIFLKNGITEENLNKLIQHAQIPPEDSEIITNMAHLGVPIVTDSTLRRRSKPERKERISEQTYQLSRWTPIIKDIMEDTIEDKLDTKHYPYISTRSSASFSTTAVSARYGHWHKNKAPGEYRSGPRLIIFILGGVSLNEMRCAYEVTQANGKWEVLIGSTHILTPQKLLDTLKKLNKTDEEISS,mutated_sequence,1.0,594.0,NP_001027392.1.a2m,NP_001027392.1.npy,ClinVar
+NP_001027554.1,NP_001027554.1.csv,MPLPVALQTRLAKRGILKHLEPEPEEEIIAEDYDDDPVDYEATRLEGLPPSWYKVFDPSCGLPYYWNADTDLVSWLSPHDPNSVVTKSAKKLRSSNADAEEKLDRSHDKSDRGHDKSDRSHEKLDRGHDKSDRGHDKSDRDRERGYDKVDRERERDRERDRDRGYDKADREEGKERRHHRREELAPYPKSKKAVSRKDEELDPMDPSSYSDAPRGTWSTGLPKRNEAKTGADTTAAGPLFQQRPYPSPGAVLRANAEASRTKQQD,mutated_sequence,1.0,265.0,NP_001027554.1.a2m,NP_001027554.1.npy,ClinVar
+NP_001034680.2,NP_001034680.2.csv,MTATTRGSPVGGNDNQGQAPDGQSQPPLQQNQTSSPDSSNENSPATPPDEQGQGDAPPQLEDEEPAFPHTDLAKLDDMINRPRWVVPVLPKGELEVLLEAAIDLSKKGLDVKSEACQRFFRDGLTISFTKILTDEAVSGWKFEIHRCIINNTHRLVELCVAKLSQDWFPLLELLAMALNPHCKFHIYNGTRPCESVSSSVQLPEDELFARSPDPRSPKGWLVDLLNKFGTLNGFQILHDRFINGSALNVQIIAALIKPFGQCYEFLTLHTVKKYFLPIIEMVPQFLENLTDEELKKEAKNEAKNDALSMIIKSLKNLASRVPGQEETVKNLEIFRLKMILRLLQISSFNGKMNALNEVNKVISSVSYYTHRHGNPEEEEWLTAERMAEWIQQNNILSIVLRDSLHQPQYVEKLEKILRFVIKEKALTLQDLDNIWAAQAGKHEAIVKNVHDLLAKLAWDFSPEQLDHLFDCFKASWTNASKKQREKLLELIRRLAEDDKDGVMAHKVLNLLWNLAHSDDVPVDIMDLALSAHIKILDYSCSQDRDTQKIQWIDRFIEELRTNDKWVIPALKQIREICSLFGEAPQNLSQTQRSPHVFYRHDLINQLQHNHALVTLVAENLATYMESMRLYARDHEDYDPQTVRLGSRYSHVQEVQERLNFLRFLLKDGQLWLCAPQAKQIWKCLAENAVYLCDREACFKWYSKLMGDEPDLDPDINKDFFESNVLQLDPSLLTENGMKCFERFFKAVNCREGKLVAKRRAYMMDDLELIGLDYLWRVVIQSNDDIASRAIDLLKEIYTNLGPRLQVNQVVIHEDFIQSCFDRLKASYDTLCVLDGDKDSVNCARQEAVRMVRVLTVLREYINECDSDYHEERTILPMSRAFRGKHLSFVVRFPNQGRQVDDLEVWSHTNDTIGSVRRCILNRIKANVAHTKIELFVGGELIDPADDRKLIGQLNLKDKSLITAKLTQISSNMPSSPDSSSDSSTGSPGNHGNHYSDGPNPEVESCLPGVIMSLHPRYISFLWQVADLGSSLNMPPLRDGARVLMKLMPPDSTTIEKLRAICLDHAKLGESSLSPSLDSLFFGPSASQVLYLTEVVYALLMPAGAPLADDSSDFQFHFLKSGGLPLVLSMLTRNNFLPNADMETRRGAYLNALKIAKLLLTAIGYGHVRAVAEACQPGVEGVNPMTQINQVTHDQAVVLQSALQSIPNPSSECMLRNVSVRLAQQISDEASRYMPDICVIRAIQKIIWASGCGSLQLVFSPNEEITKIYEKTNAGNEPDLEDEQVCCEALEVMTLCFALIPTALDALSKEKAWQTFIIDLLLHCHSKTVRQVAQEQFFLMCTRCCMGHRPLLFFITLLFTVLGSTARERAKHSGDYFTLLRHLLNYAYNSNINVPNAEVLLNNEIDWLKRIRDDVKRTGETGIEETILEGHLGVTKELLAFQTSEKKFHIGCEKGGANLIKELIDDFIFPASNVYLQYMRNGELPAEQAIPVCGSPPTINAGFELLVALAVGCVRNLKQIVDSLTEMYYIGTAITTCEALTEWEYLPPVGPRPPKGFVGLKNAGATCYMNSVIQQLYMIPSIRNGILAIEGTGSDVDDDMSGDEKQDNESNVDPRDDVFGYPQQFEDKPALSKTEDRKEYNIGVLRHLQVIFGHLAASRLQYYVPRGFWKQFRLWGEPVNLREQHDALEFFNSLVDSLDEALKALGHPAMLSKVLGGSFADQKICQGCPHRYECEESFTTLNVDIRNHQNLLDSLEQYVKGDLLEGANAYHCEKCNKKVDTVKRLLIKKLPPVLAIQLKRFDYDWERECAIKFNDYFEFPRELDMEPYTVAGVAKLEGDNVNPESQLIQQSEQSESETAGSTKYRLVGVLVHSGQASGGHYYSYIIQRNGGDGERNRWYKFDDGDVTECKMDDDEEMKNQCFGGEYMGEVFDHMMKRMSYRRQKRWWNAYILFYERMDTIDQDDELIRYISELAITTRPHQIIMPSAIERSVRKQNVQFMHNRMQYSMEYFQFMKKLLTCNGVYLNPPPGQDHLLPEAEEITMISIQLAARFLFTTGFHTKKVVRGSASDWYDALCILLRHSKNVRFWFAHNVLFNVSNRFSEYLLECPSAEVRGAFAKLIVFIAHFSLQDGPCPSPFASPGPSSQAYDNLSLSDHLLRAVLNLLRREVSEHGRHLQQYFNLFVMYANLGVAEKTQLLKLSVPATFMLVSLDEGPGPPIKYQYAELGKLYSVVSQLIRCCNVSSRMQSSINGNPPLPNPFGDPNLSQPIMPIQQNVADILFVRTSYVKKIIEDCSNSEETVKLLRFCCWENPQFSSTVLSELLWQVAYSYTYELRPYLDLLLQILLIEDSWQTHRIHNALKGIPDDRDGLFDTIQRSKNHYQKRAYQCIKCMVALFSNCPVAYQILQGNGDLKRKWTWAVEWLGDELERRPYTGNPQYTYNNWSPPVQSNETSNGYFLERSHSARMTLAKACELCPEEEPDDQDAPDEHESPPPEDAPLYPHSPGSQYQQNNHVHGQPYTGPAAHHMNNPQRTGQRAQENYEGSEEVSPPQTKDQ,mutated_sequence,1.0,2554.0,NP_001034680.2.a2m,NP_001034680.2.npy,ClinVar
+NP_001035232.1,NP_001035232.1.csv,MAQSVLVPPGPDSFRFFTRESLAAIEQRIAEEKAKRPKQERKDEDDENGPKPNSDLEAGKSLPFIYGDIPPEMVSVPLEDLDPYYINKKTFIVLNKGKAISRFSATPALYILTPFNPIRKLAIKILVHSLFNMLIMCTILTNCVFMTMSNPPDWTKNVEYTFTGIYTFESLIKILARGFCLEDFTFLRDPWNWLDFTVITFAYVTEFVDLGNVSALRTFRVLRALKTISVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNKCLQWPPDNSSFEINITSFFNNSLDGNGTTFNRTVSIFNWDEYIEDKSHFYFLEGQNDALLCGNSSDAGQCPEGYICVKAGRNPNYGYTSFDTFSWAFLSLFRLMTQDFWENLYQLTLRAAGKTYMIFFVLVIFLGSFYLINLILAVVAMAYEEQNQATLEEAEQKEAEFQQMLEQLKKQQEEAQAAAAAASAESRDFSGAGGIGVFSESSSVASKLSSKSEKELKNRRKKKKQKEQSGEEEKNDRVRKSESEDSIRRKGFRFSLEGSRLTYEKRFSSPHQSLLSIRGSLFSPRRNSRASLFSFRGRAKDIGSENDFADDEHSTFEDNDSRRDSLFVPHRHGERRHSNVSQASRASRVLPILPMNGKMHSAVDCNGVVSLVGGPSTLTSAGQLLPEGTTTETEIRKRRSSSYHVSMDLLEDPTSRQRAMSIASILTNTMEELEESRQKCPPCWYKFANMCLIWDCCKPWLKVKHLVNLVVMDPFVDLAITICIVLNTLFMAMEHYPMTEQFSSVLSVGNLVFTGIFTAEMFLKIIAMDPYYYFQEGWNIFDGFIVSLSLMELGLANVEGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKSYKECVCKISNDCELPRWHMHDFFHSFLIVFRVLCGEWIETMWDCMEVAGQTMCLTVFMMVMVIGNLVVLNLFLALLLSSFSSDNLAATDDDNEMNNLQIAVGRMQKGIDFVKRKIREFIQKAFVRKQKALDEIKPLEDLNNKKDSCISNHTTIEIGKDLNYLKDGNGTTSGIGSSVEKYVVDESDYMSFINNPSLTVTVPIAVGESDFENLNTEEFSSESDMEESKEKLNATSSSEGSTVDIGAPAEGEQPEVEPEESLEPEACFTEDCVRKFKCCQISIEEGKGKLWWNLRKTCYKIVEHNWFETFIVFMILLSSGALAFEDIYIEQRKTIKTMLEYADKVFTYIFILEMLLKWVAYGFQVYFTNAWCWLDFLIVDVSLVSLTANALGYSELGAIKSLRTLRALRPLRALSRFEGMRVVVNALLGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFYHCINYTTGEMFDVSVVNNYSECKALIESNQTARWKNVKVNFDNVGLGYLSLLQVATFKGWMDIMYAAVDSRNVELQPKYEDNLYMYLYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKFGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPANKFQGMVFDFVTKQVFDISIMILICLNMVTMMVETDDQSQEMTNILYWINLVFIVLFTGECVLKLISLRYYYFTIGWNIFDFVVVILSIVGMFLAELIEKYFVSPTLFRVIRLARIGRILRLIKGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYAIFGMSNFAYVKREVGIDDMFNFETFGNSMICLFQITTSAGWDGLLAPILNSGPPDCDPDKDHPGSSVKGDCGNPSVGIFFFVSYIIISFLVVVNMYIAVILENFSVATEESAEPLSEDDFEMFYEVWEKFDPDATQFIEFAKLSDFADALDPPLLIAKPNKVQLIAMDLPMVSGDRIHCLDILFAFTKRVLGESGEMDALRIQMEERFMASNPSKVSYEPITTTLKRKQEEVSAIIIQRAYRRYLLKQKVKKVSSIYKKDKGKECDGTPIKEDTLIDKLNENSTPEKTDMTPSTTSPPSYDSVTKPEKEKFEKDKSEKEDKGKDIRESKK,mutated_sequence,1.0,2005.0,NP_001035232.1.a2m,NP_001035232.1.npy,ClinVar
+NP_001035526.1,NP_001035526.1.csv,MAAPILRSFSWGRWSGTLNLSVLLPLGLRKAHSGAQGLLAAQKARGLFKDFFPETGTKIELPELFDRGTASFPQTIYCGFDPTADSLHVGHLLALLGLFHLQRAGHNVIALVGGATARLGDPSGRTKEREALETERVRANARALRLGLEALAANHQQLFTDGRSWGSFTVLDNSAWYQKQHLVDFLAAVGGHFRMGTLLSRQSVQLRLKSPEGMSLAEFFYQVLQAYDFYYLFQRYGCRVQLGGSDQLGNIMSGYEFINKLTGEDVFGITVPLITSTTGAKLGKSAGNAVWLNRDKTSPFELYQFFVRQPDDSVERYLKLFTFLPLPEIDHIMQLHVKEPERRGPQKRLAAEVTKLVHGREGLDSAKRCTQALYHSSIDALEVMSDQELKELFKEAPFSEFFLDPGTSVLDTCRKANAIPDGPRGYRMITEGGVSINHQQVTNPESVLIVGQHILKNGLSLLKIGKRNFYIIKWLQL,mutated_sequence,1.0,477.0,NP_001035526.1.a2m,NP_001035526.1.npy,ClinVar
+NP_001035806.1,NP_001035806.1.csv,MLKFRTVHGGLRLLGIRRTSTAPAASPNVRRLEYKPIKKVMVANRGEIAIRVFRACTELGIRTVAIYSEQDTGQMHRQKADEAYLIGRGLAPVQAYLHIPDIIKVAKENNVDAVHPGYGFLSERADFAQACQDAGVRFIGPSPEVVRKMGDKVEARAIAIAAGVPVVPGTDAPITSLHEAHEFSNTYGFPIIFKAAYGGGGRGMRVVHSYEELEENYTRAYSEALAAFGNGALFVEKFIEKPRHIEVQILGDQYGNILHLYERDCSIQRRHQKVVEIAPAAHLDPQLRTRLTSDSVKLAKQVGYENAGTVEFLVDRHGKHYFIEVNSRLQVEHTVTEEITDVDLVHAQIHVAEGRSLPDLGLRQENIRINGCAIQCRVTTEDPARSFQPDTGRIEVFRSGEGMGIRLDNASAFQGAVISPHYDSLLVKVIAHGKDHPTAATKMSRALAEFRVRGVKTNIAFLQNVLNNQQFLAGTVDTQFIDENPELFQLRPAQNRAQKLLHYLGHVMVNGPTTPIPVKASPSPTDPVVPAVPIGPPPAGFRDILLREGPEGFARAVRNHPGLLLMDTTFRDAHQSLLATRVRTHDLKKIAPYVAHNFSKLFSMENWGGATFDVAMRFLYECPWRRLQELRELIPNIPFQMLLRGANAVGYTNYPDNVVFKFCEVAKENGMDVFRVFDSLNYLPNMLLGMEAAGSAGGVVEAAISYTGDVADPSRTKYSLQYYMGLAEELVRAGTHILCIKDMAGLLKPTACTMLVSSLRDRFPDLPLHIHTHDTSGAGVAAMLACAQAGADVVDVAADSMSGMTSQPSMGALVACTRGTPLDTEVPMERVFDYSEYWEGARGLYAAFDCTATMKSGNSDVYENEIPGGQYTNLHFQAHSMGLGSKFKEVKKAYVEANQMLGDLIKVTPSSKIVGDLAQFMVQNGLSRAEAEAQAEELSFPRSVVEFLQGYIGVPHGGFPEPFRSKVLKDLPRVEGRPGASLPPLDLQALEKELVDRHGEEVTPEDVLSAAMYPDVFAHFKDFTATFGPLDSLNTRLFLQGPKIAEEFEVELERGKTLHIKALAVSDLNRAGQRQVFFELNGQLRSILVKDTQAMKEMHFHPKALKDVKGQIGAPMPGKVIDIKVVAGAKVAKGQPLCVLSAMKMETVVTSPMEGTVRKVHVTKDMTLEGDDLILEIE,mutated_sequence,1.0,1178.0,NP_001035806.1.a2m,NP_001035806.1.npy,ClinVar
+NP_001035937.1,NP_001035937.1.csv,MRKRTEPVALEHERCAAAGSSSSGSAAAALDADCRLKQNLRLTGPAAAEPRCAADAGMKRALGRRKGVWLRLRKILFCVLGLYIAIPFLIKLCPGIQAKLIFLNFVRVPYFIDLKKPQDQGLNHTCNYYLQPEEDVTIGVWHTVPAVWWKNAQGKDQMWYEDALASSHPIILYLHGNAGTRGGDHRVELYKVLSSLGYHVVTFDYRGWGDSVGTPSERGMTYDALHVFDWIKARSGDNPVYIWGHSLGTGVATNLVRRLCERETPPDALILESPFTNIREEAKSHPFSVIYRYFPGFDWFFLDPITSSGIKFANDENVKHISCPLLILHAEDDPVVPFQLGRKLYSIAAPARSFRDFKVQFVPFHSDLGYRHKYIYKSPELPRILREFLGKSEPEHQH,mutated_sequence,1.0,398.0,NP_001035937.1.a2m,NP_001035937.1.npy,ClinVar
+NP_001035957.1,NP_001035957.1.csv,MAAHRPVEWVQAVVSRFDEQLPIKTGQQNTHTKVSTEHNKECLINISKYKFSLVISGLTTILKNVNNMRIFGEAAEKNLYLSQLIILDTLEKCLAGQPKDTMRLDETMLVKQLLPEICHFLHTCREGNQHAAELRNSASGVLFSLSCNNFNAVFSRISTRLQELTVCSEDNVDVHDIELLQYINVDCAKLKRLLKETAFKFKALKKVAQLAVINSLEKAFWNWVENYPDEFTKLYQIPQTDMAECAEKLFDLVDGFAESTKRKAAVWPLQIILLILCPEIIQDISKDVVDENNMNKKLFLDSLRKALAGHGGSRQLTESAAIACVKLCKASTYINWEDNSVIFLLVQSMVVDLKNLLFNPSKPFSRGSQPADVDLMIDCLVSCFRISPHNNQHFKICLAQNSPSTFHYVLVNSLHRIITNSALDWWPKIDAVYCHSVELRNMFGETLHKAVQGCGAHPAIRMAPSLTFKEKVTSLKFKEKPTDLETRSYKYLLLSMVKLIHADPKLLLCNPRKQGPETQGSTAELITGLVQLVPQSHMPEIAQEAMEALLVLHQLDSIDLWNPDAPVETFWEISSQMLFYICKKLTSHQMLSSTEILKWLREILICRNKFLLKNKQADRSSCHFLLFYGVGCDIPSSGNTSQMSMDHEELLRTPGASLRKGKGNSSMDSAAGCSGTPPICRQAQTKLEVALYMFLWNPDTEAVLVAMSCFRHLCEEADIRCGVDEVSVHNLLPNYNTFMEFASVSNMMSTGRAALQKRVMALLRRIEHPTAGNTEAWEDTHAKWEQATKLILNYPKAKMEDGQAAESLHKTIVKRRMSHVSGGGSIDLSDTDSLQEWINMTGFLCALGGVCLQQRSNSGLATYSPPMGPVSERKGSMISVMSSEGNADTPVSKFMDRLLSLMVCNHEKVGLQIRTNVKDLVGLELSPALYPMLFNKLKNTISKFFDSQGQVLLTDTNTQFVEQTIAIMKNLLDNHTEGSSEHLGQASIETMMLNLVRYVRVLGNMVHAIQIKTKLCQLVEVMMARRDDLSFCQEMKFRNKMVEYLTDWVMGTSNQAADDDVKCLTRDLDQASMEAVVSLLAGLPLQPEEGDGVELMEAKSQLFLKYFTLFMNLLNDCSEVEDESAQTGGRKRGMSRRLASLRHCTVLAMSNLLNANVDSGLMHSIGLGYHKDLQTRATFMEVLTKILQQGTEFDTLAETVLADRFERLVELVTMMGDQGELPIAMALANVVPCSQWDELARVLVTLFDSRHLLYQLLWNMFSKEVELADSMQTLFRGNSLASKIMTFCFKVYGATYLQKLLDPLLRIVITSSDWQHVSFEVDPTRLEPSESLEENQRNLLQMTEKFFHAIISSSSEFPPQLRSVCHCLYQATCHSLLNKATVKEKKENKKSVVSQRFPQNSIGAVGSAMFLRFINPAIVSPYEAGILDKKPPPRIERGLKLMSKILQSIANHVLFTKEEHMRPFNDFVKSNFDAARRFFLDIASDCPTSDAVNHSLSFISDGNVLALHRLLWNNQEKIGQYLSSNRDHKAVGRRPFDKMATLLAYLGPPEHKPVADTHWSSLNLTSSKFEEFMTRHQVHEKEEFKALKTLSIFYQAGTSKAGNPIFYYVARRFKTGQINGDLLIYHVLLTLKPYYAKPYEIVVDLTHTGPSNRFKTDFLSKWFVVFPGFAYDNVSAVYIYNCNSWVREYTKYHERLLTGLKGSKRLVFIDCPGKLAEHIEHEQQKLPAATLALEEDLKVFHNALKLAHKDTKVSIKVGSTAVQVTSAERTKVLGQSVFLNDIYYASEIEEICLVDENQFTLTIANQGTPLTFMHQECEAIVQSIIHIRTRWELSQPDSIPQHTKIRPKDVPGTLLNIALLNLGSSDPSLRSAAYNLLCALTCTFNLKIEGQLLETSGLCIPANNTLFIVSISKTLAANEPHLTLEFLEECISGFSKSSIELKHLCLEYMTPWLSNLVRFCKHNDDAKRQRVTAILDKLITMTINEKQMYPSIQAKIWGSLGQITDLLDVVLDSFIKTSATGGLGSIKAEVMADTAVALASGNVKLVSSKVIGRMCKIIDKTCLSPTPTLEQHLMWDDIAILARYMLMLSFNNSLDVAAHLPYLFHVVTFLVATGPLSLRASTHGLVINIIHSLCTCSQLHFSEETKQVLRLSLTEFSLPKFYLLFGISKVKSAAVIAFRSSYRDRSFSPGSYERETFALTSLETVTEALLEIMEACMRDIPTCKWLDQWTELAQRFAFQYNPSLQPRALVVFGCISKRVSHGQIKQIIRILSKALESCLKGPDTYNSQVLIEATVIALTKLQPLLNKDSPLHKALFWVAVAVLQLDEVNLYSAGTALLEQNLHTLDSLRIFNDKSPEEVFMAIRNPLEWHCKQMDHFVGLNFNSNFNFALVGHLLKGYRHPSPAIVARTVRILHTLLTLVNKHRNCDKFEVNTQSVAYLAALLTVSEEVRSRCSLKHRKSLLLTDISMENVPMDTYPIHHGDPSYRTLKETQPWSSPKGSEGYLAATYPTVGQTSPRARKSMSLDMGQPSQANTKKLLGTRKSFDHLISDTKAPKRQEMESGITTPPKMRRVAETDYEMETQRISSSQQHPHLRKVSVSESNVLLDEEVLTDPKIQALLLTVLATLVKYTTDEFDQRILYEYLAEASVVFPKVFPVVHNLLDSKINTLLSLCQDPNLLNPIHGIVQSVVYHEESPPQYQTSYLQSFGFNGLWRFAGPFSKQTQIPDYAELIVKFLDALIDTYLPGIDEETSEESLLTPTSPYPPALQSQLSITANLNLSNSMTSLATSQHSPGIDKENVELSPTTGHCNSGRTRHGSASQVQKQRSAGSFKRNSIKKIV,mutated_sequence,1.0,2839.0,NP_001035957.1.a2m,NP_001035957.1.npy,ClinVar
+NP_001036167.1,NP_001036167.1.csv,MFAAATKSFVKQVGDGGRLVPVPSLSEADKYQPLSLVVKKKRCFLFPRYKFTSTPFTLKDILLGDREISAGISSYQLLNYEDESDVSLYGRRGNHIVNDVGINVAGSDSIAVKASFGIVTKHEVEVSTLLKEITTRKINFDHSLIRQSRSSRKAVLCVVMESIRTTRQCSLSVHAGIRGEAMRFHFMDEQNPKGRDKAIVFPAHTTIAFSVFELFIYLDGAFDLCVTSVSKGGFEREETATFALLYRLRNILFERNRRVMDVISRSQLYLDDLFSDYYDKPLSMTDISLKEGTHIRVNLLNHNIPKGPCILCGMGNFKRETVYGCFQCSVDGQKYVRLHAVPCFDIWHKRMK,mutated_sequence,1.0,352.0,NP_001036167.1.a2m,NP_001036167.1.npy,ClinVar
+NP_001041639.1,NP_001041639.1.csv,MRKPRAAVGSGHRKQAASQEGRQKHAKNNSQAKPSACDGLARQPEEVVLQASVSSYHLFRDVAEVTAFRGSLLSWYDQEKRDLPWRRRAEDEMDLDRRAYAVWVSEVMLQQTQVATVINYYTGWMQKWPTLQDLASASLEEVNQLWAGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQLLPGVGRYTAGAIASIAFGQATGVVDGNVARVLCRVRAIGADPSSTLVSQQLWGLAQQLVDPARPGDFNQAAMELGATVCTPQRPLCSQCPVESLCRARQRVEQEQLLASGSLSGSPDVEECAPNTGQCHLCLPPSEPWDQTLGVVNFPRKASRKPPREESSATCVLEQPGALGAQILLVQRPNSGLLAGLWEFPSVTWEPSEQLQRKALLQELQRWAGPLPATHLRHLGEVVHTFSHIKLTYQVYGLALEGQTPVTTVPPGARWLTQEEFHTAAVSTAMKKVFRVYQGQQPGTCMGSKRSQVSSPCSRKKPRMGQQVLDNFFRSHISTDAHSLNSAAQ,mutated_sequence,1.0,521.0,NP_001041639.1.a2m,NP_001041639.1.npy,ClinVar
+NP_001070833.1,NP_001070833.1.csv,MWGFLKRPVVVTADINLSLVALTGMGLLSRLWRLTYPRAVVFDEVYYGQYISFYMKQIFFLDDSGPPFGHMVLALGGYLGGFDGNFLWNRIGAEYSSNVPVWSLRLLPALAGALSVPMAYQIVLELHFSHCAAMGAALLMLIENALITQSRLMLLESVLIFFNLLAVLSYLKFFNCQKHSPFSLSWWFWLTLTGVACSCAVGIKYMGVFTYVLVLGVAAVHAWHLLGDQTLSNVCVFCHLLARAVALLVIPVVLYLLFFYVHLILVFRSGPHDQIMSSAFQASLEGGLARITQGQPLEVAFGSQVTLRNVFGKPVPCWLHSHQDTYPMIYENGRGSSHQQQVTCYPFKDVNNWWIVKDPRRHQLVVSSPPRPVRHGDMVQLVHGMTTRSLNTHDVAAPLSPHSQEVSCYIDYNISMPAQNLWRLEIVNRGSDTDVWKTILSEVRFVHVNTSAVLKLSGAHLPDWGYRQLEIVGEKLSRGYHGSTVWNVEEHRYGASQEQRERERELHSPAQVDVSRNLSFMARFSELQWRMLALRSDDSEHKYSSSPLEWVTLDTNIAYWLHPRTSAQIHLLGNIVIWVSGSLALAIYALLSLWYLLRRRRNVHDLPQDAWLRWVLAGALCAGGWAVNYLPFFLMEKTLFLYHYLPALTFQILLLPVVLQHISDHLCRSQLQRSIFSALVVAWYSSACHVSNTLRPLTYGDKSLSPHELKALRWKDSWDILIRKH,mutated_sequence,1.0,725.0,NP_001070833.1.a2m,NP_001070833.1.npy,ClinVar
+NP_001071.1,NP_001071.1.csv,MATCIWLRSCGARRLGSTFPGCRLRPRAGGLVPASGPAPGPAQLRCYAGRLAGLSAALLRTDSFVGGRWLPAAATFPVQDPASGAALGMVADCGVREARAAVRAAYEAFCRWREVSAKERSSLLRKWYNLMIQNKDDLARIITAESGKPLKEAHGEILYSAFFLEWFSEEARRVYGDIIHTPAKDRRALVLKQPIGVAAVITPWNFPSAMITRKVGAALAAGCTVVVKPAEDTPFSALALAELASQAGIPSGVYNVIPCSRKNAKEVGEAICTDPLVSKISFTGSTTTGKILLHHAANSVKRVSMELGGLAPFIVFDSANVDQAVAGAMASKFRNTGQTCVCSNQFLVQRGIHDAFVKAFAEAMKKNLRVGNGFEEGTTQGPLINEKAVEKVEKQVNDAVSKGATVVTGGKRHQLGKNFFEPTLLCNVTQDMLCTHEETFGPLAPVIKFDTEEEAIAIANAADVGLAGYFYSQDPAQIWRVAEQLEVGMVGVNEGLISSVECPFGGVKQSGLGREGSKYGIDEYLELKYVCYGGL,mutated_sequence,1.0,535.0,NP_001071.1.a2m,NP_001071.1.npy,ClinVar
+NP_001073911.1,NP_001073911.1.csv,MEGQTPGSRGLPEKPHPATAAATLSSMGAVFILMKSALGAGLLNFPWAFSKAGGVVPAFLVELVSLVFLISGLVILGYAAAVSGQATYQGVVRGLCGPAIGKLCEACFLLNLLMISVAFLRVIGDQLEKLCDSLLSGTPPAPQPWYADQRFTLPLLSVLVILPLSAPREIAFQKYTSILGTLAACYLALVITVQYYLWPQGLVRESHPSLSPASWTSVFSVFPTICFGFQCHEAAVSIYCSMRKRSLSHWALVSVLSLLACCLIYSLTGVYGFLTFGTEVSADVLMSYPGNDMVIIVARVLFAVSIVTVYPIVLFLGRSVMQDFWRRSCLGGWGPSALADPSGLWVRMPLTILWVTVTLAMALFMPDLSEIVSIIGGISSFFIFIFPGLCLICAMGVEPIGPRVKCCLEVWGVVSVLVGTFIFGQSTAAAVWEMF,mutated_sequence,1.0,435.0,NP_001073911.1.a2m,NP_001073911.1.npy,ClinVar
+NP_001073936.1,NP_001073936.1.csv,MSVGELYSQCTRVWIPDPDEVWRSAELTKDYKEGDKSLQLRLEDETILEYPIDVQRNQLPFLRNPDILVGENDLTALSYLHEPAVLHNLKVRFLESNHIYTYCGIVLVAINPYEQLPIYGQDVIYTYSGQNMGDMDPHIFAVAEEAYKQMARDEKNQSIIVSGESGAGKTVSAKYAMRYFATVGGSASETNIEEKVLASSPIMEAIGNAKTTRNDNSSRFGKYIQIGFDKRYHIIGANMRTYLLEKSRVVFQADDERNYHIFYQLCAAAGLPEFKELALTSAEDFFYTSQGGDTSIEGVDDAEDFEKTRQAFTLLGVKESHQMSIFKIIASILHLGSVAIQAERDGDSCSISPQDVYLSNFCRLLGVEHSQMEHWLCHRKLVTTSETYVKTMSLQQVINARNALAKHIYAQLFGWIVEHINKALHTSLKQHSFIGVLDIYGFETFEVNSFEQFCINYANEKLQQQFNSHVFKLEQEEYMKEQIPWTLIDFYDNQPCIDLIEAKLGILDLLDEECKVPKGTDQNWAQKLYDRHSSSQHFQKPRMSNTAFIIVHFADKVEYLSDGFLEKNRDTVYEEQINILKASKFPLVADLFHDDKDPVPATTPGKGSSSKISVRSARPPMKVSNKEHKKTVGHQFRTSLHLLMETLNATTPHYVRCIKPNDEKLPFHFDPKRAVQQLRACGVLETIRISAAGYPSRWAYHDFFNRYRVLVKKRELANTDKKAICRSVLENLIKDPDKFQFGRTKIFFRAGQVAYLEKLRADKFRTATIMIQKTVRGWLQKVKYHRLKGATLTLQRYCRGHLARRLAEHLRRIRAAVVLQKHYRMQRARQAYQRVRRAAVVIQAFTRAMFVRRTYRQVLMEHKATTIQKHVRGWMARRHFQRLRDAAIVIQCAFRMLKARRELKALRIEARSAEHLKRLNVGMENKVVQLQRKIDEQNKEFKTLSEQLSVTTSTYTMEVERLKKELVHYQQSPGEDTSLRLQEEVESLRTELQRAHSERKILEDAHSREKDELRKRVADLEQENALLKDEKEQLNNQILCQSKDEFAQNSVKENLMKKELEEERSRYQNLVKEYSQLEQRYDNLRDEMTIIKQTPGHRRNPSNQSSLESDSNYPSISTSEIGDTEDALQQVEEIGLEKAAMDMTVFLKLQKRVRELEQERKKLQVQLEKREQQDSKKVQAEPPQTDIDLDPNADLAYNSLKRQELESENKKLKNDLNELRKAVADQATQNNSSHGSPDSYSLLLNQLKLAHEELEVRKEEVLILRTQIVSADQRRLAGRNAEPNINARSSWPNSEKHVDQEDAIEAYHGVCQTNSKTEDWGYLNEDGELGLAYQGLKQVARLLEAQLQAQSLEHEEEVEHLKAQLEALKEEMDKQQQTFCQTLLLSPEAQVEFGVQQEISRLTNENLDLKELVEKLEKNERKLKKQLKIYMKKAQDLEAAQALAQSERKRHELNRQVTVQRKEKDFQGMLEYHKEDEALLIRNLVTDLKPQMLSGTVPCLPAYILYMCIRHADYTNDDLKVHSLLTSTINGIKKVLKKHNDDFEMTSFWLSNTCRLLHCLKQYSGDEGFMTQNTAKQNEHCLKNFDLTEYRQVLSDLSIQIYQQLIKIAEGVLQPMIVSAMLENESIQGLSGVKPTGYRKRSSSMADGDNSYCLEAIIRQMNAFHTVMCDQGLDPEIILQVFKQLFYMINAVTLNNLLLRKDVCSWSTGMQLRYNISQLEEWLRGRNLHQSGAVQTMEPLIQAAQLLQLKKKTQEDAEAICSLCTSLSTQQIVKILNLYTPLNEFEERVTVAFIRTIQAQLQERNDPQQLLLDAKHMFPVLFPFNPSSLTMDSIHIPACLNLEFLNEV,mutated_sequence,1.0,1848.0,NP_001073936.1.a2m,NP_001073936.1.npy,ClinVar
+NP_001076585.1,NP_001076585.1.csv,MAARLLLLGILLLLLPLPVPAPCHTAARSECKRSHKFVPGAWLAGEGVDVTSLRRSGSFPVDTQRFLRPDGTCTLCENALQEGTLQRLPLALTNWRAQGSGCQRHVTRAKVSSTEAVARDAARSIRNDWKVGLDVTPKPTSNVHVSVAGSHSQAANFAAQKTHQDQYSFSTDTVECRFYSFHVVHTPPLHPDFKRALGDLPHHFNASTQPAYLRLISNYGTHFIRAVELGGRISALTALRTCELALEGLTDNEVEDCLTVEAQVNIGIHGSISAEAKACEEKKKKHKMTASFHQTYRERHSEVVGGHHTSINDLLFGIQAGPEQYSAWVNSLPGSPGLVDYTLEPLHVLLDSQDPRREALRRALSQYLTDRARWRDCSRPCPPGRQKSPRDPCQCVCHGSAVTTQDCCPRQRGLAQLEVTFIQAWGLWGDWFTATDAYVKLFFGGQELRTSTVWDNNNPIWSVRLDFGDVLLATGGPLRLQVWDQDSGRDDDLLGTCDQAPKSGSHEVRCNLNHGHLKFRYHARCLPHLGGGTCLDYVPQMLLGEPPGNRSGAVW,mutated_sequence,1.0,555.0,NP_001076585.1.a2m,NP_001076585.1.npy,ClinVar
+NP_001077083.1,NP_001077083.1.csv,MAALLRRLLQRERPSAASGRPVGRREANLGTDAGVAVRVRFAPSPTGFLHLGGLRTALYNYIFAKKYQGSFILRLEDTDQTRVVPGAAENIEDMLEWAGIPPDESPRRGGPAGPYQQSQRLELYAQATEALLKTGAAYPCFCSPQRLELLKKEALRNHQTPRYDNRCRNMSQEQVAQKLAKDPKPAIRFRLEQVVPAFQDLVYGWNRHEVASVEGDPVIMKSDGFPTYHLACVVDDHHMGISHVLRGSEWLVSTAKHLLLYQALGWQPPHFAHLPLLLNRDGSKLSKRQGDVFLEHFAADGFLPDSLLDIITNCGSGFAENQMGRTLPELITQFNLTQVTCHSALLDLEKLPEFNRLHLQRLVSNESQRRQLVGKLQVLVEEAFGCQLQNRDVLNPVYVERILLLRQGHICRLQDLVSPVYSYLWTRPAVGRAQLDAISEKVDVIAKRVLGLLERSSMSLTQDMLNGELKKLSEGLEGTKYSNVMKLLRMALSGQQQGPPVAEMMLALGPKEVRERIQKVVSS,mutated_sequence,1.0,523.0,NP_001077083.1.a2m,NP_001077083.1.npy,ClinVar
+NP_001077431.1,NP_001077431.1.csv,MHHQQRMAALGTDKELSDLLDFSAMFSPPVSSGKNGPTSLASGHFTGSNVEDRSSSGSWGNGGHPSPSRNYGDGTPYDHMTSRDLGSHDNLSPPFVNSRIQSKTERGSYSSYGRESNLQGCHQQSLLGGDMDMGNPGTLSPTKPGSQYYQYSSNNPRRRPLHSSAMEVQTKKVRKVPPGLPSSVYAPSASTADYNRDSPGYPSSKPATSTFPSSFFMQDGHHSSDPWSSSSGMNQPGYAGMLGNSSHIPQSSSYCSLHPHERLSYPSHSSADINSSLPPMSTFHRSGTNHYSTSSCTPPANGTDSIMANRGSGAAGSSQTGDALGKALASIYSPDHTNNSFSSNPSTPVGSPPSLSAGTAVWSRNGGQASSSPNYEGPLHSLQSRIEDRLERLDDAIHVLRNHAVGPSTAMPGGHGDMHGIIGPSHNGAMGGLGSGYGTGLLSANRHSLMVGTHREDGVALRGSHSLLPNQVPVPQLPVQSATSPDLNPPQDPYRGMPPGLQGQSVSSGSSEIKSDDEGDENLQDTKSSEDKKLDDDKKDIKSITRSRSSNNDDEDLTPEQKAEREKERRMANNARERLRVRDINEAFKELGRMVQLHLKSDKPQTKLLILHQAVAVILSLEQQVRERNLNPKAACLKRREEEKVSSEPPPLSLAGPHPGMGDASNHMGQM,mutated_sequence,1.0,671.0,NP_001077431.1.a2m,NP_001077431.1.npy,ClinVar
+NP_001091.1,NP_001091.1.csv,MCDEDETTALVCDNGSGLVKAGFAGDDAPRAVFPSIVGRPRHQGVMVGMGQKDSYVGDEAQSKRGILTLKYPIEHGIITNWDDMEKIWHHTFYNELRVAPEEHPTLLTEAPLNPKANREKMTQIMFETFNVPAMYVAIQAVLSLYASGRTTGIVLDSGDGVTHNVPIYEGYALPHAIMRLDLAGRDLTDYLMKILTERGYSFVTTAEREIVRDIKEKLCYVALDFENEMATAASSSSLEKSYELPDGQVITIGNERFRCPETLFQPSFIGMESAGIHETTYNSIMKCDIDIRKDLYANNVMSGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQQMWITKQEYDEAGPSIVHRKCF,mutated_sequence,1.0,377.0,NP_001091.1.a2m,NP_001091.1.npy,ClinVar
+NP_001092.1,NP_001092.1.csv,MDDDIAALVVDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHQGVMVGMGQKDSYVGDEAQSKRGILTLKYPIEHGIVTNWDDMEKIWHHTFYNELRVAPEEHPVLLTEAPLNPKANREKMTQIMFETFNTPAMYVAIQAVLSLYASGRTTGIVMDSGDGVTHTVPIYEGYALPHAILRLDLAGRDLTDYLMKILTERGYSFTTTAEREIVRDIKEKLCYVALDFEQEMATAASSSSLEKSYELPDGQVITIGNERFRCPEALFQPSFLGMESCGIHETTFNSIMKCDVDIRKDLYANTVLSGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQQMWISKQEYDESGPSIVHRKCF,mutated_sequence,1.0,375.0,NP_001092.1.a2m,NP_001092.1.npy,ClinVar
+NP_001093327.1,NP_001093327.1.csv,MNRHLWKSQLCEMVQPSGGPAADQDVLGEESPLGKPAMLHLPSEQGAPETLQRCLEENQELRDAIRQSNQILRERCEELLHFQASQREEKEFLMCKFQEARKLVERLGLEKLDLKRQKEQALREVEHLKRCQQQMAEDKASVKAQVTSLLGELQESQSRLEAATKECQALEGRARAASEQARQLESEREALQQQHSVQVDQLRMQGQSVEAALRMERQAASEEKRKLAQLQVAYHQLFQEYDNHIKSSVVGSERKRGMQLEDLKQQLQQAEEALVAKQEVIDKLKEEAEQHKIVMETVPVLKAQADIYKADFQAERQAREKLAEKKELLQEQLEQLQREYSKLKASCQESARIEDMRKRHVEVSQAPLPPAPAYLSSPLALPSQRRSPPEEPPDFCCPKCQYQAPDMDTLQIHVMECIE,mutated_sequence,1.0,419.0,NP_001093327.1.a2m,NP_001093327.1.npy,ClinVar
+NP_001094896.1,NP_001094896.1.csv,MEAGPPGSARPAEPGPCLSGQRGADHTASASLQSVAGTEPGRHPQAVAAVLPAGGCGERMGVPTPKQFCPILERPLISYTLQALERVCWIKDIVVAVTGENMEVMKSIIQKYQHKRISLVEAGVTRHRSIFNGLKALAEDQINSKLSKPEVVIIHDAVRPFVEEGVLLKVVTAAKEHGAAGAIRPLVSTVVSPSADGCLDYSLERARHRASEMPQAFLFDVIYEAYQQCSDYDLEFGTECLQLALKYCCTKAKLVEGSPDLWKVTYKRDLYAAESIIKERISQEICVVMDTEEDNKHVGHLLEEVLKSELNHVKVTSEALGHAGRHLQQIILDQCYNFVCVNVTTSDFQETQKLLSMLEESSLCILYPVVVVSVHFLDFKLVPPSQKMENLMQIREFAKEVKERNILLYGLLISYPQDDQKLQESLRQGAIIIASLIKERNSGLIGQLLIA,mutated_sequence,1.0,451.0,NP_001094896.1.a2m,NP_001094896.1.npy,ClinVar
+NP_001096034.1,NP_001096034.1.csv,MEDLLDLDEELRYSLATSRAKMGRRAQQESAQAENHLNGKNSSLTLTGETSSAKLPRCRQGGWAGDSVKASKFRRKASEEIEDFRLRPQSLNGSDYGGDIPIIPDLEEVQEEDFVLQVAAPPSIQIKRVMTYRDLDNDLMKYSAIQTLDGEIDLKLLTKVLAPEHEVREDDVGWDWDHLFTEVSSEVLTEWDPLQTEKEDPAGQARHT,mutated_sequence,1.0,208.0,NP_001096034.1.a2m,NP_001096034.1.npy,ClinVar
+NP_001098101.1,NP_001098101.1.csv,MEAEGSSAPARAGSGEGSDSAGGATLKAPKHLWRHEQHHQYPLRQPQFRLLHPHHHLPPPPPPSPQPQPQCPLQPPPPPPLPPPPPPPGAARGRYASSGATGRVRHRGYSDTERYLYCRAMDRTSYAVETGHRPGLKKSRMSWPSSFQGLRRFDVDNGTSAGRSPLDPMTSPGSGLILQANFVHSQRRESFLYRSDSDYDLSPKSMSRNSSIASDIHGDDLIVTPFAQVLASLRTVRNNFAALTNLQDRAPSKRSPMCNQPSINKATITEEAYQKLASETLEELDWCLDQLETLQTRHSVSEMASNKFKRMLNRELTHLSEMSRSGNQVSEFISNTFLDKQHEVEIPSPTQKEKEKKKRPMSQISGVKKLMHSSSLTNSSIPRFGVKTEQEDVLAKELEDVNKWGLHVFRIAELSGNRPLTVIMHTIFQERDLLKTFKIPVDTLITYLMTLEDHYHADVAYHNNIHAADVVQSTHVLLSTPALEAVFTDLEILAAIFASAIHDVDHPGVSNQFLINTNSELALMYNDSSVLENHHLAVGFKLLQEENCDIFQNLTKKQRQSLRKMVIDIVLATDMSKHMNLLADLKTMVETKKVTSSGVLLLDNYSDRIQVLQNMVHCADLSNPTKPLQLYRQWTDRIMEEFFRQGDRERERGMEISPMCDKHNASVEKSQVGFIDYIVHPLWETWADLVHPDAQDILDTLEDNREWYQSTIPQSPSPAPDDPEEGRQGQTEKFQFELTLEEDGESDTEKDSGSQVEEDTSCSDSKTLCTQDSESTEIPLDEQVEEEAVGEEEESQPEACVIDDRSPDT,mutated_sequence,1.0,809.0,NP_001098101.1.a2m,NP_001098101.1.npy,ClinVar
+NP_001103348.1,NP_001103348.1.csv,MALSSRARAFSVEALVGRPSKRKLQDPIQAEQPELREKKGGEEEEERRSSAAGKSEPLEKQPKTEPSTSASSGCGSDSGYGNSSESLEEKDIQMELQGSELWKRFHDIGTEMIITKAGRRMFPSVRVKVKGLDPGKQYHVAIDVVPVDSKRYRYVYHSSQWMVAGNTDHLCIIPRFYVHPDSPCSGETWMRQIISFDRMKLTNNEMDDKGHIILQSMHKYKPRVHVIEQGSSVDLSQIQSLPTEGVKTFSFKETEFTTVTAYQNQQITKLKIERNPFAKGFRDTGRNRGVLDGLLETYPWRPSFTLDFKTFGADTQSGSSGSSPVTSSGGAPSPLNSLLSPLCFSPMFHLPTSSLGMPCPEAYLPNVNLPLCYKICPTNFWQQQPLVLPAPERLASSNSSQSLAPLMMEVPMLSSLGVTNSKSGSSEDSSDQYLQAPNSTNQMLYGLQSPGNIFLPNSITPEALSCSFHPSYDFYRYNFSMPSRLISGSNHLKVNDDSQVSFGEGKCNHVHWYPAINHYL,mutated_sequence,1.0,520.0,NP_001103348.1.a2m,NP_001103348.1.npy,ClinVar
+NP_001103824.1,NP_001103824.1.csv,MELSYRLFICLLLWGSTELCYPQPLWLLQGGASHPETSVQPVLVECQEATLMVMVSKDLFGTGKLIRAADLTLGPEACEPLVSMDTEDVVRFEVGLHECGNSMQVTDDALVYSTFLLHDPRPVGNLSIVRTNRAEIPIECRYPRQGNVSSQAILPTWLPFRTTVFSEEKLTFSLRLMEENWNAEKRSPTFHLGDAAHLQAEIHTGSHVPLRLFVDHCVATPTPDQNASPYHTIVDFHGCLVDGLTDASSAFKVPRPGPDTLQFTVDVFHFANDSRNMIYITCHLKVTLAEQDPDELNKACSFSKPSNSWFPVEGSADICQCCNKGDCGTPSHSRRQPHVMSQWSRSASRNRRHVTEEADVTVGPLIFLDRRGDHEVEQWALPSDTSVVLLGVGLAVVVSLTLTAVILVLTRRCRTASHPVSASE,mutated_sequence,1.0,424.0,NP_001103824.1.a2m,NP_001103824.1.npy,ClinVar
+NP_001104026.1,NP_001104026.1.csv,MSSSHSRAGQSAAGAAPGGGVDTRDAEMPATEKDLAEDAPWKKIQQNTFTRWCNEHLKCVSKRIANLQTDLSDGLRLIALLEVLSQKKMHRKHNQRPTFRQMQLENVSVALEFLDRESIKLVSIDSKAIVDGNLKLILGLIWTLILHYSISMPMWDEEEDEEAKKQTPKQRLLGWIQNKLPQLPITNFSRDWQSGRALGALVDSCAPGLCPDWDSWDASKPVTNAREAMQQADDWLGIPQVITPEEIVDPNVDEHSVMTYLSQFPKAKLKPGAPLRPKLNPKKARAYGPGIEPTGNMVKKRAEFTVETRSAGQGEVLVYVEDPAGHQEEAKVTANNDKNRTFSVWYVPEVTGTHKVTVLFAGQHIAKSPFEVYVDKSQGDASKVTAQGPGLEPSGNIANKTTYFEIFTAGAGTGEVEVVIQDPMGQKGTVEPQLEARGDSTYRCSYQPTMEGVHTVHVTFAGVPIPRSPYTVTVGQACNPSACRAVGRGLQPKGVRVKETADFKVYTKGAGSGELKVTVKGPKGEERVKQKDLGDGVYGFEYYPMVPGTYIVTITWGGQNIGRSPFEVKVGTECGNQKVRAWGPGLEGGVVGKSADFVVEAIGDDVGTLGFSVEGPSQAKIECDDKGDGSCDVRYWPQEAGEYAVHVLCNSEDIRLSPFMADIRDAPQDFHPDRVKARGPGLEKTGVAVNKPAEFTVDAKHGGKAPLRVQVQDNEGCPVEALVKDNGNGTYSCSYVPRKPVKHTAMVSWGGVSIPNSPFRVNVGAGSHPNKVKVYGPGVAKTGLKAHEPTYFTVDCAEAGQGDVSIGIKCAPGVVGPAEADIDFDIIRNDNDTFTVKYTPRGAGSYTIMVLFADQATPTSPIRVKVEPSHDASKVKAEGPGLSRTGVELGKPTHFTVNAKAAGKGKLDVQFSGLTKGDAVRDVDIIDHHDNTYTVKYTPVQQGPVGVNVTYGGDPIPKSPFSVAVSPSLDLSKIKVSGLGEKVDVGKDQEFTVKSKGAGGQGKVASKIVGPSGAAVPCKVEPGLGADNSVVRFLPREEGPYEVEVTYDGVPVPGSPFPLEAVAPTKPSKVKAFGPGLQGGSAGSPARFTIDTKGAGTGGLGLTVEGPCEAQLECLDNGDGTCSVSYVPTEPGDYNINILFADTHIPGSPFKAHVVPCFDASKVKCSGPGLERATAGEVGQFQVDCSSAGSAELTIEICSEAGLPAEVYIQDHGDGTHTITYIPLCPGAYTVTIKYGGQPVPNFPSKLQVEPAVDTSGVQCYGPGIEGQGVFREATTEFSVDARALTQTGGPHVKARVANPSGNLTETYVQDRGDGMYKVEYTPYEEGLHSVDVTYDGSPVPSSPFQVPVTEGCDPSRVRVHGPGIQSGTTNKPNKFTVETRGAGTGGLGLAVEGPSEAKMSCMDNKDGSCSVEYIPYEAGTYSLNVTYGGHQVPGSPFKVPVHDVTDASKVKCSGPGLSPGMVRANLPQSFQVDTSKAGVAPLQVKVQGPKGLVEPVDVVDNADGTQTVNYVPSREGPYSISVLYGDEEVPRSPFKVKVLPTHDASKVKASGPGLNTTGVPASLPVEFTIDAKDAGEGLLAVQITDPEGKPKKTHIQDNHDGTYTVAYVPDVTGRYTILIKYGGDEIPFSPYRVRAVPTGDASKCTVTVSIGGHGLGAGIGPTIQIGEETVITVDTKAAGKGKVTCTVCTPDGSEVDVDVVENEDGTFDIFYTAPQPGKYVICVRFGGEHVPNSPFQVTALAGDQPSVQPPLRSQQLAPQYTYAQGGQQTWAPERPLVGVNGLDVTSLRPFDLVIPFTIKKGEITGEVRMPSGKVAQPTITDNKDGTVTVRYAPSEAGLHEMDIRYDNMHIPGSPLQFYVDYVNCGHVTAYGPGLTHGVVNKPATFTVNTKDAGEGGLSLAIEGPSKAEISCTDNQDGTCSVSYLPVLPGDYSILVKYNEQHVPGSPFTARVTGDDSMRMSHLKVGSAADIPINISETDLSLLTATVVPPSGREEPCLLKRLRNGHVGISFVPKETGEHLVHVKKNGQHVASSPIPVVISQSEIGDASRVRVSGQGLHEGHTFEPAEFIIDTRDAGYGGLSLSIEGPSKVDINTEDLEDGTCRVTYCPTEPGNYIINIKFADQHVPGSPFSVKVTGEGRVKESITRRRRAPSVANVGSHCDLSLKIPEISIQDMTAQVTSPSGKTHEAEIVEGENHTYCIRFVPAEMGTHTVSVKYKGQHVPGSPFQFTVGPLGEGGAHKVRAGGPGLERAEAGVPAEFSIWTREAGAGGLAIAVEGPSKAEISFEDRKDGSCGVAYVVQEPGDYEVSVKFNEEHIPDSPFVVPVASPSGDARRLTVSSLQESGLKVNQPASFAVSLNGAKGAIDAKVHSPSGALEECYVTEIDQDKYAVRFIPRENGVYLIDVKFNGTHIPGSPFKIRVGEPGHGGDPGLVSAYGAGLEGGVTGNPAEFVVNTSNAGAGALSVTIDGPSKVKMDCQECPEGYRVTYTPMAPGSYLISIKYGGPYHIGGSPFKAKVTGPRLVSNHSLHETSSVFVDSLTKATCAPQHGAPGPGPADASKVVAKGLGLSKAYVGQKSSFTVDCSKAGNNMLLVGVHGPRTPCEEILVKHVGSRLYSVSYLLKDKGEYTLVVKWGDEHIPGSPYRVVVP,mutated_sequence,1.0,2647.0,NP_001104026.1.a2m,NP_001104026.1.npy,ClinVar
+NP_001104262.1,NP_001104262.1.csv,MAAAAAAAPSGGGGGGEEERLEEKSEDQDLQGLKDKPLKFKKVKKDKKEEKEGKHEPVQPSAHHSAEPAEAGKAETSEGSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTTRPKAATSEGVQVKRVLEKSPGKLLVKMPFQTSPGGKAEGGGATTSTQVMVIKRPGRKRKAEADPQAIPKKRGRKPGSVVAAAAAEAKKKAVKESSIRSVQETVLPIKKRKTRETVSIEVKEVVKPLLVSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSSASSPPKKEHHHHHHHSESPKAPVPLLPPLPPPPPEPESSEDPTSPPEPQDLSSSVCKEEKMPRGGSLESDGCPKEPAKTQPAVATAATAAEKYKHRGEGERKDIVSSSMPRPNREEPVDSRTPVTERVS,mutated_sequence,1.0,498.0,NP_001104262.1.a2m,NP_001104262.1.npy,ClinVar
+NP_001108225.1,NP_001108225.1.csv,MDRGTLPLAVALLLASCSLSPTSLAETVHCDLQPVGPERGEVTYTTSQVSKGCVAQAPNAILEVHVLFLEFPTGPSQLELTLQASKQNGTWPREVLLVLSVNSSVFLHLQALGIPLHLAYNSSLVTFQEPPGVNTTELPSFPKTQILEWAAERGPITSAAELNDPQSILLRLGQAQGSLSFCMLEASQDMGRTLEWRPRTPALVRGCHLEGVAGHKEAHILRVLPGHSAGPRTVTVKVELSCAPGDLDAVLILQGPPYVSWLIDANHNMQIWTTGEYSFKIFPEKNIRGFKLPDTPQGLLGEARMLNASIVASFVELPLASIVSLHASSCGGRLQTSPAPIQTTPPKDTCSPELLMSLIQTKCADDAMTLVLKKELVAHLKCTITGLTFWDPSCEAEDRGDKFVLRSAYSSCGMQVSASMISNEAVVNILSSSSPQRKKVHCLNMDSLSFQLGLYLSPHFLQASNTIEPGQQSFVQVRVSPSVSEFLLQLDSCHLDLGPEGGTVELIQGRAAKGNCVSLLSPSPEGDPRFSFLLHFYTVPIPKTGTLSCTVALRPKTGSQDQEVHRTVFMRLNIISPDLSGCTSKGLVLPAVLGITFGAFLIGALLTAALWYIYSHTRSPSKREPVVAVAAPASSESSSTNHSIGSTQSTPCSTSSMA,mutated_sequence,1.0,658.0,NP_001108225.1.a2m,NP_001108225.1.npy,ClinVar
+NP_001116102.1,NP_001116102.1.csv,MERLVARGTFPVLVRTSACRSLFGPVDHEELSRELQARLAELNAEDQNRWDYDFQQDMPLRGPGRLQWTEVDSDSVPAFYRETVQVGRCRLLLAPRPVAVAVAVSPPLEPAAESLDGLEEAPEQLPSVPVPAPASTPPPVPVLAPAPAPAPAPVAAPVAAPVAVAVLAPAPAPAPAPAPAPAPVAAPAPAPAPAPAPAPAPAPAPDAAPQESAEQGANQGQRGQEPLADQLHSGISGRPAAGTAAASANGAAIKKLSGPLISDFFAKRKRSAPEKSSGDVPAPCPSPSAAPGVGSVEQTPRKRLR,mutated_sequence,1.0,305.0,NP_001116102.1.a2m,NP_001116102.1.npy,ClinVar
+NP_001116427.1,NP_001116427.1.csv,MSTEKVDQKEEAGEKEVCGDQIKGPDKEEEPPAAASHGQGWRPGGRAARNARPEPGARHPALPAMVNDPPVPALLWAQEVGQVLAGRARRLLLQFGVLFCTILLLLWVSVFLYGSFYYSYMPTVSHLSPVHFYYRTDCDSSTTSLCSFPVANVSLTKGGRDRVLMYGQPYRVTLELELPESPVNQDLGMFLVTISCYTRGGRIISTSSRSVMLHYRSDLLQMLDTLVFSSLLLFGFAEQKQLLEVELYADYRENSYVPTTGAIIEIHSKRIQLYGAYLRIHAHFTGLRYLLYNFPMTCAFIGVASNFTFLSVIVLFSYMQWVWGGIWPRHRFSLQVNIRKRDNSRKEVQRRISAHQPGPEGQEESTPQSDVTEDGESPEDPSGTEGQLSEEEKPDQQPLSGEEELEPEASDGSGSWEDAALLTEANLPAPAPASASAPVLETLGSSEPAGGALRQRPTCSSS,mutated_sequence,1.0,462.0,NP_001116427.1.a2m,NP_001116427.1.npy,ClinVar
+NP_001116857.1,NP_001116857.1.csv,MLSATPLYGNVHSWMNSERVRMCGASEDRKILVNDGDASKARLELREENPLNHNVVDASTAHRIDGLAALSMDRTGLIREGLRVPGNIVYSSLCGLGSEKGREAATSTLGGLGFSSERNPEMQFKPNTPETVEASAVSGKPPNGFSAIYKTPPGIQKSAVATAEALGLDRPASDKQSPLNINGASYLRLPWVNPYMEGATPAIYPFLDSPNKYSLNMYKALLPQQSYSLAQPLYSPVCTNGERFLYLPPPHYVGPHIPSSLASPMRLSTPSASPAIPPLVHCADKSLPWKMGVSPGNPVDSHAYPHIQNSKQPRVPSAKAVTSGLPGDTALLLPPSPRPSPRVHLPTQPAADTYSEFHKHYARISTSPSVALSKPYMTVSSEFPAARLSNGKYPKAPEGGEGAQPVPGHARKTAVQDRKDGSSPPLLEKQTVTKDVTDKPLDLSSKVVDVDASKADHMKKMAPTVLVHSRAGSGLVLSGSEIPKETLSPPGNGCAIYRSEIISTAPSSWVVPGPSPNEENNGKSMSLKNKALDWAIPQQRSSSCPRMGGTDAVITNVSGSVSSAGRPASASPAPNANADGTKTSRSSVETTPSVIQHVGQPPATPAKHSSSTSSKGAKASNPEPSFKANENGLPPSSIFLSPNEAFRSPPIPYPRSYLPYPAPEGIAVSPLSLHGKGPVYPHPVLLPNGSLFPGHLAPKPGLPYGLPTGRPEFVTYQDALGLGMVHPMLIPHTPIEITKEEKPERRSRSHERARYEDPTLRNRFSEILETSSTKLHPDVPTDKNLKPNPNWNQGKTVVKSDKLVYVDLLREEPDAKTDTNVSKPSFAAESVGQSAEPPKPSVEPALQQHRDFIALREELGRISDFHETYTFKQPVFTVSKDSVLAGTNKENLGLPVSTPFLEPPLGSDGPAVTFGKTQEDPKPFCVGSAPPSVDVTPTYTKDGADEAESNDGKVLKPKPSKLAKRIANSAGYVGDRFKCVTTELYADSSQLSREQRALQMEGLQEDSILCLPAAYCERAMMRFSELEMKEREGGHPATKDSEMCKFSPADWERLKGNQDKKPKSVTLEEAIAEQNESERCEYSVGNKHRDPFEAPEDKDLPVEKYFVERQPVSEPPADQVASDMPHSPTLRVDRKRKVSGDSSHTETTAEEVPEDPLLKAKRRRVSKDDWPEREMTNSSSNHLEDPHYSELTNLKVCIELTGLHPKKQRHLLHLRERWEQQVSAADGKPGRQSRKEVTQATQPEAIPQGTNITEEKPGRKRAEAKGNRSWSEESLKPSDNEQGLPVFSGSPPMKSLSSTSAGGKKQAQPSCAPASRPPAKQQKIKENQKTDVLCADEEEDCQAASLLQKYTDNSEKPSGKRLCKTKHLIPQESRRGLPLTGEYYVENADGKVTVRRFRKRPEPSSDYDLSPAKQEPKPFDRLQQLLPASQSTQLPCSSSPQETTQSRPMPPEARRLIVNKNAGETLLQRAARLGYEEVVLYCLENKICDVNHRDNAGYCALHEACARGWLNIVRHLLEYGADVNCSAQDGTRPLHDAVENDHLEIVRLLLSYGADPTLATYSGRTIMKMTHSELMEKFLTDYLNDLQGRNDDDASGTWDFYGSSVCEPDDESGYDVLANPPGPEDQDDDDDAYSDVFEFEFSETPLLPCYNIQVSVAQGPRNWLLLSDVLKKLKMSSRIFRCNFPNVEIVTIAEAEFYRQVSASLLFSCSKDLEAFNPESKELLDLVEFTNEIQTLLGSSVEWLHPSDLASDNYW,mutated_sequence,1.0,1755.0,NP_001116857.1.a2m,NP_001116857.1.npy,ClinVar
+NP_001119580.2,NP_001119580.2.csv,MAELPTTETPGDATLCSGRFTISTLLSSDEPSPPAAYDSSHPSHLTHSSTFCMRTFGYNTIDVVPTYEHYANSTQPGEPRKVRPTLADLHSFLKQEGRHLHALAFDSRPSHEMTDGLVEGEAGTSSEKNPEEPVRFGWVKGVMIRCMLNIWGVILYLRLPWITAQAGIVLTWIIILLSVTVTSITGLSISAISTNGKVKSGGTYFLISRSLGPELGGSIGLIFAFANAVGVAMHTVGFAETVRDLLQEYGAPIVDPINDIRIIAVVSVTVLLAISLAGMEWESKAQVLFFLVIMVSFANYLVGTLIPPSEDKASKGFFSYRADIFVQNLVPDWRGPDGTFFGMFSIFFPSATGILAGANISGDLKDPAIAIPKGTLMAIFWTTISYLAISATIGSCVVRDASGVLNDTVTPGWGACEGLACSYGWNFTECTQQHSCHYGLINYYQTMSMVSGFAPLITAGIFGATLSSALACLVSAAKVFQCLCEDQLYPLIGFFGKGYGKNKEPVRGYLLAYAIAVAFIIIAELNTIAPIISNFFLCSYALINFSCFHASITNSPGWRPSFQYYNKWAALFGAIISVVIMFLLTWWAALIAIGVVLFLLLYVIYKKPEVNWGSSVQAGSYNLALSYSVGLNEVEDHIKNYRPQCLVLTGPPNFRPALVDFVGTFTRNLSLMICGHVLIGPHKQRMPELQLIANGHTKWLNKRKIKAFYSDVIAEDLRRGVQILMQAAGLGRMKPNILVVGFKKNWQSAHPATVEDYIGILHDAFDFNYGVCVMRMREGLNVSKMMQAHINPVFDPAEDGKEASARVDPKALVKEEQATTIFQSEQGKKTIDIYWLFDDGGLTLLIPYLLGRKRRWSKCKIRVFVGGQINRMDQERKAIISLLSKFRLGFHEVHILPDINQNPRAEHTKRFEDMIAPFRLNDGFKDEATVNEMRRDCPWKISDEEITKNRVKSLRQVRLNEIVLDYSRDAALIVITLPIGRKGKCPSSLYMAWLETLSQDLRPPVILIRGNQENVLTFYCQ,mutated_sequence,1.0,1021.0,NP_001119580.2.a2m,NP_001119580.2.npy,ClinVar
+NP_001120694.1,NP_001120694.1.csv,MARFGDEMPARYGGGGSGAAAGVVVGSGGGRGAGGSRQGGQPGAQRMYKQSMAQRARTMALYNPIPVRQNCLTVNRSLFLFSEDNVVRKYAKKITEWPPFEYMILATIIANCIVLALEQHLPDDDKTPMSERLDDTEPYFIGIFCFEAGIKIIALGFAFHKGSYLRNGWNVMDFVVVLTGILATVGTEFDLRTLRAVRVLRPLKLVSGIPSLQVVLKSIMKAMIPLLQIGLLLFFAILIFAIIGLEFYMGKFHTTCFEEGTDDIQGESPAPCGTEEPARTCPNGTKCQPYWEGPNNGITQFDNILFAVLTVFQCITMEGWTDLLYNSNDASGNTWNWLYFIPLIIIGSFFMLNLVLGVLSGEFAKERERVENRRAFLKLRRQQQIERELNGYMEWISKAEEVILAEDETDGEQRHPFDALRRTTIKKSKTDLLNPEEAEDQLADIASVGSPFARASIKSAKLENSTFFHKKERRMRFYIRRMVKTQAFYWTVLSLVALNTLCVAIVHYNQPEWLSDFLYYAEFIFLGLFMSEMFIKMYGLGTRPYFHSSFNCFDCGVIIGSIFEVIWAVIKPGTSFGISVLRALRLLRIFKVTKYWASLRNLVVSLLNSMKSIISLLFLLFLFIVVFALLGMQLFGGQFNFDEGTPPTNFDTFPAAIMTVFQILTGEDWNEVMYDGIKSQGGVQGGMVFSIYFIVLTLFGNYTLLNVFLAIAVDNLANAQELTKDEQEEEEAANQKLALQKAKEVAEVSPLSAANMSIAVKEQQKNQKPAKSVWEQRTSEMRKQNLLASREALYNEMDPDERWKAAYTRHLRPDMKTHLDRPLVVDPQENRNNNTNKSRAAEPTVDQRLGQQRAEDFLRKQARYHDRARDPSGSAGLDARRPWAGSQEAELSREGPYGRESDHHAREGSLEQPGFWEGEAERGKAGDPHRRHVHRQGGSRESRSGSPRTGADGEHRRHRAHRRPGEEGPEDKAERRARHREGSRPARGGEGEGEGPDGGERRRRHRHGAPATYEGDARREDKERRHRRRKENQGSGVPVSGPNLSTTRPIQQDLGRQDPPLAEDIDNMKNNKLATAESAAPHGSLGHAGLPQSPAKMGNSTDPGPMLAIPAMATNPQNAASRRTPNNPGNPSNPGPPKTPENSLIVTNPSGTQTNSAKTARKPDHTTVDIPPACPPPLNHTVVQVNKNANPDPLPKKEEEKKEEEEDDRGEDGPKPMPPYSSMFILSTTNPLRRLCHYILNLRYFEMCILMVIAMSSIALAAEDPVQPNAPRNNVLRYFDYVFTGVFTFEMVIKMIDLGLVLHQGAYFRDLWNILDFIVVSGALVAFAFTGNSKGKDINTIKSLRVLRVLRPLKTIKRLPKLKAVFDCVVNSLKNVFNILIVYMLFMFIFAVVAVQLFKGKFFHCTDESKEFEKDCRGKYLLYEKNEVKARDREWKKYEFHYDNVLWALLTLFTVSTGEGWPQVLKHSVDATFENQGPSPGYRMEMSIFYVVYFVVFPFFFVNIFVALIIITFQEQGDKMMEEYSLEKNERACIDFAISAKPLTRHMPQNKQSFQYRMWQFVVSPPFEYTIMAMIALNTIVLMMKFYGASVAYENALRVFNIVFTSLFSLECVLKVMAFGILNYFRDAWNIFDFVTVLGSITDILVTEFGNNFINLSFLRLFRAARLIKLLRQGYTIRILLWTFVQSFKALPYVCLLIAMLFFIYAIIGMQVFGNIGIDVEDEDSDEDEFQITEHNNFRTFFQALMLLFRSATGEAWHNIMLSCLSGKPCDKNSGILTRECGNEFAYFYFVSFIFLCSFLMLNLFVAVIMDNFEYLTRDSSILGPHHLDEYVRVWAEYDPAAWGRMPYLDMYQMLRHMSPPLGLGKKCPARVAYKRLLRMDLPVADDNTVHFNSTLMALIRTALDIKIAKGGADKQQMDAELRKEMMAIWPNLSQKTLDLLVTPHKSTDLTVGKIYAAMMIMEYYRQSKAKKLQAMREEQDRTPLMFQRMEPPSPTQEGGPGQNALPSTQLDPGGALMAHESGLKESPSWVTQRAQEMFQKTGTWSPEQGPPTDMPNSQPNSQSVEMREMGRDGYSDSEHYLPMEGQGRAASMPRLPAENQRRRGRPRGNNLSTISDTSPMKRSASVLGPKARRLDDYSLERVPPEENQRHHQRRRDRSHRASERSLGRYTDVDTGLGTDLSMTTQSGDLPSKERDQERGRPKDRKHRQHHHHHHHHHHPPPPDKDRYAQERPDHGRARARDQRWSRSPSEGREHMAHRQGSSSVSGSPAPSTSGTSTPRRGRRQLPQTPSTPRPHVSYSPVIRKAGGSGPPQQQQQQQQQQQQQAVARPGRAATSGPRRYPGPTAEPLAGDRPPTGGHSSGRSPRMERRVPGPARSESPRACRHGGARWPASGPHVSEGPPGPRHHGYYRGSDYDEADGPGSGGGEEAMAGAYDAPPPVRHASSGATGRSPRTPRASGPACASPSRHGRRLPNGYYPAHGLARPRGPGSRKGLHEPYSESDDDWC,mutated_sequence,1.0,2506.0,NP_001120694.1.a2m,NP_001120694.1.npy,ClinVar
+NP_001120864.1,NP_001120864.1.csv,MEVVDETEALQRFFEGHDINGALEPSNIDTSILEEYISKEDASDLCFPDISAPASSASYSHGQPAMPGSSGVHHLSPPGGGPSPGRHGPLPPPGYGTPLNCNNNNGMGAAPKPFPGGTGPPIKAEPKAPYAPGTLPDSPPDSGSEAYSPQQVNEPHLLRTITPETLCHVGVPSRLEHPPPPPAHLPGPPPPPPPPPHYPVLQRDLYMKAEPPIPHYAAMGQGLVPTDLHHTQQSQMLHQLLQQHGAELPTHPSKKRKHSESPPSTLNAQMLNGMIKQEPGTVTALPLHPTRAPSPPWPPQGPLSPGPGSLPLSIARVQTPPWHPPGAPSPGLLQDSDSLSGSYLDPNYQSIKWQPHQQNKWATLYDANYKELPMLTYRVDADKGFNFSVGDDAFVCQKKNHFQVTVYIGMLGEPKYVKTPEGLKPLDCFYLKLHGVKLEALNQSINIEQSQSDRSKRPFNPVTVNLPPEQVTKVTVGRLHFSETTANNMRKKGKPNPDQRYFMLVVALQAHAQNQNYTLAAQISERIIVRASNPGQFESDSDVLWQRAQVPDTVFHHGRVGINTDRPDEALVVHGNVKVMGSLMHPSDLRAKEHVQEVDTTEQLKRISRMRLVHYRYKPEFAASAGIEATAPETGVIAQEVKEILPEAVKDTGDMVFANGKTIENFLVVNKERIFMENVGAVKELCKLTDNLETRIDELERWSHKLAKLRRLDSLKSTGSSGAFSHAGSQFSRAGSVPHKKRPPKVASKSSSVVPDQACISQRFLQGTIIALVVVMAFSVVSMSTLYVLSLRTEEDLVDTDGSFAVSTSCLLALLRPQPPGGSEALCPWSSQSFGTTQLRQSPLTTGLPGIQPSLLLVTTSLTSSAPGSAVRTLDMCSSHPCPVICCSSPTTNPTTGPSLGPSFNPGHVLSPSPSPSTNRSGPSQMALLPVTNIRAKSWGLSVNGIGHSKHHKSLEPLASPAVPFPGGQGKAKNSPSLGFHGRARRGALQSSVGPAEPTWAQGQSASLLAEPVPSLTSIQVLENSMSITSQYCAPGDACRPGNFTYHIPVSSGTPLHLSLTLQMNSSSPVSVVLCSLRSKEEPCEEGSLPQSLHTHQDTQGTSHRWPITILSFREFTYHFRVALLGQANCSSEALAQPATDYHFHFYRLCD,mutated_sequence,1.0,1151.0,NP_001120864.1.a2m,NP_001120864.1.npy,ClinVar
+NP_001121636.1,NP_001121636.1.csv,MKSNQERSNECLPPKKREIPATSRSSEEKAPTLPSDNHRVEGTAWLPGNPGGRGHGGGRHGPAGTSVELGLQQGIGLHKALSTGLDYSPPSAPRSVPVATTLPAAYATPQPGTPVSPVQYAHLPHTFQFIGSSQYSGTYASFIPSQLIPPTANPVTSAVASAAGATTPSQRSQLEAYSTLLANMGSLSQTPGHKAEQQQQQQQQQQQQHQHQQQQQQQQQQQQQQHLSRAPGLITPGSPPPAQQNQYVHISSSPQNTGRTASPPAIPVHLHPHQTMIPHTLTLGPPSQVVMQYADSGSHFVPREATKKAESSRLQQAIQAKEVLNGEMEKSRRYGAPSSADLGLGKAGGKSVPHPYESRHVVVHPSPSDYSSRDPSGVRASVMVLPNSNTPAADLEVQQATHREASPSTLNDKSGLHLGKPGHRSYALSPHTVIQTTHSASEPLPVGLPATAFYAGTQPPVIGYLSGQQQAITYAGSLPQHLVIPGTQPLLIPVGSTDMEASGAAPAIVTSSPQFAAVPHTFVTTALPKSENFNPEALVTQAAYPAMVQAQIHLPVVQSVASPAAAPPTLPPYFMKGSIIQLANGELKKVEDLKTEDFIQSAEISNDLKIDSSTVERIEDSHSPGVAVIQFAVGEHRAQVSVEVLVEYPFFVFGQGWSSCCPERTSQLFDLPCSKLSVGDVCISLTLKNLKNGSVKKGQPVDPASVLLKHSKADGLAGSRHRYAEQENGINQGSAQMLSENGELKFPEKMGLPAAPFLTKIEPSKPAATRKRRWSAPESRKLEKSEDEPPLTLPKPSLIPQEVKICIEGRSNVGK,mutated_sequence,1.0,815.0,NP_001121636.1.a2m,NP_001121636.1.npy,ClinVar
+NP_001121697.2,NP_001121697.2.csv,MPGCPCPGCGMAGPRLLFLTALALELLERAGGSQPALRSRGTATACRLDNKESESWGALLSGERLDTWICSLLGSLMVGLSGVFPLLVIPLEMGTMLRSEAGAWRLKQLLSFALGGLLGNVFLHLLPEAWAYTCSASPGGEGQSLQQQQQLGLWVIAGILTFLALEKMFLDSKEEGTSQAPNKDPTAAAAALNGGHCLAQPAAEPGLGAVVRSIKVSGYLNLLANTIDNFTHGLAVAASFLVSKKIGLLTTMAILLHEIPHEVGDFAILLRAGFDRWSAAKLQLSTALGGLLGAGFAICTQSPKGVVGCSPAAEETAAWVLPFTSGGFLYIALVNVLPDLLEEEDPWRSLQQLLLLCAGIVVMVLFSLFVD,mutated_sequence,1.0,371.0,NP_001121697.2.a2m,NP_001121697.2.npy,ClinVar
+NP_001123910.1,NP_001123910.1.csv,MDPSGVKVLETAEDIQERRQQVLDRYHRFKELSTLRRQKLEDSYRFQFFQRDAEELEKWIQEKLQIASDENYKDPTNLQGKLQKHQAFEAEVQANSGAIVKLDETGNLMISEGHFASETIRTRLMELHRQWELLLEKMREKGIKLLQAQKLVQYLRECEDVMDWINDKEAIVTSEELGQDLEHVEVLQKKFEEFQTDMAAHEERVNEVNQFAAKLIQEQHPEEELIKTKQDEVNAAWQRLKGLALQRQGKLFGAAEVQRFNRDVDETISWIKEKEQLMASDDFGRDLASVQALLRKHEGLERDLAALEDKVKALCAEADRLQQSHPLSATQIQVKREELITNWEQIRTLAAERHARLNDSYRLQRFLADFRDLTSWVTEMKALINADELASDVAGAEALLDRHQEHKGEIDAHEDSFKSADESGQALLAAGHYASDEVREKLTVLSEERAALLELWELRRQQYEQCMDLQLFYRDTEQVDNWMSKQEAFLLNEDLGDSLDSVEALLKKHEDFEKSLSAQEEKITALDEFATKLIQNNHYAMEDVATRRDALLSRRNALHERAMRRRAQLADSFHLQQFFRDSDELKSWVNEKMKTATDEAYKDPSNLQGKVQKHQAFEAELSANQSRIDALEKAGQKLIDVNHYAKDEVAARMNEVISLWKKLLEATELKGIKLREANQQQQFNRNVEDIELWLYEVEGHLASDDYGKDLTNVQNLQKKHALLEADVAAHQDRIDGITIQARQFQDAGHFDAENIKKKQEALVARYEALKEPMVARKQKLADSLRLQQLFRDVEDEETWIREKEPIAASTNRGKDLIGVQNLLKKHQALQAEIAGHEPRIKAVTQKGNAMVEEGHFAAEDVKAKLHELNQKWEALKAKASQRRQDLEDSLQAQQYFADANEAESWMREKEPIVGSTDYGKDEDSAEALLKKHEALMSDLSAYGSSIQALREQAQSCRQQVAPTDDETGKELVLALYDYQEKSPREVTMKKGDILTLLNSTNKDWWKVEVNDRQGFVPAAYVKKLDPAQSASRENLLEEQGSIALRQEQIDNQTRITKEAGSVSLRMKQVEELYHSLLELGEKRKGMLEKSCKKFMLFREANELQQWINEKEAALTSEEVGADLEQVEVLQKKFDDFQKDLKANESRLKDINKVAEDLESEGLMAEEVQAVQQQEVYGMMPRDETDSKTASPWKSARLMVHTVATFNSIKELNERWRSLQQLAEERSQLLGSAHEVQRFHRDADETKEWIEEKNQALNTDNYGHDLASVQALQRKHEGFERDLAALGDKVNSLGETAERLIQSHPESAEDLQEKCTELNQAWSSLGKRADQRKAKLGDSHDLQRFLSDFRDLMSWINGIRGLVSSDELAKDVTGAEALLERHQEHRTEIDARAGTFQAFEQFGQQLLAHGHYASPEIKQKLDILDQERADLEKAWVQRRMMLDQCLELQLFHRDCEQAENWMAAREAFLNTEDKGDSLDSVEALIKKHEDFDKAINVQEEKIAALQAFADQLIAAGHYAKGDISSRRNEVLDRWRRLKAQMIEKRSKLGESQTLQQFSRDVDEIEAWISEKLQTASDESYKDPTNIQLSKLLSKHQKHQAFEAELHANADRIRGVIDMGNSLIERGACAGSEDAVKARLAALADQWQFLVQKSAEKSQKLKEANKQQNFNTGIKDFDFWLSEVEALLASEDYGKDLASVNNLLKKHQLLEADISAHEDRLKDLNSQADSLMTSSAFDTSQVKDKRDTINGRFQKIKSMAASRRAKLNESHRLHQFFRDMDDEESWIKEKKLLVGSEDYGRDLTGVQNLRKKHKRLEAELAAHEPAIQGVLDTGKKLSDDNTIGKEEIQQRLAQFVEHWKELKQLAAARGQRLEESLEYQQFVANVEEEEAWINEKMTLVASEDYGDTLAAIQGLLKKHEAFETDFTVHKDRVNDVCTNGQDLIKKNNHHEENISSKMKGLNGKVSDLEKAAAQRKAKLDENSAFLQFNWKADVVESWIGEKENSLKTDDYGRDLSSVQTLLTKQETFDAGLQAFQQEGIANITALKDQLLAAKHVQSKAIEARHASLMKRWSQLLANSAARKKKLLEAQSHFRKVEDLFLTFAKKASAFNSWFENAEEDLTDPVRCNSLEEIKALREAHDAFRSSLSSAQADFNQLAELDRQIKSFRVASNPYTWFTMEALEETWRNLQKIIKERELELQKEQRRQEENDKLRQEFAQHANAFHQWIQETRTYLLDGSCMVEESGTLESQLEATKRKHQEIRAMRSQLKKIEDLGAAMEEALILDNKYTEHSTVGLAQQWDQLDQLGMRMQHNLEQQIQARNTTGVTEEALKEFSMMFKHFDKDKSGRLNHQEFKSCLRSLGYDLPMVEEGEPDPEFEAILDTVDPNRDGHVSLQEYMAFMISRETENVKSSEEIESAFRALSSEGKPYVTKEELYQNLTREQADYCVSHMKPYVDGKGRELPTAFDYVEFTRSLFVN,mutated_sequence,1.0,2477.0,NP_001123910.1.a2m,NP_001123910.1.npy,ClinVar
+NP_001127879.1,NP_001127879.1.csv,MGRVGYWTLLVLPALLVWRGPAPSAAAEKGPPALNIAVMLGHSHDVTERELRTLWGPEQAAGLPLDVNVVALLMNRTDPKSLITHVCDLMSGARIHGLVFGDDTDQEAVAQMLDFISSHTFVPILGIHGGASMIMADKDPTSTFFQFGASIQQQATVMLKIMQDYDWHVFSLVTTIFPGYREFISFVKTTVDNSFVGWDMQNVITLDTSFEDAKTQVQLKKIHSSVILLYCSKDEAVLILSEARSLGLTGYDFFWIVPSLVSGNTELIPKEFPSGLISVSYDDWDYSLEARVRDGIGILTTAASSMLEKFSYIPEAKASCYGQMERPEVPMHTLHPFMVNVTWDGKDLSFTEEGYQVHPRLVVIVLNKDREWEKVGKWENHTLSLRHAVWPRYKSFSDCEPDDNHLSIVTLEEAPFVIVEDIDPLTETCVRNTVPCRKFVKINNSTNEGMNVKKCCKGFCIDILKKLSRTVKFTYDLYLVTNGKHGKKVNNVWNGMIGEVVYQRAVMAVGSLTINEERSEVVDFSVPFVETGISVMVSRSNGTVSPSAFLEPFSASVWVMMFVMLLIVSAIAVFVFEYFSPVGYNRNLAKGKAPHGPSFTIGKAIWLLWGLVFNNSVPVQNPKGTTSKIMVSVWAFFAVIFLASYTANLAAFMIQEEFVDQVTGLSDKKFQRPHDYSPPFRFGTVPNGSTERNIRNNYPYMHQYMTKFNQKGVEDALVSLKTGKLDAFIYDAAVLNYKAGRDEGCKLVTIGSGYIFATTGYGIALQKGSPWKRQIDLALLQFVGDGEMEELETLWLTGICHNEKNEVMSSQLDIDNMAGVFYMLAAAMALSLITFIWEHLFYWKLRFCFTGVCSDRPGLLFSISRGIYSCIHGVHIEEKKKSPDFNLTGSQSNMLKLLRSAKNISSMSNMNSSRMDSPKRAADFIQRGSLIMDMVSDKGNLMYSDNRSFQGKESIFGDNMNELQTFVANRQKDNLNNYVFQGQHPLTLNESNPNTVEVAVSTESKANSRPRQLWKKSVDSIRQDSLSQNPVSQRDEATAENRTHSLKSPRYLPEEMAHSDISETSNRATCHREPDNSKNHKTKDNFKRSVASKYPKDCSEVERTYLKTKSSSPRDKIYTIDGEKEPGFHLDPPQFVENVTLPENVDFPDPYQDPSENFRKGDSTLPMNRNPLHNEEGLSNNDQYKLYSKHFTLKDKGSPHSETSERYRQNSTHCRSCLSNMPTYSGHFTMRSPFKCDACLRMGNLYDIDEDQMLQETGNPATGEQVYQQDWAQNNALQLQKNKLRISRQHSYDNIVDKPRELDLSRPSRSISLKDRERLLEGNFYGSLFSVPSSKLSGKKSSLFPQGLEDSKRSKSLLPDHTSDNPFLHSHRDDQRLVIGRCPSDPYKHSLPSQAVNDSYLRSSLRSTASYCSRDSRGHNDVYISEHVMPYAANKNNMYSTPRVLNSCSNRRVYKKMPSIESDV,mutated_sequence,1.0,1464.0,NP_001127879.1.a2m,NP_001127879.1.npy,ClinVar
+NP_001128145.1,NP_001128145.1.csv,MYSPLCLTQDEFHPFIEALLPHVRAFAYTWFNLQARKRKYFKKHEKRMSKEEERAVKDELLSEKPEVKQKWASRLLAKLRKDIRPEYREDFVLTVTGKKPPCCVLSNPDQKGKMRRIDCLRQADKVWRLDLVMVILFKGIPLESTDGERLVKSPQCSNPGLCVQPHHIGVSVKELDLYLAYFVHAADSSQSESPSQPSDADIKDQPENGHLGFQDSFVTSGVFSVTELVRVSQTPIAAGTGPNFSLSDLESSSYYSMSPGAMRRSLPSTSSTSSTKRLKSVEDEMDSPGEEPFYTGQGRSPGSGSQSSGWHEVEPGMPSPTTLKKSEKSGFSSPSPSQTSSLGTAFTQHHRPVITGPRASPHATPSTLHFPTSPIIQQPGPYFSHPAIRYHPQETLKEFVQLVCPDAGQQAGQVGFLNPNGSSQGKVHNPFLPTPMLPPPPPPPMARPVPLPVPDTKPPTTSTEGGAASPTSPTYSTPSTSPANRFVSVGPRDPSFVNIPQQTQSWYLG,mutated_sequence,1.0,509.0,NP_001128145.1.a2m,NP_001128145.1.npy,ClinVar
+NP_001130.1,NP_001130.1.csv,MATYKVRVATGTDLLSGTRDSISLTIVGTQGESHKQLLNHFGRDFATGAVGQYTVQCPQDLGELIIIRLHKERYAFFPKDPWYCNYVQICAPNGRIYHFPAYQWMDGYETLALREATGKTTADDSLPVLLEHRKEEIRAKQDFYHWRVFLPGLPSYVHIPSYRPPVRRHRNPNRPEWNGYIPGFPILINFKATKFLNLNLRYSFLKTASFFVRLGPMALAFKVRGLLDCKHSWKRLKDIRKIFPGKKSVVSEYVAEHWAEDTFFGYQYLNGVNPGLIRRCTRIPDKFPVTDDMVAPFLGEGTCLQAELEKGNIYLADYRIMEGIPTVELSGRKQHHCAPLCLLHFGPEGKMMPIAIQLSQTPGPDCPIFLPSDSEWDWLLAKTWVRYAEFYSHEAIAHLLETHLIAEAFCLALLRNLPMCHPLYKLLIPHTRYTVQINSIGRAVLLNEGGLSAKGMSLGVEGFAGVMVRALSELTYDSLYLPNDFVERGVQDLPGYYYRDDSLAVWNALEKYVTEIITYYYPSDAAVEGDPELQSWVQEIFKECLLGRESSGFPRCLRTVPELIRYVTIVIYTCSAKHAAVNTGQMEFTAWMPNFPASMRNPPIQTKGLTTLETFMDTLPDVKTTCITLLVLWTLSREPDDRRPLGHFPDIHFVEEAPRRSIEAFRQRLNQISHDIRQRNKCLPIPYYYLDPVLIENSISI,mutated_sequence,1.0,701.0,NP_001130.1.a2m,NP_001130.1.npy,ClinVar
+NP_001133.1,NP_001133.1.csv,MGTWILFACLLGAAFAMPLPPHPGHPGYINFSYEVLTPLKWYQSIRPPYPSYGYEPMGGWLHHQIIPVLSQQHPPTHTLQPHHHIPVVPAQQPVIPQQPMMPVPGQHSMTPIQHHQPNLPPPAQQPYQPQPVQPQPHQPMQPQPPVHPMQPLPPQPPLPPMFPMQPLPPMLPDLTLEAWPSTDKTKREEVD,mutated_sequence,1.0,191.0,NP_001133.1.a2m,NP_001133.1.npy,ClinVar
+NP_001136256.1,NP_001136256.1.csv,MSSSCSGLSRVLVAVATALVSASSPCPQAWGPPGVQYGQPGRSVKLCCPGVTAGDPVSWFRDGEPKLLQGPDSGLGHELVLAQADSTDEGTYICQTLDGALGGTVTLQLGYPPARPVVSCQAADYENFSCTWSPSQISGLPTRYLTSYRKKTVLGADSQRRSPSTGPWPCPQDPLGAARCVVHGAEFWSQYRINVTEVNPLGASTRLLDVSLQSILRPDPPQGLRVESVPGYPRRLRASWTYPASWPCQPHFLLKFRLQYRPAQHPAWSTVEPAGLEEVITDAVAGLPHAVRVSARDFLDAGTWSTWSPEAWGTPSTGTIPKEIPAWGQLHTQPEVEPQVDSPAPPRPSLQPHPRLLDHRDSVEQVAVLASLGILSFLGLVAGALALGLWLRLRRGGKDGSPKPGFLASVIPVDRRPGAPNL,mutated_sequence,1.0,422.0,NP_001136256.1.a2m,NP_001136256.1.npy,ClinVar
+NP_001136272.1,NP_001136272.1.csv,MTDKSIVILSLMVFHSSFINGKTCRRQLVEEWHPQPSSYVVNWTLTENICLDFYRDCWFLGVNTKIDTSGNQAVPQICPLQIQLGDILVISSEPSLQFPEINLMNVSETSFVGCVQNTTTEDQLLFGCRLKGMHTVNSKWLSVGTHYFITVMASGPSPCPLGLRLNVTVKQQFCQESLSSEFCSGHGKCLSEAWSKTYSCHCQPPFSGKYCQELDACSFKPCKNNGSCINKRENWDEQAYECVCHPPFTGKNCSEIIGQCQPHVCFHGNCSNITSNSFICECDEQFSGPFCEVSAKPCVSLLFWKRGICPNSSSAYTYECPKGSSSQNGETDVSEFSLVPCQNGTDCIKISNDVMCICSPIFTDLLCKSIQTSCESFPLRNNATCKKCEKDYPCSCISGFTEKNCEKAIDHCKLLSINCLNEEWCFNIIGRFKYVCIPGCTKNPCWFLKNVYLIHQHLCYCGVTFHGICQDKGPAQFEYVWQLGFAGSEGEKCQGVIDAYFFLAANCTEDATYVNDPEDNNSSCWFPHEGTKEICANGCSCLSEEDSQEYRYLCFLRWAGNMYLENTTDDQENECQHEAVCKDEINRPRCSCSLSYIGRLCVVNVDYCLGNHSISVHGLCLALSHNCNCSGLQRYERNICEIDTEDCKSASCKNGTTSTHLRGYFFRKCVPGFKGTQCEIDIDECASHPCKNGATCIDQPGNYFCQCVPPFKVVDGFSCLCNPGYVGIRCEQDIDDCILNACEHNSTCKDLHLSYQCVCLSDWEGNFCEQESNECKMNPCKNNSTCTDLYKSYRCECTSGWTGQNCSEEINECDSDPCMNGGLCHESTIPGQFVCLCPPLYTGQFCHQRYNLCDLLHNPCRNNSTCLALVDANQHCICREEFEGKNCEIDVKDCLFLSCQDYGDCEDMVNNFRCICRPGFSGSLCEIEINECSSEPCKNNGTCVDLTNRFFCNCEPEYHGPFCELDVNKCKISPCLDEENCVYRTDGYNCLCAPGYTGINCEINLDECLSEPCLHDGVCIDGINHYTCDCKSGFFGTHCETNANDCLSNPCLHGRCTELINEYPCSCDADGTSTQCKIKINDCTSIPCMNEGFCQKSAHGFTCICPRGYTGAYCEKSIDNCAEPELNSVICLNGGICVDGPGHTFDCRCLPGFSGQFCEININECSSSPCLHGADCEDHINGYVCKCQPGWSGHHCENELECIPNSCVHELCMENEPGSTCLCTPGFMTCSIGLLCGDEIRRITCLTPIFQRTDPISTQTYTIPPSETLVSSFPSIKATRIPAIMDTYPVDQGPKQTGIVKHDILPTTGLATLRISTPLESYLLQELIVTRELSAKHSLLSSADVSSSRFLNFGIRDPAQIVQDKTSVSHMPIRTSAATLGFFFPDRRARTPFIMSSLMSDFIFPTQSLLFENCQTVALSATPTTSVIRSIPGADIELNRQSLLSRGFLLIAASISATPVVSRGAQEDIEEYSADSLISRREHWRLLSPSMSPIFPAKVIISKQVTILNSSALHRFSTKAFNPSEYQAITEASSNQRLTNIKSQAADSLRELSQTCATCSMTEIKSSREFSDQVLHSKQSHFYETFWMNSAILASWYALMGAQTITSGHSFSSATEITPSVAFTEVPSLFPSKKSAKRTILSSSLEESITLSSNLDVNLCLDKTCLSIVPSQTISSDLMNSDLTSKMTTDELSVSENILKLLKIRQYGITMGPTEVLNQESLLDMEKSKGSHTLFKLHPSDSSLDFELNLQIYPDVTLKTYSEITHANDFKNNLPPLTGSVPDFSEVTTNVAFYTVSATPALSIQTSSSMSVIRPDWPYFTDYMTSLKKEVKTSSEWSKWELQPSVQYQEFPTASRHLPFTRSLTLSSLESILAPQRLMISDFSCVRYYGDSYLEFQNVALNPQNNISLEFQTFSSYGLLLYVKQDSNLVDGFFIQLFIENGTLKYHFYCPGEAKFKSINTTVRVDNGQKYTLLIRQELDPCNAELTILGRNTQICESINHVLGKPLPKSGSVFIGGFPDLHGKIQMPVPVKNFTGCIEVIEINNWRSFIPSKAVKNYHINNCRSQGFMLSPTASFVDASDVTQGVDTMWTSVSPSVAAPSVCQQDVCHNGGTCHAIFLSSGIVSFQCDCPLHFTGRFCEKDAGLFFPSFNGNSYLELPFLKFVLEKEHNRTVTIYLTIKTNSLNGTILYSNGNNCGKQFLHLFLVEGRPSVKYGCGNSQNILTVSANYSINTNAFTPITIRYTTPVGSPGVVCMIEMTADGKPPVQKKDTEISHASQAYFESMFLGHIPANVQIHKKAGPVYGFRGCILDLQVNNKEFFIIDEARHGKNIENCHVPWCAHHLCRNNGTCISDNENLFCECPRLYSGKLCQFASCENNPCGNGATCVPKSGTDIVCLCPYGRSGPLCTDAINITQPRFSGTDAFGYTSFLAYSRISDISFHYEFHLKFQLANNHSALQNNLIFFTGQKGHGLNGDDFLAVGLLNGSVVYSYNLGSGIASIRSEPLNLSLGVHTVHLGKFFQEGWLKVDDHKNKSIIAPGRLVGLNVFSQFYVGGYSEYTPDLLPNGADFKNGFQGCIFTLQVRTEKDGHFRGLGNPEGHPNAGRSVGQCHASPCSLMKCGNGGTCIESGTSVYCNCTTGWKGSFCTETVSTCDPEHDPPHHCSRGATCISLPHGYTCFCPLGTTGIYCEQALSISDPSFRSNELSWMSFASFHVRKKTHIQLQFQPLAADGILFYAAQHLKAQSGDFLCISLVNSSVQLRYNLGDRTIILETLQKVTINGSTWHIIKAGRVGAEGYLDLDGINVTEKASTKMSSLDTNTDFYIGGVSSLNLVNPMAIENEPVGFQGCIRQVIINNQELQLTEFGAKGGSNVGDCDGTACGYNTCRNGGECTVNGTTFSCRCLPDWAGNTCNQSVSCLNNLCLHQSLCIPDQSFSYSCLCTLGWVGRYCENKTSFSTAKFMGNSYIKYIDPNYRMRNLQFTTISLNFSTTKTEGLIVWMGIAQNEENDFLAIGLHNQTLKIAVNLGERISVPMSYNNGTFCCNKWHHVVVIQNQTLIKAYINNSLILSEDIDPHKNFVALNYDGICYLGGFEYGRKVNIVTQEIFKTNFVGKIKDVVFFQEPKNIELIKLEGYNVYDGDEQNEVT,mutated_sequence,1.0,3144.0,NP_001136272.1.a2m,NP_001136272.1.npy,ClinVar
+NP_001138830.1,NP_001138830.1.csv,MKRRLDDQESPVYAAQQRRIPGSTEAFPHQHRVLAPAPPVYEAVSETMQSATGIQYSVTPSYQVSAMPQSSGSHGPAIAAVHSSHHHPTAVQPHGGQVVQSHAHPAPPVAPVQGQQQFQRLKVEDALSYLDQVKLQFGSQPQVYNDFLDIMKEFKSQSIDTPGVISRVSQLFKGHPDLIMGFNTFLPPGYKIEVQTNDMVNVTTPGQVHQIPTHGIQPQPQPPPQHPSQPSAQSAPAPAQPAPQPPPAKVSKPSQLQAHTPASQQTPPLPPYASPRSPPVQPHTPVTISLGTAPSLQNNQPVEFNHAINYVNKIKNRFQGQPDIYKAFLEILHTYQKEQRNAKEAGGNYTPALTEQEVYAQVARLFKNQEDLLSEFGQFLPDANSSVLLSKTTAEKVDSVRNDHGGTVKKPQLNNKPQRPSQNGCQIRRHPTGTTPPVKKKPKLLNLKDSSMADASKHGGGTESLFFDKVRKALRSAEAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFLGYKESVHLETYPKERATEGIAMEIDYASCKRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSWSEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLATIRVLEAIQKKLSRLSAEEQAKFRLDNTLGGTSEVIHRKALQRIYADKAADIIDGLRKNPSIAVPIVLKRLKMKEEEWREAQRGFNKVWREQNEKYYLKSLDHQGINFKQNDTKVLRSKSLLNEIESIYDERQEQATEENAGVPVGPHLSLAYEDKQILEDAAALIIHHVKRQTGIQKEDKYKIKQIMHHFIPDLLFAQRGDLSDVEEEEEEEMDVDEATGAVKKHNGVGGSPPKSKLLFSNTAAQKLRGMDEVYNLFYVNNNWYIFMRLHQILCLRLLRICSQAERQIEEENREREWEREVLGIKRDKSDSPAIQLRLKEPMDVDVEDYYPAFLDMVRSLLDGNIDSSQYEDSLREMFTIHAYIAFTMDKLIQSIVRQLQHIVSDEICVQVTDLYLAENNNGATGGQLNTQNSRSLLESTYQRKAEQLMSDENCFKLMFIQSQGQVQLTIELLDTEEENSDDPVEAERWSDYVERYMNSDTTSPELREHLAQKPVFLPRNLRRIRKCQRGREQQEKEGKEGNSKKTMENVDSLDKLECRFKLNSYKMVYVIKSEDYMYRRTALLRAHQSHERVSKRLHQRFQAWVDKWTKEHVPREMAAETSKWLMGEGLEGLVPCTTTCDTETLHFVSINKYRVKYGTVFKAP,mutated_sequence,1.0,1273.0,NP_001138830.1.a2m,NP_001138830.1.npy,ClinVar
+NP_001153171.1,NP_001153171.1.csv,MASHRHSGPSSYKVGTMAEKFDCHYCRDPLQGKKYVQKDGHHCCLKCFDKFCANTCVECRKPIGADSKEVHYKNRFWHDTCFRCAKCLHPLANETFVAKDNKILCNKCTTREDSPKCKGCFKAIVAGDQNVEYKGTVWHKDCFTCSNCKQVIGTGSFFPKGEDFYCVTCHETKFAKHCVKCNKAITSGGITYQDQPWHADCFVCVTCSKKLAGQRFTAVEDQYYCVDCYKNFVAKKCAGCKNPITGFGKGSSVVAYEGQSWHDYCFHCKKCSVNLANKRFVFHQEQVYCPDCAKKL,mutated_sequence,1.0,296.0,NP_001153171.1.a2m,NP_001153171.1.npy,ClinVar
+NP_001157749.1,NP_001157749.1.csv,MAAQGYGYYRTVIFSAMFGGYSLYYFNRKTFSFVMPSLVEEIPLDKDDLGFITSSQSAAYAISKFVSGVLSDQMSARWLFSSGLLLVGLVNIFFAWSSTVPVFAALWFLNGLAQGLGWPPCGKVLRKWFEPSQFGTWWAILSTSMNLAGGLGPILATILAQSYSWRSTLALSGALCVVVSFLCLLLIHNEPADVGLRNLDPMPSEGKKGSLKEESTLQELLLSPYLWVLSTGYLVVFGVKTCCTDWGQFFLIQEKGQSALVGSSYMSALEVGGLVGSIAAGYLSDRAMAKAGLSNYGNPRHGLLLFMMAGMTVSMYLFRVTVTSDSPKLWILVLGAVFGFSSYGPIALFGVIANESAPPNLCGTSHAIVGLMANVGGFLAGLPFSTIAKHYSWSTAFWVAEVICAASTAAFFLLRNIRTKMGRVSKKAE,mutated_sequence,1.0,429.0,NP_001157749.1.a2m,NP_001157749.1.npy,ClinVar
+NP_001159435.1,NP_001159435.1.csv,MEQTVLVPPGPDSFNFFTRESLAAIERRIAEEKAKNPKPDKKDDDENGPKPNSDLEAGKNLPFIYGDIPPEMVSEPLEDLDPYYINKKTFIVLNKGKAIFRFSATSALYILTPFNPLRKIAIKILVHSLFSMLIMCTILTNCVFMTMSNPPDWTKNVEYTFTGIYTFESLIKIIARGFCLEDFTFLRDPWNWLDFTVITFAYVTEFVDLGNVSALRTFRVLRALKTISVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNKCIQWPPTNASLEEHSIEKNITVNYNGTLINETVFEFDWKSYIQDSRYHYFLEGFLDALLCGNSSDAGQCPEGYMCVKAGRNPNYGYTSFDTFSWAFLSLFRLMTQDFWENLYQLTLRAAGKTYMIFFVLVIFLGSFYLINLILAVVAMAYEEQNQATLEEAEQKEAEFQQMIEQLKKQQEAAQQAATATASEHSREPSAAGRLSDSSSEASKLSSKSAKERRNRRKKRKQKEQSGGEEKDEDEFQKSESEDSIRRKGFRFSIEGNRLTYEKRYSSPHQSLLSIRGSLFSPRRNSRTSLFSFRGRAKDVGSENDFADDEHSTFEDNESRRDSLFVPRRHGERRNSNLSQTSRSSRMLAVFPANGKMHSTVDCNGVVSLVGGPSVPTSPVGQLLPEVIIDKPATDDNGTTTETEMRKRRSSSFHVSMDFLEDPSQRQRAMSIASILTNTVEELEESRQKCPPCWYKFSNIFLIWDCSPYWLKVKHVVNLVVMDPFVDLAITICIVLNTLFMAMEHYPMTDHFNNVLTVGNLVFTGIFTAEMFLKIIAMDPYYYFQEGWNIFDGFIVTLSLVELGLANVEGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKSYKDCVCKIASDCQLPRWHMNDFFHSFLIVFRVLCGEWIETMWDCMEVAGQAMCLTVFMMVMVIGNLVVLNLFLALLLSSFSADNLAATDDDNEMNNLQIAVDRMHKGVAYVKRKIYEFIQQSFIRKQKILDEIKPLDDLNNKKDSCMSNHTAEIGKDLDYLKDVNGTTSGIGTGSSVEKYIIDESDYMSFINNPSLTVTVPIAVGESDFENLNTEDFSSESDLEESKEKLNESSSSSEGSTVDIGAPVEEQPVVEPEETLEPEACFTEGCVQRFKCCQINVEEGRGKQWWNLRRTCFRIVEHNWFETFIVFMILLSSGALAFEDIYIDQRKTIKTMLEYADKVFTYIFILEMLLKWVAYGYQTYFTNAWCWLDFLIVDVSLVSLTANALGYSELGAIKSLRTLRALRPLRALSRFEGMRVVVNALLGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFYHCINTTTGDRFDIEDVNNHTDCLKLIERNETARWKNVKVNFDNVGFGYLSLLQVATFKGWMDIMYAAVDSRNVELQPKYEESLYMYLYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKFGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPGNKFQGMVFDFVTRQVFDISIMILICLNMVTMMVETDDQSEYVTTILSRINLVFIVLFTGECVLKLISLRHYYFTIGWNIFDFVVVILSIVGMFLAELIEKYFVSPTLFRVIRLARIGRILRLIKGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYAIFGMSNFAYVKREVGIDDMFNFETFGNSMICLFQITTSAGWDGLLAPILNSKPPDCDPNKVNPGSSVKGDCGNPSVGIFFFVSYIIISFLVVVNMYIAVILENFSVATEESAEPLSEDDFEMFYEVWEKFDPDATQFMEFEKLSQFAAALEPPLNLPQPNKLQLIAMDLPMVSGDRIHCLDILFAFTKRVLGESGEMDALRIQMEERFMASNPSKVSYQPITTTLKRKQEEVSAVIIQRAYRRHLLKRTVKQASFTYNKNKIKGGANLLIKEDMIIDRINENSITEKTDLTMSTAACPPSYDRVTKPIVEKHEQEGKDEKAKGK,mutated_sequence,1.0,2009.0,NP_001159435.1.a2m,NP_001159435.1.npy,ClinVar
+NP_001162.5,NP_001162.5.csv,MAAPAEPCAGQGVWNQTEPEPAATSLLSLCFLRTAGVWVPPMYLWVLGPIYLLFIHHHGRGYLRMSPLFKAKMVLGFALIVLCTSSVAVALWKIQQGTPEAPEFLIHPTVWLTTMSFAVFLIHTERKKGVQSSGVLFGYWLLCFVLPATNAAQQASGAGFQSDPVRHLSTYLCLSLVVAQFVLSCLADQPPFFPEDPQQSNPCPETGAAFPSKATFWWVSGLVWRGYRRPLRPKDLWSLGRENSSEELVSRLEKEWMRNRSAARRHNKAIAFKRKGGSGMKAPETEPFLRQEGSQWRPLLKAIWQVFHSTFLLGTLSLIISDVFRFTVPKLLSLFLEFIGDPKPPAWKGYLLAVLMFLSACLQTLFEQQNMYRLKVLQMRLRSAITGLVYRKVLALSSGSRKASAVGDVVNLVSVDVQRLTESVLYLNGLWLPLVWIVVCFVYLWQLLGPSALTAIAVFLSLLPLNFFISKKRNHHQEEQMRQKDSRARLTSSILRNSKTIKFHGWEGAFLDRVLGIRGQELGALRTSGLLFSVSLVSFQVSTFLVALVVFAVHTLVAENAMNAEKAFVTLTVLNILNKAQAFLPFSIHSLVQARVSFDRLVTFLCLEEVDPGVVDSSSSGSAAGKDCITIHSATFAWSQESPPCLHRINLTVPQGCLLAVVGPVGAGKSSLLSALLGELSKVEGFVSIEGAVAYVPQEAWVQNTSVVENVCFGQELDPPWLERVLEACALQPDVDSFPEGIHTSIGEQGMNLSGGQKQRLSLARAVYRKAAVYLLDDPLAALDAHVGQHVFNQVIGPGGLLQGTTRILVTHALHILPQADWIIVLANGAIAEMGSYQELLQRKGALMCLLDQARQPGDRGEGETEPGTSTKDPRGTSAGRRPELRRERSIKSVPEKDRTTSEAQTEVPLDDPDRAGWPAGKDSIQYGRVKATVHLAYLRAVGTPLCLYALFLFLCQQVASFCRGYWLSLWADDPAVGGQQTQAALRGGIFGLLGCLQAIGLFASMAAVLLGGARASRLLFQRLLWDVVRSPISFFERTPIGHLLNRFSKETDTVDVDIPDKLRSLLMYAFGLLEVSLVVAVATPLATVAILPLFLLYAGFQSLYVVSSCQLRRLESASYSSVCSHMAETFQGSTVVRAFRTQAPFVAQNNARVDESQRISFPRLVADRWLAANVELLGNGLVFAAATCAVLSKAHLSAGLVGFSVSAALQVTQTLQWVVRNWTDLENSIVSVERMQDYAWTPKEAPWRLPTCAAQPPWPQGGQIEFRDFGLRYRPELPLAVQGVSFKIHAGEKVGIVGRTGAGKSSLASGLLRLQEAAEGGIWIDGVPIAHVGLHTLRSRISIIPQDPILFPGSLRMNLDLLQEHSDEAIWAALETVQLKALVASLPGQLQYKCADRGEDLSVGQKQLLCLARALLRKTQILILDEATAAVDPGTELQMQAMLGSWFAQCTVLLIAHRLRSVMDCARVLVMDKGQVAESGSPAQLLAQKGLFYRLAQESGLV,mutated_sequence,1.0,1503.0,NP_001162.5.a2m,NP_001162.5.npy,ClinVar
+NP_001167560.1,NP_001167560.1.csv,MAAATRRVFHLQPCENSPTMSQNGYFEDSSYYKCDTDDTFEAREEILGDEAFDTANSSIVSGESIRFFVNVNLEMQATNTENEATSGGCVLLHTSRKYLKLKNFKEEIRAHRDLDGFLAQASIVLNETATSLDNVLRTMLRRFARDPDNNEPNCNLDLLMAMLFTDAGAPMRGKVHLLSDTIQGVTATVTGVRYQQSWLCIICTMKALQKRHVCISRLVRPQNWGENSCEVRFVILVLAPPKMKSTKTAMEVARTFATMFSDIAFRQKLLETRTEEEFKEALVHQRQLLTMVSHGPVAPRTKERSTVSLPAHRHPEPPKCKDFVPFGKGIREDIARRFPLYPLDFTDGIIGKNKAVGKYITTTLFLYFACLLPTIAFGSLNDENTDGAIDVQKTIAGQSIGGLLYALFSGQPLVILLTTAPLALYIQVIRVICDDYDLDFNSFYAWTGLWNSFFLALYAFFNLSLVMSLFKRSTEEIIALFISITFVLDAVKGTVKIFWKYYYGHYLDDYHTKRTSSLVSLSGLGASLNASLHTALNASFLASPTELPSATHSGQATAVLSLLIMLGTLWLGYTLYQFKKSPYLHPCVREILSDCALPIAVLAFSLISSHGFREIEMSKFRYNPSESPFAMAQIQSLSLRAVSGAMGLGFLLSMLFFIEQNLVAALVNAPENRLVKGTAYHWDLLLLAIINTGLSLFGLPWIHAAYPHSPLHVRALALVEERVENGHIYDTIVNVKETRLTSLGASVLVGLSLLLLPVPLQWIPKPVLYGLFLYIALTSLDGNQLVQRVALLLKEQTAYPPTHYIRRVPQRKIHYFTGLQVLQLLLLCAFGMSSLPYMKMIFPLIMIAMIPIRYILLPRIIEAKYLDVMDAEHRP,mutated_sequence,1.0,875.0,NP_001167560.1.a2m,NP_001167560.1.npy,ClinVar
+NP_001167618.1,NP_001167618.1.csv,MDIATGPESLERCFPRGQTDCAKMLDGIKMEEHALRPGPATLGVLLGSDCPHPAVCEGCQRPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGCMEKIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSSVSPDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQEQQNSQRLGQEVLSSRMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGDHMNPYGNDSIFHDIDSDTSLTSLSDCFLGSSDVGSLQARVGNPIDRLYSMQSSYFAS,mutated_sequence,1.0,402.0,NP_001167618.1.a2m,NP_001167618.1.npy,ClinVar
+NP_001171809.1,NP_001171809.1.csv,MESLLLPVLLLLAILWTQAAALINLKYSVEEEQRAGTVIANVAKDAREAGFALDPRQASAFRVVSNSAPHLVDINPSSGLLVTKQKIDRDLLCRQSPKCIISLEVMSSSMEICVIKVEIKDLNDNAPSFPAAQIELEISEAASPGTRIPLDSAYDPDSGSFGVQTYELTPNELFGLEIKTRGDGSRFAELVVEKSLDRETQSHYSFRITALDGGDPPRLGTVGLSIKVTDSNDNNPVFSESTYAVSVPENSPPNTPVIRLNASDPDEGTNGQVVYSFYGYVNDRTRELFQIDPHSGLVTVTGALDYEEGHVYELDVQAKDLGPNSIPAHCKVTVSVLDTNDNPPVINLLSVNSELVEVSESAPPGYVIALVRVSDRDSGLNGRVQCRLLGNVPFRLQEYESFSTILVDGRLDREQHDQYNLTIQARDGGVPMLQSAKSFTVLITDENDNHPHFSKPYYQVIVQENNTPGAYLLSVSARDPDLGLNGSVSYQIVPSQVRDMPVFTYVSINPNSGDIYALRSFNHEQTKAFEFKVLAKDGGLPSLQSNATVRVIILDVNDNTPVITAPPLINGTAEVYIPRNSGIGYLVTVVKAEDYDEGENGRVTYDMTEGDRGFFEIDQVNGEVRTTRTFGESSKSSYELIVVAHDHGKTSLSASALVLIYLSPALDAQESMGSVNLSLIFIIALGSIAGILFVTMIFVAIKCKRDNKEIRTYNCSNCLTITCLLGCFIKGQNSKCLHCISVSPISEEQDKKTEEKVSLRGKRIAEYSYGHQKKSSKKKKISKNDIRLVPRDVEETDKMNVVSCSSLTSSLNYFDYHQQTLPLGCRRSESTFLNVENQNTRNTSANHIYHHSFNSQGPQQPDLIINGVPLPETENYSFDSNYVNSRAHLIKSSSTFKDLEGNSLKDSGHEESDQTDSEHDVQRSLYCDTAVNDVLNTSVTSMGSQMPDHDQNEGFHCREECRILGHSDRCWMPRNPMPIRSKSPEHVRNIIALSIEATAADVEAYDDCGPTKRTFATFGKDVSDHPAEERPTLKGKRTVDVTICSPKVNSVIREAGNGCEAISPVTSPLHLKSSLPTKPSVSYTIALAPPARDLEQYVNNVNNGPTRPSEAEPRGADSEKVMHEVSPILKEGRNKESPGVKRLKDIVL,mutated_sequence,1.0,1148.0,NP_001171809.1.a2m,NP_001171809.1.npy,ClinVar
+NP_001174.2,NP_001174.2.csv,MMAAMATARVRMGPRCAQALWRMPWLPVFLSLAAAAAAAAAEQQVPLVLWSSDRDLWAPAADTHEGHITSDLQLSTYLDPALELGPRNVLLFLQDKLSIEDFTAYGGVFGNKQDSAFSNLENALDLAPSSLVLPAVDWYAVSTLTTYLQEKLGASPLHVDLATLRELKLNASLPALLLIRLPYTASSGLMAPREVLTGNDEVIGQVLSTLKSEDVPYTAALTAVRPSRVARDVAVVAGGLGRQLLQKQPVSPVIHPPVSYNDTAPRILFWAQNFSVAYKDQWEDLTPLTFGVQELNLTGSFWNDSFARLSLTYERLFGTTVTFKFILANRLYPVSARHWFTMERLEVHSNGSVAYFNASQVTGPSIYSFHCEYVSSLSKKGSLLVARTQPSPWQMMLQDFQIQAFNVMGEQFSYASDCASFFSPGIWMGLLTSLFMLFIFTYGLHMILSLKTMDRFDDHKGPTISLTQIV,mutated_sequence,1.0,470.0,NP_001174.2.a2m,NP_001174.2.npy,ClinVar
+NP_001177203.1,NP_001177203.1.csv,MNSVRAANRRPRRVSRPRPVQQQQQQPPQQPPPQPPQQQPPQQQPPPPPQQQQQQQPPPPPPPPPPLPQERNNVGERDDDVPADMVAEESGPGAQNSPYQLRRKTLLPKRTACPTKNSMEGASTSTTENFGHRAKRARVSGKSQDLSAAPAEQYLQEKLPDEVVLKIFSYLLEQDLCRAACVCKRFSELANDPILWKRLYMEVFEYTRPMMHPEPGKFYQINPEEYEHPNPWKESFQQLYKGAHVKPGFAEHFYSNPARYKGRENMLYYDTIEDALGGVQEAHFDGLIFVHSGIYTDEWIYIESPITMIGAAPGKVADKVIIENTRDSTFVFMEGSEDAYVGYMTIRFNPDDKSAQHHNAHHCLEITVNCSPIIDHCIIRSTCTVGSAVCVSGQGACPTIKHCNISDCENVGLYITDHAQGIYEDNEISNNALAGIWVKNHGNPIIRRNHIHHGRDVGVFTFDHGMGYFESCNIHRNRIAGFEVKAYANPTVVRCEIHHGQTGGIYVHEKGRGQFIENKIYANNFAGVWITSNSDPTIRGNSIFNGNQGGVYIFGDGRGLIEGNDIYGNALAGIQIRTNSCPIVRHNKIHDGQHGGIYVHEKGQGVIEENEVYSNTLAGVWVTTGSTPVLRRNRIHSGKQVGVYFYDNGHGVLEDNDIYNHMYSGVQIRTGSNPKIRRNKIWGGQNGGILVYNSGLGCIEDNEIFDNAMAGVWIKTDSNPTLRRNKIHDGRDGGICIFNGGRGLLEENDIFRNAQAGVLISTNSHPILRKNRIFDGFAAGIEITNHATATLEGNQIFNNRFGGLFLASGVNVTMKDNKIMNNQDAIEKAVSRGQCLYKISSYTSYPMHDFYRCHTCNTTDRNAICVNCIKKCHQGHDVEFIRHDRFFCDCGAGTLSNPCTLAGEPTHDTDTLYDSAPPIESNTLQHN,mutated_sequence,1.0,927.0,NP_001177203.1.a2m,NP_001177203.1.npy,ClinVar
+NP_001182482.1,NP_001182482.1.csv,MELDFGHFDERDKTSRNMRGSRMNGLPSPTHSAHCSFYRTRTLQALSNEKKAKKVRFYRNGDRYFKGIVYAVSSDRFRSFDALLADLTRSLSDNINLPQGVRYIYTIDGSRKIGSMDELEEGESYVCSSDNFFKKVEYTKNVNPNWSVNVKTSANMKAPQSLASSNSAQARENKDFVRPKLVTIIRSGVKPRKAVRVLLNKKTAHSFEQVLTDITEAIKLETGVVKKLYTLDGKQVTCLHDFFGDDDVFIACGPEKFRYAQDDFSLDENECRVMKGNPSATAGPKASPTPQKTSAKSPGPMRRSKSPADSGNDQDANGTSSSQLSTPKSKQSPISTPTSPGSLRKHKVDLYLPLSLDDSDSLGDSM,mutated_sequence,1.0,366.0,NP_001182482.1.a2m,NP_001182482.1.npy,ClinVar
+NP_001184033.1,NP_001184033.1.csv,MAHSCRWRFPARPGTTGGGGGGGRRGLGGAPRQRVPALLLPPGPPVGGGGPGAPPSPPAVAAAAAAAGSSGAGVPGGAAAASAASSSSASSSSSSSSSASSGPALLRVGPGFDAALQVSAAIGTNLRRFRAVFGESGGGGGSGEDEQFLGFGSDEEVRVRSPTRSPSVKTSPRKPRGRPRSGSDRNSAILSDPSVFSPLNKSETKSGDKIKKKDSKSIEKKRGRPPTFPGVKIKITHGKDISELPKGNKEDSLKKIKRTPSATFQQATKIKKLRAGKLSPLKSKFKTGKLQIGRKGVQIVRRRGRPPSTERIKTPSGLLINSELEKPQKVRKDKEGTPPLTKEDKTVVRQSPRRIKPVRIIPSSKRTDATIAKQLLQRAKKGAQKKIEKEAAQLQGRKVKTQVKNIRQFIMPVVSAISSRIIKTPRRFIEDEDYDPPIKIARLESTPNSRFSAPSCGSSEKSSAASQHSSQMSSDSSRSSSPSVDTSTDSQASEEIQVLPEERSDTPEVHPPLPISQSPENESNDRRSRRYSVSERSFGSRTTKKLSTLQSAPQQQTSSSPPPPLLTPPPPLQPASSISDHTPWLMPPTIPLASPFLPASTAPMQGKRKSILREPTFRWTSLKHSRSEPQYFSSAKYAKEGLIRKPIFDNFRPPPLTPEDVGFASGFSASGTAASARLFSPLHSGTRFDMHKRSPLLRAPRFTPSEAHSRIFESVTLPSNRTSAGTSSSGVSNRKRKRKVFSPIRSEPRSPSHSMRTRSGRLSSSELSPLTPPSSVSSSLSISVSPLATSALNPTFTFPSHSLTQSGESAEKNQRPRKQTSAPAEPFSSSSPTPLFPWFTPGSQTERGRNKDKAPEELSKDRDADKSVEKDKSRERDREREKENKRESRKEKRKKGSEIQSSSALYPVGRVSKEKVVGEDVATSSSAKKATGRKKSSSHDSGTDITSVTLGDTTAVKTKILIKKGRGNLEKTNLDLGPTAPSLEKEKTLCLSTPSSSTVKHSTSSIGSMLAQADKLPMTDKRVASLLKKAKAQLCKIEKSKSLKQTDQPKAQGQESDSSETSVRGPRIKHVCRRAAVALGRKRAVFPDDMPTLSALPWEEREKILSSMGNDDKSSIAGSEDAEPLAPPIKPIKPVTRNKAPQEPPVKKGRRSRRCGQCPGCQVPEDCGVCTNCLDKPKFGGRNIKKQCCKMRKCQNLQWMPSKAYLQKQAKAVKKKEKKSKTSEKKDSKESSVVKNVVDSSQKPTPSAREDPAPKKSSSEPPPRKPVEEKSEEGNVSAPGPESKQATTPASRKSSKQVSQPALVIPPQPPTTGPPRKEVPKTTPSEPKKKQPPPPESGPEQSKQKKVAPRPSIPVKQKPKEKEKPPPVNKQENAGTLNILSTLSNGNSSKQKIPADGVHRIRVDFKEDCEAENVWEMGGLGILTSVPITPRVVCFLCASSGHVEFVYCQVCCEPFHKFCLEENERPLEDQLENWCCRRCKFCHVCGRQHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTEDEMYEILSNLPESVAYTCVNCTERHPAEWRLALEKELQISLKQVLTALLNSRTTSHLLRYRQAAKPPDLNPETEESIPSRSSPEGPDPPVLTEVSKQDDQQPLDLEGVKRKMDQGNYTSVLEFSDDIVKIIQAAINSDGGQPEIKKANSMVKSFFIRQMERVFPWFSVKKSRFWEPNKVSSNSGMLPNAVLPPSLDHNYAQWQEREENSHTEQPPLMKKIIPAPKPKGPGEPDSPTPLHPPTPPILSTDRSREDSPELNPPPGIEDNRQCALCLTYGDDSANDAGRLLYIGQNEWTHVNCALWSAEVFEDDDGSLKNVHMAVIRGKQLRCEFCQKPGATVGCCLTSCTSNYHFMCSRAKNCVFLDDKKVYCQRHRDLIKGEVVPENGFEVFRRVFVDFEGISLRRKFLNGLEPENIHMMIGSMTIDCLGILNDLSDCEDKLFPIGYQCSRVYWSTTDARKRCVYTCKIVECRPPVVEPDINSTVEHDENRTIAHSPTSFTESSSKESQNTAEIISPPSPDRPPHSQTSGSCYYHVISKVPRIRTPSYSPTQRSPGCRPLPSAGSPTPTTHEIVTVGDPLLSSGLRSIGSRRHSTSSLSPQRSKLRIMSPMRTGNTYSRNNVSSVSTTGTATDLESSAKVVDHVLGPLNSSTSLGQNTSTSSNLQRTVVTVGNKNSHLDGSSSSEMKQSSASDLVSKSSSLKGEKTKVLSSKSSEGSAHNVAYPGIPKLAPQVHNTTSRELNVSKIGSFAEPSSVSFSSKEALSFPHLHLRGQRNDRDQHTDSTQSANSSPDEDTEVKTLKLSGMSNRSSIINEHMGSSSRDRRQKGKKSCKETFKEKHSSKSFLEPGQVTTGEEGNLKPEFMDEVLTPEYMGQRPCNNVSSDKIGDKGLSMPGVPKAPPMQVEGSAKELQAPRKRTVKVTLTPLKMENESQSKNALKESSPASPLQIESTSPTEPISASENPGDGPVAQPSPNNTSCQDSQSNNYQNLPVQDRNLMLPDGPKPQEDGSFKRRYPRRSARARSNMFFGLTPLYGVRSYGEEDIPFYSSSTGKKRGKRSAEGQVDGADDLSTSDEDDLYYYNFTRTVISSGGEERLASHNLFREEEQCDLPKISQLDGVDDGTESDTSVTATTRKSSQIPKRNGKENGTENLKIDRPEDAGEKEHVTKSSVGHKNEPKMDNCHSVSRVKTQGQDSLEAQLSSLESSRRVHTSTPSDKNLLDTYNTELLKSDSDNNNSDDCGNILPSDIMDFVLKNTPSMQALGESPESSSSELLNLGEGLGLDSNREKDMGLFEVFSQQLPTTEPVDSSVSSSISAEEQFELPLELPSDLSVLTTRSPTVPSQNPSRLAVISDSGEKRVTITEKSVASSESDPALLSPGVDPTPEGHMTPDHFIQGHMDADHISSPPCGSVEQGHGNNQDLTRNSSTPGLQVPVSPTVPIQNQKYVPNSTDSPGPSQISNAAVQTTPPHLKPATEKLIVVNQNMQPLYVLQTLPNGVTQKIQLTSSVSSTPSVMETNTSVLGPMGGGLTLTTGLNPSLPTSQSLFPSASKGLLPMSHHQHLHSFPAATQSSFPPNISNPPSGLLIGVQPPPDPQLLVSESSQRTDLSTTVATPSSGLKKRPISRLQTRKNKKLAPSSTPSNIAPSDVVSNMTLINFTPSQLPNHPSLLDLGSLNTSSHRTVPNIIKRSKSSIMYFEPAPLLPQSVGGTAATAAGTSTISQDTSHLTSGSVSGLASSSSVLNVVSMQTTTTPTSSASVPGHVTLTNPRLLGTPDIGSISNLLIKASQQSLGIQDQPVALPPSSGMFPQLGTSQTPSTAAITAASSICVLPSTQTTGITAASPSGEADEHYQLQHVNQLLASKTGIHSSQRDLDSASGPQVSNFTQTVDAPNSMGLEQNKALSSAVQASPTSPGGSPSSPSSGQRSASPSVPGPTKPKPKTKRFQLPLDKGNGKKHKVSHLRTSSSEAHIPDQETTSLTSGTGTPGAEAEQQDTASVEQSSQKECGQPAGQVAVLPEVQVTQNPANEQESAEPKTVEEEESNFSSPLMLWLQQEQKRKESITEKKPKKGLVFEISSDDGFQICAESIEDAWKSLTDKVQEARSNARLKQLSFAGVNGLRMLGILHDAVVFLIEQLSGAKHCRNYKFRFHKPEEANEPPLNPHGSARAEVHLRKSAFDMFNFLASKHRQPPEYNPNDEEEEEVQLKSARRATSMDLPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN,mutated_sequence,1.0,3972.0,NP_001184033.1.a2m,NP_001184033.1.npy,ClinVar
+NP_001193908.1,NP_001193908.1.csv,MGSKMNLIEHSHLPTTDEFSFSENLFGVLTEQVAGPLGQNLEVEPYSQYSNVQFPQVQPQISSSSYYSNLGFYPQQPEEWYSPGIYELRRMPAETLYQGETEVAEMPVTKKPRMGASAGRIKGDELCVVCGDRASGYHYNALTCEGCKGFFRRSITKNAVYKCKNGGNCVMDMYMRRKCQECRLRKCKEMGMLAECMYTGLLTEIQCKSKRLRKNVKQHADQTVNEDSEGRDLRQVTSTTKSCREKTELTPDQQTLLHFIMDSYNKQRMPQEITNKILKEEFSAEENFLILTEMATNHVQVLVEFTKKLPGFQTLDHEDQIALLKGSAVEAMFLRSAEIFNKKLPSGHSDLLEERIRNSGISDEYITPMFSFYKSIGELKMTQEEYALLTAIVILSPDRQYIKDREAVEKLQEPLLDVLQKLCKIHQPENPQHFACLLGRLTELRTFNHHHAEMLMSWRVNDHKFTPLLCEIWDVQ,mutated_sequence,1.0,476.0,NP_001193908.1.a2m,NP_001193908.1.npy,ClinVar
+NP_001193927.1,NP_001193927.1.csv,MRPRSGGRPGATGRRRRRLRRRPRGLRCSRLPPPPPLPLLLGLLLAAAGPGAARAKETAFVEVVLFESSPSGDYTTYTTGLTGRFSRAGATLSAEGEIVQMHPLGLCNNNDEEDLYEYGWVGVVKLEQPELDPKPCLTVLGKAKRAVQRGATAVIFDVSENPEAIDQLNQGSEDPLKRPVVYVKGADAIKLMNIVNKQKVARARIQHRPPRQPTEYFDMGIFLAFFVVVSLVCLILLVKIKLKQRRSQNSMNRLAVQALEKMETRKFNSKSKGRREGSCGALDTLSSSSTSDCAICLEKYIDGEELRVIPCTHRFHRKCVDPWLLQHHTCPHCRHNIIEQKGNPSAVCVETSNLSRGRQQRVTLPVHYPGRVHRTNAIPAYPTRTSMDSHGNPVTLLTMDRHGEQSLYSPQTPAYIRSYPPLHLDHSLAAHRCGLEHRAYSPAHPFRRPKLSGRSFSKAACFSQYETMYQHYYFQGLSYPEQEGQSPPSLAPRGPARAFPPSGSGSLLFPTVVHVAPPSHLESGSTSSFSCYHGHRSVCSGYLADCPGSDSSSSSSSGQCHCSSSDSVVDCTEVSNQGVYGSCSTFRSSLSSDYDPFIYRSRSPCRASEAGGSGSSGRGPALCFEGSPPPEELPAVHSHGAGRGEPWPGPASPSGDQVSTCSLEMNYSSNSSLEHRGPNSSTSEVGLEASPGAAPDLRRTWKGGHELPSCACCCEPQPSPAGPSAGAAGSSTLFLGPHLYEGSGPAGGEPQSGSSQGLYGLHPDHLPRTDGVKYEGLPCCFYEEKQVARGGGGGSGCYTEDYSVSVQYTLTEEPPPGCYPGARDLSQRIPIIPEDVDCDLGLPSDCQGTHSLGSWGGTRGPDTPRPHRGLGATREEERALCCQARALLRPGCPPEEAGAVRANFPSALQDTQESSTTATEAAGPRSHSADSSSPGA,mutated_sequence,1.0,936.0,NP_001193927.1.a2m,NP_001193927.1.npy,ClinVar
+NP_001194.1,NP_001194.1.csv,MLLRSAGKLNVGTKKEDGESTAPTPRPKVLRCKCHHHCPEDSVNNICSTDGYCFTMIEEDDSGLPVVTSGCLGLEGSDFQCRDTPIPHQRRSIECCTERNECNKDLHPTLPPLKNRDFVDGPIHHRALLISVTVCSLLLVLIILFCYFRYKRQETRPRYSIGLEQDETYIPPGESLRDLIEQSQSSGSGSGLPLLVQRTIAKQIQMVKQIGKGRYGEVWMGKWRGEKVAVKVFFTTEEASWFRETEIYQTVLMRHENILGFIAADIKGTGSWTQLYLITDYHENGSLYDYLKSTTLDAKSMLKLAYSSVSGLCHLHTEIFSTQGKPAIAHRDLKSKNILVKKNGTCCIADLGLAVKFISDTNEVDIPPNTRVGTKRYMPPEVLDESLNRNHFQSYIMADMYSFGLILWEVARRCVSGGIVEEYQLPYHDLVPSDPSYEDMREIVCIKKLRPSFPNRWSSDECLRQMGKLMTECWAHNPASRLTALRVKKTLAKMSESQDIKL,mutated_sequence,1.0,502.0,NP_001194.1.a2m,NP_001194.1.npy,ClinVar
+NP_001195.2,NP_001195.2.csv,MTSSLQRPWRVPWLPWTILLVSTAAASQNQERLCAFKDPYQQDLGIGESRISHENGTILCSKGSTCYGLWEKSKGDINLVKQGCWSHIGDPQECHYEECVVTTTPPSIQNGTYRFCCCSTDLCNVNFTENFPPPDTTPLSPPHSFNRDETIIIALASVSVLAVLIVALCFGYRMLTGDRKQGLHSMNMMEAAASEPSLDLDNLKLLELIGRGRYGAVYKGSLDERPVAVKVFSFANRQNFINEKNIYRVPLMEHDNIARFIVGDERVTADGRMEYLLVMEYYPNGSLCKYLSLHTSDWVSSCRLAHSVTRGLAYLHTELPRGDHYKPAISHRDLNSRNVLVKNDGTCVISDFGLSMRLTGNRLVRPGEEDNAAISEVGTIRYMAPEVLEGAVNLRDCESALKQVDMYALGLIYWEIFMRCTDLFPGESVPEYQMAFQTEVGNHPTFEDMQVLVSREKQRPKFPEAWKENSLAVRSLKETIEDCWDQDAEARLTAQCAEERMAELMMIWERNKSVSPTVNPMSTAMQNERNLSHNRRVPKIGPYPDYSSSSYIEDSIHHTDSIVKNISSEHSMSSTPLTIGEKNRNSINYERQQAQARIPSPETSVTSLSTNTTTTNTTGLTPSTGMTTISEMPYPDETNLHTTNVAQSIGPTPVCLQLTEEDLETNKLDPKEVDKNLKESSDENLMEHSLKQFSGPDPLSSTSSSLLYPLIKLAVEATGQQDFTQTANGQACLIPDVLPTQIYPLPKQQNLPKRPTSLPLNTKNSTKEPRLKFGSKHKSNLKQVETGVAKMNTINAAEPHVVTVTMNGVAGRNHSVNSHAATTQYANGTVLSGQTTNIVTHRAQEMLQNQFIGEDTRLNINSSPDEHEPLLRREQQAGHDEGVLDRLVDRRERPLEGGRTNSNNNNSNPCSEQDVLAQGVPSTAADPGPSKPRRAQRPNSLDLSATNVLDGSSIQIGESTQDGKSGSGEKIKKRVKTPYSLKRWRPSTWVISTESLDCEVNNNGSNRAVHSKSSTAVYLAEGGTATTMVSKDIGMNCL,mutated_sequence,1.0,1038.0,NP_001195.2.a2m,NP_001195.2.npy,ClinVar
+NP_001203.1,NP_001203.1.csv,MLPLLRCVPRVLGSSVAGLRAAAPASPFRQLLQPAPRLCTRPFGLLSVRAGSERRPGLLRPRGPCACGCGCGSLHTDGDKAFVDFLSDEIKEERKIQKHKTLPKMSGGWELELNGTEAKLVRKVAGEKITVTFNINNSIPPTFDGEEEPSQGQKVEEQEPELTSTPNFVVEVIKNDDGKKALVLDCHYPEDEVGQEDEAESDIFSIREVSFQSTGESEWKDTNYTLNTDSLDWALYDHLMDFLADRGVDNTFADELVELSTALEHQEYITFLEDLKSFVKSQ,mutated_sequence,1.0,282.0,NP_001203.1.a2m,NP_001203.1.npy,ClinVar
+NP_001229825.1,NP_001229825.1.csv,MRTTKVYKLVIHKKGFGGSDDELVVNPKVFPHIKLGDIVEIAHPNDEYSPLLLQVKSLKEDLQKETISVDQTVTQVFRLRPYQDVYVNVVDPKDVTLDLVELTFKDQYIGRGDMWRLKKSLVSTCAYITQKVEFAGIRAQAGELWVKNEKVMCGYISEDTRVVFRSTSAMVYIFIQMSCEMWDFDIYGDLYFEKAVNGFLADLFTKWKEKNCSHEVTVVLFSRTFYDAKSVDEFPEINRASIRQDHKGRFYEDFYKVVVQNERREEWTSLLVTIKKLFIQYPVLVRLEQAEGFPQGDNSTSAQGNYLEAINLSFNVFDKHYINRNFDRTGQMSVVITPGVGVFEVDRLLMILTKQRMIDNGIGVDLVCMGEQPLHAVPLFKLHNRSAPRDSRLGDDYNIPHWINHSFYTSKSQLFCNSFTPRIKLAGKKPASEKAKNGRDTSLGSPKESENALPIQVDYDAYDAQVFRLPGPSRAQCLTTCRSVRERESHSRKSASSCDVSSSPSLPSRTLPTEEVRSQASDDSSLGKSANILMIPHPHLHQYEVSSSLGYTSTRDVLENMMEPPQRDSSAPGRFHVGSAESMLHVRPGGYTPQRALINPFAPSRMPMKLTSNRRRWMHTFPVGPSGEAIQIHHQTRQNMAELQGSGQRDPTHSSAELLELAYHEAAGRHSNSRQPGDGMSFLNFSGTEELSVGLLSNSGAGMNPRTQNKDSLEDSVSTSPDPILTLSAPPVVPGFCCTVGVDWKSLTTPACLPLTTDYFPDRQGLQNDYTEGCYDLLPEADIDRRDEDGVQMTAQQVFEEFICQRLMQGYQIIVQPKTQKPNPAVPPPLSSSPLYSRGLVSRNRPEEEDQYWLSMGRTFHKVTLKDKMITVTRYLPKYPYESAQIHYTYSLCPSHSDSEFVSCWVEFSHERLEEYKWNYLDQYICSAGSEDFSLIESLKFWRTRFLLLPACVTATKRITEGEAHCDIYGDRPRADEDEWQLLDGFVRFVEGLNRIRRRHRSDRMMRKGTAMKGLQMTGPISTHSLESTAPPVGKKGTSALSALLEMEASQKCLGEQQAAVHGGKSSAQSAESSSVAMTPTYMDSPRKDGAFFMEFVRSPRTASSAFYPQVSVDQTATPMLDGTSLGICTGQSMDRGNSQTFGNSQNIGEQGYSSTNSSDSSSQQLVASSLTSSSTLTEILEAMKHPSTGVQLLSEQKGLSPYCFISAEVVHWLVNHVEGIQTQAMAIDIMQKMLEEQLITHASGEAWRTFIYGFYFYKIVTDKEPDRVAMQQPATTWHTAGVDDFASFQRKWFEVAFVAEELVHSEIPAFLLPWLPSRPASYASRHSSFSRSFGGRSQAAALLAATVPEQRTVTLDVDVNNRTDRLEWCSCYYHGNFSLNAAFEIKLHWMAVTAAVLFEMVQGWHRKATSCGFLLVPVLEGPFALPSYLYGDPLRAQLFIPLNISCLLKEGSEHLFDSFEPETYWDRMHLFQEAIAHRFGFVQDKYSASAFNFPAENKPQYIHVTGTVFLQLPYSKRKFSGQQRRRRNSTSSTNQNMFCEERVGYNWAYNTMLTKTWRSSATGDEKFADRLLKDFTDFCINRDNRLVTFWTSCLEKMHASAP,mutated_sequence,1.0,1603.0,NP_001229825.1.a2m,NP_001229825.1.npy,ClinVar
+NP_001230937.1,NP_001230937.1.csv,MAGASVKVAVRVRPFNSREMSRDSKCIIQMSGSTTTIVNPKQPKETPKSFSFDYSYWSHTSPEDINYASQKQVYRDIGEEMLQHAFEGYNVCIFAYGQTGAGKSYTMMGKQEKDQQGIIPQLCEDLFSRINDTTNDNMSYSVEVSYMEIYCERVRDLLNPKNKGNLRVREHPLLGPYVEDLSKLAVTSYNDIQDLMDSGNKARTVAATNMNETSSRSHAVFNIIFTQKRHDAETNITTEKVSKISLVDLAGSERADSTGAKGTRLKEGANINKSLTTLGKVISALAEMDSGPNKNKKKKKTDFIPYRDSVLTWLLRENLGGNSRTAMVAALSPADINYDETLSTLRYADRAKQIRCNAVINEDPNNKLIRELKDEVTRLRDLLYAQGLGDITDTNTVPGGPKLTNALVGMSPSSSLSALSSRAASVSSLHERILFAPGSEEAIERLKETEKIIAELNETWEEKLRRTEAIRMEREALLAEMGVAMREDGGTLGVFSPKKTPHLVNLNEDPLMSECLLYYIKDGITRVGREDGERRQDIVLSGHFIKEEHCVFRSDSRGGSEAVVTLEPCEGADTYVNGKKVTEPSILRSGNRIIMGKSHVFRFNHPEQARQERERTPCAETPAEPVDWAFAQRELLEKQGIDMKQEMEQRLQELEDQYRREREEATYLLEQQRLDYESKLEALQKQMDSRYYPEVNEEEEEPEDEVQWTERECELALWAFRKWKWYQFTSLRDLLWGNAIFLKEANAISVELKKKVQFQFVLLTDTLYSPLPPDLLPPEAAKDRETRPFPRTIVAVEVQDQKNGATHYWTLEKLRQRLDLMREMYDRAAEVPSSVIEDCDNVVTGGDPFYDRFPWFRLVGSSAISGCNSYPLLNTCMSERMAALTPSPTFSSPDSDATEPAEEQSVGEEEEEEEEEEDEEEEDLEDDVFPEHALCDGRDPFYDRPPLFSLVGRAFVYLSNLLYPVPLVHRVAIVSEKGEVKGFLRVAVQAISADEEAPDYGSGVRQSGTAKISFDDQHFEKFQSESCPVVGMSRSGTSQEELRIVEGQGQGADVGPSADEVNNNTCSAVPPEGLLLDSSEKAALDGPLDAALDHLRLGNTFTFRVTVLQASSISAEYADIFCQFNFIHRHDEAFSTEPLKNTGRGPPLGFYHVQNIAVEVTKSFIEYIKSQPIVFEVFGHYQQHPFPPLCKDVLSPLRPSRRHFPRVMPLSKPVPATKLSTLTRPCPGPCHCKYDLLVYFEICELEANGDYIPAVVDHRGGMPCMGTFLLHQGIQRRITVTLLHETGSHIRWKEVRELVVGRIRNTPETDESLIDPNILSLNILSSGYIHPAQDDRTFYQFEAAWDSSMHNSLLLNRVTPYREKIYMTLSAYIEMENCTQPAVVTKDFCMVFYSRDAKLPASRSIRNLFGSGSLRASESNRVTGVYELSLCHVADAGSPGMQRRRRRVLDTSVAYVRGEENLAGWRPRSDSLILDHQWELEKLSLLQEVEKTRHYLLLREKLETAQRPVPEALSPAFSEDSESHGSSSASSPLSAEGRPSPLEAPNERQRELAVKCLRLLTHTFNREYTHSHVCVSASESKLSEMSVTLLRDPSMSPLGVATLTPSSTCPSLVEGRYGATDLRTPQPCSRPASPEPELLPEADSKKLPSPARATETDKEPQRLLVPDIQEIRVSPIVSKKGYLHFLEPHTSGWARRFVVVRRPYAYMYNSDKDTVERFVLNLATAQVEYSEDQQAMLKTPNTFAVCTEHRGILLQAASDKDMHDWLYAFNPLLAGTIRSKLSRRRSAQMRV,mutated_sequence,1.0,1791.0,NP_001230937.1.a2m,NP_001230937.1.npy,ClinVar
+NP_001241.1,NP_001241.1.csv,MVRLPLQCVLWGCLLTAVHPEPPTACREKQYLINSQCCSLCQPGQKLVSDCTEFTETECLPCGESEFLDTWNRETHCHQHKYCDPNLGLRVQQKGTSETDTICTCEEGWHCTSEACESCVLHRSCSPGFGVKQIATGVSDTICEPCPVGFFSNVSSAFEKCHPWTSCETKDLVVQQAGTNKTDVVCGPQDRLRALVVIPIIFGILFAILLVLVFIKKVAKKPTNKAPHPKQEPQEINFPDDLPGSNTAAPVQETLHGCQPVTQEDGKESRISVQERQ,mutated_sequence,1.0,277.0,NP_001241.1.a2m,NP_001241.1.npy,ClinVar
+NP_001243.1,NP_001243.1.csv,MPEEGSGCSVRRRPYGCVLRAALVPLVAGLVICLVVCIQRFAQAQQQLPLESLGWDVAELQLNHTGPQQDPRLYWQGGPALGRSFLHGPELDKGQLRIHRDGIYMVHIQVTLAICSSTTASRHHPTTLAVGICSPASRSISLLRLSFHQGCTIASQRLTPLARGDTLCTNLTGTLLPSRNTDETFFGVQWVRP,mutated_sequence,1.0,193.0,NP_001243.1.a2m,NP_001243.1.npy,ClinVar
+NP_001243000.2,NP_001243000.2.csv,MECPSCQHVSKEETPKFCSQCGERLPPAAPIADSENNNSTMASASEGEMECGQELKEEGGPCLFPGSDSWQENPEEPCSKASWTVQESKKKKRKKKKKGNKSASSELASLPLSPASPCHLTLLSNPWPQDTALPHSQAQQSGPTGQPSQPPGTATTPLEGDGLSAPTEVGDSPLQAQALGEAGVATGSEAQSSPQFQDHTEGEDQDASIPSGGRGLSQEGTGPPTSAGEGHSRTEDAAQELLLPESKGGSSEPGTELQTTEQQAGASASMAVDAVAEPANAVKGAGKEMKEKTQRMKQPPATTPPFKTHCQEAETKTKDEMAAAEEKVGKNEQGEPEDLKKPEGKNRSAAAVKNEKEQKNQEADVQEVKASTLSPGGGVTVFFHAIISLHFPFNPDLHKVFIRGGEEFGESKWDSNICELHYTRDLGHDRVLVEGIVCISKKHLDKYIPYKYVIYNGESFEYEFIYKHQQKKGEYVNRCLFIKSSLLGSGDWHQYYDIVYMKPHGRLQKVMNHITDGPRKDLVKGKQIAAALMLDSTFSILQTWDTINLNSFFTQFEQFCFVLQQPMIYEGQAQLWTDLQYREKEVKRYLWQHLKKHVVPLPDGKSTDFLPVDCPVRSKLKTGLIVLFVVEKIELLLEGSLDWLCHLLTSDASSPDEFHRDLSHILGIPQSWRLYLVNLCQRCMDTRTYTWLGALPVLHCCMELAPRHKDAWRQPEDTWAALEGLSFSPFREQMLDTSSLLQFMREKQHLLSIDEPLFRSWFSLLPLSHLVMYMENFIEHLGRFPAHILDCLSGIYYRLPGLEQVLNTQDVQDVQNVQNILEMLLRLLDTYRDKIPEEALSPSYLTVCLKLHEAICSSTKLLKFYELPALSAEIVCRMIRLLSLVDSAGQRDETGNNSVQTVFQGTLAATKRWLREVFTKNMLTSSGASFTYVKEIEVWRRLVEIQFPAEHGWKESLLGDMEWRLTKEEPLSQITAYCNSCWDTKGLEDSVAKTFEKCIIEAVSSACQSQTSILQGFSYSDLRKFGIVLSAVITKSWPRTADNFDDILKHLLTLADVKHVFRLCGTDEKILANVTEDAKRLIAVADSVLTKVVGDLLSGTILVGQLELIIKHKNQFLDIWQLREKSLSPQDEQCAVEEALDWRREELLLLKKEKRCVDSLLKMCGNVKHLIQVDFGVLAVRHSQDLSSKRLNDTVTVRLSTSSNSQRATHYHLSSQVQEMAGKIDLLRDSHIFQLFWREAAEPLSEPKEDQEAAELLSEPEEESERHILELEEVYDYLYQPSYRKFIKLHQDLKSGEVTLAEIDVIFKDFVNKYTDLDSELKIMCTVDHQDQRDWIKDRVEQIKEYHHLHQAVHAAKVILQVKESLGLNGDFSVLNTLLNFTDNFDDFRRETLDQINQELIQAKKLLQDISEARCKGLQALSLRKEFICWVREALGGINELKVFVDLASISAGENDIDVDRVACFHDAVQGYASLLFKLDPSVDFSAFMKHLKKLWKALDKDQYLPRKLCDSARNLEWLKTVNESHGSVERSSLTLATAINQRGIYVIQAPKGGQKISPDTVLHLILPESPGSHEESREYSLEEVKELLNKLMLMSGKKDRNNTEVERFSEVFCSVQRLSQAFIDLHSAGNMLFRTWIAMAYCSPKQGVSLQMDFGLDLVTELKEGGDVTELLAALCRQMEHFLDSWKRFVTQKRMEHFYLNFYTAEQLVYLSTELRKQPPSDAALTMLSFIKSNCTLRDVLRASVGCGSEAARYRMRRVMEELPLMLLSEFSLVDKLRIIMEQSMRCLPAFLPDCLDLETLGHCLAHLAGMGGSPVERCLPRGLQVGQPNLVVCGHSEVLPAALAVYMQTPSQPLPTYDEVLLCTPATTFEEVALLLRRCLTLGSLGHKVYSLLFADQLSYEVARQAEELFHNLCTQQHREDYQLVMVCDGDWEHCYLPSAFSQHKVFVTPQAPLEAIQAYLAGHYRVPKQTLSAAAVFNDRLCVGIVASERAGVGKSLYVKRLHDKMKMQLNVKNVPLKTIRLIDPQVDESRVLGALLPFLDAQYQKVPVLFHLDVTSSVQTGIWVFLFKLLILQYLMDINGKMWLRNPCHLYIVEILERRTSVPSRSSSALRTRVPQFSFLDIFPKVTCRPPKEVIDMELSALRSDTEPGMDLWEFCSETFQRPYQYLRRFNQNQDLDTFQYQEGSVEGTPEECLQHFLFHCGVINPSWSELRNFARFLNYQLRDCEASLFCNPSFIGDTLRGFKKFVVTFMIFMARDFATPSLHTSDQSPGKHMVTMDGVREEDLAPFSLRKRWESEPHPYVFFNDDHTTMTFIGFHLQPNINGSVDAISHLTGKVIKRDVMTRDLYQGLLLQRVPFNVDFDKLPRHKKLERLCLTLGIPQATDPDKTYELTTDNMLKILAIEMRFRCGIPVIIMGETGCGKTRLIKFLSDLRRGGTNADTIKLVKVHGGTTADMIYSRVREAENVAFANKDQHQLDTILFFDEANTTEAISCIKEVLCDHMVDGQPLAEDSGLHIIAACNPYRKHSEEMICRLESAGLGYRVSMEETADRLGSIPLRQLVYRVHALPPSLIPLVWDFGQLSDVAEKLYIQQIVQRLVESISLDENGTRVITEVLCASQGFMRKTEDECSFVSLRDVERCVKVFRWFHEHSAMLLAQLNAFLSKSSVSKNHTERDPVLWSLMLAIGVCYHASLEKKDSYRKAIARFFPKPYDDSRLLLDEITRAQDLFLDGVPLRKTIAKNLALKENVFMMVVCIELKIPLFLVGKPGSSKSLAKTIVADAMQGPAAYSDLFRSLKQVHLVSFQCSPHSTPQGIISTFRQCARFQQGKDLQQYVSVVVLDEVGLAEDSPKMPLKTLHPLLEDGCIEDDPAPHKKVGFVGISNWALDPAKMNRGIFVSRGSPNETELIESAKGICSSDILVQDRVQGYFASFAKAYETVCKRQDKEFFGLRDYYSLIKMVFAAAKASNRKPSPQDIAQAVLRNFSGKDDIQALDIFLANLPEAKCSEEVSPMQLIKQNIFGPSQKVPGGEQEDAESRYLLVLTKNYVALQILQQTFFEGDQQPEIIFGSGFPKDQEYTQLCRNINRVKICMETGKMVLLLNLQNLYESLYDALNQYYVHLGGQKYVDLGLGTHRVKCRVHPNFRLIVIEEKDVVYKHFPIPLINRLEKHYLDINTVLEKWQKSIVEELCAWVEKFINVKAHHFQKRHKYSPSDVFIGYHSDACASVVLQVIERQGPRALTEELHQKVSEEAKSILLNCATPDAVVRLSAYSLGGFAAEWLSQEYFHRQRHNSFADFLQAHLHTADLERHAIFTEITTFSRLLTSHDCEILESEVTGRAPKPTLLWLQQFDTEYSFLKEVRNCLTNTAKCKILIFQTDFEDGIRSAQLIASAKYSVINEINKIRENEDRIFVYFITKLSRVGRGTAYVGFHGGLWQSVHIDDLRRSTLMVSDVTRLQHVTISQLFAPGDLPELGLEHRAEDGHEEAMETEASTSGEVAEVAEEAMETESSEKVGKETSELGGSDVSILDTTRLLRSCVQSAVGMLRDQNESCTRNMRRVVLLLGLLNEDDACHASFLRVSKMRLSVFLKKQEESQFHPLEWLAREACNQDALQEAGTFRHTLWKRVQGAVTPLLASMISFIDRDGNLELLTRPDTPPWARDLWMFIFSDTMLLNIPLVMNNERHKGEMAYIVVQNHMNLSENASNNVPFSWKIKDYLEELWVQAQYITDAEGLPKKFVDIFQQTPLGRFLAQLHGEPQQELLQCYLKDFILLTMRVSTEEELKFLQMALWSCTRKLKAASEAPEEEVSLPWVHLAYQRFRSRLQNFSRILTIYPQVLHSLMEARWNHELAGCEMTLDAFAAMACTEMLTRNTLKPSPQAWLQLVKNLSMPLELICSDEHMQGSGSLAQAVIREVRAQWSRIFSTALFVEHVLLGTESRVPELQGLVTEHVFLLDKCLRENSDVKTHGPFEAVMRTLCECKETASKTLSRFGIQPCSICLGDAKDPVCLPCDHVHCLRCLRAWFASEQMICPYCLTALPDEFSPAVSQAHREAIEKHARFRQMCNSFFVDLVSTICFKDNAPPEKEVIESLLSLLFVQKGRLRDAAQRHCEHTKSLSPFNDVVDKTPVIRSVILKLLLKYSFHDVKDYIQEYLTLLKKKAFITEDKTELYMLFINCLEDSILEKTSAYSRNDELNHLEEEGRFLKAYSPASRGREPANEASVEYLQEVARIRLCLDRAADFLSEPEGGPEMAKEKQCYLQQVKQFCIRVENDWHRVYLVRKLSSQRGMEFVQGLSKPGRPHQWVFPKDVVKQQGLRQDHPGQMDRYLVYGDEYKALRDAVAKAVLECKPLGIKTALKACKTPQSQQSAYFLLTLFREVAILYRSHNASLHPTPEQCEAVSKFIGECKILSPPDISRFATSLVDNSVPLLRAGPSDSNLDGTVTEMAIHAAAVLLCGQNELLEPLKNLAFSPATMAHAFLPTMPEDLLAQARRWKGLERVHWYTCPNGHPCSVGECGRPMEQSICIDCHAPIGGIDHKPRDGFHLVKDKADRTQTGHVLGNPQRRDVVTCDRGLPPVVFLLIRLLTHLALLLGASQSSQALINIIKPPVRDPKGFLQQHILKDLEQLAKMLGHSADETIGVVHLVLRRLLQEQHQLSSRRLLNFDTELSTKEMRNNWEKEIAAVISPELEHLDKTLPTMNNLISQDKRISSNPVAKIIYGDPVTFLPHLPRKSVVHCSKIWSCRKRITVEYLQHIVEQKNGKERVPILWHFLQKEAELRLVKFLPEILALQRDLVKQFQNVQQVEYSSIRGFLSKHSSDGLRQLLHNRITVFLSTWNKLRRSLETNGEINLPKDYCSTDLDLDTEFEILLPRRRGLGLCATALVSYLIRLHNEIVYAVEKLSKENNSYSVDAAEVTELHVISYEVERDLTPLILSNCQYQVEEGRETVQEFDLEKIQRQIVSRFLQGKPRLSLKGIPTLVYRHDWNYEHLFMDIKNKMAQDSLPSSVISAISGQLQSYSDACEVLSVVEVTLGFLSTAGGDPNMQLNVYTQDILQMGDQTIHVLKALNRCQLKHTIALWQFLSAHKSEQLLRLHKEPFGEISSRYKADLSPENAKLLSTFLNQTGLDAFLLELHEMIILKLKNPQTQTEERFRPQWSLRDTLVSYMQTKESEILPEMASQFPEEILLASCVSVWKTAAVLKWNREMR,mutated_sequence,1.0,5207.0,NP_001243000.2.a2m,NP_001243000.2.npy,ClinVar
+NP_001243718.1,NP_001243718.1.csv,MSESEGGKDTTPEPSPANGAGPGPEWGLCPGPPAVEGESSGASGLGTPKRRNQHSKHKTVAVASAQRSPRALFCLTLANPLRRSCISIVEWKPFDILILLTIFANCVALGVYIPFPEDDSNTANHNLEQVEYVFLVIFTVETVLKIVAYGLVLHPSAYIRNGWNLLDFIIVVVGLFSVLLEQGPGRPGDAPHTGGKPGGFDVKALRAFRVLRPLRLVSGVPSLHIVLNSIMKALVPLLHIALLVLFVIIIYAIIGLELFLGRMHKTCYFLGSDMEAEEDPSPCASSGSGRACTLNQTECRGRWPGPNGGITNFDNFFFAMLTVFQCVTMEGWTDVLYWMQDAMGYELPWVYFVSLVIFGSFFVLNLVLGVLSGEFSKEREKAKARGDFQKQREKQQMEEDLRGYLDWITQAEELDMEDPSADDNLGPQLAELTNRRRGRLRWFSHSTRSTHSTSSHASLPASDTGSMTETQGDEDEEEGALASCTRCLNKIMKTRVCRRLRRANRVLRARCRRAVKSNACYWAVLLLVFLNTLTIASEHHGQPVWLTQIQEYANKVLLCLFTVEMLLKLYGLGPSAYVSSFFNRFDCFVVCGGILETTLVEVGAMQPLGISVLRCVRLLRIFKVTRHWASLSNLVASLLNSMKSIASLLLLLFLFIIIFSLLGMQLFGGKFNFDQTHTKRSTFDTFPQALLTVFQILTGEDWNVVMYDGIMAYGGPFFPGMLVCIYFIILFICGNYILLNVFLAIAVDNLASGDAGTAKDKGGEKSNEKDLPQENEGLVPGVEKEEEEGARREGADMEEEEEEEEEEEEEEEEEGAGGVELLQEVVPKEKVVPIPEGSAFFCLSQTNPLRKGCHTLIHHHVFTNLILVFIILSSVSLAAEDPIRAHSFRNHILGYFDYAFTSIFTVEILLKMTVFGAFLHRGSFCRSWFNMLDLLVVSVSLISFGIHSSAISVVKILRVLRVLRPLRAINRAKGLKHVVQCVFVAIRTIGNIMIVTTLLQFMFACIGVQLFKGKFYTCTDEAKHTPQECKGSFLVYPDGDVSRPLVRERLWVNSDFNFDNVLSAMMALFTVSTFEGWPALLYKAIDAYAEDHGPIYNYRVEISVFFIVYIIIIAFFMMNIFVGFVIITFRAQGEQEYQNCELDKNQRQCVEYALKAQPLRRYIPKNPHQYRVWATVNSAAFEYLMFLLILLNTVALAMQHYEQTAPFNYAMDILNMVFTGLFTIEMVLKIIAFKPKHYFTDAWNTFDALIVVGSIVDIAVTEVNNGGHLGESSEDSSRISITFFRLFRVMRLVKLLSKGEGIRTLLWTFIKSFQALPYVALLIAMIFFIYAVIGMQMFGKVALQDGTQINRNNNFQTFPQAVLLLFRCATGEAWQEIMLASLPGNRCDPESDFGPGEEFTCGSNFAIAYFISFFMLCAFLIINLFVAVIMDNFDYLTRDWSILGPHHLDEFKRIWSEYDPGAKGRIKHLDVVALLRRIQPPLGFGKLCPHRVACKRLVAMNMPLNSDGTVTFNATLFALVRTSLKIKTEGNLEQANQELRIVIKKIWKRMKQKLLDEVIPPPDEEEVTVGKFYATFLIQDYFRKFRRRKEKGLLGNDAAPSTSSALQAGLRSLQDLGPEMRQALTCDTEEEEEEGQEGVEEEDEKDLETNKATMVSQPSARRGSGISVSLPVGDRLPDSLSFGPSDDDRGTPTSSQPSVPQAGSNTHRRGSGALIFTIPEEGNSQPKGTKGQNKQDEDEEVPDRLSYLDEQAGTPPCSVLLPPHRAQRYMDGHLVPRRRLLPPTPAGRKPSFTIQCLQRQGSCEDLPIPGTYHRGRNSGPNRAQGSWATPPQRGRLLYAPLLLVEEGAAGEGYLGRSSGPLRTFTCLHVPGTHSDPSHGKRGSADSLVEAVLISEGLGLFARDPRFVALAKQEIADACRLTLDEMDNAASDLLAQGTSSLYSDEESILSRFDEEDLGDEMACVHAL,mutated_sequence,1.0,1966.0,NP_001243718.1.a2m,NP_001243718.1.npy,ClinVar
+NP_001244025.1,NP_001244025.1.csv,MKFTLGLGSRAWRVSWEGAAAAAAGPGAGGSALRCRAQRVSSPRLGRRGSRLSGALPLCLSRGGGGAQALPDCAGPSPGHPGHPGARQLAGPLAMEQTYGEVNQLGGVFVNGRPLPNAIRLRIVELAQLGIRPCDISRQLRVSHGCVSKILARYNETGSILPGAIGGSKPRVTTPNVVKHIRDYKQGDPGIFAWEIRDRLLADGVCDKYNVPSVSSISRILRNKIGSLAQPGPYEASKQPPSQPTLPYNHIYQYPYPSPVSPTGAKMGSHPGVPGTAGHVSIPRSWPSAHSVSNILGIRTFMEQTGALAGSEGTAYSPKMEDWAGVNRTAFPATPAVNGLEKPALEADIKYTQSASTLSAVGGFLPACAYPASNQHGVYSAPGGGYLAPGPPWPPAQGPPLAPPGAGVAVHGGELAAAMTFKHPSREVADRKPPSSGSKAPDALSSLHGLPIPASTS,mutated_sequence,1.0,457.0,NP_001244025.1.a2m,NP_001244025.1.npy,ClinVar
+NP_001245321.1,NP_001245321.1.csv,MLGSLVLRRKALAPRLLLRLLRSPTLRGHGGASGRNVTTGSLGEPQWLRVATGGRPGTSPALFSGRGAATGGRQGGRFDTKCLAAATWGRLPGPEETLPGQDSWNGVPSRAGLGMCALAAALVVHCYSKSPSNKDAALLEAARANNMQEVSRLLSEGADVNAKHRLGWTALMVAAINRNNSVVQVLLAAGADPNLGDDFSSVYKTAKEQGIHSLEVLITREDDFNNRLNNRASFKGCTALHYAVLADDYRTVKELLDGGANPLQRNEMGHTPLDYAREGEVMKLLRTSEAKYQEKQRKREAEERRRFPLEQRLKEHIIGQESAIATVGAAIRRKENGWYDEEHPLVFLFLGSSGIGKTELAKQTAKYMHKDAKKGFIRLDMSEFQERHEVAKFIGSPPGYVGHEEGGQLTKKLKQCPNAVVLFDEVDKAHPDVLTIMLQLFDEGRLTDGKGKTIDCKDAIFIMTSNVASDEIAQHALQLRQEALEMSRNRIAENLGDVQISDKITISKNFKENVIRPILKAHFRRDEFLGRINEIVYFLPFCHSELIQLVNKELNFWAKRAKQRHNITLLWDREVADVLVDGYNVHYGARSIKHEVERRVVNQLAAAYEQDLLPGGCTLRITVEDSDKQLLKSPELPSPQAEKRLPKLRLEIIDKDSKTRRLDIRAPLHPEKVCNTI,mutated_sequence,1.0,677.0,NP_001245321.1.a2m,NP_001245321.1.npy,ClinVar
+NP_001258822.1,NP_001258822.1.csv,MEEGSSSPVSPVDSLGTSEEELERQPKRFGRKRRYSKKSSEDGSPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDNKMTSCSYVAHERLSYAFSVWRMEGAWSMSASH,mutated_sequence,1.0,160.0,NP_001258822.1.a2m,NP_001258822.1.npy,ClinVar
+NP_001258867.1,NP_001258867.1.csv,MALGKVLAMALVLALAVLGSLSPGARAGDCKGQRQVLREAPGFVTDGAGNYSVNGNCEWLIEAPSPQHRILLDFLFLDTECTYDYLFVYDGDSPRGPLLASLSGSTRPPPIEASSGKMLLHLFSDANYNLLGFNASFRFSLCPGGCQSHGQCQPPGVCACEPGWGGPDCGLQECSAYCGSHGTCASPLGPCRCEPGFLGRACDLHLWENQGAGWWHNVSARDPAFSARIGAAGAFLSPPGLLAVFGGQDLNNALGDLVLYNFSANTWESWDLSPAPAARHSHVAVAWAGSLVLMGGELADGSLTNDVWAFSPLGRGHWELLAPPASSSSGPPGLAGHAAALVDDVWLYVSGGRTPHDLFSSGLFRFRLDSTSGGYWEQVIPAGGRPPAATGHSMVFHAPSRALLVHGGHRPSTARFSVRVNSTELFHVDRHVWTTLKGRDGLQGPRERAFHTASVLGNYMVVYGGNVHTHYQEEKCYEDGIFFYHLGCHQWVSGAELAPPGTPEGRAAPPSGRYSHVAAVLGGSVLLVAGGYSGRPRGDLMAYKVPPFVFQAPAPDYHLDYCSMYTDHSVCSRDPECSWCQGACQAAPPPGTPLGACPAASCLGLGRLLGDCQACLAFSSPTAPPRGPGTLGWCVHNESCLPRPEQARCRGEQISGTVGWWGPAPVFVTSLEACVTQSFLPGLHLLTFQQPPNTSQPDKVSIVRSTTITLTPSAETDVSLVYRGFIYPMLPGGPGGPGAEDVAVWTRAQRLHVLARMARGPDTENMEEVGRWVAHQEKETRRLQRPGSARLFPLPGRDHKYAVEIQGQLNGSAGPGHSELTLLWDRTGVPGGSEISFFFLEPYRSSSCTSYSSCLGCLADQGCGWCLTSATCHLRQGGAHCGDDGAGGSLLVLVPTLCPLCEEHRDCHACTQDPFCEWHQSTSRKGDAACSRRGRGRGALKSPEECPPLCSQRLTCEDCLANSSQCAWCQSTHTCFLFAAYLARYPHGGCRGWDDSVHSEPRCRSCDGFLTCHECLQSHECGWCGNEDNPTLGRCLQGDFSGPLGGGNCSLWVGEGLGLPVALPARWAYARCPDVDECRLGLARCHPRATCLNTPLSYECHCQRGYQGDGISHCNRTCLEDCGHGVCSGPPDFTCVCDLGWTSDLPPPTPAPGPPAPRCSRDCGCSFHSHCRKRGPGFCDECQDWTWGEHCERCRPGSFGNATGSRGCRPCQCNGHGDPRRGHCDNLSGLCFCQDHTEGAHCQLCSPGYYGDPRAGGSCFRECGGRALLTNVSSVALGSRRVGGLLPPGGGAARAGPGLSYCVWVVSATEELQPCAPGTLCPPLTLTFSPDSSTPCTLSYVLAFDGFPRFLDTGVVQSDRSLIAAFCGQRRDRPLTVQALSGLLVLHWEANGSSSWGFNASVGSARCGSGGPGSCPVPQECVPQDGAAGAGLCRCPQGWAGPHCRMALCPENCNAHTGAGTCNQSLGVCICAEGFGGPDCATKLDGGQLVWETLMDSRLSADTASRFLHRLGHTMVDGPDATLWMFGGLGLPQGLLGNLYRYSVSERRWTQMLAGAEDGGPGPSPRSFHAAAYVPAGRGAMYLLGGLTAGGVTRDFWVLNLTTLQWRQEKAPQTVELPAVAGHTLTARRGLSLLLVGGYSPENGFNQQLLEYQLATGTWVSGAQSGTPPTGLYGHSAVYHEATDSLYVFGGFRFHVELAAPSPELYSLHCPDRTWSLLAPSQGAKRDRMRNVRGSSRGLGQVPGEQPGSWGFREVRKKMALWAALAGTGGFLEEISPHLKEPRPRLFHASALLGDTMVVLGGRSDPDEFSSDVLLYQVNCNAWLLPDLTRSASVGPPMEESVAHAVAAVGSRLYISGGFGGVALGRLLALTLPPDPCRLLSSPEACNQSGACTWCHGACLSGDQAHRLGCGGSPCSPMPRSPEECRRLRTCSECLARHPRTLQPGDGEASTPRCKWCTNCPEGACIGRNGSCTSENDCRINQREVFWAGNCSEAACGAADCEQCTREGKCMWTRQFKRTGETRRILSVQPTYDWTCFSHSLLNVSPMPVESSPPLPCPTPCHLLPNCTSCLDSKGADGGWQHCVWSSSLQQCLSPSYLPLRCMAGGCGRLLRGPESCSLGCAQATQCALCLRRPHCGWCAWGGQDGGGRCMEGGLSGPRDGLTCGRPGASWAFLSCPPEDECANGHHDCNETQNCHDQPHGYECSCKTGYTMDNMTGLCRPVCAQGCVNGSCVEPDHCRCHFGFVGRNCSTECRCNRHSECAGVGARDHCLLCRNHTKGSHCEQCLPLFVGSAVGGGTCRPCHAFCRGNSHICISRKELQMSKGEPKKYSLDPEEIENWVTEGPSEDEAVCVNCQNNSYGEKCESCLQGYFLLDGKCTKCQCNGHADTCNEQDGTGCPCQNNTETGTCQGSSPSDRRDCYKYQCAKCRESFHGSPLGGQQCYRLISVEQECCLDPTSQTNCFHEPKRRALGPGRTVLFGVQPKFTNVDIRLTLDVTFGAVDLYVSTSYDTFVVRVAPDTGVHTVHIQPPPAPPPPPPPADGGPRGAGDPGGAGASSGPGAPAEPRVREVWPRGLITYVTVTEPSAVLVVRGVRDRLVITYPHEHHALKSSRFYLLLLGVGDPSGPGANGSADSQGLLFFRQDQAHIDLFVFFSVFFSCFFLFLSLCVLLWKAKQALDQRQEQRRHLQEMTKMASRPFAKVTVCFPPDPTAPASAWKPAGLPPPAFRRSEPFLAPLLLTGAGGPWGPMGGGCCPPAIPATTAGLRAGPITLEPTEDGMAGVATLLLQLPGGPHAPNGACLGSALVTLRHRLHEYCGGGGGAGGSGHGTGAGRKGLLSQDNLTSMSL,mutated_sequence,1.0,2845.0,NP_001258867.1.a2m,NP_001258867.1.npy,ClinVar
+NP_001262.3,NP_001262.3.csv,MMRNKDKSQEEDSSLHSNASSHSASEEASGSDSGSQSESEQGSDPGSGHGSESNSSSESSESQSESESESAGSKSQPVLPEAKEKPASKKERIADVKKMWEEYPDVYGVRRSNRSRQEPSRFNIKEEASSGSESGSPKRRGQRQLKKQEKWKQEPSEDEQEQGTSAESEPEQKKVKARRPVPRRTVPKPRVKKQPKTQRGKRKKQDSSDEDDDDDEAPKRQTRRRAAKNVSYKEDDDFETDSDDLIEMTGEGVDEQQDNSETIEKVLDSRLGKKGATGASTTVYAIEANGDPSGDFDTEKDEGEIQYLIKWKGWSYIHSTWESEESLQQQKVKGLKKLENFKKKEDEIKQWLGKVSPEDVEYFNCQQELASELNKQYQIVERVIAVKTSKSTLGQTDFPAHSRKPAPSNEPEYLCKWMGLPYSECSWEDEALIGKKFQNCIDSFHSRNNSKTIPTRECKALKQRPRFVALKKQPAYLGGENLELRDYQLEGLNWLAHSWCKNNSVILADEMGLGKTIQTISFLSYLFHQHQLYGPFLIVVPLSTLTSWQREFEIWAPEINVVVYIGDLMSRNTIREYEWIHSQTKRLKFNALITTYEILLKDKTVLGSINWAFLGVDEAHRLKNDDSLLYKTLIDFKSNHRLLITGTPLQNSLKELWSLLHFIMPEKFEFWEDFEEDHGKGRENGYQSLHKVLEPFLLRRVKKDVEKSLPAKVEQILRVEMSALQKQYYKWILTRNYKALAKGTRGSTSGFLNIVMELKKCCNHCYLIKPPEENERENGQEILLSLIRSSGKLILLDKLLTRLRERGNRVLIFSQMVRMLDILAEYLTIKHYPFQRLDGSIKGEIRKQALDHFNADGSEDFCFLLSTRAGGLGINLASADTVVIFDSDWNPQNDLQAQARAHRIGQKKQVNIYRLVTKGTVEEEIIERAKKKMVLDHLVIQRMDTTGRTILENNSGRSNSNPFNKEELTAILKFGAEDLFKELEGEESEPQEMDIDEILRLAETRENEVSTSATDELLSQFKVANFATMEDEEELEERPHKDWDEIIPEEQRKKVEEEERQKELEEIYMLPRIRSSTKKAQTNDSDSDTESKRQAQRSSASESETEDSDDDKKPKRRGRPRSVRKDLVEGFTDAEIRRFIKAYKKFGLPLERLECIARDAELVDKSVADLKRLGELIHNSCVSAMQEYEEQLKENASEGKGPGKRRGPTIKISGVQVNVKSIIQHEEEFEMLHKSIPVDPEEKKKYCLTCRVKAAHFDVEWGVEDDSRLLLGIYEHGYGNWELIKTDPELKLTDKILPVETDKKPQGKQLQTRADYLLKLLRKGLEKKGAVTGGEEAKLKKRKPRVKKENKVPRLKEEHGIELSSPRHSDNPSEEGEVKDDGLEKSPMKKKQKKKENKENKEKQMSSRKDKEGDKERKKSKDKKEKPKSGDAKSSSKSKRSQGPVHITAGSEPVPIGEDEDDDLDQETFSICKERMRPVKKALKQLDKPDKGLNVQEQLEHTRNCLLKIGDRIAECLKAYSDQEHIKLWRRNLWIFVSKFTEFDARKLHKLYKMAHKKRSQEEEEQKKKDDVTGGKKPFRPEASGSSRDSLISQSHTSHNLHPQKPHLPASHGPQMHGHPRDNYNHPNKRHFSNADRGDWQRERKFNYGGGNNNPPWGSDRHHQYEQHWYKDHHYGDRRHMDAHRSGSYRPNNMSRKRPYDQYSSDRDHRGHRDYYDRHHHDSKRRRSDEFRPQNYHQQDFRRMSDHRPAMGYHGQGPSDHYRSFHTDKLGEYKQPLPPLHPAVSDPRSPPSQKSPHDSKSPLDHRSPLERSLEQKNNPDYNWNVRKT,mutated_sequence,1.0,1828.0,NP_001262.3.a2m,NP_001262.3.npy,ClinVar
+NP_001263274.1,NP_001263274.1.csv,MSDIEEVVEEYEEEEQEEAAVEEEEDWREDEDEQEEAAEEDAEAEAETEETRAEEDEEEEEAKEAEDGPMEESKPKPRSFMPNLVPPKIPDGERVDFDDIHRKRMEKDLNELQALIEAHFENRKKEEEELVSLKDRIERRRAERAEQQRIRNEREKERQNRLAEERARREEEENRRKAEDEARKKKALSNMMHFGGYIQKQAQTERKSGKRQTEREKKKKILAERRKVLAIDHLNEDQLREKAKELWQSIYNLEAEKFDLQEKFKQQKYEINVLRNRINDNQKVSKTRGKAKVTGRWK,mutated_sequence,1.0,298.0,NP_001263274.1.a2m,NP_001263274.1.npy,ClinVar
+NP_001264044.1,NP_001264044.1.csv,MAAQVAAREARDFREAPTLRLTSGAGLEAVGAVELEEEEENEEEAAARRARSFAQDARVRFLGGRLAMMLGFTEEKWSQYLESEDNRQVLGEFLESTSPACLVFSFAASGRLAASQEIPRDANHKLVFISKKITESIGVNDFSQVVLFGELPALSLGHVSAFLDEILVPVLSNKNNHKSWSCFTSQDMEYHIEVMKKKMYIFRGKMSRRTLLPIPTVAGKMDLDQNCSENKPPSNERIILHAIESVVIEWSHQIQEIIERDSVQRLLNGLHLSPQAELDFWMMRRENLSCIYDQLQAPVVLKMVKILTTKQSSYFPTLKDIFLAVENALLEAQDVELYLRPLRRHIQCLQETEFPQTRILIAPLFHTICLIWSHSKFYNTPARVIVLLQEFCNLFINQATAYLSPEDLLRGEIEESLEKVQVAVNILKTFKNSFFNYRKKLASYFMGRKLRPWDFQSHLVFCRFDKFLDRLIKIEDIFATTLEFEKLERLEFGGTKGAILNGQVHEMSEELMELCKLFKQSTYDPSDCTNMEFESDYVAFKSKTLEFDRRLGTIICEAFFNCNGLEAAFKLLTIFGNFLEKPVVMEIFSLHYSTLVHMFNTELDVCKQLYNEHMKQIECGHVVLNKNMPFTSGNMKWAQQVLQRLQMFWSNFASLRYLFLGNPDHALVYQKYVEMTTLLDQFESRIYNEWKSNVDEICEFNLNQPLVKFSAINGLLCVNFDPKLVAVLREVKYLLMLKKQDIPDSALAIFKKRNTILKYIGNLDLLVQGYNKLKQTLLEVEYPLIEDELRAIDEQLTAATTWLTWQDDCWGYIERVRAATSELEHRVERTQKNVKVIQQTMRGWARCVLPPRREHRREAAFTLEDKGDLFTKKYKLIQGDGCKIHNLVEENRKLFKANPSLDTWKIYVEFIDDIVVEGFFQAIMHDLDFFLKNTEKQLKPAPFFQAQMILLPPEIVFKPSLDREAGDGFYDLVEEMLCNSFRMSAQMNRIATHLEIKNYQNDMDNMLGLAEVRQEIMNRVVNVINKVLDFRNTLETHTYLWVDDRAEFMKHFLLYGHAVSSDEMDAHANEEIPEQPPTLEQFKEQIDIYEALYVQMSKFEDFRVFDSWFKVDMKPFKVSLLTIIKKWSWMFQEHLLRFVIDSLNELQEFIKETDSGLQRELNEGDHDGLVDIMVHLLAVRSRQRATDELFEPLKETITLLESYGQKMPEQVYIQLEELPERWETTKKIAATVRHEVSPLHNAEVTLIRKKCILFDAKQAEFRERFRHYAPLGFNAENPYTALDKANEELEALEEEMLQMQESTRLFEVALPEYKQMKQCRKEIKLLKGLWDVIIYVRRSIDNWTKTQWRQIHVEQMDVELRRFAKEIWSLNKEVRVWDAYTGLEGTVKDMTASLRAITELQSPALRDRHWHQLMKAIGVKFLINEATTLADLLALRLHRVEDDVRRIVDKAVKELGTEKVITEISQTWATMKFSYEVHYRTGIPLLKSDEQLFETLEHNQVQLQTLLQSKYVEYFIEQVLSWQNKLNIADLVIFTWMEVQRTWSHLESIFVCSEDIRIQLVKDARRFDGVDAEFKELMFKTAKVENVLEATCRPNLYEKLKDLQSRLSLCEKALAEYLETKRIAFPRFYFVSSADLLDILSKGAQPKQVTCHLAKLFDSIADLQFEDNQDVSAHRAVGMYSKEKEYVPFQAECECVGHVETWLLQLEQTMQETVRHSITEAIVAYEEKPRELWIFDFPAQVALTSSQIWWTTDVGIAFSRLEEGYETALKDFHKKQISQLNTLITLLLGELPPGDRQKIMTICTIDVHARDVVAKLISQKVVSPQAFTWLSQLRHRWEDTQKHCFVNICDAQFQYFYEYLGNSPRLVITPLTDRCYITLTQSLHLTMSGAPAGPAGTGKTETTKDLGRALGMMVYVFNCSEQMDYKSIGNIYKGLVQTGAWGCFDEFNRISVEVLSVVAVQVKMIHDAIRNRKKRFVFLGEAITLKPSVGIFITMNPGYAGRTELPENLKALFRPCAMVAPDIELICEILLVAEGFVDARALARKFITLYTLCKELLSKQDHYDWGLRAIKSVLVVAGSLKRGDKNRPEDQVLMRALRDFNMPKIVTDDIPVFLGLVGDLFPALDVPRRRKLHFEQMVRQSTLELRLQPEESFILKVVQLEELLAVRHSVFVVGNAGTGKSKILRTLNRTYVNMKQKPVWNDLNPKAVTTDELFGFIHHATREWKDGKIVYSYFIGLFSSILREQANLKHDGPKWIVLDGDIDPMWIESLNTVMDDNKVLTLASNERIALTPFMRLLFEIHHLRSATPATVSRAGILYVNPQDLGWNPYVASWIDRRRHQSEKANLTILFDKYVPACLDKLRTSFKTITSIPESSLVQTLCVLLECLLTPENVPSDSPKEVYEVYFVFACIWAFGGTLLQDQISDYQADFSRWWQKEMKAVKFPSQGTIFDYYVDHKTKKLLPWADKIAQFTMDPDVPLQTVLVHTTETARLRYFMELLLEKGKPLMLVGNAGVGKTVFVGDTLASLSEDYIVSRVPFNYYTTSTALQKILEKPLEKKAGHNYGPGGNKKLIYFIDDMNMPEVDLYGTVQPHTLIRQHIDYGHWYDRQKVMLKEIHNCQYVACMNPMVGSFTINPRLQRHFTVFAFNFPSLDALNTIYGQIFSFHFQQQAFAPSILRSGPTLIQATIAFHQTMMCNFLPTAIKFHYIFNLRDLSNVFQGILFASPECLKGPLDLIHLWLHESARVYGDKLIDKKDCDLFQRRMLETAYKYFEGIDSHMLLQQPLIYCHFADRGKDPHYMPVKDWEVLKTILTETLDNYNELNAAMHLVLFEDAMQHVCRISRILRTPQGCALLVGVGGSGKQSLSRLAAYLRGLEVFQITLTEGYGIQELRVDLANLYIRTGAKNMPTVFLLTDAQVLDESFLVLINDLLASGEIPDLFSDEDVDKIISGIHNEVHALGMVDSRENCWKFFMARVRLQLKIILCFSPVGRTLRVRARKFPAIVNCTAIDWFHAWPQEALVSVSRRFIEETKGIEPVHKDSISLFMAHVHTTVNEMSTRYYQNERRHNYTTPKSFLEQISLFKNLLKKKQNEVSEKKERLVNGIQKLKTTASQVGDLKARLASQEAELQLRNHDAEALITKIGLQTEKVSREKTIADAEERKVTAIQTEVFQKQRECEADLLKAEPALVAATAALNTLNRVNLSELKAFPNPPIAVTNVTAAVMVLLAPRGRVPKDRSWKAAKVFMGKVDDFLQALINYDKEHIPENCLKVVNEHYLKDPEFNPNLIRTKSFAAAGLCAWVINIIKFYEVYCDVEPKRQALAQANLELAAATEKLEAIRKKLVDLDRNLSRLTASFEKATAEKVRCQEEVNQTNKTIKLANRLVKELEAKKIRWGQSIKSFEAQEKTLCGDVLLTAAFVSYVGPFTRQYRQELVHCKWVPFLQQKVSIPLTEGLDLISMLTDDATIAAWNNEGLPSDRMSTENAAILTHCERWPLVIDPQQQGIKWIKNKYGMDLKVTHLGQKGFLNAIETALAFGDVILIENLEETIDPVLDPLLGRNTIKKGKYIRIGDKECEFNKNFRLILHTKLANPHYKPELQAQTTLLNFTVTEDGLEAQLLAEVVSIERPDLEKLKLVLTKHQNDFKIELKYLEDDLLLRLSAAEGSFLDDTKLVERLEATKTTVAEIEHKVIEAKENERKINEARECYRPVAARASLLYFVINDLQKINPLYQFSLKAFNVLFHRAIEQADKVEDMQGRISILMESITHAVFLYTSQALFEKDKLTFLSQMAFQILLRKKEIDPLELDFLLRFTVEHTHLSPVDFLTSQSWSAIKAIAVMEEFRGIDRDVEGSAKQWRKWVESECPEKEKLPQEWKKKSLIQKLILLRAMRPDRMTYALRNFVEEKLGAKYVERTRLDLVKAFEESSPATPIFFILSPGVDALKDLEILGKRLGFTIDSGKFHNVSLGQGQETVAEVALEKASKGGHWVILQNVHLVAKWLGTLEKLLERFSQGSHRDYRVFMSAESAPTPDEHIIPQGLLENSIKITNEPPTGMLANLHAALYNFDQDTLEICSKEQEFKSILFSLCYFHACVAGRLRFGPQGWSRSYPFNPGDLTICASVLYNYLEANSKVPWEDLRYLFGEIMYGGHITDDWDRKLCRVYLEEFMNPSLTEDELMLAPGFAAPPYLDYAGYHQYIEEMLPPESPALYGLHPNAEIEFLTVTSNTLFRTLLEMQPRNALSGDELGQSTEEKVKNVLDDILEKLPEEFNMAEIMQKNSNRSPYVLVCFQECERMNILIREIRISLEQLDLSLKGELALSPAVEAQQFALSYDTVPDTWSKLAYPSTYGLAQWFNDLLLRCRELDTWTQDLTLPAVVWLSGFFNPQSFLTAIMQTMARKNEWPLDKTRLTADVTKKTKEDYGHPPREGAYLHGLFMEGARWDTQAGTIVEARLKELACPMPVIFAKATPVDRQETKQTYECPVYRTKLRGPSYIWTFRLKSEEKTAKWVLAGVALLLEA,mutated_sequence,1.0,4516.0,NP_001264044.1.a2m,NP_001264044.1.npy,ClinVar
+NP_001265441.1,NP_001265441.1.csv,MSAAPAYSEDKGGSAGPGEPEYGHDPASGGIFSSDYKRHDDLKEMLDTNKDSLKLEAMKRIVAMIARGKNASDLFPAVVKNVACKNIEVKKLVYVYLVRYAEEQQDLALLSISTFQRGLKDPNQLIRASALRVLSSIRVPIIVPIMMLAIKEAASDMSPYVRKTAAHAIPKLYSLDSDQKDQLIEVIEKLLADKTTLVAGSVVMAFEEVCPERIDLIHKNYRKLCNLLIDVEEWGQVVIISMLTRYARTQFLSPTQNESLLEENAEKAFYGSEEDEAKGAGSEETAAAAAPSRKPYVMDPDHRLLLRNTKPLLQSRSAAVVMAVAQLYFHLAPKAEVGVIAKALVRLLRSHSEVQYVVLQNVATMSIKRRGMFEPYLKSFYIRSTDPTQIKILKLEVLTNLANETNIPTVLREFQTYIRSMDKDFVAATIQAIGRCATNIGRVRDTCLNGLVQLLSNRDELVVAESVVVIKKLLQMQPAQHGEIIKHLAKLTDNIQVPMARASILWLIGEYCEHVPRIAPDVLRKMAKSFTAEEDIVKLQVINLAAKLYLTNSKQTKLLTQYVLSLAKYDQNYDIRDRARFTRQLIVPSEQGGALSRHAKKLFLAPKPAPVLESSFKDRDHFQLGSLSHLLNAKATGYQELPDWPEEAPDPSVRNVEEEDLSLIETHVGLLGEYTEVPEWTKCSNREKRKEKEKPFYSDSEGESGPTESADSDPESESESDSKSSSESGSGESSSESDNEDQDEDEEKGRGSESEQSEEDGKRKTKKKVPERKGEASSSDEGSDSSSSSSESEMTSESEEEQLEPASWSRKTPPSSKSAPATKEISLLDLEDFTPPSVQPVSPPAIVSTSLAADLEGLTLTDSTLVPSLLSPVSGVGRQELLHRVAGEGLAVDYTFSRQPFSGDPHMVSVHIHFSNSSDTPIKGLHVGTPKLPAGISIQEFPEIESLAPGESATAVMGINFCDSTQAANFQLCTQTRQFYVSIQPPVGELMAPVFMSENEFKKEQGKLMGMNEITEKLMLPDTCRSDHIVVQKVTATANLGRVPCGTSDEYRFAGRTLTGGSLVLLTLDARPAGAAQLTVNSEKMVIGTMLVKDVIQALTQ,mutated_sequence,1.0,1101.0,NP_001265441.1.a2m,NP_001265441.1.npy,ClinVar
+NP_001265645.1,NP_001265645.1.csv,MSPVFPMLTVLTMFYYICLRRRARTATRGEMMNTHRAIESNSQTSPLNAEVVQYAKEVVDFSSHYGSENSMSYTMWNLAGVPNVFPSSGDFTQTAVFRTYGTWWDQCPSASLPFKRTPPNFQSQDYVELTFEQQVYPTAVHVLETYHPGAVIRILACSANPYSPNPPAEVRWEILWSERPTKVNASQARQFKPCIKQINFPTNLIRLEVNSSLLEYYTELDAVVLHGVKDKPVLSLKTSLIDMNDIEDDAYAEKDGCGMDSLNKKFSSAVLGEGPNNGYFDKLPYELIQLILNHLTLPDLCRLAQTCKLLSQHCCDPLQYIHLNLQPYWAKLDDTSLEFLQSRCTLVQWLNLSWTGNRGFISVAGFSRFLKVCGSELVRLELSCSHFLNETCLEVISEMCPNLQALNLSSCDKLPPQAFNHIAKLCSLKRLVLYRTKVEQTALLSILNFCSELQHLSLGSCVMIEDYDVIASMIGAKCKKLRTLDLWRCKNITENGIAELASGCPLLEELDLGWCPTLQSSTGCFTRLAHQLPNLQKLFLTANRSVCDTDIDELACNCTRLQQLDILGTRMVSPASLRKLLESCKDLSLLDVSFCSQIDNRAVLELNASFPKVFIKKSFTQ,mutated_sequence,1.0,621.0,NP_001265645.1.a2m,NP_001265645.1.npy,ClinVar
+NP_001268669.1,NP_001268669.1.csv,MATLACRVQFLDDTDPFNSTNFPEPSRPPLFTFREDLALGTQLAGVHRLLQAPHKLDDCTLQLSHNGAYLDLEATLAEQRDELEGFQDDAGRGKKHSIILRTQLSVRVHACIEKLYNSSGRDLRRALFSLKQIFQDDKDLVHEFVVAEGLTCLIKVGAEADQNYQNYILRALGQIMLYVDGMNGVINRNETIQWLYTLIGSKFRLVVKTALKLLLVFVEYSESNAPLLIQAVTAVDTKRGVKPWSNIMEILEEKDGVDTELLVYAMTLVNKTLSGLPDQDTFYDVVDCLEELGIAAVSQRHLNKKGTDLDLVEQLNIYEVALRHEDGDETTEPPPSGCRDRRRASVCSSGGGEHRGLDRRRSRRHSVQSIKSTLSAPTSPCSQSAPSFKPNQVRDLREKEEEEEEEQPITEPSSEEEREDDASCQGKDSKVGAASGQSPTGRDAAPKSSALPAVSNASSQGKPLLVGTAGGTTWHSGSSGSEATPSALLSPPASAARPSSATPGSLKVSPTIDKLPYVPHSPFHLFSYDFEDSSLSTKEKEAESQKENSSSDSFSLSTYSASEPYHFRSFSSNRYSNFGNNSYHSSRPSSGSSVPTTPTSSVSPPQEARLERSSPSGLLTSSFRQHQESLAAERERRRQEREERLQRIEREERNKFSRDYLDKREEQRQAREERYKYLEQLAAEEHEKELRSRSVSRGRADLSLDLTSPAAPACLAPLSHSPSSSDSQEALTVSASSPGTPHHPQASAGDPEPESEAEPEAEAGAGQVADEAGQDIASAHEGAETEVEQALEQEPEERASLSEKERQNEGVNERDNCSASSVSSSSSTLEREEKEDKLSRDRTTGLWPAGVQDAGVNGQCGDILTNKRFMLDMLYAHNRKSPDDEEKGDGEAGRTQQEAEAVASLATRISTLQANSQTQDESVRRVDVGCLDNRGSVKAFAEKFNSGDLGRGSISPDAEPNDKVPETAPVQPKTESDYIWDQLMANPRELRIQDMDFTDLGEEDDIDVLDVDLGHREAPGPPPPPPPTFLGLPPPPPPPLLDSIPPPPVPGNLLVPPPPVFNAPQGLGWSQVPRGQPTFTKKKKTIRLFWNEVRPFDWPCKNNRRCREFLWSKLEPIKVDTSRLEHLFESKSKELSVSKKTAADGKRQEIIVLDSKRSNAINIGLTVLPPPRTIKIAILNFDEYALNKEGIEKILTMIPTDEEKQKIQEAQLANPEIPLGSAEQFLLTLSSISELSARLHLWAFKMDYETTEKEVAEPLLDLKEGIDQLENNKTLGFILSTLLAIGNFLNGTNAKAFELSYLEKVPEVKDTVHKQSLLHHVCTMVVENFPDSSDLYSEIGAITRSAKVDFDQLQDNLCQMERRCKASWDHLKAIAKHEMKPVLKQRMSEFLKDCAERIIILKIVHRRIINRFHSFLLFMGHPPYAIREVNINKFCRIISEFALEYRTTRERVLQQKQKRANHRERNKTRGKMITDTDEEEEVESGKFSGSSPAPPSQPQGLSYAEDAAEHENMKAVLKTSSPSVEDATPALGVRTRSRASRGSTSSWTMGTDDSPNVTDDAADEIMDRIVKSATQVPSQRVVPRERKRSRANRKSLRRTLKSGLTPEEARALGLVGTSELQL,mutated_sequence,1.0,1622.0,NP_001268669.1.a2m,NP_001268669.1.npy,ClinVar
+NP_001269154.1,NP_001269154.1.csv,MLVDGPSERPALCFLLLAVAMSFFGSALSIDETRAHLLLKEKMMRLGGRLVLNTKEELANERLMTLKIAEMKEAMRTLIFPPSMHFFQAKHLIERSQVFNILRMMPKGAALHLHDIGIVTMDWLVRNVTYRPHCHICFTPRGIMQFRFAHPTPRPSEKCSKWILLEDYRKRVQNVTEFDDSLLRNFTLVTQHPEVIYTNQNVVWSKFETIFFTISGLIHYAPVFRDYVFRSMQEFYEDNVLYMEIRARLLPVYELSGEHHDEEWSVKTYQEVAQKFVETHPEFIGIKIIYSDHRSKDVAVIAESIRMAMGLRIKFPTVVAGFDLVGHEDTGHSLHDYKEALMIPAKDGVKLPYFFHAGETDWQGTSIDRNILDALMLNTTRIGHGFALSKHPAVRTYSWKKDIPIEVCPISNQVLKLVSDLRNHPVATLMATGHPMVISSDDPAMFGAKGLSYDFYEVFMGIGGMKADLRTLKQLAMNSIKYSTLLESEKNTFMEIWKKRWDKFIADVATK,mutated_sequence,1.0,511.0,NP_001269154.1.a2m,NP_001269154.1.npy,ClinVar
+NP_001269646.1,NP_001269646.1.csv,MSSPLQRAVGDTKRALSASSSSSASLPFDDRDSNHTSEGNGDSLLADEDTDFEDSLNRNVKKRAAKRPPKTTPVAKHPKKGSRVVHRHSRKQSEPPANDLFNAVKAAKSDMQSLVDEWLDSYKQDQDAGFLELVNFFIQSCGCKGIVTPEMFKKMSNSEIIQHLTEQFNEDSGDYPLIAPGPSWKKFQGSFCEFVRTLVCQCQYSLLYDGFPMDDLISLLTGLSDSQVRAFRHTSTLAAMKLMTSLVKVALQLSVHQDNNQRQYEAERNKGPGQRAPERLESLLEKRKELQEHQEEIEGMMNALFRGVFVHRYRDVLPEIRAICIEEIGCWMQSYSTSFLTDSYLKYIGWTLHDKHREVRLKCVKALKGLYGNRDLTTRLELFTSRFKDRMVSMVMDREYDVAVEAVRLLILILKNMEGVLTDADCESVYPVVYASHRGLASAAGEFLYWKLFYPECEIRMMGGREQRQSPGAQRTFFQLLLSFFVESELHDHAAYLVDSLWDCAGARLKDWEGLTSLLLEKDQNLGDVQESTLIEILVSSARQASEGHPPVGRVTGRKGLTSKERKTQADDRVKLTEHLIPLLPQLLAKFSADAEKVTPLLQLLSCFDLHIYCTGRLEKHLELFLQQLQEVVVKHAEPAVLEAGAHALYLLCNPEFTFFSRADFARSQLVDLLTDRFQQELEELLQSSFLDEDEVYNLAATLKRLSAFYNTHDLTRWELYEPCCQLLQKAVDTGEVPHQVILPALTLVYFSILWTLTHISKSDASQKQLSSLRDRMVAFCELCQSCLSDVDTEIQEQAFVLLSDLLLIFSPQMIVGGRDFLRPLVFFPEATLQSELASFLMDHVFIQPGDLGSGDSQEDHLQIERLHQRRRLLAGFCKLLLYGVLEMDAASDVFKHYNKFYNDYGDIIKETLTRARQIDRSHCSRILLLSLKQLYTELLQEHGPQGLNELPAFIEMRDLARRFALSFGPQQLQNRDLVVMLHKEGIQFSLSELPPAGSSNQPPNLAFLELLSEFSPRLFHQDKQLLLSYLEKCLQHVSQAPGHPWGPVTTYCHSLSPVENTAETSPQVLPSSKRRRVEGPAKPNREDVSSSQEESLQLNSIPPTPTLTSTAVKSRQPLWGLKEMEEEDGSELDFAQGSQPVAGTERSRFLGPQYFQTPHNPSGPGLGNQLMRLSLMEEDEEEELEIQDESNEERQDTDMQASSYSSTSERGLDLLDSTELDIEDF,mutated_sequence,1.0,1226.0,NP_001269646.1.a2m,NP_001269646.1.npy,ClinVar
+NP_001269938.1,NP_001269938.1.csv,MPKIVLNGVTVDFPFQPYKCQQEYMTKVLECLQQKVNGILESPTGTGKTLCLLCTTLAWREHLRDGISARKIAERAQGELFPDRALSSWGNAAAAAGDPIACYTDIPKIIYASRTHSQLTQVINELRNTSYRPKVCVLGSREQLCIHPEVKKQESNHLQIHLCRKKVASRSCHFYNNVEEKSLEQELASPILDIEDLVKSGSKHRVCPYYLSRNLKQQADIIFMPYNYLLDAKSRRAHNIDLKGTVVIFDEAHNVEKMCEESASFDLTPHDLASGLDVIDQVLEEQTKAAQQGEPHPEFSADSPSPGLNMELEDIAKLKMILLRLEGAIDAVELPGDDSGVTKPGSYIFELFAEAQITFQTKGCILDSLDQIIQHLAGRAGVFTNTAGLQKLADIIQIVFSVDPSEGSPGSPAGLGALQSYKVHIHPDAGHRRTAQRSDAWSTTAARKRGKVLSYWCFSPGHSMHELVRQGVRSLILTSGTLAPVSSFALEMQIPFPVCLENPHIIDKHQIWVGVVPRGPDGAQLSSAFDRRFSEECLSSLGKALGNIARVVPYGLLIFFPSYPVMEKSLEFWRARDLARKMEALKPLFVEPRSKGSFSETISAYYARVAAPGSTGATFLAVCRGKASEGLDFSDTNGRGVIVTGLPYPPRMDPRVVLKMQFLDEMKGQGGAGGQFLSGQEWYRQQASRAVNQAIGRVIRHRQDYGAVFLCDHRFAFADARAQLPSWVRPHVRVYDNFGHVIRDVAQFFRVAERTMPAPAPRATAPSVRGEDAVSEAKSPGPFFSTRKAKSLDLHVPSLKQRSSGSPAAGDPESSLCVEYEQEPVPARQRPRGLLAALEHSEQRAGSPGEEQAHSCSTLSLLSEKRPAEEPRGGRKKIRLVSHPEEPVAGAQTDRAKLFMVAVKQELSQANFATFTQALQDYKGSDDFAALAACLGPLFAEDPKKHNLLQGFYQFVRPHHKQQFEEVCIQLTGRGCGYRPEHSIPRRQRAQPVLDPTGRTAPDPKLTVSTAAAQQLDPQEHLNQGRPHLSPRPPPTGDPGSQPQWGSGVPRAGKQGQHAVSAYLADARRALGSAGCSQLLAALTAYKQDDDLDKVLAVLAALTTAKPEDFPLLHRFSMFVRPHHKQRFSQTCTDLTGRPYPGMEPPGPQEERLAVPPVLTHRAPQPGPSRSEKTGKTQSKISSFLRQRPAGTVGAGGEDAGPSQSSGPPHGPAASEWGEPHGRDIAGQQATGAPGGPLSAGCVCQGCGAEDVVPFQCPACDFQRCQACWQRHLQASRMCPACHTASRKQSVMQVFWPEPQ,mutated_sequence,1.0,1300.0,NP_001269938.1.a2m,NP_001269938.1.npy,ClinVar
+NP_001278344.1,NP_001278344.1.csv,MKSCGVSLATAAAAAAAFGDEEKKMAAGKASGESEEASPSLTAEEREALGGLDSRLFGFVRFHEDGARTKALLGKAVRCYESLILKAEGKVESDFFCQLGHFNLLLEDYPKALSAYQRYYSLQSDYWKNAAFLYGLGLVYFHYNAFQWAIKAFQEVLYVDPSFCRAKEIHLRLGLMFKVNTDYESSLKHFQLALVDCNPCTLSNAEIQFHIAHLYETQRKYHSAKEAYEQLLQTENLSAQVKATVLQQLGWMHHTVDLLGDKATKESYAIQYLQKSLEADPNSGQSWYFLGRCYSSIGKVQDAFISYRQSIDKSEASADTWCSIGVLYQQQNQPMDALQAYICAVQLDHGHAAAWMDLGTLYESCNQPQDAIKCYLNATRSKSCSNTSALAARIKYLQAQLCNLPQGSLQNKTKLLPSIEEAWSLPIPAELTSRQGAMNTAQQACKPHHPNTEPVLGLSQTPISQQSLPLHMIPSSQVDDLSSPAKRKRTSSPTKNTSDNWSGGHAVSHPPVQQQAHSWCLTPQKLQHLEQLRANRNNLNPAQKLMLEQLESQFVLMQQHQMRPTGVAQVRSTGIPNGPTADSSLPTNSVSGQQPQLALTRVPSVSQPGVRPACPGQPLANGPFSAGHVPCSTSRTLGSTDTILIGNNHITGSGSNGNVPYLQRNALTLPHNRTNLTSSAEEPWKNQLSNSTQGLHKGQSSHSAGPNGERPLSSTGPSQHLQAAGSGIQNQNGHPTLPSNSVTQGAALNHLSSHTATSGGQQGITLTKESKPSGNILTVPETSRHTGETPNSTASVEGLPNHVHQMTADAVCSPSHGDSKSPGLLSSDNPQLSALLMGKANNNVGTGTCDKVNNIHPAVHTKTDNSVASSPSSAISTATPSPKSTEQTTTNSVTSLNSPHSGLHTINGEGMEESQSPMKTDLLLVNHKPSPQIIPSMSVSIYPSSAEVLKACRNLGKNGLSNSSILLDKCPPPRPPSSPYPPLPKDKLNPPTPSIYLENKRDAFFPPLHQFCTNPNNPVTVIRGLAGALKLDLGLFSTKTLVEANNEHMVEVRTQLLQPADENWDPTGTKKIWHCESNRSHTTIAKYAQYQASSFQESLREENEKRSHHKDHSDSESTSSDNSGRRRKGPFKTIKFGTNIDLSDDKKWKLQLHELTKLPAFVRVVSAGNLLSHVGHTILGMNTVQLYMKVPGSRTPGHQENNNFCSVNINIGPGDCEWFVVPEGYWGVLNDFCEKNNLNFLMGSWWPNLEDLYEANVPVYRFIQRPGDLVWINAGTVHWVQAIGWCNNIAWNVGPLTACQYKLAVERYEWNKLQSVKSIVPMVHLSWNMARNIKVSDPKLFEMIKYCLLRTLKQCQTLREALIAAGKEIIWHGRTKEEPAHYCSICEVEVFDLLFVTNESNSRKTYIVHCQDCARKTSGNLENFVVLEQYKMEDLMQVYDQFTLAPPLPSASS,mutated_sequence,1.0,1453.0,NP_001278344.1.a2m,NP_001278344.1.npy,ClinVar
+NP_001289.1,NP_001289.1.csv,MAKINTQYSHPSRTHLKVKTSDRDLNRAENGLSRAHSSSEETSSVLQPGIAMETRGLADSGQGSFTGQGIARLSRLIFLLRRWAARHVHHQDQGPDSFPDRFRGAELKEVSSQESNAQANVGSQEPADRGRSAWPLAKCNTNTSNNTEEEKKTKKKDAIVVDPSSNLYYRWLTAIALPVFYNWYLLICRACFDELQSEYLMLWLVLDYSADVLYVLDVLVRARTGFLEQGLMVSDTNRLWQHYKTTTQFKLDVLSLVPTDLAYLKVGTNYPEVRFNRLLKFSRLFEFFDRTETRTNYPNMFRIGNLVLYILIIIHWNACIYFAISKFIGFGTDSWVYPNISIPEHGRLSRKYIYSLYWSTLTLTTIGETPPPVKDEEYLFVVVDFLVGVLIFATIVGNVGSMISNMNASRAEFQAKIDSIKQYMQFRKVTKDLETRVIRWFDYLWANKKTVDEKEVLKSLPDKLKAEIAINVHLDTLKKVRIFQDCEAGLLVELVLKLRPTVFSPGDYICKKGDIGKEMYIINEGKLAVVADDGVTQFVVLSDGSYFGEISILNIKGSKSGNRRTANIRSIGYSDLFCLSKDDLMEALTEYPEAKKALEEKGRQILMKDNLIDEELARAGADPKDLEEKVEQLGSSLDTLQTRFARLLAEYNATQMKMKQRLSQLESQVKGGGDKPLADGEVPGDATKTEDKQQ,mutated_sequence,1.0,694.0,NP_001289.1.a2m,NP_001289.1.npy,ClinVar
+NP_001299838.1,NP_001299838.1.csv,MSCKKQRSRKHSVNEKCNMKIEHYFSPVSKEQQNNCSTSLMRMESRGDPRATTNTQAQRFHSPKKNPEDQTMPQNRTIYVTLKVNHRRNQDMKLKLTHSENSSLYMALNTLQAVRKEIETHQGQEMLVRGTEGIKEYINLGMPLSCFPEGGQVVITFSQSKSKQKEDNHIFGRQDKASTECVKFYIHAIGIGKCKRRIVKCGKLHKKGRKLCVYAFKGETIKDALCKDGRFLSFLENDDWKLIENNDTILESTQPVDELEGRYFQVEVEKRMVPSAAASQNPESEKRNTCVLREQIVAQYPSLKRESEKIIENFKKKMKVKNGETLFELHRTTFGKVTKNSSSIKVVKLLVRLSDSVGYLFWDSATTGYATCFVFKGLFILTCRHVIDSIVGDGIEPSKWATIIGQCVRVTFGYEELKDKETNYFFVEPWFEIHNEELDYAVLKLKENGQQVPMELYNGITPVPLSGLIHIIGHPYGEKKQIDACAVIPQGQRAKKCQERVQSKKAESPEYVHMYTQRSFQKIVHNPDVITYDTEFFFGASGSPVFDSKGSLVAMHAAGFAYTYQNETRSIIEFGSTMESILLDIKQRHKPWYEEVFVNQQDVEMMSDEDL,mutated_sequence,1.0,611.0,NP_001299838.1.a2m,NP_001299838.1.npy,ClinVar
+NP_001308049.1,NP_001308049.1.csv,MLQDKGLSESEEAFRAPGPALGEASAANAPEPALAAPGLSGAALGSPPGPGADVVAAAAAEQTIENIKVGLHEKELWKKFHEAGTEMIITKAGRRMFPSYKVKVTGMNPKTKYILLIDIVPADDHRYKFCDNKWMVAGKAEPAMPGRLYVHPDSPATGAHWMRQLVSFQKLKLTNNHLDPFGHIILNSMHKYQPRLHIVKADENNAFGSKNTAFCTHVFPETSFISVTSYQNHKITQLKIENNPFAKGFRGSDDSDLRVARLQSKEYPVISKSIMRQRLISPQLSATPDVGPLLGTHQALQHYQHENGAHSQLAEPQDLPLSTFPTQRDSSLFYHCLKRRADGTRHLDLPCKRSYLEAPSSVGEDHYFRSPPPYDQQMLSPSYCSEVTPREACMYSGSGPEIAGVSGVDDLPPPPLSCNMWTSVSPYTSYSVQTMETVPYQPFPTHFTATTMMPRLPTLSAQSSQPPGNAHFSVYNQLSQSQVRERGPSASFPRERGLPQGCERKPPSPHLNAANEFLYSQTFSLSRESSLQYHSGMGTVENWTDG,mutated_sequence,1.0,546.0,NP_001308049.1.a2m,NP_001308049.1.npy,ClinVar
+NP_001310218.1,NP_001310218.1.csv,MKIPNIGNVMNKFEILGVVGEGAYGVVLKCRHKETHEIVAIKKFKDSEENEEVKETTLRELKMLRTLKQENIVELKEAFRRRGKLYLVFEYVEKNMLELLEEMPNGVPPEKVKSYIYQLIKAIHWCHKNDIVHRDIKPENLLISHNDVLKLCDFGFARNLSEGNNANYTEYVATRWYRSPELLLGAPYGKSVDMWSVGCILGELSDGQPLFPGESEIDQLFTIQKVLGPLPSEQMKLFYSNPRFHGLRFPAVNHPQSLERRYLGILNSVLLDLMKNLLKLDPADRYLTEQCLNHPTFQTQRLLDRSPSRSAKRKPYHVESSTLSNRNQAGKSTALQSHHRSNSKDIQNLSVGLPRADEGLPANESFLNGNLAGASLSPLHTKTYQASSQPGSTSKDLTNNNIPHLLSPKEAKSKTEFDFNIDPKPSEGPGTKYLKSNSRSQQNRHSFMESSQSKAGTLQPNEKQSRHSYIDTIPQSSRSPSYRTKAKSHGALSDSKSVSNLSEARAQIAEPSTSRYFPSSCLDLNSPTSPTPTRHSDTRTLLSPSGRNNRNEGTLDSRRTTTRHSKTMEELKLPEHMDSSHSHSLSAPHESFSYGLGYTSPFSSQQRPHRHSMYVTRDKVRAKGLDGSLSIGQGMAARANSLQLLSPQPGEQLPPEMTVARSSVKETSREGTSSFHTRQKSEGGVYHDPHSDDGTAPKENRHLYNDPVPRRVGSFYRVPSPRPDNSFHENNVSTRVSSLPSESSSGTNHSKRQPAFDPWKSPENISHSEQLKEKEKQGFFRSMKKKKKKSQTVPNSDSPDLLTLQKSIHSASTPSSRPKEWRPEKISDLQTQSQPLKSLRKLLHLSSASNHPASSDPRFQPLTAQQTKNSFSEIRIHPLSQASGGSSNIRQEPAPKGRPALQLPGQMDPGWHVSSVTRSATEGPSYSEQLGAKSGPNGHPYNRTNRSRMPNLNDLKETAL,mutated_sequence,1.0,960.0,NP_001310218.1.a2m,NP_001310218.1.npy,ClinVar
+NP_001317217.1,NP_001317217.1.csv,MAVRKKDGGPNVKYYEAADTVTQFDNVRLWLGKNYKKYIQAEPPTNKSLSSLVVQLLQFQEEVFGKHVSNAPLTKLPIKCFLDFKAGGSLCHILAAAYKFKSDQGWRRYDFQNPSRMDRNVEMFMTIEKSLVQNNCLSRPNIFLCPEIEPKLLGKLKDIIKRHQGTVTEDKNNASHVVYPVPGNLEEEEWVRPVMKRDKQVLLHWGYYPDSYDTWIPASEIEASVEDAPTPEKPRKVHAKWILDTDTFNEWMNEEDYEVNDDKNPVSRRKKISAKTLTDEVNSPDSDRRDKKGGNYKKRKRSPSPSPTPEAKKKNAKKGPSTPYTKSKRGHREEEQEDLTKDMDEPSPVPNVEEVTLPKTVNTKKDSESAPVKGGTMTDLDEQEDESMETTGKDEDENSTGNKGEQTKNPDLHEDNVTEQTHHIIIPSYAAWFDYNSVHAIERRALPEFFNGKNKSKTPEIYLAYRNFMIDTYRLNPQEYLTSTACRRNLAGDVCAIMRVHAFLEQWGLINYQVDAESRPTPMGPPPTSHFHVLADTPSGLVPLQPKTPQGRQVDADTKAGRKGKELDDLVPETAKGKPELQTSASQQMLNFPDKGKEKPTDMQNFGLRTDMYTKKNVPSKSKAAASATREWTEQETLLLLEALEMYKDDWNKVSEHVGSRTQDECILHFLRLPIEDPYLEDSEASLGPLAYQPIPFSQSGNPVMSTVAFLASVVDPRVASAAAKSALEEFSKMKEEVPTALVEAHVRKVEEAAKVTGKADPAFGLESSGIAGTTSDEPERIEESGNDEARVEGQATDEKKEPKEPREGGGAIEEEAKEKTSEAPKKDEEKGKEGDSEKESEKSDGDPIVDPEKEKEPKEGQEEVLKEVVESEGERKTKVERDIGEGNLSTAAAAALAAAAVKAKHLAAVEERKIKSLVALLVETQMKKLEIKLRHFEELETIMDREREALEYQRQQLLADRQAFHMEQLKYAEMRARQQHFQQMHQQQQQPPPALPPGSQPIPPTGAAGPPAVHGLAVAPASVVPAPAGSGAPPGSLGPSEQIGQAGSTAGPQQQQPAGAPQPGAVPPGVPPPGPHGPSPFPNQQTPPSMMPGAVPGSGHPGVAGNAPLGLPFGMPPPPPPPAPSIIPFGSLADSISINLPAPPNLHGHHHHLPFAPGTLPPPNLPVSMANPLHPNLPATTTMPSSLPLGPGLGSAAAQSPAIVAAVQGNLLPSASPLPDPGTPLPPDPTAPSPGTVTPVPPPQ,mutated_sequence,1.0,1245.0,NP_001317217.1.a2m,NP_001317217.1.npy,ClinVar
+NP_001317629.1,NP_001317629.1.csv,MAKSGGCGAGAGVGGGNGALTWVTLFDQNNAAKKEESETANKNDSSKKLSVERVYQKKTQLEHILLRPDTYIGSVEPLTQFMWVYDEDVGMNCREVTFVPGLYKIFDEILVNAADNKQRDKNMTCIKVSIDPESNIISIWNNGKGIPVVEHKVEKVYVPALIFGQLLTSSNYDDDEKKVTGGRNGYGAKLCNIFSTKFTVETACKEYKHSFKQTWMNNMMKTSEAKIKHFDGEDYTCITFQPDLSKFKMEKLDKDIVALMTRRAYDLAGSCRGVKVMFNGKKLPVNGFRSYVDLYVKDKLDETGVALKVIHELANERWDVCLTLSEKGFQQISFVNSIATTKGGRHVDYVVDQVVGKLIEVVKKKNKAGVSVKPFQVKNHIWVFINCLIENPTFDSQTKENMTLQPKSFGSKCQLSEKFFKAASNCGIVESILNWVKFKAQTQLNKKCSSVKYSKIKGIPKLDDANDAGGKHSLECTLILTEGDSAKSLAVSGLGVIGRDRYGVFPLRGKILNVREASHKQIMENAEINNIIKIVGLQYKKSYDDAESLKTLRYGKIMIMTDQDQDGSHIKGLLINFIHHNWPSLLKHGFLEEFITPIVKASKNKQELSFYSIPEFDEWKKHIENQKAWKIKYYKGLGTSTAKEAKEYFADMERHRILFRYAGPEDDAAITLAFSKKKIDDRKEWLTNFMEDRRQRRLHGLPEQFLYGTATKHLTYNDFINKELILFSNSDNERSIPSLVDGFKPGQRKVLFTCFKRNDKREVKVAQLAGSVAEMSAYHHGEQALMMTIVNLAQNFVGSNNINLLQPIGQFGTRLHGGKDAASPRYIFTMLSTLARLLFPAVDDNLLKFLYDDNQRVEPEWYIPIIPMVLINGAEGIGTGWACKLPNYDAREIVNNVRRMLDGLDPHPMLPNYKNFKGTIQELGQNQYAVSGEIFVVDRNTVEITELPVRTWTQVYKEQVLEPMLNGTDKTPALISDYKEYHTDTTVKFVVKMTEEKLAQAEAAGLHKVFKLQTTLTCNSMVLFDHMGCLKKYETVQDILKEFFDLRLSYYGLRKEWLVGMLGAESTKLNNQARFILEKIQGKITIENRSKKDLIQMLVQRGYESDPVKAWKEAQEKAAEEDETQNQHDDSSSDSGTPSGPDFNYILNMSLWSLTKEKVEELIKQRDAKGREVNDLKRKSPSDLWKEDLAAFVEELDKVESQEREDVLAGMSGKAIKGKVGKPKVKKLQLEETMPSPYGRRIIPEITAMKADASKKLLKKKKGDLDTAAVKVEFDEEFSGAPVEGAGEEALTPSVPINKGPKPKREKKEPGTRVRKTPTSSGKPSAKKVKKRNPWSDDESKSESDLEETEPVVIPRDSLLRRAAAERPKYTFDFSEEEDDDADDDDDDNNDLEELKVKASPITNDGEDEFVPSDGLDKDEYTFSPGKSKATPEKSLHDKKSQDFGNLFSFPSYSQKSEDDSAKFDSNEEDSASVFSPSFGLKQTDKVPSKTVAAKKGKPSSDTVPKPKRAPKQKKVVEAVNSDSDSEFGIPKKTTTPKGKGRGAKKRKASGSENEGDYNPGRKTSKTTSKKPKKTSFDQDSDVDIFPSDFPTEPPSLPRTGRARKEVKYFAESDEEEDDVDFAMFN,mutated_sequence,1.0,1626.0,NP_001317629.1.a2m,NP_001317629.1.npy,ClinVar
+NP_001334650.1,NP_001334650.1.csv,MHTGGETSACKPSSVRLAPSFSFHAAGLQMAGQMPHSHQYSDRRQPNISDQQVSALSYSDQIQQPLTNQRRMPQTFRDPATAPLRKLSVDLIKTYKHINEVYYAKKKRRHQQGQGDDSSHKKERKVYNDGYDDDNYDYIVKNGEKWMDRYEIDSLIGKGSFGQVVKAYDRVEQEWVAIKIIKNKKAFLNQAQIEVRLLELMNKHDTEMKYYIVHLKRHFMFRNHLCLVFEMLSYNLYDLLRNTNFRGVSLNLTRKFAQQMCTALLFLATPELSIIHCDLKPENILLCNPKRSAIKIVDFGSSCQLGQRIYQYIQSRFYRSPEVLLGMPYDLAIDMWSLGCILVEMHTGEPLFSGANEVDQMNKIVEVLGIPPAHILDQAPKARKFFEKLPDGTWNLKKTKDGKREYKPPGTRKLHNILGVETGGPGGRRAGESGHTVADYLKFKDLILRMLDYDPKTRIQPYYALQHSFFKKTADEGTNTSNSVSTSPAMEQSQSSGTTSSTSSSSGGSSGTSNSGRARSDPTHQHRHSGGHFTAAVQAMDCETHSPQVRQQFPAPLGWSGTEAPTQVTVETHPVQETTFHVAPQQNALHHHHGNSSHHHHHHHHHHHHHGQQALGNRTRPRVYNSPTNSSSTQDSMEVGHSHHSMTSLSSSTTSSSTSSSSTGNQGNQAYQNRPVAANTLDFGQNGAMDVNLTVYSNPRQETGIAGHPTYQFSANTGPAHYMTEGHLTMRQGADREESPMTGVCVQQSPVASS,mutated_sequence,1.0,754.0,NP_001334650.1.a2m,NP_001334650.1.npy,ClinVar
+NP_001336813.1,NP_001336813.1.csv,MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAVLCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFGMGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHWNPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTRLLSRDI,mutated_sequence,1.0,266.0,NP_001336813.1.a2m,NP_001336813.1.npy,ClinVar
+NP_001340274.1,NP_001340274.1.csv,MENSHPPHHHHQQPPPQPGPSGERRNHHWRSYKLMIDPALKKGHHKLYRYDGQHFSLAMSSNRPVEIVEDPRVVGIWTKNKELELSVPKFKIDEFYVGPVPPKQVTFAKLNDNIRENFLRDMCKKYGEVEEVEILYNPKTKKHLGIAKVVFATVRGAKDAVQHLHSTSVMGNIIHVELDTKGETRMRFYELLVTGRYTPQTLPVGELDAVSPIVNETLQLSDALKRLKDGGLSAGCGSGSSSVTPNSGGTPFSQDTAYSSCRLDTPNSYGQGTPLTPRLGTPFSQDSSYSSRQPTPSYLFSQDPAVTFKARRHESKFTDAYNRRHEHHYVHNSPAVTAVAGATAAFRGSSDLPFGAVGGTGGSSGPPFKAQPQDSATFAHTPPPAQATPAPGFKSAFSPYQTPVAHFPPPPEEPTATAAFGARDSGEFRRAPAPPPLPPAEPLAKEKPGTPPGPPPPDTNSMELGGRPTFGWSPEPCDSPGTPTLESSPAGPEKPHDSLDSRIEMLLKEQRTKLLFLREPDSDTELQMEGSPISSSSSQLSPLAPFGTNSQPGFRGPTPPSSRPSSTGLEDISPTPLPDSDEDEELDLGLGPRPPPEPGPPDPAGLLSQTAEVALDLVGDRTPTSEKMDEGQQSSGEDMEISDDEMPSAPITSADCPKPMVVTPGAAAVAAPSVLAPTLPLPPPPGFPPLPPPPPPPPPQPGFPMPPPLPPPPPPPPPAHPAVTVPPPPLPAPPGVPPPPILPPLPPFPPGLFPVMQVDMSHVLGGQWGGMPMSFQMQTQVLSRLMTGQGACPYPPFMAAAAAAASAGLQFVNLPPYRGPFSLSNSGPGRGQHWPPLPKFDPSVPPPGYMPRQEDPHKATVDGVLLVVLKELKAIMKRDLNRKMVEVVAFRAFDEWWDKKERMAKASLTPVKSGEHKDEDRPKPKDRIASCLLESWGKGEGLGYEGLGLGIGLRGAIRLPSFKVKRKEPPDTTSSGDQKRLRPSTSVDEEDEESERERDRDMADTPCELAKRDPKGVGVRRRPARPLELDSGGEEDEKESLSASSSSSASSSSGSSTTSPSSSASDKEEEQESTEEEEEAEEEEEEEVPRSQLSSSSTSSTSDKDDDDDDSDDRDESENDDEDTALSEASEKDEGDSDEEETVSIVTSKAEATSSSESSESSEFESSSESSPSSSEDEEEVVAREEEEEEEEEEMVAEESMASAGPEDFEQDGEEAALAPGAPAVDSLGMEEEVDIETEAVAPEERPSMLDEPPLPVGVEEPADSREPPEEPGLSQEGAMLLSPEPPAKEVEARPPLSPERAPEHDLEVEPEPPMMLPLPLQPPLPPPRPPRPPSPPPEPETTDASHPSVPPEPLAEDHPPHTPGLCGSLAKSQSTETVPATPGGEPPLSGGSSGLSLSSPQVPGSPFSYPAPSPSLSSGGLPRTPGRDFSFTPTFSEPSGPLLLPVCPLPTGRRDERSGPLASPVLLETGLPLPLPLPLPLPLALPAVLRAQARAPTPLPPLLPAPLASCPPPMKRKPGRPRRSPPSMLSLDGPLVRPPAGAALGRELLLLPGQPQTPVFPSTHDPRTVTLDFRNAGIPAPPPPLPPQPPPPPPPPPVEPTKLPFKELDNQWPSEAIPPGPRGRDEVTEEYMELAKSRGPWRRPPKKRHEDLVPPAGSPELSPPQPLFRPRSEFEEMTILYDIWNGGIDEEDIRFLCVTYERLLQQDNGMDWLNDTLWVYHPSTSLSSAKKKKRDDGIREHVTGCARSEGFYTIDKKDKLRYLNSSRASTDEPPADTQGMSIPAQPHASTRAGSERRSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN,mutated_sequence,1.0,1966.0,NP_001340274.1.a2m,NP_001340274.1.npy,ClinVar
+NP_001341533.1,NP_001341533.1.csv,MQSESGIVPDFEVGEEFHEEPKTYYELKSQPLKSSSSAEHPGASKPPISSSSMTSRILLRQQLMREQMQEQERREQQQKLQAAQFMQQRVPVSQTPAINVSVPTTLPSATQVPMEVLKVQTHLENPTKYHIQQAQRQQVKQYLSTTLANKHANQVLSLPCPNQPGDHVMPPVPGSSAPNSPMAMLTLNSNCEKEGFYKFEEQNRAESECPGMNTHSRASCMQMDDVIDDIISLESSYNEEILGLMDPALQMANTLPVSGNLIDLYGNQGLPPPGLTISNSCPANLPNIKRELTACIFPTESEARALAKERQKKDNHNLIERRRRFNINDRIKELGTLIPKSNDPDMRWNKGTILKASVDYIRKLQREQQRAKELENRQKKLEHANRHLLLRIQELEMQARAHGLSLIPSTGLCSPDLVNRIIKQEPVLENCSQDLLQHHADLTCTTTLDLTDGTITFNNNLGTGTEANQAYSVPTKMGSKLEDILMDDTLSPVGVTDPLLSSVSPGASKTSSRRSSMSMEETEHTC,mutated_sequence,1.0,526.0,NP_001341533.1.a2m,NP_001341533.1.npy,ClinVar
+NP_001341569.1,NP_001341569.1.csv,MLLLLLLLLLLPPLVLRVAASRCLHDETQKSVSLLRPPFSQLPSKSRSSSLTLPSSRDPQPLRIQSCYLGDHISDGAWDPEGEGMRGGSRALAAVREATQRIQAVLAVQGPLLLSRDPAQYCHAVWGDPDSPNYHRCSLLNPGYKGESCLGAKIPDTHLRGYALWPEQGPPQLVQPDGPGVQNTDFLLYVRVAHTSKCHQETVSLCCPGWSTAAQSQLTAALTSWAQRRGFVMLPRLCLKLLGSSNLPTLASQSIRITGPSVIAYAACCQLDSEDRPLAGTIVYCAQHLTSPSLSHSDIVMATLHELLHALGFSGQLFKKWRDCPSGFSVRENCSTRQLVTRQDEWGQLLLTTPAVSLSLAKHLGVSGASLGVPLEEEEGLLSSHWEARLLQGSLMTATFDGAQRTRLDPITLAAFKDSGWYQVNHSAAEELLWGQGSGPEFGLVTTCGTGSSDFFCTGSGLGCHYLHLDKGSCSSDPMLEGCRMYKPLANGSECWKKENGFPAGVDNPHGEIYHPQSRCFFANLTSQLLPGDKPRHPSLTPHLKEAELMGRCYLHQCTGRGAYKVQVEGSPWVPCLPGKVIQIPGYYGLLFCPRGRLCQTNEDINAVTSPPVSLSTPDPLFQLSLELAGPPGHSLGKEQQEGLAEAVLEALASKGGTGRCYFHGPSITTSLVFTVHMWKSPGCQGPSVATLHKALTLTLQKKPLEVYHGGANFTTQPSKLLVTSDHNPSMTHLRLSMGLCLMLLILVGVMGTTAYQKRATLPVRPSASYHSPELHSTRVPVRGIREV,mutated_sequence,1.0,788.0,NP_001341569.1.a2m,NP_001341569.1.npy,ClinVar
+NP_001341641.1,NP_001341641.1.csv,MTPNSMTENGLTAWDKPKHCPDREHDWKLVGMSEACLHRKSHSERRSTLKNEQSSPHLIQTTWTSSIFHLDHDDVNDQSVSSAQTFQTEEKKCKGYIPSYLDKDELCVVCGDKATGYHYRCITCEGCKGFFRRTIQKNLHPSYSCKYEGKCVIDKVTRNQCQECRFKKCIYVGMATDLVLDDSKRLAKRKLIEENREKRRREELQKSIGHKPEPTDEEWELIKTVTEAHVATNAQGSHWKQKRKFLPEDIGQAPIVNAPEGGKVDLEAFSHFTKIITPAITRVVDFAKKLPMFCELPCEDQIILLKGCCMEIMSLRAAVRYDPESETLTLNGEMAVTRGQLKNGGLGVVSDAIFDLGMSLSSFNLDDTEVALLQAVLLMSSDRPGLACVERIEKYQDSFLLAFEHYINYRKHHVTHFWPKLLMKVTDLRMIGACHASRFLHMKVECPTELFPPLFLEVFED,mutated_sequence,1.0,461.0,NP_001341641.1.a2m,NP_001341641.1.npy,ClinVar
+NP_001346945.1,NP_001346945.1.csv,MAEQVALSRTQVCGILREELFQGDAFHQSDTHIFIIMGASGDLAKKKIYPTIWWLFRDGLLPENTFIVGYARSRLTVADIRKQSEPFFKATPEEKLKLEDFFARNSYVAGQYDDAASYQRLNSHMNALHLGSQANRLFYLALPPTVYEAVTKNIHESCMSQIGWNRIIVEKPFGRDLQSSDRLSNHISSLFREDQIYRIDHYLGKEMVQNLMVLRFANRIFGPIWNRDNIACVILTFKEPFGTEGRGGYFDEFGIIRDVMQNHLLQMLCLVAMEKPASTNSDDVRDEKVKVLKCISEVQANNVVLGQYVGNPDGEGEATKGYLDDPTVPRGSTTATFAAVVLYVENERWDGVPFILRCGKALNERKAEVRLQFHDVAGDIFHQQCKRNELVIRVQPNEAVYTKMMTKKPGMFFNPEESELDLTYGNRYKNVKLPDAYERLILDVFCGSQMHFVRSDELREAWRIFTPLLHQIELEKPKPIPYIYGSRGPTEADELMKRVGFQYEGTYKWVNPHKL,mutated_sequence,1.0,515.0,NP_001346945.1.a2m,NP_001346945.1.npy,ClinVar
+NP_001347.3,NP_001347.3.csv,MSHVAVENALGLDQQFAGLDLNSSDNQSGGSTASKGRYIPPHLRNREATKGFYDKDSSGWSSSKDKDAYSSFGSRSDSRGKSSFFSDRGSGSRGRFDDRGRSDYDGIGSRGDRSGFGKFERGGNSRWCDKSDEDDWSKPLPPSERLEQELFSGGNTGINFEKYDDIPVEATGNNCPPHIESFSDVEMGEIIMGNIELTRYTRPTPVQKHAIPIIKEKRDLMACAQTGSGKTAAFLLPILSQIYSDGPGEALRAMKENGRYGRRKQYPISLVLAPTRELAVQIYEEARKFSYRSRVRPCVVYGGADIGQQIRDLERGCHLLVATPGRLVDMMERGKIGLDFCKYLVLDEADRMLDMGFEPQIRRIVEQDTMPPKGVRHTMMFSATFPKEIQMLARDFLDEYIFLAVGRVGSTSENITQKVVWVEESDKRSFLLDLLNATGKDSLTLVFVETKKGADSLEDFLYHEGYACTSIHGDRSQRDREEALHQFRSGKSPILVATAVAARGLDISNVKHVINFDLPSDIEEYVHRIGRTGRVGNLGLATSFFNERNINITKDLLDLLVEAKQEVPSWLENMAYEHHYKGSSRGRSKSSRFSGGFGARDYRQSSGASSSSFSSSRASSSRSGGGGHGSSRGFGGGGYGGFYNSDGYGGNYNSQGVDWWGN,mutated_sequence,1.0,662.0,NP_001347.3.a2m,NP_001347.3.npy,ClinVar
+NP_001350047.1,NP_001350047.1.csv,MAAPTPARPVLTHLLVALFGMGSWAAVNGIWVELPVVVKELPEGWSLPSYVSVLVALGNLGLLVVTLWRRLAPGKDEQVPIRVVQVLGMVGTALLASLWHHVAPVAGQLHSVAFLALAFVLALACCASNVTFLPFLSHLPPRFLRSFFLGQGLSALLPCVLALVQGVGRLECPPAPINGTPGPPLDFLERFPASTFFWALTALLVASAAAFQGLLLLLPPPPSVPTGELGSGLQVGAPGAEEEVEESSPLQEPPSQAAGTTPGPDPKAYQLLSARSACLLGLLAATNALTNGVLPAVQSFSCLPYGRLAYHLAVVLGSAANPLACFLAMGVLCRSLAGLGGLSLLGVFCGGYLMALAVLSPCPPLVGTSAGVVLVVLSWVLCLGVFSYVKVAASSLLHGGGRPALLAAGVAIQVGSLLGAVAMFPPTSIYHVFHSRKDCADPCDS,mutated_sequence,1.0,445.0,NP_001350047.1.a2m,NP_001350047.1.npy,ClinVar
+NP_001351.2,NP_001351.2.csv,MAAKSQPNIPKAKSLDGVTNDRTASQGQWGRAWEVDWFSLASVIFLLLFAPFIVYYFIMACDQYSCALTGPVVDIVTGHARLSDIWAKTPPITRKAAQLYTLWVTFQVLLYTSLPDFCHKFLPGYVGGIQEGAVTPAGVVNKYQINGLQAWLLTHLLWFANAHLLSWFSPTIIFDNWIPLLWCANILGYAVSTFAMVKGYFFPTSARDCKFTGNFFYNYMMGIEFNPRIGKWFDFKLFFNGRPGIVAWTLINLSFAAKQRELHSHVTNAMVLVNVLQAIYVIDFFWNETWYLKTIDICHDHFGWYLGWGDCVWLPYLYTLQGLYLVYHPVQLSTPHAVGVLLLGLVGYYIFRVANHQKDLFRRTDGRCLIWGRKPKVIECSYTSADGQRHHSKLLVSGFWGVARHFNYVGDLMGSLAYCLACGGGHLLPYFYIIYMAILLTHRCLRDEHRCASKYGRDWERYTAAVPYRLLPGIF,mutated_sequence,1.0,475.0,NP_001351.2.a2m,NP_001351.2.npy,ClinVar
+NP_001352017.1,NP_001352017.1.csv,MHPPETTTKMASVRFMVTPTKIDDIPGLSDTSPDLSSRSSSRVRFSSRESVPETSRSEPMSEMSGATTSLATVALDPPSDRTSHPQDVIEDLSQNSITGEHSQLLDDGHKKARNAYLNNSNYEEGDEYFDKNLALFEEEMDTRPKVSSLLNRMANYTNLTQGAKEHEEAENITEGKKKPTKTPQMGTFMGVYLPCLQNIFGVILFLRLTWVVGTAGVLQAFAIVLICCCCTMLTAISMSAIATNGVVPAGGSYFMISRALGPEFGGAVGLCFYLGTTFAAAMYILGAIEIFLVYIVPRAAIFHSDDALKESAAMLNNMRVYGTAFLVLMVLVVFIGVRYVNKFASLFLACVIVSILAIYAGAIKSSFAPPHFPVCMLGNRTLSSRHIDVCSKTKEINNMTVPSKLWGFFCNSSQFFNATCDEYFVHNNVTSIQGIPGLASGIITENLWSNYLPKGEIIEKPSAKSSDVLGSLNHEYVLVDITTSFTLLVGIFFPSVTGIMAGSNRSGDLKDAQKSIPIGTILAILTTSFVYLSNVVLFGACIEGVVLRDKFGDAVKGNLVVGTLSWPSPWVIVIGSFFSTCGAGLQSLTGAPRLLQAIAKDNIIPFLRVFGHSKANGEPTWALLLTAAIAELGILIASLDLVAPILSMFFLMCYLFVNLACALQTLLRTPNWRPRFRYYHWALSFMGMSICLALMFISSWYYAIVAMVIAGMIYKYIEYQGAEKEWGDGIRGLSLSAARFALLRLEEGPPHTKNWRPQLLVLLKLDEDLHVKHPRLLTFASQLKAGKGLTIVGSVIVGNFLENYGEALAAEQTIKHLMEAEKVKGFCQLVVAAKLREGISHLIQSCGLGGMKHNTVVMGWPNGWRQSEDARAWKTFIGTVRVTTAAHLALLVAKNISFFPSNVEQFSEGNIDVWWIVHDGGMLMLLPFLLKQHKVWRKCSIRIFTVAQLEDNSIQMKKDLATFLYHLRIEAEVEVVEMHDSDISAYTYERTLMMEQRSQMLRHMRLSKTERDREAQLVKDRNSMLRLTSIGSDEDEETETYQEKVHMTWTKDKYMASRGQKAKSMEGFQDLLNMRPDQSNVRRMHTAVKLNEVIVNKSHEAKLVLLNMPGPPRNPEGDENYMEFLEVLTEGLERVLLVRGGGSEVITIYS,mutated_sequence,1.0,1150.0,NP_001352017.1.a2m,NP_001352017.1.npy,ClinVar
+NP_001352465.1,NP_001352465.1.csv,MAMLPPPGPQSFVHFTKQSLALIEQRIAERKSKEPKEEKKDDDEEAPKPSSDLEAGKQLPFIYGDIPPGMVSEPLEDLDPYYADKKTFIVLNKGKTIFRFNATPALYMLSPFSPLRRISIKILVHSLFSMLIMCTILTNCIFMTMNNPPDWTKNVEYTFTGIYTFESLVKILARGFCVGEFTFLRDPWNWLDFVVIVFAYLTEFVNLGNVSALRTFRVLRALKTISVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLKHKCFRNSLENNETLESIMNTLESEEDFRKYFYYLEGSKDALLCGFSTDSGQCPEGYTCVKIGRNPDYGYTSFDTFSWAFLALFRLMTQDYWENLYQQTLRAAGKTYMIFFVVVIFLGSFYLINLILAVVAMAYEEQNQANIEEAKQKELEFQQMLDRLKKEQEEAEAIAAAAAEYTSIRRSRIMGLSESSSETSKLSSKSAKERRNRRKKKNQKKLSSGEEKGDAEKLSKSESEDSIRRKSFHLGVEGHRRAHEKRLSTPNQSPLSIRGSLFSARRSSRTSLFSFKGRGRDIGSETEFADDEHSIFGDNESRRGSLFVPHRPQERRSSNISQASRSPPMLPVNGKMHSAVDCNGVVSLVDGRSALMLPNGQLLPEVIIDKATSDDSGTTNQIHKKRRCSSYLLSEDMLNDPNLRQRAMSRASILTNTVEELEESRQKCPPWWYRFAHKFLIWNCSPYWIKFKKCIYFIVMDPFVDLAITICIVLNTLFMAMEHHPMTEEFKNVLAIGNLVFTGIFAAEMVLKLIAMDPYEYFQVGWNIFDSLIVTLSLVELFLADVEGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKSYKECVCKINDDCTLPRWHMNDFFHSFLIVFRVLCGEWIETMWDCMEVAGQAMCLIVYMMVMVIGNLVVLNLFLALLLSSFSSDNLTAIEEDPDANNLQIAVTRIKKGINYVKQTLREFILKAFSKKPKISREIRQAEDLNTKKENYISNHTLAEMSKGHNFLKEKDKISGFGSSVDKHLMEDSDGQSFIHNPSLTVTVPIAPGESDLENMNAEELSSDSDSEYSKVRLNRSSSSECSTVDNPLPGEGEEAEAEPMNSDEPEACFTDGCVWRFSCCQVNIESGKGKIWWNIRKTCYKIVEHSWFESFIVLMILLSSGALAFEDIYIERKKTIKIILEYADKIFTYIFILEMLLKWIAYGYKTYFTNAWCWLDFLIVDVSLVTLVANTLGYSDLGPIKSLRTLRALRPLRALSRFEGMRVVVNALIGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFYECINTTDGSRFPASQVPNRSECFALMNVSQNVRWKNLKVNFDNVGLGYLSLLQVATFKGWTIIMYAAVDSVNVDKQPKYEYSLYMYIYFVVFIIFGSFFTLNLFIGVIIDNFNQQKKKLGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPGNKIQGCIFDLVTNQAFDISIMVLICLNMVTMMVEKEGQSQHMTEVLYWINVVFIILFTGECVLKLISLRHYYFTVGWNIFDFVVVIISIVGMFLADLIETYFVSPTLFRVIRLARIGRILRLVKGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYAIFGMSNFAYVKKEDGINDMFNFETFGNSMICLFQITTSAGWDGLLAPILNSKPPDCDPKKVHPGSSVEGDCGNPSVGIFYFVSYIIISFLVVVNMYIAVILENFSVATEESTEPLSEDDFEMFYEVWEKFDPDATQFIEFSKLSDFAAALDPPLLIAKPNKVQLIAMDLPMVSGDRIHCLDILFAFTKRVLGESGEMDSLRSQMEERFMSANPSKVSYEPITTTLKRKQEDVSATVIQRAYRRYRLRQNVKNISSIYIKDGDRDDDLLNKKDMAFDNVNENSSPEKTDATSSTTSPPSYDSVTKPDKEKYEQDRTEKEDKGKDSKESKK,mutated_sequence,1.0,1988.0,NP_001352465.1.a2m,NP_001352465.1.npy,ClinVar
+NP_001352928.1,NP_001352928.1.csv,MASERPEPEVEEAGQVFLLMKKDYRISRNVRLAWFLSHLHQTVQATPQEMLLQSEQELEVLSVLPPGWQPDEPVVPRPFLLVPSTRVTFLAWQYRFVIELDLSPSTGIVDDSTGEILFDEVFHALSRCLGGLLRPFRVPGSCIDFQPEIYVTIQAYSSIIGLQSHQVLVQGCLLDPSQREVFLQQIYEQLCLFEDKVATMLQQQYDPQSQAEDQSPDSGDLLGRKVGVSMVTADLGLVSMIRQGILALQLLPSNSSAGIIVITDGVTSVPDVAVCETLLNQLRSGTVACSFVQVGGVYSYDCSFGHVPNVELMKFIAMATFGSYLSTCPEPEPGNLGLTVYHRAFLLYSFLRSGEALNPEYYCGSQHRLFNEHLVSASSNPALALRRKKHTEKEVPADLVSTVSVRLREGYSVREVTLAKGGSQLEVKLVLLWKHNMRIEYVAMAPWPLEPEGPRVTRVEVTMEGGYDILHDVSCALRQPIRSLYRTHVIRRFWNTLQSINQTDQMLAHLQSFSSVPEHFTLPDSTKSGVPLFYIPPGSTTPVLSLQPSGSDSSHAQFAAYWKPVLSMDANSWQRWLHMHRLVLILEHDTPIPKHLHTPGSNGRYSTIQCRISHSSLTSLLRDWSSFVLVEGYSYVKLLSSAPDQPPNSFYMVRIISKAPCMVLRLGFPIGTPAPARHKIVSGLREEILRLRFPHRVQSKEPTPKVKRKGLGGAGGGSSPSKSPPVLGPQQALSDRPCLVVLHKPLDKLLIRYEKLPLDYRAPFLLTLEPPGPLPLVSGRSASSSLASLSRYLYHQRWLWSVPSGLAPALPLSAIAQLLSILTEVRLSEGFHFACSGEGIINMVLELPIQNEPPGQAAAEEKHTCVVQYILFPPHSTSTKDSFSTDDDNDVEVEALEGDSELNLVTEVWVEPQYGRVGPGPGIWKHLQDLTYSEIPQALHPRDAACIGSMLSFEYLIQLCQSKEWGPLPPEPRVSDGLDQGGDTCVHEIPFHFDLMGLLPQCQQLQMFFLLLAREPEGVPFAEGSCPANDMVLCLLHSCLGQELSDREIPLTPVDQAAFLSEVLRRTCHVPGAEGPLLGVHGIPKEQAVGSTQATGDSAFTSLSVGLPETLKPLISAQPPQWRCYARLVNPQHVFLTFLPATFSDVQRLAACGLEGPPQEETKPKFGDWSGAPSLKDLGGTGIKATKSHVPVLSVTLASDNAQNQGELSPPFRRDLQAYAGRQASQTESADGPRTRCPVYIYSCSLEALREQMVGMQPPQAPRDLIFRTQFLDHPSPSSAWMEPRYKEAANHCALLQEHAQRCYVRGLFRSLQQAQSVTSQDLLTAVDACEELLQEIDITPFLLALCGHTWGLPHAPPSPGPLSPGPFSSSMEEGAEPRERAILASESSIETEDLSEPEFQSTRVPGIPDPGPEISLTDVCQLRGEAHGALHSVIQEKFLEISRLHFRTVPSNPHYFFYCPPSSRREDEGPRDTVDRKISDLEFSEAELMGEEGDTSACCVVTESDPELEVEYRESRESDLGPAGLDSASLSDVDTVNPDEDSFSILGGDSPTGPESFLHDLPPLFLHLTCSVRLRGQHSSVPVCSLPTCLGQVLSSLEGPPVGGRVPLRDLSVTLDVFMLTLPLEVELPTASDPQHHRSTSESSASFPRSPGQPSSLRSDDGLGPPLPPPEEERHPGLSNLATPHRLAIETTMNEIRWLLEDEMVGALRRGGIPQSPALHRAAAHIHSSPGRSTCLRQTLPLSFVFGPERSLTQFKEEFRRLHLPGHVLLEDPDSGFFFVAAGQQPGGSHGEPSSAAWAWHSHEDRAEGIEGETLTASPQAPGSPEDSEGVPLISLPRVPQGGSQPGPSRGLSLMSSQGSVDSDHLGYDGGSSGSDSEGPNDTLGEKAPFTLRTPPGPAPPQPSLSGLPGPCLPDFWLIVRVLQDRVEVYAHARSLIREDGGPGTECRHLQQLLVRRVGEICREVNQRLLLQDLHDSHVCNSLLVAESEEDLWRSETPFHSRQRAPLPSDDYAADESCAPRGYLAATMQFVPGHFSCDVVWGTVIRVHSRLKMGPSMGVSRAIQALRSVLNAFSVVNRKNMFVYQERATKAVYYLRLLETSCSDRPWKGDALPPSLALSRSQEPIYSEEASGPRSPLDMVSSRSSDAARPVGQVDRHIQLLVHGVGQAGPEITDELVRVLCRRLDEATLDVITVMLVRNCKLTPADVEFIQPPGSLPSEVLHLALPTSCRPWLPALAWYLRQNLLIFLHSPKYTDSNSRNHFQHPLPPQGGLPDLDIYLYNKPGGQGTGGKGVACITLAFVDEGGAPLSLALWPPSSPGPPDPLREEEFEQLTQVIRCPVVVDSSSAQNGAPRLRLDVWEKGNISIVQLEEKLRGAARQALADAIIELQLLPASLCTEDTPTGSLRNGSLETKSSAGRASTFPPAPVPGEPVTPPSKAGRRSFWDMLSKTECGDLGSPKTTDDIVLDRPEDTRGRRRHKTESVRTPGGAERAPGSDSGAQRQKRRTTQLEEGEVGTLHPVFARVAQRWMEFMVQIGCASVSRSSAHMVSRFLLPSILSEFTALVTSMAGDTSVRIFEQHLGSEPEIFGPCSPGQLGPSPRPAAERHLLLLGRNFLQWRRPTQQAAKAMQRFEPGGDGSSGRNAPRQRLLLLEVVDKKLQLLTYNWAPDLGAALGRALVRLVQWQNARAHLIFCLLSQKLGLFHHYGQLDFPVRDEKEPNPFLLPTMEVETLIRSASPPLSREQGRLSGSSRGGGPLPLDTFPFDEALRDITAARPSSVLGPVPRPPDPVTYHGQQFLEIKMAERRELERQMKMENLFVTWQQRSTPATMPISAGELETLKQSSRLVHYCATAMLFDPAAWLHGPPETSGPPDGQRRHRPESGSGSREAPTSCESLDVSPPGAREEPWLKELSLAFLQQYVQYLQSIGFVLVPLRPPSPARSTSRPRAMAILGTEGRGSFSCPKTKTDGSPKSTSSPVTTYHLQRALPGGIILMELAFQGCYFCVKQFALECSRIPMGQAVNSQLSMLFTEECDKVRDLMHVHSFSYDFHLRLVHQHVLGAHLVLRHGYHLTTFLRHFLAHHPDGPHFGRNHIYQGTLELPTPLIAAHQLYNYVADHASSYHMKPLRMARPGGPEHNEYALVSAWHSSGSYLDSEGLRHQDDFDVSLLVCHCAAPFEEQGEAERHVLRLQFFVVLTSQRELFPRLTADMRRFRKPPRLPPEPEAPGSSAGSPGEASGLILAPGPAPLFPPLAAEVGMARARLAQLVRLAGGHCRRDTLWKRLFLLEPPGPDRLRLGGRLALAELEELLEAVHAKSIGDIDPQLDCFLSMTVSWYQSLIKVLLSRFPQSCRHFQSPDLGTQYLVVLNQKFTDCFVLVFLDSHLGKTSLTVVFREPFPVQPQDSESPPAQLVSTYHHLESVINTACFTLWTRLL,mutated_sequence,1.0,3432.0,NP_001352928.1.a2m,NP_001352928.1.npy,ClinVar
+NP_001353314.1,NP_001353314.1.csv,MGELCRRDSALTALDEETLWEMMESHRHRIVRCICPSRLTPYLRQAKVLCQLDEEEVLHSPRLTNSAMRAGHLLDLLKTRGKNGAIAFLESLKFHNPDVYTLVTGLQPDVDFSNFSGLMETSKLTECLAGAIGSLQEELNQEKGQKEVLLRRCQQLQEHLGLAETRAEGLHQLEADHSRMKREVSAHFHEVLRLKDEMLSLSLHYSNALQEKELAASRCRSLQEELYLLKQELQRANMVSSCELELQEQSLRTASDQESGDEELNRLKEENEKLRSLTFSLAEKDILEQSLDEARGSRQELVERIHSLRERAVAAERQREQYWEEKEQTLLQFQKSKMACQLYREKVNALQAQVCELQKERDQAYSARDSAQREISQSLVEKDSLRRQVFELTDQVCELRTQLRQLQAEPPGVLKQEARTREPCPREKQRLVRMHAICPRDDSDCSLVSSTESQLLSDLSATSSRELVDSFRSSSPAPPSQQSLYKRVAEDFGEEPWSFSSCLEIPEGDPGALPGAKAGDPHLDYELLDTADLPQLESSLQPVSPGRLDVSESGVLMRRRPARRILSQVTMLAFQGDALLEQISVIGGNLTGIFIHRVTPGSAADQMALRPGTQIVMVDYEASEPLFKAVLEDTTLEEAVGLLRRVDGFCCLSVKVNTDGYKRLLQDLEAKVATSGDSFYIRVNLAMEGRAKGELQVHCNEVLHVTDTMFQGCGCWHAHRVNSYTMKDTAAHGTIPNYSRAQQQLIALIQDMTQQCTVTRKPSSGGPQKLVRIVSMDKAKASPLRLSFDRGQLDPSRMEGSSTCFWAESCLTLVPYTLVRPHRPARPRPVLLVPRAVGKILSEKLCLLQGFKKCLAEYLSQEEYEAWSQRGDIIQEGEVSGGRCWVTRHAVESLMEKNTHALLDVQLDSVCTLHRMDIFPIVIHVSVNEKMAKKLKKGLQRLGTSEEQLLEAARQEEGDLDRAPCLYSSLAPDGWSDLDGLLSCVRQAIADEQKKVVWTEQSPR,mutated_sequence,1.0,1004.0,NP_001353314.1.a2m,NP_001353314.1.npy,ClinVar
+NP_001354.1,NP_001354.1.csv,MADAEVIILPKKHKKKKERKSLPEEDVAEIQHAEEFLIKPESKVAKLDTSQWPLLLKNFDKLNVRTTHYTPLACGSNPLKREIGDYIRTGFINLDKPSNPSSHEVVAWIRRILRVEKTGHSGTLDPKVTGCLIVCIERATRLVKSQQSAGKEYVGIVRLHNAIEGGTQLSRALETLTGALFQRPPLIAAVKRQLRVRTIYESKMIEYDPERRLGIFWVSCEAGTYIRTLCVHLGLLLGVGGQMQELRRVRSGVMSEKDHMVTMHDVLDAQWLYDNHKDESYLRRVVYPLEKLLTSHKRLVMKDSAVNAICYGAKIMLPGVLRYEDGIEVNQEIVVITTKGEAICMAIALMTTAVISTCDHGIVAKIKRVIMERDTYPRKWGLGPKASQKKLMIKQGLLDKHGKPTDSTPATWKQEYVDYSESAKKEVVAEVVKAPQVVAEAAKTAKRKRESESESDETPPAAPQLIKKEKKKSKKDKKAKAGLESGAEPGDGDSDTTKKKKKKKKAKEVELVSE,mutated_sequence,1.0,514.0,NP_001354.1.a2m,NP_001354.1.npy,ClinVar
+NP_001354650.1,NP_001354650.1.csv,MADDDVLFEDVYELCEVIGKGPFSVVRRCINRETGQQFAVKIVDVAKFTSSPGLSTEDLKREASICHMLKHPHIVELLETYSSDGMLYMVFEFMDGADLCFEIVKRADAGFVYSEAVASHYMRQILEALRYCHDNNIIHRDVKPHCVLLASKENSAPVKLGGFGVAIQLGESGLVAGGRVGTPHFMAPEVVKREPYGKPVDVWGCGVILFILLSGCLPFYGTKERLFEGIIKGKYKMNPRQWSHISESAKDLVRRMLMLDPAERITVYEALNHPWLKERDRYAYKIHLPETVEQLRKFNARRKLKGAVLAAVSSHKFNSFYGDPPEELPDFSEDPTSSGLLAAERAVSQVLDSLEEIHALTDCSEKDLDFLHSVFQDQHLHTLLDLYDKINTKSSPQIRNPPSDAVQRAKEVLEEISCYPENNDAKELKRILTQPHFMALLQTHDVVAHEVYSDEALRVTPPPTSPYLNGDSPESANGDMDMENVTRVRLVQFQKNTDEPMGITLKMNELNHCIVARIMHGGMIHRQGTLHVGDEIREINGISVANQTVEQLQKMLREMRGSITFKIVPSYRTQSSSCERDSPSTSRQSPANGHSSTNNSVSDLPSTTQPKGRQIYVRAQFEYDPAKDDLIPCKEAGIRFRVGDIIQIISKDDHNWWQGKLENSKNGTAGLIPSPELQEWRVACIAMEKTKQEQQASCTWFGKKKKQYKDKYLAKHNAVFDQLDLVTYEEVVKLPAFKRKTLVLLGAHGVGRRHIKNTLITKHPDRFAYPIPHTTRPPKKDEENGKNYYFVSHDQMMQDISNNEYLEYGSHEDAMYGTKLETIRKIHEQGLIAILDVEPQALKVLRTAEFAPFVVFIAAPTITPGLNEDESLQRLQKESDILQRTYAHYFDLTIINNEIDETIRHLEEAVELVCTAPQWVPVSWVY,mutated_sequence,1.0,926.0,NP_001354650.1.a2m,NP_001354650.1.npy,ClinVar
+NP_001355823.1,NP_001355823.1.csv,MQNSHSGVNQLGGVFVNGRPLPDSTRQKIVELAHSGARPCDISRILQTHADAKVQVLDNQNVSNGCVSKILGRYYETGSIRPRAIGGSKPRVATPEVVSKIAQYKRECPSIFAWEIRDRLLSEGVCTNDNIPSVSSINRVLRNLASEKQQMGADGMYDKLRMLNGQTGSWGTRPGWYPGTSVPGQPTQDGCQQQEGGGENTNSISSNGEDSDEAQMRLQLKRKLQRNRTSFTQEQIEALEKEFERTHYPDVFARERLAAKIDLPEARIQVWFSNRRAKWRREEKLRNQRRQASNTPSHIPISSSFSTSVYQPIPQPTTPVSSFTSGSMLGRTDTALTNTYSALPPMPSFTMANNLPMQPPVPSQTSSYSCMLPTSPSVNGRSYDTYTPPHMQTHMNSQPMGTSGTTSTGLISPGVSVPVQVPGSEPDMSQYWPRLQ,mutated_sequence,1.0,436.0,NP_001355823.1.a2m,NP_001355823.1.npy,ClinVar
+NP_001357029.1,NP_001357029.1.csv,MARLTKRRQADTKAIQHLWAAIEIIRNQKQIANIDRITKYMSRVHGMHPKETTRQLSLAVKDGLIVETLTVGCKGSKAGIEQEGYWLPGDEIDWETENHDWYCFECHLPGEVLICDLCFRVYHSKCLSDEFRLRDSSSPWQCPVCRSIKKKNTNKQEMGTYLRFIVSRMKERAIDLNKKGKDNKHPMYRRLVHSAVDVPTIQEKVNEGKYRSYEEFKADAQLLLHNTVIFYGADSEQADIARMLYKDTCHELDELQLCKNCFYLSNARPDNWFCYPCIPNHELVWAKMKGFGFWPAKVMQKEDNQVDVRFFGHHHQRAWIPSENIQDITVNIHRLHVKRSMGWKKACDELELHQRFLREGRFWKSKNEDRGEEEAESSISSTSNEQLKVTQEPRAKKGRRNQSVEPKKEEPEPETEAVSSSQEIPTMPQPIEKVSVSTQTKKLSASSPRMLHRSTQTTNDGVCQSMCHDKYTKIFNDFKDRMKSDHKRETERVVREALEKLRSEMEEEKRQAVNKAVANMQGEMDRKCKQVKEKCKEEFVEEIKKLATQHKQLISQTKKKQWCYNCEEEAMYHCCWNTSYCSIKCQQEHWHAEHKRTCRRKR,mutated_sequence,1.0,602.0,NP_001357029.1.a2m,NP_001357029.1.npy,ClinVar
+NP_001357188.2,NP_001357188.2.csv,MGLKAAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVIPTNVPELTFQPSPAPDPPGGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGEERPGEQSQGTQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRIVSREAEAAEAEEPWGEEAREGRRRGPRRESKPEEPPPPKKPALDKGLGTGQGAVSGPPRKPPGTVAGTARGPEGGSTAQVPAPTASPPPEGPVLTFQSEKMKGMKELLVATKINSSAIKLQLTAQSQVQMKKQKVSTPSDYTLSFLKRQRKGL,mutated_sequence,1.0,610.0,NP_001357188.2.a2m,NP_001357188.2.npy,ClinVar
+NP_001357524.1,NP_001357524.1.csv,MVVLRAGKKTFLPPLCRAFACRGCQLAPERGAERRDTAPSGVSRFCPPRKSCHDWIGPPDKYSNLRPVHFYIPENESPLEQKLRKLRQETQEWNQQFWANQNLTFSKEKEEFIHSRLKTKGLGLRTESGQKATLNAEEMADFYKEFLSKNFQKHMYYNRDWYKRNFAITFFMGKVALERIWNKLKQKQKKRSN,mutated_sequence,1.0,193.0,NP_001357524.1.a2m,NP_001357524.1.npy,ClinVar
+NP_001357587.1,NP_001357587.1.csv,MSGARSKLALFLCGCYVVALGAHTGEESVADHHEAEYYVAAVYEHPSILSLNPLALISRQEALELMNQNLDIYEQQVMTAAQKDVQIIVFPEDGIHGFNFTRTSIYPFLDFMPSPQVVRWNPCLEPHRFNDTEVLQRLSCMAIRGDMFLVANLGTKEPCHSSDPRCPKDGRYQFNTNVVFSNNGTLVDRYRKHNLYFEAAFDVPLKVDLITFDTPFAGRFGIFTCFDILFFDPAIRVLRDYKVKHVVYPTAWMNQLPLLAAIEIQKAFAVAFGINVLAANVHHPVLGMTGSGIHTPLESFWYHDMENPKSHLIIAQVAKNPVGLIGAENATGETDPSHSKFLKILSGDPYCEKDAQEVHCDEATKWNVNAPPTFHSEMMYDNFTLVPVWGKEGYLHVCSNGLCCYLLYERPTLSKELYALGVFDGLHTVHGTYYIQVCALVRCGGLGFDTCGQEITEATGIFEFHLWGNFSTSYIFPLFLTSGMTLEVPDQLGWENDHYFLRKSRLSSGLVTAALYGRLYERD,mutated_sequence,1.0,523.0,NP_001357587.1.a2m,NP_001357587.1.npy,ClinVar
+NP_001360.1,NP_001360.1.csv,MFRIGRRQLWKHSVTRVLTQRLKGEKEAKRALLDARHNYLFAIVASCLDLNKTEVEDAILEGNQIERIDQLFAVGGLRHLMFYYQDVEEAETGQLGSLGGVNLVSGKIKKPKVFVTEGNDVALTGVCVFFIRTDPSKAITPDNIHQEVSFNMLDAADGGLLNSVRRLLSDIFIPALRATSHGWGELEGLQDAANIRQEFLSSLEGFVNVLSGAQESLKEKVNLRKCDILELKTLKEPTDYLTLANNPETLGKIEDCMKVWIKQTEQVLAENNQLLKEADDVGPRAELEHWKKRLSKFNYLLEQLKSPDVKAVLAVLAAAKSKLLKTWREMDIRITDATNEAKDNVKYLYTLEKCCDPLYSSDPLSMMDAIPTLINAIKMIYSISHYYNTSEKITSLFVKVTNQIISACKAYITNNGTASIWNQPQDVVEEKILSAIKLKQEYQLCFHKTKQKLKQNPNAKQFDFSEMYIFGKFETFHRRLAKIIDIFTTLKTYSVLQDSTIEGLEDMATKYQGIVATIKKKEYNFLDQRKMDFDQDYEEFCKQTNDLHNELRKFMDVTFAKIQNTNQALRMLKKFERLNIPNLGIDDKYQLILENYGADIDMISKLYTKQKYDPPLARNQPPIAGKILWARQLFHRIQQPMQLFQQHPAVLSTAEAKPIIRSYNRMAKVLLEFEVLFHRAWLRQIEEIHVGLEASLLVKAPGTGELFVNFDPQILILFRETECMAQMGLEVSPLATSLFQKRDRYKRNFSNMKMMLAEYQRVKSKIPAAIEQLIVPHLAKVDEALQPGLAALTWTSLNIEAYLENTFAKIKDLELLLDRVNDLIEFRIDAILEEMSSTPLCQLPQEEPLTCEEFLQMTKDLCVNGAQILHFKSSLVEEAVNELVNMLLDVEVLSEEESEKISNENSVNYKNESSAKREEGNFDTLTSSINARANALLLTTVTRKKKETEMLGEEARELLSHFNHQNMDALLKVTRNTLEAIRKRIHSSHTINFRDSNSASNMKQNSLPIFRASVTLAIPNIVMAPALEDVQQTLNKAVECIISVPKGVRQWSSELLSKKKIQERKMAALQSNEDSDSDVEMGENELQDTLEIASVNLPIPVQTKNYYKNVSENKEIVKLVSVLSTIINSTKKEVITSMDCFKRYNHIWQKGKEEAIKTFITQSPLLSEFESQILYFQNLEQEINAEPEYVCVGSIALYTADLKFALTAETKAWMVVIGRHCNKKYRSEMENIFMLIEEFNKKLNRPIKDLDDIRIAMAALKEIREEQISIDFQVGPIEESYALLNRYGLLIAREEIDKVDTLHYAWEKLLARAGEVQNKLVSLQPSFKKELISAVEVFLQDCHQFYLDYDLNGPMASGLKPQEASDRLIMFQNQFDNIYRKYITYTGGEELFGLPATQYPQLLEIKKQLNLLQKIYTLYNSVIETVNSYYDILWSEVNIEKINNELLEFQNRCRKLPRALKDWQAFLDLKKIIDDFSECCPLLEYMASKAMMERHWERITTLTGHSLDVGNESFKLRNIMEAPLLKYKEEIEDICISAVKERDIEQKLKQVINEWDNKTFTFGSFKTRGELLLRGDSTSEIIANMEDSLMLLGSLLSNRYNMPFKAQIQKWVQYLSNSTDIIESWMTVQNLWIYLEAVFVGGDIAKQLPKEAKRFSNIDKSWVKIMTRAHEVPSVVQCCVGDETLGQLLPHLLDQLEICQKSLTGYLEKKRLCFPRFFFVSDPALLEILGQASDSHTIQAHLLNVFDNIKSVKFHEKIYDRILSISSQEGETIELDKPVMAEGNVEVWLNSLLEESQSSLHLVIRQAAANIQETGFQLTEFLSSFPAQVGLLGIQMIWTRDSEEALRNAKFDKKIMQKTNQAFLELLNTLIDVTTRDLSSTERVKYETLITIHVHQRDIFDDLCHMHIKSPMDFEWLKQCRFYFNEDSDKMMIHITDVAFIYQNEFLGCTDRLVITPLTDRCYITLAQALGMSMGGAPAGPAGTGKTETTKDMGRCLGKYVVVFNCSDQMDFRGLGRIFKGLAQSGSWGCFDEFNRIDLPVLSVAAQQISIILTCKKEHKKSFIFTDGDNVTMNPEFGLFLTMNPGYAGRQELPENLKINFRSVAMMVPDRQIIIRVKLASCGFIDNVVLARKFFTLYKLCEEQLSKQVHYDFGLRNILSVLRTLGAAKRANPMDTESTIVMRVLRDMNLSKLIDEDEPLFLSLIEDLFPNILLDKAGYPELEAAISRQVEEAGLINHPPWKLKVIQLFETQRVRHGMMTLGPSGAGKTTCIHTLMRAMTDCGKPHREMRMNPKAITAPQMFGRLDVATNDWTDGIFSTLWRKTLRAKKGEHIWIILDGPVDAIWIENLNSVLDDNKTLTLANGDRIPMAPNCKIIFEPHNIDNASPATVSRNGMVFMSSSILDWSPILEGFLKKRSPQEAEILRQLYTESFPDLYRFCIQNLEYKMEVLEAFVITQSINMLQGLIPLKEQGGEVSQAHLGRLFVFALLWSAGAALELDGRRRLELWLRSRPTGTLELPPPAGPGDTAFDYYVAPDGTWTHWNTRTQEYLYPSDTTPEYGSILVPNVDNVRTDFLIQTIAKQGKAVLLIGEQGTAKTVIIKGFMSKYDPECHMIKSLNFSSATTPLMFQRTIESYVDKRMGTTYGPPAGKKMTVFIDDVNMPIINEWGDQVTNEIVRQLMEQNGFYNLEKPGEFTSIVDIQFLAAMIHPGGGRNDIPQRLKRQFSIFNCTLPSEASVDKIFGVIGVGHYCTQRGFSEEVRDSVTKLVPLTRRLWQMTKIKMLPTPAKFHYVFNLRDLSRVWQGMLNTTSEVIKEPNDLLKLWKHECKRVIADRFTVSSDVTWFDKALVSLVEEEFGEEKKLLVDCGIDTYFVDFLRDAPEAAGETSEEADAETPKIYEPIESFSHLKERLNMFLQLYNESIRGAGMDMVFFADAMVHLVKISRVIRTPQGNALLVGVGGSGKQSLTRLASFIAGYVSFQITLTRSYNTSNLMEDLKVLYRTAGQQGKGITFIFTDNEIKDESFLEYMNNVLSSGEVSNLFARDEIDEINSDLASVMKKEFPRCLPTNENLHDYFMSRVRQNLHIVLCFSPVGEKFRNRALKFPALISGCTIDWFSRWPKDALVAVSEHFLTSYDIDCSLEIKKEVVQCMGSFQDGVAEKCVDYFQRFRRSTHVTPKSYLSFIQGYKFIYGEKHVEVRTLANRMNTGLEKLKEASESVAALSKELEAKEKELQVANDKADMVLKEVTMKAQAAEKVKAEVQKVKDRAQAIVDSISKDKAIAEEKLEAAKPALEEAEAALQTIRPSDIATVRTLGRPPHLIMRIMDCVLLLFQRKVSAVKIDLEKSCTMPSWQESLKLMTAGNFLQNLQQFPKDTINEEVIEFLSPYFEMPDYNIETAKRVCGNVAGLCSWTKAMASFFSINKEVLPLKANLVVQENRHLLAMQDLQKAQAELDDKQAELDVVQAEYEQAMTEKQTLLEDAERCRHKMQTASTLISGLAGEKERWTEQSQEFAAQTKRLVGDVLLATAFLSYSGPFNQEFRDLLLNDWRKEMKARKIPFGKNLNLSEMLIDAPTISEWNLQGLPNDDLSIQNGIIVTKASRYPLLIDPQTQGKIWIKNKESRNELQITSLNHKYFRNHLEDSLSLGRPLLIEDVGEELDPALDNVLERNFIKTGSTFKVKVGDKEVDVLDGFRLYITTKLPNPAYTPEISARTSIIDFTVTMKGLEDQLLGRVILTEKQELEKERTHLMEDVTANKRRMKELEDNLLYRLTSTQGSLVEDESLIVVLSNTKRTAEEVTQKLEISAETEVQINSAREEYRPVATRGSILYFLITEMRLVNEMYQTSLRQFLGLFDLSLARSVKSPITSKRIANIIEHMTYEVYKYAARGLYEEHKFLFTLLLTLKIDIQRNRVKHEEFLTLIKGGASLDLKACPPKPSKWILDITWLNLVELSKLRQFSDVLDQISRNEKMWKIWFDKENPEEEPLPNAYDKSLDCFRRLLLIRSWCPDRTIAQARKYIVDSMGEKYAEGVILDLEKTWEESDPRTPLICLLSMGSDPTDSIIALGKRLKIETRYVSMGQGQEVHARKLLQQTMANGGWALLQNCHLGLDFMDELMDIIIETELVHDAFRLWMTTEAHKQFPITLLQMSIKFANDPPQGLRAGLKRTYSGVSQDLLDVSSGSQWKPMLYAVAFLHSTVQERRKFGALGWNIPYEFNQADFNATVQFIQNHLDDMDVKKGVSWTTIRYMIGEIQYGGRVTDDYDKRLLNTFAKVWFSENMFGPDFSFYQGYNIPKCSTVDNYLQYIQSLPAYDSPEVFGLHPNADITYQSKLAKDVLDTILGIQPKDTSGGGDETREAVVARLADDMLEKLPPDYVPFEVKERLQKMGPFQPMNIFLRQEIDRMQRVLSLVRSTLTELKLAIDGTIIMSENLRDALDCMFDARIPAWWKKASWISSTLGFWFTELIERNSQFTSWVFNGRPHCFWMTGFFNPQGFLTAMRQEITRANKGWALDNMVLCNEVTKWMKDDISAPPTEGVYVYGLYLEGAGWDKRNMKLIESKPKVLFELMPVIRIYAENNTLRDPRFYSCPIYKKPVRTDLNYIAAVDLRTAQTPEHWVLRGVALLCDVK,mutated_sequence,1.0,4624.0,NP_001360.1.a2m,NP_001360.1.npy,ClinVar
+NP_001361282.1,NP_001361282.1.csv,METSASATASEKQEAKSGILEAAGFPDPGKKASPLVVAAAAAAAVAAQGVPQHLLPPFHAPLPIDMRHQEGRYHYEPHSVHGVHGPPALSGSPVISDISLIRLSPHPAGPGESPFNAPHPYVNPHMEHYLRSVHSSPTLSMISAARGLSPADVAQEHLKERGLFGLPAPGTTPSDYYHQMTLVAGHPAPYGDLLMQSGGAASAPHLHDYLNPVDVSRFSSPRVTPRLSRKRALSISPLSDASLDLQRMIRTSPNSLVAYINNSRSSSAASGSYGHLSAGALSPAFTFPHPINPVAYQQILSQQRGLGSAFGHTPPLIQPSPTFLAQQPMALTSINATPTQLSSSSNCLSDTNQNKQSSESAVSSTVNPVAIHKRSKVKTEPEGLRPASPLALTQEQLADLKEDLDRDDCKQEAEVVIYETNCHWEDCTKEYDTQEQLVHHINNEHIHGEKKEFVCRWQACTREQKPFKAQYMLVVHMRRHTGEKPHKCTFEGCSKAYSRLENLKTHLRSHTGEKPYVCEHEGCNKAFSNASDRAKHQNRTHSNEKPYICKIPGCTKRYTDPSSLRKHVKTVHGPDAHVTKKQRNDVHLRTPLLKENGDSEAGTEPGGPESTEASSTSQAVEDCLHVRAIKTESSGLCQSSPGAQSSCSSEPSPLGSAPNNDSGVEMPGTGPGSLGDLTALDDTPPGADTSALAAPSAGGLQLRKHMTTMHRFEQLKKEKLKSLKDSCSWAGPTPHTRNTKLPPLPGSGSILENFSGSGGGGPAGLLPNPRLSELSASEVTMLSQLQERRDSSTSTVSSAYTVSRRSSGISPYFSSRRSSEASPLGAGRPHNASSADSYDPISTDASRRSSEASQCSGGSGLLNLTPAQQYSLRAKYAAATGGPPPTPLPGLERMSLRTRLALLDAPERTLPAGCPRPLGPRRGSDGPTYGHGHAGAAPAFPHEAPGGGARRASDPVRRPDALSLPRVQRFHSTHNVNPGPLPPCADRRGLRLQSHPSTDGGLARGAYSPRPPSISENVAMEAVAAGVDGAGPEADLGLPEDDLVLPDDVVQYIKAHASGALDEGTGQVYPTESTGFSDNPRLPSPGLHGQRRMVAADSNVGPSAPMLGGCQLGFGAPSSLNKNNMPVQWNEVSSGTVDALASQVKPPPFPQGNLAVVQQKPAFGQYPGYSPQGLQASPGGLDSTQPHLQPRSGAPSQGIPRVNYMQQLRQPVAGSQCPGMTTTMSPHACYGQVHPQLSPSTISGALNQFPQSCSNMPAKPGHLGHPQQTEVAPDPTTMGNRHRELGVPDSALAGVPPPHPVQSYPQQSHHLAASMSQEGYHQVPSLLPARQPGFMEPQTGPMGVATAGFGLVQPRPPLEPSPTGRHRGVRAVQQQLAYARATGHAMAAMPSSQETAEAVPKGAMGNMGSVPPQPPPQDAGGAPDHSMLYYYGQIHMYEQDGGLENLGSCQVMRSQPPQPQACQDSIQPQPLPSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLTSLAEESKFLNMMT,mutated_sequence,1.0,1569.0,NP_001361282.1.a2m,NP_001361282.1.npy,ClinVar
+NP_001361314.1,NP_001361314.1.csv,MSTERDSETTFDEDSQPNDEVVPYSDDETEDELDDQGSAVEPEQNRVNREAEENREPFRKECTWQVKANDRKYHEQPHFMNTKFLCIKESKYANNAIKTYKYNAFTFIPMNLFEQFKRAANLYFLALLILQAVPQISTLAWYTTLVPLLVVLGVTAIKDLVDDVARHKMDKEINNRTCEVIKDGRFKVAKWKEIQVGDVIRLKKNDFVPADILLLSSSEPNSLCYVETAELDGETNLKFKMSLEITDQYLQREDTLATFDGFIECEEPNNRLDKFTGTLFWRNTSFPLDADKILLRGCVIRNTDFCHGLVIFAGADTKIMKNSGKTRFKRTKIDYLMNYMVYTIFVVLILLSAGLAIGHAYWEAQVGNSSWYLYDGEDDTPSYRGFLIFWGYIIVLNTMVPISLYVSVEVIRLGQSHFINWDLQMYYAEKDTPAKARTTTLNEQLGQIHYIFSDKTGTLTQNIMTFKKCCINGQIYGDHRDASQHNHNKIEQVDFSWNTYADGKLAFYDHYLIEQIQSGKEPEVRQFFFLLAVCHTVMVDRTDGQLNYQAASPDEGALVNAARNFGFAFLARTQNTITISELGTERTYNVLAILDFNSDRKRMSIIVRTPEGNIKLYCKGADTVIYERLHRMNPTKQETQDALDIFANETLRTLCLCYKEIEEKEFTEWNKKFMAASVASTNRDEALDKVYEEIEKDLILLGATAIEDKLQDGVPETISKLAKADIKIWVLTGDKKETAENIGFACELLTEDTTICYGEDINSLLHARMENQRNRGGVYAKFAPPVQESFFPPGGNRALIITGSWLNEILLEKKTKRNKILKLKFPRTEEERRMRTQSKRRLEAKKEQRQKNFVDLACECSAVICCRVTPKQKAMVVDLVKRYKKAITLAIGDGANDVNMIKTAHIGVGISGQEGMQAVMSSDYSFAQFRYLQRLLLVHGRWSYIRMCKFLRYFFYKNFAFTLVHFWYSFFNGYSAQTAYEDWFITLYNVLYTSLPVLLMGLLDQDVSDKLSLRFPGLYIVGQRDLLFNYKRFFVSLLHGVLTSMILFFIPLGAYLQTVGQDGEAPSDYQSFAVTIASALVITVNFQIGLDTSYWTFVNAFSIFGSIALYFGIMFDFHSAGIHVLFPSAFQFTGTASNALRQPYIWLTIILAVAVCLLPVVAIRFLSMTIWPSESDKIQKHRKRLKAEEQWQRRQQVFRRGVSTRRSAYAFSHQRGYADLISSGRSIRKKRSPLDAIVADGTAEYRRTGDS,mutated_sequence,1.0,1251.0,NP_001361314.1.a2m,NP_001361314.1.npy,ClinVar
+NP_001361433.1,NP_001361433.1.csv,MPVAEAPQVAGGQGDGGDGEEAEPEGMFKACEDSKRKARGYLRLVPLFVLLALLVLASAGVLLWYFLGYKAEVMVSQVYSGSLRVLNRHFSQDLTRRESSAFRSETAKAQKMLKELITSTRLGTYYNSSSVYSFGEGPLTCFFWFILQIPEHRRLMLSPEVVQALLVEELLSTVNSSAAVPYRAEYEVDPEGLVILEASVKDIAALNSTLGCYRYSYVGQGQVLRLKGPDHLASSCLWHLQGPKDLMLKLRLEWTLAECRDRLAMYDVAGPLEKRLITSVYGCSRQEPVVEVLASGAIMAVVWKKGLHSYYDPFVLSVQPVVFQACEVNLTLDNRLDSQGVLSTPYFPSYYSPQTHCSWHLTVPSLDYGLALWFDAYALRRQKYDLPCTQGQWTIQNRRLCGLRILQPYAERIPVVATAGITINFTSQISLTGPGVRVHYGLYNQSDPCPGEFLCSVNGLCVPACDGVKDCPNGLDERNCVCRATFQCKEDSTCISLPKVCDGQPDCLNGSDEEQCQEGVPCGTFTFQCEDRSCVKKPNPQCDGRPDCRDGSDEEHCDCGLQGPSSRIVGGAVSSEGEWPWQASLQVRGRHICGGALIADRWVITAAHCFQEDSMASTVLWTVFLGKVWQNSRWPGEVSFKVSRLLLHPYHEEDSHDYDVALLQLDHPVVRSAAVRPVCLPARSHFFEPGLHCWITGWGALREGGPISNALQKVDVQLIPQDLCSEVYRYQVTPRMLCAGYRKGKKDACQGDSGGPLVCKALSGRWFLAGLVSWGLGCGRPNYFGVYTRITGVISWIQQVVT,mutated_sequence,1.0,802.0,NP_001361433.1.a2m,NP_001361433.1.npy,ClinVar
+NP_001361552.1,NP_001361552.1.csv,MEEQVFKGDPDTPHSISFSGSGFLSFYQAGAVDALRDLAPRMLETAHRFAGTSAGAVIAALAICGIEMDEYLRVLNVGVAEVKKSFLGPLSPSCKMVQMMRQFLYRVLPEDSYKVTTGKLHVSLTRLTDGENVVVSEFTSKEELIEALYCSCFVPVYCGLIPPTYRGVRYIDGGFTGMQPCAFWTDAITISTFSGQQDICPRDCPAIFHDFRMFNCSFQFSLENIARMTHALFPPDLVILHDYYYRGYEDAVLYLRRLNAVYLNSSSKRVIFPRVEVYCQIELALGNECPERSQPSLRARQASLEGATQPHKEWVPKGDGRGSHGPPVSQPVQTLEFTCESPVSAPVSPLEQPPAQPLASSTPLSLSGMPPVSFPAVHKPPSSTPGSSLPTPPPGLSPLSPQQQVQPSGSPARSLHSQAPTSPRPSLGPSTVGAPQTLPRSSLSAFPAQPPVEELGQEQPQAVALLVSSKPKSAVPLVHVKETVSKPYVTESPAEDSNWVNKVFKKNKQKTSGTRKGFPRHSGSKKPSSKVQSAPCPLNFPLLSTSETVWVTYRPHPSRIQECCPEVWNSLG,mutated_sequence,1.0,572.0,NP_001361552.1.a2m,NP_001361552.1.npy,ClinVar
+NP_001361757.1,NP_001361757.1.csv,MAARAAAAAAAAAARARARAGSGERRAPPGPRPAPGARDLEAGARGAAAAAAAPGPMLGGGGDGGGGLNSVHHHPLLPRHELNMAHNAGAAAAAGTHSAKSGGSEAALKEGGSAAALSSSSSSSAAAAAASSSSSSGPGSAMETGLLPNHKLKTVGEAPAAPPHQQHHHHHHAHHHHHHAHHLHHHHALQQQLNQFQQQQQQQQQQQQQQQQQQHPISNNNSLGGAGGGAPQPGPDMEQPQHGGAKDSAAGGQADPPGPPLLSKPGDEDDAPPKMGEPAGGRYEHPGLGALGTQQPPVAVPGGGGGPAAVPEFNNYYGSAAPASGGPGGRAGPCFDQHGGQQSPGMGMMHSASAAAAGAPGSMDPLQNSHEGYPNSQCNHYPGYSRPGAGGGGGGGGGGGGGSGGGGGGGGAGAGGAGAGAVAAAAAAAAAAAGGGGGGGYGGSSAGYGVLSSPRQQGGGMMMGPGGGGAASLSKAAAGSAAGGFQRFAGQNQHPSGATPTLNQLLTSPSPMMRSYGGSYPEYSSPSAPPPPPSQPQSQAAAAGAAAGGQQAAAGMGLGKDMGAQYAAASPAWAAAQQRSHPAMSPGTPGPTMGRSQGSPMDPMVMKRPQLYGMGSNPHSQPQQSSPYPGGSYGPPGPQRYPIGIQGRTPGAMAGMQYPQQQMPPQYGQQGVSGYCQQGQQPYYSQQPQPPHLPPQAQYLPSQSQQRYQPQQDMSQEGYGTRSQPPLAPGKPNHEDLNLIQQERPSSLPDLSGSIDDLPTGTEATLSSAVSASGSTSSQGDQSNPAQSPFSPHASPHLSSIPGGPSPSPVGSPVGSNQSRSGPISPASIPGSQMPPQPPGSQSESSSHPALSQSPMPQERGFMAGTQRNPQMAQYGPQQTGPSMSPHPSPGGQMHAGISSFQQSNSSGTYGPQMSQYGPQGNYSRPPAYSGVPSASYSGPGPGMGISANNQMHGQGPSQPCGAVPLGRMPSAGMQNRPFPGNMSSMTPSSPGMSQQGGPGMGPPMPTVNRKAQEAAAAVMQAAANSAQSRQGSFPGMNQSGLMASSSPYSQPMNNSSSLMNTQAPPYSMAPAMVNSSAASVGLADMMSPGESKLPLPLKADGKEEGTPQPESKSKDSYSSQGISQPPTPGNLPVPSPMSPSSASISSFHGDESDSISSPGWPKTPSSPKSSSSTTTGEKITKVYELGNEPERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQYLFAFECKIERGEEPPPEVFSTGDTKKQPKLQPPSPANSGSLQGPQTPQSTGSNSMAEVPGDLKPPTPASTPHGQMTPMQGGRSSTISVHDPFSDVSDSSFPKRNSMTPNAPYQQGMSMPDVMGRMPYEPNKDPFGGMRKVPGSSEPFMTQGQMPNSSMQDMYNQSPSGAMSNLGMGQRQQFPYGASYDRRHEPYGQQYPGQGPPSGQPPYGGHQPGLYPQQPNYKRHMDGMYGPPAKRHEGDMYNMQYSSQQQEMYNQYGGSYSGPDRRPIQGQYPYPYSRERMQGPGQIQTHGIPPQMMGGPLQSSSSEGPQQNMWAARNDMPYPYQNRQGPGGPTQAPPYPGMNRTDDMMVPDQRINHESQWPSHVSQRQPYMSSSASMQPITRPPQPSYQTPPSLPNHISRAPSPASFQRSLENRMSPSKSPFLPSMKMQKVMPTVPTSQVTGPPPQPPPIRREITFPPGSVEASQPVLKQRRKITSKDIVTPEAWRVMMSLKSGLLAESTWALDTINILLYDDSTVATFNLSQLSGFLELLVEYFRKCLIDIFGILMEYEVGDPSQKALDHNAARKDDSQSLADDSGKEEEDAECIDDDEEDEEDEEEDSEKTESDEKSSIALTAPDAAADPKEKPKQASKFDKLPIKIVKKNNLFVVDRSDKLGRVQEFNSGLLHWQLGGGDTTEHIQTHFESKMEIPPRRRPPPPLSSAGRKKEQEGKGDSEEQQEKSIIATIDDVLSARPGALPEDANPGPQTESSKFPFGIQQAKSHRNIKLLEDEPRSRDETPLCTIAHWQDSLAKRCICVSNIVRSLSFVPGNDAEMSKHPGLVLILGKLILLHHEHPERKRAPQTYEKEEDEDKGVACSKDEWWWDCLEVLRDNTLVTLANISGQLDLSAYTESICLPILDGLLHWMVCPSAEAQDPFPTVGPNSVLSPQRLVLETLCKLSIQDNNVDLILATPPFSRQEKFYATLVRYVGDRKNPVCREMSMALLSNLAQGDALAARAIAVQKGSIGNLISFLEDGVTMAQYQQSQHNLMHMQPPPLEPPSVDMMCRAAKALLAMARVDENRSEFLLHEGRLLDISISAVLNSLVASVICDVLFQIGQL,mutated_sequence,1.0,2372.0,NP_001361757.1.a2m,NP_001361757.1.npy,ClinVar
+NP_001362309.1,NP_001362309.1.csv,MFGIQENIPRGGTTMKEEPLGSGMNPVRSWMHTAGVVDANTAAQSGVGLARAHFEKQPPSNLRKSNFFHFVLALYDRQGQPVEIERTAFVDFVEKEKEPNNEKTNNGIHYKLQLLYSNGVRTEQDLYVRLIDSMTKQAIVYEGQDKNPEMCRVLLTHEIMCSRCCDKKSCGNRNETPSDPVIIDRFFLKFFLKCNQNCLKNAGNPRDMRRFQVVVSTTVNVDGHVLAVSDNMFVHNNSKHGRRARRLDPSEGTAPSYLENATPCIKAISPSEGWTTGGATVIIIGDNFFDGLQVVFGTMLVWSELITPHAIRVQTPPRHIPGVVEVTLSYKSKQFCKGAPGRFVYTALNEPTIDYGFQRLQKVIPRHPGDPERLPKEVLLKRAADLVEALYGMPHNNQEIILKRAADIAEALYSVPRNHNQIPTLGNNPAHTGMMGVNSFSSQLAVNVSETSQANDQVGYSRNTSSVSPRGYVPSSTPQQSNYNTVSTSMNGYGSGAMASLGVPGSPGFLNGSSANSPYGIVPSSPTMAASSVTLPSNCSSTHGIFSFSPANVISAVKQKSAFAPVVRPQASPPPSCTSANGNGLQGSLLGAEDVAAEKTNWPFCEVGGIFHFDELMLKKGTGKLCLGW,mutated_sequence,1.0,629.0,NP_001362309.1.a2m,NP_001362309.1.npy,ClinVar
+NP_001364194.1,NP_001364194.1.csv,MAEPRQEFEVMEDHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLKESPLQTPTEDGSEEPGSETSDAKSTPTAEAEEAGIGDTPSLEDEAAGHVTQEELRVPGRQRKAPERPLANEISAHVQPGPCGEASGVSGPCLGEKEPEAPVPLTASLPQHRPVCPAPPPTGGPQEPSLEWGQKGGDWAEKGPAFPKPATTAYLHTEPESGKVVQEGFLREPGPPGLSHQLMSGMPGAPLLPEGPREATRQPSGTGPEDTEGGRHAPELLKHQLLGDLHQEGPPLKGAGGKERPGSKEEVDEDRDVDESSPQDSPPSKASPAQDGRPPQTAAREATSIPGFPAEGAIPLPVDFLSKVSTEIPASEPDGPSVGRAKGQDAPLEFTFHVEITPNVQKEQAHSEEHLGRAAFPGAPGEGPEARGPSLGEDTKEADLPEPSEKQPAAAPRGKPVSRVPQLKARMVSKSKDGTGSDDKKAKTSTRSSAKTLKNRPCLSPKHPTPGSSDPLIQPSSPAVCPEPPSSPKYVSSVTSRTGSSGAKEMKLKGADGKTKIATPRGAAPPGQKGQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRSRTPSLPTPPTREPKKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTENLKHQPGGGKVQIINKKLDLSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSLGNIHHKPGGGQVEVKSEKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAKTDHGAEIVYKSPVVSGDTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL,mutated_sequence,1.0,833.0,NP_001364194.1.a2m,NP_001364194.1.npy,ClinVar
+NP_001365112.1,NP_001365112.1.csv,MASEVVCGLIFRLLLPICLAVACAFRYNGLSFVYLIYLLLIPLFSEPTKTTMQGHTGRLLKSLCFISLSFLLLHIIFHITLVSLEAQHRIAPGYNCSTWEKTFRQIGFESLKGADAGNGIRVFVPDIGMFIASLTIWLLCRNIVQKPVTDEAAQSNPEFENEELAEGEKIDSEEALIYEEDFNGGDGVEGELEESTKLKMFRRLASVASKLKEFIGNMITTAGKVVVTILLGSSGMMLPSLTSSVYFFVFLGLCTWWSWCRTFDPLLFSCLCVLLAIFTAGHLIGLYLYQFQFFQEAVPPNDYYARLFGIKSVIQTDCSSTWKIIVNPDLSWYHHANPILLLVMYYTLATLIRIWLQEPLVQDEGTKEEDKALACSPIQITAGRRRSLWYATHYPTDERKLLSMTQDDYKPSDGLLVTVNGNPVDYHTIHPSLPMENGPGKADLYSTPQYRWEPSDESSEKREEEEEEKEEFEEERSREEKRSIKVHAMVSVFQFIMKQSYICALIAMMAWSITYHSWLTFVLLIWSCTLWMIRNRRKYAMISSPFMVVYGNLLLILQYIWSFELPEIKKVPGFLEKKEPGELASKILFTITFWLLLRQHLTEQKALQEKEALLSEVKIGSQENEEKDEELQDIQVEGEPKEEEEEEAKEEKQERKKVEQEEAEEEDEQDIMKVLGNLVVAMFIKYWIYVCGGMFFFVSFEGKIVMYKIIYMVLFLFCVALYQVHYEWWRKILKYFWMSVVIYTMLVLIFIYTYQFENFPGLWQNMTGLKKEKLEDLGLKQFTVAELFTRIFIPTSFLLVCILHLHYFHDRFLELTDLKSIPSKEDNTIYSHAKVNGRVYLIINSIKKKLPIHQNELAHPEGSLPDLTMMHLTASLEKPEVRKLAEPGEEKLEGYSEKAQKGDLGKDSEESEEDGEEEEESEEEEETSDLRNKWHLVIDRLTVLFLKFLEYFHKLQVFMWWILELHIIKIVSSYIIWVSVKEVSLFNYVFLISWAFALPYAKLRRLASSVCTVWTCVIIVCKMLYQLQTIKPENFSVNCSLPNENQTNIPFNELNKSLLYSAPIDPTEWVGLRKSSPLLVYLRNNLLMLAILAFEVTIYRHQEYYRGRNNLTAPVSRTIFHDITRLHLDDGLINCAKYFINYFFYKFGLETCFLMSVNVIGQRMDFYAMIHACWLIAVLYRRRRKAIAEIWPKYCCFLACIITFQYFICIGIPPAPCRDYPWRFKGASFNDNIIKWLYFPDFIVRPNPVFLVYDFMLLLCASLQRQIFEDENKAAVRIMAGDNVEICMNLDAASFSQHNPVPDFIHCRSYLDMSKVIIFSYLFWFVLTIIFITGTTRISIFCMGYLVACFYFLLFGGDLLLKPIKSILRYWDWLIAYNVFVITMKNILSIGACGYIGTLVHNSCWLIQAFSLACTVKGYQMPAANSPCTLPSGEAGIIWDSICFAFLLLQRRVFMSYYFLHVVADIKASQILASRGAELFQATIVKAVKARIEEEKKSMDQLKRQMDRIKARQQKYKKGKERMLSLTQEPGEGQDMQKLSEEDDEREADKQKAKGKKKQWWRPWVDHASMVRSGDYYLFETDSEEEEEEELKKEDEEPPRRSAFQRAIGKFASAILALPKSVIKLPKTILQYLIRAAKFVYQAWITDPKTALRQRHKEKKRSAREERKRRRKGSKEGPVEWEDREDEPIKKKSDGPDNIIKRIFNILKFTWVLFLATVDSFTTWLNSISREHIDISTVLRIERCMLTREIKKGNVPTRESIHMYYQNHIMNLSRESGLDTIDEHPGAASGAQTAHRMDSLDSHDSISSCYTEATMLFSRQSTLDDLDGQEIPKTSERARPRLRKMLSMDMSSSSADSGSLASSEPTQCTMLYSRQGTTETIEEVEAEQEEEAGSTAPEPREAKEYEATGYDVGAMGAEEASLTPEEELTQFSTLDGDVEAPPSYSKAVSFEHLSFGSQDDSAGKNRMAVSPDDSRTDKLGSSILPPLTHELTASELLLKKMFHDDELEESEKFYVGQPRFLLLFYAMYNTLVARSEMVCYFVIILNHMVSASMITLLLPILIFLWAMLSVPRPSRRFWMMAIVYTEVAIVVKYFFQFGFFPWNKNVEVNKDKPYHPPNIIGVEKKEGYVLYDLIQLLALFFHRSILKCHGLWDEDDMTESGMAREESDDELSLGHGRRDSSDSLKSINLAASVESVHVTFPEQQTAVRRKRSGSSSEPSQRSSFSSNRSQRGSTSTRNSSQKGSSVLSIKQKGKRELYMEKLQEHLIKAKAFTIKKTLEIYVPIKQFFYNLIHPEYSAVTDVYVLMFLADTVDFIIIVFGFWAFGKHSAAADITSSLSEDQVPGPFLVMVLIQFGTMVVDRALYLRKTVLGKVIFQVILVFGIHFWMFFILPGVTERKFSQNLVAQLWYFVKCVYFGLSAYQIRCGYPTRVLGNFLTKSYNYVNLFLFQGFRLVPFLTELRAVMDWVWTDTTLSLSSWICVEDIYAHIFILKCWRESEKRYPQPRGQKKKKVVKYGMGGMIIVLLICIVWFPLLFMSLIKSVAGVINQPLDVSVTITLGGYQPIFTMSAQQSQLKVMDQQSFNKFIQAFSRDTGAMQFLENYEKEDITVAELEGNSNSLWTISPPSKQKMIHELLDPNSSFSVVFSWSIQRNLSLGAKSEIATDKLSFPLKNITRKNIAKMIAGNSTESSKTPVTIEKIYPYYVKAPSDSNSKPIKQLLSENNFMDITIILSRDNTTKYNSEWWVLNLTGNRIYNPNSQALELVVFNDKVSPPSLGFLAGYGIMGLYASVVLVIGKFVREFFSGISHSIMFEELPNVDRILKLCTDIFLVRETGELELEEDLYAKLIFLYRSPETMIKWTREKTN,mutated_sequence,1.0,2865.0,NP_001365112.1.a2m,NP_001365112.1.npy,ClinVar
+NP_001365381.1,NP_001365381.1.csv,MSDKMSSFLHIGDICSLYAEGSTNGFISTLGLVDDRCVVQPETGDLNNPPKKFRDCLFKLCPMNRYSAQKQFWKAAKPGANSTTDAVLLNKLHHAADLEKKQNETENRKLLGTVIQYGNVIQLLHLKSNKYLTVNKRLPALLEKNAMRVTLDEAGNEGSWFYIQPFYKLRSIGDSVVIGDKVVLNPVNAGQPLHASSHQLVDNPGCNEVNSVNCNTSWKIVLFMKWSDNKDDILKGGDVVRLFHAEQEKFLTCDEHRKKQHVFLRTTGRQSATSATSSKALWEVEVVQHDPCRGGAGYWNSLFRFKHLATGHYLAAEVDPDFEEECLEFQPSVDPDQDASRSRLRNAQEKMVYSLVSVPEGNDISSIFELDPTTLRGGDSLVPRNSYVRLRHLCTNTWVHSTNIPIDKEEEKPVMLKIGTSPVKEDKEAFAIVPVSPAEVRDLDFANDASKVLGSIAGKLEKGTITQNERRSVTKLLEDLVYFVTGGTNSGQDVLEVVFSKPNRERQKLMREQNILKQIFKLLQAPFTDCGDGPMLRLEELGDQRHAPFRHICRLCYRVLRHSQQDYRKNQEYIAKQFGFMQKQIGYDVLAEDTITALLHNNRKLLEKHITAAEIDTFVSLVRKNREPRFLDYLSDLCVSMNKSIPVTQELICKAVLNPTNADILIETKLVLSRFEFEGVSSTGENALEAGEDEEEVWLFWRDSNKEIRSKSVRELAQDAKEGQKEDRDVLSYYRYQLNLFARMCLDRQYLAINEISGQLDVDLILRCMSDENLPYDLRASFCRLMLHMHVDRDPQEQVTPVKYARLWSEIPSEIAIDDYDSSGASKDEIKERFAQTMEFVEEYLRDVVCQRFPFSDKEKNKLTFEVVNLARNLIYFGFYNFSDLLRLTKILLAILDCVHVTTIFPISKMAKGEENKGNNDVEKLKSSNVMRSIHGVGELMTQVVLRGGGFLPMTPMAAAPEGNVKQAEPEKEDIMVMDTKLKIIEILQFILNVRLDYRISCLLCIFKREFDESNSQTSETSSGNSSQEGPSNVPGALDFEHIEEQAEGIFGGSEENTPLDLDDHGGRTFLRVLLHLTMHDYPPLVSGALQLLFRHFSQRQEVLQAFKQVQLLVTSQDVDNYKQIKQDLDQLRSIVEKSELWVYKGQGPDETMDGASGENEHKKTEEGNNKPQKHESTSSYNYRVVKEILIRLSKLCVQESASVRKSRKQQQRLLRNMGAHAVVLELLQIPYEKAEDTKMQEIMRLAHEFLQNFCAGNQQNQALLHKHINLFLNPGILEAVTMQHIFMNNFQLCSEINERVVQHFVHCIETHGRNVQYIKFLQTIVKAEGKFIKKCQDMVMAELVNSGEDVLVFYNDRASFQTLIQMMRSERDRMDENSPLMYHIHLVELLAVCTEGKNVYTEIKCNSLLPLDDIVRVVTHEDCIPEVKIAYINFLNHCYVDTEVEMKEIYTSNHMWKLFENFLVDICRACNNTSDRKHADSILEKYVTEIVMSIVTTFFSSPFSDQSTTLQTRQPVFVQLLQGVFRVYHCNWLMPSQKASVESCIRVLSDVAKSRAIAIPVDLDSQVNNLFLKSHSIVQKTAMNWRLSARNAARRDSVLAASRDYRNIIERLQDIVSALEDRLRPLVQAELSVLVDVLHRPELLFPENTDARRKCESGGFICKLIKHTKQLLEENEEKLCIKVLQTLREMMTKDRGYGEKLISIDELDNAELPPAPDSENATEELEPSPPLRQLEDHKRGEALRQVLVNRYYGNVRPSGRRESLTSFGNGPLSAGGPGKPGGGGGGSGSSSMSRGEMSLAEVQCHLDKEGASNLVIDLIMNASSDRVFHESILLAIALLEGGNTTIQHSFFCRLTEDKKSEKFFKVFYDRMKVAQQEIKATVTVNTSDLGNKKKDDEVDRDAPSRKKAKEPTTQITEEVRDQLLEASAATRKAFTTFRREADPDDHYQPGEGTQATADKAKDDLEMSAVITIMQPILRFLQLLCENHNRDLQNFLRCQNNKTNYNLVCETLQFLDCICGSTTGGLGLLGLYINEKNVALINQTLESLTEYCQGPCHENQNCIATHESNGIDIITALILNDINPLGKKRMDLVLELKNNASKLLLAIMESRHDSENAERILYNMRPKELVEVIKKAYMQGEVEFEDGENGEDGAASPRNVGHNIYILAHQLARHNKELQSMLKPGGQVDGDEALEFYAKHTAQIEIVRLDRTMEQIVFPVPSICEFLTKESKLRIYYTTERDEQGSKINDFFLRSEDLFNEMNWQKKLRAQPVLYWCARNMSFWSSISFNLAVLMNLLVAFFYPFKGVRGGTLEPHWSGLLWTAMLISLAIVIALPKPHGIRALIASTILRLIFSVGLQPTLFLLGAFNVCNKIIFLMSFVGNCGTFTRGYRAMVLDVEFLYHLLYLVICAMGLFVHEFFYSLLLFDLVYREETLLNVIKSVTRNGRSIILTAVLALILVYLFSIVGYLFFKDDFILEVDRLPNETAVPETGESLASEFLFSDVCRVESGENCSSPAPREELVPAEETEQDKEHTCETLLMCIVTVLSHGLRSGGGVGDVLRKPSKEEPLFAARVIYDLLFFFMVIIIVLNLIFGVIIDTFADLRSEKQKKEEILKTTCFICGLERDKFDNKTVTFEEHIKEEHNMWHYLCFIVLVKVKDSTEYTGPESYVAEMIKERNLDWFPRMRAMSLVSSDSEGEQNELRNLQEKLESTMKLVTNLSGQLSELKDQMTEQRKQKQRIGLLGHPPHMNVNPQQPA,mutated_sequence,1.0,2758.0,NP_001365381.1.a2m,NP_001365381.1.npy,ClinVar
+NP_001365383.1,NP_001365383.1.csv,MEPEDLPWPGELEEEEEEEEEEEEEEEEAAAAAAANVDDVVVVEEVEEEAGRELDSDSHYGPQHLESIDDEEDEEAKAWLQAHPGRILPPLSPPQHRYSEGERTSLEKIVPLTCHVWQQIVYQGNSRTQISDTNVVCLETTAQRGSGDDQKTESWHCLPQEMDSSQTLDTSQTRFNVRTEDTEVTDFPSLEEGILTQSENQVKEPNRDLFCSPLLVIQDSFASPDLPLLTCLTQDQEFAPDSLFHQSELSFAPLRGIPDKSEDTEWSSRPSEVSEALFQATAEVASDLASSRFSVSQHPLIGSTAVGSQCPFLPSEQGNNEETISSVDELKIPKDCDRYDDLCSYMSWKTRKDTQWPENNLADKDQVSVATSFDITDENIATKRSDHFDAARSYGQYWTQEDSSKQAETYLTKGLQGKVESDVITLDGLNENAVVCSERVAELQRKPTRESEYHSSDLRMLRMSPDTVPKAPKHLKAGDTSKGGIAKVTQSNLKSGITTTPVDSDIGSHLSLSLEDLSQLAVSSPLETTTGQHTDTLNQKTLADTHLTEETLKVTAIPEPADQKTATPTVLSSSHSHRGKPSIFYQQGLPDSHLTEEALKVSAAPGLADQTTGMSTLTSTSYSHREKPGTFYQQELPESNLTEEPLEVSAAPGPVEQKTGIPTVSSTSHSHVEDLLFFYRQTLPDGHLTDQALKVSAVSGPADQKTGTATVLSTPHSHREKPGIFYQQEFADSHQTEETLTKVSATPGPADQKTEIPAVQSSSYSQREKPSILYPQDLADSHLPEEGLKVSAVAGPADQKTGLPTVPSSAYSHREKLLVFYQQALLDSHLPEEALKVSAVSGPADGKTGTPAVTSTSSASSSLGEKPSAFYQQTLPNSHLTEEALKVSIVPGPGDQKTGIPSAPSSFYSHREKPIIFSQQTLPDFLFPEEALKVSAVSVLAAQKTGTPTVSSNSHSHSEKSSVFYQQELPDSDLPRESLKMSAIPGLTDQKTVPTPTVPSGSFSHREKPSIFYQQEWPDSYATEKALKVSTGPGPADQKTEIPAVQSSSYPQREKPSVLYPQVLSDSHLPEESLKVSAFPGPADQMTDTPAVPSTFYSQREKPGIFYQQTLPESHLPKEALKISVAPGLADQKTGTPTVTSTSYSQHREKPSIFHQQALPGTHIPEEAQKVSAVTGPGNQKTWIPRVLSTFYSQREKPGIFYQQTLPGSHIPEEAQKVSPVLGPADQKTGTPTPTSASYSHTEKPGIFYQQVLPDNHPTEEALKISVASEPVDQTTGTPAVTSTSYSQYREKPSIFYQQSLPSSHLTEEAKNVSAVPGPADQKTVIPILPSTFYSHTEKPGVFYQQVLPHSHPTEEALKISVASEPVDQTTGTPTVTSTSYSQHTEKPSIFYQQSLPGSHLTEEAKNVSAVPGPGDRKTGIPTLPSTFYSHTEKPGSFYQQVLPHSHLPEEALEVSVAPGPVDQTIGTPTVTSPSSSFGEKPIVIYKQAFPEGHLPEESLKVSVAPGPVGQTTGAPTITSPSYSQHRAKSGSFYQLALLGSQIPEEALRVSSAPGPADQTTGIPTITSTSYSFGEKPIVNYKQAFPDGHLPEEALKVSIVSGPTEKKTDIPAGPLGSSALGEKPITFYRQALLDSPLNKEVVKVSAAPGPADQKTETLPVHSTSYSNRGKPVIFYQQTLSDSHLPEEALKVPPVPGPDAQKTETPSVSSSLYSYREKPIVFYQQALPDSELTQEALKVSAVPQPADQKTGLSTVTSSFYSHTEKPNISYQQELPDSHLTEEALKVSNVPGPADQKTGVSTVTSTSYSHREKPIVSYQRELPHFTEAGLKILRVPGPADQKTGINILPSNSYPQREHSVISYEQELPDLTEVTLKAIGVPGPADQKTGIQIASSSSYSNREKASIFHQQELPDVTEEALNVFVVPGQGDRKTEIPTVPLSYYSRREKPSVISQQELPDSHLTEEALKVSPVSIPAEQKTGIPIGLSSSYSHSHKEKLKISTVHIPDDQKTEFPAATLSSYSQIEKPKISTVIGPNDQKTPSQTAFHSSYSQTVKPNILFQQQLPDRDQSKGILKISAVPELTDVNTGKPVSLSSSYFHREKSNIFSPQELPGSHVTEDVLKVSTIPGPAGQKTVLPTALPSSFSHREKPDIFYQKDLPDRHLTEDALKISSALGQADQITGLQTVPSGTYSHGENHKLVSEHVQRLIDNLNSSDSSVSSNNVLLNSQADDRVVINKPESAGFRDVGSEEIQDAENSAKTLKEIRTLLMEAENMALKRCNFPAPLARFRDISDISFIQSKKVVCFKEPSSTGVSNGDLLHRQPFTEESPSSRCIQKDIGTQTNLKCRRGIENWEFISSTTVRSPLQEAESKVSMALEETLRQYQAAKSVMRSEPEGCSGTIGNKIIIPMMTVIKSDSSSDASDGNGSCSWDSNLPESLESVSDVLLNFFPYVSPKTSITDSREEEGVSESEDGGGSSVDSLAAHVKNLLQCESSLNHAKEILRNAEEEESRVRAHAWNMKFNLAHDCGYSISELNEDDRRKVEEIKAELFGHGRTTDLSKGLQSPRGMGCKPEAVCSHIIIESHEKGCFRTLTSEHPQLDRHPCAFRSAGPSEMTRGRQNPSSCRAKHVNLSASLDQNNSHFKVWNSLQLKSHSPFQNFIPDEFKISKGLRMPFDEKMDPWLSELVEPAFVPPKEVDFHSSSQMPSPEPMKKFTTSITFSSHRHSKCISNSSVVKVGVTEGSQCTGASVGVFNSHFTEEQNPPRDLKQKTSSPSSFKMHSNSQDKEVTILAEGRRQSQKLPVDFERSFQEEKPLERSDFTGSHSEPSTRANCSNFKEIQISDNHTLISMGRPSSTLGVNRSSSRLGVKEKNVTITPDLPSCIFLEQRELFEQSKAPRADDHVRKHHSPSPQHQDYVAPDLPSCIFLEQRELFEQCKAPYVDHQMRENHSPLPQGQDSIASDLPSPISLEQCQSKAPGVDDQMNKHHFPLPQGQDCVVEKNNQHKPKSHISNINVEAKFNTVVSQSAPNHCTLAASASTPPSNRKALSCVHITLCPKTSSKLDSGTLDERFHSLDAASKARMNSEFNFDLHTVSSRSLEPTSKLLTSKPVAQDQESLGFLGPKSSLDFQVVQPSLPDSNTITQDLKTIPSQNSQIVTSRQIQVNISDFEGHSNPEGTPVFADRLPEKMKTPLSAFSEKLSSDAVTQITTESPEKTLFSSEIFINAEDRGHEIIEPGNQKLRKAPVKFASSSSVQQVTFSRGTDGQPLLLPYKPSGSTKMYYVPQLRQIPPSPDSKSDTTVESSHSGSNDAIAPDFPAQVLGTRDDDLSATVNIKHKEGIYSKRVVTKASLPVGEKPLQNENADASVQVLITGDENLSDKKQQEIHSTRAVTEAAQAKEKESLQKDTADSSAAAAAEHSAQVGDPEMKNLPDTKAITQKEEIHRKKTVPEEAWPNNKESLQINIEESECHSEFENTTRSVFRSAKFYIHHPVHLPSDQDICHESLGKSVFMRHSWKDFFQHHPDKHREHMCLPLPYQNMDKTKTDYTRIKSLSINVNLGNKEVMDTTKSQVRDYPKHNGQISDPQRDQKVTPEQTTQHTVSLNELWNKYRERQRQQRQPELGDRKELSLVDRLDRLAKILQNPITHSLQVSESTHDDSRGERSVKEWSGRQQQRNKLQKKKRFKSLEKSHKNTGELKKSKVLSHHRAGRSNQIKIEQIKFDKYILSKQPGFNYISNTSSDCRPSEESELLTDTTTNILSGTTSTVESDILTQTDREVALHERSSSVSTIDTARLIQAFGHERVCLSPRRIKLYSSITNQQRRYLEKRSKHSKKVLNTGHPLVTSEHTRRRHIQVANHVISSDSISSSASSFLSSNSTFCNKQNVHMLNKGIQAGNLEIVNGAKKHTRDVGITFPTPSSSEAKLEENSDVTSWSEEKREEKMLFTGYPEDRKLKKNKKNSHEGVSWFVPVENVESRSKKENVPNTCGPGISWFEPITKTRPWREPLREQNCQGQHLDGRGYLAGPGREAGRDLLRPFVRATLQESLQFHRPDFISRSGERIKRLKLIVQERKLQSMLQTERDALFNIDRERQGHQNRMCPLPKRVFLAIQKNKPISKKEMIQRSKRIYEQLPEVQKKREEEKRKSEYKSYRLRAQLYKKRVTNQLLGRKVPWD,mutated_sequence,1.0,4168.0,NP_001365383.1.a2m,NP_001365383.1.npy,ClinVar
+NP_001365544.1,NP_001365544.1.csv,MNPREEKVKIITEEFIENDEDADMGRQNKNSKVRRQPRKKQPPTAVPKEMVSEKSHLGNPQEPVQEEPKTRLLSMTVRRGPRSLPPIPSTSRTGFAEFSMRGRMREKLQAARSKAESALLQEIPTPRPRRLRSPSKKELETEFGTEPGKEVERTQQEVDSQSYSRVKFHDSARKIKPKPQVPPGFPSAEEAYNFFTFNFDPEPEGSEEKPKARHRAGTNQEEEEGEEEEPPAQGGGKEMDEEELLNGDDAEDFLLGLDHVADDFVAVRPADYESIHDRLQMEREMLFIPSRQTVPTYKKLPENVQPRFLEDEGLYTGVRPEVARTNQNIMENRLLMQDPERRWFGDDGRILALPNPIKPFPSRPPVLTQEQSIKAELETLYKKAVKYVHSSQHVIRSGDPPGNFQLDIDISGLIFTHHPCFSREHVLAAKLAQLYDQYLARHQRNKAKFLTDKLQALRNAVQTGLDPEKPHQSLDTIQKTINEYKSEIRQTRKFRDAEQEKDRTLLKTIIKVWKEMKSLREFQRFTNTPLKLVLRKEKADQKADEEAYEAEIQAEISELLEEHTEEYAQKMEEYRTSLQQWKAWRKVQRAKKKKRKQAAEEHPGDEIAEPYPEEDLVKPSPPEPTDRAVIEQEVRERAAQSRRRPWEPTLVPELSLAGSVTPNDQCPRAEVSRREDVKKRSVYLKVLFNNKEVSRTVSRPLGADFRVHFGQIFNLQIVNWPESLTLQVYETVGHSSPTLLAEVFLPIPETTVVTGRAPTEEVEFSSNQHVTLDHEGVGSGVPFSFEADGSNQLTLMTSGKVSHSVAWAIGENGIPLIPPLSQQNIGFRSALKKADAISSIGTSGLTDMKKLAKWAAESKLDPNDPNNAPLMQLISVATSGESYVPDFFRLEQLQQEFNFVSDQELNRSKRFRLLHLRSQEVPEFRNYKQVPVYDREIMEKVFQDYEKRLRDRNVIETKEHIDTHRAIVAKYLQQVRESVINRFLIAKQYFLLADMIVEEEVPNISILGLSLFKLAEQKRPLRPRRKGRKKVTAQNLSDGDIKLLVNIVRAYDIPVRKPAVSKFQQPSRSSRMFSEKHAASPSTYSPTHNADYPLGQVLVRPFVEVSFQRTVCHTTTAEGPNPSWNEELELPFRAPNGDYSTASLQSVKDVVFINIFDEVLHDVLEDDRERGSGIHTRIERHWLGCVKMPFSTIYFQARIDGTFKIDIPPVLLGYSKERNMILERGFDSVRSLSEGSYITLFITIEPQLVPGESIREKFESQEDEKLLQATEKFQAECALKFPNRQCLTTVIDISGKTVFITRYLKPLNPPQELLNVYPNNLQATAELVARYVSLIPFLPDTVSFGGICDLWSTSDQFLDLLAGDEEEHAVLLCNYFLSLGKKAWLLMGNAIPEGPTAYVLTWEQGRYLIWNPCSGHFYGQFDTFCPLKNVGCLIGPDNIWFNIQRYESPLRINFDVTRPKLWKSFFSRSLPYPGLSSVQPEELIYQRSDKAAAAELQDRIEKILKEKIMDWRPRHLTRWNRYCTSTLRHFLPLLEKSQGEDVEDDHRAELLKQLGDYRFSGFPLHMPYSEVKPLIDAVYSTGVHNIDVPNVEFALAVYIHPYPKNVLSVWIYVASLIRNR,mutated_sequence,1.0,1620.0,NP_001365544.1.a2m,NP_001365544.1.npy,ClinVar
+NP_001365898.1,NP_001365898.1.csv,MAAGVAAWLPFARAAAIGWMPVANCPMPLAPADKNKRQDELIVLNVSGRRFQTWRTTLERYPDTLLGSTEKEFFFNEDTKEYFFDRDPEVFRCVLNFYRTGKLHYPRYECISAYDDELAFYGILPEIIGDCCYEEYKDRKRENAERLMDDNDSENNQESMPSLSFRQTMWRAFENPHTSTLALVFYYVTGFFIAVSVITNVVETVPCGTVPGSKELPCGERYSVAFFCLDTACVMIFTVEYLLRLFAAPSRYRFIRSVMSIIDVVAIMPYYIGLVMTNNEDVSGAFVTLRVFRVFRIFKFSRHSQGLRILGYTLKSCASELGFLLFSLTMAIIIFATVMFYAEKGSSASKFTSIPASFWYTIVTMTTLGYGDMVPKTIAGKIFGSICSLSGVLVIALPVPVIVSNFSRIYHQNQRADKRRAQKKARLARIRVAKTGSSNAYLHSKRNGLLNEALELTGTPEEEHMGKTTSLIESQHHHLLHCLEKTTGLSYLVDDPLLSVRTSTIKNHEFIDEQMFEQNCMESSMQNYPSTRSPSLSSHPGLTTTCCSRRSKKTTHLPNSNLPATRLRSMQELSTIHIQGSEQPSLTTSRSSLNLKADDGLRPNCKTSQITTAIISIPTPPALTPEGESRPPPASPGPNTNIPSIASNVVKVSAL,mutated_sequence,1.0,655.0,NP_001365898.1.a2m,NP_001365898.1.npy,ClinVar
+NP_001365958.1,NP_001365958.1.csv,MSDNQSWNSSGSEEDPETESGPPVERCGVLSKWTNYIHGWQDRWVVLKNNALSYYKSEDETEYGCRGSICLSKAVITPHDFDECRFDISVNDSVWYLRAQDPDHRQQWIDAIEQHKTESGYGSESSLRRHGSMVSLVSGASGYSATSTSSFKKGHSLREKLAEMETFRDILCRQVDTLQKYFDACADAVSKDELQRDKVVEDDEDDFPTTRSDGDFLHSTNGNKEKLFPHVTPKGINGIDFKGEAITFKATTAGILATLSHCIELMVKREDSWQKRLDKETEKKRRTEEAYKNAMTELKKKSHFGGPDYEEGPNSLINEEEFFDAVEAALDRQDKIEEQSQSEKVRLHWPTSLPSGDAFSSVGTHRFVQKPYSRSSSMSSIDLVSASDDVHRFSSQVEEMVQNHMTYSLQDVGGDANWQLVVEEGEMKVYRREVEENGIVLDPLKATHAVKGVTGHEVCNYFWNVDVRNDWETTIENFHVVETLADNAIIIYQTHKRVWPASQRDVLYLSVIRKIPALTENDPETWIVCNFSVDHDSAPLNNRCVRAKINVAMICQTLVSPPEGNQEISRDNILCKITYVANVNPGGWAPASVLRAVAKREYPKFLKRFTSYVQEKTAGKPILF,mutated_sequence,1.0,624.0,NP_001365958.1.a2m,NP_001365958.1.npy,ClinVar
+NP_001366039.1,NP_001366039.1.csv,MDEEIVSEKQAEESHRQDSANLLIFILLLTLTILTIWLFKHRRARFLHETGLAMIYGLLVGLVLRYGIHVPSDVNNVTLSCEVQSSPTTLLVNVSGKFYEYMLKGEISSHELNNVQDNEMLRKVTFDPEVFFNILLPPIIFYAGYSLKRRHFFRNLGSILAYAFLGTAISCFVIGSIMYGCVTLMKVTGQLAGDFYFTDCLLFGAIVSATDPVTVLAIFHELQVDVELYALLFGESVLNDAVAIVLSSSIVAYQPAGDNSHTFDVTAMFKSIGIFLGIFSGSFAMGAATGVVTALVTKFTKLREFQLLETGLFFLMSWSTFLLAEAWGFTGVVAVLFCGITQAHYTYNNLSTESQHRTKQLFELLNFLAENFIFSYMGLTLFTFQNHVFNPTFVVGAFVAIFLGRAANIYPLSLLLNLGRRSKIGSNFQHMMMFAGLRGAMAFALAIRDTATYARQMMFSTTLLIVFFTVWVFGGGTTAMLSCLHIRYCARLWGYRDELRHGLYPQGTHSLFGETDRVGVDSDQEHLGVPENERRTTKAESAWLFRMWYNFDHNYLKPLLTHSGPPLTTTLPACCGPIARCLTSPQAYENQEQLKDDDSDLILNDGDISLTYGDSTVNTEPATSSAPRRFMGNSSEDALDRELAFGDHELVIRGTRLVLPMDDSEPPLNLLDNTRHGPA,mutated_sequence,1.0,679.0,NP_001366039.1.a2m,NP_001366039.1.npy,ClinVar
+NP_001366220.1,NP_001366220.1.csv,MSAESGPGTRLRNLPVMGDGLETSQMSTTQAQAQPQPANAASTNPPPPETSNPNKPKRQTNQLQYLLRVVLKTLWKHQFAWPFQQPVDAVKLNLPDYYKIIKTPMDMGTIKKRLENNYYWNAQECIQDFNTMFTNCYIYNKPGDDIVLMAEALEKLFLQKINELPTEETEIMIVQAKGRGRGRKETGTAKPGVSTVPNTTQASTPPQTQTPQPNPPPVQATPHPFPAVTPDLIVQTPVMTVVPPQPLQTPPPVPPQPQPPPAPAPQPVQSHPPIIAATPQPVKTKKGVKRKADTTTPTTIDPIHEPPSLPPEPKTTKLGQRRESSRPVKPPKKDVPDSQQHPAPEKSSKVSEQLKCCSGILKEMFAKKHAAYAWPFYKPVDVEALGLHDYCDIIKHPMDMSTIKSKLEAREYRDAQEFGADVRLMFSNCYKYNPPDHEVVAMARKLQDVFEMRFAKMPDEPEEPVVAVSSPAVPPPTKVVAPPSSSDSSSDSSSDSDSSTDDSEEERAQRLAELQEQLKAVHEQLAALSQPQQNKPKKKEKDKKEKKKEKHKRKEEVEENKKSKAKEPPPKKTKKNNSSNSNVSKKEPAPMKSKPPPTYESEEEDKCKPMSYEEKRQLSLDINKLPGEKLGRVVHIIQSREPSLKNSNPDEIEIDFETLKPSTLRELERYVTSCLRKKRKPQAEKVDVIAGSSKMKGFSSSESESSSESSSSDSEDSETEMAPKSKKKGHPGREQKKHHHHHHQQMQQAPAPVPQQPPPPPQQPPPPPPPQQQQQPPPPPPPPSMPQQAAPAMKSSPPPFIATQVPVLEPQLPGSVFDPIGHFTQPILHLPQPELPPHLPQPPEHSTPPHLNQHAVVSPPALHNALPQQPSRPSNRAAALPPKPARPPAVSPALTQTPLLPQPPMAQPPQVLLEDEEPPAPPLTSMQMQLYLQQLQKVQPPTPLLPSVKVQSQPPPPLPPPPHPSVQQQLQQQPPPPPPPQPQPPPQQQHQPPPRPVHLQPMQFSTHIQQPPPPQGQQPPHPPPGQQPPPPQPAKPQQVIQHHHSPRHHKSDPYSTGHLREAPSPLMIHSPQMSQFQSLTHQSPPQQNVQPKKQELRAASVVQPQPLVVVKEEKIHSPIIRSEPFSPSLRPEPPKHPESIKAPVHLPQRPEMKPVDVGRPVIRPPEQNAPPPGAPDKDKQKQEPKTPVAPKKDLKIKNMGSWASLVQKHPTTPSSTAKSSSDSFEQFRRAAREKEEREKALKAQAEHAEKEKERLRQERMRSREDEDALEQARRAHEEARRRQEQQQQQRQEQQQQQQQQAAAVAAAATPQAQSSQPQSMLDQQRELARKREQERRRREAMAATIDMNFQSDLLSIFEENLF,mutated_sequence,1.0,1362.0,NP_001366220.1.a2m,NP_001366220.1.npy,ClinVar
+NP_001366332.1,NP_001366332.1.csv,MASLGEETLASASSSSDSDTGGASPPPRKKPRASAAEGVGEPGASAGRAGLSPPSSSSSSSSSSSSSVVVVVGLPPAAAPPAAAAVPHRSSGHSLVSGSIMQANGAGGGGGGGGGGGGGGGGGGGQGQTPELACLSAQNGESSPSSSSSAGDLAHANGLLPSAPSAASNNSNSLNVNNGVPGGAAAASSATVAAASATTAASSSLATPELGSSLKKKKRLSQSDEDVIRLIGQHLNGLGLNQTVDLLMQESGCRLEHPSATKFRNHVMEGDWDKAENDLNELKPLVHSPHAIVVRGALEISQTLLGIIVRMKFLLLQQKYLEYLEDGKVLEALQVLRCELTPLKYNTERIHVLSGYLMCSHAEDLRAKAEWEGKGTASRSKLLDKLQTYLPPSVMLPPRRLQTLLRQAVELQRDRCLYHNTKLDNNLDSVSLLIDHVCSRRQFPCYTQQILTEHCNEVWFCKFSNDGTKLATGSKDTTVIIWQVDPDTHLLKLLKTLEGHAYGVSYIAWSPDDNYLVACGPDDCSELWLWNVQTGELRTKMSQSHEDSLTSVAWNPDGKRFVTGGQRGQFYQCDLDGNLLDSWEGVRVQCLWCLSDGKTVLASDTHQRIRGYNFEDLTDRNIVQEDHPIMSFTISKNGRLALLNVATQGVHLWDLQDRVLVRKYQGVTQGFYTIHSCFGGHNEDFIASGSEDHKVYIWHKRSELPIAELTGHTRTVNCVSWNPQIPSMMASASDDGTVRIWGPAPFIDHQNIEEECSSMDS,mutated_sequence,1.0,761.0,NP_001366332.1.a2m,NP_001366332.1.npy,ClinVar
+NP_001366539.1,NP_001366539.1.csv,MKVTGIFLLSALALLSLSGNTGADSLGREAKCYNELNGCTKIYDPVCGTDGNTYPNECVLCFENRKRQTSILIQKSGPC,mutated_sequence,1.0,79.0,NP_001366539.1.a2m,NP_001366539.1.npy,ClinVar
+NP_001367.2,NP_001367.2.csv,MSEPGGGGGEDGSAGLEVSAVQNVADVSVLQKHLRKLVPLLLEDGGEAPAALEAALEEKSALEQMRKFLSDPQVHTVLVERSTLKEDVGDEGEEEKEFISYNINIDIHYGVKSNSLAFIKRTPVIDADKPVSSQLRVLTLSEDSPYETLHSFISNAVAPFFKSYIRESGKADRDGDKMAPSVEKKIAELEMGLLHLQQNIEIPEISLPIHPMITNVAKQCYERGEKPKVTDFGDKVEDPTFLNQLQSGVNRWIREIQKVTKLDRDPASGTALQEISFWLNLERALYRIQEKRESPEVLLTLDILKHGKRFHATVSFDTDTGLKQALETVNDYNPLMKDFPLNDLLSATELDKIRQALVAIFTHLRKIRNTKYPIQRALRLVEAISRDLSSQLLKVLGTRKLMHVAYEEFEKVMVACFEVFQTWDDEYEKLQVLLRDIVKRKREENLKMVWRINPAHRKLQARLDQMRKFRRQHEQLRAVIVRVLRPQVTAVAQQNQGEVPEPQDMKVAEVLFDAADANAIEEVNLAYENVKEVDGLDVSKEGTEAWEAAMKRYDERIDRVETRITARLRDQLGTAKNANEMFRIFSRFNALFVRPHIRGAIREYQTQLIQRVKDDIESLHDKFKVQYPQSQACKMSHVRDLPPVSGSIIWAKQIDRQLTAYMKRVEDVLGKGWENHVEGQKLKQDGDSFRMKLNTQEIFDDWARKVQQRNLGVSGRIFTIESTRVRGRTGNVLKLKVNFLPEIITLSKEVRNLKWLGFRVPLAIVNKAHQANQLYPFAISLIESVRTYERTCEKVEERNTISLLVAGLKKEVQALIAEGIALVWESYKLDPYVQRLAETVFNFQEKVDDLLIIEEKIDLEVRSLETCMYDHKTFSEILNRVQKAVDDLNLHSYSNLPIWVNKLDMEIERILGVRLQAGLRAWTQVLLGQAEDKAEVDMDTDAPQVSHKPGGEPKIKNVVHELRITNQVIYLNPPIEECRYKLYQEMFAWKMVVLSLPRIQSQRYQVGVHYELTEEEKFYRNALTRMPDGPVALEESYSAVMGIVSEVEQYVKVWLQYQCLWDMQAENIYNRLGEDLNKWQALLVQIRKARGTFDNAETKKEFGPVVIDYGKVQSKVNLKYDSWHKEVLSKFGQMLGSNMTEFHSQISKSRQELEQHSVDTASTSDAVTFITYVQSLKRKIKQFEKQVELYRNGQRLLEKQRFQFPPSWLYIDNIEGEWGAFNDIMRRKDSAIQQQVANLQMKIVQEDRAVESRTTDLLTDWEKTKPVTGNLRPEEALQALTIYEGKFGRLKDDREKCAKAKEALELTDTGLLSGSEERVQVALEELQDLKGVWSELSKVWEQIDQMKEQPWVSVQPRKLRQNLDALLNQLKSFPARLRQYASYEFVQRLLKGYMKINMLVIELKSEALKDRHWKQLMKRLHVNWVVSELTLGQIWDVDLQKNEAIVKDVLLVAQGEMALEEFLKQIREVWNTYELDLVNYQNKCRLIRGWDDLFNKVKEHINSVSAMKLSPYYKVFEEDALSWEDKLNRIMALFDVWIDVQRRWVYLEGIFTGSADIKHLLPVETQRFQSISTEFLALMKKVSKSPLVMDVLNIQGVQRSLERLADLLGKIQKALGEYLERERSSFPRFYFVGDEDLLEIIGNSKNVAKLQKHFKKMFAGVSSIILNEDNSVVLGISSREGEEVMFKTPVSITEHPKINEWLTLVEKEMRVTLAKLLAESVTEVEIFGKATSIDPNTYITWIDKYQAQLVVLSAQIAWSENVETALSSMGGGGDAAPLHSVLSNVEVTLNVLADSVLMEQPPLRRRKLEHLITELVHQRDVTRSLIKSKIDNAKSFEWLSQMRFYFDPKQTDVLQQLSIQMANAKFNYGFEYLGVQDKLVQTPLTDRCYLTMTQALEARLGGSPFGPAGTGKTESVKALGHQLGRFVLVFNCDETFDFQAMGRIFVGLCQVGAWGCFDEFNRLEERMLSAVSQQVQCIQEALREHSNPNYDKTSAPITCELLNKQVKVSPDMAIFITMNPGYAGRSNLPDNLKKLFRSLAMTKPDRQLIAQVMLYSQGFRTAEVLANKIVPFFKLCDEQLSSQSHYDFGLRALKSVLVSAGNVKRERIQKIKREKEERGEAVDEGEIAENLPEQEILIQSVCETMVPKLVAEDIPLLFSLLSDVFPGVQYHRGEMTALREELKKVCQEMYLTYGDGEEVGGMWVEKVLQLYQITQINHGLMMVGPSGSGKSMAWRVLLKALERLEGVEGVAHIIDPKAISKDHLYGTLDPNTREWTDGLFTHVLRKIIDSVRGELQKRQWIVFDGDVDPEWVENLNSVLDDNKLLTLPNGERLSLPPNVRIMFEVQDLKYATLATVSRCGMVWFSEDVLSTDMIFNNFLARLRSIPLDEGEDEAQRRRKGKEDEGEEAASPMLQIQRDAATIMQPYFTSNGLVTKALEHAFQLEHIMDLTRLRCLGSLFSMLHQACRNVAQYNANHPDFPMQIEQLERYIQRYLVYAILWSLSGDSRLKMRAELGEYIRRITTVPLPTAPNIPIIDYEVSISGEWSPWQAKVPQIEVETHKVAAPDVVVPTLDTVRHEALLYTWLAEHKPLVLCGPPGSGKTMTLFSALRALPDMEVVGLNFSSATTPELLLKTFDHYCEYRRTPNGVVLAPVQLGKWLVLFCDEINLPDMDKYGTQRVISFIRQMVEHGGFYRTSDQTWVKLERIQFVGACNPPTDPGRKPLSHRFLRHVPVVYVDYPGPASLTQIYGTFNRAMLRLIPSLRTYAEPLTAAMVEFYTMSQERFTQDTQPHYIYSPREMTRWVRGIFEALRPLETLPVEGLIRIWAHEALRLFQDRLVEDEERRWTDENIDTVALKHFPNIDREKAMSRPILYSNWLSKDYIPVDQEELRDYVKARLKVFYEEELDVPLVLFNEVLDHVLRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVAWMNGLSVYQIKVHRKYTGEDFDEDLRTVLRRSGCKNEKIAFIMDESNVLDSGFLERMNTLLANGEVPGLFEGDEYATLMTQCKEGAQKEGLMLDSHEELYKWFTSQVIRNLHVVFTMNPSSEGLKDRAATSPALFNRCVLNWFGDWSTEALYQVGKEFTSKMDLEKPNYIVPDYMPVVYDKLPQPPSHREAIVNSCVFVHQTLHQANARLAKRGGRTMAITPRHYLDFINHYANLFHEKRSELEEQQMHLNVGLRKIKETVDQVEELRRDLRIKSQELEVKNAAANDKLKKMVKDQQEAEKKKVMSQEIQEQLHKQQEVIADKQMSVKEDLDKVEPAVIEAQNAVKSIKKQHLVEVRSMANPPAAVKLALESICLLLGESTTDWKQIRSIIMRENFIPTIVNFSAEEISDAIREKMKKNYMSNPSYNYEIVNRASLACGPMVKWAIAQLNYADMLKRVEPLRNELQKLEDDAKDNQQKANEVEQMIRDLEASIARYKEEYAVLISEAQAIKADLAAVEAKVNRSTALLKSLSAERERWEKTSETFKNQMSTIAGDCLLSAAFIAYAGYFDQQMRQNLFTTWSHHLQQANIQFRTDIARTEYLSNADERLRWQASSLPADDLCTENAIMLKRFNRYPLIIDPSGQATEFIMNEYKDRKITRTSFLDDAFRKNLESALRFGNPLLVQDVESYDPVLNPVLNREVRRTGGRVLITLGDQDIDLSPSFVIFLSTRDPTVEFPPDLCSRVTFVNFTVTRSSLQSQCLNEVLKAERPDVDEKRSDLLKLQGEFQLRLRQLEKSLLQALNEVKGRILDDDTIITTLENLKREAAEVTRKVEETDIVMQEVETVSQQYLPLSTACSSIYFTMESLKQIHFLYQYSLQFFLDIYHNVLYENPNLKGVTDHTQRLSIITKDLFQVAFNRVARGMLHQDHITFAMLLARIKLKGTVGEPTYDAEFQHFLRGNEIVLSAGSTPRIQGLTVEQAEAVVRLSCLPAFKDLIAKVQADEQFGIWLDSSSPEQTVPYLWSEETPATPIGQAIHRLLLIQAFRPDRLLAMAHMFVSTNLGESFMSIMEQPLDLTHIVGTEVKPNTPVLMCSVPGYDASGHVEDLAAEQNTQITSIAIGSAEGFNQADKAINTAVKSGRWVMLKNVHLAPGWLMQLEKKLHSLQPHACFRLFLTMEINPKVPVNLLRAGRIFVFEPPPGVKANMLRTFSSIPVSRICKSPNERARLYFLLAWFHAIIQERLRYAPLGWSKKYEFGESDLRSACDTVDTWLDDTAKGRQNISPDKIPWSALKTLMAQSIYGGRVDNEFDQRLLNTFLERLFTTRSFDSEFKLACKVDGHKDIQMPDGIRREEFVQWVELLPDTQTPSWLGLPNNAERVLLTTQGVDMISKMLKMQMLEDEDDLAYAETEKKTRTDSTSDGRPAWMRTLHTTASNWLHLIPQTLSHLKRTVENIKDPLFRFFEREVKMGAKLLQDVRQDLADVVQVCEGKKKQTNYLRTLINELVKGILPRSWSHYTVPAGMTVIQWVSDFSERIKQLQNISLAAASGGAKELKNIHVCLGGLFVPEAYITATRQYVAQANSWSLEELCLEVNVTTSQGATLDACSFGVTGLKLQGATCNNNKLSLSNAISTALPLTQLRWVKQTNTEKKASVVTLPVYLNFTRADLIFTVDFEIATKEDPRSFYERGVAVLCTE,mutated_sequence,1.0,4646.0,NP_001367.2.a2m,NP_001367.2.npy,ClinVar
+NP_001368.2,NP_001368.2.csv,MANGTADVRKLFIFTTTQNYFGLMSELWDQPLLCNCLEINNFLDDGNQMLLRVQRSDAGISFSNTIEFGDTKDKVLVFFKLRPEVITDENLHDNILVSSMLESPISSLYQAVRQVFAPMLLKDQEWSRNFDPKLQNLLSELEAGLGIVLRRSDTNLTKLKFKEDDTRGILTPSDEFQFWIEQAHRGNKQISKERANYFKELFETIAREFYNLDSLSLLEVVDLVETTQDVVDDVWRQTEHDHYPESRMLHLLDIIGGSFGRFVQKKLGTLNLWEDPYYLVKESLKAGISICEQWVIVCNHLTGQVWQRYVPHPWKNEKYFPETLDKLGKRLEEVLAIRTIHEKFLYFLPASEEKIICLTRVFEPFTGLNPVQYNPYTEPLWKAAVSQYEKIIAPAEQKIAGKLKNYISEIQDSPQQLLQAFLKYKELVKRPTISKELMLERETLLARLVDSIKDFRLDFENRCRGIPGDASGPLSGKNLSEVVNSIVWVRQLELKVDDTIKIAEALLSDLPGFRCFHQSAKDLLDQLKLYEQEQFDDWSRDIQSGLSDSRSGLCIEASSRIMELDSNDGLLKVHYSDRLVILLREVRQLSALGFVIPAKIQQVANIAQKFCKQAIILKQVAHFYNSIDQQMIQSQRPMMLQSALAFEQIIKNSKAGSGGKSQITWDNPKELEGYIQKLQNAAERLATENRKLRKWHTTFCEKVVVLMNIDLLRQQQRWKDGLQELRTGLATVEAQGFQASDMHAWKQHWNHQLYKALEHQYQMGLEALNENLPEINIDLTYKQGRLQFRPPFEEIRAKYYREMKRFIGIPNQFKGVGEAGDESIFSIMIDRNASGFLTIFSKAEDLFRRLSAVLHQHKEWIVIGQVDMEALVEKHLFTVHDWEKNFKALKIKGKEVERLPSAVKVDCLNINCNPVKTVIDDLIQKLFDLLVLSLKKSIQAHLHEIDTFVTEAMEVLTIMPQSVEEIGDANLQYSKLQERKPEILPLFQEAEDKNRLLRTVAGGGLETISNLKAKWDKFELMMESHQLMIKDQIEVMKGNVKSRLQIYYQELEKFKARWDQLKPGDDVIETGQHNTLDKSAKLIKEKKIEFDDLEVTRKKLVDDCHHFRLEEPNFSLASSISKDIESCAQIWAFYEEFQQGFQEMANEDWITFRTKTYLFEEFLMNWHDRLRKVEEHSVMTVKLQSEVDKYKIVIPILKYVRGEHLSPDHWLDLFRLLGLPRGTSLEKLLFGDLLRVADTIVAKAADLKDLNSRAQGEVTIREALRELDLWGVGAVFTLIDYEDSQSRTMKLIKDWKDIVNQVGDNRCLLQSLKDSPYYKGFEDKVSIWERKLAELDEYLQNLNHIQRKWVYLEPIFGRGALPKEQTRFNRVDEDFRSIMTDIKKDNRVTTLTTHAGIRNSLLTILDQLQRCQKSLNEFLEEKRSAFPRFYFIGDDDLLEILGQSTNPSVIQSHLKKLFAGINSVCFDEKSKHITAMKSLEGEVVPFKNKVPLSNNVETWLNDLALEMKKTLEQLLKECVTTGRSSQGAVDPSLFPSQILCLAEQIKFTEDVENAIKDHSLHQIETQLVNKLEQYTNIDTSSEDPGNTESGILELKLKALILDIIHNIDVVKQLNQIQVHTTEDWAWKKQLRFYMKSDHTCCVQMVDSEFQYTYEYQGNASKLVYTPLTDKCYLTLTQAMKMGLGGNPYGPAGTGKTESVKALGGLLGRQVLVFNCDEGIDVKSMGRIFVGLVKCGAWGCFDEFNRLEESVLSAVSMQIQTIQDALKNHRTVCELLGKEVEVNSNSGIFITMNPAGKGYGGRQKLPDNLKQLFRPVAMSHPDNELIAEVILYSEGFKDAKVLSRKLVAIFNLSRELLTPQQHYDWGLRALKTVLRGSGNLLRQLNKSGTTQNANESHIVVQALRLNTMSKFTFTDCTRFDALIKDVFPGIELKEVEYDELSAALKQVFEEANYEIIPNQIKKALELYEQLCQRMGVVIVGPSGAGKSTLWRMLRAALCKTGKVVKQYTMNPKAMPRYQLLGHIDMDTREWSDGVLTNSARQVVREPQDVSSWIICDGDIDPEWIESLNSVLDDNRLLTMPSGERIQFGPNVNFVFETHDLSCASPATISRMGMIFLSDEETDLNSLIKSWLRNQPAEYRNNLENWIGDYFEKALQWVLKQNDYVVETSLVGTVMNGLSHLHGCRDHDEFIINLIRGLGGNLNMKSRLEFTKEVFHWARESPPDFHKPMDTYYDSTRGRLATYVLKKPEDLTADDFSNGLTLPVIQTPDMQRGLDYFKPWLSSDTKQPFILVGPEGCGKGMLLRYAFSQLRSTQIATVHCSAQTTSRHLLQKLSQTCMVISTNTGRVYRPKDCERLVLYLKDINLPKLDKWGTSTLVAFLQQVLTYQGFYDENLEWVGLENIQIVASMSAGGRLGRHKLTTRFTSIVRLCSIDYPEREQLQTIYGAYLEPVLHKNLKNHSIWGSSSKIYLLAGSMVQVYEQVRAKFTVDDYSHYFFTPCILTQWVLGLFRYDLEGGSSNHPLDYVLEIVAYEARRLFRDKIVGAKELHLFDIILTSVFQGDWGSDILDNMSDSFYVTWGARHNSGARAAPGQPLPPHGKPLGKLNSTDLKDVIKKGLIHYGRDNQNLDILLFHEVLEYMSRIDRVLSFPGGSLLLAGRSGVGRRTITSLVSHMHGAVLFSPKISRGYELKQFKNDLKHVLQLAGIEAQQVVLLLEDYQFVHPTFLEMINSLLSSGEVPGLYTLEELEPLLLPLKDQASQDGFFGPVFNYFTYRIQQNLHIVLIMDSANSNFMINCESNPALHKKCQVLWMEGWSNSSMKKIPEMLFSETGGGEKYNDKKRKEEKKKNSVDPDFLKSFLLIHESCKAYGATPSRYMTFLHVYSAISSSKKKELLKRQSHLQAGVSKLNEAKALVDELNRKAGEQSVLLKTKQDEADAALQMITVSMQDASEQKTELERLKHRIAEEVVKIEERKNKIDDELKEVQPLVNEAKLAVGNIKPESLSEIRSLRMPPDVIRDILEGVLRLMGIFDTSWVSMKSFLAKRGVREDIATFDARNISKEIRESVEELLFKNKGSFDPKNAKRASTAAAPLAAWVKANIQYSHVLERIHPLETEQAGLESNLKKTEDRKRKLEELLNSVGQKVSELKEKFQSRTSEAAKLEAEVSKAQETIKAAEVLINQLDREHKRWNAQVVEITEELATLPKRAQLAAAFITYLSAAPESLRKTCLEEWTKSAGLEKFDLRRFLCTESEQLIWKSEGLPSDDLSIENALVILQSRVCPFLIDPSSQATEWLKTHLKDSRLEVINQQDSNFITALELAVRFGKTLIIQEMDGVEPVLYPLLRRDLVAQGPRYVVQIGDKIIDYNEEFRLFLSTRNPNPFIPPDAASIVTEVNFTTTRSGLRGQLLALTIQHEKPDLEEQKTKLLQQEEDKKIQLAKLEESLLETLATSQGNILENKDLIESLNQTKASSALIQESLKESYKLQISLDQERDAYLPLAESASKMYFIISDLSKINNMYRFSLAAFLRLFQRALQNKQDSENTEQRIQSLISSLQHMVYEYICRCLFKADQLMFALHFVRGMHPELFQENEWDTFTGVVVGDMLRKADSQQKIRDQLPSWIDQERSWAVATLKIALPSLYQTLCFEDAALWRTYYNNSMCEQEFPSILAKKVSLFQQILVVQALRPDRLQSAMALFACKTLGLKEVSPLPLNLKRLYKETLEIEPILIIISPGADPSQELQELANAERSGECYHQVAMGQGQADLAIQMLKECARNGDWLCLKNLHLVVSWLPVLEKELNTLQPKDTFRLWLTAEVHPNFTPILLQSSLKITYESPPGLKKNLMRTYESWTPEQISKKDNTHRAHALFSLAWFHAACQERRNYIPQGWTKFYEFSLSDLRAGYNIIDRLFDGAKDVQWEFVHGLLENAIYGGRIDNYFDLRVLQSYLKQFFNSSVIDVFNQRNKKSIFPYSVSLPQSCSILDYRAVIEKIPEDDKPSFFGLPANIARSSQRMISSQVISQLRILGRSITAGSKFDREIWSNELSPVLNLWKKLNQNSNLIHQKVPPPNDRQGSPILSFIILEQFNAIRLVQSVHQSLAALSKVIRGTTLLSSEVQKLASALLNQKCPLAWQSKWEGPEDPLQYLRGLVARALAIQNWVDKAEKQALLSETLDLSELFHPDTFLNALRQETARAVGRSVDSLKFVASWKGRLQEAKLQIKISGLLLEGCSFDGNQLSENQLDSPSVSSVLPCFMGWIPQDACGPYSPDECISLPVYTSAERDRVVTNIDVPCGGNQDQWIQCGAALFLKNQ,mutated_sequence,1.0,4307.0,NP_001368.2.a2m,NP_001368.2.npy,ClinVar
+NP_001371839.1,NP_001371839.1.csv,MGNVMEGKSVEELSSTECHQWYKKFMTECPSGQLTLYEFRQFFGLKNLSPSASQYVEQMFETFDFNKDGYIDFMEYVAALSLVLKGKVEQKLRWYFKLYDVDGNGCIDRDELLTIIQAIRAINPCSDTTMTAEEFTDTVFSKIDVNGDGELSLEEFIEGVQKDQMLLDTLTRSLDLTRIVRRLQNGEQDEEGADEAAEAAG,mutated_sequence,1.0,201.0,NP_001371839.1.a2m,NP_001371839.1.npy,ClinVar
+NP_001373322.1,NP_001373322.1.csv,MGGLLGRQRLLLRMGGGRLGAPMERHGRASATSVSSAGEQAAGDPEGRRQEPLRRRASSASVPAVGASAEGTRRDRLGSYSGPTSVSRQRVESLRKKRPLFPWFGLDIGGTLVKLVYFEPKDITAEEEEEEVESLKSIRKYLTSNVAYGSTGIRDVHLELKDLTLCGRKGNLHFIRFPTHDMPAFIQMGRDKNFSSLHTVFCATGGGAYKFEQDFLTIGDLQLCKLDELDCLIKGILYIDSVGFNGRSQCYYFENPADSEKCQKLPFDLKNPYPLLLVNIGSGVSILAVYSKDNYKRVTGTSLGGGTFFGLCCLLTGCTTFEEALEMASRGDSTKVDKLVRDIYGGDYERFGLPGWAVASSFGNMMSKEKREAVSKEDLARATLITITNNIGSIARMCALNENINQVVFVGNFLRINTIAMRLLAYALDYWSKGQLKALFSEHEGYFGAVGALLELLKIP,mutated_sequence,1.0,460.0,NP_001373322.1.a2m,NP_001373322.1.npy,ClinVar
+NP_001405.1,NP_001405.1.csv,MDDKELIEYFKSQMKEDPDMASAVAAIRTLLEFLKRDKGETIQGLRANLTSAIETLCGVDSSVAVSSGGELFLRFISLASLEYSDYSKCKKIMIERGELFLRRISLSRNKIADLCHTFIKDGATILTHAYSRVVLRVLEAAVAAKKRFSVYVTESQPDLSGKKMAKALCHLNVPVTVVLDAAVGYIMEKADLVIVGAEGVVENGGIINKIGTNQMAVCAKAQNKPFYVVAESFKFVRLFPLNQQDVPDKFKYKADTLKVAQTGQDLKEEHPWVDYTAPSLITLLFTDLGVLTPSAVSDELIKLYL,mutated_sequence,1.0,305.0,NP_001405.1.a2m,NP_001405.1.npy,ClinVar
+NP_001420.2,NP_001420.2.csv,MAENVVEPGPPSAKRPKLSSPALSASASDGTDFGSLFDLEHDLPDELINSTELGLTNGGDINQLQTSLGMVQDAASKHKQLSELLRSGSSPNLNMGVGGPGQVMASQAQQSSPGLGLINSMVKSPMTQAGLTSPNMGMGTSGPNQGPTQSTGMMNSPVNQPAMGMNTGMNAGMNPGMLAAGNGQGIMPNQVMNGSIGAGRGRQNMQYPNPGMGSAGNLLTEPLQQGSPQMGGQTGLRGPQPLKMGMMNNPNPYGSPYTQNPGQQIGASGLGLQIQTKTVLSNNLSPFAMDKKAVPGGGMPNMGQQPAPQVQQPGLVTPVAQGMGSGAHTADPEKRKLIQQQLVLLLHAHKCQRREQANGEVRQCNLPHCRTMKNVLNHMTHCQSGKSCQVAHCASSRQIISHWKNCTRHDCPVCLPLKNAGDKRNQQPILTGAPVGLGNPSSLGVGQQSAPNLSTVSQIDPSSIERAYAALGLPYQVNQMPTQPQVQAKNQQNQQPGQSPQGMRPMSNMSASPMGVNGGVGVQTPSLLSDSMLHSAINSQNPMMSENASVPSLGPMPTAAQPSTTGIRKQWHEDITQDLRNHLVHKLVQAIFPTPDPAALKDRRMENLVAYARKVEGDMYESANNRAEYYHLLAEKIYKIQKELEEKRRTRLQKQNMLPNAAGMVPVSMNPGPNMGQPQPGMTSNGPLPDPSMIRGSVPNQMMPRITPQSGLNQFGQMSMAQPPIVPRQTPPLQHHGQLAQPGALNPPMGYGPRMQQPSNQGQFLPQTQFPSQGMNVTNIPLAPSSGQAPVSQAQMSSSSCPVNSPIMPPGSQGSHIHCPQLPQPALHQNSPSPVPSRTPTPHHTPPSIGAQQPPATTIPAPVPTPPAMPPGPQSQALHPPPRQTPTPPTTQLPQQVQPSLPAAPSADQPQQQPRSQQSTAASVPTPTAPLLPPQPATPLSQPAVSIEGQVSNPPSTSSTEVNSQAIAEKQPSQEVKMEAKMEVDQPEPADTQPEDISESKVEDCKMESTETEERSTELKTEIKEEEDQPSTSATQSSPAPGQSKKKIFKPEELRQALMPTLEALYRQDPESLPFRQPVDPQLLGIPDYFDIVKSPMDLSTIKRKLDTGQYQEPWQYVDDIWLMFNNAWLYNRKTSRVYKYCSKLSEVFEQEIDPVMQSLGYCCGRKLEFSPQTLCCYGKQLCTIPRDATYYSYQNRYHFCEKCFNEIQGESVSLGDDPSQPQTTINKEQFSKRKNDTLDPELFVECTECGRKMHQICVLHHEIIWPAGFVCDGCLKKSARTRKENKFSAKRLPSTRLGTFLENRVNDFLRRQNHPESGEVTVRVVHASDKTVEVKPGMKARFVDSGEMAESFPYRTKALFAFEEIDGVDLCFFGMHVQEYGSDCPPPNQRRVYISYLDSVHFFRPKCLRTAVYHEILIGYLEYVKKLGYTTGHIWACPPSEGDDYIFHCHPPDQKIPKPKRLQEWYKKMLDKAVSERIVHDYKDIFKQATEDRLTSAKELPYFEGDFWPNVLEESIKELEQEEEERKREENTSNESTDVTKGDSKNAKKKNNKKTSKNKSSLSRGNKKKPGMPNVSNDLSQKLYATMEKHKEVFFVIRLIAGPAANSLPPIVDPDPLIPCDLMDGRDAFLTLARDKHLEFSSLRRAQWSTMCMLVELHTQSQDRFVYTCNECKHHVETRWHCTVCEDYDLCITCYNTKNHDHKMEKLGLGLDDESNNQQAAATQSPGDSRRLSIQRCIQSLVHACQCRNANCSLPSCQKMKRVVQHTKGCKRKTNGGCPICKQLIALCCYHAKHCQENKCPVPFCLNIKQKLRQQQLQHRLQQAQMLRRRMASMQRTGVVGQQQGLPSPTPATPTTPTGQQPTTPQTPQPTSQPQPTPPNSMPPYLPRTQAAGPVSQGKAAGQVTPPTPPQTAQPPLPGPPPAAVEMAMQIQRAAETQRQMAHVQIFQRPIQHQMPPMTPMAPMGMNPPPMTRGPSGHLEPGMGPTGMQQQPPWSQGGLPQPQQLQSGMPRPAMMSVAQHGQPLNMAPQPGLGQVGISPLKPGTVSQQALQNLLRTLRSPSSPLQQQQVLSILHANPQLLAAFIKQRAAKYANSNPQPIPGQPGMPQGQPGLQPPTMPGQQGVHSNPAMQNMNPMQAGVQRAGLPQQQPQQQLQPPMGGMSPQAQQMNMNHNTMPSQFRDILRRQQMMQQQQQQGAGPGIGPGMANHNQFQQPQGVGYPPQQQQRMQHHMQQMQQGNMGQIGQLPQALGAEAGASLQAYQQRLLQQQMGSPVQPNPMSPQQHMLPNQAQSPHLQGQQIPNSLSNQVRSPQPVPSPRPQSQPPHSSPSPRMQPQPSPHHVSPQTSSPHPGLVAAQANPMEQGHFASPDQNSMLSQLASNPGMANLHGASATDLGLSTDNSDLNSNLSQSTLDIH,mutated_sequence,1.0,2414.0,NP_001420.2.a2m,NP_001420.2.npy,ClinVar
+NP_001444.2,NP_001444.2.csv,MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPGGMARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKDAVKDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDIKTENGTCPSPPQPLSPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAPPPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAYSPGQSSLYSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAGGSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRLTSWYLNQAGGDLGHLASAAAAAAAAGYPGQQQNFHSVREMFESQRIGLNNSPVNGNSSCQMAFPSSQSLYRTSGAFVYDCSKF,mutated_sequence,1.0,553.0,NP_001444.2.a2m,NP_001444.2.npy,ClinVar
+NP_001483.3,NP_001483.3.csv,MPPPQQGPCGHHLLLLLALLLPSLPLTRAPVPPGPAAALLQALGLRDEPQGAPRLRPVPPVMWRLFRRRDPQETRSGSRRTSPGVTLQPCHVEELGVAGNIVRHIPDRGAPTRASEPASAAGHCPEWTVVFDLSAVEPAERPSRARLELRFAAAAAAAPEGGWELSVAQAGQGAGADPGPVLLRQLVPALGPPVRAELLGAAWARNASWPRSLRLALALRPRAPAACARLAEASLLLVTLDPRLCHPLARPRRDAEPVLGGGPGGACRARRLYVSFREVGWHRWVIAPRGFLANYCQGQCALPVALSGSGGPPALNHAVLRALMHAAAPGAADLPCCVPARLSPISVLFFDNSDNVVLRQYEDMVVDECGCR,mutated_sequence,1.0,372.0,NP_001483.3.a2m,NP_001483.3.npy,ClinVar
+NP_001646.2,NP_001646.2.csv,MVLLAAAVCTKAGKAIVSRQFVEMTRTRIEGLLAAFPKLMNTGKQHTFVETESVRYVYQPMEKLYMVLITTKNSNILEDLETLRLFSRVIPEYCRALEENEISEHCFDLIFAFDEIVALGYRENVNLAQIRTFTEMDSHEEKVFRAVRETQEREAKAEMRRKAKELQQARRDAERQGKKAPGFGGFGSSAVSGGSTAAMITETIIETDKPKVAPAPARPSGPSKALKLGAKGKEVDNFVDKLKSEGETIMSSSMGKRTSEATKMHAPPINMESVHMKIEEKITLTCGRDGGLQNMELHGMIMLRISDDKYGRIRLHVENEDKKGVQLQTHPNVDKKLFTAESLIGLKNPEKSFPVNSDVGVLKWRLQTTEESFIPLTINCWPSESGNGCDVNIEYELQEDNLELNDVVITIPLPSGVGAPVIGEIDGEYRHDSRRNTLEWCLPVIDAKNKSGSLEFSIAGQPNDFFPVQVSFVSKKNYCNIQVTKVTQVDGNSPVRFSTETTFLVDKYEIL,mutated_sequence,1.0,511.0,NP_001646.2.a2m,NP_001646.2.npy,ClinVar
+NP_001664.3,NP_001664.3.csv,MCGIWALFGSDDCLSVQCLSAMKIAHRGPDAFRFENVNGYTNCCFGFHRLAVVDPLFGMQPIRVKKYPYLWLCYNGEIYNHKKMQQHFEFEYQTKVDGEIILHLYDKGGIEQTICMLDGVFAFVLLDTANKKVFLGRDTYGVRPLFKAMTEDGFLAVCSEAKGLVTLKHSATPFLKVEPFLPGHYEVLDLKPNGKVASVEMVKYHHCRDVPLHALYDNVEKLFPGFEIETVKNNLRILFNNAVKKRLMTDRRIGCLLSGGLDSSLVAATLLKQLKEAQVQYPLQTFAIGMEDSPDLLAARKVADHIGSEHYEVLFNSEEGIQALDEVIFSLETYDITTVRASVGMYLISKYIRKNTDSVVIFSGEGSDELTQGYIYFHKAPSPEKAEEESERLLRELYLFDVLRADRTTAAHGLELRVPFLDHRFSSYYLSLPPEMRIPKNGIEKHLLRETFEDSNLIPKEILWRPKEAFSDGITSVKNSWFKILQEYVEHQVDDAMMANAAQKFPFNTPKTKEGYYYRQVFERHYPGRADWLSHYWMPKWINATDPSARTLTHYKSAVKA,mutated_sequence,1.0,561.0,NP_001664.3.a2m,NP_001664.3.npy,ClinVar
+NP_001684.2,NP_001684.2.csv,MALRAMRGIVNGAAPELPVPTGGPAVGAREQALAVSRNYLSQPRLTYKTVSGVNGPLVILDHVKFPRYAEIVHLTLPDGTKRSGQVLEVSGSKAVVQVFEGTSGIDAKKTSCEFTGDILRTPVSEDMLGRVFNGSGKPIDRGPVVLAEDFLDIMGQPINPQCRIYPEEMIQTGISAIDGMNSIARGQKIPIFSAAGLPHNEIAAQICRQAGLVKKSKDVVDYSEENFAIVFAAMGVNMETARFFKSDFEENGSMDNVCLFLNLANDPTIERIITPRLALTTAEFLAYQCEKHVLVILTDMSSYAEALREVSAAREEVPGRRGFPGYMYTDLATIYERAGRVEGRNGSITQIPILTMPNDDITHPIPDLTGYITEGQIYVDRQLHNRQIYPPINVLPSLSRLMKSAIGEGMTRKDHADVSNQLYACYAIGKDVQAMKAVVGEEALTSDDLLYLEFLQKFERNFIAQGPYENRTVFETLDIGWQLLRIFPKEMLKRIPQSTLSEFYPRDSAKH,mutated_sequence,1.0,511.0,NP_001684.2.a2m,NP_001684.2.npy,ClinVar
+NP_001724.4,NP_001724.4.csv,MWLLYLLVPALFCRAGGSIPIPQKLFGEVTSPLFPKPYPNNFETTTVITVPTGYRVKLVFQQFDLEPSEGCFYDYVKISADKKSLGRFCGQLGSPLGNPPGKKEFMSQGNKMLLTFHTDFSNEENGTIMFYKGFLAYYQAVDLDECASRSKSGEEDPQPQCQHLCHNYVGGYFCSCRPGYELQEDRHSCQAECSSELYTEASGYISSLEYPRSYPPDLRCNYSIRVERGLTLHLKFLEPFDIDDHQQVHCPYDQLQIYANGKNIGEFCGKQRPPDLDTSSNAVDLLFFTDESGDSRGWKLRYTTEIIKCPQPKTLDEFTIIQNLQPQYQFRDYFIATCKQGYQLIEGNQVLHSFTAVCQDDGTWHRAMPRCKIKDCGQPRNLPNGDFRYTTTMGVNTYKARIQYYCHEPYYKMQTRAGSRESEQGVYTCTAQGIWKNEQKGEKIPRCLPVCGKPVNPVEQRQRIIGGQKAKMGNFPWQVFTNIHGRGGGALLGDRWILTAAHTLYPKEHEAQSNASLDVFLGHTNVEELMKLGNHPIRRVSVHPDYRQDESYNFEGDIALLELENSVTLGPNLLPICLPDNDTFYDLGLMGYVSGFGVMEEKIAHDLRFVRLPVANPQACENWLRGKNRMDVFSQNMFCAGHPSLKQDACQGDSGGVFAVRDPNTDRWVATGIVSWGIGCSRGYGFYTKVLNYVDWIKKEMEEED,mutated_sequence,1.0,705.0,NP_001724.4.a2m,NP_001724.4.npy,ClinVar
+NP_001725.1,NP_001725.1.csv,MWCIVLFSLLAWVYAEPTMYGEILSPNYPQAYPSEVEKSWDIEVPEGYGIHLYFTHLDIELSENCAYDSVQIISGDTEEGRLCGQRSSNNPHSPIVEEFQVPYNKLQVIFKSDFSNEERFTGFAAYYVATDINECTDFVDVPCSHFCNNFIGGYFCSCPPEYFLHDDMKNCGVNCSGDVFTALIGEIASPNYPKPYPENSRCEYQIRLEKGFQVVVTLRREDFDVEAADSAGNCLDSLVFVAGDRQFGPYCGHGFPGPLNIETKSNALDIIFQTDLTGQKKGWKLRYHGDPMPCPKEDTPNSVWEPAKAKYVFRDVVQITCLDGFEVVEGRVGATSFYSTCQSNGKWSNSKLKCQPVDCGIPESIENGKVEDPESTLFGSVIRYTCEEPYYYMENGGGGEYHCAGNGSWVNEVLGPELPKCVPVCGVPREPFEEKQRIIGGSDADIKNFPWQVFFDNPWAGGALINEYWVLTAAHVVEGNREPTMYVGSTSVQTSRLAKSKMLTPEHVFIHPGWKLLEVPEGRTNFDNDIALVRLKDPVKMGPTVSPICLPGTSSDYNLMDGDLGLISGWGRTEKRDRAVRLKAARLPVAPLRKCKEVKVEKPTADAEAYVFTPNMICAGGEKGMDSCKGDSGGAFAVQDPNDKTKFYAAGLVSWGPQCGTYGLYTRVKNYVDWIMKTMQENSTPRED,mutated_sequence,1.0,688.0,NP_001725.1.a2m,NP_001725.1.npy,ClinVar
+NP_001796.2,NP_001796.2.csv,MSHGTYYECEPRGGQQPLEFSGGRAGPGELGDMCEHEASIDLSAYIESGEEQLLSDLFAVKPAPEARGLKGPGTPAFPHYLPPDPRPFAYPPHTFGPDRKALGPGIYSSPGSYDPRAVAVKEEPRGPEGSRAASRGSYNPLQYQVAHCGQTAMHLPPTLAAPGQPLRVLKAPLATAAPPCSPLLKAPSPAGPLHKGKKAVNKDSLEYRLRRERNNIAVRKSRDKAKRRILETQQKVLEYMAENERLRSRVEQLTQELDTLRNLFRQIPEAANLIKGVGGCS,mutated_sequence,1.0,281.0,NP_001796.2.a2m,NP_001796.2.npy,ClinVar
+NP_001805.4,NP_001805.4.csv,MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQEKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKYKEEGSKVTTYCNETMTGWVHDVLGRNWACFTGKKVGTASENVYVNIAHLKNSQEKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPKL,mutated_sequence,1.0,463.0,NP_001805.4.a2m,NP_001805.4.npy,ClinVar
+NP_001835.3,NP_001835.3.csv,MIRLGAPQTLVLLTLLVAAVLRCQGQDVQEAGSCVQDGQRYNDKDVWKPEPCRICVCDTGTVLCDDIICEDVKDCLSPEIPFGECCPICPTDLATASGQPGPKGQKGEPGDIKDIVGPKGPPGPQGPAGEQGPRGDRGDKGEKGAPGPRGRDGEPGTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFDEKAGGAQLGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPMGPRGPPGPPGKPGDDGEAGKPGKAGERGPPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAGAPGVKGESGSPGENGSPGPMGPRGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFPGAPGAKGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPGVMGFPGPKGANGEPGKAGEKGLPGAPGLRGLPGKDGETGAAGPPGPAGPAGERGEQGAPGPSGFQGLPGPPGPPGEGGKPGDQGVPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGDRGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHRGFTGLQGLPGPPGPSGDQGASGPAGPSGPRGPPGPVGPSGKDGANGIPGPIGPPGPRGRSGETGPAGPPGNPGPPGPPGPPGPGIDMSAFAGLGPREKGPDPLQYMRADQAAGGLRQHDAEVDATLKSLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCFL,mutated_sequence,1.0,1487.0,NP_001835.3.a2m,NP_001835.3.npy,ClinVar
+NP_001839.2,NP_001839.2.csv,MRAARALLPLLLQACWTAAQDEPETPRAVAFQDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGGSHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIATDHTYRRNFTAADWGQSRDAEEAISQTIDTIVDMIKNNVEQVCCSFECQPARGPPGLRGDPGFEGERGKPGLPGEKGEAGDPGRPGDLGPVGYQGMKGEKGSRGEKGSRGPKGYKGEKGKRGIDGVDGVKGEMGYPGLPGCKGSPGFDGIQGPPGPKGDPGAFGLKGEKGEPGADGEAGRPGSSGPSGDEGQPGEPGPPGEKGEAGDEGNPGPDGAPGERGGPGERGPRGTPGTRGPRGDPGEAGPQGDQGREGPVGVPGDPGEAGPIGPKGYRGDEGPPGSEGARGAPGPAGPPGDPGLMGERGEDGPAGNGTEGFPGFPGYPGNRGAPGINGTKGYPGLKGDEGEAGDPGDDNNDIAPRGVKGAKGYRGPEGPQGPPGHQGPPGPDECEILDIIMKMCSCCECKCGPIDLLFVLDSSESIGLQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYAGVVQYSHSQMQEHVSLRSPSIRNVQELKEAIKSLQWMAGGTFTGEALQYTRDQLLPPSPNNRIALVITDGRSDTQRDTTPLNVLCSPGIQVVSVGIKDVFDFIPGSDQLNVISCQGLAPSQGRPGLSLVKENYAELLEDAFLKNVTAQICIDKKCPDYTCPITFSSPADITILLDGSASVGSHNFDTTKRFAKRLAERFLTAGRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYVTRFYREASSGAAKKRLLLFSDGNSQGATPAAIEKAVQEAQRAGIEIFVVVVGRQVNEPHIRVLVTGKTAEYDVAYGESHLFRVPSYQALLRGVFHQTVSRKVALG,mutated_sequence,1.0,1028.0,NP_001839.2.a2m,NP_001839.2.npy,ClinVar
+NP_001840.3,NP_001840.3.csv,MLQGTCSVLLLWGILGAIQAQQQEVISPDTTERNNNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDRSKGTVHFAVVITDGHVTGSPCGGIKLQAERAREEGIRLFAVAPNQNLKEQGLRDIASTPHELYRNDYATMLPDSTEIDQDTINRIIKVMKHEAYGECYKVSCLEIPGPSGPKGYRGQKGAKGNMGEPGEPGQKGRQGDPGIEGPIGFPGPKGVPGFKGEKGEFGADGRKGAPGLAGKNGTDGQKGKLGRIGPPGCKGDPGNRGPDGYPGEAGSPGERGDQGGKGDPGRPGRRGPPGEIGAKGSKGYQGNSGAPGSPGVKGAKGGPGPRGPKGEPGRRGDPGTKGSPGSDGPKGEKGDPGPEGPRGLAGEVGNKGAKGDRGLPGPRGPQGALGEPGKQGSRGDPGDAGPRGDSGQPGPKGDPGRPGFSYPGPRGAPGEKGEPGPRGPEGGRGDFGLKGEPGRKGEKGEPADPGPPGEPGPRGPRGVPGPEGEPGPPGDPGLTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRVFAVVITDGRHDPRDDDLNLRALCDRDVTVTAIGIGDMFHEKHESENLYSIACDKPQQVRNMTLFSDLVAEKFIDDMEDVLCPDPQIVCPDLPCQTELSVAQCTQRPVDIVFLLDGSERLGEQNFHKARRFVEQVARRLTLARRDDDPLNARVALLQFGGPGEQQVAFPLSHNLTAIHEALETTQYLNSFSHVGAGVVHAINAIVRSPRGGARRHAELSFVFLTDGVTGNDSLHESAHSMRKQNVVPTVLALGSDVDMDVLTTLSLGDRAAVFHEKDYDSLAQPGFFDRFIRWIC,mutated_sequence,1.0,1019.0,NP_001840.3.a2m,NP_001840.3.npy,ClinVar
+NP_001845.3,NP_001845.3.csv,MEPWSSRWKTKRWLWDFTVTTLALTFLFQAREVRGAAPVDVLKALDFHNSPEGISKTTGFCTNRKNSKGSDTAYRVSKQAQLSAPTKQLFPGGTFPEDFSILFTVKPKKGIQSFLLSIYNEHGIQQIGVEVGRSPVFLFEDHTGKPAPEDYPLFRTVNIADGKWHRVAISVEKKTVTMIVDCKKKTTKPLDRSERAIVDTNGITVFGTRILDEEVFEGDIQQFLITGDPKAAYDYCEHYSPDCDSSAPKAAQAQEPQIDEYAPEDIIEYDYEYGEAEYKEAESVTEGPTVTEETIAQTEANIVDDFQEYNYGTMESYQTEAPRHVSGTNEPNPVEEIFTEEYLTGEDYDSQRKNSEDTLYENKEIDGRDSDLLVDGDLGEYDFYEYKEYEDKPTSPPNEEFGPGVPAETDITETSINGHGAYGEKGQKGEPAVVEPGMLVEGPPGPAGPAGIMGPPGLQGPTGPPGDPGDRGPPGRPGLPGADGLPGPPGTMLMLPFRYGGDGSKGPTISAQEAQAQAILQQARIALRGPPGPMGLTGRPGPVGGPGSSGAKGESGDPGPQGPRGVQGPPGPTGKPGKRGRPGADGGRGMPGEPGAKGDRGFDGLPGLPGDKGHRGERGPQGPPGPPGDDGMRGEDGEIGPRGLPGEAGPRGLLGPRGTPGAPGQPGMAGVDGPPGPKGNMGPQGEPGPPGQQGNPGPQGLPGPQGPIGPPGEKGPQGKPGLAGLPGADGPPGHPGKEGQSGEKGALGPPGPQGPIGYPGPRGVKGADGVRGLKGSKGEKGEDGFPGFKGDMGLKGDRGEVGQIGPRGEDGPEGPKGRAGPTGDPGPSGQAGEKGKLGVPGLPGYPGRQGPKGSTGFPGFPGANGEKGARGVAGKPGPRGQRGPTGPRGSRGARGPTGKPGPKGTSGGDGPPGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPPGPGGVVGPQGPTGETGPIGERGHPGPPGPPGEQGLPGAAGKEGAKGDPGPQGISGKDGPAGLRGFPGERGLPGAQGAPGLKGGEGPQGPPGPVGSPGERGSAGTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLPGPAGPAGSPGEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAPGIAGGDGEPGPRGQQGMFGQKGDEGARGFPGPPGPIGLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQGPNGADGPQGPPGSVGSVGGVGEKGEPGEAGNPGPPGEAGVGGPKGERGEKGEAGPPGAAGPPGAKGPPGDDGPKGNPGPVGFPGDPGPPGEPGPAGQDGVGGDKGEDGDPGQPGPPGPSGEAGPPGPPGKRGPPGAAGAEGRQGEKGAKGEAGAEGPPGKTGPVGPQGPAGKPGPEGLRGIPGPVGEQGLPGAAGQDGPPGPMGPPGLPGLKGDPGSKGEKGHPGLIGLIGPPGEQGEKGDRGLPGTQGSPGAKGDGGIPGPAGPLGPPGPPGLPGPQGPKGNKGSTGPAGQKGDSGLPGPPGSPGPPGEVIQPLPILSSKKTRRHTEGMQADADDNILDYSDGMEEIFGSLNSLKQDIEHMKFPMGTQTNPARTCKDLQLSHPDFPDGEYWIDPNQGCSGDSFKVYCNFTSGGETCIYPDKKSEGVRISSWPKEKPGSWFSEFKRGKLLSYLDVEGNSINMVQMTFLKLLTASARQNFTYHCHQSAAWYDVSSGSYDKALRFLGSNDEEMSYDNNPFIKTLYDGCASRKGYEKTVIEINTPKIDQVPIVDVMINDFGDQNQKFGFEVGPVCFLG,mutated_sequence,1.0,1806.0,NP_001845.3.a2m,NP_001845.3.npy,ClinVar
+NP_001866.2,NP_001866.2.csv,MTRILTAFKVVRTLKTGFGFTNVTAHQKWKFSRPGIRLLSVKAQTAHIVLEDGTKMKGYSFGHPSSVAGEVVFNTGLGGYPEAITDPAYKGQILTMANPIIGNGGAPDTTALDELGLSKYLESNGIKVSGLLVLDYSKDYNHWLATKSLGQWLQEEKVPAIYGVDTRMLTKIIRDKGTMLGKIEFEGQPVDFVDPNKQNLIAEVSTKDVKVYGKGNPTKVVAVDCGIKNNVIRLLVKRGAEVHLVPWNHDFTKMEYDGILIAGGPGNPALAEPLIQNVRKILESDRKEPLFGISTGNLITGLAAGAKTYKMSMANRGQNQPVLNITNKQAFITAQNHGYALDNTLPAGWKPLFVNVNDQTNEGIMHESKPFFAVQFHPEVTPGPIDTEYLFDSFFSLIKKGKATTITSVLPKPALVASRVEVSKVLILGSGGLSIGQAGEFDYSGSQAVKAMKEENVKTVLMNPNIASVQTNEVGLKQADTVYFLPITPQFVTEVIKAEQPDGLILGMGGQTALNCGVELFKRGVLKEYGVKVLGTSVESIMATEDRQLFSDKLNEINEKIAPSFAVESIEDALKAADTIGYPVMIRSAYALGGLGSGICPNRETLMDLSTKAFAMTNQILVEKSVTGWKEIEYEVVRDADDNCVTVCNMENVDAMGVHTGDSVVVAPAQTLSNAEFQMLRRTSINVVRHLGIVGECNIQFALHPTSMEYCIIEVNARLSRSSALASKATGYPLAFIAAKIALGIPLPEIKNVVSGKTSACFEPSLDYMVTKIPRWDLDRFHGTSSRIGSSMKSVGEVMAIGRTFEESFQKALRMCHPSIEGFTPRLPMNKEWPSNLDLRKELSEPSSTRIYAIAKAIDDNMSLDEIEKLTYIDKWFLYKMRDILNMEKTLKGLNSESMTEETLKRAKEIGFSDKQISKCLGLTEAQTRELRLKKNIHPWVKQIDTLAAEYPSVTNYLYVTYNGQEHDVNFDDHGMMVLGCGPYHIGSSVEFDWCAVSSIRTLRQLGKKTVVVNCNPETVSTDFDECDKLYFEELSLERILDIYHQEACGGCIISVGGQIPNNLAVPLYKNGVKIMGTSPLQIDRAEDRSIFSAVLDELKVAQAPWKAVNTLNEALEFAKSVDYPCLLRPSYVLSGSAMNVVFSEDEMKKFLEEATRVSQEHPVVLTKFVEGAREVEMDAVGKDGRVISHAISEHVEDAGVHSGDATLMLPTQTISQGAIEKVKDATRKIAKAFAISGPFNVQFLVKGNDVLVIECNLRASRSFPFVSKTLGVDFIDVATKVMIGENVDEKHLPTLDHPIIPADYVAIKAPMFSWPRLRDADPILRCEMASTGEVACFGEGIHTAFLKAMLSTGFKIPQKGILIGIQQSFRPRFLGVAEQLHNEGFKLFATEATSDWLNANNVPATPVAWPSQEGQNPSLSSIRKLIRDGSIDLVINLPNNNTKFVHDNYVIRRTAVDSGIPLLTNFQVTKLFAEAVQKSRKVDSKSLFHYRQYSAGKAA,mutated_sequence,1.0,1500.0,NP_001866.2.a2m,NP_001866.2.npy,ClinVar
+NP_001867.2,NP_001867.2.csv,MAEAHQAVAFQFTVTPDGIDLRLSHEALRQIYLSGLHSWKKKFIRFKNGIITGVYPASPSSWLIVVVGVMTTMYAKIDPSLGIIAKINRTLETANCMSSQTKNVVSGVLFGTGLWVALIVTMRYSLKVLLSYHGWMFTEHGKMSRATKIWMGMVKIFSGRKPMLYSFQTSLPRLPVPAVKDTVNRYLQSVRPLMKEEDFKRMTALAQDFAVGLGPRLQWYLKLKSWWATNYVSDWWEEYIYLRGRGPLMVNSNYYAMDLLYILPTHIQAARAGNAIHAILLYRRKLDREEIKPIRLLGSTIPLCSAQWERMFNTSRIPGEETDTIQHMRDSKHIVVYHRGRYFKVWLYHDGRLLKPREMEQQMQRILDNTSEPQPGEARLAALTAGDRVPWARCRQAYFGRGKNKQSLDAVEKAAFFVTLDETEEGYRSEDPDTSMDSYAKSLLHGRCYDRWFDKSFTFVVFKNGKMGLNAEHSWADAPIVAHLWEYVMSIDSLQLGYAEDGHCKGDINPNIPYPTRLQWDIPGECQEVIETSLNTANLLANDVDFHSFPFVAFGKGIIKKCRTSPDAFVQLALQLAHYKDMGKFCLTYEASMTRLFREGRTETVRSCTTESCDFVRAMVDPAQTVEQRLKLFKLASEKHQHMYRLAMTGSGIDRHLFCLYVVSKYLAVESPFLKEVLSEPWRLSTSQTPQQQVELFDLENNPEYVSSGGGFGPVADDGYGVSYILVGENLINFHISSKFSCPETDSHRFGRHLKEAMTDIITLFGLSSNSKK,mutated_sequence,1.0,773.0,NP_001867.2.a2m,NP_001867.2.npy,ClinVar
+NP_001877.1,NP_001877.1.csv,MTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGFETVRSLKVLSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLTIFEQENFLGKKGELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQYVLECDHHSGDYKHFREWGSHAPTFQVQSIRRIQQ,mutated_sequence,1.0,196.0,NP_001877.1.a2m,NP_001877.1.npy,ClinVar
+NP_001895.1,NP_001895.1.csv,MATQADLMELDMAMEPDRKAAVSHWQQQSYLDSGIHSGATTTAPSLSGKGNPEEEDVDTSQVLYEWEQGFSQSFTQEQVADIDGQYAMTRAQRVRAAMFPETLDEGMQIPSTQFDAAHPTNVQRLAEPSQMLKHAVVNLINYQDDAELATRAIPELTKLLNDEDQVVVNKAAVMVHQLSKKEASRHAIMRSPQMVSAIVRTMQNTNDVETARCTAGTLHNLSHHREGLLAIFKSGGIPALVKMLGSPVDSVLFYAITTLHNLLLHQEGAKMAVRLAGGLQKMVALLNKTNVKFLAITTDCLQILAYGNQESKLIILASGGPQALVNIMRTYTYEKLLWTTSRVLKVLSVCSSNKPAIVEAGGMQALGLHLTDPSQRLVQNCLWTLRNLSDAATKQEGMEGLLGTLVQLLGSDDINVVTCAAGILSNLTCNNYKNKMMVCQVGGIEALVRTVLRAGDREDITEPAICALRHLTSRHQEAEMAQNAVRLHYGLPVVVKLLHPPSHWPLIKATVGLIRNLALCPANHAPLREQGAIPRLVQLLVRAHQDTQRRTSMGGTQQQFVEGVRMEEIVEGCTGALHILARDVHNRIVIRGLNTIPLFVQLLYSPIENIQRVAAGVLCELAQDKEAAEAIEAEGATAPLTELLHSRNEGVATYAAAVLFRMSEDKPQDYKKRLSVELTSSLFRTEPMAWNETADLGLDIGAQGEPLGYRQDDPSYRSFHSGGYGQDALGMDPMMEHEMGGHHPGADYPVDGLPDLGHAQDLMDGLPPGDSNQLAWFDTDL,mutated_sequence,1.0,781.0,NP_001895.1.a2m,NP_001895.1.npy,ClinVar
+NP_001900.1,NP_001900.1.csv,MQPSSLLPLALCLLAAPASALVRIPLHKFTSIRRTMSEVGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAARL,mutated_sequence,1.0,412.0,NP_001900.1.a2m,NP_001900.1.npy,ClinVar
+NP_001918.3,NP_001918.3.csv,MSQAYSSSQRVSSYRRTFGGAPGFPLGSPLSSPVFPRAGFGSKGSSSSVTSRVYQVSRTSGGAGGLGSLRASRLGTTRTPSSYGAGELLDFSLADAVNQEFLTTRTNEKVELQELNDRFANYIEKVRFLEQQNAALAAEVNRLKGREPTRVAELYEEELRELRRQVEVLTNQRARVDVERDNLLDDLQRLKAKLQEEIQLKEEAENNLAAFRADVDAATLARIDLERRIESLNEEIAFLKKVHEEEIRELQAQLQEQQVQVEMDMSKPDLTAALRDIRAQYETIAAKNISEAEEWYKSKVSDLTQAANKNNDALRQAKQEMMEYRHQIQSYTCEIDALKGTNDSLMRQMRELEDRFASEASGYQDNIARLEEEIRHLKDEMARHLREYQDLLNVKMALDVEIATYRKLLEGEESRINLPIQTYSALNFRETSPEQRGSEVHTKKTVMIKTIETRDGEVVSEATQQQHEVL,mutated_sequence,1.0,470.0,NP_001918.3.a2m,NP_001918.3.npy,ClinVar
+NP_001922.2,NP_001922.2.csv,MWRVCARRAQNVAPWAGLEARWTALQEVPGTPRVTSRSGPAPARRNSVTTGYGGVRALCGWTPSSGATPRNRLLLQLLGSPGRRYYSLPPHQKVPLPSLSPTMQAGTIARWEKKEGDKINEGDLIAEVETDKATVGFESLEECYMAKILVAEGTRDVPIGAIICITVGKPEDIEAFKNYTLDSSAAPTPQAAPAPTPAATASPPTPSAQAPGSSYPPHMQVLLPALSPTMTMGTVQRWEKKVGEKLSEGDLLAEIETDKATIGFEVQEEGYLAKILVPEGTRDVPLGTPLCIIVEKEADISAFADYRPTEVTDLKPQVPPPTPPPVAAVPPTPQPLAPTPSAPCPATPAGPKGRVFVSPLAKKLAVEKGIDLTQVKGTGPDGRITKKDIDSFVPSKVAPAPAAVVPPTGPGMAPVPTGVFTDIPISNIRRVIAQRLMQSKQTIPHYYLSIDVNMGEVLLVRKELNKILEGRSKISVNDFIIKASALACLKVPEANSSWMDTVIRQNHVVDVSVAVSTPAGLITPIVFNAHIKGVETIANDVVSLATKAREGKLQPHEFQGGTFTISNLGMFGIKNFSAIINPPQACILAIGASEDKLVPADNEKGFDVASMMSVTLSCDHRVVDGAVGAQWLAEFRKYLEKPITMLL,mutated_sequence,1.0,647.0,NP_001922.2.a2m,NP_001922.2.npy,ClinVar
+NP_001944.1,NP_001944.1.csv,MAALMTPGTGAPPAPGDFSGEGSQGLPDPSPEPKQLPELIRMKRDGGRLSEADIRGFVAAVVNGSAQGAQIGAMLMAIRLRGMDLEETSVLTQALAQSGQQLEWPEAWRQQLVDKHSTGGVGDKVSLVLAPALAACGCKVPMISGRGLGHTGGTLDKLESIPGFNVIQSPEQMQVLLDQAGCCIVGQSEQLVPADGILYAARDVTATVDSLPLITASILSKKLVEGLSALVVDVKFGGAAVFPNQEQARELAKTLVGVGASLGLRVAAALTAMDKPLGRCVGHALEVEEALLCMDGAGPPDLRDLVTTLGGALLWLSGHAGTQAQGAARVAAALDDGSALGRFERMLAAQGVDPGLARALCSGSPAERRQLLPRAREQEELLAPADGTVELVRALPLALVLHELGAGRSRAGEPLRLGVGAELLVDVGQRLRRGTPWLRVHRDGPALSGPQSRALQEALVLSDRAPFAAPSPFAELVLPPQQ,mutated_sequence,1.0,482.0,NP_001944.1.a2m,NP_001944.1.npy,ClinVar
+NP_001963.1,NP_001963.1.csv,MTLGRRLACLFLACVLPALLLGGTALASEIVGGRRARPHAWPFMVSLQLRGGHFCGATLIAPNFVMSAAHCVANVNVRAVRVVLGAHNLSRREPTRQVFAVQRIFENGYDPVNLLNDIVILQLNGSATINANVQVAQLPAQGRRLGNGVQCLAMGWGLLGRNRGIASVLQELNVTVVTSLCRRSNVCTLVRGRQAGVCFGDSGSPLVCNGLIHGIASFVRGGCASGLYPDAFAPVAQFVNWIDSIIQRSEDNPCPHPRDPDPASRTH,mutated_sequence,1.0,267.0,NP_001963.1.a2m,NP_001963.1.npy,ClinVar
+NP_001978.1,NP_001978.1.csv,MSETPAQCSIKQERISYTPPESPVPSYASSTPLHVPVPRALRMEEDSIRLPAHLRLQPIYWSRDDVAQWLKWAENEFSLRPIDSNTFEMNGKALLLLTKEDFRYRSPHSGDVLYELLQHILKQRKPRILFSPFFHPGNSIHTQPEVILHQNHEEDNCVQRTPRPSVDNVHHNPPTIELLHRSRSPITTNHRPSPDPEQRPLRSPLDNMIRRLSPAERAQGPRPHQENNHQESYPLSVSPMENNHCPASSESHPKPSSPRQESTRVIQLMPSPIMHPLILNPRHSVDFKQSRLSEDGLHREGKPINLSHREDLAYMNHIMVSVSPPEEHAMPIGRIADCRLLWDYVYQLLSDSRYENFIRWEDKESKIFRIVDPNGLARLWGNHKNRTNMTYEKMSRALRHYYKLNIIRKEPGQRLLFRFMKTPDEIMSGRTDRLEHLESQELDEQIYQEDEC,mutated_sequence,1.0,452.0,NP_001978.1.a2m,NP_001978.1.npy,ClinVar
+NP_001990.2,NP_001990.2.csv,MGRRRRLCLQLYFLWLGCVVLWAQGTAGQPQPPPPKPPRPQPPPQQVRSATAGSEGGFLAPEYREEGAAVASRVRRRGQQDVLRGPNVCGSRFHSYCCPGWKTLPGGNQCIVPICRNSCGDGFCSRPNMCTCSSGQISSTCGSKSIQQCSVRCMNGGTCADDHCQCQKGYIGTYCGQPVCENGCQNGGRCIGPNRCACVYGFTGPQCERDYRTGPCFTQVNNQMCQGQLTGIVCTKTLCCATIGRAWGHPCEMCPAQPQPCRRGFIPNIRTGACQDVDECQAIPGICQGGNCINTVGSFECRCPAGHKQSETTQKCEDIDECSIIPGICETGECSNTVGSYFCVCPRGYVTSTDGSRCIDQRTGMCFSGLVNGRCAQELPGRMTKMQCCCEPGRCWGIGTIPEACPVRGSEEYRRLCMDGLPMGGIPGSAGSRPGGTGGNGFAPSGNGNGYGPGGTGFIPIPGGNGFSPGVGGAGVGAGGQGPIITGLTILNQTIDICKHHANLCLNGRCIPTVSSYRCECNMGYKQDANGDCIDVDECTSNPCTNGDCVNTPGSYYCKCHAGFQRTPTKQACIDIDECIQNGVLCKNGRCVNTDGSFQCICNAGFELTTDGKNCVDHDECTTTNMCLNGMCINEDGSFKCICKPGFVLAPNGRYCTDVDECQTPGICMNGHCINSEGSFRCDCPPGLAVGMDGRVCVDTHMRSTCYGGIKKGVCVRPFPGAVTKSECCCANPDYGFGEPCQPCPAKNSAEFHGLCSSGVGITVDGRDINECALDPDICANGICENLRGSYRCNCNSGYEPDASGRNCIDIDECLVNRLLCDNGLCRNTPGSYSCTCPPGYVFRTETETCEDINECESNPCVNGACRNNLGSFNCECSPGSKLSSTGLICIDSLKGTCWLNIQDSRCEVNINGATLKSECCATLGAAWGSPCERCELDTACPRGLARIKGVTCEDVNECEVFPGVCPNGRCVNSKGSFHCECPEGLTLDGTGRVCLDIRMEQCYLKWDEDECIHPVPGKFRMDACCCAVGAAWGTECEECPKPGTKEYETLCPRGAGFANRGDVLTGRPFYKDINECKAFPGMCTYGKCRNTIGSFKCRCNSGFALDMEERNCTDIDECRISPDLCGSGICVNTPGSFECECFEGYESGFMMMKNCMDIDECERNPLLCRGGTCVNTEGSFQCDCPLGHELSPSREDCVDINECSLSDNLCRNGKCVNMIGTYQCSCNPGYQATPDRQGCTDIDECMIMNGGCDTQCTNSEGSYECSCSEGYALMPDGRSCADIDECENNPDICDGGQCTNIPGEYRCLCYDGFMASMDMKTCIDVNECDLNSNICMFGECENTKGSFICHCQLGYSVKKGTTGCTDVDECEIGAHNCDMHASCLNIPGSFKCSCREGWIGNGIKCIDLDECSNGTHQCSINAQCVNTPGSYRCACSEGFTGDGFTCSDVDECAENINLCENGQCLNVPGAYRCECEMGFTPASDSRSCQDIDECSFQNICVFGTCNNLPGMFHCICDDGYELDRTGGNCTDIDECADPINCVNGLCVNTPGRYECNCPPDFQLNPTGVGCVDNRVGNCYLKFGPRGDGSLSCNTEIGVGVSRSSCCCSLGKAWGNPCETCPPVNSTEYYTLCPGGEGFRPNPITIILEDIDECQELPGLCQGGNCINTFGSFQCECPQGYYLSEDTRICEDIDECFAHPGVCGPGTCYNTLGNYTCICPPEYMQVNGGHNCMDMRKSFCYRSYNGTTCENELPFNVTKRMCCCTYNVGKAWNKPCEPCPTPGTADFKTICGNIPGFTFDIHTGKAVDIDECKEIPGICANGVCINQIGSFRCECPTGFSYNDLLLVCEDIDECSNGDNLCQRNADCINSPGSYRCECAAGFKLSPNGACVDRNECLEIPNVCSHGLCVDLQGSYQCICHNGFKASQDQTMCMDVDECERHPCGNGTCKNTVGSYNCLCYPGFELTHNNDCLDIDECSSFFGQVCRNGRCFNEIGSFKCLCNEGYELTPDGKNCIDTNECVALPGSCSPGTCQNLEGSFRCICPPGYEVKSENCIDINECDEDPNICLFGSCTNTPGGFQCLCPPGFVLSDNGRRCFDTRQSFCFTNFENGKCSVPKAFNTTKAKCCCSKMPGEGWGDPCELCPKDDEVAFQDLCPYGHGTVPSLHDTREDVNECLESPGICSNGQCINTDGSFRCECPMGYNLDYTGVRCVDTDECSIGNPCGNGTCTNVIGSFECNCNEGFEPGPMMNCEDINECAQNPLLCAFRCMNTFGSYECTCPIGYALREDQKMCKDLDECAEGLHDCESRGMMCKNLIGTFMCICPPGMARRPDGEGCVDENECRTKPGICENGRCVNIIGSYRCECNEGFQSSSSGTECLDNRQGLCFAEVLQTICQMASSSRNLVTKSECCCDGGRGWGHQCELCPLPGTAQYKKICPHGPGYTTDGRDIDECKVMPNLCTNGQCINTMGSFRCFCKVGYTTDISGTSCIDLDECSQSPKPCNYICKNTEGSYQCSCPRGYVLQEDGKTCKDLDECQTKQHNCQFLCVNTLGGFTCKCPPGFTQHHTACIDNNECGSQPSLCGAKGICQNTPGSFSCECQRGFSLDATGLNCEDVDECDGNHRCQHGCQNILGGYRCGCPQGYIQHYQWNQCVDENECSNPNACGSASCYNTLGSYKCACPSGFSFDQFSSACHDVNECSSSKNPCNYGCSNTEGGYLCGCPPGYYRVGQGHCVSGMGFNKGQYLSLDTEVDEENALSPEACYECKINGYSKKDSRQKRSIHEPDPTAVEQISLESVDMDSPVNMKFNLSHLGSKEHILELRPAIQPLNNHIRYVISQGNDDSVFRIHQRNGLSYLHTAKKKLMPGTYTLEITSIPLYKKKELKKLEESNEDDYLLGELGEALRMRLQIQLY,mutated_sequence,1.0,2912.0,NP_001990.2.a2m,NP_001990.2.npy,ClinVar
+NP_002026.1,NP_002026.1.csv,MLLLAAAFLVAFVLLLYMVSPLISPKPLALPGAHVVVTGGSSGIGKCIAIECYKQGAFITLVARNEDKLLQAKKEIEMHSINDKQVVLCISVDVSQDYNQVENVIKQAQEKLGPVDMLVNCAGMAVSGKFEDLEVSTFERLMSINYLGSVYPSRAVITTMKERRVGRIVFVSSQAGQLGLFGFTAYSASKFAIRGLAEALQMEVKPYNVYITVAYPPDTDTPGFAEENRTKPLETRLISETTSVCKPEQVAKQIVKDAIQGNFNSSLGSDGYMLSALTCGMAPVTSITEGLQQVVTMGLFRTIALFYLGSFDSIVRRCMMQREKSENADKTA,mutated_sequence,1.0,332.0,NP_002026.1.a2m,NP_002026.1.npy,ClinVar
+NP_002058.2,NP_002058.2.csv,MTLESMMACCLSDEVKESKRINAEIEKQLRRDKRDARRELKLLLLGTGESGKSTFIKQMRIIHGAGYSEEDKRGFTKLVYQNIFTAMQAMIRAMETLKILYKYEQNKANALLIREVDVEKVTTFEHQYVSAIKTLWEDPGIQECYDRRREYQLSDSAKYYLTDVDRIATLGYLPTQQDVLRVRVPTTGIIEYPFDLENIIFRMVDVGGQRSERRKWIHCFENVTSIMFLVALSEYDQVLVESDNENRMEESKALFRTIITYPWFQNSSVILFLNKKDLLEDKILYSHLVDYFPEFDGPQRDAQAAREFILKMFVDLNPDSDKIIYSHFTCATDTENIRFVFAAVKDTILQLNLKEYNLV,mutated_sequence,1.0,359.0,NP_002058.2.a2m,NP_002058.2.npy,ClinVar
+NP_002060.4,NP_002060.4.csv,MGCTLSAEDKAAVERSKMIDRNLREDGEKAAREVKLLLLGAGESGKSTIVKQMKIIHEAGYSEEECKQYKAVVYSNTIQSIIAIIRAMGRLKIDFGDSARADDARQLFVLAGAAEEGFMTAELAGVIKRLWKDSGVQACFNRSREYQLNDSAAYYLNDLDRIAQPNYIPTQQDVLRTRVKTTGIVETHFTFKDLHFKMFDVGGQRSERKKWIHCFEGVTAIIFCVALSDYDLVLAEDEEMNRMHESMKLFDSICNNKWFTDTSIILFLNKKDLFEEKIKKSPLTICYPEYAGSNTYEEAAAYIQCQFEDLNKRKDTKEIYTHFTCATDTKNVQFVFDAVTDVIIKNNLKDCGLF,mutated_sequence,1.0,354.0,NP_002060.4.a2m,NP_002060.4.npy,ClinVar
+NP_002065.1,NP_002065.1.csv,MSELDQLRQEAEQLKNQIRDARKACADATLSQITNNIDPVGRIQMRTRRTLRGHLAKIYAMHWGTDSRLLVSASQDGKLIIWDSYTTNKVHAIPLRSSWVMTCAYAPSGNYVACGGLDNICSIYNLKTREGNVRVSRELAGHTGYLSCCRFLDDNQIVTSSGDTTCALWDIETGQQTTTFTGHTGDVMSLSLAPDTRLFVSGACDASAKLWDVREGMCRQTFTGHESDINAICFFPNGNAFATGSDDATCRLFDLRADQELMTYSHDNIICGITSVSFSKSGRLLLAGYDDFNCNVWDALKADRAGVLAGHDNRVSCLGVTDDGMAVATGSWDSFLKIWN,mutated_sequence,1.0,340.0,NP_002065.1.a2m,NP_002065.1.npy,ClinVar
+NP_002066.1,NP_002066.1.csv,MGEMEQLRQEAEQLKKQIADARKACADVTLAELVSGLEVVGRVQMRTRRTLRGHLAKIYAMHWATDSKLLVSASQDGKLIVWDSYTTNKVHAIPLRSSWVMTCAYAPSGNFVACGGLDNMCSIYNLKSREGNVKVSRELSAHTGYLSCCRFLDDNNIVTSSGDTTCALWDIETGQQKTVFVGHTGDCMSLAVSPDFNLFISGACDASAKLWDVREGTCRQTFTGHESDINAICFFPNGEAICTGSDDASCRLFDLRADQELICFSHESIICGITSVAFSLSGRLLFAGYDDFNCNVWDSMKSERVGILSGHDNRVSCLGVTADGMAVATGSWDSFLKIWN,mutated_sequence,1.0,340.0,NP_002066.1.a2m,NP_002066.1.npy,ClinVar
+NP_002070.1,NP_002070.1.csv,MAPPSVFAEVPQAQPVLVFKLTADFREDPDPRKVNLGVGAYRTDDCHPWVLPVVKKVEQKIANDNSLNHEYLPILGLAEFRSCASRLALGDDSPALKEKRVGGVQSLGGTGALRIGADFLARWYNGTNNKNTPVYVSSPTWENHNAVFSAAGFKDIRSYRYWDAEKRGLDLQGFLNDLENAPEFSIVVLHACAHNPTGIDPTPEQWKQIASVMKHRFLFPFFDSAYQGFASGNLERDAWAIRYFVSEGFEFFCAQSFSKNFGLYNERVGNLTVVGKEPESILQVLSQMEKIVRITWSNPPAQGARIVASTLSNPELFEEWTGNVKTMADRILTMRSELRARLEALKTPGTWNHITDQIGMFSFTGLNPKQVEYLVNEKHIYLLPSGRINVSGLTTKNLDYVATSIHEAVTKIQ,mutated_sequence,1.0,413.0,NP_002070.1.a2m,NP_002070.1.npy,ClinVar
+NP_002071.2,NP_002071.2.csv,MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKMNLGVGAYRDDNGKPYVLPSVRKAEAQIAAKNLDKEYLPIGGLAEFCKASAELALGENSEVLKSGRFVTVQTISGTGALRIGASFLQRFFKFSRDVFLPKPTWGNHTPIFRDAGMQLQGYRYYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVKKRNLFAFFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMVCKDADEAKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGMRTQLVSNLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVTSSNVGYLAHAIHQVTK,mutated_sequence,1.0,430.0,NP_002071.2.a2m,NP_002071.2.npy,ClinVar
+NP_002100.2,NP_002100.2.csv,MAERAALEELVKLQGERVRGLKQQKASAELIEEEVAKLLKLKAQLGPDESKQKFVLKTPKGTRDYSPRQMAVREKVFDVIIRCFKRHGAEVIDTPVFELKETLMGKYGEDSKLIYDLKDQGGELLSLRYDLTVPFARYLAMNKLTNIKRYHIAKVYRRDNPAMTRGRYREFYQCDFDIAGNFDPMIPDAECLKIMCEILSSLQIGDFLVKVNDRRILDGMFAICGVSDSKFRTICSSVDKLDKVSWEEVKNEMVGEKGLAPEVADRIGDYVQQHGGVSLVEQLLQDPKLSQNKQALEGLGDLKLLFEYLTLFGIDDKISFDLSLARGLDYYTGVIYEAVLLQTPAQAGEEPLGVGSVAAGGRYDGLVGMFDPKGRKVPCVGLSIGVERIFSIVEQRLEALEEKIRTTETQVLVASAQKKLLEERLKLVSELWDAGIKAELLYKKNPKLLNQLQYCEEAGIPLVAIIGEQELKDGVIKLRSVTSREEVDVRREDLVEEIKRRTGQPLCIC,mutated_sequence,1.0,509.0,NP_002100.2.a2m,NP_002100.2.npy,ClinVar
+NP_002172.2,NP_002172.2.csv,MSPARLRPRLHFCLVLLLLLVVPAAWGCGPGRVVGSRRRPPRKLVPLAYKQFSPNVPEKTLGASGRYEGKIARSSERFKELTPNYNPDIIFKDEENTGADRLMTQRCKDRLNSLAISVMNQWPGVKLRVTEGWDEDGHHSEESLHYEGRAVDITTSDRDRNKYGLLARLAVEAGFDWVYYESKAHVHCSVKSEHSAAAKTGGCFPAGAQVRLESGARVALSAVRPGDRVLAMGEDGSPTFSDVLIFLDREPHRLRAFQVIETQDPPRRLALTPAHLLFTADNHTEPAARFRATFASHVQPGQYVLVAGVPGLQPARVAAVSTHVALGAYAPLTKHGTLVVEDVVASCFAAVADHHLAQLAFWPLRLFHSLAWGSWTPGEGVHWYPQLLYRLGRLLLEEGSFHPLGMSGAGS,mutated_sequence,1.0,411.0,NP_002172.2.a2m,NP_002172.2.npy,ClinVar
+NP_002221.1,NP_002221.1.csv,MEVMNLMEQPIKVTEWQQTYTYDSGIHSGANTCVPSVSSKGIMEEDEACGRQYTLKKTTTYTQGVPPSQGDLEYQMSTTARAKRVREAMCPGVSGEDSSLLLATQVEGQATNLQRLAEPSQLLKSAIVHLINYQDDAELATRALPELTKLLNDEDPVVVTKAAMIVNQLSKKEASRRALMGSPQLVAAVVRTMQNTSDLDTARCTTSILHNLSHHREGLLAIFKSGGIPALVRMLSSPVESVLFYAITTLHNLLLYQEGAKMAVRLADGLQKMVPLLNKNNPKFLAITTDCLQLLAYGNQESKLIILANGGPQALVQIMRNYSYEKLLWTTSRVLKVLSVCPSNKPAIVEAGGMQALGKHLTSNSPRLVQNCLWTLRNLSDVATKQEGLESVLKILVNQLSVDDVNVLTCATGTLSNLTCNNSKNKTLVTQNSGVEALIHAILRAGDKDDITEPAVCALRHLTSRHPEAEMAQNSVRLNYGIPAIVKLLNQPNQWPLVKATIGLIRNLALCPANHAPLQEAAVIPRLVQLLVKAHQDAQRHVAAGTQQPYTDGVRMEEIVEGCTGALHILARDPMNRMEIFRLNTIPLFVQLLYSSVENIQRVAAGVLCELAQDKEAADAIDAEGASAPLMELLHSRNEGTATYAAAVLFRISEDKNPDYRKRVSVELTNSLFKHDPAAWEAAQSMIPINEPYGDDMDATYRPMYSSDVPLDPLEMHMDMDGDYPIDTYSDGLRPPYPTADHMLA,mutated_sequence,1.0,745.0,NP_002221.1.a2m,NP_002221.1.npy,ClinVar
+NP_002231.1,NP_002231.1.csv,MAKLTESMTNVLEGDSMDQDVESPVAIHQPKLPKQARDDLPRHISRDRTKRKIQRYVRKDGKCNVHHGNVRETYRYLTDIFTTLVDLKWRFNLLIFVMVYTVTWLFFGMIWWLIAYIRGDMDHIEDPSWTPCVTNLNGFVSAFLFSIETETTIGYGYRVITDKCPEGIILLLIQSVLGSIVNAFMVGCMFVKISQPKKRAETLVFSTHAVISMRDGKLCLMFRVGDLRNSHIVEASIRAKLIKSKQTSEGEFIPLNQTDINVGYYTGDDRLFLVSPLIISHEINQQSPFWEISKAQLPKEELEIVVILEGMVEATGMTCQARSSYITSEILWGYRFTPVLTLEDGFYEVDYNSFHETYETSTPSLSAKELAELASRAELPLSWSVSSKLNQHAELETEEEEKNLEEQTERNGDVANLENESKV,mutated_sequence,1.0,423.0,NP_002231.1.a2m,NP_002231.1.npy,ClinVar
+NP_002263.3,NP_002263.3.csv,MIARQQCVRGGPRGFSCGSAIVGGGKRGAFSSVSMSGGAGRCSSGGFGSRSLYNLRGNKSISMSVAGSRQGACFGGAGGFGTGGFGGGFGGSFSGKGGPGFPVCPAGGIQEVTINQSLLTPLHVEIDPEIQKVRTEEREQIKLLNNKFASFIDKVQFLEQQNKVLETKWNLLQQQTTTTSSKNLEPLFETYLSVLRKQLDTLGNDKGRLQSELKTMQDSVEDFKTKYEEEINKRTAAENDFVVLKKDVDAAYLNKVELEAKVDSLNDEINFLKVLYDAELSQMQTHVSDTSVVLSMDNNRNLDLDSIIAEVRAQYEEIAQRSKAEAEALYQTKVQQLQISVDQHGDNLKNTKSEIAELNRMIQRLRAEIENIKKQCQTLQVSVADAEQRGENALKDAHSKRVELEAALQQAKEELARMLREYQELMSVKLALDIEIATYRKLLEGEEYRMSGECQSAVSISVVSGSTSTGGISGGLGSGSGFGLSSGFGSGSGSGFGFGGSVSGSSSSKIISTTTLNKRR,mutated_sequence,1.0,520.0,NP_002263.3.a2m,NP_002263.3.npy,ClinVar
+NP_002372.1,NP_002372.1.csv,MPRPAPARRLPGLLLLLWPLLLLPSAAPDPVARPGFRRLETRGPGGSPGRRPSPAAPDGAPASGTSEPGRARGAGVCKSRPLDLVFIIDSSRSVRPLEFTKVKTFVSRIIDTLDIGPADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEAFTVEAGAREPSSNIPKVAIIVTDGRPQDQVNEVAARAQASGIELYAVGVDRADMASLKMMASEPLEEHVFYVETYGVIEKLSSRFQETFCALDPCVLGTHQCQHVCISDGEGKHHCECSQGYTLNADKKTCSALDRCALNTHGCEHICVNDRSGSYHCECYEGYTLNEDRKTCSAQDKCALGTHGCQHICVNDRTGSHHCECYEGYTLNADKKTCSVRDKCALGSHGCQHICVSDGAASYHCDCYPGYTLNEDKKTCSATEEARRLVSTEDACGCEATLAFQDKVSSYLQRLNTKLDDILEKLKINEYGQIHR,mutated_sequence,1.0,486.0,NP_002372.1.a2m,NP_002372.1.npy,ClinVar
+NP_002428.1,NP_002428.1.csv,MALWRAYQRALAAHPWKVQVLTAGSLMGLGDIISQQLVERRGLQEHQRGRTLTMVSLGCGFVGPVVGGWYKVLDRFIPGTTKVDALKKMLLDQGGFAPCFLGCFLPLVGALNGLSAQDNWAKLQRDYPDALITNYYLWPAVQLANFYLVPLHYRLAVVQCVAVIWNSYLSWKAHRL,mutated_sequence,1.0,176.0,NP_002428.1.a2m,NP_002428.1.npy,ClinVar
+NP_002440.2,NP_002440.2.csv,MASPSKGNDLFSPDEEGPAVVAGPGPGPGGAEGAAEERRVKVSSLPFSVEALMSDKKPPKEASPLPAESASAGATLRPLLLSGHGAREAHSPGPLVKPFETASVKSENSEDGAAWMQEPGRYSPPPRHMSPTTCTLRKHKTNRKPRTPFTTSQLLALERKFRQKQYLSIAERAEFSSSLNLTETQVKIWFQNRRAKAKRLQEAELEKLKMAAKPMLPSSFSLPFPISSPLQAASIYGASYPFHRPVLPIPPVGLYATPVGYGMYHLS,mutated_sequence,1.0,267.0,NP_002440.2.a2m,NP_002440.2.npy,ClinVar
+NP_002445.2,NP_002445.2.csv,MRRFLLLYATQQGQAKAIAEEICEQAVVHGFSADLHCISESDKYDLKTETAPLVVVVSTTGTGDPPDTARKFVKEIQNQTLPVDFFAHLRYGLLGLGDSEYTYFCNGGKIIDKRLQELGARHFYDTGHADDCVGLELVVEPWIAGLWPALRKHFRSSRGQEEISGALPVASPASSRTDLVKSELLHIESQVELLRFDDSGRKDSEVLKQNAVNSNQSNVVIEDFESSLTRSVPPLSQASLNIPGLPPEYLQVHLQESLGQEESQVSVTSADPVFQVPISKAVQLTTNDAIKTTLLVELDISNTDFSYQPGDAFSVICPNSDSEVQSLLQRLQLEDKREHCVLLKIKADTKKKGATLPQHIPAGCSLQFIFTWCLEIRAIPKKAFLRALVDYTSDSAEKRRLQELCSKQGAADYSRFVRDACACLLDLLLAFPSCQPPLSLLLEHLPKLQPRPYSCASSSLFHPGKLHFVFNIVEFLSTATTEVLRKGVCTGWLALLVASVLQPNIHASHEDSGKALAPKISISPRTTNSFHLPDDPSIPIIMVGPGTGIAPFIGFLQHREKLQEQHPDGNFGAMWLFFGCRHKDRDYLFRKELRHFLKHGILTHLKVSFSRDAPVGEEEAPAKYVQDNIQLHGQQVARILLQENGHIYVCGDAKNMAKDVHDALVQIISKEVGVEKLEAMKTLATLKEEKRYLQDIWS,mutated_sequence,1.0,698.0,NP_002445.2.a2m,NP_002445.2.npy,ClinVar
+NP_002459.3,NP_002459.3.csv,MAAGGPGAGSAAPVSSTSSLPLAALNMRVRRRLSLFLNVRTQVAADWTALAEEMDFEYLEIRQLETQADPTGRLLDAWQGRPGASVGRLLELLTKLGRDDVLLELGPSIEEDCQKYILKQQQEEAEKPLQVAAVDSSVPRTAELAGITTLDDPLGHMPERFDAFICYCPSDIQFVQEMIRQLEQTNYRLKLCVSDRDVLPGTCVWSIASELIEKRCRRMVVVVSDDYLQSKECDFQTKFALSLSPGAHQKRLIPIKYKAMKKEFPSILRFITVCDYTNPCTKSWFWTRLAKALSLP,mutated_sequence,1.0,296.0,NP_002459.3.a2m,NP_002459.3.npy,ClinVar
+NP_002461.2,NP_002461.2.csv,MSSDTEMEVFGIAAPFLRKSEKERIEAQNQPFDAKTYCFVVDSKEEYAKGKIKSSQDGKVTVETEDNRTLVVKPEDVYAMNPPKFDRIEDMAMLTHLNEPAVLYNLKDRYTSWMIYTYSGLFCVTVNPYKWLPVYNPEVVEGYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGESGAGKTVNTKRVIQYFATIAATGDLAKKKDSKMKGTLEDQIISANPLLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGKLASADIETYLLEKSRVTFQLKAERSYHIFYQILSNKKPELIELLLITTNPYDYPFISQGEILVASIDDAEELLATDSAIDILGFTPEEKSGLYKLTGAVMHYGNMKFKQKQREEQAEPDGTEVADKTAYLMGLNSSDLLKALCFPRVKVGNEYVTKGQTVDQVHHAVNALSKSVYEKLFLWMVTRINQQLDTKLPRQHFIGVLDIAGFEIFEYNSLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTFIDFGMDLAACIELIEKPMGIFSILEEECMFPKATDTSFKNKLYDQHLGKSNNFQKPKVVKGRAEAHFSLIHYAGTVDYSVSGWLEKNKDPLNETVVGLYQKSSNRLLAHLYATFATADADSGKKKVAKKKGSSFQTVSALFRENLNKLMSNLRTTHPHFVRCIIPNETKTPGAMEHSLVLHQLRCNGVLEGIRICRKGFPNRILYGDFKQRYRVLNASAIPEGQFIDSKKACEKLLASIDIDHTQYKFGHTKVFFKAGLLGTLEEMRDDRLAKLITRTQAVCRGFLMRVEFQKMVQRRESIFCIQYNIRSFMNVKHWPWMKLFFKIKPLLKSAETEKEMATMKEEFQKTKDELAKSEAKRKELEEKLVTLVQEKNDLQLQVQAESENLLDAEERCDQLIKAKFQLEAKIKEVTERAEDEEEINAELTAKKRKLEDECSELKKDIDDLELTLAKVEKEKHATENKVKNLTEELSGLDETIAKLTREKKALQEAHQQALDDLQAEEDKVNSLNKTKSKLEQQVEDLESSLEQEKKLRVDLERNKRKLEGDLKLAQESILDLENDKQQLDERLKKKDFEYCQLQSKVEDEQTLGLQFQKKIKELQARIEELEEEIEAERATRAKTEKQRSDYARELEELSERLEEAGGVTSTQIELNKKREAEFLKLRRDLEEATLQHEAMVAALRKKHADSVAELGEQIDNLQRVKQKLEKEKSEFKLEIDDLSSSMESVSKSKANLEKICRTLEDQLSEARGKNEEIQRSLSELTTQKSRLQTEAGELSRQLEEKESIVSQLSRSKQAFTQQTEELKRQLEEENKAKNALAHALQSSRHDCDLLREQYEEEQEGKAELQRALSKANSEVAQWRTKYETDAIQRTEELEEAKKKLAQRLQDSEEQVEAVNAKCASLEKTKQRLQGEVEDLMVDVERANSLAAALDKKQRNFDKVLAEWKTKCEESQAELEASLKESRSLSTELFKLKNAYEEALDQLETVKRENKNLEQEIADLTEQIAENGKTIHELEKSRKQIELEKADIQLALEEAEAALEHEEAKILRIQLELTQVKSEIDRKIAEKDEEIEQLKRNYQRTVETMQSALDAEVRSRNEAIRLKKKMEGDLNEIEIQLSHANRQAAETLKHLRSVQGQLKDTQLHLDDALRGQEDLKEQLAIVERRANLLQAEVEELRATLEQTERARKLAEQELLDSNERVQLLHTQNTSLIHTKKKLETDLMQLQSEVEDASRDARNAEEKAKKAITDAAMMAEELKKEQDTSAHLERMKKNLEQTVKDLQHRLDEAEQLALKGGKKQIQKLETRIRELEFELEGEQKKNTESVKGLRKYERRVKELTYQSEEDRKNVLRLQDLVDKLQVKVKSYKRQAEEADEQANAHLTKFRKAQHELEEAEERADIAESQVNKLRAKTRDFTSSRMVVHESEE,mutated_sequence,1.0,1940.0,NP_002461.2.a2m,NP_002461.2.npy,ClinVar
+NP_002464.1,NP_002464.1.csv,MAQQAADKYLYVDKNFINNPLAQADWAAKKLVWVPSDKSGFEPASLKEEVGEEAIVELVENGKKVKVNKDDIQKMNPPKFSKVEDMAELTCLNEASVLHNLKERYYSGLIYTYSGLFCVVINPYKNLPIYSEEIVEMYKGKKRHEMPPHIYAITDTAYRSMMQDREDQSILCTGESGAGKTENTKKVIQYLAYVASSHKSKKDQGELERQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDVNGYIVGANIETYLLEKSRAIRQAKEERTFHIFYYLLSGAGEHLKTDLLLEPYNKYRFLSNGHVTIPGQQDKDMFQETMEAMRIMGIPEEEQMGLLRVISGVLQLGNIVFKKERNTDQASMPDNTAAQKVSHLLGINVTDFTRGILTPRIKVGRDYVQKAQTKEQADFAIEALAKATYERMFRWLVLRINKALDKTKRQGASFIGILDIAGFEIFDLNSFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIDLIEKPAGPPGILALLDEECWFPKATDKSFVEKVMQEQGTHPKFQKPKQLKDKADFCIIHYAGKVDYKADEWLMKNMDPLNDNIATLLHQSSDKFVSELWKDVDRIIGLDQVAGMSETALPGAFKTRKGMFRTVGQLYKEQLAKLMATLRNTNPNFVRCIIPNHEKKAGKLDPHLVLDQLRCNGVLEGIRICRQGFPNRVVFQEFRQRYEILTPNSIPKGFMDGKQACVLMIKALELDSNLYRIGQSKVFFRAGVLAHLEEERDLKITDVIIGFQACCRGYLARKAFAKRQQQLTAMKVLQRNCAAYLKLRNWQWWRLFTKVKPLLQVSRQEEEMMAKEEELVKVREKQLAAENRLTEMETLQSQLMAEKLQLQEQLQAETELCAEAEELRARLTAKKQELEEICHDLEARVEEEEERCQHLQAEKKKMQQNIQELEEQLEEEESARQKLQLEKVTTEAKLKKLEEEQIILEDQNCKLAKEKKLLEDRIAEFTTNLTEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTDLSDQIAELQAQIAELKMQLAKKEEELQAALARVEEEAAQKNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELEDTLDSTAAQQELRSKREQEVNILKKTLEEEAKTHEAQIQEMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQGKGDSEHKRKKVEAQLQELQVKFNEGERVRTELADKVTKLQVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLSTKLKQVEDEKNSFREQLEEEEEAKHNLEKQIATLHAQVADMKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQQELDDLLVDLDHQRQSACNLEKKQKKFDQLLAEEKTISAKYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDERKQRSMAVAARKKLEMDLKDLEAHIDSANKNRDEAIKQLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEELAAAERAKRQAQQERDELADEIANSSGKGALALEEKRRLEARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENARQQLERQNKELKVKLQEMEGTVKSKYKASITALEAKIAQLEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQADKASTRLKQLKRQLEEAEEEAQRANASRRKLQRELEDATETADAMNREVSSLKNKLRRGDLPFVVPRRMARKGAGDGSDEEVDGKADGAEAKPAE,mutated_sequence,1.0,1960.0,NP_002464.1.a2m,NP_002464.1.npy,ClinVar
+NP_002476.2,NP_002476.2.csv,MWKLLPAAGPAGGEPYRLLTGVEYVVGRKNCAILIENDQSISRNHAVLTANFSVTNLSQTDEIPVLTLKDNSKYGTFVNEEKMQNGFSRTLKSGDGITFGVFGSKFRIEYEPLVACSSCLDVSGKTALNQAILQLGGFTVNNWTEECTHLVMVSVKVTIKTICALICGRPIVKPEYFTEFLKAVESKKQPPQIESFYPPLDEPSIGSKNVDLSGRQERKQIFKGKTFIFLNAKQHKKLSSAVVFGGGEARLITEENEEEHNFFLAPGTCVVDTGITNSQTLIPDCQKKWIQSIMDMLQRQGLRPIPEAEIGLAVIFMTTKNYCDPQGHPSTGLKTTTPGPSLSQGVSVDEKLMPSAPVNTTTYVADTESEQADTWDLSERPKEIKVSKMEQKFRMLSQDAPTVKESCKTSSNNNSMVSNTLAKMRIPNYQLSPTKLPSINKSKDRASQQQQTNSIRNYFQPSTKKRERDEENQEMSSCKSARIETSCSLLEQTQPATPSLWKNKEQHLSENEPVDTNSDNNLFTDTDLKSIVKNSASKSHAAEKLRSNKKREMDDVAIEDEVLEQLFKDTKPELEIDVKVQKQEEDVNVRKRPRMDIETNDTFSDEAVPESSKISQENEIGKKRELKEDSLWSAKEISNNDKLQDDSEMLPKKLLLTEFRSLVIKNSTSRNPSGINDDYGQLKNFKKFKKVTYPGAGKLPHIIGGSDLIAHHARKNTELEEWLRQEMEVQNQHAKEESLADDLFRYNPYLKRRR,mutated_sequence,1.0,754.0,NP_002476.2.a2m,NP_002476.2.npy,ClinVar
+NP_002523.2,NP_002523.2.csv,MAAAEGPVGDGELWQTWLPNHVVFLRLREGLKNQSPTEAEKPASSSLPSSPPPQLLTRNVVFGLGGELFLWDGEDSSFLVVRLRGPSGGGEEPALSQYQRLLCINPPLFEIYQVLLSPTQHHVALIGIKGLMVLELPKRWGKNSEFEGGKSTVNCSTTPVAERFFTSSTSLTLKHAAWYPSEILDPHVVLLTSDNVIRIYSLREPQTPTNVIILSEAEEESLVLNKGRAYTASLGETAVAFDFGPLAAVPKTLFGQNGKDEVVAYPLYILYENGETFLTYISLLHSPGNIGKLLGPLPMHPAAEDNYGYDACAVLCLPCVPNILVIATESGMLYHCVVLEGEEEDDHTSEKSWDSRIDLIPSLYVFECVELELALKLASGEDDPFDSDFSCPVKLHRDPKCPSRYHCTHEAGVHSVGLTWIHKLHKFLGSDEEDKDSLQELSTEQKCFVEHILCTKPLPCRQPAPIRGFWIVPDILGPTMICITSTYECLIWPLLSTVHPASPPLLCTREDVEVAESPLRVLAETPDSFEKHIRSILQRSVANPAFLKASEKDIAPPPEECLQLLSRATQVFREQYILKQDLAKEEIQRRVKLLCDQKKKQLEDLSYCREERKSLREMAERLADKYEEAKEKQEDIMNRMKKLLHSFHSELPVLSDSERDMKKELQLIPDQLRHLGNAIKQVTMKKDYQQQKMEKVLSLPKPTIILSAYQRKCIQSILKEEGEHIREMVKQINDIRNHVNF,mutated_sequence,1.0,741.0,NP_002523.2.a2m,NP_002523.2.npy,ClinVar
+NP_002537.3,NP_002537.3.csv,MNNLLCCALVFLDISIKWTTQETFPPKYLHYDEETSHQLLCDKCPPGTYLKQHCTAKWKTVCAPCPDHYYTDSWHTSDECLYCSPVCKELQYVKQECNRTHNRVCECKEGRYLEIEFCLKHRSCPPGFGVVQAGTPERNTVCKRCPDGFFSNETSSKAPCRKHTNCSVFGLLLTQKGNATHDNICSGNSESTQKCGIDVTLCEEAFFRFAVPTKFTPNWLSVLVDNLPGTKVNAESVERIKRQHSSQEQTFQLLKLWKHQNKDQDIVKKIIQDIDLCENSVQRHIGHANLTFEQLRSLMESLPGKKVGAEDIEKTIKACKPSDQILKLLSLWRIKNGDQDTLKGLMHALKHSKTYHFPKTVTQSLKKTIRFLHSFTMYKLYQKLFLEMIGNQVQSVKISCL,mutated_sequence,1.0,401.0,NP_002537.3.a2m,NP_002537.3.npy,ClinVar
+NP_002600.1,NP_002600.1.csv,MRLPGAMPALALKGELLLLSLLLLLEPQISQGLVVTPPGPELVLNVSSTFVLTCSGSAPVVWERMSQEPPQEMAKAQDGTFSSVLTLTNLTGLDTGEYFCTHNDSRGLETDERKRLYIFVPDPTVGFLPNDAEELFIFLTEITEITIPCRVTDPQLVVTLHEKKGDVALPVPYDHQRGFSGIFEDRSYICKTTIGDREVDSDAYYVYRLQVSSINVSVNAVQTVVRQGENITLMCIVIGNEVVNFEWTYPRKESGRLVEPVTDFLLDMPYHIRSILHIPSAELEDSGTYTCNVTESVNDHQDEKAINITVVESGYVRLLGEVGTLQFAELHRSRTLQVVFEAYPPPTVLWFKDNRTLGDSSAGEIALSTRNVSETRYVSELTLVRVKVAEAGHYTMRAFHEDAEVQLSFQLQINVPVRVLELSESHPDSGEQTVRCRGRGMPQPNIIWSACRDLKRCPRELPPTLLGNSSEEESQLETNVTYWEEEQEFEVVSTLRLQHVDRPLSVRCTLRNAVGQDTQEVIVVPHSLPFKVVVISAILALVVLTIISLIILIMLWQKKPRYEIRWKVIESVSSDGHEYIYVDPMQLPYDSTWELPRDQLVLGRTLGSGAFGQVVEATAHGLSHSQATMKVAVKMLKSTARSSEKQALMSELKIMSHLGPHLNVVNLLGACTKGGPIYIITEYCRYGDLVDYLHRNKHTFLQHHSDKRRPPSAELYSNALPVGLPLPSHVSLTGESDGGYMDMSKDESVDYVPMLDMKGDVKYADIESSNYMAPYDNYVPSAPERTCRATLINESPVLSYMDLVGFSYQVANGMEFLASKNCVHRDLAARNVLICEGKLVKICDFGLARDIMRDSNYISKGSTFLPLKWMAPESIFNSLYTTLSDVWSFGILLWEIFTLGGTPYPELPMNEQFYNAIKRGYRMAQPAHASDEIYEIMQKCWEEKFEIRPPFSQLVLLLERLLGEGYKKKYQQVDEEFLRSDHPAILRSQARLPGFHGLRSPLDTSSVLYTAVQPNEGDNDYIIPLPDPKPEVADEGPLEGSPSLASSTLNEVNTSSTISCDSPLEPQDEPEPEPQLELQVEPEPELEQLPDSGCPAPRAEAEDSFL,mutated_sequence,1.0,1106.0,NP_002600.1.a2m,NP_002600.1.npy,ClinVar
+NP_002606.3,NP_002606.3.csv,MQALVLLLCIGALLGHSSCQNPASPPEEGSPDPDSTGALVEEEDPFFKVPVNKLAAAVSNFGYDLYRVRSSTSPTTNVLLSPLSVATALSALSLGAEQRTESIIHRALYYDLISSPDIHGTYKELLDTVTAPQKNLKSASRIVFEKKLRIKSSFVAPLEKSYGTRPRVLTGNPRLDLQEINNWVQAQMKGKLARSTKEIPDEISILLLGVAHFKGQWVTKFDSRKTSLEDFYLDEERTVRVPMMSDPKAVLRYGLDSDLSCKIAQLPLTGSMSIIFFLPLKVTQNLTLIEESLTSEFIHDIDRELKTVQAVLTVPKLKLSYEGEVTKSLQEMKLQSLFDSPDFSKITGKPIKLTQVEHRAGFEWNEDGAGTTPSPGLQPAHLTFPLDYHLNQPFIFVLRDTDTGALLFIGKILDPRGP,mutated_sequence,1.0,418.0,NP_002606.3.a2m,NP_002606.3.npy,ClinVar
+NP_002626.1,NP_002626.1.csv,MFSSVAHLARANPFNTPHLQLVHDGLGDLRSSSPGPTGQPRRPRNLAAAAVEEYSCEFGSAKYYALCGFGGVLSCGLTHTAVVPLDLVKCRMQVDPQKYKGIFNGFSVTLKEDGVRGLAKGWAPTFLGYSMQGLCKFGFYEVFKVLYSNMLGEENTYLWRTSLYLAASASAEFFADIALAPMEAAKVRIQTQPGYANTLRDAAPKMYKEEGLKAFYKGVAPLWMRQIPYTMMKFACFERTVEALYKFVVPKPRSECSKPEQLVVTFVAGYIAGVFCAIVSHPADSVVSVLNKEKGSSASLVLKRLGFKGVWKGLFARIIMIGTLTALQWFIYDSVKVYFRLPRPPPPEMPESLKKKLGLTQ,mutated_sequence,1.0,361.0,NP_002626.1.a2m,NP_002626.1.npy,ClinVar
+NP_002632.1,NP_002632.1.csv,MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFTPDPFRRHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVSVEAVLPMDKRLDRLISHCGPVTGYIFALLAVFNFLFLIFLRWMTPDSIIDVAIDATGPRGAWTNNYSHSKRGGENNEISETR,mutated_sequence,1.0,484.0,NP_002632.1.a2m,NP_002632.1.npy,ClinVar
+NP_002668.1,NP_002668.1.csv,MSNKFLGTWKLVSSENFDDYMKALGVGLATRKLGNLAKPTVIISKKGDIITIRTESTFKNTEISFKLGQEFEETTADNRKTKSIVTLQRGSLNQVQRWDGKETTIKRKLVNGKMVAECKMKGVVCTRIYEKV,mutated_sequence,1.0,132.0,NP_002668.1.a2m,NP_002668.1.npy,ClinVar
+NP_002682.2,NP_002682.2.csv,MDGKRRPGPGPGVPPKRARGGLWDDDDAPRPSQFEEDLALMEEMEAEHRLQEQEEEELQSVLEGVADGQVPPSAIDPRWLRPTPPALDPQTEPLIFQQLEIDHYVGPAQPVPGGPPPSRGSVPVLRAFGVTDEGFSVCCHIHGFAPYFYTPAPPGFGPEHMGDLQRELNLAISRDSRGGRELTGPAVLAVELCSRESMFGYHGHGPSPFLRITVALPRLVAPARRLLEQGIRVAGLGTPSFAPYEANVDFEIRFMVDTDIVGCNWLELPAGKYALRLKEKATQCQLEADVLWSDVVSHPPEGPWQRIAPLRVLSFDIECAGRKGIFPEPERDPVIQICSLGLRWGEPEPFLRLALTLRPCAPILGAKVQSYEKEEDLLQAWSTFIRIMDPDVITGYNIQNFDLPYLISRAQTLKVQTFPFLGRVAGLCSNIRDSSFQSKQTGRRDTKVVSMVGRVQMDMLQVLLREYKLRSYTLNAVSFHFLGEQKEDVQHSIITDLQNGNDQTRRRLAVYCLKDAYLPLRLLERLMVLVNAVEMARVTGVPLSYLLSRGQQVKVVSQLLRQAMHEGLLMPVVKSEGGEDYTGATVIEPLKGYYDVPIATLDFSSLYPSIMMAHNLCYTTLLRPGTAQKLGLTEDQFIRTPTGDEFVKTSVRKGLLPQILENLLSARKRAKAELAKETDPLRRQVLDGRQLALKVSANSVYGFTGAQVGKLPCLEISQSVTGFGRQMIEKTKQLVESKYTVENGYSTSAKVVYGDTDSVMCRFGVSSVAEAMALGREAADWVSGHFPSPIRLEFEKVYFPYLLISKKRYAGLLFSSRPDAHDRMDCKGLEAVRRDNCPLVANLVTASLRRLLIDRDPEGAVAHAQDVISDLLCNRIDISQLVITKELTRAASDYAGKQAHVELAERMRKRDPGSAPSLGDRVPYVIISAAKGVAAYMKSEDPLFVLEHSLPIDTQYYLEQQLAKPLLRIFEPILGEGRAEAVLLRGDHTRCKTVLTGKVGGLLAFAKRRNCCIGCRTVLSHQGAVCEFCQPRESELYQKEVSHLNALEERFSRLWTQCQRCQGSLHEDVICTSRDCPIFYMRKKVRKDLEDQEQLLRRFGPPGPEAW,mutated_sequence,1.0,1107.0,NP_002682.2.a2m,NP_002682.2.npy,ClinVar
+NP_002684.1,NP_002684.1.csv,MSRLLWRKVAGATVGPGPVPAPGRWVSSSVPASDPSDGQRRRQQQQQQQQQQQQQPQQPQVLSSEGGQLRHNPLDIQMLSRGLHEQIFGQGGEMPGEAAVRRSVEHLQKHGLWGQPAVPLPDVELRLPPLYGDNLDQHFRLLAQKQSLPYLEAANLLLQAQLPPKPPAWAWAEGWTRYGPEGEAVPVAIPEERALVFDVEVCLAEGTCPTLAVAISPSAWYSWCSQRLVEERYSWTSQLSPADLIPLEVPTGASSPTQRDWQEQLVVGHNVSFDRAHIREQYLIQGSRMRFLDTMSMHMAISGLSSFQRSLWIAAKQGKHKVQPPTKQGQKSQRKARRGPAISSWDWLDISSVNSLAEVHRLYVGGPPLEKEPRELFVKGTMKDIRENFQDLMQYCAQDVWATHEVFQQQLPLFLERCPHPVTLAGMLEMGVSYLPVNQNWERYLAEAQGTYEELQREMKKSLMDLANDACQLLSGERYKEDPWLWDLEWDLQEFKQKKAKKVKKEPATASKLPIEGAGAPGDPMDQEDLGPCSEEEEFQQDVMARACLQKLKGTTELLPKRPQHLPGHPGWYRKLCPRLDDPAWTPGPSLLSLQMRVTPKLMALTWDGFPLHYSERHGWGYLVPGRRDNLAKLPTGTTLESAGVVCPYRAIESLYRKHCLEQGKQQLMPQEAGLAEEFLLTDNSAIWQTVEELDYLEVEAEAKMENLRAAVPGQPLALTARGGPKDTQPSYHHGNGPYNDVDIPGCWFFKLPHKDGNSCNVGSPFAKDFLPKMEDGTLQAGPGGASGPRALEINKMISFWRNAHKRISSQMVVWLPRSALPRAVIRHPDYDEEGLYGAILPQVVTAGTITRRAVEPTWLTASNARPDRVGSELKAMVQAPPGYTLVGADVDSQELWIAAVLGDAHFAGMHGCTAFGWMTLQGRKSRGTDLHSKTATTVGISREHAKIFNYGRIYGAGQPFAERLLMQFNHRLTQQEAAEKAQQMYAATKGLRWYRLSDEGEWLVRELNLPVDRTEGGWISLQDLRKVQRETARKSQWKKWEVVAERAWKGGTESEMFNKLESIATSDIPRTPVLGCCISRALEPSAVQEEFMTSRVNWVVQSSAVDYLHLMLVAMKWLFEEFAIDGRFCISIHDEVRYLVREEDRYRAALALQITNLLTRCMFAYKLGLNDLPQSVAFFSAVDIDRCLRKEVTMDCKTPSNPTGMERRYGIPQGEALDIYQIIELTKGSLEKRSQPGP,mutated_sequence,1.0,1239.0,NP_002684.1.a2m,NP_002684.1.npy,ClinVar
+NP_002706.1,NP_002706.1.csv,MDEKVFTKELDQWIEQLNECKQLSESQVKSLCEKAKEILTKESNVQEVRCPVTVCGDVHGQFHDLMELFRIGGKSPDTNYLFMGDYVDRGYYSVETVTLLVALKVRYRERITILRGNHESRQITQVYGFYDECLRKYGNANVWKYFTDLFDYLPLTALVDGQIFCLHGGLSPSIDTLDHIRALDRLQEVPHEGPMCDLLWSDPDDRGGWGISPRGAGYTFGQDISETFNHANGLTLVSRAHQLVMEGYNWCHDRNVVTIFSAPNYCYRCGNQAAIMELDDTLKYSFLQFDPAPRRGEPHVTRRTPDYFL,mutated_sequence,1.0,309.0,NP_002706.1.a2m,NP_002706.1.npy,ClinVar
+NP_002721.1,NP_002721.1.csv,MGNAAAAKKGSEQESVKEFLAKAKEDFLKKWESPAQNTAHLDQFERIKTLGTGSFGRVMLVKHKETGNHYAMKILDKQKVVKLKQIEHTLNEKRILQAVNFPFLVKLEFSFKDNSNLYMVMEYVPGGEMFSHLRRIGRFSEPHARFYAAQIVLTFEYLHSLDLIYRDLKPENLLIDQQGYIQVTDFGFAKRVKGRTWTLCGTPEYLAPEIILSKGYNKAVDWWALGVLIYEMAAGYPPFFADQPIQIYEKIVSGKVRFPSHFSSDLKDLLRNLLQVDLTKRFGNLKNGVNDIKNHKWFATTDWIAIYQRKVEAPFIPKFKGPGDTSNFDDYEEEEIRVSINEKCGKEFSEF,mutated_sequence,1.0,351.0,NP_002721.1.a2m,NP_002721.1.npy,ClinVar
+NP_002730.1,NP_002730.1.csv,MAGLGPGVGDSEGGPRPLFCRKGALRQKVVHEVKSHKFTARFFKQPTFCSHCTDFIWGIGKQGLQCQVCSFVVHRRCHEFVTFECPGAGKGPQTDDPRNKHKFRLHSYSSPTFCDHCGSLLYGLVHQGMKCSCCEMNVHRRCVRSVPSLCGVDHTERRGRLQLEIRAPTADEIHVTVGEARNLIPMDPNGLSDPYVKLKLIPDPRNLTKQKTRTVKATLNPVWNETFVFNLKPGDVERRLSVEVWDWDRTSRNDFMGAMSFGVSELLKAPVDGWYKLLNQEEGEYYNVPVADADNCSLLQKFEACNYPLELYERVRMGPSSSPIPSPSPSPTDPKRCFFGASPGRLHISDFSFLMVLGKGSFGKVMLAERRGSDELYAIKILKKDVIVQDDDVDCTLVEKRVLALGGRGPGGRPHFLTQLHSTFQTPDRLYFVMEYVTGGDLMYHIQQLGKFKEPHAAFYAAEIAIGLFFLHNQGIIYRDLKLDNVMLDAEGHIKITDFGMCKENVFPGTTTRTFCGTPDYIAPEIIAYQPYGKSVDWWSFGVLLYEMLAGQPPFDGEDEEELFQAIMEQTVTYPKSLSREAVAICKGFLTKHPGKRLGSGPDGEPTIRAHGFFRWIDWERLERLEIPPPFRPRPCGRSGENFDKFFTRAAPALTPPDRLVLASIDQADFQGFTYVNPDFVHPDARSPTSPVPVPVM,mutated_sequence,1.0,697.0,NP_002730.1.a2m,NP_002730.1.npy,ClinVar
+NP_002736.3,NP_002736.3.csv,MAAAAAAGAGPEMVRGQVFDVGPRYTNLSYIGEGAYGMVCSAYDNVNKVRVAIKKISPFEHQTYCQRTLREIKILLRFRHENIIGINDIIRAPTIEQMKDVYIVQDLMETDLYKLLKTQHLSNDHICYFLYQILRGLKYIHSANVLHRDLKPSNLLLNTTCDLKICDFGLARVADPDHDHTGFLTEYVATRWYRAPEIMLNSKGYTKSIDIWSVGCILAEMLSNRPIFPGKHYLDQLNHILGILGSPSQEDLNCIINLKARNYLLSLPHKNKVPWNRLFPNADSKALDLLDKMLTFNPHKRIEVEQALAHPYLEQYYDPSDEPIAEAPFKFDMELDDLPKEKLKELIFEETARFQPGYRS,mutated_sequence,1.0,360.0,NP_002736.3.a2m,NP_002736.3.npy,ClinVar
+NP_002746.1,NP_002746.1.csv,MPKKKPTPIQLNPAPDGSAVNGTSSAETNLEALQKKLEELELDEQQRKRLEAFLTQKQKVGELKDDDFEKISELGAGNGGVVFKVSHKPSGLVMARKLIHLEIKPAIRNQIIRELQVLHECNSPYIVGFYGAFYSDGEISICMEHMDGGSLDQVLKKAGRIPEQILGKVSIAVIKGLTYLREKHKIMHRDVKPSNILVNSRGEIKLCDFGVSGQLIDSMANSFVGTRSYMSPERLQGTHYSVQSDIWSMGLSLVEMAVGRYPIPPPDAKELELMFGCQVEGDAAETPPRPRTPGRPLSSYGMDSRPPMAIFELLDYIVNEPPPKLPSGVFSLEFQDFVNKCLIKNPAERADLKQLMVHAFIKRSDAEEVDFAGWLCSTIGLNQPSTPTHAAGV,mutated_sequence,1.0,393.0,NP_002746.1.a2m,NP_002746.1.npy,ClinVar
+NP_002769.1,NP_002769.1.csv,MYALFLLASLLGAALAGPVLGLKECTRGSAVWCQNVKTASDCGAVKHCLQTVWNKPTVKSLPCDICKDVVTAAGDMLKDNATEEEILVYLEKTCDWLPKPNMSASCKEIVDSYLPVILDIIKGEMSRPGEVCSALNLCESLQKHLAELNHQKQLESNKIPELDMTEVVAPFMANIPLLLYPQDGPRSKPQPKDNGDVCQDCIQMVTDIQTAVRTNSTFVQALVEHVKEECDRLGPGMADICKNYISQYSEIAIQMMMHMQPKEICALVGFCDEVKEMPMQTLVPAKVASKNVIPALELVEPIKKHEVPAKSDVYCEVCEFLVKEVTKLIDNNKTEKEILDAFDKMCSKLPKSLSEECQEVVDTYGSSILSILLEEVSPELVCSMLHLCSGTRLPALTVHVTQPKDGGFCEVCKKLVGYLDRNLEKNSTKQEILAALEKGCSFLPDPYQKQCDQFVAEYEPVLIEILVEVMDPSFVCLKIGACPSAHKPLLGTEKCIWGPSYWCQNTETAAQCNAVEHCKRHVWN,mutated_sequence,1.0,524.0,NP_002769.1.a2m,NP_002769.1.npy,ClinVar
+NP_002779.1,NP_002779.1.csv,MSSIGTGYDLSASTFSPDGRVFQVEYAMKAVENSSTAIGIRCKDGVVFGVEKLVLSKLYEEGSNKRLFNVDRHVGMAVAGLLADARSLADIAREEASNFRSNFGYNIPLKHLADRVAMYVHAYTLYSAVRPFGCSFMLGSYSVNDGAQLYMIDPSGVSYGYWGCAIGKARQAAKTEIEKLQMKEMTCRDIVKEVAKIIYIVHDEVKDKAFELELSWVGELTNGRHEIVPKDIREEAEKYAKESLKEEDESDDDNM,mutated_sequence,1.0,255.0,NP_002779.1.a2m,NP_002779.1.npy,ClinVar
+NP_002787.2,NP_002787.2.csv,MEAFLGSRSGLWAGGPAPGQFYRIPSTPDSFMDPASALYRGPITRTQNPMVTGTSVLGVKFEGGVVIAADMLGSYGSLARFRNISRIMRVNNSTMLGASGDYADFQYLKQVLGQMVIDEELLGDGHSYSPRAIHSWLTRAMYSRRSKMNPLWNTMVIGGYADGESFLGYVDMLGVAYEAPSLATGYGAYLAQPLLREVLEKQPVLSQTEARDLVERCMRVLYYRDARSYNRFQIATVTEKGVEIEGPLSTETNWDIAHMISGFE,mutated_sequence,1.0,264.0,NP_002787.2.a2m,NP_002787.2.npy,ClinVar
+NP_002825.3,NP_002825.3.csv,MTSRRWFHPNITGVEAENLLLTRGVDGSFLARPSKSNPGDFTLSVRRNGAVTHIKIQNTGDYYDLYGGEKFATLAELVQYYMEHHGQLKEKNGDVIELKYPLNCADPTSERWFHGHLSGKEAEKLLTEKGKHGSFLVRESQSHPGDFVLSVRTGDDKGESNDGKSKVTHVMIRCQELKYDVGGGERFDSLTDLVEHYKKNPMVETLGTVLQLKQPLNTTRINAAEIESRVRELSKLAETTDKVKQGFWEEFETLQQQECKLLYSRKEGQRQENKNKNRYKNILPFDHTRVVLHDGDPNEPVSDYINANIIMPEFETKCNNSKPKKSYIATQGCLQNTVNDFWRMVFQENSRVIVMTTKEVERGKSKCVKYWPDEYALKEYGVMRVRNVKESAAHDYTLRELKLSKVGQGNTERTVWQYHFRTWPDHGVPSDPGGVLDFLEEVHHKQESIMDAGPVVVHCSAGIGRTGTFIVIDILIDIIREKGVDCDIDVPKTIQMVRSQRSGMVQTEAQYRFIYMAVQHYIETLQRRIEEEQKSKRKGHEYTNIKYSLADQTSGDQSPLPPCTPTPPCAEMREDSARVYENVGLMQQQKSFR,mutated_sequence,1.0,593.0,NP_002825.3.a2m,NP_002825.3.npy,ClinVar
+NP_002829.3,NP_002829.3.csv,MTMYLWLKLLAFGFAFLDTEVFVTGQSPTPSPTGLTTAKMPSVPLSSDPLPTHTTAFSPASTFERENDFSETTTSLSPDNTSTQVSPDSLDNASAFNTTGVSSVQTPHLPTHADSQTPSAGTDTQTFSGSAANAKLNPTPGSNAISDVPGERSTASTFPTDPVSPLTTTLSLAHHSSAALPARTSNTTITANTSDAYLNASETTTLSPSGSAVISTTTIATTPSKPTCDEKYANITVDYLYNKETKLFTAKLNVNENVECGNNTCTNNEVHNLTECKNASVSISHNSCTAPDKTLILDVPPGVEKFQLHDCTQVEKADTTICLKWKNIETFTCDTQNITYRFQCGNMIFDNKEIKLENLEPEHEYKCDSEILYNNHKFTNASKIIKTDFGSPGEPQIIFCRSEAAHQGVITWNPPQRSFHNFTLCYIKETEKDCLNLDKNLIKYDLQNLKPYTKYVLSLHAYIIAKVQRNGSAAMCHFTTKSAPPSQVWNMTVSMTSDNSMHVKCRPPRDRNGPHERYHLEVEAGNTLVRNESHKNCDFRVKDLQYSTDYTFKAYFHNGDYPGEPFILHHSTSYNSKALIAFLAFLIIVTSIALLVVLYKIYDLHKKRSCNLDEQQELVERDDEKQLMNVEPIHADILLETYKRKIADEGRLFLAEFQSIPRVFSKFPIKEARKPFNQNKNRYVDILPYDYNRVELSEINGDAGSNYINASYIDGFKEPRKYIAAQGPRDETVDDFWRMIWEQKATVIVMVTRCEEGNRNKCAEYWPSMEEGTRAFGDVVVKINQHKRCPDYIIQKLNIVNKKEKATGREVTHIQFTSWPDHGVPEDPHLLLKLRRRVNAFSNFFSGPIVVHCSAGVGRTGTYIGIDAMLEGLEAENKVDVYGYVVKLRRQRCLMVQVEAQYILIHQALVEYNQFGETEVNLSELHPYLHNMKKRDPPSEPSPLEAEFQRLPSYRSWRTQHIGNQEENKSKNRNSNVIPYDYNRVPLKHELEMSKESEHDSDESSDDDSDSEEPSKYINASFIMSYWKPEVMIAAQGPLKETIGDFWQMIFQRKVKVIVMLTELKHGDQEICAQYWGEGKQTYGDIEVDLKDTDKSSTYTLRVFELRHSKRKDSRTVYQYQYTNWSVEQLPAEPKELISMIQVVKQKLPQKNSSEGNKHHKSTPLLIHCRDGSQQTGIFCALLNLLESAETEEVVDIFQVVKALRKARPGMVSTFEQYQFLYDVIASTYPAQNGQVKKNNHQEDKIEFDNEVDKVKQDANCVNPLGAPEKLPEAKEQAEGSEPTSGTEGPEHSVNGPASPALNQGS,mutated_sequence,1.0,1306.0,NP_002829.3.a2m,NP_002829.3.npy,ClinVar
+NP_002878.2,NP_002878.2.csv,MDVLVSECSARLLQQEEEIKSLTAEIDRLKNCGCLGASPNLEQLQEENLKLKYRLNILRKSLQAERNKPTKNMINIISRLQEVFGHAIKAAYPDLENPPLLVTPSQQAKFGDYQCNSAMGISQMLKTKEQKVNPREIAENITKHLPDNECIEKVEIAGPGFINVHLRKDFVSEQLTSLLVNGVQLPALGENKKVIVDFSSPNIAKEMHVGHLRSTIIGESISRLFEFAGYDVLRLNHVGDWGTQFGMLIAHLQDKFPDYLTVSPPIGDLQVFYKESKKRFDTEEEFKKRAYQCVVLLQGKNPDITKAWKLICDVSRQELNKIYDALDVSLIERGESFYQDRMNDIVKEFEDRGFVQVDDGRKIVFVPGCSIPLTIVKSDGGYTYDTSDLAAIKQRLFEEKADMIIYVVDNGQSVHFQTIFAAAQMIGWYDPKVTRVFHAGFGVVLGEDKKKFKTRSGETVRLMDLLGEGLKRSMDKLKEKERDKVLTAEELNAAQTSVAYGCIKYADLSHNRLNDYIFSFDKMLDDRGNTAAYLLYAFTRIRSIARLANIDEEMLQKAARETKILLDHEKEWKLGRCILRFPEILQKILDDLFLHTLCDYIYELATAFTEFYDSCYCVEKDRQTGKILKVNMWRMLLCEAVAAVMAKGFDILGIKPVQRM,mutated_sequence,1.0,660.0,NP_002878.2.a2m,NP_002878.2.npy,ClinVar
+NP_002896.2,NP_002896.2.csv,MWLPLLLGALLWAVLWLLRDRQSLPASNAFVFITGCDSGFGRLLALQLDQRGFRVLASCLTPSGAEDLQRVASSRLHTTLLDITDPQSVQQAAKWVEMHVKEAGLFGLVNNAGVAGIIGPTPWLTRDDFQRVLNVNTMGPIGVTLALLPLLQQARGRVINITSVLGRLAANGGGYCVSKFGLEAFSDSLRRDVAHFGIRVSIVEPGFFRTPVTNLESLEKTLQACWARLPPATQAHYGGAFLTKYLKMQQRIMNLICDPDLTKVSRCLEHALTARHPRTRYSPGWDAKLLWLPASYLPASLVDAVLTWVLPKPAQAVY,mutated_sequence,1.0,318.0,NP_002896.2.a2m,NP_002896.2.npy,ClinVar
+NP_002991.2,NP_002991.2.csv,MAAVVALSLRRRLPATTLGGACLQASRGAQTAAATAPRIKKFAIYRWDPDKAGDKPHMQTYEVDLNKCGPMVLDALIKIKNEVDSTLTFRRSCREGICGSCAMNINGGNTLACTRRIDTNLNKVSKIYPLPHMYVIKDLVPDLSNFYAQYKSIEPYLKKKDESQEGKQQYLQSIEEREKLDGLYECILCACCSTSCPSYWWNGDKYLGPAVLMQAYRWMIDSRDDFTEERLAKLQDPFSLYRCHTIMNCTRTCPKGLNPGKAIAEIKKMMATYKEKKASV,mutated_sequence,1.0,280.0,NP_002991.2.a2m,NP_002991.2.npy,ClinVar
+NP_002993.1,NP_002993.1.csv,MAVLWRLSAVCGALGGRALLLRTPVVRPAHISAFLQDRPIPEWCGVQHIHLSPSHHSGSKAASLHWTSERVVSVLLLGLLPAAYLNPCSAMDYSLAAALTLHGHWGLGQVVTDYVHGDALQKAAKAGLLALSALTFAGLCYFNYHDVGICKAVAMLWKL,mutated_sequence,1.0,159.0,NP_002993.1.a2m,NP_002993.1.npy,ClinVar
+NP_003033.3,NP_003033.3.csv,MATNGSKVADGQISTEVSEAPVANDKPKTLVVKVQKKAADLPDRDTWKGRFDFLMSCVGYAIGLGNVWRFPYLCGKNGGGAFLIPYFLTLIFAGVPLFLLECSLGQYTSIGGLGVWKLAPMFKGVGLAAAVLSFWLNIYYIVIISWAIYYLYNSFTTTLPWKQCDNPWNTDRCFSNYSMVNTTNMTSAVVEFWERNMHQMTDGLDKPGQIRWPLAITLAIAWILVYFCIWKGVGWTGKVVYFSATYPYIMLIILFFRGVTLPGAKEGILFYITPNFRKLSDSEVWLDAATQIFFSYGLGLGSLIALGSYNSFHNNVYRDSIIVCCINSCTSMFAGFVIFSIVGFMAHVTKRSIADVAASGPGLAFLAYPEAVTQLPISPLWAILFFSMLLMLGIDSQFCTVEGFITALVDEYPRLLRNRRELFIAAVCIISYLIGLSNITQGGIYVFKLFDYYSASGMSLLFLVFFECVSISWFYGVNRFYDNIQEMVGSRPCIWWKLCWSFFTPIIVAGVFIFSAVQMTPLTMGNYVFPKWGQGVGWLMALSSMVLIPGYMAYMFLTLKGSLKQRIQVMVQPSEDIVRPENGPEQPQAGSSTSKEAYI,mutated_sequence,1.0,599.0,NP_003033.3.a2m,NP_003033.3.npy,ClinVar
+NP_003051.1,NP_003051.1.csv,MRDYDEVTAFLGEWGPFQRLIFFLLSASIIPNGFTGLSSVFLIATPEHRCRVPDAANLSSAWRNHTVPLRLRDGREVPHSCRRYRLATIANFSALGLEPGRDVDLGQLEQESCLDGWEFSQDVYLSTIVTEWNLVCEDDWKAPLTISLFFVGVLLGSFISGQLSDRFGRKNVLFVTMGMQTGFSFLQIFSKNFEMFVVLFVLVGMGQISNYVAAFVLGTEILGKSVRIIFSTLGVCIFYAFGYMVLPLFAYFIRDWRMLLVALTMPGVLCVALWWFIPESPRWLISQGRFEEAEVIIRKAAKANGIVVPSTIFDPSELQDLSSKKQQSHNILDLLRTWNIRMVTIMSIMLWMTISVGYFGLSLDTPNLHGDIFVNCFLSAMVEVPAYVLAWLLLQYLPRRYSMATALFLGGSVLLFMQLVPPDLYYLATVLVMVGKFGVTAAFSMVYVYTAELYPTVVRNMGVGVSSTASRLGSILSPYFVYLGAYDRFLPYILMGSLTILTAILTLFLPESFGTPLPDTIDQMLRVKGMKHRKTPSHTRMLKDGQERPTILKSTAF,mutated_sequence,1.0,557.0,NP_003051.1.a2m,NP_003051.1.npy,ClinVar
+NP_003063.2,NP_003063.2.csv,MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGPSPGSAHSMMGPSPGPPSAGHPIPTQGPGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQGYPSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQNQLHQLRAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGPGPGPGPGPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAAPTSTPQKLIPPQPTGRPSPAPPAVPPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRITPIQKPRGLDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNFQRQLRQEVVVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKHQEYLNSILQHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKAENAEGQTPAIGPDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYEVAPRSDSEESGSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENAKQDVDDEYGVSQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYNNNLNGILADEMGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVVKVSYKGSPAARRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTHYVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGEKVDLNEEETILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRHMQAKGVLLTDGSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIVQGLDLYRASGKFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTTKAEDRGMLLKTFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRAHRIGQQNEVRVLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAILEHEEQDESRHCSTGSGSASFAHTAPPPAGVNPDLEEPPLKEEDEVPDDETVNQMIARHEEEFDLFMRMDLDRRREEARNPKRKPRLMEEDELPSWIIKDDAEVERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWLKAIEEGTLEEIEEEVRQKKSSRKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTKKMKKIVDAVIKYKDSSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLNDLEKDVMLLCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQKIEKEDDSEGEESEEEEEGEEEGSESESRSVKVKIKLGRKEKAQDRLKGGRRRPSRGSRAKPVVSDDDSEEEQEEDRSGSGSEED,mutated_sequence,1.0,1647.0,NP_003063.2.a2m,NP_003063.2.npy,ClinVar
+NP_003064.2,NP_003064.2.csv,MMMMALSKTFGQKPVKFQLEDDGEFYMIGSEVGNYLRMFRGSLYKRYPSLWRRLATVEERKKIVASSHGKKTKPNTKDHGYTTLATSVTLLKASEVEEILDGNDEKYKAVSISTEPPTYLREQKAKRNSQWVPTLPNSSHHLDAVPCSTTINRNRMGRDKKRTFPLCFDDHDPAVIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSDQRVIIKLNIHVGNISLVDQFEWDMSEKENSPEKFALKLCSELGLGGEFVTTIAYSIRGQLSWHQKTYAFSENPLPTVEIAIRNTGDADQWCPLLETLTDAEMEKKIRDQDRNTRRMRRLANTAPAW,mutated_sequence,1.0,385.0,NP_003064.2.a2m,NP_003064.2.npy,ClinVar
+NP_003065.3,NP_003065.3.csv,MAAAAGGGGPGTAVGATGSGIAAAAAGLAVYRRKDGGPATKFWESPETVSQLDSVRVWLGKHYKKYVHADAPTNKTLAGLVVQLLQFQEDAFGKHVTNPAFTKLPAKCFMDFKAGGALCHILGAAYKYKNEQGWRRFDLQNPSRMDRNVEMFMNIEKTLVQNNCLTRPNIYLIPDIDLKLANKLKDIIKRHQGTFTDEKSKASHHIYPYSSSQDDEEWLRPVMRKEKQVLVHWGFYPDSYDTWVHSNDVDAEIEDPPIPEKPWKVHVKWILDTDIFNEWMNEEDYEVDENRKPVSFRQRISTKNEEPVRSPERRDRKASANARKRKHSPSPPPPTPTESRKKSGKKGQASLYGKRRSQKEEDEQEDLTKDMEDPTPVPNIEEVVLPKNVNLKKDSENTPVKGGTVADLDEQDEETVTAGGKEDEDPAKGDQSRSVDLGEDNVTEQTNHIIIPSYASWFDYNCIHVIERRALPEFFNGKNKSKTPEIYLAYRNFMIDTYRLNPQEYLTSTACRRNLTGDVCAVMRVHAFLEQWGLVNYQVDPESRPMAMGPPPTPHFNVLADTPSGLVPLHLRSPQVPAAQQMLNFPEKNKEKPVDLQNFGLRTDIYSKKTLAKSKGASAGREWTEQETLLLLEALEMYKDDWNKVSEHVGSRTQDECILHFLRLPIEDPYLENSDASLGPLAYQPVPFSQSGNPVMSTVAFLASVVDPRVASAAAKAALEEFSRVREEVPLELVEAHVKKVQEAARASGKVDPTYGLESSCIAGTGPDEPEKLEGAEEEKMEADPDGQQPEKAENKVENETDEGDKAQDGENEKNSEKEQDSEVSEDTKSEEKETEENKELTDTCKERESDTGKKKVEHEISEGNVATAAAAALASAATKAKHLAAVEERKIKSLVALLVETQMKKLEIKLRHFEELETIMDREKEALEQQRQQLLTERQNFHMEQLKYAELRARQQMEQQQHGQNPQQAHQHSGGPGLAPLGAAGHPGMMPHQQPPPYPLMHHQMPPPHPPQPGQIPGPGSMMPGQHMPGRMIPTVAANIHPSGSGPTPPGMPPMPGNILGPRVPLTAPNGMYPPPPQQQPPPPPPADGVPPPPAPGPPASAAP,mutated_sequence,1.0,1105.0,NP_003065.3.a2m,NP_003065.3.npy,ClinVar
+NP_003110.1,NP_003110.1.csv,MAVLLLLLRALRRGPGPGPRPLWGPGPAWSPGFPARPGRGRPYMASRPPGDLAEAGGRALQSLQLRLLTPTFEGINGLLLKQHLVQNPVRLWQLLGGTFYFNTSRLKQKNKEKDKSKGKAPEEDEEERRRRERDDQMYRERLRTLLVIAVVMSLLNALSTSGGSISWNDFVHEMLAKGEVQRVQVVPESDVVEVYLHPGAVVFGRPRLALMYRMQVANIDKFEEKLRAAEDELNIEAKDRIPVSYKRTGFFGNALYSVGMTAVGLAILWYVFRLAGMTGREGGFSAFNQLKMARFTIVDGKMGKGVSFKDVAGMHEAKLEVREFVDYLKSPERFLQLGAKVPKGALLLGPPGCGKTLLAKAVATEAQVPFLAMAGPEFVEVIGGLGAARVRSLFKEARARAPCIVYIDEIDAVGKKRSTTMSGFSNTEEEQTLNQLLVEMDGMGTTDHVIVLASTNRADILDGALMRPGRLDRHVFIDLPTLQERREIFEQHLKSLKLTQSSTFYSQRLAELTPGFSGADIANICNEAALHAAREGHTSVHTLNFEYAVERVLAGTAKKSKILSKEEQKVVAFHESGHALVGWMLEHTEAVMKVSITPRTNAALGFAQMLPRDQHLFTKEQLFERMCMALGGRASEALSFNEVTSGAQDDLRKVTRIAYSMVKQFGMAPGIGPISFPEAQEGLMGIGRRPFSQGLQQMMDHEARLLVAKAYRHTEKVLQDNLDKLQALANALLEKEVINYEDIEALIGPPPHGPKKMIAPQRWIDAQREKQDLGEEETEETQQPPLGGEEPTWPK,mutated_sequence,1.0,795.0,NP_003110.1.a2m,NP_003110.1.npy,ClinVar
+NP_003117.2,NP_003117.2.csv,MEQFPKETVVESSGPKVLETAEEIQERRQEVLTRYQSFKERVAERGQKLEDSYHLQVFKRDADDLGKWIMEKVNILTDKSYEDPTNIQGKYQKHQSLEAEVQTKSRLMSELEKTREERFTMGHSAHEETKAHIEELRHLWDLLLELTLEKGDQLLRALKFQQYVQECADILEWIGDKEAIATSVELGEDWERTEVLHKKFEDFQVELVAKEGRVVEVNQYANECAEENHPDLPLIQSKQNEVNAAWERLRGLALQRQKALSNAANLQRFKRDVTEAIQWIKEKEPVLTSEDYGKDLVASEGLFHSHKGLERNLAVMSDKVKELCAKAEKLTLSHPSDAPQIQEMKEDLVSSWEHIRALATSRYEKLQATYWYHRFSSDFDELSGWMNEKTAAINADELPTDVAGGEVLLDRHQQHKHEIDSYDDRFQSADETGQDLVNANHEASDEVREKMEILDNNWTALLELWDERHRQYEQCLDFHLFYRDSEQVDSWMSRQEAFLENEDLGNSLGSAEALLQKHEDFEEAFTAQEEKIITVDKTATKLIGDDHYDSENIKAIRDGLLARRDALREKAATRRRLLKESLLLQKLYEDSDDLKNWINKKKKLADDEDYKDIQNLKSRVQKQQVFEKELAVNKTQLENIQKTGQEMIEGGHYASDNVTTRLSEVASLWEELLEATKQKGTQLHEANQQLQFENNAEDLQRWLEDVEWQVTSEDYGKGLAEVQNRLRKHGLLESAVAARQDQVDILTDLAAYFEEIGHPDSKDIRARQESLVCRFEALKEPLATRKKKLLDLLHLQLICRDTEDEEAWIQETEPSATSTYLGKDLIASKKLLNRHRVILENIASHEPRIQEITERGNKMVEEGHFAAEDVASRVKSLNQNMESLRARAARRQNDLEANVQFQQYLADLHEAETWIREKEPIVDNTNYGADEEAAGALLKKHEAFLLDLNSFGDSMKALRNQANACQQQQAAPVEGVAGEQRVMALYDFQARSPREVTMKKGDVLTLLSSINKDWWKVEAADHQGIVPAVYVRRLAHDEFPMLPQRRREEPGNITQRQEQIENQYRSLLDRAEERRRRLLQRYNEFLLAYEAGDMLEWIQEKKAENTGVELDDVWELQKKFDEFQKDLNTNEPRLRDINKVADDLLFEGLLTPEGAQIRQELNSRWGSLQRLADEQRQLLGSAHAVEVFHREADDTKEQIEKKCQALSAADPGSDLFSVQALQRRHEGFERDLVPLGDKVTILGETAERLSESHPDATEDLQRQKMELNEAWEDLQGRTKDRKESLNEAQKFYLFLSKARDLQNWISSIGGMVSSQELAEDLTGIEILLERHQEHRADMEAEAPTFQALEDFSAELIDSGHHASPEIEKKLQAVKLERDDLEKAWEKRKKILDQCLELQMFQGNCDQVESWMVARENSLRSDDKSSLDSLEALMKKRDDLDKAITAQEGKITDLEHFAESLIADEHYAKEEIATRLQRVLDRWKALKAQLIDERTKLGDYANLKQFYRDLEELEEWISEMLPTACDESYKDATNIQRKYLKHQTFAHEVDGRSEQVHGVINLGNSLIECSACDGNEEAMKEQLEQLKEHWDHLLERTNDKGKKLNEASRQQRFNTSIRDFEFWLSEAETLLAMKDQARDLASAGNLLKKHQLLEREMLAREDALKDLNTLAEDLLSSGTFNVDQIVKKKDNVNKRFLNVQELAAAHHEKLKEAYALFQFFQDLDDEESWIEEKLIRVSSQDYGRDLQGVQNLLKKHKRLEGELVAHEPAIQNVLDMAEKLKDKAAVGQEEIQLRLAQFVEHWEKLKELAKARGLKLEESLEYLQFMQNAEEEEAWINEKNALAVRGDCGDTLAATQSLLMKHEALENDFAVHETRVQNVCAQGEDILNKVLQEESQNKEISSKIEALNEKTPSLAKAIAAWKLQLEDDYAFQEFNWKADVVEAWIADKETSLKTNGNGADLGDFLTLLAKQDTLDASLQSFQQERLPEITDLKDKLISAQHNQSKAIEERYAALLKRWEQLLEASAVHRQKLLEKQLPLQKAEDLFVEFAHKASALNNWCEKMEENLSEPVHCVSLNEIRQLQKDHEDFLASLARAQADFKCLLELDQQIKALGVPSSPYTWLTVEVLERTWKHLSDIIEEREQELQKEEARQVKNFEMCQEFEQNASTFLQWILETRAYFLDGSLLKETGTLESQLEANKRKQKEIQAMKRQLTKIVDLGDNLEDALILDIKYSTIGLAQQWDQLYQLGLRMQHNLEQQIQAKDIKGVSEETLKEFSTIYKHFDENLTGRLTHKEFRSCLRGLNYYLPMVEEDEHEPKFEKFLDAVDPGRKGYVSLEDYTAFLIDKESENIKSSDEIENAFQALAEGKSYITKEDMKQALTPEQVSFCATHMQQYMDPRGRSHLSGYDYVGFTNSYFGN,mutated_sequence,1.0,2419.0,NP_003117.2.a2m,NP_003117.2.npy,ClinVar
+NP_003127.1,NP_003127.1.csv,MVLADLGRKITSALRSLSNATIINEEVLNAMLKEVCTALLEADVNIKLVKQLRENVKSAIDLEEMASGLNKRKMIQHAVFKELVKLVDPGVKAWTPTKGKQNVIMFVGLQGSGKTTTCSKLAYYYQRKGWKTCLICADTFRAGAFDQLKQNATKARIPFYGSYTEMDPVIIASEGVEKFKNENFEIIIVDTSGRHKQEDSLFEEMLQVANAIQPDNIVYVMDASIGQACEAQAKAFKDKVDVASVIVTKLDGHAKGGGALSAVAATKSPIIFIGTGEHIDDFEPFKTQPFISKLLGMGDIEGLIDKVNELKLDDNEALIEKLKHGQFTLRDMYEQFQNIMKMGPFSQILGMIPGFGTDFMSKGNEQESMARLKKLMTIMDSMNDQELDSTDGAKVFSKQPGRIQRVARGSGVSTRDVQELLTQYTKFAQMVKKMGGIKGLFKGGDMSKNVSQSQMAKLNQQMAKMMDPRVLHHMGGMAGLQSMMRQFQQGAAGNMKGMMGFNNM,mutated_sequence,1.0,504.0,NP_003127.1.a2m,NP_003127.1.npy,ClinVar
+NP_003163.1,NP_003163.1.csv,MAAVAALQLGLRAAGLGRAPASAAWRSVLRVSPRPGVAWRPSRCGSSAAEASATKAEDDSFLQWVLLLIPVTAFGLGTWQVQRRKWKLNLIAELESRVLAEPVPLPADPMELKNLEYRPVKVRGCFDHSKELYMMPRTMVDPVREAREGGLISSSTQSGAYVVTPFHCTDLGVTILVNRGFVPRKKVNPETRQKGQIEGEVDLIGMVRLTETRQPFVPENNPERNHWHYRDLEAMARITGAEPIFIDANFQSTVPGGPIGGQTRVTLRNEHLQYIVTWYGLSAATSYLWFKKFLRGTPGV,mutated_sequence,1.0,300.0,NP_003163.1.a2m,NP_003163.1.npy,ClinVar
+NP_003273.1,NP_003273.1.csv,MGDEEKRNRAITARRQHLKSVMLQIAATELEKEESRREAEKQNYLAEHCPPLHIPGSMSEVQELCKQLHAKIDAAEEEKYDMEVRVQKTSKELEDMNQKLFDLRGKFKRPPLRRVRMSADAMLKALLGSKHKVCMDLRANLKQVKKEDTEKERDLRDVGDWRKNIEEKSGMEGRKKMFESES,mutated_sequence,1.0,182.0,NP_003273.1.a2m,NP_003273.1.npy,ClinVar
+NP_003280.2,NP_003280.2.csv,MDAIKKKMQMLKLDKENAIDRAEQAEADKKQAEDRCKQLEEEQQALQKKLKGTEDEVEKYSESVKEAQEKLEQAEKKATDAEADVASLNRRIQLVEEELDRAQERLATALQKLEEAEKAADESERGMKVIENRAMKDEEKMELQEMQLKEAKHIAEDSDRKYEEVARKLVILEGELERSEERAEVAESKCGDLEEELKIVTNNLKSLEAQADKYSTKEDKYEEEIKLLEEKLKEAETRAEFAERSVAKLEKTIDDLEDEVYAQKMKYKAISEELDNALNDITSL,mutated_sequence,1.0,284.0,NP_003280.2.a2m,NP_003280.2.npy,ClinVar
+NP_003292.1,NP_003292.1.csv,MENETVSELNQTQLQPRAVVALEYQVVTILLVLIICGLGIVGNIMVVLVVMRTKHMRTPTNCYLVSLAVADLMVLVAAGLPNITDSIYGSWVYGYVGCLCITYLQYLGINASSCSITAFTIERYIAICHPIKAQFLCTFSRAKKIIIFVWAFTSLYCMLWFFLLDLNISTYKDAIVISCGYKISRNYYSPIYLMDFGVFYVVPMILATVLYGFIARILFLNPIPSDPKENSKTWKNDSTHQNTNLNVNTSNRCFNSTVSSRKQVTKMLAVVVILFALLWMPYRTLVVVNSFLSSPFQENWFLLFCRICIYLNSAINPVIYNLMSQKFRAAFRKLCNCKQKPTEKPANYSVALNYSVIKESDHFSTELDDITVTDTYLSATKVSFDDTCLASEVSFSQS,mutated_sequence,1.0,398.0,NP_003292.1.a2m,NP_003292.1.npy,ClinVar
+NP_003313.3,NP_003313.3.csv,MPLRDETLREVWASDSGHEEESLSPEAPRRPKQRPAPAQRLRKKRTEAPESPCPTGSKPRKPGAGRTGRPREEPSPDPAQARAPQTVYARFLRDPEAKKRDPRETFLVARAPDAEDEEEEEEEDEEDEEEEAEEKKEKILLPPKKPLREKSSADLKERRAKAQGPRGDLGSPDPPPKPLRVRNKEAPAGEGTKMRKTKKKGSGEADKDPSGSPASARKSPAAMFLVGEGSPDKKALKKKGTPKGARKEEEEEEEAATVIKKSNQKGKAKGKGKKKAKEERAPSPPVEVDEPREFVLRPAPQGRTVRCRLTRDKKGMDRGMYPSYFLHLDTEKKVFLLAGRKRKRSKTANYLISIDPTNLSRGGENFIGKLRSNLLGNRFTVFDNGQNPQRGYSTNVASLRQELAAVIYETNVLGFRGPRRMTVIIPGMSAENERVPIRPRNASDGLLVRWQNKTLESLIELHNKPPVWNDDSGSYTLNFQGRVTQASVKNFQIVHADDPDYIVLQFGRVAEDAFTLDYRYPLCALQAFAIALSSFDGKLACE,mutated_sequence,1.0,542.0,NP_003313.3.a2m,NP_003313.3.npy,ClinVar
+NP_003327.2,NP_003327.2.csv,MSTPARRRLMRDFKRLQEDPPAGVSGAPSENNIMVWNAVIFGPEGTPFEDGTFKLTIEFTEEYPNKPPTVRFVSKMFHPNVYADGSICLDILQNRWSPTYDVSSILTSIQSLLDEPNPNSPANSQAAQLYQENKREYEKRVSAIVEQSWRDC,mutated_sequence,1.0,152.0,NP_003327.2.a2m,NP_003327.2.npy,ClinVar
+NP_003352.2,NP_003352.2.csv,MGQPSLTWMLMVVVASWFITTAATDTSEARWCSECHSNATCTEDEAVTTCTCQEGFTGDGLTCVDLDECAIPGAHNCSANSSCVNTPGSFSCVCPEGFRLSPGLGCTDVDECAEPGLSHCHALATCVNVVGSYLCVCPAGYRGDGWHCECSPGSCGPGLDCVPEGDALVCADPCQAHRTLDEYWRSTEYGEGYACDTDLRGWYRFVGQGGARMAETCVPVLRCNTAAPMWLNGTHPSSDEGIVSRKACAHWSGHCCLWDASVQVKACAGGYYVYNLTAPPECHLAYCTDPSSVEGTCEECSIDEDCKSNNGRWHCQCKQDFNITDISLLEHRLECGANDMKVSLGKCQLKSLGFDKVFMYLSDSRCSGFNDRDNRDWVSVVTPARDGPCGTVLTRNETHATYSNTLYLADEIIIRDLNIKINFACSYPLDMKVSLKTALQPMVSALNIRVGGTGMFTVRMALFQTPSYTQPYQGSSVTLSTEAFLYVGTMLDGGDLSRFALLMTNCYATPSSNATDPLKYFIIQDRCPHTRDSTIQVVENGESSQGRFSVQMFRFAGNYDLVYLHCEVYLCDTMNEKCKPTCSGTRFRSGSVIDQSRVLNLGPITRKGVQATVSRAFSSLGLLKVWLPLLLSATLTLTFQ,mutated_sequence,1.0,640.0,NP_003352.2.a2m,NP_003352.2.npy,ClinVar
+NP_003385.2,NP_003385.2.csv,MLEEPRPRPPPSGLAGLLFLALCSRALSNEILGLKLPGEPPLTANTVCLTLSGLSKRQLGLCLRNPDVTASALQGLHIAVHECQHQLRDQRWNCSALEGGGRLPHHSAILKRGFRESAFSFSMLAAGVMHAVATACSLGKLVSCGCGWKGSGEQDRLRAKLLQLQALSRGKSFPHSLPSPGPGSSPSPGPQDTWEWGGCNHDMDFGEKFSRDFLDSREAPRDIQARMRIHNNRVGRQVVTENLKRKCKCHGTSGSCQFKTCWRAAPEFRAVGAALRERLGRAIFIDTHNRNSGAFQPRLRPRRLSGELVYFEKSPDFCERDPTMGSPGTRGRACNKTSRLLDGCGSLCCGRGHNVLRQTRVERCHCRFHWCCYVLCDECKVTEWVNVCK,mutated_sequence,1.0,389.0,NP_003385.2.a2m,NP_003385.2.npy,ClinVar
+NP_003403.2,NP_003403.2.csv,MLLDAGPQYPAIGVTTFGASRHHSAGDVAERDVGLGINPFADGMGAFKLNPSSHELASAGQTAFTSQAPGYAAAAALGHHHHPGHVGSYSSAAFNSTRDFLFRNRGFGDAAAAASAQHSLFAASAGGFGGPHGHTDAAGHLLFPGLHEQAAGHASPNVVNGQMRLGFSGDMYPRPEQYGQVTSPRSEHYAAPQLHGYGPMNVNMAAHHGAGAFFRYMRQPIKQELICKWIEPEQLANPKKSCNKTFSTMHELVTHVTVEHVGGPEQSNHICFWEECPREGKPFKAKYKLVNHIRVHTGEKPFPCPFPGCGKVFARSENLKIHKRTHTGEKPFKCEFEGCDRRFANSSDRKKHMHVHTSDKPYLCKMCDKSYTHPSSLRKHMKVHESSSQGSQPSPAASSGYESSTPPTIVSPSTDNPTTSSLSPSSSAVHHTAGHSALSSNFNEWYV,mutated_sequence,1.0,447.0,NP_003403.2.a2m,NP_003403.2.npy,ClinVar
+NP_003404.1,NP_003404.1.csv,MTMLLDGGPQFPGLGVGSFGAPRHHEMPNREPAGMGLNPFGDSTHAAAAAAAAAAFKLSPAAAHDLSSGQSSAFTPQGSGYANALGHHHHHHHHHHHTSQVPSYGGAASAAFNSTREFLFRQRSSGLSEAASGGGQHGLFAGSASSLHAPAGIPEPPSYLLFPGLHEQGAGHPSPTGHVDNNQVHLGLRGELFGRADPYRPVASPRTDPYAAGAQFPNYSPMNMNMGVNVAAHHGPGAFFRYMRQPIKQELSCKWIDEAQLSRPKKSCDRTFSTMHELVTHVTMEHVGGPEQNNHVCYWEECPREGKSFKAKYKLVNHIRVHTGEKPFPCPFPGCGKIFARSENLKIHKRTHTGEKPFKCEFEGCDRRFANSSDRKKHMHVHTSDKPYICKVCDKSYTHPSSLRKHMKVHESQGSDSSPAASSGYESSTPPAIASANSKDTTKTPSAVQTSTSHNPGLPPNFNEWYV,mutated_sequence,1.0,467.0,NP_003404.1.a2m,NP_003404.1.npy,ClinVar
+NP_003459.2,NP_003459.2.csv,MARPDPSAPPSLLLLLLAQLVGRAAAASKAPVCQEITVPMCRGIGYNLTHMPNQFNHDTQDEAGLEVHQFWPLVEIQCSPDLRFFLCSMYTPICLPDYHKPLPPCRSVCERAKAGCSPLMRQYGFAWPERMSCDRLPVLGRDAEVLCMDYNRSEATTAPPRPFPAKPTLPGPPGAPASGGECPAGGPFVCKCREPFVPILKESHPLYNKVRTGQVPNCAVPCYQPSFSADERTFATFWIGLWSVLCFISTSTTVATFLIDMERFRYPERPIIFLSACYLCVSLGFLVRLVVGHASVACSREHNHIHYETTGPALCTIVFLLVYFFGMASSIWWVILSLTWFLAAGMKWGNEAIAGYAQYFHLAAWLIPSVKSITALALSSVDGDPVAGICYVGNQNLNSLRGFVLGPLVLYLLVGTLFLLAGFVSLFRIRSVIKQGGTKTDKLEKLMIRIGIFTLLYTVPASIVVACYLYEQHYRESWEAALTCACPGHDTGQPRAKPEYWVLMLKYFMCLVVGITSGVWIWSGKTVESWRRFTSRCCCRPRRGHKSGGAMAAGDYPEASAALTGRTGPPGPAATYHKQVSLSHV,mutated_sequence,1.0,585.0,NP_003459.2.a2m,NP_003459.2.npy,ClinVar
+NP_003473.3,NP_003473.3.csv,MDSQKLAGEDKDSEPAADGPAASEDPSATESDLPNPHVGEVSVLSSGSPRLQETPQDCSGGPVRRCALCNCGEPSLHGQRELRRFELPFDWPRCPVVSPGGSPGPNEAVLPSEDLSQIGFPEGLTPAHLGEPGGSCWAHHWCAAWSAGVWGQEGPELCGVDKAIFSGISQRCSHCTRLGASIPCRSPGCPRLYHFPCATASGSFLSMKTLQLLCPEHSEGAAYLEEARCAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWFENYSLCHRCHKAQGGQTIRSVAEQHTPVCSRFSPPEPGDTPTDEPDALYVACQGQPKGGHVTSMQPKEPGPLQCEAKPLGKAGVQLEPQLEAPLNEEMPLLPPPEESPLSPPPEESPTSPPPEASRLSPPPEELPASPLPEALHLSRPLEESPLSPPPEESPLSPPPESSPFSPLEESPLSPPEESPPSPALETPLSPPPEASPLSPPFEESPLSPPPEELPTSPPPEASRLSPPPEESPMSPPPEESPMSPPPEASRLFPPFEESPLSPPPEESPLSPPPEASRLSPPPEDSPMSPPPEESPMSPPPEVSRLSPLPVVSRLSPPPEESPLSPPPEESPTSPPPEASRLSPPPEDSPTSPPPEDSPASPPPEDSLMSLPLEESPLLPLPEEPQLCPRSEGPHLSPRPEEPHLSPRPEEPHLSPQAEEPHLSPQPEEPCLCAVPEEPHLSPQAEGPHLSPQPEELHLSPQTEEPHLSPVPEEPCLSPQPEESHLSPQSEEPCLSPRPEESHLSPELEKPPLSPRPEKPPEEPGQCPAPEELPLFPPPGEPSLSPLLGEPALSEPGEPPLSPLPEELPLSPSGEPSLSPQLMPPDPLPPPLSPIITAAAPPALSPLGELEYPFGAKGDSDPESPLAAPILETPISPPPEANCTDPEPVPPMILPPSPGSPVGPASPILMEPLPPQCSPLLQHSLVPQNSPPSQCSPPALPLSVPSPLSPIGKVVGVSDEAELHEMETEKVSEPECPALEPSATSPLPSPMGDLSCPAPSPAPALDDFSGLGEDTAPLDGIDAPGSQPEPGQTPGSLASELKGSPVLLDPEELAPVTPMEVYPECKQTAGQGSPCEEQEEPRAPVAPTPPTLIKSDIVNEISNLSQGDASASFPGSEPLLGSPDPEGGGSLSMELGVSTDVSPARDEGSLRLCTDSLPETDDSLLCDAGTAISGGKAEGEKGRRRSSPARSRIKQGRSSSFPGRRRPRGGAHGGRGRGRARLKSTASSIETLVVADIDSSPSKEEEEEDDDTMQNTVVLFSNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFHCEWQNSYTHCGPCASLVTCPICHAPYVEEDLLIQCRHCERWMHAGCESLFTEDDVEQAADEGFDCVSCQPYVVKPVAPVAPPELVPMKVKEPEPQYFRFEGVWLTETGMALLRNLTMSPLHKRRQRRGRLGLPGEAGLEGSEPSDALGPDDKKDGDLDTDELLKGEGGVEHMECEIKLEGPVSPDVEPGKEETEESKKRKRKPYRPGIGGFMVRQRKSHTRTKKGPAAQAEVLSGDGQPDEVIPADLPAEGAVEQSLAEGDEKKKQQRRGRKKSKLEDMFPAYLQEAFFGKELLDLSRKALFAVGVGRPSFGLGTPKAKGDGGSERKELPTSQKGDDGPDIADEESRGLEGKADTPGPEDGGVKASPVPSDPEKPGTPGEGMLSSDLDRISTEELPKMESKDLQQLFKDVLGSEREQHLGCGTPGLEGSRTPLQRPFLQGGLPLGNLPSSSPMDSYPGLCQSPFLDSRERGGFFSPEPGEPDSPWTGSGGTTPSTPTTPTTEGEGDGLSYNQRSLQRWEKDEELGQLSTISPVLYANINFPNLKQDYPDWSSRCKQIMKLWRKVPAADKAPYLQKAKDNRAAHRINKVQKQAESQINKQTKVGDIARKTDRPALHLRIPPQPGALGSPPPAAAPTIFIGSPTTPAGLSTSADGFLKPPAGSVPGPDSPGELFLKLPPQVPAQVPSQDPFGLAPAYPLEPRFPTAPPTYPPYPSPTGAPAQPPMLGASSRPGAGQPGEFHTTPPGTPRHQPSTPDPFLKPRCPSLDNLAVPESPGVGGGKASEPLLSPPPFGESRKALEVKKEELGASSPSYGPPNLGFVDSPSSGTHLGGLELKTPDVFKAPLTPRASQVEPQSPGLGLRPQEPPPAQALAPSPPSHPDIFRPGSYTDPYAQPPLTPRPQPPPPESCCALPPRSLPSDPFSRVPASPQSQSSSQSPLTPRPLSAEAFCPSPVTPRFQSPDPYSRPPSRPQSRDPFAPLHKPPRPQPPEVAFKAGSLAHTSLGAGGFPAALPAGPAGELHAKVPSGQPPNFVRSPGTGAFVGTPSPMRFTFPQAVGEPSLKPPVPQPGLPPPHGINSHFGPGPTLGKPQSTNYTVATGNFHPSGSPLGPSSGSTGESYGLSPLRPPSVLPPPAPDGSLPYLSHGASQRSGITSPVEKREDPGTGMGSSLATAELPGTQDPGMSGLSQTELEKQRQRQRLRELLIRQQIQRNTLRQEKETAAAAAGAVGPPGSWGAEPSSPAFEQLSRGQTPFAGTQDKSSLVGLPPSKLSGPILGPGSFPSDDRLSRPPPPATPSSMDVNSRQLVGGSQAFYQRAPYPGSLPLQQQQQQLWQQQQATAATSMRFAMSARFPSTPGPELGRQALGSPLAGISTRLPGPGEPVPGPAGPAQFIELRHNVQKGLGPGGTPFPGQGPPQRPRFYPVSEDPHRLAPEGLRGLAVSGLPPQKPSAPPAPELNNSLHPTPHTKGPTLPTGLELVNRPPSSTELGRPNPLALEAGKLPCEDPELDDDFDAHKALEDDEELAHLGLGVDVAKGDDELGTLENLETNDPHLDDLLNGDEFDLLAYTDPELDTGDKKDIFNEHLRLVESANEKAEREALLRGVEPGPLGPEERPPPAADASEPRLASVLPEVKPKVEEGGRHPSPCQFTIATPKVEPAPAANSLGLGLKPGQSMMGSRDTRMGTGPFSSSGHTAEKASFGATGGPPAHLLTPSPLSGPGGSSLLEKFELESGALTLPGGPAASGDELDKMESSLVASELPLLIEDLLEHEKKELQKKQQLSAQLQPAQQQQQQQQQHSLLSAPGPAQAMSLPHEGSSPSLAGSQQQLSLGLAGARQPGLPQPLMPTQPPAHALQQRLAPSMAMVSNQGHMLSGQHGGQAGLVPQQSSQPVLSQKPMGTMPPSMCMKPQQLAMQQQLANSFFPDTDLDKFAAEDIIDPIAKAKMVALKGIKKVMAQGSIGVAPGMNRQQVSLLAQRLSGGPSSDLQNHVAAGSGQERSAGDPSQPRPNPPTFAQGVINEADQRQYEEWLFHTQQLLQMQLKVLEEQIGVHRKSRKALCAKQRTAKKAGREFPEADAEKLKLVTEQQSKIQKQLDQVRKQQKEHTNLMAEYRNKQQQQQQQQQQQQQQHSAVLALSPSQSPRLLTKLPGQLLPGHGLQPPQGPPGGQAGGLRLTPGGMALPGQPGGPFLNTALAQQQQQQHSGGAGSLAGPSGGFFPGNLALRSLGPDSRLLQERQLQLQQQRMQLAQKLQQQQQQQQQQQHLLGQVAIQQQQQQGPGVQTNQALGPKPQGLMPPSSHQGLLVQQLSPQPPQGPQGMLGPAQVAVLQQQHPGALGPQGPHRQVLMTQSRVLSSPQLAQQGQGLMGHRLVTAQQQQQQQQHQQQGSMAGLSHLQQSLMSHSGQPKLSAQPMGSLQQLQQQQQLQQQQQLQQQQQQQLQQQQQLQQQQLQQQQQQQQLQQQQQQQLQQQQQQLQQQQQQQQQQFQQQQQQQQMGLLNQSRTLLSPQQQQQQQVALGPGMPAKPLQHFSSPGALGPTLLLTGKEQNTVDPAVSSEATEGPSTHQGGPLAIGTTPESMATEPGEVKPSLSGDSQLLLVQPQPQPQPSSLQLQPPLRLPGQQQQQVSLLHTAGGGSHGQLGSGSSSEASSVPHLLAQPSVSLGDQPGSMTQNLLGPQQPMLERPMQNNTGPQPPKPGPVLQSGQGLPGVGIMPTVGQLRAQLQGVLAKNPQLRHLSPQQQQQLQALLMQRQLQQSQAVRQTPPYQEPGTQTSPLQGLLGCQPQLGGFPGPQTGPLQELGAGPRPQGPPRLPAPPGALSTGPVLGPVHPTPPPSSPQEPKRPSQLPSPSSQLPTEAQLPPTHPGTPKPQGPTLEPPPGRVSPAAAQLADTLFSKGLGPWDPPDNLAETQKPEQSSLVPGHLDQVNGQVVPEASQLSIKQEPREEPCALGAQSVKREANGEPIGAPGTSNHLLLAGPRSEAGHLLLQKLLRAKNVQLSTGRGSEGLRAEINGHIDSKLAGLEQKLQGTPSNKEDAAARKPLTPKPKRVQKASDRLVSSRKKLRKEDGVRASEALLKQLKQELSLLPLTEPAITANFSLFAPFGSGCPVNGQSQLRGAFGSGALPTGPDYYSQLLTKNNLSNPPTPPSSLPPTPPPSVQQKMVNGVTPSEELGEHPKDAASARDSERALRDTSEVKSLDLLAALPTPPHNQTEDVRMESDEDSDSPDSIVPASSPESILGEEAPRFPHLGSGRWEQEDRALSPVIPLIPRASIPVFPDTKPYGALGLEVPGKLPVTTWEKGKGSEVSVMLTVSAAAAKNLNGVMVAVAELLSMKIPNSYEVLFPESPARAGTEPKKGEAEGPGGKEKGLEGKSPDTGPDWLKQFDAVLPGYTLKSQLDILSLLKQESPAPEPPTQHSYTYNVSNLDVRQLSAPPPEEPSPPPSPLAPSPASPPTEPLVELPTEPLAEPPVPSPLPLASSPESARPKPRARPPEEGEDSRPPRLKKWKGVRWKRLRLLLTIQKGSGRQEDEREVAEFMEQLGTALRPDKVPRDMRRCCFCHEEGDGATDGPARLLNLDLDLWVHLNCALWSTEVYETQGGALMNVEVALHRGLLTKCSLCQRTGATSSCNRMRCPNVYHFACAIRAKCMFFKDKTMLCPMHKIKGPCEQELSSFAVFRRVYIERDEVKQIASIIQRGERLHMFRVGGLVFHAIGQLLPHQMADFHSATALYPVGYEATRIYWSLRTNNRRCCYRCSIGENNGRPEFVIKVIEQGLEDLVFTDASPQAVWNRIIEPVAAMRKEADMLRLFPEYLKGEELFGLTVHAVLRIAESLPGVESCQNYLFRYGRHPLMELPLMINPTGCARSEPKILTHYKRPHTLNSTSMSKAYQSTFTGETNTPYSKQFVHSKSSQYRRLRTEWKNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMFRINNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHKIPCHCGAWNCRKWMN,mutated_sequence,1.0,5537.0,NP_003473.3.a2m,NP_003473.3.npy,ClinVar
+NP_003551.2,NP_003551.2.csv,MQFFGRLVNTFSGVTNLFSNPFRVKEVAVADYTSSDRVREEGQLILFQNTPNRTWDCVLVNPRNSQSGFRLFQLELEADALVNFHQYSSQLLPFYESSPQVLHTEVLQHLTDLIRNHPSWSVAHLAVELGIRECFHHSRIISCANCAENEEGCTPLHLACRKGDGEILVELVQYCHTQMDVTDYKGETVFHYAVQGDNSQVLQLLGRNAVAGLNQVNNQGLTPLHLACQLGKQEMVRVLLLCNARCNIMGPNGYPIHSAMKFSQKGCAEMIISMDSSQIHSKDPRYGASPLHWAKNAEMARMLLKRGCNVNSTSSAGNTALHVAVMRNRFDCAIVLLTHGANADARGEHGNTPLHLAMSKDNVEMIKALIVFGAEVDTPNDFGETPTFLASKIGRLVTRKAILTLLRTVGAEYCFPPIHGVPAEQGSAAPHHPFSLERAQPPPISLNNLELQDLMHISRARKPAFILGSMRDEKRTHDHLLCLDGGGVKGLIIIQLLIAIEKASGVATKDLFDWVAGTSTGGILALAILHSKSMAYMRGMYFRMKDEVFRGSRPYESGPLEEFLKREFGEHTKMTDVRKPKVMLTGTLSDRQPAELHLFRNYDAPETVREPRFNQNVNLRPPAQPSDQLVWRAARSSGAAPTYFRPNGRFLDGGLLANNPTLDAMTEIHEYNQDLIRKGQANKVKKLSIVVSLGTGRSPQVPVTCVDVFRPSNPWELAKTVFGAKELGKMVVDCCTDPDGRAVDRARAWCEMVGIQYFRLNPQLGTDIMLDEVSDTVLVNALWETEVYIYEHREEFQKLIQLLLSP,mutated_sequence,1.0,806.0,NP_003551.2.a2m,NP_003551.2.npy,ClinVar
+NP_003562.1,NP_003562.1.csv,MSERRVVVDLPTSASSSMPLQRRRASFRGPRSSSSLESPPASRTNAMSGLVRAPGVYVGTAPSGCIGGLGARVTRRALGISSVFLQGLRSSGLATVPAPGLERDHGAVEDLGGCLVEYMAKVHALEQVSQELETQLRMHLESKATRSGNWGALRASWASSCQQVGEAVLENARLMLQTETIQAGADDFKERYENEQPFRKAAEEEINSLYKVIDEANLTKMDLESQIESLKEELGSLSRNYEEDVKLLHKQLAGCELEQMDAPIGTGLDDILETIRIQWERDVEKNRVEAGALLQAKQQAEVAHMSQTQEEKLAAALRVELHNTSCQVQSLQAETESLRALKRGLENTLHDAKHWHDMELQNLGAVVGRLEAELREIRAEAEQQQQERAHLLARKCQLQKDVASYHALLDREESG,mutated_sequence,1.0,415.0,NP_003562.1.a2m,NP_003562.1.npy,ClinVar
+NP_003592.3,NP_003592.3.csv,MSSAAEPPPPPPPESAPSKPAASIASGGSNSSNKGGPEGVAAQAVASAASAGPADAEMEEIFDDASPGKQKEIQEPDPTYEEKMQTDRANRFEYLLKQTELFAHFIQPAAQKTPTSPLKMKPGRPRIKKDEKQNLLSVGDYRHRRTEQEEDEELLTESSKATNVCTRFEDSPSYVKWGKLRDYQVRGLNWLISLYENGINGILADEMGLGKTLQTISLLGYMKHYRNIPGPHMVLVPKSTLHNWMSEFKRWVPTLRSVCLIGDKEQRAAFVRDVLLPGEWDVCVTSYEMLIKEKSVFKKFNWRYLVIDEAHRIKNEKSKLSEIVREFKTTNRLLLTGTPLQNNLHELWSLLNFLLPDVFNSADDFDSWFDTNNCLGDQKLVERLHMVLRPFLLRRIKADVEKSLPPKKEVKIYVGLSKMQREWYTRILMKDIDILNSAGKMDKMRLLNILMQLRKCCNHPYLFDGAEPGPPYTTDMHLVTNSGKMVVLDKLLPKLKEQGSRVLIFSQMTRVLDILEDYCMWRNYEYCRLDGQTPHDERQDSINAYNEPNSTKFVFMLSTRAGGLGINLATADVVILYDSDWNPQVDLQAMDRAHRIGQTKTVRVFRFITDNTVEERIVERAEMKLRLDSIVIQQGRLVDQNLNKIGKDEMLQMIRHGATHVFASKESEITDEDIDGILERGAKKTAEMNEKLSKMGESSLRNFTMDTESSVYNFEGEDYREKQKIAFTEWIEPPKRERKANYAVDAYFREALRVSEPKAPKAPRPPKQPNVQDFQFFPPRLFELLEKEILFYRKTIGYKVPRNPELPNAAQAQKEEQLKIDEAESLNDEELEEKEKLLTQGFTNWNKRDFNQFIKANEKWGRDDIENIAREVEGKTPEEVIEYSAVFWERCNELQDIEKIMAQIERGEARIQRRISIKKALDTKIGRYKAPFHQLRISYGTNKGKNYTEEEDRFLICMLHKLGFDKENVYDELRQCIRNSPQFRFDWFLKSRTAMELQRRCNTLITLIERENMELEEKEKAEKKKRGPKPSTQKRKMDGAPDGRGRKKKLKL,mutated_sequence,1.0,1052.0,NP_003592.3.a2m,NP_003592.3.npy,ClinVar
+NP_003602.1,NP_003602.1.csv,MMAQSNMFTVADVLSQDELRKKLYQTFKDRGILDTLKTQLRNQLIHELMHPVLSGELQPRSISVEGSSLLIGASNSLVADHLQRCGYEYSLSVFFPESGLAKEKVFTMQDLLQLIKINPTSSLYKSLVSGSDKENQKGFLMHFLKELAEYHQAKESCNMETQTSSTFNRDSLAEKLQLIDDQFADAYPQRIKFESLEIKLNEYKREIEEQLRAEMCQKLKFFKDTEIAKIKMEAKKKYEKELTMFQNDFEKACQAKSEALVLREKSTLERIHKHQEIETKEIYAQRQLLLKDMDLLRGREAELKQRVEAFELNQKLQEEKHKSITEALRRQEQNIKSFEETYDRKLKNELLKYQLELKDDYIIRTNRLIEDERKNKEKAVHLQEELIAINSKKEELNQSVNRVKELELELESVKAQSLAITKQNHMLNEKVKEMSDYSLLKEEKLELLAQNKLLKQQLEESRNENLRLLNRLAQPAPELAVFQKELRKAEKAIVVEHEEFESCRQALHKQLQDEIEHSAQLKAQILGYKASVKSLTTQVADLKLQLKQTQTALENEVYCNPKQSVIDRSVNGLINGNVVPCNGEISGDFLNNPFKQENVLARMVASRITNYPTAWVEGSSPDSDLEFVANTKARVKELQQEAERLEKAFRSYHRRVIKNSAKSPLAAKSPPSLHLLEAFKNITSSSPERHIFGEDRVVSEQPQVGTLEERNDVVEALTGSAASRLRGGTSSRRLSSTPLPKAKRSLESEMYLEGLGRSHIASPSPCPDRMPLPSPTESRHSLSIPPVSSPPEQKVGLYRRQTELQDKSEFSDVDKLAFKDNEEFESSFESAGNMPRQLEMGGLSPAGDMSHVDAAAAAVPLSYQHPSVDQKQIEEQKEEEKIREQQVKERRQREERRQSNLQEVLERERRELEKLYQERKMIEESLKIKIKKELEMENELEMSNQEIKDKSAHSENPLEKYMKIIQQEQDQESADKSSKKMVQEGSLVDTLQSSDKVESLTGFSHEELDDSW,mutated_sequence,1.0,1012.0,NP_003602.1.a2m,NP_003602.1.npy,ClinVar
+NP_003713.3,NP_003713.3.csv,MNFETSRCATLQYCPDPYIQRFVETPAHFSWKESYYRSTMSQSTQTNEFLSPEVFQHIWDFLEQPICSVQPIDLNFVDEPSEDGATNKIEISMDCIRMQDSDLSDPMWPQYTNLGLLNSMDQQIQNGSSSTSPYNTDHAQNSVTAPSPYAQPSSTFDALSPSPAIPSNTDYPGPHSFDVSFQQSSTAKSATWTYSTELKKLYCQIAKTCPIQIKVMTPPPQGAVIRAMPVYKKAEHVTEVVKRCPNHELSREFNEGQIAPPSHLIRVEGNSHAQYVEDPITGRQSVLVPYEPPQVGTEFTTVLYNFMCNSSCVGGMNRRPILIIVTLETRDGQVLGRRCFEARICACPGRDRKADEDSIRKQQVSDSTKNGDGTKRPFRQNTHGIQMTSIKKRRSPDDELLYLPVRGRETYEMLLKIKESLELMQYLPQHTIETYRQQQQQQHQHLLQKQTSIQSPSSYGNSSPPLNKMNSMNKLPSVSQLINPQQRNALTPTTIPDGMGANIPMMGTHMPMAGDMNGLSPTQALPPPLSMPSTSHCTPPPPYPTDCSIVSFLARLGCSSCLDYFTTQGLTTIYQIEHYSMDDLASLKIPEQFRHAIWKGILDHRQLHEFSSPSHLLRTPSSASTVSVGSSETRGERVIDAVRFTLRQTISFPPRDEWNDFNFDMDARRNKQQRIKEEGE,mutated_sequence,1.0,680.0,NP_003713.3.a2m,NP_003713.3.npy,ClinVar
+NP_003733.2,NP_003733.2.csv,MSDSVILRSIKKFGEENDGFESDKSYNNDKKSRLQDEKKGDGVRVGFFQLFRFSSSTDIWLMFVGSLCAFLHGIAQPGVLLIFGTMTDVFIDYDVELQELQIPGKACVNNTIVWTNSSLNQNMTNGTRCGLLNIESEMIKFASYYAGIAVAVLITGYIQICFWVIAAARQIQKMRKFYFRRIMRMEIGWFDCNSVGELNTRFSDDINKINDAIADQMALFIQRMTSTICGFLLGFFRGWKLTLVIISVSPLIGIGAATIGLSVSKFTDYELKAYAKAGVVADEVISSMRTVAAFGGEKREVERYEKNLVFAQRWGIRKGIVMGFFTGFVWCLIFLCYALAFWYGSTLVLDEGEYTPGTLVQIFLSVIVGALNLGNASPCLEAFATGRAAATSIFETIDRKPIIDCMSEDGYKLDRIKGEIEFHNVTFHYPSRPEVKILNDLNMVIKPGEMTALVGPSGAGKSTALQLIQRFYDPCEGMVTVDGHDIRSLNIQWLRDQIGIVEQEPVLFSTTIAENIRYGREDATMEDIVQAAKEANAYNFIMDLPQQFDTLVGEGGGQMSGGQKQRVAIARALIRNPKILLLDMATSALDNESEAMVQEVLSKIQHGHTIISVAHRLSTVRAADTIIGFEHGTAVERGTHEELLERKGVYFTLVTLQSQGNQALNEEDIKDATEDDMLARTFSRGSYQDSLRASIRQRSKSQLSYLVHEPPLAVVDHKSTYEEDRKDKDIPVQEEVEPAPVRRILKFSAPEWPYMLVGSVGAAVNGTVTPLYAFLFSQILGTFSIPDKEEQRSQINGVCLLFVAMGCVSLFTQFLQGYAFAKSGELLTKRLRKFGFRAMLGQDIAWFDDLRNSPGALTTRLATDASQVQGAAGSQIGMIVNSFTNVTVAMIIAFSFSWKLSLVILCFFPFLALSGATQTRMLTGFASRDKQALEMVGQITNEALSNIRTVAGIGKERRFIEALETELEKPFKTAIQKANIYGFCFAFAQCIMFIANSASYRYGGYLISNEGLHFSYVFRVISAVVLSATALGRAFSYTPSYAKAKISAARFFQLLDRQPPISVYNTAGEKWDNFQGKIDFVDCKFTYPSRPDSQVLNGLSVSISPGQTLAFVGSSGCGKSTSIQLLERFYDPDQGKVMIDGHDSKKVNVQFLRSNIGIVSQEPVLFACSIMDNIKYGDNTKEIPMERVIAAAKQAQLHDFVMSLPEKYETNVGSQGSQLSRGEKQRIAIARAIVRDPKILLLDEATSALDTESEKTVQVALDKAREGRTCIVIAHRLSTIQNADIIAVMAQGVVIEKGTHEELMAQKGAYYKLVTTGSPIS,mutated_sequence,1.0,1321.0,NP_003733.2.a2m,NP_003733.2.npy,ClinVar
+NP_003784.2,NP_003784.2.csv,MAPWLQLLSLLGLLPGAVAAPAQPRAASFQAWGPPSPELLAPTRFALEMFNRGRAAGTRAVLGLVRGRVRRAGQGSLYSLEATLEEPPCNDPMVCRLPVSKKTLLCSFQVLDELGRHVLLRKDCGPVDTKVPGAGEPKSAFTQGSAMISSLSQNHPDNRNETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD,mutated_sequence,1.0,484.0,NP_003784.2.a2m,NP_003784.2.npy,ClinVar
+NP_003892.2,NP_003892.2.csv,MPSTDLLMLKAFEPYLEILEVYSTKAKNYVNGHCTKYEPWQLIAWSVVWTLLIVWGYEFVFQPESLWSRFKKKCFKLTRKMPIIGRKIQDKLNKTKDDISKNMSFLKVDKEYVKALPSQGLSSSAVLEKLKEYSSMDAFWQEGRASGTVYSGEEKLTELLVKAYGDFAWSNPLHPDIFPGLRKIEAEIVRIACSLFNGGPDSCGCVTSGGTESILMACKAYRDLAFEKGIKTPEIVAPQSAHAAFNKAASYFGMKIVRVPLTKMMEVDVRAMRRAISRNTAMLVCSTPQFPHGVIDPVPEVAKLAVKYKIPLHVDACLGGFLIVFMEKAGYPLEHPFDFRVKGVTSISADTHKYGYAPKGSSLVLYSDKKYRNYQFFVDTDWQGGIYASPTIAGSRPGGISAACWAALMHFGENGYVEATKQIIKTARFLKSELENIKGIFVFGNPQLSVIALGSRDFDIYRLSNLMTAKGWNLNQLQFPPSIHFCITLLHARKRVAIQFLKDIRESVTQIMKNPKAKTTGMGAIYGMAQTTVDRNMVAELSSVFLDSLYSTDTVTQGSQMNGSPKPH,mutated_sequence,1.0,568.0,NP_003892.2.a2m,NP_003892.2.npy,ClinVar
+NP_003910.1,NP_003910.1.csv,MQLPRWWELGDPCAWTGQGRGTRRMSPATTGTFLLTVYSIFSKVHSDRNVYPSAGVLFVHVLEREYFKGEFPPYPKPGEISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTAENVGKPTIIEITAYNRRTFETARHNLIINIMSAEDFPLPYQAEFFIKNMNVEEMLASEVLGDFLGAVKNVWQPERLNAINITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVENPQNQLRCSQEMEPVITCDKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGEYKPPSDSLKSRDYYTDFLITLAVPSAVALVLFLILAYIMCCRREGVEKRNMQTPDIQLVHHSAIQKSTKELRDMSKNREIAWPLSTLPVFHPVTGEIIPPLHTDNYDSTNMPLMQTQQNLPHQTQIPQQQTTGKWYP,mutated_sequence,1.0,437.0,NP_003910.1.a2m,NP_003910.1.npy,ClinVar
+NP_003912.1,NP_003912.1.csv,MEPTAPSLTEEDLTEVKKDALENLRVYLCEKIIAERHFDHLRAKKILSREDTEEISCRTSSRKRAGKLLDYLQENPKGLDTLVESIRREKTQNFLIQKITDEVLKLRNIKLEHLKGLKCSSCEPFPDGATNNLSRSNSDESNFSEKLRASTVMYHPEGESSTTPFFSTNSSLNLPVLEVGRTENTIFSSTTLPRPGDPGAPPLPPDLQLEEEGTCANSSEMFLPLRSRTVSRQ,mutated_sequence,1.0,233.0,NP_003912.1.a2m,NP_003912.1.npy,ClinVar
+NP_003928.1,NP_003928.1.csv,MEPSSLELPADTVQRIAAELKCHPTDERVALHLDEEDKLRHFRECFYIPKIQDLPPVDLSLVNKDENAIYFLGNSLGLQPKMVKTYLEEELDKWAKIAAYGHEVGKRPWITGDESIVGLMKDIVGANEKEIALMNALTVNLHLLMLSFFKPTPKRYKILLEAKAFPSDHYAIESQLQLHGLNIEESMRMIKPREGEETLRIEDILEVIEKEGDSIAVILFSGVHFYTGQHFNIPAITKAGQAKGCYVGFDLAHAVGNVELYLHDWGVDFACWCSYKYLNAGAGGIAGAFIHEKHAHTIKPALVGWFGHELSTRFKMDNKLQLIPGVCGFRISNPPILLVCSLHASLEIFKQATMKALRKKSVLLTGYLEYLIKHNYGKDKAATKKPVVNIITPSHVEERGCQLTITFSVPNKDVFQELEKRGVVCDKRNPNGIRVAPVPLYNSFHDVYKFTNLLTSILDSAETKN,mutated_sequence,1.0,465.0,NP_003928.1.a2m,NP_003928.1.npy,ClinVar
+NP_003968.3,NP_003968.3.csv,MADIIARLREDGIQKRVIQEGRGELPDFQDGTKATFHYRTLHSDDEGTVLDDSRARGKPMELIIGKKFKLPVWETIVCTMREGEIAQFLCDIKHVVLYPLVAKSLRNIAVGKDPLEGQRHCCGVAQMREHSSLGHADLDALQQNPQPLIFHMEMLKVESPGTYQQDPWAMTDEEKAKAVPLIHQEGNRLYREGHVKEAAAKYYDAIACLKNLQMKEQPGSPEWIQLDQQITPLLLNYCQCKLVVEEYYEVLDHCSSILNKYDDNVKAYFKRGKAHAAVWNAQEAQADFAKVLELDPALAPVVSRELRALEARIRQKDEEDKARFRGIFSH,mutated_sequence,1.0,330.0,NP_003968.3.a2m,NP_003968.3.npy,ClinVar
+NP_003973.3,NP_003973.3.csv,MVDSTEYEVASQPEVETSPLGDGASPGPEQVKLKKEISLLNGVCLIVGNMIGSGIFVSPKGVLIYSASFGLSLVIWAVGGLFSVFGALCYAELGTTIKKSGASYAYILEAFGGFLAFIRLWTSLLIIEPTSQAIIAITFANYMVQPLFPSCFAPYAASRLLAAACICLLTFINCAYVKWGTLVQDIFTYAKVLALIAVIVAGIVRLGQGASTHFENSFEGSSFAVGDIALALYSALFSYSGWDTLNYVTEEIKNPERNLPLSIGISMPIVTIIYILTNVAYYTVLDMRDILASDAVAVTFADQIFGIFNWIIPLSVALSCFGGLNASIVAASRLFFVGSREGHLPDAICMIHVERFTPVPSLLFNGIMALIYLCVEDIFQLINYYSFSYWFFVGLSIVGQLYLRWKEPDRPRPLKLSVFFPIVFCLCTIFLVAVPLYSDTINSLIGIAIALSGLPFYFLIIRVPEHKRPLYLRRIVGSATRYLQVLCMSVAAEMDLEDGGEMPKQRDPKSN,mutated_sequence,1.0,511.0,NP_003973.3.a2m,NP_003973.3.npy,ClinVar
+NP_003986.2,NP_003986.2.csv,MALPSLLLLVAALAGGVRPPGARNLTLAVVLPEHNLSYAWAWPRVGPAVALAVEALGRALPVDLRFVSSELEGACSEYLAPLSAVDLKLYHDPDLLLGPGCVYPAASVARFASHWRLPLLTAGAVASGFSAKNDHYRTLVRTGPSAPKLGEFVVTLHGHFNWTARAALLYLDARTDDRPHYFTIEGVFEALQGSNLSVQHQVYAREPGGPEQATHFIRANGRIVYICGPLEMLHEILLQAQRENLTNGDYVFFYLDVFGESLRAGPTRATGRPWQDNRTREQAQALREAFQTVLVITYREPPNPEYQEFQNRLLIRAREDFGVELGPSLMNLIAGCFYDGILLYAEVLNETIQEGGTREDGLRIVEKMQGRRYHGVTGLVVMDKNNDRETDFVLWAMGDLDSGDFQPAAHYSGAEKQIWWTGRPIPWVKGAPPSDNPPCAFDLDDPSCDKTPLSTLAIVALGTGITFIMFGVSSFLIFRKLMLEKELASMLWRIRWEELQFGNSERYHKGAGSRLTLSLRGSSYGSLMTAHGKYQIFANTGHFKGNVVAIKHVNKKRIELTRQVLFELKHMRDVQFNHLTRFIGACIDPPNICIVTEYCPRGSLQDILENDSINLDWMFRYSLINDLVKGMAFLHNSIISSHGSLKSSNCVVDSRFVLKITDYGLASFRSTAEPDDSHALYAKKLWTAPELLSGNPLPTTGMQKADVYSFGIILQEIALRSGPFYLEGLDLSPKEIVQKVRNGQRPYFRPSIDRTQLNEELVLLMERCWAQDPAERPDFGQIKGFIRRFNKEGGTSILDNLLLRMEQYANNLEKLVEERTQAYLEEKRKAEALLYQILPHSVAEQLKRGETVQAEAFDSVTIYFSDIVGFTALSAESTPMQVVTLLNDLYTCFDAIIDNFDVYKVETIGDAYMVVSGLPGRNGQRHAPEIARMALALLDAVSSFRIRHRPHDQLRLRIGVHTGPVCAGVVGLKMPRYCLFGDTVNTASRMESNGQALKIHVSSTTKDALDELGCFQLELRGDVEMKGKGKMRTYWLLGERKGPPGLL,mutated_sequence,1.0,1047.0,NP_003986.2.a2m,NP_003986.2.npy,ClinVar
+NP_003995.2,NP_003995.2.csv,MDWGTLQTILGGVNKHSTSIGKIWLTVLFIFRIMILVVAAKEVWGDEQADFVCNTLQPGCKNVCYDHYFPISHIRLWALQLIFVSTPALLVAMHVAYRRHEKKRKFIKGEIKSEFKDIEEIKTQKVRIEGSLWWTYTSSIFFRVIFEAAFMYVFYVMYDGFSMQRLVKCNAWPCPNTVDCFVSRPTEKTVFTVFMIAVSGICILLNVTELCYLLIRYCSGKSKKPV,mutated_sequence,1.0,226.0,NP_003995.2.a2m,NP_003995.2.npy,ClinVar
+NP_003997.2,NP_003997.2.csv,MLWWEEVEDCYEREDVQKKTFTKWVNAQFSKFGKQHIENLFSDLQDGRRLLDLLEGLTGQKLPKEKGSTRVHALNNVNKALRVLQNNNVDLVNIGSTDIVDGNHKLTLGLIWNIILHWQVKNVMKNIMAGLQQTNSEKILLSWVRQSTRNYPQVNVINFTTSWSDGLALNALIHSHRPDLFDWNSVVCQQSATQRLEHAFNIARYQLGIEKLLDPEDVDTTYPDKKSILMYITSLFQVLPQQVSIEAIQEVEMLPRPPKVTKEEHFQLHHQMHYSQQITVSLAQGYERTSSPKPRFKSYAYTQAAYVTTSDPTRSPFPSQHLEAPEDKSFGSSLMESEVNLDRYQTALEEVLSWLLSAEDTLQAQGEISNDVEVVKDQFHTHEGYMMDLTAHQGRVGNILQLGSKLIGTGKLSEDEETEVQEQMNLLNSRWECLRVASMEKQSNLHRVLMDLQNQKLKELNDWLTKTEERTRKMEEEPLGPDLEDLKRQVQQHKVLQEDLEQEQVRVNSLTHMVVVVDESSGDHATAALEEQLKVLGDRWANICRWTEDRWVLLQDILLKWQRLTEEQCLFSAWLSEKEDAVNKIHTTGFKDQNEMLSSLQKLAVLKADLEKKKQSMGKLYSLKQDLLSTLKNKSVTQKTEAWLDNFARCWDNLVQKLEKSTAQISQAVTTTQPSLTQTTVMETVTTVTTREQILVKHAQEELPPPPPQKKRQITVDSEIRKRLDVDITELHSWITRSEAVLQSPEFAIFRKEGNFSDLKEKVNAIEREKAEKFRKLQDASRSAQALVEQMVNEGVNADSIKQASEQLNSRWIEFCQLLSERLNWLEYQNNIIAFYNQLQQLEQMTTTAENWLKIQPTTPSEPTAIKSQLKICKDEVNRLSDLQPQIERLKIQSIALKEKGQGPMFLDADFVAFTNHFKQVFSDVQAREKELQTIFDTLPPMRYQETMSAIRTWVQQSETKLSIPQLSVTDYEIMEQRLGELQALQSSLQEQQSGLYYLSTTVKEMSKKAPSEISRKYQSEFEEIEGRWKKLSSQLVEHCQKLEEQMNKLRKIQNHIQTLKKWMAEVDVFLKEEWPALGDSEILKKQLKQCRLLVSDIQTIQPSLNSVNEGGQKIKNEAEPEFASRLETELKELNTQWDHMCQQVYARKEALKGGLEKTVSLQKDLSEMHEWMTQAEEEYLERDFEYKTPDELQKAVEEMKRAKEEAQQKEAKVKLLTESVNSVIAQAPPVAQEALKKELETLTTNYQWLCTRLNGKCKTLEEVWACWHELLSYLEKANKWLNEVEFKLKTTENIPGGAEEISEVLDSLENLMRHSEDNPNQIRILAQTLTDGGVMDELINEELETFNSRWRELHEEAVRRQKLLEQSIQSAQETEKSLHLIQESLTFIDKQLAAYIADKVDAAQMPQEAQKIQSDLTSHEISLEEMKKHNQGKEAAQRVLSQIDVAQKKLQDVSMKFRLFQKPANFEQRLQESKMILDEVKMHLPALETKSVEQEVVQSQLNHCVNLYKSLSEVKSEVEMVIKTGRQIVQKKQTENPKELDERVTALKLHYNELGAKVTERKQQLEKCLKLSRKMRKEMNVLTEWLAATDMELTKRSAVEGMPSNLDSEVAWGKATQKEIEKQKVHLKSITEVGEALKTVLGKKETLVEDKLSLLNSNWIAVTSRAEEWLNLLLEYQKHMETFDQNVDHITKWIIQADTLLDESEKKKPQQKEDVLKRLKAELNDIRPKVDSTRDQAANLMANRGDHCRKLVEPQISELNHRFAAISHRIKTGKASIPLKELEQFNSDIQKLLEPLEAEIQQGVNLKEEDFNKDMNEDNEGTVKELLQRGDNLQQRITDERKREEIKIKQQLLQTKHNALKDLRSQRRKKALEISHQWYQYKRQADDLLKCLDDIEKKLASLPEPRDERKIKEIDRELQKKKEELNAVRRQAEGLSEDGAAMAVEPTQIQLSKRWREIESKFAQFRRLNFAQIHTVREETMMVMTEDMPLEISYVPSTYLTEITHVSQALLEVEQLLNAPDLCAKDFEDLFKQEESLKNIKDSLQQSSGRIDIIHSKKTAALQSATPVERVKLQEALSQLDFQWEKVNKMYKDRQGRFDRSVEKWRRFHYDIKIFNQWLTEAEQFLRKTQIPENWEHAKYKWYLKELQDGIGQRQTVVRTLNATGEEIIQQSSKTDASILQEKLGSLNLRWQEVCKQLSDRKKRLEEQKNILSEFQRDLNEFVLWLEEADNIASIPLEPGKEQQLKEKLEQVKLLVEELPLRQGILKQLNETGGPVLVSAPISPEEQDKLENKLKQTNLQWIKVSRALPEKQGEIEAQIKDLGQLEKKLEDLEEQLNHLLLWLSPIRNQLEIYNQPNQEGPFDVKETEIAVQAKQPDVEEILSKGQHLYKEKPATQPVKRKLEDLSSEWKAVNRLLQELRAKQPDLAPGLTTIGASPTQTVTLVTQPVVTKETAISKLEMPSSLMLEVPALADFNRAWTELTDWLSLLDQVIKSQRVMVGDLEDINEMIIKQKATMQDLEQRRPQLEELITAAQNLKNKTSNQEARTIITDRIERIQNQWDEVQEHLQNRRQQLNEMLKDSTQWLEAKEEAEQVLGQARAKLESWKEGPYTVDAIQKKITETKQLAKDLRQWQTNVDVANDLALKLLRDYSADDTRKVHMITENINASWRSIHKRVSEREAALEETHRLLQQFPLDLEKFLAWLTEAETTANVLQDATRKERLLEDSKGVKELMKQWQDLQGEIEAHTDVYHNLDENSQKILRSLEGSDDAVLLQRRLDNMNFKWSELRKKSLNIRSHLEASSDQWKRLHLSLQELLVWLQLKDDELSRQAPIGGDFPAVQKQNDVHRAFKRELKTKEPVIMSTLETVRIFLTEQPLEGLEKLYQEPRELPPEERAQNVTRLLRKQAEEVNTEWEKLNLHSADWQRKIDETLERLRELQEATDELDLKLRQAEVIKGSWQPVGDLLIDSLQDHLEKVKALRGEIAPLKENVSHVNDLARQLTTLGIQLSPYNLSTLEDLNTRWKLLQVAVEDRVRQLHEAHRDFGPASQHFLSTSVQGPWERAISPNKVPYYINHETQTTCWDHPKMTELYQSLADLNNVRFSAYRTAMKLRRLQKALCLDLLSLSAACDALDQHNLKQNDQPMDILQIINCLTTIYDRLEQEHNNLVNVPLCVDMCLNWLLNVYDTGRTGRIRVLSFKTGIISLCKAHLEDKYRYLFKQVASSTGFCDQRRLGLLLHDSIQIPRQLGEVASFGGSNIEPSVRSCFQFANNKPEIEAALFLDWMRLEPQSMVWLPVLHRVAAAETAKHQAKCNICKECPIIGFRYRSLKHFNYDICQSCFFSGRVAKGHKMHYPMVEYCTPTTSGEDVRDFAKVLKNKFRTKRYFAKHPRMGYLPVQTVLEGDNMETPVTLINFWPVDSAPASSPQLSHDDTHSRIEHYASRLAEMENSNGSYLNDSISPNESIDDEHLLIQHYCQSLNQDSPLSQPRSPAQILISLESEERGELERILADLEEENRNLQAEYDRLKQQHEHKGLSPLPSPPEMMPTSPQSPRDAELIAEAKLLRQHKGRLEARMQILEDHNKQLESQLHRLRQLLEQPQAEAKVNGTTVSSPSTSLQRSDSSQPMLLRVVGSQTSDSMGEEDLLSPPQDTSTGLEEVMEQLNNSFPSSRGRNTPGKPMREDTM,mutated_sequence,1.0,3685.0,NP_003997.2.a2m,NP_003997.2.npy,ClinVar
+NP_004077.1,NP_004077.1.csv,MSAAWIPALGLGVCLLLLPGPAGSEGAAPIAITCFTRGLDIRKEKADVLCPGGCPLEEFSVYGNIVYASVSSICGAAVHRGVISNSGGPVRVYSLPGRENYSSVDANGIQSQMLSRWSASFTVTKGKSSTQEATGQAVSTAHPPTGKRLKKTPEKKTGNKDCKADIAFLIDGSFNIGQRRFNLQKNFVGKVALMLGIGTEGPHVGLVQASEHPKIEFYLKNFTSAKDVLFAIKEVGFRGGNSNTGKALKHTAQKFFTVDAGVRKGIPKVVVVFIDGWPSDDIEEAGIVAREFGVNVFIVSVAKPIPEELGMVQDVTFVDKAVCRNNGFFSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPIRESPNKNFLVIVTDGQSYDDVQGPAAAAHDAGITIFSVGVAWAPLDDLKDMASKPKESHAFFTREFTGLEPIVSDVIRGICRDFLESQQ,mutated_sequence,1.0,550.0,NP_004077.1.a2m,NP_004077.1.npy,ClinVar
+NP_004110.2,NP_004110.2.csv,MPALARDGGQLPLLVVFSAMIFGTITNQDLPVIKCVLINHKNNDSSVGKSSSYPMVSESPEDLGCALRPQSSGTVYEAAAVEVDVSASITLQVLVDAPGNISCLWVFKHSSLNCQPHFDLQNRGVVSMVILKMTETQAGEYLLFIQSEATNYTILFTVSIRNTLLYTLRRPYFRKMENQDALVCISESVPEPIVEWVLCDSQGESCKEESPAVVKKEEKVLHELFGTDIRCCARNELGRECTRLFTIDLNQTPQTTLPQLFLKVGEPLWIRCKAVHVNHGFGLTWELENKALEEGNYFEMSTYSTNRTMIRILFAFVSSVARNDTGYYTCSSSKHPSQSALVTIVEKGFINATNSSEDYEIDQYEEFCFSVRFKAYPQIRCTWTFSRKSFPCEQKGLDNGYSISKFCNHKHQPGEYIFHAENDDAQFTKMFTLNIRRKPQVLAEASASQASCFSDGYPLPSWTWKKCSDKSPNCTEEITEGVWNRKANRKVFGQWVSSSTLNMSEAIKGFLVKCCAYNSLGTSCETILLNSPGPFPFIQDNISFYATIGVCLLFIVVLTLLICHKYKKQFRYESQLQMVQVTGSSDNEYFYVDFREYEYDLKWEFPRENLEFGKVLGSGAFGKVMNATAYGISKTGVSIQVAVKMLKEKADSSEREALMSELKMMTQLGSHENIVNLLGACTLSGPIYLIFEYCCYGDLLNYLRSKREKFHRTWTEIFKEHNFSFYPTFQSHPNSSMPGSREVQIHPDSDQISGLHGNSFHSEDEIEYENQKRLEEEEDLNVLTFEDLLCFAYQVAKGMEFLEFKSCVHRDLAARNVLVTHGKVVKICDFGLARDIMSDSNYVVRGNARLPVKWMAPESLFEGIYTIKSDVWSYGILLWEIFSLGVNPYPGIPVDANFYKLIQNGFKMDQPFYATEEIYIIMQSCWAFDSRKRPSFPNLTSFLGCQLADAEEAMYQNVDGRVSECPHTYQNRRPFSREMDLGLLSPQAQVEDS,mutated_sequence,1.0,993.0,NP_004110.2.a2m,NP_004110.2.npy,ClinVar
+NP_004125.3,NP_004125.3.csv,MISASRAAAARLVGAAASRGPTAARHQDSWNGLSHEAFRLVSRRDYASEAIKGAVVGIDLGTTNSCVAVMEGKQAKVLENAEGARTTPSVVAFTADGERLVGMPAKRQAVTNPNNTFYATKRLIGRRYDDPEVQKDIKNVPFKIVRASNGDAWVEAHGKLYSPSQIGAFVLMKMKETAENYLGHTAKNAVITVPAYFNDSQRQATKDAGQISGLNVLRVINEPTAAALAYGLDKSEDKVIAVYDLGGGTFDISILEIQKGVFEVKSTNGDTFLGGEDFDQALLRHIVKEFKRETGVDLTKDNMALQRVREAAEKAKCELSSSVQTDINLPYLTMDSSGPKHLNMKLTRAQFEGIVTDLIRRTIAPCQKAMQDAEVSKSDIGEVILVGGMTRMPKVQQTVQDLFGRAPSKAVNPDEAVAIGAAIQGGVLAGDVTDVLLLDVTPLSLGIETLGGVFTKLINRNTTIPTKKSQVFSTAADGQTQVEIKVCQGEREMAGDNKLLGQFTLIGIPPAPRGVPQIEVTFDIDANGIVHVSAKDKGTGREQQIVIQSSGGLSKDDIENMVKNAEKYAEEDRRKKERVEAVNMAEGIIHDTETKMEEFKDQLPADECNKLKEEISKMRELLARKDSETGENIRQAASSLQQASLKLFEMAYKKMASEREGSGSSGTGEQKEDQKEEKQ,mutated_sequence,1.0,679.0,NP_004125.3.a2m,NP_004125.3.npy,ClinVar
+NP_004161.4,NP_004161.4.csv,MGKPARKGCEWKRFLKNNWVLLSTVAAVVLGITTGVLVREHSNLSTLEKFYFAFPGEILMRMLKLIILPLIISSMITGVAALDSNVSGKIGLRAVVYYFCTTLIAVILGIVLVVSIKPGVTQKVGEIARTGSTPEVSTVDAMLDLIRNMFPENLVQACFQQYKTKREEVKPPSDPEMNMTEESFTAVMTTAISKNKTKEYKIVGMYSDGINVLGLIVFCLVFGLVIGKMGEKGQILVDFFNALSDATMKIVQIIMCYMPLGILFLIAGKIIEVEDWEIFRKLGLYMATVLTGLAIHSIVILPLIYFIVVRKNPFRFAMGMAQALLTALMISSSSATLPVTFRCAEENNQVDKRITRFVLPVGATINMDGTALYEAVAAVFIAQLNDLDLGIGQIITISITATSASIGAAGVPQAGLVTMVIVLSAVGLPAEDVTLIIAVDWLLDRFRTMVNVLGDAFGTGIVEKLSKKELEQMDVSSEVNIVNPFALESTILDNEDSDTKKSYVNGGFAVDKSDTISFTQTSQF,mutated_sequence,1.0,524.0,NP_004161.4.a2m,NP_004161.4.npy,ClinVar
+NP_004165.2,NP_004165.2.csv,MWGLGARGPDRGLLLALALGGLARAGGVEVEPGGAHGESGGFQVVTFEWAHVQDPYVIALWILVASLAKIGFHLSHKVTSVVPESALLIVLGLVLGGIVWAADHIASFTLTPTVFFFYLLPPIVLDAGYFMPNRLFFGNLGTILLYAVVGTVWNAATTGLSLYGVFLSGLMGDLQIGLLDFLLFGSLMAAVDPVAVLAVFEEVHVNEVLFIIVFGESLLNDAVTVVLYNVFESFVALGGDNVTGVDCVKGIVSFFVVSLGGTLVGVVFAFLLSLVTRFTKHVRIIEPGFVFIISYLSYLTSEMLSLSAILAITFCGICCQKYVKANISEQSATTVRYTMKMLASSAETIIFMFLGISAVNPFIWTWNTAFVLLTLVFISVYRAIGVVLQTWLLNRYRMVQLEPIDQVVLSYGGLRGAVAFALVVLLDGDKVKEKNLFVSTTIIVVFFTVIFQGLTIKPLVQWLKVKRSEHREPRLNEKLHGRAFDHILSAIEDISGQIGHNYLRDKWSHFDRKFLSRVLMRRSAQKSRDRILNVFHELNLKDAISYVAEGERRGSLAFIRSPSTDNVVNVDFTPRSSTVEASVSYLLRENVSAVCLDMQSLEQRRRSIRDAEDMVTHHTLQQYLYKPRQEYKHLYSRHELTPTEDEKQDREIFHRTMRKRLESFKSTKLGLNQNKKAAKLYKRERAQKRRNSSIPNGKLPMESPAQNFTIKEKDLELSDTEEPPNYDEEMSGGIEFLASVTKDTASDSPAGIDNPVFSPDEALDRSLLARLPPWLSPGETVVPSQRARTQIPYSPGTFCRLMPFRLSSKSVDSFLQADGPEERPPAALPESTHM,mutated_sequence,1.0,834.0,NP_004165.2.a2m,NP_004165.2.npy,ClinVar
+NP_004167.3,NP_004167.3.csv,MDEPPFSEAALEQALGEPCDLDAALLTDIEDMLQLINNQDSDFPGLFDPPYAGSGAGGTDPASPDTSSPGSLSPPPATLSSSLEAFLSGPQAAPSPLSPPQPAPTPLKMYPSMPAFSPGPGIKEESVPLSILQTPTPQPLPGALLPQSFPAPAPPQFSSTPVLGYPSPPGGFSTGSPPGNTQQPLPGLPLASPPGVPPVSLHTQVQSVVPQQLLTVTAAPTAAPVTTTVTSQIQQVPVLLQPHFIKADSLLLTAMKTDGATVKAAGLSPLVSGTTVQTGPLPTLVSGGTILATVPLVVDAEKLPINRLAAGSKAPASAQSRGEKRTAHNAIEKRYRSSINDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQHSNQKLKQENLSLRTAVHKSKSLKDLVSACGSGGNTDVLMEGVKTEVEDTLTPPPSDAGSPFQSSPLSLGSRGSGSGGSGSDSEPDSPVFEDSKAKPEQRPSLHSRGMLDRSRLALCTLVFLCLSCNPLASLLGARGLPSPSDTTSVYHSPGRNVLGTESRDGPGWAQWLLPPVVWLLNGLLVLVSLVLLFVYGEPVTRPHSGPAVYFWRHRKQADLDLARGDFAQAAQQLWLALRALGRPLPTSHLDLACSLLWNLIRHLLQRLWVGRWLAGRAGGLQQDCALRVDASASARDAALVYHKLHQLHTMGKHTGGHLTATNLALSALNLAECAGDAVSVATLAEIYVAAALRVKTSLPRALHFLTRFFLSSARQACLAQSGSVPPAMQWLCHPVGHRFFVDGDWSVLSTPWESLYSLAGNPVDPLAQVTQLFREHLLERALNCVTQPNPSPGSADGDKEFSDALGYLQLLNSCSDAAGAPAYSFSISSSMATTTGVDPVAKWWASLTAVVIHWLRRDEEAAERLCPLVEHLPRVLQESERPLPRAALHSFKAARALLGCAKAESGPASLTICEKASGYLQDSLATTPASSSIDKAVQLFLCDLLLVVRTSLWRQQQPPAPAPAAQGTSSRPQASALELRGFQRDLSSLRRLAQSFRPAMRRVFLHEATARLMAGASPTRTHQLLDRSLRRRAGPGGKGGAVAELEPRPTRREHAEALLLASCYLPPGFLSAPGQRVGMLAEAARTLEKLGDRRLLHDCQQMLMRLGGGTTVTSS,mutated_sequence,1.0,1147.0,NP_004167.3.a2m,NP_004167.3.npy,ClinVar
+NP_004174.1,NP_004174.1.csv,MTITYTSQVANARLGSFSRLLLCWRGSIYKLLYGEFLIFLLCYYIIRFIYRLALTEEQQLMFEKLTLYCDSYIQLIPISFVLGFYVTLVVTRWWNQYENLPWPDRLMSLVSGFVEGKDEQGRLLRRTLIRYANLGNVLILRSVSTAVYKRFPSAQHLVQAGFMTPAEHKQLEKLSLPHNMFWVPWVWFANLSMKAWLGGRIRDPILLQSLLNEMNTLRTQCGHLYAYDWISIPLVYTQVVTVAVYSFFLTCLVGRQFLNPAKAYPGHELDLVVPVFTFLQFFFYVGWLKVAEQLINPFGEDDDDFETNWIVDRNLQVSLLAVDEMHQDLPRMEPDMYWNKPEPQPPYTAASAQFRRASFMGSTFNISLNKEEMEFQPNQEDEEDAHAGIIGRFLGLQSHDHHPPRANSRTKLLWPKRESLLHEGLPKNHKAAKQNVRGQEDNKAWKLKAVDAFKSAPLYQRPGYYSAPQTPLSPTPMFFPLEPSAPSKLHSVTGIDTKDKSLKTVSSGAKKSFELLSESDGALMEHPEVSQVRRKTVEFNLTDMPEIPENHLKEPLEQSPTNIHTTLKDHMDPYWALENRDEAHS,mutated_sequence,1.0,585.0,NP_004174.1.a2m,NP_004174.1.npy,ClinVar
+NP_004195.2,NP_004195.2.csv,MVLKAFFPTCCVSTDSGLLVGRWVPEQSSAVVLAVLHFPFIPIQVKQLLAQVRQASQVGVAVLGTWCHCRQEPEESLGRFLESLGAVFPHEPWLRLCRERGGTFWSCEATHRQAPTAPGAPGEDQVMLIFYDQRQVLLSQLHLPTVLPDRQAGATTASTGGLAAVFDTVARSEVLFRSDRFDEGPVRLSHWQSEGVEASILAELARRASGPICLLLASLLSLVSAVSACRVFKLWPLSFLGSKLSTCEQLRHRLEHLTLIFSTRKAENPAQLMRKANTVASVLLDVALGLMLLSWLHGRSRIGHLADALVPVADHVAEELQHLLQWLMGAPAGLKMNRALDQVLGRFFLYHIHLWISYIHLMSPFVEHILWHVGLSACLGLTVALSLLSDIIALLTFHIYCFYVYGARLYCLKIHGLSSLWRLFRGKKWNVLRQRVDSCSYDLDQLFIGTLLFTILLFLLPTTALYYLVFTLLRLLVVAVQGLIHLLVDLINSLPLYSLGLRLCRPYRLAAGVKFRVLRHEAGRPLRLLMQINPLPYSRVVHTYRLPSCGCHPKHSWGALCRKLFLGELIYPWRQRGDKQD,mutated_sequence,1.0,581.0,NP_004195.2.a2m,NP_004195.2.npy,ClinVar
+NP_004199.1,NP_004199.1.csv,MFRCGGLAAGALKQKLVPLVRTVCVRSPRQRNRLPGNLFQRWHVPLELQMTRQMASSGASGGKIDNSVLVLIVGLSTVGAGAYAYKTMKEDEKRYNERISGLGLTPEQKQKKAALSASEGEEVPQDKAPSHVPFLLIGGGTAAFAAARSIRARDPGARVLIVSEDPELPYMRPPLSKELWFSDDPNVTKTLRFKQWNGKERSIYFQPPSFYVSAQDLPHIENGGVAVLTGKKVVQLDVRDNMVKLNDGSQITYEKCLIATGGTPRSLSAIDRAGAEVKSRTTLFRKIGDFRSLEKISREVKSITIIGGGFLGSELACALGRKARALGTEVIQLFPEKGNMGKILPEYLSNWTMEKVRREGVKVMPNAIVQSVGVSSGKLLIKLKDGRKVETDHIVAAVGLEPNVELAKTGGLEIDSDFGGFRVNAELQARSNIWVAGDAACFYDIKLGRRRVEHHDHAVVSGRLAGENMTGAAKPYWHQSMFWSDLGPDVGYEAIGLVDSSLPTVGVFAKATAQDNPKSATEQSGTGIRSESETESEASEITIPPSTPAVPQAPVQGEDYGKGVIFYLRDKVVVGIVLWNIFNRMPIARKIIKDGEQHEDLNEVAKLFNIHED,mutated_sequence,1.0,613.0,NP_004199.1.a2m,NP_004199.1.npy,ClinVar
+NP_004205.2,NP_004205.2.csv,MTSELDIFVGNTTLIDEDVYRLWLDGYSVTDAVALRVRSGILEQTGATAAVLQSDTMDHYRTFHMLERLLHAPPKLLHQLIFQIPPSRQALLIERYYAFDEAFVREVLGKKLSKGTKKDLDDISTKTGITLKSCRRQFDNFKRVFKVVEEMRGSLVDNIQQHFLLSDRLARDYAAIVFFANNRFETGKKKLQYLSFGDFAFCAELMIQNWTLGAVDSQMDDMDMDLDKEFLQDLKELKVLVADKDLLDLHKSLVCTALRGKLGVFSEMEANFKNLSRGLVNVAAKLTHNKDVRDLFVDLVEKFVEPCRSDHWPLSDVRFFLNQYSASVHSLDGFRHQALWDRYMGTLRGCLLRLYHD,mutated_sequence,1.0,357.0,NP_004205.2.a2m,NP_004205.2.npy,ClinVar
+NP_004251.4,NP_004251.4.csv,MERLRDVRERLQAWERAFRRQRGRRPSQDDVEAAPEETRALYREYRTLKRTTGQAGGGLRSSESLPAAAEEAPEPRCWGPHLNRAATKSPQSTPGRSRQGSVPDYGQRLKANLKGTLQAGPALGRRPWPLGRASSKASTPKPPGTGPVPSFAEKVSDEPPQLPEPQPRPGRLQHLQASLSQRLGSLDPGWLQRCHSEVPDFLGAPKACRPDLGSEESQLLIPGESAVLGPGAGSQGPEASAFQEVSIRVGSPQPSSSGGEKRRWNEEPWESPAQVQQESSQAGPPSEGAGAVAVEEDPPGEPVQAQPPQPCSSPSNPRYHGLSPSSQARAGKAEGTAPLHIFPRLARHDRGNYVRLNMKQKHYVRGRALRSRLLRKQAWKQKWRKKGECFGGGGATVTTKESCFLNEQFDHWAAQCPRPASEEDTDAVGPEPLVPSPQPVPEVPSLDPTVLPLYSLGPSGQLAETPAEVFQALEQLGHQAFRPGQERAVMRILSGISTLLVLPTGAGKSLCYQLPALLYSRRSPCLTLVVSPLLSLMDDQVSGLPPCLKAACIHSGMTRKQRESVLQKIRAAQVHVLMLTPEALVGAGGLPPAAQLPPVAFACIDEAHCLSQWSHNFRPCYLRVCKVLRERMGVHCFLGLTATATRRTASDVAQHLAVAEEPDLHGPAPVPTNLHLSVSMDRDTDQALLTLLQGKRFQNLDSIIIYCNRREDTERIAALLRTCLHAAWVPGSGGRAPKTTAEAYHAGMCSRERRRVQRAFMQGQLRVVVATVAFGMGLDRPDVRAVLHLGLPPSFESYVQAVGRAGRDGQPAHCHLFLQPQGEDLRELRRHVHADSTDFLAVKRLVQRVFPACTCTCTRPPSEQEGAVGGERPVPKYPPQEAEQLSHQAAPGPRRVCMGHERALPIQLTVQALDMPEEAIETLLCYLELHPHHWLELLATTYTHCRLNCPGGPAQLQALAHRCPPLAVCLAQQLPEDPGQGSSSVEFDMVKLVDSMGWELASVRRALCQLQWDHEPRTGVRRGTGVLVEFSELAFHLRSPGDLTAEEKDQICDFLYGRVQARERQALARLRRTFQAFHSVAFPSCGPCLEQQDEERSTRLKDLLGRYFEEEEGQEPGGMEDAQGPEPGQARLQDWEDQVRCDIRQFLSLRPEEKFSSRAVARIFHGIGSPCYPAQVYGQDRRFWRKYLHLSFHALVGLATEELLQVAR,mutated_sequence,1.0,1208.0,NP_004251.4.a2m,NP_004251.4.npy,ClinVar
+NP_004278.2,NP_004278.2.csv,MDPLFQQTHKQVHEIQSCMGRLETADKQSVHIVENEIQASIDQIFSRLERLEILSSKEPPNKRQNARLRVDQLKYDVQHLQTALRNFQHRRHAREQQERQREELLSRTFTTNDSDTTIPMDESLQFNSSLQKVHNGMDDLILDGHNILDGLRTQRLTLKGTQKKILDIANMLGLSNTVMRLIEKRAFQDKYFMIGGMLLTCVVMFLVVQYLT,mutated_sequence,1.0,212.0,NP_004278.2.a2m,NP_004278.2.npy,ClinVar
+NP_004300.1,NP_004300.1.csv,MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVAVSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNREIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEEAPKGMLARGSYSIKSRFTDDDKTDHLSWEWNLTIKKDWKD,mutated_sequence,1.0,204.0,NP_004300.1.a2m,NP_004300.1.npy,ClinVar
+NP_004320.2,NP_004320.2.csv,MPQLYIYIRLLGAYLFIISRVQGQNLDSMLHGTGMKSDSDQKKSENGVTLAPEDTLPFLKCYCSGHCPDDAINNTCITNGHCFAIIEEDDQGETTLASGCMKYEGSDFQCKDSPKAQLRRTIECCRTNLCNQYLQPTLPPVVIGPFFDGSIRWLVLLISMAVCIIAMIIFSSCFCYKHYCKSISSRRRYNRDLEQDEAFIPVGESLKDLIDQSQSSGSGSGLPLLVQRTIAKQIQMVRQVGKGRYGEVWMGKWRGEKVAVKVFFTTEEASWFRETEIYQTVLMRHENILGFIAADIKGTGSWTQLYLITDYHENGSLYDFLKCATLDTRALLKLAYSAACGLCHLHTEIYGTQGKPAIAHRDLKSKNILIKKNGSCCIADLGLAVKFNSDTNEVDVPLNTRVGTKRYMAPEVLDESLNKNHFQPYIMADIYSFGLIIWEMARRCITGGIVEEYQLPYYNMVPSDPSYEDMREVVCVKRLRPIVSNRWNSDECLRAVLKLMSECWAHNPASRLTALRIKKTLAKMVESQDVKI,mutated_sequence,1.0,532.0,NP_004320.2.a2m,NP_004320.2.npy,ClinVar
+NP_004324.2,NP_004324.2.csv,MAALSGGGGGGAEPGQALFNGDMEPEAGAGAGAAASSAADPAIPEEVWNIKQMIKLTQEHIEALLDKFGGEHNPPSIYLEAYEEYTSKLDALQQREQQLLESLGNGTDFSVSSSASMDTVTSSSSSSLSVLPSSLSVFQNPTDVARSNPKSPQKPIVRVFLPNKQRTVVPARCGVTVRDSLKKALMMRGLIPECCAVYRIQDGEKKPIGWDTDISWLTGEELHVEVLENVPLTTHNFVRKTFFTLAFCDFCRKLLFQGFRCQTCGYKFHQRCSTEVPLMCVNYDQLDLLFVSKFFEHHPIPQEEASLAETALTSGSSPSAPASDSIGPQILTSPSPSKSIPIPQPFRPADEDHRNQFGQRDRSSSAPNVHINTIEPVNIDDLIRDQGFRGDGGSTTGLSATPPASLPGSLTNVKALQKSPGPQRERKSSSSSEDRNRMKTLGRRDSSDDWEIPDGQITVGQRIGSGSFGTVYKGKWHGDVAVKMLNVTAPTPQQLQAFKNEVGVLRKTRHVNILLFMGYSTKPQLAIVTQWCEGSSLYHHLHIIETKFEMIKLIDIARQTAQGMDYLHAKSIIHRDLKSNNIFLHEDLTVKIGDFGLATVKSRWSGSHQFEQLSGSILWMAPEVIRMQDKNPYSFQSDVYAFGIVLYELMTGQLPYSNINNRDQIIFMVGRGYLSPDLSKVRSNCPKAMKRLMAECLKKKRDERPLFPQILASIELLARSLPKIHRSASEPSLNRAGFQTEDFSLYACASPKTPIQAGGYGAFPVH,mutated_sequence,1.0,766.0,NP_004324.2.a2m,NP_004324.2.npy,ClinVar
+NP_004351.1,NP_004351.1.csv,MGPWSRSLSALLLLLQVSSWLCQEPEPCHPGFDAESYTFTVPRRHLERGRVLGRVNFEDCTGRQRTAYFSLDTRFKVGTDGVITVKRPLRFHNPQIHFLVYAWDSTYRKFSTKVTLNTVGHHHRPPPHQASVSGIQAELLTFPNSSPGLRRQKRDWVIPPISCPENEKGPFPKNLVQIKSNKDKEGKVFYSITGQGADTPPVGVFIIERETGWLKVTEPLDRERIATYTLFSHAVSSNGNAVEDPMEILITVTDQNDNKPEFTQEVFKGSVMEGALPGTSVMEVTATDADDDVNTYNAAIAYTILSQDPELPDKNMFTINRNTGVISVVTTGLDRESFPTYTLVVQAADLQGEGLSTTATAVITVTDTNDNPPIFNPTTYKGQVPENEANVVITTLKVTDADAPNTPAWEAVYTILNDDGGQFVVTTNPVNNDGILKTAKGLDFEAKQQYILHVAVTNVVPFEVSLTTSTATVTVDVLDVNEAPIFVPPEKRVEVSEDFGVGQEITSYTAQEPDTFMEQKITYRIWRDTANWLEINPDTGAISTRAELDREDFEHVKNSTYTALIIATDNGSPVATGTGTLLLILSDVNDNAPIPEPRTIFFCERNPKPQVINIIDADLPPNTSPFTAELTHGASANWTIQYNDPTQESIILKPKMALEVGDYKINLKLMDNQNKDQVTTLEVSVCDCEGAAGVCRKAQPVEAGLQIPAILGILGGILALLILILLLLLFLRRRAVVKEPLLPPEDDTRDNVYYYDEEGGGEEDQDFDLSQLHRGLDARPEVTRNDVAPTLMSVPRYLPRPANPDEIGNFIDENLKAADTDPTAPPYDSLLVFDYEGSGSEAASLSSLNSSESDKDQDYDYLNEWGNRFKKLADMYGGGEDD,mutated_sequence,1.0,882.0,NP_004351.1.a2m,NP_004351.1.npy,ClinVar
+NP_004355.2,NP_004355.2.csv,MESADFYEAEPRPPMSSHLQSPPHAPSSAAFGFPRGAGPAQPPAPPAAPEPLGGICEHETSIDISAYIDPAAFNDEFLADLFQHSRQQEKAKAAVGPTGGGGGGDFDYPGAPAGPGGAVMPGGAHGPPPGYGCAAAGYLDGRLEPLYERVGAPALRPLVIKQEPREEDEAKQLALAGLFPYQPPPPPPPSHPHPHPPPAHLAAPHLQFQIAHCGQTTMHLQPGHPTPPPTPVPSPHPAPALGAAGLPGPGSALKGLGAAHPDLRASGGSGAGKAKKSVDKNSNEYRVRRERNNIAVRKSRDKAKQRNVETQQKVLELTSDNDRLRKRVEQLSRELDTLRGIFRQLPESSLVKAMGNCA,mutated_sequence,1.0,358.0,NP_004355.2.a2m,NP_004355.2.npy,ClinVar
+NP_004357.3,NP_004357.3.csv,MAAAAAEEGMEPRALQYEQTLMYGRYTQDLGAFAKEEAARIRLGGPEPWKGPPSSRAAPELLEYGRSRCARCRVCSVRCHKFLVSRVGEDWIFLVLLGLLMALVSWVMDYAIAACLQAQQWMSRGLNTSILLQYLAWVTYPVVLITFSAGFTQILAPQAVGSGIPEMKTILRGVVLKEYLTLKTFIAKVIGLTCALGSGMPLGKEGPFVHIASMCAALLSKFLSLFGGIYENESRNTEMLAAACAVGVGCCFAAPIGGVLFSIEVTSTFFAVRNYWRGFFAATFSAFIFRVLAVWNRDEETITALFKTRFRLDFPFDLQELPAFAVIGIASGFGGALFVYLNRKIVQVMRKQKTINRFLMRKRLLFPALVTLLISTLTFPPGFGQFMAGQLSQKETLVTLFDNRTWVRQGLVEELEPPSTSQAWNPPRANVFLTLVIFILMKFWMSALATTIPVPCGAFMPVFVIGAAFGRLVGESMAAWFPDGIHTDSSTYRIVPGGYAVVGAAALAGAVTHTVSTAVIVFELTGQIAHILPVMIAVILANAVAQSLQPSLYDSIIRIKKLPYLPELGWGRHQQYRVRVEDIMVRDVPHVALSCTFRDLRLALHRTKGRMLALVESPESMILLGSIERSQVVALLGAQLSPARRRQHMQERRATQTSPLSDQEGPPTPEASVCFQVNTEDSAFPAARGETHKPLKPALKRGPSVTRNLGESPTGSAESAGIALRSLFCGSPPPEAASEKLESCEKRKLKRVRISLASDADLEGEMSPEEILEWEEQQLDEPVNFSDCKIDPAPFQLVERTSLHKTHTIFSLLGVDHAYVTSIGRLIGIVTLKELRKAIEGSVTAQGVKVRPPLASFRDSATSSSDTETTEVHALWGPHSRHGLPREGSPSDSDDKCQ,mutated_sequence,1.0,898.0,NP_004357.3.a2m,NP_004357.3.npy,ClinVar
+NP_004360.2,NP_004360.2.csv,MRKHRHLPLVAVFCLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREFLYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIFLIDGSNNTGSVNFAVILDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQEKLLPYIVGVAQRHIVLKPPTIVTQVIEVNKRDIVFLVDGSSALGLANFNAIRDFIAKVIQRLEIGQDLIQVAVAQYADTVRPEFYFNTHPTKREVITAVRKMKPLDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDEISQPAQELKRSSIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHSNKRDIIFLLDGSANVGKTNFPYVRDFVMNLVNSLDIGNDNIRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIREHVPQLLLLLTAGQSEDSYLQAANALTRAGILTFCVGASQANKAELEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTTYVSGGVEEVPLAQPESKRDILFLFDGSANLVGQFPVVRDFLYKIIDELNVKPEGTRIAVAQYSDDVKVESRFDEHQSKPEILNLVKRMKIKTGKALNLGYALDYAQRYIFVKSAGSRIEDGVLQFLVLLVAGRSSDRVDGPASNLKQSGVVPFIFQAKNADPAELEQIVLSPAFILAAESLPKIGDLHPQIVNLLKSVHNGAPAPVSGEKDVVFLLDGSEGVRSGFPLLKEFVQRVVESLDVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRITEGVPQLLIVLTADRSGDDVRNPSVVVKRGGAVPIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQVISERVTQLTREELSRLQPVLQPLPSPGVGGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYLDVGFDTTRVAVIQFSDDPKVEFLLNAHSSKDEVQNAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQFLVLISSGKSDDEVDDPAVELKQFGVAPFTIARNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAVESDAADIVFLIDSSEGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKADIVFLLDGSINFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVEDAQDVSLALTQRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSELSEQVLETLHDAMHETLCPGVTDAAKACNLDVILGFDGSRDQNVFVAQKGFESKVDAILNRISQMHRVSCSGGRSPTVRVSVVANTPSGPVEAFDFDEYQPEMLEKFRNMRSQHPYVLTEDTLKVYLNKFRQSSPDSVKVVIHFTDGADGDLADLHRASENLRQEGVRALILVGLERVVNLERLMHLEFGRGFMYDRPLRLNLLDLDYELAEQLDNIAEKACCGVPCKCSGQRGDRGPIGSIGPKGIPGEDGYRGYPGDEGGPGERGPPGVNGTQGFQGCPGQRGVKGSRGFPGEKGEVGEIGLDGLDGEDGDKGLPGSSGEKGNPGRRGDKGPRGEKGERGDVGIRGDPGNPGQDSQERGPKGETGDLGPMGVPGRDGVPGGPGETGKNGGFGRRGPPGAKGNKGGPGQPGFEGEQGTRGAQGPAGPAGPPGLIGEQGISGPRGSGGAAGAPGERGRTGPLGRKGEPGEPGPKGGIGNRGPRGETGDDGRDGVGSEGRRGKKGERGFPGYPGPKGNPGEPGLNGTTGPKGIRGRRGNSGPPGIVGQKGDPGYPGPAGPKGNRGDSIDQCALIQSIKDKCPCCYGPLECPVFPTELAFALDTSEGVNQDTFGRMRDVVLSIVNDLTIAESNCPRGARVAVVTYNNEVTTEIRFADSKRKSVLLDKIKNLQVALTSKQQSLETAMSFVARNTFKRVRNGFLMRKVAVFFSNTPTRASPQLREAVLKLSDAGITPLFLTRQEDRQLINALQINNTAVGHALVLPAGRDLTDFLENVLTCHVCLDICNIDPSCGFGSWRPSFRDRRAAGSDVDIDMAFILDSAETTTLFQFNEMKKYIAYLVRQLDMSPDPKASQHFARVAVVQHAPSESVDNASMPPVKVEFSLTDYGSKEKLVDFLSRGMTQLQGTRALGSAIEYTIENVFESAPNPRDLKIVVLMLTGEVPEQQLEEAQRVILQAKCKGYFFVVLGIGRKVNIKEVYTFASEPNDVFFKLVDKSTELNEEPLMRFGRLLPSFVSSENAFYLSPDIRKQCDWFQGDQPTKNLVKFGHKQVNVPNNVTSSPTSNPVTTTKPVTTTKPVTTTTKPVTTTTKPVTIINQPSVKPAAAKPAPAKPVAAKPVATKMATVRPPVAVKPATAAKPVAAKPAAVRPPAAAAAKPVATKPEVPRPQAAKPAATKPATTKPMVKMSREVQVFEITENSAKLHWERAEPPGPYFYDLTVTSAHDQSLVLKQNLTVTDRVIGGLLAGQTYHVAVVCYLRSQVRATYHGSFSTKKSQPPPPQPARSASSSTINLMVSTEPLALTETDICKLPKDEGTCRDFILKWYYDPNTKSCARFWYGGCGGNENKFGSQKECEKVCAPVLAKPGVISVMGT,mutated_sequence,1.0,3177.0,NP_004360.2.a2m,NP_004360.2.npy,ClinVar
+NP_004371.2,NP_004371.2.csv,MAENLLDGPPNPKRAKLSSPGFSANDSTDFGSLFDLENDLPDELIPNGGELGLLNSGNLVPDAASKHKQLSELLRGGSGSSINPGIGNVSASSPVQQGLGGQAQGQPNSANMASLSAMGKSPLSQGDSSAPSLPKQAASTSGPTPAASQALNPQAQKQVGLATSSPATSQTGPGICMNANFNQTHPGLLNSNSGHSLINQASQGQAQVMNGSLGAAGRGRGAGMPYPTPAMQGASSSVLAETLTQVSPQMTGHAGLNTAQAGGMAKMGITGNTSPFGQPFSQAGGQPMGATGVNPQLASKQSMVNSLPTFPTDIKNTSVTNVPNMSQMQTSVGIVPTQAIATGPTADPEKRKLIQQQLVLLLHAHKCQRREQANGEVRACSLPHCRTMKNVLNHMTHCQAGKACQVAHCASSRQIISHWKNCTRHDCPVCLPLKNASDKRNQQTILGSPASGIQNTIGSVGTGQQNATSLSNPNPIDPSSMQRAYAALGLPYMNQPQTQLQPQVPGQQPAQPQTHQQMRTLNPLGNNPMNIPAGGITTDQQPPNLISESALPTSLGATNPLMNDGSNSGNIGTLSTIPTAAPPSSTGVRKGWHEHVTQDLRSHLVHKLVQAIFPTPDPAALKDRRMENLVAYAKKVEGDMYESANSRDEYYHLLAEKIYKIQKELEEKRRSRLHKQGILGNQPALPAPGAQPPVIPQAQPVRPPNGPLSLPVNRMQVSQGMNSFNPMSLGNVQLPQAPMGPRAASPMNHSVQMNSMGSVPGMAISPSRMPQPPNMMGAHTNNMMAQAPAQSQFLPQNQFPSSSGAMSVGMGQPPAQTGVSQGQVPGAALPNPLNMLGPQASQLPCPPVTQSPLHPTPPPASTAAGMPSLQHTTPPGMTPPQPAAPTQPSTPVSSSGQTPTPTPGSVPSATQTQSTPTVQAAAQAQVTPQPQTPVQPPSVATPQSSQQQPTPVHAQPPGTPLSQAAASIDNRVPTPSSVASAETNSQQPGPDVPVLEMKTETQAEDTEPDPGESKGEPRSEMMEEDLQGASQVKEETDIAEQKSEPMEVDEKKPEVKVEVKEEEESSSNGTASQSTSPSQPRKKIFKPEELRQALMPTLEALYRQDPESLPFRQPVDPQLLGIPDYFDIVKNPMDLSTIKRKLDTGQYQEPWQYVDDVWLMFNNAWLYNRKTSRVYKFCSKLAEVFEQEIDPVMQSLGYCCGRKYEFSPQTLCCYGKQLCTIPRDAAYYSYQNRYHFCEKCFTEIQGENVTLGDDPSQPQTTISKDQFEKKKNDTLDPEPFVDCKECGRKMHQICVLHYDIIWPSGFVCDNCLKKTGRPRKENKFSAKRLQTTRLGNHLEDRVNKFLRRQNHPEAGEVFVRVVASSDKTVEVKPGMKSRFVDSGEMSESFPYRTKALFAFEEIDGVDVCFFGMHVQEYGSDCPPPNTRRVYISYLDSIHFFRPRCLRTAVYHEILIGYLEYVKKLGYVTGHIWACPPSEGDDYIFHCHPPDQKIPKPKRLQEWYKKMLDKAFAERIIHDYKDIFKQATEDRLTSAKELPYFEGDFWPNVLEESIKELEQEEEERKKEESTAASETTEGSQGDSKNAKKKNNKKTNKNKSSISRANKKKPSMPNVSNDLSQKLYATMEKHKEVFFVIHLHAGPVINTLPPIVDPDPLLSCDLMDGRDAFLTLARDKHWEFSSLRRSKWSTLCMLVELHTQGQDRFVYTCNECKHHVETRWHCTVCEDYDLCINCYNTKSHAHKMVKWGLGLDDEGSSQGEPQSKSPQESRRLSIQRCIQSLVHACQCRNANCSLPSCQKMKRVVQHTKGCKRKTNGGCPVCKQLIALCCYHAKHCQENKCPVPFCLNIKHKLRQQQIQHRLQQAQLMRRRMATMNTRNVPQQSLPSPTSAPPGTPTQQPSTPQTPQPPAQPQPSPVSMSPAGFPSVARTQPPTTVSTGKPTSQVPAPPPPAQPPPAAVEAARQIEREAQQQQHLYRVNINNSMPPGRTGMGTPGSQMAPVSLNVPRPNQVSGPVMPSMPPGQWQQAPLPQQQPMPGLPRPVISMQAQAAVAGPRMPSVQPPRSISPSALQDLLRTLKSPSSPQQQQQVLNILKSNPQLMAAFIKQRTAKYVANQPGMQPQPGLQSQPGMQPQPGMHQQPSLQNLNAMQAGVPRPGVPPQQQAMGGLNPQGQALNIMNPGHNPNMASMNPQYREMLRRQLLQQQQQQQQQQQQQQQQQQGSAGMAGGMAGHGQFQQPQGPGGYPPAMQQQQRMQQHLPLQGSSMGQMAAQMGQLGQMGQPGLGADSTPNIQQALQQRILQQQQMKQQIGSPGQPNPMSPQQHMLSGQPQASHLPGQQIATSLSNQVRSPAPVQSPRPQSQPPHSSPSPRIQPQPSPHHVSPQTGSPHPGLAVTMASSIDQGHLGNPEQSAMLPQLNTPSRSALSSELSLVGDTTGDTLEKFVEGL,mutated_sequence,1.0,2442.0,NP_004371.2.a2m,NP_004371.2.npy,ClinVar
+NP_004399.2,NP_004399.2.csv,MGNRGMEDLIPLVNRLQDAFSAIGQNADLDLPQIAVVGGQSAGKSSVLENFVGRDFLPRGSGIVTRRPLVLQLVNATTEYAEFLHCKGKKFTDFEEVRLEIEAETDRVTGTNKGISPVPINLRVYSPHVLNLTLVDLPGMTKVPVGDQPPDIEFQIRDMLMQFVTKENCLILAVSPANSDLANSDALKVAKEVDPQGQRTIGVITKLDLMDEGTDARDVLENKLLPLRRGYIGVVNRSQKDIDGKKDITAALAAERKFFLSHPSYRHLADRMGTPYLQKVLNQQLTNHIRDTLPGLRNKLQSQLLSIEKEVEEYKNFRPDDPARKTKALLQMVQQFAVDFEKRIEGSGDQIDTYELSGGARINRIFHERFPFELVKMEFDEKELRREISYAIKNIHGIRTGLFTPDMAFETIVKKQVKKIREPCLKCVDMVISELISTVRQCTKKLQQYPRLREEMERIVTTHIREREGRTKEQVMLLIDIELAYMNTNHEDFIGFANAQQRSNQMNKKKTSGNQDEILVIRKGWLTINNIGIMKGGSKEYWFVLTAENLSWYKDDEEKEKKYMLSVDNLKLRDVEKGFMSSKHIFALFNTEQRNVYKDYRQLELACETQEEVDSWKASFLRAGVYPERVGDKEKASETEENGSDSFMHSMDPQLERQVETIRNLVDSYMAIVNKTVRDLMPKTIMHLMINNTKEFIFSELLANLYSCGDQNTLMEESAEQAQRRDEMLRMYHALKEALSIIGDINTTTVSTPMPPPVDDSWLQVQSVPAGRRSPTSSPTPQRRAPAVPPARPGSRGPAPGPPPAGSALGGAPPVPSRPGASPDPFGPPPQVPSRPNRAPPGVPSRSGQASPSRPESPRPPFDL,mutated_sequence,1.0,864.0,NP_004399.2.a2m,NP_004399.2.npy,ClinVar
+NP_004406.2,NP_004406.2.csv,MSCNGGSHPRINTLGRMIRAESGPDLRYEVTSGGGGTSRMYYSRRGVITDQNSDGYCQTGTMSRHQNQNTIQELLQNCSDCLMRAELIVQPELKYGDGIQLTRSRELDECFAQANDQMEILDSLIREMRQMGQPCDAYQKRLLQLQEQMRALYKAISVPRVRRASSKGGGGYTCQSGSGWDEFTKHVTSECLGWMRQQRAEMDMVAWGVDLASVEQHINSHRGIHNSIGDYRWQLDKIKADLREKSAIYQLEEEYENLLKASFERMDHLRQLQNIIQATSREIMWINDCEEEELLYDWSDKNTNIAQKQEAFSIRMSQLEVKEKELNKLKQESDQLVLNQHPASDKIEAYMDTLQTQWSWILQITKCIDVHLKENAAYFQFFEEAQSTEAYLKGLQDSIRKKYPCDKNMPLQHLLEQIKELEKEREKILEYKRQVQNLVNKSKKIVQLKPRNPDYRSNKPIILRALCDYKQDQKIVHKGDECILKDNNERSKWYVTGPGGVDMLVPSVGLIIPPPNPLAVDLSCKIEQYYEAILALWNQLYINMKSLVSWHYCMIDIEKIRAMTIAKLKTMRQEDYMKTIADLELHYQEFIRNSQGSEMFGDDDKRKIQSQFTDAQKHYQTLVIQLPGYPQHQTVTTTEITHHGTCQDVNHNKVIETNRENDKQETWMLMELQKIRRQIEHCEGRMTLKNLPLADQGSSHHITVKINELKSVQNDSQAIAEVLNQLKDMLANFRGSEKYCYLQNEVFGLFQKLENINGVTDGYLNSLCTVRALLQAILQTEDMLKVYEARLTEEETVCLDLDKVEAYRCGLKKIKNDLNLKKSLLATMKTELQKAQQIHSQTSQQYPLYDLDLGKFGEKVTQLTDRWQRIDKQIDFRLWDLEKQIKQLRNYRDNYQAFCKWLYDAKRRQDSLESMKFGDSNTVMRFLNEQKNLHSEISGKRDKSEEVQKIAELCANSIKDYELQLASYTSGLETLLNIPIKRTMIQSPSGVILQEAADVHARYIELLTRSGDYYRFLSEMLKSLEDLKLKNTKIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKAKLASLEELKRQAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKENLGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEEMSNLRNKYETEINITKTTIKEISMQKEDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKVRNNYDEEIISLKNQFETEINITKTTIHQLTMQKEEDTSGYRAQIDNLTRENRSLSEEIKRLKNTLTQTTENLRRVEEDIQQQKATGSEVSQRKQQLEVELRQVTQMRTEESVRYKQSLDDAAKTIQDKNKEIERLKQLIDKETNDRKCLEDENARLQRVQYDLQKANSSATETINKLKVQEQELTRLRIDYERVSQERTVKDQDITRFQNSLKELQLQKQKVEEELNRLKRTASEDSCKRKKLEEELEGMRRSLKEQAIKITNLTQQLEQASIVKKRSEDDLRQQRDVLDGHLREKQRTQEELRRLSSEVEALRRQLLQEQESVKQAHLRNEHFQKAIEDKSRSLNESKIEIERLQSLTENLTKEHLMLEEELRNLRLEYDDLRRGRSEADSDKNATILELRSQLQISNNRTLELQGLINDLQRERENLRQEIEKFQKQALEASNRIQESKNQCTQVVQERESLLVKIKVLEQDKARLQRLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERCRRKLEDSTRETQSQLETERSRYQREIDKLRQRPYGSHRETQTECEWTVDTSKLVFDGLRKKVTAMQLYECQLIDKTTLDKLLKGKKSVEEVASEIQPFLRGAGSIAGASASPKEKYSLVEAKRKKLISPESTVMLLEAQAATGGIIDPHRNEKLTVDSAIARDLIDFDDRQQIYAAEKAITGFDDPFSGKTVSVSEAIKKNLIDRETGMRLLEAQIASGGVVDPVNSVFLPKDVALARGLIDRDLYRSLNDPRDSQKNFVDPVTKKKVSYVQLKERCRIEPHTGLLLLSVQKRSMSFQGIRQPVTVTELVDSGILRPSTVNELESGQISYDEVGERIKDFLQGSSCIAGIYNETTKQKLGIYEAMKIGLVRPGTALELLEAQAATGFIVDPVSNLRLPVEEAYKRGLVGIEFKEKLLSAERAVTGYNDPETGNIISLFQAMNKELIEKGHGIRLLEAQIATGGIIDPKESHRLPVDIAYKRGYFNEELSEILSDPSDDTKGFFDPNTEENLTYLQLKERCIKDEETGLCLLPLKEKKKQVQTSQKNTLRKRRVVIVDPETNKEMSVQEAYKKGLIDYETFKELCEQECEWEEITITGSDGSTRVVLVDRKTGSQYDIQDAIDKGLVDRKFFDQYRSGSLSLTQFADMISLKNGVGTSSSMGSGVSDDVFSSSRHESVSKISTISSVRNLTIRSSSFSDTLEESSPIAAIFDTENLEKISITEGIERGIVDSITGQRLLEAQACTGGIIHPTTGQKLSLQDAVSQGVIDQDMATRLKPAQKAFIGFEGVKGKKKMSAAEAVKEKWLPYEAGQRFLEFQYLTGGLVDPEVHGRISTEEAIRKGFIDGRAAQRLQDTSSYAKILTCPKTKLKISYKDAINRSMVEDITGLRLLEAASVSSKGLPSPYNMSSAPGSRSGSRSGSRSGSRSGSRSGSRRGSFDATGNSSYSYSYSFSSSSIGH,mutated_sequence,1.0,2871.0,NP_004406.2.a2m,NP_004406.2.npy,ClinVar
+NP_004420.1,NP_004420.1.csv,MARPGQRWLGKWLVAMVVWALCRLATPLAKNLEPVSWSSLNPKFLSGKGLVIYPKIGDKLDIICPRAEAGRPYEYYKLYLVRPEQAAACSTVLDPNVLVTCNRPEQEIRFTIKFQEFSPNYMGLEFKKHHDYYITSTSNGSLEGLENREGGVCRTRTMKIIMKVGQDPNAVTPEQLTTSRPSKEADNTVKMATQAPGSRGSLGDSDGKHETVNQEEKSGPGASGGSSGDPDGFFNSKVALFAAVGAGCVIFLLIIIFLTVLLLKLRKRHRKHTQQRAAALSLSTLASPKGGSGTAGTEPSDIIIPLRTTENNYCPHYEKVSGDYGHPVYIVQEMPPQSPANIYYKV,mutated_sequence,1.0,346.0,NP_004420.1.a2m,NP_004420.1.npy,ClinVar
+NP_004439.2,NP_004439.2.csv,MELAALCRWGLLLALLPPGAASTQVCTGTDMKLRLPASPETHLDMLRHLYQGCQVVQGNLELTYLPTNASLSFLQDIQEVQGYVLIAHNQVRQVPLQRLRIVRGTQLFEDNYALAVLDNGDPLNNTTPVTGASPGGLRELQLRSLTEILKGGVLIQRNPQLCYQDTILWKDIFHKNNQLALTLIDTNRSRACHPCSPMCKGSRCWGESSEDCQSLTRTVCAGGCARCKGPLPTDCCHEQCAAGCTGPKHSDCLACLHFNHSGICELHCPALVTYNTDTFESMPNPEGRYTFGASCVTACPYNYLSTDVGSCTLVCPLHNQEVTAEDGTQRCEKCSKPCARVCYGLGMEHLREVRAVTSANIQEFAGCKKIFGSLAFLPESFDGDPASNTAPLQPEQLQVFETLEEITGYLYISAWPDSLPDLSVFQNLQVIRGRILHNGAYSLTLQGLGISWLGLRSLRELGSGLALIHHNTHLCFVHTVPWDQLFRNPHQALLHTANRPEDECVGEGLACHQLCARGHCWGPGPTQCVNCSQFLRGQECVEECRVLQGLPREYVNARHCLPCHPECQPQNGSVTCFGPEADQCVACAHYKDPPFCVARCPSGVKPDLSYMPIWKFPDEEGACQPCPINCTHSCVDLDDKGCPAEQRASPLTSIISAVVGILLVVVLGVVFGILIKRRQQKIRKYTMRRLLQETELVEPLTPSGAMPNQAQMRILKETELRKVKVLGSGAFGTVYKGIWIPDGENVKIPVAIKVLRENTSPKANKEILDEAYVMAGVGSPYVSRLLGICLTSTVQLVTQLMPYGCLLDHVRENRGRLGSQDLLNWCMQIAKGMSYLEDVRLVHRDLAARNVLVKSPNHVKITDFGLARLLDIDETEYHADGGKVPIKWMALESILRRRFTHQSDVWSYGVTVWELMTFGAKPYDGIPAREIPDLLEKGERLPQPPICTIDVYMIMVKCWMIDSECRPRFRELVSEFSRMARDPQRFVVIQNEDLGPASPLDSTFYRSLLEDDDMGDLVDAEEYLVPQQGFFCPDPAPGAGGMVHHRHRSSSTRSGGGDLTLGLEPSEEEAPRSPLAPSEGAGSDVFDGDLGMGAAKGLQSLPTHDPSPLQRYSEDPTVPLPSETDGYVAPLTCSPQPEYVNQPDVRPQPPSPREGPLPAARPAGATLERPKTLSPGKNGVVKDVFAFGGAVENPEYLTPQGGAAPQPHPPPAFSPAFDNLYYWDQDPPERGAPPSTFKGTPTAENPEYLGLDVPV,mutated_sequence,1.0,1255.0,NP_004439.2.a2m,NP_004439.2.npy,ClinVar
+NP_004444.2,NP_004444.2.csv,MLVPLAKLSCLAYQCFHALKIKKNYLPLCATRWSSTSTVPRITTHYTIYPRDKDKRWEGVNMERFAEEADVVIVGAGPAGLSAAVRLKQLAVAHEKDIRVCLVEKAAQIGAHTLSGACLDPGAFKELFPDWKEKGAPLNTPVTEDRFGILTEKYRIPVPILPGLPMNNHGNYIVRLGHLVSWMGEQAEALGVEVYPGYAAAEVLFHDDGSVKGIATNDVGIQKDGAPKATFERGLELHAKVTIFAEGCHGHLAKQLYKKFDLRANCEPQTYGIGLKELWVIDEKNWKPGRVDHTVGWPLDRHTYGGSFLYHLNEGEPLVALGLVVGLDYQNPYLSPFREFQRWKHHPSIRPTLEGGKRIAYGARALNEGGFQSIPKLTFPGGLLIGCSPGFMNVPKIKGTHTAMKSGILAAESIFNQLTSENLQSKTIGLHVTEYEDNLKNSWVWKELYSVRNIRPSCHGVLGVYGGMIYTGIFYWILRGMEPWTLKHKGSDFERLKPAKDCTPIEYPKPDGQISFDLLSSVALSGTNHEHDQPAHLTLRDDSIPVNRNLSIYDGPEQRFCPAGVYEFVPVEQGDGFRLQINAQNCVHCKTCDIKDPSQNINWVVPEGGGGPAYNGM,mutated_sequence,1.0,617.0,NP_004444.2.a2m,NP_004444.2.npy,ClinVar
+NP_004447.2,NP_004447.2.csv,MGQTGKKSEKGPVCWRKRVKSEYMRLRQLKRFRRADEVKSMFSSNRQKILERTEILNQEWKQRRIQPVHILTSVSSLRGTRECSVTSDLDFPTQVIPLKTLNAVASVPIMYSWSPLQQNFMVEDETVLHNIPYMGDEVLDQDGTFIEELIKNYDGKVHGDRECGFINDEIFVELVNALGQYNDDDDDDDGDDPEEREEKQKDLEDHRDDKESRPPRKFPSDKIFEAISSMFPDKGTAEELKEKYKELTEQQLPGALPPECTPNIDGPNAKSVQREQSLHSFHTLFCRRCFKYDCFLHRKCNYSFHATPNTYKRKNTETALDNKPCGPQCYQHLEGAKEFAAALTAERIKTPPKRPGGRRRGRLPNNSSRPSTPTINVLESKDTDSDREAGTETGGENNDKEEEEKKDETSSSSEANSRCQTPIKMKPNIEPPENVEWSGAEASMFRVLIGTYYDNFCAIARLIGTKTCRQVYEFRVKESSIIAPAPAEDVDTPPRKKKRKHRLWAAHCRKIQLKKDGSSNHVYNYQPCDHPRQPCDSSCPCVIAQNFCEKFCQCSSECQNRFPGCRCKAQCNTKQCPCYLAVRECDPDLCLTCGAADHWDSKNVSCKNCSIQRGSKKHLLLAPSDVAGWGIFIKDPVQKNEFISEYCGEIISQDEADRRGKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVMMVNGDHRIGIFAKRAIQTGEELFFDYRYSQADALKYVGIEREMEIP,mutated_sequence,1.0,751.0,NP_004447.2.a2m,NP_004447.2.npy,ClinVar
+NP_004454.2,NP_004454.2.csv,MHGHRAPGGAGPSEPEHPATNPPGAAPPACADSDPGASEPGLLARRGSGSALGGPLDPQFVGPSDTSLGAAPGHRVLPCGPSPQHHRALRFSYHLEGSQPRPGLHQGNRILVKSLSLDPGQSLEPHPEGPQRLRSDPGPPTETPSQRPSPLKRAPGPKPQVPPKPSYLQMPRMPPPLEPIPPPPSRPLPADPRVAKGLAPRAEASPSSAAVSSLIEKFEREPVIVASDRPVPGPSPGPPEPVMLPQPTSQPPVPQLPEGEASRCLFLLAPGPRDGEKVPNRDSGIDSISSPSNSEETCFVSDDGPPSHSLCPGPPALASVPVALADPHRPGSQEVDSDLEEEDDEEEEEEKDREIPVPLMERQESVELTVQQKVFHIANELLQTEKAYVSRLHLLDQVFCARLLEEARNRSSFPADVVHGIFSNICSIYCFHQQFLLPELEKRMEEWDRYPRIGDILQKLAPFLKMYGEYVKNFDRAVELVNTWTERSTQFKVIIHEVQKEEACGNLTLQHHMLEPVQRIPRYELLLKDYLLKLPHGSPDSKDAQKSLELIATAAEHSNAAIRKMERMHKLLKVYELLGGEEDIVSPTKELIKEGHILKLSAKNGTTQDRYLILFNDRLLYCVPRLRLLGQKFSVRARIDVDGMELKESSNLNLPRTFLVSGKQRSLELQARTEEEKKDWVQAINSTLLKHEQTLETFKLLNSTNREDEDTPPNSPNVDLGKRAPTPIREKEVTMCMRCQEPFNSITKRRHHCKACGHVVCGKCSEFRARLVYDNNRSNRVCTDCYVALHGVPGSSPACSQHTPQRRRSILEKQASVAAENSVICSFLHYMEKGGKGWHKAWFVVPENEPLVLYIYGAPQDVKAQRSLPLIGFEVGPPEAGERPDRRHVFKITQSHLSWYFSPETEELQRRWMAVLGRAGRGDTFCPGPTLSEDREMEEAPVAALGATAEPPESPQTRDKT,mutated_sequence,1.0,961.0,NP_004454.2.a2m,NP_004454.2.npy,ClinVar
+NP_004513.1,NP_004513.1.csv,MADPAECSIKVMCRFRPLNEAEILRGDKFIPKFKGDETVVIGQGKPYVFDRVLPPNTTQEQVYNACAKQIVKDVLEGYNGTIFAYGQTSSGKTHTMEGKLHDPQLMGIIPRIAHDIFDHIYSMDENLEFHIKVSYFEIYLDKIRDLLDVSKTNLAVHEDKNRVPYVKGCTERFVSSPEEVMDVIDEGKANRHVAVTNMNEHSSRSHSIFLINIKQENVETEKKLSGKLYLVDLAGSEKVSKTGAEGAVLDEAKNINKSLSALGNVISALAEGTKTHVPYRDSKMTRILQDSLGGNCRTTIVICCSPSVFNEAETKSTLMFGQRAKTIKNTVSVNLELTAEEWKKKYEKEKEKNKTLKNVIQHLEMELNRWRNGEAVPEDEQISAKDQKNLEPCDNTPIIDNIAPVVAGISTEEKEKYDEEISSLYRQLDDKDDEINQQSQLAEKLKQQMLDQDELLASTRRDYEKIQEELTRLQIENEAAKDEVKEVLQALEELAVNYDQKSQEVEDKTRANEQLTDELAQKTTTLTTTQRELSQLQELSNHQKKRATEILNLLLKDLGEIGGIIGTNDVKTLADVNGVIEEEFTMARLYISKMKSEVKSLVNRSKQLESAQMDSNRKMNASERELAACQLLISQHEAKIKSLTDYMQNMEQKRRQLEESQDSLSEELAKLRAQEKMHEVSFQDKEKEHLTRLQDAEEMKKALEQQMESHREAHQKQLSRLRDEIEEKQKIIDEIRDLNQKLQLEQEKLSSDYNKLKIEDQEREMKLEKLLLLNDKREQAREDLKGLEETVSRELQTLHNLRKLFVQDLTTRVKKSVELDNDDGGGSAAQKQKISFLENNLEQLTKVHKQLVRDNADLRCELPKLEKRLRATAERVKALESALKEAKENAMRDRKRYQQEVDRIKEAVRAKNMARRAHSAQIAKPIRPGHYPASSPTAVHAIRGGGGSSSNSTHYQK,mutated_sequence,1.0,957.0,NP_004513.1.a2m,NP_004513.1.npy,ClinVar
+NP_004514.2,NP_004514.2.csv,MASQPNSSAKKKEEKGKNIQVVVRCRPFNLAERKASAHSIVECDPVRKEVSVRTGGLADKSSRKTYTFDMVFGASTKQIDVYRSVVCPILDEVIMGYNCTIFAYGQTGTGKTFTMEGERSPNEEYTWEEDPLAGIIPRTLHQIFEKLTDNGTEFSVKVSLLEIYNEELFDLLNPSSDVSERLQMFDDPRNKRGVIIKGLEEITVHNKDEVYQILEKGAAKRTTAATLMNAYSSRSHSVFSVTIHMKETTIDGEELVKIGKLNLVDLAGSENIGRSGAVDKRAREAGNINQSLLTLGRVITALVERTPHVPYRESKLTRILQDSLGGRTRTSIIATISPASLNLEETLSTLEYAHRAKNILNKPEVNQKLTKKALIKEYTEEIERLKRDLAAAREKNGVYISEENFRVMSGKLTVQEEQIVELIEKIGAVEEELNRVTELFMDNKNELDQCKSDLQNKTQELETTQKHLQETKLQLVKEEYITSALESTEEKLHDAASKLLNTVEETTKDVSGLHSKLDRKKAVDQHNAEAQDIFGKNLNSLFNNMEELIKDGSSKQKAMLEVHKTLFGNLLSSSVSALDTITTVALGSLTSIPENVSTHVSQIFNMILKEQSLAAESKTVLQELINVLKTDLLSSLEMILSPTVVSILKINSQLKHIFKTSLTVADKIEDQKKELDGFLSILCNNLHELQENTICSLVESQKQCGNLTEDLKTIKQTHSQELCKLMNLWTERFCALEEKCENIQKPLSSVQENIQQKSKDIVNKMTFHSQKFCADSDGFSQELRNFNQEGTKLVEESVKHSDKLNGNLEKISQETEQRCESLNTRTVYFSEQWVSSLNEREQELHNLLEVVSQCCEASSSDITEKSDGRKAAHEKQHNIFLDQMTIDEDKLIAQNLELNETIKIGLTKLNCFLEQDLKLDIPTGTTPQRKSYLYPSTLVRTEPREHLLDQLKRKQPELLMMLNCSENNKEETIPDVDVEEAVLGQYTEEPLSQEPSVDAGVDCSSIGGVPFFQHKKSHGKDKENRGINTLERSKVEETTEHLVTKSRLPLRAQINL,mutated_sequence,1.0,1056.0,NP_004514.2.a2m,NP_004514.2.npy,ClinVar
+NP_004521.1,NP_004521.1.csv,MEALMARGALTGPLRALCLLGCLLSHAAAAPSPIIKFPGDVAPKTDKELAVQYLNTFYGCPKESCNLFVLKDTLKKMQKFFGLPQTGDLDQNTIETMRKPRCGNPDVANYNFFPRKPKWDKNQITYRIIGYTPDLDPETVDDAFARAFQVWSDVTPLRFSRIHDGEADIMINFGRWEHGDGYPFDGKDGLLAHAFAPGTGVGGDSHFDDDELWTLGEGQVVRVKYGNADGEYCKFPFLFNGKEYNSCTDTGRSDGFLWCSTTYNFEKDGKYGFCPHEALFTMGGNAEGQPCKFPFRFQGTSYDSCTTEGRTDGYRWCGTTEDYDRDKKYGFCPETAMSTVGGNSEGAPCVFPFTFLGNKYESCTSAGRSDGKMWCATTANYDDDRKWGFCPDQGYSLFLVAAHEFGHAMGLEHSQDPGALMAPIYTYTKNFRLSQDDIKGIQELYGASPDIDLGTGPTPTLGPVTPEICKQDIVFDGIAQIRGEIFFFKDRFIWRTVTPRDKPMGPLLVATFWPELPEKIDAVYEAPQEEKAVFFAGNEYWIYSASTLERGYPKPLTSLGLPPDVQRVDAAFNWSKNKKTYIFAGDKFWRYNEVKKKMDPGFPKLIADAWNAIPDNLDAVVDLQGGGHSYFFKGAYYLKLENQSLKSVKFGSIKSDWLGC,mutated_sequence,1.0,660.0,NP_004521.1.a2m,NP_004521.1.npy,ClinVar
+NP_004535.1,NP_004535.1.csv,MALRLLKLAATSASARVVAAGAQRVRGIHSSVQCKLRYGMWHFLLGDKASKRLTERSRVITVDGNICTGKGKLAKEIAEKLGFKHFPEAGIHYPDSTTGDGKPLATDYNGNCSLEKFYDDPRSNDGNSYRLQSWLYSSRLLQYSDALEHLLTTGQGVVLERSIFSDFVFLEAMYNQGFIRKQCVDHYNEVKSVTICDYLPPHLVIYIDVPVPEVQRRIQKKGDPHEMKITSAYLQDIENAYKKTFLPEMSEKCEVLQYSAREAQDSKKVVEDIEYLKFDKGPWLKQDNRTLYHLRLLVQDKFEVLNYTSIPIFLPEVTIGAHQTDRVLHQFRELPGRKYSPGYNTEVGDKWIWLK,mutated_sequence,1.0,355.0,NP_004535.1.a2m,NP_004535.1.npy,ClinVar
+NP_004553.2,NP_004553.2.csv,MIVFVRFNSSHGFPVEVDSDTSIFQLKEVVAKRQGVPADQLRVIFAGKELRNDWTVQNCDLDQQSIVHIVQRPWRKGQEMNATGGDDPRNAAGGCEREPQSLTRVDLSSSVLPGDSVGLAVILHTDSRKDSPPAGSPAGRSIYNSFYVYCKGPCQRVQPGKLRVQCSTCRQATLTLTQGPSCWDDVLIPNRMSGECQSPHCPGTSAEFFFKCGAHPTSDKETSVALHLIATNSRNITCITCTDVRSPVLVFQCNSRHVICLDCFHLYCVTRLNDRQFVHDPQLGYSLPCVAGCPNSLIKELHHFRILGEEQYNRYQQYGAEECVLQMGGVLCPRPGCGAGLLPEPDQRKVTCEGGNGLGCGFAFCRECKEAYHEGECSAVFEASGTTTQAYRVDERAAEQARWEAASKETIKKTTKPCPRCHVPVEKNGGCMHMKCPQPQCRLEWCWNCGCEWNRVCMGDHWFDV,mutated_sequence,1.0,465.0,NP_004553.2.a2m,NP_004553.2.npy,ClinVar
+NP_004577.1,NP_004577.1.csv,MPLAQLADPWQKMAVESPSDSAENGQQIMDEPMGEEEINPQTEEVSIKEIAITHHVKEGHEKADPSQFELLKVLGQGSFGKVFLVKKISGSDARQLYAMKVLKKATLKVRDRVRTKMERDILVEVNHPFIVKLHYAFQTEGKLYLILDFLRGGDLFTRLSKEVMFTEEDVKFYLAELALALDHLHSLGIIYRDLKPENILLDEEGHIKLTDFGLSKESIDHEKKAYSFCGTVEYMAPEVVNRRGHTQSADWWSFGVLMFEMLTGTLPFQGKDRKETMTMILKAKLGMPQFLSPEAQSLLRMLFKRNPANRLGAGPDGVEEIKRHSFFSTIDWNKLYRREIHPPFKPATGRPEDTFYFDPEFTAKTPKDSPGIPPSANAHQLFRGFSFVAITSDDESQAMQTVGVHSIVQQLHRNSIQFTDGYEVKEDIGVGSYSVCKRCIHKATNMEFAVKIIDKSKRDPTEEIEILLRYGQHPNIITLKDVYDDGKYVYVVTELMKGGELLDKILRQKFFSEREASAVLFTITKTVEYLHAQGVVHRDLKPSNILYVDESGNPESIRICDFGFAKQLRAENGLLMTPCYTANFVAPEVLKRQGYDAACDIWSLGVLLYTMLTGYTPFANGPDDTPEEILARIGSGKFSLSGGYWNSVSDTAKDLVSKMLHVDPHQRLTAALVLRHPWIVHWDQLPQYQLNRQDAPHLVKGAMAATYSALNRNQSPVLEPVGRSTLAQRRGIKKITSTAL,mutated_sequence,1.0,740.0,NP_004577.1.a2m,NP_004577.1.npy,ClinVar
+NP_004585.1,NP_004585.1.csv,MLRAALSLLALPLAGAAEEPTQKPESPGEPPPGLELFRWQWHEVEAPYLVALWILVASLAKIVFHLSRKVTSLVPESCLLILLGLVLGGIVLAVAKKAEYQLEPGTFFLFLLPPIVLDSGYFMPSRLFFDNLGAILTYAVVGTLWNAFTTGAALWGLQQAGLVAPRVQAGLLDFLLFGSLISAVDPVAVLAVFEEVHVNETLFIIVFGESLLNDAVTVVLYKVCNSFVEMGSANVQATDYLKGVASLFVVSLGGAAVGLVFAFLLALTTRFTKRVRIIEPLLVFLLAYAAYLTAEMASLSAILAVTMCGLGCKKYVEANISHKSRTTVKYTMKTLASCAETVIFMLLGISAVDSSKWAWDSGLVLGTLIFILFFRALGVVLQTWVLNQFRLVPLDKIDQVVMSYGGLRGAVAFALVILLDRTKVPAKDYFVATTIVVVFFTVIVQGLTIKPLVKWLKVKRSEHHKPTLNQELHEHTFDHILAAVEDVVGHHGYHYWRDRWEQFDKKYLSQLLMRRSAYRIRDQIWDVYYRLNIRDAISFVDQGGHVLSSTGLTLPSMPSRNSVAETSVTNLLRESGSGACLDLQVIDTVRSGRDREDAVMHHLLCGGLYKPRRRYKASCSRHFISEDAQERQDKEVFQQNMKRRLESFKSTKHNICFTKSKPRPRKTGRRKKDGVANAEATNGKHRGLGFQDTAAVILTVESEEEEEESDSSETEKEDDEGIIFVARATSEVLQEGKVSGSLEVCPSPRIIPPSPTCAEKELPWKSGQGDLAVYVSSETTKIVPVDMQTGWNQSISSLESLASPPCNQAPILTCLPPHPRGTEEPQVPLHLPSDPRSSFAFPPSLAKAGRSRSESSADLPQQQELQPLMGHKDHTHLSPGTATSHWCIQFNRGSRL,mutated_sequence,1.0,896.0,NP_004585.1.a2m,NP_004585.1.npy,ClinVar
+NP_004594.1,NP_004594.1.csv,MKDRTQELRTAKDSDDDDDVAVTVDRDRFMDEFFEQVEEIRGFIDKIAENVEEVKRKHSAILASPNPDEKTKEELEELMSDIKKTANKVRSKLKSIEQSIEQEEGLNRSSADLRIRKTQHSTLSRKFVEVMSEYNATQSDYRERCKGRIQRQLEITGRTTTSEELEDMLESGNPAIFASGIIMDSSISKQALSEIETRHSEIIKLENSIRELHDMFMDMAMLVESQGEMIDRIEYNVEHAVDYVERAVSDTKKAVKYQSKARRKKIMIIICCVILGIVIASTVGGIFA,mutated_sequence,1.0,288.0,NP_004594.1.a2m,NP_004594.1.npy,ClinVar
+NP_004599.2,NP_004599.2.csv,MYHPRELYPSLGAGYRLGPAQPGADSSFPPALAEGYRYPELDTPKLDCFLSGMEAAPRTLAAHPPLPLLPPAMGTEPAPSAPEALHSLPGVSLSLENRELWKEFSSVGTEMIITKAGRRMFPACRVSVTGLDPEARYLFLLDVIPVDGARYRWQGRRWEPSGKAEPRLPDRVYIHPDSPATGAHWMRQPVSFHRVKLTNSTLDPHGHLILHSMHKYQPRIHLVRAAQLCSQHWGGMASFRFPETTFISVTAYQNPQITQLKIAANPFAKGFRENGRNCKRERDARVKRKLRGPEPAATEAYGSGDTPGGPCDSTLGGDIRESDPEQAPAPGEATAAPAPLCGGPSAEAYLLHPAAFHGAPSHLPTRSPSFPEAPDSGRSAPYSAAFLELPHGSGGSGYPAAPPAVPFAPHFLQGGPFPLPYTAPGGYLDVGSKPMY,mutated_sequence,1.0,436.0,NP_004599.2.a2m,NP_004599.2.npy,ClinVar
+NP_004603.1,NP_004603.1.csv,MEAAVAAPRPRLLLLVLAAAAAAAAALLPGATALQCFCHLCTKDNFTCVTDGLCFVSVTETTDKVIHNSMCIAEIDLIPRDRPFVCAPSSKTGSVTTTYCCNQDHCNKIELPTTVKSSPGLGPVELAAVIAGPVCFVCISLMLMVYICHNRTVIHHRVPNEEDPSLDRPFISEGTTLKDLIYDMTTSGSGSGLPLLVQRTIARTIVLQESIGKGRFGEVWRGKWRGEEVAVKIFSSREERSWFREAEIYQTVMLRHENILGFIAADNKDNGTWTQLWLVSDYHEHGSLFDYLNRYTVTVEGMIKLALSTASGLAHLHMEIVGTQGKPAIAHRDLKSKNILVKKNGTCCIADLGLAVRHDSATDTIDIAPNHRVGTKRYMAPEVLDDSINMKHFESFKRADIYAMGLVFWEIARRCSIGGIHEDYQLPYYDLVPSDPSVEEMRKVVCEQKLRPNIPNRWQSCEALRVMAKIMRECWYANGAARLTALRIKKTLSQLSQQEGIKM,mutated_sequence,1.0,503.0,NP_004603.1.a2m,NP_004603.1.npy,ClinVar
+NP_004605.4,NP_004605.4.csv,MLLWPLRGWAARALRCFGPGSRGSPASGPGPRRVQRRAWPPDKEQEKEKKSVICVEGNIASGKTTCLEFFSNATDVEVLTEPVSKWRNVRGHNPLGLMYHDASRWGLTLQTYVQLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDLIVYLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSLFPMAAPVLVIEADHHMERMLELFEQNRDRILTPENRKHCP,mutated_sequence,1.0,265.0,NP_004605.4.a2m,NP_004605.4.npy,ClinVar
+NP_004620.1,NP_004620.1.csv,MSRQTTSVGSSCLDLWREKNDRLVRQAKVAQNSGLTLRRQQLAQDALEGLRGLLHSLQGLPAAVPVLPLELTVTCNFIILRASLAQGFTEDQAQDIQRSLERVLETQEQQGPRLEQGLRELWDSVLRASCLLPELLSALHRLVGLQAALWLSADRLGDLALLLETLNGSQSGASKDLLLLLKTWSPPAEELDAPLTLQDAQGLKDVLLTAFAYRQGLQELITGNPDKALSSLHEAASGLCPRPVLVQVYTALGSCHRKMGNPQRALLYLVAALKEGSAWGPPLLEASRLYQQLGDTTAELESLELLVEALNVPCSSKAPQFLIEVELLLPPPDLASPLHCGTQSQTKHILASRCLQTGRAGDAAEHYLDLLALLLDSSEPRFSPPPSPPGPCMPEVFLEAAVALIQAGRAQDALTLCEELLSRTSSLLPKMSRLWEDARKGTKELPYCPLWVSATHLLQGQAWVQLGAQKVAISEFSRCLELLFRATPEEKEQGAAFNCEQGCKSDAALQQLRAAALISRGLEWVASGQDTKALQDFLLSVQMCPGNRDTYFHLLQTLKRLDRRDEATALWWRLEAQTKGSHEDALWSLPLYLESYLSWIRPSDRDAFLEEFRTSLPKSCDL,mutated_sequence,1.0,622.0,NP_004620.1.a2m,NP_004620.1.npy,ClinVar
+NP_004634.1,NP_004634.1.csv,MAAAAAAAAAAGAAGGRGSGPGRRRHLVPGAGGEAGEGAPGGAGDYGNGLESEELEPEELLLEPEPEPEPEEEPPRPRAPPGAPGPGPGSGAPGSQEEEEEPGLVEGDPGDGAIEDPELEAIKARVREMEEEAEKLKELQNEVEKQMNMSPPPGNAGPVIMSIEEKMEADARSIYVGNVDYGATAEELEAHFHGCGSVNRVTILCDKFSGHPKGFAYIEFSDKESVRTSLALDESLFRGRQIKVIPKRTNRPGISTTDRGFPRARYRARTTNYNSSRSRFYSGFNSRPRGRVYRGRARATSWYSPY,mutated_sequence,1.0,306.0,NP_004634.1.a2m,NP_004634.1.npy,ClinVar
+NP_004637.1,NP_004637.1.csv,MALGTTLRASLLLLGLLTEGLAQLAIPASVPRGFWALPENLTVVEGASVELRCGVSTPGSAVQWAKDGLLLGPDPRIPGFPRYRLEGDPARGEFHLHIEACDLSDDAEYECQVGRSEMGPELVSPRVILSILVPPKLLLLTPEAGTMVTWVAGQEYVVNCVSGDAKPAPDITILLSGQTISDISANVNEGSQQKLFTVEATARVTPRSSDNRQLLVCEASSPALEAPIKASFTVNVLFPPGPPVIEWPGLDEGHVRAGQSLELPCVARGGNPLATLQWLKNGQPVSTAWGTEHTQAVARSVLVMTVRPEDHGAQLSCEAHNSVSAGTQEHGITLQVTFPPSAIIILGSASQTENKNVTLSCVSKSSRPRVLLRWWLGWRQLLPMEETVMDGLHGGHISMSNLTFLARREDNGLTLTCEAFSEAFTKETFKKSLILNVKYPAQKLWIEGPPEGQKLRAGTRVRLVCLAIGGNPEPSLMWYKDSRTVTESRLPQESRRVHLGSVEKSGSTFSRELVLVTGPSDNQAKFTCKAGQLSASTQLAVQFPPTNVTILANASALRPGDALNLTCVSVSSNPPVNLSWDKEGERLEGVAAPPRRAPFKGSAAARSVLLQVSSRDHGQRVTCRAHSAELRETVSSFYRLNVLYRPEFLGEQVLVVTAVEQGEALLPVSVSANPAPEAFNWTFRGYRLSPAGGPRHRILSSGALHLWNVTRADDGLYQLHCQNSEGTAEARLRLDVHYAPTIRALQDPTEVNVGGSVDIVCTVDANPILPGMFNWERLGEDEEDQSLDDMEKISRGPTGRLRIHHAKLAQAGAYQCIVDNGVAPPARRLLRLVVRFAPQVEHPTPLTKVAAAGDSTSSATLHCRARGVPNIVFTWTKNGVPLDLQDPRYTEHTYHQGGVHSSLLTIANVSAAQDYALFTCTATNALGSDQTNIQLVSISRPDPPSGLKVVSLTPHSVGLEWKPGFDGGLPQRFCIRYEALGTPGFHYVDVVPPQATTFTLTGLQPSTRYRVWLLASNALGDSGLADKGTQLPITTPGLHQPSGEPEDQLPTEPPSGPSGLPLLPVLFALGGLLLLSNASCVGGVLWQRRLRRLAEGISEKTEAGSEEDRVRNEYEESQWTGERDTQSSTVSTTEAEPYYRSLRDFSPQLPPTQEEVSYSRGFTGEDEDMAFPGHLYDEVERTYPPSGAWGPLYDEVQMGPWDLHWPEDTYQDPRGIYDQVAGDLDTLEPDSLPFELRGHLV,mutated_sequence,1.0,1241.0,NP_004637.1.a2m,NP_004637.1.npy,ClinVar
+NP_004691.2,NP_004691.2.csv,MAEAPPRRLGLGPPPGDAPRAELVALTAVQSEQGEAGGGGSPRRLGLLGSPLPPGAPLPGPGSGSGSACGQRSSAAHKRYRRLQNWVYNVLERPRGWAFVYHVFIFLLVFSCLVLSVLSTIQEHQELANECLLILEFVMIVVFGLEYIVRVWSAGCCCRYRGWQGRFRFARKPFCVIDFIVFVASVAVIAAGTQGNIFATSALRSMRFLQILRMVRMDRRGGTWKLLGSVVYAHSKELITAWYIGFLVLIFASFLVYLAEKDANSDFSSYADSLWWGTITLTTIGYGDKTPHTWLGRVLAAGFALLGISFFALPAGILGSGFALKVQEQHRQKHFEKRRMPAANLIQAAWRLYSTDMSRAYLTATWYYYDSILPSFRELALLFEHVQRARNGGLRPLEVRRAPVPDGAPSRYPPVATCHRPGSTSFCPGESSRMGIKDRIRMGSSQRRTGPSKQHLAPPTMPTSPSSEQVGEATSPTKVQKSWSFNDRTRFRASLRLKPRTSAEDAPSEEVAEEKSYQCELTVDDIMPAVKTVIRSIRILKFLVAKRKFKETLRPYDVKDVIEQYSAGHLDMLGRIKSLQTRVDQIVGRGPGDRKAREKGDKGPSDAEVVDEISMMGRVVKVEKQVQSIEHKLDLLLGFYSRCLRSGTSASLGAVQVPLFDPDITSDYHSPVDHEDISVSAQTLSISRSVSTNMD,mutated_sequence,1.0,695.0,NP_004691.2.a2m,NP_004691.2.npy,ClinVar
+NP_004718.1,NP_004718.1.csv,MGKLIRMGPQERWLLRTKRLHWSRLLFLLGMLIIGSTYQHLRRPRGLSSLWAAVSSHQPIKLASRDLSSEEMMMMSSSPSKPSSEMGGKMLVPQASVGSDEATLSMTVENIPSMPKRTAKMIPTTTKNNYSPTAAGTERRKEDTPTSSRTLTYYTSTSSRQIVKKYTPTPRGEMKSYSPTQVREKVKYTPSPRGRRVGTYVPSTFMTMETSHAITPRTTVKDSDITATYKILETNSLKRIMEETTPTTLKGMFDSTPTFLTHEVEANVLTSPRSVMEKNNLFPPRRVESNSSAHPWGLVGKSNPKTPQGTVLLHTPATSEGQVTISTMTGSSPAETKAFTAAWSLRNPSPRTSVSAIKTAPAIVWRLAKKPSTAPSTSTTPTVRAKLTMQVHHCVVVKPTPAMLTTPSPSLTTALLPEELSPSPSVLPPSLPDLHPKGEYPPDLFSVEERRQGWVVLHVFGMMYVFVALAIVCDEYFVPALGVITDKLQISEDVAGATFMAAGGSAPELFTSLIGVFISHSNVGIGTIVGSAVFNILFVIGTCSLFSREILNLTWWPLFRDVSFYILDLIMLILFFLDSLIAWWESLLLLLAYAFYVFTMKWNKHIEVWVKEQLSRRPVAKVMALEDLSKPGDGAIAVDELQDNKKLKLPSLLTRGSSSTSLHNSTIRSTIYQLMLHSLDPLREVRLAKEKEEESLNQGARAQPQAKAESKPEEEEPAKLPAVTVTPAPVPDIKGDQKENPGGQEDVAEAESTGEMPGEEGETAGEGETEEKSGGETQPEGEGETETQGKGEECEDENEAEGKGDNEGEDEGEIHAEDGEMKGNEGETESQELSAENHGEAKNDEKGVEDGGGSDGGDSEEEEEEEEEQEEEEEEEEQEEEEEEEEEEEEKGNEEPLSLDWPETRQKQAIYLFLLPIVFPLWLTVPDVRRQESRKFFVFTFLGSIMWIAMFSYLMVWWAHQVGETIGISEEIMGLTILAAGTSIPDLITSVIVARKGLGDMAVSSSVGSNIFDITVGLPVPWLLFSLINGLQPVPVSSNGLFCAIVLLFLMLLFVISSIASCKWRMNKILGFTMFLLYFVFLIISVMLEDRIISCPVSV,mutated_sequence,1.0,1099.0,NP_004718.1.a2m,NP_004718.1.npy,ClinVar
+NP_004735.2,NP_004735.2.csv,MKNPMLEVVSLLLEKLLLISNFTLFSSGAAGEDKGRNSFYETSSFHRGDVLEVPRTHLTHYGIYLGDNRVAHMMPDILLALTDDMGRTQKVVSNKRLILGVIVKVASIRVDTVEDFAYGANILVNHLDESLQKKALLNEEVARRAEKLLGFTPYSLLWNNCEHFVTYCRYGTPISPQSDKFCETVKIIIRDQRSVLASAVLGLASIVCTGLVSYTTLPAIFIPFFLWMAG,mutated_sequence,1.0,230.0,NP_004735.2.a2m,NP_004735.2.npy,ClinVar
+NP_004804.2,NP_004804.2.csv,MEKLRLLGLRYQEYVTRHPAATAQLETAVRGFSYLLAGRFADSHELSELVYSASNLLVLLNDGILRKELRKKLPVSLSQQKLLTWLSVLECVEVFMEMGAAKVWGEVGRWLVIALVQLAKAVLRMLLLLWFKAGLQTSPPIVPLDRETQAQPPDGDHSPGNHEQSYVGKRSNRVVRTLQNTPSLHSRHWGAPQQREGRQQQHHEELSATPTPLGLQETIAEFLYIARPLLHLLSLGLWGQRSWKPWLLAGVVDVTSLSLLSDRKGLTRRERRELRRRTILLLYYLLRSPFYDRFSEARILFLLQLLADHVPGVGLVTRPLMDYLPTWQKIYFYSWG,mutated_sequence,1.0,336.0,NP_004804.2.a2m,NP_004804.2.npy,ClinVar
+NP_004809.2,NP_004809.2.csv,MAGELADKKDRDASPSKEERKRSRTPDRERDRDRDRKSSPSKDRKRHRSRDRRRGGSRSRSRSRSKSAERERRHKERERDKERDRNKKDRDRDKDGHRRDKDRKRSSLSPGRGKDFKSRKDRDSKKDEEDEHGDKKPKAQPLSLEELLAKKKAEEEAEAKPKFLSKAEREAEALKRRQQEVEERQRMLEEERKKRKQFQDLGRKMLEDPQERERRERRERMERETNGNEDEEGRQKIREEKDKSKELHAIKERYLGGIKKRRRTRHLNDRKFVFEWDASEDTSIDYNPLYKERHQVQLLGRGFIAGIDLKQQKREQSRFYGDLMEKRRTLEEKEQEEARLRKLRKKEAKQRWDDRHWSQKKLDEMTDRDWRIFREDYSITTKGGKIPNPIRSWKDSSLPPHILEVIDKCGYKEPTPIQRQAIPIGLQNRDIIGVAETGSGKTAAFLIPLLVWITTLPKIDRIEESDQGPYAIILAPTRELAQQIEEETIKFGKPLGIRTVAVIGGISREDQGFRLRMGCEIVIATPGRLIDVLENRYLVLSRCTYVVLDEADRMIDMGFEPDVQKILEHMPVSNQKPDTDEAEDPEKMLANFESGKHKYRQTVMFTATMPPAVERLARSYLRRPAVVYIGSAGKPHERVEQKVFLMSESEKRKKLLAILEQGFDPPIIIFVNQKKGCDVLAKSLEKMGYNACTLHGGKGQEQREFALSNLKAGAKDILVATDVAGRGIDIQDVSMVVNYDMAKNIEDYIHRIGRTGRAGKSGVAITFLTKEDSAVFYELKQAILESPVSSCPPELANHPDAQHKPGTILTKKRREETIFA,mutated_sequence,1.0,820.0,NP_004809.2.a2m,NP_004809.2.npy,ClinVar
+NP_004813.2,NP_004813.2.csv,MMRAVWEALAALAAVACLVGAVRGGPGLSMFAGQAAQPDPCSDENGHPRRCIPDFVNAAFGKDVRVSSTCGRPPARYCVVSERGEERLRSCHLCNASDPKKAHPPAFLTDLNNPHNLTCWQSENYLQFPHNVTLTLSLGKKFEVTYVSLQFCSPRPESMAIYKSMDYGRTWVPFQFYSTQCRKMYNRPHRAPITKQNEQEAVCTDSHTDMRPLSGGLIAFSTLDGRPSAHDFDNSPVLQDWVTATDIRVAFSRLHTFGDENEDDSELARDSYFYAVSDLQVGGRCKCNGHAARCVRDRDDSLVCDCRHNTAGPECDRCKPFHYDRPWQRATAREANECVACNCNLHARRCRFNMELYKLSGRKSGGVCLNCRHNTAGRHCHYCKEGYYRDMGKPITHRKACKACDCHPVGAAGKTCNQTTGQCPCKDGVTGITCNRCAKGYQQSRSPIAPCIKIPVAPPTTAASSVEEPEDCDSYCKASKGKLKINMKKYCKKDYAVQIHILKADKAGDWWKFTVNIISVYKQGTSRIRRGDQSLWIRSRDIACKCPKIKPLKKYLLLGNAEDSPDQSGIVADKSSLVIQWRDTWARRLRKFQQREKKGKCKKA,mutated_sequence,1.0,604.0,NP_004813.2.a2m,NP_004813.2.npy,ClinVar
+NP_004850.1,NP_004850.1.csv,MAQILPIRFQEHLQLQNLGINPANIGFSTLTMESDKFICIREKVGEQAQVVIIDMNDPSNPIRRPISADSAIMNPASKVIALKAGKTLQIFNIEMKSKMKAHTMTDDVTFWKWISLNTVALVTDNAVYHWSMEGESQPVKMFDRHSSLAGCQIINYRTDAKQKWLLLTGISAQQNRVVGAMQLYSVDRKVSQPIEGHAASFAQFKMEGNAEESTLFCFAVRGQAGGKLHIIEVGTPPTGNQPFPKKAVDVFFPPEAQNDFPVAMQISEKHDVVFLITKYGYIHLYDLETGTCIYMNRISGETIFVTAPHEATAGIIGVNRKGQVLSVCVEEENIIPYITNVLQNPDLALRMAVRNNLAGAEELFARKFNALFAQGNYSEAAKVAANAPKGILRTPDTIRRFQSVPAQPGQTSPLLQYFGILLDQGQLNKYESLELCRPVLQQGRKQLLEKWLKEDKLECSEELGDLVKSVDPTLALSVYLRANVPNKVIQCFAETGQVQKIVLYAKKVGYTPDWIFLLRNVMRISPDQGQQFAQMLVQDEEPLADITQIVDVFMEYNLIQQCTAFLLDALKNNRPSEGPLQTRLLEMNLMHAPQVADAILGNQMFTHYDRAHIAQLCEKAGLLQRALEHFTDLYDIKRAVVHTHLLNPEWLVNYFGSLSVEDSLECLRAMLSANIRQNLQICVQVASKYHEQLSTQSLIELFESFKSFEGLFYFLGSIVNFSQDPDVHFKYIQAACKTGQIKEVERICRESNCYDPERVKNFLKEAKLTDQLPLIIVCDRFDFVHDLVLYLYRNNLQKYIEIYVQKVNPSRLPVVIGGLLDVDCSEDVIKNLILVVRGQFSTDELVAEVEKRNRLKLLLPWLEARIHEGCEEPATHNALAKIYIDSNNNPERFLRENPYYDSRVVGKYCEKRDPHLACVAYERGQCDLELINVCNENSLFKSLSRYLVRRKDPELWGSVLLESNPYRRPLIDQVVQTALSETQDPEEVSVTVKAFMTADLPNELIELLEKIVLDNSVFSEHRNLQNLLILTAIKADRTRVMEYINRLDNYDAPDIANIAISNELFEEAFAIFRKFDVNTSAVQVLIEHIGNLDRAYEFAERCNEPAVWSQLAKAQLQKGMVKEAIDSYIKADDPSSYMEVVQAANTSGNWEELVKYLQMARKKARESYVETELIFALAKTNRLAELEEFINGPNNAHIQQVGDRCYDEKMYDAAKLLYNNVSNFGRLASTLVHLGEYQAAVDGARKANSTRTWKEVCFACVDGKEFRLAQMCGLHIVVHADELEELINYYQDRGYFEELITMLEAALGLERAHMGMFTELAILYSKFKPQKMREHLELFWSRVNIPKVLRAAEQAHLWAELVFLYDKYEEYDNAIITMMNHPTDAWKEGQFKDIITKVANVELYYRAIQFYLEFKPLLLNDLLMVLSPRLDHTRAVNYFSKVKQLPLVKPYLRSVQNHNNKSVNESLNNLFITEEDYQALRTSIDAYDNFDNISLAQRLEKHELIEFRRIAAYLFKGNNRWKQSVELCKKDSLYKDAMQYASESKDTELAEELLQWFLQEEKRECFGACLFTCYDLLRPDVVLETAWRHNIMDFAMPYFIQVMKEYLTKVDKLDASESLRKEEEQATETQPIVYGQPQLMLTAGPSVAVPPQAPFGYGYTAPPYGQPQPGFGYSM,mutated_sequence,1.0,1675.0,NP_004850.1.a2m,NP_004850.1.npy,ClinVar
+NP_004874.1,NP_004874.1.csv,MRQVCCSALPPPPLEKGRCSSYSDSSSSSSERSSSSSSSSSESGSSSRSSSNNSSISRPAAPPEPRPQQQPQPRSPAARRAAARSRAAAAGGMRRDPAPGFSMLLFGVSLACYSPSLKSVQDQAYKAPVVVEGKVQGLVPAGGSSSNSTREPPASGRVALVKVLDKWPLRSGGLQREQVISVGSCVPLERNQRYIFFLEPTEQPLVFKTAFAPLDTNGKNLKKEVGKILCTDCATRPKLKKMKSQTGQVGEKQSLKCEAAAGNPQPSYRWFKDGKELNRSRDIRIKYGNGRKNSRLQFNKVKVEDAGEYVCEAENILGKDTVRGRLYVNSVSTTLSSWSGHARKCNETAKSYCVNGGVCYYIEGINQLSCKCPNGFFGQRCLEKLPLRLYMPDPKQKAEELYQKRVLTITGICVALLVVGIVCVVAYCKTKKQRKQMHNHLRQNMCPAHQNRSLANGPSHPRLDPEEIQMADYISKNVPATDHVIRRETETTFSGSHSCSPSHHCSTATPTSSHRHESHTWSLERSESLTSDSQSGIMLSSVGTSKCNSPACVEARARRAAAYNLEERRRATAPPYHDSVDSLRDSPHSERYVSALTTPARLSPVDFHYSLATQVPTFEITSPNSAHAVSLPPAAPISYRLAEQQPLLRHPAPPGPGPGPGPGPGPGADMQRSYDSYYYPAAGPGPRRGTCALGGSLGSLPASPFRIPEDDEYETTQECAPPPPPRPRARGASRRTSAGPRRWRRSRLNGLAAQRARAARDSLSLSSGSGGGSASASDDDADDADGALAAESTPFLGLRGAHDALRSDSPPLCPAADSRTYYSLDSHSTRASSRHSRGPPPRAKQDSAPL,mutated_sequence,1.0,850.0,NP_004874.1.a2m,NP_004874.1.npy,ClinVar
+NP_004915.2,NP_004915.2.csv,MVDYHAANQSYQYGPSSAGNGAGGGGSMGDYMAQEDDWDRDLLLDPAWEKQQRKTFTAWCNSHLRKAGTQIENIDEDFRDGLKLMLLLEVISGERLPKPERGKMRVHKINNVNKALDFIASKGVKLVSIGAEEIVDGNAKMTLGMIWTIILRFAIQDISVEETSAKEGLLLWCQRKTAPYKNVNVQNFHISWKDGLAFNALIHRHRPELIEYDKLRKDDPVTNLNNAFEVAEKYLDIPKMLDAEDIVNTARPDEKAIMTYVSSFYHAFSGAQKAETAANRICKVLAVNQENEHLMEDYEKLASDLLEWIRRTIPWLEDRVPQKTIQEMQQKLEDFRDYRRVHKPPKVQEKCQLEINFNTLQTKLRLSNRPAFMPSEGKMVSDINNGWQHLEQAEKGYEEWLLNEIRRLERLDHLAEKFRQKASIHEAWTDGKEAMLKHRDYETATLSDIKALIRKHEAFESDLAAHQDRVEQIAAIAQELNELDYYDSHNVNTRCQKICDQWDALGSLTHSRREALEKTEKQLEAIDQLHLEYAKRAAPFNNWMESAMEDLQDMFIVHTIEEIEGLISAHDQFKSTLPDADREREAILAIHKEAQRIAESNHIKLSGSNPYTTVTPQIINSKWEKVQQLVPKRDHALLEEQSKQQSNEHLRRQFASQANVVGPWIQTKMEEIGRISIEMNGTLEDQLSHLKQYERSIVDYKPNLDLLEQQHQLIQEALIFDNKHTNYTMEHIRVGWEQLLTTIARTINEVENQILTRDAKGISQEQMQEFRASFNHFDKDHGGALGPEEFKACLISLGYDVENDRQGEAEFNRIMSLVDPNHSGLVTFQAFIDFMSRETTDTDTADQVIASFKVLAGDKNFITAEELRRELPPDQAEYCIARMAPYQGPDAVPGALDYKSFSTALYGESDL,mutated_sequence,1.0,911.0,NP_004915.2.a2m,NP_004915.2.npy,ClinVar
+NP_004928.2,NP_004928.2.csv,MIRNWLTIFILFPLKLVEKCESSVSLTVPPVVKLENGSSTNVSLTLRPPLNATLVITFEITFRSKNITILELPDEVVVPPGVTNSSFQVTSQNVGQLTVYLHGNHSNQTGPRIRFLVIRSSAISIINQVIGWIYFVAWSISFYPQVIMNWRRKSVIGLSFDFVALNLTGFVAYSVFNIGLLWVPYIKEQFLLKYPNGVNPVNSNDVFFSLHAVVLTLIIIVQCCLYERGGQRVSWPAIGFLVLAWLFAFVTMIVAAVGVTTWLQFLFCFSYIKLAVTLVKYFPQAYMNFYYKSTEGWSIGNVLLDFTGGSFSLLQMFLQSYNNDQWTLIFGDPTKFGLGVFSIVFDVVFFIQHFCLYRKRPGYDQLN,mutated_sequence,1.0,367.0,NP_004928.2.a2m,NP_004928.2.npy,ClinVar
+NP_004950.2,NP_004950.2.csv,MDYSYDEDLDELCPVCGDKVSGYHYGLLTCESCKGFFKRTVQNNKHYTCTESQSCKIDKTQRKRCPFCRFQKCLTVGMRLEAVRADRMRGGRNKFGPMYKRDRALKQQKKAQIRANGFKLETGPPMGVPPPPPPAPDYVLPPSLHGPEPKGLAAGPPAGPLGDFGAPALPMAVPGAHGPLAGYLYPAFPGRAIKSEYPEPYASPPQPGLPYGYPEPFSGGPNVPELILQLLQLEPDEDQVRARILGCLQEPTKSRPDQPAAFGLLCRMADQTFISIVDWARRCMVFKELEVADQMTLLQNCWSELLVFDHIYRQVQHGKEGSILLVTGQEVELTTVATQAGSLLHSLVLRAQELVLQLLALQLDRQEFVCLKFIILFSLDLKFLNNHILVKDAQEKANAALLDYTLCHYPHCGDKFQQLLLCLVEVRALSMQAKEYLYHKHLGNEMPRNNLLIEMLQAKQT,mutated_sequence,1.0,461.0,NP_004950.2.a2m,NP_004950.2.npy,ClinVar
+NP_004961.1,NP_004961.1.csv,MALRKGGLALALLLLSWVALGPRSLEGADPGTPGEAEGPACPAACVCSYDDDADELSVFCSSRNLTRLPDGVPGGTQALWLDGNNLSSVPPAAFQNLSSLGFLNLQGGQLGSLEPQALLGLENLCHLHLERNQLRSLALGTFAHTPALASLGLSNNRLSRLEDGLFEGLGSLWDLNLGWNSLAVLPDAAFRGLGSLRELVLAGNRLAYLQPALFSGLAELRELDLSRNALRAIKANVFVQLPRLQKLYLDRNLIAAVAPGAFLGLKALRWLDLSHNRVAGLLEDTFPGLLGLRVLRLSHNAIASLRPRTFKDLHFLEELQLGHNRIRQLAERSFEGLGQLEVLTLDHNQLQEVKAGAFLGLTNVAVMNLSGNCLRNLPEQVFRGLGKLHSLHLEGSCLGRIRPHTFTGLSGLRRLFLKDNGLVGIEEQSLWGLAELLELDLTSNQLTHLPHRLFQGLGKLEYLLLSRNRLAELPADALGPLQRAFWLDVSHNRLEALPNSLLAPLGRLRYLSLRNNSLRTFTPQPPGLERLWLEGNPWDCGCPLKALRDFALQNPSAVPRFVQAICEGDDCQPPAYTYNNITCASPPEVVGLDLRDLSEAHFAPC,mutated_sequence,1.0,605.0,NP_004961.1.a2m,NP_004961.1.npy,ClinVar
+NP_004965.1,NP_004965.1.csv,MTVATGDPADEAAALPGHPQDTYDPEADHECCERVVINISGLRFETQLKTLAQFPETLLGDPKKRMRYFDPLRNEYFFDRNRPSFDAILYYYQSGGRLRRPVNVPLDIFSEEIRFYELGEEAMEMFREDEGYIKEEERPLPENEFQRQVWLLFEYPESSGPARIIAIVSVMVILISIVSFCLETLPIFRDENEDMHGSGVTFHTYSNSTIGYQQSTSFTDPFFIVETLCIIWFSFEFLVRFFACPSKAGFFTNIMNIIDIVAIIPYFITLGTELAEKPEDAQQGQQAMSLAILRVIRLVRVFRIFKLSRHSKGLQILGQTLKASMRELGLLIFFLFIGVILFSSAVYFAEADERESQFPSIPDAFWWAVVSMTTVGYGDMVPTTIGGKIVGSLCAIAGVLTIALPVPVIVSNFNYFYHRETEGEEQAQYLQVTSCPKIPSSPDLKKSRSASTISKSDYMEIQEGVNNSNEDFREENLKTANCTLANTNYVNITKMLTDV,mutated_sequence,1.0,499.0,NP_004965.1.a2m,NP_004965.1.npy,ClinVar
+NP_004975.2,NP_004975.2.csv,MAETNNECSIKVLCRFRPLNQAEILRGDKFIPIFQGDDSVVIGGKPYVFDRVFPPNTTQEQVYHACAMQIVKDVLAGYNGTIFAYGQTSSGKTHTMEGKLHDPQLMGIIPRIARDIFNHIYSMDENLEFHIKVSYFEIYLDKIRDLLDVTKTNLSVHEDKNRVPFVKGCTERFVSSPEEILDVIDEGKSNRHVAVTNMNEHSSRSHSIFLINIKQENMETEQKLSGKLYLVDLAGSEKVSKTGAEGAVLDEAKNINKSLSALGNVISALAEGTKSYVPYRDSKMTRILQDSLGGNCRTTMFICCSPSSYNDAETKSTLMFGQRAKTIKNTASVNLELTAEQWKKKYEKEKEKTKAQKETIAKLEAELSRWRNGENVPETERLAGEEAALGAELCEETPVNDNSSIVVRIAPEERQKYEEEIRRLYKQLDDKDDEINQQSQLIEKLKQQMLDQEELLVSTRGDNEKVQRELSHLQSENDAAKDEVKEVLQALEELAVNYDQKSQEVEEKSQQNQLLVDELSQKVATMLSLESELQRLQEVSGHQRKRIAEVLNGLMKDLSEFSVIVGNGEIKLPVEISGAIEEEFTVARLYISKIKSEVKSVVKRCRQLENLQVECHRKMEVTGRELSSCQLLISQHEAKIRSLTEYMQSVELKKRHLEESYDSLSDELAKLQAQETVHEVALKDKEPDTQDADEVKKALELQMESHREAHHRQLARLRDEINEKQKTIDELKDLNQKLQLELEKLQADYEKLKSEEHEKSTKLQELTFLYERHEQSKQDLKGLEETVARELQTLHNLRKLFVQDVTTRVKKSAEMEPEDSGGIHSQKQKISFLENNLEQLTKVHKQLVRDNADLRCELPKLEKRLRATAERVKALEGALKEAKEGAMKDKRRYQQEVDRIKEAVRYKSSGKRGHSAQIAKPVRPGHYPASSPTNPYGTRSPECISYTNSLFQNYQNLYLQATPSSTSDMYFANSCTSSGATSSGGPLASYQKANMDNGNATDINDNRSDLPCGYEAEDQAKLFPLHQETAAS,mutated_sequence,1.0,1032.0,NP_004975.2.a2m,NP_004975.2.npy,ClinVar
+NP_004976.2,NP_004976.2.csv,MTEYKLVVVGAGGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQVVIDGETCLLDILDTAGQEEYSAMRDQYMRTGEGFLCVFAINNTKSFEDIHHYREQIKRVKDSEDVPMVLVGNKCDLPSRTVDTKQAQDLARSYGIPFIETSAKTRQGVDDAFYTLVREIRKHKEKMSKDGKKKKKKSKTKCVIM,mutated_sequence,1.0,188.0,NP_004976.2.a2m,NP_004976.2.npy,ClinVar
+NP_004981.2,NP_004981.2.csv,MRLFVSDGVPGCLPVLAAAGRARGRAEVLISTVGPEDCVVPFLTRPKVPVLQLDSGNYLFSTSAICRYFFLLSGWEQDDLTNQWLEWEATELQPALSAALYYLVVQGKKGEDVLGSVRRALTHIDHSLSRQNCPFLAGETESLADIVLWGALYPLLQDPAYLPEELSALHSWFQTLSTQEPCQRAAETVLKQQGVLALRPYLQKQPQPSPAEGRAVTNEPEEEELATLSEEEIAMAVTAWEKGLESLPPLRPQQNPVLPVAGERNVLITSALPYVNNVPHLGNIIGCVLSADVFARYSRLRQWNTLYLCGTDEYGTATETKALEEGLTPQEICDKYHIIHADIYRWFNISFDIFGRTTTPQQTKITQDIFQQLLKRGFVLQDTVEQLRCEHCARFLADRFVEGVCPFCGYEEARGDQCDKCGKLINAVELKKPQCKVCRSCPVVQSSQHLFLDLPKLEKRLEEWLGRTLPGSDWTPNAQFITRSWLRDGLKPRCITRDLKWGTPVPLEGFEDKVFYVWFDATIGYLSITANYTDQWERWWKNPEQVDLYQFMAKDNVPFHSLVFPCSALGAEDNYTLVSHLIATEYLNYEDGKFSKSRGVGVFGDMAQDTGIPADIWRFYLLYIRPEGQDSAFSWTDLLLKNNSELLNNLGNFINRAGMFVSKFFGGYVPEMVLTPDDQRLLAHVTLELQHYHQLLEKVRIRDALRSILTISRHGNQYIQVNEPWKRIKGSEADRQRAGTVTGLAVNIAALLSVMLQPYMPTVSATIQAQLQLPPPACSILLTNFLCTLPAGHQIGTVSPLFQKLENDQIESLRQRFGGGQAKTSPKPAVVETVTTAKPQQIQALMDEVTKQGNIVRELKAQKADKNEVAAEVAKLLDLKKQLAVAEGKPPEAPKGKKKK,mutated_sequence,1.0,900.0,NP_004981.2.a2m,NP_004981.2.npy,ClinVar
+NP_004997.4,NP_004997.4.csv,MLRIPVRKALVGLSKSPKGCVRTTATAASNLIEVFVDGQSVMVEPGTTVLQACEKVGMQIPRFCYHERLSVAGNCRMCLVEIEKAPKVVAACAMPVMKGWNILTNSEKSKKAREGVMEFLLANHPLDCPICDQGGECDLQDQSMMFGNDRSRFLEGKRAVEDKNIGPLVKTIMTRCIQCTRCIRFASEIAGVDDLGTTGRGNDMQVGTYIEKMFMSELSGNIIDICPVGALTSKPYAFTARPWETRKTESIDVMDAVGSNIVVSTRTGEVMRILPRMHEDINEEWISDKTRFAYDGLKRQRLTEPMVRNEKGLLTYTSWEDALSRVAGMLQSFQGKDVAAIAGGLVDAEALVALKDLLNRVDSDTLCTEEVFPTAGAGTDLRSNYLLNTTIAGVEEADVVLLVGTNPRFEAPLFNARIRKSWLHNDLKVALIGSPVDLTYTYDHLGDSPKILQDIASGSHPFSQVLKEAKKPMVVLGSSALQRNDGAAILAAVSSIAQKIRMTSGVTGDWKVMNILHRIASQVAALDLGYKPGVEAIRKNPPKVLFLLGADGGCITRQDLPKDCFIIYQGHHGDVGAPIADVILPGAAYTEKSATYVNTEGRAQQTKVAVTPPGLAREDWKIIRALSEIAGMTLPYDTLDQVRNRLEEVSPNLVRYDDIEGANYFQQANELSKLVNQQLLADPLVPPQLTIKDFYMTDSISRASQTMAKCVKAVTEGAQAVEEPSIC,mutated_sequence,1.0,727.0,NP_004997.4.a2m,NP_004997.4.npy,ClinVar
+NP_005043.1,NP_005043.1.csv,MQAIKCVVVGDGAVGKTCLLISYTTNAFPGEYIPTVFDNYSANVMVDGKPVNLGLWDTAGQEDYDRLRPLSYPQTDVFLICFSLVSPASFENVRAKWYPEVRHHCPHTPILLVGTKLDLRDDKDTIERLRDKKLAPITYPQGLAMAREIGSVKYLECSALTQRGLKTVFDEAIRAVLCPPPVKKPGKKCTVF,mutated_sequence,1.0,192.0,NP_005043.1.a2m,NP_005043.1.npy,ClinVar
+NP_005046.2,NP_005046.2.csv,MGQDQTKQQIEKGLQLYQSNQTEKALQVWTKVLEKSSDLMGRFRVLGCLVTAHSEMGRYKEMLKFAVVQIDTARELEDADFLLESYLNLARSNEKLCEFHKTISYCKTCLGLPGTRAGAQLGGQVSLSMGNAFLGLSVFQKALESFEKALRYAHNNDDAMLECRVCCSLGSFYAQVKDYEKALFFPCKAAELVNNYGKGWSLKYRAMSQYHMAVAYRLLGRLGSAMECCEESMKIALQHGDRPLQALCLLCFADIHRSRGDLETAFPRYDSAMSIMTEIGNRLGQVQALLGVAKCWVARKALDKALDAIERAQDLAEEVGNKLSQLKLHCLSESIYRSKGLQRELRAHVVRFHECVEETELYCGLCGESIGEKNSRLQALPCSHIFHLRCLQNNGTRSCPNCRRSSMKPGFV,mutated_sequence,1.0,412.0,NP_005046.2.a2m,NP_005046.2.npy,ClinVar
+NP_005145.3,NP_005145.3.csv,MPAVASVPKELYLSSSLKDLNKKTEVKPEKISTKSYVHSALKIFKTAEECRLDRDEERAYVLYMKYVTVYNLIKKRPDFKQQQDYFHSILGPGNIKKAVEEAERLSESLKLRYEEAEVRKKLEEKDRQEEAQRLQQKRQETGREDGGTLAKGSLENVLDSKDKTQKSNGEKNEKCETKEKGAITAKELYTMMTDKNISLIIMDARRMQDYQDSCILHSLSVPEEAISPGVTASWIEAHLPDDSKDTWKKRGNVEYVVLLDWFSSAKDLQIGTTLRSLKDALFKWESKTVLRNEPLVLEGGYENWLLCYPQYTTNAKVTPPPRRQNEEVSISLDFTYPSLEESIPSKPAAQTPPASIEVDENIELISGQNERMGPLNISTPVEPVAASKSDVSPIIQPVPSIKNVPQIDRTKKPAVKLPEEHRIKSESTNHEQQSPQSGKVIPDRSTKPVVFSPTLMLTDEEKARIHAETALLMEKNKQEKELRERQQEEQKEKLRKEEQEQKAKKKQEAEENEITEKQQKAKEEMEKKESEQAKKEDKETSAKRGKEITGVKRQSKSEHETSDAKKSVEDRGKRCPTPEIQKKSTGDVPHTSVTGDSGSGKPFKIKGQPESGILRTGTFREDTDDTERNKAQREPLTRARSEEMGRIVPGLPSGWAKFLDPITGTFRYYHSPTNTVHMYPPEMAPSSAPPSTPPTHKAKPQIPAERDREPSKLKRSYSSPDITQAIQEEEKRKPTVTPTVNRENKPTCYPKAEISRLSASQIRNLNPVFGGSGPALTGLRNLGNTCYMNSILQCLCNAPHLADYFNRNCYQDDINRSNLLGHKGEVAEEFGIIMKALWTGQYRYISPKDFKITIGKINDQFAGYSQQDSQELLLFLMDGLHEDLNKADNRKRYKEENNDHLDDFKAAEHAWQKHKQLNESIIVALFQGQFKSTVQCLTCHKKSRTFEAFMYLSLPLASTSKCTLQDCLRLFSKEEKLTDNNRFYCSHCRARRDSLKKIEIWKLPPVLLVHLKRFSYDGRWKQKLQTSVDFPLENLDLSQYVIGPKNNLKKYNLFSVSNHYGGLDGGHYTAYCKNAARQRWFKFDDHEVSDISVSSVKSSAAYILFYTSLGPRVTDVAT,mutated_sequence,1.0,1118.0,NP_005145.3.a2m,NP_005145.3.npy,ClinVar
+NP_005148.2,NP_005148.2.csv,MLEICLKLVGCKSKKGLSSSSSCYLEEALQRPVASDFEPQGLSEAARWNSKENLLAGPSENDPNLFVALYDFVASGDNTLSITKGEKLRVLGYNHNGEWCEAQTKNGQGWVPSNYITPVNSLEKHSWYHGPVSRNAAEYLLSSGINGSFLVRESESSPGQRSISLRYEGRVYHYRINTASDGKLYVSSESRFNTLAELVHHHSTVADGLITTLHYPAPKRNKPTVYGVSPNYDKWEMERTDITMKHKLGGGQYGEVYEGVWKKYSLTVAVKTLKEDTMEVEEFLKEAAVMKEIKHPNLVQLLGVCTREPPFYIITEFMTYGNLLDYLRECNRQEVNAVVLLYMATQISSAMEYLEKKNFIHRDLAARNCLVGENHLVKVADFGLSRLMTGDTYTAHAGAKFPIKWTAPESLAYNKFSIKSDVWAFGVLLWEIATYGMSPYPGIDLSQVYELLEKDYRMERPEGCPEKVYELMRACWQWNPSDRPSFAEIHQAFETMFQESSISDEVEKELGKQGVRGAVSTLLQAPELPTKTRTSRRAAEHRDTTDVPEMPHSKGQGESDPLDHEPAVSPLLPRKERGPPEGGLNEDERLLPKDKKTNLFSALIKKKKKTAPTPPKRSSSFREMDGQPERRGAGEEEGRDISNGALAFTPLDTADPAKSPKPSNGAGVPNGALRESGGSGFRSPHLWKKSSTLTSSRLATGEEEGGGSSSKRFLRSCSASCVPHGAKDTEWRSVTLPRDLQSTGRQFDSSTFGGHKSEKPALPRKRAGENRSDQVTRGTVTPPPRLVKKNEEAADEVFKDIMESSPGSSPPNLTPKPLRRQVTVAPASGLPHKEEAGKGSALGTPAAAEPVTPTSKAGSGAPGGTSKGPAEESRVRRHKHSSESPGRDKGKLSRLKPAPPPPPAASAGKAGGKPSQSPSQEAAGEAVLGAKTKATSLVDAVNSDAAKPSQPGEGLKKPVLPATPKPQSAKPSGTPISPAPVPSTLPSASSALAGDQPSSTAFIPLISTRVSLRKTRQPPERIASGAITKGVVLDSTEALCLAISRNSEQMASHSAVLEAGKNLYTFCVSYVDSIQQMRNKFAFREAINKLENNLRELQICPATAGSGPAATQDFSKLLSSVKEISDIVQR,mutated_sequence,1.0,1130.0,NP_005148.2.a2m,NP_005148.2.npy,ClinVar
+NP_005150.1,NP_005150.1.csv,MCDDEETTALVCDNGSGLVKAGFAGDDAPRAVFPSIVGRPRHQGVMVGMGQKDSYVGDEAQSKRGILTLKYPIEHGIITNWDDMEKIWHHTFYNELRVAPEEHPTLLTEAPLNPKANREKMTQIMFETFNVPAMYVAIQAVLSLYASGRTTGIVLDSGDGVTHNVPIYEGYALPHAIMRLDLAGRDLTDYLMKILTERGYSFVTTAEREIVRDIKEKLCYVALDFENEMATAASSSSLEKSYELPDGQVITIGNERFRCPETLFQPSFIGMESAGIHETTYNSIMKCDIDIRKDLYANNVLSGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQQMWISKQEYDEAGPSIVHRKCF,mutated_sequence,1.0,377.0,NP_005150.1.a2m,NP_005150.1.npy,ClinVar
+NP_005177.2,NP_005177.2.csv,MSEEIITPVYCTGVSAQVQKQRARELGLGRHENAIKYLGQDYEQLRVRCLQSGTLFRDEAFPPVPQSLGYKDLGPNSSKTYGIKWKRPTELLSNPQFIVDGATRTDICQGALGDCWLLAAIASLTLNDTLLHRVVPHGQSFQNGYAGIFHFQLWQFGEWVDVVVDDLLPIKDGKLVFVHSAEGNEFWSALLEKAYAKVNGSYEALSGGSTSEGFEDFTGGVTEWYELRKAPSDLYQIILKALERGSLLGCSIDISSVLDMEAITFKKLVKGHAYSVTGAKQVNYRGQVVSLIRMRNPWGEVEWTGAWSDSSSEWNNVDPYERDQLRVKMEDGEFWMSFRDFMREFTRLEICNLTPDALKSRTIRKWNTTLYEGTWRRGSTAGGCRNYPATFWVNPQFKIRLDETDDPDDYGDRESGCSFVLALMQKHRRRERRFGRDMETIGFAVYEVPPELVGQPAVHLKRDFFLANASRARSEQFINLREVSTRFRLPPGEYVVVPSTFEPNKEGDFVLRFFSEKSAGTVELDDQIQANLPDEQVLSEEEIDENFKALFRQLAGEDMEISVKELRTILNRIISKHKDLRTKGFSLESCRSMVNLMDRDGNGKLGLVEFNILWNRIRNYLSIFRKFDLDKSGSMSAYEMRMAIESAGFKLNKKLYELIITRYSEPDLAVDFDNFVCCLVRLETMFRFFKTLDTDLDGVVTFDLFKWLQLTMFA,mutated_sequence,1.0,714.0,NP_005177.2.a2m,NP_005177.2.npy,ClinVar
+NP_005179.2,NP_005179.2.csv,MAGNVKKSSGAGGGSGSGGSGSGGLIGLMKDAFQPHHHHHHHLSPHPPGTVDKKMVEKCWKLMDKVVRLCQNPKLALKNSPPYILDLLPDTYQHLRTILSRYEGKMETLGENEYFRVFMENLMKKTKQTISLFKEGKERMYEENSQPRRNLTKLSLIFSHMLAELKGIFPSGLFQGDTFRITKADAAEFWRKAFGEKTIVPWKSFRQALHEVHPISSGLEAMALKSTIDLTCNDYISVFEFDIFTRLFQPWSSLLRNWNSLAVTHPGYMAFLTYDEVKARLQKFIHKPGSYIFRLSCTRLGQWAIGYVTADGNILQTIPHNKPLFQALIDGFREGFYLFPDGRNQNPDLTGLCEPTPQDHIKVTQEQYELYCEMGSTFQLCKICAENDKDVKIEPCGHLMCTSCLTSWQESEGQGCPFCRCEIKGTEPIVVDPFDPRGSGSLLRQGAEGAPSPNYDDDDDERADDTLFMMKELAGAKVERPPSPFSMAPQASLPPVPPRLDLLPQRVCVPSSASALGTASKAASGSLHKDKPLPVPPTLRDLPPPPPPDRPYSVGAESRPQRRPLPCTPGDCPSRDKLPPVPSSRLGDSWLPRPIPKVPVSAPSSSDPWTGRELTNRHSLPFSLPSQMEPRPDVPRLGSTFSLDTSMSMNSSPLVGPECDHPKIKPSSSANAIYSLAARPLPVPKLPPGEQCEGEEDTEYMTPSSRPLRPLDTSQSSRACDCDQQIDSCTYEAMYNIQSQAPSITESSTFGEGNLAAAHANTGPEESENEDDGYDVPKPPVPAVLARRTLSDISNASSSFGWLSLDGDPTTNVTEGSQVPERPPKPFPRRINSERKAGSCQQGSGPAASAATASPQLSSEIENLMSQGYSYQDIQKALVIAQNNIEMAKNILREFVSISSPAHVAT,mutated_sequence,1.0,906.0,NP_005179.2.a2m,NP_005179.2.npy,ClinVar
+NP_005190.4,NP_005190.4.csv,MHGGQGPLLLLLLLAVCLGAQGRNQEERLLADLMQNYDPNLRPAERDSDVVNVSLKLTLTNLISLNEREEALTTNVWIEMQWCDYRLRWDPRDYEGLWVLRVPSTMVWRPDIVLENNVDGVFEVALYCNVLVSPDGCIYWLPPAIFRSACSISVTYFPFDWQNCSLIFQSQTYSTNEIDLQLSQEDGQTIEWIFIDPEAFTENGEWAIQHRPAKMLLDPAAPAQEAGHQKVVFYLLIQRKPLFYVINIIAPCVLISSVAILIHFLPAKAGGQKCTVAINVLLAQTVFLFLVAKKVPETSQAVPLISKYLTFLLVVTILIVVNAVVVLNVSLRSPHTHSMARGVRKVFLRLLPQLLRMHVRPLAPAAVQDTQSRLQNGSSGWSITTGEEVALCLPRSELLFQQWQRQGLVAAALEKLEKGPELGLSQFCGSLKQAAPAIQACVEACNLIACARHQQSHFDNGNEEWFLVGRVLDRVCFLAMLSLFICGTAGIFLMAHYNRVPALPFPGDPRPYLPSPD,mutated_sequence,1.0,517.0,NP_005190.4.a2m,NP_005190.4.npy,ClinVar
+NP_005199.2,NP_005199.2.csv,METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFDNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSANHKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ,mutated_sequence,1.0,215.0,NP_005199.2.a2m,NP_005199.2.npy,ClinVar
+NP_005219.2,NP_005219.2.csv,MRPSGTAGAALLALLAALCPASRALEEKKVCQGTSNKLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYDANKTGLKELPMRNLQEILHGAVRFSNNPALCNVESIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEENCQKLTKIICAQQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGSCVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILKTVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVVSLNITSLGLRSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQKTKIISNRGENSCKATGQVCHALCSPEGCWGPEPRDCVSCRNVSRGRECVDKCNLLEGEPREFVENSECIQCHPECLPQAMNITCTGRGPDNCIQCAHYIDGPHCVKTCPAGVMGENNTLVWKYADAGHVCHLCHPNCTYGCTGPGLEGCPTNGPKIPSIATGMVGALLLLLVVALGIGLFMRRRHIVRKRTLRRLLQERELVEPLTPSGEAPNQALLRILKETEFKKIKVLGSGAFGTVYKGLWIPEGEKVKIPVAIKELREATSPKANKEILDEAYVMASVDNPHVCRLLGICLTSTVQLITQLMPFGCLLDYVREHKDNIGSQYLLNWCVQIAKGMNYLEDRRLVHRDLAARNVLVKTPQHVKITDFGLAKLLGAEEKEYHAEGGKVPIKWMALESILHRIYTHQSDVWSYGVTVWELMTFGSKPYDGIPASEISSILEKGERLPQPPICTIDVYMIMVKCWMIDADSRPKFRELIIEFSKMARDPQRYLVIQGDERMHLPSPTDSNFYRALMDEEDMDDVVDADEYLIPQQGFFSSPSTSRTPLLSSLSATSNNSTVACIDRNGLQSCPIKEDSFLQRYSSDPTGALTEDSIDDTFLPVPEYINQSVPKRPAGSVQNPVYHNQPLNPAPSRDPHYQDPHSTAVGNPEYLNTVQPTCVNSTFDSPAHWAQKGSHQISLDNPDYQQDFFPKEAKPNGIFKGSTAENAEYLRVAPQSSEFIGA,mutated_sequence,1.0,1210.0,NP_005219.2.a2m,NP_005219.2.npy,ClinVar
+NP_005240.3,NP_005240.3.csv,MLDMGDRKEVKMIPKSSFSINSLVPEAVQNDNHHASHGHHNSHHPQHHHHHHHHHHHPPPPAPQPPPPPQQQQPPPPPPPAPQPPQTRGAPAADDDKGPQQLLLPPPPPPPPAAALDGAKADGLGGKGEPGGGPGELAPVGPDEKEKGAGAGGEEKKGAGEGGKDGEGGKEGEKKNGKYEKPPFSYNALIMMAIRQSPEKRLTLNGIYEFIMKNFPYYRENKQGWQNSIRHNLSLNKCFVKVPRHYDDPGKGNYWMLDPSSDDVFIGGTTGKLRRRSTTSRAKLAFKRGARLTSTGLTFMDRAGSLYWPMSPFLSLHHPRASSTLSYNGTTSAYPSHPMPYSSVLTQNSLGNNHSFSTANGLSVDRLVNGEIPYATHHLTAAALAASVPCGLSVPCSGTYSLNPCSVNLLAGQTSYFFPHVPHPSMTSQSSTSMSARAASSSTSPQAPSTLPCESLRPSLPSFTTGLSGGLSDYFTHQNQGSSSNPLIH,mutated_sequence,1.0,489.0,NP_005240.3.a2m,NP_005240.3.npy,ClinVar
+NP_005324.3,NP_005324.3.csv,MGLSPSAPAVAVQASNASASPPSGCPMHEGKMKGCPVNTEPSGPTCEKKTYSVPAHQERAYEYVECPIRGTAAENKENLDPSNLMPPPNQTPAPDQPFALSTVREESSIPRADSEKKWVYPSEQMFWNAMLKKGWKWKDEDISQKDMYNIIRIHNQNNEQAWKEILKWEALHAAECPCGPSLIRFGGKAKEYSPRARIRSWMGYELPFDRHDWIINRCGTEVRYVIDYYDGGEVNKDYQFTILDVRPALDSLSAVWDRMKVAWWRWTS,mutated_sequence,1.0,268.0,NP_005324.3.a2m,NP_005324.3.npy,ClinVar
+NP_005334.1,NP_005334.1.csv,MTEYKLVVVGAGGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQVVIDGETCLLDILDTAGQEEYSAMRDQYMRTGEGFLCVFAINNTKSFEDIHQYREQIKRVKDSDDVPMVLVGNKCDLAARTVESRQAQDLARSYGIPYIETSAKTRQGVEDAFYTLVREIRQHKLRKLNPPDESGPGCMSCKCVLS,mutated_sequence,1.0,189.0,NP_005334.1.a2m,NP_005334.1.npy,ClinVar
+NP_005339.3,NP_005339.3.csv,MPEETQTQDQPMEEEEVETFAFQAEIAQLMSLIINTFYSNKEIFLRELISNSSDALDKIRYESLTDPSKLDSGKELHINLIPNKQDRTLTIVDTGIGMTKADLINNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFYSAYLVAEKVTVITKHNDDEQYAWESSAGGSFTVRTDTGEPMGRGTKVILHLKEDQTEYLEERRIKEIVKKHSQFIGYPITLFVEKERDKEVSDDEAEEKEDKEEEKEKEEKESEDKPEIEDVGSDEEEEKKDGDKKKKKKIKEKYIDQEELNKTKPIWTRNPDDITNEEYGEFYKSLTNDWEDHLAVKHFSVEGQLEFRALLFVPRRAPFDLFENRKKKNNIKLYVRRVFIMDNCEELIPEYLNFIRGVVDSEDLPLNISREMLQQSKILKVIRKNLVKKCLELFTELAEDKENYKKFYEQFSKNIKLGIHEDSQNRKKLSELLRYYTSASGDEMVSLKDYCTRMKENQKHIYYITGETKDQVANSAFVERLRKHGLEVIYMIEPIDEYCVQQLKEFEGKTLVSVTKEGLELPEDEEEKKKQEEKKTKFENLCKIMKDILEKKVEKVVVSNRLVTSPCCIVTSTYGWTANMERIMKAQALRDNSTMGYMAAKKHLEINPDHSIIETLRQKAEADKNDKSVKDLVILLYETALLSSGFSLEDPQTHANRIYRMIKLGLGIDEDDPTADDTSAAVTEEMPPLEGDDDTSRMEEVD,mutated_sequence,1.0,732.0,NP_005339.3.a2m,NP_005339.3.npy,ClinVar
+NP_005350.1,NP_005350.1.csv,MDNMSITNTPTSNDACLSIVHSLMCHRQGGESETFAKRAIESLVKKLKEKKDELDSLITAITTNGAHPSKCVTIQRTLDGRLQVAGRKGFPHVIYARLWRWPDLHKNELKHVKYCQYAFDLKCDSVCVNPYHYERVVSPGIDLSGLTLQSNAPSSMMVKDEYVHDFEGQPSLSTEGHSIQTIQHPPSNRASTETYSTPALLAPSESNATSTANFPNIPVASTSQPASILGGSHSEGLLQIASGPQPGQQQNGFTGQPATYHHNSTTTWTGSRTAPYTPNLPHHQNGHLQHHPPMPPHPGHYWPVHNELAFQPPISNHPAPEYWCSIAYFEMDVQVGETFKVPSSCPIVTVDGYVDPSGGDRFCLGQLSNVHRTEAIERARLHIGKGVQLECKGEGDVWVRCLSDHAVFVQSYYLDREAGRAPGDAVHKIYPSAYIKVFDLRQCHRQMQQQAATAQAAAAAQAAAVAGNIPGPGSVGGIAPAISLSAAAGIGVDDLRRLCILRMSFVKGWGPDYPRQSIKETPCWIEIHLHRALQLLDEVLHTMPIADPQPLD,mutated_sequence,1.0,552.0,NP_005350.1.a2m,NP_005350.1.npy,ClinVar
+NP_005369.2,NP_005369.2.csv,MPSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENELWGSPAEEDAFGLGGLGGLTPNPVILQDCMWSGFSAREKLERAVSEKLQHGRGPPTAGSTAQSPGAGAASPAGRGHGGAAGAGRAGAALPAELAHPAAECVDPAVVFPFPVNKREPAPVPAAPASAPAAGPAVASGAGIAAPAGAPGVAPPRPGGRQTSGGDHKALSTSGEDTLSDSDDEDDEEEDEEEEIDVVTVEKRRSSSNTKAVTTFTITVRPKNAALGPGRAQSSELILKRCLPIHQQHNYAAPSPYVESEDAPPQKKIKSEASPRPLKSVIPPKAKSLSPRNSDSEDSERRRNHNILERQRRNDLRSSFLTLRDHVPELVKNEKAAKVVILKKATEYVHSLQAEEHQLLLEKEKLQARQQQLLKKIEHARTC,mutated_sequence,1.0,464.0,NP_005369.2.a2m,NP_005369.2.npy,ClinVar
+NP_005436.1,NP_005436.1.csv,MYIKQVIIQGFRSYRDQTIVDPFSSKHNVIVGRNGSGKSNFFYAIQFVLSDEFSHLRPEQRLALLHEGTGPRVISAFVEIIFDNSDNRLPIDKEEVSLRRVIGAKKDQYFLDKKMVTKNDVMNLLESAGFSRSNPYYIVKQGKINQMATAPDSQRLKLLREVAGTRVYDERKEESISLMKETEGKREKINELLKYIEERLHTLEEEKEELAQYQKWDKMRRALEYTIYNQELNETRAKLDELSAKRETSGEKSRQLRDAQQDARDKMEDIERQVRELKTKISAMKEEKEQLSAERQEQIKQRTKLELKAKDLQDELAGNSEQRKRLLKERQKLLEKIEEKQKELAETEPKFNSVKEKEERGIARLAQATQERTDLYAKQGRGSQFTSKEERDKWIKKELKSLDQAINDKKRQIAAIHKDLEDTEANKEKNLEQYNKLDQDLNEVKARVEELDRKYYEVKNKKDELQSERNYLWREENAEQQALAAKREDLEKKQQLLRAATGKAILNGIDSINKVLDHFRRKGINQHVQNGYHGIVMNNFECEPAFYTCVEVTAGNRLFYHIVDSDEVSTKILMEFNKMNLPGEVTFLPLNKLDVRDTAYPETNDAIPMISKLRYNPRFDKAFKHVFGKTLICRSMEVSTQLARAFTMDCITLEGDQVSHRGALTGGYYDTRKSRLELQKDVRKAEEELGELEAKLNENLRRNIERINNEIDQLMNQMQQIETQQRKFKASRDSILSEMKMLKEKRQQSEKTFMPKQRSLQSLEASLHAMESTRESLKAELGTDLLSQLSLEDQKRVDALNDEIRQLQQENRQLLNERIKLEGIITRVETYLNENLRKRLDQVEQELNELRETEGGTVLTATTSELEAINKRVKDTMARSEDLDNSIDKTEAGIKELQKSMERWKNMEKEHMDAINHDTKELEKMTNRQGMLLKKKEECMKKIRELGSLPQEAFEKYQTLSLKQLFRKLEQCNTELKKYSHVNKKALDQFVNFSEQKEKLIKRQEELDRGYKSIMELMNVLELRKYEAIQLTFKQVSKNFSEVFQKLVPGGKATLVMKKGDVEGSQSQDEGEGSGESERGSGSQSSVPSVDQFTGVGIRVSFTGKQGEMREMQQLSGGQKSLVALALIFAIQKCDPAPFYLFDEIDQALDAQHRKAVSDMIMELAVHAQFITTTFRPELLESADKFYGVKFRNKVSHIDVITAEMAKDFVEDDTTHG,mutated_sequence,1.0,1217.0,NP_005436.1.a2m,NP_005436.1.npy,ClinVar
+NP_005467.1,NP_005467.1.csv,MEKNGNNRKLRVCVATCNRADYSKLAPIMFGIKTEPEFFELDVVVLGSHLIDDYGNTYRMIEQDDFDINTRLHTIVRGEDEAAMVESVGLALVKLPDVLNRLKPDIMIVHGDRFDALALATSAALMNIRILHIEGGEVSGTIDDSIRHAITKLAHYHVCCTRSAEQHLISMCEDHDRILLAGCPSYDKLLSAKNKDYMSIIRMWLGDDVKSKDYIVALQHPVTTDIKHSIKMFELTLDALISFNKRTLVLFPNIDAGSKEMVRVMRKKGIEHHPNFRAVKHVPFDQFIQLVAHAGCMIGNSSCGVREVGAFGTPVINLGTRQIGRETGENVLHVRDADTQDKILQALHLQFGKQYPCSKIYGDGNAVPRILKFLKSIDLQEPLQKKFCFPPVKENISQDIDHILETLSALAVDLGGTNLRVAIVSMKGEIVKKYTQFNPKTYEERINLILQMCVEAAAEAVKLNCRILGVGISTGGRVNPREGIVLHSTKLIQEWNSVDLRTPLSDTLHLPVWVDNDGNCAALAERKFGQGKGLENFVTLITGTGIGGGIIHQHELIHGSSFCAAELGHLVVSLDGPDCSCGSHGCIEAYASGMALQREAKKLHDEDLLLVEGMSVPKDEAVGALHLIQAAKLGNAKAQSILRTAGTALGLGVVNILHTMNPSLVILSGVLASHYIHIVKDVIRQQALSSVQDVDVVVSDLVDPALLGAASMVLDYTTRRIY,mutated_sequence,1.0,722.0,NP_005467.1.a2m,NP_005467.1.npy,ClinVar
+NP_005493.2,NP_005493.2.csv,MACWPQLRLLLWKNLTFRRRQTCQLLLEVAWPLFIFLILISVRLSYPPYEQHECHFPNKAMPSAGTLPWVQGIICNANNPCFRYPTPGEAPGVVGNFNKSIVARLFSDARRLLLYSQKDTSMKDMRKVLRTLQQIKKSSSNLKLQDFLVDNETFSGFLYHNLSLPKSTVDKMLRADVILHKVFLQGYQLHLTSLCNGSKSEEMIQLGDQEVSELCGLPREKLAAAERVLRSNMDILKPILRTLNSTSPFPSKELAEATKTLLHSLGTLAQELFSMRSWSDMRQEVMFLTNVNSSSSSTQIYQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALFGGNGTEEDAETFYDNSTTPYCNDLMKNLESSPLSRIIWKALKPLLVGKILYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELSPKIWTFMENSQEMDLVRMLLDSRDNDHFWEQQLDGLDWTAQDIVAFLAKHPEDVQSSNGSVYTWREAFNETNQAIRTISRFMECVNLNKLEPIATEVWLINKSMELLDERKFWAGIVFTGITPGSIELPHHVKYKIRMDIDNVERTNKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDVVEQAIIRVLTGTEKKTGVYMQQMPYPCYVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKGIVYEKEARLKETMRIMGLDNSILWFSWFISSLIPLLVSAGLLVVILKLGNLLPYSDPSVVFVFLSVFAVVTILQCFLISTLFSRANLAAACGGIIYFTLYLPYVLCVAWQDYVGFTLKIFASLLSPVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFLYGVMTWYIEAVFPGQYGIPRPWYFPCTKSYWFGEESDEKSHPGSNQKRISEICMEEEPTHLKLGVSIQNLVKVYRDGMKVAVDGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYILGKDIRSEMSTIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVGLPSSKLKSKTSQLSGGMQRKLSVALAFVGGSKVVILDEPTAGVDPYSRRGIWELLLKYRQGRTIILSTHHMDEADVLGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLSSCRNSSSTVSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDIGHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAETSDGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETDLLSGMDGKGSYQVKGWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQPWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPDTPCQAGEEEWTTAPVPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQNTADILQDLTGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNTQALPPSQEVNDAIKQMKKHLKLAKDSSADRFLNSLGRFMTGLDTKNNVKVWFNNKGWHAISSFLNVINNAILRANLQKGENPSHYGITAFNHPLNLTKQQLSEVALMTTSVDVLVSICVIFAMSFVPASFVVFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYVVPATLVIIIFICFQQKSYVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYVVLTSVNLFIGINGSVATFVLELFTDNKLNNINDILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSWDLVGRNLFAMAVEGVVFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRILDGGGQNDILEIKELTKIYRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLTGDTTVTRGDAFLNKNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGEWAIRKLGLVKYGEKYAGNYSGGNKRKLSTAMALIGGPPVVFLDEPTTGMDPKARRFLWNCALSVVKEGRSVVLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIVVRIAGSNPDLKPVQDFFGLAFPGSVLKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTVVDVAVLTSFLQDEKVKESYV,mutated_sequence,1.0,2261.0,NP_005493.2.a2m,NP_005493.2.npy,ClinVar
+NP_005525.2,NP_005525.2.csv,MRPTLLWSLLLLLGVFAAAAAAPPDPLSQLPAPQHPKIRLYNAEQVLSWEPVALSNSTRPVVYQVQFKYTDSKWFTADIMSIGVNCTQITATECDFTAASPSAGFPMDFNVTLRLRAELGALHSAWVTMPWFQHYRNVTVGPPENIEVTPGEGSLIIRFSSPFDIADTSTAFFCYYVHYWEKGGIQQVKGPFRSNSISLDNLKPSRVYCLQVQAQLLWNKSNIFRVGHLSNISCYETMADASTELQQVILISVGTFSLLSVLAGACFFLVLKYRGLIKYWFHTPPSIPLQIEEYLKDPTQPILEALDKDSSPKDDVWDSVSIISFPEKEQEDVLQTL,mutated_sequence,1.0,337.0,NP_005525.2.a2m,NP_005525.2.npy,ClinVar
+NP_005526.1,NP_005526.1.csv,MEPLVTWVVPLLFLFLLSRQGAACRTSECCFQDPPYPDADSGSASGPRDLRCYRISSDRYECSWQYEGPTAGVSHFLRCCLSSGRCCYFAAGSATRLQFSDQAGVSVLYTVTLWVESWARNQTEKSPEVTLQLYNSVKYEPPLGDIKVSKLAGQLRMEWETPDNQVGAEVQFRHRTPSSPWKLGDCGPQDDDTESCLCPLEMNVAQEFQLRRRQLGSQGSSWSKWSSPVCVPPENPPQPQVRFSVEQLGQDGRRRLTLKEQPTQLELPEGCQGLAPGTEVTYRLQLHMLSCPCKAKATRTLHLGKMPYLSGAAYNVAVISSNQFGPGLNQTWHIPADTHTEPVALNISVGTNGTTMYWPARAQSMTYCIEWQPVGQDGGLATCSLTAPQDPDPAGMATYSWSRESGAMGQEKCYYITIFASAHPEKLTLWSTVLSTYHFGGNASAAGTPHHVSVKNHSLDSVSVDWAPSLLSTCPGVLKEYVVRCRDEDSKQVSEHPVQPTETQVTLSGLRAGVAYTVQVRADTAWLRGVWSQPQRFSIEVQVSDWLIFFASLGSFLSILLVGVLGYLGLNRAARHLCPPLPTPCASSAIEFPGGKETWQWINPVDFQEEASLQEALVVEMSWDKGERTEPLEKTELPEGAPELALDTELSLEDGDRCKAKM,mutated_sequence,1.0,662.0,NP_005526.1.a2m,NP_005526.1.npy,ClinVar
+NP_005535.1,NP_005535.1.csv,MASPPESDGFSDVRKVGYLRKPKSMHKRFFVLRAASEAGGPARLEYYENEKKWRHKSSAPKRSIPLESCFNINKRADSKNKHLVALYTRDEHFAIAADSEAEQDSWYQALLQLHNRAKGHHDGAAALGAGGGGGSCSGSSGLGEAGEDLSYGDVPPGPAFKEVWQVILKPKGLGQTKNLIGIYRLCLTSKTISFVKLNSEAAAVVLQLMNIRRCGHSENFFFIEVGRSAVTGPGEFWMQVDDSVVAQNMHETILEAMRAMSDEFRPRSKSQSSSNCSNPISVPLRRHHLNNPPPSQVGLTRRSRTESITATSPASMVGGKPGSFRVRASSDGEGTMSRPASVDGSPVSPSTNRTHAHRHRGSARLHPPLNHSRSIPMPASRCSPSATSPVSLSSSSTSGHGSTSDCLFPRRSSASVSGSPSDGGFISSDEYGSSPCDFRSSFRSVTPDSLGHTPPARGEEELSNYICMGGKGPSTLTAPNGHYILSRGGNGHRCTPGTGLGTSPALAGDEAASAADLDNRFRKRTHSAGTSPTITHQKTPSQSSVASIEEYTEMMPAYPPGGGSGGRLPGHRHSAFVPTRSYPEEGLEMHPLERRGGHHRPDSSTLHTDDGYMPMSPGVAPVPSGRKGSGDYMPMSPKSVSAPQQIINPIRRHPQRVDPNGYMMMSPSGGCSPDIGGGPSSSSSSSNAVPSGTSYGKLWTNGVGGHHSHVLPHPKPPVESSGGKLLPCTGDYMNMSPVGDSNTSSPSDCYYGPEDPQHKPVLSYYSLPRSFKHTQRPGEPEEGARHQHLRLSTSSGRLLYAATADDSSSSTSSDSLGGGYCGARLEPSLPHPHHQVLQPHLPRKVDTAAQTNSRLARPTRLSLGDPKASTLPRAREQQQQQQPLLHPPEPKSPGEYVNIEFGSDQSGYLSGPVAFHSSPSVRCPSQLQPAPREEETGTEEYMKMDLGPGRRAAWQESTGVEMGRLGPAPPGAASICRPTRAVPSSRGDYMTMQMSCPRQSYVDTSPAAPVSYADMRTGIAAEEVSLPRATMAAASSSSAASASPTGPQGAAELAAHSSLLGGPQGPGGMSAFTRVNLSPNRNQSAKVIRADPQGCRRRHSSETFSSTPSATRVGNTVPFGAGAAVGGGGGSSSSSEDVKRHSSASFENVWLRPGELGGAPKEPAKLCGAAGGLENGLNYIDLDLVKDFKQCPQECTPEPQPPPPPPPHQPLGSGESSSTRRSSEDLSAYASISFQKQPEDRQ,mutated_sequence,1.0,1242.0,NP_005535.1.a2m,NP_005535.1.npy,ClinVar
+NP_005545.1,NP_005545.1.csv,MASTSTTIRSHSSSRRGFSANSARLPGVSRSGFSSVSVSRSRGSGGLGGACGGAGFGSRSLYGLGGSKRISIGGGSCAISGGYGSRAGGSYGFGGAGSGFGFGGGAGIGFGLGGGAGLAGGFGGPGFPVCPPGGIQEVTVNQSLLTPLNLQIDPTIQRVRAEEREQIKTLNNKFASFIDKVRFLEQQNKVLETKWTLLQEQGTKTVRQNLEPLFEQYINNLRRQLDSIVGERGRLDSELRGMQDLVEDFKNKYEDEINKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRALYDAELSQMQTHISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEIAQRSRAEAESWYQTKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCANLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRKLLEGEECRLNGEGVGQVNISVVQSTVSSGYGGASGVGSGLGLGGGSSYSYGSGLGVGGGFSSSSGRAIGGGLSSVGGGSSTIKYTTTSSSSRKSYKH,mutated_sequence,1.0,564.0,NP_005545.1.a2m,NP_005545.1.npy,ClinVar
+NP_005546.2,NP_005546.2.csv,MASTSTTIRSHSSSRRGFSANSARLPGVSRSGFSSISVSRSRGSGGLGGACGGAGFGSRSLYGLGGSKRISIGGGSCAISGGYGSRAGGSYGFGGAGSGFGFGGGAGIGFGLGGGAGLAGGFGGPGFPVCPPGGIQEVTVNQSLLTPLNLQIDPAIQRVRAEEREQIKTLNNKFASFIDKVRFLEQQNKVLDTKWTLLQEQGTKTVRQNLEPLFEQYINNLRRQLDNIVGERGRLDSELRNMQDLVEDLKNKYEDEINKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRALYDAELSQMQTHISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEIAQRSRAEAESWYQTKYEELQITAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCANLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRKLLEGEECRLNGEGVGQVNISVVQSTVSSGYGGASGVGSGLGLGGGSSYSYGSGLGVGGGFSSSSGRATGGGLSSVGGGSSTIKYTTTSSSSRKSYKH,mutated_sequence,1.0,564.0,NP_005546.2.a2m,NP_005546.2.npy,ClinVar
+NP_005548.2,NP_005548.2.csv,MTTCSRQFTSSSSMKGSCGIGGGIGGGSSRISSVLAGGSCRAPSTYGGGLSVSSRFSSGGACGLGGGYGGGFSSSSSFGSGFGGGYGGGLGAGFGGGLGAGFGGGFAGGDGLLVGSEKVTMQNLNDRLASYLDKVRALEEANADLEVKIRDWYQRQRPSEIKDYSPYFKTIEDLRNKIIAATIENAQPILQIDNARLAADDFRTKYEHELALRQTVEADVNGLRRVLDELTLARTDLEMQIEGLKEELAYLRKNHEEEMLALRGQTGGDVNVEMDAAPGVDLSRILNEMRDQYEQMAEKNRRDAETWFLSKTEELNKEVASNSELVQSSRSEVTELRRVLQGLEIELQSQLSMKASLENSLEETKGRYCMQLSQIQGLIGSVEEQLAQLRCEMEQQSQEYQILLDVKTRLEQEIATYRRLLEGEDAHLSSQQASGQSYSSREVFTSSSSSSSRQTRPILKEQSSSSFSQGQSS,mutated_sequence,1.0,473.0,NP_005548.2.a2m,NP_005548.2.npy,ClinVar
+NP_005576.3,NP_005576.3.csv,MFRSKRSGLVRRLWRSRVVPDREEGGSGGGGGGDEDGSLGSRAEPAPRAREGGGCGRSEVRPVAPRRPRDAVGQRGAQGAGRRRRAGGPPRPMSEPGAGAGSSLLDVAEPGGPGWLPESDCETVTCCLFSERDAAGAPRDASDPLAGAALEPAGGGRSREARSRLLLLEQELKTVTYSLLKRLKERSLDTLLEAVESRGGVPGGCVLVPRADLRLGGQPAPPQLLLGRLFRWPDLQHAVELKPLCGCHSFAAAADGPTVCCNPYHFSRLCGPESPPPPYSRLSPRDEYKPLDLSDSTLSYTETEATNSLITAPGEFSDASMSPDATKPSHWCSVAYWEHRTRVGRLYAVYDQAVSIFYDLPQGSGFCLGQLNLEQRSESVRRTRSKIGFGILLSKEPDGVWAYNRGEHPIFVNSPTLDAPGGRALVVRKVPPGYSIKVFDFERSGLQHAPEPDAADGPYDPNSVRISFAKGWGPCYSRQFITSCPCWLEILLNNPR,mutated_sequence,1.0,496.0,NP_005576.3.a2m,NP_005576.3.npy,ClinVar
+NP_005600.1,NP_005600.1.csv,MSRPLSDQEKRKQISVRGLAGVENVTELKKNFNRHLHFTLVKDRNVATPRDYYFALAHTVRDHLVGRWIRTQQHYYEKDPKRIYYLSLEFYMGRTLQNTMVNLALENACDEATYQLGLDMEELEEIEEDAGLGNGGLGRLAACFLDSMATLGLAAYGYGIRYEFGIFNQKISGGWQMEEADDWLRYGNPWEKARPEFTLPVHFYGHVEHTSQGAKWVDTQVVLAMPYDTPVPGYRNNVVNTMRLWSAKAPNDFNLKDFNVGGYIQAVLDRNLAENISRVLYPNDNFFEGKELRLKQEYFVVAATLQDIIRRFKSSKFGCRDPVRTNFDAFPDKVAIQLNDTHPSLAIPELMRILVDLERMDWDKAWDVTVRTCAYTNHTVLPEALERWPVHLLETLLPRHLQIIYEINQRFLNRVAAAFPGDVDRLRRMSLVEEGAVKRINMAHLCIAGSHAVNGVARIHSEILKKTIFKDFYELEPHKFQNKTNGITPRRWLVLCNPGLAEVIAERIGEDFISDLDQLRKLLSFVDDEAFIRDVAKVKQENKLKFAAYLEREYKVHINPNSLFDIQVKRIHEYKRQLLNCLHVITLYNRIKREPNKFFVPRTVMIGGKAAPGYHMAKMIIRLVTAIGDVVNHDPAVGDRLRVIFLENYRVSLAEKVIPAADLSEQISTAGTEASGTGNMKFMLNGALTIGTMDGANVEMAEEAGEENFFIFGMRVEDVDKLDQRGYNAQEYYDRIPELRQVIEQLSSGFFSPKQPDLFKDIVNMLMHHDRFKVFADYEDYIKCQEKVSALYKNPREWTRMVIRNIATSGKFSSDRTIAQYAREIWGVEPSRQRLPAPDEAI,mutated_sequence,1.0,842.0,NP_005600.1.a2m,NP_005600.1.npy,ClinVar
+NP_005620.1,NP_005620.1.csv,MAKKSAENGIYSVSGDEKKGPLIAPGPDGAPAKGDGPVGLGTPGGRLAVPPRETWTRQMDFIMSCVGFAVGLGNVWRFPYLCYKNGGGVFLIPYVLIALVGGIPIFFLEISLGQFMKAGSINVWNICPLFKGLGYASMVIVFYCNTYYIMVLAWGFYYLVKSFTTTLPWATCGHTWNTPDCVEIFRHEDCANASLANLTCDQLADRRSPVIEFWENKVLRLSGGLEVPGALNWEVTLCLLACWVLVYFCVWKGVKSTGKIVYFTATFPYVVLVVLLVRGVLLPGALDGIIYYLKPDWSKLGSPQVWIDAGTQIFFSYAIGLGALTALGSYNRFNNNCYKDAIILALINSGTSFFAGFVVFSILGFMAAEQGVHISKVAESGPGLAFIAYPRAVTLMPVAPLWAALFFFMLLLLGLDSQFVGVEGFITGLLDLLPASYYFRFQREISVALCCALCFVIDLSMVTDGGMYVFQLFDYYSASGTTLLWQAFWECVVVAWVYGADRFMDDIACMIGYRPCPWMKWCWSFFTPLVCMGIFIFNVVYYEPLVYNNTYVYPWWGEAMGWAFALSSMLCVPLHLLGCLLRAKGTMAERWQHLTQPIWGLHHLEYRAQDADVRGLTTLTPVSESSKVVVVESVM,mutated_sequence,1.0,635.0,NP_005620.1.a2m,NP_005620.1.npy,ClinVar
+NP_005624.2,NP_005624.2.csv,MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESNDDALQYVEELILQLLNMLCQAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGYKIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDVEDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSANDVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYARDILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQLEEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIKKMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRLPGASNAEYRLKEKFFMRKVQINDKDDTNEYKHAFEIILKDENSVIFSAKSAEEKNNWMAALISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPIIKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADRIAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDAYLLQRMEEFIGTVRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLHPIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKILEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSKRRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRNPKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAPNSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSNDTVFIQVTLPHGPRSASVSSISLTKGTDEVPVPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSISDRTSISDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHGNAFFPNSPSPFTPPPPQTPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMHRDGPPLLENAHSS,mutated_sequence,1.0,1333.0,NP_005624.2.a2m,NP_005624.2.npy,ClinVar
+NP_005630.1,NP_005630.1.csv,MVSESHHEALAAPPVTTVATVLPSNATEPASPGEGKEDAFSKLKEKFMNELHKIPLPPWALIAIAIVAVLLVLTCCFCICKKCLFKKKNKKKGKEKGGKNAINMKDVKDLGKTMKDQALKDDDAETGLTDGEEKEEPKEEEKLGKLQYSLDYDFQNNQLLVGIIQAAELPALDMGGTSDPYVKVFLLPDKKKKFETKVHRKTLNPVFNEQFTFKVPYSELGGKTLVMAVYDFDRFSKHDIIGEFKVPMNTVDFGHVTEEWRDLQSAEKEEQEKLGDICFSLRYVPTAGKLTVVILEAKNLKKMDVGGLSDPYVKIHLMQNGKRLKKKKTTIKKNTLNPYYNESFSFEVPFEQIQKVQVVVTVLDYDKIGKNDAIGKVFVGYNSTGAELRHWSDMLANPRRPIAQWHTLQVEEEVDAMLAVKK,mutated_sequence,1.0,422.0,NP_005630.1.a2m,NP_005630.1.npy,ClinVar
+NP_005645.1,NP_005645.1.csv,MAMVVSSWRDPQDDVAGGNPGGPNPAAQAARGGGGGAGEQQQQAGSGAPHTPQTPGQPGAPATPGTAGDKGQGPPGSGQSQQHIECVVCGDKSSGKHYGQFTCEGCKSFFKRSVRRNLTYTCRANRNCPIDQHHRNQCQYCRLKKCLKVGMRREAVQRGRMPPTQPNPGQYALTNGDPLNGHCYLSGYISLLLRAEPYPTSRYGSQCMQPNNIMGIENICELAARLLFSAVEWARNIPFFPDLQITDQVSLLRLTWSELFVLNAAQCSMPLHVAPLLAAAGLHASPMSADRVVAFMDHIRIFQEQVEKLKALHVDSAEYSCLKAIVLFTSDACGLSDAAHIESLQEKSQCALEEYVRSQYPNQPSRFGKLLLRLPSLRTVSSSVIEQLFFVRLVGKTPIETLIRDMLLSGSSFNWPYMSIQCS,mutated_sequence,1.0,423.0,NP_005645.1.a2m,NP_005645.1.npy,ClinVar
+NP_005651.1,NP_005651.1.csv,MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLMLNRSLSRLQWASLLLLFTGVAIVQAQQAGGGGPRPLDQNPGAGLAAVVASCLSSGFAGVYFEKILKGSSGSVWLRNLQLGLFGTALGLVGLWWAEGTAVATRGFFFGYTPAVWGVVLNQAFGGLLVAVVVKYADNILKGFATSLSIVLSTVASIRLFGFHVDPLFALGAGLVIGAVYLYSLPRGAAKAIASASASASGPCVHQQPPGQPPPPQLSSHRGDLITEPFLPKLLTKVKGS,mutated_sequence,1.0,396.0,NP_005651.1.a2m,NP_005651.1.npy,ClinVar
+NP_005786.1,NP_005786.1.csv,MEKKCTLYFLVLLPFFMILVTAELEESPEDSIQLGVTRNKIMTAQYECYQKIMQDPIQQAEGVYCNRTWDGWLCWNDVAAGTESMQLCPDYFQDFDPSEKVTKICDQDGNWFRHPASNRTWTNYTQCNVNTHEKVKTALNLFYLTIIGHGLSIASLLISLGIFFYFKSLSCQRITLHKNLFFSFVCNSVVTIIHLTAVANNQALVATNPVSCKVSQFIHLYLMGCNYFWMLCEGIYLHTLIVVAVFAEKQHLMWYYFLGWGFPLIPACIHAIARSLYYNDNCWISSDTHLLYIIHGPICAALLVNLFFLLNIVRVLITKLKVTHQAESNLYMKAVRATLILVPLLGIEFVLIPWRPEGKIAEEVYDYIMHILMHFQGLLVSTIFCFFNGEVQAILRRNWNQYKIQFGNSFSNSEALRSASYTVSTISDGPGYSHDCPSEHLNGKSIHDIENVLLKPENLYN,mutated_sequence,1.0,461.0,NP_005786.1.a2m,NP_005786.1.npy,ClinVar
+NP_005850.1,NP_005850.1.csv,MADRDSGSEQGGAALGSGGSLGHPGSGSGSGGGGGGGGGGGGSGGGGGGAPGGLQHETQELASKRVDIQNKRFYLDVKQNAKGRFLKIAEVGAGGNKSRLTLSMSVAVEFRDYLGDFIEHYAQLGPSQPPDLAQAQDEPRRALKSEFLVRENRKYYMDLKENQRGRFLRIRQTVNRGPGLGSTQGQTIALPAQGLIEFRDALAKLIDDYGVEEEPAELPEGTSLTVDNKRFFFDVGSNKYGVFMRVSEVKPTYRNSITVPYKVWAKFGHTFCKYSEEMKKIQEKQREKRAACEQLHQQQQQQQEETAAATLLLQGEEEGEED,mutated_sequence,1.0,322.0,NP_005850.1.a2m,NP_005850.1.npy,ClinVar
+NP_005852.2,NP_005852.2.csv,MKGKEEKEGGARLGAGGGSPEKSPSAQELKEQGNRLFVGRKYPEAAACYGRAITRNPLVAVYYTNRALCYLKMQQHEQALADCRRALELDGQSVKAHFFLGQCQLEMESYDEAIANLQRAYSLAKEQRLNFGDDIPSALRIAKKKRWNSIEERRIHQESELHSYLSRLIAAERERELEECQRNHEGDEDDSHVRAQQACIEAKHDKYMADMDELFSQVDEKRKKRDIPDYLCGKISFELMREPCITPSGITYDRKDIEEHLQRVGHFDPVTRSPLTQEQLIPNLAMKEVIDAFISENGWVEDY,mutated_sequence,1.0,303.0,NP_005852.2.a2m,NP_005852.2.npy,ClinVar
+NP_005893.1,NP_005893.1.csv,MSSILPFTPPIVKRLLGWKKGEQNGQEEKWCEKAVKSLVKKLKKTGQLDELEKAITTQNVNTKCITIPRSLDGRLQVSHRKGLPHVIYCRLWRWPDLHSHHELRAMELCEFAFNMKKDEVCVNPYHYQRVETPVLPPVLVPRHTEIPAEFPPLDDYSHSIPENTNFPAGIEPQSNIPETPPPGYLSEDGETSDHQMNHSMDAGSPNLSPNPMSPAHNNLDLQPVTYCEPAFWCSISYYELNQRVGETFHASQPSMTVDGFTDPSNSERFCLGLLSNVNRNAAVELTRRHIGRGVRLYYIGGEVFAECLSDSAIFVQSPNCNQRYGWHPATVCKIPPGCNLKIFNNQEFAALLAQSVNQGFEAVYQLTRMCTIRMSFVKGWGAEYRRQTVTSTPCWIELHLNGPLQWLDKVLTQMGSPSIRCSSVS,mutated_sequence,1.0,425.0,NP_005893.1.a2m,NP_005893.1.npy,ClinVar
+NP_005948.3,NP_005948.3.csv,MVNEARGNSSLNPCLEGSASSGSESSKDSSRCSTPGLDPERHERLREKMRRRLESGDKWFSLEFFPPRTAEGAVNLISRFDRMAAGGPLYIDVTWHPAGDPGSDKETSSMMIASTAVNYCGLETILHMTCCRQRLEEITGHLHKAKQLGLKNIMALRGDPIGDQWEEEEGGFNYAVDLVKHIRSEFGDYFDICVAGYPKGHPEAGSFEADLKHLKEKVSAGADFIITQLFFEADTFFRFVKACTDMGITCPIVPGIFPIQGYHSLRQLVKLSKLEVPQEIKDVIEPIKDNDAAIRNYGIELAVSLCQELLASGLVPGLHFYTLNREMATTEVLKRLGMWTEDPRRPLPWALSAHPKRREEDVRPIFWASRPKSYIYRTQEWDEFPNGRWGNSSSPAFGELKDYYLFYLKSKSPKEELLKMWGEELTSEESVFEVFVLYLSGEPNRNGHKVTCLPWNDEPLAAETSLLKEELLRVNRQGILTINSQPNINGKPSSDPIVGWGPSGGYVFQKAYLEFFTSRETAEALLQVLKKYELRVNYHLVNVKGENITNAPELQPNAVTWGIFPGREIIQPTVVDPVSFMFWKDEAFALWIERWGKLYEEESPSRTIIQYIHDNYFLVNLVDNDFPLDNCLWQVVEDTLELLNRPTQNARETEAP,mutated_sequence,1.0,656.0,NP_005948.3.a2m,NP_005948.3.npy,ClinVar
+NP_005951.1,NP_005951.1.csv,MQLLGLLGLLWMLKASPWATGTLSTATSISQVPFPRAEAASAVLSNSPHSRDLAGWPLGVPQLASPAPGHRENAPMTLTTSPHDTLISETLLNSPVSSNTSTTPTSKFAFKVETTPPTVLVYSATTECVYPTSFIITISHPTSICVTTTQVAFTSSYTSTPVTQKPVTTVTSTYSMTTTEKGTSAMTSSPSTTTARETPIVTVTPSSVSATDTTFHTTISSTTRTTERTPLPTGSIHTTTSPTPVFTTLKTAVTSTSPITSSITSTNTVTSMTTTASQPTATNTLSSPTRTILSSTPVLSTETITSGITNTTPLSTLVTTLPTTISRSTPTSETTYTTSPTSTVTDSTTKIAYSTSMTGTLSTETSLPPTSSSLPTTETATTPMTNLVTTTTEISSHSTPSFSSSTIYSTVSTSTTAISSLPPTSGTMVTSTTMTPSSLSTDIPFTTPTTITHHSVGSTGFLTTATDLTSTFTVSSSSAMSTSVIPSSPSIQNTETSSLVSMTSATTPNVRPTFVSTLSTPTSSLLTTFPATYSFSSSMSASSAGTTHTESISSPPASTSTLHTTAESTLAPTTTTSFTTSTTMEPPSTTAATTGTGQTTFTSSTATFPETTTPTPTTDMSTESLTTAMTSPPITSSVTSTNTVTSMTTTTSPPTTTNSFTSLTSMPLSSTPVPSTEVVTSGTINTIPPSILVTTLPTPNASSMTTSETTYPNSPTGPGTNSTTEITYPTTMTETSSTATSLPPTSPLVSTAKTAKTPTTNLVTTTTKTTSHSTTSFTSSTVYSTASTYTTAITSVPTTLGTMVTSTSMISSTVSTGIPTSQPTTITPSSVGISGSLPMMTDLTSVYTVSNMSARPTTVIPSSPTVQNTEISISVSMTSATTPSGGPTFTSTENTPTRSLLTSFPMTHSFSSSMSESSAGTTHTESISSPRGTTSTLHTTVESTPSPTTTTSFTTSTMMEPPSSTVSTTGRGQTTFPSSTATFPETTTLTPTTDISTVSLTTAMTSPPPVSSSITPTNTMTSMRTTTYWPTATNTLSPLTSSILSSTPVPSTEMITSHTTNTTPLSTLVTTLLTTITRSTPTSETTYPTSPTSIVSDSTTEITYSTSITGTLSTATTLPPTSSSLPTTETATMTPTTTLITTTPNTTSLSTPSFTSSTIYSTVSTSTTAISSASPTSGTMVTSTTMTPSSLSTDTPSTTPTTITYPSVGSTGFLTTATDLTSTFTVSSSSAMSTSVIPSSPSIQNTETSSLVSMTSATTPSLRPTITSTDSTLTSSLLTTFPSTYSFSSSMSASSAGTTHTETISSLPASTNTIHTTAESALAPTTTTSFTTSPTMEPPSTTVATTGTGQTTFPSSTATFLETTTLTPTTDFSTESLTTAMTSTPPITSSITPTDTMTSMRTTTSWPTATNTLSPLTSSILSSTPVPSTEVTTSHTTNTNPVSTLVTTLPITITRSTLTSETAYPSSPTSTVTESTTEITYPTTMTETSSTATSLPPTSSLVSTAETAKTPTTNLVTTTTKTTSHSTTSFTSSTIYSTASTPTTAITSVPTTLGTMVTSTSMIPSTVSTGIPTSQPTTITPSSVGISGSLPMMTDLTSVYTVSSMSARPTSVIPSSPTVQNTETSIFVSMMSATTPSGGPTFTSTENTPTRSLLTSFPVTHSFSSSMSASSVGTTHTQSISSPPAITSTLHTTAESTPSPTTTMSFTTFTKMETPSSTVATTGTGQTTFTSSTATSPKTTTLTPTSDISTGSFKTAVSSTPPITSSITSTYTVTSMTTTTPLGPTATNTLPSFTSSVSSSTPVPSTEAITSGTTNTTPLSTLVTTFSNSDTSSTPTSETTYPTSLTSALTDSTTRTTYSTNMTGTLSTVTSLRPTSSSLLTTVTATVPTTNLVTTTTKITSHSTPSFTSSIATTETPSHSTPRFTSSITTTETPSHSTPRFTSSITNTKTTSHSSPSFTSSITTTETTSHNTPSLTSSITTTKTTSHSTPSYTSLITTTTTTSHSTPSFTSSITTTETTSHNTPSLTSSITTTETTSHSTPSFTSSITTETTSHSTPSFTSLITITEITSHSTLSYTTSITTTETPSHSTLSFTSSITTTETTSHSTPSFTSSITTSEMPSHSTPSFTSSITTTENATHSTPNFTSSITTTETTSHSTPSFTSLITTTETTSHRWGTTETTSYSTPSFTSSNTITETTSHSTPSYITSITTTETPSSSTPSFSSSITTTETTSHSTPGFTSSITTTETTSHSTPSFTSSITTTETTSHDTPSFTSSITTSETPSHSTPSSTSLITTTKTTSHSTPSFTSSITTTETTSHSAHSFTSSITTTETTSHNTRSFTSSITTTETNSHSTTSFTSSITTTETTSHSTPSFSSSITTTETPLHSTPGLTSWVTTTKTTSHITPGLTSSITTTETTSHSTPGFTSSITTTETTSESTPSLSSSTIYSTVSTSTTAITSHFTTSETAVTPTPVTPSSLSTDIPTTSLRTLTPSSVGTSTSLTTTTDFPSIPTDISTLPTRTHIISSSPSIQSTETSSLVGTTSPTMSTVRMTLRITENTPISSFSTSIVVIPETPTQTPPVLTSATGTQTSPAPTTVTFGSTDSSTSTLHTLTPSTALSTIVSTSQVPIPSTHSSTLQTTPSTPSLQTSLTSTSEFTTESFTRGSTSTNAILTSFSTIIWSSTPTIIMSSSPSSASITPVFSTTIHSVPSSPYIFSTENVGSASITGFPSLSSSATTSTSSTSSSLTTALTEITPFSYISLPSTTPCPGTITITIVPASPTDPCVEMDPSTEATSPPTTPLTVFPFTTEMVTCPTSISIQTTLTTYMDTSSMMPESESSISPNASSSTGTGTVPTNTVFTSTRLPTSETWLSNSSVIPLPLPGVSTIPLTMKPSSSLPTILRTSSKSTHPSPPTTRTSETPVATTQTPTTLTSRRTTRITSQMTTQSTLTTTAGTCDNGGTWEQGQCACLPGFSGDRCQLQTRCQNGGQWDGLKCQCPSTFYGSSCEFAVEQVDLDVVETEVGMEVSVDQQFSPDLNDNTSQAYRDFNKTFWNQMQKIFADMQGFTFKGVEILSLRNGSIVVDYLVLLEMPFSPQLESEYEQVKTTLKEGLQNASQDVNSCQDSQTLCFKPDSIKVNNNSKTELTPAAICRRAAPTGYEEFYFPLVEATRLRCVTKCTSGVDNAIDCHQGQCVLETSGPTCRCYSTDTHWFSGPRCEVAVHWRALVGGLTAGAALLVLLLLALGVRAVRSGWWGGQRRGRSWDQDRKWFETWDEEVVGTFSNWGFEDDGTDKDTNFYVALENVDTTMKVHIKRPEMTSSSV,mutated_sequence,1.0,3323.0,NP_005951.1.a2m,NP_005951.1.npy,ClinVar
+NP_005952.2,NP_005952.2.csv,MVQRWLLLSCCGALLSAGLANTSYTSPGLQRLKDSPQTAPDKGQCSTWGAGHFSTFDHHVYDFSGTCNYIFAATCKDAFPTFSVQLRRGPDGSISRIIVELGASVVTVSEAIISVKDIGVISLPYTSNGLQITPFGQSVRLVAKQLELELEVVWGPDSHLMVLVERKYMGQMCGLCGNFDGKVTNEFVSEEGKFLEPHKFAALQKLDDPGEICTFQDIPSTHVRQAQHARICTQLLTLVAPECSVSKEPFVLSCQADVAAAPQPGPQNSSCATLSEYSRQCSMVGQPVRRWRSPGLCSVGQCPANQVYQECGSACVKTCSNPQHSCSSSCTFGCFCPEGTVLNDLSNNHTCVPVTQCPCVLHGAMYAPGEVTIAACQTCRCTLGRWVCTERPCPGHCSLEGGSFVTTFDARPYRFHGTCTYILLQSPQLPEDGALMAVYDKSGVSHSETSLVAVVYLSRQDKIVISQDEVVTNNGEAKWLPYKTRNITVFRQTSTHLQMATSFGLELVVQLRPIFQAYVTVGPQFRGQTRGLCGNFNGDTTDDFTTSMGIAEGTASLFVDSWRAGNCPAALERETDPCSMSQLNKVCAETHCSMLLRTGTVFERCHATVNPAPFYKRCVYQACNYEETFPHICAALGDYVHACSLRGVLLWGWRSSVDNCTIPCTGNTTFSYNSQACERTCLSLSDRATECHHSAVPVDGCNCPDGTYLNQKGECVRKAQCPCILEGYKFILAEQSTVINGITCHCINGRLSCPQRPQMFLASCQAPKTFKSCSQSSENKFGAACAPTCQMLATGVACVPTKCEPGCVCAEGLYENADGQCVPPEECPCEFSGVSYPGGAELHTDCRTCSCSRGRWACQQGTHCPSTCTLYGEGHVITFDGQRFVFDGNCEYILATDVCGVNDSQPTFKILTENVICGNSGVTCSRAIKIFLGGLSVVLADRNYTVTGEEPHVQLGVTPGALSLVVDISIPGRYNLTLIWNRHMTILIRIARASQDPLCGLCGNFNGNMKDDFETRSRYVASSELELVNSWKESPLCGDVSFVTDPCSLNAFRRSWAERKCSVINSQTFATCHSKVYHLPYYEACVRDACGCDSGGDCECLCDAVAAYAQACLDKGVCVDWRTPAFCPIYCGFYNTHTQDGHGEYQYTQEANCTWHYQPCLCPSQPQSVPGSNIEGCYNCSQDEYFDHEEGVCVPCMPPTTPQPPTTPQLPTTGSRPTQVWPMTGTSTTIGLLSSTGPSPSSNHTPASPTQTPLLPATLTSSKPTASSGEPPRPTTAVTPQATSGLPPTATLRSTATKPTVTQATTRATASTASPATTSTAQSTTRTTMTLPTPATSGTSPTLPKSTNQELPGTTATQTTGPRPTPASTTGPTTPQPGQPTRPTATETTQTRTTTEYTTPQTPHTTHSPPTAGSPVPSTGPVTATSFHATTTYPTPSHPETTLPTHVPPFSTSLVTPSTHTVITPTHAQMATSASNHSAPTGTIPPPTTLKATGSTHTAPPITPTTSGTSQAHSSFSTNKTPTSLHSHTSSTHHPEVTPTSTTTITPNPTSTRTRTPVAHTNSATSSRPPPPFTTHSPPTGSSPFSSTGPMTATSFKTTTTYPTPSHPQTTLPTHVPPFSTSLVTPSTHTVITPTHAQMATSASIHSMPTGTIPPPTTLKATGSTHTAPTMTLTTSGTSQALSSLNTAKTSTSLHSHTSSTHHAEATSTSTTNITPNPTSTGTPPMTVTTSGTSQSRSSFSTAKTSTSLHSHTSSTHHPEVTSTSTTSITPNHTSTGTRTPVAHTTSATSSRLPTPFTTHSPPTGTTPISSTGPVTATSFQTTTTYPTPSHPHTTLPTHVPSFSTSLVTPSTHTVIIPTHTQMATSASIHSMPTGTIPPPTTIKATGSTHTAPPMTPTTSGTSQSPSSFSTAKTSTSLPYHTSSTHHPEVTPTSTTNITPKHTSTGTRTPVAHTTSASSSRLPTPFTTHSPPTGSSPFSSTGPMTATSFQTTTTYPTPSHPQTTLPTHVPPFSTSLVTPSTHTVIITTHTQMATSASIHSTPTGTVPPPTTLKATGSTHTAPPMTVTTSGTSQTHSSFSTATASSSFISSSSWLPQNSSSRPPSSPITTQLPHLSSATTPVSTTNQLSSSFSPSPSAPSTVSSYVPSSHSSPQTSSPSVGTSSSFVSAPVHSTTLSSGSHSSLSTHPTTASVSASPLFPSSPAASTTIRATLPHTISSPFTLSALLPISTVTVSPTPSSHLASSTIAFPSTPRTTASTHTAPAFSSQSTTSRSTSLTTRVPTSGFVSLTSGVTGIPTSPVTNLTTRHPGPTLSPTTRFLTSSLTAHGSTPASAPVSSLGTPTPTSPGVCSVREQQEEITFKGCMANVTVTRCEGACISAASFNIITQQVDARCSCCRPLHSYEQQLELPCPDPSTPGRRLVLTLQVFSHCVCSSVACGD,mutated_sequence,1.0,2439.0,NP_005952.2.a2m,NP_005952.2.npy,ClinVar
+NP_005973.1,NP_005973.1.csv,MSMLPSFGFTQEQVACVCEVLQQGGNLERLGRFLWSLPACDHLHKNESVLKAKAVVAFHRGNFRELYKILESHQFSPHNHPKLQQLWLKAHYVEAEKLRGRPLGAVGKYRVRRKFPLPRTIWDGEETSYCFKEKSRGVLREWYAHNPYPSPREKRELAEATGLTTTQVSNWFKNRRQRDRAAEAKERENTENNNSSSNKQNQLSPLEGGKPLMSSSEEEFSPPQSPDQNSVLLLQGNMGHARSSNYSLPGLTASQPSHGLQTHQHQLQDSLLGPLTSSLVDLGS,mutated_sequence,1.0,284.0,NP_005973.1.a2m,NP_005973.1.npy,ClinVar
+NP_005975.1,NP_005975.1.csv,MPAPRAPRALAAAAPASGKAKLTHPGKAILAGGLAGGIEICITFPTEYVKTQLQLDERSHPPRYRGIGDCVRQTVRSHGVLGLYRGLSSLLYGSIPKAAVRFGMFEFLSNHMRDAQGRLDSTRGLLCGLGAGVAEAVVVVCPMETIKVKFIHDQTSPNPKYRGFFHGVREIVREQGLKGTYQGLTATVLKQGSNQAIRFFVMTSLRNWYRGDNPNKPMNPLITGVFGAIAGAASVFGNTPLDVIKTRMQGLEAHKYRNTWDCGLQILKKEGLKAFYKGTVPRLGRVCLDVAIVFVIYDEVVKLLNKVWKTD,mutated_sequence,1.0,311.0,NP_005975.1.a2m,NP_005975.1.npy,ClinVar
+NP_005996.2,NP_005996.2.csv,MDSNTAPLGPSCPQPPPAPQPQARSRLNATASLEQERSERPRAPGPQAGPGPGVRDAAAPAEPQAQHTRSRERADGTGPTKGDMEIPFEEVLERAKAGDPKAQTEVGKHYLQLAGDTDEELNSCTAVDWLVLAAKQGRREAVKLLRRCLADRRGITSENEREVRQLSSETDLERAVRKAALVMYWKLNPKKKKQVAVAELLENVGQVNEHDGGAQPGPVPKSLQKQRRMLERLVSSESKNYIALDDFVEITKKYAKGVIPSSLFLQDDEDDDELAGKSPEDLPLRLKVVKYPLHAIMEIKEYLIDMASRAGMHWLSTIIPTHHINALIFFFIVSNLTIDFFAFFIPLVIFYLSFISMVICTLKVFQDSKAWENFRTLTDLLLRFEPNLDVEQAEVNFGWNHLEPYAHFLLSVFFVIFSFPIASKDCIPCSELAVITGFFTVTSYLSLSTHAEPYTRRALATEVTAGLLSLLPSMPLNWPYLKVLGQTFITVPVGHLVVLNVSVPCLLYVYLLYLFFRMAQLRNFKGTYCYLVPYLVCFMWCELSVVILLESTGLGLLRASIGYFLFLFALPILVAGLALVGVLQFARWFTSLELTKIAVTVAVCSVPLLLRWWTKASFSVVGMVKSLTRSSMVKLILVWLTAIVLFCWFYVYRSEGMKVYNSTLTWQQYGALCGPRAWKETNMARTQILCSHLEGHRVTWTGRFKYVRVTDIDNSAESAINMLPFFIGDWMRCLYGEAYPACSPGNTSTAEEELCRLKLLAKHPCHIKKFDRYKFEITVGMPFSSGADGSRSREEDDVTKDIVLRASSEFKSVLLSLRQGSLIEFSTILEGRLGSKWPVFELKAISCLNCMAQLSPTRRHVKIEHDWRSTVHGAVKFAFDFFFFPFLSAA,mutated_sequence,1.0,890.0,NP_005996.2.a2m,NP_005996.2.npy,ClinVar
+NP_006000.2,NP_006000.2.csv,MRECISIHVGQAGVQIGNACWELYCLEHGIQPDGQMPSDKTIGGGDDSFNTFFSETGAGKHVPRAVFVDLEPTVIDEVRTGTYRQLFHPEQLITGKEDAANNYARGHYTIGKEIIDLVLDRIRKLADQCTGLQGFLVFHSFGGGTGSGFTSLLMERLSVDYGKKSKLEFSIYPAPQVSTAVVEPYNSILTTHTTLEHSDCAFMVDNEAIYDICRRNLDIERPTYTNLNRLIGQIVSSITASLRFDGALNVDLTEFQTNLVPYPRIHFPLATYAPVISAEKAYHEQLSVAEITNACFEPANQMVKCDPRHGKYMACCLLYRGDVVPKDVNAAIATIKTKRTIQFVDWCPTGFKVGINYQPPTVVPGGDLAKVQRAVCMLSNTTAIAEAWARLDHKFDLMYAKRAFVHWYVGEGMEEGEFSEAREDMAALEKDYEEVGVDSVEGEGEEEGEEY,mutated_sequence,1.0,451.0,NP_006000.2.a2m,NP_006000.2.npy,ClinVar
+NP_006006.3,NP_006006.3.csv,MAAQVAPAAASSLGNPPPPPPSELKKAEQQQREEAGGEAAAAAAAERGEMKAAAGQESEGPAVGPPQPLGKELQDGAESNGGGGGGGAGSGGGPGAEPDLKNSNGNAGPRPALNNNLTEPPGGGGGGSSDGVGAPPHSAAAALPPPAYGFGQPYGRSPSAVAAAAAAVFHQQHGGQQSPGLAALQSGGGGGLEPYAGPQQNSHDHGFPNHQYNSYYPNRSAYPPPAPAYALSSPRGGTPGSGAAAAAGSKPPPSSSASASSSSSSFAQQRFGAMGGGGPSAAGGGTPQPTATPTLNQLLTSPSSARGYQGYPGGDYSGGPQDGGAGKGPADMASQCWGAAAAAAAAAAASGGAQQRSHHAPMSPGSSGGGGQPLARTPQPSSPMDQMGKMRPQPYGGTNPYSQQQGPPSGPQQGHGYPGQPYGSQTPQRYPMTMQGRAQSAMGGLSYTQQIPPYGQQGPSGYGQQGQTPYYNQQSPHPQQQQPPYSQQPPSQTPHAQPSYQQQPQSQPPQLQSSQPPYSQQPSQPPHQQSPAPYPSQQSTTQQHPQSQPPYSQPQAQSPYQQQQPQQPAPSTLSQQAAYPQPQSQQSQQTAYSQQRFPPPQELSQDSFGSQASSAPSMTSSKGGQEDMNLSLQSRPSSLPDLSGSIDDLPMGTEGALSPGVSTSGISSSQGEQSNPAQSPFSPHTSPHLPGIRGPSPSPVGSPASVAQSRSGPLSPAAVPGNQMPPRPPSGQSDSIMHPSMNQSSIAQDRGYMQRNPQMPQYSSPQPGSALSPRQPSGGQIHTGMGSYQQNSMGSYGPQGGQYGPQGGYPRQPNYNALPNANYPSAGMAGGINPMGAGGQMHGQPGIPPYGTLPPGRMSHASMGNRPYGPNMANMPPQVGSGMCPPPGGMNRKTQETAVAMHVAANSIQNRPPGYPNMNQGGMMGTGPPYGQGINSMAGMINPQGPPYSMGGTMANNSAGMAASPEMMGLGDVKLTPATKMNNKADGTPKTESKSKKSSSSTTTNEKITKLYELGGEPERKMWVDRYLAFTEEKAMGMTNLPAVGRKPLDLYRLYVSVKEIGGLTQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQCLYAFECKIERGEDPPPDIFAAADSKKSQPKIQPPSPAGSGSMQGPQTPQSTSSSMAEGGDLKPPTPASTPHSQIPPLPGMSRSNSVGIQDAFNDGSDSTFQKRNSMTPNPGYQPSMNTSDMMGRMSYEPNKDPYGSMRKAPGSDPFMSSGQGPNGGMGDPYSRAAGPGLGNVAMGPRQHYPYGGPYDRVRTEPGIGPEGNMSTGAPQPNLMPSNPDSGMYSPSRYPPQQQQQQQQRHDSYGNQFSTQGTPSGSPFPSQQTTMYQQQQQNYKRPMDGTYGPPAKRHEGEMYSVPYSTGQGQPQQQQLPPAQPQPASQQQAAQPSPQQDVYNQYGNAYPATATAATERRPAGGPQNQFPFQFGRDRVSAPPGTNAQQNMPPQMMGGPIQASAEVAQQGTMWQGRNDMTYNYANRQSTGSAPQGPAYHGVNRTDEMLHTDQRANHEGSWPSHGTRQPPYGPSAPVPPMTRPPPSNYQPPPSMQNHIPQVSSPAPLPRPMENRTSPSKSPFLHSGMKMQKAGPPVPASHIAPAPVQPPMIRRDITFPPGSVEATQPVLKQRRRLTMKDIGTPEAWRVMMSLKSGLLAESTWALDTINILLYDDNSIMTFNLSQLPGLLELLVEYFRRCLIEIFGILKEYEVGDPGQRTLLDPGRFSKVSSPAPMEGGEEEEELLGPKLEEEEEEEVVENDEEIAFSGKDKPASENSEEKLISKFDKLPVKIVQKNDPFVVDCSDKLGRVQEFDSGLLHWRIGGGDTTEHIQTHFESKTELLPSRPHAPCPPAPRKHVTTAEGTPGTTDQEGPPPDGPPEKRITATMDDMLSTRSSTLTEDGAKSSEAIKESSKFPFGISPAQSHRNIKILEDEPHSKDETPLCTLLDWQDSLAKRCVCVSNTIRSLSFVPGNDFEMSKHPGLLLILGKLILLHHKHPERKQAPLTYEKEEEQDQGVSCNKVEWWWDCLEMLRENTLVTLANISGQLDLSPYPESICLPVLDGLLHWAVCPSAEAQDPFSTLGPNAVLSPQRLVLETLSKLSIQDNNVDLILATPPFSRLEKLYSTMVRFLSDRKNPVCREMAVVLLANLAQGDSLAARAIAVQKGSIGNLLGFLEDSLAATQFQQSQASLLHMQNPPFEPTSVDMMRRAARALLALAKVDENHSEFTLYESRLLDISVSPLMNSLVSQVICDVLFLIGQS,mutated_sequence,1.0,2285.0,NP_006006.3.a2m,NP_006006.3.npy,ClinVar
+NP_006010.2,NP_006010.2.csv,MGSMFRSEEVALVQLFLPTAAAYTCVSRLGELGLVEFRDLNASVSAFQRRFVVDVRRCEELEKTFTFLQEEVRRAGLVLPPPKGRLPAPPPRDLLRIQEETERLAQELRDVRGNQQALRAQLHQLQLHAAVLRQGHEPQLAAAHTDGASERTPLLQAPGGPHQDLRVNFVAGAVEPHKAPALERLLWRACRGFLIASFRELEQPLEHPVTGEPATWMTFLISYWGEQIGQKIRKITDCFHCHVFPFLQQEEARLGALQQLQQQSQELQEVLGETERFLSQVLGRVLQLLPPGQVQVHKMKAVYLALNQCSVSTTHKCLIAEAWCSVRDLPALQEALRDSSMEEGVSAVAHRIPCRDMPPTLIRTNRFTASFQGIVDAYGVGRYQEVNPAPYTIITFPFLFAVMFGDVGHGLLMFLFALAMVLAENRPAVKAAQNEIWQTFFRGRYLLLLMGLFSIYTGFIYNECFSRATSIFPSGWSVAAMANQSGWSDAFLAQHTMLTLDPNVTGVFLGPYPFGIDPIWSLAANHLSFLNSFKMKMSVILGVVHMAFGVVLGVFNHVHFGQRHRLLLETLPELTFLLGLFGYLVFLVIYKWLCVWAARAASAPSILIHFINMFLFSHSPSNRLLYPRQEVVQATLVVLALAMVPILLLGTPLHLLHRHRRRLRRRPADRQEENKAGLLDLPDASVNGWSSDEEKAGGLDDEEEAELVPSEVLMHQAIHTIEFCLGCVSNTASYLRLWALSLAHAQLSEVLWAMVMRIGLGLGREVGVAAVVLVPIFAAFAVMTVAILLVMEGLSAFLHALRLHWVEFQNKFYSGTGYKLSPFTFAATDD,mutated_sequence,1.0,830.0,NP_006010.2.a2m,NP_006010.2.npy,ClinVar
+NP_006022.3,NP_006022.3.csv,MEVEQEQRRRKVEAGRTKLAHFRQRKTKGDSSHSEKKTAKRKGSAVDASVQEESPVTKEDSALCGGGDICKSTSCDDTPDGAGGAFAAQPEDCDGEKREDLEQLQQKQVNDHPPEQCGMFTVSDHPPEQHGMFTVGDHPPEQRGMFTVSDHPPEQHGMFTVSDHPPEQRGMFTISDHQPEQRGMFTVSDHTPEQRGIFTISDHPAEQRGMFTKECEQECELAITDLESGREDEAGLHQSQAVHGLELEALRLSLSNMHTAQLELTQANLQKEKETALTELREMLNSRRAQELALLQSRQQHELELLREQHAREKEEVVLRCGQEAAELKEKLQSEMEKNAQIVKTLKEDWESEKDLCLENLRKELSAKHQSEMEDLQNQFQKELAEQRAELEKIFQDKNQAERALRNLESHHQAAIEKLREDLQSEHGRCLEDLEFKFKESEKEKQLELENLQASYEDLKAQSQEEIRRLWSQLDSARTSRQELSELHEQLLARTSRVEDLEQLKQREKTQHESELEQLRIYFEKKLRDAEKTYQEDLTLLQQRLQGAREDALLDSVEVGLSCVGLEEKPEKGRKDHVDELEPERHKESLPRFQAELEESHRHQLEALESPLCIQHEGHVSDRCCVETSALGHEWRLEPSEGHSQELPWVHLQGVQDGDLEADTERAARVLGLETEHKVQLSLLQTELKEEIELLKIENRNLYGKLQHETRLKDDLEKVKHNLIEDHQKELNNAKQKTELMKQEFQRKETDWKVMKEELQREAEEKLTLMLLELREKAESEKQTIINKFELREAEMRQLQDQQAAQILDLERSLTEQQGRLQQLEQDLTSDDALHCSQCGREPPTAQDGELAALHVKEDCALQLMLARSRFLEERKEITEKFSAEQDAFLQEAQEQHARELQLLQERHQQQLLSVTAELEARHQAALGELTASLESKQGALLAARVAELQTKHAADLGALETRHLSSLDSLESCYLSEFQTIREEHRQALELLRADFEEQLWKKDSLHQTILTQELEKLKRKHEGELQSVRDHLRTEVSTELAGTVAHELQGVHQGEFGSEKKTALHEKEETLRLQSAQAQPFHQEEKESLSLQLQKKNHQVQQLKDQVLSLSHEIEECRSELEVLQQRRERENREGANLLSMLKADVNLSHSERGALQDALRRLLGLFGETLRAAVTLRSRIGERVGLCLDDAGAGLALSTAPALEETWSDVALPELDRTLSECAEMSSVAEISSHMRESFLMSPESVRECEQPIRRVFQSLSLAVDGLMEMALDSSRQLEEARQIHSRFEKEFSFKNEETAQVVRKHQELLECLKEESAAKAELALELHKTQGTLEGFKVETADLKEVLAGKEDSEHRLVLELESLRRQLQQAAQEQAALREECTRLWSRGEATATDAEAREAALRKEVEDLTKEQSETRKQAEKDRSALLSQMKILESELEEQLSQHRGCAKQAEAVTALEQQVASLDKHLRNQRQFMDEQAAEREHEREEFQQEIQRLEGQLRQAAKPQPWGPRDSQQAPLDGEVELLQQKLREKLDEFNELAIQKESADRQVLMQEEEIKRLEEMNINIRKKVAQLQEEVEKQKNIVKGLEQDKEVLKKQQMSSLLLASTLQSTLDAGRCPEPPSGSPPEGPEIQLEVTQRALLRRESEVLDLKEQLEKMKGDLESKNEEILHLNLKLDMQNSQTAVSLRELEEENTSLKVIYTRSSEIEELKATIENLQENQKRLQKEKAEEIEQLHEVIEKLQHELSLMGPVVHEVSDSQAGSLQSELLCSQAGGPRGQALQGELEAALEAKEALSRLLADQERRHSQALEALQQRLQGAEEAAELQLAELERNVALREAEVEDMASRIQEFEAALKAKEATIAERNLEIDALNQRKAAHSAELEAVLLALARIRRALEQQPLAAGAAPPELQWLRAQCARLSRQLQVLHQRFLRCQVELDRRQARRATAHTRVPGAHPQPRMDGGAKAQVTGDVEASHDAALEPVVPDPQGDLQPVLVTLKDAPLCKQEGVMSVLTVCQRQLQSELLLVKNEMRLSLEDGGKGKEKVLEDCQLPKVDLVAQVKQLQEKLNRLLYSMTFQNVDAADTKSLWPMASAHLLESSWSDDSCDGEEPDISPHIDTCDANTATGGVTDVIKNQAIDACDANTTPGGVTDVIKNWDSLIPDEMPDSPIQEKSECQDMSLSSPTSVLGGSRHQSHTAEAGPRKSPVGMLDLSSWSSPEVLRKDWTLEPWPSLPVTPHSGALSLCSADTSLGDRADTSLPQTQGPGLLCSPGVSAAALALQWAESPPADDHHVQRTAVEKDVEDFITTSFDSQETLSSPPPGLEGKADRSEKSDGSGFGARLSPGSGGPEAQTAGPVTPASISGRFQPLPEAMKEKEVRPKHVKALLQMVRDESHQILALSEGLAPPSGEPHPPRKEDEIQDISLHGGKTQEVPTACPDWRGDLLQVVQEAFEKEQEMQGVELQPRLSGSDLGGHSSLLERLEKIIREQGDLQEKSLEHLRLPDRSSLLSEIQALRAQLRMTHLQNQEKLQHLRTALTSAEARGSQQEHQLRRQVELLAYKVEQEKCIAGDLQKTLSEEQEKANSVQKLLAAEQTVVRDLKSDLCESRQKSEQLSRSLCEVQQEVLQLRSMLSSKENELKAALQELESEQGKGRALQSQLEEEQLRHLQRESQSAKALEELRASLETQRAQSSRLCVALKHEQTAKDNLQKELRIEHSRCEALLAQERSQLSELQKDLAAEKSRTLELSEALRHERLLTEQLSQRTQEACVHQDTQAHHALLQKLKEEKSRVVDLQAMLEKVQQQALHSQQQLEAEAQKHCEALRREKEVSATLKSTVEALHTQKRELRCSLEREREKPAWLQAELEQSHPRLKEQEGRKAARRSAEARQSPAAAEQWRKWQRDKEKLRELELQRQRDLHKIKQLQQTVRDLESKDEVPGSRLHLGSARRAAGSDADHLREQQRELEAMRQRLLSAARLLTSFTSQAVDRTVNDWTSSNEKAVMSLLHTLEELKSDLSRPTSSQKKMAAELQFQFVDVLLKDNVSLTKALSTVTQEKLELSRAVSKLEKLLKHHLQKGCSPSRSERSAWKPDETAPQSSLRRPDPGRLPPAASEEAHTSNVKMEKLYLHYLRAESFRKALIYQKKYLLLLIGGFQDSEQETLSMIAHLGVFPSKAERKITSRPFTRFRTAVRVVIAILRLRFLVKKWQEVDRKGALAQGKAPRPGPRARQPQSPPRTRESPPTRDVPSGHTRDPARGRRLAAAASPHSGGRATPSPNSRLERSLTASQDPEHSLTEYIHHLEVIQQRLGGVLPDSTSKKSCHPMIKQ,mutated_sequence,1.0,3336.0,NP_006022.3.a2m,NP_006022.3.npy,ClinVar
+NP_006054.2,NP_006054.2.csv,MDSQRELAEELRLYQSTLLQDGLKDLLDEKKFIDCTLKAGDKSLPCHRLILSACSPYFREYFLSEIDEAKKKEVVLDNVDPAILDLIIKYLYSASIDLNDGNVQDIFALASRFQIPSVFTVCVSYLQKRLAPGNCLAILRLGLLLDCPRLAISAREFVSDRFVQICKEEDFMQLSPQELISVISNDSLNVEKEEAVFEAVMKWVRTDKENRVKNLSEVFDCIRFRLMTEKYFKDHVEKDDIIKSNPDLQKKIKVLKDAFAGKLPEPSKNAAKTGAGEVNGDVGDEDLLPGYLNDIPRHGMFVKDLILLVNDTAAVAYDPTENECYLTALAEQIPRNHSSIVTQQNQIYVVGGLYVDEENKDQPLQSYFFQLDSIASEWVGLPPLPSARCLFGLGEVDDKIYVVAGKDLQTEASLDSVLCYDPVAAKWNEVKKLPIKVYGHNVISHKGMIYCLGGKTDDKKCTNRVFIFNPKKGDWKDLAPMKIPRSMFGVAVHKGKIVIAGGVTEDGLSASVEAFDLTTNKWDVMTEFPQERSSISLVSLAGSLYAIGGFAMIQLESKEFAPTEVNDIWKYEDDKKEWAGMLKEIRYASGASCLATRLNLFKLSKL,mutated_sequence,1.0,606.0,NP_006054.2.a2m,NP_006054.2.npy,ClinVar
+NP_006197.1,NP_006197.1.csv,MGTSHPAFLVLGCLLTGLSLILCQLSLPSILPNENEKVVQLNSSFSLRCFGESEVSWQYPMSEEESSDVEIRNEENNSGLFVTVLEVSSASAAHTGLYTCYYNHTQTEENELEGRHIYIYVPDPDVAFVPLGMTDYLVIVEDDDSAIIPCRTTDPETPVTLHNSEGVVPASYDSRQGFNGTFTVGPYICEATVKGKKFQTIPFNVYALKATSELDLEMEALKTVYKSGETIVVTCAVFNNEVVDLQWTYPGEVKGKGITMLEEIKVPSIKLVYTLTVPEATVKDSGDYECAARQATREVKEMKKVTISVHEKGFIEIKPTFSQLEAVNLHEVKHFVVEVRAYPPPRISWLKNNLTLIENLTEITTDVEKIQEIRYRSKLKLIRAKEEDSGHYTIVAQNEDAVKSYTFELLTQVPSSILDLVDDHHGSTGGQTVRCTAEGTPLPDIEWMICKDIKKCNNETSWTILANNVSNIITEIHSRDRSTVEGRVTFAKVEETIAVRCLAKNLLGAENRELKLVAPTLRSELTVAAAVLVLLVIVIISLIVLVVIWKQKPRYEIRWRVIESISPDGHEYIYVDPMQLPYDSRWEFPRDGLVLGRVLGSGAFGKVVEGTAYGLSRSQPVMKVAVKMLKPTARSSEKQALMSELKIMTHLGPHLNIVNLLGACTKSGPIYIITEYCFYGDLVNYLHKNRDSFLSHHPEKPKKELDIFGLNPADESTRSYVILSFENNGDYMDMKQADTTQYVPMLERKEVSKYSDIQRSLYDRPASYKKKSMLDSEVKNLLSDDNSEGLTLLDLLSFTYQVARGMEFLASKNCVHRDLAARNVLLAQGKIVKICDFGLARDIMHDSNYVSKGSTFLPVKWMAPESIFDNLYTTLSDVWSYGILLWEIFSLGGTPYPGMMVDSTFYNKIKSGYRMAKPDHATSEVYEIMVKCWNSEPEKRPSFYHLSEIVENLLPGQYKKSYEKIHLDFLKSDHPAVARMRVDSDNAYIGVTYKNEEDKLKDWEGGLDEQRLSADSGYIIPLPDIDPVPEEEDLGKRNRHSSQTSEESAIETGSSSSTFIKREDETIEDIDMMDDIGIDSSDLVEDSFL,mutated_sequence,1.0,1089.0,NP_006197.1.a2m,NP_006197.1.npy,ClinVar
+NP_006205.1,NP_006205.1.csv,MEQLRAAARLQIVLGHLGRPSAGAVVAHPTSGTISSASFHPQQFQYTLDNNVLTLEQRKFYEENGFLVIKNLVPDADIQRFRNEFEKICRKEVKPLGLTVMRDVTISKSEYAPSEKMITKVQDFQEDKELFRYCTLPEILKYVECFTGPNIMAMHTMLINKPPDSGKKTSRHPLHQDLHYFPFRPSDLIVCAWTAMEHISRNNGCLVVLPGTHKGSLKPHDYPKWEGGVNKMFHGIQDYEENKARVHLVMEKGDTVFFHPLLIHGSGQNKTQGFRKAISCHFASADCHYIDVKGTSQENIEKEVVGIAHKFFGAENSVNLKDIWMFRARLVKGERTNL,mutated_sequence,1.0,338.0,NP_006205.1.a2m,NP_006205.1.npy,ClinVar
+NP_006209.2,NP_006209.2.csv,MPPRPSSGELWGIHLMPPRILVECLLPNGMIVTLECLREATLITIKHELFKEARKYPLHQLLQDESSYIFVSVTQEAEREEFFDETRRLCDLRLFQPFLKVIEPVGNREEKILNREIGFAIGMPVCEFDMVKDPEVQDFRRNILNVCKEAVDLRDLNSPHSRAMYVYPPNVESSPELPKHIYNKLDKGQIIVVIWVIVSPNNDKQKYTLKINHDCVPEQVIAEAIRKKTRSMLLSSEQLKLCVLEYQGKYILKVCGCDEYFLEKYPLSQYKYIRSCIMLGRMPNLMLMAKESLYSQLPMDCFTMPSYSRRISTATPYMNGETSTKSLWVINSALRIKILCATYVNVNIRDIDKIYVRTGIYHGGEPLCDNVNTQRVPCSNPRWNEWLNYDIYIPDLPRAARLCLSICSVKGRKGAKEEHCPLAWGNINLFDYTDTLVSGKMALNLWPVPHGLEDLLNPIGVTGSNPNKETPCLELEFDWFSSVVKFPDMSVIEEHANWSVSREAGFSYSHAGLSNRLARDNELRENDKEQLKAISTRDPLSEITEQEKDFLWSHRHYCVTIPEILPKLLLSVKWNSRDEVAQMYCLVKDWPPIKPEQAMELLDCNYPDPMVRGFAVRCLEKYLTDDKLSQYLIQLVQVLKYEQYLDNLLVRFLLKKALTNQRIGHFFFWHLKSEMHNKTVSQRFGLLLESYCRACGMYLKHLNRQVEAMEKLINLTDILKQEKKDETQKVQMKFLVEQMRRPDFMDALQGFLSPLNPAHQLGNLRLEECRIMSSAKRPLWLNWENPDIMSELLFQNNEIIFKNGDDLRQDMLTLQIIRIMENIWQNQGLDLRMLPYGCLSIGDCVGLIEVVRNSHTIMQIQCKGGLKGALQFNSHTLHQWLKDKNKGEIYDAAIDLFTRSCAGYCVATFILGIGDRHNSNIMVKDDGQLFHIDFGHFLDHKKKKFGYKRERVPFVLTQDFLIVISKGAQECTKTREFERFQEMCYKAYLAIRQHANLFINLFSMMLGSGMPELQSFDDIAYIRKTLALDKTEQEALEYFMKQMNDAHHGGWTTKMDWIFHTIKQHALN,mutated_sequence,1.0,1068.0,NP_006209.2.a2m,NP_006209.2.npy,ClinVar
+NP_006227.1,NP_006227.1.csv,MATAASNPYLPGNSLLAAGSIVHSDAAGAGGGGGGGGGGGGGGAGGGGGGMQPGSAAVTSGAYRGDPSSVKMVQSDFMQGAMAASNGGHMLSHAHQWVTALPHAAAAAAAAAAAAVEASSPWSGSAVGMAGSPQQPPQPPPPPPQGPDVKGGAGRDDLHAGTALHHRGPPHLGPPPPPPHQGHPGGWGAAAAAAAAAAAAAAAAHLPSMAGGQQPPPQSLLYSQPGGFTVNGMLSAPPGPGGGGGGAGGGAQSLVHPGLVRGDTPELAEHHHHHHHHAHPHPPHPHHAQGPPHHGGGGGGAGPGLNSHDPHSDEDTPTSDDLEQFAKQFKQRRIKLGFTQADVGLALGTLYGNVFSQTTICRFEALQLSFKNMCKLKPLLNKWLEEADSSTGSPTSIDKIAAQGRKRKKRTSIEVSVKGALESHFLKCPKPSAQEITNLADSLQLEKEVVRVWFCNRRQKEKRMTPPGIQQQTPDDVYSQVGTVSADTPPPHHGLQTSVQ,mutated_sequence,1.0,500.0,NP_006227.1.a2m,NP_006227.1.npy,ClinVar
+NP_006236.1,NP_006236.1.csv,MPYKLKKEKEPPKVAKCTAKPSSSGKDGGGENTEEAQPQPQPQPQPQAQSQPPSSNKRPSNSTPPPTQLSKIKYSGGPQIVKKERRQSSSRFNLSKNRELQKLPALKDSPTQEREELFIQKLRQCCVLFDFVSDPLSDLKFKEVKRAGLNEMVEYITHSRDVVTEAIYPEAVTMFSVNLFRTLPPSSNPTGAEFDPEEDEPTLEAAWPHLQLVYEFFLRFLESPDFQPNIAKKYIDQKFVLALLDLFDSEDPRERDFLKTILHRIYGKFLGLRAYIRRQINHIFYRFIYETEHHNGIAELLEILGSIINGFALPLKEEHKMFLIRVLLPLHKVKSLSVYHPQLAYCVVQFLEKESSLTEPVIVGLLKFWPKTHSPKEVMFLNELEEILDVIEPSEFSKVMEPLFRQLAKCVSSPHFQVAERALYYWNNEYIMSLISDNAARVLPIMFPALYRNSKSHWNKTIHGLIYNALKLFMEMNQKLFDDCTQQYKAEKQKGRFRMKEREEMWQKIEELARLNPQYPMFRAPPPLPPVYSMETETPTAEDIQLLKRTVETEAVQMLKDIKKEKVLLRRKSELPQDVYTIKALEAHKRAEEFLTASQEAL,mutated_sequence,1.0,602.0,NP_006236.1.a2m,NP_006236.1.npy,ClinVar
+NP_006256.1,NP_006256.1.csv,MFYAHFVLSKRGPLAKIWLAAHWDKKLTKAHVFECNLESSVESIISPKVKMALRTSGHLLLGVVRIYHRKAKYLLADCNEAFIKIKMAFRPGVVDLPEENREAAYNAITLPEEFHDFDQPLPDLDDIDVAQQFSLNQSRVEEITMREEVGNISILQENDFGDFGMDDREIMREGSAFEDDDMLVSTTTSNLLLESEQSTSNLNEKINHLEYEDQYKDDNFGEGNDGGILDDKLISNNDGGIFDDPPALSEAGVMLPEQPAHDDMDEDDNVSMGGPDSPDSVDPVEPMPTMTDQTTLVPNEEEAFALEPIDITVKETKAKRKRKLIVDSVKELDSKTIRAQLSDYSDIVTTLDLAPPTKKLMMWKETGGVEKLFSLPAQPLWNNRLLKLFTRCLTPLVPEDLRKRRKGGEADNLDEFLKEFENPEVPREDQQQQHQQRDVIDEPIIEEPSRLQESVMEASRTNIDESAMPPPPPQGVKRKAGQIDPEPVMPPQQVEQMEIPPVELPPEEPPNICQLIPELELLPEKEKEKEKEKEDDEEEEDEDASGGDQDQEERRWNKRTQQMLHGLQRALAKTGAESISLLELCRNTNRKQAAAKFYSFLVLKKQQAIELTQEEPYSDIIATPGPRFHII,mutated_sequence,1.0,631.0,NP_006256.1.a2m,NP_006256.1.npy,ClinVar
+NP_006261.1,NP_006261.1.csv,MSSGAASGTGRGRPRGGGPGPGDPPPSETHKLVVVGGGGVGKSALTIQFIQSYFVSDYDPTIEDSYTKICSVDGIPARLDILDTAGQEEFGAMREQYMRAGHGFLLVFAINDRQSFNEVGKLFTQILRVKDRDDFPVVLVGNKADLESQRQVPRSEASAFGASHHVAYFEASAKLRLNVDEAFEQLVRAVRKYQEQELPPSPPSAPRKKGGGCPCVLL,mutated_sequence,1.0,218.0,NP_006261.1.a2m,NP_006261.1.npy,ClinVar
+NP_006297.2,NP_006297.2.csv,MGFLKLIEIENFKSYKGRQIIGPFQRFTAIIGPNGSGKSNLMDAISFVLGEKTSNLRVKTLRDLIHGAPVGKPAANRAFVSMVYSEEGAEDRTFARVIVGGSSEYKINNKVVQLHEYSEELEKLGILIKARNFLVFQGAVESIAMKNPKERTALFEEISRSGELAQEYDKRKKEMVKAEEDTQFNYHRKKNIAAERKEAKQEKEEADRYQRLKDEVVRAQVQLQLFKLYHNEVEIEKLNKELASKNKEIEKDKKRMDKVEDELKEKKKELGKMMREQQQIEKEIKEKDSELNQKRPQYIKAKENTSHKIKKLEAAKKSLQNAQKHYKKRKGDMDELEKEMLSVEKARQEFEERMEEESQSQGRDLTLEENQVKKYHRLKEEASKRAATLAQELEKFNRDQKADQDRLDLEERKKVETEAKIKQKLREIEENQKRIEKLEEYITTSKQSLEEQKKLEGELTEEVEMAKRRIDEINKELNQVMEQLGDARIDRQESSRQQRKAEIMESIKRLYPGSVYGRLIDLCQPTQKKYQIAVTKVLGKNMDAIIVDSEKTGRDCIQYIKEQRGEPETFLPLDYLEVKPTDEKLRELKGAKLVIDVIRYEPPHIKKALQYACGNALVCDNVEDARRIAFGGHQRHKTVALDGTLFQKSGVISGGASDLKAKARRWDEKAVDKLKEKKERLTEELKEQMKAKRKEAELRQVQSQAHGLQMRLKYSQSDLEQTKTRHLALNLQEKSKLESELANFGPRINDIKRIIQSREREMKDLKEKMNQVEDEVFEEFCREIGVRNIREFEEEKVKRQNEIAKKRLEFENQKTRLGIQLDFEKNQLKEDQDKVHMWEQTVKKDENEIEKLKKEEQRHMKIIDETMAQLQDLKNQHLAKKSEVNDKNHEMEEIRKKLGGANKEMTHLQKEVTAIETKLEQKRSDRHNLLQACKMQDIKLPLSKGTMDDISQEEGSSQGEDSVSGSQRISSIYAREALIEIDYGDLCEDLKDAQAEEEIKQEMNTLQQKLNEQQSVLQRIAAPNMKAMEKLESVRDKFQETSDEFEAARKRAKKAKQAFEQIKKERFDRFNACFESVATNIDEIYKALSRNSSAQAFLGPENPEEPYLDGINYNCVAPGKRFRPMDNLSGGEKTVAALALLFAIHSYKPAPFFVLDEIDAALDNTNIGKVANYIKEQSTCNFQAIVISLKEEFYTKAESLIGVYPEQGDCVISKVLTFDLTKYPDANPNPNEQ,mutated_sequence,1.0,1233.0,NP_006297.2.a2m,NP_006297.2.npy,ClinVar
+NP_006336.3,NP_006336.3.csv,MLPGLAAAAAHRCSWSSLCRLRLRCRAAACNPSDRQEWQNLVTFGSFSNMVPCSHPYIGTLSQVKLYSTNVQKEGQGSQTLRVEKVPSFETAEGIGTELKAPLKQEPLQVRVKAVLKKREYGSKYTQNNFITGVRAINEFCLKSSDLEQLRKIRRRSPHEDTESFTVYLRSDVEAKSLEVWGSPEALAREKKLRKEAEIEYRERLFRNQKILREYRDFLGNTKPRSRTASVFFKGPGKVVMVAICINGLNCFFKFLAWIYTGSASMFSEAIHSLSDTCNQGLLALGISKSVQTPDPSHPYGFSNMRYISSLISGVGIFMMGAGLSWYHGVMGLLHPQPIESLLWAYCILAGSLVSEGATLLVAVNELRRNARAKGMSFYKYVMESRDPSTNVILLEDTAAVLGVIIAATCMGLTSITGNPLYDSLGSLGVGTLLGMVSAFLIYTNTEALLGRSIQPEQVQRLTELLENDPSVRAIHDVKATDLGLGKVRFKAEVDFDGRVVTRSYLEKQDFDQMLQEIQEVKTPEELETFMLKHGENIIDTLGAEVDRLEKELKKRNPEVRHVDLEIL,mutated_sequence,1.0,568.0,NP_006336.3.a2m,NP_006336.3.npy,ClinVar
+NP_006339.4,NP_006339.4.csv,MEGGGGSVAVAGLGARGSGAAAATVRELLQDGCYSDFLNEDFDVKTYTSQSIHQAVIAEQLAKLAQGISQLDRELHLQVVARHEDLLAQATGIESLEGVLQMMQTRIGALQGAVDRIKAKIVEPYNKIVARTAQLARLQVACDLLRRIIRILNLSKRLQGQLQGGSREITKAAQSLNELDYLSQGIDLSGIEVIENDLLFIARARLEVENQAKRLLEQGLETQNPTQVGTALQVFYNLGTLKDTITSVVDGYCATLEENINSALDIKVLTQPSQSAVRGGPGRSTMPTPGNTAALRASFWTNMEKLMDHIYAVCGQVQHLQKVLAKKRDPVSHICFIEEIVKDGQPEIFYTFWNSVTQALSSQFHMATNSSMFLKQAFEGEYPKLLRLYNDLWKRLQQYSQHIQGNFNASGTTDLYVDLQHMEDDAQDIFIPKKPDYDPEKALKDSLQPYEAAYLSKSLSRLFDPINLVFPPGGRNPPSSDELDGIIKTIASELNVAAVDTNLTLAVSKNVAKTIQLYSVKSEQLLSTQGDASQVIGPLTEGQRRNVAVVNSLYKLHQSVTKVVSSQSSFPLAAEQTIISALKAIHALMENAVQPLLTSVGDAIEAIIITMHQEDFSGSLSSSGKPDVPCSLYMKELQGFIARVMSDYFKHFECLDFVFDNTEAIAQRAVELFIRHASLIRPLGEGGKMRLAADFAQMELAVGPFCRRVSDLGKSYRMLRSFRPLLFQASEHVASSPALGDVIPFSIIIQFLFTRAPAELKSPFQRAEWSHTRFSQWLDDHPSEKDRLLLIRGALEAYVQSVRSREGKEFAPVYPIMVQLLQKAMSALQ,mutated_sequence,1.0,829.0,NP_006339.4.a2m,NP_006339.4.npy,ClinVar
+NP_006403.2,NP_006403.2.csv,MELWPCLAAALLLLLLLVQLSRAAEFYAKVALYCALCFTVSAVASLVCLLRHGGRTVENMSIIGWFVRSFKYFYGLRFEVRDPRRLQEARPCVIVSNHQSILDMMGLMEVLPERCVQIAKRELLFLGPVGLIMYLGGVFFINRQRSSTAMTVMADLGERMVRENLKVWIYPEGTRNDNGDLLPFKKGAFYLAVQAQVPIVPVVYSSFSSFYNTKKKFFTSGTVTVQVLEAIPTSGLTAADVPALVDTCHRAMRTTFLHISKTPQENGATAGSGVQPAQ,mutated_sequence,1.0,278.0,NP_006403.2.a2m,NP_006403.2.npy,ClinVar
+NP_006406.1,NP_006406.1.csv,MATATEQWVLVEMVQALYEAPAYHLILEGILILWIIRLLFSKTYKLQERSDLTVKEKEELIEEWQPEPLVPPVPKDHPALNYNIVSGPPSHKTVVNGKECINFASFNFLGLLDNPRVKAAALASLKKYGVGTCGPRGFYGTFDVHLDLEDRLAKFMKTEEAIIYSYGFATIASAIPAYSKRGDIVFVDRAACFAIQKGLQASRSDIKLFKHNDMADLERLLKEQEIEDQKNPRKARVTRRFIVVEGLYMNTGTICPLPELVKLKYKYKARIFLEESLSFGVLGEHGRGVTEHYGINIDDIDLISANMENALASIGGFCCGRSFVIDHQRLSGQGYCFSASLPPLLAAAAIEALNIMEENPGIFAVLKEKCGQIHKALQGISGLKVVGESLSPAFHLQLEESTGSREQDVRLLQEIVDQCMNRSIALTQARYLEKEEKCLPPPSIRVVVTVEQTEEELERAASTIKEVAQAVLL,mutated_sequence,1.0,473.0,NP_006406.1.a2m,NP_006406.1.npy,ClinVar
+NP_006436.3,NP_006436.3.csv,MAGVFPYRGPGNPVPGPLAPLPDYMSEEKLQEKARKWQQLQAKRYAEKRKFGFVDAQKEDMPPEHVRKIIRDHGDMTNRKFRHDKRVYLGALKYMPHAVLKLLENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIEPVYISQWGSMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQLELDPEEDAPVLDWFYDHQPLRDSRKYVNGSTYQRWQFTLPMMSTLYRLANQLLTDLVDDNYFYLFDLKAFFTSKALNMAIPGGPKFEPLVRDINLQDEDWNEFNDINKIIIRQPIRTEYKIAFPYLYNNLPHHVHLTWYHTPNVVFIKTEDPDLPAFYFDPLINPISHRHSVKSQEPLPDDDEEFELPEFVEPFLKDTPLYTDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQKLLKYYVLNALKHRPPKAQKKRYLFRSFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREVLRLTKLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDLKHLIYYRFNTGPVGKGPGCGFWAAGWRVWLFFMRGITPLLERWLGNLLARQFEGRHSKGVAKTVTKQRVESHFDLELRAAVMHDILDMMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLTRLYLKAEQERQHNYLKDGPYITAEEAVAVYTTTVHWLESRRFSPIPFPPLSYKHDTKLLILALERLKEAYSVKSRLNQSQREELGLIEQAYDNPHEALSRIKRHLLTQRAFKEVGIEFMDLYSHLVPVYDVEPLEKITDAYLDQYLWYEADKRRLFPPWIKPADTEPPPLLVYKWCQGINNLQDVWETSEGECNVMLESRFEKMYEKIDLTLLNRLLRLIVDHNIADYMTAKNNVVINYKDMNHTNSYGIIRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPPQMPNDFLSFQDIATEAAHPIRLFCRYIDRIHIFFRFTADEARDLIQRYLTEHPDPNNENIVGYNNKKCWPRDARMRLMKHDVNLGRAVFWDIKNRLPRSVTTVQWENSFVSVYSKDNPNLLFNMCGFECRILPKCRTSYEEFTHKDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQILMASGSTTFTKIVNKWNTALIGLMTYFREAVVNTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLGMLSMGHVLIPQSDLRWSKQTDVGITHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKRQEAIAQNRRLTLEDLEDSWDRGIPRINTLFQKDRHTLAYDKGWRVRTDFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESMKWKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISLIQIFRAHLWQKIHESIVMDLCQVFDQELDALEIETVQKETIHPRKSYKMNSSCADILLFASYKWNVSRPSLLADSKDVMDSTTTQKYWIDIQLRWGDYDSHDIERYARAKFLDYTTDNMSIYPSPTGVLIAIDLAYNLHSAYGNWFPGSKPLIQQAMAKIMKANPALYVLRERIRKGLQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDKTTITEPHHIWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQSEIRDIILGMEISAPSQQRQQIAEIEKQTKEQSQLTATQTRTVNKHGDEIITSTTSNYETQTFSSKTEWRVRAISAANLHLRTNHIYVSSDDIKETGYTYILPKNVLKKFICISDLRAQIAGYLYGVSPPDNPQVKEIRCIVMVPQWGTHQTVHLPGQLPQHEYLKEMEPLGWIHTQPNESPQLSPQDVTTHAKIMADNPSWDGEKTIIITCSFTPGSCTLTAYKLTPSGYEWGRQNTDKGNNPKGYLPSHYERVQMLLSDRFLGFFMVPAQSSWNYNFMGVRHDPNMKYELQLANPKEFYHEVHRPSHFLNFALLQEGEVYSADREDLYA,mutated_sequence,1.0,2335.0,NP_006436.3.a2m,NP_006436.3.npy,ClinVar
+NP_006493.1,NP_006493.1.csv,MATGQDRVVALVDMDCFFVQVEQRQNPHLRNKPCAVVQYKSWKGGGIIAVSYEARAFGVTRSMWADDAKKLCPDLLLAQVRESRGKANLTKYREASVEVMEIMSRFAVIERASIDEAYVDLTSAVQERLQKLQGQPISADLLPSTYIEGLPQGPTTAEETVQKEGMRKQGLFQWLDSLQIDNLTSPDLQLTVGAVIVEEMRAAIERETGFQCSAGISHNKVLAKLACGLNKPNRQTLVSHGSVPQLFSQMPIRKIRSLGGKLGASVIEILGIEYMGELTQFTESQLQSHFGEKNGSWLYAMCRGIEHDPVKPRQLPKTIGCSKNFPGKTALATREQVQWWLLQLAQELEERLTKDRNDNDRVATQLVVSIRVQGDKRLSSLRRCCALTRYDAHKMSHDAFTVIKNCNTSGIQTEWSPPLTMLFLCATKFSASAPSSSTDITSFLSSDPSSLPKVPVTSSEAKTQGSGPAVTATKKATTSLESFFQKAAERQKVKEASLSSLTAPTQAPMSNSPSKPSLPFQTSQSTGTEPFFKQKSLLLKQKQLNNSSVSSPQQNPWSNCKALPNSLPTEYPGCVPVCEGVSKLEESSKATPAEMDLAHNSQSMHASSASKSVLEVTQKATPNPSLLAAEDQVPCEKCGSLVPVWDMPEHMDYHFALELQKSFLQPHSSNPQVVSAVSHQGKRNPKSPLACTNKRPRPEGMQTLESFFKPLTH,mutated_sequence,1.0,713.0,NP_006493.1.a2m,NP_006493.1.npy,ClinVar
+NP_006505.4,NP_006505.4.csv,MEFPIGSLETNNFRRFTPESLVEIEKQIAAKQGTKKAREKHREQKDQEEKPRPQLDLKACNQLPKFYGELPAELIGEPLEDLDPFYSTHRTFMVLNKGRTISRFSATRALWLFSPFNLIRRTAIKVSVHSWFSLFITVTILVNCVCMTRTDLPEKIEYVFTVIYTFEALIKILARGFCLNEFTYLRDPWNWLDFSVITLAYVGTAIDLRGISGLRTFRVLRALKTVSVIPGLKVIVGALIHSVKKLADVTILTIFCLSVFALVGLQLFKGNLKNKCVKNDMAVNETTNYSSHRKPDIYINKRGTSDPLLCGNGSDSGHCPDGYICLKTSDNPDFNYTSFDSFAWAFLSLFRLMTQDSWERLYQQTLRTSGKIYMIFFVLVIFLGSFYLVNLILAVVTMAYEEQNQATTDEIEAKEKKFQEALEMLRKEQEVLAALGIDTTSLHSHNGSPLTSKNASERRHRIKPRVSEGSTEDNKSPRSDPYNQRRMSFLGLASGKRRASHGSVFHFRSPGRDISLPEGVTDDGVFPGDHESHRGSLLLGGGAGQQGPLPRSPLPQPSNPDSRHGEDEHQPPPTSELAPGAVDVSAFDAGQKKTFLSAEYLDEPFRAQRAMSVVSIITSVLEELEESEQKCPPCLTSLSQKYLIWDCCPMWVKLKTILFGLVTDPFAELTITLCIVVNTIFMAMEHHGMSPTFEAMLQIGNIVFTIFFTAEMVFKIIAFDPYYYFQKKWNIFDCIIVTVSLLELGVAKKGSLSVLRSFRLLRVFKLAKSWPTLNTLIKIIGNSVGALGNLTIILAIIVFVFALVGKQLLGENYRNNRKNISAPHEDWPRWHMHDFFHSFLIVFRILCGEWIENMWACMEVGQKSICLILFLTVMVLGNLVVLNLFIALLLNSFSADNLTAPEDDGEVNNLQVALARIQVFGHRTKQALCSFFSRSCPFPQPKAEPELVVKLPLSSSKAENHIAANTARGSSGGLQAPRGPRDEHSDFIANPTVWVSVPIAEGESDLDDLEDDGGEDAQSFQQEVIPKGQQEQLQQVERCGDHLTPRSPGTGTSSEDLAPSLGETWKDESVPQVPAEGVDDTSSSEGSTVDCLDPEEILRKIPELADDLEEPDDCFTEGCIRHCPCCKLDTTKSPWDVGWQVRKTCYRIVEHSWFESFIIFMILLSSGSLAFEDYYLDQKPTVKALLEYTDRVFTFIFVFEMLLKWVAYGFKKYFTNAWCWLDFLIVNISLISLTAKILEYSEVAPIKALRTLRALRPLRALSRFEGMRVVVDALVGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFWRCINYTDGEFSLVPLSIVNNKSDCKIQNSTGSFFWVNVKVNFDNVAMGYLALLQVATFKGWMDIMYAAVDSREVNMQPKWEDNVYMYLYFVIFIIFGGFFTLNLFVGVIIDNFNQQKKKLGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPLNKFQGFVFDIVTRQAFDITIMVLICLNMITMMVETDDQSEEKTKILGKINQFFVAVFTGECVMKMFALRQYYFTNGWNVFDFIVVVLSIASLIFSAILKSLQSYFSPTLFRVIRLARIGRILRLIRAAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYSIFGMSSFPHVRWEAGIDDMFNFQTFANSMLCLFQITTSAGWDGLLSPILNTGPPYCDPNLPNSNGTRGDCGSPAVGIIFFTTYIIISFLIMVNMYIAVILENFNVATEESTEPLSEDDFDMFYETWEKFDPEATQFITFSALSDFADTLSGPLRIPKPNRNILIQMDLPLVPGDKIHCLDILFAFTKNVLGESGELDSLKANMEEKFMATNLSKSSYEPIATTLRWKQEDISATVIQKAYRSYVLHRSMALSNTPCVPRAEEEAASLPDEGFVAFTANENCVLPDKSETASATSFPPSYESVTRGLSDRVNMRTSSSIQNEDEATSMELIAPGP,mutated_sequence,1.0,1956.0,NP_006505.4.a2m,NP_006505.4.npy,ClinVar
+NP_006507.2,NP_006507.2.csv,MEPSSKKLTGRLMLAVGGAVLGSLQFGYNTGVINAPQKVIEEFYNQTWVHRYGESILPTTLTTLWSLSVAIFSVGGMIGSFSVGLFVNRFGRRNSMLMMNLLAFVSAVLMGFSKLGKSFEMLILGRFIIGVYCGLTTGFVPMYVGEVSPTALRGALGTLHQLGIVVGILIAQVFGLDSIMGNKDLWPLLLSIIFIPALLQCIVLPFCPESPRFLLINRNEENRAKSVLKKLRGTADVTHDLQEMKEESRQMMREKKVTILELFRSPAYRQPILIAVVLQLSQQLSGINAVFYYSTSIFEKAGVQQPVYATIGSGIVNTAFTVVSLFVVERAGRRTLHLIGLAGMAGCAILMTIALALLEQLPWMSYLSIVAIFGFVAFFEVGPGPIPWFIVAELFSQGPRPAAIAVAGFSNWTSNFIVGMCFQYVEQLCGPYVFIIFTVLLVLFFIFTYFKVPETKGRTFDEIASGFRQGGASQSDKTPEELFHPLGADSQV,mutated_sequence,1.0,492.0,NP_006507.2.a2m,NP_006507.2.npy,ClinVar
+NP_006508.2,NP_006508.2.csv,MALQSQASEEAKGPWQEADQEQQEPVGSPEPESEPEPEPEPEPVPVPPPEPQPEPQPLPDPAPLPELEFESERVHEPEPTPTVETRGTARGFQPPEGGFGWVVVFAATWCNGSIFGIHNSVGILYSMLLEEEKEKNRQVEFQAAWVGALAMGMIFFCSPIVSIFTDRLGCRITATAGAAVAFIGLHTSSFTSSLSLRYFTYGILFGCGCSFAFQPSLVILGHYFQRRLGLANGVVSAGSSIFSMSFPFLIRMLGDKIKLAQTFQVLSTFMFVLMLLSLTYRPLLPSSQDTPSKRGVRTLHQRFLAQLRKYFNMRVFRQRTYRIWAFGIAAAALGYFVPYVHLMKYVEEEFSEIKETWVLLVCIGATSGLGRLVSGHISDSIPGLKKIYLQVLSFLLLGLMSMMIPLCRDFGGLIVVCLFLGLCDGFFITIMAPIAFELVGPMQASQAIGYLLGMMALPMIAGPPIAGLLRNCFGDYHVAFYFAGVPPIIGAVILFFVPLMHQRMFKKEQRDSSKDKMLAPDPDPNGELLPGSPNPEEPI,mutated_sequence,1.0,539.0,NP_006508.2.a2m,NP_006508.2.npy,ClinVar
+NP_006556.1,NP_006556.1.csv,MEGDAVEAIVEESETFIKGKERKTYQRRREGGQEEDACHLPQNQTDGGEVVQDVNSSVQMVMMEQLDPTLLQMKTEVMEGTVAPEAEAAVDDTQIITLQVVNMEEQPINIGELQLVQVPVPVTVPVATTSVEELQGAYENEVSKEGLAESEPMICHTLPLPEGFQVVKVGANGEVETLEQGELPPQEDPSWQKDPDYQPPAKKTKKTKKSKLRYTEEGKDVDVSVYDFEEEQQEGLLSEVNAEKVVGNMKPPKPTKIKKKGVKKTFQCELCSYTCPRRSNLDRHMKSHTDERPHKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCPDCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHSGEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPHCDTVIARKSDLGVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQCDYACRQERHMIMHKRTHTGEKPYACSHCDKTFRQKQLLDMHFKRYHDPNFVPAAFVCSKCGKTFTRRNTMARHADNCAGPDGVEGENGGETKKSKRGRKRKMRSKKEDSSDSENAEPDLDDNEDEEEPAVEIEPEPEPQPVTPAPPPAKKRRGRPPGRTNQPKQNQPTAIIQVEDQNTGAIENIIVEVKKEPDAEPAEGEEEEAQPAATDAPNGDLTPEMILSMMDR,mutated_sequence,1.0,727.0,NP_006556.1.a2m,NP_006556.1.npy,ClinVar
+NP_006570.1,NP_006570.1.csv,MTTNAGPLHPYWPQHLRLDNFVPNDRPTWHILAGLFSVTGVLVVTTWLLSGRAAVVPLGTWRRLSLCWFAVCGFIHLVIEGWFVLYYEDLLGDQAFLSQLWKEYAKGDSRYILGDNFTVCMETITACLWGPLSLWVVIAFLRQHPLRFILQLVVSVGQIYGDVLYFLTEHRDGFQHGELGHPLYFWFYFVFMNALWLVLPGVLVLDAVKHLTHAQSTLDAKATKAKSKKN,mutated_sequence,1.0,230.0,NP_006570.1.a2m,NP_006570.1.npy,ClinVar
+NP_006584.1,NP_006584.1.csv,MQLEHCLSPSIMLSKKFLNVSSSYPHSGGSELVLHDHPIISTTDNLERSSPLKKITRGMTNQSDTDNFPDSKDSPGDVQRSKLSPVLDGVSELRHSFDGSAADRYLLSQSSQPQSAATAPSAMFPYPGQHGPAHPAFSIGSPSRYMAHHPVITNGAYNSLLSNSSPQGYPTAGYPYPQQYGHSYQGAPFYQFSSTQPGLVPGKAQVYLCNRPLWLKFHRHQTEMIITKQGRRMFPFLSFNISGLDPTAHYNIFVDVILADPNHWRFQGGKWVPCGKADTNVQGNRVYMHPDSPNTGAHWMRQEISFGKLKLTNNKGASNNNGQMVVLQSLHKYQPRLHVVEVNEDGTEDTSQPGRVQTFTFPETQFIAVTAYQNTDITQLKIDHNPFAKGFRDNYDTIYTGCDMDRLTPSPNDSPRSQIVPGARYAMAGSFLQDQFVSNYAKARFHPGAGAGPGPGTDRSVPHTNGLLSPQQAEDPGAPSPQRWFVTPANNRLDFAASAYDTATDFAGNAATLLSYAAAGVKALPLQAAGCTGRPLGYYADPSGWGARSPPQYCGTKSGSVLPCWPNSAAAAARMAGANPYLGEEAEGLAAERSPLPPGAAEDAKPKDLSDSSWIETPSSIKSIDSSDSGIYEQAKRRRISPADTPVSESSSPLKSEVLAQRDCEKNCAKDISGYYGFYSHS,mutated_sequence,1.0,682.0,NP_006584.1.a2m,NP_006584.1.npy,ClinVar
+NP_006746.1,NP_006746.1.csv,MSSSPVKRQRMESALDQLKQFTTVVADTGDFHAIDEYKPQDATTNPSLILAAAQMPAYQELVEEAIAYGRKLGGSQEDQIKNAIDKLFVLFGAEILKKIPGRVSTEVDARLSFDKDAMVARARRLIELYKEAGISKDRILIKLSSTWEGIQAGKELEEQHGIHCNMTLLFSFAQAVACAEAGVTLISPFVGRILDWHVANTDKKSYEPLEDPGVKSVTKIYNYYKKFSYKTIVMGASFRNTGEIKALAGCDFLTISPKLLGELLQDNAKLVPVLSAKAAQASDLEKIHLDEKSFRWLHNEDQMAVEKLSDGIRKFAADAVKLERMLTERMFNAENGK,mutated_sequence,1.0,337.0,NP_006746.1.a2m,NP_006746.1.npy,ClinVar
+NP_006763.2,NP_006763.2.csv,MSRSRASIHRGSIPAMSYAPFRDVRGPSMHRTQYVHSPYDRPGWNPRFCIISGNQLLMLDEDEIHPLLIRDRRSESSRNKLLRRTVSVPVEGRPHGEHEYHLGRSRRKSVPGGKQYSMEGAPAAPFRPSQGFLSRRLKSSIKRTKSQPKLDRTSSFRQILPRFRSADHDRARLMQSFKESHSHESLLSPSSAAEALELNLDEDSIIKPVHSSILGQEFCFEVTTSSGTKCFACRSAAERDKWIENLQRAVKPNKDNSRRVDNVLKLWIIEARELPPKKRYYCELCLDDMLYARTTSKPRSASGDTVFWGEHFEFNNLPAVRALRLHLYRDSDKKRKKDKAGYVGLVTVPVATLAGRHFTEQWYPVTLPTGSGGSGGMGSGGGGGSGGGSGGKGKGGCPAVRLKARYQTMSILPMELYKEFAEYVTNHYRMLCAVLEPALNVKGKEEVASALVHILQSTGKAKDFLSDMAMSEVDRFMEREHLIFRENTLATKAIEEYMRLIGQKYLKDAIGEFIRALYESEENCEVDPIKCTASSLAEHQANLRMCCELALCKVVNSHCVFPRELKEVFASWRLRCAERGREDIADRLISASLFLRFLCPAIMSPSLFGLMQEYPDEQTSRTLTLIAKVIQNLANFSKFTSKEDFLGFMNEFLELEWGSMQQFLYEISNLDTLTNSSSFEGYIDLGRELSTLHALLWEVLPQLSKEALLKLGPLPRLLNDISTALRNPNIQRQPSRQSERPRPQPVVLRGPSAEMQGYMMRDLNSSIDLQSFMARGLNSSMDMARLPSPTKEKPPPPPPGGGKDLFYVSRPPLARSSPAYCTSSSDITEPEQKMLSVNKSVSMLDLQGDGPGGRLNSSSVSNLAAVGDLLHSSQASLTAALGLRPAPAGRLSQGSGSSITAAGMRLSQMGVTTDGVPAQQLRIPLSFQNPLFHMAADGPGPPGGHGGGGGHGPPSSHHHHHHHHHHRGGEPPGDTFAPFHGYSKSEDLSSGVPKPPAASILHSHSYSDEFGPSGTDFTRRQLSLQDNLQHMLSPPQITIGPQRPAPSGPGGGSGGGSGGGGGGQPPPLQRGKSQQLTVSAAQKPRPSSGNLLQSPEPSYGPARPRQQSLSKEGSIGGSGGSGGGGGGGLKPSITKQHSQTPSTLNPTMPASERTVAWVSNMPHLSADIESAHIEREEYKLKEYSKSMDESRLDRVKEYEEEIHSLKERLHMSNRKLEEYERRLLSQEEQTSKILMQYQARLEQSEKRLRQQQAEKDSQIKSIIGRLMLVEEELRRDHPAMAEPLPEPKKRLLDAQERQLPPLGPTNPRVTLAPPWNGLAPPAPPPPPRLQITENGEFRNTADH,mutated_sequence,1.0,1343.0,NP_006763.2.a2m,NP_006763.2.npy,ClinVar
+NP_006867.1,NP_006867.1.csv,MQMSYAIRCAFYQLLLAALMLVAMLQLLYLSLLSGLHGQEEQDQYFEFFPPSPRSVDQVKAQLRTALASGGVLDASGDYRVYRGLLKTTMDPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMRARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNNLLRNLAREGANYALVIDVDMVPSEGLWRGLREMLDQSNQWGGTALVVPAFEIRRARRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNRISQACELHVAGFDFEVLNEGFLVHKGFKEALKFHPQKEAENQHNKILYRQFKQELKAKYPNSPRRC,mutated_sequence,1.0,415.0,NP_006867.1.a2m,NP_006867.1.npy,ClinVar
+NP_008816.3,NP_008816.3.csv,MEGCDSPVVSGKDNGCGIPQHQQWTELNSTHLPDKPSSMEQSTGESHGPLDSLRAPFNERLAESTASAGPPSEPASKEVTCNECSASFASLQTYMEHHCPSARPPPPLREESASDTGEEGDEESDVENLAGEIVYQPDGSAYIVESLSQLTQGGGACGSGSGSGPLPSLFLNSLPGAGGKQGDPSCAAPVYPQIINTFHIASSFGKWFEGPDQAFPNTSALAGLSPVLHSFRVFDVRHKSNKDYLNSDGSAKSSCVSKDVPNNVDLSKFDGFVLYGKRKPILMCFLCKLSFGYVRSFVTHAVHDHRMTLSEDERKILSNKNISAIIQGIGKDKEPLVSFLEPKNKNFQHPLVSTANLIGPGHSFYGKFSGIRMEGEEALPAGSAAGPEQPQAGLLTPSTLLNLGGLTSSVLKTPITSVPLGPLASSPTKSSEGKDSGAAEGEKQEVGDGDCFSEKVEPAEEEAEEEEEEEEAEEEEEEEEEEEEEEEDEGCKGLFPSELDEELEDRPHEEPGAAAGSSSKKDLALSNQSISNSPLMPNVLQTLSRGTASTSSNSASSFVVFDGANRRNRLSFNSEGVRANVAEGGRRLDFADESANKDNATAPEPNESTEGDDGGFVPHHQHAGSLCELGVGECPSGSGVECPKCDTVLGSSRSLGGHMTMMHSRNSCKTLKCPKCNWHYKYQQTLEAHMKEKHPEPGGSCVYCKSGQPHPRLARGESYTCGYKPFRCEVCNYSTTTKGNLSIHMQSDKHLNNMQNLQNGGGEQVFSHTAGAAAAAVAAAAAAANISSSCGAPSPTKPKTKPTWRCEVCDYETNVARNLRIHMTSEKHMHNMMLLQQNMTQIQHNRHLGLGSLPSPAEAELYQYYLAQNMNLPNLKMDSAASDAQFMMSGFQLDPAGPMAAMTPALVGGEIPLDMRLGGGQLVSEELMNLGESFIQTNDPSLKLFQCAVCNKFTTDNLDMLGLHMNVERSLSEDEWKAVMGDSYQCKLCRYNTQLKANFQLHCKTDKHVQKYQLVAHIKEGGKANEWRLKCVAIGNPVHLKCNACDYYTNSLEKLRLHTVNSRHEASLKLYKHLQQHESGVEGESCYYHCVLCNYSTKAKLNLIQHVRSMKHQRSESLRKLQRLQKGLPEEDEDLGQIFTIRRCPSTDPEEAIEDVEGPSETAADPEELAKDQEGGASSSQAEKELTDSPATSKRISFPGSSESPLSSKRPKTAEEIKPEQMYQCPYCKYSNADVNRLRVHAMTQHSVQPMLRCPLCQDMLNNKIHLQLHLTHLHSVAPDCVEKLIMTVTTPEMVMPSSMFLPAAVPDRDGNSNLEEAGKQPETSEDLGKNILPSASTEQSGDLKPSPADPGSVREDSGFICWKKGCNQVFKTSAALQTHFNEVHAKRPQLPVSDRHVYKYRCNQCSLAFKTIEKLQLHSQYHVIRAATMCCLCQRSFRTFQALKKHLETSHLELSEADIQQLYGGLLANGDLLAMGDPTLAEDHTIIVEEDKEEESDLEDKQSPTGSDSGSVQEDSGSEPKRALPFRKGPNFTMEKFLDPSRPYKCTVCKESFTQKNILLVHYNSVSHLHKLKRALQESATGQPEPTSSPDNKPFKCNTCNVAYSQSSTLEIHMRSVLHQTKARAAKLEAASGSSNGTGNSSSISLSSSTPSPVSTSGSNTFTTSNPSSAGIAPSSNLLSQVPTESVGMPPLGNPIGANIASPSEPKEANRKKLADMIASRQQQQQQQQQQQQQQQQQQQAQTLAQAQAQVQAHLQQELQQQAALIQSQLFNPTLLPHFPMTTETLLQLQQQQHLLFPFYIPSAEFQLNPEVSLPVTSGALTLTGTGPGLLEDLKAQVQVPQQSHQQILPQQQQNQLSIAQSHSALLQPSQHPEKKNKLVIKEKEKESQRERDSAEGGEGNTGPKETLPDALKAKEKKELAPGGGSEPSMLPPRIASDARGNATKALLENFGFELVIQYNENKQKVQKKNGKTDQGENLEKLECDSCGKLFSNILILKSHQEHVHQNYFPFKQLERFAKQYRDHYDKLYPLRPQTPEPPPPPPPPPPPPLPAAPPQPASTPAIPASAPPITSPTIAPAQPSVPLTQLSMPMELPIFSPLMMQTMPLQTLPAQLPPQLGPVEPLPADLAQLYQHQLNPTLLQQQNKRPRTRITDDQLRVLRQYFDINNSPSEEQIKEMADKSGLPQKVIKHWFRNTLFKERQRNKDSPYNFSNPPITSLEELKIDSRPPSPEPPKQEYWGSKRSSRTRFTDYQLRVLQDFFDANAYPKDDEFEQLSNLLNLPTRVIVVWFQNARQKARKNYENQGEGKDGERRELTNDRYIRTSNLNYQCKKCSLVFQRIFDLIKHQKKLCYKDEDEEGQDDSQNEDSMDAMEILTPTSSSCSTPMPSQAYSAPAPSANNTASSAFLQLTAEAEELATFNSKTEAGDEKPKLAEAPSAQPNQTQEKQGQPKPELQQQEQPEQKTNTPQQKLPQLVSLPSLPQPPPQAPPPQCPLPQSSPSPSQLSHLPLKPLHTSTPQQLANLPPQLIPYQCDQCKLAFPSFEHWQEHQQLHFLSAQNQFIHPQFLDRSLDMPFMLFDPSNPLLASQLLSGAIPQIPASSATSPSTPTSTMNTLKRKLEEKASASPGENDSGTGGEEPQRDKRLRTTITPEQLEILYQKYLLDSNPTRKMLDHIAHEVGLKKRVVQVWFQNTRARERKGQFRAVGPAQAHRRCPFCRALFKAKTALEAHIRSRHWHEAKRAGYNLTLSAMLLDCDGGLQMKGDIFDGTSFSHLPPSSSDGQGVPLSPVSKTMELSPRTLLSPSSIKVEGIEDFESPSMSSVNLNFDQTKLDNDDCSSVNTAITDTTTGDEGNADNDSATGIATETKSSSAPNEGLTKAAMMAMSEYEDRLSSGLVSPAPSFYSKEYDNEGTVDYSETSSLADPCSPSPGASGSAGKSGDSGDRPGQKRFRTQMTNLQLKVLKSCFNDYRTPTMLECEVLGNDIGLPKRVVQVWFQNARAKEKKSKLSMAKHFGINQTSYEGPKTECTLCGIKYSARLSVRDHIFSQQHISKVKDTIGSQLDKEKEYFDPATVRQLMAQQELDRIKKANEVLGLAAQQQGMFDNTPLQALNLPTAYPALQGIPPVLLPGLNSPSLPGFTPSNTALTSPKPNLMGLPSTTVPSPGLPTSGLPNKPSSASLSSPTPAQATMAMGPQQPPQQQQQQQQPQVQQPPPPPAAQPPPTPQLPLQQQQQRKDKDSEKVKEKEKAHKGKGEPLPVPKKEKGEAPTATAATISAPLPTMEYAVDPAQLQALQAALTSDPTALLTSQFLPYFVPGFSPYYAPQIPGALQSGYLQPMYGMEGLFPYSPALSQALMGLSPGSLLQQYQQYQQSLQEAIQQQQQRQLQQQQQQKVQQQQPKASQTPVPPGAPSPDKDPAKESPKPEEQKNTPREVSPLLPKLPEEPEAESKSADSLYDPFIVPKVQYKLVCRKCQAGFSDEEAARSHLKSLCFFGQSVVNLQEMVLHVPTGGGGGGSGGGGGGGGGGGGGGSYHCLACESALCGEEALSQHLESALHKHRTITRAARNAKEHPSLLPHSACFPDPSTASTSQSAAHSNDSPPPPSAAAPSSASPHASRKSWPQVVSRASAAKPPSFPPLSSSSTVTSSSCSTSGVQPSMPTDDYSEESDTDLSQKSDGPASPVEGPKDPSCPKDSGLTSVGTDTFRL,mutated_sequence,1.0,3703.0,NP_008816.3.a2m,NP_008816.3.npy,ClinVar
+NP_008818.3,NP_008818.3.csv,MSTTLLSAFYDVDFLCKTEKSLANLNLNNMLDKKAVGTPVAAAPSSGFAPGFLRRHSASNLHALAHPAPSPGSCSPKFPGAANGSSCGSAAAGGPTSYGTLKEPSGGGGTALLNKENKFRDRSFSENGDRSQHLLHLQQQQKGGGGSQINSTRYKTELCRPFEESGTCKYGEKCQFAHGFHELRSLTRHPKYKTELCRTFHTIGFCPYGPRCHFIHNADERRPAPSGGASGDLRAFGTRDALHLGFPREPRPKLHHSLSFSGFPSGHHQPPGGLESPLLLDSPTSRTPPPPSCSSASSCSSSASSCSSASAASTPSGAPTCCASAAAAAAAALLYGTGGAEDLLAPGAPCAACSSASCANNAFAFGPELSSLITPLAIQTHNFAAVAAAAYYRSQQQQQQQGLAPPAQPPAPPSATLPAGAAAPPSPPFSFQLPRRLSDSPVFDAPPSPPDSLSDRDSYLSGSLSSGSLSGSESPSLDPGRRLPIFSRLSISDD,mutated_sequence,1.0,494.0,NP_008818.3.a2m,NP_008818.3.npy,ClinVar
+NP_008846.2,NP_008846.2.csv,MGCFFSKRRKADKESRPENEEERPKQYSWDQREKVDPKDYMFSGLKDETVGRLPGTVAGQQFLIQDCENCNIYIFDHSATVTIDDCTNCIIFLGPVKGSVFFRNCRDCKCTLACQQFRVRDCRKLEVFLCCATQPIIESSSNIKFGCFQWYYPELAFQFKDAGLSIFNNTWSNIHDFTPVSGELNWSLLPEDAVVQDYVPIPTTEELKAVRVSTEANRSIVPISRGQRQKSSDESCLVVLFAGDYTIANARKLIDEMVGKGFFLVQTKEVSMKAEDAQRVFREKAPDFLPLLNKGPVIALEFNGDGAVEVCQLIVNEIFNGTKMFVSESKETASGDVDSFYNFADIQMGI,mutated_sequence,1.0,350.0,NP_008846.2.a2m,NP_008846.2.npy,ClinVar
+NP_008853.3,NP_008853.3.csv,MAQALLVPPGPESFRLFTRESLAAIEKRAAEEKAKKPKKEQDNDDENKPKPNSDLEAGKNLPFIYGDIPPEMVSEPLEDLDPYYINKKTFIVMNKGKAIFRFSATSALYILTPLNPVRKIAIKILVHSLFSMLIMCTILTNCVFMTLSNPPDWTKNVEYTFTGIYTFESLIKILARGFCLEDFTFLRDPWNWLDFSVIVMAYVTEFVDLGNVSALRTFRVLRALKTISVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNKCLQWPPSDSAFETNTTSYFNGTMDSNGTFVNVTMSTFNWKDYIGDDSHFYVLDGQKDPLLCGNGSDAGQCPEGYICVKAGRNPNYGYTSFDTFSWAFLSLFRLMTQDYWENLYQLTLRAAGKTYMIFFVLVIFLGSFYLVNLILAVVAMAYEEQNQATLEEAEQKEAEFQQMLEQLKKQQEEAQAVAAASAASRDFSGIGGLGELLESSSEASKLSSKSAKEWRNRRKKRRQREHLEGNNKGERDSFPKSESEDSVKRSSFLFSMDGNRLTSDKKFCSPHQSLLSIRGSLFSPRRNSKTSIFSFRGRAKDVGSENDFADDEHSTFEDSESRRDSLFVPHRHGERRNSNVSQASMSSRMVPGLPANGKMHSTVDCNGVVSLVGGPSALTSPTGQLPPEGTTTETEVRKRRLSSYQISMEMLEDSSGRQRAVSIASILTNTMEELEESRQKCPPCWYRFANVFLIWDCCDAWLKVKHLVNLIVMDPFVDLAITICIVLNTLFMAMEHYPMTEQFSSVLTVGNLVFTGIFTAEMVLKIIAMDPYYYFQEGWNIFDGIIVSLSLMELGLSNVEGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKSYKECVCKINDDCTLPRWHMNDFFHSFLIVFRVLCGEWIETMWDCMEVAGQTMCLIVFMLVMVIGNLVVLNLFLALLLSSFSSDNLAATDDDNEMNNLQIAVGRMQKGIDYVKNKMRECFQKAFFRKPKVIEIHEGNKIDSCMSNNTGIEISKELNYLRDGNGTTSGVGTGSSVEKYVIDENDYMSFINNPSLTVTVPIAVGESDFENLNTEEFSSESELEESKEKLNATSSSEGSTVDVVLPREGEQAETEPEEDLKPEACFTEGCIKKFPFCQVSTEEGKGKIWWNLRKTCYSIVEHNWFETFIVFMILLSSGALAFEDIYIEQRKTIKTMLEYADKVFTYIFILEMLLKWVAYGFQTYFTNAWCWLDFLIVDVSLVSLVANALGYSELGAIKSLRTLRALRPLRALSRFEGMRVVVNALVGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFYHCVNMTTGNMFDISDVNNLSDCQALGKQARWKNVKVNFDNVGAGYLALLQVATFKGWMDIMYAAVDSRDVKLQPVYEENLYMYLYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKFGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPANKFQGMVFDFVTRQVFDISIMILICLNMVTMMVETDDQGKYMTLVLSRINLVFIVLFTGEFVLKLVSLRHYYFTIGWNIFDFVVVILSIVGMFLAEMIEKYFVSPTLFRVIRLARIGRILRLIKGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYAIFGMSNFAYVKKEAGIDDMFNFETFGNSMICLFQITTSAGWDGLLAPILNSAPPDCDPDTIHPGSSVKGDCGNPSVGIFFFVSYIIISFLVVVNMYIAVILENFSVATEESAEPLSEDDFEMFYEVWEKFDPDATQFIEFSKLSDFAAALDPPLLIAKPNKVQLIAMDLPMVSGDRIHCLDILFAFTKRVLGESGEMDALRIQMEDRFMASNPSKVSYEPITTTLKRKQEEVSAAIIQRNFRCYLLKQRLKNISSNYNKEAIKGRIDLPIKQDMIIDKLNGNSTPEKTDGSSSTTSPPSYDSVTKPDKEKFEKDKPEKESKGKEVRENQK,mutated_sequence,1.0,2000.0,NP_008853.3.a2m,NP_008853.3.npy,ClinVar
+NP_008872.1,NP_008872.1.csv,MAEEQDLSEVELSPVGSEEPRCLSPGSAPSLGPDGGGGGSGLRASPGPGELGKVKKEQQDGEADDDKFPVCIREAVSQVLSGYDWTLVPMPVRVNGASKSKPHVKRPMNAFMVWAQAARRKLADQYPHLHNAELSKTLGKLWRLLNESDKRPFIEEAERLRMQHKKDHPDYKYQPRRRKNGKAAQGEAECPGGEAEQGGTAAIQAHYKSAHLDHRHPGEGSPMSDGNPEHPSGQSHGPPTPPTTPKTELQSGKADPKRDGRSMGEGGKPHIDFGNVDIGEISHEVMSNMETFDVAELDQYLPPNGHPGHVSSYSAAGYGLGSALAVASGHSAWISKPPGVALPTVSPPGVDAKAQVKTETAGPQGPPHYTDQPSTSQIAYTSLSLPHYGSAFPSISRPQFDYSDHQPSGPYYGHSGQASGLYSAFSYMGPSQRPLYTAISDPSPSGPQSHSPTHWEQPVYTTLSRP,mutated_sequence,1.0,466.0,NP_008872.1.a2m,NP_008872.1.npy,ClinVar
+NP_008877.2,NP_008877.2.csv,MSSTLSPTDFDSLEIQGQYSDINNRWDLPDSDWDNDSSSARLFERSRIKALADEREAVQKKTFTKWVNSHLARVTCRVGDLYSDLRDGRNLLRLLEVLSGEILPKPTKGRMRIHCLENVDKALQFLKEQKVHLENMGSHDIVDGNHRLTLGLVWTIILRFQIQDISVETEDNKEKKSAKDALLLWCQMKTAGYPNVNVHNFTTSWRDGLAFNAIVHKHRPDLLDFESLKKCNAHYNLQNAFNLAEKELGLTKLLDPEDVNVDQPDEKSIITYVATYYHYFSKMKALAVEGKRIGKVLDHAMEAERLVEKYESLASELLQWIEQTIVTLNDRQLANSLSGVQNQLQSFNSYRTVEKPPKFTEKGNLEVLLFTIQSKLRANNQKVYTPREGRLISDINKAWERLEKAEHERELALRTELIRQEKLEQLAARFDRKAAMRETWLSENQRLVSQDNFGLELAAVEAAVRKHEAIETDIVAYSGRVQAVDAVAAELAAERYHDIKRIAARQHNVARLWDFLRQMVAARRERLLLNLELQKVFQDLLYLMDWMEEMKGRLQSQDLGRHLAGVEDLLQLHELVEADIAVQAERVRAVSASALRFCNPGKEYRPCDPQLVSERVAKLEQSYEALCELAAARRARLEESRRLWRFLWEVGEAEAWVREQQHLLASADTGRDLTGALRLLNKHTALRGEMSGRLGPLKLTLEQGQQLVAEGHPGASQASARAAELQAQWERLEALAEERAQRLAQAASLYQFQADANDMEAWLVDALRLVSSPELGHDEFSTQALARQHRALEEEIRSHRPTLDALREQAAALPPTLSRTPEVQSRVPTLERHYEELQARAGERARALEAALALYTMLSEAGACGLWVEEKEQWLNGLALPERLEDLEVVQQRFETLEPEMNTLAAQITAVNDIAEQLLKANPPGKDRIVNTQEQLNHRWQQFRRLADGKKAALTSALSIQNYHLECTETQAWMREKTKVIESTQGLGNDLAGVLALQRKLAGTERDLEAIAARVGELTREANALAAGHPAQAVAINARLREVQTGWEDLRATMRRREESLGEARRLQDFLRSLDDFQAWLGRTQTAVASEEGPATLPEAEALLAQHAALRGEVERAQSEYSRLRALGEEVTRDQADPQCLFLRQRLEALGTGWEELGRMWESRQGRLAQAHGFQGFLRDARQAEGVLSSQEYVLSHTEMPGTLQAADAAIKKLEDFMSTMDANGERIHGLLEAGRQLVSEGNIHADKIREKADSIERRHKKNQDAAQQFLGRLRDNREQQHFLQDCHELKLWIDEKMLTAQDVSYDEARNLHTKWQKHQAFMAELAANKDWLDKVDKEGRELTLEKPELKALVSEKLRDLHRRWDELETTTQAKARSLFDANRAELFAQSCCALESWLESLQAQLHSDDYGKDLTSVNILLKKQQMLEWEMAVREKEVEAIQAQAKALAQEDQGAGEVERTSRAVEEKFRALCQPMRERCRRLQASREQHQFHRDVEDEILWVTERLPMASSMEHGKDLPSVQLLMKKNQTLQKEIQGHEPRIADLRERQRALGAAAAGPELAELQEMWKRLGHELELRGKRLEDALRAQQFYRDAAEAEAWMGEQELHMMGQEKAKDELSAQAEVKKHQVLEQALADYAQTIHQLAASSQDMIDHEHPESTRISIRQAQVDKLYAGLKELAGERRERLQEHLRLCQLRRELDDLEQWIQEREVVAASHELGQDYEHVTMLRDKFREFSRDTSTIGQERVDSANALANGLIAGGHAARATVAEWKDSLNEAWADLLELLDTRGQVLAAAYELQRFLHGARQALARVQHKQQQLPDGTGRDLNAAEALQRRHCAYEHDIQALSPQVQQVQDDGHRLQKAYAGDKAEEIGRHMQAVAEAWAQLQGSSAARRQLLLDTTDKFRFFKAVRELMLWMDEVNLQMDAQERPRDVSSADLVIKNQQGIKAEIEARADRFSSCIDMGKELLARSHYAAEEISEKLSQLQARRQETAEKWQEKMDWLQLVLEVLVFGRDAGMAEAWLCSQEPLVRSAELGCTVDEVESLIKRHEAFQKSAVAWEERFCALEKLTALEEREKERKRKREEEERRKQPPAPEPTASVPPGDLVGGQTASDTTWDGTQPRPPPSTQAPSVNGVCTDGEPSQPLLGQQRLEHSSFPEGPGPGSGDEANGPRGERQTRTRGPAPSAMPQSRSTESAHAATLPPRGPEPSAQEQMEGMLCRKQEMEAFGKKAANRSWQNVYCVLRRGSLGFYKDAKAASAGVPYHGEVPVSLARAQGSVAFDYRKRKHVFKLGLQDGKEYLFQAKDEAEMSSWLRVVNAAIATASSASGEPEEPVVPSTTRGMTRAMTMPPVSPVGAEGPVVLRSKDGREREREKRFSFFKKNK,mutated_sequence,1.0,2390.0,NP_008877.2.a2m,NP_008877.2.npy,ClinVar
+NP_008880.2,NP_008880.2.csv,MAPSGLKAVVGEKILSGVIRSVKKDGEWKVLIMDHPSMRILSSCCKMSDILAEGITIVEDINKRREPIPSLEAIYLLSPTEKSVQALIKDFQGTPTFTYKAAHIFFTDTCPEPLFSELGRSRLAKVVKTLKEIHLAFLPYEAQVFSLDAPHSTYNLYCPFRAEERTRQLEVLAQQIATLCATLQEYPAIRYRKGPEDTAQLAHAVLAKLNAFKADTPSLGEGPEKTRSQLLIMDRAADPVSPLLHELTFQAMAYDLLDIEQDTYRYETTGLSEAREKAVLLDEDDDLWVELRHMHIADVSKKVTELLRTFCESKRLTTDKANIKDLSQILKKMPQYQKELNKYSTHLHLADDCMKHFKGSVEKLCSVEQDLAMGSDAEGEKIKDSMKLIVPVLLDAAVPAYDKIRVLLLYILLRNGVSEENLAKLIQHANVQAHSSLIRNLEQLGGTVTNPGGSGTSSRLEPRERMEPTYQLSRWTPVIKDVMEDAVEDRLDRNLWPFVSDPAPTASSQAAVSARFGHWHKNKAGIEARAGPRLIVYVMGGVAMSEMRAAYEVTRATEGKWEVLIGSSHILTPTRFLDDLKALDKKLEDIALP,mutated_sequence,1.0,593.0,NP_008880.2.a2m,NP_008880.2.npy,ClinVar
+NP_009060.2,NP_009060.2.csv,MLLDAGPQFPAIGVGSFARHHHHSAAAAAAAAAEMQDRELSLAAAQNGFVDSAAAHMGAFKLNPGAHELSPGQSSAFTSQGPGAYPGSAAAAAAAAALGPHAAHVGSYSGPPFNSTRDFLFRSRGFGDSAPGGGQHGLFGPGAGGLHHAHSDAQGHLLFPGLPEQHGPHGSQNVLNGQMRLGLPGEVFGRSEQYRQVASPRTDPYSAAQLHNQYGPMNMNMGMNMAAAAAHHHHHHHHHPGAFFRYMRQQCIKQELICKWIDPEQLSNPKKSCNKTFSTMHELVTHVSVEHVGGPEQSNHVCFWEECPREGKPFKAKYKLVNHIRVHTGEKPFPCPFPGCGKVFARSENLKIHKRTHTGEKPFQCEFEGCDRRFANSSDRKKHMHVHTSDKPYLCKMCDKSYTHPSSLRKHMKVHESSPQGSESSPAASSGYESSTPPGLVSPSAEPQSSSNLSPAAAAAAAAAAAAAAAVSAVHRGGGSGSGGAGGGSGGGSGSGGGGGGAGGGGGGSSGGGSGTAGGHSGLSSNFNEWYV,mutated_sequence,1.0,532.0,NP_009060.2.a2m,NP_009060.2.npy,ClinVar
+NP_009167.1,NP_009167.1.csv,MGSRASTLLRDEELEEIKKETGFSHSQITRLYSRFTSLDKGENGTLSREDFQRIPELAINPLGDRIINAFFPEGEDQVNFRGFMRTLAHFRPIEDNEKSKDVNGPEPLNSRSNKLHFAFRLYDLDKDEKISRDELLQVLRMMVGVNISDEQLGSIADRTIQEADQDGDSAISFTEFVKVLEKVDVEQKMSIRFLH,mutated_sequence,1.0,195.0,NP_009167.1.a2m,NP_009167.1.npy,ClinVar
+NP_009185.2,NP_009185.2.csv,MGEVEAPGRLWLESPPGGAPPIFLPSDGQALVLGRGPLTQVTDRKCSRTQVELVADPETRTVAVKQLGVNPSTTGTQELKPGLEGSLGVGDTLYLVNGLHPLTLRWEETRTPESQPDTPPGTPLVSQDEKRDAELPKKRMRKSNPGWENLEKLLVFTAAGVKPQGKVAGFDLDGTLITTRSGKVFPTGPSDWRILYPEIPRKLRELEAEGYKLVIFTNQMSIGRGKLPAEEFKAKVEAVVEKLGVPFQVLVATHAGLYRKPVTGMWDHLQEQANDGTPISIGDSIFVGDAAGRPANWAPGRKKKDFSCADRLFALNLGLPFATPEEFFLKWPAAGFELPAFDPRTVSRSGPLCLPESRALLSASPEVVVAVGFPGAGKSTFLKKHLVSAGYVHVNRDTLGSWQRCVTTCETALKQGKRVAIDNTNPDAASRARYVQCARAAGVPCRCFLFTATLEQARHNNRFREMTDSSHIPVSDMVMYGYRKQFEAPTLAEGFSAILEIPFRLWVEPRLGRLYCQFSEG,mutated_sequence,1.0,521.0,NP_009185.2.a2m,NP_009185.2.npy,ClinVar
+NP_009193.2,NP_009193.2.csv,MASKRALVILAKGAEEMETVIPVDVMRRAGIKVTVAGLAGKDPVQCSRDVVICPDASLEDAKKEGPYDVVVLPGGNLGAQNLSESAAVKEILKEQENRKGLIAAICAGPTALLAHEIGFGSKVTTHPLAKDKMMNGGHYTYSENRVEKDGLILTSRGPGTSFEFALAIVEALNGKEVAAQVKAPLVLKD,mutated_sequence,1.0,189.0,NP_009193.2.a2m,NP_009193.2.npy,ClinVar
+NP_009225.1,NP_009225.1.csv,MDLSALRVEEVQNVINAMQKILECPICLELIKEPVSTKCDHIFCKFCMLKLLNQKKGPSQCPLCKNDITKRSLQESTRFSQLVEELLKIICAFQLDTGLEYANSYNFAKKENNSPEHLKDEVSIIQSMGYRNRAKRLLQSEPENPSLQETSLSVQLSNLGTVRTLRTKQRIQPQKTSVYIELGSDSSEDTVNKATYCSVGDQELLQITPQGTRDEISLDSAKKAACEFSETDVTNTEHHQPSNNDLNTTEKRAAERHPEKYQGSSVSNLHVEPCGTNTHASSLQHENSSLLLTKDRMNVEKAEFCNKSKQPGLARSQHNRWAGSKETCNDRRTPSTEKKVDLNADPLCERKEWNKQKLPCSENPRDTEDVPWITLNSSIQKVNEWFSRSDELLGSDDSHDGESESNAKVADVLDVLNEVDEYSGSSEKIDLLASDPHEALICKSERVHSKSVESNIEDKIFGKTYRKKASLPNLSHVTENLIIGAFVTEPQIIQERPLTNKLKRKRRPTSGLHPEDFIKKADLAVQKTPEMINQGTNQTEQNGQVMNITNSGHENKTKGDSIQNEKNPNPIESLEKESAFKTKAEPISSSISNMELELNIHNSKAPKKNRLRRKSSTRHIHALELVVSRNLSPPNCTELQIDSCSSSEEIKKKKYNQMPVRHSRNLQLMEGKEPATGAKKSNKPNEQTSKRHDSDTFPELKLTNAPGSFTKCSNTSELKEFVNPSLPREEKEEKLETVKVSNNAEDPKDLMLSGERVLQTERSVESSSISLVPGTDYGTQESISLLEVSTLGKAKTEPNKCVSQCAAFENPKGLIHGCSKDNRNDTEGFKYPLGHEVNHSRETSIEMEESELDAQYLQNTFKVSKRQSFAPFSNPGNAEEECATFSAHSGSLKKQSPKVTFECEQKEENQGKNESNIKPVQTVNITAGFPVVGQKDKPVDNAKCSIKGGSRFCLSSQFRGNETGLITPNKHGLLQNPYRIPPLFPIKSFVKTKCKKNLLEENFEEHSMSPEREMGNENIPSTVSTISRNNIRENVFKEASSSNINEVGSSTNEVGSSINEIGSSDENIQAELGRNRGPKLNAMLRLGVLQPEVYKQSLPGSNCKHPEIKKQEYEEVVQTVNTDFSPYLISDNLEQPMGSSHASQVCSETPDDLLDDGEIKEDTSFAENDIKESSAVFSKSVQKGELSRSPSPFTHTHLAQGYRRGAKKLESSEENLSSEDEELPCFQHLLFGKVNNIPSQSTRHSTVATECLSKNTEENLLSLKNSLNDCSNQVILAKASQEHHLSEETKCSASLFSSQCSELEDLTANTNTQDPFLIGSSKQMRHQSESQGVGLSDKELVSDDEERGTGLEENNQEEQSMDSNLGEAASGCESETSVSEDCSGLSSQSDILTTQQRDTMQHNLIKLQQEMAELEAVLEQHGSQPSNSYPSIISDSSALEDLRNPEQSTSEKAVLTSQKSSEYPISQNPEGLSADKFEVSADSSTSKNKEPGVERSSPSKCPSLDDRWYMHSCSGSLQNRNYPSQEELIKVVDVEEQQLEESGPHDLTETSYLPRQDLEGTPYLESGISLFSDDPESDPSEDRAPESARVGNIPSSTSALKVPQLKVAESAQSPAAAHTTDTAGYNAMEESVSREKPELTASTERVNKRMSMVVSGLTPEEFMLVYKFARKHHITLTNLITEETTHVVMKTDAEFVCERTLKYFLGIAGGKWVVSYFWVTQSIKERKMLNEHDFEVRGDVVNGRNHQGPKRARESQDRKIFRGLEICCYGPFTNMPTDQLEWMVQLCGASVVKELSSFTLGTGVHPIVVVQPDAWTEDNGFHAIGQMCEAPVVTREWVLDSVALYQCQELDTYLIPQIPHSHY,mutated_sequence,1.0,1863.0,NP_009225.1.a2m,NP_009225.1.npy,ClinVar
+NP_015564.5,NP_015564.5.csv,MARQKKMGQSVLRAVFFLVLGLLGHSHGGFPNTISIGGLFMRNTVQEHSAFRFAVQLYNTNQNTTEKPFHLNYHVDHLDSSNSFSVTNAFCSQFSRGVYAIFGFYDQMSMNTLTSFCGALHTSFVTPSFPTDADVQFVIQMRPALKGAILSLLGHYKWEKFVYLYDTERGFSILQAIMEAAVQNNWQVTARSVGNIKDVQEFRRIIEEMDRRQEKRYLIDCEVERINTILEQVVILGKHSRGYHYMLANLGFTDILLERVMHGGANITGFQIVNNENPMVQQFIQRWVRLDEREFPEAKNAPLKYTSALTHDAILVIAEAFRYLRRQRVDVSRRGSAGDCLANPAVPWSQGIDIERALKMVQVQGMTGNIQFDTYGRRTNYTIDVYEMKVSGSRKAGYWNEYERFVPFSDQQISNDSASSENRTIVVTTILESPYVMYKKNHEQLEGNERYEGYCVDLAYEIAKHVRIKYKLSIVGDGKYGARDPETKIWNGMVGELVYGRADIAVAPLTITLVREEVIDFSKPFMSLGISIMIKKPQKSKPGVFSFLDPLAYEIWMCIVFAYIGVSVVLFLVSRFSPYEWHLEDNNEEPRDPQSPPDPPNEFGIFNSLWFSLGAFMQQGCDISPRSLSGRIVGGVWWFFTLIIISSYTANLAAFLTVERMVSPIESAEDLAKQTEIAYGTLDSGSTKEFFRRSKIAVYEKMWSYMKSAEPSVFTKTTADGVARVRKSKGKFAFLLESTMNEYIEQRKPCDTMKVGGNLDSKGYGVATPKGSALRTPVNLAVLKLSEQGILDKLKNKWWYDKGECGAKDSGSKDKTSALSLSNVAGVFYILVGGLGLAMMVALIEFCYKSRAESKRMKLTKNTQNFKPAPATNTQNYATYREGYNVYGTESVKI,mutated_sequence,1.0,894.0,NP_015564.5.a2m,NP_015564.5.npy,ClinVar
+NP_015566.1,NP_015566.1.csv,MSTMRLLTLALLFSCSVARAACDPKIVNIGAVLSTRKHEQMFREAVNQANKRHGSWKIQLNATSVTHKPNAIQMALSVCEDLISSQVYAILVSHPPTPNDHFTPTPVSYTAGFYRIPVLGLTTRMSIYSDKSIHLSFLRTVPPYSHQSSVWFEMMRVYSWNHIILLVSDDHEGRAAQKRLETLLEERESKAEKVLQFDPGTKNVTALLMEAKELEARVIILSASEDDAATVYRAAAMLNMTGSGYVWLVGEREISGNALRYAPDGILGLQLINGKNESAHISDAVGVVAQAVHELLEKENITDPPRGCVGNTNIWKTGPLFKRVLMSSKYADGVTGRVEFNEDGDRKFANYSIMNLQNRKLVQVGIYNGTHVIPNDRKIIWPGGETEKPRGYQMSTRLKIVTIHQEPFVYVKPTLSDGTCKEEFTVNGDPVKKVICTGPNDTSPGSPRHTVPQCCYGFCIDLLIKLARTMNFTYEVHLVADGKFGTQERVNNSNKKEWNGMMGELLSGQADMIVAPLTINNERAQYIEFSKPFKYQGLTILVKKEIPRSTLDSFMQPFQSTLWLLVGLSVHVVAVMLYLLDRFSPFGRFKVNSEEEEEDALTLSSAMWFSWGVLLNSGIGEGAPRSFSARILGMVWAGFAMIIVASYTANLAAFLVLDRPEERITGINDPRLRNPSDKFIYATVKQSSVDIYFRRQVELSTMYRHMEKHNYESAAEAIQAVRDNKLHAFIWDSAVLEFEASQKCDLVTTGELFFRSGFGIGMRKDSPWKQNVSLSILKSHENGFMEDLDKTWVRYQECDSRSNAPATLTFENMAGVFMLVAGGIVAGIFLIFIEIAYKRHKDARRKQMQLAFAAVNVWRKNLQDRKSGRAEPDPKKKATFRAITSTLASSFKRRRSSKDTSTGGGRGALQNQKDTVLPRRAIEREEGQLQLCSRHRES,mutated_sequence,1.0,938.0,NP_015566.1.a2m,NP_015566.1.npy,ClinVar
+NP_036192.2,NP_036192.2.csv,MEALIPVINKLQDVFNTVGADIIQLPQIVVVGTQSSGKSSVLESLVGRDLLPRGTGIVTRRPLILQLVHVSQEDKRKTTGEENGVEAEEWGKFLHTKNKLYTDFDEIRQEIENETERISGNNKGVSPEPIHLKIFSPNVVNLTLVDLPGMTKVPVGDQPKDIELQIRELILRFISNPNSIILAVTAANTDMATSEALKISREVDPDGRRTLAVITKLDLMDAGTDAMDVLMGRVIPVKLGIIGVVNRSQLDINNKKSVTDSIRDEYAFLQKKYPSLANRNGTKYLARTLNRLLMHHIRDCLPELKTRINVLAAQYQSLLNSYGEPVDDKSATLLQLITKFATEYCNTIEGTAKYIETSELCGGARICYIFHETFGRTLESVDPLGGLNTIDILTAIRNATGPRPALFVPEVSFELLVKRQIKRLEEPSLRCVELVHEEMQRIIQHCSNYSTQELLRFPKLHDAIVEVVTCLLRKRLPVTNEMVHNLVAIELAYINTKHPDFADACGLMNNNIEEQRRNRLARELPSAVSRDKSSKVPSALAPASQEPSPAASAEADGKLIQDSRRETKNVASGGGGVGDGVQEPTTGNWRGMLKTSKAEELLAEEKSKPIPIMPASPQKGHAVNLLDVPVPVARKLSAREQRDCEVIERLIKSYFLIVRKNIQDSVPKAVMHFLVNHVKDTLQSELVGQLYKSSLLDDLLTESEDMAQRRKEAADMLKALQGASQIIAEIRETHLW,mutated_sequence,1.0,736.0,NP_036192.2.a2m,NP_036192.2.npy,ClinVar
+NP_036286.2,NP_036286.2.csv,MYSGAGPALAPPAPPPPIQGYAFKPPPRPDFGTSGRTIKLQANFFEMDIPKIDIYHYELDIKPEKCPRRVNREIVEHMVQHFKTQIFGDRKPVFDGRKNLYTAMPLPIGRDKVELEVTLPGEGKDRIFKVSIKWVSCVSLQALHDALSGRLPSVPFETIQALDVVMRHLPSMRYTPVGRSFFTASEGCSNPLGGGREVWFGFHQSVRPSLWKMMLNIDVSATAFYKAQPVIEFVCEVLDFKSIEEQQKPLTDSQRVKFTKEIKGLKVEITHCGQMKRKYRVCNVTRRPASHQTFPLQQESGQTVECTVAQYFKDRHKLVLRYPHLPCLQVGQEQKHTYLPLEVCNIVAGQRCIKKLTDNQTSTMIRATARSAPDRQEEISKLMRSASFNTDPYVREFGIMVKDEMTDVTGRVLQPPSILYGGRNKAIATPVQGVWDMRNKQFHTGIEIKVWAIACFAPQRQCTEVHLKSFTEQLRKISRDAGMPIQGQPCFCKYAQGADSVEPMFRHLKNTYAGLQLVVVILPGKTPVYAEVKRVGDTVLGMATQCVQMKNVQRTTPQTLSNLCLKINVKLGGVNNILLPQGRPPVFQQPVIFLGADVTHPPAGDGKKPSIAAVVGSMDAHPNRYCATVRVQQHRQEIIQDLAAMVRELLIQFYKSTRFKPTRIIFYRDGVSEGQFQQVLHHELLAIREACIKLEKDYQPGITFIVVQKRHHTRLFCTDKNERVGKSGNIPAGTTVDTKITHPTEFDFYLCSHAGIQGTSRPSHYHVLWDDNRFSSDELQILTYQLCHTYVRCTRSVSIPAPAYYAHLVAFRARYHLVDKEHDSAEGSHTSGQSNGRDHQALAKAVQVHQDTLRTMYFA,mutated_sequence,1.0,859.0,NP_036286.2.a2m,NP_036286.2.npy,ClinVar
+NP_036325.2,NP_036325.2.csv,MAWRGAGPSVPGAPGGVGLSLGLLLQLLLLLGPARGFGDEEERRCDPIRISMCQNLGYNVTKMPNLVGHELQTDAELQLTTFTPLIQYGCSSQLQFFLCSVYVPMCTEKINIPIGPCGGMCLSVKRRCEPVLKEFGFAWPESLNCSKFPPQNDHNHMCMEGPGDEEVPLPHKTPIQPGEECHSVGTNSDQYIWVKRSLNCVLKCGYDAGLYSRSAKEFTDIWMAVWASLCFISTAFTVLTFLIDSSRFSYPERPIIFLSMCYNIYSIAYIVRLTVGRERISCDFEEAAEPVLIQEGLKNTGCAIIFLLMYFFGMASSIWWVILTLTWFLAAGLKWGHEAIEMHSSYFHIAAWAIPAVKTIVILIMRLVDADELTGLCYVGNQNLDALTGFVVAPLFTYLVIGTLFIAAGLVALFKIRSNLQKDGTKTDKLERLMVKIGVFSVLYTVPATCVIACYFYEISNWALFRYSADDSNMAVEMLKIFMSLLVGITSGMWIWSAKTLHTWQKCSNRLVNSGKVKREKRGNGWVKPGKGSETVV,mutated_sequence,1.0,537.0,NP_036325.2.a2m,NP_036325.2.npy,ClinVar
+NP_036345.2,NP_036345.2.csv,MRGFGPGLTARRLLPLRLPPRPPGPRLASGQAAGALERAMDELLRRAVPPTPAYELREKTPAPAEGQCADFVSFYGGLAETAQRAELLGRLARGFGVDHGQVAEQSAGVLHLRQQQREAAVLLQAEDRLRYALVPRYRGLFHHISKLDGGVRFLVQLRADLLEAQALKLVEGPDVREMNGVLKGMLSEWFSSGFLNLERVTWHSPCEVLQKISEAEAVHPVKNWMDMKRRVGPYRRCYFFSHCSTPGEPLVVLHVALTGDISSNIQAIVKEHPPSETEEKNKITAAIFYSISLTQQGLQGVELGTFLIKRVVKELQREFPHLGVFSSLSPIPGFTKWLLGLLNSQTKEHGRNELFTDSECKEISEITGGPINETLKLLLSSSEWVQSEKLVRALQTPLMRLCAWYLYGEKHRGYALNPVANFHLQNGAVLWRINWMADVSLRGITGSCGLMANYRYFLEETGPNSTSYLGSKIIKASEQVLSLVAQFQKNSKL,mutated_sequence,1.0,493.0,NP_036345.2.a2m,NP_036345.2.npy,ClinVar
+NP_036382.2,NP_036382.2.csv,MAAAGWRDGSGQEKYRLVVVGGGGVGKSALTIQFIQSYFVTDYDPTIEDSYTKQCVIDDRAARLDILDTAGQEEFGAMREQYMRTGEGFLLVFSVTDRGSFEEIYKFQRQILRVKDRDEFPMILIGNKADLDHQRQVTQEEGQQLARQLKVTYMEASAKIRMNVDQAFHELVRVIRKFQEQECPPSPEPTRKEKDKKGCHCVIF,mutated_sequence,1.0,204.0,NP_036382.2.a2m,NP_036382.2.npy,ClinVar
+NP_036470.1,NP_036470.1.csv,MAREDSVKCLRCLLYALNLLFWLMSISVLAVSAWMRDYLNNVLTLTAETRVEEAVILTYFPVVHPVMIAVCCFLIIVGMLGYCGTVKRNLLLLAWYFGSLLVIFCVELACGVWTYEQELMVPVQWSDMVTLKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQEDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQMMSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL,mutated_sequence,1.0,305.0,NP_036470.1.a2m,NP_036470.1.npy,ClinVar
+NP_036546.2,NP_036546.2.csv,MACSIVQFCYFQDLQAARDFLFPHLREEILSGALRRDPSKSTDWEDDGWGAWEENEPQEPEEEGNTCKTQKTSWLQDCVLSLSPTNDLMVIAREQKAVFLVPKWKYSDKGKEEMQFAVGWSGSLNVEEGECVTSALCIPLASQKRSSTGRPDWTCIVVGFTSGYVRFYTENGVLLLAQLLNEDPVLQLKCRTYEIPRHPGVTEQNEELSILYPAAIVTIDGFSLFQSLRACRNQVAKAAASGNENIQPPPLAYKKWGLQDIDTIIDHASVGIMTLSPFDQMKTASNIGGFNAAIKNSPPAMSQYITVGSNPFTGFFYALEGSTQPLLSHVALAVASKLTSALFNAASGWLGWKSKHEEEAVQKQKPKVEPATPLAVRFGLPDSRRHGESICLSPCNTLAAVTDDFGRVILLDVARGIAIRMWKGYRDAQIGWIQTVEDLHERVPEKADFSPFGNSQGPSRVAQFLVIYAPRRGILEVWSTQQGPRVGAFNVGKHCRLLYPGYKIMGLNNVTSQSWQPQTYQICLVDPVSGSVKTVNVPFHLALSDKKSERAKDMHLVKKLAALLKTKSPNLDLVETEIKELILDIKYPATKKQALESILASERLPFSCLRNITQTLMDTLKSQELESVDEGLLQFCANKLKLLQLYESVSQLNSLDFHLDTPFSDNDLALLLRLDEKELLKLQALLEKYKQENTRTNVRFSDDKDGVLPVKTFLEYLEYEKDVLNIKKISEEEYVALGSFFFWKCLHGESSTEDMCHTLESAGLSPQLLLSLLLSVWLSKEKDILDKPQSICCLHTMLSLLSKMKVAIDETWDSQSVSPWWQQMRTACIQSENNGAALLSAHVGHSVAAQISNNMTEKKFSQTVLGADSEALTDSWEALSLDTEYWKLLLKQLEDCLILQTLLHSKGNTQTSKVSSLQAEPLPRLSVKKLLEGGKGGIADSVAKWIFKQDFSPEVLKLANEERDAENPDEPKEGVNRSFLEVSEMEMDLGAIPDLLHLAYEQFPCSLELDVLHAHCCWEYVVQWNKDPEEARFFVRSIEHLKQIFNAHVQNGIALMMWNTFLVKRFSAATYLMDKVGKSPKDRLCRRDVGMSDTAMTSFLGSCLDLLQILMEADVSRDEIQVPVLDTEDAWLSVEGPISIVELALEQKHIHYPLVEHHSILCSILYAVMRFSLKTVKPLSLFDSKGKNAFFKDLTSIQLLPSGEMDPNFISVRQQFLLKVVSAAVQAQHSATKVKDPTEEATPTPFGKDQDWPALAVDLAHHLQVSEDVVRRHYVGELYNYGVDHLGEEAILQVHDKEVLASQLLVLTGQRLAHALLHTQTKEGMELLARLPPTLCTWLKAMDPQDLQNTEVPIATTAKLVNKVIELLPEKHGQYGLALHLIEAVEAISLPSL,mutated_sequence,1.0,1393.0,NP_036546.2.a2m,NP_036546.2.npy,ClinVar
+NP_036601.2,NP_036601.2.csv,MNKKKKPFLGMPAPLGYVPGLGRGATGFTTRSDIGPARDANDPVDDRHAPPGKRTVGDQMKKNQAADDDDEDLNDTNYDEFNGYAGSLFSSGPYEKDDEEADAIYAALDKRMDERRKERREQREKEEIEKYRMERPKIQQQFSDLKRKLAEVTEEEWLSIPEVGDARNKRQRNPRYEKLTPVPDSFFAKHLQTGENHTSVDPRQTQFGGLNTPYPGGLNTPYPGGMTPGLMTPGTGELDMRKIGQARNTLMDMRLSQVSDSVSGQTVVDPKGYLTDLNSMIPTHGGDINDIKKARLLLKSVRETNPHHPPAWIASARLEEVTGKLQVARNLIMKGTEMCPKSEDVWLEAARLQPGDTAKAVVAQAVRHLPQSVRIYIRAAELETDIRAKKRVLRKALEHVPNSVRLWKAAVELEEPEDARIMLSRAVECCPTSVELWLALARLETYENARKVLNKARENIPTDRHIWITAAKLEEANGNTQMVEKIIDRAITSLRANGVEINREQWIQDAEECDRAGSVATCQAVMRAVIGIGIEEEDRKHTWMEDADSCVAHNALECARAIYAYALQVFPSKKSVWLRAAYFEKNHGTRESLEALLQRAVAHCPKAEVLWLMGAKSKWLAGDVPAARSILALAFQANPNSEEIWLAAVKLESENDEYERARRLLAKARSSAPTARVFMKSVKLEWVQDNIRAAQDLCEEALRHYEDFPKLWMMKGQIEEQKEMMEKAREAYNQGLKKCPHSTPLWLLLSRLEEKIGQLTRARAILEKSRLKNPKNPGLWLESVRLEYRAGLKNIANTLMAKALQECPNSGILWSEAIFLEARPQRRTKSVDALKKCEHDPHVLLAVAKLFWSQRKITKAREWFHRTVKIDSDLGDAWAFFYKFELQHGTEEQQEEVRKRCESAEPRHGELWCAVSKDIANWQKKIGDILRLVAGRIKNTF,mutated_sequence,1.0,941.0,NP_036601.2.a2m,NP_036601.2.npy,ClinVar
+NP_037379.1,NP_037379.1.csv,MAAPRAGRGAGWSLRAWRALGGIRWGRRPRLTPDLRALLTSGTSDPRARVTYGTPSLWARLSVGVTEPRACLTSGTPGPRAQLTAVTPDTRTREASENSGTRSRAWLAVALGAGGAVLLLLWGGGRGPPAVLAAVPSPPPASPRSQYNFIADVVEKTAPAVVYIEILDRHPFLGREVPISNGSGFVVAADGLIVTNAHVVADRRRVRVRLLSGDTYEAVVTAVDPVADIATLRIQTKEPLPTLPLGRSADVRQGEFVVAMGSPFALQNTITSGIVSSAQRPARDLGLPQTNVEYIQTDAAIDFGNSGGPLVNLDGEVIGVNTMKVTAGISFAIPSDRLREFLHRGEKKNSSSGISGSQRRYIGVMMLTLSPSILAELQLREPSFPDVQHGVLIHKVILGSPAHRAGLRPGDVILAIGEQMVQNAEDVYEAVRTQSQLAVQIRRGRETLTLYVTPEVTE,mutated_sequence,1.0,458.0,NP_037379.1.a2m,NP_037379.1.npy,ClinVar
+NP_037386.1,NP_037386.1.csv,MQSTSNHLWLLSDILGQGATANVFRGRHKKTGDLFAIKVFNNISFLRPVDVQMREFEVLKKLNHKNIVKLFAIEEETTTRHKVLIMEFCPCGSLYTVLEEPSNAYGLPESEFLIVLRDVVGGMNHLRENGIVHRDIKPGNIMRVIGEDGQSVYKLTDFGAARELEDDEQFVSLYGTEEYLHPDMYERAVLRKDHQKKYGATVDLWSIGVTFYHAATGSLPFRPFEGPRRNKEVMYKIITGKPSGAISGVQKAENGPIDWSGDMPVSCSLSRGLQVLLTPVLANILEADQEKCWGFDQFFAETSDILHRMVIHVFSLQQMTAHKIYIHSYNTATIFHELVYKQTKIISSNQELIYEGRRLVLEPGRLAQHFPKTTEENPIFVVSREPLNTIGLIYEKISLPKVHPRYDLDGDASMAKAITGVVCYACRIASTLLLYQELMRKGIRWLIELIKDDYNETVHKKTEVVITLDFCIRNIEKTVKVYEKLMKINLEAAELGEISDIHTKLLRLSSSQGTIETSLQDIDSRLSPGGSLADAWAHQEGTHPKDRNVEKLQVLLNCMTEIYYQFKKDKAERRLAYNEEQIHKFDKQKLYYHATKAMTHFTDECVKKYEAFLNKSEEWIRKMLHLRKQLLSLTNQCFDIEEEVSKYQEYTNELQETLPQKMFTASSGIKHTMTPIYPSSNTLVEMTLGMKKLKEEMEGVVKELAENNHILERFGSLTMDGGLRNVDCL,mutated_sequence,1.0,729.0,NP_037386.1.a2m,NP_037386.1.npy,ClinVar
+NP_037397.2,NP_037397.2.csv,MAAAAAAGPSPGSGPGDSPEGPEGEAPERRRKAHGMLKLYYGLSEGEAAGRPAGPDPLDPTDLNGAHFDPEVYLDKLRRECPLAQLMDSETDMVRQIRALDSDMQTLVYENYNKFISATDTIRKMKNDFRKMEDEMDRLATNMAVITDFSARISATLQDRHERITKLAGVHALLRKLQFLFELPSRLTKCVELGAYGQAVRYQGRAQAVLQQYQHLPSFRAIQDDCQVITARLAQQLRQRFREGGSGAPEQAECVELLLALGEPAEELCEEFLAHARGRLEKELRNLEAELGPSPPAPDVLEFTDHGGSGFVGGLCQVAAAYQELFAAQGPAGAEKLAAFARQLGSRYFALVERRLAQEQGGGDNSLLVRALDRFHRRLRAPGALLAAAGLADAATEIVERVARERLGHHLQGLRAAFLGCLTDVRQALAAPRVAGKEGPGLAELLANVASSILSHIKASLAAVHLFTAKEVSFSNKPYFRGEFCSQGVREGLIVGFVHSMCQTAQSFCDSPGEKGGATPPALLLLLSRLCLDYETATISYILTLTDEQFLVQDQFPVTPVSTLCAEARETARRLLTHYVKVQGLVISQMLRKSVETRDWLSTLEPRNVRAVMKRVVEDTTAIDVQVGLLYEEGVRKAQSSDSSKRTFSVYSSSRQQGRYAPSYTPSAPMDTNLLSNIQKLFSERIDVFSPVEFNKVSVLTGIIKISLKTLLECVRLRTFGRFGLQQVQVDCHFLQLYLWRFVADEELVHLLLDEVVASAALRCPDPVPMEPSVVEVICERG,mutated_sequence,1.0,782.0,NP_037397.2.a2m,NP_037397.2.npy,ClinVar
+NP_037398.2,NP_037398.2.csv,MSAETPITLNIDPQDLQVQTFTVEKLLEPLIIQVTTLVNCPQNPSSRKKGRSKRASVLLASVEEATWNLLDKGEKIAQEATVLKDELTASLEEVRKESEALKVSAERFTDDPCFLPKREAVVQAARALLAAVTRLLILADMIDVMCLLQHVSAFQRTFESLKNVANKSDLQKTYQKLGKELENLDYLAFKRQQDLKSPNQRDEIAGARASLKENSPLLHSICSACLEHSDVASLKASKDTVCEEIQNALNVISNASQGIQNMTTPPEPQAATLGSALDELENLIVLNPLTVTEEEIRPSLEKRLEAIISGAALLADSSCTRDLHRERIIAECNAIRQALQDLLSEYMNNAGKKERSNTLNIALDNMCKKTRDLRRQLRKAIIDHVSDSFLDTTVPLLVLIEAAKNGREKEIKEYAAIFHEHTSRLVEVANLACSMSTNEDGIKIVKIAANHLETLCPQIINAALALAARPKSQAVKNTMEMYKRTWENHIHVLTEAVDDITSIDDFLAVSESHILEDVNKCIIALRDQDADNLDRAAGAIRGRAARVAHIVTGEMDSYEPGAYTEGVMRNVNFLTSTVIPEFVTQVNVALEALSKSSLNVLDDNQFVDISKKIYDTIHDIRCSVMMIRTPEELEDVSDLEEEHEVRSHTSIQTEGKTDRAKMTQLPEAEKEKIAEQVADFKKVKSKLDAEIEIWDDTSNDIIVLAKNMCMIMMEMTDFTRGKGPLKHTTDVIYAAKMISESGSRMDVLARQIANQCPDPSCKQDLLAYLEQIKFYSHQLKICSQVKAEIQNLGGELIMSALDSVTSLIQAAKNLMNAVVQTVKMSYIASTKIIRIQSPAGPRHPVVMWRMKAPAKKPLIKREKPEETCAAVRRGSAKKKIHPLQVMSEFRGRQIY,mutated_sequence,1.0,895.0,NP_037398.2.a2m,NP_037398.2.npy,ClinVar
+NP_037407.4,NP_037407.4.csv,MPKGGCPKAPQQEELPLSSDMVEKQTGKKDKDKVSLTKTPKLERGDGGKEVRERASKRKLPFTAGANGEQKDSDTEKQGPERKRIKKEPVTRKAGLLFGMGLSGIRAGYPLSERQQVALLMQMTAEESANSPVDTTPKHPSQSTVCQKGTPNSASKTKDKVNKRNERGETRLHRAAIRGDARRIKELISEGADVNVKDFAGWTALHEACNRGYYDVAKQLLAAGAEVNTKGLDDDTPLHDAANNGHYKVVKLLLRYGGNPQQSNRKGETPLKVANSPTMVNLLLGKGTYTSSEESSTESSEEEDAPSFAPSSSVDGNNTDSEFEKGLKHKAKNPEPQKATAPVKDEYEFDEDDEQDRVPPVDDKHLLKKDYRKETKSNSFISIPKMEVKSYTKNNTIAPKKASHRILSDTSDEEDASVTVGTGEKLRLSAHTILPGSKTREPSNAKQQKEKNKVKKKRKKETKGREVRFGKRSDKFCSSESESESSESGEDDRDSLGSSGCLKGSPLVLKDPSLFSSLSASSTSSHGSSAAQKQNPSHTDQHTKHWRTDNWKTISSPAWSEVSSLSDSTRTRLTSESDYSSEGSSVESLKPVRKRQEHRKRASLSEKKSPFLSSAEGAVPKLDKEGKVVKKHKTKHKHKNKEKGQCSISQELKLKSFTYEYEDSKQKSDKAILLENDLSTENKLKVLKHDRDHFKKEEKLSKMKLEEKEWLFKDEKSLKRIKDTNKDISRSFREEKDRSNKAEKERSLKEKSPKEEKLRLYKEERKKKSKDRPSKLEKKNDLKEDKISKEKEKIFKEDKEKLKKEKVYREDSAFDEYCNKNQFLENEDTKFSLSDDQRDRWFSDLSDSSFDFKGEDSWDSPVTDYRDMKSDSVAKLILETVKEDSKERRRDSRAREKRDYREPFFRKKDRDYLDKNSEKRKEQTEKHKSVPGYLSEKDKKRRESAEAGRDRKDALESCKERRDGRAKPEEAHREELKECGCESGFKDKSDGDFGKGLEPWERHHPAREKEKKDGPDKERKEKTKPERYKEKSSDKDKSEKSILEKCQKDKEFDKCFKEKKDTKEKHKDTHGKDKERKASLDQGKEKKEKAFPGIISEDFSEKKDDKKGKEKSWYIADIFTDESEDDRDSCMGSGFKMGEASDLPRTDGLQEKEEGREAYASDRHRKSSDKQHPERQKDKEPRDRRKDRGAADAGRDKKEKVFEKHKEKKDKESTEKYKDRKDRASVDSTQDKKNKQKLPEKAEKKHAAEDKAKSKHKEKSDKEHSKERKSSRSADAEKSLLEKLEEEALHEYREDSNDKISEVSSDSFTDRGQEPGLTAFLEVSFTEPPGDDKPRESACLPEKLKEKERHRHSSSSSKKSHDRERAKKEKAEKKEKGEDYKEGGSRKDSGQYEKDFLEADAYGVSYNMKADIEDELDKTIELFSTEKKDKNDSEREPSKKIEKELKPYGSSAINILKEKKKREKHREKWRDEKERHRDRHADGLLRHHRDELLRHHRDEQKPATRDKDSPPRVLKDKSRDEGPRLGDAKLKEKFKDGAEKEKGDPVKMSNGNDKVAPSKDPGKKDARPREKLLGDGDLMMTSFERMLSQKDLEIEERHKRHKERMKQMEKLRHRSGDPKLKEKAKPADDGRKKGLDIPAKKPPGLDPPFKDKKLKESTPIPPAAENKLHPASGADSKDWLAGPHMKEVLPASPRPDQSRPTGVPTPTSVLSCPSYEEVMHTPRTPSCSADDYADLVFDCADSQHSTPVPTAPTSACSPSFFDRFSVASSGLSENASQAPARPLSTNLYRSVSVDIRRTPEEEFSVGDKLFRQQSVPAASSYDSPMPPSMEDRAPLPPVPAEKFACLSPGYYSPDYGLPSPKVDALHCPPAAVVTVTPSPEGVFSSLQAKPSPSPRAELLVPSLEGALPPDLDTSEDQQATAAIIPPEPSYLEPLDEGPFSAVITEEPVEWAHPSEQALASSLIGGTSENPVSWPVGSDLLLKSPQRFPESPKRFCPADPLHSAAPGPFSASEAPYPAPPASPAPYALPVAEPGLEDVKDGVDAVPAAISTSEAAPYAPPSGLESFFSNCKSLPEAPLDVAPEPACVAAVAQVEALGPLENSFLDGSRGLSHLGQVEPVPWADAFAGPEDDLDLGPFSLPELPLQTKDAADGEAEPVEESLAPPEEMPPGAPGVINGGDVSTVVAEEPPALPPDQASTRLPAELEPEPSGEPKLDVALEAAVEAETVPEERARGDPDSSVEPAPVPPEQRPLGSGDQGAEAEGPPAASLCAPDGPAPNTVAQAQAADGAGPEDDTEASRAAAPAEGPPGGIQPEAAEPKPTAEAPKAPRVEEIPQRMTRNRAQMLANQSKQGPPPSEKECAPTPAPVTRAKARGSEDDDAQAQHPRKRRFQRSTQQLQQQLNTSTQQTREVIQQTLAAIVDAIKLDAIEPYHSDRANPYFEYLQIRKKIEEKRKILCCITPQAPQCYAEYVTYTGSYLLDGKPLSKLHIPVIAPPPSLAEPLKELFRQQEAVRGKLRLQHSIEREKLIVSCEQEILRVHCRAARTIANQAVPFSACTMLLDSEVYNMPLESQGDENKSVRDRFNARQFISWLQDVDDKYDRMKTCLLMRQQHEAAALNAVQRMEWQLKVQELDPAGHKSLCVNEVPSFYVPMVDVNDDFVLLPA,mutated_sequence,1.0,2663.0,NP_037407.4.a2m,NP_037407.4.npy,ClinVar
+NP_037471.2,NP_037471.2.csv,MEKWYLMTVVVLIGLTVRWTVSLNSYSGAGKPPMFGDYEAQRHWQEITFNLPVKQWYFNSSDNNLQYWGLDYPPLTAYHSLLCAYVAKFINPDWIALHTSRGYESQAHKLFMRTTVLIADLLIYIPAVVLYCCCLKEISTKKKIANALCILLYPGLILIDYGHFQYNSVSLGFALWGVLGISCDCDLLGSLAFCLAINYKQMELYHALPFFCFLLGKCFKKGLKGKGFVLLVKLACIVVASFVLCWLPFFTEREQTLQVLRRLFPVDRGLFEDKVANIWCSFNVFLKIKDILPRHIQLIMSFCSTFLSLLPACIKLILQPSSKGFKFTLVSCALSFFLFSFQVHEKSILLVSLPVCLVLSEIPFMSTWFLLVSTFSMLPLLLKDELLMPSVVTTMAFFIACVTSFSIFEKTSEEELQLKSFSISVRKYLPCFTFLSRIIQYLFLISVITMVLLTLMTVTLDPPQKLPDLFSVLVCFVSCLNFLFFLVYFNIIIMWDSKSGRNQKKIS,mutated_sequence,1.0,507.0,NP_037471.2.a2m,NP_037471.2.npy,ClinVar
+NP_054728.2,NP_054728.2.csv,MPNPRPGKPSAPSLALGPSPGASPSWRAAPKASDLLGARGPGGTFQGRDLRGGAHASSSSLNPMPPSQLQLPTLPLVMVAPSGARLGPLPHLQALLQDRPHFMHQLSTVDAHARTPVLQVHPLESPAMISLTPPTTATGVFSLKARPGLPPGINVASLEWVSREPALLCTFPNPSAPRKDSTLSAVPQSSYPLLANGVCKWPGCEKVFEEPEDFLKHCQADHLLDEKGRAQCLLQREMVQSLEQQLVLEKEKLSAMQAHLAGKMALTKASSVASSDKGSCCIVAAGSQGPVVPAWSGPREAPDSLFAVRRHLWGSHGNSTFPEFLHNMDYFKFHNMRPPFTYATLIRWAILEAPEKQRTLNEIYHWFTRMFAFFRNHPATWKNAIRHNLSLHKCFVRVESEKGAVWTVDELEFRKKRSQRPSRCSNPTPGP,mutated_sequence,1.0,431.0,NP_054728.2.a2m,NP_054728.2.npy,ClinVar
+NP_054859.2,NP_054859.2.csv,MSLPLTEEQRKKIEENRQKALARRAEKLLAEQHQRTSSGTSIAGNPFQAKQGPSQNFPRESCKPVSHGVIFKQQNLSSSSNADQRPHDSHSFQAKGIWKKPEEMPTACPGHSPRSQMALTGISPPLAQSPPEVPKQQLLSYELGQGHAQASPEIRFTPFANPTHKPLAKPKSSQETPAHSSGQPPRDAKLEAKTAKASPSGQNISYIHSSSESVTPRTEGRLQQKSGSSVQKGVNSQKGKCVRNGDRFQVLIGYNAELIAVFKTLPSKNYDPDTKTWNFSMNDYSALMKAAQSLPTVNLQPLEWAYGSSESPSTSSEGQAGLPSAPSLSFVKGRCMLISRAYFEADISYSQDLIALFKQMDSRRYDVKTRKWSFLLEEHSKLIAKVRCLPQVQLDPLPTTLTLAFASQLKKTSLSLTPDVPEADLSEVDPKLVSNLMPFQRAGVNFAIAKGGRLLLADDMGLGKTIQAICIAAFYRKEWPLLVVVPSSVRFTWEQAFLRWLPSLSPDCINVVVTGKDRLTAGLINIVSFDLLSKLEKQLKTPFKVVIIDESHFLKNSRTARCRAAMPVLKVAKRVILLSGTPAMSRPAELYTQIIAVKPTFFPQFHAFGLRYCDAKRMPWGWDYSGSSNLGELKLLLEEAVMLRRLKSDVLSQLPAKQRKIVVIAPGRINARTRAALDAAAKEMTTKDKTKQQQKDALILFFNRTAEAKIPSVIEYILDLLESGREKFLVFAHHKVVLDAITQELERKHVQHIRIDGSTSSAEREDLCQQFQLSERHAVAVLSITAANMGLTFSSADLVVFAELFWNPGVLIQAEDRVHRIGQTSSVGIHYLVAKGTADDYLWPLIQEKIKVLAEAGLSETNFSEMTESTDYLYKDPKQQKIYDLFQKSFEKEGSDMELLEAAESFDPGSASGTSGSSSQNMGDTLDESSLTASPQKKRRFEFFDNWDSFTSPL,mutated_sequence,1.0,954.0,NP_054859.2.a2m,NP_054859.2.npy,ClinVar
+NP_054860.1,NP_054860.1.csv,MQAAPRAGCGAALLLWIVSSCLCRAWTAPSTSQKCDEPLVSGLPHVAFSSSSSISGSYSPGYAKINKRGGAGGWSPSDSDHYQWLQVDFGNRKQISAIATQGRYSSSDWVTQYRMLYSDTGRNWKPYHQDGNIWAFPGNINSDGVVRHELQHPIIARYVRIVPLDWNGEGRIGLRIEVYGCSYWADVINFDGHVVLPYRFRNKKMKTLKDVIALNFKTSESEGVILHGEGQQGDYITLELKKAKLVLSLNLGSNQLGPIYGHTSVMTGSLLDDHHWHSVVIERQGRSINLTLDRSMQHFRTNGEFDYLDLDYEITFGGIPFSGKPSSSSRKNFKGCMESINYNGVNITDLARRKKLEPSNVGNLSFSCVEPYTVPVFFNATSYLEVPGRLNQDLFSVSFQFRTWNPNGLLVFSHFADNLGNVEIDLTESKVGVHINITQTKMSQIDISSGSGLNDGQWHEVRFLAKENFAILTIDGDEASAVRTNSPLQVKTGEKYFFGGFLNQMNNSSHSVLQPSFQGCMQLIQVDDQLVNLYEVAQRKPGSFANVSIDMCAIIDRCVPNHCEHGGKCSQTWDSFKCTCDETGYSGATCHNSIYEPSCEAYKHLGQTSNYYWIDPDGSGPLGPLKVYCNMTEDKVWTIVSHDLQMQTPVVGYNPEKYSVTQLVYSASMDQISAITDSAEYCEQYVSYFCKMSRLLNTPDGSPYTWWVGKANEKHYYWGGSGPGIQKCACGIERNCTDPKYYCNCDADYKQWRKDAGFLSYKDHLPVSQVVVGDTDRQGSEAKLSVGPLRCQGDRNYWNAASFPNPSSYLHFSTFQGETSADISFYFKTLTPWGVFLENMGKEDFIKLELKSATEVSFSFDVGNGPVEIVVRSPTPLNDDQWHRVTAERNVKQASLQVDRLPQQIRKAPTEGHTRLELYSQLFVGGAGGQQGFLGCIRSLRMNGVTLDLEERAKVTSGFISGCSGHCTSYGTNCENGGKCLERYHGYSCDCSNTAYDGTFCNKDVGAFFEEGMWLRYNFQAPATNARDSSSRVDNAPDQQNSHPDLAQEEIRFSFSTTKAPCILLYISSFTTDFLAVLVKPTGSLQIRYNLGGTREPYNIDVDHRNMANGQPHSVNITRHEKTIFLKLDHYPSVSYHLPSSSDTLFNSPKSLFLGKVIETGKIDQEIHKYNTPGFTGCLSRVQFNQIAPLKAALRQTNASAHVHIQGELVESNCGASPLTLSPMSSATDPWHLDHLDSASADFPYNPGQGQAIRNGVNRNSAIIGGVIAVVIFTILCTLVFLIRYMFRHKGTYHTNEAKGAESAESADAAIMNNDPNFTETIDESKKEWLI,mutated_sequence,1.0,1331.0,NP_054860.1.a2m,NP_054860.1.npy,ClinVar
+NP_055023.2,NP_055023.2.csv,MKIITYFCIWAVAWAIPVPQSKPLERHVEKSMNLHLLARSNVSVQDELNASGTIKESGVLVHEGDRGRQENTQDGHKGEGNGSKWAEVGGKSFSTYSTLANEEGNIEGWNGDTGKAETYGHDGIHGKEENITANGIQGQVSIIDNAGATNRSNTNGNTDKNTQNGDVGDAGHNEDVAVVQEDGPQVAGSNNSTDNEDEIIENSCRNEGNTSEITPQINSKRNGTKEAEVTPGTGEDAGLDNSDGSPSGNGADEDEDEGSGDDEDEEAGNGKDSSNNSKGQEGQDHGKEDDHDSSIGQNSDSKEYYDPEGKEDPHNEVDGDKTSKSEENSAGIPEDNGSQRIEDTQKLNHRESKRVENRITKESETHAVGKSQDKGIEIKGPSSGNRNITKEVGKGNEGKEDKGQHGMILGKGNVKTQGEVVNIEGPGQKSEPGNKVGHSNTGSDSNSDGYDSYDFDDKSMQGDDPNSSDESNGNDDANSESDNNSSSRGDASYNSDESKDNGNGSDSKGAEDDDSDSTSDTNNSDSNGNGNNGNDDNDKSDSGKGKSDSSDSDSSDSSNSSDSSDSSDSDSSDSNSSSDSDSSDSDSSDSSDSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSKSDSSKSESDSSDSDSKSDSSDSNSSDSSDNSDSSDSSNSSNSSDSSDSSDSSDSSSSSDSSNSSDSSDSSDSSNSSESSDSSDSSDSDSSDSSDSSNSNSSDSDSSNSSDSSDSSNSSDSSDSSDSSNSSDSSDSSDSSNSSDSSDSSDSSDSSDSSNSSDSNDSSNSSDSSDSSNSSDSSNSSDSSDSSDSSDSDSSNSSDSSNSSDSSDSSNSSDSSDSSDSSDGSDSDSSNRSDSSNSSDSSDSSDSSNSSDSSDSSDSNESSNSSDSSDSSNSSDSDSSDSSNSSDSSDSSNSSDSSESSNSSDNSNSSDSSNSSDSSDSSDSSNSSDSSNSSDSSNSSDSSDSNSSDSSDSSNSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSNSSDSSNSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSESSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSNSSDSSDSSESSDSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSNESSDSSDSSDSSDSSNSSDSSDSSDSSDSTSDSNDESDSQSKSGNGNNNGSDSDSDSEGSDSNHSTSDD,mutated_sequence,1.0,1301.0,NP_055023.2.a2m,NP_055023.2.npy,ClinVar
+NP_055047.2,NP_055047.2.csv,MSATAATAPPAAPAGEGGPPAPPPNLTSNRRLQQTQAQVDEVVDIMRVNVDKVLERDQKLSELDDRADALQAGASQFETSAAKLKRKYWWKNLKMMIILGVICAIILIIIIVYFST,mutated_sequence,1.0,116.0,NP_055047.2.a2m,NP_055047.2.npy,ClinVar
+NP_055064.1,NP_055064.1.csv,METRPTALMSSTVAAAAPAAGAASRKESPGRWGLGEDPTGVSPSLQCRVCGDSSSGKHYGIYACNGCSGFFKRSVRRRLIYRCQVGAGMCPVDKAHRNQCQACRLKKCLQAGMNQDAVQNERQPRSTAQVHLDSMESNTESRPESLVAPPAPAGRSPRGPTPMSAARALGHHFMASLITAETCAKLEPEDADENIDVTSNDPEFPSSPYSSSSPCGLDSIHETSARLLFMAVKWAKNLPVFSSLPFRDQVILLEEAWSELFLLGAIQWSLPLDSCPLLAPPEASAAGGAQGRLTLASMETRVLQETISRFRALAVDPTEFACMKALVLFKPETRGLKDPEHVEALQDQSQVMLSQHSKAHHPSQPVRFGKLLLLLPSLRFITAERIELLFFRKTIGNTPMEKLLCDMFKN,mutated_sequence,1.0,410.0,NP_055064.1.a2m,NP_055064.1.npy,ClinVar
+NP_055067.1,NP_055067.1.csv,MKSNPAIQAAIDLTAGAAGGTACVLTGQPFDTMKVKMQTFPDLYRGLTDCCLKTYSQVGFRGFYKGTSPALIANIAENSVLFMCYGFCQQVVRKVAGLDKQAKLSDLQNAAAGSFASAFAALVLCPTELVKCRLQTMYEMETSGKIAKSQNTVWSVIKSILRKDGPLGFYHGLSSTLLREVPGYFFFFGGYELSRSFFASGRSKDELGPVPLMLSGGVGGICLWLAVYPVDCIKSRIQVLSMSGKQAGFIRTFINVVKNEGITALYSGLKPTMIRAFPANGALFLAYEYSRKLMMNQLEAY,mutated_sequence,1.0,301.0,NP_055067.1.a2m,NP_055067.1.npy,ClinVar
+NP_055085.1,NP_055085.1.csv,MGDTGLRKRREDEKSIQSQEPKTTSLQKELGLISGISIIVGTIIGSGIFVSPKSVLSNTEAVGPCLIIWAACGVLATLGALCFAELGTMITKSGGEYPYLMEAYGPIPAYLFSWASLIVIKPTSFAIICLSFSEYVCAPFYVGCKPPQIVVKCLAAAAILFISTVNSLSVRLGSYVQNIFTAAKLVIVAIIIISGLVLLAQGNTKNFDNSFEGAQLSVGAISLAFYNGLWAYDGWNQLNYITEELRNPYRNLPLAIIIGIPLVTACYILMNVSYFTVMTATELLQSQAVAVTFGDRVLYPASWIVPLFVAFSTIGAANGTCFTAGRLIYVAGREGHMLKVLSYISVRRLTPAPAIIFYGIIATIYIIPGDINSLVNYFSFAAWLFYGLTILGLIVMRFTRKELERPIKVPVVIPVLMTLISVFLVLAPIISKPTWEYLYCVLFILSGLLFYFLFVHYKFGWAQKISKPITMHLQMLMEVVPPEEDPE,mutated_sequence,1.0,487.0,NP_055085.1.a2m,NP_055085.1.npy,ClinVar
+NP_055086.1,NP_055086.1.csv,MKAPIPHLILLYATFTQSLKVVTKRGSADGCTDWSIDIKKYQVLVGEPVRIKCALFYGYIRTNYSLAQSAGLSLMWYKSSGPGDFEEPIAFDGSRMSKEEDSIWFRPTLLQDSGLYACVIRNSTYCMKVSISLTVGENDTGLCYNSKMKYFEKAELSKSKEISCRDIEDFLLPTREPEILWYKECRTKTWRPSIVFKRDTLLIREVREDDIGNYTCELKYGGFVVRRTTELTVTAPLTDKPPKLLYPMESKLTIQETQLGDSANLTCRAFFGYSGDVSPLIYWMKGEKFIEDLDENRVWESDIRILKEHLGEQEVSISLIVDSVEEGDLGNYSCYVENGNGRRHASVLLHKRELMYTVELAGGLGAILLLLVCLVTIYKCYKIEIMLFYRNHFGAEELDGDNKDYDAYLSYTKVDPDQWNQETGEEERFALEILPDMLEKHYGYKLFIPDRDLIPTGTYIEDVARCVDQSKRLIIVMTPNYVVRRGWSIFELETRLRNMLVTGEIKVILIECSELRGIMNYQEVEALKHTIKLLTVIKWHGPKCNKLNSKFWKRLQYEMPFKRIEPITHEQALDVSEQGPFGELQTVSAISMAAATSTALATAHPDLRSTFHNTYHSQMRQKHYYRSYEYDVPPTGTLPLTSIGNQHTYCNIPMTLINGQRPQTKSSREQNPDEAHTNSAILPLLPRETSISSVIW,mutated_sequence,1.0,696.0,NP_055086.1.a2m,NP_055086.1.npy,ClinVar
+NP_055105.2,NP_055105.2.csv,MLEGDLVSKMLRAVLQSHKNGVALPRLQGEYRSLTGDWIPFKQLGFPTLEAYLRSVPAVVRIETSRSGEITCYAMACTETARIAQLVARQRSSKRKTGRQVNCQMRVKKTMPFFLEGKPKATLRQPGFASNFSVGKKPNPAPLRDKGNSVGVKPDAEMSPYMLHTTLGNEAFKDIPVQRHVTMSTNNRFSPKASLQPPLQMHLSRTSTKEMSDNLNQTVEKPNVKPPASYTYKMDEVQNRIKEILNKHNNGIWISKLPHFYKELYKEDLNQGILQQFEHWPHICTVEKPCSGGQDLLLYPAKRKQLLRSELDTEKVPLSPLPGPKQTPPLKGCPTVMAGDFKEKVADLLVKYTSGLWASALPKAFEEMYKVKFPEDALKNLASLSDVCSIDYISGNPQKAILYAKLPLPTDKIQKDAGQAHGDNDIKAMVEQEYLQVEESIAESANTFMEDITVPPLMIPTEASPSVLVVELSNTNEVVIRYVGKDYSAAQELMEDEMKEYYSKNPKITPVQAVNVGQLLAVNAEEDAWLRAQVISTEENKIKVCYVDYGFSENVEKSKAYKLNPKFCSLSFQATKCKLAGLEVLSDDPDLVKVVESLTCGKIFAVEILDKADIPLVVLYDTSGEDDININATCLKAICDKSLEVHLQVDAMYTNVKVTNICSDGTLYCQVPCKGLNKLSDLLRKIEDYFHCKHMTSECFVSLPFCGKICLFHCKGKWLRVEITNVHSSRALDVQFLDSGTVTSVKVSELREIPPRFLQEMIAIPPQAIKCCLADLPQSIGMWTPDAVLWLRDSVLNCSDCSIKVTKVDETRGIAHVYLFTPKNFPDPHRSINRQITNADLWKHQKDVFLSAISSGADSPNSKNGNMPMSGNTGENFRKNLTDVIKKSMVDHTSAFSTEELPPPVHLSKPGEHMDVYVPVACHPGYFVIQPWQEIHKLEVLMEEMILYYSVSEERHIAVEKDQVYAAKVENKWHRVLLKGILTNGLVSVYELDYGKHELVNIRKVQPLVDMFRKLPFQAVTAQLAGVKCNQWSEEASMVFRNHVEKKPLVALVQTVIENANPWDRKVVVYLVDTSLPDTDTWIHDFMSEYLIELSKVN,mutated_sequence,1.0,1098.0,NP_055105.2.a2m,NP_055105.2.npy,ClinVar
+NP_055149.3,NP_055149.3.csv,MARPPRQHPGVWASLLLLLLTGPAACAASPADDGAGPGGRGPRGRARGDTGADEAVPRHDSSYGTFAGEFYDLRYLSEEGYPFPTAPPVDPFAKIKVDDCGKTKGCFRYGKPGCNAETCDYFLSYRMIGADVEFELSADTDGWVAVGFSSDKKMGGDDVMACVHDDNGRVRIQHFYNVGQWAKEIQRNPARDEEGVFENNRVTCRFKRPVNVPRDETIVDLHLSWYYLFAWGPAIQGSITRHDIDSPPASERVVSIYKYEDIFMPSAAYQTFSSPFCLLLIVALTFYLLMGTP,mutated_sequence,1.0,293.0,NP_055149.3.a2m,NP_055149.3.npy,ClinVar
+NP_055177.2,NP_055177.2.csv,MGQREMWRLMSRFNAFKRTNTILHHLRMSKHTDAAEEVLLEKKGCTGVITLNRPKFLNALTLNMIRQIYPQLKKWEQDPETFLIIIKGAGGKAFCAGGDIRVISEAEKAKQKIAPVFFREEYMLNNAVGSCQKPYVALIHGITMGGGVGLSVHGQFRVATEKCLFAMPETAIGLFPDVGGGYFLPRLQGKLGYFLALTGFRLKGRDVYRAGIATHFVDSEKLAMLEEDLLALKSPSKENIASVLENYHTESKIDRDKSFILEEHMDKINSCFSANTVEEIIENLQQDGSSFALEQLKVINKMSPTSLKITLRQLMEGSSKTLQEVLTMEYRLSQACMRGHDFHEGVRAVLIDKDQSPKWKPADLKEVTEEDLNNHFKSLGSSDLKF,mutated_sequence,1.0,386.0,NP_055177.2.a2m,NP_055177.2.npy,ClinVar
+NP_055178.3,NP_055178.3.csv,METKENRWVPVTVLPGCVGCRTVAALASWTVRDVKERIFAETGFPVSEQRLWRGGRELSDWIKIGDLTSKNCHLFVNLQSKGLKGGGRFGQTTPPLVDFLKDILRRYPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAVFTPEDWHGIQEIARSRKKDDPLKVGRFGIGFNSVYHITDVPCIFSGDQIGMLDPHQTLFGPHESGQCWNLKDDSKEISELSDQFAPFVGIFGSTKETFINGNFPGTFFRFPLRLQPSQLSSNLYNKQKVLELFESFRADADTVLLFLKSVQDVSLYVREADGTEKLVFRVTSSESKALKHERPNSIKILGTAISNYCKKTPSNNITCVTYHVNIVLEEESTKDAQKTSWLVCNSVGGRGISSKLDSLADELKFVPIIGIAMPLSSRDDEAKGATSDFSGKAFCFLPLPPGEESSTGLPVHISGFFGLTDNRRSIKWRELDQWRDPAALWNEFLVMNVVPKAYATLILDSIKRLEMEKSSDFPLSVDVIYKLWPEASKVKVHWQPVLEPLFSELLQNAVIYSISCDWVRLEQVYFSELDENLEYTKTVLNYLQSSGKQIAKVPGNVDAAVQLTAASGTTPVRKVTPAWVRQVLRKCAHLGCAEEKLHLLEFVLSDQAYSELLGLELLPLQNGNFVPFSSSVSDQDVIYITSAEYPRSLFPSLEGRFILDNLKPHLVAALKEAAQTRGRPCTQLQLLNPERFARLIKEVMNTFWPGRELIVQWYPFDENRNHPSVSWLKMVWKNLYIHFSEDLTLFDEMPLIPRTILEEGQTCVELIRLRIPSLVILDDESEAQLPEFLADIVQKLGGFVLKKLDASIQHPLIKKYIHSPLPSAVLQIMEKMPLQKLCNQITSLLPTHKDALRKFLASLTDSSEKEKRIIQELAIFKRINHSSDQGISSYTKLKGCKVLHHTAKLPADLRLSISVIDSSDEATIRLANMLKIEQLKTTSCLKLVLKDIENAFYSHEEVTQLMLWVLENLSSLKNENPNVLEWLTPLKFIQISQEQMVSAGELFDPDIEVLKDLFCNEEGTYFPPSVFTSPDILHSLRQIGLKNEASLKEKDVVQVAKKIEALQVGACPDQDVLLKKAKTLLLVLNKNHTLLQSSEGKMTLKKIKWVPACKERPPNYPGSLVWKGDLCNLCAPPDMCDVGHAILIGSSLPLVESIHVNLEKALGIFTKPSLSAVLKHFKIVVDWYSSKTFSDEDYYQFQHILLEIYGFMHDHLNEGKDSFRALKFPWVWTGKKFCPLAQAVIKPIHDLDLQPYLHNVPKTMAKFHQLFKVCGSIEELTSDHISMVIQKIYLKSDQDLSEQESKQNLHLMLNIIRWLYSNQIPASPNTPVPIHHSKNPSKLIMKPIHECCYCDIKVDDLNDLLEDSVEPIILVHEDIPMKTAEWLKVPCLSTRLINPENMGFEQSGQREPLTVRIKNILEEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMAACHGPALWSFNNSQFSDSDFVNITRLGESLKRGEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNINHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPFIDVFGCQLPLTVEAPYSYNGTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKIEETNPSLAQDTVIIKKKSCSSKALNTPVLSVLKEAAKLMKTCSSSNKKLPSDEPKSSCILQITVEEFHHVFRRIADLQSPLFRGPDDDPAALFEMAKSGQSKKPSDELSQKTVECTTWLLCTCMDTGEALKFSLSESGRRLGLVPCGAVGVQLSEIQDQKWTVKPHIGEVFCYLPLRIKTGLPVHINGCFAVTSNRKEIWKTDTKGRWNTTFMRHVIVKAYLQVLSVLRDLATSGELMDYTYYAVWPDPDLVHDDFSVICQGFYEDIAHGKGKELTKVFSDGSTWVSMKNVRFLDDSILKRRDVGSAAFKIFLKYLKKTGSKNLCAVELPSSVKLGFEEAGCKQILLENTFSEKQFFSEVFFPNIQEIEAELRDPLMIFVLNEKVDEFSGVLRVTPCIPCSLEGHPLVLPSRLIHPEGRVAKLFDIKDGRFPYGSTQDYLNPIILIKLVQLGMAKDDILWDDMLERAVSVAEINKSDHVAACLRSSILLSLIDEKLKIRDPRAKDFAAKYQTIRFLPFLTKPAGFSLDWKGNSFKPETMFAATDLYTAEHQDIVCLLQPILNENSHSFRGCGSVSLAVKEFLGLLKKPTVDLVINQLKEVAKSVDDGITLYQENITNACYKYLHEALMQNEITKMSIIDKLKPFSFILVENAYVDSEKVSFHLNFEAAPYLYQLPNKYKNNFRELFETVGVRQSCTVEDFALVLESIDQERGTKQITEENFQLCRRIISEGIWSLIREKKQEFCEKNYGKILLPDTNLMLLPAKSLCYNDCPWIKVKDTTVKYCHADIPREVAVKLGAVPKRHKALERYASNVCFTTLGTEFGQKEKLTSRIKSILNAYPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNNQPFTEDDVRGIQNLGKGTKEGNPYKTGQYGIGFNSVYHITDCPSFISGNDILCIFDPHARYAPGATSISPGRMFRDLDADFRTQFSDVLDLYLGTHFKLDNCTMFRFPLRNAEMAKVSEISSVPASDRMVQNLLDKLRSDGAELLMFLNHMEKISICEIDKSTGALNVLYSVKGKITDGDRLKRKQFHASVIDSVTKKRQLKDIPVQQITYTMDTEDSEGNLTTWLICNRSGFSSMEKVSKSVISAHKNQDITLFPRGGVAACITHNYKKPHRAFCFLPLSLETGLPFHVNGHFALDSARRNLWRDDNGVGVRSDWNNSLMTALIAPAYVELLIQLKKRYFPGSDPTLSVLQNTPIHVVKDTLKKFLSFFPVNRLDLQPDLYCLVKALYNCIHEDMKRLLPVVRAPNIDGSDLHSAVIITWINMSTSNKTRPFFDNLLQDELQHLKNADYNITTRKTVAENVYRLKHLLLEIGFNLVYNCDETANLYHCLIDADIPVSYVTPADIRSFLMTFSSPDTNCHIGKLPCRLQQTNLKLFHSLKLLVDYCFKDAEENEIEVEGLPLLITLDSVLQTFDAKRPKFLTTYHELIPSRKDLFMNTLYLKYSNILLNCKVAKVFDISSFADLLSSVLPREYKTKSCTKWKDNFASESWLKNAWHFISESVSVKEDQEETKPTFDIVVDTLKDWALLPGTKFTVSANQLVVPEGDVLLPLSLMHIAVFPNAQSDKVFHALMKAGCIQLALNKICSKDSAFVPLLSCHTANIESPTSILKALHYMVQTSTFRAEKLVENDFEALLMYFNCNLNHLMSQDDIKILKSLPCYKSISGRYVSIGKFGTCYVLTKSIPSAEVEKWTQSSSSAFLEEKIHLKELYEVIGCVPVDDLEVYLKHLLPKIENLSYDAKLEHLIYLKNRLSSAEELSEIKEQLFEKLESLLIIHDANSRLKQAKHFYDRTVRVFEVMLPEKLFIPNDFFKKLEQLIKPKNHVTFMTSWVEFLRNIGLKYILSQQQLLQFAKEISVRANTENWSKETLQNTVDILLHHIFQERMDLLSGNFLKELSLIPFLCPERAPAEFIRFHPQYQEVNGTLPLIKFNGAQVNPKFKQCDVLQLLWTSCPILPEKATPLSIKEQEGSDLGPQEQLEQVLNMLNVNLDPPLDKVINNCRNICNITTLDEEMVKTRAKVLRSIYEFLSAEKREFRFQLRGVAFVMVEDGWKLLKPEEVVINLEYESDFKPYLYKLPLELGTFHQLFKHLGTEDIISTKQYVEVLSRIFKNSEGKQLDPNEMRTVKRVVSGLFRSLQNDSVKVRSDLENVRDLALYLPSQDGRLVKSSILVFDDAPHYKSRIQGNIGVQMLVDLSQCYLGKDHGFHTKLIMLFPQKLRPRLLSSILEEQLDEETPKVCQFGALCSLQGRLQLLLSSEQFITGLIRIMKHENDNAFLANEEKAIRLCKALREGLKVSCFEKLQTTLRVKGFNPIPHSRSETFAFLKRFGNAVILLYIQHSDSKDINFLLALAMTLKSATDNLISDTSYLIAMLGCNDIYRIGEKLDSLGVKYDSSEPSKLELPMPGTPIPAEIHYTLLMDPMNVFYPGEYVGYLVDAEGGDIYGSYQPTYTYAIIVQEVEREDADNSSFLGKIYQIDIGYSEYKIVSSLDLYKFSRPEESSQSRDSAPSTPTSPTEFLTPGLRSIPPLFSGRESHKTSSKHQSPKKLKVNSLPEILKEVTSVVEQAWKLPESERKKIIRRLYLKWHPDKNPENHDIANEVFKHLQNEINRLEKQAFLDQNADRASRRTFSTSASRFQSDKYSFQRFYTSWNQEATSHKSERQQQNKEKCPPSAGQTYSQRFFVPPTFKSVGNPVEARRWLRQARANFSAARNDLHKNANEWVCFKCYLSTKLALIAADYAVRGKSDKDVKPTALAQKIEEYSQQLEGLTNDVHTLEAYGVDSLKTRYPDLLPFPQIPNDRFTSEVAMRVMECTACIIIKLENFMQQKV,mutated_sequence,1.0,4579.0,NP_055178.3.a2m,NP_055178.3.npy,ClinVar
+NP_055331.1,NP_055331.1.csv,MADKRKLQGEIDRCLKKVSEGVEQFEDIWQKLHNAANANQKEKYEADLKKEIKKLQRLRDQIKTWVASNEIKDKRQLIDNRKLIETQMERFKVVERETKTKAYSKEGLGLAQKVDPAQKEKEEVGQWLTNTIDTLNMQVDQFESEVESLSVQTRKKKGDKDKQDRIEGLKRHIEKHRYHVRMLETILRMLDNDSILVDAIRKIKDDVEYYVDSSQDPDFEENEFLYDDLDLEDIPQALVATSPPSHSHMEDEIFNQSSSTPTSTTSSSPIPPSPANCTTENSEDDKKRGRSTDSEVSQSPAKNGSKPVHSNQHPQSPAVPPTYPSGPPPAASALSTTPGNNGVPAPAAPPSALGPKASPAPSHNSGTPAPYAQAVAPPAPSGPSTTQPRPPSVQPSGGGGGGSGGGGSSSSSNSSAGGGAGKQNGATSYSSVVADSPAEVALSSSGGNNASSQALGPPSGPHNPPPSTSKEPSAAAPTGAGGVAPGSGNNSGGPSLLVPLPVNPPSSPTPSFSDAKAAGALLNGPPQFSTAPEIKAPEPLSSLKSMAERAAISSGIEDPVPTLHLTERDIILSSTSAPPASAQPPLQLSEVNIPLSLGVCPLGPVPLTKEQLYQQAMEEAAWHHMPHPSDSERIRQYLPRNPCPTPPYHHQMPPPHSDTVEFYQRLSTETLFFIFYYLEGTKAQYLAAKALKKQSWRFHTKYMMWFQRHEEPKTITDEFEQGTYIYFDYEKWGQRKKEGFTFEYRYLEDRDLQ,mutated_sequence,1.0,753.0,NP_055331.1.a2m,NP_055331.1.npy,ClinVar
+NP_055400.1,NP_055400.1.csv,MTRAGDHNRQRGCCGSLADYLTSAKFLLYLGHSLSTWGDRMWHFAVSVFLVELYGNSLLLTAVYGLVVAGSVLVLGAIIGDWVDKNARLKVAQTSLVVQNVSVILCGIILMMVFLHKHELLTMYHGWVLTSCYILIITIANIANLASTATAITIQRDWIVVVAGEDRSKLANMNATIRRIDQLTNILAPMAVGQIMTFGSPVIGCGFISGWNLVSMCVEYVLLWKVYQKTPALAVKAGLKEEETELKQLNLHKDTEPKPLEGTHLMGVKDSNIHELEHEQEPTCASQMAEPFRTFRDGWVSYYNQPVFLAGMGLAFLYMTVLGFDCITTGYAYTQGLSGSILSILMGASAITGIMGTVAFTWLRRKCGLVRTGLISGLAQLSCLILCVISVFMPGSPLDLSVSPFEDIRSRFIQGESITPTKIPEITTEIYMSNGSNSANIVPETSPESVPIISVSLLFAGVIAARIGLWSFDLTVTQLLQENVIESERGIINGVQNSMNYLLDLLHFIMVILAPNPEAFGLLVLISVSFVAMGHIMYFRFAQNTLGNKLFACGPDAKEVRKENQANTSVV,mutated_sequence,1.0,571.0,NP_055400.1.a2m,NP_055400.1.npy,ClinVar
+NP_055440.1,NP_055440.1.csv,MERRARSSSRESRGRGGRTPHKENKRAKAERSGGGRGRQEAGPEPSGSGRAGTPGEPRAPAATVVDVDEVRGSGEEGTEVVALLESERPEEGTKSSGLGACEWLLVLISLLFIIMTFPFSIWFCVKVVQEYERVIIFRLGHLLPGRAKGPGLFFFLPCLDTYHKVDLRLQTLEIPFHEIVTKDMFIMEIDAICYYRMENASLLLSSLAHVSKAVQFLVQTTMKRLLAHRSLTEILLERKSIAQDAKVALDSVTCIWGIKVERIEIKDVRLPAGLQHSLAVEAEAQRQAKVRMIAAEAEKAASESLRMAAEILSGTPAAVQLRYLHTLQSLSTEKPSTVVLPLPFDLLNCLSSPSNRTQGSLPFPSPSKPVEPLNPKKKDSPML,mutated_sequence,1.0,383.0,NP_055440.1.a2m,NP_055440.1.npy,ClinVar
+NP_055542.1,NP_055542.1.csv,MAAAAGGGSCPGPGSARGRFPGRPRGAGGGGGRGGRGNGAERVRVALRRGGGATGPGGAEPGEDTALLRLLGLRRGLRRLRRLWAGPRVQRGRGRGRGRGWGPSRGCVPEEESSDGESDEEEFQGFHSDEDVAPSSLRSALRSQRGRAPRGRGRKHKTTPLPPPRLADVAPTPPKTPARKRGEEGTERMVQALTELLRRAQAPQAPRSRACEPSTPRRSRGRPPGRPAGPCRRKQQAVVVAEAAVTIPKPEPPPPVVPVKHQTGSWKCKEGPGPGPGTPRRGGQSSRGGRGGRGRGRGGGLPFVIKFVSRAKKVKMGQLSLGLESGQGQGQHEESWQDVPQRRVGSGQGGSPCWKKQEQKLDDEEEEKKEEEEKDKEGEEKEERAVAEEMMPAAEKEEAKLPPPPLTPPAPSPPPPLPPPSTSPPPPLCPPPPPPVSPPPLPSPPPPPAQEEQEESPPPVVPATCSRKRGRPPLTPSQRAEREAARAGPEGTSPPTPTPSTATGGPPEDSPTVAPKSTTFLKNIRQFIMPVVSARSSRVIKTPRRFMDEDPPKPPKVEVSPVLRPPITTSPPVPQEPAPVPSPPRAPTPPSTPVPLPEKRRSILREPTFRWTSLTRELPPPPPAPPPPPAPSPPPAPATSSRRPLLLRAPQFTPSEAHLKIYESVLTPPPLGAPEAPEPEPPPADDSPAEPEPRAVGRTNHLSLPRFAPVVTTPVKAEVSPHGAPALSNGPQTQAQLLQPLQALQTQLLPQALPPPQPQLQPPPSPQQMPPLEKARIAGVGSLPLSGVEEKMFSLLKRAKVQLFKIDQQQQQKVAASMPLSPGGQMEEVAGAVKQISDRGPVRSEDESVEAKRERPSGPESPVQGPRIKHVCRHAAVALGQARAMVPEDVPRLSALPLRDRQDLATEDTSSASETESVPSRSRRGKVEAAGPGGESEPTGSGGTLAHTPRRSLPSHHGKKMRMARCGHCRGCLRVQDCGSCVNCLDKPKFGGPNTKKQCCVYRKCDKIEARKMERLAKKGRTIVKTLLPWDSDESPEASPGPPGPRRGAGAGGPREEVVAHPGPEEQDSLLQRKSARRCVKQRPSYDIFEDSDDSEPGGPPAPRRRTPRENELPLPEPEEQSRPRKPTLQPVLQLKARRRLDKDALAPGPFASFPNGWTGKQKSPDGVHRVRVDFKEDCDLENVWLMGGLSVLTSVPGGPPMVCLLCASKGLHELVFCQVCCDPFHPFCLEEAERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRHAYHPACLGPSYPTRATRKRRHWICSACVRCKSCGATPGKNWDVEWSGDYSLCPRCTQLYEKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGPCAGAAQPRWREALSGALQGGLRQVLQGLLSSKVVGPLLLCTQCGPDGKQLHPGPCGLQAVSQRFEDGHYKSVHSFMEDMVGILMRHSEEGETPDRRAGGQMKGLLLKLLESAFGWFDAHDPKYWRRSTRLPNGVLPNAVLPPSLDHVYAQWRQQEPETPESGQPPGDPSAAFQGKDPAAFSHLEDPRQCALCLKYGDADSKEAGRLLYIGQNEWTHVNCAIWSAEVFEENDGSLKNVHAAVARGRQMRCELCLKPGATVGCCLSSCLSNFHFMCARASYCIFQDDKKVFCQKHTDLLDGKEIVNPDGFDVLRRVYVDFEGINFKRKFLTGLEPDAINVLIGSIRIDSLGTLSDLSDCEGRLFPIGYQCSRLYWSTVDARRRCWYRCRILEYRPWGPREEPAHLEAAEENQTIVHSPAPSSEPPGGEDPPLDTDVLVPGAPERHSPIQNLDPPLRPDSGSAPPPAPRSFSGARIKVPNYSPSRRPLGGVSFGPLPSPGSPSSLTHHIPTVGDPDFPAPPRRSRRPSPLAPRPPPSRWASPPLKTSPQLRVPPPTSVVTALTPTSGELAPPGPAPSPPPPEDLGPDFEDMEVVSGLSAADLDFAASLLGTEPFQEEIVAAGAMGSSHGGPGDSSEEESSPTSRYIHFPVTVVSAPGLAPSATPGAPRIEQLDGVDDGTDSEAEAVQQPRGQGTPPSGPGVVRAGVLGAAGDRARPPEDLPSEIVDFVLKNLGGPGDGGAGPREESLPPAPPLANGSQPSQGLTASPADPTRTFAWLPGAPGVRVLSLGPAPEPPKPATSKIILVNKLGQVFVKMAGEGEPVPPPVKQPPLPPTISPTAPTSWTLPPGPLLGVLPVVGVVRPAPPPPPPPLTLVLSSGPASPPRQAIRVKRVSTFSGRSPPAPPPYKAPRLDEDGEASEDTPQVPGLGSGGFSRVRMKTPTVRGVLDLDRPGEPAGEESPGPLQERSPLLPLPEDGPPQVPDGPPDLLLESQWHHYSGEASSSEEEPPSPDDKENQAPKRTGPHLRFEISSEDGFSVEAESLEGAWRTLIEKVQEARGHARLRHLSFSGMSGARLLGIHHDAVIFLAEQLPGAQRCQHYKFRYHQQGEGQEEPPLNPHGAARAEVYLRKCTFDMFNFLASQHRVLPEGATCDEEEDEVQLRSTRRATSLELPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN,mutated_sequence,1.0,2715.0,NP_055542.1.a2m,NP_055542.1.npy,ClinVar
+NP_055595.2,NP_055595.2.csv,MVGELRYREFRVPLGPGLHAYPDELIRQRVGHDGHPEYQIRWLILRRGDEGDGGSGQVDCKAEHILLWMSKDEIYANCHKMLGEDGQVIGPSQESAGEVGALDKSVLEEMETDVKSLIQRALRQLEECVGTIPPAPLLHTVHVLSAYASIEPLTGVFKDPRVLDLLMHMLSSPDYQIRWSAGRMIQALSSHDAGTRTQILLSLSQQEAIEKHLDFDSRCALLALFAQATLSEHPMSFEGIQLPQVPGRVLFSLVKRYLHVTSLLDQLNDSAAEPGAQNTSAPEELSGERGQLELEFSMAMGTLISELVQAMRWDQASDRPRSSARSPGSIFQPQLADVSPGLPAAQAQPSFRRSRRFRPRSEFASGNTYALYVRDTLQPGMRVRMLDDYEEISAGDEGEFRQSNNGVPPVQVFWESTGRTYWVHWHMLEILGFEEDIEDMVEADEYQGAVASRVLGRALPAWRWRPMTELYAVPYVLPEDEDTEECEHLTLAEWWELLFFIKKLDGPDHQEVLQILQENLDGEILDDEILAELAVPIELAQDLLLTLPQRLNDSALRDLINCHVYKKYGPEALAGNQAYPSLLEAQEDVLLLDAQAQAKDSEDAAKVEAKEPPSQSPNTPLQRLVEGYGPAGKILLDLEQALSSEGTQENKVKPLLLQLQRQPQPFLALMQSLDTPETNRTLHLTVLRILKQLVDFPEALLLPWHEAVDACMACLRSPNTDREVLQELIFFLHRLTSVSRDYAVVLNQLGARDAISKALEKHLGKLELAQELRDMVFKCEKHAHLYRKLITNILGGCIQMVLGQIEDHRRTHQPINIPFFDVFLRYLCQGSSVEVKEDKCWEKVEVSSNPHRASKLTDHNPKTYWESNGSAGSHYITLHMRRGILIRQLTLLVASEDSSYMPARVVVCGGDSTSSLHTELNSVNVMPSASRVILLENLTRFWPIIQIRIKRCQQGGIDTRIRGLEILGPKPTFWPVFREQLCRHTRLFYMVRAQAWSQDMAEDRRSLLHLSSRLNGALRQEQNFADRFLPDDEAAQALGKTCWEALVSPVVQNITSPDEDGISPLGWLLDQYLECQEAVFNPQSRGPAFFSRVRRLTHLLVHVEPCEAPPPVVATPRPKGRNRSHDWSSLATRGLPSSIMRNLTRCWRAVVEKQVNNFLTSSWRDDDFVPRYCEHFNILQNSSSELFGPRAAFLLALQNGCAGALLKLPFLKAAHVSEQFARHIDQQIQGSRIGGAQEMERLAQLQQCLQAVLIFSGLEIATTFEHYYQHYMADRLLGVVSSWLEGAVLEQIGPCFPNRLPQQMLQSLSTSKELQRQFHVYQLQQLDQELLKLEDTEKKIQVGLGASGKEHKSEKEEEAGAAAVVDVAEGEEEEEENEDLYYEGAMPEVSVLVLSRHSWPVASICHTLNPRTCLPSYLRGTLNRYSNFYNKSQSHPALERGSQRRLQWTWLGWAELQFGNQTLHVSTVQMWLLLYLNDLKAVSVESLLAFSGLSADMLNQAIGPLTSSRGPLDLHEQKDIPGGVLKIRDGSKEPRSRWDIVRLIPPQTYLQAEGEDGQNLEKRRNLLNCLIVRILKAHGDEGLHIDQLVCLVLEAWQKGPCPPRGLVSSLGKGSACSSTDVLSCILHLLGKGTLRRHDDRPQVLSYAVPVTVMEPHTESLNPGSSGPNPPLTFHTLQIRSRGVPYASCTATQSFSTFR,mutated_sequence,1.0,1698.0,NP_055595.2.a2m,NP_055595.2.npy,ClinVar
+NP_055637.2,NP_055637.2.csv,MSQQGYVATPPYSQPQPGIGLSPPHYGHYGDPSHTASPTGMMKPAGPLGATATRGMLPPGPPPPGPHQFGQNGAHATGHPPQRFPGPPPVNNVASSHAPYQPSAQSSYPGPISTSSVTQLGSQLSAMQINSYGSGMAPPSQGPPGPLSATSLQTPPRPPQPSILQPGSQVLPPPPTTLNGPGASPLPLPMYRPDGLSGPPPPNAQYQPPPLPGQTLGAGYPPQQANSGPQMAGAQLSYPGGFPGGPAQMAGPPQPQKKLDPDSIPSPIQVIENDRASRGGQVYATNTRGQIPPLVTTDCMIQDQGNASPRFIRCTTYCFPCTSDMAKQAQIPLAAVIKPFATIPSNESPLYLVNHGESGPVRCNRCKAYMCPFMQFIEGGRRYQCGFCNCVNDVPPFYFQHLDHIGRRLDHYEKPELSLGSYEYVATLDYCRKSKPPNPPAFIFMIDVSYSNIKNGLVKLICEELKTMLEKIPKEEQEETSAIRVGFITYNKVLHFFNVKSNLAQPQMMVVTDVGEVFVPLLDGFLVNYQESQSVIHNLLDQIPDMFADSNENETVFAPVIQAGMEALKAADCPGKLFIFHSSLPTAEAPGKLKNRDDKKLVNTDKEKILFQPQTNVYDSLAKDCVAHGCSVTLFLFPSQYVDVASLGLVPQLTGGTLYKYNNFQMHLDRQQFLNDLRNDIEKKIGFDAIMRVRTSTGFRATDFFGGILMNNTTDVEMAAIDCDKAVTVEFKHDDKLSEDSGALIQCAVLYTTISGQRRLRIHNLGLNCSSQLADLYKSCETDALINFFAKSAFKAVLHQPLKVIREILVNQTAHMLACYRKNCASPSAASQLILPDSMKVLPVYMNCLLKNCVLLSRPEISTDERAYQRQLVMTMGVADSQLFFYPQLLPIHTLDVKSTMLPAAVRCSESRLSEEGIFLLANGLHMFLWLGVSSPPELIQGIFNVPSFAHINTDMTLLPEVGNPYSQQLRMIMGIIQQKRPYSMKLTIVKQREQPEMVFRQFLVEDKGLYGGSSYVDFLCCVHKEICQLLN,mutated_sequence,1.0,1032.0,NP_055637.2.a2m,NP_055637.2.npy,ClinVar
+NP_055690.1,NP_055690.1.csv,MSLHSTHNRNNSGDILDIPSSQNSSSLNALTHSSRLKLHLKSDMSECENDDPLLRSAGKVRDINRTYVISASRKTADMPLTPNPVGRLALQRRTTRNKESSLLVSELEDTTEKTAETRLTLQRRAKTDSAEKWKTAEIDSVKMTLNVGGETENNGVSKESRTNVRIVNNAKNSFVASSVPLDEDPQVIEMMADKKYKETFSAPSRANENVALKYSSNRPPIASLSQTEVVRSGHLTTKPTQSKLDIKVLGTGNLYHRSIGKEIAKTSNKFGSLEKRTPTKCTTEHKLTTKCSLPQLKSPAPSILKNRMSNLQVKQRPKSSFLANKQERSAENTILPEEETVVQNTSAGKDPLKVENSQVTVAVRVRPFTKREKIEKASQVVFMSGKEITVEHPDTKQVYNFIYDVSFWSFDECHPHYASQTTVYEKLAAPLLERAFEGFNTCLFAYGQTGSGKSYTMMGFSEEPGIIPRFCEDLFSQVARKQTQEVSYHIEMSFFEVYNEKIHDLLVCKDENGQRKQPLRVREHPVYGPYVEALSMNIVSSYADIQSWLELGNKQRATAATGMNDKSSRSHSVFTLVMTQTKTEFVEGEEHDHRITSRINLIDLAGSERCSTAHTNGDRLKEGVSINKSLLTLGKVISALSEQANQRSVFIPYRESVLTWLLKESLGGNSKTAMIATISPAASNIEETLSTLRYANQARLIVNIAKVNEDMNAKLIRELKAEIAKLKAAQRNSRNIDPERYRLCRQEITSLRMKLHQQERDMAEMQRVWKEKFEQAEKRKLQETKELQKAGIMFQMDNHLPNLVNLNEDPQLSEMLLYMIKEGTTTVGKYKPNSSHDIQLSGVLIADDHCTIKNFGGTVSIIPVGEAKTYVNGKHILEITVLRHGDRVILGGDHYFRFNHPVEVQKGKRPSGRDTPISEGPKDFEFAKNELLMAQRSQLEAEIKEAQLKAKEEMMQGIQIAKEMAQQELSSQKAAYESKIKALEAELREESQRKKMQEINNQKANHKIEELEKAKQHLEQEIYVNKKRLEMETLATKQALEDHSIRHARILEALETEKQKIAKEVQILQQNRNNRDKTFTVQTTWSSMKLSMMIQEANAISSKLKTYYVFGRHDISDKSSSDTSIRVRNLKLGISTFWSLEKFESKLAAMKELYESNGSNRGEDAFCDPEDEWEPDITDAPVSSLSRRRSRSLMKNRRISGCLHDIQVHPIKNLHSSHSSGLMDKSSTIYSNSAESFLPGICKELIGSSLDFFGQSYDEERTIADSLINSFLKIYNGLFAISKAHEEQDEESQDNLFSSDRAIQSLTIQTACAFEQLVVLMKHWLSDLLPCTNIARLEDELRQEVKKLGGYLQLFLQGCCLDISSMIKEAQKNAIQIVQQAVKYVGQLAVLKGSKLHFLENGNNKAASVQEEFMDAVCDGVGLGMKILLDSGLEKAKELQHELFRQCTKNEVTKEMKTNAMGLIRSLENIFAESKIKSFRRQVQEENFEYQDFKRMVNRAPEFLKLKHCLEKAIEIIISALKGCHSDINLLQTCVESIRNLASDFYSDFSVPSTSVGSYESRVTHIVHQELESLAKSLLFCFESEESPDLLKPWETYNQNTKEEHQQSKSSGIDGSKNKGVPKRVYELHGSSPAVSSEECTPSRIQWV,mutated_sequence,1.0,1648.0,NP_055690.1.a2m,NP_055690.1.npy,ClinVar
+NP_055761.2,NP_055761.2.csv,MNSPGGRGKKKGSGGASNPVPPRPPPPCLAPAPPAAGPAPPPESPHKRNLYYFSYPLFVGFALLRLVAFHLGLLFVWLCQRFSRALMAAKRSSGAAPAPASASAPAPVPGGEAERVRVFHKQAFEYISIALRIDEDEKAGQKEQAVEWYKKGIEELEKGIAVIVTGQGEQCERARRLQAKMMTNLVMAKDRLQLLEKMQPVLPFSKSQTDVYNDSTNLACRNGHLQSESGAVPKRKDPLTHTSNSLPRSKTVMKTGSAGLSGHHRAPSYSGLSMVSGVKQGSGPAPTTHKGTPKTNRTNKPSTPTTATRKKKDLKNFRNVDSNLANLIMNEIVDNGTAVKFDDIAGQDLAKQALQEIVILPSLRPELFTGLRAPARGLLLFGPPGNGKTMLAKAVAAESNATFFNISAASLTSKYVGEGEKLVRALFAVARELQPSIIFIDEVDSLLCERREGEHDASRRLKTEFLIEFDGVQSAGDDRVLVMGATNRPQELDEAVLRRFIKRVYVSLPNEETRLLLLKNLLCKQGSPLTQKELAQLARMTDGYSGSDLTALAKDAALGPIRELKPEQVKNMSASEMRNIRLSDFTESLKKIKRSVSPQTLEAYIRWNKDFGDTTV,mutated_sequence,1.0,616.0,NP_055761.2.a2m,NP_055761.2.npy,ClinVar
+NP_055790.1,NP_055790.1.csv,MSDSLWTALSNFSMPSFPGGSMFRRTKSCRTSNRKSLILTSTSPTLPRPHSPLPGHLGSSPLDSPRNFSPNTPAHFSFASSRRADGRRWSLASLPSSGYGTNTPSSTVSSSCSSQERLHQLPYQPTVDELHFLSKHFGSTESITDEDGGRRSPAVRPRSRSLSPGRSPSSYDNEIVMMNHVYKERFPKATAQMEEKLRDFTRAYEPDSVLPLADGVLSFIHHQIIELARDCLTKSRDGLITTVYFYELQENLEKLLQDAYERSESLEVAFVTQLVKKLLIIISRPARLLECLEFNPEEFYHLLEAAEGHAKEGHLVKTDIPRYIIRQLGLTRDPFPDVVHLEEQDSGGSNTPEQDDLSEGRSSKAKKPPGENDFDTIKLISNGAYGAVYLVRHRDTRQRFAMKKINKQNLILRNQIQQAFVERDILTFAENPFVVGMFCSFETRRHLCMVMEYVEGGDCATLLKNIGALPVEMARMYFAETVLALEYLHNYGIVHRDLKPDNLLITSMGHIKLTDFGLSKMGLMSLTTNLYEGHIEKDAREFLDKQVCGTPEYIAPEVILRQGYGKPVDWWAMGIILYEFLVGCVPFFGDTPEELFGQVISDDILWPEGDEALPTEAQLLISSLLQTNPLVRLGAGGAFEVKQHSFFRDLDWTGLLRQKAEFIPHLESEDDTSYFDTRSDRYHHVNSYDEDDTTEEEPVEIRQFSSCSPRFSKVYSSMEQLSQHEPKTPVAAAGSSKREPSTKGPEEKVAGKREGLGGLTLREKTWRGGSPEIKRFSASEASFLEGEASPPLGARRRFSALLEPSRFSAPQEDEDEARLRRPPRPSSDPAGSLDARAPKEETQGEGTSSAGDSEATDRPRPGDLCPPSKDGDASGPRATNDLVLRRARHQQMSGDVAVEKRPSRTGGKVIKSASATALSVMIPAVDPHGSSPLASPMSPRSLSSNPSSRDSSPSRDYSPAVSGLRSPITIQRSGKKYGFTLRAIRVYMGDTDVYSVHHIVWHVEEGGPAQEAGLCAGDLITHVNGEPVHGMVHPEVVELILKSGNKVAVTTTPFENTSIRIGPARRSSYKAKMARRNKRPSAKEGQESKKRSSLFRKITKQSNLLHTSRSLSSLNRSLSSSDSLPGSPTHGLPARSPTHSYRSTPDSAYLGASSQSSSPASSTPNSPASSASHHIRPSTLHGLSPKLHRQYRSARCKSAGNIPLSPLAHTPSPTQASPPPLPGHTVGSSHTTQSFPAKLHSSPPVVRPRPKSAEPPRSPLLKRVQSAEKLGASLSADKKGALRKHSLEVGHPDFRKDFHGELALHSLAESDGETPPVEGLGAPRQVAVRRLGRQESPLSLGADPLLPEGASRPPVSSKEKESPGGAEACTPPRATTPGGRTLERDVGCTRHQSVQTEDGTGGMARAVAKAALSPVQEHETGRRSSSGEAGTPLVPIVVEPARPGAKAVVPQPLGADSKGLQEPAPLAPSVPEAPRGRERWVLEVVEERTTLSGPRSKPASPKLSPEPQTPSLAPAKCSAPSSAVTPVPPASLLGSGTKPQVGLTSRCPAEAVPPAGLTKKGVSSPAPPGP,mutated_sequence,1.0,1570.0,NP_055790.1.a2m,NP_055790.1.npy,ClinVar
+NP_055861.3,NP_055861.3.csv,MSTCCWCTPGGASTIDFLKRYASNTPSGEFQTADEDLCYCLECVAEYHKARDELPFLHEVLWELETLRLINHFEKSMKAEIGDDDELYIVDNNGEMPLFDITGQDFENKLRVPLLEILKYPYLLLHERVNELCVEALCRMEQANCSFQVFDKHPGIYLFLVHPNEMVRRWAILTARNLGKVDRDDYYDLQEVLLCLFKVIELGLLESPDIYTSSVLEKGKLILLPSHMYDTTNYKSYWLGICMLLTILEEQAMDSLLLGSDKQNDFMQSILHTMEREADDDSVDPFWPALHCFMVILDRLGSKVWGQLMDPIVAFQTIINNASYNREIRHIRNSSVRTKLEPESYLDDMVTCSQIVYNYNPEKTKKDSGWRTAICPDYCPNMYEEMETLASVLQSDIGQDMRVHNSTFLWFIPFVQSLMDLKDLGVAYIAQVVNHLYSEVKEVLNQTDAVCDKVTEFFLLILVSVIELHRNKKCLHLLWVSSQQWVEAVVKCAKLPTTAFTRSSEKSSGNCSKGTAMISSLSLHSMPSNSVQLAYVQLIRSLLKEGYQLGQQSLCKRFWDKLNLFLRGNLSLGWQLTSQETHELQSCLKQIIRNIKFKAPPCNTFVDLTSACKISPASYNKEESEQMGKTSRKDMHCLEASSPTFSKEPMKVQDSVLIKADNTIEGDNNEQNYIKDVKLEDHLLAGSCLKQSSKNIFTERAEDQIKISTRKQKSVKEISSYTPKDCTSRNGPERGCDRGIIVSTRLLTDSSTDALEKVSTSNEDFSLKDDALAKTSKRKTKVQKDEICAKLSHVIKKQHRKSTLVDNTINLDENLTVSNIESFYSRKDTGVQKGDGFIHNLSLDPSGVLDDKNGEQKSQNNVLPKEKQLKNEELVIFSFHENNCKIQEFHVDGKELIPFTEMTNASEKKSSPFKDLMTVPESRDEEMSNSTSVIYSNLTREQAPDISPKSDTLTDSQIDRDLHKLSLLAQASVITFPSDSPQNSSQLQRKVKEDKRCFTANQNNVGDTSRGQVIIISDSDDDDDERILSLEKLTKQDKICLEREHPEQHVSTVNSKEEKNPVKEEKTETLFQFEESDSQCFEFESSSEVFSVWQDHPDDNNSVQDGEKKCLAPIANTTNGQGCTDYVSEVVKKGAEGIEEHTRPRSISVEEFCEIEVKKPKRKRSEKPMAEDPVRPSSSVRNEGQSDTNKRDLVGNDFKSIDRRTSTPNSRIQRATTVSQKKSSKLCTCTEPIRKVPVSKTPKKTHSDAKKGQNRSSNYLSCRTTPAIVPPKKFRQCPEPTSTAEKLGLKKGPRKAYELSQRSLDYVAQLRDHGKTVGVVDTRKKTKLISPQNLSVRNNKKLLTSQELQMQRQIRPKSQKNRRRLSDCESTDVKRAGSHTAQNSDIFVPESDRSDYNCTGGTEVLANSNRKQLIKCMPSEPETIKAKHGSPATDDACPLNQCDSVVLNGTVPTNEVIVSTSEDPLGGGDPTARHIEMAALKEGEPDSSSDAEEDNLFLTQNDPEDMDLCSQMENDNYKLIELIHGKDTVEVEEDSVSRPQLESLSGTKCKYKDCLETTKNQGEYCPKHSEVKAADEDVFRKPGLPPPASKPLRPTTKIFSSKSTSRIAGLSKSLETSSALSPSLKNKSKGIQSILKVPQPVPLIAQKPVGEMKNSCNVLHPQSPNNSNRQGCKVPFGESKYFPSSSPVNILLSSQSVSDTFVKEVLKWKYEMFLNFGQCGPPASLCQSISRPVPVRFHNYGDYFNVFFPLMVLNTFETVAQEWLNSPNRENFYQLQVRKFPADYIKYWEFAVYLEECELAKQLYPKENDLVFLAPERINEEKKDTERNDIQDLHEYHSGYVHKFRRTSVMRNGKTECYLSIQTQENFPANLNELVNCIVISSLVTTQRKLKAMSLLGSRNQLARAVLNPNPMDFCTKDLLTTTSERIIAYLRDFNEDQKKAIETAYAMVKHSPSVAKICLIHGPPGTGKSKTIVGLLYRLLTENQRKGHSDENSNAKIKQNRVLVCAPSNAAVDELMKKIILEFKEKCKDKKNPLGNCGDINLVRLGPEKSINSEVLKFSLDSQVNHRMKKELPSHVQAMHKRKEFLDYQLDELSRQRALCRGGREIQRQELDENISKVSKERQELASKIKEVQGRPQKTQSIIILESHIICCTLSTSGGLLLESAFRGQGGVPFSCVIVDEAGQSCEIETLTPLIHRCNKLILVGDPKQLPPTVISMKAQEYGYDQSMMARFCRLLEENVEHNMISRLPILQLTVQYRMHPDICLFPSNYVYNRNLKTNRQTEAIRCSSDWPFQPYLVFDVGDGSERRDNDSYINVQEIKLVMEIIKLIKDKRKDVSFRNIGIITHYKAQKTMIQKDLDKEFDRKGPAEVDTVDAFQGRQKDCVIVTCVRANSIQGSIGFLASLQRLNVTITRAKYSLFILGHLRTLMENQHWNQLIQDAQKRGAIIKTCDKNYRHDAVKILKLKPVLQRSLTHPPTIAPEGSRPQGGLPSSKLDSGFAKTSVAASLYHTPSDSKEITLTVTSKDPERPPVHDQLQDPRLLKRMGIEVKGGIFLWDPQPSSPQHPGATPPTGEPGFPVVHQDLSHIQQPAAVVAALSSHKPPVRGEPPAASPEASTCQSKCDDPEEELCHRREARAFSEGEQEKCGSETHHTRRNSRWDKRTLEQEDSSSKKRKLL,mutated_sequence,1.0,2677.0,NP_055861.3.a2m,NP_055861.3.npy,ClinVar
+NP_055907.3,NP_055907.3.csv,MSRRAPGSRLSSGGGGGGTKYPRSWNDWQPRTDSASADPDNLKYSSSRDRGGSSSYGLQPSNSAVVSRQRHDDTRVHADIQNDEKGGYSVNGGSGENTYGRKSLGQELRVNNVTSPEFTSVQHGSRALATKDMRKSQERSMSYSDESRLSNLLRRITREDDRDRRLATVKQLKEFIQQPENKLVLVKQLDNILAAVHDVLNESSKLLQELRQEGACCLGLLCASLSYEAEKIFKWIFSKFSSSAKDEVKLLYLCATYKALETVGEKKAFSSVMQLVMTSLQSILENVDTPELLCKCVKCILLVARCYPHIFSTNFRDTVDILVGWHIDHTQKPSLTQQVSGWLQSLEPFWVADLAFSTTLLGQFLEDMEAYAEDLSHVASGESVDEDVPPPSVSLPKLAALLRVFSTVVRSIGERFSPIRGPPITEAYVTDVLYRVMRCVTAANQVFFSEAVLTAANECVGVLLGSLDPSMTIHCDMVITYGLDQLENCQTCGTDYIISVLNLLTLIVEQINTKLPSSFVEKLFIPSSKLLFLRYHKEKEVVAVAHAVYQAVLSLKNIPVLETAYKLILGEMTCALNNLLHSLQLPEACSEIKHEAFKNHVFNVDNAKFVVIFDLSALTTIGNAKNSLIGMWALSPTVFALLSKNLMIVHSDLAVHFPAIQYAVLYTLYSHCTRHDHFISSSLSSSSPSLFDGAVISTVTTATKKHFSIILNLLGILLKKDNLNQDTRKLLMTWALEAAVLMKKSETYAPLFSLPSFHKFCKGLLANTLVEDVNICLQACSSLHALSSSLPDDLLQRCVDVCRVQLVHSGTRIRQAFGKLLKSIPLDVVLSNNNHTEIQEISLALRSHMSKAPSNTFHPQDFSDVISFILYGNSHRTGKDNWLERLFYSCQRLDKRDQSTIPRNLLKTDAVLWQWAIWEAAQFTVLSKLRTPLGRAQDTFQTIEGIIRSLAAHTLNPDQDVSQWTTADNDEGHGNNQLRLVLLLQYLENLEKLMYNAYEGCANALTSPPKVIRTFFYTNRQTCQDWLTRIRLSIMRVGLLAGQPAVTVRHGFDLLTEMKTTSLSQGNELEVTIMMVVEALCELHCPEAIQGIAVWSSSIVGKNLLWINSVAQQAEGRFEKASVEYQEHLCAMTGVDCCISSFDKSVLTLANAGRNSASPKHSLNGESRKTVLSKPTDSSPEVINYLGNKACECYISIADWAAVQEWQNAIHDLKKSTSSTSLNLKADFNYIKSLSSFESGKFVECTEQLELLPGENINLLAGGSKEKIDMKKLLPNMLSPDPRELQKSIEVQLLRSSVCLATALNPIEQDQKWQSITENVVKYLKQTSRIAIGPLRLSTLTVSQSLPVLSTLQLYCSSALENTVSNRLSTEDCLIPLFSEALRSCKQHDVRPWMQALRYTMYQNQLLEKIKEQTVPIRSHLMELGLTAAKFARKRGNVSLATRLLAQCSEVQLGKTTTAQDLVQHFKKLSTQGQVDEKWGPELDIEKTKLLYTAGQSTHAMEMLSSCAISFCKSVKAEYAVAKSILTLAKWIQAEWKEISGQLKQVYRAQHQQNFTGLSTLSKNILTLIELPSVNTMEEEYPRIESESTVHIGVGEPDFILGQLYHLSSVQAPEVAKSWAALASWAYRWGRKVVDNASQGEGVRLLPREKSEVQNLLPDTITEEEKERIYGILGQAVCRPAGIQDEDITLQITESEDNEEDDMVDVIWRQLISSCPWLSELDESATEGVIKVWRKVVDRIFSLYKLSCSAYFTFLKLNAGQIPLDEDDPRLHLSHRVEQSTDDMIVMATLRLLRLLVKHAGELRQYLEHGLETTPTAPWRGIIPQLFSRLNHPEVYVRQSICNLLCRVAQDSPHLILYPAIVGTISLSSESQASGNKFSTAIPTLLGNIQGEELLVSECEGGSPPASQDSNKDEPKSGLNEDQAMMQDCYSKIVDKLSSANPTMVLQVQMLVAELRRVTVLWDELWLGVLLQQHMYVLRRIQQLEDEVKRVQNNNTLRKEEKIAIMREKHTALMKPIVFALEHVRSITAAPAETPHEKWFQDNYGDAIENALEKLKTPLNPAKPGSSWIPFKEIMLSLQQRAQKRASYILRLEEISPWLAAMTNTEIALPGEVSARDTVTIHSVGGTITILPTKTKPKKLLFLGSDGKSYPYLFKGLEDLHLDERIMQFLSIVNTMFATINRQETPRFHARHYSVTPLGTRSGLIQWVDGATPLFGLYKRWQQREAALQAQKAQDSYQTPQNPGIVPRPSELYYSKIGPALKTVGLSLDVSRRDWPLHVMKAVLEELMEATPPNLLAKELWSSCTTPDEWWRVTQSYARSTAVMSMVGYIIGLGDRHLDNVLIDMTTGEVVHIDYNVCFEKGKSLRVPEKVPFRMTQNIETALGVTGVEGVFRLSCEQVLHIMRRGRETLLTLLEAFVYDPLVDWTAGGEAGFAGAVYGGGGQQAESKQSKREMEREITRSLFSSRVAEIKVNWFKNRDEMLVVLPKLDGSLDEYLSLQEQLTDVEKLQGKLLEEIEFLEGAEGVDHPSHTLQHRYSEHTQLQTQQRAVQEAIQVKLNEFEQWITHYQAAFNNLEATQLASLLQEISTQMDLGPPSYVPATAFLQNAGQAHLISQCEQLEGEVGALLQQRRSVLRGCLEQLHHYATVALQYPKAIFQKHRIEQWKTWMEELICNTTVERCQELYRKYEMQYAPQPPPTVCQFITATEMTLQRYAADINSRLIRQVERLKQEAVTVPVCEDQLKEIERCIKVFLHENGEEGSLSLASVIISALCTLTRRNLMMEGAASSAGEQLVDLTSRDGAWFLEELCSMSGNVTCLVQLLKQCHLVPQDLDIPNPMEASETVHLANGVYTSLQELNSNFRQIIFPEALRCLMKGEYTLESMLHELDGLIEQTTDGVPLQTLVESLQAYLRNAAMGLEEETHAHYIDVARLLHAQYGELIQPRNGSVDETPKMSAGQMLLVAFDGMFAQVETAFSLLVEKLNKMEIPIAWRKIDIIREARSTQVNFFDDDNHRQVLEEIFFLKRLQTIKEFFRLCGTFSKTLSGSSSLEDQNTVNGPVQIVNVKTLFRNSCFSEDQMAKPIKAFTADFVRQLLIGLPNQALGLTLCSFISALGVDIIAQVEAKDFGAESKVSVDDLCKKAVEHNIQIGKFSQLVMNRATVLASSYDTAWKKHDLVRRLETSISSCKTSLQRVQLHIAMFQWQHEDLLINRPQAMSVTPPPRSAILTSMKKKLHTLSQIETSIATVQEKLAALESSIEQRLKWAGGANPALAPVLQDFEATIAERRNLVLKESQRASQVTFLCSNIIHFESLRTRTAEALNLDAALFELIKRCQQMCSFASQFNSSVSELELRLLQRVDTGLEHPIGSSEWLLSAHKQLTQDMSTQRAIQTEKEQQIETVCETIQNLVDNIKTVLTGHNRQLGDVKHLLKAMAKDEEAALADGEDVPYENSVRQFLGEYKSWQDNIQTVLFTLVQAMGQVRSQEHVEMLQEITPTLKELKTQSQSIYNNLVSFASPLVTDATNECSSPTSSATYQPSFAAAVRSNTGQKTQPDVMSQNARKLIQKNLATSADTPPSTVPGTGKSVACSPKKAVRDPKTGKAVQERNSYAVSVWKRVKAKLEGRDVDPNRRMSVAEQVDYVIKEATNLDNLAQLYEGWTAWV,mutated_sequence,1.0,3661.0,NP_055907.3.a2m,NP_055907.3.npy,ClinVar
+NP_056110.2,NP_056110.2.csv,MAAADGGGPGGASVGTEEDGGGVGHRTVYLFDRREKESELGDRPLQVGERSDYAGFRACVCQTLGISPEEKFVITTTSRKEITCDNFDETVKDGVTLYLLQSVNQLLLTATKERIDFLPHYDTLVKSGMYEYYASEGQNPLPFALAELIDNSLSATSRNIGVRRIQIKLLFDETQGKPAVAVIDNGRGMTSKQLNNWAVYRLSKFTRQGDFESDHSGYVRPVPVPRSLNSDISYFGVGGKQAVFFVGQSARMISKPADSQDVHELVLSKEDFEKKEKNKEAIYSGYIRNRKPSDSVHITNDDERFLHHLIIEEKEKDSFTAVVITGVQPEHIQYLKNYFHLWTRQLAHIYHYYIHGPKGNEIRTSKEVEPFNNIDIEISMFEKGKVPKIVNLREIQDDMQTLYVNTAADSFEFKAHVEGDGVVEGIIRYHPFLYDRETYPDDPCFPSKLKDEDDEDDCFILEKAARGKRPIFECFWNGRLIPYTSVEDFDWCTPPKKRGLAPIECYNRISGALFTNDKFQVSTNKLTFMDLELKLKDKNTLFTRILNGQEQRMKIDREFALWLKDCHEKYDKQIKFTLFKGVITRPDLPSKKQGPWATYAAIEWDGKIYKAGQLVKTIKTLPLFYGSIVRFFLYGDHDGEVYATGGEVQIAMEPQALYDEVRTVPIAKLDRTVAEKAVKKYVEDEMARLPDRLSVTWPEGDELLPNEVRPAGTPIGALRIEILNKKGEAMQKLPGTSHGGSKKLLVELKVILHSSSGNKEIISHISQHGGKWPYWFKKMENIQKLGNYTLKLQVVLNESNADTYAGRPLPSKAIKFSVKEGKPEKFSFGLLDLPFRVGVPFNIPLEFQDEFGHTSQLVTDIQPVLEASGLSLHYEEITKGPNCVIRGVTAKGPVNSCQGKNYNLKVTLPGLKEDSQILKIRLLPGHPRRLKVKPDSEILVIENGTAFPFQVEVLDESDNITAQPKLIVHCKFSGAPNLPVYVVDCSSSGTSILTGSAIQVQNIKKDQTLKARIEIPSCKDVAPVEKTIKLLPSSHVARLQIFSVEGQKAIQIKHQDEVNWIAGDIMHNLIFQMYDEGEREINITSALAEKIKVNWTPEINKEHLLQGLLPDVQVPTSVKDMRYCQVSFQDDHVSLESAFTVRPLPDEPKHLKCEMKGGKTVQMGQELQGEVVIIITDQYGNQIQAFSPSSLSSLSIAGVGLDSSNLKTTFQENTQSISVRGIKFIPGPPGNKDLCFTWREFSDFIRVQLISGPPAKLLLIDWPELKESIPVINGRDLQNPIIVQLCDQWDNPAPVQHVKISLTKASNLKLMPSNQQHKTDEKGRANLGVFSVFAPRGEHTLQVKAIYNKSIIEGPIIKLMILPDPEKPVRLNVKYDKDASFLAGGLFTDFMISVISEDDSIIKNINPARISMKMWKLSTSGNRPPANAETFSCNKIKDNDKEDGCFYFRDKVIPNKVGTYCIQFGFMMDKTNILNSEQVIVEVLPNQPVKLVPKIKPPTPAVSNVRSVASRTLVRDLHLSITDDYDNHTGIDLVGTIIATIKGSNEEDTDTPLFIGKVRTLEFPFVNGSAEIMSLVLAESSPGRDSTEYFIVFEPRLPLLSRTLEPYILPFMFYNDVKKQQQMAALTKEKDQLSQSIVMYKSLFEASQQLLNEMKCQVEEARLKEAQLRNELKIHNIDIPTTQQVPHIEALLKRKLSEQEELKKKPRRSCTLPNYTKGSGDVLGKIAHLAQIEDDRAAMVISWHLASDMDCVVTLTTDAARRIYDETQGRQQVLPLDSIYKKTLPDWKRSLPHFRNGKLYFKPIGDPVFARDLLTFPDNVEHCETVFGMLLGDTIILDNLDAANHYRKEVVKITHCPTLLTRDGDRIRSNGKFGGLQNKAPPMDKLRGMVFGAPVPKQCLILGEQIDLLQQYRSAVCKLDSVNKDLNSQLEYLRTPDMRKKKQELDEHEKNLKLIEEKLGMTPIRKCNDSLRHSPKVETTDCPVPPKRMRREATRQNRIITKTDV,mutated_sequence,1.0,2005.0,NP_056110.2.a2m,NP_056110.2.npy,ClinVar
+NP_056145.5,NP_056145.5.csv,MKKASRSVGSVPKVSAISKTQTAEKIKPENSSSASTGGKLVKPGTAASLSKTKSSDDLLAGMAGGVTVTNGVKGKKSTCPSAAPSASAPAMTTVENKSKISTGTASSTKRSTSTGNKESSSTRERLRERTRLNQSKKLPSAGQGANDMALAKRSRSRTATECDVRMSKSKSDNQISDRAALEAKVKDLLTLAKTKDVEILHLRNELRDMRAQLGINEDHSEGDEKSEKETIMAHQPTDVESTLLQLQEQNTAIREELNQLKNENRMLKDRLNALGFSLEQRLDNSEKLFGYQSLSPEITPGNQSDGGGTLTSSVEGSAPGSVEDLLSQDENTLMDHQHSNSMDNLDSECSEVYQPLTSSDDALDAPSSSESEGIPSIERSRKGSSGNASEVSVACLTERIHQMEENQHSTSEELQATLQELADLQQITQELNSENERLGEEKVILMESLCQQSDKLEHFSRQIEYFRSLLDEHHISYVIDEDVKSGRYMELEQRYMDLAENARFEREQLLGVQQHLSNTLKMAEQDNKEAQEMIGALKERSHHMERIIESEQKGKAALAATLEEYKATVASDQIEMNRLKAQLENEKQKVAELYSIHNSGDKSDIQDLLESVRLDKEKAETLASSLQEDLAHTRNDANRLQDAIAKVEDEYRAFQEEAKKQIEDLNMTLEKLRSDLDEKETERSDMKETIFELEDEVEQHRAVKLHDNLIISDLENTVKKLQDQKHDMEREIKTLHRRLREESAEWRQFQADLQTAVVIANDIKSEAQEEIGDLKRRLHEAQEKNEKLTKELEEIKSRKQEEERGRVYNYMNAVERDLAALRQGMGLSRRSSTSSEPTPTVKTLIKSFDSASQVPNPAAAAIPRTPLSPSPMKTPPAAAVSPMQRHSISGPISTSKPLTALSDKRPNYGEIPVQEHLLRTSSASRPASLPRVPAMESAKTLSVSRRSSEEVKRDISAQEGASPASLMAMGTTSPQLSLSSSPTASVTPTTRSRIREERKDPLSALAREYGGSKRNALLKWCQKKTEGYQNIDITNFSSSWNDGLAFCALLHTYLPAHIPYQELNSQDKRRNFMLAFQAAESVGIKSTLDINEMVRTERPDWQNVMLYVTAIYKYFET,mutated_sequence,1.0,1117.0,NP_056145.5.a2m,NP_056145.5.npy,ClinVar
+NP_056161.2,NP_056161.2.csv,MNHPFGKEEAASQKQLFGFFCECLRRGEWELAQACVPQLQEGQGDIPKRVEDILQALVVCPNLLRCGQDINPQRVAWVWLLVLEKWLAREKKLLPVVFRRKLEFLLLSEDLQGDIPENILEELYETLTQGAVGHVPDGNPRRESWTPRLSSEAVSVLWDLLRQSPQPAQALLELLLEEDDGTGLCHWPLQNALVDLIRKALRALQGPDSVPPGVVDAIYGALRTLRCPAEPLGVELHLLCEELLEACRTEGSPLREERLLSCLLHKASRGLLSLYGHTYAEKVTEKPPRATASGKVSPDHLDPERAMLALFSNPNPAEAWKVAYFYCLSNNKHFLEQILVTALTLLKEEDFPNLGCLLDREFRPLSCLLVLLGWTHCQSLESAKRLLQTLHRTQGPGCDELLRDACDGLWAHLEVLEWCIQQSSNPIPKRDLLYHLHGGDSHSVLYTLHHLTNLPALREEDVLKLLQKVPAKDPQQEPDAVDAPVPEHLSQCQNLTLYQGFCAMKYAIYALCVNSHQHSQCQDCKDSLSEDLASATEPANDSLSSPGAANLFSTYLARCQQYLCSIPDSLCLELLENIFSLLLITSADLHPEPHLPEDYAEDDDIEGKSPSGLRSPSESPQHIAHPERKSERGSLGVPKTLAYTMPSHVKAEPKDSYPGPHRHSFLDLKHFTSGISGFLADEFAIGAFLRLLQEQLDEISSRSPPEKPKQESQSCSGSRDGLQSRLHRLSKVVSEAQWRHKVVTSNHRSEEQPSRRYQPATRHPSLRRGRRTRRSQADGRDRGSNPSLESTSSELSTSTSEGSLSAMSGRNELHSRLHPHPQSSLIPMMFSPPESLLASCILRGNFAEAHQVLFTFNLKSSPSSGELMFMERYQEVIQELAQVEHKIENQNSDAGSSTIRRTGSGRSTLQAIGSAAAAGMVFYSISDVTDKLLNTSGDPIPMLQEDFWISTALVEPTAPLREVLEDLSPPAMAAFDLACSQCQLWKTCKQLLETAERRLNSSLERRGRRIDHVLLNADGIRGFPVVLQQISKSLNYLLMSASQTKSESVEEKGGGPPRCSITELLQMCWPSLSEDCVASHTTLSQQLDQVLQSLREALELPEPRTPPLSSLVEQAAQKAPEAEAHPVQIQTQLLQKNLGKQTPSGSRQMDYLGTFFSYCSTLAAVLLQSLSSEPDHVEVKVGNPFVLLQQSSSQLVSHLLFERQVPPERLAALLAQENLSLSVPQVIVSCCCEPLALCSSRQSQQTSSLLTRLGTLAQLHASHCLDDLPLSTPSSPRTTENPTLERKPYSSPRDSSLPALTSSALAFLKSRSKLLATVACLGASPRLKVSKPSLSWKELRGRREVPLAAEQVARECERLLEQFPLFEAFLLAAWEPLRGSLQQGQSLAVNLCGWASLSTVLLGLHSPIALDVLSEAFEESLVARDWSRALQLTEVYGRDVDDLSSIKDAVLSCAVACDKEGWQYLFPVKDASLRSRLALQFVDRWPLESCLEILAYCISDTAVQEGLKCELQRKLAELQVYQKILGLQSPPVWCDWQTLRSCCVEDPSTVMNMILEAQEYELCEEWGCLYPIPREHLISLHQKHLLHLLERRDHDKALQLLRRIPDPTMCLEVTEQSLDQHTSLATSHFLANYLTTHFYGQLTAVRHREIQALYVGSKILLTLPEQHRASYSHLSSNPLFMLEQLLMNMKVDWATVAVQTLQQLLVGQEIGFTMDEVDSLLSRYAEKALDFPYPQREKRSDSVIHLQEIVHQAADPETLPRSPSAEFSPAAPPGISSIHSPSLRERSFPPTQPSQEFVPPATPPARHQWVPDETESICMVCCREHFTMFNRRHHCRRCGRLVCSSCSTKKMVVEGCRENPARVCDQCYSYCNKDVPEEPSEKPEALDSSKNESPPYSFVVRVPKADEVEWILDLKEEENELVRSEFYYEQAPSASLCIAILNLHRDSIACGHQLIEHCCRLSKGLTNPEVDAGLLTDIMKQLLFSAKMMFVKAGQSQDLALCDSYISKVDVLNILVAAAYRHVPSLDQILQPAAVTRLRNQLLEAEYYQLGVEVSTKTGLDTTGAWHAWGMACLKAGNLTAAREKFSRCLKPPFDLNQLNHGSRLVQDVVEYLESTVRPFVSLQDDDYFATLRELEATLRTQSLSLAVIPEGKIMNNTYYQECLFYLHNYSTNLAIISFYVRHSCLREALLHLLNKESPPEVFIEGIFQPSYKSGKLHTLENLLESIDPTLESWGKYLIAACQHLQKKNYYHILYELQQFMKDQVRAAMTCIRFFSHKAKSYTELGEKLSWLLKAKDHLKIYLQETSRSSGRKKTTFFRKKMTAADVSRHMNTLQLQMEVTRFLHRCESAGTSQITTLPLPTLFGNNHMKMDVACKVMLGGKNVEDGFGIAFRVLQDFQLDAAMTYCRAARQLVEKEKYSEIQQLLKCVSESGMAAKSDGDTILLNCLEAFKRIPPQELEGLIQAIHNDDNKVRAYLICCKLRSAYLIAVKQEHSRATALVQQVQQAAKSSGDAVVQDICAQWLLTSHPRGAHGPGSRK,mutated_sequence,1.0,2539.0,NP_056161.2.a2m,NP_056161.2.npy,ClinVar
+NP_056183.2,NP_056183.2.csv,MAIAQLATEYVFSDFLLKEPTEPKFKGLRLELAVDKMVTCIAVGLPLLLISLAFAQEISIGTQISCFSPSSFSWRQAAFVDSYCWAAVQQKNSLQSESGNLPLWLHKFFPYILLLFAILLYLPPLFWRFAAAPHICSDLKFIMEELDKVYNRAIKAAKSARDLDMRDGACSVPGVTENLGQSLWEVSESHFKYPIVEQYLKTKKNSNNLIIKYISCRLLTLIIILLACIYLGYYFSLSSLSDEFVCSIKSGILRNDSTVPDQFQCKLIAVGIFQLLSVINLVVYVLLAPVVVYTLFVPFRQKTDVLKVYEILPTFDVLHFKSEGYNDLSLYNLFLEENISEVKSYKCLKVLENIKSSGQGIDPMLLLTNLGMIKMDVVDGKTPMSAEMREEQGNQTAELQGMNIDSETKANNGEKNARQRLLDSSC,mutated_sequence,1.0,426.0,NP_056183.2.a2m,NP_056183.2.npy,ClinVar
+NP_056280.2,NP_056280.2.csv,MGQEPRTLPPSPNWYCARCSDAVPGGLFGFAARTSVFLVRVGPGAGESPGTPPFRVIGELVGHTERVSGFTFSHHPGQYNLCATSSDDGTVKIWDVETKTVVTEHALHQHTISTLHWSPRVKDLIVSGDEKGVVFCYWFNRNDSQHLFIEPRTIFCLTCSPHHEDLVAIGYKDGIVVIIDISKKGEVIHRLRGHDDEIHSIAWCPLPGEDCLSINQEETSEEAEITNGNAVAQAPVTKGCYLATGSKDQTIRIWSCSRGRGVMILKLPFLKRRGGGIDPTVKERLWLTLHWPSNQPTQLVSSCFGGELLQWDLTQSWRRKYTLFSASSEGQNHSRIVFNLCPLQTEDDKQLLLSTSMDRDVKCWDIATLECSWTLPSLGGFAYSLAFSSVDIGSLAIGVGDGMIRVWNTLSIKNNYDVKNFWQGVKSKVTALCWHPTKEGCLAFGTDDGKVGLYDTYSNKPPQISSTYHKKTVYTLAWGPPVPPMSLGGEGDRPSLALYSCGGEGIVLQHNPWKLSGEAFDINKLIRDTNSIKYKLPVHTEISWKADGKIMALGNEDGSIEIFQIPNLKLICTIQQHHKLVNTISWHHEHGSQPELSYLMASGSNNAVIYVHNLKTVIESSPESPVTITEPYRTLSGHTAKITSVAWSPHHDGRLVSASYDGTAQVWDALREEPLCNFRGHRGRLLCVAWSPLDPDCIYSGADDFCVHKWLTSMQDHSRPPQGKKSIELEKKRLSQPKAKPKKKKKPTLRTPVKLESIDGNEEESMKENSGPVENGVSDQEGEEQAREPELPCGLAPAVSREPVICTPVSSGFEKSKVTINNKVILLKKEPPKEKPETLIKKRKARSLLPLSTSLDHRSKEELHQDCLVLATAKHSRELNEDVSADVEERFHLGLFTDRATLYRMIDIEGKGHLENGHPELFHQLMLWKGDLKGVLQTAAERGELTDNLVAMAPAAGYHVWLWAVEAFAKQLCFQDQYVKAASHLLSIHKVYEAVELLKSNHFYREAIAIAKARLRPEDPVLKDLYLSWGTVLERDGHYAVAAKCYLGATCAYDAAKVLAKKGDAASLRTAAELAAIVGEDELSASLALRCAQELLLANNWVGAQEALQLHESLQGQRLVFCLLELLSRHLEEKQLSEGKSSSSYHTWNTGTEGPFVERVTAVWKSIFSLDTPEQYQEAFQKLQNIKYPSATNNTPAKQLLLHICHDLTLAVLSQQMASWDEAVQALLRAVVRSYDSGSFTIMQEVYSAFLPDGCDHLRDKLGDHQSPATPAFKSLEAFFLYGRLYEFWWSLSRPCPNSSVWVRAGHRTLSVEPSQQLDTASTEETDPETSQPEPNRPSELDLRLTEEGERMLSTFKELFSEKHASLQNSQRTVAEVQETLAEMIRQHQKSQLCKSTANGPDKNEPEVEAEQPLCSSQSQCKEEKNEPLSLPELTKRLTEANQRMAKFPESIKAWPFPDVLECCLVLLLIRSHFPGCLAQEMQQQAQELLQKYGNTKTYRRHCQTFCM,mutated_sequence,1.0,1508.0,NP_056280.2.a2m,NP_056280.2.npy,ClinVar
+NP_056321.2,NP_056321.2.csv,MEPKVAELKQKIEDTLCPFGFEVYPFQVAWYNELLPPAFHLPLPGPTLAFLVLSTPAMFDRALKPFLQSCHLRMLTDPVDQCVAYHLGRVRESLPELQIEIIADYEVHPNRRPKILAQTAAHVAGAAYYYQRQDVEADPWGNQRISGVCIHPRFGGWFAIRGVVLLPGIEVPDLPPRKPHDCVPTRADRIALLEGFNFHWRDWTYRDAVTPQERYSEEQKAYFSTPPAQRLALLGLAQPSEKPSSPSPDLPFTTPAPKKPGNPSRARSWLSPRVSPPASPGP,mutated_sequence,1.0,282.0,NP_056321.2.a2m,NP_056321.2.npy,ClinVar
+NP_056414.1,NP_056414.1.csv,MDLGAITKYSALHAKPNGLILQYGTAGFRTKAEHLDHVMFRMGLLAVLRSKQTKSTIGVMVTASHNPEEDNGVKLVDPLGEMLAPSWEEHATCLANAEEQDMQRVLIDISEKEAVNLQQDAFVVIGRDTRPSSEKLSQSVIDGVTVLGGQFHDYGLLTTPQLHYMVYCRNTGGRYGKATIEGYYQKLSKAFVELTKQASCSGDEYRSLKVDCANGIGALKLREMEHYFSQGLSVQLFNDGSKGKLNHLCGADFVKSHQKPPQGMEIKSNERCCSFDGDADRIVYYYHDADGHFHLIDGDKIATLISSFLKELLVEIGESLNIGVVQTAYANGSSTRYLEEVMKVPVYCTKTGVKHLHHKAQEFDIGVYFEANGHGTALFSTAVEMKIKQSAEQLEDKKRKAAKMLENIIDLFNQAAGDAISDMLVIEAILALKGLTVQQWDALYTDLPNRQLKVQVADRRVISTTDAERQAVTPPGLQEAINDLVKKYKLSRAFVRPSGTEDVVRVYAEADSQESADHLAHEVSLAVFQLAGGIGERPQPGF,mutated_sequence,1.0,542.0,NP_056414.1.a2m,NP_056414.1.npy,ClinVar
+NP_056444.3,NP_056444.3.csv,MSLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSGDSVKTIAKLWDSKMFAEIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSKRFPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQGQQLSEEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLTNLSKMPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKCTLAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGGRRYRKMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARISKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEANQKYFSSMAEFLKVKGEKSGLMST,mutated_sequence,1.0,499.0,NP_056444.3.a2m,NP_056444.3.npy,ClinVar
+NP_056477.1,NP_056477.1.csv,MHLKHLRTLLSPQDGAAKVTCMAWSQNNAKFAVCTVDRVVLLYDEHGERRDKFSTKPADMKYGRKSYMVKGMAFSPDSTKIAIGQTDNIIYVYKIGEDWGDKKVICNKFIQTSAVTCLQWPAEYIIVFGLAEGKVRLANTKTNKSSTIYGTESYVVSLTTNCSGKGILSGHADGTIVRYFFDDEGSGESQGKLVNHPCPPYALAWATNSIVAAGCDRKIVAYGKEGHMLQTFDYSRDPQEREFTTAVSSPGGQSVVLGSYDRLRVFNWIPRRSIWEEAKPKEITNLYTITALAWKRDGSRLCVGTLCGGVEQFDCCLRRSIYKNKFELTYVGPSQVIVKNLSSGTRVVLKSHYGYEVEEVKILGKERYLVAHTSETLLLGDLNTNRLSEIAWQGSGGNEKYFFENENVCMIFNAGELTLVEYGNNDTLGSVRTEFMNPHLISVRINERCQRGTEDNKKLAYLIDIKTIAIVDLIGGYNIGTVSHESRVDWLELNETGHKLLFRDRKLRLHLYDIESCSKTMILNFCSYMQWVPGSDVLVAQNRNSLCVWYNIEAPERVTMFTIRGDVIGLERGGGKTEVMVMEGVTTVAYTLDEGLIEFGTAIDDGNYIRATAFLETLEMTPETEAMWKTLSKLALEARQLHIAERCFSALGQVAKARFLHETNEIADQVSREYGGEGTDFYQVRARLAMLEKNYKLAEMIFLEQNAVEEAMGMYQELHRWDECIAVAEAKGHPALEKLRRSYYQWLMDTQQEERAGELQESQGDGLAAISLYLKAGLPAKAARLVLTREELLANTELVEHITAALIKGELYERAGDLFEKIHNPQKALECYRKGNAFMKAVELARLAFPVEVVKLEEAWGDHLVQQKQLDAAINHYIEARCSIKAIEAALGARQWKKAIYILDLQDRNTASKYYPLVAQHYASLQEYEIAEELYTKGDRTKDAIDMYTQAGRWEQAHKLAMKCMRPEDVSVLYITQAQEMEKQGKYREAERLYVTVQEPDLAITMYKKHKLYDDMIRLVGKHHPDLLSDTHLHLGKELEAEGRLQEAEYHYLEAQEWKATVNMYRASGLWEEAYRVARTQGGANAHKHVAYLWAKSLGGEAAVRLLNKLGLLEAAVDHAADNCSFEFAFELSRLALKHKTPEVHLKYAMFLEDEGKFEEAEAEFIRAGKPKEAVLMFVHNQDWEAAQRVAEAHDPDSVAEVLVGQARGALEEKDFQKAEGLLLRAQRPGLALNYYKEAGLWSDALRICKDYVPSQLEALQEEYEREATKKGARGVEGFVEQARHWEQAGEYSRAVDCYLKVRDSGNSGLAEKCWMKAAELSIKFLPPQRNMEVVLAVGPQLIGIGKHSAAAELYLNLDLVKEAIDAFIEGEEWNKAKRVAKELDPRYEDYVDQHYKEFLKNQGKVDSLVGVDVIAALDLYVEQGQWDKCIETATKQNYKILHKYVALYATHLIREGSSAQALALYVQHGAPANPQNFNIYKRIFTDMVSSPGTNCAEAYHSWADLRDVLFNLCENLVKSSEANSPAHEEFKTMLLIAHYYATRSAAQSVKQLETVAARLSVSLLRHTQLLPVDKAFYEAGIAAKAVGWDNMAFIFLNRFLDLTDAIEEGTLDGLDHSDFQDTDIPFEVPLPAKQHVPEAEREEVRDWVLTVSMDQRLEQVLPRDERGAYEASLVAASTGVRALPCLITGYPILRNKIEFKRPGKAANKDNWNKFLMAIKTSHSPVCQDVLKFISQWCGGLPSTSFSFQ,mutated_sequence,1.0,1749.0,NP_056477.1.a2m,NP_056477.1.npy,ClinVar
+NP_056496.1,NP_056496.1.csv,MATASPSVFLLMVNGQVESAQFPEYDDLYCKYCFVYGQDWAPTAGLEEGISQITSKSQDVRQALVWNFPIDVTFKSTNPYGWPQIVLSVYGPDVFGNDVVRGYGAVHVPFSPGRHKRTIPMFVPESTSKLQKFTSWFMGRRPEYTDPKVVAQGEGREVTRVRSQGFVTLLFNVVTKDMRKLGYDTGPSDTQGVLGPSPPQSFPQ,mutated_sequence,1.0,204.0,NP_056496.1.a2m,NP_056496.1.npy,ClinVar
+NP_056528.2,NP_056528.2.csv,MGDPERPEAAGLDQDERSSSDTNESEIKSNEEPLLRKSSRRFVIFPIQYPDIWKMYKQAQASFWTAEEVDLSKDLPHWNKLKADEKYFISHILAFFAASDGIVNENLVERFSQEVQVPEARCFYGFQILIENVHSEMYSLLIDTYIRDPKKREFLFNAIETMPYVKKKADWALRWIADRKSTFGERVVAFAAVEGVFFSGSFAAIFWLKKRGLMPGLTFSNELISRDEGLHCDFACLMFQYLVNKPSEERVREIIVDAVKIEQEFLTEALPVGLIGMNCILMKQYIEFVADRLLVELGFSKVFQAENPFDFMENISLEGKTNFFEKRVSEYQRFAVMAETTDNVFTLDADF,mutated_sequence,1.0,351.0,NP_056528.2.a2m,NP_056528.2.npy,ClinVar
+NP_056993.2,NP_056993.2.csv,MAAPESGPALSPGTAEGEEETILYDLLVNTEWPPETEVQPRGNQKHGASFIITKAIRDRLLFLRQYIWYSPAPFLLPDGLVRLVNKQINWHLVLASNGKLLAAVQDQCVEIRSAKDDFTSIIGKCQVPKDPKPQWRRVAWSYDCTLLAYAESTGTVRVFDLMGSELFVISPASSFIGDLSYAIAGLIFLEYKASAQWSAELLVINYRGELRSYLVSVGTNQSYQESHCFSFSSHYPHGINTAIYHPGHRLLLVGGCETAEVGMSKASSCGLSAWRVLSGSPYYKQVTNGGDGVTAVPKTLGLLRMLSVKFYSRQGQEQDGIFKMSLSPDGMLLAAIHFSGKLSIWAIPSLKQQGEWGQNEQPGYDDLNPDWRLSTEKRKKIKDKESFYPLIDVNWWADSAVTLARCSGALTVSSVKTLKNLLGKSCEWFEPSPQVTATHDGGFLSLECEIKLAPKRSRLETRAGEEDEGEEDSDSDYEISAKARYFGYIKQGLYLVTEMERFAPPRKRPRTITKNYRLVSLRSTTPEELYQRKIESEEYEEALSLAHTYGLDTDLVYQRQWRKSAVNVASIQNYLSKIKKRSWVLHECLERVPENVDAAKELLQYGLKGTDLEALLAIGKGADDGRFTLPGEIDIDSISYEELSPPDEEPAKNKKEKELKKRQELLKLVNFSKLTLEQKELCRCRRKLLTYLDRLATYEEILGVPHASEQRYDAEFFKKFRNQNIVLSARTYAQESNVQALEILFTYHGSDLLPHRLAILSNFPETTSPHEYSVLLPEACFNGDSLMIIPWHEHKHRAKDWCEELACRMVVEPNLQDESEFLYAAQPELLRFRMTQLTVEKVMDWYQTRAEEIEHYARQVDCALSLIRLGMERNIPGLLVLCDNLVTLETLVYEARCDVTLTLKELQQMKDIEKLRLLMNSCSEDKYVTSAYQWMVPFLHRCEKQSPGVANELLKEYLVTLAKGDLKFPLKIFQHSKPDLQQKIIPDQDQLMAIALECIYTCERNDQLCLCYDLLECLPERGYGDKTEATTKLHDMVDQLEQILSVSELLEKHGLEKPISFVKNTQSSSEEARKLMVRLTRHTGRKQPPVSESHWRTLLQDMLTMQQNVYTCLDSDACYEIFTESLLCSSRLENIHLAGQMMHCSACSENPPAGIAHKGKPHYRVSYEKSIDLVLAASREYFNSSTNLTDSCMDLARCCLQLITDRPPAIQEELDLIQAVGCLEEFGVKILPLQVRLCPDRISLIKECISQSPTCYKQSTKLLGLAELLRVAGENPEERRGQVLILLVEQALRFHDYKAASMHCQELMATGYPKSWDVCSQLGQSEGYQDLATRQELMAFALTHCPPSSIELLLAASSSLQTEILYQRVNFQIHHEGGENISASPLTSKAVQEDEVGVPGSNSADLLRWTTATTMKVLSNTTTTTKAVLQAVSDGQWWKKSLTYLRPLQGQKCGGAYQIGTTANEDLEKQGCHPFYESVISNPFVAESEGTYDTYQHVPVESFAEVLLRTGKLAEAKNKGEVFPTTEVLLQLASEALPNDMTLALAYLLALPQVLDANRCFEKQSPSALSLQLAAYYYSLQIYARLAPCFRDKCHPLYRADPKELIKMVTRHVTRHEHEAWPEDLISLTKQLHCYNERLLDFTQAQILQGLRKGVDVQRFTADDQYKRETILGLAETLEESVYSIAISLAQRYSVSRWEVFMTHLEFLFTDSGLSTLEIENRAQDLHLFETLKTDPEAFHQHMVKYIYPTIGGFDHERLQYYFTLLENCGCADLGNCAIKPETHIRLLKKFKVVASGLNYKKLTDENMSPLEALEPVLSSQNILSISKLVPKIPEKDGQMLSPSSLYTIWLQKLFWTGDPHLIKQVPGSSPEWLHAYDVCMKYFDRLHPGDLITVVDAVTFSPKAVTKLSVEARKEMTRKAIKTVKHFIEKPRKRNSEDEAQEAKDSKVTYADTLNHLEKSLAHLETLSHSFILSLKNSEQETLQKYSHLYDLSRSEKEKLHDEAVAICLDGQPLAMIQQLLEVAVGPLDISPKDIVQSAIMKIISALSGGSADLGGPRDPLKVLEGVVAAVHASVDKGEELVSPEDLLEWLRPFCADDAWPVRPRIHVLQILGQSFHLTEEDSKLLVFFRTEAILKASWPQRQVDIADIENEENRYCLFMELLESSHHEAEFQHLVLLLQAWPPMKSEYVITNNPWVRLATVMLTRCTMENKEGLGNEVLKMCRSLYNTKQMLPAEGVKELCLLLLNQSLLLPSLKLLLESRDEHLHEMALEQITAVTTVNDSNCDQELLSLLLDAKLLVKCVSTPFYPRIVDHLLASLQQGRWDAEELGRHLREAGHEAEAGSLLLAVRGTHQAFRTFSTALRAAQHWV,mutated_sequence,1.0,2371.0,NP_056993.2.a2m,NP_056993.2.npy,ClinVar
+NP_056999.2,NP_056999.2.csv,MAKNRRDRNSWGGFSEKTYEWSSEEEEPVKKAGPVQVLIVKDDHSFELDETALNRILLSEAVRDKEVVAVSVAGAFRKGKSFLMDFMLRYMYNQESVDWVGDYNEPLTGFSWRGGSERETTGIQIWSEIFLINKPDGKKVAVLLMDTQGTFDSQSTLRDSATVFALSTMISSIQVYNLSQNVQEDDLQHLQLFTEYGRLAMEETFLKPFQSLIFLVRDWSFPYEFSYGADGGAKFLEKRLKVSGNQHEELQNVRKHIHSCFTNISCFLLPHPGLKVATNPNFDGKLKEIDDEFIKNLKILIPWLLSPESLDIKEINGNKITCRGLVEYFKAYIKIYQGEELPHPKSMLQATAEANNLAAVATAKDTYNKKMEEICGGDKPFLAPNDLQTKHLQLKEESVKLFRGVKKMGGEEFSRRYLQQLESEIDELYIQYIKHNDSKNIFHAARTPATLFVVIFITYVIAGVTGFIGLDIIASLCNMIMGLTLITLCTWAYIRYSGEYRELGAVIDQVAAALWDQGSTNEALYKLYSAAATHRHLYHQAFPTPKSESTEQSEKKKM,mutated_sequence,1.0,558.0,NP_056999.2.a2m,NP_056999.2.npy,ClinVar
+NP_057006.1,NP_057006.1.csv,MEPAVSEPMRDQVARTHLTEDTPKVNADIEKVNQNQAKRCTVIGGSGFLGQHMVEQLLARGYAVNVFDIQQGFDNPQVRFFLGDLCSRQDLYPALKGVNTVFHCASPPPSSNNKELFYRVNYIGTKNVIETCKEAGVQKLILTSSASVIFEGVDIKNGTEDLPYAMKPIDYYTETKILQERAVLGANDPEKNFLTTAIRPHGIFGPRDPQLVPILIEAARNGKMKFVIGNGKNLVDFTFVENVVHGHILAAEQLSRDSTLGGKAFHITNDEPIPFWTFLSRILTGLNYEAPKYHIPYWVAYYLALLLSLLVMVISPVIQLQPTFTPMRVALAGTFHYYSCERAKKAMGYQPLVTMDDAMERTVQSFRHLRRVK,mutated_sequence,1.0,373.0,NP_057006.1.a2m,NP_057006.1.npy,ClinVar
+NP_057088.2,NP_057088.2.csv,MEKELRSTILFNAYKKEIFTTNNGYKSMQKKLRSNWKIQSLKDEITSEKLNGVKLWITAGPREKFTAAEFEILKKYLDTGGDVFVMLGEGGESRFDTNINFLLEEYGIMVNNDAVVRNVYHKYFHPKEALVSSGVLNREISRAAGKAVPGIIDEESSGNNAQALTFVYPFGATLSVMKPAVAVLSTGSVCFPLNRPILAFYHSKNQGGKLAVLGSCHMFSDQYLDKEENSKIMDVVFQWLTTGDIHLNQIDAEDPEISDYMMLPYTATLSKRNRECLQESDEIPRDFTTLFDLSIFQLDTTSFHSVIEAHEQLNVKHEPLQLIQPQFETPLPTLQPAVFPPSFRELPPPPLELFDLDETFSSEKARLAQITNKCTEEDLEFYVRKCGDILGVTSKLPKDQQDAKHILEHVFFQVVEFKKLNQEHDIDTSETAFQNNF,mutated_sequence,1.0,437.0,NP_057088.2.a2m,NP_057088.2.npy,ClinVar
+NP_057119.3,NP_057119.3.csv,MATLLRPVLRRLCGLPGLQRPAAEMPLRARSDGAGPLYSHHLPTSPLQKGLLAAGSAAMALYNPYRHDMVAVLGETTGHRTLKVLRDQMRRDPEGAQILQERPRISTSTLDLGKLQSLPEGSLGREYLRFLDVNRVSPDTRAPTRFVDDEELAYVIQRYREVHDMLHTLLGMPTNILGEIVVKWFEAVQTGLPMCILGAFFGPIRLGAQSLQVLVSELIPWAVQNGRRAPCVLNLYYERRWEQSLRALREELGITAPPMHVQGLA,mutated_sequence,1.0,265.0,NP_057119.3.a2m,NP_057119.3.npy,ClinVar
+NP_057122.2,NP_057122.2.csv,MSIFTPTNQIRLTNVAVVRMKRAGKRFEIACYKNKVVGWRSGVEKDLDEVLQTHSVFVNVSKGQVAKKEDLISAFGTDDQTEICKQILTKGEVQVSDKERHTQLEQMFRDIATIVADKCVNPETKRPYTVILIERAMKDIHYSVKTNKSTKQQALEVIKQLKEKMKIERAHMRLRFILPVNEGKKLKEKLKPLIKVIESEDYGQQLEIVCLIDPGCFREIDELIKKETKGKGSLEVLNLKDVEEGDEKFE,mutated_sequence,1.0,250.0,NP_057122.2.a2m,NP_057122.2.npy,ClinVar
+NP_057206.2,NP_057206.2.csv,MVVSTFTDMDTFPNNFPPGGDSGLTGSQSEFQKMLIDERLRCEHHKANYQTLKAEHTRLQNEHVKLQNELKHLFNEKQTQQEKLQLLLEELRGELVEKTKDLEEMKLQILTPQKLELLRAQIQQELETPMRERFRNLDEEVEKYRAVYNKLRYEHTFLKSEFEHQKEEYARILDEGKIKYESEIARLEEDKEELRNQLLNVDLTKDSKRVEQLAREKVYLCQKLKGLEAEVAELKAEKENSEAQVENAQRIQVRQLAEMQATVRSLEAEKQSANLRAERLEKELQSSSEQNTFLINKLHKAEREINTLSSKVKELKHSNKLEITDIKLETARAKSELERERNKIQSELDGLQSDNEILKAAVEHHKVLLVEKDRELIRKVQAAKEEGYQKLVVLQDEKLELENRLADLEKMKVEHDVWRQSEKDQYEEKLRASQMAEEITRKELQSVRLKLQQQIVTIENAEKEKNENSDLKQQISSLQIQVTSLAQSENDLLNSNQMLKEMVERLKQECRNFRSQAEKAQLEAEKTLEEKQIQWLEEKHKLHERITDREEKYNQAKEKLQRAAIAQKKRKSLHENKLKRLQEKVEVLEAKKEELETENQVLNRQNVPFEDYTRLQKRLKDIQRRHNEFRSLILVPNMPPTASINPVSFQSSAMVPSMELPFPPHMQEEQHQRELSLLRKRLEELETTQRKQLEELGSSGE,mutated_sequence,1.0,701.0,NP_057206.2.a2m,NP_057206.2.npy,ClinVar
+NP_057264.4,NP_057264.4.csv,MGSNSGQAGRHIYKSLADDGPFDSVEPPKRPTSRLIMHSMAMFGREFCYAVEAAYVTPVLLSVGLPSSLYSIVWFLSPILGFLLQPVVGSASDHCRSRWGRRRPYILTLGVMMLVGMALYLNGATVVAALIANPRRKLVWAISVTMIGVVLFDFAADFIDGPIKAYLFDVCSHQDKEKGLHYHALFTGFGGALGYLLGAIDWAHLELGRLLGTEFQVMFFFSALVLTLCFTVHLCSISEAPLTEVAKGIPPQQTPQDPPLSSDGMYEYGSIEKVKNGYVNPELAMQGAKNKNHAEQTRRAMTLKSLLRALVNMPPHYRYLCISHLIGWTAFLSNMLFFTDFMGQIVYRGDPYSAHNSTEFLIYERGVEVGCWGLCINSVFSSLYSYFQKVLVSYIGLKGLYFTGYLLFGLGTGFIGLFPNVYSTLVLCSLFGVMSSTLYTVPFNLITEYHREEEKERQQAPGGDPDNSVRGKGMDCATLTCMVQLAQILVGGGLGFLVNTAGTVVVVVITASAVALIGCCFVALFVRYVD,mutated_sequence,1.0,530.0,NP_057264.4.a2m,NP_057264.4.npy,ClinVar
+NP_057287.2,NP_057287.2.csv,MGSAVMDTKKKKDVSSPGGSGGKKNASQKRRSLRVHIPDLSSFAMPLLDGDLEGSGKHSSRKVDSPFGPGSPSKGFFSRGPQPRPSSPMSAPVRPKTSPGSPKTVFPFSYQESPPRSPRRMSFSGIFRSSSKESSPNSNPATSPGGIRFFSRSRKTSGLSSSPSTPTQVTKQHTFPLESYKHEPERLENRIYASSSPPDTGQRFCPSSFQSPTRPPLASPTHYAPSKAAALAAALGPAEAGMLEKLEFEDEAVEDSESGVYMRFMRSHKCYDIVPTSSKLVVFDTTLQVKKAFFALVANGVRAAPLWESKKQSFVGMLTITDFINILHRYYKSPMVQIYELEEHKIETWRELYLQETFKPLVNISPDASLFDAVYSLIKNKIHRLPVIDPISGNALYILTHKRILKFLQLFMSDMPKPAFMKQNLDELGIGTYHNIAFIHPDTPIIKALNIFVERRISALPVVDESGKVVDIYSKFDVINLAAEKTYNNLDITVTQALQHRSQYFEGVVKCNKLEILETIVDRIVRAEVHRLVVVNEADSIVGIISLSDILQALILTPAGAKQKETETE,mutated_sequence,1.0,569.0,NP_057287.2.a2m,NP_057287.2.npy,ClinVar
+NP_057323.3,NP_057323.3.csv,MAKEEDEEKKAKKGKKGKKAPEPEKPKRSLKGTSRLFMGFRDRTPKISKKGQFRSASAFFWGLHTGPQKTKRKRKARTVLKSTSKLMTQMRMGKKKRAMKGKKPSFMVIRFPGRRGYGRLRPRARSLSKASTAINWLTKKFLLKKAEESGSEQATVDAWLQRSSSRMGSRKLPFPSGAEILRPGGRLRRFPRSRSIYASGEPLGFLPFEDEAPFHHSGSRKSLYGLEGFQDLGEYYDYHRDGDDYYDRQSLHRYEEQEPYLAGLGPYSPAWPPYGDHYYGYPPEDPYDYYHPDYYGGPFDPGYTYGYGYDDYEPPYAPPSGYSSPYSYHDGYEGEAHPYGYYLDPYAPYDAPYPPYDLPYHTPYDVPYFDPYGVHYTVPYAEGVYGGGDEAIYPPEVPYFYPEESASAFVYPWVPPPIPSPHNPYAHAMDDIAELEEPEDAGVERQGTSFRLPSAAFFEQQGMDKPARSKLSLIRKFRLFPRPQVKLFGKEKLEVPLPPSLDIPLPLGDADEEEDEEELPPVSAVPYGHPFWGFLTPRQRNLQRALSAFGAHRGLGFGPEFGRPVPRPATSLARFLKKTLSEKKPIARLRGSQKARAGGPAVREAAYKRFGYKLAGMDPEKPGTPIVLRRAQPRARSSNDARRPPAPQPAPRTLSHWSALLSPPVPPRPPSSGPPPAPPLSPALSGLPRPASPYGSLRRHPPPWAAPAHVPPAPQASWWAFVEPPAVSPEVPPDLLAFPGPRPSFRGSRRRGAAFGFPGASPRASRRRAWSPLASPQPSLRSSPGLGYCSPLAPPSPQLSLRTGPFQPPFLPPARRPRSLQESPAPRRAAGRLGPPGSPLPGSPRPPSPPLGLCHSPRRSSLNLPSRLPHTWRRLSEPPTRAVKPQVRLPFHRPPRAGAWRAPLEHRESPREPEDSETPWTVPPLAPSWDVDMPPTQRPPSPWPGGAGSRRGFSRPPPVPENPFLQLLGPVPSPTLQPEDPAADMTRVFLGRHHEPGPGQLTKSAGPTPEKPEEEATLGDPQLPAETKPPTPAPPKDVTPPKDITPPKDVLPEQKTLRPSLSYPLAACDQTRATWPPWHRWGTLPQAAAPLAPIRAPEPLPKGGERRQAAPGRFAVVMPRVQKLSSFQRVGPATLKPQVQPIQDPKPRACSLRWSCLWLRADAYGPWPRVHTHPQSCHLGPGAACLSLRGSWEEVGPPSWRNKMHSIRNLPSMRFREQHGEDGVEDMTQLEDLQETTVLSNLKIRFERNLIYTYIGSILVSVNPYQMFGIYGPEQVQQYNGRALGENPPHLFAVANLAFAKMLDAKQNQCIIISGESGSGKTEATKLILRYLAAMNQKREVMQQIKILEATPLLESFGNAKTVRNDNSSRFGKFVEIFLEGGVISGAITSQYLLEKSRIVFQAKNERNYHIFYELLAGLPAQLRQAFSLQEAETYYYLNQGGNCEIAGKSDADDFRRLLAAMEVLGFSSEDQDSIFRILASILHLGNVYFEKYETDAQEVASVVSAREIQAVAELLQISPEGLQKAITFKVTETMREKIFTPLTVESAVDARDAIAKVLYALLFSWLITRVNALVSPRQDTLSIAILDIYGFEDLSFNSFEQLCINYANENLQYLFNKIVFQEEQEEYIREQIDWQEITFADNQPCINLISLKPYGILRILDDQCCFPQATDHTFLQKCHYHHGANPLYSKPKMPLPEFTIKHYAGKVTYQVHKFLDKNHDQVRQDVLDLFVRSRTRVVAHLFSSHAPQAAPQRLGKSSSVTRLYKAHTVAAKFQQSLLDLVEKMERCNPLFMRCLKPNHKKEPGLFEPDVVMAQLRYSGVLETVRIRKEGFPVRLPFQGFIDRYCCLVALKHDLPANGDMCVSVLSRLCKVMPNMYRVGVSKLFLKEHLYQLLESMREHVLNLAALTLQRCLRGFFIKRRFRSLRHKIILLQSRARGYLARQRYQQMRRSLVKFRSLVHAYVSRRRYLKLRAEWRCQVEGALLWEQEELSKREVVAVGHLEVPAELAGLLQAVAGLGLAQVPQVAPVRTPRLQAEPRVTLPLDINNYPMAKFVQCHFKEPAFGMLTVPLRTPLTQLPAEHHAEAVSIFKLILRFMGDPHLHGARENIFGNYIVQKGLAVPELRDEILAQLANQVWHNHNAHNAERGWLLLAACLSGFAPSPCFNKYLLKFVSDYGRNGFQAVCQHRLMQAMGRAQQQGSGAARTLPPTQLEWTATYEKASMALDVGCFNGDQFSCPVHSWSTGEEVAGDILRHRGLADGWRGWTVAMKNGVQWAELAGHDYVLDLVSDLELLRDFPRQKSYFIVGTEGPAASRGGPKVVFGNSWDSDEDMSTRPQPQEHMPKVLDSDGYSSHNQDGTNGETEAQRGTATHQESDSLGEPAVPHKGLDCYLDSLFDPVLSYGDADLEKPTAIAYRMKGGGQPGGGSSSGTEDTPRRPPEPKPIPGLDASTLALQQAFIHKQAVLLAREMTLQATALQQQPLSAALRSLPAEKPPAPEAQPTSVGTGPPAKPVLLRATPKPLAPAPLAKAPRLPIKPVAAPVLAQDQASPETTSPSPELVRYSTLNSEHFPQPTQQIKNIVRQYQQPFRGGRPEALRKDGGKVFMKRPDPHEEALMILKGQMTHLAAAPGTQVSREAVALVKPVTSAPRPSMAPTSALPSRSLEPPEELTQTRLHRLINPNFYGYQDAPWKIFLRKEVFYPKDSYSHPVQLDLLFRQILHDTLSEACLRISEDERLRMKALFAQNQLDTQKPLVTESVKRAVVSTARDTWEVYFSRIFPATGSVGTGVQLLAVSHVGIKLLRMVKGGQEAGGQLRVLRAYSFADILFVTMPSQNMLEFNLASEKVILFSARAHQVKTLVDDFILELKKDSDYVVAVRNFLPEDPALLAFHKGDIIHLQPLEPPRVGYSAGCVVRRKVVYLEELRRRGPDFGWRFGTIHGRVGRFPSELVQPAAAPDFLQLPTEPGRGRAAAVAAAVASAAAAQEVGRRREGPPVRARSADHGEDALALPPYTMLEFAQKYFRDPQRRPQDGLRLKSKEPRESRTLEDMLCFTKTPLQESLIELSDSSLSKMATDMFLAVMRFMGDAPLKGQSDLDVLCNLLKLCGDHEVMRDECYCQVVKQITDNTSSKQDSCQRGWRLLYIVTAYHSCSEVLHPHLTRFLQDVSRTPGLPFQGIAKACEQNLQKTLRFGGRLELPSSIELRAMLAGRSSKRQLFLLPGGLERHLKIKTCTVALDVVEEICAEMALTRPEAFNEYVIFVVTNRGQHVCPLSRRAYILDVASEMEQVDGGYMLWFRRVLWDQPLKFENELYVTMHYNQVLPDYLKGLFSSVPASRPSEQLLQQVSKLASLQHRAKDHFYLPSVREVQEYIPAQLYRTTAGSTWLNLVSQHRQQTQALSPHQARAQFLGLLSALPMFGSSFFFIQSCSNIAVPAPCILAINHNGLNFLSTETHELMVKFPLKEIQSTRTQRPTANSSYPYVEIALGDVAAQRTLQLQLEQGLELCRVVAVHVENLLSAHEKRLTLPPSEITLL,mutated_sequence,1.0,3530.0,NP_057323.3.a2m,NP_057323.3.npy,ClinVar
+NP_057353.1,NP_057353.1.csv,MPQLSGGGGGGGGDPELCATDEMIPFKDEGDPQKEKIFAEISHPEEEGDLADIKSSLVNESEIIPASNGHEVARQAQTSQEPYHDKAREHPDDGKHPDGGLYNKGPSYSSYSGYIMMPNMNNDPYMSNGSLSPPIPRTSNKVPVVQPSHAVHPLTPLITYSDEHFSPGSHPSHIPSDVNSKQGMSRHPPAPDIPTFYPLSPGGVGQITPPLGWQGQPVYPITGGFRQPYPSSLSVDTSMSRFSHHMIPGPPGPHTTGIPHPAIVTPQVKQEHPHTDSDLMHVKPQHEQRKEQEPKRPHIKKPLNAFMLYMKEMRANVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWSARDNYGKKKKRKREKLQESASGTGPRMTAAYI,mutated_sequence,1.0,399.0,NP_057353.1.a2m,NP_057353.1.npy,ClinVar
+NP_057425.3,NP_057425.3.csv,MTSEEMTASVLIPVTQRKVVSAQSAADESSEKVSDINISKAHTVRRSGETSHTISQLNKLKEEPSGSNLPKILSIAREKIVSDENSNEKCWEKIMPDSAKNLNINCNNILRNHQHGLPQRQFYEMYNSVAEEDLCLETGIPSPLERKVFPGIQLELDRPSMGISPLGNQSVIIETGRAHPDSRRAVFHFHYEVDRRMSDTFCTLSENLILDDCGNCVPLPGGEEKQKKNYVAYTCKLMELAKNCDNKNEQLQCDHCDTLNDKYFCFEGSCEKVDMVYSGDSFCRKDFTDSQAAKTFLSHFEDFPDNCDDVEEDAFKSKKERSTLLVRRFCKNDREVKKSVYTGTRAIVRTLPSGHIGLTAWSYIDQKRNGPLLPCGRVMEPPSTVEIRQDGSQRLSEAQWYPIYNAVRREETENTVGSLLHFLTKLPASETAHGRISVGPCLKQCVRDTVCEYRATLQRTSISQYITGSLLEATTSLGARSGLLSTFGGSTGRMMLKERQPGPSVANSNALPSSSAGISKELIDLQPLIQFPEEVASILMEQEQTIYRRVLPVDYLCFLTRDLGTPECQSSLPCLKASISASILTTQNGEHNALEDLVMRFNEVSSWVTWLILTAGSMEEKREVFSYLVHVAKCCWNMGNYNAVMEFLAGLRSRKVLKMWQFMDQSDIETMRSLKDAMAQHESSCEYRKVVTRALHIPGCKVVPFCGVFLKELCEVLDGASGLMKLCPRYNSQEETLEFVADYSGQDNFLQRVGQNGLKNSEKESTVNSIFQVIRSCNRSLETDEEDSPSEGNSSRKSSLKDKSRWQFIIGDLLDSDNDIFEQSKEYDSHGSEDSQKAFDHGTELIPWYVLSIQADVHQFLLQGATVIHYDQDTHLSARCFLQLQPDNSTLTWVKPTTASPASSKAKLGVLNNTAEPGKFPLLGNAGLSSLTEGVLDLFAVKAVYMGHPGIDIHTVCVQNKLGSMFLSETGVTLLYGLQTTDNRLLHFVAPKHTAKMLFSGLLELTRAVRKMRKFPDQRQQWLRKQYVSLYQEDGRYEGPTLAHAVELFGGRRWSARNPSPGTSAKNAEKPNMQRNNTLGISTTKKKKKILMRGESGEVTDDEMATRKAKMHKECRSRSGSDPQDINEQEESEVNAIANPPNPLPSRRAHSLTTAGSPNLAAGTSSPIRPVSSPVLSSSNKSPSSAWSSSSWHGRIKGGMKGFQSFMVSDSNMSFVEFVELFKSFSVRSRKDLKDLFDVYAVPCNRSGSESAPLYTNLTIDENTSDLQPDLDLLTRNVSDLGLFIKSKQQLSDNQRQISDAIAAASIVTNGTGIESTSLGIFGVGILQLNDFLVNCQGEHCTYDEILSIIQKFEPSISMCHQGLMSFEGFARFLMDKENFASKNDESQENIKELQLPLSYYYIESSHNTYLTGHQLKGESSVELYSQVLLQGCRSVELDCWDGDDGMPIIYHGHTLTTKIPFKEVVEAIDRSAFINSDLPIIISIENHCSLPQQRKMAEIFKTVFGEKLVTKFLFETDFSDDPMLPSPDQLRKKVLLKNKKLKAHQTPVDILKQKAHQLASMQVQAYNGGNANPRPANNEEEEDEEDEYDYDYESLSDDNILEDRPENKSCNDKLQFEYNEEIPKRIKKADNSACNKGKVYDMELGEEFYLDQNKKESRQIAPELSDLVIYCQAVKFPGLSTLNASGSSRGKERKSRKSIFGNNPGRMSPGETASFNKTSGKSSCEGIRQTWEESSSPLNPTTSLSAIIRTPKCYHISSLNENAAKRLCRRYSQKLTQHTACQLLRTYPAATRIDSSNPNPLMFWLHGIQLVALNYQTDDLPLHLNAAMFEANGGCGYVLKPPVLWDKNCPMYQKFSPLERDLDSMDPAVYSLTIVSGQNVCPSNSMGSPCIEVDVLGMPLDSCHFRTKPIHRNTLNPMWNEQFLFHVHFEDLVFLRFAVVENNSSAVTAQRIIPLKALKRGYRHLQLRNLHNEVLEISSLFINSRRMEENSSGNTMSASSMFNTEERKCLQTHRVTVHGVPGPEPFTVFTINGGTKAKQLLQQILTNEQDIKPVTTDYFLMEEKYFISKEKNECRKQPFQRAIGPEEEIMQILSSWFPEEGYMGRIVLKTQQENLEEKNIVQDDKEVILSSEEESFFVQVHDVSPEQPRTVIKAPRVSTAQDVIQQTLCKAKYSYSILSNPNPSDYVLLEEVVKDTTNKKTTTPKSSQRVLLDQECVFQAQSKWKGAGKFILKLKEQVQASREDKKKGISFASELKKLTKSTKQPRGLTSPSQLLTSESIQTKEEKPVGGLSSSDTMDYRQ,mutated_sequence,1.0,2302.0,NP_057425.3.a2m,NP_057425.3.npy,ClinVar
+NP_057490.2,NP_057490.2.csv,MADEATRRVVSEIPVLKTNAGPRDRELWVQRLKEEYQSLIRYVENNKNADNDWFRLESNKEGTRWFGKCWYIHDLLKYEFDIEFDIPITYPTTAPEIAVPELDGKTAKMYRGGKICLTDHFKPLWARNVPKFGLAHLMALGLGPWLAVEIPDLIQKGVIQHKEKCNQ,mutated_sequence,1.0,167.0,NP_057490.2.a2m,NP_057490.2.npy,ClinVar
+NP_057501.2,NP_057501.2.csv,MSGSLGRAAAALLRWGRGAGGGGLWGPGVRAAGSGAGGGGSAEQLDALVKKDKVVVFLKGTPEQPQCGFSNAVVQILRLHGVRDYAAYNVLDDPELRQGIKDYSNWPTIPQVYLNGEFVGGCDILLQMHQNGDLVEELKKLGIHSALLDEKKDQDSK,mutated_sequence,1.0,157.0,NP_057501.2.a2m,NP_057501.2.npy,ClinVar
+NP_057688.3,NP_057688.3.csv,MADAAASPVGKRLLLLFADTAASASASAPAAAAASGDPGPALRTRAWRAGTVRAMSGAVPQDLAIFVEFDGCNWKQHSWVKVHAEEVIVLLLEGSLVWAPREDPVLLQGIRVSIAQWPALTFTPLVDKLGLGSVVPVEYLLDRELRFLSDANGLHLFQMGTDSQNQILLEHAALRETVNALISDQKLQEIFSRGPYSVQGHRVKIYQPEGEEGWLYGVVSHQDSITRLMEVSVTESGEIKSVDPRLIHVMLMDNSAPQSEGGTLKAVKSSKGKKKRESIEGKDGRRRKSASDSGCDPASKKLKGDRGEVDSNGSDGGEASRGPWKGGNASGEPGLDQRAKQPPSTFVPQINRNIRFATYTKENGRTLVVQDEPVGGDTPASFTPYSTATGQTPLAPEVGGAENKEAGKTLEQVGQGIVASAAVVTTASSTPNTVRISDTGLAAGTVPEKQKGSRSQASGENSRNSILASSGFGAPLPSSSQPLTFGSGRSQSNGVLATENKPLGFSFGCSSAQEAQKDTDLSKNLFFQCMSQTLPTSNYFTTVSESLADDSSSRDSFKQSLESLSSGLCKGRSVLGTDTKPGSKAGSSVDRKVPAESMPTLTPAFPRSLLNARTPENHENLFLQPPKLSREEPSNPFLAFVEKVEHSPFSSFASQASGSSSSATTVTSKVAPSWPESHSSADSASLAKKKPLFITTDSSKLVSGVLGSALTSGGPSLSAMGNGRSSSPTSSLTQPIEMPTLSSSPTEERPTVGPGQQDNPLLKTFSNVFGRHSGGFLSSPADFSQENKAPFEAVKRFSLDERSLACRQDSDSSTNSDLSDLSDSEEQLQAKTGLKGIPEHLMGKLGPNGERSAELLLGKSKGKQAPKGRPRTAPLKVGQSVLKDVSKVKKLKQSGEPFLQDGSCINVAPHLHKCRECRLERYRKFKEQEQDDSTVACRFFHFRRLIFTRKGVLRVEGFLSPQQSDPDAMNLWIPSSSLAEGIDLETSKYILANVGDQFCQLVMSEKEAMMMVEPHQKVAWKRAVRGVREMCDVCETTLFNIHWVCRKCGFGVCLDCYRLRKSRPRSETEEMGDEEVFSWLKCAKGQSHEPENLMPTQIIPGTALYNIGDMVHAARGKWGIKANCPCISRQNKSVLRPAVTNGMSQLPSINPSASSGNETTFSGGGGPAPVTTPEPDHVPKADSTDIRSEEPLKTDSSASNSNSELKAIRPPCPDTAPPSSALHWLADLATQKAKEETKEAGSLRSVLNKESHSPFGLDSFNSTAKVSPLTPKLFNSLLLGPTASNNKTEGSSLRDLLHSGPGKLPQTPLDTGIPFPPVFSTSSAGVKSKASLPNFLDHIIASVVENKKTSDASKRACNLTDTQKEVKEMVMGLNVLDPHTSHSWLCDGRLLCLHDPSNKNNWKIFRECWKQGQPVLVSGVHKKLKSELWKPEAFSQEFGDQDVDLVNCRNCAIISDVKVRDFWDGFEIICKRLRSEDGQPMVLKLKDWPPGEDFRDMMPTRFEDLMENLPLPEYTKRDGRLNLASRLPSYFVRPDLGPKMYNAYGLITAEDRRVGTTNLHLDVSDAVNVMVYVGIPIGEGAHDEEVLKTIDEGDADEVTKQRIHDGKEKPGALWHIYAAKDAEKIRELLRKVGEEQGQENPPDHDPIHDQSWYLDQTLRKRLYEEYGVQGWAIVQFLGDAVFIPAGAPHQVHNLYSCIKVAEDFVSPEHVKHCFRLTQEFRHLSNTHTNHEDKLQVKNIIYHAVKDAVGTLKAHESKLARS,mutated_sequence,1.0,1761.0,NP_057688.3.a2m,NP_057688.3.npy,ClinVar
+NP_057737.2,NP_057737.2.csv,MSSLGASFVQIKFDDLQFFENCGGGSFGSVYRAKWISQDKEVAVKKLLKIEKEAEILSVLSHRNIIQFYGVILEPPNYGIVTEYASLGSLYDYINSNRSEEMDMDHIMTWATDVAKGMHYLHMEAPVKVIHRDLKSRNVVIAADGVLKICDFGASRFHNHTTHMSLVGTFPWMAPEVIQSLPVSETCDTYSYGVVLWEMLTREVPFKGLEGLQVAWLVVEKNERLTIPSSCPRSFAELLHQCWEADAKKRPSFKQIISILESMSNDTSLPDKCNSFLHNKAEWRCEIEATLERLKKLERDLSFKEQELKERERRLKMWEQKLTEQSNTPLLPSFEIGAWTEDDVYCWVQQLVRKGDSSAEMSVYASLFKENNITGKRLLLLEEEDLKDMGIVSKGHIIHFKSAIEKLTHDYINLFHFPPLIKDSGGEPEENEEKIVNLELVFGFHLKPGTGPQDCKWKMYMEMDGDEIAITYIKDVTFNTNLPDAEILKMTKPPFVMEKWIVGIAKSQTVECTVTYESDVRTPKSTKHVHSIQWSRTKPQDEVKAVQLAIQTLFTNSDGNPGSRSDSSADCQWLDTLRMRQIASNTSLQRSQSNPILGSPFFSHFDGQDSYAAAVRRPQVPIKYQQITPVNQSRSSSPTQYGLTKNFSSLHLNSRDSGFSSGNTDTSSERGRYSDRSRNKYGRGSISLNSSPRGRYSGKSQHSTPSRGRYPGKFYRVSQSALNPHQSPDFKRSPRDLHQPNTIPGMPLHPETDSRASEEDSKVSEGGWTKVEYRKKPHRPSPAKTNKERARGDHRGWRNF,mutated_sequence,1.0,800.0,NP_057737.2.a2m,NP_057737.2.npy,ClinVar
+NP_059830.1,NP_059830.1.csv,MPYEIKKVFASLPQVERGVSKIIGGDPKGNNFLYTNGKCVILRNIDNPALADIYTEHAHQVVVAKYAPSGFYIASGDVSGKLRIWDTTQKEHLLKYEYQPFAGKIKDIAWTEDSKRIAVVGEGREKFGAVFLWDSGSSVGEITGHNKVINSVDIKQSRPYRLATGSDDNCAAFFEGPPFKFKFTIGDHSRFVNCVRFSPDGNRFATASADGQIYIYDGKTGEKVCALGGSKAHDGGIYAISWSPDSTHLLSASGDKTSKIWDVSVNSVVSTFPMGSTVLDQQLGCLWQKDHLLSVSLSGYINYLDRNNPSKPLHVIKGHSKSIQCLTVHKNGGKSYIYSGSHDGHINYWDSETGENDSFAGKGHTNQVSRMTVDESGQLISCSMDDTVRYTSLMLRDYSGQGVVKLDVQPKCVAVGPGGYAVVVCIGQIVLLKDQRKCFSIDNPGYEPEVVAVHPGGDTVAIGGVDGNVRLYSILGTTLKDEGKLLEAKGPVTDVAYSHDGAFLAVCDASKVVTVFSVADGYSENNVFYGHHAKIVCLAWSPDNEHFASGGMDMMVYVWTLSDPETRVKIQDAHRLHHVSSLAWLDEHTLVTTSHDASVKEWTITY,mutated_sequence,1.0,606.0,NP_059830.1.a2m,NP_059830.1.npy,ClinVar
+NP_060083.1,NP_060083.1.csv,MALSVPGYSPGFRKPPEVVRLRRKRARSRGAAASPPRELTEPAARRAALVAGLPLRPFPAAGGRGGGSGGGPAAARRNPFARLDNRPRVAAEPPDGPAREQPEAPVPFLDSNQENDLLWEEKFPERTTVTELPQTSHVSFSEPDIPSSKSTELPVDWSIKTRLLFTSSQPFTWADHLKAQEEAQGLVQHCRATEVTLPKSIQDPKLSSELRCTFQQSLIYWLHPALSWLPLFPRIGADRKMAGKTSPWSNDATLQHVLMSDWSVSFTSLYNLLKTKLCPYFYVCTYQFTVLFRAAGLAGSDLITALISPTTRGLREAMRNEGIEFSLPLIKESGHKKETASGTSLGYGEEQAISDEDEEESFSWLEEMGVQDKIKKPDILSIKLRKEKHEVQMDHRPESVVLVKGINTFTLLNFLINSKSLVATSGPQAGLPPTLLSPVAFRGATMQMLKARSVNVKTQALSGYRDQFSLEITGPIMPHSLHSLTMLLKSSQSGSFSAVLYPHEPTAVFNICLQMDKVLDMEVVHKELTNCGLHPNTLEQLSQIPLLGKSSLRNVVLRDYIYNWRS,mutated_sequence,1.0,566.0,NP_060083.1.a2m,NP_060083.1.npy,ClinVar
+NP_060087.3,NP_060087.3.csv,MPPLLAPLLCLALLPALAARGPRCSQPGETCLNGGKCEAANGTEACVCGGAFVGPRCQDPNPCLSTPCKNAGTCHVVDRRGVADYACSCALGFSGPLCLTPLDNACLTNPCRNGGTCDLLTLTEYKCRCPPGWSGKSCQQADPCASNPCANGGQCLPFEASYICHCPPSFHGPTCRQDVNECGQKPGLCRHGGTCHNEVGSYRCVCRATHTGPNCERPYVPCSPSPCQNGGTCRPTGDVTHECACLPGFTGQNCEENIDDCPGNNCKNGGACVDGVNTYNCRCPPEWTGQYCTEDVDECQLMPNACQNGGTCHNTHGGYNCVCVNGWTGEDCSENIDDCASAACFHGATCHDRVASFYCECPHGRTGLLCHLNDACISNPCNEGSNCDTNPVNGKAICTCPSGYTGPACSQDVDECSLGANPCEHAGKCINTLGSFECQCLQGYTGPRCEIDVNECVSNPCQNDATCLDQIGEFQCICMPGYEGVHCEVNTDECASSPCLHNGRCLDKINEFQCECPTGFTGHLCQYDVDECASTPCKNGAKCLDGPNTYTCVCTEGYTGTHCEVDIDECDPDPCHYGSCKDGVATFTCLCRPGYTGHHCETNINECSSQPCRHGGTCQDRDNAYLCFCLKGTTGPNCEINLDDCASSPCDSGTCLDKIDGYECACEPGYTGSMCNINIDECAGNPCHNGGTCEDGINGFTCRCPEGYHDPTCLSEVNECNSNPCVHGACRDSLNGYKCDCDPGWSGTNCDINNNECESNPCVNGGTCKDMTSGYVCTCREGFSGPNCQTNINECASNPCLNQGTCIDDVAGYKCNCLLPYTGATCEVVLAPCAPSPCRNGGECRQSEDYESFSCVCPTGWQGQTCEVDINECVLSPCRHGASCQNTHGGYRCHCQAGYSGRNCETDIDDCRPNPCHNGGSCTDGINTAFCDCLPGFRGTFCEEDINECASDPCRNGANCTDCVDSYTCTCPAGFSGIHCENNTPDCTESSCFNGGTCVDGINSFTCLCPPGFTGSYCQHDVNECDSQPCLHGGTCQDGCGSYRCTCPQGYTGPNCQNLVHWCDSSPCKNGGKCWQTHTQYRCECPSGWTGLYCDVPSVSCEVAAQRQGVDVARLCQHGGLCVDAGNTHHCRCQAGYTGSYCEDLVDECSPSPCQNGATCTDYLGGYSCKCVAGYHGVNCSEEIDECLSHPCQNGGTCLDLPNTYKCSCPRGTQGVHCEINVDDCNPPVDPVSRSPKCFNNGTCVDQVGGYSCTCPPGFVGERCEGDVNECLSNPCDARGTQNCVQRVNDFHCECRAGHTGRRCESVINGCKGKPCKNGGTCAVASNTARGFICKCPAGFEGATCENDARTCGSLRCLNGGTCISGPRSPTCLCLGPFTGPECQFPASSPCLGGNPCYNQGTCEPTSESPFYRCLCPAKFNGLLCHILDYSFGGGAGRDIPPPLIEEACELPECQEDAGNKVCSLQCNNHACGWDGGDCSLNFNDPWKNCTQSLQCWKYFSDGHCDSQCNSAGCLFDGFDCQRAEGQCNPLYDQYCKDHFSDGHCDQGCNSAECEWDGLDCAEHVPERLAAGTLVVVVLMPPEQLRNSSFHFLRELSRVLHTNVVFKRDAHGQQMIFPYYGREEELRKHPIKRAAEGWAAPDALLGQVKASLLPGGSEGGRRRRELDPMDVRGSIVYLEIDNRQCVQASSQCFQSATDVAAFLGALASLGSLNIPYKIEAVQSETVEPPPPAQLHFMYVAAAAFVLLFFVGCGVLLSRKRRRQHGQLWFPEGFKVSEASKKKRREPLGEDSVGLKPLKNASDGALMDDNQNEWGDEDLETKKFRFEEPVVLPDLDDQTDHRQWTQQHLDAADLRMSAMAPTPPQGEVDADCMDVNVRGPDGFTPLMIASCSGGGLETGNSEEEEDAPAVISDFIYQGASLHNQTDRTGETALHLAARYSRSDAAKRLLEASADANIQDNMGRTPLHAAVSADAQGVFQILIRNRATDLDARMHDGTTPLILAARLAVEGMLEDLINSHADVNAVDDLGKSALHWAAAVNNVDAAVVLLKNGANKDMQNNREETPLFLAAREGSYETAKVLLDHFANRDITDHMDRLPRDIAQERMHHDIVRLLDEYNLVRSPQLHGAPLGGTPTLSPPLCSPNGYLGSLKPGVQGKKVRKPSSKGLACGSKEAKDLKARRKKSQDGKGCLLDSSGMLSPVDSLESPHGYLSDVASPPLLPSPFQQSPSVPLNHLPGMPDTHLGIGHLNVAAKPEMAALGGGGRLAFETGPPRLSHLPVASGTSTVLGSSSGGALNFTVGGSTSLNGQCEWLSRLQSGMVPNQYNPLRGSVAPGPLSTQAPSLQHGMVGPLHSSLAASALSQMMSYQGLPSTRLATQPHLVQTQQVQPQNLQMQQQNLQPANIQQQQSLQPPPPPPQPHLGVSSAASGHLGRSFLSGEPSQADVQPLGPSSLAVHTILPQESPALPTSLPSSLVPPVTAAQFLTPPSQHSYSSPVDNTPSHQLQVPEHPFLTPSPESPDQWSSSSPHSNVSDWSEGVSSPPTSMQSQIARIPEAFK,mutated_sequence,1.0,2555.0,NP_060087.3.a2m,NP_060087.3.npy,ClinVar
+NP_060119.3,NP_060119.3.csv,MIGCGACEPKVKMAGGQAAAALPTWKMAARRSLSARGRGILQAAAGRLLPLLLLSCCCGAGGCAAVGENEETVIIGLRLEDTNDVSFMEGGALRVSERTRVKLRVYGQNINNETWSRIAFTEHERRRHSPGERGLGGPAPPEPDSGPQRCGIRTSDIIILPHIILNRRTSGIIEIEIKPLRKMEKSKSYYLCTSLSTPALGAGGSGSTGGAVGGKGGSGVAGLPPPPWAETTWIYHDGEDTKMIVGEEKKFLLPFWLQVIFISLLLCLSGMFSGLNLGLMALDPMELRIVQNCGTEKEKNYAKRIEPVRRQGNYLLCSLLLGNVLVNTTLTILLDDIAGSGLVAVVVSTIGIVIFGEIVPQAICSRHGLAVGANTIFLTKFFMMMTFPASYPVSKLLDCVLGQEIGTVYNREKLLEMLRVTDPYNDLVKEELNIIQGALELRTKTVEDVMTPLRDCFMITGEAILDFNTMSEIMESGYTRIPVFEGERSNIVDLLFVKDLAFVDPDDCTPLKTITKFYNHPLHFVFNDTKLDAMLEEFKKGKSHLAIVQRVNNEGEGDPFYEVLGIVTLEDVIEEIIKSEILDETDLYTDNRTKKKVAHRERKQDFSAFKQTDSEMKVKISPQLLLAMHRFLATEVEAFSPSQMSEKILLRLLKHPNVIQELKYDEKNKKAPEYYLYQRNKPVDYFVLILQGKVEVEAGKEGMKFEASAFSYYGVMALTASPVPLSLSRTFVVSRTELLAAGSPGENKSPPRPCGLNHSDSLSRSDRIDAVTPTLGSSNNQLNSSLLQVYIPDYSVRALSDLQFVKISRQQYQNALMASRMDKTPQSSDSENTKIELTLTELHDGLPDETANLLNEQNCVTHSKANHSLHNEGAI,mutated_sequence,1.0,875.0,NP_060119.3.a2m,NP_060119.3.npy,ClinVar
+NP_060137.2,NP_060137.2.csv,MQKIKSLMTRQGLKSPQESLSDLGAIESLRVPGKEEFRELREQPSDPQAEQELINSIEQVYFSVDSFDIVKYELEKLPPVLNLQELEAYRDKLKQQQAAVSKKVADLILEKQPAYVKELERVTSLQTGLQLAAVICTNGRRHLNIAKEGFTQASLGLLANQRKRQLLIGLLKSLRTIKTLQRTDVRLSEMLEEEDYPGAIQLCLECQKAASTFKHYSCISELNSKLQDTLEQIEEQLDVALSKICKNFDINHYTKVQQAYRLLGKTQTAMDQLHMHFTQAIHNTVFQVVLGYVELCAGNTDTKFQKLQYKDLCTHVTPDSYIPCLADLCKALWEVMLSYYRTMEWHEKHDNEDTASASEGSNMIGTEETNFDRGYIKKKLEHGLTRIWQDVQLKVKTYLLGTDLSIFKYDDFIFVLDIISRLMQVGEEFCGSKSEVLQESIRKQSVNYFKNYHRTRLDELRMFLENETWELCPVKSNFSILQLHEFKFMEQSRSPSVSPSKQPVSTSSKTVTLFEQYCSGGNPFEIQANHKDEETEDVLASNGYESDEQEKSAYQEYDSDSDVPEELKRDYVDEQTGDGPVKSVSRETLKSRKKSDYSLNKVNAPILTNTTLNVIRLVGKYMQMMNILKPIAFDVIHFMSQLFDYYLYAIYTFFGRNDSLESTGLGLSSSRLRTTLNRIQESLIDLEVSADPTATLTAAEERKEKVPSPHLSHLVVLTSGDTLYGLAERVVATESLVFLAEQFEFLQPHLDAVMPAVKKPFLQQFYSQTVSTASELRKPIYWIVAGKALDYEQMLLLMANVKWDVKEIMSQHNIYVDALLKEFEQFNRRLNEVSKRVRIPLPVSNILWEHCIRLANRTIVEGYANVKKCSNEGRALMQLDFQQFLMKLEKLTDIRPIPDKEFVETYIKAYYLTENDMERWIKEHREYSTKQLTNLVNVCLGSHINKKARQKLLAAIDDIDRPKR,mutated_sequence,1.0,964.0,NP_060137.2.a2m,NP_060137.2.npy,ClinVar
+NP_060141.3,NP_060141.3.csv,MLSSTDFTFASWELVVRVDHPNEEQQKDVTLRVSGDLHVGGVMLKLVEQINISQDWSDFALWWEQKHCWLLKTHWTLDKYGVQADAKLLFTPQHKMLRLRLPNLKMVRLRVSFSAVVFKAVSDICKILNIRRSEELSLLKPSGDYFKKKKKKDKNNKEPIIEDILNLESSPTASGSSVSPGLYSKTMTPIYDPINGTPASSTMTWFSDSPLTEQNCSILAFSQPPQSPEALADMYQPRSLVDKAKLNAGWLDSSRSLMEQGIQEDEQLLLRFKYYSFFDLNPKYDAVRINQLYEQARWAILLEEIDCTEEEMLIFAALQYHISKLSLSAETQDFAGESEVDEIEAALSNLEVTLEGGKADSLLEDITDIPKLADNLKLFRPKKLLPKAFKQYWFIFKDTSIAYFKNKELEQGEPLEKLNLRGCEVVPDVNVAGRKFGIKLLIPVADGMNEMYLRCDHENQYAQWMAACMLASKGKTMADSSYQPEVLNILSFLRMKNRNSASQVASSLENMDMNPECFVSPRCAKRHKSKQLAARILEAHQNVAQMPLVEAKLRFIQAWQSLPEFGLTYYLVRFKGSKKDDILGVSYNRLIKIDAATGIPVTTWRFTNIKQWNVNWETRQVVIEFDQNVFTAFTCLSADCKIVHEYIGGYIFLSTRSKDQNETLDEDLFHKLTGGQD,mutated_sequence,1.0,677.0,NP_060141.3.a2m,NP_060141.3.npy,ClinVar
+NP_060206.2,NP_060206.2.csv,MAAPAQQTTQPGGGKRKGKAQYVLAKRARRCDAGGPRQLEPGLQGILITCNMNERKCVEEAYSLLNEYGDDMYGPEKFTDKDQQPSGSEGEDDDAEAALKKEVGDIKASTEMRLRRFQSVESGANNVVFIRTLGIEPEKLVHHILQDMYKTKKKKTRVILRMLPISGTCKAFLEDMKKYAETFLEPWFKAPNKGTFQIVYKSRNNSHVNREEVIRELAGIVCTLNSENKVDLTNPQYTVVVEIIKAVCCLSVVKDYMLFRKYNLQEVVKSPKDPSQLNSKQGNGKEAKLESADKSDQNNTAEGKNNQQVPENTEELGQTKPTSNPQVVNEGGAKPELASQATEGSKSNENDFS,mutated_sequence,1.0,353.0,NP_060206.2.a2m,NP_060206.2.npy,ClinVar
+NP_060227.2,NP_060227.2.csv,MMDSENKPENDEDEKINKEAQDLTKLSSHNEDGGPVSDVIASFPENSMGKRGFSESSNSDSVVIGEDRNKHASKRRKLDEAEPLKSGKQGICRLETSESSVTEGGIALDETGKETFLSDCTVGGTCLPNALSPSCNFSTIDVVSLKTDTEKTSAQEMVSLDLERESPFPPKEISVSCTIGNVDTVLKCSICGHLFSSCSDLEKHAESHMQQPKEHTCCHCSHKAESSSALHMHIKQAHGPQKVFSCDLCGFQCSEENLLNAHYLGKTHLRRQNLAARGGFVQILTKQPFPKKSRTMATKNVHSKPRTSKSIAKNSDSKGLRNVGSTFKDFRGSISKQSGSSSELLVEMMPSRNTLSQEVEIVEEHVTSLGLAQNPENQSRKLDTLVTSEGLLEKLESTKNTLQAAHGNSVTSRPRPERNILVLGNSFRRRSSTFTLKGQAKKRFNLLGIKRGTSETQRMYMKHLRTQMKTHDAESVLKHLEACSSVQRVCVTTSETQEAEQGQGSARPPDSGLHSLTVKPASGSQTLCACTDCGQVATNRTDLEIHVKRCHAREMKFYCRTCDFSSMSRRDLDEHLHSNQHQQTASVLSCQCCSFISLDEINLRDHMKEKHNMHFLCTPCNLFFLSEKDVEEHKATEKHINSLVQPKTLQSSNSDLVLQTLPLSTLESENAKESMDDSGKASQEEPLKSRVSHGNEVRHSSKPQFQCKKCFYKTRSSTVLTRHIKLRHGQDYHFLCKACNLYSLSKEGMEKHIKRSKHLENAKKNNIGLSFEECIERVCIGANDKKEEFDVSGNGRIEGHIGVQLQEHSYLEKGMLASEELSQSGGSTKDDELASTTTPKRGRPKGNISRTCSHCGLLASSITNLTVHIRRKHSHQYSYLCKVCKYYTVTKGDMERHCATKKHKGRVEIEASGKHSSDIIVGPEGGSLEAGKKNAGSAVTMSDEHANKPAESPTSVLEKPDRGNSIEAEVENVFHSLDGEVNSHLLDKKEQISSEPEDFAQPGDVYSQRDVTGTGENKCLHCEFSAHSSASLELHVKRKHTKEFEFYCMACDYYAVTRREMTRHAATEKHKMKRQSYLNSANVEAGSADMSKNIIMPEEEHQQNSEEFQIISGQPSDTLKSRNAADCSILNENTNLDMSKVLCAADSVEVETEEESNFNEDHSFCETFQQAPVKDKVRKPEEMMSLTMSSNYGSPSRFQNENSGSSALNCETAKKNHEISNDAGELRVHCEGEGGNAGDGGGVVPHRHLCPVTLDGERSAESPVLVVTRITREQGNLESGGQNRVARGHGLEDLKGVQEDPVLGNKEILMNSQHETEFILEEDGPASDSTVESSDVYETIISIDDKGQAMYSFGRFDSSIIRIKNPEDGELIDQSEEGLIATGVRISELPLKDCAQGVKKKKSEGSSIGESTRIRCDDCGFLADGLSGLNVHIAMKHPTKEKHFHCLLCGKSFYTESNLHQHLASAGHMRNEQASVEELPEGGATFKCVKCTEPFDSEQNLFLHIKGQHEELLREVNKYIVEDTEQINREREENQGNVCKYCGKMCRSSNSMAFLAHIRTHTGSKPFKCKICHFATAQLGDARNHVKRHLGMREYKCHVCGVAFVMKKHLNTHLLGKHGVGTPKERKFTCHLCDRSFTEKWALNNHMKLHTGEKPFKCTWPTCHYSFLTASAMKDHYRTHTGEKSFLCDLCGFAGGTRHALTKHRRQHTGEKPFKCDECNFASTTQSHLTRHKRVHTGEKPYRCPWCDYRSNCAENIRKHILHTGKHEGVKMYNCPKCDYGTNVPVEFRNHLKEQHPDIENPDLAYLHAGIVSKSYECRLKGQGATFVETDSPFTAAALAEEPLVKEKPLRSSRRPAPPPEQVQQVIIFQGYDGEFALDPSVEETAAATLQTLAMAGQVARVVHITEDGQVIATSQSGAHVGSVVPGPILPEQLADGATQVVVVGGSMEGHGMDESLSPGGAVIQQVTKQEILNLSEAGVAPPEASSALDALLCAVTELGEVEGRAGLEEQGRPGAKDVLIQLPGQEVSHVAADPEAPEIQMFPQAQESPAAVEVLTQVVHPSAAMASQERAQVAFKKMVQGVLQFAVCDTAAAGQLVKDGVTQVVVSEEGAVHMVAGEGAQIIMQEAQGEHMDLVESDGEISQIIVTEELVQAMVQESSGGFSEGTTHYILTELPPGVQDEPGLYSHTVLETADSQELLQAGATLGTEAGAPSRAEQLASVVIYTQEGSSAAAAIQSQRESSELQEA,mutated_sequence,1.0,2248.0,NP_060227.2.a2m,NP_060227.2.npy,ClinVar
+NP_060247.2,NP_060247.2.csv,MAETVWSTDTGEAVYRSRDPVRNLRLRVHLQRITSSNFLHYQPAAELGKDLIDLATFRPQPTASGHRPEEDEEEEIVIGWQEKLFSQFEVDLYQNETACQSPLDYQYRQEILKLENSGGKKNRRIFTYTDSDRYTNLEEHCQRMTTAASEVPSFLVERMANVRRRRQDRRGMEGGILKSRIVTWEPSEEFVRNNHVINTPLQTMHIMADLGPYKKLGYKKYEHVLCTLKVDSNGVITVKPDFTGLKGPYRIETEGEKQELWKYTIDNVSPHAQPEEEERERRVFKDLYGRHKEYLSSLVGTDFEMTVPGALRLFVNGEVVSAQGYEYDNLYVHFFVELPTAHWSSPAFQQLSGVTQTCTTKSLAMDKVAHFSYPFTFEAFFLHEDESSDALPEWPVLYCEVLSLDFWQRYRVEGYGAVVLPATPGSHTLTVSTWRPVELGTVAELRRFFIGGSLELEDLSYVRIPGSFKGERLSRFGLRTETTGTVTFRLHCLQQSRAFMESSSLQKRMRSVLDRLEGFSQQSSIHNVLEAFRRARRRMQEARESLPQDLVSPSGTLVS,mutated_sequence,1.0,559.0,NP_060247.2.a2m,NP_060247.2.npy,ClinVar
+NP_060295.1,NP_060295.1.csv,MAAAAMAAAAGGGAGAARSLSRFRGCLAGALLGDCVGSFYEAHDTVDLTSVLRHVQSLEPDPGTPGSERTEALYYTDDTAMARALVQSLLAKEAFDEVDMAHRFAQEYKKDPDRGYGAGVVTVFKKLLNPKCRDVFEPARAQFNGKGSYGNGGAMRVAGISLAYSSVQDVQKFARLSAQLTHASSLGYNGAILQALAVHLALQGESSSEHFLKQLLGHMEDLEGDAQSVLDARELGMEERPYSSRLKKIGELLDQASVTREEVVSELGNGIAAFESVPTAIYCFLRCMEPDPEIPSAFNSLQRTLIYSISLGGDTDTIATMAGAIAGAYYGMDQVPESWQQSCEGYEETDILAQSLHRVFQKS,mutated_sequence,1.0,363.0,NP_060295.1.a2m,NP_060295.1.npy,ClinVar
+NP_060334.2,NP_060334.2.csv,MSAEAADREAATSSRPCTPPQTCWFEFLLEESLLEKHLRKPCPDPAPVQLIVQFLEQASKPSVNEQNQVQPPPDNKRNRILKLLALKVAAHLKWDLDILEKSLSVPVLNMLLNELLCISKVPPGTKHVDMDLATLPPTTAMAVLLYNRWAIRTIVQSSFPVKQAKPGPPQLSVMNQMQQEKELTENILKVLKEQAADSILVLEAALKLNKDLYVHTMRTLDLLAMEPGMVNGETESSTAGLKVKTEEMQCQVCYDLGAAYFQQGSTNSAVYENAREKFFRTKELIAEIGSLSLHCTIDEKRLAGYCQACDVLVPSSDSTSQQLTPYSQVHICLRSGNYQEVIQIFIEDNLTLSLPVQFRQSVLRELFKKAQQGNEALDEICFKVCACNTVRDILEGRTISVQFNQLFLRPNKEKIDFLLEVCSRSVNLEKASESLKGNMAAFLKNVCLGLEDLQYVFMISSHELFITLLKDEERKLLVDQMRKRSPRVNLCIKPVTSFYDIPASASVNIGQLEHQLILSVDPWRIRQILIELHGMTSERQFWTVSNKWEVPSVYSGVILGIKDNLTRDLVYILMAKGLHCSTVKDFSHAKQLFAACLELVTEFSPKLRQVMLNEMLLLDIHTHEAGTGQAGERPPSDLISRVRGYLEMRLPDIPLRQVIAEECVAFMLNWRENEYLTLQVPAFLLQSNPYVKLGQLLAATCKELPGPKESRRTAKDLWEVVVQICSVSSQHKRGNDGRVSLIKQRESTLGIMYRSELLSFIKKLREPLVLTIILSLFVKLHNVREDIVNDITAEHISIWPSSIPNLQSVDFEAVAITVKELVRYTLSINPNNHSWLIIQADIYFATNQYSAALHYYLQAGAVCSDFFNKAVPPDVYTDQVIKRMIKCCSLLNCHTQVAILCQFLREIDYKTAFKSLQEQNSHDAMDSYYDYIWDVTILEYLTYLHHKRGETDKRQIAIKAIGQTELNASNPEEVLQLAAQRRKKKFLQAMAKLYF,mutated_sequence,1.0,995.0,NP_060334.2.a2m,NP_060334.2.npy,ClinVar
+NP_060352.1,NP_060352.1.csv,MEATRRRQHLGATGGPGAQLGASFLQARHGSVSADEAARTAPFHLDLWFYFTLQNWVLDFGRPIAMLVFPLEWFPLNKPSVGDYFHMAYNVITPFLLLKLIERSPRTLPRSITYVSIIIFIMGASIHLVGDSVNHRLLFSGYQHHLSVRENPIIKNLKPETLIDSFELLYYYDEYLGHCMWYIPFFLILFMYFSGCFTASKAESLIPGPALLLVAPSGLYYWYLVTEGQIFILFIFTFFAMLALVLHQKRKRLFLDSNGLFLFSSFALTLLLVALWVAWLWNDPVLRKKYPGVIYVPEPWAFYTLHVSSRH,mutated_sequence,1.0,311.0,NP_060352.1.a2m,NP_060352.1.npy,ClinVar
+NP_060387.2,NP_060387.2.csv,MDWKEVLRRRLATPNTCPNKKKSEQELKDEEMDLFTKYYSEWKGGRKNTNEFYKTIPRFYYRLPAEDEVLLQKLREESRAVFLQRKSRELLDNEELQNLWFLLDKHQTPPMIGEEAMINYENFLKVGEKAGAKCKQFFTAKVFAKLLHTDSYGRISIMQFFNYVMRKVWLHQTRIGLSLYDVAGQGYLRESDLENYILELIPTLPQLDGLEKSFYSFYVCTAVRKFFFFLDPLRTGKIKIQDILACSFLDDLLELRDEELSKESQETNWFSAPSALRVYGQYLNLDKDHNGMLSKEELSRYGTATMTNVFLDRVFQECLTYDGEMDYKTYLDFVLALENRKEPAALQYIFKLLDIENKGYLNVFSLNYFFRAIQELMKIHGQDPVSFQDVKDEIFDMVKPKDPLKISLQDLINSNQGDTVTTILIDLNGFWTYENREALVANDSENSADLDDT,mutated_sequence,1.0,453.0,NP_060387.2.a2m,NP_060387.2.npy,ClinVar
+NP_060416.1,NP_060416.1.csv,MRLFLWNAVLTLFVTSLIGALIPEPEVKIEVLQKPFICHRKTKGGDLMLVHYEGYLEKDGSLFHSTHKHNNGQPIWFTLGILEALKGWDQGLKGMCVGEKRKLIIPPALGYGKEGKGKIPPESTLIFNIDLLEIRNGPRSHESFQEMDLNDDWKLSKDEVKAYLKKEFEKHGAVVNESHHDALVEDIFDKEDEDKDGFISAREFTYKHDEL,mutated_sequence,1.0,211.0,NP_060416.1.a2m,NP_060416.1.npy,ClinVar
+NP_060420.2,NP_060420.2.csv,MAEPGGAAGRSHPEDGSASEGEKEGNNESHMVSPPEKDDGQKGEEAVGSTEHPEEVTTQAEAAIEEGEVETEGEAAVEGEEEAVSYGDAESEEEYYYTETSSPEGQISAADTTYPYFSPPQELPGEEAYDSVSGEAGLQGFQQEATGPPESRERRVTSPEPSHGVLGPSEQMGQVTSGPAVGRLTGSTEEPQGQVLPMGVQHRFRLSHGSDIESSDLEEFVSQEPVIPPGVPDAHPREGDLPVFQDQIQQPSTEEGAMAERVESEGSDEEAEDEGSQLVVLDPDHPLMVRFQAALKNYLNRQIEKLKLDLQELVVATKQSRAQRQELGVNLYEVQQHLVHLQKLLEKSHDRHAMASSERRQKEEELQAARALYTKTCAAANEERKKLAALQTEMENLALHLFYMQNIDQDMRDDIRVMTQVVKKAETERIRAEIEKKKQDLYVDQLTTRAQQLEEDIALFEAQYLAQAEDTRILRKAVSEACTEIDAISVEKRRIMQQWASSLVGMKHRDEAHRAVLEALRGCQHQAKSTDGEIEAYKKSIMKEEEKNEKLASILNRTETEATLLQKLTTQCLTKQVALQSQFNTYRLTLQDTEDALSQDQLEQMILTEELQAIRQAIQGELELRRKTDAAIREKLQEHMTSNKTTKYFNQLILRLQKEKTNMMTHLSKINGDIAQTTLDITHTSSRLDAHQKTLVELDQDVKKVNELITNSQSEISRRTILIERKQGLINFLNKQLERMVSELGGEEVGPLELEIKRLSKLIDEHDGKAVQAQVTWLRLQQEMVKVTQEQEEQLASLDASKKELHIMEQKKLRVESKIEQEKKEQKEIEHHMKDLDNDLKKLNMLMNKNRCSSEELEQNNRVTENEFVRSLKASERETIKMQDKLNQLSEEKATLLNQLVEAEHQIMLWEKKIQLAKEMRSSVDSEIGQTEIRAMKGEIHRMKVRLGQLLKQQEKMIRAMELAVARRETVTTQAEGQRKMDRKALTRTDFHHKQLELRRKIRDVRKATDECTKTVLELEETQRNVSSSLLEKQEKLSVIQADFDTLEADLTRLGALKRQNLSEIVALQTRLKHLQAVKEGRYVFLFRSKQSLVLERQRLDKRLALIATILDRVRDEYPQFQEALHKVSQMIANKLESPGPS,mutated_sequence,1.0,1142.0,NP_060420.2.a2m,NP_060420.2.npy,ClinVar
+NP_060476.2,NP_060476.2.csv,MQALRHVVCALSGGVDSAVAALLLRRRGYQVTGVFMKNWDSLDEHGVCTADKDCEDAYRVCQILDIPFHQVSYVKEYWNDVFSDFLNEYEKGRTPNPDIVCNKHIKFSCFFHYAVDNLGADAIATGHYARTSLEDEEVFEQKHVKKPEGLFRNRFEVRNAVKLLQAADSFKDQTFFLSQVSQDALRRTIFPLGGLTKEFVKKIAAENRLHHVLQKKESMGMCFIGKRNFEHFLLQYLQPRPGHFISIEDNKVLGTHKGWFLYTLGQRANIGGLREPWYVVEKDSVKGDVFVAPRTDHPALYRDLLRTSRVHWIAEEPPAALVRDKMMECHFRFRHQMALVPCVLTLNQDGTVWVTAVQAVRALATGQFAVFYKGDECLGSGKILRLGPSAYTLQKGQRRAGMATESPSDSPEDGPGLSPLL,mutated_sequence,1.0,421.0,NP_060476.2.a2m,NP_060476.2.npy,ClinVar
+NP_060521.4,NP_060521.4.csv,MEPGKRRTKDDTWKADDLRKHLWAIQSGGSKEERKHREKKLRKESEMDLPEHKEPRCRDPDQDARSRDRVAEVHTAKESPRGERDRDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRARKEELRQTVAHHNLLGQETRDRQLLERAERKGRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLKYWLYKEEGERRHRKPREPDRDNKHREKSSTREKREKYSKEKSNSFSDKGEERHKEKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRESQNGEHRNRGASSKRDGTSSQHAENLVRNHGKDKDSRRKHGHEEGSSVWWKLDQRPGGEETVEIEKEETDLENARADAYTASCEDDFEDYEDDFEVCDGDDDESSNEPESREKLEELPLAQKKEIQEIQRAINAENERIGELSLKLFQKRGRTEFEKEPRTDTNSSPSRASVCGIFVDFASASHRQKSRTQALKQKMRSTKLLRLIDLDFSFTFSLLDLPPVNEYDMYIRNFGKKNTKQAYVQCNEDNVERDIQTEEIETREVWTQHPGESTVVSGGSEQRDTSDAVVMPKIDTPRLCSFLRAACQVMAVLLEEDRLAAEPSWNLRAQDRALYFSDSSSQLNTSLPFLQNRKVSSLHTSRVQRQMVVSVHDLPEKSFVPLLDSKYVLCVWDIWQPSGPQKVLICESQVTCCCLSPLKAFLLFAGTAHGSVVVWDLREDSRLHYSVTLSDGFWTFRTATFSTDGILTSVNHRSPLQAVEPISTSVHKKQSFVLSPFSTQEEMSGLSFHIASLDESGVLNVWVVVELPKADIAGSISDLGLMPGGRVKLVHSALIQLGDSLSHKGNEFWGTTQTLNVKFLPSDPNHFIIGTDMGLISHGTRQDLRVAPKLFKPQQHGIRPVKVNVIDFSPFGEPIFLAGCSDGSIRLHQLSSAFPLLQWDSSTDSHAVTGLQWSPTRPAVFLVQDDTSNIYIWDLLQSDLGPVAKQQVSPNRLVAMAAVGEPEKAGGSFLALVLARASGSIDIQHLKRRWAAPEVDECNRLRLLLQEALWPEGKLHK,mutated_sequence,1.0,1066.0,NP_060521.4.a2m,NP_060521.4.npy,ClinVar
+NP_060533.2,NP_060533.2.csv,MPAERPAGSGGSEAPAMVEQLDTAVITPAMLEEEEQLEAAGLERERKMLEKARMSWDRESTEIRYRRLQHLLEKSNIYSKFLLTKMEQQQLEEQKKKEKLERKKESLKVKKGKNSIDASEEKPVMRKKRGREDESYNISEVMSKEEILSVAKKNKKENEDENSSSTNLCVEDLQKNKDSNSIIKDRLSETVRQNTKFFFDPVRKCNGQPVPFQQPKHFTGGVMRWYQVEGMEWLRMLWENGINGILADEMGLGKTVQCIATIALMIQRGVPGPFLVCGPLSTLPNWMAEFKRFTPDIPTMLYHGTQEERQKLVRNIYKRKGTLQIHPVVITSFEIAMRDRNALQHCYWKYLIVDEGHRIKNMKCRLIRELKRFNADNKLLLTGTPLQNNLSELWSLLNFLLPDVFDDLKSFESWFDITSLSETAEDIIAKEREQNVLHMLHQILTPFLLRRLKSDVALEVPPKREVVVYAPLSKKQEIFYTAIVNRTIANMFGSSEKETIELSPTGRPKRRTRKSINYSKIDDFPNELEKLISQIQPEVDRERAVVEVNIPVESEVNLKLQNIMMLLRKCCNHPYLIEYPIDPVTQEFKIDEELVTNSGKFLILDRMLPELKKRGHKVLLFSQMTSMLDILMDYCHLRDFNFSRLDGSMSYSEREKNMHSFNTDPEVFIFLVSTRAGGLGINLTAADTVIIYDSDWNPQSDLQAQDRCHRIGQTKPVVVYRLVTANTIDQKIVERAAAKRKLEKLIIHKNHFKGGQSGLNLSKNFLDPKELMELLKSRDYEREIKGSREKVISDKDLELLLDRSDLIDQMNASGPIKEKMGIFKILENSEDSSPECLF,mutated_sequence,1.0,838.0,NP_060533.2.a2m,NP_060533.2.npy,ClinVar
+NP_060552.4,NP_060552.4.csv,MDVLAEEFGNLTPEQLAAPIPTVEEKWRLLPAFLKVKGLVKQHIDSFNYFINVEIKKIMKANEKVTSDADPMWYLKYLNIYVGLPDVEESFNVTRPVSPHECRLRDMTYSAPITVDIEYTRGSQRIIRNALPIGRMPIMLRSSNCVLTGKTPAEFAKLNECPLDPGGYFIVKGVEKVILIQEQLSKNRIIVEADRKGAVGASVTSSTHEKKSRTNMAVKQGRFYLRHNTLSEDIPIVIIFKAMGVESDQEIVQMIGTEEHVMAAFGPSLEECQKAQIFTQMQALKYIGNKVRRQRMWGGGPKKTKIEEARELLASTILTHVPVKEFNFRAKCIYTAVMVRRVILAQGDNKVDDRDYYGNKRLELAGQLLSLLFEDLFKKFNSEMKKIADQVIPKQRAAQFDVVKHMRQDQITNGMVNAISTGNWSLKRFKMDRQGVTQVLSRLSYISALGMMTRISSQFEKTRKVSGPRSLQPSQWGMLCPSDTPEGEACGLVKNLALMTHITTDMEDGPIVKLASNLGVEDVNLLCGEELSYPNVFLVFLNGNILGVIRDHKKLVNTFRLMRRAGYINEFVSISTNLTDRCVYISSDGGRLCRPYIIVKKQKPAVTNKHMEELAQGYRNFEDFLHESLVEYLDVNEENDCNIALYEHTINKDTTHLEIEPFTLLGVCAGLIPYPHHNQSPRNTYQCAMGKQAMGTIGYNQRNRIDTLMYLLAYPQKPMVKTKTIELIEFEKLPAGQNATVAVMSYSGYDIEDALVLNKASLDRGFGRCLVYKNAKCTLKRYTNQTFDKVMGPMLDAATRKPIWRHEILDADGICSPGEKVENKQVLVNKSMPTVTQIPLEGSNVPQQPQYKDVPITYKGATDSYIEKVMISSNAEDAFLIKMLLRQTRRPEIGDKFSSRHGQKGVCGLIVPQEDMPFCDSGICPDIIMNPHGFPSRMTVGKLIELLAGKAGVLDGRFHYGTAFGGSKVKDVCEDLVRHGYNYLGKDYVTSGITGEPLEAYIYFGPVYYQKLKHMVLDKMHARARGPRAVLTRQPTEGRSRDGGLRLGEMERDCLIGYGASMLLLERLMISSDAFEVDVCGQCGLLGYSGWCHYCKSSCHVSSLRIPYACKLLFQELQSMNIIPRLKLSKYNE,mutated_sequence,1.0,1133.0,NP_060552.4.a2m,NP_060552.4.npy,ClinVar
+NP_060575.1,NP_060575.1.csv,MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTPDCFKRECNNKLLKENAVPTIFLCTEPHDKKEDLLEPQEQLPPPPLPPPVSQVDAAIGLLMPPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQLEKLKEVVHFQKEKDDVSERGYVILPNDYFEIVEVPA,mutated_sequence,1.0,213.0,NP_060575.1.a2m,NP_060575.1.npy,ClinVar
+NP_060597.4,NP_060597.4.csv,MWALCSLLRSAAGRTMSQGRTISQAPARRERPRKDPLRHLRTREKRGPSGCSGGPNTVYLQVVAAGSRDSGAALYVFSEFNRYLFNCGEGVQRLMQEHKLKVARLDNIFLTRMHWSNVGGLSGMILTLKETGLPKCVLSGPPQLEKYLEAIKIFSGPLKGIELAVRPHSAPEYEDETMTVYQIPIHSEQRRGKHQPWQSPERPLSRLSPERSSDSESNENEPHLPHGVSQRRGVRDSSLVVAFICKLHLKRGNFLVLKAKEMGLPVGTAAIAPIIAAVKDGKSITHEGREILAEELCTPPDPGAAFVVVECPDESFIQPICENATFQRYQGKADAPVALVVHMAPASVLVDSRYQQWMERFGPDTQHLVLNENCASVHNLRSHKIQTQLNLIHPDIFPLLTSFRCKKEGPTLSVPMVQGECLLKYQLRPRREWQRDAIITCNPEEFIVEALQLPNFQQSVQEYRRSAQDGPAPAEKRSQYPEIIFLGTGSAIPMKIRNVSATLVNISPDTSLLLDCGEGTFGQLCRHYGDQVDRVLGTLAAVFVSHLHADHHTGLPSILLQRERALASLGKPLHPLLVVAPNQLKAWLQQYHNQCQEVLHHISMIPAKCLQEGAEISSPAVERLISSLLRTCDLEEFQTCLVRHCKHAFGCALVHTSGWKVVYSGDTMPCEALVRMGKDATLLIHEATLEDGLEEEAVEKTHSTTSQAISVGMRMNAEFIMLNHFSQRYAKVPLFSPNFSEKVGVAFDHMKVCFGDFPTMPKLIPPLKALFAGDIEEMEERREKRELRQVRAALLSRELAGGLEDGEPQQKRAHTEEPQAKKVRAQ,mutated_sequence,1.0,826.0,NP_060597.4.a2m,NP_060597.4.npy,ClinVar
+NP_060767.2,NP_060767.2.csv,MAAAALGSSSGSASPAVAELCQNTPETFLEASKLLLTYADNILRNPNDEKYRSIRIGNTAFSTRLLPVRGAVECLFEMGFEEGETHLIFPKKASVEQLQKIRDLIAIERSSRLDGSNKSHKVKSSQQPAASTQLPTTPSSNPSGLNQHTRNRQGQSSDPPSASTVAADSAILEVLQSNIQHVLVYENPALQEKALACIPVQELKRKSQEKLSRARKLDKGINISDEDFLLLELLHWFKEEFFHWVNNVLCSKCGGQTRSRDRSLLPSDDELKWGAKEVEDHYCDACQFSNRFPRYNNPEKLLETRCGRCGEWANCFTLCCRAVGFEARYVWDYTDHVWTEVYSPSQQRWLHCDACEDVCDKPLLYEIGWGKKLSYVIAFSKDEVVDVTWRYSCKHEEVIARRTKVKEALLRDTINGLNKQRQLFLSENRRKELLQRIIVELVEFISPKTPKPGELGGRISGSVAWRVARGEMGLQRKETLFIPCENEKISKQLHLCYNIVKDRYVRVSNNNQTISGWENGVWKMESIFRKVETDWHMVYLARKEGSSFAYISWKFECGSVGLKVDSISIRTSSQTFQTGTVEWKLRSDTAQVELTGDNSLHSYADFSGATEVILEAELSRGDGDVAWQHTQLFRQSLNDHEENCLEIIIKFSDL,mutated_sequence,1.0,654.0,NP_060767.2.a2m,NP_060767.2.npy,ClinVar
+NP_060876.5,NP_060876.5.csv,MKGARWRRVPWVSLSCLCLCLLPHVVPGTTEDTLITGSKTAAPVTSTGSTTATLEGQSTAASSRTSNQDISASSQNHQTKSTETTSKAQTDTLTQMMTSTLFSSPSVHNVMETAPPDEMTTSFPSSVTNTLMMTSKTITMTTSTDSTLGNTEETSTAGTESSTPVTSAVSITAGQEGQSRTTSWRTSIQDTSASSQNHWTRSTQTTRESQTSTLTHRTTSTPSFSPSVHNVTGTVSQKTSPSGETATSSLCSVTNTSMMTSEKITVTTSTGSTLGNPGETSSVPVTGSLMPVTSAALVTFDPEGQSPATFSRTSTQDTTAFSKNHQTQSVETTRVSQINTLNTLTPVTTSTVLSSPSGFNPSGTVSQETFPSGETTTSSPSSVSNTFLVTSKVFRMPTSRDSTLGNTEETSLSVSGTISAITSKVSTIWWSDTLSTALSPSSLPPKISTAFHTQQSEGAETTGRPHERSSFSPGVSQEIFTLHETTTWPSSFSSKGHTTWSQTELPSTSTGAATRLVTGNPSTGTAGTIPRVPSKVSAIGEPGEPTTYSSHSTTLPKTTGAGAQTQWTQETGTTGEALLSSPSYSVTQMIKTATSPSSSPMLDRHTSQQITTAPSTNHSTIHSTSTSPQESPAVSQRGHTQAPQTTQESQTTRSVSPMTDTKTVTTPGSSFTASGHSPSEIVPQDAPTISAATTFAPAPTGDGHTTQAPTTALQAAPSSHDATLGPSGGTSLSKTGALTLANSVVSTPGGPEGQWTSASASTSPDTAAAMTHTHQAESTEASGQTQTSEPASSGSRTTSAGTATPSSSGASGTTPSGSEGISTSGETTRFSSNPSRDSHTTQSTTELLSASASHGAIPVSTGMASSIVPGTFHPTLSEASTAGRPTGQSSPTSPSASPQETAAISRMAQTQRTRTSRGSDTISLASQATDTFSTVPPTPPSITSTGLTSPQTETHTLSPSGSGKTFTTALISNATPLPVTYASSASTGHTTPLHVTDASSVSTGHATPLPVTSPSSVSTGHTTPLPVTDTSSESTGHVTPLPVTSFSSASTGDSTPLPVTDTSSASTGHVTPLPVTSLSSASTGDTTPLPVTDTSSASTGHATSLPVTDTSSVSTGHTTPLPVTDTSSASTGHATSLPVTDTSSVSTGHTTPLHVTDASSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATPLLVTDTSSASTGHATPLPVTDASSVSTDHATSLPVTIPSAASTGHTTPLPVTDTSSASTGQATSLLVTDTSSVSTGDTTPLPVTSTSSASTGHVTPLHVTSPSSASTGHATPLPVTSLSSASTGDTMPLPVTSPSSASTGDTTPLPVTDASSVSTGHTTPLHVTDASSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATPLLVTDTSSASTGHATPLPVTDASSVSTDHATSLPVTIPSAASTGHTTPLPVTDTSSASTGQATSLLVTDTSSVSTGDTTPLPVTSTSSASTGHVTPLHVTSPSSASTGHATPLPVTSLSSASTGDTMPLPVTSPSSASTGDTTPLPVTDASSVSTGHTTPLPVTSPSSASTGHTTPLPVTDTSSASKGDTTPLPVTSPSSASTGHTTPLPVTDTSSASTGDTTPLPVTNASSLSTGHATPLHVTSPSSASTGHATPLPVTSTSSASTGHATPLPVTGLSSATTDDTTRLPVTDVSSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHASPLLVTDASSASTGQATPLPVTDTSSVSTAHATPLPVTGLSSASTDDTTRLPVTDVSSASTGQAIPLPVTSPSSASTGDTTPLPVTDASSASTGDTTSLPVTIPSSASSGHTTSLPVTDASSVSTGHATSLLVTDASSVSTGDTTPLPVTDTNSASTGDTTPLHVTDASSVSTGHATSLPVTSLSSASTGDTTPLPVTSPSSASSGHTTPLPVTDASSVPTGHATSLPVTDASSVSTGHATPLPVTDASSVSTGHATPLPVTDTSSVSTGQATPLPVTSLSSASTGDTTPLPVTDTSSASTGQDTPLPVTSLSSVSTGDTTPLPVTNPSSASTGHATPLLVTDASSISTGHATSLLVTDASSVSTGHATALHDTDASSLSTGDTTPLPVTSPSSTSTGDTTPLPVTETSSVSTGHATSLPVTDTSSASTGHATSLPVTDTSSASTGHATPLPVTDTSSASTGQATPLPVTSPSSASTGHAIPLLVTDTSSASTGQATPLPVTSLSSASTGDTTPLPVTDASSVSTGHATSLPVTSLSSVSTGDTTPLPVTSPSSASTGHATPLHVTDASSASTGHATPLPVTSLSSASTGDTTPLPVTSPSSASTGHATPLHVTDASSVSTGDTTPLPVTSSSSASSGHTTPLPVTDASSASTGDTTPLPVTDTSSASTGHATHLPVTGLSSASTGDTTRLPVTNVSSASTGHATPLPVTSTSSASTGDTTPLPGTDTSSVSTGHTTPLLVTDASSVSTGDTTRLPVTSPSSASTGHTTPLPVTDTPSASTGDTTPLPVTNASSLSTRHATSLHVTSPSSASTGHATSLPVTDTSAASTGHATPLPVTSTSSASTGDTTPLPVTDTYSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATPLLVTDASSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATSLPVTDTSSASTGDTTSLPVTDTSSAYTGDTTSLPVTDTSSSSTGDTTPLLVTETSSVSTGDTTPLPVTDTSSASTGHATPLPVTNTSSVSTGHATPLHVTSPSSASTGHTTPLPVTDASSVSTGHATSLPVTDASSVFTGHATSLPVTIPSSASSGHTTPLPVTDASSVSTGHATSLPVTDASSVSTGHATPLPVTDASSVSTGHATPLPLTSLSSVSTGDTTPLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLPVTDTSSASTGHATSLPVTDTSSASTGHATPLPDTDTSSASTGHATLLPVTDTSSASIGHATSLPVTDTSSISTGHATPLHVTSPSSASTGHATPLPVTDTSSASTGHANPLHVTSPSSASTGHATPLPVTDTSSASTGHATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHTTPLPVTDTSSASTGQATALPVTSTSSASTGDTTPLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATPLLVTDASSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATSLPVTDTSSASTGDTTSLPVTDTSSAYTGDTTSLPVTDTSSSSTGDTTPLLVTETSSVSTGHATPLLVTDASSASTGHATPLHVTSPSSASTGDTTPVPVTDTSSVSTGHATPLPVTGLSSASTGDTTRLPVTDISSASTGQATPLPVTNTSSVSTGDTMPLPVTSPSSASTGHATPLPVTSTSSASTGHATPVPVTSTSSASTGHTTPLPVTDTSSASTGDTTPLPVTSPSSASTGHTTPLHVTIPSSASTGDTSTLPVTGASSASTGHATPLPVTDTSSVSTGHATPLPVTSLSSVSTGDTTPLPVTDASSASTGQATPLPVTSLSSVSTGDTTPLLVTDASSVSTGHATPLPVTDTSSASTGDTTRLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLLVTDASSVSTGHATPLPVTDTSSASTGDTTRLPVTDTSSASTGQATPLPVTIPSSSSSGHTTPLPVTSTSSVSTGHVTPLHVTSPSSASTGHVTPLPVTSTSSASTGHATPLLVTDASSVSTGHATPLPVTDASSASTGDTTPLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLPVTDASSASTGHATPLPVTIPSSVSTGDTMPLPVTSPSSASTGHATPLPVTGLSSASTGDTTPLPVTDTSSASTRHATPLPVTDTSSASTDDTTRLPVTDVSSASTGHATPLPVTSTSSASTGDTTPLPVTDTSSVSTGHATSLPVTSRSSASTGHATPLPVTDTSSVSTGHATPLPVTSTSSVSTGHATPLPVTSPSSASTGHATPVPVTSTSSASTGDTTPLPVTNASSLSTGHATPLHVTSPSSASRGDTSTLPVTDASSASTGHATPLPLTSLSSVSTGDTTPLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLPVTIPSSASSGHTTSLPVTDASSVSTGHGTPLPVTSTSSASTGDTTPLPVTDTSSASTGHATPLPVTDTSSASTGHATPLPVTSLSSVSTGHATPLAVSSATSASTVSSDSPLKMETPGMTTPSLKTDGGRRTATSPPPTTSQTIISTIPSTAMHTRSTAAPIPILPERGVSLFPYGAGAGDLEFVRRTVDFTSPLFKPATGFPLGSSLRDSLYFTDNGQIIFPESDYQIFSYPNPLPTGFTGRDPVALVAPFWDDADFSTGRGTTFYQEYETFYGEHSLLVQQAESWIRKMTNNGGYKARWALKVTWVNAHAYPAQWTLGSNTYQAILSTDGSRSYALFLYQSGGMQWDVAQRSGNPVLMGFSSGDGYFENSPLMSQPVWERYRPDRFLNSNSGLQGLQFYRLHREERPNYRLECLQWLKSQPRWPSWGWNQVSCPCSWQQGRRDLRFQPVSIGRWGLGSRQLCSFTSWRGGVCCSYGPWGEFREGWHVQRPWQLAQELEPQSWCCRWNDKPYLCALYQQRRPHVGCATYRPPQPAWMFGDPHITTLDGVSYTFNGLGDFLLVGAQDGNSSFLLQGRTAQTGSAQATNFIAFAAQYRSSSLGPVTVQWLLEPHDAIRVLLDNQTVTFQPDHEDGGGQETFNATGVLLSRNGSEVSASFDGWATVSVIALSNILHASASLPPEYQNRTEGLLGVWNNNPEDDFRMPNGSTIPPGSPEEMLFHFGMTWQINGTGLLGKRNDQLPSNFTPVFYSQLQKNSSWAEHLISNCDGDSSCIYDTLALRNASIGLHTREVSKNYEQANATLNQYPPSINGGRVIEAYKGQTTLIQYTSNAEDANFTLRDSCTDLELFENGTLLWTPKSLEPFTLEILARSAKIGLASALQPRTVVCHCNAESQCLYNQTSRVGNSSLEVAGCKCDGGTFGRYCEGSEDACEEPCFPSVHCVPGKGCEACPPNLTGDGRHCAALGSSFLCQNQSCPVNYCYNQGHCYISQTLGCQPMCTCPPAFTDSRCFLAGNNFSPTVNLELPLRVIQLLLSEEENASMAEVNASVAYRLGTLDMRAFLRNSQVERIDSAAPASGSPIQHWMVISEFQYRPRGPVIDFLNNQLLAAVVEAFLYHVPRRSEEPRNDVVFQPISGEDVRDVTALNVSTLKAYFRCDGYKGYDLVYSPQSGFTCVSPCSRGYCDHGGQCQHLPSGPRCSCVSFSIYTAWGEHCEHLSMKLDAFFGIFFGALGGLLLLGVGTFVVLRFWGCSGARFSYFLNSAEALP,mutated_sequence,1.0,5412.0,NP_060876.5.a2m,NP_060876.5.npy,ClinVar
+NP_060914.2,NP_060914.2.csv,MPAPTQLFFPLIRNCELSRIYGTACYCHHKHLCCSSSYIPQSRLRYTPHPAYATFCRPKENWWQYTQGRRYASTPQKFYLTPPQVNSILKANEYSFKVPEFDGKNVSSILGFDSNQLPANAPIEDRRSAATCLQTRGMLLGVFDGHAGCACSQAVSERLFYYIAVSLLPHETLLEIENAVESGRALLPILQWHKHPNDYFSKEASKLYFNSLRTYWQELIDLNTGESTDIDVKEALINAFKRLDNDISLEAQVGDPNSFLNYLVLRVAFSGATACVAHVDGVDLHVANTGDSRAMLGVQEEDGSWSAVTLSNDHNAQNERELERLKLEHPKSEAKSVVKQDRLLGLLMPFRAFGDVKFKWSIDLQKRVIESGPDQLNDNEYTKFIPPNYHTPPYLTAEPEVTYHRLRPQDKFLVLATDGLWETMHRQDVVRIVGEYLTGMHHQQPIAVGGYKVTLGQMHGLLTERRTKMSSVFEDQNAATHLIRHAVGNNEFGTVDHERLSKMLSLPEELARMYRDDITIIVVQFNSHVVGAYQNQE,mutated_sequence,1.0,537.0,NP_060914.2.a2m,NP_060914.2.npy,ClinVar
+NP_060921.3,NP_060921.3.csv,MFLMPTSSELNSGQNFLTQWMTNPSRAGVILNRGFPILEADKEKRAAVDISTSFPIKGTHFSDSFSFINEEDSLLEEQKLESNNPYKPQSDKSETHTAFPCIKKGPQVAACHSAPGHQEENKNDFIPDLASEFKEGAYKDPLFKKLEQLKEVQQKKQEQLKRQQLEQLQRLMEEQEKLLTMVSGQCTLPGLSLLPDDQSQKHRSPGNTTTGERATCCFPSYVYPDPTQEETYPSNILSHEQSNFCRTAHGDFVLTSKRASPNLFSEAQYQEAPVEKNNLKEENRNHPTGESILCWEKVTEQIQEANDKNLQKHDDSSEVANIEERPIKAAIGERKQTFEDYLEEQIQLEEQELKQKQLKEAEGPLPIKAKPKQPFLKRGEGLARFTNAKSKFQKGKESKLVTNQSTSEDQPLFKMDRQQLQRKTALKNKELCADNPILKKDSKARTKSGSVTLSQKPKMLKCSNRKSLSPSGLKIQTGKKCDGQFRDQIKFENKVTSNNKENVTECPKPCDTGCTGWNKTQGKDRLPLSTGPASRLAAKSPIRETMKESESSLDVSLQKKLETWEREKEKENLELDEFLFLEQAADEISFSSNSSFVLKILERDQQICKGHRMSSTPVKAVPQKTNPADPISHCNRSEDLDHTAREKESECEVAPKQLHSLSSADELREQPCKIRKAVQKSTSENQTEWNARDDEGVPNSDSSTDSEEQLDVTIKPSTEDRERGISSREDSPQVCDDKGPFKDTRTQEDKRRDVDLDLSDKDYSSDESIMESIKHKVSEPSRSSSLSLSKMDFDDERTWTDLEENLCNHDVVLGNESTYGTPQTCYPNNEIGILDKTIKRKIAPVKRGEDLSKSRRSRSPPTSELMMKFFPSLKPKPKSDSHLGNELKLNISQDQPPGDNARSQVLREKIIELETEIEKFKAENASLAKLRIERESALEKLRKEIADFEQQKAKELARIEEFKKEEMRKLQKERKVFEKYTTAARTFPDKKEREEIQTLKQQIADLREDLKRKETKWSSTHSRLRSQIQMLVRENTDLREEIKVMERFRLDAWKRAEAIESSLEVEKKDKLANTSVRFQNSQISSGTQVEKYKKNYLPMQGNPPRRSKSAPPRDLGNLDKGQAASPREPLEPLNFPDPEYKEEEEDQDIQGEISHPDGKVEKVYKNGCRVILFPNGTRKEVSADGKTITVTFFNGDVKQVMPDQRVIYYYAAAQTTHTTYPEGLEVLHFSSGQIEKHYPDGRKEITFPDQTVKNLFPDGQEESIFPDGTIVRVQRDGNKLIEFNNGQRELHTAQFKRREYPDGTVKTVYANGHQETKYRSGRIRVKDKEGNVLMDTEL,mutated_sequence,1.0,1338.0,NP_060921.3.a2m,NP_060921.3.npy,ClinVar
+NP_060956.1,NP_060956.1.csv,MEEPEEPADSGQSLVPVYIYSPEYVSMCDSLAKIPKRASMVHSLIEAYALHKQMRIVKPKVASMEEMATFHTDAYLQHLQKVSQEGDDDHPDSIEYGLGYDCPATEGIFDYAAAIGGATITAAQCLIDGMCKVAINWSGGWHHAKKDEASGFCYLNDAVLGILRLRRKFERILYVDLDLHHGDGVEDAFSFTSKVMTVSLHKFSPGFFPGTGDVSDVGLGKGRYYSVNVPIQDGIQDEKYYQICESVLKEVYQAFNPKAVVLQLGADTIAGDPMCSFNMTPVGIGKCLKYILQWQLATLILGGGGYNLANTARCWTYLTGVILGKTLSSEIPDHEFFTAYGPDYVLEITPSCRPDRNEPHRIQQILNYIKGNLKHVV,mutated_sequence,1.0,377.0,NP_060956.1.a2m,NP_060956.1.npy,ClinVar
+NP_061154.1,NP_061154.1.csv,MADEQEIMCKLESIKEIRNKTLQMEKIKARLKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKLLESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLDYFEKQKAEWQTEPQEPPIPESLAAAAAAAQQLQVARKQDTRQTATFRQQPPPMKACLSCHQQIHRNAPICPLCKAKSRSRNPKKPKRKQDE,mutated_sequence,1.0,224.0,NP_061154.1.a2m,NP_061154.1.npy,ClinVar
+NP_061183.2,NP_061183.2.csv,MGRYSGKTCRLLFMLVLTVAFFVAELVSGYLGNSIALLSDSFNMLSDLISLCVGLSAGYIARRPTRGFSATYGYARAEVVGALSNAVFLTALCFTIFVEAVLRLARPERIDDPELVLIVGVLGLLVNVVGLLIFQDCAAWFACCLRGRSRRLQQRQQLAEGCVPGAFGGPQGAEDPRRAADPTAPGSDSAVTLRGTSVERKREKGATVFANVAGDSFNTQNEPEDMMKKEKKSEALNIRGVLLHVMGDALGSVVVVITAIIFYVLPLKSEDPCNWQCYIDPSLTVLMVIIILSSAFPLIKETAAILLQMVPKGVNMEELMSKLSAVPGISSVHEVHIWELVSGKIIATLHIKYPKDRGYQDASTKIREIFHHAGIHNVTIQFENVDLKEPLEQKDLLLLCNSPCISKGCAKQLCCPPGALPLAHVNGCAEHNGGPSLDTYGSDGLSRRDAREVAIEVSLDSCLSDHGQSLNKTQEDQCYVNRTHF,mutated_sequence,1.0,485.0,NP_061183.2.a2m,NP_061183.2.npy,ClinVar
+NP_061193.2,NP_061193.2.csv,MNCEREQLRGNQEAAAAPDTMAQPYASAQFAPPQNGIPAEYTAPHPHPAPEYTGQTTVPEHTLNLYPPAQTHSEQSPADTSAQTVSGTATQTDDAAPTDGQPQTQPSENTENKSQPKRLHVSNIPFRFRDPDLRQMFGQFGKILDVEIIFNERGSKGFGFVTFENSADADRAREKLHGTVVEGRKIEVNNATARVMTNKKTVNPYTNGWKLNPVVGAVYSPEFYAGTVLLCQANQEGSSMYSAPSSLVYTSAMPGFPYPAATAAAAYRGAHLRGRGRTVYNTFRAAAPPPPIPAYGGVVYQDGFYGADIYGGYAAYRYAQPTPATAAAYSDSYGRVYAADPYHHALAPAPTYGVGAMNAFAPLTDAKTRSHADDVGLVLSSLQASIYRGGYNRFAPY,mutated_sequence,1.0,397.0,NP_061193.2.a2m,NP_061193.2.npy,ClinVar
+NP_061496.2,NP_061496.2.csv,MDEEEDGAGAEESGQPRSFMRLNDLSGAGGRPGPGSAEKDPGSADSEAEGLPYPALAPVVFFYLSQDSRPRSWCLRTVCNPWFERISMLVILLNCVTLGMFRPCEDIACDSQRCRILQAFDDFIFAFFAVEMVVKMVALGIFGKKCYLGDTWNRLDFFIVIAGMLEYSLDLQNVSFSAVRTVRVLRPLRAINRVPSMRILVTLLLDTLPMLGNVLLLCFFVFFIFGIVGVQLWAGLLRNRCFLPENFSLPLSVDLERYYQTENEDESPFICSQPRENGMRSCRSVPTLRGDGGGGPPCGLDYEAYNSSSNTTCVNWNQYYTNCSAGEHNPFKGAINFDNIGYAWIAIFQVITLEGWVDIMYFVMDAHSFYNFIYFILLIIVGSFFMINLCLVVIATQFSETKQRESQLMREQRVRFLSNASTLASFSEPGSCYEELLKYLVYILRKAARRLAQVSRAAGVRVGLLSSPAPLGGQETQPSSSCSRSHRRLSVHHLVHHHHHHHHHYHLGNGTLRAPRASPEIQDRDANGSRRLMLPPPSTPALSGAPPGGAESVHSFYHADCHLEPVRCQAPPPRSPSEASGRTVGSGKVYPTVHTSPPPETLKEKALVEVAASSGPPTLTSLNIPPGPYSSMHKLLETQSTGACQSSCKISSPCLKADSGACGPDSCPYCARAGAGEVELADREMPDSDSEAVYEFTQDAQHSDLRDPHSRRQRSLGPDAEPSSVLAFWRLICDTFRKIVDSKYFGRGIMIAILVNTLSMGIEYHEQPEELTNALEISNIVFTSLFALEMLLKLLVYGPFGYIKNPYNIFDGVIVVISVWEIVGQQGGGLSVLRTFRLMRVLKLVRFLPALQRQLVVLMKTMDNVATFCMLLMLFIFIFSILGMHLFGCKFASERDGDTLPDRKNFDSLLWAIVTVFQILTQEDWNKVLYNGMASTSSWAALYFIALMTFGNYVLFNLLVAILVEGFQAEEISKREDASGQLSCIQLPVDSQGGDANKSESEPDFFSPSLDGDGDRKKCLALVSLGEHPELRKSLLPPLIIHTAATPMSLPKSTSTGLGEALGPASRRTSSSGSAEPGAAHEMKSPPSARSSPHSPWSAASSWTSRRSSRNSLGRAPSLKRRSPSGERRSLLSGEGQESQDEEESSEEERASPAGSDHRHRGSLEREAKSSFDLPDTLQVPGLHRTASGRGSASEHQDCNGKSASGRLARALRPDDPPLDGDDADDEGNLSKGERVRAWIRARLPACCLERDSWSAYIFPPQSRFRLLCHRIITHKMFDHVVLVIIFLNCITIAMERPKIDPHSAERIFLTLSNYIFTAVFLAEMTVKVVALGWCFGEQAYLRSSWNVLDGLLVLISVIDILVSMVSDSGTKILGMLRVLRLLRTLRPLRVISRAQGLKLVVETLMSSLKPIGNIVVICCAFFIIFGILGVQLFKGKFFVCQGEDTRNITNKSDCAEASYRWVRHKYNFDNLGQALMSLFVLASKDGWVDIMYDGLDAVGVDQQPIMNHNPWMLLYFISFLLIVAFFVLNMFVGVVVENFHKCRQHQEEEEARRREEKRLRRLEKKRRNLMLDDVIASGSSASAASEAQCKPYYSDYSRFRLLVHHLCTSHYLDLFITGVIGLNVVTMAMEHYQQPQILDEALKICNYIFTVIFVLESVFKLVAFGFRRFFQDRWNQLDLAIVLLSIMGITLEEIEVNASLPINPTIIRIMRVLRIARVLKLLKMAVGMRALLDTVMQALPQVGNLGLLFMLLFFIFAALGVELFGDLECDETHPCEGLGRHATFRNFGMAFLTLFRVSTGDNWNGIMKDTLRDCDQESTCYNTVISPIYFVSFVLTAQFVLVNVVIAVLMKHLEESNKEAKEEAELEAELELEMKTLSPQPHSPLGSPFLWPGVEGPDSPDSPKPGALHPAAHARSASHFSLEHPTDRQLFDTISLLIQGSLEWELKLMDELAGPGGQPSAFPSAPSLGGSDPQIPLAEMEALSLTSEIVSEPSCSLALTDDSLPDDMHTLLLSALESNMQPHPTELPGPDLLTVRKSGVSRTHSLPNDSYMCRHGSTAEGPLGHRGWGLPKAQSGSVLSVHSQPADTSYILQLPKDAPHLLQPHSAPTWGTIPKLPPPGRSPLAQRPLRRQAAIRTDSLDVQGLGSREDLLAEVSGPSPPLARAYSFWGQSSTQAQQHSRSHSKISKHMTPPAPCPGPEPNWGKGPPETRSSLELDTELSWISGDLLPPGGQEEPPSPRDLKKCYSVEAQSCQRRPTSWLDEQRRHSIAVSCLDSGSQPHLGTDPSNLGGQPLGGPGSRPKKKLSPPSITIDPPESQGPRTPPSPGICLRRRAPSSDSKDPLASGPPDSMAASPSPKKDVLSLSGLSSDPADLDP,mutated_sequence,1.0,2377.0,NP_061496.2.a2m,NP_061496.2.npy,ClinVar
+NP_061764.2,NP_061764.2.csv,MNPASDGGTSESIFDLDYASWGIRSTLMVAGFVFYLGVFVVCHQLSSSLNATYRSLVAREKVFWDLAATRAVFGVQSTAAGLWALLGDPVLHADKARGQQNWCWFHITTATGFFCFENVAVHLSNLIFRTFDLFLVIHHLFAFLGFLGCLVNLQAGHYLAMTTLLLEMSTPFTCVSWMLLKAGWSESLFWKLNQWLMIHMFHCRMVLTYHMWWVCFWHWDGLVSSLYLPHLTLFLVGLALLTLIINPYWTHKKTQQLLNPVDWNFAQPEAKSRPEGNGQLLRKKRP,mutated_sequence,1.0,286.0,NP_061764.2.a2m,NP_061764.2.npy,ClinVar
+NP_061820.1,NP_061820.1.csv,MGDVEKGKKIFIMKCSQCHTVEKGGKHKTGPNLHGLFGRKTGQAPGYSYTAANKNKGIIWGEDTLMEYLENPKKYIPGTKMIFVGIKKKEERADLIAYLKKATNE,mutated_sequence,1.0,105.0,NP_061820.1.a2m,NP_061820.1.npy,ClinVar
+NP_061971.3,NP_061971.3.csv,MFKSLTKVNKVKPIGENNENEQSSRRNEEGSHPSNQSQQTTAQEENKGEEKSLKTKSTPVTSEEPHTNIQDKLSKKNSSGDLTTNPDPQNAAEPTGTVPEQKEMDPGKEGPNSPQNKPPAAPVINEYADAQLHNLVKRMRQRTALYKKKLVEGDLSSPEASPQTAKPTAVPPVKESDDKPTEHYYRLLWFKVKKMPLTEYLKRIKLPNSIDSYTDRLYLLWLLLVTLAYNWNCCFIPLRLVFPYQTADNIHYWLIADIICDIIYLYDMLFIQPRLQFVRGGDIIVDSNELRKHYRTSTKFQLDVASIIPFDICYLFFGFNPMFRANRMLKYTSFFEFNHHLESIMDKAYIYRVIRTTGYLLFILHINACVYYWASNYEGIGTTRWVYDGEGNEYLRCYYWAVRTLITIGGLPEPQTLFEIVFQLLNFFSGVFVFSSLIGQMRDVIGAATANQNYFRACMDDTIAYMNNYSIPKLVQKRVRTWYEYTWDSQRMLDESDLLKTLPTTVQLALAIDVNFSIISKVDLFKGCDTQMIYDMLLRLKSVLYLPGDFVCKKGEIGKEMYIIKHGEVQVLGGPDGTKVLVTLKAGSVFGEISLLAAGGGNRRTANVVAHGFANLLTLDKKTLQEILVHYPDSERILMKKARVLLKQKAKTAEATPPRKDLALLFPPKEETPKLFKTLLGGTGKASLARLLKLKREQAAQKKENSEGGEEEGKENEDKQKENEDKQKENEDKGKENEDKDKGREPEEKPLDRPECTASPIAVEEEPHSVRRTVLPRGTSRQSLIISMAPSAEGGEEVLTIEVKEKAKQ,mutated_sequence,1.0,809.0,NP_061971.3.a2m,NP_061971.3.npy,ClinVar
+NP_063945.2,NP_063945.2.csv,MPSKAENLRPSEPAPQPPEGRTLQGQLPGAPPAQRAGSPPDAPGSESPALACSTPATPSGEDPPARAAPIAPRPPARPRLERALSLDDKGWRRRRFRGSQEDLEARNGTSPSRGSVQSEGPGAPAHSCSPPCLSTSLQEIPKSRGVLSSERGSPSSGGNPLSGVASSSPNLPHRDAAVAGSSPRLPSLLPPRPPPALSLDIASDSLRTANKVDSDLADYKLRAQPLLVRAHSSLGPGRPRSPLACDDCSLRSAKSSFSLLAPIRSKDVRSRSYLEGSLLASGALLGADELARYFPDRNVALFVATWNMQGQKELPPSLDEFLLPAEADYAQDLYVIGVQEGCSDRREWETRLQETLGPHYVLLSSAAHGVLYMSLFIRRDLIWFCSEVECSTVTTRIVSQIKTKGALGISFTFFGTSFLFITSHFTSGDGKVAERLLDYTRTVQALVLPRNVPDTNPYRSSAADVTTRFDEVFWFGDFNFRLSGGRTVVDALLCQGLVVDVPALLQHDQLIREMRKGSIFKGFQEPDIHFLPSYKFDIGKDTYDSTSKQRTPSYTDRVLYRSRHKGDICPVSYSSCPGIKTSDHRPVYGLFRVKVRPGRDNIPLAAGKFDRELYLLGIKRRISKEIQRQQALQSQNSSTICSVS,mutated_sequence,1.0,644.0,NP_063945.2.a2m,NP_063945.2.npy,ClinVar
+NP_065175.4,NP_065175.4.csv,MPHLPLASFRPPFWGLRHSRGLPRFHSVSTQSEPHGSPISRRNREAKQKRLREKQATLEAEIAGESKSPAESIKAWRPKELVLYEIPTKPGEKKDVSGPLPPAYSPRYVEAAWYPWWVREGFFKPEYQARLPQATGETFSMCIPPPNVTGSLHIGHALTVAIQDALVRWHRMRGDQVLWVPGSDHAGIATQAVVEKQLWKERGVRRHELSREAFLREVWQWKEAKGGEICEQLRALGASLDWDRECFTMDVGSSVAVTEAFVRLYKAGLLYRNHQLVNWSCALRSAISDIEVENRPLPGHTQLRLPGCPTPVSFGLLFSVAFPVDGEPDAEVVVGTTRPETLPGDVAVAVHPDDSRYTHLHGRQLRHPLMGQPLPLITDYAVQPHVGTGAVKVTPAHSPADAEMGARHGLSPLNVIAEDGTMTSLCGDWLQGLHRFVAREKIMSVLSEWGLFRGLQNHPMVLPICSRSGDVIEYLLKNQWFVRCQEMGARAAKAVESGALELSPSFHQKNWQHWFSHIGDWCVSRQLWWGHQIPAYLVVEDHAQGEEDCWVVGRSEAEAREVAAELTGRPGAELTLERDPDVLDTWFSSALFPFSALGWPQETPDLARFYPLSLLETGSDLLLFWVGRMVMLGTQLTGQLPFSKVLLHPMVRDRQGRKMSKSLGNVLDPRDIISGVEMQVLQEKLRSGNLDPAELAIVAAAQKKDFPHGIPECGTDALRFTLCSHGVQAGDLHLSVSEVQSCRHFCNKIWNALRFILNALGEKFVPQPAEELSPSSPMDAWILSRLALAAQECERGFLTRELSLVTHALHHFWLHNLCDVYLEAVKPVLWHSPRPLGPPQVLFSCADLGLRLLAPLMPFLAEELWQRLPPRPGCPPAPSISVAPYPSACSLEHWRQPELERRFSRVQEVVQVLRALRATYQLTKARPRVLLQSSEPGDQGLFEAFLEPLGTLGYCGAVGLLPPGAAAPSGWAQAPLSDTAQVYMELQGLVDPQIQLPLLAARRYKLQKQLDSLTARTPSEGEAGTQRQQKLSSLQLELSKLDKAASHLRQLMDEPPAPGSPEL,mutated_sequence,1.0,1063.0,NP_065175.4.a2m,NP_065175.4.npy,ClinVar
+NP_065184.2,NP_065184.2.csv,MGRARPGQRGPPSPGPAAQPPAPPRRRARSLALLGALLAAAAAAAVRVCARHAEAQAAARQELALKTLGTDGLFLFSSLDTDGDMYISPEEFKPIAEKLTGSCSVTQTGVQWCSHSSLQPQLPWLNUSSCLSLLRSTPAASCEEEELPPDPSEETLTIEARFQPLLPETMTKSKDGFLGVSRLALSGLRNWTAAASPSAVFATRHFQPFLPPPGQELGEPWWIIPSELSMFTGYLSNNRFYPPPPKGKEVIIHRLLSMFHPRPFVKTRFAPQGAVACLTAISDFYYTVMFRIHAEFQLSEPPDFPFWFSPAQFTGHIILSKDATHVRDFRLFVPNHRSLNVDMEWLYGASESSNMEVDIGYIPQMELEATGPSVPSVILDEDGSMIDSHLPSGEPLQFVFEEIKWQQELSWEEAARRLEVAMYPFKKVSYLPFTEAFDRAKAENKLVHSILLWGALDDQSCUGSGRTLRETVLESSPILTLLNESFISTWSLVKELEELQNNQENSSHQKLAGLHLEKYSFPVEMMICLPNGTVVHHINANYFLDITSVKPEEIESNLFSFSSTFEDPSTATYMQFLKEGLRRGLPLLQP,mutated_sequence,1.0,590.0,NP_065184.2.a2m,NP_065184.2.npy,ClinVar
+NP_065394.1,NP_065394.1.csv,MTAPAGPRGSETERLLTPNPGYGTQAGPSPAPPTPPEEEDLRRRLKYFFMSPCDKFRAKGRKPCKLMLQVVKILVVTVQLILFGLSNQLAVTFREENTIAFRHLFLLGYSDGADDTFAAYTREQLYQAIFHAVDQYLALPDVSLGRYAYVRGGGDPWTNGSGLALCQRYYHRGHVDPANDTFDIDPMVVTDCIQVDPPERPPPPPSDDLTLLESSSSYKNLTLKFHKLVNVTIHFRLKTINLQSLINNEIPDCYTFSVLITFDNKAHSGRIPISLETQAHIQECKHPSVFQHGDNSFRLLFDVVVILTCSLSFLLCARSLLRGFLLQNEFVGFMWRQRGRVISLWERLEFVNGWYILLVTSDVLTISGTIMKIGIEAKNLASYDVCSILLGTSTLLVWVGVIRYLTFFHNYNILIATLRVALPSVMRFCCCVAVIYLGYCFCGWIVLGPYHVKFRSLSMVSECLFSLINGDDMFVTFAAMQAQQGRSSLVWLFSQLYLYSFISLFIYMVLSLFIALITGAYDTIKHPGGAGAEESELQAYIAQCQDSPTSGKFRRGSGSACSLLCCCGRDPSEEHSLLVN,mutated_sequence,1.0,580.0,NP_065394.1.a2m,NP_065394.1.npy,ClinVar
+NP_065712.1,NP_065712.1.csv,MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVKRRDSATSFSLDFGYLRNKNGCHVELLFLRYISDWDLDPGRCYRVTWFTSWSPCYDCARHVADFLRGNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHERTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTLGL,mutated_sequence,1.0,198.0,NP_065712.1.a2m,NP_065712.1.npy,ClinVar
+NP_065789.1,NP_065789.1.csv,MSVLISQSVINYVEEENIPALKALLEKCKDVDERNECGQTPLMIAAEQGNLEIVKELIKNGANCNLEDLDNWTALISASKEGHVHIVEELLKCGVNLEHRDMGGWTALMWACYKGRTDVVELLLSHGANPSVTGLYSVYPIIWAAGRGHADIVHLLLQNGAKVNCSDKYGTTPLVWAARKGHLECVKHLLAMGADVDQEGANSMTALIVAVKGGYTQSVKEILKRNPNVNLTDKDGNTALMIASKEGHTEIVQDLLDAGTYVNIPDRSGDTVLIGAVRGGHVEIVRALLQKYADIDIRGQDNKTALYWAVEKGNATMVRDILQCNPDTEICTKDGETPLIKATKMRNIEVVELLLDKGAKVSAVDKKGDTPLHIAIRGRSRKLAELLLRNPKDGRLLYRPNKAGETPYNIDCSHQKSILTQIFGARHLSPTETDGDMLGYDLYSSALADILSEPTMQPPICVGLYAQWGSGKSFLLKKLEDEMKTFAGQQIEPLFQFSWLIVFLTLLLCGGLGLLFAFTVHPNLGIAVSLSFLALLYIFFIVIYFGGRREGESWNWAWVLSTRLARHIGYLELLLKLMFVNPPELPEQTTKALPVRFLFTDYNRLSSVGGETSLAEMIATLSDACEREFGFLATRLFRVFKTEDTQGKKKWKKTCCLPSFVIFLFIIGCIISGITLLAIFRVDPKHLTVNAVLISIASVVGLAFVLNCRTWWQVLDSLLNSQRKRLHNAASKLHKLKSEGFMKVLKCEVELMARMAKTIDSFTQNQTRLVVIIDGLDACEQDKVLQMLDTVRVLFSKGPFIAIFASDPHIIIKAINQNLNSVLRDSNINGHDYMRNIVHLPVFLNSRGLSNARKFLVTSATNGDVPCSDTTGIQEDADRRVSQNSLGEMTKLGSKTALNRRDTYRRRQMQRTITRQMSFDLTKLLVTEDWFSDISPQTMRRLLNIVSVTGRLLRANQISFNWDRLASWINLTEQWPYRTSWLILYLEETEGIPDQMTLKTIYERISKNIPTTKDVEPLLEIDGDIRNFEVFLSSRTPVLVARDVKVFLPCTVNLDPKLREIIADVRAAREQISIGGLAYPPLPLHEGPPRAPSGYSQPPSVCSSTSFNGPFAGGVVSPQPHSSYYSGMTGPQHPFYNRPFFAPYLYTPRYYPGGSQHLISRPSVKTSLPRDQNNGLEVIKEDAAEGLSSPTDSSRGSGPAPGPVVLLNSLNVDAVCEKLKQIEGLDQSMLPQYCTTIKKANINGRVLAQCNIDELKKEMNMNFGDWHLFRSTVLEMRNAESHVVPEDPRFLSESSSGPAPHGEPARRASHNELPHTELSSQTPYTLNFSFEELNTLGLDEGAPRHSNLSWQSQTRRTPSLSSLNSQDSSIEISKLTDKVQAEYRDAYREYIAQMSQLEGGPGSTTISGRSSPHSTYYMGQSSSGGSIHSNLEQEKGKDSEPKPDDGRKSFLMKRGDVIDYSSSGVSTNDASPLDPITEEDEKSDQSGSKLLPGKKSSERSSLFQTDLKLKGSGLRYQKLPSDEDESGTEESDNTPLLKDDKDRKAEGKVERVPKSPEHSAEPIRTFIKAKEYLSDALLDKKDSSDSGVRSSESSPNHSLHNEVADDSQLEKANLIELEDDSHSGKRGIPHSLSGLQDPIIARMSICSEDKKSPSECSLIASSPEENWPACQKAYNLNRTPSTVTLNNNSAPANRANQNFDEMEGIRETSQVILRPSSSPNPTTIQNENLKSMTHKRSQRSSYTRLSKDPPELHAAASSESTGFGEERESIL,mutated_sequence,1.0,1771.0,NP_065789.1.a2m,NP_065789.1.npy,ClinVar
+NP_065796.2,NP_065796.2.csv,MAASVAAAARRLRRAIRRSPAWRGLSHRPLSSEPPAAKASAVRAAFLNFFRDRHGHRLVPSASVRPRGDPSLLFVNAGMNQFKPIFLGTVDPRSEMAGFRRVANSQKCVRAGGHHNDLEDVGRDLSHHTFFEMLGNWAFGGEYFKEEACNMAWELLTQVYGIPEERLWISYFDGDPKAGLDPDLETRDIWLSLGVPASRVLSFGPQENFWEMGDTGPCGPCTEIHYDLAGGVGAPQLVELWNLVFMQHNREADGSLQPLPQRHVDTGMGLERLVAVLQGKHSTYDTDLFSPLLNAIQQGCRAPPYLGRVGVADEGRTDTAYRVVADHIRTLSVCISDGIFPGMSGPPLVLRRILRRAVRFSMEILKAPPGFLGSLVPVVVETLGDAYPELQRNSAQIANLVSEDEAAFLASLERGRRIIDRTLRTLGPSDMFPAEVAWSLSLCGDLGLPLDMVELMLEEKGVQLDSAGLERLAQEEAQHRARQAEPVQKQGLWLDVHALGELQRQGVPPTDDSPKYNYSLRPSGSYEFGTCEAQVLQLYTEDGTAVASVGKGQRCGLLLDRTNFYAEQGGQASDRGYLVRAGQEDVLFPVARAQVCGGFILHEAVAPECLRLGDQVQLHVDEAWRLGCMAKHTATHLLNWALRQTLGPGTEQQGSHLNPEQLRLDVTTQTPLTPEQLRAVENTVQEAVGQDEAVYMEEVPLALTAQVPGLRSLDEVYPDPVRVVSVGVPVAHALDPASQAALQTSVELCCGTHLLRTGAVGDLVIIGDRQLSKGTTRLLAVTGEQAQQARELGQSLAQEVKAATERLSLGSRDVAEALRLSKDIGRLIEAVETAVMPQWQRRELLATVKMLQRRANTAIRKLQMGQAAKKTQELLERHSKGPLIVDTVSAESLSVLVKVVRQLCEQAPSTSVLLLSPQPMGKVLCACQVAQGAMPTFTAEAWALAVCSHMGGKAWGSRVVAQGTGSTTDLEAALSIAQTYALSQL,mutated_sequence,1.0,985.0,NP_065796.2.a2m,NP_065796.2.npy,ClinVar
+NP_065822.2,NP_065822.2.csv,MERAMEQLNRLTRSLRRARTVELPEDNETAVYTLMPMVMADQHRSVSELLSNSKFDVNYAFGRVKRSLLHIAANCGSVECLVLLLKKGANPNYQDISGCTPLHLAARNGQKKCMSKLLEYSADVNICNNEGLTAIHWLAVNGRTELLHDLVQHVSDVDVEDAMGQTALHVACQNGHKTTVQCLLDSGADINRPNVSGATPLYFACSHGQRDTAQILLLRGAKYLPDKNGVTPLDLCVQGGYGETCEVLIQYHPRLFQTIIQMTQNEDLRENMLRQVLEHLSQQSESQYLKILTSLAEVATTNGHKLLSLSSNYDAQMKSLLRIVRMFCHVFRIGPSSPSNGIDMGYNGNKTPRSQVFKPLELLWHSLDEWLVLIATELMKNKRDSTEITSILLKQKGQDQDAASIPPFEPPGPGSYENLSTGTRESKPDALAGRQEASADCQDVISMTANRLSAVIQAFYMCCSCQMPPGMTSPRFIEFVCKHDEVLKCFVNRNPKIIFDHFHFLLECPELMSRFMHIIKAQPFKDRCEWFYEHLHSGQPDSDMVHRPVNENDILLVHRDSIFRSSCEVVSKANCAKLKQGIAVRFHGEEGMGQGVVREWFDILSNEIVNPDYALFTQSADGTTFQPNSNSYVNPDHLNYFRFAGQILGLALNHRQLVNIYFTRSFYKHILGIPVNYQDVASIDPEYAKNLQWILDNDISDLGLELTFSVETDVFGAMEEVPLKPGGGSILVTQNNKAEYVQLVTELRMTRAIQPQINAFLQGFHMFIPPSLIQLFDEYELELLLSGMPEIDVSDWIKNTEYTSGYEREDPVIQWFWEVVEDITQEERVLLLQFVTGSSRVPHGGFANIMGGSGLQNFTIAAVPYTPNLLPTSSTCINMLKLPEYPSKEILKDRLLVALHCGSYGYTMA,mutated_sequence,1.0,909.0,NP_065822.2.a2m,NP_065822.2.npy,ClinVar
+NP_065851.1,NP_065851.1.csv,MRLKISLLKEPKHQELVSCVGWTTAEELYSCSDDHQIVKWNLLTSETTQIVKLPDDIYPIDFHWFPKSLGVKKQTQAESFVLTSSDGKFHLISKLGRVEKSVEAHCGAVLAGRWNYEGTALVTVGEDGQIKIWSKTGMLRSTLAQQGTPVYSVAWGPDSEKVLYTAGKQLIIKPLQPNAKVLQWKAHDGIILKVDWNSVNDLILSAGEDCKYKVWDSYGRPLYNSQPHEHPITSVAWAPDGELFAVGSFHTLRLCDKTGWSYALEKPNTGSIFNIAWSIDGTQIAGACGNGHVVFAHVVEQHWEWKNFQVTLTKRRAMQVRNVLNDAVDLLEFRDRVIKASLNYAHLVVSTSLQCYVFSTKNWNTPIIFDLKEGTVSLILQAERHFLLVDGSSIYLYSYEGRFISSPKFPGMRTDILNAQTVSLSNDTIAIRDKADEKIIFLFEASTGKPLGDGKFLSHKNEILEIALDQKGLTNDRKIAFIDKNRDLCITSVKRFGKEEQIIKLGTMVHTLAWNDTCNILCGLQDTRFIVWYYPNTVYVDRDILPKTLYERDASEFSKNPHIVSFVGNQVTIRRADGSLVHISITPYPAILHEYVSSSKWEDAVRLCRFVKEQTMWACLAAMAVANRDMTTAEIAYAAIGEIDKVQYINSIKNLPSKESKMAHILLFSGNIQEAEIVLLQAGLVYQAIQININLYNWERALELAVKYKTHVDTVLAYRQKFLETFGKQETNKRYLHYAEGLQIDWEKIKAKIEMEITKEREQSSSSQSSKSIGLKP,mutated_sequence,1.0,777.0,NP_065851.1.a2m,NP_065851.1.npy,ClinVar
+NP_066124.1,NP_066124.1.csv,MAKATSGAAGLRLLLLLLLPLLGKVALGLYFSRDAYWEKLYVDQAAGTPLLYVHALRDAPEEVPSFRLGQHLYGTYRTRLHENNWICIQEDTGLLYLNRSLDHSSWEKLSVRNRGFPLLTVYLKVFLSPTSLREGECQWPGCARVYFSFFNTSFPACSSLKPRELCFPETRPSFRIRENRPPGTFHQFRLLPVQFLCPNISVAYRLLEGEGLPFRCAPDSLEVSTRWALDREQREKYELVAVCTVHAGAREEVVMVPFPVTVYDEDDSAPTFPAGVDTASAVVEFKRKEDTVVATLRVFDADVVPASGELVRRYTSTLLPGDTWAQQTFRVEHWPNETSVQANGSFVRATVHDYRLVLNRNLSISENRTMQLAVLVNDSDFQGPGAGVLLLHFNVSVLPVSLHLPSTYSLSVSRRARRFAQIGKVCVENCQAFSGINVQYKLHSSGANCSTLGVVTSAEDTSGILFVNDTKALRRPKCAELHYMVVATDQQTSRQAQAQLLVTVEGSYVAEEAGCPLSCAVSKRRLECEECGGLGSPTGRCEWRQGDGKGITRNFSTCSPSTKTCPDGHCDVVETQDINICPQDCLRGSIVGGHEPGEPRGIKAGYGTCNCFPEEEKCFCEPEDIQDPLCDELCRTVIAAAVLFSFIVSVLLSAFCIHCYHKFAHKPPISSAEMTFRRPAQAFPVSYSSSGARRPSLDSMENQVSVDAFKILEDPKWEFPRKNLVLGKTLGEGEFGKVVKATAFHLKGRAGYTTVAVKMLKENASPSELRDLLSEFNVLKQVNHPHVIKLYGACSQDGPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHPDERALTMGDLISFAWQISQGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYEEDSYVKRSQGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLFNLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKRRDYLDLAASTPSDSLIYDDGLSEEETPLVDCNNAPLPRALPSTWIENKLYGMSDPNWPGESPVPLTRADGTNTGFPRYPNDSVYANWMLSPSAAKLMDTFDS,mutated_sequence,1.0,1114.0,NP_066124.1.a2m,NP_066124.1.npy,ClinVar
+NP_066192.1,NP_066192.1.csv,MASTGLELLGMTLAVLGWLGTLVSCALPLWKVTAFIGNSIVVAQVVWEGLWMSCVVQSTGQMQCKVYDSLLALPQDLQAARALCVIALLLALLGLLVAITGAQCTTCVEDEGAKARIVLTAGVILLLAGILVLIPVCWTAHAIIQDFYNPLVAEALKRELGASLYLGWAAAALLMLGGGLLCCTCPPPQVERPRGPRLGYSIPSRSGASGLDKRDYV,mutated_sequence,1.0,217.0,NP_066192.1.a2m,NP_066192.1.npy,ClinVar
+NP_066268.1,NP_066268.1.csv,MGCTLSAEERAALERSKAIEKNLKEDGISAAKDVKLLLLGAGESGKSTIVKQMKIIHEDGFSGEDVKQYKPVVYSNTIQSLAAIVRAMDTLGIEYGDKERKADAKMVCDVVSRMEDTEPFSAELLSAMMRLWGDSGIQECFNRSREYQLNDSAKYYLDSLDRIGAADYQPTEQDILRTRVKTTGIVETHFTFKNLHFRLFDVGGQRSERKKWIHCFEDVTAIIFCVALSGYDQVLHEDETTNRMHESLMLFDSICNNKFFIDTSIILFLNKKDLFGEKIKKSPLTICFPEYTGPNTYEDAAAYIQAQFESKNRSPNKEIYCHMTCATDTNNIQVVFDAVTDIIIANNLRGCGLY,mutated_sequence,1.0,354.0,NP_066268.1.a2m,NP_066268.1.npy,ClinVar
+NP_066285.1,NP_066285.1.csv,MAMVVSTWRDPQDEVPGSQGSQASQAPPVPGPPPGAPHTPQTPGQGGPASTPAQTAAGGQGGPGGPGSDKQQQQQHIECVVCGDKSSGKHYGQFTCEGCKSFFKRSVRRNLSYTCRANRNCPIDQHHRNQCQYCRLKKCLKVGMRREAVQRGRMPPTQPTHGQFALTNGDPLNCHSYLSGYISLLLRAEPYPTSRFGSQCMQPNNIMGIENICELAARMLFSAVEWARNIPFFPDLQITDQVALLRLTWSELFVLNAAQCSMPLHVAPLLAAAGLHASPMSADRVVAFMDHIRIFQEQVEKLKALHVDSAEYSCLKAIVLFTSDACGLSDVAHVESLQEKSQCALEEYVRSQYPNQPTRFGKLLLRLPSLRTVSSSVIEQLFFVRLVGKTPIETLIRDMLLSGSSFNWPYMAIQ,mutated_sequence,1.0,414.0,NP_066285.1.a2m,NP_066285.1.npy,ClinVar
+NP_066288.2,NP_066288.2.csv,MEDSDSAAKQLGLAEAAAVAAAAAVAAAAAAAAGGEAEEPVLSRDEDSEEDADSEAERETPRVTAVAVMAAEPGHMDMGAEALPGPDEAAAAAAFAEVTTVTVANVGAAADNVFTTSVANAASISGHVLSGRTALQIGDSLNTEKATLIVVHTDGSIVETTGLKGPAAPLTPGPQSPPTPLAPGQEKGGTKYNWDPSVYDSELPVRCRNISGTLYKNRLGSGGRGRCIKQGENWYSPTEFEAMAGRASSKDWKRSIRYAGRPLQCLIQDGILNPHAASCTCAACCDDMTLSGPVRLFVPYKRRKKENELPTTPVKKDSPKNITLLPATAATTFTVTPSGQITTSGALTFDRASTVEATAVISESPAQGDVFAGATVQEASVQPPCRASHPEPHYPGYQDSCQIAPFPEAALPTSHPKIVLTSLPALAVPPPTPTKAAPPALVNGLELSEPRSWLYLEEMVNSLLNTAQQLKTLFEQAKHASTYREAATNQAKIHADAERKEQSCVNCGREAMSECTGCHKVNYCSTFCQRKDWKDHQHICGQSAAVTVQADEVHVAESVMEKVTV,mutated_sequence,1.0,565.0,NP_066288.2.a2m,NP_066288.2.npy,ClinVar
+NP_067075.1,NP_067075.1.csv,MDEDVLTTLKILIIGESGVGKSSLLLRFTDDTFDPELAATIGVDFKVKTISVDGNKAKLAIWDTAGQERFRTLTPSYYRGAQGVILVYDVTRRDTFVKLDNWLNELETYCTRNDIVNMLVGNKIDKENREVDRNEGLKFARKHSMLFIEASAKTCDGVQCAFEELVEKIIQTPGLWESENQNKGVKLSHREEGQGGGACGGYCSVL,mutated_sequence,1.0,206.0,NP_067075.1.a2m,NP_067075.1.npy,ClinVar
+NP_067627.3,NP_067627.3.csv,METPLQFQRGFFPEQPPPPPRSSHLHCQQQQQSQDKPCPPFAPLPHPHHHPHLAHQQPASGGSSPCLRCNSCASSGAPAAGAGDNLSLLLRTSSPGGAFRTRTSSPLSGSSCCCCCCSSRRGSQLNVSELTPSSHASALRQQYAQQSAQQSASASQYHQCHSLQPAASPTGSLGSLGSGPPLSHHHHHPHPAHHQHHQPQARRESNPFTEIAMSSCRYNGGVMRPLSNLSASRRNLHEMDSEAQPLQPPASVGGGGGASSPSAAAAAAAAVSSSAPEIVVSKPEHNNSNNLALYGTGGGGSTGGGGGGGGSGHGSSSGTKSSKKKNQNIGYKLGHRRALFEKRKRLSDYALIFGMFGIVVMVIETELSWGAYDKASLYSLALKCLISLSTIILLGLIIVYHAREIQLFMVDNGADDWRIAMTYERIFFICLEILVCAIHPIPGNYTFTWTARLAFSYAPSTTTADVDIILSIPMFLRLYLIARVMLLHSKLFTDASSRSIGALNKINFNTRFVMKTLMTICPGTVLLVFSISLWIIAAWTVRACERYHDQQDVTSNFLGAMWLISITFLSIGYGDMVPNTYCGKGVCLLTGIMGAGCTALVVAVVARKLELTKAEKHVHNFMMDTQLTKRVKNAAANVLRETWLIYKNTKLVKKIDHAKVRKHQRKFLQAIHQLRSVKMEQRKLNDQANTLVDLAKTQNIMYDMISDLNERSEDFEKRIVTLETKLETLIGSIHALPGLISQTIRQQQRDFIEAQMESYDKHVTYNAERSRSSSRRRRSSSTAPPTSSESS,mutated_sequence,1.0,791.0,NP_067627.3.a2m,NP_067627.3.npy,ClinVar
+NP_067638.3,NP_067638.3.csv,MADSSEGPRAGPGEVAELPGDESGTPGGEAFPLSSLANLFEGEDGSLSPSPADASRPAGPGDGRPNLRMKFQGAFRKGVPNPIDLLESTLYESSVVPGPKKAPMDSLFDYGTYRHHSSDNKRWRKKIIEKQPQSPKAPAPQPPPILKVFNRPILFDIVSRGSTADLDGLLPFLLTHKKRLTDEEFREPSTGKTCLPKALLNLSNGRNDTIPVLLDIAERTGNMREFINSPFRDIYYRGQTALHIAIERRCKHYVELLVAQGADVHAQARGRFFQPKDEGGYFYFGELPLSLAACTNQPHIVNYLTENPHKKADMRRQDSRGNTVLHALVAIADNTRENTKFVTKMYDLLLLKCARLFPDSNLEAVLNNDGLSPLMMAAKTGKIGIFQHIIRREVTDEDTRHLSRKFKDWAYGPVYSSLYDLSSLDTCGEEASVLEILVYNSKIENRHEMLAVEPINELLRDKWRKFGAVSFYINVVSYLCAMVIFTLTAYYQPLEGTPPYPYRTTVDYLRLAGEVITLFTGVLFFFTNIKDLFMKKCPGVNSLFIDGSFQLLYFIYSVLVIVSAALYLAGIEAYLAVMVFALVLGWMNALYFTRGLKLTGTYSIMIQKILFKDLFRFLLVYLLFMIGYASALVSLLNPCANMKVCNEDQTNCTVPTYPSCRDSETFSTFLLDLFKLTIGMGDLEMLSSTKYPVVFIILLVTYIILTFVLLLNMLIALMGETVGQVSKESKHIWKLQWATTILDIERSFPVFLRKAFRSGEMVTVGKSSDGTPDRRWCFRVDEVNWSHWNQNLGIINEDPGKNETYQYYGFSHTVGRLRRDRWSSVVPRVVELNKNSNPDEVVVPLDSMGNPRCDGHQQGYPRKWRTDDAPL,mutated_sequence,1.0,871.0,NP_067638.3.a2m,NP_067638.3.npy,ClinVar
+NP_067641.2,NP_067641.2.csv,MAVYRLCVTTGPYLRAGTLDNISVTLVGTCGESPKQRLDRMGRDFAPGSVQKYKVRCTAELGELLLLRVHKERYAFFRKDSWYCSRICVTEPDGSVSHFPCYQWIEGYCTVELRPGTARTICQDSLPLLLDHRTRELRARQECYRWKIYAPGFPCMVDVNSFQEMESDKKFALTKTTTCVDQGDSSGNRYLPGFPMKIDIPSLMYMEPNVRYSATKTISLLFNAIPASLGMKLRGLLDRKGSWKKLDDMQNIFWCHKTFTTKYVTEHWCEDHFFGYQYLNGVNPVMLHCISSLPSKLPVTNDMVAPLLGQDTCLQTELERGNIFLADYWILAEAPTHCLNGRQQYVAAPLCLLWLSPQGALVPLAIQLSQTPGPDSPIFLPTDSEWDWLLAKTWVRNSEFLVHENNTHFLCTHLLCEAFAMATLRQLPLCHPIYKLLLPHTRYTLQVNTIARATLLNPEGLVDQVTSIGRQGLIYLMSTGLAHFTYTNFCLPDSLRARGVLAIPNYHYRDDGLKIWAAIESFVSEIVGYYYPSDASVQQDSELQAWTGEIFAQAFLGRESSGFPSRLCTPGEMVKFLTAIIFNCSAQHAAVNSGQHDFGAWMPNAPSSMRQPPPQTKGTTTLKTYLDTLPEVNISCNNLLLFWLVSQEPKDQRPLGTYPDEHFTEEAPRRSIAAFQSRLAQISRDIQERNQGLALPYTYLDPPLIENSVSI,mutated_sequence,1.0,711.0,NP_067641.2.a2m,NP_067641.2.npy,ClinVar
+NP_068741.1,NP_068741.1.csv,MATPDAGLPGAEGVEPAPWAQLEAPARLLLQALQAGPEGARRGLGVLRALGSRGWEPFDWGRLLEALCREEPVVQGPDGRLELKPLLLRLPRICQRNLMSLLMAVRPSLPESGLLSVLQIAQQDLAPDPDAWLRALGELLRRDLGVGTSMEGASPLSERCQRQLQSLCRGLGLGGRRLKSPQAPDPEEEENRDSQQPGKRRKDSEEEAASPEGKRVPKRLRCWEEEEDHEKERPEHKSLESLADGGSASPIKDQPVMAVKTGEDGSNLDDAKGLAESLELPKAIQDQLPRLQQLLKTLEEGLEGLEDAPPVELQLLHECSPSQMDLLCAQLQLPQLSDLGLLRLCTWLLALSPDLSLSNATVLTRSLFLGRILSLTSSASRLLTTALTSFCAKYTYPVCSALLDPVLQAPGTGPAQTELLCCLVKMESLEPDAQVLMLGQILELPWKEETFLVLQSLLERQVEMTPEKFSVLMEKLCKKGLAATTSMAYAKLMLTVMTKYQANITETQRLGLAMALEPNTTFLRKSLKAALKHLGP,mutated_sequence,1.0,536.0,NP_068741.1.a2m,NP_068741.1.npy,ClinVar
+NP_068758.3,NP_068758.3.csv,MFPAGPPSHSLLRLPLLQLLLLVVQAVGRGLGRASPAGGPLEDVVIERYHIPRACPREVQMGDFVRYHYNGTFEDGKKFDSSYDRNTLVAIVVGVGRLITGMDRGLMGMCVNERRRLIVPPHLGYGSIGLAGLIPPDATLYFDVVLLDVWNKEDTVQVSTLLRPPHCPRMVQDGDFVRYHYNGTLLDGTSFDTSYSKGGTYDTYVGSGWLIKGMDQGLLGMCPGERRKIIIPPFLAYGEKGYGTVIPPQASLVFHVLLIDVHNPKDAVQLETLELPPGCVRRAGAGDFMRYHYNGSLMDGTLFDSSYSRNHTYNTYIGQGYIIPGMDQGLQGACMGERRRITIPPHLAYGENGTGDKIPGSAVLIFNVHVIDFHNPADVVEIRTLSRPSETCNETTKLGDFVRYHYNCSLLDGTQLFTSHDYGAPQEATLGANKVIEGLDTGLQGMCVGERRQLIVPPHLAHGESGARGVPGSAVLLFEVELVSREDGLPTGYLFVWHKDPPANLFEDMDLNKDGEVPPEEFSTFIKAQVSEGKGRLMPGQDPEKTIGDMFQNQDRNQDGKITVDELKLKSDEDEERVHEEL,mutated_sequence,1.0,582.0,NP_068758.3.a2m,NP_068758.3.npy,ClinVar
+NP_071372.1,NP_071372.1.csv,MSADSSPLVGSTPTGYGTLTIGTSIDPLSSSVSSVRLSGYCGSPWRVIGYHVVVWMMAGIPLLLFRWKPLWGVRLRLRPCNLAHAETLVIEIRDKEDSSWQLFTVQVQTEAIGEGSLEPSPQSQAEDGRSQAAVGAVPEGAWKDTAQLHKSEEAVSVGQKRVLRYYLFQGQRYIWIETQQAFYQVSLLDHGRSCDDVHRSRHGLSLQDQMVRKAIYGPNVISIPVKSYPQLLVDEALNPYYGFQAFSIALWLADHYYWYALCIFLISSISICLSLYKTRKQSQTLRDMVKLSMRVCVCRPGGEEEWVDSSELVPGDCLVLPQEGGLMPCDAALVAGECMVNESSLTGESIPVLKTALPEGLGPYCAETHRRHTLFCGTLILQARAYVGPHVLAVVTRTGFCTAKGGLVSSILHPRPINFKFYKHSMKFVAALSVLALLGTIYSIFILYRNRVPLNEIVIRALDLVTVVVPPALPAAMTVCTLYAQSRLRRQGIFCIHPLRINLGGKLQLVCFDKTGTLTEDGLDVMGVVPLKGQAFLPLVPEPRRLPVGPLLRALATCHALSRLQDTPVGDPMDLKMVESTGWVLEEEPAADSAFGTQVLAVMRPPLWEPQLQAMEEPPVPVSVLHRFPFSSALQRMSVVVAWPGATQPEAYVKGSPELVAGLCNPETVPTDFAQMLQSYTAAGYRVVALASKPLPTVPSLEAAQQLTRDTVEGDLSLLGLLVMRNLLKPQTTPVIQALRRTRIRAVMVTGDNLQTAVTVARGCGMVAPQEHLIIVHATHPERGQPASLEFLPMESPTAVNGVKDPDQAASYTVEPDPRSRHLALSGPTFGIIVKHFPKLLPKVLVQGTVFARMAPEQKTELVCELQKLQYCVGMCGDGANDCGALKAADVGISLSQAEASVVSPFTSSMASIECVPMVIREGRCSLDTSFSVFKYMALYSLTQFISVLILYTINTNLGDLQFLAIDLVITTTVAVLMSRTGPALVLGRVRPPGALLSVPVLSSLLLQMVLVTGVQLGGYFLTLAQPWFVPLNRTVAAPDNLPNYENTVVFSLSSFQYLILAAAVSKGAPFRRPLYTNVPFLVALALLSSVLVGLVLVPGLLQGPLALRNITDTGFKLLLLGLVTLNFVGAFMLESVLDQCLPACLRRLRPKRASKKRFKQLERELAEQPWPPLPAGPLR,mutated_sequence,1.0,1180.0,NP_071372.1.a2m,NP_071372.1.npy,ClinVar
+NP_071384.1,NP_071384.1.csv,MPKVVSRSVVCSDTRDREEYDDGEKPLHVYYCLCGQMVLVLDCQLEKLPMRPRDRSRVIDAAKHAHKFCNTEDEETMYLRRPEGIERQYRKKCAKCGLPLFYQSQPKNAPVTFIVDGAVVKFGQGFGKTNIYTQKQEPPKKVMMTKRTKDMGKFSSVTVSTIDEEEEEIEAREVADSYAQNAKVIEKQLERKGMSKRRLQELAELEAKKAKMKGTLIDNQFK,mutated_sequence,1.0,222.0,NP_071384.1.a2m,NP_071384.1.npy,ClinVar
+NP_071407.4,NP_071407.4.csv,MGRHVATSCHVAWLLVLISGCWGQVNRLPFFTNHFFDTYLLISEDTPVGSSVTQLLAQDMDNDPLVFGVSGEEASRFFAVEPDTGVVWLRQPLDRETKSEFTVEFSVSDHQGVITRKVNIQVGDVNDNAPTFHNQPYSVRIPENTPVGTPIFIVNATDPDLGAGGSVLYSFQPPSQFFAIDSARGIVTVIRELDYETTQAYQLTVNATDQDKTRPLSTLANLAIIITDVQDMDPIFINLPYSTNIYEHSPPGTTVRIITAIDQDKGRPRGIGYTIVSGNTNSIFALDYISGVLTLNGLLDRENPLYSHGFILTVKGTELNDDRTPSDATVTTTFNILVIDINDNAPEFNSSEYSVAITELAQVGFALPLFIQVVDKDENLGLNSMFEVYLVGNNSHHFIISPTSVQGKADIRIRVAIPLDYETVDRYDFDLFANESVPDHVGYAKVKITLINENDNRPIFSQPLYNISLYENVTVGTSVLTVLATDNDAGTFGEVSYFFSDDPDRFSLDKDTGLIMLIARLDYELIQRFTLTIIARDGGGEETTGRVRINVLDVNDNVPTFQKDAYVGALRENEPSVTQLVRLRATDEDSPPNNQITYSIVSASAFGSYFDISLYEGYGVISVSRPLDYEQISNGLIYLTVMAMDAGNPPLNSTVPVTIEVFDENDNPPTFSKPAYFVSVVENIMAGATVLFLNATDLDRSREYGQESIIYSLEGSTQFRINARSGEITTTSLLDRETKSEYILIVRAVDGGVGHNQKTGIATVNITLLDINDNHPTWKDAPYYINLVEMTPPDSDVTTVVAVDPDLGENGTLVYSIQPPNKFYSLNSTTGKIRTTHAMLDRENPDPHEAELMRKIVVSVTDCGRPPLKATSSATVFVNLLDLNDNDPTFQNLPFVAEVLEGIPAGVSIYQVVAIDLDEGLNGLVSYRMPVGMPRMDFLINSSSGVVVTTTELDRERIAEYQLRVVASDAGTPTKSSTSTLTIHVLDVNDETPTFFPAVYNVSVSEDVPREFRVVWLNCTDNDVGLNAELSYFITGGNVDGKFSVGYRDAVVRTVVGLDRETTAAYMLILEAIDNGPVGKRHTGTATVFVTVLDVNDNRPIFLQSSYEASVPEDIPEGHSILQLKATDADEGEFGRVWYRILHGNHGNNFRIHVSNGLLMRGPRPLDRERNSSHVLIVEAYNHDLGPMRSSVRVIVYVEDINDEAPVFTQQQYSRLGLRETAGIGTSVIVVQATDRDSGDGGLVNYRILSGAEGKFEIDESTGLIITVNYLDYETKTSYMMNVSATDQAPPFNQGFCSVYITLLNELDEAVQFSNASYEAAILENLALGTEIVRVQAYSIDNLNQITYRFNAYTSTQAKALFKIDAITGVITVQGLVDREKGDFYTLTVVADDGGPKVDSTVKVYITVLDENDNSPRFDFTSDSAVSIPEDCPVGQRVATVKAWDPDAGSNGQVVFSLASGNIAGAFEIVTTNDSIGEVFVARPLDREELDHYILQVVASDRGTPPRKKDHILQVTILDINDNPPVIESPFGYNVSVNENVGGGTAVVQVRATDRDIGINSVLSYYITEGNKDMAFRMDRISGEIATRPAPPDRERQSFYHLVATVEDEGTPTLSATTHVYVTIVDENDNAPMFQQPHYEVLLDEGPDTLNTSLITIQALDLDEGPNGTVTYAIVAGNIVNTFRIDRHMGVITAAKELDYEISHGRYTLIVTATDQCPILSHRLTSTTTVLVNVNDINDNVPTFPRDYEGPFEVTEGQPGPRVWTFLAHDRDSGPNGQVEYSIMDGDPLGEFVISPVEGVLRVRKDVELDRETIAFYNLTICARDRGMPPLSSTMLVGIRVLDINDNDPVLLNLPMNITISENSPVSSFVAHVLASDADSGCNARLTFNITAGNRERAFFINATTGIVTVNRPLDRERIPEYKLTISVKDNPENPRIARRDYDLLLIFLSDENDNHPLFTKSTYQAEVMENSPAGTPLTVLNGPILALDADQDIYAVVTYQLLGAQSGLFDINSSTGVVTVRSGVIIDREAFSPPILELLLLAEDIGLLNSTAHLLITILDDNDNRPTFSPATLTVHLLENCPPGFSVLQVTATDEDSGLNGELVYRIEAGAQDRFLIHLVTGVIRVGNATIDREEQESYRLTVVATDRGTVPLSGTAIVTILIDDINDSRPEFLNPIQTVSVLESAEPGTVIANITAIDHDLNPKLEYHIVGIVAKDDTDRLVPNQEDAFAVNINTGSVMVKSPMNRELVATYEVTLSVIDNASDLPERSVSVPNAKLTVNVLDVNDNTPQFKPFGITYYMERILEGATPGTTLIAVAAVDPDKGLNGLVTYTLLDLVPPGYVQLEDSSAGKVIANRTVDYEEVHWLNFTVRASDNGSPPRAAEIPVYLEIVDINDNNPIFDQPSYQEAVFEDVPVGTIILTVTATDADSGNFALIEYSLGDGESKFAINPTTGDIYVLSSLDREKKDHYILTALAKDNPGDVASNRRENSVQVVIQVLDVNDCRPQFSKPQFSTSVYENEPAGTSVITMMATDQDEGPNGELTYSLEGPGVEAFHVDMDSGLVTTQRPLQSYEKFSLTVVATDGGEPPLWGTTMLLVEVIDVNDNRPVFVRPPNGTILHIREEIPLRSNVYEVYATDKDEGLNGAVRYSFLKTAGNRDWEFFIIDPISGLIQTAQRLDRESQAVYSLILVASDLGQPVPYETMQPLQVALEDIDDNEPLFVRPPKGSPQYQLLTVPEHSPRGTLVGNVTGAVDADEGPNAIVYYFIAAGNEEKNFHLQPDGCLLVLRDLDREREAIFSFIVKASSNRSWTPPRGPSPTLDLVADLTLQEVRVVLEDINDQPPRFTKAEYTAGVATDAKVGSELIQVLALDADIGNNSLVFYSILAIHYFRALANDSEDVGQVFTMGSMDGILRTFDLFMAYSPGYFVVDIVARDLAGHNDTAIIGIYILRDDQRVKIVINEIPDRVRGFEEEFIHLLSNITGAIVNTDNVQFHVDKKGRVNFAQTELLIHVVNRDTNRILDVDRVIQMIDENKEQLRNLFRNYNVLDVQPAISVRLPDDMSALQMAIIVLAILLFLAAMLFVLMNWYYRTVHKRKLKAIVAGSAGNRGFIDIMDMPNTNKYSFDGANPVWLDPFCRNLELAAQAEHEDDLPENLSEIADLWNSPTRTHGTFGREPAAVKPDDDRYLRAAIQEYDNIAKLGQIIREGPIKGSLLKVVLEDYLRLKKLFAQRMVQKASSCHSSISELIQTELDEEPGDHSPGQGSLRFRHKPPVELKGPDGIHVVHGSTGTLLATDLNSLPEEDQKGLGRSLETLTAAEATAFERNARTESAKSTPLHKLRDVIMETPLEITEL,mutated_sequence,1.0,3354.0,NP_071407.4.a2m,NP_071407.4.npy,ClinVar
+NP_071415.1,NP_071415.1.csv,MWAVLRLALRPCARASPAGPRAYHGDSVASLGTQPDLGSALYQENYKQMKALVNQLHERVEHIKLGGGEKARALHISRGKLLPRERIDNLIDPGSPFLELSQFAGYQLYDNEEVPGGGIITGIGRVSGVECMIIANDATVKGGAYYPVTVKKQLRAQEIAMQNRLPCIYLVDSGGAYLPRQADVFPDRDHFGRTFYNQAIMSSKNIAQIAVVMGSCTAGGAYVPAMADENIIVRKQGTIFLAGPPLVKAATGEEVSAEDLGGADLHCRKSGVSDHWALDDHHALHLTRKVVRNLNYQKKLDVTIEPSEEPLFPADELYGIVGANLKRSFDVREVIARIVDGSRFTEFKAFYGDTLVTGFARIFGYPVGIVGNNGVLFSESAKKGTHFVQLCCQRNIPLLFLQNITGFMVGREYEAEGIAKDGAKMVAAVACAQVPKITLIIGGSYGAGNYGMCGRAYSPRFLYIWPNARISVMGGEQAANVLATITKDQRAREGKQFSSADEAALKEPIIKKFEEEGNPYYSSARVWDDGIIDPADTRLVLGLSFSAALNAPIEKTDFGIFRM,mutated_sequence,1.0,563.0,NP_071415.1.a2m,NP_071415.1.npy,ClinVar
+NP_071900.2,NP_071900.2.csv,MDQTCELPRRNCLLPFSNPVNLDAPEDKDSPFGNGQSNFSEPLNGCTMQLSTVSGTSQNAYGQDSPSCYIPLRRLQDLASMINVEYLNGSADGSESFQDPEKSDSRAQTPIVCTSLSPGGPTALAMKQEPSCNNSPELQVKVTKTIKNGFLHFENFTCVDDADVDSEMDPEQPVTEDESIEEIFEETQTNATCNYETKSENGVKVAMGSEQDSTPESRHGAVKSPFLPLAPQTETQKNKQRNEVDGSNEKAALLPAPFSLGDTNITIEEQLNSINLSFQDDPDSSTSTLGNMLELPGTSSSSTSQELPFCQPKKKSTPLKYEVGDLIWAKFKRRPWWPCRICSDPLINTHSKMKVSNRRPYRQYYVEAFGDPSERAWVAGKAIVMFEGRHQFEELPVLRRRGKQKEKGYRHKVPQKILSKWEASVGLAEQYDVPKGSKNRKCIPGSIKLDSEEDMPFEDCTNDPESEHDLLLNGCLKSLAFDSEHSADEKEKPCAKSRARKSSDNPKRTSVKKGHIQFEAHKDERRGKIPENLGLNFISGDISDTQASNELSRIANSLTGSNTAPGSFLFSSCGKNTAKKEFETSNGDSLLGLPEGALISKCSREKNKPQRSLVCGSKVKLCYIGAGDEEKRSDSISICTTSDDGSSDLDPIEHSSESDNSVLEIPDAFDRTENMLSMQKNEKIKYSRFAATNTRVKAKQKPLISNSHTDHLMGCTKSAEPGTETSQVNLSDLKASTLVHKPQSDFTNDALSPKFNLSSSISSENSLIKGGAANQALLHSKSKQPKFRSIKCKHKENPVMAEPPVINEECSLKCCSSDTKGSPLASISKSGKVDGLKLLNNMHEKTRDSSDIETAVVKHVLSELKELSYRSLGEDVSDSGTSKPSKPLLFSSASSQNHIPIEPDYKFSTLLMMLKDMHDSKTKEQRLMTAQNLVSYRSPGRGDCSTNSPVGVSKVLVSGGSTHNSEKKGDGTQNSANPSPSGGDSALSGELSASLPGLLSDKRDLPASGKSRSDCVTRRNCGRSKPSSKLRDAFSAQMVKNTVNRKALKTERKRKLNQLPSVTLDAVLQGDRERGGSLRGGAEDPSKEDPLQIMGHLTSEDGDHFSDVHFDSKVKQSDPGKISEKGLSFENGKGPELDSVMNSENDELNGVNQVVPKKRWQRLNQRRTKPRKRMNRFKEKENSECAFRVLLPSDPVQEGRDEFPEHRTPSASILEEPLTEQNHADCLDSAGPRLNVCDKSSASIGDMEKEPGIPSLTPQAELPEPAVRSEKKRLRKPSKWLLEYTEEYDQIFAPKKKQKKVQEQVHKVSSRCEEESLLARGRSSAQNKQVDENSLISTKEEPPVLEREAPFLEGPLAQSELGGGHAELPQLTLSVPVAPEVSPRPALESEELLVKTPGNYESKRQRKPTKKLLESNDLDPGFMPKKGDLGLSKKCYEAGHLENGITESCATSYSKDFGGGTTKIFDKPRKRKRQRHAAAKMQCKKVKNDDSSKEIPGSEGELMPHRTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNECRTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECVQKYPPTVMQNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYHANDFCLAAGSKILASNSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNIDIPEGNWYCNDCKAGKKPHYREIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFGSNDYLWTHQARVFPYMEGDVSSKDKMGKGVDGTYKKALQEAAARFEELKAQKELRQLQEDRKNDKKPPPYKHIKVNRPIGRVQIFTADLSEIPRCNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPKNQPIATEEKSKKFKKKQQGKRRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQCDICGKEAASFCEMCPSSFCKQHREGMLFISKLDGRLSCTEHDPCGPNPLEPGEIREYVPPPVPLPPGPSTHLAEQSTGMAAQAPKMSDKPPADTNQMLSLSKKALAGTCQRPLLPERPLERTDSRPQPLDKVRDLAGSGTKSQSLVSSQRPLDRPPAVAGPRPQLSDKPSPVTSPSSSPSVRSQPLERPLGTADPRLDKSIGAASPRPQSLEKTSVPTGLRLPPPDRLLITSSPKPQTSDRPTDKPHASLSQRLPPPEKVLSAVVQTLVAKEKALRPVDQNTQSKNRAALVMDLIDLTPRQKERAASPHQVTPQADEKMPVLESSSWPASKGLGHMPRAVEKGCVSDPLQTSGKAAAPSEDPWQAVKSLTQARLLSQPPAKAFLYEPTTQASGRASAGAEQTPGPLSQSPGLVKQAKQMVGGQQLPALAAKSGQSFRSLGKAPASLPTEEKKLVTTEQSPWALGKASSRAGLWPIVAGQTLAQSCWSAGSTQTLAQTCWSLGRGQDPKPEQNTLPALNQAPSSHKCAESEQK,mutated_sequence,1.0,2696.0,NP_071900.2.a2m,NP_071900.2.npy,ClinVar
+NP_071908.2,NP_071908.2.csv,MSGFLEELLGEKLVTGGGEEVDVHSLGARGISLLGLYFGCSLSAPCAQLSASLAAFYGRLRGDAAAGPGPGAGAGAAAEPEPRRRLEIVFVSSDQDQRQWQDFVRDMPWLALPYKEKHRKLKLWNKYRISNIPSLIFLDATTGKVVCRNGLLVIRDDPEGLEFPWGPKPFREVIAGPLLRNNGQSLESSSLEGSHVGVYFSAHWCPPCRSLTRVLVESYRKIKEAGQNFEIIFVSADRSEESFKQYFSEMPWLAVPYTDEARRSRLNRLYGIQGIPTLIMLDPQGEVITRQGRVEVLNDEDCREFPWHPKPVLELSDSNAAQLNEGPCLVLFVDSEDDGESEAAKQLIQPIAEKIIAKYKAKEEEAPLLFFVAGEDDMTDSLRDYTNLPEAAPLLTILDMSARAKYVMDVEEITPAIVEAFVNDFLAEKLKPEPI,mutated_sequence,1.0,435.0,NP_071908.2.a2m,NP_071908.2.npy,ClinVar
+NP_071934.3,NP_071934.3.csv,MSVKEGAQRKWAALKEKLGPQDSDPTEANLESADPELCIRLLQMPSVVNYSGLRKRLEGSDGGWMVQFLEQSGLDLLLEALARLSGRGVARISDALLQLTCVSCVRAVMNSRQGIEYILSNQGYVRQLSQALDTSNVMVKKQVFELLAALCIYSPEGHVLTLDALDHYKTVCSQQYRFSIVMNELSGSDNVPYVVTLLSVINAVILGPEDLRARTQLRNEFIGLQLLDVLARLRDLEDADLLIQLEAFEEAKAEDEEELLRVSGGVDMSSHQEVFASLFHKVSCSPVSAQLLSVLQGLLHLEPTLRSSQLLWEALESLVNRAVLLASDAQECTLEEVVERLLSVKGRPRPSPLVKAHKSVQANLDQSQRGSSPQNTTTPKPSVEGQQPAAAAACEPVDHAQSESILKVSQPRALEQQASTPPPPPPPPLLPGSSAEPPPPPPPPPLPSVGAKALPTAPPPPPLPGLGAMAPPAPPLPPPLPGSCEFLPPPPPPLPGLGCPPPPPPLLPGMGWGPPPPPPPLLPCTCSPPVAGGMEEVIVAQVDHGLGSAWVPSHRRVNPPTLRMKKLNWQKLPSNVAREHNSMWASLSSPDAEAVEPDFSSIERLFSFPAAKPKEPTMVAPRARKEPKEITFLDAKKSLNLNIFLKQFKCSNEEVAAMIRAGDTTKFDVEVLKQLLKLLPEKHEIENLRAFTEERAKLASADHFYLLLLAIPCYQLRIECMLLCEGAAAVLDMVRPKAQLVLAACESLLTSRQLPIFCQLILRIGNFLNYGSHTGDADGFKISTLLKLTETKSQQNRVTLLHHVLEEAEKSHPDLLQLPRDLEQPSQAAGINLEIIRSEASSNLKKLLETERKVSASVAEVQEQYTERLQASISAFRALDELFEAIEQKQRELADYLCEDAQQLSLEDTFSTMKAFRDLFLRALKENKDRKEQAAKAERRKQQLAEEEARRPRGEDGKPVRKGPGKQEEVCVIDALLADIRKGFQLRKTARGRGDTDGGSKAASMDPPRATEPVATSNPAGDPVGSTRCPASEPGLDATTASESRGWDLVDAVTPGPQPTLEQLEEGGPRPLERRSSWYVDASDVLTTEDPQCPQPLEGAWPVTLGDAQALKPLKFSSNQPPAAGSSRQDAKDPTSLLGVLQAEADSTSEGLEDAVHSRGARPPAAGPGGDEDEDEEDTAPESALDTSLDKSFSEDAVTDSSGSGTLPRARGRASKGTGKRRKKRPSRSQEEVPPDSDDNKTKKLCVIQ,mutated_sequence,1.0,1249.0,NP_071934.3.a2m,NP_071934.3.npy,ClinVar
+NP_072046.2,NP_072046.2.csv,MPAMPSSGPGDTSSSAAEREEDRKDGEEQEEPRGKEERQEPSTTARKVGRPGRKRKHPPVESGDTPKDPAVISKSPSMAQDSGASELLPNGDLEKRSEPQPEEGSPAGGQKGGAPAEGEGAAETLPEASRAVENGCCTPKEGRGAPAEAGKEQKETNIESMKMEGSRGRLRGGLGWESSLRQRPMPRLTFQAGDPYYISKRKRDEWLARWKREAEKKAKVIAGMNAVEENQGPGESQKVEEASPPAVQQPTDPASPTVATTPEPVGSDAGDKNATKAGDDEPEYEDGRGFGIGELVWGKLRGFSWWPGRIVSWWMTGRSRAAEGTRWVMWFGDGKFSVVCVEKLMPLSSFCSAFHQATYNKQPMYRKAIYEVLQVASSRAGKLFPVCHDSDESDTAKAVEVQNKPMIEWALGGFQPSGPKGLEPPEEEKNPYKEVYTDMWVEPEAAAYAPPPPAKKPRKSTAEKPKVKEIIDERTRERLVYEVRQKCRNIEDICISCGSLNVTLEHPLFVGGMCQNCKNCFLECAYQYDDDGYQSYCTICCGGREVLMCGNNNCCRCFCVECVDLLVGPGAAQAAIKEDPWNCYMCGHKGTYGLLRRREDWPSRLQMFFANNHDQEFDPPKVYPPVPAEKRKPIRVLSLFDGIATGLLVLKDLGIQVDRYIASEVCEDSITVGMVRHQGKIMYVGDVRSVTQKHIQEWGPFDLVIGGSPCNDLSIVNPARKGLYEGTGRLFFEFYRLLHDARPKEGDDRPFFWLFENVVAMGVSDKRDISRFLESNPVMIDAKEVSAAHRARYFWGNLPGMNRPLASTVNDKLELQECLEHGRIAKFSKVRTITTRSNSIKQGKDQHFPVFMNEKEDILWCTEMERVFGFPVHYTDVSNMSRLARQRLLGRSWSVPVIRHLFAPLKEYFACV,mutated_sequence,1.0,912.0,NP_072046.2.a2m,NP_072046.2.npy,ClinVar
+NP_075044.2,NP_075044.2.csv,MSRRKQGKPQHLSKREFSPEPLEAILTDDEPDHGPLGAPEGDHDLLTCGQCQMNFPLGDILIFIEHKRKQCNGSLCLEKAVDKPPSPSPIEMKKASNPVEVGIQVTPEDDDCLSTSSRGICPKQEHIADKLLHWRGLSSPRSAHGALIPTPGMSAEYAPQGICKDEPSSYTCTTCKQPFTSAWFLLQHAQNTHGLRIYLESEHGSPLTPRVGIPSGLGAECPSQPPLHGIHIADNNPFNLLRIPGSVSREASGLAEGRFPPTPPLFSPPPRHHLDPHRIERLGAEEMALATHHPSAFDRVLRLNPMAMEPPAMDFSRRLRELAGNTSSPPLSPGRPSPMQRLLQPFQPGSKPPFLATPPLPPLQSAPPPSQPPVKSKSCEFCGKTFKFQSNLVVHRRSHTGEKPYKCNLCDHACTQASKLKRHMKTHMHKSSPMTVKSDDGLSTASSPEPGTSDLVGSASSALKSVVAKFKSENDPNLIPENGDEEEEEDDEEEEEEEEEEEEELTESERVDYGFGLSLEAARHHENSSRGAVVGVGDESRALPDVMQGMVLSSMQHFSEAFHQVLGEKHKRGHLAEAEGHRDTCDEDSVAGESDRIDDGTVNGRGCSPGESASGGLSKKLLLGSPSSLSPFSKRIKLEKEFDLPPAAMPNTENVYSQWLAGYAASRQLKDPFLSFGDSRQSPFASSSEHSSENGSLRFSTPPGELDGGISGRSGTGSGGSTPHISGPGPGRPSSKEGRRSDTCEYCGKVFKNCSNLTVHRRSHTGERPYKCELCNYACAQSSKLTRHMKTHGQVGKDVYKCEICKMPFSVYSTLEKHMKKWHSDRVLNNDIKTE,mutated_sequence,1.0,835.0,NP_075044.2.a2m,NP_075044.2.npy,ClinVar
+NP_075555.1,NP_075555.1.csv,MMASYPEPEDAAGALLAPETGRTVKEPEGPPPSPGKGGGGGGGTAPEKPDPAQKPPYSYVALIAMAIRESAEKRLTLSGIYQYIIAKFPFYEKNKKGWQNSIRHNLSLNECFIKVPREGGGERKGNYWTLDPACEDMFEKGNYRRRRRMKRPFRPPPAHFQPGKGLFGAGGAAGGCGVAGAGADGYGYLAPPKYLQSGFLNNSWPLPQPPSPMPYASCQMAAAAAAAAAAAAAAGPGSPGAAAVVKGLAGPAASYGPYTRVQSMALPPGVVNSYNGLGGPPAAPPPPPHPHPHPHAHHLHAAAAPPPAPPHHGAAAPPPGQLSPASPATAAPPAPAPTSAPGLQFACARQPELAMMHCSYWDHDSKTGALHSRLDL,mutated_sequence,1.0,376.0,NP_075555.1.a2m,NP_075555.1.npy,ClinVar
+NP_075598.2,NP_075598.2.csv,MWSWKCLLFWAVLVTATLCTARPSPTLPEQAQPWGAPVEVESFLVHPGDLLQLRCRLRDDVQSINWLRDGVQLAESNRTRITGEEVEVQDSVPADSGLYACVTSSPSGSDTTYFSVNVSDALPSSEDDDDDDDSSSEEKETDNTKPNRMPVAPYWTSPEKMEKKLHAVPAAKTVKFKCPSSGTPNPTLRWLKNGKEFKPDHRIGGYKVRYATWSIIMDSVVPSDKGNYTCIVENEYGSINHTYQLDVVERSPHRPILQAGLPANKTVALGSNVEFMCKVYSDPQPHIQWLKHIEVNGSKIGPDNLPYVQILKTAGVNTTDKEMEVLHLRNVSFEDAGEYTCLAGNSIGLSHHSAWLTVLEALEERPAVMTSPLYLEIIIYCTGAFLISCMVGSVIVYKMKSGTKKSDFHSQMAVHKLAKSIPLRRQVTVSADSSASMNSGVLLVRPSRLSSSGTPMLAGVSEYELPEDPRWELPRDRLVLGKPLGEGCFGQVVLAEAIGLDKDKPNRVTKVAVKMLKSDATEKDLSDLISEMEMMKMIGKHKNIINLLGACTQDGPLYVIVEYASKGNLREYLQARRPPGLEYCYNPSHNPEEQLSSKDLVSCAYQVARGMEYLASKKCIHRDLAARNVLVTEDNVMKIADFGLARDIHHIDYYKKTTNGRLPVKWMAPEALFDRIYTHQSDVWSFGVLLWEIFTLGGSPYPGVPVEELFKLLKEGHRMDKPSNCTNELYMMMRDCWHAVPSQRPTFKQLVEDLDRIVALTSNQEYLDLSMPLDQYSPSFPDTRSSTCSSGEDSVFSHEPLPEEPCLPRHPAQLANGGLKRR,mutated_sequence,1.0,822.0,NP_075598.2.a2m,NP_075598.2.npy,ClinVar
+NP_076872.1,NP_076872.1.csv,MDWKTLQALLSGVNKYSTAFGRIWLSVVFVFRVLVYVVAAERVWGDEQKDFDCNTKQPGCTNVCYDNYFPISNIRLWALQLIFVTCPSLLVILHVAYREERERRHRQKHGDQCAKLYDNAGKKHGGLWWTYLFSLIFKLIIEFLFLYLLHTLWHGFNMPRLVQCANVAPCPNIVDCYIARPTEKKIFTYFMVGASAVCIVLTICELCYLICHRVLRGLHKDKPRGGCSPSSSASRASTCRCHHKLVEAGEVDPDPGNNKLQASAPNLTPI,mutated_sequence,1.0,270.0,NP_076872.1.a2m,NP_076872.1.npy,ClinVar
+NP_076932.1,NP_076932.1.csv,MRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDAGEKGDKGAPGRPGRVGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRKAIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMPKDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEEDCVEMVASGGWNDVACHTTMYFMCEFDKENM,mutated_sequence,1.0,271.0,NP_076932.1.a2m,NP_076932.1.npy,ClinVar
+NP_077277.1,NP_077277.1.csv,MRLTRCQAALAAAITLNLLVLFYVSWLQHQPRNSRARGPRRASAAGPRVTVLVREFEAFDNAVPELVDSFLQQDPAQPVVVAADTLPYPPLALPRIPNVRLALLQPALDRPAAASRPETYVATEFVALVPDGARAEAPGLLERMVEALRAGSARLVAAPVATANPARCLALNVSLREWTARYGAAPAAPRCDALDGDAVVLLRARDLFNLSAPLARPVGTSLFLQTALRGWAVQLLDLTFAAARQPPLATAHARWKAEREGRARRAALLRALGIRLVSWEGGRLEWFGCNKETTRCFGTVVGDTPAYLYEERWTPPCCLRALRETARYVVGVLEAAGVRYWLEGGSLLGAARHGDIIPWDYDVDLGIYLEDVGNCEQLRGAEAGSVVDERGFVWEKAVEGDFFRVQYSESNHLHVDLWPFYPRNGVMTKDTWLDHRQDVEFPEHFLQPLVPLPFAGFVAQAPNNYRRFLELKFGPGVIENPQYPNPALLSLTGSG,mutated_sequence,1.0,495.0,NP_077277.1.a2m,NP_077277.1.npy,ClinVar
+NP_077288.2,NP_077288.2.csv,MLFKLLQRQTYTCLSHRYGLYVCFLGVVVTIVSAFQFGEVVLEWSRDQYHVLFDSYRDNIAGKSFQNRLCLPMPIDVVYTWVNGTDLELLKELQQVREQMEEEQKAMREILGKNTTEPTKKSEKQLECLLTHCIKVPMLVLDPALPANITLKDLPSLYPSFHSASDIFNVAKPKNPSTNVSVVVFDSTKDVEDAHSGLLKGNSRQTVWRGYLTTDKEVPGLVLMQDLAFLSGFPPTFKETNQLKTKLPENLSSKVKLLQLYSEASVALLKLNNPKDFQELNKQTKKNMTIDGKELTISPAYLLWDLSAISQSKQDEDISASRFEDNEELRYSLRSIERHAPWVRNIFIVTNGQIPSWLNLDNPRVTIVTHQDVFRNLSHLPTFSSPAIESHIHRIEGLSQKFIYLNDDVMFGKDVWPDDFYSHSKGQKVYLTWPVPNCAEGCPGSWIKDGYCDKACNNSACDWDGGDCSGNSGGSRYIAGGGGTGSIGVGQPWQFGGGINSVSYCNQGCANSWLADKFCDQACNVLSCGFDAGDCGQDHFHELYKVILLPNQTHYIIPKGECLPYFSFAEVAKRGVEGAYSDNPIIRHASIANKWKTIHLIMHSGMNATTIHFNLTFQNTNDEEFKMQITVEVDTREGPKLNSTAQKGYENLVSPITLLPEAEILFEDIPKEKRFPKFKRHDVNSTRRAQEEVKIPLVNISLLPKDAQLSLNTLDLQLEHGDITLKGYNLSKSALLRSFLMNSQHAKIKNQAIITDETNDSLVAPQEKQVHKSILPNSLGVSERLQRLTFPAVSVKVNGHDQGQNPPLDLETTARFRVETHTQKTIGGNVTKEKPPSLIVPLESQMTKEKKITGKEKENSRMEENAENHIGVTEVLLGRKLQHYTDSYLGFLPWEKKKYFQDLLDEEESLKTQLAYFTDSKNTGRQLKDTFADSLRYVNKILNSKFGFTSRKVPAHMPHMIDRIVMQELQDMFPEEFDKTSFHKVRHSEDMQFAFSYFYYLMSAVQPLNISQVFDEVDTDQSGVLSDREIRTLATRIHELPLSLQDLTGLEHMLINCSKMLPADITQLNNIPPTQESYYDPNLPPVTKSLVTNCKPVTDKIHKAYKDKNKYRFEIMGEEEIAFKMIRTNVSHVVGQLDDIRKNPRKFVCLNDNIDHNHKDAQTVKAVLRDFYESMFPIPSQFELPREYRNRFLHMHELQEWRAYRDKLKFWTHCVLATLIMFTIFSFFAEQLIALKRKIFPRRRIHKEASPNRIRV,mutated_sequence,1.0,1256.0,NP_077288.2.a2m,NP_077288.2.npy,ClinVar
+NP_077744.4,NP_077744.4.csv,MDFLLLQDPASTCVPEPASQHTLRSGPGCLQQPEQQGVRDPGGIWAKLGAAEASAERLQGRRSRGASGSEPQQMGSDVRDLNALLPAVPSLGGGGGCALPVSGAAQWAPVLDFAPPGASAYGSLGGPAPPPAPPPPPPPPPHSFIKQEPSWGGAEPHEEQCLSAFTVHFSGQFTGTAGACRYGPFGPPPPSQASSGQARMFPNAPYLPSCLESQPAIRNQGYSTVTFDGTPSYGHTPSHHAAQFPNHSFKHEDPMGQQGSLGEQQYSVPPPVYGCHTPTDSCTGSQALLLRTPYSSDNLYQMTSQLECMTWNQMNLGATLKGVAAGSSSSVKWTEGQSNHSTGYESDNHTTPILCGAQYRIHTHGVFRGIQDVRRVPGVAPTLVRSASETSEKRPFMCAYPGCNKRYFKLSHLQMHSRKHTGEKPYQCDFKDCERRFSRSDQLKRHQRRHTGVKPFQCKTCQRKFSRSDHLKTHTRTHTGKTSEKPFSCRWPSCQKKFARSDELVRHHNMHQRNMTKLQLAL,mutated_sequence,1.0,522.0,NP_077744.4.a2m,NP_077744.4.npy,ClinVar
+NP_078853.2,NP_078853.2.csv,MGGCFCIPRERSLTRGPGKETPSKDPTVSSECIASSEYKEKCFLPQNINPDLTLSFCVKSRSRRCVNGPLQEAARRRLWALENEDQEVRMLFKDLSARLVSIQSQRAQFLITFKTMEEIWKFSTYLNLGYVSMCLEHLLFDHKYWLNCILVEDTEIQVSVDDKHLETIYLGLLIQEGHFFCRALCSVTPPAEKEGECLTLCKNELISVKMAEAGSELEGVSLVTGQRGLVLVSALEPLPLPFHQWFLKNYPGSCGLSRKRDWTGSYQIGRGRCKALTGYEPGEKDELNFYQGESIEIIGFVIPGLQWFIGKSTSSGQVGFVPTRNIDPDSYSPMSRNSAFLSDEERCSLLALGSDKQTECSSFLHTLARTDITSVYRLSGFESIQNPPNDLSASQPEGFKEVRPGRAWEEHQAVGSRQSSSSEDSSLEEELLSATSDSYRLPEPDDLDDPELLMDLSTGQEEEAENFAPILAFLDHEGYADHFKSLYDFSFSFLTSSFYSFSEEDEFVAYLEASRKWAKKSHMTWAHARLCFLLGRLSIRKVKLSQARVYFEEAIHILNGAFEDLSLVATLYINLAAIYLKQRLRHKGSALLEKAGALLACLPDRESSAKHELDVVAYVLRQGIVVGSSPLEARACFLAIRLLLSLGRHEEVLPFAERLQLLSGHPPASEAVASVLSFLYDKKYLPHLAVASVQQHGIQSAQGMSLPIWQVHLVLQNTTKLLGFPSPGWGEVSALACPMLRQALAACEELADRSTQRALCLILSKVYLEHRSPDGAIHYLSQALVLGQLLGEQESFESSLCLAWAYLLASQAKKALDVLEPLLCSLKETESLTQRGVIYNLLGLALQGEGRVNRAAKSYLRALNRAQEVGDVHNQAVAMANLGHLSLKSWAQHPARNYLLQAVRLYCELQASKETDMELVQVFLWLAQVLVSGHQLTHGLLCYEMALLFGLRHRHLKSQLQATKSLCHFYSSVSPNPEACITYHEHWLALAQQLRDREMEGRLLESLGQLYRNLNTARSLRRSLTCIKESLRIFIDLGETDKAAEAWLGAGRLHYLMQEDELVELCLQAAIQTALKSEEPLLALKLYEEAGDVFFNGTRHRHHAVEYYRAGAVPLARRLKAVRTELRIFNKLTELQISLEGYEKALEFATLAARLSTVTGDQRQELVAFHRLATVYYSLHMYEMAEDCYLKTLSLCPPWLQSPKEALYYAKVYYRLGRLTFCQLKDAHDATEYFLLALAAAVLLGDEELQDTIRSRLDNICQSPLWHSRPSGCSSERARWLSGGGLAL,mutated_sequence,1.0,1288.0,NP_078853.2.a2m,NP_078853.2.npy,ClinVar
+NP_078916.3,NP_078916.3.csv,MSPARRCRGMRAAVAASVGLSEGPAGSRSGRLFRPPSPAPAAPGARLLRLPGSGAVQAASPERAGWTEALRAAVAELRAGAVVAVPTDTLYGLACAASCSAALRAVYRLKGRSEAKPLAVCLGRVADVYRYCRVRVPEGLLKDLLPGPVTLVMERSEELNKDLNPFTPLVGIRIPDHAFMQDLAQMFEGPLALTSANLSSQASSLNVEEFQDLWPQLSLVIDGGQIGDGQSPECRLGSTVVDLSVPGKFGIIRPGCALESTTAILQQKYGLLPSHASYL,mutated_sequence,1.0,279.0,NP_078916.3.a2m,NP_078916.3.npy,ClinVar
+NP_078941.2,NP_078941.2.csv,MSISSDEVNFLVYRYLQESGFSHSAFTFGIESHISQSNINGALVPPAALISIIQKGLQYVEAEVSINEDGTLFDGRPIESLSLIDAVMPDVVQTRQQAYRDKLAQQQAAAAAAAAAAASQQGSAKNGENTANGEENGAHTIANNHTDMMEVDGDVEIPPNKAVVLRGHESEVFICAWNPVSDLLASGSGDSTARIWNLSENSTSGSTQLVLRHCIREGGQDVPSNKDVTSLDWNSEGTLLATGSYDGFARIWTKDGNLASTLGQHKGPIFALKWNKKGNFILSAGVDKTTIIWDAHTGEAKQQFPFHSAPALDVDWQSNNTFASCSTDMCIHVCKLGQDRPIKTFQGHTNEVNAIKWDPTGNLLASCSDDMTLKIWSMKQDNCVHDLQAHNKEIYTIKWSPTGPGTNNPNANLMLASASFDSTVRLWDVDRGICIHTLTKHQEPVYSVAFSPDGRYLASGSFDKCVHIWNTQTGALVHSYRGTGGIFEVCWNAAGDKVGASASDGSVCVLDLRK,mutated_sequence,1.0,514.0,NP_078941.2.a2m,NP_078941.2.npy,ClinVar
+NP_078961.3,NP_078961.3.csv,MLSSMAAAGSVKAALQVAEVLEAIVSCCVGPEGRQVLCTKPTGEVLLSRNGGRLLEALHLEHPIARMIVDCVSSHLKKTGDGAKTFIIFLCHLLRGLHAITDREKDPLMCENIQTHGRHWKNCSRWKFISQALLTFQTQILDGIMDQYLSRHFLSIFSSAKERTLCRSSLELLLEAYFCGRVGRNNHKFISQLMCDYFFKCMTCKSGIGVFELVDDHFVELNVGVTGLPVSDSRIIAGLVLQKDFSVYRPADGDMRMVIVTETIQPLFSTSGSEFILNSEAQFQTSQFWIMEKTKAIMKHLHSQNVKLLISSVKQPDLVSYYAGVNGISVVECLSSEEVSLIRRIIGLSPFVPPQAFSQCEIPNTALVKFCKPLILRSKRYVHLGLISTCAFIPHSIVLCGPVHGLIEQHEDALHGALKMLRQLFKDLDLNYMTQTNDQNGTSSLFIYKNSGESYQAPDPGNGSIQRPYQDTVAENKDALEKTQTYLKVHSNLVIPDVELETYIPYSTPTLTPTDTFQTVETLTCLSLERNRLTDYYEPLLKNNSTAYSTRGNRIEISYENLQVTNITRKGSMLPVSCKLPNMGTSQSYLSSSMPAGCVLPVGGNFEILLHYYLLNYAKKCHQSEETMVSMIIANALLGIPKVLYKSKTGKYSFPHTYIRAVHALQTNQPLVSSQTGLESVMGKYQLLTSVLQCLTKILTIDMVITVKRHPQKVHNQDSEDEL,mutated_sequence,1.0,723.0,NP_078961.3.a2m,NP_078961.3.npy,ClinVar
+NP_079033.4,NP_079033.4.csv,MAAADAEAVPARGEPQQDCCVKTELLGEETPMAADEGSAEKQAGEAHMAADGETNGSCENSDASSHANAAKHTQDSARVNPQDGTNTLTRIAENGVSERDSEAAKQNHVTADDFVQTSVIGSNGYILNKPALQAQPLRTTSTLASSLPGHAAKTLPGGAGKGRTPSAFPQTPAAPPATLGEGSADTEDRKLPAPGADVKVHRARKTMPKSVVGLHAASKDPREVREARDHKEPKEEINKNISDFGRQQLLPPFPSLHQSLPQNQCYMATTKSQTACLPFVLAAAVSRKKKRRMGTYSLVPKKKTKVLKQRTVIEMFKSITHSTVGSKGEKDLGASSLHVNGESLEMDSDEDDSEELEEDDGHGAEQAAAFPTEDSRTSKESMSEADRAQKMDGESEEEQESVDTGEEEEGGDESDLSSESSIKKKFLKRKGKTDSPWIKPARKRRRRSRKKPSGALGSESYKSSAGSAEQTAPGDSTGYMEVSLDSLDLRVKGILSSQAEGLANGPDVLETDGLQEVPLCSCRMETPKSREITTLANNQCMATESVDHELGRCTNSVVKYELMRPSNKAPLLVLCEDHRGRMVKHQCCPGCGYFCTAGNFMECQPESSISHRFHKDCASRVNNASYCPHCGEESSKAKEVTIAKADTTSTVTPVPGQEKGSALEGRADTTTGSAAGPPLSEDDKLQGAASHVPEGFDPTGPAGLGRPTPGLSQGPGKETLESALIALDSEKPKKLRFHPKQLYFSARQGELQKVLLMLVDGIDPNFKMEHQNKRSPLHAAAEAGHVDICHMLVQAGANIDTCSEDQRTPLMEAAENNHLEAVKYLIKAGALVDPKDAEGSTCLHLAAKKGHYEVVQYLLSNGQMDVNCQDDGGWTPMIWATEYKHVDLVKLLLSKGSDINIRDNEENICLHWAAFSGCVDIAEILLAAKCDLHAVNIHGDSPLHIAARENRYDCVVLFLSRDSDVTLKNKEGETPLQCASLNSQVWSALQMSKALQDSAPDRPSPVERIVSRDIARGYERIPIPCVNAVDSEPCPSNYKYVSQNCVTSPMNIDRNITHLQYCVCIDDCSSSNCMCGQLSMRCWYDKDGRLLPEFNMAEPPLIFECNHACSCWRNCRNRVVQNGLRARLQLYRTRDMGWGVRSLQDIPPGTFVCEYVGELISDSEADVREEDSYLFDLDNKDGEVYCIDARFYGNVSRFINHHCEPNLVPVRVFMAHQDLRFPRIAFFSTRLIEAGEQLGFDYGERFWDIKGKLFSCRCGSPKCRHSSAALAQRQASAAQEAQEDGLPDTSSAAAADPL,mutated_sequence,1.0,1298.0,NP_079033.4.a2m,NP_079033.4.npy,ClinVar
+NP_079093.2,NP_079093.2.csv,MVSHFMGSLSVLCFLLLLGFQFVCPQPSTQHRKVPQRMAAEGAPEDDGGGGAPGVWGAWGPWSACSRSCSGGVMEQTRPCLPRSYRLRGGQRPGAPARAFADHVVSAVRTSVPLHRSRDETPALAGTDASRQGPTVLRGSRHPQPQGLEVTGDRRSRTRGTIGPGKYGYGKAPYILPLQTDTAHTPQRLRRQKLSSRHSRSQGASSARHGYSSPAHQVPQHGPLYQSDSGPRSGLQAAEAPIYQLPLTHDQGYPAASSLFHSPETSNNHGVGTHGATQSFSQPARSTAISCIGAYRQYKLCNTNVCPESSRSIREVQCASYNNKPFMGRFYEWEPFAEVKGNRKCELNCQAMGYRFYVRQAEKVIDGTPCDQNGTAICVSGQCKSIGCDDYLGSDKVVDKCGVCGGDNTGCQVVSGVFKHALTSLGYHRVVEIPEGATKINITEMYKSNNYLALRSRSGRSIINGNWAIDRPGKYEGGGTMFTYKRPNEISSTAGESFLAEGPTNEILDVYMIHQQPNPGVHYEYVIMGTNAISPQVPPHRRPGEPFNGQMVTEGRSQEEGEQKGRNEEKEDLRGEAPEMFTSESAQTFPVRHPDRFSPHRPDNLVPPAPQPPRRSRDHNWKQLGTTECSTTCGKGSQYPIFRCVHRSTHEEAPESYCDSSMKPTPEEEPCNIFPCPAFWDIGEWSECSKTCGLGMQHRQVLCRQVYANRSLTVQPYRCQHLEKPETTSTCQLKICSEWQIRTDWTSCSVPCGVGQRTRDVKCVSNIGDVVDDEECNMKLRPNDIENCDMGPCAKSWFLTEWSERCSAECGAGVRTRSVVCMTNHVSSLPLEGCGNNRPAEATPCDNGPCTGKVEWFAGSWSQCSIECGSGTQQREVICVRKNADTFEVLDPSECSFLEKPPSQQSCHLKPCGAKWFSTEWSMCSKSCQGGFRVREVRCLSDDMTLSNLCDPQLKPEERESCNPQDCVPEVDENCKDKYYNCNVVVQARLCVYNYYKTACCASCTRVANRQTGFLGSR,mutated_sequence,1.0,1018.0,NP_079093.2.a2m,NP_079093.2.npy,ClinVar
+NP_079130.2,NP_079130.2.csv,MEAARPPPTAGKFVVVGGGIAGVTCAEQLATHFPSEDILLVTASPVIKAVTNFKQISKILEEFDVEEQSSTMLGKRFPNIKVIESGVKQLKSEEHCIVTEDGNQHVYKKLCLCAGAKPKLICEGNPYVLGIRDTDSAQEFQKQLTKAKRIMIIGNGGIALELVYEIEGCEVIWAIKDKAIGNTFFDAGAAEFLTSKLIAEKSEAKIAHKRTRYTTEGRKKEARSKSKADNVGSALGPDWHEGLNLKGTKEFSHKIHLETMCEVKKIYLQDEFRILKKKSFTFPRDHKSVTADTEMWPVYVELTNEKIYGCDFIVSATGVTPNVEPFLHGNSFDLGEDGGLKVDDHMHTSLPDIYAAGDICTTSWQLSPVWQQMRLWTQARQMGWYAAKCMAAASSGDSIDMDFSFELFAHVTKFFNYKVVLLGKYNAQGLGSDHELMLRCTKGREYIKVVMQNGRMMGAVLIGETDLEETFENLILNQMNLSSYGEDLLDPNIDIEDYFD,mutated_sequence,1.0,500.0,NP_079130.2.a2m,NP_079130.2.npy,ClinVar
+NP_079265.2,NP_079265.2.csv,MFLHSVNLWNLAFYVFMVFLATLGLWDVFFGFEENKCSMSYMFEYPEYQKIELPKKLAKRYPAYELYLYGEGSYAEEHKILPLTGIPVLFLPGNAGSYKQVRSIGSIALRKAEDIDFKYHFDFFSVNFNEELVALYGGSLQKQTKFVHECIKTILKLYKGQEFAPKSVAIIGHSMGGLVARALLTLKNFKHDLINLLITQATPHVAPVMPLDRFITDFYTTVNNYWILNARHINLTTLSVAGGFRDYQVRSGLTFLPKLSHHTSALSVVSSAVPKTWVSTDHLSIVWCKQLQLTTVRAFFDLIDADTKQITQNSKKKLSVLYHHFIRHPSKHFEENPAIISDLTGTSMWVLVKVSKWTYVAYNESEKIYFTFPLENHRKIYTHVYCQSTMLDTNSWIFACINSTSMCLQGVDLSWKAELLPTIKYLTLRLQDYPSLSHLVVYVPSVRGSKFVVDCEFFKKEKRYIQLPVTHLFSFGLSSRKVVLNTNGLYYNLELLNFGQIYQAFKINVVSKCSAVKEEITSIYRLHIPWSYEDSLTIAQAPSSTEISLKLHIAQPENNTHVALFKMYTSSDCRYEVTVKTSFSQILGQVVRFHGGALPAYVVSNILLAYRGQLYSLFSTGCCLEYATMLDKEAKPYKVDPFVIIIKFLLGYKWFKELWDVLLLPELDAVILTCQSMCFPLISLILFLFGTCTAYWSGLLSSASVRLLSSLWLALKRPSELPKDIKMISPDLPFLTIVLIIVSWTTCGALAILLSYLYYVFKVVHLQASLTTFKNSQPVNPKHSRRSEKKSNHHKDSSIHHLRLSANDAEDSLRMHSTVINLLTWIVLLSMPSLIYWLKNLRYYFKLNPDPCKPLAFILIPTMAILGNTYTVSIKSSKLLKTTSQFPLPLAVGVIAFGSAHLYRLPCFVFIPLLLHALCNFM,mutated_sequence,1.0,922.0,NP_079265.2.a2m,NP_079265.2.npy,ClinVar
+NP_079375.3,NP_079375.3.csv,MAAGRAQVPSSEQAWLEDAQVFIQKTLCPAVKEPNVQLTPLVIDCVKTVWLSQGRNQGSTLPLSYSFVSVQDLKTHQRLPCCSHLSWSSSAYQAWAQEAGPNGNPLPREQLLLLGTLTDLSADLEQECRNGSLYVRDNTGVLSCELIDLDLSWLGHLFLFPRWSYLPPARWNSSGEGHLELWDAPVPVFPLTISPGPVTPIPVLYPESASCLLRLRNKLRGVQRNLAGSLVRLSALVKSKQKAYFILSLGRSHPAVTHVSIIVQVPAQLVWHRALRPGTAYVLTELRVSKIRGQRQHVWMTSQSSRLLLLKPECVQELELELEGPLLEADPKPLPMPSNSEDKKDPESLVRYSRLLSYSGAVTGVLNEPAGLYELDGQLGLCLAYQQFRGLRRVMRPGVCLQLQDVHLLQSVGGGTRRPVLAPCLRGAVLLQSFSRQKPGAHSSRQAYGASLYEQLVWERQLGLPLYLWATKALEELACKLCPHVLRHHQFLQHSSPGSPSLGLQLLAPTLDLLAPPGSPVRNAHNEILEEPHHCPLQKYTRLQTPSSFPTLATLKEEGQRKAWASFDPKALLPLPEASYLPSCQLNRRLAWSWLCLLPSAFCPAQVLLGVLVASSHKGCLQLRDQSGSLPCLLLAKHSQPLSDPRLIGCLVRAERFQLIVERDVRSSFPSWKELSMPGFIQKQQARVYVQFFLADALILPVPRPCLHSATPSTPQTDPTGPEGPHLGQSRLFLLCHKEALMKRNFCVPPGASPEVPKPALSFYVLGSWLGGTQRKEGTGWGLPEPQGNDDNDQKVHLIFFGSSVRWFEFLHPGQVYRLIAPGPATPMLFEKDGSSCISRRPLELAGCASCLTVQDNWTLELESSQDIQDVLDANKSLPESSLTDLLSDNFTDSLVSFSAEILSRTLCEPLVASLWMKLGNTGAMRRCVKLTVALETAECEFPPHLDVYIEDPHLPPSLGLLPGARVHFSQLEKRVSRSHNVYCCFRSSTYVQVLSFPPETTISIPLPHIYLAELLQGGQSPFQATASCHIVSVFSLQLFWVCAYCTSICRQGKCTRLGSTCPTQTAISQAIIRLLVEDGTAEAVVTCRNHHVAAALGLCPREWASLLDFVQVPGRVVLQFAGPGAQLESSARVDEPMTMFLWTLCTSPSVLRPIVLSFELERKPSKIVPLEPPRLQRFQCGELPFLTHVNPRLRLSCLSIRESEYSSSLGILASSC,mutated_sequence,1.0,1217.0,NP_079375.3.a2m,NP_079375.3.npy,ClinVar
+NP_079483.3,NP_079483.3.csv,MGWDLGTRLFQRQEQRSRLSRIWLEKTRVFLEGSTRTPALPHCLFWLLQVPSTQDPLFPGYGPQCPVDLAGPPCLRPLFGGLGGYWRALQRGREGRTMTSRASELSPGRSVTAGIIIVGDEILKGHTQDTNTFFLCRTLRSLGVQVCRVSVVPDEVATIAAEVTSFSNRFTHVLTAGGIGPTHDDVTFEAVAQAFGDELKPHPKLEAATKALGGEGWEKLSLVPSSARLHYGTDPCTGQPFRFPLVSVRNVYLFPGIPELLRRVLEGMKGLFQNPAVQFHSKELYVAADEASIAPILAEAQAHFGRRLGLGSYPDWGSNYYQVKLTLDSEEEGPLEECLAYLTARLPQGSLVPYMPNAVEQASEAVYKLAESGSSLGKKVAGALQTIETSLAQYSLTQLCVGFNGGKDCTALLHLFHAAVQRKLPDVPNPLQILYIRSISPFPELEQFLQDTIKRYNLQMLEAEGSMKQALGELQARHPQLEAVLMGTRRTDPYSCSLCPFSPTDPGWPAFMRINPLLDWTYRDIWDFLRQLFVPYCILYDRGYTSLGSRENTVRNPALKCLSPGGHPTYRPAYLLENEEEERNSRT,mutated_sequence,1.0,587.0,NP_079483.3.a2m,NP_079483.3.npy,ClinVar
+NP_079495.1,NP_079495.1.csv,MADQRQRSLSTSGESLYHVLGLDKNATSDDIKKSYRKLALKYHPDKNPDNPEAADKFKEINNAHAILTDATKRNIYDKYGSLGLYVAEQFGEENVNTYFVLSSWWAKALFVFCGLLTCCYCCCCLCCCFNCCCGKCKPKAPEGEETEFYVSPEDLEAQLQSDEREATDTPIVIQPASATETTQLTADSHPSYHTDGFN,mutated_sequence,1.0,198.0,NP_079495.1.a2m,NP_079495.1.npy,ClinVar
+NP_079519.1,NP_079519.1.csv,MDCYRTSLSSSWIYPTVILCLFGFFSMMRPSEPFLIPYLSGPDKNLTSAEITNEIFPVWTYSYLVLLLPVFVLTDYVRYKPVIILQGISFIITWLLLLFGQGVKTMQVVEFFYGMVTAAEVAYYAYIYSVVSPEHYQRVSGYCRSVTLAAYTAGSVLAQLLVSLANMSYFYLNVISLASVSVAFLFSLFLPMPKKSMFFHAKPSREIKKSSSVNPVLEETHEGEAPGCEEQKPTSEILSTSGKLNKGQLNSLKPSNVTVDVFVQWFQDLKECYSSKRLFYWSLWWAFATAGFNQVLNYVQILWDYKAPSQDSSIYNGAVEAIATFGGAVAAFAVGYVKVNWDLLGELALVVFSVVNAGSLFLMHYTANIWACYAGYLIFKSSYMLLITIAVFQIAVNLNVERYALVFGINTFIALVIQTIMTVIVVDQRGLNLPVSIQFLVYGSYFAVIAGIFLMRSMYITYSTKSQKDVQSPAPSENPDVSHPEEESNIIMSTKL,mutated_sequence,1.0,496.0,NP_079519.1.a2m,NP_079519.1.npy,ClinVar
+NP_079541.1,NP_079541.1.csv,MAEAVFHAPKRKRRVYETYESPLPIPFGQDHGPLKEFKIFRAEMINNNVIVRNAEDIEQLYGKGYFGKGILSRSRPSFTISDPKLVAKWKDMKTNMPIITSKRYQHSVEWAAELMRRQGQDESTVRRILKDYTKPLEHPPVKRNEEAQVHDKLNSGMVSNMEGTAGGERPSVVNGDSGKSGGVGDPREPLGCLQEGSGCHPTTESFEKSVREDASPLPHVCCCKQDALILQRGLHHEDGSQHIGLLHPGDRGPDHEYVLVEEAECAMSEREAAPNEELVQRNRLICRRNPYRIFEYLQLSLEEAFFLVYALGCLSIYYEKEPLTIVKLWKAFTVVQPTFRTTYMAYHYFRSKGWVPKVGLKYGTDLLLYRKGPPFYHASYSVIIELVDDHFEGSLRRPLSWKSLAALSRVSVNVSKELMLCYLIKPSTMTDKEMESPECMKRIKVQEVILSRWVSSRERSDQDDL,mutated_sequence,1.0,465.0,NP_079541.1.a2m,NP_079541.1.npy,ClinVar
+NP_085911.2,NP_085911.2.csv,MANETQKVGAIHFPFPFTPYSIQEDFMAELYRVLEAGKIGIFESPTGTGKSLSLICGALSWLRDFEQKKREEEARLLETGTGPLHDEKDESLCLSSSCEGAAGTPRPAGEPAWVTQFVQKKEERDLVDRLKAEQARRKQREERLQQLQHRVQLKYAAKRLRQEEEERENLLRLSREMLETGPEAERLEQLESGEEELVLAEYESDEEKKVASRVDEDEDDLEEEHITKIYYCSRTHSQLAQFVHEVKKSPFGKDVRLVSLGSRQNLCVNEDVKSLGSVQLINDRCVDMQRSRHEKKKGAEEEKPKRRRQEKQAACPFYNHEQMGLLRDEALAEVKDMEQLLALGKEARACPYYGSRLAIPAAQLVVLPYQMLLHAATRQAAGIRLQDQVVIIDEAHNLIDTITGMHSVEVSGSQLCQAHSQLLQYVERYGKRLKAKNLMYLKQILYLLEKFVAVLGGNIKQNPNTQSLSQTGTELKTINDFLFQSQIDNINLFKVQRYCEKSMISRKLFGFTERYGAVFSSREQPKLAGFQQFLQSLQPRTTEALAAPADESQASTLRPASPLMHIQGFLAALTTANQDGRVILSRQGSLSQSTLKFLLLNPAVHFAQVVKECRAVVIAGGTMQPVSDFRQQLLACAGVEAERVVEFSCGHVIPPDNILPLVICSGISNQPLEFTFQKRELPQMMDEVGRILCNLCGVVPGGVVCFFPSYEYLRQVHAHWEKGGLLGRLAARKKIFQEPKSAHQVEQVLLAYSRCIQACGQERGQVTGALLLSVVGGKMSEGINFSDNLGRCVVMVGMPFPNIRSAELQEKMAYLDQTLPRAPGQAPPGKALVENLCMKAVNQSIGRAIRHQKDFASVVLLDQRYARPPVLAKLPAWIRARVEVKATFGPAIAAVQKFHREKSASS,mutated_sequence,1.0,906.0,NP_085911.2.a2m,NP_085911.2.npy,ClinVar
+NP_109587.1,NP_109587.1.csv,MLARRKPVLPALTINPTIAEGPSPTSEGASEANLVDLQKKLEELELDEQQKKRLEAFLTQKAKVGELKDDDFERISELGAGNGGVVTKVQHRPSGLIMARKLIHLEIKPAIRNQIIRELQVLHECNSPYIVGFYGAFYSDGEISICMEHMDGGSLDQVLKEAKRIPEEILGKVSIAVLRGLAYLREKHQIMHRDVKPSNILVNSRGEIKLCDFGVSGQLIDSMANSFVGTRSYMAPERLQGTHYSVQSDIWSMGLSLVELAVGRYPIPPPDAKELEAIFGRPVVDGEEGEPHSISPRPRPPGRPVSGHGMDSRPAMAIFELLDYIVNEPPPKLPNGVFTPDFQEFVNKCLIKNPAERADLKMLTNHTFIKRSEVEEVDFAGWLCKTLRLNQPGTPTRTAV,mutated_sequence,1.0,400.0,NP_109587.1.a2m,NP_109587.1.npy,ClinVar
+NP_110426.4,NP_110426.4.csv,MSGFENLNTDFYQTSYSIDDQSQQSYDYGGSGGPYSKQYAGYDYSQQGRFVPPDMMQPQQPYTGQIYQPTQAYTPASPQPFYGNNFEDEPPLLEELGINFDHIWQKTLTVLHPLKVADGSIMNETDLAGPMVFCLAFGATLLLAGKIQFGYVYGISAIGCLGMFCLLNLMSMTGVSFGCVASVLGYCLLPMILLSSFAVIFSLQGMVGIILTAGIIGWCSFSASKIFISALAMEGQQLLVAYPCALLYGVFALISVF,mutated_sequence,1.0,257.0,NP_110426.4.a2m,NP_110426.4.npy,ClinVar
+NP_112506.2,NP_112506.2.csv,MDEKTKKAEEMALSLTRAVAGGDEQVAMKCAIWLAEQRVPLSVQLKPEVSPTQDIRLWVSVEDAQMHTVTIWLTVRPDMTVASLKDMVFLDYGFPPVLQQWVIGQRLARDQETLHSHGVRQNGDSAYLYLLSARNTSLNPQELQRERQLRMLEDLGFKDLTLQPRGPLEPGPPKPGVPQEPGRGQPDAVPEPPPVGWQCPGCTFINKPTRPGCEMCCRARPEAYQVPASYQPDEEERARLAGEEEALRQYQQRKQQQQEGNYLQHVQLDQRSLVLNTEPAECPVCYSVLAPGEAVVLRECLHTFCRECLQGTIRNSQEAEVSCPFIDNTYSCSGKLLEREIKALLTPEDYQRFLDLGISIAENRSAFSYHCKTPDCKGWCFFEDDVNEFTCPVCFHVNCLLCKAIHEQMNCKEYQEDLALRAQNDVAARQTTEMLKVMLQQGEAMRCPQCQIVVQKKDGCDWIRCTVCHTEICWVTKGPRWGPGGPGDTSGGCRCRVNGIPCHPSCQNCH,mutated_sequence,1.0,510.0,NP_112506.2.a2m,NP_112506.2.npy,ClinVar
+NP_112553.1,NP_112553.1.csv,METEQPEETFPNTETNGEFGKRPAEDMEEEQAFKRSRNTDEMVELRILLQSKNAGAVIGKGGKNIKALRTDYNASVSVPDSSGPERILSISADIETIGEILKKIIPTLEEGLQLPSPTATSQLPLESDAVECLNYQHYKGSDFDCELRLLIHQSLAGGIIGVKGAKIKELRENTQTTIKLFQECCPHSTDRVVLIGGKPDRVVECIKIILDLISESPIKGRAQPYDPNFYDETYDYGGFTMMFDDRRGRPVGFPMRGRGGFDRMPPGRGGRPMPPSRRDYDDMSPRRGPPPPPPGRGGRGGSRARNLPLPPPPPPRGGDLMAYDRRGRPGDRYDGMVGFSADETWDSAIDTWSPSEWQMAYEPQGGSGYDYSYAGGRGSYGDLGGPIITTQVTIPKDLAGSIIGKGGQRIKQIRHESGASIKIDEPLEGSEDRIITITGTQDQIQNAQYLLQNSVKQYADVEGF,mutated_sequence,1.0,464.0,NP_112553.1.a2m,NP_112553.1.npy,ClinVar
+NP_112586.1,NP_112586.1.csv,MAEEMESSLEASFSSSGAVSGASGFLPPARSRIFKIIVIGDSNVGKTCLTYRFCAGRFPDRTEATIGVDFRERAVEIDGERIKIQLWDTAGQERFRKSMVQHYYRNVHAVVFVYDMTNMASFHSLPSWIEECKQHLLANDIPRILVGNKCDLRSAIQVPTDLAQKFADTHSMPLFETSAKNPNDNDHVEAIFMTLAHKLKSHKPLMLSQPPDNGIILKPEPKPAMTCWC,mutated_sequence,1.0,229.0,NP_112586.1.a2m,NP_112586.1.npy,ClinVar
+NP_113584.3,NP_113584.3.csv,MKVDRTKLKKTPTEAPADCRALIDKLKVCNDEQLLLELQQIKTWNIGKCELYHWVDLLDRFDGILADAGQTVENMSWMLVCDRPEREQLKMLLLAVLNFTALLIEYSFSRHLYSSIEHLTTLLASSDMQVVLAVLNLLYVFSKRSNYITRLGSDKRTPLLTRLQHLAESWGGKENGFGLAECCRDLHMMKYPPSATTLHFEFYADPGAEVKIEKRTTSNTLHYIHIEQLDKISESPSEIMESLTKMYSIPKDKQMLLFTHIRLAHGFSNHRKRLQAVQARLHAISILVYSNALQESANSILYNGLIEELVDVLQITDKQLMEIKAASLRTLTSIVHLERTPKLSSIIDCTGTASYHGFLPVLVRNCIQAMIDPSMDPYPHQFATALFSFLYHLASYDAGGEALVSCGMMEALLKVIKFLGDEQDQITFVTRAVRVVDLITNLDMAAFQSHSGLSIFIYRLEHEVDLCRKECPFVIKPKIQRPNTTQEGEEMETDMDGVQCIPQRAALLKSMLNFLKKAIQDPAFSDGIRHVMDGSLPTSLKHIISNAEYYGPSLFLLATEVVTVFVFQEPSLLSSLQDNGLTDVMLHALLIKDVPATREVLGSLPNVFSALCLNARGLQSFVQCQPFERLFKVLLSPDYLPAMRRRRSSDPLGDTASNLGSAVDELMRHQPTLKTDATTAIIKLLEEICNLGRDPKYICQKPSIQKADGTATAPPPRSNHAAEEASSEDEEEEEVQAMQSFNSTQQNETEPNQQVVGTEERIPIPLMDYILNVMKFVESILSNNTTDDHCQEFVNQKGLLPLVTILGLPNLPIDFPTSAACQAVAGVCKSILTLSHEPKVLQEGLLQLDSILSSLEPLHRPIESPGGSVLLRELACAGNVADATLSAQATPLLHALTAAHAYIMMFVHTCRVGQSEIRSISVNQWGSQLGLSVLSKLSQLYCSLVWESTVLLSLCTPNSLPSGCEFGQADMQKLVPKDEKAGTTQGGKRSDGEQDGAAGSMDASTQGLLEGIGLDGDTLAPMETDEPTASDSKGKSKITPAMAARIKQIKPLLSASSRLGRALAELFGLLVKLCVGSPVRQRRSHHAASTTTAPTPAARSTASALTKLLTKGLSWQPPPYTPTPRFRLTFFICSVGFTSPMLFDERKYPYHLMLQKFLCSGGHNALFETFNWALSMGGKVPVSEGLEHSDLPDGTGEFLDAWLMLVEKMVNPTTVLESPHSLPAKLPGGVQNFPQFSALRFLVVTQKAAFTCIKNLWNRKPLKVYGGRMAESMLAILCHILRGEPVIRERLSKEKEGSRGEEDTGQEEGGSRREPQVNQQQLQQLMDMGFTREHAMEALLNTSTMEQATEYLLTHPPPIMGGVVRDLSMSEEDQMMRAIAMSLGQDIPMDQRAESPEEVACRKEEEERKAREKQEEEEAKCLEKFQDADPLEQDELHTFTDTMLPGCFHLLDELPDTVYRVCDLIMTAIKRNGADYRDMILKQVVNQVWEAADVLIKAALPLTTSDTKTVSEWISQMATLPQASNLATRILLLTLLFEELKLPCAWVVESSGILNVLIKLLEVVQPCLQAAKEQKEVQTPKWITPVLLLIDFYEKTAISSKRRAQMTKYLQSNSNNWRWFDDRSGRWCSYSASNNSTIDSAWKSGETSVRFTAGRRRYTVQFTTMVQVNEETGNRRPVMLTLLRVPRLNKNSKNSNGQELEKTLEESKEMDIKRKENKGNDTPLALESTNTEKETSLEETKIGEILIQGLTEDMVTVLIRACVSMLGVPVDPDTLHATLRLCLRLTRDHKYAMMFAELKSTRMILNLTQSSGFNGFTPLVTLLLRHIIEDPCTLRHTMEKVVRSAATSGAGSTTSGVVSGSLGSREINYILRVLGPAACRNPDIFTEVANCCIRIALPAPRGSGTASDDEFENLRIKGPNAVQLVKTTPLKPSPLPVIPDTIKEVIYDMLNALAAYHAPEEADKSDPKPGVMTQEVGQLLQDMGDDVYQQYRSLTRQSSDFDTQSGFSINSQVFAADGASTETSASGTSQGEASTPEESRDGKKDKEGDRASEEGKQKGKGSKPLMPTSTILRLLAELVRSYVGIATLIANYSYTVGQSELIKEDCSVLAFVLDHLLPHTQNAEDKDTPALARLFLASLAAAGSGTDAQVALVNEVKAALGRALAMAESTEKHARLQAVMCIISTIMESCPSTSSFYSSATAKTQHNGMNNIIRLFLKKGLVNDLARVPHSLDLSSPNMANTVNAALKPLETLSRIVNQPSSLFGSKSASSKNKSEQDAQGASQDSSSNQQDPGEPGEAEVQEEDHDVTQTEVADGDIMDGEAETDSVVIAGQPEVLSSQEMQVENELEDLIDELLERDGGSGNSTIIVSRSGEDESQEDVLMDEAPSNLSQASTLQANREDSMNILDPEDEEEHTQEEDSSGSNEDEDDSQDEEEEEEEDEEDDQEDDEGEEGDEDDDDDGSEMELDEDYPDMNASPLVRFERFDREDDLIIEFDNMFSSATDIPPSPGNIPTTHPLMVRHADHSSLTLGSGSSTTRLTQGIGRSQRTLRQLTANTGHTIHVHYPGNRQPNPPLILQRLLGPSAAADILQLSSSLPLQSRGRARLLVGNDDVHIIARSDDELLDDFFHDQSTATSQAGTLSSIPTALTRWTEECKVLDAESMHDCVSVVKVSIVNHLEFLRDEELEERREKRRKQLAEEETKITDKGKEDKENRDQSAQCTASKSNDSTEQNLSDGTPMPDSYPTTPSSTDAATSESKETLGTLQSSQQQPTLPTPPALGEVPQELQSPAGEGGSSTQLLMPVEPEELGPTRPSGEAETTQMELSPAPTITSLSPERAEDSDALTAVSSQLEGSPMDTSSLASCTLEEAVGDTSAAGSSEQPRAGSSTPGDAPPAVAEVQGRSDGSGESAQPPEDSSPPASSESSSTRDSAVAISGADSRGILEEPLPSTSSEEEDPLAGISLPEGVDPSFLAALPDDIRREVLQNQLGIRPPTRTAPSTNSSAPAVVGNPGVTEVSPEFLAALPPAIQEEVLAQQRAEQQRRELAQNASSDTPMDPVTFIQTLPSDLRRSVLEDMEDSVLAVMPPDIAAEAQALRREQEARQRQLMHERLFGHSSTSALSAILRSPAFTSRLSGNRGVQYTRLAVQRGGTFQMGGSSSHNRPSGSNVDTLLRLRGRLLLDHEALSCLLVLLFVDEPKLNTSRLHRVLRNLCYHAQTRHWVIRSLLSILQRSSESELCIETPKLTTSEEKGKKSSKSCGSSSHENRPLDLLHKMESKSSNQLSWLSVSMDAALGCRTNIFQIQRSGGRKHTEKHASGGSTVHIHPQAAPVVCRHVLDTLIQLAKVFPSHFTQQRTKETNCESDRERGNKACSPCSSQSSSSGICTDFWDLLVKLDNMNVSRKGKNSVKSVPVSAGGEGETSPYSLEASPLGQLMNMLSHPVIRRSSLLTEKLLRLLSLISIALPENKVSEAQANSGSGASSTTTATSTTSTTTTTAASTTPTPPTAPTPVTSAPALVAATAISTIVVAASTTVTTPTTATTTVSISPTTKGSKSPAKVSDGGSSSTDFKMVSSGLTENQLQLSVEVLTSHSCSEEGLEDAANVLLQLSRGDSGTRDTVLKLLLNGARHLGYTLCKQIGTLLAELREYNLEQQRRAQCETLSPDGLPEEQPQTTKLKGKMQSRFDMAENVVIVASQKRPLGGRELQLPSMSMLTSKTSTQKFFLRVLQVIIQLRDDTRRANKKAKQTGRLGSSGLGSASSIQAAVRQLEAEADAIIQMVREGQRARRQQQAATSESSQSEASVRREESPMDVDQPSPSAQDTQSIASDGTPQGEKEKEERPPELPLLSEQLSLDELWDMLGECLKELEESHDQHAVLVLQPAVEAFFLVHATERESKPPVRDTRESQLAHIKDEPPPLSPAPLTPATPSSLDPFFSREPSSMHISSSLPPDTQKFLRFAETHRTVLNQILRQSTTHLADGPFAVLVDYIRVLDFDVKRKYFRQELERLDEGLRKEDMAVHVRRDHVFEDSYRELHRKSPEEMKNRLYIVFEGEEGQDAGGLLREWYMIISREMFNPMYALFRTSPGDRVTYTINPSSHCNPNHLSYFKFVGRIVAKAVYDNRLLECYFTRSFYKHILGKSVRYTDMESEDYHFYQGLVYLLENDVSTLGYDLTFSTEVQEFGVCEVRDLKPNGANILVTEENKKEYVHLVCQMRMTGAIRKQLAAFLEGFYEIIPKRLISIFTEQELELLISGLPTIDIDDLKSNTEYHKYQSNSIQIQWFWRALRSFDQADRAKFLQFVTGTSKVPLQGFAALEGMNGIQKFQIHRDDRSTDRLPSAHTCFNQLDLPAYESFEKLRHMLLLAIQECSEGFGLA,mutated_sequence,1.0,4374.0,NP_113584.3.a2m,NP_113584.3.npy,ClinVar
+NP_113663.2,NP_113663.2.csv,MALEQALQAARQGELDVLRSLHAAGLLGPSLRDPLDALPVHHAARAGKLHCLRFLVEEAALPAAARARNGATPAHDASATGHLACLQWLLSQGGCRVQDKDNSGATVLHLAARFGHPEVVNWLLHHGGGDPTAATDMGALPIHYAAAKGDFPSLRLLVEHYPEGVNAQTKNGATPLYLACQEGHLEVTQYLVQECGADPHARAHDGMTPLHAAAQMGHSPVIVWLVSCTDVSLSEQDKDGATAMHFAASRGHTKVLSWLLLHGGEISADLWGGTPLHDAAENGELECCQILVVNGAELDVRDRDGYTAADLSDFNGHSHCTRYLRTVENLSVEHRVLSRDPSAELEAKQPDSGMSSPNTTVSVQPLNFDLSSPTSTLSNYDSCSSSHSSIKGQHPPCGLSSARAADIQSYMDMLNPELGLPRGTIGKPTPPPPPPSFPPPPPPPGTQLPPPPPGYPAPKPPVGPQAADIYMQTKNKLRHVETEALKKELSSCDGHDGLRRQDSSRKPRAFSKQPSTGDYYRQLGRCPGETLAARPGMAHSEEVRARQPARAGCPRLGPAARGSLEGPSAPPQAALLPGNHVPNGCAADPKASRELPPPPPPPPPPLPEAASSPPPAPPLPLESAGPGCGQRRSSSSTGSTKSFNMMSPTGDNSELLAEIKAGKSLKPTPQSKGLTTVFSGIGQPAFQPDSPLPSVSPALSPVRSPTPPAAGFQPLLNGSLVPVPPTTPAPGVQLDVEALIPTHDEQGRPIPEWKRQVMVRKMQLKMQEEEEQRRKEEEEEARLASMPAWRRDLLRKKLEEEREQKRKEEERQKQEELRREKEQSEKLRTLGYDESKLAPWQRQVILKKGDIAKY,mutated_sequence,1.0,854.0,NP_113663.2.a2m,NP_113663.2.npy,ClinVar
+NP_114032.2,NP_114032.2.csv,MSSSPVNVKKLKVSELKEELKKRRLSDKGLKAELMERLQAALDDEEAGGRPAMEPGNGSLDLGGDSAGRSGAGLEQEAAAGGDEEEEEEEEEEEGISALDGDQMELGEENGAAGAADSGPMEEEEAASEDENGDDQGFQEGEDELGDEEEGAGDENGHGEQQPQPPATQQQQPQQQRGAAKEAAGKSSGPTSLFAVTVAPPGARQGQQQAGGKKKAEGGGGGGRPGAPAAGDGKTEQKGGDKKRGVKRPREDHGRGYFEYIEENKYSRAKSPQPPVEEEDEHFDDTVVCLDTYNCDLHFKISRDRLSASSLTMESFAFLWAGGRASYGVSKGKVCFEMKVTEKIPVRHLYTKDIDIHEVRIGWSLTTSGMLLGEEEFSYGYSLKGIKTCNCETEDYGEKFDENDVITCFANFESDEVELSYAKNGQDLGVAFKISKEVLAGRPLFPHVLCHNCAVEFNFGQKEKPYFPIPEEYTFIQNVPLEDRVRGPKGPEEKKDCEVVMMIGLPGAGKTTWVTKHAAENPGKYNILGTNTIMDKMMVAGFKKQMADTGKLNTLLQRAPQCLGKFIEIAARKKRNFILDQTNVSAAAQRRKMCLFAGFQRKAVVVCPKDEDYKQRTQKKAEVEGKDLPEHAVLKMKGNFTLPEVAECFDEITYVELQKEEAQKLLEQYKEESKKALPPEKKQNTGSKKSNKNKSGKNQFNRGGGHRGRGGFNMRGGNFRGGAPGNRGGYNRRGNMPQRGGGGGGSGGIGYPYPRAPVFPGRGSYSNRGNYNRGGMPNRGNYNQNFRGRGNNRGYKNQSQGYNQWQQGQFWGQKPWSQHYHQGYY,mutated_sequence,1.0,825.0,NP_114032.2.a2m,NP_114032.2.npy,ClinVar
+NP_115501.2,NP_115501.2.csv,MAEAEGSSLLLLPPPPPPPRMAEVEAPTAAETDMKQYQGSGGVAMDVERSRFPYCVVWTPIPVLTWFFPIIGHMGICTSTGVIRDFAGPYFVSEDNMAFGKPAKYWKLDPAQVYASGPNAWDTAVHDASEEYKHRMHNLCCDNCHSHVALALNLMRYNNSTNWNMVTLCFFCLLYGKYVSVGAFVKTWLPFILLLGIILTVSLVFNLR,mutated_sequence,1.0,208.0,NP_115501.2.a2m,NP_115501.2.npy,ClinVar
+NP_115785.1,NP_115785.1.csv,MAVRQALGRGLQLGRALLLRFTGKPGRAYGLGRPGPAAGCVRGERPGWAAGPGAEPRRVGLGLPNRLRFFRQSVAGLAARLQRQFVVRAWGCAGPCGRAVFLAFGLGLGLIEEKQAESRRAVSACQEIQAIFTQKSKPGPDPLDTRRLQGFRLEEYLIGQSIGKGCSAAVYEATMPTLPQNLEVTKSTGLLPGRGPGTSAPGEGQERAPGAPAFPLAIKMMWNISAGSSSEAILNTMSQELVPASRVALAGEYGAVTYRKSKRGPKQLAPHPNIIRVLRAFTSSVPLLPGALVDYPDVLPSRLHPEGLGHGRTLFLVMKNYPCTLRQYLCVNTPSPRLAAMMLLQLLEGVDHLVQQGIAHRDLKSDNILVELDPDGCPWLVIADFGCCLADESIGLQLPFSSWYVDRGGNGCLMAPEVSTARPGPRAVIDYSKADAWAVGAIAYEIFGLVNPFYGQGKAHLESRSYQEAQLPALPESVPPDVRQLVRALLQREASKRPSARVAANVLHLSLWGEHILALKNLKLDKMVGWLLQQSAATLLANRLTEKCCVETKMKMLFLANLECETLCQAALLLCSWRAAL,mutated_sequence,1.0,581.0,NP_115785.1.a2m,NP_115785.1.npy,ClinVar
+NP_115791.3,NP_115791.3.csv,MPGGGPEMDDYMETLKDEEDALWENVECNRHMLSRYINPAKLTPYLRQCKVIDEQDEDEVLNAPMLPSKINRAGRLLDILHTKGQRGYVVFLESLEFYYPELYKLVTGKEPTRRFSTIVVEEGHEGLTHFLMNEVIKLQQQMKAKDLQRCELLARLRQLEDEKKQMTLTRVELLTFQERYYKMKEERDSYNDELVKVKDDNYNLAMRYAQLSEEKNMAVMRSRDLQLEIDQLKHRLNKMEEECKLERNQSLKLKNDIENRPKKEQVLELERENEMLKTKNQELQSIIQAGKRSLPDSDKAILDILEHDRKEALEDRQELVNRIYNLQEEARQAEELRDKYLEEKEDLELKCSTLGKDCEMYKHRMNTVMLQLEEVERERDQAFHSRDEAQTQYSQCLIEKDKYRKQIRELEEKNDEMRIEMVRREACIVNLESKLRRLSKDSNNLDQSLPRNLPVTIISQDFGDASPRTNGQEADDSSTSEESPEDSKYFLPYHPPQRRMNLKGIQLQRAKSPISLKRTSDFQAKGHEEEGTDASPSSCGSLPITNSFTKMQPPRSRSSIMSITAEPPGNDSIVRRYKEDAPHRSTVEEDNDSGGFDALDLDDDSHERYSFGPSSIHSSSSSHQSEGLDAYDLEQVNLMFRKFSLERPFRPSVTSVGHVRGPGPSVQHTTLNGDSLTSQLTLLGGNARGSFVHSVKPGSLAEKAGLREGHQLLLLEGCIRGERQSVPLDTCTKEEAHWTIQRCSGPVTLHYKVNHEGYRKLVKDMEDGLITSGDSFYIRLNLNISSQLDACTMSLKCDDVVHVRDTMYQDRHEWLCARVDPFTDHDLDMGTIPSYSRAQQLLLVKLQRLMHRGSREEVDGTHHTLRALRNTLQPEEALSTSDPRVSPRLSRASFLFGQLLQFVSRSENKYKRMNSNERVRIISGSPLGSLARSSLDATKLLTEKQEELDPESELGKNLSLIPYSLVRAFYCERRRPVLFTPTVLAKTLVQRLLNSGGAMEFTICKSDIVTRDEFLRRQKTETIIYSREKNPNAFECIAPANIEAVAAKNKHCLLEAGIGCTRDLIKSNIYPIVLFIRVCEKNIKRFRKLLPRPETEEEFLRVCRLKEKELEALPCLYATVEPDMWGSVEELLRVVKDKIGEEQRKTIWVDEDQL,mutated_sequence,1.0,1154.0,NP_115791.3.a2m,NP_115791.3.npy,ClinVar
+NP_115909.1,NP_115909.1.csv,MAAGLARLLLLLGLSAGGPAPAGAAKMKVVEEPNAFGVNNPFLPQASRLQAKRDPSPVSGPVHLFRLSGKCFSLVESTYKYEFCPFHNVTQHEQTFRWNAYSGILGIWHEWEIANNTFTGMWMRDGDACRSRSRQSKVELACGKSNRLAHVSEPSTCVYALTFETPLVCHPHALLVYPTLPEALQRQWDQVEQDLADELITPQGHEKLLRTLFEDAGYLKTPEENEPTQLEGGPDSLGFETLENCRKAHKELSKEIKRLKGLLTQHGIPYTRPTETSNLEHLGHETPRAKSPEQLRGDPGLRGSL,mutated_sequence,1.0,305.0,NP_115909.1.a2m,NP_115909.1.npy,ClinVar
+NP_116027.2,NP_116027.2.csv,MEVAPEQPRWMAHPAVLNAQHPDSHHPGLAHNYMEPAQLLPPDEVDVFFNHLDSQGNPYYANPAHARARVSYSPAHARLTGGQMCRPHLLHSPGLPWLDGGKAALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGGSGGGSGSSVASLTPTAAHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGEDKDGVKYQVSLTESMKMESGSPLRPGLATMGTQPATHHPIPTYPSYVPAAAHDYSSGLFHPGGFLGGPASSFTPKQRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMKKEGIQTRNRKMSNKSKKSKKGAECFEELSKCMQEKSSPFSAAALAGHMAPVGHLPPFSHSGHILPTPTPIHPSSSLSFGHPHPSSMVTAMG,mutated_sequence,1.0,480.0,NP_116027.2.a2m,NP_116027.2.npy,ClinVar
+NP_116250.3,NP_116250.3.csv,MSLAAYCVICCRRIGTSTSPPKSGTHWRDIRNIIKFTGSLILGGSLFLTYEVLALKKAVTLDTQVVEREKMKSYIYVHTVSLDKGENHGIAWQARKELHKAVRKVLATSAKILRNPFADPFSTVDIEDHECAVWLLLRKSKSDDKTTRLEAVREMSETHHWHDYQYRIIAQACDPKTLIGLARSEESDLRFFLLPPPLPSLKEDSSTEEELRQLLASLPQTELDECIQYFTSLALSESSQSLAAQKGGLWCFGGNGLPYAESFGEVPSATVEMFCLEAIVKHSEISTHCDKIEANGGLQLLQRLYRLHKDCPKVQRNIMRVIGNMALNEHLHSSIVRSGWVSIMAEAMKSPHIMESSHAARILANLDRETVQEKYQDGVYVLHPQYRTSQPIKADVLFIHGLMGAAFKTWRQQDSEQAVIEKPMEDEDRYTTCWPKTWLAKDCPALRIISVEYDTSLSDWRARCPMERKSIAFRSNELLRKLRAAGVGDRPVVWISHSMGGLLVKKMLLEASTKPEMSTVINNTRGIIFYSVPHHGSRLAEYSVNIRYLLFPSLEVKELSKDSPALKTLQDDFLEFAKDKNFQVLNFVETLPTYIGSMIKLHVVPVESADLGIGDLIPVDVNHLNICKPKKKDAFLYQRTLQFIREALAKDLEN,mutated_sequence,1.0,654.0,NP_116250.3.a2m,NP_116250.3.npy,ClinVar
+NP_149078.1,NP_149078.1.csv,MAEEQGRERDSVPKPSVLFLHPDLGVGGAERLVLDAALALQARGCSVKIWTAHYDPGHCFAESRELPVRCAGDWLPRGLGWGGRGAAVCAYVRMVFLALYVLFLADEEFDVVVCDQVSACIPVFRLARRRKKILFYCHFPDLLLTKRDSFLKRLYRAPIDWIEEYTTGMADCILVNSQFTAAVFKETFKSLSHIDPDVLYPSLNVTSFDSVVPEKLDDLVPKGKKFLLLSINRYERKKNLTLALEALVQLRGRLTSQDWERVHLIVAGGYDERVLENVEHYQELKKMVQQSDLGQYVTFLRSFSDKQKISLLHSCTCVLYTPSNEHFGIVPLEAMYMQCPVIAVNSGGPLESIDHSVTGFLCEPDPVHFSEAIEKFIREPSLKATMGLAGRARVKEKFSPEAFTEQLYRYVTKLLV,mutated_sequence,1.0,416.0,NP_149078.1.a2m,NP_149078.1.npy,ClinVar
+NP_203123.1,NP_203123.1.csv,MMAEEHTDLEAQIVKDIHCKEIDLVNRDPKNINEDIVKVDFEDVIAEPVGTYSFDGVWKVSYTTFTVSKYWCYRLLSTLLGVPLALLWGFLFACISFCHIWAVVPCIKSYLIEIQCISHIYSLCIRTFCNPLFAALGQVCSSIKVVLRKEV,mutated_sequence,1.0,151.0,NP_203123.1.a2m,NP_203123.1.npy,ClinVar
+NP_203524.1,NP_203524.1.csv,MTEYKLVVVGAGGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQVVIDGETCLLDILDTAGQEEYSAMRDQYMRTGEGFLCVFAINNTKSFEDIHHYREQIKRVKDSEDVPMVLVGNKCDLPSRTVDTKQAQDLARSYGIPFIETSAKTRQRVEDAFYTLVREIRQYRLKKISKEEKTPGCVKIKKCIIM,mutated_sequence,1.0,189.0,NP_203524.1.a2m,NP_203524.1.npy,ClinVar
+NP_203699.1,NP_203699.1.csv,MKLRGVSLAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCSGIKGEKGERGFPGLEGHPGLPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQGIPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMSSLPGPKGNPGYPGPPGIQGLPGPTGIPGPIGPPGPPGLMGPPGPPGLPGPKGNMGLNFQGPKGEKGEQGLQGPPGPPGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGIRGPPGPPGGEKGEKGEQGEPGKRGKPGKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKGDTGPPGPPGLVIPRPGTGITIGEKGNIGLPGLPGEKGERGFPGIQGPPGLPGPPGAAVMGPPGPPGFPGERGQKGDEGPPGISIPGPPGLDGQPGAPGLPGPPGPAGPHIPPSDEICEPGPPGPPGSPGDKGLQGEQGVKGDKGDTCFNCIGTGISGPPGQPGLPGLPGPPGSLGFPGQKGEKGQAGATGPKGLPGIPGAPGAPGFPGSKGEPGDILTFPGMKGDKGELGSPGAPGLPGLPGTPGQDGLPGLPGPKGEPGGITFKGERGPPGNPGLPGLPGNIGPMGPPGFGPPGPVGEKGIQGVAGNPGQPGIPGPKGDPGQTITQPGKPGLPGNPGRDGDVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGIPGPPGAPGTPGRIGLEGPPGPPGFPGPKGEPGFALPGPPGPPGLPGFKGALGPKGDRGFPGPPGPPGRTGLDGLPGPKGDVGPNGQPGPMGPPGLPGIGVQGPPGPPGIPGPIGQPGLHGIPGEKGDPGPPGLDVPGPPGERGSPGIPGAPGPIGPPGSPGLPGKAGASGFPGTKGEMGMMGPPGPPGPLGIPGRSGVPGLKGDDGLQGQPGLPGPTGEKGSKGEPGLPGPPGPMDPNLLGSKGEKGEPGLPGIPGVSGPKGYQGLPGDPGQPGLSGQPGLPGPPGPKGNPGLPGQPGLIGPPGLKGTIGDMGFPGPQGVEGPPGPSGVPGQPGSPGLPGQKGDKGDPGISSIGLPGLPGPKGEPGLPGYPGNPGIKGSVGDPGLPGLPGTPGAKGQPGLPGFPGTPGPPGPKGISGPPGNPGLPGEPGPVGGGGHPGQPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGLSGQKGDGGLPGIPGNPGLPGPKGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPPGRPGPTGFQGLPGPEGPPGLPGNGGIKGEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPGLNGMKGDPGLPGVPGFPGMKGPSGVPGSAGPEGEPGLIGPPGPPGLPGPSGQSIIIKGDAGPPGIPGQPGLKGLPGPQGPQGLPGPTGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGTRGLDGPPGPDGLQGPPGPPGTSSVAHGFLITRHSQTTDAPQCPQGTLQVYEGFSLLYVQGNKRAHGQDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMSMQPLKGQSIQPFISRCAVCEAPAVVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRGTCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTRISRCQVCMKRT,mutated_sequence,1.0,1691.0,NP_203699.1.a2m,NP_203699.1.npy,ClinVar
+NP_338599.1,NP_338599.1.csv,MGSQALPPGPMQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPPPRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWCLVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHGPRKSYSLGSIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVTASARTKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAILTLAVATLYGLSLATPGE,mutated_sequence,1.0,314.0,NP_338599.1.a2m,NP_338599.1.npy,ClinVar
+NP_443077.1,NP_443077.1.csv,MAVCGLGSRLGLGSRLGLRGCFGAARLLYPRFQSRGPQGVEDGDRPQPSSKTPRIPKIYTKTGDKGFSSTFTGERRPKDDQVFEAVGTTDELSSAIGFALELVTEKGHTFAEELQKIQCTLQDVGSALATPCSSAREAHLKYTTFKAGPILELEQWIDKYTSQLPPLTAFILPSGGKISSALHFCRAVCRRAERRVVPLVQMGETDANVAKFLNRLSDYLFTLARYAAMKEGNQEKIYMKNDPSAESEGL,mutated_sequence,1.0,250.0,NP_443077.1.a2m,NP_443077.1.npy,ClinVar
+NP_443106.1,NP_443106.1.csv,MKDRTQELRSAKDSDDEEEVVHVDRDHFMDEFFEQVEEIRGCIEKLSEDVEQVKKQHSAILAAPNPDEKTKQELEDLTADIKKTANKVRSKLKAIEQSIEQEEGLNRSSADLRIRKTQHSTLSRKFVEVMTEYNATQSKYRDRCKDRIQRQLEITGRTTTNEELEDMLESGKLAIFTDDIKMDSQMTKQALNEIETRHNEIIKLETSIRELHDMFVDMAMLVESQGEMIDRIEYNVEHSVDYVERAVSDTKKAVKYQSKARRKKIMIIICCVVLGVVLASSIGGTLGL,mutated_sequence,1.0,288.0,NP_443106.1.a2m,NP_443106.1.npy,ClinVar
+NP_443715.1,NP_443715.1.csv,MRAVLTWRDKAEHCINDIAFKPDGTQLILAAGSRLLVYDTSDGTLLQPLKGHKDTVYCVAYAKDGKRFASGSADKSVIIWTSKLEGILKYTHNDAIQCVSYNPITHQLASCSSSDFGLWSPEQKSVSKHKSSSKIICCSWTNDGQYLALGMFNGIISIRNKNGEEKVKIERPGGSLSPIWSICWNPSSRWESFWMNRENEDAEDVIVNRYIQEIPSTLKSAVYSSQGSEAEEEEPEEEDDSPRDDNLEERNDILAVADWGQKVSFYQLSGKQIGKDRALNFDPCCISYFTKGEYILLGGSDKQVSLFTKDGVRLGTVGEQNSWVWTCQAKPDSNYVVVGCQDGTISFYQLIFSTVHGLYKDRYAYRDSMTDVIVQHLITEQKVRIKCKELVKKIAIYRNRLAIQLPEKILIYELYSEDLSDMHYRVKEKIIKKFECNLLVVCANHIILCQEKRLQCLSFSGVKEREWQMESLIRYIKVIGGPPGREGLLVGLKNGQILKIFVDNLFAIVLLKQATAVRCLDMSASRKKLAVVDENDTCLVYDIDTKELLFQEPNANSVAWNTQCEDMLCFSGGGYLNIKASTFPVHRQKLQGFVVGYNGSKIFCLHVFSISAVEVPQSAPMYQYLDRKLFKEAYQIACLGVTDTDWRELAMEALEGLDFETAKKAFIRVQDLRYLELISSIEERKKRGETNNDLFLADVFSYQGKFHEAAKLYKRSGHENLALEMYTDLCMFEYAKDFLGSGDPKETKMLITKQADWARNIKEPKAAVEMYISAGEHVKAIEICGDHGWVDMLIDIARKLDKAEREPLLLCATYLKKLDSPGYAAETYLKMGDLKSLVQLHVETQRWDEAFALGEKHPEFKDDIYMPYAQWLAENDRFEEAQKAFHKAGRQREAVQVLEQLTNNAVAESRFNDAAYYYWMLSMQCLDIAQDPAQKDTMLGKFYHFQRLAELYHGYHAIHRHTEDPFSVHRPETLFNISRFLLHSLPKDTPSGISKVKILFTLAKQSKALGAYRLARHAYDKLRGLYIPARFQKSIELGTLTIRAKPFHDSEELVPLCYRCSTNNPLLNNLGNVCINCRQPFIFSASSYDVLHLVEFYLEEGITDEEAISLIDLEVLRPKRDDRQLEIANNSSQILRLVETKDSIGDEDPFTAKLSFEQGGSEFVPVVVSRLVLRSMSRRDVLIKRWPPPLRWQYFRSLLPDASITMCPSCFQMFHSEDYELLVLQHGCCPYCRRCKDDPGP,mutated_sequence,1.0,1241.0,NP_443715.1.a2m,NP_443715.1.npy,ClinVar
+NP_444504.1,NP_444504.1.csv,MAVEELQSIIKRCQILEEQDFKEEDFGLFQLAGQRCIEEGHTDQLLEIIQNEKNKVIIKNMGWNLVGPVVRCLLCKDKEDSKRKVYFLIFDLLVKLCNPKELLLGLLELIEEPSGKQISQSILLLLQPLQTVIQKLHNKAYSIGLALSTLWNQLSLLPVPYSKEQIQMDDYGLCQCCKALIEFTKPFVEEVIDNKENSLENEKLKDELLKFCFKSLKCPLLTAQFFEQSEEGGNDPFRYFASEIIGFLSAIGHPFPKMIFNHGRKKRTWNYLEFEEEENKQLADSMASLAYLVFVQGIHIDQLPMVLSPLYLLQFNMGHIEVFLQRTEESVISKGLELLENSLLRIEDNSLLYQYLEIKSFLTVPQGLVKVMTLCPIETLRKKSLAMLQLYINKLDSQGKYTLFRCLLNTSNHSGVEAFIIQNIKNQIDMSLKRTRNNKWFTGPQLISLLDLVLFLPEGAETDLLQNSDRIMASLNLLRYLVIKDNENDNQTGLWTELGNIENNFLKPLHIGLNMSKAHYEAEIKNSQEAQKSKDLCSITVSGEEIPNMPPEMQLKVLHSALFTFDLIESVLARVEELIEIKTKSTSEENIGIK,mutated_sequence,1.0,594.0,NP_444504.1.a2m,NP_444504.1.npy,ClinVar
+NP_473368.1,NP_473368.1.csv,MVKFPALTHYWPLIRFLVPLGITNIAIDFGEQALNRGIAAVKEDAVEMLASYGLAYSLMKFFTGPMSDFKNVGLVFVNSKRDRTKAVLCMVVAGAIAAVFHTLIAYSDLGYYIINKLHHVDESVGSKTRRAFLYLAAFPFMDAMAWTHAGILLKHKYSFLVGCASISDVIAQVVFVAILLHSHLECREPLLIPILSLYMGALVRCTTLCLGYYKNIHDIIPDRSGPELGGDATIRKMLSFWWPLALILATQRISRPIVNLFVSRDLGGSSAATEAVAILTATYPVGHMPYGWLTEIRAVYPAFDKNNPSNKLVSTSNTVTAAHIKKFTFVCMALSLTLCFVMFWTPNVSEKILIDIIGVDFAFAELCVVPLRIFSFFPVPVTVRAHLTGWLMTLKKTFVLAPSSVLRIIVLIASLVVLPYLGVHGATLGVGSLLAGFVGESTMVAIAACYVYRKQKKKMENESATEGEDSAMTDMPPTEEVTDIVEMREENE,mutated_sequence,1.0,492.0,NP_473368.1.a2m,NP_473368.1.npy,ClinVar
+NP_477352.3,NP_477352.3.csv,MAAAPARGGGGGGGGGGGCSGSGSSASRGFYFNTVLSLARSLAVQRPASLEKVQKLLCMCPVDFHGIFQLDERRRDAVIALGIFLIESDLQHKDCVVPYLLRLLKGLPKVYWVEESTARKGRGALPVAESFSFCLVTLLSDVAYRDPSLRDEILEVLLQVLHVLLGMCQALEIQDKEYLCKYAIPCLIGISRAFGRYSNMEESLLSKLFPKIPPHSLRVLEELEGVRRRSFNDFRSILPSNLLTVCQEGTLKRKTSSVSSISQVSPERGMPPPSSPGGSAFHYFEASCLPDGTALEPEYYFSTISSSFSVSPLFNGVTYKEFNIPLEMLRELLNLVKKIVEEAVLKSLDAIVASVMEANPSADLYYTSFSDPLYLTMFKMLRDTLYYMKDLPTSFVKEIHDFVLEQFNTSQGELQKILHDADRIHNELSPLKLRCQANAACVDLMVWAVKDEQGAENLCIKLSEKLQSKTSSKVIIAHLPLLICCLQGLGRLCERFPVVVHSVTPSLRDFLVIPSPVLVKLYKYHSQYHTVAGNDIKISVTNEHSESTLNVMSGKKSQPSMYEQLRDIAIDNICRCLKAGLTVDPVIVEAFLASLSNRLYISQESDKDAHLIPDHTIRALGHIAVALRDTPKVMEPILQILQQKFCQPPSPLDVLIIDQLGCLVITGNQYIYQEVWNLFQQISVKASSVVYSATKDYKDHGYRHCSLAVINALANIAANIQDEHLVDELLMNLLELFVQLGLEGKRASERASEKGPALKASSSAGNLGVLIPVIAVLTRRLPPIKEAKPRLQKLFRDFWLYSVLMGFAVEGSGLWPEEWYEGVCEIATKSPLLTFPSKEPLRSVLQYNSAMKNDTVTPAELSELRSTIINLLDPPPEVSALINKLDFAMSTYLLSVYRLEYMRVLRSTDPDRFQVMFCYFEDKAIQKDKSGMMQCVIAVADKVFDAFLNMMADKAKTKENEEELERHAQFLLVNFNHIHKRIRRVADKYLSGLVDKFPHLLWSGTVLKTMLDILQTLSLSLSADIHKDQPYYDIPDAPYRITVPDTYEARESIVKDFAARCGMILQEAMKWAPTVTKSHLQEYLNKHQNWVSGLSQHTGLAMATESILHFAGYNKQNTTLGATQLSERPACVKKDYSNFMASLNLRNRYAGEVYGMIRFSGTTGQMSDLNKMMVQDLHSALDRSHPQHYTQAMFKLTAMLISSKDCDPQLLHHLCWGPLRMFNEHGMETALACWEWLLAGKDGVEVPFMREMAGAWHMTVEQKFGLFSAEIKEADPLAASEASQPKPCPPEVTPHYIWIDFLVQRFEIAKYCSSDQVEIFSSLLQRSMSLNIGGAKGSMNRHVAAIGPRFKLLTLGLSLLHADVVPNATIRNVLREKIYSTAFDYFSCPPKFPTQGEKRLREDISIMIKFWTAMFSDKKYLTASQLVPPDNQDTRSNLDITVGSRQQATQGWINTYPLSSGMSTISKKSGMSKKTNRGSQLHKYYMKRRTLLLSLLATEIERLITWYNPLSAPELELDQAGENSVANWRSKYISLSEKQWKDNVNLAWSISPYLAVQLPARFKNTEAIGNEVTRLVRLDPGAVSDVPEAIKFLVTWHTIDADAPELSHVLCWAPTDPPTGLSYFSSMYPPHPLTAQYGVKVLRSFPPDAILFYIPQIVQALRYDKMGYVREYILWAASKSQLLAHQFIWNMKTNIYLDEEGHQKDPDIGDLLDQLVEEITGSLSGPAKDFYQREFDFFNKITNVSAIIKPYPKGDERKKACLSALSEVKVQPGCYLPSNPEAIVLDIDYKSGTPMQSAAKAPYLAKFKVKRCGVSELEKEGLRCRSDSEDECSTQEADGQKISWQAAIFKVGDDCRQDMLALQIIDLFKNIFQLVGLDLFVFPYRVVATAPGCGVIECIPDCTSRDQLGRQTDFGMYDYFTRQYGDESTLAFQQARYNFIRSMAAYSLLLFLLQIKDRHNGNIMLDKKGHIIHIDFGFMFESSPGGNLGWEPDIKLTDEMVMIMGGKMEATPFKWFMEMCVRGYLAVRPYMDAVVSLVTLMLDTGLPCFRGQTIKLLKHRFSPNMTEREAANFIMKVIQSCFLSNRSRTYDMIQYYQNDIPY,mutated_sequence,1.0,2102.0,NP_477352.3.a2m,NP_477352.3.npy,ClinVar
+NP_510965.1,NP_510965.1.csv,MATATIALQVNGQQGGGSEPAAAAAVVAAGDKWKPPQGTDSIKMENGQSTAAKLGLPPLTPEQQEALQKAKKYAMEQSIKSVLVKQTIAHQQQQLTNLQMAAVTMGFGDPLSPLQSMAAQRQRALAIMCRVYVGSIYYELGEDTIRQAFAPFGPIKSIDMSWDSVTMKHKGFAFVEYEVPEAAQLALEQMNSVMLGGRNIKVGRPSNIGQAQPIIDQLAEEARAFNRIYVASVHQDLSDDDIKSVFEAFGKIKSCTLARDPTTGKHKGYGFIEYEKAQSSQDAVSSMNLFDLGGQYLRVGKAVTPPMPLLTPATPGGLPPAAAVAAAAATAKITAQEAVAGAAVLGTLGTPGLVSPALTLAQPLGTLPQAVMAAQAPGVITGVTPARPPIPVTIPSVGVVNPILASPPTLGLLEPKKEKEEEELFPESERPEMLSEQEHMSISGSSARHMVMQKLLRKQESTVMVLRNMVDPKDIDDDLEGEVTEECGKFGAVNRVIIYQEKQGEEEDAEIIVKIFVEFSIASETHKAIQALNGRWFAGRKVVAEVYDQERFDNSDLSA,mutated_sequence,1.0,559.0,NP_510965.1.a2m,NP_510965.1.npy,ClinVar
+NP_542172.2,NP_542172.2.csv,MKLLRRAWRRRAALGLGTLALCGAALLYLARCAAEPGDPRAMSGRSPPPPAPARAAAFLAVLVASAPRAAERRSVIRSTWLARRGAPGDVWARFAVGTAGLGAEERRALEREQARHGDLLLLPALRDAYENLTAKVLAMLAWLDEHVAFEFVLKADDDSFARLDALLAELRAREPARRRRLYWGFFSGRGRVKPGGRWREAAWQLCDYYLPYALGGGYVLSADLVHYLRLSRDYLRAWHSEDVSLGAWLAPVDVQREHDPRFDTEYRSRGCSNQYLVTHKQSLEDMLEKHATLAREGRLCKREVQLRLSYVYDWSAPPSQCCQRREGIP,mutated_sequence,1.0,329.0,NP_542172.2.a2m,NP_542172.2.npy,ClinVar
+NP_542411.2,NP_542411.2.csv,MERCSRCHRLLLLLPLVLGLSAAPGWAGAPPVDVLRALRFPSLPDGVRRAKGICPADVAYRVARPAQLSAPTRQLFPGGFPKDFSLLTVVRTRPGLQAPLLTLYSAQGVRQLGLELGRPVRFLYEDQTGRPQPPSQPVFRGLSLADGKWHRVAVAVKGQSVTLIVDCKKRVTRPLPRSARPVLDTHGVIIFGARILDEEVFEGDVQELAIVPGVQAAYESCEQKELECEGGQRERPQNQQPHRAQRSPQQQPSRLHRPQNQEPQSQPTESLYYDYEPPYYDVMTTGTTPDYQDPTPGEEEEILESSLLPPLEEEQTDLQVPPTADRFQAEEYGEGGTDPPEGPYDYTYGYGDDYREETELGPALSAETAHSGAAAHGPRGLKGEKGEPAVLEPGMLVEGPPGPEGPAGLIGPPGIQGNPGPVGDPGERGPPGRAGLPGSDGAPGPPGTSLMLPFRFGSGGGDKGPVVAAQEAQAQAILQQARLALRGPPGPMGYTGRPGPLGQPGSPGLKGESGDLGPQGPRGPQGLTGPPGKAGRRGRAGADGARGMPGDPGVKGDRGFDGLPGLPGEKGHRGDTGAQGLPGPPGEDGERGDDGEIGPRGLPGESGPRGLLGPKGPPGIPGPPGVRGMDGPQGPKGSLGPQGEPGPPGQQGTPGTQGLPGPQGAIGPHGEKGPQGKPGLPGMPGSDGPPGHPGKEGPPGTKGNQGPSGPQGPLGYPGPRGVKGVDGIRGLKGHKGEKGEDGFPGFKGDIGVKGDRGEVGVPGSRGEDGPEGPKGRTGPTGDPGPPGLMGEKGKLGVPGLPGYPGRQGPKGSLGFPGFPGASGEKGARGLSGKSGPRGERGPTGPRGQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHPGQRGEVGFQGKTGPPGPPGVVGPQGAAGETGPMGERGHPGPPGPPGEQGLPGTAGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSPGERGAAGSGGPIGPPGRPGPQGPPGAAGEKGVPGEKGPIGPTGRDGVQGPVGLPGPAGPPGVAGEDGDKGEVGDPGQKGTKGNKGEHGPPGPPGPIGPVGQPGAAGADGEPGARGPQGHFGAKGDEGTRGFNGPPGPIGLQGLPGPSGEKGETGDVGPMGPPGPPGPRGPAGPNGADGPQGPPGGVGNLGPPGEKGEPGESGSPGIQGEPGVKGPRGERGEKGESGQPGEPGPPGPKGPTGDDGPKGNPGPVGFPGDPGPPGEGGPRGQDGAKGDRGEDGEPGQPGSPGPTGENGPPGPLGKRGPAGSPGSEGRQGGKGAKGDPGAIGAPGKTGPVGPAGPAGKPGPDGLRGLPGSVGQQGRPGATGQAGPPGPVGPPGLPGLRGDAGAKGEKGHPGLIGLIGPPGEQGEKGDRGLPGPQGSPGQKGEMGIPGASGPIGPGGPPGLPGPAGPKGAKGATGPGGPKGEKGVQGPPGHPGPPGEVIQPLPIQMPKKTRRSVDGSRLMQEDEAIPTGGAPGSPGGLEEIFGSLDSLREEIEQMRRPTGTQDSPARTCQDLKLCHPELPDGEYWVDPNQGCARDAFRVFCNFTAGGETCVTPRDDVTQFSYVDSEGSPVGVVQLTFLRLLSVSAHQDVSYPCSGAARDGPLRLRGANEDELSPETSPYVKEFRDGCQTQQGRTVLEVRTPVLEQLPVLDASFSDLGAPPRRGGVLLGPVCFMG,mutated_sequence,1.0,1736.0,NP_542411.2.a2m,NP_542411.2.npy,ClinVar
+NP_570850.2,NP_570850.2.csv,MWRLRRAAVACEVCQSLVKHSSGIKGSLPLQKLHLVSRSIYHSHHPTLKLQRPQLRTSFQQFSSLTNLPLRKLKFSPIKYGYQPRRNFWPARLATRLLKLRYLILGSAVGGGYTAKKTFDQWKDMIPDLSEYKWIVPDIVWEIDEYIDFEKIRKALPSSEDLVKLAPDFDKIVESLSLLKDFFTSGHKLVSEVIGASDLLLLLGSPEETAFRATDRGSESDKHFRKGLLGELILLQQQIQEHEEEARRAAGQYSTSYAQQKRKVSDKEKIDQLQEELLHTQLKYQRILERLEKENKELRKLVLQKDDKGIHHRKLKKSLIDMYSEVLDVLSDYDASYNTQDHLPRVVVVGDQSAGKTSVLEMIAQARIFPRGSGEMMTRSPVKVTLSEGPHHVALFKDSSREFDLTKEEDLAALRHEIELRMRKNVKEGCTVSPETISLNVKGPGLQRMVLVDLPGVINTVTSGMAPDTKETIFSISKAYMQNPNAIILCIQDGSVDAERSIVTDLVSQMDPHGRRTIFVLTKVDLAEKNVASPSRIQQIIEGKLFPMKALGYFAVVTGKGNSSESIEAIREYEEEFFQNSKLLKTSMLKAHQVTTRNLSLAVSDCFWKMVRESVEQQADSFKATRFNLETEWKNNYPRLRELDRNELFEKAKNEILDEVISLSQVTPKHWEEILQQSLWERVSTHVIENIYLPAAQTMNSGTFNTTVDIKLKQWTDKQLPNKAVEVAWETLQEEFSRFMTEPKGKEHDDIFDKLKEAVKEESIKRHKWNDFAEDSLRVIQHNALEDRSISDKQQWDAAIYFMEEALQARLKDTENAIENMVGPDWKKRWLYWKNRTQEQCVHNETKNELEKMLKCNEEHPAYLASDEITTVRKNLESRGVEVDPSLIKDTWHQVYRRHFLKTALNHCNLCRRGFYYYQRHFVDSELECNDVVLFWRIQRMLAITANTLRQQLTNTEVRRLEKNVKEVLEDFAEDGEKKIKLLTGKRVQLAEDLKKVREIQEKLDAFIEALHQEK,mutated_sequence,1.0,1015.0,NP_570850.2.a2m,NP_570850.2.npy,ClinVar
+NP_570854.1,NP_570854.1.csv,MATACKRSGEPQSDDIEASRMKRAAAKHLIERYYHQLTEGCGNEACTNEFCASCPTFLRMDNNAAAIKALELYKINAKLCDPHPSKKGASSAYLENSKGAPNNSCSEIKMNKKGARIDFKDVTYLTEEKVYEILELCREREDYSPLIRVIGRVFSSAEALVQSFRKVKQHTKEELKSLQAKDEDKDEDEKEKAACSAAAMEEDSEASSSRIGDSSQGDNNLQKLGPDDVSVDIDAIRRVYTRLLSNEKIETAFLNALVYLSPNVECDLTYHNVYSRDPNYLNLFIIVMENRNLHSPEYLEMALPLFCKAMSKLPLAAQGKLIRLWSKYNADQIRRMMETFQQLITYKVISNEFNSRNLVNDDDAIVAASKCLKMVYYANVVGGEVDTNHNEEDDEEPIPESSELTLQELLGEERRNKKGPRVDPLETELGVKTLDCRKPLIPFEEFINEPLNEVLEMDKDYTFFKVETENKFSFMTCPFILNAVTKNLGLYYDNRIRMYSERRITVLYSLVQGQQLNPYLRLKVRRDHIIDDALVRLEMIAMENPADLKKQLYVEFEGEQGVDEGGVSKEFFQLVVEEIFNPDIGMFTYDESTKLFWFNPSSFETEGQFTLIGIVLGLAIYNNCILDVHFPMVVYRKLMGKKGTFRDLGDSHPVLYQSLKDLLEYEGNVEDDMMITFQISQTDLFGNPMMYDLKENGDKIPITNENRKEFVNLYSDYILNKSVEKQFKAFRRGFHMVTNESPLKYLFRPEEIELLICGSRNLDFQALEETTEYDGGYTRDSVLIREFWEIVHSFTDEQKRLFLQFTTGTDRAPVGGLGKLKMIIAKNGPDTERLPTSHTCFNVLLLPEYSSKEKLKERLLKAITYAKGFGML,mutated_sequence,1.0,872.0,NP_570854.1.a2m,NP_570854.1.npy,ClinVar
+NP_573566.2,NP_573566.2.csv,MAALLRSARWLLRAGAAPRLPLSLRLLPGGPGRLHAASYLPAARAGPVAGGLLSPARLYAIAAKEKDIQEESTFSSRKISNQFDWALMRLDLSVRRTGRIPKKLLQKVFNDTCRSGGLGGSHALLLLRSCGSLLPELKLEERTEFAHRIWDTLQKLGAVYDVSHYNALLKVYLQNEYKFSPTDFLAKMEEANIQPNRVTYQRLIASYCNVGDIEGASKILGFMKTKDLPVTEAVFSALVTGHARAGDMENAENILTVMRDAGIEPGPDTYLALLNAYAEKGDIDHVKQTLEKVEKSELHLMDRDLLQIIFSFSKAGYPQYVSEILEKVTCERRYIPDAMNLILLLVTEKLEDVALQILLACPVSKEDGPSVFGSFFLQHCVTMNTPVEKLTDYCKKLKEVQMHSFPLQFTLHCALLANKTDLAKALMKAVKEEGFPIRPHYFWPLLVGRRKEKNVQGIIEILKGMQELGVHPDQETYTDYVIPCFDSVNSARAILQENGCLSDSDMFSQAGLRSEAANGNLDFVLSFLKSNTLPISLQSIRSSLLLGFRRSMNINLWSEITELLYKDGRYCQEPRGPTEAVGYFLYNLIDSMSDSEVQAKEEHLRQYFHQLEKMNVKIPENIYRGIRNLLESYHVPELIKDAHLLVESKNLDFQKTVQLTSSELESTLETLKAENQPIRDVLKQLILVLCSEENMQKALELKAKYESDMVTGGYAALINLCCRHDKVEDALNLKEEFDRLDSSAVLDTGKYVGLVRVLAKHGKLQDAINILKEMKEKDVLIKDTTALSFFHMLNGAALRGEIETVKQLHEAIVTLGLAEPSTNISFPLVTVHLEKGDLSTALEVAIDCYEKYKVLPRIHDVLCKLVEKGETDLIQKAMDFVSQEQGEMVMLYDLFFAFLQTGNYKEAKKIIETPGIRARSARLQWFCDRCVANNQVETLEKLVELTQKLFECDRDQMYYNLLKLYKINGDWQRADAVWNKIQEENVIPREKTLRLLAEILREGNQEVPFDVPELWYEDEKHSLNSSSASTTEPDFQKDILIACRLNQKKGAYDIFLNAKEQNIVFNAETYSNLIKLLMSEDYFTQAMEVKAFAETHIKGFTLNDAANSRLIITQVRRDYLKEAVTTLKTVLDQQQTPSRLAVTRVIQALAMKGDVENIEVVQKMLNGLEDSIGLSKMVFINNIALAQIKNNNIDAAIENIENMLTSENKVIEPQYFGLAYLFRKVIEEQLEPAVEKISIMAERLANQFAIYKPVTDFFLQLVDAGKVDDARALLQRCGAIAEQTPILLLFLLRNSRKQGKASTVKSVLELIPELNEKEEAYNSLMKSYVSEKDVTSAKALYEHLTAKNTKLDDLFLKRYASLLKYAGEPVPFIEPPESFEFYAQQLRKLRENSS,mutated_sequence,1.0,1394.0,NP_573566.2.a2m,NP_573566.2.npy,ClinVar
+NP_597677.2,NP_597677.2.csv,MNGDMPHVPITTLAGIASLTDLLNQLPLPSPLPATTTKSLLFNARIAEEVNCLLACRDDNLVSQLVHSLNQVSTDHIELKDNLGSDDPEGDIPVLLQAVLARSPNVFREKSMQNRYVQSGMMMSQYKLSQNSMHSSPASSNYQQTTISHSPSSRFVPPQTSSGNRFMPQQNSPVPSPYAPQSPAGYMPYSHPSSYTTHPQMQQASVSSPIVAGGLRNIHDNKVSGPLSGNSANHHADNPRHGSSEDYLHMVHRLSSDDGDSSTMRNAASFPLRSPQPVCSPAGSEGTPKGSRPPLILQSQSLPCSSPRDVPPDILLDSPERKQKKQKKMKLGKDEKEQSEKAAMYDIISSPSKDSTKLTLRLSRVRSSDMDQQEDMISGVENSNVSENDIPFNVQYPGQTSKTPITPQDINRPLNAAQCLSQQEQTAFLPANQVPVLQQNTSVAAKQPQTSVVQNQQQISQQGPIYDEVELDALAEIERIERESAIERERFSKEVQDKDKPLKKRKQDSYPQEAGGATGGNRPASQETGSTGNGSRPALMVSIDLHQAGRVDSQASITQDSDSIKKPEEIKQCNDAPVSVLQEDIVGSLKSTPENHPETPKKKSDPELSKSEMKQSESRLAESKPNENRLVETKSSENKLETKVETQTEELKQNESRTTECKQNESTIVEPKQNENRLSDTKPNDNKQNNGRSETTKSRPETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPKQKGESRPETPKQKNEGRPETPKHRHDNRRDSGKPSTEKKPEVSKHKQDTKSDSPRLKSERAEALKQRPDGRSVSESLRRDHDNKQKSDDRGESERHRGDQSRVRRPETLRSSSRNEHGIKSDSSKTDKLERKHRHESGDSRERPSSGEQKSRPDSPRVKQGDSNKSRSDKLGFKSPTSKDDKRTEGNKSKVDTNKAHPDNKAEFPSYLLGGRSGALKNFVIPKIKRDKDGNVTQETKKMEMKGEPKDKVEKIGLVEDLNKGAKPVVVLQKLSLDDVQKLIKDREDKSRSSLKPIKNKPSKSNKGSIDQSVLKELPPELLAEIESTMPLCERVKMNKRKRSTVNEKPKYAEISSDEDNDSDEAFESSRKRHKKDDDKAWEYEERDRRSSGDHRRSGHSHEGRRSSGGGRYRNRSPSDSDMEDYSPPPSLSEVARKMKKKEKQKKRKAYEPKLTPEEMMDSSTFKRFTASIENILDNLEDMDFTAFGDDDEIPQELLLGKHQLNELGSESAKIKAMGIMDKLSTDKTVKVLNILEKNIQDGSKLSTLLNHNNDTEEEERLWRDLIMERVTKSADACLTTINIMTSPNMPKAVYIEDVIERVIQYTKFHLQNTLYPQYDPVYRLDPHGGGLLSSKAKRAKCSTHKQRVIVMLYNKVCDIVSSLSELLEIQLLTDTTILQVSSMGITPFFVENVSELQLCAIKLVTAVFSRYEKHRQLILEEIFTSLARLPTSKRSLRNFRLNSSDMDGEPMYIQMVTALVLQLIQCVVHLPSSEKDSNAEEDSNKKIDQDVVITNSYETAMRTAQNFLSIFLKKCGSKQGEEDYRPLFENFVQDLLSTVNKPEWPAAELLLSLLGRLLVHQFSNKSTEMALRVASLDYLGTVAARLRKDAVTSKMDQGSIERILKQVSGGEDEIQQLQKALLDYLDENTETDPSLVFSRKFYIAQWFRDTTLETEKAMKSQKDEESSEGTHHAKEIETTGQIMHRAENRKKFLRSIIKTTPSQFSTLKMNSDTVDYDDACLIVRYLASMRPFAQSFDIYLTQILRVLGENAIAVRTKAMKCLSEVVAVDPSILARLDMQRGVHGRLMDNSTSVREAAVELLGRFVLCRPQLAEQYYDMLIERILDTGISVRKRVIKILRDICIEQPTFPKITEMCVKMIRRVNDEEGIKKLVNETFQKLWFTPTPHNDKEAMTRKILNITDVVAACRDTGYDWFEQLLQNLLKSEEDSSYKPVKKACTQLVDNLVEHILKYEESLADSDNKGVNSGRLVACITTLFLFSKIRPQLMVKHAMTMQPYLTTKCSTQNDFMVICNVAKILELVVPLMEHPSETFLATIEEDLMKLIIKYGMTVVQHCVSCLGAVVNKVTQNFKFVWACFNRYYGAISKLKSQHQEDPNNTSLLTNKPALLRSLFTVGALCRHFDFDLEDFKGNSKVNIKDKVLELLMYFTKHSDEEVQTKAIIGLGFAFIQHPSLMFEQEVKNLYNNILSDKNSSVNLKIQVLKNLQTYLQEEDTRMQQADRDWKKVAKQEDLKEMGDVSSGMSSSIMQLYLKQVLEAFFHTQSSVRHFALNVIALTLNQGLIHPVQCVPYLIAMGTDPEPAMRNKADQQLVEIDKKYAGFIHMKAVAGMKMSYQVQQAINTCLKDPVRGFRQDESSSALCSHLYSMIRGNRQHRRAFLISLLNLFDDTAKTDVTMLLYIADNLACFPYQTQEEPLFIMHHIDITLSVSGSNLLQSFKESMVKDKRKERKSSPSKENESSDSEEEVSRPRKSRKRVDSDSDSDSEDDINSVMKCLPENSAPLIEFANVSQGILLLLMLKQHLKNLCGFSDSKIQKYSPSESAKVYDKAINRKTGVHFHPKQTLDFLRSDMANSKITEEVKRSIVKQYLDFKLLMEHLDPDEEEEEGEVSASTNARNKAITSLLGGGSPKNNTAAETEDDESDGEDRGGGTSGSLRRSKRNSDSTELAAQMNESVDVMDVIAICCPKYKDRPQIARVVQKTSSGFSVQWMAGSYSGSWTEAKRRDGRKLVPWVDTIKESDIIYKKIALTSANKLTNKVVQTLRSLYAAKDGTSS,mutated_sequence,1.0,2804.0,NP_597677.2.a2m,NP_597677.2.npy,ClinVar
+NP_598004.1,NP_598004.1.csv,MLKQSERRRSWSYRPWNTTENEGSQHRRSICSLGARSGSQASIHGWTEGNYNYYIEEDEDGEEEDQWKDDLAEEDQQAGEVTTAKPEGPSDPPALLSTLNVNVGGHSYQLDYCELAGFPKTRLGRLATSTSRSRQLSLCDDYEEQTDEYFFDRDPAVFQLVYNFYLSGVLLVLDGLCPRRFLEELGYWGVRLKYTPRCCRICFEERRDELSERLKIQHELRAQAQVEEAEELFRDMRFYGPQRRRLWNLMEKPFSSVAAKAIGVASSTFVLVSVVALALNTVEEMQQHSGQGEGGPDLRPILEHVEMLCMGFFTLEYLLRLASTPDLRRFARSALNLVDLVAILPLYLQLLLECFTGEGHQRGQTVGSVGKVGQVLRVMRLMRIFRILKLARHSTGLRAFGFTLRQCYQQVGCLLLFIAMGIFTFSAAVYSVEHDVPSTNFTTIPHSWWWAAVSISTVGYGDMYPETHLGRFFAFLCIAFGIILNGMPISILYNKFSDYYSKLKAYEYTTIRRERGEVNFMQRARKKIAECLLGSNPQLTPRQEN,mutated_sequence,1.0,545.0,NP_598004.1.a2m,NP_598004.1.npy,ClinVar
+NP_612396.1,NP_612396.1.csv,MESTLGAGIVIAEALQNQLAWLENVWLWITFLGDPKILFLFYFPAAYYASRRVGIAVLWISLITEWLNLIFKWFLFGDRPFWWVHESGYYSQAPAQVHQFPSSCETGPGSPSGHCMITGAALWPIMTALSSQVATRARSRWVRVMPSLAYCTFLLAVGLSRIFILAHFPHQVLAGLITGAVLGWLMTPRVPMERELSFYGLTALALMLGTSLIYWTLFTLGLDLSWSISLAFKWCERPEWIHVDSRPFASLSRDSGAALGLGIALHSPCYAQVRRAQLGNGQKIACLVLAMGLLGPLDWLGHPPQISLFYIFNFLKYTLWPCLVLALVPWAVHMFSAQEAPPIHSS,mutated_sequence,1.0,346.0,NP_612396.1.a2m,NP_612396.1.npy,ClinVar
+NP_612422.2,NP_612422.2.csv,MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEENLHKLGTFPFRGFVVQGSNGEFPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVEMTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSPIPVVLYSVPANTGLDLPVDAVVTLSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVCALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGGPCRAPLQELSPAEEEALRMDFTSNGWL,mutated_sequence,1.0,327.0,NP_612422.2.a2m,NP_612422.2.npy,ClinVar
+NP_612486.2,NP_612486.2.csv,MAAVLESLLREEVSVAAVVRWIARSTQGSEDNAGEAAALSSLRALRKEFVPFLLNFLREQSSRVLPQGPPTPAKTPGASAALPGRPGGPPRGSRGARSQLFPPTEAQSTAAEAPLARRGGRRRGPGPARERGGRGLEEGVSGESLPGAGGRRLRGSGSPSRPSLTLSDPPNLSNLEEFPPVGSVPPGPTGTKPSRRINPTPVSEERSLSKPKTCFTSPPISCVPSSQPSALDTSPWGLGLPPGCRSLQEEREMLRKERSKQLQQSPTPTCPTPELGSPLPSRTGSLTDEPADPARVSSRQRLELVALVYSSCIAENLVPNLFLELFFVFQLLTARRMVTAKDSDPELSPAVLDSLESPLFQSIHDCVFFAVQVLECHFQVLSNLDKGTLKLLAENERLLCFSPALQGRLRAAYEGSVAKVSLVMPPSTQAVSFQPETDNRANFSSDRAFHTFKKQRDVFYEVLREWEDHHEEPGWDFEKGLGSRIRAMMGQLSAACSHSHFVRLFQKQLLQMCQSPGGAGGTVLGEAPDVLSMLGADKLGRLWRLQERLMAPQSSGGPCPPPTFPGCQGFFRDFILSASSFQFNQHLMDSLSLKIQELNGLALPQHEPNDEDGESDVDWQGERKQFAVVLLSLRLLAKFLGFVAFLPYRGPEPPPTGELQDSILALRSQVPPVLDVRTLLQRGLQARRAVLTVPWLVEFLSFADHVVPLLEYYRDIFTLLLRLHRSLVLSQESEGKMCFLNKLLLLAVLGWLFQIPTVPEDLFFLEEGPSYAFEVDTVAPEHGLDNAPVVDQQLLYTCCPYIGELRKLLASWVSGSSGRSGGFMRKITPTTTTSLGAQPSQTSQGLQAQLAQAFFHNQPPSLRRTVEFVAERIGSNCVKHIKATLVADLVRQAESLLQEQLVTQGEEGGDPAQLLEILCSQLCPHGAQALALGREFCQRKSPGAVRALLPEETPAAVLSSAENIAVGLATEKACAWLSANITALIRREVKAAVSRTLRAQGPEPAARGERRGCSRACEHHAPLPSHLISEIKDVLSLAVGPRDPDEGVSPEHLEQLLGQLGQTLRCRQFLCPPAEQHLAKCSVELASLLVADQIPILGPPAQYRLERGQARRLLHMLLSLWKEDFQGPVPLQLLLSPRNVGLLADTRPREWDLLLFLLRELVEKGLMGRMEIEACLGSLHQAQWPGDFAEELATLSNLFLAEPHLPEPQLRACELVQPNRGTVLAQS,mutated_sequence,1.0,1227.0,NP_612486.2.a2m,NP_612486.2.npy,ClinVar
+NP_620130.2,NP_620130.2.csv,MDSNHQSNYKLSKTEKKFLRKQIKAKHTLLRHEGIETVSYATQSLVVANGGLGNGVSRNQLLPVLEKCGLVDALLMPPNKPYSFARYRTTEESKRAYVTLNGKEVVDDLGQKITLYLNFVEKVQWKELRPQALPPGLMVVEEIISSEEEKMLLESVDWTEDTDNQNSQKSLKHRRVKHFGYEFHYENNNVDKDKPLSGGLPDICESFLEKWLRKGYIKHKPDQMTINQYEPGQGIPAHIDTHSAFEDEIVSLSLGSEIVMDFKHPDGIAVPVMLPRRSLLVMTGESRYLWTHGITCRKFDTVQASESLKSGIITSDVGDLTLSKRGLRTSFTFRKVRQTPCNCSYPLVCDSQRKETPPSFPESDKEASRLEQEYVHQVYEEIAGHFSSTRHTPWPHIVEFLKALPSGSIVADIGCGNGKYLGINKELYMIGCDRSQNLVDICRERQFQAFVCDALAVPVRSGSCDACISIAVIHHFATAERRVAALQEIVRLLRPGGKALIYVWAMEQEYNKQKSKYLRGNRNSQGKKEEMNSDTSVQRSLVEQMRDMGSRDSASSVPRINDSQEGGCNSRQVSNSKLPVHVNRTSFYSQDVLVPWHLKGNPDKGKPVEPFGPIGSQDPSPVFHRYYHVFREGELEGACRTVSDVRILQSYYDQGNWCVILQKA,mutated_sequence,1.0,664.0,NP_620130.2.a2m,NP_620130.2.npy,ClinVar
+NP_620596.2,NP_620596.2.csv,MHQRHPRARCPPLCVAGILACGFLLGCWGPSHFQQSCLQALEPQAVSSYLSPGAPLKGRPPSPGFQRQRQRQRRAAGGILHLELLVAVGPDVFQAHQEDTERYVLTNLNIGAELLRDPSLGAQFRVHLVKMVILTEPEGAPNITANLTSSLLSVCGWSQTINPEDDTDPGHADLVLYITRFDLELPDGNRQVRGVTQLGGACSPTWSCLITEDTGFDLGVTIAHEIGHSFGLEHDGAPGSGCGPSGHVMASDGAAPRAGLAWSPCSRRQLLSLLSAGRARCVWDPPRPQPGSAGHPPDAQPGLYYSANEQCRVAFGPKAVACTFAREHLDMCQALSCHTDPLDQSSCSRLLVPLLDGTECGVEKWCSKGRCRSLVELTPIAAVHGRWSSWGPRSPCSRSCGGGVVTRRRQCNNPRPAFGGRACVGADLQAEMCNTQACEKTQLEFMSQQCARTDGQPLRSSPGGASFYHWGAAVPHSQGDALCRHMCRAIGESFIMKRGDSFLDGTRCMPSGPREDGTLSLCVSGSCRTFGCDGRMDSQQVWDRCQVCGGDNSTCSPRKGSFTAGRAREYVTFLTVTPNLTSVYIANHRPLFTHLAVRIGGRYVVAGKMSISPNTTYPSLLEDGRVEYRVALTEDRLPRLEEIRIWGPLQEDADIQVYRRYGEEYGNLTRPDITFTYFQPKPRQAWVWAAVRGPCSVSCGAGLRWVNYSCLDQARKELVETVQCQGSQQPPAWPEACVLEPCPPYWAVGDFGPCSASCGGGLRERPVRCVEAQGSLLKTLPPARCRAGAQQPAVALETCNPQPCPARWEVSEPSSCTSAGGAGLALENETCVPGADGLEAPVTEGPGSVDEKLPAPEPCVGMSCPPGWGHLDATSAGEKAPSPWGSIRTGAQAAHVWTPAAGSCSVSCGRGLMELRFLCMDSALRVPVQEELCGLASKPGSRREVCQAVPCPARWQYKLAACSVSCGRGVVRRILYCARAHGEDDGEEILLDTQCQGLPRPEPQEACSLEPCPPRWKVMSLGPCSASCGLGTARRSVACVQLDQGQDVEVDEAACAALVRPEASVPCLIADCTYRWHVGTWMECSVSCGDGIQRRRDTCLGPQAQAPVPADFCQHLPKPVTVRGCWAGPCVGQGACGRQHLEPTGTIDMRGPGQADCAVAIGRPLGEVVTLRVLESSLNCSAGDMLLLWGRLTWRKMCRKLLDMTFSSKTNTLVVRQRCGRPGGGVLLRYGSQLAPETFYRECDMQLFGPWGEIVSPSLSPATSNAGGCRLFINVAPHARIAIHALATNMGAGTEGANASYILIRDTHSLRTTAFHGQQVLYWESESSQAEMEFSEGFLKAQASLRGQYWTLQSWVPEMQDPQSWKGKEGT,mutated_sequence,1.0,1371.0,NP_620596.2.a2m,NP_620596.2.npy,ClinVar
+NP_644805.1,NP_644805.1.csv,MAQWNQLQQLDTRYLEQLHQLYSDSFPMELRQFLAPWIESQDWAYAASKESHATLVFHNLLGEIDQQYSRFLQESNVLYQHNLRRIKQFLQSRYLEKPMEIARIVARCLWEESRLLQTAATAAQQGGQANHPTAAVVTEKQQMLEQHLQDVRKRVQDLEQKMKVVENLQDDFDFNYKTLKSQGDMQDLNGNNQSVTRQKMQQLEQMLTALDQMRRSIVSELAGLLSAMEYVQKTLTDEELADWKRRQQIACIGGPPNICLDRLENWITSLAESQLQTRQQIKKLEELQQKVSYKGDPIVQHRPMLEERIVELFRNLMKSAFVVERQPCMPMHPDRPLVIKTGVQFTTKVRLLVKFPELNYQLKIKVCIDKDSGDVAALRGSRKFNILGTNTKVMNMEESNNGSLSAEFKHLTLREQRCGNGGRANCDASLIVTEELHLITFETEVYHQGLKIDLETHSLPVVVISNICQMPNAWASILWYNMLTNNPKNVNFFTKPPIGTWDQVAEVLSWQFSSTTKRGLSIEQLTTLAEKLLGPGVNYSGCQITWAKFCKENMAGKGFSFWVWLDNIIDLVKKYILALWNEGYIMGFISKERERAILSTKPPGTFLLRFSESSKEGGVTFTWVEKDISGKTQIQSVEPYTKQQLNNMSFAEIIMGYKIMDATNILVSPLVYLYPDIPKEEAFGKYCRPESQEHPEADPGSAAPYLKTKFICVTPTTCSNTIDLPMSPRTLDSLMQFGNNGEGAEPSAGGQFESLTFDMELTSECATSPM,mutated_sequence,1.0,770.0,NP_644805.1.a2m,NP_644805.1.npy,ClinVar
+NP_653197.2,NP_653197.2.csv,MSSEMEPLLLAWSYFRRRKFQLCADLCTQMLEKSPYDQEPDPELPVHQAAWILKARALTEMVYIDEIDVDQEGIAEMMLDENAIAQVPRPGTSLKLPGTNQTGGPSQAVRPITQAGRPITGFLRPSTQSGRPGTMEQAIRTPRTAYTARPITSSSGRFVRLGTASMLTSPDGPFINLSRLNLTKYSQKPKLAKALFEYIFHHENDVKTALDLAALSTEHSQYKDWWWKVQIGKCYYRLGMYREAEKQFKSALKQQEMVDTFLYLAKVYVSLDQPVTALNLFKQGLDKFPGEVTLLCGIARIYEEMNNMSSAAEYYKEVLKQDNTHVEAIACIGSNHFYSDQPEIALRFYRRLLQMGIYNGQLFNNLGLCCFYAQQYDMTLTSFERALSLAENEEEAADVWYNLGHVAVGIGDTNLAHQCFRLALVNNNNHAEAYNNLAVLEMRKGHVEQARALLQTASSLAPHMYEPHFNFATISDKIGDLQRSYVAAQKSEAAFPDHVDTQHLIKQLRQHFAML,mutated_sequence,1.0,515.0,NP_653197.2.a2m,NP_653197.2.npy,ClinVar
+NP_658985.2,NP_658985.2.csv,MSRLRALLGLGLLVAGSRVPRIKSQTIACRSGPTWWGPQRLNSGGRWDSEVMASTVVKYLSQEEAQAVDQELFNEYQFSVDQLMELAGLSCATAIAKAYPPTSMSRSPPTVLVICGPGNNGGDGLVCARHLKLFGYEPTIYYPKRPNKPLFTALVTQCQKMDIPFLGEMPAEPMTIDELYELVVDAIFGFSFKGDVREPFHSILSVLKGLTVPIASIDIPSGWDVEKGNAGGIQPDLLISLTAPKKSATQFTGRYHYLGGRFVPPALEKKYQLNLPPYPDTECVYRLQ,mutated_sequence,1.0,288.0,NP_658985.2.a2m,NP_658985.2.npy,ClinVar
+NP_659434.2,NP_659434.2.csv,MNAIVALCHFCELHGPRTLFCTEVLHAPLPQGDGNEDSPGQGEQAEEEEGGIQMNSRMRAHSPAEGASVESSSPGPKKSDMCEGCRSLAAGHPGYISHDKETSIKYVSHQHPSHPQLFSIVRQACVRSLSCEVCPGREGPIFFGDEQHGFVFSHTFFIKDSLARGFQRWYSIITIMMDRIYLINSWPFLLGKVRGIIDELQGKALKVFEAEQFGCPQRAQRMNTAFTPFLHQRNGNAARSLTSLTSDDNLWACLHTSFAWLLKACGSRLTEKLLEGAPTEDTLVQMEKLADLEEESESWDNSEAEEEEKAPVLPESTEGRELTQGPAESSSLSGCGSWQPRKLPVFKSLRHMRQVLGAPSFRMLAWHVLMGNQVIWKSRDVDLVQSAFEVLRTMLPVGCVRIIPYSSQYEEAYRCNFLGLSPHVQIPPHVLSSEFAVIVEVHAAARSTLHPVGCEDDQSLSKYEFVVTSGSPVAADRVGPTILNKIEAALTNQNLSVDVVDQCLVCLKEEWMNKVKVLFKFTKVDSRPKEDTQKLLSILGASEEDNVKLLKFWMTGLSKTYKSHLMSTVRSPTASESRN,mutated_sequence,1.0,579.0,NP_659434.2.a2m,NP_659434.2.npy,ClinVar
+NP_660208.2,NP_660208.2.csv,MSSKKNRKRLNQSAENGSSLPSAASSCAEARAPSAGSDFAATSGTLTVTNLLEKVDDKIPKTFQNSLIHLGLNTMKSANICIGRPVLLTSLNGKQEVYTAWPMAGFPGGKVGLSEMAQKNVGVRPGDAIQVQPLVGAVLQAEEMDVALSDKDMEINEEELTGCILRKLDGKIVLPGNFLYCTFYGRPYKLQVLRVKGADGMILGGPQSDSDTDAQRMAFEQSSMETSSLELSLQLSQLDLEDTQIPTSRSTPYKPIDDRITNKASDVLLDVTQSPGDGSGLMLEEVTGLKCNFESAREGNEQLTEEERLLKFSIGAKCNTDTFYFISSTTRVNFTEIDKNSKEQDNQFKVTYDMIGGLSSQLKAIREIIELPLKQPELFKSYGIPAPRGVLLYGPPGTGKTMIARAVANEVGAYVSVINGPEIISKFYGETEAKLRQIFAEATLRHPSIIFIDELDALCPKREGAQNEVEKRVVASLLTLMDGIGSEVSEGQVLVLGATNRPHALDAALRRPGRFDKEIEIGVPNAQDRLDILQKLLRRVPHLLTEAELLQLANSAHGYVGADLKVLCNEAGLCALRRILKKQPNLPDVKVAGLVKITLKDFLQAMNDIRPSAMREIAIDVPNVSWSDIGGLESIKLKLEQAVEWPLKHPESFIRMGIQPPKGVLLYGPPGCSKTMIAKALANESGLNFLAIKGPELMNKYVGESERAVRETFRKARAVAPSIIFFDELDALAVERGSSLGAGNVADRVLAQLLTEMDGIEQLKDVTILAATNRPDRIDKALMRPGRIDRIIYVPLPDAATRREIFKLQFHSMPVSNEVDLDELILQTDAYSGAEIVAVCREAALLALEEDIQANLIMKRHFTQALSTVTPRIPESLRRFYEDYQEKSGLHTL,mutated_sequence,1.0,893.0,NP_660208.2.a2m,NP_660208.2.npy,ClinVar
+NP_660289.2,NP_660289.2.csv,MHGHGGYDSDFSDDERCGESSKRKKRTVEDDLLLQKPFQKEKHGKVAHKQVAAELLDREEARNRRFHLIAMDAYQRHTKFVNDYILYYGGKKEDFKRLGENDKTDLDVIRENHRFLWNEEDEMDMTWEKRLAKKYYDKLFKEYCIADLSKYKENKFGFRWRVEKEVISGKGQFFCGNKYCDKKEGLKSWEVNFGYIEHGEKRNALVKLRLCQECSIKLNFHHRRKEIKSKKRKDKTKKDCEESSHKKSRLSSAEEASKKKDKGHSSSKKSEDSLLRNSDEEESASESELWKGPLPETDEKSQEEEFDEYFQDLFL,mutated_sequence,1.0,315.0,NP_660289.2.a2m,NP_660289.2.npy,ClinVar
+NP_663304.1,NP_663304.1.csv,MSTASAASSSSSSSAGEMIEAPSQVLNFEEIDYKEIEVEEVVGRGAFGVVCKAKWRAKDVAIKQIESESERKAFIVELRQLSRVNHPNIVKLYGACLNPVCLVMEYAEGGSLYNVLHGAEPLPYYTAAHAMSWCLQCSQGVAYLHSMQPKALIHRDLKPPNLLLVAGGTVLKICDFGTACDIQTHMTNNKGSAAWMAPEVFEGSNYSEKCDVFSWGIILWEVITRRKPFDEIGGPAFRIMWAVHNGTRPPLIKNLPKPIESLMTRCWSKDPSQRPSMEEIVKIMTHLMRYFPGADEPLQYPCQYSDEGQSNSATSTGSFMDIASTNTSNKSDTNMEQVPATNDTIKRLESKLLKNQAKQQSESGRLSLGASRGSSVESLPPTSEGKRMSADMSEIEARIAATTAYSKPKRGHRKTASFGNILDVPEIVISGNGQPRRRSIQDLTVTGTEPGQVSSRSSSPSVRMITTSGPTSEKPTRSHPWTPDDSTDTNGSDNSIPMAYLTLDHQLQPLAPCPNSKESMAVFEQHCKMAQEYMKVQTEIALLLQRKQELVAELDQDEKDQQNTSRLVQEHKKLLDENKSLSTYYQQCKKQLEVIRSQQQKRQGTS,mutated_sequence,1.0,606.0,NP_663304.1.a2m,NP_663304.1.npy,ClinVar
+NP_689476.2,NP_689476.2.csv,MMEAIKKKMQMLKLDKENALDRAEQAEAEQKQAEERSKQLEDELAAMQKKLKGTEDELDKYSEALKDAQEKLELAEKKAADAEAEVASLNRRIQLVEEELDRAQERLATALQKLEEAEKAADESERGMKVIENRALKDEEKMELQEIQLKEAKHIAEEADRKYEEVARKLVIIEGDLERTEERAELAESKCSELEEELKNVTNNLKSLEAQAEKYSQKEDKYEEEIKILTDKLKEAETRAEFAERSVAKLEKTIDDLEDELYAQKLKYKAISEELDHALNDMTSI,mutated_sequence,1.0,285.0,NP_689476.2.a2m,NP_689476.2.npy,ClinVar
+NP_689509.1,NP_689509.1.csv,MGDKKDDKDSPKKNKGKERRDLDDLKKEVAMTEHKMSVEEVCRKYNTDCVQGLTHSKAQEILARDGPNALTPPPTTPEWVKFCRQLFGGFSILLWIGAILCFLAYGIQAGTEDDPSGDNLYLGIVLAAVVIITGCFSYYQEAKSSKIMESFKNMVPQQALVIREGEKMQVNAEEVVVGDLVEIKGGDRVPADLRIISAHGCKVDNSSLTGESEPQTRSPDCTHDNPLETRNITFFSTNCVEGTARGVVVATGDRTVMGRIATLASGLEVGKTPIAIEIEHFIQLITGVAVFLGVSFFILSLILGYTWLEAVIFLIGIIVANVPEGLLATVTVCLTLTAKRMARKNCLVKNLEAVETLGSTSTICSDKTGTLTQNRMTVAHMWFDNQIHEADTTEDQSGTSFDKSSHTWVALSHIAGLCNRAVFKGGQDNIPVLKRDVAGDASESALLKCIELSSGSVKLMRERNKKVAEIPFNSTNKYQLSIHETEDPNDNRYLLVMKGAPERILDRCSTILLQGKEQPLDEEMKEAFQNAYLELGGLGERVLGFCHYYLPEEQFPKGFAFDCDDVNFTTDNLCFVGLMSMIDPPRAAVPDAVGKCRSAGIKVIMVTGDHPITAKAIAKGVGIISEGNETVEDIAARLNIPVSQVNPRDAKACVIHGTDLKDFTSEQIDEILQNHTEIVFARTSPQQKLIIVEGCQRQGAIVAVTGDGVNDSPALKKADIGVAMGIAGSDVSKQAADMILLDDNFASIVTGVEEGRLIFDNLKKSIAYTLTSNIPEITPFLLFIMANIPLPLGTITILCIDLGTDMVPAISLAYEAAESDIMKRQPRNPRTDKLVNERLISMAYGQIGMIQALGGFFSYFVILAENGFLPGNLVGIRLNWDDRTVNDLEDSYGQQWTYEQRKVVEFTCHTAFFVSIVVVQWADLIICKTRRNSVFQQGMKNKILIFGLFEETALAAFLSYCPGMDVALRMYPLKPSWWFCAFPYSFLIFVYDEIRKLILRRNPGGWVEKETYY,mutated_sequence,1.0,1013.0,NP_689509.1.a2m,NP_689509.1.npy,ClinVar
+NP_689777.3,NP_689777.3.csv,MLESYVTPILMSYVNRYIKNLKPSDLQLSLWGGDVVLSKLELKLDVLEQELKLPFTFLSGHIHELRIHVPWTKLGSEPVVITINTMECILKLKDGIQDDHESCGSNSTNRSTAESTKSSIKPRRMQQAAPTDPDLPPGYVQSLIRRVVNNVNIVINNLILKYVEDDIVLSVNITSAECYTVGELWDRAFMDISATDLVLRKVINFSDCTVCLDKRNASGKIEFYQDPLLYKCSFRTRLHFTYENLNSKMPSVIKIHTLVESLKLSITDQQLPMFIRIMQLGIALYYGEIGNFKEGEIEDLTCHNKDMLGNITGSEDETRIDMQYPAQHKGQELYSQQDEEQPQGWVSWAWSFVPAIVSYDDGEEDFVGNDPASTMHQQKAQTLKDPIVSIGFYCTKATVTFKLTEMQVESSYYSPQKVKSKEVLCWEQEGTTVEALMMGEPFFDCQIGFVGCRAMCLKGIMGVKDFEENMNRSETEACFFICGDNLSTKGFTYLTNSLFDYRSPENNGTRAEFILDSTHHKETYTEIAGMQRFGAFYMDYLYTMENTSGKGSTNQQDFSSGKSEDLGTVQEKSTKSLVIGPLDFRLDSSAVHRILKMIVCALEHEYEPYSRLKSDIKDENETILNPEEVALLEEYIPTRHTSVTLLKCTCTISMAEFNLLDHLLPVIMGEKNSSNFMNTTNFQSLRPLPSIRILVDKINLEHSVPMYAEQLVHVVSSLTQPSDNLLHYCYVHCYLKIFGFQAGLTSLDCSGSYCLPVPVIPSFSTALYGKLLKLPTCWTKRSQIAITEGIFELPNLTIQATRAQTLLLQAIYQSWSHLGNVSSSAVIEALINEIFLSIGVKSKNPLPTLEGSIQNVELKYCSTSLVKCASGTMGSIKICAKAPVDSGKEKLIPLLQGPSDTKDLHSTKWLNESRKPESLLAPDLMAFTIQVPQYIDYCHNSGAVLLCSIQGLAVNIDPILYTWLIYQPQKRTSRHMQQQPVVAVPLVMPVCRRKEDEVSIGSAPLAKQQSYQASEYASSPVKTKTVTESRPLSVPVKAMLNISESCRSPEERMKEFIGIVWNAVKHLTLQLEVQSCCVFIPNDSLPSPSTIVSGDIPGTVRSWYHGQTSMPGTLVLCLPQIKIISAGHKYMEPLQEIPFVIPRPILEEGDAFPWTISLHNFSIYTLLGKQVTLCLVEPMGCTSTLAVTSQKLLATGPDTRHSFVVCLHVDLESLEIKCSNPQVQLFYELTDIMNKVWNKIQKRGNLNLSPTSPETMAGPVPTSPVRSSIGTAPPDTSTCSPSADIGTTTEGDSIQAGEESPFSDSVTLEQTTSNIGGTSGRVSLWMQWVLPKITIKLFAPDPENKGTEVCMVSELEDLSASIDVQDVYTKVKCKIESFNIDHYRSRPGEGWQSGHFEGVFLQCKEKSVTTTKLLDGTHQQHGFLSLTYTKAVTKNVRHKLTSRNERRSFHKLSEGLMDGSPHFLHEILLSAQAFDIVLYFPLLNAIASIFQAKLPKTQKEKRKSPGQPMRTHTLTSRNLPLIYVNTSVIRIFIPKTEEMQPTVEANQAAKEDTVVLKIGSVAMAPQADNPLGRSVLRKDIYQRALNLGILRDPGSEIEDRQYQIDLQSINIGTAQWHQLKPEKESVSGGVVTETERNSQNPALEWNMASSIRRHQERRAILTPVLTDFSVRITGAPAVIFTKVVSPENLHTEEILVCGHSLEVNITTNLDFFLSVAQVQLLHQLIVANMTGLEPSNKAAEISKQEQKKVDIFDGGMAETSSRYSGAQDSGIGSDSVKIRIVQIEQHSGASQHRIARPSRQSSIVKNLNFIPFDIFITASRISLMTYSCMALSKSKSQEQKNNEKTDKSSLNLPEVDSDVAKPNQACISTVTAEDLLRSSISFPSGKKIGVLSLESLHASTRSSARQALGITIVRQPGRRGTGDLQLEPFLYFIVSQPSLLLSCHHRKQRVEVSIFDAVLKGVASDYKCIDPGKTLPEALDYCTVWLQTVPGEIDSKSGIPPSFITLQIKDFLNGPADVNLDISKPLKANLSFTKLDQINLFLKKIKNAHSLAHSEETSAMSNTMVNKDDLPVSKYYRGKLSKPKIHGDGVQKISAQENMWRAVSCFQKISVQTTQIVISMETVPHTSKPCLLASLSNLNGSLSVKATQKVPGIILGSSFLLSINDFLLKTSLKERSRILIGPCCATANLEAKWCKHSGNPGPEQSIPKISIDLRGGLLQVFWGQEHLNCLVLLHELLNGYLNEEGNFEVQVSEPVPQMSSPVEKNQTFKSEQSSDDLRTGLFQYVQDAESLKLPGVYEVLFYNETEDCPGMMLWRYPEPRVLTLVRITPVPFNTTEDPDISTADLGDVLQVPCSLEYWDELQKVFVAFREFNLSESKVCELQLPDINLVNDQKKLVSSDLWRIVLNSSQNGADDQSSASESGSQSTCDPLVTPTALAACTRVDSCFTPWFVPSLCVSFQFAHLEFHLCHHLDQLGTAAPQYLQPFVSDRNMPSELEYMIVSFREPHMYLRQWNNGSVCQEIQFLAQADCKLLECRNVTMQSVVKPFSIFGQMAVSSDVVEKLLDCTVIVDSVFVNLGQHVVHSLNTAIQAWQQNKCPEVEELVFSHFVICNDTQETLRFGQVDTDENILLASLHSHQYSWRSHKSPQLLHICIEGWGNWRWSEPFSVDHAGTFIRTIQYRGRTASLIIKVQQLNGVQKQIIICGRQIICSYLSQSIELKVVQHYIGQDGQAVVREHFDCLTAKQKLPSYILENNELTELCVKAKGDEDWSRDVCLESKAPEYSIVIQVPSSNSSIIYVWCTVLTLEPNSQVQQRMIVFSPLFIMRSHLPDPIIIHLEKRSLGLSETQIIPGKGQEKPLQNIEPDLVHHLTFQAREEYDPSDCAVPISTSLIKQIATKVHPGGTVNQILDEFYGPEKSLQPIWPYNKKDSDRNEQLSQWDSPMRVKLSIWKPYVRTLLIELLPWALLINESKWDLWLFEGEKIVLQVPAGKIIIPPNFQEAFQIGIYWANTNTVHKSVAIKLVHNLTSPKWKDGGNGEVVTLDEEAFVDTEIRLGAFPGHQKLCQFCISSMVQQGIQIIQIEDKTTIINNTPYQIFYKPQLSVCNPHSGKEYFRVPDSATFSICPGGEQPAMKSSSLPCWDLMPDISQSVLDASLLQKQIMLGFSPAPGADSSQCWSLPAIVRPEFPRQSVAVPLGNFRENGFCTRAIVLTYQEHLGVTYLTLSEDPSPRVIIHNRCPVKMLIKENIKDIPKFEVYCKKIPSECSIHHELYHQISSYPDCKTKDLLPSLLLRVEPLDEVTTEWSDAIDINSQGTQVVFLTGFGYVYVDVVHQCGTVFITVAPEGKAGPILTNTNRAPEKIVTFKMFITQLSLAVFDDLTHHKASAELLRLTLDNIFLCVAPGAGPLPGEEPVAALFELYCVEICCGDLQLDNQLYNKSNFHFAVLVCQGEKAEPIQCSKMQSLLISNKELEEYKEKCFIKLCITLNEGKSILCDINEFSFELKPARLYVEDTFVYYIKTLFDTYLPNSRLAGHSTHLSGGKQVLPMQVTQHARALVNPVKLRKLVIQPVNLLVSIHASLKLYIASDHTPLSFSVFERGPIFTTARQLVHALAMHYAAGALFRAGWVVGSLDILGSPASLVRSIGNGVADFFRLPYEGLTRGPGAFVSGVSRGTTSFVKHISKGTLTSITNLATSLARNMDRLSLDEEHYNRQEEWRRQLPESLGEGLRQGLSRLGISLLGAIAGIVDQPMQNFQKTSEAQASAGHKAKGVISGVGKGIMGVFTKPIGGAAELVSQTGYGILHGAGLSQLPKQRHQPSDLHADQAPNSHVKYVWKMLQSLGRPEVHMALDVVLVRGSGQEHEGCLLLTSEVLFVVSVSEDTQQQAFPVTEIDCAQDSKQNNLLTVQLKQPRVACDVEVDGVRERLSEQQYNRLVDYITKTSCHLAPSCSSMQIPCPVVAAEPPPSTVKTYHYLVDPHFAQVFLSKFTMVKNKALRKGFP,mutated_sequence,1.0,3997.0,NP_689777.3.a2m,NP_689777.3.npy,ClinVar
+NP_689831.2,NP_689831.2.csv,MVMACRVVNKRRHMGLQQLSSFAETGRTFLGPLKSSKFIIDEECHESVLISSTVRLLESLDLTSAVGQLLNEAVQAQNNTYRTGISTLLFLVGAWSSAVEECLHLGVPISIIVSVMSEGLNFCSEEVVSLHVPVHNIFDCMDSTKTFSQLETFSVSLCPFLQVPSDTDLIEELHGLKDVASQTLTISNLSGRPLKSYELFKPQTKVEADNNTSRTLKNSLLADTCCRQSILIHSRHFNRTDNTEGVSKPDGFQEHVTATHKTYRCNDLVELAVGLSHGDHSSMKLVEEAVQLQYQNACVQQGNCTKPFMFDISRIFTCCLPGLPETSSCVCPGYITVVSVSNNPVIKELQNQPVRIVLIEGDLTENYRHLGFNKSANIKTVLDSMRLQEDSSEELWANHVLQVLIQFKVNLVLVQGNVSERLIEKCINSKRLVIGSVNGSVMQAFAEAAGAVQVAYITQVNEDCVGDGVCVTFWRSSPLDVVDRNNRIAILLKTEGINLVTAVLTNPVTAQMQIKEDRFWTCAYRLYYALKEEKVFLGGGAVEFLCLSCLHILAEQSLKKENHACSGWLHNTSSWLASSLAIYRPTVLKFLANGWQKYLSTLLYNTANYSSEFEASTYIQHHLQNATDSGSPSSYILNEYSKLNSRIFNSDISNKLEQIPRVYDVVTPKIEAWRRALDLVLLVLQTDSEIITGHGHTQINSQELTGFLFL,mutated_sequence,1.0,710.0,NP_689831.2.a2m,NP_689831.2.npy,ClinVar
+NP_689945.2,NP_689945.2.csv,MDADSLLLSLELASGSGQGLSPDRRASLLTSLMLVKRDYRYDRVLFWGRILGLVADYYIAQGLSEDQLAPRKTLYSLNCTEWSLLPPATEEMVAQSSVVKGRFMGDPSYEYEHTELQKVNEGEKVFEEEIVVQIKEETRLVSVIDQIDKAVAIIPRGALFKTPFGPTHVNRTFEGLSLSEAKKLSSYFHFREPVELKNKTLLEKADLDPSLDFMDSLEHDIPKGSWSIQMERGNALVVLRSLLWPGLTFYHAPRTKNYGYVYVGTGEKNMDLPFML,mutated_sequence,1.0,276.0,NP_689945.2.a2m,NP_689945.2.npy,ClinVar
+NP_694950.2,NP_694950.2.csv,MAEAVLIDLFGLKLNSQKNCHQTLLKTLNAVQYHHAAKAKFLCIMCCSNISYERDGEQDNCEIETSNGLSALLEEFEIVSCPSMAATLYTIKQKIDEKNLSSIKVIVPRHRKTLMKAFIDQLFTDVYNFEFEDLQVTFRGGLFKQSIEINVITAQELRGIQNEIETFLRSLPALRGKLTIITSSLIPDIFIHGFTTRTGGISYIPTLSSFNLFSSSKRRDPKVVVQENLRRLANAAGFNVEKFYRIKTHHSNDIWIMGRKEPDSYDGITTNQRGVTIAALGADCIPIVFADPVKKACGVAHAGWKGTLLGVAMATVNAMIAEYGCSLEDIVVVLGPSVGPCCFTLPRESAEAFHNLHPACVQLFDSPNPCIDIRKATRILLEQGGILPQNIQDQNQDLNLCTSCHPDKFFSHVRDGLNFGTQIGFISIKE,mutated_sequence,1.0,430.0,NP_694950.2.a2m,NP_694950.2.npy,ClinVar
+NP_694972.3,NP_694972.3.csv,MGTASSLVSPAGGEVIEDTYGAGGGEACEIPVEVKPKARLLRNSFRRGAGAAAGAGPGSLPRGVGAGGLLGASFKSTGSSVPELEYAAAEYERLRKEYEIFRVSKNQELLSMGRREAKLDTENKRLRAELQALQKTYQKILREKESALEAKYQAMERAATFEHDRDKVKRQFKIFRETKENEIQDLLRAKRELESKLQRLQAQGIQVFDPGESDSDDNCTDVTAAGTQCEYWTGGALGSEPSIGSMIQLQQSFRGPEFAHSSIDVEGPFANVNRDDWDIAVASLLQVTPLFSHSLWSNTVRCYLIYTDETQPEMDLFLKDYSPKLKRMCETMGYFFHAVYFPIDVENQYLTVRKWEIEKSSLVILFIHLTLPSLLLEDCEEAFLKNPEGKPRLIFHRLEDGKVSSDSVQQLIDQVSNLNKTSKAKIIDHSGDPAEGVYKTYICVEKIIKQDILGFENTDLETKDLGSEDSIPEEDDFGDVLWDIHDEQEQMETFQQASNSAHELGFEKYYQRLNDLVAAPAPIPPLLVSGGPGSGKSLLLSKWIQLQQKNSPNTLILSHFVGRPMSTSSESSLIIKRLTLKLMQHSWSVSALTLDPAKLLEEFPRWLEKLSARHQGSIIIVIDSIDQVQQVEKHMKWLIDPLPVNVRVIVSVNVETCPPAWRLWPTLHLDPLSPKDAKSIIIAECHSVDIKLSKEQEKKLERHCRSATTCNALYVTLFGKMIARAGRAGNLDKILHQCFQCQDTLSLYRLVLHSIRESMANDVDKELMKQILCLVNVSHNGVSESELMELYPEMSWTFLTSLIHSLYKMCLLTYGCGLLRFQHLQAWETVRLEYLEGPTVTSSYRQKLINYFTLQLSQDRVTWRSADELPWLFQQQGSKQKLHDCLLNLFVSQNLYKRGHFAELLSYWQFVGKDKSAMATEYFDSLKQYEKNCEGEDNMSCLADLYETLGRFLKDLGLLSQAIVPLQRSLEIRETALDPDHPRVAQSLHQLASVYVQWKKFGNAEQLYKQALEISENAYGADHPYTARELEALATLYQKQNKYEQAEHFRKKSFKIHQKAIKKKGNLYGFALLRRRALQLEELTLGKDTPDNARTLNELGVLYYLQNNLETADQFLKRSLEMRERVLGPDHPDCAQSLNNLAALCNEKKQYDKAEELYERALDIRRRALAPDHPSLAYTVKHLAILYKKMGKLDKAVPLYELAVEIRQKSFGPKHPSVATALVNLAVLYSQMKKHVEALPLYERALKIYEDSLGRMHPRVGETLKNLAVLSYEGGDFEKAAELYKRAMEIKEAETSLLGGKAPSRHSSSGDTFSLKTAHSPNVFLQQGQR,mutated_sequence,1.0,1330.0,NP_694972.3.a2m,NP_694972.3.npy,ClinVar
+NP_710156.3,NP_710156.3.csv,MWGFRLLRSPPLLLLLPQLGIGNASSCSQARTMNPGGSGGARCSLSAEVRRRQCLQLSTVPGADPQRSNELLLLAAAGEGLERQDLPGDPAKEEPQPPPQHHVLYFPGDVQNYHEIMTRHPENYQWENWSLENVATILAHRFPNSYIWVIKCSRMHLHKFSCYDNFVKSNMFGAPEHNTDFGAFKHLYMLLVNAFNLSQNSLSKKSLNVWNKDSIASNCRSSPSHTTNGCQGEKVRTCEKSDESAMSFYPPSLNDASFTLIGFSKGCVVLNQLLFELKEAKKDKNIDAFIKSIRTMYWLDGGHSGGSNTWVTYPEVLKEFAQTGIIVHTHVTPYQVRDPMRSWIGKEHKKFVQILGDLGMQVTSQIHFTKEAPSIENHFRVHEVF,mutated_sequence,1.0,385.0,NP_710156.3.a2m,NP_710156.3.npy,ClinVar
+NP_714915.3,NP_714915.3.csv,MATRGGAGVAMAVWSLLSARAVTAFLLLFLPRFLQAQTFSFPFQQPEKCDNNQYFDISALSCVPCGANQRQDARGTSCVCLPGFQMISNNGGPAIICKKCPENMKGVTEDGWNCISCPSDLTAEGKCHCPIGHILVERDINGTLLSQATCELCDGNENSFMVVNALGDRCVRCEPTFVNTSRSCACSEPNILTGGLCFSSTGNFPLRRISAARYGEVGMSLTSEWFAKYLQSSAAACWVYANLTSCQALGNMCVMNMNSYDFATFDACGLFQFIFENTAGLSTVHSISFWRQNLPWLFYGDQLGLAPQVLSSTSLPTNFSFKGENQNTKLKFVAASYDIRGNFLKWQTLEGGVLQLCPDTETRLNAAYSFGTTYQQNCEIPISKILIDFPTPIFYDVYLEYTDENQHQYILAVPVLNLNLQHNKIFVNQDSNSGKWLLTRRIFLVDAVSGRENDLGTQPRVIRVATQISLSVHLVPNTINGNIYPPLITIAYSDIDIKDANSQSVKVSFSVTYEMDHGEAHVQTDIALGVLGGLAVLASLLKTAGWKRRIGSPMIDLQTVVKFLVYYAGDLANVFFIITVGTGLYWLIFFKAQKSVSVLLPMPIQEERFVTYVGCAFALKALQFLHKLISQITIDVFFIDWERPKGKVLKAVEGEGGVRSATVPVSIWRTYFVANEWNEIQTVRKINSLFQVLTVLFFLEVVGFKNLALMDSSSSLSRNPPSYIAPYSCILRYAVSAALWLAIGIIQVVFFAVFYERFIEDKIRQFVDLCSMSNISVFLLSHKCFGYYIHGRSVHGHADTNMEEMNMNLKREAENLCSQRGLVPNTDGQTFEIAISNQMRQHYDRIHETLIRKNGPARLLSSSASTFEQSIKAYHMMNKFLGSFIDHVHKEMDYFIKDKLLLERILGMEFMEPMEKSIFYNDEGYSFSSVLYYGNEATLLIFDLLFFCVVDLACQNFILASFLTYLQQEIFRYIRNTVGQKNLASKTLVDQRFLI,mutated_sequence,1.0,995.0,NP_714915.3.a2m,NP_714915.3.npy,ClinVar
+NP_714928.1,NP_714928.1.csv,MARGGAACKSDARLLLGRDALRPAPALLAPAVLLGAALGLGLGLWLGCRAGRQRTRHQKDDTQNLLKNLESNAQTPSETGSPSRRRKREVQMSKDKEAVDECEPPSNSNITAFALKAKVIYPINQKFRPLADGSSNPSLHENLKQAVLPHQPVEASPSSSLGSLSQGEKDDCSSSSSVHSATSDDRFLSRTFLRVNAFPEVLACESVDVDLCIYSLHLKDLLHLDTALRQEKHMMFIQIFKMCLLDLLPKKKSDDELYQKILSKQEKDLEELEKGLQVKLSNTEMSGAGDSEYITLADVEKKEREYSEQLIDNMEAFWKQMANIQHFLVDQFKCSSSKARQLMMTLTERMIAAEGLLCDSQELQALDALERTMGRAHMAKVIEFLKLQVQEETRCRLAAISHGLELLAGEGKLSGRQKEELLTQQHKAFWQEAERFSREFVQRGKDLVTASLAHQVEGTAKLTLAQEEEQRSFLAEAQPTADPEKFLEAFHEVLERQRLMQCDLEEEENVRATEAVVALCQELYFSTVDTFQKFVDALFLQTLPGMTGLPPEECDYLRQEVQENAAWQLGKSNRFRRQQWKLFQELLEQDQQVWMEECALSSVLQTHLREDHEGTIRGVLGRLGGLTEESTRCVLQGHDLLLRSALRRLALRGNALATLTQMRLSGKKHLLQELREQRALEQGSSQCLDEHQWQLLRALEARVLEEASRLEEEAQQTRLQLQQRLLAEAQEVGQLLQQHMECAIGQALLVHARNAATKSRAKDRDDFKRTLMEAAVESVYVTSAGVSRLVQAYYQQIGRIMEDHEERKLQHLKTLQGERMENYKLRKKQELSNPSSGSRTAGGAHETSQAVHQRMLSQQKRFLAQFPVHQQMRLHAQQQQAGVMDLLEAQLETQLQEAEQNFISELAALARVPLAESKLLPAKRGLLEKPLRTKRKKPLPQERGDLGVPNNEDLASGDQTSGSLSSKRLSQQESEAGDSGNSKKMLKRRSNL,mutated_sequence,1.0,992.0,NP_714928.1.a2m,NP_714928.1.npy,ClinVar
+NP_733751.2,NP_733751.2.csv,MSSEEDKSVEQPQPPPPPPEEPGAPAPSPAAADKRPRGRPRKDGASPFQRARKKPRSRGKTAVEDEDSMDGLETTETETIVETEIKEQSAEEDAEAEVDNSKQLIPTLQRSVSEESANSLVSVGVEAKISEQLCAFCYCGEKSSLGQGDLKQFRITPGFILPWRNQPSNKKDIDDNSNGTYEKMQNSAPRKQRGQRKERSPQQNIVSCVSVSTQTASDDQAGKLWDELSLVGLPDAIDIQALFDSTGTCWAHHRCVEWSLGVCQMEEPLLVNVDKAVVSGSTERCAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFQDFSHIFLLCPEHIDQAPERSKEDANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQPVMKSVPTNGWKCKNCRICIECGTRSSSQWHHNCLICDNCYQQQDNLCPFCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDHELDTQLKEEYICMYCKHLGAEMDRLQPGEEVEIAELTTDYNNEMEVEGPEDQMVFSEQAANKDVNGQESTPGIVPDAVQVHTEEQQKSHPSESLDTDSLLIAVSSQHTVNTELEKQISNEVDSEDLKMSSEVKHICGEDQIEDKMEVTENIEVVTHQITVQQEQLQLLEEPETVVSREESRPPKLVMESVTLPLETLVSPHEESISLCPEEQLVIERLQGEKEQKENSELSTGLMDSEMTPTIEGCVKDVSYQGGKSIKLSSETESSFSSSADISKADVSSSPTPSSDLPSHDMLHNYPSALSSSAGNIMPTTYISVTPKIGMGKPAITKRKFSPGRPRSKQGAWSTHNTVSPPSWSPDISEGREIFKPRQLPGSAIWSIKVGRGSGFPGKRRPRGAGLSGRGGRGRSKLKSGIGAVVLPGVSTADISSNKDDEENSMHNTVVLFSSSDKFTLNQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSSCPVCYRNYREEDLILQCRQCDRWMHAVCQNLNTEEEVENVADIGFDCSMCRPYMPASNVPSSDCCESSLVAQIVTKVKELDPPKTYTQDGVCLTESGMTQLQSLTVTVPRRKRSKPKLKLKIINQNSVAVLQTPPDIQSEHSRDGEMDDSREGELMDCDGKSESSPEREAVDDETKGVEGTDGVKKRKRKPYRPGIGGFMVRQRSRTGQGKTKRSVIRKDSSGSISEQLPCRDDGWSEQLPDTLVDESVSVTESTEKIKKRYRKRKNKLEETFPAYLQEAFFGKDLLDTSRQSKISLDNLSEDGAQLLYKTNMNTGFLDPSLDPLLSSSSAPTKSGTHGPADDPLADISEVLNTDDDILGIISDDLAKSVDHSDIGPVTDDPSSLPQPNVNQSSRPLSEEQLDGILSPELDKMVTDGAILGKLYKIPELGGKDVEDLFTAVLSPANTQPTPLPQPPPPTQLLPIHNQDAFSRMPLMNGLIGSSPHLPHNSLPPGSGLGTFSAIAQSSYPDARDKNSAFNPMASDPNNSWTSSAPTVEGENDTMSNAQRSTLKWEKEEALGEMATVAPVLYTNINFPNLKEEFPDWTTRVKQIAKLWRKASSQERAPYVQKARDNRAALRINKVQMSNDSMKRQQQQDSIDPSSRIDSELFKDPLKQRESEHEQEWKFRQQMRQKSKQQAKIEATQKLEQVKNEQQQQQQQQFGSQHLLVQSGSDTPSSGIQSPLTPQPGNGNMSPAQSFHKELFTKQPPSTPTSTSSDDVFVKPQAPPPPPAPSRIPIQDSLSQAQTSQPPSPQVFSPGSSNSRPPSPMDPYAKMVGTPRPPPVGHSFSRRNSAAPVENCTPLSSVSRPLQMNETTANRPSPVRDLCSSSTTNNDPYAKPPDTPRPVMTDQFPKSLGLSRSPVVSEQTAKGPIAAGTSDHFTKPSPRADVFQRQRIPDSYARPLLTPAPLDSGPGPFKTPMQPPPSSQDPYGSVSQASRRLSVDPYERPALTPRPIDNFSHNQSNDPYSQPPLTPHPAVNESFAHPSRAFSQPGTISRPTSQDPYSQPPGTPRPVVDSYSQSSGTARSNTDPYSQPPGTPRPTTVDPYSQQPQTPRPSTQTDLFVTPVTNQRHSDPYAHPPGTPRPGISVPYSQPPATPRPRISEGFTRSSMTRPVLMPNQDPFLQAAQNRGPALPGPLVRPPDTCSQTPRPPGPGLSDTFSRVSPSAARDPYDQSPMTPRSQSDSFGTSQTAHDVADQPRPGSEGSFCASSNSPMHSQGQQFSGVSQLPGPVPTSGVTDTQNTVNMAQADTEKLRQRQKLREIILQQQQQKKIAGRQEKGSQDSPAVPHPGPLQHWQPENVNQAFTRPPPPYPGNIRSPVAPPLGPRYAVFPKDQRGPYPPDVASMGMRPHGFRFGFPGGSHGTMPSQERFLVPPQQIQGSGVSPQLRRSVSVDMPRPLNNSQMNNPVGLPQHFSPQSLPVQQHNILGQAYIELRHRAPDGRQRLPFSAPPGSVVEASSNLRHGNFIPRPDFPGPRHTDPMRRPPQGLPNQLPVHPDLEQVPPSQQEQGHSVHSSSMVMRTLNHPLGGEFSEAPLSTSVPSETTSDNLQITTQPSDGLEEKLDSDDPSVKELDVKDLEGVEVKDLDDEDLENLNLDTEDGKVVELDTLDNLETNDPNLDDLLRSGEFDIIAYTDPELDMGDKKSMFNEELDLPIDDKLDNQCVSVEPKKKEQENKTLVLSDKHSPQKKSTVTNEVKTEVLSPNSKVESKCETEKNDENKDNVDTPCSQASAHSDLNDGEKTSLHPCDPDLFEKRTNRETAGPSANVIQASTQLPAQDVINSCGITGSTPVLSSLLANEKSDNSDIRPSGSPPPPTLPASPSNHVSSLPPFIAPPGRVLDNAMNSNVTVVSRVNHVFSQGVQVNPGLIPGQSTVNHSLGTGKPATQTGPQTSQSGTSSMSGPQQLMIPQTLAQQNRERPLLLEEQPLLLQDLLDQERQEQQQQRQMQAMIRQRSEPFFPNIDFDAITDPIMKAKMVALKGINKVMAQNNLGMPPMVMSRFPFMGQVVTGTQNSEGQNLGPQAIPQDGSITHQISRPNPPNFGPGFVNDSQRKQYEEWLQETQQLLQMQQKYLEEQIGAHRKSKKALSAKQRTAKKAGREFPEEDAEQLKHVTEQQSMVQKQLEQIRKQQKEHAELIEDYRIKQQQQCAMAPPTMMPSVQPQPPLIPGATPPTMSQPTFPMVPQQLQHQQHTTVISGHTSPVRMPSLPGWQPNSAPAHLPLNPPRIQPPIAQLPIKTCTPAPGTVSNANPQSGPPPRVEFDDNNPFSESFQERERKERLREQQERQRIQLMQEVDRQRALQQRMEMEQHGMVGSEISSSRTSVSQIPFYSSDLPCDFMQPLGPLQQSPQHQQQMGQVLQQQNIQQGSINSPSTQTFMQTNERRQVGPPSFVPDSPSIPVGSPNFSSVKQGHGNLSGTSFQQSPVRPSFTPALPAAPPVANSSLPCGQDSTITHGHSYPGSTQSLIQLYSDIIPEEKGKKKRTRKKKRDDDAESTKAPSTPHSDITAPPTPGISETTSTPAVSTPSELPQQADQESVEPVGPSTPNMAAGQLCTELENKLPNSDFSQATPNQQTYANSEVDKLSMETPAKTEEIKLEKAETESCPGQEEPKLEEQNGSKVEGNAVACPVSSAQSPPHSAGAPAAKGDSGNELLKHLLKNKKSSSLLNQKPEGSICSEDDCTKDNKLVEKQNPAEGLQTLGAQMQGGFGCGNQLPKTDGGSETKKQRSKRTQRTGEKAAPRSKKRKKDEEEKQAMYSSTDTFTHLKQQNNLSNPPTPPASLPPTPPPMACQKMANGFATTEELAGKAGVLVSHEVTKTLGPKPFQLPFRPQDDLLARALAQGPKTVDVPASLPTPPHNNQEELRIQDHCGDRDTPDSFVPSSSPESVVGVEVSRYPDLSLVKEEPPEPVPSPIIPILPSTAGKSSESRRNDIKTEPGTLYFASPFGPSPNGPRSGLISVAITLHPTAAENISSVVAAFSDLLHVRIPNSYEVSSAPDVPSMGLVSSHRINPGLEYRQHLLLRGPPPGSANPPRLVSSYRLKQPNVPFPPTSNGLSGYKDSSHGIAESAALRPQWCCHCKVVILGSGVRKSFKDLTLLNKDSRESTKRVEKDIVFCSNNCFILYSSTAQAKNSENKESIPSLPQSPMRETPSKAFHQYSNNISTLDVHCLPQLPEKASPPASPPIAFPPAFEAAQVEAKPDELKVTVKLKPRLRAVHGGFEDCRPLNKKWRGMKWKKWSIHIVIPKGTFKPPCEDEIDEFLKKLGTSLKPDPVPKDYRKCCFCHEEGDGLTDGPARLLNLDLDLWVHLNCALWSTEVYETQAGALINVELALRRGLQMKCVFCHKTGATSGCHRFRCTNIYHFTCAIKAQCMFFKDKTMLCPMHKPKGIHEQELSYFAVFRRVYVQRDEVRQIASIVQRGERDHTFRVGSLIFHTIGQLLPQQMQAFHSPKALFPVGYEASRLYWSTRYANRRCRYLCSIEEKDGRPVFVIRIVEQGHEDLVLSDISPKGVWDKILEPVACVRKKSEMLQLFPAYLKGEDLFGLTVSAVARIAESLPGVEACENYTFRYGRNPLMELPLAVNPTGCARSEPKMSAHVKRFVLRPHTLNSTSTSKSFQSTVTGELNAPYSKQFVHSKSSQYRKMKTEWKSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCHCGAVNCRKWMN,mutated_sequence,1.0,4911.0,NP_733751.2.a2m,NP_733751.2.npy,ClinVar
+NP_733765.1,NP_733765.1.csv,MENAHTKTVEEVLGHFGVNESTGLSLEQVKKLKERWGSNELPAEEGKTLLELVIEQFEDLLVRILLLAACISFVLAWFEEGEETITAFVEPFVILLILVANAIVGVWQERNAENAIEALKEYEPEMGKVYRQDRKSVQRIKAKDIVPGDIVEIAVGDKVPADIRLTSIKSTTLRVDQSILTGESVSVIKHTDPVPDPRAVNQDKKNMLFSGTNIAAGKAMGVVVATGVNTEIGKIRDEMVATEQERTPLQQKLDEFGEQLSKVISLICIAVWIINIGHFNDPVHGGSWIRGAIYYFKIAVALAVAAIPEGLPAVITTCLALGTRRMAKKNAIVRSLPSVETLGCTSVICSDKTGTLTTNQMSVCRMFILDRVEGDTCSLNEFTITGSTYAPIGEVHKDDKPVNCHQYDGLVELATICALCNDSALDYNEAKGVYEKVGEATETALTCLVEKMNVFDTELKGLSKIERANACNSVIKQLMKKEFTLEFSRDRKSMSVYCTPNKPSRTSMSKMFVKGAPEGVIDRCTHIRVGSTKVPMTSGVKQKIMSVIREWGSGSDTLRCLALATHDNPLRREEMHLEDSANFIKYETNLTFVGCVGMLDPPRIEVASSVKLCRQAGIRVIMITGDNKGTAVAICRRIGIFGQDEDVTSKAFTGREFDELNPSAQRDACLNARCFARVEPSHKSKIVEFLQSFDEITAMTGDGVNDAPALKKAEIGIAMGSGTAVAKTASEMVLADDNFSTIVAAVEEGRAIYNNMKQFIRYLISSNVGEVVCIFLTAALGFPEALIPVQLLWVNLVTDGLPATALGFNPPDLDIMNKPPRNPKEPLISGWLFFRYLAIGCYVGAATVGAAAWWFIAADGGPRVSFYQLSHFLQCKEDNPDFEGVDCAIFESPYPMTMALSVLVTIEMCNALNSLSENQSLLRMPPWENIWLVGSICLSMSLHFLILYVEPLPLIFQITPLNVTQWLMVLKISLPVILMDETLKFVARNYLEPGKECVQPATKSCSFSACTDGISWPFVLLIMPLVIWVYSTDTNFSDMFWS,mutated_sequence,1.0,1042.0,NP_733765.1.a2m,NP_733765.1.npy,ClinVar
+NP_733775.1,NP_733775.1.csv,MAQRYDELPHYGGMDGVGVPASMYGDPHAPRPIPPVHHLNHGPPLHATQHYGAHAPHPNVMPASMGSAVNDALKRDKDAIYGHPLFPLLALVFEKCELATCTPREPGVAGGDVCSSDSFNEDIAVFAKQVRAEKPLFSSNPELDNLMIQAIQVLRFHLLELEKVHELCDNFCHRYISCLKGKMPIDLVIDERDGSSKSDHEELSGSSTNLADHNPSSWRDHDDATSTHSAGTPGPSSGGHASQSGDNSSEQGDGLDNSVASPGTGDDDDPDKDKKRQKKRGIFPKVATNIMRAWLFQHLTHPYPSEEQKKQLAQDTGLTILQVNNWFINARRRIVQPMIDQSNRAGFLLDPSVSQGAAYSPEGQPMGSFVLDGQQHMGIRPAGLQSMPGDYVSQGGPMGMSMAQPSYTPPQMTPHPTQLRHGPPMHSYLPSHPHHPAMMMHGGPPTHPGMTMSAQSPTMLNSVDPNVGGQVMDIHAQ,mutated_sequence,1.0,477.0,NP_733775.1.a2m,NP_733775.1.npy,ClinVar
+NP_733821.1,NP_733821.1.csv,METPSQRRATRSGAQASSTPLSPTRITRLQEKEDLQELNDRLAVYIDRVRSLETENAGLRLRITESEEVVSREVSGIKAAYEAELGDARKTLDSVAKERARLQLELSKVREEFKELKARNTKKEGDLIAAQARLKDLEALLNSKEAALSTALSEKRTLEGELHDLRGQVAKLEAALGEAKKQLQDEMLRRVDAENRLQTMKEELDFQKNIYSEELRETKRRHETRLVEIDNGKQREFESRLADALQELRAQHEDQVEQYKKELEKTYSAKLDNARQSAERNSNLVGAAHEELQQSRIRIDSLSAQLSQLQKQLAAKEAKLRDLEDSLARERDTSRRLLAEKEREMAEMRARMQQQLDEYQELLDIKLALDMEIHAYRKLLEGEEERLRLSPSPTSQRSRGRASSHSSQTQGGGSVTKKRKLESTESRSSFSQHARTSGRVAVEEVDEEGKFVRLRNKSNEDQSMGNWQIKRQNGDDPLLTYRFPPKFTLKAGQVVTIWAAGAGATHSPPTDLVWKAQNTWGCGNSLRTALINSTGEEVAMRKLVRSVTVVEDDEDEDGDDLLHHHHGSHCSSSGDPAEYNLRSRTVLCGTCGQPADKASASGSGAQVGGPISSGSSASSVTVTRSYRSVGGSGGGSFGDNLVTRSYLLGNSSPRTQSPQNCSIM,mutated_sequence,1.0,664.0,NP_733821.1.a2m,NP_733821.1.npy,ClinVar
+NP_742105.1,NP_742105.1.csv,MVQKSRNGGVYPGPSGEKKLKVGFVGLDPGAPDSTRDGALLIAGSEAPKRGSILSKPRAGGAGAGKPPKRNAFYRKLQNFLYNVLERPRGWAFIYHAYVFLLVFSCLVLSVFSTIKEYEKSSEGALYILEIVTIVVFGVEYFVRIWAAGCCCRYRGWRGRLKFARKPFCVIDIMVLIASIAVLAAGSQGNVFATSALRSLRFLQILRMIRMDRRGGTWKLLGSVVYAHSKELVTAWYIGFLCLILASFLVYLAEKGENDHFDTYADALWWGLITLTTIGYGDKYPQTWNGRLLAATFTLIGVSFFALPAGILGSGFALKVQEQHRQKHFEKRRNPAAGLIQSAWRFYATNLSRTDLHSTWQYYERTVTVPMYSSQTQTYGASRLIPPLNQLELLRNLKSKSGLAFRKDPPPEPSPSKGSPCRGPLCGCCPGRSSQKVSLKDRVFSSPRGVAAKGKGSPQAQTVRRSPSADQSLEDSPSKVPKSWSFGDRSRARQAFRIKGAASRQNSEEASLPGEDIVDDKSCPCEFVTEDLTPGLKVSIRAVCVMRFLVSKRKFKESLRPYDVMDVIEQYSAGHLDMLSRIKSLQSRVDQIVGRGPAITDKDRTKGPAEAELPEDPSMMGRLGKVEKQVLSMEKKLDFLVNIYMQRMGIPPTETEAYFGAKEPEPAPPYHSPEDSREHVDRHGCIVKIVRSSSSTGQKNFSAPPAAPPVQCPPSTSWQPQSHPRQGHGTSPVGDHGSLVRIPPPPAHERSLSAYGGGNRASMEFLRQEDTPGCRPPEGNLRDSDTSISIPSVDHEELERSFSGFSISQSKENLDALNSCYAAVAPCAKVRPYIAEGESDTDSDLCTPCGPPPRSATGEGPFGDVGWAGPRK,mutated_sequence,1.0,872.0,NP_742105.1.a2m,NP_742105.1.npy,ClinVar
+NP_758440.1,NP_758440.1.csv,MASATEDPVLERYFKGHKAAITSLDLSPNGKQLATASWDTFLMLWNFKPHARAYRYVGHKDVVTSVQFSPHGNLLASASRDRTVRLWIPDKRGKFSEFKAHTAPVRSVDFSADGQFLATASEDKSIKVWSMYRQRFLYSLYRHTHWVRCAKFSPDGRLIVSCSEDKTIKIWDTTNKQCVNNFSDSVGFANFVDFNPSGTCIASAGSDQTVKVWDVRVNKLLQHYQVHSGGVNCISFHPSGNYLITASSDGTLKILDLLEGRLIYTLQGHTGPVFTVSFSKGGELFASGGADTQVLLWRTNFDELHCKGLTKRNLKRLHFDSPPHLLDIYPRTPHPHEEKVETVEINPKLEVIDLQISTPPVMDILSFDSTTTTETSGRTLPDKGEEACGYFLNPSLMSPECLPTTTKKKTEDMSDLPCESQRSIPLAVTDALEHIMEQLNVLTQTVSILEQRLTLTEDKLKDCLENQQKLFSAVQQKS,mutated_sequence,1.0,478.0,NP_758440.1.a2m,NP_758440.1.npy,ClinVar
+NP_775109.2,NP_775109.2.csv,MASTSTTIRSHSSSRRGFSANSARLPGVSRSGFSSISVSRSRGSGGLGGACGGAGFGSRSLYGLGGSKRISIGGGSCAISGGYGSRAGGSYGFGGAGSGFGFGGGAGIGFGLGGGAGLAGGFGGPGFPVCPPGGIQEVTVNQSLLTPLNLQIDPAIQRVRAEEREQIKTLNNKFASFIDKVRFLEQQNKVLDTKWTLLQEQGTKTVRQNLEPLFEQYINNLRRQLDSIVGERGRLDSELRNMQDLVEDLKNKYEDEINKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRALYDAELSQMQTHISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEIAQRSRAEAESWYQTKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRKLLEGEECRLNGEGVGQVNVSVVQSTISSGYGGASGVGSGLGLGGGSSYSYGSGLGIGGGFSSSSGRAIGGGLSSVGGGSSTIKYTTTSSSSRKSYKH,mutated_sequence,1.0,564.0,NP_775109.2.a2m,NP_775109.2.npy,ClinVar
+NP_775931.3,NP_775931.3.csv,MTEAALVEGQVKLRDGKKWKSRWLVLRKPSPVADCLLMLVYKDKSERIKGLRERSSLTLEDICGLEPGLPYEGLVHTLAIVCLSQAIMLGFDSHEAMCAWDARIRYALGEVHRFHVTVAPGTKLESGPATLHLCNDVLVLARDIPPAVTGQWKLSDLRRYGAVPSGFIFEGGTRCGYWAGVFFLSSAEGEQISFLFDCIVRGISPTKGPFGLRPVLPDPSPPGPSTVEERVAQEALETLQLEKRLSLLSHAGRPGSGGDDRSLSSSSSEASHLDVSASSRLTAWPEQSSSSASTSQEGPRPAAAQAAGEAMVGASRPPPKPLRPRQLQEVGRQSSSDSGIATGSHSSYSSSLSSYAGSSLDVWRATDELGSLLSLPAAGAPEPSLCTCLPGTVEYQVPTSLRAHYDTPRSLCLAPRDHSPPSQGSPGNSAARDSGGQTSAGCPSGWLGTRRRGLVMEAPQGSEATLPGPAPGEPWEAGGPHAGPPPAFFSACPVCGGLKVNPPP,mutated_sequence,1.0,504.0,NP_775931.3.a2m,NP_775931.3.npy,ClinVar
+NP_777367.1,NP_777367.1.csv,MPSQQKKIIFCMAGVFSFACALGVVTALGTPLWIKATVLCKTGALLVNASGQELDKFMGEMQYGLFHGEGVRQCGLGARPFRFSFFPDLLKAIPVSIHVNVILFSAILIVLTMVGTAFFMYNAFGKPFETLHGPLGLYLLSFISGSCGCLVMILFASEVKIHHLSEKIANYKEGTYVYKTQSEKYTTSFWVIFFCFFVHFLNGLLIRLAGFQFPFAKSKDAETTNVAADLMY,mutated_sequence,1.0,232.0,NP_777367.1.a2m,NP_777367.1.npy,ClinVar
+NP_787110.2,NP_787110.2.csv,MVSVNAPLGAPVESSYDTSPSEGTNLNAPNSLGVSALCAICGDRATGKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKCFRAGMKKEAVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKKIASIADVCESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFKDVLLLGNDYIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKLFGMAKIDNLLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQMCEWPRPRGQAATPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI,mutated_sequence,1.0,452.0,NP_787110.2.a2m,NP_787110.2.npy,ClinVar
+NP_789794.1,NP_789794.1.csv,MDLILNRMDYLQVGVTSQKTMKLIPASRHRATQKVVIGDHDGVVMCFGMKKGEAAAVFKTLPGPKIARLELGGVINTPQEKIFIAAASEIRGFTKRGKQFLSFETNLTESIKAMHISGSDLFLSASYIYNHYCDCKDQHYYLSGDKINDVICLPVERLSRITPVLACQDRVLRVLQGSDVMYAVEVPGPPTVLALHNGNGGDSGEDLLFGTSDGKLALIQITTSKPVRKWEIQNEKKRGGILCIDSFDIVGDGVKDLLVGRDDGMVEVYSFDNANEPVLRFDQMLSESVTSIQGGCVGKDSYDEIVVSTYSGWVTGLTTEPIHKESGPGEELKINQEMQNKISSLRNELEHLQYKVLQERENYQQSSQSSKAKSAVPSFGINDKFTLNKDDASYSLILEVQTAIDNVLIQSDVPIDLLDVDKNSAVVSFSSCDSESNDNFLLATYRCQADTTRLELKIRSIEGQYGTLQAYVTPRIQPKTCQVRQYHIKPLSLHQRTHFIDHDRPMNTLTLTGQFSFAEVHSWVVFCLPEVPEKPPAGECVTFYFQNTFLDTQLESTYRKGEGVFKSDNISTISILKDVLSKEATKRKINLNISYEINEVSVKHTLKLIHPKLEYQLLLAKKVQLIDALKELQIHEGNTNFLIPEYHCILEEADHLQEEYKKQPAHLERLYGMITDLFIDKFKFKGTNVKTKVPLLLEILDSYDQNALISFFDAA,mutated_sequence,1.0,715.0,NP_789794.1.a2m,NP_789794.1.npy,ClinVar
+NP_796376.2,NP_796376.2.csv,MRNIFKRNQEPIVAPATTTATMPIGPVDNSTESGGAGESQEDMFAKLKEKLFNEINKIPLPPWALIAIAVVAGLLLLTCCFCICKKCCCKKKKNKKEKGKGMKNAMNMKDMKGGQDDDDAETGLTEGEGEGEEEKEPENLGKLQFSLDYDFQANQLTVGVLQAAELPALDMGGTSDPYVKVFLLPDKKKKYETKVHRKTLNPAFNETFTFKVPYQELGGKTLVMAIYDFDRFSKHDIIGEVKVPMNTVDLGQPIEEWRDLQGGEKEEPEKLGDICTSLRYVPTAGKLTVCILEAKNLKKMDVGGLSDPYVKIHLMQNGKRLKKKKTTVKKKTLNPYFNESFSFEIPFEQIQKVQVVVTVLDYDKLGKNEAIGKIFVGSNATGTELRHWSDMLANPRRPIAQWHSLKPEEEVDALLGKNK,mutated_sequence,1.0,419.0,NP_796376.2.a2m,NP_796376.2.npy,ClinVar
+NP_803187.1,NP_803187.1.csv,MKSPALQPLSMAGLQLMTPASSPMGPFFGLPWQQEAIHDNIYTPRKYQVELLEAALDHNTIVCLNTGSGKTFIAVLLTKELSYQIRGDFSRNGKRTVFLVNSANQVAQQVSAVRTHSDLKVGEYSNLEVNASWTKERWNQEFTKHQVLIMTCYVALNVLKNGYLSLSDINLLVFDECHLAILDHPYREIMKLCENCPSCPRILGLTASILNGKCDPEELEEKIQKLEKILKSNAETATDLVVLDRYTSQPCEIVVDCGPFTDRSGLYERLLMELEEALNFINDCNISVHSKERDSTLISKQILSDCRAVLVVLGPWCADKVAGMMVRELQKYIKHEQEELHRKFLLFTDTFLRKIHALCEEHFSPASLDLKFVTPKVIKLLEILRKYKPYERQQFESVEWYNNRNQDNYVSWSDSEDDDEDEEIEEKEKPETNFPSPFTNILCGIIFVERRYTAVVLNRLIKEAGKQDPELAYISSNFITGHGIGKNQPRNKQMEAEFRKQEEVLRKFRAHETNLLIATSIVEEGVDIPKCNLVVRFDLPTEYRSYVQSKGRARAPISNYIMLADTDKIKSFEEDLKTYKAIEKILRNKCSKSVDTGETDIDPVMDDDDVFPPYVLRPDDGGPRVTINTAIGHINRYCARLPSDPFTHLAPKCRTRELPDGTFYSTLYLPINSPLRASIVGPPMSCVRLAERVVALICCEKLHKIGELDDHLMPVGKETVKYEEELDLHDEEETSVPGRPGSTKRRQCYPKAIPECLRDSYPRPDQPCYLYVIGMVLTTPLPDELNFRRRKLYPPEDTTRCFGILTAKPIPQIPHFPVYTRSGEVTISIELKKSGFMLSLQMLELITRLHQYIFSHILRLEKPALEFKPTDADSAYCVLPLNVVNDSSTLDIDFKFMEDIEKSEARIGIPSTKYTKETPFVFKLEDYQDAVIIPRYRNFDQPHRFYVADVYTDLTPLSKFPSPEYETFAEYYKTKYNLDLTNLNQPLLDVDHTSSRLNLLTPRHLNQKGKALPLSSAEKRKAKWESLQNKQILVPELCAIHPIPASLWRKAVCLPSILYRLHCLLTAEELRAQTASDAGVGVRSLPADFRYPNLDFGWKKSIDSKSFISISNSSSAENDNYCKHSTIVPENAAHQGANRTSSLENHDQMSVNCRTLLSESPGKLHVEVSADLTAINGLSYNQNLANGSYDLANRDFCQGNQLNYYKQEIPVQPTTSYSIQNLYSYENQPQPSDECTLLSNKYLDGNANKSTSDGSPVMAVMPGTTDTIQVLKGRMDSEQSPSIGYSSRTLGPNPGLILQALTLSNASDGFNLERLEMLGDSFLKHAITTYLFCTYPDAHEGRLSYMRSKKVSNCNLYRLGKKKGLPSRMVVSIFDPPVNWLPPGYVVNQDKSNTDKWEKDEMTKDCMLANGKLDEDYEEEDEEEESLMWRAPKEEADYEDDFLEYDQEHIRFIDNMLMGSGAFVKKISLSPFSTTDSAYEWKMPKKSSLGSMPFSSDFEDFDYSSWDAMCYLDPSKAVEEDDFVVGFWNPSEENCGVDTGKQSISYDLHTEQCIADKSIADCVEALLGCYLTSCGERAAQLFLCSLGLKVLPVIKRTDREKALCPTRENFNSQQKNLSVSCAAASVASSRSSVLKDSEYGCLKIPPRCMFDHPDADKTLNHLISGFENFEKKINYRFKNKAYLLQAFTHASYHYNTITDCYQRLEFLGDAILDYLITKHLYEDPRQHSPGVLTDLRSALVNNTIFASLAVKYDYHKYFKAVSPELFHVIDDFVQFQLEKNEMQGMDSELRRSEEDEEKEEDIEVPKAMGDIFESLAGAIYMDSGMSLETVWQVYYPMMRPLIEKFSANVPRSPVRELLEMEPETAKFSPAERTYDGKVRVTVEVVGKGKFKGVGRSYRIAKSAAARRALRSLKANQPQVPNS,mutated_sequence,1.0,1922.0,NP_803187.1.a2m,NP_803187.1.npy,ClinVar
+NP_808592.2,NP_808592.2.csv,MPGRSCVALVLLAAAVSCAVAQHAPPWTEDCRKSTYPPSGPTYRGAVPWYTINLDLPPYKRWHELMLDKAPVLKVIVNSLKNMINTFVPSGKIMQVVDEKLPGLLGNFPGPFEEEMKGIAAVTDIPLGEIISFNIFYELFTICTSIVAEDKKGHLIHGRNMDFGVFLGWNINNDTWVITEQLKPLTVNLDFQRNNKTVFKASSFAGYVGMLTGFKPGLFSLTLNERFSINGGYLGILEWILGKKDVMWIGFLTRTVLENSTSYEEAKNLLTKTKILAPAYFILGGNQSGEGCVITRDRKESLDVYELDAKQGRWYVVQTNYDRWKHPFFLDDRRTPAKMCLNRTSQENISFETMYDVLSTKPVLNKLTVYTTLIDVTKGQFETYLRDCPDPCIGW,mutated_sequence,1.0,395.0,NP_808592.2.a2m,NP_808592.2.npy,ClinVar
+NP_835361.1,NP_835361.1.csv,MDARWWAVVVLAAFPSLGAGGETPEAPPESWTQLWFFRFVVNAAGYASFMVPGYLLVQYFRRKNYLETGRGLCFPLVKACVFGNEPKASDEVPLAPRTEAAETTPMWQALKLLFCATGLQVSYLTWGVLQERVMTRSYGATATSPGERFTDSQFLVLMNRVLALIVAGLSCVLCKQPRHGAPMYRYSFASLSNVLSSWCQYEALKFVSFPTQVLAKASKVIPVMLMGKLVSRRSYEHWEYLTATLISIGVSMFLLSSGPEPRSSPATTLSGLILLAGYIAFDSFTSNWQDALFAYKMSSVQMMFGVNFFSCLFTVGSLLEQGALLEGTRFMGRHSEFAAHALLLSICSACGQLFIFYTIGQFGAAVFTIIMTLRQAFAILLSCLLYGHTVTVVGGLGVAVVFAALLLRVYARGRLKQRGKKAVPVESPVQKV,mutated_sequence,1.0,432.0,NP_835361.1.a2m,NP_835361.1.npy,ClinVar
+NP_835464.1,NP_835464.1.csv,MEKYERIRVVGRGAFGIVHLCLRKADQKLVIIKQIPVEQMTKEERQAAQNECQVLKLLNHPNVIEYYENFLEDKALMIAMEYAPGGTLAEFIQKRCNSLLEEETILHFFVQILLALHHVHTHLILHRDLKTQNILLDKHRMVVKIGDFGISKILSSKSKAYTVVGTPCYISPELCEGKPYNQKSDIWALGCVLYELASLKRAFEAANLPALVLKIMSGTFAPISDRYSPELRQLVLSLLSLEPAQRPPLSHIMAQPLCIRALLNLHTDVGSVRMRRAEKSVAPSNTGSRTTSVRCRGIPRGPVRPAIPPPLSSVYAWGGGLGTPLRLPMLNTEVVQVAAGRTQKAGVTRSGRLILWEAPPLGAGGGSLLPGAVEQPQPQFISRFLEGQSGVTIKHVACGDFFTACLTDRGIIMTFGSGSNGCLGHGSLTDISQPTIVEALLGYEMVQVACGASHVLALSTERELFAWGRGDSGRLGLGTRESHSCPQQVPMPPGQEAQRVVCGIDSSMILTVPGQALACGSNRFNKLGLDHLSLGEEPVPHQQVEEALSFTLLGSAPLDQEPLLSIDLGTAHSAAVTASGDCYTFGSNQHGQLGTNTRRGSRAPCKVQGLEGIKMAMVACGDAFTVAIGAESEVYSWGKGARGRLGRRDEDAGLPRPVQLDETHPYTVTSVSCCHGNTLLAVRSVTDEPVPP,mutated_sequence,1.0,692.0,NP_835464.1.a2m,NP_835464.1.npy,ClinVar
+NP_849191.3,NP_849191.3.csv,MRPVALLLLPSLLALLAHGLSLEAPTVGKGQAPGIEETDGELTAAPTPEQPERGVHFVTTAPTLKLLNHHPLLEEFLQEGLEKGDEELRPALPFQPDPPAPFTPSPLPRLANQDSRPVFTSPTPAMAAVPTQPQSKEGPWSPESESPMLRITAPLPPGPSMAVPTLGPGEIASTTPPSRAWTPTQEGPGDMGRPWVAEVVSQGAGIGIQGTITSSTASGDDEETTTTTTIITTTITTVQTPGPCSWNFSGPEGSLDSPTDLSSPTDVGLDCFFYISVYPGYGVEIKVQNISLREGETVTVEGLGGPDPLPLANQSFLLRGQVIRSPTHQAALRFQSLPPPAGPGTFHFHYQAYLLSCHFPRRPAYGDVTVTSLHPGGSARFHCATGYQLKGARHLTCLNATQPFWDSKEPVCIAACGGVIRNATTGRIVSPGFPGNYSNNLTCHWLLEAPEGQRLHLHFEKVSLAEDDDRLIIRNGDNVEAPPVYDSYEVEYLPIEGLLSSGKHFFVELSTDSSGAAAGMALRYEAFQQGHCYEPFVKYGNFSSSTPTYPVGTTVEFSCDPGYTLEQGSIIIECVDPHDPQWNETEPACRAVCSGEITDSAGVVLSPNWPEPYGRGQDCIWGVHVEEDKRIMLDIRVLRIGPGDVLTFYDGDDLTARVLGQYSGPRSHFKLFTSMADVTIQFQSDPGTSVLGYQQGFVIHFFEVPRNDTCPELPEIPNGWKSPSQPELVHGTVVTYQCYPGYQVVGSSVLMCQWDLTWSEDLPSCQRVTSCHDPGDVEHSRRLISSPKFPVGATVQYICDQGFVLMGSSILTCHDRQAGSPKWSDRAPKCLLEQLKPCHGLSAPENGARSPEKQLHPAGATIHFSCAPGYVLKGQASIKCVPGHPSHWSDPPPICRAASLDGFYNSRSLDVAKAPAASSTLDAAHIAAAIFLPLVAMVLLVGGVYFYFSRLQGKSSLQLPRPRPRPYNRITIESAFDNPTYETGSLSFAGDERI,mutated_sequence,1.0,994.0,NP_849191.3.a2m,NP_849191.3.npy,ClinVar
+NP_851564.1,NP_851564.1.csv,MPRGWAAPLLLLLLQGGWGCPDLVCYTDYLQTVICILEMWNLHPSTLTLTWQDQYEELKDEATSCSLHRSAHNATHATYTCHMDVFHFMADDIFSVNITDQSGNYSQECGSFLLAESIKPAPPFNVTVTFSGQYNISWRSDYEDPAFYMLKGKLQYELQYRNRGDPWAVSPRRKLISVDSRSVSLLPLEFRKDSSYELQVRAGPMPGSSYQGTWSEWSDPVIFQTQSEELKEGWNPHLLLLLLLVIVFIPAFWSLKTHPLWRLWKKIWAVPSPERFFMPLYKGCSGDFKKWVGAPFTGSSLELGPWSPEVPSTLEVYSCHPPRSPAKRLQLTELQEPAELVESDGVPKPSFWPTAQNSGGSAYSEERDRPYGLVSIDTVTVLDAEGPCTWPCSCEDDGYPALDLDAGLEPSPGLEDPLLDAGTTVLSCGCVSAGSPGLGGPLGSLLDRLKPPLADGEDWAGGLPWGGRSPGGVSESEAGSPLAGLDMDTFDSGFVGSDCSSPVECDFTSPGDEGPPRSYLRQWVVIPPPLSSPGPQAS,mutated_sequence,1.0,538.0,NP_851564.1.a2m,NP_851564.1.npy,ClinVar
+NP_852091.1,NP_852091.1.csv,MSSEFLAELHWEDGFAIPVANEENKLLEDQLSKLKDERASLQDELREYEERINSMTSHFKNVKQELSITQSLCKARERETESEEHFKAIAQRELGRVKDEIQRLENEMASILEKKSDKENGIFKATQKLDGLKCQMNWDQQALEAWLEESAHKDSDALTLQKYAQQDDNKIRALTLQLERLTLECNQKRKILDNELTETISAQLELDKAAQDFRKIHNERQELIKQWENTIEQMQKRDGDIDNCALELARIKQETREKENLVKEKIKFLESEIGNNTEFEKRISVADRKLLKCRTAYQDHETSRIQLKGELDSLKATVNRTSSDLEALRKNISKIKKDIHEETARLQKTKNHNEIIQTKLKEITEKTMSVEEKATNLEDMLKEEEKDVKEVDVQLNLIKGVLFKKAQELQTETMKEKAVLSEIEGTRSSLKHLNHQLQKLDFETLKQQEIMYSQDFHIQQVERRMSRLKGEINSEEKQALEAKIVELRKSLEEKKSTCGLLETQIKKLHNDLYFIKKAHSKNSDEKQSLMTKINELNLFIDRSEKELDKAKGFKQDLMIEDNLLKLEVKRTREMLHSKAEEVLSLEKRKQQLYTAMEERTEEIKVHKTMLASQIRYVDQERENISTEFRERLSKIEKLKNRYEILTVVMLPPEGEEEKTQAYYVIKAAQEKEELQREGDCLDAKINKAEKEIYALENTLQVLNSCNNNYKQSFKKVTPSSDEYELKIQLEEQKRAVDEKYRYKQRQIRELQEDIQSMENTLDVIEHLANNVKEKLSEKQAYSFQLSKETEEQKPKLERVTKQCAKLTKEIRLLKDTKDETMEEQDIKLREMKQFHKVIDEMLVDIIEENTEIRIILQTYFQQSGLELPTASTKGSRQSSRSPSHTSLSARSSRSTSTSTSQSSIKVLELKFPASSSLVGSPSRPSSASSSSSNVKSKKSSK,mutated_sequence,1.0,941.0,NP_852091.1.a2m,NP_852091.1.npy,ClinVar
+NP_852259.1,NP_852259.1.csv,MADADEGFGLAHTPLEPDAKDLPCDSKPESALGAPSKSPSSPQAAFTQQGMEGIKVFLHERELWLKFHEVGTEMIITKAGRRMFPSYKVKVTGLNPKTKYILLMDIVPADDHRYKFADNKWSVTGKAEPAMPGRLYVHPDSPATGAHWMRQLVSFQKLKLTNNHLDPFGHIILNSMHKYQPRLHIVKADENNGFGSKNTAFCTHVFPETAFIAVTSYQNHKITQLKIENNPFAKGFRGSDDMELHRMSRMQSKEYPVVPRSTVRQKVASNHSPFSSESRALSTSSNLGSQYQCENGVSGPSQDLLPPPNPYPLPQEHSQIYHCTKRKEEECSTTDHPYKKPYMETSPSEEDSFYRSSYPQQQGLGASYRTESAQRQACMYASSAPPSEPVPSLEDISCNTWPSMPSYSSCTVTTVQPMDRLPYQHFSAHFTSGPLVPRLAGMANHGSPQLGEGMFQHQTSVAHQPVVRQCGPQTGLQSPGTLQPPEFLYSHGVPRTLSPHQYHSVHGVGMVPEWSDNS,mutated_sequence,1.0,518.0,NP_852259.1.a2m,NP_852259.1.npy,ClinVar
+NP_852608.1,NP_852608.1.csv,MAFVPVIPESYSHVLAEFESLDPLLSALRLDSSRLKCTSIAVSRKWLALGSSGGGLHLIQKEGWKHRLFLSHREGAISQVACCLHDDDYVAVATSQGLVVVWELNQERRGKPEQMYVSSEHKGRRVTALCWDTAILRVFVGDHAGKVSAIKLNTSKQAKAAAAFVMFPVQTITTVDSCVVQLDYLDGRLLISSLTRSFLCDTEREKFWKIGNKERDGEYGACFFPGRCSGGQQPLIYCARPGSRMWEVNFDGEVISTHQFKKLLSLPPLPVITLRSEPQYDHTAGSSQSLSFPKLLHLSEHCVLTWTERGIYIFIPQNVQVLLWSEVKDIQDVAVCRNELFCLHLNGKVSHLSLISVERCVERLLRRGLWNLAARTCCLFQNSVIASRARKTLTADKLEHLKSQLDHGTYNDLISQLEELILKFEPLDSACSSRRSSISSHESFSILDSGIYRIISSRRGSQSDEDSCSLHSQTLSEDERFKEFTSQQEEDLPDQCCGSHGNEDNVSHAPVMFETDKNETFLPFGIPLPFRSPSPLVSLQAVKESVSSFVRKTTEKIGTLHTSPDLKVRPELRGDEQSCEEDVSSDTCPKEEDTEEEKEVTSPPPEEDRFQELKVATAEAMTKLQDPLVLFESESLRMVLQEWLSHLEKTFAMKDFSGVSDTDNSSMKLNQDVLLVNESKKGILDEDNEKEKRDSLGNEESVDKTACECVRSPRESLDDLFQICSPCAIASGLRNDLAELTTLCLELNVLNSKIKSTSGHVDHTLQQYSPEILACQFLKKYFFLLNLKRAKESIKLSYSNSPSVWDTFIEGLKEMASSNPVYMEMEKGDLPTRLKLLDDEVPFDSPLLVVYATRLYEKFGESALRSLIKFFPSILPSDIIQLCHHHPAEFLAYLDSLVKSRPEDQRSSFLESLLQPESLRLDWLLLAVSLDAPPSTSTMDDEGYPRPHSHLLSWGYSQLILHLIKLPADFITKEKMTDICRSCGFWPGYLILCLELERRREAFTNIVYLNDMSLMEGDNGWIPETVEEWKLLLHLIQSKSTRPAPQESLNGSLSDGPSPINVENVALLLAKAMGPDRAWSLLQECGLALELSEKFTRTCDILRIAEKRQRALIQSMLEKCDRFLWSQQA,mutated_sequence,1.0,1129.0,NP_852608.1.a2m,NP_852608.1.npy,ClinVar
+NP_852664.1,NP_852664.1.csv,MSAEGYQYRALYDYKKEREEDIDLHLGDILTVNKGSLVALGFSDGQEARPEEIGWLNGYNETTGERGDFPGTYVEYIGRKKISPPTPKPRPPRPLPVAPGSSKTEADVEQQALTLPDLAEQFAPPDIAPPLLIKLVEAIEKKGLECSTLYRTQSSSNLAELRQLLDCDTPSVDLEMIDVHVLADAFKRYLLDLPNPVIPAAVYSEMISLAPEVQSSEEYIQLLKKLIRSPSIPHQYWLTLQYLLKHFFKLSQTSSKNLLNARVLSEIFSPMLFRFSAASSDNTENLIKVIEILISTEWNERQPAPALPPKPPKPTTVANNGMNNNMSLQDAEWYWGDISREEVNEKLRDTADGTFLVRDASTKMHGDYTLTLRKGGNNKLIKIFHRDGKYGFSDPLTFSSVVELINHYRNESLAQYNPKLDVKLLYPVSKYQQDQVVKEDNIEAVGKKLHEYNTQFQEKSREYDRLYEEYTRTSQEIQMKRTAIEAFNETIKIFEEQCQTQERYSKEYIEKFKREGNEKEIQRIMHNYDKLKSRISEIIDSRRRLEEDLKKQAAEYREIDKRMNSIKPDLIQLRKTRDQYLMWLTQKGVRQKKLNEWLGNENTEDQYSLVEDDEDLPHHDEKTWNVGSSNRNKAENLLRGKRDGTFLVRESSKQGCYACSVVVDGEVKHCVINKTATGYGFAEPYNLYSSLKELVLHYQHTSLVQHNDSLNVTLAYPVYAQQRR,mutated_sequence,1.0,724.0,NP_852664.1.a2m,NP_852664.1.npy,ClinVar
+NP_859056.2,NP_859056.2.csv,MGRAVKVLQLFKTLHRTRQQVFKNDARALEAARIKINEEFKNNKSETSSKKIEELMKIGSDVELLLRTSVIQGIHTDHNTLKLVPRKDLLVENVPYCDAPTQKQ,mutated_sequence,1.0,104.0,NP_859056.2.a2m,NP_859056.2.npy,ClinVar
+NP_872282.1,NP_872282.1.csv,MAARLVSRCGAVRAAPHSGPLVSWRRWSGASTDTVYDVVVSGGGLVGAAMACALGYDIHFHDKKILLLEAGPKKVLEKLSETYSNRVSSISPGSATLLSSFGAWDHICNMRYRAFRRMQVWDACSEALIMFDKDNLDDMGYIVENDVIMHALTKQLEAVSDRVTVLYRSKAIRYTWPCPFPMADSSPWVHITLGDGSTFQTKLLIGADGHNSGVRQAVGIQNVSWNYDQSAVVATLHLSEATENNVAWQRFLPSGPIALLPLSDTLSSLVWSTSHEHAAELVSMDEEKFVDAVNSAFWSDADHTDFIDTAGAMLQYAVSLLKPTKVSARQLPPSVARVDAKSRVLFPLGLGHAAEYVRPRVALIGDAAHRVHPLAGQGVNMGFGDISSLAHHLSTAAFNGKDLGSVSHLTGYETERQRHNTALLAATDLLKRLYSTSASPLVLLRTWGLQATNAVSPLKEQIMAFASK,mutated_sequence,1.0,468.0,NP_872282.1.a2m,NP_872282.1.npy,ClinVar
+NP_877435.3,NP_877435.3.csv,MRTSLQAVALWGQKAPPHSITAIMITDDQRTIVTGSQEGQLCLWNLSHELKISAKELLFGHSASVTCLARARDFSKQPYIVSAAENGEMCVWNVTNGQCMEKATLPYRHTAICYYHCSFRMTGEGWLLCCGEYQDVLIIDAKTLAVVHSFRSSQFPDWINCMCIVHSMRIQEDSLLVVSVAGELKVWDLSSSINSIQEKQDVYEKESKFLESLNCQTIRFCTYTERLLLVVFSKCWKVYDYCDFSLLLTEVSRNGQFFAGGEVIAAHRILIWTEDGHSYIYQLLNSGLSKSIYPADGRVLKETIYPHLLCSTSVQENKEQSRPFVMGYMNERKEPFYKVLFSGEVSGRITLWHIPDVPVSKFDGSPREIPVTATWTLQDNFDKHDTMSQSIIDYFSGLKDGAGTAVVTSSEYIPSLDKLICGCEDGTIIITQALNAAKARLLEGGSLVKDSPPHKVLKGHHQSVTSLLYPHGLSSKLDQSWMLSGDLDSCVILWDIFTEEILHKFFLEAGPVTSLLMSPEKFKLRGEQIICCVCGDHSVALLHLEGKSCLLHARKHLFPVRMIKWHPVENFLIVGCADDSVYIWEIETGTLERHETGERARIILNCCDDSQLVKSVLPIASETLKHKSIEQRSSSPYQLGPLPCPGLQVESSCKVTDAKFCPRPFNVLPVKTKWSNVGFHILLFDLENLVELLLPTPLSDVDSSSSFYGGEVLRRAKSTVEKKTLTLRKSKTACGPLSAEALAKPITESLAQGDNTIKFSEENDGIKRQKKMKISKKMQPKPSRKVDASLTIDTAKLFLSCLLPWGVDKDLDYLCIKHLNILKLQGPISLGISLNEDNFSLMLPGWDLCNSGMIKDYSGVNLFSRKVLDLSDKYTATLPNQVGIPRGLENNCDSLRESDTIVYLLSRLFLVNKLVNMPLELACRVGSSFRMESIHNKMRGAGNDILNMSSFYSCLRNGKNESHVPEADLSLLKLISCWRDQSVQVTEAIQAVLLAEVQQHMKSLGKIPVNSQPVSMAENGNCEMKQMLPKLEWTEELELQCVRNTLPLQTPVSPVKHDSNSNSANFQDVEDMPDRCALEESESPGEPRHHSWIAKVCPCKVS,mutated_sequence,1.0,1102.0,NP_877435.3.a2m,NP_877435.3.npy,ClinVar
+NP_891555.2,NP_891555.2.csv,MQRGAALCLRLWLCLGLLDGLVSGYSMTPPTLNITEESHVIDTGDSLSISCRGQHPLEWAWPGAQEAPATGDKDSEDTGVVRDCEGTDARPYCKVLLLHEVHANDTGSYVCYYKYIKARIEGTTAASSYVFVRDFEQPFINKPDTLLVNRKDAMWVPCLVSIPGLNVTLRSQSSVLWPDGQEVVWDDRRGMLVSTPLLHDALYLQCETTWGDQDFLSNPFLVHITGNELYDIQLLPRKSLELLVGEKLVLNCTVWAEFNSGVTFDWDYPGKQAERGKWVPERRSQQTHTELSSILTIHNVSQHDLGSYVCKANNGIQRFRESTEVIVHENPFISVEWLKGPILEATAGDELVKLPVKLAAYPPPEFQWYKDGKALSGRHSPHALVLKEVTEASTGTYTLALWNSAAGLRRNISLELVVNVPPQIHEKEASSPSIYSRHSRQALTCTAYGVPLPLSIQWHWRPWTPCKMFAQRSLRRRQQQDLMPQCRDWRAVTTQDAVNPIESLDTWTEFVEGKNKTVSKLVIQNANVSAMYKCVVSNKVGQDERLIYFYVTTIPDGFTIESKPSEELLEGQPVLLSCQADSYKYEHLRWYRLNLSTLHDAHGNPLLLDCKNVHLFATPLAASLEEVAPGARHATLSLSIPRVAPEHEGHYVCEVQDRRSHDKHCHKKYLSVQALEAPRLTQNLTDLLVNVSDSLEMQCLVAGAHAPSIVWYKDERLLEEKSGVDLADSNQKLSIQRVREEDAGRYLCSVCNAKGCVNSSASVAVEGSEDKGSMEIVILVGTGVIAVFFWVLLLLIFCNMRRPAHADIKTGYLSIIMDPGEVPLEEQCEYLSYDASQWEFPRERLHLGRVLGYGAFGKVVEASAFGIHKGSSCDTVAVKMLKEGATASEHRALMSELKILIHIGNHLNVVNLLGACTKPQGPLMVIVEFCKYGNLSNFLRAKRDAFSPCAEKSPEQRGRFRAMVELARLDRRRPGSSDRVLFARFSKTEGGARRASPDQEAEDLWLSPLTMEDLVCYSFQVARGMEFLASRKCIHRDLAARNILLSESDVVKICDFGLARDIYKDPDYVRKGSARLPLKWMAPESIFDKVYTTQSDVWSFGVLLWEIFSLGASPYPGVQINEEFCQRLRDGTRMRAPELATPAIRRIMLNCWSGDPKARPAFSELVEILGDLLQGRGLQEEEEVCMAPRSSQSSEEGSFSQVSTMALHIAQADAEDSPPSLQRHSLAARYYNWVSFPGCLARGAETRGSSRMKTFEEFPMTPTTYKGSVDNQTDSGMVLASEEFEQIESRHRQESGFSCKGPGQNVAVTRAHPDSQGRRRRPERGARGGQVFYNSEYGELSEPSEEDHCSPSARVTFFTDNSY,mutated_sequence,1.0,1363.0,NP_891555.2.a2m,NP_891555.2.npy,ClinVar
+NP_898888.1,NP_898888.1.csv,MGRVSGLVPSRFLTLLAHLVVVITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFAVELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFVTVFGLKKKPF,mutated_sequence,1.0,140.0,NP_898888.1.a2m,NP_898888.1.npy,ClinVar
+NP_899058.1,NP_899058.1.csv,MSDGDYDYLIKFLALGDSGVGKTSVLYQYTDGKFNSKFITTVGIDFREKRVVYRASGPDGATGRGQRIHLQLWDTAGQERFRSLTTAFFRDAMGFLLLFDLTNEQSFLNVRNWISQLQMHAYCENPDIVLCGNKSDLEDQRVVKEEEAIALAEKYGIPYFETSAANGTNISQAIEMLLDLIMKRMERCVDKSWIPEGVVRSNGHASTDQLSEEKEKGACGC,mutated_sequence,1.0,221.0,NP_899058.1.a2m,NP_899058.1.npy,ClinVar
+NP_899200.1,NP_899200.1.csv,MSGSKSVSPPGYAAQKTAAPAPRGGPEHRSAWGEADSRANGYPHAPGGSARGSTKKPGGAVTPQQQQRLASRWRSDDDDDPPLSGDDPLAGGFGFSFRSKSAWQERGGDDCGRGSRRQRRGAASGGSTRAPPAGGGGGSAAAAASAGGTEVRPRSVEVGLEERRGKGRAADELEAGAVEGGEGSGDGGSSADSGSGAGPGAVLSLGACCLALLQIFRSKKFPSDKLERLYQRYFFRLNQSSLTMLMAVLVLVCLVMLAFHAARPPLQLPYLAVLAAAVGVILIMAVLCNRAAFHQDHMGLACYALIAVVLAVQVVGLLLPQPRSASEGIWWTVFFIYTIYTLLPVRMRAAVLSGVLLSALHLAIALRTNAQDQFLLKQLVSNVLIFSCTNIVGVCTHYPAEVSQRQAFQETRECIQARLHSQRENQQQERLLLSVLPRHVAMEMKADINAKQEDMMFHKIYIQKHDNVSILFADIEGFTSLASQCTAQELVMTLNELFARFDKLAAENHCLRIKILGDCYYCVSGLPEARADHAHCCVEMGMDMIEAISLVREVTGVNVNMRVGIHSGRVHCGVLGLRKWQFDVWSNDVTLANHMEAGGKAGRIHITKATLNYLNGDYEVEPGCGGERNAYLKEHSIETFLILRCTQKRKEEKAMIAKMNRQRTNSIGHNPPHWGAERPFYNHLGGNQVSKEMKRMGFEDPKDKNAQESANPEDEVDEFLGRAIDARSIDRLRSEHVRKFLLTFREPDLEKKYSKQVDDRFGAYVACASLVFLFICFVQITIVPHSIFMLSFYLTCSLLLTLVVFVSVIYSCVKLFPSPLQTLSRKIVRSKMNSTLVGVFTITLVFLAAFVNMFTCNSRDLLGCLAQEHNISASQVNACHVAESAVNYSLGDEQGFCGSPWPNCNFPEYFTYSVLLSLLACSVFLQISCIGKLVLMLAIELIYVLIVEVPGVTLFDNADLLVTANAIDFFNNGTSQCPEHATKVALKVVTPIIISVFVLALYLHAQQVESTARLDFLWKLQATEEKEEMEELQAYNRRLLHNILPKDVAAHFLARERRNDELYYQSCECVAVMFASIANFSEFYVELEANNEGVECLRLLNEIIADFDEIISEDRFRQLEKIKTIGSTYMAASGLNDSTYDKVGKTHIKALADFAMKLMDQMKYINEHSFNNFQMKIGLNIGPVVAGVIGARKPQYDIWGNTVNVASRMDSTGVPDRIQVTTDMYQVLAANTYQLECRGVVKVKGKGEMMTYFLNGGPPLS,mutated_sequence,1.0,1261.0,NP_899200.1.a2m,NP_899200.1.npy,ClinVar
+NP_919224.1,NP_919224.1.csv,MALLIHLKTVSELRGRGDRIAKVTFRGQSFYSRVLENCEDVADFDETFRWPVASSIDRNEMLEIQVFNYSKVFSNKLIGTFRMVLQKVVEESHVEVTDTLIDDNNAIIKTSLCVEVRYQATDGTVGSWDDGDFLGDESLQEEEKDSQETDGLLPGSRPSSRPPGEKSFRRAGRSVFSAMKLGKNRSHKEEPQRPDEPAVLEMEDLDHLAIRLGDGLDPDSVSLASVTALTTNVSNKRSKPDIKMEPSAGRPMDYQVSITVIEARQLVGLNMDPVVCVEVGDDKKYTSMKESTNCPYYNEYFVFDFHVSPDVMFDKIIKISVIHSKNLLRSGTLVGSFKMDVGTVYSQPEHQFHHKWAILSDPDDISSGLKGYVKCDVAVVGKGDNIKTPHKANETDEDDIEGNLLLPEGVPPERQWARFYVKIYRAEGLPRMNTSLMANVKKAFIGENKDLVDPYVQVFFAGQKGKTSVQKSSYEPLWNEQVVFTDLFPPLCKRMKVQIRDSDKVNDVAIGTHFIDLRKISNDGDKGFLPTLGPAWVNMYGSTRNYTLLDEHQDLNEGLGEGVSFRARLLLGLAVEIVDTSNPELTSSTEVQVEQATPISESCAGKMEEFFLFGAFLEASMIDRRNGDKPITFEVTIGNYGNEVDGLSRPQRPRPRKEPGDEEEVDLIQNASDDEAGDAGDLASVSSTPPMRPQVTDRNYFHLPYLERKPCIYIKSWWPDQRRRLYNANIMDHIADKLEEGLNDIQEMIKTEKSYPERRLRGVLEELSCGCCRFLSLADKDQGHSSRTRLDRERLKSCMRELENMGQQARMLRAQVKRHTVRDKLRLCQNFLQKLRFLADEPQHSIPDIFIWMMSNNKRVAYARVPSKDLLFSIVEEETGKDCAKVKTLFLKLPGKRGFGSAGWTVQAKVELYLWLGLSKQRKEFLCGLPCGFQEVKAAQGLGLHAFPPVSLVYTKKQAFQLRAHMYQARSLFAADSSGLSDPFARVFFINQSQCTEVLNETLCPTWDQMLVFDNLELYGEAHELRDDPPIIVIEIYDQDSMGKADFMGRTFAKPLVKMADEAYCPPRFPPQLEYYQIYRGNATAGDLLAAFELLQIGPAGKADLPPINGPVDVDRGPIMPVPMGIRPVLSKYRVEVLFWGLRDLKRVNLAQVDRPRVDIECAGKGVQSSLIHNYKKNPNFNTLVKWFEVDLPENELLHPPLNIRVVDCRAFGRYTLVGSHAVSSLRRFIYRPPDRSAPSWNTTVRLLRRCRVLCNGGSSSHSTGEVVVTMEPEVPIKKLETMVKLDATSEAVVKVDVAEEEKEKKKKKKGTAEEPEEEEPDESMLDWWSKYFASIDTMKEQLRQQEPSGIDLEEKEEVDNTEGLKGSMKGKEKARAAKEEKKKKTQSSGSGQGSEAPEKKKPKIDELKVYPKELESEFDNFEDWLHTFNLLRGKTGDDEDGSTEEERIVGRFKGSLCVYKVPLPEDVSREAGYDSTYGMFQGIPSNDPINVLVRVYVVRATDLHPADINGKADPYIAIRLGKTDIRDKENYISKQLNPVFGKSFDIEASFPMESMLTVAVYDWDLVGTDDLIGETKIDLENRFYSKHRATCGIAQTYSTHGYNIWRDPMKPSQILTRLCKDGKVDGPHFGPPGRVKVANRVFTGPSEIEDENGQRKPTDEHVALLALRHWEDIPRAGCRLVPEHVETRPLLNPDKPGIEQGRLELWVDMFPMDMPAPGTPLDISPRKPKKYELRVIIWNTDEVVLEDDDFFTGEKSSDIFVRGWLKGQQEDKQDTDVHYHSLTGEGNFNWRYLFPFDYLAAEEKIVISKKESMFSWDETEYKIPARLTLQIWDADHFSADDFLGAIELDLNRFPRGAKTAKQCTMEMATGEVDVPLVSIFKQKRVKGWWPLLARNENDEFELTGKVEAELHLLTAEEAEKNPVGLARNEPDPLEKPNRPDTSFIWFLNPLKSARYFLWHTYRWLLLKLLLLLLLLLLLALFLYSVPGYLVKKILGA,mutated_sequence,1.0,1997.0,NP_919224.1.a2m,NP_919224.1.npy,ClinVar
+NP_919231.1,NP_919231.1.csv,MVPSSPAVEKQVPVEPGPDPELRSWRHLVCYLCFYGFMAQIRPGESFITPYLLGPDKNFTREQVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQLMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRVSFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPGPGGKLGHALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWNEVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLLAHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTIITFIVSDVRGLGLPVRKQFQLYSVYFLILSIIYFLGAMLDGLRHCQRGHHPRQPPAQGLRSAAEEKAAQALSVQDKGLGGLQPAQSPPLSPEDSLGAVGPASLEQRQSDPYLAQAPAPQAAEFLSPVTTPSPCTLCSAQASGPEAADETCPQLAVHPPGVSKLGLQCLPSDGVQNVNQ,mutated_sequence,1.0,591.0,NP_919231.1.a2m,NP_919231.1.npy,ClinVar
+NP_919253.1,NP_919253.1.csv,MLHLKVQFLDDSQKIFVVDQKSSGKALFNLSCSHLNLAEKEYFGLEFCSHSGNNVWLELLKPITKQVKNPKEIVFKFMVKFFPVDPGHLREELTRYLFTLQIKKDLALGRLPCSDNCTALMVSHILQSELGDFHEETDRKHLAQTRYLPNQDCLEGKIMHFHQKHIGRSPAESDILLLDIARKLDMYGIRPHPASDGEGMQIHLAVAHMGVLVLRGNTKINTFNWAKIRKLSFKRKHFLIKLHANILVLCKDTLEFTMASRDACKAFWKTCVEYHAFFRLSEEPKSKPKTLLCSKGSSFRYSGRTQRQLLEYGRKGRLKSLPFERKHYPSQYHERQCRSSPDLLSDVSKQVEDLRLAYGGGYYQNVNGVHASEPVLESRRRNSALEVTFATELEHSKPEADPTLLHQSQSSSSFPFIYMDPVFNTEPNPNPDPRDIFSERSSLSSFQTSCKFSGNHMSIYSGLTSKVRPAKQLTYTDVPYIPCTGQQVGIMPPQVFFYVDKPPQVPRWSPIRAEERTSPHSYVEPTAMKPAERSPRNIRMKSFQQDLQVLQEAIARTSGRSNINVGLEEEDPNLEDAFVCNIQEQTPKRSQSQSDMKTIRFPFGSEFRPLGPCPALSHKADLFTDMFAEQELPAVLMDQSTAERYVASESSDSESEILKPDYYALYGKEIRSPMARIRLSSGSLQLDEEDEDAYFNTPTAEDRTSLKPCNYFLA,mutated_sequence,1.0,714.0,NP_919253.1.a2m,NP_919253.1.npy,ClinVar
+NP_938148.1,NP_938148.1.csv,MAAVAAVAARRRRSWASLVLAFLGVCLGITLAVDRSNFKTCEESSFCKRQRSIRPGLSPYRALLDSLQLGPDSLTVHLIHEVTKVLLVLELQGLQKNMTRFRIDELEPRRPRYRVPDVLVADPPIARLSVSGRDENSVELTMAEGPYKIILTARPFRLDLLEDRSLLLSVNARGLLEFEHQRAPRVSQGSKDPAEGDGAQPEETPRDGDKPEETQGKAEKDEPGAWEETFKTHSDSKPYGPMSVGLDFSLPGMEHVYGIPEHADNLRLKVTEGGEPYRLYNLDVFQYELYNPMALYGSVPVLLAHNPHRDLGIFWLNAAETWVDISSNTAGKTLFGKMMDYLQGSGETPQTDVRWMSETGIIDVFLLLGPSISDVFRQYASLTGTQALPPLFSLGYHQSRWNYRDEADVLEVDQGFDDHNLPCDVIWLDIEHADGKRYFTWDPSRFPQPRTMLERLASKRRKLVAIVDPHIKVDSGYRVHEELRNLGLYVKTRDGSDYEGWCWPGSAGYPDFTNPTMRAWWANMFSYDNYEGSAPNLFVWNDMNEPSVFNGPEVTMLKDAQHYGGWEHRDVHNIYGLYVHMATADGLRQRSGGMERPFVLARAFFAGSQRFGAVWTGDNTAEWDHLKISIPMCLSLGLVGLSFCGADVGGFFKNPEPELLVRWYQMGAYQPFFRAHAHLDTGRREPWLLPSQHNDIIRDALGQRYSLLPFWYTLLYQAHREGIPVMRPLWVQYPQDVTTFNIDDQYLLGDALLVHPVSDSGAHGVQVYLPGQGEVWYDIQSYQKHHGPQTLYLPVTLSSIPVFQRGGTIVPRWMRVRRSSECMKDDPITLFVALSPQGTAQGELFLDDGHTFNYQTRQEFLLRRFSFSGNTLVSSSADPEGHFETPIWIERVVIIGAGKPAAVVLQTKGSPESRLSFQHDPETSVLVLRKPGINVASDWSIHLR,mutated_sequence,1.0,944.0,NP_938148.1.a2m,NP_938148.1.npy,ClinVar
+NP_940908.3,NP_940908.3.csv,MHLFACLCIVLSFLEGVGCLCPSQCTCDYHGRNDGSGSRLVLCNDMDMNELPTNLPVDTVKLRIEKTVIRRISAEAFYYLVELQYLWVTYNSVASIDPSSFYNLKQLHELRLDGNSLAAFPWASLLDMPLLRTLDLHNNKITSVPNEALRYLKNLAYLDLSSNRLTTLPPDFLESWTHLVSTPSGVLDLSPSRIILGLQDNPWFCDCHISKMIELSKVVDPAIVLLDPLMTCSEPERLTGILFQRAELEHCLKPSVMTSATKIMSALGSNVLLRCDATGFPTPQITWTRSDSSPVNYTVIQESPEEGVRWSIMSLTGISSKDAGDYKCKAKNLAGMSEAVVTVTVLGITTTPIPPDTSERTGDHPEWDVQPGSGRSTSVSSASSYLWSSSFSPTSSFSASTLSPPSTASFSLSPFSSSTVSSTTTLSTSISASTTMANKRSFQLHQGGKRNLKVAKNGSKLPPASTSKKEELALLDQTMLTETNAAIENLRVVSETKESVTLTWNMINTTHNSAVTVLYSKYGGKDLLLLNADSSKNQVTIDGLEPGGQYMACVCPKGVPPQKDQCITFSTERVEGDDSQWSLLLVVTSTACVVILPLICFLLYKVCKLQCKSEPFWEDDLAKETYIQFETLFPRSQSVGELWTRSHRDDSEKLLLCSRSSVESQVTFKSEGSRPEYYC,mutated_sequence,1.0,679.0,NP_940908.3.a2m,NP_940908.3.npy,ClinVar
+NP_940927.2,NP_940927.2.csv,MGLEAQRLPGAEEAPVRVALRVRPLLPKELLHGHQSCLQVEPGLGRVTLGRDRHFGFHVVLAEDAGQEAVYQACVQPLLEAFFEGFNATVFAYGQTGSGKTYTMGEASVASLLEDEQGIVPRAMAEAFKLIDENDLLDCLVHVSYLEVYKEEFRDLLEVGTASRDIQLREDERGNVVLCGVKEVDVEGLDEVLSLLEMGNAARHTGATHLNHLSSRSHTVFTVTLEQRGRAPSRLPRPAPGQLLVSKFHFVDLAGSERVLKTGSTGERLKESIQINSSLLALGNVISALGDPQRRGSHIPYRDSKITRILKDSLGGNAKTVMIACVSPSSSDFDETLNTLNYASRAQNIRNRATVNWRPEAERPPEETASGARGPPRHRSETRIIHRGRRAPGPATASAAAAMRLGAECARYRACTDAAYSLLRELQAEPGLPGAAARKVRDWLCAVEGERSALSSASGPDSGIESASVEDQAAQGAGGRKEDEGAQQLLTLQNQVARLEEENRDFLAALEDAMEQYKLQSDRLREQQEEMVELRLRLELVRPGWGGPRLLNGLPPGSFVPRPHTAPLGGAHAHVLGMVPPACLPGDEVGSEQRGEQVTNGREAGAELLTEVNRLGSGSSAASEEEEEEEEPPRRTLHLRRNRISNCSQRAGARPGSLPERKGPELCLEELDAAIPGSRAVGGSKARVQARQVPPATASEWRLAQAQQKIRELAINIRMKEELIGELVRTGKAAQALNRQHSQRIRELEQEAEQVRAELSEGQRQLRELEGKELQDAGERSRLQEFRRRVAAAQSQVQVLKEKKQATERLVSLSAQSEKRLQELERNVQLMRQQQGQLQRRLREETEQKRRLEAEMSKRQHRVKELELKHEQQQKILKIKTEEIAAFQRKRRSGSNGSVVSLEQQQKIEEQKKWLDQEMEKVLQQRRALEELGEELHKREAILAKKEALMQEKTGLESKRLRSSQALNEDIVRVSSRLEHLEKELSEKSGQLRQGSAQSQQQIRGEIDSLRQEKDSLLKQRLEIDGKLRQGSLLSPEEERTLFQLDEAIEALDAAIEYKNEAITCRQRVLRASASLLSQCEMNLMAKLSYLSSSETRALLCKYFDKVVTLREEQHQQQIAFSELEMQLEEQQRLVYWLEVALERQRLEMDRQLTLQQKEHEQNMQLLLQQSRDHLGEGLADSRRQYEARIQALEKELGRYMWINQELKQKLGGVNAVGHSRGGEKRSLCSEGRQAPGNEDELHLAPELLWLSPLTEGAPRTREETRDLVHAPLPLTWKRSSLCGEEQGSPEELRQREAAEPLVGRVLPVGEAGLPWNFGPLSKPRRELRRASPGMIDVRKNPL,mutated_sequence,1.0,1343.0,NP_940927.2.a2m,NP_940927.2.npy,ClinVar
+NP_944494.1,NP_944494.1.csv,MSSPNIWSTGSSVYSTPVFSQKMTVWILLLLSLYPGFTSQKSDDDYEDYASNKTWVLTPKVPEGDVTVILNNLLEGYDNKLRPDIGVKPTLIHTDMYVNSIGPVNAINMEYTIDIFFAQTWYDRRLKFNSTIKVLRLNSNMVGKIWIPDTFFRNSKKADAHWITTPNRMLRIWNDGRVLYTLRLTIDAECQLQLHNFPMDEHSCPLEFSSYGYPREEIVYQWKRSSVEVGDTRSWRLYQFSFVGLRNTTEVVKTTSGDYVVMSVYFDLSRRMGYFTIQTYIPCTLIVVLSWVSFWINKDAVPARTSLGITTVLTMTTLSTIARKSLPKVSYVTAMDLFVSVCFIFVFSALVEYGTLHYFVSNRKPSKDKDKKKKNPLLRMFSFKAPTIDIRPRSATIQMNNATHLQERDEEYGYECLDGKDCASFFCCFEDCRTGAWRHGRIHIRIAKMDSYARIFFPTAFCLFNLVYWVSYLYL,mutated_sequence,1.0,475.0,NP_944494.1.a2m,NP_944494.1.npy,ClinVar
+NP_945345.2,NP_945345.2.csv,MAGIRVTKVDWQRSRNGAAHHTQEYPCPELVVRRGQSFSLTLELSRALDCEEILIFTMETGPRASEALHTKAVFQTSELERGEGWTAAREAQMEKTLTVSLASPPSAVIGRYLLSIRLSSHRKHSNRRLGEFVLLFNPWCAEDDVFLASEEERQEYVLSDSGIIFRGVEKHIRAQGWNYGQFEEDILNICLSILDRSPGHQNNPATDVSCRHNPIYVTRVISAMVNSNNDRGVVQGQWQGKYGGGTSPLHWRGSVAILQKWLKGRYKPVKYGQCWVFAGVLCTVLRCLGIATRVVSNFNSAHDTDQNLSVDKYVDSFGRTLEDLTEDSMWNFHVWNESWFARQDLGPSYNGWQVLDATPQEESEGVFRCGPASVTAIREGDVHLAHDGPFVFAEVNADYITWLWHEDESRERVYSNTKKIGRCISTKAVGSDSRVDITDLYKYPEGSRKERQVYSKAVNRLFGVEASGRRIWIRRAGGRCLWRDDLLEPATKPSIAGKFKVLEPPMLGHDLRLALCLANLTSRAQRVRVNLSGATILYTRKPVAEILHESHAVRLGPQEEKRIPITISYSKYKEDLTEDKKILLAAMCLVTKGEKLLVEKDITLEDFITIKVLGPAMVGVAVTVEVTVVNPLIERVKDCALMVEGSGLLQEQLSIDVPTLEPQERASVQFDITPSKSGPRQLQVDLVSPHFPDIKGFVIVHVATAK,mutated_sequence,1.0,706.0,NP_945345.2.a2m,NP_945345.2.npy,ClinVar
+NP_954712.1,NP_954712.1.csv,MATLLSHPQQRPPFLRQAIKIRRRRVRDLQDPPPQMAPEIQPPSHHFSPEQRALLYEDALYTVLHRLGHPEPNHVTEASELLRYLQEAFHVEPEEHQQTLQRVRELEKPIFCLKATVKQAKGILGKDVSGFSDPYCLLGIEQGVGVPGGSPGSRHRQKAVVRHTIPEEETHRTQVITQTLNPVWDETFILEFEDITNASFHLDMWDLDTVESVRQKLGELTDLHGLRRIFKEARKDKGQDDFLGNVVLRLQDLRCREDQWYPLEPRTETYPDRGQCHLQFQLIHKRRATSASRSQPSYTVHLHLLQQLVSHEVTQHEAGSTSWDGSLSPQAATVLFLHATQKDLSDFHQSMAQWLAYSRLYQSLEFPSSCLLHPITSIEYQWIQGRLKAEQQEELAASFSSLLTYGLSLIRRFRSVFPLSVSDSPARLQSLLRVLVQMCKMKAFGELCPNTAPLPQLVTEALQTGTTEWFHLKQQHHQPMVQGIPEAGKALLGLVQDVIGDLHQCQRTWDKIFHNTLKIHLFSMAFRELQWLVAKRVQDHTTVVGDVVSPEMGESLFQLYISLKELCQLRMSSSERDGVLALDNFHRWFQPAIPSWLQKTYNEALARVQRAVQMDELVPLGELTKHSTSAVDLSTCFAQISHTARQLDWPDPEEAFMITVKFVEDTCRLALVYCSLIKARARELSSGQKDQGQAANMLCVVVNDMEQLRLVIGKLPAQLAWEALEQRVGAVLEQGQLQNTLHAQLQSALAGLGHEIRTGVRTLAEQLEVGIAKHIQKLVGVRESVLPEDAILPLMKFLEVELCYMNTNLVQENFSSLLTLLWTHTLTVLVEAAASQRSSSLASNRLKIALQNLEICFHAEGCGLPPKALHTATFQALQRDLELQAASSRELIRKYFCSRIQQQAETTSEELGAVTVKASYRASEQKLRVELLSASSLLPLDSNGSSDPFVQLTLEPRHEFPELAARETQKHKKDLHPLFDETFEFLVPAEPCRKAGACLLLTVLDYDTLGADDLEGEAFLPLREVPGLSGSEEPGEVPQTRLPLTYPAPNGDPILQLLEGRKGDREAQVFVRLRRHRAKQASQHALRPAP,mutated_sequence,1.0,1090.0,NP_954712.1.a2m,NP_954712.1.npy,ClinVar
+NP_955631.1,NP_955631.1.csv,MEAFPWAPRSPRRGRAPPPMALVPSARYVSAPGPAHPQPFSSWNDYLGLATLITKAVDGEPRFGCARGGNGGGGSPPSSSSSSCCSPHTGAGPGALGPALGPPDYDEDDDDDSDEPGSRGRYLGSALELRALELCAGPAEAGLLEERFAELSPFAGRAAAVLLGCAPAAAAAATTTSEATPREERAPAWAAEPRLHAASGAAAARLLKPELQVCVFCRNNKEAMALYTTHILKGPDGRVLCPVLRRYTCPLCGASGDNAHTIKYCPLSKVPPPPARPPPRSARDGPPGKKLR,mutated_sequence,1.0,292.0,NP_955631.1.a2m,NP_955631.1.npy,ClinVar
+NP_957705.1,NP_957705.1.csv,MALKNINYLLIFYLSFSLLIYIKNSFCNKNNTRCLSNSCQNNSTCKDFSKDNDCSCSDTANNLDKDCDNMKDPCFSNPCQGSATCVNTPGERSFLCKCPPGYSGTICETTIGSCGKNSCQHGGICHQDPIYPVCICPAGYAGRFCEIDHDECASSPCQNGAVCQDGIDGYSCFCVPGYQGRHCDLEVDECASDPCKNEATCLNEIGRYTCICPHNYSGVNCELEIDECWSQPCLNGATCQDALGAYFCDCAPGFLGDHCELNTDECASQPCLHGGLCVDGENRYSCNCTGSGFTGTHCETLMPLCWSKPCHNNATCEDSVDNYTCHCWPGYTGAQCEIDLNECNSNPCQSNGECVELSSEKQYGRITGLPSSFSYHEASGYVCICQPGFTGIHCEEDVNECSSNPCQNGGTCENLPGNYTCHCPFDNLSRTFYGGRDCSDILLGCTHQQCLNNGTCIPHFQDGQHGFSCLCPSGYTGSLCEIATTLSFEGDGFLWVKSGSVTTKGSVCNIALRFQTVQPMALLLFRSNRDVFVKLELLSGYIHLSIQVNNQSKVLLFISHNTSDGEWHFVEVIFAEAVTLTLIDDSCKEKCIAKAPTPLESDQSICAFQNSFLGGLPVGMTSNGVALLNFYNMPSTPSFVGCLQDIKIDWNHITLENISSGSSLNVKAGCVRKDWCESQPCQSRGRCINLWLSYQCDCHRPYEGPNCLREYVAGRFGQDDSTGYVIFTLDESYGDTISLSMFVRTLQPSGLLLALENSTYQYIRVWLERGRLAMLTPNSPKLVVKFVLNDGNVHLISLKIKPYKIELYQSSQNLGFISASTWKIEKGDVIYIGGLPDKQETELNGGFFKGCIQDVRLNNQNLEFFPNPTNNASLNPVLVNVTQGCAGDNSCKSNPCHNGGVCHSRWDDFSCSCPALTSGKACEEVQWCGFSPCPHGAQCQPVLQGFECIANAVFNGQSGQILFRSNGNITRELTNITFGFRTRDANVIILHAEKEPEFLNISIQDSRLFFQLQSGNSFYMLSLTSLQSVNDGTWHEVTLSMTDPLSQTSRWQMEVDNETPFVTSTIATGSLNFLKDNTDIYVGDRAIDNIKGLQGCLSTIEIGGIYLSYFENVHGFINKPQEEQFLKISTNSVVTGCLQLNVCNSNPCLHGGNCEDIYSSYHCSCPLGWSGKHCELNIDECFSNPCIHGNCSDRVAAYHCTCEPGYTGVNCEVDIDNCQSHQCANGATCISHTNGYSCLCFGNFTGKFCRQSRLPSTVCGNEKTNLTCYNGGNCTEFQTELKCMCRPGFTGEWCEKDIDECASDPCVNGGLCQDLLNKFQCLCDVAFAGERCEVDLADDLISDIFTTIGSVTVALLLILLLAIVASVVTSNKRATQGTYSPSRQEKEGSRVEMWNLMPPPAMERLI,mutated_sequence,1.0,1406.0,NP_957705.1.a2m,NP_957705.1.npy,ClinVar
+NP_958780.1,NP_958780.1.csv,MAGPLPDEQDFIQAYEEVREKYKDERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQSEDMTAKEKLLLWSQRMVEGYQGLRCDNFTSSWRDGRLFNAIIHRHKPLLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPEDVDVPQPDEKSIITYVSSLYDAMPRVPDVQDGVRANELQLRWQEYRELVLLLLQWMRHHTAAFEERRFPSSFEEIEILWSQFLKFKEMELPAKEADKNRSKGIYQSLEGAVQAGQLKVPPGYHPLDVEKEWGKLHVAILEREKQLRSEFERLECLQRIVTKLQMEAGLCEEQLNQADALLQSDVRLLAAGKVPQRAGEVERDLDKADSMIRLLFNDVQTLKDGRHPQGEQMYRRVYRLHERLVAIRTEYNLRLKAGVAAPATQVAQVTLQSVQRRPELEDSTLRYLQDLLAWVEENQHRVDGAEWGVDLPSVEAQLGSHRGLHQSIEEFRAKIERARSDEGQLSPATRGAYRDCLGRLDLQYAKLLNSSKARLRSLESLHSFVAAATKELMWLNEKEEEEVGFDWSDRNTNMTAKKESYSALMRELELKEKKIKELQNAGDRLLREDHPARPTVESFQAALQTQWSWMLQLCCCIEAHLKENAAYFQFFSDVREAEGQLQKLQEALRRKYSCDRSATVTRLEDLLQDAQDEKEQLNEYKGHLSGLAKRAKAVVQLKPRHPAHPMRGRLPLLAVCDYKQVEVTVHKGDECQLVGPAQPSHWKVLSSSGSEAAVPSVCFLVPPPNQEAQEAVTRLEAQHQALVTLWHQLHVDMKSLLAWQSLRRDVQLIRSWSLATFRTLKPEEQRQALHSLELHYQAFLRDSQDAGGFGPEDRLMAEREYGSCSHHYQQLLQSLEQGAQEESRCQRCISELKDIRLQLEACETRTVHRLRLPLDKEPARECAQRIAEQQKAQAEVEGLGKGVARLSAEAEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRGTQGAEEVLRAHEEQLKEAQAVPATLPELEATKASLKKLRAQAEAQQPTFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVAQLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLQDARRRQEQIQAMPLADSQAVREQLRQEQALLEEIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESVIQEYVDLRTHYSELTTLTSQYIKFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAKELQQRMQEEVVRREEAAVDAQQQKRSIQEELQQLRQSSEAEIQAKARQAEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDESQRKRQAEVELASRVKAEAEAAREKQRALQALEELRLQAEEAERRLRQAEVERARQVQVALETAQRSAEAELQSKRASFAEKTAQLERSLQEEHVAVAQLREEAERRAQQQAEAERAREEAERELERWQLKANEALRLRLQAEEVAQQKSLAQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRQELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAARQRAEAERVLAEKLAAIGEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRRLEEQAAQHKADIEERLAQLRKASDSELERQKGLVEDTLRQRRQVEEEILALKASFEKAAAGKAELELELGRIRSNAEDTLRSKEQAELEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLDQLRGEAEAARRAAEEAEEARVQAEREAAQSRRQVEEAERLKQSAEEQAQARAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKKFAEQTLRQKAQVEQELTTLRLQLEETDHQKNLLDEELQRLKAEATEAARQRSQVEEELFSVRVQMEELSKLKARIEAENRALILRDKDNTQRFLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAEETQGFQRTLEAERQRQLEMSAEAERLKLRVAEMSRAQARAEEDAQRFRKQAEEIGEKLHRTELATQEKVTLVQTLEIQRQQSDHDAERLREAIAELEREKEKLQQEAKLLQLKSEEMQTVQQEQLLQETQALQQSFLSEKDSLLQRERFIEQEKAKLEQLFQDEVAKAQQLREEQQRQQQQMEQERQRLVASMEEARRRQHEAEEGVRRKQEELQQLEQQRRQQEELLAEENQRLREQLQLLEEQHRAALAHSEEVTASQVAATKTLPNGRDALDGPAAEAEPEHSFDGLRRKVSAQRLQEAGILSAEELQRLAQGHTTVDELARREDVRHYLQGRSSIAGLLLKATNEKLSVYAALQRQLLSPGTALILLEAQAASGFLLDPVRNRRLTVNEAVKEGVVGPELHHKLLSAERAVTGYKDPYTGQQISLFQAMQKGLIVREHGIRLLEAQIATGGVIDPVHSHRVPVDVAYRRGYFDEEMNRVLADPSDDTKGFFDPNTHENLTYLQLLERCVEDPETGLCLLPLTDKAAKGGELVYTDSEARDVFEKATVSAPFGKFQGKTVTIWEIINSEYFTAEQRRDLLRQFRTGRITVEKIIKIIITVVEEQEQKGRLCFEGLRSLVPAAELLESRVIDRELYQQLQRGERSVRDVAEVDTVRRALRGANVIAGVWLEEAGQKLSIYNALKKDLLPSDMAVALLEAQAGTGHIIDPATSARLTVDEAVRAGLVGPEFHEKLLSAEKAVTGYRDPYTGQSVSLFQALKKGLIPREQGLRLLDAQLSTGGIVDPSKSHRVPLDVACARGCLDEETSRALSAPRADAKAYSDPSTGEPATYGELQQRCRPDQLTGLSLLPLSEKAARARQEELYSELQARETFEKTPVEVPVGGFKGRTVTVWELISSEYFTAEQRQELLRQFRTGKVTVEKVIKILITIVEEVETLRQERLSFSGLRAPVPASELLASGVLSRAQFEQLKDGKTTVKDLSELGSVRTLLQGSGCLAGIYLEDTKEKVSIYEAMRRGLLRATTAALLLEAQAATGFLVDPVRNQRLYVHEAVKAGVVGPELHEQLLSAEKAVTGYRDPYSGSTISLFQAMQKGLVLRQHGIRLLEAQIATGGIIDPVHSHRVPVDVAYQRGYFSEEMNRVLADPSDDTKGFFDPNTHENLTYRQLLERCVEDPETGLRLLPLKGAEKAEVVETTQVYTEEETRRAFEETQIDIPGGGSHGGSTMSLWEVMQSDLIPEEQRAQLMADFQAGRVTKERMIIIIIEIIEKTEIIRQQGLASYDYVRRRLTAEDLFEARIISLETYNLLREGTRSLREALEAESAWCYLYGTGSVAGVYLPGSRQTLSIYQALKKGLLSAEVARLLLEAQAATGFLLDPVKGERLTVDEAVRKGLVGPELHDRLLSAERAVTGYRDPYTEQTISLFQAMKKELIPTEEALRLLDAQLATGGIVDPRLGFHLPLEVAYQRGYLNKDTHDQLSEPSEVRSYVDPSTDERLSYTQLLRRCRRDDGTGQLLLPLSDARKLTFRGLRKQITMEELVRSQVMDEATALQLREGLTSIEEVTKNLQKFLEGTSCIAGVFVDATKERLSVYQAMKKGIIRPGTAFELLEAQAATGYVIDPIKGLKLTVEEAVRMGIVGPEFKDKLLSAERAVTGYKDPYSGKLISLFQAMKKGLILKDHGIRLLEAQIATGGIIDPEESHRLPVEVAYKRGLFDEEMNEILTDPSDDTKGFFDPNTEENLTYLQLMERCITDPQTGLCLLPLKEKKRERKTSSKSSVRKRRVVIVDPETGKEMSVYEAYRKGLIDHQTYLELSEQECEWEEITISSSDGVVKSMIIDRRSGRQYDIDDAIAKNLIDRSALDQYRAGTLSITEFADMLSGNAGGFRSRSSSVGSSSSYPISPAVSRTQLASWSDPTEETGPVAGILDTETLEKVSITEAMHRNLVDNITGQRLLEAQACTGGIIDPSTGERFPVTDAVNKGLVDKIMVDRINLAQKAFCGFEDPRTKTKMSAAQALKKGWLYYEAGQRFLEVQYLTGGLIEPDTPGRVPLDEALQRGTVDARTAQKLRDVGAYSKYLTCPKTKLKISYKDALDRSMVEEGTGLRLLEAAAQSTKGYYSPYSVSGSGSTAGSRTGSRTGSRAGSRRGSFDATGSGFSMTFSSSSYSSSGYGRRYASGSSASLGGPESAVA,mutated_sequence,1.0,4533.0,NP_958780.1.a2m,NP_958780.1.npy,ClinVar
+NP_958786.1,NP_958786.1.csv,MSQHQLRVPQPEGLGRKRTSSEDNLYLAVLRASEGKKDERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQSEDMTAKEKLLLWSQRMVEGYQGLRCDNFTSSWRDGRLFNAIIHRHKPLLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPEDVDVPQPDEKSIITYVSSLYDAMPRVPDVQDGVRANELQLRWQEYRELVLLLLQWMRHHTAAFEERRFPSSFEEIEILWSQFLKFKEMELPAKEADKNRSKGIYQSLEGAVQAGQLKVPPGYHPLDVEKEWGKLHVAILEREKQLRSEFERLECLQRIVTKLQMEAGLCEEQLNQADALLQSDVRLLAAGKVPQRAGEVERDLDKADSMIRLLFNDVQTLKDGRHPQGEQMYRRVYRLHERLVAIRTEYNLRLKAGVAAPATQVAQVTLQSVQRRPELEDSTLRYLQDLLAWVEENQHRVDGAEWGVDLPSVEAQLGSHRGLHQSIEEFRAKIERARSDEGQLSPATRGAYRDCLGRLDLQYAKLLNSSKARLRSLESLHSFVAAATKELMWLNEKEEEEVGFDWSDRNTNMTAKKESYSALMRELELKEKKIKELQNAGDRLLREDHPARPTVESFQAALQTQWSWMLQLCCCIEAHLKENAAYFQFFSDVREAEGQLQKLQEALRRKYSCDRSATVTRLEDLLQDAQDEKEQLNEYKGHLSGLAKRAKAVVQLKPRHPAHPMRGRLPLLAVCDYKQVEVTVHKGDECQLVGPAQPSHWKVLSSSGSEAAVPSVCFLVPPPNQEAQEAVTRLEAQHQALVTLWHQLHVDMKSLLAWQSLRRDVQLIRSWSLATFRTLKPEEQRQALHSLELHYQAFLRDSQDAGGFGPEDRLMAEREYGSCSHHYQQLLQSLEQGAQEESRCQRCISELKDIRLQLEACETRTVHRLRLPLDKEPARECAQRIAEQQKAQAEVEGLGKGVARLSAEAEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRGTQGAEEVLRAHEEQLKEAQAVPATLPELEATKASLKKLRAQAEAQQPTFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVAQLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLQDARRRQEQIQAMPLADSQAVREQLRQEQALLEEIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESVIQEYVDLRTHYSELTTLTSQYIKFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAKELQQRMQEEVVRREEAAVDAQQQKRSIQEELQQLRQSSEAEIQAKARQAEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDESQRKRQAEVELASRVKAEAEAAREKQRALQALEELRLQAEEAERRLRQAEVERARQVQVALETAQRSAEAELQSKRASFAEKTAQLERSLQEEHVAVAQLREEAERRAQQQAEAERAREEAERELERWQLKANEALRLRLQAEEVAQQKSLAQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRQELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAARQRAEAERVLAEKLAAIGEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRRLEEQAAQHKADIEERLAQLRKASDSELERQKGLVEDTLRQRRQVEEEILALKASFEKAAAGKAELELELGRIRSNAEDTLRSKEQAELEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLDQLRGEAEAARRAAEEAEEARVQAEREAAQSRRQVEEAERLKQSAEEQAQARAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKKFAEQTLRQKAQVEQELTTLRLQLEETDHQKNLLDEELQRLKAEATEAARQRSQVEEELFSVRVQMEELSKLKARIEAENRALILRDKDNTQRFLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAEETQGFQRTLEAERQRQLEMSAEAERLKLRVAEMSRAQARAEEDAQRFRKQAEEIGEKLHRTELATQEKVTLVQTLEIQRQQSDHDAERLREAIAELEREKEKLQQEAKLLQLKSEEMQTVQQEQLLQETQALQQSFLSEKDSLLQRERFIEQEKAKLEQLFQDEVAKAQQLREEQQRQQQQMEQERQRLVASMEEARRRQHEAEEGVRRKQEELQQLEQQRRQQEELLAEENQRLREQLQLLEEQHRAALAHSEEVTASQVAATKTLPNGRDALDGPAAEAEPEHSFDGLRRKVSAQRLQEAGILSAEELQRLAQGHTTVDELARREDVRHYLQGRSSIAGLLLKATNEKLSVYAALQRQLLSPGTALILLEAQAASGFLLDPVRNRRLTVNEAVKEGVVGPELHHKLLSAERAVTGYKDPYTGQQISLFQAMQKGLIVREHGIRLLEAQIATGGVIDPVHSHRVPVDVAYRRGYFDEEMNRVLADPSDDTKGFFDPNTHENLTYLQLLERCVEDPETGLCLLPLTDKAAKGGELVYTDSEARDVFEKATVSAPFGKFQGKTVTIWEIINSEYFTAEQRRDLLRQFRTGRITVEKIIKIIITVVEEQEQKGRLCFEGLRSLVPAAELLESRVIDRELYQQLQRGERSVRDVAEVDTVRRALRGANVIAGVWLEEAGQKLSIYNALKKDLLPSDMAVALLEAQAGTGHIIDPATSARLTVDEAVRAGLVGPEFHEKLLSAEKAVTGYRDPYTGQSVSLFQALKKGLIPREQGLRLLDAQLSTGGIVDPSKSHRVPLDVACARGCLDEETSRALSAPRADAKAYSDPSTGEPATYGELQQRCRPDQLTGLSLLPLSEKAARARQEELYSELQARETFEKTPVEVPVGGFKGRTVTVWELISSEYFTAEQRQELLRQFRTGKVTVEKVIKILITIVEEVETLRQERLSFSGLRAPVPASELLASGVLSRAQFEQLKDGKTTVKDLSELGSVRTLLQGSGCLAGIYLEDTKEKVSIYEAMRRGLLRATTAALLLEAQAATGFLVDPVRNQRLYVHEAVKAGVVGPELHEQLLSAEKAVTGYRDPYSGSTISLFQAMQKGLVLRQHGIRLLEAQIATGGIIDPVHSHRVPVDVAYQRGYFSEEMNRVLADPSDDTKGFFDPNTHENLTYRQLLERCVEDPETGLRLLPLKGAEKAEVVETTQVYTEEETRRAFEETQIDIPGGGSHGGSTMSLWEVMQSDLIPEEQRAQLMADFQAGRVTKERMIIIIIEIIEKTEIIRQQGLASYDYVRRRLTAEDLFEARIISLETYNLLREGTRSLREALEAESAWCYLYGTGSVAGVYLPGSRQTLSIYQALKKGLLSAEVARLLLEAQAATGFLLDPVKGERLTVDEAVRKGLVGPELHDRLLSAERAVTGYRDPYTEQTISLFQAMKKELIPTEEALRLLDAQLATGGIVDPRLGFHLPLEVAYQRGYLNKDTHDQLSEPSEVRSYVDPSTDERLSYTQLLRRCRRDDGTGQLLLPLSDARKLTFRGLRKQITMEELVRSQVMDEATALQLREGLTSIEEVTKNLQKFLEGTSCIAGVFVDATKERLSVYQAMKKGIIRPGTAFELLEAQAATGYVIDPIKGLKLTVEEAVRMGIVGPEFKDKLLSAERAVTGYKDPYSGKLISLFQAMKKGLILKDHGIRLLEAQIATGGIIDPEESHRLPVEVAYKRGLFDEEMNEILTDPSDDTKGFFDPNTEENLTYLQLMERCITDPQTGLCLLPLKEKKRERKTSSKSSVRKRRVVIVDPETGKEMSVYEAYRKGLIDHQTYLELSEQECEWEEITISSSDGVVKSMIIDRRSGRQYDIDDAIAKNLIDRSALDQYRAGTLSITEFADMLSGNAGGFRSRSSSVGSSSSYPISPAVSRTQLASWSDPTEETGPVAGILDTETLEKVSITEAMHRNLVDNITGQRLLEAQACTGGIIDPSTGERFPVTDAVNKGLVDKIMVDRINLAQKAFCGFEDPRTKTKMSAAQALKKGWLYYEAGQRFLEVQYLTGGLIEPDTPGRVPLDEALQRGTVDARTAQKLRDVGAYSKYLTCPKTKLKISYKDALDRSMVEEGTGLRLLEAAAQSTKGYYSPYSVSGSGSTAGSRTGSRTGSRAGSRRGSFDATGSGFSMTFSSSSYSSSGYGRRYASGSSASLGGPESAVA,mutated_sequence,1.0,4547.0,NP_958786.1.a2m,NP_958786.1.npy,ClinVar
+NP_976035.1,NP_976035.1.csv,MAASQAVEEMRSRVVLGEFGVRNVHTTDFPGNYSGYDDAWDQDRFEKNFRVDVVHMDENSLEFDMVGIDAAIANAFRRILLAEVPTMAVEKVLVYNNTSIVQDEILAHRLGLIPIHADPRLFEYRNQGDEEGTEIDTLQFRLQVRCTRNPHAAKDSSDPNELYVNHKVYTRHMTWIPLGNQADLFPEGTIRPVHDDILIAQLRPGQEIDLLMHCVKGIGKDHAKFSPVATASYRLLPDITLLEPVEGEAAEELSRCFSPGVIEVQEVQGKKVARVANPRLDTFSREIFRNEKLKKVVRLARVRDHYIFSVESTGVLPPDVLVSEAIKVLMGKCRRFLDELDAVQMD,mutated_sequence,1.0,346.0,NP_976035.1.a2m,NP_976035.1.npy,ClinVar
+NP_981932.1,NP_981932.1.csv,MYFLTPILVAILCILVVWIFKNADRSMEKKKGEPRTRAEARPWVDEDLKDSSDLHQAEEDADEWQESEENVEHIPFSHNHYPEKEMVKRSQEFYELLNKRRSVRFISNEQVPMEVIDNVIRTAGTAPSGAHTEPWTFVVVKDPDVKHKIRKIIEEEEEINYMKRMGHRWVTDLKKLRTNWIKEYLDTAPILILIFKQVHGFAANGKKKVHYYNEISVSIACGILLAALQNAGLVTVTTTPLNCGPRLRVLLGRPAHEKLLMLLPVGYPSKEATVPDLKRKPLDQIMVTV,mutated_sequence,1.0,289.0,NP_981932.1.a2m,NP_981932.1.npy,ClinVar
+NP_982301.1,NP_982301.1.csv,MATFSRQEFFQQLLQGCLLPTAQQGLDQIWLLLAICLACRLLWRLGLPSYLKHASTVAGGFFSLYHFFQLHMVWVVLLSLLCYLVLFLCRHSSHRGVFLSVTILIYLLMGEMHMVDTVTWHKMRGAQMIVAMKAVSLGFDLDRGEVGTVPSPVEFMGYLYFVGTIVFGPWISFHSYLQAVQGRPLSCRWLQKVARSLALALLCLVLSTCVGPYLFPYFIPLNGDRLLRNKKRKARGTMVRWLRAYESAVSFHFSNYFVGFLSEATATLAGAGFTEEKDHLEWDLTVSKPLNVELPRSMVEVVTSWNLPMSYWLNNYVFKNALRLGTFSAVLVTYAASALLHGFSFHLAAVLLSLAFITYVEHVLRKRLARILSACVLSKRCPPDCSHQHRLGLGVRALNLLFGALAIFHLAYLGSLFDVDVDDTTEEQGYGMAYTVHKWSELSWASHWVTFGCWIFYRLIG,mutated_sequence,1.0,461.0,NP_982301.1.a2m,NP_982301.1.npy,ClinVar
+NP_991331.1,NP_991331.1.csv,MCPKGYEDSMEFPDHSRHLLQCLSEQRHQGFLCDCTVLVGDAQFRAHRAVLASCSMYFHLFYKDQLDKRDIVHLNSDIVTAPAFALLLEFMYEGKLQFKDLPIEDVLAAASYLHMYDIVKVCKKKLKEKATTEADSTKKEEDASSCSDKVESLSDGSSHIAGDLPSDEDEGEDEKLNILPSKRDLAAEPGNMWMRLPSDSAGIPQAGGEAEPHATAAGKTVASPCSSTESLSQRSVTSVRDSADVDCVLDLSVKSSLSGVENLNSSYFSSQDVLRSNLVQVKVEKEASCDESDVGTNDYDMEHSTVKESVSTNNRVQYEPAHLAPLREDSVLRELDREDKASDDEMMTPESERVQVEGGMESSLLPYVSNILSPAGQIFMCPLCNKVFPSPHILQIHLSTHFREQDGIRSKPAADVNVPTCSLCGKTFSCMYTLKRHERTHSGEKPYTCTQCGKSFQYSHNLSRHAVVHTREKPHACKWCERRFTQSGDLYRHIRKFHCELVNSLSVKSEALSLPTVRDWTLEDSSQELWK,mutated_sequence,1.0,531.0,NP_991331.1.a2m,NP_991331.1.npy,ClinVar
+NP_996816.3,NP_996816.3.csv,MNCPVLSLGSGFLFQVIEMLIFAYFASISLTESRGLFPRLENVGAFKKVSIVPTQAVCGLPDRSTFCHSSAAAESIQFCTQRFCIQDCPYRSSHPTYTALFSAGLSSCITPDKNDLHPNAHSNSASFIFGNHKSCFSSPPSPKLMASFTLAVWLKPEQQGVMCVIEKTVDGQIVFKLTISEKETMFYYRTVNGLQPPIKVMTLGRILVKKWIHLSVQVHQTKISFFINGVEKDHTPFNARTLSGSITDFASGTVQIGQSLNGLEQFVGRMQDFRLYQVALTNREILEVFSGDLLRLHAQSHCRCPGSHPRVHPLAQRYCIPNDAGDTADNRVSRLNPEAHPLSFVNDNDVGTSWVSNVFTNITQLNQGVTISVDLENGQYQVFYIIIQFFSPQPTEIRIQRKKENSLDWEDWQYFARNCGAFGMKNNGDLEKPDSVNCLQLSNFTPYSRGNVTFSILTPGPNYRPGYNNFYNTPSLQEFVKATQIRFHFHGQYYTTETAVNLRHRYYAVDEITISGRCQCHGHADNCDTTSQPYRCLCSQESFTEGLHCDRCLPLYNDKPFRQGDQVYAFNCKPCQCNSHSKSCHYNISVDPFPFEHFRGGGGVCDDCEHNTTGRNCELCKDYFFRQVGADPSAIDVCKPCDCDTVGTRNGSILCDQIGGQCNCKRHVSGRQCNQCQNGFYNLQELDPDGCSPCNCNTSGTVDGDITCHQNSGQCKCKANVIGLRCDHCNFGFKFLRSFNDVGCEPCQCNLHGSVNKFCNPHSGQCECKKEAKGLQCDTCRENFYGLDVTNCKACDCDTAGSLPGTVCNAKTGQCICKPNVEGRQCNKCLEGNFYLRQNNSFLCLPCNCDKTGTINGSLLCNKSTGQCPCKLGVTGLRCNQCEPHRYNLTIDNFQHCQMCECDSLGTLPGTICDPISGQCLCVPNRQGRRCNQCQPGFYISPGNATGCLPCSCHTTGAVNHICNSLTGQCVCQDASIAGQRCDQCKDHYFGFDPQTGRCQPCNCHLSGALNETCHLVTGQCFCKQFVTGSKCDACVPSASHLDVNNLLGCSKTPFQQPPPRGQVQSSSAINLSWSPPDSPNAHWLTYSLLRDGFEIYTTEDQYPYSIQYFLDTDLLPYTKYSYYIETTNVHGSTRSVAVTYKTKPGVPEGNLTLSYIIPIGSDSVTLTWTTLSNQSGPIEKYILSCAPLAGGQPCVSYEGHETSATIWNLVPFAKYDFSVQACTSGGCLHSLPITVTTAQAPPQRLSPPKMQKISSTELHVEWSPPAELNGIIIRYELYMRRLRSTKETTSEESRVFQSSGWLSPHSFVESANENALKPPQTMTTITGLEPYTKYEFRVLAVNMAGSVSSAWVSERTGESAPVFMIPPSVFPLSSYSLNISWEKPADNVTRGKVVGYDINMLSEQSPQQSIPMAFSQLLHTAKSQELSYTVEGLKPYRIYEFTITLCNSVGCVTSASGAGQTLAAAPAQLRPPLVKGINSTTIHLRWFPPEELNGPSPIYQLERRESSLPALMTTMMKGIRFIGNGYCKFPSSTHPVNTDFTGIKASFRTKVPEGLIVFAASPGNQEEYFALQLKKGRLYFLFDPQGSPVEVTTTNDHGKQYSDGKWHEIIAIRHQAFGQITLDGIYTGSSAILNGSTVIGDNTGVFLGGLPRSYTILRKDPEIIQKGFVGCLKDVHFMKNYNPSAIWEPLDWQSSEEQINVYNSWEGCPASLNEGAQFLGAGFLELHPYMFHGGMNFEISFKFRTDQLNGLLLFVYNKDGPDFLAMELKSGILTFRLNTSLAFTQVDLLLGLSYCNGKWNKVIIKKEGSFISASVNGLMKHASESGDQPLVVNSPVYVGGIPQELLNSYQHLCLEQGFGGCMKDVKFTRGAVVNLASVSSGAVRVNLDGCLSTDSAVNCRGNDSILVYQGKEQSVYEGGLQPFTEYLYRVIASHEGGSVYSDWSRGRTTGAAPQSVPTPSRVRSLNGYSIEVTWDEPVVRGVIEKYILKAYSEDSTRPPRMPSASAEFVNTSNLTGILTGLLPFKNYAVTLTACTLAGCTESSHALNISTPQEAPQEVQPPVAKSLPSSLLLSWNPPKKANGIITQYCLYMDGRLIYSGSEENYIVTDLAVFTPHQFLLSACTHVGCTNSSWVLLYTAQLPPEHVDSPVLTVLDSRTIHIQWKQPRKISGILERYVLYMSNHTHDFTIWSVIYNSTELFQDHMLQYVLPGNKYLIKLGACTGGGCTVSEASEALTDEDIPEGVPAPKAHSYSPDSFNVSWTEPEYPNGVITSYGLYLDGILIHNSSELSYRAYGFAPWSLHSFRVQACTAKGCALGPLVENRTLEAPPEGTVNVFVKTQGSRKAHVRWEAPFRPNGLLTHSVLFTGIFYVDPVGNNYTLLNVTKVMYSGEETNLWVLIDGLVPFTNYTVQVNISNSQGSLITDPITIAMPPGAPDGVLPPRLSSATPTSLQVVWSTPARNNAPGSPRYQLQMRSGDSTHGFLELFSNPSASLSYEVSDLQPYTEYMFRLVASNGFGSAHSSWIPFMTAEDKPGPVVPPILLDVKSRMMLVTWQHPRKSNGVITHYNIYLHGRLYLRTPGNVTNCTVMHLHPYTAYKFQVEACTSKGCSLSPESQTVWTLPGAPEGIPSPELFSDTPTSVIISWQPPTHPNGLVENFTIERRVKGKEEVTTLVTLPRSHSMRFIDKTSALSPWTKYEYRVLMSTLHGGTNSSAWVEVTTRPSRPAGVQPPVVTVLEPDAVQVTWKPPLIQNGDILSYEIHMPDPHITLTNVTSAVLSQKVTHLIPFTNYSVTIVACSGGNGYLGGCTESLPTYVTTHPTVPQNVGPLSVIPLSESYVVISWQPPSKPNGPNLRYELLRRKIQQPLASNPPEDLNRWHNIYSGTQWLYEDKGLSRFTTYEYMLFVHNSVGFTPSREVTVTTLAGLPERGANLTASVLNHTAIDVRWAKPTVQDLQGEVEYYTLFWSSATSNDSLKILPDVNSHVIGHLKPNTEYWIFISVFNGVHSINSAGLHATTCDGEPQGMLPPEVVIINSTAVRVIWTSPSNPNGVVTEYSIYVNNKLYKTGMNVPGSFILRDLSPFTIYDIQVEVCTIYACVKSNGTQITTVEDTPSDIPTPTIRGITSRSLQIDWVSPRKPNGIILGYDLLWKTWYPCAKTQKLVQDQSDELCKAVRCQKPESICGHICYSSEAKVCCNGVLYNPKPGHRCCEEKYIPFVLNSTGVCCGGRIQEAQPNHQCCSGYYARILPGEVCCPDEQHNRVSVGIGDSCCGRMPYSTSGNQICCAGRLHDGHGQKCCGRQIVSNDLECCGGEEGVVYNRLPGMFCCGQDYVNMSDTICCSASSGESKAHIKKNDPVPVKCCETELIPKSQKCCNGVGYNPLKYVCSDKISTGMMMKETKECRILCPASMEATEHCGRCDFNFTSHICTVIRGSHNSTGKASIEEMCSSAEETIHTGSVNTYSYTDVNLKPYMTYEYRISAWNSYGRGLSKAVRARTKEDVPQGVSPPTWTKIDNLEDTIVLNWRKPIQSNGPIIYYILLRNGIERFRGTSLSFSDKEGIQPFQEYSYQLKACTVAGCATSSKVVAATTQGVPESILPPSITALSAVALHLSWSVPEKSNGVIKEYQIRQVGKGLIHTDTTDRRQHTVTGLQPYTNYSFTLTACTSAGCTSSEPFLGQTLQAAPEGVWVTPRHIIINSTTVELYWSLPEKPNGLVSQYQLSRNGNLLFLGGSEEQNFTDKNLEPNSRYTYKLEVKTGGGSSASDDYIVQTPMSTPEEIYPPYNITVIGPYSIFVAWIPPGILIPEIPVEYNVLLNDGSVTPLAFSVGHHQSTLLENLTPFTQYEIRIQACQNGSCGVSSRMFVKTPEAAPMDLNSPVLKALGSACIEIKWMPPEKPNGIIINYFIYRRPAGIEEESVLFVWSEGALEFMDEGDTLRPFTLYEYRVRACNSKGSVESLWSLTQTLEAPPQDFPAPWAQATSAHSVLLNWTKPESPNGIISHYRVVYQERPDDPTFNSPTVHAFTVKGTSHQAHLYGLEPFTTYRIGVVAANHAGEILSPWTLIQTLESSPSGLRNFIVEQKENGRALLLQWSEPMRTNGVIKTYNIFSDGFLEYSGLNRQFLFRRLDPFTLYTLTLEACTRAGCAHSAPQPLWTDEAPPDSQLAPTVHSVKSTSVELSWSEPVNPNGKIIRYEVIRRCFEGKAWGNQTIQADEKIVFTEYNTERNTFMYNDTGLQPWTQCEYKIYTWNSAGHTCSSWNVVRTLQAPPEGLSPPVISYVSMNPQKLLISWIPPEQSNGIIQSYRLQRNEMLYPFSFDPVTFNYTDEELLPFSTYSYALQACTSGGCSTSKPTSITTLEAAPSEVSPPDLWAVSATQMNVCWSPPTVQNGKITKYLVRYDNKESLAGQGLCLLVSHLQPYSQYNFSLVACTNGGCTASVSKSAWTMEALPENMDSPTLQVTGSESIEITWKPPRNPNGQIRSYELRRDGTIVYTGLETRYRDFTLTPGVEYSYTVTASNSQGGILSPLVKDRTSPSAPSGMEPPKLQARGPQEILVNWDPPVRTNGDIINYTLFIRELFERETKIIHINTTHNSFGMQSYIVNQLKPFHRYEIRIQACTTLGCASSDWTFIQTPEIAPLMQPPPHLEVQMAPGGFQPTVSLLWTGPLQPNGKVLYYELYRRQIATQPRKSNPVLIYNGSSTSFIDSELLPFTEYEYQVWAVNSAGKAPSSWTWCRTGPAPPEGLRAPTFHVISSTQAVVNISAPGKPNGIVSLYRLFSSSAHGAETVLSEGMATQQTLHGLQAFTNYSIGVEACTCFNCCSKGPTAELRTHPAPPSGLSSPQIGTLASRTASFRWSPPMFPNGVIHSYELQFHVACPPDSALPCTPSQIETKYTGLGQKASLGGLQPYTTYKLRVVAHNEVGSTASEWISFTTQKELPQYRAPFSVDSNLSVVCVNWSDTFLLNGQLKEYVLTDGGRRVYSGLDTTLYIPRTADKTFFFQVICTTDEGSVKTPLIQYDTSTGLGLVLTTPGKKKGSRSKSTEFYSELWFIVLMAMLGLILLAIFLSLILQRKIHKEPYIRERPPLVPLQKRMSPLNVYPPGENHMGLADTKIPRSGTPVSIRSNRSACVLRIPSQNQTSLTYSQGSLHRSVSQLMDIQDKKVLMDNSLWEAIMGHNSGLYVDEEDLMNAIKDFSSVTKERTTFTDTHL,mutated_sequence,1.0,5202.0,NP_996816.3.a2m,NP_996816.3.npy,ClinVar
+NP_996820.1,NP_996820.1.csv,MAASQTSQTVASHVPFADLCSTLERIQKSKGRAEKIRHFREFLDSWRKFHDALHKNHKDVTDSFYPAMRLILPQLERERMAYGIKETMLAKLYIELLNLPRDGKDALKLLNYRTPTGTHGDAGDFAMIAYFVLKPRCLQKGSLTIQQVNDLLDSIASNNSAKRKDLIKKSLLQLITQSSALEQKWLIRMIIKDLKLGVSQQTIFSVFHNDAAELHNVTTDLEKVCRQLHDPSVGLSDISITLFSAFKPMLAAIADIEHIEKDMKHQSFYIETKLDGERMQMHKDGDVYKYFSRNGYNYTDQFGASPTEGSLTPFIHNAFKADIQICILDGEMMAYNPNTQTFMQKGTKFDIKRMVEDSDLQTCYCVFDVLMVNNKKLGHETLRKRYEILSSIFTPIPGRIEIVQKTQAHTKNEVIDALNEAIDKREEGIMVKQPLSIYKPDKRGEGWLKIKPEYVSGLMDELDILIVGGYWGKGSRGGMMSHFLCAVAEKPPPGEKPSVFHTLSRVGSGCTMKELYDLGLKLAKYWKPFHRKAPPSSILCGTEKPEVYIEPCNSVIVQIKAAEIVPSDMYKTGCTLRFPRIEKIRDDKEWHECMTLDDLEQLRGKASGKLASKHLYIGGDDEPQEKKRKAAPKMKKVIGIIEHLKAPNLTNVNKISNIFEDVEFCVMSGTDSQPKPDLENRIAEFGGYIVQNPGPDTYCVIAGSENIRVKNIILSNKHDVVKPAWLLECFKTKSFVPWQPRFMIHMCPSTKEHFAREYDCYGDSYFIDTDLNQLKEVFSGIKNSNEQTPEEMASLIADLEYRYSWDCSPLSMFRRHTVYLDSYAVINDLSTKNEGTRLAIKALELRFHGAKVVSCLAEGVSHVIIGEDHSRVADFKAFRRTFKRKFKILKESWVTDSIDKCELQEENQYLI,mutated_sequence,1.0,911.0,NP_996820.1.a2m,NP_996820.1.npy,ClinVar
+NP_997244.4,NP_997244.4.csv,MHSAGTPGLSSRRTGNSTSFQPGPPPPPRLLLLLLLLLSLVSRVPAQPAAFGRALLSPGLAGAAGVPAEEAIVLANRGLRVPFGREVWLDPLHDLVLQVQPGDRCAVSVLDNDALAQRPGRLSPKRFPCDFGPGEVRYSHLGARSPSRDRVRLQLRYDAPGGAVVLPLVLEVEVVFTQLEVVTRNLPLVVEELLGTSNALDARSLEFAFQPETEECRVGILSGLGALPRYGELLHYPQVPGGAREGGAPETLLMDCKAFQELGVRYRHTAASRSPNRDWIPMVVELRSRGAPVGSPALKREHFQVLVRIRGGAENTAPKPSFVAMMMMEVDQFVLTALTPDMLAAEDAESPSDLLIFNLTSPFQPGQGYLVSTDDRSLPLSSFTQRDLRLLKIAYQPPSEDSDQERLFELELEVVDLEGAASDPFAFMVVVKPMNTMAPVVTRNTGLILYEGQSRPLTGPAGSGPQNLVISDEDDLEAVRLEVVAGLRHGHLVILGASSGSSAPKSFTVAELAAGQVVYQHDDRDGSLSDNLVLRMVDGGGRHQVQFLFPITLVPVDDQPPVLNANTGLTLAEGETVPILPLSLSATDMDSDDSLLLFVLESPFLTTGHLLLRQTHPPHEKQELLRGLWRKEGAFYERTVTEWQQQDITEGRLFYRHSGPHSPGPVTDQFTFRVQDNHDPPNQSGLQRFVIRIHPVDRLPPELGSGCPLRMVVQESQLTPLRKKWLRYTDLDTDDRELRYTVTQPPTDTDENHLPAPLGTLVLTDNPSVVVTHFTQAQINHHKIAYRPPGQELGVATRVAQFQFQVEDRAGNVAPGTFTLYLHPVDNQPPEILNTGFTIQEKGHHILSETELHVNDVDTDVAHISFTLTQAPKHGHMRVSGQILHVGGLFHLEDIKQGRVSYAHNGDKSLTDSCSLEVSDRHHVVPITLRVNVRPVDDEVPILSHPTGTLESYLDVLENGATEITANVIKGTNEETDDLMLTFLLEDPPLYGEILVNGIPAEQFTQRDILEGSVVYTHTSGEIGLLPKADSFNLSLSDMSQEWRIGGNTIQGVTIWVTILPVDSQAPEIFVGEQLIVMEGDKSVITSVHISAEDVDSLNDDILCTIVIQPTSGYVENISPAPGSEKSRAGIAISAFNLKDLRQGHINYVQSVHKGVEPVEDRFVFRCSDGINFSERQFFPIVIIPTNDEQPEMFMREFMVMEGMSLVIDTPILNAADADVPLDDLTFTITQFPTHGHIMNQLINGTVLVESFTLDQIIESSSIIYEHDDSETQEDSFVIKLTDGKHSVEKTVLIIVIPVDDETPRMTINNGLEIEIGDTKIINNKILMATDLDSEDKSLVYIIRYGPGHGLLQRRKPTGAFENITLGMNFTQDEVDRNLIQYVHLGQEGIRDLIKFDVTDGINPLIDRYFYVSIGSIDIVFPDVISKGVSLKEGGKVTLTTDLLSTSDLNSPDENLVFTITRAPMRGHLECTDQPGVSITSFTQLQLAGNKIYYIHTADDEVKMDSFEFQVTDGRNPVFRTFRISISDVDNKKPVVTIHKLVVSESENKLITPFELTVEDRDTPDKLLKFTITQVPIHGHLLFNNTRPVMVFTKQDLNENLISYKHDGTESSEDSFSFTVTDGTHTDFYVFPDTVFETRRPQVMKIQVLAVDNSVPQIAVNKGASTLRTLATGHLGFMITSKILKVEDRDSLHISLRFIVTEAPQHGYLLNLDKGNHSITQFTQADIDDMKICYVLREGANATSDMFYFAVEDGGGNKLTYQNFRLNWAWISFEKEYYLVNEDSKFLDVVLKRRGYLGETSFISIGTRDRTAEKDKDFKGKAQKQVQFNPGQTRATWRVRILSDGEHEQSETFQVVLSEPVLAALEFPTVATVEIVDPGDEPTVFIPQSKYSVEEDVGELFIPIRRSGDVSQELMVVCYTQQGTATGTVPTSVLSYSDYISRPEDHTSVVRFDKDEREKLCRIVIIDDSLYEEEETFHVLLSMPMGGRIGSEFPGAQVTIVPDKDDEPIFYFGDVEYSVDESAGYVEVQVWRTGTDLSKSSSVTVRSRKTDPPSADAGTDYVGISRNLDFAPGVNMQPVRVVILDDLGQPALEGIEKFELVLRMPMNAALGEPSKATVSINDSVSDLPKMQFKERIYTGSESDGQIVTMIHRTGDVQYRSSVRCYTRQGSAQVMMDFEERPNTDTSIITFLPGETEKPCILELMDDVLYEEVEELRLVLGTPQSNSPFGAAVGEQNETLIRIRDDADKTVIKFGETKFSVTEPKEPGESVVIRIPVIRQGDTSKVSIVRVHTKDGSATSGEDYHPVSEEIEFKEGETQHVVEIEVTFDGVREMREAFTVHLKPDENMIAEMQLTKAIVYIEEMSSMADVTFPSVPQIVSLLMYDDTSKAKESAEPMSGYPVICITACNPKYSDYDKTGSICASENINDTLTRYRWLISAPAGPDGVTSPMREVDFDTFFTSSKMVTLDSIYFQPGSRVQCAARAVNTNGDEGLELMSPIVTISREEGLCQPRVPGVVGAEPFSAKLRYTGPEDADYTNLIKLTVTMPHIDGMLPVISTRELSNFELTLSPDGTRVGNHKCSNLLDYTEVKTHYGFLTDATKNPEIIGETYPYQYSLSIRGSTTLRFYRNLNLEACLWEFVSYYDMSELLADCGGTIGTDGQVLNLVQSYVTLRVPLYVSYVFHSPVGVGGWQHFDLKSELRLTFVYDTAILWNDGIGSPPEAELQGSLYPTSMRIGDEGRLAVHFKTEAQFHGLFVLSHPASFTSSVIMSADHPGLTFSLRLIRSEPTYNQPVQQWSFVSDFAVRDYSGTYTVKLVPCTAPSHQEYRLPVTCNPREPVTFDLDIRFQQVSDPVAAEFSLNTQMYLLSKKSLWLSDGSMGFGQESDVAFAEGDIIYGRVMVDPVQNLGDSFYCSIEKVFLCTGADGYVPKYSPMNAEYGCLADSPSLLYRFKIVDKAQPETQATSFGNVLFNAKLAVDDPEAILLVNQPGSDGFKVDSTPLFQVALGREWYIHTIYTVRSKDNANRGIGKRSVEYHSLVSQGKPQSTTKSRKKREIRSTPSLAWEIGAENSRGTNIQHIALDRTKRQIPHGRAPPDGILPWELNSPSSAVSLVTVVGGTTVGLLTICLTVIAVLMCRGKESFRGKDAPKGSSSSEPMVPPQSHHNDSSEV,mutated_sequence,1.0,3169.0,NP_997244.4.a2m,NP_997244.4.npy,ClinVar
+NP_997717.2,NP_997717.2.csv,MAAWSPAAAAPLLRGIRGLPLHHRMFATQTEGELRVTQILKEKFPRATAIKVTDISGGCGAMYEIKIESEEFKEKRTVQQHQMVNQALKEEIKEMHGLRIFTSVPKR,mutated_sequence,1.0,107.0,NP_997717.2.a2m,NP_997717.2.npy,ClinVar
+UPI0000125CAE,UPI0000125CAE.csv,MAATGTAAAAATGRLLLLLLVGLTAPALALAGYIEALAANAGTGFAVAEPQIAMFCGKLNMHVNIQTGKWEPDPTGTKSCFETKEEVLQYCQEMYPELQITNVMEANQRVSIDNWCRRDKKQCKSRFVTPFKCLVGEFVSDVLLVPEKCQFFHKERMEVCENHQHWHTVVKEACLTQGMTLYSYGMLLPCGVDQFHGTEYVCCPQTKIIGSVSKEEEEEDEEEEEEEDEEEDYDVYKSEFPTEADLEDFTEAAVDEDDEDEEEGEEVVEDRDYYYDTFKGDDYNEENPTEPGSDGTMSDKEITHDVKAVCSQEAMTGPCRAVMPRWYFDLSKGKCVRFIYGGCGGNRNNFESEDYCMAVCKAMIPPTPLPTNDVDVYFETSADDNEHARFQKAKEQLEIRHRNRMDRVKKEWEEAELQAKNLPKAERQTLIQHFQAMVKALEKEAASEKQQLVETHLARVEAMLNDRRRMALENYLAALQSDPPRPHRILQALRRYVRAENKDRLHTIRHYQHVLAVDPEKAAQMKSQVMTHLHVIEERRNQSLSLLYKVPYVAQEIQEEIDELLQEQRADMDQFTASISETPVDVRVSSEESEEIPPFHPFHPFPALPENEDTQPELYHPMKKGSGVGEQDGGLIGAEEKVINSKNKVDENMVIDETLDVKEMIFNAERVGGLEEERESVGPLREDFSLSSSALIGLLVIAVAIATVIVISLVMLRKRQYGTISHGIVEVDPMLTPEERHLNKMQNHGYENPTYKYLEQMQI,mutated_sequence,1.0,763.0,UPI0000125CAE.a2m,UPI0000125CAE.npy,gnomAD
+UPI000013C672,UPI000013C672.csv,MAKKVAVIGAGVSGLISLKCCVDEGLEPTCFERTEDIGGVWRFKENVEDGRASIYQSVVTNTSKEMSCFSDFPMPEDFPNFLHNSKLLEYFRIFAKKFDLLKYIQFQTTVLSVRKCPDFSSSGQWKVVTQSNGKEQSAVFDAVMVCSGHHILPHIPLKSFPGMERFKGQYFHSRQYKHPDGFEGKRILVIGMGNSGSDIAVELSKNAAQVFISTRHGTWVMSRISEDGYPWDSVFHTRFRSMLRNVLPRTAVKWMIEQQMNRWFNHENYGLEPQNKYIMKEPVLNDDVPSRLLCGAIKVKSTVKELTETSAIFEDGTVEENIDVIIFATGYSFSFPFLEDSLVKVENNMVSLYKYIFPAHLDKSTLACIGLIQPLGSIFPTAELQARWVTRVFKGLCSLPSERTMMMDIIKRNEKRIDLFGESQSQTLQTNYVDYLDELALEIGAKPDFCSLLFKDPKLAVRLYFGPCNSY,mutated_sequence,1.0,471.0,UPI000013C672.a2m,UPI000013C672.npy,gnomAD
+UPI000035AA82,UPI000035AA82.csv,MLRYLLKTLLQMNLFADSLAGDISNSSELLLGFNSSLAALNHTLLPPGDPSLNGSRVGPEDAMPRIVEQPPDLLVSRGEPATLPCRAEGRPRPNIEWYKNGARVATVREDPRAHRLLLPSGALFFPRIVHGRRARPDEGVYTCVARNYLGAAASRNASLEVAVLRDDFRQSPGNVVVAVGEPAVLECVPPRGHPEPSVSWRKDGARLKEEEGRITIRGGKLMMSHTLKSDAGMYVCVASNMAGERESAAAEVMVLERPSFLRRPVNQVVLADAPVTFLCEVKGDPPPRLRWRKEDGELPTGRYEIRSDHSLWIGHVSAEDEGTYTCVAENSVGRAEASGSLSVHVPPQLVTQPQDQMAAPGESVAFQCETKGNPPPAIFWQKEGSQVLLFPSQSLQPTGRFSVSPRGQLNITAVQRGDAGYYVCQAVSVAGSILAKALLEIKGASLDGLPPVILQGPANQTLVLGSSVWLPCRVTGNPQPSVRWKKDGQWLQGDDLQFKTMANGTLYIANVQEMDMGFYSCVAKSSTGEATWSGWLKMREDWGVSPDPPTEPSSPPGAPSQPVVTEITKNSITLTWKPNPQTGAAVTSYVIEAFSPAAGNTWRTVADGVQLETHTVSGLQPNTIYLFLVRAVGAWGLSEPSPVSEPVRTQDSSPSRPVEDPWRGQQGLAEVAVRLQEPIVLGPRTLQVSWTVDGPVQLVQGFRVSWRVAGPEGGSWTMLDLQSPSQQSTVLRGLPPGTQIQIKVQAQGQEGLGAESLSVTRSIPEEAPSGPPQGVAVALGGDGNSSITVSWEPPLPSQQNGVITEYQIWCLGNESRFHLNRSAAGWARSAMLRGLVPGLLYRTLVAAATSAGVGVPSAPVLVQLPSPPDLEPGLEVGAGLAVRLARVLREPAFLAGSGAACGALLLGLCAALYWRRKQRKELSHYTASFAYTPAVSFPHSEGLSGASSRPPMGLGPAPYSWLADSWPHPSRSPSAQEPRGSCCPSNPDPDDRYYNEAGISLYLAQTARGTAAPGEGPVYSTIDPAGEELQTFHGGFPQHPSGDLGPWSQYAPPEWSQGDSGAKGGKVKLLGKPVQMPSLNWPEALPPPPPSCELSCLEGPEEELEGSSEPEEWCPPMPERSHLTEPSSSGGCLVTPSRRETPSPTPSYGQQSTATLTPSPPDPPQPPTDMPHLHQMPRRVPLGPSSPLSVSQPMLGIREARPAGLGAGPAASPHLSPSPAPSTASSAPGRTWQGNGEMTPPLQGPRARFRKKPKALPYRRENSPGDLPPPPLPPPEEEASWALELRAAGSMSSLERERSGERKAVQAVPLAAQRVLHPDEEAWLPYSRPSFLSRGQGTSTCSTAGSNSSRGSSSSRGSRGPGRSRSRSQSRSQSQRPGQKRREEPR,mutated_sequence,1.0,1386.0,UPI000035AA82.a2m,UPI000035AA82.npy,gnomAD
+UPI00001D7B55,UPI00001D7B55.csv,MNYPGRGSPRSPEHNGRGGGGGAWELGSDARPAFGGGVCCFEHLPGGDPDDGDVPLALLRGEPGLHLAPGTDDHNHHLALDPCLSDENYDFSSAESGSSLRYYSEGESGGGGSSLSLHPPQQPPLVPTNSGGGGATGGSPGERKRTRLGGPAARHRYEVVTELGPEEVRWFYKEDKKTWKPFIGYDSLRIELAFRTLLQTTGARPQGGDRDGDHVCSPTGPASSSGEDDDEDRACGFCQSTTGHEPEMVELVNIEPVCVRGGLYEVDVTQGECYPVYWNQADKIPVMRGQWFIDGTWQPLEEEESNLIEQEHLNCFRGQQMQENFDIEVSKSIDGKDAVHSFKLSRNHVDWHSVDEVYLYSDATTSKIARTVTQKLGFSKASSSGTRLHRGYVEEATLEDKPSQTTHIVFVVHGIGQKMDQGRIIKNTAMMREAARKIEERHFSNHATHVEFLPVEWRSKLTLDGDTVDSITPDKVRGLRDMLNSSAMDIMYYTSPLYRDELVKGLQQELNRLYSLFCSRNPDFEEKGGKVSIVSHSLGCVITYDIMTGWNPVRLYEQLLQKEEELPDERWMSYEERHLLDELYITKRRLKEIEERLHGLKASSMTQTPALKFKVENFFCMGSPLAVFLALRGIRPGNTGSQDHILPREICNRLLNIFHPTDPVAYRLEPLILKHYSNISPVQIHWYNTSNPLPYEHMKPSFLNPAKEPTSVSENEGISTIPSPVTSPVLSRRHYGESITNIGKASILGAASIGKGLGGMLFSRFGRSSTTQSSETSKDSMEDEKKPVASPSATTVGTQTLPHSSSGFLDSAYFRLQESFFNLPQLLFPENVMQNKDNALVELDHRIDFELREGLVESRYWSAVTSHTAYWSSLDVALFLLTFMYKHEHDDDAKPNLDPI,mutated_sequence,1.0,900.0,UPI00001D7B55.a2m,UPI00001D7B55.npy,gnomAD
+UPI000059D713,UPI000059D713.csv,MEAVKAEAWEGAAVAQDLLALGYGGVPGAASRGASCPDFRGLCVRLAAELATLGALEQQREAGAEVLSAGDGPGAEEDFLRQLGSLLRELHCPDRALCGGDGAAALREPGAGLRLLRFLCSELQATRLLCLRSLLDPSPRPPLGEGVVEGAGMVQELDLTLQALGLPRPAPGTPASQLLQELHAKISELQPSLPPGSLQPLLSCSLDAPRWEALESLSQSLRDQYRCRRCLLLKRLDLTTSAFHWSDRAEAQGEAMRAVLIPIREVLTPESDISIAHVLAARADLSCLVPATSVAVRRGTCCAINKVLMGNVPDRGGRPNELEPPMPTWRSRREDGGPQCWGRKKKKKK,mutated_sequence,1.0,349.0,UPI000059D713.a2m,UPI000059D713.npy,gnomAD
+UPI0000246EE9,UPI0000246EE9.csv,MGNAAGSAEQPAGPAAPPPKQPAPPKQPMPAAGELEERFNRALNCMNLPPDKVQLLSQYDNEKKWELICDQERFQVKNPPAAYIQKLKSYVDTGGVSRKVAADWMSNLGFKRRVQESTQVLRELETSLRTNHIGWVQEFLNEENRGLDVLLEYLAFAQCSVTYDMESTDNGASNSEKNKPLEQSVEDLSKGPPSSVPKSRHLTIKLTPAHSRKALRNSRIVSQKDDVHVCIMCLRAIMNYQSGFSLVMNHPACVNEIALSLNNKNPRTKALVLELLAAVCLVRGGHDIILAAFDNFKEVCGEQHRFEKLMEYFRNEDSNIDFMVACMQFINIVVHSVENMNFRVFLQYEFTHLGLDLYLERLRLTESDKLQVQIQAYLDNIFDVGALLEDTETKNAVLEHMEELQEQVALLTERLRDAENESMAKIAELEKQLSQARKELETLRERFSESTAMGPSRRPPEPEKAPPAAPTRPSALELKVEELEEKGLIRILRGPGDAVSIEILPVAVATPSGGDAPTPGVPTGSPSPDLAPAAEPAPGAAPPPPPPLPGLPSPQEAPPSAPPQAPPLPGSPEPPPAPPLPGDLPPPPPPPPPPPGTDGPVPPPPPPPPPPPGGPPDALGRRDSELGPGVKAKKPIQTKFRMPLLNWVALKPSQITGTVFTELNDEKVLQELDMSDFEEQFKTKSQGPSLDLSALKSKAAQKAPSKATLIEANRAKNLAITLRKGNLGAERICQAIEAYDLQALGLDFLELLMRFLPTEYERSLITRFEREQRPMEELSEEDRFMLCFSRIPRLPERMTTLTFLGNFPDTAQLLMPQLNAIIAASMSIKSSDKLRQILEIVLAFGNYMNSSKRGAAYGFRLQSLDALLEMKSTDRKQTLLHYLVKVIAEKYPQLTGFHSDLHFLDKAGSVSLDSVLADVRSLQRGLELTQREFVRQDDCMVLKEFLRANSPTMDKLLADSKTAQEAFESVVEYFGENPKTTSPGLFFSLFSRFIKAYKKAEQEVEQWKKEAAAQEAGADTPGKGEPPAPKSPPKARRPQMDLISELKRRQQKEPLIYESDRDGAIEDIITVIKTVPFTARTGKRTSRLLCEASLGEEMPL,mutated_sequence,1.0,1100.0,UPI0000246EE9.a2m,UPI0000246EE9.npy,gnomAD
+UPI000003C716,UPI000003C716.csv,MKFLLLVLAALGFLTQVIPASAGGSKCVSNTPGYCRTCCHWGETALFMCNASRKCCISYSFLPKPDLPQLIGNHWQSRRRNTQRKDKKQQTTVTS,mutated_sequence,1.0,95.0,UPI000003C716.a2m,UPI000003C716.npy,gnomAD
+UPI0001D3B05C,UPI0001D3B05C.csv,AGLSPEPRAGVGSEFPAWFLGGSSQRRNMALLGSRAELEADEDVFEDALETISIWLKKKSGRREGKTSS,mutated_sequence,1.0,69.0,UPI0001D3B05C.a2m,UPI0001D3B05C.npy,gnomAD
+UPI000003C48A,UPI000003C48A.csv,MKRPKLKKASKRMTCHKRYKIQKKVREHHRKLRKEAKKRGHKKPRKDPGVPNSAPFKEALLREAELRKQRLEELKQQQKLDRQKELEKKRKLETNPDIKPSNVEPMEKEFGLCKTENKAKSGKQNSKKLYCQELKKVIEASDVVLEVLDARDPLGCRCPQVEEAIVQSGQKKLVLILNKSDLVPKENLESWLNYLKKELPTVVFRASTKPKDKGKITKRVKAKKNAAPFRSEVCFGKEGLWKLLGGFQETCSKAIRVGVIGFPNVGKSSIINSLKQEQMCNVGVSMGLTRSMQVVPLDKQITIIDSPSFIVSPLNSSSALALRSPASIEVVKPMEAASAILSQADARQVVLKYTVPGYRNSLEFFTVLAQRRGMHQKGGIPNVEGAAKLLWSEWTGASLAYYCHPPTSWTPPPYFNESIVVDMKSGFNLEELEKNNAQSIRAIKGPHLANSILFQSSGLTNGIIEEKDIHEELPKRKERKQEEREDDKDSDQETVDEEVDENSSGMFAAEETGEALSEETTAGEQSTRSFILDKIIEEDDAYDFSTDYV,mutated_sequence,1.0,549.0,UPI000003C48A.a2m,UPI000003C48A.npy,gnomAD
+UPI0000206D91,UPI0000206D91.csv,MAFSKGFRIYHKLDPPPFSLIVETRHKEECLMFESGAVAVLSSAEKEAIKGTYSKVLDAYGLLGVLRLNLGDTMLHYLVLVTGCMSVGKIQESEVFRVTSTEFISLRIDSSDEDRISEVRKVLNSGNFYFAWSASGISLDLSLNAHRSMQEQTTDNRFFWNQSLHLHLKHYGVNCDDWLLRLMCGGVEIRTIYAAHKQAKACLISRLSCERAGTRFNVRGTNDDGHVANFVETEQVVYLDDSVSSFIQIRGSVPLFWEQPGLQVGSHRVRMSRGFEANAPAFDRHFRTLKNLYGKQIIVNLLGSKEGEHMLSKAFQSHLKASEHAADIQMVNFDYHQMVKGGKAEKLHSVLKPQVQKFLDYGFFYFNGSEVQRCQSGTVRTNCLDCLDRTNSVQAFLGLEMLAKQLEALGLAEKPQLVTRFQEVFRSMWSVNGDSISKIYAGTGALEGKAKLKDGARSVTRTIQNNFFDSSKQEAIDVLLLGNTLNSDLADKARALLTTGSLRVSEQTLQSASSKVLKSMCENFYKYSKPKKIRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQEFQDKRSKPTDIFAIGFEEMVELNAGNIVSASTTNQKLWAVELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAAGQSQVKERNEDFIEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQVFRGFLEGKVTFAPTYKYDLFSDDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFQDESKILYTWTPGTLLHYGRAELKTSDHRPVVALIDIDIFEVEAEERQNIYKEVIAVQGPPDGTVLVSIKSSLPENNFFDDALIDELLQQFASFGEVILIRFVEDKMWVTFLEGSSALNVLSLNGKELLNRTITIALKSPDWIKNLEEEMSLEKISIALPSSTSSTLLGEDAEVAADFDMEGDVDDYSAEVEELLPQHLQPSSSSGLGTSPSSSPRTSPCQSPTISEGPVPSLPIRPSRAPSRTPGPPSAQSSPIDAQPATPLPQKDPAQPLEPKRPPPPRPVAPPTRPAPPQRPPPPSGARSPAPTRKEFGGIGAPPSPGVARREMEAPKSPGTTRKDNIGRSQPSPQAGLAGPGPAGYSTARPTIPPRAGVISAPQSHARASAGRLTPESQSKTSETSKGSTFLPEPLKPQAAFPPQSSLPPPAQRLQEPLVPVAAPMPQSGPQPNLETPPQPPPRSRSSHSLPSEASSQPQVKTNGISDGKRESPLKIDPFEDLSFNLLAVSKAQLSVQTSPVPTPDPKRLIQLPSATQSNVLSSVSCMPTMPPIPARSQSQENMRSSPNPFITGLTRTNPFSDRTAAPGNPFRAKSEESEATSWFSKEEPVTISPFPSLQPLGHNKSRASSSLDGFKDSFDLQGQSTLKISNPKGWVTFEEEEDFGVKGKSKSACSDLLGNQPSSFSGSNLTLNDDWNKGTNVSFCVLPSRRPPPPPVPLLPPGTSPPVDPFTTLASKASPTLDFTER,mutated_sequence,1.0,1573.0,UPI0000206D91.a2m,UPI0000206D91.npy,gnomAD
+UPI00000721F7,UPI00000721F7.csv,MGDMKTPDFDDLLAAFDIPDIDANEAIHSGPEENEGPGGPGKPEPGVGSESEDTAAASAGDGPGVPAQASDHGLPPPDISVVSVIVKNTVCPEQSEALAGGSAGDGAQAAGVTKEGPVGPHRMQNGFGSPEPSLPGTPHSPAPPSGGTWKEKGMEGKTPLDLFAHFGPEPGDHSDPLPPSAPSPTREGALTPPPFPSSFELAQENGPGMQPPVSSPPLGALKQESCSPHHPQVLAQQGSGSSPKATDIPASASPPPVAGVPFFKQSPGHQSPLASPKVPVCQPLKEEDDDEGPVDKSSPGSPQSPSSGAEAADEDSNDSPASSSSRPLKVRIKTIKTSCGNITRTVTQVPSDPDPPAPLAEGAFLAEASLLKLSPATPTSEGPKVVSVQLGDGTRLKGTVLPVATIQNASTAMLMAASVARKAVVLPGGTATSPKMIAKNVLGLVPQALPKADGRAGLGTGGQKVNGASVVMVQPSKTATGPSTGGGTVISRTQSSLVEAFNKILNSKNLLPAYRPNLSPPAEAGLALPPTGYRCLECGDAFSLEKSLARHYDRRSMRIEVTCNHCARRLVFFNKCSLLLHAREHKDKGLVMQCSHLVMRPVALDQMVGQPDITPLLPVAVPPVSGPLALPALGKGEGAITSSAITTVAAEAPVLPLSTEPPAAPATSAYTCFRCLECKEQCRDKAGMAAHFQQLGPPAPGATSNVCPTCPMMLPNRCSFSAHQRMHKNRPPHVCPECGGNFLQANFQTHLREACLHVSRRVGYRCPSCSVVFGGVNSIKSHIQTSHCEVFHKCPICPMAFKSGPSAHAHLYSQHPSFQTQQAKLIYKCAMCDTVFTHKPLLSSHFDQHLLPQRVSVFKCPSCPLLFAQKRTMLEHLKNTHQSGRLEETAGKGAGGALLTPKTEPEELAVSQGGAAPATEESSSSSEEEEVPSSPEPPRPAKRPRRELGSKGLKGGGGGPGGWTCGLCHSWFPERDEYVAHMKKEHGKSVKKFPCRLCERSFCSAPSLRRHVRVNHEGIKRVYPCRYCTEGKRTFSSRLILEKHVQVRHGLQLGAQSPGRGTTLARGSSARAQGPGRKRRQSSDSCSEEPDSTTPPAKSPRGGPGSGGHGPLRYRSSSSTEQSLMMGLRVEDGAQQCLDCGLCFASPGSLSRHRFISHKKRRGVGKASALGLGDGEEEAPPSRSDPDGGDSPLPASGGPLTCKVCGKSCDSPLNLKTHFRTHGMAFIRARQGAVGDN,mutated_sequence,1.0,1237.0,UPI00000721F7.a2m,UPI00000721F7.npy,gnomAD
+UPI000192952A,UPI000192952A.csv,MRVLACLLAALVGIQAVERLRLADGPHGCAGRLEVWHGGRWGTVCDDGWDLRDAAVACRQLGCGGALAAPGGAFFGEGAGPVWLSELACRGNEGQLGLCHHRGWKAHICSHEEDAGVVCAGQRVANSRDDSTSPLDGAPWPGLLLELSPSTEEPLVTHAPRPAGNPQNASRKKSPRPKQAKSTRAPLLTTGAPRQERLRLVSGPHRCAGRLEVWHGGRWGTVCDDGWDLRDAAVACRELGCGGALAAPGGARFGPGAGPVWMDDVGCGGGEQALRDCPRSPWGRSNCDHSEDAGLVCTGPAPRLRLADGPHGCAGRLEVWHGGRWGSVCDDAWDLRDAAVACRELGCGGALAAPGGAFFGEGSGPIILDDLRCRGNETALRFCPARPWGQHDCHHREDAGAVCDGMPLGYVPPTAPTDSNNSTPREAASRPPSTMTSQAPGTAGVSPPPASPTVLWEPGPEAGSPQLRLVAGPSKCSGRLEVWHDQRWGTVCDDSWDMRDSAVVCRELGCGGPQQPDPAAGRFGWGAGPIWLDDVGCVGTEASLSDCPAAPWGKHNCAHNEDVGVTCTGPPGLDSISDPFSWSWIPGLGRDRDAWLPGELATKPSASVTASVLEKTTTKAPGKMPKSTKKWVTKNAKRPTTQPPVMPTTKHSRAQSPPDLTSQTTAALTTEASRRPTSEFTRRPTTEAPQRWTSHTTATLTPQAPRERTTKTMAMLTTQGPQEMTSESTIKSIPQASLEPSAEIPEGSPESPKDPAPSPSVSTTGESGLFRVRLADGPNRCAGRLEVWHAGRWGTVCDDNWDLRDATVACWELGCGKVRPRVGKTHYGPGTGPIWLDDMGCKGSEASLSDCPSGAWGKHNCDHEEDVGLTCTGYTDYDDYPPWTWDPTSREDLAKGTTTAGVPGHTLPWRTTRRPGSSSPAIRRLPDTGSKDGYKLPWTWDTPSGRGLAEGTPTAGKLGPTLGAGTTRSPGSPPTLRVHGDTGSPRKPWPERRPPRPAATRTAPPTPSPGPSASPGPPGPALTSDSSRELTPHSALTSEATSDAPDTSPPTPDPASRTNPDLILTSPDFALSTPDSSVVPALTPEPSPTPLPTLPKELTSDPSTPSEVTSLSPTSEQVPESDTTPDLDTTPYSSTVSEYSRSPDPSPSPHPTTTPDPTMAPDPITTLNPTVTPHFPTTPHPTTTPHPTTITHSTMIPDPTTTPQPFTTITHSTMIPDPTTTPQPFTTMQPTTTPHSTTPHPTTTPHPTTITHSTMIPDPTTTPQPFTTMQPTTMPHPTTTPHPTTTPHPTTTPHPTTTPHPTMTPDPTTTPYPTTTPDPTTTPHPTTPDPSSTPVITTVSLPTSLGTELSSPTLAPTVKPSLHPQLTFTAPAPHTSTSQIPTLEPSPALESSPSRSSTATSMDPLSTEDFKPPRSQSPNLTPPPTHTPHSASDLTVSPDPLLSPTAHPLDHPPLDPLTLGPTPGQSPGPHGPCVAPTPPVRVMACEPPALVELVAAVRDVGGQLQRLTQVVEQERQERQALLLGLTQLVEAARGLGQLGEAVKRLAEMAWTTSMPAPTTTTPEEEERPLRGDV,mutated_sequence,1.0,1573.0,UPI000192952A.a2m,UPI000192952A.npy,gnomAD
+UPI000006EC9A,UPI000006EC9A.csv,MGSMFRSEEVALVQLFLPTAAAYTCVSRLGELGLVEFRDLNASVSAFQRRFVVDVRRCEELEKTFTFLQEEVRRAGLVLPPPKGRLPAPPPRDLLRIQEETERLAQELRDVRGNQQALRAQLHQLQLHAAVLRQGHEPQLAAAHTDGASERTPLLQAPGGPHQDLRVNFVAGAVEPHKAPALERLLWRACRGFLIASFRELEQPLEHPVTGEPATWMTFLISYWGEQIGQKIRKITDCFHCHVFPFLQQEEARLGALQQLQQQSQELQEVLGETERFLSQVLGRVLQLLPPGQVQVHKMKAVYLALNQCSVSTTHKCLIAEAWCSVRDLPALQEALRDSSMEEGVSAVAHRIPCRDMPPTLIRTNRFTASFQGIVDAYGVGRYQEVNPAPYTIITFPFLFAVMFGDVGHGLLMFLFALAMVLAENRPAVKAAQNEIWQTFFRGRYLLLLMGLFSIYTGFIYNECFSRATSIFPSGWSVAAMANQSGWSDAFLAQHTMLTLDPNVTGVFLGPYPFGIDPIWSLAANHLSFLNSFKMKMSVILGVVHMAFGVVLGVFNHVHFGQRHRLLLETLPELTFLLGLFGYLVFLVIYKWLCVWAARAASAPSILIHFINMFLFSHSPSNRLLYPRQEVVQATLVVLALAMVPILLLGTPLHLLHRHRRRLRRRPADRQEENKAGLLDLPDASVNGWSSDEEKAGGLDDEEEAELVPSEVLMHQAIHTIEFCLGCVSNTASYLRLWALSLAHAQLSEVLWAMVMRIGLGLGREVGVAAVVLVPIFAAFAVMTVAILLVMEGLSAFLHALRLHWVEFQNKFYSGTGYKLSPFTFAATDD,mutated_sequence,1.0,830.0,UPI000006EC9A.a2m,UPI000006EC9A.npy,gnomAD
+UPI00000726A9,UPI00000726A9.csv,MSSTLPALLCVGLCLSQRISAQQQTLPKPFIWAEPHFMVPKEKQVTICCQGNYGAVEYQLHFEGSLFAVDRPKPPERINKVKFYIPDMNSRMAGQYSCIYRVGELWSEPSNLLDLVVTEMYDTPTLSVHPGPEVISGEKVTFYCRLDTATSMFLLLKEGRSSHVQRGYGKVQAEFPLGPVTTAHRGTYRCFGSYNNHAWSFPSEPVKLLVTGDIENTSLAPEDPTFPADTWGTYLLTTETGLQKDHALWDHTAQNLLRMGLAFLVLVALVWFLVEDWLSRKRTRERASRASTWEGRRRLNTQTL,mutated_sequence,1.0,304.0,UPI00000726A9.a2m,UPI00000726A9.npy,gnomAD
+UPI000173A2B0,UPI000173A2B0.csv,MGEKVSEAPEPVPRGCSGHGSRTPASALVAASSPGASSAESSSGSETLSEEGEPGGFSREHQPPPPPPLGGTLGARAPAAWAPASVLLERGVLALPPPLPGGAVPPAPRGSSASQEEQDEELDHILSPPPMPFRKCSNPDVASGPGKSLKYKRQLSEDGRQLRRGSLGGALTGRYLLPNPVAGQAWPASAETSNLVRMRSQALGQSAPSLTASLKELSLPRRGSFCRTSNRKSLIGNGQSPALPRPHSPLSAHAGNSPQDSPRNFSPSASAHFSFARRTDGRRWSLASLPSSGYGTNTPSSTVSSSCSSQEKLHQLPYQPTPDELHFLSKHFCTTESIATENRCRNTPMRPRSRSLSPGRSPACCDHEIIMMNHVYKERFPKATAQMEERLKEIITSYSPDNVLPLADGVLSFTHHQIIELARDCLDKSHQGLITSRYFLELQHKLDKLLQEAHDRSESGELAFIKQLVRKILIVIARPARLLECLEFDPEEFYYLLEAAEGHAKEGQGIKTDIPRYIISQLGLNKDPLEEMAHLGNYDSGTAETPETDESVSSSNASLKLRRKPRESDFETIKLISNGAYGAVYFVRHKESRQRFAMKKINKQNLILRNQIQQAFVERDILTFAENPFVVSMYCSFETRRHLCMVMEYVEGGDCATLMKNMGPLPVDMARMYFAETVLALEYLHNYGIVHRDLKPDNLLVTSMGHIKLTDFGLSKVGLMSMTTNLYEGHIEKDAREFLDKQVCGTPEYIAPEVILRQGYGKPVDWWAMGIILYEFLVGCVPFFGDTPEELFGQVISDEINWPEKDEAPPPDAQDLITLLLRQNPLERLGTGGAYEVKQHRFFRSLDWNSLLRQKAEFIPQLESEDDTSYFDTRSEKYHHMETEEEDDTNDEDFNVEIRQFSSCSHRFSKVFSSIDRITQNSAEEKEDSVDKTKSTTLPSTETLSWSSEYSEMQQLSTSNSSDTESNRHKLSSGLLPKLAISTEGEQDEAASCPGDPHEEPGKPALPPEECAQEEPEVTTPASTISSSTLSVGSFSEHLDQINGRSECVDSTDNSSKPSSEPASHMARQRLESTEKKKISGKVTKSLSASALSLMIPGDMFAVSPLGSPMSPHSLSSDPSSSRDSSPSRDSSAASASPHQPIVIHSSGKNYGFTIRAIRVYVGDSDIYTVHHIVWNVEEGSPACQAGLKAGDLITHINGEPVHGLVHTEVIELLLKSGNKVSITTTPFENTSIKTGPARRNSYKSRMVRRSKKSKKKESLERRRSLFKKLAKQPSPLLHTSRSFSCLNRSLSSGESLPGSPTHSLSPRSPTPSYRSTPDFPSGTNSSQSSSPSSSAPNSPAGSGHIRPSTLHGLAPKLGGQRYRSGRRKSAGNIPLSPLARTPSPTPQPTSPQRSPSPLLGHSLGNSKIAQAFPSKMHSPPTIVRHIVRPKSAEPPRSPLLKRVQSEEKLSPSYGSDKKHLCSRKHSLEVTQEEVQREQSQREAPLQSLDENVCDVPPLSRARPVEQGCLKRPVSRKVGRQESVDDLDRDKLKAKVVVKKADGFPEKQESHQKSHGPGSDLENFALFKLEEREKKVYPKAVERSSTFENKASMQEAPPLGSLLKDALHKQASVRASEGAMSDGRVPAEHRQGGGDFRRAPAPGTLQDGLCHSLDRGISGKGEGTEKSSQAKELLRCEKLDSKLANIDYLRKKMSLEDKEDNLCPVLKPKMTAGSHECLPGNPVRPTGGQQEPPPASESRAFVSSTHAAQMSAVSFVPLKALTGRVDSGTEKPGLVAPESPVRKSPSEYKLEGRSVSCLKPIEGTLDIALLSGPQASKTELPSPESAQSPSPSGDVRASVPPVLPSSSGKKNDTTSARELSPSSLKMNKSYLLEPWFLPPSRGLQNSPAVSLPDPEFKRDRKGPHPTARSPGTVMESNPQQREGSSPKHQDHTTDPKLLTCLGQNLHSPDLARPRCPLPPEASPSREKPGLRESSERGPPTARSERSAARADTCREPSMELCFPETAKTSDNSKNLLSVGRTHPDFYTQTQAMEKAWAPGGKTNHKDGPGEARPPPRDNSSLHSAGIPCEKELGKVRRGVEPKPEALLARRSLQPPGIESEKSEKLSSFPSLQKDGAKEPERKEQPLQRHPSSIPPPPLTAKDLSSPAARQHCSSPSHASGREPGAKPSTAEPSSSPQDPPKPVAAHSESSSHKPRPGPDPGPPKTKHPDRSLSSQKPSVGATKGKEPATQSLGGSSREGKGHSKSGPDVFPATPGSQNKASDGIGQGEGGPSVPLHTDRAPLDAKPQPTSGGRPLEVLEKPVHLPRPGHPGPSEPADQKLSAVGEKQTLSPKHPKPSTVKDCPTLCKQTDNRQTDKSPSQPAANTDRRAEGKKCTEALYAPAEGDKLEAGLSFVHSENRLKGAERPAAGVGKGFPEARGKGPGPQKPPTEADKPNGMKRSPSATGQSSFRSTALPEKSLSCSSSFPETRAGVREASAASSDTSSAKAAGGMLELPAPSNRDHRKAQPAGEGRTHMTKSDSLPSFRVSTLPLESHHPDPNTMGGASHRDRALSVTATVGETKGKDPAPAQPPPARKQNVGRDVTKPSPAPNTDRPISLSNEKDFVVRQRRGKESLRSSPHKKAL,mutated_sequence,1.0,2623.0,UPI000173A2B0.a2m,UPI000173A2B0.npy,gnomAD
+UPI0000D62424,UPI0000D62424.csv,MVDPVPEEEKAGAEPGDSGGDEAVASVPPDSQGAQEPAASSASASASAAVPRKAEVPCAAAEGGRREQSPLLHLDLFNFDCPEAEGSRYVLTSPRSLEACARCAVKPVELLPRALADLVREAPGRSMRVATGLYEAYEAERRAKLQQCRAERERIMREEKRRLFTPLSPAAAAAAAAAAASAPSAGSSSSCSSASLPASPAPRAARKASPSPSSARTQPPPAGSRTGRKSHSLDSLSRRREGALSSESGASSSSYSGESLRELRWPPRASARNSCPAGSASSTTNAPGRPSALTLVPITGRSFSLGDLSHSPQTAQHVERIVRQVRAERGLRGVPERDRKIAALMLARHQEELLLLEQRAAAHGQWELQRVHAKQRREREEREKQRALEQGRRAWAAQVEERRGRRGREEREAARRRQRQYERSEERRRELAERQGLLRRERAERAAREDRLRKLQQEQNLKQREEGLQEGRERAEQIRRERAQRAARAKQRQEGQLQREKRELSRAERARHEALLQGRTRQQRQEREGLRSSLEASLGRAQENYEHLVEQRTRELRERARREELQGRRAKEAAERKEREHQAHLEALARAGERRLQHATQVAEEAVQQKARRVGQSRLEKERAQRANKEKVERDEDCRRRELLQAIGRKLERSEQLTRERRSALESARSTARASFHVREKVREETNTRSFDRMVREAQLHASLDRK,mutated_sequence,1.0,707.0,UPI0000D62424.a2m,UPI0000D62424.npy,gnomAD
+UPI000022DC3D,UPI000022DC3D.csv,MGVQGFQEFLEKRCPGAVVPVDLLKLARTVSRQQQQQHLHRQLPPTAALAPGAPRAARGSVPLQPPLPPAALGAYSGGAGPIRHHHPAHHFHHHGQAQPGLHPPLPPPPPPQLPGARVLVDAGSALPRLYGGYQTDWVCGGQWNAMLGYLSALCQACAYPGGDGLELVVMFPGGLGKDRLAEWGRRCQAERQTAQLIVGHVGNKGTPPPRAWFLPPACLSHCVRLALIRFRVKVFQSLEDHHLEVVAFFRENGFHGLLAHDSEYALYNIPSYYSSHALKLSWNGKNLTTNQFLMQEVAKQLGLKRMNFPIFAALLGNHILPDEDLAAFHWSLLGPEHPLASLKVRAHQLVLPPCDVVIKAVSEYVSSIKDPSNLDVVGKDVFKQSQSRTEDKIERFKKAVEYYSVTTKLSSLPVGPSFLGFRNNRLGNPPLPRNQVGTISAGKPMFSHQVPQKVKYPPPFPVGPNSSLLFSSHALGESHAFSEDPMLQNSPFANWAVSYDSSASQFPNYLPSKASPPLGPDSSHSSSSDGDEPNGASSDHITEAFHHQPEWGNPNRDRGSWAQPVDTGVSEASLGDGEPHIPSLLSMSTRNHMDITIPPLPPVAPEVLRVAEHRHRRGLMYPYIYHVLTKGEIKIPVCIEDECNMELPPAALLFRSARQYVYGVLFSLAETQRKMERLAMRRRLPVEVPSVILKEWSAYKGKSPQTPELVSALTFREWTCPNLKKLWLGKAVEDKNRRMRAFLACMKSDTPSMLNPANVPTHLLLMCCVLRYMVQWPGGRILHRHELDTFLAQAVSTQLYEPDRLQELKIEKLDARGIQLAALFMSGVDTALFANDACGQPVPWEHCCPWIYFDGKLFQSKLIKAGRERVSLVELCDGQADLATKVEKMRQSILEGVNMNHPPPSALLPSPTFVPPMVPSLYPVSLYSRAMGSMPLPPQGRSRGFAGLHPIPPQGGKLEIAGMVVGQWAGSRSSRGRGSFGMQVVSVGGPGKGHGKEQTGRGSKGHKKGNKQGSSDGVSKSLELHQGRSRSQVNGNSGALIKEEKSDHRLPAPSQCALSRDSNECNNGNRYLPMNNREKNHLQEQKLETVAQRKED,mutated_sequence,1.0,1096.0,UPI000022DC3D.a2m,UPI000022DC3D.npy,gnomAD
+UPI000006DE38,UPI000006DE38.csv,MEPRAVAEAVETGEEDVIMEALRSYNQEHSQSFTFDDAQQEDRKRLAELLVSVLEQGLPPSHRVIWLQSVRILSRDRNCLDPFTSRQSLQALACYADISVSEGSVPESADMDVVLESLKCLCNLVLSSPVAQMLAAEARLVVKLTERVGLYRERSFPHDVQFFDLRLLFLLTALRTDVRQQLFQELKGVRLLTDTLELTLGVTPEGNPPPTLLPSQETERAMEILKVLFNITLDSIKGEVDEEDAALYRHLGTLLRHCVMIATAGDRTEEFHGHAVNLLGNLPLKCLDVLLTLEPHGDSTEFMGVNMDVIRALLIFLEKRLHKTHRLKESVAPVLSVLTECARMHRPARKFLKAQVLPPLRDVRTRPEVGEMLRNKLVRLMTHLDTDVKRVAAEFLFVLCSESVPRFIKYTGYGNAAGLLAARGLMAGGRPEGQYSEDEDTDTDEYKEAKASINPVTGRVEEKPPNPMEGMTEEQKEHEAMKLVTMFDKLSRNRVIQPMGMSPRGHLTSLQDAMCETMEQQLSSDPDSDPD,mutated_sequence,1.0,531.0,UPI000006DE38.a2m,UPI000006DE38.npy,gnomAD
+UPI0000033466,UPI0000033466.csv,MDSDEGYNYEFDEDEECSEEDSGAEEEEDEDDDEPDDDTLDLGEVELVEPGLGVGGERDGLLCGETGGGGGSALGPGGGGGGGGGGGGGGPGHEQEEDYRYEVLTAEQILQHMVECIREVNEVIQNPATITRILLSHFNWDKEKLMERYFDGNLEKLFAECHVINPSKKSRTRQMNTRSSAQDMPCQICYLNYPNSYFTGLECGHKFCMQCWSEYLTTKIMEEGMGQTISCPAHGCDILVDDNTVMRLITDSKVKLKYQHLITNSFVECNRLLKWCPAPDCHHVVKVQYPDAKPVRCKCGRQFCFNCGENWHDPVKCKWLKKWIKKCDDDSETSNWIAANTKECPKCHVTIEKDGGCNHMVCRNQNCKAEFCWVCLGPWEPHGSAWYNCNRYNEDDAKAARDAQERSRAALQRYLFYCNRYMNHMQSLRFEHKLYAQVKQKMEEMQQHNMSWIEVQFLKKAVDVLCQCRATLMYTYVFAFYLKKNNQSIIFENNQADLENATEVLSGYLERDISQDSLQDIKQKVQDKYRYCESRRRVLLQHVHEGYEKDLWEYIED,mutated_sequence,1.0,557.0,UPI0000033466.a2m,UPI0000033466.npy,gnomAD
+UPI000020E761,UPI000020E761.csv,MEPGKRRTKDDTWKADDLRKHLWAIQSGGSKEERKHREKKLRKESEMDLPEHKEPRCRDPDQDARSRDRVAEVHTAKESPRGERDRDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRARKEELRQTVAHHNLLGQETRDRQLLERAERKGRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLKYWLYKEEGERRHRKPREPDRDNKHREKSSTREKREKYSKEKSNSFSDKGEERHKEKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRESQNGEHRNRGASSKRDGTSSQHAENLVRNHGKDKDSRRKHGHEEGSSVWWKLDQRPGGEETVEIEKEETDLENARADAYTASCEDDFEDYEDDFEVCDGDDDESSNEPESREKLEELPLAQKKEIQEIQRAINAENERIGELSLKLFQKRGRTEFEKEPRTDTNSSPSRASVCGIFVDFASASHRQKSRTQALKQKMRSTKLLRLIDLDFSFTFSLLDLPPVNEYDMYIRNFGKKNTKQAYVQCNEDNVERDIQTEEIETREVWTQHPGESTVVSGGSEQRDTSDAVVMPKIDTPRLCSFLRAACQVMAVLLEEDRLAAEPSWNLRAQDRALYFSDSSSQLNTSLPFLQNRKVSSLHTSRVQRQMVVSVHDLPEKSFVPLLDSKYVLCVWDIWQPSGPQKVLICESQVTCCCLSPLKAFLLFAGTAHGSVVVWDLREDSRLHYSVTLSDGFWTFRTATFSTDGILTSVNHRSPLQAVEPISTSVHKKQSFVLSPFSTQEEMSGLSFHIASLDESGVLNVWVVVELPKADIAGSISDLGLMPGGRVKLVHSALIQLGDSLSHKGNEFWGTTQTLNVKFLPSDPNHFIIGTDMGLISHGTRQDLRVAPKLFKPQQHGIRPVKVNVIDFSPFGEPIFLAGCSDGSIRLHQLSSAFPLLQWDSSTDSHAVTGLQWSPTRPAVFLVQDDTSNIYIWDLLQSDLGPVAKQQVSPNRLVAMAAVGEPEKAGGSFLALVLARASGSIDIQHLKRRWAAPEVDECNRLRLLLQEALWPEGKLHK,mutated_sequence,1.0,1066.0,UPI000020E761.a2m,UPI000020E761.npy,gnomAD
+UPI000013F0A1,UPI000013F0A1.csv,MLSRLSGLANVVLHELSGDDDTDQNMRAPLDPELHQESDMEFNNTTQEDVQERLAYAEQLVVELKDIIRQKDVQLQQKDEALQEERKAADNKIKKLKLHAKAKLTSLNKYIEEMKAQGGTVLPTEPQSEEQLSKHDKSSTEEEMEIEKIKHKLQEKEELISTLQAQLTQAQAEQPAQSSTEMEEFVMMKQQLQEKEEFISTLQAQLSQTQAEQAAQQVVREKDARFETQVRLHEDELLQLVTQADVETEMQQKLRVLQRKLEEHEESLVGRAQVVDLLQQELTAAEQRNQILSQQLQQMEAEHNTLRNTVETEREESKILLEKMELEVAERKLSFHNLQEEMHHLLEQFEQAGQAQAELESRYSALEQKHKAEMEEKTSHILSLQKTGQELQSACDALKDQNSKLLQDKNEQAVQSAQTIQQLEDQLQQKSKEISQFLNRLPLQQHETASQTSFPDVYNEGTQAVTEENIASLQKRVVELENEKGALLLSSIELEELKAENEKLSSQITLLEAQNRTGEADREVSEISIVDIANKRSSSAEESGQDVLENTFSQKHKELSVLLLEMKEAQEEIAFLKLQLQGKRAEEADHEVLDQKEMKQMEGEGIAPIKMKVFLEDTGQDFPLMPNEESSLPAVEKEQASTEHQSRTSEEISLNDAGVELKSTKQDGDKSLSAVPDIGQCHQDELERLKSQILELELNFHKAQEIYEKNLDEKAKEISNLNQLIEEFKKNADNNSSAFTALSEERDQLLSQVKELSMVTELRAQVKQLEMNLAEAERQRRLDYESQTAHDNLLTEQIHSLSIEAKSKDVKIEVLQNELDDVQLQFSEQSTLIRSLQSQLQNKESEVLEGAERVRHISSKVEELSQALSQKELEITKMDQLLLEKKRDVETLQQTIEEKDQQVTEISFSMTEKMVQLNEEKFSLGVEIKTLKEQLNLLSRAEEAKKEQVEEDNEVSSGLKQNYDEMSPAGQISKEELQHEFDLLKKENEQRKRKLQAALINRKELLQRVSRLEEELANLKDESKKEIPLSETERGEVEEDKENKEYSEKCVTSKCQEIEIYLKQTISEKEVELQHIRKDLEEKLAAEEQFQALVKQMNQTLQDKTNQIDLLQAEISENQAIIQKLITSNTDASDGDSVALVKETVVISPPCTGSSEHWKPELEEKILALEKEKEQLQKKLQEALTSRKAILKKAQEKERHLREELKQQKDDYNRLQEQFDEQSKENENIGDQLRQLQIQVRESIDGKLPSTDQQESCSSTPGLEEPLFKATEQHHTQPVLESNLCPDWPSHSEDASALQGGTSVAQIKAQLKEIEAEKVELELKVSSTTSELTKKSEEVFQLQEQINKQGLEIESLKTVSHEAEVHAESLQQKLESSQLQIAGLEHLRELQPKLDELQKLISKKEEDVSYLSGQLSEKEAALTKIQTEIIEQEDLIKALHTQLEMQAKEHDERIKQLQVELCEMKQKPEEIGEESRAKQQIQRKLQAALISRKEALKENKSLQEELSLARGTIERLTKSLADVESQVSAQNKEKDTVLGRLALLQEERDKLITEMDRSLLENQSLSSSCESLKLALEGLTEDKEKLVKEIESLKSSKIAESTEWQEKHKELQKEYEILLQSYENVSNEAERIQHVVEAVRQEKQELYGKLRSTEANKKETEKQLQEAEQEMEEMKEKMRKFAKSKQQKILELEEENDRLRAEVHPAGDTAKECMETLLSSNASMKEELERVKMEYETLSKKFQSLMSEKDSLSEEVQDLKHQIEGNVSKQANLEATEKHDNQTNVTEEGTQSIPGETEEQDSLSMSTRPTCSESVPSAKSANPAVSKDFSSHDEINNYLQQIDQLKERIAGLEEEKQKNKEFSQTLENEKNTLLSQISTKDGELKMLQEEVTKMNLLNQQIQEELSRVTKLKETAEEEKDDLEERLMNQLAELNGSIGNYCQDVTDAQIKNELLESEMKNLKKCVSELEEEKQQLVKEKTKVESEIRKEYLEKIQGAQKEPGNKSHAKELQELLKEKQQEVKQLQKDCIRYQEKISALERTVKALEFVQTESQKDLEITKENLAQAVEHRKKAQAELASFKVLLDDTQSEAARVLADNLKLKKELQSNKESVKSQMKQKDEDLERRLEQAEEKHLKEKKNMQEKLDALRREKVHLEETIGEIQVTLNKKDKEVQQLQENLDSTVTQLAAFTKSMSSLQDDRDRVIDEAKKWERKFSDAIQSKEEEIRLKEDNCSVLKDQLRQMSIHMEELKINISRLEHDKQIWESKAQTEVQLQQKVCDTLQGENKELLSQLEETRHLYHSSQNELAKLESELKSLKDQLTDLSNSLEKCKEQKGNLEGIIRQQEADIQNSKFSYEQLETDLQASRELTSRLHEEINMKEQKIISLLSGKEEAIQVAIAELRQQHDKEIKELENLLSQEEEENIVLEEENKKAVDKTNQLMETLKTIKKENIQQKAQLDSFVKSMSSLQNDRDRIVGDYQQLEERHLSIILEKDQLIQEAAAENNKLKEEIRGLRSHMDDLNSENAKLDAELIQYREDLNQVITIKDSQQKQLLEVQLQQNKELENKYAKLEEKLKESEEANEDLRRSFNALQEEKQDLSKEIESLKVSISQLTRQVTALQEEGTLGLYHAQLKVKEEEVHRLSALFSSSQKRIAELEEELVCVQKEAAKKVGEIEDKLKKELKHLHHDAGIMRNETETAEERVAELARDLVEMEQKLLMVTKENKGLTAQIQSFGRSMSSLQNSRDHANEELDELKRKYDASLKELAQLKEQGLLNRERDALLSETAFSMNSTEENSLSHLEKLNQQLLSKDEQLLHLSSQLEDSYNQVQSFSKAMASLQNERDHLWNELEKFRKSEEGKQRSAAQPSTSPAEVQSLKKAMSSLQNDRDRLLKELKNLQQQYLQINQEITELHPLKAQLQEYQDKTKAFQIMQEELRQENLSWQHELHQLRMEKSSWEIHERRMKEQYLMAISDKDQQLSHLQNLIRELRSSSSQTQPLKVQYQRQASPETSASPDGSQNLVYETELLRTQLNDSLKEIHQKELRIQQLNSNFSQLLEEKNTLSIQLCDTSQSLRENQQHYGDLLNHCAVLEKQVQELQAGPLNIDVAPGAPQEKNGVHRKSDPEELREPQQSFSEAQQQLCNTRQEVNELRKLLEEERDQRVAAENALSVAEEQIRRLEHSEWDSSRTPIIGSCGTQEQALLIDLTSNSCRRTRSGVGWKRVLRSLCHSRTRVPLLAAIYFLMIHVLLILCFTGHL,mutated_sequence,1.0,3259.0,UPI000013F0A1.a2m,UPI000013F0A1.npy,gnomAD
+UPI0000129A56,UPI0000129A56.csv,MLRVFILYAENVHTPDTDISDAYCSAVFAGVKKRTKVIKNSVNPVWNEGFEWDLKGIPLDQGSELHVVVKDHETMGRNRFLGEAKVPLREVLATPSLSASFNAPLLDTKKQPTGASLVLQVSYTPLPGAVPLFPPPTPLEPSPTLPDLDVVADTGGEEDTEDQGLTGDEAEPFLDQSGGPGAPTTPRKLPSRPPPHYPGIKRKRSAPTSRKLLSDKPQDFQIRVQVIEGRQLPGVNIKPVVKVTAAGQTKRTRIHKGNSPLFNETLFFNLFDSPGELFDEPIFITVVDSRSLRTDALLGEFRMDVGTIYREPRHAYLRKWLLLSDPDDFSAGARGYLKTSLCVLGPGDEAPLERKDPSEDKEDIESNLLRPTGVALRGAHFCLKVFRAEDLPQMDDAVMDNVKQIFGFESNKKNLVDPFVEVSFAGKMLCSKILEKTANPQWNQNITLPAMFPSMCEKMRIRIIDWDRLTHNDIVATTYLSMSKISAPGGEIEEEPAGAVKPSKASDLDDYLGFLPTFGPCYINLYGSPREFTGFPDPYTELNTGKGEGVAYRGRLLLSLETKLVEHSEQKVEDLPADDILRVEKYLRRRKYSLFAAFYSATMLQDVDDAIQFEVSIGNYGNKFDMTCLPLASTTQYSRAVFDGCHYYYLPWGNVKPVVVLSSYWEDISHRIETQNQLLGIADRLEAGLEQVHLALKAQCSTEDVDSLVAQLTDELIAGCSQPLGDIHETPSATHLDQYLYQLRTHHLSQITEAALALKLGHSELPAALEQAEDWLLRLRALAEEPQNSLPDIVIWMLQGDKRVAYQRVPAHQVLFSRRGANYCGKNCGKLQTIFLKYPMEKVPGARMPVQIRVKLWFGLSVDEKEFNQFAEGKLSVFAETYENETKLALVGNWGTTGLTYPKFSDVTGKIKLPKDSFRPSAGWTWAGDWFVCPEKTLLHDMDAGHLSFVEEVFENQTRLPGGQWIYMSDNYTDVNGEKVLPKDDIECPLGWKWEDEEWSTDLNRAVDEQGWEYSITIPPERKPKHWVPAEKMYYTHRRRRWVRLRRRDLSQMEALKRHRQAEAEGEGWEYASLFGWKFHLEYRKTDAFRRRRWRRRMEPLEKTGPAAVFALEGALGGVMDDKSEDSMSVSTLSFGVNRPTISCIFDYGNRYHLRCYMYQARDLAAMDKDSFSDPYAIVSFLHQSQKTVVVKNTLNPTWDQTLIFYEIEIFGEPATVAEQPPSIVVELYDHDTYGADEFMGRCICQPSLERMPRLAWFPLTRGSQPSGELLASFELIQREKPAIHHIPGFEVQETSRILDESEDTDLPYPPPQREANIYMVPQNIKPALQRTAIEILAWGLRNMKSYQLANISSPSLVVECGGQTVQSCVIRNLRKNPNFDICTLFMEVMLPREELYCPPITVKVIDNRQFGRRPVVGQCTIRSLESFLCDPYSAESPSPQGGPDDVSLLSPGEDVLIDIDDKEPLIPIQEEEFIDWWSKFFASIGEREKCGSYLEKDFDTLKVYDTQLENVEAFEGLSDFCNTFKLYRGKTQEETEDPSVIGEFKGLFKIYPLPEDPAIPMPPRQFHQLAAQGPQECLVRIYIVRAFGLQPKDPNGKCDPYIKISIGKKSVSDQDNYIPCTLEPVFGKMFELTCTLPLEKDLKITLYDYDLLSKDEKIGETVVDLENRLLSKFGARCGLPQTYCVSGPNQWRDQLRPSQLLHLFCQQHRVKAPVYRTDRVMFQDKEYSIEEIEAGRIPNPHLGPVEERLALHVLQQQGLVPEHVESRPLYSPLQPDIEQGKLQMWVDLFPKALGRPGPPFNITPRRARRFFLRCIIWNTRDVILDDLSLTGEKMSDIYVKGWMIGFEEHKQKTDVHYRSLGGEGNFNWRFIFPFDYLPAEQVCTIAKKDAFWRLDKTESKIPARVVFQIWDNDKFSFDDFLGSLQLDLNRMPKPAKTAKKCSLDQLDDAFHPEWFVSLFEQKTVKGWWPCVAEEGEKKILAGKLEMTLEIVAESEHEERPAGQGRDEPNMNPKLEDPRRPDTSFLWFTSPYKTMKFILWRRFRWAIILFIILFILLLFLAIFIYAFPNYAAMKLVKPFS,mutated_sequence,1.0,2080.0,UPI0000129A56.a2m,UPI0000129A56.npy,gnomAD
+UPI000013E95E,UPI000013E95E.csv,MYGRPQAEMEQEAGELSRWQAAHQAAQDNENSAPILNMSSSSGSSGVHTSWNQGLPSIQHFPHSAEMLGSPLVSVEAPGQNVNEGGPQFSMPLPERGMSYCPQATLTPSRMIYCQRMSPPQQEMTIFSGPQLMPVGEPNIPRVARPFGGNLRMPPNGLPVSASTGIPIMSHTGNPPVPYPGLSTVPSDETLLGPTVPSTEAQAVLPSMAQMLPPQDAHDLGMPPAESQSLLVLGSQDSLVSQPDSQEGPFLPEQPGPAPQTVEKNSRPQEGTGRRGSSEARPYCCNYENCGKAYTKRSHLVSHQRKHTGERPYSCNWESCSWSFFRSDELRRHMRVHTRYRPYKCDQCSREFMRSDHLKQHQKTHRPGPSDPQANNNNGEQDSPPAAGP,mutated_sequence,1.0,389.0,UPI000013E95E.a2m,UPI000013E95E.npy,gnomAD
+UPI0000EA87E6,UPI0000EA87E6.csv,MEPPGGSLGPGRGTRDKKKGRSPDELPSAGGDGGKSKKFTLKRLMADELERFTSMRIKKEKEKPNSAHRNSSASYGDDPTAQSLQDVSDEQVLVLFEQMLLDMNLNEEKQQPLREKDIIIKREMVSQYLYTSKAGMSQKESSKSAMMYIQELRSGLRDMPLLSCLESLRVSLNNNPVSWVQTFGAEGLASLLDILKRLHDEKEETAGSYDSRNKHEIIRCLKAFMNNKFGIKTMLETEEGILLLVRAMDPAVPNMMIDAAKLLSALCILPQPEDMNERVLEAMTERAEMDEVERFQPLLDGLKSGTTIALKVGCLQLINALITPAEELDFRVHIRSELMRLGLHQVLQDLREIENEDMRVQLNVFDEQGEEDSYDLKGRLDDIRMEMDDFNEVFQILLNTVKDSKAEPHFLSILQHLLLVRNDYEARPQYYKLIEECISQIVLHKNGADPDFKCRHLQIEIEGLIDQMIDKTKVEKSEAKAAELEKKLDSELTARHELQVEMKKMESDFEQKLQDLQGEKDALHSEKQQIATEKQDLEAEVSQLTGEVAKLTKELEDAKKEMASLSAAAITVPPSVPSRAPVPPAPPLPGDSGTIIPPPPAPGDSTTPPPPPPPPPPPPPLPGGVCISSPPSLPGGTAISPPPPLSGDATIPPPPPLPEGVGIPSPSSLPGGTAIPPPPPLPGSARIPPPPPPLPGSAGIPPPPPPLPGEAGMPPPPPPLPGGPGIPPPPPFPGGPGIPPPPPGMGMPPPPPFGFGVPAAPVLPFGLTPKKLYKPEVQLRRPNWSKLVAEDLSQDCFWTKVKEDRFENNELFAKLTLTFSAQTKTSKAKKDQEGGEEKKSVQKKKVKELKVLDSKTAQNLSIFLGSFRMPYQEIKNVILEVNEAVLTESMIQNLIKQMPEPEQLKMLSELKDEYDDLAESEQFGVVMGTVPRLRPRLNAILFKLQFSEQVENIKPEIVSVTAACEELRKSESFSNLLEITLLVGNYMNAGSRNAGAFGFNISFLCKLRDTKSTDQKMTLLHFLAELCENDYPDVLKFPDELAHVEKASRVSAENLQKNLDQMKKQISDVERDVQNFPAATDEKDKFVEKMTSFVKDAQEQYNKLRMMHSNMETLYKELGEYFLFDPKKLSVEEFFMDLHNFRNMFLQAVKENQKRRETEEKMRRAKLAKEKAEKERLEKQQKREQLIDMNAEGDETGVMDSLLEALQSGAAFRRKRGPRQANRKAGCAVTSLLASELTKDDAMAAVPAKVSKNSETFPTILEEAKELVGRAS,mutated_sequence,1.0,1272.0,UPI0000EA87E6.a2m,UPI0000EA87E6.npy,gnomAD
+UPI000020E4D9,UPI000020E4D9.csv,MASLSRPSLPSCLCSFLLLLLLQVSSSYAGQFRVIGPRHPIRALVGDEVELPCRISPGKNATGMEVGWYRPPFSRVVHLYRNGKDQDGDQAPEYRGRTELLKDAIGEGKVTLRIRNVRFSDEGGFTCFFRDHSYQEEAAMELKVEDPFYWVSPGVLVLLAVLPVLLLQITVGLIFLCLQYRLRGKLRAEIENLHRTFDPHFLRVPCWKITLFVIVPVLGPLVALIICYNWLHRRLAGQFLEELRNPF,mutated_sequence,1.0,247.0,UPI000020E4D9.a2m,UPI000020E4D9.npy,gnomAD
+UPI0000072D37,UPI0000072D37.csv,MKPTSGPEEARRPASDIRVFASNCSMHGLGHVFGPGSLSLRRGMWAAAVVLSVATFLYQVAERVRYYREFHHQTALDERESHRLIFPAVTLCNINPLRRSRLTPNDLHWAGSALLGLDPAEHAAFLRALGRPPAPPGFMPSPTFDMAQLYARAGHSLDDMLLDCRFRGQPCGPENFTTIFTRMGKCYTFNSGADGAELLTTTRGGMGNGLDIMLDVQQEEYLPVWRDNEETPFEVGIRVQIHSQEEPPIIDQLGLGVSPGYQTFVSCQQQQLSFLPPPWGDCSSASLNPNYEPEPSDPLGSPSPSPSPPYTLMGCRLACETRYVARKCGCRMVYMPGDVPVCSPQQYKNCAHPAIDAMLRKDSCACPNPCASTRYAKELSMVRIPSRAAARFLARKLNRSEAYIAENVLALDIFFEALNYETVEQKKAYEMSELLGDIGGQMGLFIGASLLTILEILDYLCEVFRDKVLGYFWNRQHSQRHSSTNLLQEGLGSHRTQVPHLSLGPRPPTPPCAVTKTLSASHRTCYLVTQL,mutated_sequence,1.0,531.0,UPI0000072D37.a2m,UPI0000072D37.npy,gnomAD
+UPI000059D6C5,UPI000059D6C5.csv,MNVMLENYKNLVFLAGIAVSKQDPITSLEQEKEPWNMKICEMVDESPAMCSSFTRDLWPEQDIKDSFQQVILRRHGKCEHENLQLRKGSANVVECKVYKKGYELNQCLTTTQSKIFPCDKYIKVFHKIFNSNRHKTRHTGEKPFKCKKCDESFCMLLHLHQHKRIHIRENSYQCEECDKVFKRFSTLTRHKRVHTGEKPFKCEECGKAFKHSSTLTTHKMIHTGEKPYRCEECGKAFYHSSHLTTHKVIHTGEKPFKCEECGKAFNHPSALTTHKFIHVKEKPYKCEECDKAFNRFSYLTKHKIIHSGEKSYKCEQCGKGFNWSSTLTKHKRIHTGEKPYKCEECGKAFNVSSHLTTHKMIHTGEKPYKCEECGKAFNHSSKLTIHKIIHTGEKPYKCEECGKAFNQSSNLTKHKIIHTGEKLYKCEECGKAFNRSSNLTTHKRIHTGEKPYKCEECGKAFNRSSNLTKHNIIHTGEKSYKCEECGKAFNQSSTLTKHRKIQQGMVAHACNPNTLRGLGEQIARSGVQDQPGQHGKTPSLLKIQKFAGCGGRRL,mutated_sequence,1.0,554.0,UPI000059D6C5.a2m,UPI000059D6C5.npy,gnomAD
+UPI00001D7B9E,UPI00001D7B9E.csv,MIEPSEDSFETMMEHKNPSSKQMESSEGSSNTTEATSGSGVRGEAGPASGPAQEKKEPPSGPLQEMEELPTDLLQDMEEPSSGPRKEIEDPPNDLLQDLEESCNGSHQARGDPLSGASDRMKEASVNPSGAREEQEAHTDLKESGREETPQEQNQTEHSTAELMAMVRSIISLYFRMQDLKEQQRVAEEILIKGINAGQLPAPKHFSGDRREFHEFIVLCQLTLQSYPRMFYNDRLRVGYVINHLSGLALEWAKALLQENSPLIGDFPAFLEAMSEVFEYRQALRVAEEAMFTIRQGGRSATEYIDEFQSLVPILGWPDEVLQAHLCQGLNEEIRHYLFRVPQPDSLDSLIVLILQIEEKLAERRAMLRLPPEARPRNLTWIDSPAPERWMVSSWLPSEVHPDINRAHLFLLLMVRVNPYHSVAVQALVDSGADGNFMDEKFAQEHYVELYEKPYPQPVQSVDGSLIGNEPVWLYTEPLVCIHQNHQESIEFDIVPSPNFSVVLGIRWLRVHAPEVDWIKGRCTFHSPYCLKNCFRPPPPCIALERHGMSLLPGLPHPYSDLADVFNPKEADDETSDQPSSDGSDDLSESEPSELQQAGDSDHSETFYECPSTAPWEPVGARMQERARLQEEYWDLQDMLTNRQDYIQMIPELFDQLHGAEWFTKLELRGTIVEESVNGHRTEDVWKAAFGLELEEMKSYQPFALSPDPIIPQNVIHFILKDMLGFFVLSYGQEVLIYSMSQEEHLHHVRQVLVRFRHHNVYCSLDKSQFHRQTVEFLGFVVTPKGVKLNKNVMTIITGYPTPGSKLSLRNFIEFVFPYRHFVERFSIIAEPLVRQLLSSYQFYWGVEEQEAFECLKRAFRKAPLLHHPKPQNPFYLETGVTGTALHASLIQIDDQTGKRACCAFYSRNISPIEVEYSQAEMKILPIRAAFMVWCRYLENTEEPIMILLNTEDLASLNNDRLTVLLPGHWVFFFSHFNFDVMELPEQDGGRALPPVRNLRWRRAFQRNTAARQTLLLASRGFPRDPSTESGEEENEEQDELNEQILRQELLAMIPIDQILNSFLAHFSMAQIRAVILHFFRGLLYWKNTLALAAILVLLRVRQCLSLRPAPAMRVARPQPQRSLRLILDSSLIAGSSITTAITQLLTQMPALVGANTIPAQELAELFLGPGRWQRNALHSQAHRGLQFTPGFWLTLCEFFGVRVTPQEGHLPALRQNRYLELHVVGDEDVVLREALQDDLQRYRQCGLHDGLQDTSQDKQDNDVQEAPPSHTAATHPPRPRHLMDPQVLEFLGSRLLHIHSADGQLHLLSREQAARALSQFLTLIYRRALPIPAWESQPREQARLEELPDEDEDANLD,mutated_sequence,1.0,1358.0,UPI00001D7B9E.a2m,UPI00001D7B9E.npy,gnomAD
+UPI0000D61A6B,UPI0000D61A6B.csv,MHQPPESTAAAAAAADISARKMAHPAMFPRRGSGSGSASALNAAGTGVGSNATSSEDFPPPSLLQPPPPAASSTSGPQPPPPQSLNLLSQAQLQAQPLAPGGTQMKKKSGFQITSVTPAQISASISSNNSIAEDTESYDDLDESHTEDLSSSEILDVSLSRATDLGEPERSSSEETLNNFQEAETPGAVSPNQPHLPQPHLPHLPQQNVVINGNAHPHHLHHHHQIHHGHHLQHGHHHPSHVAVASASITGGPPSSPVSRKLSTTGSSDSITPVAPTSAVSSSGSPASVMTNMRAPSTTGGIGINSVTGTSTVNNVNITAVGSFNPNVTSSMLGNVNISTSNIPSAAGVSVGPGVTSGVNVNILSGMGNGTISSSAAVSSVPNAAAGMTGGSVSSQQQQPTVNTSRFRVVKLDSSSEPFKKGRWTCTEFYEKENAVPATEGVLINKVVETVKQNPIEVTSERESTSGSSVSSSVSTLSHYTESVGSGEMGAPTVVVQQQQQQQQQQQQQPALQGVTLQQMDFGSTGPQSIPAVSIPQSISQSQISQVQLQSQELSYQQKQGLQPVPLQATMSAATGIQPSPVNVVGVTSALGQQPSISSLAQPQLPYSQAAPPVQTPLPGAPPPQQLQYGQQQPMVSTQMAPGHVKSVTQNPASEYVQQQPILQTAMSSGQPSSAGVGAGTTVIPVAQPQGIQLPVQPTAVPAQPAGASVQPVGQAPAAVSAVPTGSQIANIGQQANIPTAVQQPSTQVPPSVIQQGAPPSSQVVPPAQTGIIHQGVQTSAPSLPQQLVIASQSSLLTVPPQPQGVEPVAQGIVSQQLPAVSSLPSASSISVTSQVSSTGPSGMPSAPTNLVPPQNIAQTPATQNGNLVQSVSQPPLIATNTNLPLAQQIPLSSTQFSAQSLAQAIGSQIEDARRAAEPSLVGLPQTISGDSGGMSAVSDGSSSSLAASASLFPLKVLPLTTPLVDGEDESSSGASVVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELIEKNSQLEQENNLLKTLASPEQLAQFQAQLQTGSPPATTQPQGTTQPPAQPASQGSGPTA,mutated_sequence,1.0,1073.0,UPI0000D61A6B.a2m,UPI0000D61A6B.npy,gnomAD
+UPI000041A715,UPI000041A715.csv,MDDDSLDELVARSPGPDGHPQVGPADPAGDFEESSVGSSGDSGDDSDSEHGDGTDGEDEGASEEEDLEDRSGSEDSEDDGETLLEVAGTQGKLEAAGSFNSDDDAESCPICLNAFRDQAVGTPENCAHYFCLDCIVEWSKNANSCPVDRTLFKCICIRAQFGGKILRKIPVENTKASEEEEDPTFCEVCGRSDREDRLLLCDGCDAGYHMECLDPPLQEVPVDEWFCPECAAPGVVLAADAGPVSEEEVSLLLADVVPTTSRLRPRAGRTRAIARTRQSERVRATVNRNRISTARRVQHTPGRLGSSLLDEAIEAVATGLSTAVYQRPLTPRTPARRKRKTRRRKKVPGRKKTPSGPSAKSKSSATRSKKRQHRVKKRRGKKVKSEATTRSRIARTLGLRRPVHSSCIPSVLKPVEPSLGLLRADIGAASLSLFGDPYELDPFDSSEELSANPLSPLSAKRRALSRSALQSHQPVARPVSVGLSRRRLPAAVPEPDLEEEPVPDLLGSILSGQSLLMLGSSDVIIHRDGSLSAKRAAPVSFQRNSGSLSRGEEGFKGCLQPRALPSGSPAQGPSGNRPQSTGLSCQGRSRTPARTAGAPVRLDLPAAPGAVQARNLSNGSVPGFRQSHSPWFNGTNKHTLPLASAASKISSRDSKPPCRSVVPGPPLKPAPRRTDISELPRIPKIRRDDGGGRRDAAPAHGQSIEIPSACISRLTGREGTGQPGRGTRAESEASSRVPREPGVHTGSSRPPAPSSHGSLAPLGPSRGKGVGSTFESFRINIPGNMAHSSQLSSPGFCNTFRPVDDKEQRKENPSPLFSIKKTKQLRSEVYDPSDPTGSDSSAPGSSPERSGPGLLPSEITRTISINSPKAQTVQAVRCVTSYTVESIFGTEPEPPLGPSSAMSKLRGAVAAEGASDTEREEPTESQGLAARLRRPSPPEPWDEEDGASCSTFFGSEERTVTCVTVVEPEAPPSPDVLQAATHRVVELRPPSRSRSTSSSRSRKKAKRKRVSREHGRTRSGTRSESRDRSSRSASPSVGEERPRRQRSKAKSRRSSSDRSSSRERAKRKKAKDKSREHRRGPWGHSRRTSRSRSGSPGSSSYEHYESRKKKKRRSASRPRGRECSPTSSLERLCRHKHQRERSHERPDRKESVAWPRDRRKRRSRSPSSEHRAREHRRPRSREKWPQTRSHSPERKGAVREASPAPLAQGEPGREDLPTRLPALGEAHVSPEVATADKAPLQAPPVLEVAAECEPDDLDLDYGDSVEAGHVFDDFSSDAVFIQLDDMSSPPSPESTDSSPERDFPLKPALPPASLAVAAIQREVSLMHDEDPSQPPPLPEGTQEPHLLRPDAAEKAEAPSSPDVAPAGKEDSPSASGRVQEAARPEEVVSQTPLLRSRALVKRVTWNLQESESSAPAEDRAPRAPLHRPQKPREGAWDMEDVAPTGVRQVFSELPFPSHVLPEPGFPDTDPSQVYSPGLPPAPAQPSSIPPCALVSQPTVQFILQGSLPLVGCGAAQTLAPVPAALTPASEPASQATAASNSEEKTPAPRLAAEKTKKEEYMKKLHMQERAVEEVKLAIKPFYQKREVTKEEYKDILRKAVQKICHSKSGEINPVKVANLVKAYVDKYRHMRRHKKPEAGEEPPTQGAEG,mutated_sequence,1.0,1649.0,UPI000041A715.a2m,UPI000041A715.npy,gnomAD
+UPI0000001BCE,UPI0000001BCE.csv,MAHEMIGTQIVTERLVALLESGTEKVLLIDSRPFVEYNTSHILEAININCSKLMKRRLQQDKVLITELIQHSAKHKVDIDCSQKVVVYDQSSQDVASLSSDCFLTVLLGKLEKSFNSVHLLAGGFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLYLGCQRDVLNKELMQQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCEKILPWLDKSVDFIEKAKASNGCVLVHCLAGISRSATIAIAYIMKRMDMSLDEAYRFVKEKRPTISPNFNFLGQLLDYEKKIKNQTGASGPKSKLKLLHLEKPNEPVPAVSEGGQKSETPLSPPCADSATSEAAGQRPVHPASVPSVPSVQPSLLEDSPLVQALSGLHLSADRLEDSNKLKRSFSLDIKSVSYSASMAASLHGFSSSEDALEYYKPSTTLDGTNKLCQFSPVQELSEQTPETSPDKEEASIPKKLQTARPSDSQSKRLHSVRTSSSGTAQRSLLSPLHRSGSVEDNYHTSFLFGLSTSQQHLTKSAGLGLKGWHSDILAPQTSTPSLTSSWYFATESSHFYSASAIYGGSASYSAYSCSQLPTCGDQVYSVRRRQKPSDRADSRRSWHEESPFEKQFKRRSCQMEFGESIMSENRSREELGKVGSQSSFSGSMEIIEVS,mutated_sequence,1.0,665.0,UPI0000001BCE.a2m,UPI0000001BCE.npy,gnomAD
+UPI0001B79057,UPI0001B79057.csv,MALPQGLLTFRDVAIEFSQEEWKCLDPAQRTLYRDVMLENYRNLVSLGSCCAVEVGLDLLGSRHPPRPPKKLGLQAPTPVPNYWLLFLNFLHMCVLRANMSLLWTTSSWFLLIESVFAF,mutated_sequence,1.0,119.0,UPI0001B79057.a2m,UPI0001B79057.npy,gnomAD
+UPI000006D1F4,UPI000006D1F4.csv,MKLNERSLAFYATCDAPVDNAGFLYKKGGRHAAYHRRWFVLRGNMLFYFEDAASREPVGVIILEGCTVELVEAAEEFAFAVRFAGTRARTYVLAAESQDAMEGWVKALSRASFDYLRLVVRELEQQLAAVRGGGGMALPQPQPQSLPLPPSLPSALAPVPSLPSAPAPVPALPLPRRPSALPPKENGCAVWSTEATFRPGPEPPPPPPRRRASAPHGPLDMAPFARLHECYGQEIRALRGQWLSSRVQP,mutated_sequence,1.0,249.0,UPI000006D1F4.a2m,UPI000006D1F4.npy,gnomAD
+UPI00015FA087,UPI00015FA087.csv,MGNQDGKLKRSAGDALHEGGGGAEDALGPRDVEATKKGSGGKKALGKHGKGGGGGGGGGESGKKKSKSDSRASVFSNLRIRKNLSKGKGAGGSREDVLDSQALQTGELDSAHSLLTKTPDLSLSADEAGLSDTECADPFEVTGPGGPGPAEARVGGRPIAEDVETAAGAQDGQRTSSGSDTDIYSFHSATEQEDLLSDIQQAIRLQQQQQQQLQLQLQQQQQQQQLQGAEEPAAPPTAVSPQPGAFLGLDRFLLGPSGGAGEAPGSPDTEQALSALSDLPESLAAEPREPQQPPSPGGLPVSEAPSLPAAQPAAKDSPSSTAFPFPEAGPGEEAAGAPVRGAGDTDEEGEEDAFEDAPRGSPGEEWAPEVGEDAPQRLGEEPEEEAQGPDAPAAASLPGSPAPSQRCFKPYPLITPCYIKTTTRQLSSPNHSPSQSPNQSPRIKRRPEPSLSRGSRTALASVAAPAKKHRADGGLAAGLSRSADWTEELGARTPRVGGSAHLLERGVASDSGGGVSPALAAKASGAPAAADGFQNVFTGRTLLEKLFSQQENGPPEEAEKFCSRIIAMGLLLPFSDCFREPCNQNAQTNAASFDQDQLYTWAAVSQPTHSLDYSEGQFPRRVPSMGPPSKPPDEEHRLEDAETESQSAVSETPQKRSDAVQKEVVDMKSEGQATVIQQLEQTIEDLRTKIAELERQYPALDTEVASGHQGLENGVTASGDVCLEALRLEEKEVRHHRILEAKSIQTSPTEEGGVLTLPPVDGLPGRPPCPPGAESGPQTKFCSEISLIVSPRRISVQLDSHQPTQSISQPPPPPSLLWSAGQGQPGSQPPHSISTEFQTSHEHSVSSAFKNSCNIPSPPPLPCTESSSSMPGLGMVPPPPPPLPGMTVPTLPSTAIPQPPPLQGTEMLPPPPPPLPGAGIPPPPPLPGAGILPLPPLPGAGIPPPPPLPGAAIPPPPPLPGAGIPLPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGVGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPRVGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGVGIPPPPPLPGVGIPPPPPLPGAGIPPPPPLPGMGIPPAPAPPLPPPGTGIPPPPLLPVSGPPLLPQVGSSTLPTPQVCGFLPPPLPSGLFGLGMNQDKGSRKQPIEPCRPMKPLYWTRIQLHSKRDSSTSLIWEKIEEPSIDCHEFEELFSKTAVKERKKPISDTISKTKAKQVVKLLSNKRSQAVGILMSSLHLDMKDIQHAVVNLDNSVVDLETLQALYENRAQSDELEKIEKHGRSSKDKENAKSLDKPEQFLYELSLIPNFSERVFCILFQSTFSESICSIRRKLELLQKLCETLKNGPGVMQVLGLVLAFGNYMNGGNKTRGQADGFGLDILPKLKDVKSSDNSRSLLSYIVSYYLRNFDEDAGKEQCLFPLPEPQDLFQASQMKFEDFQKDLRKLKKDLKACEVEAGKVYQVSSKEHMQPFKENMEQFIIQAKIDQEAEENSLTETHKCFLETTAYFFMKPKLGEKEVSPNAFFSIWHEFSSDFKDFWKKENKLLLQERVKEAEEVCRQKKGKSLYKIKPRHDSGIKAKISMKT,mutated_sequence,1.0,1722.0,UPI00015FA087.a2m,UPI00015FA087.npy,gnomAD
+UPI000047099D,UPI000047099D.csv,MHGAARAPATSVSADCCIPAGLRLGPVPGTFKLGKYLSDRREPGPKKKVRMVRGELVDESGGSPLEWIGLIRAARNSQEQTLEAIADLPGGQIFYRALRDVQPGEELTVWYSNSLAQWFDIPTTATPTHDEKGEERYICWYCWRTFRYPNSLKAHLRFHCVFSGGGGGAFLHHEHAARQGAVPAADGLGLSPKPPAPDFAAPSQAGTLRPHPLGPPPVQACGAREGIKREASSAPSATSPTPGKWGQPKKGKEQLDRALDMSGAARGQGHFLGIVGGSSAGVGSLAFYPGVRSAFKPAGLARAAAAAHGDPYREESSSKQGAGLALGRLLGGGRACGRPGSGENSAAGGAGHHHHHHAHHHHHPKCLLAGDPPPPPPPGLPCSGALRGFPLLSVPPEEASAFKHVERAPPAAAALPGARYAQLPPAPGLPLERCALPPLDPGGLKAYPGGECSHLPAVMPAFTVYNGELLYGSPATTAYYPLKLHFGGLLKYPESISYFSGPAAAALSPAELGSLASIDREIAMHNQQLSEMAAGKGRGRLDSGTLPPAVAAAGGTGGGGSGGSGAGKPKTGHLCLYCGKLYSRKYGLKIHMRTHTGYKPLKCKVCLRPFGDPSNLNKHIRLHAEGNTPYRCEFCGKVLVRRRDLERHVKSRHPGQSLLAKAGDGPGAEPGYPPEPGDPKSDDSDVDVCFTDDQSDPEVGGGGERDL,mutated_sequence,1.0,707.0,UPI000047099D.a2m,UPI000047099D.npy,gnomAD
+UPI000000D866,UPI000000D866.csv,MALGLEQAEEQRLYQQTLLQDGLKDMLDHGKFLDCVVRAGEREFPCHRLVLAACSPYFRARFLAEPERAGELHLEEVSPDVVAQVLHYLYTSEIALDEASVQDLFAAAHRFQIPSIFTICVSFLQKRLCLSNCLAVFRLGLLLDCARLAVAARDFICAHFTLVARDADFLGLSADELIAIISSDGLNVEKEEAVFEAVMRWAGSGDAEAQAERQRALPTVFESVRCRLLPRAFLESRVERHPLVRAQPELLRKVQMVKDAHEGRITTLRKKKKGKDGAGAKEADKGTSKAKAEEDEEAERILPGILNDTLRFGMFLQDLIFMISEEGAVAYDPAANECYCASLSNQVPKNHVSLVTKENQVFVAGGLFYNEDNKEDPMSAYFLQFDHLDSEWLGMPPLPSPRCLFGLGEALNSIYVVGGREIKDGERCLDSVMCYDRLSFKWGESDPLPYVVYGHTVLSHMDLVYVIGGKGSDRKCLNKMCVYDPKKFEWKELAPMQTARSLFGATVHDGRIIVAAGVTDTGLTSSAEVYSITDNKWAPFEAFPQERSSLSLVSLVGTLYAIGGFATLETESGELVPTELNDIWRYNEEEKKWEGVLREIAYAAGATFLPVRLNVLCLTKM,mutated_sequence,1.0,621.0,UPI000000D866.a2m,UPI000000D866.npy,gnomAD
+UPI0000DBEE85,UPI0000DBEE85.csv,MSRRKPASGGLAASSSAPARQAVLSRFFQSTGSLKSTSSSTGAADQVDPGAAAAAAAAAAAAPPAPPAPAFPPQLPPHIATEIDRRKKRPLENDGPVKKKVKKVQQKEGGSDLGMSGNSEPKKCLRTRNVSKSLEKLKEFCCDSALPQSRVQTESLQERFAVLPKCTDFDDISLLHAKNAVSSEDSKRQINQKDTTLFDLSQFGSSNTSHENLQKTASKSANKRSKSIYTPLELQYIEMKQQHKDAVLCVECGYKYRFFGEDAEIAARELNIYCHLDHNFMTASIPTHRLFVHVRRLVAKGYKVGVVKQTETAALKAIGDNRSSLFSRKLTALYTKSTLIGEDVNPLIKLDDAVNVDEIMTDTSTSYLLCISENKENVRDKKKGNIFIGIVGVQPATGEVVFDSFQDSASRSELETRMSSLQPVELLLPSALSEQTEALIHRATSVSVQDDRIRVERMDNIYFEYSHAFQAVTEFYAKDTVDIKGSQIISGIVNLEKPVICSLAAIIKYLKEFNLEKMLSKPENFKQLSSKMEFMTINGTTLRNLEILQNQTDMKTKGSLLWVLDHTKTSFGRRKLKKWVTQPLLKLREINARLDAVSEVLHSESSVFGQIENHLRKLPDIERGLCSIYHKKCSTQEFFLIVKTLYHLKSEFQAIIPAVNSHIQSDLLRTVILEIPELLSPVEHYLKILNEQAAKVGDKTELFKDLSDFPLIKKRKDEIQGVIDEIRMHLQEIRKILKNPSAQYVTVSGQEFMIEIKNSAVSCIPTDWVKVGSTKAVSRFHSPFIVENYRHLNQLREQLVLDCSAEWLDFLEKFSEHYHSLCKAVHHLATVDCIFSLAKVAKQGDYCRPTVQEERKIVIKNGRHPVIDVLLGEQDQYVPNNTDLSEDSERVMIITGPNMGGKSSYIKQVALITIMAQIGSYVPAEEATIGIVDGIFTRMGAADNIYKGQSTFMEELTDTAEIIRKATSQSLVILDELGRGTSTHDGIAIAYATLEYFIRDVKSLTLFVTHYPPVCELEKNYSHQVGNYHMGFLVSEDESKLDPGAAEQVPDFVTFLYQITRGIAARSYGLNVAKLADVPGEILKKAAHKSKELEGLINTKRKRLKYFAKLWTMHNAQDLQKWTEEFNMEETQTSLLH,mutated_sequence,1.0,1137.0,UPI0000DBEE85.a2m,UPI0000DBEE85.npy,gnomAD
+UPI000000DCCB,UPI000000DCCB.csv,MSASKIPLFKMKDLILILCLLEMSFAVPFFPQQSGTPGMASLSLETMRQLGSLQRLNTLSQYSRYGFGKSFNSLWMHGLLPPHSSLPWMRPREHETQQYEYSLPVHPPPLPSQPSLKPQQPGLKPFLQSAAATTNQATALKEALQPPIHLGHLPLQEGELPLVQQQVAPSDKPPKPELPGVDFADPQGPSLPGMDFPDPQGPSLPGLDFADPQGSTIFQIARLISHGPMPQNKQSPLYPGMLYVPFGANQLNAPARLGIMSSEEVAGGREDPMAYGAMFPGFGGMRPGFEGMPHNPAMGGDFTLEFDSPVAATKGPENEEGGAQGSPMPEANPDNLENPAFLTELEPAPHAGLLALPKDDIPGLPRSPSGKMKGLPSVTPAAADPLMTPELADVYRTYDADMTTSVDFQEEATMDTTMAPNSLQTSMPGNKAQEPEMMHDAWHFQEP,mutated_sequence,1.0,447.0,UPI000000DCCB.a2m,UPI000000DCCB.npy,gnomAD
+UPI000013C495,UPI000013C495.csv,MEIPKLLPARGTLQGGGGGGIPAGGGRVHRGPDSPAGQVPTRRLLLLRGPQDGGPGRRREEASTASRGPGPSLLAPRTDQPSGGGGGGGDDFFLVLLDPVGGDVETAGSGQAAGPVLREEAEEGPGLQGGESGANPAGPTALGPRCLSAVPTPAPISAPGPAAAFAGTVTIHNQDLLLRFENGVLTLATPPPHAWEPGAAPAQQPGCLIAPQAGFPHAAHPGDCPELPPDLLLAEPAEPAPAPAPEEEAEGPAAALGPRGPLGSGPGVVLYLCPEAQCGQTFAKKHQLKVHLLTHSSSQGQRPFKCPLGGCGWTFTTSYKLKRHLQSHDKLRPFGCPAEGCGKSFTTVYNLKAHMKGHEQENSFKCEVCEESFPTQAKLSAHQRSHFEPERPYQCAFSGCKKTFITVSALFSHNRAHFREQELFSCSFPGCSKQYDKACRLKIHLRSHTGERPFLCDFDGCGWNFTSMSKLLRHKRKHDDDRRFMCPVEGCGKSFTRAEHLKGHSITHLGTKPFVCPVAGCCARFSARSSLYIHSKKHLQDVDTWKSRCPISSCNKLFTSKHSMKTHMVKRHKVGQDLLAQLEAANSLTPSSELTSQRQNDLSDAEIVSLFSDVPDSTSAALLDTALVNSGILTIDVASVSSTLAGHLPANNNNSVGQAVDPPSLMATSDPPQSLDTSLFFGTAATGFQQSSLNMDEVSSVSVGPLGSLDSLAMKNSSPEPQALTPSSKLTVDTDALTPSSTLCENSVSELLTPTKAEWNVHPDSDFFGQEGETQFGFPNAAGNHGSQKETDLITVTGSSFLV,mutated_sequence,1.0,803.0,UPI000013C495.a2m,UPI000013C495.npy,gnomAD
+UPI000016A402,UPI000016A402.csv,MNAETCVSYCESPAAAMDAYYSPVSQSREGSSPFRAFPGGDKFGTTFLSAAAKAQGFGDAKSRARYGAGQQDLATPLESGAGARGSFNKFQPQPSTPQPQPPPQPQPQQQQPQPQPPAQPHLYLQRGACKTPPDGSLKLQEGSSGHSAALQVPCYAKESSLGEPELPPDSDTVGMDSSYLSVKEAGVKGPQDRASSDLPSPLEKADSESNKGKKRRNRTTFTSYQLEELEKVFQKTHYPDVYAREQLAMRTDLTEARVQVWFQNRRAKWRKRERFGQMQQVRTHFSTAYELPLLTRAENYAQIQNPSWLGNNGAASPVPACVVPCDPVPACMSPHAHPPGSGASSVTDFLSVSGAGSHVGQTHMGSLFGAASLSPGLNGYELNGEPDRKTSSIAALRMKAKEHSAAISWAT,mutated_sequence,1.0,411.0,UPI000016A402.a2m,UPI000016A402.npy,gnomAD
+UPI00001AE452,UPI00001AE452.csv,MPPPSPDSENGFYPGLPSSMNPAFFPSFSPVSPHGCTGLSVPTSGGGGGGFGGPFSATAVPPPPPPAMNIPQQQPPPPAAPQQPQSRRSPVSPQLQQQHQAAAAAFLQQRNSYNHHQPLLKQSPWSNHQSSGWGTGSMSWGAMHGRDHRRTGNMGIPGTMNQISPLKKPFSGNVIAPPKFTRSTPSLTPKSWIEDNVFRTDNNSNTLLPLQVRSSLQLPAWGSDSLQDSWCTAAGTSRIDQDRSRMYDSLNMHSLENSLIDIMRAEHDPLKGRLSYPHPGTDNLLMLNGRSSLFPIDDGLLDDGHSDQVGVLNSPTCYSAHQNGERIERFSRKVFVGGLPPDIDEDEITASFRRFGPLVVDWPHKAESKSYFPPKGYAFLLFQEESSVQALIDACIEEDGKLYLCVSSPTIKDKPVQIRPWNLSDSDFVMDGSQPLDPRKTIFVGGVPRPLRAVELAMIMDRLYGGVCYAGIDTDPELKYPKGAGRVAFSNQQSYIAAISARFVQLQHGDIDKRVEVKPYVLDDQMCDECQGARCGGKFAPFFCANVTCLQYYCEFCWANIHSRAGREFHKPLVKEGADRPRQIHFRWN,mutated_sequence,1.0,589.0,UPI00001AE452.a2m,UPI00001AE452.npy,gnomAD
+UPI0000070A05,UPI0000070A05.csv,MFVQEEKIFAGKVLRLHICASDGAEWLEEATEDTSVEKLKERCLKHCAHGSLEDPKSITHHKLIHAASERVLSDARTILEENIQDQDVLLLIKKRAPSPLPKMADVSAEEKKKQDQKAPDKEAILRATANLPSYNMDRAAVQTNMRDFQTELRKILVSLIEVAQKLLALNPDAVELFKKANAMLDEDEDERVDEAALRQLTEMGFPENRATKALQLNHMSVPQAMEWLIEHAEDPTIDTPLPGQAPPEAEGATAAASEAAAGASATDEEARDELTEIFKKIRRKREFRADARAVISLMEMGFDEKEVIDALRVNNNQQNAACEWLLGDRKPSPEELDKGIDPDSPLFQAILDNPVVQLGLTNPKTLLAFEDMLENPLNSTQWMNDPETGPVMLQISRIFQTLNRT,mutated_sequence,1.0,405.0,UPI0000070A05.a2m,UPI0000070A05.npy,gnomAD
+UPI000013C9CD,UPI000013C9CD.csv,MNQNTTEPVAATETLAEVPEHVLRGLPEEVRLFPSAVDKTRIGVWATKPILKGKKFGPFVGDKKKRSQVKNNVYMWEVYYPNLGWMCIDATDPEKGNWLRYVNWACSGEEQNLFPLEINRAIYYKTLKPIAPGEELLVWYNGEDNPEIAAAIEEERASARSKRSSPKSRKGKKKSQENKNKGNKIQDIQLKTSEPDFTSANMRDSAEGPKEDEEKPSASALEQPATLQEVASQEVPPELATPAPAWEPQPEPDERLEAAACEVNDLGEEEEEEEEEDEEEEEDDDDDELEDEGEEEASMPNENSVKEPEIRCDEKPEDLLEEPKTTSEETLEDCSEVTPAMQIPRTKEEANGDVFETFMFPCQHCERKFTTKQGLERHMHIHISTVNHAFKCKYCGKAFGTQINRRRHERRHEAGLKRKPSQTLQPSEDLADGKASGENVASKDDSSPPSLGPDCLIMNSEKASQDTINSSVVEENGEVKELHPCKYCKKVFGTHTNMRRHQRRVHERHLIPKGVRRKGGLEEPQPPAEQAQATQNVYVPSTEPEEEGEADDVYIMDISSNISENLNYYIDGKIQTNNNTSNCDVIEMESASADLYGINCLLTPVTVEITQNIKTTQVPVTEDLPKEPLGSTNSEAKKRRTASPPALPKIKAETDSDPMVPSCSLSLPLSISTTEAVSFHKEKSVYLSSKLKQLLQTQDKLTPAGISATEIAKLGPVCVSAPASMLPVTSSRFKRRTSSPPSSPQHSPALRDFGKPSDGKAAWTDAGLTSKKSKLESHSDSPAWSLSGRDERETVSPPCFDEYKMSKEWTASSAFSSVCNQQPLDLSSGVKQKAEGTGKTPVQWESVLDLSVHKKHCSDSEGKEFKESHSVQPTCSAVKKRKPTTCMLQKVLLNEYNGIDLPVENPADGTRSPSPCKSLEAQPDPDLGPGSGFPAPTVESTPDVCPSSPALQTPSLSSGQLPPLLIPTDPSSPPPCPPVLTVATPPPPLLPTVPLPAPSSSASPHPCPSPLSNATAQSPLPILSPTVSPSPSPIPPVEPLMSAASPGPPTLSSSSSSSSSSSSFSSSSSSSSPSPPPLSAISSVVSSGDNLEASLPMISFKQEELENEGLKPREEPQSAAEQDVVVQETFNKNFVCNVCESPFLSIKDLTKHLSIHAEEWPFKCEFCVQLFKDKTDLSEHRFLLHGVGNIFVCSVCKKEFAFLCNLQQHQRDLHPDKVCTHHEFESGTLRPQNFTDPSKAHVEHMQSLPEDPLETSKEEEELNDSSEELYTTIKIMASGIKTKDPDVRLGLNQHYPSFKPPPFQYHHRNPMGIGVTATNFTTHNIPQTFTTAIRCTKCGKGVDNMPELHKHILACASASDKKRYTPKKNPVPLKQTVQPKNGVVVLDNSGKNAFRRMGQPKRLNFSVELSKMSSNKLKLNALKKKNQLVQKAILQKNKSAKQKADLKNACESSSHICPYCNREFTYIGSLNKHAAFSCPKKPLSPPKKKVSHSSKKGGHSSPASSDKNSNSNHRRRTADAEIKMQSMQTPLGKTRARSSGPTQVPLPSSSFRSKQNVKFAASVKSKKPSSSSLRNSSPIRMAKITHVEGKKPKAVAKNHSAQLSSKTSRSLHVRVQKSKAVLQSKSTLASKKRTDRFNIKSRERSGGPVTRSLQLAAAADLSENKREDGSAKQELKDFSYSLRLASRCSPPAAPYITRQYRKVKAPAAAQFQGPFFKE,mutated_sequence,1.0,1718.0,UPI000013C9CD.a2m,UPI000013C9CD.npy,gnomAD
+UPI000004F239,UPI000004F239.csv,MEQSNYSVYADFILLGLFSNARFPWLLFALILLVFLTSIASNVVKIILIHIDSRLHTPMYFLLSQLSLRDILYISTIVPKMLVDQVMSQRAISFAGCTAQHFLYLTLAGAEFFLLGLMSYDRYVAICNPLHYPVLMSRKICWLIVAAAWLGGSIDGFLLTPVTMQFPFCASREINHFFCEVPALLKLSCTDTSAYETAMYVCCIMMLLIPFSVISGSYTRILITVYRMSEAEGRGKAVATCSSHMVVVSLFYGAAMYTYVLPHSYHTPEQDKAVSAFYTILTPMLNPLIYSLRNKDVTGALQKVVGRCVSSGKVTTF,mutated_sequence,1.0,317.0,UPI000004F239.a2m,UPI000004F239.npy,gnomAD
+UPI0000129FBA,UPI0000129FBA.csv,MRALWVLGLCCVLLTFGSVRADDEVDVDGTVEEDLGKSREGSRTDDEVVQREEEAIQLDGLNASQIRELREKSEKFAFQAEVNRMMKLIINSLYKNKEIFLRELISNASDALDKIRLISLTDENALSGNEELTVKIKCDKEKNLLHVTDTGVGMTREELVKNLGTIAKSGTSEFLNKMTEAQEDGQSTSELIGQFGVGFYSAFLVADKVIVTSKHNNDTQHIWESDSNEFSVIADPRGNTLGRGTTITLVLKEEASDYLELDTIKNLVKKYSQFINFPIYVWSSKTETVEEPMEEEEAAKEEKEESDDEAAVEEEEEEKKPKTKKVEKTVWDWELMNDIKPIWQRPSKEVEEDEYKAFYKSFSKESDDPMAYIHFTAEGEVTFKSILFVPTSAPRGLFDEYGSKKSDYIKLYVRRVFITDDFHDMMPKYLNFVKGVVDSDDLPLNVSRETLQQHKLLKVIRKKLVRKTLDMIKKIADDKYNDTFWKEFGTNIKLGVIEDHSNRTRLAKLLRFQSSHHPTDITSLDQYVERMKEKQDKIYFMAGSSRKEAESSPFVERLLKKGYEVIYLTEPVDEYCIQALPEFDGKRFQNVAKEGVKFDESEKTKESREAVEKEFEPLLNWMKDKALKDKIEKAVVSQRLTESPCALVASQYGWSGNMERIMKAQAYQTGKDISTNYYASQKKTFEINPRHPLIRDMLRRIKEDEDDKTVLDLAVVLFETATLRSGYLLPDTKAYGDRIERMLRLSLNIDPDAKVEEEPEEEPEETAEDTTEDTEQDEDEEMDVGTDEEEETAKESTAEKDEL,mutated_sequence,1.0,803.0,UPI0000129FBA.a2m,UPI0000129FBA.npy,gnomAD
+UPI0000141B94,UPI0000141B94.csv,MDPPRPALLALLALPALLLLLLAGARAEEEMLENVSLVCPKDATRFKHLRKYTYNYEAESSSGVPGTADSRSATRINCKVELEVPQLCSFILKTSQCTLKEVYGFNPEGKALLKKTKNSEEFAAAMSRYELKLAIPEGKQVFLYPEKDEPTYILNIKRGIISALLVPPETEEAKQVLFLDTVYGNCSTHFTVKTRKGNVATEISTERDLGQCDRFKPIRTGISPLALIKGMTRPLSTLISSSQSCQYTLDAKRKHVAEAICKEQHLFLPFSYKNKYGMVAQVTQTLKLEDTPKINSRFFGEGTKKMGLAFESTKSTSPPKQAEAVLKTLQELKKLTISEQNIQRANLFNKLVTELRGLSDEAVTSLLPQLIEVSSPITLQALVQCGQPQCSTHILQWLKRVHANPLLIDVVTYLVALIPEPSAQQLREIFNMARDQRSRATLYALSHAVNNYHKTNPTGTQELLDIANYLMEQIQDDCTGDEDYTYLILRVIGNMGQTMEQLTPELKSSILKCVQSTKPSLMIQKAAIQALRKMEPKDKDQEVLLQTFLDDASPGDKRLAAYLMLMRSPSQADINKIVQILPWEQNEQVKNFVASHIANILNSEELDIQDLKKLVKEALKESQLPTVMDFRKFSRNYQLYKSVSLPSLDPASAKIEGNLIFDPNNYLPKESMLKTTLTAFGFASADLIEIGLEGKGFEPTLEALFGKQGFFPDSVNKALYWVNGQVPDGVSKVLVDHFGYTKDDKHEQDMVNGIMLSVEKLIKDLKSKEVPEARAYLRILGEELGFASLHDLQLLGKLLLMGARTLQGIPQMIGEVIRKGSKNDFFLHYIFMENAFELPTGAGLQLQISSSGVIAPGAKAGVKLEVANMQAELVAKPSVSVEFVTNMGIIIPDFARSGVQMNTNFFHESGLEAHVALKAGKLKFIIPSPKRPVKLLSGGNTLHLVSTTKTEVIPPLIENRQSWSVCKQVFPGLNYCTSGAYSNASSTDSASYYPLTGDTRLELELRPTGEIEQYSVSATYELQREDRALVDTLKFVTQAEGAKQTEATMTFKYNRQSMTLSSEVQIPDFDVDLGTILRVNDESTEGKTSYRLTLDIQNKKITEVALMGHLSCDTKEERKIKGVISIPRLQAEARSEILAHWSPAKLLLQMDSSATAYGSTVSKRVAWHYDEEKIEFEWNTGTNVDTKKMTSNFPVDLSDYPKSLHMYANRLLDHRVPQTDMTFRHVGSKLIVAMSSWLQKASGSLPYTQTLQDHLNSLKEFNLQNMGLPDFHIPENLFLKSDGRVKYTLNKNSLKIEIPLPFGGKSSRDLKMLETVRTPALHFKSVGFHLPSREFQVPTFTIPKLYQLQVPLLGVLDLSTNVYSNLYNWSASYSGGNTSTDHFSLRARYHMKADSVVDLLSYNVQGSGETTYDHKNTFTLSYDGSLRHKFLDSNIKFSHVEKLGNNPVSKGLLIFDASSSWGPQMSASVHLDSKKKQHLFVKEVKIDGQFRVSSFYAKGTYGLSCQRDPNTGRLNGESNLRFNSSYLQGTNQITGRYEDGTLSLTSTSDLQSGIIKNTASLKYENYELTLKSDTNGKYKNFATSNKMDMTFSKQNALLRSEYQADYESLRFFSLLSGSLNSHGLELNADILGTDKINSGAHKATLRIGQDGISTSATTNLKCSLLVLENELNAELGLSGASMKLTTNGRFREHNAKFSLDGKAALTELSLGSAYQAMILGVDSKNIFNFKVSQEGLKLSNDMMGSYAEMKFDHTNSLNIAGLSLDFSSKLDNIYSSDKFYKQTVNLQLQPYSLVTTLNSDLKYNALDLTNNGKLRLEPLKLHVAGNLKGAYQNNEIKHIYAISSAALSASYKADTVAKVQGVEFSHRLNTDIAGLASAIDMSTNYNSDSLHFSNVFRSVMAPFTMTIDAHTNGNGKLALWGEHTGQLYSKFLLKAEPLAFTFSHDYKGSTSHHLVSRKSISAALEHKVSALLTPAEQTGTWKLKTQFNNNEYSQDLDAYNTKDKIGVELTGRTLADLTLLDSPIKVPLLLSEPINIIDALEMRDAVEKPQEFTIVAFVKYDKNQDVHSINLPFFETLQEYFERNRQTIIVVLENVQRNLKHINIDQFVRKYRAALGKLPQQANDYLNSFNWERQVSHAKEKLTALTKKYRITENDIQIALDDAKINFNEKLSQLQTYMIQFDQYIKDSYDLHDLKIAIANIIDEIIEKLKSLDEHYHIRVNLVKTIHDLHLFIENIDFNKSGSSTASWIQNVDTKYQIRIQIQEKLQQLKRHIQNIDIQHLAGKLKQHIEAIDVRVLLDQLGTTISFERINDILEHVKHFVINLIGDFEVAEKINAFRAKVHELIERYEVDQQIQVLMDKLVELAHQYKLKETIQKLSNVLQQVKIKDYFEKLVGFIDDAVKKLNELSFKTFIEDVNKFLDMLIKKLKSFDYHQFVDETNDKIREVTQRLNGEIQALELPQKAEALKLFLEETKATVAVYLESLQDTKITLIINWLQEALSSASLAHMKAKFRETLEDTRDRMYQMDIQQELQRYLSLVGQVYSTLVTYISDWWTLAAKNLTDFAEQYSIQDWAKRMKALVEQGFTVPEIKTILGTMPAFEVSLQALQKATFQTPDFIVPLTDLRIPSVQINFKDLKNIKIPSRFSTPEFTILNTFHIPSFTIDFVEMKVKIIRTIDQMLNSELQWPVPDIYLRDLKVEDIPLARITLPDFRLPEIAIPEFIIPTLNLNDFQVPDLHIPEFQLPHISHTIEVPTFGKLYSILKIQSPLFTLDANADIGNGTTSANEAGIAASITAKGESKLEVLNFDFQANAQLSNPKINPLALKESVKFSSKYLRTEHGSEMLFFGNAIEGKSNTVASLHTEKNTLELSNGVIVKINNQLTLDSNTKYFHKLNIPKLDFSSQADLRNEIKTLLKAGHIAWTSSGKGSWKWACPRFSDEGTHESQISFTIEGPLTSFGLSNKINSKHLRVNQNLVYESGSLNFSKLEIQSQVDSQHVGHSVLTAKGMALFGEGKAEFTGRHDAHLNGKVIGTLKNSLFFSAQPFEITASTNNEGNLKVRFPLRLTGKIDFLNNYALFLSPSAQQASWQVSARFNQYKYNQNFSAGNNENIMEAHVGINGEANLDFLNIPLTIPEMRLPYTIITTPPLKDFSLWEKTGLKEFLKTTKQSFDLSVKAQYKKNKHRHSITNPLAVLCEFISQSIKSFDRHFEKNRNNALDFVTKSYNETKIKFDKYKAEKSHDELPRTFQIPGYTVPVVNVEVSPFTIEMSAFGYVFPKAVSMPSFSILGSDVRVPSYTLILPSLELPVLHVPRNLKLSLPDFKELCTISHIFIPAMGNITYDFSFKSSVITLNTNAELFNQSDIVAHLLSSSSSVIDALQYKLEGTTRLTRKRGLKLATALSLSNKFVEGSHNSTVSLTTKNMEVSVATTTKAQIPILRMNFKQELNGNTKSKPTVSSSMEFKYDFNSSMLYSTAKGAVDHKLSLESLTSYFSIESSTKGDVKGSVLSREYSGTIASEANTYLNSKSTRSSVKLQGTSKIDDIWNLEVKENFAGEATLQRIYSLWEHSTKNHLQLEGLFFTNGEHTSKATLELSPWQMSALVQVHASQPSSFHDFPDLGQEVALNANTKNQKIRWKNEVRIHSGSFQSQVELSNDQEKAHLDIAGSLEGHLRFLKNIILPVYDKSLWDFLKLDVTTSIGRRQHLRVSTAFVYTKNPNGYSFSIPVKVLADKFIIPGLKLNDLNSVLVMPTFHVPFTDLQVPSCKLDFREIQIYKKLRTSSFALNLPTLPEVKFPEVDVLTKYSQPEDSLIPFFEITVPESQLTVSQFTLPKSVSDGIAALDLNAVANKIADFELPTIIVPEQTIEIPSIKFSVPAGIVIPSFQALTARFEVDSPVYNATWSASLKNKADYVETVLDSTCSSTVQFLEYELNVLGTHKIEDGTLASKTKGTFAHRDFSAEYEEDGKYEGLQEWEGKAHLNIKSPAFTDLHLRYQKDKKGISTSAASPAVGTVGMDMDEDDDFSKWNFYYSPQSSPDKKLTIFKTELRVRESDEETQIKVNWEEEAASGLLTSLKDNVPKATGVLYDYVNKYHWEHTGLTLREVSSKLRRNLQNNAEWVYQGAIRQIDDIDVRFQKAASGTTGTYQEWKDKAQNLYQELLTQEGQASFQGLKDNVFDGLVRVTQEFHMKVKHLIDSLIDFLNFPRFQFPGKPGIYTREELCTMFIREVGTVLSQVYSKVHNGSEILFSYFQDLVITLPFELRKHKLIDVISMYRELLKDLSKEAQEVFKAIQSLKTTEVLRNLQDLLQFIFQLIEDNIKQLKEMKFTYLINYIQDEINTIFSDYIPYVFKLLKENLCLNLHKFNEFIQNELQEASQELQQIHQYIMALREEYFDPSIVGWTVKYYELEEKIVSLIKNLLVALKDFHSEYIVSASNFTSQLSSQVEQFLHRNIQEYLSILTDPDGKGKEKIAELSATAQEIIKSQAIATKKIISDYHQQFRYKLQDFSDQLSDYYEKFIAESKRLIDLSIQNYHTFLIYITELLKKLQSTTVMNPYMKLAPGELTIIL,mutated_sequence,1.0,4563.0,UPI0000141B94.a2m,UPI0000141B94.npy,gnomAD
+UPI000046D388,UPI000046D388.csv,MNMSQASVSFQDVTVEFTREEWQHLGPVERTLYRDVMLENYSHLISVGYCITKPKVISKLEKGEEPWSLEDEFLNQRYPGYFKVDHIKGIREKQEKPLWQEIFISDADKTLSKEGQKVLEKPFNLEIAPELSEKISCKCDSHRMNLPVASQLIISERKYSRKKTEYMNVCEKLQLDIKHEKAHAEEKSYEHGENAKAFSYKKDQHWKFQTLEESFECDGSGQGLYDKTICITPQSFLTGEKSCKDDEFRKNFDKITLFNHMRTDTRGKCSDLNEYGTSCDKTTAVEYNKVHMAMTHYECNERGINFSRKSPLTQSQRTITGWSAFESNKCEENFSQSSAHIVHQKTQAGDKFGEHNECTDALYQKLDFTAHQRIHTEDKFYLSDEHGKCRKSFYRKAHLIQHQRPHSGEKTYQYEECAKSFCSSSHPIQHPGTYVGFKLYECNECGKAFCQNSNLSKHLRIHTKEKPCDNNGCGRSYKSPLIGHQKTDAEMELCGGSEYGKTSHLKGHQRILMGEKPYECIECGKTFSKTSHLRAHQRIHTGEKPYECVECEKTFSHKTHLSVHQRVHTGEKPYECNDCGKSFTYNSALRAHQRIHTGEKPYECSDCEKTFAHNSALRAHHRIHTGEKPYECNECGRSFAHISVLKAHQRIHTGEKPYECNECGRSFTYNSALRAHQRIHTGRKPYECSDCEKTFAHNSALKIHQRIHTGEKPYECNECEKTFAHNSALRAHQNIHTGEKLYECSECGKTFFQKTRLSTHRRIHTGEKPYECSKCGKTFSQKSYLSGHERIHTGEKPYECNVCGKTFVYKAALIVHQRIHTGEKPYECNQCGKTFSQRTHLCAHQRIHTGEKPYECNECGKTFADNSALRAHHRIHTGEKPYECNDCGKTFSKTSHLRAHLRTRSGEKPYECSECGKTFSEKSYVSAHQRVHTGEKPYECNVCGKPFAHNSTLRVHQRIHTGEKSYECNDCGKTFSQKSHLSAHQRIHTGEKPYECNECGKAFAQNSTLRVHQRIHTGEKPYECDECGKTFVRKAALRVHHTRMHTREKTLACNGFGKS,mutated_sequence,1.0,1059.0,UPI000046D388.a2m,UPI000046D388.npy,gnomAD
+UPI00001AEF8F,UPI00001AEF8F.csv,MKAIIHLTLLALLSVNTATNQGNSADAVTTTETATSGPTVAAADTTETNFPETASTTANTPFPTATSPAPPIISTHSSSTIPTPAPPIISTHSSSTIPIPTAADSESTTNVNSLATSDIITASSPNDGLITMVPSETQSNNEMSPTTEDNQSSGPPTGTALLETSTLNSTGPSNPCQDDPCADNSLCVKLHNTSFCLCLEGYYYNSSTCKKGKVFPGKISVTVSETFDPEEKHSMAYQDLHSEITSLFKDVFGTSVYGQTVILTVSTSLSPRSEMRADDKFVNVTIVTILAETTSDNEKTVTEKINKAIRSSSSNFLNYDLTLRCDYYGCNQTADDCLNGLACDCKSDLQRPNPQSPFCVASSLKCPDACNAQHKQCLIKKSGGAPECACVPGYQEDANGNCQKCAFGYSGLDCKDKFQLILTIVGTIAGIVILSMIIALIVTARSNNKTKHIEEENLIDEDFQNLKLRSTGFTNLGAEGSVFPKVRITASRDSQMQNPYSRHSSMPRPDY,mutated_sequence,1.0,511.0,UPI00001AEF8F.a2m,UPI00001AEF8F.npy,gnomAD
+UPI000013ECF6,UPI000013ECF6.csv,MAGCIPEEKTYRRFLELFLGEFRGPCGGGEPEPEPEPEPEPEPESEPEPEPELVEAEAAEASVEEPGEEAATVAATEEGDQEQDPEPEEEAAVEGEEEEEGAATAAAAPGHSAVPPPPPQLPPLPPLPRPLSERITREEVEGESLDLCLQQLYKYNCPSFLAAALARATSDEVLQSDLSAHYIPKETDGTEGTVEIETVKLARSVFSKLHEICCSWVKDFPLRRRPQLYYETSIHAIKNMRRKMEDKHVCIPDFNMLFNLEDQEEQAYFAVFDGHGGVDAAIYASIHLHVNLVRQEMFPHDPAEALCRAFRVTDERFVQKAARESLRCGTTGVVTFIRGNMLHVAWVGDSQVMLVRKGQAVELMKPHKPDREDEKQRIEALGGCVVWFGAWRVNGSLSVSRAIGDAEHKPYICGDADSASTVLDGTEDYLILACDGFYDTVNPDEAVKVVSDHLKENNGDSSMVAHKLVASARDAGSSDNITVIVVFLRDMNKAVNVSEESDWTENSFQGGQEDGGDDKENHGECKRPWPQHQCSAPADLGYDGRVDSFTDRTSLSPGSQINVLEDPGYLDLTQIEASKPHSAQFLLPVEMFGPGAPKKANLINELMMEKKSVQSSLPEWSGAGEFPTAFNLGSTGEQIYRMQSLSPVCSGLENEQFKSPGNRVSRLSHLRHHYSKKWHRFRFNPKFYSFLSAQEPSHKIGTSLSSLTGSGKRNRIRSSLPWRQNSWKGYSENMRKLRKTHDIPCPDLPWSYKIE,mutated_sequence,1.0,755.0,UPI000013ECF6.a2m,UPI000013ECF6.npy,gnomAD
+UPI0000061E6E,UPI0000061E6E.csv,MEGNKTWITDITLPRFQVGPALEILLCGLFSAFYTLTLLGNGVIFGIICLDCKLHTPMYFFLSHLAIVDISYASNYVPKMLTNLMNQESTISFFPCIMQTFLYLAFAHVECLILVVMSYDRYADICHPLRYNSLMSWRVCTVLAVASWVFSFLLALVPLVLILSLPFCGPHEINHFFCEILSVLKLACADTWLNQVVIFAACVFILVGPLCLVLVSYLRILAAILRIQSGEGRRKAFSTCSSHLCVVGLFFGSAIVTYMAPKSRHPEEQQKVLSLFYSLFNPMLNPLIYSLRNAEVKGALRRALRKERLT,mutated_sequence,1.0,310.0,UPI0000061E6E.a2m,UPI0000061E6E.npy,gnomAD
+UPI00001FD815,UPI00001FD815.csv,MNLQAQPKAQNKRKRCLFGGQEPAPKEQPPPLQPPQQSIRVKEEQYLGHEGPGGAVSTSQPVELPPPSSLALLNSVVYGPERTSAAMLSQQVASVKWPNSVMAPGRGPERGGGGGVSDSSWQQQPGQPPPHSTWNCHSLSLYSATKGSPHPGVGVPTYYNHPEALKREKAGGPQLDRYVRPMMPQKVQLEVGRPQAPLNSFHAAKKPPNQSLPLQPFQLAFGHQVNRQVFRQGPPPPNPVAAFPPQKQQQQQQPQQQQQQQQAALPQMPLFENFYSMPQQPSQQPQDFGLQPAGPLGQSHLAHHSMAPYPFPPNPDMNPELRKALLQDSAPQPALPQVQIPFPRRSRRLSKEGILPPSALDGAGTQPGQEATGNLFLHHWPLQQPPPGSLGQPHPEALGFPLELRESQLLPDGERLAPNGREREAPAMGSEEGMRAVSTGDCGQVLRGGVIQSTRRRRRASQEANLLTLAQKAVELASLQNAKDGSGSEEKRKSVLASTTKCGVEFSEPSLATKRAREDSGMVPLIIPVSVPVRTVDPTEAAQAGGLDEDGKGPEQNPAEHKPSVIVTRRRSTRIPGTDAQAQAEDMNVKLEGEPSVRKPKQRPRPEPLIIPTKAGTFIAPPVYSNITPYQSHLRSPVRLADHPSERSFELPPYTPPPILSPVREGSGLYFNAIISTSTIPAPPPITPKSAHRTLLRTNSAEVTPPVLSVMGEATPVSIEPRINVGSRFQAEIPLMRDRALAAADPHKADLVWQPWEDLESSREKQRQVEDLLTAACSSIFPGAGTNQELALHCLHESRGDILETLNKLLLKKPLRPHNHPLATYHYTGSDQWKMAERKLFNKGIAIYKKDFFLVQKLIQTKTVAQCVEFYYTYKKQVKIGRNGTLTFGDVDTSDEKSAQEEVEVDIKTSQKFPRVPLPRRESPSEERLEPKREVKEPRKEGEEEVPEIQEKEEQEEGRERSRRAAAVKATQTLQANESASDILILRSHESNAPGSAGGQASEKPREGTGKSRRALPFSEKKKKTETFSKTQNQENTFPCKKCGR,mutated_sequence,1.0,1045.0,UPI00001FD815.a2m,UPI00001FD815.npy,gnomAD
+UPI0000236D8A,UPI0000236D8A.csv,MGTFDLWTDYLGLAHLVRALSGKEGPETRLSPQPEPEPMLEPDQKRSLESSPAPERLCSFCKHNGESRAIYQSHVLKDEAGRVLCPILRDYVCPQCGATRERAHTRRFCPLTGQGYTSVYSHTTRNSAGKKLVRPDKAKTQDTGHRRGGGGGAGFRGAGKSEPSPSCSPSMST,mutated_sequence,1.0,173.0,UPI0000236D8A.a2m,UPI0000236D8A.npy,gnomAD
+UPI0000237947,UPI0000237947.csv,MAKDSPSPLGASPKKPGCSSPAAAVLENQRRELEKLRAELEAERAGWRAERRRFAARERQLREEAERERRQLADRLRSKWEAQRSRELRQLQEEMQREREAEIRQLLRWKEAEQRQLQQLLHRERDGVVRQARELQRQLAEELVNRGHCSRPGASEVSAAQCRCRLQEVLAQLRWQTDGEQAARIRYLQAALEVERQLFLKYILAHFRGHPALSGSPDPQAVHSLEEPLPQTSSGSCHAPKPACQLGSLDSLSAEVGVRSRSLGLVSSACSSSPDGLLSTHASSLDCFAPACSRSLDSTRSLPKASKSEERPSSPDTSTPGSRRLSPPPSPLPPPPPPSAHRKLSNPRGGEGSESQPCEVLTPSPPGLGHHELIKLNWLLAKALWVLARRCYTLQEENKQLRRAGCPYQADEKVKRLKVKRAELTGLARRLADRARELQETNLRAVSAPIPGESCAGLELCQVFARQRARDLSEQASAPLAKDKQIEELRQECHLLQARVASGPCSDLHTGRGGPCTQWLNVRDLDRLQRESQREVLRLQRQLMLQQGNGGAWPEAGGQSATCEEVRRQMLALERELDQRRRECQELGTQAAPARRRGEEAETQLQAALLKNAWLAEENGRLQAKTDWVRKVEAENSEVRGHLGRACQERDASGLIAEQLLQQAARGQDRQQQLQRDPQKALCDLHPSWKEIQALQCRPGHPPEQPWETSQMPESQVKGSRRPKFHARPEDYAVSQPNRDIQEKREASLEESPVALGESASVPQVSETVPASQPLSKKTSSQSNSSSEGSMWATVPSSPTLDRDTASEVDDLEPDSVSLALEMGGSAAPAAPKLKIFMAQYNYNPFEGPNDHPEGELPLTAGDYIYIFGDMDEDGFYEGELDDGRRGLVPSNFVEQIPDSYIPGCLPAKSPDLGPSQLPAGQDEALEEDSLLSGKAQGMVDRGLCQMVRVGSKTEVATEILDTKTEACQLGLLQSMGKQGLSRPLLGTKGVLRMAPMQLHLQNVTATSANITWVYSSHRHPHVVYLDDREHALTPAGVSCYTFQGLCPGTHYRVRVEVRLPWDLLQVYWGTMSSTVTFDTLLAGPPYPPLDVLVERHASPGVLVVSWLPVTIDSAGSSNGVQVTGYAVYADGLKVCEVADATAGSTVLEFSQLQVPLTWQKVSVRTMSLCGESLDSVPAQIPEDFFMCHRWPETPPFSYTCGDPSTYRVTFPVCPQKLSLAPPSAKASPHNPGSCGEPQAKFLEAFFEEPPRRQSPVSNLGSEGECPSSGAGSQAQELAEAWEGCRKDLLFQKSPQNHRPPSVSDQPGEKENCYQHMGTSKSPAPGFIHLRTECGPRKEPCQEKAALERVLRQKQDAQGFTPPQLGASQQYASDFHNVLKEEQEALCLDLRGTERREERREPEPHSRQGQALGVKRGCQLHEPSSALCPAPSAKVIKMPRGGPQQLGTGANTPARVFVALSDYNPLVMSANLKAAEEELVFQKRQLLRVWGSQDTHDFYLSECNRQVGNIPGRLVAEMEVGTEQTDRRWRSPAQGHLPSVAHLEDFQGLIIPQGSSLVLQGNSKRLPLWTPKIMIAALDYDPGDGQMGGQGKGRLALRAGDVVMVYGPMDDQGFYYGELGGHRGLVPAHLLDHMSLHGH,mutated_sequence,1.0,1639.0,UPI0000237947.a2m,UPI0000237947.npy,gnomAD
+UPI0000367804,UPI0000367804.csv,MSCQISCKSRGRGGGGGGFRGFSSGSAVVSGGSRRSTSSFSCLSRHGGGGGGFGGGGFGSRSLVGLGGTKSISISVAGGGGGFGAAGGFGGRGGGFGGGSSFGGGSGFSGGGFGGGGFGGGRFGGFGGPGGVGGLGGPGGFGPGGYPGGIHEVSVNQSLLQPLNVKVDPEIQNVKAQEREQIKTLNNKFASFIDKVRFLEQQNQVLQTKWELLQQMNVGTRPINLEPIFQGYIDSLKRYLDGLTAERTSQNSELNNMQDLVEDYKKKYEDEINKRTAAENDFVTLKKDVDNAYMIKVELQSKVDLLNQEIEFLKVLYDAEISQIHQSVTDTNVILSMDNSRNLDLDSIIAEVKAQYEEIAQRSKEEAEALYHSKYEELQVTVGRHGDSLKEIKIEISELNRVIQRLQGEIAHVKKQCKNVQDAIADAEQRGEHALKDARNKLNDLEEALQQAKEDLARLLRDYQELMNVKLALDVEIATYRKLLEGEECRMSGDLSSNVTVSVTSSTISSNVASKAAFGGSGGRGSSSGGGYSSGSSSYGSGGRQSGSRGGSGGGGSISGGGYGSGGGSGGRYGSGGGSKGGSISGGGYGSGGGKHSSGGGSRGGSSSGGGYGSGGGGSSSVKGSSGEAFGSSVTFSFR,mutated_sequence,1.0,639.0,UPI0000367804.a2m,UPI0000367804.npy,gnomAD
+UPI00006C069B,UPI00006C069B.csv,MASPRASRWPPPLLLLLLPLLLLPPAAPGTRDPPPSPARRALSLAPLAGAGLELQLERRPEREPPPTPPRERRGPATPGPSYRAPEPGAATQRGPSGRAPRGGSADAAWKHWPESNTEAHVENITFYQNQEDFSTVSSKEGVMVQTSGKSHAASDAPENLTLLAETADARGRSGSSSRTNFTILPVGYSLEIATALTSQSGNLASESLHLPSSSSEFDERIAAFQTKSGTASEMGTERAMGLSEEWTVHSQEATTSAWSPSFLPALEMGELTTPSRKRNSSGPDLSWLHFYRTAASSPLLDLSSSSESTEKLNNSTGLQSSSVSQTKTMHVATVFTDGGPRTLRSLTVSLGPVSKTEGFPKDSRIATTSSSVLLSPSAVESRRNSRVTGNPGDEEFIEPSTENEFGLTSLRWQNDSPTFGEHQLASSSEVQNGSPMSQTETVSRSVAPMRGGEITAHWLLTNSTTSADVTGSSASYPEGVNASVLTQFSDSTVQSGGSHTALGDRSYSESSSTSSSESLNSSAPRGERSIAGISYGQVRGTAIEQRTSSDHTDHTYLSSTFTKGERALLSITDNSSSSDIVESSTSYIKISNSSHSEYSSFFHAQTERSNISSYDGEYAQPSTESPVLHTSNLPSYTPTINMPNTSVVLDTDAEFVSDSSSSSSSSSSSSSSGPPLPLPSVSQSHHLFSSILPSTRASVHLLKSTSDASTPWSSSPSPLPVSLTTSTSAPLSVSQTTLPQSSSTPVLPRARETPVTSFQTSTMTSFMTMLHSSQTADLKSQSTPHQEKVITESKSPSLVSLPTESTKAVTTNSPLPPSLTESSTEQTLPATSTNLAQMSPTFTTTILKTSQPLMTTPGTLSSTASLVTGPIAVQTTAGKQLSLTHPEILVPQISTEGGISTERNRVIVDATTGLIPLTSVPTSAKEMTTKLGVTAEYSPASRSLGTSPSPQTTVVSTAEDLAPKSATFAVQSSTQSPTTVSSSASVNSCAVNPCLHNGECVADNTSRGYHCRCPPSWQGDDCSVDVNECLSNPCPSTAMCNNTQGSFICKCPVGYQLEKGICNLVRTFVTEFKLKRTFLNTTVEKHSDLQEVENEITKTLNMCFSALPSYIRSTVHASRESNAVVISLQTTFSLASNVTLFDLADRMQKCVNSCKSSAEVCQLLGSQRRIFRAGSLCKRKSPECDKDTSICTDLDGVALCQCKSGYFQFNKMDHSCRACEDGYRLENETCMSCPFGLGGLNCGNPYQLITVVIAAAGGGLLLILGIALIVTCCRKNKNDISKLIFKSGDFQMSPYAEYPKNPRSQEWGREAIEMHENGSTKNLLQMTDVYYSPTSVRNPELERNGLYPAYTGLPGSRHSCIFPGQYNPSFISDESRRRDYF,mutated_sequence,1.0,1381.0,UPI00006C069B.a2m,UPI00006C069B.npy,gnomAD
+UPI000013DBF5,UPI000013DBF5.csv,MRPKTFPATTYSGNSRQRLQEIREGLKQPSKSSVQGLPAGPNSDTSLDAKVLGSKDATRQQQQMRATPKFGPYQKALREIRYSLLPFANESGTSAAAEVNRQMLQELVNAGCDQEMAGRALKQTGSRSIEAALEYISKMGYLDPRNEQIVRVIKQTSPGKGLMPTPVTRRPSFEGTGDSFASYHQLSGTPYEGPSFGADGPTALEEMPRPYVDYLFPGVGPHGPGHQHQHPPKGYGASVEAAGAHFPLQGAHYGRPHLLVPGEPLGYGVQRSPSFQSKTPPETGGYASLPTKGQGGPPGAGLAFPPPAAGLYVPHPHHKQAGPAAHQLHVLGSRSQVFASDSPPQSLLTPSRNSLNVDLYELGSTSVQQWPAATLARRDSLQKPGLEAPPRAHVAFRPDCPVPSRTNSFNSHQPRPGPPGKAEPSLPAPNTVTAVTAAHILHPVKSVRVLRPEPQTAVGPSHPAWVPAPAPAPAPAPAPAAEGLDAKEEHALALGGAGAFPLDVEYGGPDRRCPPPPYPKHLLLRSKSEQYDLDSLCAGMEQSLRAGPNEPEGGDKSRKSAKGDKGGKDKKQIQTSPVPVRKNSRDEEKRESRIKSYSPYAFKFFMEQHVENVIKTYQQKVNRRLQLEQEMAKAGLCEAEQEQMRKILYQKESNYNRLKRAKMDKSMFVKIKTLGIGAFGEVCLACKVDTHALYAMKTLRKKDVLNRNQVAHVKAERDILAEADNEWVVKLYYSFQDKDSLYFVMDYIPGGDMMSLLIRMEVFPEHLARFYIAELTLAIESVHKMGFIHRDIKPDNILIDLDGHIKLTDFGLCTGFRWTHNSKYYQKGSHVRQDSMEPSDLWDDVSNCRCGDRLKTLEQRARKQHQRCLAHSLVGTPNYIAPEVLLRKGYTQLCDWWSVGVILFEMLVGQPPFLAPTPTETQLKVINWENTLHIPAQVKLSPEARDLITKLCCSADHRLGRNGADDLKAHPFFSAIDFSSDIRKQPAPYVPTISHPMDTSNFDPVDEESPWNDASEGSTKAWDTLTSPNNKHPEHAFYEFTFRRFFDDNGYPFRCPKPSGAEASQAESSDLESSDLVDQTEGCQPVYV,mutated_sequence,1.0,1088.0,UPI000013DBF5.a2m,UPI000013DBF5.npy,gnomAD
+UPI00015CE746,UPI00015CE746.csv,MSHLPAVSPVFFQLPAPHPPTVLRPQLGLHPNPECDREKMSVRDHDPEVLTRNSACKPRGQLSGHLLKPRAPLEAA,mutated_sequence,,,UPI00015CE746.a2m,UPI00015CE746.npy,gnomAD
+UPI0000EE57E0,UPI0000EE57E0.csv,MMTMTTMADGLEGQDSSKSAFMEFGQQQQQQQQQQQQQQQQQQQPPPPPPPPPQPHSQQSSPAMAGAHYPLHCLHSAAAAAAAGSHHHHHHQHHHHGSPYASGGGNSYNHRSLAAYPYMSHSQHSPYLQSYHNSSAAAQTRGDDTDQQKTTVIENGEIRFNGKGKKIRKPRTIYSSLQLQALNHRFQQTQYLALPERAELAASLGLTQTQVKIWFQNKRSKFKKLLKQGSNPHESDPLQGSAALSPRSPALPPVWDVSASAKGVSMPPNSYMPGYSHWYSSPHQDTMQRPQMM,mutated_sequence,1.0,293.0,UPI0000EE57E0.a2m,UPI0000EE57E0.npy,gnomAD
+UPI000013E9AB,UPI000013E9AB.csv,MFARKPPGAAPLGAMPVPDQPSSASEKTSSLSPGLNTSNGDGSETETTSAILASVKEQELQFERLTRELEAERQIVASQLERCKLGSETGSMSSMSSAEEQFQWQSQDGQKDIEDELTTGLELVDSCIRSLQESGILDPQDYSTGERPSLLSQSALQLNSKPEGSFQYPASYHSNQTLALGETTPSQLPARGTQARATGQSFSQGTTSRAGHLAGPEPAPPPPPPPREPFAPSLGSAFHLPDAPPAAAAAALYYSSSTLPAPPRGGSPLAAPQGGSPTKLQRGGSAPEGATYAAPRGSSPKQSPSRLAKSYSTSSPINIVVSSAGLSPIRVTSPPTVQSTISSSPIHQLSSTIGTYATLSPTKRLVHASEQYSKHSQELYATATLQRPGSLAAGSRASYSSQHGHLGPELRALQSPEHHIDPIYEDRVYQKPPMRSLSQSQGDPLPPAHTGTYRTSTAPSSPGVDSVPLQRTGSQHGPQNAAAATFQRASYAAGPASNYADPYRQLQYCPSVESPYSKSGPALPPEGTLARSPSIDSIQKDPREFGWRDPELPEVIQMLQHQFPSVQSNAAAYLQHLCFGDNKIKAEIRRQGGIQLLVDLLDHRMTEVHRSACGALRNLVYGKANDDNKIALKNCGGIPALVRLLRKTTDLEIRELVTGVLWNLSSCDALKMPIIQDALAVLTNAVIIPHSGWENSPLQDDRKIQLHSSQVLRNATGCLRNVSSAGEEARRRMRECDGLTDALLYVIQSALGSSEIDSKTVENCVCILRNLSYRLAAETSQGQHMGTDELDGLLCGEANGKDAESSGCWGKKKKKKKSQDQWDGVGPLPDCAEPPKGIQMLWHPSIVKPYLTLLSECSNPDTLEGAAGALQNLAAGSWKWSVYIRAAVRKEKGLPILVELLRIDNDRVVCAVATALRNMALDVRNKELIGKYAMRDLVHRLPGGNNSNNTASKAMSDDTVTAVCCTLHEVITKNMENAKALRDAGGIEKLVGISKSKGDKHSPKVVKAASQVLNSMWQYRDLRSLYKKDGWSQYHFVASSSTIERDRQRPYSSSRTPSISPVRVSPNNRSASAPASPREMISLKERKTDYECTGSNATYHGAKGEHTSRKDAMTAQNTGISTLYRNSYGAPAEDIKHNQVSAQPVPQEPSRKDYETYQPFQNSTRNYDESFFEDQVHHRPPASEYTMHLGLKSTGNYVDFYSAARPYSELNYETSHYPASPDSWV,mutated_sequence,1.0,1225.0,UPI000013E9AB.a2m,UPI000013E9AB.npy,gnomAD
+UPI0000070847,UPI0000070847.csv,MEEPQAGDAARFSCPPNFTAKPPASESPRFSLEALTGPDTELWLIQAPADFAPECFNGRHVPLSGSQIVKGKLAGKRHRYRVLSSCPQAGEATLLAPSTEAGGGLTCASAPQGTLRILEGPQQSLSGSPLQPIPASPPPQIPPGLRPRFCAFGGNPPVTGPRSALAPNLLTSGKKKKEMQVTEAPVTQEAVNGHGALEVDMALGSPEMDVRKKKKKKNQQLKEPEAAGPVGTEPTVETLEPLGVLFPSTTKKRKKPKGKETFEPEDKTVKQEQINTEPLEDTVLSPTKKRKRQKGTEGMEPEEGVTVESQPQVKVEPLEEAIPLPPTKKRKKEKGQMAMMEPGTEAMEPVEPEMKPLESPGGTMAPQQPEGAKPQAQAALAAPKKKTKKEKQQDATVEPETEVVGPELPDDLEPQAAPTSTKKKKKKKERGHTVTEPIQPLEPELPGEGQPEARATPGSTKKRKKQSQESRMPETVPQEEMPGPPLNSESGEEAPTGRDKKRKQQQQQPV,mutated_sequence,1.0,510.0,UPI0000070847.a2m,UPI0000070847.npy,gnomAD
+UPI000044CC1A,UPI000044CC1A.csv,MEARSRSAEELRRAELVEIIVETEAQTGVSGINVAGGGKEGIFVRELREDSPAARSLSLQEGDQLLSARVFFENFKYEDALRLLQCAEPYKVSFCLKRTVPTGDLALRPGTVSGYEIKGPRAKVAKLNIQSLSPVKKKKMVPGALGVPADLAPVDVEFSFPKFSRLRRGLKAEAVKGPVPAAPARRRLQLPRLRVREVAEEAQAARLAAAAPPPRKAKVEAEVAAGARFTAPQVELVGPRLPGAEVGVPQVSAPKAAPSAEAAGGFALHLPTLGLGAPAPPAVEAPAVGIQVPQVELPALPSLPTLPTLPCLETREGAVSVVVPTLDVAAPTVGVDLALPGAEVEARGEAPEVALKMPRLSFPRFGARAKEVAEAKVAKVSPEARVKGPRLRMPTFGLSLLEPRPAAPEVVESKLKLPTIKMPSLGIGVSGPEVKVPKGPEVKLPKAPEVKLPKVPEAALPEVRLPEVELPKVSEMKLPKVPEMAVPEVRLPEVELPKVSEMKLPKVPEMAVPEVRLPEVQLLKVSEMKLPKVPEMAVPEVRLPEVQLPKVSEMKLPEVSEVAVPEVRLPEVQLPKVPEMKVPEMKLPKVPEMKLPEMKLPEVQLPKVPEMAVPDVHLPEVQLPKVPEMKLPEMKLPEVKLPKVPEMAVPDVHLPEVQLPKVPEMKLPKMPEMAVPEVRLPEVQLPKVSEMKLPKVPEMAVPDVHLPEVQLPKVCEMKVPDMKLPEIKLPKVPEMAVPDVHLPEVQLPKVSEIRLPEMQVPKVPDVHLPKAPEVKLPRAPEVQLKATKAEQAEGMEFGFKMPKMTMPKLGRAESPSRGKPGEAGAEVSGKLVTLPCLQPEVDGEAHVGVPSLTLPSVELDLPGALGLQGQVPAAKMGKGERVEGPEVAAGVREVGFRVPSVEIVTPQLPAVEIEEGRLEMIETKVKPSSKFSLPKFGLSGPKVAKAEAEGAGRATKLKVSKFAISLPKARVGAEAEAKGAGEAGLLPALDLSIPQLSLDAHLPSGKVEVAGADLKFKGPRFALPKFGVRGRDTEAAELVPGVAELEGKGWGWDGRVKMPKLKMPSFGLARGKEAEVQGDRASPGEKAESTAVQLKIPEVELVTLGAQEEGRAEGAVAVSGMQLSGLKVSTAGQVVTEGHDAGLRMPPLGISLPQVELTGFGEAGTPGQQAQSTVPSAEGTAGYRVQVPQVTLSLPGAQVAGGELLVGEGVFKMPTVTVPQLELDVGLSREAQAGEAATGEGGLRLKLPTLGARARVGGEGAEEQPPGAERTFCLSLPDVELSPSGGNHAEYQVAEGEGEAGHKLKVRLPRFGLVRAKEGAEEGEKAKSPKLRLPRVGFSQSEMVTGEGSPSPEEEEEEEEEGSGEGASGRRGRVRVRLPRVGLAAPSKASRGQEGDAAPKSPVREKSPKFRFPRVSLSPKARSGSGDQEEGGLRVRLPSVGFSETGAPGPARMEGAQAAAV,mutated_sequence,1.0,1461.0,UPI000044CC1A.a2m,UPI000044CC1A.npy,gnomAD
+UPI0000036166,UPI0000036166.csv,MAFNFGAPSGTSGTAAATAAPAGGFGGFGTTSTTAGSAFSFSAPTNTGTTGLFGGTQNKGFGFGTGFGTTTGTSTGLGTGLGTGLGFGGFNTQQQQQTTLGGLFSQPTQAPTQSNQLINTASALSAPTLLGDERDAILAKWNQLQAFWGTGKGYFNNNIPPVEFTQENPFCRFKAVGYSCMPSNKDEDGLVVLVFNKKETEIRSQQQQLVESLHKVLGGNQTLTVNVEGTKTLPDDQTEVVIYVVERSPNGTSRRVPATTLYAHFEQANIKTQLQQLGVTLSMTRTELSPAQIKQLLQNPPAGVDPIIWEQAKVDNPDSEKLIPVPMVGFKELLRRLKVQDQMTKQHQTRLDIISEDISELQKNQTTSVAKIAQYKRKLMDLSHRTLQVLIKQEIQRKSGYAIQADEEQLRVQLDTIQGELNAPTQFKGRLNELMSQIRMQNHFGAVRSEERYYIDADLLREIKQHLKQQQEGLSHLISIIKDDLEDIKLVEHGLNETIHIRGGVFS,mutated_sequence,1.0,507.0,UPI0000036166.a2m,UPI0000036166.npy,gnomAD
+UPI000015FC7D,UPI000015FC7D.csv,MRRSRSSAAAKLRGQKRSGASAAPAASAAAALAPSATRTRRSASQAGSKSQAVEKPPSEKPRLRRSSPRAQEEGPGEPPPPELALLPPPPPPPPTPATPTSSASNLDLGEQRERWETFQKRQKLTSEGAAKLLLDTFEYQGLVKHTGGCHCGAVRFEVWASADLHIFDCNCSICKKKQNRHFIVPASRFKLLKGAEHITTYTFNTHKAQHTFCKRCGVQSFYTPRSNPGGFGIAPHCLDEGTVRSMVTEEFNGSDWEKAMKEHKTIKNMSKE,mutated_sequence,1.0,272.0,UPI000015FC7D.a2m,UPI000015FC7D.npy,gnomAD
+UPI0000061EA8,UPI0000061EA8.csv,MDGTNGSTQTHFILLGFSDRPHLERILFVVILIAYLLTLVGNTTIILVSRLDPHLHTPMYFFLAHLSFLDLSFTTSSIPQLLYNLNGCDKTISYMGCAIQLFLFLGLGGVECLLLAVMAYDRCVAICKPLHYMVIMNPRLCRGLVSVTWGCGVANSLAMSPVTLRLPRCGHHEVDHFLREMPALIRMACVSTVAIEGTVFVLAVGVVLSPLVFILLSYSYIVRAVLQIRSASGRQKAFGTCGSHLTVVSLFYGNIIYMYMQPGASSSQDQGMFLMLFYNIVTPLLNPLIYTLRNREVKGALGRLLLGKRELGKE,mutated_sequence,1.0,314.0,UPI0000061EA8.a2m,UPI0000061EA8.npy,gnomAD
+UPI000014083E,UPI000014083E.csv,MFRQFYLWTCLASGIILGSLFEICLGQYDDDCKLARGGPPATIVAIDEESRNGTILVDNMLIKGTAGGPDPTIELSLKDNVDYWVLMDPVKQMLFLNSTGRVLDRDPPMNIHSIVVQVQCINKKVGTIIYHEVRIVVRDRNDNSPTFKHESYYATVNELTPVGTTIFTGFSGDNGATDIDDGPNGQIEYVIQYNPDDPTSNDTFEIPLMLTGNIVLRKRLNYEDKTRYFVIIQANDRAQNLNERRTTTTTLTVDVLDGDDLGPMFLPCVLVPNTRDCRPLTYQAAIPELRTPEELNPIIVTPPIQAIDQDRNIQPPSDRPGILYSILVGTPEDYPRFFHMHPRTAELSLLEPVNRDFHQKFDLVIKAEQDNGHPLPAFAGLHIEILDENNQSPYFTMPSYQGYILESAPVGATISDSLNLTSPLRIVALDKDIEDTKDPELHLFLNDYTSVFTVTQTGITRYLTLLQPVDREEQQTYTFSITAFDGVQESEPVIVNIQVMDANDNTPTFPEISYDVYVYTDMRPGDSVIQLTAVDADEGSNGEITYEILVGAQGDFIINKTTGLITIAPGVEMIVGRTYALTVQAADNAPPAERRNSICTVYIEVLPPNNQSPPRFPQLMYSLEISEAMRVGAVLLNLQATDREGDSITYAIENGDPQRVFNLSETTGILTLGKALDRESTDRYILIITASDGRPDGTSTATVNIVVTDVNDNAPVFDPYLPRNLSVVEEEANAFVGQVKATDPDAGINGQVHYSLGNFNNLFRITSNGSIYTAVKLNREVRDYYELVVVATDGAVHPRHSTLTLAIKVLDIDDNSPVFTNSTYTVLVEENLPAGTTILQIEAKDVDLGANVSYRIRSPEVKHFFALHPFTGELSLLRSLDYEAFPDQEASITFLVEAFDIYGTMPPGIATVTVIVKDMNDYPPVFSKRIYKGMVAPDAVKGTPITTVYAEDADPPGLPASRVRYRVDDVQFPYPASIFEVEEDSGRVITRVNLNEEPTTIFKLVVVAFDDGEPVMSSSATVKILVLHPGEIPRFTQEEYRPPPVSELATKGTMVGVISAAAINQSIVYSIVSGNEEDTFGINNITGVIYVNGPLDYETRTSYVLRVQADSLEVVLANLRVPSKSNTAKVYIEIQDENNHPPVFQKKFYIGGVSEDARMFTSVLRVKATDKDTGNYSVMAYRLIIPPIKEGKEGFVVETYTGLIKTAMLFHNMRRSYFKFQVIATDDYGKGLSGKADVLVSVVNQLDMQVIVSNVPPTLVEKKIEDLTEILDRYVQEQIPGAKVVVESIGARRHGDAFSLEDYTKCDLTVYAIDPQTNRAIDRNELFKFLDGKLLDINKDFQPYYGEGGRILEIRTPEAVTSIKKRGESLGYTEGALLALAFIIILCCIPAILVVLVSYRQFKVRQAECTKTARIQAALPAAKPAVPAPAPVAAPPPPPPPPPGAHLYEELGDSSILFLLYHFQQSRGNNSVSEDRKHQQVVMPFSSNTIEAHKSAHVDGSLKSNKLKSARKFTFLSDEDDLSAHNPLYKENISQVSTNSDISQRTDFVDPFSPKIQAKSKSLRGPREKIQRLWSQSVSLPRRLMRKVPNRPEIIDLQQWQGTRQKAENENTGICTNKRGSSNPLLTTEEANLTEKEEIRQGETLMIEGTEQLKSLSSDSSFCFPRPHFSFSTLPTVSRTVELKSEPNVISSPAECSLELSPSRPCVLHSSLSRRETPICMLPIETERNIFENFAHPPNISPSACPLPPPPPISPPSPPPAPAPLAPPPDISPFSLFCPPPSPPSIPLPLPPPTFFPLSVSTSGPPTPPLLPPFPTPLPPPPPSIPCPPPPSASFLSTECVCITGVKCTTNLMPAEKIKSSMTQLSTTTVCKTDPQREPKGILRHVKNLAELEKSVANMYSQIEKNYLRTNVSELQTMCPSEVTNMEITSEQNKGSLNNIVEGTEKQSHSQSTSL,mutated_sequence,1.0,1955.0,UPI000014083E.a2m,UPI000014083E.npy,gnomAD
+UPI000012CF7F,UPI000012CF7F.csv,MTTSLLLHPRWPESLMYVYEDSAAESGIGGGGGGGGGGTGGAGGGCSGASPGKAPSMDGLGSSCPASHCRDLLPHPVLGRPPAPLGAPQGAVYTDIPAPEAARQCAPPPAPPTSSSATLGYGYPFGGSYYGCRLSHNVNLQQKPCAYHPGDKYPEPSGALPGDDLSSRAKEFAFYPSFASSYQAMPGYLDVSVVPGISGHPEPRHDALIPVEGYQHWALSNGWDSQVYCSKEQSQSAHLWKSPFPDVVPLQPEVSSYRRGRKKRVPYTKVQLKELEKEYAASKFITKEKRRRISATTNLSERQVTIWFQNRRVKEKKVVSKSKAPHLHST,mutated_sequence,1.0,330.0,UPI000012CF7F.a2m,UPI000012CF7F.npy,gnomAD
+UPI000006EEBC,UPI000006EEBC.csv,MGRPLLLPLLLLLQPPAFLQPGGSTGSGPSYLYGVTQPKHLSASMGGSVEIPFSFYYPWELAIVPNVRISWRRGHFHGQSFYSTRPPSIHKDYVNRLFLNWTEGQESGFLRISNLRKEDQSVYFCRVELDTRRSGRQQLQSIKGTKLTITQAVTTTTTWRPSSTTTIAGLRVTESKGHSESWHLSLDTAIRVALAVAVLKTVILGLLCLLLLWWRRRKGSRAPSSDF,mutated_sequence,1.0,227.0,UPI000006EEBC.a2m,UPI000006EEBC.npy,gnomAD
+UPI000003117A,UPI000003117A.csv,MQQDGLGVGTRNGSGKGRSVHPSWPWCAPRPLRYFGRDARARRAQTAAMALLAGGLSRGLGSHPAAAGRDAVVFVWLLLSTWCTAPARAIQVTVSNPYHVVILFQPVTLPCTYQMTSTPTQPIVIWKYKSFCRDRIADAFSPASVDNQLNAQLAAGNPGYNPYVECQDSVRTVRVVATKQGNAVTLGDYYQGRRITITGNADLTFDQTAWGDSGVYYCSVVSAQDLQGNNEAYAELIVLGRTSGVAELLPGFQAGPIEDWLFVVVVCLAAFLIFLLLGICWCQCCPHTCCCYVRCPCCPDKCCCPEALYAAGKAATSGVPSIYAPSTYAHLSPAKTPPPPAMIPMGPAYNGYPGGYPGDVDRSSSAGGQGSYVPLLRDTDSSVASEVRSGYRIQASQQDDSMRVLYYMEKELANFDPSRPGPPSGRVERAMSEVTSLHEDDWRSRPSRGPALTPIRDEEWGGHSPRSPRGWDQEPAREQAGGGWRARRPRARSVDALDDLTPPSTAESGSRSPTSNGGRSRAYMPPRSRSRDDLYDQDDSRDFPRSRDPHYDDFRSRERPPADPRSHHHRTRDPRDNGSRSGDLPYDGRLLEEAVRKKGSEERRRPHKEEEEEAYYPPAPPPYSETDSQASRERRLKKNLALSRESLVV,mutated_sequence,1.0,649.0,UPI000003117A.a2m,UPI000003117A.npy,gnomAD
+UPI0000D61701,UPI0000D61701.csv,MFRSPGTLEILTPLRDTAFPAFDFQKCLVNIQALLMDPELHVGKCKIEFLTGEFIYRMYTIDMHSQLELTASLIPQPGTSLIPLVMVSNPHSLGFQATFYENGYTSDGNTKYKLDIFLKQQQHWGRTDSNFTSSLKKATMSTLTVDIANKEISCVDIKPLSTLISVGCDLDKKIVIQNKVSACSMGILDPLTLQDNYSFIIEKEFYDPGFQGQQSSEDLHVFYSYQQLGCPLLVYYDTLWKPVVELLESSGMNTVHCSLNLPGSSGPPASASQVAGTAVAYHNARLIFFFFFFFFFLRQSLALSPRLECSGVILAHRKLCLPGSRHSPASASRVAGITGARHCARLIFFIFSRDRVSPC,mutated_sequence,1.0,359.0,UPI0000D61701.a2m,UPI0000D61701.npy,gnomAD
+UPI000003777D,UPI000003777D.csv,MELLQVTILFLLPSICSSNSTGVLEAANNSLVVTTTKPSITTPNTESLQKNVVTPTTGTTPKGTITNELLKMSLMSTATFLTSKDEGLKATTTDVRKNDSIISNVTVTSVTLPNAVSTLQSSKPKTETQSSIKTTEIPGSVLQPDASPSKTGTLTSIPVTIPENTSQSQVIGTEGGKNASTSATSRSYSSIILPVVIALIVITLSVFVLVGLYRMCWKADPGTPENGNDQPQSDKESVKLLTVKTISHESGEHSAQGKTKN,mutated_sequence,1.0,261.0,UPI000003777D.a2m,UPI000003777D.npy,gnomAD
+UPI000000165C,UPI000000165C.csv,MSPSAKKRPKNSRVSKMQDEKLRDETEQPVSKVIERNRLRTVLKNLSLLKLLKSSNRRIQELHKLAKRCWHSLLSVPKILRISSGENSACNKTKQNNEEFQEIGCSEKELKSKKLESTGDPKKKEYKEWKSQVQSGMRNKEKTSLAAMPRKEKHIEPEVPRTSRDDSLNPGVQGRQPLTEGPRVIFIKPYRNRTPMGHMKQLDVADQWIWFEGLPTRIHLPAPRVMCRSSTLRWVKRRCTRFCSASLEMPMWHPYKVDVTWTRARGASRGWRSRHQLKGRNGWRNSRVYK,mutated_sequence,1.0,290.0,UPI000000165C.a2m,UPI000000165C.npy,gnomAD
+UPI000013D46D,UPI000013D46D.csv,MAEPSGSPVHVQLPQQAAPVTAAAAAAPAAATAAPAPAAPAAPAPAPAPAAQAVGWPICRDAYELQEVIGSGATAVVQAALCKPRQERVAIKRINLEKCQTSMDELLKEIQAMSQCSHPNVVTYYTSFVVKDELWLVMKLLSGGSMLDIIKYIVNRGEHKNGVLEEAIIATILKEVLEGLDYLHRNGQIHRDLKAGNILLGEDGSVQIADFGVSAFLATGGDVTRNKVRKTFVGTPCWMAPEVMEQVRGYDFKADMWSFGITAIELATGAAPYHKYPPMKVLMLTLQNDPPTLETGVEDKEMMKKYGKSFRKLLSLCLQKDPSKRPTAAELLKCKFFQKAKNREYLIEKLLTRTPDIAQRAKKVRRVPGSSGHLHKTEDGDWEWSDDEMDEKSEEGKAAFSQEKSRRVKEENPEIAVSASTIPEQIQSLSVHDSQGPPNANEDYREASSCAVNLVLRLRNSRKELNDIRFEFTPGRDTADGVSQELFSAGLVDGHDVVIVAANLQKIVDDPKALKTLTFKLASGCDGSEIPDEVKLIGFAQLSVS,mutated_sequence,1.0,545.0,UPI000013D46D.a2m,UPI000013D46D.npy,gnomAD
+UPI0000132189,UPI0000132189.csv,MEEAEELLLEGKKALQLAREPRLGLDLGWNPSGEGCTQGLKDVPPEPTRDILALKSLPRGLALGPSLAKEQRLGVWCVGDPLQPGLLWGPLEEESASKEKGEGVKPRQEENLSLGPWGDVCACEQSSGWTSLVQRGRLESEGNVAPVRISERLHLQVYQLVLPGSELLLWPQPSSEGPSLTQPGLDKEAAVAVVTEVESAVQQEVASPGEDAAEPCIDPGSQSPSGIQAENMVSPGLKFPTQDRISKDSQPLGPLLQDGDVDEECPAQAQMPPELQSNSATQQDPDGSGASFSSSARGTQPHGYLAKKLHSPSDQCPPRAKTPEPGAQQSGFPTLSRSPPGPAGSSPKQGRRYRCGECGKAFLQLCHLKKHAFVHTGHKPFLCTECGKSYSSEESFKAHMLGHRGVRPFPCPQCDKAYGTQRDLKEHQVVHSGARPFACDQCGKAFARRPSLRLHRKTHQVPAAPAPCPCPVCGRPLANQGSLRNHMRLHTGEKPFLCPHCGRAFRQRGNLRGHLRLHTGERPYRCPHCADAFPQLPELRRHLISHTGEAHLCPVCGKALRDPHTLRAHERLHSGERPFPCPQCGRAYTLATKLRRHLKSHLEDKPYRCPTCGMGYTLPQSLRRHQLSHRPEAPCSPPSVPSAASEPTVVLLQAEPQLLDTHREEEVSPARDVVEVTISESQEKCFVVPEEPDAAPSLVLIHKDMGLGAWAEVVEVEMGT,mutated_sequence,1.0,720.0,UPI0000132189.a2m,UPI0000132189.npy,gnomAD
+UPI000012ADCB,UPI000012ADCB.csv,MTLSGGGSASDMSGQTVLTAEDVDIDVVGEGDDGLEEKDSDAGCDSPAGPPELRLDEADEVPPAAPHHGQPQPPHQQPLTLPKEAAGAGAGPGGDVGAPEADGCKGGVGGEEGGASGGGPGAGSGSAGGLAPSKPKNSLVKPPYSYIALITMAILQSPQKKLTLSGICEFISNRFPYYREKFPAWQNSIRHNLSLNDCFVKIPREPGNPGKGNYWTLDPQSEDMFDNGSFLRRRKRFKRHQQEHLREQTALMMQSFGAYSLAAAAGAAGPYGRPYGLHPAAAAGAYSHPAAAAAAAAAAALQYPYALPPVAPVLPPAVPLLPSGELGRKAAAFGSQLGPGLQLQLNSLGAAAAAAGTAGAAGTTASLIKSEPSARPSFSIENIIGGGPAAPGGSAVGAGVAGGTGGSGGGSTAQSFLRPPGTVQSAALMATHQPLSLSRTTATIAPILSVPLSGQFLQPAASAAAAAAAAAQAKWPAQ,mutated_sequence,1.0,478.0,UPI000012ADCB.a2m,UPI000012ADCB.npy,gnomAD
+UPI000013E315,UPI000013E315.csv,MSEIPSTIVSKNMTNDKNSLESMNISSSSSTEENPKKQARKNEEHGPDPSANPFHLSGDVDFFLLRDQERNKALSERQQQKTMRVHQKMTYSSKVSAKHTSLRRQLQLEDKQEDLEARAEAEHQRAFRDYTTWKLTLTKEKNVEPENMSGYIKQKRQMFLLQYALDVKRREIQRLETLATKEEARLERAEKSLEKDAALFDEFVRENDCSSVQAMRAAEKETKAKIEKILEIRDLTTQIVNIKSEISRFEDTLKHYKVYKDFLYKLSPKEWLEEQEKKHSFLKKAKEVSEASKESSVNSTPGDKGPGIKGKASSMWAKEGQGTKKPWRFLQTMRLGRSPSYLSSPQQGSQPSESSGGDSRGSNSPIPPTQEDTDSDGEEPQLYFTEPQQLLDVFRELEEQNLSLIQNSQETEKTLEELSHTLKHTQIRMDREVNQLKQWVTTMMMSITKEEDTAAELELKARVFHFGEYKGDQQDKLLESLNCKVLDVYRHCTGTQQEANLGTVQMLTIIEHQLDELLENLEHVPQVKIEQAERAKEKERRIRLREEKLQMQKILQEEHLQRARARAQAEIKKKRGRTLVCRSRPPAHRIKQQSEHTLMDKEEEELLFFFT,mutated_sequence,1.0,611.0,UPI000013E315.a2m,UPI000013E315.npy,gnomAD
+UPI00001402C1,UPI00001402C1.csv,MAASRLDFGEVETFLDRHPELFEDYLMRKGKQEMVEKWLQRHSQGQGALGPRPSLAGTSSLAHSTCRGGSSVGGGTGPNGSAHSQPLPGGGDCGGVPLSPSWAGGSRGDGNLQRRASQKELRKSFARSKAIHVNRTYDEQVTSRAQEPLSSVRRRALLRKASSLPPTTAHILSALLESRVNLPRYPPTAIDYKCHLKKHNERQFFLELVKDISNDLDLTSLSYKILIFVCLMVDADRCSLFLVEGAAAGKKTLVSKFFDVHAGTPLLPCSSTENSNEVQVPWGKGIIGYVGEHGETVNIPDAYQDRRFNDEIDKLTGYKTKSLLCMPIRSSDGEIIGVAQAINKIPEGAPFTEDDEKVMQMYLPFCGIAISNAQLFAASRKEYERSRALLEVVNDLFEEQTDLEKIVKKIMHRAQTLLKCERCSVLLLEDIESPVVKFTKSFELMSPKCSADAENSFKESMEKSSYSDWLINNSIAELVASTGLPVNISDAYQDPRFDAEADQISGFHIRSVLCVPIWNSNHQIIGVAQVLNRLDGKPFDDADQRLFEAFVIFCGLGINNTIMYDQVKKSWAKQSVALDVLSYHATCSKAEVDKFKAANIPLVSELAIDDIHFDDFSLDVDAMITAALRMFMELGMVQKFKIDYETLCRWLLTVRKNYRMVLYHNWRHAFNVCQLMFAMLTTAGFQDILTEVEILAVIVGCLCHDLDHRGTNNAFQAKSGSALAQLYGTSATLEHHHFNHAVMILQSEGHNIFANLSSKEYSDLMQLLKQSILATDLTLYFERRTEFFELVSKGEYDWNIKNHRDIFRSMLMTACDLGAVTKPWEISRQVAELVTSEFFEQGDRERLELKLTPSAIFDRNRKDELPRLQLEWIDSICMPLYQALVKVNVKLKPMLDSVATNRSKWEELHQKRLLASTASSSPASVMVAKEDRN,mutated_sequence,1.0,933.0,UPI00001402C1.a2m,UPI00001402C1.npy,gnomAD
+UPI0000237200,UPI0000237200.csv,MSHSHPAGLLAAYNSLMDKHLAGYFNNTRIRRHLLRSGLITRSGRILSEKEYKLNMMKRDHQKYIRECLAQAIFHKVLDMERYHQLEIKKKLETLARKERIQRFKGEHTRRSVENNMPILSPHPPVGPKSNRGHSVLVDEGHSSPLALTAPRPYTAPGNMQPPIRLQPLPSNPAVETVPKVTSRSRSKTSLLENEALFPIGGKKAVMKFRNSIGNSQRMNSYQLPNINSYMMPIPPPLPPTGKITRENRSETWRRRRFRPTTAPNGLEPLLTKDSRRIHKTSLHSNAAITMIYLGKNVHLSSDNPDFRDEIKVYQQHCGGENLCVYKGKLLEKETFQFISKRHHGFPFSLTFFLNGMQVNRLSSCCEYKHRKGSRLGGKRGYFGFVCVERSSPCYKCIIAMGLDKKPSLPKSRKEKSTEKGEELKKAEGKVRKEREYVIPKRNEIKENKTSVSAKFSAQEIKTGLKEVVTAVEEMTSKGKPGQEVLEDDQENTLKYEYEEDFEVDEEKQGEKSNEEGQADVQMNGIPQSPLDDKKDNLDPEKESETSSQKAPDARDNVKDENDGCSESELEEDKQDMKTASSTSSRSHPYSSDSEDESAVGDREAHTDSSTDESARRSSSQELSENDKPRKSHLPIEESLEIEIEDQEITKADVETKPMPIDESFENVLKEGTEKGTQEIAEGLSEKSGKHVSAEEKEKDKSKLWEESTAQVKDKKAGLPGLEEGGKDSLPLAYVLALGAPTMNFMVDETAAINSNKESQQLVQKTYTLEKKEAMEEDEAPQHRDADIVQGKGEAALWGEAGAVHEAPLRAWKPTAEQPELAEEFTEKREIPPGIERGAEGAAEAEGVRRLGEGGSDPIGQAAAKDAVGLSKDEAPEKQALMLTVLETDKAASEGEQGLEKAVLANEAAALNLEHLHEVAALREAATSEEGEAEGGVAVSDVGESEEEASIDLEDTGPMEDTASKREDGSEEAILGGEEPAKERKEVMRTETRLSPFTGEAEASRMQVSEGSPEEGSLAKEAFLCKEDVEGEEMVTEAEANREDDRKEILPKELDLARERRKAERPKTSLRKTDSEREEVTRANALKDEDAFKEEQKLKAEEGETETEVRAEEETKAPPNEMGSDAENEAPVEASELSDNPGLLGEDSLKETVVPIFEATPGFEKSLENITALRKEGGGERLSEARDTEHKDREELSSRENRALKEGHRQDGEGALAAPEAEPAGKVQAPEGLIPATGQAEELAAKDHDSCAGLEGRAEGQGGVDVVLRTQEAVAEEDPIMAEKFREEAVDEDPEEEEDKECTLETEAMQDRNSEGDGDMEGEGNTQKNEGMGGGRVVAVEVLHGGGETAETAAEEREVLAGSETAEEKTIANKASSFSDVAEEETWHQQDELVGKTAAAGKVVVEELARSGEEVPAAEEMTVTYTTEAGVGTPGALERKTSGLGQEQEEGSEGQEAATGSGDGRQETGAAEKFRLGLSREGERELSPESLQAMATLPVKPDFTETREKQQHMVQGESETADVSPNNVQV,mutated_sequence,1.0,1530.0,UPI0000237200.a2m,UPI0000237200.npy,gnomAD
+UPI000013E1E1,UPI000013E1E1.csv,MKQRFSALQLLKLLLLLQPPLPRALREALCPEPCNCVPDGALRCPGPTAGLTRLSLAYLPVKVIPSQAFRGLNEVIKIEISQIDSLERIEANAFDNLLNLSEILIQNTKNLRYIEPGAFINLPRLKYLSICNTGIRKFPDVTKVFSSESNFILEICDNLHITTIPGNAFQGMNNESVTLKLYGNGFEEVQSHAFNGTTLTSLELKENVHLEKMHNGAFRGATGPKTLDISSTKLQALPSYGLESIQRLIATSSYSLKKLPSRETFVNLLEATLTYPSHCCAFRNLPTKEQNFSHSISENFSKQCESTVRKVNNKTLYSSMLAESELSGWDYEYGFCLPKTPRCAPEPDAFNPCEDIMGYDFLRVLIWLINILAIMGNMTVLFVLLTSRYKLTVPRFLMCNLSFADFCMGLYLLLIASVDSQTKGQYYNHAIDWQTGSGCSTAGFFTVFASELSVYTLTVITLERWHTITYAIHLDQKLRLRHAILIMLGGWLFSSLIAMLPLVGVSNYMKVSICFPMDVETTLSQVYILTILILNVVAFFIICACYIKIYFAVRNPELMATNKDTKIAKKMAILIFTDFTCMAPISFFAISAAFKVPLITVTNSKVLLVLFYPINSCANPFLYAIFTKTFQRDFFLLLSKFGCCKRRAELYRRKDFSAYTSNCKNGFTGSNKPSQSTLKLSTLHCQGTALLDKTRYTEC,mutated_sequence,1.0,699.0,UPI000013E1E1.a2m,UPI000013E1E1.npy,gnomAD
+UPI0000186945,UPI0000186945.csv,MLEGLGSPASPRAAASASVAGSSGPAACSPPSSSAPRSPESPAPRRGGVRASVPQKLAEMLSSQYGLIVFVAGLLLLLAWAVHAAGVSKSDLLCFLTALMLLQLLWMLWYVGRSSAHRRLFRLKDTHAGAGWLRGSITLFAVITVILGCLKIGYFIGFSECLSATEGVFPVTHSVHTLLQVYFLWGHAKDIIQSFKTLERFGVIHSVFTNLLLWANGVLNESKHQLNEHKERLITLGFGNITTVLDDHTPQCNCTPPTLCTAISHGIYYLYPFNIEYQILASTMLYVLWKNIGRKVDSHQHQKMQFKSDGVMVGAVLGLTVLAATIAVVVVYLIHIGRSKTKSESALIMFYLYAITLLMLMGAAGLAGIRIYRIDEKSLDESKNPARKLDSDLLVGTASGSWLISWGSILAILCAEGHPRYTWYNLPYSILAIVEKYIQNLFIFESIHREPEKLSEDIQTLRVVTVCNGNTMPLASSCPKSGGVARDVAPQGKDMPPAANGNVCMRESHDKEEEKQEESSWGGSPSPVRLPRFLQGNAKRKVLRNIAAFLFLCNISLWIPPAFGCRPEYDNGLEEIVFGFEPWIIVVNLAMPFSIFYRMHAAASLFEVYCKI,mutated_sequence,1.0,612.0,UPI0000186945.a2m,UPI0000186945.npy,gnomAD
+UPI000013D6EF,UPI000013D6EF.csv,MAYSWQTDPNPNESHEKQYEHQEFLFVNQPHSSSQVSLGFDQIVDEISGKIPHYESEIDENTFFVPTAPKWDSTGHSLNEAHQISLNEFTSKSRELSWHQVSKAPAIGFSPSVLPKPQNTNKECSWGSPIGKHHGADDSRFSILAPSFTSLDKINLEKELENENHNYHIGFESSIPPTNSSFSSDFMPKEENKRSGHVNIVEPSLMLLKGSLQPGMWESTWQKNIESIGCSIQLVEVPQSSNTSLASFCNKVKKIRERYHAADVNFNSGKIWSTTTAFPYQLFSKTKFNIHIFIDNSTQPLHFMPCANYLVKDLIAEILHFCTNDQLLPKDHILSVCGSEEFLQNDHCLGSHKMFQKDKSVIQLHLQKSREAPGKLSRKHEEDHSQFYLNQLLEFMHIWKVSRQCLLTLIRKYDFHLKYLLKTQENVYNIIEEVKKICSVLGCVETKQITDAVNELSLILQRKGENFYQSSETSAKGLIEKVTTELSTSIYQLINVYCNSFYADFQPVNVPRCTSYLNPGLPSHLSFTVYAAHNIPETWVHRINFPLEIKSLPRESMLTVKLFGIACATNNANLLAWTCLPLFPKEKSILGSMLFSMTLQSEPPVEMITPGVWDVSQPSPVTLQIDFPATGWEYMKPDSEENRSNLEEPLKECIKHIARLSQKQTPLLLSEEKKRYLWFYRFYCNNENCSLPLVLGSAPGWDERTVSEMHTILRRWTFSQPLEALGLLTSSFPDQEIRKVAVQQLDNLLNDELLEYLPQLVQAVKFEWNLESPLVQLLLHRSLQSIQVAHRLYWLLKNAENEAYFKSWYQKLLAALQFCAGKALNDEFSKEQKLIKILGDIGERVKSASDHQRQEVLKKEIGRLEEFFQDVNTCHLPLNPALCIKGIDHDACSYFTSNALPLKITFINANPMGKNISIIFKAGDDLRQDMLVLQLIQVMDNIWLQEGLDMQMIIYRCLSTGKDQGLVQMVPDAVTLAKIHRHSGLIGPLKENTIKKWFSQHNHLKADYEKALRNFFYSCAGWCVVTFILGVCDRHNDNIMLTKSGHMFHIDFGKFLGHAQTFGGIKRDRAPFIFTSEMEYFITEGGKNPQHFQDFVELCCRAYNIIRKHSQLLLNLLEMMLYAGLPELSGIQDLKYVYNNLRPQDTDLEATSHFTKKIKESLECFPVKLNNLIHTLAQMSAISPAKSTSQTFPQESCLLSTTRSIERATILGFSKKSSNLYLIQVTHSNNETSLTEKSFEQFSKLHSQLQKQFASLTLPEFPHWWHLPFTNSDHRRFRDLNHYMEQILNVSHEVTNSDCVLSFFLSEAVQQTVEESSPVYLGEKFPDKKPKVQLVISYEDVKLTILVKHMKNIHLPDGSAPSAHVEFYLLPYPSEVRRRKTKSVPKCTDPTYNEIVVYDEVTELQGHVLMLIVKSKTVFVGAINIRLCSVPLDKEKWYPLGNSII,mutated_sequence,1.0,1445.0,UPI000013D6EF.a2m,UPI000013D6EF.npy,gnomAD
+UPI00015DF949,UPI00015DF949.csv,WLTPVIPALGEAEVGGLLESRSSRRSCAMIAPLHSSLGDRARPRLKKKKKKKKRKKRSLILRPCLTLSPRLTCNGTILAHCNLRLPGSSDSPASASQVAGITGTHHHTRLIFVFLVESPCWSAIVSSA,mutated_sequence,1.0,128.0,UPI00015DF949.a2m,UPI00015DF949.npy,gnomAD
+UPI0000072854,UPI0000072854.csv,MSESEGGKDTTPEPSPANGAGPGPEWGLCPGPPAVEGESSGASGLGTPKRRNQHSKHKTVAVASAQRSPRALFCLTLANPLRRSCISIVEWKPFDILILLTIFANCVALGVYIPFPEDDSNTANHNLEQVEYVFLVIFTVETVLKIVAYGLVLHPSAYIRNGWNLLDFIIVVVGLFSVLLEQGPGRPGDAPHTGGKPGGFDVKALRAFRVLRPLRLVSGVPSLHIVLNSIMKALVPLLHIALLVLFVIIIYAIIGLELFLGRMHKTCYFLGSDMEAEEDPSPCASSGSGRACTLNQTECRGRWPGPNGGITNFDNFFFAMLTVFQCVTMEGWTDVLYWMQDAMGYELPWVYFVSLVIFGSFFVLNLVLGVLSGEFSKEREKAKARGDFQKQREKQQMEEDLRGYLDWITQAEELDMEDPSADDNLGSMAEEGRAGHRPQLAELTNRRRGRLRWFSHSTRSTHSTSSHASLPASDTGSMTETQGDEDEEEGALASCTRCLNKIMKTRVCRRLRRANRVLRARCRRAVKSNACYWAVLLLVFLNTLTIASEHHGQPVWLTQIQEYANKVLLCLFTVEMLLKLYGLGPSAYVSSFFNRFDCFVVCGGILETTLVEVGAMQPLGISVLRCVRLLRIFKVTRHWASLSNLVASLLNSMKSIASLLLLLFLFIIIFSLLGMQLFGGKFNFDQTHTKRSTFDTFPQALLTVFQILTGEDWNVVMYDGIMAYGGPFFPGMLVCIYFIILFICGNYILLNVFLAIAVDNLASGDAGTAKDKGGEKSNEKDLPQENEGLVPGVEKEEEEGARREGADMEEEEEEEEEEEEEEEEEGAGGVELLQEVVPKEKVVPIPEGSAFFCLSQTNPLRKGCHTLIHHHVFTNLILVFIILSSVSLAAEDPIRAHSFRNHILGYFDYAFTSIFTVEILLKMTVFGAFLHRGSFCRSWFNMLDLLVVSVSLISFGIHSSAISVVKILRVLRVLRPLRAINRAKGLKHVVQCVFVAIRTIGNIMIVTTLLQFMFACIGVQLFKGKFYTCTDEAKHTPQECKGSFLVYPDGDVSRPLVRERLWVNSDFNFDNVLSAMMALFTVSTFEGWPALLYKAIDAYAEDHGPIYNYRVEISVFFIVYIIIIAFFMMNIFVGFVIITFRAQGEQEYQNCELDKNQRQCVEYALKAQPLRRYIPKNPHQYRVWATVNSAAFEYLMFLLILLNTVALAMQHYEQTAPFNYAMDILNMVFTGLFTIEMVLKIIAFKPKHYFTDAWNTFDALIVVGSIVDIAVTEVNNGGHLGESSEDSSRISITFFRLFRVMRLVKLLSKGEGIRTLLWTFIKSFQALPYVALLIAMIFFIYAVIGMQMFGKVALQDGTQINRNNNFQTFPQAVLLLFRCATGEAWQEIMLASLPGNRCDPESDFGPGEEFTCGSNFAIAYFISFFMLCAFLIINLFVAVIMDNFDYLTRDWSILGPHHLDEFKRIWSEYDPGAKGRIKHLDVVALLRRIQPPLGFGKLCPHRVACKRLVAMNMPLNSDGTVTFNATLFALVRTSLKIKTEGNLEQANQELRIVIKKIWKRMKQKLLDEVIPPPDEEEVTVGKFYATFLIQDYFRKFRRRKEKGLLGNDAAPSTSSALQAGLRSLQDLGPEMRQALTCDTEEEEEEGQEGVEEEDEKDLETNKATMVSQPSARRGSGISVSLPVGDRLPDSLSFGPSDDDRGTPTSSQPSVPQAGSNTHRRGSGALIFTIPEEGNSQPKGTKGQNKQDEDEEVPDRLSYLDEQAGTPPCSVLLPPHRAQRYMDGHLVPRRRLLPPTPAGRKPSFTIQCLQRQGSCEDLPIPGTYHRGRNSGPNRAQGSWATPPQRGRLLYAPLLLVEEGAAGEGYLGRSSGPLRTFTCLHVPGTHSDPSHGKRGSADSLVEAVLISEGLGLFARDPRFVALAKQEIADACRLTLDEMDNAASDLLAQGTSSLYSDEESILSRFDEEDLGDEMACVHAL,mutated_sequence,1.0,1977.0,UPI0000072854.a2m,UPI0000072854.npy,gnomAD
+UPI00003E5903,UPI00003E5903.csv,MFSAGAESLLHQAREIQDEELKKFCSRICKLLQAEDLGPDTLDSLQRLFLIISATKYSRRLEKTCVDLLQATLGLPACPEQLQVLCAAILREMSPSDSLSLAWDHTQNSRQLSLVASVLLAQGDRNEEVRAVGQGVLRALESRQPEGPSLRHLLPVMAKVVVLSPGTLQEDQATLLSKRLVDWLRYASLQQGLPHSGGFFSTPRARQPGPVTEVDGAVATDFFTVLSSGHRFTDDQWLNVQAFSMLRAWLLHSGPEGPGTLDTDDRSEQEGSTLSVISATSSAGRLLPPRERLREVAFEYCQRLIEQSNRRALRKGDSDLQKACLVEAVLVLDVLCRQDPSFLYRSLSCLKALHGRVRGDPASVRVLLPLAHFFLSHGEAAAVDSEAVYQHLFTRIPVEQFHSPMLAFEFIQFCRDNLHLFSGHLSTLRLSFPNLFKFLAWNSPPLTSEFVALLPALVDAGTALEMLHALLDLPCLTAVLDLQLRSAPAASERPLWDTSLRAPSCLEAFRDPQFQGLFQYLLRPKASGATERLAPLHQLLQPMAGCARVAQCAQAVPTLLQAFFSAVTQVADGSLINQLALLLLGRSDSLYPAPGYAAGVHSVLSSQFLALCTLKPSLVVELARDLLEFLGSVNGLCSRASLVTSVVWAIGEYLSVTYDRRCTVEQINKFFEALEALLFEVTQCRPSAALPRCPPQVVTVLMTTLTKLASRSQDLIPRASLLLSKMRTLAHSPATSSTHSEEGAEAIRTRATELLTLLKMPSVAQFVLTPSTEVCSPRYHRDANTALPLALRTVSRLVEREAGLMPG,mutated_sequence,1.0,807.0,UPI00003E5903.a2m,UPI00003E5903.npy,gnomAD
+UPI000015D1F5,UPI000015D1F5.csv,MTPSEGARAGTGRELEMLDSLLALGGLVLLRDSVEWEGRSLLKALVKKSALCGEQVHILGCEVSEEEFREGFDSDINNRLVYHDFFRDPLNWSKTEEAFPGGPLGALRAMCKRTDPVPVTIALDSLSWLLLRLPCTTLCQVLHAVSHQDSCPGETPPSLFPLIHLPLPRSVPLFLSTLE,mutated_sequence,1.0,179.0,UPI000015D1F5.a2m,UPI000015D1F5.npy,gnomAD
+UPI000013EA48,UPI000013EA48.csv,DAEQPRGPSGAERGGLELGDAGAAGQLVLTNPWNIMIKHRQVQRRGRRSQMTTSFTDPAISMDLLRAVLQPSINEEIQTVFNKYMKFFQKAALNVRDNVGEEVDAEQLIQEACRSCLEQVRGGSSYHGPIWYHGDEHPAASLPLWCGQEA,mutated_sequence,1.0,150.0,UPI000013EA48.a2m,UPI000013EA48.npy,gnomAD
+UPI0000137929,UPI0000137929.csv,MAAPEKMTFPEKPSHKKYRAALKKEKRKKRRQELARLRDSGLSQKEEEEDTFIEEQQLEEEKLLERERQRLHEEWLLREQKAQEEFRIKKEKEEAAKKRQEEQERKLKEQWEEQQRKEREEEEQKRQEKKEKEEALQKMLDQAENELENGTTWQNPEPPVDFRVMEKDRANCPFYSKTGACRFGDRCSRKHNFPTSSPTLLIKSMFTTFGMEQCRRDDYDPDASLEYSEEETYQQFLDFYEDVLPEFKNVGKVIQFKVSCNLEPHLRGNVYVQYQSEEECQAALSLFNGRWYAGRQLQCEFCPVTRWKMAICGLFEIQQCPRGKHCNFLHVFRNPNNEFWEANRDIYLSPDRTGSSFGKNSERRERMGHHDDYYSRLRGRRNPSPDHSYKRNGESERKSSRHRGKKSHKRTSKSRERHNSRSRGRNRDRSRDRSRGRGSRSRSRSRSRRSRRSRSQSSSRSRSRGRRRSGNRDRTVQSPKSK,mutated_sequence,1.0,482.0,UPI0000137929.a2m,UPI0000137929.npy,gnomAD
+UPI00001983C7,UPI00001983C7.csv,MAGAGRGAAVSRVQAGPGSPRRARGRQQVQPLGKQRPAPWPGLRSKEKKKVNCKPKNQDEQEIPFRLREIMRSRQEMKNPISNKKRKKAAQVTFRKTLEKEAKGEEPDIAVPKFKQRKGESDGAYIHRMQQEAQHVLFLSKNQAIRQPEVQAAPKEKSEQKKAKKAFQKRRLDKVRRKKEEKAADRLEQELLRDTVKFGEVVLQPPELTARPQRSVSKDQPGRRSQMLRMLLSPGGVSQPLTASLARQRIVEEERERAVQAYRALKQRQQQLHGERPHLTSRKKPEPQL,mutated_sequence,1.0,289.0,UPI00001983C7.a2m,UPI00001983C7.npy,gnomAD
+UPI000022AA0A,UPI000022AA0A.csv,MAAAEPMGPAQVPMNSEVIVDPIQGQVNFEDVFVYFSQEEWVLLDEAQRLLYRDVMLENFALMASLGHTSFMSHIVASLVMGSEPWVPDWVDMTLAVATETPGGSDPGCWHGMEDEEIPFEQSFSIGMSQIRIPKGGPSTQKAYPCGTCGLVLKDILHLAEHQETHPGQKPYMCVLCGKQFCFSANLHQHQKQHSGEKPFRSDKSRPFLLNNCAVQSMEMSFVTGEACKDFLASSSIFEHHAPHNEWKPHSNTKCEEASHCGKRHYKCSECGKTFSRKDSLVQHQRVHTGERPYECGECGKTFSRKPILAQHQRIHTGEMPYECGICGKVFNHSSNLIVHQRVHTGARPYKCSECGKAYSHKSTLVQHESIHTGERPYECSECGKYFGHKYRLIKHWSVHTGARPYECIACGKFFSQSSDLIAHQRVHNGEKPYVCSECGKAFSHKHVLVQHHRIHTGERPYKCSECGKAFRQRASLIRHWKIHTGERP,mutated_sequence,1.0,489.0,UPI000022AA0A.a2m,UPI000022AA0A.npy,gnomAD
+UPI000013D193,UPI000013D193.csv,MKLSRQFTVFGSAIFCVVIFSLYLMLDRGHLDYPRNPRREGSFPQGQLSMLQEKIDHLERLLAENNEIISNIRDSVINLSESVEDGPKSSQSNFSQGAGSHLLPSQLSLSVDTADCLFASQSGSHNSDVQMLDVYSLISFDNPDGGVWKQGFDITYESNEWDTEPLQVFVVPHSHNDPGWLKTFNDYFRDKTQYIFNNMVLKLKEDSRRKFIWSEISYLSKWWDIIDIQKKDAVKSLIENGQLEIVTGGWVMPDEATPHYFALIDQLIEGHQWLENNIGVKPRSGWAIDPFGHSPTMAYLLNRAGLSHMLIQRVHYAVKKHFALHKTLEFFWRQNWDLGSVTDILCHMMPFYSYDIPHTCGPDPKICCQFDFKRLPGGRFGCPWGVPPETIHPGNVQSRARMLLDQYRKKSKLFRTKVLLAPLGDDFRYCEYTEWDLQFKNYQQLFDYMNSQSKFKVKIQFGTLSDFFDALDKADETQRDKGQSMFPVLSGDFFTYADRDDHYWSGYFTSRPFYKRMDRIMESHLRAAEILYYFALRQAHKYKINKFLSSSLYTALTEARRNLGLFQHHDAITGTAKDWVVVDYGTRLFHSLMVLEKIIGNSAFLLILKDKLTYDSYSPDTFLEMDLKQKSQDSLPQKNIIRLSAEPRYLVVYNPLEQDRISLVSVYVSSPTVQVFSASGKPVEVQVSAVWDTANTISETAYEISFRAHIPPLGLKVYKILESASSNSHLADYVLYKNKVEDSGIFTIKNMINTEEGITLENSFVLLRFDQTGLMKQMMTKEDGKHHEVNVQFSWYGTTIKRDKSGAYLFLPDGNAKPYVYTTPPFVRVTHGRIYSEVTCFFDHVTHRVRLYHIQGIEGQSVEVSNIVDIRKVYNREIAMKISSDIKSQNRFYTDLNGYQIQPRMTLSKLPLQANVYPMTTMAYIQDAKHRLTLLSAQSLGVSSLNSGQIEVIMDRRLMQDDNRGLEQGIQDNKITANLFRILLEKRSAVNTEEEKKSVSYPSLLSHITSSLMNHPVIPMANKFSSPTLELQGEFSPLQSSLPCDIHLVNLRTIQSKVGNGHSNEAALILHRKGFDCRFSSKGTGLFCSTTQGKILVQKLLNKFIVESLTPSSLSLMHSPPGTQNISEINLSPMEISTFRIQLR,mutated_sequence,1.0,1144.0,UPI000013D193.a2m,UPI000013D193.npy,gnomAD
+UPI000006CD59,UPI000006CD59.csv,MAVSRKDWSALSSLARQRTLEDEEEQERERRRRHRNLSSTTDDEAPRLSQNGDRQASASERLPSVEEAEVPKPLPPASKDEDEDIQSILRTRQERRQRRQVVEAAQAPIQERLEAEEGRNSLSPVQATQKPLVSKKELEIPPRRRLSREQRGPWALEEESLVGREPEERKKGVPEKSPVLEKSSMPKKTAPEKSLVSDKTSISEKVLASEKTSLSEKIAVSEKRNSSEKKSVLEKTSVSEKSLAPGMALGSGRRLVSEKASIFEKALASEKSPTADAKPAPKRATASEQPLAQEPPASGGSPATTKEQRGRALPGKNLPSLAKQGASDPPTVASRLPPVTLQVKIPSKEEEADMSSPTQRTYSSSLKRSSPRTISFRMKPKKENSETTLTRSASMKLPDNTVKLGEKLERYHTAIRRSESVKSRGLPCTELFVAPVGVASKRHLFEKELAGQSRAEPASSRKENLRLSGVVTSRLNLWISRTQESGDQDPQEAQKASSATERTQWGQKSDSSLDAEV,mutated_sequence,1.0,517.0,UPI000006CD59.a2m,UPI000006CD59.npy,gnomAD
+UPI000006DC1A,UPI000006DC1A.csv,MEFDLGAALEPTSQKPGVGAGHGGDPKLSPHKVQGRSEAGAGPGPKQGHHSSSDSSSSSSDSDTDVKSHAAGSKQHESIPGKAKKPKVKKKEKGKKEKGKKKEAPH,mutated_sequence,1.0,106.0,UPI000006DC1A.a2m,UPI000006DC1A.npy,gnomAD
+UPI00001AEEF8,UPI00001AEEF8.csv,MGVAGRNRPGAAWAVLLLLLLLPPLLLLAGAVPPGRGRAAGPQEDVDECAQGLDDCHADALCQNTPTSYKCSCKPGYQGEGRQCEDIDECGNELNGGCVHDCLNIPGNYRCTCFDGFMLAHDGHNCLDVDECLENNGGCQHTCVNVMGSYECCCKEGFFLSDNQHTCIHRSEEGLSCMNKDHGCSHICKEAPRGSVACECRPGFELAKNQRDCILTCNHGNGGCQHSCDDTADGPECSCHPQYKMHTDGRSCLEREDTVLEVTESNTTSVVDGDKRVKRRLLMETCAVNNGGCDRTCKDTSTGVHCSCPVGFTLQLDGKTCKDIDECQTRNGGCDHFCKNIVGSFDCGCKKGFKLLTDEKSCQDVDECSLDRTCDHSCINHPGTFACACNRGYTLYGFTHCGDTNECSINNGGCQQVCVNTVGSYECQCHPGYKLHWNKKDCVEVKGLLPTSVSPRVSLHCGKSGGGDGCFLRCHSGIHLSSDVTTIRTSVTFKLNEGKCSLKNAELFPEGLRPALPEKHSSVKESFRYVNLTCSSGKQVPGAPGRPSTPKEMFITVEFELETNQKEVTASCDLSCIVKRTEKRLRKAIRTLRKAVHREQFHLQLSGMNLDVAKKPPRTSERQAESCGVGQGHAENQCVSCRAGTYYDGARERCILCPNGTFQNEEGQMTCEPCPRPGNSGALKTPEAWNMSECGGLCQPGEYSADGFAPCHLCALGTFQPEAGRTSCFPCGGGLATKHQGATSFQDCETRVQCSPGHFYNTTTHRCIRCPVGTYQPEFGKNNCVSCPGNTTTDFDGSTNITQCKNRRCGGELGDFTGYIESPNYPGNYPANTECTWTINPPPKRRILIVVPEIFLPIEDDCGDYLVMRKTSSSNSVTTYETCQTYERPIAFTSRSKKLWIQFKSNEGNSARGFQVPYVTYDEDYQELIEDIVRDGRLYASENHQEILKDKKLIKALFDVLAHPQNYFKYTAQESREMFPRSFIRLLRSKVSRFLRPYK,mutated_sequence,1.0,999.0,UPI00001AEEF8.a2m,UPI00001AEEF8.npy,gnomAD
+UPI0000D61944,UPI0000D61944.csv,CRQHHRSHPIQLASPPSPHTVGITTVTPYGQHHHRHPIWLASPPSPHAVGITEVTPYSWHHHRHPMQSASPKSPHTVGITAVTPCIGITEVTPYGRHHHRHPIQWASPPSPHTVGITAVTPYGRHHRRHPIQWASPPSPHTVGITAVTPCGRHHHRHPIQWASPKSPHAVGITLVAPCSQHHLGHPTQLASLPSPHAVGITTVTPYSGHHHHHPMQSAPLLSPHRVGITSVTPCSRHHLSHPMQS,mutated_sequence,,,UPI0000D61944.a2m,UPI0000D61944.npy,gnomAD
+UPI0000127786,UPI0000127786.csv,MEQAGTRPAATEHPRLRRPMPWLLLLPLLLLLLLLLPGPAASQLRYSVPEEQAPGALVGNVARALGLELRRLGPGCLRINHLGAPSPRYLELDLTSGALFVNERIDREALCEQRPRCLLSLEVLAHNPVAVSAVEVEILDINDNSPRFPRPNYQLQVSESVAPGARFHIESAQDPDVGANSVQTYELSPSEHFELDLKPLQENSKVLELVLRKGLDREQAALHHLVLTAVDGGIPARSGTAQISVRVLDTNDNSPAFDQSTYRVQLREDSPPGTLVVKLNASDPDEGSNGELRYSLSSYTSDRERQLFSIDASTGEVRVIGGLDYEEASSYQIYVQATDRGPVPMAGHCKVLVDIVDVNDNAPEVVLTDLYSPVPENATPNTIVAVLSVNDQDSGPNRKVSLGLEATLPFRLNGFGNSYTLVVSGPLDRERVAVYNITVTATDGGIPQLTSLRTLKVEISDINDNPPSFLEDSYSIYIQENNLPGVLLCTVQATDPDEKENAEVTYSLLEREIQGLPVTSYVSINSASGSLYAVNSFDYEKFREFFVTVEAQDKGSPPLSSTVTANVYVVDMNDHAPHILYPTSTNSSAAFEMVPRTAPAGYLVTKVIAMDSDSGQNAWLFYHLAQTSDLDLFKVELHTGEIRTTRKMGDESGSTFNLTVVVRDNGEPSLSASVAITVAVVDRVSKILPDTQRHVKSPRTYSEITLYLIIALSTVSFIFLLTIIILSIIKCYRYTAYGTACCGGFCGVRERSPAELYKQANNNIDARIPHGLKVQPHFIEVRGNGSLTKTYCYKACLTAGSGSDTFMFYNTGAQTGPGPSGAQAAVTDSRNLTGQSGQNAGNLIILKNEAVSQNEPRQPNPDWRYSASLRAGMHSSVHLEEAGILRAGPGGPDQQWPTVSSATPEPEAGEVSPPVGAGVNSNSWTFKYGPGNPKQSGPGELPDKFIIPGSPAIISIRQEPTNSQIDKSDFITFGKKEETKKKKKKKKGNKTQEKKEKGNSTTDNSDQ,mutated_sequence,1.0,1007.0,UPI0000127786.a2m,UPI0000127786.npy,gnomAD
+UPI0000458963,UPI0000458963.csv,MLAEWGACLLLAVALLGPGLQAQAMEGVKCGGVLSAPSGNFSSPNFPRLYPYNTECSWLIVVAEGSSVLLTFHAFDLEYHDTCSFDFLEIYNGASPDKGNLLGRFCGKVPPPPFTSSWHVMSVIFHSDKHVASHGFSAGYQKDVCGGVLTGLSGVLTSPEYPNNYPNSMECHWVIRAAGPAHVKLVFVDFQVEGNEECTYDYVAVLGGPGPTRGHHYCGSTRPPTLVSLGHELQVVFKSDFNIGGRGFKAYYFSGECQEVYMAMRGNFSSPQYPSSYPNNIRCHWTIRLPPGYQVKVFFLDLDLEEPNSLTKTCDFDHLAAFDGASEEAPLLGNWCGHHLPPPVTSSHNQLLLLLHTDRSTTRRGFSVAYIGGQLGCGSGSTEGEGEALQPQSLQSPSSIPPVCPAPPMNGLLQLLLHWLHPCPLSGPLRLDGTAPACFHYCRASFPSF,mutated_sequence,1.0,449.0,UPI0000458963.a2m,UPI0000458963.npy,gnomAD
+UPI0000458979,UPI0000458979.csv,MGWGSRCCCPGRLDLLCVLALLGGCLLPVCRTRVYTNHWAVKIAGGFPEANRIASKYGFINIGQIGALKDYYHFYHSRTIKRSVISSRGTHSFISMEPKVEWIQQQVVKKRTKRDYDFSRAQSTYFNDPKWPSMWYMHCSDNTHPCQSDMNIEGAWKRGYTGKNIVVTILDDGIERTHPDLMQNYDALASCDVNGNDLDPMPRYDASNENKHGTRCAGEVAAAANNSHCTVGIAFNAKIGGVRMLDGDVTDMVEAKSVSFNPQHVHIYSASWGPDDDGKTVDGPAPLTRQAFENGVRMGRRGLGSVFVWASGNGGRSKDHCSCDGYTNSIYTISISSTAESGKKPWYLEECSSTLATTYSSGESYDKKIITTDLRQRCTDNHTGTSASAPMAAGIIALALEANPFLTWRDVQHVIVRTSRAGHLNANDWKTNAAGFKVSHLYGFGLMDAEAMVMEAEKWTTVPRQHVCVESTDRQIKTIRPNSAVRSIYKASGCSDNPNRHVNYLEHVVVRITITHPRRGDLAIYLTSPSGTRSQLLANRLFDHSMEGFKNWEFMTIHCWGERAAGDWVLEVYDTPSQLRNFKTPGKLKEWSLVLYGTSVQPYSPTNEFPKVERFRYSRVEDPTDDYGTEDYAGELASSGTQAKKRQLMHHPTTWENRMEWNGMKWNGMKWNGMEWNGMEWNRIESNRIE,mutated_sequence,1.0,690.0,UPI0000458979.a2m,UPI0000458979.npy,gnomAD
+UPI000041EA63,UPI000041EA63.csv,MSSSFFNPSFAFSSHFDPDGAPLSELSWPSSLAVVAVSFSGLFAVIVLMLACLCCKKGGIGFKEFENAEGDEYAADLAQGSPATAAQNGPDVYVLPLTEVSLPMAKQPGRSVQLLKSTDVGRHSLLYLKEIGRGWFGKVFLGEVNSGISSAQVVVKELQASASVQEQMQFLEEVQPYRALKHSNLLQCLAQCAEVTPYLLVMEFCPLGDLKGYLRSCRVAESMAPDPRTLQRMACEVACGVLHLHRNNFVHSDLALRNCLLTADLTVKIGDYGLAHCKYREDYFVTADQLWVPLRWIAPELVDEVHSNLLVVDQTKSGNVWSLGVTIWELFELGTQPYPQHSDQQVLAYTVREQQLKLPKPQLQLTLSDRWYEVMQFCWLQPEQRPTAEEVHLLLSYLCAKGATEAEEEFERRWRSLRPGGGGVGPGPGAAGPMLGGVVELAAASSFPLLEQFAGDGFHADGDDVLTVTETSRGLNFEYKWEAGRGAEAFPATLSPGRTARLQELCAPDGAPPGVVPVLSAHSPSLGSEYFIRLEEAAPAAGHDPDCAGCAPSPPATADQDDDSDGSTAASLAMEPLLGHGPPVDVPWGRGDHYPRRSLARDPLCPSRSPSPSAGPLSLAEGGAEDADWGVAAFCPAFFEDPLGTSPLGSSGAPPLPLTGEDELEEVGARRAAQRGHWRSNVSANNNSGSRCPESWDPVSAGGHAEGCPSPKQTPRASPEPGYPGEPLLGLQAASAQEPGCCPGLPHLCSAQGLAPAPCLVTPSWTETASSGGDHPQAEPKLATEAEGTTGPRLPLPSVPSPSQEGAPLPSEEASAPDAPDALPDSPTPATGGEVSAIKLASALNGSSSSPEVEAPSSEDEDTAEATSGIFTDTSSDGLQARRPDVVPAFRSLQKQVGTPDSLDSLDIPSSASDGGYEVFSPSATGPSGGQPRALDSGYDTENYESPEFVLKEAQEGCEPQAFAELASEGEGPGPETRLSTSLSGLNEKNPYRDSAYFSDLEAEAEATSGPEKKCGGDRAPGPELGLPSTGQPSEQVCLRPGVSGEAQGSGPGEVLPPLLQLEGSSPEPSTCPSGLVPEPPEPQGPAKVRPGPSPSCSQFFLLTPVPLRSEGNSSEFQGPPGLLSGPAPQKRMGGPGTPRAPLRLALPGLPAALEGRPEEEEEDSEDSDESDEELRCYSVQEPSEDSEEEAPAVPVVVAESQSARNLRSLLKMPSLLSETFCEDLERKKKAVSFFDDVTVYLFDQESPTRELGEPFPGAKESPPTFLRGSPGSPSAPNRPQQADGSPNGSTAEEGGGFAWDDDFPLMTAKAAFAMALDPAAPAPAAPTPTPAPFSRFTVSPAPTSRFSITHVSDSDAESKRGPEAGAGGESKEA,mutated_sequence,1.0,1374.0,UPI000041EA63.a2m,UPI000041EA63.npy,gnomAD
+UPI0000D4C11A,UPI0000D4C11A.csv,MWPQPRLPPRPAMSEETRQSKLAAAKKKLREYQQRNSPGVPTGAKKKKKIKNGSNPETTTSGGCHSPEDTPKDNAATLQPSDDTVLPGGVPSPGASLTSMAASQNHDADNVPNLMDETKTFSSTESLRQLSQQLNGLVCESATCVNGEGPASSANLKDLESRYQQLAVALDSSYVTNKQLNITIEKLKQQNQEITDQLEEEKKECHQKQGALREQLQVHIQTIGILVSEKAELQTALAHTQHAARQKEGESEDLASRLQYSRRRVGELERALSAVSTQQKKADRYNKELTKERDALRLELYKNTQSNEDLKQEKSELEEKLRVLVTEKAGMQLNLEELQKKLEMTELLLQQFSSRCEAPDANQQLQQAMEERAQLEAHLGQVMESVRQLQMERDKYAENLKGESAMWRQRMQQMSEQVHTLREEKECSMSRVQELETSLAELRNQMAEPPPPEPPAGPSEVEQQLQAEAEHLRKELEGLAGQLQAQVQDNEGLSRLNREQEERLLELERAAELWGEQAEARRQILETMQNDRTTISRALSQNRELKEQLAELQSGFVKLTNENMEITSALQSEQHVKRELGKKLGELQEKLSELKETVELKSQEAQSLQQQRDQYLGHLQQYVAAYQQLTSEKEVLHNQLLLQTQLVDQLQQQEAQGKAVAEMARQELQETQERLEAATQQNQQLRAQLSLMAHPGEGDGLDREEEEDEEEEEEEAVAVPQPMPSIPEDLESREAMVAFFNSAVASAEEEQARLRGQLKEQRVRCRRLAHLLASAQKEPEAAAPAPGTGGDSVCGETHRALQGAMEKLQSRFMELMQEKADLKERVEELEHRCIQLSGETDTIGEYIALYQSQRAVLKERHREKEEYISRLAQDKEEMKVKLLELQELVLRLVGDRNEWHGRFLAAAQNPADEPTSGAPAPQELGAANQQGDLCEVSLAGSVEPAQGEAREGSPRDNPTAQQIMQLLREMQNPRERPGLGSNPCIPFFYRADENDEVKITVI,mutated_sequence,1.0,1002.0,UPI0000D4C11A.a2m,UPI0000D4C11A.npy,gnomAD
+UPI0000135F21,UPI0000135F21.csv,MGHHRPWLHASVLWAGVASLLLPPAMTQQLRGDGLGFRNRNNSTGVAGLSEEASAELRHHLHSPRDHPDENKDVSTENGHHFWSHPDREKEDEDVSKEYGHLLPGHRSQDHKVGDEGVSGEEVFAEHGGQARGHRGHGSEDTEDSAEHRHHLPSHRSHSHQDEDEDEVVSSEHHHHILRHGHRGHDGEDDEGEEEEEEEEEEEEASTEYGHQAHRHRGHGSEEDEDVSDGHHHHGPSHRHQGHEEDDDDDDDDDDDDDDDDVSIEYRHQAHRHQGHGIEEDEDVSDGHHHRDPSHRHRSHEEDDNDDDDVSTEYGHQAHRHQDHRKEEVEAVSGEHHHHVPDHRHQGHRDEEEDEDVSTERWHQGPQHVHHGLVDEEEEEEEITVQFGHYVASHQPRGHKSDEEDFQDEYKTEVPHHHHHRVPREEDEEVSAELGHQAPSHRQSHQDEETGHGQRGSIKEMSHHPPGHTVVKDRSHLRKDDSEEEKEKEEDPGSHEEDDESSEQGEKGTHHGSRDQEDEEDEEEGHGLSLNQEEEEEEDKEEEEEEEDEERREERAEVGAPLSPDHSEEEEEEEEGLEEDEPRFTIIPNPLDRREEAGGASSEEESGEDTGPQDAQEYGNYQPGSLCGYCSFCNRCTECESCHCDEENMGEHCDQCQHCQFCYLCPLVCETVCAPGSYVDYFSSSLYQALADMLETPEP,mutated_sequence,1.0,699.0,UPI0000135F21.a2m,UPI0000135F21.npy,gnomAD
+UPI00006EB135,UPI00006EB135.csv,METRSSKTRRSLASRTNECQGTMWAPTSPPAGSSSPSQPTWKSSLYSSLAYSEAFHYSFAARPRRLTQLALAQRPEPQLLRLRPSSLRTQDISHLLTGVFRNLYSAEVIGDEVSASLIKARGSENERHEEFVDQLQQIRELYKQRLDEFEMLERHITQAQARAIAENERVMSQAGVQDLESLVRLPPVKSVSRWCIDSELLRKHHLISPEDYYTDTVPFHSAPKGISLPGCSKLTFSCEKRSVQKKELNKKLEDSCRKKLAEFEDELDHTVDSLTWNLTPKAKERTREPLKKASQPRNKNWMNHLRVPQRELDRLLLARMESRNHFLKNPRFFPPNTRYGGKSLVFPPKKPAPIGEFQSTEPEQSCADTPVFLAKPPIGFFTDYEIGPVYEMVIALQNTTTTSRYLRVLPPSTPYFALGLGMFPGKGGMVAPGMTCQYIVQFFPDCLGDFDDFILVETQSAHTLLIPLQARRPPPVLTLSPVLDCGYCLIGGVKMTRFICKNVGFSVGRFCIMPKTSWPPLSFKAIATVGFVEQPPFGILPSVFELAPGHAILVEVLFSPKSLGKAEQTFIIMCDNCQIKELVTIGIGQLIALDLIYISGEKSQPDPGELTDLTAQHFIRFEPENLRSTARKQLIIRNATHVELAFYWQIMKPNLQPLMPGETFSMDSIKCYPDKETAFSIMPRKGVLSPHTDHEFILSFSPHELRDFHSVLQMVLEEVPEPVSSEAESLGHSSYSVDDVIVLEIEVKGSVEPFQVLLEPYALIIPGENYIGINVKKAFKMWNNSKSPIRYLWGKISDCHIIEVEPGTGVIEPSEVGDFELNFTGGVPGPTSQDLLCEIEDSPSPVVLHIEAVFKGPALIINVSALQFGLLRLGQKATNSIQIRNVSQLPATWRMKESPVSLQERPEDVSPFDIEPSSGQLHSLGECRVDITLEALHCQHLETVLELEVENGAWSYLPVYAEVQKPHVYLQSSQVEVRNLYLGVPTKTTITLINGTLLPTQFHWGKLLGHQAEFCMVTVSPKHGLLGPSEECQLKLELTAHTQEELTHLALPCHVSGMKKPLVLGISGKPQGLQVAITISKESSDCSTEQWPGHPKELRLDFGSAVPLRTRVTRQLILTNRSPIRTRFSLKFEYFGSPQNSLSKKTSLPNMPPALLKTVRMQEHLAKREQLDFMESMLSHGKGAAFFPHFSQGMLGPYQQLCIDITGCANMWGEYWDNLICTVGDLLPEVIPVHMAAVGCPISSLRTTSYTIDQAQKEPAMRFGTQVSGGDTVTRTLRLNNSSPCDIRLDWETYVPEDKEDRLVELLVFYGPPFPLRDQAGNELVCPDTPEGGCLLWSPGPSSSSEFSHETDSSVEGSSSASNRVAQKLISVILQAHEGVPSGHLYCISPKQVVVPAGGSSTIYISFTPMVLSPEILHKVECTGYALGFMSLDSKVEREIPGKRHRLQDFAVGPLKLDLHSYVRPAQLSVELDYGGSMEFQCQASDLIPEQPCSGVLSELVTTHHLKLTNTTEIPHYFRLMVSRPFSVSQDGASQDHRAPGPGQKQECEEETASADKQLVLQAQENMLVNVSFSLSLELLSYQKLPADQTLPGVDIQQSASGEREMVFTQNLLLEYTNQTTQVVPLRAVVAVPELQLSTSWVDFGTCFVSQQRVREVYLMNLSGCRSYWTMLMGVVSCTSPRWWWKVCSVRSPAPCGSGAKAPMMRDTCCLTSPEAPPQPSAPGPSWRKNIAQGLGAALQHKDTDLGTWGPLGSSWNGRTPFHNGLSLGPHDMSSELT,mutated_sequence,1.0,1778.0,UPI00006EB135.a2m,UPI00006EB135.npy,gnomAD
+UPI00003667EB,UPI00003667EB.csv,MPGPRGAAGGLAPEMRGAGAAGLLALLLLLLLLLLGLGGRVEGGPAGERGAGGGGALARERFKVVFAPVICKRTCLKGQCRDSCQQGSNMTLIGENGHSTDTLTGSGFRVVVCPLPCMNGGQCSSRNQCLCPPDFTGRFCQVPAGGAGGGTGGSGPGLSRTGALSTGALPPLAPEGDSVASKHAIYAVQVIADPPGPGEGPPAQHAAFLVPLGPGQISAEVQAPPPVVNVRVHHPPEASVQVHRIESSNAESAAPSQHLLPHPKPSHPRPPTQKPLGRCFQDTLPKQPCGSNPLPGLTKQEDCCGSIGTAWGQSKCHKCPQLQYTGVQKPGPVRGEVGADCPQGYKRLNSTHCQDINECAMPGVCRHGDCLNNPGSYRCVCPPGHSLGPSRTQCIADKPEEKSLCFRLVSPEHQCQHPLTTRLTRQLCCCSVGKAWGARCQRCPTDGTAAFKEICPAGKGYHILTSHQTLTIQGESDFSLFLHPDGPPKPQQLPESPSQAPPPEDTEEERGVTTDSPVSEERSVQQSHPTATTTPARPYPELISRPSPPTMRWFLPDLPPSRSAVEIAPTQVTETDECRLNQNICGHGECVPGPPDYSCHCNPGYRSHPQHRYCVDVNECEAEPCGPGRGICMNTGGSYNCHCNRGYRLHVGAGGRSCVDLNECAKPHLCGDGGFCINFPGHYKCNCYPGYRLKASRPPVCEDIDECRDPSSCPDGKCENKPGSFKCIACQPGYRSQGGGACRDVNECAEGSPCSPGWCENLPGSFRCTCAQGYAPAPDGRSCLDVDECEAGDVCDNGICSNTPGSFQCQCLSGYHLSRDRSHCEDIDECDFPAACIGGDCINTNGSYRCLCPQGHRLVGGRKCQDIDECSQDPSLCLPHGACKNLQGSYVCVCDEGFTPTQDQHGCEEVEQPHHKKECYLNFDDTVFCDSVLATNVTQQECCCSLGAGWGDHCEIYPCPVYSSAEFHSLCPDGKGYTQDNNIVNYGIPAHRDIDECMLFGSEICKEGKCVNTQPGYECYCKQGFYYDGNLLECVDVDECLDESNCRNGVCENTRGGYRCACTPPAEYSPAQRQCLSPEEMDVDECQDPAACRPGRCVNLPGSYRCECRPPWVPGPSGRDCQLPESPAERAPERRDVCWSQRGEDGMCAGPLAGPALTFDDCCCRQGRGWGAQCRPCPPRGAGSHCPTSQSESNSFWDTSPLLLGKPPRDEDSSEEDSDECRCVSGRCVPRPGGAVCECPGGFQLDASRARCVDIDECRELNQRGLLCKSERCVNTSGSFRCVCKAGFARSRPHGACVPQRRR,mutated_sequence,1.0,1303.0,UPI00003667EB.a2m,UPI00003667EB.npy,gnomAD
+UPI000020198E,UPI000020198E.csv,MIATGGLLRISARKQDPLRPPSQIPKRKRKAKKRRKNDVVVVKGKLKLCSISGLIALCGILVLLVGIAMAVVGYWPKATGTNREGGKQLPPAGSSHRVPTTANSSSSGSKNRSRSHPRAPGGVNSSSAGAPRSTPPARAASPSSSSTSVGFFFRIFSGYLHSDKLKVFGPLIMGIGIFLFICANAVLHENRDKKTKIINLRDLYSTVIDVHSLRAKDLAAAAAAAAAAAASSSSSAPAAAPPGAIPLNGFLSYVQSRGLELKPGGCGGSGDAFGAAAMLAKGSWPPHPAAPSGGRPRGAASPPDLASSPRCPREPPSLAEAVYSVYRERSGVAGSRRAAAATAAAAASSCSSPAPCSPPESWGRQSTASSFVDSSLSAFALLPLQGGRDRGGDAEGASCSWQRPPGERGSQEIPRGELDLSMTNLRGAEGSMRGARREPEEPEGAVAARAARGQGGRLPRTGRYAALRRRSTSGLPDYRAPPSPEPPPSPGSADPDSSPLAKAASPSPPLRLEGSPPTRRDSGSSQSDDPSSSNKGYTPLREAGTSTESVLDAVAGQTRDSAVAAPVLGAEQSSPEGASQEPPTAEQPQPVQRQFTNKEKLIMISRSHAIGVEEELESTGI,mutated_sequence,1.0,621.0,UPI000020198E.a2m,UPI000020198E.npy,gnomAD
+UPI00001C1FC7,UPI00001C1FC7.csv,MHRAVDPPGARAAREAFALGGLSCAGAWSSCPPHPPPRSAWLPGGRCSASIGQPPLPAPLPPSHGSSSGHPSKPYYAPGAPTPRPLHGKLESLHGCVQALLREPAQPGLWEQLGQLYESEHDSEEATRCYHSALRYGGSFAELGPRIGRLQQAQLWNFHTGSCQHRAKVLPPLEQVWNLLHLEHKRNYGAKRGGPPVKRAAEPPVVQPVPPAALSGPSGEEGLSPGGKRRRGCNSEQTGLPPGLPLPPPPLPPPPPPPPPPPPPLPGLATSPPFQLTKPGLWSTLHGDAWGPERKGSAPPERQEQRHSLPHPYPYPAPAYTAHPPGHRLVPAAPPGPGPRPPGAESHGCLPATRPPGSDLRESRVQRSRMDSSVSPAATTACVPYAPSRPPGLPGTTTSSSSSSSSNTGLRGVEPNPGIPGADHYQTPALEVSHHGRLGPSAHSSRKPFLGAPAATPHLSLPPGPSSPPPPPCPRLLRPPPPPAWLKGPACRAAREDGEILEELFFGTEGPPRPAPPPLPHREGFLGPPASRFSVGTQDSHTPPTPPTPTTSSSNSNSGSHSSSPAGPVSFPPPPYLARSIDPLPRPPSPAQNPQDPPLVPLTLALPPAPPSSCHQNTSGSFRRPESPRPRVSFPKTPEVGPGPPPGPLSKAPQPVPPGVGELPARGPRLFDFPPTPLEDQFEEPAEFKILPDGLANIMKMLDESIRKEEEQQQHEAGVAPQPPLKEPFASLQSPFPTDTAPTTTAPAVAVTTTTTTTTTTTATQEEEKKPPPALPPPPPLAKFPPPSQPQPPPPPPPSPASLLKSLASVLEGQKYCYRGTGAAVSTRPGPLPTTQYSPGPPSGATALPPTSAAPSAQGSPQPSASSSSQFSTSGGPWARERRAGEEPVPGPMTPTQPPPPLSLPPARSESEVLEEISRACETLVERVGRSATDPADPVDTAEPADSGTERLLPPAQAKEEAGGVAAVSGSCKRRQKEHQKEHRRHRRACKDSVGRRPREGRAKAKAKVPKEKSRRVLGNLDLQSEEIQGREKSRPDLGGASKAKPPTAPAPPSAPAPSAQPTPPSASVPGKKAREEAPGPPGVSRADMLKLRSLSEGPPKELKIRLIKVESGDKETFIASEVEERRLRMADLTISHCAADVVRASRNAKVKGKFRESYLSPAQSVKPKINTEEKLPREKLNPPTPSIYLESKRDAFSPVLLQFCTDPRNPITVIRGLAGSLRLNLGLFSTKTLVEASGEHTVEVRTQVQQPSDENWDLTGTRQIWPCESSRSHTTIAKYAQYQASSFQESLQEEKESEDEESEEPDSTTGTPPSSAPDPKNHHIIKFGTNIDLSDAKRWKPQLQELLKLPAFMRVTSTGNMLSHVGHTILGMNTVQLYMKVPGSRTPGHQENNNFCSVNINIGPGDCEWFAVHEHYWETISAFCDRHGVDYLTGSWWPILDDLYASNIPVYRFVQRPGDLVWINAGTVHWVQATGWCNNIAWNVGPLTAYQYQLALERYEWNEVKNVKSIVPMIHVSWNVARTVKISDPDLFKMIKFCLLQSMKHCQVQRESLVRAGKKIAYQGRVKDEPAYYCNECDVEVFNILFVTSENGSRNTYLVHCEGCARRRSAGLQGVVVLEQYRTEELAQAYDAFTLAPASTSR,mutated_sequence,1.0,1643.0,UPI00001C1FC7.a2m,UPI00001C1FC7.npy,gnomAD
+UPI00005C3036,UPI00005C3036.csv,MESGPRAELGAGAPPAVVARTPPEPRPSPEGDPSPPPPPMSALVPDTPPDTPPAMKNATSSKQLPLEPESPSGQVGPRPAPPQEESPSSEAKSRGPTPPAMGPRDARPPRRSSQPSPTAVPASDSPPTKQEVKKAGERHKLAKERREERAKYLAAKKAVWLEKEEKAKALREKQLQERRRRLEEQRLKAEQRRAALEERQRQKLEKNKERYEAAIQRSVKKTWAEIRQQRWSWAGALHHSSPGHKTSGSRCSVSAVNLPKHVDSIINKRLSKSSATLWNSPSRNRSLQLSAWESSIVDRLMTPTLSFLARSRSAVTLPRNGRDQGRGCDPGRGPTWGRAGASLARGPQPDRTHPSAAVPVCPRSASASPLTPCSVTRSVHRCAPAGERGERRKPNAGGSPAPVRRRPEASPVQKKEKKDKERENEKEKSALARERSLKKRQSLPASPRARLSASTASELSPKSKARPSSPSTSWHRPASPCPSPGPGHTLPPKPPSPRGTTASPKGRVRRKEEAKESPSAAGPEDKSQSKRRASNEKESAAPASPAPSPAPSPTPAPPQKEQPPAETPTDAAVLTSPPAPAPPVTPSKPMAGTTDREEATRLLAEKRRQAREQREREEQERRLQAERDKRMREEQLAREAEARAEREAEARRREEQEAREKAQAEQEEQERLQKQKEEAEARSREEAERQRLEREKHFQQQEQERQERRKRLEEIMKRTRKSEVSETKQKQDSKEANANGSSPEPVKAVEARSPGLQKEAVQKEEPIPQEPQWSLPSKELPASLVNGLQPLPAHQENGFSTNGPSGDKSLSRTPETLLPFAEAEAFLKKAVVQSPQVTEVL,mutated_sequence,1.0,841.0,UPI00005C3036.a2m,UPI00005C3036.npy,gnomAD
+UPI0001E92A31,UPI0001E92A31.csv,MRRGNISPAFWFLWLLLFGLLGPSSENTTAFTKGSDTTTASITGSETTMASTMASTSALTTGSKITTDSTTGSETTSASTMASTAAFTTGSETNTASTTDSGTTIASTRTFTTGSDTTTGSTAGSETIVASTTVSGTTTTFTIASTTVPETTMASSTTSTAGSEKTMASSIISETTMASTTGSETATVSTTGSETTTTSTASSEATKVSTTGSETTTASTAGSETTTTSTSMAGSEATTTSTADSKVITASSMSSETTVAPAAGSNTTTASTTGSETTTILIKASETTTASTAGSETTTPSPTGSQTTIVSISGSEITTTSTAGSENTTVSSAGSGTTTASMAGSETTVSTAGSETTTVSITGTETTMVSAMGSETTTNSTTSSETTVTSTAGSETTTVSTVGSETTTAYTADSETTAASTTGSEMTTVFTAGSETITPSTAGSETTTVSTAGSETTTVSTTGSETTTASTAHSETTAASTMGSETTKVSTAGSETTVSTAGSETTAASTEDSETNTAFTEDSKTTTASTTGFETTAASTTGSEPTMASTMGSETTMASTIGPETTKVSTASSEVTTVFAAGSETIRASTVGSETTTVSTTGSETTTASIMGSETSTDSTTGSETTTASTEGSETTTASTEGSEATTVSTTGSETTTVSITDSETTTTCTEGSEMTAVSTTVFETTTASTEGSEITIASTSDSETTTASTEGSETTTVTTAGSETKTAYTTGSETTTASNTGLETTTVFTIGSDTTTASTEGSETTAVSATGSEMTTVSTEGSENTTVSTTGSETTTVSTTGLETTTTSTEGSEMTTVSTTGAETTTDSTEGSGTTAASTAGSETTTVSTADSENTTASTADSETTSASTTGSETTTASTTSSETTTASTEGSETTTVSTTDSETTMVSTTGSERTITSTEGSETTTVSATGSETTVSTEGSGTTTVSITGSETTKVSTTGSETTTTSTEGSEITTASITGSETTTASTEGSETTTASTEGSETTSASTTGSETTTASTTSSETTMASIMGSETTMASTIGSETTKVSTASSKMTTVFTENSETTIASTTASETTTVSTAGSETIPASTAGSETTTTTSTEGSETTTASTEGSETTTASTESSETTTATTIGSETTTASTEGSETTTTSTEGSETTTASTEGSEITTVSTTGSETTTASTEGSETTTASTEGSELTTVSTTGSETITVSAEGSETTTVTTMGSETTTASTAGSETTTVSTAGSETTTASIEGSETTTVSSTGSETTTVSTTGTETTITSTEGSETTTVTTAGSETTAVYTTGSETTTTSTEGSETTTVSTTGSETTTASTADLETTTVSTSGSGTTTASTAGSETTTVYITGSKTTTASTEGSEATTVSTTSSETTTASTTGSEMTTVFTTVSETTTVSTIGSEATTSSAAGSEATTTSTEGSETTTASTAGSETTTASTAGSETTTASTSGSETNTACTTGSETSTPSSAGSETNTAFIIGSESTIASTASLEPTATSLTGSETTTVSITASGATAASTTVSSTTFVLTKATDVSIQPITNTPMSGTRTTGTRLTASSSVTMAPGMDFTASAASHTVPGIVLNTSGLGTSTMGASSTTSAHGVRTTTGSTREPTSSTFQETGPVSMGTNTVSMSHTPTNVIKPSGYLQPWAIILISLAAVVAAVGLSVGLSFCLRNLFFPLRYCGIYYPHGHSHSLGLDLNLGLGSGTFHSLGNALVHGGELEMGHGGTHGFGYGVGHGLSHIHGDGYGVNHGGHYGHGGGH,mutated_sequence,1.0,1773.0,UPI0001E92A31.a2m,UPI0001E92A31.npy,gnomAD
+UPI00001D7CA5,UPI00001D7CA5.csv,MCSGNQTSQNQTASTDFTLTGLFAESKHAALLYTVTFLLFLMALTGNALLILLIHSEPRLHTPMYFFISQLALMDLMYLCVTVPKMLVGQVTGDDTISPSGCGIQMFFYLTLAGAEVFLLAAMAYDRYAAVCRPLHYPLLMNQRVCQLLVSACWVLGMVDGLLLTPITMSFPFCQSRKILSFFCETPALLKLSCSDVSLYKTLMYLCCILMLLAPIMVISSSYTLILHLIHRMNSAAGHRKALATCSSHMIIVLLLFGASFYTYMLPSSYHTAEQDMMVSAFYTIFTPVLNPLIYSLRNKDVTRALRSMMQSRMNQEK,mutated_sequence,1.0,318.0,UPI00001D7CA5.a2m,UPI00001D7CA5.npy,gnomAD
+UPI00004DDA7C,UPI00004DDA7C.csv,MASPLRDEEEEEEEMVVSEEEEEEEEEGDEEEEEEVEAADEDDEEDDDEGVLGRGPGHDRGRDRHSPPGCHLFPPPPPPPPPLPPPPPPPPPDKDDIRLLPSALGVKKRKRGPKKQKENKPGKPRKRKKRDSEEEFGSERDEYREKSESGGSEYGTGPGRKRRRKHREKKEKKTKRRKKGEGDGGQKQVEQKSSATLLLTWGLEDVEHVFSEEDYHTLTNYKAFSQFMRPLIAKKNPKIPMSKMMTILGAKWREFSANNPFKGSAAAVAAAAAAAAAAVAEQVSAAVSSATPIAPSGPPALPPPPAADIQPPPIRRAKTKEGKGPGHKRRSKSPRVPDGRKKLRGKKMAPLKIKLGLLGGKRKKGGSYVFQSDEGPEPEAEESDLDSGSVHSASGRPDGPVRTKKLKRGRPGRKKKKVLGCPAVAGEEEVDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVAVPAPQQADGNPDVPPPRPLQGRSEREFFVKWVGLSYWHCSWAKELQLEIFHLVMYRNYQRKNDMDEPPPLDYGSGEDDGKSDKRKVKDPHYAEMEEKYYRFGIKPEWMTVHRIINHSVDKKGNYHYLVKWRDLPYDQSTWEEDEMNIPEYEEHKQSYWRHRELIMGEDPAQPRKYKKKKKELQGDGPPSSPTNDPTVKYETQPRFITATGGTLHMYQLEGLNWLRFSWAQGTDTILADEMGLGKTIQTIVFLYSLYKEGHTKGPFLVSAPLSTIINWEREFQMWAPKFYVVTYTGDKDSRAIIRENEFSFEDNAIKGGKKAFKMKREAQVKFHVLLTSYELITIDQAALGSIRWACLVVDEAHRLKNNQSKFFRVLNGYKIDHKLLLTGTPLQNNLEELFHLLNFLTPERFNNLEGFLEEFADISKEDQIKKLHDLLGPHMLRRLKADVFKNMPAKTELIVRVELSPMQKKYYKYILTRNFEALNSRGGGNQVSLLNIMMDLKKCCNHPYLFPVAAMESPKLPSGAYEGGALIKSSGKLMLLQKMLRKLKEQGHRVLIFSQMTKMLDLLEDFLDYEGYKYERIDGGITGALRQEAIDRFNAPGAQQFCFLLSTRAGGLGINLATADTVIIFDSDWNPHNDIQAFSRAHRIGQANKVMIYRFVTRASVEERITQVAKRKMMLTHLVVRPGLGSKAGSMSKQELDDILKFGTEELFKDENEGENKEEDSSVIHYDNEAIARLLDRNQDATEDTDVQNMNEYLSSFKVAQYVVREEDKIEEIEREIIKQEENVDPDYWEKLLRHHYEQQQEDLARNLGKGKRVRKQVNYNDAAQEDQDNQSEYSVGSEEEDEDFDERPEGRRQSKRQLRNEKDKPLPPLLARVGGNIEVLGFNTRQRKAFLNAVMRWGMPPQDAFTTQWLVRDLRGKTEKEFKAYVSLFMRHLCEPGADGSETFADGVPREGLSRQQVLTRIGVMSLVKKKVQEFEHINGRWSMPELMPDPSADSKRSSRASSPTKTSPTTPEASATNSPCTSKPATPAPSEKGEGIRTPLEKEEAENQEEKPEKNSRIGEKMETEADAPSPAPSLGERLEPRKIPLEDEVPGVPGEMEPEPGYRGDREKSATESTPGERGEEKPLDGQEHRERPEGETGDLGKREDVKGDRELRPGPRDEPRSNGRREEKTEKPRFMFNIADGGFTELHTLWQNEERAAISSGKLNEIWHRRHDYWLLAGIVLHGYARWQDIQNDAQFAIINEPFKTEANKGNFLEMKNKFLARRFKLLEQALVIEEQLRRAAYLNLSQEPAHPAMALHARFAEAECLAESHQHLSKESLAGNKPANAVLHKVLNQLEELLSDMKADVTRLPATLSRIPPIAARLQMSERSILSRLASKGTEPHPTPAYPPGPYATPPGYGAAFSAAPVGALAAAGANYSQMPAGSFITAATNGPPVLVKKEKEMVGALVSDGLDRKEPRAGEVICIDD,mutated_sequence,1.0,2059.0,UPI00004DDA7C.a2m,UPI00004DDA7C.npy,gnomAD
+UPI000013C519,UPI000013C519.csv,MCGPDDRCPARWPGPGRAVKCGKGLAAARPGRVERGGAQRGGAGLELHPLLGGRTWRAARDADGCEALGTVAVPFDDDDKIVGGYTCEENSLPYQVSLNSGSHFCGGSLISEQWVVSAAHCYKTRIQVRLGEHNIKVLEGNEQFINAAKIIRHPKYNRDTLDNDIMLIKLSSPAVINARVSTISLPTTPPAAGTECLISGWGNTLSFGADYPDELKCLDAPVLTQAECKASYPGKITNSMFCVGFLEGGKDSCQRDSGGPVVCNGQLQGVVSWGHGCAWKNRPGVYTKVYNYVDWIKDTIAANS,mutated_sequence,1.0,304.0,UPI000013C519.a2m,UPI000013C519.npy,gnomAD
+UPI000015FEF2,UPI000015FEF2.csv,MYTVLTGTPPFMASPLSEMYQNIREGHYPEPAHLSANARRLIVHLLAPNPAERPSLDHLLQDDFFTQGFTPDRLPAHSCHSPPIFAIPPPLGRIFRKVGQRLLTQCRPPCPFTPKEASGPGEGGPDPDSMEWDGESSLSAKEVPCLEGPIHLVAQGTLQSDLAGPEGSRRPEVEAALRHLQLCLDVGPPATQDPLGEQQPILWAPKWVDYSSKYGFGYQLLDGGRTGRHPHGPATPRREGTLPTPVPPAGPGLCLLRFLASEHALLLLFSNGMVQVSFSGVPAQLVLSGEGEGLQLTLWEQGSPGTSYSLDVPRSHGCAPTTGQHLHHALRMLQSI,mutated_sequence,1.0,336.0,UPI000015FEF2.a2m,UPI000015FEF2.npy,gnomAD
+UPI000012500C,UPI000012500C.csv,MASPALAAALAVAAAAGPNASGAGERGSGGVANASGASWGPPRGQYSAGAVAGLAAVVGFLIVFTVVGNVLVVIAVLTSRALRAPQNLFLVSLASADILVATLVMPFSLANELMAYWYFGQVWCGVYLALDVLFCTSSIVHLCAISLDRYWSVTQAVEYNLKRTPRRVKATIVAVWLISAVISFPPLVSLYRQPDGAAYPQCGLNDETWYILSSCIGSFFAPCLIMGLVYARIYRVAKLRTRTLSEKRAPVGPDGASPTTENGLGAAAGAGENGHCAPPPADVEPDESSAAAERRRRRGALRRGGRRRAGAEGGAGGADGQGAGPGAAESGALTASRSPGPGGRLSRASSRSVEFFLSRRRRARSSVCRRKVAQAREKRFTFVLAVVMGVFVLCWFPFFFSYSLYGICREACQVPGPLFKFFFWIGYCNSSLNPVIYTVFNQDFRRSFKHILFRRRRRGFRQ,mutated_sequence,1.0,462.0,UPI000012500C.a2m,UPI000012500C.npy,gnomAD
+UPI000204AD4D,UPI000204AD4D.csv,SPLRRTFKSKVLAHYPQNIEWNPFDQDAVNMLCMPKGLSFRTQTDNKDPQFHSFIITREDGSRTYGFVLTFYEEVTSKQICTAMQTLYQMHNAEHYSSVYASSSCSMDSLASSLDEGDTTSLLKLQRYNSYDISRDTLYVSKSICLITPLPFMQACKKFLIQLYKAVTSQQPPPLPLESYIHNILYEVPLPPPGRSLKFYGVYEPVICQRPGPSELPLSDYPLREAFELLGLENLVQVFTCVLLEMQILLYSQDYQRLMTVAEGITTLLFPFQWQHVYVPILPASLLHFLDAPVPYLMGLQSKEGTDRSKLELPQEANLCFVDIDNHFIELPEEFPQFPNKVDFIQELSEVLVQFGIPPEGSLHCSESTSKLKNMVLKDLVNDKKNGNVCTNNISMYELLKGNETIARLQALAKRTGVAVEKMDLSASLGEKDKDLKLHCEEAELRDYQLNVQLREVFANRFTQMFADYEAFVIQTAQDMESWLTNREQMQNFDKVKRTIVFLLWLLVALLLILTFLNFLFFCAYAYPYIIDFVNLF,mutated_sequence,1.0,537.0,UPI000204AD4D.a2m,UPI000204AD4D.npy,gnomAD
+UPI000013D247,UPI000013D247.csv,MEHPLFGCLRSPHATAQGLHPFSQSSLALHGRSDHMSYPELSTSSSSCIIAGYPNEEGMFASQHHRGHHHHHHHHHHHHHQQQQHQALQTNWHLPQMSSPPSAARHSLCLQPDSGGPPELGSSPPVLCSNSSSLGSSTPTGAACAPGDYGRQALSPAEAEKRSGGKRKSDSSDSQEGNYKSEVNSKPRKERTAFTKEQIRELEAEFAHHNYLTRLRRYEIAVNLDLTERQVKVWFQNRRMKWKRVKGGQQGAAAREKELVNVKKGTLLPSELSGIGAATLQQTGDSIANEDSHDSDHSSEHAHL,mutated_sequence,1.0,304.0,UPI000013D247.a2m,UPI000013D247.npy,gnomAD
+UPI000012ECF8,UPI000012ECF8.csv,MFPVFPCTLLAPPFPVLGLDSRGVGGLMNSFPPPQGHAQNPLQVGAELQSRFFASQGCAQSPFQAAPAPPPTPQAPAAEPLQVDLLPVLAAAQESAAAAAAAAAAAAAVAAAPPAPAAASTVDTAALKQPPAPPPPPPPVSAPAAEAAPPASAATIAAAAATAVVAPTSTVAVAPVASALEKKTKSKGPYICALCAKEFKNGYNLRRHEAIHTGAKAGRVPSGAMKMPTMVPLSLLSVPQLSGAGGGGGEAGAGGGAAAVAAGGVVTTTASGKRIRKNHACEMCGKAFRDVYHLNRHKLSHSDEKPYQCPVCQQRFKRKDRMSYHVRSHDGAVHKPYNCSHCGKSFSRPDHLNSHVRQVHSTERPFKCEKCEAAFATKDRLRAHTVRHEEKVPCHVCGKMLSSAYISDHMKVHSQGPHHVCELCNKGTGEVCPMAAAAAAAAAAAAAAVAAPPTAVGSLSGAEGVPVSSQPLPSQPW,mutated_sequence,1.0,477.0,UPI000012ECF8.a2m,UPI000012ECF8.npy,gnomAD
+UPI0000167B81,UPI0000167B81.csv,MAKSKNHTTHNQSRKWHRNGIKKPRSQRYESLKGVDPKFLRNMRFAKKHNKKGLKKMQANNAKAMSARAEAIKALVKPKEVKPKIPKGVSRKLDRLAYIAHPKLGKRARARIAKGLRLCRPKAKAKAKAKDQTKAQAAAPASVPAQAPKRTQAPTKASE,mutated_sequence,1.0,159.0,UPI0000167B81.a2m,UPI0000167B81.npy,gnomAD
+UPI000013EF3A,UPI000013EF3A.csv,MKITRQKHAKKHLGFFRNNFGVREPYQILLDGTFCQAALRGRIQLREQLPRYLMGETQLCTTRCVLKELETLGKDLYGAKLIAQKCQVRNCPHFKNAVSGSECLLSMVEEGNPHHYFVATQDQNLSVKVKKKPGVPLMFIIQNTMVLDKPSPKTIAFVKAVESGQLVSVHEKESIKHLKEEQGLVKNTEQSRRKKRKKISGPNPLSCLKKKKKAPDTQSSASEKKRKRKRIRNRSNPKVLSEKQNAEGE,mutated_sequence,1.0,249.0,UPI000013EF3A.a2m,UPI000013EF3A.npy,gnomAD
+UPI000046FEDB,UPI000046FEDB.csv,MTCFGPYPIGTGLQGITLCCFPSFCKMKSRMCMAISICQMLSMLSFVVCAFRYRHMFKRGWPMGTCCLFLPTAAPVLSCEAATQTERRLDLAAVTLRRGLRSRASRCRPRSLIDYKSYMDTKLLVARFLEQSSCTMTPDIHELVENIKSVLKSDEEHMEEAITSASFLEQIMAPLQPSTSRAHKLPSRRQPGLLHLQSCGDLHTFTPAGRPRAERRPRRVEAERPHSLIGVIRETVL,mutated_sequence,1.0,237.0,UPI000046FEDB.a2m,UPI000046FEDB.npy,gnomAD
+UPI000007007F,UPI000007007F.csv,MLLPLLLLLPMCWAVEVKRPRGVSLTNHHFYDESKPFTCLDGSATIPFDQVNDDYCDCKDGSDEPGTAACPNGSFHCTNTGYKPLYIPSNRVNDGVCDCCDGTDEYNSGVICENTCKEKGRKERESLQQMAEVTREGFRLKKILIEDWKKAREEKQKKLIELQAGKKSLEDQVEMLRTVKEEAEKPEREAKEQHQKLWEEQLAAAKAQQEQELAADAFKELDDDMDGTVSVTELQTHPELDTDGDGALSEAEAQALLSGDTQTDATSFYDRVWAAIRDKYRSEALPTDLPAPSAPDLTEPKEEQPPVPSSPTEEEEEEEEEEEEEAEEEEEEEDSEEAPPPLSPPQPASPAEEDKMPPYDEQTQAFIDAAQEARNKFEEAERSLKDMEESIRNLEQEISFDFGPNGEFAYLYSQCYELTTNEYVYRLCPFKLVSQKPKLGGSPTSLGTWGSWIGPDHDKFSAMKYEQGTGCWQGPNRSTTVRLLCGKETMVTSTTEPSRCEYLMELMTPAACPEPPPEAPTEDDHDEL,mutated_sequence,1.0,528.0,UPI000007007F.a2m,UPI000007007F.npy,gnomAD
+UPI0001D3B409,UPI0001D3B409.csv,XLHNFSARLWEQLVHFHVMRLTDSLFLWVGATPHLRNLAVAMCSRYVPCPHHLHDADLLGPPGKLFLQLCKETDPVSPGQAAHRGQDSSCSHSPGSPVRVPQKGERSFFFFFFFASG,mutated_sequence,1.0,117.0,UPI0001D3B409.a2m,UPI0001D3B409.npy,gnomAD
+UPI00001BD8AE,UPI00001BD8AE.csv,MAAETQTLNFGPEWLRALSSGGSITSPPLSPALPKYKLADYRYGREEMLALFLKDNKIPSDLLDKEFLPILQEEPLPPLALVPFTEEEQRNFSMSVNSAAVLRLTGRGGGGTVVGAPRGRSSSRGRGRGRGECGFYQRSFDEVEGVFGRGGGREMHRSQSWEERGDRRFEKPGRKDVGRPNFEEGGPTSVGRKHEFIRSESENWRIFREEQNGEDEDGGWRLAGSRRDGERWRPHSPDGPRSAGWREHMERRRRFEFDFRDRDDERGYRRVRSGSGSIDDDRDSLPEWCLEDAEEEMGTFDSSGAFLSLKKVQKEPIPEEQEMDFRPVDEGEECSDSEGSHNEEAKEPDKTNKKEGEKTDRVGVEASEETPQTSSSSARPGTPSDHQSQEASQFERKDEPKTEQTEKAEEETRMENSLPAKVPSRGDEMVADVQQPLSQIPSDTASPLLILPPPVPNPSPTLRPVETPVVGAPGMGSVSTEPDDEEGLKHLEQQAEKMVAYLQDSALDDERLASKLQEHRAKGVSIPLMHEAMQKWYYKDPQGEIQGPFNNQEMAEWFQAGYFTMSLLVKRACDESFQPLGDIMKMWGRVPFSPGPAPPPHMGELDQERLTRQQELTALYQMQHLQYQQFLIQQQYAQVLAQQQKAALSSQQQQQLALLLQQFQTLKMRISDQNIIPSVTRSVSVPDTGSIWELQPTASQPTVWEGGSVWDLPLDTTTPGPALEQLQQLEKAKAAKLEQERREAEMRAKREEEERKRQEELRRQQEEILRRQQEEERKRREEEELARRKQEEALRRQREQEIALRRQREEEERQQQEEALRRLEERRREEEERRKQEELLRKQEEEAAKWAREEEEAQRRLEENRLRMEEEAARLRHEEEERKRKELEVQRQKELMRQRQQQQEALRRLQQQQQQQQLAQMKLPSSSTWGQQSNTTACQSQATLSLAEIQKLEEERERQLREEQRRQQRELMKALQQQQQQQQQKLSGWGNVSKPSGTTKSLLEIQQEEARQMQKQQQQQQQHQQPNRARNNTHSNLHTSIGNSVWGSINTGPPNQWASDLVSSIWSNADTKNSNMGFWDDAVKEVGPRNSTNKNKNNASLSKSVGVSNRQNKKVEEEEKLLKLFQGVNKAQDGFTQWCEQMLHALNTANNLDVPTFVSFLKEVESPYEVHDYIRAYLGDTSEAKEFAKQFLERRAKQKANQQRQQQQLPQQQQQQPPQQPPQQPQQQDSVWGMNHSTLHSVFQTNQSNNQQSNFEAVQSGKKKKKQKMVRADPSLLGFSVNASSERLNMGEIETLDDY,mutated_sequence,1.0,1299.0,UPI00001BD8AE.a2m,UPI00001BD8AE.npy,gnomAD
+UPI00015ADD19,UPI00015ADD19.csv,MWLKPEEVLLKNALKLWVTQKSSCYFILQRRRGHGEGGGRLTGRLVGALDAVLDSNARVAPFRILLQVPGSQVYSPIACGATLEEINQHWDWLEQNLLHTLSVFDNKDDIASFVKGKVKALIAEETSSRLAEQEEEPEKFREALVKFEARFNFPEAEKLVTYYSCCCWKGRVPRQGWLYLSINHLCFYSFFLGKELKLVVPWVDIQKLERTSNVFLTDTIRITTQNKERDFSMFLNLDEVFKVMEQLADVTLRRLLDNEVFDLDPDLQEPSQITKRDLEARAQNEFFRAFFRLPRKEKLHAVVDCSLWTPFSRCHTTGRMFASDSYICFASREDGCCKIILPLREVVSIEKMEDTSLLPHPIIVSIRSKVAFQFIELRDRDSLVEALLARLKQVHANHPVHYDTSADDDMASLVFHSTSMCSDHRFGDLEMMSSQNSEESEKEKSPLMHPDALVTAFQQSGSQSPDSRMSREQIKISLWNDHFVEYGRTVCMFRTEKIRKLVAMGIPESLRGRLWLLFSDAVTDLASHPGYYGNLVEESLGKCCLVTEEIERDLHRSLPEHPAFQNETGIAALRRVLTAYAHRNPKIGYCQSMNILTSVLLLYTKEEEAFWLLVAVCERMLPDYFNHRVIGAQVDQSVFEELIKGHLPELAEHMNDLSALASVSLSWFLTLFLSIMPLESAVNVVDCFFYDGIKAIFQLGLAVLEANAEDLCSSKDDGQALMILSRFLDHIKNEDSPGPPVGSHHAFFSDDQEPYPVTDISDLIRDSYEKFGDQSVEQIEHLRYKHRIRVLQGHEDTTKQNVLRVVIPEVSILPEDLEELYDLFKREHMMSCYWEQPRPMASRHDPSRPYAEQYRIDARQFAHLFQLVSPWTCGAHTEILAERTFRLLDDNMDQLIEFKAFVSCLDIMYNGEMNEKIKLLYRLHIPPALTENDRDSQSPLRNPLLSTSRPLVFGKPNGDAVDYQKQLKQMIKDLAKEKDKTEKELPKMSQREFIQFCKTLYSMFHEDPEENDLYQAIATVTTLLLQIGEVGQRGSSSGSCSQECGEELRASAPSPEDSVFADTGKTPQDSQAFPEAAERDWTVSLEHILASLLTEQSLVNFFEKPLDMKSKLENAKINQYNLKTFEMSHQSQSELKLSNL,mutated_sequence,1.0,1140.0,UPI00015ADD19.a2m,UPI00015ADD19.npy,gnomAD
+UPI00001604C8,UPI00001604C8.csv,MEAPTVETPPDPSPPSAPAPALVPLRAPDVARLREEQEKVVTNCQERIQHWKKVDNDYNALRERLSTLPDKLSYNIMVPFGPFAFMPGKLVHTNEVTVLLGDNWFAKCSAKQAVGLVEHRKEHVRKTIDDLKKVMKNFESRVEFTEDLQKMSDAAGDIVDIREEIKCDFEFKAKHRIAHKPHSKPKTSDIFEADIANDVKSKDLLADKELWARLEELERQEELLGELDSKPDTVIANGEDTTSSEEEKEDRNTNVNAMHQVTDSHTPCHKDVASSEPFSGQVNSQLNCSVNGSSSYHSDDDDDDDDDDDDDNIDDDDGDNDHEALGVGDNSIPTIYFSHTVEPKRVRINTGKNTTLKFSEKKEEAKRKRKNSTGSGHSAQELPTIRTPADIYRAFVDVVNGEYVPRKSILKSRSRENSVCSDTSESSAAEFDDRRGVLRSISCEEATCSDTSESILEEEPQENQKKLLPLSVTPEAFSGTVIEKEFVSPSLTPPPAIAHPALPTIPERKEVLLEASEETGKRVSKFKAARLQQKD,mutated_sequence,1.0,535.0,UPI00001604C8.a2m,UPI00001604C8.npy,gnomAD
+UPI00005956CD,UPI00005956CD.csv,MPSGSSAALALAAAPAPLPQPPPPPPPPPPPLPPPSGGPELEGDGLLLRERLAALGLDDPSPAEPGAPALRAPAAAAQGQARRAAELSPEERAPPGRPGAPEAAELELEEDEEEGEEAELDGDLLEEEELEEAEEEDRSSLLLLSPPAATASQTQQIPGGSLGSVLLPAARFDAREAAAAAAAAGVLYGGDDAQGMMAAMLSHAYGPGGCGAAAAALNGEQAALLRRKSVNTTECVPVPSSEHVAEIVGRQGCKIKALRAKTNTYIKTPVRGEEPIFVVTGRKEDVAMAKREILSAAEHFSMIRASRNKNGPALGGLSCSPNLPGQTTVQVRVPYRVVGLVVGPKGATIKRIQQQTHTYIVTPSRDKEPVFEVTGMPENVDRAREEIEMHIAMRTGNYIELNEENDFHYNGTDVSFEGGTLGSAWLSSNPVPPSRARMISNYRNDSSSSLGSGSTDSYFGSNRLADFSPTSPFSTGNFWFGDTLPSVGSEDLAVDSPAFDSLPTSAQTIWTPFEPVNPLSGFGSDPSGNMKTQRRGSQPSTPRLSPTFPESIEHPLARRVRSDPPSTGNHVGLPIYIPAFSNGTNSYSSSNGGSTSSSPPESRRKHDCVICFENEVIAALVPCGHNLFCMECANKICEKRTPSCPVCQTAVTQAIQIHS,mutated_sequence,1.0,659.0,UPI00005956CD.a2m,UPI00005956CD.npy,gnomAD
+UPI0000071CBA,UPI0000071CBA.csv,MSQKSWIESTLTKRECVYIIPSSKDPHRCLPGCQICQQLVRCFCGRLVKQHACFTASLAMKYSDVKLGDHFNQAIEEWSVEKHTEQSPTDAYGVINFQGGSHSYRAKYVRLSYDTKPEVILQLLLKEWQMELPKLVISVHGGMQKFELHPRIKQLLGKGLIKAAVTTGAWILTGGVNTGVAKHVGDALKEHASRSSRKICTIGIAPWGVIENRNDLVGRDVVAPYQTLLNPLSKLNVLNNLHSHFILVDDGTVGKYGAEVRLRRELEKTINQQRIHARIGQGVPVVALIFEGGPNVILTVLEYLQESPPVPVVVCEGTGRAADLLAYIHKQTEEGGNLPDAAEPDIISTIKKTFNFGQNEALHLFQTLMECMKRKELITVFHIGSDEHQDIDVAILTALLKGTNASAFDQLILTLAWDRVDIAKNHVFVYGQQWLVGSLEQAMLDALVMDRVAFVKLLIENGVSMHKFLTIPRLEELYNTKQGPTNPMLFHLVRDVKQGNLPPGYKITLIDIGLVIEYLMGGTYRCTYTRKRFRLIYNSLGGNNRRSGRNTSSSTPQLRKSHESFGNRADKKEKMRHNHFIKTAQPYRPKIDTVMEEGKKKRTKDEIVDIDDPETKRFPYPLNELLIWACLMKRQVMARFLWQHGEESMAKALVACKIYRSMAYEAKQSDLVDDTSEELKQYSNDFGQLAVELLEQSFRQDETMAMKLLTYELKNWSNSTCLKLAVSSRLRPFVAHTCTQMLLSDMWMGRLNMRKNSWYKVILSILVPPAILLLEYKTKAEMSHIPQSQDAHQMTMDDSENNFQNITEEIPMEVFKEVRILDSNEGKNEMEIQMKSKKLPITRKFYAFYHAPIVKFWFNTLAYLGFLMLYTFVVLVQMEQLPSVQEWIVIAYIFTYAIEKVREIFMSEAGKVNQKIKVWFSDYFNISDTIAIISFFIGFGLRFGAKWNFANAYDNHVFVAGRLIYCLNIIFWYVRLLDFLAVNQQAGPYVMMIGKMVANMFYIVVIMALVLLSFGVPRKAILYPHEAPSWTLAKDIVFHPYWMIFGEVYAYEIDVCANDSVIPQICGPGTWLTPFLQAVYLFVQYIIMVNLLIAFFNNVYLQVKAISNIVWKYQRYHFIMAYHEKPVLPPPLIILSHIVSLFCCICKRRKKDKTSDGPKLFLTEEDQKKLHDFEEQCVEMYFNEKDDKFHSGSEERIRVTFERVEQMCIQIKEVGDRVNYIKRSLQSLDSQIGHLQDLSALTVDTLKTLTAQKASEASKVHNEITRELSISKHLAQNLIDDGPVRPSVWKKHGVVNTLSSSLPQGDLESNNPFHCNILMKDDKDPQCNIFGQDLPAVPQRKEFNFPEAGSSSGALFPSAVSPPELRQRLHGVELLKIFNKNQKLGSSSTSIPHLSSPPTKFFVSTPSQPSCKSHLETGTKDQETVCSKATEGDNTEFGAFVGHRDSMDLQRFKETSNKIKILSNNNTSENTLKRVSSLAGFTDCHRTSIPVHSKQAEKISRRPSTEDTHEVDSKAALIPDWLQDRPSNREMPSEEGTLNGLTSPFKPAMDTNYYYSAVERNNLMRLSQSIPFTPVPPRGEPVTVYRLEESSPNILNNSMSSWSQLGLCAKIEFLSKEEMGGGLRRAVKVQCTWSEHDILKSGHLYIIKSFLPEVVNTWSSIYKEDTVLHLCLREIQQQRAAQKLTFAFNQMKPKSIPYSPRFLEVFLLYCHSAGQWFAVEECMTGEFRKYNNNNGDEIIPTNTLEEIMLAFSHWTYEYTRGELLVLDLQGVGENLTDPSVIKAEEKRSCDMVFGPANLGEDAIKNFRAKHHCNSCCRKLKLPDLKRNDYTPDKIIFPQDEPSDLNLQPGNSTKESESTNSVRLML,mutated_sequence,1.0,1865.0,UPI0000071CBA.a2m,UPI0000071CBA.npy,gnomAD
+UPI000013E29D,UPI000013E29D.csv,MQLGEQLLVSSVNLPGAHFYPLESARGGSGGSAGHLPSAAPSPQKLDLDKASKKFSGSLSCEAVSGEPAAASAGAPAAMLSDTDAGDAFASAAAVAKPGPPDGRKGSPCGEEELPSAAAAAAAAAAAAAATARYSMDSLSSERYYLQSPGPQGSELAAPCSLFPYQAAAGAPHGPVYPAPNGARYPYGSMLPPGGFPAAVCPPGRAQFGPGAGAGSGAGGSSGGGGGPGTYQYSQGAPLYGPYPGAAAAGSCGGLGGLGVPGSGFRAHVYLCNRPLWLKFHRHQTEMIITKQGRRMFPFLSFNINGLNPTAHYNVFVEVVLADPNHWRFQGGKWVTCGKADNNMQGNKMYVHPESPNTGSHWMRQEISFGKLKLTNNKGANNNNTQMIVLQSLHKYQPRLHIVEVTEDGVEDLNEPSKTQTFTFSETQFIAVTAYQNTDITQLKIDHNPFAKGFRDNYDSSHQIVPGGRYGVQSFFPEPFVNTLPQARYYNGERTVPQTNGLLSPQQSEEVANPPQRWLVTPVQQPGTNKLDISSYESEYTSSTLLPYGIKSLPLQTSHALGYYPDPTFPAMAGWGGRGSYQRKMAAGLPWTSRTSPTVFSEDQLSKEKVKEEIGSSWIETPPSIKSLDSNDSGVYTSACKRRRLSPSNSSNENSPSIKCEDINAEEYSKDTSKGMGGYYAFYTTP,mutated_sequence,1.0,686.0,UPI000013E29D.a2m,UPI000013E29D.npy,gnomAD
+UPI000003F57A,UPI000003F57A.csv,MFSVLSYGRLVARAVLGGLSQTDPRAGGGGGGDYGLVTAGCGFGKDFRKGLLKKGACYGDDACFVARHRSADVLGVADGVGGWRDYGVDPSQFSGTLMRTCERLVKEGRFVPSNPIGILTTSYCELLQNKVPLLGSSTACIVVLDRTSHRLHTANLGDSGFLVVRGGEVVHRSDEQQHYFNTPFQLSIAPPEAEGVVLSDSPDAADSTSFDVQLGDIILTATDGLFDNMPDYMILQELKKLKNSNYESIQQTARSIAEQAHELAYDPNYMSPFAQFACDNGLNVRGGKPDDITVLLSIVAEYTD,mutated_sequence,1.0,304.0,UPI000003F57A.a2m,UPI000003F57A.npy,gnomAD
+UPI0000160243,UPI0000160243.csv,MELSSMKICAAIPTSRALPEVVRRMPRKRISGLEWLLQQDPGFSLVNTVKAGMIISFPSNNIYSSVCCCQSEIFKYEFSNSKKSSWIQEERHLGKNNVLYSAHDVSPEKVTSALKKTNKQTTTINNFPLQYLPGSKLLDRFLSLSRSLLCLNSWSSSLPLAPQVKKK,mutated_sequence,1.0,167.0,UPI0000160243.a2m,UPI0000160243.npy,gnomAD
+UPI0001F784C6,UPI0001F784C6.csv,XFSSARLEEVGWLSGVGGLRSGGWCVQSSRTPGQSRVLGRGLGYKQQVRGWGHCEAVLVITLLIAAYLKLSVSLTHLKGRKETPFENLFVHLKVFTQVIHQVTHLLNQYH,mutated_sequence,1.0,110.0,UPI0001F784C6.a2m,UPI0001F784C6.npy,gnomAD
+UPI00004708E6,UPI00004708E6.csv,MARACLIQRLPIKRDCTPVFVRGLSSPSAAMALLSEGLDEVPAACLSPCGPPNPTELFSESRRLALEELVAGGPEAFAAFLRRERLARFLNPDEVHAILRAAERPGEEGAAAAAAAEDSFGSSHDCSSGTYFPEQSDLEPPLLELGWPAFYQGAYRGATRVETHFQPRGAGEGGPYGCKDALRQQLRSAREVIAVVMDVFTDIDIFRDLQEICRKQGVAVYILLDQALLSQFLDMCMDLKVHPEQEKLMTVRTITGNIYYARSGTKIIGKVHEKFTLIDGIRVATGSYSFTWTDGKLNSSNLVILSGQVVEHFDLEFRILYAQSKPISPKLLSHFQSSNKFDHLTNRKPQSKELTLGNLLRMRLARLSSTPRKADLDPEMPAEGKAERKPHDCESSTVSEEDYFSSHRDELQSRKAIDAATQTEPGEEMPGLSVSEVGTQTSITTACAGTQTAVITRIASSQTTIWSRSTTTQTDMDENILFPRGTQSTEGSPVSKMSVSRSSSLKSSSSVSSQGSVASSTGSPASIRTTDFHNPGYPKYLGTPHLELYLSDSLRNLNKERQFHFAGIRSRLNHMLAMLSRRTLFTENHLGLHSGNFSRVNLLAVRDVALYPSYQ,mutated_sequence,1.0,615.0,UPI00004708E6.a2m,UPI00004708E6.npy,gnomAD
+UPI00001AE6F8,UPI00001AE6F8.csv,MEESHFNSNPYFWPSIPTVSGQIENTMFINKMKDQLLPEKGCGLAPPHYPTLLTVPASVSLPSGISMDTESKSDQLTPHSQASVTQNITVVPVPSTGLMTAGVSCSQRWRREGSQSRGPGLVITSPSGSLVTTASSAQTFPISAPMIVSALPPGSQALQVVPDLSKKVASTLTEEGGGGGGGGGSVAPKPPRGRKKKRMLESGLPEMNDPYVLSPEDDDDHQKDGKTYRCRMCSLTFYSKSEMQIHSKSHTETKPHKCPHCSKTFANSSYLAQHIRIHSGAKPYSCNFCEKSFRQLSHLQQHTRIHSKMHTETIKPHKCPHCSKTFANTSYLAQHLRIHSGAKPYNCSYCQKAFRQLSHLQQHTRIHTGDRPYKCAHPGCEKAFTQLSNLQSHRRQHNKDKPFKCHNCHRAYTDAASLEVHLSTHTVKHAKVYTCTICSRAYTSETYLMKHMRKHNPPDLQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQASQASQQQQQQQQQQQQQQQQPPPHFQSPGAAPQGGGGGDSNPNPPPQCSFDLTPYKTAEHHKDICLTVTTSTIQVEHLASS,mutated_sequence,1.0,577.0,UPI00001AE6F8.a2m,UPI00001AE6F8.npy,gnomAD
+UPI000013F990,UPI000013F990.csv,MIPGKYRSVSGRAANNVNCGLHLVIQTSSLPEKNKVEFKLNKDTSSFPGRLLQHDLERNYSSRQGDHINLVSSSLSSFPILQRSSEEKILYSDRLSLERQKLTVCPIINGEDHLRLLNFQHNFITRIQNISNLQKLISLDLYDNQIEEISGLSTLRCLRVLLLGKNRIKKISNLENLKSLDVLDLHGNQITKIENINHLCELRVLNLARNFLSHVDNLNGLDSLTELNLRHNQITFVRDVDNLPCLQHLFLSFNNISSFDSVSCLADSSSLSDITFDGNPIAQESWYKHTVLQNMMQLRQLDMKRITEEERRMASVLAKKEEEKKRESHKQSLLKEKKRLTINNVARQWDLQQQRVANIATNEDRKDSDSPQDPCQIDGSTLSAFPEETGPLDSGLNNALQGLSVIDTYLVEVDGDTLSLYGSGALESLDRNWSVQTAGMITTVSFTFIEFDEIVQVLPKLKIKFPNSLHLKFKETNLVMLQQFNALAQLRRIDQLTIDPQGNPVVNFTLWKYYVLFRLSHFSMQKINGTEVTQNDMIMAERLFGILAHVASSELPQYRLISILGDARKKQFRYLLESKGKKPGIINEENNDSKRLVGENTNRATLNYTTRDFYNEKLEEIKEKKKFCKTYIEDLVKEATEINMKNEALQKLWPQMFIELVRDAVIEIRNKNSYMKLCLQQITDQK,mutated_sequence,1.0,686.0,UPI000013F990.a2m,UPI000013F990.npy,gnomAD
+UPI000156FA8B,UPI000156FA8B.csv,MMRRTLENRNAQTKQLQTAVSNVEKHFGELCQIFAAYVRKTARLRDKADLLVNEINAYAATETPHLKLGLMNFADEFAKLQDYRQAEVERLEAKVVEPLKTYGTIVKMKRDDLKATLTARNREAKQLTQLERTRQRNPSDRHVISQAETELQRAAMDASRTSRHLEETINNFERQKMKDIKTIFSEFITIEMLFHGKALEVYTAAYQNIQNIDEDEDLEVFRNSLYAPDYSSRLDIVRANSKSPLQRSLSAKCVSGTGQVSTCRLRKDQQAEDDEDDELDVTEEENFLK,mutated_sequence,1.0,289.0,UPI000156FA8B.a2m,UPI000156FA8B.npy,gnomAD
+UPI00001FF5EB,UPI00001FF5EB.csv,MVFKLKTKEEQHSMLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHIIREDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIVADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEPDSVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMPMPSANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPATPPSYESVDDINADKNISSAQIVGPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSASEDIDFDDLSRRFEELKKKT,mutated_sequence,1.0,379.0,UPI00001FF5EB.a2m,UPI00001FF5EB.npy,gnomAD
+UPI00016632FD,UPI00016632FD.csv,MDGRDFGPQRSVHGPPPPLLSGLAMDSHRVGAATAGRLPASGLPGPLPPGKYMAGLNLHPHPGEAFLGSFVASGMGPSASSHGSPVPLPSDLSFRSPTPSNLPMVQLWAAHAHEGFSHLPSGLYPSYLHLNHLEPPSSGSPLLSQLGQPSIFDTQKGQGPGGDGFYLPTAGAPGSLHSHAPSARTPGGGHSSGAPAKGSSSRDGPAKERAGRGGEPPPLFGKKDPRARGEEASGPRGVVDLTQEARAEGRQDRGPPRLAERLSPFLAESKTKNAALQPSVLTMCNGGAGDVGLPALVAEAGRGGAKEAARQDEGARLLRRTETLLPGPRPCPSPLPPPPAPPKGPPAPPAATPAGVYTVFREQGREHRVVAPTFVPSVEAFDERPGPIQIASQARDARAREREAGRPGVLQAPPGSPRPLDRPEGLREKNSVIRSLKRPPPADAPTVRATRASPDPRAYVPAKELLKPEADPRPCERAPRGPAGPAAQQAAKLFGLEPGRPPPTGPEHKWKPFELGNFAATQMAVLAAQHHHSRAEEEAAVVAASSSKKAYLDPGAVLPRSAATCGRPVADMHSAAHGSGEASAMQSLIKYSGSFARDAVAVRPGGCGKKSPFGGLGTMKPEPAPTSAGASRAQARLPHSGGPAAGGGRQLKRDPERPESAKAFGREGSGAQGEAEVRHPPVGIAVAVARQKDSGGSGRLGPGLVDQERSLSLSNVKGHGRADEDCVDDRARHREERLLGARLDRDQEKLLRESKELADLARLHPTSCAPNGLNPNLMVTGGPALAGSGRWSADPAAHLATHPWLPRSGNASMWLAGHPYGLGPPSLHQGMAPAFPPGLGGSLPSAYQFVRDPQSGQLVVIPSDHLPHFAELMERATVPPLWPALYPPGRSPLHHAQQLQLFSQQHFLRQQEFLYLQQQAAQALELQRSAQLVQERLKAQEHRAEMEEKGSKRGLEAAGKAGLATAGPGLLPRKPPGLAAGPAGTYGKAVSPPPSPRASPVAALKAKVIQKLEDVSKPPAYAYPATPSSHPTSPPPASPPPTPGITRKEEAPENVVEKKDLELEKEAPSPFQALFSDIPPRYPFQALPPHYGRPYPFLLQPTAAADADGLAPDVPLPADGPERLALSPEDKPIRLSPSKITEPLREGPEEEPLAEREVKAEVEDMDEGPTELPPLESPLPLPAAEAMATPSPAGGCGGGLLEAQALSATGQSCAEPSECPDFVEGPEPRVDSPGRTEPCTAALDLGVQLTPETLVEAKEEPVEVPVAVPVVEAVPEEGLAQVAPSESQPTLEMSDCDVPAGEGQCPSLEPQEAVPVLGSTCFLEEASSDQFLPSLEDPLAGMNALAAAAELPQARPLPSPGAAGAQALEKLEAAESLVLEQSFLHGITLLSEIAELELERRSQEMGGAERALVARPSLESLLAAGSHMLREVLDGPVVDPLKNLRLPRELKPNKKYSWMRKKEERMYAMKSSLEDMDALELDFRMRLAEVQRQYKEKQRELVKLQRRRDSEDRREEPHRSLARRGPGRPRKRTHAPSALSPPRKRGKSGHSSGKLSSKSLLTSDDYELGAGIRKRHKGSEEEHDALIGMGKARGRNQTWDEHEASSDFISQLKIKKKKMASDQEQLASKLDKALSLTKQDKLKSPFKFSDSAGGKSKTSGGCGRYLTPYDSLLGKNRKALAKGLGLSLKSSREGKHKRAAKTRKMEVGFKARGQPKSAHSPFASEVSSYSYNTDSEEDEEFLKDEWPAQGPSSSKLTPSLLCSMVAKNSKAAGGPKLTKRGLAAPRTLKPKPATSRKQPFCLLLREAEARSSFSDSSEESFDQDESSEEEDEEEELEEEDEASGGGYRLGARERALSPGLEESGLGLLARFAASALPSPTVGPSLSVVQLEAKQKARKKEERQSLLGTEFEYTDSESEVKVRKRSPAGLLRPKKGLGEPGPSLAAPTPGARGPDPSSPDKAKLAVEKGRKARKLRGPKEPGFEAGPEASDDDLWTRRRSERIFLHDASAAAPAPVSTAPATKTSRCAKGGPLSPRKDAGRAKDRKDPRKKKKGKEAGPGAGLPPPRAPALPSEARAPHASSLTAAKRSKAKAKGKEVKKENRGKGGAVSKLMESMAAEEDFEPNQDSSFSEDEHLPRGGAVERPLTPAPRSCIIDKDELKDGLRVLIPMDDKLLYAGHVQTVHSPDIYRVVVEGERGNRPHIYCLEQLLQEAIIDVRPASTRFLPQGTRIAAYWSQQYRCLYPGTVVRGLLDLEDDGDLITVEFDDGDTGRIPLSHIRLLPPDYKIQCAEPSPALLVPSAKRRSRKTSKDTGEGKDGGTAGSEEPGAKARGRGRKPSAKAKGDRAATLEEGNPTDEVPSTPLALEPSSTPGSKKSPPEPVDKRAKAPKARPAPPQPSPAPPAFTSCPAPEPFAELPAPATSLAPAPLITMPATRPKPKKARAAEESGAKGPRRPGEEAELLVKLDHEGVTSPKSKKAKEALLLREDPGAGGWQEPKSLLSLGSYPPAAGSSEPKAPWPKATDGDLAQEPGPGLTFEDSGNPKSPDKAQAEQDGAEESESSSSSSSGSSSSSSSSSSSGSETEGEEEGDKNGDGGCGTGGRNCSAASSRAASPASSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSTTDEDSSCSSDDEAAPAPTAGPSAQAALPTKATKQAGKARPSAHSPGKKTPAPQPQAPPPQPTQPLQPKAQAGAKSRPKKREGVHLPTTKELAKRQRLPSVENRPKIAAFLPARQLWKWFGKPTQRRGMKGKARKLFYKAIVRGKEMIRIGDCAVFLSAGRPNLPYIGRIQSMWESWGNNMVVRVKWFYHPEETSPGKQFHQGQHWDQKSSRSLPAALRVSSQRKDFMERALYQSSHVDENDVQTVSHKCLVVGLEQYEQMLKTKKYQDSEGLYYLAGTYEPTTGMIFSTDGVPVLC,mutated_sequence,1.0,2968.0,UPI00016632FD.a2m,UPI00016632FD.npy,gnomAD
+UPI0000199CEF,UPI0000199CEF.csv,MSRPRNNPQTSSPQDSTKDGSSFHYFQGRFELSGKSRQYPADALEPQPGIGDVKVIEKATKSMLDPAQRSHFYLVTPSLVFLCFIFDGLHKALLSVGVSKRSNIVIGNENKETGTLYASKFEDVLPTFTALEMSSILRHCCDLIGIAAGSSDPICTNSLQVQRQFKAMMISIGRPLHSESADLLISYNAGPAIDWINSRPWVGGLMFTFLFGEFESPACELLDQVKVVASKAQMMTYYTVRMFLDQCVDGSTALPAVVLEIPVFEQKKPLAKKVLGDFFEFGGVLRHPVIGVLSPQMFPNLATAANYWAKRRNSTFSGFEALDIIPGSTITFPVLQMASAQKISRGSDMDPYTLNILRGYGISGFE,mutated_sequence,1.0,366.0,UPI0000199CEF.a2m,UPI0000199CEF.npy,gnomAD
+UPI00021CF38C,UPI00021CF38C.csv,MRTAAGAVSPDSRPETRRQTRKNEEAAWGPRVCRAEREDNRKCPPSILKRSRPEHHRPEAKPQRTSRRVWFREPPAVTVHYIADKNATATVRAVCVRPARAGPGPILRPGQARGNGTGGPAGPAPRPCPAPAARGPHLLARPPAALTGRTGRTGRASRRAQALRGLRDSLSPGPAAW,mutated_sequence,1.0,177.0,UPI00021CF38C.a2m,UPI00021CF38C.npy,gnomAD
+UPI000013D80E,UPI000013D80E.csv,MAAAVLSGPSAGSAAGVPGGTGGLSAVSSGPRLRLLLLESVSGLLQPRTGSAVAPVHPPNRSAPHLPGLMCLLRLHGSVGGAQNLSALGALVSLSNARLSSIKTRFEGLCLLSLLVGESPTELFQQHCVSWLRSIQQVLQTQDPPATMELAVAVLRDLLRYAAQLPALFRDISMNHLPGLLTSLLGLRPECEQSALEGMKACMTYFPRACGSLKGKLASFFLSRVDALSPQLQQLACECYSRLPSLGAGFSQGLKHTESWEQELHSLLASLHTLLGALYEGAETAPVQNEGPGVEMLLSSEDGDAHVLLQLRQRFSGLARCLGLMLSSEFGAPVSVPVQEILDFICRTLSVSSKNISLHGDGPLRLLLLPSIHLEALDLLSALILACGSRLLRFGILIGRLLPQVLNSWSIGRDSLSPGQERPYSTVRTKVYAILELWVQVCGASAGMLQGGASGEALLTHLLSDISPPADALKLRSPRGSPDGSLQTGKPSAPKKLKLDVGEAMAPPSHRKGDSNANSDVCAAALRGLSRTILMCGPLIKEETHRRLHDLVLPLVMGVQQGEVLGSSPYTSSRCRRELYCLLLALLLAPSPRCPPPLACALQAFSLGQREDSLEVSSFCSEALVTCAALTHPRVPPLQPMGPTCPTPAPVPPPEAPSPFRAPPFHPPGPMPSVGSMPSAGPMPSAGPMPSAGPVPSARPGPPTTANHLGLSVPGLVSVPPRLLPGPENHRAGSNEDPILAPSGTPPPTIPPDETFGGRVPRPAFVHYDKEEASDVEISLESDSDDSVVIVPEGLPPLPPPPPSGATPPPIAPTGPPTASPPVPAKEEPEELPAAPGPLPPPPPPPPPVPGPVTLPPPQLVPEGTPGGGGPPALEEDLTVININSSDEEEEEEEEEEEEEEEEEEEEEDFEEEEEDEEEYFEEEEEEEEEFEEEFEEEEGELEEEEEEEDEEEEEELEEVEDLEFGTAGGEVEEGAPPPPTLPPALPPPESPPKVQPEPEPEPGLLLEVEEPGTEEERGADTAPTLAPEALPSQGEVEREGESPAAGPPPQELVEEEPSAPPTLLEEETEDGSDKVQPPPETPAEEEMETETEAEALQEKEQDDTAAMLADFIDCPPDDEKPPPPTEPDS,mutated_sequence,1.0,1130.0,UPI000013D80E.a2m,UPI000013D80E.npy,gnomAD
+UPI000013D6CB,UPI000013D6CB.csv,MRAAAAGGGVRTAALALLLGALHWAPARCEEYDYYGWQAEPLHGRSYSKPPQCLDIPADLPLCHTVGYKRMRLPNLLEHESLAEVKQQASSWLPLLAKRCHSDTQVFLCSLFAPVCLDRPIYPCRSLCEAVRAGCAPLMEAYGFPWPEMLHCHKFPLDNDLCIAVQFGHLPATAPPVTKICAQCEMEHSADGLMEQMCSSDFVVKMRIKEIKIENGDRKLIGAQKKKKLLKPGPLKRKDTKRLVLHMKNGAGCPCPQLDSLAGSFLVMGRKVDGQLLLMAVYRWDKKNKEMKFAVKFMFSYPCSLYYPFFYGAAEPH,mutated_sequence,1.0,317.0,UPI000013D6CB.a2m,UPI000013D6CB.npy,gnomAD
+UPI000013D7E3,UPI000013D7E3.csv,MEQLTTLPRPGDPGAMEPWALPTWHSWTPGRGGEPSSAAPSIADTPPAALQLQELRSEESSKPKGDGSSRPVGGTDPEGAEACLPSLGQQASSSGPACQRPEDEEVEAFLKAKLNMSFGDRPNLELLRALGELRQRCAILKEENQMLRKSSFPETEEKVRRLKRKNAELAVIAKRLEERARKLQETNLRVVSAPLPRPGTSLELCRKALARQRARDLSETASALLAKDKQIAALQRECRELQARLTLVGKEGPQWLHVRDFDRLLRESQREVLRLQRQIALRNQRETLPLPPSWPPGPALQARAGAPAPGAPGEATPQEDADNLPVILGEPEKEQRVQQLESELSKKRKKCESLEQEARKKQRRCEELELQLRQAQNENARLVEENSRLSGRATEKEQVEWENAELRGQLLGVTQERDSALRKSQGLQSKLESLEQVLKHMREVAQRRQQLEVEHEQARLSLREKQEEVRRLQQAQAEAQREHEGAVQLLESTLDSMQARVRELEEQCRSQTEQFSLLAQELQAFRLHPGPLDLLTSALDCGSLGDCPPPPCCCSIPQPCRGSGPKDLDLPPGSPGRCTPKSSEPAPATLTGVPRRTAKKAESLSNSSHSESIHNSPKSCPTPEVDTASEVEELEADSVSLLPAAPEGSRGGARIQVFLARYSYNPFEGPNENPEAELPLTAGEYIYIYGNMDEDGFFEGELMDGRRGLVPSNFVERVSDDDLLTSLPPELADLSHSSGPELSFLSVGGGGSSSGGQSSVGRSQPRPEEEDAGDELSLSPSPEGLGEPPAVPYPRRLVVLKQLAHSVVLAWEPPPEQVELHGFHICVNGELRQALGPGAPPKAVLENLDLWAGPLHISVQALTSRGSSDPLRCCLAVGARAGVVPSQLRVHRLTATSAEITWVPGNSNLAHAIYLNGEECPPASPSTYWATFCHLRPGTPYQAQVEAQLPPQGPWEPGWERLEQRAATLQFTTLPAGPPDAPLDVQIEPGPSPGILIISWLPVTIDAAGTSNGVRVTGYAIYADGQKIMEVASPTAGSVLVELSQLQLLQVCREVVVRTMSPHGESADSIPAPITPALAPASLPARVSCPSPHPSPEARAPLASASPGPGDPSSPLQHPAPLGTQEPPGAPPASPSREMAKGSHEDPPAPCSQEEAGAAVLGTSEERTASTSTLGEKDPGPAAPSLAKQEAEWTAGEACPASSSTQGARAQQAPNTEMCQGGDPGSGLRPRAEKEDTAELGVHLVNSLVDHGRNSDLSDIQEEEEEEEEEEEEELGSRTCSFQKQVAGNSIRENGAKSQPDPFCETDSDEEILEQILELPLQQFCSKKLFSIPEEEEEEEEDEEEEKSGAGCSSRDPGPPEPALLGLGCDSGQPRRPGQCPLSPESSRAGDCLEDMPGLVGGSSRRRGGGSPEKPPSRRRPPDPREHCSRLLSNNGPQASGRLGPTRERGGLPVIEGPRTGLEASGRGRLGPSRRCSRGRALEPGLASCLSPKCLEISIEYDSEDEQEAGSGGISITSSCYPGDGEAWGTATVGRPRGPPKANSGPKPYPRLPAWEKGEPERRGRSATGRAKEPLSRATETGEARGQDGSGRRGPQKRGVRVLRPSTAELVPARSPSETLAYQHLPVRIFVALFDYDPVSMSPNPDAGEEELPFREGQILKVFGDKDADGFYQGEGGGRTGYIPCNMVAEVAVDSPAGRQQLLQRGYLSPDILLEGSGNGPFVYSTAHTTGPPPKPRRSKKAESEGPAQPCPGPPKLVPSADLKAPHSMVAAFDYNPQESSPNMDVEAELPFRAGDVITVFGGMDDDGFYYGELNGQRGLVPSNFLEGPGPEAGGLDREPRTPQAESQRTRRRRVQC,mutated_sequence,1.0,1857.0,UPI000013D7E3.a2m,UPI000013D7E3.npy,gnomAD
+UPI000013CCDF,UPI000013CCDF.csv,MLGWVQRVLPQPPGTPRKTKMQEEEEVEPEPEMEAEVEPEPNPEEAETESESMPPEESFKEEEVAVADPSPQETKEAALTSTISLRAQGAEISEMNSPSRRVLTWLMKGVEKVIPQPVHSITEDPAQILGHGSTGDTGCTDEPNEALEAQDTRPGLRLLLWLEQNLERVLPQPPKSSEVWRDEPAVATGAASDPAPPGRPQEMGPKLQARETPSLPTPIPLQPKEEPKEAPAPEPQPGSQAQTSSLPPTRDPARLVAWVLHRLEMALPQPVLHGKIGEQEPDSPGICDVQTISILPGGQVEPDLVLEEVEPPWEDAHQDVSTSPQGTEVVPAYEEENKAVEKMPRELSRIEEEKEDEEEEEEEEEEEEEEEVTEVLLDSCVVSQVGVGQSEEDGTRPQSTSDQKLWEEVGEEAKKEAEEKAKEEAEEVAEEEAEKEPQDWAETKEEPEAEAEAASSGVPATKQHPEVQVEDTDADSCPLMAEENPPSTVLPPPSPAKSDTLIVPSSASGTHRKKLPSEDDEAEELKALSPAESPVVAWSDPTTPKDTDGQDRAASTASTNSAIINDRLQELVKLFKERTEKVKEKLIDPDVTSDEESPKPSPAKKAPEPAPDTKPAEAEPVEEEHYCDMLCCKFKHRPWKKYQFPQSIDPLTNLMYVLWLFFVVMAWNWNCWLIPVRWAFPYQTPDNIHHWLLMDYLCDLIYFLDITVFQTRLQFVRGGDIITDKKDMRNNYLKSRRFKMDLLSLLPLDFLYLKVGVNPLLRLPRCLKYMAFFEFNSRLESILSKAYVYRVIRTTAYLLYSLHLNSCLYYWASAYQGLGSTHWVYDGVGNSYIRCYYFAVKTLITIGGLPDPKTLFEIVFQLLNYFTGVFAFSVMIGQMRDVVGAATAGQTYYRSCMDSTVKYMNFYKIPKSVQNRVKTWYEYTWHSQGMLDESELMVQLPDKMRLDLAIDVNYNIVSKVALFQGCDRQMIFDMLKRLRSVVYLPNDYVCKKGEIGREMYIIQAGQVQVLGGPDGKSVLVTLKAGSVFGEISLLAVGGGNRRTANVVAHGFTNLFILDKKDLNEILVHYPESQKLLRKKARRMLRSNNKPKEEKSVLILPPRAGTPKLFNAALAMTGKMGGKGAKGGKLAHLRARLKELAALEAAAKQQELVEQAKSSQDVKGEEGSAAPDQHTHPKEAATDPPAPRTPPEPPGSPPSSPPPASLGRPEGEEEGPAEPEEHSVRICMSPGPEPGEQILSVKMPEEREEKAE,mutated_sequence,1.0,1251.0,UPI000013CCDF.a2m,UPI000013CCDF.npy,gnomAD
+UPI00000503E3,UPI00000503E3.csv,MLLETQDALYVALELVIAALSVAGNVLVCAAVGTANTLQTPTNYFLVSLAAADVAVGLFAIPFAITISLGFCTDFYGCLFLACFVLVLTQSSIFSLLAVAVDRYLAICVPLRYKSLVTGTRARGVIAVLWVLAFGIGLTPFLGWNSKDSATNNCTEPWDGTTNESCCLVKCLFENVVPMSYMVYFNFFGCVLPPLLIMLVIYIKIFLVACRQLQRTELMDHSRTTLQREIHAAKSLAMIVGIFALCWLPVHAVNCVTLFQPAQGKNKPKWAMNMAILLSHANSVVNPIVYAYRNRDFRYTFHKIISRYLLCQADVKSGNGQAGVQPALGVGL,mutated_sequence,1.0,332.0,UPI00000503E3.a2m,UPI00000503E3.npy,gnomAD
+UPI000006D8B0,UPI000006D8B0.csv,MGARASGGPLARAGLLLLLLLLLLLGLLAPGAQGARGRGGAEKNSYRRTVNTFSQSVSSLFGEDNVRAAQKFLARLTERFVLGVDMFVETLWKVWTELLDVLGLDVSNLSQYFSPASVSSSPARALLLVGVVLLAYWFLSLTLGFTFSVLHVVFGRFFWIVRVVLFSMSCVYILHKYEGEPENAVLPLCFVVAVYFMTGPMGFYWRSSPSGPSNPSNPSVEEKLEHLEKQVRLLNIRLNRVLESLDRSKDK,mutated_sequence,1.0,251.0,UPI000006D8B0.a2m,UPI000006D8B0.npy,gnomAD
+UPI0001B794AD,UPI0001B794AD.csv,XLPAPAAALPGAPPPPPGVRGYPARARGLQGASGGARFPSSLSLPVPGKGGWVVSRAKFPALDSCEITTAHPRLCFAVSPPFSCSLPLPSLFPHSPLLGRPKRQAKPAADEGFWDCSVCTFRNSAEAFKCSICDVRKGTSTRKPRINSQLVAQQVAQQYATPPPPKKEKKEKVEKQDKEKPEKDKEISPSVTKKNTNKKTKPKSDILKDPPSEANSIQSANATTKTSETNHTSRPRLKNVDRSTAQQLAVTVGNVTVIITDFKEKTRSSSTSSSTVTSSAGSEQQNQSSSGSESTDKGSSRSSTPKGDMSAVNDESF,mutated_sequence,1.0,317.0,UPI0001B794AD.a2m,UPI0001B794AD.npy,gnomAD
+UPI000007413A,UPI000007413A.csv,MEGPSLRGPALRLAGLPTQQDCNIQEKIDLEIRMREGIWKLLSLSTQKDQVLHAVKNLMVCNARLMAYTSELQKLEEQIANQTGRCDVKFESKERTACKGKIAISDIRIPLMWKDSDHFSNKERSRRYAIFCLFKMGANVFDTDVVNVDKTITDICFENVTIFNEAGPDFQIKVEVYSCCTEESSITNTPKKLAKKLKTSISKATGKKISSVLQEEDDEMCLLLSSAVFGVKYNLLAHTTLTLESAEDSFKTHNLSINGNEESSFWLPLYGNMCCRLVAQPACMAEDAFAGFLNQQQMVEGLISWRRLYCVLRGGKLYCFYSPEEIEAKVEPALVVPINKETRIRAMDKDAKKRIHNFSVINPVPGQAITQIFAVDNREDLQKWMEAFWQHFFDLSQWKHCCEELMKIEIMSPRKPPLFLTKEATSVYHDMSIDSPMKLESLTDIIQKKIEETNGQFLIGQHEESLPPPWATLFDGNHQMVIQKKVLYPASEPLHDEKGKKRQAPLPPSDKLPFSLKSQSNTDQLVKDNWGKTSVSQTSSLDTKLSTLMHHLQKPMAAPRKLLPARRNRLSDGEHTDTKTNFEAKPVPAPRQKSIKDILDPRSWLQAQV,mutated_sequence,1.0,609.0,UPI000007413A.a2m,UPI000007413A.npy,gnomAD
+UPI000012CF8B,UPI000012CF8B.csv,MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAGFPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPGGGSPAAAYQAAPPPPPHPPPPPPPPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSGNIGEDPDHLNQSSSPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTERQVKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN,mutated_sequence,1.0,290.0,UPI000012CF8B.a2m,UPI000012CF8B.npy,gnomAD
+UPI000020A0AE,UPI000020A0AE.csv,MGDVKLVASSHISKTSLSVDPSRVDSMPLTEAPAFILPPRNLCIKEGATAKFEGRVRGYPEPQVTWHRNGQPITSGGRFLLDCGIRGTFSLVIHAVHEEDRGKYTCEATNGSGARQVTVELTVEGSFAKQLGQPVVSKTLGDRFSAPAVETRPSIWGECPPKFATKLGRVVVKEGQMGRFSCKITGRPQPQVTWLKGNVPLQPSARVSVSEKNGMQVLEIHGVNQDDVGVYTCLVVNGSGKASMSAELSIQGLDSANRSFVRETKATNSDVRKEVTNVISKESKLDSLEAAAKSKNCSSPQRGGSPPWAANSQPQPPRESKLESCKDSPRTAPQTPVLQKTSSSITLQAARVQPEPRAPGLGVLSPSGEERKRPAPPRPATFPTRQPGLGSQDVVSKAANRRIPMEGQRDSAFPKFESKPQSQEVKENQTVKFRCEVSGIPKPEVAWFLEGTPVRRQEGSIEVYEDAGSHYLCLLKARTRDSGTYSCTASNAQGQLSCSWTLQVERLAVMEVAPSFSSVLKDCAVIEGQDFVLQCSVRGTPVPRITWLLNGQPIQYARSTCEAGVAELHIQDALPEDHGTYTCLAENALGQVSCSAWVTVHEKKSSRKSEYLLPVAPSKPTAPIFLQGLSDLKVMDGSQVTMTVQVSGNPPPEVIWLHNGNEIQESEDFHFEQRGTQHSLCIQEVFPEDTGTYTCEAWNSAGEVRTQAVLTVQEPHDGTQPWFISKPRSVTASLGQSVLISCAIAGDPFPTVHWLRDGKALCKDTGHFEVLQNEDVFTLVLKKVQPWHAGQYEILLKNRVGECSCQVSLMLQNSSARALPRGREPASCEDLCGGGVGADGGGSDRYGSLRPGWPARGQGWLEEEDGEDVRGVLKRRVETRQHTEEAIRQQEVEQLDFRDLLGKKVSTKTLSEDDLKEIPAEQMDFRANLQRQVKPKTVSEEERKVHSPQQVDFRSVLAKKGTSKTPVPEKVPPPKPATPDFRSVLGGKKKLPAENGSSSAETLNAKAVESSKPLSNAQPSGPLKPVGNAKPAETLKPMGNAKPAETLKPMGNAKPDENLKSASKEELKKDVKNDVNCKRGHAGTTDNEKRSESQGTAPAFKQKLQDVHVAEGKKLLLQCQVSSDPPATIIWTLNGKTLKTTKFIILSQEGSLCSVSIEKALPEDRGLYKCVAKNDAGQAECSCQVTVDDAPASENTKAPEMKSRRPKSSLPPVLGTESDATVKKKPAPKTPPKAAMPPQIIQFPEDQKVRAGESVELFGKVTGTQPITCTWMKFRKQIQESEHMKVENSENGSKLTILAARQEHCGCYTLLVENKLGSRQAQVNLTVVDKPDPPAGTPCASDIRSSSLTLSWYGSSYDGGSAVQSYSIEIWDSANKTWKELATCRSTSFNVQDLLPDHEYKFRVRAINVYGTSEPSQESELTTVGEKPEEPKDEVEVSDDDEKEPEVDYRTVTINTEQKVSDFYDIEERLGSGKFGQVFRLVEKKTRKVWAGKFFKAYSAKEKENIRQEISIMNCLHHPKLVQCVDAFEEKANIVMVLEIVSGGELFERIIDEDFELTERECIKYMRQISEGVEYIHKQGIVHLDLKPENIMCVNKTGTRIKLIDFGLARRLENAGSLKVLFGTPEFVAPEVINYEPIGYATDMWSIGVICYILVSGLSPFMGDNDNETLANVTSATWDFDDEAFDEISDDAKDFISNLLKKDMKNRLDCTQCLQHPWLMKDTKNMEAKKLSKDRMKKYMARRKWQKTGNAVRAIGRLSSMAMISGLSGRKSSTGSPTSPLNAEKLESEEDVSQAFLEAVAEEKPHVKPYFSKTIRDLEVVEGSAARFDCKIEGYPDPEVVWFKDDQSIRESRHFQIDYDEDGNCSLIISDVCGDDDAKYTCKAVNSLGEATCTAELIVETMEEGEGEGEEEEE,mutated_sequence,1.0,1914.0,UPI000020A0AE.a2m,UPI000020A0AE.npy,gnomAD
+UPI00001323E1,UPI00001323E1.csv,MIRAAPPPLFLLLLLLLLLVSWASRGEAAPDQDEIQRLPGLAKQPSFRQYSGYLKGSGSKHLHYWFVESQKDPENSPVVLWLNGGPGCSSLDGLLTEHGPFLVQPDGVTLEYNPYSWNLIANVLYLESPAGVGFSYSDDKFYATNDTEVAQSNFEALQDFFRLFPEYKNNKLFLTGESYAGIYIPTLAVLVMQDPSMNLQGLAVGNGLSSYEQNDNSLVYFAYYHGLLGNRLWSSLQTHCCSQNKCNFYDNKDLECVTNLQEVARIVGNSGLNIYNLYAPCAGGVPSHFRYEKDTVVVQDLGNIFTRLPLKRMWHQALLRSGDKVRMDPPCTNTTAASTYLNNPYVRKALNIPEQLPQWDMCNFLVNLQYRRLYRSMNSQYLKLLSSQKYQILLYNGDVDMACNFMGDEWFVDSLNQKMEVQRRPWLVKYGDSGEQIAGFVKEFSHIAFLTIKGAGHMVPTDKPLAAFTMFSRFLNKQPY,mutated_sequence,1.0,480.0,UPI00001323E1.a2m,UPI00001323E1.npy,gnomAD
+UPI0000136846,UPI0000136846.csv,MPQLGGGGGGGGGGSGGGGGSSAGAAGGGDDLGANDELIPFQDEGGEEQEPSSDSASAQRDLDEVKSSLVNESENQSSSSDSEAERRPQPVRDTFQKPRDYFAEVRRPQDSAFFKGPPYPGYPFLMIPDLSSPYLSNGPLSPGGARTYLQMKWPLLDVPSSATVKDTRSPSPAHLSNKVPVVQHPHHMHPLTPLITYSNDHFSPGSPPTHLSPEIDPKTGIPRPPHPSELSPYYPLSPGAVGQIPHPLGWLVPQQGQPMYSLPPGGFRHPYPALAMNASMSSLVSSRFSPHMVAPAHPGLPTSGIPHPAIVSPIVKQEPAPPSLSPAVSVKSPVTVKKEEEKKPHVKKPLNAFMLYMKEMRAKVVAECTLKESAAINQILGRKWHNLSREEQAKYYELARKERQLHSQLYPTWSARDNYGKKKKRKREKQLSQTQSQQQVQEAEGALASKSKKPCVQYLPPEKPCDSPASSHGSMLDSPATPSAALASPAAPAATHSEQAQPLSLTTKPETRAQLALHSAAFLSAKAAASSSGQMGSQPPLLSRPLPLGSMPTALLASPPSFPATLHAHQALPVLQAQPLSLVTKSAH,mutated_sequence,1.0,588.0,UPI0000136846.a2m,UPI0000136846.npy,gnomAD
+UPI00000742DC,UPI00000742DC.csv,MGKLRRRYNIKGRQQAGPGPSKGPPEPPPVQLELEDKDTLKGVDASNALVLPGKKKKKTKAPPLSKKEKKPLTKKEKKVLQKILEQKEKKSQRAEMLQKLSEVQASEAEMRLFYTTSKLGTGNRMYHTKEKADEVVAPGQEKISSLSGAHRKRRRWPSAEEEEEEEEESESELEEESELDEDPAAEPAEAGVGTTVAPLPPAPAPSSQPVPAGMTVPPPPAAAPPLPRALAKPAVFIPVNRSPEMQEERLKLPILSEEQVIMEAVAEHPIVIVCGETGSGKTTQVPQFLYEAGFSSEDSIIGVTEPRRVAAVAMSQRVAKEMNLSQRVVSYQIRYEGNVTEETRIKFMTDGVLLKEIQKDFLLLRYKVVIIDEAHERSVYTDILIGLLSRIVTLRAKRNLPLKLLIMSATLRVEDFTQNPRLFAKPPPVIKVESRQFPVTVHFNKRTPLEDYSGECFRKVCKIHRMLPAGGILVFLTGQAEVHALCRRLRKAFPPSRARPQEKDDDQKDSVEEMRKFKKSRARAKKARAEVLPQINLDHYSVLPAGEGDEDREAEVDEEEGALDSDLDLDLGDGGQDGGEQPDASLPLHVLPLYSLLAPEKQAQVFKPPPEGTRLCVVATNVAETSLTIPGIKYVVDCGKVKKRYYDRVTGVSSFRVTWVSQASADQRAGRAGRTEPGHCYRLYSSAVFGDFEQFPPPEITRRPVEDLILQMKALNVEKVINFPFPTPPSVEALLAAEELLIALGALQPPQKAERVKQLQENRLSCPITALGRTMATFPVAPRYAKMLALSRQHGCLPYAITIVASMTVRELFEELDRPAASDEELTRLKSKRARVAQMKRTWAGQGASLKLGDLMVLLGAVGACEYASCTPQFCEANGLRYKAMMEIRRLRGQLTTAVNAVCPEAELFVDPKMQPPTESQVTYLRQIVTAGLGDHLARRVQSEEMLEDKWRNAYKTPLLDDPVFIHPSSVLFKELPEFVVYQEIVETTKMYMKGVSSVEVQWIPALLPSYCQFDKPLEEPAPTYCPERGRVLCHRASVFYRVGWPLPAIEVDFPEGIDRYKHFARFLLEGQVFRKLASYRSCLLSSPGTMLKTWARLQPRTESLLRALVAEKADCHEALLAAWKKNPKYLLAEYCEWLPQAMHPDIEKAWPPTTVH,mutated_sequence,1.0,1157.0,UPI00000742DC.a2m,UPI00000742DC.npy,gnomAD
+UPI0000203ED6,UPI0000203ED6.csv,MKEPLLGGECDKAVASQLGLLDEIKTEPDNAQEYCHRQQSRTQENELKINAVFSESASQLTAGIQLSLASSGVNKMLPSVSTTAIQVSCAGCKKILQKGQTAYQRKGSAQLFCSIPCITEYISSASSPVPSKRTCSNCSKDILNPKDVISVQLEDTTSCKTFCSLSCLSSYEEKRKPFVTICTNSILTKCSMCQKTAIIQYEVKYQNVKHNLCSNACLSKFHSANNFIMNCCENCGTYCYTSSSLSHILQMEGQSHYFNSSKSITAYKQKPAKPLISVPCKPLKPSDEMIETTSDLGKTELFCSINCFSAYSKAKMESSSVSVVSVVHDTSTELLSPKKDTTPVISNIVSLADTDVALPIMNTDVLQDTVSSVTATADVIVDLSKSSPSEPSNAVASSSTEQPSVSPSSSVFSQHAIGSSTEVQKDNMKSMKISDELCHPKCTSKVQKVKGKSRSIKKSCCADFECLENSKKDVAFCYSCQLFCQKYFSCGRESFATHGTSNWKKTLEKFRKHEKSEMHLKSLEFWREYQFCDGAVSDDLSIHSKQIEGNKKYLKLIIENILFLGKQCLPLRGNDQSVSSVNKGNFLELLEMRAKDKGEETFRLMNSQVDFYNSTQIQSDIIEIIKTEMLQDIVNEINDSSAFSIICDETINSAMKEQLSICVRYPQKSSKAILIKERFLGFVDTEEMTGTHLHRTIKTYLQQIGVDMDKIHGQAYDSTTNLKIKFNKIAAEFKKEEPRALYIHCYAHFLDLSIIRFCKEVKELRSALKTLSSLFNTICMSGEMLANFRNIYRLSQNKTCKKHISQSCWTVHDRTLLSVIDSLPEIIETLEVIASHSSNTSFADELSHLLTLVSKFEFVFCLKFLYRVLSVTGILSKELQNKTIDIFSLSSKIEAILECLSSERNDVYFKTIWDGTEEICQKITCKGFKVEKPSLQKRRKIQKSVDLGNSDNMFFPTSTEEQYKINIYYQGLDTILQNLKLCFSEFDYCKIKQISELLFKWNEPLNETTAKHVQEFYKLDEDIIPELRFYRHYAKLNFVIDDSCINFVSLGCLFIQHGLHSNIPCLSKLLYIALSWPITSASTENSFSTLPRLKTYLCNTMGQEKLTGPALMAVEQELVNKLMEPERLNEIVEKFISQMKEI,mutated_sequence,1.0,1142.0,UPI0000203ED6.a2m,UPI0000203ED6.npy,gnomAD
+UPI000178DEEC,UPI000178DEEC.csv,MPTLSFWVCSATPVSPGFFALILLVFVTSIASNVVKIILIHIDSRLHTPMYFLLSQLSLRDILYISTIVPKMLVDQVMSQRAISFAGCTAQHFLYLTLAGAEFFLLGLMSCDRYVAICNPLHYPDLMSRKICWLIVAAAWLGGSIDGFLLTPVTMQFPFCASREINHFFCEVPALLKLSCTDTSAYETAMYVCCIMMLLIPFSVISGSYTRILITVYRMSEAEGRRKAVATCSSHMVVVSLFYGAAMYTYVLPHSYHTPEQDKAVSAFYTILTPMLNPLIYSLRNKDVTGALQKVVGRCVSSGKVTTF,mutated_sequence,1.0,308.0,UPI000178DEEC.a2m,UPI000178DEEC.npy,gnomAD
+UPI000045725F,UPI000045725F.csv,MNNQKVVAVLLQECKQVLDQLLLEAPDVSEEDKSEDQRCRALLPSELRTLIQEAKEMKWPFVPEKWQYKQAVGPEDKTNLKDVIGAGLQQLLASLRASILARDCAAAAAIVFLVDRFLYGLDVSGKLLQVAKGLHKLQPATPIAPQVVIRQARISVNSGKLLKAEYILSSLISNNGATGTWLYRNESDKVLVQSVCIQIRGQILQKLGMWYEAAELIWASIVGYLALPQPDKKGLSTSLGILADIFVSMSKNDYEKFKNNPQINLSLLKEFDHHLLSAAEACKLAAAFSAYTPLFVLTAVNIRGTCLLSYSSSNDCPPELKNLHLCEAKEAFEIGLLTKRDDEPVTGKQELHSFVKAAFGLTTVHRRLHGETGTVHAASQLCKEAMGKLYNFSTSSRSQDREALSQEVMSVIAQVKEHLQVQSFSNVDDRSYVPESFECRLDKLILHGQGDFQKILDTYSQHHTSVCEVFESDCGNNKNEQKDAKTGVCITALKTEIKNIDTVSTTQEKPHCQRDTGISSSLMGKNVQRELRRGGRRNWTHSDAFRVSLDQDVETETEPSDYSNGEGAVFNKSLSGSQTSSAWSNLSGFSSSASWEEVNYHVDDRSARKEPGKEHLVDTQCSTALSEELENDREGRAMHSLHSQLHDLSLQEPNNDNLEPSQNQPQQQMPLTPFSPHNTPGIFLAPGAGLLEGAPEGIQEVRNMGPRNTSAHSRPSYRSASWSSDSGRPKNMGTHPSVQKEEAFEIIVEFPETNCDVKDRQGKEQGEEISERGAGPTFKASPSWVDPEGETAESTEDAPLDFHRVLHNSLGNISMLPCSSFTPNWPVQNPDSRKSGGPVAEQGIDPDASTVDEEGQLLDSMDVPCTNGHGSHRLCILRQPPGQRAETPNSSVSGNILFPVLSEDCTTTEEGNQPGNMLNCSQNSSSSSVWWLKSPAFSSGSSEGDSPWSYLNSSGSSWVSLPGKMRKEILEARTLQPDDFEKLLAGVRHDWLFQRLENTGVFKPSQLHRAHSALLLKYSKKSELWTAQETIVYLGDYLTVKKKGRQRNAFWVHHLHQEEILGRYVGKDYKEQKGLWHHFTDVERQMTAQHYVTEFNKRLYEQNIPTQIFYIPSTILLILEDKTIKGCISVEPYILGEFVKLSNNTKVVKTEYKATEYGLAYGHFSYEFSNHRDVVVDLQGWVTGNGKGLIYLTDPQIHSVDQKVFTTNFGKRGIFYFFNNQHVECNEICHRLSLTRPSMEKPCT,mutated_sequence,1.0,1244.0,UPI000045725F.a2m,UPI000045725F.npy,gnomAD
+UPI00033351D1,UPI00033351D1.csv,XLGLERDVSRAVELLERLQRSGELPPQKLQALQRVLQSRFCSAIREVYEQLYDTLDITGSAEIRAHATAKVGPAPHCSWCLFNCLVELNISFMPVISVSSLMIFAKSMCHLTISK,mutated_sequence,1.0,115.0,UPI00033351D1.a2m,UPI00033351D1.npy,gnomAD
+UPI00001D8160,UPI00001D8160.csv,MTTYRAIPSDGVDLAASCGARVGDVLPGPHTGDYAPLGFWAQNGSMSQPLGESPATATATATATTRPSPTTPAMPKMGVRARVADWPPKREALREHSNPSPSQDTDGTKATKMAHSMRSIQNGQPPTSTPASSGSKAFHRLSRRRSKDVEFQDGWPRSPGRAFLPLRHRSSSEITLSECDAEDAGEPRGARHTGALPLFREYGSTSSIDVQGMPEQSFFDILNEFRSEQPDARGCQALTELLRADPGPHLMGGGGGAKGDSHNGQPAKDSLLPLQPTKEKEKARKKPARGLGGGDTVDSSIFRKLRSSKPEGEAGRSPGEADEGRSPPEASRPWVCQKSFAHFDVQSMLFDLNEAAANRVSVSQRRNTTTGASAASAASAMASLTASRAHSLGGLDPAFTSTEDLNCKENLEQDLGDDNSNDLLLSCPHFRNEIGGECERNVSFSRASVGSPSSGEGHLAEPALSAYRTNASISVLEVPKEQQRTQSRPRQYSIEHVDLGARYYQDYFVGKEHANYFGVDEKLGPVAVSIKREKLEDHKEHGPQYQYRIIFRTRELITLRGSILEDATPTATKHGTGRGLPLKDALEYVIPELNIHCLRLALNTPKVTEQLLKLDEQGLCRKHKVGILYCKAGQSSEEEMYNNEEAGPAFEEFLSLIGEKVCLKGFTKYAAQLDVKTDSTGTHSLYTMYQDYEIMFHVSTLLPYTPNNRQQLLRKRHIGNDIVTIIFQEPGALPFTPKNIRSHFQHVFIIVRVHNPCTDNVCYSMAVTRSKDAPPFGPPIPSGTTFRKSDVFRDFLLAKVINAENAAHKSDKFHTMATRTRQEYLKDLAENCVSNTPIDSTGKFNLISLTSKKKEKTKARAGAEQHSAGAIAWRVVAQDYAQGVEIDCILGISNEFVVLLDLRTKEVVFNCYCGDVIGWTPDSSTLKIFYGRGDHIFLQATEGSVEDIREIVQRLKVMTSGWETVDMTLRRNGLGQLGFHVKYDGTVAEVEDYGFAWQAGLRQGSRLVEICKVAVVTLTHDQMIDLLRTSVTVKVVIIPPFEDGTPRRGWPETYDMNTSEPKTEQESITPGGRPPYRSNAPWQWSGPASHNSLPASKWATPTTPGHAQSLSRPLKQTPIVPFRESQPLHSKRPVSFPETPYTVSPAGADRVPPYRQPSGSFSTPGSATYVRYKPSPERYTAAPHPLLSLDPHFSHDGTSSGDSSSGGLTSQESTMERQKPEPLWHVPAQARLSAIAGSSGNKHPSRQDAAGKDSPNRHSKGEPQYSSHSSSNTLSSNASSSHSDDRWFDPLDPLEPEQDPLSKGGSSDSGIDTTLYTSSPSCMSLAKAPRPAKPHKPPGSMGLCGGGREAAGRSHHADRRREVSPAPAVAGQSKGYRPKLYSSGSSTPTGLAGGSRDPPRQPSDMGSRVGYPAQVYKTASAETPRPSQLAQPSPFQLSASVPKSFFSKQPVRNKHPTGWKRTEEPPPRPLPFSDPKKQVDTNTKNVFGQPRLRASLRDLRSPRKNYKSTIEDDLKKLIIMDNLGPEQERDTGQSPQKGLQRTLSDESLCSGRREPSFASPAGLEPGLPSDVLFTSTCAFPSSTLPARRQHQHPHPPVGPGATPAAGSGFPEKKSTISASELSLADGRDRPLRRLDPGLMPLPDTAAGLEWSSLVNAAKAYEVQRAVSLFSLNDPALSPDIPPAHSPVHSHLSLERGPPTPRTTPTMSEEPPLDLTGKVYQLEVMLKQLHTDLQKEKQDKVVLQSEVASLRQNNQRLQEESQAASEQLRKFAEIFCREKKEL,mutated_sequence,1.0,1781.0,UPI00001D8160.a2m,UPI00001D8160.npy,gnomAD
+UPI0000038CE3,UPI0000038CE3.csv,MGIGRSEGGRRGAALGVLLALGAALLAVGSASEYDYVSFQSDIGPYQSGRFYTKPPQCVDIPADLRLCHNVGYKKMVLPNLLEHETMAEVKQQASSWVPLLNKNCHAGTQVFLCSLFAPVCLDRPIYPCRWLCEAVRDSCEPVMQFFGFYWPEMLKCDKFPEGDVCIAMTPPNATEASKPQGTTVCPPCDNELKSEAIIEHLCASEFALRMKIKEVKKENGDKKIVPKKKKPLKLGPIKKKDLKKLVLYLKNGADCPCHQLDNLSHHFLIMGRKVKSQYLLTAIHKWDKKNKEFKNFMKKMKNHECPTFQSVFK,mutated_sequence,1.0,314.0,UPI0000038CE3.a2m,UPI0000038CE3.npy,gnomAD
+UPI0000070D27,UPI0000070D27.csv,MIIKEYRIPLPMTVEEYRIAQLYMIQKKSRNETYGEGSGVEILENRPYTDGPGGSGQYTHKVYHVGMHIPSWFRSILPKAALRVVEESWNAYPYTRTRFTCPFVEKFSIDIETFYKTDAGENPDVFNLSPVEKNQLTIDFIDIVKDPVPHNEYKTEEDPKLFQSTKTQRGPLSENWIEEYKKQVFPIMCAYKLCKVEFRYWGMQSKIERFIHDTGLRRVMVRAHRQAWCWQDEWYGLSMENIRELEKEAQLMLSRKMAQFNEDGEEATELVKHEAVSDQTSGEPPEPSSSNGEPLVGRGLKKQWSTSSKSSRSSKRGASPSRHSISEWRMQSIARDSDESSDDEFFDAHEDLSDTEEMFPKDITKWSSNDLMDKIESPEPEDTQDGLYRQGAPEFRVASSVEQLNIIEDEVSQPLAAPPSKIHVLLLVLHGGTILDTGAGDPSSKKGDANTIANVFDTVMRVHYPSALGRLAIRLVPCPPVCSDAFALVSNLSPYSHDEGCLSSSQDHIPLAALPLLATSSPQYQEAVATVIQRANLAYGDFIKSQEGMTFNGQVCLIGDCVGGILAFDALCYSNQPVSESQSSSRRGSVVSMQDNDLLSPGILMNAAHCCGGGGGGGGGGGSSGGGGSSGGSSLESSRHLSRSNVDIPRSNGTEDPKRQLPRKRSDSSTYELDTIQQHQAFLSSLHASVLRTEPCSRHSSSSTMLDGTGALGRFDFEITDLFLFGCPLGLVLALRKTVIPALDVFQLRPACQQVYNLFHPADPSASRLEPLLERRFHALPPFSVPRYQRYPLGDGCSTLLADVLQTHNAAFQEHGAPSSPGTAPASRGFRRASEISIASQVSGMAESYTASSIAQKAPDALSHTPSVRRLSLLALPAPSPTTPGPHPPARKASPGLERAPGLPELDIGEVAAKWWGQKRIDYALYCPDALTAFPTVALPHLFHASYWESTDVVSFLLRQVMRHDNSSILELDGKEVSVFTPSKPREKWQRKRTHVKLRNVTANHRINDALANEDGPQVLTGRFMYGPLDMVTLTGEKVDVHIMTQPPSGEWLYLDTLVTNNSGRVSYTIPESHRLGVGVYPIKMVVRGDHTFADSYITVLPKGTEFVVFSIDGSFAASVSIMGSDPKVRAGAVDVVRHWQDLGYLIIYVTGRPDMQKQRVVAWLAQHNFPHGVVSFCDGLVHDPLRHKANFLKLLISELHLRVHAAYGSTKDVAVYSAISLSPMQIYIVGRPTKKLQQQCQFITDGYAAHLAQLKYSHRARPARNTATRMALRKGSFGLPGQGDFLRSRNHLLRTISAQPSGPSHRHERTQSQADGEQRGQRSMSVAAGCWGRAMTGRLEPGAAAGPK,mutated_sequence,1.0,1349.0,UPI0000070D27.a2m,UPI0000070D27.npy,gnomAD
+UPI0001B79456,UPI0001B79456.csv,MQGPDAPLLLSARELGPGRRGSASWYRQEGGAVCNWLRKPQPLEPRTSFPSARRSEFRPPRRLPWAGPASAQSEEGHAGGRCQAGSPAPARRLSGAFKTLVPGTHAQGASDTHPLGPPHTPVLDPSPQDGFPVLWRGSSKASHMNRLRNAKIYVERAVKQKKIFTIQGCYPVIRCLLRRRGWVEKKMVHRSGPTLLPPQKDLDSSAMGDSDTTEDEDEDEDEEFQPSQLFDFDDLLKFDDLDGTHALMVGLCLNLRNLPWFDEVDANSFFPRCYCLGAEDDKKAFIEDFWLTAARNVLKLVVKSEWKSYPIQAVEEEASGDKQPKKQEKNPVLVSPEFVDEALCACEEYLSNLAHMDIDKDLEAPLYLTPEGWSLFLQRYYQVVHEGAELRHLDTQVQRCEDILQQLQAVVPQIDMEGDRNIWIVKPGAKSRGRGIMCMDHLEEMLKLVNGNPVVMKDGKWVVQKYIERPLLIFGTKFDLRQWFLVTDWNPLTVWFYRDSYIRFSTQPFSLKNLDNSVHLCNNSIQKHLENSCHRHPLLPPDNMWSSQRFQAHLQEMGAPNAWSTIIVPGMKDAVIHALQTSQDTVQCRKASFELYGADFVFGEDFQPWLIEINASPTMAPSTAVTARLCAGVQADTLRVVIDRMLDRNCDTGAFELIYKQPAVEVPQYVGIRLLVEGFTIKKPMAMCHRRMGVRPAVPLLTQRGSGEARHHFPSLHTKAQLPSPHVLRHQGQVLRRQHSKLVGTKALSTTGKALRTLPTAKVFISLPPNLDFKVAPSILKPRKAPALLCLRGPQLEVPCCLCPLKSEQFLAPVGRSRPKANSRPDCDKPRAEACPMKRLSPLKPLPLVGTFQRRRGLGDMKLGKPLLRFPTALVLDPTPNKKKQVKYLGLDSIAVGGSRVDGARPCTPGSTARA,mutated_sequence,1.0,915.0,UPI0001B79456.a2m,UPI0001B79456.npy,gnomAD
+UPI00002021CA,UPI00002021CA.csv,MAHELVMFRDVAIDVSQEEWECLNPAQRNLYKEVMLENYSNLVSLGLSVSKPAVISSLEQGKEPWMVVREETGRWCPGTWKTWGFHNNFLDNNEATDINADLASRDEPQKLSPKRDIYETELSQWVNMEEFKSHSPERSIFSAIWEGNCHFEQHQGQEEGYFRQLMINHENMPIFSQHTLLTQEFYDREKISECKKCRKIFSYHLFFSHHKRTHSKELSECKECTEIVNTPCLFKQQTIQNGDKCNECKECWKAFVHCSQLKHLRIHNGEKRYECNECGKAFNYGSELTLHQRIHTGEKPYECKECGKAFRQRSQLTQHQRLHTGEKPYECKQCGKAFIRGFQLTEHLRLHTGEKPYECKECGKTFRHRSHLTIHQRIHTGEKPYECRECGKAFSYHSSFSHHQKIHSGKKPYECHECGKAFCDGLQLTLHQRIHTGEKPYECKECGKTFRQCSHLKRHQRIHTGEKPHECMICGKAFRLHSHLIQHQRIHTGEKPYECKECGKAFSYHSSFSHHQRIHSGKKPYQCGKAFNHRLQLNLHQTLHTGEKPVRFPLLPPHPSLAS,mutated_sequence,1.0,563.0,UPI00002021CA.a2m,UPI00002021CA.npy,gnomAD
+UPI000013CD7C,UPI000013CD7C.csv,MVLLSILRILFLCELVLFMEHRAQMAEGGQSSIALLAEAPTLPLIEELLEESPGEQPRKPRLLGHSLRYMLELYRRSADSHGHPRENRTIGATMVRLVKPLTNVARPHRGTWHIQILGFPLRPNRGLYQLVRATVVYRHHLQLTRFNLSCHVEPWVQKNPTNHFPSSEGDSSKPSLMSNAWKEMDITQLVQQRFWNNKGHRILRLRFMCQQQKDSGGLELWHGTSSLDIAFLLLYFNDTHKSIRKAKFLPRGMEEFMERESLLRRTRQADGISAEVTASSSKHSGPENNQCSLHPFQISFRQLGWDHWIIAPPFYTPNYCKGTCLRVLRDGLNSPNHAIIQNLINQLVDQSVPRPSCVPYKYVPISVLMIEANGSILYKEYEGMIAESCTCR,mutated_sequence,1.0,392.0,UPI000013CD7C.a2m,UPI000013CD7C.npy,gnomAD
+UPI0000071891,UPI0000071891.csv,MEEEDESRGKTEESGEDRGDGPPDRDPTLSPSAFILRAIQQAVGSSLQGDLPNDKDGSRCHGLRWRRCRSPRSEPRSQESGGTDTATVLDMATDSFLAGLVSVLDPPDTWVPSRLDLRPGESEDMLELVAEVRIGDRDPIPLPVPSLLPRLRAWRTGKTVSPQSNSSRPTCARHLTLGTGDGGPAPPPAPSSASSSPSPSPSSSSPSPPPPPPPPAPPAPPAPRFDIYDPFHPTDEAYSPPPAPEQKYDPFEPTGSNPSSSAGTPSPEEEEEEEEEEEEEEEDEEEEEGLSQSISRISETLAGIYDDNSLSQDFPGDESPRPDAQPTQPTPAPGTPPQVDSTRADGAMRRRVFVVGTEAEACREGKVSVEVVTAGGAALPPPLLPPGDSEIEEGEIVQPEEEPRLALSLFRPGGRAARPTPAASATPTAQPLPQPPAPRAPEGDDFLSLHAESDGEGALQVDLGEPAPAPPAADSRWGGLDLRRKILTQRRERYRQRSPSPAPAPAPAAAAGPPTRKKSRRERKRSGEAKEAASSSSGTQPAPPAPASPWDSKKHRSRDRKPGSHASSSARRRSRSRSRSRSTRRRSRSTDRRRGGSRRSRSREKRRRRRRSASPPPATSSSSSSRRERHRGKHRDGGGSKKKKKRSRSRGEKRSGDGSEKAPAPAPPPSGSTSCGDRDSRRRGAVPPSIQDLTDHDLFAIKRTITVGRLDKSDPRGPSPAPASSPKREVLYDSEGLSGEERGGKSSQKDRRRSGAASSSSSSREKGSRRKALDGGDRDRDRDRDRDRDRSSKKARPPKESAPSSGPPPKPPVSSGSGSSSSSSSCSSRKVKLQSKVAVLIREGVSSTTPAKDAASAGLGSIGVKFSRDRESRSPFLKPDERAPTEMAKAAPGSTKPKKTKVKAKAGAKKTKGTKGKTKPSKTRKKVRSGGGSGGSGGQVSLKKSKADSCSQAAGTKGAEETSWSGEERAAKVPSTPPPKAAPPPPALTPDSQTVDSSCKTPEVSFLPEEATEEAGVRGGAEEEEEEEEEEEEEEEEEEQQPATTTATSTAAAAPSTAPSAGSTAGDSGAEDGPASRVSQLPTLPPPMPWNLPAGVDCTTSGVLALTALLFKMEEANLASRAKAQELIQATNQILSHRKPPSSLGMTPAPVPTSLGLPPGPSSYLLPGSLPLGGCGSTPPTPTGLAATSDKREGSSSSEGRGDTDKYLKKLHTQERAVEEVKLAIKPYYQKKDITKEEYKDILRKAVHKICHSKSGEINPVKVSNLVRAYVQRYRYFRKHGRKPGDPPGPPRPPKEPGPPDKGGPGLPLPPL,mutated_sequence,1.0,1312.0,UPI0000071891.a2m,UPI0000071891.npy,gnomAD
+UPI000013D6B1,UPI000013D6B1.csv,MRWRTILLQYCFLLITCLLTALEAVPIDIDKTKVQNIHPVESAKIEPPDTGLYYDEYLKQVIDVLETDKHFREKLQKADIEEIKSGRLSKELDLVSHHVRTKLDELKRQEVGRLRMLIKAKLDSLQDIGMDHQALLKQFDHLNHLNPDKFESTDLDMLIKAATSDLEHYDKTRHEEFKKYEMMKEHERREYLKTLNEEKRKEEESKFEEMKKKHENHPKVNHPGSKDQLKEVWEETDGLDPNDFDPKTFFKLHDVNSDGFLDEQELEALFTKELEKVYDPKNEEDDMVEMEEERLRMREHVMSEVDTNKDRLVTLEEFLKATEKKEFLEPDSWETLDQQQFFTEEELKEYENIIALQENELKKKADELQKQKEELQRQHDQLEAQKLEYHQVIQQMEQKKLQQGIPPSGPAGELKFEPHI,mutated_sequence,1.0,420.0,UPI000013D6B1.a2m,UPI000013D6B1.npy,gnomAD
+UPI000013DEA0,UPI000013DEA0.csv,MVKLANPLYTEWILEAIQKIKKQKQRPSEERICHAVSTSHGLDKKTVSEQLELSVQDGSVLKVTNKGLASYKDPDNPGRFSSVKPGTFPKSAKGSRGSCNDLRNVDWNKLLRRAIEGLEEPNGSSLKNIEKYLRSQSDLTSTTNNPAFQQRLRLGAKRAVNNGRLLKDGPQYRVNYGSLDGKGAPQYPSAFPSSLPPVSLLPHEKDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWICQVCRPKKKGRKLLHEKAAQIKRRYAKPIGRPKNKLKQRLLSVTSDEGSMNAFTGRGSPGRGQKTKVCTTPSSGHAASGKDSSSRLAVTDPTRPGATTKITTTSTYISASTLKVNKKTKGLIDGLTKFFTPSPDGRRSRGEIIDFSKHYRPRKKVSQKQSCTSHVLATGTTQKLKPPPSSLPPPTPISGQSPSSQKSSTATSSPSPQSSSSQCSVPSLSSLTTNSQLKALFDGLSHIYTTQGQSRKKGHPSYAPPKRMRRKTELSSTAKSKAHFFGKRDIRSRFISHSSSSSWGMARGSIFKAIAHFKRTTFLKKHRMLGRLKYKVTPQMGTPSPGKGSLTDGRIKPDQDDDTEIKINIKQESADVNVIGNKDVVTEEDLDVFKQAQELSWEKIECESGVEDCGRYPSVIEFGKYEIQTWYSSPYPQEYARLPKLYLCEFCLKYMKSKNILLRHSKKCGWFHPPANEIYRRKDLSVFEVDGNMSKIYCQNLCLLAKLFLDHKTLYYDVEPFLFYVLTKNDEKGCHLVGYFSKEKLCQQKYNVSCIMIMPQHQRQGFGRFLIDFSYLLSRREGQAGSPEKPLSDLGRLSYLAYWKSVILEYLYHHHERHISIKAISRATGMCPHDIATTLQHLHMIDKRDGRFVIIRREKLILSHMEKLKTCSRANELDPDSLRWTPILISNAAVSEEEREAEKEAERLMEQASCWEKEEQEILSTRANSRQSPAKVQSKNKYLHSPESRPVTGERGQLLELSKESSEEEEEEEDEEEEEEEEEEEEDEEEEEEEEEEEEEENIQSSPPRLTKPQSVAIKRKRPFVLKKKRGRKRRRINSSVTTETISETTEVLNEPFDNSDEERPMPQLEPTCEIEVEEDGRKPVLRKAFQHQPGKKRQTEEEEGKDNHCFKNADPCRNNMNDDSSNLKEGSKDNPEPLKCKQVWPKGTKRGLSKWRQNKERKTGFKLNLYTPPETPMEPDEQVTVEEQKETSEGKTSPSPIRIEEEVKETGEALLPQEENRREETCAPVSPNTSPGEKPEDDLIKPEEEEEEEEEEEEEEEEEEGEEEEGGGNVEKDPDGAKSQEKEEPEISTEKEDSARLDDHEEEEEEDEEPSHNEDHDADDEDDSHMESAEVEKEELPRESFKEVLENQETFLDLNVQPGHSNPEVLMDCGVDLTASCNSEPKELAGDPEAVPESDEEPPPGEQAQKQDQKNSKEVDTEFKEGNPATMEIDSETVQAVQSLTQESSEQDDTFQDCAETQEACRSLQNYTRADQSPQIATTLDDCQQSDHSSPVSSVHSHPGQSVRSVNSPSVPALENSYAQISPDQSAISVPSLQNMETSPMMDVPSVSDHSQQVVDSGFSDLGSIESTTENYENPSSYDSTMGGSICGNGSSQNSCSYSNLTSSSLTQSSCAVTQQMSNISGSCSMLQQTSISSPPTCSVKSPQGCVVERPPSSSQQLAQCSMAANFTPPMQLAEIPETSNANIGLYERMGQSDFGAGHYPQPSATFSLAKLQQLTNTLIDHSLPYSHSAAVTSYANSASLSTPLSNTGLVQLSQSPHSVPGGPQAQATMTPPPNLTPPPMNLPPPLLQRNMAASNIGISHSQRLQTQIASKGHISMRTKSASLSPAAATHQSQIYGRSQTVAMQGPARTLTMQRGMNMSVNLMPAPAYNVNSVNMNMNTLNAMNGYSMSQPMMNSGYHSNHGYMNQTPQYPMQMQMGMMGTQPYAQQPMQTPPHGNMMYTAPGHHGYMNTGMSKQSLNGSYMRR,mutated_sequence,1.0,2073.0,UPI000013DEA0.a2m,UPI000013DEA0.npy,gnomAD
+UPI000000D799,UPI000000D799.csv,MESGDEAAIERHRVHLRSATLRDAVPATLHLLPCEVAVDGPAPVGRFFTPAIRQGPEGLEVSFRGRCLRGEEVAVPPGLVGYVMVTEEKKVSMGKPDPLRDSGTDDQEEEPLERDFDRFIGATANFSRFTLWGLETIPGPDAKVRGALTWPSLAAAIHAQVPED,mutated_sequence,1.0,164.0,UPI000000D799.a2m,UPI000000D799.npy,gnomAD
+UPI000013C5D0,UPI000013C5D0.csv,MSRSLDSARSFLERLEARGGREGAVLAGEFSDIQACSAAWKADGVCSTVAGSRPENVRKNRYKDVLPYDQTRVILSLLQEEGHSDYINGNFIRGVDGSLAYIATQGPLPHTLLDFWRLVWEFGVKVILMACREIENGRKRCERYWAQEQEPLQTGLFCITLIKEKWLNEDIMLRTLKVTFQKESRSVYQLQYMSWPDRGVPSSPDHMLAMVEEARRLQGSGPEPLCVHCSAGCGRTGVLCTVDYVRQLLLTQMIPPDFSLFDVVLKMRKQRPAAVQTEEQYRFLYHTVAQMFCSTLQNASPHYQNIKENCAPLYDDALFLRTPQALLAIPRPPGGVLRSISVPGSPGHAMADTYAVVQKRGAPAGAGSGTQTGTGTGTGARSAEEAPLYSKVTPRAQRPGAHAEDARGTLPGRVPADQSPAGSGAYEDVAGGAQTGGLGFNLRIGRPKGPRDPPAEWTRV,mutated_sequence,1.0,460.0,UPI000013C5D0.a2m,UPI000013C5D0.npy,gnomAD
+UPI000013C34D,UPI000013C34D.csv,MEGLLHYINPAHAISLLSALNEERLKGQLCDVLLIVGDQKFRAHKNVLAASSEYFQSLFTNKENESQTVFQLDFCEPDAFDNVLNYIYSSSLFVEKSSLAAVQELGYSLGISFLTNIVSKTPQAPFPTCPNRKKVFVEDDENSSQKRSVIVCQSRNEAQGKTVSQNQPDVSHTSRPSPSIAVKANTNKPHVPKPIEPLHNLSLTEKSWPKDSSVVYAKSLEHSGSLDDPNRISLVKRNAVLPSKPLQDREAMDDKPGVSGQLPKGKALELALKRPRPPVLSVCSSSETPYLLKETNKGNGQGEDRNLLYYSKLGLVIPSSGSGSGNQSIDRSGPLVKSLLRRSLSMDSQVPVYSPSIDLKSSQGSSSVSSDAPGNVLCALSQKSSLKDCSEKTALDDRPQVLQPHRLRSFSASQSTDREGASPVTEVRIKTEPSSPLSDPSDIIRVTVGDAATTAAASSSSVTRDLSLKTEDDQKDMSRLPAKRRFQADRRLPFKKLKVNEHGSPVSEDNFEEGSSPTLLDADFPDSDLNKDEFGELEGTRPNKKFKCKHCLKIFRSTAGLHRHVNMYHNPEKPYACDICHKRFHTNFKVWTHCQTQHGIVKNPSPASSSHAVLDEKFQRKLIDIVREREIKKALIIKLRRGKPGFQGQSSSQAQQVIKRNLRSRAKGAYICTYCGKAYRFLSQFKQHIKMHPGEKPLGVNKVAKPKEHAPLASPVENKEVYQCRLCNAKLSSLLEQGSHERLCRNAAVCPYCSLRFFSPELKQEHESKCEYKKLTCLECMRTFKSSFSIWRHQVEVHNQNNMAPTENFSLPVLDHNGDVTGSSRPQSQPEPNKVNHIVTTKDDNVFSDSSEQVNFDSEDSSCLPEDLSLSKQLKIQVKEEPVEEAEEEAPEASTAPKEAGPSKEASLWPCEKCGKMFTVHKQLERHQELLCSVKPFICHVCNKAFRTNFRLWSHFQSHMSQASEESAHKESEVCPVPTNSPSPPPLPPPPPLPKIQPLEPDSPTGLSENPTPATEKLFVPQESDTLFYHAPPLSAITFKRQFMCKLCHRTFKTAFSLWSHEQTHN,mutated_sequence,1.0,1066.0,UPI000013C34D.a2m,UPI000013C34D.npy,gnomAD
+UPI00000743C2,UPI00000743C2.csv,MAPTKPSFQQDPSRRERITAQHPLPNQSECRKIYRYDGIYCESTYQNLQALRKEKSRDAARSRRGKENFEFYELAKLLPLPAAITSQLDKASIIRLTISYLKMRDFANQGDPPWNLRMEGPPPNTSVKVIGAQRRRSPSALAIEVFEAHLGSHILQSLDGFVFALNQEGKFLYISETVSIYLGLSQVELTGSSVFDYVHPGDHVEMAEQLGMKLPPGRGLLSQGTAEDGASSASSSSQSETPEPVESTSPSLLTTDNTLERSFFIRMKSTLTKRGVHIKSSGYKVIHITGRLRLRVSLSHGRTVPSQIMGLVVVAHALPPPTINEVRIDCHMFVTRVNMDLNIIYCENRISDYMDLTPVDIVGKRCYHFIHAEDVEGIRHSHLDLLNKGQCVTKYYRWMQKNGGYIWIQSSATIAINAKNANEKNIIWVNYLLSNPEYKDTPMDIAQLPHLPEKTSESSETSDSESDSKDTSGITEDNENSKSDEKGNQSENSEDPEPDRKKSGNACDNDMNCNDDGHSSSNPDSRDSDDSFEHSDFENPKAGEDGFGALGAMQIKVERYVESESDLRLQNCESLTSDSAKDSDSAGEAGAQASSKHQKRKKRRKRQKGGSASRRRLSSASSPGGLDAGLVEPPRLLSSPNSASVLKIKTEISEPINFDNDSSIWNYPPNREISRNESPYSMTKPPSSEHFPSPQGGGGGGGGGGGLHVAIPDSVLTPPGADGAAARKTQFGASATAALAPVASDPLSPPLSASPRDKHPGNGGGGGGGGGGAGGGGPSASNSLLYTGDLEALQRLQAGNVVLPLVHRVTGTLAATSTAAQRVYTTGTIRYAPAEVTLAMQSNLLPNAHAVNFVDVNSPGFGLDPKTPMEMLYHHVHRLNMSGPFGGAVSAASLTQMPAGNVFTTAEGLFSTLPFPVYSNGIHAAQTLERKED,mutated_sequence,1.0,933.0,UPI00000743C2.a2m,UPI00000743C2.npy,gnomAD
+UPI0000000A01,UPI0000000A01.csv,MWNLLHETDSAVATARRPRWLCAGALVLAGGFFLLGFLFGWFIKSSNEATNITPKHNMKAFLDELKAENIKKFLYNFTQIPHLAGTEQNFQLAKQIQSQWKEFGLDSVELAHYDVLLSYPNKTHPNYISIINEDGNEIFNTSLFEPPPPGYENVSDIVPPFSAFSPQGMPEGDLVYVNYARTEDFFKLERDMKINCSGKIVIARYGKVFRGNKVKNAQLAGAKGVILYSDPADYFAPGVKSYPDGWNLPGGGVQRGNILNLNGAGDPLTPGYPANEYAYRRGIAEAVGLPSIPVHPIGYYDAQKLLEKMGGSAPPDSSWRGSLKVPYNVGPGFTGNFSTQKVKMHIHSTNEVTRIYNVIGTLRGAVEPDRYVILGGHRDSWVFGGIDPQSGAAVVHEIVRSFGTLKKEGWRPRRTILFASWDAEEFGLLGSTEWAEENSRLLQERGVAYINADSSIEGNYTLRVDCTPLMYSLVHNLTKELKSPDEGFEGKSLYESWTKKSPSPEFSGMPRISKLGSGNDFEVFFQRLGIASGRARYTKNWETNKFSGYPLYHSVYETYELVEKFYDPMFKYHLTVAQVRGGMVFELANSIVLPFDCRDYAVVLRKYADKIYSISMKHPQEMKTYSVSFDSLFSAVKNFTEIASKFSERLQDFDKSNPIVLRMMNDQLMFLERAFIDPLGLPDRPFYRHVIYAPSSHNKYAGESFPGIYDALFDIESKVDPSKAWGEVKRQIYVAAFTVQAAAETLSEVA,mutated_sequence,1.0,750.0,UPI0000000A01.a2m,UPI0000000A01.npy,gnomAD
+UPI000013C6CB,UPI000013C6CB.csv,MEGGAYGAGKAGGAFDPYTLVRQPHTILRVVSWLFSIVVFGSIVNEGYLNSASEGEEFCIYNRNPNACSYGVAVGVLAFLTCLLYLALDVYFPQISSVKDRKKAVLSDIGVSAFWAFLWFVGFCYLANQWQVSKPKDNPLNEGTDAARAAIAFSFFSIFTWAGQAVLAFQRYQIGADSALFSQDYMDPSQDSSMPYAPYVEPTGPDPAGMGGTYQQPANTFDTEPQGYQSQGY,mutated_sequence,1.0,233.0,UPI000013C6CB.a2m,UPI000013C6CB.npy,gnomAD
+UPI00001B4EFC,UPI00001B4EFC.csv,MGDTAPPQAPAGGLGGASGAGLLGGGSVTPRVHSAIVERLRARIAVCRQHHLSCEGRYERGRAESSDRERESTLQLLSLVQHGQGARKAGKHTKATATAATTTAPPPPPAAPPAASQAAATAAPPPPPDYHHHHQQHLLNSSNNGGSGGINGEQQPPASTPGDQRNSALIALQGSLKRKQVVNLSPANSKRPNGFVDNSFLDIKRIRVGENLSAGQGGLQINNGQSQIMSGTLPMSQAPLRKTNTLPSHTHSPGNGLFNMGLKEVKKEPGETLSCSKHMDGQMTQENIFPNRYGDDPGEQLMDPELQELFNELTNISVPPMSDLELENMINATIKQDDPFNIDLGQQSQRSTPRPSLPMEKIVIKSEYSPGLTQGPSGSPQLRPPSAGPAFSMANSALSTSSPIPSVPQSQAQPQTGSGASRALPSWQEVSHAQQLKQIAANRQQHARMQQHQQQHQPTNWSALPSSAGPSPGPFGQEKIPSPSFGQQTFSPQSSPMPGVAGGSGQSKVMANYMYKAGPSAQGGHLDVLMQQKPQDLSRSFINNPHPAMEPRQGNTKPLFHFNSDQANQQMPSVLPSQNKPSLLHYTQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQSSISAQQQQQQQSSISAQQQQQQQQQQQQQQQQQQQQQQQQQQQPSSQPAQSLPSQPLLRSPLPLQQKLLLQQMQNQPIAGMGYQVSQQQRQDQHSVVGQNTGPSPSPNPCSNPNTGSGYMNSQQSLLNQQLMGKKQTLQRQIMEQKQQLLLQQQMLADAEKIAPQDQINRHLSRPPPDYKDQRRNVGNMQPTAQYSGGSSTISLNSNQALANPVSTHTILTPNSSLLSTSHGTRMPSLSTAVQNMGMYGNLPCNQPNTYSVTSGMNQLTQQRNPKQLLANQNNPMMPRPPTLGPSNNNNVATFGAGSVGNSQQLRPNLTHSMASMPPQRTSNVMITSNTTAPNWASQEGTSKQQEALTSAGVRFPTGTPAAYTPNQSLQQAVGSQQFSQRAVAPPNQLTPAVQMRPMNQMSQTLNGQTMGPLRGLNLRPNQLSTQILPNLNQSGTGLNQSRTGINQPPSLTPSNFPSPNQSSRAFQGTDHSSDLAFDFLSQQNDNMGPALNSDADFIDSLLKTEPGNDDWMKDINLDEILGNNS,mutated_sequence,1.0,1156.0,UPI00001B4EFC.a2m,UPI00001B4EFC.npy,gnomAD
+UPI00001FDCF7,UPI00001FDCF7.csv,MSATSWFLVSSSGARHRLPRELIFVGREECELMLQSRSVDKQHAVINYDQDRDEHWVKDLGSLNGTFVNDMRIPDQKYVTLKLNDVIRFGYDSNMYVLERVQHRVPEEALKHEKYTSQLQVSVKGLAPKRSEALPEHTPYCEASNPRPEKGDRRPGTEAASYRTPLYGQPSWWGEDDGSTLPDAQRQGEPYPERPKGPVQQDGELHGFRAPAEPQGCSFRREPSYFEIPTKETPQPSQPPEVPAHEMPTKDAEAGGGGAAPVVQSHASFTIEFDDCSPGKMKIKDHITKFSLRQRRPPGKEATPGEMVSAETKVADWLVQNDPSLLHRVGPGDDRHSTKSDLPVHTRTLKGHKHEDGTQSDSEDPLAKAASAAGVPLEASGEQVRLQRQIKRDPQELLHNQQAFVIEFFDEDTPRKKRSQSFTHSPSGDPKADKRRGPTPADRDRPSVPAPVQAGGRSSGPQRAGSLKREKTEERLGSPSPASRTPARPFGSVGRRSRLAQDFMAQCLRESSPAARPSPEKVPPVLPAPLTPHGTSPVGPPTPPPAPTDPQLTKARKQEEDDSLSDAGTYTIETEAQDTEVEEARKMIDQVFGVLESPELSRASSATFRPVIRGDRDESDDGGVAQRMALLQEFASRPLGAAPQAEHQGLPVPGSPGGQKWVSRWASLADSYSDPGLTEDGLGRRGGEPEGSLPVRMRRRLPQLPSERADSPAGPESSRRSGPGPPELDSEQPSRLFGQEELDPDSLSDASGSDGGRGPEPGVEPQDSRRRSPQEGPTWSRGRRSPRAPGEPTPASFFIGDQNGDAVLSRKPLAAPGDGEGLGQTAQPSPPARDGVYVSANGRMVIQLRPGRSPEPDGPAPAFLRQESFTKEPASGPPAPGKPPHISSHPLLQDLAATRAARMDFHSQDTHLILKETETALAALEARLLSNSVDAECEGGSTPRPPEDALSGDSDVDTASTVSLRSGKSGPSPTTPQPLRAQKEMSPSPPAAQDPGGTALVSAREQSSERQHHPLGPTDMGRGEPVRRSAIRRGHRPRGSLDWPSEERGPVLAHLPSSDVMASNHETPEATGAGRLGSRRKPAAPPPSPAAREEQSRSSASSQKGPQALTRSNSLSTPRPTRASRLRRARLGDASDTEAADGERGSLGNPEPVGRPAAEQAKKLSRLDILAMPRKRAGSFTGTSDPEAAPARTSFSGRSVELCCASRKPTMAEARAVSRKAANTATTTGPRQPFSRARSGSARYTSTTQTPRAGSSSRARSRAPGPRDTDDDEEEPDPYGFIVQTAEIAEIARLSQTLVKDVAILAQEIHDVAGDGDTLGSSEPAHSASLSNMPSTPASTISAREELVQRIPEASLNFQKVPPGSLNSRDFDQNMNDSCEDALANKTRPRNREEVIFDNLMLNPVSQLSQAIRENTEHLAEKMKILFQNTGRAWEDLEARINAENEVPILKTSNKEISSILKELRRVQKQLEVINAIVDPSGSLDLLTGNRSLASSAQPGLGKGRVAAQSPPSPASAEALLPALPLRNFPQRASCGPPSLPDPTFLPDAERFLI,mutated_sequence,1.0,1554.0,UPI00001FDCF7.a2m,UPI00001FDCF7.npy,gnomAD
+UPI00002263B3,UPI00002263B3.csv,MPGGPSPRSPAPLLRPLLLLLCALAPGAPGPAPGRATEGRAALDIVHPVRVDAGGSFLSYELWPRALRKRDVSVRRDAPAFYELQYRGRELRFNLTANQHLLAPGFVSETRRRGGLGRAHIRAHTPACHLLGEVQDPELEGGLAAISACDGLKGVFQLSNEDYFIEPLDSAPARPGHAQPHVVYKRQAPERLAQRGDSSAPSTCGVQVYPELESRRERWEQRQQWRRPRLRRLHQRSVSKEKWVETLVVADAKMVEYHGQPQVESYVLTIMNMVAGLFHDPSIGNPIHITIVRLVLLEDEEEDLKITHHADNTLKSFCKWQKSINMKGDAHPLHHDTAILLTRKDLCAAMNRPCETLGLSHVAGMCQPHRSCSINEDTGLPLAFTVAHELGHSFGIQHDGSGNDCEPVGKRPFIMSPQLLYDAAPLTWSRCSRQYITRFLDRGWGLCLDDPPAKDIIDFPSVPPGVLYDVSHQCRLQYGAYSAFCEDMDNVCHTLWCSVGTTCHSKLDAAVDGTRCGENKWCLSGECVPVGFRPEAVDGGWSGWSAWSICSRSCGMGVQSAERQCTQPTPKYKGRYCVGERKRFRLCNLQACPAGRPSFRHVQCSHFDAMLYKGQLHTWVPVVNDVNPCELHCRPANEYFAEKLRDAVVDGTPCYQVRASRDLCINGICKNVGCDFEIDSGAMEDRCGVCHGNGSTCHTVSGTFEEAEGLGYVDVGLIPAGAREIRIQEVAEAANFLALRSEDPEKYFLNGGWTIQWNGDYQVAGTTFTYARRGNWENLTSPGPTKEPVWIQLLFQESNPGVHYEYTIHREAGGHDEVPPPVFSWHYGPWTKCTVTCGRGVQRQNVYCLERQAGPVDEEHCDPLGRPDDQQRKCSEQPCPARWWAGEWQLCSSSCGPGGLSRRAVLCIRSVGLDEQSALEPPACEHLPRPPTETPCNRHVPCPATWAVGNWSQCSVTCGEGTQRRNVLCTNDTGVPCDEAQQPASEVTCSLPLCRWPLGTLGPEGSGSGSSSHELFNEADFIPHHLAPRPSPASSPKPGTMGNAIEEEAPELDLPGPVFVDDFYYDYNFINFHEDLSYGPSEEPDLDLAGTGDRTPPPHSHPAAPSTGSPVPATEPPAAKEEGVLGPWSPSPWPSQAGRSPPPPSEQTPGNPLINFLPEEDTPIGAPDLGLPSLSWPRVSTDGLQTPATPESQNDFPVGKDSQSQLPPPWRDRTNEVFKDDEEPKGRGAPHLPPRPSSTLPPLSPVGSTHSSPSPDVAELWTGGTVAWEPALEGGLGPVDSELWPTVGVASLLPPPIAPLPEMKVRDSSLEPGTPSFPTPGPGSWDLQTVAVWGTFLPTTLTGLGHMPEPALNPGPKGQPESLSPEVPLSSRLLSTPAWDSPANSHRVPETQPLAPSLAEAGPPADPLVVRNAGWQAGNWSECSTTCGLGAVWRPVRCSSGRDEDCAPAGRPQPARRCHLRPCATWHSGNWSKCSRSCGGGSSVRDVQCVDTRDLRPLRPFHCQPGPAKPPAHRPCGAQPCLSWYTSSWRECSEACGGGEQQRLVTCPEPGLCEEALRPNTTRPCNTHPCTQWVVGPWGQCSGPCGGGVQRRLVKCVNTQTGLPEEDSDQCGHEAWPESSRPCGTEDCEPVEPPRCERDRLSFGFCETLRLLGRCQLPTIRTQCCRSCSPPSHGAPSRGHQRVARR,mutated_sequence,1.0,1686.0,UPI00002263B3.a2m,UPI00002263B3.npy,gnomAD
+UPI0000197460,UPI0000197460.csv,MRVLCEQRYCAVCREELRQVVFGKKLPAFATIPIHQLQHEKKYDIYFADGKVYALYRQLLQHECPRCPELPPFSLFGDLEQHMRRQHELFCCRLCLQHLQIFTYERKWYSRKDLARHRMQGDPDDTSHRGHPLCKFCDERYLDNDELLKHLRRDHYFCHFCDSDGAQDYYSDYAYLREHFREKHFLCEEGRCSTEQFTHAFRTEIDLKAHRTACHSRSRAEARQNRHIDLQFSYAPRHSRRNEGVVGGEDYEEVDRYSRQGRVARAGTRGAQQSRRGSWRYKREEEDREVAAAVRASVAAQQQEEARRSEDQEEGGRPKKEEAAARGPEDPRGPRRSPRTQGEGPGPKETSTNGPVSQEAFSVTGPAAPGCVGVPGALPPPSPKLKDEDFPSLSASTSSSCSTAATPGPVGLALPYAIPARGRSAFQEEDFPALVSSVPKPGTAPTSLVSAWNSSSSSKKVAQPPLSAQATGSGQPTRKAGKGSRGGRKGGPPFTQEEEEDGGPALQELLSTRPTGSVSSTLGLASIQPSKVGKKKKVGSEKPGTTLPQPPPATCPPGALQAPEAPASRAEGPVAVVVNGHMEGPAPARSAPKEPPGLPRPLGSFPCPTPQEDFPALGGPCPPRMPPPPGFSAVVLLKGTPPPPPPGLVPPISKPPPGFSGLLPSPHPACVPSPATTTTTKAPRLLPAPRAYLVPENFRERNLQLIQSIRDFLQSDEARFSEFKSHSGEFRQGLISAAQYYKSCRDLLGENFQKVFNELLVLLPDTAKQQELLSAHTDFCNREKPLSTKSKKNKKSAWQATTQQAGLDCRVCPTCQQVLAHGDASSHQALHAARDDDFPSLQAIARIIT,mutated_sequence,1.0,849.0,UPI0000197460.a2m,UPI0000197460.npy,gnomAD
+UPI000013E592,UPI000013E592.csv,MPRYGASLRQSCPRSGREQGQDGTAGAPGLLWMGLVLALALALALALALSDSRVLWAPAEAHPLSPQGHPARLHRIVPRLRDVFGWGNLTCPICKGLFTAINLGLKKEPNVARVGSVAIKLCNLLKIAPPAVCQSIVHLFEDDMVEVWRRSVLSPSEACGLLLGSTCGHWDIFSSWNISLPTVPKPPPKPPSPPAPGAPVSRILFLTDLHWDHDYLEGTDPDCADPLCCRRGSGLPPASRPGAGYWGEYSKCDLPLRTLESLLSGLGPAGPFDMVYWTGDIPAHDVWHQTRQDQLRALTTVTALVRKFLGPVPVYPAVGNHESTPVNSFPPPFIEGNHSSRWLYEAMAKAWEPWLPAEALRTLRIGGFYALSPYPGLRLISLNMNFCSRENFWLLINSTDPAGQLQWLVGELQAAEDRGDKVHIIGHIPPGHCLKSWSWNYYRIVARYENTLAAQFFGHTHVDEFEVFYDEETLSRPLAVAFLAPSATTYIGLNPGYRVYQIDGNYSGSSHVVLDHETYILNLTQANIPGAIPHWQLLYRARETYGLPNTLPTAWHNLVYRMRGDMQLFQTFWFLYHKGHPPSEPCGTPCRLATLCAQLSARADSPALCRHLMPDGSLPEAQSLWPRPLFC,mutated_sequence,1.0,631.0,UPI000013E592.a2m,UPI000013E592.npy,gnomAD
+UPI00004DDD95,UPI00004DDD95.csv,MEKLGVEPEEEGGGDDDEEDAEAWAMELADVGAAASSQGVHDQVLPTPNASSRVIVHVDLDCFYAQVEMISNPELKDKPLGVQQKYLVVTCNYEARKLGVKKLMNVRDAKEKCPQLVLVNGEDLTRYREMSYKVTELLEEFSPVVERLGFDENFVDLTEMVEKRLQQLQSDELSAVTVSGHVYNNQSINLLDVLHIRLLVGSQIAAEMREAMYNQLGLTGCAGVASNKLLAKLVSGVFKPNQQTVLLPESCQHLIHSLNHIKEIPGIGYKTAKCLEALGINSVRDLQTFSPKILEKELGISVAQRIQKLSFGEDNSPVILSGPPQSFSEEDSFKKCSSEVEAKNKIEELLASLLNRVCQDGRKPHTVRLIIRRYSSEKHYGRESRQCPIPSHVIQKLGTGNYDVMTPMVDILMKLFRNMVNVKMPFHLTLLSVCFCNLKALNTAKKGLIDYYLMPSLSTTSRSGKHSFKMKDTHMEDFPKDKETNRDFLPSGRIESTRTRESPLDTTNFSKEKDINEFPLCSLPEGVDQEVFKQLPVDIQEEILSGKSREKFQGKGSVSCPLHASRGVLSFFSKKQMQDIPINPRDHLSSSKQVSSVSPCEPGTSGFNSSSSSYMSSQKDYSYYLDNRLKDERISQGPKEPQGFHFTNSNPAVSAFHSFPNLQSEQLFSRNHTTDSHKQTVATDSHEGLTENREPDSVDEKITFPSDIDPQVFYELPEAVQKELLAEWKRAGSDFHIGHK,mutated_sequence,1.0,740.0,UPI00004DDD95.a2m,UPI00004DDD95.npy,gnomAD
+UPI0000D616C3,UPI0000D616C3.csv,MDPLETPIKDGILYQQHVKFGKKCWRKVWALLYAGGPSGVARLESWEVRDGGLGAAGDRSAGPGRRGERRVIRLADCVSVLPADGESCPRDTGAFLLTTTERSHLLAAQHRQAWMGPICQLAFPGTGEASSGSTDAQSPKRGLVPMEENSIYSSWQEVGEFPVVVQRTEAATRCQLKGPALLVLGPDAIQLREAKGTQALYSWPYHFLRKFGSDKILLGTPGVSLLICKGERTDDVSGIILDESLLRAYSVPGAGGHSRVQDSLGPVLREPTFQGERSFLKTSMLRSLLCSCSWRHPRSQPRTQASCLQGSDCPAPHRNSTSAAHTLGTS,mutated_sequence,1.0,330.0,UPI0000D616C3.a2m,UPI0000D616C3.npy,gnomAD
+UPI0000129B6F,UPI0000129B6F.csv,MTILTYPFKNLPTASKWALRFSIRPLSCSSQLRAAPAVQTKTKKTLAKPNIRNVVVVDGVRTPFLLSGTSYKDLMPHDLARAALTGLLHRTSVPKEVVDYIIFGTVIQEVKTSNVAREAALGAGFSDKTPAHTVTMACISANQAMTTGVGLIASGQCDVIVAGGVELMSDVPIRHSRKMRKLMLDLNKAKSMGQRLSLISKFRFNFLAPELPAVSEFSTSETMGHSADRLAAAFAVSRLEQDEYALRSHSLAKKAQDEGLLSDVVPFKVPGKDTVTKDNGIRPSSLEQMAKLKPAFIKPYGTVTAANSSFLTDGASAMLIMAEEKALAMGYKPKAYLRDFMYVSQDPKDQLLLGPTYATPKVLEKAGLTMNDIDAFEFHEAFSGQILANFKAMDSDWFAENYMGRKTKVGLPPLEKFNNWGGSLSLGHPFGATGCRLVMAAANRLRKEGGQYGLVAACAAGGQGHAMIVEAYPK,mutated_sequence,1.0,474.0,UPI0000129B6F.a2m,UPI0000129B6F.npy,gnomAD
+UPI00004588F6,UPI00004588F6.csv,MGDTAKPYFVKRTKDRGTMDDDDFRRGHPQQDYLIIDDHAKGHGSKMEKGLQKKKITPGNYGNTPRKGPCAVSSNPYAFKNPIYSQPAWMNDSHKDQSKRWLSDEHTGNSDNWREFKPGPRIPVINRQRKDSFQENEDGYRWQDTRGCRTVRRLFHKDLTSLETTSEMEAGSPENKKQRSRPRKPRKTRNEENEQDGDLEGPVIDESVLSTKELLGLQQAEERLKRDCIDRLKRRPRNYPTAKYTCRLCDVLIESIAFAHKHIKEKRHKKNIKEKQEEELLTTLPPPTPSQINAVGIAIDKVVQEFGLHNENLEQRLEIKRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFPAIMSQPDVLLLVQECLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACLTTKHLTALGKLEPKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLLPVYLGSWIEGFSLSKLGNFNLQDIEKDVVIWEHTDSAAGDTGITKEEAPRETPIKRGQVSLILDVKHQPSVPVGQLWVELLRFYALEFNLADLVISIRVKELVSRELKDWPKKRIAIEDPYSVKRNVARTLNSQPVFEYILHCLRTTYKYFALPHKITKSSLLKPLNAITCISEHSKEVINHHPDVQTKDDKLKNSVLAQGPGATSSAANTCKVQPLTLKETAESFGSPPKEEMGNEHISVHPENSDCIQADVNSDDYKGDKVYHPETGRKNEKEKVGRKGKHLLTVDQKRGEHVVCGSTRNNESESTLDLEGFQNPTAKECEGLATLDNKADLDGESTEGTEELEDSLNHFTHSVQGQTSEMIPSDEEEEDDEEEEEEEEPRLTINQREDEDGMANEDELDNTYTGSGDEDALSEEDDELGEAAKYEDVKECGKHVERALLVELNKISLKEENVCEEKNSPVDQSDFFYEFSKLIFTKGKSPTVVCSLCKREGHLKKDCPEDFKRIQLEPLPPLTPKFLNILDQVCIQCYKDFSPTIIEDQAREHIRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDVCMTINGLETAEGLDCVRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKGEKKPEIFVDGWNIYFFDQIDELPTYWSECGKNTESVGQLWLGLLRFYTEEFDFKEHVISIRRKSLLTTFKKQWTSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAFINGRRVFGIPVKGFPKDYPSKMEYFFDPDVLTEGELAPNDRCCRICGKIGHFMKDCPMRRKVRRRRDQEDALNQRYPENKEKRSKEDKEIHNKYTEREVSTKEDKPIQCTPQKAKPMRAAADLGREKILRPPVEKWKRQDDKDLREKRCFICGREGHIKKECPQFKGSSGSLSSKYMTQGKASAKRTQQES,mutated_sequence,1.0,1495.0,UPI00004588F6.a2m,UPI00004588F6.npy,gnomAD
+UPI000035B018,UPI000035B018.csv,MRLDEITKSENEDGEKKMFTQQFESTDACVYEVYVKINSIVKSQSQCGLKRGSPETELWGAIVFRSQEENAKPVKETEVTASDADSGLYGFIEYSLYDGFLSYEAPQAFRIDPHDGQICVSQDIDRERDPATYDLLVEAKDGDEQRMTHLALVKGGLSAQAFVRVDLEDVNDNHPVFNPSTYVTSISDETQPGTEIINVLATDQDSGIYGTVAYELIPGNVSSLFTIDSTTGLYSPEVEILSAVNFSADKEVMNSLEMFLPLLRHFKKVERDEEAAEKKFEASRGWFMRFKGRRHLHYLKVQDEAASADGEAAARYVANLAKILGEGIIYLTLPLSHLESTTLSLMVSAQDGGGLTAVINADVTIHIFQTTLAPAEFERPKYTFLVYEDVPEDSPIGTVKAREPLNSSEPIFYRISSGDLGGKFSIHPRLGTIRTRKPLDHETQPVVVLTVQAQLGSAPACSSTEVNITVMDVNDNHPAFLRTSDEIRISQTTPPGTALYLARAEDRDSGRNGLIRYSIASPQPGVFAIDRALGVLFLNGSLGAGEQRELTLTLRAEDQGVHPQAALLVLTVVIEKREHSPSWTFEHLVYQVEVSESLSPMTQMLQTQAHPLGPQRAASPLRYSLEPSVDSAMFGIRPYTGWIYLRRQFDYESTQTYNFRVFAWIPEDGFLQNVSTTVIVRVWDENDNSPTFLHDVLFLKVEESPVPQGVIGKITAIDMDSGKNGQLLYFLLSDGKFFKMNPNTGPAGTIYVITWADGAAAFSGTDFAFSSDELQAFVLKSLFCELGEGELINWVALDREHRGHHEMTVLVTDRGSPPRNATMAVYVSVTDINDNRPFFPQCLPGKELHVKVLEGQPVNMLVTTVFAKDPDEGNNAEVTYSVSSEDSSDHFKIDANNGEIRTTTILSYDYRPSYRMSVIATDQGVPPLQGQAVVNIQVIPLSKGRAIMSQNIRHLIIPENLKPTKIMSLIKSSDHLQQHYNGKLHFSIVADDKDGHFEIDSSTGDLFLSKELDYETTSHYLFRVITTDHSKNLSLSSTVFLSIDVEDQNDHSPSFQDELIVISVEENVPIGTLVYVFNAKDDDGSFLNSRIQYYIESHNPGTNPFLIHPSFGTLVTVSRLDRESIPTVILTVTASDQAVNVTDRRLRSLTAQIVILDVNDHNPTFISFPNAHVKEDVTVGSLVHHITAHDPDEGRNGKVTYSILSGNENMTFMLDESSGLLTTTCPLDYEMKTQHILTVLALDDGTPALSSSQTLTVTVLDVNDEAPVFKQHLYEASVKENQNPGEFVTRVEALDRDSGVNSKLQFEIMPGASFELFEINSDTGEVVTTTILDREIQEVFTLRVLVRDGGFPSLSSTTTILCTVEDENDHAPEFIVSSYDIEVLENQEPEVVYTVLASDMDAGNNRAVEYHIIDGNTDECFTINEMSGELSTTRALDREQISNFTLVILCSDLGDPPRSSVIHLQVRVLDANDHSPSFPTLYYQSSVREDAEVGTVVLVLSAVDKDEGLNGQTEYFLTDEASGAFTIDPMSGTLKTSNTLDREARSQHTFSAVARDCSIQGSRSTTVIIKVYVTDVNDNDPVLEQNPFDVFLSPESPTNQTTVIVRADDLDLGPNGTVVFSFAETQSMFSIDKYTGEIQFQQNPSSEYFPIWLQLKVTDQGIPARTTTGLLVIHMEGEDVKISFSHHLYKGLVTENCEAGTSIVTVKAFAPDSIQDSMKYSIFSGNEDGVLSLCSKSGQLTVKEPKFLDFEVRNEVQLIVLAESSGHRAYCKVAVLIQDENDNSPCFEQSIYQASVSESQLYNAHVIQVFATDLDSGLNGLIEYSILSGNQEEAFQIDALSGVITTKAILDYELTSSYSLIVQATDKGMPRLSNTTVIKVQVTDINDNAPAFLPSEAVEITEDSLPGVIVTHVSVHDVDLNSAFIFSFAKESNPGTKFAIDQNTGVVVLVKTLDFEEMTEYELLIQISDSVHYTEGALVVRVLDVNDNPPVFSQDFYQVTVPESIPVGYSVLTLSATDLESNENISYRILSSSKEFSIDPKNGTIFTISPVLLLDTISTTQFLVEASDGGNPDLRALTLVEIGIEDMNNYAPEFTVKSYNLSLSEDALVGSTLVTFSNIDHDWTRENTYVEYSIISGNSQNNFHVETKFFHSEYPYKQVGYLVLLHSLDREASASHELVILASDSGCPPLSSTAVISIQVLDVNDNPPNFSSLSYHTHVKESTPLGSHITVVSANDRDTGSHAEIIYNIISGNEKGHFYLEENTGVLYLIKPLDYEKMTKFTLTVQASDAEKKHFSFAVVFVSVLDDNDHAPQFMFSSFSCIVPENLPISSTICSINALDFDAGPYGELTYSIVSPCFLTHGMSYDHDLFLIDPLTGDIHAKQILDYENGNKYCLTVQAKDKGDATASLVVWVDIEGIDEFEPIFTQDQYFFTLPEKNKDRQLIGRVEASDADAGIDGVILYSLGTSSPFFSVNKTNGNIYLIRALPLIKSQLNKEDTLEMKIIAHSPKSDSKFASCTVFVNVSFSSEGTPLAVFASSFSISLVVSFLVFLILICILIVMILRHKQKDTINNYEEKKTSSLDADLRVTRDASVLKAFQKTDDCSNEVVPVDATPEWLSLISIMEKDIVNLYRHSNSSGHCSVEGETAEDKEIQRINEHPYRKCSDSALSDHESRVPDSGIPRDSDQLSCLSGETDVMVTAETAEASQTFGEGDQGEGCSTTCAQNNVLPQTVQKREAKESILADVRKESVFISGDQEVRCAALSTQTTSDHDGKDNYHWNYLLSWEPKFQPLASVFNDIAKLKDEHLHMPGIPKEKKSFVFPPPLITAVAQPGIKAVPPRMPAVNLGQVPPKHPRSPIPYHLGSLPEGMTPNFSPSLSLLTMQPPALSPLLREGELLGTHISGTCHELKAEDEVQI,mutated_sequence,1.0,2916.0,UPI000035B018.a2m,UPI000035B018.npy,gnomAD
+UPI0000071C18,UPI0000071C18.csv,MHACSSTALHPRPFIHGFSFTTFLQQLFPHGSSTALHPWPFIHGSSPMALHPRLFTHGPSSTALHPCPFTHGSSPMSLHPRLFTHSPSSMPLHPRPFVHASSPTALRPCLFTHGPSSMPLHPRPFVHASSPTALPPCLFTHGPSSMPLHPRPFLHASSPTALPPCLFTHGPSSMPLHPRPFVHASSPTALRPCLFTHGPSSMPLHPRPFVHASSPTALPPCLFTHGPSSMPLHPRPFVHASSPTALRPCLFTHGPSSMPLHPRPFLHASSPTALRPCLFTHGPSSMPLHPRPFVHASSPTALRPCLFTHGPSSMPLHPRPFVHASSSTSLHPRPFIHASSSKSLHLRLFSHSSCLVGFSQQGNVLLLSARATFNTCLLVSNMYFLIISY,mutated_sequence,1.0,389.0,UPI0000071C18.a2m,UPI0000071C18.npy,gnomAD
+UPI00001C1EF8,UPI00001C1EF8.csv,MASDASHALEAALEQMDGIIAGTKTGADLSDGTCEPGLASPASYMNPFPVLHLIEDLRLALEMLELPQERAALLSQIPGPTAAYIKEWFEESLSQVNHHSAASNETYQERLARLEGDKESLILQVSVLTDQVEAQGEKIRDLEVCLEGHQVKLNAAEEMLQQELLSRTSLETQKLDLMTEVSELKLKLVGMEKEQREQEEKQRKAEELLQELRHLKIKVEELENERNQYEWKLKATKAEVAQLQEQVALKDAEIERLHSQLSRTAALHSESHTERDQEIQRLKMGMETLLLANEDKDRRIEELTGLLNQYRKVKEIVMVTQGPSERTLSINEEEPEGGFSKWNATNKDPEELFKQEMPPRCSSPTVGPPPLPQKSLETRAQKKLSCSLEDLRSESVDKCMDGNQPFPVLEPKDSPFLAEHKYPTLPGKLSGATPNGEAAKSPPTICQPDATGSSLLRLRDTESGWDDTAVVNDLSSTSSGTESGPQSPLTPDGKRNPKGIKKFWGKIRRTQSGNFYTDTLGMAEFRRGGLRATAGPRLSRTRDSKGQKSDANAPFAQWSTERVCAWLEDFGLAQYVIFARQWVSSGHTLLTATPQDMEKELGIKHPLHRKKLVLAVKAINTKQEEKSALLDHIWVTRWLDDIGLPQYKDQFHESRVDRRMLQYLTVNDLLFLKVTSQLHHLSIKCAIHVLHVNKFNPHCLHRRPADESNLSPSEVVQWSNHRVMEWLRSVDLAEYAPNLRGSGVHGGLIILEPRFTGDTLAMLLNIPPQKTLLRRHLTTKFNALIGPEAEQEKREKMASPAYTPLTTTAKVRPRKLGFSHFGNIRKKKFDESTDYICPMEPSDGVSDSHRVYSGYRGLSPLDAPELDGLDQVGQIS,mutated_sequence,1.0,876.0,UPI00001C1EF8.a2m,UPI00001C1EF8.npy,gnomAD
+UPI0000405B22,UPI0000405B22.csv,MGSQVLQILRQGVWASLTGGWFFDPHQSTFSNCFHLYVWIFLLIFPFLLYMVLPPSLMVAGVYCLVVAVIFATIKTVNYRLHAMFDQGEIVEKRSSTMGELEEEPAQGDSNPPRDPGVEMTVFRKVSSTPPVRCSSQHSVFGFNQVSELLPRMEDSGPLRDIKELVREQGSNNVIVTSADREMLKLSSQEKLIGDLPQTPPGAVPDPSLASTDSSEPSPLAGDGAPWSGSSMADTPMSPLLKGSLSQELSKSFLTLTQPDRALVRTSSRREQRRGAGGYQPLDRRGSGEPTPQKAGSSDSCFSGTDRETLSSFKSEKTNSTHLDSPPGGPAPEGSDTDPPSEAELPASPDAGVPSDDTLRSFDTVIGAGTPPGLAEPLLVVRPKDLALLRPSKRQPPLRRHSPPGRAPRRPLLEGGGFFEDEDTSEGSELSPASSLRSQRRYSTDSSSSTSCYSPESSRGAAGGPRKRRAPHGAEEGTAVPPKRPYGTQRTPSTASAKTHARVLSMDGAGGDVLRPPLAGCKAELEAQVGVEQAASEPVVLPAEARRGPAANQPGWRGELQEEGAVGGAAEETGRRDRSSSVRRTQAIRRRHNAGSNPTPPASVMGSPPSSLQEAQRGRAASHSRALTLPSALHFASSLLLTRAGANVHEACTFDDTSEGAVHYFYDESGVRRSYTFGLAGGGYENPVGQQGEQTANGAWDRHSHSSSFHSADVPEATGGLNLLQPRPVVLQGMQVRRVPLEIPEEQTLMEEAPPRAQHSYKYWLLPGRWTSVRYERLALLALLDRTRGVLENIFGVGLSSLVAFLGYLLLLKGFFTDIWVFQFCLVIASCQYSLLKSVQPDAASPMHGHNWVIAYSRPVYFCICCLLIWLLDALGSAQPFPPVSLYGLTLFSASFFFCARDVATVFTLCFPFVFLLGLLPQVNTCLMYLLEQIDMHGFGGTAATSPLTAVFSLSRSLLAAALLYGFCLGAIKTPWPEQHVPVLFSVFCGLLVALSYHLSRQSSDPTVLWSLIRSKLFPELEERSLETARAEPPDPLPDKMRQSVREVLHSDLVMCVVIAVLTFAISASTVFIALKSVLGFVLYALAGAVGFFTHYLLPQLRKQLPWFCLSQPVLKPLEYSQYEVRGAAQVMWFEKLYAGLQCVEKYLIYPAVVLNALTVDAHTVVSHPDKYCFYCRALLMTVAGLKLLRSAFCCPPQQYLTLAFTVLLFHFDYPRLSQGFLLDYFLMSLLCSKLWDLLYKLRFVLTYIAPWQITWGSAFHAFAQPFAVPHSAMLFVQALLSGLFSTPLNPLLGSAVFIMSYARPLKFWERDYNTKRVDHSNTRLVTQLDRNPGADDNNLNSIFYEHLTRSLQHTLCGDLVLGRWGNYGPGDCFVLASDYLNALVHLIEVGNGLVTFQLRGLEFRGTYCQQREVEAITEGVEEDEGCCCCEPGHLPRVLSFNAAFGQRWLAWEVTASKYVLEGYSISDNNAASMLQVFDLRKILITYYVKSIIYYVSRSPKLEVWLSHEGITAALRPVRVPGYADSDPTFSLSVDEDYDLRLSGLSLPSFCAVHLEWIQYCASRRSQPVDQDWNSPLVTLCFGLCVLGRRALGTASHSMSASLEPFLYGLHALFKGDFRITSPRDEWVFADMDLLHRVVAPGVRMALKLHQDHFTSPDEYEEPAALYDAIAANEERLVISHEGDPAWRSAILSNTPSLLALRHVLDDASDEYKIIMLNRRHLSFRVIKVNRECVRGLWAGQQQELVFLRNRNPERGSIQNAKQALRNMINSSCDQPLGYPIYVSPLTTSLAGSHPQLRALWGGPISLGAIAHWLLRTWERLHKGCGAGCNSGGNVDDSDCSGGGGLTSLSNNPPVAHPTPENTAGNGDQPLPPGPGWGPRSSLSGSGDGRPPPLLQWPPPRLPGPPPASPIPTEGPRTSRPPGPGLLSSEGPSGKWSLGGRKGLGGSDGEPASGSPKGGTPKSQAPLDLSLSLSLSLSPDVSTEASPPRASQDIPCLDSSAPESGTPMGALGDWPAPIEERESPAAQPLLEHQY,mutated_sequence,1.0,2034.0,UPI0000405B22.a2m,UPI0000405B22.npy,gnomAD
+UPI000012B3DF,UPI000012B3DF.csv,MAEAEGESLESWLNKATNPSNRQEDWEYIIGFCDQINKELEGPQIAVRLLAHKIQSPQEWEALQALTVLEACMKNCGRRFHNEVGKFRFLNELIKVVSPKYLGDRVSEKVKTKVIELLYSWTMALPEEAKIKDAYHMLKRQGIVQSDPPIPVDRTLIPSPPPRPKNPVFDDEEKSKLLAKLLKSKNPDDLQEANKLIKSMVKEDEARIQKVTKRLHTLEEVNNNVRLLSEMLLHYSQEDSSDGDRELMKELFDQCENKRRTLFKLASETEDNDNSLGDILQASDNLSRVINSYKTIIEGQVINGEVATLTLPDSEGNSQCSNQGTLIDLAELDTTNSLSSVLAPAPTPPSSGIPILPPPPQASGPPRSRSSSQAEATLGPSSTSNALSWLDEELLCLGLADPAPNVPPKESAGNSQWHLLQREQSDLDFFSPRPGTAACGASDAPLLQPSAPSSSSSQAPLPPPFPAPVVPASVPAPSAGSSLFSTGVAPALAPKVEPAVPGHHGLALGNSALHHLDALDQLLEEAKVTSGLVKPTTSPLIPTTTPARPLLPFSTGPGSPLFQPLSFQSQGSPPKGPELSLASIHVPLESIKPSSALPVTAYDKNGFRILFHFAKECPPGRPDVLVVVVSMLNTAPLPVKSIVLQAAVPKSMKVKLQPPSGTELSPFSPIQPPAAITQVMLLANPLKEKVRLRYKLTFALGEQLSTEVGEVDQFPPVEQWGNL,mutated_sequence,1.0,723.0,UPI000012B3DF.a2m,UPI000012B3DF.npy,gnomAD
+UPI000006FBBD,UPI000006FBBD.csv,MKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGPFRFTPNPEFSTYPPAATEGPNIVCKACGLSFSVFRKKHVCCDCKKDFCSVCSVLQENLRRCSTCHLLQETAFQRPQLMRLKVKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMDTSSLNSSRSQTSSFFTRSFFSNYTAPSATMSSFQGELMDGDQTSRSGVPAQVQSEITSANTEDDDDDDDEDDDDEEENAEDRNPGLSKERVRASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCEKWELVEKVNRLYKENEENQKSYGERLQLQDEEDDSLCRICMDAVIDCVLLECGHMVTCTKCGKRMSECPICRQYVVRAVHVFKS,mutated_sequence,1.0,372.0,UPI000006FBBD.a2m,UPI000006FBBD.npy,gnomAD
+UPI0000161087,UPI0000161087.csv,MTGEVGSEVHLEINDPNVISQEEADSPSDSGQGSYETIGPLSEGDSDEEIFVSKKLKNRKVLQDSDSETEDTNASPEKTTYDSAEEENKENLYAGKNTKIKRIYKTVADSDESYMEKSLYQENLEAQVKPCLELSLQSGNSTDFTTDRKSSKKHIHDKEGTAGKAKVKSKRRLEKEERKMEKIRQLKKKETKNQEDDVEQPFNDSGCLLVDKDLFETGLEDENNSPLEDEESLESIRAAVKNKVKKHKKKEPSLESGVHSFEEGSELSKGTTRKERKAARLSKEALKQLHSETQRLIRESALNLPYHMPENKTIHDFFKRKPRPTCHGNAMALLKSSKYQSSHHKEIIDTANTTEMNSDHHSKGSEQTTGAENEVETNALPVVSKETQIITGSDESCRKDLVKNEELEIQEKQKQSDIRPSPGDSSVLQQESNFLGNNHSEECQVGGLVAFEPHALEGEGPQNPEETDEKVEEPEQQNKSSAVGPPEKVRRFTLDRLKQLGVDVSIKPRLGADEDSFVILEPETNRELEALKQRFWKHANPAAKPRAGQTVNVNVIVKDMGTDGKEELKADVVPVTLAPKKLDGASHTKPGEKLQVLKAKLQEAMKLRRFEERQKRQALFKLDNEDGFEEEEEEEEEMTDESEEDGEEKVEKEEKEEELEEEEEKEEEEEEEGNQETAEFLLSSEEIETKDEKEMDKENNDGSSEIGKAVGFLSVPKSLSSDSTLLLFKDSSSKMGYFPTEEKSETDENSGKQPSKLDEDDSCSLLTKESSHNSSFELIGSTIPSYQPCNRQTGRGTSFFPTAGGFRSPSPGLFRASLVSSASKSSGKLSEPSLPIEDSQDLYNASPEPKTLFLGAGDFQFCLEDDTQSQLLDADGFLNVRNHRNQYQALKPRLPLASMDENAMDANMDELLDLCTGKFTSQAEKHLPRKSDKKENMEELLNLCSGKFTSQDASTPASSELNKQEKESSMGDPMEEALALCSGSFPTDKEEEDEEEEFGDFRLVSNDNEFDSDEDEHSDSGNDLALEDHEDDDEEELLKRSEKLKRQMRLRKYLEDEAEVSGSDVGSEDEYDGEEIDEYEEDVIDEVLPSDEELQSQIKKIHMKTMLDDDKRQLRLYQERYLADGDLHSDGPGRMRKFRWKNIDDASQMDLFHRDSDDDQTEEQLDESEARWRKERIEREQWLRDMAQQGKITAEEEEEIGEDSQFMILAKKVTAKALQKNASRPMVIQESKSLLRNPFEAIRPGSAQQVKTGSLLNQPKAVLQKLAALSDHNPSAPRNSRNFVFHTLSPVKAEAAKESSKSQVKKRGPSFMTSPSPKHLKTDDSTSGLTRSIFKYLES,mutated_sequence,1.0,1339.0,UPI0000161087.a2m,UPI0000161087.npy,gnomAD
+UPI000020630A,UPI000020630A.csv,MAAGSDLLDEVFFNSEVDEKVVSDLVGSLESQLAASAAHHHHLAPRTPEVRAAAAGALGNHVVSGSPAGAAGAGPAAPAEGAPGAAPEPPPAGRARPGGGGPQRPGPPSPRRPLVPAGPAPPAAKLRPPPEGSAGSCAPVPAAAAVAAGPEPAPAGPAKPAGPAALAARAGPGPGPGPGPGPGPGPGKPAGPGAAQTLNGSAALLNSHHAAAPAVSLVNNGPAALLPLPKPAAPGTVIQTPPFVGAAAPPAPAAPSPPAAPAPAAPAAAPPPPPPAPATLARPPGHPAGPPTAAPAVPPPAAAQNGGSAGAAPAPAPAAGGPAGVSGQPGPGAAAAAPAPGVKAESPKRVVQAAPPAAQTLAASGPASTAASMVIGPTMQGALPSPAAVPPPAPGTPTGLPKGAAGAVTQSLSRTPTATTSGIRATLTPTVLAPRLPQPPQNPTNIQNFQLPPGMVLVRSENGQLLMIPQQALAQMQAQAHAQPQTTMAPRPATPTSAPPVQISTVQAPGTPIIARQVTPTTIIKQVSQAQTTVQPSATLQRSPGVQPQLVLGGAAQTASLGTATAVQTGTPQRTVPGATTTSSAATETMENVKKCKNFLSTLIKLASSGKQSTETAANVKELVQNLLDGKIEAEDFTSRLYRELNSSPQPYLVPFLKRSLPALRQLTPDSAAFIQQSQQQPPPPTSQATTALTAVVLSSSVQRTAGKTAATVTSALQPPVLSLTQPTQVGVGKQGQPTPLVIQQPPKPGALIRPPQVTLTQTPMVALRQPHNRIMLTTPQQIQLNPLQPVPVVKPAVLPGTKALSAVSAQAAAAQKNKLKEPGGGSFRDDDDINDVASMAGVNLSEESARILATNSELVGTLTRSCKDETFLLQAPLQRRILEIGKKHGITELHPDVVSYVSHATQQRLQNLVEKISETAQQKNFSYKDDDRYEQASDVRAQLKFFEQLDQIEKQRKDEQEREILMRAAKSRSRQEDPEQLRLKQKAKEMQQQELAQMRQRDANLTALAAIGPRKKRKVDCPGPGSGAEGSGPGSVVPGSSGVGTPRQFTRQRITRVNLRDLIFCLENERETSHSLLLYKAFLK,mutated_sequence,1.0,1085.0,UPI000020630A.a2m,UPI000020630A.npy,gnomAD
+UPI00003D7962,UPI00003D7962.csv,MELRSELPSVPGAATAAAATATGPPVASVASVAAAAAAAASLPVSVAGGLLRGPPLLLRAAEKYPRTPKCARCRNHGVVSALKGHKRYCRWKDCLCAKCTLIAERQRVMAAQVALRRQQAQEENEARELQLLYGTAEGLALAAANGIIPPRPAYEVFGSVCAADGGGPGAGAPAGTGGGAAGAGGSEAKLQKFDLFPKTLLQAGRPGSPLPPPVKPLSPDGADSGPGTSSPEVRPGSGSENGDGESFSGSPLARASKEAGGSCPGSAGPGGGGEEDSPGSASPLGSESGSEADKEEGEAAPAPGLGGGSGPRQRTPLDILTRVFPGHRRGVLELVLQGCGGDVVQAIEQVLNHHRGGLAAGLGPAAPPDKAAVGAAAAADDAWPSRVDAAAAAAAAAGGPGLPAPLQAGPAAPPHHRPLLAGAMAPGALGSLSSRSAFSPLQPNASHFGADAGAYPLGAPLGLSPLRLAYSAAAAHSRGLAFMAPYSTAGLVPTLGFRPPMDYAFSDLMRDRSAAAAAAVHKEPTYGGGLYGPMVNGAPEKQ,mutated_sequence,1.0,542.0,UPI00003D7962.a2m,UPI00003D7962.npy,gnomAD
+UPI000020E164,UPI000020E164.csv,MIWRNNWKSTTGRLNVKLQSDKLQHGCGPDYSSAWLPANESLWQATTVPSNHRNNHIRRHSIASDSGDTGIGTSCSDSVEDHSTSSGTLSFKPSQSLITLPTAHVMPSNSSASISKLRESLTPDGSKWSTSLMQTLGNHSRGEQDSSLDMKDFRPLRKWSSLSKLTAPDNCGQGGTVCREESRNGLEKIGKAKALTSQLRTIGPSCLHDSMEMLRLEDKEINKKRSSTLDCKYKFESCSKEDFRASSSTLRRQPVDMTYSALPESKPIMTSSEAFEPPKYLMLGQQAVGGVPIQPSVRTQMWLTEQLRTNPLEGRNTEDSYSLAPWQQQQIEDFRQGSETPMQVLTGSSRQSYSPGYQDFSKWESMLKIKEGLLRQKEIVIDRQKQQITHLHERIRDNELRAQHAMLGHYVNCEDSYVASLQPQYENTSLQTPFSEESVSHSQQGEFEQKLASTEKEVLQLNEFLKQRLSLFSEEKKKLEEKVGFSNKVELGQQHFLSI,mutated_sequence,1.0,499.0,UPI000020E164.a2m,UPI000020E164.npy,gnomAD
+UPI00000738BA,UPI00000738BA.csv,MSAAQVSSSRRQSCYLCDLPRMPWAMIWDFSEPVCRGCVNYEGADRIEFVIETARQLKRAHGCFQDGRSPGPPPPVGVKTVALSAKEAAAAAAAAAAAAAAAQQQQQQQQQQQQQQQQQQQQQQQQQLNHVDGSSKPAVLAAPSGLERYGLSAAAAAAAAAAAAVEQRSRFEYPPPPVSLGSSSHTARLPNGLGGPNGFPKPTPEEGPPELNRQSPNSSSAAASVASRRGTHGGLVTGLPNPGGGGGPQLTVPPNLLPQTLLNGPASAAVLPPPPPHALGSRGPPTPAPPGAPGGPACLGGTPGVSATSSSASSSTSSSVAEVGVGAGGKRPGSVSSTDQERELKEKQRNAEALAELSESLRNRAEEWASKPKMVRDTLLTLAGCTPYEVRFKKDHSLLGRVFAFDAVSKPGMDYELKLFIEYPTGSGNVYSSASGVAKQMYQDCMKDFGRGLSSGFKYLEYEKKHGSGDWRLLGDLLPEAVRFFKEGVPGADMLPQPYLDASCPMLPTALVSLSRAPSAPPGTGALPPAAPSGRGAAASLRKRKASPEPPDSAEGALKLGEEQQRQQWMANQSEALKLTMSAGGFAAPGHAAGGPPPPPPPLGPHSNRTTPPESAPQNGPSPMAALMSVADTLGTAHSPKDGSSVHSTTASARRNSSSPVSPASVPGQRRLASRNGDLNLQVAPPPPSAHPGMDQVHPQNIPDSPMANSGPLCCTICHERLEDTHFVQCPSVPSHKFCFPCSRESIKAQGATGEVYCPSGEKCPLVGSNVPWAFMQGEIATILAGDVKVKKERDP,mutated_sequence,1.0,796.0,UPI00000738BA.a2m,UPI00000738BA.npy,gnomAD
+UPI0001B300F3,UPI0001B300F3.csv,MEPHVLGAVLYWLLLPCALLAACLLRFSGLSLVYLLFLLLLPWFPGPTRCGLQGHTGRLLRALLGLSLLFLVAHLALQICLHIVPRLDQLLGPSCSRWETLSRHIGVTRLDLKDIPNAIRLVAPDLGILVVSSVCLGICGRLARNTRQSPHPRELDDDERDVDASPTAGLQEAATLAPTRRSRLAARFRVTAHWLLVAAGRVLAVTLLALAGIAHPSALSSVYLLLFLALCTWWACHFPISTRGFSRLCVAVGCFGAGHLICLYCYQMPLAQALLPPAGIWARVLGLKDFVGPTNCSSPHALVLNTGLDWPVYASPGVLLLLCYATASLRKLRAYRPSGQRKEAAKGYEARELELAELDQWPQERESDQHVVPTAPDTEADNCIVHELTGQSSVLRRPVRPKRAEPREASPLHSLGHLIMDQSYVCALIAMMVWSITYHSWLTFVLLLWACLIWTVRSRHQLAMLCSPCILLYGMTLCCLRYVWAMDLRPELPTTLGPVSLRQLGLEHTRYPCLDLGAMLLYTLTFWLLLRQFVKEKLLKWAESPAALTEVTVADTEPTRTQTLLQSLGELVKGVYAKYWIYVCAGMFIVVSFAGRLVVYKIVYMFLFLLCLTLFQVYYSLWRKLLKAFWWLVVAYTMLVLIAVYTFQFQDFPAYWRNLTGFTDEQLGDLGLEQFSVSELFSSILVPGFFLLACILQLHYFHRPFMQLTDMEHVSLPGTRLPRWAHRQDAVSGTPLLREEQQEHQQQQQEEEEEEEDSRDEGLGVATPHQATQVPEGAAKWGLVAERLLELAAGFSDVLSRVQVFLRRLLELHVFKLVALYTVWVALKEVSVMNLLLVVLWAFALPYPRFRPMASCLSTVWTCVIIVCKMLYQLKVVNPQEYSSNCTEPFPNSTNLLPTEISQSLLYRGPVDPANWFGVRKGFPNLGYIQNHLQVLLLLVFEAIVYRRQEHYRRQHQLAPLPAQAVFASGTRQQLDQDLLGCLKYFINFFFYKFGLEICFLMAVNVIGQRMNFLVTLHGCWLVAILTRRHRQAIARLWPNYCLFLALFLLYQYLLCLGMPPALCIDYPWRWSRAVPMNSALIKWLYLPDFFRAPNSTNLISDFLLLLCASQQWQVFSAERTEEWQRMAGVNTDRLEPLRGEPNPVPNFIHCRSYLDMLKVAVFRYLFWLVLVVVFVTGATRISIFGLGYLLACFYLLLFGTALLQRDTRARLVLWDCLILYNVTVIISKNMLSLLACVFVEQMQTGFCWVIQLFSLVCTVKGYYDPKEMMDRDQDCLLPVEEAGIIWDSVCFFFLLLQRRVFLSHYYLHVRADLQATALLASRGFALYNAANLKSIDFHRRIEEKSLAQLKRQMERIRAKQEKHRQGRVDRSRPQDTLGPKDPGLEPGPDSPGGSSPPRRQWWRPWLDHATVIHSGDYFLFESDSEEEEEAVPEDPRPSAQSAFQLAYQAWVTNAQAVLRRRQQEQEQARQEQAGQLPTGGGPSQEVEPAEGPEEAAAGRSHVVQRVLSTAQFLWMLGQALVDELTRWLQEFTRHHGTMSDVLRAERYLLTQELLQGGEVHRGVLDQLYTSQAEATLPGPTEAPNAPSTVSSGLGAEEPLSSMTDDMGSPLSTGYHTRSGSEEAVTDPGEREAGASLYQGLMRTASELLLDRRLRIPELEEAELFAEGQGRALRLLRAVYQCVAAHSELLCYFIIILNHMVTASAGSLVLPVLVFLWAMLSIPRPSKRFWMTAIVFTEIAVVVKYLFQFGFFPWNSHVVLRRYENKPYFPPRILGLEKTDGYIKYDLVQLMALFFHRSQLLCYGLWDHEEDSPSKEHDKSGEEEQGAEEGPGVPAATTEDHIQVEARVGPTDGTPEPQVELRPRDTRRISLRFRRRKKEGPARKGAAAIEAEDREEEEGEEEKEAPTGREKRPSRSGGRVRAAGRRLQGFCLSLAQGTYRPLRRFFHDILHTKYRAATDVYALMFLADVVDFIIIIFGFWAFGKHSAATDITSSLSDDQVPEAFLVMLLIQFSTMVVDRALYLRKTVLGKLAFQVALVLAIHLWMFFILPAVTERMFNQNVVAQLWYFVKCIYFALSAYQIRCGYPTRILGNFLTKKYNHLNLFLFQGFRLVPFLVELRAVMDWVWTDTTLSLSSWMCVEDIYANIFIIKCSRETEKKYPQPKGQKKKKIVKYGMGGLIILFLIAIIWFPLLFMSLVRSVVGVVNQPIDVTVTLKLGGYEPLFTMSAQQPSIIPFTAQAYEELSRQFDPQPLAMQFISQYSPEDIVTAQIEGSSGALWRISPPSRAQMKRELYNGTADITLRFTWNFQRDLAKGGTVEYANEKHMLALAPNSTARRQLASLLEGTSDQSVVIPNLFPKYIRAPNGPEANPVKQLQPNEEADYLGVRIQLRREQGAGATGFLEWWVIELQECRTDCNLLPMVIFSDKVSPPSLGFLAGYGIMGLYVSIVLVIGKFVRGFFSEISHSIMFEELPCVDRILKLCQDIFLVRETRELELEEELYAKLIFLYRSPETMIKWTREKE,mutated_sequence,1.0,2521.0,UPI0001B300F3.a2m,UPI0001B300F3.npy,gnomAD
+UPI00004193C9,UPI00004193C9.csv,MVSRMGWGGRRRRLGRWGDLGPGSVPLLPMPLPPPPPPSCRGPGGGRISIFSLSPAPHTRSSPSSFSPPTAGPPCSVLQGTGASQSCHSALPIPATPPTQAQPAMTPASASPSWGSHSTPPLAPATPTPSQQCPQDSPGLRVGPLIPEQDYERLEDCDPEGSQDSPIHGEEQQPLLHVPEGLRGSWHHIQNLDSFFTKIYSYHQRNGFACILLEDVFQLGQFIFIVTFTTFLLRCVDYNVLFANQPSNHTRPGPFHSKVTLSDAILPSAQCAERIRSSPLLVLLLVLAAGFWLVQLLRSVCNLFSYWDIQVFYREALHIPPGKGREDTGMYWRGQPGGLD,mutated_sequence,1.0,340.0,UPI00004193C9.a2m,UPI00004193C9.npy,gnomAD
+UPI000013D01B,UPI000013D01B.csv,MEANWTAFLFQAHEASHHQQQAAQNSLLPLLSSAVEPPDQKPLLPIPITQKPQGAPETLKDAIGIKKEKPKTSFVCTYCSKAFRDSYHLRRHESCHTGIKLVSRPKKTPTTVVPLISTIAGDSSRTSLVSTIAGILSTVTTSSSGTNPSSSASTTAMPVTQSVKKPSKPVKKNHACEMCGKAFRDVYHLNRHKLSHSDEKPFECPICNQRFKRKDRMTYHVRSHEGGITKPYTCSVCGKGFSRPDHLSCHVKHVHSTERPFKCQTCTAAFATKDRLRTHMVRHEGKVSCNICGKLLSAAYITSHLKTHGQSQSINCNTCKQGISKTCMSEETSNQKQQQQQQQQQQQQQQQQQQHVTSWPGKQVETLRLWEEAVKARKKEAANLCQTSTAATTPVTLTTPFSITSSVSSGTMSNPVTVAAAMSMRSPVNVSSAVNITSPMNIGHPVTITSPLSMTSPLTLTTPVNLPTPVTAPVNIAHPVTITSPMNLPTPMTLAAPLNIAMRPVESMPFLPQALPTSPPW,mutated_sequence,1.0,521.0,UPI000013D01B.a2m,UPI000013D01B.npy,gnomAD
+UPI0000366BAD,UPI0000366BAD.csv,MSAAKENPCRKFQANIFNKSKCQNCFKPRESHLLNDEDLTQAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQGTINMNQCTDVVDGEGRTGQKFSLCILTPEKEHFIRAETKEIVSGWLEMLMVYPRTNKQNQKKKRKVEPPTPQEPGPAKVAVTSSSSSSSSSSSIPSAEKVPTTKSTLWQEEMRTKDQPDGSSLSPAQSPSQSQPPAASSLREPGLESKEEESAMSSDRMDCGRKVRVESGYFSLEKTKQDLKAEEQQLPPPLSPPSPSTPNHRRSQVIEKFEALDIEKAEHMETNAVGPSPSSDTRQGRSEKRAFPRKRDFTNEAPPAPLPDASASPLSPHRRAKSLDRRSTEPSVTPDLLNFKKGWLTKQYEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEIDLSACYDVTEYPVQRNYGFQIHTKEGEFTLSAMTSGIRRNWIQTIMKHVHPTTAPDVTSSLPEEKNKSSCSFETCPRPTEKQEAELGEPDPEQKRSRARERRREGRSKTFDWAEFRPIQQALAQERVGGVGPADTHEPLRPEAEPGELERERARRREERRKRFGMLDATDGPGTEDAALRMEVDRSPGLPMSDLKTHNVHVEIEQRWHQVETTPLREEKQVPIAPVHLSSEDGGDRLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQATCERGFAAMEETHQKKIEDLQRQHQRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSVNSDVEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGEATGSPLAQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKSPDSATVSGYDIMKSKSNPDFLKKDRSCVTRQLRNIRSKSVIEQVSWDT,mutated_sequence,1.0,1025.0,UPI0000366BAD.a2m,UPI0000366BAD.npy,gnomAD
+UPI000012DA10,UPI000012DA10.csv,MPRAPAPLYACLLGLCALLPRLAGLNICTSGSATSCEECLLIHPKCAWCSKEDFGSPRSITSRCDLRANLVKNGCGGEIESPASSFHVLRSLPLSSKGSGSAGWDVIQMTPQEIAVNLRPGDKTTFQLQVRQVEDYPVDLYYLMDLSLSMKDDLDNIRSLGTKLAEEMRKLTSNFRLGFGSFVDKDISPFSYTAPRYQTNPCIGYKLFPNCVPSFGFRHLLPLTDRVDSFNEEVRKQRVSRNRDAPEGGFDAVLQAAVCKEKIGWRKDALHLLVFTTDDVPHIALDGKLGGLVQPHDGQCHLNEANEYTASNQMDYPSLALLGEKLAENNINLIFAVTKNHYMLYKNFTALIPGTTVEILDGDSKNIIQLIINAYNSIRSKVELSVWDQPEDLNLFFTATCQDGVSYPGQRKCEGLKIGDTASFEVSLEARSCPSRHTEHVFALRPVGFRDSLEVGVTYNCTCGCSVGLEPNSARCNGSGTYVCGLCECSPGYLGTRCECQDGENQSVYQNLCREAEGKPLCSGRGDCSCNQCSCFESEFGKIYGPFCECDNFSCARNKGVLCSGHGECHCGECKCHAGYIGDNCNCSTDISTCRGRDGQICSERGHCLCGQCQCTEPGAFGEMCEKCPTCPDACSTKRDCVECLLLHSGKPDNQTCHSLCRDEVITWVDTIVKDDQEAVLCFYKTAKDCVMMFTYVELPSGKSNLTVLREPECGNTPNAMTILLAVVGSILLVGLALLAIWKLLVTIHDRREFAKFQSERSRARYEMASNPLYRKPISTHTVDFTFNKFNKSYNGTVD,mutated_sequence,1.0,799.0,UPI000012DA10.a2m,UPI000012DA10.npy,gnomAD
+UPI0001E8F4C1,UPI0001E8F4C1.csv,XGLGLGLWLGAGLSAARRLEQVPSLLGRRRKLHLPDPDLASWGPGRSGSGGGRWDCMCECLLRTDPVPEEGEDVAATISATETLSEEEQEELRRELAKVEEEIQTLSQVLAAKEKHLAEIKRKLGINSLQELKQNIAKGWQDVTATSA,mutated_sequence,1.0,148.0,UPI0001E8F4C1.a2m,UPI0001E8F4C1.npy,gnomAD
+UPI0002A47099,UPI0002A47099.csv,MTGCFLWQYHLPKLKTGSLGPEETSAPVRMCPRHPEPVPLAHPLPVLKEALEKVDQILRQAMSAPGVAAMSAVVIHNDTVLWTGNFGKKNGSDPASGAPNEYTMYRISSISKIFPVLMLYRLWEEGIVASLDDPLERYASTFTINNPLGLASAEQQGLMDGLEQVGPAPRPSPVTLRRMASQLSGLPRRLRSTSLLWKGSTQEALNLLKDDVLVVDPGTRCHYSTLAFSLLAHVLAAHTAQGDYQRWVSENVLEPLGMADTGFDLTPDVRARLAAGFYGSGRPAPLYDLGWYRPSGQMYSTAADLAKLAVALLGGGPRRLLRPDAAKTLLAPLLACPGAYFANETGTPWEFHAQRGYRVVRKDGDLDGYAATFSLVPPLRLGLVLLLAGPRPPGPDLVARAYDELLPALERALREAEPGPAPPPTAHPFAGYFTFANLTFYEVRAGPAGELRLRQFGPRVEALVPPAFRTLALRHLHGRVFQLHVAHEFPCALPLGDAWLSLEAQHGQLVNFYPLDHHGLSPGFDVPGLNTYRVLRLRGKPVFKTQ,mutated_sequence,1.0,546.0,UPI0002A47099.a2m,UPI0002A47099.npy,gnomAD
+UPI0002840CC6,UPI0002840CC6.csv,MFGSSRYLGSSEQPRANSLGPSDRTLVLCSLVEGEDKVNPSEPHGLRMEEKWLLKGKLRNQRNQNKLLSPNKKQRKNHTSKLQELALLLPIALKTGTKKLTKKEILVHVLQYIQYLQRNIDAAKALFKCHITTGEGGLAGLGQKPAWGPARRRRHSTPSSSPSSQKSCLQGACQKPRKKKLTQASESQTRTPKPRRSLALNKPEKLVAPSPDQKGSGTGGTTTPPRCPDSCGHPRPASSSPPGDRKGGQSQLTLLDLAEDTIHCDISSCWCQGSVQDDAPFPALLAQEDVARIHFLNKTQPHPRQKLVFYDSSEDVDKGSLDADPWLPAWTPENSPQGSPLFLGPPQIDVWSGTGHPSEILGLSPSLFSSPGKLLPDEILEDDMEYLTQAAFFEEVCLDLESSPSAYTQEAPQEKDTASKAPKDPPESHSLHRSSVSLDHCYLSLSGNSKAPSSSSSSSSSSSSSEDSDSEPLWKQREDMQANPVGTPGSSEEDEDTTWTPTRLASPLLAAEKKATKGQVARAPVKPKEKKKGPCPPQMKKKCVNGFIMFCRMNRKQYIRSCPGTASTAATKELAQLWRVMTQQERRPYCTKARRFSRQHNRIVKQDGSSSEAEDWETPKPFYQLLAEKALPLPPHLQ,mutated_sequence,1.0,638.0,UPI0002840CC6.a2m,UPI0002840CC6.npy,gnomAD
+UPI0000160791,UPI0000160791.csv,MAGGRGAPGRGRDEPPESYPQRQDHELQALEAIYGADFQDLRPDACGPVKEPPEINLVLYPQGLTGEEVYVKVDLRVKCPPTYPDVVPEIELKNAKGLSNESVNLLKSRLEELAKKHCGEVMIFELAYHVQSFLSEHNKPPPKSFHEEMLERRAQEEQQRLLEAKRKEEQEQREILHEIQRRKEEIKEEKKRKEMAKQERLEIASLSNQDHTSKKDPGGHRTAAILHGGSPDFVGNGKHRANSSGRSRRERQYSVCNSEDSPGSCEILYFNMGSPDQLMVHKGKCIGSDEQLGKLVYNALETATGGFVLLYEWVLQWQKKMGPFLTSQEKEKIDKCKKQIQGTETEFNSLVKLSHPNVVRYLAMNLKEQDDSIVVDILVEHISGVSLAAHLSHSGPIPVHQLRRYTAQLLSGLDYLHSNSVVHKVLSASNVLVDAEGTVKITDYSISKRLADICKEDVFEQTRVRFSDNALPYKTGKKGDVWRLGLLLLSLSQGQECGEYPVTIPSDLPADFQDFLKKCVCLDDKERWSPQQLLKHSFINPQPKMPLVEQSPEDSEGQDYVETVIPSNRLPSAAFFSETQRQFSRYFIEFEELQLLGKGAFGAVIKVQNKLDGCCYAVKRIPINPASRQFRRIKGEVTLLSRLHHENIVRYYNAWIERHERPAGPGTPPPDSGPLAKDDRAARGQPASDTDGLDSVEAAAPPPILSSSVEWSTSGERSASARFPATGPGSSDDEDDDEDEHGGVFSQSFLPASDSESDIIFDNEDENSKSQNQDEDCNEKNGCHESEPSVTTEAVHYLYIQMEYCEKSTLRDTIDQGLYRDTVRLWRLFREILDGLAYIHEKGMIHRDLKPVNIFLDSDDHVKIGDFGLATDHLAFSADSKQDDQTGDLIKSDPSGHLTGMVGTALYVSPEVQGSTKSAYNQKVDLFSLGIIFFEMSYHPMVTASERIFVLNQLRDPTSPKFPEDFDDGEHAKQKSVISWLLNHDPAKRPTATELLKSELLPPPQMEESELHEVLHHTLTNVDGKAYRTMMAQIFSQRISPAIDYTYDSDILKGNFSIRTAKMQQHVCETIIRIFKRHGAVQLCTPLLLPRNRQIYEHNEAALFMDHSGMLVMLPFDLRIPFARYVARNNILNLKRYCIERVFRPRKLDRFHPKELLECAFDIVTSTTNSFLPTAEIIYTIYEIIQEFPALQERNYSIYLNHTMLLKAILLHCGIPEDKLSQVYIILYDAVTEKLTRREVEAKFCNLSLSSNSLCRLYKFIEQKGDLQDLMPTINSLIKQKTGIAQLVKYGLKDLEEVVGLLKKLGIKLQVLINLGLVYKVQQHNGIIFQFVAFIKRRQRAVPEILAAGGRYDLLIPQFRGPQALGPVPTAIGVSIAIDKISAAVLNMEESVTISSCDLLVVSVGQMSMSRAINLTQKLWTAGITAEIMYDWSQSQEELQEYCRHHEITYVALVSDKEGSHVKVKSFEKERQTEKRVLETELVDHVLQKLRTKVTDERNGREASDNLAVQNLKGSFSNASGLFEIHGATVVPIVSVLAPEKLSASTRRRYETQVQTRLQTSLANLHQKSSEIEILAVDLPKETILQFLSLEWDADEQAFNTTVKQLLSRLPKQRYLKLVCDEIYNIKVEKKVSVLFLYSYRDDYYRILF,mutated_sequence,1.0,1649.0,UPI0000160791.a2m,UPI0000160791.npy,gnomAD
+UPI000041A871,UPI000041A871.csv,MWPQPRLPPHPAMSEKTQQGKLAAAKKKLKAYWQRKSPGIPAGANRKKKVNGSSPDTATSGGYHSPGDSATGIYGEGRASSTTLQDLESQYQELAVALDSSSAIISQLTENINSLVRTSKEEKKHEIHLVQKLGRSLFKLKNQTAEPLAPEPPAGPSKVEQLQDETNHLRKELESVGRQLQAEVENNQMLSLLNRRQEERLREQEERLHEQEERLHEQEERLCEQEERLCEQEERLREQEERLREQEERLHEQEERLCEQEERLCEQEERLCEQEKLPGQERLLEEVEKLLEQERRQEEQERLLERERLLDEVEELLEQERLRQQDERLWQQETLRELERLRELERLRELERLRELERMLELGWEALYEQRAEPRSGFEELNNENKSTLQLEQQVKELEKSGGAEEPRGSESAAAARPVPGAPVPQGAWMCGQAGWTPQEHPGLSGEAVGTGEAAGGAEEAACHSFRAAENRELNITII,mutated_sequence,1.0,479.0,UPI000041A871.a2m,UPI000041A871.npy,gnomAD
+UPI000049DE26,UPI000049DE26.csv,MSRRKQSNPRQIKRSLGDMEAREEVQLVGASHMEQKATAPEAPSPPSADVNSPPPLPPPTSPGGPKELEGQEPEPRPTEEEPGSPWSGPDELEPVVQDGQRRIRARLSLATGLSWGPFHGSVQTRASSPRQAEPSPALTLLLVDEACWLRTLPQALTEAEANTEIHRKDDALWCRVTKPVPAGGLLSVLLTAEPHSTPGHPVKKEPAEPTCPAPAHDLQLLPQQAGMASILATAVINKDVFPCKDCGIWYRSERNLQAHLLYYCASRQGTGSPAAAATDEKPKETYPNERVCPFPQCRKSCPSASSLEIHMRSHSGERPFVCLICLSAFTTKANCERHLKVHTDTLSGVCHSCGFISTTRDILYSHLVTNHMVCQPGSKGEIYSPGAGHPATKLPPDSLGSFQQQHTALQGPLASADLGLAPTPSPGLDRKALAEATNGEARAEPLAQNGGSSEPPAAPRSIKVEAVEEPEAAPILGPGEPGPQAPSRTPSPRSPAPARVKAELSSPTPGSSPVPGELGLAGALFLPQYVFGPDAAPPASEILAKMSELVHSRLQQGAGAGAGGAQTGLFPGAPKGATCFECEITFSNVNNYYVHKRLYCSGRRAPEDAPAARRPKAPPGPARAPPGQPAEPDAPRSSPGPGAREEGAGGAATPEDGAGGRGSEGSQSPGSSVDDAEDDPSRTLCEACNIRFSRHETYTVHKRYYCASRHDPPPRRPAAPPGPPGPAAPPAPSPAAPVRTRRRRKLYELHAAGAPPPPPPGHAPAPESPRPGSGSGSGPGLAPARSPGPAADGPIDLSKKPRRPLPGAPAPALADYHECTACRVSFHSLEAYLAHKKYSCPAAPPPGALGLPAAACPYCPPNGPVRGDLLEHFRLAHGLLLGAPLAGPGVEARTPADRGPSPAPAPAASPQPGSRGPRDGLGPEPQEPPPGPPPSPAAAPEAVPPPPAPPSYSDKGVQTPSKGTPAPLPNGNHRYCRLCNIKFSSLSTFIAHKKYYCSSHAAEHVK,mutated_sequence,1.0,1006.0,UPI000049DE26.a2m,UPI000049DE26.npy,gnomAD
+UPI000012DC9B,UPI000012DC9B.csv,MEPWPCSPGGGGGTRARHVIINVGGCRVRLAWAALARCPLARLERLRACRGHDDLLRVCDDYDVSRDEFFFDRSPCAFRAIVALLRAGKLRLLRGPCALAFRDELAYWGIDEARLERCCLRRLRRREEEAAEARAGPTERGAQGSPARALGPRGRLQRGRRRLRDVVDNPHSGLAGKLFACVSVSFVAVTAVGLCLSTMPDIRAEEERGECSPKCRSLFVLETVCVAWFSFEFLLRSLQAESKCAFLRAPLNIIDILALLPFYVSLLLGLAAGPGGTKLLERAGLVLRLLRALRVLYVMRLARHSLGLRSLGLTMRRCAREFGLLLLFLCVAMALFAPLVHLAERELGARRDFSSVPASYWWAVISMTTVGYGDMVPRSLPGQVVALSSILSGILLMAFPVTSIFHTFSRSYSELKEQQQRAASPEPALQEDSTHSATATEDSSQGPDSAGLADDSADALWVRAGR,mutated_sequence,1.0,466.0,UPI000012DC9B.a2m,UPI000012DC9B.npy,gnomAD
+UPI000013FD3E,UPI000013FD3E.csv,MRGQGRRKGHRGSPARPSGAVLRVLRGCSTCGIPRAGSLLWRGKEGASGSEPSSAAHRSLPAQLAAPGGGARPGSRGVTWSSGRCNPGRLWGHVREVSYPCVSAGPLGHWELNEVVMSQVFSKIRTRHQTEEAGAPESTQGRVGEEGGRGCSAGAGRPQPRGTRRRAKTCHPAREGREGPREGAPFQPGETRVLAASLSSRQRAPVLTPAPFPRSMLLTKWPGEECRSPRGQALPKH,mutated_sequence,1.0,237.0,UPI000013FD3E.a2m,UPI000013FD3E.npy,gnomAD
+UPI00004191D3,UPI00004191D3.csv,MPRAQLPEDSSAVDMDILFPLDSVIGTELCPSPIPQIIHFVLFVVFSLVILIILRLYIPREPSSVPPREEDSENDQAEVGEWLRIGNKYITLKDYRILLKELENLEIYTFLSKKCLKKLSREGSSHHLPRQVRPGPVYKPAPARNHRPRGGRGKASPTSFHVSPRAPLAPLASMPSSVPKTSVESLGSPSSLSSSKPREPLCPLKHPSHQPPASTLSPNPTSSTESLGETPHAGRWRQGSRFPQPGCANAAGRIRHQNPRHSHGHRISDIHEQLGIS,mutated_sequence,1.0,277.0,UPI00004191D3.a2m,UPI00004191D3.npy,gnomAD
+UPI0000359605,UPI0000359605.csv,MAAAGARLSPGPGSGLRGRPRLCFHPGPPPLLPLLLLFLLLLPPPPLLAGATAAASREPDSPCRLKTVTVSTLPALRESDIGWSGARAGAGAGTGAGAAAAAASPGSPGSAGTAAESRLLLFVRNELPGRIAVQDDLDNTELPFFTLEMSGTAADISLVHWRQQWLENGTLYFHVSMSSSGQLAQATAPTLQEPSEIVEEQMHILHISVMGGLIALLLLLLVFTVALYAQRRWQKRRRIPQKSASTEATHEIHYIPSVLLGPQARESFRSSRLQTHNSVIGVPIRETPILDDYDCEEDEEPPRRANHVSREDEFGSQVTHTLDSLGHPGEEKVDFEKKAAAEATQETVESLMQKFKESFRANTPIEIGQLQPPLRSTSAGKRKRRSKSRGGISFGRAKGTSGSEADDETQLTFYTEQYRSRRRSKGLLKSPVNKTALTLIAVSSCILAMVCGSQMSCPLTVKVTLHVPEHFIADGSSFVVSEGSYLDISDWLNPAKLSLYYQINATSPWVRDLCGQRTTDACEQLCDPETGECSCHEGYAPDPVHRHLCVRSDWGQSEGPWPYTTLERGYDLVTGEQAPEKILRSTFSLGQGLWLPVSKSFVVPPVELSINPLASCKTDVLVTEDPADVREEAMLSTYFETINDLLSSFGPVRDCSRNNGGCTRNFKCVSDRQVDSSGCVCPEELKPMKDGSGCYDHSKGIDCSDGFNGGCEQLCLQQTLPLPYDATSSTIFMFCGCVEEYKLAPDGKSCLMLSDVCEGPKCLKPDSKFNDTLFGEMLHGYNNRTQHVNQGQVFQMTFRENNFIKDFPQLADGLLVIPLPVEEQCRGVLSEPLPDLQLLTGDIRYDEAMGYPMVQQWRVRSNLYRVKLSTITLAAGFTNVLKILTKESSREELLSFIQHYGSHYIAEALYGSELTCIIHFPSKKVQQQLWLQYQKETTELGSKKELKSMPFITYLSGLLTAQMLSDDQLISGVEIRCEEKGRCPSTCHLCRRPGKEQLSPTPVLLEINRVVPLYTLIQDNGTKEAFKSALMSSYWCSGKGDVIDDWCRCDLSAFDANGLPNCSPLLQPVLRLSPTVEPSSTVVSLEWVDVQPAIGTKVSDYILQHKKVDEYTDTDLYTGEFLSFADDLLSGLGTSCVAAGRSHGEVPEVSIYSVIFKCLEPDGLYKFTLYAVDTRGRHSELSTVTLRTACPLVDDNKAEEIADKIYNLYNGYTSGKEQQMAYNTLMEVSASMLFRVQHHYNSHYEKFGDFVWRSEDELGPRKAHLILRRLERVSSHCSSLLRSAYIQSRVETVPYLFCRSEEVRPAGMVWYSILKDTKITCEEKMVSMARNTYGESKGR,mutated_sequence,1.0,1339.0,UPI0000359605.a2m,UPI0000359605.npy,gnomAD
+UPI000013CC65,UPI000013CC65.csv,MKRLSLVTTNRLSPHGNFLPLCTFPLAVDMAALFQEASSCPVCSDYLEKPMSLECGCAVCFKCINSLQKEPHGEDLLCCCCSMVSQKNKIRPSWQLERLASHIKELEPKLKKILQMNPRMRKFQVDMTLDADTANNFLLISDDLRSVRSGCITQNRQDLAERFDVSICILGSPRFTCGRHYWEVDVGTSTEWDLGVCRESVHRKGRIHLTTERGFWTVSLRDGSRLSASTVPLTFLFVDRKLQRVGIFLDMGMQNVSFFDAEGGSHVYTFRSVSAEEPLHLFFAPPSPPNGDKSVLSICPVINPGTTDAPVHPGEAK,mutated_sequence,1.0,317.0,UPI000013CC65.a2m,UPI000013CC65.npy,gnomAD
+UPI00001B03C6,UPI00001B03C6.csv,MPLLLLGETEPLKLERDCRSPVDPWAAASPDLALACLCHCQDLSSGAFPDRGVLGGVLFPTVEMVIKVFVATSSGSIAIRKKQQEVVGFLEANKIDFKELDIAGDEDNRRWMRENVPGEKKPQNGIPLPPQIFNEEQYCGDFDSFFSAKEENIIYSFLGLAPPPDSKGSEKAEEGGETEAQKEGSEDVGNLPEAQEKNEEEGETATEETEEIAMEGAEGEAEEEEETAEGEEPGEDEDS,mutated_sequence,1.0,239.0,UPI00001B03C6.a2m,UPI00001B03C6.npy,gnomAD
+UPI000059DB31,UPI000059DB31.csv,XVSCGMMEALLKVIKFLGDEQDQITFVTRAVRVVDLITNLDMAAFQSHSGLSIFIYRLEHEVDLCRKECPFVIKPKIQRPNTTQEGEEMETDMDVADVAMESSPGSSISMEHRLDVELRASGSSSSTNISSGPSPGPSPGTGPGPGPGPGPGPGPGPGPGPGPGPGPGPGPGPRPGVQCIPQRAALLKSMLNFLKKAIQDPAFSDGIRHVMDGSLPTSLKHIISNAEY,mutated_sequence,1.0,228.0,UPI000059DB31.a2m,UPI000059DB31.npy,gnomAD
+UPI0000E5AEF3,UPI0000E5AEF3.csv,MSFSEMNRRTLAFRGGGLVTASGGGSTNNNAGGEASAWPPQPQPRQPPPPAPPALQPPNGRGADEEVELEGLEPQDLEASAGPAAGAAEEAKELLLPQDAGGPTSLGGGAGGPLLAERNRRTLAFRGGGGGGLGNNGSSRGRPETSVWPLRHFNGRGPATVDLELDALEGKELMQDGASLSDSTEDEEEGASLGDGSGAEGGSCSSSRRSGGDGGDEVEGSGVGAGEGETVQHFPLARPKSLMQKLQCSFQTSWLKDFPWLRYSKDTGLMSCGWCQKTPADGGSVDLPPVGHDELSRGTRNYKKTLLLRHHVSTEHKLHEANAQESEIPSEEGYCDFNSRPNENSYCYQLLRQLNEQRKKGILCDVSIVVSGKIFKAHKNILVAGSRFFKTLYCFSNKESPNQNNTTHLDIAAVQGFSVILDFLYSGNLVLTSQNAIEVMTVASYLQMSEVVQTCRNFIKDALNISIKSEAPESVVVDYNNRKPVNRDGLSSSRDQKIASFWATRNLTNLASNVKIENDGCNVDEGQIENYQMNDSSWVQDGSPEMAENESEGQTKVFIWNNMGSQGIQETGKTRRKNQTTKRFIYNIPPNNETNLEDCSVMQPPVAYPEENTLLIKEEPDLDGALLSGPDGDRNVNANLLAEAGTSQDGGDAGTSHDFKYGLMPGPSNDFKYGLIPGTSNDFKYGLIPGASNDFKYGLLPESWPKQETWENGESSLIMNKLKCPHCSYVAKYRRTLKRHLLIHTGVRSFSCDICGKLFTRREHVKRHSLVHKKDKKYKCMVCKKIFMLAASVGIRHGSRRYGVCVDCADKSQPGGQEGVDQGQDTEFPRDEEYEENEVGEADEELVDDGEDQNDPSRWDESGEVCMSLDD,mutated_sequence,1.0,871.0,UPI0000E5AEF3.a2m,UPI0000E5AEF3.npy,gnomAD
+UPI000155B91C,UPI000155B91C.csv,MESRSVAQAGVQWCDLGSLQAPPPGFTLFSCLSLLSSWDYSSGFSGFCASPIEESHGALISSCNSRTMTDGLVTFRDVAIDFSQEEWECLDPAQRDLYVDVMLENYSNLVSLDLESKTYETKKIFSENDIFEINFSQWEMKDKSKTLGLEASIFRNNWKCKSIFEGLKGHQEGYFSQMIISYEKIPSYRKSKSLTPHQRIHNTEKSYVCKECGKACSHGSKLVQHERTHTAEKHFECKECGKNYLSAYQLNVHQRFHTGEKPYECKECGKTFSWGSSLVKHERIHTGEKPYECKECGKAFSRGYHLTQHQKIHTGVKSYKCKECGKAFFWGSSLAKHEIIHTGEKPYKCKECGKAFSRGYQLTQHQKIHTGKKPYECKICGKAFCWGYQLTRHQIFHTGEKPYECKECGKAFNCGSSLIQHERIHTGEKPYECKECGKAFSRGYHLSQHQKIHTGEKPFECKECGKAFSWGSSLVKHERVHTGEKSHECKECGKTFCSGYQLTRHQVFHTGEKPYECKECGKAFNCGSSLVQHERIHTGEKPYECKECGKAFSRGYHLTQHQKIHTGEKPFKCKECGKAFSWGSSLVKHERVHTNEKSYECKDCGKAFGSGYQLSVHQRFHTGEKLYQRKEFGKTFTCGSKLVHERTHSNDKPYKYNECGEAFLWTTYSNEKIDTDETL,mutated_sequence,1.0,679.0,UPI000155B91C.a2m,UPI000155B91C.npy,gnomAD
+UPI000013E535,UPI000013E535.csv,MEASALTSSAVTSVAKVVRVASGSAVVLPLARIATVVIGGVVAVPMVLSAMGFTAAGIASSSIAAKMMSAAAIANGGGVASGSLVATLQSLGATGLSGLTKFILGSIGSAIAAVIARFY,mutated_sequence,1.0,119.0,UPI000013E535.a2m,UPI000013E535.npy,gnomAD
+UPI0001B3CB30,UPI0001B3CB30.csv,MKGARWRRVPWVSLSCLCLCLLPHVVPGTTEDTLITGSKTAAPVTSTGSTTATLEGQSTAASSRTSNQDISASSQNHQTKSTETTSKAQTDTLTQMMTSTLFSSPSVHNVMETAPPDEMTTSFPSSVTNTLMMTSKTITMTTSTDSTLGNTEETSTAGTESSTPVTSAVSITAGQEGQSRTTSWRTSIQDTSASSQNHWTRSTQTTRESQTSTLTHRTTSTPSFSPSVHNVTGTVSQKTSPSGETATSSLCSVTNTSMMTSEKITVTTSTGSTLGNPGETSSVPVTGSLMPVTSAALVTFDPEGQSPATFSRTSTQDTTAFSKNHQTQSVETTRVSQINTLNTLTPVTTSTVLSSPSGFNPSGTVSQETFPSGETTTSSPSSVSNTFLVTSKVFRMPTSRDSTLGNTEETSLSVSGTISAITSKVSTIWWSDTLSTALSPSSLPPKISTAFHTQQSEGAETTGRPHERSSFSPGVSQEIFTLHETTTWPSSFSSKGHTTWSQTELPSTSTGAATRLVTGNPSTGTAGTIPRVPSKVSAIGEPGEPTTYSSHSTTLPKTTGAGAQTQWTQETGTTGEALLSSPSYSVTQMIKTATSPSSSPMLDRHTSQQITTAPSTNHSTIHSTSTSPQESPAVSQRGHTQAPQTTQESQTTRSVSPMTDTKTVTTPGSSFTASGHSPSEIVPQDAPTISAATTFAPAPTGDGHTTQAPTTALQAAPSSHDATLGPSGGTSLSKTGALTLANSVVSTPGGPEGQWTSASASTSPDTAAAMTHTHQAESTEASGQTQTSEPASSGSRTTSAGTATPSSSGASGTTPSGSEGISTSGETTRFSSNPSRDSHTTQSTTELLSASASHGAIPVSTGMASSIVPGTFHPTLSEASTAGRPTGQSSPTSPSASPQETAAISRMAQTQRTRTSRGSDTISLASQATDTFSTVPPTPPSITSTGLTSPQTETHTLSPSGSGKTFTTALISNATPLPVTYASSASTGHTTPLHVTDASSVSTGHATPLPVTSPSSVSTGHTTPLPVTDTSSESTGHVTPLPVTSFSSASTGDSTPLPVTDTSSASTGHVTPLPVTSLSSASTGDTTPLPVTDTSSASTGHATSLPVTDTSSVSTGHTTPLPVTDTSSASTGHATSLPVTDTSSVSTGHTTPLHVTDASSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATPLLVTDTSSASTGHATPLPVTDASSVSTDHATSLPVTIPSAASTGHTTPLPVTDTSSASTGQATSLLVTDTSSVSTGDTTPLPVTSTSSASTGHVTPLHVTSPSSASTGHATPLPVTSLSSASTGDTMPLPVTSPSSASTGDTTPLPVTDASSVSTGHTTPLHVTDASSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATPLLVTDTSSASTGHATPLPVTDASSVSTDHATSLPVTIPSAASTGHTTPLPVTDTSSASTGQATSLLVTDTSSVSTGDTTPLPVTSTSSASTGHVTPLHVTSPSSASTGHATPLPVTSLSSASTGDTMPLPVTSPSSASTGDTTPLPVTDASSVSTGHTTPLPVTSPSSASTGHTTPLPVTDTSSASKGDTTPLPVTSPSSASTGHTTPLPVTDTSSASTGDTTPLPVTNASSLSTGHATPLHVTSPSSASTGHATPLPVTSTSSASTGHATPLPVTGLSSATTDDTTRLPVTDVSSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHASPLLVTDASSASTGQATPLPVTDTSSVSTAHATPLPVTGLSSASTDDTTRLPVTDVSSASTGQAIPLPVTSPSSASTGDTTPLPVTDASSASTGDTTSLPVTIPSSASSGHTTSLPVTDASSVSTGHATSLLVTDASSVSTGDTTPLPVTDTNSASTGDTTPLHVTDASSVSTGHATSLPVTSLSSASTGDTTPLPVTSPSSASSGHTTPLPVTDASSVPTGHATSLPVTDASSVSTGHATPLPVTDASSVSTGHATPLPVTDTSSVSTGQATPLPVTSLSSASTGDTTPLPVTDTSSASTGQDTPLPVTSLSSVSTGDTTPLPVTNPSSASTGHATPLLVTDASSISTGHATSLLVTDASSVSTGHATALHDTDASSLSTGDTTPLPVTSPSSTSTGDTTPLPVTETSSVSTGHATSLPVTDTSSASTGHATSLPVTDTSSASTGHATPLPVTDTSSASTGQATPLPVTSPSSASTGHAIPLLVTDTSSASTGQATPLPVTSLSSASTGDTTPLPVTDASSVSTGHATSLPVTSLSSVSTGDTTPLPVTSPSSASTGHATPLHVTDASSASTGHATPLPVTSLSSASTGDTTPLPVTSPSSASTGHATPLHVTDASSVSTGDTTPLPVTSSSSASSGHTTPLPVTDASSASTGDTTPLPVTDTSSASTGHATHLPVTGLSSASTGDTTRLPVTNVSSASTGHATPLPVTSTSSASTGDTTPLPGTDTSSVSTGHTTPLLVTDASSVSTGDTTRLPVTSPSSASTGHTTPLPVTDTPSASTGDTTPLPVTNASSLSTRHATSLHVTSPSSASTGHATSLPVTDTSAASTGHATPLPVTSTSSASTGDTTPLPVTDTYSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATPLLVTDASSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATSLPVTDTSSASTGDTTSLPVTDTSSAYTGDTTSLPVTDTSSSSTGDTTPLLVTETSSVSTGDTTPLPVTDTSSASTGHATPLPVTNTSSVSTGHATPLHVTSPSSASTGHTTPLPVTDASSVSTGHATSLPVTDASSVFTGHATSLPVTIPSSASSGHTTPLPVTDASSVSTGHATSLPVTDASSVSTGHATPLPVTDASSVSTGHATPLPLTSLSSVSTGDTTPLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLPVTDTSSASTGHATSLPVTDTSSASTGHATPLPDTDTSSASTGHATLLPVTDTSSASIGHATSLPVTDTSSISTGHATPLHVTSPSSASTGHATPLPVTDTSSASTGHANPLHVTSPSSASTGHATPLPVTDTSSASTGHATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHTTPLPVTDTSSASTGQATALPVTSTSSASTGDTTPLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATPLLVTDASSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATSLPVTDTSSASTGDTTSLPVTDTSSAYTGDTTSLPVTDTSSSSTGDTTPLLVTETSSVSTGHATPLLVTDASSASTGHATPLHVTSPSSASTGDTTPVPVTDTSSVSTGHATPLPVTGLSSASTGDTTRLPVTDISSASTGQATPLPVTNTSSVSTGDTMPLPVTSPSSASTGHATPLPVTSTSSASTGHATPVPVTSTSSASTGHTTPLPVTDTSSASTGDTTPLPVTSPSSASTGHTTPLHVTIPSSASTGDTSTLPVTGASSASTGHATPLPVTDTSSVSTGHATPLPVTSLSSVSTGDTTPLPVTDASSASTGQATPLPVTSLSSVSTGDTTPLLVTDASSVSTGHATPLPVTDTSSASTGDTTRLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLLVTDASSVSTGHATPLPVTDTSSASTGDTTRLPVTDTSSASTGQATPLPVTIPSSSSSGHTTPLPVTSTSSVSTGHVTPLHVTSPSSASTGHVTPLPVTSTSSASTGHATPLLVTDASSVSTGHATPLPVTDASSASTGDTTPLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLPVTDASSASTGHATPLPVTIPSSVSTGDTMPLPVTSPSSASTGHATPLPVTGLSSASTGDTTPLPVTDTSSASTRHATPLPVTDTSSASTDDTTRLPVTDVSSASTGHATPLPVTSTSSASTGDTTPLPVTDTSSVSTGHATSLPVTSRSSASTGHATPLPVTDTSSVSTGHATPLPVTSTSSVSTGHATPLPVTSPSSASTGHATPVPVTSTSSASTGDTTPLPVTNASSLSTGHATPLHVTSPSSASRGDTSTLPVTDASSASTGHATPLPLTSLSSVSTGDTTPLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLPVTIPSSASSGHTTSLPVTDASSVSTGHGTPLPVTSTSSASTGDTTPLPVTDTSSASTGHATPLPVTDTSSASTGHATPLPVTSLSSVSTGHATPLAVSSATSASTVSSDSPLKMETPGMTTPSLKTDGGRRTATSPPPTTSQTIISTIPSTAMHTRSTAAPIPILPERGVSLFPYGAGAGDLEFVRRTVDFTSPLFKPATGFPLGSSLRDSLYFTDNGQIIFPESDYQIFSYPNPLPTGFTGRDPVALVAPFWDDADFSTGRGTTFYQEYETFYGEHSLLVQQAESWIRKMTNNGGYKARWALKVTWVNAHAYPAQWTLGSNTYQAILSTDGSRSYALFLYQSGGMQWDVAQRSGNPVLMGFSSGDGYFENSPLMSQPVWERYRPDRFLNSNSGLQGLQFYRLHREERPNYRLECLQWLKSQPRWPSWGWNQVSCPCSWQQGRRDLRFQPVSIGRWGLGSRQLCSFTSWRGGVCCSYGPWGEFREGWHVQRPWQLAQELEPQSWCCRWNDKPYLCALYQQRRPHVGCATYRPPQPAWMFGDPHITTLDGVSYTFNGLGDFLLVGAQDGNSSFLLQGRTAQTGSAQATNFIAFAAQYRSSSLGPVTVQWLLEPHDAIRVLLDNQTVTFQPDHEDGGGQETFNATGVLLSRNGSEVSASFDGWATVSVIALSNILHASASLPPEYQNRTEGLLGVWNNNPEDDFRMPNGSTIPPGSPEEMLFHFGMTWQINGTGLLGKRNDQLPSNFTPVFYSQLQKNSSWAEHLISNCDGDSSCIYDTLALRNASIGLHTREVSKNYEQANATLNQYPPSINGGRVIEAYKGQTTLIQYTSNAEDANFTLRDSCTDLELFENGTLLWTPKSLEPFTLEILARSAKIGLASALQPRTVVCHCNAESQCLYNQTSRVGNSSLEVAGCKCDGGTFGRYCEGSEDACEEPCFPSVHCVPGKGCEACPPNLTGDGRHCAALGSSFLCQNQSCPVNYCYNQGHCYISQTLGCQPMCTCPPAFTDSRCFLAGNNFSPTVNLELPLRVIQLLLSEEENASMAEVNASVAYRLGTLDMRAFLRNSQVERIDSAAPASGSPIQHWMVISEFQYRPRGPVIDFLNNQLLAAVVEAFLYHVPRRSEEPRNDVVFQPISGEDVRDVTALNVSTLKAYFRCDGYKGYDLVYSPQSGFTCVSPCSRGYCDHGGQCQHLPSGPRCSCVSFSIYTAWGEHCEHLSMKLDAFFGIFFGALGGLLLLGVGTFVVLRFWGCSGARFSYFLNSAEALP,mutated_sequence,1.0,5412.0,UPI0001B3CB30.a2m,UPI0001B3CB30.npy,gnomAD
+UPI0002064E1F,UPI0002064E1F.csv,MGEMWRKREEVIFREYSFSFYPLNIFAPFVLFPDYFCLLGFIWLSMNVLLFWKTFLLYNQGPEYHYLHQMLGLGLCLSRASASVLNLNCSLILLPMCRTLLAYLRGSQKVPSRRTRRLLDKSRTFHITCGVTICIFSGVHVAAHLVNALNFSVNYSEDFVELNAARYRDEDPRKLLFTTVPGLTGVCMVVVLFLMITASTYAIRVSNYDIFWYTHNLFFVFYMLLTLHVSGGLLKYQTNLDTHPPGCISLNRTSSQNISLPEYFSEHFHEPFPEGFSKPAEFTQHKFVKICMEEPRFQANFPQTWLWISGPLCLYCAERLYRYIRSNKPVTIISVMSHPSDVMEIRMVKENFKARPGQYITLHCPSVSALENHPFTLTMCPTETKATFGVHLKIVGDWTERFRDLLLPPSSQDSEILPFIQSRNYPKLYIDGPFGSPFEESLNYEVSLCVAGGIGVTPFASILNTLLDDWKPYKLRRLYFIWVCRDIQSFRWFADLLCMLHNKFWQENRPDYVNIQLYLSQTDGIQKIIGEKYHALNSRLFIGRPRWKLLFDEIAKYNRGKTVGVFCCGPNSLSKTLHKLSNQNNSYGTRFEYNKESFS,mutated_sequence,1.0,599.0,UPI0002064E1F.a2m,UPI0002064E1F.npy,gnomAD
+UPI0000132184,UPI0000132184.csv,MMGSVLPAEALVLKTGLKAPGLALAEVITSDILHSFLYGRWRNVLGEQLFEDKSHHASPKTAFTAEVLAQSFSGEVQKLSSLVLPAEVIIAQSSIPGEGLGIFSKTWIKAGTEMGPFTGRVIAPEHVDICKNNNLMWEVFNEDGTVRYFIDASQEDHRSWMTYIKCARNEQEQNLEVVQIGTSIFYKAIEMIPPDQELLVWYGNSHNTFLGIPGVPGLEEDQKKNKHEDFHPADSAAGPAGRMRCVICHRGFNSRSNLRSHMRIHTLDKPFVCRFCNRRFSQSSTLRNHVRLHTGERPYKCQVCQSAYSQLAGLRAHQKSARHRPPSTALQAHSPALPAPHAHAPALAAAAAAAAAAAAHHLPAMVL,mutated_sequence,1.0,367.0,UPI0000132184.a2m,UPI0000132184.npy,gnomAD
+UPI0000141565,UPI0000141565.csv,MARFGDEMPARYGGGGSGAAAGVVVGSGGGRGAGGSRQGGQPGAQRMYKQSMAQRARTMALYNPIPVRQNCLTVNRSLFLFSEDNVVRKYAKKITEWPPFEYMILATIIANCIVLALEQHLPDDDKTPMSERLDDTEPYFIGIFCFEAGIKIIALGFAFHKGSYLRNGWNVMDFVVVLTGILATVGTEFDLRTLRAVRVLRPLKLVSGIPSLQVVLKSIMKAMIPLLQIGLLLFFAILIFAIIGLEFYMGKFHTTCFEEGTDDIQGESPAPCGTEEPARTCPNGTKCQPYWEGPNNGITQFDNILFAVLTVFQCITMEGWTDLLYNSNDASGNTWNWLYFIPLIIIGSFFMLNLVLGVLSGEFAKERERVENRRAFLKLRRQQQIERELNGYMEWISKAEEVILAEDETDGEQRHPFDALRRTTIKKSKTDLLNPEEAEDQLADIASVGSPFARASIKSAKLENSTFFHKKERRMRFYIRRMVKTQAFYWTVLSLVALNTLCVAIVHYNQPEWLSDFLYYAEFIFLGLFMSEMFIKMYGLGTRPYFHSSFNCFDCGVIIGSIFEVIWAVIKPGTSFGISVLRALRLLRIFKVTKYWASLRNLVVSLLNSMKSIISLLFLLFLFIVVFALLGMQLFGGQFNFDEGTPPTNFDTFPAAIMTVFQILTGEDWNEVMYDGIKSQGGVQGGMVFSIYFIVLTLFGNYTLLNVFLAIAVDNLANAQELTKDEQEEEEAANQKLALQKAKEVAEVSPLSAANMSIAVKEQQKNQKPAKSVWEQRTSEMRKQNLLASREALYNEMDPDERWKAAYTRHLRPDMKTHLDRPLVVDPQENRNNNTNKSRAAEPTVDQRLGQQRAEDFLRKQARYHDRARDPSGSAGLDARRPWAGSQEAELSREGPYGRESDHHAREGSLEQPGFWEGEAERGKAGDPHRRHVHRQGGSRESRSGSPRTGADGEHRRHRAHRRPGEEGPEDKAERRARHREGSRPARGGEGEGEGPDGGERRRRHRHGAPATYEGDARREDKERRHRRRKENQGSGVPVSGPNLSTTRPIQQDLGRQDPPLAEDIDNMKNNKLATAESAAPHGSLGHAGLPQSPAKMGNSTDPGPMLAIPAMATNPQNAASRRTPNNPGNPSNPGPPKTPENSLIVTNPSGTQTNSAKTARKPDHTTVDIPPACPPPLNHTVVQVNKNANPDPLPKKEEEKKEEEEDDRGEDGPKPMPPYSSMFILSTTNPLRRLCHYILNLRYFEMCILMVIAMSSIALAAEDPVQPNAPRNNVLRYFDYVFTGVFTFEMVIKMIDLGLVLHQGAYFRDLWNILDFIVVSGALVAFAFTGNSKGKDINTIKSLRVLRVLRPLKTIKRLPKLKAVFDCVVNSLKNVFNILIVYMLFMFIFAVVAVQLFKGKFFHCTDESKEFEKDCRGKYLLYEKNEVKARDREWKKYEFHYDNVLWALLTLFTVSTGEGWPQVLKHSVDATFENQGPSPGYRMEMSIFYVVYFVVFPFFFVNIFVALIIITFQEQGDKMMEEYSLEKNERACIDFAISAKPLTRHMPQNKQSFQYRMWQFVVSPPFEYTIMAMIALNTIVLMMKFYGASVAYENALRVFNIVFTSLFSLECVLKVMAFGILNYFRDAWNIFDFVTVLGSITDILVTEFGNNFINLSFLRLFRAARLIKLLRQGYTIRILLWTFVQSFKALPYVCLLIAMLFFIYAIIGMQVFGNIGIDVEDEDSDEDEFQITEHNNFRTFFQALMLLFRSATGEAWHNIMLSCLSGKPCDKNSGILTRECGNEFAYFYFVSFIFLCSFLMLNLFVAVIMDNFEYLTRDSSILGPHHLDEYVRVWAEYDPAAWGRMPYLDMYQMLRHMSPPLGLGKKCPARVAYKRLLRMDLPVADDNTVHFNSTLMALIRTALDIKIAKGGADKQQMDAELRKEMMAIWPNLSQKTLDLLVTPHKSTDLTVGKIYAAMMIMEYYRQSKAKKLQAMREEQDRTPLMFQRMEPPSPTQEGGPGQNALPSTQLDPGGALMAHESGLKESPSWVTQRAQEMFQKTGTWSPEQGPPTDMPNSQPNSQSVEMREMGRDGYSDSEHYLPMEGQGRAASMPRLPAENQRRRGRPRGNNLSTISDTSPMKRSASVLGPKARRLDDYSLERVPPEENQRHHQRRRDRSHRASERSLGRYTDVDTGLGTDLSMTTQSGDLPSKERDQERGRPKDRKHRQHHHHHHHHHHPPPPDKDRYAQERPDHGRARARDQRWSRSPSEGREHMAHRQGSSSVSGSPAPSTSGTSTPRRGRRQLPQTPSTPRPHVSYSPVIRKAGGSGPPQQQQQQQQQQQQQAVARPGRAATSGPRRYPGPTAEPLAGDRPPTGGHSSGRSPRMERRVPGPARSESPRACRHGGARWPASGPHVSEGPPGPRHHGYYRGSDYDEADGPGSGGGEEAMAGAYDAPPPVRHASSGATGRSPRTPRASGPACASPSRHGRRLPNGYYPAHGLARPRGPGSRKGLHEPYSESDDDWC,mutated_sequence,1.0,2506.0,UPI0000141565.a2m,UPI0000141565.npy,gnomAD
+UPI0001711D28,UPI0001711D28.csv,MGSRCRAWAWTRAWALAEFQARAEEGAAAAAAAAGGYPSTGRCRCSLRGMEGTAVAVFEILRFLIIHWKCDIDVSKGALLEGQLVISIEGLNSKHQANALHCVTTRWSLTLLPRPECSGAVSAHCNLHLPGSSDSHASVPRVAGITDAHHHAWLIMVASAGSLFGGMVLKKFLKEIQSILPGISAKLTWTSEEGSYSQDMTGVTPFQMIFEVDEKPRTLMTDCLVIKHFLRKIIMVHPKVRFHFSVKVNGILSTEIFGVENEPTLNLGNGIALLVDSQHYVRPNFGTIESHCSRIHPVLGHPVMLFIPEDVAGMDLLGELILTPAAALCPSPKVSSNQLNRISSVSIFLYGPLGLPLILSTWEQPMTTFFKDTSSLVDWKKYHLCMIPNLDLNLDRDLVLPDVSYQVESSEEDQSQTMDPQGQTLLLFLFVDFHSAFPVQQMEIWGVYTLLTTHLNAILVESHSVVQGSIQFTVDKVLEQHHQAAKAQQKLQASLSVAVNSIMSILTGSTRSSFRKMCLQTLQAADTQEFRTKLHKVFREITQHQFLHHCSCEVKQQLTLEKKDSAQGTEDAPDNSSLELLADTSGQAENKRLKRGSPRIEEMRALRSARAPSPSEAAPRRPEATAAPLTPRGREHREAHGRALAPGRASLGSRLEDVLWLQEVSNLSEWLSPSPGP,mutated_sequence,1.0,677.0,UPI0001711D28.a2m,UPI0001711D28.npy,gnomAD
+UPI000059D669,UPI000059D669.csv,MRAGACARRPVGKWGVEWPRRLTWGGASRLCPAVKDRYGDRDSSSDSSSESDSSDERVEFDPQQERDFYKTLSLLKKKDPRIYQKDATFYNRTASSSDSEEDPEALEKQKKVRPMYLKDYERKVILEKAGKYVDEENSDGETSNHRLQETSSQSYVEEQKQLKESFRAFVEDSEDEDGAGEGGSSLLQKRAKTRQEKAQEEADYIEWLKGQKEIRNPDSLKELTHLKEYWNDPELDEGERFLRDYILNKRYEEEEEEEEDEEEMEEEEGVHGPPVQLAVDDSSDEGELFLKKQEDFEQKYNFRFEEPDSASVKTYPRSIASSVRRKDERRKEKREETRERKKREKAKKQEELKQLKNLKRKEILAKLEKLRKVTGNEMLGLEEGDLEDDFDPAQHDQLMQKCFGDEYYGAVEEEKPQFEEEEGLEDDWNWDTWDGPEQEGDWSQQELHCEDPNFNMDADYDPSQPRKKKREAPLTGKKKRKSPFAAAVGQEKPVFEPGDKTFEEYLDEYYRLDYEDIIDDLPCRFKYRTVVPCDFGLSTEEILAADDKELNRWCSLKKTCMYRSEQEELRDKRAYSQKAQNSWKKRQVFKSLCREEAETPAEATGKPQRDEAGPQRQLPALDGSLMGPESPPAQEEEAPVSPHKKPAPQKRRRAKKARLLGPTVMLGGCEFSRQRLQAFGLNPKRLHFRQLGRQRRKQQGPKNSS,mutated_sequence,1.0,705.0,UPI000059D669.a2m,UPI000059D669.npy,gnomAD
+UPI000012DDC2,UPI000012DDC2.csv,MSQTQDYECRSHNVDLPESRIPGSNTRLEWVEIIEPRTRERMYANLVTGECVWDPPAGVRIKRTSENQWWELFDPNTSRFYYYNASTQRTVWHRPQGCDIIPLAKLQTLKQNTESPRASAESSPGRGSSVSREGSTSSSLEPEPDTEKAQELPARAGRPAAFGTVKEDSGSSSPPGVFLEKDYEIYRDYSADGQLLHYRTSSLRWNSGAKERMLIKVADREPSFLAAQGNGYAPDGPPGVRSRRPSGSQHSPSLQTFAPEADGTIFFPERRPSPFLKRAELPGSSSPLLAQPRKPSGDSQPSSPRYGYEPPLYEEPPVEYQAPIYDEPPMDVQFEAGGGYQAGSPQRSPGRKPRPFLQPNKQGPPSPCQQLVLTKQKCPERFLSLEYSPAGKEYVRQLVYVEQAGSSPKLRAGPRHKYAPNPGGGSYSLQPSPCLLRDQRLGVKSGDYSTMEGPELRHSQPPTPLPQAQEDAMSWSSQQDTLSSTGYSPGTRKRKSRKPSLCQATSATPTEGPGDLLVEQPLAEEQPPCGTSLAPVKRAEGEAEGARGAAEPFLAQARLAWEAQQAHFHMKQRSSWDSQQDGSGYESDGALPLPMPGPVVRAFSEDEALAQQENRHWRRGTFEKLGFPQILLEKSVSVQTNLASPEPYLHPSQSEDLAACAQFESSRQSRSGVPSSSCVFPTFTLRKPSSETDIENWASKHFNKHTQGLFRRKVSIANMLAWSSESIKKPMIVTSDRHVKKEACELFKLIQMYMGDRRAKADPLHVALEVATKGWSVQGLRDELYIQLCRQTTENFRLESLARGWELMAICLAFFPPTPKFHSYLEGYIYRHMDPVNDTKGVAISTYAKYCYHKLQKAALTGAKKGLKKPNVEEIRHAKNAVFSPSMFGSALQEVMGMQRERYPERQLPWVQTRLSEEVLALNGDQTEGIFRVPGDIDEVNALKLQVDQWKVPTGLEDPHVPASLLKLWYRELEEPLIPHEFYEQCIAHYDSPEAAVAVVHALPRINRMVLCYLIRFLQVFVQPANVAVTKMDVSNLAMVMAPNCLRCQSDDPRVIFENTRKEMSFLRVLIQHLDTSFMEGVL,mutated_sequence,1.0,1083.0,UPI000012DDC2.a2m,UPI000012DDC2.npy,gnomAD
+UPI000013E37F,UPI000013E37F.csv,MNRKWEAKLKQIEERASHYERKPLSSVYRPRLSKPEEPPSIWRLFHRQAQAFNFVKSCKEDVHVFALECKVGDGQRIYLVTTYAEFWFYYKSRKNLLHCYEVIPENAVCKLYFDLEFNKPANPGADGKKMVALLIEYVCKALQELYGVNCSAEDVLNLDSSTDEKFSQHLIFQLHDVAFKDNIHVGNFLRKILQPALDLLGSEDDDSAPETTGHGFPHFSEAPARQGFSFNKMFTEKATEESWTSNSKKLERLGSAEQSSPDLSFLVVKNNMGEKHLFVDLGVYTRNRNFRLYKSSKIGKRVALEVTEDNKFFPIQSKDVSDEYQYFLSSLVSNVRFSDTLRILTCEPSQNKQKGVGYFNSIGTSVETIEGFQCSPYPEVDHFVLSLVNKDGIKGGIRRWNYFFPEELLVYDICKYRWCENIGRAHKSNNIMILVDLKNEVWYQKCHDPVCKAENFKSDCFPLPAEVCLLFLFKEEEEFTTDEADETRSNETQNPHKPSPSRLSTGASADAVWDNGIDDAYFLEATEDAELAEAAENSLLSYNSEVDEIPDELIIEVLQE,mutated_sequence,1.0,560.0,UPI000013E37F.a2m,UPI000013E37F.npy,gnomAD
+UPI00000557D8,UPI00000557D8.csv,MPMPSRDGGLHPRHHHYGSHSPWSQLLSSPMETPSIKGLYYRRVRKVGALDASPVDLKKEILINVGGRRYLLPWSTLDRFPLSRLSKLRLCRSYEEIVQLCDDYDEDSQEFFFDRSPSAFGVIVSFLAAGKLVLLQEMCALSFQEELAYWGIEEAHLERCCLRKLLRKLEELEELAKLHREDVLRQQRETRRPASHSSRWGLCMNRLREMVENPQSGLPGKVFACLSILFVATTAVSLCVSTMPDLRAEEDQGECSRKCYYIFIVETICVAWFSLEFCLRFVQAQDKCQFFQGPLNIIDILAISPYYVSLAVSEEPPEDGERPSGSSYLEKVGLVLRVLRALRILYVMRLARHSLGLQTLGLTVRRCTREFGLLLLFLAVAITLFSPLVYVAEKESGRVLEFTSIPASYWWAIISMTTVGYGDMVPRSVPGQMVALSSILSGILIMAFPATSIFHTFSHSYLELKKEQEQLQARLRHLQNTGPASECELLDPHVASEHELMNDVNDLILEGPALPIMHM,mutated_sequence,1.0,519.0,UPI00000557D8.a2m,UPI00000557D8.npy,gnomAD
+UPI000007206C,UPI000007206C.csv,MKHLKRWWSAGGGLLHLTLLLSLAGLRVDLDLYLLLPPPTLLQDELLFLGGPASSAYALSPFSASGGWGRAGHLHPKGRELDPAAPPEGQLLREVRALGVPFVPRTSVDAWLVHSVAAGSADEAHGLLGAAAASSTGGAGASVDGGSQAVQGGGGDPRAARSGPLDAGEEEKAPAEPTAQVPDAGGCASEENGVLREKHEAVDHSSQHEENEERVSAQKENSLQQNDDDENKIAEKPDWEAEKTTESRNERHLNGTDTSFSLEDLFQLLSSQPENSLEGISLGDIPLPGSISDGMNSSAHYHVNFSQAISQDVNLHEAILLCPNNTFRRDPTARTSQSQEPFLQLNSHTTNPEQTLPGTNLTGFLSPVDNHMRNLTSQDLLYDLDINIFDEINLMSLATEDNFDPIDVSQLFDEPDSDSGLSLDSSHNNTSVIKSNSSHSVCDEGAIGYCTDHESSSHHDLEGAVGGYYPEPSKLCHLDQSDSDFHGDLTFQHVFHNHTYHLQPTAPESTSEPFPWPGKSQKIRSRYLEDTDRNLSRDEQRAKALHIPFSVDEIVGMPVDSFNSMLSRYYLTDLQVSLIRDIRRRGKNKVAAQNCRKRKLDIILNLEDDVCNLQAKKETLKREQAQCNKAINIMKQKLHDLYHDIFSRLRDDQGRPVNPNHYALQCTHDGSILIVPKELVASGHKKETQKGKRK,mutated_sequence,1.0,694.0,UPI000007206C.a2m,UPI000007206C.npy,gnomAD
+UPI00000723B9,UPI00000723B9.csv,MPSAKQRGSKGGHGAASPSEKGAHPSGGADDVAKKPPPAPQQPPPPPAPHPQQHPQQHPQNQAHGKGGHRGGGGGGGKSSSSSSASAAAAAAAASSSASCSRRLGRALNFLFYLALVAAAAFSGWCVHHVLEEVQQVRRSHQDFSRQREELGQGLQGVEQKVQSLQATFGTFESILRSSQHKQDLTEKAVKQGESEVSRISEVLQKLQNEILKDLSDGIHVVKDARERDFTSLENTVEERLTELTKSINDNIAIFTEVQKRSQKEINDMKAKVASLEESEGNKQDLKALKEAVKEIQTSAKSREWDMEALRSTLQTMESDIYTEVRELVSLKQEQQAFKEAADTERLALQALTEKLLRSEESVSRLPEEIRRLEEELRQLKSDSHGPKEDGGFRHSEAFEALQQKSQGLDSRLQHVEDGVLSMQVASARQTESLESLLSKSQEHEQRLAALQGRLEGLGSSEADQDGLASTVRSLGETQLVLYGDVEELKRSVGELPSTVESLQKVQEQVHTLLSQDQAQAARLPPQDFLDRLSSLDNLKASVSQVEADLKMLRTAVDSLVAYSVKIETNENNLESAKGLLDDLRNDLDRLFVKVEKIHEKV,mutated_sequence,1.0,602.0,UPI00000723B9.a2m,UPI00000723B9.npy,gnomAD
+UPI000013C350,UPI000013C350.csv,MMKSQGLVSFKDVAVDFTQEEWQQLDPSQRTLYRDVMLENYSHLVSMGYPVSKPDVISKLEQGEEPWIIKGDISNWIYPDEYQADGRQDRKSNLHNSQSCILGTVSFHHKILKGVTRDGSLCSILKVCQGDGQLQRFLENQDKLFRQVTFVNSKTVTEASGHKYNPLGKIFQECIETDISIQRFHKYDAFKKNLKPNIDLPSCYKSNSRKKPDQSFGGGKSSSQSEPNSNLEKIHNGVIPFDDNQCGNVFRNTQSLIQYQNVETKEKSCVCVTCGKAFAKKSQLIVHQRIHTGKKPYDCGACGKAFSEKFHLVVHQRTHTGEKPYDCSECGKAFSQKSSLIIHQRVHTGEKPYECSECGKAFSQKSPLIIHQRIHTGEKPYECRECGKAFSQKSQLIIHHRAHTGEKPYECTECGKAFCEKSHLIIHKRIHTGEKPYKCAQCEEAFSRKTELITHQLVHTGEKPYECTECGKTFSRKSQLIIHQRTHTGEKPYKCSECGKAFCQKSHLIGHQRIHTGEKPYICTECGKAFSQKSHLPGHQRIHTGEKPYICAECGKAFSQKSDLVLHQRIHTGERPYQCAICGKAFIQKSQLTVHQRIHTVVKS,mutated_sequence,1.0,604.0,UPI000013C350.a2m,UPI000013C350.npy,gnomAD
+UPI00001301E8,UPI00001301E8.csv,MMLPSPVTSTPFSVKDILNLEQQHQHFHGAHLQADLEHHFHSAPCMLAAAEGTQFSDGGEEDEEDEGEKLSYLNSLAAADGHGDSGLCPQGYVHTVLRDSCSEPKEHEEEPEVVRDRSQKSCQLKKSLETAGDCKAAEESERPKPRSRRKPRVLFSQAQVFELERRFKQQRYLSAPEREHLASSLKLTSTQVKIWFQNRRYKCKRQRQDKSLELGAHAPPPPPRRVAVPVLVRDGKPCVTPSAQAYGAPYSVGASAYSYNSFPAYGYGNSAAAAAAAAAAAAAAAAYSSSYGCAYPAGGGGGGGGTSAATTAMQPACSAAGGGPFVNVSNLGGFGSGGSAQPLHQGTAAGAACAQGTLQGIRAW,mutated_sequence,1.0,364.0,UPI00001301E8.a2m,UPI00001301E8.npy,gnomAD
+UPI0000198666,UPI0000198666.csv,MLSLREQQLQVWFKNRRAKLARERRLQQQPQRVPGQRGRGARAAPLVPAASASAPQRGPSGILPAAEPTICSLHQAWGGPGCRAQKGIPAALSPGPGPIPAPIPGPAQIPGPLPGSIPGPIPGPAQIPSPIPAPIPGPISGPVQIPGPFRGPIPGPISGPAPIPGPISGPFSGPNPGPIPGPNPGPIPGPISGPIPGPISVPIPGPIPGPISGPISGPNPGPIPGPIPGPISGPNPGPIPGPISGPNPGLIPGPIPGPISGPGPIIGPIPSPAQIPGPGRLQGPGPILSPGRMRSPGSLPGLAPILGPGSGPGSGSVPAPIPGPGSLPAPAPLWPQSPDASDFLPDTQLFPHFTELLLPLDPLEGSSVSTMTSQYQEGDDSMGKKHSGSQPQEEGGSVNENHSGPRLLLDL,mutated_sequence,1.0,411.0,UPI0000198666.a2m,UPI0000198666.npy,gnomAD
+UPI000013E667,UPI000013E667.csv,MHLALTTVLLWAWGLQAFEIVEKENIFQRTPCPAFLMFENAAYLADMSFELPCHCKPEEVPAVVWFYQKHLGSSHTKVLTDFDGRVLTEAAQVRVGSDMLTRFSIRMFSLLVFRAQSEDSGLYFCGTRKGDYFYAYDVDIQNSEGMVATFQDKGQEPFADEYYGHLHVFTTFWEWTPCDRCGVRGEQWRIGLCYLQSPDLSPRYLKAVPDVVSCGSRAVPRKLRTKARDHTPEVLVRSCLVPCEKTKTIREGVLAIINYVSKVGSRPWVPQVPIQFHQQRLGHGLIISCPGARPEHAVAWDKDRQHLYRTQYLKGVNRSMRVFIDHGNQLHIRFTQLDDRGIYYCWRQGVLVAGFRLGVTSHGHYPASFSDPETRSAVELTLIGYLLITAVFVTIHFCRCCCYLFHCCPSFSP,mutated_sequence,1.0,413.0,UPI000013E667.a2m,UPI000013E667.npy,gnomAD
+UPI0000212176,UPI0000212176.csv,MECPEGQLPISSENDSTPTVSTSEVTSQQEPQILVDRGSETTYESSADIAGDEGTQIPADEDTQTDADSSAQAAAQAPENFQEGKDMSESQDEVPDEVENQFILRLPLEHACTVRNLARSQSVKMKDKLKIDLLPDGRHAVVEVEDVPLAAKLVDLPCVIESLRTLDKKTFYKTADISQMLVCTADGDIHLSPEEPAASTDPNIVRKKERGREEKCVWKHGITPPLKNVRKKRFRKTQKKVPDVKEMEKSSFTEYIESPDVENEVKRLLRSDAEAVSTRWEVIAEDGTKEIESQGSIPGFLISSGMSSHKQGHTSSEYDMLREMFSDSRSNNDDDEDEDDEDEDEDEDEDEDEDKEEEEEDCSEEYLERQLQAEFIESGQYRANEGTSSIVMEIQKQIEKKEKKLHKIQNKAQRQKDLIMKVENLTLKNHFQSVLEQLELQEKQKNEKLISLQEQLQRFLKK,mutated_sequence,1.0,462.0,UPI0000212176.a2m,UPI0000212176.npy,gnomAD
+UPI00001615E1,UPI00001615E1.csv,MGTVSSRRSWWPLPLLLLLLLLLGPAGARAQEDEDGDYEELVLALRSEEDGLAEAPEHGTTATFHRCAKDPWRLPGTYVVVLKEETHLSQSERTARRLQAQAARRGYLTKILHVFHGLLPGFLVKMSGDLLELALKLPHVDYIEEDSSVFAQSIPWNLERITPPRYRADEYQPPDGGSLVEVYLLDTSIQSDHREIEGRVMVTDFENVPEEDGTRFHRQASKCDSHGTHLAGVVSGRDAGVAKGASMRSLRVLNCQGKGTVSGTLIGLEFIRKSQLVQPVGPLVVLLPLAGGYSRVLNAACQRLARAGVVLVTAAGNFRDDACLYSPASAPEVITVGATNAQDQPVTLGTLGTNFGRCVDLFAPGEDIIGASSDCSTCFVSQSGTSQAAAHVAGIAAMMLSAEPELTLAELRQRLIHFSAKDVINEAWFPEDQRVLTPNLVAALPPSTHGAGWQLFCRTVWSAHSGPTRMATAVARCAPDEELLSCSSFSRSGKRRGERMEAQGGKLVCRAHNAFGGEGVYAIARCCLLPQANCSVHTAPPAEASMGTRVHCHQQGHVLTGCSSHWEVEDLGTHKPPVLRPRGQPNQCVGHREASIHASCCHAPGLECKVKEHGIPAPQEQVTVACEEGWTLTGCSALPGTSHVLGAYAVDNTCVVRSRDVSTTGSTSEGAVTAVAICCRSRHLAQASQELQ,mutated_sequence,1.0,692.0,UPI00001615E1.a2m,UPI00001615E1.npy,gnomAD
+UPI0000197DF0,UPI0000197DF0.csv,MKEGRGSFSVERGPRKERETAQSGMWKGNSPAGSQGAAMEGTGGELGGQGNWGPEDAPGLLARASLIMLPWPLPLASSALTLLFGALTSLFLWYCYRLGSQDMQALGAGSRAGGVRGGPVGCSEAGGPSPGGPGDPGEGPRTEGLVSRRLRAYARRYSWAGMGRVRRAAQGGPGPGRGPGVLGIQRPGLLFLPDLPSAPFVPRDAQRHDVELLESSFPAILRDFGAVSWDFSGTTPPPRGWSPPLAPGCYQLLLYQAGRCQPSNCRRCPGAYRALRGLRSFMSANTFGNAGFSVLLPGARLEGRCGPTNARVRCHLGLKIPPGCELVVGGEPQCWAEGHCLLVDDSFLHTVAHNGSPEDGPRVVFIVDLWHPNVAGAERQALDFVFAPDP,mutated_sequence,1.0,390.0,UPI0000197DF0.a2m,UPI0000197DF0.npy,gnomAD
+UPI00003519AE,UPI00003519AE.csv,MEDAGAAGPGPEPEPEPEPEPEPAPEPEPEPKPGAGTSEAFSRLWTDVMGILDGSLGNIDDLAQQYADYYNTCFSDVCERMEELRKRRVSQDLEVEKPDASPTSLQLRSQIEESLGFCSAVSTPEVERKNPLHKSNSEDSSVGKGDWKKKNKYFWQNFRKNQKGIMRQTSKGEDVGYVASEITMSDEERIQLMMMVKEKMITIEEALARLKEYEAQHRQSAALDPADWPDGSYPTFDGSSNCNSREQSDDETEESVKFKRLHKLVNSTRRVRKKLIRVEEMKKPSTEGGEEHVFENSPVLDERSALYSGVHKKPLFFDGSPEKPPEDDSDSLTTSPSSSSLDTWGAGRKLVKTFSKGESRGLIKPPKKMGTFFSYPEEEKAQKVSRSLTEGEMKKGLGSLSHGRTCSFGGFDLTNRSLHVGSNNSDPMGKEGDFVYKEVIKSPTASRISLGKKVKSVKETMRKRMSKKYSSSVSEQDSGLDGMPGSPPPSQPDPEHLDKPKLKAGGSVESLRSSLSGQSSMSGQTVSTTDSSTSNRESVKSEDGDDEEPPYRGPFCGRARVHTDFTPSPYDTDSLKLKKGDIIDIISKPPMGTWMGLLNNKVGTFKFIYVDVLSEDEEKPKRPTRRRRKGRPPQPKSVEDLLDRINLKEHMPTFLFNGYEDLDTFKLLEEEDLDELNIRDPEHRAVLLTAVELLQEYDSNSDQSGSQEKLLVDSQGLSGCSPRDSGCYESSENLENGKTRKASLLSAKSSTEPSLKSFSRNQLGNYPTLPLMKSGDALKQGQEEGRLGGGLAPDTSKSCDPPGVTGLNKNRRSLPVSICRSCETLEGPQTVDTWPRSHSLDDLQVEPGAEQDVPTEVTEPPPQIVPEVPQKTTASSTKAQPLEQDSAVDNALLLTQSKRFSEPQKLTTKKLEGSIAASGRGLSPPQCLPRNYDAQPPGAKHGLARTPLEGHRKGHEFEGTHHPLGTKEGVDAEQRMQPKIPSQPPPVPAKKSRERLANGLHPVPMGPSGALPSPDAPCLPVKRGSPASPTSPSDCPPALAPRPLSGQAPGSPPSTRPPPWLSELPENTSLQEHGVKLGPALTRKVSCARGVDLETLTENKLHAEGIDLTEEPYSDKHGRCGIPEALVQRYAEDLDQPERDVAANMDQIRVKQLRKQHRMAIPSGGLTEICRKPVSPGCISSVSDWLISIGLPMYAGTLSTAGFSTLSQVPSLSHTCLQEAGITEERHIRKLLSAARLFKLPPGPEAM,mutated_sequence,1.0,1247.0,UPI00003519AE.a2m,UPI00003519AE.npy,gnomAD
+UPI00001279CC,UPI00001279CC.csv,MAQALLVPPGPESFRLFTRESLAAIEKRAAEEKAKKPKKEQDNDDENKPKPNSDLEAGKNLPFIYGDIPPEMVSEPLEDLDPYYINKKTFIVMNKGKAIFRFSATSALYILTPLNPVRKIAIKILVHSLFSMLIMCTILTNCVFMTLSNPPDWTKNVEYTFTGIYTFESLIKILARGFCLEDFTFLRDPWNWLDFSVIVMAYVTEFVSLGNVSALRTFRVLRALKTISVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNKCLQWPPSDSAFETNTTSYFNGTMDSNGTFVNVTMSTFNWKDYIGDDSHFYVLDGQKDPLLCGNGSDAGQCPEGYICVKAGRNPNYGYTSFDTFSWAFLSLFRLMTQDYWENLYQLTLRAAGKTYMIFFVLVIFLGSFYLVNLILAVVAMAYEEQNQATLEEAEQKEAEFQQMLEQLKKQQEEAQAVAAASAASRDFSGIGGLGELLESSSEASKLSSKSAKEWRNRRKKRRQREHLEGNNKGERDSFPKSESEDSVKRSSFLFSMDGNRLTSDKKFCSPHQSLLSIRGSLFSPRRNSKTSIFSFRGRAKDVGSENDFADDEHSTFEDSESRRDSLFVPHRHGERRNSNVSQASMSSRMVPGLPANGKMHSTVDCNGVVSLVGGPSALTSPTGQLPPEGTTTETEVRKRRLSSYQISMEMLEDSSGRQRAVSIASILTNTMEELEESRQKCPPCWYRFANVFLIWDCCDAWLKVKHLVNLIVMDPFVDLAITICIVLNTLFMAMEHYPMTEQFSSVLTVGNLVFTGIFTAEMVLKIIAMDPYYYFQEGWNIFDGIIVSLSLMELGLSNVEGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKSYKECVCKINDDCTLPRWHMNDFFHSFLIVFRVLCGEWIETMWDCMEVAGQTMCLIVFMLVMVIGNLVVLNLFLALLLSSFSSDNLAATDDDNEMNNLQIAVGRMQKGIDYVKNKMRECFQKAFFRKPKVIEIHEGNKIDSCMSNNTGIEISKELNYLRDGNGTTSGVGTGSSVEKYVIDENDYMSFINNPSLTVTVPIAVGESDFENLNTEEFSSESELEESKEKLNATSSSEGSTVDVVLPREGEQAETEPEEDLKPEACFTEGCIKKFPFCQVSTEEGKGKIWWNLRKTCYSIVEHNWFETFIVFMILLSSGALAFEDIYIEQRKTIKTMLEYADKVFTYIFILEMLLKWVAYGFQTYFTNAWCWLDFLIVDVSLVSLVANALGYSELGAIKSLRTLRALRPLRALSRFEGMRVVVNALVGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFYHCVNMTTGNMFDISDVNNLSDCQALGKQARWKNVKVNFDNVGAGYLALLQVATFKGWMDIMYAAVDSRDVKLQPVYEENLYMYLYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKFGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPANKFQGMVFDFVTRQVFDISIMILICLNMVTMMVETDDQGKYMTLVLSRINLVFIVLFTGEFVLKLVSLRHYYFTIGWNIFDFVVVILSIVGMFLAEMIEKYFVSPTLFRVIRLARIGRILRLIKGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYAIFGMSNFAYVKKEAGIDDMFNFETFGNSMICLFQITTSAGWDGLLAPILNSAPPDCDPDTIHPGSSVKGDCGNPSVGIFFFVSYIIISFLVVVNMYIAVILENFSVATEESAEPLSEDDFEMFYEVWEKFDPDATQFIEFSKLSDFAAALDPPLLIAKPNKVQLIAMDLPMVSGDRIHCLDILFAFTKRVLGESGEMDALRIQMEDRFMASNPSKVSYEPITTTLKRKQEEVSAAIIQRNFRCYLLKQRLKNISSNYNKEAIKGRIDLPIKQDMIIDKLNGNSTPEKTDGSSSTTSPPSYDSVTKPDKEKFEKDKPEKESKGKEVRENQK,mutated_sequence,1.0,2000.0,UPI00001279CC.a2m,UPI00001279CC.npy,gnomAD
+UPI000006F3AC,UPI000006F3AC.csv,MPGANYRAGAGAGAGARRPRGARDREEDGGGLEPAAVARDLLRGTSNMSFEELLELQSQVGTKTYKQLVAGNSPKKQASRPPIQNACVADKHRPLEMSAKIRVPFLRQVVPISKKVARDPRFDDLSGEYNPEVFDKTYQFLNDIRAKEKELVKKQLKKHLSGEEHEKLQQLLQRMEQQEMAQQERKQQQELHLALKQERRAQAQQGHRPYFLKKSEQRQLALAEKFKELKRSKKLENFLSRKRRRNAGKDRRHLPLSKE,mutated_sequence,1.0,259.0,UPI000006F3AC.a2m,UPI000006F3AC.npy,gnomAD
+UPI000040C55F,UPI000040C55F.csv,MEENPTLESEAWGSSRGWLAPREARGAPCSSPGPSLSSVLNELPSAATLRYRDPGVLPWGALEEEEEDGGRSRKAFTEVTQTELQDPHPSRELPWPMQARRAHRQRNASRDQVVYGSGTKTDRWARLLRRSKEKTKEGLRSLQPWAWTLKRIGGQFGAGTESYFSLLRFLLLLNVLASVLMACMTLLPTWLGGAPPGPPGPDISSPCGSYNPHSQGLVTFATQLFNLLSGEGYLEWSPLFYGFYPPRPRLAVTYLCWAFAVGLICLLLILHRSVSGLKQTLLAESEALTSYSHRVFSAWDFGLCGDVHVRLRQRIILYELKVELEETVVRRQAAVRTLGQQARVWLVRVLLNLLVVALLGAAFYGVYWATGCTVELQEMPLVQELPLLKLGVNYLPSIFIAGVNFVLPPVFKLIAPLEGYTRSRQIVFILLRTVFLRLASLVVLLFSLWNQITCGGDSEAEDCKTCGYNYKQLPCWETVLGQEMYKLLLFDLLTVLAVALLIQFPRKLLCGLCPGALGRLAGTQEFQVPDEVLGLIYAQTVVWVGSFFCPLLPLLNTVKFLLLFYLKKLTLFSTCSPAARTFRASAANFFFPLVLLLGLAISSVPLLYSIFLIPPSKLCGPFRGQSSIWAQIPESISSLPETTQNFLFFLGTQAFAVPLLLISSILMAYTVALANSYGRLISELKRQRQTEAQNKVFLARRAVALTSTKPAL,mutated_sequence,1.0,712.0,UPI000040C55F.a2m,UPI000040C55F.npy,gnomAD
+UPI00001AE729,UPI00001AE729.csv,MDPPAGAARRLLCPALLLLLLLLPPPLLPPPPPPANARLAAAADPPGGPLGHGAERILAVPVRTDAQGRLVSHVVSAATSRAGVRARRAAPVRTPSFPGGNEEEPGSHLFYNVTVFGRDLHLRLRPNARLVAPGATMEWQGEKGTTRVEPLLGSCLYVGDVAGLAEASSVALSNCDGLAGLIRMEEEEFFIEPLEKGLAAQEAEQGRVHVVYRRPPTSPPLGGPQALDTGASLDSLDSLSRALGVLEEHANSSRRRARRHAADDDYNIEVLLGVDDSVVQFHGKEHVQKYLLTLMNIVNEIYHDESLGAHINVVLVRIILLSYGKSMSLIEIGNPSQSLENVCRWAYLQQKPDTGHDEYHDHAIFLTRQDFGPSGMQGYAPVTGMCHPVRSCTLNHEDGFSSAFVVAHETGHVLGMEHDGQGNRCGDEVRLGSIMAPLVQAAFHRFHWSRCSQQELSRYLHSYDCLLDDPFAHDWPALPQLPGLHYSMNEQCRFDFGLGYMMCTAFRTFDPCKQLWCSHPDNPYFCKTKKGPPLDGTMCAPGKHCFKGHCIWLTPDILKRDGSWGAWSPFGSCSRTCGTGVKFRTRQCDNPHPANGGRTCSGLAYDFQLCSRQDCPDSLADFREEQCRQWDLYFEHGDAQHHWLPHEHRDAKERCHLYCESRETGEVVSMKRMVHDGTRCSYKDAFSLCVRGDCRKVGCDGVIGSSKQEDKCGVCGGDNSHCKVVKGTFTRSPKKHGYIKMFEIPAGARHLLIQEVDATSHHLAVKNLETGKFILNEENDVDASSKTFIAMGVEWEYRDEDGRETLQTMGPLHGTITVLVIPVGDTRVSLTYKYMIHEDSLNVDDNNVLEEDSVVYEWALKKWSPCSKPCGGGSQFTKYGCRRRLDHKMVHRGFCAALSKPKAIRRACNPQECSQPVWVTGEWEPCSQTCGRTGMQVRSVRCIQPLHDNTTRSVHAKHCNDARPESRRACSRELCPGRWRAGPWSQCSVTCGNGTQERPVLCRTADDSFGICQEERPETARTCRLGPCPRNISDPSKKSYVVQWLSRPDPDSPIRKISSKGHCQGDKSIFCRMEVLSRYCSIPGYNKLCCKSCNLYNNLTNVEGRIEPPPGKHNDIDVFMPTLPVPTVAMEVRPSPSTPLEVPLNASSTNATEDHPETNAVDEPYKIHGLEDEVQPPNLIPRRPSPYEKTRNQRIQELIDEMRKKEMLGKF,mutated_sequence,1.0,1211.0,UPI00001AE729.a2m,UPI00001AE729.npy,gnomAD
+UPI0000DA182F,UPI0000DA182F.csv,MRGPEPGPQPTMEGDVLDTLEALGYKGPLLEEQALTKAAEGGLSSPEFSELCIWLGSQIKSLCNLEESITSAGRDDLESFQLEISGFLKEMACPYSVLISGDIKDRLKKKEDCLKLLLFLSTELQASQILQNKKHKNSQLDKNSEVYQEVQAMFDTLGIPKSTTSDIPHMLNQVESKVKDILSKVQKNHVGKPLLKMDLNSEQAEQLERINDALSCEYECRRRMLMKRLDVTVQSFGWSDRAKVKTDDIARIYQPKRYALSPKTTITMAHLLAAREDLSKIIRTSSGTSREKTACAINKVLMGRVPDRGGRPNEIEPPPPEMPPWQKRQEGGGGRGGWGGGGGGGGRGGGGGGGGRGGWGGGGGGWGGGGGGGGGWGGGGGGGRGGFQGRGDYGGRGGYGGRGGYGGRGYGDPYGGGGGGGGGGGGGGGYRRY,mutated_sequence,1.0,433.0,UPI0000DA182F.a2m,UPI0000DA182F.npy,gnomAD
+UPI0000E5ADF3,UPI0000E5ADF3.csv,MWKLGRGRVLLDEPPEEEDGLRGGPPPAAAAAAQAQVQGASFRGWKEVTSLFNKDDEQHLLERCKSPKSKGTNLRLKEELKAEKKSGFWDNLVLKQNIQSKKPDEIEGWEPPKLALEDISADPEDTVGGHPSWSGWEDDAKGSTKYTSLASSANSSRWSLRAAGRLVSIRRQSKGHLTDSPEEAE,mutated_sequence,1.0,185.0,UPI0000E5ADF3.a2m,UPI0000E5ADF3.npy,gnomAD
+UPI0001D0434B,UPI0001D0434B.csv,MRDFGFGVLQTAPLRSSSPGPLFCGEAYGPYAVGSVNPLPSATPFGPLSPPPLPVTGFLEAASPFSVPLGGGAGSPAAAASSSSPFLAHQQTMQDELLLGLTQQPARPLSGAAATEKLPDHHPGGGTIAGVTHLLPSQDFKPSLHHPSSSSASSCCCCRTSSPQDFSKRQQQQLSSQKRKEFSPPHLPHPPDSKPPPPPPPLHCPGRFSPPPPPAGPLLQPAQLAQRQQQQPPQQFSLLHQQHLSPQDFAPRQRPADLPPLPQLPPSPPAAPRRRHGGAGSPRKTPAAGEGSAAESPNAGLASSTPVNPAPGSMESPNHPLLNSPSNLLPGGALGAGAFSSLQSPDLPHPGGGGGGGGGGPPGGGGGGGSASPPPLPGFGTPWSVQTASPPPQPQQPPPTQPQQQPPPPQQPPQPQPQPPGSSATTPGGGSGGSLSAMPPPSPDSENGFYPGLPSSMNPAFFPSFSPVSPHGCTGLSVPTSGGGGGGFGGPFSATAVPPPPPPAMNIPQQQPPPPAAPQQPQSRRSPVSPQLQQQHQAAAAAFLQQRNSYNHHQPLLKQSPWSNHQSSGWGTGSMSWGAMHGRDHRRTGNMGIPGTMNQISPLKKPFSGNVIAPPKFTRSTPSLTPKSWIEDNVFRTDNNSNTLLPLQVRSSLQLPAWGSDSLQDSWCTAAGTSRIDQDRSRMYDSLNMHSLENSLIDIMRAEHDPLKGRLSYPHPGTDNLLMLNARSYGRRRGRSSLFPIDDGLLDDGHSDQVGVLNSPTCYSAHQNGERIERFSRKVFVGGLPPDIDEDEITASFRRFGPLVVDWPHKAESKSYFPPKGYAFLLFQEESSVQALIDACIEEDGKLYLCVSSPTIKDKPVQIRPWNLSDSDFVMDGSQPLDPRKTIFVGGVPRPLRAVELAMIMDRLYGGVCYAGIDTDPELKYPKGAGRVAFSNQQSYIAAISARFVQLQHGDIDKRVEVKPYVLDDQMCDECQGARCGGKFAPFFCANVTCLQYYCEFCWANIHSRAGREFHKPLVKEGADRPRQIHFRWN,mutated_sequence,1.0,1034.0,UPI0001D0434B.a2m,UPI0001D0434B.npy,gnomAD
+UPI0001662AC1,UPI0001662AC1.csv,MAESSLYRQRLEVIAEKRRLQEEIRAARREVEEEKLRVERLKRKSLRERWLMDGAAAVPEPSEDPTSKDPQSPEGQAQARIRNLEDSLFTLQSQLQLLQSASTGAQHKPSGRPSWRRQGHRPLSQSIVEAGSVGQTDLNKRASLPAGLVGTPPESPSEPREDVLGFLPGPRQVPGAAGDSSEANGPCPSPIPTPEQGLSQRAVPSEGRVGEAKGGGVVSVVWEGLRATEDCATGATGPELEAKVEEVVLEAIGDRKGAGSLELPAWVKEDRGIVEVVWEGVGGSDAEAMGEIGRVPEVVQTSSPRLQERLEAAASIEGEDVPQGSPEGDGQGGSGGEEGSFIWVERVTLSEEWEELLVEGLEGPEVAGRERGDESPLGAEGAKTGGGEETWEAEKRKAEESMGIGSEEKPGTGRDEAEMSPVVERKGGEKKLELESRGSAEKLGTEREGGEEPLGIERKVEGHLRAEKEGDEEKRGAEEEEVEEPLGVEKKGGEEEPEATKEPLEAERKGGEETLEAEKRGGEESLETEKTQGTEGDLNLEQGSREGSESQAEEMNEAGPPLEANTETRPEKEGPQPQEKPVGALEEEGVKPQTAAEGQGPLGDATPLLAETPAPEQPAECQPLLQGEGPSANPSAHPVPTYAPARQPEPSAPTEGEEASGPKQKTCQCCAVM,mutated_sequence,1.0,673.0,UPI0001662AC1.a2m,UPI0001662AC1.npy,gnomAD
+UPI000011785A,UPI000011785A.csv,MAEEQPQVELFVKAGSDGAKIGNCPFSQRLFMVLWLKGVTFNVTTVDTKRRTETVQKLCPGGQLPFLLYGTEVHTDTNKIEEFLEAVLCPPRYPKLAALNPESNTAGLDIFAKFSAYIKNSNPALNDNLEKGLLKALKVLDNYLTSPLPEEVDETSAEDEGVSQRKFLDGNELTLADCNLLPKLHIVQVVCKKYRGFTIPEAFRGVHRYLSNAYAREEFASTCPDDEEIELAYEQVAKALK,mutated_sequence,1.0,241.0,UPI000011785A.a2m,UPI000011785A.npy,gnomAD
+UPI0001837EA2,UPI0001837EA2.csv,MAERGQQPPPAKRLCCRPGGGGGGGGSSGGGGGAGGGYSSACRPGPRAGGAAAAAACGGGAALGLLPPGKTQSPESLLDIAARRVAEKWPFQRVEERFERIPEPVQRRIVYWSFPRSEREICMYSSFNTGGGAAGGPGDDSGGGGGAGGGGGGGSSSSPAATSAAATSAAAAAAAAAAAAAAAAGAGAPSVGAAGAADGGDETRLPFRRGIALLESGCVDNVLQVGFHLSGTVTEPAIQSEPETVCNVAISFDRCKITSVTCSCGNKDIFYCAHVVALSLYRIRKPDQVKLHLPISETLFQMNRDQLQKFVQYLITVHHTEVLPTAQKLADEILSQNSEINQVHGAPDPTAGASIDDENCWHLDEEQVQEQVKLFLSQGGYHGSGKQLNLLFAKVREMLKMRDSNGARMLTLITEQFMADPRLSLWRQQGTAMTDKYRQLWDELGALWMCIVLNPHCKLEQKASWLKQLKKWNSVDVCPWEDGNHGSELPNLTNALPQGANANQDSSNRPHRTVFTRAIEACDLHWQDSHLQHIISSDLYTNYCYHDDTENSLFDSRGWPLWHEHVPTACARVDALRSHGYPREALRLAIAIVNTLRRQQQKQLEMFRTQKKELPHKNITSITNLEGWVGHPLDPVGTLFSSLMEACRIDDENLSGFSDFTENMGQCKSLEYQHLPAHKFLEEGESYLTLAVEVALIGLGQQRIMPDGLYTQEKVCRNEEQLISKLQEIELDDTLVKIFRKQAVFLLEAGPYSGLGEIIHRESVPMHTFAKYLFTSLLPHDAELAYKIALRAMRLLVLESTAPSGDLTRPHHIASVVPNRYPRWFTLSHIESQQCELASTMLTAAKGDVRRLETVLESIQKNIHSSSHIFKLAQDAFKIATLMDSLPDITLLKVSLELGLQVMRMTLSTLNWRRREMVRWLVTCATEVGVYALDSIMQTWFTLFTPTEATSIVATTVMSNSTIVRLHLDCHQQEKLASSARTLALQCAMKDPQNCALSALTLCEKDHIAFETAYQIVLDAATTGMSYTQLFTIARYMEHRGYPMRAYKLATLAMTHLNLSYNQDTHPAINDVLWACALSHSLGKNELAAIIPLVVKSVKCATVLSDILRRCTLTTPGMVGLHGRRNSGKLMSLDKAPLRQLLDATIGAYINTTHSRLTHISPRHYSEFIEFLSKARETFLMAHDGHIQFTQFIDNLKQIYKGKKKLMMLVRERFG,mutated_sequence,1.0,1215.0,UPI0001837EA2.a2m,UPI0001837EA2.npy,gnomAD
+UPI00001320B0,UPI00001320B0.csv,MEMEQEKMTMNKELSPDAAAYCCSACHGDETWSYNHPIRGRAKSRSLSASPALGSTKEFRRTRSLHGPCPVTTFGPKACVLQNPQTIMHIQDPASQRLTWNKSPKSVLVIKKMRDASLLQPFKELCTHLMEENMIVYVEKKVLEDPAIASDESFGAVKKKFCTFREDYDDISNQIDFIICLGGDGTLLYASSLFQGSVPPVMAFHLGSLGFLTPFSFENFQSQVTQVIEGNAAVVLRSRLKVRVVKELRGKKTAVHNGLGENGSQAAGLDMDVGKQAMQYQVLNEVVIDRGPSSYLSNVDVYLDGHLITTVQGDGVIVSTPTGSTAYAAAAGASMIHPNVPAIMITPICPHSLSFRPIVVPAGVELKIMLSPEARNTAWVSFDGRKRQEIRHGDSISITTSCYPLPSICVRDPVSDWFESLAQCLHWNVRKKQAHFEEEEEEEEEG,mutated_sequence,1.0,446.0,UPI00001320B0.a2m,UPI00001320B0.npy,gnomAD
+UPI0000052A8C,UPI0000052A8C.csv,MVVLRSSLELHNHSAASATGSLDLSSDFLSLEHIGRRRLRSAGAAQKKPAATTAKAGDGSSVKEVETYHRTRALRSLRKDAQNSSDSSFEKNVEITEQLANGRHFTRQLARQQADKKKEEHREDKVIPVTRSLRARNIVQSTEHLHEDNGDVEVRRSCRIRSRYSGVNQSMLFDKLITNTAEAVLQKMDDMKKMRRQRMRELEDLGVFNETEESNLNMYTRGKQKDIQRTDEETTDNQEGSVESSEEGEDQEHEDDGEDEDDEDDDDDDDDDDDDDDEDDEDEEDGEEENQKRYYLRQRKATVYYQAPLEKPRHQRKPNIFYSGPASPARPRYRLSSAGPRSPYCKRMNRRRHAIHSSDSTSSSSSEDEQHFERRRKRSRNRAINRCLPLNFRKDELKGIYKDRMKIGASLADVDPMQLDSSVRFDSVGGLSNHIAALKEMVVFPLLYPEVFEKFKIQPPRGCLFYGPPGTGKTLVARALANECSQGDKRVAFFMRKGADCLSKWVGESERQLRLLFDQAYQMRPSIIFFDEIDGLAPVRSSRQDQIHSSIVSTLLALMDGLDSRGEIVVIGATNRLDSIDPALRRPGRFDREFLFSLPDKEARKEILKIHTRDWNPKPLDTFLEELAENCVGYCGADIKSICAEAALCALRRRYPQIYTTSEKLQLDLSSINISAKDFEVAMQKMIPASQRAVTSPGQALSTVVKPLLQNTVDKILEALQRVFPHAEFRTNKTLDSDISCPLLESDLAYSDDDVPSVYENGLSQKSSHKAKDNFNFLHLNRNACYQPMSFRPRILIVGEPGFGQGSHLAPAVIHALEKFTVYTLDIPVLFGVSTTSPEETCAQVIREAKRTAPSIVYVPHIHVWWEIVGPTLKATFTTLLQNIPSFAPVLLLATSDKPHSALPEEVQELFIRDYGEIFNVQLPDKEERTKFFEDLILKQAAKPPISKKKAVLQALEVLPVAPPPEPRSLTAEEVKRLEEQEEDTFRELRIFLRNVTHRLAIDKRFRVFTKPVDPDEVPDYVTVIKQPMDLSSVISKIDLHKYLTVKDYLRDIDLICSNALEYNPDRDPGDRLIRHRACALRDTAYAIIKEELDEDFEQLCEEIQESRKKRGCSSSKYAPSYYHVMPKQNSTLVGDKRSDPEQNEKLKTPSTPVACSTPAQLKRKIRKKSNWYLGTIKKRRKISQAKDDSQNAIDHKIESDTEETQDTSVDHNETGNTGESSVEENEKQQNASESKLELRNNSNTCNIENELEDSRKTTACTELRDKIACNGDASSSQIIHISDENEGKEMCVLRMTRARRSQVEQQQLITVEKALAILSQPTPSLVVDHERLKNLLKTVVKKSQNYNIFQLENLYAVISQCIYRHRKDHDKTSLIQKMEQEVENFSCSR,mutated_sequence,1.0,1390.0,UPI0000052A8C.a2m,UPI0000052A8C.npy,gnomAD
+UPI00021CF3CC,UPI00021CF3CC.csv,RRLCPLHTHTHTHTHTHMHAHTRTHTKLSCLLVCPPEAGTCWSLRSTWSGPDVRCVYWLNLSRPIEEQGPLDVIIHKLTDVILEADQNDSQSLELVHRFQEYIDAHPETIVLDPLPAIRTLLDRSKSYELIRKIEAYMEDDRICSPPFMELTSLCGDDTMRLLEKNGLTFPFICKTRVAHGTNSHEMAIVFNQEGLNAIQPPCVVQNFINHNAVLYKVFVVGESYTVVQR,mutated_sequence,1.0,230.0,UPI00021CF3CC.a2m,UPI00021CF3CC.npy,gnomAD
+UPI000002B371,UPI000002B371.csv,MARGRKMSKPRAVEAAAAAAAVAATAPGPEMVERRGPGRPRTDGENVFTGQSKIYSYMSPNKCSGMRFPLQEENSVTHHEVKCQGKPLAGIYRKREEKRNAGNAVRSAMKSEEQKIKDARKGPLVPFPNQKSEAAEPPKTPPSSCDSTNAAIAKQALKKPIKGKQAPRKKAQGKTQQNRKLTDFYPVRRSSRKSKAELQSEERKRIDELIESGKEEGMKIDLIDGKGRGVIATKQFSRGDFVVEYHGDLIEITDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETNRLGRLINHSKCGNCQTKLHDIDGVPHLILIASRDIAAGEELLYDYGDRSKASIEAHPWLKH,mutated_sequence,1.0,352.0,UPI000002B371.a2m,UPI000002B371.npy,gnomAD
+UPI0002840938,UPI0002840938.csv,MLLPLLLLLPMCWAVEVKRPRGVSLTSESSCSPSRQAGGGRGQHWALGIGIYSLLTQFPGSAPLVGMGGQRWYLVEKALTCPGLSFLYPADHHFYDESKPFTCLDGSATIPFDQVNDDYCDCKDGSDEPGTA,mutated_sequence,1.0,132.0,UPI0002840938.a2m,UPI0002840938.npy,gnomAD
+UPI0001747A7C,UPI0001747A7C.csv,MHNLYSITGYPDPPGTMEEEEEDDDYENSTPPYKDLPPKPGTMEEEEEDDDYENSTPPYKDLPPKPGTMEEEEEDDDYENSTPPYKDLPPKPGSSAPPRPPRAAKETEKPPLPCKPRNMTGLDLAAVTCPPPQLAVNLEPSPLQPSLAATPVPWLNQRSGGPGCCQKRWMVYLCLLVVTSLFLGCLGLTVTLIKYQELMEELRMLSFQQMTWRTNMTGMAGLAGLKHDIARVRADTNQSLVELWGLLDCRRITCPEGWLPFEGKCYYFSPSTKSWDEARMFCQENYSHLVIINSFAEHNFVAKAHGSPRVYWLGLNDRAQEGDWRWLDGSPVTLSFWEPEEPNNIHDEDCATMNKGGTWNDLSCYKTTYWICERKCSC,mutated_sequence,1.0,378.0,UPI0001747A7C.a2m,UPI0001747A7C.npy,gnomAD
+UPI0001C0B37D,UPI0001C0B37D.csv,MASPLPSGFPARRNSRLDVFLRRHLPPEVYDAVRAYEPCIVVSNSENHILKYVVLSDRLVYLTENPPKSIRRVVALRDVVAIDLIDDYPEFLSSPDREISQHIRIIYSSTVLKKECKKSNSVRKFLFPFHHTKANNKKVKEEKNGLAFWRSKESRSLKESPLRDQQESSTPSKDSTLCPRPGLKKLSLHGQGAFRPLPSPSRRSSQSAPTTGKAVSEPSCTTNTKEPQGLPDHNSISEIPFKCNGNGNEFYLGNSLLDSPSQSNSNLEKKESELHLYVISTTSSIFLHLKSSWNNYIIKATLLQDPFYASEFSPAIGSQKPYRSEEKIKHFSQLKSELFLKDNSLRRILSLLMELKVAAQKNFILKRLFWKTSDLFYFIVNKLHEYLPESRDKNALQNQSQRVDELVACIEIIQTLVLMFRETETESSRLNTLAAKKGALFNLLVILISEPQIPKSCPVFDIQLVADSALVRMSFDAELQKLILEYTNTATALLYEILLVFQQGNLGLGSTKFAISWIMSFLQSCPPIITFVASIVKQVVRGLSASFQLLSPCQAVLLYQQFYILKSCLRHSRTLAEYIRNNYREEFRYFIHMPALQKRLPLCYPITQPTIQLFHEVLKLVE,mutated_sequence,1.0,622.0,UPI0001C0B37D.a2m,UPI0001C0B37D.npy,gnomAD
+UPI0000136B58,UPI0000136B58.csv,MLRLQMTDGHISCTAVEFSYMSKISLNTPPGTKVKLSGIVDIKNGFLLLNDSNTTVLGGEVEHLIEKWELQRSLSKHNRSNIGTEGGPPPFVPFGQKCVSHVQVDSRELDRRKTLQVTMPVKPTNDNDEFEKQRTAAIAEVAKSKETKTFGGGGGGARSNLNMNAAGNRNREVLQKEKSTKSEGKHEGVYRELVDEKALKHITEMGFSKEASRQALMDNGNNLEAALNVLLTSNKQKPVMGPPLRGRGKGRGRIRSEDEEDLGNARPSAPSTLFDFLESKMGTLNVEEPKSQPQQLHQGQYRSSNTEQNGVKDNNHLRHPPRNDTRQPRNEKPPRFQRDSQNSKSVLEGSGLPRNRGSERPSTSSVSEVWAEDRIKCDRPYSRYDRTKDTSYPLGSQHSDGAFKKRDNSMQSRSGKGPSFAEAKENPLPQGSVDYNNQKRGKRESQTSIPDYFYDRKSQTINNEAFSGIKIEKHFNVNTDYQNPVRSNSFIGVPNGEVEMPLKGRRIGPIKPAGPVTAVPCDDKIFYNSGPKRRSGPIKPEKILESSIPMEYAKMWKPGDECFALYWEDNKFYRAEVEALHSSGMTAVVKFIDYGNYEEVLLSNIKPIQTEAWEEEGTYDQTLEFRRGGDGQPRRSTRPTQQFYQPPRARN,mutated_sequence,1.0,651.0,UPI0000136B58.a2m,UPI0000136B58.npy,gnomAD
+UPI000013DF8A,UPI000013DF8A.csv,MGPRIGPAGEVPQVPDKETKATMGTENTPGGKASPDPQDVRPSVFHNIKLFVLCHSLLQLAQLMISGYLKSSISTVEKRFGLSSQTSGLLASFNEVGNTALIVFVSYFGSRVHRPRMIGYGAILVALAGLLMTLPHFISEPYRYDNTSPEDMPQDFKASLCLPTTSAPASAPSNGNCSSYTETQHLSVVGIMFVAQTLLGVGGVPIQPFGISYIDDFAHNSNSPLYLGILFAVTMMGPGLAFGLGSLMLRLYVDINQMPEGGISLTIKDPRWVGAWWLGFLIAAGAVALAAIPYFFFPKEMPKEKRELQFRRKVLAVTDSPARKGKDSPSKQSPGESTKKQDGLVQIAPNLTVIQFIKVFPRVLLQTLRHPIFLLVVLSQVCLSSMAAGMAIFLPKFLERQFSITASYANLLIGCLSFPSVIVGIVVGGVLVKRLHLGPVGCGALCLLGMLLCLFFSLPLFFIGCSSHQIAGITHQTSAHPGLELSPSCMEACSCPLDGFNPVCDPSTRVEYITPCHAGCSSWVVQDALDNSQVFYTNCSCVVEGNPVLAGSCDSTCSHLVVPFLLLVSLGSALACLTHTPSFMLILRGVKKEDKTLAVGIQFMFLRILAWMPSPVIHGSAIDTTCVHWALSCGRRAVCRYYNNDLLRNRFIGLQFFFKTGSVICFALVLAVLRQQDKEARTKESRSSPAVEQQLLVSGPGKKPEDSRV,mutated_sequence,1.0,709.0,UPI000013DF8A.a2m,UPI000013DF8A.npy,gnomAD
+UPI000013EFE9,UPI000013EFE9.csv,MSSAPRRPAKGADSFCTPEPESLGPGTPGFPEQEEDELHRTLGVERFEEILQEAGSRGGEEPGRSYGEEDFEYHRQSSHHIHHPLSTHLPPDARRRKTPQGPGRKPRRRPGASPTGETPTIEEGEEDEDEASEAEGARALTQPSPVSTPSSVQFFLQEDDSADRKAERTSPSSPAPLPHQEATPRASKGAQAGTQVEEAEAEAVAVASGTAGGDDGGASGRPLPKAQPGHRSYNLQERRRIGSMTGAEQALLPRVPTDEIEAQTLATADLDLMKSHRFEDVPGVRRHLVRKNAKGSTQSGREGREPGPTPRARPRAPHKPHEVFVELNELLLDKNQEPQWRETARWIKFEEDVEEETERWGKPHVASLSFRSLLELRRTLAHGAVLLDLDQQTLPGVAHQVVEQMVISDQIKAEDRANVLRALLLKHSHPSDEKDFSFPRNISAGSLGSLLGHHHGQGAESDPHVTEPLMGGVPETRLEVERERELPPPAPPAGITRSKSKHELKLLEKIPENAEATVVLVGCVEFLSRPTMAFVRLREAVELDAVLEVPVPVRFLFLLLGPSSANMDYHEIGRSISTLMSDKQFHEAAYLADEREDLLTAINAFLDCSVVLPPSEVQGEELLRSVAHFQRQMLKKREEQGRLLPTGAGLEPKSAQDKALLQMVEAAGAAEDDPLRRTGRPFGGLIRDVRRRYPHYLSDFRDALDPQCLAAVIFIYFAALSPAITFGGLLGEKTQDLIGVSELIMSTALQGVVFCLLGAQPLLVIGFSGPLLVFEEAFFSFCSSNHLEYLVGRVWIGFWLVFLALLMVALEGSFLVRFVSRFTQEIFAFLISLIFIYETFYKLVKIFQEHPLHGCSASNSSEVDGGENMTWAGARPTLGPGNRSLAGQSGQGKPRGQPNTALLSLVLMAGTFFIAFFLRKFKNSRFFPGRIRRVIGDFGVPIAILIMVLVDYSIEDTYTQKLSVPSGFSVTAPEKRGWVINPLGEKSPFPVWMMVASLLPAILVFILIFMETQITTLIISKKERMLQKGSGFHLDLLLIVAMGGICALFGLPWLAAATVRSVTHANALTVMSKAVAPGDKPKIQEVKEQRVTGLLVALLVGLSIVIGDLLRQIPLAVLFGIFLYMGVTSLNGIQFYERLHLLLMPPKHHPDVTYVKKVRTLRMHLFTALQLLCLALLWAVMSTAASLAFPFILILTVPLRMVVLTRIFTDREMKCLDANEAEPVFDEREGVDEYNEMPMPV,mutated_sequence,1.0,1241.0,UPI000013EFE9.a2m,UPI000013EFE9.npy,gnomAD
+UPI00015B3C70,UPI00015B3C70.csv,MTRGRAWGMRRAAAGAGGARAAGPTGGASRLHPNAGRRSGARAGAQGCGGPRVGSADSRALPAQPLACARGRSQRLVCDPKAASALPDLAPDVFVLRVRLEETGEMFRVANCRGDMTVRELKEELDLMVGIPFNLQRLQYLDEGVLMDDTTLKFHDVVPGGIISLCIWHHDGWTELVLAAVEGDPSKLSCLGLTEDSFYRTANSEHFEGEKWKHWTSQRAFVALYVASHRGHFDAVQYLLEHGASCLSRSPLGRTPLHVAAAMGRSDCIILLLQHGASIHDRDAKGETPISIAHRLNHTLSERQMVLLHRIAKSGIRDLNDLVMKNALQRVKSGFRSEKMTMTPH,mutated_sequence,1.0,345.0,UPI00015B3C70.a2m,UPI00015B3C70.npy,gnomAD
+UPI00006E232B,UPI00006E232B.csv,MAQSPPPQSLLGHDHWIFAQGWGWAGHWDSTSPASSSDSSGSCPCDGARGLPQPQPPSCSSRAAEAAATTPRRARTGPAGGQRQSASEREKLRMRTLARALHELRRFLPPSLAPAGQSLTKIETLRLAIRYIGHLSAVLGLSEESLQCRRRQRGDAGSPWGCPLCPDRGPAEAQTQAEGQGQGQGQGQGQGQGQGQGQGQGQGQGRRPGLVSAVLAEASWGSPSACPGAQAAPERLGRGVHDTDPWATPPYCPKIQSPPYSSQGTTSDASLWTPPQGCPWTQSSPEPRNPPVPWTAAPATLELAAVYQGLSVSPEPCLSLGAPSLLPHPSCQRLQPQTPGRCWSHSAEVVPNSEDQGPGAAFQLSEASPPQSSGLRFSGCPELWQEDLEGARLGIFY,mutated_sequence,1.0,397.0,UPI00006E232B.a2m,UPI00006E232B.npy,gnomAD
+UPI000016021B,UPI000016021B.csv,MAELSEPEGPVDWKERCVALESQLMKFRVQASKIRELLAEKMQQLERQVIDAERQAEKAFQQVQVMEDKLKAANIQTSESETRLYNKCQDLESLIQEKDDVIQNLELQLEEQKQIRIQEAKIIEEKAAKIKEWVTVKLNELELENQNLRLINQNQTEEIRTMQSKLQEVQGKKSSTVSTLKLSEGQRLSSLTFGCFLSRARSPPQVVKSEEMSKISSKEPEFTEGKDMEEMEIPEKSVDNQVLENNRGQRTLHQTPCGSEQNRKTRTSFATDGGISQNSGAPVSDWSSDEEDGSKGRSKSRCTSTLSSHTSEEGVQCSRMGSEMYLTASDDSSSIFEEETFGIKRPEHKKLYSWQQEAQWKALNSPLGKGNSELSKKEQDSSSDELNKKFQSQRLDYSSSSSEANTPSPILTPALMPKHPNSLSGKGTQLVPSSHLPPPKLRIPNVFSISVALAKRHLSQPQLSSDRMFGTNRNAISMIRPLRPQETDLDLVDGDSTEVLENMDTSCDDGLFSYDSLDSPNSDDQEHCDSAKKVAYSKPPTPPLHRFPSWESRIYAVAKSGIRMSEAFNMESVNKNSAATLSYTTSGLYTSLIYKNMTTPVYTTLKGKATQISSSPFLDDSSGSEEEDSSRSSSRTSESDSRSRSGPGSPRAMKRGVSLSSVASESDYAIPPDAYSTDTEYSQPEQKLPKTCSSSSDNGKNEPLEKSGYLLKMSGKVKSWKRRWFVLKGGELLYYKSPSDVIRKPQGHIELSASCSILRGDNKQTVQLTTEKHTYYLTADSPNILEEWIKVLQNVLRVQAANPLSLQPEGKPTMKGLLTKVKHGYSKRVWCTLIGKTLYYFRSQEDKFPLGQIKLWEAKVEEVDRSCDSDEDYEASGRSLLSTHYTIVIHPKDQGPTYLLIGSKHEKDTWLYHLTVAAGSNNVNVGSEFEQLVCKLLNIDGEPSSQIWRHPTLCHSKEGIISPLTTLPSEALQTEAIKLFKTCQLFINAAVDSPAIDYHISLAQSALQICLTHPELQNEICCQLIKQTRRRQPQNQPGPLQGWQLLALCVGLFLPHHPFLWLLRLHLKRNADSRTEFGKYAIYCQRCVERTQQNGDREARPSRMEILSTLLRNPYHHSLPFSIPVHFMNGIYQVVGFDASTTVEEFLNTLNQDTGMRKPAQSGFALFTDDPSGRDLEHCLQGNIKICDIISKWEQASKEQQPGKCEGTRTVRLTYKNRLYFSVQARGETDREKLLLMYQTNDQIINGLFPLNKDLALEMAALLSQVEIGDFERPFSTPAGHVTNQCKVNQTLKQVIEKFYPKRYRDGCSEEQLRQLCQRLSTRWMALRGHSAADCVRIYLTVARKWPFFGAKLFLAKPITPSSLGSTFLWLAVHEDGLSLLEYNSMRLIVSYVYKSLMTFGGYQDDFMVVINNTHSKDKPTEKLLFAMAKPKILEITLLIASYINNFHQQKAAFHHLSAPALLSAQTRGPQARMMGSQPLLSSSRPTKGPTLL,mutated_sequence,1.0,1493.0,UPI000016021B.a2m,UPI000016021B.npy,gnomAD
+UPI000013D057,UPI000013D057.csv,MRGLEESGPRPTATPCGCVKPALETGNLLTEPVGYLESCFSAKNGTPRQPSICSYSRACLRIRKRIFNNPEHSLMGLEQFSHVWILFVFHKNGHLSCKAKVQPPRLNGAKTGVFSTRSPHRPNAIGLTLAKLEKVEGGAIYLSGIDMIHGTPVLDIKPYIAEYDSPQNVMEPLADFNLQNNQHTPNTVSQSDSKTDSCDQRQLSGCDEPQPHHSTKRKPKCPEDRTSEENYLTHSDTARIQQAFPMHREIAVDFGLESRRDQSSSVAEEQIGPYCPEKSFSEKGTDKKLERVEGAAVLQGSRAETQPMAPHCPAGRADGAPRSVVPAWVTEAPVATLEVRFTPHAEMDLGQLSSQDVGQASFKYFQSAEEAKRAIEAVLSADPRSVYRRKLCQDRLFYFTVDIAHVTCWFGDGFAEVLRIKPASEPVHMTGPVGSLVSLGS,mutated_sequence,1.0,441.0,UPI000013D057.a2m,UPI000013D057.npy,gnomAD
+UPI000000127D,UPI000000127D.csv,MLVIPPGLSEEEEALQKKFNKLKKKKKALLALKKQSSSSTTSQGGVKRSLSEQPVMDTATATEQAKQLVKSGAISAIKAETKNSGFKRSRTLEGKLKDPEKGPVPTFQPFQRSISADDDLQESSRRPQRKSLYESFVSSSDRLRELGPDGEEAEGPGAGDGPPRSFDWGYEERSGAHSSASPPRSRSRDRSHERNRDRDRDRERDRDRDRDRDRERDRDRDRDRDRDRERDRDRERDRDRDREGPFRRSDSFPERRAPRKGNTLYVYGEDMTPTLLRGAFSPFGNIIDLSMDPPRNCAFVTYEKMESADQAVAELNGTQVESVQLKVNIARKQPMLDAATGKSVWGSLAVQNSPKGCHRDKRTQIVYSDDVYKENLVDGF,mutated_sequence,1.0,380.0,UPI000000127D.a2m,UPI000000127D.npy,gnomAD
+UPI00003588F0,UPI00003588F0.csv,MAADKGPAAGPRSRAAMAQWRKKKGLRKRRGAASQARGSDSEDGEFEIQAEDDARARKLGPGRPLPTFPTSECTSDVEPDTREMVRAQNKKKKKSGGFQSMGLSYPVFKGIMKKGYKVPTPIQRKTIPVILDGKDVVAMARTGSGKTACFLLPMFERLKTHSAQTGARALILSPTRELALQTLKFTKELGKFTGLKTALILGGDRMEDQFAALHENPDIIIATPGRLVHVAVEMSLKLQSVEYVVFDEADRLFEMGFAEQLQEIIARLPGGHQTVLFSATLPKLLVEFARAGLTEPVLIRLDVDTKLNEQLKTSFFLVREDTKAAVLLHLLHNVVRPQDQTVVFVATKHHAEYLTELLTTQRVSCAHIYSALDPTARKINLAKFTLGKCSTLIVTDLAARGLDIPLLDNVINYSFPAKGKLFLHRVGRVARAGRSGTAYSLVAPDEIPYLLDLHLFLGRSLTLARPLKEPSGVAGVDGMLGRVPQSVVDEEDSGLQSTLEASLELRGLARVADNAQQQYVRSRPAPSPESIKRAKEMDLVGLGLHPLFSSRFEEEELQRLRLVDSIKNYRSRATIFEINASSRDLCSQVMRAKRQKDRKAIARFQQGQQGRQEQQEGPVGPAPSRPALQEKQPEKEEEEEAGESVEDIFSEVVGRKRQRSGPNRGAKRRREEARQRDQEFYIPYRPKDFDSERGLSISGEGGAFEQQAAGAVLDLMGDEAQNLTRGRQQLKWDRKKKRFVGQSGQEDKKKIKTESGRYISSSYKRDLYQKWKQKQKIDDRDSDEEGASDRRGPERRGGKRDRGQGASRPHAPGTPAGRVRPELKTKQQILKQRRRAQKLHFLQRGGLKQLSARNRRRVQELQQGAFGRGARSKKGKMRKRM,mutated_sequence,1.0,881.0,UPI00003588F0.a2m,UPI00003588F0.npy,gnomAD
+UPI0000E671FE,UPI0000E671FE.csv,MGLKNKKNTEDPEEPLIASQSTEPEIGHLSPSKKETIMVTLHGATNLPACKDGSEPWPYVVVKSTSEEKNNQSSKAVTSVTSEPTRAPIWGDTVNVEIQAEDAGQEDVILKVVDNRKKQELLSYKIPIKYLRVFHPYHFELVKPTESGKADEATAKTQLYATVVRKSSFIPRYIGCNHMALEIFLRGVNEPLANNPNPIVVIARVVPNYKEFKVSQANRDLASVGLPITPLSFPIPSMMNFDVPRVSQNGCPQLSKPGGPPEQPLWNQSFLFQGRDGATSFSEDTALVLEYYSSTSMKGSQPWTLNQPLGISVLPLKSRLYQKMLTGKGLDGLHVERLPIMDTSLKTINDEAPTVALSFQLLSSERPENFLTPNNSKALPTLDPKILDKKLRTIQESWSKDTVSSTMDLSTSTPREAEEEPLVPEMSHDTEMNNYRRAMQKMAEDILSLRRQASILEGENRILRSRLAQQEEEEGQGKASEAQNTVSMKQKLLLSELDMKKLRDRVQHLQNELIRKNDREKELLLLYQAQQPQAALLKQYQGKLQKMKALEETVRHQEKVIEKMERVLEDRLQDRSKPPPLNRQQGKPYTGFPMLSASGLPLGSMGENLPVELYSVLLAENAKLRTELDKNRHQQAPIILQQQALPDLLSGTSDKFNLLAKLEHAQSRILSLESQLEDSARRWGREKQDLATRLQEQEKGFRHPSNSIIIEQPSALTHSMDLKQPSELEPLLPSSDSKLNKPLSPQKETANSQQT,mutated_sequence,1.0,755.0,UPI0000E671FE.a2m,UPI0000E671FE.npy,gnomAD
+UPI0000DBEF37,UPI0000DBEF37.csv,MGKKSRAVPGRRPILQLSPPGPRGSTPGRDPEPEPDTEPDSTAAVPSQPAPSAATTTTTAVTAAAASDDSPSEDEQEAVQEVPRVVQNPPKPVMTTRPTAVKATGGLCLLGAYADSDDDDNDVSEKLAQSKETNGNQSTDIDSTLANFLAEIDAITAPQPAAPVGASAPPPTPPRPEPKEAATSTLSSSTSNGTDSTQTSGWQYDTQCSLAGVGIEMGDWQEVWDENTGCYYYWNTQTNEVTWELPQYLATQVQGLQHYQPSSVPGAETSFVVNTDIYSKEKTISVSSSKSGPVIAKREVKKEVNEGIQALSNSEEEKKGVAASLLAPLLPEGIKEEEERWRRKVICKEEPVSEVKETSTTVEEATTIVKPQEIMLDNIEDPSQEDLCSVVQSGESEEEEEQDTLELELVLERKKAELRALEEGDGSVSGSSPRSDISQPASQDGMRRLMSKRGKWKMFVRATSPESTSRSSSKTGRDTPENGETAIGAENSEKIDENSDKEMEVEESPEKIKVQTTPKVEEEQDLKFQIGELANTLTSKFEFLGINRQSISNFHVLLLQTETRIADWREGALNGNYLKRKLQDAAEQLKQYEINATPKGWSCHWDRDHRRYFYVNEQSGESQWEFPDGEEEEEESQAQENRDETLAKQTLKDKTGTDSNSTESSETSTGSLCKESFSGQVSSSSLMPLTPFWTLLQSNVPVLQPPLPLEMPPPPPPPPESPPPPPPPPPPAEDGEIQEVEMEDEGSEEPPAPGTEEDTPLKPSAQTTVVTSQSSVDSTISSSSSTKGIKRKATEISTAVVQRSATIGSSPVLYSQSAIATGHQAAGIGNQATGIGHQTIPVSLPAAGMGHQARGMSLQSNYLGLAAAPAIMSYAECSVPIGVTAPSLQPVQARGAVPTATIIEPPPPPPPPPPPPPPAPKMPPPEKTKKGRKDKAKKSKTKMPSLVKKWQSIQRELDEEDNSSSSEEDRESTAQKRIEEWKQQQLVSGMAERNANFEALPEDWRARLKRRKMAPNT,mutated_sequence,1.0,1017.0,UPI0000DBEF37.a2m,UPI0000DBEF37.npy,gnomAD
+UPI00001A832D,UPI00001A832D.csv,MGEKSRRKGPAPRHADGKLGRTCDHPYAPWSFTPSSRAPTAWVRPPCPVWASRLQEHSPEPRRARAPPTRRAQAALYAPALRLRDHLDRFSILMTSCTSWLQAPQAPGLCRDEQSSRISVPQLSGAPILLPDLEGTKLSNFQESSPLPHKHERKDKRSTPEEEGRSAPEKIIQSLKLCPGGHRPASLSSGCPAGCRLSFNLPPSMLLSVQKCCMPSSLKTC,mutated_sequence,1.0,221.0,UPI00001A832D.a2m,UPI00001A832D.npy,gnomAD
+UPI000004C619,UPI000004C619.csv,MRLLPEWFLLLFGPWLLRKAVSAQIPESGRPQYLGLRPAAAGAGAPGQQLPEPRSSDGLGVGRAWSWAWPTNHTGALARAGAAGALPAQRTKRKPSIKAARAKKIFGWGDFYFRVHTLKFSLLVTGKIVDHVNGTFSVYFRHNSSSLGNLSVSIVPPSKRVEFGGVWLPGPVPHPLQSTLALEGVLPGLGPPLGMAAAAAGPGLGGSLGGALAGPLGGALGVPGAKESRAFNCHVEYEKTNRARKHRPCLYDPSQVCFTEHTQSQAAWLCAKPFKVICIFVSFLSFDYKLVQKVCPDYNFQSEHPYFG,mutated_sequence,1.0,308.0,UPI000004C619.a2m,UPI000004C619.npy,gnomAD
+UPI0000046406,UPI0000046406.csv,MASPSLPGSDCSQIIDHSHVPEFEVATWIKITLILVYLIIFVMGLLGNSATIRVTQVLQKKGYLQKEVTDHMVSLACSDILVFLIGMPMEFYSIIWNPLTTSSYTLSCKLHTFLFEACSYATLLHVLTLSFERYIAICHPFRYKAVSGPCQVKLLIGFVWVTSALVALPLLFAMGTEYPLVNVPSHRGLTCNRSSTRHHEQPETSNMSICTNLSSRWTVFQSSIFGAFVVYLVVLLSVAFMCWNMMQVLMKSQKGSLAGGTRPPQLRKSESEESRTARRQTIIFLRLIVVTLAVCWMPNQIRRIMAAAKPKHDWTRSYFRAYMILLPFSETFFYLSSVINPLLYTVSSQQFRRVFVQVLCCRLSLQHANHEKRLRVHAHSTTDSARFVQRPLLFASRRQSSARRTEKIFLSTFQSEAEPQSKSQSLSLESLEPNSGAKPANSAAENGFQEHEV,mutated_sequence,1.0,453.0,UPI0000046406.a2m,UPI0000046406.npy,gnomAD
+UPI00003D4D6C,UPI00003D4D6C.csv,MAASTMSVCSSACSDSWQVDACPESCCEPHCCALSCCAPAPCLTLVCTPVSRVSSPCCQAACEPSPCQSGCTSSCTPSCCQQSSCQPACCTSSPCQQACCVPVCCKPVCCLPTCSKDSSSCCQQSSCQPTCCASSSSQQSCCVPVCCKPVCYVPTCSEDSSSCCQQSSCHPACCTSSPCQQACCVPVRCKPVCCKPICCVPVCSGASTSCCQQSSCQPACCTTSCCRPSSSVSLLCRPVCRPACCMPVSSCCAPASSCQASCCRPASCVSLLCRPACSRPAC,mutated_sequence,1.0,282.0,UPI00003D4D6C.a2m,UPI00003D4D6C.npy,gnomAD
+UPI00002020EB,UPI00002020EB.csv,MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVSEALGQGTREAVGTGVRQVPGFGVADALGNRVGEAAHALGNTGHEIGRQAEDVIRHGADAVRGSWQGVPGHNGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNFGTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNEARGSGESGIQNSETSPGMFNFDTFWKNFKSKLGFINWDAINKNQVPPPSTRALLYFSRLWEDFKQNTPFLNWKAIIEGADASSLQKRAGRDDQNYNYNQHAYPTAYGGKYSVKTPAKGGVSPSSSASRVQPGLLQWVKFW,mutated_sequence,1.0,476.0,UPI00002020EB.a2m,UPI00002020EB.npy,gnomAD
+UPI0000039B1A,UPI0000039B1A.csv,MSAYGMPMYKSGDLVFAKLKGYAHWPARIEHMTQPNRYQVFFFGTHETAFLSPKRLFPYKECKEKFGKPNKRRGFSAGLWEIENNPTVQASDCPLASEKGSGDGPWPEPEAAEGDEDKPTHAGGGGDELGKPDDDKPTEEEKGPLKRSAGDPPEDAPKRPKEAAPDQEEEAEAERAAEAERAAAAAAATAVDEESPFLVAVENGSAPSEPGLVCEPPQPEEEELREEEVADEEASQEWHAEAPGGGDRDSL,mutated_sequence,1.0,251.0,UPI0000039B1A.a2m,UPI0000039B1A.npy,gnomAD
+UPI0000039E60,UPI0000039E60.csv,MLTEVMEVWHGLVIAVVSLFLQACFLTAINYLLSRHMAHKSEQILKAASLQVPRPSPGHHHPPAVKEMKETQTERDIPMSDSLYRHDSDTPSDSLDSSCSSPPACQATEDVDYTQVVFSDPGELKNDSPLDYENIKEITDYVNVNPERHKPSFWYFVNPALSEPAEYDQVAM,mutated_sequence,1.0,172.0,UPI0000039E60.a2m,UPI0000039E60.npy,gnomAD
+UPI0000070C71,UPI0000070C71.csv,MTENVVCTGAVNAVKEVWEKRIKKLNEDLKREKEFQHKLVRIWEERVSLTKLREKVTREDGRVILKIEKEEWKTLPSSLLKLNQLQEWQLHRTGLLKIPEFIGRFQNLIVLDLSRNTISEIPPGIGLLTRLQELILSYNKIKTVPKELSNCASLEKLELAVNRDICDLPQELSNLLKLTHLDLSMNDFTTIPLAVLNMPALEWLDMGSNKLEQLPDTIERMQNLHTLWLQRNEITCLPQTISNMKNLGTLVLSNNKLQDIPVCMEEMANLRFVNFRDNPLKLKVSLPPSEGTDEEEERELFGLQFMHTYIQESRRRADHQVNGSTTLPISINTDG,mutated_sequence,1.0,335.0,UPI0000070C71.a2m,UPI0000070C71.npy,gnomAD
+UPI0001612CC0,UPI0001612CC0.csv,MEEFLQRAKSKLNRSKRLEKVHVVIGPKSCDLDSLISTFTYAYFLDKVSPPGVLCLPVLNIPRTEFNYFTETRFILEELNISESFHIFRDEINLHQLNDEGKLSITLVGSSVLASEDKTLESAVVKVINPVEQSDANVEFRESSSSLVLKEILQEAPELITEQLAHRLRGSILFKWMTMESEKISEKQEEILSILEEKFPNLPPREDIINVLQETQFSAQGLSIEQTMLKDLKELSDGEIKVAISTVSMNLENCLFHSNITSDLKAFTDKFGFDVLILFSSYLSEEQQPRRQIAVYSENMELCSQICCELEECQNPCLELEPFDCGCDEILVYQQEDPSVTCDQVVLVVKEVINRRCPEMVSNSRTSSTEAVAGSAPLSQGSSGIMELYGSDIEPQPSSVNFIENPPDLNDSNQAQVDANVDLVSPDSGLATIRSSRSSKESSVFLSDDSPVGEGAGPHHTLLPGLDSYSPIPEGAVAEEHAWSGEHGEHFDLFNFDPAPMASGQSQQSSHSADYSPADDFFPNSDLSEGQLPAGPEGLDGMGTNMSNYSSSSLLSGAGKDSLVEHDEEFVQRQDSPRDNSERNLSLTDFVGDESPSPERLKNTGKRIPPTPMNSLVESSPSTEEPASLYTEDMTQKATDTGHMGPPQTHARCSSWWGGLEIDSKNIADAWSSSEQESVFQSPESWKEHKPSSIDRRASDSVFQPKSLEFTKSGPWESEFGQPELGSNDIQDKNEESLPFQNLPMEKSPLPNTSPQGTNHLIEDFASLWHSGRSPTAMPEPWGNPTDDGEPAAVAPFPAWSAFGKEDHDEALKNTWNLHPTSSKTPSVRDPNEWAMAKSGFAFSSSELLDNSPSEINNEAAPEIWGKKNNDSRDHIFAPGNPSSDLDHTWTNSKPPKEDQNGLVDPKTRGKVYEKVDSWNLFEENMKKGGSDVLVPWEDSFLSYKCSDYSASNLGEDSVPSPLDTNYSTSDSYTSPTFAGDEKETEHKPFAKEEGFESKDGNSTAEETDIPPQSLQQSSRNRISSGPGNLDMWASPHTDNSSEINTTHNLDENELKTEHTDGKNISMEDDVGESSQSSYDDPSMMQLYNETNRQLTLLHSSTNSRQTAPDSLDLWNRVILEDTQSTATISDMDNDLDWDDCSGGAAIPSDGQTEGYMAEGSEPETRFTVRQLEPWGLEYQEANQVDWELPASDEHTKDSAPSEHHTLNEKSGQLIANSIWDSVMRDKDMSSFMLPGSSHITDSEQRELPPEIPSHSANVKDTHSPDAPAASGTSESEALISHLDKQDTERETLQSDAASLATRLENPGYFPHPDPWKGHGDGQSESEKEAQGATDRGHLDEEEVIASGVENASGISEKGQSDQELSSLVASEHQEICIKSGKISSLAVTFSPQTEEPEEVLEYEEGSYNLDSRDVQTGMSADNLQPKDTHEKHLMSQRNSGETTETSDGMNFTKYVSVPEKDLEKTEECNFLEPENVGGGPPHRVPRSLDFGDVPIDSDVHVSSTCSEITKNLDVKGSENSLPGAGSSGNFDRDTISSEYTHSSASSPELNDSSVALSSWGQQPSSGYQEENQGNWSEQNHQESELITTDGQVEIVTKVKDLEKNRINEFEKSFDRKTPTFLEIWNDSVDGDSFSSLSSPETGKYSEHSGTHQESNLIASYQEKNEHDISATVQPEDARVISTSSGSDDDSVGGEESIEEEIQVANCHVAEDESRAWDSLNESNKFLVTADPKSENIYDYLDSSEPAENENKSNPFCDNQQSSPDPWTFSPLTETEMQITAVEKEKRSSPETGTTGDVAWQISPKASFPKNEDNSQLEMLGFSADSTEWWKASPQEGRLIESPFERELSDSSGVLEINSSVHQNASPWGVPVQGDIEPVETHYTNPFSDNHQSPFLEGNGKNSHEQLWNIQPRQPDPDADKFSQLVKLDQIKEKDSREQTFVSAAGDELTPETPTQEQCQDTMLPVCDHPDTAFTHAEENSCVTSNVSTNEGQETNQWEQEKSYLGEMTNSSIATENFPAVSSPTQLIMKPGSEWDGSTPSEDSRGTFVPDILHGNFQEGGQLASAAPDLWIDAKKPFSLKADGENPDILTHCEHDSNSQASDSPDICHDSEAKQETEKHLSACMGPEVESSELCLTEPEIDEEPIYEPGREFVPSNAELDSENATVLPPIGYQADIKGSSQPASHKGSPEPSEINGDNSTGLQVSEKGASPDMAPILEPVDRRIPRIENVATSIFVTHQEPTPEGDGSWISDSFSPESQPGARALFDGDPHLSTENPALVPDALLASDTCLDISEAAFDHSFSDASGLNTSTGTIDDMSKLTLSEGHPETPVDGDLGKQDICSSEASWGDFEYDVMGQNIDEDLLREPEHFLYGGDPPLEEDSLKQSLAPYTPPFDLSYLTEPAQSAETIEEAGSPEDESLGCRAAEIVLSALPDRRSEGNQAETKNRLPGSQLAVLHIREDPESVYLPVGAGSNILSPSNVDWEVETDNSDLPAGGDIGPPNGASKEISELEEEKTIPTKEPEQIKSEYKEERCTEKNEDRHALHMDYILVNREENSHSKPETCEERESIAELELYVGSKETGLQGTQLASFPDTCQPASLNERKGLSAEKMSSKSDTRSSFESPAQDQSWMFLGHSEVGDPSLDARDSGPGWSGKTVEPFSELGLGEGPQLQILEEMKPLESLALEEASGPVSQSQKSKSRGRAGPDAVTLQAVTHDNEWEMLSPQPVQKNMIPDTEMEEETEFLELGTRISRPNGLLSEDVGMDIPFEEGVLSPSAADMRPEPPNSLDLNDTHPRRIKLTAPNINLSLDQSEGSILSDDNLDSPDEIDINVDELDTPDEADSFEYTGHDPTANKDSGQESESIPEYTAEEEREDNRLWRTVVIGEQEQRIDMKVIEPYRRVISHGGYYGDGLNAIIVFAACFLPDSSRADYHYVMENLFLYVISTLELMVAEDYMIVYLNGATPRRRMPGLGWMKKCYQMIDRRLRKNLKSFIIVHPSWFIRTILAVTRPFISSKFSSKIKYVNSLSELSGLIPMDCIHIPESIIKLDEELREASEAAKTSCLYNDPEMSSMEKDIDLKLKEKP,mutated_sequence,1.0,3088.0,UPI0001612CC0.a2m,UPI0001612CC0.npy,gnomAD
+UPI00001AE77D,UPI00001AE77D.csv,MSSNSFPYNEQSGGGEATELGQEATSTISPSGAFGLFSSDLKKNEDLKQMLESNKDSAKLDAMKRIVGMIAKGKNASELFPAVVKNVASKNIEIKKLVYVYLVRYAEEQQDLALLSISTFQRALKDPNQLIRASALRVLSSIRVPIIVPIMMLAIKEASADLSPYVRKNAAHAIQKLYSLDPEQKEMLIEVIEKLLKDKSTLVAGSVVMAFEEVCPDRIDLIHKNYRKLCNLLVDVEEWGQVVIIHMLTRYARTQFVSPWKEGDELEDNGKNFYESDDDQKEKTDKKKKPYTMDPDHRLLIRNTKPLLQSRNAAVVMAVAQLYWHISPKSEAGIISKSLVRLLRSNREVQYIVLQNIATMSIQRKGMFEPYLKSFYVRSTDPTMIKTLKLEILTNLANEANISTLLREFQTYVKSQDKQFAAATIQTIGRCATNILEVTDTCLNGLVCLLSNRDEIVVAESVVVIKKLLQMQPAQHGEIIKHMAKLLDSITVPVARASILWLIGENCERVPKIAPDVLRKMAKSFTSEDDLVKLQILNLGAKLYLTNSKQTKLLTQYILNLGKYDQNYDIRDRTRFIRQLIVPNVKSGALSKYAKKIFLAQKPAPLLESPFKDRDHFQLGTLSHTLNIKATGYLELSNWPEVAPDPSVRNVEVIELAKEWTPAGKAKQENSAKKFYSESEEEEDSSDSSSDSESESGSESGEQGESGEEGDSNEDSSEDSSSEQDSESGRESGLENKRTAKRNSKAKGKSDSEDGEKENEKSKTSDSSNDESSSIEDSSSDSESESEPESESESRRVTKEKEKKTKQDRTPLTKDVSLLDLDDFNPVSTPVALPTPALSPSLMADLEGLHLSTSSSVISVSTPAFVPTKTHVLLHRMSGKGLAAHYFFPRQPCIFGDKMVSIQITLNNTTDRKIENIHIGEKKLPIGMKMHVFNPIDSLEPEGSITVSMGIDFCDSTQTASFQLCTKDDCFNVNIQPPVGELLLPVAMSEKDFKKEQGVLTGMNETSAVIIAAPQNFTPSVIFQKVVNVANVGAVPSGQDNIHRFAAKTVHSGSLMLVTVELKEGSTAQLIINTEKTVIGSVLLRELKPVLSQG,mutated_sequence,1.0,1094.0,UPI00001AE77D.a2m,UPI00001AE77D.npy,gnomAD
+UPI0001AE68B4,UPI0001AE68B4.csv,MEGTHCTLQLHKPITELCYISFCLPKGEVRGFSYKGTVTLDRSNKGFHNCYQVREESDIISLSQEPDEHPGDIFFKQTPTKDILTELYKLTTERERLLTNLLSSDHILGITMGNQEGKLQELSVSLAPEDDCFQSAGDWQGELPVGPLNKRSTHGNKKPRRSSGRRESFGALPQKRTKRKGRGGRESAPLMGKDKICSSHSLPLSRTRPNLWVLEEKGNLLPNGALACSLQRRESCPPDIPKTPDTDLGFGSFETAFKDTGLGREVLPPDCSSTEAGGDGIRRPPSGLEHQQTGLSESHQDPEKHPEAEKDEMEKPAKRTCKQKPVSKVVAKVQDLSSQVQRVVKTHSKGKETIAIRPAAHAEFVPKADLLTLPGAEAGAHGSRRQGKERQGDRSSQSPAGETASISSVSASAEGAVNKVPLKVIESEKLDEAPEGKRLGFPVHTSVPHTRPETRNKRRAGLPLGGHKSLFLDLPHKVGPDSSQPRGDKKKPSPPAPAALGKVFNNSASQSSTHKQTSPVPSPLSPRLPSPQQHHRILRLPALPGEREAALNDSPCRKSRVFSGCVSADTLEPPSSAKVTETKGASPAFLRAGQPRLVPGETLEKSLGPGKTTAEPQHQSPPGISSEGFPWDGFNEQTPKDLPNRDGGAWVLGYRAGPACPFLLHEEREKSNRSELYLDLHPDHSLTEQDDRTPGRLQAVWPPPKTKDTEEKVGLKYTEAEYQAAILHLKREHKEEIENLQAQFELRAFHIRGEHAMITARLEETIENLKHELEHRWRGGCEERKDVCISTDDDCPPKTFRNVCVQTDRETFLKPCESESKTTRSNQLVPKKLNISSLSQLSPPNDHKDIHAALQPMEGMASNQQKALPPPPASIPPPPPLPSGLGSLSPAPPMPPVSAGPPLPPPPPPPPPLPPPSSAGPPPPPPPPPLPNSPAPPNPGGPPPAPPPPGLAPPPPPGLFFGLGSSSSQCPRKPAIEPSCPMKPLYWTRIQISDRSQNATPTLWDSLEEPDIRDPSEFEYLFSKDTTQQKKKPLSETYEKKNKVKKIIKLLDGKRSQTVGILISSLHLEMKDIQQAIFNVDDSVVDLETLAALYENRAQEDELVKIRKYYETSKEEELKLLDKPEQFLHELAQIPNFAERAQCIIFRSVFSEGITSLHRKVEIITRASKDLLHVKSVKDILALILAFGNYMNGGNRTRGQADGYSLEILPKLKDVKSRDNGINLVDYVVKYYLRYYDQEAGTEKSVFPLPEPQDFFLASQVKFEDLIKDLRKLKRQLEASEKQMVVVCKESPKEYLQPFKDKLEEFFQKAKKEHKMEESHLENAQKSFETTVRYFGMKPKSGEKEITPSYVFMVWYEFCSDFKTIWKRESKNISKERLKMAQESVSKLTSEKKVETKKINPTASLKERLRQKEASVTTN,mutated_sequence,1.0,1419.0,UPI0001AE68B4.a2m,UPI0001AE68B4.npy,gnomAD
+UPI000204A78B,UPI000204A78B.csv,MERPEAGINSNECENVSRKKKMSEEFEANTMDSLVDMPFATVDIQDDCGITDEPQINLKRSQENEWVKSDQVKKRKKKRKDYQPNYFLSIPITNKEIIKGIKILQNAIIQQDERLAKAMVSDGSFHITLLVMQLLNEDEVNIGIDALLELKPFIEELLQGKHLTLPFQGIGTFGNQVGFVKLAEGDHVNSLLEIAETANRTFQEKGILVGESRSFKPHLTFMKLSKSPWLRKNGVKKIDPDLYEKFISHRFGEEILYRIDLCSMLKKKQSNGYYHCESSIVIGKKPIGIRDLINEALHRETMGLKSKVKQIKELLLKPETQARIRRELFEGRLINNSNSANDVDFSTTLT,mutated_sequence,1.0,350.0,UPI000204A78B.a2m,UPI000204A78B.npy,gnomAD
+UPI0000167E20,UPI0000167E20.csv,MNTNDAKEYLARREIPQLFESLLNGLMCSKPEDPVEYLESCLQKVKELGGCDKVKWDTFVSQEKKTLPPLNGGQSRRSFLRNVMPENSNFPYRRYDRLPPIHQFSIESDTDLSETAELIEEYEVFDPTRPRPKIILVIGGPGSGKGTQSLKIAERYGFQYISVGELLRKKIHSTSSNRKWSLIAKIITTGELAPQETTITEIKQKLMQIPDEEGIVIDGFPRDVAQALSFEDQICTPDLVVFLACANQRLKERLLKRAEQQGRPDDNVKATQRRLMNFKQNAAPLVKYFQEKGLIMTFDADRDEDEVFYDISMAVDNKLFPNKEAAAGSSDLDPSMILDTGEIIDTGSDYEDQGDDQLNVFGEDTMGGFMEDLRKCKIIFIIGGPGSGKGTQCEKLVEKYGFTHLSTGELLREELASESERSKLIRDIMERGDLVPSGIVLELLKEAMVASLGDTRGFLIDGYPREVKQGEEFGRRIGDPQLVICMDCSADTMTNRLLQRSRSSLPVDDTTKTIAKRLEAYYRASIPVIAYYETKTQLHKINAEGTPEDVFLQLCTAIDSIF,mutated_sequence,1.0,562.0,UPI0000167E20.a2m,UPI0000167E20.npy,gnomAD
+UPI000003B115,UPI000003B115.csv,MGPVRLGILLFLFLAVHEAWAGMLKEEDDDTERLPSKCEVCKLLSTELQAELSRTGRSREVLELGQVLDTGKRKRHVPYSVSETRLEEALENLCERILDYSVHAERKGSLRYAKGQSQTMATLKGLVQKGVKVDLGIPLELWDEPSVEVTYLKKQCETMLEEFEDIVGDWYFHHQEQPLQNFLCEGHVLPAAETACLQETWTGKEITDGEEKTEGEEEQEEEEEEEEEEGGDKMTKTGSHPKLDREDL,mutated_sequence,1.0,248.0,UPI000003B115.a2m,UPI000003B115.npy,gnomAD
+UPI000006EEEC,UPI000006EEEC.csv,MHPQVVILSLILHLADSVAGSVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNGIVWTNGTHVTYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGWFNDMKITVSLEIVPPKVTTTPIVTTVPTVTTVRTSTTVPTTTTVPMTTVPTTTVPTTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMPLPRQNHEPVATSPSSPQPAETHPTTLQGAIRREPTSSPLYSYTTDGNDTVTESSDGLWNNNQTQLFLEHSLLTANTTKGIYAGVCISVLVLLALLGVIIAKKYFFKKEVQQLSVSFSSLQIKALQNAVEKEVQAEDNIYIENSLYATD,mutated_sequence,1.0,364.0,UPI000006EEEC.a2m,UPI000006EEEC.npy,gnomAD
+UPI00001BDC7C,UPI00001BDC7C.csv,MKKFSRMPKSEGGSGGGAAGGGAGGAGAGAGCGSGGSSVGVRVFAVGRHQVTLEESLAEGGFSTVFLVRTHGGIRCALKRMYVNNMPDLNVCKREITIMKELSGHKNIVGYLDCAVNSISDNVWEVLILMEYCRAGQVVNQMNKKLQTGFTEPEVLQIFCDTCEAVARLHQCKTPIIHRDLKVENILLNDGGNYVLCDFGSATNKFLNPQKDGVNVVEEEIKKYTTLSYRAPEMINLYGGKPITTKADIWALGCLLYKLCFFTLPFGESQVAICDGNFTIPDNSRYSRNIHCLIRFMLEPDPEHRPDIFQVSYFAFKFAKKDCPVSNINNSSIPSALPEPMTASEAAARKSQIKARITDTIGPTETSIAPRQRPKANSATTATPSVLTIQSSATPVKVLAPGEFGNHRPKGALRPGNGPEILLGQGPPQQPPQQHRVLQQLQQGDWRLQQLHLQHRHPHQQQQQQQQQQQQQQQQQQQQQQQQQQQHHHHHHHHLLQDAYMQQYQHATQQQQMLQQQFLMHSVYQPQPSASQYPTMMPQYQQAFFQQQMLAQHQPSQQQASPEYLTSPQEFSPALVSYTSSLPAQVGTIMDSSYSANRSVADKEAIANFTNQKNISNPPDMSGWNPFGEDNFSKLTEEELLDREFDLLRSNRLEERASSDKNVDSLSAPHNHPPEDPFGSVPFISHSGSPEKKAEHSSINQENGTANPIKNGKTSPASKDQRTGKKTSVQGQVQKGNDESESDFESDPPSPKSSEEEEQDDEEVLQGEQGDFNDDDTEPENLGHRPLLMDSEDEEEEEKHSSDSDYEQAKAKYSDMSSVYRDRSGSGPTQDLNTILLTSAQLSSDVAVETPKQEFDVFGAVPFFAVRAQQPQQEKNEKNLPQHRFPAAGLEQEEFDVFTKAPFSKKVNVQECHAVGPEAHTIPGYPKSVDVFGSTPFQPFLTSTSKSESNEDLFGLVPFDEITGSQQQKVKQRSLQKLSSRQRRTKQDMSKSNGKRHHGTPTSTKKTLKPTYRTPERARRHKKVGRRDSQSSNEFLTISDSKENISVALTDGKDRGNVLQPEESLLDPFGAKPFHSPDLSWHPPHQGLSDIRADHNTVLPGRPRQNSLHGSFHSADVLKMDDFGAVPFTELVVQSITPHQSQQSQPVELDPFGAAPFPSKQ,mutated_sequence,1.0,1161.0,UPI00001BDC7C.a2m,UPI00001BDC7C.npy,gnomAD
+UPI000045779F,UPI000045779F.csv,MNGGAERAMRSLPSLGGLALLCCAAAAAAAAVASAASAGNVTGGGGAAGQVDASPGPGLRGEPSHPFPRATAPTAQAPRTGPPRATVHRPLAATSPAQSPETTPLWATAGPSSTTFQAPLGPSPTTPPAAERTSTTSQAPTRPAPTTLSTTTGPAPTTPVATTVPAPTTPRTPTPDLPSSSNSSVLPTPPATEAPSSPPPEYVCNCSVVGSLNVNRCNQTTGQCECRPGYQGLHCETCKEGFYLNYTSGLCQPCDCSPHGALSIPCNSSGKCQCKVGVIGSICDRCQDGYYGFSKNGCLPCQCNNRSASCDALTGACLNCQENSKGNHCEECKEGFYQSPDATKECLRCPCSAVTSTGSCSIKSSELEPECDQCKDGYIGPNCNKCENGYYNFDSICRKCQCHGHVDPVKTPKICKPESGECINCLHNTTGFWCENCLEGYVHDLEGNCIKKEVILPTPEGSTILVSNASLTTSVPTPVINSTFTPTTLQTIFSVSTSENSTSALADVSWTQFNIIILTVIIIVVVLLMGFVGAVYMYREYQNRKLNAPFWTIELKEDNISFSSYHDSIPNADVSGLLEDDGNEVAPNGQLTLTTPIHNYKA,mutated_sequence,1.0,602.0,UPI000045779F.a2m,UPI000045779F.npy,gnomAD
+UPI0002840E7F,UPI0002840E7F.csv,XIGLDSCKELLKPDRKSGALRCVRAF,mutated_sequence,1.0,26.0,UPI0002840E7F.a2m,UPI0002840E7F.npy,gnomAD
+UPI000020ADBC,UPI000020ADBC.csv,MNLGDGLKLETELLDGKTKLILSPYEHKSKISVKMGNKAKIAKCPLRTKTGHILKSTQDTCIGSEKLLQKKPVGSETSQAKGEKNGMTFSSTKDLCKQCIDKDCLHIQKEISPATPNMQKTRNTVNTSLVGKQKPHKKHITAENMKSSLVCLTQDQLQQILMTVNQGNRSLSLTENGKEAKSQYSLYLNSISNQPKDENIMGLFKKTEMVSSVPAENKSVLNEHQETSKQCEQKIAIENEWKPADIFSTLGERECDRSSLEAKKAQWRKELDEQVALKKKEKEVSEKWNDPWKKSESDKIIWEKHQILDQSRETVLLEHPFSAVKQELQRKWIEELNKQIEDDRQRKIEEKIIYSKGEEHDRWAMHFDSLKSYPGSQSQLFSQSTHKQPEYFCVSPDTQELADVSSVCTPTTGSQVEPSEEEHIAKPIKDVVMANSKKTNFLRSMTALLDPAQIEERDRRRQKQLEHQKAITAQVEEKRRKKQLEEEQRKKEEQEEELRLAQEREEMQKQYEEDILKQKQKEEIMTLKTNELFQTMQRAQELAQRLKQEQRIRELAQKGHDTSRLIKNLGVDTIQMEYNASNISNSRHDSDEISGKMNTYMNSTTSKKDTGVQTDDLNIGIFTNAESHCGSLMERDITNCSSPEISAELIGQFSTKKNKQELTQDKGASLEKENNRCNDQCNQFTRIEKQTKHMKKYPKRPDWNINKPPKRYIPASEKYPKQLQKQREEKKVRRQMELLHLVEKNNPGHLSQNRGISPEIFHSSHQETESKLRWHLVKKEEEPLNIHSFSKERSPSSPVPVVKNRTQQTQNTLHLPLKNSSYERENLISGSNQTELSSGISESSHFIPYVRTNEIYYLDPDAPLSGPSTQDPQYQNSQDCGQKRQLFDSDCVRDPLLNPNMVKNRDRQQAILKGLSELRQGLLQKQKELESSLLPLAENQEESFGSSF,mutated_sequence,1.0,948.0,UPI000020ADBC.a2m,UPI000020ADBC.npy,gnomAD
+UPI0000457614,UPI0000457614.csv,MEVLESGEQGVLQWDRKLSELSEPGDGEALMYHTHFSELLDEFSQNVLGQLLNDPFLSEKSVSMEVEPSPTSPAPLIQAEHSYSLCEEPRAQSPFTHITTSDSFNDDEVESEKWYLSTDFPSTSIKTEPVTDEPPPGLVPSVTLTITAISTPLEKEEPPLEMNTGVDSSCQTIIPKIKLEPHEVDQFLNFSPKEAPVDHLHLPPTPPSSHGSDSEGSLSPNPRLHPFSLPQTHSPSRAAPRAPSALSSSPLLTAPHKLQGSGPLVLTEEEKRTLIAEGYPIPTKLPLSKSEEKALKKIRRKIKNKISAQESRRKKKEYMDSLEKKVESCSTENLELRKKVEVLENTNRTLLQQLQKLQTLVMGKVSRTCKLAGTQTGTCLMVVVLCFAVAFGSFFQGYGPYPSATKMALPSQHSLQEPYTASVVRSRNLLIYEEHSPPEESSSPGSAGELGGWDRGSSLLRVSGLESRPDVDLPHFIISNETSLEKSVLLELQQHLVSAKLEGNETLKVVELDRRVNTTF,mutated_sequence,1.0,520.0,UPI0000457614.a2m,UPI0000457614.npy,gnomAD
+UPI000013CF5D,UPI000013CF5D.csv,MLPAPAAPRWPPLLLLLLLLLPLARGAPARPAAGGQASELVVPTRLPGSAGELALHLSAFGKGFVLRLAPDDSFLAPEFKIERLGGSGRATGGERGLRGCFFSGTVNGEPESLAAVSLCRGLSGSFLLDGEEFTIQPQGAGGSLAQPHRLQRWGPAGARPLPRGPEWEVETGEGQRQERGDHQEDSEEESQEEEAEGASEPPPPLGATSRTKRFVSEARFVETLLVADASMAAFYGADLQNHILTLMSVAARIYKHPSIKNSINLMVVKVLIVEDEKWGPEVSDNGGLTLRNFCNWQRRFNQPSDRHPEHYDTAILLTRQNFCGQEGLCDTLGVADIGTICDPNKSCSVIEDEGLQAAHTLAHELGHVLSMPHDDSKPCTRLFGPMGKHHVMAPLFVHLNQTLPWSPCSAMYLTELLDGGHGDCLLDAPAAALPLPTGLPGRMALYQLDQQCRQIFGPDFRHCPNTSAQDVCAQLWCHTDGAEPLCHTKNGSLPWADGTPCGPGHLCSEGSCLPEEEVERPKPVADGGWAPWGPWGECSRTCGGGVQFSHRECKDPEPQNGGRYCLGRRAKYQSCHTEECPPDGKSFREQQCEKYNAYNYTDMDGNLLQWVPKYAGVSPRDRCKLFCRARGRSEFKVFEAKVIDGTLCGPETLAICVRGQCVKAGCDHVVDSPRKLDKCGVCGGKGNSCRKVSGSLTPTNYGYNDIVTIPAGATNIDVKQRSHPGVQNDGNYLALKTADGQYLLNGNLAISAIEQDILVKGTILKYSGSIATLERLQSFRPLPEPLTVQLLTVPGEVFPPKVKYTFFVPNDVDFSMQSSKERATTNIIQPLLHAQWVLGDWSECSSTCGAGWQRRTVECRDPSGQASATCNKALKPEDAKPCESQLCPL,mutated_sequence,1.0,889.0,UPI000013CF5D.a2m,UPI000013CF5D.npy,gnomAD
+UPI0000070A3C,UPI0000070A3C.csv,MERGMHLGAAAAGEDDLFLHKSLSASTSKRLEAAFRSTPPGMDLSLAPPPRERPASSSSSPLGCFEPADPEGAGLLLPPPGGGGGGSAGSGGGGGGGVGVPGLLVGSAGVGGDPSLSSLPAGAALCLKYGESASRGSVAESSGGEQSPDDDSDGRCELVLRAGVADPRASPGAGGGGAKAAEGCSNAHLHGGASVPPGGLGGGGGGGSSSGSSGGGGGSGSGSGGSSSSSSSSSKKSKEQKALRLNINARERRRMHDLNDALDELRAVIPYAHSPSVRKLSKIATLLLAKNYILMQAQALEEMRRLVAYLNQGQAISAASLPSSAAAAAAAAALHPALGAYEQAAGYPFSAGLPPAASCPEKCALFNSVSSSLCKQCTEKP,mutated_sequence,1.0,381.0,UPI0000070A3C.a2m,UPI0000070A3C.npy,gnomAD
+UPI0000198BEF,UPI0000198BEF.csv,MAVETRAELVGKRFLCVAVGDEARSERWESGRGWRSWRAGVIRAVSHRDSRNPDLAVYVEFDDLEWDKREWVKVYEDFSTFLVEYHLIWAKRNDPSQTQGSKSKQIQWPALTFKPLVERNIPSSVTAVEFLVDKQLDFLTEDSAFQPYQDDIDSLNPVLRDNPQLHEEVKVWVKEQKVQEIFMQGPYSLNGYRVRVYRQDSATQWFTGIITHHDLFTRTMIVMNDQVLEPQNVDPSMVQMTFLDDVVHSLLKGENIGITSRRRSRANQNVNAVHSHYTRAQANSPRPAMNSQAAVPKQNTHQQQQQRSIRPNKRKGSDSSIPDEEKMKEEKYDYISRGENPKGKNKHLMNKRRKPEEDEKKLNMKRLRTDNVSDFSESSDSENSNKRIIDNSSEQKPENELKNKNTSKINGEEGKPHNNEKAGEETLKNSQPPWDQIQEDKKHEEAEKRKSVDTQLQEDMIIHSSEQSTVSDHNSNDLLPQECNMDKTHTMELLPKEKFVSRPPTPKCVIDITNDTNLEKVAQENSSTFGLQTLQKMDPNVSDSKHSIANAKFLETAKKDSDQSWVSDVVKVDLTQSSVTNASSGNDHLNMEKEKYVSYISPLSAVSVMEDKLHKRSPPPETIKSKLNTSVDTHKIKSSPSPEVVKPKITHSPDSVKSKATYVNSQATGERRLANKIEHELSRCSFHPIPTRSSTLETTKSPLIIDKNEHFTVYRDPALIGSETGANHISPFLSQHPFPLHSSSHRTCLNPGTHHPALTPAPHLLAGSSSQTPLPTINTHPLTSGPHHAVHHPHLLPTVLPGVPTASLLGGHPRLESAHASSLSHLALAHQQQQQLLQHQSPHLLGQAHPSASYNQLGLYPIIWQYPNGTHAYSGLGLPSSKWVHPENAVNAEASLRRNSPSPWLHQPTPVTSADGIGLLSHIPVRPSSAEPHRPLKITAHSSPPLTKTLVDHHKEELERKAFMEPLRSVASTSAKNDLDLNRSQTGKDCHLHRHFVDPVLNQLQRPPQETGERLNKYKEEHRRILQESIDVAPFTTKIKGLEGERENYSRVASSSSSPKSHIIKQDMDVERSVSDLYKMKHSVPQSLPQSNYFTTLSNSVVNEPPRSYPSKEVSNIYGDKQSNALAAAAANPQTLTSFITSLSKPPPLIKHQPESEGLVGKIPEHLPHQIASHSVTTFRNDCRSPTHLTVSSTNTLRSMPALHRAPVFHPPIHHSLERKEGSYSSLSPPTLTPVMPVNAGGKVQESQKPPTLIPEPKDSQANFKSSSEQSLTEMWRPNNNLSKEKTEWHVEKSSGKLQAAMASVIVRPSSSTKTDSMPAMQLASKDRVSERSSAGAHKTDCLKLAEAGETGRIILPNVNSDSVHTKSEKNFQAVSQGSVPSSVMSAVNTMCNTKTDVITSAADTTSVSSWGGSEVISSLSNTILASTSSECVSSKSVSQPVAQKQECKVSTTAPVTLASSKTGSVVQPSSGFSGTTDFIHLKKHKAALAAAQYKSSNASETEPNAIKNQTLSASLPLDSTVICSTINKANSVGNGQASQTSQPNYHTKLKKAWLTRHSEEDKNTNKMENSGNSVSEIIKPCSVNLIASTSSDIQNSVDSKIIVDKYVKDDKVNRRKAKRTYESGSESGDSDESESKSEQRTKRQPKPTYKKKQNDLQKRKGEIEEDLKPNGVLSRSAKERSKLKLQSNSNTGIPRSVLKDWRKVKKLKQTGESFLQDDSCCEIGPNLQKCRECRLIRSKKGEEPAHSPVFCRFYYFRRLSFSKNGVVRIDGFSSPDQYDDEAMSLWTHENFEDDELDIETSKYILDIIGDKFCQLVTSEKTALSWVKKDAKIAWKRAVRGVREMCDACEATLFNIHWVCQKCGFVVCLDCYKAKERKSSRDKELYAWMKCVKGQPHDHKHLMPTQIIPGSVLTDLLDAMHTLREKYGIKSHCHCTNKQNLQVGNFPTMNGVSQVLQNVLNHSNKISLCMPESQQQNTPPKSEKNGGSSPESDVGTDNKLTPPESQSPLHWLADLAEQKAREEKKENKELTLENQIKEEREQDNSESPNGRTSPLVSQNNEQGSTLRDLLTTTAGKLRVGSTDAGIAFAPVYSMGAPSSKSGRTMPNILDDIIASVVENKIPPSKTSKINVKPELKEEPEESIISAVDENNKLYSDIPHSWICEKHILWLKDYKNSSNWKLFKECWKQGQPAVVSGVHKKMNISLWKAESISLDFGDHQADLLNCKDSIISNANVKEFWDGFEEVSKRQKNKSGETVVLKLKDWPSGEDFKTMMPARYEDLLKSLPLPEYCNPEGKFNLASHLPGFFVRPDLGPRLCSAYGVVAAKDHDIGTTNLHIEVSDVVNILVYVGIAKGNGILSKAGILKKFEEEDLDDILRKRLKDSSEIPGALWHIYAGKDVDKIREFLQKISKEQGLEVLPEHDPIRDQSWYVNKKLRQRLLEEYGVRTCTLIQFLGDAIVLPAGALHQVQNFHSCIQVTEDFVSPEHLVESFHLTQELRLLKEEINYDDKLQVKNILYHAVKEMVRALKIHEDEVEDMEEN,mutated_sequence,1.0,2540.0,UPI0000198BEF.a2m,UPI0000198BEF.npy,gnomAD
+UPI0000136C3F,UPI0000136C3F.csv,MDQNNSLPPYAQGLASPQGAMTPGIPIFSPMMPYGTGLTPQPIQNTNSLSILEEQQRQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQAVAAAAVQQSTSQQATQGTSGQAPQLFHSQTLTTAPLPGTTPLYPSPMTPMTPITPATPASESSGIVPQLQNIVSTVNLGCKLDLKTIALRARNAEYNPKRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARVVQKLGFPAKFLDFKIQNMVGSCDVKFPIRLEGLVLTHQQFSSYEPELFPGLIYRMIKPRIVLLIFVSGKVVLTGAKVRAEIYEAFENIYPILKGFRKTT,mutated_sequence,1.0,339.0,UPI0000136C3F.a2m,UPI0000136C3F.npy,gnomAD
+UPI00000719CD,UPI00000719CD.csv,MRVVTIVILLCFCKAAELRKASPGSVRSRVNHGRAGGGRRGSNPVKRYAPGLPCDVYTYLHEKYLDCQERKLVYVLPGWPQDLLHMLLARNKIRTLKNNMFSKFKKLKSLDLQQNEISKIESEAFFGLNKLTTLLLQHNQIKVLTEEVFIYTPLLSYLRLYDNPWHCTCEIETLISMLQIPRNRNLGNYAKCESPQEQKNKKLRQIKSEQLCNEEEKEQLDPKPQVSGRPPVIKPEVDSTFCHNYVFPIQTLDCKRKELKKVPNNIPPDIVKLDLSYNKINQLRPKEFEDVHELKKLNLSSNGIEFIDPAAFLGLTHLEELDLSNNSLQNFDYGVLEDLYFLKLLWLRDNPWRCDYNIHYLYYWLKHHYNVHFNGLECKTPEEYKGWSVGKYIRSYYEECPKDKLPAYPESFDQDTEDDEWEKKHRDHTAKKQSVIITIVG,mutated_sequence,1.0,441.0,UPI00000719CD.a2m,UPI00000719CD.npy,gnomAD
+UPI00000725EE,UPI00000725EE.csv,MDLPDESQWDETTCGLAVCQHPQCWATIRRIERGHPRILGSSCKTPLDAEDKLPVLTVVDILDSGFAAHHLPECTFTKAHSLLSQSSKFYSKFHGRPPKGLPDKSLINCTNRLPKFPVLNLNETQLPCPEDVRNMVVLWIPEETEIHVSQHGKKKRKNSAVKSKSFLGLSGNQSAGTRVGTPGMIVPPPTPVQLSEQFSSDFLPLWAQSEALPQDLLKELLPGGKQTMLCPEMKIKLAMMKKNLPLEKNRPDSVISSKMFLSIHRLTLERPALRYPERLKKLHNLKTEGYRKQQQRQQQQQQQQKKVKTPIKKQEAKKKAKSDPGIQSTSHKHPVTTVHDRLYGYRTLPGQNSDMKQQQQMEKGTTSKQDSTERPKMNYYDHADFHHSVKSPELYETEPTNKDISAPVDAVPEAQAARQKKISFNFSEIMASTGWNSELKLLRILQDTDDEDEEDQSSGAE,mutated_sequence,1.0,461.0,UPI00000725EE.a2m,UPI00000725EE.npy,gnomAD
+UPI0000D617B1,UPI0000D617B1.csv,MGPPSACPHRECIPWQGLLLTASLLTFWNAPTTAWLFIASAPFEVAEGENVHLSVVYLPENLYSYGWYKGKTVEPNQLIAAYVIDTHVRTPGPAYSGRETISPSGDLHFQNVTLEDTGYYTLQVTYRNSQIEQASHHLRVYESVAQPSIQASSTTVTEKGSVVLTCHTNNTGTSFQWIFNNQRLQVTKRMKLSWFNHMLTIDPIRQEDAGEYQCEVSNPVSSNRSDPLKLTVKSDDNTLGILIGVLVGSLLVAALVCFLLLRKTGRASDQSDFREQQPPASTPGHGPSDSSIS,mutated_sequence,1.0,293.0,UPI0000D617B1.a2m,UPI0000D617B1.npy,gnomAD
+UPI000003166A,UPI000003166A.csv,MRAHPGGGRCCPEQEEGESAAGGSGAGGDSAIEQGGQGSALAPSPVSGVRREGARGGGRGRGRWKQAGRGGGVCGRGRGRGRGRGRGRGRGRGRGRPPSGGSGLGGDGGGCGGGGSGGGGAPRREPVPFPSGSAGPGPRGPRATESGKRMDCPALPPGWKKEEVIRKSGLSAGKSDVYYFSPSGKKFRSKPQLARYLGNTVDLSSFDFRTGKMMPSKLQKNKQRLRNDPLNQNKGKPDLNTTLPIRQTASIFKQPVTKVTNHPSNKVKSDPQRMNEQPRQLFWEKRLQGLSASDVTEQIIKTMELPKGLQGVGPGSNDETLLSAVASALHTSSAPITGQVSAAVEKNPAVWLNTSQPLCKAFIVTDEDIRKQEERVQQVRKKLEEALMADILSRAADTEEMDIEMDSGDEA,mutated_sequence,1.0,411.0,UPI000003166A.a2m,UPI000003166A.npy,gnomAD
+UPI000023271E,UPI000023271E.csv,MMQHASPAPALTMMATQNVPPPPYQDSPQMTATAQPPSKAQAVHISAPSAAASTPVPSAPIDPQAQLEADKRAVYRHPLFPLLTLLFEKCEQATQGSECITSASFDVDIENFVHQQEQEHKPFFSDDPELDNLMVKAIQVLRIHLLELEKVNELCKDFCNRYITCLKTKMHSDNLLRNDLGGPYSPNQPSINLHSQDLLQNSPNSMSGVSNNPQGIVVPASALQQGNIAMTTVNSQVVSGGALYQPVTMVTSQGQVVTQAIPQGAIQIQNTQVNLDLTSLLDNEDKKSKNKRGVLPKHATNIMRSWLFQHLMHPYPTEDEKRQIAAQTNLTLLQVNNWFINARRRILQPMLDASNPDPAPKAKKIKSQHRPTQRFWPNSIAAGVLQQQGGAPGTNPDGSINLDNLQSLSSDSATMAMQQAMMAAHDDSLDGTEEEDEDEMEEEEEEELEEEVDELQTTNVSDLGLEHSDSLE,mutated_sequence,1.0,472.0,UPI000023271E.a2m,UPI000023271E.npy,gnomAD
+UPI0000129CA7,UPI0000129CA7.csv,MLTNLRIFAMSHQTIPSVYINNICCYKIRASLKRLKPHVPLGRNCSSLPGLIGNDIKSLHSIINPPIAKIRNIGIMAHIDAGKTTTTERILYYSGYTRSLGDVDDGDTVTDFMAQERERGITIQSAAVTFDWKGYRVNLIDTPGHVDFTLEVERCLRVLDGAVAVFDASAGVEAQTLTVWRQADKHNIPRICFLNKMDKTGASFKYAVESIREKLKAKPLLLQLPIGEAKTFKGVVDVVMKEKLLWNCNSNDGKDFERKPLLEMNDPELLKETTEARNALIEQVADLDDEFADLVLEEFSENFDLLPAEKLQTAIHRVTLAQTAVPVLCGSALKNKGIQPLLDAVTMYLPSPEERNYEFLQWYKDDLCALAFKVLHDKQRGPLVFMRIYSGTIKPQLAIHNINGNCTERISRLLLPFADQHVEIPSLTAGNIALTVGLKHTATGDTIVSSKSSALAAARRAEREGEKKHRQNNEAERLLLAGVEIPEPVFFCTIEPPSLSKQPDLEHALKCLQREDPSLKVRLDPDSGQTVLCGMGELHIEIIHDRIKREYGLETYLGPLQVAYRETILNSVRATDTLDRTLGDKRHLVTVEVEARPIETSSVMPVIEFEYAESINEGLLKVSQEAIENGIHSACLQGPLLGSPIQDVAITLHSLTIHPGTSTTMISACVSRCVQKALKKADKQVLEPLMNLEVTVARDYLSPVLADLAQRRGNIQEIQTRQDNKVVIGFVPLAEIMGYSTVLRTLTSGSATFALELSTYQAMNPQDQNTLLNRRSGLT,mutated_sequence,1.0,779.0,UPI0000129CA7.a2m,UPI0000129CA7.npy,gnomAD
+UPI0000456F25,UPI0000456F25.csv,MVKLAKAGKNQGDPKKMAPPPKEVEEDSEDEEMSEDEEDDSSGEEVVIPQKKGKKAAATSAKKVVVSPTKKVAVATPAKKAAVTPGKKAAATPAKKTVTPAKAVTTPGKKGATPGKALVATPGKKGAAIPAKGAKNGKNAKKEDSDEEEDDDSEEDEEDDEDEDEDEDEIEPAAMKAAAAAPASEDEDDEDDEDDEDDDDDEEDDSEEEAMETTPAKGKKAAKVVPVKAKNVAEDEDEEEDDEDEDDDDDEDDEDDDDEDDEEEEEEEEEEPVKEAPGKRKKEMAKQKAAPEAKKQKVEGTEPTTAFNLFVGNLNFNKSAPELKTGISDVFAKNDLAVVDVRIGMTRKFGYVDFESAEDLEKALELTGLKVFGNEIKLEKPKGKDSKKERDARTLLAKNLPYKVTQDELKEVFEDAAEIRLVSKDGKSKGIAYIEFKTEADAEKTFEEKQGTEIDGRSISLYYTGEKGQNQDYRGGKNSTWSGESKTLVLSNLSYSATEETLQEVFEKATFIKVPQNQNGKSKGYAFIEFASFEDAKEALNSCNKREIEGRAIRLELQGPRGSPNARSQPSKTLFVKGLSEDTTEETLKESFDGSVRARIVTDRETGSSKGFGFVDFNSEEDAKAAKEAMEDGEIDGNKVTLDWAKPKGEGGFGGRGGGRGGFGGRGGGRGGRGGFGGRGRGGFGGRGGFRGGRGGGGDHKPQGKKTKFE,mutated_sequence,1.0,710.0,UPI0000456F25.a2m,UPI0000456F25.npy,gnomAD
+UPI00001C1C97,UPI00001C1C97.csv,MIRHAGAPARGDPTGPVPVVGKGEEEEEEDGMRLCLPANPKNCLPHRRGISILEKLIKTCPVWLQLSLGQAEVARILHRVVAGMFLVRRDSSSKQLVLCVHFPSLNESSAEVLEYTIKEEKSILYLEGSALVFEDIFRLIAFYCVSRDLLPFTLRLPQAILEASSFTDLETIANLGLGFWDSSLNPPQERGKPAEPPRDRAPGFPLVSSLRPTAHDANCACEIELSVGNDRLWFVNPIFIEDCSSALPTDQPPLGNCPARPLPPTSDATSPTSRWAPRRPPPPPPVLPLQPCSPAQPPVLPALAPAPACPLPTSPPVPAPHVTPHAPGPPDHPNQPPMMTCERLPCPTAGLGPLREEAMKPGAASSPLQQVPAPPLPAKKNLPTAPPRRRVSERVSLEDQSPGMAAEGDQLSLPPQGTSDGPEDTPRESTEQGQDTEVKASDPHSMPELPRTAKQPPVPPPRKKRISRQLASTLPAPLENAELCTQAMALETPTPGPPREGQSPASQAGTQHPPAQATAHSQSSPEFKGSLASLSDSLGVSVMATDQDSYSTSSTEEELEQFSSPSVKKKPSMILGKARHRLSFASFSSMFHAFLSNNRKLYKKVVELAQDKGSYFGSLVQDYKVYSLEMMARQTSSTEMLQEIRTMMTQLKSYLLQSTELKALVDPALHSEEELEAIVESALYKCVLKPLKEAINSCLHQIHSKDGSLQQLKENQLVILATTTTDLGVTTSVPEVPMMEKILQKFTSMHKAYSPEKKISILLKTCKLIYDSMALGNPGKPYGADDFLPVLMYVLARSNLTEMLLNVEYMMELMDPALQLGEGSYYLTTTYGALEHIKSYDKITVTRQLSVEVQDSIHRWERRRTLNKARASRSSVQDFICVSYLEPEQQARTLASRADTQAQALCAQCAEKFAVERPQAHRLFVLVDGRCFQLADDALPHCIKGYLLRSEPKRDFHFVYRPLDGGGGGGGGSPPCLVVREPNFL,mutated_sequence,1.0,985.0,UPI00001C1C97.a2m,UPI00001C1C97.npy,gnomAD
+UPI0000001C9F,UPI0000001C9F.csv,MKTLMRHGLAVCLALTTMCTSLLLVYSSLGGQKERPPQQQQQQQQQQQQASATGSSQPAAESSTQQRPGVPAGPRPLDGYLGVADHKPLKMHCRDCALVTSSGHLLHSRQGSQIDQTECVIRMNDAPTRGYGRDVGNRTSLRVIAHSSIQRILRNRHDLLNVSQGTVFIFWGPSSYMRRDGKGQVYNNLHLLSQVLPRLKAFMITRHKMLQFDELFKQETGKDRKISNTWLSTGWFTMTIALELCDRINVYGMVPPDFCRDPNHPSVPYHYYEPFGPDECTMYLSHERGRKGSHHRFITEKRVFKNWARTFNIHFFQPDWKPESLAINHPENKPVF,mutated_sequence,1.0,336.0,UPI0000001C9F.a2m,UPI0000001C9F.npy,gnomAD
+UPI00000467CA,UPI00000467CA.csv,MNIRNAQPDDLMNMQHCNLLCLPENYQMKYYLYHGLSWPQLSYIAEDEDGKIVGYVLAKMEEEPDDVPHGHITSLAVKRSHRRLGLAQKLMDQASRAMIENFNAKYVSLHVRKSNRPALHLYSNTLNFQISEVEPKYYADGEDAYAMKRDLSQMADELRRQMDLKKGGYVVLGSRENQETQGSTLSDSEEACQQKNPATEESGSDSKEPKESVESTNVQDSSESSDSTS,mutated_sequence,1.0,229.0,UPI00000467CA.a2m,UPI00000467CA.npy,gnomAD
+UPI0000EBB7D6,UPI0000EBB7D6.csv,MDLGTAEGTRCTDPPAGKPAMAPKRKGGLKLNAICAKLSRQVVVEKRADAGSHTEGSPSQPRDQERSGPESGAARAPRSEEDKRRAVIEKWVNGEYSEEPAPTPVLGRIAREGLELPPEGVYMVQPQGCSDEEDHAEEPSKDGGALEEKDSDGAASKEDSGPSTRQASGEASSLRDYAASTMTEFLGMFGYDDQNTRDELARKISFEKLHAGSTPEAATSSMLPTSEDTLSKRARFSKYEEYIRKLKAGEQLSWPAPSTKTEERVGKEVVGTLPGLRLPSSTAHLETKATILPLPSHSSVQMQNLVARASKYDFFIQKLKTGENLRPQNGSTYKKPSKYDLENVKYLHLFKPGEGSPDMGGAIAFKTGKVGRPSKYDVRGIQKPGPAKVPPTPSLAPAPLASVPSAPSAPGPGPEPPASLSFNTPEYLKSTFSKTDSITTGTVSTVKNGLPTDKPAVTEDVNIYQKYIARFSGSQHCGHIHCAYQYREHYHCLDPECNYQRFTSKQDVIRHYNMHKKRDNSLQHGFMRFSPLDDCSVYYHGCHLNGKSTHYHCMQVGCNKVYTSTSDVMTHENFHKKNTQLINDGFQRFRATEDCGTADCQFYGQKTTHFHCRRPGCTFTFKNKCDIEKHKSYHIKDDAYAKDGFKKFYKYEECKYEGCVYSKATNHFHCIRAGCGFTFTSTSQMTSHKRKHERRHIRSSGALGLPPSLLGAKDTEHEESSNDDLVDFSALSSKNSSLSASPTSQQSSASLAAATAATEAGPSATKPPNSKISGLLPQGLPGSIPLALALSNSGLPTPTPYFPILAGRGSTSLPVGTPSLLGAVSSGSAASATPDTPTLVASGAGDSAPVAAASVPAPPASIMERISASKGLISPMMARLAAAALKPSATFDPGSGQQVTPARFPPAQVKPEPGESTGAPGPHEASQDRSLDLTVKEPSNESNGHAVPANSSLLSSLMNKMSQGNPGLGSLLNIKAEAEGSPAAEPSPFLGKAVKALVQEKLAEPWKVYLRRFGTKDFCDGQCDFLHKAHFHCVVEECGALFSTLDGAIKHANFHFRTEGGAAKGNTEAAFPASAAETKPPMAPSSPPVPPVTTATVSSLEGPAPSPASVPSTPTLLAWKQLASTIPQMPQIPASVPHLPASPLATTSLENAKPQVKPGFLQFQENDPCLATDCKYANKFHFHCLFGNCKYVCKTSGKAESHCLDHINPNNNLVNVRDQFAYYSLQCLCPNQHCEFRMRGHYHCLRTGCYFVTNITTKLPWHIKKHEKAERRAANGFKYFTKREECGRLGCKYNQVNSHFHCIREGCQFSFLLKHQMTSHARKHMRRMLGKNFDRVPPSQGPPGLMDAETDECMDYTGCSPGAMSSESSTMDRSCSSTPVGNESTAAGNTISMPTASGAKKRFWIIEDMSPFGKRRKTASSRKMLDEGMMLEGFRRFDLYEDCKDAACQFSLKVTHYHCTRENCGYKFCGRTHMYKHAQHHDRVDNLVLDDFKRFKASLSCHFADCPFSGTSTHFHCLRCRFRCTDSTKVTAHRKHHGKQDVISAAGFCQFSSSADCAVPDCKYKLKCSHFHCTFPGCRHTVVGMSQMDSHKRKHEKQERGEPAAEGPAPGPPISLDGSLSLGAEPGSLLFLQSAAAGLGLALGDAGDPGPPDAAAPGPREGAAAAAAAAGESSQEDEEEELELPEEEAEDDEDEDDDEDDDDEDDDEDDDDEDLRTDSEESLPEAAAEAAGAGARTPALAALAALGAPGPAPTAASSP,mutated_sequence,1.0,1759.0,UPI0000EBB7D6.a2m,UPI0000EBB7D6.npy,gnomAD
+UPI000016783D,UPI000016783D.csv,MTTKDYPSLWGFGTTKTFKIPIEHLDFKYIEKCSDVKHLEKILCVLRSGEEGYYPELTEFCEKHLQALAPESRALRKDKPAATAASFTAEEWEKIDGDIKSWVSEIKKEEDKMHFHETETFPAMKDNLPPVRGSNSCLHVGKEKYSKRPTKKKTPRDYAEWDKFDVEKECLKIDEDYKEKTVIDKSHLSKIETRIDTAGLTEKEKDFLATREKEKGNEAFNSGDYEEAVMYYTRSISALPTVVAYNNRAQAEIKLQNWNSAFQDCEKVLELEPGNVKALLRRATTYKHQNKLREATEDLSKVLDVEPDNDLAKKTLSEVERDLKNSEAASETQTKGKRMVIQEIENSEDEEGKSGRKHEDGGGDKKPAEPAGAARAAQPCVMGNIQKKLTGKAEGGKRPARGAPQRGQTPEAGADKRSPRRASAAAAAGGGATGHPGGGQGAENPAGLKSQGNELFRSGQFAEAAGKYSAAIALLEPAGSEIADDLSILYSNRAACYLKEGNCSGCIQDCNRALELHPFSMKPLLRRAMAYETLEQYGKAYVDYKTVLQIDCGLQLANDSVNRLSRILMELDGPNWREKLSPIPAVPASVPLQAWHPAKEMISKQAGDSSSHRQQGITDEKTFKALKEEGNQCVNDKNYKDALSKYSECLKINNKECAIYTNRALCYLKLCQFEEAKQDCDQALQLADGNVKAFYRRALAHKGLKNYQKSLIDLNKVILLDPSIIEAKMELEEVTRLLNLKDKTAPFNKEKERRKIEIQEVNEGKEEPGRPAGEVSMGCLASEKGGKSSRSPEDPEKLPIAKPNNAYEFGQIINALSTRKDKEACAHLLAITAPKDLPMFLSNKLEGDTFLLLIQSLKNNLIEKDPSLVYQHLLYLSKAERFKMMLTLISKGQKELIEQLFEDLSDTPNNHFTLEDIQALKRQYEL,mutated_sequence,1.0,926.0,UPI000016783D.a2m,UPI000016783D.npy,gnomAD
+UPI00000437FE,UPI00000437FE.csv,MKALDEPPYLTVGTDVSAKYRGAFCEAKIKTAKRLVKVKVTFRHDSSTVEVQDDHIKGPLKVGAIVEVKNLDGAYQEAVINKLTDASWYTVVFDDGDEKTLRRSSLCLKGERHFAESETLDQLPLTNPEHFGTPVIGKKTNRGRRSNHIPEEESSSSSSDEDEDDRKQIDELLGKVVCVDYISLDKKKALWFPALVVCPDCSDEIAVKKDNILVRSFKDGKFTSVPRKDVHEITSDTAPKPDAVLKQAFEQALEFHKSRTIPANWKTELKEDSSSSEAEEEEEEEDDEKEKEDNSSEEEEEIEPFPEERENFLQQLYKFMEDRGTPINKRPVLGYRNLNLFKLFRLVHKLGGFDNIESGAVWKQVYQDLGIPVLNSAAGYNVKCAYKKYLYGFEEYCRSANIEFQMALPEKVVNKQCKECENVKEIKVKEENETEIKEIKMEEERNIIPREEKPIEDEIERKENIKPSLGSKKNLLESIPTHSDQEKEVNIKKPEDNENLDDKDDDTTRVDESLNIKVEAEEEKAKSGDETNKEEDEDDEEAEEEEEEEEEEEDEDDDDNNEEEEFECYPPGMKVQVRYGRGKNQKMYEASIKDSDVEGGEVLYLVHYCGWNVRYDEWIKADKIVRPADKNVPKIKHRKKIKNKLDKEKDKDEKYSPKNCKLRRLSKPPFQTNPSPEMVSKLDLTDAKNSDTAHIKSIEITSILNGLQASESSAEDSEQEDERGAQDMDNNGKEESKIDHLTNNRNDLISKEEQNSSSLLEENKVHADLVISKPVSKSPERLRKDIEVLSEDTDYEEDEVTKKRKDVKKDTTDKSSKPQIKRGKRRYCNTEECLKTGSPGKKEEKAKNKESLCMENSSNSSSDEDEEETKAKMTPTKKYNGLEEKRKSLRTTGFYSGFSEVAEKRIKLLNNSDERLQNSRAKDRKDVWSSIQGQWPKKTLKELFSDSDTEAAASPPHPAPEEGVAEESLQTVAEEESCSPSVELEKPPPVNVDSKPIEEKTVEVNDRKAEFPSSGSNSVLNTPPTTPESPSSVTVTEGSRQQSSVTVSEPLAPNQEEVRSIKSETDSTIEVDSVAGELQDLQSEGNSSPAGFDASVSSSSSNQPEPEHPEKACTGQKRVKDAQGGGSSSKKQKRSHKATVVNNKKKGKGTNSSDSEELSAGESITKSQPVKSVSTGMKSHSTKSPARTQSPGKCGKNGDKDPDLKEPSNRLPKVYKWSFQMSDLENMTSAERITILQEKLQEIRKHYLSLKSEVASIDRRRKRLKKKERESAATSSSSSSPSSSSITAAVMLTLAEPSMSSASQNGMSVECR,mutated_sequence,1.0,1312.0,UPI00000437FE.a2m,UPI00000437FE.npy,gnomAD
+UPI00001B07C3,UPI00001B07C3.csv,MCAARLAAAAAAAQSVYAFSARPLAGGEPVSLGSLRGKVLLIENVASLUGTTVRDYTQMNELQRRLGPRGLVVLGFPCNQFGHQENAKNEEILNSLKYVRPGGGFEPNFMLFEKCEVNGAGAHPLFAFLREALPAPSDDATALMTDPKLITWSPVCRNDVAWNFEKFLVGPDGVPLRRYSRRFQTIDIEPDIEALLSQGPSCA,mutated_sequence,1.0,203.0,UPI00001B07C3.a2m,UPI00001B07C3.npy,gnomAD
+UPI000013C3DC,UPI000013C3DC.csv,MLLDAGPQFPAIGVGSFARHHHHSAAAAAAAAAEMQDRELSLAAAQNGFVDSAAAHMGAFKLNPGAHELSPGQSSAFTSQGPGAYPGSAAAAAAAAALGPHAAHVGSYSGPPFNSTRDFLFRSRGFGDSAPGGGQHGLFGPGAGGLHHAHSDAQGHLLFPGLPEQHGPHGSQNVLNGQMRLGLPGEVFGRSEQYRQVASPRTDPYSAAQLHNQYGPMNMNMGMNMAAAAAHHHHHHHHHPGAFFRYMRQQCIKQELICKWIDPEQLSNPKKSCNKTFSTMHELVTHVSVEHVGGPEQSNHVCFWEECPREGKPFKAKYKLVNHIRVHTGEKPFPCPFPGCGKVFARSENLKIHKRTHTGEKPFQCEFEGCDRRFANSSDRKKHMHVHTSDKPYLCKMCDKSYTHPSSLRKHMKVHESSPQGSESSPAASSGYESSTPPGLVSPSAEPQSSSNLSPAAAAAAAAAAAAAAAVSAVHRGGGSGSGGAGGGSGGGSGSGGGGGGAGGGGGGSSGGGSGTAGGHSGLSSNFNEWYV,mutated_sequence,1.0,532.0,UPI000013C3DC.a2m,UPI000013C3DC.npy,gnomAD
+UPI0000359594,UPI0000359594.csv,MRPGGALLALLASLLLLLLLRLLWCPADAPGRARILVEESREATHGTPAALRTLRSPATAVPRATNSTYLNEKSLQLTEKCKNLQYGIESFSNKTKGYSENDYLQIITDIQSCPWKRQAEEYANFRAKLASCCDAVQNFVVSQNNTPVGTNMSYEVESKKEIPIKKNIFHMFPVSQPFVDYPYNQCAVVGNGGILNKSLCGTEIDKSDFVFRCNLPPTTGDVSKDVGSKTNLVTINPSIITLKYGNLKEKKALFLEDIATYGDAFFLLPAFSFRANTGTSFKVYYTLEESKARQKVLFFHPKYLKDLALFWRTKGVTAYRLSTGLMITSVAVELCKNVKLYGFWPFSKTVEDIPVSHHYYDNKLPKHGFHQMPKEYSQILQLHMKGILKLQFSKCEVA,mutated_sequence,1.0,398.0,UPI0000359594.a2m,UPI0000359594.npy,gnomAD
+UPI0000423DF4,UPI0000423DF4.csv,MAAPVLLRVSVPRWERVARYAVCAAGILLSIYAYHVEREKERDPEHRALCDLGPWVKCSAALASRHDSKRCGGFDPHDVLHHVGRGVPVPGLHSVLCAEGVLHHLHRHVRAELPSSHYQLQTTSLLERGLEAAAATQAGLTPDRLHPNSLKPLSIQFILQQVFIIIIIIIIHNRHFP,mutated_sequence,1.0,177.0,UPI0000423DF4.a2m,UPI0000423DF4.npy,gnomAD
+UPI000012ADC5,UPI000012ADC5.csv,MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPGGMARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKDAVKDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDIKTENGTCPSPPQPLSPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAPPPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAYSPGQSSLYSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAGGSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRLTSWYLNQAGGDLGHLASAAAAAAAAGYPGQQQNFHSVREMFESQRIGLNNSPVNGNSSCQMAFPSSQSLYRTSGAFVYDCSKF,mutated_sequence,1.0,553.0,UPI000012ADC5.a2m,UPI000012ADC5.npy,gnomAD
+UPI00006C181B,UPI00006C181B.csv,MAEPGGAAGRSHPEDGSASEGEKEGNNESHMVSPPEKDDGQKGEEAVGSTEHPEEVTTQAEAAIEEGEVETEGEAAVEGEEEAVSYGDAESEEEYYYTETSSPEGQISAADTTYPYFSPPQELPGEEAYDSVSGEAGLQGFQQEATGPPESRERRVTSPEPSHGVLGPSEQMGQVTSGPAVGRLTGSTEEPQGQVLPMGVQHRFRLSHGSDIESSDLEEFVSQEPVIPPGVPDAHPREGDLPVFQDQIQQPSTEEGAMAERVESEGSDEEAEDEGSQLVVLDPDHPLMVRFQAALKNYLNRQIEKLKLDLQELVVATKQSRAQRQELGVNLYEVQQHLVHLQKLLEKSHDRHAMASSERRQKEEELQAARALYTKTCAAANEERKKLAALQTEMENLALHLFYMQNIDQDMRDDIRVMTQVVKKAETERIRAEIEKKKQDLYVDQLTTRAQQLEEDIALFEAQYLAQAEDTRILRKAVSEACTEIDAISVEKRRIMQQWASSLVGMKHRDEAHRAVLEALRGCQHQAKSTDGEIEAYKKSIMKEEEKNEKLASILNRTETEATLLQKLTTQCLTKQVALQSQFNTYRLTLQDTEDALSQDQLEQMILTEELQAIRQAIQGELELRRKTDAAIREKLQEHMTSNKTTKYFNQLILRLQKEKTNMMTHLSKINGDIAQTTLDITHTSSRLDAHQKTLVELDQDVKKVNELITNSQSEISRRTILIERKQGLINFLNKQLERMVSELGGEEVGPLELEIKRLSKLIDEHDGKAVQAQVTWLRLQQEMVKVTQEQEEQLASLDASKKELHIMEQKKLRVESKIEQEKKEQKEIEHHMKDLDNDLKKLNMLMNKNRCSSEELEQNNRVTENEFVRSLKASERETIKMQDKLNQLSEEKATLLNQLVEAEHQIMLWEKKIQLAKEMRSSVDSEIGQTEIRAMKGEIHRMKKYCRTTQDAHRHVHEQHGTHAGTCTKNTGRAQARARTTRDARGHVHEQHGTRAGTCTNNTGRAQARARTRDARRHVHEHRTHTARA,mutated_sequence,1.0,1030.0,UPI00006C181B.a2m,UPI00006C181B.npy,gnomAD
+UPI000004A07B,UPI000004A07B.csv,MAPVSGSRSPDREASGSGGRRRSSSKSPKPSKSARSPRGRRSRSHSCSRSGDRNGLTHQLGGLSQGSRNQSYRSRSRSRSRERPSAPRGIPFASASSSVYYGSYSRPYGSDKPWPSLLDKEREESLRQKRLSERERIGELGAPEVWGLSPKNPEPDSDEHTPVEDEEPKKSTTSASTSEEEKKKKSSRSKERSKKRRKKKSSKRKHKKYSEDSDSDSDSETDSSDEDNKRRAKKAKKKEKKKKHRSKKYKKKRSKKSRKESSDSSSKESQEEFLENPWKDRTKAEEPSDLIGPEAPKTLTSQDDKPLNYGHALLPGEGAAMAEYVKAGKRIPRRGEIGLTSEEIASFECSGYVMSGSRHRRMEAVRLRKENQIYSADEKRALASFNQEERRKRENKILASFREMVYRKTKGKDDK,mutated_sequence,1.0,415.0,UPI000004A07B.a2m,UPI000004A07B.npy,gnomAD
+UPI0000E5AFF9,UPI0000E5AFF9.csv,MKCVFVTVGTTSFDDLIACVSAPDSLQKIESLGYNRLILQIGRGTVVPEPFSTESFTLDVYRYKDSLKEDIQKADLVISHAGAGSCLETLEKGKPLVVVINEKLMNNHQLELAKQLHKEGHLFYCTCRVLTCPGQAKSIASAPGKCQDSAALTSTAFSGLDFGLLSGYLHKQALVTATHPTCTLLFPSCHAFFPLPLTPTLYKMHKGWKNYCSQKSLNEASMDEYLGSLGLFRKLTAKDASCLFRAISEQLFCSQVHHLEIRKACVSYMRENQQTFESYVEGSFEKYLERLGDPKESAGQLEIRALSLIYNRDFILYRFPGKPPTYVTDNGYEDKILLCYSSSGHYDSVYSKQFQSSAAVCQAVLYEILYKDVFVVDEEELKTAIKLFRSGSKKNRNNAVTGSEDAHTDYKSSNQNRMEEWGACYNAENIPEGYNKGTEETKSPENPSKMPFPYKVLKALDPEIYRNVEFDVWLDSRKELQKSDYMEYAGRQYYLGDKCQVCLESEGRYYNAHIQEVGNENNSVTVFIEELAEKHVVPLANLKPVTQVMSVPAWNAMPSRKGRGYQKMPGGYVPEIVISEMDIKQQKKMFKKIRGKEVYMTMAYGKGDPLLPPRLQHSMHYGHDPPMHYSQTAGNVMSNEHFHPQHPSPRQGRGYGMPRNSSRFINRHNMPGPKVDFYPGPGKRCCQSYDNFSYRSRSFRRSHRQMSCVNKESQYGFTPGNGQMPRGLEETITFYEVEEGDETAYPTLPNHGGPSTMVPATSGYCVGRRGHSSGKQTLNLEEGNGQSENGRYHEEYLYRAEPDYETSGVYSTTASTANLSLQDRKSCSMSPQDTVTSYNYPQKMMGNIAAVAASCANNVPAPVLSNGAAANQAISTTSVSSQNAIQPLFVSPPTHGRPVIASPSYPCHSAIPHAGASLPPPPPPPPPPPPPPPPPPPPPPPPPPPALDVGETSNLQPPPPLPPPPYSCDPSGSDLPQDTKVLQYYFNLGLQCYYHSYWHSMVYVPQMQQQLHVENYPVYTEPPLVDQTVPQCYSEVRREDGIQAEASANDTFPNADSSSVPHGAVYYPVMSDPYGQPPLPGFDSCLPVVPDYSCVPPWHPVGTAYGGSSQIHGAINPGPIGCIAPSPPASHYVPQGM,mutated_sequence,1.0,1137.0,UPI0000E5AFF9.a2m,UPI0000E5AFF9.npy,gnomAD
+UPI000023281A,UPI000023281A.csv,MAHLRSPSGFGDPGKKDQKESEEELEEEEEEEEVEEEEEEVEEEEEEVEEEEEEVVEEELVGEEQELEAPETFSEEYLWKVTDIGDYDDDFPDVRPRLASIVSPSLTSTFVPSQSATSTETPSASPPSSTSSHKSFPKIFQTFRKDMSEMSIDRNIHRNLSPGIPVSVQTEESWLQDLSDKVQSRKKASKEKAEPECLASKLREKWVINPEESKLNILYELEFKEDFITLFEPSLRTLPSIGPPSILAYKEESSNLGINFKDEEEETSPKCEFCGSDLRAFFSNVDVSSEPKGHASCCIAFQNLIDYIYEEQIKTKPPKAELIAIDPHAAHGSEVDRLKAKEKALQRKQEQRMARHFAIISREQTHFSEDDSKRLKTISYQLSVDIPEKQIIDDIVFDFQLRNSNMSIICCDSRIACGKVVRNELLEKHYKHGSKFLTSFPDGTTQIFYPSGNLAIIRVPNKVNGFTCIVQEDMPTNPAILAVLDSSGRSSCYHPNGNVWVYINILGGQYSDQAGNRIRAWNWSNSITSSPFVSFKPVFLALNRYIGVRILEQDKISITFLAMGQQARISVGTKVKLPNPEEIPILRYVSGDDLLLLASLIKIRRLFHKLEGCVNFPSSQVWEKLKQPSYLSSLSLKLIALCHSSGIKQDIMKTIRNIINEEI,mutated_sequence,1.0,663.0,UPI000023281A.a2m,UPI000023281A.npy,gnomAD
+UPI000014153E,UPI000014153E.csv,MTLGSCCCEIMSSESSPAALSEADADIDVVGGGSGGGELPARSGPRAPRDVLPHGHEPPAEEAEADLAEDEEESGGCSDGEPRALASRGAAAAAGSPGPGAAAARGAAGPGPGPPSGGAATRSPLVKPPYSYIALITMAILQSPKKRLTLSEICEFISGRFPYYREKFPAWQNSIRHNLSLNDCFVKIPREPGNPGKGNYWTLDPESADMFDNGSFLRRRKRFKRQPLPPPHPHPHPHPELLLRGGAAAAGDPGAFLPGFAAYGAYGYGYGLALPAYGAPPPGPAPHPHPHPHAFAFAAAAAAAPCQLSVPPGRAAAPPPGPPTASVFAGAGSAPAPAPASGSGPGPGPAGLPAFLGAELGCAKAFYAASLSPPAAGTAAGLPTALLRQGLKTDAGGGAGGGGAGAGQRPSFSIDHIMGHGGGGAAPPGAGEGSPGPPFAAAAGPGGQAQVLAMLTAPALAPVAGHIRLSHPGDALLSSGSRFASKVAGLSGCHF,mutated_sequence,1.0,495.0,UPI000014153E.a2m,UPI000014153E.npy,gnomAD
+UPI000013CB8B,UPI000013CB8B.csv,MKSNQERSNECLPPKKREIPATSRSSEEKAPTLPSDNHRVEGTAWLPGNPGGRGHGGGRHGPAGTSVELGLQQGIGLHKALSTGLDYSPPSAPRSVPVATTLPAAYATPQPGTPVSPVQYAHLPHTFQFIGSSQYSGTYASFIPSQLIPPTANPVTSAVASAAGATTPSQRSQLEAYSTLLANMGSLSQTPGHKAEQQQQQQQQQQQQHQHQQQQQQQQQQQQQQHLSRAPGLITPGSPPPAQQNQYVHISSSPQNTGRTASPPAIPVHLHPHQTMIPHTLTLGPPSQVVMQYADSGSHFVPREATKKAESSRLQQAIQAKEVLNGEMEKSRRYGAPSSADLGLGKAGGKSVPHPYESRHVVVHPSPSDYSSRDPSGVRASVMVLPNSNTPAADLEVQQATHREASPSTLNDKSGLHLGKPGHRSYALSPHTVIQTTHSASEPLPVGLPATAFYAGTQPPVIGYLSGQQQAITYAGSLPQHLVIPGTQPLLIPVGSTDMEASGAAPAIVTSSPQFAAVPHTFVTTALPKSENFNPEALVTQAAYPAMVQAQIHLPVVQSVASPAAAPPTLPPYFMKGSIIQLANGELKKVEDLKTEDFIQSAEISNDLKIDSSTVERIEDSHSPGVAVIQFAVGEHRAQVSVEVLVEYPFFVFGQGWSSCCPERTSQLFDLPCSKLSVGDVCISLTLKNLKNGSVKKGQPVDPASVLLKHSKADGLAGSRHRYAEQENGINQGSAQMLSENGELKFPEKMGLPAAPFLTKIEPSKPAATRKRRWSAPESRKLEKSEDEPPLTLPKPSLIPQEVKICIEGRSNVGK,mutated_sequence,1.0,815.0,UPI000013CB8B.a2m,UPI000013CB8B.npy,gnomAD
+UPI00004575C6,UPI00004575C6.csv,MVPPVWTLLLLVGAALFRKEKPPDQKLVVRSSRDNYVLTQCDFEDDAKPLCDWSQVSADDEDWVRASGPSPTGSTGAPGGYPNGEGSYLHMESNSFHRGGVARLLSPDLWEQGPLCVHFAHHMFGLSWGAQLRLLLLSGEEGRRPDVLWKHWNTQRPSWMLTTVTVPAGFTLPTRLMFEGTRGSTAYLDIALDALSIRRGSCNRVCMMQTCSFDIPNDLCDWTWIPTASGAKWTQKKGSSGKPGVGPDGDFSSPGSGCYMLLDPKNARPGQKAVLLSPVSLSSGCLSFSFHYILRGQSPGAALHIYASVLGSIRKHTLFSGQPGPNWQAVSVNYTAVGRIQFAVVGVFGKTPEPAVAVDATSIAPCGEGFPQCDFEDNAHPFCDWVQTSGDGGHWALGHKNGPVHGMGPAGGFPNAGGHYIYLEADEFSQAGQSVRLVSRPFCAPGDICVEFAYHMYGLGEGTMLELLLGSPAGSPPIPLWKRVGSQRPYWQNTSVTVPSGHQQPMQLIFKGIQGSNTASVVAMGFILINPGTCPVKVLPELPPVSPVSSTGPSETTGLTENPTISTKKPTVSIEKPSVTTEKPTVPKEKPTIPTEKPTISTEKPTIPSEKPNMPSEKPTIPSEKPTILTEKPTIPSEKPTIPSEKPTISTEKPTVPTEEPTTPTEETTTSMEEPVIPTEKPSIPTEKPSIPTEKPTISMEETIISTEKPTISPEKPTIPTEKPTIPTEKSTISPEKPTTPTEKPTIPTEKPTISPEKPTTPTEKPTISPEKLTIPTEKPTIPTEKPTIPTEKPTISTEEPTTPTEETTISTEKPSIPMEKPTLPTEETTTSVEETTISTEKLTIPMEKPTISTEKPTIPTEKPTISPEKLTIPTEKLTIPTEKPTIPIEETTISTEKLTIPTEKPTISPEKPTISTEKPTIPTEKPTIPTEETTISTEKLTIPTEKPTISPEKLTIPTEKPTISTEKPTIPTEKLTIPTEKPTIPTEKPTIPTEKLTALRPPHPSPTATGLAALVMSPHAPSTPMTSVILGTTTTSRSSTERCPPNARYESCACPASCKSPRPSCGPLCREGCVCNPGFLFSDNHCIQASSCNCFYNNDYYEPGAEWFSPNCTEHCRCWPGSRVECQISQCGTHTVCQLKNGQYGCHPYAGTATCLVYGDPHYVTFDGRHFGFMGKCTYILAQPCGNSTDPFFRVTAKNEEQGQEGVSCLSKVYVTLPESTVTLLKGRRTLVGGQQVTLPAIPSKGVFLGASGRFVELQTEFGLRVRWDGDQQLYVTVSSTYSGKLCGLCGNYDGNSDNDHLKLDGSPAGDKEELGNSWQTDQDEDQECQKYQVVNSPSCDSSLQSSMSGPGFCGRLVDTHGPFETCLLHVKAASFFDSCMLDMCGFQGLQHLLCTHMSTMTTTCQDAGHAVKPWREPHFCPMACPPNSKYSLCAKPCPDTCHSGFSGMFCSDRCVEACECNPGFVLSGLECIPRSQCGCLHPAGSYFKVGERWYKPGCKELCVCESNNRIRCQPWRCRAQEFCGQQDGIYGCHAQGAATCTASGDPHYLTFDGALHHFMGTCTYVLTRPCWSRSQDSYFVVSATNENRGGILEVSYIKAVHVTVFDLSISLLRGCKVMLNGHRVALPVWLAQGRVTIRLSSNLVLLYTNFGLQVRYDGSHLVEVTVPSSYGGQLCGLCGNYNNNSLDDNLRPDRKLAGDSMQLGAAWKLPESSEPGCFLVGGKPSSCQENSMADAWNKNCAILINPQGPFSQCHQVVPPQSSFASCVHGQCGTKGDTTALCRSLQAYASLCAQAGQAPAWRNRTFCPMRCPPGSSYSPCSSPCPDTCSSINNPRDCPKALPCAESCECQKGHILSGTSCVPLGQCGCTDPAGSYHPVGERWYTENTCTRLCTCSVHNNITCFQSTCKPNQICWALDGLLRRASGVGVCQLPGESHYVSFDGSNHSIPDACTLVLVKVCHPAMALPFFKISAKHEKEEGGTEAFRLHEVYIDIYDAQVTLQKGHRVLINSKQVTLPAISQIPGVSVKSSSIYSIVNIKIGVQVKFDGNHLLEIEIPTTYYGKVCGMCGNFNDEEEDELMMPSDEVANSDSEFVNSWKDKDIDPSCQSLLVDEQQIPAEQQENPSGNCRAADLRRAREKCEAALRAPVWAQCASRIDLTPFLVDCANTLCEFGGLYQALCQALQAFGATCQSQGLKPPLWRNSSFCPLECPAYSSYTNCLPSCSPSCWDLDGRCEGAKVPSACAEGCICQPGYVLSEDKCVPRSQCGCKDAHGGSIPLGKSWVSSGCTEKCVCTGGAIQCGDFRCPSGSHCQLTSDNSNSNCVSDKSEQCSVYGDPRYLTFDGFSYRLQGRMTYVLIKTVDVLPEGVEPLLVEGRNKMDPPRSSIFLQEVITTVYGYKVQLQAGLELVVNNQKMAVPYRPNEHLRVTLWGQRLYLVTDFELVVSFGGRKNAVISLPSMYEGLVSGLCGNYDKNRKNDMMLPSGALTQNLNTFGNSWEVKTEDALLRFPRAIPAEEEGQGAELGLRTGLQVSECSPEQLASNSTQACRVLADPQGPFAACHQTVAPEPFQEHCVLDLCSAQDPREQEELRCQVLSGYAILCQEAGAALAGWRDRTLCESPCLQNPCQNDGQCREQGATFTCECEVGYGGGLCMEPRDAPPPRKPASNLVGVLLGLLVPVVVVLLAVTRECIYRTRRKREKTQEGDRLARLVDTDTVLDCAC,mutated_sequence,1.0,2720.0,UPI00004575C6.a2m,UPI00004575C6.npy,gnomAD
+UPI00022F843B,UPI00022F843B.csv,XFSTHLSPNPSWCFKRVKIPDRAPGWTRNYCPAGLHFHSGRWLVVPDKEDPEGNHTLPCRAPGFSSAKLTLTRLQEGKEPTPDSRLKGTRTREMRHTRAGQLWGSFQRGAEIRLPGGAPGPGGAPQCDWVKCFQRTRVEVGGISSRPCRHLPQNPRAPPSSLLNGNAGDVLSHPLSTHTSPLIWKFFSL,mutated_sequence,1.0,189.0,UPI00022F843B.a2m,UPI00022F843B.npy,gnomAD
+UPI0000206B64,UPI0000206B64.csv,MDEEENHYVSQLREVYSSCDTTGTGFLDRQELTQLCLKLHLEQQLPVLLQTLLGNDHFARVNFEEFKEGFVAVLSSNAGVRPSDEDSSSLESAASSAIPPKYVNGSKWYGRRSRPELCDAATEARRVPEQQTQASLKSHLWRSASLESVESPKSDEEAESTKEAQNELFEAQGQLQTWDSEDFGSPQKSCSPSFDTPESQIRGVWEELGVGSSGHLSEQELAVVCQSVGLQGLEKEELEDLFNKLDQDGDGKVSLEEFQLGLFSHEPALLLESSTRVKPSKAWSHYQVPEESGCHTTTTSSLVSLCSSLRLFSSIDDGSGFAFPDQVLAMWTQEGIQNGREILQSLDFSVDEKVNLLELTWALDNELMTVDSAVQQAALACYHQELSYQQGQVEQLARERDKARQDLERAEKRNLEFVKEMDDCHSTLEQLTEKKIKHLEQGYRERLSLLRSEVEAERELFWEQAHRQRAALEWDVGRLQAEEAGLREKLTLALKENSRLQKEIVEVVEKLSDSERLALKLQKDLEFVLKDKLEPQSAELLAQEERFAAVLKEYELKCRDLQDRNDELQAELEGLWARLPKNRHSPSWSPDGRRRQLPGLGPAGISFLGNSAPVSIETELMMEQVKEHYQDLRTQLETKVNYYEREIAALKRNFEKERKDMEQARRREVSVLEGQKADLEELHEKSQEVIWGLQEQLQDTARGPEPEQMGLAPCCTQALCGLALRHHSHLQQIRREAEAELSGELSGLGALPARRDLTLELEEPPQGPLPRGSQRSEQLELERALKLQPCASEKRAQMCVSLALEEEELELARGKRVDGPSLEAEMQALPKDGLVAGSGQEGTRGLLPLRPGCGERPLAWLAPGDGRESEEAAGAGPRRRQAQDTEATQSPAPAPAPASHGPSERWSRMQPCGVDGDIVPKEPEPFGASAAGLEQPGARELPLLGTERDASQTQPRMWEPPLRPAASCRGQAERLQAIQEERARSWSRGTQEQASEQQARAEGALEPGCHKHSVEVARRGSLPSHLQLADPQGSWQEQLAAPEEGETKIALEREKDDMETKLLHLEDVVRALEKHVDLRENDRLEFHRLSEENTLLKNDLGRVRQELEAAESTHDAQRKEIEVLKKDKEKACSEMEVLNRQNQNYKDQLSQLNVRVLQLGQEASTHQAQNEEHRVTIQMLTQSLEEVVRSGQQQSDQIQKLRVELECLNQEHQSLQLPWSELTQTLEESQDQVQGAHLRLRQAQAQHLQEVRLVPQDRVAELHRLLSLQGEQARRRLDAQREEHEKQLKATEERVEEAEMILKNMEMLLQEKVDKLKEQFEKNTKSDLLLKELYVENAHLVRALQATEEKQRGAEKQSRLLEEKVRALNKLVSRIAPAALSV,mutated_sequence,1.0,1382.0,UPI0000206B64.a2m,UPI0000206B64.npy,gnomAD
+UPI0000E599FA,UPI0000E599FA.csv,MCQCVRACVCVCVCACATQRASHSALPGTTISVKDWRLCLLDQFDACARSGLSEPRSLTLRVPSCGKPLPGPGARLGREVTPCLSFAFAWCWLKMCQEEQTSYMVVQTSEEGLAADAELPGPLLMLAQNCAVMHNLLGPACIFLRKGFAENRQPDRSLRPEEIEELREAFREFDKDKDGYINCRDLGNCMRTMGYMPTEMELIELSQQINMNLGGHVDFDDFVELMGPKLLAETADMIGVKELRDAFREFDTNGDGEISTSELREAMRKLLGHQVGHRDIEEIIRDVDLNGDGRVDFEEFVRMMSR,mutated_sequence,1.0,306.0,UPI0000E599FA.a2m,UPI0000E599FA.npy,gnomAD
+UPI00015E00D0,UPI00015E00D0.csv,MLLLLGLCLGLSLCVGSQEEAQSWGHSSEQDGLRVPRQVRLLQRLKTKPLMTEFSVKSTIISRYAFTTVSCRMLNRASEDQDIEFQMQIPAAAFITNFTMLIGDKVYQGEITEREKKSGDRVKEKRNKTTEENGEKGTEIFRASAVIPSKDKAAFFLSYEELLQRRLGKYEHSISVRPQQLSGRLSVDVNILESAGIASLEVLPLHNSRQRGSGRGEDDSGPPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLNGYFVHYFAPKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGIGDRSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPSSVVQATKTLFPNYFNGSEIIIAGKLVDRKLDHLHVEVTASNSKKFIILKTDVPVRPQKAGKDVTGSPRPGGDGEGDTNHIERLWSYLTTKELLSSWLQSDDEPEKERLRQRAQALAVSYRFLTPFTSMKLRGPVPRMDGLEEAHGMSAAMGPEPVVQSVRGAGTQPGPLLKKPPNSVKKKQNKTKKRHGRDGVFPLHHLGIR,mutated_sequence,1.0,695.0,UPI00015E00D0.a2m,UPI00015E00D0.npy,gnomAD
+UPI0000EE0485,UPI0000EE0485.csv,MFRQFYLWTCLASGIILGSLFEICLGQYDDDCKLARGGPPATIVAIDEESRNGTILVDNMLIKGTAGGPDPTIELSLKDNVDYWVLMDPVKQMLFLNSTGRVLDRDPPMNIHSIVVQVQCINKKVGTIIYHEVRIVVRDRNDNSPTFKHESYYATVNELTPVGTTIFTGFSGDNGATDIDDGPNGQIEYVIQYNPDDPTSNDTFEIPLMLTGNIVLRKRLNYEDKTRYFVIIQANDRAQNLNERRTTTTTLTVDVLDGDDLGPMFLPCVLVPNTRDCRPLTYQAAIPELRTPEELNPIIVTPPIQAIDQDRNIQPPSDRPGILYSILVGTPEDYPRFFHMHPRTAELSLLEPVNRDFHQKFDLVIKAEQDNGHPLPAFAGLHIEILDENNQSPYFTMPSYQGYILESAPVGATISDSLNLTSPLRIVALDKDIEDVPPSGVPTKDPELHLFLNDYTSVFTVTQTGITRYLTLLQPVDREEQQTYTFSITAFDGVQESEPVIVNIQVMDANDNTPTFPEISYDVYVYTDMRPGDSVIQLTAVDADEGSNGEITYEILVGAQGDFIINKTTGLITIAPGVEMIVGRTYALTVQAADNAPPAERRNSICTVYIEVLPPNNQSPPRFPQLMYSLEISEAMRVGAVLLNLQATDREGDSITYAIENGDPQRVFNLSETTGILTLGKALDRESTDRYILIITASDGRPDGTSTATVNIVVTDVNDNAPVFDPYLPRNLSVVEEEANAFVGQVKATDPDAGINGQVHYSLGNFNNLFRITSNGSIYTAVKLNREVRDYYELVVVATDGAVHPRHSTLTLAIKVLDIDDNSPVFTNSTYTVLVEENLPAGTTILQIEAKDVDLGANVSYRIRSPEVKHFFALHPFTGELSLLRSLDYEAFPDQEASITFLVEAFDIYGTMPPGIATVTVIVKDMNDYPPVFSKRIYKGMVAPDAVKGTPITTVYAEDADPPGLPASRVRYRVDDVQFPYPASIFEVEEDSGRVITRVNLNEEPTTIFKLVVVAFDDGEPVMSSSATVKILVLHPGEIPRFTQEEYRPPPVSELATKGTMVGVISAAAINQSIVYSIVSGNEEDTFGINNITGVIYVNGPLDYETRTSYVLRVQADSLEVVLANLRVPSKSNTAKVYIEIQDENNHPPVFQKKFYIGGVSEDARMFTSVLRVKATDKDTGNYSVMAYRLIIPPIKEGKEGFVVETYTGLIKTAMLFHNMRRSYFKFQVIATDDYGKGLSGKADVLVSVVNQLDMQVIVSNVPPTLVEKKIEDLTEILDRYVQEQIPGAKVVVESIGARRHGDAFSLEDYTKCDLTVYAIDPQTNRAIDRNELFKFLDGKLLDINKDFQPYYGEGGRILEIRTPEAVTSIKKRGESLGYTEGALLALAFIIILCCIPAILVVLVSYRQFKVRQAECTKTARIQAALPAAKPAVPAPAPVAAPPPPPPPPPGAHLYEELGDSSMHKYEMPQYGSRRRLLPPAGQEEYGEVVGEAEEEYEEEEWARKRMIKLVVDREYETSSTGEDSAPECQRNRLHHPSIHSNINGNIYIAQNGSVVRTRRACLTDNLKVASPVRLGGPFKKLDKLAVTHEENVPLNTLSKGPFSTEKMNARPTLVTFAPCPVGTDNTAVKPLRNRLKSTVEQESMIDSKNIKEALEFHSDHTQSDDEELWMGPWNNLHIPMTKL,mutated_sequence,1.0,1684.0,UPI0000EE0485.a2m,UPI0000EE0485.npy,gnomAD
+UPI00004589BB,UPI00004589BB.csv,MAPPAHKSILERSENVLMSPWKGKLIVQDRMLCDIALWSTYGAMIPTQLPQELDFKYVMKVSSLKKRLPEAAFRKQNYLEEKVCFQDLCFNLYEVELSNRQGENIDKLTECIKNKQLAIIKCLEDRGFFILLTSSALLSEPDFGGKQMGLHGLHLFRSPLSTGVKDLKVEDDISMKVIPILSTLNCALLETKKSLPEERIHPNTLVKRHFQELYKADRSPSLSVAPQDRMKDPTFLGKLPSGFDLIPPAEKCPSESLTQLNSYFSDPSAYILEVSTALDLLAEHPQSPCVSDGICDAGFSLVMTPDPEFLVSEAEVRKETETKKDSEEMLKAKKRVFPLSPASNLRVQPKRKASMPHMVQSKKVNLCRPFPKRTASRADNSSDSPTTLKLVKGQFPQKRKRGAEVLTAQFVQKTKLDRKNQEAPISKDVPVPTNAKRARKQEKSPVKTVPRAKPPVKKSPQKQRVNIVKGNENPRNRKQLQPVKGETASKLQSEISRGCQEDGISINSVQPENTTAAHNDLPENSIVNYDSQALNMLADLALSSATSSTPVSEARNLHCSSELPQNDVLLSKENSLRGTSDHEYHRGVKTQKGELLPNPSSDRKSNSGSDLTVSQDEESLVPCSQAPAKAQSALTEEMLESSDASQSSSVSVEHSYALLLTEHSKKHLQEREILSPLFPRNGTKSPEAATPVGKVMPFRHQPGLLLQQKPPDDPVVKPKDRPPSARVKKSSCSRIVLSCDDSVKITFKCETEYAFSLDSKYTNNPLEKTVVRALHGPWNTDLPDNVEEVKLLLHMWVALFYSNQNKIIRSSRKVVEHSNPAKYVSINSTLESCELREIEESLGLEKCSADSLLETNEISRAHAAEVSFRDPNCLLPFIKTPLTQGLELCVQNEQKKTFARECDPDTQEDQNFICSYNNEVTGEEAKQESLETSNLVLSGIGSTQTNGPSVPSEEEIVQPLDSTRVASYSGTVTQATFTRTYDGPGSQPVICQSSVYGTLENKVDILDAAVQTKTGTLQDLIQHGSPINNECHPSLERKDDNMGCAVINPEPITLTFEKNAHVPIQTEGVNTADERTTFKKELIKQVSPAASLRHPVSTSENARTQGLRDIPSLVVAGQKGTKYLCASSVGGETLDKAVCSLQKETPLPVSLPSDKTMVMEALSLAKSSSHLSPSEEVRCTQDFLSQTQSLLGLSSEGLLELTQVEVDSSSASTTLGRQCSLNCISSGCHTSGDSLELRKNHKNGPNTENMNLEAFDSVFIKQTSLSVSREVSLELSEEDSDIDLALTISPPTSPREEMPAGEIEQFEEAPFSNLELQDVAEEIGEPEEVALTESREVSSADNVSVYPSVSEEPVENKERKGDNLQPVTLILSKENCTLEIAEEINVTSDFPFDSVIEEVSPASSPEPPVPVKETRPYQAVTPCILKLHGTQCEKSNQISQCESEDLGITEKENVFVGPTHPVGQDNFTQVQQMQVSAEMPLILTDHPGRTGRPTLPGKVTEEIVSSEHDEGLSFSGKVQCYGRELNQPASAAKCTGDFSPSPEKLVKSGNPLQPVSIENRNLDLKHLVLESSEPPFGPRNVIENKSLSDTLVSTTAPSGIVNVSVKQQTSPKSSQNHLFPGDLKTDEGIYLQVKSLTAASVDGAYSTQGCMCSVVPTLCSSSDNATLTHYVRPINAEPVFQAQEIPAGRMASLLKNGEPEAELHKETTGPGTAGPQSNTTSSLKGERKAIHTLQDVSTCETKELLNVGVSSLCAGPYQNTADTKENLSKEPLASFVSESFDTSVCGIATEHVEIENSGEGLRAEAGSETLGRDGEVGVNSDMHYELSGDSDLDLLGDCRNPRLDLEDSYTLRGSYTRKKDVPTDGYESSLNFHNNNQEDWGCSSWVPGMETSLPPGHWTAAVKKEEKCVPPYVQIRDLHGILRTYANFSITKELKDTMRTSHGLRRHPSFSANCGLPSSWTSTWQVADDLTQNTLDLEYLRFAHKLKQTIKNGDSQHSASSANVFPKESPTQISIGAFPSTKISEAPFLHPAPRSRSPLLVTVVESDPRPQGQPRRGYTASSLDSSSSWRERCSHNRDLRNSQRNHTVSFHLNKLKYNSTVKESRNDISLILNEYAEFNKVMKNSNQFIFQDKELNDVSGEATAQEMYLPFPGRSASYEDIIIDVCTNLHVKLRSVVKEACKSTFLFYLVETEDKSFFVRTKNLLRKGGHTEIEPQHFCQAFHRENDTLIIIIRNEDISSHLHQIPSLLKLKHFPSVIFAGVDSPGDVLDHTYQELFRAGGFVISDDKILEAVTLVQLKEIIKILEKLNGNGRWKWLLHYRENKKLKEDERVDSTAHKKNIMLKSFQSANIIELLHYHQCDSRSSTKAEILKCLLNLQIQHIDARFAVLLTDKPTIPREVFENSGILVTDVNNFIENIEKIAAPFRSSYW,mutated_sequence,1.0,2430.0,UPI00004589BB.a2m,UPI00004589BB.npy,gnomAD
+UPI0000130C80,UPI0000130C80.csv,MYYAVSQARVNAVPGTMLRPQRPGDLQLGASLYELVGYRQPPSSSSSSTSSTSSTSSSSTTAPLLPKAAREKPEAPAEPPGPGPGSGAHPGGSARPDAKEEQQQQLRRKINSRERKRMQDLNLAMDALREVILPYSAAHCQGAPGRKLSKIATLLLARNYILLLGSSLQELRRALGEGAGPAAPRLLLAGLPLLAAAPGSVLLAPGAVGPPDALRPAKYLSLALDEPPCGQFALPGGGAGGPGLCTCAVCKFPHLVPASLGLAAVQAQFSK,mutated_sequence,1.0,271.0,UPI0000130C80.a2m,UPI0000130C80.npy,gnomAD
+UPI00001C1E1E,UPI00001C1E1E.csv,MGDFAAPAAAANGSSICINSSLNSSLGGAGIGVNNTPNSTPAAPSSNHPAAGGCGGSGGPGGGSAAVPKHSTVVERLRQRIEGCRRHHVNCENRYQQAQVEQLELERRDTVSLYQRTLEQRAKKSGAGTGKQQHPSKPQQDAEAASAEQRNHTLIMLQETVKRKLEGARSPLNGDQQNGACDGNFSPTSKRIRKDISAGMEAINNLPSNMPLPSASPLHQLDLKPSLPLQNSGTHTPGLLEDLSKNGRLPEIKLPVNGCSDLEDSFTILQSKDLKQEPLDDPTCIDTSETSLSNQNKLFSDINLNDQEWQELIDELANTVPEDDIQDLFNEDFEEKKEPEFSQPATETPLSQESASVKSDPSHSPFAHVSMGSPQARPSSSGPPFSTVSTATSLPSVASTPAAPNPASSPANCAVQSPQTPNQAHTPGQAPPRPGNGYLLNPAAVTVAGSASGPVAVPSSDMSPAEQLKQMAAQQQQRAKLMQQKQQQQQQQQQQQQQQQQQQQQQQQQQHSNQTSNWSPLGPPSSPYGAAFTAEKPNSPMMYPQAFNNQNPIVPPMANNLQKTTMNNYLPQNHMNMINQQPNNLGTNSLNKQHNILTYGNTKPLTHFNADLSQRMTPPVANPNKNPLMPYIQQQQQQQQQQQQQQQQQQPPPPQLQAPRAHLSEDQKRLLLMKQKGVMNQPMAYAALPSHGQEQHPVGLPRTTGPMQSSVPPGSGGMVSGASPAGPGFLGSQPQAAIMKQMLIDQRAQLIEQQKQQFLREQRQQQQQQQQQILAEQQLQQSHLPRQHLQPQRNPYPVQQVNQFQGSPQDIAAVRSQAALQSMRTSRLMAQNAGMMGIGPSQNPGTMATAAAQSEMGLAPYSTTPTSQPGMYNMSTGMTQMLQHPNQSGMSITHNQAQGPRQPASGQGVGMVSGFGQSMLVNSAITQQHPQMKGPVGQALPRPQAPPRLQSLMGTVQQGAQSWQQRSLQGMPGRTSGELGPFNNGASYPLQAGQPRLTKQHFPQGLSQSVVDANTGTVRTLNPAAMGRQMMPSLPGQQGTSQARPMVMSGLSQGVPGMPAFSQPPAQQQIPSGSFAPSSQSQAYERNAPQDVSYNYSGDGAGGSFPGLPDGADLVDSIIKGGPGDEWMQELDELFGNP,mutated_sequence,1.0,1138.0,UPI00001C1E1E.a2m,UPI00001C1E1E.npy,gnomAD
+UPI000006FF4C,UPI000006FF4C.csv,MADKMVRTPKCSRCRNHGFLVPVKGHAGKCRWKQCLCEKCYLISERQKIMAAQKVLKTQAAEEEQEAALCAQGPKQASGAAAAAPAPVPVPAASLRPLSPGTPSGDADPGPEGRAAACFFEQPPRGRNPGPRALQPVLGGRSHVEPSERAAVAMPSLAGPPFGAEAAGSGYPGPLDLRRPMRTVPGPLFTDFVRPLNINPDRALGPEYPGGSSMHPYCPFPLGYLDAPPGVPLQQGFRHVSRSQYQGGGLVSEPGGDFQPSYYLPPPPPPLPPLPPLPPQPQFLPPGYLSALHFLPPPPPPPPPSSFSLTVLFDTDKENTDDQDAEVLSGEPSQPSSQEQSD,mutated_sequence,1.0,342.0,UPI000006FF4C.a2m,UPI000006FF4C.npy,gnomAD
+UPI0000071B6E,UPI0000071B6E.csv,MEAARPFAREWRAQSLPLAVGGVLKLRLCELWLLLLGSSLNARFLPDEEDVDFINEYVNLHNELRGDVIPRGSNLRFMTWDVALSRTARAWGKKCLFTHNIYLQDVQMVHPKFYGIGENMWVGPENEFTASIAIRSWHAEKKMYNFENGSCSGDCSNYIQLVWDHSYKVGCAVTPCSKIGHIIHAAIFICNYAPGGTLTRRPYEPGIFCTRCGRRDKCTDFLCSKIKKINMKKMHNGLDKKNKRLNTSFLWSC,mutated_sequence,1.0,253.0,UPI0000071B6E.a2m,UPI0000071B6E.npy,gnomAD
+UPI000012ADEC,UPI000012ADEC.csv,MMQESATETISNSSMNQNGMSTLSSQLDAGSRDGRSSGDTSSEVSTVELLHLQQQQALQAARQLLLQQQTSGLKSPKSSDKQRPLQVPVSVAMMTPQVITPQQMQQILQQQVLSPQQLQALLQQQQAVMLQQQQLQEFYKKQQEQLHLQLLQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQHPGKQAKEQQQQQQQQQQLAAQQLVFQQQLLQMQQLQQQQHLLSLQRQGLISIPPGQAALPVQSLPQAGLSPAEIQQLWKEVTGVHSMEDNGIKHGGLDLTTNNSSSTTSSNTSKASPPITHHSIVNGQSSVLSARRDSSSHEETGASHTLYGHGVCKWPGCESICEDFGQFLKHLNNEHALDDRSTAQCRVQMQVVQQLEIQLSKERERLQAMMTHLHMRPSEPKPSPKPLNLVSSVTMSKNMLETSPQSLPQTPTTPTAPVTPITQGPSVITPASVPNVGAIRRRHSDKYNIPMSSEIAPNYEFYKNADVRPPFTYATLIRQAIMESSDRQLTLNEIYSWFTRTFAYFRRNAATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEVEYQKRRSQKITGSPTLVKNIPTSLGYGAALNASLQAALAESSLPLLSNPGLINNASSGLLQAVHEDLNGSLDHIDSNGNSSPGCSPQPHIHSIHVKEEPVIAEDEDCPMSLVTTANHSPELEDDREIEEEPLSEDLE,mutated_sequence,1.0,715.0,UPI000012ADEC.a2m,UPI000012ADEC.npy,gnomAD
+UPI000013C70B,UPI000013C70B.csv,MFYYPNVLQRHTGCFATIWLAATRGSRLVKREYLRVNVVKTCEEILNYVLVRVQPPQPGLPRPRFSLYLSAQLQIGVIRVYSQQCQYLVEDIQHILERLHRAQLQIRIDMETELPSLLLPNHLAMMETLEDAPDPFFGMMSVDPRLPSPFDIPQIRHLLEAAIPERVEEIPPEVPTEPREPERIPVTVLPPEAITILEAEPIRMLEIEGERELPEVSRRELDLLIAEEEAILLEIPRLPPPAPAEVEGIGEALGPEELRLTGWEPGALLMEVTPPEELRLPAPPSPERRPPVPPPPRRRRRRRLLFWDKETQISPEKFQEQLQTRAHCWECPMVQPPERTIRGPAELFRTPTLSGWLPPELLGLWTHCAQPPPKALRRELPEEAAAEEERRKIEVPSEIEVPREALEPSVPLMVSLEISLEAAEEEKSRISLIPPEERWAWPEVEAPEAPALPVVPELPEVPMEMPLVLPPELELLSLEAVHRAVALELQANREPDFSSLVSPLSPRRMAARVFYLLLVLSAQQILHVKQEKPYGRLLIQPGPRFH,mutated_sequence,1.0,546.0,UPI000013C70B.a2m,UPI000013C70B.npy,gnomAD
+UPI0000131BD3,UPI0000131BD3.csv,MELWPCLAAALLLLLLLVQLSRAAEFYAKVALYCALCFTVSAVASLVCLLRHGGRTVENMSIIGWFVRSFKYFYGLRFEVRDPRRLQEARPCVIVSNHQSILDMMGLMEVLPERCVQIAKRELLFLGPVGLIMYLGGVFFINRQRSSTAMTVMADLGERMVRENLKVWIYPEGTRNDNGDLLPFKKGAFYLAVQAQVPIVPVVYSSFSSFYNTKKKFFTSGTVTVQVLEAIPTSGLTAADVPALVDTCHRAMRTTFLHISKTPQENGATAGSGVQPAQ,mutated_sequence,1.0,278.0,UPI0000131BD3.a2m,UPI0000131BD3.npy,gnomAD
+UPI0000E5A38D,UPI0000E5A38D.csv,XCRSFLLRPSLCAVLILQFFLFHHFGTGSIRKKRFVSSHRYVETMLVADQSMAEFHGSGLKHYLLTLFSVAARLYKHPSIRNSVSLVVVKILVIHDEQKGPEVTSNAALTLRNFCNWQKQHNPPSDRDAEHYDTAILFTRQDLCGSQTCDTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKQCA,mutated_sequence,1.0,200.0,UPI0000E5A38D.a2m,UPI0000E5A38D.npy,gnomAD
+UPI00001AE868,UPI00001AE868.csv,MGLGLLLPLLLLWTRGTQGSELDPKGQHVCVASSPSAELQCCAGWRQKDQECTIPICEGPDACQKDEVCVKPGLCRCKPGFFGAHCSSRCPGQYWGPDCRESCPCHPHGQCEPATGACQCQADRWGARCEFPCACGPHGRCDPATGVCHCEPGWWSSTCRRPCQCNTAAARCEQATGACVCKPGWWGRRCSFRCNCHGSPCEQDSGRCACRPGWWGPECQQQCECVRGRCSAASGECTCPPGFRGARCELPCPAGSHGVQCAHSCGRCKHNEPCSPDTGSCESCEPGWNGTQCQQPCLPGTFGESCEQQCPHCRHGEACEPDTGHCQRCDPGWLGPRCEDPCPTGTFGEDCGSTCPTCVQGSCDTVTGDCVCSAGYWGPSCNASCPAGFHGNNCSVPCECPEGLCHPVSGSCQPGSGSRDTALIAGSLVPLLLLFLGLACCACCCWAPRSDLKDRPARDGATVSRMKLQVWGTLTSLGSTLPCRSLSSHKLPWVTVSHHDPEVPFNHSFIEPPSAGWATDDSFSSDPESGEADEVPAYCVPPQEGMVPVAQAGSSEASLAAGAFPPPEDASTPFAIPRTSSLARAKRPSVSFAEGTKFAPQSRRSSGELSSPLRKPKRLSRGAQSGPEGREAEESTGPEEAEAPESFPAAASPGDSATGHRRPPLGGRTVAEHVEAIEGSVQESSGPVTTIYMLAGKPRGSEGPVRSVFRHFGSFQKGQAEAKVKRAIPKPPRQALNRKKGSPGLASGSVGQSPNSAPKAGLPGATGPMAVRPEEAVRGLGAGTESSRRAQEPVSGCGSPEQDPQKQAEEERQEEPEYENVVPISRPPEP,mutated_sequence,1.0,830.0,UPI00001AE868.a2m,UPI00001AE868.npy,gnomAD
+UPI0000203C32,UPI0000203C32.csv,MERISAFFSSIWDTILTKHQEGIYNTICLGVLLGLPLLVIITLLFICCHCCWSPPGKRGQQPEKNKKKKKKKKKKDEEDLWISAQPKLLQMEKRPSLPV,mutated_sequence,1.0,99.0,UPI0000203C32.a2m,UPI0000203C32.npy,gnomAD
+UPI000006F1B6,UPI000006F1B6.csv,MATVAELKAVLKDTLEKKGVLGHLKARIRAEVFNALDDDREPRPSLSHENLLINELIREYLEFNKYKYTASVLIAESGQPVVPLDRQFLIHELNAFEESKDNTIPLLYGILAHFLRGTKDGIQNAFLKGPSLQPSDPSLGRQPSRRKPMDDHLRKEEQKSTNIEDLHVSQAVNR,mutated_sequence,1.0,174.0,UPI000006F1B6.a2m,UPI000006F1B6.npy,gnomAD
+UPI000004A077,UPI000004A077.csv,MAQEVSEYLSQNPRVAAWVEALRCDGETDKHWRHRRDFLLRNAGDLAPAGGAASASTDEAADAESGTRNRQLQQLISFSMAWANHVFLGCRYPQKVMDKILSMAEGIKVTDAPTYTTRDELVAKVKKRGISSSNEGVEEPSKKRVIEGKNSSAVEQDHAKTSAKTERASAQQENSSTCIGSAIKSESGNSARSSGISSQNSSTSDGDRSVSSQSSSSVSSQVTTAGSGKASEAEAPDKHGSASFVSLLKSSVNSHMTQSTDSRQQSGSPKKSALEGSSASASQSSSEIEVPLLGSSGSSEVELPLLSSKPSSETASSGLTSKTSSEASVSSSVAKNSSSSGTSLLTPKSSSSTNTSLLTSKSTSQVAASLLASKSSSQTSGSLVSKSTSLASVSQLASKSSSQTSTSQLPSKSTSQSSESSVKFSCKLTNEDVKQKQPFFNRLYKTVAWKLVAVGGFSPNVNHGELLNAAIEALKATLDVFFVPLKELADLPQNKSSQESIVCELRCKSVYLGTGCGKSKENAKAVASREALKLFLKKKVVVKICKRKYRGSEIEDLVLLDEESRPVNLPPALKHPQELL,mutated_sequence,1.0,580.0,UPI000004A077.a2m,UPI000004A077.npy,gnomAD
+UPI000013E694,UPI000013E694.csv,MALAALMIALGSLGLHTWQAQAVPILPLGLAPDTFDDTYVGCAEEMEEKAAPLLKEEMAHHALLRESWEAAQETWEDKRRGLTLPPGFKAQNGIAIMVYTNSSNTLYWELNQAVRTGGGSRELYMRHFPFKALHFYLIRALQLLRGSGGCSRGPGEVVFRGVGSLRFEPKRLGDSVRLGQFASSSLDKAVAHRFGNATLFSLTTCFGAPIQAFSVFPKEREVLIPPHEVFLVTRFSQDGAQSLVTLWSYNQTCSHFNCAYLGGEKRRGCVSAPGALGTGDLHMTKRHLQQP,mutated_sequence,1.0,291.0,UPI000013E694.a2m,UPI000013E694.npy,gnomAD
+UPI0000488BDC,UPI0000488BDC.csv,MVSYWDTGVLLCALLSCLLLTGSSSGSKLKDPELSLKGTQHIMQAGQTLHLQCRGEAAHKWSLPEMVSKESERLSITKSACGRNGKQFCSTLTLNTAQANHTGFYSCKYLAVPTSKKKETESAIYIFISDTGRPFVEMYSEIPEIIHMTEGRELVIPCRVTSPNITVTLKKFPLDTLIPDGKRIIWDSRKGFIISNATYKEIGLLTCEATVNGHLYKTNYLTHRQTNTIIDVQISTPRPVKLLRGHTLVLNCTATTPLNTRVQMTWSYPDEKNKRASVRRRIDQSNSHANIFYSVLTIDKMQNKDKGLYTCRVRSGPSFKSVNTSVHIYDKAFITVKHRKQQVLETVAGKRSYRLSMKVKAFPSPEVVWLKDGLPATEKSARYLTRGYSLIIKDVTEEDAGNYTILLSIKQSNVFKNLTATLIVNVKPQIYEKAVSSFPDPALYPLGSRQILTCTAYGIPQPTIKWFWHPCNHNHSEARCDFCSNNEESFILDADSNMGNRIESITQRMAIIEGKNKMASTLVVADSRISGIYICIASNKVGTVGRNISFYITDVPNGFHVNLEKMPTEGEDLKLSCTVNKFLYRDVTWILLRTVNNRTMHYSISKQKMAITKEHSITLNLTIMNVSLQDSGTYACRARNVYTGEEILQKKEITIRDQEAPYLLRNLSDHTVAISSSTTLDCHANGVPEPQITWFKNNHKIQQEPELYTSTSPSSSSSSPLSSSSSSSSSSSS,mutated_sequence,1.0,733.0,UPI0000488BDC.a2m,UPI0000488BDC.npy,gnomAD
+UPI0000071E24,UPI0000071E24.csv,MRLPLSHSPEHVEMALLSNILAAYSFVSENPERAALYFVSGVCIGLVLTLAALVIRISCHTDCRRRPGKKFLQDRESSSDSSDSEDGSEDTVSDLSVRRHRRFERTLNKNVFTSAEELERAQRLEERERIIREIWMNGQPEVPGTRSLNRYY,mutated_sequence,1.0,152.0,UPI0000071E24.a2m,UPI0000071E24.npy,gnomAD
+UPI000059D33E,UPI000059D33E.csv,MRELEAKATKDVERNLSRDLVQEEEQLMEEKKKKKDDKKKKEAAQKKATEQKIKVPEQIKPSVSQPQPANSNNGTSTATSTNNNAKRATANNQQPQQQQQQQQPQQQQPQQQPQPQPQQQQPQQQPQALPRYPREVPPRFRHQEHKQLLKRGQHFPVIAANLGSAVKVLNSQSESSALTNQQPQNNGEVQNSKNQSDINHSTSGSHYENSQRGPVSSTSDSSTNCKNAVVSDLSEKEAWPSAPGSDPELASECMDADSASSSESERNITIMASGNTGGEKDGLRNSTGLGSQNKFVVGSSSNNVGHGSSTGPWGFSHGAIISTCQVSVDAPESKSESSNNRMNAWGTVSSSSNGGLNPSTLNSASNHGAWPVLENNGLALKGPVGSGSSGINIQCSTIGQMPNNQSINSKVSGGSTHGTWGSLQETCESEVSGTQKVSFSGQPQNITTEMTGPNNTTNFMTSSLPNSGSVQNNELPSSNTGAWRVSTMNHPQMQAPSGMNGTSLSHLSNGESKSGGSYGTTWGAYGSNYSGDKCSGPNGQANGDTVNATLMQPGVNGPMGTNFQVNTNKGGGVWESGAANSQSTSWGSGNGANSGGSRRGWGTPAQNTGTNLPSVEWNKLPSNQHSNDSANGNGKTFTNGWKSTEEEDQGSATSQTNEQSSVWAKTGGTVESDGSTESTGRLEEKGTGESQSRDRRKIDQHTLLQSIVNRTDLDPRVLSNSGWGQTPIKQNTAWDTETSPRGERKTDNGTEAWGSSATQTFNSGACIDKTSPNGNDTSSVSGWGDPKPALRWGDSKGSNCQGGWEDDSAATGMVKSNQWGNCKEEKAAWNDSQKNKQGWGDGQKSSQGWSVSASDNWGETSRNNHWGEANKKSSSGGSDSDRSVSGWNELGKTSSFTWGNNINPNNSSGWDESSKPTPSQGWGDPPKSNQSLGWGDSSKPVSSPDWNKQQDIVGSWGIPPATGKPPGTGWLGGPIPAPAKEEEPTGWEEPSPESIRRKMEIDDGTSAWGDPSKYNYKNVNMWNKNVPNGNSRSDQQAQVHQLLTPASAISNKEASSGSGWGEPWGEPSTPATTVDNGTSAWGKPIDSGPSWGEPIAAASSTSTWGSSSVGPQALSKSGPKSMQDGWCGDDMPLPGNRPTGWEEEEDVEIGMWNSNSSQELNSSLNWPPYTKKMSSKGLSGKKRRRERGMMKGGNKQEEAWINPFVKQFSNISFSRDSPEENVQSNKMDLSGGMLQDKRMEIDKHSLNIGDYNRTVGKGPGSRPQISKESSMERNPYFDKDGIVADESQNMQFMSSQSMKLPPSNSALPNQALGSIAGLGMQNLNSVRQNGNPSMFGVGNTAAQPRGMQQPPAQPLSSSQPNLRAQVPPPLLSPQVPVSLLKYAPNNGGLNPLFGPQQVAMLNQLSQLNQLSQISQLQRLLAQQQRAQSQRSVPSGNRPQQDQQGRPLSVQQQMMQQSRQLDPNLLVKQQTPPSQQQPLHQPAMKSFLDNVMPHTTPELQKGPSPINAFSNFPIGLNSNLNVNMDMNSIKEPQSRLRKWTTVDSISVNTSLDQNSSKHGAISSGFRLEESPFVPYDFMNSSTSPASPPGSIGDGWPRAKSPNGSSSVNWPPEFRPGEPWKGYPNIDPETDPYVTPGSVINNLSINTVREVDHLRDRNSGSSSSLNTTLPSTSAWSSIRASNYNVPLSSTAQSTSARNSDSKLTWSPGSVTNTSLAHELWKVPLPPKNITAPSRPPPGLTGQKPPLSTWDNSPLRIGGGWGNSDARYTPGSSWGESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVKAQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSFSSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYSTSLWGPPSSSDPRGISSPSPINAFLSVDHLGGGGESM,mutated_sequence,1.0,1962.0,UPI000059D33E.a2m,UPI000059D33E.npy,gnomAD
+UPI000013D842,UPI000013D842.csv,MSVRYSSSKHYSSSRSGGGGGGGGCGGGGGVSSLRISSSKGSLGGGFSSGGFSGGSFSRGSSGGGCFGGSSGGYGGLGGFGGGSFRGSYGSSSFGGSYGGIFGGGSFGGGSFGGGSFGGGGFGGGGFGGGFGGGFGGDGGLLSGNEKVTMQNLNDRLASYLDKVRALEESNYELEGKIKEWYEKHGNSHQGEPRDYSKYYKTIDDLKNQILNLTTDNANILLQIDNARLAADDFRLKYENEVALRQSVEADINGLRRVLDELTLTKADLEMQIESLTEELAYLKKNHEEEMKDLRNVSTGDVNVEMNAAPGVDLTQLLNNMRSQYEQLAEQNRKDAEAWFNEKSKELTTEIDNNIEQISSYKSEITELRRNVQALEIELQSQLALKQSLEASLAETEGRYCVQLSQIQAQISALEEQLQQIRAETECQNTEYQQLLDIKIRLENEIQTYRSLLEGEGSSGGGGRGGGSFGGGYGGGSSGGGSSGGGHGGGHGGSSGGGYGGGSSGGGSSGGGYGGGSSSGGHGGSSSGGYGGGSSGGGGGGYGGGSSGGGSSSGGGYGGGSSSGGHKSSSSGSVGESSSKGPRY,mutated_sequence,1.0,584.0,UPI000013D842.a2m,UPI000013D842.npy,gnomAD
+UPI00001378FC,UPI00001378FC.csv,MASGDTLYIATDGSEMPAEIVELHEIEVETIPVETIETTVVGEEEEEDDDDEDGGGGDHGGGGGHGHAGHHHHHHHHHHHPPMIALQPLVTDDPTQVHHHQEVILVQTREEVVGGDDSDGLRAEDGFEDQILIPVPAPAGGDDDYIEQTLVTVAAAGKSGGGGSSSSGGGRVKKGGGKKSGKKSYLSGGAGAAGGGGADPGNKKWEQKQVQIKTLEGEFSVTMWSSDEKKDIDHETVVEEQIIGENSPPDYSEYMTGKKLPPGGIPGIDLSDPKQLAEFARMKPRKIKEDDAPRTIACPHKGCTKMFRDNSAMRKHLHTHGPRVHVCAECGKAFVESSKLKRHQLVHTGEKPFQCTFEGCGKRFSLDFNLRTHVRIHTGDRPYVCPFDGCNKKFAQSTNLKSHILTHAKAKNNQ,mutated_sequence,1.0,414.0,UPI00001378FC.a2m,UPI00001378FC.npy,gnomAD
+UPI0001838820,UPI0001838820.csv,MAPYPCGCHILLLLFCCLAAARANLLNLNWLWFNNEDTSHAATTIPEPQGPLPVQPTADTTTHVTPRNGSTEPATAPGSPEPPSELLEDGQDTPTSAESPDAPEENIAGVGAEILNVAKGIRSFVQLWNDTVPTESLARAETLVLETPVGPLALAGPSSTPQENGTTLWPSRGIPSSPGAHTTEAGTLPAPTPSPPSLGRPWAPLTGPSVPPPSSGRASLSSLLGGAPPWGSLQDPDSQGLSPAAAAPSQQLQRPDVRLRTPLLHPLVMGSLGKHAAPSAFSSGLPGALSQVAVTTLTRDSGAWVSHVANSVGPGLANNSALLGADPEAPAGRCLPLPPSLPVCGHLGISRFWLPNHLHHESGEQVRAGARAWGGLLQTHCHPFLAWFFCLLLVPPCGSVPPPAPPPCCQFCEALQDACWSRLGGGRLPVACASLPTQEDGYCVLIGPAAERISEEVGLLQLLGDPPPQQVTQTDDPDVGLAYVFGPDANSGQVARYHFPSLFFRDFSLLFHIRPATEGPGVLFAITDSAQAMVLLGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLALSVAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGADPDKFQGVIAELKVRRDPQVSPMHCLDEEGDDSDGASGDSGSGLGDARELLREETGAALKPRLPAPPPVTTPPLAGGSSTEDSRSEEVEEQTTVASLGAQTLPGSDSVSTWDGSVRTPGGRVKEGGLKGQKGEPGVPGPPGRAGPPGSPCLPGPPGLPCPVSPLGPAGPALQTVPGPQGPPGPPGRDGTPGRDGEPGDPGEDGKPGDTGPQGFPGTPGDVGPKGDKGDPGVGERGPPGPQGPPGPPGPSFRHDKLTFIDMEGSGFGGDLEALRGPRGFPGPPGPPGVPGLPGEPGRFGVNSSDVPGPAGLPGVPGREGPPGFPGLPGPPGPPGREGPPGRTGQKGSLGEAGAPGHKGSKGAPGPAGARGESGLAGAPGPAGPPGPPGPPGPPGPGLPAGFDDMEGSGGPFWSTARSADGPQGPPGLPGLKGDPGVPGLPGAKGEVGADGVPGFPGLPGREGIAGPQGPKGDRGSRGEKGDPGKDGVGQPGLPGPPGPPGPVVYVSEQDGSVLSVPGPEGRPGFAGFPGPAGPKGNLGSKGERGSPGPKGEKGEPGSIFSPDGGALGPAQKGAKGEPGFRGPPGPYGRPGYKGEIGFPGRPGRPGMNGLKGEKGEPGDASLGFGMRGMPGPPGPPGPPGPPGTPVYDSNVFAESSRPGPPGLPGNQGPPGPKGAKGEVGPPGPPGQFPFDFLQLEAEMKGEKGDRGDAGQKGERGEPGGGGFFGSSLPGPPGPPGPPGPRGYPGIPGPKGESIRGQPGPPGPQGPPGIGYEGRQGPPGPPGPPGPPSFPGPHRQTISVPGPPGPPGPPGPPGTMGASSGVRLWATRQAMLGQVHEVPEGWLIFVAEQEELYVRVQNGFRKVQLEARTPLPRGTDNEVAALQPPVVQLHDSNPYPRREHPHPTARPWRADDILASPPRLPEPQPYPGAPHHSSYVHLRPARPTSPPAHSHRDFQPVLHLVALNSPLSGGMRGIRGADFQCFQQARAVGLAGTFRAFLSSRLQDLYSIVRRADRAAVPIVNLKDELLFPSWEALFSGSEGPLKPGARIFSFDGKDVLRHPTWPQKSVWHGSDPNGRRLTESYCETWRTEAPSATGQASSLLGGRLLGQSAASCHHAYIVLCIENSFMTASK,mutated_sequence,1.0,1754.0,UPI0001838820.a2m,UPI0001838820.npy,gnomAD
+UPI00001AFE92,UPI00001AFE92.csv,MGSSRAPWMGRVGGHGMMALLLAGLLLPGTLAKSIGTFSDPCKDPTRITSPNDPCLTGKGDSSGFSSYSGSSSSGSSISSARSSGGGSSGSSSGSSIAQGGSAGSFKPGTGYSQVSYSSGSGSSLQGASGSSQLGSSSSHSGNSGSHSGSSSSHSSSSSSFQFSSSSFQVGNGSALPTNDNSYRGILNPSQPGQSSSSSQTFGVSSSGQSVSSNQRPCSSDIPDSPCSGGPIVSHSGPYIPSSHSVSGGQRPVVVVVDQHGSGAPGVVQGPPCSNGGLPGKPCPPITSVDKSYGGYEVVGGSSDSYLVPGMTYSKGKIYPVGYFTKENPVKGSPGVPSFAAGPPISEGKYFSSNPIIPSQSAASSAIAFQPVGTGGVQLCGGGSTGSKGPCSPSSSRVPSSSSISSSSGLPYHPCGSASQSPCSPPGTGSFSSSSSSQSSGKIILQPCGSKSSSSGHPCMSVSSLTLTGGPDGSPHPDPSAGAKPCGSSSAGKIPCRSIRDILAQVKPLGPQLADPEVFLPQGELLNSP,mutated_sequence,1.0,529.0,UPI00001AFE92.a2m,UPI00001AFE92.npy,gnomAD
+UPI000013C8B1,UPI000013C8B1.csv,MMFPGLLAPPAGYPSLLRPTPTLTLPQSLQSAFSGHSSFLVEDLIRISRPPAYLPRSVPTASMSPPRQGAPTALTDTGASDLGSPGPGSRRGGSPPTAFSPASETTFLKFGVNAILSSGPRTETSPALLQSVPPKTFAFPYFEGSFQPFIRSSYFPASSSVVPIPGTFSWPLAARGKPRRGMLRRAVFSDVQRKALEKMFQKQKYISKPDRKKLAAKLGLKDSQVKIWFQNRRMKWRNSKERELLSSGGCREQTLPTKLNPHPDLSDVGQKGPGNEEEEEGPGSPSHRLAYHASSDPQHLRDPRLPGPLPPSPAHSSSPGKPSDFSDSEEEEEGEEQEEITVS,mutated_sequence,1.0,343.0,UPI000013C8B1.a2m,UPI000013C8B1.npy,gnomAD
+UPI000004B1EC,UPI000004B1EC.csv,MFLYLCFIFQRTCSEEMEEENATLLTEFVLTGFLHQPDCKIPLFLAFLVIYLITIMGNLGLIVLIWKDPHLHIPMYLFLGSLAFVDASLSSTVTPKMLINFLAKSKMISLSECMVQFFSLVTTVTTECFLLATMAYDRYVAICKALLYPVIMTNELCIQLLVLSFIGGLLHALIHEAFSFRLTFCNSNIIQHFYCDIIPLLKISCTDSSINFLMVFIFAGSVQVFTIGTILISYTIILFTILEKKSIKGIRKAVSTCGAHLLSVSLYYGPLTFKYLGSASPQADDQDMMESLFYTVIVPLLNPMIYSLRNKQVIASFTKMFKSNV,mutated_sequence,1.0,325.0,UPI000004B1EC.a2m,UPI000004B1EC.npy,gnomAD
+UPI00001D7FAE,UPI00001D7FAE.csv,MEEGVQAPDWDSDETVIEGSVTESDLEEKELPWRRLLFDQDASLKSEFSLHPDTRGMCKGMPSPEIQLGFKLREDLQEQMNKNKMMPVLSEDTILQSQDETERNQALLQTRKNCSMFIGSFRQSGLSLNHQNIEGPEAESPEVLPHIEKELSEGRDSPEVSLLSGTAITVSDTVAVKETSLVEPEKILAAPNTFFEPRKEVTMTMTSEETKDEESSLETFVSALESLLTSPESTQEERLFELVSDFDRKELMNPLSDSLSSISIPLNSWSACHRDLLEDAKDDALPAELLEALNTLSEAKVETICHRKEGGSSLIARNECLEVEFNTSQTNEDCTQIAETLQDPNPSGLQTLAHQNITSCEPLSNKRNSNSVTNSSDQETACVLRRSSRLEKLKVSRDAKYSDHMYKMPEKILPKILGCEDLTNNNSSAQNFRMQDPALMIDGKEKNMHSARFKNGKQIRKNEQFSGKKEKMKVNKISLHSINRRNIFGENLVYKAALHDDADLVHHCIKKGGNVNQPSYAGWTALHEASVGGFYRTASELLKGGADVNIKGLYQITPLHDAVMNGHYKVAELLLLNGADPLFRNDDGKCALDEAKDLCMKRLLERYIPKHQKCLTSAQRSSIDPLDIEDVYQHKKPKFSSKSHIWHVYNENSNRQKLEHVKVNKGSKASLFINKEDVYEYYQKDPKNTKFGKSKHKQSTLDQIYSTGLRKGNLHNVKDPNTNVPKGIGRRKTQHKRTQVDDVDCNPRKILAVSPSRRINRLVTYQQHIPETHNDLPEELCEPSSLTLSSLRNGLDSSTEACSVSKEKHIQNLDLSDSQEVQCLELESVDQTEAVSFPGLLLHKEIKLPVVTTDKQPHTLQEQHHVLYKSHENSNLVPKDERFNKWENSFLSFVKENSDNDDDDDCSTSEKAITSKKVLCSTGGKKHYNFKENLTNKKEMGFQQFLLSEDHLSQENELKAVSLTTLPEQEAVNFSYSDNAVISEHVANYEQCIFGPSFDHSNGNPEQNSLACMRTLLTHEASKLTNHVELFKKPQDYIPRAPTFLMNQTDTHIVEKMAKNCDTERNYIDRDQKIIYSNEPLSIVAHSQVIETTKVEKRRQNHLESETIHNIDSHSTDNMSKELANISKLSQREKKEISHKPGMKAGRINKRNARGESQLHLAVRRGNLPLVKALIESGADVNLNDNAGWTPLHEASNEGSIDIIVELLKAGAKVNCENIDGILPLHDAVANNHLKAAEILLQNGANPNQKDQKQKSALDEADDEKMKELLRSYGAIETVNRDESDAIVNEKIPAVRSKRHKQCFCDDGKTIDSSSLSHQERSRESLSVHQTLSAILQDIEEKQEYLLEFEIRNPEDAEQYIEKMLKIKKIMDNVLAKQKAERDDLAKKYRVSIESFKHGALREQLANLAARQKSLLVVAKKQKKISLKIQNCRNVTSLPCLSLRKLPPRSEISSEKDSQELTSLENLEHPQSGSLSPVSGSMQETQLSLETWNYSQNTNICLNSEAVRRGEFSGNDMNSKQNGSDCTLDGFPKSRHSDGTEKNKLPSQPVAFIGQTEYSQKENDLTEATDKDHEFYVSSPVIGKLNISETASVLAENAAHPSNIICDQDLSNYDPKRGNRKTSSQQSPTGASESLAHQGIAVLGSDTVHQMKPYLKKSVSVVPCADDSQISSSSGSGQQDTIKKALNYSTAPKKKCIQIKDLILLGRINPGNNILEFKTQETTHKASILLNGKLKVESGQIYKNPVTWLKDLLGGNSYVTWNYAWSKVTYLGKELLRYVSEDAPILPEPNSVPQQYQPCLPEVACLDDPVQEPNKSMFEKTKFGQGTSRESMQSSPRYLQINEILLISDQEFLPCHIMDQHWKFCVECEELTP,mutated_sequence,1.0,1873.0,UPI00001D7FAE.a2m,UPI00001D7FAE.npy,gnomAD
+UPI0000140198,UPI0000140198.csv,MLQTPESRGLPVPQAEGEKDGGHDGETRAPTASQERPKEELGAGREEGAAEPALTRKGARALAAKALARRRAYRRLNRTVAELVQFLLVKDKKKSPITRSEMVKYVIGDLKILFPDIIARAAEHLRYVFGFELKQFDRKHHTYILINKLKPLEEEEEEDLGGDGPRLGLLMMILGLIYMRGNSAREAQVWEMLRRLGVQPSKYHFLFGYPKRLIMEDFVQQRYLSYRRVPHTNPPEYEFSWGPRSNLEISKMEVLGFVAKLHKKEPQHWPVQYREALADEADRARAKARAEASMRARASARAGIHLW,mutated_sequence,1.0,307.0,UPI0000140198.a2m,UPI0000140198.npy,gnomAD
+UPI0000135F5C,UPI0000135F5C.csv,MGSPAHRPALLLLLPPLLLLLLLRVPPSRSFPGSGDSPLEDDEVGYSHPRYKDTPWCSPIKVKYGDVYCRAPQGGYYKTALGTRCDIRCQKGYELHGSSLLICQSNKRWSDKVICKQKRCPTLAMPANGGFKCVDGAYFNSRCEYYCSPGYTLKGERTVTCMDNKAWSGRPASCVDMEPPRIKCPSVKERIAEPNKLTVRVSWETPEGRDTADGILTDVILKGLPPGSNFPEGDHKIQYTVYDRAENKGTCKFRVKVRVKRCGKLNAPENGYMKCSSDGDNYGATCEFSCIGGYELQGSPARVCQSNLAWSGTEPTCAAMNVNVGVRTAAALLDQFYEKRRLLIVSTPTARNLLYRLQLGMLQQAQCGLDLRHITVVELVGVFPTLIGRIGAKIMPPALALQLRLLLRIPLYSFSMVLVDKHGMDKERYVSLVMPVALFNLIDTFPLRKEEMVLQAEMSQTCNT,mutated_sequence,1.0,464.0,UPI0000135F5C.a2m,UPI0000135F5C.npy,gnomAD
+UPI000000D824,UPI000000D824.csv,MNGDHMVLGSSVTDKKAIILVTILLLLRLVAIAGNGFITAALGVEWVLRRMLLPCDKLLVSLGASRFCLQSVVMGKTIYVFLHPMAFPYNPVLQFLAFQWDFLNAATLWSSTWLSVFYCVKIATFTHPVFFWLKHKLSGWLPWMLFSSVGLSSFTTILFFIGNHRMYQNYLRNHLQPWNVTGDSIRSYCEKFYLFPLKMITWTMPTAVFFICMILLITSLGRHRKKALLTTSGFREPSVQAHIKALLALLSFAMLFISYFLSLVFSAAGIFPPLDFKFWVWESVIYLCAAVHPIILLFSNCRLRAVLKSRRSSRCGTP,mutated_sequence,1.0,318.0,UPI000000D824.a2m,UPI000000D824.npy,gnomAD
+UPI0001F7848D,UPI0001F7848D.csv,MKKNTSKTTMRINKQDALCTPHSHDPRDLQNMLDGGEYAPFVSPPMLESNFIQVNRRGESIYLHNRANWVTVGICFS,mutated_sequence,1.0,77.0,UPI0001F7848D.a2m,UPI0001F7848D.npy,gnomAD
+UPI000004673A,UPI000004673A.csv,MGKGDVLEAAPTTTAYHSLMDEYGYEVGKAIGHGSYGSVYEAFYTKQKVMVAVKIISKKKASDDYLNKFLPREIQVMKVLRHKYLINFYRAIESTSRVYIILELAQGGDVLEWIQRYGACSEPLAGKWFSQLTLGIAYLHSKSIVHRDLKLENLLLDKWENVKISDFGFAKMVPSNQPVGCSPSYRQVNCFSHLSQTYCGSFAYACPEILRGLPYNPFLSDTWSMGVILYTLVVAHLPFDDTNLKKLLRETQKEVTFPANHTISQECKNLILQMLRQATKRATILDIIKDSWVLKFQPEQPTHEIRLLEAMCQLHNTTKQHQSLQITT,mutated_sequence,1.0,328.0,UPI000004673A.a2m,UPI000004673A.npy,gnomAD
+UPI00001DFBF3,UPI00001DFBF3.csv,MPFAKRIVEPQWLCRQRRPAPGPAVDASGGSAEPPPPLQPPGRRDLDEVEAPGPEEPARAVPAPSGLPPPPPPLPAPADQTQPPHGEASVAGEESTAGIPEAAPAAGEASSAAAAAAVLLMLDLCAVSNAALARVLRQLSDVARHACSLFQELESDIQLTHRRVWALQGKLGGVQRVLSTLDPKQEAVPVSNLDIESKLSVYYRAPWHQQRNIFLPATRPPCVEELHRHARQSLQALRREHRSRSDRREQRAAAPLSIAAPPLPAYPPAHSQRRREFKDRHFLTSHPPEDEDTDVMLGQRPKNPIHNIPSTLDKQTNWSKALPLPTPEEKMKQDAQVISSCIIPINVTGVGFDREASIRCSLVHSQSVLQRRRKLRRRKTISGIPRRVQQEIDSDESPVARERNVIVHTNPDPSNTVNRISGTRDSECQTEDILIAAPSRRRIRAQRGQSIAASLSHSAGNISALADKGDTMFTPAVSSRTRSRSLPREGNRGGDAEPKVGAKPSAYEEGESFVGDHERTPNDFSEAPSSPSAQDHQPTLGLACSQHLHSPQHKLSERGRSRLSRMAADSGSCDISSNSDTFGSPIHCISTAGVLLSSHMDQKDDHQSSSGNWSGSSSTCPSQTSETIPPAASPPLTGSSHCDSELSLNTAPHANEDASVFVTEQYNDHLDKVRGHRANSFTSTVADLLDDPNNSNTSDSEWNYLHHHHDASCRQDFSPERPKADSLGCPSFTSMATYDSFLEKSPSDKADTSSHFSVDTEGYYTSMHFDCGLKGNKSYVCHYAALGPENGQGVGASPGLPDCAWQDYLDHKRQGRPSISFRKPKAKPTPPKRSSSLRKSDGNADISEKKEPKISSGQHLPHSSREMKLPLDFANTPSRMENANLPTKQEPSWINQSEQGIKEPQLDASDIPPFKDEVAESTHYADLWLLNDLKTNDPYRSLSNSSTATGTTVIECIKSPESSESQTSQSESRATTPSLPSVDNEFKLASPEKLAGLASPSSGYSSQSETPTSSFPTAFFSGPLSPGGSKRKPKVPERKSSLQQPSLKDGTISLSKDLELPIIPPTHLDLSALHNVLNKPFHHRHPLHVFTHNKQNTVGETLRSNPPPSLAITPTILKSVNLRSINKSEEVKQKEENNTDLPYLEESTLTTAALSPSKIRPHTANKSVSRQYSTEDTILSFLDSSAVEMGPDKLHLEKNSTFDVKNRCDPETITSAGSSLLDSNVTKDQVRTETEPIPENTPTKNCAFPTEGFQRVSAARPNDLDGKIIQYGPGPDETLEQVQKAPSAGLEEVAQPESVDVITSQSDSPTRATDVSNQFKHQFVMSRHHDKVPGTISYESEITSVNSFPEKCSKQENIASGISAKSASDNSKAEETQGNVDEASLKESSPSDDSIISPLSEDSQAEAEGVFVSPNKPRTTEDLFAVIHRSKRKVLGRKDSGDMSVRSKSRAPLSSSSSSASSITSPSSNVTTPNSQRSPGLIYRNAKKSNTSNEEFKLLLLKKGSRSDSSYRMSATEILKSPILPKPPGELTAESPQSTDDAHQGSQGAEALSPLSPCSPRVNAEGFSSKSFATSASARVGRSRAPPAASSSRYSVRCRLYNTPMQAISEGETENSDGSPHDDRSSQSST,mutated_sequence,1.0,1630.0,UPI00001DFBF3.a2m,UPI00001DFBF3.npy,gnomAD
+UPI00001D68E9,UPI00001D68E9.csv,MEAFPWAPRSPRRGRAPPPMALVPSARYVSAPGPAHPQPFSSWNDYLGLATLITKAVDGEPRFGCARGGNGGGGSPPSSSSSSCCSPHTGAGPGALGPALGPPDYDEDDDDDSDEPGSRGRYLGSALELRALELCAGPAEAGLLEERFAELSPFAGRAAAVLLGCAPAAAAAATTTSEATPREERAPAWAAEPRLHAASGAAAARLLKPELQVCVFCRNNKEAMALYTTHILKGPDGRVLCPVLRRYTCPLCGASGDNAHTIKYCPLSKVPPPPARPPPRSARDGPPGKKLR,mutated_sequence,1.0,292.0,UPI00001D68E9.a2m,UPI00001D68E9.npy,gnomAD
+UPI00001AED69,UPI00001AED69.csv,MEGGGKPNSSSNSRDDGNSVFPAKASATGAGPAAAEKRLGTPPGGGGAGAKEHGNSVCFKVDGGGGGGGGGGGGEEPAGGFEDAEGPRRQYGFMQRQFTSMLQPGVNKFSLRMFGSQKAVEKEQERVKTAGFWIIHPYSDFRFYWDLIMLIMMVGNLVIIPVGITFFTEQTTTPWIIFNVASDTVFLLDLIMNFRTGTVNEDSSEIILDPKVIKMNYLKSWFVVDFISSIPVDYIFLIVEKGMDSEVYKTARALRIVRFTKILSLLRLLRLSRLIRYIHQWEEIFHMTYDLASAVVRIFNLIGMMLLLCHWDGCLQFLVPLLQDFPPDCWVSLNEMVNDSWGKQYSYALFKAMSHMLCIGYGAQAPVSMSDLWITMLSMIVGATCYAMFVGHATALIQSLDSSRRQYQEKYKQVEQYMSFHKLPADMRQKIHDYYEHRYQGKIFDEENILNELNDPLREEIVNFNCRKLVATMPLFANADPNFVTAMLSKLRFEVFQPGDYIIREGAVGKKMYFIQHGVAGVITKSSKEMKLTDGSYFGEICLLTKGRRTASVRADTYCRLYSLSVDNFNEVLEEYPMMRRAFETVAIDRLDRIGKKNSILLQKFQKDLNTGVFNNQENEILKQIVKHDREMVQAIAPINYPQMTTLNSTSSTTTPTSRMRTQSPPVYTATSLSHSNLHSPSPSTQTPQPSAILSPCSYTTAVCSPPVQSPLAARTFHYASPTASQLSLMQQQPQQQVQQSQPPQTQPQQPSPQPQTPGSSTPKNEVHKSTQALHNTNLTREVRPLSASQPSLPHEVSTLISRPHPTVGESLASIPQPVTAVPGTGLQAGGRSTVPQRVTLFRQMSSGAIPPNRGVPPAPPPPAAALPRESSSVLNTDPDAEKPRFASNL,mutated_sequence,1.0,890.0,UPI00001AED69.a2m,UPI00001AED69.npy,gnomAD
+UPI00004C3E05,UPI00004C3E05.csv,MRGGKCNMLSSLGCLLLCGSITLALGNAQKLPKGKRPNLKVHINTTSDSILLKFLRPSPNVKLEGLLLGYGSNVSPNQYFPLPAEGKFTEAIVDAEPKYLIVVRPAPPPSQKKSCSGKTRSRKPLQLVVGTLTPSSVFLSWGFLINPHHDWTLPSHCPNDRFYTIRYREKDKEKKWIFQICPATETIVENLKPNTVYEFGVKDNVEGGIWSKIFNHKTVVGSKKVNGKIQSTYDQDHTVPAYVPRKLIPITIIKQVIQNVTHKDSAKSPEKAPLGGVILVHLIIPGLNETTVKLPASLMFEISDALKTQLAKNETLALPAESKTPEVEKISARPTTVTPETVPRSTKPTTSSALDVSETTLVLSKRTPETLQTILIPQFELPLSTLAPKSLPEFPEAKTPFPFEKPRGTLASSEKPWIVPTAKISEDSKVLQPQTATYDVFSSPTTSDEPEISDSYTATSDRILDSIPPKTSRTLEQPRATLAPSETPFVPQKLEIFTSPEMQPTTPAPQQTTSIPSTPKRRPRPKPPRTKPERTTSAGTITPKISKSPEPTWTTPAPGKTQFISLKPKIPLSPEVTHTKPAPEPQTLLPSQSTIGPETPGTKPSTTLAPRKTKRPGRRPRPRPRPKTTPSPEVPKSKPALEPATIQPEPLVPTTASKPSERPKTTHRPDAPQIQPGSKPPKQLLPKPQTTAEPDMPPTKSVSEPVPFETEAPSMTIVPTTDIEPVTVRTEATVTTLAPKTSQRTRTRRPRPKHKTTPRPETLQTKLDFGPITPGTSSAPTTTTKRTRRPHPKPKTTPHPEVPQTKLATKTSKRTRPPRPRPKTTPSPQAPETKPVPATVLEPVTLRPEASTTLASKTSQRTRRPRLRTKTTPRPEAPESKPVPTAELKPVTLRTETWVTTQAPKTSQRTRRPRPKTKTTPSPEVPQTKLVPSTDLEPGTLRTEAPKTMVVTTVLEPDTFRTKFPETTLAPKTQRTRRPRPRPKTTSSPEVPQNKSGKCMNPWLK,mutated_sequence,1.0,1005.0,UPI00004C3E05.a2m,UPI00004C3E05.npy,gnomAD
+UPI000012ADD0,UPI000012ADD0.csv,MNLPRAERPRSTPQRSLRDSDGEDGKIDVLGEEEDEDEVEDEEEEASQKFLEQSLQPGLQVARWGGVALPREHIEGGGPSDPSEFGTEFRAPPRSAAASEDARQPAKPPYSYIALITMAILQSPHKRLTLSGICAFISGRFPYYRRKFPAWQNSIRHNLSLNDCFVKIPREPGHPGKGTYWSLDPASQDMFDNGSFLRRRKRFKRHQLTPGAHLPHPFPLPAAHAALHNPRPGPLLGAPALPQPVPGAYPNTAPGRRPYALLHPHPPRYLLLSAPAYAGAPKKAEGADLATPGTLPVLQPSLGPQPWEEGKGLASPPGGGCISFSIESIMQGVRGAGTGAAQSLSPTAWSYCPLLQRPSSLSDNFAATAAASGGGLRQRLRSHQGRGAGRAPVGRVGAAAVSGGGRGL,mutated_sequence,1.0,408.0,UPI000012ADD0.a2m,UPI000012ADD0.npy,gnomAD
+UPI000012BB5B,UPI000012BB5B.csv,MAARKGRRRTCETGEPMEAESGDTSSEGPAQVYLPGRGPPLREGEELVMDEEAYVLYHRAQTGAPCLSFDIVRDHLGDNRTELPLTLYLCAGTQAESAQSNRLMMLRMHNLHGTKPPPSEGSDEEEEEEDEEDEEERKPQLELAMVPHYGGINRVRVSWLGEEPVAGVWSEKGQVEVFALRRLLQVVEEPQALAAFLRDEQAQMKPIFSFAGHMGEGFALDWSPRVTGRLLTGDCQKNIHLWTPTDGGSWHVDQRPFVGHTRSVEDLQWSPTENTVFASCSADASIRIWDIRAAPSKACMLTTATAHDGDVNVISWSRREPFLLSGGDDGALKIWDLRQFKSGSPVATFKQHVAPVTSVEWHPQDSGVFAASGADHQITQWDLAVERDPEAGDVEADPGLADLPQQLLFVHQGETELKELHWHPQCPGLLVSTALSGFTIFRTISV,mutated_sequence,1.0,446.0,UPI000012BB5B.a2m,UPI000012BB5B.npy,gnomAD
+UPI000003032B,UPI000003032B.csv,MSRSAAASGGPRRPERHLPPAPCGAPGPPETCRTEPDGAGTMNKLRQSLRRRKPAYVPEASRPHQWQADEDAVRKGTCSFPVRYLGHVEVEESRGMHVCEDAVKKLKAMGRKSVKSVLWVSADGLRVVDDKTKDLLVDQTIEKVSFCAPDRNLDKAFSYICRDGTTRRWICHCFLALKDSGERLSHAVGCAFAACLERKQRREKECGVTAAFDASRTSFAREGSFRLSGGGRPAEREAPDKKKAEAAAAPTVAPGPAQPGHVSPTPATTSPGEKGEAGTPVAAGTTAAAIPRRHAPLEQLVRQGSFRGFPALSQKNSPFKRQLSLRLNELPSTLQRRTDFQVKGTVPEMEPPGAGDSDSINALCTQISSSFASAGAPAPGPPPATTGTSAWGEPSVPPAAAFQPGHKRTPSEAERWLEEVSQVAKAQQQQQQQQQQQQQQQQQQQQAASVAPVPTMPPALQPFPAPVGPFDAAPAQVAVFLPPPHMQPPFVPAYPGLGYPPMPRVPVVGITPSQMVANAFCSAAQLQPQPATLLGKAGAFPPPAIPSAPGSQARPRPNGAPWPPEPAPAPAPELDPFEAQWAALEGKATVEKPSNPFSGDLQKTFEIEL,mutated_sequence,1.0,609.0,UPI000003032B.a2m,UPI000003032B.npy,gnomAD
+UPI000004A951,UPI000004A951.csv,MSTAPSLSALRSSKHSGGGGGGGGGGGADPAWTSALSGNSSGPGPGSSPAGSTKPFVHAVPPSDPLRQANRLPIKVLKMLTARTGHILHPEYLQPLPSTPVSPIELDAKKSPLALLAQTCSQIGKPDPSPSSKLSSVASNGGGAGGAGGGAAGDKDTKSGPLKLSDIGVEDKSSFKPYSKPGSDKKEPGGGGGGGGGGGGGGGGVSSEKSGFRVPSATCQPFTPRTGSPSSSASACSPGGMLSSAGGAPEGKDDKKDTDVGGGGKGTGGASAEGGPTGLAHGRISCGGGINVDVNQHPDGGPGGKALGSDCGGSSGSSSGSGPSAPTSSSVLGSGLVAPVSPYKPGQTVFPLPPAGMTYPGSLAGAYAGYPPQFLPHGVALDPTKPGSLVGAQLAAAAAGSLGCSKPAGSSPLAGASPPSVMTASLCRDPYCLSYHCASHLAGAAAASASCAHDPAAAAAALKSGYPLVYPTHPLHGVHSSLTAAAAAGATPPSLAGHPLYPYGFMLPNDPLPHICNWVSANGPCDKRFATSEELLSHLRTHTAFPGTDKLLSGYPSSSSLASAAAAAMACHMHIPTSGAPGSPGTLALRSPHHALGLSSRYHPYSKSPLPTPGAPVPVPAATGPYYSPYALYGQRLTTASALGYQ,mutated_sequence,1.0,646.0,UPI000004A951.a2m,UPI000004A951.npy,gnomAD
+UPI0000D61567,UPI0000D61567.csv,MESGAGKHWTEEEVKALLSVWAEKNIRKQLYGTLRNKGIFIYIAKRLQSLGVYRDWKQCWAKYKNLKYEYRTVKYAHNSGDSSKTMKFFHDLDVILQYEPATQFTEEDANGRYLETLSPSTAPETTEGKKKKKKKKKKKSTLVFLLFYSF,mutated_sequence,1.0,150.0,UPI0000D61567.a2m,UPI0000D61567.npy,gnomAD
+UPI0000037B1F,UPI0000037B1F.csv,MPTNCAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQQVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEANSVLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTKSTFI,mutated_sequence,1.0,228.0,UPI0000037B1F.a2m,UPI0000037B1F.npy,gnomAD
+UPI00004565DA,UPI00004565DA.csv,MTRGGPGGRPGLPQPPPLLLLLLLLPLLLVTAEPPKPAGVYYATAYWMPAEKTVQVKNVMDKNGDAYGFYNNSVKTTGWGILEIRAGYGSQTLSNEIIMFVAGFLEGYLTAPHMNDHYTNLYPQLITKPSIMDKVQDFMEKQDKWTRKNIKEYKTDSFWRHTGYVMAQIDGLYVGAKKRAILEGTKPMTLFQIQFLNSVGDLLDLIPSLSPTKNGSLKVFKRWDMGHCSALIKVLPGFENILFAHSSWYTYAAMLRIYKHWDFNVIDKDTSSSRLSFSSYPGFLESLDDFYILSSGLILLQTTNSVFNKTLLKQVIPETLLSWQRVRVANMMADSGKRWADIFSKYNSGTYNNQYMVLDLKKVKLNHSLDKGTLYIVEQIPTYVEYSEQTDVLRKGYWPSYNVPFHEKIYNWSGYPLLVQKLGLDYSYDLAPRAKIFRRDQGKVTDTASMKYIMRYNNYKKDPYSRGDPCNTICCREDLNSPNPSPGGCYDTKVADIYLASQYTSYAISGPTVQGGLPVFRWDRFNKTLHQGMPEVYNFDFITMKPILKLDIK,mutated_sequence,1.0,553.0,UPI00004565DA.a2m,UPI00004565DA.npy,gnomAD
+UPI000013DD75,UPI000013DD75.csv,MNGHSDEESVRNSSGESSQSDDDSGSASGSGSGSSSGSSSDGSSSQSGSSDSDSGSESGSQSESESDTSRENKVQAKPPKVDGAEFWKSSPSILAVQRSAILKKQQQQQQQQQHQASSNSGSEEDSSSSEDSDDSSSEVKRKKHKDEDWQMSGSGSPSQSGSDSESEEEREKSSCDETESDYEPKNKVKSRKPQNRSKSKNGKKILGQKKRQIDSSEEDDDEEDYDNDKRSSRRQATVNVSYKEDEEMKTDSDDLLEVCGEDVPQPEEEEFETIERFMDCRIGRKGATGATTTIYAVEADGDPNAGFEKNKEPGEIQYLIKWKGWSHIHNTWETEETLKQQNVRGMKKLDNYKKKDQETKRWLKNASPEDVEYYNCQQELTDDLHKQYQIVERIIAHSNQKSAAGYPDYYCKWQGLPYSECSWEDGALISKKFQACIDEYFSRNQSKTTPFKDCKVLKQRPRFVALKKQPSYIGGHEGLELRDYQLNGLNWLAHSWCKGNSCILADEMGLGKTIQTISFLNYLFHEHQLYGPFLLVVPLSTLTSWQREIQTWASQMNAVVYLGDINSRNMIRTHEWTHHQTKRLKFNILLTTYEILLKDKAFLGGLNWAFIGVDEAHRLKNDDSLLYKTLIDFKSNHRLLITGTPLQNSLKELWSLLHFIMPEKFSSWEDFEEEHGKGREYGYASLHKELEPFLLRRVKKDVEKSLPAKVEQILRMEMSALQKQYYKWILTRNYKALSKGSKGSTSGFLNIMMELKKCCNHCYLIKPPDNNEFYNKQEALQHLIRSSGKLILLDKLLIRLRERGNRVLIFSQMVRMLDILAEYLKYRQFPFQRLDGSIKGELRKQALDHFNAEGSEDFCFLLSTRAGGLGINLASADTVVIFDSDWNPQNDLQAQARAHRIGQKKQVNIYRLVTKGSVEEDILERAKKKMVLDHLVIQRMDTTGKTVLHTGSAPSSSTPFNKEELSAILKFGAEELFKEPEGEEQEPQEMDIDEILKRAETHENEPGPLTVGDELLSQFKVANFSNMDEDDIELEPERNSKNWEEIIPEDQRRRLEEEERQKELEEIYMLPRMRNCAKQISFNGSEGRRSRSRRYSGSDSDSISEGKRPKKRGRPRTIPRENIKGFSDAEIRRFIKSYKKFGGPLERLDAIARDAELVDKSETDLRRLGELVHNGCIKALKDSSSGTERTGGRLGKVKGPTFRISGVQVNAKLVISHEEELIPLHKSIPSDPEERKQYTIPCHTKAAHFDIDWGKEDDSNLLIGIYEYGYGSWEMIKMDPDLSLTHKILPDDPDKKPQAKQLQTRADYLIKLLSRDLAKKEALSGAGSSKRRKARAKKNKAMKSIKVKEEIKSDSSPLPSEKSDEDDDKLSESKSDGRERSKKSSVSDAPVHITASGEPVPISEESEELDQKTFSICKERMRPVKAALKQLDRPEKGLSEREQLEHTRQCLIKIGDHITECLKEYTNPEQIKQWRKNLWIFVSKFTEFDARKLHKLYKHAIKKRQESQQNSDQNSNLNPHVIRNPDVERLKENTNHDDSSRDSYSSDRHLTQYHDHHKDRHQGDSYKKSDSRKRPYSSFSNGKDHRDWDHYKQDSRYYSDREKHRKLDDHRSRDHRSNLEGSLKDRSHSDHRSHSDHRLHSDHRSSSEYTHHKSSRDYRYHSDWQMDHRASSSGPRSPLDQRSPYGSRSPFEHSVEHKSTPEHTWSSRKT,mutated_sequence,1.0,1710.0,UPI000013DD75.a2m,UPI000013DD75.npy,gnomAD
+UPI00001C2000,UPI00001C2000.csv,MEAAPGTPPPPPSESPPPPSPPPPSTPSPPPCSPDARPATPHLLHHRLPLPDDREDGELEEGELEDDGAEETQDTSGGPERSRKEKGEKHHSDSDEEKSHRRLKRKRKKEREKEKRRSKKRRKSKHKRHASSSDDFSDFSDDSDFSPSEKGHRKYREYSPPYAPSHQQYPPSHATPLPKKAYSKMDSKSYGMYEDYENEQYGEYEGDEEEDMGKEDYDDFTKELNQYRRAKEGSSRGRGSRGRGRGYRGRGSRGGSRGRGMGRGSRGRGRGSMGGDHPEDEEDFYEEEMDYGESEEPMGDDDYDEYSKELNQYRRSKDSRGRGLSRGRGRGSRGRGKGMGRGRGRGGSRGGMNKGGMNDDEDFYDEDMGDGGGGSYRSRDHDKPHQQSDKKGKVICKYFVEGRCTWGDHCNFSHDIELPKKRELCKFYITGFCARAENCPYMHGDFPCKLYHTTGNCINGDDCMFSHDPLTEETRELLDKMLADDAEAGAEDEKEVEELKKQGINPLPKPPPGVGLLPTPPRPPGPQAPTSPNGRPMQGGPPPPPPPPPPPPGPPQMPMPVHEPLSPQQLQQQDMYNKKIPSLFEIVVRPTGQLAEKLGVRFPGPGGPPGPMGPGPNMGPPGPMGGPMHPDMHPDMHPDMHPDMHADMHADMPMGPGMNPGPPMGPGGPPMMPYGPGDSPHSGMMPPIPPAQNFYENFYQQQEGMEMEPGLLGDAEDYGHYEELPGEPGEHLFPEHPLEPDSFSEGGPPGRPKPGAGVPDFLPSAQRALYLRIQQKQQEEEERARRLAESSKQDRENEEGDTGNWYSSDEDEGGSSVTSILKTLRQQTSSRPPASVGELSSSGLGDPRLQKGHPTGSRLADPRLSRDPRLTRHVEASGGSGPGDSGPSDPRLARALPTSKPEGSLHSSPVGPSSSKGSGPPPTEEEEGERALREKAVNIPLDPLPGHPLRDPRSQLQQFSHIKKDVTLSKPSFARTVLWNPEDLIPLPIPKQDAVPPVPAALQSMPTLDPRLHRAATAGPPNARQRPGASTDSSTQGANLPDFELLSRILKTVNATGSSAAPGSSDKPSDPRVRKAPTDPRLQKPTDSTASSRAAKPGPAEAPSPTASPSGDASPPATAPYDPRVLAAGGLGQGGGGGQSSVLSGISLYDPRTPNAGGKATEPAADTGAQPKGAEGNGKSSASKAKEPPFVRKSALEQPETGKAGADGGTPTDRYNSYNRPRPKAAAAPAATTATPPPEGAPPQPGVHNLPVPTLFGTVKQTPKTGSGSPFAGNSPAREGEQDAASLKDVFKGFDPTASPFCQ,mutated_sequence,1.0,1303.0,UPI00001C2000.a2m,UPI00001C2000.npy,gnomAD
+UPI0001A5E4A3,UPI0001A5E4A3.csv,MKKKQTVQGTFSKLFGKKHTTTPSTSLYATNPPWIFTQEAPEEGTGGFDGIYYGDNRFNTVSESGTATLKARPRVRPLLTFLPLNAQENHGLAVPTPSVPDDFADKEVTGTSSLVNGNLRLYSSVGDLRPGQYGQDLLIPPPPPGPAPGPPQDISEPPGGSPLPSPPSTAPPPPPLLLEPPPPPSMAPPPPPVLEALSPPHTLSSPSIPTPPDFIPPAPPLAFLAPPPPPVPAPAPPAPASPHTVGTRLFPPGGVTKWKSDVALNGRQAEATRASPPRSPAEPKGSALGPNPEPHLTFPRSFKVPPPTPVRTSSIPVQEAQEAPRKEEGATKKAPSRLPLPPSFHIRPASQVYPDRAPEPDCPGELKATAPASPRLGQSQSQADERAGTPPPAPPLPPPAPPLPPPAPPLPPAAPPLPCAQKAAHPPAGFTKTPKSSSPALKPKPNPPSPENTASSAPVDWRDPSQMEKLRNELAAYLCGSRREDRFLSHRPGPTVAPQSKEGKKGPRLPEKETLLSLPAKDTPPGVPEKSLGGSSLTETEAAPSLTLPSVDYIPQDSPTPSVRQIRNELEARLSSAAEKEAKPSIGSLPPKPRLEGGRICENGADDDKLSKPVAKNLPPQSTTLLPTTSLQPKAMLGPAIPPKATPEPAIPPKATLWPATPPKATLGPATPLKATSGPTTPLKATSGPAIASTATTLPTTTSQLMAEKDSGPAGQPEKPASQEVSTPSQARGEGSPSEATRLPTQGARSSAAFPPKTSPGGGEVPCLYKPHCHQSSLSREVAVVMPTLARGGAAGPGEPVEVKEPPGLPAKPPASAQPTDELLRHPVTGEVVERGSPMALLLAARQRAQKGRSVGAALGRSSLPGSLRDHSHQAEASSDSIFHSQGTPNSFTVVPKLPKEAEKDSPLTTEIPNKWGPRLGRDAEGTELSRRHNWTKPEPQAPVAWERVAPSNLPQGHPLPKSFSSPPSPSNKREEEEEEFNFEVIPPPPEFSNDPEPPAPALQYLGRQSSPPRNNYSDLRQLPNAGPGAPPALGFSRFPAGARYAGAGGLERFSGGGRSLIKKRLYVGEPHRGPGLPHGGTGRSLSSPNCFGPQPGGPEMRRVNSAGRAPPGGLHAPRLSLEGAARGAAEAKHKAPGSADYGFAPAAGRSPYTTTRYGSPINTFTVRPGTRHPISYVCSGAHRKATS,mutated_sequence,1.0,1188.0,UPI0001A5E4A3.a2m,UPI0001A5E4A3.npy,gnomAD
+UPI00001AECDC,UPI00001AECDC.csv,MAPEMDQFYRSTMAIYKSIMEQFNPALENLVYLGNNYLRAFHALSEAAEVYFSAIQKIGERALQSPTSQILGEILVQMSDTQRHLNSDLEVVVQTFHGGLLQHMEKNTKLDMQFIKDSRQHYELEYRHRAANLEKCMSELWRMERKRDKNVREMKESVNRLHAQMQAFVSESQRAAELEEKRRYRFLAEKHLLLSNTFLQFFGRARGMLQNRVLLWKEQSEASRSPSRAHSPGLLGPALGPPYPSGRLTPTCLDMPPRPLGEFSSPRSRHGSGSYGTEPDARPASQLEPDRRSLPRTPSASSLYSGSAQSSRSNSFGERPGGGGGARRVRALVSHSEGANHTLLRFSAGDVVEVLVPEAQNGWLYGKLEGSSASGWFPEAYVKALEEGPVNPMTPVTPMTSMTSMSPMTPMNPGNELPSRSYPLRGSHSLDDLLDRPGNSIAPSEYWDGQSRSRTPSRVPSRAPSPAPPPLPSSRRSSMGSTAVATDVKKLMSSEQYPPQELFPRGTNPFATVKLRPTITNDRSAPLIR,mutated_sequence,1.0,529.0,UPI00001AECDC.a2m,UPI00001AECDC.npy,gnomAD
+UPI000002ADD2,UPI000002ADD2.csv,MGRKKIQITRIMDERNRQVTFTKRKFGLMKKAYELSVLCDCEIALIIFNSSNKLFQYASTDMDKVLLKYTEYNEPHESRTNSDIVETLRKKGLNGCESPDADDYFEHSPLSEDRFSKLNEDSDFIFKRGPPGLPPQNFSMSVTVPVTSPNALSYTNPGSSLVSPSLAASSTLTDSSMLSPPQTTLHRNVSPGAPQRPPSTGNAGGMLSTTDLTVPNGAGSSPVGNGFVNSRASPNLIGATGANSLGKVMPTKSPPPPGGGNLGMNSRKPDLRVVIPPSSKGMMPPLSEEEELELNTQRISSSQATQPLATPVVSVTTPSLPPQGLVYSAMPTAYNTDYSLTSADLSALQGFNSPGMLSLGQVSAWQQHHLGQAALSSLVAGGQLSQGSNLSINTNQNISIKSEPISPPRDRMTPSGFQQQQQQQQQQQPPPPPQPQPQPPQPQPRQEMGRSPVDSLSSSSSSYDGSDREDPRGDFHSPIVLGRPPNTEDRESPSVKRMRMDAWVT,mutated_sequence,1.0,505.0,UPI000002ADD2.a2m,UPI000002ADD2.npy,gnomAD
+UPI0000041CE2,UPI0000041CE2.csv,MANLTIVTEFILMGFSTNKNMCILHSILFLLIYLCALMGNVLIIMITTLDHHLHTPVYFFLKNLSFLDLCLISVTAPKSIANSLIHNNSISFLGCVSQVFLLLSSASAELLLLTVMSFDRYTAICHPLHYDVIMDRSTCVQRATVSWLYGGLIAVMHTAGTFSLSYCGSNMVHQFFCDIPQLLAISCSENLIREIALILINVVLDFCCFIVIIITYVHVFSTVKKIPSTEGQSKAYSICLPHLLVVLFLSTGFIAYLKPASESPSILDAVISVFYTMLPPTFNPIIYSLRNKAIKVALGMLIKGKLTKK,mutated_sequence,1.0,309.0,UPI0000041CE2.a2m,UPI0000041CE2.npy,gnomAD
+UPI00001A9480,UPI00001A9480.csv,MDAPRASAAKPPTGRKMKARAPPPPGKAATLHVHSDQKPPHDGALGSQQNLVRMKEALRASTMDVTVVLPSGLEKRSVLNGSHAMMDLLVELCLQNHLNPSHHALEIRSSETQQPLSFKPNTLIGTLNVHTVFLKEKVPEEKVKPGPPKVPEKSVRLVVNYLRTQKAVVRVSPEVPLQNILPVICAKCEVSPEHVVLLRDNIAGEELELSKSLNELGIKELYAWDNRRETFRKSSLGNDETDKEKKKFLGFFKVNKRSNSKGCLTTPNSPSMHSRSLTLGPSLSLGSISGVSVKSEMKKRRAPPPPGSGPPVQDKASEKVSLGSQIDLQKKKRRAPAPPPPQPPPPSPLIPNRTEDKEENRKSTMVSLPLGSGSHCSPDGAPQVLSEAEETVSVGSCFASEDTTEDSGVMSSPSDIVSLDSQQDSMKYKDKWATDQEDCSDQDLAGTPDLGPQKSPLWEKNGSENSHLRTEKAVTASNDEEDLLIAGEFRKTLAELDEDLEEMEDSYETDTSSLTSSIHGASNHCPQDAMIPHGDTDAIPVTFIGEVSDDPVDSGLFSNRNNNAGSFDSEGVASRRDSLAPLQAEHSQPHEKAREEVPALHPASHDVGKGIRVALSNISKDGNLMETAPRVTSFASNLHTDNLNAKVKDKVYGCADGERTQATERVNSQPVNEKDSNDKNAALAPTSWHQRGQNPGKSYRLKHGLTTYKIIPPKSEMRCYDRDVSLSTGAIKIDELGNLVSPHATGIRIISLSSSVPEAESQPIGKVREFWRCNSVEKHLGRPSESSARGPPSTPVPTQTQNPESRLQADPKPISPQQKSAHHEGRNPLGEGRNQPPTMGMGHVRVPAAHTTEVTFLKPQRRTSSQYVASAIAKRIGAPKVHADVVRPHGYAEKGYAGKAPVLAAPPVTVKDDRTSSPHSETQGWKDGAQWPCVTPPNNHGEDLAVGAPPRGEVIGPHRKLSTQDRPAAIHRSSCFSLVQSSQRDRVSVGQSCGFSGKQSTSSQEASSASEPRRAPDGTDPPPPHTSDTQACSRELVNGSVRAPGHGEPSHPPGGSGTESHILLEREEKPSVFSTDGNETDSIWPPSIFGPKKKFKPVVQRPVPKDTSLHSALMEAIHSAGGKDRLRKTAEHTGEGRPAKLSYTEAEGERSALLAAIRGHSGTCSLRKVASSASEELQSFRDAALSAQGSESPLLEDLGLLSPPAIPPPPPPPSQALSAPRTASRFSTGTLSNTADARQALMDAIRSGTGAARLRKVPLLV,mutated_sequence,1.0,1261.0,UPI00001A9480.a2m,UPI00001A9480.npy,gnomAD
+UPI00025A2EC3,UPI00025A2EC3.csv,MAPKKPEPKKEAAKPAPAPAPAPAPAPAPAPEAPKEPAFDPKSVKIDFTADQIEGEYGQPHLSVSIALSWRRRRRRKKRRSSCPHEFKEAFSLFDRTPTGEMKITYGQCGDVLRALGQNPTNAEVLRVLGKPKPEEMNVKMLDFETFLPILQHISRNKEQGTYEDFVEGLRVF,mutated_sequence,1.0,173.0,UPI00025A2EC3.a2m,UPI00025A2EC3.npy,gnomAD
+UPI0001B79574,UPI0001B79574.csv,MTHEVLSTFIPSEGSSSKVEEVDLQDFKLDVQHVIEVDEGNTAVIACHLPESHPKAQVRYSVKQE,mutated_sequence,1.0,65.0,UPI0001B79574.a2m,UPI0001B79574.npy,gnomAD
+UPI000022A768,UPI000022A768.csv,MKDSEGPQRPPLCFLSTLLSQKVPEKSDAVLRCIISGQPKPEVTWYKNGQAIDGSGIISNYEFFENQYIHVLHLSCCTKNDAAVYQISAKNSFGMICCSASVEVECSSENPQLSPNLEDDRDRGWKHETGTHEEERANQIDEKEHPYKEEESISPGTPRSADSSPSKSNHSLSLQSLGNLDISVSSSENPLGVKGTRHTGEAYDPSNTEEIANGLLFLNSSHIYEKQDRCCHKTVHSMASKFTDGDLNNDGPHDEGLRSSQQNPKVQKYISFSLPLSEATAHIYPGDSAVANKQPSPQLSSEDSDSDYELCPEITLTYTEEFSDDDLEYLECSDVMTDYSNAVWQRNLLGTEHVFLLESDDEEMEFGEHCLGGCEHFLSGMGCGSRVSGDAGPMVATAGFCGHHSQPQEVGVRSSRVSKHGPSSPQTGMTLILGPHQDGTSSVTEQGRYKLPTAPEAAENDYPGIQGETRDSHQAREEFASDNLLNMDESVRETEMKLLSGESENSGMSQCWETAADKRVGGKDLWSKRGSRKSARVRQPGMKGNPKKPNANLRESTTEGTLHLCSAKESAEPPLTQSDKRETSHTTAAATGRSSHADARECAISTQAEQEAKTLQTSTDSVSKEGNTNCKGEGMQVNTLFETSQVPDWSDPPQVQVQETVRETISCSQMPAFSEPAGEESPFTGTTTISFSNLGGVHKENASLAQHSEVKPCTCGPQHEEKQDRDGNIPDNFREDLKYEQSISEANDETMSPGVFSRHLPKDARADFREPVAVSVASPEPTDTALTLENVCDEPRDREAVCAMECFEAGDQGTCFDTIDSLVGRPVDKYSPQEICSVDTELAEGQNKVSDLCSSNDKTLEVFFQTQVSETSVSTCKSSKDGNSVMSPLFTSTFTLNISHTASEGATGENLAKVENSTYPLASTVHAGQEQPSPSNSGGLDETQLLSSENNPLVQFKEGGDKSPSPSAADTTATPASYSSIVSFPWEKPTTLTANNECFQATRETEDTSTVTIATEVHPAKYLAVSIPEDKHAGGTEERFPRASHEKVSQFPSQVQLDHILSGATIKSTKELLCRAPSVPGVPHHVLQLPEGEGFCSNSPLQVDNLSGDKSQTVDRADFRSYEENFQERGSETKQGVQQQSLSQQGSLSAPDFQQSLPTTSAAQEERNLVPTAHSPASSREGAGQRSGWGTRVSVVAETAGEEDSQALSNVPSLSDILLEESKEYRPGNWEAGNKLKIITLEASASEIWPPRQLTNSESKASDGGLIIPDKVWAVPDSLKADAVVPELAPSEIAALAHSPEDAESALADSRESHKGEEPTISVHWRSLSSRGFSQPRLLESSVDPVDEKELSVTDSLSAASETGGKENVNNVSQDQEEKQLKMDHTAFFKKFLTCPKILESSVDPIDEISVIEYTRAGKPEPSETTPQGAREGGQSNDGNMGHEAEIQPAILQVPCLQGTILSENRISRSQEGSMKQEAEQIQPEEAKTAIWQVLQPSEGGERIPSGCSIGQIQESSDGSLGEAEQSKKDKAELISPTSPLSSCLPIMTHASLGVDTHNSTGQIHDVPENDIVEPRKRQYVFPVSQKRGTIENERGKPLPSSPDLTRFPCTSSPEGNVTDFLISHKMEEPKIEVLQIGETKPPSSSSSSAKTLAFISGERELEKAPKLLQDPCQKGTLGCAKKSREREKSLEARAGKSPGTLTAVTGSEEVKRKPEAPGSGHLAEGVKKKILSRVAALRLKLEEKENIRKNSAFLKKMPKLETSLSHTEEKQDPKKPSCKREGRAPVLLKKIQAEMFPEHSGNVKLSCQFAEIHEDSTICWTKDSKSIAQVQRSAGDNSTVSFAIVQASPKDQGLYYCCIKNSYGKVTAEFNLTAEVLKQLSSRQDTKGCEEIEFSQLIFKEDFLHDSYFGGRLRGQIATEELHFGEGVHRKAFRSTVMHGLMPVFKPGHACVLKVHNAIAYGTRNNDELIQRNYKLAAQECYVQNTARYYAKIYAAEAQPLEGFGEVPEIIPIFLIHRPENNIPYATVEEELIGEFVKYSIRDGKEINFLRRESEAGQKCCTFQHWVYQKTSGCLLVTDMQGVGMKLTDVGIATLAKGYKGFKGNCSMTFIDQFKALHQCNKYCKMLGLKSLQNNNQKQKQPSIGKSKVQTNSMTIKKAGPETPGEKKT,mutated_sequence,1.0,2170.0,UPI000022A768.a2m,UPI000022A768.npy,gnomAD
+UPI000046FD46,UPI000046FD46.csv,TEEEEKIKSQGQDVTSSVYFMKQTISNACGTIGLIHAIANNKDKMHFESGSTLKKFLEESVSMSPEERARYLENYDAIRVTHETSAHEGQTESSSPSSSQPHSSHCRTKASSLCHHASLPWVKRNHVGPAKATSPSLRLRRWRPFLRLPSLGLHSVAPSIDEKVDLHFIALVHVDGHLYELDGRKPFPINHGETSDETLLEDAIEVCKKFMERDPDELRFNAIALSAA,mutated_sequence,1.0,228.0,UPI000046FD46.a2m,UPI000046FD46.npy,gnomAD
+UPI0000126640,UPI0000126640.csv,MSERAADDVRGEPRRAAAAAGGAAAAAARQQQQQQQQQQPPPPQPQRQQHPPPPPRRTRPEDGGPGAASTSAAAMATVGERRPLPSPEVMLGQSWNLWVEASKLPGKDGTELDESFKEFGKNREVMGLCREDMPIFGFCPAHDDFYLVVCNDCNQVVKPQAFQSHYERRHSSSSKPPLAVPPTSVFSFFPSLSKSKGGSASGSNRSSSGGVLSASSSSSKLLKSPKEKLQLRGNTRPMHPIQQSRVPHGRIMTPSVKVEKIHPKMDGTLLKSAVGPTCPATVSSLVKPGLNCPSIPKPTLPSPGQILNGKGLPAPPTLEKKPEDNSNNRKFLNKRLSEREFDPDIHCGVIDLDTKKPCTRSLTCKTHSLTQRRAVQGRRKRFDVLLAEHKNKTREKELIRHPDSQQPPQPLRDPHPAPPRTSQEPHQNPHGVIPSESKPFVASKPKPHTPSLPRPPGCPAQQGGSAPIDPPPVHESPHPPLPATEPASRLSSEEGEGDDKEESVEKLDCHYSGHHPQPASFCTFGSRQIGRGYYVFDSRWNRLRCALNLMVEKHLNAQLWKKIPPVPSTTSPISTRIPHRTNSVPTSQCGVSYLAAATVSTSPVLLSSTCISPNSKSVPAHGTTLNAQPAASGAMDPVCSMQSRQVSSSSSSPSTPSGLSSVPSSPMSRKPQKLKSSKSLRPKESSGNSTNCQNASSSTSGGSGKKRKNSSPLLVHSSSSSSSSSSSSHSMESFRKNCVAHSGPPYPSTVTSSHSIGLNCVTNKANAVNVRHDQSGRGPPTGSPAESIKRMSVMVNSSDSTLSLGPFIHQSNELPVNSHGSFSHSHTPLDKLIGKKRKCSPSSSSINNSSSKPTKVAKVPAVNNVHMKHTGTIPGAQGLMNSSLLHQPKARP,mutated_sequence,1.0,892.0,UPI0000126640.a2m,UPI0000126640.npy,gnomAD
+UPI00000389E8,UPI00000389E8.csv,MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFEATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEYIHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRNRVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSVVEFCNEKTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIKYAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFLITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSRAANSSVPPITVNAKMNKDVNSFPSPMIVYAEILQGYVPVLGANVTAFIESQNGHTEVLELLDNGAGADSFKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVVNGEIEANPPRPEIDEDTQTTLEDFSRTASGGAFVVSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDNFDVGKVQRYIIRISASILDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAIKSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSVVIVNFILSTTI,mutated_sequence,1.0,919.0,UPI00000389E8.a2m,UPI00000389E8.npy,gnomAD
+UPI000020E71F,UPI000020E71F.csv,MAASRSAGEAGPGGSQGRVVRMKRRGGRGPRRGPAGGGEKALKRLKLAVEEFVHATSEGEAPGGCEGRGAPVSFRPGGRKSRKELRKEKRHLRKARRLQRTAGPEQGPGLGGRSGAEEASGHRQDTEERARPAPSRDPSPPRKPRPSRVKAKATAATAKTRPSAAATAAARKRALLAANEEEDREIRKLERCLGLNKRKKKDGSSSVPLSFARDGLDYILGALESGKNSGLYDSSGEEEEDAGQTLPESDLESDSQDESEEEEEGDVEKEKKAQEAEAQSEDDDEDTEEEQGEEKEKGAQEKRRGKRVRFAEDEEKSENSSEDGDITDKSLCGSGEKYIPPHVRQAEETVDFKKKEELERLKKHVKGLLNRLSEPNMASISGQLEELYMAHSRKDMNDTLTSALMGACVTASAMPSRLMMEHVLLVSILHHTVGIEVGAHFLEAVVRKFDAIYKYGSEGKECDNLFTVIAHLYNFHVVQSLLIFDILKKLIGTFTEKDIELILLMLKNVGFSLRKDDALSLKELITEAQTKASGAGSEFQDQTRIRFMLETMLALKNNDMRKIPGYDPEPVEKLRKLQRALVRNAGSGSETQLRVSWDSVLSAEQTGRWWIVGSAWSGAPMIDNSHHTHLQKQLVGTVSSKILELARKQRMNTDIRRNIFCTIMTSEDFLDAFEKLLKLGLKDQQEREIIHVLMDCCLQEKTYNPFYAFLASKFCEYERRFQMTFQFSIWDKFRDLENLPATNFSNLVHLVAHLLKTKSLSLSILKVVEFSELDKPRVRFLRKVLSILLMETEVEDLSLIFTRVSDNPKLGVLREGLKLFISHFLLKNAQAHRSADEANVLREKADLATKCLQGKASLRM,mutated_sequence,1.0,860.0,UPI000020E71F.a2m,UPI000020E71F.npy,gnomAD
+UPI0000044645,UPI0000044645.csv,MPGRAEAGEAEEEAGAGSGSEAEEDALWERIEGVRHRLARALNPAKLTPYLRQCRVIDEQDEEEVLSTYRFPCRVNRTGRLMDILRCRGKRGYEAFLEALEFYYPEHFTLLTGQEPAQRCSMILDEEGPEGLTQFLMTEVRRLREARKSQLQREQQLQARGRVLEEERAGLEQRLRDQQQAQERCQRLREDWEAGSLELLRLKDENYMIAMRLAQLSEEKNSAVLRSRDLQLAVDQLKLKVSRLEEECALLRRARGPPPGAEEKEKEKEKEKEPDNVDLVSELRAENQRLTASLRELQEGLQQEASRPGAPGSERILLDILEHDWREAQDSRQELCQKLHAVQGELQWAEELRDQYLQEMEDLRLKHRTLQKDCDLYKHRMATVLAQLEEIEKERDQAIQSRDRIQLQYSQSLIEKDQYRKQVRGLEAERDELLTTLTSLEGTKALLEVQLQRAQGGTCLKACASSHSLCSNLSSTWSLSEFPSPLGGPEATGEAAVMGGPEPHNSEEATDSEKEINRLSILPFPPSAGSILRRQREEDPAPPKRSFSSMSDITGSVTLKPWSPGLSSSSSSDSVWPLGKPEGLLARGCGLDFLNRSLAIRVSGRSPPGGPEPQDKGPDGLSFYGDRWSGAVVRRVLSGPGSARMEPREQRVEAAGLEGACLEAEAQQRTLLWNQGSTLPSLMDSKACQSFHEALEAWAKGPGAEPFYIRANLTLPERADPHALCVKAQEILRLVDSAYKRRQEWFCTRVDPLTLRDLDRGTVPNYQRAQQLLEVQEKCLPSSRHRGPRSNLKKRALDQLRLVRPKPVGAPAGDSPDQLLLEPCAEPERSLRPYSLVRPLLVSALRPVVLLPECLAPRLIRNLLDLPSSRLDFQVCPAESLSGEELCPSSAPGAPKAQPATPGLGSRIRAIQESVGKKHCLLELGARGVRELVQNEIYPIVIHVEVTEKNVREVRGLLGRPGWRDSELLRQCRGSEQVLWGLPCSWVQVPAHEWGHAEELAKVVRGRILQEQARLVWVECGSSRGCPSSSEA,mutated_sequence,1.0,1032.0,UPI0000044645.a2m,UPI0000044645.npy,gnomAD
+UPI0001AE67FB,UPI0001AE67FB.csv,MFQRAQELRRRAEDYHKCKIPPSARKALCNWVRMAAAEHRHSSGLPYWPYLTAETLKNRMGHQPPPPTQQHSITDNSLSLKTPPECLLTPLPPSADDNLKTPPECVLTPLPPSADDNLKTPPECVLTPLPPSADDNLKTPPECLLTPLPPSADDNLKTPPECLLTPLPPSALPSAPPSADDNLKTRAECLLHPLPPSADDNLKTPSERQLTPLPPSAPPSADDNIKTPAERLRGPLPPSADDNLKTPSERQLTPLPPSAPPSADDNIKTPAERLRGPLPPSADDNLKTPSERRTPLPPSAPPSADDNIKTPAERLRGPLPPSADDNLKTPSERHIMPPPSADDNINTPAERLRGPLPPSADDNLKTPSERQLTPLPPSADDNIKTPAERLRGPLPPSADDNLKTPSERQLTPLPPSAPPSADDNIKIPAERLRIPLPPSADDNLKTPSERQLTPLPPSAPPSADDNIKTPGFHPERMIISRHLPSVSSLPFHPQLHPQQMIISRHLPSVCGGRFHPQRMIISRHLPSVSSLPFHPQLHPQQMIISRHLPSVCGGRFHPQQMIISRHLPSVSSLPFHPQLHPQQMIISRHLPSVCGGRFHPQRMIISRHLPSVSSLPFHPQLHPQQMIISRHLPSVCGGRFHPQRMIISRHLPSVSSLPFHPQLHPQQMIISRHLPSVCGPLPPSADDNLKTPSERQLTPLPPSAPPSADDNIKTPAERLRGPLPPSADDNLKTPSERQLTPLPPSAPPSADDNIKTPAERLRGPLPPSADDNLKTPSERQLTPLPPSAPPSADDNIKTPAERLRGPLPPSADDNLKTPSERQLTPLPPSAPPSADDNIKTPAERLRGPLPPSADDNLKTPPLATQEAEAEKPRKPKRQRAAEMEPPPEPKRRRVGDVEPSRKPKRRRAADVEPSSPEPKRRRVGDVEPSRKPKRRRAADVEPSSPEPKRRRVGDVEPSRKPKRRRAADVEPSLPEPKRRRLS,mutated_sequence,1.0,982.0,UPI0001AE67FB.a2m,UPI0001AE67FB.npy,gnomAD
+UPI0000072AD5,UPI0000072AD5.csv,MAELGAGGDGHRGGDGAVRSETAPDSYKVQDKKNASSRPASAISGQNNNHSGNKPDPPPVLRVDDRQRLARERREEREKQLAAREIVWLEREERARQHYEKHLEERKKRLEEQRQKEERRRAAVEEKRRQRLEEDKERHEAVVRRTMERSQKPKQKHNRWSWGGSLHGSPSIHSADPDRRSVSTMNLSKYVDPVISKRLSSSSATLLNSPDRARRLQLSPWESSVVNRLLTPTHSFLARSKSTAALSGEAASCSPIIMPYKAAHSRNSMDRPKLFVTPPEGSSRRRIIHGTASYKKERERENVLFLTSGTRRAVSPSNPKARQPARSRLWLPSKSLPHLPGTPRPTSSLPPGSVKAAPAQVRPPSPGNIRPVKREVKVEPEKKDPEKEPQKVANEPSLKGRAPLVKVEEATVEERTPAEPEVGPAAPAMAPAPASAPAPASAPAPAPVPTPAMVSAPSSTVNASASVKTSAGTTDPEEATRLLAEKRRLAREQREKEERERREQEELERQKREELAQRVAEERTTRREEESRRLEAEQAREKEEQLQRQAEERALREREEAERAQRQKEEEARVREEAERVRQEREKHFQREEQERLERKKRLEEIMKRTRRTEATDKKTSDQRNGDIAKGALTGGTEVSALPCTTNAPGNGKPVGSPHVVTSHQSKVTVESTPDLEKQPNENGVSVQNENFEEIINLPIGSKPSRLDVTNSESPEIPLNPILAFDDEGTLGPLPQVDGVQTQQTAEVI,mutated_sequence,1.0,749.0,UPI0000072AD5.a2m,UPI0000072AD5.npy,gnomAD
+UPI0000246F54,UPI0000246F54.csv,MTQQPQDDFDRSVEDAQAWMKAVQDQLQVNDNTQGPRAALEARLWETEKICQLEPEGRVRVDLVLRMAEALLACCPGDQKPGILARLKDIKAQWEETVTYMTHCHSRIEWVWLHWSEYLLARDEFYRWFQKMMVTLEPHIELQLGLKEKQWQLSHAQVLLHNVDNQAVLLDRLLEEAASLFNRIGDPSVDEDAQKRMKAEYDAVKAKAQKRVDLLEQVAREHEEYQAGVDEFQLWLKAVVEKVNGCLGRNCKLPITQRLSTLQDIAKDFPRGEESLETLEEQSAGVIRNTSPLGAEKITGELEEMRKVLEKLRALWEEEEERLRGLLRSRGAWEQQIKQLEAELSEFRMVLQRLAQEGLQPAAKAGTEDELVAHWRRYSATRAALASEEPRVDRLQAQLKELIVFPHNLKPLSDSVIATIQEYQSLKVKSARLRNAAAVELWQHFQRPLQDLQLWKALAQRLLEVTASLPDLPSLHTFLPQIEAALMESSRLKELLTMLQLKKDLLIGIFGQERATALLEQVAGSMRDRDLLHNSLLQRKSKLQSLLAQHKDFGAAFEPLQRKLLDLQVRVQAEKGLQRDLPGKQAQLSRLQGLQEEGLDLGAQMEAARPLVQENPNHQHKMDQLSSDFQALQRSLEDLVDRCRQSVQEHCTFSHQLLELRQWIVVTTQKLEAHRGEAGPGDAESQEAEFERLVAEFPEKEAQLSLVEAQGWLVMEKSSPEGAAVVQEELRELAESWRALRLLEESLLSLIRNWHLQRMEVDSGKKMVFTNNIPKSGFLINPMDPIPRHRRRANLLQEEEGSHEDFSQLLRNFGQWLQVENSKLVRIIAMRTSTAEDLRTRKSKLQELEARVPEGQHLFENLLRLGPARGTSDELEDLRYQWMLYKSKLKDSGHLLTQSSPGEPTGFQKTRRWRGLGSLFRRACCVALPLQLLLLLFLLLLFLLPIREEDRSCTLANNFARSFTLMLRYNGPPPT,mutated_sequence,1.0,975.0,UPI0000246F54.a2m,UPI0000246F54.npy,gnomAD
+UPI0000E03260,UPI0000E03260.csv,MAFSQVQCLDDNHVNWRSSESKPEFFYSEEQRLALEALVARGRDAFYEVLKRENIRDFLSELELKRILETIEVYDPGSEDPRGTGPSQGPEDNGVGDGEEASGADGVPIEAEPLPSLEYWPQKSDRSIPQLDLGWPDTIAYRGVTRASVYMQPPIDGQAHIKEVVRKMISQAQKVIAVVMDMFTDVDIFKDLLDAGFKRKVAVYIIVDESNVKYFLHMCERACMHLGHLKNLRVRSSGGTEFFTRSATKFKGALAQKFMFVDGDRAVCGSYSFTWSAARTDRNVISVLSGQVVEMFDRQFQELYLMSHSVSLKGIPMEKEPEPEPIVLPSVVPLVPAGTVAKKLVNPKYALVKAKSVDEIAKISSEKQEAKKPLGLKGPALAEHPGELPELLPPIHPGLLHLERANMFEYLPTWVEPDPEPGSDILGYINIIDPNIWNPQPSQMNRIKIRDTSQASAQHQLWKQSQDSRPRPEPCPPPEPSAPQDGVPAENGLPQGDPEPLPPVPKPRTVPVADVLARDSSDIGWVLELPKEEAPQNGTDHRLPRMAGPGHAPLQRQLSVTQDDPESLGVGLPNGLDGVEEEDDDDYVTLSDQDSHSGSSGRGPGPRRPSVASSVSEEYFEVREHSVPLRRRHSEQVANGPTPPPRRQLSAPHITRGTFVGPQGGSPWAQSRGREEADALKRMQAQRSTDKEAQGQQFHHHRVPASGTRDKDGFPGPPRYRSAADSVQSSTRNAGPAMAGPHHWQAKGGQVPRLLPDPGSPRLAQNARPMTDGRATEEHPSPFGIPYSKLSQSKHLKARTGGSQWASSDSKRRAQAPRDRKDP,mutated_sequence,1.0,823.0,UPI0000E03260.a2m,UPI0000E03260.npy,gnomAD
+UPI0000424CC7,UPI0000424CC7.csv,METCDSPPISRQENGQSTSKLCGTTQLDNEVPEKVAGMEPDRENSSTDDNLKTDERKSEALLGFSVENAAATQVTSAKEIPCNECATSFPSLQKYMEHHCPNARLPVLKDDNESEISELEDSDVENLTGEIVYQPDGSAYIIEDSKESGQNAQTGANSKLFSTAMFLDSLASAGEKSDQSASAPMSFYPQIINTFHIASSLGKPFTADQAFPNTSALAGVGPVLHSFRVYDLRHKREKDYLTSDGSAKNSCVSKDVPNNVDLSKFDGCVSDGKRKPVLMCFLCKLSFGYIRSFVTHAVHDHRMTLNDEEQKLLSNKCVSAIIQGIGKDKEPLISFLEPKKSTSVYPHFSTTNLIGPDPTFRGLWSAFHVENGDSLPAGFAFLKGSASTSSSAEQPLGITQMPKAEVNLGGLSSLVVNTPITSVSLSHSSSESSKMSESKDQENNCERPKESNVLHPNGECPVKSEPTEPGDEDEEDAYSNELDDEEVLGELTDSIGNKDFPLLNQSISPLSSSVLKFIEKGTSSSSATVSDDTEKKKQTAAVRASGSVASNYGISGKDFADASASKDSATAAHPSEIARGDEDSSATPHQHGFTPSTPGTPGPGGDGSPGSGIECPKCDTVLGSSRSLGGHMTMMHSRNSCKTLKCPKCNWHYKYQQTLEAHMKEKHPEPGGSCVYCKTGQPHPRLARGESYTCGYKPFRCEVCNYSTTTKGNLSIHMQSDKHLNNVQNLQNGNGEQVFGHSAPAPNTSLSGCGTPSPSKPKQKPTWRCEVCDYETNVARNLRIHMTSEKHMHNMMLLQQNMKQIQHNLHLGLAPAEAELYQYYLAQNIGLTGMKLENPADPQLMINPFQLDPATAAALAPGLVNNELPPEIRLASGQLMGDDLSLLTAGELSPYISDPALKLFQCAVCNKFTSDSLEALSVHVSSERSLPEEEWRAVIGDIYQCKLCNYNTQLKANFQLHCKTDKHMQKYQLVAHIKEGGKSNEWRLKCIAIGNPVHLKCNACDYYTNSVDKLRLHTTNHRHEAALKLYKHLQKQEGAVNPESCYYYCAVCDYTTKVKLNLVQHVRSVKHQQTEGLRKLQLHQQGLAPEEDNLSEIFFVKDCPPNELETASLGARTCDDDLTEQQLRSTSEEQSEEAEGAIKPTAVAEDDEKDTSERDNSEGKNSNKDSGIITPEKELKVSVAGGTQPLLLAKEEDVATKRSKPTEDNKFCHEQFYQCPYCNYNSRDQSRIQMHVLSQHSVQPVICCPLCQDVLSNKMHLQLHLTHLHSVSPDCVEKLLMTVPVPDVMMPNSMLLPAAASEKSERDTPAAVTAEGSGKYSGESPMDDKSMAGLEDSKANVEVKNEEQKPTKEPLEVSEWNKNSSKDVKIPDTLQDQLNEQQKRQPLSVSDRHVYKYRCNHCSLAFKTMQKLQIHSQYHAIRAATMCNLCQRSFRTFQALKKHLEAGHPELSEAELQQLYASLPVNGELWAESETMSQDDHGLEQEMEREYEVDHEGKASPVGSDSSSIPDDMGSEPKRTLPFRKGPNFTMEKFLDPSRPYKCTVCKESFTQKNILLVHYNSVSHLHKLKKVLQEASSPVPQETNSNTDNKPYKCSICNVAYSQSSTLEIHMRSVLHQTKARAAKLEPSGHVAGGHSIAANVNSPGQGMLDSMSLAAVNSKDTHLDAKELNKKQTPDLISAQPAHHPPQSPAQIQMQLQHELQQQAAFFQPQFLNPAFLPHFPMTPEALLQFQQPQFLFPFYIPGTEFSLGPDLGLPGSATFGMPGMTGMAGSLLEDLKQQIQTQHHVGQTQLQILQQQAQQYQATQPQLQPQKQQQQPPPPQQQQQQQASKLLKQEQSNIVSADCQIMKDVPSYKEAEDISEKPEKPKQEFISEGEGLKEGKDTKKQKSLEPSIPPPRIASGARGNAAKALLENFGFELVIQYNENRQKVQKKGKSGEGENTDKLECGTCGKLFSNVLILKSHQEHVHGQFFPYAALEKFARQYREAYDKLYPISPSSPETPPPPPPPPPLPPAPPQPSSMGPVKIPNTVSTPLQAPPPTPPPPPPPPPPPPPPPPPPPPSAPPQVQLPVSLDLPLFPSIMMQPVQHPALPPQLALQLPQMDALSADLTQLCQQQLGLDPNFLRHSQFKRPRTRITDDQLKILRAYFDINNSPSEEQIQEMAEKSGLSQKVIKHWFRNTLFKERQRNKDSPYNFSNPPITVLEDIRIDPQPTSLEHYKSDASFSKRSSRTRFTDYQLRVLQDFFDTNAYPKDDEIEQLSTVLNLPTRVIVVWFQNARQKARKSYENQAETKDNEKRELTNERYIRTSNMQYQCKKCNVVFPRIFDLITHQKKQCYKDEDDDAQDESQTEDSMDATDQVVYKHCTVSGQTDAAKNAAAPAASSGSGTSTPLIPSPKPEPEKTSPKPEYPAEKPKQSDPSPPSQGTKPALPLASTSSDPPQASTAQPQPQPQPPKQPQLIGRPPSASQTPVPSSPLQISMTSLQNSLPPQLLQYQCDQCTVAFPTLELWQEHQHMHFLAAQNQFLHSPFLERPMDMPYMIFDPNNPLMTGQLLGSSLTQMPPQASSSHTTAPTTVAASLKRKLDDKEDNNCSEKEGGNSGEDQHRDKRLRTTITPEQLEILYEKYLLDSNPTRKMLDHIAREVGLKKRVVQVWFQNTRARERKGQFRAVGPAQSHKRCPFCRALFKAKSALESHIRSRHWNEGKQAGYSLPPSPLISTEDGGESPQKYIYFDYPSLPLTKIDLSSENELASTVSTPVSKTAELSPKNLLSPSSFKAECSEDVENLNAPPAEAGYDQNKTDFDETSSINTAISDATTGDEGNTEMESTTGSSGDVKPALSPKEPKTLDTLPKPATTPTTEVCDDKFLFSLTSPSIHFNDKDGDHDQSFYITDDPDDNADRSETSSIADPSSPNPFGSSNPFKSKSNDRPGHKRFRTQMSNLQLKVLKACFSDYRTPTMQECEMLGNEIGLPKRVVQVWFQNARAKEKKFKINIGKPFMINQGGTEGTKPECTLCGVKYSARLSIRDHIFSKQHISKVRETVGSQLDREKDYLAPTTVRQLMAQQELDRIKKASDVLGLTVQQPGMMDSSSLHGISLPTAYPGLPGLPPVLLPGMNGPSSLPGFPQNSNTLTPPGAGMLGFPTSATSSPALSLSSAPTKPLLQTPPPPPPPPPPPPSSSLSGQQTEQQNKESEKKQTKPNKVKKIKEEELEATKPEKHPKKEEKISSALSVLGKVVGETHVDPIQLQALQNAIAGDPASFIGGQFLPYFIPGFASYFTPQLPGTVQGGYFPPVCGMESLFPYGPTMPQTLAGLSPGALLQQYQQYQQNLQESLQKQQKQQQEQQQKPVQAKTSKVESDQPQNSNDASETKEDKSTATESTKEEPQLESKSADFSDTYVVPFVKYEFICRKCQMMFTDEDAAVNHQKSFCYFGQPLIDPQETVLRVPVSKYQCLACDVAISGNEALSQHLQSSLHKEKTIKQAMRNAKEHVRLLPHSVCSPNPNTTSTSQSAASSNNTYPHLSCFSMKSWPNILFQASARRAASPPSSPPSLSLPSTVTSSLCSTSGVQTSLPTESCSDESDSELSQKLEDLDNSLEVKAKPASGLDGNFNSIRMDMFSV,mutated_sequence,1.0,3616.0,UPI0000424CC7.a2m,UPI0000424CC7.npy,gnomAD
+UPI0000E5924D,UPI0000E5924D.csv,MRSPPCGLLRWFGGPLLASWCRCHLRFRAFGTSAGWYRAFPAPPPLLPPACPSPRDYRPHVSLSPFLSRPSRGGSSSSSSSRRRSPVAAVAGEPMAYSQGGGKKKVCYYYDGDIGNYYYGQGHPMKPHRIRMTHNLLLNYGLYRKMEIYRPHKATAEEMTKYHSDEYIKFLRSIRPDNMSEYSKQMQRFNVGEDCPVFDGLFEFCQLSTGGSVAGAVKLNRQQTDMAVNWAGGLHHAKKSEASGFCYVNDIVLAILELLKYHQRVLYIDIDIHHGDGVEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAGKGKYYAVNFPMRDGIDDESYGQIFKPIISKVMEMYQPSAVVLQCGADSLSGDRLGCFNLTVKGHAKCVEVVKTFNLPLLMLGGGGYTIRNVARCWTYETAVALDCEIPNELPYNDYFEYFGPDFKLHISPSNMTNQNTPEYMEKIKQRLFENLRMLPHAPGVQMQAIPEDAVHEDSGDEDGEDPDKRISIRASDKRIACDEEFSDSEDEGEGGRRNVADHKKGAKKARIEEDKKETEDKKTDVKEEDKSKDNSGEKTDTKGTKSEQLSNP,mutated_sequence,1.0,582.0,UPI0000E5924D.a2m,UPI0000E5924D.npy,gnomAD
+UPI0000231C71,UPI0000231C71.csv,MARSRLPATSLRKPWKLDRQKLPSPDSGHSLLCGWSPGGKARPAGNTGAWAPAEQFFPASNRTREGGGLWPPLPLQSSPAAPTMLDSSAAEQVTRLTLKLLGQKLEQERQNVEGGPEGLHLEPGNEDRPDDALQTALKRRRDLLQRLREQHLLDELSRAQAWSGPSRGALGSALPPELPPTGILPTASPSPLAPDPPRIILPTVPQPPATIIQQLPQQPLIAQIPPPQAFPTQRSGSIKEDMVELLLLQNAQVHQLVLQNWMLKALPPALQDPPHVPPRVPRAARPRLPAVHHHHHHHHAVWPPGAATVLQPAPSLWTPGPP,mutated_sequence,1.0,322.0,UPI0000231C71.a2m,UPI0000231C71.npy,gnomAD
+UPI000012666A,UPI000012666A.csv,MQAERGARGGRGRRPGRGRPGGDRHSERPGAAAAVARGGGGGGGGDGGGRRGRGRGRGFRGARGGRGGGGAPRGSRREPGGWGAGASAPVEDDSDAETYGEENDEQGNYSKRKIVSNWDRYQDIEKEVNNESGESQRGTDFSVLLSSAGDSFSQFRFAEEKEWDSEASCPKQNSAFYVDSELLVRALQELPLCLRLNVAAELVQGTVPLEVPQVKPKRTDDGKGLGMQLKGPLGPGGRGPIFELKSVAAGCPVLLGKDNPSPGPSRDSQKPTSPLQSAGDHLEEELDLLLNLDAPIKEGDNILPDQTSQDLKSKEDGEVVQEEEVCAKPSVTEEKNMEPEQPSTSKNVTEEELEDWLDSMIS,mutated_sequence,1.0,362.0,UPI000012666A.a2m,UPI000012666A.npy,gnomAD
+UPI000013E87D,UPI000013E87D.csv,MERAISPGLLVRALLLLLLLLGLAARTVAAGRARGLPAPTAEAAFGLGAAAAPTSATRVPAAGAVAAAEVTVEDAEALPAAAGEQEPRGPEPDDETELRPRGRSLVIISTLDGRIAALDPENHGKKQWDLDVGSGSLVSSSLSKPEVFGNKMIIPSLDGALFQWDQDRESMETVPFTVESLLESSYKFGDDVVLVGGKSLTTYGLSAYSGKVRYICSALGCRQWDSDEMEQEEDILLLQRTQKTVRAVGPRSGNEKWNFSVGHFELRYIPDMETRAGFIESTFKPNENTEESKIISDVEEQEAAIMDIVIKVSVADWKVMAFSKKGGHLEWEYQFCTPIASAWLLKDGKVIPISLFDDTSYTSNDDVLEDEEDIVEAARGATENSVYLGMYRGQLYLQSSVRISEKFPSSPKALESVTNENAIIPLPTIKWKPLIHSPSRTPVLVGSDEFDKCLSNDKFSHEEYSNGALSILQYPYDNGYYLPYYKRERNKRSTQITVRFLDNPHYNKNIRKKDPVLLLHWWKEIVATILFCIIATTFIVRRLFHPHPHRQRKESETQCQTENKYDSVSGEANDSSWNDIKNSGYISRYLTDFEPIQCLGRGGFGVVFEAKNKVDDCNYAIKRIRLPNRELAREKVMREVKALAKLEHPGIVRYFNAWLEAPPEKWQEKMDEIWLKDESTDWPLSSPSPMDAPSVKIRRMDPFATKEHIEIIAPSPQRSRSFSVGISCDQTSSSESQFSPLEFSGMDHEDISESVDAAYNLQDSCLTDCDVEDGTMDGNDEGHSFELCPSEASPYVRSRERTSSSIVFEDSGCDNASSKEEPKTNRLHIGNHCANKLTAFKPTSSKSSSEATLSISPPRPTTLSLDLTKNTTEKLQPSSPKVYLYIQMQLCRKENLKDWMNGRCTIEERERSVCLHIFLQIAEAVEFLHSKGLMHRDLKPSNIFFTMDDVVKVGDFGLVTAMDQDEEEQTVLTPMPAYARHTGQVGTKLYMSPEQIHGNSYSHKVDIFSLGLILFELLYPFSTQMERVRTLTDVRNLKFPPLFTQKYPCEYVMVQDMLSPSPMERPEAINIIENAVFEDLDFPGKTVLRQRSRSLSSSGTKHSRQSNNSHSPLPSN,mutated_sequence,1.0,1116.0,UPI000013E87D.a2m,UPI000013E87D.npy,gnomAD
+UPI0000072AAB,UPI0000072AAB.csv,MAAVKEPLEFHAKRPWRPEEAVEDPDEEDEDNTSEAENGFSLEEVLRLGGTKQDYLMLATLDENEEVIDGGKKGAIDDLQQGELEAFIQNLNLAKYTKASLVEEDEPAEKENSSKKEVKIPKINNKNTAESQRTSVNKVKNKNRPEPHSDENGSTTPKVKKDKQNIFEFFERQTLLLRPGGKWYDLEYSNEYSLKPQPQDVVSKYKTLAQKLYQHEINLFKSKTNSQKGASSTWMKAIVSSGTLGDRMAAMILLIQDDAVHTLQFVETLVNLVKKKGSKQQCLMALDTFKELLITDLLPDNRKLRIFSQRPFDKLEQLSSGNKDSRDRRLILWYFEHQLKHLVAEFVQVLETLSHDTLVTTKTRALTVAHELLCNKPEEEKALLVQVVNKLGDPQNRIATKASHLLETLLCKHPNMKGVVSGEVERLLFRSNISSKAQYYAICFLNQMALSHEESELANKLITVYFCFFRTCVKKKDVESKMLSALLTGVNRAYPYSQTGDDKVREQIDTLFKVLHIVNFNTSVQALMLLFQVMNSQQTISDRYYTALYRKMLDPGLMTCSKQAMFLNLVYKSLKADIVLRRVKAFVKRLLQVTCQQMPPFICGALYLVSEILKAKPGLRSQLDDHPESDDEENFIDANDDEDMEKFTDADKETEIVKKLETEETVPETDVETKKPEVASWVHFDNLKGGKQLNKYDPFSRNPLFCGAENTSLWELKKLSVHFHPSVALFAKTILQGNYIQYSGDPLQDFTLMRFLDRFVYRNPKPHKGKENTDSVVMQPKRKHFIKDIRHLPVNSKEFLAKEESQIPVDEVFFHRYYKKVAVKEKQKRDADEESIEDVDDEEFEELIDTFEDDNCFSSGKDDMDFAGNVKKRTKGAKDNTLDEDSEGSDDELGNLDDDEVSLGSMDDEEFAEVDEDGGTFMDVLDDESESVPELEVHSKVSTKKSKRKGTDDFDFAGSFQGPRKKKRNLNDSSLFVSAEEFGHLLDENMGSKFDNIGMNAMANKDNASLKQLRWEAERDDWLHNRDAKSIIKKKKHFKKKRIKTTQKTKKQRK,mutated_sequence,1.0,1054.0,UPI0000072AAB.a2m,UPI0000072AAB.npy,gnomAD
+UPI0000205CB5,UPI0000205CB5.csv,MYHGMNPSNGDGFLEQQQQQQQPQSPQRLLAVILWFQLALCFGPAQLTGGFDDLQVCADPGIPENGFRTPSGGVFFEGSVARFHCQDGFKLKGATKRLCLKHFNGTLGWIPSDNSICVQEDCRIPQIEDAEIHNKTYRHGEKLIITCHEGFKIRYPDLHNMVSLCRDDGTWNNLPICQGCLRPLASSNGYVNISELQTSFPVGTVISYRCFPGFKLDGSAYLECLQNLIWSSSPPRCLALEVCPLPPMVSHGDFVCHPRPCERYNHGTVVEFYCDPGYSLTSDYKYITCQYGEWFPSYQVYCIKSEQTWPSTHETLLTTWKIVAFTATSVLLVLLLVILARMFQTKFKAHFPPRGPPRSSSSDPDFVVVDGVPVMLPSYDEAVSGGLSALGPGYMASVGQGCPLPVDDQSPPAYPGSGDTDTGPGESETCDSVSGSSELLQSLYSPPRCQESTHPASDNPDIIASTAEEVASTSPGIDIADEIPLMEEDP,mutated_sequence,1.0,490.0,UPI0000205CB5.a2m,UPI0000205CB5.npy,gnomAD
+UPI0000212786,UPI0000212786.csv,MEPEDLPWPGELEEEEEEEEEEEEEEEEAAAAAAANVDDVVVVEEVEEEAGRELDSDSHYGPQHLESIDDEEDEEAKAWLQAHPGRILPPLSPPQHRYSEGERTSLEKIVPLTCHVWQQIVYQGNSRTQISDTNVVCLETTAQRGSGDDQKTESWHCLPQEMDSSQTLDTSQTRFNVRTEDTEVTDFPSLEEGILTQSENQVKEPNRDLFCSPLLVIQDSFASPDLPLLTCLTQDQEFAPDSLFHQSELSFAPLRGIPDKSEDTEWSSRPSEVSEALFQATAEVASDLASSRFSVSQHPLIGSTAVGSQCPFLPSEQGNNEETISSVDELKIPKDCDRYDDLCSYMSWKTRKDTQWPENNLADKDQVSVATSFDITDENIATKRSDHFDAARSYGQYWTQEDSSKQAETYLTKGLQGKVESDVITLDGLNENAVVCSERVAELQRKPTRESEYHSSDLRMLRMSPDTVPKAPKHLKAGDTSKGGIAKVTQSNLKSGITTTPVDSDIGSHLSLSLEDLSQLAVSSLETTTGQHTDTLNQKTLADTHLTEETLKVTAIPEPADQKTATPTVLSSSHSHRGKPSIFYQQGLPDSHLTEEALKVSAAPGLADQTTGMSTLTSTSYSHREKPGTFYQQELPESNLTEEPLEVSAAPGPVEQKTGIPTVSSTSHSHVEDLLFFYRQTLPDGHLTDQALKVSAVSGPADQKTGTATVLSTPHSHREKPGIFYQQEFADSHQTEETLTKVSATPGPADQKTEIPAVQSSSYSQREKPSILYPQDLADSHLPEEGLKVSAVAGPADQKTGLPTVPSSAYSHREKLLVFYQQALLDSHLPEEALKVSAVSGPADGKTGTPAVTSTSSASSSLGEKPSAFYQQTLPNSHLTEEALKVSIVPGPGDQKTGIPSAPSSFYSHREKPIIFSQQTLPDFLFPEEALKVSAVSVLAAQKTGTPTVSSNSHSHSEKSSVFYQQELPDSDLPRESLKMSAIPGLTDQKTVPTPTVPSGSFSHREKPSIFYQQEWPDSYATEKALKVSTGPGPADQKTEIPAVQSSSYPQREKPSVLYPQVLSDSHLPEESLKVSAFPGPADQMTDTPAVPSTFYSQREKPGIFYQQTLPESHLPKEALKISVAPGLADQKTGTPTVTSTSYSQHREKPSIFHQQALPGTHIPEEAQKVSAVTGPGNQKTWIPRVLSTFYSQREKPGIFYQQTLPGSHIPEEAQKVSPVLGPADQKTGTPTPTSASYSHTEKPGIFYQQVLPDNHPTEEALKISVASEPVDQTTGTPAVTSTSYSQYREKPSIFYQQSLPSSHLTEEAKNVSAVPGPADQKTVIPILPSTFYSHTEKPGVFYQQVLPHSHPTEEALKISVASEPVDQTTGTPTVTSTSYSQHTEKPSIFYQQSLPGSHLTEEAKNVSAVPGPGDRKTGIPTLPSTFYSHTEKPGSFYQQVLPHSHLPEEALEVSVAPGPVDQTIGTPTVTSPSSSFGEKPIVIYKQAFPEGHLPEESLKVSVAPGPVGQTTGAPTITSPSYSQHRAKSGSFYQLALLGSQIPEEALRVSSAPGPADQTTGIPTITSTSYSFGEKPIVNYKQAFPDGHLPEEALKVSIVSGPTEKKTDIPAGPLGSSALGEKPITFYRQALLDSPLNKEVVKVSAAPGPADQKTETLPVHSTSYSNRGKPVIFYQQTLSDSHLPEEALKVPPVPGPDAQKTETPSVSSSLYSYREKPIVFYQQALPDSELTQEALKVSAVPQPADQKTGLSTVTSSFYSHTEKPNISYQQELPDSHLTEEALKVSNVPGPADQKTGVSTVTSTSYSHREKPIVSYQRELPHFTEAGLKILRVPGPADQKTGINILPSNSYPQREHSVISYEQELPDLTEVTLKAIGVPGPADQKTGIQIASSSSYSNREKASIFHQQELPDVTEEALNVFVVPGQGDRKTEIPTVPLSYYSRREKPSVISQQELPDSHLTEEALKVSPVSIPAEQKTGIPIGLSSSYSHSHKEKLKISTVHIPDDQKTEFPAATLSSYSQIEKPKISTVIGPNDQKTPSQTAFHSSYSQTVKPNILFQQQLPDRDQSKGILKISAVPELTDVNTGKPVSLSSSYFHREKSNIFSPQELPGSHVTEDVLKVSTIPGPAGQKTVLPTALPSSFSHREKPDIFYQKDLPDRHLTEDALKISSALGQADQITGLQTVPSGTYSHGENHKLVSEHVQRLIDNLNSSDSSVSSNNVLLNSQADDRVVINKPESAGFRDVGSEEIQDAENSAKTLKEIRTLLMEAENMALKRCNFPAPLARFRDISDISFIQSKKVVCFKEPSSTGVSNGDLLHRQPFTEESPSSRCIQKDIGTQTNLKCRRGIENWEFISSTTVRSPLQEAESKVSMALEETLRQYQAAKSVMRSEPEGCSGTIGNKIIIPMMTVIKSDSSSDASDGNGSCSWDSNLPESLESVSDVLLNFFPYVSPKTSITDSREEEGVSESEDGGGSSVDSLAAHVKNLLQCESSLNHAKEILRNAEEEESRVRAHAWNMKFNLAHDCGYSISELNEDDRRKVEEIKAELFGHGRTTDLSKGLQSPRGMGCKPEAVCSHIIIESHEKGCFRTLTSEHPQLDRHPCAFRSAGPSEMTRGRQNPSSCRAKHVNLSASLDQNNSHFKVWNSLQLKSHSPFQNFIPDEFKISKGLRMPFDEKMDPWLSELVEPAFVPPKEVDFHSSSQMPSPEPMKKFTTSITFSSHRHSKCISNSSVVKVGVTEGSQCTGASVGVFNSHFTEEQNPPRDLKQKTSSPSSFKMHSNSQDKEVTILAEGRRQSQKLPVDFERSFQEEKPLERSDFTGSHSEPSTRANCSNFKEIQISDNHTLISMGRPSSTLGVNRSSSRLGVKEKNVTITPDLPSCIFLEQRELFEQSKAPRADDHVRKHHSPSPQHQDYVAPDLPSCIFLEQRELFEQCKAPYVDHQMRENHSPLPQGQDSIASDLPSPISLEQCQSKAPGVDDQMNKHHFPLPQGQDCVVEKNNQHKPKSHISNINVEAKFNTVVSQSAPNHCTLAASASTPPSNRKALSCVHITLCPKTSSKLDSGTLDERFHSLDAASKARMNSEFNFDLHTVSSRSLEPTSKLLTSKPVAQDQESLGFLGPKSSLDFQVVQPSLPDSNTITQDLKTIPSQNSQIVTSRQIQVNISDFEGHSNPEGTPVFADRLPEKMKTPLSAFSEKLSSDAVTQITTESPEKTLFSSEIFINAEDRGHEIIEPGNQKLRKAPVKFASSSSVQQVTFSRGTDGQPLLLPYKPSGSTKMYYVPQLRQIPPSPDSKSDTTVESSHSGSNDAIAPDFPAQVLGTRDDDLSATVNIKHKEGIYSKRVVTKASLPVGEKPLQNENADASVQVLITGDENLSDKKQQEIHSTRAVTEAAQAKEKESLQKDTADSSAAAAAEHSAQVGDPEMKNLPDTKAITQKEEIHRKKTVPEEAWPNNKESLQINIEESECHSEFENTTRSVFRSAKFYIHHPVHLPSDQDICHESLGKSVFMRHSWKDFFQHHPDKHREHMCLPLPYQNMDKTKTDYTRIKSLSINVNLGNKEVMDTTKSQVRDYPKHNGQISDPQRDQKVTPEQTTQHTVSLNELWNKYRERQRQQRQPELGDRKELSLVDRLDRLAKILQNPITHSLQVSESTHDDSRGERSVKEWSGRQQQRNKLQKKKRFKSLEKSHKNTGELKKSKVLSHHRAGRSNQIKIEQIKFDKYILSKQPGFNYISNTSSDCRPSEESELLTDTTTNILSGTTSTVESDILTQTDREVALHERSSSVSTIDTARLIQAFGHERVCLSPRRIKLYSSITNQQRRYLEKRSKHSKKVLNTGHPLVTSEHTRRRHIQVANHVISSDSISSSASSFLSSNSTFCNKQNVHMLNKGIQAGNLEIVNGAKKHTRDVGITFPTPSSSEAKLEENSDVTSWSEEKREEKMLFTGYPEDRKLKKNKKNSHEGVSWFVPVENVESRSKKENVPNTCGPGISWFEPITKTRPWREPLREQNCQGQHLDGRGYLAGPGREAGRDLLRPFVRATLQESLQFHRPDFISRSGERIKRLKLIVQERKLQSMLQTERDALFNIDRERQGHQNRMCPLPKRVFLAIQKNKPISKKEMIQRSKRIYEQLPEVQKKREEEKRKSEYKSYRLRAQLYKKRVTNQLLGRKVPWD,mutated_sequence,1.0,4167.0,UPI0000212786.a2m,UPI0000212786.npy,gnomAD
+UPI00000421FD,UPI00000421FD.csv,MAALFLRGFVQIGNCKTGISKSKEAFIEAVERKKKDRLVLYFKSGKYSTFRLSDNIQNVVLKSYRGNQNHLHLTLQNNNGLFIEGLSSTDAEQLKIFLDRVHQNEVQPPVRPGKGGSVFSSTTQKEINKTSFHKVDEKSSSKSFEIAKGSGTGVLQRMPLLTSKLTLTCGELSENQHKKRKRMLSSSSEMNEEFLKENNSVEYKKSKADCSRCVSYNREKQLKLKELEENKKLECESSCIMNATGNPYLDDIGLLQALTEKMVLVFLLQQGYSDGYTKWDKLKLFFELFPEKICHGLPNLGNTCYMNAVLQSLLSIPSFADDLLNQSFPWGKIPLNALTMCLARLLFFKDTYNIEIKEMLLLNLKKAISAAAEIFHGNAQNDAHEFLAHCLDQLKDNMEKLNTIWKPKSEFGEDNFPKQVFADDPDTSGFSCPVITNFELELLHSIACKACGQVILKTELNNYLSINLPQRIKAHPSSIQSTFDLFFGAEELEYKCAKCEHKTSVGVHSFSRLPRILIVHLKRYSLNEFCALKKNDQEVIISKYLKVSSHCNEGTRPPLPLSEDGEITDFQLLKVIRKMTSGNISVSWPATKESKDILAPHIGSDKESEQKKGQTVFKGASRRQQQKYLGKNSKPNELESVYSGDRAFIEKEPLAHLMTYLEDTSLCQFHKAGGKPASSPGTPLSKVDFQTVPENPKRKKYVKTSKFVAFDRIINPTKDLYEDKNIRIPERFQKVSEQTQQCDGMRICEQAPQQALPQSFPKPGTQGHTKNLLRPTKLNLQKSNRNSLLALGSNKNPRNKDILDKIKSKAKETKRNDDKGDHTYRLISVVSHLGKTLKSGHYICDAYDFEKQIWFTYDDMRVLGIQEAQMQEDRRCTGYIFFYMHNEIFEEMLKREENAQLNSKEVEETLQKE,mutated_sequence,1.0,913.0,UPI00000421FD.a2m,UPI00000421FD.npy,gnomAD
+UPI000013DDDC,UPI000013DDDC.csv,MALKKSSPSLDSGDSDSEELPTFAFLKKEPSSTKRRQPEREEKIVVVDISDCEASCPPAPELFSPPVPEIAETVTQTQPVRLLSSESEDEEEFIPLAQRLTCKFLTHKQLSPEDSSSPVKSVLDHQNNEGASCDWKKPFPKIPEVPLHDTPERSAADNKDLILDPCCQLPAYLSTCPGQSSSLAVTKTNSDILPPQKKTKPSQKVQGRGSHGCRQQRQARQKESTLRRQERKNAALVTRMKAQRPEECLKHIIVVLDPVLLQMEGGGQLLGALQTMECRCVIEAQAVPCSVTWRRRAGPSEDREDWVEEPTVLVLLRAEAFVSMIDNGKQGSLDSTMKGKETLQGFVTDITAKTAGKALSLVIVDQEKCFSAQNPPRRGKQGANKQTKKQQQRQPEASIGSMVSRVDAEEALVDLQLHTEAQAQIVQSWKELADFTCAFTKAVAEAPFKKLRDETTFSFCLESDWAGGVKVDLAGRGLALVWRRQIQQLNRVSLEMASAVVNAYPSPQLLVQAYQQCFSDKERQNLLADIQVRRGEGVTSTSRRIGPELSRRIYLQMTTLQPHLSLDSAD,mutated_sequence,1.0,570.0,UPI000013DDDC.a2m,UPI000013DDDC.npy,gnomAD
+UPI0001662BAD,UPI0001662BAD.csv,MAAAATLRLSAQGTVTFEDVAVNFTWEEWNLLSEAQRCLYRDVTLENLALISSLGCWCGVEDEAAPSKQSIYIQRETQVRTPMAGVSPKKAHPCEMCGPILGDILHVADHQGTHHKQKLHRCEAWGNKLYDSGNFHQHQNEHIGEKPYRGSVEEALFAKRCKLHVSGESSVFSESGKDFLPRSGLLQQEASHTGEKSNSKTECVSPIQCGGAHYSCGESMKHFSTKHILSQHQRLLTREECYVCCECGKSFSKYASLSNHQRVHTEKKHECGECGKSFSKYVSFSNHQRVHTEKKHECGECGKSFSKYVSFSNHQRVHTGKRPYECGECGKSFSKYASFSNHQRVHTEKKHYECGECGKSFSKYVSFSNHQRVHTGKRPYECGECGKSFSKYASFSNHQRVHTDKKHYECGECGKSFSQKSSLIQHQRFHTGEKPYGCEECGKSFSSEGHLRSHQRVHAGERPFKCGECVKSFSHKRSLVHHQRVHSGERPYQCGECGKSFSQKGNLVLHQRVHTGARPYECGECGKSFSSKGHLRNHQQIHTGDRLYECGECGKSFSHKGTLILHQRVHPRERSYGCGECGKSFSSIGHLRSHQRVHTGERPYECGECGKSFSHKRSLVHHQRMHTGERPYKCGDCGKSFNEKGHLRNHQRVHTTERPFKCGECGKCFSHKGNLILHQHGHTGERPYVCRECGKLFKKKSHLLVHQRIHNGEKPYACEACQKFFRNKYQLIAHQRVHTGERPYECNDCGKSFTHSSTFCVHKRIHTGEKPYECSECGKSFAESSSFTKHKRVHTGEKPYECSECGKSFAESSSLTKHKRVHTGEKPYKCEKCGKLFNKKSHLLVHQSSHWRKAI,mutated_sequence,1.0,855.0,UPI0001662BAD.a2m,UPI0001662BAD.npy,gnomAD
+UPI0000201E92,UPI0000201E92.csv,MVLAGLIRKLGHQLAEIRERALKSILCKIEHNLICYADLIQERQLFLHLLEWFNFPSVPMKEEVLNLLSRLVKYPPAVQHLVDVGAVEFLSKLRSNVEPNLQAEIDGILDGLFLLPSEVPALSSASYQTNQTELSKNPEILTGYFPQDKSNFQQMEVPPRPVVNQTVKCLKFSTFPWLPLTTTDRHVLSSNESSLRSSNHTLIWNTCELLKDVIMQDFPAEIFLQRPKIVQSLLSLLKLAFGDGKHRLALQSVSCLQQLCMYLRNRLNFHRDPGFFSNKHDTVSQNSSLSYCHEARGTHHSQNPSPGSSSPRPSVVGRTGQRPRGDGQDWDAASSSGSSSHAHVNSRISVHSPLDMGHIDLPELETEDTLELQFQQLSLPQFCVSILESAVPLLRTGSRQVIIRVLELLTEDMTLIGEAISTDIWDDSSLFGIDMKEKLLLVLGALGETMCYHKSSISLEQPEVMLVHHRMAFISISLFAVRLLQTLLPVEKASEFLSEPMSTALFLLSLDMPISLEYPNIHEAVVAYLEQLNSENYSIYKRTAEAVYSIECTCNFLSDIGKEGEKNLLELVELADQALRSFSYHQHFPLIKEIISICSKIWKSAQASPLLQGESQKVLLHMLSHPLPRVKAETYHCCLEITKECLGVHNVTKPVSSLCNGIHFLLHPKVLYEISVFGIQEPESEVNTAAKAILLYLLQGRLMMTALTWNKFIESLCPVIPILQGYADTEDPLGNCILLLSKASSDTEEMLPCTTRLKSMLRLLLVKKPSVRSLALKLLAFHLTSEEGADTKRPLIDARVLSRVTDLFIGKKPIELRLDDRRELVIKLETVEKVYEIFTSDDVDLVLRKSAAEQLAVIMQDIKMHAVVKKLCLIDKIIEYLNECVSQDGKVVECLVQPCLTLLRKVLCGDPVMRVSLSQQSSLLTVLFRVSLIFHEDCSVVTEVGALFCLLLFDEVSRMDMWSVNPSNKPSLPSVFSLPVSVFRRYHLPVHVIGHHAVSPYSIVLPLSADCLALKPVSDMLRIAWNLSWYHGSDNLLKQMNSETKTQEILDALKLSTEDILTLKITHMASGLQDCLHSIVQAATHREVRAAVTRMSFYLLNDRLSLKGCPGPCGVTLKSLAWHTALNRFLQVLPACTEDEKLLIDIIHFLNKLIKEQRKNSSLELLNWILELLLRHSANPLLDLLVLTESQAREETDDIRTAVRQQLQKELIALFDTLLLNFMEVTDRKCSELLYVFQTQLALKLLQCLKVTDAPHFYGLPSLERTLRGMANLTAFPGWSSHSPLTKPLDICVKYLSGLLEVITSFYVERGGNAMSFMGKGVTKSTILCLLHLSHEMMAQAGSLEWMSLWFLPLGSHSEEHIPTQQGLAWLIPLWVDRDPEVRFTSLGLGSALTTLETGCVALANSCQNISGGLWGTVVNILLDQSECSMVRREAAFILQNLLVIPMPTEIIKDYTWQGPCVHDEDSGLSLIGKPALQALLYHCHFYEHLNQMVKHCYLGRCMFDLNFSAFDRNSESNDLNGLDDSFKFWRAPSRTSQDRDPSSLSTSETTVAPSLGSTEFQPLVQSTTLLPEASHDQFVAQGHQESTSPRPPHDSSLSAPLPKLCVFVTPSLLSAMCSLLDNLLTIAPRDTAKAFRQAHLIELLCSIADATLIQTCVQELRALLPSSPPAEHTQAQVSFLLEYLSSLSRLLQSCLLVEPDLVIQDELVKPLITNIIGILTICTKDVLDKELISAFYHTWTHLFNLLAMLLRKAGAITLPFVTVALAKHWTAAIDMFCTCAGLSATCPALYTASLQFLSVLLTEEAKGHLQAKSKTHLCCSPTVASLLDDSQENQKSLEQLSDVILQCYEGKSSKDILKRVAANALMSLLAVSRRAQKHALKANLIDNCMEQMKHINAQLNLDSLRPGKAALKKKEDGVIKELSIAMQLLRNCLYQNEECKEAALEAHLVPVLHSLWPWILMDDSLMQISLQLLCVYTANFPNGCSSLCWSSCGQHPVQATHRGAVSNSLMLCILKLASQMPLENTTVQQMVFMLLSNLALSHDCKGVIQKSNFLQNFLSLALPKGGNKHLSNLTILWLKLLLNISSGEDGQQMILRLDGCLDLLTEMSKYKHKSSPLLPLLIFHNVCFSPANKPKILANEKVITVLAACLESENQNAQRIGAAALWALIYNYQKAKTALKSPSVKRRVDEAYSLAKKTFPNSEANPLNAYYLKCLENLVQLLNSS,mutated_sequence,1.0,2226.0,UPI0000201E92.a2m,UPI0000201E92.npy,gnomAD
+UPI00001BDAC0,UPI00001BDAC0.csv,MMPLAEAGALAQGGGPSATEWACILRRKTPRHKQPTLLMVRASRRSGKTSAVLKAGRQSVSGRKNSTSKDLVTLGASSLREERGHPLHPRHRKAVHLRTRGRTRGWVQTLARMSRRTRGPVERAAAAAAAAAGGDAGHAPFPPPPAADGARAPRSPGQVTPRGLRLRLPRRESLLRGLCRPLRPLLGFRESDSAKPASLRLLQHTPSARRNYRIAGARLMRSNYPPPLSSAALRGAGPTRRN,mutated_sequence,1.0,242.0,UPI00001BDAC0.a2m,UPI00001BDAC0.npy,gnomAD
+UPI000012ADD6,UPI000012ADD6.csv,MTTEGGPPPAPLRRACSPVPGALQAALMSPPPAAAAAAAAAPETTSSSSSSSSASCASSSSSSNSASAPSAACKSAGGGGAGAGSGGAKKASSGLRRPEKPPYSYIALIVMAIQSSPSKRLTLSEIYQFLQARFPFFRGAYQGWKNSVRHNLSLNECFIKLPKGLGRPGKGHYWTIDPASEFMFEEGSFRRRPRGFRRKCQALKPMYHRVVSGLGFGASLLPQGFDFQAPPSAPLGCHSQGGYGGLDMMPAGYDAGAGAPSHAHPHHHHHHHVPHMSPNPGSTYMASCPVPAGPGGVGAAGGGGGGDYGPDSSSSPVPSSPAMASAIECHSPYTSPAAHWSSPGASPYLKQPPALTPSSNPAASAGLHSSMSSYSLEQSYLHQNAREDLSVGLPRYQHHSTPVCDRKDFVLNFNGISSFHPSASGSYYHHHHQSVCQDIKPCVM,mutated_sequence,1.0,444.0,UPI000012ADD6.a2m,UPI000012ADD6.npy,gnomAD
+UPI000022AC91,UPI000022AC91.csv,MSLPKTPSTPLNSTSTSESKKLKVSVAKEGTRGLPELKEKKNMVDRSKLLPTSLQNEFIPKEVLLSLTYAANAGPCPENLLPPKKIKTPKGTLPRLVDHVWHHPVRRNKFKYLIDHPVSLTGAGRDISFLYDVTYAKGQTREKAVCPPHLARSLQSHDGVIVPHKPKTLTDTLIPEEFHIVSSTGVSGLECYDDKYTTLLTDSENRLLLFPSMKPNKRVEVAQLNDVMDTMLERAGVENQEYTGPTKMHKLLHILKKEQTIYNMIFHELIRQVSVDCADRGELLSKVRERYVQMLDQIARQMIDFYKDLVTQRVMDQRILEELYNFKHVIEELTRELCLVRAHDVKLTKETEKAHKDLAQALLNAEKNAKIVEEYHDLYTLQRERMENDMKKLVAERDIWSSATYELALKVIERNRVILARRLYLNEKGWNKYTKHFIILLSNKDTEDLALLQKLTQKWRNLVNKLKQEVEQMEESTSETLKIVKDGLIKWQEFFNEKDILSPNKGNIFNSVLLDFKQWQKILNEKKEEFTGDVLLSKYDTLKIIKHLQENWADIGLGIFNRHKSLEGEMPSERQYMEEIIKNIQKLYKEYEIRINGDNGYSKILPSLISSLDFCSFKLENLEFPDTPLEEWQEIDEKINEMKSHLDILLNLTGIVPQHIDVDSVSVLQAYIFNMIQQWLLKIGNEINNGNIELQHHMDELHISMIQWMVNLLILMIPNFTDQDCLLKLEEESAEKHDIGVARLELDAIELTRKLYQYSSYLSSCCKGMVTAMALSKSTNSHKNATEDLYEVDKLKKECYEWINTCSCLLSNIKGRKITLLTYEEIERLLEEEAVKEFIEPEIDESFKEDEEESKEDRKLQEENKERAEEQPSTSTEKEKLIRFIGEDENVHSKPLFETDVLSSWRESAKQGTLAQKYLEAMAVIEHMQEKLLEVENRARQAEEKFEDAYEKLHHTLIKNKDLEELVMTSRKESKEEKENQDEREVKEEEEQQEEEEVRSAENSSKSPKKGH,mutated_sequence,1.0,1012.0,UPI000022AC91.a2m,UPI000022AC91.npy,gnomAD
+UPI000006F1A1,UPI000006F1A1.csv,MKVFNPIPCFHTKDKESLNFPFPFFWAPIGSSIYNVSGLVGGRLSIEVSCVFTCLSCPISLVAINFLLLKYLDFWLPIWLPSLVFISVWF,mutated_sequence,1.0,90.0,UPI000006F1A1.a2m,UPI000006F1A1.npy,gnomAD
+UPI00001376B5,UPI00001376B5.csv,MAAAAGGGSCPGPGSARGRFPGRPRGAGGGGGRGGRGNGAERVRVALRRGGGATGPGGAEPGEDTALLRLLGLRRGLRRLRRLWAGPRVQRGRGRGRGRGWGPSRGCVPEEESSDGESDEEEFQGFHSDEDVAPSSLRSALRSQRGRAPRGRGRKHKTTPLPPPRLADVAPTPPKTPARKRGEEGTERMVQALTELLRRAQAPQAPRSRACEPSTPRRSRGRPPGRPAGPCRRKQQAVVVAEAAVTIPKPEPPPPVVPVKHQTGSWKCKEGPGPGPGTPRRGGQSSRGGRGGRGRGRGGGLPFVIKFVSRAKKVKMGQLSLGLESGQGQGQHEESWQDVPQRRVGSGQGGSPCWKKQEQKLDDEEEEKKEEEEKDKEGEEKEERAVAEEMMPAAEKEEAKLPPPPLTPPAPSPPPPLPPPSTSPPPPLCPPPPPPVSPPPLPSPPPPPAQEEQEESPPPVVPATCSRKRGRPPLTPSQRAEREAARAGPEGTSPPTPTPSTATGGPPEDSPTVAPKSTTFLKNIRQFIMPVVSARSSRVIKTPRRFMDEDPPKPPKVEVSPVLRPPITTSPPVPQEPAPVPSPPRAPTPPSTPVPLPEKRRSILREPTFRWTSLTRELPPPPPAPPPPPAPSPPPAPATSSRRPLLLRAPQFTPSEAHLKIYESVLTPPPLGAPEAPEPEPPPADDSPAEPEPRAVGRTNHLSLPRFAPVVTTPVKAEVSPHGAPALSNGPQTQAQLLQPLQALQTQLLPQALPPPQPQLQPPPSPQQMPPLEKARIAGVGSLPLSGVEEKMFSLLKRAKVQLFKIDQQQQQKVAASMPLSPGGQMEEVAGAVKQISDRGPVRSEDESVEAKRERPSGPESPVQGPRIKHVCRHAAVALGQARAMVPEDVPRLSALPLRDRQDLATEDTSSASETESVPSRSRRGKVEAAGPGGESEPTGSGGTLAHTPRRSLPSHHGKKMRMARCGHCRGCLRVQDCGSCVNCLDKPKFGGPNTKKQCCVYRKCDKIEARKMERLAKKGRTIVKTLLPWDSDESPEASPGPPGPRRGAGAGGPREEVVAHPGPEEQDSLLQRKSARRCVKQRPSYDIFEDSDDSEPGGPPAPRRRTPRENELPLPEPEEQSRPRKPTLQPVLQLKARRRLDKDALAPGPFASFPNGWTGKQKSPDGVHRVRVDFKEDCDLENVWLMGGLSVLTSVPGGPPMVCLLCASKGLHELVFCQVCCDPFHPFCLEEAERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRHAYHPACLGPSYPTRATRKRRHWICSACVRCKSCGATPGKNWDVEWSGDYSLCPRCTQLYEKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGPCAGAAQPRWREALSGALQGGLRQVLQGLLSSKVVGPLLLCTQCGPDGKQLHPGPCGLQAVSQRFEDGHYKSVHSFMEDMVGILMRHSEEGETPDRRAGGQMKGLLLKLLESAFGWFDAHDPKYWRRSTRLPNGVLPNAVLPPSLDHVYAQWRQQEPETPESGQPPGDPSAAFQGKDPAAFSHLEDPRQCALCLKYGDADSKEAGRLLYIGQNEWTHVNCAIWSAEVFEENDGSLKNVHAAVARGRQMRCELCLKPGATVGCCLSSCLSNFHFMCARASYCIFQDDKKVFCQKHTDLLDGKEIVNPDGFDVLRRVYVDFEGINFKRKFLTGLEPDAINVLIGSIRIDSLGTLSDLSDCEGRLFPIGYQCSRLYWSTVDARRRCWYRCRILEYRPWGPREEPAHLEAAEENQTIVHSPAPSSEPPGGEDPPLDTDVLVPGAPERHSPIQNLDPPLRPDSGSAPPPAPRSFSGARIKVPNYSPSRRPLGGVSFGPLPSPGSPSSLTHHIPTVGDPDFPAPPRRSRRPSPLAPRPPPSRWASPPLKTSPQLRVPPPTSVVTALTPTSGELAPPGPAPSPPPPEDLGPDFEDMEVVSGLSAADLDFAASLLGTEPFQEEIVAAGAMGSSHGGPGDSSEEESSPTSRYIHFPVTVVSAPGLAPSATPGAPRIEQLDGVDDGTDSEAEAVQQPRGQGTPPSGPGVVRAGVLGAAGDRARPPEDLPSEIVDFVLKNLGGPGDGGAGPREESLPPAPPLANGSQPSQGLTASPADPTRTFAWLPGAPGVRVLSLGPAPEPPKPATSKIILVNKLGQVFVKMAGEGEPVPPPVKQPPLPPTISPTAPTSWTLPPGPLLGVLPVVGVVRPAPPPPPPPLTLVLSSGPASPPRQAIRVKRVSTFSGRSPPAPPPYKAPRLDEDGEASEDTPQVPGLGSGGFSRVRMKTPTVRGVLDLDRPGEPAGEESPGPLQERSPLLPLPEDGPPQVPDGPPDLLLESQWHHYSGEASSSEEEPPSPDDKENQAPKRTGPHLRFEISSEDGFSVEAESLEGAWRTLIEKVQEARGHARLRHLSFSGMSGARLLGIHHDAVIFLAEQLPGAQRCQHYKFRYHQQGEGQEEPPLNPHGAARAEVYLRKCTFDMFNFLASQHRVLPEGATCDEEEDEVQLRSTRRATSLELPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN,mutated_sequence,1.0,2715.0,UPI00001376B5.a2m,UPI00001376B5.npy,gnomAD
+UPI0000161FDC,UPI0000161FDC.csv,MAKRLCAGSALCVRGPRGPAPLLLVGLALLGAARAREEAGGGFSLHPPYFNLAEGARIAASATCGEEAPARGSPRPTEDLYCKLVGGPVAGGDPNQTIRGQYCDICTAANSNKAHPASNAIDGTERWWQSPPLSRGLEYNEVNVTLDLGQVFHVAYVLIKFANSPRPDLWVLERSMDFGRTYQPWQFFASSKRDCLERFGPQTLERITRDDAAICTTEYSRIVPLENGEIVVSLVNGRPGAMNFSYSPLLREFTKATNVRLRFLRTNTLLGHLMGKALRDPTVTRRYYYSIKDISIGGRCVCHGHADACDAKDPTDPFRLQCTCQHNTCGGTCDRCCPGFNQQPWKPATANSANECQSCNCYGHATDCYYDPEVDRRRASQSLDGTYQGGGVCIDCQHHTTGVNCERCLPGFYRSPNHPLDSPHVCRRCNCESDFTDGTCEDLTGRCYCRPNFSGERCDVCAEGFTGFPSCYPTPSSSNDTREQVLPAGQIVNCDCSAAGTQGNACRKDPRVGRCLCKPNFQGTHCELCAPGFYGPGCQPCQCSSPGVADDRCDPDTGQCRCRVGFEGATCDRCAPGYFHFPLCQLCGCSPAGTLPEGCDEAGRCLCQPEFAGPHCDRCRPGYHGFPNCQACTCDPRGALDQLCGAGGLCRCRPGYTGTACQECSPGFHGFPSCVPCHCSAEGSLHAACDPRSGQCSCRPRVTGLRCDTCVPGAYNFPYCEAGSCHPAGLAPVDPALPEAQVPCMCRAHVEGPSCDRCKPGFWGLSPSNPEGCTRCSCDLRGTLGGVAECQPGTGQCFCKPHVCGQACASCKDGFFGLDQADYFGCRSCRCDIGGALGQSCEPRTGVCRCRPNTQGPTCSEPARDHYLPDLHHLRLELEEAATPEGHAVRFGFNPLEFENFSWRGYAQMAPVQPRIVARLNLTSPDLFWLVFRYVNRGAMSVSGRVSVREEGRSATCANCTAQSQPVAFPPSTEPAFITVPQRGFGEPFVLNPGTWALRVEAEGVLLDYVVLLPSAYYEAALLQLRVTEACTYRPSAQQSGDNCLLYTHLPLDGFPSAAGLEALCRQDNSLPRPCPTEQLSPSHPPLITCTGSDVDVQLQVAVPQPGRYALVVEYANEDARQEVGVAVHTPQRAPQQGLLSLHPCLYSTLCRGTARDTQDHLAVFHLDSEASVRLTAEQARFFLHGVTLVPIEEFSPEFVEPRVSCISSHGAFGPNSAACLPSRFPKPPQPIILRDCQVIPLPPGLPLTHAQDLTPAMSPAGPRPRPPTAVDPDAEPTLLREPQATVVFTTHVPTLGRYAFLLHGYQPAHPTFPVEVLINAGRVWQGHANASFCPHGYGCRTLVVCEGQALLDVTHSELTVTVRVPKGRWLWLDYVLVVPENVYSFGYLREEPLDKSYDFISHCAAQGYHISPSSSSLFCRNAAASLSLFYNNGARPCGCHEVGATGPTCEPFGGQCPCHAHVIGRDCSRCATGYWGFPNCRPCDCGARLCDELTGQCICPPRTIPPDCLLCQPQTFGCHPLVGCEECNCSGPGIQELTDPTCDTDSGQCKCRPNVTGRRCDTCSPGFHGYPRCRPCDCHEAGTAPGVCDPLTGQCYCKENVQGPKCDQCSLGTFSLDAANPKGCTRCFCFGATERCRSSSYTRQEFVDMEGWVLLSTDRQVVPHERQPGTEMLRADLRHVPEAVPEAFPELYWQAPPSYLGDRVSSYGGTLRYELHSETQRGDVFVPMESRPDVVLQGNQMSITFLEPAYPTPGHVHRGQLQLVEGNFRHTETRNTVSREELMMVLASLEQLQIRALFSQISSAVFLRRVALEVASPAGQGALASNVELCLCPASYRGDSCQECAPGFYRDVKGLFLGRCVPCQCHGHSDRCLPGSGVCVDCQHNTEGAHCERCQAGFVSSRDDPSAPCVSCPCPLSVPSNNFAEGCVLRGGRTQCLCKPGYAGASCERCAPGFFGNPLVLGSSCQPCDCSGNGDPNLLFSDCDPLTGACRGCLRHTTGPRCEICAPGFYGNALLPGNCTRCDCTPCGTEACDPHSGHCLCKAGVTGRRCDRCQEGHFGFDGCGGCRPCACGPAAEGSECHPQSGQCHCRPGTMGPQCRECAPGYWGLPEQGCRRCQCPGGRCDPHTGRCNCPPGLSGERCDTCSQQHQVPVPGGPVGHSIHCEVCDHCVVLLLDDLERAGALLPAIHEQLRGINASSMAWARLHRLNASIADLQSQLRSPLGPRHETAQQLEVLEQQSTSLGQDARRLGGQAVGTRDQASQLLAGTEATLGHAKTLLAAIRAVDRTLSELMSQTGHLGLANASAPSGEQLLRTLAEVERLLWEMRARDLGAPQAAAEAELAAAQRLLARVQEQLSSLWEENQALATQTRDRLAQHEAGLMDLREALNRAVDATREAQELNSRNQERLEEALQRKQELSRDNATLQATLHAARDTLASVFRLLHSLDQAKEELERLAASLDGARTPLLQRMQTFSPAGSKLRLVEAAEAHAQQLGQLALNLSSIILDVNQDRLTQRAIEASNAYSRILQAVQAAEDAAGQALQQADHTWATVVRQGLVDRAQQLLANSTALEEAMLQEQQRLGLVWAALQGARTQLRDVRAKKDQLEAHIQAAQAMLAMDTDETSKKIAHAKAVAAEAQDTATRVQSQLQAMQENVERWQGQYEGLRGQDLGQAVLDAGHSVSTLEKTLPQLLAKLSILENRGVHNASLALSASIGRVRELIAQARGAASKVKVPMKFNGRSGVQLRTPRDLADLAAYTALKFYLQGPEPEPGQGTEDRFVMYMGSRQATGDYMGVSLRDKKVHWVYQLGEAGPAVLSIDEDIGEQFAAVSLDRTLQFGHMSVTVERQMIQETKGDTVAPGAEGLLNLRPDDFVFYVGGYPSTFTPPPLLRFPGYRGCIEMDTLNEEVVSLYNFERTFQLDTAVDRPCARSKSTGDPWLTDGSYLDGTGFARISFDSQISTTKRFEQELRLVSYSGVLFFLKQQSQFLCLAVQEGSLVLLYDFGAGLKKAVPLQPPPPLTSASKAIQVFLLGGSRKRVLVRVERATVYSVEQDNDLELADAYYLGGVPPDQLPPSLRRLFPTGGSVRGCVKGIKALGKYVDLKRLNTTGVSAGCTADLLVGRAMTFHGHGFLRLALSNVAPLTGNVYSGFGFHSAQDSALLYYRASPDGLCQVSLQQGRVSLQLLRTEVKTQAGFADGAPHYVAFYSNATGVWLYVDDQLQQMKPHRGPPPELQPQPEGPPRLLLGGLPESGTIYNFSGCISNVFVQRLLGPQRVFDLQQNLGSVNVSTGCAPALQAQTPGLGPRGLQATARKASRRSRQPARHPACMLPPHLRTTRDSYQFGGSLSSHLEFVGILARHRNWPSLSMHVLPRSSRGLLLFTARLRPGSPSLALFLSNGHFVAQMEGLGTRLRAQSRQRSRPGRWHKVSVRWEKNRILLVTDGARAWSQEGPHRQHQGAEHPQPHTLFVGGLPASSHSSKLPVTVGFSGCVKRLRLHGRPLGAPTRMAGVTPCILGPLEAGLFFPGSGGVITLDLPGATLPDVGLELEVRPLAVTGLIFHLGQARTPPYLQLQVTEKQVLLRADDGAGEFSTSVTRPSVLCDGQWHRLAVMKSGNVLRLEVDAQSNHTVGPLLAAAAGAPAPLYLGGLPEPMAVQPWPPAYCGCMRRLAVNRSPVAMTRSVEVHGAVGASGCPAA,mutated_sequence,1.0,3695.0,UPI0000161FDC.a2m,UPI0000161FDC.npy,gnomAD
+UPI0000140E62,UPI0000140E62.csv,MGKTANSPGSGARPDPVRSFNRWKKKHSHRQNKKKQLRKQLKKPEWQVERESISRLMQNYEKINVNEITRFSDFPLSKKTLKGLQEAQYRLVTEIQKQTIGLALQGKDVLGAAKTGSGKTLAFLVPVLEALYRLQWTSTDGLGVLIISPTRELAYQTFEVLRKVGKNHDFSAGLIIGGKDLKHEAERINNINILVCTPGRLLQHMDETVSFHATDLQMLVLDEADRILDMGFADTMNAVIENLPKKRQTLLFSATQTKSVKDLARLSLKNPEYVWVHEKAKYSTPATLEQNYIVCELQQKISVLYSFLRSHLKKKSIVFFSSCKEVQYLYRVFCRLRPGVSILALHGRQQQMRRMEVYNEFVRKRAAVLFATDIAARGLDFPAVNWVLQFDCPEDANTYIHRAGRTARYKEDGEALLILLPSEKAMVQQLLQKKVPVKEIKINPEKLIDVQKKLESILAQDQDLKERAQRCFVSYVRSVYLMKDKEVFDVSKLPIPEYALSLGLAVAPRVRFLQKMQKQPTKELVRSQADKVIEPRAPSLTNDEVEEFRAYFNEKMSILQKGGKRLEGTEHRQDNDTGNEEQEEEEDDEEEMEEKLAKAKGSQAPSLPNTSEAQKIKEVPTQFLDRDEEEEDADFLKVKRHNVFGLDLKDEKTLQKKEPSKSSIKKKMTKVAEAKKVMKRNFKVNKKITFTDEGELVQQWPQMQKSAIKDAEEDDDTGGINLHKAKERLQEEDKFDKEEYRKKIKAKHREKRLKEREARREANKRQAKAKDEEEAFLDWSDDDDDDDDGFDPSTLPDPDKYRSSEDSDSEDMENKISDTKKKQGMKKRSNSEVEDVGPTSHNRKKARWDTLEPLDTGLSLAEDEELVLHLLRSQS,mutated_sequence,1.0,875.0,UPI0000140E62.a2m,UPI0000140E62.npy,gnomAD
+UPI000013E44F,UPI000013E44F.csv,MLLRGVLLALQALQLAGALDLPAGSCAFEESTCGFDSVLASLPWILNEEGHYIYVDTSFGKQGEKAVLLSPDLQAEEWSCLRLVYQITTSSESLSDPSQLNLYMRFEDESFDRLLWSAKEPSDSWLIASLDLQNSSKKFKILIEGVLGQGNTASIALFEIKMTTGYCIECDFEENHLCGFVNRWNPNVNWFVGGGSIRNVHSILPQDHTFKSELGHYMYVDSVYVKHFQEVAQLISPLTTAPMAGCLSFYYQIQQGNDNVFSLYTRDVAGLYEEIWKADRPGNAAWNLAEVEFSAPYPMEVIFEVAFNGPKGGYVALDDISFSPVHCQNQTELLFSAVEASCNFEQDLCNFYQDKEGPGWTRVKVKPNMYRAGDHTTGLGYYLLANTKFTSQPGYIGRLYGPSLPGNLQYCLRFHYAIYGFLKMSDTLAVYIFEENHVVQEKIWSVLESPRGVWMQAEITFKKPMPTKVVFMSLCKSFWDCGLVALDDITIQLGSCSSSEKLPPPPGECTFEQDECTFTQEKRNRSSWHRRRGETPTSYTGPKGDHTTGVGYYMYIEASHMVYGQKARLLSRPLRGVSGKHCLTFFYHMYGGGTGLLSVYLKKEEDSEESLLWRRRGEQSISWLRALIEYSCERQHQIIFEAIRGVSIRSDIAIDDVKFQAGPCGEMEDTTQQSSGYSEDLNEIEY,mutated_sequence,1.0,686.0,UPI000013E44F.a2m,UPI000013E44F.npy,gnomAD
+UPI0000209875,UPI0000209875.csv,MAAFGRQVLDWHRLIPLTWACMARQTPHLGEQRRTTASLLRKLTTASNGGVIEELSCVRSNNYVQEPECRRNLVQCLLEKQGTPVVQGSLELERVMSSLLDMGFSNAHINELLSVRRGASLQQLLDIISEFILLGLNPEPVCVVLKKSPQLLKLPIMQMRKRSSYLQKLGLGEGKLKRVLYCCPEIFTMRQQDINDTVRLLKEKCLFTVQQVTKILHSCPSVLREDLGQLEYKFQYAYFRMGIKHPDIVKSEYLQYSLTKIKQRHIYLERLGRYQTPDKKGQTQIPNPLLKDILRVSEAEFLARTACTSVEEFQVFKKLLAREEEESESSTSDDKRASLDEDEDDDDEEDNDEDDNDEDDDDEDDDEAEDNDEDEDDDEEE,mutated_sequence,1.0,381.0,UPI0000209875.a2m,UPI0000209875.npy,gnomAD
+UPI0000251D78,UPI0000251D78.csv,MLREEATKKSKEKEPGMALPQGRLTFRDVAIEFSLEEWKCLNPAQRALYRAVMLENYRNLEFVDSSLKSMMEFSSTRHSITGEVIHTGTLQRHKSHHIGDFCFPEMKKDIHHFEFQWQEVERNGHEAPMTKIKKLTGSTDRSDHRHAGNKPIKDQLGLSFHSHLPELHMFQTKGKISNQLDKSIGASSASESQRISCRLKTHISNKYGKNFLHSSFTQIQEICMREKPCQSNECGKAFNYSSLLRRHHITHSREREYKCDVCGKIFNQKQYIVYHHRCHTGEKTYKCNECGKTFTQMSSLVCHRRLHTGEKPYKCNECGKTFSEKSSLRCHRRLHTGEKPYKCNECGKTFGRNSALVIHKAIHTGEKPYKCNECGKTFSQKSSLQCHHILHTGEKPYKCEECDNVYIRRSHLERHRKIHTGEGSYKCKVCDKVFRSDSYLAEHQRVHTGEKPYKCNKCGRSFSRKSSLQYHHTLHTGEKPYTCNECGKVFSRRENLARHHRLHAGEKPYKCEECDKVFSRRSHLERHRRIHTGEKPYKCKVCDKAFRSDSCLANHTRVHTGEKPYKCNKCAKVFNQKGILAQHQRVHTGEKPYKCNECGKVFNQKASLAKHQRVHTAEKPYKCNECGKAFTGQSTLIHHQAIHGCRETLQM,mutated_sequence,1.0,651.0,UPI0000251D78.a2m,UPI0000251D78.npy,gnomAD
+UPI000059D3CC,UPI000059D3CC.csv,MSSAPEKQQPPHGGGGGGGGGGGAAMDPASSGPSKAKKTNAGIRRPEKPPYSYIALIVMAIQSSPTKRLTLSEIYQFLQSRFPFFRGSYQGWKNSVRHNLSLNECFIKLPKGLGRPGKGHYWTIDPASEFMFEEGSFRRRPRGFRRKCQALKPMYSMMNGLGFNHLPDTYGFQGSAGGLSCPPNSLALEGGLGMMNGHLPGNVDGMALPSHSVPHLPSNGGHSYMGGCGGAAAGEYPHHDSSVPASPLLPTGAGGVMEPHAVYSGSAAAWPPSASAALNSGASYIKQQPLSPCNPAANPLSGSLSTHSLEQPYLHQNSHNAPAELQGIPRYHSQSPSMCDRKEFVFSFNAMASSSMHSAGGGSYYHQQVTYQDIKPCVM,mutated_sequence,1.0,379.0,UPI000059D3CC.a2m,UPI000059D3CC.npy,gnomAD
+UPI0001929538,UPI0001929538.csv,MSFSLTFTELANIAIPQCGVLNFKALHLLLHGILEHIHMAELKKVLSGDEDFLQTSQVVIMPREGDAQPILNPMKRLSNVFDHVVSRLDKLENQLALLQDLPSTAQLLEASQGTARPVQDLWHLIKLRKMVEGHDEVMAKSMQTLQDLLTDLHALQVTITALRKEVDMLKNMLDKVHPERMDIFAEDFKIQNWKMVALQREVASLQNKFKTIPKTEDMVLWSGLHDAMFTSEIGSSPLDLWQSVEQLPEAALAQTTKYLEATRAIQVSEPVQNPQLLQTVWHYEVPELLPEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPVLGPVPAPGAQPPPLGDWPALPRRWPLPQGWPRVGSWPLWDLGVLRPTQPQPSRAPPPATEFGSLWPRPLQPYQSRQGEALQLAAVQVKGEENDVPSLRGLRERARKDGAPKDRTRKDGVPKDRGGKDVDPKDRAHKDDVPKDRGGKDVDPKDRAHKDDVPKDRGGKDGDPKDRVGKDGAPKEAQPKAPQSALHRLKTTAAIAAAAAAAYAAATSSAAQAAKVAAKFVKDAPATKMAAIATDTAAAGPLGVFADVLGAGPSRGATESQILGDDSEIYEILSPSYSAASIGPDPALSQAMVATKQAMSPEDKKRAVKYSMSHIAQIPVKHDSLKEEFAQLSCNLNQRLSYLANMGGPSSLGTTVDILQKKIGSLQKSRLKEEELERIWGNQIEMMKDRYITLDKAVENLQIRMDEFKTLQAQIKRLEMNKVNKSTMEEELREKADRSALAGKASRVDLETVALELNEMIQGILFKVTIHEDSWKKAMEELSKDVNTKLVHSDLDPLKKEMEEVWKIVRKLLIEGLRLDPDSAAGFRRKLFKRVKCISCDRPVEMMTGPQLITIRKAHLLSRLRPASANSCEYLQRQQMREQQWLQLQDLGIQEDCQQDWGDGPQNATSLKCKSCNLLTLYPYGDPHVIDYDSAEVDILGVDGILYKGRVNSQRGAQPLAVAKELAAVKAPSPPSQSLYDRVHSSALFGAICPPLCPRSSACSAASGPHLTMPARPPSLPPLLLLPPLIPSLRDPQQAPGSTRLSRAPHIESRVGRKPPEEPANP,mutated_sequence,1.0,1141.0,UPI0001929538.a2m,UPI0001929538.npy,gnomAD
+UPI000013E413,UPI000013E413.csv,MSNVSGILETAGVPLVSANWPQPSPPPAVPAGPQMDHMGNSSQGAPWLFLTSALARGVSGIFVWTALVLTCHQIYLHLRSYTVPQEQRYIIRLLLIVPIYAFDSWLSLLLLGDHQYYVYFDSVRDCYEAFVIYSFLSLCFQYLGGEGAIMAEIRGKPIKSSCLYGTCCLRGMTYSIGFLRFCKQATLQFCLVKPVMAVTTIILQAFGKYHDGDFNVRSGYLYVTLIYNASVSLALYALFLFYFTTRELLRPFQPVLKFLTIKAVIFLSFWQGLLLAILERCGVIPEVETSGGNKLGAGTLAAGYQNFIICVEMLFASVALRYAFPCQVYAEKKENSPAPPAPMQSISSGIRETVSPQDIVQDAIHNFSPAYQHYTQQATHEAPRPGTHPSGGSGGSRKSRSLEKRMLIPSEDL,mutated_sequence,1.0,413.0,UPI000013E413.a2m,UPI000013E413.npy,gnomAD
+UPI000013D308,UPI000013D308.csv,MWRRKHPRTSGGTRGVLSGNRGVEYGSGRGHLGTFEGRWRKLPKMPEAVGTDPSTSRKMAELEEVTLDGKPLQALRVTDLKAALEQRGLAKSGQKSALVKRLKGALMLENLQKHSTPHAAFQPNSQIGEEMSQNSFIKQYLEKQQELLRQRLEREAREAAELEEASAESEDEMIHPEGVASLLPPDFQSSLERPELELSRHSPRKSSSISEEKGDSDDEKPRKGERRSSRVRQARAAKLSEGSQPAEEEEDQETPSRNLRVRADRNLKTEEEEEEEEEEEEDDEEEEGDDEGQKSREAPILKEFKEEGEEIPRVKPEEMMDERPKTRSQEQEVLERGGRFTRSQEEARKSHLARQQQEKEMKTTSPLEEEEREIKSSQGLKEKSKSPSPPRLTEDRKKASLVALPEQTASEEETPPPLLTKEASSPPPHPQLHSEEEIEPMEGPAPAVLIQLSPPNTDADTRELLVSQHTVQLVGGLSPLSSPSDTKAESPAEKVPEESVLPLVQKSTLADYSAQKDLEPESDRSAQPLPLKIEELALAKGITEECLKQPSLEQKEGRRASHTLLPSHRLKQSADSSSSRSSSSSSSSSRSRSRSPDSSGSRSHSPLRSKQRDVAQARTHANPRGRPKMGSRSTSESRSRSRSRSRSASSNSRKSLSPGVSRDSSTSYTETKDPSSGQEVATPPVPQLQVCEPKERTSTSSSSVQARRLSQPESAEKHVTQRLQPERGSPKKCEAEEAEPPAATQPQTSETQTSHLPESERIHHTVEEKEEVTMDTSENRPENDVPEPPMPIADQVSNDDRPEGSVEDEEKKESSLPKSFKRKISVVSATKGVPAGNSDTEGGQPGRKRRWGASTATTQKKPSISITTESLKSLIPDIKPLAGQEAVVDLHADDSRISEDETERNGDDGTHDKGLKICRTVTQVVPAEGQENGQREEEEEEKEPEAEPPVPPQVSVEVALPPPAEHEVKKVTLGDTLTRRSISQQKSGVSITIDDPVRTAQVPSPPRGKISNIVHISNLVRPFTLGQLKELLGRTGTLVEEAFWIDKIKSHCFVTYSTVEEAVATRTALHGVKWPQSNPKFLCADYAEQDELDYHRGLLVDRPSETKTEEQGIPRPLHPPPPPPVQPPQHPRAEQREQERAVREQWAEREREMERRERTRSEREWDRDKVREGPRSRSRSRDRRRKERAKSKEKKSEKKEKAQEEPPAKLLDDLFRKTKAAPCIYWLPLTDSQIVQKEAERAERAKEREKRRKEQEEEEQKEREKEAERERNRQLEREKRREHSRERDRERERERERDRGDRDRDRERDRERGRERDRRDTKRHSRSRSRSTPVRDRGGRR,mutated_sequence,1.0,1341.0,UPI000013D308.a2m,UPI000013D308.npy,gnomAD
+UPI00001AF305,UPI00001AF305.csv,MRRRAARGPGPPPPGPGLSRLPLPLLLLLALGTRGGCAAPAPAPRAEDLSLGVEWLSRFGYLPPADPTTGQLQTQEELSKAITAMQQFGGLEATGILDEATLALMKTPRCSLPDLPVLTQARRRRQAPAPTKWNKRNLSWRVRTFPRDSPLGHDTVRALMYYALKVWSDIAPLNFHEVAGSAADIQIDFSKADHNDGYPFDGPGGTVAHAFFPGHHHTAGDTHFDDDEAWTFRSSDAHGMDLFAVAVHEFGHAIGLSHVAAAHSIMRPYYQGPVGDPLRYGLPYEDKVRVWQLYGVRESVSPTAQPEEPPLLPEPPDNRSSAPPRKDVPHRCSTHFDAVAQIRGEAFFFKGKYFWRLTRDRHLVSLQPAQMHRFWRGLPLHLDSVDAVYERTSDHKIVFFKGDRYWVFKDNNVEEGYPRPVSDFSLPPGGIDAAFSWAHNDRTYFFKDQLYWRYDDHTRHMDPGYPAQSPLWRGVPSTLDDAMRWSDGASYFFRGQEYWKVLDGELEVAPGYPQSTARDWLVCGDSQADGSVAAGVDAAEGPRAPPGQHDQSRSEDGYEVCSCTSGASSPPGAPGPLVAATMLLLLPPLSPGALWTAAQALTL,mutated_sequence,1.0,603.0,UPI00001AF305.a2m,UPI00001AF305.npy,gnomAD
+UPI0001DD21C7,UPI0001DD21C7.csv,MGAPSACRTLVLALAAMLVVPQAETQGPVEPSWENAGHTMDGGAPTSSPTRRVSFVPPVTVFPSLSPLNPAHNGRVCSTWGDFHYKTFDGDVFRFPGLCNYVFSEHCRAAYEDFNVQLRRGLVGSRPVVTRVVIKAQGLVLEASNGSVLINGQREELPYSRTGLLVEQSGDYIKVSIRLVLTFLWNGEDSALLELDPKYANQTCGLCGDFNGLPAFNEFYAHNARLTPLQFGNLQKLDGPTEQCPDPLPLPAGNCTDEEGICHRTLLGPAFAECHALVDSTAYLAACAQDLCRCPTCPCATFVEYSRQCAHAGGQPRNWRCPELCPRTCPLNMQHQECGSPCTDTCSNPQRAQLCEDHCVDGCFCPPGTVLDDITHSGCLPLGQCPCTHGGRTYSPGTSFNTTCSSCTCSGGLWQCQDLPCPGTCSVQGGAHISTYDEKLYDLHGDCSYVLSKKCADSSFTVLAELRKCGLTDNENCLKAVTLSLDGGDTAIRVQADGGVFLNSIYTQLPLSAANITLFTPSSFFIVVQTGLGLQLLVQLVPLMQVFVRLDPAHQGQMCGLCGNFNQNQADDFTALSGVVEATGAAFANTWKAQAACANARNSFEDPCSLSVENENYARHWCSRLTDPNSAFSRCHSIINPKPFHSNCMFDTCNCERSEDCLCAALSSYVHACAAKGVQLSDWRDGVCTKYMQNCPKSQRYAYVVDACQPTCRGLSEADVTCSVSFVPVDGCTCPAGTFLNDAGACVPAQECPCYAHGTVLAPGEVVHDEGAVCSCTGGKLSCLGASLQKSTGCAAPMVYLDCSNSSAGTPGAECLRSCHTLDVGCFSTHCVSGCVCPPGLVSDGSGGCIAEEDCPCVHNEATYKPGETIRVDCNTCTCRNRRWECSHRLCLGTCVAYGDGHFITFDGDRYSFEGSCEYILAQDYCGDNTTHGTFRIVTENIPCGTTGTTCSKAIKLFVESYELILQEGTFKAVARGPGGDPPYKIRYMGIFLVIETHGMAVSWDRKTSVFIRLHQDYKGRVCGLCGNFDDNAINDFATRSRSVVGDALEFGNSWKLSPSCPDALAPKDPCTANPFRKSWAQKQCSILHGPTFAACRSQVDSTKYYEACVNDACACDSGGDCECFCTAVAAYAQACHDAGLCVSWRTPDTCPLFCDFYNPHGGCEWHYQPCGAPCLKTCRNPSGHCLVDLPGLEGCYPKCPPSQPFFNEDQMKCVAQCGCYDKDGNYYDVGARVPTAENCQSCNCTPSGIQCAHSLEACTCTYEDRTYSYQDVIYNTTDGLGACLIAICGSNGTIIRKAVACPGTPATTPFTFTTAWVPHSTTSPALPVSTVCVREVCRWSSWYNGHRPEPGLGGGDFETFENLRQRGYQVCPVLADIECRAAQLPDMPLEELGQQVDCDRMRGLMCANSQQSPPLCHDYELRVLCCEYVPCGPSPAPGTSPQPSLSASTEPAVPTPTQTTATEKTTLWVTPSIRSTAALTSQTGSSSGPVTVTPSAPGTTTCQPRCQWTEWFDEDYPKSEQLGGDVESYDKIRAAGGHLCQQPKDIECQAESFPNWTLAQVGQKVHCDVHFGLVCRNWEQEGVFKMCYNYRIRVLCCSDDHCRGRATTPPPTTELETATTTTTQALFSTPQPTSSPGLTRAPPASTTAVPTLSEGLTSPRYTSTLGTATTGGPTTPAGSTEPTVPGVATSTLPTRSALPGTTGSLGTWRPSQPPTLAPTTMATSRARPTGTASTASKEPLTTSLAPTLTSELSTSQAETSTPRTETTMSPLTNTTTSQGTTRCQPKCEWTEWFDVDFPTSGVAGGDMETFENIRAAGGKMCWAPKSIECRAENYPEVSIDQVGQVLTCSLETGLTCKNEDQTGRFNMCFNYNVRVLCCDDYSHCPSTPATSSTATPSSTPGTTWILTKPTTTATTTASTGSTATPTSTLRTAPPPKVLTTTATTPTVTSSKATPSSSPGTATALPALRSTATTPTATSVTPIPSSSLGTTWTRLSQTTTPTATMSTATPSSTPETAHTSTVLTATATTTGATGSVATPSSTPGTAHTTKVPTTTTTGFTATPSSSPGTALTPPVWISTTTTPTTRGSTVTPSSIPGTTHTATVLTTTTTTVATGSMATPSSSTQTSGTPPSLTTTATTITATGSTTNPSSTPGTTPIPPVLTTTATTPAATSNTVTPSSALGTTHTPPVPNTMATTHGRSLPPSSPHTVRTAWTSATSGILGTTHITEPSTVTSHTLAATTGTTQHSTPALSSPHPSSRTTESPPSPGTTTPGHTTATSRTTATATPSKTRTSTLLPSSPTSAPITTVVTMGCEPQCAWSEWLDYSYPMPGPSGGDFDTYSNIRAAGGAVCEQPLGLECRAQAQPGVPLRELGQVVECSLDFGLVCRNREQVGKFKMCFNYEIRVFCCNYGHCPSTPATSSTAMPSSTPGTTWILTELTTTATTTESTGSTATPSSTPGTTWILTEPSTTATVTVPTGSTATASSTQATAGTPHVSTTATTPTVTSSKATPFSSPGTATALPALRSTATTPTATSFTAIPSSSLGTTWTRLSQTTTPTATMSTATPSSTPETVHTSTVLTTTATTTGATGSVATPSSTPGTAHTTKVLTTTTTGFTATPSSSPGTARTLPVWISTTTTPTTRGSTVTPSSIPGTTHTPTVLTTTTTTVATGSMATPSSSTQTSGTPPSLTTTATTITATGSTTNPSSTPGTTPIPPVLTTTATTPAATSSTVTPSSALGTTHTPPVPNTTATTHGRSLSPSSPHTVRTAWTSATSGTLGTTHITEPSTGTSHTPAATTGTTQHSTPALSSPHPSSRTTESPPSPGTTTPGHTRATSRTTATATPSKTRTSTLLPSSPTSAPITTVVTMGCEPQCAWSEWLDYSYPMPGPSGGDFDTYSNIRAAGGAVCEQPLGLECRAQAQPGVPLRELGQVVECSLDFGLVCRNREQVGKFKMCFNYEIRVFCCNYGHCPSTPATSSTATPSSTPGTTWILTEQTTAATTTATTGSTAIPSSTPGTAPPPKVLTSTATTPTATSSKATSSSSPRTATTLPVLTSTATKSTATSFTPIPSFTLGTTGTLPEQTTTPMATMSTIHPSSTPETTHTSTVLTTKATTTRATSSMSTPSSTPGTTWILTELTTAATTTAATGPTATPSSTPGTTWILTEPSTTATVTVPTGSTATASSTRATAGTLKVLTSTATTPTVISSRATPSSSPGTATALPALRSTATTPTATSVTAIPSSSLGTAWTRLSQTTTPTATMSTATPSSTPETVHTSTVLTTTTTTTRATGSVATPSSTPGTAHTTKVPTTTTTGFTATPSSSPGTALTPPVWISTTTTPTTRGSTVTPSSIPGTTHTATVLTTTTTTVATGSMATPSSSTQTSGTPPSLTTTATTITATGSTTNPSSTPGTTPIPPVLTTTATTPAATSSTVTPSSALGTTHTPPVPNTTATTHGRSLPPSSPHTVRTAWTSATSGILGTTHITEPSTVTSHTPAATTSTTQHSTPALSSPHPSSRTTESPPSPGTTTPGHTRGTSRTTATATPSKTRTSTLLPSSPTSAPITTVVTTGCEPQCAWSEWLDYSYPMPGPSGGDFDTYSNIRAAGGAVCEQPLGLECRAQAQPGVPLRELGQVVECSLDFGLVCRNREQVGKFKMCFNYEIRVFCCNYGHCPSTPATSSTATPSSTPGTTWILTKLTTTATTTESTGSTATPSSTPGTTWILTEPSTTATVTVPTGSTATASSTQATAGTPHVSTTATTPTVTSSKATPFSSPGTATALPALRSTATTPTATSFTAIPSSSLGTTWTRLSQTTTPTATMSTATPSSTPETAHTSTVLTTTATTTRATGSVATPSSTPGTAHTTKVPTTTTTGFTVTPSSSPGTARTPPVWISTTTTPTTSGSTVTPSSVPGTTHTPTVLTTTTTTVATGSMATPSSSTQTSGTPPSLITTATTITATGSTTNPSSTPGTTPIPPVLTTTATTPAATSSTVTPSSALGTTHTPPVPNTTATTHGRSLSPSSPHTVRTAWTSATSGTLGTTHITEPSTGTSHTPAATTGTTQHSTPALSSPHPSSRTTESPPSPGTTTPGHTTATSRTTATATPSKTRTSTLLPSSPTSAPITTVVTTGCEPQCAWSEWLDYSYPMPGPSGGDFDTYSNIRAAGGAVCEQPLGLECRAQAQPGVPLGELGQVVECSLDFGLVCRNREQVGKFKMCFNYEIRVFCCNYGHCPSTPATSSTAMPSSTPGTTWILTELTTTATTTASTGSTATPSSTPGTAPPPKVLTSPATTPTATSSKATSSSSPRTATTLPVLTSTATKSTATSVTPIPSSTLGTTGTLPEQTTTPVATMSTIHPSSTPETTHTSTVLTTKATTTRATSSTSTPSSTPGTTWILTELTTAATTTAATGPTATPSSTPGTTWILTELTTTATTTASTGSTATPSSTPGTTWILTEPSTTATVTVPTGSTATASSTQATAGTPHVSTTATTPTVTSSKATPSSSPGTATALPALRSTATTPTATSFTAIPSSSLGTTWTRLSQTTTPTATMSTATPSSTPETVHTSTVLTATATTTGATGSVATPSSTPGTAHTTKVPTTTTTGFTATPSSSPGTALTPPVWISTTTTPTTTTPTTSGSTVTPSSIPGTTHTARVLTTTTTTVATGSMATPSSSTQTSGTPPSLTTTATTITATGSTTNPSSTPGTTPITPVLTSTATTPAATSSKATSSSSPRTATTLPVLTSTATKSTATSFTPIPSSTLWTTWTVPAQTTTPMSTMSTIHTSSTPETTHTSTVLTTTATMTRATNSTATPSSTLGTTRILTELTTTATTTAATGSTATLSSTPGTTWILTEPSTIATVMVPTGSTATASSTLGTAHTPKVVTTMATMPTATASTVPSSSTVGTTRTPAVLPSSLPTFSVSTVSSSVLTTLRPTGFPSSHFSTPCFCRAFGQFFSPGEVIYNKTDRAGCHFYAVCNQHCDIDRFQGACPTSPPPVSSAPLSSPSPAPGCDNAIPLRQVNETWTLENCTVARCVGDNRVVLLDPKPVANVTCVNKHLPIKVSDPSQPCDFHYECECICSMWGGSHYSTFDGTSYTFRGNCTYVLMREIHARFGNLSLYLDNHYCTASATAAAARCPRALSIHYKSMDIVLTVTMVHGKEEGLILFDQIPVSSGFSKNGVLVSVLGTTTMRVDIPALGVSVTFNGQVFQARLPYSLFHNNTEGQCGTCTNNQRDDCLQRDGTTAASCKDMAKTWLVPDSRKDGCWAPTGTPPTASPAAPVSSTPTPTPCPPQPLCDLMLSQVFAECHNLVPPGPFFNACISDHCRGRLEVPCQSLEAYAELCRARGVCSDWRGATGGLCDLTCPPTKVYKPCGPIQPATCNSRNQSPQLEGMAEGCFCPEDQILFNAHMGICVQACPCVGPDGFPKFPGERWVSNCQSCVCDEGSVSVQCKPLPCDAQGQPPPCNRPGFVTVTRPRAENPCCPETVCVCNTTTCPQSLPVCPPGQESICTQEEGDCCPTFRCRPQLCSYNGTFYGVGATFPGALPCHMCTCLSGDTQDPTVQCQEDACNNTTCPQGFEYKRVAGQCCGECVQTACLTPDGQPVQLNETWVNSHVDNCTVYLCEAEGGVHLLTPQPASCPDVSSCRGSLRKTGCCYSCEEDSCQVRINTTILWHQGCETEVNITFCEGSCPGASKYSAEAQAMQHQCTCCQERRVHEETVPLHCPNGSAILHTYTHVDECGCTPFCVPAPMAPPHTRGFPAQEATAV,mutated_sequence,1.0,5762.0,UPI0001DD21C7.a2m,UPI0001DD21C7.npy,gnomAD
+UPI000004A10E,UPI000004A10E.csv,MLGGSAGRLKMSSSGTLSNYYVDSLIGHEGDEVFAARFGPPGPGAQGRPAGVADGPAATAAEFASCSFAPRSAVFSASWSAVPSQPPAAAAMSGLYHPYVPPPPLAASASEPGRYVRSWMEPLPGFPGGAGGGGGGGGGGPGRGPSPGPSGPANGRHYGIKPETRAAPAPATAASTTSSSSTSLSSSSKRTECSVARESQGSSGPEFSCNSFLQEKAAAATGGTGPGAGIGAATGTGGSSEPSACSDHPIPGCSLKEEEKQHSQPQQQQLDPNNPAANWIHARSTRKKRCPYTKYQTLELEKEFLFNMYLTRDRRYEVARILNLTERQVKIWFQNRRMKMKKMSKEKCPKGD,mutated_sequence,1.0,352.0,UPI000004A10E.a2m,UPI000004A10E.npy,gnomAD
+UPI000007443D,UPI000007443D.csv,MPRPELPLPEGWEEARDFDGKVYYIDHTNRTTSWIDPRDRYTKPLTFADCISDELPLGWEEAYDPQVGDYFIDHNTKTTQIEDPRVQWRREQEHMLKDYLVVAQEALSAQKEIYQVKQQRLELAQQEYQQLHAVWEHKLGSQVSLVSGSSSSSKYDPEILKAEIATAKSRVNKLKREMVHLQHELQFKERGFQTLKKIDKKMSDAQGSYKLDEAQAVLRETKAIKKAITCGEKEKQDLIKSLAMLKDGFRTDRGSHSDLWSSSSSLESSSFPLPKQYLDVSSQTDISGSFGINSNNQLAEKVRLRLRYEEAKRRIANLKIQLAKLDSEAWPGVLDSERDRLILINEKEELLKEMRFISPRKWTQGEVEQLEMARKRLEKDLQAARDTQSKALTERLKLNSKRNQLVRELEEATRQVATLHSQLKSLSSSMQSLSSGSSPGSLTSSRGSLVASSLDSSTSASFTDLYYDPFEQLDSELQSKVEFLLLEGATGFRPSGCITTIHEDEVAKTQKAEGGGRLQALRSLSGTPKSMTSLSPRSSLSSPSPPCSPLMADPLLAGDAFLNSLEFEDPELSATLCELSLGNSAQERYRLEEPGTEGKQLGQAVNTAQGCGLKVACVSAAVSDESVAGDSGVYEASVQRLGASEAAAFDSDESEAVGATRIQIALKYDEKNKQFAILIIQLSNLSALLQQQDQKVNIRVAVLPCSESTTCLFRTRPLDASDTLVFNEVFWVSMSYPALHQKTLRVDVCTTDRSHLEECLGGAQISLAEVCRSGERSTRWYNLLSYKYLKKQSRELKPVGVMAPASGPASTDAVSALLEQTAVELEKRQEGRSSTQTLEDSWRYEETSENEAVAEEEEEEVEEEEGEEDVFTEKASPDMDGYPALKVDKETNTETPAPSPTVVRPKDRRVGTPSQGPFLRGSTIIRSKTFSPGPQSQYVCRLNRSDSDSSTLSKKPPFVRNSLERRSVRMKRPSSVKSLRSERLIRTSLDLELDLQATRTWHSQLTQEISVLKELKEQLEQAKSHGEKELPQWLREDERFRLLLRMLEKRQMDRAEHKGELQTDKMMRAAAKDVHRLRGQSCKEPPEVQSFREKMAFFTRPRMNIPALSADDV,mutated_sequence,1.0,1113.0,UPI000007443D.a2m,UPI000007443D.npy,gnomAD
+UPI0000367134,UPI0000367134.csv,MVFRRFVEVGRVAYVSFGPHAGKLVAIVDVIDQNRALVDGPCTQVRRQAMPFKCMQLTDFILKFPHSAHQKYVRQAWQKADINTKWAATRWAKKIEARERKAKMTDFDRFKVMKAKKMRNRIIKNEVKKLQKAALLKASPKKAPGTKGTAAAAAAAAAAKVPAKKITAASKKAPAQKVPAQKATGQKAAPAPKAQKGQKAPAQKAPAPKASGKKA,mutated_sequence,1.0,215.0,UPI0000367134.a2m,UPI0000367134.npy,gnomAD
+UPI0000048EB8,UPI0000048EB8.csv,MQPRRAQAPGAQLLPALALLLLLLGAGPRGSSLANPVPAAPLSAPGPCAAQPCRNGGVCTSRPEPDPQHPAPAGEPGYSCTCPAGISGANCQLVADPCASNPCHHGNCSSSSSSSSDGYLCICNEGYEGPNCEQALPSLPATGWTESMAPRQLQPVPATQEPDKILPRSQATVTLPTWQPKTGQKVVEMKWDQVEVIPDIACGNASSNSSAGGRLVSFEVPQNTSVKIRQDATASLILLWKVTATGFQQCSLIDGRSVTPLQASGGLVLLEEMLALGNNHFIGFVNDSVTKSIVALRLTLVVKVSTCVPGESHANDLECSGKGKCTTKPSEATFSCTCEEQYVGTFCEEYDACQRKPCQNNASCIDANEKQDGSNFTCVCLPGYTGELCQSKIDYCILDPCRNGATCISSLSGFTCQCPEGYFGSACEEKVDPCASSPCQNNGTCYVDGVHFTCNCSPGFTGPTCAQLIDFCALSPCAHGTCRSVGTSYKCLCDPGYHGLYCEEEYNECLSAPCLNAATCRDLVNGYECVCLAEYKGTHCELYKDPCANVSCLNGATCDSDGLNGTCICAPGFTGEECDIDINECDSNPCHHGGSCLDQPNGYNCHCPHGWVGANCEIHLQWKSGHMAESLTNMPRHSLYIIIGALCVAFILMLIILIVGICRISRIEYQGSSRPAYEEFYNCRSIDSEFSNAIASIRHARFGKKSRPAMYDVSPIAYEDYSPDDKPLVTLIKTKDL,mutated_sequence,1.0,737.0,UPI0000048EB8.a2m,UPI0000048EB8.npy,gnomAD
+UPI00015E04B4,UPI00015E04B4.csv,RSPTLRKPLKHSTPEEAALGWSPRPSGGASYLSGSPMPAHFSQDLASHPAGVSPPATVRKRRLSTLWASKESSLDLSAPGEEPPTSASLTQRQRQRQQQQQQQESLRAKSWAQNPGLPGILNTTGRKRRDPKKRAAAMERVRQWEIYVLQNIEEATQHELTIEDD,mutated_sequence,1.0,165.0,UPI00015E04B4.a2m,UPI00015E04B4.npy,gnomAD
+UPI00015E06EA,UPI00015E06EA.csv,MMAAAAAAAAGSSSSGGGGGGSGSSSSSSDTSSTGEEERMRRLFQTCDGDGDGYISRNDLLMVCRQLNMEESVAEIMNQLGADENGKISFQDFTRCRMQLVREIRKEEVDLSAKSDNSCTKKLRDRIASWPTSSDNSLGALSAARESWEYDSGARDLQSPDVQSQSALQKLLEYGGSSLHQQAALHKLLTQSPHIGNSVGGSYLELANTLHSAALASLKGDIVELNKRLQQTERERDLLEKKLAKAQCEQSHLMREHEDVQERTTLRYEERITELHSVIAELNKKIDRLQGTTIREEDEYSELRSELSQSQHEVNEDSRSMDQDQTSVSIPENQSTMVTADMDNCSDLNSELQRVLTGLENVVCGRKKSSCSLSVAEVDKHIEQLTTASEHCDLAIKTVEEIEGVLGRDLYPNLAEERSRWEKELAGLREENESLTAMLCSKEEELNRTKATMNAIREERDRLRRRVRELQTRLQSVQATGPSSPGRLTSTNRPINPSTGELSTSSSSNDIPIAKIAERVKLSKTRSESSSSDRPVLGSEISSIGVSSSVAEHLAHSLQDCSNIQEIFQTLYSHGSAISESKIREFEVETERLNSRIEHLKSQNDLLTITLEECKSNAERMSMLVGKYESNATALRLALQYSEQCIEAYELLLALAESEQSLILGQFRAAGVGSSPGDQSGDENITQMLKRAHDCRKTAENAAKALLMKLDGSCGGAFAVAGCSVQPWESLSSNSHTSTTSSTASSCDTEFTKEDEQRLKDYIQQLKNDRAAVKLTMLELESIHIDPLSYDVKPRGDSQRLDLENAVLMQELMAMKEEMAELKAQLYLLEKEKKALELKLSTREAQEQAYLVHIEHLKSEVEEQKEQRMRSLSSTSSGSKDKPGKECADAASPALSLAELRTTCSENELAAEFTNAIRREKKLKARVQELVSALERLTKSSEIRHQQSAEFVNDLKRANSNLVAAYEKAKKKHQNKLKKLESQMMAMVERHETQVRMLKQRIALLEEENSRPHTNETSL,mutated_sequence,1.0,1019.0,UPI00015E06EA.a2m,UPI00015E06EA.npy,gnomAD
+UPI0002065974,UPI0002065974.csv,MSVSARSAAAEERSVNSSTMGQQKNLEGYVGFANLPNQVYRKSVKRGFEFTLMVVGESGLGKSTLINSLFLTDLYSPEYPGPSHRIKKTVQVEQSKVLIKEGGVQLLLTIVDTPGFGDAVDNSNCWQPVIDYIDSKFEDYLNAESRVNRRQMPDNRVQCCLYFIAPSGHGLKPLDIEFMKRLHEKVNIIPLIAKADTLTPEECQQFKKQIMKEIQEHKIKIYEFPETAKKKKKKK,mutated_sequence,1.0,235.0,UPI0002065974.a2m,UPI0002065974.npy,gnomAD
+UPI0000073AA5,UPI0000073AA5.csv,MVSSVLPNPTSAECWAALLHDPMTLDMDAVLSDFVRSTGAEPGLARDLLEGKNWDLTAALSDYEQLRQVHTANLPHVFNEGRGPKQPEREPQPGHKVERPCLQRQDDIAQEKRLSRGISHASSAIVSLARSHVASECNNEQFPLEMPIYTFQLPDLSVYSEDFRSFIERDLIEQATMVALEQAGRLNWWSTVCTSCKRLLPLATTGDGNCLLHAASLGMWGFHDRDLVLRKALYTMMRTGAEREALKRRWRWQQTQQNKEEEWEREWTELLKLASSEPRTHFSKNGGTGGGVDNSEDPVYESLEEFHVFVLAHILRRPIVVVADTMLRDSGGEAFAPIPFGGIYLPLEVPPNRCHCSPLVLAYDQAHFSALVSMEQRDQQREQAVIPLTDSEHKLLPLHFAVDPGKDWEWGKDDNDNARLAHLILSLEAKLNLLHSYMNVTWIRIPSETRAPLAQPESPTASAGEDVQSLADSLDSDRDSVCSNSNSNNGKNGKDKEKEKQRKEKDKTRADSVANKLGSFSKTLGIKLKKNMGGLGGLVHGKMGRANSANGKNGDSAERGKEKKAKSRKGSKEESGASASTSPSEKTTPSPTDKAAGASPAEKGGGPRGDAWKYSTDVKLSLNILRAAMQGERKFIFAGLLLTSHRHQFHEEMIGYYLTSAQERFSAEQEQRRRDAATAAAAAAAAAAATAKRPPRRPETEGVPVPERASPGPPTQLVLKLKERPSPGPAAGRAARAAAGGTASPGGGARRASASGPVPGRSPPAPARQSVIHVQASGARDEACAPAVGALRPCATYPQQNRSLSSQSYSPARAAALRTVNTVESLARAVPGALPGAAGTAGAAEHKSQTYTNGFGALRDGLEFADADAPTARSNGECGRGGPGPVQRRCQRENCAFYGRAETEHYCSYCYREELRRRREARGARP,mutated_sequence,1.0,926.0,UPI0000073AA5.a2m,UPI0000073AA5.npy,gnomAD
+UPI00001AFDEB,UPI00001AFDEB.csv,MVFRNVGRPPEEEDVEAAPEPGPSELLCPRHRCALDPKALPPGLALERTWGPAAGLEAQLAALGLGQPAGPGVKTVGGGCCPCPCPPQPPPPQPQPPAAAPQAGEDPTETSDALLVLEGLESEAESLETNSCSEEELSSPGRGGGGGGRLLLQPPGPELPPVPFPLQDLVPLGRLSRGEQQQQQQQQPPPPPPPPGPLRPLAGPSRKGSFKIRLSRLFRTKSCNGGSGGGDGTGKRPSGELAASAASLTDMGGSAGRELDAGRKPKLTRTQSAFSPVSFSPLFTGETVSLVDVDISQRGLTSPHPPTPPPPPRRSLSLLDDISGTLPTSVLVAPMGSSLQSFPLPPPPPPHAPDAFPRIAPIRAAESLHSQPPQHLQCPLYRPDSSSFAASLRELEKCGWYWGPMNWEDAEMKLKGKPDGSFLVRDSSDPRYILSLSFRSQGITHHTRMEHYRGTFSLWCHPKFEDRCQSVVEFIKRAIMHSKNGKFLYFLRSRVPGLPPTPVQLLYPVSRFSNVKSLQHLCRFRIRQLVRIDHIPDLPLPKPLISYIRKFYYYDPQEEVYLSLKEAQLISKQKQEVEPST,mutated_sequence,1.0,581.0,UPI00001AFDEB.a2m,UPI00001AFDEB.npy,gnomAD
+UPI0000073F52,UPI0000073F52.csv,MLLLGLLLLLPLLAGARLLWNWWKLRSLHLPPLAPGFLHLLQPDLPIYLLGLTQKFGPIYRLHLGLQDVVVLNSKRTIEEAMVKKWADFAGRPEPLTYKLVSRNYPDLSLGDYSLLWKAHKKLTRSALLLGIRDSMEPVVEQLTQEFCERMRAQPGTPVAIEEEFSLLTCSIICYLTFGDKIKDDNLMPAYYKCIQEVLKTWSHWSIQIVDVIPFLRFFPNPGLRRLKQAIEKRDHIVEMQLRQHKESLVAGQWRDMMDYMLQGVAQPSMEEGSGQLLEGHVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPGASSSRVPYKDRARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGAHLDETVWERPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPSGDALPSLQPLPHCSVILKMQPFQVRLQPRGMGAHSPGQSQ,mutated_sequence,1.0,495.0,UPI0000073F52.a2m,UPI0000073F52.npy,gnomAD
+UPI00002326B6,UPI00002326B6.csv,MPRLPVKKIRKQMKLLLLLLLLSCAAWLTYVHLGLVRQGRALRQRLGYGRDGEKLTSETDGRGVHAAPSTQRAEDSSESREEEQAPEGRDLDMLFPGGAGRLPLNFTHQTPPWREEYKGQVNLHVFEDWCGGAVGHLRRNLHFPLFPHTRTTVKKLAVSPKWKNYGLRIFGFIHPARDGDVQFSVASDDNSEFWLSLDESPAAAQLVAFVGKTGSEWTAPGEFTKFSSQVSKPRRLMASRRYYFELLHKQDDRGSDHVEVGWRAFLPGLKFEVISSAHISLYTDESALKMDHVAHVPQSPASHVGGRPPQEETSADMLRPDPRDTFFLTPRMESSSLENVLEPCAYAPTYVVKDFPIARYQGLQFVYLSFVYPNDYTRLTHMETDNKCFYRESPLYLERFGFYKYMKMDKEEGDEDEEDEVQRRAFLFLNPDDFLDDEDEGELLDSLEPTEAAPPRSGPQSPAPAAPAQPGATLAPPTPPRPRDGGTPRHSRALSWAARAARPLPLFLGRAPPPRPAVEQPPPKVYVTRVRPGQRASPRAPAPRAPWPPFPGVFLHPRPLPRVQLRAPPRPPRPHGRRTGGPQATQPRPPARAQATQGGREGQARTLGPAAPTVDSNLSSEARPVTSFLSLSQVSGPQLPGEGEEEEEGEDDGAPGDEAASEDSEEAAGPALGRWREDAIDWQRTFSVGAVDFELLRSDWNDLRCNVSGNLQLPEAEAVDVTAQYMERLNARHGGRFALLRIVNVEKRRDSARGSRFLLELELQERGGGRLRLSEYVFLRLPGARVGDADGESPEPAPAASVRPDGRPELCRPLRLAWRQDVMVHFIVPVKNQARWVAQFLADMAALHARTGDSRFSVVLVDFESEDMDVERALRAARLPRYQYLRRTGNFERSAGLQAGVDAVEDASSIVFLCDLHIHFPPNILDGIRKHCVEGRLAFAPVVMRLSCGSSPRDPHGYWEVNGFGLFGIYKSDFDRVGGMNTEEFRDQWGGEDWELLDRVLQAGLEVERLRLRNFYHHYHSKRGMWSVRSRKGSRTGAS,mutated_sequence,1.0,1039.0,UPI00002326B6.a2m,UPI00002326B6.npy,gnomAD
+UPI0000129881,UPI0000129881.csv,MKLQAVMETLLQRQQRARQELEARQQLPPDPPAAPPGRARAAPDEDREPESARMQRAQMAALAAMRAAAAGLGHPASPGGSEDGPPGSEEEDAAREGTPGSPGRGREGPGEEHFEDMASDEDMKPKWEEEEMEEDLGEDEEEEEEDYEDEEEEEDEEGLGPPGPASLGTTALFPRKAQPPQAFRGDGVPRVLGGQERPGPGPAHPGGAAHVAPQLQPPDHGDWTYEEQFKQLYELDGDPKRKEFLDDLFSFMQKRGTPVNRIPIMAKQVLDLFMLYVLVTEKGGLVEVINKKLWREITKGLNLPTSITSAAFTLRTQYMKYLYPYECEKRGLSNPNELQAAIDSNRREGRRQSFGGSLFAYSPGGAHGMLSSPKLPVSSLGLAASTNGSSITPAPKIKKEEDSAIPITVPGRLPVSLAGHPVVAAQAAAVQAAAAQAAVAAQAAALEQLREKLESAEPPEKKMALVADEQQRLMQRALQQNFLAMAAQLPMSIRINSQASESRQDSAVNLTGTNGSNSISMSVEINGIMYTGVLFAQPPAPTPTSAPNKGGGGGGGSSSNAGGRGGNTGTSGGQAGPAGLSTPSTSTSNNSLP,mutated_sequence,1.0,593.0,UPI0000129881.a2m,UPI0000129881.npy,gnomAD
+UPI000007385F,UPI000007385F.csv,MAAGGSAPEPRVLVCLGALLAGWVAVGLEAVVIGEVHENVTLHCGNISGLRGQVTWYRNNSEPVFLLSSNSSLRPAEPRFSLVDATSLHIESLSLGDEGIYTCQEILNVTQWFQVWLQVASGPYQIEVHIVATGTLPNGTLYAARGSQVDFSCNSSSRPPPVVEWWFQALNSSSESFGHNLTVNFFSLLLISPNLQGNYTCLALNQLSKRHRKVTTELLVYYPPPSAPQCWAQMASGSFMLQLTCRWDGGYPDPDFLWIEEPGGVIVGKSKLGVEMLSESQLSDGKKFKCVTSHIVGPESGASCMVQIRGPSLLSEPMKTCFTGGNVTLTCQVSGAYPPAKILWLRNLTQPEVIIQPSSRHLITQDGQNSTLTIHNCSQDLDEGYYICRADSPVGVREMEIWLSVKEPLNIGGIVGTIVSLLLLGLAIISGLLLHYSPVFCWKVGNTSRGQNMDDVMVLVDSEEEEEEEEEEEEDAAVGEQEGAREREELPKEIPKQDHIHRVTALVNGNIEQMGNGFQDLQDDSSEEQSDIVQEEDRPV,mutated_sequence,1.0,540.0,UPI000007385F.a2m,UPI000007385F.npy,gnomAD
+UPI0001610E83,UPI0001610E83.csv,MLKPGDPGGSAFLKVDPAYLQHWQQLFPHGGAGPLKGSGAAGLLSAPQPLQPPPPPPPPERAEPPPDSLRPRPASLSSASSTPASSSTSASSASSCAAAAAAAALAGLSALPVSQLPVFAPLAAAAVAAEPLPPKELCLGATSGPGPVKCGGGGGGGGEGRGAPRFRCSAEELDYYLYGQQRMEIIPLNQHTSDPNNRCDMCADNRNGECPMHGPLHSLRRLVGTSSAAAAAPPPELPEWLRDLPREVCLCTSTVPGLAYGICAAQRIQQGTWIGPFQGVLLPPEKVQAGAVRNTQHLWEIYDQDGTLQHFIDGGEPSKSSWMRYIRCARHCGEQNLTVVQYRSNIFYRACIDIPRGTELLVWYNDSYTSFFGIPLQCIAQDENLNVPSTVMEAMCRQDALQPFNKSSKLAPTTQQRSVVFPQTPCSRNFSLLDKSGPIESGFNQINVKNQRVLASPTSTSQLHSEFSDWHLWKCGQCFKTFTQRILLQMHVCTQNPDRPYQCGHCSQSFSQPSELRNHVVTHSSDRPFKCGYCGRAFAGATTLNNHIRTHTGEKPFKCERCERSFTQATQLSRHQRMPNECKPITESPESIEVD,mutated_sequence,1.0,595.0,UPI0001610E83.a2m,UPI0001610E83.npy,gnomAD
+UPI000006F554,UPI000006F554.csv,MKTRQNKDSMSMRSGRKKEAPGPREELRSRGRASPGGVSTSSSDGKAEKSRQTAKKARVEEASTPKVNKQGRSEEISESESEETNAPKKTKTEQELPRPQSPSDLDSLDGRSLNDDGSSDPRDIDQDNRSTSPSIYSPGSVENDSDSSSGLSQGPARPYHPPPLFPPSPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGAPPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAPPTKPPTTPVGGGNLPSAPPPANFPHVTPNLPPPPALRPLNNASASPPGLGAQPLPGHLPSPHAMGQGMGGLPPGPEKGPTLAPSPHSLPPASSSAPAPPMRFPYSSSSSSSAAASSSSSSSSSSASPFPASQALPSYPHSFPPPTSLSVSNQPPKYTQPSLPSQAVWSQGPPPPPPYGRLLANSNAHPGPFPPSTGAQSTAHPPVSTHHHHHQQQQQQQQQQQQQQQQQQQHHGNSGPPPPGAFPHPLEGGSSHHAHPYAMSPSLGSLRPYPPGPAHLPPPHSQVSYSQAGPNGPPVSSSSNSSSSTSQGSYPCSHPSPSQGPQGAPYPFPPVPTVTTSSATLSTVIATVASSPAGYKTASPPGPPPYGKRAPSPGAYKTATPPGYKPGSPPSFRTGTPPGYRGTSPPAGPGTFKPGSPTVGPGPLPPAGPSGLPSLPPPPAAPASGPPLSATQIKQEPAEEYETPESPVPPARSPSPPPKVVDVPSHASQSARFNKHLDRGFNSCARSDLYFVPLEGSKLAKKRADLVEKVRREAEQRAREEKEREREREREKEREREKERELERSVKLAQEGRAPVECPSLGPVPHRPPFEPGSAVATVPPYLGPDTPALRTLSEYARPHVMSPGNRNHPFYVPLGAVDPGLLGYNVPALYSSDPAAREREREARERDLRDRLKPGFEVKPSELEPLHGVPGPGLDPFPRHGGLALQPGPPGLHPFPFHPSLGPLERERLALAAGPALRPDMSYAERLAAERQHAERVAALGNDPLARLQMLNVTPHHHQHSHIHSHLHLHQQDAIHAASASVHPLIDPLASGSHLTRIPYPAGTLPNPLLPHPLHENEVLRHQLFAAPYRDLPASLSAPMSAAHQLQAMHAQSAELQRLALEQQQWLHAHHPLHSVPLPAQEDYYSHLKKESDKPL,mutated_sequence,1.0,1190.0,UPI000006F554.a2m,UPI000006F554.npy,gnomAD
+UPI00015E05B9,UPI00015E05B9.csv,MWITSLITMKDEDIFDLAGDPQDSLIPGPSPTPKQNPVCNEAFFGHQTFSLRETGLKSVHCQGQEAENMEREKFQQKALKQTKQKKSKSAEFLMVKEDREATEGTGNPAFNMSSPDLSACQTAEKKVIRHDMPDRTLAAHQQKFRLPASAEPKGNEYGRNYFDPLMDEEINPRQCATEVSREDESGREETLNSEAPGSSNKSHEIHKEASEATTAHLEEFQRSQKTIILLGSSPLEQEIRSTSLHCMEDEMSHPWILLLKVTAVIRSRRYYREQRF,mutated_sequence,1.0,276.0,UPI00015E05B9.a2m,UPI00015E05B9.npy,gnomAD
+UPI000004F638,UPI000004F638.csv,MEPRCPPPCGCCERLVLNVAGLRFETRARTLGRFPDTLLGDPARRGRFYDDARREYFFDRHRPSFDAVLYYYQSGGRLRRPAHVPLDVFLEEVAFYGLGAAALARLREDEGCPVPPERPLPRRAFARQLWLLFEFPESSQAARVLAVVSVLVILVSIVVFCLETLPDFRDDRDGTGLAAAAAAGPFPAPLNGSSQMPGNPPRLPFNDPFFVVETLCICWFSFELLVRLLVCPSKAIFFKNVMNLIDFVAILPYFVALGTELARQRGVGQQAMSLAILRVIRLVRVFRIFKLSRHSKGLQILGQTLRASMRELGLLIFFLFIGVVLFSSAVYFAEVDRVDSHFTSIPESFWWAVVTMTTVGYGDMAPVTVGGKIVGSLCAIAGVLTISLPVPVIVSNFSYFYHRETEGEEAGMFSHVDMQPCGPLEGKANGGLVDGEVPELPPPLWAPPGKHLVTEV,mutated_sequence,1.0,456.0,UPI000004F638.a2m,UPI000004F638.npy,gnomAD
+UPI000006FECD,UPI000006FECD.csv,MKDAAEELSFARVLLQRVDELEKLFKDREQFLELVSRKLSLVPGAEEVTMVTWEELEQAITDGWRASQAGSETLMGFSKHGGFTSLTSPEGTLSGDSTKQPSIEQALDSASGLGPDRTASGSGGTAHPSDGVSSREQSKVPSGTGRQQQPRARDEAGVPRLHQSSTFQFKSDSDRHRSREKLTSTQPRRNARPGPVQQDLPLARDQPSSVPASQSQVHLRPDRRGLEPTGMNQPGLVPASTYPHGVVPLSMGQLGVPPPEMDDRELIPFVVDEQRMLPPSVPGRDQQGLELPSTDQHGLVSVSAYQHGMTFPGTDQRSMEPLGMDQRGCVISGMGQQGLVPPGIDQQGLTLPVVDQHGLVLPFTDQHGLVSPGLMPISADQQGFVQPSLEATGFIQPGTEQHDLIQSGRFQRALVQRGAYQPGLVQPGADQRGLVRPGMDQSGLAQPGADQRGLVWPGMDQSGLAQPGRDQHGLIQPGTGQHDLVQSGTGQGVLVQPGVDQPGMVQPGRFQRALVQPGAYQPGLVQPGADQIDVVQPGADQHGLVQSGADQSDLAQPGAVQHGLVQPGVDQRGLAQPRADHQRGLVPPGADQRGLVQPGADQHGLVQPGVDQHGLAQPGEVQRSLVQPGIVQRGLVQPGAVQRGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQPGADQPGLVQPGAGQLGMVQPGIGQQGMVQPQADPHGLVQPGAYPLGLVQPGAYLHDLSQSGTYPRGLVQPGMDQYGLRQPGAYQPGLIAPGTKLRGSSTFQADSTGFISVRPYQHGMVPPGREQYGQVSPLLASQGLASPGIDRRSLVPPETYQQGLMHPGTDQHSPIPLSTGLGSTHPDQQHVASPGPGEHDQVYPDAAQHGHAFSLFDSHDSMYPGYRGPGYLSADQHGQEGLDPNRTRASDRHGIPAQKAPGQDVTLFRSPDSVDRVLSEGSEVSSEVLSERRNSLRRMSSSFPTAVETFHLMGELSSLYVGLKESMKDLDEEQAGQTDLEKIQFLLAQMVKRTIPPELQEQLKTVKTLAKEVWQEKAKVERLQRILEGEGNQEAGKELKAGELRLQLGVLRVTVADIEKELAELRESQDRGKAAMENSVSEASLYLQDQLDKLRMIIESMLTSSSTLLSMSMAPHKAHTLAPGQIDPEATCPACSLDVSHQVSTLVRRYEQLQDMVNSLAVSRPSKKAKLQRQDEELLGRVQSAILQVQGDCEKLNITTSNLIEDHRQKQKDIAMLYQGLEKLEKEKANREHLEMEIDVKADKSALATKVSRVQFDATTEQLNHMMQELVAKMSGQEQDWQKMLDRLLTEMDNKLDRLELDPVKQLLEDRWKSLRQQLRERPPLYQADEAAAMRRQLLAHFHCLSCDRPLETPVTGHAIPVTPAGPGLPGHHSIRPYTVFELEQVRQHSRNLKLGSAFPRGDLAQMEQSVGRLRSMHSKMLMNIEKVQIHFGGSTKASSQIIRELLHAQCLGSPCYKRVTDMADYTYSTVPRRCGGSHTLTYPYHRSRPQHLPRGLYPTEEIQIAMKHDEVDILGLDGHIYKGRMDTRLPGILRKDSSGTSKRKSQQPRPHVHRPPSLSSNGQLPSRPQSAQISAGNTSER,mutated_sequence,1.0,1663.0,UPI000006FECD.a2m,UPI000006FECD.npy,gnomAD
+UPI00025A2F44,UPI00025A2F44.csv,MGQPSLTWMLMVVVASWFITTAATDTSEARTKYNCPARWSWRTPQRGGDTEQGPDEDFTSQGSRKHKMCKEREGQQVHRKSYQRWCSECHSNATCTEDEAVTTCTCQEGFTGDGLTCVDLDECAIPGAHNCSANSSCVNTPGSFSCVCPEGFRLSPGLGCT,mutated_sequence,1.0,161.0,UPI00025A2F44.a2m,UPI00025A2F44.npy,gnomAD
+UPI000013D104,UPI000013D104.csv,MESNFNQEGVPRPSYVFSADPIARPSEINFDGIKLDLSHEFSLVAPNTEANSFESKDYLQVCLRIRPFTQSEKELESEGCVHILDSQTVVLKEPQCILGRLSEKSSGQMAQKFSFSKVFGPATTQKEFFQGCIMQPVKDLLKGQSRLIFTYGLTNSGKTYTFQGTEENIGILPRTLNVLFDSLQERLYTKMNLKPHRSREYLRLSSEQEKEEIASKSALLRQIKEVTVHNDSDDTLYGSLTNSLNISEFEESIKDYEQANLNMANSIKFSVWVSFFEIYNEYIYDLFVPVSSKFQKRKMLRLSQDVKGYSFIKDLQWIQVSDSKEAYRLLKLGIKHQSVAFTKLNNASSRSHSIFTVKILQIEDSEMSRVIRVSELSLCDLAGSERTMKTQNEGERLRETGNINTSLLTLGKCINVLKNSEKSKFQQHVPFRESKLTHYFQSFFNGKGKICMIVNISQCYLAYDETLNVLKFSAIAQKVCVPDTLNSSQEKLFGPVKSSQDVSLDSNSNSKILNVKRATISWENSLEDLMEDEDLVEELENAEETQNVETKLLDEDLDKTLEENKAFISHEEKRKLLDLIEDLKKKLINEKKEKLTLEFKIREEVTQEFTQYWAQREADFKETLLQEREILEENAERRLAIFKDLVGKCDTREEAAKDICATKVETEETHNYVGFEDIIDSLQDNVADIKKQAEIAHLYIASLPDPQEATACLELKFNQIKAELAKTKGELIKTKEELKKRENESDSLIQELETSNKKIITQNQRIKELINIIDQKEDTINEFQNLKSHMENTFKCNDKADTSSLIINNKLICNETVEVPKDSKSKICSERKRVNENELQQDEPPAKKGSIHVSSAITEDQKKSEEVRPNIAEIEDIRVLQENNEGLRAFLLTIENELKNEKEEKAELNKQIVHFQQELSLSEKKNLTLSKEVQQIQSNYDIAIAELHVQKSKNQEQEEKIMKLSNEIETATRSITNNVSQIKLMHTKIDELRTLDSVSQISNIDLLNLRDLSNGSEEDNLPNTQLDLLGNDYLVSKQVKEYRIQEPNRENSFHSSIEAIWEECKEIVKASSKKSHQIEELEQQIEKLQAEVKGYKDENNRLKEKEHKNQDDLLKEKETLIQQLKEELQEKNVTLDVQIQHVVEGKRALSELTQGVTCYKAKIKELETILETQKVECSHSAKLEQDILEKESIILKLERNLKEFQEHLQDSVKNTKDLNVKELKLKEEITQLTNNLQDMKHLLQLKEEEEETNRQETEKLKEELSASSARTQNLKADLQRKEEDYADLKEKLTDAKKQIKQVQKEVSVMRDEDKLLRIKINELEKKKNQCSQELDMKQRTIQQLKEQLNNQKVEEAIQQYERACKDLNVKEKIIEDMRMTLEEQEQTQVEQDQVLEAKLEEVERLATELEKWKEKCNDLETKNNQRSNKEHENNTDVLGKLTNLQDELQESEQKYNADRKKWLEEKMMLITQAKEAENIRNKEMKKYAEDRERFFKQQNEMEILTAQLTEKDSDLQKWREERDQLVAALEIQLKALISSNVQKDNEIEQLKRIISETSKIETQIMDIKPKRISSADPDKLQTEPLSTSFEISRNKIEDGSVVLDSCEVSTENDQSTRFPKPELEIQFTPLQPNKMAVKHPGCTTPVTVKIPKARKRKSNEMEEDLVKCENKKNATPRTNLKFPISDDRNSSVKKEQKVAIRPSSKKTYSLRSQASIIGVNLATKKKEGTLQKFGDFLQHSPSILQSKAKKIIETMSSSKLSNVEASKENVSQPKRAKRKLYTSEISSPIDISGQVILMDQKMKESDHQIIKRRLRTKTAK,mutated_sequence,1.0,1820.0,UPI000013D104.a2m,UPI000013D104.npy,gnomAD
+UPI000066DA6A,UPI000066DA6A.csv,MTAAAASNWGLITNIVNSIVGVSVLTMPFCFKQCGIVLGALLLVFCSWMTHQSCMFLVKSASLSKRRTYAGLAFHAYGKAGKMLVETSMIGLMLGTCIAFYVVIGDLGSNFFARLFGFQVGGTFRMFLLFAVSLCIVLPLSLQRNMMASIQSFSAMALLFYTVFMFVIVLSSLKHGLFSGQWLRRVSYVRWEGVFRCIPIFGMSFACQSQVLPTYDSLDEPSVKTMSSIFASSLNVVTTFYVMVGFFGYVSFTEATAGNVLMHFPSNLVTEMLRVGFMMSVAVGFPMMILPCRQALSTLLCEQQQKDGTFAAGGYMPPLRFKALTLSVVFGTMVGGILIPNVETILGLTGATMGSLICFICPALIYKKIHKNALSSQVVLWVGLGVLVVSTVTTLSVSEEVPEDLAEEAPGGRLGEAEGLMKVEAARLSAQDPVVAVAEDGREKPKLPKEREELEQAQIKGPVDVPGREDGKEAPEEAQLDRPGQGIAVPVGEAHRHEPPVPHDKVVVDEGQDREVPEENKPPSRHAGGKAPGVQGQMAPPLPDSEREKQEPEQGEVGKRPGQAQALEEAGDLPEDPQKVPEADGQPAVQPAKEDLGPGDRGLHPRPQAVLSEQQNGLAVGGGEKAKGGPPPGNAAGDTGQPAEDSDHGGKPPLPAEKPAPGPGLPPEPREQRDVERAGGNQAASQLEEAGRAEMLDHAVLLQVIKEQQVQQKRLLDQQEKLLAVIEEQHKEIHQQRQEDEEDKPRQVEVHQEPGAAVPRGQEAPEGKARETVENLPPLPLDPVLRAPGGRPAPSQDLNQRSLEHSEGPVGRDPAGPPDGGPDTEPRAAQAKLRDGQKDAAPRAAGTVKELPKGPEQVPVPDPAREAGGPEERLAEEFPGQSQDVTGGSQDRKKPGKEVAATGTSILKEANWLVAGPGAETGDPRMKPKQVSRDLGLAADLPGGAEGAAAQPQAVLRQPELRVISDGEQGGQQGHRLDHGGHLEMRKARGGDHVPVSHEQPRGGEDAAVQEPRQRPEPELGLKRAVPGGQRPDNAKPNRDLKLQAGSDLRRRRRDLGPHAEGQLAPRDGVIIGLNPLPDVQVNDLRGALDAQLRQAAGGALQVVHSRQLRQAPGPPEES,mutated_sequence,1.0,1119.0,UPI000066DA6A.a2m,UPI000066DA6A.npy,gnomAD
+UPI00004C8020,UPI00004C8020.csv,MFGDLFEEEYSTVSNNQYGKGKKLKTKALEPPAPREFTNLSGIRNQGGTCYLNSLLQTLHFTPEFREALFSLGPEELGLFEDKDKPDAKVRIIPLQLQRLFAQLLLLDQEAASTADLTDSFGWTSNEEMRQHDVQELNRILFSALETSLVGTSGHDLIYRLYHGTIVNQIVCKECKNVSERQEDFLDLTVAVKNVSGLEDALWNMYVEEEVFDCDNLYHCGTCDRLVKAAKSAKLRKLPPFLTVSLLRFNFDFVKCERYKETSCYTFPLRINLKPFCEQSELDDLEYIYDLFSVIIHKGGCYGGHYHVYIKDVDHLGNWQFQEEKSKPDVNLKDLQSEEEIDHPLMILKAILLEENNLIPVDQLGQKLLKKIGISWNKKYRKQHGPLRKFLQLHSQIFLLSSDESTVRLLKNSSLQAESDFQRNDQQIFKMLPPESPGLNNSISCPHWFDINDSKVQPIREKDIEQQFQGKESAYMLFYRKSQLQRPPEARANPRYGVPCHLLNEMDAANIELQTKRAECDSANNTFELHLHLGPQYHFFNGALHPVVSQTESVWDLTFDKRKTLGDLRQSIFQLLEFWEGDMVLSVAKLVPAGLHIYQSLGGDELTLCETEIADGEDIFVWNGVEVGGVHIQTGIDCEPLLLNVLHLDTSSDGEKCCQVIESPHVFPANAEVGTVLTALAIPAGVIFINSAGCPGGEGWTAIPKEDMRKTFREQGLRNGSSILIQDSHDDNSLLTKEEKWVTSMNEIDWLHVKNLCQLESEEKQVKISATVNTMVFDIRIKAIKELKLMKELADNSCLRPIDRNGKLLCPVPDSYTLKEAELKMGSSLGLCLGKAPSSSQLFLFFAMGSDVQPGTEMEIVVEETISVRDCLKLMLKKSGLQGDAWHLRKMDWCYEAGEPLCEEDATLKELLICSGDTLLLIEGQLPPLGFLKVPIWWYQLQGPSGHWESHQDQTNCTSSWGRVWRATSSQGASGNEPAQVSLLYLGDIEISEDATLAELKSQAMTLPPFLEFGVPSPAHLRAWTVERKRPGRLLRTDRQPLREYKLGRRIEICLEPLQKGENLGPQDVLLRTQVRIPGERTYAPALDLVWNAAQGGTAGSLRQRVADFYRLPVEKIEIAKYFPEKFEWLPISSWNQQITKRKKKKKQDYLQGAPYYLKDGDTIGVKNLLIDDDDDFSTIRDDTGKEKQKQRALGRRKSQEALHEQSSYILSSAETPARPRAPETSLSIHVGSFR,mutated_sequence,1.0,1235.0,UPI00004C8020.a2m,UPI00004C8020.npy,gnomAD
+UPI0000072C2E,UPI0000072C2E.csv,MSGGKSAQGPEEGGVCITEALITKRNLTFPEDGELSEKMFHTLDELQTVRLDREGITTIRNLEGLQNLHSLYLQGNKIQQIENLACIPSLRFLSLAGNQIRQVENLLDLPCLQFLDLSENLIETLKLDEFPQSLLILNLSGNSCTNQDGYRELVTEALPLLLDLDGQPVVERWISDEEDEASSDEEFPELSGPFCSERGFLKELEQELSRHREHRQQTALTEHLLRMEMQPTLTDLPLLPGVPMAGDSSPSATPAQGEETVPEAVSSPQASSPTKKPCSLIPRGHQSSFWGRKGARAATAPKASVAEAPSTTKTTAKRSKK,mutated_sequence,1.0,321.0,UPI0000072C2E.a2m,UPI0000072C2E.npy,gnomAD
+UPI0000423C4C,UPI0000423C4C.csv,MKERKRHLGDTKHFCPVVLKENFILQPGNTEEAAKYREKIYYFSSAEAKEKFLEHPEDYVAHEEPLKAPPLRICLVGPQGSGKTMCGRQLAEKLNIFHIQFEEVLQEKLLLKTEKKVGPEFEEDSENEQAAKQELEELAIQANVKVEEENTKKQLPEVQLTEEEEVIKSSLMENEPLPPEILEVILSEWWLKEPIRSTGFILDGFPRYPEEAQFLGDRGFFPDAAVFIQVDDQDIFDRLLPAQIEKWKLKQKKKLERKKLIKDMKAKIRVDTIAKRRAELILERDKKRREPNKEKRKRGISENTTCYKCNHMSPWCEYAGLPYRGLLILYSATATATAGTKHCRSGAKVIVALSKQNIHNQK,mutated_sequence,1.0,362.0,UPI0000423C4C.a2m,UPI0000423C4C.npy,gnomAD
+UPI00001C1E94,UPI00001C1E94.csv,MAKNKLRGPKSRNVFHIASQKNFKAKNKAKPVTTNLKKINIMNEEKVNRVNKAFVNVQKELAHFAKSISLEPLQKELIPQQRHESKPVDEATRLMALL,mutated_sequence,1.0,98.0,UPI00001C1E94.a2m,UPI00001C1E94.npy,gnomAD
+UPI00001B647D,UPI00001B647D.csv,MGPSASSHGSPVPLPSDLSFRSPTPSNLPMVQLWAAHAHEGFSHLPSGLYPSYLHLNHLEPPSSGSPLLSQLGQPSIFDTQKGQGPGGGKTGVRGWGWHRGLLRGFWGLWQHPGSPQLCPGCWLFFFFFF,mutated_sequence,1.0,130.0,UPI00001B647D.a2m,UPI00001B647D.npy,gnomAD
+UPI0000073CB8,UPI0000073CB8.csv,MRTHTRGAPSVFFIYLLCFVSAYITDENPEVMIPFTNANYDSHPMLYFSRAEVAELQLRAASSHEHIAARLTEAVHTMLSSPLEYLPPWDPKDYSARWNEIFGNNLGALAMFCVLYPENIEARDMAKDYMERMAAQPSWLVKDAPWDEVPLAHSLVGFATAYDFLYNYLSKTQQEKFLEVIANASGYMYETSYRRGWGFQYLHNHQPTNCMALLTGSLVLMNQGYLQEAYLWTKQVLTIMEKSLVLLREVTDGSLYEGVAYGSYTTRSLFQYMFLVQRHFNINHFGHPWLKQHFAFMYRTILPGFQRTVAIADSNYNWFYGPESQLVFLDKFVMRNGSGNWLADQIRRNRVVEGPGTPSKGQRWCTLHTEFLWYDGSLKSVPPPDFGTPTLHYFEDWGVVTYGSALPAEINRSFLSFKSGKLGGRAIYDIVHRNKYKDWIKGWRNFNAGHEHPDQNSFTFAPNGVPFITEALYGPKYTFFNNVLMFSPAVSKSCFSPWVGQVTEDCSSKWSKYKHDLAASCQGRVVAAEEKNGVVFIRGEGVGAYNPQLNLKNVQRNLILLHPQLLLLVDQIHLGEESPLETAASFFHNVDVPFEETVVDGVHGAFIRQRDGLYKMYWMDDTGYSEKATFASVTYPRGYPYNGTNYVNVTMHLRSPITRAAYLFIGPSIDVQSFTVHGDSQQLDVFIATSKHAYATYLWTGEATGQSAFAQVIADRHKILFDRNSAIKSSIVPEVKDYAAIVEQNLQHFKPVFQLLEKQILSRVRNTASFRKTAERLLRFSDKRQTEEAIDRIFAISQQQQQQSKSKKNRRAGKRYKFVDAVPDIFAQIEVNEKKIRQKAQILAQKELPIDEDEEMKDLLDFADVTYEKHKNGGLIKGRFGQARMVTTTHSRAPSLSASYTRLFLILNIAIFFVMLAMQLTYFQRAQSLHGQRCLYAVLLIDSCILLWLYSSCSQSQC,mutated_sequence,1.0,958.0,UPI0000073CB8.a2m,UPI0000073CB8.npy,gnomAD
+UPI000013CDC7,UPI000013CDC7.csv,MGAGSSTEQRSPEQPPEGSSTPAEPEPSGGGPSAEAAPDTTADPAIAASDPATKLLQKNGQLSTINGVAEQDELSLQEGDLNGQKGALNGQGALNSQEEEEVIVTEVGQRDSEDVSKRDSDKEMATKSAVVHDITDDGQEETPEIIEQIPSSESNLEELTQPTESQANDIGFKKVFKFVGFKFTVKKDKTEKPDTVQLLTVKKDEGEGAAGAGDHKDPSLGAGEAASKESEPKQSTEKPEETLKREQSHAEISPPAESGQAVEECKEEGEEKQEKEPSKSAESPTSPVTSETGSTFKKFFTQGWAGWRKKTSFRKPKEDEVEASEKKKEQEPEKVDTEEDGKAEVASEKLTASEQAHPQEPAESAHEPRLSAEYEKVELPSEEQVSGSQGPSEEKPAPLATEVFDEKIEVHQEEVVAEVHVSTVEERTEEQKTEVEETAGSVPAEELVEMDAEPQEAEPAKELVKLKETCVSGEDPTQGADLSPDEKVLSKPPEGVVSEVEMLSSQERMKVQGSPLKKLFTSTGLKKLSGKKQKGKRGGGDEESGEHTQVPADSPDSQEEQKGESSASSPEEPEEITCLEKGLAEVQQDGEAEEGATSDGEKKREGVTPWASFKKMVTPKKRVRRPSESDKEDELDKVKSATLSSTESTASEMQEEMKGSVEEPKPEEPKRKVDTSVSWEALICVGSSKKRARRGSSSDEEGGPKAMGGDHQKADEAGKDKETGTDGILAGSQEHDPGQGSSSPEQAGSPTEGEGVSTWESFKRLVTPRKKSKSKLEEKSEDSIAGSGVEHSTPDTEPGKEESWVSIKKFIPGRRKKRPDGKQEQAPVEDAGPTGANEDDSDVPAVVPLSEYDAVEREKMEAQQAQKSAEQPEQKAATEVSKELSESQVHMMAAAVADGTRAATIIEERSPSWISASVTEPLEQVEAEAALLTEEVLEREVIAEEEPPTVTEPLPENREARGDTVVSEAELTPEAVTAAETAGPLGAEEGTEASAAEETTEMVSAVSQLTDSPDTTEEATPVQEVEGGVPDIEEQERRTQEVLQAVAEKVKEESQLPGTGGPEDVLQPVQRAEAERPEEQAEASGLKKETDVVLKVDAQEAKTEPFTQGKVVGQTTPESFEKAPQVTESIESSELVTTCQAETLAGVKSQEMVMEQAIPPDSVETPTDSETDGSTPVADFDAPGTTQKDEIVEIHEENEVASGTQSGGTEAEAVPAQKERPPAPSSFVFQEETKEQSKMEDTLEHTDKEVSVETVSILSKTEGTQEADQYADEKTKDVPFFEGLEGSIDTGITVSREKVTEVALKGEGTEEAECKKDDALELQSHAKSPPSPVEREMVVQVEREKTEAEPTHVNEEKLEHETAVTVSEEVSKQLLQTVNVPIIDGAKEVSSLEGSPPPCLGQEEAVCTKIQVQSSEASFTLTAAAEEEKVLGETANILETGETLEPAGAHLVLEEKSSEKNEDFAAHPGEDAVPTGPDCQAKSTPVIVSATTKKGLSSDLEGEKTTSLKWKSDEVDEQVACQEVKVSVAIEDLEPENGILELETKSSKLVQNIIQTAVDQFVRTEETATEMLTSELQTQAHVIKADSQDAGQETEKEGEEPQASAQDETPITSAKEESESTAVGQAHSDISKDMSEASEKTMTVEVEGSTVNDQQLEEVVLPSEEEGGGAGTKSVPEDDGHALLAERIEKSLVEPKEDEKGDDVDDPENQNSALADTDASGGLTKESPDTNGPKQKEKEDAQEVELQEGKVHSESDKAITPQAQEELQKQERESAKSELTES,mutated_sequence,1.0,1782.0,UPI000013CDC7.a2m,UPI000013CDC7.npy,gnomAD
+UPI0001A48FC8,UPI0001A48FC8.csv,MTSQEKTEEYPFADIFDEDETERNFLLSKPVCFVVFGKPGVGKTTLARYITQAWKCIRVEALPILEEQIAAETESGVMLQSMLISGQSIPDELVIKLMLEKLNSPEVCHFGYIITEIPSLSQDAMTTLQQIELIKNLNLKPDVIINIKCPDYDLCQRISGQRQHNNTGYIYSRDQWDPEVIENHRKKKKEAQKDGKGEEEEEEEEQEEEEAFIAEMQMVAEILHHLVQRPEDYLENVENIVKLYKETILQTLEEVMAEHNPQYLIELNGNKPAEELFMIVMDRLKYLNLKRAAILTKLQGAEEEINDTMENDELFRTLASYKLIAPRYRWQRSKWGRTCPVNLKDGNIYSGLPDYSVSFLGKIYCLSSEEALKPFLLNPRPYLLPPMPGPPCKVFILGPQYSGKTTLCNMLAENYKGKVVDYAQLVQPRFDKARETLVENTIAEATAAAIKVVKEKLLRELQARKQAETALREFQRQYEKMEFGVFPMEATHSSIDEEGYIQGSQRDRGSSLVDTEEAKTKSENVLHDQAAKVDKDDGKETGETFTFKRHSQDASQDVKLYSDTAPTEDLIEEVTADHPEVVTMIEETIKMSQDINFEQPYEKHAEILQEVLGEVMEENKDRFPGAPKYGGWIVDNCPIVKELWMALIKKGIIPDLVIYLSDTENNGKCLFNRIYLQKKSEIDSKILERLLEELQKKKKEEEEARKATEEELRLEEENRRLLELMKVKAKEAEETDNEDEEEIEGDELEVHEEPEASHDTRGSWLPEEFEASEVPETEPEAVSEPIEETTVETEIPKGSKEGLEIEKLSETVVLPEFPEDSYPDVPEMEPFKEKIGSFIILWKQLEATISEAYIKILNLEIADRTPQELLQKVVETMEKPFQYTAWELTGEDYEEETEDYQTEAEVDEELEEEEEEEGEDKMKERKRHLGDTKHFCPVVLKENFILQPGNTEEAAKYREKIYYFSSAEAKEKFLEHPEDYVAHEEPLKAPPLRICLVGPQGSGKTMCGRQLAEKLNIFHIQFEEVLQEKLLLKTEKKVGPEFEEDSENEQAAKQELEELAIQANVKVEEENTKKQLPEVQLTEEEEVIKSSLMENEPLPPEILEVILSEWWLKEPIRSTGFILDGFPRYPEEAQFLGDRGFFPDAAVFIQVDDQDIFDRLLPAQIEKWKLKQKKKLERKKLIKDMKAKIRVDTIAKRRAELILERDKKRRENVVRDDEEISEEELEEDNDDIENILEDEFPKDEEEMSGEEDEEQETDAIERLRGELGEKFEADTHNLQIIQDELERYLIPIISINGARRNHIVQYTLNMKLKPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNPIKYIRQPKPKPTVPIRIIIVGPPKSGKTTVAKKITSEYGLKHLSIGGALRYVLNNHPETELALMLNWHLHKGMTAPDELAIQALELSLMESVCNTAGVVIDGYPVTKHQMNLLEARSIIPMVIFELSVPSKEIFKRLLLEKENEQRLPYPLHNSAQIVAVNNVKYRKNIGEIRQYYQEQHQNWYVIDGFHSKWWVWNEVIKNVQMVNKYMQTYLERIKAGKAACIDKLCITPQELLSRLGEFEQFCPVSLAESQELFDCSATDSLEFAAEFRGHYYKMSSQEKLNKFLENPELYVPPLAPHPLPSADMIPKRLTLSELKSRFPKCAELQGYCPVTYKDGNQRYEALVPGSINYALEYHNRIYICENKEKLQKFLRSPLKYWEQKLPHKLPPLREPILLTSLPLPGYLEQGIATSLIKAMNAAGCLKPKFPFLSIRRSALLYIALHLKAFNPKGSEYTRKKYKKKMEQFMESCELITYLGAKMTRKYKEPQFRAIDFDHKLKTFLSLRNIDPING,mutated_sequence,1.0,1911.0,UPI0001A48FC8.a2m,UPI0001A48FC8.npy,gnomAD
+UPI000195170D,UPI000195170D.csv,MSPPRLSSSLGAHLPCIVPSPVLDCIRPRLLRKWDLGSWPAPGLSAPHRHCKSLLSDFRDTVSVTRGEIEVWSYEALTVTSFQTKDFLRNIFLQITPSVT,mutated_sequence,,,UPI000195170D.a2m,UPI000195170D.npy,gnomAD
+UPI0000141917,UPI0000141917.csv,MAGWPGAGPLCVLGGAALGVCLAGVAGQLVEPSTAPPKPKPPPLTKETVVFWDMRLWHVVGIFSLFVLSIIITLCCVFNCRVPRTRKEIEARYLQRKAAKMYTDKLETVPPLNELTEVPGEDKKKKKKKKKDSVDTVAIKVEEDEKNEAKKKKGEK,mutated_sequence,1.0,156.0,UPI0000141917.a2m,UPI0000141917.npy,gnomAD
+UPI000013DFEA,UPI000013DFEA.csv,MEGAALLRVSVLCIWMSALFLGVGVRAEEAGARVQQNVPSGTDTGDPQSKPLGDWAAGTMDPESSIFIEDAIKYFKEKVSTQNLLLLLTDNEAWNGFVAAAELPRNEADELRKALDNLARQMIMKDKNWHDKGQQYRNWFLKEFPRLKSELEDNIRRLRALADGVQKVHKGTTIANVVSGSLSISSGILTLVGMGLAPFTEGGSLVLLEPGMELGITAALTGITSSTMDYGKKWWTQAQAHDLVIKSLDKLKEVREFLGENISNFLSLAGNTYQLTRGIGKDIRALRRARANLQSVPHASASRPRVTEPISAESGEQVERVNEPSILEMSRGVKLTDVAPVSFFLVLDVVYLVYESKHLHEGAKSETAEELKKVAQELEEKLNILNNNYKILQADQEL,mutated_sequence,1.0,398.0,UPI000013DFEA.a2m,UPI000013DFEA.npy,gnomAD
+UPI00004700E1,UPI00004700E1.csv,MSSPLASLSKTRKVPLPSEPMNPGRRGIRIYGDEDEVDMLSDGCGSEEKISVPSCYGGIGAPVSRQVPASHDSELMAFMTRKLWDLEQQVKAQTDEILSKDQKIAALEDLVQTLRPHPAEATLQRQEELETMCVQLQRQVREMERFLSDYGLQWVGEPMDQEDSESKTVSEHGERDWMTAKKFWKPGDSLAPPEVDFDRLLASLQDLSELVVEGDTQVTPVPGGARLRTLEPIPLKLYRNGIMMFDGPFQPFYDPSTQRCLRDILDGFFPSELQRLYPNGVPFKVSDLRNQVYLEDGLDPFPGEGRVVGRQLMHKALDRVEEHPGSRMTAEKFLNRLPKFVIRQGEVIDIRGPIRDTLQNCCPLPARIQEIVVETPTLAAERERSQESPNTPAPPLSMLRIKSENGEQAFLLMMQPDNTIGDVRALLAQARVMDASAFEIFSTFPPTLYQDDTLTLQAAGLVPKAALLLRARRAPKSSLKFSPGPCPGPGPGPSPGPGPGPSPGPGPGPSPCPGPSPSPQ,mutated_sequence,1.0,520.0,UPI00004700E1.a2m,UPI00004700E1.npy,gnomAD
+UPI000050ED33,UPI000050ED33.csv,MRNSEEQPSGGTTVLQRLLQEQLRYGNPSENRSLLAIHQQATGNGPPFPSGSGNPGPQSDVLSPQDHHQQLVAHAARQEPQGQEIQSENLIMEKQLSPRMQNNEELPTYEEAKVQSQYFRGQQHASVGAAFYVTGVTNQKMRTEGRPSVQRLNPGKMHQDEGLRDLKQGHVRSLSERLMQMSLATSGVKAHPPVTSAPLSPPQPNDLYKNPTSSSEFYKAQGPLPNQHSLKGMEHRGPPPEYPFKGMPPQSVVCKPQEPGHFYSEHRLNQPGRTEGQLMRYQHPPEYGAARPAQDISLPLSARNSQPHSPTSSLTSGGSLPLLQSPPSTRLSPARHPLVPNQGDHSAHLPRPQQHFLPNQAHQGDHYRLSQPGLSQQQQQQQQQHHHHHHHQQQQQQQPQQQPGEAYSAMPRAQPSSASYQPVPADPFAIVSRAQQMVEILSDENRNLRQELEGCYEKVARLQKVETEIQRVSEAYENLVKSSSKREALEKAMRNKLEGEIRRMHDFNRDLRERLETANKQLAEKEYEGSEDTRKTISQLFAKNKESQREKEKLEAELATARSTNEDQRRHIEIRDQALSNAQAKVVKLEEELKKKQVYVDKVEKMQQALVQLQAACEKREQLEHRLRTRLERELESLRIQQRQGNCQPTNVSEYNAAALMELLREKEERILALEADMTKWEQKYLEENVMRHFALDAAATVAAQRDTTVISHSPNTSYDTALEARIQKEEEEILMANKRCLDMEGRIKTLHAQIIEKDAMIKVLQQRSRKEPSKTEQLSCMRPAKSLMSISNAGSGLLSHSSTLTGSPIMEEKRDDKSWKGSLGILLGGDYRAEYVPSTPSPVPPSTPLLSAHSKTGSRDCSTQTERGTESNKTAAVAPISVPAPVAAAATAAAITATAATITTTMVAAAPVAVAAAAAPAAAAAPSPATAAATAAAVSPAAAGQIPAAASVASAAAVAPSAAAAAAVQVAPAAPAPVPAPALVPVPAPAAAQASAPAQTQAPTSAPAVAPTPAPTPTPAVAQAEVPASPATGPGPHRLSIPSLTCNPDKTDGPVFHSNTLERKTPIQILGQEPDAEMVEYLI,mutated_sequence,1.0,1084.0,UPI000050ED33.a2m,UPI000050ED33.npy,gnomAD
+UPI000004E7FC,UPI000004E7FC.csv,MFPFYSCWRTGLLLLLLAVAVRESWQTEEKTCDLVGEKGKESEKELALVKRLKPLFNKSFESTVGQGSDTYIYIFRVCREAGNHTSGAGLVQINKSNGKETVVGRLNETHIFNGSNWIMLIYKGGDEYDNHCGKEQRRAVVMISCNRHTLADNFNPVSEERGKVQDCFYLFEMDSSLACSPEISHLSVGSILLVTFASLVAVYVVGGFLYQRLVVGAKGMEQFPHLAFWQDLGNLVADGCDFVCRSKPRNVPAAYRGVGDDQLGEESEERDDHLLPM,mutated_sequence,1.0,277.0,UPI000004E7FC.a2m,UPI000004E7FC.npy,gnomAD
+UPI000204A77C,UPI000204A77C.csv,TLVNNVRLPRGHRLELSDGDLLTFGPEGPPGTSPSEFYFMFQQVRVKPQDFAAITIPRSRGEARVGAGFRPMLPSQGAPQRPLSTFSPAPKATLILNSIGSLSKLRPQPLTFSPSWGGPKSLPVPAPPGEMGTTPSAPPQRNRRKSVHRVLAELDDESEPPENPPPVLMEPRKKLRVDKAPLTPTGNWHIQGMQSIVKGQKERPWPREHLSGYEEAS,mutated_sequence,1.0,217.0,UPI000204A77C.a2m,UPI000204A77C.npy,gnomAD
+UPI0001662BC1,UPI0001662BC1.csv,MSSPRLIPLWKDLKLLLNDTINKSKQPSEDPKNCLIVLSDRSQAVAWMKSKTEDMVEKRTFSMTERLPPIQSMVHAGSFHILVVYCGDLILRLFGDHFRAFKPLGKVPCRFNISCLCYDPEMKMLLSGILGAVVTWVIELGGTGLQIAHMVSMPGDELVQDIVLNGPSGSLLALCETVVRVLMHQGKGQLGEVKRFTSTSSGSSITCCFTCFDQGFLYAGNQAGEIQVWSLQQGHPLHSFQAHQSGVICIRSRPEAHTLLTAGSDSLIKEWNLTSGSLLRRLELGEELYRLQFIDSITFFCQTAHSFSLHRLPCFYSLFNVCGSAPQQLRRVCCGNNWFRILCTTEDGLLRFVSPVTGDLLVITWPFSILDQAVDWAYDPGKEELFVATGSSEVLVFDTTRCPCPAKYLLGTSPNSQDFVQCLAYGHFNLGRGLEGLIFSGHQSGVIRVLSQHSCARLEKFMHFGAVLALSTLSGGIFGGQGNSLLCSYGMDDYVHLSEAVLDGVKVQLRPLASILSSCHLTHLILLPKSVGAITETNCLRLWKFHDFLSSGSQNGLKFIETLPLHLCAITSFDVCLSLSLFVTGSADGSVRIWDFHGRLIGILDSSLHFGPVCFANDRGDLLVTFNQSLYLVSCLKLLPPALLTRLSFMSISDEVLEVPKPFIPSFFFSFETMFVPKYIYPGQAQQKLVGLEKLVNNRAIAFDHSVPHVIEEDEEGSPVLLRSSMHYSLQDMEDWMQVSKRYQCHYVLPPQLQLTSWDGLNPYQILRYYFGHGREWLFAPDCYIPNSVIRARLWPEGTPIYLQCNLHAPQRELEWDRSQEFFFWHSRVRAISNTEYPKNKEEDEHFLEMRLSKDVTYSVLTDGANRSWLGRKMSEITINSMIETMLNIMVHASLLKYQCCVGALGQIFASYQVSPALRSETARRLLNDTTNSNPLIRELAWEGLKRLGMITHLFAMPLAQGLMDKDERVRIKTLSLMAEIGIHSRTSLLQLTQKQETFREMQQQMIGEEPLDHLLGMRATDLQILSTQVEQRLNENLTLSHRDEKPAFSLDVSMPSELKSSLKPPTVSEESEVAIKPSKGQRRGQAGVKKHSQKWLRGLKKTKERDSKQMSTEPGLLEDESGTEAAPIEMEEASVYSQWSSSTSVIKLSKDVDSQEKDISKDHIALTLKRLQKIRDKRDKKATAQKLKKKHKKKGKEAKVINEETTPPVMEQPVTKKVKIQGRGASGISGRRSTAGDGSSWRDDLCRLMALRISGSQTKMSENLNAELVTFAQEMLVDRHPSWELFQEICPLLKKESKVLLEDLDWDVVPPEKKPIFIQEGAIREDMIQGVTQEVIRHKEVMPREEEQAQKKARDMLGLEETQVILKKGKKVIFLEPGNVTMGKEISKKEEKKTFQKSPKQGRKAVQKERKVGKIKREMTKEERDMSEEVEEMATLEEKVVKQEGKLVMIERTPSWQDWKKAWDEWKQVHGETRKSWKAWKEEWEKRLLQEEEKLHQAGEKLSPEEEMLQEDKKLKWEEWKQVWENMLSSKSKEQQYKDEEEVTLEEEVSREGEEKEQQVTEEQRHIQEEHKWARIHRKRARAEKKRAQEERKLAQEEEKLAQEERQLAQEERKLAQAYVKITQDDREMAQAEGKFAQKEETLAQRGEKLSQEAEKLAQKRKKLAKKWEKVAREEEKLAKKGGKLAEVKNILAQKVEELPQREQNLDWQEKELAQELEELEWDMEELSWKEEELNQEEGKLVEEKKKLAEEEEALAWQREKLSEEETKLAQEEELLIQEKEKLAQHKEKMPEEEERLGRKREQLIEKKMKLAQKRERWINSMEELTKNKMILYQKKNLAQEKKNLAQEKEKLAQRKENLLYNKERLTHSKKQLVQVKNKLGMFNKILAQVEEKLTQEKETVIKKKEKLAETEKKLVQVEDSLAKKQEKLAQEKMKLALEKAMVQGKKRLRGELDIAKEEKALNLEMKRLAEEKMRLVEGKETLSKGETPETSRQRKMTQVEQELFERKLSLEEKILLHEDRILAMEESEIAKGKLEFTRGQRIFVQGQRKLAKASRKLIKKRESLSKEPAKLNKILKALQKLTRDERKLTQEEIKMTKMKRALFVKERRLSIEQSKLDIKEWDFSEKRSELTKDEKKLARKQRKLANKMRRMINKEEKMTEEESKLARKHSEVILDDEEEGGIEEEEVIPFLKRRWRKRKEAKRGDKPKEKFSSQVDEVESEEHFSEEMESLLDELEKQESLSSEEEEEREEEEEREEEEVREEEEERKEEEEGEEKQVEKEEEEKKKKKKEKKKEEVQEKEEVFEEKEEIMSEEETESLSDEEEEEESCSLEEEVDREKEILKKEKQFKLQEQRRKSLRGRERVLSILRGVPHGKGRAIRLGVLKSPLKKLMSTALEMKEKTPVPVPEKQISWEDKKATVVEIPRKFLGTMDKEREVMGKYEPIPPHVLGTVLESQAQDLKTPFMSHILRRTVEAEELQHKPLGAWWKWFLQHPPLMGQTEVQLPLSQIPAKEQHADVSLSDVEWIRHVLERMEAGEQLSRDGFHRLCQLLKDLASKGNLEWLHLAKHEAIVYRHRQALESQDTRISSRQSMSPKYLKVIPPIKAKEKESWPKPLAVPTQKSPLATKRIPDPRAKNWHLLGEPYRSERAQQISIAHKEMEMQYFYPATRDIFPSAHASVEKQTLALMFQKDFWDFKDKRRFPKLPKLEKKTQPISKKKEELPLWETFVALYHVLRMLQQRYPKDSTAWMEQFYQLMDLYQLKSPRIQKLLQELLMREEPQPQEIIYEEALKATELVPGERLFCCLFCGSSHTPRSPQEFQGAVPLPWQNCVRTILPVGIARYGILELAWKSLPEADLHLTKALTHTVAPTL,mutated_sequence,1.0,2873.0,UPI0001662BC1.a2m,UPI0001662BC1.npy,gnomAD
+UPI0000167B08,UPI0000167B08.csv,MEVQLGLGRVYPRPPSKTYRGAFQNLFQSVREVIQNPGPRHPEAASAAPPGASLLLLQQQQQQQQQQQQQQQQQQQQQQQETSPRQQQQQQGEDGSPQAHRRGPTGYLVLDEEQQPSQPQSALECHPERGCVPEPGAAVAASKGLPQQLPAPPDEDDSAAPSTLSLLGPTFPGLSSCSADLKDILSEASTMQLLQQQQQEAVSEGSSSGRAREASGAPTSSKDNYLGGTSTISDNAKELCKAVSVSMGLGVEALEHLSPGEQLRGDCMYAPLLGVPPAVRPTPCAPLAECKGSLLDDSAGKSTEDTAEYSPFKGGYTKGLEGESLGCSGSAAAGSSGTLELPSTLSLYKSGALDEAAAYQSRDYYNFPLALAGPPPPPPPPHPHARIKLENPLDYGSAWAAAAAQCRYGDLASLHGAGAAGPGSGSPSAAASSSWHTLFTAEEGQLYGPCGGGGGGGGGGGGGGGGGGGGGGGEAGAVAPYGYTRPPQGLAGQESDFTAPDVWYPGGMVSRVPYPSPTCVKSEMGPWMDSYSGPYGDMRLETARDHVLPIDYYFPPQKTCLICGDEASGCHYGALTCGSCKVFFKRAAEGKQKYLCASRNDCTIDKFRRKNCPSCRLRKCYEAGMTLGARKLKKLGNLKLQEEGEASSTTSPTEETTQKLTVSHIEGYECQPIFLNVLEAIEPGVVCAGHDNNQPDSFAALLSSLNELGERQLVHVVKWAKALPGFRNLHVDDQMAVIQYSWMGLMVFAMGWRSFTNVNSRMLYFAPDLVFNEYRMHKSRMYSQCVRMRHLSQEFGWLQITPQEFLCMKALLLFSIIPVDGLKNQKFFDELRMNYIKELDRIIACKRKNPTSCSRRFYQLTKLLDSVQPIARELHQFTFDLLIKSHMVSVDFPEMMAEIISVQVPKILSGKVKPIYFHTQ,mutated_sequence,1.0,920.0,UPI0000167B08.a2m,UPI0000167B08.npy,gnomAD
+UPI0001EE4B77,UPI0001EE4B77.csv,MAVQISKKRKFVADGIFKAELNEFLTRELAEDGYSGVEVRVTPTRTEIIILATRTQNVLGEKGRRIRELTAVVQKRFGFPEGSVELKIMVMVTGYPLLPLKLYAEKVATRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFIMESGAKGCEVVVSGKLRGQRAKSMKFVDGLMIHSGDPVNYYVDTAVRHVLLRQGVLGIKVKIMLPWDPTGKIGPKKPLPDHVSIVEPKDEILPTTPISEQKGGKPEPPAMPQPVPTA,mutated_sequence,1.0,259.0,UPI0001EE4B77.a2m,UPI0001EE4B77.npy,gnomAD
+UPI000006FC41,UPI000006FC41.csv,MKKIFSKKGESPLGSFARRQRSSAGGGGEPGEGAYSQPGYHVRDRDLGKIHKAASAGNVAKVQQILLLRKNGLNDRDKMNRTALHLACANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQCQEEKCATILLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQMVEFLIKKKANVNAVDKLESSHQLISEYKEERIPKHSSQNSNSVDESSEDSLSRLSGKPGVDDSWPTSDDEDLNFDTKNVPKPSLAKLMTASQQSRKNLEATYGTVRTGNRTLFEDRDSDSQDEVVVESLPTTSIKVQCFSHPTYQSPDLLPKPSHKSLANPGLMKEEPTKPGIAKKENGIDIIESAPLEQTNNDNLTYVDEVHKNNRSDMMSALGLGQEEDIESPWDSESISENFPQKYVDPLAGAADGKEKNIGNEQAEDVFYIPSCMSGSRNFKMAKLEDTRNVGMPVAHMESPERYLHLKPTIEMKDSVPNKAGGMKDVQTSKAAEHDLEVASEEEQEREGSENNQPQVEEERKKHRNNEMEVSANIHDGATDDAEDDDDDDGLIQKRKSGETDHQQFPRKENKEYASSGPALQMKEVKSTEKEKRTSKESVNSPVFGKASLLTGGLLQVDDDSSLSEIDEDEGRPTKKTSNEKNKVKNQIQSMDDVDDLTQSSETASEDCELPHSSYKNFMLLIEQLGMECKDSVSLLKIQDAALSCERLLELKKNHCELLTVKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNADTLYEKIREQLRRKEEQYRKEVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLSREQNARMLQDGILTNHLSKQKEIEMAQKKMNSENSHSHEEEKDLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEKKCFEDLKIVKEKNEDLQKTIKQNEETLTQTISQYNGRLSVLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDKMNFDVSNLKDNNEILSQQLFKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQDQFHAIVQKLQAESEKQSLLLEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSEASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLNANLSEDEKEQLKKLMELKQSLECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYENGEFSFHGDLKTSQFEMDIQINKLKHKIDDLTAELETAGSKCLHLDTKNQILQEELLSMKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIEERARQEIAEKLKEVNLFLQAQAASQENLEQFRENNFASMKSQMELRIKDLESELSKIKTSQEDFNKTELEKYKQLYLEELKVRKSLSSKLTKTNERLAEVNTKLLVEKQQSRSLFTTLTTRPVMEPPCVGNLNNSLDLNRKLIPRENLVISTSNPRASNNSMENYLSKMQQELEKNITRELKEAAAELESGSIASPLGSTDESNLNQDLVWKASREYVQVLKKNYMI,mutated_sequence,1.0,1710.0,UPI000006FC41.a2m,UPI000006FC41.npy,gnomAD
+UPI0000070AF2,UPI0000070AF2.csv,MTCGFNSIGCGFRPGNFSCVSACGPRPSRCCITAAPYRGISCYRGLTGGFGSHSVCGGFRAGSCGRSFGYRSGGVCGPSPPCITTVSVNESLLTPLNLEIDPNAQCVKQEEKEQIKSLNSRFAAFIDKVRFLEQQNKLLETKLQFYQNRECCQSNLEPLFAGYIETLRREAECVEADSGRLASELNHVQEVLEGYKKKYEEEVALRATAENEFVALKKDVDCAYLRKSDLEANVEALIQEIDFLRRLYEEEIRILQSHISDTSVVVKLDNSRDLNMDCIVAEIKAQYDDIATRSRAEAESWYRSKCEEMKATVIRHGETLRRTKEEINELNRMIQRLTAEVENAKCQNSKLEAAVAQSEQQGEAALSDARCKLAELEGALQKAKQDMACLIREYQEVMNSKLGLDIEIATYRRLLEGEEQRLCEGVEAVNVCVSSSRGGVVCGDLCVSGSRPVTGSVCSAPCNGNLVVSTGLCKPCGQLNTTCGGGSCGQGRH,mutated_sequence,1.0,493.0,UPI0000070AF2.a2m,UPI0000070AF2.npy,gnomAD
+UPI000034ECE0,UPI000034ECE0.csv,MSLRDKGGEEECFEYDCQDEERKPTHRQHDTQDLLEEVLCAERVGQMTKTYNDIDAVTRLLEEKERDLELAARIGQSLLKKNKTLTERNELLEEQVEHIREEVSQLRHELSMKDELLQFYTSAAEESEPESVCSTPLKRNESSSSVQNYFHLDSLQKKLKDLEEENVVLRSEASQLKTETITYEEKEQQLVNDCVKELRDANVQIASISEELAKKTEDAARQQEEITHLLSQIVDLQKKAKACAVENEELVQHLGAAKDAQRQLTAELRELEDKYAECMEMLHEAQEELKNLRNKTMPNTTSRRYHSLGLFPMDSLAAEIEGTMRKELQLEEAESPDITHQKRVFETVRNINQVVKQRSLTPSPMNIPGSNQSSAMNSLLSSCVSTPRSSFYGSDIGNVVLDNKTNSIILETEAADLGNDERSKKPGTPGTPGSHDLETALRRLSLRRENYLSERRFFEEEQERKLQELAEKGELRSGSLTPTESIMSLGTHSRFSEFTGFSGMSFSSRSYLPEKLQIVKPLEGSATLHHWQQLAQPHLGGILDPRPGVVTKGFRTLDVDLDEVYCLNDFEEDDTGDHISLPRLATSTPVQHPETSGERSQARVTVSGSRSYPSRPQASPEEMQEPPAATEEEEEEEEEEGSGEGTTISPVNLAPFPEAEFWAILTSVPGTIRSGSLSVASARLCG,mutated_sequence,1.0,686.0,UPI000034ECE0.a2m,UPI000034ECE0.npy,gnomAD
+UPI000058E2EA,UPI000058E2EA.csv,MAHNAGAAAAAGTHSAKSGGSEAALKEGGSAAALSSSSSSSAAAAAASSSSSSGPGSAMETGLLPNHKLKTVGEAPAAPPHQQHHHHHHAHHHHHHAHHLHHHHALQQQLNQFQQQQQQQQQQQQQQQQQQHPISNNNSLGGAGGGAPQPGPDMEQPQHGGAKDSAAGGQADPPGPPLLSKPGDEDDAPPKMGEPAGGRYEHPGLGALGTQQPPVAVPGGGGGPAAVPEFNNYYGSAAPASGGPGGRAGPCFDQHGGQQSPGMGMMHSASAAAAGAPGSMDPLQNSHEGYPNSQCNHYPGYSRPGAGGGGGGGGGGGGGSGGGGGGGGAGAGGAGAGAVAAAAAAAAAAAGGGGGGGYGGSSAGYGVLSSPRQQGGGMMMGPGGGGAASLSKAAAGSAAGGFQRFAGQNQHPSGATPTLNQLLTSPSPMMRSYGGSYPEYSSPSAPPPPPSQPQSQAAAAGAAAGGQQAAAGMGLGKDMGAQYAAASPAWAAAQQRSHPAMSPGTPGPTMGRSQGSPMDPMVMKRPQLYGMGSNPHSQPQQSSPYPGGSYGPPGPQRYPIGIQGRTPGAMAGMQYPQQQMPPQYGQQGVSGYCQQGQQPYYSQQPQPPHLPPQAQYLPSQSQQRYQPQQDMSQEGYGTRSQPPLAPGKPNHEDLNLIQQERPSSLPDLSGSIDDLPTGTEATLSSAVSASGSTSSQGDQSNPAQSPFSPHASPHLSSIPGGPSPSPVGSPVGSNQSRSGPISPASIPGSQMPPQPPGSQSESSSHPALSQSPMPQERGFMAGTQRNPQMAQYGPQQTGPSMSPHPSPGGQMHAGISSFQQSNSSGTYGPQMSQYGPQGNYSRPPAYSGVPSASYSGPGPGMGISANNQMHGQGPSQPCGAVPLGRMPSAGMQNRPFPGNMSSMTPSSPGMSQQGGPGMGPPMPTVNRKAQEAAAAVMQAAANSAQSRQGSFPGMNQSGLMASSSPYSQPMNNSSSLMNTQAPPYSMAPAMVNSSAASVGLADMMSPGESKLPLPLKADGKEEGTPQPESKSKKSSSSTTTGEKITKVYELGNEPERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQYLFAFECKIERGEEPPPEVFSTGDTKKQPKLQPPSPANSGSLQGPQTPQSTGSNSMAEVPGDLKPPTPASTPHGQMTPMQGGRSSTISVHDPFSDVSDSSFPKRNSMTPNAPYQQGMSMPDVMGRMPYEPNKDPFGGMRKVPGSSEPFMTQGQMPNSSMQDMYNQSPSGAMSNLGMGQRQQFPYGASYDRRHEPYGQQYPGQGPPSGQPPYGGHQPGLYPQQPNYKRHMDGMYGPPAKRHEGDMYNMQYSSQQQEMYNQYGGSYSGPDRRPIQGQYPYPYSRERMQGPGQIQTHGIPPQMMGGPLQSSSSEGPQQNMWAARNDMPYPYQNRQGPGGPTQAPPYPGMNRTDDMMVPDQRINHESQWPSHVSQRQPYMSSSASMQPITRPPQPSYQTPPSLPNHISRAPSPASFQRSLENRMSPSKSPFLPSMKMQKVMPTVPTSQVTGPPPQPPPIRREITFPPGSVEASQPVLKQRRKITSKDIVTPEAWRVMMSLKSGLLAESTWALDTINILLYDDSTVATFNLSQLSGFLELLVEYFRKCLIDIFGILMEYEVGDPSQKALDHNAARKDDSQSLADDSGKEEEDAECIDDDEEDEEDEEEDSEKTESDEKSSIALTAPDAAADPKEKPKQASKFDKLPIKIVKKNNLFVVDRSDKLGRVQEFNSGLLHWQLGGGDTTEHIQTHFESKMEIPPRRRPPPPLSSAGRKKEQEGKGDSEEQQEKSIIATIDDVLSARPGALPEDANPGPQTESSKFPFGIQQAKSHRNIKLLEDEPRSRDETPLCTIAHWQDSLAKRCICVSNIVRSLSFVPGNDAEMSKHPGLVLILGKLILLHHEHPERKRAPQTYEKEEDEDKGVACSKDEWWWDCLEVLRDNTLVTLANISGQLDLSAYTESICLPILDGLLHWMVCPSAEAQDPFPTVGPNSVLSPQRLVLETLCKLSIQDNNVDLILATPPFSRQEKFYATLVRYVGDRKNPVCREMSMALLSNLAQGDALAARAIAVQKGSIGNLISFLEDGVTMAQYQQSQHNLMHMQPPPLEPPSVDMMCRAAKALLAMARVDENRSEFLLHEGRLLDISISAVLNSLVASVICDVLFQIGQL,mutated_sequence,1.0,2236.0,UPI000058E2EA.a2m,UPI000058E2EA.npy,gnomAD
+UPI00004C2CAA,UPI00004C2CAA.csv,MAAAAAAATTAACSSGSAGTDAAGASGLQQPPPQPQPQPAAAAPAQPPPEPPRKPRMDPRRRQAALSFLTNISLDGRLPPQDAEWGGGEEGGAAKPGAGGACGARTRFSLLAAAERGGCIALAAPGTPAAGLAAGSGPCLPQPSSLPPLIPGGHATVSGPGVARGFASPLGAGRASGEQWQPPRPAPLAACAQLQLLDGSGAAGQEELEEDDAFISVQVPAAAFLGSGTPGSGSGSRGRLNSFTQGILPIAFSRPTSQNYCSLEQPGQGGSTSAFEQLQRSRRRLISQRSSLETLEDIEENAPLRRCRTLSGSPRPKNFKKIHFIKNMRQHDTRNGRIVLISGRRSFCSIFSVLPYRDSTQVGDLKLDGGRQSTGAVSLKEIIGLEGVELGADGKTVSYTQFLLPTNAFGARRNTIDSTSSFSQFRNLSHRSLSIGRASGTQGSLDTGSDLGDFMDYDPNLLDDPQWPCGKHKRVLIFPSYMTTVIDYVKPSDLKKDMNETFKEKFPHIKLTLSKIRSLKREMRKLAQEDCGLEEPTVAMAFVYFEKLALKGKLNKQNRKLCAGACVLLAAKIGSDLKKHEVKHLIDKLEEKFRLNRRELIAFEFPVLVALEFALHLPEHEVMPHYRRLVQSS,mutated_sequence,1.0,633.0,UPI00004C2CAA.a2m,UPI00004C2CAA.npy,gnomAD
+UPI0000072F87,UPI0000072F87.csv,MESRKLISATDIQYSGSLLNSLNEQRGHGLFCDVTVIVEDRKFRAHKNILSASSTYFHQLFSVAGQVVELSFIRAEIFAEILNYIYSSKIVRVRSDLLDELIKSGQLLGVKFIAELGVPLSQVKSISGTAQDGNTEPLPPDSGDKNLVIQKSKDEAQDNGATIMPIITESFSLSAEDYEMKKIIVTDSDDDDDDVIFCSEILPTKETLPSNNTVAQVQSNPGPVAISDVAPSASNNSPPLTNITPTQKLPTPVNQATLSQTQGSEKLLVSSAPTHLTPNIILLNQTPLSTPPNVSSSLPNHMPSSINLLVQNQQTPNSAILTGNKANEEEEEEIIDDDDDTISSSPDSAVSNTSLVPQADTSQNTSFDGSLIQKMQIPTLLQEPLSNSLKISDIITRNTNDPGVGSKHLMEGQKIITLDTATEIEGLSTGCKVYANIGEDTYDIVIPVKDDPDEGEARLENEIPKTSGSEMANKRMKVKHDDHYELIVDGRVYYICIVCKRSYVCLTSLRRHFNIHSWEKKYPCRYCEKVFPLAEYRTKHEIHHTGERRYQCLACGKSFINYQFMSSHIKSVHSQDPSGDSKLYRLHPCRSLQIRQYAYLSDRSSTIPAMKDDGIGYKVDTGKEPPVGTTTSTQNKPMTWEDIFIQQENDSIFKQNVTDGSTEFEFIIPESY,mutated_sequence,1.0,672.0,UPI0000072F87.a2m,UPI0000072F87.npy,gnomAD
+UPI000161137B,UPI000161137B.csv,MSPDVPLLNDYKQDFFLKRFPQTVLGGPRFKLGYCAPPYIYVNQIILFLMPWVWGGVGTLLYQLGILKDYYTAALSGGLMLFTAFVIQFTSLYAKNKSTTVERILTTDILAEEDEHEFTSCTGAETVKFLIPGKKYVANTVFHSILAGLACGLGTWYLLPNRITLLYGSTGGTALLFFFGWMTLCIAEYSLIVNTATETATFQTQDTYEIIPLMRPLYIFFFVSVDLAHRFVVNMPALEHMNQILHILFVFLPFLWALGTLPPPDALLLWAMEQVLEFGLGGSSMSTHLRLLVMFIMSAGTAIASYFIPSTVGVVLFMTGFGFLLSLNLSDMGHKIGTKSKDLPSGPEKHFSWKECLFYIIILVLALLETSLLHHFAGFSQISKSNSQAIVGYGLMILLIILWILREIQSVYIIGIFRNPFYPKDVQTVTVFFEKQTRLMKIGIVRRILLTLVSPFAMIAFLSLDSSLQGLHSVSVCIGFTRAFRMVWQNTENALLETVIVSTVHLISSTDIWWNRSLDTGLRLLLVGIIRDRLIQFISKLQFAVTVLLTSWTEKKQRRKTTATLCILNIVFSPFVLVIIVFSTLLSSPLLPLFTLPVFLVGFPRPIQSWPGAAGTTACVCADTVYYYQMVPRLTAVLQTAMAAGSLGLLLPGSHYLGRFQDRLMWIMILECGYTYCSINIKGLELQETSCHTAEARRVDEVFEDAFEQEYTRVCSLNEHFGNVLTPCTVLPVKLYSDARNVLSGIIDSHENLKEFKGDLIKVLVWILVQYCSKRPGMKENVHNTENKGKAPLMLPALNTLPPPKSPEDIDSLNSETFNDWSDDNIFDDEPTIKKVIEEKHQLKDLPGTNLFIPGSVESQRVGDHSTGTVPENDLYKAVLLGYPAVDKGKQEDMPYIPLMEFSCSHSHLVCLPAEWRTSCMPSSKMKEMSSLFPEDWYQFVLRQLECYHSEEKASNVLEEIAKDKVLKDFYVHTVMTCYFSLFGIDNMAPSPGHILRVYGGVLPWSVALDWLTEKPELFQLALKAFRYTLKLMIDKASLGPIEDFRELIKYLEEYERDWYIGLVSDEKWKEAILQEKPYLFSLGYDSNMGIYTGRVLSLQELLIQVGKLNPEAVRGQWANLSWELLYATNDDEERYSIQAHPLLLRNLTVQAAEPPLGYPIYSSKPLHIHLY,mutated_sequence,1.0,1172.0,UPI000161137B.a2m,UPI000161137B.npy,gnomAD
+UPI00001294AC,UPI00001294AC.csv,MTGVFDSLVADMHSTQIAASSTYHQHQQPPSGGGAGPGGNSSSSSSLHKPQESPTLPVSTATDSSYYTNQQHPAGGGGGGGSPYAHMGSYQYQASGLNNVPYSAKSSYDLGYTAAYTSYAPYGTSSSPANNEPEKEDLEPEIRIVNGKPKKVRKPRTIYSSFQLAALQRRFQKTQYLALPERAELAASLGLTQTQVKIWFQNRRSKFKKMWKSGEIPSEQHPGASASPPCASPPVSAPASWDFGVPQRMAGGGGPGSGGSGAGSSGSSPSSAASAFLGNYPWYHQTSGSASHLQATAPLLHPTQTPQPHHHHHHHGGGGAPVSAGTIF,mutated_sequence,1.0,328.0,UPI00001294AC.a2m,UPI00001294AC.npy,gnomAD
+UPI00001B6532,UPI00001B6532.csv,MEVKGPSGRSFCCESEGQFKSCLKRHTPSLLLPSSWKGNSGSCLMAKALHRMSPTPNSCPLPLPLCRMSGVLCSRNLFTFKFSLFQLDSGASGEPGHSLGLTLGFSHCGNCQTAVVSAQPEGMASNGAYPALGPGVTANPGTSLSVFTALPFTTPAPGPAHGPLLVTAGAPPGGPLVLSTLPSTPLVTEQDGCGPSGAGASNVFVQMRTEVGPVKAAQAQTLVLTQAPLVWQAPGALCGGVVCPPPLLLAAAPVVPVMAAQVVGGTQACEGGWSQGLPLPPPPPPAAQLPPIVSQGNAGPWPQGAHGEGSLASSQAKAPPDDSCNPRSVYENFRLWQHYKPLARRHLPQSPDTEALSCFLIPVLRSLARRKPTMTLEEGLWRAMREWQHTSNFDRMIFYEMAEKFLEFEAEEEMQIQKSQWMKGPQCLPPPATPRLEPRGPPAPEVVKQPVYLPSKAGPKAPTACLPPPRPQRPVTKARRPPPRPHRRAETKARLPPPRPQRPAETKVPEEIPPEVVQEYVDIMEELLGPSLGATGEPEKQREEGEVKQPQEEDWTPPDPGLLSYTDKLCSQKDFVTKVEAVIHPQFLEELLSPDPQMDFLALSQELEQEEGLTLAQLVEKRLLPLKEKQHARAAPSRGTARLDSSSSKFAAGQGAERDVPVPQQGVGMETCPPQTTARDSQGRGRAHTGMARSKDSVVLLGCQDSPGLRAARPTSPPQDHRPTCPGVGTKDALDLPGGSPVRESHGLAQGSSEEEELPSLAFLLGSQHKLLPWWLPQSPVPASGLLSPEKWGPQGTHQFPSAERRGLNLAPSPANKAKKRPLFGSLSPAEKTPHPGPGLRVSGEQSLTWGLGGPSQSQKRKGDPLVSRKEKKQRCSQ,mutated_sequence,1.0,878.0,UPI00001B6532.a2m,UPI00001B6532.npy,gnomAD
+UPI0001881663,UPI0001881663.csv,MKKFFDSRREQGGSGLGSGSSGGGGSTSGLGSGYIGRVFGIGRQQVTVDEVLAEGGFAIVFLVRTSNGMKCALKRMFVNNEHDLQVCKREIQIMRDLSGHKNIVGYIDSSINNVSSGDVWEVLILMDFCRGGQVVNLMNQRLQTGFTENEVLQIFCDTCEAVARLHQCKTPIIHRDLKVENILLHDRGHYVLCDFGSATNKFQNPQTEGVNAVEDEIKKYTTLSYRAPEMVNLYSGKIITTKADIWALGCLLYKLCYFTLPFGESQVAICDGNFTIPDNSRYSQDMHCLIRYMLEPDPDKRPDIYQVSYFSFKLLKKECPIPNVQNSPIPAKLPEPVKASEAAAKKTQPKARLTDPIPTTETSIAPRQRPKAGQTQPNPGILPIQPALTPRKRATVQPPPQAAGSSNQPGLLASVPQPKPQAPPSQPLPQTQAKQPQAPPTPQQTPSTQAQGLPAQAQATPQHQQQLFLKQQQQQQQPPPAQQQPAGTFYQQQQAQTQQFQAVHPATQKPAIAQFPVVSQGGSQQQLMQNFYQQQQQQQQQQQQQQLATALHQQQLMTQQAALQQKPTMAAGQQPQPQPAAAPQPAPAQEPAIQAPVRQQPKVQTTPPPAVQGQKVGSLTPPSSPKTQRAGHRRILSDVTHSAVFGVPASKSTQLLQAAAAEASLNKSKSATTTPSGSPRTSQQNVYNPSEGSTWNPFDDDNFSKLTAEELLNKDFAKLGEGKHPEKLGGSAESLIPGFQSTQGDAFATTSFSAGTAEKRKGGQTVDSGLPLLSVSDPFIPLQVPDAPEKLIEGLKSPDTSLLLPDLLPMTDPFGSTSDAVIEKADVAVESLIPGLEPPVPQRLPSQTESVTSNRTDSLTGEDSLLDCSLLSNPTTDLLEEFAPTAISAPVHKAAEDSNLISGFDVPEGSDKVAEDEFDPIPVLITKNPQGGHSRNSSGSSESSLPNLARSLLLVDQLIDL,mutated_sequence,1.0,961.0,UPI0001881663.a2m,UPI0001881663.npy,gnomAD
+UPI000020187D,UPI000020187D.csv,MRSAAKPWNPAIRAGGHGPDRVRPLPAASSGMKSSKSSTSLAFESRLSRLKRASSEDTLNKPGSTAASGVVRLKKTATAGAISELTESRLRSGTGAFTTTKRTGIPAPREFSVTVSRERSVPRGPSNPRKSVSSPTSSNTPTPTKHLRTPSTKPKQENEGGEKAALESQVRELLAEAKAKDSEINRLRSELKKYKEKRTLNAEGTDALGPNVDGTSVSPGDTEPMIRALEEKNKNFQKELSDLEEENRVLKEKLIYLEHSPNSEGAASHTGDSSCPTSITQESSFGSPTGNQMSSDIDEYKKNIHGNALRTSGSSSSDVTKASLSPDASDFEHITAETPSRPLSSTSNPFKSSKCSTAGSSPNSVSELSLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQERVKNEEPTTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQGHGRVVTSRAAPPPVDEEPESSEVDAAGRWPGVCVSRTSPTPPESATTVKSLIKSFDLGRPGGAGQNISVHKTPRSPLSGIPVRTAPAAAVSPMQRHSTYSSVRPASRGVTQRLDLPDLPLSDILKGRTETLKPDPHLRKSPSLESLSRPPSLGFGDTRLLSASTRAWKPQSKLSVERKDPLAALAREYGGSKRNALLKWCQKKTQGYANIDITNFSSSWSDGLAFCALLHTYLPAHIPYQELNSQEKKRNLLLAFEAAESVGIKPSLELSEMLYTDRPDWQSVMQYVAQIYKYFET,mutated_sequence,1.0,1068.0,UPI000020187D.a2m,UPI000020187D.npy,gnomAD
+UPI000006D077,UPI000006D077.csv,MARTDQKPPCRGGCWGQPGHPNTGGAAAHPTYHPMGHRPRTCILLRGDQTTGGQAPSREISLGPWAAGTHFLAISTTPWGRKTPACISELPTSSGTAQPLANAVCEVQTVPGPGLRPQGTPAMRAPSHKGTPPTPNPWGPEQPQNRHKHPKKGVTGGPSPPPPAASRYGQTPGREPRVQAPGLGPCGRPASGRLLSLHLEKGDGKGTRQRIPLTDAAVGGDRTDIPSAIAAGPARTPDRHGLPIPGSTPTPMVGSGRLGAPVGRSGGGASARSSRPSCANVLLRADASLGTVLSVLWTGQLSRGWALLPPGDAGRHLETSVISAGVAAGIWLVEPGEAAQDPATRRTAPPRRTASPEPPAPGAPLPACPGRIPGAARFGPRSCPLGSPAVLAVTTGWSHRSV,mutated_sequence,1.0,402.0,UPI000006D077.a2m,UPI000006D077.npy,gnomAD
+UPI000013DC6C,UPI000013DC6C.csv,MMMMSLNSKQAFSMPHGGSLHVEPKYSALHSTSPGSSAPIAPSASSPSSSSNAGGGGGGGGGGGGGGGRSSSSSSSGSSGGGGSEAMRRACLPTPPSNIFGGLDESLLARAEALAAVDIVSQSKSHHHHPPHHSPFKPDATYHTMNTIPCTSAASSSSVPISHPSALAGTHHHHHHHHHHHHQPHQALEGELLEHLSPGLALGAMAGPDGAVVSTPAHAPHMATMNPMHQAALSMAHAHGLPSHMGCMSDVDADPRDLEAFAERFKQRRIKLGVTQADVGSALANLKIPGVGSLSQSTICRFESLTLSHNNMIALKPILQAWLEEAEKSHREKLTKPELFNGAEKKRKRTSIAAPEKRSLEAYFAIQPRPSSEKIAAIAEKLDLKKNVVRVWFCNQRQKQKRMKYSAGI,mutated_sequence,1.0,409.0,UPI000013DC6C.a2m,UPI000013DC6C.npy,gnomAD
+UPI000013C755,UPI000013C755.csv,MGPTLAVPTPYGCIGCKLPQPEYPPALIIFMFCAMVITIVVDLIGNSMVILAVTKNKKLRNSGNIFVVSLSVADMLVAIYPYPLMLHAMSIGGWDLSQLQCQMVGFITGLSVVGSIFNIVAIAINRYCYICHSLQYERIFSVRNTCIYLVITWIMTVLAVLPNMYIGTIEYDPRTYTCIFNYLNNPVFTVTIVCIHFVLPLLIVGFCYVRIWTKVLAARDPAGQNPDNQLAEVRNFLTMFVIFLLFAVCWCPINVLTVLVAVSPKEMAGKIPNWLYLAAYFIAYFNSCLNAVIYGLLNENFRREYWTIFHAMRHPIIFFSGLISDIREMQEARTLARARAHARDQAREQDRAHACPAVEETPMNVRNVPLPGDAAAGHPDRASGHPKPHSRSSSAYRKSASTHHKSVFSHSKAASGHLKPVSGHSKPASGHPKSATVYPKPASVHFKADSVHFKGDSVHFKPDSVHFKPASSNPKPITGHHVSAGSHSKSAFSAATSHPKPTTGHIKPATSHAEPTTADYPKPATTSHPKPTAADNPELSASHCPEIPAIAHPVSDDSDLPESASSPAAGPTKPAASQLESDTIADLPDPTVVTTSTNDYHDVVVIDVEDDPDEMAV,mutated_sequence,1.0,617.0,UPI000013C755.a2m,UPI000013C755.npy,gnomAD
+UPI0000DAC777,UPI0000DAC777.csv,MNFQAGGGQSPQQQQSLAAPGGGGAAAQQLVCGGQFGGAGPGAGGGGGPSQQLAGGPPQQFALSNSAAIRAEIQRFESVHPNIYAIYDLIERIEDLALQNQIREHVISIEDSFVNSQEWTLSRSVPELKVGIVGNLSSGKSALVHRYLTGTYVQEESPEGGRFKKEIVVDGQSYLLLIRDEGGPPELQFAAWVDAVVFVFSLEDEISFQTVYNYFLRLCSFRNASEVPMVLVGTQDAISAANPRVIDDSRARKLSTDLKRCTYYETCATYGLNVERVFQDVAQKVVALRKKQQLAIGPCKSLPNSPSHSAVSAASIPAVHINQATNGGGSAFSDYSSSVPSTPSISQRELRIETIAASSTPTPIRKQSKRRSNIFTSRKGADLDREKKAAECKVDSIGSGRAIPIKQGILLKRSGKSLNKEWKKKYVTLCDNGLLTYHPSLHDYMQNIHGKEIDLLRTTVKVPGKRLPRATPATAPGTSPRANGLSVERSNTQLGGGTGAPHSASSASLHSERPLSSSAWAGPRPEGLHQRSCSVSSADQWSEATTSLPPGMQHPASGPAEVLSSSPKLDPPPSPHSNRKKHRRKKSTGTPRPDGPSSATEEAEESFEFVVVSLTGQTWHFEASTAEERELWVQSVQAQILASLQGCRSAKDKTRLGNQNAALAVQAVRTVRGNSFCIDCDAPNPDWASLNLGALMCIECSGIHRHLGAHLSRVRSLDLDDWPPELLAVMTAMGNALANSVWEGALGGYSKPGPDACREEKERWIRAKYEQKLFLAPLPSSDVPLGQQLLRAVVEDDLRLLVMLLAHGSKEEVNETYGDGDGRTALHLSSAMANVVFTQLLIWYGVDVRSRDARGLTPLAYARRAGSQECADILIQHGCPGEGCGLAPTPNREPANGTNPSAELHRSPSLL,mutated_sequence,1.0,911.0,UPI0000DAC777.a2m,UPI0000DAC777.npy,gnomAD
+UPI000042467C,UPI000042467C.csv,MRCALALSALLLLLSTPPLLPSSPSPSPSPSQNATQTTTDSSNKTAPTPASSVTIMATDTAQQSTVPTSKANEILASVKATTLGVSSDSPGTTTLAQQVSGPVNTTVARGGGSGNPTTTIESPKSTKSADTTTVATSTATAKPNTTSSQNGAEDTTNSGGKSSHSVTTDLTSTKAEHLTTPHPTSPLSPRQPTSTHPVATPTSSGHDHLMKISSSSSTVAIPGYTFTSPGMTTTLLETVFHHVSQAGLELLTSGDLPTLASQSAGITASSVISQRTQQTSSQMPASSTAPSSQETVQPTSPATALRTPTLPETMSSSPTAASTTHRYPKTPSPTVAHESNWAKCEDLETQTQSEKQLVLNLTGNTLCAGGASDEKLISLICRAVKATFNPAQDKCGIRLASVPGSQTVVVKEITIHTKLPAKDVYERLKDKWDELKEAGVSDMKLGDQGPPEEAEDRFSMPLIITIVCMASFLLLVAALYGCCHQRLSQRKDQQRLTEELQTVENGYHDNPTLEVMETSSEMQEKKVVSLNGELGDSWIVPLDNLTKDDLDEEEDTHL,mutated_sequence,1.0,558.0,UPI000042467C.a2m,UPI000042467C.npy,gnomAD
+UPI00006AB830,UPI00006AB830.csv,METNESTEGSRSRSRSLDIQPSSEGLGPTSEPFPSSDDSPRSALAAATAAAAAAASAAAATAAFTTAKAAALSTKTPAPCSEFMEPSSDPSLLGEPCAGPGFTHNIAHGSLGFEPVYVSCIAQDTCTTTDHSSNPGPVPGSSSGPVLGSSSGAGHGSGSGSGPGCGSVPGSGSGPGPGSGPGSGPGHGSGSHPGPASGPGPDTGPDSELSPCIPPGFRNLVADRVPNYTSWSQHCPWEPQKQPPWEFLQVLEPGARGLWKPPDIKGKLMVCYETLPRGQCLLYNWEEERATNHLDQVPSMQDGSESFFFRHGHRGLLTMQLKSPMPSSTTQKDSYQPPGNVYWPLRGKREAMLEMLLQHQICKEVQAEQEPTRKLFEVESVTHHDYRMELAQAGTPAPTKPHDYRQEQPETFWIQRAPQLPGVSNIRTLDTPFRKNCSFSTPVPLSLGKLLPYEPENYPYQLGEISSLPCPGGRLGGGGGRMTPF,mutated_sequence,1.0,485.0,UPI00006AB830.a2m,UPI00006AB830.npy,gnomAD
+UPI00002077A2,UPI00002077A2.csv,MALLTAETFRLQFNNKRRLRRPYYPRKALLCYQLTPQNGSTPTRGYFENKKKCHAEICFINEIKSMGLDETQCYQVTCYLTWSPCSSCAWELVDFIKAHDHLNLGIFASRLYYHWCKPQQKGLRLLCGSQVPVEVMGFPKFADCWENFVDHEKPLSFNPYKMLEELDKNSRAIKRRLERIKIPGVRAQGRYMDILCDAEV,mutated_sequence,1.0,200.0,UPI00002077A2.a2m,UPI00002077A2.npy,gnomAD
+UPI0001AE641D,UPI0001AE641D.csv,MLAGKHVALTPLKAYSQDAGVICPENLHLRRNSREGGDQKEKKKGLQGPLQFRDVAIEFSLEEWHCLDMAQRNLYRDVMLENYRNLVFLGIVVSKPDLITHLEQGKKPSTMQRHEMVANPSVLCSHFNQDLWPEQSIKDSFQKLILRRHKKCGHDNLQLKKGCESVDKCKVHKRGYNGLNQCLTTTQSKMFQCDKHGKVFHQFSNTNRHKIRHTGKNPCKFTECGKAFNRSSTFTTHKKIHTGEKPYKCIECGKAFNRSSHLTTHKIIHTGEKRYKCEDCGKAFNRSSNLTTHKKIHTGEKPYKCEECGKAFKRSSILTTHKRIHTGEKPYKCEECGKVFKYLSSLSTHKIIHTGEKPYKCEECGKAFNWSSHLTTHKRIHTGEKPYKCEECGKGFKYSSTLTKHKIIHTGEKPYKCEECRSLRSQCDQLEERVSVMEDEMNEMK,mutated_sequence,1.0,445.0,UPI0001AE641D.a2m,UPI0001AE641D.npy,gnomAD
+UPI0000229786,UPI0000229786.csv,MMFRDQVGVLAGWFKGWNECEQTVALLSLLKRVSQTQARFLQLCLEHSLADCAELHVLEREANSPGIINQWQQESKDKVISLLLTHLPLLKPGNLDAKVEYMKLLPKILAHSIEHNQHIEESRQLLSYALIHPATSLEDRSALAMWLNHLEDRTSTSFGGQNRGRSDSVDYGQTHYYHQRQNSDDKLNGWQNSRDSGICINASNWQDKSMGCENGHVPLYSSSSVPTTINTIGTSTSTILSGQAHHSPLKRSVSLTPPMNVPNQPLGHGWMSHEDLRARGPQCLPSDHAPLSPQSSVASSGSGGSEHLEDQTTARNTFQEEGSGMKDVPAWLKSLRLHKYAALFSQMTYEEMMALTECQLEAQNVTKGARHKIVISIQKLKERQNLLKSLERDIIEGGSLRIPLQELHQMILTPIKAYSSPSTTPEARRREPQAPRQPSLMGPESQSPDCKDGAAATGATATPSAGASGGLQPHQLSSCDGELAVAPLPEGDLPGQFTRVMGKVCTQLLVSRPDEENISSYLQLIDKCLIHEAFTETQKKRLLSWKQQVQKLFRSFPRKTLLDISGYRQQRNRGFGQSNSLPTAGSVGGGMGRRNPRQYQIPSRNVPSARLGLLGTSGFVSSNQRNTTATPTIMKQGRQNLWFANPGGSNSMPSRTHSSVQRTRSLPVHTSPQNMLMFQQPEFQLPVTEPDINNRLESLCLSMTEHALGDGVDRTSTI,mutated_sequence,1.0,718.0,UPI0000229786.a2m,UPI0000229786.npy,gnomAD
+UPI00018848F1,UPI00018848F1.csv,MARMPGSGDCNTSAGGSASAAAAAAENNGERGEGERGAGGRGRRHSRPHYCSAGEEEEEEEEEDEIQEVQITGDEEEEEDGGGGLEEDEEEEEEEEMGLDWDEPLEPEDSAGEELEPEPVHMINMDQSAALEPEAPPRLLAPRARGGPPGDGSELDPDVLQRPERARLSENTRLATRYAVRIFREYLSEKAQSPDFETMDKGALCRVLRSFYAEARSKSGQLYSKSSLISIRSSLNRYLNEPPYCRTLDLTKDPELRSANLTLAAVIRKLEEQGAGPVVQKQAITRADLRKLYTSSVFSTNTPFGLLNKVWFETCMYFCTRGRENQRELEEDSFGLAMDEDGRKFVYFKSLGPYHKSRSSSWSKKRAESSDEENLPRMYETGTEFCPYASFVKYLSKRNPLCKAFFQRPRDHCSEGDVTWYENKAIGKNLLGTRMQMLSKAAKLSKTYTNHCIGAVSIATLNSIAGIGTKLGSPAPQGCYAEALNGAARHHSHHPPTHPSHHHRPQPPSLGNTYILPKDSQVGPDVKSEAAPKRALYESVFGSGEICGPTSPKRLCIRPSEPVDAVVVVSVKHDPLPLLPEANGHRSTNSPTIVSPAIVSPTQDSRPNMSRPLITRSPASPLNNQGIPTPAQLTKSNAPVHIDVGGHMYTSSLATLTKYPESRIGRLFDGTEPIVLDSLKQHYFIDRDGQMFRYILNFLRTSKLLIPDDFKDYTLLYEEAKYFQLQPMLLEMERWKQDRETGRFSRPCECLVVRVAPDLGERITLSGDKSLIEEVFPEIGDVMCNSVNAGWNHDSTHVIRFPLNGYCHLNSVQVLERLQQRGFEIVGSCGGGVDSSQFSEYVLRRELRRTPRVPSVIRIKQEPLD,mutated_sequence,1.0,865.0,UPI00018848F1.a2m,UPI00018848F1.npy,gnomAD
+UPI00001AEE5A,UPI00001AEE5A.csv,MSSAMLVTCLPDPSSSFREDAPRPPVPGEEGETPPCQPGVGKGQVTKPMPVSSNTRRNEDGLGEPEGRASPDSPLTRWTKSLHSLLGDQDGAYLFRTFLEREKCVDTLDFWFACNGFRQMNLKDTKTLRVAKAIYKRYIENNSIVSKQLKPATKTYIRDGIKKQQIDSIMFDQAQTEIQSVMEENAYQMFLTSDIYLEYVRSGGENTAYMSNGGLGSLKVVCGYLPTLNEEEEWTCADFKCKLSPTVVGLSSKTLRATASVRSTETVDSGYRSFKRSDPVNPYHIGSGYVFAPATSANDSEISSDALTDDSMSMTDSSVDGIPPYRVGSKKQLQREMHRSVKANGQVSLPHFPRTHRLPKEMTPVEPATFAAELISRLEKLKLELESRHSLEERLQQIREDEEREGSELTLNSREGAPTQHPLSLLPSGSYEEDPQTILDDHLSRVLKTPGCQSPGVGRYSPRSRSPDHHHHHHSQYHSLLPPGGKLPPAAASPGACPLLGGKGFVTKQTTKHVHHHYIHHHAVPKTKEEIEAEATQRVHCFCPGGSEYYCYSKCKSHSKAPETMPSEQFGGSRGSTLPKRNGKGTEPGLALPAREGGAPGGAGALQLPREEGDRSQDVWQWMLESERQSKPKPHSAQSTKKAYPLESARSSPGERASRHHLWGGNSGHPRTTPRAHLFTQDPAMPPLTPPNTLAQLEEACRRLAEVSKPPKQRCCVASQQRDRNHSATVQTGATPFSNPSLAPEDHKEPKKLAGVHALQASELVVTYFFCGEEIPYRRMLKAQSLTLGHFKEQLSKKGNYRYYFKKASDEFACGAVFEEIWEDETVLPMYEGRILGKVERID,mutated_sequence,1.0,843.0,UPI00001AEE5A.a2m,UPI00001AEE5A.npy,gnomAD
+UPI00017A8741,UPI00017A8741.csv,MLRLAQTPRRLPSPFRVTYRPWGTRAFGAFPVFGTQISSPPENLDTAHLPHPEGAFEEWMRSGWKKRDLTQQAKDIQNITVQETNKNNSESIECSKITMDLKFNNSRKYISITVPSKTQTMSPHIKSVDDVVVLGMNLSKFNKLTQFFICVAGVFVFYLIYGYLQELIFSVEGFKSCGWYLTLVQFAFYSIFGLIELQLIQDKRRRYVVCFYFLIFLY,mutated_sequence,1.0,218.0,UPI00017A8741.a2m,UPI00017A8741.npy,gnomAD
+UPI0000141054,UPI0000141054.csv,MDPKLGRMAASLLAVLLLLLERGMFSSPSPPPALLEKVFQYIDLHQDEFVQTLKEWVAIESDSVQPVPRFRQELFRMMAVAADTLQRLGARVASVDMGPQQLPDGQSLPIPPIILAELGSDPTKGTVCFYGHLDVQPADRGDGWLTDPYVLTEVDGKLYGRGATDNKGPVLAWINAVSAFRALEQDLPVNIKFIIEGMEEAGSVALEELVEKEKDRFFSGVDYIVISDNLWISQRKPAITYGTRGNSYFMVEVKCRDQDFHSGTFGGILHEPMADLVALLGSLVDSSGHILVPGIYDEVVPLTEEEINTYKAIHLDLEEYRNSSRVEKFLFDTKEEILMHLWRYPSLSIHGIEGAFDEPGTKTVIPGRVIGKFSIRLVPHMNVSAVEKQVTRHLEDVFSKRNSSNKMVVSMTLGLHPWIANIDDTQYLAAKRAIRTVFGTEPDMIRDGSTIPIAKMFQEIVHKSVVLIPLGAVDDGEHSQNEKINRWNYIEGTKLFAAFFLEMAQLH,mutated_sequence,1.0,507.0,UPI0000141054.a2m,UPI0000141054.npy,gnomAD
+UPI000013D1EC,UPI000013D1EC.csv,MSRLIVKNLPNGMKEERFRQLFAAFGTLTDCSLKFTKDGKFRKFGFIGFKSEEEAQKAQKHFNKSFIDTSRITVEFCKSFGDPAKPRAWSKHAQKPSQPKQPPKDSTTPEIKKDEKKKKVAGQLEKLKEDTEFQEFLSVHQRRAQAATWANDGLDAEPSKGKSKPASDYLNFDSDSGQESEEEGAGEDLEEEASLEPKAAVQKELSDMDYLKSKMVKAGSSSSSEEEESEDEAVHCDEGSEAEEEDSSATPVLQERDSKGAGQEQGMPAGKKRPPEARAETEKPANQKEPTTCHTVKLRGAPFNVTEKNVMEFLAPLKPVAIRIVRNAHGNKTGYIFVDFSNEEEVKQALKCNREYMGGRYIEVFREKNVPTTKGAPKNTTKSWQGRILGENEEEEDLAESGRLFVRNLPYTSTEEDLEKLFSKYGPLSELHYPIDSLTKKPKGFAFITFMFPEHAVKAYSEVDGQVFQGRMLHVLPSTIKKEASEDASALGSSSYKKKKEAQDKANSASSHNWNTLFMGPNAVADAIAQKYNATKSQVFDHETKGSVAVRVALGETQLVQEVRRFLIDNGVSLDSFSQAAAERSKTVILVKNLPAGTLAAQLQETFGHFGSLGRVLLPEGGITAIVEFLEPLEARKAFRHLAYSKFHHVPLYLEWAPVGVFSSTAPQKKKLQDTPSEPMEKDPAEPETVPDGETPEDENPTEEGADNSSAKMEEEEEEEEEEEESLPGCTLFIKNLNFDTTEEKLKEVFSKVGTVKSCSISKKKNKAGVLLSMGFGFVEYRKPEQAQKALKQLQGHVVDGHKLEVRISERATKPAVTLARKKQVPRKQTTSKILVRNIPFQAHSREIRELFSTFGELKTVRLPKKMTGTGTHRGFGFVDFLTKQDAKRAFNALCHSTHLYGRRLVLEWADSEVTLQALRRKTAAHFHEPPKKKRSVVLDEILEQLEGSDSDSEEQTLQL,mutated_sequence,1.0,960.0,UPI000013D1EC.a2m,UPI000013D1EC.npy,gnomAD
+UPI0000169571,UPI0000169571.csv,MPPPQQGPCGHHLLLLLALLLPSLPLTRAPVPPGPAAALLQALGLRDEPQGAPRLRPVPPVMWRLFRRRDPQETRSGSRRTSPGVTLQPCHVEELGVAGNIVRHIPDRGAPTRASEPASAAGHCPEWTVVFDLSAVEPAERPSRARLELRFAAAAAAAPEGGWELSVAQAGQGAGADPGPVLLRQLVPALGPPVRAELLGAAWARNASWPRSLRLALALRPRAPAACARLAEASLLLVTLDPRLCHPLARPRRDAEPVLGGGPGGACRARRLYVSFREVGWHRWVIAPRGFLANYCQGQCALPVALSGSGGPPALNHAVLRALMHAAAPGAADLPCCVPARLSPISVLFFDNSDNVVLRQYEDMVVDECGCR,mutated_sequence,1.0,372.0,UPI0000169571.a2m,UPI0000169571.npy,gnomAD
+UPI0002065925,UPI0002065925.csv,MERYKALEQLLTELDDFLKILDQENLSSTALVKKSCLAELLRLYTKSSSSDEEYIYMNKVTINKQQNAESQGKAPEEQGLLPNGEPSQHSSAPQKSLPDLPPPKMRILTLGQISGAQSFSPDGTHLTGTEFLGGWSWRPSQIRLMSPGTHWGDLTATEIPERKQLAIPKTESPEGYYEEAEPYDTSLNEDGEAVSSSYESYDEEDGSKGKSAPYQWPSPEAGIELMRDARICAFLWRKKWLGQWAKQLCVIKDNRLLCYKSSKDHSPQLDVNLLGSSVIHKEKQVRKKEHKLKITPMNADVIVLGLQSKDQAEQWLRVIQEVSGLPSEGASEGNQYTPDAQRFNCQKPDIAEKYLSASEYGSSVDGHPEVPETKDVKKKCSAGLKLSNLMNLGRKKSTSLEPVERSLETSSYLNVLVNSQWKSRWCSVRDNHLHFYQDRNRSKVAQQPLSLVGCEVVPDPSPDHLYSFRILHKGEELAKLEAKSSEEMGHWLGLLLSESGSKTDPEEFTYDYVDADRVSCIVSAAKNSLLLMQRKFSEPNTYIDGLPSQDRQEELYDDVDLSELTAAVEPTEEATPVADDPNERESDRVYLDLTPVKSFLHGPSSAQAQASSPTLSCLDNATEALPADSGPGPTPDEPCIKCPENLGEQQLESLEPEDPSLRITTVKIQTEQQRISFPPSCPDAVVATPPGASPPVKDRLRVTSAEIKLGKNRTEAEVKRYTEEKERLEKKKEEIRGHLAQLRKEKRELKETLLKCTDKEVLASLEQKLKEIDEECRGEESRRVDLELSIMEVKDNLKKAEAGPVTLGTTVDTTHLENVSPRPKAVTPASAPDCTPVNSATTLKNRPLSVVVTGKGTVLQKAKEWEKKGAS,mutated_sequence,1.0,871.0,UPI0002065925.a2m,UPI0002065925.npy,gnomAD
+UPI000067CB88,UPI000067CB88.csv,MEEVPGDALCEHFEANILTQNRCQNCFHPEEAHGARYQELRSPSGAEVPYCDLPRCPPAPEDPLSASTSGCQSVVDPGLRPGPKRGPSPSAGLPEEGPTAAPRSRSRELEAVPYLEGLTTSLCGSCNEDPGSDPTSSPDSATPDDTSNSSSVDWDTVERQEEEAPSWDELAVMIPRRPREGPRADSSQRAPSLLTRSPVGGDAAGQKKEDTGGGGRSAGQHWARLRGESGLSLERHRSTLTQASSMTPHSGPRSTTSQASPAQRDTAQAASTREIPRASSPHRITQRDTSRASSTQQEISRASSTQQETSRASSTQEDTPRASSTQEDTPRASSTQWNTPRASSPSRSTQLDNPRTSSTQQDNPQTSFPTCTPQRENPRTPCVQQDDPRASSPNRTTQRENSRTSCAQRDNPKASRTSSPNRATRDNPRTSCAQRDNPRASSPSRATRDNPTTSCAQRDNPRASRTSSPNRATRDNPRTSCAQRDNPRASSPSRATRDNPTTSCAQRDNPRASRTSSPNRATRDNPRTSCAQRDNPRASSPNRAARDNPTTSCAQRDNPRASRTSSPNRATRDNPRTSCAQRDNPRASSPNRATRDNPTTSCAQRDNPRASRTSSPNRATRDNPRTSCAQRDNPRASSPNRTTQQDSPRTSCARRDDPRASSPNRTIQQENPRTSCALRDNPRASSPSRTIQQENPRTSCAQRDDPRASSPNRTTQQENPRTSCARRDNPRASSRNRTIQRDNPRTSCAQRDNPRASSPNRTIQQENLRTSCTRQDNPRTSSPNRATRDNPRTSCAQRDNLRASSPIRATQQDNPRTCIQQNIPRSSSTQQDNPKTSCTKRDNLRPTCTQRDRTQSFSFQRDNPGTSSSQCCTQKENLRPSSPHRSTQWNNPRNSSPHRTNKDIPWASFPLRPTQSDGPRTSSPSRSKQSEVPWASIALRPTQGDRPQTSSPSRPAQHDPPQSSFGPTQYNLPSRATSSSHNPGHQSTSRTSSPVYPAAYGAPLTSPEPSQPPCAVCIGHRDAPRASSPPRYLQHDPFPFFPEPRAPESEPPHHEPPYIPPAVCIGHRDAPRASSPPRHTQFDPFPFLPDTSDAEHQCQSPQHEPLQLPAPVCIGYRDAPRASSPPRQAPEPSLLFQDLPRASTESLVPSMDSLHECPHIPTPVCIGHRDAPSFSSPPRQAPEPSLFFQDPPGTSMESLAPSTDSLHGSPVLIPQVCIGHRDAPRASSPPRHPPSDLAFLAPSPSPGSSGGSRGSAPPGETRHNLEREEYTVLADLPPPRRLAQRQPGPQAQCSSGGRTHSPGRAEVERLFGQERRKSEAAGAFQAQDEGRSQQPSQGQSQLLRRQSSPAPSRQVTMLPAKQAELTRRSQAEPPHPWSPEKRPEGDRQLQGSPLPPRTSARTPERELRTQRPLESGQAGPRQPLGVWQSQEEPPGSQGPHRHLERSWSSQEGGLGPGGWWGCGEPSLGAAKAPEGAWGGTSREYKESWGQPEAWEEKPTHELPRELGKRSPLTSPPENWGGPAESSQSWHSGTPTAVGWGAEGACPYPRGSERRPELDWRDLLGLLRAPGEGVWARVPSLDWEGLLELLQARLPRKDPAGHRDDLARALGPELGPPGTNDVPEQESHSQPEGWAEATPVNGHSPALQSQSPVQLPSPACTSTQWPKIKVTRGPATATLAGLEQTGPLGSRSTAKGPSLPELQFQPEEPEESEPSRGQDPLTDQKQADSADKRPAEGKAGSPLKGRLVTSWRMPGDRPTLFNPFLLSLGVLRWRRPDLLNFKKGWMSILDEPGEPPSPSLTTTSTSQWKKHWFVLTDSSLKYYRDSTAEEADELDGEIDLRSCTDVTEYAVQRNYGFQIHTKDAVYTLSAMTSGIRRNWIEALRKTVRPTSAPDVTKLSDSNKENALHSYSTQKGPLKAGEQRAGSEVISRGGPRKADGQRQALDYVELSPLTQASPQRARTPARTPDRLAKQEELERDLAQRSEERRKWFEATDSRTPEVPAGEGPRRGLGAPLTEDQQNRLSEEIEKKWQELEKLPLRENKRVPLTALLNQSRGERRGPPSDGHEALEKEVQALRAQLEAWRLQGEAPQSALRSQEDGHIPPGYISQEACERSLAEMESSHQQVMEELQRHHERELQRLQQEKEWLLAEETAATASAIEAMKKAYQEELSRELSKTRSLQQGPDGLRKQHQSDVEALKRELQVLSEQYSQKCLEIGALMRQAEEREHTLRRCQQEGQELLRHNQELHGRLSEEIDQLRGFIASQGMGNGCGRSNERSSCELEVLLRVKENELQYLKKEVQCLRDELQMMQKDKRFTSGKYQDVYVELSHIKTRSEREIEQLKEHLRLAMAALQEKESMRNSLAE,mutated_sequence,1.0,2365.0,UPI000067CB88.a2m,UPI000067CB88.npy,gnomAD
+UPI000006F773,UPI000006F773.csv,MGSGAGELGRAERLPVLFLFLLSLFCPALCEQIRYRIPEEMPKGSVVGNLATDLGFSVQELPTRKLRVSSEKPYFTVSAESGELLVSSRLDREEICGKKPACALEFEAVAENPLNFYHVNVEIEDINDHTPKFTQNSFELQISESAQPGTRFILGSAHDADIGSNTLQNYQLSPSDHFSLINKEKSDGSKYPEMVLKTPLDREKQKSYHLTLTALDFGAPPLSSTAQIHVLVTDANDNAPVFSQDVYRVSLSENVYPGTTVLQVTATDQDEGVNAEITFSFSEASQITQFDLNSNTGEITVLNTLDFEEVKEYSIVLEARDGGGMIAQCTVEVEVIDENDNAPEVIFQSLPNLIMEDAELGTHIALLKVRDKDSRHNGEVTCKLEGDVPFKILTSSRNTYKLVTDAVLDREQNPEYNITVTATDRGKPPLSSSSSITLHIGDVNDNAPVFSQSSYIVHVAENNPPGASISQVRASDPDLGPNGQVSYCIMASDLEQRELSSYVSISAESGVVFAQRAFDHEQLRAFELTLQARDQGSPALSANVSLRVLVDDRNDNAPRVLYPALGPDGSALFDMVPHAAEPGYLVTKVVAVDADSGHNAWLSYHVLQASEPGLFSLGLRTGEVRTARALGDRDAVRQRLLVAVRDGGQPPLSATATLHLVFADSLQEVLPDITDRPDPSDLQAELQFYLVVALALISVLFLVAMILAIALRLRRSSSPASWSCFQPGLCVKSESVVPPNYSEGTLPYSYNLCVAHTGKTEFNFLKCSEQLSSGQDILCGDSSGALFPLCNSSELTSHQQAPPNTDWRFSQAQRPGTSGSQNGDDTGTWPNNQFDTEMLQAMILASASEAADGSSTLGGGAGTMGLSARYGPQFTLQHVPDYRQNVYIPGSNATLTNAAGKRDGKAPAGGNGNKKKSGKKEKK,mutated_sequence,1.0,923.0,UPI000006F773.a2m,UPI000006F773.npy,gnomAD
+UPI0000251DD8,UPI0000251DD8.csv,MGCTPSHSDLVNSVAKSGIQFLKKPKAIRPGCQGGSERGSIPLLVKNSTCYDAGEGLAEEQPSPRRNQTTAKGLCQLMGDPASGKRKDMEGLIPGTKTSSSQLNKSQSHMAKDIPFKTQGSHGSQGADFSGDESEESSTQDTSKWKRTAKCHTSSTQSHCYQTIHPAHEPEGKVDFPEPLVKAHQQAYTYLHSSLSKYEAILCIIHQATQTRELLQPMVSFLLLCFEEISQLLGEISKDGEVLLQEVREDLAWPLKKREPQEQPNLLQQLLQYTVSKLQVLNGTVASLTGSFLEGSSSYLHSTATHLENKLSTKRNVDERLLRALRQLESLASGCGDPGVQGLPLCSEDSGIGADNESVQSVDKLGKQTSWDLAPEPEEWKSVTSPHTEARQSGHTWQQSPFCLGSGRPQDCLLSGAPMAKVQPRAQDEARSPCLSSTSPENITSPPLKLGTSTPCDSFGIGVSVEPHLSKTSRPMDASSLSDSEDSSPEEEEEDKMSSMSLCAWQEKTPHSRPQSSPADRESPFQARTRRLRSLQAQEMILKMKESISERIKFVPVPCGHQDWSEEEEGRTVVPPRPSTVSGSRRAPERQTRSQSESCLQSHVEDPTFQELRRVQRDLSQKLEAFYALGAKGQGQSQEQILQPRAAAVWPNGTCRVSPSNTTSRLKASLTKNFSILPSQDKSILQKCNPHPEDEQGKAGKLPNAIPSGEVSEAAKATDWNVRGCPTRTSVKKLIETFSPTESLRMLGDSKDAGASPCLRNCIMPPRFPKYTGLAPLYPKPQISPASGRESLKMGIGWKPLAPIFPPLPKAEAAKSEELSCEMEGNLEHLPPPPMEVLMDKSFASLESPESSKSTENSPKETQEPGPGEAGPTRRTWASPKLRASVSPLDLLPSKSTASLTKPHSTGPGSGRSSCQPRKPALDLSSPPATSQSPEVKGGTWSQAEKATSLYRQPRKAIAWHHSGPPSGQNRTSESSLARPRQSRERSPPVGRKASPTRTHWVPQADKRRRSLPSSYRPAQPSPSAVQTPPSPPVSPRVLSPPTTKRRTSPPHQPKLPNPPPESAPAQCKVPSPPTQHPEASPPFSIPSPSPPMSPSQEHKETRDSEDSQAVIAKVSGNTHSIFCPATSSLFEAKPPLSTAHPLTPPSLPPEAGGPLGNPAECWKNSSGPWLRADSQRRAALCALNPLPFLRRTASDRQPGGRPQPPTLDPTSTSYESQLGQNSSSEESPKKDTEPGSSPCSPELQGGTRRASPPEFCVLGHGLQPEPRTGHIQDKSQPEAQPQQEEVS,mutated_sequence,1.0,1288.0,UPI0000251DD8.a2m,UPI0000251DD8.npy,gnomAD
+UPI0000470B60,UPI0000470B60.csv,MNSASVDGHLSGCRLFLFLSPLFRFYCDYCDTYLTHDSPSVRKTHCSGRKHKENVKDYYQKWMEEQAQSLIDKTTAAFQQGKIPPTPFSAPPPAGAMIPPPPSLPGPPRPGMMPAPHMGGPPMMPMMGPPPPGMMPVGPAPGMRPPMGGHMPMMPGPPMMRPPARPMMVPTRPGMTRPDR,mutated_sequence,1.0,180.0,UPI0000470B60.a2m,UPI0000470B60.npy,gnomAD
+UPI0000073246,UPI0000073246.csv,MEAAPPGPPWPLLLLLLLLLALCGCPAPAAASPLLLFANRRDVRLVDAGGVKLESTIVVSGLEDAAAVDFQFSKGAVYWTDVSEEAIKQTYLNQTGAAVQNVVISGLVSPDGLACDWVGKKLYWTDSETNRIEVANLNGTSRKVLFWQDLDQPRAIALDPAHGYMYWTDWGETPRIERAGMDGSTRKIIVDSDIYWPNGLTIDLEEQKLYWADAKLSFIHRANLDGSFRQKVVEGSLTHPFALTLSGDTLYWTDWQTRSIHACNKRTGGKRKEILSALYSPMDIQVLSQERQPFFHTRCEEDNGGCSHLCLLSPSEPFYTCACPTGVQLQDNGRTCKAGAEEVLLLARRTDLRRISLDTPDFTDIVLQVDDIRHAIAIDYDPLEGYVYWTDDEVRAIRRAYLDGSGAQTLVNTEINDPDGIAVDWVARNLYWTDTGTDRIEVTRLNGTSRKILVSEDLDEPRAIALHPVMGLMYWTDWGENPKIECANLDGQERRVLVNASLGWPNGLALDLQEGKLYWGDAKTDKIEVINVDGTKRRTLLEDKLPHIFGFTLLGDFIYWTDWQRRSIERVHKVKASRDVIIDQLPDLMGLKAVNVAKVVGTNPCADRNGGCSHLCFFTPHATRCGCPIGLELLSDMKTCIVPEAFLVFTSRAAIHRISLETNNNDVAIPLTGVKEASALDFDVSNNHIYWTDVSLKTISRAFMNGSSVEHVVEFGLDYPEGMAVDWMGKNLYWADTGTNRIEVARLDGQFRQVLVWRDLDNPRSLALDPTKGYIYWTEWGGKPRIVRAFMDGTNCMTLVDKVGRANDLTIDYADQRLYWTDLDTNMIESSNMLGQERVVIADDLPHPFGLTQYSDYIYWTDWNLHSIERADKTSGRNRTLIQGHLDFVMDILVFHSSRQDGLNDCMHNNGQCGQLCLAIPGGHRCGCASHYTLDPSSRNCSPPTTFLLFSQKSAISRMIPDDQHSPDLILPLHGLRNVKAIDYDPLDKFIYWVDGRQNIKRAKDDGTQPFVLTSLSQGQNPDRQPHDLSIDIYSRTLFWTCEATNTINVHRLSGEAMGVVLRGDRDKPRAIVVNAERGYLYFTNMQDRAAKIERAALDGTEREVLFTTGLIRPVALVVDNTLGKLFWVDADLKRIESCDLSGANRLTLEDANIVQPLGLTILGKHLYWIDRQQQMIERVEKTTGDKRTRIQGRVAHLTGIHAVEEVSLEEFSAHPCARDNGGCSHICIAKGDGTPRCSCPVHLVLLQNLLTCGEPPTCSPDQFACATGEIDCIPGAWRCDGFPECDDQSDEEGCPVCSAAQFPCARGQCVDLRLRCDGEADCQDRSDEADCDAICLPNQFRCASGQCVLIKQQCDSFPDCIDGSDELMCEITKPPSDDSPAHSSAIGPVIGIILSLFVMGGVYFVCQRVVCQRYAGANGPFPHEYVSGTPHVPLNFIAPGGSQHGPFTGIACGKSMMSSVSLMGGRGGVPLYDRNHVTGASSSSSSSTKATLYPPILNPPPSPATDPSLYNMDMFYSSNIPATARPYRPYIIRGMAPPTTPCSTDVCDSDYSASRWKASKYYLDLNSDSDPYPPPPTPHSQYLSAEDSCPPSPATERSYFHLFPPPPSPCTDSS,mutated_sequence,1.0,1615.0,UPI0000073246.a2m,UPI0000073246.npy,gnomAD
+UPI000013E18E,UPI000013E18E.csv,MSDVSTSVQSKFARLAKKKENITYMKREQLTETDKDIAPVLDLKCKDVSAIMNKFKVLMEIQDLMFEEMRETLKNDLKAVLGGKATIPEVKNSENSSSRTEFQQIINLALQKTGMVGKIEGENSKIGDDNENLTFKLEVNELSGKLDNTNEYNSNDGKKLPQGESRSYEVMGSMEETLCNIDDRDGNRNVHLEFTERESRKDGEDEFVKEMREERKFQKLKNKEEVLKASREEKVLMDEGAVLTLVADLSSATLDISKQWSNVFNILRENDFEPKFLCEVKLAFKCDGEIKTFSDLQSLRKFASQKSSVKELLKDVLPQKEEINQGGRKYGIQEKRDKTLIDSKHRAGEITSDGLSFLFLKEVKVAKPEEMKNLETQEEEFSELEELDEEASGMEDDEDTSGLEEEEEEPSGLEEEEEEEASGLEEDEASGLEEEEEQTSEQDSTFQGHTLVDAKHEVEITSDGMETTFIDSVEDSESEEEEEGKSSETGKVKTTSLTEKKASRRQKEIPFSYLVGDSGKKKLVKHQVVHKTQEEEETAVPTSQGTGTPCLTLCLASPSKSLEMSHDEHKKHSHTNLSISTGVTKLKKTEEKKHRTLHTEELTSKEADLTEETEENLRSSVINSIREIKEEIGNLKSSHSGVLEIENSVDDLSSRMDILEERIDSLEDQIEEFSKDTMQMTKQIISKERQRDIEERSRSCNIRLIGIPEKESYENRAEDIIKEIIDENFAELKKGSSLEIVSACRVPSKIDEKRLTPRHILVKFWNSSDKEKIIRASRERREITYQGTRIRLTADLSLDTLDARSKWSNVFKVLLEKGFNPRILYPAKMAFDFRGKTKVFLSIEEFRDYVLHMPTLRELLGNNIP,mutated_sequence,1.0,865.0,UPI000013E18E.a2m,UPI000013E18E.npy,gnomAD
+UPI00001C1F20,UPI00001C1F20.csv,MATEHPEPPKAELQLPPPPPPGHYGAWAAQELQAKLAEIGAPIQGNREELVERLQSYTRQTGIVLNRPVLRGEDGDKAAPPPMSAQLPGIPMPPPPLGLPPLQPPPPPPPPPPGLGLGFPMAHPPNLGPPPPLRVGEPVALSEEERLKLAQQQAALLMQQEERAKQQGDHSLKEHELLEQQKRAAVLLEQERQQEIAKMGTPVPRPPQDMGQIGVRTPLGPRVAAPVGPVGPTPTVLPMGAPVPRPRGPPPPPGDENREMDDPSVGPKIPQALEKILQLKESRQEEMNSQQEEEEMETDARSSLGQSASETEEDTVSVSKKEKNRKRRNRKKKKKPQRVRGVSSESSGDREKDSTRSRGSDSPAADVEIEYVTEEPEIYEPNFIFFKRIFEAFKLTDDVKKEKEKEPEKLDKLENSAAPKKKGFEEEHKDSDDDSSDDEQEKKPEAPKLSKKKLRRMNRFTVAELKQLVARPDVVEMHDVTAQDPKLLVHLKATRNSVPVPRHWCFKRKYLQGKRGIEKPPFELPDFIKRTGIQEMREALQEKEEQKTMKSKMREKVRPKMGKIDIDYQKLHDAFFKWQTKPKLTIHGDLYYEGKEFETRLKEKKPGDLSDELRISLGMPVGPNAHKVPPPWLIAMQRYGPPPSYPNLKIPGLNSPIPESCSFGYHAGGWGKPPVDETGKPLYGDVFGTNAAEFQTKTEEEEIDRTPWGELEPSDEESSEEEEEEESDEDKPDETGFITPADSGLITPGGFSSVPAGMETPELIELRKKKIEEAMDGSETPQLFTVLPEKRTATVGGAMMGSTHIYDMSTVMSRKGPAPELQGVEVALAPEELELDPMAMTQKYEEHVREQQAQVEKEDFSDMVAEHAAKQKQKKRKAQPQDSRGGSKKYKEFKF,mutated_sequence,1.0,895.0,UPI00001C1F20.a2m,UPI00001C1F20.npy,gnomAD
+UPI0000185F04,UPI0000185F04.csv,MKGMSHEPKSPSLGMLSTATRTTATVNPLTPSPLNGALVPSGSPATSSALSAQAAPSSSFAAALRKLAKQAEEPRGSSLSSESSPVSSPATNHSSPASTPKRVPMGPIIVPPGGHSVPSTPPVVTIAPTKTVNGVWRSESRQDAGSRSSSGGRERLIVEPPLPQEKAGGPAIPSHLLSTPYPFGLSPSSVVQDSRFPPLNLQRPVHHVVPPSTVTEDYLRSFRPYHTTDDLRMSSLPPLGLDPATAAAYYHPSYLAPHPFPHPAFRMDDSYCLSALRSPFYPIPTPGSLPPLHPSAMHLHLSGVRYPPELSHSSLAALHSERMSGLSAERLQMDEELRREREREREREREREADREREKEREREREKEREQEKEREREKERERELERQREQRAREKELLAAKALEPSFLPVAELHGLRGHATEERGKPSEQLTPTRAEKLKDAGLQAPKPVQHPLHPVPTPHHTVPSLISNHGIFSLPSSSAATALLIQRTNEEEKWLARQRRLRQEKEDRQSQVSEFRQQVLEQHLDMGRPPVPAEAEHRPESTTRPGPNRHEPGGRDPPQHFGGPPPLISPKPQLHAAPTALWNPVSLMDNTLETRRAESHSLHSHPAAFEPSRQAAVPLVKVERVFCPEKAEEGPRKREPAPLDKYQPPPPPPREGGSLEHQPFLPGPGPFLAELEKSTQTILGQQRASLPQAATFGELSGPLKPGSPYRPPVPRAPDPAYIYDEFLQQRRRLVSKLDLEERRRREAQEKGYYYDLDDSYDESDEEEVRAHLRCVAEQPPLKLDTSSEKLEFLQLFGLTTQQQKEELVAQKRRKRRRMLRERSPSPPTIQSKRQTPSPRLALSTRYSPDEMNNSPNFEEKKKFLTIFNLTHISAEKRKDKERLVEMLRAMKQKALSAAVADSLTNSPRDSPAVSLSEPATQQASLDVEKPVGVAASLSDIPKAAEPGKLEQVRPQELSRVQELAPASGEKARLSEAPGGKKSLSMLHYIRGAAPKDIPVPLSHSTNGKSKPWEPFVAEEFAHQFHESVLQSTQKALQKHKGSVAVLSAEQNHKVDTSVHYNIPELQSSSRAPPPQHNGQQEPPTARKGPPTQELDRDSEEEEEEDDEDGEDEEEVPKRKWQGIEAVFEAYQEHIEEQNLERQVLQTQCRRLEARHYSLSLTAEQLSHSVAELRSQKQKMVSERERLQAELDHLRKCLALPAMHWPRGYLKGYPR,mutated_sequence,1.0,1217.0,UPI0000185F04.a2m,UPI0000185F04.npy,gnomAD
+UPI000004A2BE,UPI000004A2BE.csv,MARMGLAGAAGRWWGLALGLTAFFLPGVHSQVVQVNDSMYGFIGTDVVLHCSFANPLPSVKITQVTWQKSTNGSKQNVAIYNPSMGVSVLAPYRERVEFLRPSFTDGTIRLSRLELEDEGVYICEFATFPTGNRESQLNLTVMAKPTNWIEGTQAVLRAKKGQDDKVLVATCTSANGKPPSVVSWETRLKGEAEYQEIRNPNGTVTVISRYRLVPSREAHQQSLACIVNYHMDRFKESLTLNVQYEPEVTIEGFDGNWYLQRMDVKLTCKADANPPATEYHWTTLNGSLPKGVEAQNRTLFFKGPINYSLAGTYICEATNPIGTRSGQVEVNITEFPYTPSPPEHGRRAGPVPTAIIGGVAGSILLVLIVVGGIVVALRRRRHTFKGDYSTKKHVYGNGYSKAGIPQHHPPMAQNLQYPDDSDDEKKAGPLGGSSYEEEEEEEEGGGGGERKVGGPHPKYDEDAKRPYFTVDEAEARQDGYGDRTLGYQYDPEQLDLAENMVSQNDGSFISKKEWYV,mutated_sequence,1.0,517.0,UPI000004A2BE.a2m,UPI000004A2BE.npy,gnomAD
+UPI0000127442,UPI0000127442.csv,MSDASLRSTSTMERLVARGTFPVLVRTSACRSLFGPVDHEELSRELQARLAELNAEDQNRWDYDFQQDMPLRGPGRLQWTEVDSDSVPAFYRETVQVGRCRLLLAPRPVAVAVAVSPPLEPAAESLDGLEEAPEQLPSVPVPAPASTPPPVPVLAPAPAPAPAPVAAPVAAPVAVAVLAPAPAPAPAPAPAPAPVAAPAPAPAPAPAPAPAPAPAPDAAPQESAEQGANQGQRGQEPLADQLHSGISGRPAAGTAAASANGAAIKKLSGPLISDFFAKRKRSAPEKSSGDVPAPCPSPSAAPGVGSVEQTPRKRLR,mutated_sequence,1.0,316.0,UPI0000127442.a2m,UPI0000127442.npy,gnomAD
+UPI000013CDEE,UPI000013CDEE.csv,MSSPTTSSLDTPLPGNGPPQPGAPSSSPTVKEEGPEPWPGGPDPDVPGTDEASSACSTDWVIPDPEEEPERKRKKGPAPKMLGHELCRVCGDKASGFHYNVLSCEGCKGFFRRSVVRGGARRYACRGGGTCQMDAFMRRKCQQCRLRKCKEAGMREQCVLSEEQIRKKKIRKQQQESQSQSQSPVGPQGSSSSASGPGASPGGSEAGSQGSGEGEGVQLTAAQELMIQQLVAAQLQCNKRSFSDQPKVTPWPLGADPQSRDARQQRFAHFTELAIISVQEIVDFAKQVPGFLQLGREDQIALLKASTIEIMLLETARRYNHETECITFLKDFTYSKDDFHRAGLQVEFINPIFEFSRAMRRLGLDDAEYALLIAINIFSADRPNVQEPGRVEALQQPYVEALLSYTRIKRPQDQLRFPRMLMKLVSLRTLSSVHSEQVFALRLQDKKLPPLLSEIWDVHE,mutated_sequence,1.0,460.0,UPI000013CDEE.a2m,UPI000013CDEE.npy,gnomAD
+UPI00001D7B21,UPI00001D7B21.csv,MSVGELYSQCTRVWIPDPDEVWRSAELTKDYKEGDKSLQLRLEDETILEYPIDVQRNQLPFLRNPDILVGENDLTALSYLHEPAVLHNLKVRFLESNHIYTYCGIVLVAINPYEQLPIYGQDVIYTYSGQNMGDMDPHIFAVAEEAYKQMARDEKNQSIIVSGESGAGKTVSAKYAMRYFATVGGSASETNIEEKVLASSPIMEAIGNAKTTRNDNSSRFGKYIQIGFDKRYHIIGANMRTYLLEKSRVVFQADDERNYHIFYQLCAAAGLPEFKELALTSAEDFFYTSQGGDTSIEGVDDAEDFEKTRQAFTLLGVKESHQMSIFKIIASILHLGSVAIQAERDGDSCSISPQDVYLSNFCRLLGVEHSQMEHWLCHRKLVTTSETYVKTMSLQQVINARNALAKHIYAQLFGWIVEHINKALHTSLKQHSFIGVLDIYGFETFEVNSFEQFCINYANEKLQQQFNSHVFKLEQEEYMKEQIPWTLIDFYDNQPCIDLIEAKLGILDLLDEECKVPKGTDQNWAQKLYDRHSSSQHFQKPRMSNTAFIIVHFADKVEYLSDGFLEKNRDTVYEEQINILKASKFPLVADLFHDDKDPVPATTPGKGSSSKISVRSARPPMKVSNKEHKKTVGHQFRTSLHLLMETLNATTPHYVRCIKPNDEKLPFHFDPKRAVQQLRACGVLETIRISAAGYPSRWAYHDFFNRYRVLVKKRELANTDKKAICRSVLENLIKDPDKFQFGRTKIFFRAGQVAYLEKLRADKFRTATIMIQKTVRGWLQKVKYHRLKGATLTLQRYCRGHLARRLAEHLRRIRAAVVLQKHYRMQRARQAYQRVRRAAVVIQAFTRAMFVRRTYRQVLMEHKATTIQKHVRGWMARRHFQRLRDAAIVIQCAFRMLKARRELKALRIEARSAEHLKRLNVGMENKVVQLQRKIDEQNKEFKTLSEQLSVTTSTYTMEVERLKKELVHYQQSPGEDTSLRLQEEVESLRTELQRAHSERKILEDAHSREKDELRKRVADLEQENALLKDEKEQLNNQILCQSKDEFAQNSVKENLMKKELEEERSRYQNLVKEYSQLEQRYDNLRDEMTIIKQTPGHRRNPSNQSSLESDSNYPSISTSEIGDTEDALQQVEEIGLEKAAMDMTVFLKLQKRVRELEQERKKLQVQLEKREQQDSKKVQAEPPQTDIDLDPNADLAYNSLKRQELESENKKLKNDLNELRKAVADQATQNNSSHGSPDSYSLLLNQLKLAHEELEVRKEEVLILRTQIVSADQRRLAGRNAEPNINARSSWPNSEKHVDQEDAIEAYHGVCQTNSKTEDWGYLNEDGELGLAYQGLKQVARLLEAQLQAQSLEHEEEVEHLKAQLEALKEEMDKQQQTFCQTLLLSPEAQVEFGVQQEISRLTNENLDLKELVEKLEKNERKLKKQLKIYMKKAQDLEAAQALAQSERKRHELNRQVTVQRKEKDFQGMLEYHKEDEALLIRNLVTDLKPQMLSGTVPCLPAYILYMCIRHADYTNDDLKVHSLLTSTINGIKKVLKKHNDDFEMTSFWLSNTCRLLHCLKQYSGDEGFMTQNTAKQNEHCLKNFDLTEYRQVLSDLSIQIYQQLIKIAEGVLQPMIVSAMLENESIQGLSGVKPTGYRKRSSSMADGDNSYCLEAIIRQMNAFHTVMCDQGLDPEIILQVFKQLFYMINAVTLNNLLLRKDVCSWSTGMQLRYNISQLEEWLRGRNLHQSGAVQTMEPLIQAAQLLQLKKKTQEDAEAICSLCTSLSTQQIVKILNLYTPLNEFEERVTVAFIRTIQAQLQERNDPQQLLLDAKHMFPVLFPFNPSSLTMDSIHIPACLNLEFLNEV,mutated_sequence,1.0,1848.0,UPI00001D7B21.a2m,UPI00001D7B21.npy,gnomAD
+UPI000013F27D,UPI000013F27D.csv,MDARRMKKEGLTENTGLPRKLLEKHDPWPAYVTYTSQTVKRLIEKSKTRELECMRALEERPWASRQNKPSSVIQPKRRKSSKSSGKAVFRDTLSESTLSMWGAYSVLAMAPTMIPEPTHLHADSRDCPTENYNKIIFARKPMMRMLPTVRY,mutated_sequence,1.0,151.0,UPI000013F27D.a2m,UPI000013F27D.npy,gnomAD
+UPI0000037B54,UPI0000037B54.csv,MALRSRFWGLFSVCRNPGCRFAALSTSSEPAAKPEVDPVENEAVAPEFTNRNPRNLELLSVARKERGWRTVFPSREFWHRLRVIRTQHHVEALVEHQNGKVVVSASTREWAIKKHLYSTRNVVACESIGRVLAQRCLEAGINFMVYQPTPWEAASDSMKRLQSAMTEGGVVLREPQRIYE,mutated_sequence,1.0,180.0,UPI0000037B54.a2m,UPI0000037B54.npy,gnomAD
+UPI0000E0A787,UPI0000E0A787.csv,MFYGTHFIMSPPTKSKLKRQSQLLSSMLSRTLSYKYRDLDSTFSSLGASDDPAELSTQLSAPGVLKVFGDSVCTGTHYKSVLATGTSSARELVKEALERYALDPRQAGQYVLCDVVGQAGDAGQRWQARCFRVFGDSEKPLLIQELWKPREGLSRRFELRKRSDVEELAAKEVDTITAGINAQARRLQRSRAKGTPTPALGDARSSPPPRLRRTVSETSLSPVNALPAAAQGPEEPGPDAMRYSLYQSPHLLLLQGYSQQHDSLVYVLNRDRHTVGQRTPSSKPSISLSAPDILPLHCTIRRQPLPDSGQAAGRLVLEPIPGAHISVNFSEVGHRTVVLHHGDLLSLGLYYLLLFKDPAQAQPLPARALARLRAVPQSCRLCGAALGARGAASPTQAALPRRQQLLLEFEPHLEDTLLQRIMTLIEPGGDDHKLTPAFLLCLCIQHSATHFQPGTFGQLLLKIARLIRETVWEKTKELAEKQAQLQEPISLASCAMADLVPDLQPILFWMSNSIELLYFIQQKCPLYMQSMEEQLDITGSKESLFSCTLTASEEAMAVLEEVVLYAFQQCVYYVSKSLYICLPALLECPPFQTERRESWSSAPELPEELRRVVSVYQAALDLLRQLQVHPEVASQMLAYLFFFSGTLLLNQLLDRGPSLSCFHWPRGVQACARLQQLLEWMRSAGFGAAGEHFFQKLSCTLNLLATPRAQLIQMSWTALRAAFPALSPAQLHRLLTHYQLASAMGPMSTWEPGAQDSPEAFRSEDVLESYENPPPIVLPSDGFQVDLEANCLDDSIYQHLLYVRHFLWGLRSRASPGSPGRPGSGASQPVCPEGMHHVVLDGHLEAPSCPLAPRDPGPAAREVAPERTLPLRGAPWAQAPPGRQPSRGGSQAGPPHTDSSCLLTPPSTPLGPEPGDPDWPESGGPCGKALPERQRNGLSGLRGAAPEGDSAALAEESPPAPSSRSSSTEDFCYVFTVELERGPSGLGMGLIDGMHTHLGAPGLYIQTLLPGSPAAADGRLSLGDRILEVNGSSLLGLGYLRAVDLIRHGGKKMRFLVAKSDVETAKKIHFRTPPL,mutated_sequence,1.0,1075.0,UPI0000E0A787.a2m,UPI0000E0A787.npy,gnomAD
+UPI00001C1FDF,UPI00001C1FDF.csv,MKEWKSKMEISEEKKSARAASEKLQRQITQECELVETSNSEDRLLKHWVSPLKDAMRHLPSQESGIREMHIIPQKAIVGEIGHGCNEGEKILSAGESSHRYEVSGQNFKQKSGLTEHQKIHNINKTYECKECEKTFNRSSNLIIHQRIHTGNKPYVCNECGKDSNQSSNLIIHQRIHTGKKPYICHECGKDFNQSSNLVRHKQIHSGGNPYECKECGKAFKGSSNLVLHQRIHSRGKPYLCNKCGKAFSQSTDLIIHHRIHTGEKPYECYDCGQMFSQSSHLVPHQRIHTGEKPLKCNECEKAFRQHSHLTEHQRLHSGEKPYECHRCGKTFSGRTAFLKHQRLHAGEKIEECEKTFSKDEELREEQRIHQEEKAYWCNQCGRNFQGTSDLIRHQVTHTGEKPYECKECGKTFNQSSDLLRHHRIHSGEKPCVCSKCGKSFRGSSDLIRHHRVHTGEKPYECSECGKAFSQRSHLVTHQKIHTGEKPYQCTECGKAFRRRSLLIQHRRIHSGEKPYECKECGKLFIWRTAFLKHQSLHTGEKLECEKTFSQDEELRGEQKIHQEAKAYWCNQCGRAFQGSSDLIRHQVTHTREKPYECKECGKTFNQSSDLLRHHRIHSGEKPYVCNKCGKSFRGSSDLIKHHRIHTGEKPYECSECGKAFSQRSHLATHQKIHTGEKPYQCSECGNAFRRRSLLIQHRRLHSGEKPYECKECGKLFMWHTAFLKHQRLHAGEKLEECEKTFSKDEELRKEQRTHQEKKVYWCNQCSRTFQGSSDLIRHQVTHTREKPYECKECGKTQSELRPSETS,mutated_sequence,1.0,807.0,UPI00001C1FDF.a2m,UPI00001C1FDF.npy,gnomAD
+UPI00002004B5,UPI00002004B5.csv,MPKLQGFEFWSRTLRGARHVVAPMVDQSELAWRLLSRRHGAQLCYTPMLHAQVFVRDANYRKENLYCEVCPEDRPLIVQFCANDPEVFVQAALLAQDYCDAIDLNLGCPQMIAKRGHYGAFLQDEWDLLQRMILLAHEKLSVPVTCKIRVFPEIDKTVRYAQMLEKAGCQLLTVHGRTKEQKGPLSGAASWEHIKAVRKAVAIPVFANGNIQCLQDVERCLRDTGVQGVMSAEGNLHNPALFEGRSPAVWELAEEYLDIVREHPCPLSYVRAHLFKLWHHTLQVHQELREELAKVKTLEGIAAVSQELKLRCQEEISRQEGAKPTGDLPFHWICQPYIRPGPREGSKEKAGARSKRALEEEEGGTEVLSKNKQKKQLRNPHKTFDPSLKPKYAKCDQCGNPKGNRCVFSLCRGCCKKRASKETADCPGHGLLFKTKLEKSLAWKEAQPELQEPQPAAPGTPGGFSEVMGSALA,mutated_sequence,1.0,473.0,UPI00002004B5.a2m,UPI00002004B5.npy,gnomAD
+UPI0000041DD3,UPI0000041DD3.csv,MSDSNLSDNHLPDTFFLTGIPGLEAAHFWIAIPFCAMYLVALVGNAALILVIAMDNALHAPMYLFLCLLSLTDLALSSTTVPKMLAILWLHAGEISFGGCLAQMFCVHSIYALESSILLAMAFDRYVAICNPLRYTTILNHAVIGRIGFVGLFRSVAIVSPFIFLLRRLPYCGHRVMTHTYCEHMGIARLACANITVNIVYGLTVALLAMGLDSILIAISYGFILHAVFHLPSHDAQHKALSTCGSHIGIILVFYIPAFFSFLTHRFGHHEVPKHVHIFLANLYVLVPPVLNPILYGARTKEIRSRLLKLLHLGKTSI,mutated_sequence,1.0,318.0,UPI0000041DD3.a2m,UPI0000041DD3.npy,gnomAD
+UPI0000073ADE,UPI0000073ADE.csv,MEPAGPAPGRLGPLLCLLLAASCAWSGVAGEEELQVIQPDKSVLVAAGETATLRCTATSLIPVGPIQWFRGAGPGRELIYNQKEGHFPRVTTVSDLTKRNNMDFSIRIGNITPADAGTYYCVKFRKGSPDDVEFKSGAGTELSVRAKPSAPVVSGPAARATPQHTVSFTCESHGFSPRDITLKWFKNGNELSDFQTNVDPVGESVSYSIHSTAKVVLTREDVHSQVICEVAHVTLQGDPLRGTANLSETIRVPPTLEVTQQPVRAENQVNVTCQVRKFYPQRLQLTWLENGNVSRTETASTVTENKDGTYNWMSWLLVNVSAHRDDVKLTCQVEHDGQPAVSKSHDLKVSAHPKEQGSNTAAENTGSNERNIYIVVGVVCTLLVALLMAALYLVRIRQKKAQGSTSSTRLHEPEKNAREITQDTNDITYADLNLPKGKKPAPQAAEPNNHTEYASIQTSPQPASEDTLTYADLDMVHLNRTPKQPAPKPEPSFSEYASVQVPRK,mutated_sequence,1.0,504.0,UPI0000073ADE.a2m,UPI0000073ADE.npy,gnomAD
+UPI000013DE11,UPI000013DE11.csv,MVTWLYRFLPTSNMAAKLRSLLPPDLRLQFWLHARLQKCFLSRGCGSYCAGAKASPLPGKMAMGLMCGRRELLRLLQSGRRVHSVAGPSQWLGKPLTTRLLFPAAPCCCRPHYLFLAASGPRSLSTSAISFAEVQVQAPPVVAATPSPTAVPEVASGETADVVQTAAEQSFAELGLGSYTPVGLIQNLLEFMHVDLGLPWWGAIAACTVFARCLIFPLIVTGQREAARIHNHLPEIQKFSSRIREAKLAGDHIEYYKASSEMALYQKKHGIKLYKPLILPVTQAPIFISFFIALREMANLPVPSLQTGGLWWFQDLTVSDPIYILPLAVTATMWAVLELGAETGVQSSDLQWMRNVIRMMPLITLPITMHFPTAVFMYWLSSNLFSLVQVSCLRIPAVRTVLKIPQRVVHDLDKLPPREGFLESFKKGWKNAEMTRQLREREQRMRNQLELAARGPLRQTFTHNPLLQPGKDNPPNIPSSSSKPKSKYPWHDTLG,mutated_sequence,1.0,495.0,UPI000013DE11.a2m,UPI000013DE11.npy,gnomAD
+UPI00015E09B8,UPI00015E09B8.csv,SRRRSKSSRRSSRRSSRRSRSKRSRSRRRSKSSRRSSRRSRRRSRTRRSRSSSRRSSTNRSSSK,mutated_sequence,,,UPI00015E09B8.a2m,UPI00015E09B8.npy,gnomAD
+UPI0000049802,UPI0000049802.csv,MANASEPGGSGGGEAAALGLKLATLSLLLCVSLAGNVLFALLIVRERSLHRAPYYLLLDLCLADGLRALACLPAVMLAARRAAAAAGAPPGALGCKLLAFLAALFCFHAAFLLLGVGVTRYLAIAHHRFYAERLAGWPCAAMLVCAAWALALAAAFPPVLDGGGDDEDAPCALEQRPDGAPGALGFLLLLAVVVGATHLVYLRLLFFIHDRRKMRPARLVPAVSHDWTFHGPGATGQAAANWTAGFGRGPTPPALVGIRPAGPGRGARRLLVLEEFKTEKRLCKMFYAVTLLFLLLWGPYVVASYLRVLVRPGAVPQAYLTASVWLTFAQAGINPVVCFLFNRELRDCFRAQFPCCQSPRTTQATHPCDLKGIGL,mutated_sequence,1.0,375.0,UPI0000049802.a2m,UPI0000049802.npy,gnomAD
+UPI000013DA0D,UPI000013DA0D.csv,MRLTHICCCCLLYQLGFLSNGIVSELQFAPDREEWEVVFPALWRREPVDPAGGSGGSADPGWVRGVGGGGSARAQAAGSSREVRSVAPVPLEEPVEGRSESRLRPPPPSEGEEDEELESQELPRGSSGAAALSPGAPASWQPPPPPQPPPSPPPAQHAEPDGDEVLLRIPAFSRDLYLLLRRDGRFLAPRFAVEQRPNPGPGPTGAASAPQPPAPPDAGCFYTGAVLRHPGSLASFSTCGGGLMGFIQLNEDFIFIEPLNDTMAITGHPHRVYRQKRSMEEKVTEKSALHSHYCGIISDKGRPRSRKIAESGRGKRYSYKLPQEYNIETVVVADPAMVSYHGADAARRFILTILNMVFNLFQHKSLSVQVNLRVIKLILLHETPPELYIGHHGEKMLESFCKWQHEEFGKKNDIHLEMSTNWGEDMTSVDAAILITRKDFCVHKDEPCDTVGIAYLSGMCSEKRKCIIAEDNGLNLAFTIAHEMGHNMGINHDNDHPSCADGLHIMSGEWIKGQNLGDVSWSRCSKEDLERFLRSKASNCLLQTNPQSVNSVMVPSKLPGMTYTADEQCQILFGPLASFCQEMQHVICTGLWCKVEGEKECRTKLDPPMDGTDCDLGKWCKAGECTSRTSAPEHLAGEWSLWSPCSRTCSAGISSRERKCPGLDSEARDCNGPRKQYRICENPPCPAGLPGFRDWQCQAYSVRTSSPKHILQWQAVLDEEKPCALFCSPVGKEQPILLSEKVMDGTSCGYQGLDICANGRCQKVGCDGLLGSLAREDHCGVCNGNGKSCKIIKGDFNHTRGAGYVEVLVIPAGARRIKVVEEKPAHSYLALRDAGKQSINSDWKIEHSGAFNLAGTTVHYVRRGLWEKISAKGPTTAPLHLLVLLFQDQNYGLHYEYTIPSDPLPENQSSKAPEPLFMWTHTSWEDCDATCGGGERKTTVSCTKIMSKNISIVDNEKCKYLTKPEPQIRKCNEQPCQTRWMMTEWTPCSRTCGKGMQSRQVACTQQLSNGTLIRARERDCIGPKPASAQRCEGQDCMTVWEAGVWSECSVKCGKGIRHRTVRCTNPRKKCVLSTRPREAEDCEDYSKCYVWRMGDWSKCSITCGKGMQSRVIQCMHKITGRHGNECFSSEKPAAYRPCHLQPCNEKINVNTITSPRLAALTFKCLGDQWPVYCRVIREKNLCQDMRWYQRCCETCRDFYAQKLQQKS,mutated_sequence,1.0,1207.0,UPI000013DA0D.a2m,UPI000013DA0D.npy,gnomAD
+UPI0000374562,UPI0000374562.csv,MLLLLSDQLLLTALRKPNPQAMAALFLSAPPQAEVTFEDVAVYLSREEWGRLGPAQRGLYRDVMLETYGNLVSLGVGPAGPKPGVISQLERGDEPWVLDVQGTSGKEHLRVNSPALGTRTEYKELTSQETFGEEDPQGSEPVEACDHISKSEGSLEKLVEQRGPRAVTLTNGESSRESGGNLRLLSRPVPDQRPHKCDICEQSFEQRSYLNNHKRVHRSKKTNTVRNSGEIFSANLVVKEDQKIPTGKKLHYCSYCGKTFRYSANLVKHQRLHTEEKPYKCDECGKAFSQSCEFINHRRMHSGEIPYRCDECGKTFTRRPNLMKHQRIHTGEKPYKCGECGKHFSAYSSLIYHQRIHTGEKPYKCNDCGKAFSDGSILIRHRRTHTGEKPFECKECGKGFTQSSNLIQHQRIHTGEKPYKCNECEKAFIQKTKLVEHQRSHTGEKPYECNDCGKVFSQSTHLIQHQRIHTGEKPYKCSECGKAFHNSSRLIHHQRLHHGEKPYRCSDCKKAFSQSTYLIQHRRIHTGEKPYKCSECGKAFRHSSNMCQHQRIHLREDFSM,mutated_sequence,1.0,560.0,UPI0000374562.a2m,UPI0000374562.npy,gnomAD
+UPI000013CCD8,UPI000013CCD8.csv,MSRRKQAKPQHFQSDPEVASLPRRDGDTEKGQPSRPTKSKDAHVCGRCCAEFFELSDLLLHKKNCTKNQLVLIVNENPASPPETFSPSPPPDNPDEQMNDTVNKTDQVDCSDLSEHNGLDREESMEVEAPVANKSGSGTSSGSHSSTAPSSSSSSSSSSGGGGSSSTGTSAITTSLPQLGDLTTLGNFSVINSNVIIENLQSTKVAVAQFSQEARCGGASGGKLAVPALMEQLLALQQQQIHQLQLIEQIRHQILLLASQNADLPTSSSPSQGTLRTSANPLSTLSSHLSQQLAAAAGLAQSLASQSASISGVKQLPPIQLPQSSSGNTIIPSNSGSSPNMNILAAAVTTPSSEKVASSAGASHVSNPAVSSSSSPAFAISSLLSPASNPLLPQQASANSVFPSPLPNIGTTAEDLNSLSALAQQRKSKPPNVTAFEAKSTSDEAFFKHKCRFCAKVFGSDSALQIHLRSHTGERPFKCNICGNRFSTKGNLKVHFQRHKEKYPHIQMNPYPVPEHLDNIPTSTGIPYGMSIPPEKPVTSWLDTKPVLPTLTTSVGLPLPPTLPSLIPFIKTEEPAPIPISHSATSPPGSVKSDSGGPESATRNLGGLPEEAEGSTLPPSGGKSEESGMVTNSVPTASSSVLSSPAADCGPAGSATTFTNPLLPLMSEQFKAKFPFGGLLDSAQASETSKLQQLVENIDKKATDPNECIICHRVLSCQSALKMHYRTHTGERPFKCKICGRAFTTKGNLKTHYSVHRAMPPLRVQHSCPICQKKFTNAVVLQQHIRMHMGGQIPNTPVPDSYSESMESDTGSFDEKNFDDLDNFSDENMEDCPEGSIPDTPKSADASQDSLSSSPLPLEMSSIAALENQMKMINAGLAEQLQASLKSVENGSIEGDVLTNDSSSVGGDMESQSAGSPAISESTSSMQALSPSNSTQEFHKSPSIEEKPQRAVPSEFANGLSPTPVNGGALDLTSSHAEKIIKEDSLGILFPFRDRGKFKNTACDICGKTFACQSALDIHYRSHTKERPFICTVCNRGFSTKGNLKQHMLTHQMRDLPSQLFEPSSNLGPNQNSAVIPANSLSSLIKTEVNGFVHVSPQDSKDTPTSHVPSGPLSSSATSPVLLPALPRRTPKQHYCNTCGKTFSSSSALQIHERTHTGEKPFACTICGRAFTTKGNLKVHMGTHMWNSTPARRGRRLSVDGPMTFLGGNPVKFPEMFQKDLAARSGSGDPSSFWNQYAAALSNGLAMKANEISVIQNGGIPPIPGSLGSGNSSPVSGLTGNLERLQNSEPNAPLAGLEKMASSENGTNFRFTRFVEDSKEIVTS,mutated_sequence,1.0,1324.0,UPI000013CCD8.a2m,UPI000013CCD8.npy,gnomAD
+UPI0000073D30,UPI0000073D30.csv,MSDSPAGSNPRTPESSGSGSGGGGKRPAVPAAVSLLPPADPLRQANRLPIRVLKMLSAHTGHLLHPEYLQPLSSTPVSPIELDAKKSPLALLAQTCSQIGKPDPPPSSKLNSVAAAANGLGAEKDPGRSAPGAASAAAALKQLGDSPAEDKSSFKPYSKGSGGGDSRKDSGSSSVSSTSSSSSSSPGDKAGFRVPSAACPPFPPHGAPVSASSSSSSPGGSRGGSPHHSDCKNGGGVGGGELDKKDQEPKPSPEPAAVSRGGGGEPGAHGGAESGASGRKSEPPSALVGAGHVAPVSPYKPGHSVFPLPPSSIGYHGSIVGAYAGYPSQFVPGLDPSKSGLVGGQLSGGLGLPPGKPPSSSPLTGASPPSFLQGLCRDPYCLGGYHGASHLGGSSCSTCSAHDPAGPSLKAGGYPLVYPGHPLQPAALSSSAAQAALPGHPLYTYGFMLQNEPLPHSCNWVAASGPCDKRFATSEELLSHLRTHTALPGAEKLLAAYPGASGLGSAAAAAAAAASCHLHLPPPAAPGSPGSLSLRNPHTLGLSRYHPYGKSHLSTAGGLAVPSLPTAGPYYSPYALYGQRLASASALGYQ,mutated_sequence,1.0,590.0,UPI0000073D30.a2m,UPI0000073D30.npy,gnomAD
+UPI000015F97E,UPI000015F97E.csv,MHYDGHVRFDLPPQGSVLARNVSTRSCPPRTSPAVDLEEEEEESSVDGKGDRKSTGLKLSKKKARRRHTDDPSKECFTLKFDLNVDIETEIVPAMKKKSLGEVLLPVFERKGIALGKVDIYLDQSNTPLSLTFEAYRFGGHYLRVKAPAKPGDEGKVEQGMKDSKSLSLPILRPAGTGPPALERVDAQSRRESLDILAPGRRRKNMSEFLGEASIPGQEPPTPSSCSLPSGSSGSTNTGDSWKNRAASRFSGFFSSGPSTSAFGREVDKMEQLEGKLHTYSLFGLPRLPRGLRFDHDSWEEEYDEDEDEDNACLRLEDSWRELIDGHEKLTRRQCHQQEAVWELLHTEASYIRKLRVIINLFLCCLLNLQESGLLCEVEAERLFSNIPEIAQLHRRLWASVMAPVLEKARRTRALLQPGDFLKGFKMFGSLFKPYIRYCMEEEGCMEYMRGLLRDNDLFRAYITWAEKHPQCQRLKLSDMLAKPHQRLTKYPLLLKSVLRKTEEPRAKEAVVAMIGSVERFIHHVNACMRQRQERQRLAAVVSRIDAYEVVESSSDEVDKLLKEFLHLDLTAPIPGASPEETRQLLLEGSLRMKEGKDSKMDVYCFLFTDLLLVTKAVKKAERTRVIRPPLLVDKIVCRELRDPGSFLLIYLNEFHSAVGAYTFQASGQALCRGWVDTIYNAQNQLQQLRAQEPPGSQQPLQSLEEEEDEQEEEEEEEEEEEEGEDSGTSAASSPTIMRKSSGSPDSQHCASDGSTETLAMVVVEPGDTLSSPEFDSGPFSSQSDETSLSTTASSATPTSELLPLGPVDGRSCSMDSAYGTLSPTSLQDFVAPGPMAELVPRAPESPRVPSPPPSPRLRRRTPVQLLSCPPHLLKSKSEASLLQLLAGAGTHGTPSAPSRSLSELCLAVPAPGIRTQGSPQEAGPSWDCRGAPSPGSGPGLVGCLAGEPAGSHRKRCGDLPSGASPRVQPEPPPGVSAQHRKLTLAQLYRIRTTLLLNSTLTASEV,mutated_sequence,1.0,1006.0,UPI000015F97E.a2m,UPI000015F97E.npy,gnomAD
+UPI00001D818D,UPI00001D818D.csv,MLRRGHLAFRDVAIEFPQEEWKCLDPAQRTLYREVMVENYRNLVFLGICLPDLSVISMLEQRRDPRNLQSEVKIANNPGGRECIKGVNAESSSKLGSNAGNKSLKNQLGLTFQLHLSELQLFQAERNISGCKHVEKPINNSLVSPLQKIYSSVKSHILNKYRNDFDDSPFLPQEQKAQIREKPCECNEHGKAFRVSSRLANNQVIHTADNPYKCNECDKVFSNSSNLVQHQRIHTGEKPYKCHECGKLFNRISLLARHQRIHTGEKPYKCHECGKVFTQNSHLANHHRIHTGEKPYKCNECGKVFNRNAHLARHQKIHSGEKPYKCKECGKAFSGGSGLTAHLVIHTGEKLYKCNKCGKVFNRNAHLTRHQRIHTGEKPYECKECGKVFRHKFCLTNHHRMHTGEQPYKCNECGKAFRDCSGLTAHLLIHTGEKPYKCKECAKVFRHRLSLSNHQRFHTGEKPYRCDECGKDFTRNSNLANHHRIHTGEKPYKCSECHKVFSHNSHLARHRQIHTGEKSYKCNECGKVFSHKLYLKKHERIHTGEKPYRCHECGKDFTRNSNLANHHRIHTGEKPYR,mutated_sequence,1.0,577.0,UPI00001D818D.a2m,UPI00001D818D.npy,gnomAD
+UPI00001FEBF9,UPI00001FEBF9.csv,MRAVPLPAPLLPLLLLALLAAPAARASRAESVSAPWPEPERESRPPPGPGPGNTTRFGSGAAGGSGSSSSNSSGDALVTRISILLRDLPTLKAAVIVAFAFTTLLIACLLLRVFRSGKRLKKTRKYDIITTPAERVEMAPLNEEDDEDEDSTVFDIKYR,mutated_sequence,1.0,159.0,UPI00001FEBF9.a2m,UPI00001FEBF9.npy,gnomAD
+UPI0000073E41,UPI0000073E41.csv,MAATLDLKSKEEKDAELDKRIEALRRKNEALIRRYQEIEEDRKKAELEGVAVTAPRKGRSVEKENVAVESEKNLGPSRRSPGTPRPPGASKGGRTPPQQGGRAGMGRASRSWEGSPGEQPRGGGAGGRGRRGRGRGSPHLSGAGDTSISDRKSKEWEERRRQNIEKMNEEMEKIAEYERNQREGVLEPNPVRNFLDDPRRRSGPLEESERDRREESRRHGRNWGGPDFERVRCGLEHERQGRRAGLGSAGDMTLSMTGRERSEYLRWKQEREKIDQERLQRHRKPTGQWRREWDAEKTDGMFKDGPVPAHEPSHRYDDQAWARPPKPPTFGEFLSQHKAEASSRRRRKSSRPQAKAAPRAYSDHDDRWETKEGAASPAPETPQPTSPETSPKETPMQPPEIPAPAHRPPEDEGEENEGEEDEEWEDISEDEEEEEIEVEEGDEEEPAQDHQAPEAAPTGIPCSEQAHGVPFSPEEPLLEPQAPGTPSSPFSPPSGHQPVSDWGEEVELNSPRTTHLAGALSPGEAWPFESV,mutated_sequence,1.0,531.0,UPI0000073E41.a2m,UPI0000073E41.npy,gnomAD
+UPI0000246F73,UPI0000246F73.csv,MYQASAVSLLPRDIPSCHSPSPGFSHLPTSSSQLAPDLLQFPLGQDPSFLAIPILTLPPSDSLVPPYIVWYIVWPSALISFLGCTLTVQFSNGKLQSPGNMRFTLYENKDSTNPRKRNQRILAAETDRLSYVGNNFGTGALKCNTLCRHFVGILNKTSGQMEVYDAELFNMQPLFSDVSVESELALESQTKTYREKMDSCIEAFGTTKQKRALNTRRMNRVGNESLNRAVAKAAETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFVIEALKSLPSDVESRDRQARCIWFLDTLIKFRAHRVVKRKSALGPGVPHIINTKLLKHFTCLTYNNGRLRNLISDSMKAKITAYVIILALHIHDFQIDLTVLQRDLKLSEKRMMEIAKAMRLKISKRRVSVAAGSEEDHKLGTLSLPLPPAQTSDRLAKRRKIT,mutated_sequence,1.0,481.0,UPI0000246F73.a2m,UPI0000246F73.npy,gnomAD
+UPI0001642876,UPI0001642876.csv,MGNSYAGQLKSARFEEALHNSIEASLRCSSVVPRPIFSQLYLDPDQHPFSSADVKPKVEDLDKDLVNRYTQNGSLDFSNNLTVNEMEDDEDDEEMSDSNSPPIPYSQKPAPEGSCTTDGFCQAGKDLRLVSLCMEQIDIPAGFLLVGAKSPNLPEHILVCAVDKRFLPDDHGKNALLGFSGNCIGCGERGFRYFTEFSNHINLKLTTQPKKQKHLKYYLVRSSQGVLSKGPLICWKECRSRQSSASCHSIKPSSSVSSTVTPENGTTNGYKSGFTQTDAANGNSSHGGKGSASSSTPAHTGNYSLSPRPSYASGDQATMFISGPPKKRHRGWYPGSPLPQPGLVVPVPTVRPLSRTEPLLSAPVPQTPLTGILQPRPIPAGETVIVPENLLSNSGVRPVILIGYGTLPYFYGNVGDIVVSPLLVNCYKIPQLENKDLEKLGLTGSQFLSVENMILLTIQYLVRLGPDQVPLREEFEQIMLKAMQEFTLRERALQIGAQCVPVSPGQLPWLARLIASVSQDLVHVVVTQNSLAEGISETLRTLSEMRHYQRLPDYVVVICASKIRGNEFCVVVLGQHQSRALAESMLTTSEFLKEISYELITGKVSFLASHFKTTSLGDDLDKLLEKMQQRRGDSVVTPFDGDLNECVSPQEAAAMIPTQNLDLDNETFHIYQPQLTVARKLLSQVCAIADSGSQSLDLGHFSKVDFIIIVPRSEVLVQQTLQRIRQSGVLVDLGLEENGTAHQRAEKYVVRLDNEIQTKFEVFMRRVKQNPYTLFVLVHDNSHVELTSVISGSLSHSEPSHGLADRVINCREVLEAFNLLVLQVSSFPYTLQTQQSRISSSNEVHWIQLDTGEDVGCEEKLYFGLSEYSKSLQWGITSPLLRCDETFEKMVNTLLERYPRLHSMVVRCYLLIQQYSEALMALTTMASLRDHSTPETLSIMDDLISSPGKNKSGRGHMLIIRVPSVQLAMLAKERLQEVRDKLGLQYRFEIILGNPATELSVATHFVARLKSWRGNEPEEWIPRTYQDLDGLPCIVILTGKDPLGETFPRSLKYCDLRLIDSSYLTRTALEQEVGLACCYVSKEVIRGPTVALDLSGKEQERAAVSENDSDELLIDLERPQSNSSAVTGTSGSIMENGVSSSSTADKSQKQSLTPSFQSPATSLGLDEGVSASSAGAGAGETLKQECDSLGPQMASSTTSKPSSSSSGPRTLPWPGQPIRGCRGPQAALPPVVILSKAAYSLLGSQKSGKLPSSSSLLPHADVAWVSSLRPLLNKDMSSEEQSLYYRQWTLARQHHADYSNQLDPASGTRNFHPRRLLLTGPPQVGKTGSYLQFLRILFRMLIRLLEVDVYDEEEINTDHNESSEVSQSEGEPWPDIESFSKMPFDVSVHDPKYSLMSLVYTEKLAGVKQEVIKESKVEEPRKRETVSIMLTKYAAYNTFHHCEQCRQYMDFTSASQMSDSTLHAFTFSSSMLGEEVQLYFIIPKSKESHFVFSKQGKHLESMRLPLVSDKNLNAVKSPIFTPSSGRHEHGLLNLFHAMEGISHLHLLVVKEYEMPLYRKYWPNHIMLVLPGMFNNAGVGAARFLIKELSYHNLELERNRLEELGIKRQCVWPFIVMMDDSCVLWNIHSVQEPSSQPMEVGVSSKNVSLKTVLQHIEATPKIVHYAILGIQKWSSKLTSQSLKAPFSRCHVHDFILLNTDLTQNVQYDFNRYFCEDADFNLRTNSSGLLICRFNNFSLMKKHVQVGGQRDFIIKPKIMVSESLAPILPLQYICAPDSEHTLLAAPAQFLLEKFLQHASYKLFPKAIHNFRSPVLAIDCYLNIGPEVAICYISSRPHSSNVNCEGVFFSGLLLYLCDSFVGADLKKFKFLKGATLCVICQDRSSLRQTIVRLELEDEWQFRLRDEFQTANSSDDKPLYFLTGRHV,mutated_sequence,1.0,1923.0,UPI0001642876.a2m,UPI0001642876.npy,gnomAD
+UPI00001D771D,UPI00001D771D.csv,MASNGAYPVLGPGVTVNPGTSLSVFTALPFATPAPGPAHRPPLVTAVVPPAGPLVLSAFPSTPLVAGQDGRGPSGAGASNVFVQMRTEVGPVKPPQAQTLILTQAPLVWQAPGTLCGGVMCPPPLLLAAAPGVPVTSAQVVGGTQACEGGWSHGLPLPPPPPAAQVAPIVSPGNARPWPQGAHGEGSLAPSQAKARPDDSCKPKSVYENFRLWQHYKPLARRHLPQSPDTEALSCFLIPVLRSLARRKPTMTLEEGLWQAMREWQHTSNFDRMIFYEMAEKFLEFEAEEEMQIQKSQWMKGPQSLPPPAPPRLEPRGPPAPEVVKQPVYLPSKDGPKAPTACLPPPRPQRPAETKAHLPPPRPQRPAETNAHLPPPRPQRPAETKVPEEIPPEVVQEYVDIMEELLGSHPGDTGEPEGQREKGKVEQPQEEDGITSDPGLLSYIDKLCSQEDFVTKVEAVIHPRFLEELLSPDPQMDFLALSQELEQEEGLTLAQLVEKRLLSLKEKGCGRAAPRHGTARLDSSPSEFAAGQEAAREVPDPQQRVSVETSPPQTAAQDPQGQGRVRTGMARSEDPAVLLGCQDSPRLKAVRPTSPPQDHRPTCPGLGTKDALGLPGESPVKESHGLAKGSSEETELPGMVYVVGSHHRLRPWRLSQSPVPSSGLLSPGGRGPQGALQSPSAQKRGLSPSPSPASKSKKRPLFGSPSPAEKTPHPGPGLRVSGEQSLAWGLGGPSQSQKRKGDPLASRRKKKRHCSQ,mutated_sequence,1.0,756.0,UPI00001D771D.a2m,UPI00001D771D.npy,gnomAD
+UPI000013E4EC,UPI000013E4EC.csv,MDVDAEREKITQEIKELERILDPGSSGSHVEISESSLESDSEADSLPSEDLDPADPPISEEERWGEASNDEDDPKDKTLPEDPETCLQLNMVYQEVIQEKLAEANLLLAQNREQQEELMRDLAGSKGTKVKDGKSLPPSTYMGHFMKPYFKDKVTGVGPPANEDTREKAAQGIKAFEELLVTKWKNWEKALLRKSVVSDRLQRLLQPKLLKLEYLHQKQSKVSSELERQALEKQGREAEKEIQDINQLPEEALLGNRLDSHDWEKISNINFEGSRSAEEIRKFWQNSEHPSINKQEWSREEEERLQAIAAAHGHLEWQKIAEELGTSRSAFQCLQKFQQHNKALKRKEWTEEEDRMLTQLVQEMRVGSHIPYRRIVYYMEGRDSMQLIYRWTKSLDPGLKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGVGHWAKIASELPHRSGSQCLSKWKIMMGKKQGLRRRRRRARHSVRWSSTSSSGSSSGSSGGSSSSSSSSSEEDEPEQAQAGEGDRALLSPQYMVPDMDLWVPARQSTSQPWRGGAGAWLGGPAASLSPPKGSSASQGGSKEASTTAAAPGEETSPVQVPARAHGPVPRSAQASHSADTRPAGAEKQALEGGRRLLTVPVETVLRVLRANTAARSCTQKEQLRQPPLPTSSPGVSSGDSVARSHVQWLRHRATQSGQRRWRHALHRRLLNRRLLLAVTPWVGDVVVPCTQASQRPAVVQTQADGLREQLQQARLASTPVFTLFTQLFHIDTAGCLEVVRERKALPPRLPQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRLPAFGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQPGPEKGALDLGLLSQEGEAATQQWLGGQRGVRVPLLGSRLPYQPPALCSLRALSGLLLHKKALEHKATSLVVGGEAERPAGALQASLGLVRGQLQDNPAYLLLRARFLAAFTLPALLATLAPQGVRTTLSVPSRVGSESEDEDLLSELELADRDGQPGCTTATCPIQGAPDSGKCSASSCLDTSNDPDDLDVLRTRHARHTRKRRRLV,mutated_sequence,1.0,1469.0,UPI000013E4EC.a2m,UPI000013E4EC.npy,gnomAD
+UPI00001C1FA8,UPI00001C1FA8.csv,MARAGSCGGAAAGAGRPEPWELSLEEVLKAYEQPLNEEQAWAVCFQGCRGLRGSPGRRLRDTGDLLLRGDGSVGAREPEAAEPATMVVPLASSEAQTVQSLGFAIYRALDWGLDESEERELSPQLERLIDLMANNDSEDSGCGAADEGYGGPEEEEEAEGVPRSVRTFAQAMRLCAARLTDPRGAQAHYQAVCRALFVETLELRAFLARVREAKEMLQKLREDEPHLETPRAELDSLGHTDWARLWVQLMRELRRGVKLKKVQEQEFNPLPTEFQLTPFEMLMQDIRARNYKLRKVMVDGDIPPRVKKDAHELILDFIRSRPPLKQVSERRLRPLPPKQRSLHEKILEEIKQERRLRPVRGEGWAARGFGSLPCILNACSGDAKSTSCINLSVTDAGGSAQRPRPRVLLKAPTLAEMEEMNTSEEEESPCGEVTLKRDRSFSEHDLAQLRSEVASGLQSATHPPGGTEPPRPRAGSAHVWRPGSRDQGTCPASVSDPSHPLLSNRGSSGDRPEASMTPDAKHLWLEFSHPVESLALTVEEVMDVRRVLVKAEMEKFLQNKELFSSLKKGKICCCCRAKFPLFSWPPSCLFCKRAVCTSCSIKMKMPSKKFGHIPVYTLGFESPQRVSAAKTAPIQRRDIFQSLQGPQWQSVEEAFPHIYSHGCVLKDVCSECTSFVADVVRSSRKSVDVLNTTPRRSRQTQSLYIPNTRTLDFK,mutated_sequence,1.0,714.0,UPI00001C1FA8.a2m,UPI00001C1FA8.npy,gnomAD
+UPI0000458A54,UPI0000458A54.csv,MLRRVTVAAVCATRRKLCEAGRELAALWGIETRGRCEDSAAARPFPILAMPGRNKAKSTCSCPDLQPNGQDLGENSRVARLGADESEEEGRRGSLSNAGDPEIVKSPSDPKQYRYIKLQNGLQALLISDLSNMEGKTGNTTDDEEEEEVEEEEEDDDEDSGAEIEDDDEEGFDDEDEFDDEHDDDLDTEDNELEELEERAEARKKTTEKQSAAALCVGVGSFADPDDLPGLAHFLEHMVFMGSLKYPDENGFDAFLKKHGGSDNASTDCERTVFQFDVQRKYFKEALDRWAQFFIHPLMIRDAIDREVEAVDSEYQLARPSDANRKEMLFGSLARPGHPMGKFFWGNAETLKHEPRKNNIDTHARLREFWMRYYSSHYMTLVVQSKETLDTLEKWVTEIFSQIPNNGLPRPNFGHLTDPFDTPAFNKLYRVVPIRKIHALTITWALPPQQQHYRVKPLHYISWLVGHEGKGSILSFLRKKCWALALFGGNGETGFEQNSTYSVFSISITLTDEGYEHFYEVAYTVFQYLKMLQKLGPEKRIFEEIRKIEDNEFHYQEQTDPVEYVENMCENMQLYPLQDILTGDQLLFEYKPEVIGEALNQLVPQKANLVLLSGANEGKCDLKEKWFGTQYSIEDIENSWAELWNSNFELNPDLHLPAENKYIATDFTLKAFDCPETEYPVKIVNTPQGCLWYKKDNKFKIPKAYIRFHLISPLIQKSAANVVLFDIFVNILTHNLAEPAYEADVAQLEYKLVAGEHGLIIRVKGFNHKLPLLFQLIIDYLAEFNSTPAVFTMITEQLKKTYFNILIKPETLAKDVRLLILEYARWSMIDKYQALMDGLSLESLLSFVKEFKSQLFVEGLVQGNVTSTESMDFLKYVVDKLNFKPLEQEMPVQFQVVELPSGHHLCKVKALNKGDANSEVTVYYQSGTRSLREYTLMELLVMHMEEPCFDFLRTKQTLGYHVYPTCRNTSGILGFSVTVGTQATKYNSEVVDKKIEEFLSSFEEKIENLTEEAFNTQVTALIKLKECEDTHLGEEVDRNWNEVVTQQYLFDRLAHEIEALKSFSKSDLVNWFKAHRGPGSKMLSVHVVGYGKYELEEDGTPSSEDSNSSCEVMQLTYLPTSPLLADCIIPITDIRAFTTTLNLLPYHKIVK,mutated_sequence,1.0,1151.0,UPI0000458A54.a2m,UPI0000458A54.npy,gnomAD
+UPI000066D9E3,UPI000066D9E3.csv,MATNPQPQPPPPAPPPPPPQPQPQPPPPPPGPGAGPGAGGAGGAGAGAGDPQLVAMIVNHLKSQGLFDQFRRDCLADVDTKPAYQNLRQRVDNFVANHLATHTWSPHLNKNQLRNNIRQQVLKSGMLESGIDRIISQVVDPKINHTFRPQVEKAVHEFLATLNHKEEGSGNTAPDDEKPDTSLITQGVPTPGPSANVANDAMSILETITSLNQEASAARASTETSNAKTSERASKKLPSQPTTDTSTDKERTSEDMADKEKSTADSGGEGLETAPKSEEFSDLPCPVEEIKNYTKEHNNLILLNKDVQQESSEQKNKSTDKGEKKPDSNEKGERKKEKKEKTEKKFDHSKKSEDTQKVKDEKQAKEKEVESLKLPSEKNSNKAKTVEGTKEDFSLIDSDVDGLTDITVSSVHTSDLSSFEEDTEEEVVTSDSMEEGEITSDDEEKNKQNKTKTQTSDSSEGKTKSVRHAYVHKPYLYSKYYSDSDDELTVEQRRQSIAKEKEERLLRRQINREKLEEKRKQKAEKTKSSKTKGQGRSSVDLEESSTKSLEPKAARIKEVLKERKVLEKKVALSKKRKKDSRNVEENSKKKQQYEEDSKETLKTSEHCEKEKISSSKELKHVHAKSEPSKPARRLSESLHVVDENKNESKLEREHKRRTSTPVIMEGVQEETDTRDVKRQVERSEICTEEPQKQKSTLKNEKHLKKDDSETPHLKSLLKKEVKSSKEKPEREKTPSEDKLSVKHKYKGDCMHKTGDETELHSSEKGLKVEENIQKQSQQTKLSSDDKTERKSKHRNERKLSVLGKDGKPVSEYIIKTDENVRKENNKKERRLSAEKTKAEHKSRRSSDSKIQKDSLGSKQHGITLQRRSESYSEDKCDMDSTNMDSNLKPEEVVHKEKRRTKSLLEEKLVLKSKSKTQGKQVKVVETELQEGATKQATTPKPDKEKNTEENDSEKQRKSKVEDKPFEETGVEPVLETASSSAHSTQKDSSHRAKLPLAKEKYKSDKDSTSTRLERKLSDGHKSRSLKHSSKDIKKKDENKSDDKDGKEVDSSHEKARGNSSLMEKKLSRRLCENRRGSLSQEMAKGEEKLAANTLSTPSGSSLQRPKKSGDMTLIPEQEPMEIDSEPGVENVFEVSKTQDNRNNNSQQDIDSENMKQKTSATVQKDELRTCTADSKATAPAYKPGRGTGVNSNSEKHADHRSTLTKKMHIQSAVSKMNPGEKEPIHRGTTEVNIDSETVHRMLLSAPSENDRVQKNLKNTAAEEHVAQGDATLEHSTNLDSSPSLSSVTVVPLRESYDPDVIPLFDKRTVLEGSTASTSPADHSALPNQSLTVRESEVLKTSDSKEGGEGFTVDTPAKASITSKRHIPEAHQATLLDGKQGKVIMPLGSKLTGVIVENENITKEGGLVDMAKKENDLNAEPNLKQTIKATVENGKKDGIAVDHVVGLNTEKYAETVKLKHKRSPGKVKDISIDVERRNENSEVDTSAGSGSAPSVLHQRNGQTEDVATGPRRAEKTSVATSTEGKDKDVTLSPVKAGPATTTSSETRQSEVALPCTSIEADEGLIIGTHSRNNPLHVGAEASECTVFAAAEEGGAVVTEGFAESETFLTSTKEGESGECAVAESEDRAADLLAVHAVKIEANVNSVVTEEKDDAVTSAGSEEKCDGSLSRDSEIVEGTITFISEVESDGAVTSAGTEIRAGSISSEEVDGSQGNMMRMGPKKETEGTVTCTGAEGRSDNFVICSVTGAGPREERMVTGAGVVLGDNDAPPGTSASQEGDGSVNDGTEGESAVTSTGITEDGEGPASCTGSEDSSEGFAISSESEENGESAMDSTVAKEGTNVPLVAAGPCDDEGIVTSTGAKEEDEEGEDVVTSTGRGNEIGHASTCTGLGEESEGVLICESAEGDSQIGTVVEHVEAEAGAAIMNANENNVDSMSGTEKGSKDTDICSSAKGIVESSVTSAVSGKDEVTPVPGGCEGPMTSAASDQSDSQLEKVEDTTISTGLVGGSYDVLVSGEVPECEVAHTSPSEKEDEDIITSVENEECDGLMATTASGDITNQNSLAGGKNQGKVLIISTSTTNDYTPQVSAITDVEGGLSDALRTEENMEGTRVTTEEFEAPMPSAVSGDDSQLTASRSEEKDECAMISTSIGEEFELPISSATTIKCAESLQPVAAAVEERATGPVLISTADFEGPMPSAPPEAESPLASTSKEEKDECALISTSIAEECEASVSGVVVESENERAGTVMEEKDGSGIISTSSVEDCEGPVSSAVPQEEGDPSVTPAEEMGDTAMISTSTSEGCEAVMIGAVLQDEDRLTITRVEDLSDAAIISTSTAECMPISASIDRHEENQLTADNPEGNGDLSATEVSKHKVPMPSLIAENNCRCPGPVRGGKEPGPVLAVSTEEGHNGPSVHKPSAGQGHPSAVCAEKEEKHGKECPEIGPFAGRGQKESTLHLINAEEKNVLLNSLQKEDKSPETGTAGGSSTASYSAGRGLEGNANSPAHLRGPEQTSGQTAKDPSVSIRYLAAVNTGAIKADDMPPVQGTVAEHSFLPAEQQGSEDNLKTSTTKCITGQESKIAPSHTMIPPATYSVALLAPKCEQDLTIKNDYSGKWTDQASAEKTGDDNSTRKSFPEEGDIMVTVSSEENVCDIGNEESPLNVLGGLKLKANLKMEAYVPSEEEKNGEILAPPESLCGGKPSGIAELQREPLLVNESLNVENSGFRTNEEIHSESYNKGEISSGRKDNAEAISGHSVEADPKEVEEEERHMPKRKRKQHYLSSEDEPDDNPDVLDSRIETAQRQCPETEPHDTKEENSRDLEELPKTSSETNSTTSRVMEEKDEYSSSETTGEKPEQNDDDTIKSQEEDQPIIIKRKRGRPRKYPVETTLKMKDDSKTDTGIVTVEQSPSSSKLKVMQTDESNKETANLQERSISNDDGEEKIVTSVRRRGRKPKRSLTVSDDAESSEPERKRQKSVSDPVEDKKEQESDEEEEEEEEDEPSGATTRSTTRSEAQRSKTQLSPSIKRKREVSPPGARTRGQQRVEEAPVKKAKR,mutated_sequence,1.0,3051.0,UPI000066D9E3.a2m,UPI000066D9E3.npy,gnomAD
+UPI0000126665,UPI0000126665.csv,MDGPTRGHGLRKKRRSRSQRDRERRSRGGLGAGAAGGGGAGRTRALSLASSSGSDKEDNGKPPSSAPSRPRPPRRKRRESTSAEEDIIDGFAMTSFVTFEALEKDVALKPQERVEKRQTPLTKKKREALTNGLSFHSKKSRLSHPHHYSSDRENDRNLCQHLGKRKKMPKALRQLKPGQNSCRDSDSESASGESKGFHRSSSRERLSDSSAPSSLGTGYFCDSDSDQEEKASDASSEKLFNTVIVNKDPELGVGTLPEHDSQDAGPIVPKISGLERSQEKSQDCCKEPIFEPVVLKDPCPQVAQPIPQPQTEPQLRAPSPDPDLVQRTEAPPQPPPLSTQPPQGPPEAQLQPAPQPQVQRPPRPQSPTQLLHQNLPPVQAHPSAQSLSQPLSAYNSSSLSLNSLSSSRSSTPAKTQPAPPHISHHPSASPFPLSLPNHSPLHSFTPTLQPPAHSHHPNMFAPPTALPPPPPLTSGSLQVAGHPAGSTYSEQDILRQELNTRFLASQSADRGASLGPPPYLRTEFHQHQHQHQHTHQHTHQHTFTPFPHAIPPTAIMPTPAPPMFDKYPTKVDPFYRHSLFHSYPPAVSGIPPMIPPTGPFGSLQGAFQPKTSNPIDVAARPGTVPHTLLQKDPRLTDPFRPMLRKPGKWCAMHVHIAWQIYHHQQKVKKQMQSDPHKLDFGLKPEFLSRPPGPSLFGAIHHPHDLARPSTLFSAAGAAHPTGTPFGPPPHHSNFLNPAAHLEPFNRPSTFTGLAAVGGNAFGGLGNPSVTPNSMFGHKDGPSVQNFSNPHEPWNRLHRTPPSFPTPPPWLKPGELERSASAAAHDRDRDVDKRDSSVSKDDKERESVEKRHSSHPSPAPVLPVNALGHTRSSTEQIRAHLNTEAREKDKPKERERDHSESRKDLAADEHKAKEGHLPEKDGHGHEGRAAGEEAKQLARVPSPYVRTPVVESARPNSTSSREAEPRKGEPAYENPKKSSEVKVKEERKEDHDLPPEAPQTHRASEPPPPNSSSSVHPGPLASMPMTVGVTGIHPMNSISSLDRTRMMTPFMGISPLPGGERFPYPSFHWDPIRDPLRDPYRELDIHRRDPLGRDFLLRNDPLHRLSTPRLYEADRSFRDREPHDYSHHHHHHHHPLSVDPRREHERGGHLDERERLHMLREDYEHTRLHSVHPASLDGHLPHPSLITPGLPSMHYPRISPTAGNQNGLLNKTPPTAALSAPPPLISTLGGRPVSPRRTTPLSAEIRERPPSHTLKDIEAR,mutated_sequence,1.0,1259.0,UPI0000126665.a2m,UPI0000126665.npy,gnomAD
+UPI00000015B6,UPI00000015B6.csv,MRSPATGVPLPTPPPPLLLLLLLLLPPPLLGDQVGPCRSLGSRGRGSSGACAPMGWLCPSSASNLWLYTSRCRDAGTELTGHLVPHHDGLRVWCPESEAHIPLPPAPEGCPWSCRLLGIGGHLSPQGKLTLPEEHPCLKAPRLRCQSCKLAQAPGLRAGERSPEESLGGRRKRNVNTAPQFQPPSYQATVPENQPAGTPVASLRAIDPDEGEAGRLEYTMDALFDSRSNQFFSLDPVTGAVTTAEELDRETKSTHVFRVTAQDHGMPRRSALATLTILVTDTNDHDPVFEQQEYKESLRENLEVGYEVLTVRATDGDAPPNANILYRLLEGSGGSPSEVFEIDPRSGVIRTRGPVDREEVESYQLTVEASDQGRDPGPRSTTAAVFLSVEDDNDNAPQFSEKRYVVQVREDVTPGAPVLRVTASDRDKGSNAVVHYSIMSGNARGQFYLDAQTGALDVVSPLDYETTKEYTLRVRAQDGGRPPLSNVSGLVTVQVLDINDNAPIFVSTPFQATVLESVPLGYLVLHVQAIDADAGDNARLEYRLAGVGHDFPFTINNGTGWISVAAELDREEVDFYSFGVEARDHGTPALTASASVSVTVLDVNDNNPTFTQPEYTVRLNEDAAVGTSVVTVSAVDRDAHSVITYQITSGNTRNRFSITSQSGGGLVSLALPLDYKLERQYVLAVTASDGTRQDTAQIVVNVTDANTHRPVFQSSHYTVNVNEDRPAGTTVVLISATDEDTGENARITYFMEDSIPQFRIDADTGAVTTQAELDYEDQVSYTLAITARDNGIPQKSDTTYLEILVNDVNDNAPQFLRDSYQGSVYEDVPPFTSVLQISATDRDSGLNGRVFYTFQGGDDGDGDFIVESTSGIVRTLRRLDRENVAQYVLRAYAVDKGMPPARTPMEVTVTVLDVNDNPPVFEQDEFDVFVEENSPIGLAVARVTATDPDEGTNAQIMYQIVEGNIPEVFQLDIFSGELTALVDLDYEDRPEYVLVIQATSAPLVSRATVHVRLLDRNDNPPVLGNFEILFNNYVTNRSSSFPGGAIGRVPAHDPDISDSLTYSFERGNELSLVLLNASTGELKLSRALDNNRPLEAIMSVLVSDGVHSVTAQCALRVTIITDEMLTHSITLRLEDMSPERFLSPLLGLFIQAVAATLATPPDHVVVFNVQRDTDAPGGHILNVSLSVGQPPGPGGGPPFLPSEDLQERLYLNRSLLTAISAQRVLPFDDNICLREPCENYMRCVSVLRFDSSAPFIASSSVLFRPIHPVGGLRCRCPPGFTGDYCETEVDLCYSRPCGPHGRCRSREGGYTCLCRDGYTGEHCEVSARSGRCTPGVCKNGGTCVNLLVGGFKCDCPSGDFEKPYCQVTTRSFPAHSFITFRGLRQRFHFTLALSFATKERDGLLLYNGRFNEKHDFVALEVIQEQVQLTFSAGESTTTVSPFVPGGVSDGQWHTVQLKYYNKPLLGQTGLPQGPSEQKVAVVTVDGCDTGVALRFGSVLGNYSCAAQGTQGGSKKSLDLTGPLLLGGVPDLPESFPVRMRQFVGCMRNLQVDSRHIDMADFIANNGTVPGCPAKKNVCDSNTCHNGGTCVNQWDAFSCECPLGFGGKSCAQEMANPQHFLGSSLVAWHGLSLPISQPWYLSLMFRTRQADGVLLQAITRGRSTITLQLREGHVMLSVEGTGLQASSLRLEPGRANDGDWHHAQLALGASGGPGHAILSFDYGQQRAEGNLGPRLHGLHLSNITVGGIPGPAGGVARGFRGCLQGVRVSDTPEGVNSLDPSHGESINVEQGCSLPDPCDSNPCPANSYCSNDWDSYSCSCDPGYYGDNCTNVCDLNPCEHQSVCTRKPSAPHGYTCECPPNYLGPYCETRIDQPCPRGWWGHPTCGPCNCDVSKGFDPDCNKTSGECHCKENHYRPPGSPTCLLCDCYPTGSLSRVCDPEDGQCPCKPGVIGRQCDRCDNPFAEVTTNGCEVNYDSCPRAIEAGIWWPRTRFGLPAAAPCPKGSFGTAVRHCDEHRGWLPPNLFNCTSITFSELKGFAERLQRNESGLDSGRSQQLALLLRNATQHTAGYFGSDVKVAYQLATRLLAHESTQRGFGLSATQDVHFTENLLRVGSALLDTANKRHWELIQQTEGGTAWLLQHYEAYASALAQNMRHTYLSPFTIVTPNIVISVVRLDKGNFAGAKLPRYEALRGEQPPDLETTVILPESVFRETPPVVRPAGPGEAQEPEELARRQRRHPELSQGEAVASVIIYRTLAGLLPHNYDPDKRSLRVPKRPIINTPVVSISVHDDEELLPRALDKPVTVQFRLLETEERTKPICVFWNHSILVSGTGGWSARGCEVVFRNESHVSCQCNHMTSFAVLMDVSRRENGEILPLKTLTYVALGVTLAALLLTFFFLTLLRILRSNQHGIRRNLTAALGLAQLVFLLGINQADLPFACTVIAILLHFLYLCTFSWALLEALHLYRALTEVRDVNTGPMRFYYMLGWGVPAFITGLAVGLDPEGYGNPDFCWLSIYDTLIWSFAGPVAFAVSMSVFLYILAARASCAAQRQGFEKKGPVSGLQPSFAVLLLLSATWLLALLSVNSDTLLFHYLFATCNCIQGPFIFLSYVVLSKEVRKALKLACSRKPSPDPALTTKSTLTSSYNCPSPYADGRLYQPYGDSAGSLHSTSRSGKSQPSYIPFLLREESALNPGQGPPGLGDPGSLFLEGQDQQHDPDTDSDSDLSLEDDQSGSYASTHSSDSEEEEEEEEEEAAFPGEQGWDSLLGPGAERLPLHSTPKDGGPGPGKAPWPGDFGTTAKESSGNGAPEERLRENGDALSREGSLGPLPGSSAQPHKGILKKKCLPTISEKSSLLRLPLEQCTGSSRGSSASEGSRGGPPPRPPPRQSLQEQLNGVMPIAMSIKAGTVDEDSSGSEFLFFNFLH,mutated_sequence,1.0,2923.0,UPI00000015B6.a2m,UPI00000015B6.npy,gnomAD
+UPI0000070598,UPI0000070598.csv,MVVPEKEQSWIPKIFKKKTCTTFIVDSTDPGGTLCQCGRPRTAHPAVAMEDAFGAAVVTVWDSDAHTTEKPTDAYGELDFTGAGRKHSNFLRLSDRTDPAAVYSLVTRTWGFRAPNLVVSVLGGSGGPVLQTWLQDLLRRGLVRAAQSTGAWIVTGGLHTGIGRHVGVAVRDHQMASTGGTKVVAMGVAPWGVVRNRDTLINPKGSFPARYRWRGDPEDGVQFPLDYNYSAFFLVDDGTHGCLGGENRFRLRLESYISQQKTGVGGTGIDIPVLLLLIDGDEKMLTRIENATQAQLPCLLVAGSGGAADCLAETLEDTLAPGSGGARQGEARDRIRRFFPKGDLEVLQAQVERIMTRKELLTVYSSEDGSEEFETIVLKALVKACGSSEASAYLDELRLAVAWNRVDIAQSELFRGDIQWRSFHLEASLMDALLNDRPEFVRLLISHGLSLGHFLTPMRLAQLYSAAPSNSLIRNLLDQASHSAGTKAPALKGGAAELRPPDVGHVLRMLLGKMCAPRYPSGGAWDPHPGQGFGESMYLLSDKATSPLSLDAGLGQAPWSDLLLWALLLNRAQMAMYFWEMGSNAVSSALGACLLLRVMARLEPDAEEAARRKDLAFKFEGMGVDLFGECYRSSEVRAARLLLRRCPLWGDATCLQLAMQADARAFFAQDGVQSLLTQKWWGDMASTTPIWALVLAFFCPPLIYTRLITFRKSEEEPTREELEFDMDSVINGEGPVGTADPAEKTPLGVPRQSGRPGCCGGRCGGRRCLRRWFHFWGAPVTIFMGNVVSYLLFLLLFSRVLLVDFQPAPPGSLELLLYFWAFTLLCEELRQGLSGGGGSLASGGPGPGHASLSQRLRLYLADSWNQCDLVALTCFLLGVGCRLTPGLYHLGRTVLCIDFMVFTVRLLHIFTVNKQLGPKIVIVSKMMKDVFFFLFFLGVWLVAYGVATEGLLRPRDSDFPSILRRVFYRPYLQIFGQIPQEDMDVALMEHSNCSSEPGFWAHPPGAQAGTCVSQYANWLVVLLLVIFLLVANILLVNLLIAMFSYTFGKVQGNSDLYWKAQRYRLIREFHSRPALAPPFIVISHLRLLLRQLCRRPRSPQPSSPALEHFRVYLSKEAERKLLTWESVHKENFLLARARDKRESDSERLKRTSQKVDLALKQLGHIREYEQRLKVLEREVQQCSRVLGWVAEALSRSALLPPGGPPPPDLPGSKD,mutated_sequence,1.0,1214.0,UPI0000070598.a2m,UPI0000070598.npy,gnomAD
+UPI00003FEC88,UPI00003FEC88.csv,MKCFFPVLSCLAVLGVVSAQRQVTVQEGPLYRTEGSHITIWCNVSGYQGPSEQNFQWSIYLPSSPEREVQIVSTMDSSFPYAIYTQRVRGGKIFIERVQGNSTLLHITDLQARDAGEYECHTPSTDKQYFGSYSAKMNLVVIPDSLQTTAMPQTLHRVEQDPLELTCEVASETIQHSHLSVAWLRQKVGEKPVEVISLSRDFMLHSSSEYAQRQSLGEVRLDKLGRTTFRLTIFHLQPSDQGEFYCEAAEWIQDPDGSWYAMTRKRSEGAVVNVQPTDKEFTVRLETEKRLHTVGEPVEFRCILEAQNVPDRYFAVSWAFNSSLIATMGPNAVPVLNSEFAHREARGQLKVAKESDSVFVLKIYHLRQEDSGKYNCRVTEREKTVTGEFIDKESKRPKNIPIIVLPLKSSISVEVASNASVILEGEDLRFSCSVRTAGRPQGRFSVIWQLVDRQNRRSNIMWLDRDGTVQPGSSYWERSSFGGVQMEQVQPNSFSLGIFNSRKEDEGQYECHVTEWVRAVDGEWQIVGERRASTPISITALEMGFAVTAISRTPGVTYSDSFDLQCIIKPHYPAWVPVSVTWRFQPVGTVEFHDLVTFTRDGGVQWGDRSSSFRTRTAIEKAESSNNVRLSISRASDTEAGKYQCVAELWRKNYNNTWTRLAERTSNLLEIRVLQPVTKLQVSKSKRTLTLVENKPIQLNCSVKSQTSQNSHFAVLWYVHKPSDADGKLILKTTHNSAFEYGTYAEEEGLRARLQFERHVSGGLFSLTVQRAEVSDSGSYYCHVEEWLLSPNYAWYKLAEEVSGRTEVTVKQPDSRLRLSQAQGNLSVLETRQVQLECVVLNRTSITSQLMVEWFVWKPNHPERETVARLSRDATFHYGEQAAKNNLKGRLHLESPSPGVYRLFIQNVAVQDSGTYSCHVEEWLPSPSGMWYKRAEDTAGQTALTVMRPDASLQVDTVVPNATVSEKAAFQLDCSIVSRSSQDSRFAVAWYSLRTKAGGKRSSPGLEEQEEEREEEEEEDDDDDDDPTERTALLSVGPDAVFGPEGSPWEGRLRFQRLSPVLYRLTVLQASPQDTGNYSCHVEEWLPSPQKEWYRLTEEESAPIGIRVLDTSPTLQSIICSNDALFYFVFFYPFPIFGILIITILLVRFKSRNSSKNSDGKNGVPLLWIKEPHLNYSPTCLEPPVLSIHPGAID,mutated_sequence,1.0,1194.0,UPI00003FEC88.a2m,UPI00003FEC88.npy,gnomAD
+UPI0001CE93AE,UPI0001CE93AE.csv,MVVSAGPWSSEKAEMNILEINETLRPQLAEKKQQFRNLKEKCFLTQLAGFLANRQKKYKYEECKDLIKFMLRNERQFKEEKLAEQLKQAEELRQYKVLVHSQERELTQLREKLREGRDASRSLYEHLQALLTPDEPDKSQGQDLQEQLAEGCRLAQHLVQKLSPENDEDEDEDVQVEEAEKVLESSAPREVQKAEESKVPEDSLEECAITCSNSHGPCDSNQPHKNIKITFEEDEVNSTLVVDRESSHDECQDALNILPVPGPTSSATNVSMVVSAGPLSSEKAEMNILEINEKLRPQLAEKKQQFRNLKEKCFLTQLSGFLANQQKKYKYEECKDLIKFMLRNERQFKEEKLAEQLKQAEELRQYKVLVHAQERELTQLKEKLREGRDASRSLNEHLQALLTPYEPDKSQGQDLQEQLAEGCRLAQHLVQKLSPENDNDDDEDVQVEVAEKVQKSSAPREMQKAEEKEVPEDSLEECAITYSNSHGSYDSNQPHRKTKITFEEDKVDSTLIGSSSHVEWEDAVHIIPENESDDEEEEEKGPVSPRNLQESEEEEVPQESWDEGYSTLSIPPEMLASYQSYSSTFHSLEEQQVCMAVDIGRHRWDQVKKEDQEATGPRLSRELLDEKEPEVLQDSLDRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDQGPPCPRLSRELLEVVEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKHVGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYVLEQQRVGFAVDMDEIEKYQEVEEDQDPSCPRLSRELLDEKEPEVLQDSLDRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDQGPPCPRLSRELLEVVEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKHVGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYVLEQQRVGFAVDMDEIEKYQEVEEDQDPSCPRLSRELLDEKEPEVLQDSLDRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVDRIKKDEEEEEDQDPPCPRLSRELLEVVAPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKHVGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYVLEQQRVGMAVDMDEIEKYQEVEEDQDPSCPRLSRELLDEKEPEVLQDSLNRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDEDQDPPCPRLSRELLEVVEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEENHVGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYVLEQQHVGLAVDMDEIEKYQEVEEDQDPSCPRLSRELLDEKEPEVLQDSLDRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDQGPPCPRLSRELLEVVEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEENHVGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYVLEQQHVGLAVDMDEIEKYQEVEEDQDPSCPRLSRELLDEKEPEVLQDSLDRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDQGPPCPRLSRELLEVVEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEENHVGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYVLEQQHVGLAVDMDEIEKYQEVEEDQDPSCPRLSRELLDEKEPEVLQDSLDRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDQGPPCPRLSRELLEVVEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEENHVGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYVLEQQHVGLAVDMDEIEKYQEVEEDQDPSCPRLSRELLDEKEPEVLQDSLDRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDQGPPCPRLSRELLEVVEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEENHVGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYVLEQQHVGLAVDMDEIEKYQEVEEDQDPSCPRLSRELLDEKEPEVLQDSLDRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDQGPPCPRLSRELLEVVEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKHVGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYVLEQQHVGLAVDMDEIEKYQEVEEDQDPSCPRLSRELLDEKEPEVLQDSLDRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDQGPPCPRLSRELLEVVEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKHVGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKGPKVLQDSLDRCYSTPSGCLELCDSCQPYRSAFYVLEQQRVGLAVDMDEIEKYQEVEEDQDPSCPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELCDSCQPYRSAFYILEQQRVGLAVDMDEIEKYQEVEEDQDPSCPRLSRELLDEKEPEVLQDSLDRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDQGPPCPRLSRELLEVVEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKHVGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELCDSCQPYRSAFYVLEQQHVGLAVDMDEIEKYQEVEEDQDPSCPRLSRELLDEKEPEVLQDSLDRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVDKIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLNGVLMEVEEREVLQDSLDRCYSTPSMYFELPDSFQHYRSVFYSFEEQHISFALYVDNRFFTLTVTSLHLVFQMGVIFPQ,mutated_sequence,1.0,3553.0,UPI0001CE93AE.a2m,UPI0001CE93AE.npy,gnomAD
+UPI00004CEC5B,UPI00004CEC5B.csv,MKEHIIYQKLYGLILMSSFIFLSDTLSLKGKKLDFFGRGDTYVSLIDTIPELSRFTACIDLVFMDDNSRYWMAFSYITNNALLGREDIDLGLAGDHQQLILYRLGKTFSIRHHLASFQWHTICLIWDGVKGKLELFLNKERILEVTDQPHNLTPHGTLFLGHFLKNESSEVKSMMRSFPGSLYYFQLWDHILENEEFMKCLDGNIVSWEEDVWLVNKIIPTVDRTLRCFVPENMTIQEKSTTVSQQIDMTTPSQITGVKPQNTAHSSTLLSQSIPIFATDYTTISYSNTTSPPLETMTAQKILKTLVDETATFAVDVLSTSSAISLPTQSISIDNTTNSMKKTKSPSSESTKTTKMVEAMATEIFQPPTPSNFLSTSRFTKNSVVSTTSAIKSQSAVTKTTSLFSTIESTSMSTTPCLKQKSTNTGALPISTAGQEFIESTAAGTVPWFTVEKTSPASTHVGTASSFPPEPVLISTAAPVDSVFPRNQTAFPLATTDMKIAFTVHSLTLPTRLIETTPAPRTAETELTSTNFQDVSLPRVEDAMSTSMSKETSSKTFSFLTSFSFTGTESVQTVIDAEATRTALTPEITLASTVAETMLSSTITGRVYTQNTPTADGHLLTLMSTRSASTSKAPESGPTSTTDEAAHLFSSNETIWTSRPDQALLASMNTTTILTFVPNENFTSAFHENTTYTEYLSATTNITPLKASPEGKGTTANDATTARYTTAVSKLTSPWFANFSIVSGTTSITNMPEFKLTTLLLKTIPMSTKPANELPLTPRETVVPSVDIISTLACIQPNFSTEESASETTQTEINGAIVFGGTTTPVPKSATTQRLNATVTRKEATSHYLMRKSTIAAVAEVSPFSTMLEVTDESAQRVTASVTVSSFPDIEKLSTPLDNKTATTEVRESWLLTKLVKTTPRSSYNEMTEMFNFNHTYVAHWTSETSEGISAGSPTSGSTHIFGEPLGASTTRISETSFSTTPTDRTATSLSDGILPPQPTAAHSSATPVPVTHMFSLPVNGSSVVAEETEVTMSEPSTLARAFSTSVLSDVSNLSSTTMTTALVPPLDQTASTTIVIVPTHGDLIRTTSEATVISVRKTSMAVPSLTETPFHSLRLSTPVTAKAETTLFSTSVDTVTPSTHTLVCSKPPPDNIPPASSTHVISTTSTPEATQPISQVEETSTYALSFPYTFSGGGVVASLATGTTETSVVDETTPSHISANKLTTSVNSHISSSATYRVHTPVSIQLVTSTSVLSSDKDQMTISLGKTPRTMEVTEMSPSKNSFISYSRGTPSLEMTDTGFPETTKISSHQTHSPSEIPLGTPSDGNLASSPTSGSTQITPTLTSSNTVGVHIPEMSTSLGKTALPSQALTITTFLCPEKESTSALPAYTPRTVEMIVNSTYVTHSVSYGQDTSFVDTTTSSSTRISNPMDINTTFSHLHSLRTQPEVTSVASFISESTQTFPESLSLSTAGLYNDGFTVLSDRITTAFSVPNVPTMLPRESSMATSTPIYQMSSLPVNVTAFTSKKVSDTPPIVITKSSKTMHPGCLKSPCTATSGPMSEMSSIPVNNSAFTPATVSSDTSTRVGLFSTLLSSVTPRTTMTMQTSTLDVTPVIYAGATSKNKMVSSAFTTEMIEAPSRITPTTFLSPTEPTLPFVKTVPTTIMAGIVTPFVGTTAFSPLSSKSTGAISSIPKTTFSPFLSATQQSSQADEATTLGILSGITNRSLSTVNSGTGVALTDTYSRITVPENMLSPTHADSLHTSFNIQVSPSLTSFKSASGPTKNVKTTTNCFSSNTRKMTSLLEKTSLTNYATSLNTPVSYPPWTPSSATLPSLTSFVYSPHSTEAEISTPKTSPPPTSQMVEFPVLGTRMTSSNTQPLLMTSWNIPTAEGSQFPISTTINVPTSNEMETETLHLVPGPLSTFTASQTGLVSKDVMAMSSIPMSGILPNHGLSENPSLSTSLRAITSTLADVKHTFEKMTTSVTPGTTLPSILSGATSGSVISKSPILTWLLSSLPSGSPPATVSNAPHVMTSSTVEVSKSTFLTSDMISAHPFTNLTTLPSATMSTILTRTIPTPTLGGITTGFPTSLPMSINVTDDIVYISTHPEASSRTTITANPRTVSHPSSFSRKTMSPSTTDHTLSVGAMPLPSSTITSSWNRIPTASSPSTLIIPKPTLDSLLNIMTTTSTVPGASFPLISTGVTYPFTATVSSPISSFFETTWLDSTPSFLSTEASTSPTATKSTVSFYNVEMSFSVFVEEPRIPITSVINEFTENSLNSIFQNSEFSLATLETQIKSRDISEEEMVMDRAILEQREGQEMATISYVPYSCVCQVIIKASSSLASSELMRKIKSKIHGNFTHGNFTQDQLTLLVNCEHVAVKKLEPGNCKADETASKYKGTYKWLLTNPTETAQTRCIKNEDGNATRFCSISINTGKSQWEKPKFKQCKLLQELPDKIVDLANITISDENAEDVAEHILNLINESPALGKEETKIIVSKISDISQCDEISMNLTHVMLQIINVVLEKQNNSASDLHEISNEILRIIERTGHKMEFSGQIANLTVAGLALAVLRGDHTFDGMAFSIHSYEEGTDPEIFLGNVPVGGILASIYLPKSLTERIPLSNLQTILFNFFGQTSLFKTKNVTKALTTYVVSASISDDMFIQNLADPVVITLQHIGGNQNYGQVHCAFWDFENNNGLGGWNSSGCKVKETNVNYTICQCDHLTHFGVLMDLSRSTVDSVNEQILALITYTGCGISSIFLGVAVVTYIAFHKLRKDYPAKILINLCTALLMLNLVFLINSWLSSFQKVGVCITAAVALHYFLLVSFTWMGLEAVHMYLALVKVFNIYIPNYILKFCLVGWGIPAIMVAITVSVKKDLYGTLSPTTPFCWIKDDSIFYISVVAYFCLIFLMNLSMFCTVLVQLNSVKSQIQKTRRKMILHDLKGTMSLTFLLGLTWGFAFFAWGPMRNFFLYLFAIFNTLQGFFIFVFHCVMKESVREQWQIHLCCGWLRLDNSSDGSSRCQIKVGYKQEGLKKIFEHKLLTPSLKSTATSSTFKSLGSAQGTPSEISFPNDDFDKDPYCSSP,mutated_sequence,1.0,3080.0,UPI00004CEC5B.a2m,UPI00004CEC5B.npy,gnomAD
+UPI000007315E,UPI000007315E.csv,MKGSNRNKDHSAEGEGVGKRPKRKCLQWHPLLAKKLLDFSEEEEEEDEEEDIDKVQLLGADGLEQDVGETEDDESPEQRARRPMNAFLLFCKRHRSLVRQEHPRLDNRGATKILADWWAVLDPKEKQKYTDMAKEYKDAFMKANPGYKWCPTTNKPVKSPTPTVNPRKKLWAFPSDSSRDLPSPKKAKTEEMPQLNFGMADPTQMGGLSMLLLAGEHALGTPEVSSGTCRPDVSESPELRQKSPLFQFAEISSSTSHSDASTKQCQTSALFQFAEISSNTSQLGGAEPVKRCGKSALFQLAEMCLASEGMKMEESKLIKAKESDGGRIKELEKGKEEKEIKMEKTDETRLQKEAEFEKSAKENLRDSKELRNFEALQIDDIMAIKMEDPKEIRKEELEEDHKCSHFPDFSYSASSKIIISDVPSRKDHMCHPHGIMIIEDPAALNKPEKLKKKKKKSKMDRHGNDKSTPKKTCKKRQSSESDIESVIYTIEAVAKGDWGIEKLGDTPRKKVRTSSSGKGSILDAKPPKKKVKSREKKMSKEKSSDTTKESRPPDFISISASKNISGETPEGIKAEPLTPMEDALPPSLSGQAKPEDSDCHRKIETCGSRKSERSCKGALYKTLVSEGMLTSLRANVDRGKRSSGKGNSSDHEGCWNEESWTFSQSGTSGSKKFKKTKPKEDCLLGSAKLDEEFEKKFNSLPQYSPVTFDRKCVPVPRKKKKTGNVSSEPTKTSKGPFQSQKKNLFHKIVSKYKHKKEKPNVPEKGSGDKWSNKQLFLDAIHPTEAIFSEDRNTMEPVHKVKNIPSIFNTPEPTTTQEPLVGSQKRKARKTKITHLVRTADGRVSPAGGTLDDKPKEQLQRSLPKATETDCNDKCSHNTEVGETRSSTPEMPAVSAFFSLAALAEVAAMENVHRGQRSTPLTHDGQPKEMPQAPVLISCADQ,mutated_sequence,1.0,941.0,UPI000007315E.a2m,UPI000007315E.npy,gnomAD
+UPI0000E0BF7B,UPI0000E0BF7B.csv,MAAAASVTGRVTWAASPMRSLGLGRRLSLPGPRLDAVTAAVNPSLSDHGNGLGRGTRGSGCSGGSLVADWGGGAAAAAAVALALAPALSTMRRGSSESELAARWEAEAVAAAKAAAKAEAEATAETVAEQVRVDAGAAGEPECKAGEEQPKVLAPAPAQPSAAEEGNTQVLQRPPPTLPPSKPKPVQGLCPHGKPRDKGRSCKRSSGHGSGENGSQRPVTVDSSKARTSLDALKISIRQLKWKEFPFGRRLPCDIYWHGVSFHDNDIFSGQVNKFPGMTEMVRKITLSRAVRTMQNLFPEEYNFYPRSWILPDEFQLFVAQVQMVKDDDPSWKPTFIVKPDGGCQGDGIYLIKDPSDIRLAGTLQSRPAVVQEYICKPLLIDKLKFDIRLYVLLKSLDPLEIYIAKDGLSRFCTEPYQEPTPKNLHRIFMHLTNYSLNIHSGNFIHSDSASTGSKRTFSSILCRLSSKGVDIKKVWSDIISVVIKTVIALTPELKVFYQSDIPTGRPGPTCFQILGFDILLMKNLKPILLEVNANPSMRIEHEHELSPGVFENVPSLVDEEVKVAVIRDTLRLMDPLKKKRENQSQQLEKPFAGKEDALDGELTSAPDCNANPEAHLPSICLKQVFPKYAKQFNYLRLVDRMANLFIRFLGIKGTMKLGPTGFRTFIRSCKLSSSSLSMAAVDILYIDITRRWNSMTLDQRDSGMCLQAFVEAFFFLAQRKFKMLPLHEQVASLIDLCEYHLSLLDEKRLVCGRGVPSGGRPPHRGPPQEPSPSAQPAGDNPPPRTSCANKLSHPRHTLS,mutated_sequence,1.0,800.0,UPI0000E0BF7B.a2m,UPI0000E0BF7B.npy,gnomAD
+UPI00001F9669,UPI00001F9669.csv,MEGVAVVTAGSVGAAKTEGAAALPPPPPVSPPALTPAPAAGEEGPAPLSETGAPGCSGSRPPELEPERSLGRFRGRFEDEDEELEEEEELEEEEEEEEEDMSHFSLRLEGGRQDSEDEEERLINLSELTPYILCSICKGYLIDATTITECLHTFCKSCIVRHFYYSNRCPKCNIVVHQTQPLYNIRLDRQLQDIVYKLVINLEEREKKQMHDFYKERGLEVPKPAVPQPVPSSKGRSKKVLESVFRIPPELDMSLLLEFIGANEGTGHFKPLEKKFVRVSGEATIGHVEKFLRRKMGLDPACQVDIICGDHLLEQYQTLREIRRAIGDAAMQDGLLVLHYGLVVSPLKIT,mutated_sequence,1.0,350.0,UPI00001F9669.a2m,UPI00001F9669.npy,gnomAD
+UPI000004FDF5,UPI000004FDF5.csv,MAKFALNQNLPDLGGPRLCPVPAAGGARSPSSPYSVETPYGFHLDLDFLKYIEELERGPAARRAPGPPTSRRPRAPRPGLAGARSPGAWTSSESLASDDGGAPGILSQGAPSGLLMQPLSPRAPVRNPRVEHTLRETSRRLELAQTHERAPSPGRGVPRSPRGSGRSSPAPNLAPASPGPAQLQLVREQMAAALRRLRELEDQARTLPELQEQVRALRAEKARLLAGRAQPEPDGEAETRPDKLAQLRRLTERLATSERGGRARASPRADSPDGLAAGRSEGALQVLDGEVGSLDGTPQTREVAAEAVPETREAGAQAVPETREAGVEAAPETVEADAWVTEALLGLPAAAERELELLRASLEHQRGVSELLRGRLRELEEAREAAEEAAAGARAQLREATTQTPWSCAEKAAQTESPAEAPSLTQESSPGSMDGDRAVAPAGILKSIMKKRDGTPGAQPSSGPKSLQFVGVLNGEYESSSSEDASDSDGDSENGGAEPPGSSSGSGDDSGGGSDSGTPGPPSGGDIRDPEPEAEAEPQQVAQGRCELSPRLREACVALQRQLSRPRGVASDGGAVRLVAQEWFRVSSQRRSQAEPVARMLEGVRRLGPELLAHVVNLADGNGNTALHYSVSHGNLAIASLLLDTGACEVNRQNRAGYSALMLAALTSVRQEEEDMAVVQRLFCMGDVNAKASQTGQTALMLAISHGRQDMVATLLACGADVNAQDADGATALMCASEYGRLDTVRLLLTQPGCDPAILDNEGTSALAIALEAEQDEVAALLHAHLSSGQPDTQSESPPGSQTATPGEGECGDNGENPQVQ,mutated_sequence,1.0,821.0,UPI000004FDF5.a2m,UPI000004FDF5.npy,gnomAD
+UPI000000CC09,UPI000000CC09.csv,MASDDFDIVIEAMLEAPYKKEEDEQQRKEVKKDYPSNTTSSTSNSGNETSGSSTIGETSKKKRSRSHNKSRDRKRSRSRDRDRYRRRNSRSRSPGRQCRHRSRSWDRRHGSESRSRDHRREDRVHYRSPPLATGYRYGHSKSPHFREKSPVREPVDNLSPEERDARTVFCMQLAARIRPRDLEDFFSAVGKVRDVRIISDRNSRRSKGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAAMANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFITFSDSECARRALEQLNGFELAGRPMRVGHVTERLDGGTDITFPDGDQELDLGSAGGRFQLMAKLAEGAGIQLPSTAAAAAAAAAQAAALQLNGAVPLGALNPAALTALSPALNLASQCFQLSSLFTPQTM,mutated_sequence,1.0,439.0,UPI000000CC09.a2m,UPI000000CC09.npy,gnomAD
+UPI000022D996,UPI000022D996.csv,MFGAGDEDDTDFLSPSGGARLASLFGLDQAAAGHGNEFFQYTAPKQPKKGQGTAATGNQATPKTAPATMSTPTILVATAVHAYRYTNGQYVKQGKFGAAVLGNHTAREYRILLYISQQQPVTVARIHVNFELMVRPNNYSTFYDDQRQNWSIMFESEKAAVEFNKQVCIAKCNSTSSLDAVLSQDLIVADGPAVEVGDSLEVAYTGWLFQNHVLGQVFDSTANKDKLLRLKLGSGKVIKGWEDGMLGMKKGGKRLLIVPPACAVGSEGVIGWTQATDSILVFEVEVRRVKFARDSGSDGHSVSSRDSAAPSPIPGADNLSADPVVSPPTSIPFKSGEPALRTKSNSLSEQLAINTSPDAVKAKLISRMAKMGQPMLPILPPQLDSNDSEIEDVNTLQGGGQPVVTPSVQPSLHPAHPALPQMTSQAPQPSVTGLQAPSAALMQVSSLDSHSAVSGNAQSFQPYAGMQAYAYPQASAVTSQLQPVRPLYPAPLSQPPHFQGSGDMASFLMTEARQHNTEIRMAVSKVADKMDHLMTKVEELQKHSAGNSMLIPSMSVTMETSMIMSNIQRIIQENERLKQEILEKSNRIEEQNDKISELIERNQRYVEQSNLMMEKRNNSLQTATENTQARVLHAEQEKAKVTEELAAATAQVSHLQLKMTAHQKKETELQMQLTESLKETDLLRGQLTKVQAKLSELQETSEQAQSKFKSEKQNRKQLELKVTSLEEELTDLRVEKESLEKNLSERKKKSAQERSQAEEEIDEIRKSYQEELDKLRQLLKKTRVSTDQAAAEQLSLVQAELQTQWEAKCEHLLASAKDEHLQQYQEVCAQRDAYQQKLVQLQEKCLALQAQITALTKQNEQHIKELEKNKSQMSGVEAAASDPSEKVKKIMNQVFQSLRREFELEESYNGRTILGTIMNTIKMVTLQLLNQQEQEKEESSSEEEEEKAEERPRRPSQEQSASASSGQPQAPLNRERPESPMVPSEQVVEEAVPLPPQALTTSQDGHRRKGDSEAEALSEIKDGSLPPELSCIPSHRVLGPPTSIPPEPLGPVSMDSECEESLAASPMAAKPDNPSGKVCVREVAPDGPLQESSTRLSLTSDPEEGDPLALGPESPGEPQPPQLKKDDVTSSTGPHKELSSTEAGSTVAGAALRPSHHSQRSSLSGDEEDELFKGATLKALRPKAQPEEEDEDEVSMKGRPPPTPLFGDDDDDDDIDWLG,mutated_sequence,1.0,1219.0,UPI000022D996.a2m,UPI000022D996.npy,gnomAD
+UPI0000231C2D,UPI0000231C2D.csv,MGYFLKLYAYVNSHSLFVWVCDRSYKRSFRPMILNKIKELSRNQFSTMSHLRKDSQPSSPGDDAMDRSGLPDLQGRFELSGKNRQYPLDALEPQPSIGDIKDIKKAAKSMLDPAHKSHFHPVTPSLVFLCFIFDGLHQALLSVGVSKRSNTVVGNENEERGTPYASRFKDMPNFIALEKSSVLRHCCDLLIGIAAGSSDKICTSSLQVQRRFKAMMASIGRLSHGESADLLISCNAESAIGWISSRPWVGELMFTLLFGDFESPLHKLRKSS,mutated_sequence,1.0,272.0,UPI0000231C2D.a2m,UPI0000231C2D.npy,gnomAD
+UPI000015F240,UPI000015F240.csv,MTEFHLQSQMPSIRLIFRRLSLGRIKPSQSPRCSTSFMVVPSFSIAEHWRRMKGANLSQGMEFELLGLTTDPQLQRLLFVVFLGMYTATLLGNLVMFLLIHVSATLHTPMYSLLKSLSFLDFCYSSTVVPQTLVNFLAKRKVISYFGCMTQMFFYAGFATSECYLIAAMAYDRYAAICNPLLYSTIMSPEVCASLIVGSYSAGFLNSLIHTGCIFSLKFCGAHVVTHFFCDGPPILSLSCVDTSLCEILLFIFAGFNLLSCTLTILISYFLILNTILKMSSAQGRFKAFSTCASHLTAICLFFGTTLFMYLRPRSSYSLTQDRTVAVIYTVVIPVLNPLMYSLRNKDVKKALIKVWGRKTME,mutated_sequence,1.0,362.0,UPI000015F240.a2m,UPI000015F240.npy,gnomAD
+UPI00021CF3BF,UPI00021CF3BF.csv,XPTPGIVINRPNGTDVYQGVPKDYTGEDVTPQNFLAVLRGGVRSLRCSGPPGEPPLQGSLQDLLHKMTCSLLPASPVFSEKLQIRVGKALY,mutated_sequence,1.0,91.0,UPI00021CF3BF.a2m,UPI00021CF3BF.npy,gnomAD
+UPI000011D62A,UPI000011D62A.csv,MEAAVAAPRPRLLLLVLAAAAAAAAALLPGATALQCFCHLCTKDNFTCVTDGLCFVSVTETTDKVIHNSMCIAEIDLIPRDRPFVCAPSSKTGSVTTTYCCNQDHCNKIELPTTVKSSPGLGPVELAAVIAGPVCFVCISLMLMVYICHNRTVIHHRVPNEEDPSLDRPFISEGTTLKDLIYDMTTSGSGSGLPLLVQRTIARTIVLQESIGKGRFGEVWRGKWRGEEVAVKIFSSREERSWFREAEIYQTVMLRHENILGFIAADNKDNGTWTQLWLVSDYHEHGSLFDYLNRYTVTVEGMIKLALSTASGLAHLHMEIVGTQGKPAIAHRDLKSKNILVKKNGTCCIADLGLAVRHDSATDTIDIAPNHRVGTKRYMAPEVLDDSINMKHFESFKRADIYAMGLVFWEIARRCSIGGIHEDYQLPYYDLVPSDPSVEEMRKVVCEQKLRPNIPNRWQSCEALRVMAKIMRECWYANGAARLTALRIKKTLSQLSQQEGIKM,mutated_sequence,1.0,503.0,UPI000011D62A.a2m,UPI000011D62A.npy,gnomAD
+UPI000013E507,UPI000013E507.csv,MEGSRPRAPSGHLAPSPPAFDGELDLQRYSNGPAVSAGSLGMGAVSWSESRAGERRFPCPVCGKRFRFNSILALHLRAHPGAQAFQCPHCGHRAAQRALLRSHLRTHQPERPRSPAARLLLELEERALLREARLGRARSSGGMQATPATEGLARPQAPSSSAFRCPYCKGKFRTSAERERHLHILHRPWKCGLCSFGSSQEEELLHHSLTAHGAPERPLAATSAAPPPQPQPQPPPQPEPRSVPQPEPEPEPEREATPTPAPAAPEEPPAPPEFRCQVCGQSFTQSWFLKGHMRKHKASFDHACPVCGRCFKEPWFLKNHMKVHASKLGPLRAPGPASGPARAPQPPDLGLLAYEPLGPALLLAPAPTPAERREPPSLLGYLSLRAGEGRPNGEGAEPGPGRSFGGFRPLSSALPARARRHRAEEPEEEEEVVEAEEETWARGRSLGSLASLHPRPGEGPGHSASAAGAQARSTATQEENGLLVGGTRPEGGRGATGKDCPFCGKSFRSAHHLKVHLRVHTGERPYKCPHCDYAGTQSGSLKYHLQRHHREQRSGAGPGPPPEPPPPSQRGSAPQSGAKPSPQPATWVEGASSPRPPSSGAGPGSRRKPASPGRTLRNGRGGEAEPLDLSLRAGPGGEAGPGGALHRCLFCPFATGAPELMALHLQVHHSRRARGRRPPQADASPPYARVPSGETPPSPSQEGEEGSGLSRPGEAGLGGQER,mutated_sequence,1.0,722.0,UPI000013E507.a2m,UPI000013E507.npy,gnomAD
+UPI000013DCDA,UPI000013DCDA.csv,MSTTLLSAFYDVDFLCKTEKSLANLNLNNMLDKKAVGTPVAAAPSSGFAPGFLRRHSASNLHALAHPAPSPGSCSPKFPGAANGSSCGSAAAGGPTSYGTLKEPSGGGGTALLNKENKFRDRSFSENGDRSQHLLHLQQQQKGGGGSQINSTRYKTELCRPFEESGTCKYGEKCQFAHGFHELRSLTRHPKYKTELCRTFHTIGFCPYGPRCHFIHNADERRPAPSGGASGDLRAFGTRDALHLGFPREPRPKLHHSLSFSGFPSGHHQPPGGLESPLLLDSPTSRTPPPPSCSSASSCSSSASSCSSASAASTPSGAPTCCASAAAAAAAALLYGTGGAEDLLAPGAPCAACSSASCANNAFAFGPELSSLITPLAIQTHNFAAVAAAAYYRSQQQQQQQGLAPPAQPPAPPSATLPAGAAAPPSPPFSFQLPRRLSDSPVFDAPPSPPDSLSDRDSYLSGSLSSGSLSGSESPSLDPGRRLPIFSRLSISDD,mutated_sequence,1.0,494.0,UPI000013DCDA.a2m,UPI000013DCDA.npy,gnomAD
+UPI0001881A85,UPI0001881A85.csv,MAQGSGGREGALRTPAGGWHSPPSPDMQELLRSVERDLSIDPRQLAPAPGGTHVVALVPARWLASLRDRRLPLGPCPRAEGLGEAEVRTLLQRSVQRLPAGWTRVEVHGLRKRRLSYPLGGGLPFEDGSCGPETLTRFMQEVAAQNYRNLWRHAYHTYGQPYSHSPAPSAVPALDSVRQALQRVYGCSFLPVGETTQCPSYAREGPCPPRGSPACPSLLRAEALLESPEMLYVVHPYVQFSLHDVVTFSPAKLTNSQAKVLFILFRVLRAMDACHRQGLACGALSLYHIAVDEKLCSELRLDLSAYERPEEDENEEAPVARDEAGIVSQEEQGGQPGQPTGQEELRSLVLDWVHGRISNFHYLMQLNRLAGRRQGDPNYHPVLPWVVDFTTPHGRFRDLRKSKFRLNKGDKQLDFTYEMTRQAFVAGGAGGGEPPHVPHHISDVLSDITYYVYKARRTPRSVLCGHVRAQWEPHEYPASMERMQNWTPDECIPEFYTDPSIFRSIHPDMPDLDVPAWCSSSQEFVAAHRALLESREVSRDLHHWIDLTFGYKLQGKEAVKEKNVCLHLVDAHTHLASYGVVQLFDQPHPQRLAGAPALAPEPPLIPKLLVQTIQETTGREDFTENPGQLPNGVGRPVLEATPCEASWTRDRPVAGEDDLEQATEALDSISLAGKAGDQLGSSSQASPGLLSFSVASASRPGRRNKAAGADPGEGEEGRILLPEGFNPMQALEELEKTGNFLAKGLGGLLEVPEQPRVQPAVPLQCLLHRDMQALGVLLAEMVFATRVRTLQPDAPLWVRFQAVRGLCTRHPKEVPVSLQPVLDTLLQMSGPEVPMGAERGKLDQLFEYRPVSQGLPPPCPSQLLSPFSSVVPFPPYFPALHRFILLYQARRVEDEAQGRELVFALWQQLGAVLKDITPEGLEILLPFVLSLMSEEHTAVYTAWYLFEPVAKALGPKNANKYLLKPLIGAYESPCQLHGRFYLYTDCFVAQLMVRLGLQAFLTHLLPHVLQVLAGAEASQEESKDLAGAAEEEESGLPGAGPGSCAFGEEIPMDGEPPASSGLGLPDYTSGVSFHDQADLPETEDFQAGLYVTESPQPQEAEAVSLGRLSDKSSTSETSLGEERAPDEGGAPVDKSSLRSGDSSQDLKQSEGSEEEEEEEDSCVVLEEEEGEQEEVTGASELTLSDTVLSMETVVAGGSGGDGEEEEEALPEQSEGKEQKILLDTACKMVRWLSAKLGPTVASRHVARNLLRLLTSCYVGPTRQQFTVSSGESPPLSAGNIYQKRPVLGDIVSGPVLSCLLHIARLYGEPVLTYQYLPYISYLVAPGSASGPSRLNSRKEAGLLAAVTLTQKIIVYLSDTTLMDILPRISHEVLLPVLSFLTSLVTGFPSGAQARTILCVKTISLIALICLRIGQEMVQQHLSEPVATFFQVFSQLHELRQQDLKLDPAGRGEGQLPQVVFSDGQQRPVDPALLDELQKVFTLEMAYTIYVPFSCLLGDIIRKIIPNHELVGELAALYLESISPSSRNPASVEPTMPGTGPEWDPHGGGCPQDDGHSGTFGSVLVGNRIQIPNDSRPENPGPLGPISGVGGGGLGSGSDDNALKQELPRSVHGLSGNWLAYWQYEIGVSQQDAHFHFHQIRLQSFPGHSGAVKCVAPLSSEDFFLSGSKDRTVRLWPLYNYGDGTSETAPRLVYTQHRKSVFFVGQLEAPQHVVSCDGAVHVWDPFTGKTLRTVEPLDSRVPLTAVAVMPAPHTSITMASSDSTLRFVDCRKPGLQHEFRLGGGLNPGLVRALAISPSGRSVVAGFSSGFMVLLDTRTGLVLRGWPAHEGDILQIKAVEGSVLVSSSSDHSLTVWKELEQKPTHHYKSASDPIHTFDLYGSEVVTGTVSNKIGVCSLLEPPSQATTKLSSENFRGTLTSLALLPTKRHLLLGSDNGVIRLLA,mutated_sequence,1.0,1941.0,UPI0001881A85.a2m,UPI0001881A85.npy,gnomAD
+UPI0001AE62FE,UPI0001AE62FE.csv,MHSWTWDPHSSEQLDLAQSLSEEDPLSSLRGSIQPQPPAEAPDKGGSQANNDPGGWDPGCSSSQVLGRTRGPCLGVRVREEEAGTRVKENLPVWTVTGELQGKPLGNPAAGTMNPESSIFIEDYLKYFQDQVSRENLLQLLTDDEAWNGFVAAAELPRDEADELRKALNKLASHMVMKDKNRHDKDQQHRQWFLKEFPRLKRELEDHIRKLRALAEEVEQVHRGTTIANVVSNSVGTTSGILTLLGLGLAPFTEGISFVLLDTGMGLGAAAAVAGITCSVVELVNKLRARAQARNLDQSGTNVAKVMKEFVGGNTPNVLTLVDNWYQVTQGIGRNIRAIRRARANPQLGAYAPPPHIIGRISAEGGEQVERVVEGPAQAMSRGTMIVGAATGGILLLLDVVSLAYESKHLLEGAKSESAEELKKRAQELEGKLNFLTKIHEMLQPGQDQ,mutated_sequence,1.0,449.0,UPI0001AE62FE.a2m,UPI0001AE62FE.npy,gnomAD
+UPI0000418EA1,UPI0000418EA1.csv,MRVKPQGLVVTSSAVCSSPDYLREPKYYPGGPPTPRPLLPTRPPASPPDKAFSTHAFSENPRPPPRRDPSTRRPPVLAKGDDPLPPRAARPVSQARCPTPVGDGSSSRRCWDNGRVNLRPVVQLIDIMKDLTRLSQDLQHSGVHLDCGGLRLSRPPAPPPGDLQYSFFSSPSLANSIRSPEERATPHAKSERPSHPLYEPEPEPRDSPQPGQGHSPGATAAATGLPPEPEPDSTDYSELADADILSELASLTCPEAQLLEAQALEPPSPEPEPQLLDPQPRFLDPQALEPLGEALELPPLQPLADPLGLPGLALQALDTLPDSLESQLLDPQALDPLPKLLDVPGRRLEPQQPLGHCPLAEPLRLDLCSPHGPPGPEGHPKYALRRTDRPKILCRRRKAGRGRKADAGPEGRLLPLPMPTGLVAALAEPPPPPPPPPPALPGPGPVSVPELKPESSQTPVVSTRKGKCRGVRRMVVKMAKIPVSLGRRNKTTYKVSSLSSSLSVEGKELGLRVSAEPTPLLKMKNNGRNVVVVFPPGEMPIILKRKRGRPPKNLLLGPGKPKEPAVVAAEAATVAAATMAMPEVKKRRRRKQKLASPQPSYAADANDSKAEYSDVLAKLAFLNRQSQCAGRCSPPRCWTPSEPESVHQAPDTQSISHFLHRVQGFRRRGGKAGGFGGRGGGHAAKSARCSFSDFFEGIGKKKKVVAVAAAGVGGPGLTELGHPRKRGRGEVDAVTGKPKRKRRSRKNGTLFPEQVPSGPGFGEAGAEWAGDKGGGWAPHHGHPGGQAGRNCGFQGTEARAFASTGLESGASGRGSYYSTGAPSGQTELSQERQNLFTGYFRSLLDSDDSSDLLDFALSASRPESRKASGTYAGPPTSALPAQRGLATFPSRGAKASPVAVGSSGAGADPSFQPVLSARQTFPPGRAASYGLTPAASDCRAAETFPKLVPPPSAMARSPTTHPPANTYLPQYGGYGAGQSVFAPTKPFTGQDCANSKDCSFAYGSGNSLPASPSSAHSAGYAPPPTGGPCLPPSKASFFSSSEGAPFSGSAPTPLRCDSRASTVSPGGYMVPKGTTASATSAASAASSSSSSFQPSPENCRQFAGASQWPFRQGYGGLDWASEAFSQLYNPSFDCHVSEPNVILDISNYTPQKVKQQTAVSETFSESSSDSTQFNQPVGGGGFRRANSEASSSEGQSSLSSLEKLMMDWNEASSAPGYNWNQSVLFQSSSKPGRGRRKKVDLFEASHLGFPTSASAAASGYPSKRSTGPRQPRGGRGGGACSAKKERGGAAAKAKFIPKPQPVNPLFQDSPDLGLDYYSGDSSMSPLPSQSRAFGVGERDPCDFIGPYSMNPSTPSDGTFGQGFHCDSPSLGAPELDGKHFPPLAHPPTVFDAGLQKAYSPTCSPTLGFKEELRPPPTKLAACEPLKHGLQGASLGHAAAAQAHLSCRDLPLGQPHYDSPSCKGTAYWYPPGSAARSPPYEGKVGTGLLADFLGRTEAACLSAPHLASPPATPKADKEPLEMARPPGPPRGPAAAAAGYGCPLLSDLTLSPVPRDSLLPLQDTAYRYPGFMPQAHPGLGGGPKSGFLGPMAEPHPEDTFTVTSL,mutated_sequence,1.0,1603.0,UPI0000418EA1.a2m,UPI0000418EA1.npy,gnomAD
+UPI00001D9756,UPI00001D9756.csv,MGTAAAAAAAAAAAAAGEGARSPSPAAVSLGLGVAVVSSLVNGSTFVLQKKGIVRAKRRGTSYLTDIVWWAGTIAMAVGQIGNFLAYTAVPTVLVTPLGALGVPFGSILASYLLKEKLNILGKLGCLLSCAGSVVLIIHSPKSESVTTQAELEEKLTNPVFVGYLCIVLLMLLLLIFWIAPAHGPTNIMVYISICSLLGSFTVPSTKGIGLAAQDILHNNPSSQRALCLCLVLLAVLGCSIIVQFRYINKALECFDSSVFGAIYYVVFTTLVLLASAILFREWSNVGLVDFLGMACGFTTVSVGIVLIQVFKEFNFNLGEMNKSNMKTD,mutated_sequence,1.0,329.0,UPI00001D9756.a2m,UPI00001D9756.npy,gnomAD
+UPI000013E280,UPI000013E280.csv,MEEPPQEALAEPLKHESPAAPSSAGHTKGQEEDDQKNQAERKADNHTAHRIADQTALRVPSQAESSIFSQATNGVAEQNGHSTPGQAGRRASNPADVSDLRADDQVNQTPSEQTKGKASSQANNVQHEQSDGQVSGLTEERTAEQTERRLPTQAERRTSGQIDGRLAMPSDQRGSRQTDHRMAGQSERRASEQMDRRMSGEAERRTSEQITHRLSKLSERRPSVQIDSGSSVPSDQSPSVQIDSGSSVPSDQRPSVQIDRRMSGKVRRRSSEKTDYRLAGLADPGTSEQTDLRLYGLVDHKTSVKTHHQVYGQATELAEHQAIDQAHSNADQPPVDNAHYTESDQTDHLADRQANHKDQLSYYETRGQSEDRIFPQLGNSKEDKEADYRVQPCKFEDSQVDLNSKPSVEMETQNATTIPPYNPVDARFTSNFQAKDQALFPRLPSISSKLNYTSSQEKTQAIVTKSDEFSEIDQGKGYHIRNQTYRRFPSIVYEDPYQVSLQYMEKHHILQIFQQITENLVYEKPEDPLNFMLCQV,mutated_sequence,1.0,536.0,UPI000013E280.a2m,UPI000013E280.npy,gnomAD
+UPI000016864A,UPI000016864A.csv,MGLGRCIWEGWTLESEALRRDMGTWLLACICICTCVCLGVSVTGEGQGPRSRTFTCLTNNILRIDCHWSAPELGQGSSPWLLFTSNQAPGGTHKCILRGSECTVVLPPEAVLVPSDNFTITFHHCMSGREQVSLVDPEYLPRRHVKLDPPSDLQSNISSGHCILTWSISPALEPMTTLLSYELAFKKQEEAWEQAQHRDHIVGVTWLILEAFELDPGFIHEARLRVQMATLEDDVVEEERYTGQWSEWSQPVCFQAPQRQGPLIPPWGWPGNTLVAVSIFLLLTGPTYLLFKLSPRVKRIFYQNVPSPAMFFQPLYSVHNGNFQTWMGAHGAGVLLSQDCAGTPQGALEPCVQEATALLTCGPARPWKSVALEEEQEGPGTRLPGNLSSEDVLPAGCTEWRVQTLAYLPQEDWAPTSLTRPAPPDSEGSRSSSSSSSSNNNNYCALGCYGGWHLSALPGNTQSSGPIPALACGLSCDHQGLETQQGVAWVLAGHCQRPGLHEDLQGMLLPSVLSKARSWTF,mutated_sequence,1.0,521.0,UPI000016864A.a2m,UPI000016864A.npy,gnomAD
+UPI00001AE786,UPI00001AE786.csv,MAHHLPAAMESHQDFRSIKAKFQASQPEPSDLPKKPPKPEFGKLKKFSQPELSEHPKKAPLPEFGAVSLKPPPPEVTDLPKKPPPPEVTDLPKKPPPPEVTDLPKKPPPPEVTDLPKKPSKLELSDLSKKFPQLGATPFPRKPLQPEVGEAPLKASLPEPGAPARKPLQPDELSHPARPPSEPKSGAFPRKLWQPEAGEATPRSPQPELSTFPKKPAQPEFNVYPKKPPQPQVGGLPKKSVPQPEFSEAAQTPLWKPQSSEPKRDSSAFPKKASQPPLSDFPKKPPQPELGDLTRTSSEPEVSVLPKRPRPAEFKALSKKPPQPELGGLPRTSSEPEFNSLPRKLLQPERRGPPRKFSQPEPSAVLKRHPQPEFFGDLPRKPPLPSSASESSLPAAVAGFSSRHPLSPGFGAAGTPRWRSGGLVHSGGARPGLRPSHPPRRRPLPPASSLGHPPAKPPLPPGPVDMQSFRRPSAASIDLRRTRSAAGLHFQDRQPEDIPQVPDEIYELYDDVEPRDDSSPSPKGRDEAPSVQQAARRPPQDPALRKEKDPQPQQLPPMDPKLLKQLRKAEKAEREFRKKFKFEGEIVVHTKMMIDPNAKTRRGGGKHLGIRRGEILEVIEFTSNEEMLCRDPKGKYGYVPRTALLPLETEVYDDVDFCDPLENQPLPLGR,mutated_sequence,1.0,670.0,UPI00001AE786.a2m,UPI00001AE786.npy,gnomAD
+UPI000166C19F,UPI000166C19F.csv,MPQIIKMTRILTAFKVVRTLKTGFGFTNVTAHQKWKFSRPGIRLLSVKAQTAHIVLEDGTKMKGYSFGHPSSVAGEVVFNTGLGGYPEAITDPAYKGQILTMANPIIGNGGAPDTTALDELGLSKYLESNGIKVSGLLVLDYSKDYNHWLATKSLGQWLQEEKVPAIYGVDTRMLTKIIRDKGTMLGKIEFEGQPVDFVDPNKQNLIAEVSTKDVKVYGKGNPTKVVAVDCGIKNNVIRLLVKRGAEVHLVPWNHDFTKMEYDGILIAGGPGNPALAEPLIQNVRKILESDRKEPLFGISTGNLITGLAAGAKTYKMSMANRGQNQPVLNITNKQAFITAQNHGYALDNTLPAGWKPLFVNVNDQTNEGIMHESKPFFAVQFHPEVTPGPIDTEYLFDSFFSLIKKGKATTITSVLPKPALVASRVEVSKVLILGSGGLSIGQAGEFDYSGSQAVKAMKEENVKTVLMNPNIASVQTNEVGLKQADTVYFLPITPQFVTEVIKAEQPDGLILGMGGQTALNCGVELFKRGVLKEYGVKVLGTSVESIMATEDRQLFSDKLNEINEKIAPSFAVESIEDALKAADTIGYPVMIRSAYALGGLGSGICPNRETLMDLSTKAFAMTNQILVEKSVTGWKEIEYEVVRDADDNCVTVCNMENVDAMGVHTGDSVVVAPAQTLSNAEFQMLRRTSINVVRHLGIVGECNIQFALHPTSMEYCIIEVNARLSRSSALASKATGYPLAFIAAKIALGIPLPEIKNVVSGKTSACFEPSLDYMVTKIPRWDLDRFHGTSSRIGSSMKSVGEVMAIGRTFEESFQKALRMCHPSIEGFTPRLPMNKEWPSNLDLRKELSEPSSTRIYAIAKAIDDNMSLDEIEKLTYIDKWFLYKMRDILNMEKTLKGLNSESMTEETLKRAKEIGFSDKQISKCLGLTEAQTRELRLKKNIHPWVKQIDTLAAEYPSVTNYLYVTYNGQEHDVNFDDHGMMVLGCGPYHIGSSVEFDWCAVSSIRTLRQLGKKTVVVNCNPETVSTDFDECDKLYFEELSLERILDIYHQEACGGCIISVGGQIPNNLAVPLYKNGVKIMGTSPLQIDRAEDRSIFSAVLDELKVAQAPWKAVNTLNEALEFAKSVDYPCLLRPSYVLSGSAMNVVFSEDEMKKFLEEATRVSQEHPVVLTKFVEGAREVEMDAVGKDGRVISHAISEHVEDAGVHSGDATLMLPTQTISQGAIEKVKDATRKIAKAFAISGPFNVQFLVKGNDVLVIECNLRASRSFPFVSKTLGVDFIDVATKVMIGENVDEKHLPTLDHPIIPADYVAIKAPMFSWPRLRDADPILRCEMASTGEVACFGEGIHTAFLKAMLSTGFKIPQKGILIGIQQSFRPRFLGVAEQLHNEGFKLFATEATSDWLNANNVPATPVAWPSQEGQNPSLSSIRKLIRDGSIDLVINLPNNNTKFVHDNYVIRRTAVDSGIPLLTNFQVTKLFAEAVQKSRKVDSKSLFHYRQYSAGKAA,mutated_sequence,1.0,1506.0,UPI000166C19F.a2m,UPI000166C19F.npy,gnomAD
+UPI000022BFB0,UPI000022BFB0.csv,MDISKGLPGMQGGLHIWISENRKMVPVPEGAYGNFFEEHCYVILHVPQSPKATQGASSDLHYWVGKQAGAEAQGAAEAFQQRLQDELGGQTVLHREAQGHESDCFCSYFRPGIIYRKGGLASDLKHVETNLFNIQRLLHIKGRKHVSATEVELSWNSFNKGDIFLLDLGKMMIQWNGPKTSISEKARGLALTYSLRDRERGGGRAQIGVVDDEAKAPDLMQIMEAVLGRRVGSLRAATPSKDINQLQKANVRLYHVYEKGKDLVVLELATPPLTQDLLQEEDFYILDQGGFKIYVWQGRMSSLQERKAAFSRAVGFIQAKGYPTYTNVEVVNDGAESAAFKQLFRTWSEKRRRNQKLGGRDKSIHVKLDVGKLHTQPKLAAQLRMVDDGSGKVEVWCIQDLHRQPVDPKRHGQLCAGNCYLVLYTYQRLGRVQYILYLWQGHQATADEIEALNSNAEELDVMYGGVLVQEHVTMGSEPPHFLAIFQGQLVIFQERAGHHGKGQSASTTRLFQVQGTDSHNTRTMEVPARASSLNSSDIFLLVTASVCYLWFGKGCNGDQREMARVVVTVISRKNEETVLEGQEPPHFWEALGGRAPYPSNKRLPEEVPSFQPRLFECSSHMGCLVLAEVGFFSQEDLDKYDIMLLDTWQEIFLWLGEAASEWKEAVAWGQEYLKTHPAGRSPATPIVLVKQGHEPPTFIGWFFTWDPYKWTSHPSHKEVVDGSPAAASTISEITAEVNNLRLSRWPGNGRAGAVALQALKGSQDSSENDLVRSPKSAGSRTSSSVSSTSATINGGLRREQLMHQAVEDLPEGVDPARREFYLSDSDFQDIFGKSKEEFYSMATWRQRQEKKQLGFF,mutated_sequence,1.0,856.0,UPI000022BFB0.a2m,UPI000022BFB0.npy,gnomAD
+UPI0000413F35,UPI0000413F35.csv,MALAAAAAAAAAGVSQAAVLGFLQEHGGKVRNSELLSRFKPLLDAGDPRGRAARRDRFKQFVNNVAVVKELDGVKFVVLRKKPRPPEPEPAPFGPPGAAAQPSKPTSTVLPRSASAPGAPPLVRVPRPVEPPGDLGLPTEPQDTPGGPASEPAQPPGERSADPPLPALELAQATERPSADAAPPPRAPSEAASPCSDPPDAEPGPGAAKGPPQQKPCMLPVRCVPAPATLRLRAEEPGLRRQLSEEPSPRSSPLLLRRLSVEESGLGLGLGPGRSPHLRRLSRAGPRLLSPDAEELPAAPPPSAVPLEPSEHEWLVRTAGGRWTHQLHGLLLRDRGLAAKRDFMSGFTALHWAAKSGDGEMALQLVEVARRSGAPVDVNARSHGGYTPLHLAALHGHEDAAVLLVVRLGAQVHVRDHSGRRAYQYLRPGSSYALRRLLGDPGLRGTTEPDATGGGSGSLAARRPVQVAATILSSTTSAFLGVLADDLMLQDLARGLKKSSSFSKFLSASPMAPRKKTKIRGGLPAFSEISRRPTPGPLAGLVPSFPPTT,mutated_sequence,1.0,549.0,UPI0000413F35.a2m,UPI0000413F35.npy,gnomAD
+UPI000013CFAC,UPI000013CFAC.csv,MKPNFSLRLRIFNLNCWGIPYLSKHRADRMRRLGDFLNQESFDLALLEEVWSEQDFQYLRQKLSPTYPAAHHFRSGIIGSGLCVFSKHPIQELTQHIYTLNGYPYMIHHGDWFSGKAVGLLVLHLSGMVLNAYVTHLHAEYNRQKDIYLAHRVAQAWELAQFIHHTSKKADVVLLCGDLNMHPEDLGCCLLKEWTGLHDAYLETRDFKGSEEGNTMVPKNCYVSQQELKPFPFGVRIDYVLYKAVSGFYISCKSFETTTGFDPHRGTPLSDHEALMATLFVRHSPPQQNPSSTHGPAERSPLMCVLKEAWTELGLGMAQARWWATFASYVIGLGLLLLALLCVLAAGGGAGEAAILLWTPSVGLVLWAGAFYLFHVQEVNGLYRAQAELQHVLGRAREAQDLGPEPQPALLLGQQEGDRTKEQ,mutated_sequence,1.0,423.0,UPI000013CFAC.a2m,UPI000013CFAC.npy,gnomAD
+UPI0000205DDE,UPI0000205DDE.csv,MAIEGGERTCGVHELICIRKVSPEAVGFLSAVGVFIILMLLLFLYINKKFCFENVGGFPDLGSEYSTRKNSQDKIYNSYMDKDEHGSSSESEDEALGKYHEALSRTHNSRLPLADSRQRNYAWETRQKYSPLSAEYDGYSSEASIDEGNCIQRMRRTPPLDELQPPPYQDDSGSPHLSCTPSEIGDSKCEFSHCSNSPRCSYNKCPSEGSTGHEIESFHNKGYEEDVPSDSTAVLSPEDMSAQGSSSQLPKPFDPEPEAKYGTLDVTFDYDSQEQKLLVTVTAVTDIPTYNRTGGNSWQVHLVLLPIKKQRAKTSIQRGPCPVFTETFKFNHVESEMIGNYAVRFRLYGVHRMKKEKIVGEKIFYLTKLNLQGKMSLPVILEPSYNHSGCDSQMSVSEMSCSESTSSCQSLEHGSVPEILIGLLYNATTGRLSAEVIKGSHFKNLAANRPPNTYVKLTLLNSMGQEMSKCKTSIRRGQPNPVYKETFVFQVALFQLSDVTLILSVYNKRSMKRKEMIGWISLGLNSSGEEELNHWTEMKESKGQQVCRWHALLES,mutated_sequence,1.0,555.0,UPI0000205DDE.a2m,UPI0000205DDE.npy,gnomAD
+UPI0000046C44,UPI0000046C44.csv,MKLLLLHPAFQSCLLLTLLGLWRTTPEAHASSLGAPAISAASFLQDLIHRYGEGDSLTLQQLKALLNHLDVGVGRGNVTQHVQGHRNLSTCFSSGDLFTAHNFSEQSRIGSSELQEFCPTILQQLDSRACTSENQENEENEQTEEGRPSAVEVWGYGLLCVTVISLCSLLGASVVPFMKKTFYKRLLLYFIALAIGTLYSNALFQLIPEAFGFNPLEDYYVSKSAVVFGGFYLFFFTEKILKILLKQKNEHHHGHSHYASESLPSKKDQEEGVMEKLQNGDLDHMIPQHCSSELDGKAPMVDEKVIVGSLSVQDLQASQSACYWLKGVRYSDIGTLAWMITLSDGLHNFIDGLAIGASFTVSVFQGISTSVAILCEEFPHELGDFVILLNAGMSIQQALFFNFLSACCCYLGLAFGILAGSHFSANWIFALAGGMFLYISLADMFPEMNEVCQEDERKGSILIPFIIQNLGLLTGFTIMVVLTMYSGQIQIG,mutated_sequence,1.0,492.0,UPI0000046C44.a2m,UPI0000046C44.npy,gnomAD
+UPI0000037465,UPI0000037465.csv,MGRWCQTVARGQRPRTSAPSRAGALLLLLLLLRSAGCWGAGEAPGALSTADPADQSVQCVPKATCPSSRPRLLWQTPTTQTLPSTTMETQFPVSEGKVDPYRSCGFSYEQDPTLRDPEAVARRWPWMVSVRANGTHICAGTIIASQWVLTVAHCLIWRDVIYSVRVGSPWIDQMTQTASDVPVLQVIMHSRYRAQRFWSWVGQANDIGLLKLKQELKYSNYVRPICLPGTDYVLKDHSRCTVTGWGLSKADGMWPQFRTIQEKEVIILNNKECDNFYHNFTKIPTLVQIIKSQMMCAEDTHREKFCYELTGEPLVCSMEGTWYLVGLVSWGAGCQKSEAPPIYLQVSSYQHWIWDCLNGQALALPAPSRTLLLALPLPLSLLAAL,mutated_sequence,1.0,385.0,UPI0000037465.a2m,UPI0000037465.npy,gnomAD
+UPI0000130507,UPI0000130507.csv,MRQVCCSALPPPPLEKGRCSSYSDSSSSSSERSSSSSSSSSESGSSSRSSSNNSSISRPAAPPEPRPQQQPQPRSPAARRAAARSRAAAAGGMRRDPAPGFSMLLFGVSLACYSPSLKSVQDQAYKAPVVVEGKVQGLVPAGGSSSNSTREPPASGRVALVKVLDKWPLRSGGLQREQVISVGSCVPLERNQRYIFFLEPTEQPLVFKTAFAPLDTNGKNLKKEVGKILCTDCATRPKLKKMKSQTGQVGEKQSLKCEAAAGNPQPSYRWFKDGKELNRSRDIRIKYGNGRKNSRLQFNKVKVEDAGEYVCEAENILGKDTVRGRLYVNSVSTTLSSWSGHARKCNETAKSYCVNGGVCYYIEGINQLSCKCPNGFFGQRCLEKLPLRLYMPDPKQKAEELYQKRVLTITGICVALLVVGIVCVVAYCKTKKQRKQMHNHLRQNMCPAHQNRSLANGPSHPRLDPEEIQMADYISKNVPATDHVIRRETETTFSGSHSCSPSHHCSTATPTSSHRHESHTWSLERSESLTSDSQSGIMLSSVGTSKCNSPACVEARARRAAAYNLEERRRATAPPYHDSVDSLRDSPHSERYVSALTTPARLSPVDFHYSLATQVPTFEITSPNSAHAVSLPPAAPISYRLAEQQPLLRHPAPPGPGPGPGPGPGPGADMQRSYDSYYYPAAGPGPRRGTCALGGSLGSLPASPFRIPEDDEYETTQECAPPPPPRPRARGASRRTSAGPRRWRRSRLNGLAAQRARAARDSLSLSSGSGGGSASASDDDADDADGALAAESTPFLGLRGAHDALRSDSPPLCPAADSRTYYSLDSHSTRASSRHSRGPPPRAKQDSAPL,mutated_sequence,1.0,850.0,UPI0000130507.a2m,UPI0000130507.npy,gnomAD
+UPI000013E60F,UPI000013E60F.csv,MALSLWPLLLLLLLLLLLSFAVTLAPTGPHSLDPGLSFLKSLLSTLDQAPQGSLSRSRFFTFLANISSSFEPGRMGEGPVGEPPPLQPPALRLHDFLVTLRGSPDWEPMLGLLGDMLALLGQEQTPRDFLVHQAGVLGGLVEVLLGALVPGGPPTPTRPPCTRDGPSDCVLAADWLPSLLLLLEGTRWQALVQVQPSVDPTNATGLDGREAAPHFLQGLLGLLTPTGELGSKEALWGGLLRTVGAPLYAAFQEGLLRVTHSLQDEVFSILGQPEPDTNGQCQGGNLQQLLLWGVRHNLSWDVQALGFLSGSPPPPPALLHCLSTGVPLPRASQPSAHISPRQRRAITVEALCENHLGPAPPYSISNFSIHLLCQHTKPATPQPHPSTTAICQTAVWYAVSWAPGAQGWLQACHDQFPDEFLDAICSNLSFSALSGSNRRLVKRLCAGLLPPPTSCPEGLPPVPLTPDIFWGCFLENETLWAERLCGEASLQAVPPSNQAWVQHVCQGPTPDVTASPPCHIGPCGERCPDGGSFLVMVCANDTMYEVLVPFWPWLAGQCRISRGGNDTCFLEGLLGPLLPSLPPLGPSPLCLTPGPFLLGMLSQLPRCQSSVPALAHPTRLHYLLRLLTFLLGPGAGGAEAQGMLGRALLLSSLPDNCSFWDAFRPEGRRSVLRTIGEYLEQDEEQPTPSGFEPTVNPSSGISKMELLACFSPVLWDLLQREKSVWALQILVQAYLHMPPENLQQLVLSAEREAAQGFLTLMLQGKLQGKLQVPPSEEQALGRLTALLLQRYPRLTSQLFIDLSPLIPFLAVSDLMRFPPSLLANDSVLAAIRDYSPGMRPEQKEALAKRLLAPELFGEVPAWPQELLWAVLPLLPHLPLENFLQLSPHQIQALEDSWPAAGLGPGHARHVLRSLVNQSVQDGEEQVRRLGPLACFLSPEELQSLVPLSDPTGPVERGLLECAANGTLSPEGRVAYELLGVLRSSGGAVLSPRELRVWAPLFSQLGLRFLQELSEPQLRAMLPVLQGTSVTPAQAVLLLGRLLPRHDLSLEELCSLHLLLPGLSPQTLQAIPRRVLVGACSCLAPELSRLSACQTAALLQTFRVKDGVKNMGTTGAGPAVCIPGQPIPTTWPDCLLPLLPLKLLQLDSLALLANRRRYWELPWSEQQAQFLWKKMQVPTNLTLRNLQALGTLAGGMSCEFLQQINSMVDFLEVVHMIYQLPTRVRGSLRACIWAELQRRMAMPEPEWTTVGPELNGLDSKLLLDLPIQLMDRLSNESIMLVVELVQRAPEQLLALTPLHQAALAERALQNLAPKETPVSGEVLETLGPLVGFLGTESTRQIPLQILLSHLSQLQGFCLGETFATELGWLLLQESVLGKPELWSQDEVEQAGRLVFTLSTEAISLIPREALGPETLERLLEKQQSWEQSRVGQLCREPQLAAKKAALVAGVVRPAAEDLPEPVPNCADVRGTFPAAWSATQIAEMELSDFEDCLTLFAGDPGLGPEELRAAMGKAKQLWGPPRGFRPEQILQLGRLLIGLGDRELQELILVDWGVLSTLGQIDGWSTTQLRIVVSSFLRQSGRHVSHLDFVHLTALGYTLCGLRPEELQHISSWEFSQAALFLGTLHLQCSEEQLEVLAHLLVLPGGFGPISNWGPEIFTEIGTIAAGIPDLALSALLRGQIQGVTPLAISVIPPPKFAVVFSPIQLSSLTSAQAVAVTPEQMAFLSPEQRRAVAWAQHEGKESPEQQGRSTAWGLQDWSRPSWSLVLTISFLGHLL,mutated_sequence,1.0,1775.0,UPI000013E60F.a2m,UPI000013E60F.npy,gnomAD
+UPI0000041BDE,UPI0000041BDE.csv,MVATNNVTEIIFVGFSQNWSEQRVISVMFLLMYTAVVLGNGLIVVTILASKVLTSPMYFFLSYLSFVEICYCSVMAPKLIFDSFIKRKVISLKGCLTQMFSLHFFGGTEAFLLMVMAYDRYVAICKPLHYMAIMNQRMCGLLVRIAWGGGLLHSVGQTFLIFQLPFCGPNIMDHYFCDVHPVLELACADTFFISLLIITNGGSISVVSFFVLMASYLIILHFLRSHNLEGQHKALSTCASHVTVVDLFFIPCSLVYIRPCVTLPADKIVAVFYTVVTPLLNPVIYSFRNAEVKNAMRRFIGGKVI,mutated_sequence,1.0,305.0,UPI0000041BDE.a2m,UPI0000041BDE.npy,gnomAD
+UPI000006EEED,UPI000006EEED.csv,MLQAADFIERTETAGELSRGLIGVLSSQISWCLLNVNLSKLPTRLQRLSCSVLNSSPAMRGGARGRPQLTLERPLRPGCRLHSCSEAEKGGFVRRKEIILFPPCEDPARGWLSANPGREPSPGICWHLNLGLPSLHNCEE,mutated_sequence,1.0,140.0,UPI000006EEED.a2m,UPI000006EEED.npy,gnomAD
+UPI0000EADF0B,UPI0000EADF0B.csv,MEIEVSVAECKSVPGITSTPHPMDHPSAFYSPPHNGLLTDHHESLDNDVAREIRYLDEVLEANCCDSAVDGTYNGTSSPEPGAVVLVGGLSPPVHEATQPEPTERTASRQAPPHIELSNSSPDPMAEAERTNGHSPSQPRDALGDSLQVPVSPSSTTSSRCSSRDGEFTLTTLKKEAKFELRAFHEDKKPSKLFEDDEHEKEQYCIRKVRPSEEMLELEKERRELIRSQAVKKNPGIAAKWWNPPQEKTIEEQLDEEHLESHKKYKERKERRAQQEQLLLQKQLQQQQQQPPSQLCTAPASSHERASMIDKAKEDIVTEQIDFSAARKQFQLMENSRQAVAKGQSTPRLFSIKPFYRPLGSVNSDKPLTNPRPPSVGGPPEDSGASAAKGQKSPGALETPSAAGSQGNTASQGKEGPYSEPSKRGPLSKLWAEDGEFTSARAVLTVVKDDDHGILDQFSRSVNVSLTQEELDSGLDELSVRSQDTTVLETLSNDFSMDNISDSGASNETTNALQENSLADFSLPQTPQTDNPSEGRGEGVSKSFSDHGFYSPSSTLGDSPLVDDPLEYQAGLLVQNAIQQAIAEQVDKAVSKTSRDGAEQQGPEATVEEAEAAAFGSEKPQSMFEPPQVSSPVQEKRDVLPKILPAEDRALRERGPPQPLPAVQPSGPINMEETRPEGSYFSKYSEAAELRSTASLLATQESDVMVGPFKLRSRKQRTLSMIEEEIRAAQEREEELKRQRQVLQSTQSPRTKNAPSLPSRTCYKTAPGKIEKVKPPPSPTTEGPSLQPDLAPEEAAGTQRPKNLMQTLMEDYETHKSKRRERMDDSSVLEATRVNRRKSALALRWEAGIYANQEEEDNE,mutated_sequence,1.0,859.0,UPI0000EADF0B.a2m,UPI0000EADF0B.npy,gnomAD
+UPI0000D820CC,UPI0000D820CC.csv,MTEITAEGNASTTTTVIDSKNGSVPKSPGKVLKRTVTEDIVTTFSSPAAWLLVIALIITWSAVAIVMFDLVDYKNFSASSIAKIGSDPLKLVRDAMEETTDWIYGFFSLLSDIISSEDEEDDDGDEDTDKGEIDEPPLRKKEIHKDKTEKQEKPERKIQTKVTHKEKEKGKEKVREKEKPEKKATHKEKIEKKEKPETKTLAKEQKKAKTAEKSEEKTKKEVKGGKQEKVKQTAAKVKEVQKTPSKPKEKEDKEKAAVSKHEQKDQYAFCRYMIDIFVHGDLKPGQSPAIPPPLPTEQASRPTPASPALEEKEGEKKKAEKKVTSETKKKEKEDIKKKSEKETAIDVEKKEPGKASETKQGTVKIAAQAAAKKDEKKEDSKKTKKPAEVEQPKGKKQEKKEKHVEPAKSPKKEHSVPSDKQVKAKTERAKEEIGAVSIKKAVPGKKEEKTTKTVEQEIRKEKSGKTSSILKDKEPIKGKEEKVPASLKEKEPETKKDEKMSKAGKEVKPKPPQLQGKKEEKPEPQIKKEAKPAISEKVQIHKQDIVKPEKTVSHGKPEEKVLKQVKAVTIEKTAKPKPTKKAEHREREPPSIKTDKPKPTPKGTSEVTESGKKKTEISEKESKEKADMKHLREEKVSTRKESLQLHNVTKAEKPARVSKDVEDVPASKKAKEGTEDVSPTKQKSPISFFQCVYLDGYNGYGFQFPFTPADRPGESSGQANSPGQKQQGQ,mutated_sequence,1.0,729.0,UPI0000D820CC.a2m,UPI0000D820CC.npy,gnomAD
+UPI000013CE13,UPI000013CE13.csv,MTHCCSPCCQPTCCRTTCCRTTCWKPTTVTTCSSTPCCQPACCVSSCCQPCCRPTCCQNTCCRTTCCQPTCVTSCCQPSCCSTPCCQPTCCGSSCCGQTSCGSSCGQSSSCAPVYCRRTCYYPTTVCLPGCLNQSCGSNCCQPCCRPACCETTCCRTTCFQPTCVSSCCQPSCC,mutated_sequence,1.0,174.0,UPI000013CE13.a2m,UPI000013CE13.npy,gnomAD
+UPI0000074578,UPI0000074578.csv,MSGGFELQPRDGGPRVALAPGETVIGRGPLLGITDKRVSRRHAILEVAGGQLRIKPIHTNPCFYQSSEKSQLLPLKPNLWCYLNPGDSFSLLVDKYIFRILSIPSEVEMQCTLRNSQVLDEDNILNETPKSPVINLPHETTGASQLEGSTEIAKTQMTPTNSVSFLGENRDCNKQQPILAERKRILPTWMLAEHLSDQNLSVPAISGGNVIQGSGKEEICKDKSQLNTTQQGRRQLISSGSSENTSAEQDTGEECKNTDQEESTISSKEMPQSFSAITLSNTEMNNIKTNAQRNKLPIEELGKVSKHKIATKRTPHKEDEAMSCSENCSSAQGDSLQDESQGSHSESSSNPSNPETLHAKATDSVLQGSEGNKVKRTSCMYGANCYRKNPVHFQHFSHPGDSDYGGVQIVGQDETDDRPECPYGPSCYRKNPQHKIEYRHNTLPVRNVLDEDNDNVGQPNEYDLNDSFLDDEEEDYEPTDEDSDWEPGKEDEEKEDVEELLKEAKRFMKRK,mutated_sequence,1.0,511.0,UPI0000074578.a2m,UPI0000074578.npy,gnomAD
+UPI000005104D,UPI000005104D.csv,MAEEEAPKKSRAAGGGASWELCAGALSARLAEEGSGDAGGRRRPPVDPRRLARQLLLLLWLLEAPLLLGVRAQAAGQGPGQGPGPGQQPPPPPQQQQSGQQYNGERGISVPDHGYCQPISIPLCTDIAYNQTIMPNLLGHTNQEDAGLEVHQFYPLVKVQCSAELKFFLCSMYAPVCTVLEQALPPCRSLCERARQGCEALMNKFGFQWPDTLKCEKFPVHGAGELCVGQNTSDKGTPTPSLLPEFWTSNPQHGGGGHRGGFPGGAGASERGKFSCPRALKVPSYLNYHFLGEKDCGAPCEPTKVYGLMYFGPEELRFSRTWIGIWSVLCCASTLFTVLTYLVDMRRFSYPERPIIFLSGCYTAVAVAYIAGFLLEDRVVCNDKFAEDGARTVAQGTKKEGCTILFMMLYFFSMASSIWWVILSLTWFLAAGMKWGHEAIEANSQYFHLAAWAVPAIKTITILALGQVDGDVLSGVCFVGLNNVDALRGFVLAPLFVYLFIGTSFLLAGFVSLFRIRTIMKHDGTKTEKLEKLMVRIGVFSVLYTVPATIVIACYFYEQAFRDQWERSWVAQSCKSYAIPCPHLQAGGGAPPHPPMSPDFTVFMIKYLMTLIVGITSGFWIWSGKTLNSWRKFYTRLTNSKQGETTV,mutated_sequence,1.0,647.0,UPI000005104D.a2m,UPI000005104D.npy,gnomAD
+UPI00001C2011,UPI00001C2011.csv,MAEVVAEVAEMPTQMSPGAVEMSTPMSAEMMEMSTEVTEMTPGEALASSLFFQHHQFMCSECGSLYNTLEEVLSHQEQHMLAVSEEEALTTQNVGLEPELVPGAEGPFQCGECSQLILSPGELLAHQDAHLRESANQIQYQCWDCQELFPSPELWVAHRKAQHLSATVAEPPVPPPLPPPTPLPPPSPPSEVKMEPYECPECSTLCATPEEFLEHQGTHFDSLEKEERNGLEEEEEDDEEDEEDDEEMEDEEAMAEVGDDAVGGDESTAGWAQGCGDCPQHQPSAGARRQHRRTAHSPASATHPFHCSQCQRSFSSANRLQAHGRAHVGGTHECTTCSKVFKKAASLEQHLRLHRGEARYLCVDCGRGFGTELTLVAHRRAHTANPLHRCRCGKTFSNMTKFLYHRRTHAGKSGAPPTGATAPPAPAEPTPPPPPPAPPAQLPCPQCSKSFASASRLSRHRRAVHGPPERRHRCGVCGKGFKKLIHVRNHLRTHTGERPFQCHSCGKTFASLANLSRHQLTHTGARPYQCLDCGKRFTQSSNLQQHRRLHLRPVAFARAPRLPITGLYNKSPYYCGTCGRWFRAMAGLRLHQRVHARARTLTLQPPRSPSPAPPPPPEPQQTIMCTELGETIAIIETSQPLALEDTLQLCQAALGASEAGGLLQLDTAFV,mutated_sequence,1.0,670.0,UPI00001C2011.a2m,UPI00001C2011.npy,gnomAD
+UPI000198C8A7,UPI000198C8A7.csv,MACRRRYFVEGEAPSSETGTSLDSPSAYPQGPLVPGSSLSPDHYEHTSVGAYGLYSGPPGQQQRTRRPKLQHSTSILRKQAEEEAIKRSRSLSESYELSSDLQDKQVEMLERKYGGRLVTRHAARTIQTAFRQYQMNKNFERLRSSMSENRMSRRIVLSNMRMQFSFEGPEKVHSSYFEGKQVSVTNDGSQLGALVSPECGDLSEPTTLKSPAPSSDFADAITELEDAFSRQVKSLAESIDDALNCRSLHTEEAPALDAARARDTEPQTALHGMDHRKLDEMTASYSDVTLYIDEEELSPPLPLSQAGDRPSSTESDLRLRAGGAAPDYWALAHKEDKADTDTSCRSTPSLERQEQRLRVEHLPLLTIEPPSDSSVDLSDRSERGSLKRQSAYERSLGGQQGSPKHGPHSGAPKSLPREEPELRPRPPRPLDSHLAINGSANRQSKSESDYSDGDNDSINSTSNSNDTINCSSESSSRDSLREQTLSKQTYHKEARNSWDSPAFSNDVIRKRHYRIGLNLFNKKPEKGVQYLIERGFVPDTPVGVAHFLLQRKGLSRQMIGEFLGNRQKQFNRDVLDCVVDEMDFSTMELDEALRKFQAHIRVQGEAQKVERLIEAFSQRYCICNPGVVRQFRNPDTIFILAFAIILLNTDMYSPNVKPERKMKLEDFIKNLRGVDDGEDIPREMLMGIYERIRKRELKTNEDHVSQVQKVEKLIVGKKPIGSLHPGLGCVLSLPHRRLVCYCRLFEVPDPNKPQKLGLHQREIFLFNDLLVVTKIFQKKKNSVTYSFRQSFSLYGMQVLLFENQYYPNGIRLTSSVPGADIKVLINFNAPNPQDRKKFTDDLRESIAEVQEMEKHRIESELEKQKGVVRPSMSQCSSLKKESGNGTLSRACLDDSYASGEGLKRSALSSSLRDLSEAGKRGRRSSAGSLESNVEGSIISSPHMRRRATSTRECPSRPHQTMPNSSSLLGSLFGSKRGKPPPQAHLPSAPALATPPPTGGPASLAALCGWPPPGAPRGAAAGRHARASHPVLPHAEPSPVPPSPPPPPTPAHPARTPVPPRPPWGPPSLRGPCPRPPAAALGPRGAHSAPPWAAPCPAAPHQQQGQTQRHQHNCVDSLGRGPRLPETPAHHTGHARGSPAAHQTRGTSVAISPLCPSRPN,mutated_sequence,1.0,1160.0,UPI000198C8A7.a2m,UPI000198C8A7.npy,gnomAD
+UPI0002065B81,UPI0002065B81.csv,MAGASVKVAVRVRPFNSREMSRDSKCIIQMSGSTTTIVNPKQPKETPKSFSFDYSYWSHTSPEDINYASQKQVYRDIGEEMLQHAFEGYNVCIFAYGQTGAGKSYTMMGKQEKDQQGIIPQLCEDLFSRINDTTNDNMSYSVEVSYMEIYCERVRDLLNPKNKGNLRVREHPLLGPYVEDLSKLAVTSYNDIQDLMDSGNKARTVAATNMNETSSRSHAVFNIIFTQKRHDAETNITTEKVSKISLVDLAGSERADSTGAKGTRLKEGANINKSLTTLGKVISALAEMDSGPNKNKKKKKTDFIPYRDSVLTWLLRENLGGNSRTAMVAALSPADINYDETLSTLRYADRAKQIRCNAVINEDPNNKLIRELKDEVTRLRDLLYAQGLGDITDTNTVPGGPKLTNALVGMSPSSSLSALSSRAASVSSLHERILFAPGSEEAIERLKETEKIIAELNETWEEKLRRTEAIRMEREALLAEMGVAMREDGGTLGVFSPKKTPHLVNLNEDPLMSECLLYYIKDGITRVGREDGERRQDIVLSGHFIKEEHCVFRSDSRGGSEAVVTLEPCEGADTYVNGKKVTEPSILRSGNRIIMGKSHVFRFNHPEQARQERERTPCAETPAEPVDWAFAQRELLEKQGIDMKQEMEQRLQELEDQYRREREEATYLLEQQRLDYESKLEALQKQMDSRYYPEVNEEEEEPEDEVQWTERECELALWAFRKWKWYQFTSLRDLLWGNAIFLKEANAISVELKKKVQFQFVLLTDTLYSPLPPDLLPPEAAKDRETRPFPRTIVAVEVQDQKNGATHYWTLEKLRQRLDLMREMYDRAAEVPSSVIEDCDNVVTGGDPFYDRFPWFRLVGSSAISGCNSYPLLNTCMSERMAALTPSPTFSSPDSDATEPAEEQSVGEEEEEEEEEEDEEEEDLEDDVFPEHALCDGRDPFYDRPPLFSLVGRAFVYLSNLLYPVPLVHRVAIVSEKGEVKGFLRVAVQAISADEEAPDYGSGVRQSGTAKISFDDQHFEKFQSESCPVVGMSRSGTSQEELRIVEGQGQGADVGPSADEVNNNTCSAVPPEGLLLDSSEKAALDGPLDAALDHLRLGNTFTFRVTVLQASSISAEYADIFCQFNFIHRHDEAFSTEPLKNTGRGPPLGFYHVQNIAVEVTKSFIEYIKSQPIVFEVFGHYQQHPFPPLCKDVLSPLRPSRRHFPRVMPLSKPVPATKLSTLTRPCPGPCHCKYDLLVYFEICELEANGDYIPAVVDHRGGMPCMGTFLLHQGIQRRITVTLLHETGSHIRWKEVRELVVGRIRNTPETDESLIDPNILSLNILSSGYIHPAQDDRTFYQFEAAWDSSMHNSLLLNRVTPYREKIYMTLSAYIEMENCTQPAVVTKDFCMVFYSRDAKLPASRSIRNLFGSGSLRASESNRVTGVYELSLCHVADAGSPGMQRRRRRVLDTSVAYVRGEENLAGWRPRSDSLILDHQWELEKLSLLQEVEKTRHYLLLREKLETAQRPVPEALSPAFSEDSESHGSSSASSPLSAEGRPSPLEAPNERQRELAVKCLRLLTHTFNREYTHSHVCVSASESKLSEMSVTLLRDPSMSPLGVATLTPSSTCPSLVEGRYGATDLRTPQPCSRPASPEPELLPEADSKKLPSPARATETDKEPQRLLVPDIQEIRVSPIVSKKGYLHFLEPHTSGWARRFVVVRRPYAYMYNSDKDTVERFVLNLATAQVEYSEDQQAMLKTPNTFAVCTEHRGILLQAASDKDMHDWLYAFNPLLAGTIRSKLSRRRSAQMRV,mutated_sequence,1.0,1791.0,UPI0002065B81.a2m,UPI0002065B81.npy,gnomAD
+UPI0001AE795F,UPI0001AE795F.csv,KERKKERKKERKEREREKERKKKYPKSSLSSSKFLRFLGQSQN,mutated_sequence,,,UPI0001AE795F.a2m,UPI0001AE795F.npy,gnomAD
+UPI000013811E,UPI000013811E.csv,MMAAMATARVRMGPRCAQALWRMPWLPVFLSLAAAAAAAAAEQQVPLVLWSSDRDLWAPAADTHEGHITSDLQLSTYLDPALELGPRNVLLFLQDKLSIEDFTAYGGVFGNKQDSAFSNLENALDLAPSSLVLPAVDWYAVSTLTTYLQEKLGASPLHVDLATLRELKLNASLPALLLIRLPYTASSGLMAPREVLTGNDEVIGQVLSTLKSEDVPYTAALTAVRPSRVARDVAVVAGGLGRQLLQKQPVSPVIHPPVSYNDTAPRILFWAQNFSVAYKDQWEDLTPLTFGVQELNLTGSFWNDSFARLSLTYERLFGTTVTFKFILANRLYPVSARHWFTMERLEVHSNGSVAYFNASQVTGPSIYSFHCEYVSSLSKKGSLLVARTQPSPWQMMLQDFQIQAFNVMGEQFSYASDCASFFSPGIWMGLLTSLFMLFIFTYGLHMILSLKTMDRFDDHKGPTISLTQIV,mutated_sequence,1.0,470.0,UPI000013811E.a2m,UPI000013811E.npy,gnomAD
+UPI000006D7C1,UPI000006D7C1.csv,MADNLQATFGESLLSDMLDDFPDTLPSPEALKFKILVKNKKIGTLKETHERKGSDKRGKVEEWEEEVADGEEEEEEEEEEEEEEEDKFKESEVLESVLGDNQDKETGVKKLPGVMLFKKKKTRKLKIALALSDLVIYTKAEKFKSFQHSRLYQQFNENNSIGETQARKLSKLRVHEFIFHTRKFITRIYPKATRADSSNFNPQEFWNIGCQMVALNFQTPGLPMDLQNGKFLDNGGSGYILKPHFLRESKSYFNPSNIKEGMPITLTIRLISGIQLPLTHSSSNKGDSLVIIEVFGVPNDQMKQQTRVIKKNAFSPRWNETFTFIIHVPELALIRFVVEGQGLIAGNEFLGQYTLPLLCMNKGYRRIPLFSRMGESLEPASLFVYVWYVR,mutated_sequence,1.0,390.0,UPI000006D7C1.a2m,UPI000006D7C1.npy,gnomAD
+UPI000013EFF9,UPI000013EFF9.csv,MAFLIILITCFVIILATSQPCQTPDDFVAATSPGHIIIGGLFAIHEKMLSSEDSPRRPQIQECVGFEISVFLQTLAMIHSIEMINNSTLLPGVKLGYEIYDTCTEVTVAMAATLRFLSKFNCSRETVEFKCDYSSYMPRVKAVIGSGYSEITMAVSRMLNLQLMPQVGYESTAEILSDKIRFPSFLRTVPSDFHQIKAMAHLIQKSGWNWIGIITTDDDYGRLALNTFIIQAEANNVCIAFKEVLPAFLSDNTIEVRINRTLKKIILEAQVNVIVVFLRQFHVFDLFNKAIEMNINKMWIASDNWSTATKITTIPNVKKIGKVVGFAFRRGNISSFHSFLQNLHLLPSDSHKLLHEYAMHLSACAYVKDTDLSQCIFNHSQRTLAYKANKAIERNFVMRNDFLWDYAEPGLIHSIQLAVFALGYAIRDLCQARDCQNPNAFQPWELLGVLKNVTFTDGWNSFHFDAHGDLNTGYDVVLWKEINGHMTVTKMAEYDLQNDVFIIPDQETKNEFRNLKQIQSKCSKECSPGQMKKTTRSQHICCYECQNCPENHYTNQTDMPHCLLCNNKTHWAPVRSTMCFEKEVEYLNWNDSLAILLLILSLLGIIFVLVVGIIFTRNLNTPVVKSSGGLRVCYVILLCHFLNFASTSFFIGEPQDFTCKTRQTMFGVSFTLCISCILTKSLKILLAFSFDPKLQKFLKCLYRPILIIFTCTGIQVVICTLWLIFAAPTVEVNVSLPRVIILECEEGSILAFGTMLGYIAILAFICFIFAFKGKYENYNEAKFITFGMLIYFIAWITFIPIYATTFGKYVPAVEIIVILISNYGILYCTFIPKCYVIICKQEINTKSAFLKMIYSYSSHSVSSIALSPASLDSMSGNVTMTNPSSSGKSATWQKSKDLQAQAFAHICRENATSVSKTLPRKRMSSI,mutated_sequence,1.0,926.0,UPI000013EFF9.a2m,UPI000013EFF9.npy,gnomAD
+UPI000013D82E,UPI000013D82E.csv,MELPAVGEHVFAVESIEKKRIRKGRVEYLVKWRGWSPKYNTWEPEENILDPRLLIAFQNRERQEQLMGYRKRGPKPKPLVVQVPTFARRSNVLTGLQDSSTDNRAKLDLGAQGKGQGHQYELNSKKHHQYQPHSKERAGKPPPPGKSGKYYYQLNSKKHHPYQPDPKMYDLQYQGGHKEAPSPTCPDLGAKSHPPDKWAQGAGAKGYLGAVKPLAGAAGAPGKGSEKGPPNGMMPAPKEAVTGNGIGGKMKIVKNKNKNGRIVIVMSKYMENGMQAVKIKSGEVAEGEARSPSHKKRAADERHPPADRTFKKAAGAEEKKVEAPPKRREEEVSGVSDPQPQDAGSRKLSPTKEAFGEQPLQLTTKPDLLAWDPARNTHPPSHHPHPHPHHHHHHHHHHHHAVGLNLSHVRKRCLSETHGEREPCKKRLTARSISTPTCLGGSPAAERPADLPPAAALPQPEVILLDSDLDEPIDLRCVKTRSEAGEPPSSLQVKPETPASAAVAVAAAAAPTTTAEKPPAEAQDEPAESLSEFKPFFGNIIITDVTANCLTVTFKEYVTV,mutated_sequence,1.0,560.0,UPI000013D82E.a2m,UPI000013D82E.npy,gnomAD
+UPI00004EC29C,UPI00004EC29C.csv,MEKEETTRELLLPNWQGSGSHGLTIAQRDDGVFVQEVTQNSPAARTGVVKEGDQIVGATIYFDNLQSGEVTQLLNTMGHHTVGLKLHRKGDRSPEPGQTWTREVFSSCSSEVVLSGDDEEYQRIYTTKIKPRLKSEDGVEGDLGETQSRTITVTRRVTAYTVDVTGREGAKDIDISSPEFKIKIPRHELTEISNVDVETQSGKTVIRLPSGSGAASPTGSAVDIRAGAISASGPELQGAGHSKLQVTMPGIKVGGSGVNVNAKGLDLGGRGGVQVPAVDISSSLGGRAVEVQGPSLESGDHGKIKFPTMKVPKFGVSTGREGQTPKAGLRVSAPEVSVGHKGGKPGLTIQAPQLEVSVPSANIEGLEGKLKGPQITGPSLEGDLGLKGAKPQGHIGVDASAPQIGGSITGPSVEVQAPDIDVQGPGSKLNVPKMKVPKFSVSGAKGEETGIDVTLPTGEVTVPGVSGDVSLPEIATGGLEGKMKGTKVKTPEMIIQKPKISMQDVDLSLGSPKLKGDIKVSAPGVQGDVKGPQVALKGSRVDIETPNLEGTLTGPRLGSPSGKTGTCRISMSEVDLNVAAPKVKGGVDVTLPRVEGKVKVPEVDVRGPKVDVSAPDVEAHGPEWNLKMPKMKMPTFSTPGAKGEGPDVHMTLPKGDISISGPKVNVEAPDVNLEGLGGKLKGPDVKLPDMSVKTPKISMPDVDLHVKGTKVKGEYDVTVPKLEGELKGPKVDIDAPDVDVHGPDWHLKMPKMKMPKFSVPGFKAEGPEVDVNLPKADVDISGPKIDVTAPDVSIEEPEGKLKGPKFKMPEMNIKVPKISMPDVDLHLKGPNVKGEYDVTMPKVESEIKVPDVELKSAKMDIDVPDVEVQGPDWHLKMPKMKMPKFSMPGFKAEGPEVDVNLPKADVDISGPKVGVEVPDVNIEGPEGKLKGPKFKMPEMNIKAPKISMPDVDLHMKGPKVKGEYDMTVPKLEGDLKGPKVDVSAPDVEMQGPDWNLKMPKIKMPKFSMPSLKGEGPEFDVNLSKANVDISAPKVDTNAPDLSLEGPEGKLKGPKFKMPEMHFRAPKMSLPDVDLDLKGPKMKGNVDISAPKIEGEMQVPDVDIRGPKVDIKAPDVEGQGLDWSLKIPKMKMPKFSMPSLKGEGPEVDVNLPKADVVVSGPKVDIEAPDVSLEGPEGKLKGPKFKMPEMHFKTPKISMPDVDLHLKGPKVKGDVDVSVPKVEGEMKVPDVEIKGPKMDIDAPDVEVQGPDWHLKMPKMKMPKFSMPGFKGEGREVDVNLPKADIDVSGPKVDVEVPDVSLEGPEGKLKGPKFKMPEMHFKAPKISMPDVDLNLKGPKLKGDVDVSLPEVEGEMKVPDVDIKGPKVDISAPDVDVHGPDWHLKMPKVKMPKFSMPGFKGEGPEVDVKLPKADVDVSGPKMDAEVPDVNIEGPDAKLKGPKFKMPEMSIKPQKISIPDVGLHLKGPKMKGDYDVTVPKVEGEIKAPDVDIKGPKVDINAPDVEVHGPDWHLKMPKVKMPKFSMPGFKGEGPEVDMNLPKADLGVSGPKVDIDVPDVNLEAPEGKLKGPKFKMPSMNIQTHKISMPDVGLNLKAPKLKTDVDVSLPKVEGDLKGPEIDVKAPKMDVNVGDIDIEGPEGKLKGPKFKMPEMHFKAPKISMPDVDLHLKGPKVKGDMDVSVPKVEGEMKVPDVDIKGPKVDIDAPDVEVHDPDWHLKMPKMKMPKFSMPGFKAEGPEVDVNLPKADIDVSGPSVDTDAPDLDIEGPEGKLKGSKFKMPKLNIKAPKVSMPDVDLNLKGPKLKGEIDASVPELEGDLRGPQVDVKGPFVEAEVPDVDLECPDAKLKGPKFKMPEMHFKAPKISMPDVDLHLKGPKVKGDADVSVPKLEGDLTGPSVGVEVPDVELECPDAKLKGPKFKMPDMHFKAPKISMPDVDLHLKGPKVKGDVDVSVPKLEGDLTGPSVGVEVPDVELECPDAKLKGPKFKMPEMHFKTPKISMPDVDLHLKGPKVKGDMDVSVPKVEGEMKVPDVDIKGPKMDIDAPDVDVHGPDWHLKMPKMKMPKFSMPGFKAEGPEVDVNLPKADVVVSGPKVDVEVPDVSLEGPEGKLKGPKLKMPEMHFKAPKISMPDVDLHLKGPKVKGDVDVSLPKLEGDLTGPSVDVEVPDVELECPDAKLKGPKFKMPEMHFKTPKISMPDVNLNLKGPKVKGDMDVSVPKVEGEMKVPDVDIRGPKVDIDAPDVDVHGPDWHLKMPKMKMPKFSMPGFKGEGPEVDVNLPKADVDVSGPKVDVEVPDVSLEGPEGKLKGPKFKMPEMHFKTPKISMPDVDFNLKGPKIKGDVDVSAPKLEGELKGPELDVKGPKLDADMPEVAVEGPNGKWKTPKFKMPDMHFKAPKISMPDLDLHLKSPKAKGEVDVDVPKLEGDLKGPHVDVSGPDIDIEGPEGKLKGPKFKMPDMHFKAPNISMPDVDLNLKGPKIKGDVDVSVPEVEGKLEVPDMNIRGPKVDVNAPDVQAPDWHLKMPKMKMPKFSMPGFKAEGPEVDVNLPKADVDISGPKVDIEGPDVNIEGPEGKLKGPKLKMPEMNIKAPKISMPDFDLHLKGPKVKGDVDVSLPKVEGDLKGPEVDIKGPKVDINAPDVGVQGPDWHLKMPKVKMPKFSMPGFKGEGPDGDVKLPKADIDVSGPKVDIEGPDVNIEGPEGKLKGPKFKMPEMNIKAPKISMPDIDLNLKGPKVKGDVDVSLPKVEGDLKGPEVDIKGPKVDIDAPDVDVHGPDWHLKMPKIKMPKISMPGFKGEGPDVDVNLPKADIDVSGPKVDVECPDVNIEGPEGKWKSPKFKMPEMHFKTPKISMPDIDLNLTGPKIKGDVDVTGPKVEGDLKGPEVDLKGPKVDIDVPDVNVQGPDWHLKMPKMKMPKFSMPGFKAEGPEVDVNLPKADVDVSGPKVDVEGPDVNIEGPEGKLKGPKFKMPEMNIKAPKIPMPDFDLHLKGPKVKGDVDISLPKVEGDLKGPEVDIRGPQVDIDVPDVGVQGPDWHLKMPKVKMPKFSMPGFKGEGPDVDVNLPKADLDVSGPKVDIDVPDVNIEGPEGKLKGPKFKMPEMNIKAPKISMPDIDLNLKGPKVKGDMDVSLPKVEGDMKVPDVDIKGPKVDINAPDVDVQGPDWHLKMPKIKMPKISMPGFKGEGPEVDVNLPKADLDVSGPKVDVDVPDVNIEGPDAKLKGPKFKMPEMNIKAPKISMPDLDLNLKGPKMKGEVDVSLANVEGDLKGPALDIKGPKIDVDAPDIDIHGPDAKLKGPKLKMPDMHVNMPKISMPEIDLNLKGSKLKGDVDVSGPKLEGDIKAPSLDIKGPEVDVSGPKLNIEGKSKKSRFKLPKFNFSGSKVQTPEVDVKGKKPDIDITGPKVDINAPDVEVQGKVKGSKFKMPFLSISSPKVSMPDVELNLKSPKVKGDLDIAGPNLEGDFKGPKVDIKAPEVNLNAPDVDVHGPDWNLKMPKMKMPKFSVSGLKAEGPDVAVDLPKGDINIEGPSMNIEGPDLNVEGPEGGLKGPKFKMPDMNIKAPKISMPDIDLNLKGPKVKGDVDISLPKLEGDLKGPEVDIKGPKVDINAPDVDVHGPDWHLKMPKVKMPKFSMPGFKGEGPEVDVTLPKADIDISGPNVDVDVPDVNIEGPDAKLKGPKFKMPEMNIKAPKISMPDFDLNLKGPKMKGDVVVSLPKVEGDLKGPEVDIKGPKVDIDTPDINIEGSEGKFKGPKFKIPEMHLKAPKISMPDIDLNLKGPKVKGDVDVSLPKMEGDLKGPEVDIKGPKVDINAPDVDVQGPDWHLKMPKVKMPKFSMPGFKGEGPDVDVNLPKADLDVSGPKVDIDVPDVNIEGPEGKLKGPKFKMPEMNIKAPKISMPDIDLNLKGPKVKGDMDVSLPKVEGDMQVPDLDIKGPKVDINAPDVDVRGPDWHLKMPKIKMPKISMPGFKGEGPEVDVNLPKADLDVSGPKVDVDVPDVNIEGPDAKLKGPKFKMPEMNIKAPKISMPDFDLHLKGPKVKGDVDVSLPKMEGDLKAPEVDIKGPKVDIDAPDVDVHGPDWHLKMPKVKMPKFSMPGFKGEGPEVDVNLPKADIDVSGPKVDIDTPDIDIHGPEGKLKGPKFKMPDLHLKAPKISMPEVDLNLKGPKMKGDVDVSLPKVEGDLKGPEVDIKGPKVDIDVPDVDVQGPDWHLKMPKVKMPKFSMPGFKGEGPDVDVNLPKADLDVSGPKVDIDVPDVNIEGPDAKLKGPKFKMPEMNIKAPKISMPDFDLHLKGPKVKGDVDVSLPKVEGDLKGPEVDIKGPKVDIDAPDVDVHGPDWHLKMPKVKMPKFSMPGFKGEGPDVDVTLPKADIEISGPKVDIDAPDVSIEGPDAKLKGPKFKMPEMNIKAPKISMPDIDFNLKGPKVKGDVDVSLPKVEGDLKGPEIDIKGPSLDIDTPDVNIEGPEGKLKGPKFKMPEMNIKAPKISMPDFDLHLKGPKVKGDVDVSLPKVESDLKGPEVDIEGPEGKLKGPKFKMPDVHFKSPQISMSDIDLNLKGPKIKGDMDISVPKLEGDLKGPKVDVKGPKVGIDTPDIDIHGPEGKLKGPKFKMPDLHLKAPKISMPEVDLNLKGPKVKGDMDISLPKVEGDLKGPEVDIRDPKVDIDVPDVDVQGPDWHLKMPKVKMPKFSMPGFKGEGPDVDVNLPKADIDVSGPKVDVDVPDVNIEGPDAKLKGPKFKMPEMSIKAPKISMPDIDLNLKGPKVKGDVDVTLPKVEGDLKGPEADIKGPKVDINTPDVDVHGPDWHLKMPKVKMPKFSMPGFKGEGPDVDVSLPKADIDVSGPKVDVDIPDVNIEGPDAKLKGPKFKMPEINIKAPKISIPDVDLDLKGPKVKGDFDVSVPKVEGTLKGPEVDLKGPRLDFEGPDAKLSGPSLKMPSLEISAPKVTAPDVDLHLKAPKIGFSGPKLEGGEVDLKGPKVEAPSLDVHMDSPDINIEGPDVKIPKFKKPKFGFGAKSPKADIKSPSLDVTVPEAELNLETPEISVGGKGKKSKFKMPKIHMSGPKIKAKKQGFDLNVPGGEIDASLKAPDVDVNIAGPDAALKVDVKSPKTKKTMFGKMYFPDVEFDIKSPKFKAEAPLPSPKLEGELQAPDLELSLPAIHVEGLDIKAKAPKVKMPDVDISVPKIEGDLKGPKVQANLGAPDINIEGLDAKVKTPSFGISAPQVSIPDVNVNLKGPKIKGDVPSVGLEGPDVDLQGPEAKIKFPKFSMPKIGIPGVKMEGGGAEVHAQLPSLEGDLRGPDVKLEGPDVSLKGPGVDLPSVNLSMPKVSGPDLDLNLKGPSLKGDLDASVPSMKVHAPGLNLSGVGGKMQVGGDGVKVPGIDATTKLNVGAPDVTLRGPSLQGDLAVSGDIKCPKVSVGAPDLSLEASEGSIKLPKMKLPQFGISTPGSDLHVNAKGPQVSGELKGPGVDVNLKGPRISAPNVDFNLEGPKVKGSLGATGEIKGPTVGGGLPGIGVQGLEGNLQMPGIKSSGCDVNLPGVNVKLPTGQISGPEIKGGLKGSEVGFHGAAPDISVKGPAFNMASPESDFGINLKGPKIKGGADVSGGVSAPDISLGEGHLSVKGSGGEWKGPQVSSALNLDTSKFAGGLHFSGPKVEGGVKGGQIGLQAPGLSVSGPQGHLESGSGKVTFPKMKIPKFTFSGRELVGREMGVDVHFPKAEASIQAGAGDGEWEESEVKLKKSKIKMPKFNFSKPKGKGGVTGSPEASISGSKGDLKSSKASLGSLEGEAEAEASSPKGKFSLFKSKKPRHRSNSFSDEREFSGPSTPTGTLEFEGGEVSLEGGKVKGKHGKLKFGTFGGLGSKSKGHYEVTGSDDETGKLQGSGVSLASKKSRLSSSSSNDSGNKVGIQLPEVELSVSTKKE,mutated_sequence,1.0,5890.0,UPI00004EC29C.a2m,UPI00004EC29C.npy,gnomAD
+UPI000051AE2E,UPI000051AE2E.csv,MEPAAAATVQRLPELGREDRASAPAAAAAAAAAAAAAAAALAAAAGGGRSPEPALTPAAPSGGNGSGSGAREEAPGEAPPGPLPGRAGGAGRRRRRGAPQPIAGGAAPVPGAGGGANSLLLRRGRLKRNLSAAAAAASSSSSSSAAAASHSPGAAGLPASCSASASLCTRSLDRKTLLLKHRQTLQLQPSDRDWVRHQLQRGCVHVFDRHMASTYLRPVLCTLDTTAGEVAARLLQLGHKGGGVVKVLGQGPGAAAAREPAEPPPEAGPRLAPPEPRDSEVPPARSAPGAFGGPPRAPPADLPLPVGGPGGWSRRASPAPSDSSPGEPFVGGPVSSPRAPRPVVSDTESFSLSPSAESVSDRLDPYSSGGGSSSSSEELEADAASAPTGVPGQPRRPGHPAQPLPLPQTASSPQPQQKAPRAIDSPGGAVREGSCEEKAAAAVAPGGLQSTPGRSGVTAEKAPPPPPPPTLYVQLHGETTRRLEAEEKPLQIQNDYLFQLGFGELWRVQEEGMDSEIGCLIRFYAGKPHSTGSSERIQLSGMYNVRKGKMQLPVNRWTRRQVILCGTCLIVSSVKDSLTGKMHVLPLIGGKVEEVKKHQHCLAFSSSGPQSQTYYICFDTFTEYLRWLRQVSKVASQRISSVDLSCCSLEHLPANLFYSQDLTHLNLKQNFLRQNPSLPAARGLNELQRFTKLKSLNLSNNHLGDFPLAVCSIPTLAELNVSCNALRSVPAAVGVMHNLQTFLLDGNFLQSLPAELENMKQLSYLGLSFNEFTDIPEVLEKLTAVDKLCMSGNCVETLRLQALRKMPHIKHVDLRLNVIRKLIADEVDFLQHVTQLDLRDNKLGDLDAMIFNNIEVLHCERNQLVTLDICGYFLKALYASSNELVQLDVYPVPNYLSYMDVSRNRLENVPEWVCESRKLEVLDIGHNQICELPARLFCNSSLRKLLAGHNQLARLPERLERTSVEVLDVQHNQLLELPPNLLMKADSLRFLNASANKLESLPPATLSEETNSILQELYLTNNSLTDKCVPLLTGHPHLKILHMAYNRLQSFPASKMAKLEELEEIDLSGNKLKAIPTTIMNCRRMHTVIAHSNCIEVFPEVMQLPEIKCVDLSCNELSEVTLPENLPPKLQELDLTGNPRLVLDHKTLELLNNIRCFKIDQPSTGDASGAPAVWSHGYTEASGVKNKLCVAALSVNNFCDNREALYGVFDGDRNVEVPYLLQCTMSDILAEELQKTKNEEEYMVNTFIVMQRKLGTAGQKLGGAAVLCHIKHDPVDPGGSFTLTSANVGKCQTVLCRNGKPLPLSRSYIMSCEEELKRIKQHKAIITEDGKVNGVTESTRILGYTFLHPSVVPRPHVQSVLLTPQDEFFILGSKGLWDSLSVEEAVEAVRNVPDALAAAKKLCTLAQSYGCHDSISAVVVQLSVTEDSFCCCELSAGGAVPPPSPGIFPPSVNMVIKDRPSDGLGVPSSSSGMASEISSELSTSEMSSEVGSTASDEPPPGALSENSPAYPSEQRCMLHPICLSNSFQRQLSSATFSSAFSDNGLDSDDEEPIEGVFTNGSRVEVEVDIHCSRAKEKEKQQHLLQVPAEASDEGIVISANEDEPGLPRKADFSAVGTIGRRRANGSVAPQERSHNVIEVATDAPLRKPGGYFAAPAQPDPDDQFIIPPELEEEVKEIMKHHQEQQQQQQPPPPPQLQPQLPRHYQLDQLPDYYDTPL,mutated_sequence,1.0,1717.0,UPI000051AE2E.a2m,UPI000051AE2E.npy,gnomAD
+UPI000013D704,UPI000013D704.csv,MRRAPAAERLLELGFPPRCGRQEPPFPLGVTRGWGRWPIQKRREGARPVPFSERSQEDGRGPAARSSGTLWRIRTRLSLCRDPEPPPPLCLLRVSLLCALRAGGRGSRWGEDGARLLLLPPARAAGNGEAEPSGGPSYAGRMLESSGCKALKEGVLEKRSDGLLQLWKKKCCILTEEGLLLIPPKQLQHQQQQQQQQQQQQQQQPGQGPAEPSQPSGPAVASLEPPVKLKELHFSNMKTVDCVERKGKYMYFTVVMAEGKEIDFRCPQDQGWNAEITLQMVQYKNRQAILAVKSTRQKQQHLVQQQPPSQPQPQPQLQPQPQPQPQPQPQPQSQPQPQPQPKPQPQQLHPYPHPHPHPHSHPHSHPHPHPHPHPHQIPHPHPQPHSQPHGHRLLRSTSNSA,mutated_sequence,1.0,401.0,UPI000013D704.a2m,UPI000013D704.npy,gnomAD
+UPI000016ABE3,UPI000016ABE3.csv,MKLVFLVLLFLGALGLCLAGRRRSVQWCAVSQPEATKCFQWQRNMRKVRGPPVSCIKRDSPIQCIQAIAENRADAVTLDGGFIYEAGLAPYKLRPVAAEVYGTERQPRTHYYAVAVVKKGGSFQLNELQGLKSCHTGLRRTAGWNVPIGTLRPFLNWTGPPEPIEAAVARFFSASCVPGADKGQFPNLCRLCAGTGENKCAFSSQEPYFSYSGAFKCLRDGAGDVAFIRESTVFEDLSDEAERDEYELLCPDNTRKPVDKFKDCHLARVPSHAVVARSVNGKEDAIWNLLRQAQEKFGKDKSPKFQLFGSPSGQKDLLFKDSAIGFSRVPPRIDSGLYLGSGYFTAIQNLRKSEEEVAARRARVVWCAVGEQELRKCNQWSGLSEGSVTCSSASTTEDCIALVLKGEADAMSLDGGYVYTAGKCGLVPVLAENYKSQQSSDPDPNCVDRPVEGYLAVAVVRRSDTSLTWNSVKGKKSCHTAVDRTAGWNIPMGLLFNQTGSCKFDEYFSQSCAPGSDPRSNLCALCIGDEQGENKCVPNSNERYYGYTGAFRCLAENAGDVAFVKDVTVLQNTDGNNNEAWAKDLKLADFALLCLDGKRKPVTEARSCHLAMAPNHAVVSRMDKVERLKQVLLHQQAKFGRNGSDCPDKFCLFQSETKNLLFNDNTECLARLHGKTTYEKYLGPQYVAGITNLKKCSTSPLLEACEFLRK,mutated_sequence,1.0,710.0,UPI000016ABE3.a2m,UPI000016ABE3.npy,gnomAD
+UPI000019B3C1,UPI000019B3C1.csv,MNRQVCKKSFSGRSQGFSGRSAVVSGSSRMSCVARSGGAGGGACGFRSGAGSFGSRSLYNLGSNKSISISVAAGSSRAGGFGGGRSSCGFAGGYGGGFGGSYGGGFGGGRGVGSGFGGAGGFGGAGGFGGPGVFGGPGSFGGPGGFGPGGFPGGIQEVIVNQSLLQPLNVEIDPQIGQVKAQEREQIKTLNNKFASFIDKVRFLEQQNKVLETKWELLQQQTTGSGPSSLEPCFESYISFLCKQLDSLLGERGNLEGELKSMQDLVEDFKKKYEDEINKRTAAENEFVGLKKDVDAAFMNKVELQAKVDSLTDEVSFLRTLYEMELSQMQSHASDTSVVLSMDNNRCLDLGSIIAEVRAQYEEIAQRSKSEAEALYQTKLGELQTTAGRHGDDLRNTKSEIMELNRMIQRLRAEIENVKKQNANLQTAIAEAEQRGEMALKDANAKLQDLQTALQKAKDDLARLLRDYQELMNVKLALDVEIATYRKLLEGEECRMSGECQSAVCISVVSNVTSTSGSSGSSRGVFGGVSGSGSGGYKGGSSSSSSSGYGVSGGSGSGYGGVSSGSTGGRGSSGSYQSSSSGSRLGGAGSISVSHSGMGSSSGSIQTSGGSGYKSGGGGSTSIRFSQTTSSSQHSSTK,mutated_sequence,1.0,638.0,UPI000019B3C1.a2m,UPI000019B3C1.npy,gnomAD
+UPI000013C57B,UPI000013C57B.csv,MEVPAAGRVPAEGAPTAAVAEVRCPGPAPLRLLEWRVAAGAAVRIGSVLAVFEAAASAQSSGASQSRVASGGCVRPARPERRLRSERAGVVRELCAQPGQVVAPGAVLVRLEGCSHPVVMKGLCAECGQDLTQLQSKNGKQQVPLSTATVSMVHSVPELMVSSEQAEQLGREDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIFHFQLGRGEPMLHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFPCGDSMVCIIDDREDVWKFAPNLITVKKYVYFQGTGDMNAPPGSRESQTRKKVNHSRGTEVSEPSPPVRDPEGVTQAPGVEPSNGLEKPARELNGSEAATPRDSPRPGKPDERDIWPPAQAPTSSQELAGAPEPQGSCAQGGRVAPGQRPAQGATGTDLDFDLSSDSESSSESEGTKSSSSASDGESEGKRGRQKPKAAPEGAGALAQGSSLEPGRPAAPSLPGEAEPGAHAPDKEPELGGQEEGERDGLCGLGNGCADRKEAETESQNSELSGVTAGESLDQSMEEEEEEDTDEDDHLIYLEEILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYHATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQLFPLRDDHTKAQRENSPAAFPDREGVPPTALFHPMPVLPKAQPGPEVRIYDSNTGKLIRTGARGPPAPSSSLPIRQEPSSFRAVPPPQPQMFGEELPDAQDGEQPGPSRRKRQPSMSETMPLYTLCKEDLESMDKEVDDILGEGSDDSDSEKRRPEEQEEEPQPRKPGTRRERTLGAPASSERSAAGGRGPRGHKRKLNEEDAASESSRESSNEDEGSSSEADEMAKALEAELNDLM,mutated_sequence,1.0,961.0,UPI000013C57B.a2m,UPI000013C57B.npy,gnomAD
+UPI0000141A8E,UPI0000141A8E.csv,MTHDKSWRRCSISGSTKCRCGSRIAGPNALGSGGSRSSSSSSRSILSSSILSSSIPSSSSSSSSPSSSPSSSSPSSSHSSSSPSSSSSTSSPSSSSSSSSSSPSSSNSSSSSSSSSPSSSSSSSSSSPSSSSSSPSSSSSSSSSSPSSSSSSPSSSSSSSSSSSSSPSSSSPSSSGSSPSSSNSSPSSSSSSPSSSSSSPSPRSSSPSSSSSSTSSPSSSSPSSSSPSSSCPSAALGRRPQSPQSSHCAPFP,mutated_sequence,1.0,252.0,UPI0000141A8E.a2m,UPI0000141A8E.npy,gnomAD
+UPI000020E56F,UPI000020E56F.csv,MSDRSGPTAKGKDGKKYSSLNLFDTYKGKSLEIQKPAVAPRHGLQSLGKVAIARRMPPPANLPSLKAENKGNDPNVSLVPKDGTGWASKQEQSDPKSSDASTAQPPESQPLPASQTPASNQPKRPPAAPENTPLVPSGVKSWAQASVTHGAHGDGGRASSLLSRFSREEFPTLQAAGDQDKAAKERESAEQSSGPGPSLRPQNSTTWRDGGGRGPDELEGPDSKLHHGHDPRGGLQPSGPPQFPPYRGMMPPFMYPPYLPFPPPYGPQGPYRYPTPDGPSRFPRVAGPRGSGPPMRLVEPVGRPSILKEDNLKEFDQLDQENDDGWAGAHEEVDYTEKLKFSDEEDGRDSDEEGAEGHRDSQSASGEERPPEADGKKGNSPNSEPPTPKTAWAETSRPPETEPGPPAPKPPLPPPHRGPAGNWGPPGDYPDRGGPPCKPPAPEDEDEAWRQRRKQSSSEISLAVERARRRREEEERRMQEERRAACAEKLKRLDEKFGAPDKRLKAEPAAPPAAPSTPAPPPAVPKELPAPPAPPPASAPTPETEPEEPAQAPPAQSTPTPGVAAAPTLVSGGGSTSSTSSGSFEASPVEPQLPSKEGPEPPEEVPPPTTPPVPKVEPKGDGIGPTRQPPSQGLGYPKYQKSLPPRFQRQQQEQLLKQQQQHQWQQHQQGSAPPTPVPPSPPQPVTLGAVPAPQAPPPPPKALYPGALGRPPPMPPMNFDPRWMMIPPYVDPRLLQGRPPLDFYPPGVHPSGLVPRERSDSGGSSSEPFDRHAPAMLRERGTPPVDPKLAWVGDVFTATPAEPRPLTSPLRQAADEDDKGMRSETPPVPPPPPYLASYPGFPENGAPGPPISRFPLEEPGPRPLPWPPGSDEVAKIQTPPPKKEPPKEETAQLTGPEAGRKPARGVGSGGQGPPPPRRESRTETRWGPRPGSSRRGIPPEEPGAPPRRAGPIKKPPPPTKVEELPPKPLEQGDETPKPPKPDPLKITKGKLGGPKETPPNGNLSPAPRLRRDYSYERVGPTSCRGRGRGEYFARGRGFRGTYGGRGRGARSREFRSYREFRGDDGRGGGTGGPNHPPAPRGRTASETRSEGSEYEEIPKRRRQRGSETGSETHESDLAPSDKEAPTPKEGTLTQVPLAPPPPGAPPSPAPARFTARGGRVFTPRGVPSRRGRGGGRPPPQVCPGWSPPAKSLAPKKPPTGPLPPSKEPLKEKLIPGPLSPVARGGSNGGSNVGMEDGERPRRRRHGRAQQQDKPPRFRRLKQERENAARGSEGKPSLTLPASAPGPEEALTTVTVAPAPRRAAAKSPDLSNQNSDQANEEWETASESSDFTSERRGDKEAPPPVLLTPKAVGTPGGGGGGAVPGISAMSRGDLSQRAKDLSKRSFSSQRPGMERQNRRPGPGGKAGSSGSSSGGGGGGPGGRTGPGRGDKRSWPSPKNRSRPPEERPPGLPLPPPPPSSSAVFRLDQVIHSNPAGIQQALAQLSSRQGSVTAPGGHPRHKPGLPQAPQGPSPRPPTRYEPQRVNSGLSSDPHFEEPGPMVRGVGGTPRDSAGVSPFPPKRRERPPRKPELLQEESLPPPHSSGFLGSKPEGPGPQAESRDTGTEALTPHIWNRLHTATSRKSYRPSSMEPWMEPLSPFEDVAGTEMSQSDSGVDLSGDSQVSSGPCSQRSSPDGGLKGAAEGPPKRPGGSSPLNAVPCEGPPGSEPPRRPPPAPHDGDRKELPREQPLPPGPIGTERSQRTDRGTEPGPIRPSHRPGPPVQFGTSDKDSDLRLVVGDSLKAEKELTASVTEAIPVSRDWELLPSAAASAEPQSKNLDSGHCVPEPSSSGQRLYPEVFYGSAGPSSSQISGGAMDSQLHPNSGGFRPGTPSLHPYRSQPLYLPPGPAPPSALLSGLALKGQFLDFSTMQATELGKLPAGGVLYPPPSFLYSPAFCPSPLPDTSLLQVRQDLPSPSDFYSTPLQPGGQSGFLPSGAPAQQMLLPMVDSQLPVVNFGSLPPAPPPAPPPLSLLPVGPALQPPSLAVRPPPAPATRVLPSPARPFPASLGRAELHPVELKPFQDYQKLSSNLGGPGSSRTPPTGRSFSGLNSRLKATPSTYSGVFRTQRVDLYQQASPPDALRWIPKPWERTGPPPREGPSRRAEEPGSRGDKEPGLPPPR,mutated_sequence,1.0,2157.0,UPI000020E56F.a2m,UPI000020E56F.npy,gnomAD
+UPI00001AADF3,UPI00001AADF3.csv,MLAATCNKIGSPSPSPSSLSDSSSSFGKGFHPWKRSSSSSSASCNVVGSSLSSFGVSGASRNGGSSSAAAAAAAAAAAAAALVSDSFSCGGSPGSSAFSLTSSSAAAAAAAAAAAASSSPFANDYSVFQAPGVSGGSGGGGGGGGGGSSAHSQDGSHQPVFISKVHTSVDGLQGIYPRVGMAHPYESWFKPSHPGLGAAGEVGSAGASSWWDVGAGWIDVQNPNSAAALPGSLHPAAGGLQTSLHSPLGGYNSDYSGLSHSAFSSGASSHLLSPAGQHLMDGFKPVLPGSYPDSAPSPLAGAGGSMLSAGPSAPLGGSPRSSARRYSGRATCDCPNCQEAERLGPAGASLRRKGLHSCHIPGCGKVYGKTSHLKAHLRWHTGERPFVCNWLFCGKRFTRSDELQRHLRTHTGEKRFACPVCNKRFMRSDHLSKHVKTHSGGGGGGGSAGSGSGGKKGSDTDSEHSAAGSPPCHSPELLQPPEPGHRNGLE,mutated_sequence,1.0,490.0,UPI00001AADF3.a2m,UPI00001AADF3.npy,gnomAD
+UPI0000071679,UPI0000071679.csv,MEMKKKINLELRNRSPEEVTELVLDNCLCVNGEIEGLNDTFKELEFLSMANVELSSLARLPSLNKLRKLELSDNIISGGLEVLAEKCPNLTYLNLSGNKIKDLSTVEALQNLKNLKSLDLFNCEITNLEDYRESIFELLQQITYLDGFDQEDNEAPDSEEEDDEDGDEDDEEEEENEAGPPEGYEEEEEEEEEEDEDEDEDEDEAGSELGEGEEEVGLSYLMKEEIQDEEDDDDYVEEGEEEEEEEEGGLRGEKRKRDAEDDGEEEDD,mutated_sequence,1.0,268.0,UPI0000071679.a2m,UPI0000071679.npy,gnomAD
+UPI0000126A2D,UPI0000126A2D.csv,MPGLGRRAQWLCWWWGLLCSCCGPPPLRPPLPAAAAAAAGGQLLGDGGSPGRTEQPPPSPQSSSGFLYRRLKTQEKREMQKEILSVLGLPHRPRPLHGLQQPQPPALRQQEEQQQQQQLPRGEPPPGRLKSAPLFMLDLYNALSADNDEDGASEGERQQSWPHEAASSSQRRQPPPGAAHPLNRKSLLAPGSGSGGASPLTSAQDSAFLNDADMVMSFVNLVEYDKEFSPRQRHHKEFKFNLSQIPEGEVVTAAEFRIYKDCVMGSFKNQTFLISIYQVLQEHQHRDSDLFLLDTRVVWASEEGWLEFDITATSNLWVVTPQHNMGLQLSVVTRDGVHVHPRAAGLVGRDGPYDKQPFMVAFFKVSEVHVRTTRSASSRRRQQSRNRSTQSQDVARVSSASDYNSSELKTACRKHELYVSFQDLGWQDWIIAPKGYAANYCDGECSFPLNAHMNATNHAIVQTLVHLMNPEYVPKPCCAPTKLNAISVLYFDDNSNVILKKYRNMVVRACGCH,mutated_sequence,1.0,513.0,UPI0000126A2D.a2m,UPI0000126A2D.npy,gnomAD
+UPI0000EE4A53,UPI0000EE4A53.csv,MVRGWEPPPGLDCAISEGHKSEGTMPPNKEASGLSSSPAGLICLPPISEELQLVWTQAAQTSELDSNEHLLKTFSYFPYPSLADIALLCLRYGLQMEKVKTWFMAQRLRCGISWSSEEIEETRARVVYRRDQLHFKSLLSFTHHAGRPPEEVPPPPVPAPEQVGIGIGPPTLSKPTQTKGLKVEPEEPSQMPPLPQSHQKLKESLMTPGSGAFPYQSDFWQHLQSSGLSKEQAGRGPNQSHGIGTASWNHSTTVPQPQARDKPPPIALIASSCKEESASSVTPSSSSTSSSFQVLANGATAASKPLQPLGCVPQSVSPSEQALPPHLEPAWPQGLRHNSVPGRVGPTEYLSPDMQRQRKTKRKTKEQLAILKSFFLQCQWARREDYQKLEQITGLPRPEIIQWFGDTRYALKHGQLKWFRDNAVPGAPSFQDPAIPTPPPSTRSLNERAETPPLPIPPPPPDIQPLERYWAAHQQLRETDIPQLSQASRLSTQQVLDWFDSRLPQPAEVVVCLDEEEEEEEEELPEDDEEEEEEEEEDDDDDDDDVIIQD,mutated_sequence,1.0,550.0,UPI0000EE4A53.a2m,UPI0000EE4A53.npy,gnomAD
+UPI00001A96B8,UPI00001A96B8.csv,MSCVHYKFSSKLNYDTVTFDGLHISLCDLKKQIMGREKLKAADCDLQITNAQTKEEYTDDNALIPKNSSVIVRRIPIGGVKSTSKTYVISRTEPAMATTKAIDDSSASISLAQLTKTANLAEANASEEDKIKAMMSQSGHEYDPINYMKKPLGPPPPSYTCFRCGKPGHYIKNCPTNGDKNFESGPRIKKSTGIPRSFMMEVKDPNMKGAMLTNTGKYAIPTIDAEAYAIGKKEKPPFLPEEPSSSSEEDDPIPDELLCLICKDIMTDAVVIPCCGNSYCDECIRTALLESDEHTCPTCHQNDVSPDALIANKFLRQAVNNFKNETGYTKRLRKQLPPPPPPIPPPRPLIQRNLQPLMRSPISRQQDPLMIPVTSSSTHPAPSISSLTSNQSSLAPPVSGNPSSAPAPVPDITATVSISVHSEKSDGPFRDSDNKILPAAALASEHSKGTSSIAITALMEEKGYQVPVLGTPSLLGQSLLHGQLIPTTGPVRINTARPGGGRPGWEHSNKLGYLVSPPQQIRRGERSCYRSINRGRHHSERSQRTQGPSLPATPVFVPVPPPPLYPPPPHTLPLPPGVPPPQFSPQFPPGQPPPAGYSVPPPGFPPAPANLSTPWVSSGVQTAHSNTIPTTQAPPLSREEFYREQRRLKEEEKKKSKLDEFTNDFAKELMEYKKIQKERRRSFSRSKSPYSGSSYSRSSYTYSKSRSGSTRSRSYSRSFSRSHSRSYSRSPPYPRRGRGKSRNYRSRSRSHGYHRSRSRSPPYRRYHSRSRSPQAFRGQSPNKRNVPQGETEREYFNRYREVPPPYDMKAYYGRSVDFRDPFEKERYREWERKYREWYEKYYKGYAAGAQPRPSANRENFSPERFLPLNIRNSPFTRGRREDYVGGQSHRSRNIGSNYPEKLSARDGHNQKDNTKSKEKESENAPGDGKGNKHKKHRKRRKGEESEGFLNPELLETSRKSREPTGVEENKTDSLFVLPSRDDATPVRDEPMDAESITFKSVSEKDKRERDKPKAKGDKTKRKNDGSAVSKKENIVKPAKGPQEKVDGERERSPRSEPPIKKAKEETPKTDNTKSSSSSQKDEKITGTPRKAHSKSAKEHQETKPVKEEKVKKDYSKDVKSEKLTTKEEKAKKPNEKNKPLDNKGEKRKRKTEEKGVDKDFESSSMKISKLEVTEIVKPSPKRKMEPDTEKMDRTPEKDKISLSAPAKKIKLNRETGKKIGSTENISNTKEPSEKLESTSSKVKQEKVKGKVRRKVTGTEGSSSTLVDYTSTSSTGGSPVRKSEEKTDTKRTVIKTMEEYNNDNTAPAEDVIIMIQVPQSKWDKDDFESEEEDVKSTQPISSVGKPASVIKNVSTKPSNIVKYPEKESEPSEKIQKFTKDVSHEIIQHEVKSSKNSASSEKGKTKDRDYSVLEKENPEKRKNSTQPEKESNLDRLNEQGNFKSLSQSSKEARTSDKHDSTRASSNKDFTPNRDKKTDYDTREYSSSKRRDEKNELTRRKDSPSRNKDSASGQKNKPREERDLPKKGTGDSKKSNSSPSRDRKPHDHKATYDTKRPNEETKSVDKNPCKDREKHVLEARNNKESSGNKLLYILNPPETQVEKEQITGQIDKSTVKPKPQLSHSSRLSSDLTRETDEAAFEPDYNESDSESNVSVKEEESSGNISKDLKDKIVEKAKESLDTAAVVQVGISRNQSHSSPSVSPSRSHSPSGSQTRSHSSSASSAESQDSKKKKKKKEKKKHKKHKKHKKHKKHAGTEVELEKSQKHKHKKKKSKKNKDKEKEKEKDDQKVKSVTV,mutated_sequence,1.0,1792.0,UPI00001A96B8.a2m,UPI00001A96B8.npy,gnomAD
+UPI0000161631,UPI0000161631.csv,MARLCRRVPCTLLLGLAVVLLKARLVPAAARAELSRSDLSLIQQQQQQQQQQQQQQKQLEEAEEERTEVPGATSTLTVPVSVFMLKVQVNDIISRQYLSQAVVEVFVNYTKTNSTVTKSNGAVLIKVPYKLGLSLTIIAYKDGYVLTPLPWKTRRMPIYSSVTLSLFPQSQANIWLFEDTVLITGKLADAKSQPSVQFSKALIKLPDNHHISNVTGYLTVLQQFLKVDNFLHTTGITLNKPGFENIELTPLAAICVKIYSGGKELKVNGSIQVSLPLLRLNDISAGDRIPAWTFDMNTGAWVNHGRGMVKEHNNHLIWTYDAPHLGYWIAAPLPGTRGSGINEDSKDITAYHTVFLTAILGGTIVIVIGFFAVLLCYCRDKCGTPQKRERNITKLEVLKRDQTTSTTHINHISTVKVALKAEDKSQLFNAKNSSYSPQKKEPSKAETEERVSMVKTRDDFKIYNEDVSFLSVNQNNYSRNPTQSLEPNVGSKQPKHINNNLSSSLGDAQDEKRYLTGNEEAYGRSHIPEQLMHIYSQPIAILQTSDLFSTPEQLHTAKSATLPRKGQLVYGQLMEPVNRENFTQTLPKMPIHSHAQPPDAREEDIILEGQQSLPSQASDWSRYSSSLLESVSVPGTLNEAVVMTPFSSELQGISEQTLLELSKGKPSPHPRAWFVSLDGKPVAQVRHSFIDLKKGKRTQSNDTSLDSGVDMNELHSSRKLEREKTFIKSMHQPKILYLEDLDLSSSESGTTVCSPEDPALRHILDGGSGVIMEHPGEESPGRKSTVEDFEANTSPTKRRGRPPLAKRDSKTNIWKKREERPLIPIN,mutated_sequence,1.0,826.0,UPI0000161631.a2m,UPI0000161631.npy,gnomAD
+UPI00015E0572,UPI00015E0572.csv,MLETLRERLLSVQQDFTSGLKTLSDKSREAKVKSKPRTVPFLPKYSAGLELLSRYEDTWAALHRRAKDCASAGELVDSEVVMLSAHWEKKKTSLVELQEQLQQLPALIADLESMTANLTHLEASFEEVENNLLHLEDLCGQCELERCKHMQSQQLENYKKNKRKELEVTFRSELDAEHAQKVLEMEHTQQMKLKERQKFFEEAFQQDMEQYLSTGYLQIAERREPIGSMSSMEVNVDMLEQMDLMDISDQEALDVFLNSGGEENTVLSPALGPESSTCQNEITLQVPNPSELRAKPPSSSSTCTDSATRDISEGGESPVVQSDEEEVQVDTALATSHTDREATPDGGEDSDS,mutated_sequence,1.0,352.0,UPI00015E0572.a2m,UPI00015E0572.npy,gnomAD
+UPI0002065466,UPI0002065466.csv,VHQEIHTIEKTFECSHGKKSFCQESHFTEHHRTCTREKPCESSDCKKRFCHSSVLRVHQRTHTGEKPYECKECRKSFHVKPNLTKHQKTHIGEKPFKCTACGRTFFQQSTLNVHQRTHTGEKPCGCNECEKSFYNKDALIIHQRTHTGEKPYKCNECEKSYLKKSHLNIHLRNHTGKRPHVCNECGKAFSMKSTLTVHQRTHGEKPYKCNECRKSFYMKSALSQHQRIHIWEEHCECKECGKTYQRSHLAKHHRKYTKKPYECKQCGKTFQKSHLIEHQRTHPGEIPHECNKCGKSFCYKSPLTIHQRTHIEEKPYKCSKSVTYFCMKSHLTVHQRPHTRKNPFECNECRKMFYVKSNLINHQRTHTGEKPYEGNTCGKS,mutated_sequence,1.0,380.0,UPI0002065466.a2m,UPI0002065466.npy,gnomAD
+UPI0000167B2F,UPI0000167B2F.csv,MTAESGPPPPQPEVLATVKEERGETAAGAGVPGEATGRGAGGRRRKRPLQRGKPPYSYIALIAMAIAHAPERRLTLGGIYKFITERFPFYRDNPKKWQNSIRHNLTLNDCFLKIPREAGRPGKGNYWALDPNAEDMFESGSFLRRRKRFKRSDLSTYPAYMHDAAAAAAAAAAAAAAAAIFPGAVPAARPPYPGAVYAGYAPPSLAAPPPVYYPAASPGPCRVFGLVPERPLSPELGPAPSGPGGSCAFASAGAPATTTGYQPAGCTGARPANPSAYAAAYAGPDGAYPQGAGSAIFAAAGRLAGPASPPAGGSSGGVETTVDFYGRTSPGQFGALGACYNPGGQLGGASAGAYHARHAAAYPGGIDRFVSAM,mutated_sequence,1.0,373.0,UPI0000167B2F.a2m,UPI0000167B2F.npy,gnomAD
+UPI0000456EEB,UPI0000456EEB.csv,MSVSRTMEDSCELDLVYVTERIIAVSFPSTANEENFRSNLREVAQMLKSKHGGNYLLFNLSERRPDITKLHAKVLEFGWPDLHTPALEKICSICKAMDTWLNADPHNVVVLHNKGNRGRIGVVIAAYMHYSNISASADQALDRFAMKRFYEDKIVPIGQPSQRRYVHYFSGLLSGSIKMNNKPLFLHHVIMHGIPNFESKGGCRPFLRIYQAMQPVYTSGIYNIPGDSQTSVCITIEPGLLLKGDILLKCYHKKFRSPARDVIFRVQFHTCAIHDLGVVFGKEDLDDAFKDDRFPEYGKVEFVFSYGPEKIQGMEHLENGPSVSVDYNTSDPLIRWDSYDNFSGHRDDGMEEVVGHTQGPLDGSLYAKVKKKDSLHGSTGAVNATRPTLSATPNHVEHTLSVSSDSGNSTASTKTDKTDEPVPGASSATAALSPQEKRELDRLLSGFGLEREKQGAMYHTQHLRSRPAGGSAVPSSGRHVVPAQVHVNGGALASERETDILDDELPNQDGHSAGSMGTLSSLDGVTNTSEGGYPEALSPLTNGLDKSYPMEPMVNGGGYPYESASRAGPAHAGHTAPMRPSYSAQEGLAGYQREGPHPAWPQPVTTSHYAHDPSGMFRSQSFSEAEPQLPPAPVRGGSSREAVQRGLNSWQQQQQQQQQPRPPPRQQERAHLESLVASRPSPQPLAETPIPSLPEFPRAASQQEIEQSIETLNMLMLDLEPASAAAPLHKSQSVPGAWPGASPLSSQPLSGSSRQSHPLTQSRSGYIPSGHSLGTPEPAPRASLESVPPGRSYSPYDYQPCLAGPNQDFHSKSPASSSLPAFLPTTHSPPGPQQPPASLPGLTAQPLLSPKEATSDPSRTPEEEPLNLEGLVAHRVAGVQAREKQPAEPPAPLRRRAASDGQYENQSPEATSPRSPGVRSPVQCVSPELALTIALNPGGRPKEPHLHSYKEAFEEMEGTSPSSPPPSGVRSPPGLAKTPLSALGLKPHNPADILLHPTGVTRRRIQPEEDEGKVVVRLSEEPRSYVESVARTAVAGPRAQDSEPKSFSAPATQAYGHEIPLRNGTLGGSFVSPSPLSTSSPILSADSTSVGSFPSGESSDQGPRTPTQPLLESGFRSGSLGQPSPSAQRNYQSSSPLPTVGSSYSSPDYSLQHFSSSPESQARAQFSVAGVHTVPGSPQARHRTVGTNTPPSPGFGWRAINPSMAAPSSPSLSHHQMMGPPGTGFHGSTVSSPQSSAATTPGSPSLCRHPAGVYQVSGLHNKVATTPGSPSLGRHPGAHQGNLASGLHSNAIASPGSPSLGRHLGGSGSVVPGSPCLDRHVAYGGYSTPEDRRPTLSRQSSASGYQAPSTPSFPVSPAYYPGLSSPATSPSPDSAAFRQGSPTPALPEKRRMSVGDRAGSLPNYATINGKVSSPVASGMSSPSGGSTVSFSHTLPDFSKYSMPDNSPETRAKVKFVQDTSKYWYKPEISREQAIALLKDQEPGAFIIRDSHSFRGAYGLAMKVSSPPPTIMQQNKKGDMTHELVRHFLIETGPRGVKLKGCPNEPNFGSLSALVYQHSIIPLALPCKLVIPNRDPTDESKDSSGPANSTADLLKQGAACNVLFVNSVDMESLTGPQAISKATSETLAADPTPAATIVHFKVSAQGITLTDNQRKLFFRRHYPLNTVTFCDLDPQERKWMKTEGGAPAKLFGFVARKQGSTTDNACHLFAELDPNQPASAIVNFVSKVMLNAGQKR,mutated_sequence,1.0,1735.0,UPI0000456EEB.a2m,UPI0000456EEB.npy,gnomAD
+UPI00006E2246,UPI00006E2246.csv,MSPQGPAVLSLGSLCLDTNQAPNWTGLQTLLQQLPPQDIDERYCLALGEEERAELQLFCARRKQEALGQGVARLVLPKLEGHTCEKCRELLKPGEYGVFAARAGEQRCWHQPCFACQACGQALINLIYFYHDGQLYCGRHHAELLRPRCPACDQLIFSWRCTEAEGQRWHENHFCCQDCAGPLGGGRYALPGGSPCCPSCFENRYSDAGSSWAGALEGQAFLGETGLDRTEGRDQTSVNSATLSRTLLAAAGGSSLQTQRGLPGSSPQQENRPGDKAEAPKGQEQCRLETIRDPKDTPFSTCSSSSDSEPEGFFLGERLPQSWKTPGSLQAEDSNASKTHCTMC,mutated_sequence,1.0,344.0,UPI00006E2246.a2m,UPI00006E2246.npy,gnomAD
+UPI000013C98D,UPI000013C98D.csv,MLPRVGCPALPLPPPPLLPLLLLLLGASGGGGGARAEVLFRCPPCTPERLAACGPPPVAPPAAVAAVAGGARMPCAELVREPGCGCCSVCARLEGEACGVYTPRCGQGLRCYPHPGSELPLQALVMGEGTCEKRRDAEYGASPEQVADNGDDHSEGGLVENHVDSTMNMLGGGGSAGRKPLKSGMKELAVFREKVTEQHRQMGKGGKHHLGLEEPKKLRPPPARTPCQQELDQVLERISTMRLPDERGPLEHLYSLHIPNCDKHGLYNLKQCKMSLNGQRGECWCVNPNTGKLIQGAPTIRGDPECHLFYNEQQEARGVHTQRMQ,mutated_sequence,1.0,325.0,UPI000013C98D.a2m,UPI000013C98D.npy,gnomAD
+UPI000015153B,UPI000015153B.csv,MAAAAGNRASSSGFPGARATSPEAGGGGGALKASSAPAAAAGLLREAGSGGRERADWRRRQLRKVRSVELDQLPEQPLFLAASPPASSTSPSPEPADAAGSGTGFQPVAVPPPHGAASRGGAHLTESVAAPDSGASSPAAAEPGEKRAPAAEPSPAAAPAGREMENKETLKGLHKMDDRPEERMIREKLKATCMPAWKHEWLERRNRRGPVVVKPIPVKGDGSEMNHLAAESPGEVQASAASPASKGRRSPSPGNSPSGRTVKSESPGVRRKRVSPVPFQSGRITPPRRAPSPDGFSPYSPEETNRRVNKVMRARLYLLQQIGPNSFLIGGDSPDNKYRVFIGPQNCSCARGTFCIHLLFVMLRVFQLEPSDPMLWRKTLKNFEVESLFQKYHSRRSSRIKAPSRNTIQKFVSRMSNSHTLSSSSTSTSSSENSIKDEEEQMCPICLLGMLDEESLTVCEDGCRNKLHHHCMSIWAEECRRNREPLICPLCRSKWRSHDFYSHELSSPVDSPSSLRAAQQQTVQQQPLAGSRRNQESNFNLTHYGTQQIPPAYKDLAEPWIQVFGMELVGCLFSRNWNVREMALRRLSHDVSGALLLANGESTGNSGGSSGSSPSGGATSGSSQTSISGDVVEACCSVLSMVCADPVYKVYVAALKTLRAMLVYTPCHSLAERIKLQRLLQPVVDTILVKCADANSRTSQLSISTLLELCKGQAGELAVGREILKAGSIGIGGVDYVLNCILGNQTESNNWQELLGRLCLIDRLLLEFPAEFYPHIVSTDVSQAEPVEIRYKKLLSLLTFALQSIDNSHSMVGKLSRRIYLSSARMVTTVPHVFSKLLEMLSVSSSTHFTRMRRRLMAIADEVEIAEAIQLGVEDTLDGQQDSFLQASVPNNYLETTENSSPECTVHLEKTGKGLCATKLSASSEDISERLASISVGPSSSTTTTTTTTEQPKPMVQTKGRPHSQCLNSSPLSHHSQLMFPALSTPSSSTPSVPAGTATDVSKHRLQGFIPCRIPSASPQTQRKFSLQFHRNCPENKDSDKLSPVFTQSRPLPSSNIHRPKPSRPTPGNTSKQGDPSKNSMTLDLNSSSKCDDSFGCSSNSSNAVIPSDETVFTPVEEKCRLDVNTELNSSIEDLLEASMPSSDTTVTFKSEVAVLSPEKAENDDTYKDDVNHNQKCKEKMEAEEEEALAIAMAMSASQDALPIVPQLQVENGEDIIIIQQDTPETLPGHTKAKQPYREDTEWLKGQQIGLGAFSSCYQAQDVGTGTLMAVKQVTYVRNTSSEQEEVVEALREEIRMMSHLNHPNIIRMLGATCEKSNYNLFIEWMAGGSVAHLLSKYGAFKESVVINYTEQLLRGLSYLHENQIIHRDVKGANLLIDSTGQRLRIADFGAAARLASKGTGAGEFQGQLLGTIAFMAPEVLRGQQYGRSCDVWSVGCAIIEMACAKPPWNAEKHSNHLALIFKIASATTAPSIPSHLSPGLRDVALRCLELQPQDRPPSRELLKHPVFRTTW,mutated_sequence,1.0,1512.0,UPI000015153B.a2m,UPI000015153B.npy,gnomAD
+UPI0000070FB1,UPI0000070FB1.csv,MPTVDDILEQVGESGWFQKQAFLILCLLSAAFAPICVGIVFLGFTPDHHCQSPGVAELSQRCGWSPAEELNYTVPGLGPAGEAFLGQCRRYEVDWNQSALSCVDPLASLATNRSHLPLGPCQDGWVYDTPGSSIVTEFNLVCADSWKLDLFQSCLNAGFLFGSLGVGYFADRFGRKLCLLGTVLVNAVSGVLMAFSPNYMSMLLFRLLQGLVSKGNWMAGYTLITEFVGSGSRRTVAIMYQMAFTVGLVALTGLAYALPHWRWLQLAVSLPTFLFLLYYWCVPESPRWLLSQKRNTEAIKIMDHIAQKNGKLPPADLKMLSLEEDVTEKLSPSFADLFRTPRLRKRTFILMYLWFTDSVLYQGLILHMGATSGNLYLDFLYSALVEIPGAFIALITIDRVGRIYPMAMSNLLAGAACLVMIFISPDLHWLNIIIMCVGRMGITIAIQMICLVNAELYPTFVRNLGVMVCSSLCDIGGIITPFIVFRLREVWQALPLILFAVLGLLAAGVTLLLPETKGVALPETMKDAENLGRKAKPKENTIYLKVQTSEPSGT,mutated_sequence,1.0,554.0,UPI0000070FB1.a2m,UPI0000070FB1.npy,gnomAD
+UPI00001293CB,UPI00001293CB.csv,MSRPQLRRWRLVSSPPSGVPGLALLALLALLALRLAAGTDCPCPEPELCRPIRHHPDFEVFVFDVGQKTWKSYDWSQITTVATFGKYDSELMCYAHSKGARVVLKGDVSLKDIIDPAFRASWIAQKLNLAKTQYMDGINIDIEQEVNCLSPEYDALTALVKETTDSFHREIEGSQVTFDVAWSPKNIDRRCYNYTGIADACDFLFVMSYDEQSQIWSECIAAANAPYNQTLTGYNDYIKMSINPKKLVMGVPWYGYDYTCLNLSEDHVCTIAKVPFRGAPCSDAAGRQVPYKTIMKQINSSISGNLWDKDQRAPYYNYKDPAGHFHQVWYDNPQSISLKATYIQNYRLRGIGMWNANCLDYSGDAVAKQQTEEMWEVLKPKLLQR,mutated_sequence,1.0,385.0,UPI00001293CB.a2m,UPI00001293CB.npy,gnomAD
+UPI00001F9E3B,UPI00001F9E3B.csv,MDRNPSPPPPGRDKEEEEEVAGGDCIGSTVYSKHWLFGVLSGLIQIVSPENTKSSSDDEEQLTELDEEMENEICRVWDMSMDEDVALFLQEFNAPDIFMGVLAKSKCPRLREICVGILGNMACFQEICVSISSDKNLGQVLLHCLYDSDPPTLLETSRLLLTCLSQAEVASVWVERIQEHPAIYDSICFIMSSSTNVDLLVKVGEVVDKLFDLDEKLMLEWVRNGAAQPLDQPQEESEEQPVFRLVPCILEAAKQVRSENPEWLDVYMHILQLLTTVDDGIQAIVHCPDTGKDIWNLLFDLVCHEFCQSDDPPIILQEQKTVLASVFSVLSAIYASQTEQEYLKIEKVDLPLIDSLIRVLQNMEQCQKKPENSAESNTEETKRTDLTQDDFHLKILKDILCEFLSNIFQALTKETVAQGVKEGQLSKQKCSSAFQNLLPFYSPVVEDFIKILREVDKALADDLEKNFPSLKVQT,mutated_sequence,1.0,474.0,UPI00001F9E3B.a2m,UPI00001F9E3B.npy,gnomAD
+UPI0000208B29,UPI0000208B29.csv,MDLSAAAALCLWLLSACRPRDGLEAAAVLRAAGAGPVRSPGGGGGGGGGGRTLAQAAGAAAVPAAAVPRARAARRAAGSGFRNGSVVPHHFMMSLYRSLAGRAPAGAAAVSASGHGRADTITGFTDQATQDESAAETGQSFLFDVSSLNDADEVVGAELRVLRRGSPESGPGSWTSPPLLLLSTCPGAARAPRLLYSRAAEPLVGQRWEAFDVADAMRRHRREPRPPRAFCLLLRAVAGPVPSPLALRRLGFGWPGGGGSAAEERAVLVVSSRTQRKESLFREIRAQARALGAALASEPLPDPGTGTASPRAVIGGRRRRRTALAGTRTAQGSGGGAGRGHGRRGRSRCSRKPLHVDFKELGWDDWIIAPLDYEAYHCEGLCDFPLRSHLEPTNHAIIQTLLNSMAPDAAPASCCVPARLSPISILYIDAANNVVYKQYEDMVVEACGCR,mutated_sequence,1.0,450.0,UPI0000208B29.a2m,UPI0000208B29.npy,gnomAD
+UPI00001B4B18,UPI00001B4B18.csv,MECYYIVISSTHLSNGHFRNIKGVFRGPLSKNGNKTLDYAEKENTIAKALEDLKANFYCELCDKQYYKHQEFDNHINSYDHAHKQRLKELKQREFARNVASKSRKDERKQEKALQRLHKLAELRKETVCAPGSGPMFKSTTVTVRENCNEISQRVVVDSVNNQQDFKYTLIHSEENTKDATTVAEDPESANNYTAKNNQVGDQAQGIHRHKIGFSFAFPKKASVKLESSAAAFSEYSDDASVGKGFSRKSRFVPSACHLQQSSPTDVLLSSEEKTNSFHPPEAMCRDKETVQTQEIKEVSSEKDALLLPSFCKFQLQLSSDADNCQNSVPLADQIPLESVVINEDIPVSGNSFELLGNKSTVLDMSNDCISVQATTEENVKHNEASTTEVENKNGPETLAPSNTEEVNITIHKKTNFCKRQCEPFVPVLNKHRSTVLQWPSEMLVYTTTKPSISYSCNPLCFDFKSTKVNNNLDKNKPDLKDLCSQQKQEDICMGPLSDYKDVSTEGLTDYEIGSSKNKCSQVTPLLADDILSSSCDSGKNENTGQRYKNISCKIRETEKYNFTKSQIKQDTLDEKYNKIRLKETHEYWFHKSRRKKKRKKLCQHHHMEKTKESETRCKMEAENSYTENAGKYLLEPISEKQYLAAEQLLDSHQLLDKRPKSESISLSDNEEMCKTWNTEYNTYDTISSKNHCKKNTILLNGQSNATMIHSGKHNLTYSRTYCCWKTKMSSCSQDHRSLVLQNDMKHMSQNQAVKRGYNSVMNESERFYRKRRQHSHSYSSDESLNRQNHLPEEFLRPPSTSVAPCKPKKKRRRKRGRFHPGFETLELKENTDYPVKDNSSLNPLDRLISEDKKEKMKPQEVAKIERNSEQTNQLRNKLSFHPNNLLPSETNGETEHLEMETTSGELSDVSNDPTTSVCVASAPTKEAIDNTLLEHKERSENINLNEKQIPFQVPNIERNFRQSQPKSYLCHYELAEALPQGKMNETPTEWLRYNSGILNTQPPLPFKEAHVSGHTFVTAEQILAPLALPEQALLIPLENHDKFKNVPCEVYQHILQPNMLANKVKFTFPPAALPPPSTPLQPLPLQQSLCSTSVTTIHHTVLQQHAAAAAAAAAAAAAGTFKVLQPHQQFLSQIPALTRTSLPQLSVGPVGPRLCPGNQPTFVAPPQMPIIPASVLHPSHLAFPSLPHALFPSLLSPHPTVIPLQPLF,mutated_sequence,1.0,1209.0,UPI00001B4B18.a2m,UPI00001B4B18.npy,gnomAD
+UPI00001273CC,UPI00001273CC.csv,MEFSWGSGQESRRLLLLLLLLAAWEAGNGQLHYSVSEEAKHGTFVGRIAQDLGLELAELVPRLFRVASKGRGGLLEVNLQNGILFVNSRIDREELCRRSAECSIHLEVIVDRPLQVFHVDVEVRDINDNPPVFPATQKNLSIAESRPLDSRFPLEGASDADIGENALLTYRLSPNEYFSLEKPPDDELVKGLGLILRKSLDREEAPEIFLVLTATDGGKPELTGTVQLLITVLDANDNAPAFDRTIYKVRLLENVPNGTLVIKLNASDLDEGLNGDIVYSFSNDISPNVKSKFHIDPITGQIIVKGYIDFEESKSYEIIVEGIDKGQLPLSGHCRVIVEVEDNNDNVPDLEFKSLSLPIREDAPLGTVIALISVSDKDMGVNGLVTCSLTSHVPFKLVSTFKNYYSLVLDSALDRESVSAYELVVTARDGGSPSLWATASVSVEVADVNDNAPAFAQPEYTVFVKENNPPGCHIFTVSAWDADAQENALVSYSLVERRVGERALSSYVSVHAESGKVYALQPLDHEELELLQFQVTARDAGVPPLGSNVTLQVFVLDENDNAPALLAPRAGGTGGAVSELVPWSVGVGHVVAKVRAVDADSGYNAWLSYELQPGTGGARIPFRVGLYTGEISTTRALDETDAPRHRLLVLVKDHGEPALTATATVLVSLVESGQAPKASSRALVGAVGPDAALVDVNVYLIIAICAVSSLLVLTLLLYTALRCSALPTEGACAPGKPTLVCSSAVGSWSYSQQRRPRVCSGEGPPKTDLMAFSPSLPDSRDREDQLQTTEESFAKPRQPNPDWRYSASLRAGMHSSVHLEEAGILRAGPGGPDQQWPTVSSATPEPEAGEVSPPVGAGVNSNSWTFKYGPGNPKQSGPGELPDKFIIPGSPAIISIRQEPTNSQIDKSDFITFGKKEETKKKKKKKKGNKTQEKKEKGNSTTDNSDQ,mutated_sequence,1.0,947.0,UPI00001273CC.a2m,UPI00001273CC.npy,gnomAD
+UPI0000470BBD,UPI0000470BBD.csv,MSYQKKQPTPQPPVDCVKTSGGGGGGGGSGGGGCGFFGGGGSGGGSSGSGCGYSGGGGYSGGGCGGGSSGGGGGGGIGGCGGGSGGSVKYSGGGGSSGGGSGCFSSGGGGSGCFSSGGGGSSGGGSGCFSSGGGGSSGGGSGCFSSGGGGFSGQAVQCQSYGGVSSGGSSGGGSGCFSSGGGGGSVCGYSGGGSGCGGGSSGGSGSGYVSSQQVTQTSCAPQPSYGGGSSGGGGSGGSGCFSSGGGGGSSGCGGGSSGIGSGCIISGGGSVCGGGSSGGGGGGSSVGGSGSGKGVPICHQTQQKQAPTWPSK,mutated_sequence,1.0,312.0,UPI0000470BBD.a2m,UPI0000470BBD.npy,gnomAD
+UPI00000718AE,UPI00000718AE.csv,MEVVTFGDVAVHFSREEWQCLDPGQRALYREVMLENHSSVAGLAGFLVFKPELISRLEQGEEPWVLDLQGAEGTEAPRTSKTGFLGRPTMGQEPRHPHAPPATPVPGLPKHCSQRLTLPPPGLSSSPLGHFLVHDQDRRRGTSAIWMV,mutated_sequence,1.0,148.0,UPI00000718AE.a2m,UPI00000718AE.npy,gnomAD
+UPI000013D567,UPI000013D567.csv,MATLEKLMKAFESLKSFQQQQQQQQQQQQQQQQQQQQQPPPPPPPPPPPQLPQPPPQAQPLLPQPQPPPPPPPPPPGPAVAEEPLHRPKKELSATKKDRVNHCLTICENIVAQSVRNSPEFQKLLGIAMELFLLCSDDAESDVRMVADECLNKVIKALMDSNLPRLQLELYKEIKKNGAPRSLRAALWRFAELAHLVRPQKCRPYLVNLLPCLTRTSKRPEESVQETLAAAVPKIMASFGNFANDNEIKVLLKAFIANLKSSSPTIRRTAAGSAVSICQHSRRTQYFYSWLLNVLLGLLVPVEDEHSTLLILGVLLTLRYLVPLLQQQVKDTSLKGSFGVTRKEMEVSPSAEQLVQVYELTLHHTQHQDHNVVTGALELLQQLFRTPPPELLQTLTAVGGIGQLTAAKEESGGRSRSGSIVELIAGGGSSCSPVLSRKQKGKVLLGEEEALEDDSESRSDVSSSALTASVKDEISGELAASSGVSTPGSAGHDIITEQPRSQHTLQADSVDLASCDLTSSATDGDEEDILSHSSSQVSAVPSDPAMDLNDGTQASSPISDSSQTTTEGPDSAVTPSDSSEIVLDGTDNQYLGLQIGQPQDEDEEATGILPDEASEAFRNSSMALQQAHLLKNMSHCRQPSDSSVDKFVLRDEATEPGDQENKPCRIKGDIGQSTDDDSAPLVHCVRLLSASFLLTGGKNVLVPDRDVRVSVKALALSCVGAAVALHPESFFSKLYKVPLDTTEYPEEQYVSDILNYIDHGDPQVRGATAILCGTLICSILSRSRFHVGDWMGTIRTLTGNTFSLADCIPLLRKTLKDESSVTCKLACTAVRNCVMSLCSSSYSELGLQLIIDVLTLRNSSYWLVRTELLETLAEIDFRLVSFLEAKAENLHRGAHHYTGLLKLQERVLNNVVIHLLGDEDPRVRHVAAASLIRLVPKLFYKCDQGQADPVVAVARDQSSVYLKLLMHETQPPSHFSVSTITRIYRGYNLLPSITDVTMENNLSRVIAAVSHELITSTTRALTFGCCEALCLLSTAFPVCIWSLGWHCGVPPLSASDESRKSCTVGMATMILTLLSSAWFPLDLSAHQDALILAGNLLAASAPKSLRSSWASEEEANPAATKQEEVWPALGDRALVPMVEQLFSHLLKVINICAHVLDDVAPGPAIKAALPSLTNPPSLSPIRRKGKEKEPGEQASVPLSPKKGSEASAASRQSDTSGPVTTSKSSSLGSFYHLPSYLKLHDVLKATHANYKVTLDLQNSTEKFGGFLRSALDVLSQILELATLQDIGKCVEEILGYLKSCFSREPMMATVCVQQLLKTLFGTNLASQFDGLSSNPSKSQGRAQRLGSSSVRPGLYHYCFMAPYTHFTQALADASLRNMVQAEQENDTSGWFDVLQKVSTQLKTNLTSVTKNRADKNAIHNHIRLFEPLVIKALKQYTTTTCVQLQKQVLDLLAQLVQLRVNYCLLDSDQVFIGFVLKQFEYIEVGQFRESEAIIPNIFFFLVLLSYERYHSKQIIGIPKIIQLCDGIMASGRKAVTHAIPALQPIVHDLFVLRGTNKADAGKELETQKEVVVSMLLRLIQYHQVLEMFILVLQQCHKENEDKWKRLSRQIADIILPMLAKQQMHIDSHEALGVLNTLFEILAPSSLRPVDMLLRSMFVTPNTMASVSTVQLWISGILAILRVLISQSTEDIVLSRIQELSFSPYLISCTVINRLRDGDSTSTLEEHSEGKQIKNLPEETFSRFLLQLVGILLEDIVTKQLKVEMSEQQHTFYCQELGTLLMCLIHIFKSGMFRRITAAATRLFRSDGCGGSFYTLDSLNLRARSMITTHPALVLLWCQILLLVNHTDYRWWAEVQQTPKRHSLSSTKLLSPQMSGEEEDSDLAAKLGMCNREIVRRGALILFCDYVCQNLHDSEHLTWLIVNHIQDLISLSHEPPVQDFISAVHRNSAASGLFIQAIQSRCENLSTPTMLKKTLQCLEGIHLSQSGAVLTLYVDRLLCTPFRVLARMVDILACRRVEMLLAANLQSSMAQLPMEELNRIQEYLQSSGLAQRHQRLYSLLDRFRLSTMQDSLSPSPPVSSHPLDGDGHVSLETVSPDKDWYVHLVKSQCWTRSDSALLEGAELVNRIPAEDMNAFMMNSEFNLSLLAPCLSLGMSEISGGQKSALFEAAREVTLARVSGTVQQLPAVHHVFQPELPAEPAAYWSKLNDLFGDAALYQSLPTLARALAQYLVVVSKLPSHLHLPPEKEKDIVKFVVATLEALSWHLIHEQIPLSLDLQAGLDCCCLALQLPGLWSVVSSTEFVTHACSLIYCVHFILEAVAVQPGEQLLSPERRTNTPKAISEEEEEVDPNTQNPKYITAACEMVAEMVESLQSVLALGHKRNSGVPAFLTPLLRNIIISLARLPLVNSYTRVPPLVWKLGWSPKPGGDFGTAFPEIPVEFLQEKEVFKEFIYRINTLGWTSRTQFEETWATLLGVLVTQPLVMEQEESPPEEDTERTQINVLAVQAITSLVLSAMTVPVAGNPAVSCLEQQPRNKPLKALDTRFGRKLSIIRGIVEQEIQAMVSKRENIATHHLYQAWDPVPSLSPATTGALISHEKLLLQINPERELGSMSYKLGQVSIHSVWLGNSITPLREEEWDEEEEEEADAPAPSSPPTSPVNSRKHRAGVDIHSCSQFLLELYSRWILPSSSARRTPAILISEVVRSLLVVSDLFTERNQFELMYVTLTELRRVHPSEDEILAQYLVPATCKAAAVLGMDKAVAEPVSRLLESTLRSSHLPSRVGALHGVLYVLECDLLDDTAKQLIPVISDYLLSNLKGIAHCVNIHSQQHVLVMCATAFYLIENYPLDVGPEFSASIIQMCGVMLSGSEESTPSIIYHCALRGLERLLLSEQLSRLDAESLVKLSVDRVNVHSPHRAMAALGLMLTCMYTGKEKVSPGRTSDPNPAAPDSESVIVAMERVSVLFDRIRKGFPCEARVVARILPQFLDDFFPPQDIMNKVIGEFLSNQQPYPQFMATVVYKVFQTLHSTGQSSMVRDWVMLSLSNFTQRAPVAMATWSLSCFFVSASTSPWVAAILPHVISRMGKLEQVDVNLFCLVATDFYRHQIEEELDRRAFQSVLEVVAAPGSPYHRLLTCLRNVHKVTTC,mutated_sequence,1.0,3142.0,UPI000013D567.a2m,UPI000013D567.npy,gnomAD
+UPI0000140E19,UPI0000140E19.csv,MILNKALLLGALALTTVMSPCGGEDIVADHVASCGVNLYQFYGPSGQYTHEFDGDEQFYVDLERKETAWRWPEFSKFGGFDPQGALRNMAVAKHNLNIMIKRYNSTAATNEVPEVTVFSKSPVTLGQPNTLICLVDNIFPPVVNITWLSNGQSVTEGVSETSFLSKSDHSFFKISYLTFLPSADEIYDCKVEHWGLDQPLLKHWEPEIPAPMSELTETVVCALGLSVGLMGIVVGTVFIIQGLRSVGASRHQGPL,mutated_sequence,1.0,255.0,UPI0000140E19.a2m,UPI0000140E19.npy,gnomAD
+UPI0000141657,UPI0000141657.csv,MDPQPPPPAQGSPPHRGRGRGRGRGRGRGRGRGRGGAGAPRAPLPCPTCGRLFRFPYYLSRHRLSHSGLRPHACPLCPKAFRRPAHLSRHLRGHGPQPPLRCAACPRTFPEPAQLRRHLAQEHAGGEVELAIERVAKETAEPSWGPQDEGSEPPTTAAAGATEEEAVAAWPETWPAGEPSTLAAPTSAAEPRESESEEAEAGAAELRAELALAAGRQEEKQVLLQADWTLLCLRCREAFATKGELKAHPCLRPEGEQEGEGGPPPRPKRHQCSICLKAFARPWSLSRHRLVHSTDRPFVCPDCGLAFRLASYLRQHRRVHGPLSLLAPLPAAGKKDDKASGARNSAKGPEGGEGAECGGASEGGEGQNGGDAAPARPPAGEPRFWCPECGKGFRRRAHLRQHGVTHSGARPFQCVRCQREFKRLADLARHAQVHAGGPAPHPCPRCPRRFSRAYSLLRHQRCHRAELERAAALQALQAQAPTSPPPPPPPLKAEQEEEGLPLPLANIKEEPPSPGTPPQSPPAPPVFLSASCFDSQDHSAFEMEEEEVDSKAHLRGLGGLAS,mutated_sequence,1.0,562.0,UPI0000141657.a2m,UPI0000141657.npy,gnomAD
+UPI000013D783,UPI000013D783.csv,MGSGPLSLPLALSPPRLLLLLLLSLLPVARASEAEHRLFERLFEDYNEIIRPVANVSDPVIIHFEVSMSQLVKVDEVNQIMETNLWLKQIWNDYKLKWNPSDYGGAEFMRVPAQKIWKPDIVLYNNAVGDFQVDDKTKALLKYTGEVTWIPPAIFKSSCKIDVTYFPFDYQNCTMKFGSWSYDKAKIDLVLIGSSMNLKDYWESGEWAIIKAPGYKHDIKYNCCEEIYPDITYSLYIRRLPLFYTINLIIPCLLISFLTVLVFYLPSDCGEKVTLCISVLLSLTVFLLVITETIPSTSLVIPLIGEYLLFTMIFVTLSIVITVFVLNVHYRTPTTHTMPSWVKTVFLNLLPRVMFMTRPTSNEGNAQKPRPLYGAELSNLNCFSRAESKGCKEGYPCQDGMCGYCHHRRIKISNFSANLTRSSSSESVDAVLSLSALSPEIKEAIQSVKYIAENMKAQNEAKEIQDDWKYVAMVIDRIFLWVFTLVCILGTAGLFLQPLMAREDA,mutated_sequence,1.0,505.0,UPI000013D783.a2m,UPI000013D783.npy,gnomAD
+UPI000013EF6E,UPI000013EF6E.csv,MLTVALLALLCASASGNAIQARSSSYSGEYGGGGGKRFSHSGNQLDGPITALRVRVNTYYIVGLQVRYGKVWSDYVGGRNGDLEEIFLHPGESVIQVSGKYKWYLKKLLFVTDKGRYLSFGKDSGTSFNAVPLHPNTVLRFISGRSGSLIDAIGLHWDVYPSSCSRC,mutated_sequence,1.0,167.0,UPI000013EF6E.a2m,UPI000013EF6E.npy,gnomAD
+UPI00001AEBA4,UPI00001AEBA4.csv,MGGALGPALLLTSLFGAWAGLGPGQGEQGMTVAVVFSSSGPPQAQFRARLTPQSFLDLPLEIQPLTVGVNTTNPSSLLTQICGLLGAAHVHGIVFEDNVDTEAVAQILDFISSQTHVPILSISGGSAVVLTPKEPGSAFLQLGVSLEQQLQVLFKVLEEYDWSAFAVITSLHPGHALFLEGVRAVADASHVSWRLLDVVTLELGPGGPRARTQRLLRQLDAPVFVAYCSREEAEVLFAEAAQAGLVGPGHVWLVPNLALGSTDAPPATFPVGLISVVTESWRLSLRQKVRDGVAILALGAHSYWRQHGTLPAPAGDCRVHPGPVSPAREAFYRHLLNVTWEGRDFSFSPGGYLVQPTMVVIALNRHRLWEMVGRWEHGVLYMKYPVWPRYSASLQPVVDSRHLTVATLEERPFVIVESPDPGTGGCVPNTVPCRRQSNHTFSSGDVAPYTKLCCKGFCIDILKKLARVVKFSYDLYLVTNGKHGKRVRGVWNGMIGEVYYKRADMAIGSLTINEERSEIVDFSVPFVETGISVMVARSNGTVSPSAFLEPYSPAVWVMMFVMCLTVVAITVFMFEYFSPVSYNQNLTRGKKSGGPAFTIGKSVWLLWALVFNNSVPIENPRGTTSKIMVLVWAFFAVIFLASYTANLAAFMIQEQYIDTVSGLSDKKFQRPQDQYPPFRFGTVPNGSTERNIRSNYRDMHTHMVKFNQRSVEDALTSLKMGKLDAFIYDAAVLNYMAGKDEGCKLVTIGSGKVFATTGYGIAMQKDSHWKRAIDLALLQFLGDGETQKLETVWLSGICQNEKNEVMSSKLDIDNMAGVFYMLLVAMGLALLVFAWEHLVYWKLRHSVPNSSQLDFLLAFSRGIYSCFSGVQSLASPPRQASPDLTASSAQASVLKMLQAARDMVTTAGVSSSLDRATRTIENWGGGRRAPPPSPCPTPRSGPSPCLPTPDPPPEPSPTGWGPPDGGRAALVRRAPQPPGRPPTPGPPLSDVSRVSRRPAWEARWPVRTGHCGRHLSASERPLSPARCHYSSFPRADRSGRPFLPLFPELEDLPLLGPEQLARREALLHAAWARGSRPRHASLPSSVAEAFARPSSLPAGCTGPACARPDGHSACRRLAQAQSMCLPIYREACQEGEQAGAPAWQHRQHVCLHAHAHLPFCWGAVCPHLPPCASHGSWLSGAWGPLGHRGRTLGLGTGYRDSGGLDEISRVARGTQGFPGPCTWRRISSLESEV,mutated_sequence,1.0,1233.0,UPI00001AEBA4.a2m,UPI00001AEBA4.npy,gnomAD
+UPI00000711D1,UPI00000711D1.csv,MDGDGDPESVGQPEEASPEEQPEEASAEEERPEDQQEEEAAAAAAYLDELPEPLLLRVLAALPAAELVQACRLVCLRWKELVDGAPLWLLKCQQEGLVPEGGVEEERDHWQQFYFLSKRRRNLLRNPCGEEDLEGWCDVEHGGDGWRVEELPGDSGVEFTHDESVKKYFASSFEWCRKAQVIDLQAEGYWEELLDTTQPAIVVKDWYSGRSDAGCLYELTVKLLSEHENVLAEFSSGQVAVPQDSDGGGWMEISHTFTDYGPGVRFVRFEHGGQDSVYWKGWFGARVTNSSVWVEP,mutated_sequence,1.0,296.0,UPI00000711D1.a2m,UPI00000711D1.npy,gnomAD
+UPI00001313D9,UPI00001313D9.csv,MAAYSYRPGPGAGPGPAAGAALPDQSFLWNVFQRVDKDRSGVISDTELQQALSNGTWTPFNPVTVRSIISMFDRENKAGVNFSEFTGVWKYITDWQNVFRTYDRDNSGMIDKNELKQALSGFGYRLSDQFHDILIRKFDRQGRGQIAFDDFIQGCIVLQRLTDIFRRYDTDQDGWIQVSYEQYLSMVFSIV,mutated_sequence,1.0,191.0,UPI00001313D9.a2m,UPI00001313D9.npy,gnomAD
+UPI0000D6DA73,UPI0000D6DA73.csv,MSEARGEPGSGPEAGARFFCTAGRGLEPFVMREVRARLAATQVEYISGKVFFTTCSDLNMLKKLKSAERLFLLIKKQFPLIISSVSKGKIFNEMQRLINEDPGSWLNAISIWKNLLELDAKKEKLSQRDDNQLKRKVGENEIIAKKLKIEQMQKIEENRDCQLEKQIKEETLEQRDFTTKSEKFQEEEFQNDIEKAIDTHNQNDLTFRVSCRCSGTIGKAFTAQEVGKVIGIAIMKHFGWKADLRNPQLEIFIHLNDIYSVVGIPVFRVSLASRAYIKTAGLRSTIAWAMASLADIKAGAFVLDPMCGLGTILLEAAKEWPDVYYVGADVSDSQLLGTWDNLKAAGLEDKIELLKISVIELPLPSESVDIIISDIPFGKKFKLGKDIKSILQEMERVLHVGGTIVLLLSEDHHRRLTDCKESNIPFNSKDSHTDEPGIKKCLNPEEKTGAFKTASTSFEASNHKFLDRMSPFGSLVPVECYKVSLGKTDAFICKYKKSHSSGL,mutated_sequence,1.0,503.0,UPI0000D6DA73.a2m,UPI0000D6DA73.npy,gnomAD
+UPI00015E040A,UPI00015E040A.csv,MPEDLVTFDDVAWYLTTREWFKLDPEQRALEYYGNVTSVAGVPVLNPALVPHLAQGQVLLVSDPSPNTDPAKYSESTSATRHQMKGEDAQPQEMASTSFPRASGPSPEFRQHGDSDGKRGSPQNLPIEHHFACKECGDTFRLKVLLVQHQRVHSEEKGWECGDCGKVFRGVAEFNEHRKSHVAAEPQPGPSRALENAAEKREQMEREAKPFECEECGKRFKKNAGLSQHLRVHSREKPFDCEECGRSFKVNTHLFRHQKLHTSEKPFACKACSRDFLDRQELLKHQRMHTGHLPFDCDDCGKSFRGVNGLAEHQRIHSGAKPYGCPHCGKLFRRSSELTKHRRIHTGEKPYACGQCGKAFRQSSSLLEHARIHSGERPYACGECGKAFRGPSDLIKHRRIHSGLKPYECDKCGKAFRRSSGLSRHRRIHSGARRCECSQCGRVFKRRSALQKHQPTHHE,mutated_sequence,1.0,459.0,UPI00015E040A.a2m,UPI00015E040A.npy,gnomAD
+UPI000065CA24,UPI000065CA24.csv,MLKPSGLPGSSSPTRSLMTGSRSTKATPEMDSGLTGATLSPKTSTGAIVVTEHTLPFTSPDKTLASPTSSVVGRTTQSLGVMSSALPESTSRGMTHSEQRTSPSLSPQVNGTPSRNYPATSMVSGLSSPRTRTSSTEGNFTKEASTYTLTVETTSGPVTEKYTVPTETSTTEGDSTETPWDTRYIPVKITSPMKTFADSTASKENAPVSMTPAETTVTDSHTPGRTNPSFGTLYSSFLDLSPKGTPNSRGETSLELILSTTGYPFSSPEPGSAGHSRISTSAPLSSSASVLDNKISETSIFSGQSLTSPLSPGVPEARASTMPNSAIPFSMTLSNAETSAERVRSTISSLGTPSISTKQTAETILTFHAFAETMDIPSTHIAKTLASEWLGSPGTLGGTSTSALTTTSPSTTLVSEETNTHHSTSGKETEGTLNTSMTPLETSAPGEESEMTATLVPTLGFTTLDSKIRSPSQVSSSHPTRELRTTGSTSGRQSSSTAAHGSSDILRATTSSTSKASSWTSESTAQQFSEPQHTQWVETSPSMKTERPPASTSVAAPITTSVPSVVSGFTTLKTSSTKGIWLEETSADTLIGESTAGPTTHQFAVPTGISMTGGSSTRGSQGTTHLLTRATASSETSADLTLATNGVPVSVSPAVSKTAAGSSPPGGTKPSYTMVSSVIPETSSLQSSAFREGTSLGLTPLNTRHPFSSPEPDSAGHTKISTSIPLLSSASVLEDKVSATSTFSHHKATSSITTGTPEISTKTKPSSAVLSSMTLSNAATSPERVRNATSPLTHPSPSGEETAGSVLTLSTSAETTDSPNIHPTGTLTSESSESPSTLSLPSVSGVKTTFSSSTPSTHLFTSGEETEETSNPSVSQPETSVSRVRTTLASTSVPTPVFPTMDTWPTRSAQFSSSHLVSELRATSSTSVTNSTGSALPKISHLTGTATMSQTNRDTFNDSAAPQSTTWPETSPRFKTGLPSATTTVSTSATSLSATVMVSKFTSPATSSMEATSIREPSTTILTTETTNGPGSMAVASTNIPIGKGYITEGRLDTSHLPIGTTASSETSMDFTMAKESVSMSVSPSQSMDAAGSSTPGRTSQFVDTFSDDVYHLTSREITIPRDGTSSALTPQMTATHPPSPDPGSARSTWLGILSSSPSSPTPKVTMSSTFSTQRVTTSMIMDTVETSRWNMPNLPSTTSLTPSNIPTSGAIGKSTLVPLDTPSPATSLEASEGGLPTLSTYPESTNTPSIHLGAHASSESPSTIKLTMASVVKPGSYTPLTFPSIETHIHVSTARMAYSSGSSPEMTAPGETNTGSTWDPTTYITTTDPKDTSSAQVSTPHSVRTLRTTENHPKTESATPAAYSGSPKISSSPNLTSPATKAWTITDTTEHSTQLHYTKLAEKSSGFETQSAPGPVSVVIPTSPTIGSSTLELTSDVPGEPLVLAPSEQTTITLPMATWLSTSLTEEMASTDLDISSPSSPMSTFAIFPPMSTPSHELSKSEADTSAIRNTDSTTLDQHLGIRSLGRTGDLTTVPITPLTTTWTSVIEHSTQAQDTLSATMSPTHVTQSLKDQTSIPASASPSHLTEVYPELGTQGRSSSEATTFWKPSTDTLSREIETGPTNIQSTPPMDNTTTGSSSSGVTLGIAHLPIGTSSPAETSTNMALERRSSTATVSMAGTMGLLVTSAPGRSISQSLGRVSSVLSESTTEGVTDSSKGSSPRLNTQGNTALSSSLEPSYAEGSQMSTSIPLTSSPTTPDVEFIGGSTFWTKEVTTVMTSDISKSSARTESSSATLMSTALGSTENTGKEKLRTASMDLPSPTPSMEVTPWISLTLSNAPNTTDSLDLSHGVHTSSAGTLATDRSLNTGVTRASRLENGSDTSSKSLSMGNSTHTSMTYTEKSEVSSSIHPRPETSAPGAETTLTSTPGNRAISLTLPFSSIPVEEVISTGITSGPDINSAPMTHSPITPPTIVWTSTGTIEQSTQPLHAVSSEKVSVQTQSTPYVNSVAVSASPTHENSVSSGSSTSSPYSSASLESLDSTISRRNAITSWLWDLTTSLPTTTWPSTSLSEALSSGHSGVSNPSSTTTEFPLFSAASTSAAKQRNPETETHGPQNTAASTLNTDASSVTGLSETPVGASISSEVPLPMAITSRSDVSGLTSESTANPSLGTASSAGTKLTRTISLPTSESLVSFRMNKDPWTVSIPLGSHPTTNTETSIPVNSAGPPGLSTVASDVIDTPSDGAESIPTVSFSPSPDTEVTTISHFPEKTTHSFRTISSLTHELTSRVTPIPGDWMSSAMSTKPTGASPSITLGERRTITSAAPTTSPIVLTASFTETSTVSLDNETTVKTSDILDARKTNELPSDSSSSSDLINTSIASSTMDVTKTASISPTSISGMTASSSPSLFSSDRPQVPTSTTETNTATSPSVSSNTYSLDGGSNVGGTPSTLPPFTITHPVETSSALLAWSRPVRTFSTMVSTDTASGENPTSSNSVVTSVPAPGTWTSVGSTTDLPAMGFLKTSPAGEAHSLLASTIEPATAFTPHLSAAVVTGSSATSEASLLTTSESKAIHSSPQTPTTPTSGANWETSATPESLLVVTETSDTTLTSKILVTDTILFSTVSTPPSKFPSTGTLSGASFPTLLPDTPAIPLTATEPTSSLATSFDSTPLVTIASDSLGTVPETTLTMSETSNGDALVLKTVSNPDRSIPGITIQGVTESPLHPSSTSPSKIVAPRNTTYEGSITVALSTLPAGTTGSLVFSQSSENSETTALVDSSAGLERASVMPLTTGSQGMASSGGIRSGSTHSTGTKTFSSLPLTMNPGEVTAMSEITTNRLTATQSTAPKGIPVKPTSAESGLLTPVSASSSPSKAFASLTTAPPTWGIPQSTLTFEFSEVPSLDTKSASLPTPGQSLNTIPDSDASTASSSLSKSPEKNPRARMMTSTKAISASSFQSTGFTETPEGSASPSMAGHEPRVPTSGTGDPRYASESMSYPDPSKASSAMTSTSLASKLTTLFSTGQAARSGSSSSPISLSTEKETSFLSPTASTSRKTSLFLGPSMARQPNILVHLQTSALTLSPTSTLNMSQEEPPELTSSQTIAEEEGTTAETQTLTFTPSETPTSLLPVSSPTEPTARRKSSPETWASSISVPAKTSLVETTDGTLVTTIKMSSQAAQGNSTWPAPAEETGSSPAGTSPGSPEMSTTLKIMSSKEPSISPEIRSTVRNSPWKTPETTVPMETTVEPVTLQSTALGSGSTSISHLPTGTTSPTKSPTENMLATERVSLSPSPPEAWTNLYSGTPGGTRQSLATMSSVSLESPTARSITGTGQQSSPELVSKTTGMEFSMWHGSTGGTTGDTHVSLSTSSNILEDPVTSPNSVSSLTDKSKHKTETWVSTTAIPSTVLNNKIMAAEQQTSRSVDEAYSSTSSWSDQTSGSDITLGASPDVTNTLYITSTAQTTSLVSLPSGDQGITSLTNPSGGKTSSASSVTSPSIGLETLRANVSAVKSDIAPTAGHLSQTSSPAEVSILDVTTAPTPGISTTITTMGTNSISTTTPNPEVGMSTMDSTPATERRTTSTEHPSTWSSTAASDSWTVTDMTSNLKVARSPGTISTMHTTSFLASSTELDSMSTPHGRITVIGTSLVTPSSDASAVKTETSTSERTLSPSDTTASTPISTFSRVQRMSISVPDILSTSWTPSSTEAEDVPVSMVSTDHASTKTDPNTPLSTFLFDSLSTLDWDTGRSLSSATATTSAPQGATTPQELTLETMISPATSQLPFSIGHITSAVTPAAMARSSGVTFSRPDPTSKKAEQTSTQLPTTTSAHPGQVPRSAATTLDVIPHTAKTPDATFQRQGQTALTTEARATSDSWNEKEKSTPSAPWITEMMNSVSEDTIKEVTSSSSVLRTLNTLDINLESGTTSSPSWKSSPYERIAPSESTTDKEAIHPSTNTVETTGWVTSSEHASHSTIPAHSASSKLTSPVVTTSTREQAIVSMSTTTWPESTRARTEPNSFLTIELRDVSPYMDTSSTTQTSIISSPGSTAITKGPRTEITSSKRISSSFLAQSMRSSDSPSEAITRLSNFPAMTESGGMILAMQTSPPGATSLSAPTLDTSATASWTGTPLATTQRFTYSEKTTLFSKGPEDTSQPSPPSVEETSSSSSLVPIHATTSPSNILLTSQGHSPSSTPPVTSVFLSETSGLGKTTDMSRISLEPGTSLPPNLSSTAGEALSTYEASRDTKAIHHSADTAVTNMEATSSEYSPIPGHTKPSKATSPLVTSHIMGDITSSTSVFGSSETTEIETVSSVNQGLQERSTSQVASSATETSTVITHVSSGDATTHVTKTQATFSSGTSISSPHQFITSTNTFTDVSTNPSTSLIMTESSGVTITTQTGPTGAATQGPYLLDTSTMPYLTETPLAVTPDFMQSEKTTLISKGPKDVSWTSPPSVAETSYPSSLTPFLVTTIPPATSTLQGQHTSSPVSATSVLTSGLVKTTDMLNTSMEPVTNSPQNLNNPSNEILATLAATTDIETIHPSINKAVTNMGTASSAHVLHSTLPVSSEPSTATSPMVPASSMGDALASISIPGSETTDIEGEPTSSLTAGRKENSTLQEMNSTTESNIILSNVSVGAITEATKMEVPSFDATFIPTPAQSTKFPDIFSVASSRLSNSPPMTISTHMTTTQTGSSGATSKIPLALDTSTLETSAGTPSVVTEGFAHSKITTAMNNDVKDVSQTNPPFQDEASSPSSQAPVLVTTLPSSVAFTPQWHSTSSPVSMSSVLTSSLVKTAGKVDTSLETVTSSPQSMSNTLDDISVTSAATTDIETTHPSINTVVTNVGTTGSAFESHSTVSAYPEPSKVTSPNVTTSTMEDTTISRSIPKSSKTTRTETETTSSLTPKLRETSISQEITSSTETSTVPYKELTGATTEVSRTDVTSSSSTSFPGPDQSTVSLDISTETNTRLSTSPIMTESAEITITTQTGPHGATSQDTFTMDPSNTTPQAGIHSAMTHGFSQLDVTTLMSRIPQDVSWTSPPSVDKTSSPSSFLSSPAMTTPSLISSTLPEDKLSSPMTSLLTSGLVKITDILRTRLEPVTSSLPNFSSTSDKILATSKDSKDTKEIFPSINTEETNVKANNSGHESHSPALADSETPKATTQMVITTTVGDPAPSTSMPVHGSSETTNIKREPTYFLTPRLRETSTSQESSFPTDTSFLLSKVPTGTITEVSSTGVNSSSKISTPDHDKSTVPPDTFTGEIPRVFTSSIKTKSAEMTITTQASPPESASHSTLPLDTSTTLSQGGTHSTVTQGFPYSEVTTLMGMGPGNVSWMTTPPVEETSSVSSLMSSPAMTSPSPVSSTSPQSIPSSPLPVTALPTSVLVTTTDVLGTTSPESVTSSPPNLSSITHERPATYKDTAHTEAAMHHSTNTAVTNVGTSGSGHKSQSSVLADSETSKATPLMSTTSTLGDTSVSTSTPNISQTNQIQTEPTASLSPRLRESSTSEKTSSTTETNTAFSYVPTGAITQASRTEISSSRTSISDLDRPTIAPDISTGMITRLFTSPIMTKSAEMTVTTQTTTPGATSQGILPWDTSTTLFQGGTHSTVSQGFPHSEITTLRSRTPGDVSWMTTPPVEETSSGFSLMSPSMTSPSPVSSTSPESIPSSPLPVTALLTSVLVTTTNVLGTTSPEPVTSSPPNLSSPTQERLTTYKDTAHTEAMHASMHTNTAVANVGTSISGHESQSSVPADSHTSKATSPMGITFAMGDTSVSTSTPAFFETRIQTESTSSLIPGLRDTRTSEEINTVTETSTVLSEVPTTTTTEVSRTEVITSSRTTISGPDHSKMSPYISTETITRLSTFPFVTGSTEMAITNQTGPIGTISQATLTLDTSSTASWEGTHSPVTQRFPHSEETTTMSRSTKGVSWQSPPSVEETSSPSSPVPLPAITSHSSLYSAVSGSSPTSALPVTSLLTSGRRKTIDMLDTHSELVTSSLPSASSFSGEILTSEASTNTETIHFSENTAETNMGTTNSMHKLHSSVSIHSQPSGHTPPKVTGSMMEDAIVSTSTPGSPETKNVDRDSTSPLTPELKEDSTALVMNSTTESNTVFSSVSLDAATEVSRAEVTYYDPTFMPASAQSTKSPDISPEASSSHSNSPPLTISTHKTIATQTGPSGVTSLGQLTLDTSTIATSAGTPSARTQDFVDSETTSVMNNDLNDVLKTSPFSAEEANSLSSQAPLLVTTSPSPVTSTLQEHSTSSLVSVTSVPTPTLAKITDMDTNLEPVTRSPQNLRNTLATSEATTDTHTMHPSINTAVANVGTTSSPNEFYFTVSPDSDPYKATSAVVITSTSGDSIVSTSMPRSSAMKKIESETTFSLIFRLRETSTSQKIGSSSDTSTVFDKAFTAATTEVSRTELTSSSRTSIQGTEKPTMSPDTSTRSVTMLSTFAGLTKSEERTIATQTGPHRATSQGTLTWDTSITTSQAGTHSAMTHGFSQLDLSTLTSRVPEYISGTSPPSVEKTSSSSSLLSLPAITSPSPVPTTLPESRPSSPVHLTSLPTSGLVKTTDMLASVASLPPNLGSTSHKIPTTSEDIKDTEKMYPSTNIAVTNVGTTTSEKESYSSVPAYSEPPKVTSPMVTSFNIRDTIVSTSMPGSSEITRIEMESTFSLAHGLKGTSTSQDPIVSTEKSAVLHKLTTGATETSRTEVASSRRTSIPGPDHSTESPDISTEVIPSLPISLGITESSNMTIITRTGPPLGSTSQGTFTLDTPTTSSRAGTHSMATQEFPHSEMTTVMNKDPEILSWTIPPSIEKTSFSSSLMPSPAMTSPPVSSTLPKTIHTTPSPMTSLLTPSLVMTTDTLGTSPEPTTSSPPNLSSTSHEILTTDEDTTAIEAMHPSTSTAATNVETTSSGHGSQSSVLADSEKTKATAPMDTTSTMGHTTVSTSMSVSSETTKIKRESTYSLTPGLRETSISQNASFSTDTSIVLSEVPTGTTAEVSRTEVTSSGRTSIPGPSQSTVLPEISTRTMTRLFASPTMTESAEMTIPTQTGPSGSTSQDTLTLDTSTTKSQAKTHSTLTQRFPHSEMTTLMSRGPGDMSWQSSPSLENPSSLPSLLSLPATTSPPPISSTLPVTISSSPLPVTSLLTSSPVTTTDMLHTSPELVTSSPPKLSHTSDERLTTGKDTTNTEAVHPSTNTAASNVEIPSSGHESPSSALADSETSKATSPMFITSTQEDTTVAISTPHFLETSRIQKESISSLSPKLRETGSSVETSSAIETSAVLSEVSIGATTEISRTEVTSSSRTSISGSAESTMLPEISTTRKIIKFPTSPILAESSEMTIKTQTSPPGSTSESTFTLDTSTTPSLVITHSTMTQRLPHSEITTLVSRGAGDVPRPSSLPVEETSPPSSQLSLSAMISPSPVSSTLPASSHSSSASVTSLLTPGQVKTTEVLDASAEPETSSPPSLSSTSVEILATSEVTTDTEKIHPFSNTAVTKVGTSSSGHESPSSVLPDSETTKATSAMGTISIMGDTSVSTLTPALSNTRKIQSEPASSLTTRLRETSTSEETSLATEANTVLSKVSTGATTEVSRTEAISFSRTSMSGPEQSTMSQDISIGTIPRISASSVLTESAKMTITTQTGPSESTLESTLNLNTATTPSWVETHSIVIQGFPHPEMTTSMGRGPGGVSWPSPPFVKETSPPSSPLSLPAVTSPHPVSTTFLAHIPPSPLPVTSLLTSGPATTTDILGTSTEPGTSSSSSLSTTSHERLTTYKDTAHTEAVHPSTNTGGTNVATTSSGYKSQSSVLADSSPMCTTSTMGDTSVLTSTPAFLETRRIQTELASSLTPGLRESSGSEGTSSGTKMSTVLSKVPTGATTEISKEDVTSIPGPAQSTISPDISTRTVSWFSTSPVMTESAEITMNTHTSPLGATTQGTSTLDTSSTTSLTMTHSTISQGFSHSQMSTLMRRGPEDVSWMSPPLLEKTRPSFSLMSSPATTSPSPVSSTLPESISSSPLPVTSLLTSGLAKTTDMLHKSSEPVTNSPANLSSTSVEILATSEVTTDTEKTHPSSNRTVTDVGTSSSGHESTSFVLADSQTSKVTSPMVITSTMEDTSVSTSTPGFFETSRIQTEPTSSLTLGLRKTSSSEGTSLATEMSTVLSGVPTGATAEVSRTEVTSSSRTSISGFAQLTVSPETSTETITRLPTSSIMTESAEMMIKTQTDPPGSTPESTHTVDISTTPNWVETHSTVTQRFSHSEMTTLVSRSPGDMLWPSQSSVEETSSASSLLSLPATTSPSPVSSTLVEDFPSASLPVTSLLNPGLVITTDRMGISREPGTSSTSNLSSTSHERLTTLEDTVDTEDMQPSTHTAVTNVRTSISGHESQSSVLSDSETPKATSPMGTTYTMGETSVSISTSDFFETSRIQIEPTSSLTSGLRETSSSERISSATEGSTVLSEVPSGATTEVSRTEVISSRGTSMSGPDQFTISPDISTEAITRLSTSPIMTESAESAITIETGSPGATSEGTLTLDTSTTTFWSGTHSTASPGFSHSEMTTLMSRTPGDVPWPSLPSVEEASSVSSSLSSPAMTSTSFFSTLPESISSSPHPVTALLTLGPVKTTDMLRTSSEPETSSPPNLSSTSAEILATSEVTKDREKIHPSSNTPVVNVGTVIYKHLSPSSVLADLVTTKPTSPMATTSTLGNTSVSTSTPAFPETMMTQPTSSLTSGLREISTSQETSSATERSASLSGMPTGATTKVSRTEALSLGRTSTPGPAQSTISPEISTETITRISTPLTTTGSAEMTITPKTGHSGASSQGTFTLDTSSRASWPGTHSAATHRSPHSGMTTPMSRGPEDVSWPSRPSVEKTSPPSSLVSLSAVTSPSPLYSTPSESSHSSPLRVTSLFTPVMMKTTDMLDTSLEPVTTSPPSMNITSDESLATSKATMETEAIQLSENTAVTQMGTISARQEFYSSYPGLPEPSKVTSPVVTSSTIKDIVSTTIPASSEITRIEMESTSTLTPTPRETSTSQEIHSATKPSTVPYKALTSATIEDSMTQVMSSSRGPSPDQSTMSQDISTEVITRLSTSPIKTESTEMTITTQTGSPGATSRGTLTLDTSTTFMSGTHSTASQGFSHSQMTALMSRTPGDVPWLSHPSVEEASSASFSLSSPVMTSSSPVSSTLPDSIHSSSLPVTSLLTSGLVKTTELLGTSSEPETSSPPNLSSTSAEILAITEVTTDTEKLEMTNVVTSGYTHESPSSVLADSVTTKATSSMGITYPTGDTNVLTSTPAFSDTSRIQTKSKLSLTPGLMETSISEETSSATEKSTVLSSVPTGATTEVSRTEAISSSRTSIPGPAQSTMSSDTSMETITRISTPLTRKESTDMAITPKTGPSGATSQGTFTLDSSSTASWPGTHSATTQRFPQSVVTTPMSRGPEDVSWPSPLSVEKNSPPSSLVSSSSVTSPSPLYSTPSGSSHSSPVPVTSLFTSIMMKATDMLDASLEPETTSAPNMNITSDESLAASKATTETEAIHVFENTAASHVETTSATEELYSSSPGFSEPTKVISPVVTSSSIRDNMVSTTMPGSSGITRIEIESMSSLTPGLRETRTSQDITSSTETSTVLYKMPSGATPEVSRTEVMPSSRTSIPGPAQSTMSLDISDEVVTRLSTSPIMTESAEITITTQTGYSLATSQVTLPLGTSMTFLSGTHSTMSQGLSHSEMTNLMSRGPESLSWTSPRFVETTRSSSSLTSLPLTTSLSPVSSTLLDSSPSSPLPVTSLILPGLVKTTEVLDTSSEPKTSSSPNLSSTSVEIPATSEIMTDTEKIHPSSNTAVAKVRTSSSVHESHSSVLADSETTITIPSMGITSAVDDTTVFTSNPAFSETRRIPTEPTFSLTPGFRETSTSEETTSITETSAVLYGVPTSATTEVSMTEIMSSNRIHIPDSDQSTMSPDIITEVITRLSSSSMMSESTQMTITTQKSSPGATAQSTLTLATTTAPLARTHSTVPPRFLHSEMTTLMSRSPENPSWKSSLFVEKTSSSSSLLSLPVTTSPSVSSTLPQSIPSSSFSVTSLLTPGMVKTTDTSTEPGTSLSPNLSGTSVEILAASEVTTDTEKIHPSSSMAVTNVGTTSSGHELYSSVSIHSEPSKATYPVGTPSSMAETSISTSMPANFETTGFEAEPFSHLTSGFRKTNMSLDTSSVTPTNTPSSPGSTHLLQSSKTDFTSSAKTSSPDWPPASQYTEIPVDIITPFNASPSITESTGITSFPESRFTMSVTESTHHLSTDLLPSAETISTGTVMPSLSEAMTSFATTGVPRAISGSGSPFSRTESGPGDATLSTIAESLPSSTPVPFSSSTFTTTDSSTIPALHEITSSSATPYRVDTSLGTESSTTEGRLVMVSTLDTSSQPGRTSSSPILDTRMTESVELGTVTSAYQVPSLSTRLTRTDGIMEHITKIPNEAAHRGTIRPVKGPQTSTSPASPKGLHTGGTKRMETTTTALKTTTTALKTTSRATLTTSVYTPTLGTLTPLNASMQMASTIPTEMMITTPYVFPDVPETTSSLATSLGAETSTALPRTTPSVFNRESETTASLVSRSGAERSPVIQTLDVSSSEPDTTASWVIHPAETIPTVSKTTPNFFHSELDTVSSTATSHGADVSSAIPTNISPSELDALTPLVTISGTDTSTTFPTLTKSPHETETRTTWLTHPAETSSTIPRTIPNFSHHESDATPSIATSPGAETSSAIPIMTVSPGAEDLVTSQVTSSGTDRNMTIPTLTLSPGEPKTIASLVTHPEAQTSSAIPTSTISPAVSRLVTSMVTSLAAKTSTTNRALTNSPGEPATTVSLVTHPAQTSPTVPWTTSIFFHSKSDTTPSMTTSHGAESSSAVPTPTVSTEVPGVVTPLVTSSRAVISTTIPILTLSPGEPETTPSMATSHGEEASSAIPTPTVSPGVPGVVTSLVTSSRAVTSTTIPILTFSLGEPETTPSMATSHGTEAGSAVPTVLPEVPGMVTSLVASSRAVTSTTLPTLTLSPGEPETTPSMATSHGAEASSTVPTVSPEVPGVVTSLVTSSSGVNSTSIPTLILSPGELETTPSMATSHGAEASSAVPTPTVSPGVSGVVTPLVTSSRAVTSTTIPILTLSSSEPETTPSMATSHGVEASSAVLTVSPEVPGMVTSLVTSSRAVTSTTIPTLTISSDEPETTTSLVTHSEAKMISAIPTLAVSPTVQGLVTSLVTSSGSETSAFSNLTVASSQPETIDSWVAHPGTEASSVVPTLTVSTGEPFTNISLVTHPAESSSTLPRTTSRFSHSELDTMPSTVTSPEAESSSAISTTISPGIPGVLTSLVTSSGRDISATFPTVPESPHESEATASWVTHPAVTSTTVPRTTPNYSHSEPDTTPSIATSPGAEATSDFPTITVSPDVPDMVTSQVTSSGTDTSITIPTLTLSSGEPETTTSFITYSETHTSSAIPTLPVSPGASKMLTSLVISSGTDSTTTFPTLTETPYEPETTAIQLIHPAETNTMVPRTTPKFSHSKSDTTLPVAITSPGPEASSAVSTTTISPDMSDLVTSLVPSSGTDTSTTFPTLSETPYEPETTATWLTHPAETSTTVSGTIPNFSHRGSDTAPSMVTSPGVDTRSGVPTTTIPPSIPGVVTSQVTSSATDTSTAIPTLTPSPGEPETTASSATHPGTQTGFTVPIRTVPSSEPDTMASWVTHPPQTSTPVSRTTSSFSHSSPDATPVMATSPRTEASSAVLTTISPGAPEMVTSQITSSGAATSTTVPTLTHSPGMPETTALLSTHPRTETSKTFPASTVFPQVSETTASLTIRPGAETSTALPTQTTSSLFTLLVTGTSRVDLSPTASPGVSAKTAPLSTHPGTETSTMIPTSTLSLGLLETTGLLATSSSAETSTSTLTLTVSPAVSGLSSASITTDKPQTVTSWNTETSPSVTSVGPPEFSRTVTGTTMTLIPSEMPTPPKTSHGEGVSPTTILRTTMVEATNLATTGSSPTVAKTTTTFNTLAGSLFTPLTTPGMSTLASESVTSRTSYNHRSWISTTSSYNRRYWTPATSTPVTSTFSPGISTSSIPSSTAATVPFMVPFTLNFTITNLQYEEDMRHPGSRKFNATERELQGLLKPLFRNSSLEYLYSGCRLASLRPEKDSSATAVDAICTHRPDPEDLGLDRERLYWELSNLTNGIQELGPYTLDRNSLYVNGFTHRSSMPTTSTPGTSTVDVGTSGTPSSSPSPTTAGPLLMPFTLNFTITNLQYEEDMRRTGSRKFNTMESVLQGLLKPLFKNTSVGPLYSGCRLTLLRPEKDGAATGVDAICTHRLDPKSPGLNREQLYWELSKLTNDIEELGPYTLDRNSLYVNGFTHQSSVSTTSTPGTSTVDLRTSGTPSSLSSPTIMAAGPLLVPFTLNFTITNLQYGEDMGHPGSRKFNTTERVLQGLLGPIFKNTSVGPLYSGCRLTSLRSEKDGAATGVDAICIHHLDPKSPGLNRERLYWELSQLTNGIKELGPYTLDRNSLYVNGFTHRTSVPTSSTPGTSTVDLGTSGTPFSLPSPATAGPLLVLFTLNFTITNLKYEEDMHRPGSRKFNTTERVLQTLLGPMFKNTSVGLLYSGCRLTLLRSEKDGAATGVDAICTHRLDPKSPGVDREQLYWELSQLTNGIKELGPYTLDRNSLYVNGFTHWIPVPTSSTPGTSTVDLGSGTPSSLPSPTTAGPLLVPFTLNFTITNLKYEEDMHCPGSRKFNTTERVLQSLLGPMFKNTSVGPLYSGCRLTLLRSEKDGAATGVDAICTHRLDPKSPGVDREQLYWELSQLTNGIKELGPYTLDRNSLYVNGFTHQTSAPNTSTPGTSTVDLGTSGTPSSLPSPTSAGPLLVPFTLNFTITNLQYEEDMHHPGSRKFNTTERVLQGLLGPMFKNTSVGLLYSGCRLTLLRPEKNGAATGMDAICSHRLDPKSPGLNREQLYWELSQLTHGIKELGPYTLDRNSLYVNGFTHRSSVAPTSTPGTSTVDLGTSGTPSSLPSPTTAVPLLVPFTLNFTITNLQYGEDMRHPGSRKFNTTERVLQGLLGPLFKNSSVGPLYSGCRLISLRSEKDGAATGVDAICTHHLNPQSPGLDREQLYWQLSQMTNGIKELGPYTLDRNSLYVNGFTHRSSGLTTSTPWTSTVDLGTSGTPSPVPSPTTTGPLLVPFTLNFTITNLQYEENMGHPGSRKFNITESVLQGLLKPLFKSTSVGPLYSGCRLTLLRPEKDGVATRVDAICTHRPDPKIPGLDRQQLYWELSQLTHSITELGPYTLDRDSLYVNGFTQRSSVPTTSTPGTFTVQPETSETPSSLPGPTATGPVLLPFTLNFTITNLQYEEDMRRPGSRKFNTTERVLQGLLMPLFKNTSVSSLYSGCRLTLLRPEKDGAATRVDAVCTHRPDPKSPGLDRERLYWKLSQLTHGITELGPYTLDRHSLYVNGFTHQSSMTTTRTPDTSTMHLATSRTPASLSGPMTASPLLVLFTINFTITNLRYEENMHHPGSRKFNTTERVLQGLLRPVFKNTSVGPLYSGCRLTLLRPKKDGAATKVDAICTYRPDPKSPGLDREQLYWELSQLTHSITELGPYTLDRDSLYVNGFTQRSSVPTTSIPGTPTVDLGTSGTPVSKPGPSAASPLLVLFTLNFTITNLRYEENMQHPGSRKFNTTERVLQGLLRSLFKSTSVGPLYSGCRLTLLRPEKDGTATGVDAICTHHPDPKSPRLDREQLYWELSQLTHNITELGPYALDNDSLFVNGFTHRSSVSTTSTPGTPTVYLGASKTPASIFGPSAASHLLILFTLNFTITNLRYEENMWPGSRKFNTTERVLQGLLRPLFKNTSVGPLYSGCRLTLLRPEKDGEATGVDAICTHRPDPTGPGLDREQLYLELSQLTHSITELGPYTLDRDSLYVNGFTHRSSVPTTSTGVVSEEPFTLNFTINNLRYMADMGQPGSLKFNITDNVMQHLLSPLFQRSSLGARYTGCRVIALRSVKNGAETRVDLLCTYLQPLSGPGLPIKQVFHELSQQTHGITRLGPYSLDKDSLYLNGYNEPGPDEPPTTPKPATTFLPPLSEATTAMGYHLKTLTLNFTISNLQYSPDMGKGSATFNSTEGVLQHLLRPLFQKSSMGPFYLGCQLISLRPEKDGAATGVDTTCTYHPDPVGPGLDIQQLYWELSQLTHGVTQLGFYVLDRDSLFINGYAPQNLSIRGEYQINFHIVNWNLSNPDPTSSEYITLLRDIQDKVTTLYKGSQLHDTFRFCLVTNLTMDSVLVTVKALFSSNLDPSLVEQVFLDKTLNASFHWLGSTYQLVDIHVTEMESSVYQPTSSSSTQHFYLNFTITNLPYSQDKAQPGTTNYQRNKRNIEDALNQLFRNSSIKSYFSDCQVSTFRSVPNRHHTGVDSLCNFSPLARRVDRVAIYEEFLRMTRNGTQLQNFTLDRSSVLVDGYSPNRNEPLTGNSDLPFWAVILIGLAGLLGVITCLICGVLVTTRRRKKEGEYNVQQQCPGYYQSHLDLEDLQ,mutated_sequence,1.0,14507.0,UPI000065CA24.a2m,UPI000065CA24.npy,gnomAD
+UPI000006F76F,UPI000006F76F.csv,MNALLEQKEQQERLREAAALGDIREVQKLVESGVDVNSQNEVNGWTCLHWACKRNHGQVVSYLLKSGADKEILTTKGEMPVQLTSRREIRKIMGVEEEDDDDDDDDNLPQLKKESELPFVPNYLANPAFPFIYTPTAEDSAQMQNGGPSTPPASPPADGSPPLLPPGEPPLLGTFPRDHTSLALVQNGDVSAPSAILRTPESTKPGPVCQPPVSQSRSLFSSVPSKPPMSLEPQNGTYAGPAPAFQPFFFTGAFPFNMQELVLKVRIQNPSLRENDFIEIELDRQELTYQELLRVCCCELGVNPDQVEKIRKLPNTLLRKDKDVARLQDFQELELVLMISENNFLFRNAASTLTERPCYNRRASKLTY,mutated_sequence,1.0,368.0,UPI000006F76F.a2m,UPI000006F76F.npy,gnomAD
+UPI00001BE8EA,UPI00001BE8EA.csv,MTAVGVQAQRPLGQRQPRRSFFESFIRTLIITCVALAVVLSSVSICDGHWLLAEDRLFGLWHFCTTTNQTICFRDLGQAHVPGLAVGMGLVRSVGALAVVAAIFGLEFLMVSQLCEDKHSQCKWVMGSILLLVSFVLSSGGLLGFVILLRNQVTLIGFTLMFWCEFTASFLLFLNAISGLHINSITHPWE,mutated_sequence,1.0,190.0,UPI00001BE8EA.a2m,UPI00001BE8EA.npy,gnomAD
+UPI00001313B6,UPI00001313B6.csv,MQKELGIVPSCPGMKSPRPHLLLPLLLLLLLLLGAGVPGAWGQAGSLDLQIDEEQPAGTLIGDISAGLPAGTAAPLMYFISAQEGSGVGTDLAIDEHSGVVRTARVLDREQRDRYRFTAVTPDGATVEVTVRVADINDHAPAFPQARAALQVPEHTAFGTRYPLEPARDADAGRLGTQGYALSGDGAGETFRLETRPGPDGTPVPELVVTGELDRENRSHYMLQLEAYDGGSPPRRAQALLDVTLLDINDHAPAFNQSRYHAVVSESLAPGSPVLQVFASDADAGVNGAVTYEINRRQSEGDGPFSIDAHTGLLQLERPLDFEQRRVHELVVQARDGGAHPELGSAFVTVHVRDANDNQPSMTVIFLSADGSPQVSEAAPPGQLVARISVSDPDDGDFAHVNVSLEGGEGHFALSTQDSVIYLVCVARRLDREERDAYNLRVTATDSGSPPLRAEAAFVLHVTDVNDNAPAFDRQLYRPEPLPEVALPGSFVVRVTARDPDQGTNGQVTYSLAPGAHTHWFSIDPTSGIITTAASLDYELEPQPQLIVVATDGGLPPLASSATVSVALQDVNDNEPQFQRTFYNASLPEGTQPGTCFLQVTATDADSGPFGLLSYSLGAGLGSSGSPPFRIDAHSGDVCTTRTLDRDQGPSSFDFTVTAVDGGGLKSMVYVKVFLSDENDNPPQFYPREYAASISAQSPPGTAVLRLRAHDPDQGSHGRLSYHILAGNSPPLFTLDEQSGLLTVAWPLARRANSVVQLEIGAEDGGGLQAEPSARVDISIVPGTPTPPIFEQLQYVFSVPEDVAPGTSVGIVQAHNPPGRLAPVTLSLSGGDPRGLFSLDAVSGLLQTLRPLDRELLGPVLELEVRAGSGVPPAFAVARVRVLLDDVNDNSPAFPAPEDTVLLPPNTAPGTPIYTLRALDPDSGVNSRVTFTLLAGGGGAFTVDPTTGHVRLMRPLGPSGGPAHELELEARDGGSPPRTSHFRLRVVVQDVGTRGLAPRFNSPTYRVDLPSGTTAGTQVLQVQAQAPDGGPITYHLAAEGASSPFGLEPQSGWLWVRAALDREAQELYILKVMAVSGSKAELGQQTGTATVRVSILNQNEHSPRLSEDPTFLAVAENQPPGTSVGRVFATDRDSGPNGRLTYSLQQLSEDSKAFRIHPQTGEVTTLQTLDREQQSSYQLLVQVQDGGSPPRSTTGTVHVAVLDLNDNSPTFLQASGAAGGGLPIQVPDRVPPGTLVTTLQAKDPDEGENGTILYTLTGPGSELFSLHPHSGELLTAAPLIRAERPHYVLTLSAHDQGSPPRSASLQLLVQVLPSARLAEPPPDLAERDPAAPVPVVLTVTAAEGLRPGSLLGSVAAPEPAGVGALTYTLVGGADPEGTFALDAASGRLYLARPLDFEAGPPWRALTVRAEGPGGAGARLLRVQVQVQDENEHAPAFARDPLALALPENPEPGAALYTFRASDADGPGPNSDVRYRLLRQEPPVPALRLDARTGALSAPRGLDRETTPALLLLVEATDRPANASRRRAARVSARVFVTDENDNAPVFASPSRVRLPEDQPPGPAALHVVARDPDLGEAARVSYRLASGGDGHFRLHSSTGALSVVRPLDREQRAEHVLTVVASDHGSPPRSATQVLTVSVADVNDEAPTFQQQEYSVLLRENNPPGTSLLTLRATDPDVGANGQVTYGGVSSESFSLDPDTGVLTTLRALDREEQEEINLTVYAQDRGSPPQLTHVTVRVAVEDENDHAPTFGSAHLSLEVPEGQDPQTLTMLRASDPDVGANGQLQYRILDGDPSGAFVLDLASGEFGTMRPLDREVEPAFQLRIEARDGGQPALSATLLLTVTVLDANDHAPAFPVPAYSVEVPEDVPAGTLLLQLQAHDPDAGANGHVTYYLGAGTAGAFLLEPSSGELRTAAALDREQCPSYTFSVSAVDGAAAGPLSTTVSVTITVRDVNDHAPTFPTSPLRLRLPRPGPSFSTPTLALATLRAEDRDAGANASILYRLAGTPPPGTTVDSYTGEIRVARSPVALGPRDRVLFIVATDLGRPARSATGVIIVGLQGEAERGPRFPRASSEATIRENAPPGTPIVSPRAVHAGGTNGPITYSILSGNEKGTFSIQPSTGAITVRSAEGLDFEVSPRLRLVLQAESGGAFAFTVLTLTLQDANDNAPRFLRPHYVAFLPESRPLEGPLLQVEADDLDQGSGGQISYSLAASQPARGLFHVDPTTGTITTTAILDREIWAETRLVLMATDRGSPALVGSATLTVMVIDTNDNRPTIPQPWELRVSEDALLGSEIAQVTGNDVDSGPVLWYVLSPSGPQDPFSVGRYGGRVSLTGPLDFEQCDRYQLQLLAHDGPHEGRANLTVLVEDVNDNAPAFSQSLYQVMLLEHTPPGSAILSVSATDRDSGANGHISYHLASPADGFSVDPNNGTLFTIVGTVALGHDGSGAVDVVLEARDHGAPGRAARATVHVQLQDQNDHAPSFTLSHYRVAVTEDLPPGSTLLTLEATDADGSRSHAAVDYSIISGNWGRVFQLEPRLAEAGESAGPGPRALGCLVLLEPLDFESLTQYNLTVAAADRGQPPQSSVVPVTVTVLDVNDNPPVFTRASYRVTVPEDTPVGAELLHVEASDADPGPHGLVRFTVSSGDPSGLFELDESSGTLRLAHALDCETQARHQLVVQAADPAGAHFALAPVTIEVQDVNDHGPAFPLNLLSTSVAENQPPGTLVTTLHAIDGDAGAFGRLRYSLLEAGPGPEGREAFALNSSTGELRARVPFDYEHTESFRLLVGAADAGNLSASVTVSVLVTGEDEYDPVFLAPAFHFQVPEGARRGHSLGHVQATDEDGGADGLVLYSLATSSPYFGINQTTGALYLRVDSRAPGSGTATSGGGGRTRREAPRELRLEVIARGPLPGSRSATVPVTVDITHTALGLAPDLNLLLVGAVAASLGVVVVLALAALVLGLVRARSRKAEAAPGPMSQAAPLASDSLQKLGREPPSPPPSEHLYHQTLPSYGGPGAGGPYPRGGSLDPSHSSGRGSAEAAEDDEIRMINEFPRVASVASSLAARGPDSGIQQDADGLSDTSCEPPAPDTWYKGRKAGLLLPGAGATLYREEGPPATATAFLGGCGLSPAPTGDYGFPADGKPCVAGALTAIVAGEEELRGSYNWDYLLSWCPQFQPLASVFTEIARLKDEARPCPPAPRIDPPPLITAVAHPGAKSVPPKPANTAAARAIFPPASHRSPISHEGSLSSAAMSPSFSPSLSPLAARSPVVSPFGVAQGPSASALSAESGLEPPDDTELHI,mutated_sequence,1.0,3298.0,UPI00001313B6.a2m,UPI00001313B6.npy,gnomAD
+UPI000013D92B,UPI000013D92B.csv,MAVYCYALNSLVIMNSANEMKSGGGPGPSGSETPPPPRRAVLSPGSVFSPGRGASFLFPPAESLSPEEPRSPGGWRSGRRRLNSSSGSGSGSSGSSVSSPSWAGRLRGDRQQVVAAGTLSPPGPEEAKRKLRILQRELQNVQVNQKVGMFEAHIQAQSSAIQAPRSPRLGRARSPSPCPFRSSSQPPGRVLVQGARSEERRTKSWGEQCPETSGTDSGRKGGPSLCSSQVKKGMPPLPGRAAPTGSEAQGPSAFVRMEKGIPASPRCGSPTAMEIDKRGSPTPGTRSCLAPSLGLFGASLTMATEVAARVTSTGPHRPQDLALTEPSGRARELEDLQPPEALVERQGQFLGSETSPAPERGGPRDGEPPGKMGKGYLPCGMPGSGEPEVGKRPEETTVSVQSAESSDSLSWSRLPRALASVGPEEARSGAPVGGGRWQLSDRVEGGSPTLGLLGGSPSAQPGTGNVEAGIPSGRMLEPLPCWDAAKDLKEPQCPPGDRVGVQPGNSRVWQGTMEKAGLAWTRGTGVQSEGTWESQRQDSDALPSPELLPQDPDKPFLRKACSPSNIPAVIITDMGTQEDGALEETQGSPRGNLPLRKLSSSSASSTGFSSSYEDSEEDISSDPERTLDPNSAFLHTLDQQKPRVSKSWRKIKNMVHWSPFVMSFKKKYPWIQLAGHAGSFKAAANGRILKKHCESEQRCLDRLMVDVLRPFVPAYHGDVVKDGERYNQMDDLLADFDSPCVMDCKMGIRTYLEEELTKARKKPSLRKDMYQKMIEVDPEAPTEEEKAQRAVTKPRYMQWRETISSTATLGFRIEGIKKEDGTVNRDFKKTKTREQVTEAFREFTKGNHNILIAYRDRLKAIRTTLEVSPFFKCHEVIGSSLLFIHDKKEQAKVWMIDFGKTTPLPEGQTLQHDVPWQEGNREDGYLSGLNNLVDILTEMSQDAPLA,mutated_sequence,1.0,946.0,UPI000013D92B.a2m,UPI000013D92B.npy,gnomAD
+UPI000004B634,UPI000004B634.csv,MVAACRSVAGLLPRRRRCFPARAPLLRVALCLLCWTPAAVRAVPELGLWLETVNDKSGPLIFRKTMFNSTDIKLSVKSFHCSGPVKFTIVWHLKYHTCHNEHSNLEELFQKHKLSVDEDFCHYLKNDNCWTTKNENLDCNSDSQVFPSLNNKELINIRNVSNQERSMDVVARTQKDGFHIFIVSIKTENTDASWNLNVSLSMIGPHGYISASDWPLMIFYMVMCIVYILYGILWLTWSACYWKDILRIQFWIAAVIFLGMLEKAVFYSEYQNISNTGLSTQGLLIFAELISAIKRTLARLLVIIVSLGYGIVKPRLGTVMHRVIGLGLLYLIFAAVEGVMRVIGGSNHLAVVLDDIILAVIDSIFVWFIFISLAQTMKTLRLRKNTVKFSLYRHFKNTLIFAVLASIVFMGWTTKTFRIAKCQSDWMERWVDDAFWSFLFSLILIVIMFLWRPSANNQRYAFMPLIDDSDDEIEEFMVTSENLTEGIKLRASKSVSNGTAKPATSENFDEDLKWVEENIPSSFTDVALPVLVDSDEEIMTRSEMAEKMFSSEKIM,mutated_sequence,1.0,555.0,UPI000004B634.a2m,UPI000004B634.npy,gnomAD
+UPI00004574E1,UPI00004574E1.csv,MREAAAALVPPPAFAVTPAAAMEEPPPPPPPPPPPPEPETESEPECCLAARQEGTLGDSACKSPESDLEDFSDETNTENLYGTSPPSTPRQMKRMSTKHQRNNVGRPASRSNLKEKMNAPNQPPHKDTGKTVENVEEYSYKQEKKIRAALRTTERDRKKNVQCSFMLDSVGGSLPKKSIPDVDLNKPYLSLGCSNAKLPVSVPMPIARPARQTSRTDCPADRLKFFETLRLLLKLTSVSKKKDREQRGQENTSGFWLNRSNELIWLELQAWHAGRTINDQDFFLYTARQAIPDIINEILTFKVDYGSFAFVRDRAGFNGTSVEGQCKATPGTKIVGYSTHHEHLQRQRVSFEQVKRIMELLEYIEALYPSLQALQKDYEKYAAKDFQDRVQALCLWLNITKDLNQKLRIMGTVLGIKNLSDIGWPVFEIPSPRPSKGNEPEYEGDDTEGELKELESSTDESEEEQISDPRVPEIRQPIDNSFDIQSRDCISKKLERLESEDDSLGWGAPDWSTEAGFSRHCLTSIYRPFVDKALKQMGLRKLILRLHKLMDGSLQRARIALVKNDRPVEFSEFPDPMWGSDYVQLSRTPPSSEEKCSAVSWEELKAMDLPSFEPAFLVLCRVLLNVIHECLKLRLEQRPAGEPSLLSIKQLVRECKEVLKGGLLMKQYYQFMLQEVLEDLEKPDCNIDAFEEDLHKMLMVYFDYMRSWIQMLQQLPQASHSLKNLLEEEWNFTKEITHYIRGGEAQAGKLFCDIAGMLLKSTGSFLEFGLQESCAEFWTSADDSSASDEIRRSVIEISRALKELFHEARERASKALGFAKMLRKDLEIAAEFRLSAPVRDLLDVLKSKQYVKVQIPGLENLQMFVPDTLAEEKSIILQLLNAAAGKDCSKDSDDVLIDAYLLLTKHGDRARDSEDSWGTWEAQPVKVVPQVETVDTLRSMQVDNLLLVVMQSAHLTIQRKAFQQSIEGLMTLCQEQTSSQPVIAKALQQLKNDALELCNRISNAIDRVDHMFTSEFDAEVDESESVTLQQYYREAMIQGYNFGFEYHKEVVRLMSGEFRQKIGDKYISFARKWMNYVLTKCESGRGTRPRWATQGFDFLQAIEPAFISALPEDDFLSLQALMNECIGHVIGKPHSPVTGLYLAIHRNSPRPMKVPRCHSDPPNPHLIIPTPEGFSTRSMPSDARSHGSPAAAAAAAAAAVAASRPSPSGGDSVLPKSISSAHDTRGSSVPENDRLASIAAELQFRSLSRHSSPTEERDEPAYPRGDSSGSTRRSWELRTLISQSKDTASKLGPIEAIQKSVRLFEEKRYREMRRKNIIGQVCDTPKSYDNVMHVGLRKVTFKWQRGNKIGEGQYGKVYTCISVDTGELMAMKEIRFQPNDHKTIKETADELKIFEGIKHPNLVRYFGVELHREEMYIFMEYCDEGTLEEVSRLGLQEHVIRLYSKQITIAINVLHEHGIVHRDIKGANIFLTSSGLIKLGDFGCSVKLKNNAQTMPGEVNSTLGTAAYMAPEVITRAKGEGHGRAADIWSLGCVVIEMVTGKRPWHEYEHNFQIMYKVGMGHKPPIPERLSPEGKDFLSHCLESDPKMRWTASQLLDHSFVKVCTDEE,mutated_sequence,1.0,1608.0,UPI00004574E1.a2m,UPI00004574E1.npy,gnomAD
+UPI00001B4B16,UPI00001B4B16.csv,MDNRNTQMYTEEEKTVNPFLPSTPGPKKAKGGGEAVETHPAPGPLPPPEVRDIGERREPDRAQQQPQKPAVAAGTQSLGNFRQGFMKCLLEVEKMEASHRRASKARSQTAQKSPRTLTPVPTSAPSLPQTPASVPASGPSWARLPAPGPEPAPMGAPVPTSMPCPVLLGPALDLGWRRMELLHQSSERTLSYAKARQEPEEQSLQKLYQNREKSEEQLTLKQEEAFRSYFEIFNGPGEVDAQSLKNILLLMGFSVTLAQVEDALMSADVNGDGRVDFKDFLAVMTDTRRFFCSVEQNALSDMAPHNPHTLLFEILSLLVEMLALPEAVLEEITNYYQKKLKEGTCKAQEMEAAVGRLRLQKLPYNPQQEESSEVPERKVLSILSRLKQQNYAPNLQSPYAQVPCILLCPQLDKKMVRRQPSNHYALDQCTPPGLDPDIRSPFFQSGSQGNREHNSDSRKWLSSVPARTH,mutated_sequence,1.0,469.0,UPI00001B4B16.a2m,UPI00001B4B16.npy,gnomAD
+UPI000013D915,UPI000013D915.csv,MDTSGHFHDSGVGDLDEDPKCPCPSSGDEQQQQQQQQQQQQPPPPAPPAAPQQPLGPSLQPQPPQLQQQQQQQQQQQQQQPPHPLSQLAQLQSQPVHPGLLHSSPTAFRAPPSSNSTAILHPSSRQGSQLNLNDHLLGHSPSSTATSGPGGGSRHRQASPLVHRRDSNPFTEIAMSSCKYSGGVMKPLSRLSASRRNLIEAETEGQPLQLFSPSNPPEIVISSREDNHAHQTLLHHPNATHNHQHAGTTASSTTFPKANKRKNQNIGYKLGHRRALFEKRKRLSDYALIFGMFGIVVMVIETELSWGLYSKDSMFSLALKCLISLSTIILLGLIIAYHTREVQLFVIDNGADDWRIAMTYERILYISLEMLVCAIHPIPGEYKFFWTARLAFSYTPSRAEADVDIILSIPMFLRLYLIARVMLLHSKLFTDASSRSIGALNKINFNTRFVMKTLMTICPGTVLLVFSISLWIIAAWTVRVCERYHDQQDVTSNFLGAMWLISITFLSIGYGDMVPHTYCGKGVCLLTGIMGAGCTALVVAVVARKLELTKAEKHVHNFMMDTQLTKRIKNAAANVLRETWLIYKHTKLLKKIDHAKVRKHQRKFLQAIHQLRSVKMEQRKLSDQANTLVDLSKMQNVMYDLITELNDRSEDLEKQIGSLESKLEHLTASFNSLPLLIADTLRQQQQQLLSAIIEARGVSVAVGTTHTPISDSPIGVSSTSFPTPYTSSSSC,mutated_sequence,1.0,731.0,UPI000013D915.a2m,UPI000013D915.npy,gnomAD
+UPI00001B0366,UPI00001B0366.csv,MAAELAMGAELPSSPLAIEYVNDFDLMKFEVKKEPPEAERFCHRLPPGSLSSTPLSTPCSSVPSSPSFCAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALEDLYWMSGYQHHLNPEALNLTPEDAVEALIGSGHHGAHHGAHHPAAAAAYEAFRGPGFAGGGGADDMGAGHHHGAHHAAHHHHAAHHHHHHHHHHGGAGHGGGAGHHVRLEERFSDDQLVSMSVRELNRQLRGFSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQLKLEVGRLAKERDLYKEKYEKLAGRGGPGSAGGAGFPREPSPPQAGPGGAKGTADFFL,mutated_sequence,1.0,353.0,UPI00001B0366.a2m,UPI00001B0366.npy,gnomAD
+UPI00015B3B88,UPI00015B3B88.csv,MSQRVRRNGSPTPAGSLGGGAVATAGGPGSRLQPMRATVPFQLKQQQQQQHGSPTRSGGGGGGNNNGGCCGGASGPAGGGGGGGPRTASRSTSPTRGGGNAAARTSPTVATQTGASATSTRGTSPTRSAAPGARGSPPRPPPPPPLLGTVSSPSSSPTHLWTGEVSAAPPPARVRHRRRSPEQSRSSPEKRSPSAPVCKAGDKTRQPSSSPSSIIRRTSSLDTLAAPYLAGHWPRDSHGQAAPCMRDKATQTESAWAEEYSEKKKGSHKRSASWGSTDQLKEIAKLRQQLQRSKHSSRHHRDKERQSPFHGNHAAINQCQAPVPKSALIPVIPITKSTGSRFRNSVEGLNQEIEIIIKETGEKEEQLIPQDIPDGHRAPPPLVQRSSSTRSIDTQTPGGADRGSNNSSRSQSVSPTSFLTISNEGSEESPCSADDLLVDPRDKENGNNSPLPKYATSPKPNNSYMFKREPPEGCERVKVFEECSPKQLHEIPAFYCPDKNKVNFIPKSGSAFCLVSILKPLLPTPDLTLKGSGHSLTVTTGMTTTLLQPIAVASLSTNTEQDRVSRGTSTVMPSASLLPPPEPIEEAEG,mutated_sequence,1.0,589.0,UPI00015B3B88.a2m,UPI00015B3B88.npy,gnomAD
+UPI000004CB0B,UPI000004CB0B.csv,MEIISSKLFILLTLATSSLLTSNIFCADELVISNLHSKENYDKYSEPRGYPKGERSLNFEELKDWGPKNVIKMSTPAVNKMPHSFANLPLRFGRNVQEERSAGATANLPLRSGRNMEVSLVRRVPNLPQRFGRTTTAKSVCRMLSDLCQGSMHSPCANDLFYSMTCQHQEIQNPDQKQSRRLLFKKIDDAELKQEK,mutated_sequence,1.0,196.0,UPI000004CB0B.a2m,UPI000004CB0B.npy,gnomAD
+UPI00001A95DE,UPI00001A95DE.csv,MDPFTEKLLERTRARRENLQRKMAERPTAAPRSMTHAKRARQPLSEASNQQPLSGGEEKSCTKPSPSKKRCSDNTEVEVSNLENKQPVESTSAKSCSPSPVSPQVQPQAADTISDSVAVPASLLGMRRGLNSRLEATAASSVKTRMQKLAEQRRRWDNDDMTDDIPESSLFSPMPSEEKAASPPRPLLSNASATPVGRRGRLANLAATICSWEDDVNHSFAKQNSVQEQPGTACLSKFSSASGASARINSSSVKQEATFCSQRDGDASLNKALSSSADDASLVNASISSSVKATSPVKSTTSITDAKSCEGQNPELLPKTPISPLKTGVSKPIVKSTLSQTVPSKGELSREICLQSQSKDKSTTPGGTGIKPFLERFGERCQEHSKESPARSTPHRTPIITPNTKAIQERLFKQDTSSSTTHLAQQLKQERQKELACLRGRFDKGNIWSAEKGGNSKSKQLETKQETHCQSTPLKKHQGVSKTQSLPVTEKVTENQIPAKNSSTEPKGFTECEMTKSSPLKITLFLEEDKSLKVTSDPKVEQKIEVIREIEMSVDDDDINSSKVINDLFSDVLEEGELDMEKSQEEMDQALAESSEEQEDALNISSMSLLAPLAQTVGVVSPESLVSTPRLELKDTSRSDESPKPGKFQRTRVPRAESGDSLGSEDRDLLYSIDAYRSQRFKETERPSIKQVIVRKEDVTSKLDEKNNAFPCQVNIKQKMQELNNEINMQQTVIYQASQALNCCVDEEHGKGSLEEAEAERLLLIATGKRTLLIDELNKLKNEGPQRKNKASPQSEFMPSKGSVTLSEIRLPLKADFVCSTVQKPDAANYYYLIILKAGAENMVATPLASTSNSLNGDALTFTTTFTLQDVSNDFEINIEVYSLVQKKDPSGLDKKKKTSKSKAITPKRLLTSITTKSNIHSSVMASPGGLSAVRTSNFALVGSYTLSLSSVGNTKFVLDKVPFLSSLEGHIYLKIKCQVNSSVEERGFLTIFEDVSGFGAWHRRWCVLSGNCISYWTYPDDEKRKNPIGRINLANCTSRQIEPANREFCARRNTFELITVRPQREDDRETLVSQCRDTLCVTKNWLSADTKEERDLWMQKLNQVLVDIRLWQPDACYKPIGKP,mutated_sequence,1.0,1124.0,UPI00001A95DE.a2m,UPI00001A95DE.npy,gnomAD
+UPI000003B052,UPI000003B052.csv,MASYLYGVLFAVGLCAPIYCVSPANAPSAYPRPSSTKSTPASQVYSLNTDFAFRLYRRLVLETPSQNIFFSPVSVSTSLAMLSLGAHSVTKTQILQGLGFNLTHTPESAIHQGFQHLVHSLTVPSKDLTLKMGSALFVKKELQLQANFLGNVKRLYEAEVFSTDFSNPSIAQARINSHVKKKTQGKVVDIIQGLDLLTAMVLVNHIFFKAKWEKPFHPEYTRKNFPFLVGEQVTVHVPMMHQKEQFAFGVDTELNCFVLQMDYKGDAVAFFVLPSKGKMRQLEQALSARTLRKWSHSLQKRWIEVFIPRFSISASYNLETILPKMGIQNVFDKNADFSGIAKRDSLQVSKATHKAVLDVSEEGTEATAATTTKFIVRSKDGPSYFTVSFNRTFLMMITNKATDGILFLGKVENPTKS,mutated_sequence,1.0,417.0,UPI000003B052.a2m,UPI000003B052.npy,gnomAD
+UPI000017DF81,UPI000017DF81.csv,MGLCYSLRPLLFGGPGDDPCAASEPPVEDAQPAPAPALAPVRAAARDTARTLLPRGGEGSPACARPKADKPKEKRQRTEQLSAEEREAAKEREAVKEARKVSRGIDRMLRDQKRDLQQTHRLLLLGAGESGKSTIVKQMRILHVNGFNPEEKKQKILDIRKNVKDAIVTIVSAMSTIIPPVPLANPENQFRSDYIKSIAPITDFEYSQEFFDHVKKLWDDEGVKACFERSNEYQLIDCAQYFLERIDSVSLVDYTPTDQDLLRCRVLTSGIFETRFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIYVAACSSYNMVIREDNNTNRLRESLDLFESIWNNRWLRTISIILFLNKQDMLAEKVLAGKSKIEDYFPEYANYTVPEDATPDAGEDPKVTRAKFFIRDLFLRISTATGDGKHYCYPHFTCAVDTENIRRVFNDCRDIIQRMHLKQYELL,mutated_sequence,1.0,458.0,UPI000017DF81.a2m,UPI000017DF81.npy,gnomAD
+UPI0001611034,UPI0001611034.csv,XVSAACQRRNTGTVGLKLSKVVVVGDLYVGKTSLIHRFCKNVFDRDYKATIGVDFEIERFEIAGIPYSLQIWDTAGQEKFKCIASAYYRGAQGEQHATSGLNCMQQRGPCRAPQWESRVLCTGTSFLSSKTH,mutated_sequence,1.0,132.0,UPI0001611034.a2m,UPI0001611034.npy,gnomAD
+UPI0000202F63,UPI0000202F63.csv,MLRSTGFFRAIDCPYWSGAPGGPCRRPYCHFRHRGARGSGAPGDGGEAPPAAGLGYDPYNPELPKPPAQRENGTLGLGEEPRPDVLELELVNQAIEAVRSEVELEQRRYRELLETTREHRSAEAPALAPRGPNASPTVGPDEDAFPLAFDYSPGSHGLLSPDAGYQPTPLAAPAEPGSKYSLASLDRGQGRGGGGGGALEYVPKAVSQPRRHSRPVPSGKYVVDNSRPPTDLEYDPLSNYSARHLSRASSRDERAAKRPRGSRGSEPYTPAPKKLCDPFGSCDARFSDSEDEAATVPGNEPTTASTPKARADPEIKATGQPPSKEGLEAEGGGLRETKETAVQCDVGDLQPPPAKPASPAQVQSSQDGGCPKEGKPKKKKTGAPPAPSCKDGAQGKDKTKDKGRGRPVEKPRADKKGPQASSPRRKAERPEGTKKKPSSATPVATSGKGRPDRPARRPSPTSGDSRPAAGRGPPRPLQLPDRKSTKAPSGKLVERKARSLDEGASQDAPKLKKRALSHADLFGDESEDEAAGPGVPSVWPSALPSLSSDSDSDSDSSLGFPEAQGPPKRLKASPPPSPAPSSSSSSSSSTSSAGADVDYSALEKEVDFDSDPMEECLRIFNESTSVKTEDRGRLARQPPKEEKSEEKGLSGLTTLFPGQKRRISHLSKQGQEVEPPRRGPAVPPARPPTAQEVCYLRAQQAQRASASLLQAPARLAEKSPSVHISAPGEKRRIAHIPNPRLAAAPTGAKRTLAASGSQSSNGPEPGGQQLKTRTLSGMASKTTTTIIPKRIAHSPSLQSLKKPIIPKEFGGKVPTVIRQRYLNLFIEECLKFCTSNQEAIEKALNEEKVAYDRSPSKNIYLNVAVNTLKKLRGLAPSAVPGLSKTSGRRVVSHEVVLGGRLAAKTSFSLSRPSSPRVEDLKGAALYSRLREYLLTQDQLKENGYPFPHPERPGGAIIFTAEEKRPKDSSCRTCCRCGTEYLVSSSGRCIRDEECYYHWGRLRRNRVAGGWETQYMCCSAAAGSVGCQVAKQHVQDGRKERLEGFVKTFEKELSGDTHPGIYALDCEMSYTTYGLELTRVTVVDTDVHVVYDTFVKPDNEIVDYNTRFSGVTEADLADTSVTLRDVQAVLLSMFSADTILIGHSLESDLLALKVIHSTVVDTSVLFPHRLGLPYKRSLRNLMADYLRQIIQDNVDGHSSSEDAGACMHLVIWKVREDAKTKR,mutated_sequence,1.0,1221.0,UPI0000202F63.a2m,UPI0000202F63.npy,gnomAD
+UPI00000728B5,UPI00000728B5.csv,MARGPLAARGLRLLLPLLPLLPLLPLPQVALGFADGSCDPSDQCPPQARWSSLWHVGLILLAVLLLLLCGVTAGCVRFCCLRKQAQAQPHLPPARQPCDVAVIPMDSDSPVHSTVTSYSSVQYPLGMRLPLPFGELDLDSMAPPAYSLYTPEPPPSYDEAVKMAKPREEGPALSQKPSPLLGASGLETTPVPQESGPNTQLPPCSPGAP,mutated_sequence,1.0,209.0,UPI00000728B5.a2m,UPI00000728B5.npy,gnomAD
+UPI0002A47704,UPI0002A47704.csv,XDSGSDEEEGEAEASSSTPATGVFLKSWVYQPGEDTEEEEDEDSDTGSAEDEREAETSASTPPASAFLKAWVYRPGEDTEEEEDEDVDSEDKEDDSEAALGEAESDPHPSHPDQRAHFRGWGYRPGKETEEEEAAEDWGEAEPCPFRVAIYVPGEKPPPPWAPPRLPLRLQRRLKRPETPTHDPDPETPLKARKVGAESPDSIFFFFLIELESWAVTQAGVQRHDLGSLPTSAAQVQAILVPQPPEYLGLQAHDTTPS,mutated_sequence,1.0,258.0,UPI0002A47704.a2m,UPI0002A47704.npy,gnomAD
+UPI0000074451,UPI0000074451.csv,MGLTLLLLLLLGLEGQGIVGSLPEVLQAPVGSSILVQCHYRLQDVKAQKVWCRFLPEGCQPLVSSAVDRRAPAGRRTFLTDLGGGLLQVEMVTLQEEDAGEYGCMVDGARGPQILHRVSLNILPPEEEEETHKIGSLAENAFSDPAGSANPLEPSQDEKSIPLIWGAVLLVGLLVAAVVLFAVMAKRKQGNRLGVCGRFLSSRVSGMNPSSVVHHVSDSGPAAELPLDVPHIRLDSPPSFDNTTYTSLPLDSPSGKPSLPAPSSLPPLPPKVLVCSKPVTYATVIFPGGNKGGGTSCGPAQNPPNNQTPSS,mutated_sequence,1.0,311.0,UPI0000074451.a2m,UPI0000074451.npy,gnomAD
+UPI000049DE01,UPI000049DE01.csv,MDPGSGGGGGGGGGGGSSSGSSSSDSAPDCWDQADMEAPGPGPCGGGGSLAAAAEAQRENLSAAFSRQLNVNAKPFVPNVHAAEFVPSFLRGPAAPPPPVGGAANNHGAGSGAGGRAAPVESSQEEQSLCEGSNSAVSMELSEPIVENGETEMSPEESWEHKEEISEAEPGGGSLGDGRPPEESAHEMMEEEEEIPKPKSVVAPPGAPKKEHVNVVFIGHVDAGKSTIGGQIMYLTGMVDKRTLEKYEREAKEKNRETWYLSWALDTNQEERDKGKTVEVGRAYFETEKKHFTILDAPGHKSFVPNMIGGASQADLAVLVISARKGEFETGFEKGGQTREHAMLAKTAGVKHLIVLINKMDDPTVNWSNERYEECKEKLVPFLKKVGFNPKKDIHFMPCSGLTGANLKEQSDFCPWYIGLPFIPYLDNLPNFNRSVDGPIRLPIVDKYKDMGTVVLGKLESGSICKGQQLVMMPNKHNVEVLGILSDDVETDTVAPGENLKIRLKGIEEEEILPGFILCDPNNLCHSGRTFDAQIVIIEHKSIICPGYNAVLHIHTCIEEVEITALICLVDKKSGEKSKTRPRFVKQDQVCIARLRTAGTICLETFKDFPQMGRFTLRDEGKTIAIGKVLKLVPEKD,mutated_sequence,1.0,637.0,UPI000049DE01.a2m,UPI000049DE01.npy,gnomAD
+UPI0000073639,UPI0000073639.csv,MVLPPPDRRHVCLTTLVIMGSMAVMDAYLVEQNQGPRKIGVCIIVLVGDVCFLLVLRYVAVWVGAEVRTAKRGYAMILWFLYIFVLEIKLYFIFQNYKAARRGAADPVARKALTLLLSVCVPGLFLLLVALDRMEYVRTFRKREDLRGRLFWVALDLLDLLDMQASLWEPPRSGLPLWAEGLTFFYCYMLLLVLPCVALSEVSMQGEHIAPQKMMLYPVLSLATVNVVAVLARAANMALFRDSRVSAIFVGKNVVALATKACTFLEYRRQVRDFPPPALSLELQPPPPQRNSVPPPPPPLHGPPGRPHMSSPTRDPLDT,mutated_sequence,1.0,319.0,UPI0000073639.a2m,UPI0000073639.npy,gnomAD
+UPI0000E59EF5,UPI0000E59EF5.csv,MNESASQEELRPAQENRKEDKERKWNLTEVKELHETLQSVPDVPVKEDTNSVVEKAMDEIKSQELNLEGQRKISPGSIKDSKTEASGNIAIRKSAKVIFALDETELKSKPEHTWKKNLFERMEARAQAMQQKIIDKENLKKELEKKAEKKLPRDNLAKEWFNTDSMTLNNTAYLLDKLLPTLVPGVENMLTQVEKKKVLTEADTPSKFDPINYLGEYLIRNNPNYIKDPGMSGYQRLMKEVTEDLKIYVPDTICNRVSKMKENVKQNRKQRESIDKIIVKVANTRKQALQEQFDEWILDPKGMIPKSVIQNVLQEFFQNPDFKLGSHCKQLDITDSTEPRLNKMEFTEYISSHIKDLKSEMFEELLKHLCHSADEFREVIKADMRRQMFAELFLHCDHGKVGFLDRQRTLALLELFYDHSSQMLRSLLRNPRQWPFIEFEEINLTELWGDMDNQKHIYEGFDKVLLEMNTLLSANHASKTQSKLLESPDQPKLNEQRTSTPSPNPPEQQRGVTAEQGPQRISIEEQQQGKKPTAEQELYIESVIEPGTHTESTLEQGSSRRLLTEQETHRESTTEQGQHKGSIEGQGPRRVSVSEQGSSRESVAEQGSRRESIAEQDRHKGSVAEQGSRRMSAAEQGSLRESVIEEPYQKSEQGPYGEIISEEQEDIGSTSQSRKDSILKSTKYGEPITSEYIEVPLQEKRSWEQTYEEEIFLSSELQEEVPTLSRKDHFPETTKKEVQKDKPCEPKSQKIEGKSWSGEFFTCNWKMKYVTFEDEEQANLIYGNSRFTDLHSIIRNIQSCKEVKGRTAFNGVSFNLLQFVQLLETFVGEDAPLSVSETLTSFFKEGYVETEQEKMNALEQFSQNAFQVRQRLLLEAIFQKWDSDGSGFLDLKEVDELLYTYKEGMEKESMKKAKLHIQFPKPHPGHEVRLSSKQFQNYIELVVSELRGNEDQVLESVVEFLMNALERSHIESLRNSARRKWLHQIQCAAETSGVSLEPVYSETFKALMQDAEAHGNKKISAHISLLEENLLLPEKGNVLLRNVACTLDDAQFVLNRVLYRDMKGISFTVVDEGKPIHVPQVQYHGNIFFWNQSRNKHDYNGSFLALPLQDAYMRIFGVLAVDTLRDPHEINIFLPHEIRFYQGVANVFSTAYHYVHSREHILHIVITGIGWLYDVTSSITSITTYFVEPSPAQDSDYVLRNMMVTGQLGLTEIHKNPPTIHRKSCIFRDFLFKCTDSSEVVLASACGETHIVVPLRERTGEALGVLDFNIGQNRMLLCQEYKDLQKMMKVVQVACYEILGEFSGEIKKKYILEIENVREVQRAGILFFRIMLLELQESIQLLNSMEFVSLLLYDHTLVTEPNSPQDSKSMELEANVKLVRDILKAVILFFHPELEFSSDFGSWDKCKFYVNKYLVNNICAFDPTAKHVEVNVQLIDEYIRDHSRTEVWKFGNVVIEHLYHWIHICSALMKITKQLNSGITPPLPSKTDNYMYAKMPGEGLQEK,mutated_sequence,1.0,1503.0,UPI0000E59EF5.a2m,UPI0000E59EF5.npy,gnomAD
+UPI000023731A,UPI000023731A.csv,MNPREEKVKIITEEFIENDEDADMGRQNKNSKVRRQPRKKQPPTAVPKEMVSEKSHLGNPQEPVQEEPKTRLLSMTVRRGPRSLPPIPSTSRTGFAEFSMRGRMREKLQAARSKAESALLQEIPTPRPRRLRSPSKKELETEFGTEPGKEVERTQQEVDSQSYSRVKFHDSARKIKPKPQVPPGFPSAEEAYNFFTFNFDPEPEGSEEKPKARHRAGTNQEEEEGEEEEPPAQGGGKEMDEEELLNGDDAEDFLLGLDHVADDFVAVRPADYESIHDRLQMEREMLFIPSRQTVPTYKKLPENVQPRFLEDEGLYTGVRPEVARTNQNIMENRLLMQDPERRWFGDDGRILALPNPIKPFPSRPPVLTQEQSIKAELETLYKKAVKYVHSSQHVIRSGDPPGNFQLDIDISGLIFTHHPCFSREHVLAAKLAQLYDQYLARHQRNKAKFLTDKLQALRNAVQTGLDPEKPHQSLDTIQKTINEYKSEIRQTRKFRDAEQEKDRTLLKTIIKVWKEMKSLREFQRFTNTPLKLVLRKEKADQKADEEAYEAEIQAEISELLEEHTEEYAQKMEEYRTSLQQWKAWRKVQRAKKKKRKQAAEEHPGDEIAEPYPEEDLVKPSPPEPTDRAVIEQEVRERAAQSRRRPWEPTLVPELSLAGSVTPNDQCPRAEVSRREDVKKRSVYLKVLFNNKEVSRTVSRPLGADFRVHFGQIFNLQIVNWPESLTLQVYETVGHSSPTLLAEVFLPIPETTVVTGRAPTEEVEFSSNQHVTLDHEGVGSGVPFSFEADGSNQLTLMTSGKVSHSVAWAIGENGIPLIPPLSQQNIGFRSALKKADAISSIGTSGLTDMKKLAKWAAESKLDPNDPNNAPLMQLISVATSGESYVPDFFRLEQLQQEFNFVSDQELNRSKRFRLLHLRSQEVPEFRNYKQVPVYDREIMEKVFQDYEKRLRDRNVIETKEHIDTHRAIVAKYLQQVRESVINRFLIAKQYFLLADMIVEEEVPNISILGLSLFKLAEQKRPLRPRRKGRKKVTAQNLSDGDIKLLVNIVRAYDIPVRKPAVSKFQQPSRSSRMFSEKHAASPSTYSPTHNADYPLGQVLVRPFVEVSFQRTVCHTTTAEGPNPSWNEELELPFRAPNGDYSTASLQSVKDVVFINIFDEVLHDVLEDDRERGSGIHTRIERHWLGCVKMPFSTIYFQARIDGTFKIDIPPVLLGYSKERNMILERGFDSVRSLSEGSYITLFITIEPQLVPGESIREKFESQEDEKLLQATEKFQAECALKFPNRQCLTTVIDISGKTVFITRYLKPLNPPQELLNVYPNNLQATAELVARYVSLIPFLPDTVSFGGICDLWSTSDQFLDLLAGDEEEHAVLLCNYFLSLGKKAWLLMGNAIPEGPTAYVLTWEQGRYLIWNPCSGHFYGQFDTFCPLKNVGCLIGPDNIWFNIQRYESPLRINFDVTRPKLWKSFFSRSLPYPGLSSVQPEELIYQRSDKAAAAELQDRIEKILKEKIMDWRPRHLTRWNRYCTSTLRHFLPLLEKSQGEDVEDDHRAELLKQLGDYRFSGFPLHMPYSEVKPLIDAVYSTGVHNIDVPNVEFALAVYIHPYPKNVLSVWIYVASLIRNR,mutated_sequence,1.0,1620.0,UPI000023731A.a2m,UPI000023731A.npy,gnomAD
+UPI0000000B17,UPI0000000B17.csv,MLVMAPRTVLLLLSAALALTETWAGSHSMRYFYTSVSRPGRGEPRFISVGYVDDTQFVRFDSDAASPREEPRAPWIEQEGPEYWDRNTQIYKAQAQTDRESLRNLRGYYNQSEAGSHTLQSMYGCDVGPDGRLLRGHDQYAYDGKDYIALNEDLRSWTAADTAAQITQRKWEAAREAEQRRAYLEGECVEWLRRYLENGKDKLERADPPKTHVTHHPISDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDRTFQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWEPSSQSTVPIVGIVAGLAVLAVVVIGAVVAAVMCRRKSSGGKGGSYSQAACSDSAQGSDVSLTA,mutated_sequence,1.0,362.0,UPI0000000B17.a2m,UPI0000000B17.npy,gnomAD
+UPI000013D56A,UPI000013D56A.csv,MSSCRYNGGVMRPLSNLSASRRNLHEMDSEAQPLQPPASVGGGGGASSPSAAAAAAAAVSSSAPEIVVSKPEHNNSNNLALYGTGGGGSTGGGGGGGGSGHGSSSGTKSSKKKNQNIGYKLGHRRALFEKRKRLSDYALIFGMFGIVVMVIETELSWGAYDKASLYSLALKCLISLSTIILLGLIIVYHAREIQLFMVDNGADDWRIAMTYERIFFICLEILVCAIHPIPGNYTFTWTARLAFSYAPSTTTADVDIILSIPMFLRLYLIARVMLLHSKLFTDASSRSIGALNKINFNTRFVMKTLMTICPGTVLLVFSISLWIIAAWTVRACERYHDQQDVTSNFLGAMWLISITFLSIGYGDMVPNTYCGKGVCLLTGIMGAGCTALVVAVVARKLELTKAEKHVHNFMMDTQLTKRVKNAAANVLRETWLIYKNTKLVKKIDHAKVRKHQRKFLQAIHQLRSVKMEQRKLNDQANTLVDLAKTQNIMYDMISDLNERSEDFEKRIVTLETKLETLIGSIHALPGLISQTIRQQQRDFIEAQMESYDKHVTYNAERSRSSSRRRRSSSTAPPTSSESS,mutated_sequence,1.0,579.0,UPI000013D56A.a2m,UPI000013D56A.npy,gnomAD
+UPI00005D3C95,UPI00005D3C95.csv,MREPEELMPDSGAVFTFGKSKFAENNPGKFWFKNDVPVHLSCGDEHSAVVTGNNKLYMFGSNNWGQLGLGSKSAISKPTCVKALKPEKVKLAACGRNHTLVSTEGGNVYATGGNNEGQLGLGDTEERNTFHVISFFTSEHKIKQLSAGSNTSAALTEDGRLFMWGDNSEGQIGLKNVSNVCVPQQVTIGKPVSWISCGYYHSAFVTTDGELYVFGEPENGKLGLPNQLLGNHRTPQLVSEIPEKVIQVACGGEHTVVLTENAVYTFGLGQFGQLGLGTFLFETSEPKVIENIRDQTISYISCGENHTALITDIGLMYTFGDGRHGKLGLGLENFTNHFIPTLCSNFLRFIVKLVACGGCHMVVFAAPHRGVAKEIEFDEINDTCLSVATFLPYSSLTSGNVLQRTLSARMRRRERERSPDSFSMRRTLPPIEGTLGLSACFLPNSVFPRCSERNLQESVLSEQDLMQPEEPDYLLDEMTKEAEIDNSSTVESLGETTDILNMTHIMSLNSNEKSLKLSPVQKQKKQQTIGELTQDTALTENDDSDEYEEMSEMKEGKACKQHVSQGIFMTQPATTIEAFSDEEVEIPEEKEGAEDSKGNGIEEQEVEANEENVKVHGGRKEKTEILSDDLTDKAEVSEGKAKSVGEAEDGPEGRGDGTCEEGSSGAEHWQDEEREKGEKDKGRGEMERPGEGEKELAEKEEWKKRDGEEQEQKEREQGHQKERNQEMEEGGEEEHGEGEEEEGDREEEEEKEGEGKEEGEGEEVEGEREKEEGERKKEERAGKEEKGEEEGDQGEGEEEETEGRGEEKEEGGEVEGGEVEEGKGEREEEEEEGEGEEEEGEGEEEEGEGEEEEGEGKGEEEGEEGEGEEEGEEGEGEGEEEEGEGEGEEEGEGEGEEEEGEGEGEEEGEGEGEEEEGEGKGEEEGEEGEGEGEEEEGEGEGEDGEGEGEEEEGEWEGEEEEGEGEGEEEGEGEGEEGEGEGEEEEGEGEGEEEEGEEEGEEEGEGEEEGEGEGEEEEEGEVEGEVEGEEGEGEGEEEEGEEEGEEREKEGEGEENRRNREEEEEEEGKYQETGEEENERQDGEEYKKVSKIKGSVKYGKHKTYQKKSVTNTQGNGKEQRSKMPVQSKRLLKNGPSGSKKFWNNVLPHYLELK,mutated_sequence,1.0,1152.0,UPI00005D3C95.a2m,UPI00005D3C95.npy,gnomAD
+UPI00000467E6,UPI00000467E6.csv,MSLKLQASNVTNKNDPKSINSRVFIGNLNTALVKKSDVETIFSKYGRVAGCSVHKGYAFVQYSNERHARAAVLGENGRVLAGQTLDINMAGEPKPDRPKGLKRAASAIYSGYIFDYDYYRDDFYDRLFDYRGRLSPVPVPRAVPVKRPRVTVPLVRRVKTNVPVKLFARSTAVTTSSAKIKLKSSELQAIKTELTQIKSNIDALLSRLEQIAAEQKANPDGKKKGDGGGAGGGGGGGGSGGGGSGGGGGGGSSRPPAPQENTTSEAGLPQGEARTRDDGDEEGLLTHSEEELEHSQDTDADDGALQ,mutated_sequence,1.0,306.0,UPI00000467E6.a2m,UPI00000467E6.npy,gnomAD
+UPI00015386B3,UPI00015386B3.csv,MGDLKSGFEEVDGVRLGYLIIKGKQMFALSQVFTDLLKNIPRTTVHKRMDHLKVKKHHCDLEELRKLKAINSIAFHAAKCTLISREDVEALYTSCKTERVLKTKRRRVGRALATKAPPPERAAAASPRPGFWKDKHQLWRGLSGAARPLPISAQSQRPGAAAARPAAHLPQIFSKYPGSHYPEIVRSPCKPPLNYETAPLQGNYVAFPSDPAYFRSLLCSKHPAAAAAAAAAAAAAAAAAAAAAYYQVSAAGPQPKAAAGAGGPGSLSYRCKRKRGGAKDCLLAPHAGARRLLLLPRSYKAKAAAAAAAAAAAAAAAAGATCLERFHLVNGFCPPPHHHHHHHHHHHHHHHRAQPPQQSHHPPHHHRPQPHLGSFPESCSSDSESSSYSDHAANDSDFGSSLSSSSNSVSSEEEEEEGEEEEEEEEEEGGSGASDSSEVSSEEEDSSTESDSSSGSSQVSVQSIRFRRTSFCKPPSVQAQANFLYHLASAAAATKPAAFEDAGRLPDLKSSVKAESPAEWNLQSWAPKASPVYCPASLGSCFAEIRNDRVSEITFPHSEISNAVKRTDLTINCLAEGASSPSPKTNNAFPQQRILREARKCLQTTPTTHCADNNTIAARFLNNDSSGAEANSEKYSKILHCPEFATDLPSSQTDPEVNAAGAAATKAENPCTDTGDKTLPFLHNIKIKVEDSSANEEYEPHLFTNKLKCECNDTKGEFYSVTESKEEDALLTTAKEGFACPEKETPSLNPLAQSQGLSCTLGSPKPEDGEYKFGARVRKNYRTLVLGKRPVLQTPPVKPNLKSARSPRPTGKTETNEGTLDDFTVINRRKKVASNVASAVKRPFHFMANFPCPPSLIIGRDGDLWPAYSLNTTKDSQTPHKAHPIWKWQLGGSAIPLPPSHKFRKFNS,mutated_sequence,1.0,908.0,UPI00015386B3.a2m,UPI00015386B3.npy,gnomAD
+UPI000006EA15,UPI000006EA15.csv,MTASAQPRGRRPGVGVGVVVTSCKHPRCVLLGKRKGSVGAGSFQLPGGHLEFGETWEECAQRETWEEAALHLKNVHFASVVNSFIEKENYHYVTILMKGEVDVTHDSEPKNVEPEKNESWEWVPWEELPPLDQLFWGLRCLKEQGYDPFKEDLNHLVGYKGNHL,mutated_sequence,1.0,164.0,UPI000006EA15.a2m,UPI000006EA15.npy,gnomAD
+UPI00000715E9,UPI00000715E9.csv,MAAGSEATTPVIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPARHWNAKYLSQVLHGKQIRFRMGMKSMSTVPQFETTCNYVEATLEEFLTWNCDQSSISGPFRDYDHSKFWAYADYKYFVSLFEDKTDLFQDVKWSDFGFPGRNGQESTLWIGSLGAHTPCHLDSYGCNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRFPQFRKAQRHAVTLSPGQVLFVPRHWWHYVESIDPVTVSINSWIELEEDHLARVEEAITRMLVCALKTAENPQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTSEVVEIQALRTDGEHMKKEELNVCNHMEVGQTGSQNLTTGTDKPEAASPFGPDLVPVAQRSEEPPSERGGIFGSDGKDFVDKDGEHFGKLHCAKRQQIMSNSENAIEEQIASNTTTTPQTFISTDDLLDCLVNPQVTRIVAQLLIQGRSL,mutated_sequence,1.0,488.0,UPI00000715E9.a2m,UPI00000715E9.npy,gnomAD
+UPI000013D4C2,UPI000013D4C2.csv,MEENLISMREDHSFHVRYRMEASCLELALEGERLCKSGDCRAGVSFFEAAVQVGTEDLKTLSAIYSQLGNAYFYLHDYAKALEYHHHDLTLARTIGDQLGEAKASGNLGNTLKVLGNFDEAIVCCQRHLDISRELNDKVGEARALYNLGNVYHAKGKSFGCPGPQDVGEFPEEVRDALQAAVDFYEENLSLVTALGDRAAQGRAFGNLGNTHYLLGNFRDAVIAHEQRLLIAKEFGDKAAERRAYSNLGNAYIFLGEFETASEYYKKTLLLARQLKDRAVEAQSCYSLGNTYTLLQDYEKAIDYHLKHLAIAQELNDRIGEGRACWSLGNAYTALGNHDQAMHFAEKHLEISREVGDKSGELTARLNLSDLQMVLGLSYSTNNSIMSENTEIDSSLNGVRPKLGRRHSMENMELMKLTPEKVQNWNSEILAKQKPLIAKPSAKLLFVNRLKGKKYKTNSSTKVLQDASNSIDHRIPNSQRKISADTIGDEGFFDLLSRFQSNRMDDQRCCLQEKNCHTASTTTSSTPPKMMLKTSSVPVVSPNTDEFLDLLASSQSRRLDDQRASFSNLPGLRLTQNSQSVLSHLMTNDNKEADEDFFDILVKCQGSRLDDQRCAPPPATTKGPTVPDEDFFSLILRSQGKRMDEQRVLLQRDQNRDTDFGLKDFLQNNALLEFKNSGKKSADH,mutated_sequence,1.0,684.0,UPI000013D4C2.a2m,UPI000013D4C2.npy,gnomAD
+UPI0000160D96,UPI0000160D96.csv,MEEKEILRRQIRLLQGLIDDYKTLHGNAPAPGTPAASGWQPPTYHSGRAFSARYPRPSRRGYSSHHGPSWRKKYSLVNRPPGPSDPPADHAVRPLHGARGGQPPVPQQHVLERQVQLSQGQNVVIKVKPPSKSGSASASGAQRGSLEEFEETPWSDQRPREGEGEPPRGQLQPSRPTRARGTCSVEDPLLVCQKEPGKPRMVKSVGSVGDSPREPRRTVSESVIAVKASFPSSALPPRTGVALGRKLGSHSVASCAPQLLGDRRVDAGHTDQPVPSGSVGGPARPASGPRQAREASLVVTCRTNKFRKNNYKWVAASSKSPRVARRALSPRVAAENVCKASAGMANKVEKPQLIADPEPKPRKPATSSKPGSAPSKYKWKASSPSASSSSSFRWQSEASSKDHASQLSPVLSRSPSGDRPAVGHSGLKPLSGETPLSAYKVKSRTKIIRRRSSTSLPGDKKSGTSPAATAKSHLSLRRRQALRGKSSPVLKKTPNKGLVQVTTHRLCRLPPSRAHLPTKEASSLHAVRTAPTSKVIKTRYRIVKKTPASPLSAPPFPLSLPSWRARRLSLSRSLVLNRLRPVASGGGKAQPGSPWWRSKGYRCIGGVLYKVSANKLSKTSGQPSDAGSRPLLRTGRLDPAGSCSRSLASRAVQRSLAIIRQARQRREKRKEYCMYYNRFGRCNRGERCPYIHDPEKVAVCTRFVRGTCKKTDGTCPFSHHVSKEKMPVCSYFLKGICSNSNCPYSHVYVSRKAEVCSDFLKGYCPLGAKCKKKHTLLCPDFARRGACPRGAQCQLLHRTQKRHSRRAATSPAPGPSDATARSRVSASHGPRKPSASQRPTRQTPSSAALTAAAVAAPPHCPGGSASPSSSKASSSSSSSSSPPASLDHEAPSLQEAALAAACSNRLCKLPSFISLQSSPSPGAQPRVRAPRAPLTKDSGKPLHIKPRL,mutated_sequence,1.0,948.0,UPI0000160D96.a2m,UPI0000160D96.npy,gnomAD
+UPI000006CF77,UPI000006CF77.csv,MSLPESPHSPATLDYALEDPHQGQRSREKSKATEVMADMFDGRLEPIVFPPPRLPEEGVAPQDPADGGHTFHILVDAGRSHGAIKAGQEVTPPPAEGLEAASASLTTDGSLKNGFPGEETHGLGGEKALETCGAGRSESEVIAEGKAEDVKPEECAMFSAPVDEKPGGEEMDVAEENRAIDEVNREAGPGPGPGPLNVGLHLNPLESIQLELDSVNAEADRALLQVERRFGQIHEYYLEQRNDIIRNIPGFWVTAFRHHPQLSAMIRGQDAEMLSYLTNLEVKELRHPRTGCKFKFFFQRNPYFRNKLIVKVYEVRSFGQVVSFSTLIMWRRGHGPQSFIHRNRHVICSFFTWFSDHSLPESDRIAQIIKEDLWSNPLQYYLLGEDAHRARRRLVREPVEIPRPFGFQCG,mutated_sequence,1.0,410.0,UPI000006CF77.a2m,UPI000006CF77.npy,gnomAD
+UPI000013DCB6,UPI000013DCB6.csv,MKMEEAVGKVEELIESEAPPKASEQETAKEEDGSVELESQVQKDGVADSTVISSMPCLLMELRRDSSESQLASTESDKPTTGRVYESDSSNHCMLSPSSSGHLADSDTLSSAEENEPSQAETAVEGDPSGVSGATVGRKSRRSRSESETSTMAAKKNRQSSDKQNGRVAKVKGHRSQKHKERIRLLRQKREAAARKKYNLLQDSSTSDSDLTCDSSTSSSDDDEEVSGSSKTITAEIPDGPPVVAHYDMSDTNSDPEVVNVDNLLAAAVVQEHSNSVGGQDTGATWRTSGLLEELNAEAGHLDPGFLASDKTSGNAPLNEEINIASSDSEVEIVGVQEHARCVHPRGGVIQSVSSWKHGSGTQYVSTRQTQSWTAVTPQQTWASPAEVVDLTLDEDSRRKYLL,mutated_sequence,1.0,403.0,UPI000013DCB6.a2m,UPI000013DCB6.npy,gnomAD
+UPI000015FD15,UPI000015FD15.csv,MGTGDFICISMTGGAPWGFRLQGGKEQKQPLQVAKIRNQSKASGSGLCEGDEVVSINGNPCADLTYPEVIKLMESITDSLQMLIKRPSSGISEALISENENKNLEHLTHGGYVESTTLQIRPATKTQCTEFFLAPVKTEVPLAENQRSGPDCAGSLKEETGPSYQRAPQMPDSQRGRVAEELILREKVEAVQPGPVVELQLSLSQERHKGASGPLVALPGAEKSKSPDPDPNLSHDRIVHINSIPTNEKADPFLRSSKIIQISSGRELRVIQESEAGDAGLPRVEVILDCSDRQKTEGCRLQAGKECVDSPVEGGQSEAPPSLVSFAVSSEGTEQGEDPRSEKDHSRPHKHRARHARLRRSESLSEKQVKEAKSKCKSIALLLTDAPNPNSKGVLMFKKRRRRARKYTLVSYGTGELEREADEEEEGDKEDTCEVAFLGASESEVDEELLSDVDDNTQVVNFDWDSGLVDIEKKLNRGDKMEMLPDTTGKGALMFAKRRERMDQITAQKEEDKVGGTPSREQDAAQTDGLRTTTSYQRKEEESVRTQSSVSKSYIEVSHGLGHVPQQNGFSGTSETANIQRMVPMNRTAKPFPGSVNQPATPFSPTRNMTSPIADFPAPPPYSAVTPPPDAFSRGVSSPIAGPAQPPPWPQPAPWSQPAFYDSSERIASRDERISVPAKRTGILQEAKRRSTTKPMFTFKEPKVSPNPELLSLLQNSEGKRGTGAGGDSGPEEDYLSLGAEACNFMQSSSAKQKTPPPVAPKPAVKSSSSQPVTPVSPVWSPGVAPTQPPAFPTSNPSKGTVVSSIKIAQPSYPPARPASTLNVAGPFKGPQAAVASQNYTPKPTVSTPTVNAVQPGAVGPSNELPGMSGRGAQLFAKRQSRMEKYVVDSDTVQAHAARAQSPTPSLPASWKYSSNVRAPPPVAYNPIHSPSYPLAALKSQPSAAQPSKMGKKKGKKPLNALDVMKHQPYQLNASLFTFQPPDAKDGLPQKSSVKVNSALAMKQALPPRPVNAASPTNVQASSVYSVPAYTSPPSFFAEASSPVSASPVPVGIPTSPKQESASSSYFVAPRPKFSAKKSGVTIQVWKPSVVEE,mutated_sequence,1.0,1093.0,UPI000015FD15.a2m,UPI000015FD15.npy,gnomAD
+UPI0001823FDC,UPI0001823FDC.csv,MAGSDVDSEGPARRGGAARRPGAPGGPGSEAAAGCPEPLSTAEAPAESATLPAWMRLYFYGMHGITLDVLVSSARRFARSPDLRMLGFSSPYRCLLHSLTHFALEKVYLQQRRCPNAFVFNFLLYPSAHVGLQTLAGQALLLSLGGGAGVAVAPGALDLALQYVLALYHCQVFLKRFLRLRYGRQRRRQQQQQQQQQQQQRRGALPVPPGARVPTAAGARRRRPRGPRGAGGAPSQGLPDLPRFLFFGMHGFLDEIFFTFFFNVLGQGDGTTSGHTSLWSFFMYGSCSFVVEKLYFHLHYSRGWGTWKRVPIYVIFIYVWELSWGLGLRTCGACSWDYSHYPLNFMGLITLMYLPGWIFLSVYQDLISNVLWRVQYVPAN,mutated_sequence,1.0,380.0,UPI0001823FDC.a2m,UPI0001823FDC.npy,gnomAD
+UPI0000071834,UPI0000071834.csv,MLERRCRGPLAMGLAQPRLLSGPSQESPQTLGKESRGLRQQGTSVAQSGAQAPGRAHRCAHCRRHFPGWVALWLHTRRCQARLPLPCPECGRRFRHAPFLALHRQVHAAATPDLGFACHLCGQSFRGWVALVLHLRAHSAAKRPIACPKCERRFWRRKQLRAHLRRCHPPAPEARPFICGNCGRSFAQWDQLVAHKRVHVAEALEEAAAKALGPRPRGRPAVTAPRPGGDAVDRPFQCACCGKRFRHKPNLIAHRRVHTGERPHQCPECGKRFTNKPYLTSHRRIHTGEKPYPCKECGRRFRHKPNLLSHSKIHKRSEGSAQAAPGPGSPQLPAGPQESAAEPTPAVPLKPAQEPPPGAPPEHPQDPIEAPPSLYSCDDCGRSFRLERFLRAHQRQHTGERPFTCAECGKNFGKKTHLVAHSRVHSGERPFACEECGRRFSQGSHLAAHRRDHAPDRPFVCPDCGKAFRHKPYLAAHRRIHTGEKPYVCPDCGKAFSQKSNLVSHRRIHTGERPYACPDCDRSFSQKSNLITHRKSHIRDGAFCCAICGQTFDDEERLLAHQKKHDV,mutated_sequence,1.0,567.0,UPI0000071834.a2m,UPI0000071834.npy,gnomAD
+UPI000006FF0C,UPI000006FF0C.csv,MVRETRHLWVGNLPENVREEKIIEHFKRYGRVESVKILPKRGSEGGVAAFVDFVDIKSAQKAHNSVNKMGDRDLRTDYNEPGTIPSAARGLDDTVSIASRSREVSGFRGGGGGPAYGPPPSLHAREGRYERRLDGASDNRERAYEHSAYGHHERGTGGFDRTRHYDQDYYRDPRERTLQHGLYYASRSRSPNRFDAHDPRYEPRAREQFTLPSVVHRDIYRDDITREVRGRRPERNYQHSRSRSPHSSQSRNQSPQRLASQASRPTRSPSGSGSRSRSSSSDSISSSSSTSSDSSDSSSSSSDDSPARSVQSAAVPAPTSQLLSSLEKDEPRKSFGIKVQNLPVRSTDTSLKDGLFHEFKKFGKVTSVQIHGTSEERYGLVFFRQQEDQEKALTASKGKLFFGMQIEVTAWIGPETESENEFRPLDERIDEFHPKATRTLFIGNLEKTTTYHDLRNIFQRFGEIVDIDIKKVNGVPQYAFLQYCDIASVCKAIKKMDGEYLGNNRLKLGFGKSMPTNCVWLDGLSSNVSDQYLTRHFCRYGPVVKVVFDRLKGMALVLYNEIEYAQAAVKETKGRKIGGNKIKVDFANRESQLAFYHCMEKSGQDIRDFYEMLAERREERRASYDYNQDRTYYESVRTPGTYPEDSRRDYPARGREFYSEWETYQGDYYESRYYDDPREYRDYRNDPYEQDIREYSYRQRERERERERFESDRDRDHERRPIERSQSPVHLRRPQSPGASPSQAERLPSDSERRLYSRSSDRSGSCSSLSPPRYEKLDKSRLERYTKNEKTDKERTFDPERVERERRLIRKEKVEKDKTDKQKRKGKVHSPSSQSSETDQENEREQSPEKPRSCNKLSREKADKEGIAKNRLELMPCVVLTRVKEKEGKVIDHTPVEKLKAKLDNDTVKSSALDQKLQVSQTEPAKSDLSKLESVRMKVPKEKGLSSHVEVVEKEGRLKARKHLKPEQPADGVSAVDLEKLEARKRRFADSNLKAEKQKPEVKKSSPEMEDARVLSKKQPDVSSREVILLREGEAERKPVRKEILKRESKKIKLDRLNTVASPKDCQELASISVGSGSRPSSDLQARLGELAGESVENQEVQSKKPIPSKPQLKQLQVLDDQGPEREDVRKNYCSLRDETPERKSGQEKSHSVNTEEKIGIDIDHTQSYRKQMEQSRRKQQMEMEIAKSEKFGSPKKDVDEYERRSLVHEVGKPPQDVTDDSPPSKKKRMDHVDFDICTKRERNYRSSRQISEDSERTGGSPSVRHGSFHEDEDPIGSPRLLSVKGSPKVDEKVLPYSNITVREESLKFNPYDSSRREQMADMAKIKLSVLNSEDELNRWDSQMKQDAGRFDVSFPNSIIKRDSLRKRSVRDLEPGEVPSDSDEDGEHKSHSPRASALYESSRLSFLLRDREDKLRERDERLSSSLERNKFYSFALDKTITPDTKALLERAKSLSSSREENWSFLDWDSRFANFRNNKDKEKVDSAPRPIPSWYMKKKKIRTDSEGKMDDKKEDHKEEEQERQELFASRFLHSSIFEQDSKRLQHLERKEEDSDFISGRIYGKQTSEGANSTTDSIQEPVVLFHSRFMELTRMQQKEKEKDQKPKEVEKQEDTENHPKTPESAPENKDSELKTPPSVGPPSVTVVTLESAPSALEKTTGDKTVEAPLVTEEKTVEPATVSEEAKPASEPAPAPVEQLEQVDLPPGADPDKEAAMMPAGVEEGSSGDQPPYLDAKPPTPGASFSQAESNVDPEPDSTQPLSKPAQKSEEANEPKAEKPDATADAEPDANQKAEAAPESQPPASEDLEVDPPVAAKDKKPNKSKRSKTPVQAAAVSIVEKPVTRKSERIDREKLKRSNSPRGEAQKLLELKMEAEKITRTASKNSAADLEHPEPSLPLSRTRRRNVRSVYATMGDHENRSPVKEPVEQPRVTRKRLERELQEAAAVPTTPRRGRPPKTRRRADEEEENEAKEPAETLKPPEGWRSPRSQKTAAGGGPQGKKGKNEPKVDATRPEATTEVGPQIGVKESSMEPKAAEEEAGSEQKRDRKDAGTDKNPPETAPVEVVEKKPAPEKNSKSKRGRSRNSRLAVDKSASLKNVDAAVSPRGAAAQAGERESGVVAVSPEKSESPQKEDGLSSQLKSDPVDPDKEPEKEDVSASGPSPEATQLAKQMELEQAVEHIAKLAEASASAAYKADAPEGLAPEDRDKPAHQASETELAAAIGSIINDISGEPENFPAPPPYPGESQTDLQPPAGAQALQPSEEGMETDEAVSGILETEAATESSRPPVNAPDPSAGPTDTKEARGNSSETSHSVPEAKGSKEVEVTLVRKDKGRQKTTRSRRKRNTNKKVVAPVESHVPESNQAQGESPAANEGTTVQHPEAPQEEKQSEKPHSTPPQSCTSDLSKIPSTENSSQEISVEERTPTKASVPPDLPPPPQPAPVDEEPQARFRVHSIIESDPVTPPSDPSIPIPTLPSVTAAKLSPPVASGGIPHQSPPTKVTEWITRQEEPRAQSTPSPALPPDTKASDVDTSSSTLRKILMDPKYVSATSVTSTSVTTAIAEPVSAAPCLHEAPPPPVDSKKPLEEKTAPPVTNNSEIQASEVLVAADKEKVAPVIAPKITSVISRMPVSIDLENSQKITLAKPAPQTLTGLVSALTGLVNVSLVPVNALKGPVKGSVTTLKSLVSTPAGPVNVLKGPVNVLTGPVNVLTTPVNATVGTVNAAPGTVNAAASAVNATASAVTVTAGAVTAASGGVTATTGTVTMAGAVIAPSTKCKQRASANENSRFHPGSMPVIDDRPADAGSGAGLRVNTSEGVVLLSYSGQKTEGPQRISAKISQIPPASAMDIEFQQSVSKSQVKPDSVTASQPPSKGPQAPAGYANVATHSTLVLTAQTYNASPVISSVKADRPSLEKPEPIHLSVSTPVTQGGTVKVLTQGINTPPVLVHNQLVLTPSIVTTNKKLADPVTLKIETKVLQPANLGSTLTPHHPPALPSKLPTEVNHVPSGPSIPADRTVSHLAAAKLDAHSPRPSGPGPSSFPRASHPSSTASTALSTNATVMLAAGIPVPQFISSIHPEQSVIMPPHSITQTVSLSHLSQGEVRMNTPTLPSITYSIRPEALHSPRAPLQPQQIEVRAPQRASTPQPAPAGVPALASQHPPEEEVHYHLPVARATAPVQSEVLVMQSEYRLHPYTVPRDVRIMVHPHVTAVSEQPRAADGVVKVPPASKAPQQPGKEAAKTPDAKAAPTPTPAPVPVPVPLPAPAPAPHGEARILTVTPSNQLQGLPLTPPVVVTHGVQIVHSSGELFQEYRYGDIRTYHPPAQLTHTQFPAASSVGLPSRTKTAAQGPPPEGEPLQPPQPVQSTQPAQPAPPCPPSQLGQPGQPPSSKMPQVSQEAKGTQTGVEQPRLPAGPANRPPEPHTQVQRAQAETGPTSFPSPVSVSMKPDLPVSLPTQTAPKQPLFVPTTSGPSTPPGLVLPHTEFQPAPKQDSSPHLTSQRPVDMVQLLKKYPIVWQGLLALKNDTAAVQLHFVSGNNVLAHRSLPLSEGGPPLRIAQRMRLEATQLEGVARRMTVETDYCLLLALPCGRDQEDVVSQTESLKAAFITYLQAKQAAGIINVPNPGSNQPAYVLQIFPPCEFSESHLSRLAPDLLASISNISPHLMIVIASV,mutated_sequence,1.0,3664.0,UPI000006FF0C.a2m,UPI000006FF0C.npy,gnomAD
+UPI0000505465,UPI0000505465.csv,MALGEEKAEAEASEDTKAQSYGRGSCRERELDIPGPMSGEQPPRLEAEGGLISPVWGAEGIPAPTCWIGTDPGGPSRAHQPQASDANREPVAERSEPALSGLPPATMGSGDLLLSGESQVEKTKLSSSEEFPQTLSLPRTTTICSGHDADTEDDPSLADLPQALDLSQQPHSSGLSCLSQWKSVLSPGSAAQPSSCSISASSTGSSLQGHQERAEPRGGSLAKVSSSLEPVVPQEPSSVVGLGPRPQWSPQPVFSGGDASGLGRRRLSFQAEYWACVLPDSLPPSPDRHSPLWNPNKEYEDLLDYTYPLRPGPQLPKHLDSRVPADPVLQDSGVDLDSFSVSPASTLKSPTNVSPNCPPAEATALPFSGPREPSLKQWPSRVPQKQGGMGLASWSQLASTPRAPGSRDARWERREPALRGAKDRLTIGKHLDMGSPQLRTRDRGWPSPRPEREKRTSQSARRPTCTESRWKSEEEVESDDEYLALPARLTQVSSLVSYLGSISTLVTLPTGDIKGQSPLEVSDSDGPASFPSSSSQSQLPPGAALQGSGDPEGQNPCFLRSFVRAHDSAGEGSLGSSQALGVSSGLLKTRPSLPARLDRWPFSDPDVEGQLPRKGGEQGKESLVQCVKTFCCQLEELICWLYNVADVTDHGTAARSNLTSLKSSLQLYRQFKKDIDEHQSLTESVLQKGEILLQCLLENTPVLEDVLGRIAKQSGELESHADRLYDSILASLDMLAGCTLIPDKKPMAAMEHPCEGV,mutated_sequence,1.0,757.0,UPI0000505465.a2m,UPI0000505465.npy,gnomAD
+UPI0000E5B01A,UPI0000E5B01A.csv,MAEGGEGGEDEIQFLRTEDEVVLQCIATIHKEQRKFCLAAEGLGNRLCFLEPTSEAKYIPPDLCVCNFVLEQSLSVRALQEMLANTGENGGEGAAQGGGHRTLLYGHAVLLRHSFSGMYLTCLTTSRSQTDKLAFDVGLREHATGEACWWTIHPASKQRSEGEKVRIGDDLILVSVSSERYLHLSVSNGNIQVDASFMQTLWNVHPTCSGSSIEEGYLLGGHVVRLFHGHDECLTIPSTDQNDSQHRRIFYEAGGAGTRARSLWRVEPLRISWSGSNIRWGQAFRLRHLTTGHYLALTEDQGLILQDRAKSDTKSTAFSFRASKELKEKLDSSHKRDIEGMGVPEIKYGDSVCFVQHIASGLWVTYKAQDAKTSRLGPLKRKVILHQEGHMDDGLTLQRCQREESQAARIIRNTTALFSQFVSGNNRTAAPITLPIEEVLQTLQDLIAYFQPPEEEMRHEDKQNKLRSLKNRQNLFKEEGMLALVLNCIDRLNVYNSVAHFAGIAREESGMAWKEILNLLYKLLAALIRGNRNNCAQFSNNLDWLISKLDRLESSSGILEVLHCILTESPEALNLIAEGHIKSIISLLDKHGRNHKVLDILCSLCLCNGVAVRANQNLICDNLLPRRNLLLQTRLINDVTSIRPNIFLGVAEGSAQYKKWYFELIIDQVDPFLTAEPTHLRVGWASSSGYAPYPGGGEGWGGNGVGDDLYSYGFDGLHLWSGRIPRAVASINQHLLRSDDVVSCCLDLGVPSISFRINGQPVQGMFENFNTDGLFFPVMSFSAGVKVRFLMGGRHGEFKFLPPSGYAPCYEALLPKEKMRLEPVKEYKRDADGIRDLLGTTQFLSQASFIPCPVDTSQVILPPHLEKIRDRLAENIHELWGMNKIELGWTFGKIRDDNKRQHPCLVEFSKLPETEKNYNLQMSTETLKTLLALGCHIAHVNPAAEEDLKKVKLPKNYMMSNGYKPAPLDLSDVKLLPPQEILVDKLAENAHNVWAKDRIKQGWTYGIQQDLKNKRNPRLVPYALLDERTKKSNRDSLREAVRTFVGYGYNIEPSDQELADSAVEKVSIDKIRFFRVERSYAVRSGKWYFEFEVVTGGDMRVGWARPGCRPDVELGADDQAFVFEGNRGQRWHQGSGYFGRTWQPGDVVGCMINLDDASMIFTLNGELLITNKGSELAFADYEIENGFVPICCLGLSQIGRMNLGTDASTFKFYTMCGLQEGFEPFAVNMNRDVAMWFSKRLPTFVNVPKDHPHIEVMRIDGTMDSPPCLKVTHKTFGTQNSNADMIYCRLSMPVECHSSFSHSPCLDSEAFQKRKQMQEILSHTTTQCYYAIRIFAGQDPSCVWVGWVTPDYHLYSEKFDLNKNCTVTVTLGDERGRVHESVKRSNCYMVWGGDIVASSQRSNRSNVDLEIGCLVDLAMGMLSFSANGKELGTCYQVEPNTKVFPAVFLQPTSTSLFQFELGKLKNAMPLSAAIFRSEEKNPVPQCPPRLDVQTIQPVLWSRMPNSFLKVETERVSERHGWVVQCLEPLQMMALHIPEENRCVDILELCEQEDLMRFHYHTLRLYSAVCALGNSRVAYALCSHVDLSQLFYAIDNKYLPGLLRSGFYDLLISIHLASAKERKLMMKNEYIIPITSTTRNIRLFPDESKRHGLPGVGLRTCLKPGFRFSTPCFVVTGEDHQKQSPEIPLESLRTKALSMLTEAVQCSGAHIRDPVGGSVEFQFVPVLKLIGTLLVMGVFDDDDVRQILLLIDPSVFGEHSAGTEEGAEKEEVTQVEEKAVEAGEKAGKEAPVKGLLQTRLPESVKLQMCELLSYLCDCELQHRVEAIVAFGDIYVSKLQANQKFRYNELMQALNMSAALTARKTKEFRSPPQEQINMLLNFQLGENCPCPEEIREELYDFHEDLLLHCGVPLEEEEEEEEDTSWTGKLCALVYKIKGPPKPEKEQPTEEEERCPTTLKELISQTMICWAQEDQIQDSELVRMMFNLLRRQYDSIGELLQALRKTYTISHTSVSDTINLLAALGQIRSLLSVRMGKEEELLMINGLGDIMNNKVFYQHPNLMRVLGMHETVMEVMVNVLGTEKSQIAFPKMVASCCRFLCYFCRISRQNQKAMFEHLSYLLENSSVGLASPSMRGSTPLDVAASSVMDNNELALSLEEPDLEKVVTYLAGCGLQSCPMLLAKGYPDVGWNPIEGERYLSFLRFAVFVNSESVEENASVVVKLLIRRPECFGPALRGEGGNGLLAAMQGAIKISENPALDLPSQGYKREVSTGDDEEEEEIVHMGNAIMSFYSALIDLLGRCAPEMHLIQTGKGEAIRIRSILRSLVPTEDLVGIISIPLKLPSLNKDGSVSEPDMAANFCPDHKAPMVLFLDRVYGIKDQTFLLHLLEVGFLPDLRASASLDTVSLSTTEAALALNRYICSAVLPLLTRCAPLFAGTEHCTSLIDSTLQTIYRLSKGRSLTKAQRDTIEECLLAICNHLRPSMLQQLLRRLVFDVPQLNEYCKMPLKLLTNHYEQCWKYYCLPSGWGSYGLAVEEELHLTEKLFWGIFDSLSHKKYDPDLFRMALPCLSAIAGALPPDYLDTRITATLEKQISVDADGNFDPKPINTMNFSLPEKLEYIVTKYAEHSHDKWACDKSQSGWKYGISLDENVKTHPLIRPFKTLTEKEKEIYRWPARESLKTMLAVGWTVERTKEGEALVQQRENEKLRSVSQANQGNSYSPAPLDLSNVVLSRELQGMVEVVAENYHNIWAKKKKLELESKGGGSHPLLVPYDTLTAKEKFKDREKAQDLFKFLQVNGIIVSRGMKDMELDASSMEKRFAYKFLKKILKYVDSAQEFIAHLEAIVSSGKTEKSPRDQEIKFFAKVLLPLVDQYFTSHCLYFLSSPLKPLSSSGYASHKEKEMVAGLFCKLAALVRHRISLFGSDSTTMVSCLHILAQTLDTRTVMKSGSELVKAGLRAFFENAAEDLEKTSENLKLGKFTHSRTQIKGVSQNINYTTVALLPILTSIFEHVTQHQFGMDLLLGDVQISCYHILCSLYSLGTGKNIYVERQRPALGECLASLAAAIPVAFLEPTLNRYNPLSVFNTKTPRERSILGMPDTVEDMCPDIPQLEGLMKEINDLAESGARYTEMPHVIEVILPMLCNYLSYWWERGPENLPPSTGPCCTKVTSEHLSLILGNILKIINNNLGIDEASWMKRIAVYAQPIISKARPDLLRSHFIPTLEKLKKKAVKTVQEEEQLKADGKGDTQEAELLILDEFAVLCRDLYAFYPMLIRYVDNNRSNWLKSPDADSDQLFRMVAEVFILWCKSHNFKREEQNFVIQNEINNLAFLTGDSKSKMSKAMQVKSGGQDQERKKTKRRGDLYSIQTSLIVAALKKMLPIGLNMCTPGDQELISLAKSRYSHRDTDEEVREHLRNNLHLQEKSDDPAVKWQLNLYKDVLKSEEPFNPEKTVERVQRISAAVFHLEQVEQPLRSKKAVWHKLLSKQRKRAVVACFRMAPLYNLPRHRSINLFLHGYQRFWIETEEYSFEEKLVQDLAKSPKVEEEEEEETEKQPDPLHQIILYFSRNALTERSKLEDDPLYTSYSSMMAKSCQSGEDEEEDEDKEKTFEEKEMEKQKTLYQQARLHERGAAEMVLQMISASKGEMSPMVVETLKLGIAILNGGNAGVQQKMLDYLKEKKDAGFFQSLSGLMQSCSVLDLNAFERQNKAEGLGMVTEEGTLIVRERGEKVLQNDEFTRDLFRFLQLLCEGHNSDFQNFLRTQMGNTTTVNVIISTVDYLLRLQESISDFYWYYSGKDIIDESGQHNFSKALAVTKQIFNSLTEYIQGPCIGNQQSLAHSRLWDAVVGFLHVFANMQMKLSQDSSQIELLKELLDLLQDMVVMLLSLLEGNVVNGTIGKQMVDTLVESSTNVEMILKFFDMFLKLKDLTSSDTFKEYDPDGKGIISKKEFQKAMEGQKQYTQSEIDFLLSCAEADENDMFNYVDFVDRFHEPAKDIGFNVAVLLTNLSEHMPNDSRLKCLLDPAESVLNYFEPYLGRIEIMGGAKKIERVYFEISESSRTQWEKPQVKESKRQFIFDVVNEGGEQEKMELFVNFCEDTIFEMQLASQISESDSADRPEEEEEDEDSSYVLEIAGEEEEDGSLEPASAFAMACASVKRNVTDFLKRATLKNLRKQYRNVKKMTAKELVKVLFSFFWMLFVGLFQLLFTILGGIFQILWSTVFGGGLVEGAKNIRVTKILGDMPDPTQFGIHDDTMEAERAEVMEPGITTELVHFIKGEKGDTDIMSDLFGLHPKKEGSLKHGPEVGLGDLSEIIGKDEPPTLESTVQKKRKAQAAEMKAANEAEGKVESEKADMEDGEKEDKDKEEEQAEYLWTEVTKKKKRRCGQKVEKPEAFTANFFKGLEIYQTKLLHYLARNFYNLRFLALFVAFAINFILLFYKVTEEPLEEETEDVANLWNSFNDEEEEEAMVFFVLQESTGYMAPTLRALAIIHTIISLVCVVGYYCLKVPLVVFKREKEIARKLEFDGLYITEQPSEDDIKGQWDRLVINTPSFPNNYWDKFVKRKVINKYGDLYGAERIAELLGLDKNALDFSPVEETKAEAASLVSWLSSIDMKYHIWKLGVVFTDNSFLYLAWYTTMSVLGHYNNFFFAAHLLDIAMGFKTLRTILSSVTHNGKQLVLTVGLLAVVVYLYTVVAFNFFRKFYNKSEDDDEPDMKCDDMMTCYLFHMYVGVRAGGGIGDEIEDPAGDPYEMYRIVFDITFFFFVIVILLAIIQGLIIDAFGELRDQQEQVREDMETKCFICGIGNDYFDTTPHGFETHTLQEHNLANYLFFLMYLINKDETEHTGQESYVWKMYQERCWDFFPAGDCFRKQYEDQLG,mutated_sequence,1.0,4870.0,UPI0000E5B01A.a2m,UPI0000E5B01A.npy,gnomAD
+UPI000045882C,UPI000045882C.csv,MTRGAWMCRQYDDGLKIWLAAPRENEKPFIDSERAQKWRLSLASLLFFTVLLSDHLWFCAEAKLTRARDKEHQQQQRQQQQQQQQQRQRQQQQQQRRQQEPSWPALLASMGESSPAAQAHRLLSASSSPTLPPSPGDGGGGGGKGNRGKDDRGKALFLGNSAKPVWRLETCYPQGASSGQCFTVENADAVCARNWSRGAAGGDGQEVRSKHPTPLWNLSDFYLSFCNSYTLWELFSGLSSPNTLNCSLDVVLKEGGEMTTCRQCVEAYQDYDHHAQEKYEEFESVLHKYLQSEEYSVKSCPEDCKIVYKAWLCSQYFEVTQFNCRKTIPCKQYCLEVQTRCPFILPDNDEVIYGGLSSFICTGLYETFLTNDEPECCDVRREEKSNNPSKGTVEKSGSCHRTSLTVSSATRLCNSRLKLCVLVLILLHTVLTASAAQNTAGLSFGGINTLEENSTNEE,mutated_sequence,1.0,458.0,UPI000045882C.a2m,UPI000045882C.npy,gnomAD
+UPI0000031D01,UPI0000031D01.csv,MRPGPALLLLGVGLSLSVGRLPLPPVPRGAQAAVSGAPGGLLRGAPGLGVRGGRALLSLRPSAVRAGGAVLSGRGSLCFPHGGTGRRWYCLDLRVLLSAQRLPWPAAPALALVDLQLSARGGRLSLTWSVRLPRSPGRLAWAFRLRLLGPGAARPASPAARVSPRSAAPGPRPQQGFVARTECPTDGPARVMLQAVNSSSHRAVESSVSCQINACVIQRVRINTDQKGAPVRLSMQAEATINASVQLDCPAARAIAQYWQVFSVPAVGQAPDWTQPLDLPQLEIRNSPLFIHIPNNSLQWGVYVFNFTVSITTGNPKMPEVKDSDAVYVWIVRSSLQAVMLGDANITANFTEQLILDGSTSSDPDADSPLQGLQFFWYCTTDPRNYGGDRIILGSKEVCHPEQANLKWPWASGPVLTLLPETLKGDHVYFFRMVIRKDSRTAFSDKRVHVLQGPKAIAHITCIENCERNFIVSDRFSLFLNCTNCASRDFYKWSILSSSGGEMLFDWMGETVTGRNGAYLSIKAFAFRHFLEAEFSISLYLACWSGVTSVFRHSFIINHGPQIGECKINPAKGIALITKFVVQCSNFRDKHVPLTYKIIVSDLHSVGEISSVKENTLGTILYLGPQSTVPPSFLPVGMLASQYGLKIYAQVYDSLGAFSQVTLHATAQAPTDKNSSKTVLNQLLSFTVGPSSLLSTLIQKKDFLPAGYLLYIVASVLNNMKTELPLRDDRVNLRKHLIDQSFLLPVSTLVEIGQVVMTITKLTQKPSEFTWDAQKRATMRVWQANQALQEYQQKDKRFRSEQIEIVSTGILMSLSNILKMTSPHQVVKDPFYVIESLSDTILANKVPGNKTTSMRTPNFNMYVKKVEKWGINQLFRNEKHCRNCFYPTLNVSSVPGLSANGPISTMFCDFTNDLFPWLNDQENTSVEVSGFRMTGVADNGSVLEITPDVAEVYLVRKNLTFAAFNLTVGPNSEVDGSLKKTTGGFSFQVDSTVLREVLVHIVTEVMVLFTVLVYTGSQITPTALVATFLVPHDIPPFASQSALFDPACTVKKARVVCLPVSLLQLIAQHSHSPHCTVSIVLQAPRFVMKLNDKLVRISIFSVQCLDMYGIQSEWREGYCILGEKTSWYEVHCICKNVVRARRQLGTIGLTGIHLHTHYVMAKVIVIPNPVDLRLNIIKSLHQNPVTLFTVLFIILLYVGLAFWALYRDEMDQHLRGHVIVLPDNDPYDNLCYLVTIFTGSRWGSGTRANVFVQLRGTVSTSDVHCLSHPHFTTLYRGSINTFLLTTKSDLGDIHSIRVWHNNEGRSPSWYLSRIKVENLFSRHIWLFICQKWLSVDTTLDRTFHVTHPDERLTRKDFFFIDVSSNLRKNHMWFSIFASVVAKTFNRLQRLSCCLAMLLSSLLCNIMFFNLNRQEQTESRERKYMRSMMIGIESVLITIPVQLLITFLFTCSQRKPQADLKEVSPQKHPLMSEASEHWEEYLRKWHAYETAKVHPREVAKPASKGKPRLPKASPKATSKPKHRHRKAQIKTPETLGPNTNSNNNIEDDQDVHSEQHPSQKDLQQLKKKPRIVLPWWCVYVAWFLVFATSSISSFFIVFYGLTYGYDKSIEWLFASFCSFCQSVLLVQPSKIILLSGFRTNKPKYCKNLSWSTKYKYTEIRLDGMRMHPEEMQRIHDQIVRIRGTRMYQPLTEDEIRIFKRKKRIKRRALLFLSYILTHFIFLALLLILIVLLRHTDCFYYNQFIRDRFSMDLATVTKLEDIYRWLNSVLLPLLHNDLNPTFLPESSSKILGLPLMRQVRAKSSEKMCLPAEKFVQNSIRREIHCHPKYGIDPEDTKNYSGFWNEVDKQAIDESTNGFTYKPQGTQWLYYSYGLLHTYGSGGYALYFFPEQQRFNSTLRLKELQESNWLDEKTWAVVLELTTFNPDINLFCSISVIFEVSQLGVVNTSISLHSFSLADFDRKASAEIYLYVAILIFFLAYVVDEGCIIMQERASYVRSVYNLLNFALKCIFTVLIVLFLRKHFLATGIIRFYLSNPEDFIPFHAVSQVDHIMRIILGFLLFLTILKTLRYSRFFYDVRLAQRAIQAALPGICHMAFVVSVYFFVYMAFGYLVFGQHEWNYSNLIHSTQTVFSYCVSAFQNTEFSNNRILGVLFLSSFMLVMICVLINLFQAVILSAYEEMKQPVYEEPSDEVEAMTYLCRKLRTMFSFLTSQSKAKDEPEFFIDMLYGQPEKNSHRYLGLKTRNINGKKMVYLVV,mutated_sequence,1.0,2253.0,UPI0000031D01.a2m,UPI0000031D01.npy,gnomAD
+UPI000020F9BB,UPI000020F9BB.csv,MRILKRFLACIQLLCVCRLDWANGYYRQQRKLVEEIGWSYTGALNQKNWGKKYPTCNSPKQSPINIDEDLTQVNVNLKKLKFQGWDKTSLENTFIHNTGKTVEINLTNDYRVSGGVSEMVFKASKITFHWGKCNMSSDGSEHSLEGQKFPLEMQIYCFDADRFSSFEEAVKGKGKLRALSILFEVGTEENLDFKAIIDGVESVSRFGKQAALDPFILLNLLPNSTDKYYIYNGSLTSPPCTDTVDWIVFKDTVSISESQLAVFCEVLTMQQSGYVMLMDYLQNNFREQQYKFSRQVFSSYTGKEEIHEAVCSSEPENVQADPENYTSLLVTWERPRVVYDTMIEKFAVLYQQLDGEDQTKHEFLTDGYQDLGAILNNLLPNMSYVLQIVAICTNGLYGKYSDQLIVDMPTDNPELDLFPELIGTEEIIKEEEEGKDIEEGAIVNPGRDSATNQIRKKEPQISTTTHYNRIGTKYNEAKTNRSPTRGSEFSGKGDVPNTSLNSTSQPVTKLATEKDISLTSQTVTELPPHTVEGTSASLNDGSKTVLRSPHMNLSGTAESLNTVSITEYEEESLLTSFKLDTGAEDSSGSSPATSAIPFISENISQGYIFSSENPETITYDVLIPESARNASEDSTSSGSEESLKDPSMEGNVWFPSSTDITAQPDVGSGRESFLQTNYTEIRVDESEKTTKSFSAGPVMSQGPSVTDLEMPHYSTFAYFPTEVTPHAFTPSSRQQDLVSTVNVVYSQTTQPVYNGETPLQPSYSSEVFPLVTPLLLDNQILNTTPAASSSDSALHATPVFPSVDVSFESILSSYDGAPLLPFSSASFSSELFRHLHTVSQILPQVTSATESDKVPLHASLPVAGGDLLLEPSLAQYSDVLSTTHAASETLEFGSESGVLYKTLMFSQVEPPSSDAMMHARSSGPEPSYALSDNEGSQHIFTVSYSSAIPVHDSVGVTYQGSLFSGPSHIPIPKSSLITPTASLLQPTHALSGDGEWSGASSDSEFLLPDTDGLTALNISSPVSVAEFTYTTSVFGDDNKALSKSEIIYGNETELQIPSFNEMVYPSESTVMPNMYDNVNKLNASLQETSVSISSTKGMFPGSLAHTTTKVFDHEISQVPENNFSVQPTHTVSQASGDTSLKPVLSANSEPASSDPASSEMLSPSTQLLFYETSASFSTEVLLQPSFQASDVDTLLKTVLPAVPSDPILVETPKVDKISSTMLHLIVSNSASSENMLHSTSVPVFDVSPTSHMHSASLQGLTISYASEKYEPVLLKSESSHQVVPSLYSNDELFQTANLEINQAHPPKGRHVFATPVLSIDEPLNTLINKLIHSDEILTSTKSSVTGKVFAGIPTVASDTFVSTDHSVPIGNGHVAITAVSPHRDGSVTSTKLLFPSKATSELSHSAKSDAGLVGGGEDGDTDDDGDDDDDDRGSDGLSIHKCMSCSSYRESQEKVMNDSDTHENSLMDQNNPISYSLSENSEEDNRVTSVSSDSQTGMDRSPGKSPSANGLSQKHNDGKEENDIQTGSALLPLSPESKAWAVLTSDEESGSGQGTSDSLNENETSTDFSFADTNEKDADGILAAGDSEITPGFPQSPTSSVTSENSEVFHVSEAEASNSSHESRIGLAEGLESEKKAVIPLVIVSALTFICLVVLVGILIYWRKCFQTAHFYLEDSTSPRVISTPPTPIFPISDDVGAIPIKHFPKHVADLHASSGFTEEFETLKEFYQEVQSCTVDLGITADSSNHPDNKHKNRYINIVAYDHSRVKLAQLAEKDGKLTDYINANYVDGYNRPKAYIAAQGPLKSTAEDFWRMIWEHNVEVIVMITNLVEKGRRKCDQYWPADGSEEYGNFLVTQKSVQVLAYYTVRNFTLRNTKIKKGSQKGRPSGRVVTQYHYTQWPDMGVPEYSLPVLTFVRKAAYAKRHAVGPVVVHCSAGVGRTGTYIVLDSMLQQIQHEGTVNIFGFLKHIRSQRNYLVQTEEQYVFIHDTLVEAILSKETEVLDSHIHAYVNALLIPGPAGKTKLEKQFQLLSQSNIQQSDYSAALKQCNREKNRTSSIIPVERSRVGISSLSGEGTDYINASYIMGYYQSNEFIITQHPLLHTIKDFWRMIWDHNAQLVVMIPDGQNMAEDEFVYWPNKDEPINCESFKVTLMAEEHKCLSNEEKLIIQDFILEATQDDYVLEVRHFQCPKWPNPDSPISKTFELISVIKEEAANRDGPMIVHDEHGGVTAGTFCALTTLMHQLEKENSVDVYQVAKMINLMRPGVFADIEQYQFLYKVILSLVSTRQEENPSTSLDSNGAALPDGNIAESLESLV,mutated_sequence,1.0,2315.0,UPI000020F9BB.a2m,UPI000020F9BB.npy,gnomAD
+UPI00001AAC44,UPI00001AAC44.csv,MEYEVKKGKKGFVSPIRRLVFPKAGRRAACRSSVSRRPLHSMPLYPPDYLIDPQILLCDYLEKEVKFLGHLTWVTSSLNPSSRDELLQLLDTARQLKELPLKTTAEQDSILSLSARCLLLTWRDNEELILRIPTHEIAAASYLQDDALHLLVLKTGLGVDPVPAGVDASPGGAGRDPGPPGGAPEKRRVGTAERRHTICSLDWRMGWGGGAAEARAGGGGGGSLERQRAGARASGSWERRQTFSGSWERRHGGGGGGGGAGKPGGSWERRQAGSGGGGSWERRHPGPNPLDPQDPSPDAYCNLVILAVANRDAAEESCALICQVFQIIYGDQSIECVDRAGYHYTSTPERPWLCSRSESCHTDGTYAYDADFSCCSSFNGSQDTFEACYSGTSTPSFHGSHCSGSDHSSLGLEQLQDYMVTLRSKLGPLEIQQFAMLLREYRLGLPIQDYCTGLLKLYGDRRKFLLLGMRPFIPDQDIGYFEGFLEGVGIREGGILTDSFGRIKRSMSSTSASAVRSYDGAAQRPEAQAFHRLLADITHDIEALAPDDDDDDEDEPRGSRGGSDAAEDNYL,mutated_sequence,1.0,571.0,UPI00001AAC44.a2m,UPI00001AAC44.npy,gnomAD
+UPI00024672CE,UPI00024672CE.csv,MWPQPHLPPHPMMSEKTRQNKLAEAKKKFTDYRQWNIAGVGTRATDTKKKKINNGTNPETTTSEGCHSPEDTQQNRAQLKEEKKASHQHQEALRREIEAQDHTIRILTCQKTELETALYYSQDAARKFEDGNLGTPSSFNLALSQAFRGSPLGCVSTSLIPGESKDLAGRLHHSWHFAGELQRALSAVSTWHKKADRYIEELTKERDALSLELYRNTITNEELKKKNAELQEKLRLAESEKSEIQLNVKELKRKLERAKFLLPQVQTNTLQEEMWRQEEELREQEKKIRKQEEKMWRQEERLREQEGKMREQEEKMRRQEKRLREQEKELREQEKELREQKKLREQEEQMQEQEEKMWEQEEKMREQEEKMWRQEERLWEQEKQMREQEQKMRDQEERMWEQDERLREKEERMREQEKMWEQVEKMREEKKMQEQEKKTRDQEEKMQEEERIREREKKMREEEETMREQEEKMQKQEENMWEQEEKEWQQQRLPEQKEKLWEQEKMQEQEEKIWEQEEKIRDQEEMWGQEKKMWRQEKMREQEDVETGGEAAGAGEADVGAGGEDAGSGAEDVGPGGEDVGAGREAAGEGGENAGAEEDVAAGGEDAGGEEDAGAGEEDMGPGGEDARGGEDAGAGEEDAGGGGDDAGAGGEDAGAGREDAGAGGEDVGAGREDAGAGGEDVGAGGEDVGAGRRRCGSSRGCRNRRRSCGNTRRCRSRRSGAEDVGPEGEDVGAGREAAGEGGENAGAEDVAAGGEDAGEEEDAGGEDAGAAREDAGAGGDDVGAGREDAGAGGEDVGAGGEDAGAGGEDAGAGGEDAGPGGEDAGAGGEDAGPGGEDAGAGGEDAGPGGEDVGPGGEDVGAGGEDVGAGGDAREGGEDTRSEREDAGEAARARGAVLRALPPSLQSSL,mutated_sequence,1.0,909.0,UPI00024672CE.a2m,UPI00024672CE.npy,gnomAD
+UPI000000DA9E,UPI000000DA9E.csv,MPAGRRGPAAQSARRPPPLLPLLLLLCVLGAPRAGSGAHTAVISPQDPTLLIGSSLLATCSVHGDPPGATAEGLYWTLNGRRLPPELSRVLNASTLALALANLNGSRQRSGDNLVCHARDGSILAGSCLYVGLPPEKPVNISCWSKNMKDLTCRWTPGAHGETFLHTNYSLKYKLRWYGQDNTCEEYHTVGPHSCHIPKDLALFTPYEIWVEATNRLGSARSDVLTLDILDVVTTDPPPDVHVSRVGGLEDQLSVRWVSPPALKDFLFQAKYQIRYRVEDSVDWKVVDDVSNQTSCRLAGLKPGTVYFVQVRCNPFGIYGSKKAGIWSEWSHPTAASTPRSERPGPGGGACEPRGGEPSSGPVRRELKQFLGWLKKHAYCSNLSFRLYDQWRAWMQKSHKTRNQDEGILPSGRRGTARGPAR,mutated_sequence,1.0,422.0,UPI000000DA9E.a2m,UPI000000DA9E.npy,gnomAD
+UPI00001AE937,UPI00001AE937.csv,MEGCDSPVVSGKDNGCGIPQHQQWTELNSTHLPDKPSSMEQSTGESHGPLDSLRAPFNERLAESTASAGPPSEPASKEVTCNECSASFASLQTYMEHHCPSARPPPPLREESASDTGEEGDEESDVENLAGEIVYQPDGSAYIVESLSQLTQGGGACGSGSGSGPLPSLFLNSLPGAGGKQGDPSCAAPVYPQIINTFHIASSFGKWFEGPDQAFPNTSALAGLSPVLHSFRVFDVRHKSNKDYLNSDGSAKSSCVSKDVPNNVDLSKFDGFVLYGKRKPILMCFLCKLSFGYVRSFVTHAVHDHRMTLSEDERKILSNKNISAIIQGIGKDKEPLVSFLEPKNKNFQHPLVSTANLIGPGHSFYGKFSGIRMEGEEALPAGSAAGPEQPQAGLLTPSTLLNLGGLTSSVLKTPITSVPLGPLASSPTKSSEGKDSGAAEGEKQEVGDGDCFSEKVEPAEEEAEEEEEEEEAEEEEEEEEEEEEEEEDEGCKGLFPSELDEELEDRPHEEPGAAAGSSSKKDLALSNQSISNSPLMPNVLQTLSRGTASTSSNSASSFVVFDGANRRNRLSFNSEGVRANVAEGGRRLDFADESANKDNATAPEPNESTEGDDGGFVPHHQHAGSLCELGVGECPSGSGVECPKCDTVLGSSRSLGGHMTMMHSRNSCKTLKCPKCNWHYKYQQTLEAHMKEKHPEPGGSCVYCKSGQPHPRLARGESYTCGYKPFRCEVCNYSTTTKGNLSIHMQSDKHLNNMQNLQNGGGEQVFSHTAGAAAAAVAAAAAAANISSSCGAPSPTKPKTKPTWRCEVCDYETNVARNLRIHMTSEKHMHNMMLLQQNMTQIQHNRHLGLGSLPSPAEAELYQYYLAQNMNLPNLKMDSAASDAQFMMSGFQLDPAGPMAAMTPALVGGEIPLDMRLGGGQLVSEELMNLGESFIQTNDPSLKLFQCAVCNKFTTDNLDMLGLHMNVERSLSEDEWKAVMGDSYQCKLCRYNTQLKANFQLHCKTDKHVQKYQLVAHIKEGGKANEWRLKCVAIGNPVHLKCNACDYYTNSLEKLRLHTVNSRHEASLKLYKHLQQHESGVEGESCYYHCVLCNYSTKAKLNLIQHVRSMKHQRSESLRKLQRLQKGLPEEDEDLGQIFTIRRCPSTDPEEAIEDVEGPSETAADPEELAKDQEGGASSSQAEKELTDSPATSKRISFPGSSESPLSSKRPKTAEEIKPEQMYQCPYCKYSNADVNRLRVHAMTQHSVQPMLRCPLCQDMLNNKIHLQLHLTHLHSVAPDCVEKLIMTVTTPEMVMPSSMFLPAAVPDRDGNSNLEEAGKQPETSEDLGKNILPSASTEQSGDLKPSPADPGSVREDSGFICWKKGCNQVFKTSAALQTHFNEVHAKRPQLPVSDRHVYKYRCNQCSLAFKTIEKLQLHSQYHVIRAATMCCLCQRSFRTFQALKKHLETSHLELSEADIQQLYGGLLANGDLLAMGDPTLAEDHTIIVEEDKEEESDLEDKQSPTGSDSGSVQEDSGSEPKRALPFRKGPNFTMEKFLDPSRPYKCTVCKESFTQKNILLVHYNSVSHLHKLKRALQESATGQPEPTSSPDNKPFKCNTCNVAYSQSSTLEIHMRSVLHQTKARAAKLEAASGSSNGTGNSSSISLSSSTPSPVSTSGSNTFTTSNPSSAGIAPSSNLLSQVPTESVGMPPLGNPIGANIASPSEPKEANRKKLADMIASRQQQQQQQQQQQQQQQQQQQAQTLAQAQAQVQAHLQQELQQQAALIQSQLFNPTLLPHFPMTTETLLQLQQQQHLLFPFYIPSAEFQLNPEVSLPVTSGALTLTGTGPGLLEDLKAQVQVPQQSHQQILPQQQQNQLSIAQSHSALLQPSQHPEKKNKLVIKEKEKESQRERDSAEGGEGNTGPKETLPDALKAKEKKELAPGGGSEPSMLPPRIASDARGNATKALLENFGFELVIQYNENKQKVQKKNGKTDQGENLEKLECDSCGKLFSNILILKSHQEHVHQNYFPFKQLERFAKQYRDHYDKLYPLRPQTPEPPPPPPPPPPPPLPAAPPQPASTPAIPASAPPITSPTIAPAQPSVPLTQLSMPMELPIFSPLMMQTMPLQTLPAQLPPQLGPVEPLPADLAQLYQHQLNPTLLQQQNKRPRTRITDDQLRVLRQYFDINNSPSEEQIKEMADKSGLPQKVIKHWFRNTLFKERQRNKDSPYNFSNPPITSLEELKIDSRPPSPEPPKQEYWGSKRSSRTRFTDYQLRVLQDFFDANAYPKDDEFEQLSNLLNLPTRVIVVWFQNARQKARKNYENQGEGKDGERRELTNDRYIRTSNLNYQCKKCSLVFQRIFDLIKHQKKLCYKDEDEEGQDDSQNEDSMDAMEILTPTSSSCSTPMPSQAYSAPAPSANNTASSAFLQLTAEAEELATFNSKTEAGDEKPKLAEAPSAQPNQTQEKQGQPKPELQQQEQPEQKTNTPQQKLPQLVSLPSLPQPPPQAPPPQCPLPQSSPSPSQLSHLPLKPLHTSTPQQLANLPPQLIPYQCDQCKLAFPSFEHWQEHQQLHFLSAQNQFIHPQFLDRSLDMPFMLFDPSNPLLASQLLSGAIPQIPASSATSPSTPTSTMNTLKRKLEEKASASPGENDSGTGGEEPQRDKRLRTTITPEQLEILYQKYLLDSNPTRKMLDHIAHEVGLKKRVVQVWFQNTRARERKGQFRAVGPAQAHRRCPFCRALFKAKTALEAHIRSRHWHEAKRAGYNLTLSAMLLDCDGGLQMKGDIFDGTSFSHLPPSSSDGQGVPLSPVSKTMELSPRTLLSPSSIKVEGIEDFESPSMSSVNLNFDQTKLDNDDCSSVNTAITDTTTGDEGNADNDSATGIATETKSSSAPNEGLTKAAMMAMSEYEDRLSSGLVSPAPSFYSKEYDNEGTVDYSETSSLADPCSPSPGASGSAGKSGDSGDRPGQKRFRTQMTNLQLKVLKSCFNDYRTPTMLECEVLGNDIGLPKRVVQVWFQNARAKEKKSKLSMAKHFGINQTSYEGPKTECTLCGIKYSARLSVRDHIFSQQHISKVKDTIGSQLDKEKEYFDPATVRQLMAQQELDRIKKANEVLGLAAQQQGMFDNTPLQALNLPTAYPALQGIPPVLLPGLNSPSLPGFTPSNTALTSPKPNLMGLPSTTVPSPGLPTSGLPNKPSSASLSSPTPAQATMAMGPQQPPQQQQQQQQPQVQQPPPPPAAQPPPTPQLPLQQQQQRKDKDSEKVKEKEKAHKGKGEPLPVPKKEKGEAPTATAATISAPLPTMEYAVDPAQLQALQAALTSDPTALLTSQFLPYFVPGFSPYYAPQIPGALQSGYLQPMYGMEGLFPYSPALSQALMGLSPGSLLQQYQQYQQSLQEAIQQQQQRQLQQQQQQKVQQQQPKASQTPVPPGAPSPDKDPAKESPKPEEQKNTPREVSPLLPKLPEEPEAESKSADSLYDPFIVPKVQYKLVCRKCQAGFSDEEAARSHLKSLCFFGQSVVNLQEMVLHVPTGGGGGGSGGGGGGGGGGGGGGSYHCLACESALCGEEALSQHLESALHKHRTITRAARNAKEHPSLLPHSACFPDPSTASTSQSAAHSNDSPPPPSAAAPSSASPHASRKSWPQVVSRASAAKPPSFPPLSSSSTVTSSSCSTSGVQPSMPTDDYSEESDTDLSQKSDGPASPVEGPKDPSCPKDSGLTSVGTDTFRL,mutated_sequence,1.0,3703.0,UPI00001AE937.a2m,UPI00001AE937.npy,gnomAD
+UPI00001285F3,UPI00001285F3.csv,MQQTRTEAVAGAFSRCLGFCGMRLGLLLLARHWCIAGVFPQKFDGDSAYVGMSDGNPELLSTSQTYNGQSENNEDYEIPPITPPNLPEPSLLHLGDHEASYHSLCHGLTPNGLLPAYSYQAMDLPAIMVSNMLAQDSHLLSGQLPTIQEMVHSEVAAYDSGRPGPLLGRPAMLASHMSALSQSQLISQMGIRSSIAHSSPSPPGSKSATPSPSSSTQEEESEVHFKISGEKRPSADPGKKAKNPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSKIVASMWDSLGEEQKQSSPDQGETKSTQANPPAKMLPPKQPMYAMPGLASFLTPSDLQAFRSGASPASLARTLGSKSLLPGLSASPPPPPSFPLSPTLHQQLSLPPHAQGALLSPPVSMSPAPQPPVLPTPMALQVQLAMSPSPPGPQDFPHISEFPSSSGSCSPGPSNPTSSGDWDSSYPSGECGISTCSLLPRDKSLYLT,mutated_sequence,1.0,488.0,UPI00001285F3.a2m,UPI00001285F3.npy,gnomAD
+UPI00001D822A,UPI00001D822A.csv,MQTLGSFFGSLPGFSSARNLVANAHSSARARPAADPTGAPAAEAAQPQAQVAAHPEQTAPWTEKELQPSEKQMVSGAKDLVCSKMSRAKDAVSSGVASVVDVAKGVVQGGLDTTRSALTGTKEVVSSGVTGAMDMAKGAVQGGLDTSKAVLTGTKDTVSTGLTGAVNVAKGTVQAGVDTTKTVLTGTKDTVTTGVMGAVNLAKGTVQTGVETSKAVLTGTKDAVSTGLTGAVNVARGSIQTGVDTSKTVLTGTKDTVCSGVTGAMNVAKGTIQTGVDTSKTVLTGTKDTVCSGVTGAMNVAKGTIQTGVDTSKTVLTGTKDTVCSGVTGAMNVAKGTIQTGVDTTKTVLTGTKNTVCSGVTGAVNLAKEAIQGGLDTTKSMVMGTKDTMSTGLTGAANVAKGAMQTGLNTTQNIATGTKDTVCSGVTGAMNLARGTIQTGVDTTKIVLTGTKDTVCSGVTGAANVAKGAVQGGLDTTKSVLTGTKDAVSTGLTGAVNVAKGTVQTGVDTTKTVLTGTKDTVCSGVTSAVNVAKGAVQGGLDTTKSVVIGTKDTMSTGLTGAANVAKGAVQTGVDTAKTVLTGTKDTVTTGLVGAVNVAKGTVQTGMDTTKTVLTGTKDTIYSGVTSAVNVAKGAVQTGLKTTQNIATGTKNTFGSGVTSAVNVAKGAAQTGVDTAKTVLTGTKDTVTTGLMGAVNVAKGTVQTSVDTTKTVLTGTKDTVCSGVTGAANVAKGAIQGGLDTTKSVLTGTKDAVSTGLTGAVKLAKGTVQTGMDTTKTVLTGTKDAVCSGVTGAANVAKGAVQMGVDTAKTVLTGTKDTVCSGVTGAANVAKGAVQTGLKTTQNIATGTKNTLGSGVTGAAKVAKGAVQGGLDTTKSVLTGTKDAVSTGLTGAVNLAKGTVQTGVDTSKTVLTGTKDTVCSGVTGAVNVAKGTVQTGVDTAKTVLSGAKDAVTTGVTGAVNVAKGTVQTGVDASKAVLMGTKDTVFSGVTGAMSMAKGAVQGGLDTTKTVLTGTKDAVSAGLMGSGNVATGATHTGLSTFQNWLPSTPATSWGGLTSSRTTDNGGEQTALSPQEAPFSGISTPPDVLSVGPEPAWEAAATTKGLATDVATFTQGAAPGREDTGLLATTHGPEEAPRLAMLQNELEGLGDIFHPMNAEEQAQLAASQPGPKVLSAEQGSYFVRLGDLGPSFRQRAFEHAVSHLQHGQFQARDTLAQLQDCFRLIEKAQQAPEGQPRLDQGSGASAEDAAVQEERDAGVLSRVCGLLRQLHTAYSGLVSSLQGLPAELQQPVGRARHSLCELYGIVASAGSVEELPAERLVQSREGVHQAWQGLEQLLEGLQHNPPLSWLVGPFALPAGGQ,mutated_sequence,1.0,1357.0,UPI00001D822A.a2m,UPI00001D822A.npy,gnomAD
+UPI0000052952,UPI0000052952.csv,MGDQQLYKTNHVAHGSENLFYQQPPLGVHSGLNHNYGNAVTGGGMDAPQASPISPHFPQDTRDGLGLPVGSKNLGQMDTSRQGGWGSHAGPGNHVQLRGNLANSNMMWGAPAQAEPTDGYQYTYSQASEIRTQKLTSGVLHKLDSFTQVFANQNLRIQVNNMAQVLHTQSAVMDGAPDSALRQLLSQKPMEPPAPAIPSRYQQVPQQPHPGFTGGLSKPALQVGQHPTQGHLYYDYQQPLAQVPVQGGQPLQAPQMLSQHMQQMQQHQYYPPQQQQQAGQQRISMQEIQTQPQQIRPSQPQPPPQQQQPQQLQLQQRQGSMQIPQYYQPQPMMQHLQEQQQQQMHLQPPSYHRDPHQYTPEQAHTVQLIPLGSMSQYYYQEPQQPYSHPLYQQSHLSQHQQREDSQLKTYSSDRQAQAMLSSHGDLGPPDTGMGDPASSDLTRVSSTLPHRPLLSPSGIHLNNMGPQHQQLSPSAMWPQMHLPDGRAQPGSPESSGQPKGAFGEQFDAKNKLTCSICLKEFKNLPALNGHMRSHGGMRASPNLKQEEGEKVLPPQPQPPLPPPPPPPPPPQLPPEAESLTPMVMPVSVPVKLLPPKPSSQGFTNSTVAAPSARDKPASSMSDDEMPVLEIPRKHQPSVPKAEEPLKTVQEKKKFRHRPEPLFIPPPPSYNPNPAASYSGATLYQSQLRSPRVLGDHLLLDPTHELPPYTPPPMLSPVRQGSGLFSNVLISGHGPGAHPQLPLTPLTPTPRVLLCRSNSIDGSNVTVTPGPGEQTVDVEPRINIGLRFQAEIPELQDISALAQDTHKATLVWKPWPELENHDLQQRVENLLNLCCSSALPGGGTNSEFALHSLFEAKGDVMVALEMLLLRKPVRLKCHPLANYHYAGSDKWTSLERKLFNKALATYSKDFIFVQKMVKSKTVAQCVEYYYTWKKIMRLGRKHRTRLAEIIDDCVTSEEEEELEEEEEEDPEEDRKSTKEEESEVPKSPEPPPVPVLAPTEGPPLQALGQPSGSFICEMPNCGAVFSSRQALNGHARIHGGTNQVTKARGAIPSGKQKPGGTQSGYCSVKSSPSHSTTSGETDPTTIFPCKECGKVFFKIKSRNAHMKTHRQQEEQQRQKAQKAAFAAEMAATIERTTGPVGAPGLLPLDQLSLIKPIKDVDILDDDVVQQLGGVMEEAEVVDTDLLLDDQDSVLLQGDAEL,mutated_sequence,1.0,1200.0,UPI0000052952.a2m,UPI0000052952.npy,gnomAD
+UPI00004A3B76,UPI00004A3B76.csv,MGNWVVNHWFSVLFLVVWLGLNVFLFVDAFLKYEKADKYYYTRKILGSTLACARASALCLNFNSTLILLPVCRNLLSFLRGTCSFCSRTLRKQLDHNLTFHKLVAYMICLHTAIHIIAHLFNFDCYSRSRQATDGSLASILSSLSHDEKKGGSWLNPIHPHITPTVYMFTVTFDMVLSSVNSNLFLFLLIKK,mutated_sequence,1.0,192.0,UPI00004A3B76.a2m,UPI00004A3B76.npy,gnomAD
+UPI0000164A41,UPI0000164A41.csv,MSEASSEDLVPPLEAGAAPYREEEEAAKKKKEKKKKSKGLANVFCVFTKGKKKKGQPSSAEPEDAAGSRQGLDGPPPTVEELKAALERGQLEAARPLLALERELAAAAAAGGVSEEELVRRQSKVEALYELLRDQVLGVLRRPLEAPPERLRQALAVVAEQEREDRQAAAAGPGTSGLAATRPRRWLQLWRRGVAEAAEERMGQRPAAGAEVPESVFLHLGRTMKEDLEAVVERLKPLFPAEFGVVAAYAESYHQHFAAHLAAVAQFELCERDTYMLLLWVQNLYPNDIINSPKLVGELQGMGLGSLLPPRQIRLLEATFLSSEAANVRELMDRALELEARRWAEDVPPQRLDGHCHSELAIDIIQITSQAQAKAESITLDLGSQIKRVLLVELPAFLRSYQRAFNEFLERGKQLTNYRANVIANINNCLSFRMSMEQNWQVPQDTLSLLLGPLGELKSHGFDTLLQNLHEDLKPLFKRFTHTRWAAPVETLENIIATVDTRLPEFSELQGCFREELMEALHLHLVKEYIIQLSKGRLVLKTAEQQQQLAGYILANADTIQHFCTQHGSPATWLQPALPTLAEIIRLQDPSAIKIEVATYATCYPDFSKGHLSAILAIKGNLSNSEVKRIRSILDVSMGAQEPSRPLFSLIKVG,mutated_sequence,1.0,654.0,UPI0000164A41.a2m,UPI0000164A41.npy,gnomAD
+UPI0000D62427,UPI0000D62427.csv,MEPSRALLGCLASAAAAAPPGEDGAGAGAEEEEEEEEEAAAAVGPGELGCDAPLPYWTAVFEYEAAGEDELTLRLGDVVEVLSKDSQVSGDEGWWTGQLNQRVGIFPSNYVTPRSAFSSRCQPGGEDPSCYPPIQLLEIDFAELTLEEIIGIGGFGKVYRAFWIGDEVAVKAARHDPDEDISQTIENVRQEAKLFAMLKHPNIIALRGVCLKEPNLCLVMEFARGGPLNRVLSGKRIPPDILVNWAVQIARGMNYLHDEAIVPIIHRDLKSSNILILQKVENGDLSNKILKITDFGLAREWHRTTKMSAAGTYAWMAPEVIRASMFSKGSDVWSYGVLLWELLTGEVPFRGIDGLAVAYGVAMNKLALPIPSTCPEPFAKLMEDCWNPDPHSRPSFTNILDQLTTIEESGFFEMPKDSFHCLQDNWKHEIQEMFDQLRAKEKELRTWEEELTRAALQQKNQEELLRRREQELAEREIDILERELNIIIHQLCQEKPRVKKRKGKFRKSRLKLKDGNRISLPSDFQHKFTVQASPTMDKRKSLINSRSSPPASPTIIPRLRAIQLTPGESSKTWGRSSVVPKEEGEEEEKRAPKKKGRTWGPGTLGQKELASGDEGSPQRREKANGLSTPSESPHFHLGLKSLVDGYKQWSSSAPNLVKGPRSSPALPGFTSLMEMEDEDSEGPGSGESRLQHSPSQSYLCIPFPRGEDGDGPSSDGIHEEPTPVNSATSTPQLTPTNSLKRGGAHHRRCEVALLGCGAVLAATGLGFDLLEAGKCQLLPLEEPEPPAREEKKRREGLFQRSSRPRRSTSPPSRKLFKKEEPMLLLGDPSASLTLLSLSSISECNSTRSLLRSDSDEIVVYEMPVSPVEAPPLSPCTHNPLVNVRVERFKRDPNQSLTPTHVTLTTPSQPSSHRRTPSDGALKPETLLASRSPSSNGLSPSPGAGMLKTPSPSRDPGEFPRLPDPNVVFPPTPRRWNTQQDSTLERPKTLEFLPRPRPSANRQRLDPWWFVSPSHARSTSPANSSSTETPSNLDSCFASSSSTVEERPGLPALLPFQAGPLPPTERTLLDLDAEGQSQDSTVPLCRAELNTHRPAPYEIQQEFWS,mutated_sequence,1.0,1104.0,UPI0000D62427.a2m,UPI0000D62427.npy,gnomAD
+UPI000059DA57,UPI000059DA57.csv,XTASSEPAPASAAKQEKPAEKPAETPVATSPTATDSTSGDSSRSNLFEDATSALVTGQSYENMVTEIMSMGYEREQVIAALRASFNNPDRAVEYLLMIIVKTGNKKKKQPLLGK,mutated_sequence,1.0,114.0,UPI000059DA57.a2m,UPI000059DA57.npy,gnomAD
+UPI000012D193,UPI000012D193.csv,MKAVSPVRPSGRKAPSGCGGGELALRCLAEHGHSLGGSAAAAAAAAAARCKAAEAAADEPALCLQCDMNDCYSRLRRLVPTIPPNKKVSKVEILQHVIDYILDLQLALETHPALLRQPPPPAPPHHPAGTCPAAPPRTPLTALNTDPAGAVNKQGDSILCR,mutated_sequence,1.0,161.0,UPI000012D193.a2m,UPI000012D193.npy,gnomAD
+UPI0000DA5AF5,UPI0000DA5AF5.csv,MMAKSALRENGTNSETFRQRFRRFHYQEVAGPREAFSQLWELCCRWLRPEVRTKEQIVELLVLEQFLTVLPGEIQNWVQEQCPENGEEAVTLVEDLEREPGRPRSSVTVSVKGQEVRLEKMTPPKSSQELLSVRQESVEPQPRGVPKKERARSPDLGPQEQMNPKEKLKPFQRSGLPFPKSGVVSRLEQGEPWIPDLLGSKEKELPSGSHIGDRRVHADLLPSKKDRRSWVEQDHWSFEDEKVAGVHWGYEETRTLLAILSQTEFYEALRNCHRNSQVYGAVAERLREYGFLRTLEQCRTKFKGLQKSYRKVKSGHPPETCPFFEEMEALMSAQVIALPSNGLEAAASHSGLVGSDAETEEPGQRGWQHEEGAEEAVAQESDSDDMDLEATPQDPNSAAPVVFRSPGGVHWGYEETKTYLAILSETQFYEALRNCHRNSQLYGAVAERLWEYGFLRTPEQCRTKFKSLQTSYRKVKNGQAPETCPFFEEMDALVSVRVAAPPNDGQEETASCPVQGTSEAEAQKQAEEADEATEEDSDDDEEDTEIPPGAVITRAPVLFQSPRGFEAGFENEDNSKRDISEEVQLHRTLLARSERKIPRYLHQGKGNESDCRSGRQWAKTSGEKRGKLTLPEKSLSEVLSQQRPCLGERPYKYLKYSKSFGPNSLLMHQVSHQVENPYKCADCGKSFSRSARLIRHRRIHTGEKPYKCLDCGKSFRDSSNFITHRRIHTGEKPYQCGECGKCFNQSSSLIIHQRTHTGEKPYQCEECGKSFNNSSHFSAHRRIHTGERPHVCPDCGKSFSKSSDLRAHHRTHTGEKPYGCHDCGKCFSKSSALNKHGEIHAREKLLTQSAPK,mutated_sequence,1.0,852.0,UPI0000DA5AF5.a2m,UPI0000DA5AF5.npy,gnomAD
+UPI000000D72E,UPI000000D72E.csv,MACTGPSLPSAFDILGAAGQDKLLYLKHKLKTPRPGCQGQDLLHAMVLLKLGQETEARISLEALKADAVARLVARQWAGVDSTEDPEEPPDVSWAVARLYHLLAEEKLCPASLRDVAYQEAVRTLSSRDDHRLGELQDEARNRCGWDIAGDPGSIRTLQSNLGCLPPSSALPSGTRSLPRPIDGVSDWSQGCSLRSTGSPASLASNLEISQSPTMPFLSLHRSPHGPSKLCDDPQASLVPEPVPGGCQEPEEMSWPPSGEIASPPELPSSPPPGLPEVAPDATSTGLPDTPAAPETSTNYPVECTEGSAGPQSLPLPILEPVKNPCSVKDQTPLQLSVEDTTSPNTKPCPPTPTTPETSPPPPPPPPSSTPCSAHLTPSSLFPSSLESSSEQKFYNFVILHARADEHIALRVREKLEALGVPDGATFCEDFQVPGRGELSCLQDAIDHSAFIILLLTSNFDCRLSLHQVNQAMMSNLTRQGSPDCVIPFLPLESSPAQLSSDTASLLSGLVRLDEHSQIFARKVANTFKPHRLQARKAMWRKEQDTRALREQSQHLDGERMQAAALNAAYSAYLQSYLSYQAQMEQLQVAFGSHMSFGTGAPYGARMPFGGQVPLGAPPPFPTWPGCPQPPPLHAWQAGTPPPPSPQPAAFPQSLPFPQSPAFPTASPAPPQSPGLQPLIIHHAQMVQLGLNNHMWNQRGSQAPEDKTQEAE,mutated_sequence,1.0,712.0,UPI000000D72E.a2m,UPI000000D72E.npy,gnomAD
+UPI000006E083,UPI000006E083.csv,MSRRKQGNPQHLSQRELITPEADHVEAAILEEDEGLEIEEPSGLGLMVGGPDPDLLTCGQCQMNFPLGDILVFIEHKRKQCGGSLGACYDKALDKDSPPPSSRSELRKVSEPVEIGIQVTPDEDDHLLSPTKGICPKQENIAGPCRPAQLPAVAPIAASSHPHSSVITSPLRALGALPPCLPLPCCSARPVSGDGTQGEGQTEAPFGCQCQLSGKDEPSSYICTTCKQPFNSAWFLLQHAQNTHGFRIYLEPGPASSSLTPRLTIPPPLGPEAVAQSPLMNFLGDSNPFNLLRMTGPILRDHPGFGEGRLPGTPPLFSPPPRHHLDPHRLSAEEMGLVAQHPSAFDRVMRLNPMAIDSPAMDFSRRLRELAGNSSTPPPVSPGRGNPMHRLLNPFQPSPKSPFLSTPPLPPMPPGGTPPPQPPAKSKSCEFCGKTFKFQSNLIVHRRSHTGEKPYKCQLCDHACSQASKLKRHMKTHMHKAGSLAGRSDDGLSAASSPEPGTSELAGEGLKAADGDFRHHESDPSLGHEPEEEDEEEEEEEEELLLENESRPESSFSMDSELSRNRENGGGGVPGVPGAGGGAAKALADEKALVLGKVMENVGLGALPQYGELLADKQKRGAFLKRAAGGGDAGDDDDAGGCGDAGAGGAVNGRGGGFAPGTEPFPGLFPRKPAPLPSPGLNSAAKRIKVEKDLELPPAALIPSENVYSQWLVGYAASRHFMKDPFLGFTDARQSPFATSSEHSSENGSLRFSTPPGDLLDGGLSGRSGTASGGSTPHLGGPGPGRPSSKEGRRSDTCEYCGKVFKNCSNLTVHRRSHTGERPYKCELCNYACAQSSKLTRHMKTHGQIGKEVYRCDICQMPFSVYSTLEKHMKKWHGEHLLTNDVKIEQAERS,mutated_sequence,1.0,894.0,UPI000006E083.a2m,UPI000006E083.npy,gnomAD
+UPI000004980B,UPI000004980B.csv,MADGAPRPQLYRSVSFKLLERWSGGPGLREEDTDTPGLRRRASCRPTTAARGQPSRRVSKLASGPLAAPAQPRPLRSLSPSVRQLSRRFDAPRLDDGSAGTRDGGVLPAAAEEAAEGPARGAWPSVTEMRKLFGGPGSRRPSADSESPGTPSPDGAAWEPPARESRQPPTPPPRTCFPLAGLRSARPLTGPETEGRLRRPQQQQERAQRPADGLHSWHIFSQPQAGARASCSSSSIAASYPVSRSRAASSSEEEEEGPPQLPGAQSPAYHGGHSSGSDDDRDGEGGHRWGGRPGLRPGSSLLDQDCRPDSDGLNLSSMNSAGVSGSPEPPTSPRAPREEGLREWGSGSPPCVPGPQEGLRPMSDSVGGAFRVAKVSFPSYLASPAGSRGSSRYSSTETLKDDDLWSSRGSGGWGVYRSPSFGAGEGLLRSQARTRAKGPGGTSRALRDGGFEPEKSRQRKSLSNPDIASETLTLLSFLRSDLSELRVRKPGGSSGDRGSNPLDGRDSPSAGGPVGQLEPIPIPAPASPGTRPTLKDLTATLRRAKSFTCSEKPMARRLPRTSALKSSSSELLLTGPGAEEDPLPLIVQDQYVQEARQVFEKIQRMGAQQDDGSDAPPGSPDWAGDVTRGQRSQEELSGPESSLTDEGIGADPEPPVAAFCGLGTTGMWRPLSSSSAQTNHHGPGTEDSLGGWALVSPETPPTPGALRRRRKVPPSGSGGSELSNGEAGEAYRSLSDPIPQRHRAATSEEPTGFSVDSNLLGSLSPKTGLPATSAMDEGLTSGHSDWSVGSEESKGYQEVIQSIVQGPGTLGRVVDDRIAGKAPKKKSLSDPSRRGELAGPGFEGPGGEPIREVEPMLPPSSSEPILVEQRAEPEEPGATRSRAQSERALPEALPPPATAHRNFHLDPKLADILSPRLIRRGSKKRPARSSHQELRRDEGSQDQTGSLSRARPSSRHVRHASVPATFMPIVVPEPPTSVGPPVAVPEPIGFPTRAHPTLQAPSLEDVTKQYMLNLHSGEVPAPVPVDMPCLPLAAPPSAEAKPPEAARPADEPTPASKCCSKPQVDMRKHVAMTLLDTEQSYVESLRTLMQGYMQPLKQPENSVLCDPSLVDEIFDQIPELLEHHEQFLEQVRHCMQTWHAQQKVGALLVQSFSKDVLVNIYSAYIDNFLNAKDAVRVAKEARPAFLKFLEQSMRENKEKQALSDLMIKPVQRIPRYELLVKDLLKHTPEDHPDHPLLLEAQRNIKQVAERINKGVRSAEEAERHARVLQEIEAHIEGMEDLQAPLRRFLRQEMVIEVKAIGGKKDRSLFLFTDLIVCTTLKRKSGSLRRSSMSLYTAASVIDTASKYKMLWKLPLEDADIIKGASQATNRENIQKAISRLDEDLTTLGQMSKLSESLGFPHQSLDDALRDLSAAMHRDLSEKQALCYALSFPPTKLELCATRPEGTDSYIFEFPHPDARLGFEQAFDEAKRKLASSKSCLDPEFLKAIPIMKTRSGMQFSCAAPTLNSCPEPSPEVWVCNSDGYVGQVCLLSLRAEPDVEACIAVCSARILCIGAVPGLQPRCHREPPPSLRSPPETAPEPAGPELDVEAAADEEAATLAEPGPQPCLHISIAGSGLEMTPGLGEGDPRPELVPFDSDSDDESSPSPSGTLQSQASRSTISSSFGNEETPSSKEATAETTSSEEEQEPGFLPLSGSFGPGGPCGTSPMDGRALRRSSHGSFTRGSLEDLLSVDPEAYQSSVWLGTEDGCVHVYQSSDSIRDRRNSMKLQHAASVTCILYLNNQVFVSLANGELVVYQREAGHFWDPQNFKSVTLGTQGSPITKMVSVGGRLWCGCQNRVLVLSPDTLQLEHMFYVGQDSSRCVACMVDSSLGVWVTLKGSAHVCLYHPDTFEQLAEVDVTPPVHRMLAGSDAIIRQHKAACLRITALLVCEELLWVGTSAGVVLTMPTSPGTVSCPRAPLSPTGLGQGHTGHVRFLAAVQLPDGFNLLCPTPPPPPDTGPEKLPSLEHRDSPWHRGPAPARPKMLVISGGDGYEDFRLSSGGGSSSETVGRDDSTNHLLLWRV,mutated_sequence,1.0,2063.0,UPI000004980B.a2m,UPI000004980B.npy,gnomAD
+UPI000013C68E,UPI000013C68E.csv,MSALCWGRGAAGLKRALRPCGRPGLPGKEGTAGGVCGPRRSSSASPQEQDQDRRKDWGHVELLEVLQARVRQLQAESVSEVVVNRVDVARLPECGSGDGSLQPPRKVQMGAKDATPVPCGRWAKILEKDKRTQQMRMQRLKAKLQMPFQSGEFKALTRRLQVEPRLLSKQMAGCLEDCTRQAPESPWEEQLARLLQEAPGKLSLDVEQAPSGQHSQAQLSGQQQRLLAFFKCCLLTDQLPLAHHLLVVHHGQRQKRKLLTLDMYNAVMLGWARQGAFKELVYVLFMVKDAGLTPDLLSYAAALQCMGRQDQDAGTIERCLEQMSQEGLKLQALFTAVLLSEEDRATVLKAVHKVKPTFSLPPQLPPPVNTSKLLRDVYAKDGRVSYPKLHLPLKTLQCLFEKQLHMELASRVCVVSVEKPTLPSKEVKHARKTLKTLRDQWEKALCRALRETKNRLEREVYEGRFSLYPFLCLLDEREVVRMLLQVLQALPAQGESFTTLARELSARTFSRHVVQRQRVSGQVQALQNHYRKYLCLLASDAEVPEPCLPRQYWEELGAPEALREQPWPLPVQMELGKLLAEMLVQATQMPCSLDKPHRSSRLVPVLYHVYSFRNVQQIGILKPHPAYVQLLEKAAEPTLTFEAVDVPMLCPPLPWTSPHSGAFLLSPTKLMRTVEGATQHQELLETCPPTALHGALDALTQLGNCAWRVNGRVLDLVLQLFQAKGCPQLGVPAPPSEAPQPPEAHLPHSAAPARKAELRRELAHCQKVAREMHSLRAEALYRLSLAQHLRDRVFWLPHNMDFRGRTYPCPPHFNHLGSDVARALLEFAQGRPLGPHGLDWLKIHLVNLTGLKKREPLRKRLAFAEEVMDDILDSADQPLTGRKWWMGAEEPWQTLACCMEVANAVRASDPAAYVSHLPVHQDGSCNGLQHYAALGRDSVGAASVNLEPSDVPQDVYSGVAAQVEVFRRQDAQRGMRVAQVLEGFITRKVVKQTVMTVVYGVTRYGGRLQIEKRLRELSDFPQEFVWEASHYLVRQVFKSLQEMFSGTRAIQHWLTESARLISHMGSVVEWVTPLGVPVIQPYRLDSKVKQIGGGIQSITYTHNGDISRKPNTRKQKNGFPPNFIHSLDSSHMMLTALHCYRKGLTFVSVHDCYWTHAADVSVMNQVCREQFVRLHSEPILQDLSRFLVKRFCSEPQKILEASQLKETLQAVPKPGAFDLEQVKRSTYFFS,mutated_sequence,1.0,1230.0,UPI000013C68E.a2m,UPI000013C68E.npy,gnomAD
+UPI0000041E80,UPI0000041E80.csv,MERGNQTEVGNFLLLGFAEDSDMQLLLHGLFLSMYLVTIIGNLLIILTISSDSHLHTPMYFFLSNLSFADICFTSTTVPKMLVNIQTQSKMITFAGCLTQIFFFIAFGCLDNLLLTMTAYDRFVAICYPLHYTVIMNPRLCGLLVLGSWCISVMGSLLETLTILRLSFCTNMEIPHFFCDPSEVLKLACSDTFINNIVMYFVTIVLGVFPLCGILFSYSQIFSSVLRVSARGQHKAFSTCGSHLSVVSLFYGTGLGVYLSSAVTPPSRTSLAASVMYTMVTPMLNPFIYSLRNKDMKGSLGRLLLRATSLKEGTIAKLS,mutated_sequence,1.0,319.0,UPI0000041E80.a2m,UPI0000041E80.npy,gnomAD
+UPI0000073D24,UPI0000073D24.csv,MAVVIRLQGLPIVAGTMDIRHFFSGLTIPDGGVHIVGGELGEAFIVFATDEDARLGMMRTGGTIKGSKVTLLLSSKTEMQNMIELSRRRFETANLDIPPANASRSGPPPSSGMSSRVNLPTTVSNFNNPSPSVVTATTSVHESNKNIQTFSTASVGTAPPNMGASFGSPTFSSTVPSTASPMNTVPPPPIPPIPAMPSLPPMPSIPPIPVPPPVPTLPPVPPVPPIPPVPSVPPMTPLPPMSGMPPLNPPPVAPLPAGMNGSGAPMNLNNNLNPMFLGPLNPVNPIQMNSQSSVKPLPINPDDLYVSVHGMPFSAMENDVRDFFHGLRVDAVHLLKDHVGRNNGNGLVKFLSPQDTFEALKRNRMLMIQRYVEVSPATERQWVAAGGHITFKQNMGPSGQTHPPPQTLPRSKSPSGQKRSRSRSPHEAGFCVYLKGLPFEAENKHVIDFFKKLDIVEDSIYIAYGPNGKATGEGFVEFRNEADYKAALCRHKQYMGNRFIQVHPITKKGMLEKIDMIRKRLQNFSYDQREMILNPEGDVNSAKVCAHITNIPFSITKMDVLQFLEGIPVDENAVHVLVDNNGQGLGQALVQFKNEDDARKSERLHRKKLNGREAFVHVVTLEDMREIEKNPPAQGKKGLKMPVPGNPAVPGMPNAGLPGVGLPSAGLPGAGLPSTGLPGSAITSAGLPGAGMPSAGIPSAGGEEHAFLTVGSKEANNGPPFNFPGNFGGSNAFGPPIPPPGLGGGAFGDARPGMPSVGNSGLPGLGLDVPGFGGGPNNLSGPSGFGGGPQNFGNGPGSLGGPPGFGSGPPGLGSAPGHLGGPPAFGPGPGPGPGPGPIHIGGPPGFASSSGKPGPTVIKVQNMPFTVSIDEILDFFYGYQVIPGSVCLKYNEKGMPTGEAMVAFESRDEATAAVIDLNDRPIGSRKVKLVLG,mutated_sequence,1.0,932.0,UPI0000073D24.a2m,UPI0000073D24.npy,gnomAD
+UPI000012FE45,UPI000012FE45.csv,MSGLGENLDPLASDSRKRKLPCDTPGQGLTCSGEKRRREQESKYIEELAELISANLSDIDNFNVKPDKCAILKETVRQIRQIKEQGKTISNDDDVQKADVSSTGQGVIDKDSLGPLLLQALDGFLFVVNRDGNIVFVSENVTQYLQYKQEDLVNTSVYNILHEEDRKDFLKNLPKSTVNGVSWTNETQRQKSHTFNCRMLMKTPHDILEDINASPEMRQRYETMQCFALSQPRAMMEEGEDLQSCMICVARRITTGERTFPSNPESFITRHDLSGKVVNIDTNSLRSSMRPGFEDIIRRCIQRFFSLNDGQSWSQKRHYQEAYLNGHAETPVYRFSLADGTIVTAQTKSKLFRNPVTNDRHGFVSTHFLQREQNGYRPNPNPVGQGIRPPMAGCNSSVGGMSMSPNQGLQMPSSRAYGLADPSTTGQMSGARYGGSSNIASLTPGPGMQSPSSYQNNNYGLNMSSPPHGSPGLAPNQQNIMISPRNRGSPKIASHQFSPVAGVHSPMASSGNTGNHSFSSSSLSALQAISEGVGTSLLSTLSSPGPKLDNSPNMNITQPSKVSNQDSKSPLGFYCDQNPVESSMCQSNSRDHLSDKESKESSVEGAENQRGPLESKGHKKLLQLLTCSSDDRGHSSLTNSPLDSSCKESSVSVTSPSGVSSSTSGGVSSTSNMHGSLLQEKHRILHKLLQNGNSPAEVAKITAEATGKDTSSITSCGDGNVVKQEQLSPKKKENNALLRYLLDRDDPSDALSKELQPQVEGVDNKMSQCTSSTIPSSSQEKDPKIKTETSEEGSGDLDNLDAILGDLTSSDFYNNSISSNGSHLGTKQQVFQGTNSLGLKSSQSVQSIRPPYNRAVSLDSPVSVGSSPPVKNISAFPMLPKQPMLGGNPRMMDSQENYGSSMGGPNRNVTVTQTPSSGDWGLPNSKAGRMEPMNSNSMGRPGGDYNTSLPRPALGGSIPTLPLRSNSIPGARPVLQQQQQMLQMRPGEIPMGMGANPYGQAAASNQLGSWPDGMLSMEQVSHGTQNRPLLRNSLDDLVGPPSNLEGQSDERALLDQLHTLLSNTDATGLEEIDRALGIPELVNQGQALEPKQDAFQGQEAAVMMDQKAGLYGQTYPAQGPPMQGGFHLQGQSPSFNSMMNQMNQQGNFPLQGMHPRANIMRPRTNTPKQLRMQLQQRLQGQQFLNQSRQALELKMENPTAGGAAVMRPMMQPQVSSQQGFLNAQMVAQRSRELLSHHFRQQRVAMMMQQQQQQQQQQQQQQQQQQQQQQQQQQQQQTQAFSPPPNVTASPSMDGLLAGPTMPQAPPQQFPYQPNYGMGQQPDPAFGRVSSPPNAMMSSRMGPSQNPMMQHPQAASIYQSSEMKGWPSGNLARNSSFSQQQFAHQGNPAVYSMVHMNGSSGHMGQMNMNPMPMSGMPMGPDQKYC,mutated_sequence,1.0,1424.0,UPI000012FE45.a2m,UPI000012FE45.npy,gnomAD
+UPI00001A8223,UPI00001A8223.csv,MYLSLWVTINTVNLRNTLSGLRGAVTTVGMIKSDVPGTQEWLDERRRQGDLPLPTNSNPVLSLELCDPGQGPAPFQAVVVLIQPGRGLALRPPPSCLFPPDPTPSPPAGQIRVKPDRTGVVTDGVKHSMNPFCEIAVEEAVRLKEKKLVKEVIAVSCGPAQCQETIRTALAMGADRGIHVEVPPAEAERLGPLQVARVLAKLAEKEKVDLVLLGKQAIDDDCNQTGQMTAGFLDWPQGTFASQVTLEGDKLKVEREIDGGLETLRLKLPAVVTADLRLNEPRYATLPNIMKAKKKKIEVIKPGDLGVDLTSKLSVISVEDPPQRTAGVKVETTEDLVAKLKEIGRI,mutated_sequence,1.0,346.0,UPI00001A8223.a2m,UPI00001A8223.npy,gnomAD
+UPI00001612A6,UPI00001612A6.csv,MPEPGPRMNGFSLGELCWLFCCPPCPSRIAAKLAFLPPEPTYTVLAPEQRGAGASAPAPAQATAAAAAAQPAPQQPEEGAGAGPGACSLHLSERADWQYSQRELDAVEVFFSRTARDNRLGCMFVRCAPSSRYTLLFSHGNAVDLGQMCSFYIGLGSRINCNIFSYDYSGYGVSSGKPSEKNLYADIDAAWQALRTRYGVSPENIILYGQSIGTVPTVDLASRYECAAVILHSPLMSGLRVAFPDTRKTYCFDAFPSIDKISKVTSPVLVIHGTEDEVIDFSHGLAMYERCPRAVEPLWVEGAGHNDIELYAQYLERLKQFISHELPNS,mutated_sequence,1.0,329.0,UPI00001612A6.a2m,UPI00001612A6.npy,gnomAD
+UPI000006D9E4,UPI000006D9E4.csv,MEGSGEQPGPQPQHPGDHRIRDGDFVVLKREDVFKAVQVQRRKKVTFEKQWFYLDNVIGHSYGTAFEVTSGGSLQPKKKREEPTAETKEAGTDNRNIVDDGKSQKLTQDDIKALKDKGIKGEEIVQQLIENSTTFRDKTEFAQDKYIKKKKKKYEAIITVVKPSTRILSIMYYAREPGKINHMRYDTLAQMLTLGNIRAGNKMIVMETCAGLVLGAMMERMGGFGSIIQLYPGGGPVRAATACFGFPKSFLSGLYEFPLNKVDSLLHGTFSAKMLSSEPKDSALVEESNGTLEEKQASEQENEDSMAEAPESNHPEDQETMETISQDPEHKGPKERGSKKDYIQEKQRRQEEQRKRHLEAAALLSERNADGLIVASRFHPTPLLLSLLDFVAPSRPFVVYCQYKEPLLECYTKLRERGGVINLRLSETWLRNYQVLPDRSHPKLLMSGGGGYLLSGFTVAMDNLKADTSLKSNASTLESHETEEPAAKKRKCPESDS,mutated_sequence,1.0,497.0,UPI000006D9E4.a2m,UPI000006D9E4.npy,gnomAD
+UPI0000047F3B,UPI0000047F3B.csv,MEWGYLLEVTSLLAALALLQRSSGAAAASAKELACQEITVPLCKGIGYNYTYMPNQFNHDTQDEAGLEVHQFWPLVEIQCSPDLKFFLCSMYTPICLEDYKKPLPPCRSVCERAKAGCAPLMRQYGFAWPDRMRCDRLPEQGNPDTLCMDYNRTDLTTAAPSPPRRLPPPPPGEQPPSGSGHGRPPGARPPHRGGGRGGGGGDAAAPPARGGGGGGKARPPGGGAAPCEPGCQCRAPMVSVSSERHPLYNRVKTGQIANCALPCHNPFFSQDERAFTVFWIGLWSVLCFVSTFATVSTFLIDMERFKYPERPIIFLSACYLFVSVGYLVRLVAGHEKVACSGGAPGAGGAGGAGGAAAGAGAAGAGAGGPGGRGEYEELGAVEQHVRYETTGPALCTVVFLLVYFFGMASSIWWVILSLTWFLAAGMKWGNEAIAGYSQYFHLAAWLVPSVKSIAVLALSSVDGDPVAGICYVGNQSLDNLRGFVLAPLVIYLFIGTMFLLAGFVSLFRIRSVIKQQDGPTKTHKLEKLMIRLGLFTVLYTVPAAVVVACLFYEQHNRPRWEATHNCPCLRDLQPDQARRPDYAVFMLKYFMCLVVGITSGVWVWSGKTLESWRSLCTRCCWASKGAAVGGGAGATAAGGGGGPGGGGGGGPGGGGGPGGGGGSLYSDVSTGLTWRSGTASSVSYPKQMPLSQV,mutated_sequence,1.0,694.0,UPI0000047F3B.a2m,UPI0000047F3B.npy,gnomAD
+UPI000020B8EF,UPI000020B8EF.csv,MTLQELVHKAASCYMDRVAVCFDECNNQLPVYYTYKTVVNAASELSNFLLLHCDFQGIREIGLYCQPGIDLPSWILGILQVPAAYVPIEPDSPPSLSTHFMKKCNLKYILVEKKQINKFKSFHETLLNYDTFTVEHNDLVLFRLHWKNTEVNLMLNDGKEKYEKEKIKSISSEHVNEEKAEEHMDLRLKHCLAYVLHTSGTTGIPKIVRVPHKCIVPNIQHFRVLFDITQEDVLFLASPLTFDPSVVEIFLALSSGASLLIVPTSVKLLPSKLASVLFSHHRVTVLQATPTLLRRFGSQLIKSTVLSATTSLRVLALGGEAFPSLTVLRSWRGEGNKTQIFNVYGITEVSSWATIYRIPEKTLNSTLKCELPVQLGFPLLGTVVEVRDTNGFTIQEGSGQVFLGGRNRVCFLDDEVTVPLGTMRATGDFVTVKDGEIFFLGRKDSQIKRHGKRLNIELVQQVAEELQQVESCAVTWYNQEKLILFMVSKDASVKEYIFKELQKYLPSHAVPDELVLIDSLPFTSHGKIDVSELNKIYLNYINLKSENKLSGKEDLWEKLQYLWKSTLNLPEDLLRVPDESLFLNSGGDSLKSIRLLSEIEKLVGTSVPGLLEIILSSSILEIYNHILQTVVPDEDVTFRKSCATKRKLSDINQEEASGTSLHQKAIMTFTCHNEINAFVVLSRGSQILSLNSTRFLTKLGHCSSACPSDSVSQTNIQNLKGLNSPVLIGKSKDPSCVAKVSEEGKPAIGTQKMELHVRWRSDTGKCVDASPLVVIPTFDKSSTTVYIGSHSHRMKAVDFYSGKVKWEQILGDRIESSACVSKCGNFIVVGCYNGLVYVLKSNSGEKYWMFTTEDAVKSSATMDPTTGLIYIGSHDQHAYALDIYRKKCVWKSKCGGTVFSSPCLNLIPHHLYFATLGGLLLAVNPATGNVIWKHSCGKPLFSSPQCCSQYICIGCVDGNLLCFTHFGEQVWQFSTSGPIFSSPCTSPSEQKIFFGSHDCFIYCCNMKGHLQWKFETTSRVYATPFAFHNYNGSNEMLLAAASTDGKVWILESQSGQLQSVYELPGEVFSSPVVLESMLIIGCRDNYVYCLDLLGGNQK,mutated_sequence,1.0,1098.0,UPI000020B8EF.a2m,UPI000020B8EF.npy,gnomAD
+UPI0000137078,UPI0000137078.csv,MRGHPSLLLLYMALTTCLDTSPSEETDQEVFLGPPEAQSFLSSHTRIPRANHWDLELLTPGNLERECLEERCSWEEAREYFEDNTLTERFWESYIYNGKGGRGRVDVASLAVGLTGGILLIVLAGLGAFWYLRWRQHRGQQPCPQEAGLISPLSPLNPLGPPTPLPPPPPPPPGLPTYEQALAASGVHDAPPPPYTSLRRPH,mutated_sequence,1.0,202.0,UPI0000137078.a2m,UPI0000137078.npy,gnomAD
+UPI0000246C98,UPI0000246C98.csv,MSHQVKGLKEEARGGVKGRVKSGSPHTGDRLGRRSSSKRALKAEGTPGRRGAQRSQKERAGGSPSPGSPRRKQTGRRRHREELGEQERGEAERTCEGRRKRDERASFQERTAAPKREKEIPRREEKSKRQKKPRSSSLASSASGGESLSEEELAQILEQVEEKKKLIATMRSKPWPMAKKLTELREAQEFVEKYEGALGKGKGKQLYAYKMLMAKKWVKFKRDFDNFKTQCIPWEMKIKDIESHFGSSVASYFIFLRWMYGVNLVLFGLIFGLVIIPEVLMGMPYGSIPRKTVPRAEEEKAMDFSVLWDFEGYIKYSALFYGYYNNQRTIGWLRYRLPMAYFMVGVSVFGYSLIIVIRSMASNTQGSTGEGESDNFTFSFKMFTSWDYLIGNSETADNKYASITTSFKESIVDEQESNKEENIHLTRFLRVLANFLIICCLCGSGYLIYFVVKRSQQFSKMQNVSWYERNEVEIVMSLLGMFCPPLFETIAALENYHPRTGLKWQLGRIFALFLGNLYTFLLALMDDVHLKLANEETIKNITHWTLFNYYNSSGWNESVPRPPLHPADVPRGSCWETAVGIEFMRLTVSDMLVTYITILLGDFLRACFVRFMNYCWCWDLEAGFPSYAEFDISGNVLGLIFNQGMIWMGSFYAPGLVGINVLRLLTSMYFQCWAVMSSNVPHERVFKASRSNNFYMGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFSGKNRMYDVLQETIENDFPTFLGKIFAFLANPGLIIPAILLMFLAIYYLNSVSKSLSRANAQLRKKIQVLREVEKSHKSVKGKATARDSEDTPKSSSKNATQLQLTKEETTPPSASQSQAMDKKAQGPGTSNSASRTTLPASGHLPISRPPGIGPDSGHAPSQTHPWRSASGKSAQRPPH,mutated_sequence,1.0,906.0,UPI0000246C98.a2m,UPI0000246C98.npy,gnomAD
+UPI00001D82A6,UPI00001D82A6.csv,MHQTLCLNPESLKMSACSDFVEHIWKPGSCKNCFCLRSDHQLVAGPPQPRAGSLPPPPRLPPRPENCRLEDEGVNSSPYSKPTIAVKPTMMSSEASDVWTEANLSAEVSQVIWRRAPGKLPLPKQEDAPVVYLGSFRGVQKPAGPSTSPDGNSRCPPAYTMVGLHNLEPRGERNIAFHPVSFPEEKAVHKEKPSFPYQDRPSTQESFRQKLAAFAGTTSGCHQGPGPLRESLPSEDDSDQRCSPSGDSEGGEYCSILDCCPGSPVAKAASQTAGSRGRHGGRDCSPTCWEQGKCSGPAEQEKRGPSFPKECCSQGPTAHPSCLGPKKLSLTSEAAISSDGLSCGSGSGSGASSPFVPHLESDYCSLMKEPAPEKQQDPGCPGVTPSRCLGLTGEPQPPAHPREATQPEPIYAESTKRKKAAPVPSKSQAKIEHAAAAQGQGQVCTGNAWAQKAASGWGRDSPDPTPQVSATITVMAAHPEEDHRTIYLSSPDSAVGVQWPRGPVSQNSEVGEEETSAGQGLSSRESHAHSASESKPKERPAIPPKLSKSSPVGSPVSPSAGGPPVSPLADLSDGSSGGSSIGPQPPSQGPADPAPSCRTNGVAISDPSRCPQPAASSASEQRRPRFQAGTWSRQCRIEEEEEVEQELLSHSWGRETKNGPTDHSNSTTWHRLHPTDGSSGQNSKVGTGMSKSASFAFEFPKDRSGIETFSPPPPPPKSRHLLKMNKSSSDLEKVSQGSAESLSPSFRGVHVSFTTGSTDSLASDSRTCSDGGPSSELAHSPTNSGKKLFAPVPFPSGSTEDVSPSGPQQPPPLPQKKIVSRAASSPDGFFWTQGSPKPGTASPKLNLSHSETNVHDESHFSYSLSPGNRHHPVFSSSDPLEKAFKGSGHWLPAAGLAGNRGGCGSPGLQCKGAPSASSSQLSVSSQASTGSTQLQLHGLLSNISSKEGTYAKLGGLYTQSLARLVAKCEDLFMGGQKKELHFNENNWSLFKLTCNKPCCDSGDAIYYCATCSEDPGSTYAVKICKAPEPKTVSYCSPSVPVHFNIQQDCGHFVASVPSSMLSSPDAPKDPVPALPTHPPAQEQDCVVVITREVPHQTASDFVRDSAASHQAEPEAYERRVCFLLLQLCNGLEHLKEHGIIHRDLCLENLLLVHCTLQAGPGPAPAPAPAPAAAAPPCSSAAPPAGGTLSPAAGPASPEGPREKQLPRLIISNFLKAKQKPGGTPNLQQKKSQARLAPEIVSASQYRKFDEFQTGILIYELLHQPNPFEVRAQLRERDYRQEDLPPLPALSLYSPGLQQLAHLLLEADPIKRIRIGEAKRVLQCLLWGPRRELVQQPGTSEEALCGTLHNWIDMKRALMMMKFAEKAVDRRRGVELEDWLCCQYLASAEPGALLQSLKLLQLL,mutated_sequence,1.0,1402.0,UPI00001D82A6.a2m,UPI00001D82A6.npy,gnomAD
+UPI0001533DAA,UPI0001533DAA.csv,MANCSQEELDEEFEQFMKELSDDSFENSDKTARQSKKEMKKKDTVPWWITEDDFKDDGLLGTNVSYLKTKKTSQPVMEIEEESAEKIQFLKSSGTSLLSTDSLETNELVVSELNHSSLGVGLDTLEEQEEKEQFFARLEKGLTSSIDYSRLNKELDSNDSTHFKALHSNQANAELTDDEHENESKHEELAENYSDDFEDEYVGAPLTTKDEEMPSKENSKSEKISVPKQEEEKTGMLANVVLLDSLDSVAEVNLDEQDKITPKPRCLPEMTENEMTGTGVSYGQSSSDVEALHQAYCHIAHSLGDEDKQKIESNTVEDIKSSVKGHPQENEENSKNISTMESDLPTVEELMKPIRIDSFGISGFDLQPVSSEKVAERKETEFFSSLPLKMNPNILSQDSQHVNLFFDKNDENVILQKTTNESMENSCPQVTEVTATEEHVDKMYLNILRKKITVNSSSLSQDDKINKTYRSQLSSEEEGAVMGKQVPYKKARSAPPLLKRKPQSGLYASVRSSGYGKPSSPLKMFSTLEKKTSEDIIKSKNLRSISTSNQPRKKEILSGTKLIKPAALDKPAHKTESCLSTRKKSENPTETDSCIQFQTDSLGYCGENKEKKLLMFKRVQEAEDKWRGAQALIEQIKATFSEKEKELENKLEELKKQQEKELFKLNQDNYILQAKLSSFEETNKKQRWLHFGEAADPVTGEKLKQIQKEIQEQETLLQGYQQENERLYNQVKDLQEQNKKNEERMFKENQSLFSEVASLKEQMHKSRFLSQVVEDSEPTRNQNFTDLLAELRMAQKEKDSLLEDIKRLKQDKQALEVDFEKMKKERDQAKDQIAYVTGEKLYEIKILEETHKQEISRLQKRLQWYAENQELLDKDALRLREANEEIEKLKLEIEKLKAESGNPSIRQKIRLKDKAADAKKIQDLERQVKEMEGILKRRYPNSLPALILAASAAGDTVDKNTVEFMEKRIKKLEADLEGKDEDAKKSLRTMEQQFQKMKIQYEQRLEQQEQLLACKLNQHDSPRIKALEKELDDIKEAHQITVRNLEAEIDVLKHQNAELDVKKNDKDDEDFQSIEFQVEQAHAKAKLVRLNEELAAKKREIQDLSKTVERLQKDRRMMLSNQNSKGREEMSAKRAKKDVLHSSKGNANSFPGTLDSKLYQPHTFTDSHVSEVLQENYRLKNELEGLISEKNELKMKSEAVMNQFENSMRRVKEDTAAHIASLKASHQREIEKLLCQNAVENSSSKVAELNRKIATQEVLIRHFQSQVNELQSKQESLVVSEVREEILQKEITKLLEELREAKENHTPEMKHFVGLEKKIKQMEMRHAQREQELQQIIQQTHQVVETEQNKEVEKWKRLAQLKNRELEKFRTELDSILDVLRELHRQGVVVPVAFADEMNAPEY,mutated_sequence,1.0,1403.0,UPI0001533DAA.a2m,UPI0001533DAA.npy,gnomAD
+UPI00000398BB,UPI00000398BB.csv,MPIVDKLKEALKPGRKDSADDGELGKLLASSAKKVLLQKIEFEPASKSFSYQLEALKSKYVLLNPKTEGASRHKSGDDPPARRQGSEHTYESCGDGVPAPQKVLFPTERLSLRWERVFRVGAGLHNLGNTCFLNATIQCLTYTPPLANYLLSKEHARSCHQGSFCMLCVMQNHIVQAFANSGNAIKPVSFIRDLKKIARHFRFGNQEDAHEFLRYTIDAMQKACLNGCAKLDRQTQATTLVHQIFGGYLRSRVKCSVCKSVSDTYDPYLDVALEIRQAANIVRALELFVKADVLSGENAYMCAKCKKKVPASKRFTIHRTSNVLTLSLKRFANFSGGKITKDVGYPEFLNIRPYMSQNNGDPVMYGLYAVLVHSGYSCHAGHYYCYVKASNGQWYQMNDSLVHSSNVKVVLNQQAYVLFYLRIPGSKKSPEGLISRTGSSSLPGRPSVIPDHSKKNIGNGIISSPLTGKRQDSGTMKKPHTTEEIGVPISRNGSTLGLKSQNGCIPPKLPSGSPSPKLSQTPTHMPTILDDPGKKVKKPAPPQHFSPRTAQGLPGTSNSNSSRSGSQRQGSWDSRDVVLSTSPKLLATATANGHGLKGNDESAGLDRRGSSSSSPEHSASSDSTKAPQTPRSGAAHLCDSQETNCSTAGHSKTPPSGADSKTVKLKSPVLSNTTTEPASTMSPPPAKKLALSAKKASTLWRATGNDLRPPPPSPSSDLTHPMKTSHPVVASTWPVHRARAVSPAPQSSSRLQPPFSPHPTLLSSTPKPPGTSEPRSCSSISTALPQVNEDLVSLPHQLPEASEPPQSPSEKRKKTFVGEPQRLGSETRLPQHIREATAAPHGKRKRKKKKRPEDTAASALQEGQTQRQPGSPMYRREGQAQLPAVRRQEDGTQPQVNGQQVGCVTDGHHASSRKRRRKGAEGLGEEGGLHQDPLRHSCSPMGDGDPEAMEESPRKKKKKKRKQETQRAVEEDGHLKCPRSAKPQDAVVPESSSCAPSANGWCPGDRMGLSQAPPVSWNGERESDVVQELLKYSSDKAYGRKVLTWDGKMSAVSQDAIEDSRQARTETVVDDWDEEFDRGKEKKIKKFKREKRRNFNAFQKLQTRRNFWSVTHPAKAASLSYRR,mutated_sequence,1.0,1123.0,UPI00000398BB.a2m,UPI00000398BB.npy,gnomAD
+UPI0001E8F37E,UPI0001E8F37E.csv,MLQWGLDHSPRPALQEFAPSATGLAQHTGCLVRRLPTAPFLNSVGIETHLLFLCLSLPDPRPFS,mutated_sequence,,,UPI0001E8F37E.a2m,UPI0001E8F37E.npy,gnomAD
+UPI0000190820,UPI0000190820.csv,MDSTKEKCDSYKDDLLLRMGLNDNKAGMEGLDKEKINKIIMEATKGSRFYGNELKKEKQVNQRIENMMQQKAQITSQQLRKAQLQVDRFAMELEQSRNLSNTIVHIDMDAFYAAVEMRDNPELKDKPIAVGSMSMLSTSNYHARRFGVRAAMPGFIAKRLCPQLIIVPPNFDKYRAVSKEVKEILADYDPNFMAMSLDEAYLNITKHLEERQNWPEDKRRYFIKMGSSVENDNPGKEVNKLSEHERSISPLLFEESPSDVQPPGDPFQVNFEEQNNPQILQNSVVFGTSAQEVVKEIRFRIEQKTTLTASAGIAPNTMLAKVCSDKNKPNGQYQILPNRQAVMDFIKDLPIRKVSGIGKVTEKMLKALGIITCTELYQQRALLSLLFSETSWHYFLHISLGLGSTHLTRDGERKSMSVERTFSEINKAEEQYSLCQELCSELAQDLQKERLKVLYFDMVSLVFKFFNSKMLP,mutated_sequence,1.0,472.0,UPI0000190820.a2m,UPI0000190820.npy,gnomAD
+UPI000013DE67,UPI000013DE67.csv,MENDPSRRRESISLTPVAKGLENMGADFLESLEEGQLPRSDLSPAEIRSSWSEAAPKPFSRWRNLQPALRARSFCREHMQLFRWIGTGLLCTGLSAFLLVACLLDFQRALALFVLTCVVLTFLGHRLLKRLLGPKLRRFLKPQGHPRLLLWFKRGLALAAFLGLVLWLSLDTSQRPEQLVSFAGICVFVALLFACSKHHCAVSWRAVSWGLGLQFVLGLLVIRTEPGFIAFEWLGEQIRIFLSYTKAGSSFVFGEALVKDVFAFQVLPIIVFFSCVISVLYHVGLMQWVILKIAWLMQVTMGTTATETLSVAGNIFVSQTEAPLLIRPYLADMTLSEVHVVMTGGYATIAGSLLGAYISFGIDATSLIAASVMAAPCALALSKLVYPEVEESKFRREEGVKLTYGDAQNLIEAASTGAAISVKVVANIAANLIAFLAVLDFINAALSWLGDMVDIQGLSFQLICSYILRPVAFLMGVAWEDCPVVAELLGIKLFLNEFVAYQDLSKYKQRRLAGAEEWVGDRKQWISVRAEVLTTFALCGFANFSSIGIMLGGLTSMVPQRKSDFSQIVLRALFTGACVSLVNACMAGILYMPRGAEVDCMSLLNTTLSSSSFEIYQCCREAFQSVNPEFSPEALDNCCRFYNHTICAQ,mutated_sequence,1.0,649.0,UPI000013DE67.a2m,UPI000013DE67.npy,gnomAD
+UPI0000073FE5,UPI0000073FE5.csv,MTLRAAVFDLDGVLALPAVFGVLGRTEEALALPRGLLNDAFQKGGPEGATTRLMKGEITLSQWIPLMEENCRKCSETAKVCLPKNFSIKEIFDKAISARKINRPMLQAALMLRKKGFTTAILTNTWLDDRAERDGLAQLMCELKMHFDFLIESCQVGMVKPEPQIYKFLLDTLKASPSEVVFLDDIGANLKPARDLGMVTILVQDTDTALKELEKVTGIQLLNTPAPLPTSCNPSDMSHGYVTVKPRVRLHFVELGSGPAVCLCHGFPESWYSWRYQIPALAQAGYRVLAMDMKGYGESSAPPEIEEYCMEVLCKEMVTFLDKLGLSQAVFIGHDWGGMLVWYMALFYPERVRAVASLNTPFIPANPNMSPLESIKANPVFDYQLYFQEPGVAEAELEQNLSRTFKSLFRASDESVLSMHKVCEAGGLFVNSPEEPSLSRMVTEEEIQFYVQQFKKSGFRGPLNWYRNMERNWKWACKSLGRKILIPALMVTAEKDFVLVPQMSQHMEDWIPHLKRGHIEDCGHWTQMDKPTEVNQILIKWLDSDARNPPVVSKM,mutated_sequence,1.0,555.0,UPI0000073FE5.a2m,UPI0000073FE5.npy,gnomAD
+UPI0000073BBA,UPI0000073BBA.csv,MAAYPESCVDTTVLDFVADLSLASPRRPLLCDFAPGVSLGDPALALREGRPRRMARFEEGDPEEEECEVDQGDGEEEEEEERGRGVSLLGRPKRKRVITYAQRQAANIRERKRMFNLNEAFDQLRRKVPTFAYEKRLSRIETLRLAIVYISFMTELLESCEKKESG,mutated_sequence,1.0,166.0,UPI0000073BBA.a2m,UPI0000073BBA.npy,gnomAD
+UPI000041512B,UPI000041512B.csv,MQCLAAALKDETNMSGGGEQADILPANYVVKDRWKVLKKIGGGGFGEIYEAMDLLTRENVALKVESAQQPKQVLKMEVAVLKKLQGKDHVCRFIGCGRNEKFNYVVMQLQGRNLADLRRSQPRGTFTLSTTLRLGKQILESIEAIHSVGFLHRDIKPSNFAMGRLPSTYRKCYMLDFGLARQYTNTTGDVRPPRNVAGFRGTVRYASVNAHKNREMGRHDDLWSLFYMLVEFAVGQLPWRKIKDKEQVGMIKEKYEHRMLLKHMPSEFHLFLDHIASLDYFTKPDYQLIMSVFENSMKERGIAENEAFDWEKAGTDALLSTSTSTPPQQNTRQTAAMFGVVNVTPVPGDLLRENTEDVLQGEHLSDQENAPPILPGRPSEGLGPSPHLVPHPGGPEAEVWEETDVNRNKLRINIGKSPCVEEEQSRGMGVPSSPVRAPPDSPTTPVRSLRYRRVNSPESERLSTADGRVELPERRSRMDLPGSPSRQACSSQPAQMLSVDTGHADRQASGRMDVSASVEQEALSNAFRSVPLAEEEDFDSKEWVIIDKETELKDFPPGAEPSTSGTTDEEPEELRPLPEEGEERRRLGAEPTVRPRGRSMQALAEEDLQHLPPQPLPPQLSQGDGRSETSQPPTPGSPSHSPLHSGPRPRRRESDPTGPQRQVFSVAPPFEVNGLPRAVPLSLPYQDFKRDLSDYRERARLLNRVRRVGFSHMLLTTPQVPLAPVQPQANGKEEEEEEEEDEEEEEEDEEEEEEEEEEEEEEEEEEEEEEEAAAAVALGEVLGPRSGSSSEGSERSTDRSQEGAPSTLLADDQKESRGRASMADGDLEPEEGSKTLVLVSPGDMKKSPVTAELAPDPDLGTLAALTPQHERPQPTGSQLDVSEPGTLSSVLKSEPKPPGPGAGLGAGTVTTGVGGVAVTSSPFTKVERTFVHIAEKTHLNVMSSGGQALRSEEFSAGGELGLELASDGGAVEEGARAPLENGLALSGLNGAEIEGSALSGAPRETPSEMATNSLPNGPALADGPAPVSPLEPSPEKVATISPRRHAMPGSRPRSRIPVLLSEEDTGSEPSGSLSAKERWSKRARPQQDLARLVMEKRQGRLLLRLASGASSSSSEEQRRASETLSGTGSEEDTPASEPAAALPRKSGRAAATRSRIPRPIGLRMPMPVAAQQPASRSHGAAPALDTAITSRLQLQTPPGSATAADLRPKQPPGRGLGPGRAQAGARPPAPRSPRLPASTSAARNASASPRSQSLSRRESPSPSHQARPGVPPPRGVPPARAQPDGTPSPGGSKKGPRGKLQAQRATTKGRAGGAEGRAGAR,mutated_sequence,1.0,1321.0,UPI000041512B.a2m,UPI000041512B.npy,gnomAD
+UPI000013EA1F,UPI000013EA1F.csv,MSSEQKSQHCKPEEGVEAQEEALGLVGAQAPTTEEQEAAVSSSSPLVPGTLEEVPAAESAGPPQSPQGASALPTTISFTCWRQPNEGSSSQEEEGPSTSPDAESLFREALSNKVDELAHFLLRKYRAKELVTKAEMLERVIKNYKRCFPVIFGKASESLKMIFGIDVKEVDPASNTYTLVTCLGLSYDGLLGNNQIFPKTGLLIIVLGTIAMEGDSASEEEIWEELGVMGVYDGREHTVYGEPRKLLTQDWVQENYLEYRQVPGSNPARYEFLWGPRALAETSYVKVLEHVVRVNARVRIAYPSLREAALLEEEEGV,mutated_sequence,1.0,317.0,UPI000013EA1F.a2m,UPI000013EA1F.npy,gnomAD
+UPI0000072D52,UPI0000072D52.csv,MAGYATTPSPMQTLQEEAVCAICLDYFKDPVSISCGHNFCRGCVTQLWSKEDEEDQNEEEDEWEEEEDEEAVGAMDGWDGSIREVLYRGNADEELFQDQDDDELWLGDSGITNWDNVDYMWDEEEEEEEEDQDYYLGGLRPDLRIDVYREEEILEAYDEDEDEELYPDIHPPPSLPLPGQFTCPQCRKSFTRRSFRPNLQLANMVQIIRQMCPTPYRGNRSNDQGMCFKHQEALKLFCEVDKEAICVVCRESRSHKQHSVLPLEEVVQEYQEIKLETTLVGILQIEQESIHSKAYNQ,mutated_sequence,1.0,297.0,UPI0000072D52.a2m,UPI0000072D52.npy,gnomAD
+UPI00001FEDD8,UPI00001FEDD8.csv,MASPDRSKRKILKAKKTMPLSCRKQVEMLNKSRNVEALKTAIGSNVPSGNQSFSPSVITRTTEITKCSPSENGASSLDSNKNSISEKSKVFSQNCIKPVEEIVHSETKLEQVVCSYQKPSRTTESPSRVFTEEAKDSLNTSENDSEHQTNVTRSLFEHEGACSLKSSCCPPSVLSGVVQMPESTVTSTVGDKKTDQMVFHLETNSNSESHDKRQSDNILCSEDSGFVPVEKTPNLVNSVTSNNCADDILKTDECSRTSISNCESADSTWQSSLDTNNNSHYQKKRMFSENEENVKRMKTSEQINENICVSLERQTAFLEQVRHLIQQEIYSINYELFDKKLKELNQRIGKTECRNKHEGIADKLLAKIAKLQRRIKTVLLFQRNCLKPNMLSSNGASKVANSEAMILDKNLESVNSPIEKSSVNYEPSNPSEKGSKKINLSSDQNKSVSESNNDDVMLISVESPNLTTPITSNPTDTRKITSGNSSNSPNAEVMAVQKKLDSIIDLTKEGLSNCNTESPVSPLESHSKAASNSKETTPLAQNAVQVPESFEHLPPLPEPPAPLPELVDKTRDTLPPQKPELKVKRVFRPNGIALTWNITKINPKCAPVESYHLFLCHENSNNKLIWKKIGEIKALPLPMACTLSQFLASNRYYFTVQSKDIFGRYGPFCDIKSIPGFSENLT,mutated_sequence,1.0,682.0,UPI00001FEDD8.a2m,UPI00001FEDD8.npy,gnomAD
+UPI0000071E9E,UPI0000071E9E.csv,MVGVLAMAAAAAPPPVKDCEIEPCKKRKKDDDTSTCKTITKYLSPLGKTRDRVFAPPKPSNILDYFRKTSPTNEKTQLGKECKIKSPESVPVDSNKDCTTPLEMFSNVEFKKKRKRVNLSHQLNNIKTENEAPIEISSDDSKEDYSLNNDFVESSTSVLRYKKQVEVLAENIQDTKSQPNTMTSLQNSKKVNPKQGTTKNDFKKLRKRKCRDVVDLSESLPLAEELNLLKKDGKDTKQMENTTSHANSRDNVTEAAQLNDSIITVSYEEFLKSHKENKVEEIPDSTMSICVPSETVDEIVKSGYISESENSEISQQVRFKTVTVLAQVHPIPPKKTGKIPRIFLKQKQFEMENSLSDPENEQTVQKRKSNVVIQEEELELAVLEAGSSEAVKPKCTLEERQQFMKAFRQPASDALKNGVKKSSDKQKDLNEKCLYEVGRDDNSKKIMENSGIQMVSKNGNLQLHTDKGSFLKEKNKKLKKKNKKTLDTGAIPGKNREGNTQKKETTFFLKEKQYQNRMSLRQRKTEFFKSSTLFNNESLVYEDIANDDLLKVSSLCNNNKLSRKTSIPVKDIKLTQSKAESEASLLNVSTPKSTRRSGRISSTPTTETIRGIDSDDVQDNSQLKASTQKAANLSEKHSLYTAELITVPFDSESPIRMKFTRISTPKKSKKKSNKRSEKSEATDGGFTSQIRKASNTSKNISKAKQLIEKAKALHISRSKVTEEIAIPLRRSSRHQTLPERKKLSETEDSVIIIDSSPTALKHPEKNQKKLQCLNDVLGKKLNTSTKNVPGKMKVAPLFLVRKAQKAADPVPSFDESSQDTSEKSQDCDVQCKAKRDFLMSGLPDLLKRQIAKKAAALDVYNAVSTSFQRVVHVQQKDDGCCLWHLKPPSCPLLTKFKELNTKVIDLSKCGIALGEFSTLNSKLKSGNSAAVFMRTRKEFTEEVRNLLLEEIRWSNPEFSLKKYFPLLLKKQIEHQVLSSECHSKQELEADVSHKETKRKLVEAENSKSKRKKPNEYSKNLEKTNRKSEELSKRNNSSGIKLDSSKDSGTEDMLWTEKYQPQTASELIGNELAIKKLHSWLKDWKRRAELEERQNLKGKRDEKHEDFSGGIDFKGSSDDEEESRLCNTVLITGPTGVGKTAAVYACAQELGFKIFEVNASSQRSGRQILSQLKEATQSHQVDKQGVNSQKPCFFNSYYIGKSPKKISSPKKVVTSPRKVPPPSPKSSGPKRALPPKTLANYFKVSPKPKNNEEIGMLLENNKGIKNSFEQKQITQTKSTNATNSNVKDVGAEEPSRKNATSLILFEEVDVIFDEDAGFLNAIKTFMATTKRPVILTTSDPTFSLMFDGCFEEIKFSTPSLLNVASYLQMICLTENFRTDVKDFVTLLTANTCDIRKSILYLQFWIRSGGGVLEERPLTLYRGNSRNVQLVCSEHGLDNKIYPKNTKKKRVDLPKCDSGCAETLFGLKNIFSPSEDLFSFLKHKITMKEEWHKFIQLLTEFQMRNVDFLYSNLEFILPLPVDTIPETKNFCGPSVTVDASAATKSMNCLARKHSEREQPLKKSQKKKQKKTLVILDDSDLFDTDLDFPDQSISLSSVSSSSNAEESKTGDEESKARDKGNNPETKKSIPCPPKTTAGKKCSALVSHCLNSLSEFMDNMSFLDALLTDVREQNKYGRNDFSWTNGKVTSGLCDEFSLESNDGWTSQSSGELKAAAEALSFTKCSSAISKALETLNSCKKLGRDPTNDLTFYVSQKRNNVYFSQSAANLDNAWKRISVIKSVFSSRSLLYVGNRQASIIEYLPTLRNICKTEKLKEQGKSKRRFLHYFEGIHLDIPKETVNTLAADFP,mutated_sequence,1.0,1844.0,UPI0000071E9E.a2m,UPI0000071E9E.npy,gnomAD
+UPI0000201AC8,UPI0000201AC8.csv,MALTDGGWCLPKRFGAAGADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPFETPVLHSLQSRAGAPLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKEGIQTRKRKPKNINKSKTCSGNSNNSIPMTPTSTSSNSDDCSKNTSPTTQPTASGAGAPVMTGAGESTNPENSELKYSGQDGLYIGVSLASPAEVTSSVRPDSWCALALA,mutated_sequence,1.0,595.0,UPI0000201AC8.a2m,UPI0000201AC8.npy,gnomAD
+UPI00017A6F19,UPI00017A6F19.csv,MREKEQEREEQLMEDKKRKKEDKKKKEATQKVTEQKTKVPEVTKPSLSQPTAASPIGSSPSPPVNGGNNAKRVAVPNGQPPSAARYMPREVPPRFRCQQDHKVLLKRGQPPPPSCMLLGGGAGPPPCTAPGANPNNAQVTGALLQSESGTAPDSTLGGAAASNYANSTWGSGASSNNGTSPNPIHIWDKVIVDGSDMEEWPCIASKDTESSSENTTDNNSASNPGSEKSTLPGSTTSNKGKGSQCQSASSGNECNLGVWKSDPKAKSVQSSNSTTENNNGLGNWRNVSGQDRIGPGSGFSNFNPNSNPSAWPALVQEGTSRKGALETDNSNSSAQVSTVGQTSREQQSKMENAGVNFVVSGREQAQIHNTDGPKNGNTNSLNLSSPNPMENKGMPFGMGLGNTSRSTDAPSQSTGDRKTGSVGSWGAARGPSGTDTVSGQSNSGNNGNNGKEREDSWKGASVQKSTGSKNDSWDNNNRSTGGSWNFGPQDSNDNKWGEGNKMTSGVSQGEWKQPTGSDELKIGEWSGPNQPNSSTGAWDNQKGHPLPENQGNAQAPCWGRSSSSTGSEVGGQSTGSNHKAGSSDSHNSGRRSYRPTHPDCQAVLQTLLSRTDLDPRVLSNTGWGQTQIKQDTVWDIEEVPRPEGKSDKGTEGWESAATQTKNSGGWGDAPSQSNQMKSGWGELSASTEWKDPKNTGGWNDYKNNNSSNWGGGRPDEKTPSSWNENPSKDQGWGGGRQPNQGWSSGKNGWGEEVDQTKNSNWESSASKPVSGWGEGGQNEIGTWGNGGNASLASKGGWEDCKRSPAWNETGRQPNSWNKQHQQQQPPQQPPPPQPEASGSWGGPPPPPPGNVRPSNSSWSSGPQPATPKDEEPSGWEEPSPQSISRKMDIDDGTSAWGDPNSYNYKNVNLWDKNSQGGPAPREPNLPTPMTSKSASVWSKSTPPAPDNGTSAWGEPNESSPGWGEMDDTGASTTGWGNTPANAPNAMKPNSKSMQDGWGESDGPVTGARHPSWEEEEDGGVWNTTGSQGSASSHNSASWGQGGKKQMKCSLKGGNNDSWMNPLAKQFSNMGLLSQTEDNPSSKMDLSVGSLSDKKFDVDKRAMNLGDFNDIMRKDRSGFRPPNSKDMGTTDSGPYFEKLTLPFSNQDGCLGDEAPCSPFSPSPSYKLSPSGSTLPNVSLGAIGTGLNPQNFAARQGGSHGLFGNSTAQSRGLHTPVQPLNSSPSLRAQVPPQFISPQVSASMLKQFPNSGLSPGLFNVGPQLSPQQIAMLSQLPQIPQFQLACQLLLQQQQQQQLLQNQRKISQAVRQQQEQQLARMVSALQQQQQQQQRQPGMKHSPSHPVGPKPHLDNMVPNALNVGLPDLQTKGPIPGYGSGFSSGGMDYGMVGGKEAGTESRFKQWTSMMEGLPSVATQEANMHKNGAIVAPGKTRGGSPYNQFDIIPGDTLGGHTGPAGDSWLPAKSPPTNKIGSKSSNASWPPEFQPGVPWKGIQNIDPESDPYVTPGSVLGGTATSPIVDTDHQLLRDNTTGSNSSLNTSLPSPGAWPYSASDNSFTNVHSTSAKFPDYKSTWSPDPIGHNPTHLSNKMWKNHISSRNTTPLPRPPPGLTNPKPSSPWSSTAPRSVRGWGTQDSRLASASTWSDGGSVRPSYWLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVLGNTTILAEFATDDEVSRFLAQAQPPTPAATPSAPAAGWQSLETGQNQSDPVGPALNLFGGSTGLGQWSSSAGGSSGADLAGASLWGPPNYSSSLWGVPTVEDPHRMGSPAPLLPGDLLGGGSDSI,mutated_sequence,1.0,1833.0,UPI00017A6F19.a2m,UPI00017A6F19.npy,gnomAD
+UPI0000070494,UPI0000070494.csv,MFNPHALDSPAVIFDNGSGFCKAGLSGEFGPRHMVSSIVGHLKFQAPSAEANQKKYFVGEEALYKQEALQLHSPFERGLITGWDDVERLWKHLFEWELGVKPSDQPLLATEPSLNPRENREKMAEVMFENFGVPAFYLSDQAVLALYASACVTGLVVDSGDAVTCTVPIFEGYSLPHAVTKLHVAGRDITELLMQLLLASGHTFPCQLDKGLVDDIKKKLCYVALEPEKELSRRPEEVLREYKLPDGNIISLGDPLHQAPEALFVPQQLGSQSPGLSNMVSSSITKCDTDIQKILFGEIVLSGGTTLFHGLDDRLLKELEQLASKDTPIKITAPPDRWFSTWIGASIVTSLSSFKQMWVTAADFKEFGTSVVQRRCF,mutated_sequence,1.0,377.0,UPI0000070494.a2m,UPI0000070494.npy,gnomAD
+UPI00006C19FA,UPI00006C19FA.csv,MLTLLDGLPSGIIGLMSRLSPDDGLNPNRCSCCIYAPPEPPHLPFFKWTYSFLLKSINLKKLLYTA,mutated_sequence,,,UPI00006C19FA.a2m,UPI00006C19FA.npy,gnomAD
+UPI000012ADEA,UPI000012ADEA.csv,MAEAPASPAPLSPLEVELDPEFEPQSRPRSCTWPLQRPELQASPAKPSGETAADSMIPEEEDDEDDEDGGGRAGSAMAIGGGGGSGTLGSGLLLEDSARVLAPGGQDPGSGPATAAGGLSGGTQALLQPQQPLPPPQPGAAGGSGQPRKCSSRRNAWGNLSYADLITRAIESSPDKRLTLSQIYEWMVRCVPYFKDKGDSNSSAGWKNSIRHNLSLHSRFMRVQNEGTGKSSWWIINPDGGKSGKAPRRRAVSMDNSNKYTKSRGRAAKKKAALQTAPESADDSPSQLSKWPGSPTSRSSDELDAWTDFRSRTNSNASTVSGRLSPIMASTELDEVQDDDAPLSPMLYSSSASLSPSVSKPCTVELPRLTDMAGTMNLNDGLTENLMDDLLDNITLPPSQPSPTGGLMQRSSSFPYTTKGSGLGSPTSSFNSTVFGPSSLNSLRQSPMQTIQENKPATFSSMSHYGNQTLQDLLTSDSLSHSDVMMTQSDPLMSQASTAVSAQNSRRNVMLRNDPMMSFAAQPNQGSLVNQNLLHHQHQTQGALGGSRALSNSVSNMGLSESSSLGSAKHQQQSPVSQSMQTLSDSLSGSSLYSTSANLPVMGHEKFPSDLDLDMFNGSLECDMESIIRSELMDADGLDFNFDSLISTQNVVGLNVGNFTGAKQASSQSWVPG,mutated_sequence,1.0,673.0,UPI000012ADEA.a2m,UPI000012ADEA.npy,gnomAD
+UPI0000456F39,UPI0000456F39.csv,MRKHRHLPLVAVFCLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREFLYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIFLIDGSNNTGSVNFAVILDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQEKLLPYIVGVAQRHIVLKPPTIVTQVIEVNKRDIVFLVDGSSALGLANFNAIRDFIAKVIQRLEIGQDLIQVAVAQYADTVRPEFYFNTHPTKREVITAVRKMKPLDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDEISQPAQELKRSSIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHSNKRDIIFLLDGSANVGKTNFPYVRDFVMNLVNSLDIGNDNIRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIREHVPQLLLLLTAGQSEDSYLQAANALTRAGILTFCVGASQANKAELEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTTYVSGGVEEVPLAQPESKRDILFLFDGSANLVGQFPVVRDFLYKIIDELNVKPEGTRIAVAQYSDDVKVESRFDEHQSKPEILNLVKRMKIKTGKALNLGYALDYAQRYIFVKSAGSRIEDGVLQFLVLLVAGRSSDRVDGPASNLKQSGVVPFIFQAKNADPAELEQIVLSPAFILAAESLPKIGDLHPQIVNLLKSVHNGAPAPVSGEKDVVFLLDGSEGVRSGFPLLKEFVQRVVESLDVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRITEGVPQLLIVLTADRSGDDVRNPSVVVKRGGAVPIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQVISERVTQLTREELSRLQPVLQPLPSPGVGGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYLDVGFDTTRVAVIQFSDDPKVEFLLNAHSSKDEVQNAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQFLVLISSGKSDDEVDDPAVELKQFGVAPFTIARNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAVESDAADIVFLIDSSEGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKADIVFLLDGSINFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVEDAQDVSLALTQRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSELSEQVLETLHDAMHETLCPGVTDAAKACNLDVILGFDGSRDQNVFVAQKGFESKVDAILNRISQMHRVSCSGGRSPTVRVSVVANTPSGPVEAFDFDEYQPEMLEKFRNMRSQHPYVLTEDTLKVYLNKFRQSSPDSVKVVIHFTDGADGDLADLHRASENLRQEGVRALILVGLERVVNLERLMHLEFGRGFMYDRPLRLNLLDLDYELAEQLDNIAEKACCGVPCKCSGQRGDRGPIGSIGPKGIPGEDGYRGYPGDEGGPGERGPPGVNGTQGFQGCPGQRGVKGSRGFPGEKGEVGEIGLDGLDGEDGDKGLPGSSGEKGNPGRRGDKGPRGEKGERGDVGIRGDPGNPGQDSQERGPKGETGDLGPMGVPGRDGVPGGPGETGKNGGFGRRGPPGAKGNKGGPGQPGFEGEQGTRGAQGPAGPAGPPGLIGEQGISGPRGSGGAAGAPGERGRTGPLGRKGEPGEPGPKGGIGNRGPRGETGDDGRDGVGSEGRRGKKGERGFPGYPGPKGNPGEPGLNGTTGPKGIRGRRGNSGPPGIVGQKGDPGYPGPAGPKGNRGDSIDQCALIQSIKDKCPCCYGPLECPVFPTELAFALDTSEGVNQDTFGRMRDVVLSIVNDLTIAESNCPRGARVAVVTYNNEVTTEIRFADSKRKSVLLDKIKNLQVALTSKQQSLETAMSFVARNTFKRVRNGFLMRKVAVFFSNTPTRASPQLREAVLKLSDAGITPLFLTRQEDRQLINALQINNTAVGHALVLPAGRDLTDFLENVLTCHVCLDICNIDPSCGFGSWRPSFRDRRAAGSDVDIDMAFILDSAETTTLFQFNEMKKYIAYLVRQLDMSPDPKASQHFARVAVVQHAPSESVDNASMPPVKVEFSLTDYGSKEKLVDFLSRGMTQLQGTRALGSAIEYTIENVFESAPNPRDLKIVVLMLTGEVPEQQLEEAQRVILQAKCKGYFFVVLGIGRKVNIKEVYTFASEPNDVFFKLVDKSTELNEEPLMRFGRLLPSFVSSENAFYLSPDIRKQCDWFQGDQPTKNLVKFGHKQVNVPNNVTSSPTSNPVTTTKPVTTTKPVTTTTKPVTTTTKPVTIINQPSVKPAAAKPAPAKPVAAKPVATKMATVRPPVAVKPATAAKPVAAKPAAVRPPAAAAAKPVATKPEVPRPQAAKPAATKPATTKPMVKMSREVQVFEITENSAKLHWERAEPPGPYFYDLTVTSAHDQSLVLKQNLTVTDRVIGGLLAGQTYHVAVVCYLRSQVRATYHGSFSTKKSQPPPPQPARSASSSTINLMVSTEPLALTETDICKLPKDEGTCRDFILKWYYDPNTKSCARFWYGGCGGNENKFGSQKECEKVCAPVLAKPGVISVMGT,mutated_sequence,1.0,3177.0,UPI0000456F39.a2m,UPI0000456F39.npy,gnomAD
+UPI00025A2DFB,UPI00025A2DFB.csv,QLKQQRDKLRQYQKRIAQQLERERALARQLLRDGRKERAKLLLKKKRYQEQLLDRTENQISSLEAMVQSIEFTQIEMKVMEGLQFGNECLNKMHQVMSIEEVERILDETQEAVEYQRQIDELLAGSFTQEDEDAILEELSAITQEQIELPEVPSEPLPEKIPGISMFDSFEWLEFAGEKWILEGNKTECSQAFLGRGQALLPGCLAWGRACLALGLGLGFPGVR,mutated_sequence,1.0,224.0,UPI00025A2DFB.a2m,UPI00025A2DFB.npy,gnomAD
+UPI0000E06BD6,UPI0000E06BD6.csv,MAAAPARGGGGGGGGGGGCSGSGSSASRGFYFNTVLSLARSLAVQRPASLEKVQKLLCMCPVDFHGIFQLDERRRDAVIALGIFLIESDLQHKDCVVPYLLRLLKGLPKVYWVEESTARKGRGALPVAESFSFCLVTLLSDVAYRDPSLRDEILEVLLQVLHVLLGMCQALEIQDKEYLCKYAIPCLIGISRAFGRYSNMEESLLSKLFPKIPPHSLRVLEELEGVRRRSFNDFRSILPSNLLTVCQEGTLKRKTSSVSSISQVSPERGMPPPSSPGGSAFHYFEASCLPDGTALEPEYYFSTISSSFSVSPLFNGVTYKEFNIPLEMLRELLNLVKKIVEEAVLKSLDAIVASVMEANPSADLYYTSFSDPLYLTMFKMLRDTLYYMKDLPTSFVKEIHDFVLEQFNTSQGELQKILHDADRIHNELSPLKLRCQANAACVDLMVWAVKDEQGAENLCIKLSEKLQSKTSSKVIIAHLPLLICCLQGLGRLCERFPVVVHSVTPSLRDFLVIPSPVLVKLYKYHSQYHTVAGNDIKISVTNEHSESTLNVMSGKKSQPSMYEQLRDIAIDNICRCLKAGLTVDPVIVEAFLASLSNRLYISQESDKDAHLIPDHTIRALGHIAVALRDTPKVMEPILQILQQKFCQPPSPLDVLIIDQLGCLVITGNQYIYQEVWNLFQQISVKASSVVYSATKDYKDHGYRHCSLAVINALANIAANIQDEHLVDELLMNLLELFVQLGLEGKRASERASEKGPALKASSSAGNLGVLIPVIAVLTRRLPPIKEAKPRLQKLFRDFWLYSVLMGFAVEGSGLWPEEWYEGVCEIATKSPLLTFPSKEPLRSVLQYNSAMKNDTVTPAELSELRSTIINLLDPPPEVSALINKLDFAMSTYLLSVYRLEYMRVLRSTDPDRFQVMFCYFEDKAIQKDKSGMMQCVIAVADKVFDAFLNMMADKAKTKENEEELERHAQFLLVNFNHIHKRIRRVADKYLSGLVDKFPHLLWSGTVLKTMLDILQTLSLSLSADIHKDQPYYDIPDAPYRITVPDTYEARESIVKDFAARCGMILQEAMKWAPTVTKSHLQEYLNKHQNWVSGLSQHTGLAMATESILHFAGYNKQNTTLGATQLSERPACVKKDYSNFMASLNLRNRYAGEVYGMIRFSGTTGQMSDLNKMMVQDLHSALDRSHPQHYTQAMFKLTAMLISSKDCDPQLLHHLCWGPLRMFNEHGMETALACWEWLLAGKDGVEVPFMREMAGAWHMTVEQKFGLFSAEIKEADPLAASEASQPKPCPPEVTPHYIWIDFLVQRFEIAKYCSSDQVEIFSSLLQRSMSLNIGGAKGSMNRHVAAIGPRFKLLTLGLSLLHADVVPNATIRNVLREKIYSTAFDYFSCPPKFPTQGEKRLREDISIMIKFWTAMFSDKKYLTASQLVPPDNQDTRSNLDITVGSRQQATQGWINTYPLSSGMSTISKKSGMSKKTNRGSQLHKYYMKRRTLLLSLLATEIERLITWYNPLSAPELELDQAGENSVANWRSKYISLSEKQWKDNVNLAWSISPYLAVQLPARFKNTEAIGNEVTRLVRLDPGAVSDVPEAIKFLVTWHTIDADAPELSHVLCWAPTDPPTGLSYFSSMYPPHPLTAQYGVKVLRSFPPDAILFYIPQIVQALRYDKMGYVREYILWAASKSQLLAHQFIWNMKTNIYLDEEGHQKDPDIGDLLDQLVEEITGSLSGPAKDFYQREFDFFNKITNVSAIIKPYPKGDERKKACLSALSEVKVQPGCYLPSNPEAIVLDIDYKSGTPMQSAAKAPYLAKFKVKRCGVSELEKEGLRCRSDSEDECSTQEADGQKISWQAAIFKVGDDCRQDMLALQIIDLFKNIFQLVGLDLFVFPYRVVATAPGCGVIECIPDCTSRDQLGRQTDFGMYDYFTRQYGDESTLAFQQARYNFIRSMAAYSLLLFLLQIKDRHNGNIMLDKKGHIIHIDFGFMFESSPGGNLGWEPDIKLTDEMVMIMGGKMEATPFKWFMEMCVRGYLAVRPYMDAVVSLVTLMLDTGLPCFRGQTIKLLKHRFSPNMTEREAANFIMKVIQSCFLSNRSRTYDMIQYYQNDIPY,mutated_sequence,1.0,2102.0,UPI0000E06BD6.a2m,UPI0000E06BD6.npy,gnomAD
+UPI0001573469,UPI0001573469.csv,MGNEASLEGEGLPEGLAAAAAAGGGASGAGSPSHTAIPAGMEADLSQLSEEERRQIAAVMSRAQGLPKGSVPPAAAESPSMHRKQELDSSHPPKQSGRPPDPGRPAQPGLSKSRTTDTFRSEQKLPGRSPSTISLKESKSRTDLKEEHKSSMMPGFLSEVNALSAVSSVVNKFNPFDLISDSEASQEETTKKQKVVQKEQGKPEGIIKPPLQQQPPKPIPKQQGPGRDPLQQDGTPKSISSQQPEKIKSQPPGTGKPIQGPTQTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPPIQQPTPGKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPSQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQGLPKTICPLCNTTELLLHVPEKANFNTCTECQTTVCSLCGFNPNPHLTEVKEWLCLNCQMKRALGGDLAPVPSSPQPKLKTAPVTTTSAVSKSSPQPQQTSPKKDAAPKQDLSKAPEPKKPPPLVKQPTLHGSPSAKAKQPPEADSLSKPAPPKEPSVPSEQDKAPVADDKPKQPKMVKPTTDLVSSSSATTKPDIPSSKVQSQAEEKTTPPLKTDSAKPSQSFPPTGEKVSPFDSKAIPRPASDSKIISHPGPSSESKGQKQVDPVQKKEEPKKAQTKMSPKPDAKPMPKGSPTPPGPRPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITDAPKSQPTTPQETVTGKLFGFGASIFSQASNLISTAGQPGPHSQSGPGAPMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEKKPPPIKDSKSLTAEPQKAVLPTKLEKSPKPESTCPLCKTELNIGSKDPPNFNTCTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAISGQLGDIRKMPPAPSGPKASPMPVPTESSSQKTAVPPQVKLVKKQEQEVKTEAEKVILEKVKETLSMEKIPPMVTTDQKQEESKLEKDKASALQEKKPLPEEKKLIPEEEKIRSEEKKPLLEEKKPTPEDKKLLPEAKTSAPEEQKHDLLKSQVQIAEEKLEGRVAPKTVQEGKQPQTKMEGLPSGTPQSLPKEDDKTTKTIKEQPQPPCTAKPDQVEPGKEKTEKEDDKSDTSSSQQPKSPQGLSDTGYSSDGISSSLGEIPSLIPTDEKDILKGLKKDSFSQESSPSSPSDLAKLESTVLSILEAQASTLADEKSEKKTQPHEVSPEQPKDQEKTQSLSETLEITISEEEIKESQEERKDTFKKDSQQDIPSSKDHKEKSEFVDDITTRREPYDSVEESSESENSPVPQRKRRTSVGSSSSDEYKQEDSQGSGEEEDFIRKQIIEMSADEDASGSEDDEFIRNQLKEISSSTESQKKEETKGKGKITAGKHRRLTRKSSTSIDEDAGRRHSWHDEDDEAFDESPELKYRETKSQESEELVVTGGGGLRRFKTIELNSTIADKYSAESSQKKTSLYFDEEPELEMESLTDSPEDRSRGEGSSSLHASSFTPGTSPTSVSSLDEDSDSSPSHKKGESKQQRKARHRPHGPLLPTIEDSSEEEELREEEELLKEQEKQREIEQQQRKSSSKKSKKDKDELRAQRRRERPKTPPSNLSPIEDASPTEELRQAAEMEELHRSSCSEYSPSIESDPEGFEISPEKIIEVQKVYKLPTAVSLYSPTDEQSIMQKEGSQKALKSAEEMYEEMMHKTHKYKAFPAANERDEVFEKEPLYGGMLIEDYIYESLVEDTYNGSVDGSLLTRQEEENGFMQQKGREQKIRLSEQIYEDPMQKITDLQKEFYELESLHSVVPQEDIVSSSFIIPESHEIVDLGTMVTSTEEERKLLDADAAYEELMKRQQMQLTPGSSPTQAPIGEDMTESTMDFDRMPDASLTSSVLSGASLTDSTSSATLSIPDVKITQHFSTEEIEDEYVTDYTREIQEIIAHESLILTYSEPSESATSVPPSDTPSLTSSVSSVCTTDSSSPITTLDSITTVYTEPVDMITKFEDSEEISSSTYFPGSIIDYPEEISVSLDRTAPPDGRASADHIVISLSDMASSIIESVVPKPEGPVADTVSTDLLISEKDPVKKAKKETGNGIILEVLEAYRDKKELEAERTKSSLSETVFDHPPSSVIALPMKEQLSTTYFTSGETFGQEKPASQLPSGSPSVSSLPAKPRPFFRSSSLDISAQPPPPPPPPPPPPPPPPPPPPPPLPPPTSPKPTILPKKKLTVASPVTTATPLFDAVTTLETTAVLRSNGLPVTRICTTAPPPVPPKPSSIPSGLVFTHRPEPSKPPIAPKPVIPQLPTTTQKPTDIHPKPTGLSLTSSMTLNLVTSADYKLPSPTSPLSPHSNKSSPRFSKSLTETYVVITLPSEPGTPTDSSASQAITSWPLGSPSKDLVSVEPVFSVVPPVTAVEIPISSEQTFYISGALQTFSATPVTAPSSFQAAPTSVTQFLTTEVSKTEVSATRSTAPSVGLSSISITIPPEPLALDNIHLEKPQYKEDGKLQLVGDVIDLRTVPKVEVKTTDKCIDLSASTMDVKRQITANEVYGKQISAVQPSIINLSVTSSIVTPVSLATETVTFVTCTASASYTTGTESLVGAEHAMTTPLQLTTSKHAEPPYRIPSDQVFPIAREEAPINLSLGTPAHAVTLAITKPVTVPPVGVTNGWTDSTVSQGITDGEVVDLSTTKSHRTVVTMDESTSSVMTKIIEDEKPVDLTAGRRAVCCDVVYKLPFGRSCTAQQPATTLPEDRFGYRDDHYQYDRSGPYGYRGIGGMKPSMSDTNLAEAGHFFYKSKNAFDYSEGTDTAVDLTSGRVTTGEVMDYSSKTTGPYPETRQVISGAGISTPQYSTARMTPPPGPQYCVGSVLRSSNGVVYSSVATPTPSTFAITTQPGSIFSTTVRDLSGIHTADAVTSLPAMHHSQPMPRSYFITTGASETDIAVTGIDISASLQTITMESLTAETIDSVPTLTTASEVFPEVVGDESALLIVPEEDKQQQQLDLERELLELEKIKQQRFAEELEWERQEIQRFREQEKIMVQKKLEELQSMKQHLLFQQEEERQAQFMMRQETLAQQQLQLEQIQQLQQQLHQQLEEQKIRQIYQYNYDPSGTASPQTTTEQAILEGQYAALEGSQFWATEDATTTASAVVAIEIPQSQGWYTVQSDGVTQYIAPPGILSTVSEIPLTDVVVKEEKQPKKRSSGAKVRGQYDDMGENMTDDPRSFKKIVDSGVQTDDEDATDRSYVSRRRRTKKSVDTSVQTDDEDQDEWDMPTRSRRKARVGKYGDSMTEADKTKPLSKVSSIAVQTVAEISVQTEPVGTIRTPSIRARVDAKVEIIKHISAPEKTYKGGSLGCQTEADSDTQSPQYLSATSPPKDKKRPTPLEIGYSSHLRADSTVQLAPSPPKSPKVLYSPISPLSPGKALESAFVPYEKPLPDDISPQKVLHPDMAKVPPASPKTAKMMQRSMSDPKPLSPTADESSRAPFQYTEGYTTKGSQTMTSSGAQKKVKRTLPNPPPEEISTGTQSTFSTMGTVSRRRICRTNTMARAKILQDIDRELDLVERESAKLRKKQAELDEEEKEIDAKLRYLEMGINRRKEALLKEREKRERAYLQGVAEDRDYMSDSEVSSTRPTRIESQHGIERPRTAPQTEFSQFIPPQTQTESQLVPPTSPYTQYQYSSPALPTQAPTSYTQQSHFEQQTLYHQQVSPYQTQPTFQAVATMSFTPQVQPTPTPQPSYQLPSQMMVIQQKPRQTTLYLEPKITSNYEVIRNQPLMIAPVSTDNTFAVSHLGSKYNSLDLRIGLEERSSMASSPISSISADSFYADIDHHTPRNYVLIDDIGEITKGTAALSTAFSLHEKDLSKTDRLLRTTETRRSQEVTDFLAPLQSSSRLHSYVKAEEDPMEDPYELKLLKHQIKQEFRRGTESLDHLAGLSHYYHADTSYRHFPKSEKYSISRLTLEKQAAKQLPAAILYQKQSKHKKSLIDPKMSKFSPIQESRDLEPDYSSYMTSSTSSIGGISSRARLLQDDITFGLRKNITDQQKFMGSSLGTGLGTLGNTIRSALQDEADKPYSSGSRSRPSSRPSSVYGLDLSIKRDSSSSSLRLKAQEAEALDVSFSHASSSARTKPTSLPISQSRGRIPIVAQNSEEESPLSPVGQPMGMARAAAGPLPPISADTRDQFGSSHSLPEVQQHMREESRTRGYDRDIAFIMDDFQHAMSDSEAYHLRREETDWFDKPRESRLENGHGLDRKLPERLVHSRPLSQHQEQIIQMNGKTMHYIFPHARIKITRDSKDHTVSGNGLGIRIVGGKEIPGHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGEAEICVRLDLNMLSDSENSQHLELHEPPKAVDKAKSPGVDPKQLAAELQKVSLQQSPLVLSSVVEKGSHVHSGPTSAGSSSVPSPGQPGSPSVSKKKHGSSKPTDGTKVVSHPITGEIQLQINYDLGNLIIHILQARNLVPRDNNGYSDPFVKVYLLPGRGQVMVVQNASAEYKRRTKHVQKSLNPEWNQTVIYKSISMEQLKKKTLEVTVWDYDRFSSNDFLGEVLIDLSSTSHLDNTPRWYPLKEQTESIDHGKSHSSQSSQQSPKPSVIKSRSHGIFPDPSKDMQVPTIEKSHSSPGSSKSSSEGHLRSHGPSRSQSKTSVTQTHLEDAGAAIAAAEAAVQQLRIQPTKPPNHRPAESSVSTGSSGSSFGSGYSVDSEGSSSTAGETNLFPIPRIGKMGQNGQEPVKQPGVGVGLADTEAKTQVMGEIKIALKKEMKTDGEQLIVEILQCRNITYKFKSPDHLPDLYVKIYVMNISTQKKVIKKKTRVCRHDREPSFNETFRFSLSPAGHSLQILLFSNGGKFMKKTLIGEACIWLDKVDLRKRIVNWHKLLVSPTQTH,mutated_sequence,1.0,5142.0,UPI0001573469.a2m,UPI0001573469.npy,gnomAD
+UPI000002A38D,UPI000002A38D.csv,MEDEERQKKLEAGKAKLAQFRQRKAQSDGQSPSKKQKKKRKTSSSKHDVSAHHDLNIDQSQCNEMYINSSQRVESTVIPESTIMRTLHSGEITSHEQGFSVELESEISTTADDCSSEVNGCSFVMRTGKPTNLLREEEFGVDDSYSEQGAQDSPTHLEMMESELAGKQHEIEELNRELEEMRVTYGTEGLQQLQEFEAAIKQRDGIITQLTANLQQARREKDETMREFLELTEQSQKLQIQFQQLQASETLRNSTHSSTAADLLQAKQQILTHQQQLEEQDHLLEDYQKKKEDFTMQISFLQEKIKVYEMEQDKKVENSNKEEIQEKETIIEELNTKIIEEEKKTLELKDKLTTADKLLGELQEQIVQKNQEIKNMKLELTNSKQKERQSSEEIKQLMGTVEELQKRNHKDSQFETDIVQRMEQETQRKLEQLRAELDEMYGQQIVQMKQELIRQHMAQMEEMKTRHKGEMENALRSYSNITVNEDQIKLMNVAINELNIKLQDTNSQKEKLKEELGLILEEKCALQRQLEDLVEELSFSREQIQRARQTIAEQESKLNEAHKSLSTVEDLKAEIVSASESRKELELKHEAEVTNYKIKLEMLEKEKNAVLDRMAESQEAELERLRTQLLFSHEEELSKLKEDLEIEHRINIEKLKDNLGIHYKQQIDGLQNEMSQKIETMQFEKDNLITKQNQLILEISKLKDLQQSLVNSKSEEMTLQINELQKEIEILRQEEKEKGTLEQEVQELQLKTELLEKQMKEKENDLQEKFAQLEAENSILKDEKKTLEDMLKIHTPVSQEERLIFLDSIKSKSKDSVWEKEIEILIEENEDLKQQCIQLNEEIEKQRNTFSFAEKNFEVNYQELQEEYACLLKVKDDLEDSKNKQELEYKSKLKALNEELHLQRINPTTVKMKSSVFDEDKTFVAETLEMGEVVEKDTTELMEKLEVTKREKLELSQRLSDLSEQLKQKHGEISFLNEEVKSLKQEKEQVSLRCRELEIIINHNRAENVQSCDTQVSSLLDGVVTMTSRGAEGSVSKVNKSFGEESKIMVEDKVSFENMTVGEESKQEQLILDHLPSVTKESSLRATQPSENDKLQKELNVLKSEQNDLRLQMEAQRICLSLVYSTHVDQVREYMENEKDKALCSLKEELIFAQEEKIKELQKIHQLELQTMKTQETGDEGKPLHLLIGKLQKAVSEECSYFLQTLCSVLGEYYTPALKCEVNAEDKENSGDYISENEDPELQDYRYEVQDFQENMHTLLNKVTEEYNKLLVLQTRLSKIWGQQTDGMKLEFGEENLPKEETEFLSIHSQMTNLEDIDVNHKSKLSSLQDLEKTKLEEQVQELESLISSLQQQLKETEQNYEAEIHCLQKRLQAVSESTVPPSLPVDSVVITESDAQRTMYPGSCVKKNIDGTIEFSGEFGVKEETNIVKLLEKQYQEQLEEEVAKVIVSMSIAFAQQTELSRISGGKENTASSKQAHAVCQQEQHYFNEMKLSQDQIGFQTFETVDVKFKEEFKPLSKELGEHGKEILLSNSDPHDIPESKDCVLTISEEMFSKDKTFIVRQSIHDEISVSSMDASRQLMLNEEQLEDMRQELVRQYQEHQQATELLRQAHMRQMERQREDQEQLQEEIKRLNRQLAQRSSIDNENLVSERERVLLEELEALKQLSLAGREKLCCELRNSSTQTQNGNENQGEVEEQTFKEKELDRKPEDVPPEILSNERYALQKANNRLLKILLEVVKTTAAVEETIGRHVLGILDRSSKSQSSASLIWRSEAEASVKSCVHEEHTRVTDESIPSYSGSDMPRNDINMWSKVTEEGTELSQRLVRSGFAGTEIDPENEELMLNISSRLQAAVEKLLEAISETSSQLEHAKVTQTELMRESFRQKQEATESLKCQEELRERLHEESRAREQLAVELSKAEGVIDGYADEKTLFERQIQEKTDIIDRLEQELLCASNRLQELEAEQQQIQEERELLSRQKEAMKAEAGPVEQQLLQETEKLMKEKLEVQCQAEKVRDDLQKQVKALEIDVEEQVSRFIELEQEKNTELMDLRQQNQALEKQLEKMRKFLDEQAIDREHERDVFQQEIQKLEQQLKVVPRFQPISEHQTREVEQLANHLKEKTDKCSELLLSKEQLQRDIQERNEEIEKLEFRVRELEQALLVSADTFQKVEDRKHFGAVEAKPELSLEVQLQAERDAIDRKEKEITNLEEQLEQFREELENKNEEVQQLHMQLEIQKKESTTRLQELEQENKLFKDDMEKLGLAIKESDAMSTQDQHVLFGKFAQIIQEKEVEIDQLNEQVTKLQQQLKITTDNKVIEEKNELIRDLETQIECLMSDQECVKRNREEEIEQLNEVIEKLQQELANIGQKTSMNAHSLSEEADSLKHQLDVVIAEKLALEQQVETANEEMTFMKNVLKETNFKMNQLTQELFSLKRERESVEKIQSIPENSVNVAIDHLSKDKPELEVVLTEDALKSLENQTYFKSFEENGKGSIINLETRLLQLESTVSAKDLELTQCYKQIKDMQEQGQFETEMLQKKIVNLQKIVEEKVAAALVSQIQLEAVQEYAKFCQDNQTISSEPERTNIQNLNQLREDELGSDISALTLRISELESQVVEMHTSLILEKEQVEIAEKNVLEKEKKLLELQKLLEGNEKKQREKEKKRSPQDVEVLKTTTELFHSNEESGFFNELEALRAESVATKAELASYKEKAEKLQEELLVKETNMTSLQKDLSQVRDHLAEAKEKLSILEKEDETEVQESKKACMFEPLPIKLSKSIASQTDGTLKISSSNQTPQILVKNAGIQINLQSECSSEEVTEIISQFTEKIEKMQELHAAEILDMESRHISETETLKREHYVAVQLLKEECGTLKAVIQCLRSKEGSSIPELAHSDAYQTREICSSDSGSDWGQGIYLTHSQGFDIASEGRGEESESATDSFPKKIKGLLRAVHNEGMQVLSLTESPYSDGEDHSIQQVSEPWLEERKAYINTISSLKDLITKMQLQREAEVYDSSQSHESFSDWRGELLLALQQVFLEERSVLLAAFRTELTALGTTDAVGLLNCLEQRIQEQGVEYQAAMECLQKADRRSLLSEIQALHAQMNGRKITLKREQESEKPSQELLEYNIQQKQSQMLEMQVELSSMKDRATELQEQLSSEKMVVAELKSELAQTKLELETTLKAQHKHLKELEAFRLEVKDKTDEVHLLNDTLASEQKKSRELQWALEKEKAKLGRSEERDKEELEDLKFSLESQKQRNLQLNLLLEQQKQLLNESQQKIESQRMLYDAQLSEEQGRNLELQVLLESEKVRIREMSSTLDRERELHAQLQSSDGTGQSRPPLPSEDLLKELQKQLEEKHSRIVELLNETEKYKLDSLQTRQQMEKDRQVHRKTLQTEQEANTEGQKKMHELQSKVEDLQRQLEEKRQQVYKLDLEGQRLQGIMQEFQKQELEREEKRESRRILYQNLNEPTTWSLTSDRTRNWVLQQKIEGETKESNYAKLIEMNGGGTGCNHELEMIRQKLQCVASKLQVLPQKASERLQFETADDEDFIWVQENIDEIILQLQKLTGQQGEEPSLVSPSTSCGSLTERLLRQNAELTGHISQLTEEKNDLRNMVMKLEEQIRWYRQTGAGRDNSSRFSLNGGANIEAIIASEKEVWNREKLTLQKSLKRAEAEVYKLKAELRNDSLLQTLSPDSEHVTLKRIYGKYLRAESFRKALIYQKKYLLLLLGGFQECEDATLALLARMGGQPAFTDLEVITNRPKGFTRFRSAVRVSIAISRMKFLVRRWHRVTGSVSININRDGFGLNQGAEKTDSFYHSSGGLELYGEPRHTTYRSRSDLDYIRSPLPFQNRYPGTPADFNPGSLACSQLQNYDPDRALTDYITRLEALQRRLGTIQSGSTTQFHAGMRR,mutated_sequence,1.0,3907.0,UPI000002A38D.a2m,UPI000002A38D.npy,gnomAD
+UPI000050BBEF,UPI000050BBEF.csv,MDMASESVGGKILFATDDFFAPAENLIKSDSPCFKEHEYTEFGKWMDGWETRRKRIPGHDWCVLRLGIQGVIRGFDVDVSYFTGDYAPRVSIQAANLEEDKLPEIPERGTRTGAAATPEEFEAIAELKSDDWSYLVPMTELKPGNPASGHNYFLVNSQQRWTHIRLNIFPDGGIARLRVFGTGQKDWTATDPKEPADLVAIAFGGVCVGFSNAKFGHPNNIIGVGGAKSMADGWETARRLDRPPILENDENGILLVPGCEWAVFRLAHPGVITRIEIDTKYFEGNAPDSCKVDGCILTTQEEEAVIRQKWILPAHKWKPLLPVTKLSPNQSHLFDSLTLELQDVITHARLTIVPDGGVSRLRLRGFPSSICLLRPREKPMLKFSVSFKANP,mutated_sequence,1.0,391.0,UPI000050BBEF.a2m,UPI000050BBEF.npy,gnomAD
+UPI00001419CC,UPI00001419CC.csv,MTADKDKDKDKEKDRDRDRDREREKRDKARESENSRPRRSCTLEGGAKNYAESDHSEDEDNDNNSATAEESTKKNKKKPPKKKSRYERTDTGEITSYITEDDVVYRPGDCVYIESRRPNTPYFICSIQDFKLVHNSQACCRSPTPALCDPPACSLPVASQPPQHLSEAGRGPVGSKRDHLLMNVKWYYRQSEVPDSVYQHLVQDRHNENDSGRELVITDPVIKNRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFKARVDSFFYILGYNPETRRLNSTQGEIRVGPSHQAKLPDLQPFPSPDGDTVTQHEELVWMPGVNDCDLLMYLRAARSMAAFAGMCDGGSTEDGCVAASRDDTTLNALNTLHESGYDAGKALQRLVKKPVPKLIEKCWTEDEVKRFVKGLRQYGKNFFRIRKELLPNKETGELITFYYYWKKTPEAASSRAHRRHRRQAVFRRIKTRTASTPVNTPSRPPSSEFLDLSSASEDDFDSEDSEQELKGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGELPPIEKPVDPPPFMFKPVKEEDDGLSGKHSMRTRRSRGSMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSNKRQREKVASDTEEADRTSSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPHTHIQQAPALHPQRPPSPHPPPHPSPHPPLQPLTGSAGQPSAPSHAQPPLHGQGPPGPHSLQAGPLLQHPGPPQPFGLPPQASQGQAPLGTSPAAAYPHTSLQLPASQSALQSQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSMNANLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQNLPPPPASHPPTGLHQVAPQPPFAQHPFVPGGPPPITPPTCPSTSTPPAGPGTSAQPPCSGAAASGGSIAGGSSCPLPTVQIKEEALDDAEEPESPPPPPRSPSPEPTVVDTPSHASQSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAKASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAYHMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPAANPMEHFARHSALTIPPTAGPHPFASFHPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQGSAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFGTPYPRDLPGAIPPPMSAAHQLQAMHAQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQL,mutated_sequence,1.0,1566.0,UPI00001419CC.a2m,UPI00001419CC.npy,gnomAD
+UPI0000074015,UPI0000074015.csv,MSDEGSRGSRLPLALPPASQGCSSGGGGGGSSAGGSGNSRPPRNLQGLLQMAITAGSEEPDPPPEPMSEERRQWLQEAMSAAFRGQREEVEQMKSCLRVLSQPMPPTAGEAEQAADQQEREGALELLADLCENMDNAADFCQLSGMHLLVGRYLEAGAAGLRWRAAQLIGTCSQNVAAIQEQVLGLGALRKLLRLLDRDACDTVRVKALFAISCLVREQEAGLLQFLRLDGFSVLMRAMQQQVQKLKVKSAFLLQNLLVGHPEHKGTLCSMGMVQQLVALVRTEHSPFHEHVLGALCSLVTDFPQGVRECREPELGLEELLRHRCQLLQQHEEYQEELEFCEKLLQTCFSSPADDSMDR,mutated_sequence,1.0,359.0,UPI0000074015.a2m,UPI0000074015.npy,gnomAD
+UPI00001C2068,UPI00001C2068.csv,MENRPGSFQYVPVQLQGGAPWGFTLKGGLEHCEPLTVSKIEDGGKAALSQKMRTGDELVNINGTPLYGSRQEALILIKGSFRILKLIVRRRNAPVSRPHSWHVAKLLEGCPEAATTMHFPSEAFSLSWHSGCNTSDVCVQWCPLSRHCSTEKSSSIGSMESLEQPGQATYESHLLPIDQNMYPNQRDSAYSSFSASSNASDCALSLRPEEPASTDCIMQGPGPTKAPSGRPNVAETSGGSRRTNGGHLTPSSQMSSRPQEGYQSGPAKAVRGPPQPPVRRDSLQASRAQLLNGEQRRASEPVVPLPQKEKLSLEPVLPARNPNRFCCLSGHDQVTSEGHQNCEFSQPPESSQQGSEHLLMQASTKAVGSPKACDRASSVDSNPLNEASAELAKASFGRPPHLIGPTGHRHSAPEQLLASHLQHVHLDTRGSKGMELPPVQDGHQWTLSPLHSSHKGKKSPCPPTGGTHDQSSKERKTRQVDDRSLVLGHQSQSSPPHGEADGHPSEKGFLDPNRTSRAASELANQQPSASGSLVQQATDCSSTTKAASGTEAGEEGDSEPKECSRMGGRRSGGTRGRSIQNRRKSERFATNLRNEIQRRKAQLQKSKGPLSQLCDTKEPVEETQEPPESPPLTASNTSLLSSCKKPPSPRDKLFNKSMMLRARSSECLSQAPESHESRTGLEGRISPGQRPGQSSLGLNTWWKAPDPSSSDPEKAHAHCGVRGGHWRWSPEHNSQPLVAAAMEGPSNPGDNKELKASTAQAGEDAILLPFADRRKFFEESSKSLSTSHLPGLTTHSNKTFTQRPKPIDQNFQPMSSSCRELRRHPMDQSYHSADQPYHATDQSYHSMSPLQSETPTYSECFASKGLENSMCCKPLHCGDFDYHRTCSYSCSVQGALVHDPCIYCSGEICPALLKRNMMPNCYNCRCHHHQCIRCSVCYHNPQHSALEDSSLAPGNTWKPRKLTVQEFPGDKWNPITGNRKTSQSGREMAHSKTSFSWATPFHPCLENPALDLSSYRAISSLDLLGDFKHALKKSEETSVYEEGSSLASMPHPLRSRAFSESHISLAPQSTRAWGQHRRELFSKGDETQSDLLGARKKAFPPPRPPPPNWEKYRLFRAAQQQKQQQQQQKQQEEEEEEEEEEEEEEEEEEEEAEEEEEELPPQYFSSETSGSCALNPEEVLEQPQPLSFGHLEGSRQGSQSVPAEQESFALHSSDFLPPIRGHLGSQPEQAQPPCYYGIGGLWRTSGQEATESAKQEFQHFSPPSGAPGIPTSYSAYYNISVAKAELLNKLKDQPEMAEIGLGEEEVDHELAQKKIQLIESISRKLSVLREAQRGLLEDINANSALGEEVEANLKAVCKSNEFEKYHLFVGDLDKVVNLLLSLSGRLARVENALNSIDSEANQEKLVLIEKKQQLTGQLADAKELKEHVDRREKLVFGMVSRYLPQDQLQDYQHFVKMKSALIIEQRELEEKIKLGEEQLKCLRESLLLGPSNF,mutated_sequence,1.0,1493.0,UPI00001C2068.a2m,UPI00001C2068.npy,gnomAD
+UPI00005788EA,UPI00005788EA.csv,MTMFKEAVTFKDVAVVFTEEELGLLDVSQRKLYRDVMLENFRNLLSVGHQLSHRDTFHFQREEKFWIMETATQREGNSGGKIQTELESVPETGPHEEWSCQQIWEQTASELTRPQDSISSSQFSTQGDVPSQVDAGLSIIHIGETPSEHGKCKKFFSDVSILDLHQQLHSGKISHTCNEYRKRFCYSSALCLHQKVHMGEKRYKCDVCSKAFSQNSQLQTHQRIHTGEKPFKCEQCGKSFSRRSGMYVHCKLHTGEKPHICEECGKAFIHNSQLREHQRIHTGEKPFKCYICGKSFHSRSNLNRHSMVHMQEKSFRCDTCSNSFGQRSALNSHCMDHTKEKLYKCEECGRSFTCRQDLCKHQMDHTGDKPYNCNVCGKGFRWSSCLSRHQRVHNGETTFKCDGCGKRFYMNSQGHSHQRAYREEELYKCQKCGKGYISKFNLDLHQRVHTGERPYNCKECGKSFRWASGILRHKRLHTGEKPFKCEECGKRFTENSKLRFHQRIHTGEKPYKCEECGKGFRWASTHLTHQRLHSREKLFQCEDCGKSSEHSSCLQDQQSDHSGEKTSKCEDCGKRYERRLNLDMILSLFLNDI,mutated_sequence,1.0,593.0,UPI00005788EA.a2m,UPI00005788EA.npy,gnomAD
+UPI00006BFF57,UPI00006BFF57.csv,MKIITYFCIWAVAWAIPVPQSKPLERHVEKSMNLHLLARSNVSVQDELNASGTIKESGVLVHEGDRGRQENTQDGHKGEGNGSKWAEVGGKSFSTYSTLANEEGNIEGWNGDTGKAETYGHDGIHGKEENITANGIQGQVSIIDNAGATNRSNTNGNTDKNTQNGDVGDAGHNEDVAVVQEDGPQVAGSNNSTDNEDEIIENSCRNEGNTSEITPQINSKRNGTKEAEVTPGTGEDAGLDNSDGSPSGNGADEDEDEGSGDDEDEEAGNGKDSSNNSKGQEGQDHGKEDDHDSSIGQNSDSKEYYDPEGKEDPHNEVDGDKTSKSEENSAGIPEDNGSQRIEDTQKLNHRESKRVENRITKESETHAVGKSQDKGIEIKGPSSGNRNITKEVGKGNEGKEDKGQHGMILGKGNVKTQGEVVNIEGPGQKSEPGNKVGHSNTGSDSNSDGYDSYDFDDKSMQGDDPNSSDESNGNDDANSESDNNSSSRGDASYNSDESKDNGNGSDSKGAEDDDSDSTSDTNNSDSNGNGNNGNDDNDKSDSGKGKSDSSDSDSSDSSNSSDSSDSSDSDSSDSNSSSDSDSSDSDSSDSSDSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSKSDSSKSESDSSDSDSKSDSSDSNSSDSSDNSDSSDSSNSSNSSDSSDSSDSSDSSSSSDSSNSSDSSDSSDSSNSSESSDSSDSSDSDSSDSSDSSNSNSSDSDSSNSSDSSDSSNSSDSSDSSDSSNSSDSSDSSDSSNSSDSSDSSDSSDSSDSSNSSDSNDSSNSSDSSDSSNSSDSSNSSDSSDSSDSSDSDSSNSSDSSNSSDSSDSSNSSDSSDSSDSSDGSDSDSSNRSDSSNSSDSSDSSDSSNSSDSSDSSDSNESSNSSDSSDSSNSSDSDSSDSSNSSDSSDSSNSSDSSESSNSSDNSNSSDSSNSSDSSDSSDSSNSSDSSNSSDSSNSSDSSDSNSSDSSDSSNSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSNSSDSSNSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSESSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSNSSDSSDSSESSDSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSNESSDSSDSSDSSDSSNSSDSSDSSDSSDSTSDSNDESDSQSKSGNGNNNGSDSDSDSEGSDSNHSTSDD,mutated_sequence,1.0,1301.0,UPI00006BFF57.a2m,UPI00006BFF57.npy,gnomAD
+UPI000013D050,UPI000013D050.csv,MGDSRDLCPHLDSIGEVTKEDLLLKSKGTCQSCGVTGPNLWACLQVACPYVGCGESFADHSTIHAQAKKHNLTVNLTTFRLWCYACEKEVFLEQRLAAPLLGSSSKFSEQDSPPPSHPLKAVPIAVADEGESESEDDDLKPRGLTGMKNLGNSCYMNAALQALSNCPPLTQFFLECGGLVRTDKKPALCKSYQKLVSEVWHKKRPSYVVPTSLSHGIKLVNPMFRGYAQQDTQEFLRCLMDQLHEELKEPVVATVALTEARDSDSSDTDEKREGDRSPSEDEFLSCDSSSDRGEGDGQGRGGGSSQAETELLIPDEAGRAISEKERMKDRKFSWGQQRTNSEQVDEDADVDTAMAALDDQPAEAQPPSPRSSSPCRTPEPDNDAHLRSSSRPCSPVHHHEGHAKLSSSPPRASPVRMAPSYVLKKAQVLSAGSRRRKEQRYRSVISDIFDGSILSLVQCLTCDRVSTTVETFQDLSLPIPGKEDLAKLHSAIYQNVPAKPGACGDSYAAQGWLAFIVEYIRRFVVSCTPSWFWGPVVTLEDCLAAFFAADELKGDNMYSCERCKKLRNGVKYCKVLRLPEILCIHLKRFRHEVMYSFKINSHVSFPLEGLDLRPFLAKECTSQITTYDLLSVICHHGTAGSGHYIAYCQNVINGQWYEFDDQYVTEVHETVVQNAEGYVLFYRKSSEEAMRERQQVVSLAAMREPSLLRFYVSREWLNKFNTFAEPGPITNQTFLCSHGGIPPHKYHYIDDLVVILPQNVWEHLYNRFGGGPAVNHLYVCSICQVEIEALAKRRRIEIDTFIKLNKAFQAEESPGVIYCISMQWFREWEAFVKGKDNEPPGPIDNSRIAQVKGSGHVQLKQGADYGQISEETWTYLNSLYGGGPEIAIRQSVAQPLGPENLHGEQKIEAETRAV,mutated_sequence,1.0,914.0,UPI000013D050.a2m,UPI000013D050.npy,gnomAD
+UPI000011F17B,UPI000011F17B.csv,MGNRSTADADGLLAGRGPAAGASAGASAGLAGQGAAALVGGVLLIGAVLAGNSLVCVSVATERALQTPTNSFIVSLAAADLLLALLVLPLFVYSEVQGGAWLLSPRLCDALMAMDVMLCTASIFNLCAISVDRFVAVAVPLRYNRQGGSRRQLLLIGATWLLSAAVAAPVLCGLNDVRGRDPAVCRLEDRDYVVYSSVCSFFLPCPLMLLLYWATFRGLQRWEVARRAKLHGRAPRRPSGPGPPSPTPPAPRLPQDPCGPDCAPPAPGLPRGPCGPDCAPAAPSLPQDPCGPDCAPPAPGLPPDPCGSNCAPPDAVRAAALPPQTPPQTRRRRRAKITGRERKAMRVLPVVVGAFLLCWTPFFVVHITQALCPACSVPPRLVSAVTWLGYVNSALNPVIYTVFNAEFRNVFRKALRACC,mutated_sequence,1.0,419.0,UPI000011F17B.a2m,UPI000011F17B.npy,gnomAD
+UPI000004064B,UPI000004064B.csv,MEFVRALWLGLALALGPGSAGGHPQPCGVLARLGGSVRLGALLPRAPLARARARAALARAALAPRLPHNLSLELVVAAPPARDPASLTRGLCQALVPPGVAALLAFPEARPELLQLHFLAAATETPVLSLLRREARAPLGAPNPFHLQLHWASPLETLLDVLVAVLQAHAWEDVGLALCRTQDPGGLVALWTSRAGRPPQLVLDLSRRDTGDAGLRARLAPMAAPVGGEAPVPAAVLLGCDIARARRVLEAVPPGPHWLLGTPLPPKALPTAGLPPGLLALGEVARPPLEAAIHDIVQLVARALGSAAQVQPKRALLPAPVNCGDLQPAGPESPGRFLARFLANTSFQGRTGPVWVTGSSQVHMSRHFKVWSLRRDPRGAPAWATVGSWRDGQLDLEPGGASARPPPPQGAQVWPKLRVVTLLEHPFVFARDPDEDGQCPAGQLCLDPGTNDSATLDALFAALANGSAPRALRKCCYGYCIDLLERLAEDTPFDFELYLVGDGKYGALRDGRWTGLVGDLLAGRAHMAVTSFSINSARSQVVDFTSPFFSTSLGIMVRARDTASPIGAFMWPLHWSTWLGVFAALHLTALFLTVYEWRSPYGLTPRGRNRSTVFSYSSALNLCYAILFRRTVSSKTPKCPTGRLLMNLWAIFCLLVLSSYTANLAAVMVGDKTFEELSGIHDPKLHHPAQGFRFGTVWESSAEAYIKKSFPDMHAHMRRHSAPTTPRGVAMLTSDPPKLNAFIMDKSLLDYEVSIDADCKLLTVGKPFAIEGYGIGLPQNSPLTSNLSEFISRYKSSGFIDLLHDKWYKMVPCGKRVFAVTETLQMSIYHFAGLFVLLCLGLGSALLSSLGEHAFFRLALPRIRKGSRLQYWLHTSQKIHRALNTEPPEGSKEETAEAEPSGPEVEQQQQQQDQPTAPEGWKRARRAVDKERRVRFLLEPAVVVAPEADAEAEAAPREGPVWLCSYGRPPAARPTGAPQPGELQELERRIEVARERLRQALVRRGQLLAQLGDSARHRPRRLLQARAAPAEAPPHSGRPGSQE,mutated_sequence,1.0,1043.0,UPI000004064B.a2m,UPI000004064B.npy,gnomAD
+UPI0000127E5B,UPI0000127E5B.csv,MPRSCCSRSGALLLALLLQASMEVRGWCLESSQCQDLTTESNLLECIRACKPDLSAETPMFPGNGDEQPLTENPRKYVMGHFRWDRFGRRNSSSSGSSGAGQKREDVSAGEDCGPLPEGGPEPRSDGAKPGPREGKRSYSMEHFRWGKPVGKKRRPVKVYPNGAEDESAEAFPLEFKRELTGQRLREGDGPDGPADDGAGAQADLEHSLLVAAEKKDEGPYRMEHFRWGSPPKDKRYGGFMTSEKSQTPLVTLFKNAIIKNAYKKGE,mutated_sequence,1.0,267.0,UPI0000127E5B.a2m,UPI0000127E5B.npy,gnomAD
+UPI00001BB2B9,UPI00001BB2B9.csv,MSGIKRTIKETDPDYEDVSVALPNKRHKAIENSARDAAVQKIETIIKEQFALEMKNKEHEIEVIDQRLIEARRMMDKLRACIVANYYASAGLLKVSEGSKTCDTMVFNHPAIKKFLESPSRSSSPANQRAETPSANHSESDSLSQHNDFLSDKDNNSNMDIEERLSNNMEQRPSRNTGRDTSRITGSHKTEQRNADLTDETSRLFVKKTIVVGNVSKYIPPDKREENDQSTHKWMVYVRGSRREPSINHFVKKVWFFLHPSYKPNDLVEVREPPFHLTRRGWGEFPVRVQVHFKDSQNKRIDIIHNLKLDRTYTGLQTLGAETVVDVELHRHSLGEDCIYPQSSESDISDAPPSLPLTIPAPVKASSPIKQSHEPVPDTSVEKGFPASTEAERHTPFYALPSSLERTPTKMTTSQKVTFCSHGNSAFQPIASSCKIVPQSQVPNPESPGKSFQPITMSCKIVSGSPISTPSPSPLPRTPTSTPVHVKQGTAGSVINNPYVIMDKQPGQVIGATTPSTGSPTNKISTASQVSQGTGSPVPKIHGSSFVTSTVKQEDSLFASMPPLCPIGSHPKVQSPKPITGGLGAFTKVIIKQEPGEAPHVPATGAASQSPLPQYVTVKGGHMIAVSPQKQVITPGEGIAQSAKVQPSKVVGVPVGSALPSTVKQAVAISGGQILVAKASSSVSKAVGPKQVVTQGVAKAIVSGGGGTIVAQPVQTLTKAQVTAAGPQKSGSQGSVMATLQLPATNLANLANLPPGTKLYLTTNSKNPSGKGKLLLIPQGAILRATNNANLQSGSAASGGSGAGGGGGGGGGGGSGSGGGGSTGGGGGTAGGGTQSTAGPGGISQHLTYTSYILKQTPQGTFLVGQPSPQTSGKQLTTGSVVQGTLGVSTSSAQGQQTLKVISGQKTTLFTQAAHGGQASLMKISDSTLKTVPATSQLSKPGTTMLRVAGGVITTATSPAVALSANGPAQQSEGMAPVSSSTVSSVTKTSGQQQVCVSQATVGTCKAATPTVVSATSLVPTPNPISGKATVSGLLKIHSSQSSPQQAVLTIPSQLKPLSVNTSGGVQTILMPVNKVVQSFSTSKPPAILPVAAPTPVVPSSAPAAVAKVKTEPETPGPSCLSQEGQTAVKTEESSELGNYVIKIDHLETIQQLLTAVVKKIPLITAKSEDASCFSAKSVEQYYGWNIGKRRAAEWQRAMTMRKVLQEILEKNPRFHHLTPLKTKHIAHWCRCHGYTPPDPESLRNDGDSIEDVLTQIDSEPECPSSFSSADNLCRKLEDLQQFQKREPENEEEVDILSLSEPVKINIKKEQEEKQEEVKFYLPPTPGSEFIGDVTQKIGITLQPVALHRNVYASVVEDMILKATEQLVNDILRQALAVGYQTASHNRIPKEITVSNIHQAICNIPFLDFLTNKHMGILNEDQ,mutated_sequence,1.0,1422.0,UPI00001BB2B9.a2m,UPI00001BB2B9.npy,gnomAD
+UPI000008AB1D,UPI000008AB1D.csv,MAVMAPRTLLLLLSGALALTQTWAGSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYWDQETRNVKAQSQTDRVDLGTLRGYYNQSEAGSHTIQIMYGCDVGSDGRFLRGYRQDAYDGKDYIALNEDLRSWTAADMAAQITKRKWEAAHEAEQLRAYLDGTCVEWLRRYLENGKETLQRTDPPKTHMTHHPISDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWELSSQPTIPIVGIIAGLVLLGAVITGAVVAAVMWRRKSSDRKGGSYTQAASSDSAQGSDVSLTACKV,mutated_sequence,1.0,365.0,UPI000008AB1D.a2m,UPI000008AB1D.npy,gnomAD
+UPI0000456761,UPI0000456761.csv,MSAPPVLRPPSPLLPVAAAAAAAAAALVPGSGPGPAPFLAPVAAPVGGISFHLQIGLSREPVLLLQDSSGDYSLAHVREMACSIVDQKFPECGFYGMYDKILLFRHDPTSENILQLVKAASDIQEGDLIEVVLSASATFEDFQIRPHALFVHSYRAPAFCDHCGEMLWGLVRQGLKCEGCGLNYHKRCAFKIPNNCSGVRRRRLSNVSLTGVSTIRTSSAELSTSAPDEPLLQKSPSESFIGREKRSNSQSYIGRPIHLDKILMSKVKVPHTFVIHSYTRPTVCQYCKKLLKGLFRQGLQCKDCRFNCHKRCAPKVPNNCLGEVTINGDLLSPGAESDVVMEEGSDDNDSERNSGLMDDMEEAMVQDAEMAMAECQNDSGEMQDPDPDHEDANRTISPSTSNNIPLMRVVQSVKHTKRKSSTVMKEGWMVHYTSKDTLRKRHYWRLDSKCITLFQNDTGSRYYKEIPLSEILSLEPVKTSALIPNGANPHCFEITTANVVYYVGENVVNPSSPSPNNSVLTSGVGADVARMWEIAIQHALMPVIPKGSSVGTGTNLHRDISVSISVSNCQIQENVDISTVYQIFPDEVLGSGQFGIVYGGKHRKTGRDVAIKIIDKLRFPTKQESQLRNEVAILQNLHHPGVVNLECMFETPERVFVVMEKLHGDMLEMILSSEKGRLPEHITKFLITQILVALRHLHFKNIVHCDLKPENVLLASADPFPQVKLCDFGFARIIGEKSFRRSVVGTPAYLAPEVLRNKGYNRSLDMWSVGVIIYVSLSGTFPFNEDEDIHDQIQNAAFMYPPNPWKEISHEAIDLINNLLQVKMRKRYSVDKTLSHPWLQDYQTWLDLRELECKIGERYITHESDDLRWEKYAGEQGLQYPTHLINPSASHSDTPETEETEMKALGERVSIL,mutated_sequence,1.0,912.0,UPI0000456761.a2m,UPI0000456761.npy,gnomAD
+UPI000015FEE1,UPI000015FEE1.csv,MVFESVVVDVLNRFLGDYVVDLDTSQLSLGIWKGAVALKNLQIKENALSQLDVPFKVKVGHIGNLKLIIPWKNLYTQPVEAVLEEIYLLIVPSSRIKYDPLKEEKQLMEAKQQELKRIEEAKQKVVDQEQHLPEKQDTFAEKLVTQIIKNLQVKISSIHIRYEDDITNRDKPLSFGISLQNLSMQTTDQYWVPCLHDETEKLVRKLIRLDNLFAYWNVKSQMFYLSDYDNSLDDLKNGIVNENIVPEGYDFVFRPISANAKLVMNRRSDFDFSAPKINLEIELHNIAIEFNKPQYFSIMELLESVDMMAQNLPYRKFKPDVPLHHHAREWWAYAIHGVLEVNVCPRLWMWSWKHIRKHRQKVKQYKELYKKKLTSKKPPGELLVSLEELEKTLDVFNITIARQTAEVEVKKAGYKIYKEGVKDPEDNKGWFSWLWSWSEQNTNEQQPDVQPETLEEMLTPEEKALLYEAIGYSETAVDPTLLKTFEALKFFVHLKSMSIVLRENHQKPELVDIVIEEFSTLIVQRPGAQAIKFETKIDSFHITGLPDNSEKPRLLSSLDDAMSLFQITFEINPLDETVSQRCIIEAEPLEIIYDARTVNSIVEFFRPPKEVHLAQLTAATLTKLEEFRSKTATGLLYIIETQKVLDLKINLKASYIIVPQDGIFSPTSNLLLLDLGHLKVTSKSRSELPDVKQGEANLKEIMDRAYDSFDIQLTSVQLLYSRVGDNWREARKLSVSTQHILVPMHFNLELSKAMVFMDVRMPKFKIYGKLPLISLRISDKKLQGIMELIESIPKPEPVTEVSAPVKSFQIQTSTSLGTSQISQKIIPLLELPSVSEDDSEEEFFDAPCSPLEEPLQFPTGVKSIRTRKLQKQDCSVNMTTFKIRFEVPKVLIEFYHLVGDCELSVVEILVLGLGAEIEIRTYDLKANAFLKEFCLKCPEYLDENKKPVYLVTTLDNTMEDLLTLEYVKAEKNVPDLKSTYNNVLQLIKVNFSSLDIHLHTEALLNTINYLHNILPQSEEKSAPVSTTETEDKGDVIKKLALKLSTNEDIITLQILAELSCLQIFIQDQKCNISEIKIEGLDSEMIMRPSETEINAKLRNIIVLDSDITAIYKKAVYITGKEVFSFKMVSYMDATAGSAYTDMNVVDIQVNLIVGCIEVVFVTKFLYSILAFIDNFQAAKQALAEATVQAAGMAATGVKELAQRSSRMALDINIKAPVVVIPQSPVSENVFVADFGLITMTNTFHMITESQSSPPPVIDLITIKLSEMRLYRSRFINDAYQEVLDLLLPLNLEVVVERNLCWEWYQEVPCFNVNAQLKPMEFILSQEDITTIFKTLHGNIWYEKDGSASPAVTKDQYSATSGVTTNASHHSGGATVVTAAVVEVHSRALLVKTTLNISFKTDDLTMVLYSPGPKQASFTDVRDPSLKLAEFKLENIISTLKMYTDGSTFSSFSLKNCILDDKRPHVKKATPRMIGLTVGFDKKDMMDIKYRKVRDGCVTDAVFQEMYICASVEFLQTVANVFLEAYTTGTAVETSVQTWTAKEEVPTQESVKWEINVIIKNPEIVFVADMTKNDAPALVITTQCEICYKGNLENSTMTAAIKDLQVRACPFLPVKRKGKITTVLQPCDLFYQTTQKGTDPQVIDMSVKSLTLKVSPVIINTMITITSALYTTKETIPEETASSTAHLWEKKDTKTLKMWFLEESNETEKIAPTTELVPKGEMIKMNIDSIFIVLEAGIGHRTVPMLLAKSRFSGEGKNWSSLINLHCQLELEVHYYNEMFGVWEPLLEPLEIDQTEDFRPWNLGIKMKKKAKMAIVESDPEEENYKVPEYKTVISFHSKDQLNITLSKCGLVMLNNLVKAFTEAATGSSADFVKDLAPFMILNSLGLTISVSPSDSFSVLNIPMAKSYVLKNGESLSMDYIRTKDNDHFNAMTSLSSKLFFILLTPVNHSTADKIPLTKVGRRLYTVRHRESGVERSIVCQIDTVEGSKKVTIRSPVQIRNHFSVPLSVYEGDTLLGTASPENEFNIPLGSYRSFIFLKPEDENYQMCEGIDFEEIIKNDGALLKKKCRSKNPSKESFLINIVPEKDNLTSLSVYSEDGWDLPYIMHLWPPILLRNLLPYKIAYYIEGIENSVFTLSEGHSAQICTAQLGKARLHLKLLDYLNHDWKSEYHIKPNQQDISFVSFTCVTEMEKTDLDIAVHMTYNTGQTVVAFHSPYWMVNKTGRMLQYKADGIHRKHPPNYKKPVLFSFQPNHFFNNNKVQLMVTDSELSNQFSIDTVGSHGAVKCKGLKMDYQVGVTIDLSSFNITRIVTFTPFYMIKNKSKYHISVAEEGNDKWLSLDLEQCIPFWPEYASSKLLIQVERSEDPPKRIYFNKQENCILLRLDNELGGIIAEVNLAEHSTVITFLDYHDGAATFLLINHTKNELVQYNQSSLSEIEDSLPPGKAVFYTWADPVGSRRLKWRCRKSHGEVTQKDDMMMPIDLGEKTIYLVSFFEGLQRIILFTEDPRVFKVTYESEKAELAEQEIAVALQDVGISLVNNYTKQEVAYIGITSSDVVWETKPKKKARWKPMSVKHTEKLEREFKEYTESSPSEDKVIQLDTNVPVRLTPTGHNMKILQPHVIALRRNYLPALKVEYNTSAHQSSFRIQIYRIQIQNQIHGAVFPFVFYPVKPPKSVTMDSAPKPFTDVSIVMRSAGHSQISRIKYFKVLIQEMDLRLDLGFIYALTDLMTEAEVTENTEVELFHKDIEAFKEEYKTASLVDQSQVSLYEYFHISPIKLHLSVSLSSGREEAKDSKQNGGLIPVHSLNLLLKSIGATLTDVQDVVFKLAFFELNYQFHTTSDLQSEVIRHYSKQAIKQMYVLILGLDVLGNPFGLIREFSEGVEAFFYEPYQGAIQGPEEFVEGMALGLKALVGGAVGGLAGAASKITGAMAKGVAAMTMDEDYQQKRREAMNKQPAGFREGITRGGKGLVSGFVSGITGIVTKPIKGAQKGGAAGFFKGVGKGLVGAVARPTGGIIDMASSTFQGIKRATETSEVESLRPPRFFNEDGVIRPYRLRDGTGNQMLQKIQFYREWIMTHSSSSDDDDDDDDDDESDLNH,mutated_sequence,1.0,3095.0,UPI000015FEE1.a2m,UPI000015FEE1.npy,gnomAD
+UPI000012CBC3,UPI000012CBC3.csv,MKALIAALLLITLQYSCAVSPTDCSAVEPEAEKALDLINKRRRDGYLFQLLRIADAHLDRVENTTVYYLVLDVQESDCSVLSRKYWNDCEPPDSRRPSEIVIGQCKVIATRHSHESQDLRVIDFNCTTSSVSSALANTKDSPVLIDFFEDTERYRKQANKALEKYKEENDDFASFRVDRIERVARVRGGEGTGYFVDFSVRNCPRHHFPRHPNVFGFCRADLFYDVEALDLESPKNLVINCEVFDPQEHENINGVPPHLGHPFHWGGHERSSTTKPPFKPHGSRDHHHPHKPHEHGPPPPPDERDHSHGPPLPQGPPPLLPMSCSSCQHATFGTNGAQRHSHNNNSSDLHPHKHHSHEQHPHGHHPHAHHPHEHDTHRQHPHGHHPHGHHPHGHHPHGHHPHGHHPHCHDFQDYGPCDPPPHNQGHCCHGHGPPPGHLRRRGPGKGPRPFHCRQIGSVYRLPPLRKGEVLPLPEANFPSFPLPHHKHPLKPDNQPFPQSVSESCPGKFKSGFPQVSMFFTHTFPK,mutated_sequence,1.0,525.0,UPI000012CBC3.a2m,UPI000012CBC3.npy,gnomAD
+UPI000019080B,UPI000019080B.csv,MQSDDVIWDTLGNKQFCSFKIRTKTQSFCRNEYSLTGLCNRSSCPLANSQYATIKEEKGQCYLYMKVIERAAFPRRLWERVRLSKNYEKALEQIDENLIYWPRFIRHKCKQRFTKITQYLIRIRKLTLKRQRKLVPLSKKVERREKRREEKALIAAQLDNAIEKELLERLKQDTYGDIYNFPIHAFDKALEQQEAESDSSDTEEKDDDDDDEEDVGKREFVEDGEVDESDISDFEDMDKLDASSDEDQDGKSSSEEEEEKALSAKHKGKMPLRGPLQRKRAYVEIEYEQETEPVAKAKTT,mutated_sequence,1.0,300.0,UPI000019080B.a2m,UPI000019080B.npy,gnomAD
diff --git a/reference_files/clinical_substitutions.csv b/reference_files/clinical_substitutions.csv
new file mode 100644
index 0000000..480d123
--- /dev/null
+++ b/reference_files/clinical_substitutions.csv
@@ -0,0 +1,2526 @@
+DMS_id,target_seq,file_length,DMS_filename,EVE_model_path,MSA_filename,alignment_source,weight_file_name,MSA_start,MSA_end,MSA_len
+NP_000007.1,MAAGFGRCCRVLRSISRFHWRSQHTKANRQREPGLGFSFEFTEQQKEFQATARKFAREEIIPVAAEYDKTGEYPVPLIRRAWELGLMNTHIPENCGGLGLGTFDACLISEELAYGCTGVQTAIEGNSLGQMPIIIAGNDQQKKKYLGRMTEEPLMCAYCVTEPGAGSDVAGIKTKAEKKGDEYIINGQKMWITNGGKANWYFLLARSDPDPKAPANKAFTGFIVEADTPGIQIGRKELNMGQRCSDTRGIVFEDVKVPKENVLIGDGAGFKVAMGAFDKTRPVVAAGAVGLAQRALDEATKYALERKTFGKLLVEHQAISFMLAEMAMKVELARMSYQRAAWEVDSGRRNTYYASIAKAFAGDIANQLATDAVQILGGNGFNTEYPVEKLMRDAKIYQIYEGTSQIQRLIVAREHIDKYKN,421,NP_000007.1.csv,refseq-ACADM-NM_000016.5_clinical_seed_0_final,refseq-ACADM-NM_000016.5.a2m,Invitae,refseq-ACADM-NM_000016.5.npy,1,421,421
+NP_000008.1,MAAALLARASGPARRALCPRAWRQLHTIYQSVELPETHQMLLQTCRDFAEKELFPIAAQVDKEHLFPAAQVKKMGGLGLLAMDVPEELGGAGLDYLAYAIAMEEISRGCASTGVIMSVNNSLYLGPILKFGSKEQKQAWVTPFTSGDKIGCFALSEPGNGSDAGAASTTARAEGDSWVLNGTKAWITNAWEASAAVVFASTDRALQNKGISAFLVPMPTPGLTLGKKEDKLGIRGSSTANLIFEDCRIPKDSILGEPGMGFKIAMQTLDMGRIGIASQALGIAQTALDCAVNYAENRMAFGAPLTKLQVIQFKLADMALALESARLLTWRAAMLKDNKKPFIKEAAMAKLAASEAATAISHQAIQILGGMGYVTEMPAERHYRDARITEIYEGTSEIQRLVIAGHLLRSYRS,412,NP_000008.1.csv,refseq-ACADS-NM_000017.3_clinical_seed_0_final,refseq-ACADS-NM_000017.3.a2m,Invitae,refseq-ACADS-NM_000017.3.npy,1,412,412
+NP_000009.1,MQAARMAASLGRQLLRLGGGSSRLTALLGQPRPGPARRPYAGGAAQLALDKSDSHPSDALTRKKPAKAESKSFAVGMFKGQLTTDQVFPYPSVLNEEQTQFLKELVEPVSRFFEEVNDPAKNDALEMVEETTWQGLKELGAFGLQVPSELGGVGLCNTQYARLVEIVGMHDLGVGITLGAHQSIGFKGILLFGTKAQKEKYLPKLASGETVAAFCLTEPSSGSDAASIRTSAVPSPCGKYYTLNGSKLWISNGGLADIFTVFAKTPVTDPATGAVKEKITAFVVERGFGGITHGPPEKKMGIKASNTAEVFFDGVRVPSENVLGEVGSGFKVAMHILNNGRFGMAAALAGTMRGIIAKAVDHATNRTQFGEKIHNFGLIQEKLARMVMLQYVTESMAYMVSANMDQGATDFQIEAAISKIFGSEAAWKVTDECIQIMGGMGFMKEPGVERVLRDLRIFRIFEGTNDILRLFVALQGCMDKGKELSGLGSALKNPFGNAGLLLGEAGKQLRRRAGLGSGLSLSGLVHPELSRSGELAVRALEQFATVVEAKLIKHKKGIVNEQFLLQRLADGAIDLYAMVVVLSRASRSLSEGHPTAQHEKMLCDTWCIEAAARIREGMAALQSDPWQQELYRNFKSISKALVERGGVVTSNPLGF,655,NP_000009.1.csv,refseq-ACADVL-NM_000018.3_clinical_seed_0_final,refseq-ACADVL-NM_000018.3.a2m,Invitae,refseq-ACADVL-NM_000018.3.npy,1,655,655
+NP_000010.1,MAVLAALLRSGARSRSPLLRRLVQEIRYVERSYVSKPTLKEVVIVSATRTPIGSFLGSLSLLPATKLGSIAIQGAIEKAGIPKEEVKEAYMGNVLQGGEGQAPTRQAVLGAGLPISTPCTTINKVCASGMKAIMMASQSLMCGHQDVMVAGGMESMSNVPYVMNRGSTPYGGVKLEDLIVKDGLTDVYNKIHMGSCAENTAKKLNIARNEQDAYAINSYTRSKAAWEAGKFGNEVIPVTVTVKGQPDVVVKEDEEYKRVDFSKVPKLKTVFQKENGTVTAANASTLNDGAAALVLMTADAAKRLNVTPLARIVAFADAAVEPIDFPIAPVYAASMVLKDVGLKKEDIAMWEVNEAFSLVVLANIKMLEIDPQKVNINGGAVSLGHPIGMSGARIVGHLTHALKQGEYGLASICNGGGGASAMLIQKL,427,NP_000010.1.csv,refseq-ACAT1-NM_000019.3_clinical_seed_0_final,refseq-ACAT1-NM_000019.3.a2m,Invitae,refseq-ACAT1-NM_000019.3.npy,1,427,427
+NP_000011.2,MTLGSPRKGLLMLLMALVTQGDPVKPSRGPLVTCTCESPHCKGPTCRGAWCTVVLVREEGRHPQEHRGCGNLHRELCRGRPTEFVNHYCCDSHLCNHNVSLVLEATQPPSEQPGTDGQLALILGPVLALLALVALGVLGLWHVRRRQEKQRGLHSELGESSLILKASEQGDSMLGDLLDSDCTTGSGSGLPFLVQRTVARQVALVECVGKGRYGEVWRGLWHGESVAVKIFSSRDEQSWFRETEIYNTVLLRHDNILGFIASDMTSRNSSTQLWLITHYHEHGSLYDFLQRQTLEPHLALRLAVSAACGLAHLHVEIFGTQGKPAIAHRDFKSRNVLVKSNLQCCIADLGLAVMHSQGSDYLDIGNNPRVGTKRYMAPEVLDEQIRTDCFESYKWTDIWAFGLVLWEIARRTIVNGIVEDYRPPFYDVVPNDPSFEDMKKVVCVDQQTPTIPNRLAADPVLSGLAQMMRECWYPNPSARLTALRIKKTLQKISNSPEKPKVIQ,503,NP_000011.2.csv,refseq-ACVRL1-NM_000020.2_clinical_seed_0_final,refseq-ACVRL1-NM_000020.2.a2m,Invitae,refseq-ACVRL1-NM_000020.2.npy,1,503,503
+NP_000012.1,MTELPAPLSYFQNAQMSEDNHLSNTVRSQNDNRERQEHNDRRSLGHPEPLSNGRPQGNSRQVVEQDEEEDEELTLKYGAKHVIMLFVPVTLCMVVVVATIKSVSFYTRKDGQLIYTPFTEDTETVGQRALHSILNAAIMISVIVVMTILLVVLYKYRCYKVIHAWLIISSLLLLFFFSFIYLGEVFKTYNVAVDYITVALLIWNFGVVGMISIHWKGPLRLQQAYLIMISALMALVFIKYLPEWTAWLILAVISVYDLVAVLCPKGPLRMLVETAQERNETLFPALIYSSTMVWLVNMAEGDPEAQRRVSKNSKYNAESTERESQDTVAENDDGGFSEEWEAQRDSHLGPHRSTPESRAAVQELSSSILAGEDPEERGVKLGLGDFIFYSVLVGKASATASGDWNTTIACFVAILIGLCLTLLLLAIFKKALPALPISITFGLVFYFATDYLVQPFMDQLAFHQFYI,467,NP_000012.1.csv,refseq-PSEN1-NM_000021.3_clinical_seed_0_final,refseq-PSEN1-NM_000021.3.a2m,Invitae,refseq-PSEN1-NM_000021.3.npy,1,467,467
+NP_000013.2,MAQTPAFDKPKVELHVHLDGSIKPETILYYGRRRGIALPANTAEGLLNVIGMDKPLTLPDFLAKFDYYMPAIAGCREAIKRIAYEFVEMKAKEGVVYVEVRYSPHLLANSKVEPIPWNQAEGDLTPDEVVALVGQGLQEGERDFGVKARSILCCMRHQPNWSPKVVELCKKYQQQTVVAIDLAGDETIPGSSLLPGHVQAYQEAVKSGIHRTVHAGEVGSAEVVKEAVDILKTERLGHGYHTLEDQALYNRLRQENMHFEICPWSSYLTGAWKPDTEHAVIRLKNDQANYSLNTDDPLIFKSTLDTDYQMTKRDMGFTEEEFKRLNINAAKSSFLPEDEKRELLDLLYKAYGMPPSASAGQNL,363,NP_000013.2.csv,refseq-ADA-NM_000022.4_clinical_seed_0_final,refseq-ADA-NM_000022.4.a2m,Invitae,refseq-ADA-NM_000022.4.npy,1,363,363
+NP_000014.1,MAETLFWTPLLVVLLAGLGDTEAQQTTLHPLVGRVFVHTLDHETFLSLPEHVAVPPAVHITYHAHLQGHPDLPRWLRYTQRSPHHPGFLYGSATPEDRGLQVIEVTAYNRDSFDTTRQRLVLEIGDPEGPLLPYQAEFLVRSHDAEEVLPSTPASRFLSALGGLWEPGELQLLNVTSALDRGGRVPLPIEGRKEGVYIKVGSASPFSTCLKMVASPDSHARCAQGQPPLLSCYDTLAPHFRVDWCNVTLVDKSVPEPADEVPTPGDGILEHDPFFCPPTEAPDRDFLVDALVTLLVPLLVALLLTLLLAYVMCCRREGRLKRDLATSDIQMVHHCTIHGNTEELRQMAASREVPRPLSTLPMFNVHTGERLPPRVDSAQVPLILDQH,387,NP_000014.1.csv,SGCA_HUMAN_b03_clinical_seed_0_final,SGCA_HUMAN_b03.a2m,EVE,SGCA_HUMAN_b03_theta_0.2.npy,1,387,387
+NP_000017.1,MAAGGDHGSPDSYRSPLASRYASPEMCFVFSDRYKFRTWRQLWLWLAEAEQTLGLPITDEQIQEMKSNLENIDFKMAAEEEKRLRHDVMAHVHTFGHCCPKAAGIIHLGATSCYVGDNTDLIILRNALDLLLPKLARVISRLADFAKERASLPTLGFTHFQPAQLTTVGKRCCLWIQDLCMDLQNLKRVRDDLRFRGVKGTTGTQASFLQLFEGDDHKVEQLDKMVTEKAGFKRAFIITGQTYTRKVDIEVLSVLASLGASVHKICTDIRLLANLKEMEEPFEKQQIGSSAMPYKRNPMRSERCCSLARHLMTLVMDPLQTASVQWFERTLDDSANRRICLAEAFLTADTILNTLQNISEGLVVYPKVIERRIRQELPFMATENIIMAMVKAGGSRQDCHEKIRVLSQQAASVVKQEGGDNDLIERIQVDAYFSPIHSQLDHLLDPSSFTGRASQQVQRFLEEEVYPLLKPYESVMKVKAELCL,484,NP_000017.1.csv,refseq-ADSL-NM_000026.2_clinical_seed_0_final,refseq-ADSL-NM_000026.2.a2m,Invitae,refseq-ADSL-NM_000026.2.npy,1,484,484
+NP_000018.2,MARKSNLPVLLVPFLLCQALVRCSSPLPLVVNTWPFKNATEAAWRALASGGSALDAVESGCAMCEREQCDGSVGFGGSPDELGETTLDAMIMDGTTMDVGAVGDLRRIKNAIGVARKVLEHTTHTLLVGESATTFAQSMGFINEDLSTTASQALHSDWLARNCQPNYWRNVIPDPSKYCGPYKPPGILKQDIPIHKETEDDRGHDTIGMVVIHKTGHIAAGTSTNGIKFKIHGRVGDSPIPGAGAYADDTAGAAAATGNGDILMRFLPSYQAVEYMRRGEDPTIACQKVISRIQKHFPEFFGAVICANVTGSYGAACNKLSTFTQFSFMVYNSEKNQPTEEKVDCI,346,NP_000018.2.csv,refseq-AGA-NM_000027.3_clinical_seed_0_final,refseq-AGA-NM_000027.3.a2m,Invitae,refseq-AGA-NM_000027.3.npy,1,346,346
+NP_000021.1,MASHKLLVTPPKALLKPLSIPNQLLLGPGPSNLPPRIMAAGGLQMIGSMSKDMYQIMDEIKEGIQYVFQTRNPLTLVISGSGHCALEAALVNVLEPGDSFLVGANGIWGQRAVDIGERIGARVHPMTKDPGGHYTLQEVEEGLAQHKPVLLFLTHGESSTGVLQPLDGFGELCHRYKCLLLVDSVASLGGTPLYMDRQGIDILYSGSQKALNAPPGTSLISFSDKAKKKMYSRKTKPFSFYLDIKWLANFWGCDDQPRMYHHTIPVISLYSLRESLALIAEQGLENSWRQHREAAAYLHGRLQALGLQLFVKDPALRLPTVTTVAVPAGYDWRDIVSYVIDHFDIEIMGGLGPSTGKVLRIGLLGCNATRENVDRVTEALRAALQHCPKKKL,392,NP_000021.1.csv,refseq-AGXT-NM_000030.2_clinical_seed_0_final,refseq-AGXT-NM_000030.2.a2m,Invitae,refseq-AGXT-NM_000030.2.npy,1,392,392
+NP_000022.3,MQPQSVLHSGYFHPLLRAWQTATTTLNASNLIYPIFVTDVPDDIQPITSLPGVARYGVKRLEEMLRPLVEEGLRCVLIFGVPSRVPKDERGSAADSEESPAIEAIHLLRKTFPNLLVACDVCLCPYTSHGHCGLLSENGAFRAEESRQRLAEVALAYAKAGCQVVAPSDMMDGRVEAIKEALMAHGLGNRVSVMSYSAKFASCFYGPFRDAAKSSPAFGDRRCYQLPPGARGLALRAVDRDVREGADMLMVKPGMPYLDIVREVKDKHPDLPLAVYHVSGEFAMLWHGAQAGAFDLKAAVLEAMTAFRRAGADIIITYYTPQLLQWLKEE,330,NP_000022.3.csv,refseq-ALAD-NM_000031.5_clinical_seed_0_final,refseq-ALAD-NM_000031.5.a2m,Invitae,refseq-ALAD-NM_000031.5.npy,1,330,330
+NP_000023.2,MVTAAMLLQCCPVLARGPTSLLGKVVKTHQFLFGIGRCPILATQGPNCSQIHLKATKAGGDSPSWAKGHCPFMLSELQDGKSKIVQKAAPEVQEDVKAFKTDLPSSLVSVSLRKPFSGPQEQEQISGKVTHLIQNNMPGNYVFSYDQFFRDKIMEKKQDHTYRVFKTVNRWADAYPFAQHFSEASVASKDVSVWCSNDYLGMSRHPQVLQATQETLQRHGAGAGGTRNISGTSKFHVELEQELAELHQKDSALLFSSCFVANDSTLFTLAKILPGCEIYSDAGNHASMIQGIRNSGAAKFVFRHNDPDHLKKLLEKSNPKIPKIVAFETVHSMDGAICPLEELCDVSHQYGALTFVDEVHAVGLYGSRGAGIGERDGIMHKIDIISGTLGKAFGCVGGYIASTRDLVDMVRSYAAGFIFTTSLPPMVLSGALESVRLLKGEEGQALRRAHQRNVKHMRQLLMDRGLPVIPCPSHIIPIRVGNAALNSKLCDLLLSKHGIYVQAINYPTVPRGEELLRLAPSPHHSPQMMEDFVEKLLLAWTAVGLPLQDVSVAACNFCRRPVHFELMSEWERSYFGNMGPQYVTTYA,587,NP_000023.2.csv,refseq-ALAS2-NM_000032.4_clinical_seed_0_final,refseq-ALAS2-NM_000032.4.a2m,Invitae,refseq-ALAS2-NM_000032.4.npy,1,587,587
+NP_000024.2,MPVLSRPRPWRGNTLKRTAVLLALAAYGAHKVYPLVRQCLAPARGLQAPAGEPTQEASGVAAAKAGMNRVFLQRLLWLLRLLFPRVLCRETGLLALHSAALVSRTFLSVYVARLDGRLARCIVRKDPRAFGWQLLQWLLIALPATFVNSAIRYLEGQLALSFRSRLVAHAYRLYFSQQTYYRVSNMDGRLRNPDQSLTEDVVAFAASVAHLYSNLTKPLLDVAVTSYTLLRAARSRGAGTAWPSAIAGLVVFLTANVLRAFSPKFGELVAEEARRKGELRYMHSRVVANSEEIAFYGGHEVELALLQRSYQDLASQINLILLERLWYVMLEQFLMKYVWSASGLLMVAVPIITATGYSESDAEAVKKAALEKKEEELVSERTEAFTIARNLLTAAADAIERIMSSYKEVTELAGYTARVHEMFQVFEDVQRCHFKRPRELEDAQAGSGTIGRSGVRVEGPLKIRGQVVDVEQGIICENIPIVTPSGEVVVASLNIRVEEGMHLLITGPNGCGKSSLFRILGGLWPTYGGVLYKPPPQRMFYIPQRPYMSVGSLRDQVIYPDSVEDMQRKGYSEQDLEAILDVVHLHHILQREGGWEAMCDWKDVLSGGEKQRIGMARMFYHRPKYALLDECTSAVSIDVEGKIFQAAKDAGIALLSITHRPSLWKYHTHLLQFDGEGGWKFEKLDSAARLSLTEEKQRLEQQLAGIPKMQRRLQELCQILGEAVAPAHVPAPSPQGPGGLQGAST,745,NP_000024.2.csv,refseq-ABCD1-NM_000033.3_clinical_seed_0_final,refseq-ABCD1-NM_000033.3.a2m,Invitae,refseq-ABCD1-NM_000033.3.npy,1,745,745
+NP_000026.2,MAHRFPALTQEQKKELSEIAQSIVANGKGILAADESVGTMGNRLQRIKVENTEENRRQFREILFSVDSSINQSIGGVILFHETLYQKDSQGKLFRNILKEKGIVVGIKLDQGGAPLAGTNKETTIQGLDGLSERCAQYKKDGVDFGKWRAVLRIADQCPSSLAIQENANALARYASICQQNGLVPIVEPEVIPDGDHDLEHCQYVTEKVLAAVYKALNDHHVYLEGTLLKPNMVTAGHACTKKYTPEQVAMATVTALHRTVPAAVPGICFLSGGMSEEDATLNLNAINLCPLPKPWKLSFSYGRALQASALAAWGGKAANKEATQEAFMKRAMANCQAAKGQYVHTGSSGAASTQSLFTACYTY,364,NP_000026.2.csv,refseq-ALDOB-NM_000035.3_clinical_seed_0_final,refseq-ALDOB-NM_000035.3.a2m,Invitae,refseq-ALDOB-NM_000035.3.npy,1,364,364
+NP_000028.3,MPYSVGFREADAATSFLRAARSGNLDKALDHLRNGVDINTCNQNGLNGLHLASKEGHVKMVVELLHKEIILETTTKKGNTALHIAALAGQDEVVRELVNYGANVNAQSQKGFTPLYMAAQENHLEVVKFLLENGANQNVATEDGFTPLAVALQQGHENVVAHLINYGTKGKVRLPALHIAARNDDTRTAAVLLQNDPNPDVLSKTGFTPLHIAAHYENLNVAQLLLNRGASVNFTPQNGITPLHIASRRGNVIMVRLLLDRGAQIETKTKDELTPLHCAARNGHVRISEILLDHGAPIQAKTKNGLSPIHMAAQGDHLDCVRLLLQYDAEIDDITLDHLTPLHVAAHCGHHRVAKVLLDKGAKPNSRALNGFTPLHIACKKNHVRVMELLLKTGASIDAVTESGLTPLHVASFMGHLPIVKNLLQRGASPNVSNVKVETPLHMAARAGHTEVAKYLLQNKAKVNAKAKDDQTPLHCAARIGHTNMVKLLLENNANPNLATTAGHTPLHIAAREGHVETVLALLEKEASQACMTKKGFTPLHVAAKYGKVRVAELLLERDAHPNAAGKNGLTPLHVAVHHNNLDIVKLLLPRGGSPHSPAWNGYTPLHIAAKQNQVEVARSLLQYGGSANAESVQGVTPLHLAAQEGHAEMVALLLSKQANGNLGNKSGLTPLHLVAQEGHVPVADVLIKHGVMVDATTRMGYTPLHVASHYGNIKLVKFLLQHQADVNAKTKLGYSPLHQAAQQGHTDIVTLLLKNGASPNEVSSDGTTPLAIAKRLGYISVTDVLKVVTDETSFVLVSDKHRMSFPETVDEILDVSEDEGEELISFKAERRDSRDVDEEKELLDFVPKLDQVVESPAIPRIPCAMPETVVIRSEEQEQASKEYDEDSLIPSSPATETSDNISPVASPVHTGFLVSFMVDARGGSMRGSRHNGLRVVIPPRTCAAPTRITCRLVKPQKLSTPPPLAEEEGLASRIIALGPTGAQFLSPVIVEIPHFASHGRGDRELVVLRSENGSVWKEHRSRYGESYLDQILNGMDEELGSLEELEKKRVCRIITTDFPLYFVIMSRLCQDYDTIGPEGGSLKSKLVPLVQATFPENAVTKRVKLALQAQPVPDELVTKLLGNQATFSPIVTVEPRRRKFHRPIGLRIPLPPSWTDNPRDSGEGDTTSLRLLCSVIGGTDQAQWEDITGTTKLVYANECANFTTNVSARFWLSDCPRTAEAVNFATLLYKELTAVPYMAKFVIFAKMNDPREGRLRCYCMTDDKVDKTLEQHENFVEVARSRDIEVLEGMSLFAELSGNLVPVKKAAQQRSFHFQSFRENRLAMPVKVRDSSREPGGSLSFLRKAMKYEDTQHILCHLNITMPPCAKGSGAEDRRRTPTPLALRYSILSESTPGSLSGTEQAEMKMAVISEHLGLSWAELARELQFSVEDINRIRVENPNSLLEQSVALLNLWVIREGQNANMENLYTALQSIDRGEIVNMLEGSGRQSRNLKPDRRHTDRDYSLSPSQMNGYSSLQDELLSPASLGCALSSPLRADQYWNEVAVLDAIPLAATEHDTMLEMSDMQVWSAGLTPSLVTAEDSSLECSKAEDSDATGHEWKLEGALSEEPRGPELGSLELVEDDTVDSDATNGLIDLLEQEEGQRSEEKLPGSKRQDDATGAGQDSENEVSLVSGHQRGQARITHSPTVSQVTERSQDRLQDWDADGSIVSYLQDAAQGSWQEEVTQGPHSFQGTSTMTEGLEPGGSQEYEKVLVSVSEHTWTEQPEAESSQADRDRRQQGQEEQVQEAKNTFTQVVQGNEFQNIPGEQVTEEQFTDEQGNIVTKKIIRKVVRQIDLSSADAAQEHEEVELRGSGLQPDLIEGRKGAQIVKRASLKRGKQ,1880,NP_000028.3.csv,refseq-ANK1-NM_000037.3_clinical_seed_0_final,refseq-ANK1-NM_000037.3.a2m,Invitae,refseq-ANK1-NM_000037.3.npy,1,1880,1880
+NP_000030.1,MKAAVLTLAVLFLTGSQARHFWQQDEPPQSPWDRVKDLATVYVDVLKDSGRDYVSQFEGSALGKQLNLKLLDNWDSVTSTFSKLREQLGPVTQEFWDNLEKETEGLRQEMSKDLEEVKAKVQPYLDDFQKKWQEEMELYRQKVEPLRAELQEGARQKLHELQEKLSPLGEEMRDRARAHVDALRTHLAPYSDELRQRLAARLEALKENGGARLAEYHAKATEHLSTLSEKAKPALEDLRQGLLPVLESFKVSFLSALEEYTKKLNTQ,267,NP_000030.1.csv,refseq-APOA1-NM_000039.2_clinical_seed_0_final,refseq-APOA1-NM_000039.2.a2m,Invitae,refseq-APOA1-NM_000039.2.npy,1,267,267
+NP_000032.1,MKVLWAALLVTFLAGCQAKVEQAVETEPEPELRQQTEWQSGQRWELALGRFWDYLRWVQTLSEQVQEELLSSQVTQELRALMDETMKELKAYKSELEEQLTPVAEETRARLSKELQAAQARLGADMEDVCGRLVQYRGEVQAMLGQSTEELRVRLASHLRKLRKRLLRDADDLQKRLAVYQAGAREGAERGLSAIRERLGPLVEQGRVRAATVGSLAGQPLQERAQAWGERLRARMEEMGSRTRDRLDEVKEQVAEVRAKLEEQAQQIRLQAEAFQARLKSWFEPLVEDMQRQWAGLVEKVQAAVGTSAAPVPSDNH,317,NP_000032.1.csv,refseq-APOE-NM_000041.3_clinical_seed_0_final,refseq-APOE-NM_000041.3.a2m,Invitae,refseq-APOE-NM_000041.3.npy,1,317,317
+NP_000034.1,MLGIWTLLPLVLTSVARLSSKSVNAQVTDINSKGLELRKTVTTVETQNLEGLHHDGQFCHKPCPPGERKARDCTVNGDEPDCVPCQEGKEYTDKAHFSSKCRRCRLCDEGHGLEVEINCTRTQNTKCRCKPNFFCNSTVCEHCDPCTKCEHGIIKECTLTSNTKCKEEGSRSNLGWLCLLLLPIPLIVWVKRKEVQKTCRKHRKENQGSHESPTLNPETVAINLSDVDLSKYITTIAGVMTLSQVKGFVRKNGVNEAKIDEIKNDNVQDTAEQKVQLLRNWHQLHGKKEAYDTLIKDLKKANLCTLAEKIQTIILKDITSDSENSNFRNEIQSLV,335,NP_000034.1.csv,refseq-FAS-NM_000043.5_clinical_seed_0_final,refseq-FAS-NM_000043.5.a2m,Invitae,refseq-FAS-NM_000043.5.npy,1,335,335
+NP_000035.2,MEVQLGLGRVYPRPPSKTYRGAFQNLFQSVREVIQNPGPRHPEAASAAPPGASLLLLQQQQQQQQQQQQQQQQQQQQQQQETSPRQQQQQQGEDGSPQAHRRGPTGYLVLDEEQQPSQPQSALECHPERGCVPEPGAAVAASKGLPQQLPAPPDEDDSAAPSTLSLLGPTFPGLSSCSADLKDILSEASTMQLLQQQQQEAVSEGSSSGRAREASGAPTSSKDNYLGGTSTISDNAKELCKAVSVSMGLGVEALEHLSPGEQLRGDCMYAPLLGVPPAVRPTPCAPLAECKGSLLDDSAGKSTEDTAEYSPFKGGYTKGLEGESLGCSGSAAAGSSGTLELPSTLSLYKSGALDEAAAYQSRDYYNFPLALAGPPPPPPPPHPHARIKLENPLDYGSAWAAAAAQCRYGDLASLHGAGAAGPGSGSPSAAASSSWHTLFTAEEGQLYGPCGGGGGGGGGGGGGGGGGGGGGGGEAGAVAPYGYTRPPQGLAGQESDFTAPDVWYPGGMVSRVPYPSPTCVKSEMGPWMDSYSGPYGDMRLETARDHVLPIDYYFPPQKTCLICGDEASGCHYGALTCGSCKVFFKRAAEGKQKYLCASRNDCTIDKFRRKNCPSCRLRKCYEAGMTLGARKLKKLGNLKLQEEGEASSTTSPTEETTQKLTVSHIEGYECQPIFLNVLEAIEPGVVCAGHDNNQPDSFAALLSSLNELGERQLVHVVKWAKALPGFRNLHVDDQMAVIQYSWMGLMVFAMGWRSFTNVNSRMLYFAPDLVFNEYRMHKSRMYSQCVRMRHLSQEFGWLQITPQEFLCMKALLLFSIIPVDGLKNQKFFDELRMNYIKELDRIIACKRKNPTSCSRRFYQLTKLLDSVQPIARELHQFTFDLLIKSHMVSVDFPEMMAEIISVQVPKILSGKVKPIYFHTQ,920,NP_000035.2.csv,refseq-AR-NM_000044.3_clinical_seed_0_final,refseq-AR-NM_000044.3.a2m,Invitae,refseq-AR-NM_000044.3.npy,1,920,920
+NP_000036.2,MSAKSRTIGIIGAPFSKGQPRGGVEEGPTVLRKAGLLEKLKEQECDVKDYGDLPFADIPNDSPFQIVKNPRSVGKASEQLAGKVAEVKKNGRISLVLGGDHSLAIGSISGHARVHPDLGVIWVDAHTDINTPLTTTSGNLHGQPVSFLLKELKGKIPDVPGFSWVTPCISAKDIVYIGLRDVDPGEHYILKTLGIKYFSMTEVDRLGIGKVMEETLSYLLGRKKRPIHLSFDVDGLDPSFTPATGTPVVGGLTYREGLYITEEIYKTGLLSGLDIMEVNPSLGKTPEEVTRTVNTAVAITLACFGLAREGNHKPIDYLNPPK,322,NP_000036.2.csv,refseq-ARG1-NM_000045.3_clinical_seed_0_final,refseq-ARG1-NM_000045.3.a2m,Invitae,refseq-ARG1-NM_000045.3.npy,1,322,322
+NP_000037.2,MGPRGAASLPRGPGPRRLLLPVVLPLLLLLLLAPPGSGAGASRPPHLVFLLADDLGWNDVGFHGSRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPLFLYLALQSVHEPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAGGNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIELLHNIDPNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTGYPGCGYWFPPPSQYNVSEIPSSDPPTKTLWLFDIDRDPEERHDLSREYPHIVTKLLSRLQFYHKHSVPVYFPAQDPRCDPKATGVWGPWM,533,NP_000037.2.csv,refseq-ARSB-NM_000046.3_clinical_seed_0_final,refseq-ARSB-NM_000046.3.a2m,Invitae,refseq-ARSB-NM_000046.3.npy,1,533,533
+NP_000038.2,MLHLHHSCLCFRSWLPAMLAVLLSLAPSASSDISASRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFDHFYGMPFSLMGDCARWELSEKRVNLEQKLNFLFQVLALVALTLVAGKLTHLIPVSWMPVIWSALSAVLLLASSYFVGALIVHADCFLMRNHTITEQPMCFQRTTPLILQEVASFLKRNKHGPFLLFVSFLHVHIPLITMENFLGKSLHGLYGDNVEEMDWMVGRILDTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGWNGIYKGGKGMGGWEGGIRVPGIFRWPGVLPAGRVIGEPTSLMDVFPTVVRLAGGEVPQDRVIDGQDLLPLLLGTAQHSDHEFLMHYCERFLHAARWHQRDRGTMWKVHFVTPVFQPEGAGACYGRKVCPCFGEKVVHHDPPLLFDLSRDPSETHILTPASEPVFYQVMERVQQAVWEHQRTLSPVPLQLDRLGNIWRPWLQPCCGPFPLCWCLREDDPQ,589,NP_000038.2.csv,refseq-ARSE-NM_000047.2_clinical_seed_0_final,refseq-ARSE-NM_000047.2.a2m,Invitae,refseq-ARSE-NM_000047.2.npy,1,589,589
+NP_000039.2,MASESGKLWGGRFVGAVDPIMEKFNASIAYDRHLWEVDVQGSKAYSRGLEKAGLLTKAEMDQILHGLDKVAEEWAQGTFKLNSNDEDIHTANERRLKELIGATAGKLHTGRSRNDQVVTDLRLWMRQTCSTLSGLLWELIRTMVDRAEAERDVLFPGYTHLQRAQPIRWSHWILSHAVALTRDSERLLEVRKRINVLPLGSGAIAGNPLGVDRELLRAELNFGAITLNSMDATSERDFVAEFLFWASLCMTHLSRMAEDLILYCTKEFSFVQLSDAYSTGSSLMPQKKNPDSLELIRSKAGRVFGRCAGLLMTLKGLPSTYNKDLQEDKEAVFEVSDTMSAVLQVATGVISTLQIHQENMGQALSPDMLATDLAYYLVRKGMPFRQAHEASGKAVFMAETKGVALNQLSLQELQTISPLFSGDVICVWDYGHSVEQYGALGGTARSSVDWQIRQVRALLQAQQA,464,NP_000039.2.csv,refseq-ASL-NM_000048.3_clinical_seed_0_final,refseq-ASL-NM_000048.3.a2m,Invitae,refseq-ASL-NM_000048.3.npy,1,464,464
+NP_000040.1,MTSCHIAEEHIQKVAIFGGTHGNELTGVFLVKHWLENGAEIQRTGLEVKPFITNPRAVKKCTRYIDCDLNRIFDLENLGKKMSEDLPYEVRRAQEINHLFGPKDSEDSYDIIFDLHNTTSNMGCTLILEDSRNNFLIQMFHYIKTSLAPLPCYVYLIEHPSLKYATTRSIAKYPVGIEVGPQPQGVLRADILDQMRKMIKHALDFIHHFNEGKEFPPCAIEVYKIIEKVDYPRDENGEIAAIIHPNLQDQDWKPLHPGDPMFLTLDGKTIPLGGDCTVYPVFVNEAAYYEKKEAFAKTTKLTLNAKSIRCCLH,313,NP_000040.1.csv,refseq-ASPA-NM_000049.4_clinical_seed_0_final,refseq-ASPA-NM_000049.4.a2m,Invitae,refseq-ASPA-NM_000049.4.npy,1,313,313
+NP_000041.2,MSSKGSVVLAYSGGLDTSCILVWLKEQGYDVIAYLANIGQKEDFEEARKKALKLGAKKVFIEDVSREFVEEFIWPAIQSSALYEDRYLLGTSLARPCIARKQVEIAQREGAKYVSHGATGKGNDQVRFELSCYSLAPQIKVIAPWRMPEFYNRFKGRNDLMEYAKQHGIPIPVTPKNPWSMDENLMHISYEAGILENPKNQAPPGLYTKTQDPAKAPNTPDILEIEFKKGVPVKVTNVKDGTTHQTSLELFMYLNEVAGKHGVGRIDIVENRFIGMKSRGIYETPAGTILYHAHLDIEAFTMDREVRKIKQGLGLKFAELVYTGFWHSPECEFVRHCIAKSQERVEGKVQVSVLKGQVYILGRESPLSLYNEELVSMNVQGDYEPTDATGFININSLRLKEYHRLQSKVTAK,412,NP_000041.2.csv,refseq-ASS1-NM_000050.4_clinical_seed_0_final,refseq-ASS1-NM_000050.4.a2m,Invitae,refseq-ASS1-NM_000050.4.npy,1,412,412
+NP_000043.4,MDPSMGVNSVTISVEGMTCNSCVWTIEQQIGKVNGVHHIKVSLEEKNATIIYDPKLQTPKTLQEAIDDMGFDAVIHNPDPLPVLTDTLFLTVTASLTLPWDHIQSTLLKTKGVTDIKIYPQKRTVAVTIIPSIVNANQIKELVPELSLDTGTLEKKSGACEDHSMAQAGEVVLKMKVEGMTCHSCTSTIEGKIGKLQGVQRIKVSLDNQEATIVYQPHLISVEEMKKQIEAMGFPAFVKKQPKYLKLGAIDVERLKNTPVKSSEGSQQRSPSYTNDSTATFIIDGMHCKSCVSNIESTLSALQYVSSIVVSLENRSAIVKYNASSVTPESLRKAIEAVSPGLYRVSITSEVESTSNSPSSSSLQKIPLNVVSQPLTQETVINIDGMTCNSCVQSIEGVISKKPGVKSIRVSLANSNGTVEYDPLLTSPETLRGAIEDMGFDATLSDTNEPLVVIAQPSSEMPLLTSTNEFYTKGMTPVQDKEEGKNSSKCYIQVTGMTCASCVANIERNLRREEGIYSILVALMAGKAEVRYNPAVIQPPMIAEFIRELGFGATVIENADEGDGVLELVVRGMTCASCVHKIESSLTKHRGILYCSVALATNKAHIKYDPEIIGPRDIIHTIESLGFEASLVKKDRSASHLDHKREIRQWRRSFLVSLFFCIPVMGLMIYMMVMDHHFATLHHNQNMSKEEMINLHSSMFLERQILPGLSVMNLLSFLLCVPVQFFGGWYFYIQAYKALKHKTANMDVLIVLATTIAFAYSLIILLVAMYERAKVNPITFFDTPPMLFVFIALGRWLEHIAKGKTSEALAKLISLQATEATIVTLDSDNILLSEEQVDVELVQRGDIIKVVPGGKFPVDGRVIEGHSMVDESLITGEAMPVAKKPGSTVIAGSINQNGSLLICATHVGADTTLSQIVKLVEEAQTSKAPIQQFADKLSGYFVPFIVFVSIATLLVWIVIGFLNFEIVETYFPGYNRSISRTETIIRFAFQASITVLCIACPCSLGLATPTAVMVGTGVGAQNGILIKGGEPLEMAHKVKVVVFDKTGTITHGTPVVNQVKVLTESNRISHHKILAIVGTAESNSEHPLGTAITKYCKQELDTETLGTCIDFQVVPGCGISCKVTNIEGLLHKNNWNIEDNNIKNASLVQIDASNEQSSTSSSMIIDAQISNALNAQQYKVLIGNREWMIRNGLVINNDVNDFMTEHERKGRTAVLVAVDDELCGLIAIADTVKPEAELAIHILKSMGLEVVLMTGDNSKTARSIASQVGITKVFAEVLPSHKVAKVKQLQEEGKRVAMVGDGINDSPALAMANVGIAIGTGTDVAIEAADVVLIRNDLLDVVASIDLSRKTVKRIRINFVFALIYNLVGIPIAAGVFMPIGLVLQPWMGSAAMAASSVSVVLSSLFLKLYRKPTYESYELPARSQIGQKSPSEISVHVGIDDTSRNSPKLGLLDRIVNYSRASINSLLSDKRSLNSVVTSEPDKHSLLVGDFREDDDTAL,1500,NP_000043.4.csv,ATP7A_HUMAN_b07_clinical_seed_0_final,ATP7A_HUMAN_b07.a2m,EVE,ATP7A_HUMAN_b07_theta_0.2.npy,1,1500,1500
+NP_000044.2,MPEQERQITAREGASRKILSKLSLPTRAWEPAMKKSFAFDNVGYEGGLDGLGPSSQVATSTVRILGMTCQSCVKSIEDRISNLKGIISMKVSLEQGSATVKYVPSVVCLQQVCHQIGDMGFEASIAEGKAASWPSRSLPAQEAVVKLRVEGMTCQSCVSSIEGKVRKLQGVVRVKVSLSNQEAVITYQPYLIQPEDLRDHVNDMGFEAAIKSKVAPLSLGPIDIERLQSTNPKRPLSSANQNFNNSETLGHQGSHVVTLQLRIDGMHCKSCVLNIEENIGQLLGVQSIQVSLENKTAQVKYDPSCTSPVALQRAIEALPPGNFKVSLPDGAEGSGTDHRSSSSHSPGSPPRNQVQGTCSTTLIAIAGMTCASCVHSIEGMISQLEGVQQISVSLAEGTATVLYNPSVISPEELRAAIEDMGFEASVVSESCSTNPLGNHSAGNSMVQTTDGTPTSVQEVAPHTGRLPANHAPDILAKSPQSTRAVAPQKCFLQIKGMTCASCVSNIERNLQKEAGVLSVLVALMAGKAEIKYDPEVIQPLEIAQFIQDLGFEAAVMEDYAGSDGNIELTITGMTCASCVHNIESKLTRTNGITYASVALATSKALVKFDPEIIGPRDIIKIIEEIGFHASLAQRNPNAHHLDHKMEIKQWKKSFLCSLVFGIPVMALMIYMLIPSNEPHQSMVLDHNIIPGLSILNLIFFILCTFVQLLGGWYFYVQAYKSLRHRSANMDVLIVLATSIAYVYSLVILVVAVAEKAERSPVTFFDTPPMLFVFIALGRWLEHLAKSKTSEALAKLMSLQATEATVVTLGEDNLIIREEQVPMELVQRGDIVKVVPGGKFPVDGKVLEGNTMADESLITGEAMPVTKKPGSTVIAGSINAHGSVLIKATHVGNDTTLAQIVKLVEEAQMSKAPIQQLADRFSGYFVPFIIIMSTLTLVVWIVIGFIDFGVVQRYFPNPNKHISQTEVIIRFAFQTSITVLCIACPCSLGLATPTAVMVGTGVAAQNGILIKGGKPLEMAHKIKTVMFDKTGTITHGVPRVMRVLLLGDVATLPLRKVLAVVGTAEASSEHPLGVAVTKYCKEELGTETLGYCTDFQAVPGCGIGCKVSNVEGILAHSERPLSAPASHLNEAGSLPAEKDAVPQTFSVLIGNREWLRRNGLTISSDVSDAMTDHEMKGQTAILVAIDGVLCGMIAIADAVKQEAALAVHTLQSMGVDVVLITGDNRKTARAIATQVGINKVFAEVLPSHKVAKVQELQNKGKKVAMVGDGVNDSPALAQADMGVAIGTGTDVAIEAADVVLIRNDLLDVVASIHLSKRTVRRIRINLVLALIYNLVGIPIAAGVFMPIGIVLQPWMGSAAMAASSVSVVLSSLQLKCYKKPDLERYEAQAHGHMKPLTASQVSVHIGMDDRWRDSPRATPWDQVSYVSQVSLSSLTSDKPSRHSAAADDDGDKWSLLLNGRDEEQYI,1465,NP_000044.2.csv,refseq-ATP7B-NM_000053.3_clinical_seed_0_final,refseq-ATP7B-NM_000053.3.a2m,Invitae,refseq-ATP7B-NM_000053.3.npy,1,1465,1465
+NP_000045.1,MLMASTTSAVPGHPSLPSLPSNSSQERPLDTRDPLLARAELALLSIVFVAVALSNGLVLAALARRGRRGHWAPIHVFIGHLCLADLAVALFQVLPQLAWKATDRFRGPDALCRAVKYLQMVGMYASSYMILAMTLDRHRAICRPMLAYRHGSGAHWNRPVLVAWAFSLLLSLPQLFIFAQRNVEGGSGVTDCWACFAEPWGRRTYVTWIALMVFVAPTLGIAACQVLIFREIHASLVPGPSERPGGRRRGRRTGSPGEGAHVSAAVAKTVRMTLVIVVVYVLCWAPFFLVQLWAAWDPEAPLEGAPFVLLMLLASLNSCTNPWIYASFSSSVSSELRSLLCCARGRTPPSLGPQDESCTTASSSLAKDTSS,371,NP_000045.1.csv,refseq-AVPR2-NM_000054.4_clinical_seed_0_final,refseq-AVPR2-NM_000054.4.a2m,Invitae,refseq-AVPR2-NM_000054.4.npy,1,371,371
+NP_000046.1,MHSKVTIICIRFLFWFLLLCMLIGKSHTEDDIIIATKNGKVRGMNLTVFGGTVTAFLGIPYAQPPLGRLRFKKPQSLTKWSDIWNATKYANSCCQNIDQSFPGFHGSEMWNPNTDLSEDCLYLNVWIPAPKPKNATVLIWIYGGGFQTGTSSLHVYDGKFLARVERVIVVSMNYRVGALGFLALPGNPEAPGNMGLFDQQLALQWVQKNIAAFGGNPKSVTLFGESAGAASVSLHLLSPGSHSLFTRAILQSGSFNAPWAVTSLYEARNRTLNLAKLTGCSRENETEIIKCLRNKDPQEILLNEAFVVPYGTPLSVNFGPTVDGDFLTDMPDILLELGQFKKTQILVGVNKDEGTAFLVYGAPGFSKDNNSIITRKEFQEGLKIFFPGVSEFGKESILFHYTDWVDDQRPENYREALGDVVGDYNFICPALEFTKKFSEWGNNAFFYYFEHRSSKLPWPEWMGVMHGYEIEFVFGLPLERRDNYTKAEEILSRSIVKRWANFAKYGNPNETQNNSTSWPVFKSTEQKYLTLNTESTRIMTKLRAQQCRFWTSFFPKVLEMTGNIDEAEWEWKAGFHRWNNYMMDWKNQFNDYTSKKESCVGL,602,NP_000046.1.csv,refseq-BCHE-NM_000055.3_clinical_seed_0_final,refseq-BCHE-NM_000055.3.a2m,Invitae,refseq-BCHE-NM_000055.3.npy,1,602,602
+NP_000048.1,MAAVPQNNLQEQLERHSARTLNNKLSLSKPKFSGFTFKKKTSSDNNVSVTNVSVAKTPVLRNKDVNVTEDFSFSEPLPNTTNQQRVKDFFKNAPAGQETQRGGSKSLLPDFLQTPKEVVCTTQNTPTVKKSRDTALKKLEFSSSPDSLSTINDWDDMDDFDTSETSKSFVTPPQSHFVRVSTAQKSKKGKRNFFKAQLYTTNTVKTDLPPPSSESEQIDLTEEQKDDSEWLSSDVICIDDGPIAEVHINEDAQESDSLKTHLEDERDNSEKKKNLEEAELHSTEKVPCIEFDDDDYDTDFVPPSPEEIISASSSSSKCLSTLKDLDTSDRKEDVLSTSKDLLSKPEKMSMQELNPETSTDCDARQISLQQQLIHVMEHICKLIDTIPDDKLKLLDCGNELLQQRNIRRKLLTEVDFNKSDASLLGSLWRYRPDSLDGPMEGDSCPTGNSMKELNFSHLPSNSVSPGDCLLTTTLGKTGFSATRKNLFERPLFNTHLQKSFVSSNWAETPRLGKKNESSYFPGNVLTSTAVKDQNKHTASINDLERETQPSYDIDNFDIDDFDDDDDWEDIMHNLAASKSSTAAYQPIKEGRPIKSVSERLSSAKTDCLPVSSTAQNINFSESIQNYTDKSAQNLASRNLKHERFQSLSFPHTKEMMKIFHKKFGLHNFRTNQLEAINAALLGEDCFILMPTGGGKSLCYQLPACVSPGVTVVISPLRSLIVDQVQKLTSLDIPATYLTGDKTDSEATNIYLQLSKKDPIIKLLYVTPEKICASNRLISTLENLYERKLLARFVIDEAHCVSQWGHDFRQDYKRMNMLRQKFPSVPVMALTATANPRVQKDILTQLKILRPQVFSMSFNRHNLKYYVLPKKPKKVAFDCLEWIRKHHPYDSGIIYCLSRRECDTMADTLQRDGLAALAYHAGLSDSARDEVQQKWINQDGCQVICATIAFGMGIDKPDVRFVIHASLPKSVEGYYQESGRAGRDGEISHCLLFYTYHDVTRLKRLIMMEKDGNHHTRETHFNNLYSMVHYCENITECRRIQLLAYFGENGFNPDFCKKHPDVSCDNCCKTKDYKTRDVTDDVKSIVRFVQEHSSSQGMRNIKHVGPSGRFTMNMLVDIFLGSKSAKIQSGIFGKGSAYSRHNAERLFKKLILDKILDEDLYINANDQAIAYVMLGNKAQTVLNGNLKVDFMETENSSSVKKQKALVAKVSQREEMVKKCLGELTEVCKSLGKVFGVHYFNIFNTVTLKKLAESLSSDPEVLLQIDGVTEDKLEKYGAEVISVLQKYSEWTSPAEDSSPGISLSSSRGPGRSAAEELDEEIPVSSHYFASKTRNERKRKKMPASQRSKRRKTASSGSKAKGGSATCRKISSKTKSSSIIGSSSASHTSQATSGANSKLGIMAPPKPINRPFLKPSYAFS,1417,NP_000048.1.csv,refseq-BLM-NM_000057.3_clinical_seed_0_final,refseq-BLM-NM_000057.3.a2m,Invitae,refseq-BLM-NM_000057.3.npy,1,1417,1417
+NP_000052.1,MAAVILESIFLKRSQQKKKTSPLNFKKRLFLLTVHKLSYYEYDFERGRRGSKKGSIDVEKITCVETVVPEKNPPPERQIPRRGEESSEMEQISIIERFPYPFQVVYDEGPLYVFSPTEELRKRWIHQLKNVIRYNSDLVQKYHPCFWIDGQYLCCSQTAKNAMGCQILENRNGSLKPGSSHRKTKKPLPPTPEEDQILKKPLPPEPAAAPVSTSELKKVVALYDYMPMNANDLQLRKGDEYFILEESNLPWWRARDKNGQEGYIPSNYVTEAEDSIEMYEWYSKHMTRSQAEQLLKQEGKEGGFIVRDSSKAGKYTVSVFAKSTGDPQGVIRHYVVCSTPQSQYYLAEKHLFSTIPELINYHQHNSAGLISRLKYPVSQQNKNAPSTAGLGYGSWEIDPKDLTFLKELGTGQFGVVKYGKWRGQYDVAIKMIKEGSMSEDEFIEEAKVMMNLSHEKLVQLYGVCTKQRPIFIITEYMANGCLLNYLREMRHRFQTQQLLEMCKDVCEAMEYLESKQFLHRDLAARNCLVNDQGVVKVSDFGLSRYVLDDEYTSSVGSKFPVRWSPPEVLMYSKFSSKSDIWAFGVLMWEIYSLGKMPYERFTNSETAEHIAQGLRLYRPHLASEKVYTIMYSCWHEKADERPTFKILLSNILDVMDEES,659,NP_000052.1.csv,refseq-BTK-NM_000061.2_clinical_seed_0_final,refseq-BTK-NM_000061.2.a2m,Invitae,refseq-BTK-NM_000061.2.npy,1,659,659
+NP_000053.2,MASRLTLLTLLLLLLAGDRASSNPNATSSSSQDPESLQDRGEGKVATTVISKMLFVEPILEVSSLPTTNSTTNSATKITANTTDEPTTQPTTEPTTQPTIQPTQPTTQLPTDSPTQPTTGSFCPGPVTLCSDLESHSTEAVLGDALVDFSLKLYHAFSAMKKVETNMAFSPFSIASLLTQVLLGAGENTKTNLESILSYPKDFTCVHQALKGFTTKGVTSVSQIFHSPDLAIRDTFVNASRTLYSSSPRVLSNNSDANLELINTWVAKNTNNKISRLLDSLPSDTRLVLLNAIYLSAKWKTTFDPKKTRMEPFHFKNSVIKVPMMNSKKYPVAHFIDQTLKAKVGQLQLSHNLSLVILVPQNLKHRLEDMEQALSPSVFKAIMEKLEMSKFQPTLLTLPRIKVTTSQDMLSIMEKLEFFDFSYDLNLCGLTEDPDLQVSAMQHQTVLELTETGVEAAAASAISVARTLLVFEVQQPFLFVLWDQQHKFPVFMGRVYDPRA,500,NP_000053.2.csv,refseq-SERPING1-NM_000062.2_clinical_seed_0_final,refseq-SERPING1-NM_000062.2.a2m,Invitae,refseq-SERPING1-NM_000062.2.npy,1,500,500
+NP_000055.2,MGPTSGPSLLLLLLTHLPLALGSPMYSIITPNILRLESEETMVLEAHDAQGDVPVTVTVHDFPGKKLVLSSEKTVLTPATNHMGNVTFTIPANREFKSEKGRNKFVTVQATFGTQVVEKVVLVSLQSGYLFIQTDKTIYTPGSTVLYRIFTVNHKLLPVGRTVMVNIENPEGIPVKQDSLSSQNQLGVLPLSWDIPELVNMGQWKIRAYYENSPQQVFSTEFEVKEYVLPSFEVIVEPTEKFYYIYNEKGLEVTITARFLYGKKVEGTAFVIFGIQDGEQRISLPESLKRIPIEDGSGEVVLSRKVLLDGVQNPRAEDLVGKSLYVSATVILHSGSDMVQAERSGIPIVTSPYQIHFTKTPKYFKPGMPFDLMVFVTNPDGSPAYRVPVAVQGEDTVQSLTQGDGVAKLSINTHPSQKPLSITVRTKKQELSEAEQATRTMQALPYSTVGNSNNYLHLSVLRTELRPGETLNVNFLLRMDRAHEAKIRYYTYLIMNKGRLLKAGRQVREPGQDLVVLPLSITTDFIPSFRLVAYYTLIGASGQREVVADSVWVDVKDSCVGSLVVKSGQSEDRQPVPGQQMTLKIEGDHGARVVLVAVDKGVFVLNKKNKLTQSKIWDVVEKADIGCTPGSGKDYAGVFSDAGLTFTSSSGQQTAQRAELQCPQPAARRRRSVQLTEKRMDKVGKYPKELRKCCEDGMRENPMRFSCQRRTRFISLGEACKKVFLDCCNYITELRRQHARASHLGLARSNLDEDIIAEENIVSRSEFPESWLWNVEDLKEPPKNGISTKLMNIFLKDSITTWEILAVSMSDKKGICVADPFEVTVMQDFFIDLRLPYSVVRNEQVEIRAVLYNYRQNQELKVRVELLHNPAFCSLATTKRRHQQTVTIPPKSSLSVPYVIVPLKTGLQEVEVKAAVYHHFISDGVRKSLKVVPEGIRMNKTVAVRTLDPERLGREGVQKEDIPPADLSDQVPDTESETRILLQGTPVAQMTEDAVDAERLKHLIVTPSGCGEQNMIGMTPTVIAVHYLDETEQWEKFGLEKRQGALELIKKGYTQQLAFRQPSSAFAAFVKRAPSTWLTAYVVKVFSLAVNLIAIDSQVLCGAVKWLILEKQKPDGVFQEDAPVIHQEMIGGLRNNNEKDMALTAFVLISLQEAKDICEEQVNSLPGSITKAGDFLEANYMNLQRSYTVAIAGYALAQMGRLKGPLLNKFLTTAKDKNRWEDPGKQLYNVEATSYALLALLQLKDFDFVPPVVRWLNEQRYYGGGYGSTQATFMVFQALAQYQKDAPDHQELNLDVSLQLPSRSSKITHRIHWESASLLRSEETKENEGFTVTAEGKGQGTLSVVTMYHAKAKDQLTCNKFDLKVTIKPAPETEKRPQDAKNTMILEICTRYRGDQDATMSILDISMMTGFAPDTDDLKQLANGVDRYISKYELDKAFSDRNTLIIYLDKVSHSEDDCLAFKVHQYFNVELIQPGAVKVYAYYNLEESCTRFYHPEKEDGKLNKLCRDELCRCAEENCFIQKSDDKVTLEERLDKACEPGVDYVYKTRLVKVQLSNDFDEYIMAIEQTIKSGSDEVQVGQQRTFISPIKCREALKLEEKKHYLMWGLSSDFWGEKPNLSYIIGKDTWVEHWPEEDECQDEENQKQCQDLGAFTESMVVFGCPN,1663,NP_000055.2.csv,refseq-C3-NM_000064.3_clinical_seed_0_final,refseq-C3-NM_000064.3.a2m,Invitae,refseq-C3-NM_000064.3.npy,1,1663,1663
+NP_000058.1,MSHHWGYGKHNGPEHWHKDFPIAKGERQSPVDIDTHTAKYDPSLKPLSVSYDQATSLRILNNGHAFNVEFDDSQDKAVLKGGPLDGTYRLIQFHFHWGSLDGQGSEHTVDKKKYAAELHLVHWNTKYGDFGKAVQQPDGLAVLGIFLKVGSAKPGLQKVVDVLDSIKTKGKSADFTNFDPRGLLPESLDYWTYPGSLTTPPLLECVTWIVLKEPISVSSEQVLKFRKLNFNGEGEPEELMVDNWRPAQPLKNRQIKASFK,260,NP_000058.1.csv,refseq-CA2-NM_000067.2_clinical_seed_0_final,refseq-CA2-NM_000067.2.a2m,Invitae,refseq-CA2-NM_000067.2.npy,1,260,260
+NP_000060.2,MEPSSPQDEGLRKKQPKKPVPEILPRPPRALFCLTLENPLRKACISIVEWKPFETIILLTIFANCVALAVYLPMPEDDNNSLNLGLEKLEYFFLIVFSIEAAMKIIAYGFLFHQDAYLRSGWNVLDFTIVFLGVFTVILEQVNVIQSHTAPMSSKGAGLDVKALRAFRVLRPLRLVSGVPSLQVVLNSIFKAMLPLFHIALLVLFMVIIYAIIGLELFKGKMHKTCYFIGTDIVATVENEEPSPCARTGSGRRCTINGSECRGGWPGPNHGITHFDNFGFSMLTVYQCITMEGWTDVLYWVNDAIGNEWPWIYFVTLILLGSFFILNLVLGVLSGEFTKEREKAKSRGTFQKLREKQQLDEDLRGYMSWITQGEVMDVEDFREGKLSLDEGGSDTESLYEIAGLNKIIQFIRHWRQWNRIFRWKCHDIVKSKVFYWLVILIVALNTLSIASEHHNQPLWLTRLQDIANRVLLSLFTTEMLMKMYGLGLRQYFMSIFNRFDCFVVCSGILEILLVESGAMTPLGISVLRCIRLLRIFKITKYWTSLSNLVASLLNSIRSIASLLLLLFLFIVIFALLGMQLFGGRYDFEDTEVRRSNFDNFPQALISVFQVLTGEDWTSMMYNGIMAYGGPSYPGMLVCIYFIILFVCGNYILLNVFLAIAVDNLAEAESLTSAQKAKAEEKKRRKMSKGLPDKSEEEKSTMAKKLEQKPKGEGIPTTAKLKIDEFESNVNEVKDPYPSADFPGDDEEDEPEIPLSPRPRPLAELQLKEKAVPIPEASSFFIFSPTNKIRVLCHRIVNATWFTNFILLFILLSSAALAAEDPIRADSMRNQILKHFDIGFTSVFTVEIVLKMTTYGAFLHKGSFCRNYFNMLDLLVVAVSLISMGLESSAISVVKILRVLRVLRPLRAINRAKGLKHVVQCMFVAISTIGNIVLVTTLLQFMFACIGVQLFKGKFFRCTDLSKMTEEECRGYYYVYKDGDPMQIELRHREWVHSDFHFDNVLSAMMSLFTVSTFEGWPQLLYKAIDSNAEDVGPIYNNRVEMAIFFIIYIILIAFFMMNIFVGFVIVTFQEQGETEYKNCELDKNQRQCVQYALKARPLRCYIPKNPYQYQVWYIVTSSYFEYLMFALIMLNTICLGMQHYNQSEQMNHISDILNVAFTIIFTLEMILKLMAFKARGYFGDPWNVFDFLIVIGSIIDVILSEIDTFLASSGGLYCLGGGCGNVDPDESARISSAFFRLFRVMRLIKLLSRAEGVRTLLWTFIKSFQALPYVALLIVMLFFIYAVIGMQMFGKIALVDGTQINRNNNFQTFPQAVLLLFRCATGEAWQEILLACSYGKLCDPESDYAPGEEYTCGTNFAYYYFISFYMLCAFLVINLFVAVIMDNFDYLTRDWSILGPHHLDEFKAIWAEYDPEAKGRIKHLDVVTLLRRIQPPLGFGKFCPHRVACKRLVGMNMPLNSDGTVTFNATLFALVRTALKIKTEGNFEQANEELRAIIKKIWKRTSMKLLDQVIPPIGDDEVTVGKFYATFLIQEHFRKFMKRQEEYYGYRPKKDIVQIQAGLRTIEEEAAPEICRTVSGDLAAEEELERAMVEAAMEEGIFRRTGGLFGQVDNFLERTNSLPPVMANQRPLQFAEIEMEEMESPVFLEDFPQDPRTNPLARANTNNANANVAYGNSNHSNSHVFSSVHYEREFPEETETPATRGRALGQPCRVLGPHSKPCVEMLKGLLTQRAMPRGQAPPAPCQCPRVESSMPEDRKSSTPGSLHEETPHSRSTRENTSRCSAPATALLIQKALVRGGLGTLAADANFIMATGQALADACQMEPEEVEIMATELLKGREAPEGMASSLGCLNLGSSLGSLDQHQGSQETLIPPRL,1873,NP_000060.2.csv,refseq-CACNA1S-NM_000069.2_clinical_seed_0_final,refseq-CACNA1S-NM_000069.2.a2m,Invitae,refseq-CACNA1S-NM_000069.2.npy,1,1873,1873
+NP_000061.1,MPTVISASVAPRTAAEPRSPGPVPHPAQSKATEAGGGNPSGIYSAIISRNFPIIGVKEKTFEQLHKKCLEKKVLYVDPEFPPDETSLFYSQKFPIQFVWKRPPEICENPRFIIDGANRTDICQGELGDCWFLAAIACLTLNQHLLFRVIPHDQSFIENYAGIFHFQFWRYGEWVDVVIDDCLPTYNNQLVFTKSNHRNEFWSALLEKAYAKLHGSYEALKGGNTTEAMEDFTGGVAEFFEIRDAPSDMYKIMKKAIERGSLMGCSIDDGTNMTYGTSPSGLNMGELIARMVRNMDNSLLQDSDLDPRGSDERPTRTIIPVQYETRMACGLVRGHAYSVTGLDEVPFKGEKVKLVRLRNPWGQVEWNGSWSDRWKDWSFVDKDEKARLQHQVTEDGEFWMSYEDFIYHFTKLEICNLTADALQSDKLQTWTVSVNEGRWVRGCSAGGCRNFPDTFWTNPQYRLKLLEEDDDPDDSEVICSFLVALMQKNRRKDRKLGASLFTIGFAIYEVPKEMHGNKQHLQKDFFLYNASKARSKTYINMREVSQRFRLPPSEYVIVPSTYEPHQEGEFILRVFSEKRNLSEEVENTISVDRPVKKKKTKPIIFVSDRANSNKELGVDQESEEGKGKTSPDKQKQSPQPQPGSSDQESEEQQQFRNIFKQIAGDDMEICADELKKVLNTVVNKHKDLKTHGFTLESCRSMIALMDTDGSGKLNLQEFHHLWNKIKAWQKIFKHYDTDQSGTINSYEMRNAVNDAGFHLNNQLYDIITMRYADKHMNIDFDSFICCFVRLEGMFRAFHAFDKDGDGIIKLNVLEWLQLTMYA,821,NP_000061.1.csv,refseq-CAPN3-NM_000070.2_clinical_seed_0_final,refseq-CAPN3-NM_000070.2.a2m,Invitae,refseq-CAPN3-NM_000070.2.npy,1,821,821
+NP_000062.1,MPSETPQAEVGPTGCPHRSGPHSAKGSLEKGSPEDKEAKEPLWIRPDAPSRCTWQLGRPASESPHHHTAPAKSPKILPDILKKIGDTPMVRINKIGKKFGLKCELLAKCEFFNAGGSVKDRISLRMIEDAERDGTLKPGDTIIEPTSGNTGIGLALAAAVRGYRCIIVMPEKMSSEKVDVLRALGAEIVRTPTNARFDSPESHVGVAWRLKNEIPNSHILDQYRNASNPLAHYDTTADEILQQCDGKLDMLVASVGTGGTITGIARKLKEKCPGCRIIGVDPEGSILAEPEELNQTEQTTYEVEGIGYDFIPTVLDRTVVDKWFKSNDEEAFTFARMLIAQEGLLCGGSAGSTVAVAVKAAQELQEGQRCVVILPDSVRNYMTKFLSDRWMLQKGFLKEEDLTEKKPWWWHLRVQELGLSAPLTVLPTITCGHTIEILREKGFDQAPVVDEAGVILGMVTLGNMLSSLLAGKVQPSDQVGKVIYKQFKQIRLTDTLGRLSHILEMDHFALVVHEQIQYHSTGKSSQRQMVFGVVTAIDLLNFVAAQERDQK,551,NP_000062.1.csv,refseq-CBS-NM_000071.2_clinical_seed_0_final,refseq-CBS-NM_000071.2.a2m,Invitae,refseq-CBS-NM_000071.2.npy,1,551,551
+NP_000065.1,MIETYNQTSPRSAATGLPISMKIFMYLLTVFLITQMIGSALFAVYLHRRLDKIEDERNLHEDFVFMKTIQRCNTGERSLSLLNCEEIKSQFEGFVKDIMLNKEETKKENSFEMQKGDQNPQIAAHVISEASSKTTSVLQWAEKGYYTMSNNLVTLENGKQLTVKRQGLYYIYAQVTFCSNREASSQAPFIASLCLKSPGRFERILLRAANTHSSAKPCGQQSIHLGGVFELQPGASVFVNVTDPSQVSHGTGFTSFGLLKL,261,NP_000065.1.csv,refseq-CD40LG-NM_000074.2_clinical_seed_0_final,refseq-CD40LG-NM_000074.2.a2m,Invitae,refseq-CD40LG-NM_000074.2.npy,1,261,261
+NP_000066.1,MATSRYEPVAEIGVGAYGTVYKARDPHSGHFVALKSVRVPNGGGGGGGLPISTVREVALLRRLEAFEHPNVVRLMDVCATSRTDREIKVTLVFEHVDQDLRTYLDKAPPPGLPAETIKDLMRQFLRGLDFLHANCIVHRDLKPENILVTSGGTVKLADFGLARIYSYQMALTPVVVTLWYRAPEVLLQSTYATPVDMWSVGCIFAEMFRRKPLFCGNSEADQLGKIFDLIGLPPEDDWPRDVSLPRGAFPPRGPRPVQSVVPEMEESGAQLLLEMLTFNPHKRISAFRALQHSYLHKDEGNPE,303,NP_000066.1.csv,NP_000066.1_clinical_seed_0_final,NP_000066.1.a2m,popEVE,NP_000066.1_theta_0.2.npy,1,303,303
+NP_000067.1,MSDASLRSTSTMERLVARGTFPVLVRTSACRSLFGPVDHEELSRELQARLAELNAEDQNRWDYDFQQDMPLRGPGRLQWTEVDSDSVPAFYRETVQVGRCRLLLAPRPVAVAVAVSPPLEPAAESLDGLEEAPEQLPSVPVPAPASTPPPVPVLAPAPAPAPAPVAAPVAAPVAVAVLAPAPAPAPAPAPAPAPVAAPAPAPAPAPAPAPAPAPAPDAAPQESAEQGANQGQRGQEPLADQLHSGISGRPAAGTAAASANGAAIKKLSGPLISDFFAKRKRSAPEKSSGDVPAPCPSPSAAPGVGSVEQTPRKRLR,316,NP_000067.1.csv,refseq-CDKN1C-NM_000076.2_clinical_seed_0_final,refseq-CDKN1C-NM_000076.2.a2m,Invitae,refseq-CDKN1C-NM_000076.2.npy,1,316,316
+NP_000068.1,MEPAAGSSMEPSADWLATAAARGRVEEVRALLEAGALPNAPNSYGRRPIQVMMMGSARVAELLLLHGAEPNCADPATLTRPVHDAAREGFLDTLVVLHRAGARLDVRDAWGRLPVDLAEELGHRDVARYLRAAAGGTRGSNHARIDAAEGPSDIPD,156,NP_000068.1.csv,refseq-CDKN2A-NM_000077.4_clinical_seed_0_final,refseq-CDKN2A-NM_000077.4.a2m,Invitae,refseq-CDKN2A-NM_000077.4_theta_0.2.npy,1,156,156
+NP_000070.1,MEPWPLLLLFSLCSAGLVLGSEHETRLVAKLFKDYSSVVRPVEDHRQVVEVTVGLQLIQLINVDEVNQIVTTNVRLKQQWVDYNLKWNPDDYGGVKKIHIPSEKIWRPDLVLYNNADGDFAIVKFTKVLLQYTGHITWTPPAIFKSYCEIIVTHFPFDEQNCSMKLGTWTYDGSVVAINPESDQPDLSNFMESGEWVIKESRGWKHSVTYSCCPDTPYLDITYHFVMQRLPLYFIVNVIIPCLLFSFLTGLVFYLPTDSGEKMTLSISVLLSLTVFLLVIVELIPSTSSAVPLIGKYMLFTMVFVIASIIITVIVINTHHRSPSTHVMPNWVRKVFIDTIPNIMFFSTMKRPSREKQDKKIFTEDIDISDISGKPGPPPMGFHSPLIKHPEVKSAIEGIKYIAETMKSDQESNNAAAEWKYVAMVMDHILLGVFMLVCIIGTLAVFAGRLIELNQQG,457,NP_000070.1.csv,refseq-CHRNA1-NM_000079.3_clinical_seed_0_final,refseq-CHRNA1-NM_000079.3.a2m,Invitae,refseq-CHRNA1-NM_000079.3.npy,1,457,457
+NP_000071.1,MARAPLGVLLLLGLLGRGVGKNEELRLYHHLFNNYDPGSRPVREPEDTVTISLKVTLTNLISLNEKEETLTTSVWIGIDWQDYRLNYSKDDFGGIETLRVPSELVWLPEIVLENNIDGQFGVAYDANVLVYEGGSVTWLPPAIYRSVCAVEVTYFPFDWQNCSLIFRSQTYNAEEVEFTFAVDNDGKTINKIDIDTEAYTENGEWAIDFCPGVIRRHHGGATDGPGETDVIYSLIIRRKPLFYVINIIVPCVLISGLVLLAYFLPAQAGGQKCTVSINVLLAQTVFLFLIAQKIPETSLSVPLLGRFLIFVMVVATLIVMNCVIVLNVSQRTPTTHAMSPRLRHVLLELLPRLLGSPPPPEAPRAASPPRRASSVGLLLRAEELILKKPRSELVFEGQRHRQGTWTAAFCQSLGAAAPEVRCCVDAVNFVAESTRDQEATGEEVSDWVRMGNALDNICFWAALVLFSVGSSLIFLGAYFNRVPDLPYAPCIQP,493,NP_000071.1.csv,refseq-CHRNE-NM_000080.3_clinical_seed_0_final,refseq-CHRNE-NM_000080.3.a2m,Invitae,refseq-CHRNE-NM_000080.3.npy,1,493,493
+NP_000073.1,MLGFLSARQTGLEDPLRLRRAESTRRVLGLELNKDRDVERIHGGGINTLDIEPVEGRYMLSGGSDGVIVLYDLENSSRQSYYTCKAVCSIGRDHPDVHRYSVETVQWYPHDTGMFTSSSFDKTLKVWDTNTLQTADVFNFEETVYSHHMSPVSTKHCLVAVGTRGPKVQLCDLKSGSCSHILQGHRQEILAVSWSPRYDYILATASADSRVKLWDVRRASGCLITLDQHNGKKSQAVESANTAHNGKVNGLCFTSDGLHLLTVGTDNRMRLWNSSNGENTLVNYGKVCNNSKKGLKFTVSCGCSSEFVFVPYGSTIAVYTVYSGEQITMLKGHYKTVDCCVFQSNFQELYSGSRDCNILAWVPSLYEPVPDDDETTTKSQLNPAFEDAWSSSDEEG,396,NP_000073.1.csv,refseq-ERCC8-NM_000082.3_clinical_seed_0_final,refseq-ERCC8-NM_000082.3.a2m,Invitae,refseq-ERCC8-NM_000082.3.npy,1,396,396
+NP_000074.3,MEQSRSQQRGGEQSWWGSDPQYQYMPFEHCTSYGLPSENGGLQHRLRKDAGPRHNVHPTQIYGHHKEQFSDREQDIGMPKKTGSSSTVDSKDEDHYSKCQDCIHRLGQVVRRKLGEDGIFLVLLGLLMALVSWSMDYVSAKSLQAYKWSYAQMQPSLPLQFLVWVTFPLVLILFSALFCHLISPQAVGSGIPEMKTILRGVVLKEYLTMKAFVAKVVALTAGLGSGIPVGKEGPFVHIASICAAVLSKFMSVFCGVYEQPYYYSDILTVGCAVGVGCCFGTPLGGVLFSIEVTSTYFAVRNYWRGFFAATFSAFVFRVLAVWNKDAVTITALFRTNFRMDFPFDLKELPAFAAIGICCGLLGAVFVYLHRQVMLGVRKHKALSQFLAKHRLLYPGIVTFVIASFTFPPGMGQFMAGELMPREAISTLFDNNTWVKHAGDPESLGQSAVWIHPRVNVVIIIFLFFVMKFWMSIVATTMPIPCGGFMPVFVLGAAFGRLVGEIMAMLFPDGILFDDIIYKILPGGYAVIGAAALTGAVSHTVSTAVICFELTGQIAHILPMMVAVILANMVAQSLQPSLYDSIIQVKKLPYLPDLGWNQLSKYTIFVEDIMVRDVKFVSASYTYGELRTLLQTTTVKTLPLVDSKDSMILLGSVERSELQALLQRHLCPERRLRAAQEMARKLSELPYDGKARLAGEGLPGAPPGRPESFAFVDEDEDEDLSGKSELPPSLALHPSTTAPLSPEEPNGPLPGHKQQPEAPEPAGQRPSIFQSLLHCLLGRARPTKKKTTQDSTDLVDNMSPEEIEAWEQEQLSQPVCFDSCCIDQSPFQLVEQTTLHKTHTLFSLLGLHLAYVTSMGKLRGVLALEELQKAIEGHTKSGVQLRPPLASFRNTTSTRKSTGAPPSSAENWNLPEDRPGATGTGDVIAASPETPVPSPSPEPPLSLAPGKVEGELEELELVESPGLEEELADILQGPSLRSTDEEDEDELIL,988,NP_000074.3.csv,refseq-CLCN1-NM_000083.3_clinical_seed_0_final,refseq-CLCN1-NM_000083.3.a2m,Invitae,refseq-CLCN1-NM_000083.3.npy,1,988,988
+NP_000076.2,MEEFVGLREGSSGNPVTLQELWGPCPRIRRGIRGGLEWLKQKLFRLGEDWYFLMTLGVLMALVSCAMDLAVESVVRAHQWLYREIGDSHLLRYLSWTVYPVALVSFSSGFSQSITPSSGGSGIPEVKTMLAGVVLEDYLDIKNFGAKVVGLSCTLACGSTLFLGKVGPFVHLSVMMAAYLGRVRTTTIGEPENKSKQNEMLVAAAAVGVATVFAAPFSGVLFSIEVMSSHFSVWDYWRGFFAATCGAFMFRLLAVFNSEQETITSLYKTSFRVDVPFDLPEIFFFVALGGLCGILGSAYLFCQRIFFGFIRNNRFSSKLLATSKPVYSALATLVLASITYPPSAGRFLASRLSMKQHLDSLFDNHSWALMTQNSSPPWPEELDPQHLWWEWYHPRFTIFGTLAFFLVMKFWMLILATTIPMPAGYFMPIFVYGAAIGRLFGETLSFIFPEGIVAGGITNPIMPGGYALAGAAAFSGAVTHTISTALLAFEVTGQIVHALPVLMAVLAANAIAQSCQPSFYDGTVIVKKLPYLPRILGRNIGSHRVRVEHFMNHSITTLAKDMPLEEVVKVVTSTDVAKYPLVESTESQILVGIVRRAQLVQALKAEPPSWAPGHQQCLQDILAAGCPTEPVTLKLSPETSLHEAHNLFELLNLHSLFVTSRGRAVGCVSWVEMKKAISNLTNPPAPK,687,NP_000076.2.csv,refseq-CLCNKB-NM_000085.5_clinical_seed_0_final,refseq-CLCNKB-NM_000085.5.a2m,Invitae,refseq-CLCNKB-NM_000085.5.npy,1,687,687
+NP_000078.3,MKNNIINTQQSFVTMPNVIVPDIEKEIRRMENGACSSFSEDDDSASTSEESENENPHARGSFSYKSLRKGGPSQREQYLPGAIALFNVNNSSNKDQEPEEKKKKKKEKKSKSDDKNENKNDPEKKKKKKDKEKKKKEEKSKDKKEEEKKEVVVIDPSGNTYYNWLFCITLPVMYNWTMVIARACFDELQSDYLEYWLILDYVSDIVYLIDMFVRTRTGYLEQGLLVKEELKLINKYKSNLQFKLDVLSLIPTDLLYFKLGWNYPEIRLNRLLRFSRMFEFFQRTETRTNYPNIFRISNLVMYIVIIIHWNACVFYSISKAIGFGNDTWVYPDINDPEFGRLARKYVYSLYWSTLTLTTIGETPPPVRDSEYVFVVVDFLIGVLIFATIVGNIGSMISNMNAARAEFQARIDAIKQYMHFRNVSKDMEKRVIKWFDYLWTNKKTVDEKEVLKYLPDKLRAEIAINVHLDTLKKVRIFADCEAGLLVELVLKLQPQVYSPGDYICKKGDIGREMYIIKEGKLAVVADDGVTQFVVLSDGSYFGEISILNIKGSKAGNRRTANIKSIGYSDLFCLSKDDLMEALTEYPDAKTMLEEKGKQILMKDGLLDLNIANAGSDPKDLEEKVTRMEGSVDLLQTRFARILAEYESMQQKLKQRLTKVEKFLKPLIDTEFSSIEGPGAESGPIDST,686,NP_000078.3.csv,NP_000078.3_colabfold_clinical_seed_0_final,NP_000078.3_colabfold.a2m,colabfold,NP_000078.3_colabfold_theta_0.2.npy,1,686,686
+NP_000079.2,MFSFVDLRLLLLLAATALLTHGQEEGQVEGQDEDIPPITCVQNGLRYHDRDVWKPEPCRICVCDNGKVLCDDVICDETKNCPGAEVPEGECCPVCPDGSESPTDQETTGVEGPKGDTGPRGPRGPAGPPGRDGIPGQPGLPGPPGPPGPPGPPGLGGNFAPQLSYGYDEKSTGGISVPGPMGPSGPRGLPGPPGAPGPQGFQGPPGEPGEPGASGPMGPRGPPGPPGKNGDDGEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGARGNDGATGAAGPPGPTGPAGPPGFPGAVGAKGEAGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAGNPGADGQPGAKGANGAPGIAGAPGFPGARGPSGPQGPGGPPGPKGNSGEPGAPGSKGDTGAKGEPGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAGQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGDKGESGPSGPAGPTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPGPIGNVGAPGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGSAGAPGKDGLNGLPGPIGPPGPRGRTGDAGPVGPPGPPGPPGPPGPPSAGFDFSFLPQPPQEKAHDGGRYYRADDANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPSVAQKNWYISKNPKDKRHVWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQTGNLKKALLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPIIDVAPLDVGAPDQEFGFDVGPVCFL,1464,NP_000079.2.csv,NP_000079.2_colabfold_clinical_seed_0_final,NP_000079.2_colabfold.a2m,colabfold,NP_000079.2_colabfold_theta_0.2.npy,1,1464,1464
+NP_000080.2,MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGERGPPGPPGRDGEDGPTGPPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPPGAAGAPGPQGFQGPAGEPGEPGQTGPAGARGPAGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGIRGHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGLPGERGRVGAPGPAGARGSDGSVGPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPAGPAGPRGEVGLPGLSGPVGPPGNPGANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNKGEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGPPGPPGLRGSPGSRGLPGADGRAGVMGPPGSRGASGPAGVRGPNGDAGRPGEPGLMGPRGLPGSPGNIGPAGKEGPVGLPGIDGRPGPIGPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPPGPPGFQGLPGPSGPAGEVGKPGERGLHGEFGLPGPAGPRGERGPPGESGAAGPTGPIGSRGPSGPPGPDGNKGEPGVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARGAPGAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPGAKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDGGPPGMTGFPGAAGRTGPPGPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPPGFAGEKGPSGEAGTAGPPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIAGPPGARGPPGAVGSPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAAGAPGPHGPVGPAGKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLPGLKGHNGLQGLPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIRGPQGHQGPAGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCIRAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLLANYASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKKTNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK,1366,NP_000080.2.csv,refseq-COL1A2-NM_000089.3_clinical_seed_0_final,refseq-COL1A2-NM_000089.3.a2m,Invitae,refseq-COL1A2-NM_000089.3.npy,1,1366,1366
+NP_000081.2,MMSFVQKGSWLLLALLHPTIILAQQEAVEGGCSHLGQSYADRDVWKPEPCQICVCDSGSVLCDDIICDDQELDCPNPEIPFGECCAVCPQPPTAPTRPPNGQGPQGPKGDPGPPGIPGRNGDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSYDVKSGVAVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPSGPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETGAPGLKGENGLPGENGAPGPMGPRGAPGERGRPGLPGAAGARGNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRGEPGPQGHAGAQGPPGPPGINGSPGGKGEMGPAGIPGAPGLMGARGPPGPAGANGAPGLRGGAGEPGKNGAKGEPGPRGERGEAGIPGVPGAKGEDGKDGSPGEPGANGLPGAAGERGAPGFRGPAGPNGIPGEKGPAGERGAPGPAGPRGAAGEPGRDGVPGGPGMRGMPGSPGGPGSDGKPGPPGSQGESGRPGPPGPSGPRGQPGVMGFPGPKGNDGAPGKNGERGGPGGPGPQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPGPQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAPGERGPPGLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKGDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGERGETGPPGPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGGSGPAGPPGPQGVKGERGSPGGPGAAGFPGARGLPGPPGSNGNPGPPGPSGSPGKDGPPGPAGNTGAPGSPGVSGPKGDAGQPGEKGSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQGVKGESGKPGANGLSGERGPPGPQGLPGLAGTAGEPGRDGNPGSDGLPGRDGSPGGKGDRGENGSPGAPGAPGHPGPPGPVGPAGKSGDRGESGPAGPAGAPGPAGSRGAPGPQGPRGDKGETGERGAAGIKGHRGFPGNPGAPGSPGPAGQQGAIGSPGPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGGVGAAAIAGIGGEKAGGFAPYYGDEPMDFKINTDEIMTSLKSVNGQIESLISPDGSRKNPARNCRDLKFCHPELKSGEYWVDPNQGCKLDAIKVFCNMETGETCISANPLNVPRKHWWTDSSAEKKHVWFGESMDGGFQFSYGNPELPEDVLDVHLAFLRLLSSRASQNITYHCKNSIAYMDQASGNVKKALKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEWSKTVFEYRTRKAVRLPIVDIAPYDIGGPDQEFGVDVGPVCFL,1466,NP_000081.2.csv,refseq-COL3A1-NM_000090.4_clinical_seed_0_final,refseq-COL3A1-NM_000090.4.a2m,Invitae,refseq-COL3A1-NM_000090.4.npy,1,1466,1466
+NP_000082.2,MSARTAPRPQVLLLPLLLVLLAAAPAASKGCVCKDKGQCFCDGAKGEKGEKGFPGPPGSPGQKGFTGPEGLPGPQGPKGFPGLPGLTGSKGVRGISGLPGFSGSPGLPGTPGNTGPYGLVGVPGCSGSKGEQGFPGLPGTLGYPGIPGAAGLKGQKGAPAKEEDIELDAKGDPGLPGAPGPQGLPGPPGFPGPVGPPGPPGFFGFPGAMGPRGPKGHMGERVIGHKGERGVKGLTGPPGPPGTVIVTLTGPDNRTDLKGEKGDKGAMGEPGPPGPSGLPGESYGSEKGAPGDPGLQGKPGKDGVPGFPGSEGVKGNRGFPGLMGEDGIKGQKGDIGPPGFRGPTEYYDTYQEKGDEGTPGPPGPRGARGPQGPSGPPGVPGSPGSSRPGLRGAPGWPGLKGSKGERGRPGKDAMGTPGSPGCAGSPGLPGSPGPPGPPGDIVFRKGPPGDHGLPGYLGSPGIPGVDGPKGEPGLLCTQCPYIPGPPGLPGLPGLHGVKGIPGRQGAAGLKGSPGSPGNTGLPGFPGFPGAQGDPGLKGEKGETLQPEGQVGVPGDPGLRGQPGRKGLDGIPGTPGVKGLPGPKGELALSGEKGDQGPPGDPGSPGSPGPAGPAGPPGYGPQGEPGLQGTQGVPGAPGPPGEAGPRGELSVSTPVPGPPGPPGPPGHPGPQGPPGIPGSLGKCGDPGLPGPDGEPGIPGIGFPGPPGPKGDQGFPGTKGSLGCPGKMGEPGLPGKPGLPGAKGEPAVAMPGGPGTPGFPGERGNSGEHGEIGLPGLPGLPGTPGNEGLDGPRGDPGQPGPPGEQGPPGRCIEGPRGAQGLPGLNGLKGQQGRRGKTGPKGDPGIPGLDRSGFPGETGSPGIPGHQGEMGPLGQRGYPGNPGILGPPGEDGVIGMMGFPGAIGPPGPPGNPGTPGQRGSPGIPGVKGQRGTPGAKGEQGDKGNPGPSEISHVIGDKGEPGLKGFAGNPGEKGNRGVPGMPGLKGLKGLPGPAGPPGPRGDLGSTGNPGEPGLRGIPGSMGNMGMPGSKGKRGTLGFPGRAGRPGLPGIHGLQGDKGEPGYSEGTRPGPPGPTGDPGLPGDMGKKGEMGQPGPPGHLGPAGPEGAPGSPGSPGLPGKPGPHGDLGFKGIKGLLGPPGIRGPPGLPGFPGSPGPMGIRGDQGRDGIPGPAGEKGETGLLRAPPGPRGNPGAQGAKGDRGAPGFPGLPGRKGAMGDAGPRGPTGIEGFPGPPGLPGAIIPGQTGNRGPPGSRGSPGAPGPPGPPGSHVIGIKGDKGSMGHPGPKGPPGTAGDMGPPGRLGAPGTPGLPGPRGDPGFQGFPGVKGEKGNPGFLGSIGPPGPIGPKGPPGVRGDPGTLKIISLPGSPGPPGTPGEPGMQGEPGPPGPPGNLGPCGPRGKPGKDGKPGTPGPAGEKGNKGSKGEPGPAGSDGLPGLKGKRGDSGSPATWTTRGFVFTRHSQTTAIPSCPEGTVPLYSGFSFLFVQGNQRAHGQDLGTLGSCLQRFTTMPFLFCNVNDVCNFASRNDYSYWLSTPALMPMNMAPITGRALEPYISRCTVCEGPAIAIAVHSQTTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLECHGRGTCNYYSNSYSFWLASLNPERMFRKPIPSTVKAGELEKIISRCQVCMKKRH,1670,NP_000082.2.csv,refseq-COL4A3-NM_000091.4_clinical_seed_0_final,refseq-COL4A3-NM_000091.4.a2m,Invitae,refseq-COL4A3-NM_000091.4.npy,1,1670,1670
+NP_000083.3,MWSLHIVLMRCSFRLTKSLATGPWSLILILFSVQYVYGSGKKYIGPCGGRDCSVCHCVPEKGSRGPPGPPGPQGPIGPLGAPGPIGLSGEKGMRGDRGPPGAAGDKGDKGPTGVPGFPGLDGIPGHPGPPGPRGKPGMSGHNGSRGDPGFPGGRGALGPGGPLGHPGEKGEKGNSVFILGAVKGIQGDRGDPGLPGLPGSWGAGGPAGPTGYPGEPGLVGPPGQPGRPGLKGNPGVGVKGQMGDPGEVGQQGSPGPTLLVEPPDFCLYKGEKGIKGIPGMVGLPGPPGRKGESGIGAKGEKGIPGFPGPRGDPGSYGSPGFPGLKGELGLVGDPGLFGLIGPKGDPGNRGHPGPPGVLVTPPLPLKGPPGDPGFPGRYGETGDVGPPGPPGLLGRPGEACAGMIGPPGPQGFPGLPGLPGEAGIPGRPDSAPGKPGKPGSPGLPGAPGLQGLPGSSVIYCSVGNPGPQGIKGKVGPPGGRGPKGEKGNEGLCACEPGPMGPPGPPGLPGRQGSKGDLGLPGWLGTKGDPGPPGAEGPPGLPGKHGASGPPGNKGAKGDMVVSRVKGHKGERGPDGPPGFPGQPGSHGRDGHAGEKGDPGPPGDHEDATPGGKGFPGPLGPPGKAGPVGPPGLGFPGPPGERGHPGVPGHPGVRGPDGLKGQKGDTISCNVTYPGRHGPPGFDGPPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTAEIPGPPGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPGDPAFGHLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPGLPGYPGSPGAPGGKGQPGDVGPPGPAGMKGLPGLPGRPGAHGPPGLPGIPGPFGDDGLPGPPGPKGPRGLPGFPGFPGERGKPGAEGCPGAKGEPGEKGMSGLPGDRGLRGAKGAIGPPGDEGEMAIISQKGTPGEPGPPGDDGFPGERGDKGTPGMQGRRGEPGRYGPPGFHRGEPGEKGQPGPPGPPGPPGSTGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPASHFGPPGPKGEPGSPGCPGHFGASGEQGLPGIQGPRGSPGRPGPPGSSGPPGCPGDHGMPGLRGQPGEMGDPGPRGLQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKGTKGASGLHDVGPPGPVGIPGLKGERGDPGSPGISPPGPRGKKGPPGPPGSSGPPGPAGATGRAPKDIPDPGPPGDQGPPGPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGKDGQKGPVGFPGPQGPHGFPGPPGEKGLPGPPGRKGPTGLPGPRGEPGPPADVDDCPRIPGLPGAPGMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGVDGVPGSPGPPGRKGDTGEDGYPGGPGPPGPIGDPGPKGFGPGYLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQEKAHNQDLGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEAIRPYVSRCAVCEAPAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDFRAAPFLECQGRQGTCHFFANKYSFWLTTVKADLQFSSAPAPDTLKESQAQRQKISRCQVCVKYS,1690,NP_000083.3.csv,refseq-COL4A4-NM_000092.4_clinical_seed_0_final,refseq-COL4A4-NM_000092.4.a2m,Invitae,refseq-COL4A4-NM_000092.4_theta_0.2.npy,1,1690,1690
+NP_000084.3,MDVHTRWKARSALRPGAPLLPPLLLLLLWAPPPSRAAQPADLLKVLDFHNLPDGITKTTGFCATRRSSKGPDVAYRVTKDAQLSAPTKQLYPASAFPEDFSILTTVKAKKGSQAFLVSIYNEQGIQQIGLELGRSPVFLYEDHTGKPGPEDYPLFRGINLSDGKWHRIALSVHKKNVTLILDCKKKTTKFLDRSDHPMIDINGIIVFGTRILDEEVFEGDIQQLLFVSDHRAAYDYCEHYSPDCDTAVPDTPQSQDPNPDEYYTEGDGEGETYYYEYPYYEDPEDLGKEPTPSKKPVEAAKETTEVPEELTPTPTEAAPMPETSEGAGKEEDVGIGDYDYVPSEDYYTPSPYDDLTYGEGEENPDQPTDPGAGAEIPTSTADTSNSSNPAPPPGEGADDLEGEFTEETIRNLDENYYDPYYDPTSSPSEIGPGMPANQDTIYEGIGGPRGEKGQKGEPAIIEPGMLIEGPPGPEGPAGLPGPPGTMGPTGQVGDPGERGPPGRPGLPGADGLPGPPGTMLMLPFRFGGGGDAGSKGPMVSAQESQAQAILQQARLALRGPAGPMGLTGRPGPVGPPGSGGLKGEPGDVGPQGPRGVQGPPGPAGKPGRRGRAGSDGARGMPGQTGPKGDRGFDGLAGLPGEKGHRGDPGPSGPPGPPGDDGERGDDGEVGPRGLPGEPGPRGLLGPKGPPGPPGPPGVTGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPGEKGPLGKPGLPGMPGADGPPGHPGKEGPPGEKGGQGPPGPQGPIGYPGPRGVKGADGIRGLKGTKGEKGEDGFPGFKGDMGIKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPGPLGPPGEKGKLGVPGLPGYPGRQGPKGSIGFPGFPGANGEKGGRGTPGKPGPRGQRGPTGPRGERGPRGITGKPGPKGNSGGDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPPGPPGVVGPQGPTGETGPMGERGHPGPPGPPGEQGLPGLAGKEGTKGDPGPAGLPGKDGPPGLRGFPGDRGLPGPVGALGLKGNEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGLQGPVGLPGPAGPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPPGPTGPQGPIGQPGPSGADGEPGPRGQQGLFGQKGDEGPRGFPGPPGPVGLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGPPGGIGNPGAVGEKGEPGEAGEPGLPGEGGPPGPKGERGEKGESGPSGAAGPPGPKGPPGDDGPKGSPGPVGFPGDPGPPGEPGPAGQDGPPGDKGDDGEPGQTGSPGPTGEPGPSGPPGKRGPPGPAGPEGRQGEKGAKGEAGLEGPPGKTGPIGPQGAPGKPGPDGLRGIPGPVGEQGLPGSPGPDGPPGPMGPPGLPGLKGDSGPKGEKGHPGLIGLIGPPGEQGEKGDRGLPGPQGSSGPKGEQGITGPSGPIGPPGPPGLPGPPGPKGAKGSSGPTGPKGEAGHPGPPGPPGPPGEVIQPLPIQASRTRRNIDASQLLDDGNGENYVDYADGMEEIFGSLNSLKLEIEQMKRPLGTQQNPARTCKDLQLCHPDFPDGEYWVDPNQGCSRDSFKVYCNFTAGGSTCVFPDKKSEGARITSWPKENPGSWFSEFKRGKLLSYVDAEGNPVGVVQMTFLRLLSASAHQNVTYHCYQSVAWQDAATGSYDKALRFLGSNDEEMSYDNNPYIRALVDGCATKKGYQKTVLEIDTPKVEQVPIVDIMFNDFGEASQKFGFEVGPACFMG,1838,NP_000084.3.csv,refseq-COL5A1-NM_000093.4_clinical_seed_0_final,refseq-COL5A1-NM_000093.4.a2m,Invitae,refseq-COL5A1-NM_000093.4.npy,1,1838,1838
+NP_000086.2,MVPDTACVLLLTLAALGASGQGQSPLGSDLGPQMLRELQETNAALQDVRELLRQQVREITFLKNTVMECDACGMQQSVRTGLPSVRPLLHCAPGFCFPGVACIQTESGARCGPCPAGFTGNGSHCTDVNECNAHPCFPRVRCINTSPGFRCEACPPGYSGPTHQGVGLAFAKANKQVCTDINECETGQHNCVPNSVCINTRGSFQCGPCQPGFVGDQASGCQRRAQRFCPDGSPSECHEHADCVLERDGSRSCVCAVGWAGNGILCGRDTDLDGFPDEKLRCPERQCRKDNCVTVPNSGQEDVDRDGIGDACDPDADGDGVPNEKDNCPLVRNPDQRNTDEDKWGDACDNCRSQKNDDQKDTDQDGRGDACDDDIDGDRIRNQADNCPRVPNSDQKDSDGDGIGDACDNCPQKSNPDQADVDHDFVGDACDSDQDQDGDGHQDSRDNCPTVPNSAQEDSDHDGQGDACDDDDDNDGVPDSRDNCRLVPNPGQEDADRDGVGDVCQDDFDADKVVDKIDVCPENAEVTLTDFRAFQTVVLDPEGDAQIDPNWVVLNQGREIVQTMNSDPGLAVGYTAFNGVDFEGTFHVNTVTDDDYAGFIFGYQDSSSFYVVMWKQMEQTYWQANPFRAVAEPGIQLKAVKSSTGPGEQLRNALWHTGDTESQVRLLWKDPRNVGWKDKKSYRWFLQHRPQVGYIRVRFYEGPELVADSNVVLDTTMRGGRLGVFCFSQENIIWANLRYRCNDTIPEDYETHQLRQA,757,NP_000086.2.csv,refseq-COMP-NM_000095.2_clinical_seed_0_final,refseq-COMP-NM_000095.2.a2m,Invitae,refseq-COMP-NM_000095.2.npy,1,757,757
+NP_000087.2,MKILILGIFLFLCSTPAWAKEKHYYIGIIETTWDYASDHGEKKLISVDTEHSNIYLQNGPDRIGRLYKKALYLQYTDETFRTTIEKPVWLGFLGPIIKAETGDKVYVHLKNLASRPYTFHSHGITYYKEHEGAIYPDNTTDFQRADDKVYPGEQYTYMLLATEEQSPGEGDGNCVTRIYHSHIDAPKDIASGLIGPLIICKKDSLDKEKEKHIDREFVVMFSVVDENFSWYLEDNIKTYCSEPEKVDKDNEDFQESNRMYSVNGYTFGSLPGLSMCAEDRVKWYLFGMGNEVDVHAAFFHGQALTNKNYRIDTINLFPATLFDAYMVAQNPGEWMLSCQNLNHLKAGLQAFFQVQECNKSSSKDNIRGKHVRHYYIAAEEIIWNYAPSGIDIFTKENLTAPGSDSAVFFEQGTTRIGGSYKKLVYREYTDASFTNRKERGPEEEHLGILGPVIWAEVGDTIRVTFHNKGAYPLSIEPIGVRFNKNNEGTYYSPNYNPQSRSVPPSASHVAPTETFTYEWTVPKEVGPTNADPVCLAKMYYSAVEPTKDIFTGLIGPMKICKKGSLHANGRQKDVDKEFYLFPTVFDENESLLLEDNIRMFTTAPDQVDKEDEDFQESNKMHSMNGFMYGNQPGLTMCKGDSVVWYLFSAGNEADVHGIYFSGNTYLWRGERRDTANLFPQTSLTLHMWPDTEGTFNVECLTTDHYTGGMKQKYTVNQCRRQSEDSTFYLGERTYYIAAVEVEWDYSPQREWEKELHHLQEQNVSNAFLDKGEFYIGSKYKKVVYRQYTDSTFRVPVERKAEEEHLGILGPQLHADVGDKVKIIFKNMATRPYSIHAHGVQTESSTVTPTLPGETLTYVWKIPERSGAGTEDSACIPWAYYSTVDQVKDLYSGLIGPLIVCRRPYLKVFNPRRKLEFALLFLVFDENESWYLDDNIKTYSDHPEKVNKDDEEFIESNKMHAINGRMFGNLQGLTMHVGDEVNWYLMGMGNEIDLHTVHFHGHSFQYKHRGVYSSDVFDIFPGTYQTLEMFPRTPGIWLLHCHVTDHIHAGMETTYTVLQNEDTKSG,1065,NP_000087.2.csv,refseq-CP-NM_000096.4_clinical_seed_0_final,refseq-CP-NM_000096.4.a2m,Invitae,refseq-CP-NM_000096.4.npy,1,1065,1065
+NP_000088.3,MALQLGRLSSGPCWLVARGGCGGPRAWSQCGGGGLRAWSQRSAAGRVCRPPGPAGTEQSRGLGHGSTSRGGPWVGTGLAAALAGLVGLATAAFGHVQRAEMLPKTSGTRATSLGRPEEEEDELAHRCSSFMAPPVTDLGELRRRPGDMKTKMELLILETQAQVCQALAQVDGGANFSVDRWERKEGGGGISCVLQDGCVFEKAGVSISVVHGNLSEEAAKQMRSRGKVLKTKDGKLPFCAMGVSSVIHPKNPHAPTIHFNYRYFEVEEADGNKQWWFGGGCDLTPTYLNQEDAVHFHRTLKEACDQHGPDLYPKFKKWCDDYFFIAHRGERRGIGGIFFDDLDSPSKEEVFRFVQSCARAVVPSYIPLVKKHCDDSFTPQEKLWQQLRRGRYVEFNLLYDRGTKFGLFTPGSRIESILMSLPLTARWEYMHSPSENSKEAEILEVLRHPRDWVR,454,NP_000088.3.csv,refseq-CPOX-NM_000097.5_clinical_seed_0_final,refseq-CPOX-NM_000097.5.a2m,Invitae,refseq-CPOX-NM_000097.5.npy,1,454,454
+NP_000089.1,MVPRLLLRAWPRGPAVGPGAPSRPLSAGSGPGQYLQRSIVPTMHYQDSLPRLPIPKLEDTIRRYLSAQKPLLNDGQFRKTEQFCKSFENGIGKELHEQLVALDKQNKHTSYISGPWFDMYLSARDSVVLNFNPFMAFNPDPKSEYNDQLTRATNMTVSAIRFLKTLRAGLLEPEVFHLNPAKSDTITFKRLIRFVPSSLSWYGAYLVNAYPLDMSQYFRLFNSTRLPKPSRDELFTDDKARHLLVLRKGNFYIFDVLDQDGNIVSPSEIQAHLKYILSDSSPAPEFPLAYLTSENRDIWAELRQKLMSSGNEESLRKVDSAVFCLCLDDFPIKDLVHLSHNMLHGDGTNRWFDKSFNLIIAKDGSTAVHFEHSWGDGVAVLRFFNEVFKDSTQTPAVTPQSQPATTDSTVTVQKLNFELTDALKTGITAAKEKFDATMKTLTIDCVQFQRGGKEFLKKQKLSPDAVAQLAFQMAFLRQYGQTVATYESCSTAAFKHGRTETIRPASVYTKRCSEAFVREPSRHSAGELQQMMVECSKYHGQLTKEAAMGQGFDRHLFALRHLAAAKGIILPELYLDPAYGQINHNVLSTSTLSSPAVNLGGFAPVVSDGFGVGYAVHDNWIGCNVSSYPGRNAREFLQCVEKALEDMFDALEGKSIKS,658,NP_000089.1.csv,refseq-CPT2-NM_000098.2_clinical_seed_0_final,refseq-CPT2-NM_000098.2.a2m,Invitae,refseq-CPT2-NM_000098.2.npy,1,658,658
+NP_000092.2,MGQIEWAMWANEQALASGLILITGGIVATAGRFTQWYFGAYSIVAGVFVCLLEYPRGKRKKGSTMERWGQKYMTAVVKLFGPFTRNYYVRAVLHLLLSVPAGFLLATILGTACLAIASGIYLLAAVRGEQWTPIEPKPRERPQIGGTIKQPPSNPPPRPPAEARKKPSEEEAAVAAGGPPGGPQVNPIPVTDEVV,195,NP_000092.2.csv,refseq-CYBA-NM_000101.4_clinical_seed_0_final,refseq-CYBA-NM_000101.4.a2m,Invitae,refseq-CYBA-NM_000101.4.npy,1,195,195
+NP_000093.1,MWELVALLLLTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLPRHGHMHNNFFKLQKKYGPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQMATLDIASNNRKGIAFADSGAHWQLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATHNGQSIDISFPVFVAVTNVISLICFNTSYKNGDPELNVIQNYNEGIIDNLSKDSLVDLVPWLKIFPNKTLEKLKSHVKIRNDLLNKILENYKEKFRSDSITNMLDTLMQAKMNSDNGNAGPDQDSELLSDNHILTTIGDIFGAGVETTTSVVKWTLAFLLHNPQVKKKLYEEIDQNVGFSRTPTISDRNRLLLLEATIREVLRLRPVAPMLIPHKANVDSSIGEFAVDKGTEVIINLWALHHNEKEWHQPDQFMPERFLNPAGTQLISPSVSYLPFGAGPRSCIGEILARQELFLIMAWLLQRFDLEVPDDGQLPSLEGIPKVVFLIDSFKVKIKVRQAWREAQAEGST,508,NP_000093.1.csv,refseq-CYP17A1-NM_000102.3_clinical_seed_0_final,refseq-CYP17A1-NM_000102.3.a2m,Invitae,refseq-CYP17A1-NM_000102.3.npy,1,508,508
+NP_000095.2,MGTSLSPNDPWPLNPLSIQQTTLLLLLSVLATVHVGQRLLRQRRRQLRSAPPGPFAWPLIGNAAAVGQAAHLSFARLARRYGDVFQIRLGSCPIVVLNGERAIHQALVQQGSAFADRPAFASFRVVSGGRSMAFGHYSEHWKVQRRAAHSMMRNFFTRQPRSRQVLEGHVLSEARELVALLVRGSADGAFLDPRPLTVVAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSLVDVMPWLQYFPNPVRTVFREFEQLNRNFSNFILDKFLRHCESLRPGAAPRDMMDAFILSAEKKAAGDSHGGGARLDLENVPATITDIFGASQDTLSTALQWLLLLFTRYPDVQTRVQAELDQVVGRDRLPCMGDQPNLPYVLAFLYEAMRFSSFVPVTIPHATTANTSVLGYHIPKDTVVFVNQWSVNHDPLKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKRRCIGEELSKMQLFLFISILAHQCDFRANPNEPAKMNFSYGLTIKPKSFKVNVTLRESMELLDSAVQNLQAKETCQ,543,NP_000095.2.csv,NP_000095.2_clinical_seed_0_final,NP_000095.2.a2m,popEVE,NP_000095.2_theta_0.2.npy,1,543,543
+NP_000099.2,MQSWSRVYCSLAKRGHFNRISHGLQGLSAVPLRTYADQPIDADVTVIGSGPGGYVAAIKAAQLGFKTVCIEKNETLGGTCLNVGCIPSKALLNNSHYYHMAHGKDFASRGIEMSEVRLNLDKMMEQKSTAVKALTGGIAHLFKQNKVVHVNGYGKITGKNQVTATKADGGTQVIDTKNILIATGSEVTPFPGITIDEDTIVSSTGALSLKKVPEKMVVIGAGVIGVELGSVWQRLGADVTAVEFLGHVGGVGIDMEISKNFQRILQKQGFKFKLNTKVTGATKKSDGKIDVSIEAASGGKAEVITCDVLLVCIGRRPFTKNLGLEELGIELDPRGRIPVNTRFQTKIPNIYAIGDVVAGPMLAHKAEDEGIICVEGMAGGAVHIDYNCVPSVIYTHPEVAWVGKSEEQLKEEGIEYKVGKFPFAANSRAKTNADTDGMVKILGQKSTDRVLGAHILGPGAGEMVNEAALALEYGASCEDIARVCHAHPTLSEAFREANLAASFGKSINF,509,NP_000099.2.csv,refseq-DLD-NM_000108.4_clinical_seed_0_final,refseq-DLD-NM_000108.4.a2m,Invitae,refseq-DLD-NM_000108.4.npy,1,509,509
+NP_000101.2,MAPVLSKDSADIESILALNPRTQTHATLCSTSAKKLDKKHWKRNPDKNCFNCEKLENNFDDIKHTTLGERGALREAMRCLKCADAPCQKSCPTNLDIKSFITSIANKNYYGAAKMIFSDNPLGLTCGMVCPTSDLCVGGCNLYATEEGPINIGGLQQFATEVFKAMSIPQIRNPSLPPPEKMSEAYSAKIALFGAGPASISCASFLARLGYSDITIFEKQEYVGGLSTSEIPQFRLPYDVVNFEIELMKDLGVKIICGKSLSVNEMTLSTLKEKGYKAAFIGIGLPEPNKDAIFQGLTQDQGFYTSKDFLPLVAKGSKAGMCACHSPLPSIRGVVIVLGAGDTAFDCATSALRCGARRVFIVFRKGFVNIRAVPEEMELAKEEKCEFLPFLSPRKVIVKGGRIVAMQFVRTEQDETGKWNEDEDQMVHLKADVVISAFGSVLSDPKVKEALSPIKFNRWGLPEVDPETMQTSEAWVFAGGDVVGLANTTVESVNDGKQASWYIHKYVQSQYGASVSAKPELPLFYTPIDLVDISVEMAGLKFINPFGLASATPATSTSMIRRAFEAGWGFALTKTFSLDKDIVTNVSPRIIRGTTSGPMYGPGQSSFLNIELISEKTAAYWCQSVTELKADFPDNIVIASIMCSYNKNDWTELAKKSEDSGADALELNLSCPHGMGERGMGLACGQDPELVRNICRWVRQAVQIPFFAKLTPNVTDIVSIARAAKEGGANGVTATNTVSGLMGLKSDGTPWPAVGIAKRTTYGGVSGTAIRPIALRAVTSIARALPGFPILATGGIDSAESGLQFLHSGASVLQVCSAIQNQDFTVIEDYCTGLKALLYLKSIEELQDWDGQSPATVSHQKGKPVPRIAELMDKKLPSFGPYLEQRKKIIAENKIRLKEQNVAFSPLKRNCFIPKRPIPTIKDVIGKALQYLGTFGELSNVEQVVAMIDEEMCINCGKCYMTCNDSGYQAIQFDPETHLPTITDTCTGCTLCLSVCPIVDCIKMVSRTTPYEPKRGVPLSVNPVC,1025,NP_000101.2.csv,DPYD_HUMAN_b01_clinical_seed_0_final,DPYD_HUMAN_b01.a2m,EVE,DPYD_HUMAN_b01_theta_0.2.npy,1,1025,1025
+NP_000102.1,MIEPFGNQYIVARPVYSTNAFEENHKKTGRHHKTFLDHLKVCCSCSPQKAKRIVLSLFPIASWLPAYRLKEWLLSDIVSGISTGIVAVLQGLAFALLVDIPPVYGLYASFFPAIIYLFFGTSRHISVGPFPILSMMVGLAVSGAVSKAVPDRNATTLGLPNNSNNSSLLDDERVRVAAAASVTVLSGIIQLAFGILRIGFVVIYLSESLISGFTTAAAVHVLVSQLKFIFQLTVPSHTDPVSIFKVLYSVFSQIEKTNIADLVTALIVLLVVSIVKEINQRFKDKLPVPIPIEFIMTVIAAGVSYGCDFKNRFKVAVVGDMNPGFQPPITPDVETFQNTVGDCFGIAMVAFAVAFSVASVYSLKYDYPLDGNQELIALGLGNIVCGVFRGFAGSTALSRSAVQESTGGKTQIAGLIGAIIVLIVVLAIGFLLAPLQKSVLAALALGNLKGMLMQFAEIGRLWRKDKYDCLIWIMTFIFTIVLGLGLGLAASVAFQLLTIVFRTQFPKCSTLANIGRTNIYKNKKDYYDMYEPEGVKIFRCPSPIYFANIGFFRRKLIDAVGFSPLRILRKRNKALRKIRKLQKQGLLQVTPKGFICTVDTIKDSDEELDNNQIEVLDQPINTTDLPFHIDWNDDLPLNIEVPKISLHSLILDFSAVSFLDVSSVRGLKSILQEFIRIKVDVYIVGTDDDFIEKLNRYEFFDGEVKSSIFFLTIHDAVLHILMKKDYSTSKFNPSQEKDGKIDFTINTNGGLRNRVYEVPVETKF,764,NP_000102.1.csv,refseq-SLC26A3-NM_000111.2_clinical_seed_0_final,refseq-SLC26A3-NM_000111.2.a2m,Invitae,refseq-SLC26A3-NM_000111.2.npy,1,764,764
+NP_000103.2,MSSESKEQHNVSPRDSAEGNDSYPSGIHLELQRESSTDFKQFETNDQCRPYHRILIERQEKSDTNFKEFVIKKLQKNCQCSPAKAKNMILGFLPVLQWLPKYDLKKNILGDVMSGLIVGILLVPQSIAYSLLAGQEPVYGLYTSFFASIIYFLLGTSRHISVGIFGVLCLMIGETVDRELQKAGYDNAHSAPSLGMVSNGSTLLNHTSDRICDKSCYAIMVGSTVTFIAGVYQVAMGFFQVGFVSVYLSDALLSGFVTGASFTILTSQAKYLLGLNLPRTNGVGSLITTWIHVFRNIHKTNLCDLITSLLCLLVLLPTKELNEHFKSKLKAPIPIELVVVVAATLASHFGKLHENYNSSIAGHIPTGFMPPKVPEWNLIPSVAVDAIAISIIGFAITVSLSEMFAKKHGYTVKANQEMYAIGFCNIIPSFFHCFTTSAALAKTLVKESTGCHTQLSGVVTALVLLLVLLVIAPLFYSLQKSVLGVITIVNLRGALRKFRDLPKMWSISRMDTVIWFVTMLSSALLSTEIGLLVGVCFSIFCVILRTQKPKSSLLGLVEESEVFESVSAYKNLQIKPGIKIFRFVAPLYYINKECFKSALYKQTVNPILIKVAWKKAAKRKIKEKVVTLGGIQDEMSVQLSHDPLELHTIVIDCSAIQFLDTAGIHTLKEVRRDYEAIGIQVLLAQCNPTVRDSLTNGEYCKKEEENLLFYSVYEAMAFAEVSKNQKGVCVPNGLSLSSD,739,NP_000103.2.csv,refseq-SLC26A2-NM_000112.3_clinical_seed_0_final,refseq-SLC26A2-NM_000112.3.a2m,Invitae,refseq-SLC26A2-NM_000112.3.npy,1,739,739
+NP_000104.1,MKLGRAVLGLLLLAPSVVQAVEPISLGLALAGVLTGYIYPRLYCLFAECCGQKRSLSREALQKDLDDNLFGQHLAKKIILNAVFGFINNPKPKKPLTLSLHGWTGTGKNFVSKIIAENIYEGGLNSDYVHLFVATLHFPHASNITLYKDQLQLWIRGNVSACARSIFIFDEMDKMHAGLIDAIKPFLDYYDLVDGVSYQKAMFIFLSNAGAERITDVALDFWRSGKQREDIKLKDIEHALSVSVFNNKNSGFWHSSLIDRNLIDYFVPFLPLEYKHLKMCIRVEMQSRGYEIDEDIVSRVAEEMTFFPKEERVFSDKGCKTVFTKLDYYYDD,332,NP_000104.1.csv,refseq-TOR1A-NM_000113.2_clinical_seed_0_final,refseq-TOR1A-NM_000113.2.a2m,Invitae,refseq-TOR1A-NM_000113.2.npy,1,332,332
+NP_000107.1,MPLHVKWPFPAVPPLTWTLASSVVMGLVGTYSCFWTKYMNHLTVHNREVLYELIEKRGPATPLITVSNHQSCMDDPHLWGILKLRHIWNLKLMRWTPAAADICFTKELHSHFFSLGKCVPVCRGAEFFQAENEGKGVLDTGRHMPGAGKRREKGDGVYQKGMDFILEKLNHGDWVHIFPEGKVNMSSEFLRFKWGIGRLIAECHLNPIILPLWHVGMNDVLPNSPPYFPRFGQKITVLIGKPFSALPVLERLRAENKSAVEMRKALTDFIQEEFQHLKTQAEQLHNHLQPGR,292,NP_000107.1.csv,refseq-TAZ-NM_000116.3_clinical_seed_0_final,refseq-TAZ-NM_000116.3.a2m,Invitae,refseq-TAZ-NM_000116.3.npy,1,292,292
+NP_000108.1,MDNYADLSDTELTTLLRRYNIPHGPVVGSTRRLYEKKIFEYETQRRRLSPPSSSAASSYSFSDLNSTRGDADMYDLPKKEDALLYQSKGYNDDYYEESYFTTRTYGEPESAGPSRAVRQSVTSFPDADAFHHQVHDDDLLSSSEEECKDRERPMYGRDSAYQSITHYRPVSASRSSLDLSYYPTSSSTSFMSSSSSSSSWLTRRAIRPENRAPGAGLGQDRQVPLWGQLLLFLVFVIVLFFIYHFMQAEEGNPF,254,NP_000108.1.csv,refseq-EMD-NM_000117.2_clinical_seed_0_final,refseq-EMD-NM_000117.2.a2m,Invitae,refseq-EMD-NM_000117.2.npy,1,254,254
+NP_000110.2,MGQGEPSQRSTGLAGLYAAPAASPVFIKGSGMDALGIKSCDFQAARNNEEHHTKALSSRRLFVRRGQPFTIILYFRAPVRAFLPALKKVALTAQTGEQPSKINRTQATFPISSLGDRKWWSAVVEERDAQSWTISVTTPADAVIGHYSLLLQVSGRKQLLLGQFTLLFNPWNREDAVFLKNEAQRMEYLLNQNGLIYLGTADCIQAESWDFGQFEGDVIDLSLRLLSKDKQVEKWSQPVHVARVLGALLHFLKEQRVLPTPQTQATQEGALLNKRRGSVPILRQWLTGRGRPVYDGQAWVLAAVACTVLRCLGIPARVVTTFASAQGTGGRLLIDEYYNEEGLQNGEGQRGRIWIFQTSTECWMTRPALPQGYDGWQILHPSAPNGGGVLGSCDLVPVRAVKEGTLGLTPAVSDLFAAINASCVVWKCCEDGTLELTDSNTKYVGNNISTKGVGSDRCEDITQNYKYPEGSLQEKEVLERVEKEKMEREKDNGIRPPSLETASPLYLLLKAPSSLPLRGDAQISVTLVNHSEQEKAVQLAIGVQAVHYNGVLAAKLWRKKLHLTLSANLEKIITIGLFFSNFERNPPENTFLRLTAMATHSESNLSCFAQEDIAICRPHLAIKMPEKAEQYQPLTASVSLQNSLDAPMEDCVISILGRGLIHRERSYRFRSVWPENTMCAKFQFTPTHVGLQRLTVEVDCNMFQNLTNYKSVTVVAPELSA,721,NP_000110.2.csv,refseq-EPB42-NM_000119.2_clinical_seed_0_final,refseq-EPB42-NM_000119.2.a2m,Invitae,refseq-EPB42-NM_000119.2.npy,1,721,721
+NP_000113.1,MGKRDRADRDKKKSRKRHYEDEEDDEEDAPGNDPQEAVPSAAGKQVDESGTKVDEYGAKDYRLQMPLKDDHTSRPLWVAPDGHIFLEAFSPVYKYAQDFLVAIAEPVCRPTHVHEYKLTAYSLYAAVSVGLQTSDITEYLRKLSKTGVPDGIMQFIKLCTVSYGKVKLVLKHNRYFVESCHPDVIQHLLQDPVIRECRLRNSEGEATELITETFTSKSAISKTAESSGGPSTSRVTDPQGKSDIPMDLFDFYEQMDKDEEEEEETQTVSFEVKQEMIEELQKRCIHLEYPLLAEYDFRNDSVNPDINIDLKPTAVLRPYQEKSLRKMFGNGRARSGVIVLPCGAGKSLVGVTAACTVRKRCLVLGNSAVSVEQWKAQFKMWSTIDDSQICRFTSDAKDKPIGCSVAISTYSMLGHTTKRSWEAERVMEWLKTQEWGLMILDEVHTIPAKMFRRVLTIVQAHCKLGLTATLVREDDKIVDLNFLIGPKLYEANWMELQNNGYIAKVQCAEVWCPMSPEFYREYVAIKTKKRILLYTMNPNKFRACQFLIKFHERRNDKIIVFADNVFALKEYAIRLNKPYIYGPTSQGERMQILQNFKHNPKINTIFISKVGDTSFDLPEANVLIQISSHGGSRRQEAQRLGRVLRAKKGMVAEEYNAFFYSLVSQDTQEMAYSTKRQRFLVDQGYSFKVITKLAGMEEEDLAFSTKEEQQQLLQKVLAATDLDAEEEVVAGEFGSRSSQASRRFGTMSSMSGADDTVYMEYHSSRSKAPSKHVHPLFKRFRK,782,NP_000113.1.csv,refseq-ERCC3-NM_000122.1_clinical_seed_0_final,refseq-ERCC3-NM_000122.1.a2m,Invitae,refseq-ERCC3-NM_000122.1.npy,1,782,782
+NP_000114.3,MGVQGLWKLLECSGRQVSPEALEGKILAVDISIWLNQALKGVRDRHGNSIENPHLLTLFHRLCKLLFFRIRPIFVFDGDAPLLKKQTLVKRRQRKDLASSDSRKTTEKLLKTFLKRQAIKTAFRSKRDEALPSLTQVRRENDLYVLPPLQEEEKHSSEEEDEKEWQERMNQKQALQEEFFHNPQAIDIESEDFSSLPPEVKHEILTDMKEFTKRRRTLFEAMPEESDDFSQYQLKGLLKKNYLNQHIEHVQKEMNQQHSGHIRRQYEDEGGFLKEVESRRVVSEDTSHYILIKGIQAKTVAEVDSESLPSSSKMHGMSFDVKSSPCEKLKTEKEPDATPPSPRTLLAMQAALLGSSSEEELESENRRQARGRNAPAAVDEGSISPRTLSAIKRALDDDEDVKVCAGDDVQTGGPGAEEMRINSSTENSDEGLKVRDGKGIPFTATLASSSVNSAEEHVASTNEGREPTDSVPKEQMSLVHVGTEAFPISDESMIKDRKDRLPLESAVVRHSDAPGLPNGRELTPASPTCTNSVSKNETHAEVLEQQNELCPYESKFDSSLLSSDDETKCKPNSASEVIGPVSLQETSSIVSVPSEAVDNVENVVSFNAKEHENFLETIQEQQTTESAGQDLISIPKAVEPMEIDSEESESDGSFIEVQSVISDEELQAEFPETSKPPSEQGEEELVGTREGEAPAESESLLRDNSERDDVDGEPQEAEKDAEDSLHEWQDINLEELETLESNLLAQQNSLKAQKQQQERIAATVTGQMFLESQELLRLFGIPYIQAPMEAEAQCAILDLTDQTSGTITDDSDIWLFGARHVYRNFFNKNKFVEYYQYVDFHNQLGLDRNKLINLAYLLGSDYTEGIPTVGCVTAMEILNEFPGHGLEPLLKFSEWWHEAQKNPKIRPNPHDTKVKKKLRTLQLTPGFPNPAVAEAYLKPVVDDSKGSFLWGKPDLDKIREFCQRYFGWNRTKTDESLFPVLKQLDAQQTQLRIDSFFRLAQQEKEDAKRIKSQRLNRAVTCMLRKEKEAAASEIEAVSVAMEKEFELLDKAKGKTQKRGITNTLEESSSLKRKRLSDSKGKNTCGGFLGETCLSESSDGSSSEDAESSSLMNVQRRTAAKEPKTSASDSQNSVKEAPVKNGGATTSSSSDSDDDGGKEKMVLVTARSVFGKKRRKLRRARGRKRKT,1186,NP_000114.3.csv,refseq-ERCC5-NM_000123.4_clinical_seed_0_final,refseq-ERCC5-NM_000123.4.a2m,Invitae,refseq-ERCC5-NM_000123.4.npy,1,1186,1186
+NP_000115.1,MPNEGIPHSSQTQEQDCLQSQPVSNNEEMAIKQESGGDGEVEEYLSFRSVGDGLSTSAVGCASAAPRRGPALLHIDRHQIQAVEPSAQALELQGLGVDVYDQDVLEQGVLQQVDNAIHEASRASQLVDVEKEYRSVLDDLTSCTTSLRQINKIIEQLSPQAATSRDINRKLDSVKRQKYNKEQQLKKITAKQKHLQAILGGAEVKIELDHASLEEDAEPGPSSLGSMLMPVQETAWEELIRTGQMTPFGTQIPQKQEKKPRKIMLNEASGFEKYLADQAKLSFERKKQGCNKRAARKAPAPVTPPAPVQNKNKPNKKARVLSKKEERLKKHIKKLQKRALQFQGKVGLPKARRPWESDMRPEAEGDSEGEESEYFPTEEEEEEEDDEVEGAEADLSGDGTDYELKPLPKGGKRQKKVPVQEIDDDFFPSSGEEAEAASVGEGGGGGRKVGRYRDDGDEDYYKQRLRRWNKLRLQDKEKRLKLEDDSEESDAEFDEGFKVPGFLFKKLFKYQQTGVRWLWELHCQQAGGILGDEMGLGKTIQIIAFLAGLSYSKIRTRGSNYRFEGLGPTVIVCPTTVMHQWVKEFHTWWPPFRVAILHETGSYTHKKEKLIRDVAHCHGILITSYSYIRLMQDDISRYDWHYVILDEGHKIRNPNAAVTLACKQFRTPHRIILSGSPMQNNLRELWSLFDFIFPGKLGTLPVFMEQFSVPITMGGYSNASPVQVKTAYKCACVLRDTINPYLLRRMKSDVKMSLSLPDKNEQVLFCRLTDEQHKVYQNFVDSKEVYRILNGEMQIFSGLIALRKICNHPDLFSGGPKNLKGLPDDELEEDQFGYWKRSGKMIVVESLLKIWHKQGQRVLLFSQSRQMLDILEVFLRAQKYTYLKMDGTTTIASRQPLITRYNEDTSIFVFLLTTRVGGLGVNLTGANRVVIYDPDWNPSTDTQARERAWRIGQKKQVTVYRLLTAGTIEEKIYHRQIFKQFLTNRVLKDPKQRRFFKSNDLYELFTLTSPDASQSTETSAIFAGTGSDVQTPKCHLKRRIQPAFGADHDVPKRKKFPASNISVNDATSSEEKSEAKGAEVNAVTSNRSDPLKDDPHMSSNVTSNDRLGEETNAVSGPEELSVISGNGECSNSSGTGKTSMPSGDESIDEKLGLSYKRERPSQAQTEAFWENKQMENNFYKHKSKTKHHSVAEEETLEKHLRPKQKPKNSKHCRDAKFEGTRIPHLVKKRRYQKQDSENKSEAKEQSNDDYVLEKLFKKSVGVHSVMKHDAIMDGASPDYVLVEAEANRVAQDALKALRLSRQRCLGAVSGVPTWTGHRGISGAPAGKKSRFGKKRNSNFSVQHPSSTSPTEKCQDGIMKKEGKDNVPEHFSGRAEDADSSSGPLASSSLLAKMRARNHLILPERLESESGHLQEASALLPTTEHDDLLVEMRNFIAFQAHTDGQASTREILQEFESKLSASQSCVFRELLRNLCTFHRTSGGEGIWKLKPEYC,1493,NP_000115.1.csv,refseq-ERCC6-NM_000124.3_clinical_seed_0_final,refseq-ERCC6-NM_000124.3.a2m,Invitae,refseq-ERCC6-NM_000124.3.npy,1,1493,1493
+NP_000117.1,MFRAAAPGQLRRAASLLRFQSTLVIAEHANDSLAPITLNTITAATRLGGEVSCLVAGTKCDKVAQDLCKVAGIAKVLVAQHDVYKGLLPEELTPLILATQKQFNYTHICAGASAFGKNLLPRVAAKLEVAPISDIIAIKSPDTFVRTIYAGNALCTVKCDEKVKVFSVRGTSFDAAATSGGSASSEKASSTSPVEISEWLDQKLTKSDRPELTGAKVVVSGGRGLKSGENFKLLYDLADQLHAAVGASRAAVDAGFVPNDMQVGQTGKIVAPELYIAVGISGAIQHLAGMKDSKTIVAINKDPEAPIFQVADYGIVADLFKVVPEMTEILKKK,333,NP_000117.1.csv,refseq-ETFA-NM_000126.3_clinical_seed_0_final,refseq-ETFA-NM_000126.3.a2m,Invitae,refseq-ETFA-NM_000126.3.npy,1,333,333
+NP_000118.2,MQAKKRYFILLSAGSCLALLFYFGGLQFRASRSHSRREEHSGRNGLHHPSPDHFWPRFPDALRPFVPWDQLENEDSSVHISPRQKRDANSSIYKGKKCRMESCFDFTLCKKNGFKVYVYPQQKGEKIAESYQNILAAIEGSRFYTSDPSQACLFVLSLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFDIGQAMLAKASISTENFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEIIQDRIFKHISRNSLIWNKHPGGLFVLPQYSSYLGDFPYYYANLGLKPPSKFTAVIHAVTPLVSQSQPVLKLLVAAAKSQYCAQIIVLWNCDKPLPAKHRWPATAVPVVVIEGESKVMSSRFLPYDNIITDAVLSLDEDTVLSTTEVDFAFTVWQSFPERIVGYPARSHFWDNSKERWGYTSKWTNDYSMVLTGAAIYHKYYHYLYSHYLPASLKNMVDQLANCEDILMNFLVSAVTKLPPIKVTQKKQYKETMMGQTSRASRWADPDHFAQRQSCMNTFASWFGYMPLIHSQMRLDPVLFKDQVSILRKKYRDIERL,746,NP_000118.2.csv,refseq-EXT1-NM_000127.2_clinical_seed_0_final,refseq-EXT1-NM_000127.2.a2m,Invitae,refseq-EXT1-NM_000127.2.npy,1,746,746
+NP_000119.1,MIFLYQVVHFILFTSVSGECVTQLLKDTCFEGGDITTVFTPSAKYCQVVCTYHPRCLLFTFTAESPSEDPTRWFTCVLKDSVTETLPRVNRTAAISGYSFKQCSHQISACNKDIYVDLDMKGINYNSSVAKSAQECQERCTDDVHCHFFTYATRQFPSLEHRNICLLKHTQTGTPTRITKLDKVVSGFSLKSCALSNLACIRDIFPNTVFADSNIDSVMAPDAFVCGRICTHHPGCLFFTFFSQEWPKESQRNLCLLKTSESGLPSTRIKKSKALSGFSLQSCRHSIPVFCHSSFYHDTDFLGEELDIVAAKSHEACQKLCTNAVRCQFFTYTPAQASCNEGKGKCYLKLSSNGSPTKILHGRGGISGYTLRLCKMDNECTTKIKPRIVGGTASVRGEWPWQVTLHTTSPTQRHLCGGSIIGNQWILTAAHCFYGVESPKILRVYSGILNQSEIKEDTSFFGVQEIIIHDQYKMAESGYDIALLKLETTVNYTDSQRPICLPSKGDRNVIYTDCWVTGWGYRKLRDKIQNTLQKAKIPLVTNEECQKRYRGHKITHKMICAGYREGGKDACKGDSGGPLSCKHNEVWHLVGITSWGEGCAQRERPGVYTNVVEYVDWILEKTQAV,625,NP_000119.1.csv,refseq-F11-NM_000128.3_clinical_seed_0_final,refseq-F11-NM_000128.3.a2m,Invitae,refseq-F11-NM_000128.3.npy,1,625,625
+NP_000120.2,MSETSRTAFGGRRAVPPNNSNAAEDDLPTVELQGVVPRGVNLQEFLNVTSVHLFKERWDTNKVDHHTDKYENNKLIVRRGQSFYVQIDFSRPYDPRRDLFRVEYVIGRYPQENKGTYIPVPIVSELQSGKWGAKIVMREDRSVRLSIQSSPKCIVGKFRMYVAVWTPYGVLRTSRNPETDTYILFNPWCEDDAVYLDNEKEREEYVLNDIGVIFYGEVNDIKTRSWSYGQFEDGILDTCLYVMDRAQMDLSGRGNPIKVSRVGSAMVNAKDDEGVLVGSWDNIYAYGVPPSAWTGSVDILLEYRSSENPVRYGQCWVFAGVFNTFLRCLGIPARIVTNYFSAHDNDANLQMDIFLEEDGNVNSKLTKDSVWNYHCWNEAWMTRPDLPVGFGGWQAVDSTPQENSDGMYRCGPASVQAIKHGHVCFQFDAPFVFAEVNSDLIYITAKKDGTHVVENVDATHIGKLIVTKQIGGDGMMDITDTYKFQEGQEEERLALETALMYGAKKPLNTEGVMKSRSNVDMDFEVENAVLGKDFKLSITFRNNSHNRYTITAYLSANITFYTGVPKAEFKKETFDVTLEPLSFKKEAVLIQAGEYMGQLLEQASLHFFVTARINETRDVLAKQKSTVLTIPEIIIKVRGTQVVGSDMTVTVEFTNPLKETLRNVWVHLDGPGVTRPMKKMFREIRPNSTVQWEEVCRPWVSGHRKLIASMSSDSLRHVYGELDVQIQRRPSM,732,NP_000120.2.csv,refseq-F13A1-NM_000129.3_clinical_seed_0_final,refseq-F13A1-NM_000129.3.a2m,Invitae,refseq-F13A1-NM_000129.3.npy,1,732,732
+NP_000121.2,MFPGCPRLWVLVVLGTSWVGWGSQGTEAAQLRQFYVAAQGISWSYRPEPTNSSLNLSVTSFKKIVYREYEPYFKKEKPQSTISGLLGPTLYAEVGDIIKVHFKNKADKPLSIHPQGIRYSKLSEGASYLDHTFPAEKMDDAVAPGREYTYEWSISEDSGPTHDDPPCLTHIYYSHENLIEDFNSGLIGPLLICKKGTLTEGGTQKTFDKQIVLLFAVFDESKSWSQSSSLMYTVNGYVNGTMPDITVCAHDHISWHLLGMSSGPELFSIHFNGQVLEQNHHKVSAITLVSATSTTANMTVGPEGKWIISSLTPKHLQAGMQAYIDIKNCPKKTRNLKKITREQRRHMKRWEYFIAAEEVIWDYAPVIPANMDKKYRSQHLDNFSNQIGKHYKKVMYTQYEDESFTKHTVNPNMKEDGILGPIIRAQVRDTLKIVFKNMASRPYSIYPHGVTFSPYEDEVNSSFTSGRNNTMIRAVQPGETYTYKWNILEFDEPTENDAQCLTRPYYSDVDIMRDIASGLIGLLLICKSRSLDRRGIQRAADIEQQAVFAVFDENKSWYLEDNINKFCENPDEVKRDDPKFYESNIMSTINGYVPESITTLGFCFDDTVQWHFCSVGTQNEILTIHFTGHSFIYGKRHEDTLTLFPMRGESVTVTMDNVGTWMLTSMNSSPRSKKLRLKFRDVKCIPDDDEDSYEIFEPPESTVMATRKMHDRLEPEDEESDADYDYQNRLAAALGIRSFRNSSLNQEEEEFNLTALALENGTEFVSSNTDIIVGSNYSSPSNISKFTVNNLAEPQKAPSHQQATTAGSPLRHLIGKNSVLNSSTAEHSSPYSEDPIEDPLQPDVTGIRLLSLGAGEFKSQEHAKHKGPKVERDQAAKHRFSWMKLLAHKVGRHLSQDTGSPSGMRPWEDLPSQDTGSPSRMRPWKDPPSDLLLLKQSNSSKILVGRWHLASEKGSYEIIQDTDEDTAVNNWLISPQNASRAWGESTPLANKPGKQSGHPKFPRVRHKSLQVRQDGGKSRLKKSQFLIKTRKKKKEKHTHHAPLSPRTFHPLRSEAYNTFSERRLKHSLVLHKSNETSLPTDLNQTLPSMDFGWIASLPDHNQNSSNDTGQASCPPGLYQTVPPEEHYQTFPIQDPDQMHSTSDPSHRSSSPELSEMLEYDRSHKSFPTDISQMSPSSEHEVWQTVISPDLSQVTLSPELSQTNLSPDLSHTTLSPELIQRNLSPALGQMPISPDLSHTTLSPDLSHTTLSLDLSQTNLSPELSQTNLSPALGQMPLSPDLSHTTLSLDFSQTNLSPELSHMTLSPELSQTNLSPALGQMPISPDLSHTTLSLDFSQTNLSPELSQTNLSPALGQMPLSPDPSHTTLSLDLSQTNLSPELSQTNLSPDLSEMPLFADLSQIPLTPDLDQMTLSPDLGETDLSPNFGQMSLSPDLSQVTLSPDISDTTLLPDLSQISPPPDLDQIFYPSESSQSLLLQEFNESFPYPDLGQMPSPSSPTLNDTFLSKEFNPLVIVGLSKDGTDYIEIIPKEEVQSSEDDYAEIDYVPYDDPYKTDVRTNINSSRDPDNIAAWYLRSNNGNRRNYYIAAEEISWDYSEFVQRETDIEDSDDIPEDTTYKKVVFRKYLDSTFTKRDPRGEYEEHLGILGPIIRAEVDDVIQVRFKNLASRPYSLHAHGLSYEKSSEGKTYEDDSPEWFKEDNAVQPNSSYTYVWHATERSGPESPGSACRAWAYYSAVNPEKDIHSGLIGPLLICQKGILHKDSNMPMDMREFVLLFMTFDEKKSWYYEKKSRSSWRLTSSEMKKSHEFHAINGMIYSLPGLKMYEQEWVRLHLLNIGGSQDIHVVHFHGQTLLENGNKQHQLGVWPLLPGSFKTLEMKASKPGWWLLNTEVGENQRAGMQTPFLIMDRDCRMPMGLSTGIISDSQIKASEFLGYWEPRLARLNNGGSYNAWSVEKLAAEFASKPWIQVDMQKEVIITGIQTQGAKHYLKSCYTTEFYVAYSSNQINWQIFKGNSTRNVMYFNGNSDASTIKENQFDPPIVARYIRISPTRAYNRPTLRLELQGCEVNGCSTPLGMENGKIENKQITASSFKKSWWGDYWEPFRARLNAQGRVNAWQAKANNNKQWLEIDLLKIKKITAIITQGCKSLSSEMYVKSYTIHYSEQGVEWKPYRLKSSMVDKIFEGNTNTKGHVKNFFNPPIISRFIRVIPKTWNQSIALRLELFGCDIY,2224,NP_000121.2.csv,FA5_HUMAN_b01_clinical_seed_0_final,FA5_HUMAN_b01.a2m,EVE,FA5_HUMAN_b01_theta_0.2.npy,1,2224,2224
+NP_000122.1,MVSQALRLLCLLLGLQGCLAAGGVAKASGGETRDMPWKPGPHRVFVTQEEAHGVLHRRRRANAFLEELRPGSLERECKEEQCSFEEAREIFKDAERTKLFWISYSDGDQCASSPCQNGGSCKDQLQSYICFCLPAFEGRNCETHKDDQLICVNENGGCEQYCSDHTGTKRSCRCHEGYSLLADGVSCTPTVEYPCGKIPILEKRNASKPQGRIVGGKVCPKGECPWQVLLLVNGAQLCGGTLINTIWVVSAAHCFDKIKNWRNLIAVLGEHDLSEHDGDEQSRRVAQVIIPSTYVPGTTNHDIALLRLHQPVVLTDHVVPLCLPERTFSERTLAFVRFSLVSGWGQLLDRGATALELMVLNVPRLMTQDCLQQSRKVGDSPNITEYMFCAGYSDGSKDSCKGDSGGPHATHYRGTWYLTGIVSWGQGCATVGHFGVYTRVSQYIEWLQKLMRSEPRPGVLLRAPFP,466,NP_000122.1.csv,refseq-F7-NM_000131.4_clinical_seed_0_final,refseq-F7-NM_000131.4.a2m,Invitae,refseq-F7-NM_000131.4.npy,1,466,466
+NP_000124.1,MQRVNMIMAESPGLITICLLGYLLSAECTVFLDHENANKILNRPKRYNSGKLEEFVQGNLERECMEEKCSFEEAREVFENTERTTEFWKQYVDGDQCESNPCLNGGSCKDDINSYECWCPFGFEGKNCELDVTCNIKNGRCEQFCKNSADNKVVCSCTEGYRLAENQKSCEPAVPFPCGRVSVSQTSKLTRAETVFPDVDYVNSTEAETILDNITQSTQSFNDFTRVVGGEDAKPGQFPWQVVLNGKVDAFCGGSIVNEKWIVTAAHCVETGVKITVVAGEHNIEETEHTEQKRNVIRIIPHHNYNAAINKYNHDIALLELDEPLVLNSYVTPICIADKEYTNIFLKFGSGYVSGWGRVFHKGRSALVLQYLRVPLVDRATCLRSTKFTIYNNMFCAGFHEGGRDSCQGDSGGPHVTEVEGTSFLTGIISWGEECAMKGKYGIYTKVSRYVNWIKEKTKLT,461,NP_000124.1.csv,refseq-F9-NM_000133.3_clinical_seed_0_final,refseq-F9-NM_000133.3.a2m,Invitae,refseq-F9-NM_000133.3.npy,1,461,461
+NP_000126.2,MSDSWVPNSASGQDPGGRRRAWAELLAGRVKREKYNPERAQKLKESAVRLLRSHQDLNALLLEVEGPLCKKLSLSKVIDCDSSEAYANHSSSFIGSALQDQASRLGVPVGILSAGMVASSVGQICTAPAETSHPVLLTVEQRKKLSSLLEFAQYLLAHSMFSRLSFCQELWKIQSSLLLEAVWHLHVQGIVSLQELLESHPDMHAVGSWLFRNLCCLCEQMEASCQHADVARAMLSDFVQMFVLRGFQKNSDLRRTVEPEKMPQVTVDVLQRMLIFALDALAAGVQEESSTHKIVRCWFGVFSGHTLGSVISTDPLKRFFSHTLTQILTHSPVLKASDAVQMQREWSFARTHPLLTSLYRRLFVMLSAEELVGHLQEVLETQEVHWQRVLSFVSALVVCFPEAQQLLEDWVARLMAQAFESCQLDSMVTAFLVVRQAALEGPSAFLSYADWFKASFGSTRGYHGCSKKALVFLFTFLSELVPFESPRYLQVHILHPPLVPGKYRSLLTDYISLAKTRLADLKVSIENMGLYEDLSSAGDITEPHSQALQDVEKAIMVFEHTGNIPVTVMEASIFRRPYYVSHFLPALLTPRVLPKVPDSRVAFIESLKRADKIPPSLYSTYCQACSAAEEKPEDAALGVRAEPNSAEEPLGQLTAALGELRASMTDPSQRDVISAQVAVISERLRAVLGHNEDDSSVEISKIQLSINTPRLEPREHMAVDLLLTSFCQNLMAASSVAPPERQGPWAALFVRTMCGRVLPAVLTRLCQLLRHQGPSLSAPHVLGLAALAVHLGESRSALPEVDVGPPAPGAGLPVPALFDSLLTCRTRDSLFFCLKFCTAAISYSLCKFSSQSRDTLCSCLSPGLIKKFQFLMFRLFSEARQPLSEEDVASLSWRPLHLPSADWQRAALSLWTHRTFREVLKEEDVHLTYQDWLHLELEIQPEADALSDTERQDFHQWAIHEHFLPESSASGGCDGDLQAACTILVNALMDFHQSSRSYDHSENSDLVFGGRTGNEDIISRLQEMVADLELQQDLIVPLGHTPSQEHFLFEIFRRRLQALTSGWSVAASLQRQRELLMYKRILLRLPSSVLCGSSFQAEQPITARCEQFFHLVNSEMRNFCSHGGALTQDITAHFFRGLLNACLRSRDPSLMVDFILAKCQTKCPLILTSALVWWPSLEPVLLCRWRRHCQSPLPRELQKLQEGRQFASDFLSPEAASPAPNPDWLSAAALHFAIQQVREENIRKQLKKLDCEREELLVFLFFFSLMGLLSSHLTSNSTTDLPKAFHVCAAILECLEKRKISWLALFQLTESDLRLGRLLLRVAPDQHTRLLPFAFYSLLSYFHEDAAIREEAFLHVAVDMYLKLVQLFVAGDTSTVSPPAGRSLELKGQGNPVELITKARLFLLQLIPRCPKKSFSHVAELLADRGDCDPEVSAALQSRQQAAPDADLSQEPHLF,1455,NP_000126.2.csv,refseq-FANCA-NM_000135.2_clinical_seed_0_final,refseq-FANCA-NM_000135.2.a2m,Invitae,refseq-FANCA-NM_000135.2.npy,1,1455,1455
+NP_000127.2,MAQDSVDLSCDYQFWMQKLSVWDQASTLETQQDTCLHVAQFQEFLRKMYEALKEMDSNTVIERFPTIGQLLAKACWNPFILAYDESQKILIWCLCCLINKEPQNSGQSKLNSWIQGVLSHILSALRFDKEVALFTQGLGYAPIDYYPGLLKNMVLSLASELRENHLNGFNTQRRMAPERVASLSRVCVPLITLTDVDPLVEALLICHGREPQEILQPEFFEAVNEAILLKKISLPMSAVVCLWLRHLPSLEKAMLHLFEKLISSERNCLRRIECFIKDSSLPQAACHPAIFRVVDEMFRCALLETDGALEIIATIQVFTQCFVEALEKASKQLRFALKTYFPYTSPSLAMVLLQDPQDIPRGHWLQTLKHISELLREAVEDQTHGSCGGPFESWFLFIHFGGWAEMVAEQLLMSAAEPPTALLWLLAFYYGPRDGRQQRAQTMVQVKAVLGHLLAMSRSSSLSAQDLQTVAGQGTDTDLRAPAQQLIRHLLLNFLLWAPGGHTIAWDVITLMAHTAEITHEIIGFLDQTLYRWNRLGIESPRSEKLARELLKELRTQV,558,NP_000127.2.csv,refseq-FANCC-NM_000136.2_clinical_seed_0_final,refseq-FANCC-NM_000136.2.a2m,Invitae,refseq-FANCC-NM_000136.2.npy,1,558,558
+NP_000128.1,MSFIPVAEDSDFPIHNLPYGVFSTRGDPRPRIGVAIGDQILDLSIIKHLFTGPVLSKHQDVFNQPTLNSFMGLGQAAWKEARVFLQNLLSVSQARLRDDTELRKCAFISQASATMHLPATIGDYTDFYSSRQHATNVGIMFRDKENALMPNWLHLPVGYHGRASSVVVSGTPIRRPMGQMKPDDSKPPVYGACKLLDMELEMAFFVGPGNRLGEPIPISKAHEHIFGMVLMNDWSARDIQKWEYVPLGPFLGKSFGTTVSPWVVPMDALMPFAVPNPKQDPRPLPYLCHDEPYTFDINLSVNLKGEGMSQAATICKSNFKYMYWTMLQQLTHHSVNGCNLRPGDLLASGTISGPEPENFGSMLELSWKGTKPIDLGNGQTRKFLLDGDEVIITGYCQGDGYRIGFGQCAGKVLPALLPS,419,NP_000128.1.csv,refseq-FAH-NM_000137.2_clinical_seed_0_final,refseq-FAH-NM_000137.2.a2m,Invitae,refseq-FAH-NM_000137.2.npy,1,419,419
+NP_000131.2,MRSLGANMAAALRAAGVLLRDPLASSSWRVCQPWRWKSGAAAAAVTTETAQHAQGAKPQVQPQKRKPKTGILMLNMGGPETLGDVHDFLLRLFLDRDLMTLPIQNKLAPFIAKRRTPKIQEQYRRIGGGSPIKIWTSKQGEGMVKLLDELSPNTAPHKYYIGFRYVHPLTEEAIEEMERDGLERAIAFTQYPQYSCSTTGSSLNAIYRYYNQVGRKPTMKWSTIDRWPTHHLLIQCFADHILKELDHFPLEKRSEVVILFSAHSLPMSVVNRGDPYPQEVSATVQKVMERLEYCNPYRLVWQSKVGPMPWLGPQTDESIKGLCERGRKNILLVPIAFTSDHIETLYELDIEYSQVLAKECGVENIRRAESLNGNPLFSKALADLVHSHIQSNELCSKQLTLSCPLCVNPVCRETKSFFTSQQL,423,NP_000131.2.csv,refseq-FECH-NM_000140.3_clinical_seed_0_final,refseq-FECH-NM_000140.3.a2m,Invitae,refseq-FECH-NM_000140.3.npy,1,423,423
+NP_000132.3,MVSWGRFICLVVVTMATLSLARPSFSLVEDTTLEPEEPPTKYQISQPEVYVAAPGESLEVRCLLKDAAVISWTKDGVHLGPNNRTVLIGEYLQIKGATPRDSGLYACTASRTVDSETWYFMVNVTDAISSGDDEDDTDGAEDFVSENSNNKRAPYWTNTEKMEKRLHAVPAANTVKFRCPAGGNPMPTMRWLKNGKEFKQEHRIGGYKVRNQHWSLIMESVVPSDKGNYTCVVENEYGSINHTYHLDVVERSPHRPILQAGLPANASTVVGGDVEFVCKVYSDAQPHIQWIKHVEKNGSKYGPDGLPYLKVLKAAGVNTTDKEIEVLYIRNVTFEDAGEYTCLAGNSIGISFHSAWLTVLPAPGREKEITASPDYLEIAIYCIGVFLIACMVVTVILCRMKNTTKKPDFSSQPAVHKLTKRIPLRRQVTVSAESSSSMNSNTPLVRITTRLSSTADTPMLAGVSEYELPEDPKWEFPRDKLTLGKPLGEGCFGQVVMAEAVGIDKDKPKEAVTVAVKMLKDDATEKDLSDLVSEMEMMKMIGKHKNIINLLGACTQDGPLYVIVEYASKGNLREYLRARRPPGMEYSYDINRVPEEQMTFKDLVSCTYQLARGMEYLASQKCIHRDLAARNVLVTENNVMKIADFGLARDINNIDYYKKTTNGRLPVKWMAPEALFDRVYTHQSDVWSFGVLMWEIFTLGGSPYPGIPVEELFKLLKEGHRMDKPANCTNELYMMMRDCWHAVPSQRPTFKQLVEDLDRILTLTTNEEYLDLSQPLEQYSPSYPDTRSSCSSGDDSVFSPDPMPYEPCLPQYPHINGSVKT,821,NP_000132.3.csv,refseq-FGFR2-NM_000141.4_clinical_seed_0_final,refseq-FGFR2-NM_000141.4.a2m,Invitae,refseq-FGFR2-NM_000141.4.npy,1,821,821
+NP_000134.2,MYRALRLLARSRPLVRAPAAALASAPGLGGAAVPSFWPPNAARMASQNSFRIEYDTFGELKVPNDKYYGAQTVRSTMNFKIGGVTERMPTPVIKAFGILKRAAAEVNQDYGLDPKIANAIMKAADEVAEGKLNDHFPLVVWQTGSGTQTNMNVNEVISNRAIEMLGGELGSKIPVHPNDHVNKSQSSNDTFPTAMHIAAAIEVHEVLLPGLQKLHDALDAKSKEFAQIIKIGRTHTQDAVPLTLGQEFSGYVQQVKYAMTRIKAAMPRIYELAAGGTAVGTGLNTRIGFAEKVAAKVAALTGLPFVTAPNKFEALAAHDALVELSGAMNTTACSLMKIANDIRFLGSGPRSGLGELILPENEPGSSIMPGKVNPTQCEAMTMVAAQVMGNHVAVTVGGSNGHFELNVFKPMMIKNVLHSARLLGDASVSFTENCVVGIQANTERINKLMNESLMLVTALNPHIGYDKAAKIAKTAHKNGSTLKETAIELGYLTAEQFDEWVKPKDMLGPK,510,NP_000134.2.csv,refseq-FH-NM_000143.3_clinical_seed_0_final,refseq-FH-NM_000143.3.a2m,Invitae,refseq-FH-NM_000143.3.npy,1,510,510
+NP_000135.2,MWTLGRRAVAGLLASPSPAQAQTLTRVPRPAELAPLCGRRGLRTDIDATCTPRRASSNQRGLNQIWNVKKQSVYLMNLRKSGTLGHPGSLDETTYERLAEETLDSLAEFFEDLADKPYTFEDYDVSFGSGVLTVKLGGDLGTYVINKQTPNKQIWLSSPSSGPKRYDWTGKNWVYSHDGVSLHELLAAELTKALKTKLDLSSLAYSGKDA,210,NP_000135.2.csv,refseq-FXN-NM_000144.4_clinical_seed_0_final,refseq-FXN-NM_000144.4.a2m,Invitae,refseq-FXN-NM_000144.4.npy,1,210,210
+NP_000136.2,MALLLVSLLAFLSLGSGCHHRICHCSNRVFLCQESKVTEIPSDLPRNAIELRFVLTKLRVIQKGAFSGFGDLEKIEISQNDVLEVIEADVFSNLPKLHEIRIEKANNLLYINPEAFQNLPNLQYLLISNTGIKHLPDVHKIHSLQKVLLDIQDNINIHTIERNSFVGLSFESVILWLNKNGIQEIHNCAFNGTQLDELNLSDNNNLEELPNDVFHGASGPVILDISRTRIHSLPSYGLENLKKLRARSTYNLKKLPTLEKLVALMEASLTYPSHCCAFANWRRQISELHPICNKSILRQEVDYMTQARGQRSSLAEDNESSYSRGFDMTYTEFDYDLCNEVVDVTCSPKPDAFNPCEDIMGYNILRVLIWFISILAITGNIIVLVILTTSQYKLTVPRFLMCNLAFADLCIGIYLLLIASVDIHTKSQYHNYAIDWQTGAGCDAAGFFTVFASELSVYTLTAITLERWHTITHAMQLDCKVQLRHAASVMVMGWIFAFAAALFPIFGISSYMKVSICLPMDIDSPLSQLYVMSLLVLNVLAFVVICGCYIHIYLTVRNPNIVSSSSDTRIAKRMAMLIFTDFLCMAPISFFAISASLKVPLITVSKAKILLVLFHPINSCANPFLYAIFTKNFRRDFFILLSKCGCYEMQAQIYRTETSSTVHNTHPRNGHCSSAPRVTSGSTYILVPLSHLAQN,695,NP_000136.2.csv,refseq-FSHR-NM_000145.3_clinical_seed_0_final,refseq-FSHR-NM_000145.3.a2m,Invitae,refseq-FSHR-NM_000145.3.npy,1,695,695
+NP_000137.2,MSSQIRQNYSTDVEAAVNSLVNLYLQASYTYLSLGFYFDRDDVALEGVSHFFRELAEEKREGYERLLKMQNQRGGRALFQDIKKPAEDEWGKTPDAMKAAMALEKKLNQALLDLHALGSARTDPHLCDFLETHFLDEEVKLIKKMGDHLTNLHRLGGPEAGLGEYLFERLTLKHD,175,NP_000137.2.csv,refseq-FTL-NM_000146.3_clinical_seed_0_final,refseq-FTL-NM_000146.3.a2m,Invitae,refseq-FTL-NM_000146.3.npy,1,175,175
+NP_000138.2,MRAPGMRSRPAGPALLLLLLFLGAAESVRRAQPPRRYTPDWPSLDSRPLPAWFDEAKFGVFIHWGVFSVPAWGSEWFWWHWQGEGRPQYQRFMRDNYPPGFSYADFGPQFTARFFHPEEWADLFQAAGAKYVVLTTKHHEGFTNWPSPVSWNWNSKDVGPHRDLVGELGTALRKRNIRYGLYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLVNSYKPDLIWSDGEWECPDTYWNSTNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSIDKFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERLLAVGKWLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSAVYAIFLHWPENGVLNLESPITTSTTKITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWTIKLTGVK,466,NP_000138.2.csv,refseq-FUCA1-NM_000147.4_clinical_seed_0_final,refseq-FUCA1-NM_000147.4.a2m,Invitae,refseq-FUCA1-NM_000147.4.npy,1,466,466
+NP_000142.2,MEEGMNVLHDFGIQSTHYLQVNYQDSQDWFILVSVIADLRNAFYVLFPIWFHLQEAVGIKLLWVAVIGDWLNLVFKWILFGQRPYWWVLDTDYYSNTSVPLIKQFPVTCETGPGSPSGHAMGTAGVYYVMVTSTLSIFQGKIKPTYRFRCLNVILWLGFWAVQLNVCLSRIYLAAHFPHQVVAGVLSGIAVAETFSHIHSIYNASLKKYFLITFFLFSFAIGFYLLLKGLGVDLLWTLEKAQRWCEQPEWVHIDTTPFASLLKNLGTLFGLGLALNSSMYRESCKGKLSKWLPFRLSSIVASLVLLHVFDSLKPPSQVELVFYVLSFCKSAVVPLASVSVIPYCLAQVLGQPHKKSL,357,NP_000142.2.csv,refseq-G6PC-NM_000151.3_clinical_seed_0_final,refseq-G6PC-NM_000151.3.a2m,Invitae,refseq-G6PC-NM_000151.3.npy,1,357,357
+NP_000143.2,MGVRHPPCSHRLLAVCALVSLATAALLGHILLHDFLLVPRELSGSSPVLEETHPAHQQGASRPGPRDAQAHPGRPRAVPTQCDVPPNSRFDCAPDKAITQEQCEARGCCYIPAKQGLQGAQMGQPWCFFPPSYPSYKLENLSSSEMGYTATLTRTTPTFFPKDILTLRLDVMMETENRLHFTIKDPANRRYEVPLETPHVHSRAPSPLYSVEFSEEPFGVIVRRQLDGRVLLNTTVAPLFFADQFLQLSTSLPSQYITGLAEHLSPLMLSTSWTRITLWNRDLAPTPGANLYGSHPFYLALEDGGSAHGVFLLNSNAMDVVLQPSPALSWRSTGGILDVYIFLGPEPKSVVQQYLDVVGYPFMPPYWGLGFHLCRWGYSSTAITRQVVENMTRAHFPLDVQWNDLDYMDSRRDFTFNKDGFRDFPAMVQELHQGGRRYMMIVDPAISSSGPAGSYRPYDEGLRRGVFITNETGQPLIGKVWPGSTAFPDFTNPTALAWWEDMVAEFHDQVPFDGMWIDMNEPSNFIRGSEDGCPNNELENPPYVPGVVGGTLQAATICASSHQFLSTHYNLHNLYGLTEAIASHRALVKARGTRPFVISRSTFAGHGRYAGHWTGDVWSSWEQLASSVPEILQFNLLGVPLVGADVCGFLGNTSEELCVRWTQLGAFYPFMRNHNSLLSLPQEPYSFSEPAQQAMRKALTLRYALLPHLYTLFHQAHVAGETVARPLFLEFPKDSSTWTVDHQLLWGEALLITPVLQAGKAEVTGYFPLGTWYDLQTVPVEALGSLPPPPAAPREPAIHSEGQWVTLPAPLDTINVHLRAGYIIPLQGPGLTTTESRQQPMALAVALTKGGEARGELFWDDGESLEVLERGAYTQVIFLARNNTIVNELVRVTSEGAGLQLQKVTVLGVATAPQQVLSNGVPVSNFTYSPDTKVLDICVSLLMGEQFLVSWC,952,NP_000143.2.csv,refseq-GAA-NM_000152.3_clinical_seed_0_final,refseq-GAA-NM_000152.3.a2m,Invitae,refseq-GAA-NM_000152.3.npy,1,952,952
+NP_000144.2,MAEWLLSASWQRRAKAMTAAAGSAGRAAVPLLLCALLAPGGAYVLDDSDGLGREFDGIGAVSGGGATSRLLVNYPEPYRSQILDYLFKPNFGASLHILKVEIGGDGQTTDGTEPSHMHYALDENYFRGYEWWLMKEAKKRNPNITLIGLPWSFPGWLGKGFDWPYVNLQLTAYYVVTWIVGAKRYHDLDIDYIGIWNERSYNANYIKILRKMLNYQGLQRVKIIASDNLWESISASMLLDAELFKVVDVIGAHYPGTHSAKDAKLTGKKLWSSEDFSTLNSDMGAGCWGRILNQNYINGYMTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHYVVESPVWVSAHTTQFTQPGWYYLKTVGHLEKGGSYVALTDGLGNLTIIIETMSHKHSKCIRPFLPYFNVSQQFATFVLKGSFSEIPELQVWYTKLGKTSERFLFKQLDSLWLLDSDGSFTLSLHEDELFTLTTLTTGRKGSYPLPPKSQPFPSTYKDDFNVDYPFFSEAPNFADQTGVFEYFTNIEDPGEHHFTLRQVLNQRPITWAADASNTISIIGDYNWTNLTIKCDVYIETPDTGGVFIAGRVNKGGILIRSARGIFFWIFANGSYRVTGDLAGWIIYALGRVEVTAKKWYTLTLTIKGHFTSGMLNDKSLWTDIPVNFPKNGWAAIGTHSFEFAQFDNFLVEATR,685,NP_000144.2.csv,refseq-GALC-NM_000153.4_clinical_seed_0_final,refseq-GALC-NM_000153.4.a2m,Invitae,refseq-GALC-NM_000153.4.npy,1,685,685
+NP_000147.1,MSAPSATPIFAPGENCSPAWGAAPAAYDAADTHLRILGKPVMERWETPYMHALAAAASSKGGRVLEVGFGMAIAASKVQEAPIDEHWIIECNDGVFQRLRDWAPRQTHKVIPLKGLWEDVAPTLPDGHFDGILYDTYPLSEETWHTHQFNFIKNHAFRLLKPGGVLTYCNLTSWGELMKSKYSDITIMFEETQVPALLEAGFRRENIRTEVMALVPPADCRYYAFPQMITPLVTKG,236,NP_000147.1.csv,refseq-GAMT-NM_000156.5_clinical_seed_0_final,refseq-GAMT-NM_000156.5.a2m,Invitae,refseq-GAMT-NM_000156.5.npy,1,236,236
+NP_000149.4,MAAPMTPAARPEDYEAALNAALADVPELARLLEIDPYLKPYAVDFQRRYKQFSQILKNIGENEGGIDKFSRGYESFGVHRCADGGLYCKEWAPGAEGVFLTGDFNGWNPFSYPYKKLDYGKWELYIPPKQNKSVLVPHGSKLKVVITSKSGEILYRISPWAKYVVREGDNVNYDWIHWDPEHSYEFKHSRPKKPRSLRIYESHVGISSHEGKVASYKHFTCNVLPRIKGLGYNCIQLMAIMEHAYYASFGYQITSFFAASSRYGTPEELQELVDTAHSMGIIVLLDVVHSHASKNSADGLNMFDGTDSCYFHSGPRGTHDLWDSRLFAYSSWEILRFLLSNIRWWLEEYRFDGFRFDGVTSMLYHHHGVGQGFSGDYSEYFGLQVDEDALTYLMLANHLVHTLCPDSITIAEDVSGMPALCSPISQGGGGFDYRLAMAIPDKWIQLLKEFKDEDWNMGDIVYTLTNRRYLEKCIAYAESHDQALVGDKSLAFWLMDAEMYTNMSVLTPFTPVIDRGIQLHKMIRLITHGLGGEGYLNFMGNEFGHPEWLDFPRKGNNESYHYARRQFHLTDDDLLRYKFLNNFDRDMNRLEERYGWLAAPQAYVSEKHEGNKIIAFERAGLLFIFNFHPSKSYTDYRVGTALPGKFKIVLDSDAAEYGGHQRLDHSTDFFSEAFEHNGRPYSLLVYIPSRVALILQNVDLPN,702,NP_000149.4.csv,refseq-GBE1-NM_000158.4_clinical_seed_0_final,refseq-GBE1-NM_000158.4.a2m,Invitae,refseq-GBE1-NM_000158.4_theta_0.2.npy,1,702,702
+NP_000150.1,MALRGVSVRLLSRGPGLHVLRTWVSSAAQTEKGGRTQSQLAKSSRPEFDWQDPLVLEEQLTTDEILIRDTFRTYCQERLMPRILLANRNEVFHREIISEMGELGVLGPTIKGYGCAGVSSVAYGLLARELERVDSGYRSAMSVQSSLVMHPIYAYGSEEQRQKYLPQLAKGELLGCFGLTEPNSGSDPSSMETRAHYNSSNKSYTLNGTKTWITNSPMADLFVVWARCEDGCIRGFLLEKGMRGLSAPRIQGKFSLRASATGMIIMDGVEVPEENVLPGASSLGGPFGCLNNARYGIAWGVLGASEFCLHTARQYALDRMQFGVPLARNQLIQKKLADMLTEITLGLHACLQLGRLKDQDKAAPEMVSLLKRNNCGKALDIARQARDMLGGNGISDEYHVIRHAMNLEAVNTYEGTHDIHALILGRAITGIQAFTASK,438,NP_000150.1.csv,refseq-GCDH-NM_000159.3_clinical_seed_0_final,refseq-GCDH-NM_000159.3.a2m,Invitae,refseq-GCDH-NM_000159.3.npy,1,438,438
+NP_000153.1,MLDDRARMEAAKKEKVEQILAEFQLQEEDLKKVMRRMQKEMDRGLRLETHEEASVKMLPTYVRSTPEGSEVGDFLSLDLGGTNFRVMLVKVGEGEEGQWSVKTKHQMYSIPEDAMTGTAEMLFDYISECISDFLDKHQMKHKKLPLGFTFSFPVRHEDIDKGILLNWTKGFKASGAEGNNVVGLLRDAIKRRGDFEMDVVAMVNDTVATMISCYYEDHQCEVGMIVGTGCNACYMEEMQNVELVEGDEGRMCVNTEWGAFGDSGELDEFLLEYDRLVDESSANPGQQLYEKLIGGKYMGELVRLVLLRLVDENLLFHGEASEQLRTRGAFETRFVSQVESDTGDRKQIYNILSTLGLRPSTTDCDIVRRACESVSTRAAHMCSAGLAGVINRMRESRSEDVMRITVGVDGSVYKLHPSFKERFHASVRRLTPSCEITFIESEEGSGRGAALVSAVACKKACMLGQ,465,NP_000153.1.csv,refseq-GCK-NM_000162.3_clinical_seed_0_final,refseq-GCK-NM_000162.3.a2m,Invitae,refseq-GCK-NM_000162.3.npy,1,465,465
+NP_000154.1,MDLWQLLLTLALAGSSDAFSGSEATAAILSRAPWSLQSVNPGLKTNSSKEPKFTKCRSPERETFSCHWTDEVHHGTKNLGPIQLFYTRRNTQEWTQEWKECPDYVSAGENSCYFNSSFTSIWIPYCIKLTSNGGTVDEKCFSVDEIVQPDPPIALNWTLLNVSLTGIHADIQVRWEAPRNADIQKGWMVLEYELQYKEVNETKWKMMDPILTTSVPVYSLKVDKEYEVRVRSKQRNSGNYGEFSEVLYVTLPQMSQFTCEEDFYFPWLLIIIFGIFGLTVMLFVFLFSKQQRIKMLILPPVPVPKIKGIDPDLLKEGKLEEVNTILAIHDSYKPEFHSDDSWVEFIELDIDEPDEKTEESDTDRLLSSDHEKSHSNLGVKDGDSGRTSCCEPDILETDFNANDIHEGTSEVAQPQRLKGEADLLCLDQKNQNNSPYHDACPATQQPSVIQAEKNKPQPLPTEGAESTHQAAHIQLSNPSSLSNIDFYAQVSDITPAGSVVLSPGQKNKAGMSQCDMHPEMVSLCQENFLMDNAYFCEADAKKCIPVAPHIKVESHIQPSLNQEDIYITTESLTTAAGRPGTGEHVPGSEMPVPDYTSIHIVQSPQGLILNATALPLPDKEFLSSCGYVSTDQLNKIMP,638,NP_000154.1.csv,refseq-GHR-NM_000163.4_clinical_seed_0_final,refseq-GHR-NM_000163.4.a2m,Invitae,refseq-GHR-NM_000163.4.npy,1,638,638
+NP_000156.1,MGDWSALGKLLDKVQAYSTAGGKVWLSVLFIFRILLLGTAVESAWGDEQSAFRCNTQQPGCENVCYDKSFPISHVRFWVLQIIFVSVPTLLYLAHVFYVMRKEEKLNKKEEELKVAQTDGVNVDMHLKQIEIKKFKYGIEEHGKVKMRGGLLRTYIISILFKSIFEVAFLLIQWYIYGFSLSAVYTCKRDPCPHQVDCFLSRPTEKTIFIIFMLVVSLVSLALNIIELFYVFFKGVKDRVKGKSDPYHATSGALSPAKDCGSQKYAYFNGCSSPTAPLSPMSPPGYKLVTGDRNNSSCRNYNKQASEQNWANYSAEQNRMGQAGSTISNSHAQPFDFPDDNQNSKKLAAGHELQPLAIVDQRPSSRASSRASSRPRPDDLEI,382,NP_000156.1.csv,refseq-GJA1-NM_000165.4_clinical_seed_0_final,refseq-GJA1-NM_000165.4.a2m,Invitae,refseq-GJA1-NM_000165.4.npy,1,382,382
+NP_000157.1,MNWTGLYTLLSGVNRHSTAIGRVWLSVIFIFRIMVLVVAAESVWGDEKSSFICNTLQPGCNSVCYDQFFPISHVRLWSLQLILVSTPALLVAMHVAHQQHIEKKMLRLEGHGDPLHLEEVKRHKVHISGTLWWTYVISVVFRLLFEAVFMYVFYLLYPGYAMVRLVKCDVYPCPNTVDCFVSRPTEKTVFTVFMLAASGICIILNVAEVVYLIIRACARRAQRRSNPPSRKGSGFGHRLSPEYKQNEINKLLSEQDGSLKDILRRSPGTGAGLAEKSDRCSAC,283,NP_000157.1.csv,refseq-GJB1-NM_000166.5_clinical_seed_0_final,refseq-GJB1-NM_000166.5.a2m,Invitae,refseq-GJB1-NM_000166.5.npy,1,283,283
+NP_000159.3,MEAQSHSSTTTEKKKVENSIVKCSTRTDVSEKAVASSTTSNEDESPGQTYHRERRNAITMQPQNVQGLSKVSEEPSTSSDERASLIKKEIHGSLPHVAEPSVPYRGTVFAMDPRNGYMEPHYHPPHLFPAFHPPVPIDARHHEGRYHYDPSPIPPLHMTSALSSSPTYPDLPFIRISPHRNPTAASESPFSPPHPYINPYMDYIRSLHSSPSLSMISATRGLSPTDAPHAGVSPAEYYHQMALLTGQRSPYADIIPSAATAGTGAIHMEYLHAMDSTRFSSPRLSARPSRKRTLSISPLSDHSFDLQTMIRTSPNSLVTILNNSRSSSSASGSYGHLSASAISPALSFTYSSAPVSLHMHQQILSRQQSLGSAFGHSPPLIHPAPTFPTQRPIPGIPTVLNPVQVSSGPSESSQNKPTSESAVSSTGDPMHNKRSKIKPDEDLPSPGARGQQEQPEGTTLVKEEGDKDESKQEPEVIYETNCHWEGCAREFDTQEQLVHHINNDHIHGEKKEFVCRWLDCSREQKPFKAQYMLVVHMRRHTGEKPHKCTFEGCTKAYSRLENLKTHLRSHTGEKPYVCEHEGCNKAFSNASDRAKHQNRTHSNEKPYVCKIPGCTKRYTDPSSLRKHVKTVHGPEAHVTKKQRGDIHPRPPPPRDSGSHSQSRSPGRPTQGALGEQQDLSNTTSKREECLQVKTVKAEKPMTSQPSPGGQSSCSSQQSPISNYSNSGLELPLTDGGSIGDLSAIDETPIMDSTISTATTALALQARRNPAGTKWMEHVKLERLKQVNGMFPRLNPILPPKAPAVSPLIGNGTQSNNTCSLGGPMTLLPGRSDLSGVDVTMLNMLNRRDSSASTISSAYLSSRRSSGISPCFSSRRSSEASQAEGRPQNVSVADSYDPISTDASRRSSEASQSDGLPSLLSLTPAQQYRLKAKYAAATGGPPPTPLPNMERMSLKTRLALLGDALEPGVALPPVHAPRRCSDGGAHGYGRRHLQPHDAPGHGVRRASDPVRTGSEGLALPRVPRFSSLSSCNPPAMATSAEKRSLVLQNYTRPEGGQSRNFHSSPCPPSITENVTLESLTMDADANLNDEDFLPDDVVQYLNSQNQAGYEQHFPSALPDDSKVPHGPGDFDAPGLPDSHAGQQFHALEQPCPEGSKTDLPIQWNEVSSGSADLSSSKLKCGPRPAVPQTRAFGFCNGMVVHPQNPLRSGPAGGYQTLGENSNPYGGPEHLMLHNSPGSGTSGNAFHEQPCKAPQYGNCLNRQPVAPGALDGACGAGIQASKLKSTPMQGSGGQLNFGLPVAPNESAGSMVNGMQNQDPVGQGYLAHQLLGDSMQHPGAGRPGQQMLGQISATSHINIYQGPESCLPGAHGMGSQPSSLAVVRGYQPCASFGGSRRQAMPRDSLALQSGQLSDTSQTCRVNGIKMEMKGQPHPLCSNLQNYSGQFYDQTVGFSQQDTKAGSFSISDASCLLQGTSAKNSELLSPGANQVTSTVDSLDSHDLEGVQIDFDAIIDDGDHSSLMSGALSPSIIQNLSHSSSRLTTPRASLPFPALSMSTTNMAIGDMSSLLTSLAEESKFLAVMQ,1580,NP_000159.3.csv,refseq-GLI3-NM_000168.5_clinical_seed_0_final,refseq-GLI3-NM_000168.5.a2m,Invitae,refseq-GLI3-NM_000168.5.npy,1,1580,1580
+NP_000160.1,MQLRNPELHLGCALALRFLALVSWDIPGARALDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCIDDCWMAPQRDSEGRLQADPQRFPHGIRQLANYVHSKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLENLADGYKHMSLALNRTGRSIVYSCEWPLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILDWTSFNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHISPQAKALLQDKDVIAINQDPLGKQGYQLRQGDNFEVWERPLSGLAWAVAMINRQEIGGPRSYTIAVASLGKGVACNPACFITQLLPVKRKLGFYEWTSRLRSHINPTGTVLLQLENTMQMSLKDLL,429,NP_000160.1.csv,refseq-GLA-NM_000169.2_clinical_seed_0_final,refseq-GLA-NM_000169.2.a2m,Invitae,refseq-GLA-NM_000169.2.npy,1,429,429
+NP_000161.2,MQSCARAWGLRLGRGVGGGRRLAGGSGPCWAPRSRDSSSGGGDSAAAGASRLLERLLPRHDDFARRHIGPGDKDQREMLQTLGLASIDELIEKTVPANIRLKRPLKMEDPVCENEILATLHAISSKNQIWRSYIGMGYYNCSVPQTILRNLLENSGWITQYTPYQPEVSQGRLESLLNYQTMVCDITGLDMANASLLDEGTAAAEALQLCYRHNKRRKFLVDPRCHPQTIAVVQTRAKYTGVLTELKLPCEMDFSGKDVSGVLFQYPDTEGKVEDFTELVERAHQSGSLACCATDLLALCILRPPGEFGVDIALGSSQRFGVPLGYGGPHAAFFAVRESLVRMMPGRMVGVTRDATGKEVYRLALQTREQHIRRDKATSNICTAQALLANMAAMFAIYHGSHGLEHIARRVHNATLILSEGLKRAGHQLQHDLFFDTLKIQCGCSVKEVLGRAAQRQINFRLFEDGTLGISLDETVNEKDLDDLLWIFGCESSAELVAESMGEECRGIPGSVFKRTSPFLTHQVFNSYHSETNIVRYMKKLENKDISLVHSMIPLGSCTMKLNSSSELAPITWKEFANIHPFVPLDQAQGYQQLFRELEKDLCELTGYDQVCFQPNSGAQGEYAGLATIRAYLNQKGEGHRTVCLIPKSAHGTNPASAHMAGMKIQPVEVDKYGNIDAVHLKAMVDKHKENLAAIMITYPSTNGVFEENISDVCDLIHQHGGQVYLDGANMNAQVGICRPGDFGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKKHLAPFLPNHPVISLKRNEDACPVGTVSAAPWGSSSILPISWAYIKMMGGKGLKQATETAILNANYMAKRLETHYRILFRGARGYVGHEFILDTRPFKKSANIEAVDVAKRLQDYGFHAPTMSWPVAGTLMVEPTESEDKAELDRFCDAMISIRQEIADIEEGRIDPRVNPLKMSPHSLTCVTSSHWDRPYSREVAAFPLPFVKPENKFWPTIARIDDIYGDQHLVCTCPPMEVYESPFSEQKRASS,1020,NP_000161.2.csv,refseq-GLDC-NM_000170.2_clinical_seed_0_final,refseq-GLDC-NM_000170.2.a2m,Invitae,refseq-GLDC-NM_000170.2.npy,1,1020,1020
+NP_000162.2,MYSFNTLRLYLWETIVFFSLAASKEAEAARSAPKPMSPSDFLDKLMGRTSGYDARIRPNFKGPPVNVSCNIFINSFGSIAETTMDYRVNIFLRQQWNDPRLAYNEYPDDSLDLDPSMLDSIWKPDLFFANEKGAHFHEITTDNKLLRISRNGNVLYSIRITLTLACPMDLKNFPMDVQTCIMQLESFGYTMNDLIFEWQEQGAVQVADGLTLPQFILKEEKDLRYCTKHYNTGKFTCIEARFHLERQMGYYLIQMYIPSLLIVILSWISFWINMDAAPARVGLGITTVLTMTTQSSGSRASLPKVSYVKAIDIWMAVCLLFVFSALLEYAAVNFVSRQHKELLRFRRKRRHHKEDEAGEGRFNFSAYGMGPACLQAKDGISVKGANNSNTTNPPPAPSKSPEEMRKLFIQRAKKIDKISRIGFPMAFLIFNMFYWIIYKIVRREDVHNQ,449,NP_000162.2.csv,refseq-GLRA1-NM_000171.3_clinical_seed_0_final,refseq-GLRA1-NM_000171.3.a2m,Invitae,refseq-GLRA1-NM_000171.3.npy,1,449,449
+NP_000164.5,MPLLLLLLLLPSPLHPHPICEVSKVASHLEVNCDKRNLTALPPDLPKDTTILHLSENLLYTFSLATLMPYTRLTQLNLDRCELTKLQVDGTLPVLGTLDLSHNQLQSLPLLGQTLPALTVLDVSFNRLTSLPLGALRGLGELQELYLKGNELKTLPPGLLTPTPKLEKLSLANNNLTELPAGLLNGLENLDTLLLQENSLYTIPKGFFGSHLLPFAFLHGNPWLCNCEILYFRRWLQDNAENVYVWKQGVDVKAMTSNVASVQCDNSDKFPVYKYPGKGCPTLGDEGDTDLYDYYPEEDTEGDKVRATRTVVKFPTKAHTTPWGLFYSWSTASLDSQMPSSLHPTQESTKEQTTFPPRWTPNFTLHMESITFSKTPKSTTEPTPSPTTSEPVPEPAPNMTTLEPTPSPTTPEPTSEPAPSPTTPEPTSEPAPSPTTPEPTSEPAPSPTTPEPTPIPTIATSPTILVSATSLITPKSTFLTTTKPVSLLESTKKTIPELDQPPKLRGVLQGHLESSRNDPFLHPDFCCLLPLGFYVLGLFWLLFASVVLILLLSWVGHVKPQALDSGQGAALTTATQTTHLELQRGRQVTVPRAWLLFLRGSLPTFRSSLFLWVRPNGRVGPLVAGRRPSALSQGRGQDLLSTVSIRYSGHSL,652,NP_000164.5.csv,refseq-GP1BA-NM_000173.6_clinical_seed_0_final,refseq-GP1BA-NM_000173.6.a2m,Invitae,refseq-GP1BA-NM_000173.6.npy,1,652,652
+NP_000165.1,MPAWGALFLLWATAEATKDCPSPCTCRALETMGLWVDCRGHGLTALPALPARTRHLLLANNSLQSVPPGAFDHLPQLQTLDVTQNPWHCDCSLTYLRLWLEDRTPEALLQVRCASPSLAAHGPLGRLTGYQLGSCGWQLQASWVRPGVLWDVALVAVAALGLALLAGLLCATTEALD,177,NP_000165.1.csv,refseq-GP9-NM_000174.4_clinical_seed_0_final,refseq-GP9-NM_000174.4.a2m,Invitae,refseq-GP9-NM_000174.4.npy,1,177,177
+NP_000166.2,MAALTRDPQFQKLQQWYREHRSELNLRRLFDANKDRFNHFSLTLNTNHGHILVDYSKNLVTEDVMRMLVDLAKSRGVEAARERMFNGEKINYTEGRAVLHVALRNRSNTPILVDGKDVMPEVNKVLDKMKSFCQRVRSGDWKGYTGKTITDVINIGIGGSDLGPLMVTEALKPYSSGGPRVWYVSNIDGTHIAKTLAQLNPESSLFIIASKTFTTQETITNAETAKEWFLQAAKDPSAVAKHFVALSTNTTKVKEFGIDPQNMFEFWDWVGGRYSLWSAIGLSIALHVGFDNFEQLLSGAHWMDQHFRTTPLEKNAPVLLALLGIWYINCFGCETHAMLPYDQYLHRFAAYFQQGDMESNGKYITKSGTRVDHQTGPIVWGEPGTNGQHAFYQLIHQGTKMIPCDFLIPVQTQHPIRKGLHHKILLANFLAQTEALMRGKSTEEARKELQAAGKSPEDLERLLPHKVFEGNRPTNSIVFTKLTPFMLGALVAMYEHKIFVQGIIWDINSFDQWGVELGKQLAKKIEPELDGSAQVTSHDASTNGLINFIKQQREARVQ,558,NP_000166.2.csv,refseq-GPI-NM_000175.3_clinical_seed_0_final,refseq-GPI-NM_000175.3.a2m,Invitae,refseq-GPI-NM_000175.3.npy,1,558,558
+NP_000168.1,MAPHRPAPALLCALSLALCALSLPVRAATASRGASQAGAPQGRVPEARPNSMVVEHPEFLKAGKEPGLQIWRVEKFDLVPVPTNLYGDFFTGDAYVILKTVQLRNGNLQYDLHYWLGNECSQDESGAAAIFTVQLDDYLNGRAVQHREVQGFESATFLGYFKSGLKYKKGGVASGFKHVVPNEVVVQRLFQVKGRRVVRATEVPVSWESFNNGDCFILDLGNNIHQWCGSNSNRYERLKATQVSKGIRDNERSGRARVHVSEEGTEPEAMLQVLGPKPALPAGTEDTAKEDAANRKLAKLYKVSNGAGTMSVSLVADENPFAQGALKSEDCFILDHGKDGKIFVWKGKQANTEERKAALKTASDFITKMDYPKQTQVSVLPEGGETPLFKQFFKNWRDPDQTDGLGLSYLSSHIANVERVPFDAATLHTSTAMAAQHGMDDDGTGQKQIWRIEGSNKVPVDPATYGQFYGGDSYIILYNYRHGGRQGQIIYNWQGAQSTQDEVAASAILTAQLDEELGGTPVQSRVVQGKEPAHLMSLFGGKPMIIYKGGTSREGGQTAPASTRLFQVRANSAGATRAVEVLPKAGALNSNDAFVLKTPSAAYLWVGTGASEAEKTGAQELLRVLRAQPVQVAEGSEPDGFWEALGGKAAYRTSPRLKDKKMDAHPPRLFACSNKIGRFVIEEVPGELMQEDLATDDVMLLDTWDQVFVWVGKDSQEEEKTEALTSAKRYIETDPANRDRRTPITVVKQGFEPPSFVGWFLGWDDDYWSVDPLDRAMAELAA,782,NP_000168.1.csv,refseq-GSN-NM_000177.4_clinical_seed_0_final,refseq-GSN-NM_000177.4.a2m,Invitae,refseq-GSN-NM_000177.4.npy,1,782,782
+NP_000169.1,MATNWGSLLQDKQQLEELARQAVDRALAEGVLLRTSQEPTSSEVVSYAPFTLFPSLVPSALLEQAYAVQMDFNLLVDAVSQNAAFLEQTLSSTIKQDDFTARLFDIHKQVLKEGIAQTVFLGLNRSDYMFQRSADGSPALKQIEINTISASFGGLASRTPAVHRHVLSVLSKTKEAGKILSNNPSKGLALGIAKAWELYGSPNALVLLIAQEKERNIFDQRAIENELLARNIHVIRRTFEDISEKGSLDQDRRLFVDGQEIAVVYFRDGYMPRQYSLQNWEARLLLERSHAAKCPDIATQLAGTKKVQQELSRPGMLEMLLPGQPEAVARLRATFAGLYSLDVGEEGDQAIAEALAAPSRFVLKPQREGGGNNLYGEEMVQALKQLKDSEERASYILMEKIEPEPFENCLLRPGSPARVVQCISELGIFGVYVRQEKTLVMNKHVGHLLRTKAIEHADGGVAAGVAVLDNPYPV,474,NP_000169.1.csv,refseq-GSS-NM_000178.2_clinical_seed_0_final,refseq-GSS-NM_000178.2.a2m,Invitae,refseq-GSS-NM_000178.2.npy,1,474,474
+NP_000170.1,MSRQSTLYSFFPKSPALSDANKASARASREGGRAAAAPGASPSPGGDAAWSEAGPGPRPLARSASPPKAKNLNGGLRRSVAPAAPTSCDFSPGDLVWAKMEGYPWWPCLVYNHPFDGTFIREKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQKGGHFYSAKPEILRAMQRADEALNKDKIKRLELAVCDEPSEPEEEEEMEVGTTYVTDKSEEDNEIESEEEVQPKTQGSRRSSRQIKKRRVISDSESDIGGSDVEFKPDTKEEGSSDEISSGVGDSESEGLNSPVKVARKRKRMVTGNGSLKRKSSRKETPSATKQATSISSETKNTLRAFSAPQNSESQAHVSGGGDDSSRPTVWYHETLEWLKEEKRRDEHRRRPDHPDFDASTLYVPEDFLNSCTPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLVFMKGNWAHSGFPEIAFGRYSDSLVQKGYKVARVEQTETPEMMEARCRKMAHISKYDRVVRREICRIITKGTQTYSVLEGDPSENYSKYLLSLKEKEEDSSGHTRAYGVCFVDTSLGKFFIGQFSDDRHCSRFRTLVAHYPPVQVLFEKGNLSKETKTILKSSLSCSLQEGLIPGSQFWDASKTLRTLLEEEYFREKLSDGIGVMLPQVLKGMTSESDSIGLTPGEKSELALSALGGCVFYLKKCLIDQELLSMANFEEYIPLDSDTVSTTRSGAIFTKAYQRMVLDAVTLNNLEIFLNGTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIEDLMVVPDKISEVVELLKKLPDLERLLSKIHNVGSPLKSQNHPDSRAIMYEETTYSKKKIIDFLSALEGFKVMCKIIGIMEEVADGFKSKILKQVISLQTKNPEGRFPDLTVELNRWDTAFDHEKARKTGLITPKAGFDSDYDQALADIRENEQSLLEYLEKQRNRIGCRTIVYWGIGRNRYQLEIPENFTTRNLPEEYELKSTKKGCKRYWTKTIEKKLANLINAEERRDVSLKDCMRRLFYNFDKNYKDWQSAVECIAVLDVLLCLANYSRGGDGPMCRPVILLPEDTPPFLELKGSRHPCITKTFFGDDFIPNDILIGCEEEEQENGKAYCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAVRLGHMACMVENECEDPSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKAREFEKMNQSLRLFREVCLASERSTVDAEAVHKLLTLIKEL,1360,NP_000170.1.csv,refseq-MSH6-NM_000179.2_clinical_seed_0_final,refseq-MSH6-NM_000179.2.a2m,Invitae,refseq-MSH6-NM_000179.2.npy,1,1360,1360
+NP_000171.1,MTACARRAGGLPDPGLCGPAWWAPSLPRLPRALPRLPLLLLLLLLQPPALSAVFTVGVLGPWACDPIFSRARPDLAARLAAARLNRDPGLAGGPRFEVALLPEPCRTPGSLGAVSSALARVSGLVGPVNPAACRPAELLAEEAGIALVPWGCPWTQAEGTTAPAVTPAADALYALLRAFGWARVALVTAPQDLWVEAGRSLSTALRARGLPVASVTSMEPLDLSGAREALRKVRDGPRVTAVIMVMHSVLLGGEEQRYLLEAAEELGLTDGSLVFLPFDTIHYALSPGPEALAALANSSQLRRAHDAVLTLTRHCPSEGSVLDSLRRAQERRELPSDLNLQQVSPLFGTIYDAVFLLARGVAEARAAAGGRWVSGAAVARHIRDAQVPGFCGDLGGDEEPPFVLLDTDAAGDRLFATYMLDPARGSFLSAGTRMHFPRGGSAPGPDPSCWFDPNNICGGGLEPGLVFLGFLLVVGMGLAGAFLAHYVRHRLLHMQMVSGPNKIILTVDDITFLHPHGGTSRKVAQGSRSSLGARSMSDIRSGPSQHLDSPNIGVYEGDRVWLKKFPGDQHIAIRPATKTAFSKLQELRHENVALYLGLFLARGAEGPAALWEGNLAVVSEHCTRGSLQDLLAQREIKLDWMFKSSLLLDLIKGIRYLHHRGVAHGRLKSRNCIVDGRFVLKITDHGHGRLLEAQKVLPEPPRAEDQLWTAPELLRDPALERRGTLAGDVFSLAIIMQEVVCRSAPYAMLELTPEEVVQRVRSPPPLCRPLVSMDQAPVECILLMKQCWAEQPELRPSMDHTFDLFKNINKGRKTNIIDSMLRMLEQYSSNLEDLIRERTEELELEKQKTDRLLTQMLPPSVAEALKTGTPVEPEYFEQVTLYFSDIVGFTTISAMSEPIEVVDLLNDLYTLFDAIIGSHDVYKVETIGDAYMVASGLPQRNGQRHAAEIANMSLDILSAVGTFRMRHMPEVPVRIRIGLHSGPCVAGVVGLTMPRYCLFGDTVNTASRMESTGLPYRIHVNLSTVGILRALDSGYQVELRGRTELKGKGAEDTFWLVGRRGFNKPIPKPPDLQPGSSNHGISLQEIPPERRRKLEKARPGQFS,1103,NP_000171.1.csv,refseq-GUCY2D-NM_000180.3_clinical_seed_0_final,refseq-GUCY2D-NM_000180.3.a2m,Invitae,refseq-GUCY2D-NM_000180.3.npy,1,1103,1103
+NP_000172.2,MARGSAVAWAALGPLLWGCALGLQGGMLYPQESPSRECKELDGLWSFRADFSDNRRRGFEEQWYRRPLWESGPTVDMPVPSSFNDISQDWRLRHFVGWVWYEREVILPERWTQDLRTRVVLRIGSAHSYAIVWVNGVDTLEHEGGYLPFEADISNLVQVGPLPSRLRITIAINNTLTPTTLPPGTIQYLTDTSKYPKGYFVQNTYFDFFNYAGLQRSVLLYTTPTTYIDDITVTTSVEQDSGLVNYQISVKGSNLFKLEVRLLDAENKVVANGTGTQGQLKVPGVSLWWPYLMHERPAYLYSLEVQLTAQTSLGPVSDFYTLPVGIRTVAVTKSQFLINGKPFYFHGVNKHEDADIRGKGFDWPLLVKDFNLLRWLGANAFRTSHYPYAEEVMQMCDRYGIVVIDECPGVGLALPQFFNNVSLHHHMQVMEEVVRRDKNHPAVVMWSVANEPASHLESAGYYLKMVIAHTKSLDPSRPVTFVSNSNYAADKGAPYVDVICLNSYYSWYHDYGHLELIQLQLATQFENWYKKYQKPIIQSEYGAETIAGFHQDPPLMFTEEYQKSLLEQYHLGLDQKRRKYVVGELIWNFADFMTEQSPTRVLGNKKGIFTRQRQPKSAAFLLRERYWKIANETRYPHSVAKSQCLENSLFT,651,NP_000172.2.csv,refseq-GUSB-NM_000181.3_clinical_seed_0_final,refseq-GUSB-NM_000181.3.a2m,Invitae,refseq-GUSB-NM_000181.3.npy,1,651,651
+NP_000173.2,MVACRAIGILSRFSAFRILRSRGYICRNFTGSSALLTRTHINYGVKGDVAVVRINSPNSKVNTLSKELHSEFSEVMNEIWASDQIRSAVLISSKPGCFIAGADINMLAACKTLQEVTQLSQEAQRIVEKLEKSTKPIVAAINGSCLGGGLEVAISCQYRIATKDRKTVLGTPEVLLGALPGAGGTQRLPKMVGVPAALDMMLTGRSIRADRAKKMGLVDQLVEPLGPGLKPPEERTIEYLEEVAITFAKGLADKKISPKRDKGLVEKLTAYAMTIPFVRQQVYKKVEEKVRKQTKGLYPAPLKIIDVVKTGIEQGSDAGYLCESQKFGELVMTKESKALMGLYHGQVLCKKNKFGAPQKDVKHLAILGAGLMGAGIAQVSVDKGLKTILKDATLTALDRGQQQVFKGLNDKVKKKALTSFERDSIFSNLTGQLDYQGFEKADMVIEAVFEDLSLKHRVLKEVEAVIPDHCIFASNTSALPISEIAAVSKRPEKVIGMHYFSPVDKMQLLEIITTEKTSKDTSASAVAVGLKQGKVIIVVKDGPGFYTTRCLAPMMSEVIRILQEGVDPKKLDSLTTSFGFPVGAATLVDEVGVDVAKHVAEDLGKVFGERFGGGNPELLTQMVSKGFLGRKSGKGFYIYQEGVKRKDLNSDMDSILASLKLPPKSEVSSDEDIQFRLVTRFVNEAVMCLQEGILATPAEGDIGAVFGLGFPPCLGGPFRFVDLYGAQKIVDRLKKYEAAYGKQFTPCQLLADHANSPNKKFYQ,763,NP_000173.2.csv,refseq-HADHA-NM_000182.4_clinical_seed_0_final,refseq-HADHA-NM_000182.4.a2m,Invitae,refseq-HADHA-NM_000182.4.npy,1,763,763
+NP_000174.1,MTILTYPFKNLPTASKWALRFSIRPLSCSSQLRAAPAVQTKTKKTLAKPNIRNVVVVDGVRTPFLLSGTSYKDLMPHDLARAALTGLLHRTSVPKEVVDYIIFGTVIQEVKTSNVAREAALGAGFSDKTPAHTVTMACISANQAMTTGVGLIASGQCDVIVAGGVELMSDVPIRHSRKMRKLMLDLNKAKSMGQRLSLISKFRFNFLAPELPAVSEFSTSETMGHSADRLAAAFAVSRLEQDEYALRSHSLAKKAQDEGLLSDVVPFKVPGKDTVTKDNGIRPSSLEQMAKLKPAFIKPYGTVTAANSSFLTDGASAMLIMAEEKALAMGYKPKAYLRDFMYVSQDPKDQLLLGPTYATPKVLEKAGLTMNDIDAFEFHEAFSGQILANFKAMDSDWFAENYMGRKTKVGLPPLEKFNNWGGSLSLGHPFGATGCRLVMAAANRLRKEGGQYGLVAACAAGGQGHAMIVEAYPK,474,NP_000174.1.csv,refseq-HADHB-NM_000183.2_clinical_seed_0_final,refseq-HADHB-NM_000183.2.a2m,Invitae,refseq-HADHB-NM_000183.2_theta_0.2.npy,1,474,474
+NP_000175.1,MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFGKEFTPEVQASWQKMVTGVASALSSRYH,147,NP_000175.1.csv,refseq-HBG2-NM_000184.2_clinical_seed_0_final,refseq-HBG2-NM_000184.2.a2m,Invitae,refseq-HBG2-NM_000184.2.npy,1,147,147
+NP_000177.2,MRLLAKIICLMLWAICVAEDCNELPPRRNTEILTGSWSDQTYPEGTQAIYKCRPGYRSLGNVIMVCRKGEWVALNPLRKCQKRPCGHPGDTPFGTFTLTGGNVFEYGVKAVYTCNEGYQLLGEINYRECDTDGWTNDIPICEVVKCLPVTAPENGKIVSSAMEPDREYHFGQAVRFVCNSGYKIEGDEEMHCSDDGFWSKEKPKCVEISCKSPDVINGSPISQKIIYKENERFQYKCNMGYEYSERGDAVCTESGWRPLPSCEEKSCDNPYIPNGDYSPLRIKHRTGDEITYQCRNGFYPATRGNTAKCTSTGWIPAPRCTLKPCDYPDIKHGGLYHENMRRPYFPVAVGKYYSYYCDEHFETPSGSYWDHIHCTQDGWSPAVPCLRKCYFPYLENGYNQNHGRKFVQGKSIDVACHPGYALPKAQTTVTCMENGWSPTPRCIRVKTCSKSSIDIENGFISESQYTYALKEKAKYQCKLGYVTADGETSGSITCGKDGWSAQPTCIKSCDIPVFMNARTKNDFTWFKLNDTLDYECHDGYESNTGSTTGSIVCGYNGWSDLPICYERECELPKIDVHLVPDRKKDQYKVGEVLKFSCKPGFTIVGPNSVQCYHFGLSPDLPICKEQVQSCGPPPELLNGNVKEKTKEEYGHSEVVEYYCNPRFLMKGPNKIQCVDGEWTTLPVCIVEESTCGDIPELEHGWAQLSSPPYYYGDSVEFNCSESFTMIGHRSITCIHGVWTQLPQCVAIDKLKKCKSSNLIILEEHLKNKKEFDHNSNIRYRCRGKEGWIHTVCINGRWDPEVNCSMAQIQLCPPPPQIPNSHNMTTTLNYRDGEKVSVLCQENYLIQEGEEITCKDGRWQSIPLCVEKIPCSQPPQIEHGTINSSRSSQESYAHGTKLSYTCEGGFRISEENETTCYMGKWSSPPQCEGLPCKSPPEISHGVVAHMSDSYQYGEEVTYKCFEGFGIDGPAIAKCLGEKWSHPPSCIKTDCLSLPSFENAIPMGEKKDVYKAGEQVTYTCATYYKMDGASNVTCINSRWTGRPTCRDTSCVNPPTVQNAYIVSRQMSKYPSGERVRYQCRSPYEMFGDEEVMCLNGNWTEPPQCKDSTGKCGPPPPIDNGDITSFPLSVYAPASSVEYQCQNLYQLEGNKRITCRNGQWSEPPKCLHPCVISREIMENYNIALRWTAKQKLYSRTGESVEFVCKRGYRLSSRSHTLRTTCWDGKLEYPTCAKR,1231,NP_000177.2.csv,refseq-CFH-NM_000186.3_clinical_seed_0_final,refseq-CFH-NM_000186.3.a2m,Invitae,refseq-CFH-NM_000186.3.npy,1,1231,1231
+NP_000178.2,MAELKYISGFGNECSSEDPRCPGSLPEGQNNPQVCPYNLYAEQLSGSAFTCPRSTNKRSWLYRILPSVSHKPFESIDEGQVTHNWDEVDPDPNQLRWKPFEIPKASQKKVDFVSGLHTLCGAGDIKSNNGLAIHIFLCNTSMENRCFYNSDGDFLIVPQKGNLLIYTEFGKMLVQPNEICVIQRGMRFSIDVFEETRGYILEVYGVHFELPDLGPIGANGLANPRDFLIPIAWYEDRQVPGGYTVINKYQGKLFAAKQDVSPFNVVAWHGNYTPYKYNLKNFMVINSVAFDHADPSIFTVLTAKSVRPGVAIADFVIFPPRWGVADKTFRPPYYHRNCMSEFMGLIRGHYEAKQGGFLPGGGSLHSTMTPHGPDADCFEKASKVKLAPERIADGTMAFMFESSLSLAVTKWGLKASRCLDENYHKCWEPLKSHFTPNSRNPAEPN,445,NP_000178.2.csv,refseq-HGD-NM_000187.3_clinical_seed_0_final,refseq-HGD-NM_000187.3.a2m,Invitae,refseq-HGD-NM_000187.3.npy,1,445,445
+NP_000179.2,MIAAQLLAYYFTELKDDQVKKIDKYLYAMRLSDETLIDIMTRFRKEMKNGLSRDFNPTATVKMLPTFVRSIPDGSEKGDFIALDLGGSSFRILRVQVNHEKNQNVHMESEVYDTPENIVHGSGSQLFDHVAECLGDFMEKRKIKDKKLPVGFTFSFPCQQSKIDEAILITWTKRFKASGVEGADVVKLLNKAIKKRGDYDANIVAVVNDTVGTMMTCGYDDQHCEVGLIIGTGTNACYMEELRHIDLVEGDEGRMCINTEWGAFGDDGSLEDIRTEFDREIDRGSLNPGKQLFEKMVSGMYLGELVRLILVKMAKEGLLFEGRITPELLTRGKFNTSDVSAIEKNKEGLHNAKEILTRLGVEPSDDDCVSVQHVCTIVSFRSANLVAATLGAILNRLRDNKGTPRLRTTVGVDGSLYKTHPQYSRRFHKTLRRLVPDSDVRFLLSESGSGKGAAMVTAVAYRLAEQHRQIEETLAHFHLTKDMLLEVKKRMRAEMELGLRKQTHNNAVVKMLPSFVRRTPDGTENGDFLALDLGGTNFRVLLVKIRSGKKRTVEMHNKIYAIPIEIMQGTGEELFDHIVSCISDFLDYMGIKGPRMPLGFTFSFPCQQTSLDAGILITWTKGFKATDCVGHDVVTLLRDAIKRREEFDLDVVAVVNDTVGTMMTCAYEEPTCEVGLIVGTGSNACYMEEMKNVEMVEGDQGQMCINMEWGAFGDNGCLDDIRTHYDRLVDEYSLNAGKQRYEKMISGMYLGEIVRNILIDFTKKGFLFRGQISETLKTRGIFETKFLSQIESDRLALLQVRAILQQLGLNSTCDDSILVKTVCGVVSRRAAQLCGAGMAAVVDKIRENRGLDRLNVTVGVDGTLYKLHPHFSRIMHQTVKELSPKCNVSFLLSEDGSGKGAALITAVGVRLRTEASS,917,NP_000179.2.csv,refseq-HK1-NM_000188.2_clinical_seed_0_final,refseq-HK1-NM_000188.2.a2m,Invitae,refseq-HK1-NM_000188.2.npy,1,917,917
+NP_000181.2,MSGNGNAAATAEENSPKMRVIRVGTRKSQLARIQTDSVVATLKASYPGLQFEIIAMSTTGDKILDTALSKIGEKSLFTKELEHALEKNEVDLVVHSLKDLPTVLPPGFTIGAICKRENPHDAVVFHPKFVGKTLETLPEKSVVGTSSLRRAAQLQRKFPHLEFRSIRGNLNTRLRKLDEQQEFSAIILATAGLQRMGWHNRVGQILHPEECMYAVGQGALGVEVRAKDQDILDLVGVLHDPETLLRCIAERAFLRHLEGGCSVPVAVHTAMKDGQLYLTGGVWSLDGSDSIQETMQATIHVPAQHEDGPEDDPQLVGITARNIPRGPQLAAQNLGISLANLLLSKGAKNILDVARQLNDAH,361,NP_000181.2.csv,refseq-HMBS-NM_000190.3_clinical_seed_0_final,refseq-HMBS-NM_000190.3.a2m,Invitae,refseq-HMBS-NM_000190.3.npy,1,361,361
+NP_000182.2,MAAMRKALPRRLVGLASLRAVSTSSMGTLPKRVKIVEVGPRDGLQNEKNIVSTPVKIKLIDMLSEAGLSVIETTSFVSPKWVPQMGDHTEVLKGIQKFPGINYPVLTPNLKGFEAAVAAGAKEVVIFGAASELFTKKNINCSIEESFQRFDAILKAAQSANISVRGYVSCALGCPYEGKISPAKVAEVTKKFYSMGCYEISLGDTIGVGTPGIMKDMLSAVMQEVPLAALAVHCHDTYGQALANTLMALQMGVSVVDSSVAGLGGCPYAQGASGNLATEDLVYMLEGLGIHTGVNLQKLLEAGNFICQALNRKTSSKVAQATCKL,325,NP_000182.2.csv,refseq-HMGCL-NM_000191.2_clinical_seed_0_final,refseq-HMGCL-NM_000191.2.a2m,Invitae,refseq-HMGCL-NM_000191.2.npy,1,325,325
+NP_000183.2,MADADEGFGLAHTPLEPDAKDLPCDSKPESALGAPSKSPSSPQAAFTQQGMEGIKVFLHERELWLKFHEVGTEMIITKAGRRMFPSYKVKVTGLNPKTKYILLMDIVPADDHRYKFADNKWSVTGKAEPAMPGRLYVHPDSPATGAHWMRQLVSFQKLKLTNNHLDPFGHIILNSMHKYQPRLHIVKADENNGFGSKNTAFCTHVFPETAFIAVTSYQNHKITQLKIENNPFAKGFRGSDDMELHRMSRMQSKEYPVVPRSTVRQKVASNHSPFSSESRALSTSSNLGSQYQCENGVSGPSQDLLPPPNPYPLPQEHSQIYHCTKRKEEECSTTDHPYKKPYMETSPSEEDSFYRSSYPQQQGLGASYRTESAQRQACMYASSAPPSEPVPSLEDISCNTWPSMPSYSSCTVTTVQPMDRLPYQHFSAHFTSGPLVPRLAGMANHGSPQLGEGMFQHQTSVAHQPVVRQCGPQTGLQSPGTLQPPEFLYSHGVPRTLSPHQYHSVHGVGMVPEWSDNS,518,NP_000183.2.csv,refseq-TBX5-NM_000192.3_clinical_seed_0_final,refseq-TBX5-NM_000192.3.a2m,Invitae,refseq-TBX5-NM_000192.3.npy,1,518,518
+NP_000184.1,MLLLARCLLLVLVSSLLVCSGLACGPGRGFGKRRHPKKLTPLAYKQFIPNVAEKTLGASGRYEGKISRNSERFKELTPNYNPDIIFKDEENTGADRLMTQRCKDKLNALAISVMNQWPGVKLRVTEGWDEDGHHSEESLHYEGRAVDITTSDRDRSKYGMLARLAVEAGFDWVYYESKAHIHCSVKAENSVAAKSGGCFPGSATVHLEQGGTKLVKDLSPGDRVLAADDQGRLLYSDFLTFLDRDDGAKKVFYVIETREPRERLLLTAAHLLFVAPHNDSATGEPEASSGSGPPSGGALGPRALFASRVRPGQRVYVVAERDGDRRLLPAAVHSVTLSEEAAGAYAPLTAQGTILINRVLASCYAVIEEHSWAHRAFAPFRLAHALLAALAPARTDRGGDSGGGDRGGGGGRVALTAPGAADAPGAGATAGIHWYSQLLYQIGTWLLDSEALHPLGMAVKSS,462,NP_000184.1.csv,refseq-SHH-NM_000193.2_clinical_seed_0_final,refseq-SHH-NM_000193.2.a2m,Invitae,refseq-SHH-NM_000193.2.npy,1,462,462
+NP_000185.1,MATRSPGVVISDDEPGYDLDLFCIPNHYAEDLERVFIPHGLIMDRTERLARDVMKEMGGHHIVALCVLKGGYKFFADLLDYIKALNRNSDRSIPMTVDFIRLKSYCNDQSTGDIKVIGGDDLSTLTGKNVLIVEDIIDTGKTMQTLLSLVRQYNPKMVKVASLLVKRTPRSVGYKPDFVGFEIPDKFVVGYALDYNEYFRDLNHVCVISETGKAKYKA,218,NP_000185.1.csv,refseq-HPRT1-NM_000194.2_clinical_seed_0_final,refseq-HPRT1-NM_000194.2.a2m,Invitae,refseq-HPRT1-NM_000194.2.npy,1,218,218
+NP_000186.2,MKCVLVATEGAEVLFYWTDQEFEESLRLKFGQSENEEEELPALEDQLSTLLAPVIISSMTMLEKLSDTYTCFSTENGNFLYVLHLFGECLFIAINGDHTESEGDLRRKLYVLKYLFEVHFGLVTVDGHLIRKELRPPDLAQRVQLWEHFQSLLWTYSRLREQEQCFAVEALERLIHPQLCELCIEALERHVIQAVNTSPERGGEEALHAFLLVHSKLLAFYSSHSASSLRPADLLALILLVQDLYPSESTAEDDIQPSPRRARSSQNIPVQQAWSPHSTGPTGGSSAETETDSFSLPEEYFTPAPSPGDQSSGSTIWLEGGTPPMDALQIAEDTLQTLVPHCPVPSGPRRIFLDANVKESYCPLVPHTMYCLPLWQGINLVLLTRSPSAPLALVLSQLMDGFSMLEKKLKEGPEPGASLRSQPLVGDLRQRMDKFVKNRGAQEIQSTWLEFKAKAFSKSEPGSSWELLQACGKLKRQLCAIYRLNFLTTAPSRGGPHLPQHLQDQVQRLMREKLTDWKDFLLVKSRRNITMVSYLEDFPGLVHFIYVDRTTGQMVAPSLNCSQKTSSELGKGPLAAFVKTKVWSLIQLARRYLQKGYTTLLFQEGDFYCSYFLWFENDMGYKLQMIEVPVLSDDSVPIGMLGGDYYRKLLRYYSKNRPTEAVRCYELLALHLSVIPTDLLVQQAGQLARRLWEASRIPLL,700,NP_000186.2.csv,refseq-HPS1-NM_000195.4_clinical_seed_0_final,refseq-HPS1-NM_000195.4.a2m,Invitae,refseq-HPS1-NM_000195.4.npy,1,700,700
+NP_000187.3,MERWPWPSGGAWLLVAARALLQLLRSDLRLGRPLLAALALLAALDWLCQRLLPPPAALAVLAAAGWIALSRLARPQRLPVATRAVLITGCDSGFGKETAKKLDSMGFTVLATVLELNSPGAIELRTCCSPRLRLLQMDLTKPGDISRVLEFTKAHTTSTGLWGLVNNAGHNEVVADAELSPVATFRSCMEVNFFGALELTKGLLPLLRSSRGRIVTVGSPAGDMPYPCLGAYGTSKAAVALLMDTFSCELLPWGVKVSIIQPGCFKTESVRNVGQWEKRKQLLLANLPQELLQAYGKDYIEHLHGQFLHSLRLAMSDLTPVVDAITDALLAARPRRRYYPGQGLGLMYFIHYYLPEGLRRRFLQAFFISHCLPRALQPGQPGTTPPQDAAQDPNLSPGPSPAVAR,405,NP_000187.3.csv,refseq-HSD11B2-NM_000196.3_clinical_seed_0_final,refseq-HSD11B2-NM_000196.3.a2m,Invitae,refseq-HSD11B2-NM_000196.3.npy,1,405,405
+NP_000188.1,MGDVLEQFFILTGLLVCLACLAKCVRFSRCVLLNYWKVLPKSFLRSMGQWAVITGAGDGIGKAYSFELAKRGLNVVLISRTLEKLEAIATEIERTTGRSVKIIQADFTKDDIYEHIKEKLAGLEIGILVNNVGMLPNLLPSHFLNAPDEIQSLIHCNITSVVKMTQLILKHMESRQKGLILNISSGIALFPWPLYSMYSASKAFVCAFSKALQEEYKAKEVIIQVLTPYAVSTAMTKYLNTNVITKTADEFVKESLNYVTIGGETCGCLAHEILAGFLSLIPAWAFYSGAFQRLLLTHYVAYLKLNTKVR,310,NP_000188.1.csv,DHB3_HUMAN_b07_clinical_seed_0_final,DHB3_HUMAN_b07.a2m,EVE,DHB3_HUMAN_b07_theta_0.2.npy,1,310,310
+NP_000189.1,MGWSCLVTGAGGLLGQRIVRLLVEEKELKEIRALDKAFRPELREEFSKLQNRTKLTVLEGDILDEPFLKRACQDVSVVIHTACIIDVFGVTHRESIMNVNVKGTQLLLEACVQASVPVFIYTSSIEVAGPNSYKEIIQNGHEEEPLENTWPTPYPYSKKLAEKAVLAANGWNLKNGDTLYTCALRPTYIYGEGGPFLSASINEALNNNGILSSVGKFSTVNPVYVGNVAWAHILALRALRDPKKAPSVRGQFYYISDDTPHQSYDNLNYILSKEFGLRLDSRWSLPLTLMYWIGFLLEVVSFLLSPIYSYQPPFNRHTVTLSNSVFTFSYKKAQRDLAYKPLYSWEEAKQKTVEWVGSLVDRHKETLKSKTQ,372,NP_000189.1.csv,refseq-HSD3B2-NM_000198.3_clinical_seed_0_final,refseq-HSD3B2-NM_000198.3.a2m,Invitae,refseq-HSD3B2-NM_000198.3.npy,1,372,372
+NP_000190.1,MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSAIATPHLDALARRSLLFRNAFTSVSSCSPSRASLLTGLPQHQNGMYGLHQDVHHFNSFDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGRNITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNGESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGVGLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPEHPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLPALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQDFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQNLATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHNEL,502,NP_000190.1.csv,refseq-SGSH-NM_000199.3_clinical_seed_0_final,refseq-SGSH-NM_000199.3.a2m,Invitae,refseq-SGSH-NM_000199.3_theta_0.2.npy,1,502,502
+NP_000194.2,MRPLRPRAALLALLASLLAAPPVAPAEAPHLVHVDAARALWPLRRFWRSTGFCPPLPHSQADQYVLSWDQQLNLAYVGAVPHRGIKQVRTHWLLELVTTRGSTGRGLSYNFTHLDGYLDLLRENQLLPGFELMGSASGHFTDFEDKQQVFEWKDLVSSLARRYIGRYGLAHVSKWNFETWNEPDHHDFDNVSMTMQGFLNYYDACSEGLRAASPALRLGGPGDSFHTPPRSPLSWGLLRHCHDGTNFFTGEAGVRLDYISLHRKGARSSISILEQEKVVAQQIRQLFPKFADTPIYNDEADPLVGWSLPQPWRADVTYAAMVVKVIAQHQNLLLANTTSAFPYALLSNDNAFLSYHPHPFAQRTLTARFQVNNTRPPHVQLLRKPVLTAMGLLALLDEEQLWAEVSQAGTVLDSNHTVGVLASAHRPQGPADAWRAAVLIYASDDTRAHPNRSVAVTLRLRGVPPGPGLVYVTRYLDNGLCSPDGEWRRLGRPVFPTAEQFRRMRAAEDPVAAAPRPLPAGGRLTLRPALRLPSLLLVHVCARPEKPPGQVTRLRALPLTQGQLVLVWSDEHVGSKCLWTYEIQFSQDGKAYTPVSRKPSTFNLFVFSPDTGAVSGSYRVRALDYWARPGPFSDPVPYLEVPVPRGPPSPGNP,653,NP_000194.2.csv,refseq-IDUA-NM_000203.4_clinical_seed_0_final,refseq-IDUA-NM_000203.4.a2m,Invitae,refseq-IDUA-NM_000203.4.npy,1,653,653
+NP_000197.1,MLKPSLPFTSLLFLQLPLLGVGLNTTILTPNGNEDTTADFFLTTMPTDSLSVSTLPLPEVQCFVFNVEYMNCTWNSSSEPQPTNLTLHYWYKNSDNDKVQKCSHYLFSEEITSGCQLQKKEIHLYQTFVVQLQDPREPRRQATQMLKLQNLVIPWAPENLTLHKLSESQLELNWNNRFLNHCLEHLVQYRTDWDHSWTEQSVDYRHKFSLPSVDGQKRYTFRVRSRFNPLCGSAQHWSEWSHPIHWGSNTSKENPFLFALEAVVISVGSMGLIISLLCVYFWLERTMPRIPTLKNLEDLVTEYHGNFSAWSGVSKGLAESLQPDYSERLCLVSEIPPKGGALGEGPGASPCNQHSPYWAPPCYTLKPET,369,NP_000197.1.csv,refseq-IL2RG-NM_000206.2_clinical_seed_0_final,refseq-IL2RG-NM_000206.2.a2m,Invitae,refseq-IL2RG-NM_000206.2.npy,1,369,369
+NP_000198.1,MALWMRLLPLLALLALWGPDPAAAFVNQHLCGSHLVEALYLVCGERGFFYTPKTRREAEDLQVGQVELGGGPGAGSLQPLALEGSLQKRGIVEQCCTSICSLYQLENYCN,110,NP_000198.1.csv,refseq-INS-NM_000207.2_clinical_seed_0_final,refseq-INS-NM_000207.2.a2m,Invitae,refseq-INS-NM_000207.2.npy,1,110,110
+NP_000199.2,MATGGRRGAAAAPLLVAVAALLLGAAGHLYPGEVCPGMDIRNNLTRLHELENCSVIEGHLQILLMFKTRPEDFRDLSFPKLIMITDYLLLFRVYGLESLKDLFPNLTVIRGSRLFFNYALVIFEMVHLKELGLYNLMNITRGSVRIEKNNELCYLATIDWSRILDSVEDNYIVLNKDDNEECGDICPGTAKGKTNCPATVINGQFVERCWTHSHCQKVCPTICKSHGCTAEGLCCHSECLGNCSQPDDPTKCVACRNFYLDGRCVETCPPPYYHFQDWRCVNFSFCQDLHHKCKNSRRQGCHQYVIHNNKCIPECPSGYTMNSSNLLCTPCLGPCPKVCHLLEGEKTIDSVTSAQELRGCTVINGSLIINIRGGNNLAAELEANLGLIEEISGYLKIRRSYALVSLSFFRKLRLIRGETLEIGNYSFYALDNQNLRQLWDWSKHNLTITQGKLFFHYNPKLCLSEIHKMEEVSGTKGRQERNDIALKTNGDQASCENELLKFSYIRTSFDKILLRWEPYWPPDFRDLLGFMLFYKEAPYQNVTEFDGQDACGSNSWTVVDIDPPLRSNDPKSQNHPGWLMRGLKPWTQYAIFVKTLVTFSDERRTYGAKSDIIYVQTDATNPSVPLDPISVSNSSSQIILKWKPPSDPNGNITHYLVFWERQAEDSELFELDYCLKGLKLPSRTWSPPFESEDSQKHNQSEYEDSAGECCSCPKTDSQILKELEESSFRKTFEDYLHNVVFVPRKTSSGTGAEDPRPSRKRRSLGDVGNVTVAVPTVAAFPNTSSTSVPTSPEEHRPFEKVVNKESLVISGLRHFTGYRIELQACNQDTPEERCSVAAYVSARTMPEAKADDIVGPVTHEIFENNVVHLMWQEPKEPNGLIVLYEVSYRRYGDEELHLCVSRKHFALERGCRLRGLSPGNYSVRIRATSLAGNGSWTEPTYFYVTDYLDVPSNIAKIIIGPLIFVFLFSVVIGSIYLFLRKRQPDGPLGPLYASSNPEYLSASDVFPCSVYVPDEWEVSREKITLLRELGQGSFGMVYEGNARDIIKGEAETRVAVKTVNESASLRERIEFLNEASVMKGFTCHHVVRLLGVVSKGQPTLVVMELMAHGDLKSYLRSLRPEAENNPGRPPPTLQEMIQMAAEIADGMAYLNAKKFVHRDLAARNCMVAHDFTVKIGDFGMTRDIYETDYYRKGGKGLLPVRWMAPESLKDGVFTTSSDMWSFGVVLWEITSLAEQPYQGLSNEQVLKFVMDGGYLDQPDNCPERVTDLMRMCWQFNPKMRPTFLEIVNLLKDDLHPSFPEVSFFHSEENKAPESEELEMEFEDMENVPLDRSSHCQREEAGGRDGGSSLGFKRSYEEHIPYTHMNGGKKNGRILTLPRSNPS,1382,NP_000199.2.csv,refseq-INSR-NM_000208.3_clinical_seed_0_final,refseq-INSR-NM_000208.3.a2m,Invitae,refseq-INSR-NM_000208.3.npy,1,1382,1382
+NP_000200.1,MNGEEQYYAATQLYKDPCAFQRGPAPEFSASPPACLYMGRQPPPPPPHPFPGALGALEQGSPPDISPYEVPPLADDPAVAHLHHHLPAQLALPHPPAGPFPEGAEPGVLEEPNRVQLPFPWMKSTKAHAWKGQWAGGAYAAEPEENKRTRTAYTRAQLLELEKEFLFNKYISRPRRVELAVMLNLTERHIKIWFQNRRMKWKKEEDKKRGGGTAVGGGGVAEPEQDCAVTSGEELLALPPPPPPGGAVPPAAPVAAREGRLPPGLSASPQPSSVAPRRPQEPR,283,NP_000200.1.csv,refseq-PDX1-NM_000209.3_clinical_seed_0_final,refseq-PDX1-NM_000209.3.a2m,Invitae,refseq-PDX1-NM_000209.3.npy,1,283,283
+NP_000202.3,MLGLRPPLLALVGLLSLGCVLSQECTKFKVSSCRECIESGPGCTWCQKLNFTGPGDPDSIRCDTRPQLLMRGCAADDIMDPTSLAETQEDHNGGQKQLSPQKVTLYLRPGQAAAFNVTFRRAKGYPIDLYYLMDLSYSMLDDLRNVKKLGGDLLRALNEITESGRIGFGSFVDKTVLPFVNTHPDKLRNPCPNKEKECQPPFAFRHVLKLTNNSNQFQTEVGKQLISGNLDAPEGGLDAMMQVAACPEEIGWRNVTRLLVFATDDGFHFAGDGKLGAILTPNDGRCHLEDNLYKRSNEFDYPSVGQLAHKLAENNIQPIFAVTSRMVKTYEKLTEIIPKSAVGELSEDSSNVVHLIKNAYNKLSSRVFLDHNALPDTLKVTYDSFCSNGVTHRNQPRGDCDGVQINVPITFQVKVTATECIQEQSFVIRALGFTDIVTVQVLPQCECRCRDQSRDRSLCHGKGFLECGICRCDTGYIGKNCECQTQGRSSQELEGSCRKDNNSIICSGLGDCVCGQCLCHTSDVPGKLIYGQYCECDTINCERYNGQVCGGPGRGLCFCGKCRCHPGFEGSACQCERTTEGCLNPRRVECSGRGRCRCNVCECHSGYQLPLCQECPGCPSPCGKYISCAECLKFEKGPFGKNCSAACPGLQLSNNPVKGRTCKERDSEGCWVAYTLEQQDGMDRYLIYVDESRECVAGPNIAAIVGGTVAGIVLIGILLLVIWKALIHLSDLREYRRFEKEKLKSQWNNDNPLFKSATTTVMNPKFAES,769,NP_000202.3.csv,NP_000202.3_colabfold_clinical_seed_0_final,NP_000202.3_colabfold.a2m,colabfold,NP_000202.3_colabfold_theta_0.2.npy,1,769,769
+NP_000203.2,MRARPRPRPLWATVLALGALAGVGVGGPNICTTRGVSSCQQCLAVSPMCAWCSDEALPLGSPRCDLKENLLKDNCAPESIEFPVSEARVLEDRPLSDKGSGDSSQVTQVSPQRIALRLRPDDSKNFSIQVRQVEDYPVDIYYLMDLSYSMKDDLWSIQNLGTKLATQMRKLTSNLRIGFGAFVDKPVSPYMYISPPEALENPCYDMKTTCLPMFGYKHVLTLTDQVTRFNEEVKKQSVSRNRDAPEGGFDAIMQATVCDEKIGWRNDASHLLVFTTDAKTHIALDGRLAGIVQPNDGQCHVGSDNHYSASTTMDYPSLGLMTEKLSQKNINLIFAVTENVVNLYQNYSELIPGTTVGVLSMDSSNVLQLIVDAYGKIRSKVELEVRDLPEELSLSFNATCLNNEVIPGLKSCMGLKIGDTVSFSIEAKVRGCPQEKEKSFTIKPVGFKDSLIVQVTFDCDCACQAQAEPNSHRCNNGNGTFECGVCRCGPGWLGSQCECSEEDYRPSQQDECSPREGQPVCSQRGECLCGQCVCHSSDFGKITGKYCECDDFSCVRYKGEMCSGHGQCSCGDCLCDSDWTGYYCNCTTRTDTCMSSNGLLCSGRGKCECGSCVCIQPGSYGDTCEKCPTCPDACTFKKECVECKKFDRGALHDENTCNRYCRDEIESVKELKDTGKDAVNCTYKNEDDCVVRFQYYEDSSGKSILYVVEEPECPKGPDILVVLLSVMGAILLIGLAALLIWKLLITIHDRKEFAKFEEERARAKWDTANNPLYKEATSTFTNITYRGT,788,NP_000203.2.csv,refseq-ITGB3-NM_000212.2_clinical_seed_0_final,refseq-ITGB3-NM_000212.2.a2m,Invitae,refseq-ITGB3-NM_000212.2.npy,1,788,788
+NP_000205.1,MRSPRTRGRSGRPLSLLLALLCALRAKVCGASGQFELEILSMQNVNGELQNGNCCGGARNPGDRKCTRDECDTYFKVCLKEYQSRVTAGGPCSFGSGSTPVIGGNTFNLKASRGNDRNRIVLPFSFAWPRSYTLLVEAWDSSNDTVQPDSIIEKASHSGMINPSRQWQTLKQNTGVAHFEYQIRVTCDDYYYGFGCNKFCRPRDDFFGHYACDQNGNKTCMEGWMGPECNRAICRQGCSPKHGSCKLPGDCRCQYGWQGLYCDKCIPHPGCVHGICNEPWQCLCETNWGGQLCDKDLNYCGTHQPCLNGGTCSNTGPDKYQCSCPEGYSGPNCEIAEHACLSDPCHNRGSCKETSLGFECECSPGWTGPTCSTNIDDCSPNNCSHGGTCQDLVNGFKCVCPPQWTGKTCQLDANECEAKPCVNAKSCKNLIASYYCDCLPGWMGQNCDININDCLGQCQNDASCRDLVNGYRCICPPGYAGDHCERDIDECASNPCLNGGHCQNEINRFQCLCPTGFSGNLCQLDIDYCEPNPCQNGAQCYNRASDYFCKCPEDYEGKNCSHLKDHCRTTPCEVIDSCTVAMASNDTPEGVRYISSNVCGPHGKCKSQSGGKFTCDCNKGFTGTYCHENINDCESNPCRNGGTCIDGVNSYKCICSDGWEGAYCETNINDCSQNPCHNGGTCRDLVNDFYCDCKNGWKGKTCHSRDSQCDEATCNNGGTCYDEGDAFKCMCPGGWEGTTCNIARNSSCLPNPCHNGGTCVVNGESFTCVCKEGWEGPICAQNTNDCSPHPCYNSGTCVDGDNWYRCECAPGFAGPDCRININECQSSPCAFGATCVDEINGYRCVCPPGHSGAKCQEVSGRPCITMGSVIPDGAKWDDDCNTCQCLNGRIACSKVWCGPRPCLLHKGHSECPSGQSCIPILDDQCFVHPCTGVGECRSSSLQPVKTKCTSDSYYQDNCANITFTFNKEMMSPGLTTEHICSELRNLNILKNVSAEYSIYIACEPSPSANNEIHVAISAEDIRDDGNPIKEITDKIIDLVSKRDGNSSLIAAVAEVRVQRRPLKNRTDFLVPLLSSVLTVAWICCLVTAFYWCLRKRRKPGSHTHSASEDNTTNNVREQLNQIKNPIEKHGANTVPIKDYENKNSKMSKIRTHNSEVEEDDMDKHQQKARFAKQPAYTLVDREEKPPNGTPTKHPNWTNKQDNRDLESAQSLNRMEYIV,1218,NP_000205.1.csv,refseq-JAG1-NM_000214.2_clinical_seed_0_final,refseq-JAG1-NM_000214.2.a2m,Invitae,refseq-JAG1-NM_000214.2.npy,1,1218,1218
+NP_000206.2,MAPPSEETPLIPQRSCSLLSTEAGALHVLLPARGPGPPQRLSFSFGDHLAEDLCVQAAKASGILPVYHSLFALATEDLSCWFPPSHIFSVEDASTQVLLYRIRFYFPNWFGLEKCHRFGLRKDLASAILDLPVLEHLFAQHRSDLVSGRLPVGLSLKEQGECLSLAVLDLARMAREQAQRPGELLKTVSYKACLPPSLRDLIQGLSFVTRRRIRRTVRRALRRVAACQADRHSLMAKYIMDLERLDPAGAAETFHVGLPGALGGHDGLGLLRVAGDGGIAWTQGEQEVLQPFCDFPEIVDISIKQAPRVGPAGEHRLVTVTRTDNQILEAEFPGLPEALSFVALVDGYFRLTTDSQHFFCKEVAPPRLLEEVAEQCHGPITLDFAINKLKTGGSRPGSYVLRRSPQDFDSFLLTVCVQNPLGPDYKGCLIRRSPTGTFLLVGLSRPHSSLRELLATCWDGGLHVDGVAVTLTSCCIPRPKEKSNLIVVQRGHSPPTSSLVQPQSQYQLSQMTFHKIPADSLEWHENLGHGSFTKIYRGCRHEVVDGEARKTEVLLKVMDAKHKNCMESFLEAASLMSQVSYRHLVLLHGVCMAGDSTMVQEFVHLGAIDMYLRKRGHLVPASWKLQVVKQLAYALNYLEDKGLPHGNVSARKVLLAREGADGSPPFIKLSDPGVSPAVLSLEMLTDRIPWVAPECLREAQTLSLEADKWGFGATVWEVFSGVTMPISALDPAKKLQFYEDRQQLPAPKWTELALLIQQCMAYEPVQRPSFRAVIRDLNSLISSDYELLSDPTPGALAPRDGLWNGAQLYACQDPTIFEERHLKYISQLGKGNFGSVELCRYDPLGDNTGALVAVKQLQHSGPDQQRDFQREIQILKALHSDFIVKYRGVSYGPGRQSLRLVMEYLPSGCLRDFLQRHRARLDASRLLLYSSQICKGMEYLGSRRCVHRDLAARNILVESEAHVKIADFGLAKLLPLDKDYYVVREPGQSPIFWYAPESLSDNIFSRQSDVWSFGVVLYELFTYCDKSCSPSAEFLRMMGCERDVPALCRLLELLEEGQRLPAPPACPAEVHELMKLCWAPSPQDRPSFSALGPQLDMLWSGSRGCETHAFTAHPEGKHHSLSFS,1124,NP_000206.2.csv,refseq-JAK3-NM_000215.3_clinical_seed_0_final,refseq-JAK3-NM_000215.3.a2m,Invitae,refseq-JAK3-NM_000215.3.npy,1,1124,1124
+NP_000207.2,MVPGVPGAVLTLCLWLAASSGCLAAGPGAAAARRLDESLSAGSVQRARCASRCLSLQITRISAFFQHFQNNGSLVWCQNHKQCSKCLEPCKESGDLRKHQCQSFCEPLFPKKSYECLTSCEFLKYILLVKQGDCPAPEKASGFAAACVESCEVDNECSGVKKCCSNGCGHTCQVPKTLYKGVPLKPRKELRFTELQSGQLEVKWSSKFNISIEPVIYVVQRRWNYGIHPSEDDATHWQTVAQTTDERVQLTDIRPSRWYQFRVAAVNVHGTRGFTAPSKHFRSSKDPSAPPAPANLRLANSTVNSDGSVTVTIVWDLPEEPDIPVHHYKVFWSWMVSSKSLVPTKKKRRKTTDGFQNSVILEKLQPDCDYVVELQAITYWGQTRLKSAKVSLHFTSTHATNNKEQLVKTRKGGIQTQLPFQRRRPTRPLEVGAPFYQDGQLQVKVYWKKTEDPTVNRYHVRWFPEACAHNRTTGSEASSGMTHENYIILQDLSFSCKYKVTVQPIRPKSHSKAEAVFFTTPPCSALKGKSHKPVGCLGEAGHVLSKVLAKPENLSASFIVQDVNITGHFSWKMAKANLYQPMTGFQVTWAEVTTESRQNSLPNSIISQSQILPSDHYVLTVPNLRPSTLYRLEVQVLTPGGEGPATIKTFRTPELPPSSAHRSHLKHRHPHHYKPSPERY,680,NP_000207.2.csv,refseq-KAL1-NM_000216.2_clinical_seed_0_final,refseq-KAL1-NM_000216.2.a2m,Invitae,refseq-KAL1-NM_000216.2.npy,1,680,680
+NP_000208.2,MTVMSGENVDEASAAPGHPQDGSYPRQADHDDHECCERVVINISGLRFETQLKTLAQFPNTLLGNPKKRMRYFDPLRNEYFFDRNRPSFDAILYYYQSGGRLRRPVNVPLDMFSEEIKFYELGEEAMEKFREDEGFIKEEERPLPEKEYQRQVWLLFEYPESSGPARVIAIVSVMVILISIVIFCLETLPELKDDKDFTGTVHRIDNTTVIYNSNIFTDPFFIVETLCIIWFSFELVVRFFACPSKTDFFKNIMNFIDIVAIIPYFITLGTEIAEQEGNQKGEQATSLAILRVIRLVRVFRIFKLSRHSKGLQILGQTLKASMRELGLLIFFLFIGVILFSSAVYFAEAEEAESHFSSIPDAFWWAVVSMTTVGYGDMYPVTIGGKIVGSLCAIAGVLTIALPVPVIVSNFNYFYHRETEGEEQAQLLHVSSPNLASDSDLSRRSSSTMSKSEYMEIEEDMNNSIAHYRQVNIRTANCTTANQNCVNKSKLLTDV,495,NP_000208.2.csv,refseq-KCNA1-NM_000217.2_clinical_seed_0_final,refseq-KCNA1-NM_000217.2.a2m,Invitae,refseq-KCNA1-NM_000217.2.npy,1,495,495
+NP_000209.2,MAAASSPPRAERKRWGWGRLPGARRGSAGLAKKCPFSLELAEGGPAGGALYAPIAPGAPGPAPPASPAAPAAPPVASDLGPRPPVSLDPRVSIYSTRRPVLARTHVQGRVYNFLERPTGWKCFVYHFAVFLIVLVCLIFSVLSTIEQYAALATGTLFWMEIVLVVFFGTEYVVRLWSAGCRSKYVGLWGRLRFARKPISIIDLIVVVASMVVLCVGSKGQVFATSAIRGIRFLQILRMLHVDRQGGTWRLLGSVVFIHRQELITTLYIGFLGLIFSSYFVYLAEKDAVNESGRVEFGSYADALWWGVVTVTTIGYGDKVPQTWVGKTIASCFSVFAISFFALPAGILGSGFALKVQQKQRQKHFNRQIPAAASLIQTAWRCYAAENPDSSTWKIYIRKAPRSHTLLSPSPKPKKSVVVKKKKFKLDKDNGVTPGEKMLTVPHITCDPPEERRLDHFSVDGYDSSVRKSPTLLEVSMPHFMRTNSFAEDLDLEGETLLTPITHISQLREHHRATIKVIRRMQYFVAKKKFQQARKPYDVRDVIEQYSQGHLNLMVRIKELQRRLDQSIGKPSLFISVSEKSKDRGSNTIGARLNRVEDKVTQLDQRLALITDMLHQLLSLHGGSTPGSGGPPREGGAHITQPCGSGGSVDPELFLPSNTLPTYEQLTVPRRGPDEGS,676,NP_000209.2.csv,refseq-KCNQ1-NM_000218.2_clinical_seed_0_final,refseq-KCNQ1-NM_000218.2.a2m,Invitae,refseq-KCNQ1-NM_000218.2.npy,1,676,676
+NP_000210.2,MILSNTTAVTPFLTKLWQETVQQGGNMSGLARRSPRSSDGKLEALYVLMVLGFFGFFTLGIMLSYIRSKKLEHSNDPFNVYIESDAWQEKDKAYVQARVLESYRSCYVVENHLAIEQPNTHLPETKPSP,129,NP_000210.2.csv,refseq-KCNE1-NM_000219.4_clinical_seed_0_final,refseq-KCNE1-NM_000219.4.a2m,Invitae,refseq-KCNE1-NM_000219.4.npy,1,129,129
+NP_000211.1,MNASSRNVFDTLIRVLTESMFKHLRKWVVTRFFGHSRQRARLVSKDGRCNIEFGNVEAQSRFIFFVDIWTTVLDLKWRYKMTIFITAFLGSWFFFGLLWYAVAYIHKDLPEFHPSANHTPCVENINGLTSAFLFSLETQVTIGYGFRCVTEQCATAIFLLIFQSILGVIINSFMCGAILAKISRPKKRAKTITFSKNAVISKRGGKLCLLIRVANLRKSLLIGSHIYGKLLKTTVTPEGETIILDQININFVVDAGNENLFFISPLTIYHVIDHNSPFFHMAAETLLQQDFELVVFLDGTVESTSATCQVRTSYVPEEVLWGYRFAPIVSKTKEGKYRVDFHNFSKTVEVETPHCAMCLYNEKDVRARMKRGYDNPNFILSEVNETDDTKM,391,NP_000211.1.csv,refseq-KCNJ1-NM_000220.4_clinical_seed_0_final,refseq-KCNJ1-NM_000220.4.a2m,Invitae,refseq-KCNJ1-NM_000220.4.npy,1,391,391
+NP_000213.1,MRGARGAWDFLCVLLLLLRVQTGSSQPSVSPGEPSPPSIHPGKSDLIVRVGDEIRLLCTDPGFVKWTFEILDETNENKQNEWITEKAEATNTGKYTCTNKHGLSNSIYVFVRDPAKLFLVDRSLYGKEDNDTLVRCPLTDPEVTNYSLKGCQGKPLPKDLRFIPDPKAGIMIKSVKRAYHRLCLHCSVDQEGKSVLSEKFILKVRPAFKAVPVVSVSKASYLLREGEEFTVTCTIKDVSSSVYSTWKRENSQTKLQEKYNSWHHGDFNYERQATLTISSARVNDSGVFMCYANNTFGSANVTTTLEVVDKGFINIFPMINTTVFVNDGENVDLIVEYEAFPKPEHQQWIYMNRTFTDKWEDYPKSENESNIRYVSELHLTRLKGTEGGTYTFLVSNSDVNAAIAFNVYVNTKPEILTYDRLVNGMLQCVAAGFPEPTIDWYFCPGTEQRCSASVLPVDVQTLNSSGPPFGKLVVQSSIDSSAFKHNGTVECKAYNDVGKTSAYFNFAFKGNNKEQIHPHTLFTPLLIGFVIVAGMMCIIVMILTYKYLQKPMYEVQWKVVEEINGNNYVYIDPTQLPYDHKWEFPRNRLSFGKTLGAGAFGKVVEATAYGLIKSDAAMTVAVKMLKPSAHLTEREALMSELKVLSYLGNHMNIVNLLGACTIGGPTLVITEYCCYGDLLNFLRRKRDSFICSKQEDHAEAALYKNLLHSKESSCSDSTNEYMDMKPGVSYVVPTKADKRRSVRIGSYIERDVTPAIMEDDELALDLEDLLSFSYQVAKGMAFLASKNCIHRDLAARNILLTHGRITKICDFGLARDIKNDSNYVVKGNARLPVKWMAPESIFNCVYTFESDVWSYGIFLWELFSLGSSPYPGMPVDSKFYKMIKEGFRMLSPEHAPAEMYDIMKTCWDADPLKRPTFKQIVQLIEKQISESTNHIYSNLANCSPNRQKPVVDHSVRINSVGSTASSSQPLLVHDDV,976,NP_000213.1.csv,refseq-KIT-NM_000222.2_clinical_seed_0_final,refseq-KIT-NM_000222.2.a2m,Invitae,refseq-KIT-NM_000222.2.npy,1,976,976
+NP_000214.1,MDLSNNTMSLSVRTPGLSRRLSSQSVIGRPRGMSASSVGSGYGGSAFGFGASCGGGFSAASMFGSSSGFGGGSGSSMAGGLGAGYGRALGGGSFGGLGMGFGGSPGGGSLGILSGNDGGLLSGSEKETMQNLNDRLASYLDKVRALEEANTELENKIREWYETRGTGTADASQSDYSKYYPLIEDLRNKIISASIGNAQLLLQIDNARLAAEDFRMKYENELALRQGVEADINGLRRVLDELTLTRTDLEMQIESLNEELAYMKKNHEDELQSFRVGGPGEVSVEMDAAPGVDLTRLLNDMRAQYETIAEQNRKDAEAWFIEKSGELRKEISTNTEQLQSSKSEVTDLRRAFQNLEIELQSQLAMKKSLEDSLAEAEGDYCAQLSQVQQLISNLEAQLLQVRADAERQNVDHQRLLNVKARLELEIETYRRLLDGEAQGDGLEESLFVTDSKSQAQSTDSSKDPTKTRKIKTVVQEMVNGEVVSSQVQEIEELM,494,NP_000214.1.csv,refseq-KRT12-NM_000223.3_clinical_seed_0_final,refseq-KRT12-NM_000223.3.a2m,Invitae,refseq-KRT12-NM_000223.3.npy,1,494,494
+NP_000217.2,MSCRQFSSSYLSRSGGGGGGGLGSGGSIRSSYSRFSSSGGGGGGGRFSSSSGYGGGSSRVCGRGGGGSFGYSYGGGSGGGFSASSLGGGFGGGSRGFGGASGGGYSSSGGFGGGFGGGSGGGFGGGYGSGFGGFGGFGGGAGGGDGGILTANEKSTMQELNSRLASYLDKVQALEEANNDLENKIQDWYDKKGPAAIQKNYSPYYNTIDDLKDQIVDLTVGNNKTLLDIDNTRMTLDDFRIKFEMEQNLRQGVDADINGLRQVLDNLTMEKSDLEMQYETLQEELMALKKNHKEEMSQLTGQNSGDVNVEINVAPGKDLTKTLNDMRQEYEQLIAKNRKDIENQYETQITQIEHEVSSSGQEVQSSAKEVTQLRHGVQELEIELQSQLSKKAALEKSLEDTKNRYCGQLQMIQEQISNLEAQITDVRQEIECQNQEYSLLLSIKMRLEKEIETYHNLLEGGQEDFESSGAGKIGLGGRGGSGGSYGRGSRGGSGGSYGGGGSGGGYGGGSGSRGGSGGSYGGGSGSGGGSGGGYGGGSGGGHSGGSGGGHSGGSGGNYGGGSGSGGGSGGGYGGGSGSRGGSGGSHGGGSGFGGESGGSYGGGEEASGSGGGYGGGSGKSSHS,623,NP_000217.2.csv,refseq-KRT9-NM_000226.3_clinical_seed_0_final,refseq-KRT9-NM_000226.3.a2m,Invitae,refseq-KRT9-NM_000226.3.npy,1,623,623
+NP_000219.2,MRPFFLLCFALPGLLHAQQACSRGACYPPVGDLLVGRTRFLRASSTCGLTKPETYCTQYGEWQMKCCKCDSRQPHNYYSHRVENVASSSGPMRWWQSQNDVNPVSLQLDLDRRFQLQEVMMEFQGPMPAGMLIERSSDFGKTWRVYQYLAADCTSTFPRVRQGRPQSWQDVRCQSLPQRPNARLNGGKVQLNLMDLVSGIPATQSQKIQEVGEITNLRVNFTRLAPVPQRGYHPPSAYYAVSQLRLQGSCFCHGHADRCAPKPGASAGPSTAVQVHDVCVCQHNTAGPNCERCAPFYNNRPWRPAEGQDAHECQRCDCNGHSETCHFDPAVFAASQGAYGGVCDNCRDHTEGKNCERCQLHYFRNRRPGASIQETCISCECDPDGAVPGAPCDPVTGQCVCKEHVQGERCDLCKPGFTGLTYANPQGCHRCDCNILGSRRDMPCDEESGRCLCLPNVVGPKCDQCAPYHWKLASGQGCEPCACDPHNSLSPQCNQFTGQCPCREGFGGLMCSAAAIRQCPDRTYGDVATGCRACDCDFRGTEGPGCDKASGRCLCRPGLTGPRCDQCQRGYCNRYPVCVACHPCFQTYDADLREQALRFGRLRNATASLWSGPGLEDRGLASRILDAKSKIEQIRAVLSSPAVTEQEVAQVASAILSLRRTLQGLQLDLPLEEETLSLPRDLESLDRSFNGLLTMYQRKREQFEKISSADPSGAFRMLSTAYEQSAQAAQQVSDSSRLLDQLRDSRREAERLVRQAGGGGGTGSPKLVALRLEMSSLPDLTPTFNKLCGNSRQMACTPISCPGELCPQDNGTACGSRCRGVLPRAGGAFLMAGQVAEQLRGFNAQLQRTRQMIRAAEESASQIQSSAQRLETQVSASRSQMEEDVRRTRLLIQQVRDFLTDPDTDAATIQEVSEAVLALWLPTDSATVLQKMNEIQAIAARLPNVDLVLSQTKQDIARARRLQAEAEEARSRAHAVEGQVEDVVGNLRQGTVALQEAQDTMQGTSRSLRLIQDRVAEVQQVLRPAEKLVTSMTKQLGDFWTRMEELRHQARQQGAEAVQAQQLAEGASEQALSAQEGFERIKQKYAELKDRLGQSSMLGEQGARIQSVKTEAEELFGETMEMMDRMKDMELELLRGSQAIMLRSADLTGLEKRVEQIRDHINGRVLYYATCK,1172,NP_000219.2.csv,refseq-LAMB3-NM_000228.2_clinical_seed_0_final,refseq-LAMB3-NM_000228.2.a2m,Invitae,refseq-LAMB3-NM_000228.2.npy,1,1172,1172
+NP_000220.1,MGPPGSPWQWVTLLLGLLLPPAAPFWLLNVLFPPHTTPKAELSNHTRPVILVPGCLGNQLEAKLDKPDVVNWMCYRKTEDFFTIWLDLNMFLPLGVDCWIDNTRVVYNRSSGLVSNAPGVQIRVPGFGKTYSVEYLDSSKLAGYLHTLVQNLVNNGYVRDETVRAAPYDWRLEPGQQEEYYRKLAGLVEEMHAAYGKPVFLIGHSLGCLHLLYFLLRQPQAWKDRFIDGFISLGAPWGGSIKPMLVLASGDNQGIPIMSSIKLKEEQRITTTSPWMFPSRMAWPEDHVFISTPSFNYTGRDFQRFFADLHFEEGWYMWLQSRDLLAGLPAPGVEVYCLYGVGLPTPRTYIYDHGFPYTDPVGVLYEDGDDTVATRSTELCGLWQGRQPQPVHLLPLHGIQHLNMVFSNLTLEHINAILLGAYRQGPPASPTASPEPPPPE,440,NP_000220.1.csv,refseq-LCAT-NM_000229.1_clinical_seed_0_final,refseq-LCAT-NM_000229.1.a2m,Invitae,refseq-LCAT-NM_000229.1.npy,1,440,440
+NP_000221.1,MHWGTLCGFLWLWPYLFYVQAVPIQKVQDDTKTLIKTIVTRINDISHTQSVSSKQKVTGLDFIPGLHPILTLSKMDQTLAVYQQILTSMPSRNVIQISNDLENLRDLLHVLAFSKSCHLPWASGLETLDSLGGVLEASGYSTEVVALSRLQGSLQDMLWQLDLSPGC,167,NP_000221.1.csv,refseq-LEP-NM_000230.2_clinical_seed_0_final,refseq-LEP-NM_000230.2.a2m,Invitae,refseq-LEP-NM_000230.2.npy,1,167,167
+NP_000222.2,MVREQYTTATEGICIERPENQYVYKIGIYGWRKRCLYLFVLLLLIILVVNLALTIWILKVMWFSPAGMGHLCVTKDGLRLEGESEFLFPLYAKEIHSRVDSSLLLQSTQNVTVNARNSEGEVTGRLKVGPKMVEVQNQQFQINSNDGKPLFTVDEKEVVVGTDKLRVTGPEGALFEHSVETPLVRADPFQDLRLESPTRSLSMDAPRGVHIQAHAGKIEALSQMDILFHSSDGMLVLDAETVCLPKLVQGTWGPSGSSQSLYEICVCPDGKLYLSVAGVSTTCQEHNHICL,291,NP_000222.2.csv,refseq-SGCG-NM_000231.3_clinical_seed_0_final,refseq-SGCG-NM_000231.3.a2m,Invitae,refseq-SGCG-NM_000231.3.npy,1,291,291
+NP_000224.2,MKQRFSALQLLKLLLLLQPPLPRALREALCPEPCNCVPDGALRCPGPTAGLTRLSLAYLPVKVIPSQAFRGLNEVIKIEISQIDSLERIEANAFDNLLNLSEILIQNTKNLRYIEPGAFINLPRLKYLSICNTGIRKFPDVTKVFSSESNFILEICDNLHITTIPGNAFQGMNNESVTLKLYGNGFEEVQSHAFNGTTLTSLELKENVHLEKMHNGAFRGATGPKTLDISSTKLQALPSYGLESIQRLIATSSYSLKKLPSRETFVNLLEATLTYPSHCCAFRNLPTKEQNFSHSISENFSKQCESTVRKVNNKTLYSSMLAESELSGWDYEYGFCLPKTPRCAPEPDAFNPCEDIMGYDFLRVLIWLINILAIMGNMTVLFVLLTSRYKLTVPRFLMCNLSFADFCMGLYLLLIASVDSQTKGQYYNHAIDWQTGSGCSTAGFFTVFASELSVYTLTVITLERWHTITYAIHLDQKLRLRHAILIMLGGWLFSSLIAMLPLVGVSNYMKVSICFPMDVETTLSQVYILTILILNVVAFFIICACYIKIYFAVRNPELMATNKDTKIAKKMAILIFTDFTCMAPISFFAISAAFKVPLITVTNSKVLLVLFYPINSCANPFLYAIFTKTFQRDFFLLLSKFGCCKRRAELYRRKDFSAYTSNCKNGFTGSNKPSQSTLKLSTLHCQGTALLDKTRYTEC,699,NP_000224.2.csv,refseq-LHCGR-NM_000233.3_clinical_seed_0_final,refseq-LHCGR-NM_000233.3.a2m,Invitae,refseq-LHCGR-NM_000233.3.npy,1,699,699
+NP_000226.2,MKMRFLGLVVCLVLWTLHSEGSGGKLTAVDPETNMNVSEIISYWGFPSEEYLVETEDGYILCLNRIPHGRKNHSDKGPKPVVFLQHGLLADSSNWVTNLANSSLGFILADAGFDVWMGNSRGNTWSRKHKTLSVSQDEFWAFSYDEMAKYDLPASINFILNKTGQEQVYYVGHSQGTTIGFIAFSQIPELAKRIKMFFALGPVASVAFCTSPMAKLGRLPDHLIKDLFGDKEFLPQSAFLKWLGTHVCTHVILKELCGNLCFLLCGFNERNLNMSRVDVYTTHSPAGTSVQNMLHWSQAVKFQKFQAFDWGSSAKNYFHYNQSYPPTYNVKDMLVPTAVWSGGHDWLADVYDVNILLTQITNLVFHESIPEWEHLDFIWGLDAPWRLYNKIINLMRKYQ,399,NP_000226.2.csv,refseq-LIPA-NM_000235.3_clinical_seed_0_final,refseq-LIPA-NM_000235.3.a2m,Invitae,refseq-LIPA-NM_000235.3.npy,1,399,399
+NP_000228.1,MESKALLVLTLAVWLQSLTASRGGVAAADQRRDFIDIESKFALRTPEDTAEDTCHLIPGVAESVATCHFNHSSKTFMVIHGWTVTGMYESWVPKLVAALYKREPDSNVIVVDWLSRAQEHYPVSAGYTKLVGQDVARFINWMEEEFNYPLDNVHLLGYSLGAHAAGIAGSLTNKKVNRITGLDPAGPNFEYAEAPSRLSPDDADFVDVLHTFTRGSPGRSIGIQKPVGHVDIYPNGGTFQPGCNIGEAIRVIAERGLGDVDQLVKCSHERSIHLFIDSLLNEENPSKAYRCSSKEAFEKGLCLSCRKNRCNNLGYEINKVRAKRSSKMYLKTRSQMPYKVFHYQVKIHFSGTESETHTNQAFEISLYGTVAESENIPFTLPEVSTNKTYSFLIYTEVDIGELLMLKLKWKSDSYFSWSDWWSSPGFAIQKIRVKAGETQKKVIFCSREKVSHLQKGKAPAVFVKCHDKSLNKKSG,475,NP_000228.1.csv,refseq-LPL-NM_000237.2_clinical_seed_0_final,refseq-LPL-NM_000237.2.a2m,Invitae,refseq-LPL-NM_000237.2.npy,1,475,475
+NP_000229.1,MPVRRGHVAPQNTFLDTIIRKFEGQSRKFIIANARVENCAVIYCNDGFCELCGYSRAEVMQRPCTCDFLHGPRTQRRAAAQIAQALLGAEERKVEIAFYRKDGSCFLCLVDVVPVKNEDGAVIMFILNFEVVMEKDMVGSPAHDTNHRGPPTSWLAPGRAKTFRLKLPALLALTARESSVRSGGAGGAGAPGAVVVDVDLTPAAPSSESLALDEVTAMDNHVAGLGPAEERRALVGPGSPPRSAPGQLPSPRAHSLNPDASGSSCSLARTRSRESCASVRRASSADDIEAMRAGVLPPPPRHASTGAMHPLRSGLLNSTSDSDLVRYRTISKIPQITLNFVDLKGDPFLASPTSDREIIAPKIKERTHNVTEKVTQVLSLGADVLPEYKLQAPRIHRWTILHYSPFKAVWDWLILLLVIYTAVFTPYSAAFLLKETEEGPPATECGYACQPLAVVDLIVDIMFIVDILINFRTTYVNANEEVVSHPGRIAVHYFKGWFLIDMVAAIPFDLLIFGSGSEELIGLLKTARLLRLVRVARKLDRYSEYGAAVLFLLMCTFALIAHWLACIWYAIGNMEQPHMDSRIGWLHNLGDQIGKPYNSSGLGGPSIKDKYVTALYFTFSSLTSVGFGNVSPNTNSEKIFSICVMLIGSLMYASIFGNVSAIIQRLYSGTARYHTQMLRVREFIRFHQIPNPLRQRLEEYFQHAWSYTNGIDMNAVLKGFPECLQADICLHLNRSLLQHCKPFRGATKGCLRALAMKFKTTHAPPGDTLVHAGDLLTALYFISRGSIEILRGDVVVAILGKNDIFGEPLNLYARPGKSNGDVRALTYCDLHKIHRDDLLEVLDMYPEFSDHFWSSLEITFNLRDTNMIPGSPGSTELEGGFSRQRKRKLSFRRRTDKDTEQPGEVSALGPGRAGAGPSSRGRPGGPWGESPSSGPSSPESSEDEGPGRSSSPLRLVPFSSPRPPGEPPGGEPLMEDCEKSSDTCNPLSGAFSGVSNIFSFWGDSRGRQYQELPRCPAPTPSLLNIPLSSPGRRPRGDVESRLDALQRQLNRLETRLSADMATVLQLLQRQMTLVPPAYSAVTTPGPGPTSTSPLLPVSPLPTLTLDSLSQVSQFMACEELPPGAPELPQEGPTRRLSLPGQLGALTSQPLHRHGSDPGS,1159,NP_000229.1.csv,refseq-KCNH2-NM_000238.3_clinical_seed_0_final,refseq-KCNH2-NM_000238.3.a2m,Invitae,refseq-KCNH2-NM_000238.3.npy,1,1159,1159
+NP_000230.1,MKALIVLGLVLLSVTVQGKVFERCELARTLKRLGMDGYRGISLANWMCLAKWESGYNTRATNYNAGDRSTDYGIFQINSRYWCNDGKTPGAVNACHLSCSALLQDNIADAVACAKRVVRDPQGIRAWVAWRNRCQNRDVRQYVQGCGV,148,NP_000230.1.csv,refseq-LYZ-NM_000239.2_clinical_seed_0_final,refseq-LYZ-NM_000239.2.a2m,Invitae,refseq-LYZ-NM_000239.2_theta_0.2.npy,1,148,148
+NP_000231.1,MENQEKASIAGHMFDVVVIGGGISGLSAAKLLTEYGVSVLVLEARDRVGGRTYTIRNEHVDYVDVGGAYVGPTQNRILRLSKELGIETYKVNVSERLVQYVKGKTYPFRGAFPPVWNPIAYLDYNNLWRTIDNMGKEIPTDAPWEAQHADKWDKMTMKELIDKICWTKTARRFAYLFVNINVTSEPHEVSALWFLWYVKQCGGTTRIFSVTNGGQERKFVGGSGQVSERIMDLLGDQVKLNHPVTHVDQSSDNIIIETLNHEHYECKYVINAIPPTLTAKIHFRPELPAERNQLIQRLPMGAVIKCMMYYKEAFWKKKDYCGCMIIEDEDAPISITLDDTKPDGSLPAIMGFILARKADRLAKLHKEIRKKKICELYAKVLGSQEALHPVHYEEKNWCEEQYSGGCYTAYFPPGIMTQYGRVIRQPVGRIFFAGTETATKWSGYMEGAVEAGERAAREVLNGLGKVTEKDIWVQEPESKDVPAVEITHTFWERNLPSVSGLLKIIGFSTSVTALGFVLYKYKLLPRS,527,NP_000231.1.csv,refseq-MAOA-NM_000240.3_clinical_seed_0_final,refseq-MAOA-NM_000240.3.a2m,Invitae,refseq-MAOA-NM_000240.3.npy,1,527,527
+NP_000239.1,MLEMLEYNHYQVQTHLENPTKYHIQQAQRQQVKQYLSTTLANKHANQVLSLPCPNQPGDHVMPPVPGSSAPNSPMAMLTLNSNCEKEGFYKFEEQNRAESECPGMNTHSRASCMQMDDVIDDIISLESSYNEEILGLMDPALQMANTLPVSGNLIDLYGNQGLPPPGLTISNSCPANLPNIKRELTACIFPTESEARALAKERQKKDNHNLIERRRRFNINDRIKELGTLIPKSNDPDMRWNKGTILKASVDYIRKLQREQQRAKELENRQKKLEHANRHLLLRIQELEMQARAHGLSLIPSTGLCSPDLVNRIIKQEPVLENCSQDLLQHHADLTCTTTLDLTDGTITFNNNLGTGTEANQAYSVPTKMGSKLEDILMDDTLSPVGVTDPLLSSVSPGASKTSSRRSSMSMEETEHTC,419,NP_000239.1.csv,refseq-MITF-NM_000248.3_clinical_seed_0_final,refseq-MITF-NM_000248.3.a2m,Invitae,refseq-MITF-NM_000248.3.npy,1,419,419
+NP_000240.1,MSFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMIENCLDAKSTSIQVIVKEGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQSFEDLASISTYGFRGEALASISHVAHVTITTKTADGKCAYRASYSDGKLKAPPKPCAGNQGTQITVEDLFYNIATRRKALKNPSEEYGKILEVVGRYSVHNAGISFSVKKQGETVADVRTLPNASTVDNIRSIFGNAVSRELIEIGCEDKTLAFKMNGYISNANYSVKKCIFLLFINHRLVESTSLRKAIETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFLHEESILERVQQHIESKLLGSNSSRMYFTQTLLPGLAGPSGEMVKSTTSLTSSSTSGSSDKVYAHQMVRTDSREQKLDAFLQPLSKPLSSQPQAIVTEDKTDISSGRARQQDEEMLELPAPAEVAAKNQSLEGDTTKGTSEMSEKRGPTSSNPRKRHREDSDVEMVEDDSRKEMTAACTPRRRIINLTSVLSLQEEINEQGHEVLREMLHNHSFVGCVNPQWALAQHQTKLYLLNTTKLSEELFYQILIYDFANFGVLRLSEPAPLFDLAMLALDSPESGWTEEDGPKEGLAEYIVEFLKKKAEMLADYFSLEIDEEGNLIGLPLLIDNYVPPLEGLPIFILRLATEVNWDEEKECFESLSKECAMFYSIRKQYISEESTLSGQQSEVPGSIPNSWKWTVEHIVYKALRSHILPPKHFTEDGNILQLANLPDLYKVFERC,756,NP_000240.1.csv,refseq-MLH1-NM_000249.3_clinical_seed_0_final,refseq-MLH1-NM_000249.3.a2m,Invitae,refseq-MLH1-NM_000249.3.npy,1,756,756
+NP_000241.1,MGVPFFSSLRCMVDLGPCWAGGLTAEMKLLLALAGLLAILATPQPSEGAAPAVLGEVDTSLVLSSMEEAKQLVDKAYKERRESIKQRLRSGSASPMELLSYFKQPVAATRTAVRAADYLHVALDLLERKLRSLWRRPFNVTDVLTPAQLNVLSKSSGCAYQDVGVTCPEQDKYRTITGMCNNRRSPTLGASNRAFVRWLPAEYEDGFSLPYGWTPGVKRNGFPVALARAVSNEIVRFPTDQLTPDQERSLMFMQWGQLLDHDLDFTPEPAARASFVTGVNCETSCVQQPPCFPLKIPPNDPRIKNQADCIPFFRSCPACPGSNITIRNQINALTSFVDASMVYGSEEPLARNLRNMSNQLGLLAVNQRFQDNGRALLPFDNLHDDPCLLTNRSARIPCFLAGDTRSSEMPELTSMHTLLLREHNRLATELKSLNPRWDGERLYQEARKIVGAMVQIITYRDYLPLVLGPTAMRKYLPTYRSYNDSVDPRIANVFTNAFRYGHTLIQPFMFRLDNRYQPMEPNPRVPLSRVFFASWRVVLEGGIDPILRGLMATPAKLNRQNQIAVDEIRERLFEQVMRIGLDLPALNMQRSRDHGLPGYNAWRRFCGLPQPETVGQLGTVLRNLKLARKLMEQYGTPNNIDIWMGGVSEPLKRKGRVGPLLACIIGTQFRKLRDGDRFWWENEGVFSMQQRQALAQISLPRIICDNTGITTVSKNNIFMSNSYPRDFVNCSTLPALNLASWREAS,745,NP_000241.1.csv,refseq-MPO-NM_000250.1_clinical_seed_0_final,refseq-MPO-NM_000250.1.a2m,Invitae,refseq-MPO-NM_000250.1.npy,1,745,745
+NP_000242.1,MAVQPKETLQLESAAEVGFVRFFQGMPEKPTTTVRLFDRGDFYTAHGEDALLAAREVFKTQGVIKYMGPAGAKNLQSVVLSKMNFESFVKDLLLVRQYRVEVYKNRAGNKASKENDWYLAYKASPGNLSQFEDILFGNNDMSASIGVVGVKMSAVDGQRQVGVGYVDSIQRKLGLCEFPDNDQFSNLEALLIQIGPKECVLPGGETAGDMGKLRQIIQRGGILITERKKADFSTKDIYQDLNRLLKGKKGEQMNSAVLPEMENQVAVSSLSAVIKFLELLSDDSNFGQFELTTFDFSQYMKLDIAAVRALNLFQGSVEDTTGSQSLAALLNKCKTPQGQRLVNQWIKQPLMDKNRIEERLNLVEAFVEDAELRQTLQEDLLRRFPDLNRLAKKFQRQAANLQDCYRLYQGINQLPNVIQALEKHEGKHQKLLLAVFVTPLTDLRSDFSKFQEMIETTLDMDQVENHEFLVKPSFDPNLSELREIMNDLEKKMQSTLISAARDLGLDPGKQIKLDSSAQFGYYFRVTCKEEKVLRNNKNFSTVDIQKNGVKFTNSKLTSLNEEYTKNKTEYEEAQDAIVKEIVNISSGYVEPMQTLNDVLAQLDAVVSFAHVSNGAPVPYVRPAILEKGQGRIILKASRHACVEVQDEIAFIPNDVYFEKDKQMFHIITGPNMGGKSTYIRQTGVIVLMAQIGCFVPCESAEVSIVDCILARVGAGDSQLKGVSTFMAEMLETASILRSATKDSLIIIDELGRGTSTYDGFGLAWAISEYIATKIGAFCMFATHFHELTALANQIPTVNNLHVTALTTEETLTMLYQVKKGVCDQSFGIHVAELANFPKHVIECAKQKALELEEFQYIGESQGYDIMEPAAKKCYLEREQGEKIIQEFLSKVKQMPFTEMSEENITIKLKQLKAEVIAKNNSFVNEIISRIKVTT,934,NP_000242.1.csv,refseq-MSH2-NM_000251.2_clinical_seed_0_final,refseq-MSH2-NM_000251.2.a2m,Invitae,refseq-MSH2-NM_000251.2_theta_0.2.npy,1,934,934
+NP_000243.1,MASASTSKYNSHSLENESIKRTSRDGVNRDLTEAVPRLPGETLITDKEVIYICPFNGPIKGRVYITNYRLYLRSLETDSSLILDVPLGVISRIEKMGGATSRGENSYGLDITCKDMRNLRFALKQEGHSRRDMFEILTRYAFPLAHSLPLFAFLNEEKFNVDGWTVYNPVEEYRRQGLPNHHWRITFINKCYELCDTYPALLVVPYRASDDDLRRVATFRSRNRIPVLSWIHPENKTVIVRCSQPLVGMSGKRNKDDEKYLDVIRETNKQISKLTIYDARPSVNAVANKATGGGYESDDAYHNAELFFLDIHNIHVMRESLKKVKDIVYPNVEESHWLSSLESTHWLEHIKLVLTGAIQVADKVSSGKSSVLVHCSDGWDRTAQLTSLAMLMLDSFYRSIEGFEILVQKEWISFGHKFASRIGHGDKNHTDADRSPIFLQFIDCVWQMSKQFPTAFEFNEQFLIIILDHLYSCRFGTFLFNCESARERQKVTERTVSLWSLINSNKEKFKNPFYTKEINRVLYPVASMRHLELWVNYYIRWNPRIKQQQPNPVEQRYMELLALRDEYIKRLEELQLANSAKLSDPPTSPSSPSQMMPHVQTHF,603,NP_000243.1.csv,refseq-MTM1-NM_000252.2_clinical_seed_0_final,refseq-MTM1-NM_000252.2.a2m,Invitae,refseq-MTM1-NM_000252.2.npy,1,603,603
+NP_000244.2,MILLAVLFLCFISSYSASVKGHTTGLSLNNDRLYKLTYSTEVLLDRGKGKLQDSVGYRISSNVDVALLWRNPDGDDDQLIQITMKDVNVENVNQQRGEKSIFKGKSPSKIMGKENLEALQRPTLLHLIHGKVKEFYSYQNEAVAIENIKRGLASLFQTQLSSGTTNEVDISGNCKVTYQAHQDKVIKIKALDSCKIARSGFTTPNQVLGVSSKATSVTTYKIEDSFVIAVLAEETHNFGLNFLQTIKGKIVSKQKLELKTTEAGPRLMSGKQAAAIIKAVDSKYTAIPIVGQVFQSHCKGCPSLSELWRSTRKYLQPDNLSKAEAVRNFLAFIQHLRTAKKEEILQILKMENKEVLPQLVDAVTSAQTSDSLEAILDFLDFKSDSSIILQERFLYACGFASHPNEELLRALISKFKGSIGSSDIRETVMIITGTLVRKLCQNEGCKLKAVVEAKKLILGGLEKAEKKEDTRMYLLALKNALLPEGIPSLLKYAEAGEGPISHLATTALQRYDLPFITDEVKKTLNRIYHQNRKVHEKTVRTAAAAIILNNNPSYMDVKNILLSIGELPQEMNKYMLAIVQDILRFEMPASKIVRRVLKEMVAHNYDRFSRSGSSSAYTGYIERSPRSASTYSLDILYSGSGILRRSNLNIFQYIGKAGLHGSQVVIEAQGLEALIAATPDEGEENLDSYAGMSAILFDVQLRPVTFFNGYSDLMSKMLSASGDPISVVKGLILLIDHSQELQLQSGLKANIEVQGGLAIDISGAMEFSLWYRESKTRVKNRVTVVITTDITVDSSFVKAGLETSTETEAGLEFISTVQFSQYPFLVCMQMDKDEAPFRQFEKKYERLSTGRGYVSQKRKESVLAGCEFPLHQENSEMCKVVFAPQPDSTSSGWF,894,NP_000244.2.csv,refseq-MTTP-NM_000253.3_clinical_seed_0_final,refseq-MTTP-NM_000253.3.a2m,Invitae,refseq-MTTP-NM_000253.3.npy,1,894,894
+NP_000245.2,MSPALQDLSQPEGLKKTLRDEINAILQKRIMVLDGGMGTMIQREKLNEEHFRGQEFKDHARPLKGNNDILSITQPDVIYQIHKEYLLAGADIIETNTFSSTSIAQADYGLEHLAYRMNMCSAGVARKAAEEVTLQTGIKRFVAGALGPTNKTLSVSPSVERPDYRNITFDELVEAYQEQAKGLLDGGVDILLIETIFDTANAKAALFALQNLFEEKYAPRPIFISGTIVDKSGRTLSGQTGEGFVISVSHGEPLCIGLNCALGAAEMRPFIEIIGKCTTAYVLCYPNAGLPNTFGDYDETPSMMAKHLKDFAMDGLVNIVGGCCGSTPDHIREIAEAVKNCKPRVPPATAFEGHMLLSGLEPFRIGPYTNFVNIGERCNVAGSRKFAKLIMAGNYEEALCVAKVQVEMGAQVLDVNMDDGMLDGPSAMTRFCNLIASEPDIAKVPLCIDSSNFAVIEAGLKCCQGKCIVNSISLKEGEDDFLEKARKIKKYGAAMVVMAFDEEGQATETDTKIRVCTRAYHLLVKKLGFNPNDIIFDPNILTIGTGMEEHNLYAINFIHATKVIKETLPGARISGGLSNLSFSFRGMEAIREAMHGVFLYHAIKSGMDMGIVNAGNLPVYDDIHKELLQLCEDLIWNKDPEATEKLLRYAQTQGTGGKKVIQTDEWRNGPVEERLEYALVKGIEKHIIEDTEEARLNQKKYPRPLNIIEGPLMNGMKIVGDLFGAGKMFLPQVIKSARVMKKAVGHLIPFMEKEREETRVLNGTVEEEDPYQGTIVLATVKGDVHDIGKNIVGVVLGCNNFRVIDLGVMTPCDKILKAALDHKADIIGLSGLITPSLDEMIFVAKEMERLAIRIPLLIGGATTSKTHTAVKIAPRYSAPVIHVLDASKSVVVCSQLLDENLKDEYFEEIMEEYEDIRQDHYESLKERRYLPLSQARKSGFQMDWLSEPHPVKPTFIGTQVFEDYDLQKLVDYIDWKPFFDVWQLRGKYPNRGFPKIFNDKTVGGEARKVYDDAHNMLNTLISQKKLRARGVVGFWPAQSIQDDIHLYAEAAVPQAAEPIATFYGLRQQAEKDSASTEPYYCLSDFIAPLHSGIRDYLGLFAVACFGVEELSKAYEDDGDDYSSIMVKALGDRLAEAFAEELHERVRRELWAYCGSEQLDVADLRRLRYKGIRPAPGYPSQPDHTEKLTMWRLADIEQSTGIRLTESLAMAPASAVSGLYFSNLKSKYFAVGKISKDQVEDYALRKNISVAEVEKWLGPILGYDTD,1265,NP_000245.2.csv,refseq-MTR-NM_000254.2_clinical_seed_0_final,refseq-MTR-NM_000254.2.a2m,Invitae,refseq-MTR-NM_000254.2.npy,1,1265,1265
+NP_000246.2,MLRAKNQLFLLSPHYLRQVKESSGSRLIQQRLLHQQQPLHPEWAALAKKQLKGKNPEDLIWHTPEGISIKPLYSKRDTMDLPEELPGVKPFTRGPYPTMYTFRPWTIRQYAGFSTVEESNKFYKDNIKAGQQGLSVAFDLATHRGYDSDNPRVRGDVGMAGVAIDTVEDTKILFDGIPLEKMSVSMTMNGAVIPVLANFIVTGEEQGVPKEKLTGTIQNDILKEFMVRNTYIFPPEPSMKIIADIFEYTAKHMPKFNSISISGYHMQEAGADAILELAYTLADGLEYSRTGLQAGLTIDEFAPRLSFFWGIGMNFYMEIAKMRAGRRLWAHLIEKMFQPKNSKSLLLRAHCQTSGWSLTEQDPYNNIVRTAIEAMAAVFGGTQSLHTNSFDEALGLPTVKSARIARNTQIIIQEESGIPKVADPWGGSYMMECLTNDVYDAALKLINEIEEMGGMAKAVAEGIPKLRIEECAARRQARIDSGSEVIVGVNKYQLEKEDAVEVLAIDNTSVRNRQIEKLKKIKSSRDQALAERCLAALTECAASGDGNILALAVDASRARCTVGEITDALKKVFGEHKANDRMVSGAYRQEFGESKEITSAIKRVHKFMEREGRRPRLLVAKMGQDGHDRGAKVIATGFADLGFDVDIGPLFQTPREVAQQAVDADVHAVGISTLAAGHKTLVPELIKELNSLGRPDILVMCGGVIPPQDYEFLFEVGVSNVFGPGTRIPKAAVQVLDDIEKCLEKKQQSV,750,NP_000246.2.csv,refseq-MUT-NM_000255.3_clinical_seed_0_final,refseq-MUT-NM_000255.3.a2m,Invitae,refseq-MUT-NM_000255.3.npy,1,750,750
+NP_000247.2,MPEPGKKPVSAFSKKPRSVEVAAGSPAVFEAETERAGVKVRWQRGGSDISASNKYGLATEGTRHTLTVREVGPADQGSYAVIAGSSKVKFDLKVIEAEKAEPMLAPAPAPAEATGAPGEAPAPAAELGESAPSPKGSSSAALNGPTPGAPDDPIGLFVMRPQDGEVTVGGSITFSARVAGASLLKPPVVKWFKGKWVDLSSKVGQHLQLHDSYDRASKVYLFELHITDAQPAFTGSYRCEVSTKDKFDCSNFNLTVHEAMGTGDLDLLSAFRRTSLAGGGRRISDSHEDTGILDFSSLLKKRDSFRTPRDSKLEAPAEEDVWEILRQAPPSEYERIAFQYGVTDLRGMLKRLKGMRRDEKKSTAFQKKLEPAYQVSKGHKIRLTVELADHDAEVKWLKNGQEIQMSGSKYIFESIGAKRTLTISQCSLADDAAYQCVVGGEKCSTELFVKEPPVLITRPLEDQLVMVGQRVEFECEVSEEGAQVKWLKDGVELTREETFKYRFKKDGQRHHLIINEAMLEDAGHYALCTSGGQALAELIVQEKKLEVYQSIADLMVGAKDQAVFKCEVSDENVRGVWLKNGKELVPDSRIKVSHIGRVHKLTIDDVTPADEADYSFVPEGFACNLSAKLHFMEVKIDFVPRQEPPKIHLDCPGRIPDTIVVVAGNKLRLDVPISGDPAPTVIWQKAITQGNKAPARPAPDAPEDTGDSDEWVFDKKLLCETEGRVRVETTKDRSIFTVEGAEKEDEGVYTVTVKNPVGEDQVNLTVKVIDVPDAPAAPKISNVGEDSCTVQWEPPAYDGGQPILGYILERKKKKSYRWMRLNFDLIQELSHEARRMIEGVVYEMRVYAVNAIGMSRPSPASQPFMPIGPPSEPTHLAVEDVSDTTVSLKWRPPERVGAGGLDGYSVEYCPEGCSEWVAALQGLTEHTSILVKDLPTGARLLFRVRAHNMAGPGAPVTTTEPVTVQEILQRPRLQLPRHLRQTIQKKVGEPVNLLIPFQGKPRPQVTWTKEGQPLAGEEVSIRNSPTDTILFIRAARRVHSGTYQVTVRIENMEDKATLVLQVVDKPSPPQDLRVTDAWGLNVALEWKPPQDVGNTELWGYTVQKADKKTMEWFTVLEHYRRTHCVVPELIIGNGYYFRVFSQNMVGFSDRAATTKEPVFIPRPGITYEPPNYKALDFSEAPSFTQPLVNRSVIAGYTAMLCCAVRGSPKPKISWFKNGLDLGEDARFRMFSKQGVLTLEIRKPCPFDGGIYVCRATNLQGEARCECRLEVRVPQ,1274,NP_000247.2.csv,refseq-MYBPC3-NM_000256.3_clinical_seed_0_final,refseq-MYBPC3-NM_000256.3.a2m,Invitae,refseq-MYBPC3-NM_000256.3.npy,1,1274,1274
+NP_000248.2,MGDSEMAVFGAAAPYLRKSEKERLEAQTRPFDLKKDVFVPDDKQEFVKAKIVSREGGKVTAETEYGKTVTVKEDQVMQQNPPKFDKIEDMAMLTFLHEPAVLYNLKDRYGSWMIYTYSGLFCVTVNPYKWLPVYTPEVVAAYRGKKRSEAPPHIFSISDNAYQYMLTDRENQSILITGESGAGKTVNTKRVIQYFAVIAAIGDRSKKDQSPGKGTLEDQIIQANPALEAFGNAKTVRNDNSSRFGKFIRIHFGATGKLASADIETYLLEKSRVIFQLKAERDYHIFYQILSNKKPELLDMLLITNNPYDYAFISQGETTVASIDDAEELMATDNAFDVLGFTSEEKNSMYKLTGAIMHFGNMKFKLKQREEQAEPDGTEEADKSAYLMGLNSADLLKGLCHPRVKVGNEYVTKGQNVQQVIYATGALAKAVYERMFNWMVTRINATLETKQPRQYFIGVLDIAGFEIFDFNSFEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTFIDFGMDLQACIDLIEKPMGIMSILEEECMFPKATDMTFKAKLFDNHLGKSANFQKPRNIKGKPEAHFSLIHYAGIVDYNIIGWLQKNKDPLNETVVGLYQKSSLKLLSTLFANYAGADAPIEKGKGKAKKGSSFQTVSALHRENLNKLMTNLRSTHPHFVRCIIPNETKSPGVMDNPLVMHQLRCNGVLEGIRICRKGFPNRILYGDFRQRYRILNPAAIPEGQFIDSRKGAEKLLSSLDIDHNQYKFGHTKVFFKAGLLGLLEEMRDERLSRIITRIQAQSRGVLARMEYKKLLERRDSLLVIQWNIRAFMGVKNWPWMKLYFKIKPLLKSAEREKEMASMKEEFTRLKEALEKSEARRKELEEKMVSLLQEKNDLQLQVQAEQDNLADAEERCDQLIKNKIQLEAKVKEMNERLEDEEEMNAELTAKKRKLEDECSELKRDIDDLELTLAKVEKEKHATENKVKNLTEEMAGLDEIIAKLTKEKKALQEAHQQALDDLQAEEDKVNTLTKAKVKLEQQVDDLEGSLEQEKKVRMDLERAKRKLEGDLKLTQESIMDLENDKQQLDERLKKKDFELNALNARIEDEQALGSQLQKKLKELQARIEELEEELEAERTARAKVEKLRSDLSRELEEISERLEEAGGATSVQIEMNKKREAEFQKMRRDLEEATLQHEATAAALRKKHADSVAELGEQIDNLQRVKQKLEKEKSEFKLELDDVTSNMEQIIKAKANLEKMCRTLEDQMNEHRSKAEETQRSVNDLTSQRAKLQTENGELSRQLDEKEALISQLTRGKLTYTQQLEDLKRQLEEEVKAKNALAHALQSARHDCDLLREQYEEETEAKAELQRVLSKANSEVAQWRTKYETDAIQRTEELEEAKKKLAQRLQEAEEAVEAVNAKCSSLEKTKHRLQNEIEDLMVDVERSNAAAAALDKKQRNFDKILAEWKQKYEESQSELESSQKEARSLSTELFKLKNAYEESLEHLETFKRENKNLQEEISDLTEQLGSSGKTIHELEKVRKQLEAEKMELQSALEEAEASLEHEEGKILRAQLEFNQIKAEIERKLAEKDEEMEQAKRNHLRVVDSLQTSLDAETRSRNEALRVKKKMEGDLNEMEIQLSHANRMAAEAQKQVKSLQSLLKDTQIQLDDAVRANDDLKENIAIVERRNNLLQAELEELRAVVEQTERSRKLAEQELIETSERVQLLHSQNTSLINQKKKMDADLSQLQTEVEEAVQECRNAEEKAKKAITDAAMMAEELKKEQDTSAHLERMKKNMEQTIKDLQHRLDEAEQIALKGGKKQLQKLEARVRELENELEAEQKRNAESVKGMRKSERRIKELTYQTEEDRKNLLRLQDLVDKLQLKVKAYKRQAEEAEEQANTNLSKFRKVQHELDEAEERADIAESQVNKLRAKSRDIGTKGLNEE,1935,NP_000248.2.csv,refseq-MYH7-NM_000257.3_clinical_seed_0_final,refseq-MYH7-NM_000257.3.a2m,Invitae,refseq-MYH7-NM_000257.3.npy,1,1935,1935
+NP_000249.1,MAPKKPEPKKDDAKAAPKAAPAPAPPPEPERPKEVEFDASKIKIEFTPEQIEEFKEAFMLFDRTPKCEMKITYGQCGDVLRALGQNPTQAEVLRVLGKPRQEELNTKMMDFETFLPMLQHISKNKDTGTYEDFVEGLRVFDKEGNGTVMGAELRHVLATLGERLTEDEVEKLMAGQEDSNGCINYEAFVKHIMSS,195,NP_000249.1.csv,refseq-MYL3-NM_000258.2_clinical_seed_0_final,refseq-MYL3-NM_000258.2.a2m,Invitae,refseq-MYL3-NM_000258.2.npy,1,195,195
+NP_000251.3,MVILQQGDHVWMDLRLGQEFDVPIGAVVKLCDSGQVQVVDDEDNEHWISPQNATHIKPMHPTSVHGVEDMIRLGDLNEAGILRNLLIRYRDHLIYTYTGSILVAVNPYQLLSIYSPEHIRQYTNKKIGEMPPHIFAIADNCYFNMKRNSRDQCCIISGESGAGKTESTKLILQFLAAISGQHSWIEQQVLEATPILEAFGNAKTIRNDNSSRFGKYIDIHFNKRGAIEGAKIEQYLLEKSRVCRQALDERNYHVFYCMLEGMSEDQKKKLGLGQASDYNYLAMGNCITCEGRVDSQEYANIRSAMKVLMFTDTENWEISKLLAAILHLGNLQYEARTFENLDACEVLFSPSLATAASLLEVNPPDLMSCLTSRTLITRGETVSTPLSREQALDVRDAFVKGIYGRLFVWIVDKINAAIYKPPSQDVKNSRRSIGLLDIFGFENFAVNSFEQLCINFANEHLQQFFVRHVFKLEQEEYDLESIDWLHIEFTDNQDALDMIANKPMNIISLIDEESKFPKGTDTTMLHKLNSQHKLNANYIPPKNNHETQFGINHFAGIVYYETQGFLEKNRDTLHGDIIQLVHSSRNKFIKQIFQADVAMGAETRKRSPTLSSQFKRSLELLMRTLGACQPFFVRCIKPNEFKKPMLFDRHLCVRQLRYSGMMETIRIRRAGYPIRYSFVEFVERYRVLLPGVKPAYKQGDLRGTCQRMAEAVLGTHDDWQIGKTKIFLKDHHDMLLEVERDKAITDRVILLQKVIRGFKDRSNFLKLKNAATLIQRHWRGHNCRKNYGLMRLGFLRLQALHRSRKLHQQYRLARQRIIQFQARCRAYLVRKAFRHRLWAVLTVQAYARGMIARRLHQRLRAEYLWRLEAEKMRLAEEEKLRKEMSAKKAKEEAERKHQERLAQLAREDAERELKEKEAARRKKELLEQMERARHEPVNHSDMVDKMFGFLGTSGGLPGQEGQAPSGFEDLERGRREMVEEDLDAALPLPDEDEEDLSEYKFAKFAATYFQGTTTHSYTRRPLKQPLLYHDDEGDQLAALAVWITILRFMGDLPEPKYHTAMSDGSEKIPVMTKIYETLGKKTYKRELQALQGEGEAQLPEGQKKSSVRHKLVHLTLKKKSKLTEEVTKRLHDGESTVQGNSMLEDRPTSNLEKLHFIIGNGILRPALRDEIYCQISKQLTHNPSKSSYARGWILVSLCVGCFAPSEKFVKYLRNFIHGGPPGYAPYCEERLRRTFVNGTRTQPPSWLELQATKSKKPIMLPVTFMDGTTKTLLTDSATTAKELCNALADKISLKDRFGFSLYIALFDKVSSLGSGSDHVMDAISQCEQYAKEQGAQERNAPWRLFFRKEVFTPWHSPSEDNVATNLIYQQVVRGVKFGEYRCEKEDDLAELASQQYFVDYGSEMILERLLNLVPTYIPDREITPLKTLEKWAQLAIAAHKKGIYAQRRTDAQKVKEDVVSYARFKWPLLFSRFYEAYKFSGPSLPKNDVIVAVNWTGVYFVDEQEQVLLELSFPEIMAVSSSRECRVWLSLGCSDLGCAAPHSGWAGLTPAGPCSPCWSCRGAKTTAPSFTLATIKGDEYTFTSSNAEDIRDLVVTFLEGLRKRSKYVVALQDNPNPAGEESGFLSFAKGDLIILDHDTGEQVMNSGWANGINERTKQRGDFPTDSVYVMPTVTMPPREIVALVTMTPDQRQDVVRLLQLRTAEPEVRAKPYTLEEFSYDYFRPPPKHTLSRVMVSKARGKDRLWSHTREPLKQALLKKLLGSEELSQEACLAFIAVLKYMGDYPSKRTRSVNELTDQIFEGPLKAEPLKDEAYVQILKQLTDNHIRYSEERGWELLWLCTGLFPPSNILLPHVQRFLQSRKHCPLAIDCLQRLQKALRNGSRKYPPHLVEVEAIQHKTTQIFHKVYFPDDTDEAFEVESSTKAKDFCQNIATRLLLKSSEGFSLFVKIADKVLSVPENDFFFDFVRHLTDWIKKARPIKDGIVPSLTYQVFFMKKLWTTTVPGKDPMADSIFHYYQELPKYLRGYHKCTREEVLQLGALIYRVKFEEDKSYFPSIPKLLRELVPQDLIRQVSPDDWKRSIVAYFNKHAGKSKEEAKLAFLKLIFKWPTFGSAFFEVKQTTEPNFPEILLIAINKYGVSLIDPKTKDILTTHPFTKISNWSSGNTYFHITIGNLVRGSKLLCETSLGYKMDDLLTSYISQMLTAMSKQRGSRSGK,2215,NP_000251.3.csv,refseq-MYO7A-NM_000260.3_clinical_seed_0_final,refseq-MYO7A-NM_000260.3.a2m,Invitae,refseq-MYO7A-NM_000260.3_theta_0.2.npy,1,2215,2215
+NP_000252.1,MRFFCARCCSFGPEMPAVQLLLLACLVWDVGARTAQLRKANDQSGRCQYTFSVASPNESSCPEQSQAMSVIHNLQRDSSTQRLDLEATKARLSSLESLLHQLTLDQAARPQETQEGLQRELGTLRRERDQLETQTRELETAYSNLLRDKSVLEEEKKRLRQENENLARRLESSSQEVARLRRGQCPQTRDTARAVPPGSREVSTWNLDTLAFQELKSELTEVPASRILKESPSGYLRSGEGDTGCGELVWVGEPLTLRTAETITGKYGVWMRDPKPTYPYTQETTWRIDTVGTDVRQVFEYDLISQFMQGYPSKVHILPRPLESTGAVVYSGSLYFQGAESRTVIRYELNTETVKAEKEIPGAGYHGQFPYSWGGYTDIDLAVDEAGLWVIYSTDEAKGAIVLSKLNPENLELEQTWETNIRKQSVANAFIICGTLYTVSSYTSADATVNFAYDTGTGISKTLTIPFKNRYKYSSMIDYNPLEKKLFAWDNLNMVTYDIKLSKM,504,NP_000252.1.csv,refseq-MYOC-NM_000261.1_clinical_seed_0_final,refseq-MYOC-NM_000261.1.a2m,Invitae,refseq-MYOC-NM_000261.1.npy,1,504,504
+NP_000254.2,MEAVAVAAAVGVLLLAGAGGAAGDEAREAAAVRALVARLLGPGPAADFSVSVERALAAKPGLDTYSLGGGGAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAWSGSQLRLPRPLPAVPGELTEATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVYLALGLTQAEINEFFTGPAFLAWGRMGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCSYSCSFLLAPEDPIFPIIGSLFLRELIKEFGTDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPSLQMNTSIWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLWGPEGNILDYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPRWVAGSW,743,NP_000254.2.csv,refseq-NAGLU-NM_000263.3_clinical_seed_0_final,refseq-NAGLU-NM_000263.3.a2m,Invitae,refseq-NAGLU-NM_000263.3.npy,1,743,743
+NP_000255.2,MASAGNAAEPQDRGGGGSGCIGAPGRPAGGGRRRRTGGLRRAAAPDRDYLHRPSYCDAAFALEQISKGKATGRKAPLWLRAKFQRLLFKLGCYIQKNCGKFLVVGLLIFGAFAVGLKAANLETNVEELWVEVGGRVSRELNYTRQKIGEEAMFNPQLMIQTPKEEGANVLTTEALLQHLDSALQASRVHVYMYNRQWKLEHLCYKSGELITETGYMDQIIEYLYPCLIITPLDCFWEGAKLQSGTAYLLGKPPLRWTNFDPLEFLEELKKINYQVDSWEEMLNKAEVGHGYMDRPCLNPADPDCPATAPNKNSTKPLDMALVLNGGCHGLSRKYMHWQEELIVGGTVKNSTGKLVSAHALQTMFQLMTPKQMYEHFKGYEYVSHINWNEDKAAAILEAWQRTYVEVVHQSVAQNSTQKVLSFTTTTLDDILKSFSDVSVIRVASGYLLMLAYACLTMLRWDCSKSQGAVGLAGVLLVALSVAAGLGLCSLIGISFNAATTQVLPFLALGVGVDDVFLLAHAFSETGQNKRIPFEDRTGECLKRTGASVALTSISNVTAFFMAALIPIPALRAFSLQAAVVVVFNFAMVLLIFPAILSMDLYRREDRRLDIFCCFTSPCVSRVIQVEPQAYTDTHDNTRYSPPPPYSSHSFAHETQITMQSTVQLRTEYDPHTHVYYTTAEPRSEISVQPVTVTQDTLSCQSPESTSSTRDLLSQFSDSSLHCLEPPCTKWTLSSFAEKHYAPFLLKPKAKVVVIFLFLGLLGVSLYGTTRVRDGLDLTDIVPRETREYDFIAAQFKYFSFYNMYIVTQKADYPNIQHLLYDLHRSFSNVKYVMLEENKQLPKMWLHYFRDWLQGLQDAFDSDWETGKIMPNNYKNGSDDGVLAYKLLVQTGSRDKPIDISQLTKQRLVDADGIINPSAFYIYLTAWVSNDPVAYAASQANIRPHRPEWVHDKADYMPETRLRIPAAEPIEYAQFPFYLNGLRDTSDFVEAIEKVRTICSNYTSLGLSSYPNGYPFLFWEQYIGLRHWLLLFISVVLACTFLVCAVFLLNPWTAGIIVMVLALMTVELFGMMGLIGIKLSAVPVVILIASVGIGVEFTVHVALAFLTAIGDKNRRAVLALEHMFAPVLDGAVSTLLGVLMLAGSEFDFIVRYFFAVLAILTILGVLNGLVLLPVLLSFFGPYPEVSPANGLNRLPTPSPEPPPSVVRFAMPPGHTHSGSDSSDSEYSSQTTVSGLSEELRHYEAQQGAGGPAHQVIVEATENPVFAHSTVVHPESRHHPPSNPRQQPHLDSGSLPPGRQGQQPRRDPPREGLWPPPYRPRRDAFEISTEGHSGPSNRARWGPRGARSHNPRNPASTAMGSSVPGYCQPITTVTASASVTVAVHPPPVPGPGRNPRGGLCPGYPETDHGLFEDPHVPFHVRCERRDSKVEVIELQDVECEERPRGSSSN,1447,NP_000255.2.csv,refseq-PTCH1-NM_000264.3_clinical_seed_0_final,refseq-PTCH1-NM_000264.3.a2m,Invitae,refseq-PTCH1-NM_000264.3_theta_0.2.npy,1,1447,1447
+NP_000257.1,MRKHVLAASFSMLSLLVIMGDTDSKTDSSFIMDSDPRRCMRHHYVDSISHPLYKCSSKMVLLARCEGHCSQASRSEPLVSFSTVLKQPFRSSCHCCRPQTSKLKALRLRCSGGMRLTATYRYILSCHCEECNS,133,NP_000257.1.csv,refseq-NDP-NM_000266.3_clinical_seed_0_final,refseq-NDP-NM_000266.3.a2m,Invitae,refseq-NDP-NM_000266.3.npy,1,133,133
+NP_000259.1,MAGAIASRMSFSSLKRKQPKTFTVRIVTMDAEMEFNCEMKWKGKDLFDLVCRTLGLRETWFFGLQYTIKDTVAWLKMDKKVLDHDVSKEEPVTFHFLAKFYPENAEEELVQEITQHLFFLQVKKQILDEKIYCPPEASVLLASYAVQAKYGDYDPSVHKRGFLAQEELLPKRVINLYQMTPEMWEERITAWYAEHRGRARDEAEMEYLKIAQDLEMYGVNYFAIRNKKGTELLLGVDALGLHIYDPENRLTPKISFPWNEIRNISYSDKEFTIKPLDKKIDVFKFNSSKLRVNKLILQLCIGNHDLFMRRRKADSLEVQQMKAQAREEKARKQMERQRLAREKQMREEAERTRDELERRLLQMKEEATMANEALMRSEETADLLAEKAQITEEEAKLLAQKAAEAEQEMQRIKATAIRTEEEKRLMEQKVLEAEVLALKMAEESERRAKEADQLKQDLQEAREAERRAKQKLLEIATKPTYPPMNPIPAPLPPDIPSFNLIGDSLSFDFKDTDMKRLSMEIEKEKVEYMEKSKHLQEQLNELKTEIEALKLKERETALDILHNENSDRGGSSKHNTIKKLTLQSAKSRVAFFEEL,595,NP_000259.1.csv,refseq-NF2-NM_000268.3_clinical_seed_0_final,refseq-NF2-NM_000268.3.a2m,Invitae,refseq-NF2-NM_000268.3.npy,1,595,595
+NP_000261.2,MENGYTYEDYKNTAEWLLSHTKHRPQVAIICGSGLGGLTDKLTQAQIFDYGEIPNFPRSTVPGHAGRLVFGFLNGRACVMMQGRFHMYEGYPLWKVTFPVRVFHLLGVDTLVVTNAAGGLNPKFEVGDIMLIRDHINLPGFSGQNPLRGPNDERFGDRFPAMSDAYDRTMRQRALSTWKQMGEQRELQEGTYVMVAGPSFETVAECRVLQKLGADAVGMSTVPEVIVARHCGLRVFGFSLITNKVIMDYESLEKANHEEVLAAGKQAAQKLEQFVSILMASIPLPDKAS,289,NP_000261.2.csv,refseq-PNP-NM_000270.3_clinical_seed_0_final,refseq-PNP-NM_000270.3.a2m,Invitae,refseq-PNP-NM_000270.3.npy,1,289,289
+NP_000262.2,MTARGLALGLLLLLLCPAQVFSQSCVWYGECGIAYGDKRYNCEYSGPPKPLPKDGYDLVQELCPGFFFGNVSLCCDVRQLQTLKDNLQLPLQFLSRCPSCFYNLLNLFCELTCSPRQSQFLNVTATEDYVDPVTNQTKTNVKELQYYVGQSFANAMYNACRDVEAPSSNDKALGLLCGKDADACNATNWIEYMFNKDNGQAPFTITPVFSDFPVHGMEPMNNATKGCDESVDEVTAPCSCQDCSIVCGPKPQPPPPPAPWTILGLDAMYVIMWITYMAFLLVFFGAFFAVWCYRKRYFVSEYTPIDSNIAFSVNASDKGEASCCDPVSAAFEGCLRRLFTRWGSFCVRNPGCVIFFSLVFITACSSGLVFVRVTTNPVDLWSAPSSQARLEKEYFDQHFGPFFRTEQLIIRAPLTDKHIYQPYPSGADVPFGPPLDIQILHQVLDLQIAIENITASYDNETVTLQDICLAPLSPYNTNCTILSVLNYFQNSHSVLDHKKGDDFFVYADYHTHFLYCVRAPASLNDTSLLHDPCLGTFGGPVFPWLVLGGYDDQNYNNATALVITFPVNNYYNDTEKLQRAQAWEKEFINFVKNYKNPNLTISFTAERSIEDELNRESDSDVFTVVISYAIMFLYISLALGHMKSCRRLLVDSKVSLGIAGILIVLSSVACSLGVFSYIGLPLTLIVIEVIPFLVLAVGVDNIFILVQAYQRDERLQGETLDQQLGRVLGEVAPSMFLSSFSETVAFFLGALSVMPAVHTFSLFAGLAVFIDFLLQITCFVSLLGLDIKRQEKNRLDIFCCVRGAEDGTSVQASESCLFRFFKNSYSPLLLKDWMRPIVIAIFVGVLSFSIAVLNKVDIGLDQSLSMPDDSYMVDYFKSISQYLHAGPPVYFVLEEGHDYTSSKGQNMVCGGMGCNNDSLVQQIFNAAQLDNYTRIGFAPSSWIDDYFDWVKPQSSCCRVDNITDQFCNASVVDPACVRCRPLTPEGKQRPQGGDFMRFLPMFLSDNPNPKCGKGGHAAYSSAVNILLGHGTRVGATYFMTYHTVLQTSADFIDALKKARLIASNVTETMGINGSAYRVFPYSVFYVFYEQYLTIIDDTIFNLGVSLGAIFLVTMVLLGCELWSAVIMCATIAMVLVNMFGVMWLWGISLNAVSLVNLVMSCGISVEFCSHITRAFTVSMKGSRVERAEEALAHMGSSVFSGITLTKFGGIVVLAFAKSQIFQIFYFRMYLAMVLLGATHGLIFLPVLLSYIGPSVNKAKSCATEERYKGTERERLLNF,1278,NP_000262.2.csv,refseq-NPC1-NM_000271.4_clinical_seed_0_final,refseq-NPC1-NM_000271.4.a2m,Invitae,refseq-NPC1-NM_000271.4.npy,1,1278,1278
+NP_000263.2,MLARRQRDPLQALRRRNQELKQQVDSLLSESQLKEALEPNKRQHIYQRCIQLKQAIDENKNALQKLSKADESAPVANYNQRKEEEHTLLDKLTQQLQGLAVTISRENITEVGAPTEEEEESESEDSEDSGGEEEDAEEEEEEKEENESHKWSTGEEYIAVGDFTAQQVGDLTFKKGEILLVIEKKPDGWWIAKDAKGNEGLVPRTYLEPYSEEEEGQESSEEGSEEDVEAVDETADGAEVKQRTDPHWSAVQKAISEAGIFCLVNHVSFCYLIVLMRNRMETVEDTNGSETGFRAWNVQSRGRIFLVSKPVLQQINTVDVLTTMGAIPAGFRPSTLSQLLEEGNQFRANYFLQPELMPSQLAFRDLMWDATEGTIRSRPSRISLILTLWSCKMIPLPGMSIQVLSRHVRLCLFDGNKVLSNIHTVRATWQPKKPKTWTFSPQVTRILPCLLDGDCFIRSNSASPDLGILFELGISYIRNSTGERGELSCGWVFLKLFDASGVPIPAKTYELFLNGGTPYEKGIEVDPSISRRAHGSVFYQIMTMRRQPQLLVKLRSLNRRSRNVLSLLPETLIGNMCSIHLLIFYRQILGDVLLKDRMSLQSTDLISHPMLATFPMLLEQPDVMDALRSSWAGKESTLKRSEKRDKEFLKSTFLLVYHDCVLPLLHSTRLPPFRWAEEETETARWKVITDFLKQNQENQGALQALLSPDGVHEPFDLSEQTYDFLGEMRKNAV,733,NP_000263.2.csv,refseq-NPHP1-NM_000272.3_clinical_seed_0_final,refseq-NPHP1-NM_000272.3.a2m,Invitae,refseq-NPHP1-NM_000272.3.npy,1,733,733
+NP_000264.2,MASPRLGTFCCPTRDAATQLVLSFQPRAFHALCLGSGGLRLALGLLQLLPGRRPAGPGSPATSPPASVRILRAAAACDLLGCLGMVIRSTVWLGFPNFVDSVSDMNHTEIWPAAFCVGSAMWIQLLYSACFWWLFCYAVDAYLVIRRSAGLSTILLYHIMAWGLATLLCVEGAAMLYYPSVSRCERGLDHAIPHYVTMYLPLLLVLVANPILFQKTVTAVASLLKGRQGIYTENERRMGAVIKIRFFKIMLVLIICWLSNIINESLLFYLEMQTDINGGSLKPVRTAAKTTWFIMGILNPAQGFLLSLAFYGWTGCSLGFQSPRKEIQWESLTTSAAEGAHPSPLMPHENPASGKVSQVGGQTSDEALSMLSEGSDASTIEIHTASESCNKNEGDPALPTHGDL,404,NP_000264.2.csv,refseq-GPR143-NM_000273.2_clinical_seed_0_final,refseq-GPR143-NM_000273.2.a2m,Invitae,refseq-GPR143-NM_000273.2.npy,1,404,404
+NP_000265.1,MFSKLAHLQRFAVLSRGVHSSVASATSVATKKTVQGPPTSDDIFEREYKYGAHNYHPLPVALERGKGIYLWDVEGRKYFDFLSSYSAVNQGHCHPKIVNALKSQVDKLTLTSRAFYNNVLGEYEEYITKLFNYHKVLPMNTGVEAGETACKLARKWGYTVKGIQKYKAKIVFAAGNFWGRTLSAISSSTDPTSYDGFGPFMPGFDIIPYNDLPALERALQDPNVAAFMVEPIQGEAGVVVPDPGYLMGVRELCTRHQVLFIADEIQTGLARTGRWLAVDYENVRPDIVLLGKALSGGLYPVSAVLCDDDIMLTIKPGEHGSTYGGNPLGCRVAIAALEVLEEENLAENADKLGIILRNELMKLPSDVVTAVRGKGLLNAIVIKETKDWDAWKVCLRLRDNGLLAKPTHGDIIRFAPPLVIKEDELRESIEIINKTILSF,439,NP_000265.1.csv,refseq-OAT-NM_000274.3_clinical_seed_0_final,refseq-OAT-NM_000274.3.a2m,Invitae,refseq-OAT-NM_000274.3.npy,1,439,439
+NP_000266.2,MHLEGRDGRRYPGAPAVELLQTSVPSGLAELVAGKRRLPRGAGGADPSHSCPRGAAGQSSWAPAGQEFASFLTKGRSHSSLPQMSSSRSKDSCFTENTPLLRNSLQEKGSRCIPVYHPEFITAEESWEDSSADWERRYLLSREVSGLSASASSEKGDLLDSPHIRLRLSKLRRCVQWLKVMGLFAFVVLCSILFSLYPDQGKLWQLLALSPLENYSVNLSSHVDSTLLQVDLAGALVASGPSRPGREEHIVVELTQADALGSRWRRPQQVTHNWTVYLNPRRSEHSVMSRTFEVLTRETVSISIRASLQQTQAVPLLMAHQYLRGSVETQVTIATAILAGVYALIIFEIVHRTLAAMLGSLAALAALAVIGDRPSLTHVVEWIDFETLALLFGMMILVAIFSETGFFDYCAVKAYRLSRGRVWAMIIMLCLIAAVLSAFLDNVTTMLLFTPVTIRLCEVLNLDPRQVLIAEVIFTNIGGAATAIGDPPNVIIVSNQELRKMGLDFAGFTAHMFIGICLVLLVCFPLLRLLYWNRKLYNKEPSEIVELKHEIHVWRLTAQRISPASREETAVRRLLLGKVLALEHLLARRLHTFHRQISQEDKNWETNIQELQKKHRISDGILLAKCLTVLGFVIFMFFLNSFVPGIHLDLGWIAILGAIWLLILADIHDFEIILHRVEWATLLFFAALFVLMEALAHLHLIEYVGEQTALLIKMVPEEQRLIAAIVLVVWVSALASSLIDNIPFTATMIPVLLNLSHDPEVGLPAPPLMYALAFGACLGGNGTLIGASANVVCAGIAEQHGYGFSFMEFFRLGFPMMVVSCTVGMCYLLVAHVVVGWN,838,NP_000266.2.csv,refseq-OCA2-NM_000275.2_clinical_seed_0_final,refseq-OCA2-NM_000275.2.a2m,Invitae,refseq-OCA2-NM_000275.2.npy,1,838,838
+NP_000267.2,MEPPLPVGAQPLATVEGMEMKGPLREPCALTLAQRNGQYELIIQLHEKEQHVQDIIPINSHFRCVQEAEETLLIDIASNSGCKIRVQGDWIRERRFEIPDEEHCLKFLSAVLAAQKAQSQLLVPEQKDSSSWYQKLDTKDKPSVFSGLLGFEDNFSSMNLDKKINSQNQPTGIHREPPPPPFSVNKMLPREKEASNKEQPKVTNTMRKLFVPNTQSGQREGLIKHILAKREKEYVNIQTFRFFVGTWNVNGQSPDSGLEPWLNCDPNPPDIYCIGFQELDLSTEAFFYFESVKEQEWSMAVERGLHSKAKYKKVQLVRLVGMMLLIFARKDQCRYIRDIATETVGTGIMGKMGNKGGVAVRFVFHNTTFCIVNSHLAAHVEDFERRNQDYKDICARMSFVVPNQTLPQLNIMKHEVVIWLGDLNYRLCMPDANEVKSLINKKDLQRLLKFDQLNIQRTQKKAFVDFNEGEIKFIPTYKYDSKTDRWDSSGKCRVPAWCDRILWRGTNVNQLNYRSHMELKTSDHKPVSALFHIGVKVVDERRYRKVFEDSVRIMDRMENDFLPSLELSRREFVFENVKFRQLQKEKFQISNNGQVPCHFSFIPKLNDSQYCKPWLRAEPFEGYLEPNETVDISLDVYVSKDSVTILNSGEDKIEDILVLHLDRGKDYFLTISGNYLPSCFGTSLEALCRMKRPIREVPVTKLIDLEEDSFLEKEKSLLQMVPLDEGASERPLQVPKEIWLLVDHLFKYACHQEDLFQTPGMQEELQQIIDCLDTSIPETIPGSNHSVAEALLIFLEALPEPVICYELYQRCLDSAYDPRICRQVISQLPRCHRNVFRYLMAFLRELLKFSEYNSVNANMIATLFTSLLLRPPPNLMARQTPSDRQRAIQFLLGFLLGSEED,901,NP_000267.2.csv,refseq-OCRL-NM_000276.3_clinical_seed_0_final,refseq-OCRL-NM_000276.3.a2m,Invitae,refseq-OCRL-NM_000276.3.npy,1,901,901
+NP_000268.1,MSTAVLENPGLGRKLSDFGQETSYIEDNCNQNGAISLIFSLKEEVGALAKVLRLFEENDVNLTHIESRPSRLKKDEYEFFTHLDKRSLPALTNIIKILRHDIGATVHELSRDKKKDTVPWFPRTIQELDRFANQILSYGAELDADHPGFKDPVYRARRKQFADIAYNYRHGQPIPRVEYMEEEKKTWGTVFKTLKSLYKTHACYEYNHIFPLLEKYCGFHEDNIPQLEDVSQFLQTCTGFRLRPVAGLLSSRDFLGGLAFRVFHCTQYIRHGSKPMYTPEPDICHELLGHVPLFSDRSFAQFSQEIGLASLGAPDEYIEKLATIYWFTVEFGLCKQGDSIKAYGAGLLSSFGELQYCLSEKPKLLPLELEKTAIQNYTVTEFQPLYYVAESFNDAKEKVRNFAATIPRPFSVRYDPYTQRIEVLDNTQQLKILADSINSEIGILCSALQKIK,452,NP_000268.1.csv,refseq-PAH-NM_000277.3_clinical_seed_0_final,refseq-PAH-NM_000277.3.a2m,Invitae,refseq-PAH-NM_000277.3.npy,1,452,452
+NP_000269.3,MDMHCKADPFSAMHPGHGGVNQLGGVFVNGRPLPDVVRQRIVELAHQGVRPCDISRQLRVSHGCVSKILGRYYETGSIKPGVIGGSKPKVATPKVVDKIAEYKRQNPTMFAWEIRDRLLAEGICDNDTVPSVSSINRIIRTKVQQPFHPTPDGAGTGVTAPGHTIVPSTASPPVSSASNDPVGSYSINGILGIPRSNGEKRKRDEDVSEGSVPNGDSQSGVDSLRKHLRADTFTQQQLEALDRVFERPSYPDVFQASEHIKSEQGNEYSLPALTPGLDEVKSSLSASTNPELGSNVSGTQTYPVVTGRDMASTTLPGYPPHVPPTGQGSYPTSTLAGMVPGSEFSGNPYSHPQYTAYNEAWRFSNPALLSSPYYYSAAPRGSAPAAAAAAYDRH,394,NP_000269.3.csv,refseq-PAX2-NM_000278.5_clinical_seed_0_final,refseq-PAX2-NM_000278.5.a2m,Invitae,refseq-PAX2-NM_000278.5.npy,1,394,394
+NP_000272.1,MAGKAHRLSAEERDQLLPNLRAVGWNELEGRDAIFKQFHFKDFNRAFGFMTRVALQAEKLDHHPEWFNVYNKVHITLSTHECAGLSERDINLASFIEQVAVSMT,104,NP_000272.1.csv,refseq-PCBD1-NM_000281.3_clinical_seed_0_final,refseq-PCBD1-NM_000281.3.a2m,Invitae,refseq-PCBD1-NM_000281.3.npy,1,104,104
+NP_000273.2,MAGFWVGTAPLVAAGRRGRWPPQQLMLSAALRTLKHVLYYSRQCLMVSRNLGSVGYDPNEKTFDKILVANRGEIACRVIRTCKKMGIKTVAIHSDVDASSVHVKMADEAVCVGPAPTSKSYLNMDAIMEAIKKTRAQAVHPGYGFLSENKEFARCLAAEDVVFIGPDTHAIQAMGDKIESKLLAKKAEVNTIPGFDGVVKDAEEAVRIAREIGYPVMIKASAGGGGKGMRIAWDDEETRDGFRLSSQEAASSFGDDRLLIEKFIDNPRHIEIQVLGDKHGNALWLNERECSIQRRNQKVVEEAPSIFLDAETRRAMGEQAVALARAVKYSSAGTVEFLVDSKKNFYFLEMNTRLQVEHPVTECITGLDLVQEMIRVAKGYPLRHKQADIRINGWAVECRVYAEDPYKSFGLPSIGRLSQYQEPLHLPGVRVDSGIQPGSDISIYYDPMISKLITYGSDRTEALKRMADALDNYVIRGVTHNIALLREVIINSRFVKGDISTKFLSDVYPDGFKGHMLTKSEKNQLLAIASSLFVAFQLRAQHFQENSRMPVIKPDIANWELSVKLHDKVHTVVASNNGSVFSVEVDGSKLNVTSTWNLASPLLSVSVDGTQRTVQCLSREAGGNMSIQFLGTVYKVNILTRLAAELNKFMLEKVTEDTSSVLRSPMPGVVVAVSVKPGDAVAEGQEICVIEAMKMQNSMTAGKTGTVKSVHCQAGDTVGEGDLLVELE,728,NP_000273.2.csv,refseq-PCCA-NM_000282.3_clinical_seed_0_final,refseq-PCCA-NM_000282.3.a2m,Invitae,refseq-PCCA-NM_000282.3.npy,1,728,728
+NP_000274.3,MSLSEEQARSFLDQNPDFARQYFGKKLSPENVAAACEDGCPPDCDSLRDLCQVEESTALLELVQDMQESINMERVVFKVLRRLCTLLQADRCSLFMYRQRNGVAELATRLFSVQPDSVLEDCLVPPDSEIVFPLDIGVVGHVAQTKKMVNVEDVAECPHFSSFADELTDYKTKNMLATPIMNGKDVVAVIMAVNKLNGPFFTSEDEDVFLKYLNFATLYLKIYHLSYLHNCETRRGQVLLWSANKVFEELTDIERQFHKAFYTVRAYLNCERYSVGLLDMTKEKEFFDVWSVLMGESQPYSGPRTPDGREIVFYKVIDYVLHGKEEIKVIPTPSADHWALASGLPSYVAESGFICNIMNASADEMFKFQEGALDDSGWLIKNVLSMPIVNKKEEIVGVATFYNRKDGKPFDEQDEVLMESLTQFLGWSVMNTDTYDKMNKLENRKDIAQDMVLYHVKCDRDEIQLILPTRARLGKEPADCDEDELGEILKEELPGPTTFDIYEFHFSDLECTELDLVKCGIQMYYELGVVRKFQIPQEVLVRFLFSISKGYRRITYHNWRHGFNVAQTMFTLLMTGKLKSYYTDLEAFAMVTAGLCHDIDHRGTNNLYQMKSQNPLAKLHGSSILERHHLEFGKFLLSEETLNIYQNLNRRQHEHVIHLMDIAIIATDLALYFKKRAMFQKIVDESKNYQDKKSWVEYLSLETTRKEIVMAMMMTACDLSAITKPWEVQSKVALLVAAEFWEQGDLERTVLDQQPIPMMDRNKAAELPKLQVGFIDFVCTFVYKEFSRFHEEILPMFDRLQNNRKEWKALADEYEAKVKALEEKEEEERVAAKKVGTEICNGGPAPKSSTCCIL,854,NP_000274.3.csv,refseq-PDE6B-NM_000283.4_clinical_seed_0_final,refseq-PDE6B-NM_000283.4.a2m,Invitae,refseq-PDE6B-NM_000283.4.npy,1,854,854
+NP_000275.1,MRKMLAAVSRVLSGASQKPASRVLVASRNFANDATFEIKKCDLHRLEEGPPVTTVLTREDGLKYYRMMQTVRRMELKADQLYKQKIIRGFCHLCDGQEACCVGLEAGINPTDHLITAYRAHGFTFTRGLSVREILAELTGRKGGCAKGKGGSMHMYAKNFYGGNGIVGAQVPLGAGIALACKYNGKDEVCLTLYGDGAANQGQIFEAYNMAALWKLPCIFICENNRYGMGTSVERAAASTDYYKRGDFIPGLRVDGMDILCVREATRFAAAYCRSGKGPILMELQTYRYHGHSMSDPGVSYRTREEIQEVRSKSDPIMLLKDRMVNSNLASVEELKEIDVEVRKEIEDAAQFATADPEPPLEELGYHIYSSDPPFEVRGANQWIKFKSVS,390,NP_000275.1.csv,refseq-PDHA1-NM_000284.3_clinical_seed_0_final,refseq-PDHA1-NM_000284.3.a2m,Invitae,refseq-PDHA1-NM_000284.3.npy,1,390,390
+NP_000276.2,MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTDTGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEKYAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEIVECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSSYTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTADQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLGAVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDHLLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMAGCDKAFTPFSGPK,493,NP_000276.2.csv,refseq-PEPD-NM_000285.3_clinical_seed_0_final,refseq-PEPD-NM_000285.3.a2m,Invitae,refseq-PEPD-NM_000285.3.npy,1,493,493
+NP_000277.1,MAEHGAHFTAASVADDQPSIFEVVAQDSLMTAVRPALQHVVKVLAESNPTHYGFLWRWFDEIFTLLDLLLQQHYLSRTSASFSENFYGLKRIVMGDTHKSQRLASAGLPKQQLWKSIMFLVLLPYLKVKLEKLVSSLREEDEYSIHPPSSRWKRFYRAFLAAYPFVNMAWEGWFLVQQLRYILGKAQHHSPLLRLAGVQLGRLTVQDIQALEHKPAKASMMQQPARSVSEKINSALKKAVGGVALSLSTGLSVGVFFLQFLDWWYSSENQETIKSLTALPTPPPPVHLDYNSDSPLLPKMKTVCPLCRKTRVNDTVLATSGYVFCYRCVFHYVRSHQACPITGYPTEVQHLIKLYSPEN,359,NP_000277.1.csv,refseq-PEX12-NM_000286.2_clinical_seed_0_final,refseq-PEX12-NM_000286.2.a2m,Invitae,refseq-PEX12-NM_000286.2.npy,1,359,359
+NP_000278.3,MALAVLRVLEPFPTETPPLAVLLPPGGPWPAAELGLVLALRPAGESPAGPALLVAALEGPDAGTEEQGPGPPQLLVSRALLRLLALGSGAWVRARAVRRPPALGWALLGTSLGPGLGPRVGPLLVRRGETLPVPGPRVLETRPALQGLLGPGTRLAVTELRGRARLCPESGDSSRPPPPPVVSSFAVSGTVRRLQGVLGGTGDSLGVSRSCLRGLGLFQGEWVWVAQARESSNTSQPHLARVQVLEPRWDLSDRLGPGSGPLGEPLADGLALVPATLAFNLGCDPLEMGELRIQRYLEGSIAPEDKGSCSLLPGPPFARELHIEIVSSPHYSTNGNYDGVLYRHFQIPRVVQEGDVLCVPTIGQVEILEGSPEKLPRWREMFFKVKKTVGEAPDGPASAYLADTTHTSLYMVGSTLSPVPWLPSEESTLWSSLSPPGLEALVSELCAVLKPRLQPGGALLTGTSSVLLRGPPGCGKTTVVAAACSHLGLHLLKVPCSSLCAESSGAVETKLQAIFSRARRCRPAVLLLTAVDLLGRDRDGLGEDARVMAVLRHLLLNEDPLNSCPPLMVVATTSRAQDLPADVQTAFPHELEVPALSEGQRLSILRALTAHLPLGQEVNLAQLARRCAGFVVGDLYALLTHSSRAACTRIKNSGLAGGLTEEDEGELCAAGFPLLAEDFGQALEQLQTAHSQAVGAPKIPSVSWHDVGGLQEVKKEILETIQLPLEHPELLSLGLRRSGLLLHGPPGTGKTLLAKAVATECSLTFLSVKGPELINMYVGQSEENVREVFARARAAAPCIIFFDELDSLAPSRGRSGDSGGVMDRVVSQLLAELDGLHSTQDVFVIGATNRPDLLDPALLRPGRFDKLVFVGANEDRASQLRVLSAITRKFKLEPSVSLVNVLDCCPPQLTGADLYSLCSDAMTAALKRRVHDLEEGLEPGSSALMLTMEDLLQAAARLQPSVSEQELLRYKRIQRKFAAC,980,NP_000278.3.csv,refseq-PEX6-NM_000287.3_clinical_seed_0_final,refseq-PEX6-NM_000287.3.a2m,Invitae,refseq-PEX6-NM_000287.3.npy,1,980,980
+NP_000279.1,MSAVCGGAARMLRTPGRHGYAAEFSPYLPGRLACATAQHYGIAGCGTLLILDPDEAGLRLFRSFDWNDGLFDVTWSENNEHVLITCSGDGSLQLWDTAKAAGPLQVYKEHAQEVYSVDWSQTRGEQLVVSGSWDQTVKLWDPTVGKSLCTFRGHESIIYSTIWSPHIPGCFASASGDQTLRIWDVKAAGVRIVIPAHQAEILSCDWCKYNENLLVTGAVDCSLRGWDLRNVRQPVFELLGHTYAIRRVKFSPFHASVLASCSYDFTVRFWNFSKPDSLLETVEHHTEFTCGLDFSLQSPTQVADCSWDETIKIYDPACLTIPA,323,NP_000279.1.csv,refseq-PEX7-NM_000288.3_clinical_seed_0_final,refseq-PEX7-NM_000288.3.a2m,Invitae,refseq-PEX7-NM_000288.3.npy,1,323,323
+NP_000280.1,MTHEEHHAAKTLGIGKAIAVLTSGGDAQGMNAAVRAVVRVGIFTGARVFFVHEGYQGLVDGGDHIKEATWESVSMMLQLGGTVIGSARCKDFREREGRLRAAYNLVKRGITNLCVIGGDGSLTGADTFRSEWSDLLSDLQKAGKITDEEATKSSYLNIVGLVGSIDNDFCGTDMTIGTDSALHRIMEIVDAITTTAQSHQRTFVLEVMGRHCGYLALVTSLSCGADWVFIPECPPDDDWEEHLCRRLSETRTRGSRLNIIIVAEGAIDKNGKPITSEDIKNLVVKRLGYDTRVTVLGHVQRGGTPSAFDRILGSRMGVEAVMALLEGTPDTPACVVSLSGNQAVRLPLMECVQVTKDVTKAMDEKKFDEALKLRGRSFMNNWEVYKLLAHVRPPVSKSGSHTVAVMNVGAPAAGMNAAVRSTVRIGLIQGNRVLVVHDGFEGLAKGQIEEAGWSYVGGWTGQGGSKLGTKRTLPKKSFEQISANITKFNIQGLVIIGGFEAYTGGLELMEGRKQFDELCIPFVVIPATVSNNVPGSDFSVGADTALNTICTTCDRIKQSAAGTKRRVFIIETMGGYCGYLATMAGLAAGADAAYIFEEPFTIRDLQANVEHLVQKMKTTVKRGLVLRNEKCNENYTTDFIFNLYSEEGKGIFDSRKNVLGHMQQGGSPTPFDRNFATKMGAKAMNWMSGKIKESYRNGRIFANTPDSGCVLGMRKRALVFQPVAELKDQTDFEHRIPKEQWWLKLRPILKILAKYEIDLDTSDHAHLEHITRKRSGEAAV,780,NP_000280.1.csv,refseq-PFKM-NM_000289.5_clinical_seed_0_final,refseq-PFKM-NM_000289.5.a2m,Invitae,refseq-PFKM-NM_000289.5.npy,1,780,780
+NP_000282.1,MSLSNKLTLDKLDVKGKRVVMRVDFNVPMKNNQITNNQRIKAAVPSIKFCLDNGAKSVVLMSHLGRPDGVPMPDKYSLEPVAVELKSLLGKDVLFLKDCVGPEVEKACANPAAGSVILLENLRFHVEEEGKGKDASGNKVKAEPAKIEAFRASLSKLGDVYVNDAFGTAHRAHSSMVGVNLPQKAGGFLMKKELNYFAKALESPERPFLAILGGAKVADKIQLINNMLDKVNEMIIGGGMAFTFLKVLNNMEIGTSLFDEEGAKIVKDLMSKAEKNGVKITLPVDFVTADKFDENAKTGQATVASGIPAGWMGLDCGPESSKKYAEAVTRAKQIVWNGPVGVFEWEAFARGTKALMDEVVKATSRGCITIIGGGDTATCCAKWNTEDKVSHVSTGGGASLELLEGKVLPGVDALSNI,417,NP_000282.1.csv,refseq-PGK1-NM_000291.3_clinical_seed_0_final,refseq-PGK1-NM_000291.3.a2m,Invitae,refseq-PGK1-NM_000291.3_theta_0.2.npy,1,417,417
+NP_000283.1,MRSRSNSGVRLDGYARLVQQTILCYQNPVTGLLSASHEQKDAWVRDNIYSILAVWGLGMAYRKNADRDEDKAKAYELEQNVVKLMRGLLQCMMRQVAKVEKFKHTQSTKDSLHAKYNTATCGTVVGDDQWGHLQVDATSLFLLFLAQMTASGLRIIFTLDEVAFIQNLVFYIEAAYKVADYGMWERGDKTNQGIPELNASSVGMAKAALEAIDELDLFGAHGGRKSVIHVLPDEVEHCQSILFSMLPRASTSKEIDAGLLSIISFPAFAVEDVNLVNVTKNEIISKLQGRYGCCRFLRDGYKTPREDPNRLHYDPAELKLFENIECEWPVFWTYFIIDGVFSGDAVQVQEYREALEGILIRGKNGIRLVPELYAVPPNKVDEEYKNPHTVDRVPMGKVPHLWGQSLYILSSLLAEGFLAAGEIDPLNRRFSTSVKPDVVVQVTVLAENNHIKDLLRKHGVNVQSIADIHPIQVQPGRILSHIYAKLGRNKNMNLSGRPYRHIGVLGTSKLYVIRNQIFTFTPQFTDQHHFYLALDNEMIVEMLRIELAYLCTCWRMTGRPTLTFPISRTMLTNDGSDIHSAVLSTIRKLEDGYFGGARVKLGNLSEFLTTSFYTYLTFLDPDCDEKLFDNASEGTFSPDSDSDLVGYLEDTCNQESQDELDHYINHLLQSTSLRSYLPPLCKNTEDRHVFSAIHSTRDILSVMAKAKGLEVPFVPMTLPTKVLSAHRKSLNLVDSPQPLLEKVPESDFQWPRDDHGDVDCEKLVEQLKDCSNLQDQADILYILYVIKGPSWDTNLSGQHGVTVQNLLGELYGKAGLNQEWGLIRYISGLLRKKVEVLAEACTDLLSHQKQLTVGLPPEPREKIISAPLPPEELTKLIYEASGQDISIAVLTQEIVVYLAMYVRAQPSLFVEMLRLRIGLIIQVMATELARSLNCSGEEASESLMNLSPFDMKNLLHHILSGKEFGVERSVRPIHSSTSSPTISIHEVGHTGVTKTERSGINRLRSEMKQMTRRFSADEQFFSVGQAASSSAHSSKSARSSTPSSPTGTSSSDSGGHHIGWGERQGQWLRRRRLDGAINRVPVGFYQRVWKILQKCHGLSIDGYVLPSSTTREMTPHEIKFAVHVESVLNRVPQPEYRQLLVEAIMVLTLLSDTEMTSIGGIIHVDQIVQMASQLFLQDQVSIGAMDTLEKDQATGICHFFYDSAPSGAYGTMTYLTRAVASYLQELLPNSGCQMQ,1235,NP_000283.1.csv,refseq-PHKA2-NM_000292.2_clinical_seed_0_final,refseq-PHKA2-NM_000292.2.a2m,Invitae,refseq-PHKA2-NM_000292.2.npy,1,1235,1235
+NP_000286.3,MPSSVSWGILLLAGLCCLVPVSLAEDPQGDAAQKTDTSHHDQDHPTFNKITPNLAEFAFSLYRQLAHQSNSTNIFFSPVSIATAFAMLSLGTKADTHDEILEGLNFNLTEIPEAQIHEGFQELLRTLNQPDSQLQLTTGNGLFLSEGLKLVDKFLEDVKKLYHSEAFTVNFGDTEEAKKQINDYVEKGTQGKIVDLVKELDRDTVFALVNYIFFKGKWERPFEVKDTEEEDFHVDQVTTVKVPMMKRLGMFNIQHCKKLSSWVLLMKYLGNATAIFFLPDEGKLQHLENELTHDIITKFLENEDRRSASLHLPKLSITGTYDLKSVLGQLGITKVFSNGADLSGVTEEAPLKLSKAVHKAVLTIDEKGTEAAGAMFLEAIPMSIPPEVKFNKPFVFLMIEQNTKSPLFMGKVVNPTQK,418,NP_000286.3.csv,refseq-SERPINA1-NM_000295.4_clinical_seed_0_final,refseq-SERPINA1-NM_000295.4.a2m,Invitae,refseq-SERPINA1-NM_000295.4.npy,1,418,418
+NP_000288.1,MVNSSRVQPQQPGDAKRPPAPRAPDPGRLMAGCAAVGASLAAPGGLCEQRGLEIEMQRIRQAAARDPPAGAAASPSPPLSSCSRQAWSRDNPGFEAEEEEEEVEGEEGGMVVEMDVEWRPGSRRSAASSAVSSVGARSRGLGGYHGAGHPSGRRRRREDQGPPCPSPVGGGDPLHRHLPLEGQPPRVAWAERLVRGLRGLWGTRLMEESSTNREKYLKSVLRELVTYLLFLIVLCILTYGMMSSNVYYYTRMMSQLFLDTPVSKTEKTNFKTLSSMEDFWKFTEGSLLDGLYWKMQPSNQTEADNRSFIFYENLLLGVPRIRQLRVRNGSCSIPQDLRDEIKECYDVYSVSSEDRAPFGPRNGTAWIYTSEKDLNGSSHWGIIATYSGAGYYLDLSRTREETAAQVASLKKNVWLDRGTRATFIDFSVYNANINLFCVVRLLVEFPATGGVIPSWQFQPLKLIRYVTTFDFFLAACEIIFCFFIFYYVVEEILEIRIHKLHYFRSFWNCLDVVIVVLSVVAIGINIYRTSNVEVLLQFLEDQNTFPNFEHLAYWQIQFNNIAAVTVFFVWIKLFKFINFNRTMSQLSTTMSRCAKDLFGFAIMFFIIFLAYAQLAYLVFGTQVDDFSTFQECIFTQFRIILGDINFAEIEEANRVLGPIYFTTFVFFMFFILLNMFLAIINDTYSEVKSDLAQQKAEMELSDLIRKGYHKALVKLKLKKNTVDDISESLRQGGGKLNFDELRQDLKGKGHTDAEIEAIFTKYDQDGDQELTEHEHQQMRDDLEKEREDLDLDHSSLPRPMSSRSFPRSLDDSEEDDDEDSGHSSRRRGSISSGVSYEEFQVLVRRVDRMEHSIGSIVSKIDAVIVKLEIMERAKLKRREVLGRLLDGVAEDERLGRDSEIHREQMERLVREELERWESDDAASQISHGLGTPVGLNGQPRPRSSRPSSSQSTEGMEGAGGNGSSNVHV,968,NP_000288.1.csv,refseq-PKD2-NM_000297.3_clinical_seed_0_final,refseq-PKD2-NM_000297.3.a2m,Invitae,refseq-PKD2-NM_000297.3.npy,1,968,968
+NP_000292.1,MEHKEVVLLLLLFLKSGQGEPLDDYVNTQGASLFSVTKKQLGAGSIEECAAKCEEDEEFTCRAFQYHSKEQQCVIMAENRKSSIIIRMRDVVLFEKKVYLSECKTGNGKNYRGTMSKTKNGITCQKWSSTSPHRPRFSPATHPSEGLEENYCRNPDNDPQGPWCYTTDPEKRYDYCDILECEEECMHCSGENYDGKISKTMSGLECQAWDSQSPHAHGYIPSKFPNKNLKKNYCRNPDRELRPWCFTTDPNKRWELCDIPRCTTPPPSSGPTYQCLKGTGENYRGNVAVTVSGHTCQHWSAQTPHTHNRTPENFPCKNLDENYCRNPDGKRAPWCHTTNSQVRWEYCKIPSCDSSPVSTEQLAPTAPPELTPVVQDCYHGDGQSYRGTSSTTTTGKKCQSWSSMTPHRHQKTPENYPNAGLTMNYCRNPDADKGPWCFTTDPSVRWEYCNLKKCSGTEASVVAPPPVVLLPDVETPSEEDCMFGNGKGYRGKRATTVTGTPCQDWAAQEPHRHSIFTPETNPRAGLEKNYCRNPDGDVGGPWCYTTNPRKLYDYCDVPQCAAPSFDCGKPQVEPKKCPGRVVGGCVAHPHSWPWQVSLRTRFGMHFCGGTLISPEWVLTAAHCLEKSPRPSSYKVILGAHQEVNLEPHVQEIEVSRLFLEPTRKDIALLKLSSPAVITDKVIPACLPSPNYVVADRTECFITGWGETQGTFGAGLLKEAQLPVIENKVCNRYEFLNGRVQSTELCAGHLAGGTDSCQGDSGGPLVCFEKDKYILQGVTSWGLGCARPNKPGVYVRVSRFVTWIEGVMRNN,810,NP_000292.1.csv,refseq-PLG-NM_000301.3_clinical_seed_0_final,refseq-PLG-NM_000301.3.a2m,Invitae,refseq-PLG-NM_000301.3_theta_0.2.npy,1,810,810
+NP_000293.2,MRPLLLLALLGWLLLAEAKGDAKPEDNLLVLTVATKETEGFRRFKRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKALEKHADKEDLVILFADSYDVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPVVSDGKRFLGSGGFIGYAPNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCRIFQNLDGALDEVVLKFEMGHVRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFETGCTVCDEGLRSLKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIHNHEQHHKAQVEEFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDADVALTEPNSLRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRRVGVWNVPYISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHTLGHLLSLDSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVETPCPDVYWFPIFTEVACDELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYIAPMTEKLYPGYYTRAQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGCRFLRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP,727,NP_000293.2.csv,refseq-PLOD1-NM_000302.3_clinical_seed_0_final,refseq-PLOD1-NM_000302.3.a2m,Invitae,refseq-PLOD1-NM_000302.3.npy,1,727,727
+NP_000294.1,MAAPGPALCLFDVDGTLTAPRQKITKEMDDFLQKLRQKIKIGVVGGSDFEKVQEQLGNDVVEKYDYVFPENGLVAYKDGKLLCRQNIQSHLGEALIQDLINYCLSYIAKIKLPKKRGTFIEFRNGMLNVSPIGRSCSQEERIEFYELDKKENIRQKFVADLRKEFAGKGLTFSIGGQISFDVFPDGWDKRYCLRHVENDGYKTIYFFGDKTMPGGNDHEIFTDPRTMGYSVTAPEDTRRICELLFS,246,NP_000294.1.csv,refseq-PMM2-NM_000303.2_clinical_seed_0_final,refseq-PMM2-NM_000303.2.a2m,Invitae,refseq-PMM2-NM_000303.2.npy,1,246,246
+NP_000295.1,MLLLLLSIIVLHVAVLVLLFVSTIVSQWIVGNGHATDLWQNCSTSSSGNVHHCFSSSPNEWLQSVQATMILSIIFSILSLFLFFCQLFTLTKGGRFYITGIFQILAGLCVMSAAAIYTVRHPEWHLNSDYSYGFAYILAWVAFPLALLSGVIYVILRKRE,160,NP_000295.1.csv,refseq-PMP22-NM_000304.2_clinical_seed_0_final,refseq-PMP22-NM_000304.2.a2m,Invitae,refseq-PMP22-NM_000304.2.npy,1,160,160
+NP_000298.3,MATAASNPYSILSSTSLVHADSAGMQQGSPFRNPQKLLQSDYLQGVPSNGHPLGHHWVTSLSDGGPWSSTLATSPLDQQDVKPGREDLQLGAIIHHRSPHVAHHSPHTNHPNAWGASPAPNPSITSSGQPLNVYSQPGFTVSGMLEHGGLTPPPAAASAQSLHPVLREPPDHGELGSHHCQDHSDEETPTSDELEQFAKQFKQRRIKLGFTQADVGLALGTLYGNVFSQTTICRFEALQLSFKNMCKLKPLLNKWLEEADSSTGSPTSIDKIAAQGRKRKKRTSIEVSVKGVLETHFLKCPKPAAQEISSLADSLQLEKEVVRVWFCNRRQKEKRMTPPGDQQPHEVYSHTVKTDTSCHDL,361,NP_000298.3.csv,NP_000298.3_colabfold_clinical_seed_0_final,NP_000298.3_colabfold.a2m,colabfold,NP_000298.3_colabfold_theta_0.2.npy,1,361,361
+NP_000299.3,MIRAAPPPLFLLLLLLLLLVSWASRGEAAPDQDEIQRLPGLAKQPSFRQYSGYLKGSGSKHLHYWFVESQKDPENSPVVLWLNGGPGCSSLDGLLTEHGPFLVQPDGVTLEYNPYSWNLIANVLYLESPAGVGFSYSDDKFYATNDTEVAQSNFEALQDFFRLFPEYKNNKLFLTGESYAGIYIPTLAVLVMQDPSMNLQGLAVGNGLSSYEQNDNSLVYFAYYHGLLGNRLWSSLQTHCCSQNKCNFYDNKDLECVTNLQEVARIVGNSGLNIYNLYAPCAGGVPSHFRYEKDTVVVQDLGNIFTRLPLKRMWHQALLRSGDKVRMDPPCTNTTAASTYLNNPYVRKALNIPEQLPQWDMCNFLVNLQYRRLYRSMNSQYLKLLSSQKYQILLYNGDVDMACNFMGDEWFVDSLNQKMEVQRRPWLVKYGDSGEQIAGFVKEFSHIAFLTIKGAGHMVPTDKPLAAFTMFSRFLNKQPY,480,NP_000299.3.csv,PPGB_HUMAN_b03_clinical_seed_0_final,PPGB_HUMAN_b03.a2m,EVE,PPGB_HUMAN_b03_theta_0.2.npy,1,480,480
+NP_000300.1,MGRTVVVLGGGISGLAASYHLSRAPCPPKVVLVESSERLGGWIRSVRGPNGAIFELGPRGIRPAGALGARTLLLVSELGLDSEVLPVRGDHPAAQNRFLYVGGALHALPTGLRGLLRPSPPFSKPLFWAGLRELTKPRGKEPDETVHSFAQRRLGPEVASLAMDSLCRGVFAGNSRELSIRSCFPSLFQAEQTHRSILLGLLLGAGRTPQPDSALIRQALAERWSQWSLRGGLEMLPQALETHLTSRGVSVLRGQPVCGLSLQAEGRWKVSLRDSSLEADHVISAIPASVLSELLPAEAAPLARALSAITAVSVAVVNLQYQGAHLPVQGFGHLVPSSEDPGVLGIVYDSVAFPEQDGSPPGLRVTVMLGGSWLQTLEASGCVLSQELFQQRAQEAAATQLGLKEMPSHCLVHLHKNCIPQYTLGHWQKLESARQFLTAHRLPLTLAGASYEGVAVNDCIESGRQAAVSVLGTEPNS,477,NP_000300.1.csv,refseq-PPOX-NM_000309.3_clinical_seed_0_final,refseq-PPOX-NM_000309.3.a2m,Invitae,refseq-PPOX-NM_000309.3.npy,1,477,477
+NP_000301.1,MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEKKIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQFLRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERLVQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVDSEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAHIIPFLG,306,NP_000301.1.csv,refseq-PPT1-NM_000310.3_clinical_seed_0_final,refseq-PPT1-NM_000310.3.a2m,Invitae,refseq-PPT1-NM_000310.3.npy,1,306,306
+NP_000302.1,MANLGCWMLVLFVATWSDLGLCKKRPKPGGWNTGGSRYPGQGSPGGNRYPPQGGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQGGGTHSQWNKPSKPKTNMKHMAGAAAAGAVVGGLGGYMLGSAMSRPIIHFGSDYEDRYYRENMHRYPNQVYYRPMDEYSNQNNFVHDCVNITIKQHTVTTTTKGENFTETDVKMMERVVEQMCITQYERESQAYYQRGSSMVLFSSPPVILLISFLIFLIVG,253,NP_000302.1.csv,refseq-PRNP-NM_000311.3_clinical_seed_0_final,refseq-PRNP-NM_000311.3.a2m,Invitae,refseq-PRNP-NM_000311.3.npy,1,253,253
+NP_000303.1,MWQLTSLLLFVATWGISGTPAPLDSVFSSSERAHQVLRIRKRANSFLEELRHSSLERECIEEICDFEEAKEIFQNVDDTLAFWSKHVDGDQCLVLPLEHPCASLCCGHGTCIDGIGSFSCDCRSGWEGRFCQREVSFLNCSLDNGGCTHYCLEEVGWRRCSCAPGYKLGDDLLQCHPAVKFPCGRPWKRMEKKRSHLKRDTEDQEDQVDPRLIDGKMTRRGDSPWQVVLLDSKKKLACGAVLIHPSWVLTAAHCMDESKKLLVRLGEYDLRRWEKWELDLDIKEVFVHPNYSKSTTDNDIALLHLAQPATLSQTIVPICLPDSGLAERELNQAGQETLVTGWGYHSSREKEAKRNRTFVLNFIKIPVVPHNECSEVMSNMVSENMLCAGILGDRQDACEGDSGGPMVASFHGTWFLVGLVSWGEGCGLLHNYGVYTKVSRYLDWIHGHIRDKEAPQKSWAP,461,NP_000303.1.csv,refseq-PROC-NM_000312.3_clinical_seed_0_final,refseq-PROC-NM_000312.3.a2m,Invitae,refseq-PROC-NM_000312.3.npy,1,461,461
+NP_000306.1,MIPAKDMAKVMIVMLAICFLTKSDGKSVKKRSVSEIQLMHNLGKHLNSMERVEWLRKKLQDVHNFVALGAPLAPRDAGSQRPRKKEDNVLVESHEKSLGEADKADVNVLTKAKSQ,115,NP_000306.1.csv,refseq-PTH-NM_000315.3_clinical_seed_0_final,refseq-PTH-NM_000315.3.a2m,Invitae,refseq-PTH-NM_000315.3.npy,1,115,115
+NP_000307.1,MGTARIAPGLALLLCCPVLSSAYALVDADDVMTKEEQIFLLHRAQAQCEKRLKEVLQRPASIMESDKGWTSASTSGKPRKDKASGKLYPESEEDKEAPTGSRYRGRPCLPEWDHILCWPLGAPGEVVAVPCPDYIYDFNHKGHAYRRCDRNGSWELVPGHNRTWANYSECVKFLTNETREREVFDRLGMIYTVGYSVSLASLTVAVLILAYFRRLHCTRNYIHMHLFLSFMLRAVSIFVKDAVLYSGATLDEAERLTEEELRAIAQAPPPPATAAAGYAGCRVAVTFFLYFLATNYYWILVEGLYLHSLIFMAFFSEKKYLWGFTVFGWGLPAVFVAVWVSVRATLANTGCWDLSSGNKKWIIQVPILASIVLNFILFINIVRVLATKLRETNAGRCDTRQQYRKLLKSTLVLMPLFGVHYIVFMATPYTEVSGTLWQVQMHYEMLFNSFQGFFVAIIYCFCNGEVQAEIKKSWSRWTLALDFKRKARSGSSSYSYGPMVSHTSVTNVGPRVGLGLPLSPRLLPTATTNGHPQLPGHAKPGTPALETLETTPPAMAAPKDDGFLNGSCSGLDEEASGPERPPALLQEEWETVM,593,NP_000307.1.csv,refseq-PTH1R-NM_000316.2_clinical_seed_0_final,refseq-PTH1R-NM_000316.2.a2m,Invitae,refseq-PTH1R-NM_000316.2.npy,1,593,593
+NP_000308.1,MSTEGGGRRCQAQVSRRISFSASHRLYSKFLSDEENLKLFGKCNNPNGHGHNYKVVVTVHGEIDPATGMVMNLADLKKYMEEAIMQPLDHKNLDMDVPYFADVVSTTENVAVYIWDNLQKVLPVGVLYKVKVYETDNNIVVYKGE,145,NP_000308.1.csv,refseq-PTS-NM_000317.2_clinical_seed_0_final,refseq-PTS-NM_000317.2.a2m,Invitae,refseq-PTS-NM_000317.2.npy,1,145,145
+NP_000312.2,MPPKTPRKTAATAAAAAAEPPAPPPPPPPEEDPEQDSGPEDLPLVRLEFEETEEPDFTALCQKLKIPDHVRERAWLTWEKVSSVDGVLGGYIQKKKELWGICIFIAAVDLDEMSFTFTELQKNIEISVHKFFNLLKEIDTSTKVDNAMSRLLKKYDVLFALFSKLERTCELIYLTQPSSSISTEINSALVLKVSWITFLLAKGEVLQMEDDLVISFQLMLCVLDYFIKLSPPMLLKEPYKTAVIPINGSPRTPRRGQNRSARIAKQLENDTRIIEVLCKEHECNIDEVKNVYFKNFIPFMNSLGLVTSNGLPEVENLSKRYEEIYLKNKDLDARLFLDHDKTLQTDSIDSFETQRTPRKSNLDEEVNVIPPHTPVRTVMNTIQQLMMILNSASDQPSENLISYFNNCTVNPKESILKRVKDIGYIFKEKFAKAVGQGCVEIGSQRYKLGVRLYYRVMESMLKSEEERLSIQNFSKLLNDNIFHMSLLACALEVVMATYSRSTSQNLDSGTDLSFPWILNVLNLKAFDFYKVIESFIKAEGNLTREMIKHLERCEHRIMESLAWLSDSPLFDLIKQSKDREGPTDHLESACPLNLPLQNNHTAADMYLSPVRSPKKKGSTTRVNSTANAETQATSAFQTQKPLKSTSLSLFYKKVYRLAYLRLNTLCERLLSEHPELEHIIWTLFQHTLQNEYELMRDRHLDQIMMCSMYGICKVKNIDLKFKIIVTAYKDLPHAVQETFKRVLIKEEEYDSIIVFYNSVFMQRLKTNILQYASTRPPTLSPIPHIPRSPYKFPSSPLRIPGGNIYISPLKSPYKISEGLPTPTKMTPRSRILVSIGESFGTSEKFQKINQMVCNSDRVLKRSAEGSNPPKPLKKLRFDIEGSDEADGSKHLPGESKFQQKLAEMTSTRTRMQKQKMNDSMDTSNKEEK,928,NP_000312.2.csv,refseq-RB1-NM_000321.2_clinical_seed_0_final,refseq-RB1-NM_000321.2.a2m,Invitae,refseq-RB1-NM_000321.2.npy,1,928,928
+NP_000313.2,MALLKVKFDQKKRVKLAQGLWLMNWFSVLAGIIIFSLGLFLKIELRKRSDVMNNSESHFVPNSLIGMGVLSCVFNSLAGKICYDALDPAKYARWKPWLKPYLAICVLFNIILFLVALCCFLLRGSLENTLGQGLKNGMKYYRDTDTPGRCFMKKTIDMLQIEFKCCGNNGFRDWFEIQWISNRYLDFSSKEVKDRIKSNVDGRYLVDGVPFSCCNPSSPRPCIQYQITNNSAHYSYDHQTEELNLWVRGCRAALLSYYSSLMNSMGVVTLLIWLFEVTITIGLRYLQTSLDGVSNPEESESESQGWLLERSVPETWKAFLESVKKLGKGNQVEAEGADAGQAPEAG,346,NP_000313.2.csv,refseq-PRPH2-NM_000322.4_clinical_seed_0_final,refseq-PRPH2-NM_000322.4.a2m,Invitae,refseq-PRPH2-NM_000322.4.npy,1,346,346
+NP_000315.2,MRFTFPLMAIVLEIAMIVLFGLFVEYETDQTVLEQLNITKPTDMGIFFELYPLFQDVHVMIFVGFGFLMTFLKKYGFSSVGINLLVAALGLQWGTIVQGILQSQGQKFNIGIKNMINADFSAATVLISFGAVLGKTSPTQMLIMTILEIVFFAHNEYLVSEIFKASDIGASMTIHAFGAYFGLAVAGILYRSGLRKGHENEESAYYSDLFAMIGTLFLWMFWPSFNSAIAEPGDKQCRAIVNTYFSLAACVLTAFAFSSLVEHRGKLNMVHIQNATLAGGVAVGTCADMAIHPFGSMIIGSIAGMVSVLGYKFLTPLFTTKLRIHDTCGVHNLHGLPGVVGGLAGIVAVAMGASNTSMAMQAAALGSSIGTAVVGGLMTGLILKLPLWGQPSDQNCYDDSVYWKVPKTR,409,NP_000315.2.csv,refseq-RHAG-NM_000324.2_clinical_seed_0_final,refseq-RHAG-NM_000324.2.a2m,Invitae,refseq-RHAG-NM_000324.2.npy,1,409,409
+NP_000316.2,MNCMKGPLHLEHRAAGTKLSAVSSSSCHHPQPLAMASVLAPGQPRSLDSSKHRLEVHTISDTSSPEAAEKDKSQQGKNEDVGAEDPSKKKRQRRQRTHFTSQQLQELEATFQRNRYPDMSTREEIAVWTNLTEARVRVWFKNRRAKWRKRERNQQAELCKNGFGPQFNGLMQPYDDMYPGYSYNNWAAKGLTSASLSTKSFPFFNSMNVNPLSSQSMFSPPNSISSMSMSSSMVPSAVTGVPGSSLNSLNNLNNLSSPSLNSAVPTPACPYAPPTPPYVYRDTCNSSLASLRLKAKQHSSFGYASVQNPASNLSACQYAVDRPV,324,NP_000316.2.csv,NP_000316.2_colabfold_clinical_seed_0_final,NP_000316.2_colabfold.a2m,colabfold,NP_000316.2_colabfold_theta_0.2.npy,1,324,324
+NP_000317.1,MSEGVGTFRMVPEEEQELRAQLEQLTTKDHGPVFGPCSQLPRHTLQKAKDELNEREETREEAVRELQEMVQAQAASGEELAVAVAERVQEKDSGFFLRFIRARKFNVGRAYELLRGYVNFRLQYPELFDSLSPEAVRCTIEAGYPGVLSSRDKYGRVVMLFNIENWQSQEITFDEILQAYCFILEKLLENEETQINGFCIIENFKGFTMQQAASLRTSDLRKMVDMLQDSFPARFKAIHFIHQPWYFTTTYNVVKPFLKSKLLERVFVHGDDLSGFYQEIDENILPSDFGGTLPKYDGKAVAEQLFGPQAQAENTAF,317,NP_000317.1.csv,refseq-RLBP1-NM_000326.4_clinical_seed_0_final,refseq-RLBP1-NM_000326.4.a2m,Invitae,refseq-RLBP1-NM_000326.4.npy,1,317,317
+NP_000320.1,MSIQVEHPAGGYKKLFETVEELSSPLTAHVTGRIPLWLTGSLLRCGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRRFIRTDAYVRAMTEKRIVITEFGTCAFPDPCKNIFSRFFSYFRGVEVTDNALVNVYPVGEDYYACTETNFITKINPETLETIKQVDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQADKEDPISKSEIVVQFPCSDRFKPSYVHSFGLTPNYIVFVETPVKINLFKFLSSWSLWGANYMDCFESNETMGVWLHIADKKRKKYLNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKGFEFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDKADTGKNLVTLPNTTATAILCSDETIWLEPEVLFSGPRQAFEFPQINYQKYCGKPYTYAYGLGLNHFVPDRLCKLNVKTKETWVWQEPDSYPSEPIFVSHPDALEEDDGVVLSVVVSPGAGQKPAYLLILNAKDLSEVARAEVEINIPVTFHGLFKKS,533,NP_000320.1.csv,refseq-RPE65-NM_000329.2_clinical_seed_0_final,refseq-RPE65-NM_000329.2.a2m,Invitae,refseq-RPE65-NM_000329.2.npy,1,533,533
+NP_000321.1,MSRKIEGFLLLLLFGYEATLGLSSTEDEGEDPWYQKACKCDCQGGPNALWSAGATSLDCIPECPYHKPLGFESGEVTPDQITCSNPEQYVGWYSSWTANKARLNSQGFGCAWLSKFQDSSQWLQIDLKEIKVISGILTQGRCDIDEWMTKYSVQYRTDERLNWIYYKDQTGNNRVFYGNSDRTSTVQNLLRPPIISRFIRLIPLGWHVRIAIRMELLECVSKCA,224,NP_000321.1.csv,refseq-RS1-NM_000330.3_clinical_seed_0_final,refseq-RS1-NM_000330.3.a2m,Invitae,refseq-RS1-NM_000330.3.npy,1,224,224
+NP_000325.4,MARPSLCTLVPLGPECLRPFTRESLAAIEQRAVEEEARLQRNKQMEIEEPERKPRSDLEAGKNLPMIYGDPPPEVIGIPLEDLDPYYSNKKTFIVLNKGKAIFRFSATPALYLLSPFSVVRRGAIKVLIHALFSMFIMITILTNCVFMTMSDPPPWSKNVEYTFTGIYTFESLIKILARGFCVDDFTFLRDPWNWLDFSVIMMAYLTEFVDLGNISALRTFRVLRALKTITVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALVGLQLFMGNLRQKCVRWPPPFNDTNTTWYSNDTWYGNDTWYGNEMWYGNDSWYANDTWNSHASWATNDTFDWDAYISDEGNFYFLEGSNDALLCGNSSDAGHCPEGYECIKTGRNPNYGYTSYDTFSWAFLALFRLMTQDYWENLFQLTLRAAGKTYMIFFVVIIFLGSFYLINLILAVVAMAYAEQNEATLAEDKEKEEEFQQMLEKFKKHQEELEKAKAAQALEGGEADGDPAHGKDCNGSLDTSQGEKGAPRQSSSGDSGISDAMEELEEAHQKCPPWWYKCAHKVLIWNCCAPWLKFKNIIHLIVMDPFVDLGITICIVLNTLFMAMEHYPMTEHFDNVLTVGNLVFTGIFTAEMVLKLIAMDPYEYFQQGWNIFDSIIVTLSLVELGLANVQGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKSYKECVCKIALDCNLPRWHMHDFFHSFLIVFRILCGEWIETMWDCMEVAGQAMCLTVFLMVMVIGNLVVLNLFLALLLSSFSADSLAASDEDGEMNNLQIAIGRIKLGIGFAKAFLLGLLHGKILSPKDIMLSLGEADGAGEAGEAGETAPEDEKKEPPEEDLKKDNHILNHMGLADGPPSSLELDHLNFINNPYLTIQVPIASEESDLEMPTEEETDTFSEPEDSKKPPQPLYDGNSSVCSTADYKPPEEDPEEQAEENPEGEQPEECFTEACVQRWPCLYVDISQGRGKKWWTLRRACFKIVEHNWFETFIVFMILLSSGALAFEDIYIEQRRVIRTILEYADKVFTYIFIMEMLLKWVAYGFKVYFTNAWCWLDFLIVDVSIISLVANWLGYSELGPIKSLRTLRALRPLRALSRFEGMRVVVNALLGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFYYCINTTTSERFDISEVNNKSECESLMHTGQVRWLNVKVNYDNVGLGYLSLLQVATFKGWMDIMYAAVDSREKEEQPQYEVNLYMYLYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKLGGKDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPQNKIQGMVYDLVTKQAFDITIMILICLNMVTMMVETDNQSQLKVDILYNINMIFIIIFTGECVLKMLALRQYYFTVGWNIFDFVVVILSIVGLALSDLIQKYFVSPTLFRVIRLARIGRVLRLIRGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYSIFGMSNFAYVKKESGIDDMFNFETFGNSIICLFEITTSAGWDGLLNPILNSGPPDCDPNLENPGTSVKGDCGNPSIGICFFCSYIIISFLIVVNMYIAIILENFNVATEESSEPLGEDDFEMFYETWEKFDPDATQFIAYSRLSDFVDTLQEPLRIAKPNKIKLITLDLPMVPGDKIHCLDILFALTKEVLGDSGEMDALKQTMEEKFMAANPSKVSYEPITTTLKRKHEEVCAIKIQRAYRRHLLQRSMKQASYMYRHSHDGSGDDAPEKEGLLANTMSKMYGHENGNSSSPSPEEKGEAGDAGPTMGLMPISPSDTAWPPAPPPGQTVRPGVKESLV,1836,NP_000325.4.csv,refseq-SCN4A-NM_000334.4_clinical_seed_0_final,refseq-SCN4A-NM_000334.4.a2m,Invitae,refseq-SCN4A-NM_000334.4.npy,1,1836,1836
+NP_000327.2,MHVKKYLLKGLHRLQKGPGYTYKELLVWYCDNTNTHGPKRIICEGPKKKAMWFLLTLLFAALVCWQWGIFIRTYLSWEVSVSLSVGFKTMDFPAVTICNASPFKYSKIKHLLKDLDELMEAVLERILAPELSHANATRNLNFSIWNHTPLVLIDERNPHHPMVLDLFGDNHNGLTSSSASEKICNAHGCKMAMRLCSLNRTQCTFRNFTSATQALTEWYILQATNIFAQVPQQELVEMSYPGEQMILACLFGAEPCNYRNFTSIFYPHYGNCYIFNWGMTEKALPSANPGTEFGLKLILDIGQEDYVPFLASTAGVRLMLHEQRSYPFIRDEGIYAMSGTETSIGVLVDKLQRMGEPYSPCTVNGSEVPVQNFYSDYNTTYSIQACLRSCFQDHMIRNCNCGHYLYPLPRGEKYCNNRDFPDWAHCYSDLQMSVAQRETCIGMCKESCNDTQYKMTISMADWPSEASEDWIFHVLSQERDQSTNITLSRKGIVKLNIYFQEFNYRTIEESAANNIVWLLSNLGGQFGFWMGGSVLCLIEFGEIIIDFVWITIIKLVALAKSLRQRRAQASYAGPPPTVAELVEAHTNFGFQPDTAPRSPNTGPYPSEQALPIPGTPPPNYDSLRLQPLDVIESDSEGDAI,640,NP_000327.2.csv,refseq-SCNN1B-NM_000336.2_clinical_seed_0_final,refseq-SCNN1B-NM_000336.2.a2m,Invitae,refseq-SCNN1B-NM_000336.2.npy,1,640,640
+NP_000329.2,MSLNNSSNVFLDSVPSNTNRFQVSVINENHESSAAADDNTDPPHYEETSFGDEAQKRLRISFRPGNQECYDNFLQSGETAKTDASFHAYDSHTNTYYLQTFGHNTMDAVPKIEYYRNTGSISGPKVNRPSLLEIHEQLAKNVAVTPSSADRVANGDGIPGDEQAENKEDDQAGVVKFGWVKGVLVRCMLNIWGVMLFIRLSWIVGEAGIGLGVLIILLSTMVTSITGLSTSAIATNGFVRGGGAYYLISRSLGPEFGGSIGLIFAFANAVAVAMYVVGFAETVVDLLKESDSMMVDPTNDIRIIGSITVVILLGISVAGMEWEAKAQVILLVILLIAIANFFIGTVIPSNNEKKSRGFFNYQASIFAENFGPRFTKGEGFFSVFAIFFPAATGILAGANISGDLEDPQDAIPRGTMLAIFITTVAYLGVAICVGACVVRDATGNMNDTIISGMNCNGSAACGLGYDFSRCRHEPCQYGLMNNFQVMSMVSGFGPLITAGIFSATLSSALASLVSAPKVFQALCKDNIYKALQFFAKGYGKNNEPLRGYILTFLIAMAFILIAELNTIAPIISNFFLASYALINFSCFHASYAKSPGWRPAYGIYNMWVSLFGAVLCCAVMFVINWWAAVITYVIEFFLYVYVTCKKPDVNWGSSTQALSYVSALDNALELTTVEDHVKNFRPQCIVLTGGPMTRPALLDITHAFTKNSGLCICCEVFVGPRKLCVKEMNSGMAKKQAWLIKNKIKAFYAAVAADCFRDGVRSLLQASGLGRMKPNTLVIGYKKNWRKAPLTEIENYVGIIHDAFDFEIGVVIVRISQGFDISQVLQVQEELERLEQERLALEATIKDNECEEESGGIRGLFKKAGKLNITKTTPKKDGSINTSQSMHVGEFNQKLVEASTQFKKKQEKGTIDVWWLFDDGGLTLLIPYILTLRKKWKDCKLRIYVGGKINRIEEEKIVMASLLSKFRIKFADIHIIGDINIRPNKESWKVFEEMIEPYRLHESCKDLTTAEKLKRETPWKITDAELEAVKEKSYRQVRLNELLQEHSRAANLIVLSLPVARKGSISDLLYMAWLEILTKNLPPVLLVRGNHKNVLTFYS,1099,NP_000329.2.csv,refseq-SLC12A1-NM_000338.2_clinical_seed_0_final,refseq-SLC12A1-NM_000338.2.a2m,Invitae,refseq-SLC12A1-NM_000338.2.npy,1,1099,1099
+NP_000331.1,MTEDKVTGTLVFTVITAVLGSFQFGYDIGVINAPQQVIISHYRHVLGVPLDDRKAINNYVINSTDELPTISYSMNPKPTPWAEEETVAAAQLITMLWSLSVSSFAVGGMTASFFGGWLGDTLGRIKAMLVANILSLVGALLMGFSKLGPSHILIIAGRSISGLYCGLISGLVPMYIGEIAPTALRGALGTFHQLAIVTGILISQIIGLEFILGNYDLWHILLGLSGVRAILQSLLLFFCPESPRYLYIKLDEEVKAKQSLKRLRGYDDVTKDINEMRKEREEASSEQKVSIIQLFTNSSYRQPILVALMLHVAQQFSGINGIFYYSTSIFQTAGISKPVYATIGVGAVNMVFTAVSVFLVEKAGRRSLFLIGMSGMFVCAIFMSVGLVLLNKFSWMSYVSMIAIFLFVSFFEIGPGPIPWFMVAEFFSQGPRPAALAIAAFSNWTCNFIVALCFQYIADFCGPYVFFLFAGVLLAFTLFTFFKVPETKGKSFEEIAAEFQKKSGSAHRPKAAVEMKFLGATETV,524,NP_000331.1.csv,refseq-SLC2A2-NM_000340.1_clinical_seed_0_final,refseq-SLC2A2-NM_000340.1.a2m,Invitae,refseq-SLC2A2-NM_000340.1.npy,1,524,524
+NP_000332.2,MAEDKSKRDSIEMSMKGCQTNNGFVHNEDILEQTPDPGSSTDNLKHSTRGILGSQEPDFKGVQPYAGMPKEVLFQFSGQARYRIPREILFWLTVASVLVLIAATIAIIALSPKCLDWWQEGPMYQIYPRSFKDSNKDGNGDLKGIQDKLDYITALNIKTVWITSFYKSSLKDFRYGVEDFREVDPIFGTMEDFENLVAAIHDKGLKLIIDFIPNHTSDKHIWFQLSRTRTGKYTDYYIWHDCTHENGKTIPPNNWLSVYGNSSWHFDEVRNQCYFHQFMKEQPDLNFRNPDVQEEIKEILRFWLTKGVDGFSLDAVKFLLEAKHLRDEIQVNKTQIPDTVTQYSELYHDFTTTQVGMHDIVRSFRQTMDQYSTEPGRYRFMGTEAYAESIDRTVMYYGLPFIQEADFPFNNYLSMLDTVSGNSVYEVITSWMENMPEGKWPNWMIGGPDSSRLTSRLGNQYVNVMNMLLFTLPGTPITYYGEEIGMGNIVAANLNESYDINTLRSKSPMQWDNSSNAGFSEASNTWLPTNSDYHTVNVDVQKTQPRSALKLYQDLSLLHANELLLNRGWFCHLRNDSHYVVYTRELDGIDRIFIVVLNFGESTLLNLHNMISGLPAKMRIRLSTNSADKGSKVDTSGIFLDKGEGLIFEHNTKNLLHRQTAFRDRCFVSNRACYSSVLNILYTSC,685,NP_000332.2.csv,refseq-SLC3A1-NM_000341.3_clinical_seed_0_final,refseq-SLC3A1-NM_000341.3.a2m,Invitae,refseq-SLC3A1-NM_000341.3.npy,1,685,685
+NP_000333.1,MEELQDDYEDMMEENLEQEEYEDPDIPESQMEEPAAHDTEATATDYHTTSHPGTHKVYVELQELVMDEKNQELRWMEAARWVQLEENLGENGAWGRPHLSHLTFWSLLELRRVFTKGTVLLDLQETSLAGVANQLLDRFIFEDQIRPQDREELLRALLLKHSHAGELEALGGVKPAVLTRSGDPSQPLLPQHSSLETQLFCEQGDGGTEGHSPSGILEKIPPDSEATLVLVGRADFLEQPVLGFVRLQEAAELEAVELPVPIRFLFVLLGPEAPHIDYTQLGRAAATLMSERVFRIDAYMAQSRGELLHSLEGFLDCSLVLPPTDAPSEQALLSLVPVQRELLRRRYQSSPAKPDSSFYKGLDLNGGPDDPLQQTGQLFGGLVRDIRRRYPYYLSDITDAFSPQVLAAVIFIYFAALSPAITFGGLLGEKTRNQMGVSELLISTAVQGILFALLGAQPLLVVGFSGPLLVFEEAFFSFCETNGLEYIVGRVWIGFWLILLVVLVVAFEGSFLVRFISRYTQEIFSFLISLIFIYETFSKLIKIFQDHPLQKTYNYNVLMVPKPQGPLPNTALLSLVLMAGTFFFAMMLRKFKNSSYFPGKLRRVIGDFGVPISILIMVLVDFFIQDTYTQKLSVPDGFKVSNSSARGWVIHPLGLRSEFPIWMMFASALPALLVFILIFLESQITTLIVSKPERKMVKGSGFHLDLLLVVGMGGVAALFGMPWLSATTVRSVTHANALTVMGKASTPGAAAQIQEVKEQRISGLLVAVLVGLSILMEPILSRIPLAVLFGIFLYMGVTSLSGIQLFDRILLLFKPPKYHPDVPYVKRVKTWRMHLFTGIQIICLAVLWVVKSTPASLALPFVLILTVPLRRVLLPLIFRNVELQCLDADDAKATFDEEEGRDEYDEVAMPV,911,NP_000333.1.csv,refseq-SLC4A1-NM_000342.3_clinical_seed_0_final,refseq-SLC4A1-NM_000342.3.a2m,Invitae,refseq-SLC4A1-NM_000342.3.npy,1,911,911
+NP_000334.1,MDSSTWSPKTTAVTRPVETHELIRNAADISIIVIYFVVVMAVGLWAMFSTNRGTVGGFFLAGRSMVWWPIGASLFASNIGSGHFVGLAGTGAASGIAIGGFEWNALVLVVVLGWLFVPIYIKAGVVTMPEYLRKRFGGQRIQVYLSLLSLLLYIFTKISADIFSGAIFINLALGLNLYLAIFLLLAITALYTITGGLAAVIYTDTLQTVIMLVGSLILTGFAFHEVGGYDAFMEKYMKAIPTIVSDGNTTFQEKCYTPRADSFHIFRDPLTGDLPWPGFIFGMSILTLWYWCTDQVIVQRCLSAKNMSHVKGGCILCGYLKLMPMFIMVMPGMISRILYTEKIACVVPSECEKYCGTKVGCTNIAYPTLVVELMPNGLRGLMLSVMLASLMSSLTSIFNSASTLFTMDIYAKVRKRASEKELMIAGRLFILVLIGISIAWVPIVQSAQSGQLFDYIQSITSYLGPPIAAVFLLAIFWKRVNEPGAFWGLILGLLIGISRMITEFAYGTGSCMEPSNCPTIICGVHYLYFAIILFAISFITIVVISLLTKPIPDVHLYRLCWSLRNSKEERIDLDAEEENIQEGPKETIEIETQVPEKKKGIFRRAYDLFCGLEQHGAPKMTEEEEKAMKMKMTDTSEKPLWRTVLNVNGIILVTVAVFCHAYFA,664,NP_000334.1.csv,refseq-SLC5A1-NM_000343.3_clinical_seed_0_final,refseq-SLC5A1-NM_000343.3.a2m,Invitae,refseq-SLC5A1-NM_000343.3.npy,1,664,664
+NP_000336.1,MDVFMKGLSKAKEGVVAAAEKTKQGVAEAAGKTKEGVLYVGSKTKEGVVHGVATVAEKTKEQVTNVGGAVVTGVTAVAQKTVEGAGSIAAATGFVKKDQLGKNEEGAPQEGILEDMPVDPDNEAYEMPSEEGYQDYEPEA,140,NP_000336.1.csv,refseq-SNCA-NM_000345.3_clinical_seed_0_final,refseq-SNCA-NM_000345.3.a2m,Invitae,refseq-SNCA-NM_000345.3.npy,1,140,140
+NP_000337.1,MNLLDPFMKMTDEQEKGLSGAPSPTMSEDSAGSPCPSGSGSDTENTRPQENTFPKGEPDLKKESEEDKFPVCIREAVSQVLKGYDWTLVPMPVRVNGSSKNKPHVKRPMNAFMVWAQAARRKLADQYPHLHNAELSKTLGKLWRLLNESEKRPFVEEAERLRVQHKKDHPDYKYQPRRRKSVKNGQAEAEEATEQTHISPNAIFKALQADSPHSSSGMSEVHSPGEHSGQSQGPPTPPTTPKTDVQPGKADLKREGRPLPEGGRQPPIDFRDVDIGELSSDVISNIETFDVNEFDQYLPPNGHPGVPATHGQVTYTGSYGISSTAATPASAGHVWMSKQQAPPPPPQQPPQAPPAPQAPPQPQAAPPQQPAAPPQQPQAHTLTTLSSEPGQSQRTHIKTEQLSPSHYSEQQQHSPQQIAYSPFNLPHYSPSYPPITRSQYDYTDHQNSSSYYSHAAGQGTGLYSTFTYMNPAQRPMYTPIADTSGVPSIPQTHSPQHWEQPVYTQLTRP,509,NP_000337.1.csv,refseq-SOX9-NM_000346.3_clinical_seed_0_final,refseq-SOX9-NM_000346.3.a2m,Invitae,refseq-SOX9-NM_000346.3.npy,1,509,509
+NP_000340.2,MLLATFKLCAGSSYRHMRNMKGLRQQAVMAISQELNRRALGGPTPSTWINQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWKKESQQDNGDKVMSKVVPDVGKVFRLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVLQKIGKDTFITHELAAEAAGNLVGPRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIRAEHGPTCMVLHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLESHPASEARC,285,NP_000340.2.csv,STAR_HUMAN_b03_clinical_seed_0_final,STAR_HUMAN_b03.a2m,EVE,STAR_HUMAN_b03_theta_0.2.npy,1,285,285
+NP_000341.2,MGFVRQIQLLLWKNWTLRKRQKIRFVVELVWPLSLFLVLIWLRNANPLYSHHECHFPNKAMPSAGMLPWLQGIFCNVNNPCFQSPTPGESPGIVSNYNNSILARVYRDFQELLMNAPESQHLGRIWTELHILSQFMDTLRTHPERIAGRGIRIRDILKDEETLTLFLIKNIGLSDSVVYLLINSQVRPEQFAHGVPDLALKDIACSEALLERFIIFSQRRGAKTVRYALCSLSQGTLQWIEDTLYANVDFFKLFRVLPTLLDSRSQGINLRSWGGILSDMSPRIQEFIHRPSMQDLLWVTRPLMQNGGPETFTKLMGILSDLLCGYPEGGGSRVLSFNWYEDNNYKAFLGIDSTRKDPIYSYDRRTTSFCNALIQSLESNPLTKIAWRAAKPLLMGKILYTPDSPAARRILKNANSTFEELEHVRKLVKAWEEVGPQIWYFFDNSTQMNMIRDTLGNPTVKDFLNRQLGEEGITAEAILNFLYKGPRESQADDMANFDWRDIFNITDRTLRLVNQYLECLVLDKFESYNDETQLTQRALSLLEENMFWAGVVFPDMYPWTSSLPPHVKYKIRMDIDVVEKTNKIKDRYWDSGPRADPVEDFRYIWGGFAYLQDMVEQGITRSQVQAEAPVGIYLQQMPYPCFVDDSFMIILNRCFPIFMVLAWIYSVSMTVKSIVLEKELRLKETLKNQGVSNAVIWCTWFLDSFSIMSMSIFLLTIFIMHGRILHYSDPFILFLFLLAFSTATIMLCFLLSTFFSKASLAAACSGVIYFTLYLPHILCFAWQDRMTAELKKAVSLLSPVAFGFGTEYLVRFEEQGLGLQWSNIGNSPTEGDEFSFLLSMQMMLLDAAVYGLLAWYLDQVFPGDYGTPLPWYFLLQESYWLGGEGCSTREERALEKTEPLTEETEDPEHPEGIHDSFFEREHPGWVPGVCVKNLVKIFEPCGRPAVDRLNITFYENQITAFLGHNGAGKTTTLSILTGLLPPTSGTVLVGGRDIETSLDAVRQSLGMCPQHNILFHHLTVAEHMLFYAQLKGKSQEEAQLEMEAMLEDTGLHHKRNEEAQDLSGGMQRKLSVAIAFVGDAKVVILDEPTSGVDPYSRRSIWDLLLKYRSGRTIIMSTHHMDEADLLGDRIAIIAQGRLYCSGTPLFLKNCFGTGLYLTLVRKMKNIQSQRKGSEGTCSCSSKGFSTTCPAHVDDLTPEQVLDGDVNELMDVVLHHVPEAKLVECIGQELIFLLPNKNFKHRAYASLFRELEETLADLGLSSFGISDTPLEEIFLKVTEDSDSGPLFAGGAQQKRENVNPRHPCLGPREKAGQTPQDSNVCSPGAPAAHPEGQPPPEPECPGPQLNTGTQLVLQHVQALLVKRFQHTIRSHKDFLAQIVLPATFVFLALMLSIVIPPFGEYPALTLHPWIYGQQYTFFSMDEPGSEQFTVLADVLLNKPGFGNRCLKEGWLPEYPCGNSTPWKTPSVSPNITQLFQKQKWTQVNPSPSCRCSTREKLTMLPECPEGAGGLPPPQRTQRSTEILQDLTDRNISDFLVKTYPALIRSSLKSKFWVNEQRYGGISIGGKLPVVPITGEALVGFLSDLGRIMNVSGGPITREASKEIPDFLKHLETEDNIKVWFNNKGWHALVSFLNVAHNAILRASLPKDRSPEEYGITVISQPLNLTKEQLSEITVLTTSVDAVVAICVIFSMSFVPASFVLYLIQERVNKSKHLQFISGVSPTTYWVTNFLWDIMNYSVSAGLVVGIFIGFQKKAYTSPENLPALVALLLLYGWAVIPMMYPASFLFDVPSTAYVALSCANLFIGINSSAITFILELFENNRTLLRFNAVLRKLLIVFPHFCLGRGLIDLALSQAVTDVYARFGEEHSANPFHWDLIGKNLFAMVVEGVVYFLLTLLVQRHFFLSQWIAEPTKEPIVDEDDDVAEERQRIITGGNKTDILRLHELTKIYPGTSSPAVDRLCVGVRPGECFGLLGVNGAGKTTTFKMLTGDTTVTSGDATVAGKSILTNISEVHQNMGYCPQFDAIDELLTGREHLYLYARLRGVPAEEIEKVANWSIKSLGLTVYADCLAGTYSGGNKRKLSTAIALIGCPPLVLLDEPTTGMDPQARRMLWNVIVSIIREGRAVVLTSHSMEECEALCTRLAIMVKGAFRCMGTIQHLKSKFGDGYIVTMKIKSPKDDLLPDLNPVEQFFQGNFPGSVQRERHYNMLQFQVSSSSLARIFQLLLSHKDSLLIEEYSVTQTTLDQVFVNFAKQQTESHDLPLHPRAAGASRQAQD,2273,NP_000341.2.csv,refseq-ABCA4-NM_000350.3_clinical_seed_0_final,refseq-ABCA4-NM_000350.3.a2m,Invitae,refseq-ABCA4-NM_000350.3_theta_0.2.npy,1,2273,2273
+NP_000343.2,MPLAFCGSENHSAAYRVDQGVLNNGCFVDALNVVPHVFLLFITFPILFIGWGSQSSKVHIHHSTWLHFPGHNLRWILTFMLLFVLVCEIAEGILSDGVTESHHLHLYMPAGMAFMAAVTSVVYYHNIETSNFPKLLIALLVYWTLAFITKTIKFVKFLDHAIGFSQLRFCLTGLLVILYGMLLLVEVNVIRVRRYIFFKTPREVKPPEDLQDLGVRFLQPFVNLLSKGTYWWMNAFIKTAHKKPIDLRAIGKLPIAMRALTNYQRLCEAFDAQVRKDIQGTQGARAIWQALSHAFGRRLVLSSTFRILADLLGFAGPLCIFGIVDHLGKENDVFQPKTQFLGVYFVSSQEFLANAYVLAVLLFLALLLQRTFLQASYYVAIETGINLRGAIQTKIYNKIMHLSTSNLSMGEMTAGQICNLVAIDTNQLMWFFFLCPNLWAMPVQIIVGVILLYYILGVSALIGAAVIILLAPVQYFVATKLSQAQRSTLEYSNERLKQTNEMLRGIKLLKLYAWENIFRTRVETTRRKEMTSLRAFAIYTSISIFMNTAIPIAAVLITFVGHVSFFKEADFSPSVAFASLSLFHILVTPLFLLSSVVRSTVKALVSVQKLSEFLSSAEIREEQCAPHEPTPQGPASKYQAVPLRVVNRKRPAREDCRGLTGPLQSLVPSADGDADNCCVQIMGGYFTWTPDGIPTLSNITIRIPRGQLTMIVGQVGCGKSSLLLAALGEMQKVSGAVFWSSLPDSEIGEDPSPERETATDLDIRKRGPVAYASQKPWLLNATVEENIIFESPFNKQRYKMVIEACSLQPDIDILPHGDQTQIGERGINLSGGQRQRISVARALYQHANVVFLDDPFSALDIHLSDHLMQAGILELLRDDKRTVVLVTHKLQYLPHADWIIAMKDGTIQREGTLKDFQRSECQLFEHWKTLMNRQDQELEKETVTERKATEPPQGLSRAMSSRDGLLQDEEEEEEEAAESEEDDNLSSMLHQRAEIPWRACAKYLSSAGILLLSLLVFSQLLKHMVLVAIDYWLAKWTDSALTLTPAARNCSLSQECTLDQTVYAMVFTVLCSLGIVLCLVTSVTVEWTGLKVAKRLHRSLLNRIILAPMRFFETTPLGSILNRFSSDCNTIDQHIPSTLECLSRSTLLCVSALAVISYVTPVFLVALLPLAIVCYFIQKYFRVASRDLQQLDDTTQLPLLSHFAETVEGLTTIRAFRYEARFQQKLLEYTDSNNIASLFLTAANRWLEVRMEYIGACVVLIAAVTSISNSLHRELSAGLVGLGLTYALMVSNYLNWMVRNLADMELQLGAVKRIHGLLKTEAESYEGLLAPSLIPKNWPDQGKIQIQNLSVRYDSSLKPVLKHVNALIAPGQKIGICGRTGSGKSSFSLAFFRMVDTFEGHIIIDGIDIAKLPLHTLRSRLSIILQDPVLFSGTIRFNLDPERKCSDSTLWEALEIAQLKLVVKALPGGLDAIITEGGENFSQGQRQLFCLARAFVRKTSIFIMDEATASIDMATENILQKVVMTAFADRTVVTIAHRVHTILSADLVIVLKRGAILEFDKPEKLLSRKDSVFASFVRADK,1581,NP_000343.2.csv,refseq-ABCC8-NM_000352.4_clinical_seed_0_final,refseq-ABCC8-NM_000352.4.a2m,Invitae,refseq-ABCC8-NM_000352.4.npy,1,1581,1581
+NP_000344.1,MDPYMIQMSSKGNLPSILDVHVNVGGRSSVPGKMKGRKARWSVRPSDMAKKTFNPIRAIVDNMKVKPNPNKTMISLSIGDPTVFGNLPTDPEVTQAMKDALDSGKYNGYAPSIGFLSSREEIASYYHCPEAPLEAKDVILTSGCSQAIDLCLAVLANPGQNILVPRPGFSLYKTLAESMGIEVKLYNLLPEKSWEIDLKQLEYLIDEKTACLIVNNPSNPCGSVFSKRHLQKILAVAARQCVPILADEIYGDMVFSDCKYEPLATLSTDVPILSCGGLAKRWLVPGWRLGWILIHDRRDIFGNEIRDGLVKLSQRILGPCTIVQGALKSILCRTPGEFYHNTLSFLKSNADLCYGALAAIPGLRPVRPSGAMYLMVGIEMEHFPEFENDVEFTERLVAEQSVHCLPATCFEYPNFIRVVITVPEVMMLEACSRIQEFCEQHYHCAEGSQEECDK,454,NP_000344.1.csv,refseq-TAT-NM_000353.2_clinical_seed_0_final,refseq-TAT-NM_000353.2.a2m,Invitae,refseq-TAT-NM_000353.2_theta_0.2.npy,1,454,454
+NP_000349.1,MALFVRLLALALALALGPAATLAGPAKSPYQLVLQHSRLRGRQHGPNVCAVQKVIGTNRKYFTNCKQWYQRKICGKSTVISYECCPGYEKVPGEKGCPAALPLSNLYETLGVVGSTTTQLYTDRTEKLRPEMEGPGSFTIFAPSNEAWASLPAEVLDSLVSNVNIELLNALRYHMVGRRVLTDELKHGMTLTSMYQNSNIQIHHYPNGIVTVNCARLLKADHHATNGVVHLIDKVISTITNNIQQIIEIEDTFETLRAAVAASGLNTMLEGNGQYTLLAPTNEAFEKIPSETLNRILGDPEALRDLLNNHILKSAMCAEAIVAGLSVETLEGTTLEVGCSGDMLTINGKAIISNKDILATNGVIHYIDELLIPDSAKTLFELAAESDVSTAIDLFRQAGLGNHLSGSERLTLLAPLNSVFKDGTPPIDAHTRNLLRNHIIKDQLASKYLYHGQTLETLGGKKLRVFVYRNSLCIENSCIAAHDKRGRYGTLFTMDRVLTPPMGTVMDVLKGDNRFSMLVAAIQSAGLTETLNREGVYTVFAPTNEAFRALPPRERSRLLGDAKELANILKYHIGDEILVSGGIGALVRLKSLQGDKLEVSLKNNVVSVNKEPVAEPDIMATNGVVHVITNVLQPPANRPQERGDELADSALEIFKQASAFSRASQRSVRLAPVYQKLLERMKH,683,NP_000349.1.csv,refseq-TGFBI-NM_000358.2_clinical_seed_0_final,refseq-TGFBI-NM_000358.2.a2m,Invitae,refseq-TGFBI-NM_000358.2.npy,1,683,683
+NP_000350.1,MMDGPRSDVGRWGGNPLQPPTTPSPEPEPEPDGRSRRGGGRSFWARCCGCCSCRNAADDDWGPEPSDSRGRGSSSGTRRPGSRGSDSRRPVSRGSGVNAAGDGTIREGMLVVNGVDLLSSRSDQNRREHHTDEYEYDELIVRRGQPFHMLLLLSRTYESSDRITLELLIGNNPEVGKGTHVIIPVGKGGSGGWKAQVVKASGQNLNLRVHTSPNAIIGKFQFTVRTQSDAGEFQLPFDPRNEIYILFNPWCPEDIVYVDHEDWRQEYVLNESGRIYYGTEAQIGERTWNYGQFDHGVLDACLYILDRRGMPYGGRGDPVNVSRVISAMVNSLDDNGVLIGNWSGDYSRGTNPSAWVGSVEILLSYLRTGYSVPYGQCWVFAGVTTTVLRCLGLATRTVTNFNSAHDTDTSLTMDIYFDENMKPLEHLNHDSVWNFHVWNDCWMKRPDLPSGFDGWQVVDATPQETSSGIFCCGPCSVESIKNGLVYMKYDTPFIFAEVNSDKVYWQRQDDGSFKIVYVEEKAIGTLIVTKAISSNMREDITYLYKHPEGSDAERKAVETAAAHGSKPNVYANRGSAEDVAMQVEAQDAVMGQDLMVSVMLINHSSSRRTVKLHLYLSVTFYTGVSGTIFKETKKEVELAPGASDRVTMPVAYKEYRPHLVDQGAMLLNVSGHVKESGQVLAKQHTFRLRTPDLSLTLLGAAVVGQECEVQIVFKNPLPVTLTNVVFRLEGSGLQRPKILNVGDIGGNETVTLRQSFVPVRPGPRQLIASLDSPQLSQVHGVIQVDVAPAPGDGGFFSDAGGDSHLGETIPMASRGGA,817,NP_000350.1.csv,refseq-TGM1-NM_000359.3_clinical_seed_0_final,refseq-TGM1-NM_000359.3.a2m,Invitae,refseq-TGM1-NM_000359.3.npy,1,817,817
+NP_000353.1,MTPWLGLIVLLGSWSLGDWGAEACTCSPSHPQDAFCNSDIVIRAKVVGKKLVKEGPFGTLVYTIKQMKMYRGFTKMPHVQYIHTEASESLCGLKLEVNKYQYLLTGRVYDGKMYTGLCNFVERWDQLTLSQRKGLNYRYHLGCNCKIKSCYYLPCFVTSKNECLWTDMLSNFGYPGYQSKHYACIRQKGGYCSWYRGWAPPDKSIINATDP,211,NP_000353.1.csv,refseq-TIMP3-NM_000362.4_clinical_seed_0_final,refseq-TIMP3-NM_000362.4.a2m,Invitae,refseq-TIMP3-NM_000362.4.npy,1,211,211
+NP_000354.4,MADGSSDAAREPRPAPAPIRRRSSNYRAYATEPHAKKKSKISASRKLQLKTLLLQIAKQELEREAEERRGEKGRALSTRCQPLELAGLGFAELQDLCRQLHARVDKVDEERYDIEAKVTKNITEIADLTQKIFDLRGKFKRPTLRRVRISADAMMQALLGARAKESLDLRAHLKQVKKEDTEKENREVGDWRKNIDALSGMEGRKKKFES,210,NP_000354.4.csv,refseq-TNNI3-NM_000363.4_clinical_seed_0_final,refseq-TNNI3-NM_000363.4.a2m,Invitae,refseq-TNNI3-NM_000363.4.npy,1,210,210
+NP_000359.1,MAQQANVGELLAMLDSPMLGVRDDVTAVFKENLNSDRGPMLVNTLVDYYLETSSQPALHILTTLQEPHDKHLLDRINEYVGKAATRLSILSLLGHVIRLQPSWKHKLSQAPLLPSLLKCLKMDTDVVVLTTGVLVLITMLPMIPQSGKQHLLDFFDIFGRLSSWCLKKPGHVAEVYLVHLHASVYALFHRLYGMYPCNFVSFLRSHYSMKENLETFEEVVKPMMEHVRIHPELVTGSKDHELDPRRWKRLETHDVVIECAKISLDPTEASYEDGYSVSHQISARFPHRSADVTTSPYADTQNSYGCATSTPYSTSRLMLLNMPGQLPQTLSSPSTRLITEPPQATLWSPSMVCGMTTPPTSPGNVPPDLSHPYSKVFGTTAGGKGTPLGTPATSPPPAPLCHSDDYVHISLPQATVTPPRKEERMDSARPCLHRQHHLLNDRGSEEPPGSKGSVTLSDLPGFLGDLASEEDSIEKDKEEAAISRELSEITTAEAEPVVPRGGFDSPFYRDSLPGSQRKTHSAASSSQGASVNPEPLHSSLDKLGPDTPKQAFTPIDLPCGSADESPAGDRECQTSLETSIFTPSPCKIPPPTRVGFGSGQPPPYDHLFEVALPKTAHHFVIRKTEELLKKAKGNTEEDGVPSTSPMEVLDRLIQQGADAHSKELNKLPLPSKSVDWTHFGGSPPSDEIRTLRDQLLLLHNQLLYERFKRQQHALRNRRLLRKVIKAAALEEHNAAMKDQLKLQEKDIQMWKVSLQKEQARYNQLQEQRDTMVTKLHSQIRQLQHDREEFYNQSQELQTKLEDCRNMIAELRIELKKANNKVCHTELLLSQVSQKLSNSESVQQQMEFLNRQLLVLGEVNELYLEQLQNKHSDTTKEVEMMKAAYRKELEKNRSHVLQQTQRLDTSQKRILELESHLAKKDHLLLEQKKYLEDVKLQARGQLQAAESRYEAQKRITQVFELEILDLYGRLEKDGLLKKLEEEKAEAAEAAEERLDCCNDGCSDSMVGHNEEASGHNGETKTPRPSSARGSSGSRGGGGSSSSSSELSTPEKPPHQRAGPFSSRWETTMGEASASIPTTVGSLPSSKSFLGMKARELFRNKSESQCDEDGMTSSLSESLKTELGKDLGVEAKIPLNLDGPHPSPPTPDSVGQLHIMDYNETHHEHS,1164,NP_000359.1.csv,refseq-TSC1-NM_000368.4_clinical_seed_0_final,refseq-TSC1-NM_000368.4.a2m,Invitae,refseq-TSC1-NM_000368.4.npy,1,1164,1164
+NP_000360.2,MRPADLLQLVLLLDLPRDLGGMGCSSPPCECHQEEDFRVTCKDIQRIPSLPPSTQTLKLIETHLRTIPSHAFSNLPNISRIYVSIDVTLQQLESHSFYNLSKVTHIEIRNTRNLTYIDPDALKELPLLKFLGIFNTGLKMFPDLTKVYSTDIFFILEITDNPYMTSIPVNAFQGLCNETLTLKLYNNGFTSVQGYAFNGTKLDAVYLNKNKYLTVIDKDAFGGVYSGPSLLDVSQTSVTALPSKGLEHLKELIARNTWTLKKLPLSLSFLHLTRADLSYPSHCCAFKNQKKIRGILESLMCNESSMQSLRQRKSVNALNSPLHQEYEENLGDSIVGYKEKSKFQDTHNNAHYYVFFEEQEDEIIGFGQELKNPQEETLQAFDSHYDYTICGDSEDMVCTPKSDEFNPCEDIMGYKFLRIVVWFVSLLALLGNVFVLLILLTSHYKLNVPRFLMCNLAFADFCMGMYLLLIASVDLYTHSEYYNHAIDWQTGPGCNTAGFFTVFASELSVYTLTVITLERWYAITFAMRLDRKIRLRHACAIMVGGWVCCFLLALLPLVGISSYAKVSICLPMDTETPLALAYIVFVLTLNIVAFVIVCCCYVKIYITVRNPQYNPGDKDTKIAKRMAVLIFTDFICMAPISFYALSAILNKPLITVSNSKILLVLFYPLNSCANPFLYAIFTKAFQRDVFILLSKFGICKRQAQAYRGQRVPPKNSTDIQVQKVTHEMRQGLHNMEDVYELIENSHLTPKKQGQISEEYMQTVL,764,NP_000360.2.csv,refseq-TSHR-NM_000369.2_clinical_seed_0_final,refseq-TSHR-NM_000369.2.a2m,Invitae,refseq-TSHR-NM_000369.2.npy,1,764,764
+NP_000361.1,MAEARSQPSAGPQLNALPDHSPLLQPGLAALRRRAREAGVPLAPLPLTDSFLLRFLRARDFDLDLAWRLLKNYYKWRAECPEISADLHPRSIIGLLKAGYHGVLRSRDPTGSKVLIYRIAHWDPKVFTAYDVFRVSLITSELIVQEVETQRNGIKAIFDLEGWQFSHAFQITPSVAKKIAAVLTDSFPLKVRGIHLINEPVIFHAVFSMIKPFLTEKIKERIHMHGNNYKQSLLQHFPDILPLEYGGEEFSMEDICQEWTNFIMKSEDYLSSISESIQ,278,NP_000361.1.csv,refseq-TTPA-NM_000370.3_clinical_seed_0_final,refseq-TTPA-NM_000370.3.a2m,Invitae,refseq-TTPA-NM_000370.3.npy,1,278,278
+NP_000362.1,MASHRLLLLCLAGLVFVSEAGPTGTGESKCPLMVKVLDAVRGSPAINVAVHVFRKAADDTWEPFASGKTSESGELHGLTTEEEFVEGIYKVEIDTKSYWKALGISPFHEHAEVVFTANDSGPRRYTIAALLSPYSYSTTAVVTNPKE,147,NP_000362.1.csv,refseq-TTR-NM_000371.3_clinical_seed_0_final,refseq-TTR-NM_000371.3.a2m,Invitae,refseq-TTR-NM_000371.3.npy,1,147,147
+NP_000363.1,MLLAVLYCLLWSFQTSAGHFPRACVSSKNLMEKECCPPWSGDRSPCGQLSGRGSCQNILLSNAPLGPQFPFTGVDDRESWPSVFYNRTCQCSGNFMGFNCGNCKFGFWGPNCTERRLLVRRNIFDLSAPEKDKFFAYLTLAKHTISSDYVIPIGTYGQMKNGSTPMFNDINIYDLFVWMHYYVSMDALLGGSEIWRDIDFAHEAPAFLPWHRLFLLRWEQEIQKLTGDENFTIPYWDWRDAEKCDICTDEYMGGQHPTNPNLLSPASFFSSWQIVCSRLEEYNSHQSLCNGTPEGPLRRNPGNHDKSRTPRLPSSADVEFCLSLTQYESGSMDKAANFSFRNTLEGFASPLTGIADASQSSMHNALHIYMNGTMSQVQGSANDPIFLLHHAFVDSIFEQWLRRHRPLQEVYPEANAPIGHNRESYMVPFIPLYRNGDFFISSKDLGYDYSYLQDSDPDSFQDYIKSYLEQASRIWSWLLGAAMVGAVLTALLAGLVSLLCRHKRKQLPEEKQPLLMEKEDYHSLYQSHL,529,NP_000363.1.csv,refseq-TYR-NM_000372.4_clinical_seed_0_final,refseq-TYR-NM_000372.4.a2m,Invitae,refseq-TYR-NM_000372.4.npy,1,529,529
+NP_000364.1,MAVARAALGPLVTGLYDVQAFKFGDFVLKSGLSSPIYIDLRGIVSRPRLLSQVADILFQTAQNAGISFDTVCGVPYTALPLATVICSTNQIPMLIRRKETKDYGTKRLVEGTINPGETCLIIEDVVTSGSSVLETVEVLQKEGLKVTDAIVLLDREQGGKDKLQAHGIRLHSVCTLSKMLEILEQQKKVDAETVGRVKRFIQENVFVAANHNGSPLSIKEAPKELSFGARAELPRIHPVASKLLRLMQKKETNLCLSADVSLARELLQLADALGPSICMLKTHVDILNDFTLDVMKELITLAKCHEFLIFEDRKFADIGNTVKKQYEGGIFKIASWADLVNAHVVPGSGVVKGLQEVGLPLHRGCLLIAEMSSTGSLATGDYTRAAVRMAEEHSEFVVGFISGSRVSMKPEFLHLTPGVQLEAGGDNLGQQYNSPQEVIGKRGSDIIIVGRGIISAADRLEAAEMYRKAAWEAYLSRLGV,480,NP_000364.1.csv,refseq-UMPS-NM_000373.3_clinical_seed_0_final,refseq-UMPS-NM_000373.3.a2m,Invitae,refseq-UMPS-NM_000373.3.npy,1,480,480
+NP_000365.3,MEANGLGPQGFPELKNDTFLRAAWGEETDYTPVWCMRQAGRYLPEFRETRAAQDFFSTCRSPEACCELTLQPLRRFPLDAAIIFSDILVVPQALGMEVTMVPGKGPSFPEPLREEQDLERLRDPEVVASELGYVFQAITLTRQRLAGRVPLIGFAGAPWTLMTYMVEGGGSSTMAQAKRWLYQRPQASHQLLRILTDALVPYLVGQVVAGAQALQLFESHAGHLGPQLFNKFALPYIRDVAKQVKARLREAGLAPVPMIIFAKDGHFALEELAQAGYEVVGLDWTVAPKKARECVGKTVTLQGNLDPCALYASEEEIGQLVKQMLDDFGPHRYIANLGHGLYPDMDPEHVGAFVDAVHKHSRLLRQN,367,NP_000365.3.csv,refseq-UROD-NM_000374.4_clinical_seed_0_final,refseq-UROD-NM_000374.4.a2m,Invitae,refseq-UROD-NM_000374.4.npy,1,367,367
+NP_000366.1,MKVLLLKDAKEDDCGQDPYIRELGLYGLEATLIPVLSFEFLSLPSFSEKLSHPEDYGGLIFTSPRAVEAAELCLEQNNKTEVWERSLKEKWNAKSVYVVGNATASLVSKIGLDTEGETCGNAEKLAEYICSRESSALPLLFPCGNLKREILPKALKDKGIAMESITVYQTVAHPGIQGNLNSYYSQQGVPASITFFSPSGLTYSLKHIQELSGDNIDQIKFAAIGPTTARALAAQGLPVSCTAESPTPQALATGIRKALQPHGCC,265,NP_000366.1.csv,refseq-UROS-NM_000375.2_clinical_seed_0_final,refseq-UROS-NM_000375.2.a2m,Invitae,refseq-UROS-NM_000375.2.npy,1,265,265
+NP_000368.1,MSGGPMGGRPGGRGAPAVQQNIPSTLLQDHENQRLFEMLGRKCLTLATAVVQLYLALPPGAEHWTKEHCGAVCFVKDNPQKSYFIRLYGLQAGRLLWEQELYSQLVYSTPTPFFHTFAGDDCQAGLNFADEDEAQAFRALVQEKIQKRNQRQSGDRRQLPPPPTPANEERRGGLPPLPLHPGGDQGGPPVGPLSLGLATVDIQNPDITSSRYRGLPAPGPSPADKKRSGKKKISKADIGAPSGFKHVSHVGWDPQNGFDVNNLDPDLRSLFSRAGISEAQLTDAETSKLIYDFIEDQGGLEAVRQEMRRQEPLPPPPPPSRGGNQLPRPPIVGGNKGRSGPLPPVPLGIAPPPPTPRGPPPPGRGGPPPPPPPATGRSGPLPPPPPGAGGPPMPPPPPPPPPPPSSGNGPAPPPLPPALVPAGGLAPGGGRGALLDQIRQGIQLNKTPGAPESSALQPPPQSSEGLVGALMHVMQKRSRAIHSSDEGEDQAGDEDEDDEWDD,502,NP_000368.1.csv,refseq-WAS-NM_000377.2_clinical_seed_0_final,refseq-WAS-NM_000377.2.a2m,Invitae,refseq-WAS-NM_000377.2.npy,1,502,502
+NP_000371.1,MAAADGALPEAAALEQPAELPASVRASIERKRQRALMLRQARLAARPYSATAAAATGGMANVKAAPKIIDTGGGFILEEEEEEEQKIGKVVHQPGPVMEFDYVICEECGKEFMDSYLMNHFDLPTCDNCRDADDKHKLITKTEAKQEYLLKDCDLEKREPPLKFIVKKNPHHSQWGDMKLYLKLQIVKRSLEVWGSQEALEEAKEVRQENREKMKQKKFDKKVKELRRAVRSSVWKRETIVHQHEYGPEENLEDDMYRKTCTMCGHELTYEKM,273,NP_000371.1.csv,refseq-XPA-NM_000380.3_clinical_seed_0_final,refseq-XPA-NM_000380.3.a2m,Invitae,refseq-XPA-NM_000380.3.npy,1,273,273
+NP_000372.1,METLESELTCPICLELFEDPLLLPCAHSLCFNCAHRILVSHCATNESVESITAFQCPTCRHVITLSQRGLDGLKRNVTLQNIIDRFQKASVSGPNSPSETRRERAFDANTMTSAEKVLCQFCDQDPAQDAVKTCVTCEVSYCDECLKATHPNKKPFTGHRLIEPIPDSHIRGLMCLEHEDEKVNMYCVTDDQLICALCKLVGRHRDHQVAALSERYDKLKQNLESNLTNLIKRNTELETLLAKLIQTCQHVEVNASRQEAKLTEECDLLIEIIQQRRQIIGTKIKEGKVMRLRKLAQQIANCKQCIERSASLISQAEHSLKENDHARFLQTAKNITERVSMATASSQVLIPEINLNDTFDTFALDFSREKKLLECLDYLTAPNPPTIREELCTASYDTITVHWTSDDEFSVVSYELQYTIFTGQANVVSLCNSADSWMIVPNIKQNHYTVHGLQSGTKYIFMVKAINQAGSRSSEPGKLKTNSQPFKLDPKSAHRKLKVSHDNLTVERDESSSKKSHTPERFTSQGSYGVAGNVFIDSGRHYWEVVISGSTWYAIGLAYKSAPKHEWIGKNSASWALCRCNNNWVVRHNSKEIPIEPAPHLRRVGILLDYDNGSIAFYDALNSIHLYTFDVAFAQPVCPTFTVWNKCLTIITGLPIPDHLDCTEQLP,667,NP_000372.1.csv,refseq-MID1-NM_000381.3_clinical_seed_0_final,refseq-MID1-NM_000381.3.a2m,Invitae,refseq-MID1-NM_000381.3.npy,1,667,667
+NP_000373.1,MELEVRRVRQAFLSGRSRPLRFRLQQLEALRRMVQEREKDILTAIAADLCKSEFNVYSQEVITVLGEIDFMLENLPEWVTAKPVKKNVLTMLDEAYIQPQPLGVVLIIGAWNYPFVLTIQPLIGAIAAGNAVIIKPSELSENTAKILAKLLPQYLDQDLYIVINGGVEETTELLKQRFDHIFYTGNTAVGKIVMEAAAKHLTPVTLELGGKSPCYIDKDCDLDIVCRRITWGKYMNCGQTCIAPDYILCEASLQNQIVWKIKETVKEFYGENIKESPDYERIINLRHFKRILSLLEGQKIAFGGETDEATRYIAPTVLTDVDPKTKVMQEEIFGPILPIVPVKNVDEAINFINEREKPLALYVFSHNHKLIKRMIDETSSGGVTGNDVIMHFTLNSFPFGGVGSSGMGAYHGKHSFDTFSHQRPCLLKSLKREGANKLRYPPNSQSKVDWGKFFLLKRFNKEKLGLLLLTFLGIVAAVLVKAEYY,485,NP_000373.1.csv,refseq-ALDH3A2-NM_000382.2_clinical_seed_0_final,refseq-ALDH3A2-NM_000382.2.a2m,Invitae,refseq-ALDH3A2-NM_000382.2_theta_0.2.npy,1,485,485
+NP_000374.1,MATDAALRRLLRLHRTEIAVAVDSAFPLLHALADHDVVPEDKFQETLHLKEKEGCPQAFHALLSWLLTQDSTAILDFWRVLFKDYNLERYGRLQPILDSFPKDVDLSQPRKGRKPPAVPKALVPPPRLPTKRKASEEARAAAPAALTPRGTASPGSQLKAKPPKKPESSAEQQRLPLGNGIQTMSASVQRAVAMSSGDVPGARGAVEGILIQQVFESGGSKKCIQVGGEFYTPSKFEDSGSGKNKARSSSGPKPLVRAKGAQGAAPGGGEARLGQQGSVPAPLALPSDPQLHQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQEVQPRAEEPRPQEPPVETPLPPGLRSAGEEVRGPPGEPLAGMDTTLVYKHLPAPPSAAPLPGLDSSALHPLLCVGPEGQQNLAPGARCGVCGDGTDVLRCTHCAAAFHWRCHFPAGTSRPGTGLRCRSCSGDVTPAPVEGVLAPSPARLAPGPAKDDTASHEPALHRDDLESLLSEHTFDGILQWAIQSMARPAAPFPS,545,NP_000374.1.csv,refseq-AIRE-NM_000383.3_clinical_seed_0_final,refseq-AIRE-NM_000383.3.a2m,Invitae,refseq-AIRE-NM_000383.3.npy,1,545,545
+NP_000378.1,MADQPKPISPLKNLLAGGFGGVCLVFVGHPLDTVKVRLQTQPPSLPGQPPMYSGTFDCFRKTLFREGITGLYRGMAAPIIGVTPMFAVCFFGFGLGKKLQQKHPEDVLSYPQLFAAGMLSGVFTTGIMTPGERIKCLLQIQASSGESKYTGTLDCAKKLYQEFGIRGIYKGTVLTLMRDVPASGMYFMTYEWLKNIFTPEGKRVSELSAPRILVAGGIAGIFNWAVAIPPDVLKSRFQTAPPGKYPNGFRDVLRELIRDEGVTSLYKGFNAVMIRAFPANAACFLGFEVAMKFLNWATPNL,301,NP_000378.1.csv,refseq-SLC25A20-NM_000387.5_clinical_seed_0_final,refseq-SLC25A20-NM_000387.5.a2m,Invitae,refseq-SLC25A20-NM_000387.5.npy,1,301,301
+NP_000381.1,MADTLPSEFDVIVIGTGLPESIIAAACSRSGRRVLHVDSRSYYGGNWASFSFSGLLSWLKEYQENSDIVSDSPVWQDQILENEEAIALSRKDKTIQHVEVFCYASQDLHEDVEEAGALQKNHALVTSANSTEAADSAFLPTEDESLSTMSCEMLTEQTPSSDPENALEVNGAEVTGEKENHCDDKTCVPSTSAEDMSENVPIAEDTTEQPKKNRITYSQIIKEGRRFNIDLVSKLLYSRGLLIDLLIKSNVSRYAEFKNITRILAFREGRVEQVPCSRADVFNSKQLTMVEKRMLMKFLTFCMEYEKYPDEYKGYEEITFYEYLKTQKLTPNLQYIVMHSIAMTSETASSTIDGLKATKNFLHCLGRYGNTPFLFPLYGQGELPQCFCRMCAVFGGIYCLRHSVQCLVVDKESRKCKAIIDQFGQRIISEHFLVEDSYFPENMCSRVQYRQISRAVLITDRSVLKTDSDQQISILTVPAEEPGTFAVRVIELCSSTMTCMKGTYLVHLTCTSSKTAREDLESVVQKLFVPYTEMEIENEQVEKPRILWALYFNMRDSSDISRSCYNDLPSNVYVCSGPDCGLGNDNAVKQAETLFQEICPNEDFCPPPPNPEDIILDGDSLQPEASESSAIPEANSETFKESTNLGNLEESSE,653,NP_000381.1.csv,refseq-CHM-NM_000390.2_clinical_seed_0_final,refseq-CHM-NM_000390.2.a2m,Invitae,refseq-CHM-NM_000390.2.npy,1,653,653
+NP_000382.3,MGLQACLLGLFALILSGKCSYSPEPDQRRTLPPGWVSLGRADPEEELSLTFALRQQNVERLSELVQAVSDPSSPQYGKYLTLENVADLVRPSPLTLHTVQKWLLAAGAQKCHSVITQDFLTCWLSIRQAELLLPGAEFHHYVGGPTETHVVRSPHPYQLPQALAPHVDFVGGLHRFPPTSSLRQRPEPQVTGTVGLHLGVTPSVIRKRYNLTSQDVGSGTSNNSQACAQFLEQYFHDSDLAQFMRLFGGNFAHQASVARVVGQQGRGRAGIEASLDVQYLMSAGANISTWVYSSPGRHEGQEPFLQWLMLLSNESALPHVHTVSYGDDEDSLSSAYIQRVNTELMKAAARGLTLLFASGDSGAGCWSVSGRHQFRPTFPASSPYVTTVGGTSFQEPFLITNEIVDYISGGGFSNVFPRPSYQEEAVTKFLSSSPHLPPSSYFNASGRAYPDVAALSDGYWVVSNRVPIPWVSGTSASTPVFGGILSLINEHRILSGRPPLGFLNPRLYQQHGAGLFDVTRGCHESCLDEEVEGQGFCSGPGWDPVTGWGTPNFPALLKTLLNP,563,NP_000382.3.csv,refseq-TPP1-NM_000391.3_clinical_seed_0_final,refseq-TPP1-NM_000391.3.a2m,Invitae,refseq-TPP1-NM_000391.3.npy,1,563,563
+NP_000383.2,MLEKFCNSTFWNSSFLDSPEADLPLCFEQTVLVWIPLGYLWLLAPWQLLHVYKSRTKRSSTTKLYLAKQVFVGFLLILAAIELALVLTEDSGQATVPAVRYTNPSLYLGTWLLVLLIQYSRQWCVQKNSWFLSLFWILSILCGTFQFQTLIRTLLQGDNSNLAYSCLFFISYGFQILILIFSAFSENNESSNNPSSIASFLSSITYSWYDSIILKGYKRPLTLEDVWEVDEEMKTKTLVSKFETHMKRELQKARRALQRRQEKSSQQNSGARLPGLNKNQSQSQDALVLEDVEKKKKKSGTKKDVPKSWLMKALFKTFYMVLLKSFLLKLVNDIFTFVSPQLLKLLISFASDRDTYLWIGYLCAILLFTAALIQSFCLQCYFQLCFKLGVKVRTAIMASVYKKALTLSNLARKEYTVGETVNLMSVDAQKLMDVTNFMHMLWSSVLQIVLSIFFLWRELGPSVLAGVGVMVLVIPINAILSTKSKTIQVKNMKNKDKRLKIMNEILSGIKILKYFAWEPSFRDQVQNLRKKELKNLLAFSQLQCVVIFVFQLTPVLVSVVTFSVYVLVDSNNILDAQKAFTSITLFNILRFPLSMLPMMISSMLQASVSTERLEKYLGGDDLDTSAIRHDCNFDKAMQFSEASFTWEHDSEATVRDVNLDIMAGQLVAVIGPVGSGKSSLISAMLGEMENVHGHITIKGTTAYVPQQSWIQNGTIKDNILFGTEFNEKRYQQVLEACALLPDLEMLPGGDLAEIGEKGINLSGGQKQRISLARATYQNLDIYLLDDPLSAVDAHVGKHIFNKVLGPNGLLKGKTRLLVTHSMHFLPQVDEIVVLGNGTIVEKGSYSALLAKKGEFAKNLKTFLRHTGPEEEATVHDGSEEEDDDYGLISSVEEIPEDAASITMRRENSFRRTLSRSSRSNGRHLKSLRNSLKTRNVNSLKEDEELVKGQKLIKKEFIETGKVKFSIYLEYLQAIGLFSIFFIILAFVMNSVAFIGSNLWLSAWTSDSKIFNSTDYPASQRDMRVGVYGALGLAQGIFVFIAHFWSAFGFVHASNILHKQLLNNILRAPMRFFDTTPTGRIVNRFAGDISTVDDTLPQSLRSWITCFLGIISTLVMICMATPVFTIIVIPLGIIYVSVQMFYVSTSRQLRRLDSVTRSPIYSHFSETVSGLPVIRAFEHQQRFLKHNEVRIDTNQKCVFSWITSNRWLAIRLELVGNLTVFFSALMMVIYRDTLSGDTVGFVLSNALNITQTLNWLVRMTSEIETNIVAVERITEYTKVENEAPWVTDKRPPPDWPSKGKIQFNNYQVRYRPELDLVLRGITCDIGSMEKIGVVGRTGAGKSSLTNCLFRILEAAGGQIIIDGVDIASIGLHDLREKLTIIPQDPILFSGSLRMNLDPFNNYSDEEIWKALELAHLKSFVASLQLGLSHEVTEAGGNLSIGQRQLLCLGRALLRKSKILVLDEATAAVDLETDNLIQTTIQNEFAHCTVITIAHRLHTIMDSDKVMVLDNGKIIECGSPEELLQIPGPFYFMAKEAGIENVNSTKF,1545,NP_000383.2.csv,MRP2_HUMAN_b07_clinical_seed_0_final,MRP2_HUMAN_b07.a2m,EVE,MRP2_HUMAN_b07_theta_0.2.npy,1,1545,1545
+NP_000384.2,MMANWAEARPLLILIVLLGQFVSIKAQEEDEDEGYGEEIACTQNGQMYLNRDIWKPAPCQICVCDNGAILCDKIECQDVLDCADPVTPPGECCPVCSQTPGGGNTNFGRGRKGQKGEPGLVPVVTGIRGRPGPAGPPGSQGPRGERGPKGRPGPRGPQGIDGEPGVPGQPGAPGPPGHPSHPGPDGLSRPFSAQMAGLDEKSGLGSQVGLMPGSVGPVGPRGPQGLQGQQGGAGPTGPPGEPGDPGPMGPIGSRGPEGPPGKPGEDGEPGRNGNPGEVGFAGSPGARGFPGAPGLPGLKGHRGHKGLEGPKGEVGAPGSKGEAGPTGPMGAMGPLGPRGMPGERGRLGPQGAPGQRGAHGMPGKPGPMGPLGIPGSSGFPGNPGMKGEAGPTGARGPEGPQGQRGETGPPGPVGSPGLPGAIGTDGTPGAKGPTGSPGTSGPPGSAGPPGSPGPQGSTGPQGIRGQPGDPGVPGFKGEAGPKGEPGPHGIQGPIGPPGEEGKRGPRGDPGTVGPPGPVGERGAPGNRGFPGSDGLPGPKGAQGERGPVGSSGPKGSQGDPGRPGEPGLPGARGLTGNPGVQGPEGKLGPLGAPGEDGRPGPPGSIGIRGQPGSMGLPGPKGSSGDPGKPGEAGNAGVPGQRGAPGKDGEVGPSGPVGPPGLAGERGEQGPPGPTGFQGLPGPPGPPGEGGKPGDQGVPGDPGAVGPLGPRGERGNPGERGEPGITGLPGEKGMAGGHGPDGPKGSPGPSGTPGDTGPPGLQGMPGERGIAGTPGPKGDRGGIGEKGAEGTAGNDGARGLPGPLGPPGPAGPTGEKGEPGPRGLVGPPGSRGNPGSRGENGPTGAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGPHGPNGVPGLKGGRGTQGPPGATGFPGSAGRVGPPGPAGAPGPAGPLGEPGKEGPPGLRGDPGSHGRVGDRGPAGPPGGPGDKGDPGEDGQPGPDGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGTPGKVGPTGATGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGERGDRGDPGPAGLPGSQGAPGTPGPVGAPGDAGQRGDPGSRGPIGPPGRAGKRGLPGPQGPRGDKGDHGDRGDRGQKGHRGFTGLQGLPGPPGPNGEQGSAGIPGPFGPRGPPGPVGPSGKEGNPGPLGPIGPPGVRGSVGEAGPEGPPGEPGPPGPPGPPGHLTAALGDIMGHYDESMPDPLPEFTEDQAAPDDKNKTDPGVHATLKSLSSQIETMRSPDGSKKHPARTCDDLKLCHSAKQSGEYWIDPNQGSVEDAIKVYCNMETGETCISANPSSVPRKTWWASKSPDNKPVWYGLDMNRGSQFAYGDHQSPNTAITQMTFLRLLSKEASQNITYICKNSVGYMDDQAKNLKKAVVLKGANDLDIKAEGNIRFRYIVLQDTCSKRNGNVGKTVFEYRTQNVARLPIIDLAPVDVGGTDQEFGVEIGPVCFV,1499,NP_000384.2.csv,refseq-COL5A2-NM_000393.4_clinical_seed_0_final,refseq-COL5A2-NM_000393.4.a2m,Invitae,refseq-COL5A2-NM_000393.4.npy,1,1499,1499
+NP_000387.1,MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM,329,NP_000387.1.csv,refseq-CTSK-NM_000396.4_clinical_seed_0_final,refseq-CTSK-NM_000396.4.a2m,Invitae,refseq-CTSK-NM_000396.4.npy,1,329,329
+NP_000388.2,MGNWAVNEGLSIFVILVWLGLNVFLFVWYYRVYDIPPKFFYTRKLLGSALALARAPAACLNFNCMLILLPVCRNLLSFLRGSSACCSTRVRRQLDRNLTFHKMVAWMIALHSAIHTIAHLFNVEWCVNARVNNSDPYSVALSELGDRQNESYLNFARKRIKNPEGGLYLAVTLLAGITGVVITLCLILIITSSTKTIRRSYFEVFWYTHHLFVIFFIGLAIHGAERIVRGQTAESLAVHNITVCEQKISEWGKIKECPIPQFAGNPPMTWKWIVGPMFLYLCERLVRFWRSQQKVVITKVVTHPFKTIELQMKKKGFKMEVGQYIFVKCPKVSKLEWHPFTLTSAPEEDFFSIHIRIVGDWTEGLFNACGCDKQEFQDAWKLPKIAVDGPFGTASEDVFSYEVVMLVGAGIGVTPFASILKSVWYKYCNNATNLKLKKIYFYWLCRDTHAFEWFADLLQLLESQMQERNNAGFLSYNIYLTGWDESQANHFAVHHDEEKDVITGLKQKTLYGRPNWDNEFKTIASQHPNTRIGVFLCGPEALAETLSKQSISNSESGPRGVHFIFNKENF,570,NP_000388.2.csv,refseq-CYBB-NM_000397.3_clinical_seed_0_final,refseq-CYBB-NM_000397.3.a2m,Invitae,refseq-CYBB-NM_000397.3.npy,1,570,570
+NP_000389.1,MGAQLSTLGHMVLFPVWFLYSLLMKLFQRSTPAITLESPDIKYPLRLIDREIISHDTRRFRFALPSPQHILGLPVGQHIYLSARIDGNLVVRPYTPISSDDDKGFVDLVIKVYFKDTHPKFPAGGKMSQYLESMQIGDTIEFRGPSGLLVYQGKGKFAIRPDKKSNPIIRTVKSVGMIAGGTGITPMLQVIRAIMKDPDDHTVCHLLFANQTEKDILLRPELEELRNKHSARFKLWYTLDRAPEAWDYGQGFVNEEMIRDHLPPPEEEPLVLMCGPPPMIQYACLPNLDHVGHPTERCFVF,301,NP_000389.1.csv,refseq-CYB5R3-NM_000398.6_clinical_seed_0_final,refseq-CYB5R3-NM_000398.6.a2m,Invitae,refseq-CYB5R3-NM_000398.6.npy,1,301,301
+NP_000391.1,MKLNVDGLLVYFPYDYIYPEQFSYMRELKRTLDAKGHGVLEMPSGTGKTVSLLALIMAYQRAYPLEVTKLIYCSRTVPEIEKVIEELRKLLNFYEKQEGEKLPFLGLALSSRKNLCIHPEVTPLRFGKDVDGKCHSLTASYVRAQYQHDTSLPHCRFYEEFDAHGREVPLPAGIYNLDDLKALGRRQGWCPYFLARYSILHANVVVYSYHYLLDPKIADLVSKELARKAVVVFDEAHNIDNVCIDSMSVNLTRRTLDRCQGNLETLQKTVLRIKETDEQRLRDEYRRLVEGLREASAARETDAHLANPVLPDEVLQEAVPGSIRTAEHFLGFLRRLLEYVKWRLRVQHVVQESPPAFLSGLAQRVCIQRKPLRFCAERLRSLLHTLEITDLADFSPLTLLANFATLVSTYAKGFTIIIEPFDDRTPTIANPILHFSCMDASLAIKPVFERFQSVIITSGTLSPLDIYPKILDFHPVTMATFTMTLARVCLCPMIIGRGNDQVAISSKFETREDIAVIRNYGNLLLEMSAVVPDGIVAFFTSYQYMESTVASWYEQGILENIQRNKLLFIETQDGAETSVALEKYQEACENGRGAILLSVARGKVSEGIDFVHHYGRAVIMFGVPYVYTQSRILKARLEYLRDQFQIRENDFLTFDAMRHAAQCVGRAIRGKTDYGLMVFADKRFARGDKRGKLPRWIQEHLTDANLNLTVDEGVQVAKYFLRQMAQPFHREDQLGLSLLSLEQLESEETLKRIEQIAQQL,760,NP_000391.1.csv,refseq-ERCC2-NM_000400.3_clinical_seed_0_final,refseq-ERCC2-NM_000400.3.a2m,Invitae,refseq-ERCC2-NM_000400.3.npy,1,760,760
+NP_000393.4,MGRRGSAPGNGRTLRGCERGGRRRRSADSVMAEQVALSRTQVCGILREELFQGDAFHQSDTHIFIIMGASGDLAKKKIYPTIWWLFRDGLLPENTFIVGYARSRLTVADIRKQSEPFFKATPEEKLKLEDFFARNSYVAGQYDDAASYQRLNSHMNALHLGSQANRLFYLALPPTVYEAVTKNIHESCMSQIGWNRIIVEKPFGRDLQSSDRLSNHISSLFREDQIYRIDHYLGKEMVQNLMVLRFANRIFGPIWNRDNIACVILTFKEPFGTEGRGGYFDEFGIIRDVMQNHLLQMLCLVAMEKPASTNSDDVRDEKVKVLKCISEVQANNVVLGQYVGNPDGEGEATKGYLDDPTVPRGSTTATFAAVVLYVENERWDGVPFILRCGKALNERKAEVRLQFHDVAGDIFHQQCKRNELVIRVQPNEAVYTKMMTKKPGMFFNPEESELDLTYGNRYKNVKLPDAYERLILDVFCGSQMHFVRSDELREAWRIFTPLLHQIELEKPKPIPYIYGSRGPTEADELMKRVGFQYEGTYKWVNPHKL,545,NP_000393.4.csv,refseq-G6PD-NM_000402.3_clinical_seed_0_final,refseq-G6PD-NM_000402.3.a2m,Invitae,refseq-G6PD-NM_000402.3.npy,1,545,545
+NP_000396.2,MQSLMQAPLLIALGLLLAAPAQAHLKKPSQLSSFSWDNCDEGKDPAVIRSLTLEPDPIIVPGNVTLSVMGSTSVPLSSPLKVDLVLEKEVAGLWIKIPCTDYIGSCTFEHFCDVLDMLIPTGEPCPEPLRTYGLPCHCPFKEGTYSLPKSEFVVPDLELPSWLTTGNYRIESVLSSSGKRLGCIKIAASLKGI,193,NP_000396.2.csv,refseq-GM2A-NM_000405.4_clinical_seed_0_final,refseq-GM2A-NM_000405.4.a2m,Invitae,refseq-GM2A-NM_000405.4.npy,1,193,193
+NP_000398.1,MGSGPRGALSLLLLLLAPPSRPAAGCPAPCSCAGTLVDCGRRGLTWASLPTAFPVDTTELVLTGNNLTALPPGLLDALPALRTAHLGANPWRCDCRLVPLRAWLAGRPERAPYRDLRCVAPPALRGRLLPYLAEDELRAACAPGPLCWGALAAQLALLGLGLLHALLLVLLLCRLRRLRARARARAAARLSLTDPLVAERAGTDES,206,NP_000398.1.csv,refseq-GP1BB-NM_000407.4_clinical_seed_0_final,refseq-GP1BB-NM_000407.4.a2m,Invitae,refseq-GP1BB-NM_000407.4.npy,1,206,206
+NP_000400.2,MGNVMEGKSVEELSSTECHQWYKKFMTECPSGQLTLYEFRQFFGLKNLSPSASQYVEQMFETFDFNKDGYIDFMEYVAALSLVLKGKVEQKLRWYFKLYDVDGNGCIDRDELLTIIQAIRAINPCSDTTMTAEEFTDTVFSKIDVNGDGELSLEEFIEGVQKDQMLLDTLTRSLDLTRIVRRLQNGEQDEEGADEAAEAAG,201,NP_000400.2.csv,refseq-GUCA1A-NM_000409.4_clinical_seed_0_final,refseq-GUCA1A-NM_000409.4.a2m,Invitae,refseq-GUCA1A-NM_000409.4.npy,1,201,201
+NP_000401.1,MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVFYDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQVILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNRAYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWLKDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPSPSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE,348,NP_000401.1.csv,refseq-HFE-NM_000410.3_clinical_seed_0_final,refseq-HFE-NM_000410.3.a2m,Invitae,refseq-HFE-NM_000410.3.npy,1,348,348
+NP_000402.3,MEDRLHMDNGLVPQKIVSVHLQDSTLKEVKDQVSNKQAQILEPKPEPSLEIKPEQDGMEHVGRDDPKALGEEPKQRRGSASGSEPAGDSDRGGGPVEHYHLHLSSCHECLELENSTIESVKFASAENIPDLPYDYSSSLESVADETSPEREGRRVNLTGKAPNILLYVGSDSQEALGRFHEVRSVLADCVDIDSYILYHLLEDSALRDPWTDNCLLLVIATRESIPEDLYQKFMAYLSQGGKVLGLSSSFTFGGFQVTSKGALHKTVQNLVFSKADQSEVKLSVLSSGCRYQEGPVRLSPGRLQGHLENEDKDRMIVHVPFGTRGGEAVLCQVHLELPPSSNIVQTPEDFNLLKSSNFRRYEVLREILTTLGLSCDMKQVPALTPLYLLSAAEEIRDPLMQWLGKHVDSEGEIKSGQLSLRFVSSYVSEVEITPSCIPVVTNMEAFSSEHFNLEIYRQNLQTKQLGKVILFAEVTPTTMRLLDGLMFQTPQEMGLIVIAARQTEGKGRGGNVWLSPVGCALSTLLISIPLRSQLGQRIPFVQHLMSVAVVEAVRSIPEYQDINLRVKWPNDIYYSDLMKIGGVLVNSTLMGETFYILIGCGFNVTNSNPTICINDLITEYNKQHKAELKPLRADYLIARVVTVLEKLIKEFQDKGPNSVLPLYYRYWVHSGQQVHLGSAEGPKVSIVGLDDSGFLQVHQEGGEVVTVHPDGNSFDMLRNLILPKRR,726,NP_000402.3.csv,refseq-HLCS-NM_000411.6_clinical_seed_0_final,refseq-HLCS-NM_000411.6.a2m,Invitae,refseq-HLCS-NM_000411.6.npy,1,726,726
+NP_000405.1,MGSPLRFDGRVVLVTGAGAGLGRAYALAFAERGALVVVNDLGGDFKGVGKGSLAADKVVEEIRRRGGKAVANYDSVEEGEKVVKTALDAFGRIDVVVNNAGILRDRSFARISDEDWDIIHRVHLRGSFQVTRAAWEHMKKQKYGRIIMTSSASGIYGNFGQANYSAAKLGLLGLANSLAIEGRKSNIHCNTIAPNAGSRMTQTVMPEDLVEALKPEYVAPLVLWLCHESCEENGGLFEVGAGWIGKLRWERTLGAIVRQKNHPMTPEAVKANWKKICDFENASKPQSIQESTGSIIEVLSKIDSEGGVSANHTSRATSTATSGFAGAIGQKLPPFSYAYTELEAIMYALGVGASIKDPKDLKFIYEGSSDFSCLPTFGVIIGQKSMMGGGLAEIPGLSINFAKVLHGEQYLELYKPLPRAGKLKCEAVVADVLDKGSGVVIIMDVYSYSEKELICHNQFSLFLVGSGGFGGKRTSDKVKVAVAIPNRPPDAVLTDTTSLNQAALYRLSGDWNPLHIDPNFASLAGFDKPILHGLCTFGFSARRVLQQFADNDVSRFKAIKARFAKPVYPGQTLQTEMWKEGNRIHFQTKVQETGDIVISNAYVDLAPTSGTSAKTPSEGGKLQSTFVFEEIGRRLKDIGPEVVKKVNAVFEWHITKGGNIGAKWTIDLKSGSGKVYQGPAKGAADTTIILSDEDFMEVVLGKLDPQKAFFSGRLKARGNIMLSQKLQMILKDYAKL,736,NP_000405.1.csv,refseq-HSD17B4-NM_000414.3_clinical_seed_0_final,refseq-HSD17B4-NM_000414.3.a2m,Invitae,refseq-HSD17B4-NM_000414.3.npy,1,736,736
+NP_000407.1,MALLFLLPLVMQGVSRAEMGTADLGPSSVPTPTNVTIESYNMNPIVYWEYQIMPQVPVFTVEVKNYGVKNSEWIDACINISHHYCNISDHVGDPSNSLWVRVKARVGQKESAYAKSEEFAVCRDGKIGPPKLDIRKEEKQIMIDIFHPSVFVNGDEQEVDYDPETTCYIRVYNVYVRMNGSEIQYKILTQKEDDCDEIQCQLAIPVSSLNSQYCVSAEGVLHVWGVTTEKSKEVCITIFNSSIKGSLWIPVVAALLLFLVLSLVFICFYIKKINPLKEKSIILPKSLISVVRSATLETKPESKYVSLITSYQPFSLEKEVVCEEPLSPATVPGMHTEDNPGKVEHTEELSSITEVVTTEENIPDVVPGSHLTPIERESSSPLSSNQSEPGSIALNSYHSRNCSESDHSRNGFDTDSSCLESHSSLSDSEFPPNNKGEIKTEGQELITVIKAPTSFGYDKPHVLVDLLVDDSGKESLIGYRPTEDSKEFS,489,NP_000407.1.csv,refseq-IFNGR1-NM_000416.2_clinical_seed_0_final,refseq-IFNGR1-NM_000416.2.a2m,Invitae,refseq-IFNGR1-NM_000416.2.npy,1,489,489
+NP_000410.2,MARALCPLQALWLLEWVLLLLGPCAAPPAWALNLDPVQLTFYAGPNGSQFGFSLDFHKDSHGRVAIVVGAPRTLGPSQEETGGVFLCPWRAEGGQCPSLLFDLRDETRNVGSQTLQTFKARQGLGASVVSWSDVIVACAPWQHWNVLEKTEEAEKTPVGSCFLAQPESGRRAEYSPCRGNTLSRIYVENDFSWDKRYCEAGFSSVVTQAGELVLGAPGGYYFLGLLAQAPVADIFSSYRPGILLWHVSSQSLSFDSSNPEYFDGYWGYSVAVGEFDGDLNTTEYVVGAPTWSWTLGAVEILDSYYQRLHRLRGEQMASYFGHSVAVTDVNGDGRHDLLVGAPLYMESRADRKLAEVGRVYLFLQPRGPHALGAPSLLLTGTQLYGRFGSAIAPLGDLDRDGYNDIAVAAPYGGPSGRGQVLVFLGQSEGLRSRPSQVLDSPFPTGSAFGFSLRGAVDIDDNGYPDLIVGAYGANQVAVYRAQPVVKASVQLLVQDSLNPAVKSCVLPQTKTPVSCFNIQMCVGATGHNIPQKLSLNAELQLDRQKPRQGRRVLLLGSQQAGTTLNLDLGGKHSPICHTTMAFLRDEADFRDKLSPIVLSLNVSLPPTEAGMAPAVVLHGDTHVQEQTRIVLDCGEDDVCVPQLQLTASVTGSPLLVGADNVLELQMDAANEGEGAYEAELAVHLPQGAHYMRALSNVEGFERLICNQKKENETRVVLCELGNPMKKNAQIGIAMLVSVGNLEEAGESVSFQLQIRSKNSQNPNSKIVLLDVPVRAEAQVELRGNSFPASLVVAAEEGEREQNSLDSWGPKVEHTYELHNNGPGTVNGLHLSIHLPGQSQPSDLLYILDIQPQGGLQCFPQPPVNPLKVDWGLPIPSPSPIHPAHHKRDRRQIFLPEPEQPSRLQDPVLVSCDSAPCTVVQCDLQEMARGQRAMVTVLAFLWLPSLYQRPLDQFVLQSHAWFNVSSLPYAVPPLSLPRGEAQVWTQLLRALEERAIPIWWVLVGVLGGLLLLTILVLAMWKVGFFKRNRPPLEEDDEEGE,1039,NP_000410.2.csv,refseq-ITGA2B-NM_000419.3_clinical_seed_0_final,refseq-ITGA2B-NM_000419.3.a2m,Invitae,refseq-ITGA2B-NM_000419.3.npy,1,1039,1039
+NP_000412.4,MSVRYSSSKHYSSSRSGGGGGGGGCGGGGGVSSLRISSSKGSLGGGFSSGGFSGGSFSRGSSGGGCFGGSSGGYGGLGGFGGGSFRGSYGSSSFGGSYGGIFGGGSFGGGSFGGGSFGGGGFGGGGFGGGFGGGFGGDGGLLSGNEKVTMQNLNDRLASYLDKVRALEESNYELEGKIKEWYEKHGNSHQGEPRDYSKYYKTIDDLKNQILNLTTDNANILLQIDNARLAADDFRLKYENEVALRQSVEADINGLRRVLDELTLTKADLEMQIESLTEELAYLKKNHEEEMKDLRNVSTGDVNVEMNAAPGVDLTQLLNNMRSQYEQLAEQNRKDAEAWFNEKSKELTTEIDNNIEQISSYKSEITELRRNVQALEIELQSQLALKQSLEASLAETEGRYCVQLSQIQAQISALEEQLQQIRAETECQNTEYQQLLDIKIRLENEIQTYRSLLEGEGSSGGGGRGGGSFGGGYGGGSSGGGSSGGGHGGGHGGSSGGGYGGGSSGGGSSGGGYGGGSSSGGHGGSSSGGYGGGSSGGGGGGYGGGSSGGGSSSGGGYGGGSSSGGHKSSSSGSVGESSSKGPRY,584,NP_000412.4.csv,refseq-KRT10-NM_000421.5_clinical_seed_0_final,refseq-KRT10-NM_000421.5.a2m,Invitae,refseq-KRT10-NM_000421.5_theta_0.2.npy,1,584,584
+NP_000413.1,MTTSIRQFTSSSSIKGSSGLGGGSSRTSCRLSGGLGAGSCRLGSAGGLGSTLGGSSYSSCYSFGSGGGYGSSFGGVDGLLAGGEKATMQNLNDRLASYLDKVRALEEANTELEVKIRDWYQRQAPGPARDYSQYYRTIEELQNKILTATVDNANILLQIDNARLAADDFRTKFETEQALRLSVEADINGLRRVLDELTLARADLEMQIENLKEELAYLKKNHEEEMNALRGQVGGEINVEMDAAPGVDLSRILNEMRDQYEKMAEKNRKDAEDWFFSKTEELNREVATNSELVQSGKSEISELRRTMQALEIELQSQLSMKASLEGNLAETENRYCVQLSQIQGLIGSVEEQLAQLRCEMEQQNQEYKILLDVKTRLEQEIATYRRLLEGEDAHLTQYKKEPVTTRQVRTIVEEVQDGKVISSREQVHQTTR,432,NP_000413.1.csv,refseq-KRT17-NM_000422.2_clinical_seed_0_final,refseq-KRT17-NM_000422.2.a2m,Invitae,refseq-KRT17-NM_000422.2.npy,1,432,432
+NP_000414.2,MSCQISCKSRGRGGGGGGFRGFSSGSAVVSGGSRRSTSSFSCLSRHGGGGGGFGGGGFGSRSLVGLGGTKSISISVAGGGGGFGAAGGFGGRGGGFGGGSSFGGGSGFSGGGFGGGGFGGGRFGGFGGPGGVGGLGGPGGFGPGGYPGGIHEVSVNQSLLQPLNVKVDPEIQNVKAQEREQIKTLNNKFASFIDKVRFLEQQNQVLQTKWELLQQMNVGTRPINLEPIFQGYIDSLKRYLDGLTAERTSQNSELNNMQDLVEDYKKKYEDEINKRTAAENDFVTLKKDVDNAYMIKVELQSKVDLLNQEIEFLKVLYDAEISQIHQSVTDTNVILSMDNSRNLDLDSIIAEVKAQYEEIAQRSKEEAEALYHSKYEELQVTVGRHGDSLKEIKIEISELNRVIQRLQGEIAHVKKQCKNVQDAIADAEQRGEHALKDARNKLNDLEEALQQAKEDLARLLRDYQELMNVKLALDVEIATYRKLLEGEECRMSGDLSSNVTVSVTSSTISSNVASKAAFGGSGGRGSSSGGGYSSGSSSYGSGGRQSGSRGGSGGGGSISGGGYGSGGGSGGRYGSGGGSKGGSISGGGYGSGGGKHSSGGGSRGGSSSGGGYGSGGGGSSSVKGSSGEAFGSSVTFSFR,639,NP_000414.2.csv,refseq-KRT2-NM_000423.2_clinical_seed_0_final,refseq-KRT2-NM_000423.2.a2m,Invitae,refseq-KRT2-NM_000423.2.npy,1,639,639
+NP_000415.2,MSRQSSVSFRSGGSRSFSTASAITPSVSRTSFTSVSRSGGGGGGGFGRVSLAGACGVGGYGSRSLYNLGGSKRISISTSGGSFRNRFGAGAGGGYGFGGGAGSGFGFGGGAGGGFGLGGGAGFGGGFGGPGFPVCPPGGIQEVTVNQSLLTPLNLQIDPSIQRVRTEEREQIKTLNNKFASFIDKVRFLEQQNKVLDTKWTLLQEQGTKTVRQNLEPLFEQYINNLRRQLDSIVGERGRLDSELRNMQDLVEDFKNKYEDEINKRTTAENEFVMLKKDVDAAYMNKVELEAKVDALMDEINFMKMFFDAELSQMQTHVSDTSVVLSMDNNRNLDLDSIIAEVKAQYEEIANRSRTEAESWYQTKYEELQQTAGRHGDDLRNTKHEISEMNRMIQRLRAEIDNVKKQCANLQNAIADAEQRGELALKDARNKLAELEEALQKAKQDMARLLREYQELMNTKLALDVEIATYRKLLEGEECRLSGEGVGPVNISVVTSSVSSGYGSGSGYGGGLGGGLGGGLGGGLAGGSSGSYYSSSSGGVGLGGGLSVGGSGFSASSGRGLGVGFGSGGGSSSSVKFVSTTSSSRKSFKS,590,NP_000415.2.csv,refseq-KRT5-NM_000424.3_clinical_seed_0_final,refseq-KRT5-NM_000424.3.a2m,Invitae,refseq-KRT5-NM_000424.3.npy,1,590,590
+NP_000416.1,MVVALRYVWPLLLCSPCLLIQIPEEYEGHHVMEPPVITEQSPRRLVVFPTDDISLKCEASGKPEVQFRWTRDGVHFKPKEELGVTVYQSPHSGSFTITGNNSNFAQRFQGIYRCFASNKLGTAMSHEIRLMAEGAPKWPKETVKPVEVEEGESVVLPCNPPPSAEPLRIYWMNSKILHIKQDERVTMGQNGNLYFANVLTSDNHSDYICHAHFPGTRTIIQKEPIDLRVKATNSMIDRKPRLLFPTNSSSHLVALQGQPLVLECIAEGFPTPTIKWLRPSGPMPADRVTYQNHNKTLQLLKVGEEDDGEYRCLAENSLGSARHAYYVTVEAAPYWLHKPQSHLYGPGETARLDCQVQGRPQPEVTWRINGIPVEELAKDQKYRIQRGALILSNVQPSDTMVTQCEARNRHGLLLANAYIYVVQLPAKILTADNQTYMAVQGSTAYLLCKAFGAPVPSVQWLDEDGTTVLQDERFFPYANGTLGIRDLQANDTGRYFCLAANDQNNVTIMANLKVKDATQITQGPRSTIEKKGSRVTFTCQASFDPSLQPSITWRGDGRDLQELGDSDKYFIEDGRLVIHSLDYSDQGNYSCVASTELDVVESRAQLLVVGSPGPVPRLVLSDLHLLTQSQVRVSWSPAEDHNAPIEKYDIEFEDKEMAPEKWYSLGKVPGNQTSTTLKLSPYVHYTFRVTAINKYGPGEPSPVSETVVTPEAAPEKNPVDVKGEGNETTNMVITWKPLRWMDWNAPQVQYRVQWRPQGTRGPWQEQIVSDPFLVVSNTSTFVPYEIKVQAVNSQGKGPEPQVTIGYSGEDYPQAIPELEGIEILNSSAVLVKWRPVDLAQVKGHLRGYNVTYWREGSQRKHSKRHIHKDHVVVPANTTSVILSGLRPYSSYHLEVQAFNGRGSGPASEFTFSTPEGVPGHPEALHLECQSNTSLLLRWQPPLSHNGVLTGYVLSYHPLDEGGKGQLSFNLRDPELRTHNLTDLSPHLRYRFQLQATTKEGPGEAIVREGGTMALSGISDFGNISATAGENYSVVSWVPKEGQCNFRFHILFKALGEEKGGASLSPQYVSYNQSSYTQWDLQPDTDYEIHLFKERMFRHQMAVKTNGTGRVRLPPAGFATEGWFIGFVSAIILLLLVLLILCFIKRSKGGKYSVKDKEDTQVDSEARPMKDETFGEYRSLESDNEEKAFGSSQPSLNGDIKPLGSDDSLADYGGSVDVQFNEDGSFIGQYSGKKEKEAAGGNDSSGATSPINPAVALE,1257,NP_000416.1.csv,refseq-L1CAM-NM_000425.4_clinical_seed_0_final,refseq-L1CAM-NM_000425.4.a2m,Invitae,refseq-L1CAM-NM_000425.4.npy,1,1257,1257
+NP_000421.1,MVLSQRQRDELNRAIADYLRSNGYEEAYSVFKKEAELDVNEELDKKYAGLLEKKWTSVIRLQKKVMELESKLNEAKEEFTSGGPLGQKRDPKEWIPRPPEKYALSGHRSPVTRVIFHPVFSVMVSASEDATIKVWDYETGDFERTLKGHTDSVQDISFDHSGKLLASCSADMTIKLWDFQGFECIRTMHGHDHNVSSVAIMPNGDHIVSASRDKTIKMWEVQTGYCVKTFTGHREWVRMVRPNQDGTLIASCSNDQTVRVWVVATKECKAELREHEHVVECISWAPESSYSSISEATGSETKKSGKPGPFLLSGSRDKTIKMWDVSTGMCLMTLVGHDNWVRGVLFHSGGKFILSCADDKTLRVWDYKNKRCMKTLNAHEHFVTSLDFHKTAPYVVTGSVDQTVKVWECR,410,NP_000421.1.csv,refseq-PAFAH1B1-NM_000430.3_clinical_seed_0_final,refseq-PAFAH1B1-NM_000430.3.a2m,Invitae,refseq-PAFAH1B1-NM_000430.3.npy,1,410,410
+NP_000422.1,MLSEVLLVSAPGKVILHGEHAVVHGKVALAVSLNLRTFLRLQPHSNGKVDLSLPNIGIKRAWDVARLQSLDTSFLEQGDVTTPTSEQVEKLKEVAGLPDDCAVTERLAVLAFLYLYLSICRKQRALPSLDIVVWSELPPGAGLGSSAAYSVCLAAALLTVCEEIPNPLKDGDCVNRWTKEDLELINKWAFQGERMIHGNPSGVDNAVSTWGGALRYHQGKISSLKRSPALQILLTNTKVPRNTRALVAGVRNRLLKFPEIVAPLLTSIDAISLECERVLGEMGEAPAPEQYLVLEELIDMNQHHLNALGVGHASLDQLCQVTRARGLHSKLTGAGGGGCGITLLKPGLEQPEVEATKQALTSCGFDCLETSIGAPGVSIHSATSLDSRVQQALDGL,396,NP_000422.1.csv,refseq-MVK-NM_000431.3_clinical_seed_0_final,refseq-MVK-NM_000431.3.a2m,Invitae,refseq-MVK-NM_000431.3.npy,1,396,396
+NP_000424.2,MSLVEAISLWNEGVLAADKKDWKGALDAFSAVQDPHSRICFNIGCMYTILKNMTEAEKAFTRSINRDKHLAVAYFQRGMLYYQTEKYDLAIKDLKEALIQLRGNQLIDYKILGLQFKLFACEVLYNIAFMYAKKEEWKKAEEQLALATSMKSEPRHSKIDKAMECVWKQKLYEPVVIPVGKLFRPNERQVAQLAKKDYLGKATVVASVVDQDSFSGFAPLQPQAAEPPPRPKTPEIFRALEGEAHRVLFGFVPETKEELQVMPGNIVFVLKKGNDNWATVMFNGQKGLVPCNYLEPVELRIHPQQQPQEESSPQSDIPAPPSSKAPGRPQLSPGQKQKEEPKEVKLSVPMPYTLKVHYKYTVVMKTQPGLPYSQVRDMVSKKLELRLEHTKLSYRPRDSNELVPLSEDSMKDAWGQVKNYCLTLWCENTVGDQGFPDEPKESEKADANNQTTEPQLKKGSQVEALFSYEATQPEDLEFQEGDIILVLSKVNEEWLEGECKGKVGIFPKVFVEDCATTDLESTRREV,526,NP_000424.2.csv,refseq-NCF2-NM_000433.3_clinical_seed_0_final,refseq-NCF2-NM_000433.3.a2m,Invitae,refseq-NCF2-NM_000433.3.npy,1,526,526
+NP_000425.1,MTGERPSTALPDRRWGPRILGFWGGCRVWVFAAIFLLLSLAASWSKAENDFGLVQPLVTMEQLLWVSGRQIGSVDTFRIPLITATPRGTLLAFAEARKMSSSDEGAKFIALRRSMDQGSTWSPTAFIVNDGDVPDGLNLGAVVSDVETGVVFLFYSLCAHKAGCQVASTMLVWSKDDGVSWSTPRNLSLDIGTEVFAPGPGSGIQKQREPRKGRLIVCGHGTLERDGVFCLLSDDHGASWRYGSGVSGIPYGQPKQENDFNPDECQPYELPDGSVVINARNQNNYHCHCRIVLRSYDACDTLRPRDVTFDPELVDPVVAAGAVVTSSGIVFFSNPAHPEFRVNLTLRWSFSNGTSWRKETVQLWPGPSGYSSLATLEGSMDGEEQAPQLYVLYEKGRNHYTESISVAKISVYGTL,415,NP_000425.1.csv,refseq-NEU1-NM_000434.3_clinical_seed_0_final,refseq-NEU1-NM_000434.3.a2m,Invitae,refseq-NEU1-NM_000434.3.npy,1,415,415
+NP_000427.1,MAALKLLSSGLRLCASARGSGATWYKGCVCSFSTSAHRHTKFYTDPVEAVKDIPDGATVLVGGFGLCGIPENLIDALLKTGVKGLTAVSNNAGVDNFGLGLLLRSKQIKRMVSSYVGENAEFERQYLSGELEVELTPQGTLAERIRAGGAGVPAFYTPTGYGTLVQEGGSPIKYNKDGSVAIASKPREVREFNGQHFILEEAITGDFALVKAWKADRAGNVIFRKSARNFNLPMCKAAETTVVEVEEIVDIGAFAPEDIHIPQIYVHRLIKGEKYEKRIERLSIRKEGDGEAKSAKPGDDVRERIIKRAALEFEDGMYANLGIGIPLLASNFISPNITVHLQSENGVLGLGPYPRQHEADADLINAGKETVTILPGASFFSSDESFAMIRGGHVDLTMLGAMQVSKYGDLANWMIPGKMVKGMGGAMDLVSSAKTKVVVTMEHSAKGNAHKIMEKCTLPLTGKQCVNRIITEKAVFDVDKKKGLTLIELWEGLTVDDVQKSTGCDFAVSPKLMPMQQIAN,520,NP_000427.1.csv,refseq-OXCT1-NM_000436.4_clinical_seed_0_final,refseq-OXCT1-NM_000436.4.a2m,Invitae,refseq-OXCT1-NM_000436.4_theta_0.2.npy,1,520,520
+NP_000430.3,MERRAWSLQCTAFVLFCAWCALNSAKAKRQFVNEWAAEIPGGPEAASAIAEELGYDLLGQIGSLENHYLFKHKNHPRRSRRSAFHITKRLSDDDRVIWAEQQYEKERSKRSALRDSALNLFNDPMWNQQWYLQDTRMTAALPKLDLHVIPVWQKGITGKGVVITVLDDGLEWNHTDIYANYDPEASYDFNDNDHDPFPRYDPTNENKHGTRCAGEIAMQANNHKCGVGVAYNSKVGGIRMLDGIVTDAIEASSIGFNPGHVDIYSASWGPNDDGKTVEGPGRLAQKAFEYGVKQGRQGKGSIFVWASGNGGRQGDNCDCDGYTDSIYTISISSASQQGLSPWYAEKCSSTLATSYSSGDYTDQRITSADLHNDCTETHTGTSASAPLAAGIFALALEANPNLTWRDMQHLVVWTSEYDPLANNPGWKKNGAGLMVNSRFGFGLLNAKALVDLADPRTWRSVPEKKECVVKDNDFEPRALKANGEVIIEIPTRACEGQENAIKSLEHVQFEATIEYSRRGDLHVTLTSAAGTSTVLLAERERDTSPNGFKNWDFMSVHTWGENPIGTWTLRITDMSGRIQNEGRIVNWKLILHGTSSQPEHMKQPRVYTSYNTVQNDRRGVEKMVDPGEEQPTQENPKENTLVSKSPSSSSVGGRRDELEEGAPSQAMLRLLQSAFSKNSPPKQSPKKSPSAKLNIPYENFYEALEKLNKPSQLKDSEDSLYNDYVDVFYNTKPYKHRDDRLLQALVDILNEEN,753,NP_000430.3.csv,refseq-PCSK1-NM_000439.4_clinical_seed_0_final,refseq-PCSK1-NM_000439.4.a2m,Invitae,refseq-PCSK1-NM_000439.4.npy,1,753,753
+NP_000431.2,MGEVTAEEVEKFLDSNIGFAKQYYNLHYRAKLISDLLGAKEAAVDFSNYHSPSSMEESEIIFDLLRDFQENLQTEKCIFNVMKKLCFLLQADRMSLFMYRTRNGIAELATRLFNVHKDAVLEDCLVMPDQEIVFPLDMGIVGHVAHSKKIANVPNTEEDEHFCDFVDILTEYKTKNILASPIMNGKDVVAIIMAVNKVDGSHFTKRDEEILLKYLNFANLIMKVYHLSYLHNCETRRGQILLWSGSKVFEELTDIERQFHKALYTVRAFLNCDRYSVGLLDMTKQKEFFDVWPVLMGEVPPYSGPRTPDGREINFYKVIDYILHGKEDIKVIPNPPPDHWALVSGLPAYVAQNGLICNIMNAPAEDFFAFQKEPLDESGWMIKNVLSMPIVNKKEEIVGVATFYNRKDGKPFDEMDETLMESLTQFLGWSVLNPDTYESMNKLENRKDIFQDIVKYHVKCDNEEIQKILKTREVYGKEPWECEEEELAEILQAELPDADKYEINKFHFSDLPLTELELVKCGIQMYYELKVVDKFHIPQEALVRFMYSLSKGYRKITYHNWRHGFNVGQTMFSLLVTGKLKRYFTDLEALAMVTAAFCHDIDHRGTNNLYQMKSQNPLAKLHGSSILERHHLEFGKTLLRDESLNIFQNLNRRQHEHAIHMMDIAIIATDLALYFKKRTMFQKIVDQSKTYESEQEWTQYMMLEQTRKEIVMAMMMTACDLSAITKPWEVQSQVALLVAAEFWEQGDLERTVLQQNPIPMMDRNKADELPKLQVGFIDFVCTFVYKEFSRFHEEITPMLDGITNNRKEWKALADEYDAKMKVQEEKKQKQQSAKSAAAGNQPGGNPSPGGATTSKSCCIQ,860,NP_000431.2.csv,refseq-PDE6A-NM_000440.2_clinical_seed_0_final,refseq-PDE6A-NM_000440.2.a2m,Invitae,refseq-PDE6A-NM_000440.2.npy,1,860,860
+NP_000432.1,MAAPGGRSEPPQLPEYSCSYMVSRPVYSELAFQQQHERRLQERKTLRESLAKCCSCSRKRAFGVLKTLVPILEWLPKYRVKEWLLSDVISGVSTGLVATLQGMAYALLAAVPVGYGLYSAFFPILTYFIFGTSRHISVGPFPVVSLMVGSVVLSMAPDEHFLVSSSNGTVLNTTMIDTAARDTARVLIASALTLLVGIIQLIFGGLQIGFIVRYLADPLVGGFTTAAAFQVLVSQLKIVLNVSTKNYNGVLSIIYTLVEIFQNIGDTNLADFTAGLLTIVVCMAVKELNDRFRHKIPVPIPIEVIVTIIATAISYGANLEKNYNAGIVKSIPRGFLPPELPPVSLFSEMLAASFSIAVVAYAIAVSVGKVYATKYDYTIDGNQEFIAFGISNIFSGFFSCFVATTALSRTAVQESTGGKTQVAGIISAAIVMIAILALGKLLEPLQKSVLAAVVIANLKGMFMQLCDIPRLWRQNKIDAVIWVFTCIVSIILGLDLGLLAGLIFGLLTVVLRVQFPSWNGLGSIPSTDIYKSTKNYKNIEEPQGVKILRFSSPIFYGNVDGFKKCIKSTVGFDAIRVYNKRLKALRKIQKLIKSGQLRATKNGIISDAVSTNNAFEPDEDIEDLEELDIPTKEIEIQVDWNSELPVKVNVPKVPIHSLVLDCGAISFLDVVGVRSLRVIVKEFQRIDVNVYFASLQDYVIEKLEQCGFFDDNIRKDTFFLTVHDAILYLQNQVKSQEGQGSILETITLIQDCKDTLELIETELTEEELDVQDEAMRTLAS,780,NP_000432.1.csv,refseq-SLC26A4-NM_000441.1_clinical_seed_0_final,refseq-SLC26A4-NM_000441.1.a2m,Invitae,refseq-SLC26A4-NM_000441.1.npy,1,780,780
+NP_000434.1,MDLEAAKNGTAWRPTSAEGDFELGISSKQKRKKTKTVKMIGVLTLFRYSDWQDKLFMSLGTIMAIAHGSGLPLMMIVFGEMTDKFVDTAGNFSFPVNFSLSLLNPGKILEEEMTRYAYYYSGLGAGVLVAAYIQVSFWTLAAGRQIRKIRQKFFHAILRQEIGWFDINDTTELNTRLTDDISKISEGIGDKVGMFFQAVATFFAGFIVGFIRGWKLTLVIMAISPILGLSAAVWAKILSAFSDKELAAYAKAGAVAEEALGAIRTVIAFGGQNKELERYQKHLENAKEIGIKKAISANISMGIAFLLIYASYALAFWYGSTLVISKEYTIGNAMTVFFSILIGAFSVGQAAPCIDAFANARGAAYVIFDIIDNNPKIDSFSERGHKPDSIKGNLEFNDVHFSYPSRANVKILKGLNLKVQSGQTVALVGSSGCGKSTTVQLIQRLYDPDEGTINIDGQDIRNFNVNYLREIIGVVSQEPVLFSTTIAENICYGRGNVTMDEIKKAVKEANAYEFIMKLPQKFDTLVGERGAQLSGGQKQRIAIARALVRNPKILLLDEATSALDTESEAEVQAALDKAREGRTTIVIAHRLSTVRNADVIAGFEDGVIVEQGSHSELMKKEGVYFKLVNMQTSGSQIQSEEFELNDEKAATRMAPNGWKSRLFRHSTQKNLKNSQMCQKSLDVETDGLEANVPPVSFLKVLKLNKTEWPYFVVGTVCAIANGGLQPAFSVIFSEIIAIFGPGDDAVKQQKCNIFSLIFLFLGIISFFTFFLQGFTFGKAGEILTRRLRSMAFKAMLRQDMSWFDDHKNSTGALSTRLATDAAQVQGATGTRLALIAQNIANLGTGIIISFIYGWQLTLLLLAVVPIIAVSGIVEMKLLAGNAKRDKKELEAAGKIATEAIENIRTVVSLTQERKFESMYVEKLYGPYRNSVQKAHIYGITFSISQAFMYFSYAGCFRFGAYLIVNGHMRFRDVILVFSAIVFGAVALGHASSFAPDYAKAKLSAAHLFMLFERQPLIDSYSEEGLKPDKFEGNITFNEVVFNYPTRANVPVLQGLSLEVKKGQTLALVGSSGCGKSTVVQLLERFYDPLAGTVLLDGQEAKKLNVQWLRAQLGIVSQEPILFDCSIAENIAYGDNSRVVSQDEIVSAAKAANIHPFIETLPHKYETRVGDKGTQLSGGQKQRIAIARALIRQPQILLLDEATSALDTESEKVVQEALDKAREGRTCIVIAHRLSTIQNADLIVVFQNGRVKEHGTHQQLLAQKGIYFSMVSVQAGTQNL,1279,NP_000434.1.csv,refseq-ABCB4-NM_000443.3_clinical_seed_0_final,refseq-ABCB4-NM_000443.3.a2m,Invitae,refseq-ABCB4-NM_000443.3.npy,1,1279,1279
+NP_000435.3,MEAETGSSVETGKKANRGTRIALVVFVGGTLVLGTILFLVSQGLLSLQAKQEYCLKPECIEAAAAILSKVNLSVDPCDNFFRFACDGWISNNPIPEDMPSYGVYPWLRHNVDLKLKELLEKSISRRRDTEAIQKAKILYSSCMNEKAIEKADAKPLLHILRHSPFRWPVLESNIGPEGVWSERKFSLLQTLATFRGQYSNSVFIRLYVSPDDKASNEHILKLDQATLSLAVREDYLDNSTEAKSYRDALYKFMVDTAVLLGANSSRAEHDMKSVLRLEIKIAEIMIPHENRTSEAMYNKMNISELSAMIPQFDWLGYIKKVIDTRLYPHLKDISPSENVVVRVPQYFKDLFRILGSERKKTIANYLVWRMVYSRIPNLSRRFQYRWLEFSRVIQGTTTLLPQWDKCVNFIESALPYVVGKMFVDVYFQEDKKEMMEELVEGVRWAFIDMLEKENEWMDAGTKRKAKEKARAVLAKVGYPEFIMNDTHVNEDLKAIKFSEADYFGNVLQTRKYLAQSDFFWLRKAVPKTEWFTNPTTVNAFYSASTNQIRFPAGELQKPFFWGTEYPRSLSYGAIGVIVGHEFTHGFDNNGRKYDKNGNLDPWWSTESEEKFKEKTKCMINQYSNYYWKKAGLNVKGKRTLGENIADNGGLREAFRAYRKWINDRRQGLEEPLLPGITFTNNQLFFLSYAHVRCNSYRPEAAREQVQIGAHSPPQFRVNGAISNFEEFQKAFNCPPNSTMNRGMDSCRLW,749,NP_000435.3.csv,refseq-PHEX-NM_000444.5_clinical_seed_0_final,refseq-PHEX-NM_000444.5.a2m,Invitae,refseq-PHEX-NM_000444.5.npy,1,749,749
+NP_000438.2,MLTFMASDSEEEVCDERTSLMSAESPTPRSCQEGRQGPEDGENTAQWRSQENEEDGEEDPDRYVCSGVPGRPPGLEEELTLKYGAKHVIMLFVPVTLCMIVVVATIKSVRFYTEKNGQLIYTPFTEDTPSVGQRLLNSVLNTLIMISVIVVMTIFLVVLYKYRCYKFIHGWLIMSSLMLLFLFTYIYLGEVLKTYNVAMDYPTLLLTVWNFGAVGMVCIHWKGPLVLQQAYLIMISALMALVFIKYLPEWSAWVILGAISVYDLVAVLCPKGPLRMLVETAQERNEPIFPALIYSSAMVWTVGMAKLDPSSQGALQLPYDPEMEEDSYDSFGEPSYPEVFEPPLTGYPGEELEEEEERGVKLGLGDFIFYSVLVGKAAATGSGDWNTTLACFVAILIGLCLTLLLLAVFKKALPALPISITFGLIFYFSTDNLVRPFMDTLASHQLYI,448,NP_000438.2.csv,refseq-PSEN2-NM_000447.2_clinical_seed_0_final,refseq-PSEN2-NM_000447.2.a2m,Invitae,refseq-PSEN2-NM_000447.2.npy,1,448,448
+NP_000439.2,MAASFPPTLGLSSAPDEIQHPHIKFSEWKFKLFRVRSFEKTPEEAQKEKKDSFEGKPSLEQSPAVLDKADGQKPVPTQPLLKAHPKFSKKFHDNEKARGKAIHQANLRHLCRICGNSFRADEHNRRYPVHGPVDGKTLGLLRKKEKRATSWPDLIAKVFRIDVKADVDSIHPTEFCHNCWSIMHRKFSSAPCEVYFPRNVTMEWHPHTPSCDICNTARRGLKRKSLQPNLQLSKKLKTVLDQARQARQHKRRAQARISSKDVMKKIANCSKIHLSTKLLAVDFPEHFVKSISCQICEHILADPVETNCKHVFCRVCILRCLKVMGSYCPSCRYPCFPTDLESPVKSFLSVLNSLMVKCPAKECNEEVSLEKYNHHISSHKESKEIFVHINKGGRPRQHLLSLTRRAQKHRLRELKLQVKAFADKEEGGDVKSVCMTLFLLALRARNEHRQADELEAIMQGKGSGLQPAVCLAIRVNTFLSCSQYHKMYRTVKAITGRQIFQPLHALRNAEKVLLPGYHHFEWQPPLKNVSSSTDVGIIDGLSGLSSSVDDYPVDTIAKRFRYDSALVSALMDMEEDILEGMRSQDLDDYLNGPFTVVVKESCDGMGDVSEKHGSGPVVPEKAVRFSFTIMKITIAHSSQNVKVFEEAKPNSELCCKPLCLMLADESDHETLTAILSPLIAEREAMKSSELMLELGGILRTFKFIFRGTGYDEKLVREVEGLEASGSVYICTLCDATRLEASQNLVFHSITRSHAENLERYEVWRSNPYHESVEELRDRVKGVSAKPFIETVPSIDALHCDIGNAAEFYKIFQLEIGEVYKNPNASKEERKRWQATLDKHLRKKMNLKPIMRMNGNFARKLMTKETVDAVCELIPSEERHEALRELMDLYLKMKPVWRSSCPAKECPESLCQYSFNSQRFAELLSTKFKYRYEGKITNYFHKTLAHVPEIIERDGSIGAWASEGNESGNKLFRRFRKMNARQSKCYEMEDVLKHHWLYTSKYLQKFMNAHNALKTSGFTMNPQASLGDPLGIEDSLESQDSMEF,1043,NP_000439.2.csv,refseq-RAG1-NM_000448.3_clinical_seed_0_final,refseq-RAG1-NM_000448.3.a2m,Invitae,refseq-RAG1-NM_000448.3.npy,1,1043,1043
+NP_000444.1,MEAVETGERPTFGAWDYGVFALMLLVSTGIGLWVGLARGGQRSAEDFFTGGRRLAALPVGLSLSASFMSAVQVLGVPSEAYRYGLKFLWMCLGQLLNSVLTALLFMPVFYRLGLTSTYEYLEMRFSRAVRLCGTLQYIVATMLYTGIVIYAPALILNQVTGLDIWASLLSTGIICTFYTAVGGMKAVVWTDVFQVVVMLSGFWVVLARGVMLVGGPRQVLTLAQNHSRINLMDFNPDPRSRYTFWTFVVGGTLVWLSMYGVNQAQVQRYVACRTEKQAKLALLINQVGLFLIVSSAACCGIVMFVFYTDCDPLLLGRISAPDQYMPLLVLDIFEDLPGVPGLFLACAYSGTLSTASTSINAMAAVTVEDLIKPRLRSLAPRKLVIISKGLSLIYGSACLTVAALSSLLGGGVLQGSFTVMGVISGPLLGAFILGMFLPACNTPGVLAGLGAGLALSLWVALGATLYPPSEQTMRVLPSSAARCVALSVNASGLLDPALLPANDSSRAPSSGMDASRPALADSFYAISYLYYGALGTLTTVLCGALISCLTGPTKRSTLAPGLLWWDLARQTASVAPKEEVAILDDNLVKGPEELPTGNKKPPGFLPTNEDRLFFLGQKELEGAGSWTPCVGHDGGRDQQETNL,643,NP_000444.1.csv,refseq-SLC5A5-NM_000453.2_clinical_seed_0_final,refseq-SLC5A5-NM_000453.2.a2m,Invitae,refseq-SLC5A5-NM_000453.2.npy,1,643,643
+NP_000446.1,MEVVDPQQLGMFTEGELMSVGMDTFIHRIDSTEVIYQPRRKRAKLIGKYLMGDLLGEGSYGKVKEVLDSETLCRRAVKILKKKKLRRIPNGEANVKKEIQLLRRLRHKNVIQLVDVLYNEEKQKMYMVMEYCVCGMQEMLDSVPEKRFPVCQAHGYFCQLIDGLEYLHSQGIVHKDIKPGNLLLTTGGTLKISDLGVAEALHPFAADDTCRTSQGSPAFQPPEIANGLDTFSGFKVDIWSAGVTLYNITTGLYPFEGDNIYKLFENIGKGSYAIPGDCGPPLSDLLKGMLEYEPAKRFSIRQIRQHSWFRKKHPPAEAPVPIPPSPDTKDRWRSMTVVPYLEDLHGADEDEDLFDIEDDIIYTQDFTVPGQVPEEEASHNGQRRGLPKAVCMNGTEAAQLSTKSRAEGRAPNPARKACSASSKIRRLSACKQQ,433,NP_000446.1.csv,refseq-STK11-NM_000455.4_clinical_seed_0_final,refseq-STK11-NM_000455.4.a2m,Invitae,refseq-STK11-NM_000455.4.npy,1,433,433
+NP_000447.2,MLLLHRAVVLRLQQACRLKSIPSRICIQACSTNDSFQPQRPSLTFSGDNSSTQGWRVMGTLLGLGAVLAYQDHRCRAAQESTHIYTKEEVSSHTSPETGIWVTLGSEVFDVTEFVDLHPGGPSKLMLAAGGPLEPFWALYAVHNQSHVRELLAQYKIGELNPEDKVAPTVETSDPYADDPVRHPALKVNSQRPFNAEPPPELLTENYITPNPIFFTRNHLPVPNLDPDTYRLHVVGAPGGQSLSLSLDDLHNFPRYEITVTLQCAGNRRSEMTQVKEVKGLEWRTGAISTARWAGARLCDVLAQAGHQLCETEAHVCFEGLDSDPTGTAYGASIPLARAMDPEAEVLLAYEMNGQPLPRDHGFPVRVVVPGVVGARHVKWLGRVSVQPEESYSHWQRRDYKGFSPSVDWETVDFDSAPSIQELPVQSAITEPRDGETVESGEVTIKGYAWSGGGRAVIRVDVSLDGGLTWQVAKLDGEEQRPRKAWAWRLWQLKAPVPAGQKELNIVCKAVDDGYNVQPDTVAPIWNLRGVLSNAWHRVHVYVSP,545,NP_000447.2.csv,refseq-SUOX-NM_000456.2_clinical_seed_0_final,refseq-SUOX-NM_000456.2.a2m,Invitae,refseq-SUOX-NM_000456.2.npy,1,545,545
+NP_000449.1,MVSKLTSLQQELLSALLSSGVTKEVLVQALEELLPSPNFGVKLETLPLSPGSGAEPDTKPVFHTLTNGHAKGRLSGDEGSEDGDDYDTPPILKELQALNTEEAAEQRAEVDRMLSEDPWRAAKMIKGYMQQHNIPQREVVDVTGLNQSHLSQHLNKGTPMKTQKRAALYTWYVRKQREILRQFNQTVQSSGNMTDKSSQDQLLFLFPEFSQQSHGPGQSDDACSEPTNKKMRRNRFKWGPASQQILYQAYDRQKNPSKEEREALVEECNRAECLQRGVSPSKAHGLGSNLVTEVRVYNWFANRRKEEAFRQKLAMDAYSSNQTHSLNPLLSHGSPHHQPSSSPPNKLSGVRYSQQGNNEITSSSTISHHGNSAMVTSQSVLQQVSPASLDPGHNLLSPDGKMISVSGGGLPPVSTLTNIHSLSHHNPQQSQNLIMTPLSGVMAIAQSLNTSQAQSVPVINSVAGSLAALQPVQFSQQLHSPHQQPLMQQSPGSHMAQQPFMAAVTQLQNSHMYAHKQEPPQYSHTSRFPSAMVVTDTSSISTLTNMSSSKQCPLQAW,557,NP_000449.1.csv,refseq-HNF1B-NM_000458.2_clinical_seed_0_final,refseq-HNF1B-NM_000458.2.a2m,Invitae,refseq-HNF1B-NM_000458.2.npy,1,557,557
+NP_000450.3,MDSLASLVLCGVSLLLSGTVEGAMDLILINSLPLVSDAETSLTCIASGWRPHEPITIGRDFEALMNQHQDPLEVTQDVTREWAKKVVWKREKASKINGAYFCEGRVRGEAIRIRTMKMRQQASFLPATLTMTVDKGDNVNISFKKVLIKEEDAVIYKNGSFIHSVPRHEVPDILEVHLPHAQPQDAGVYSARYIGGNLFTSAFTRLIVRRCEAQKWGPECNHLCTACMNNGVCHEDTGECICPPGFMGRTCEKACELHTFGRTCKERCSGQEGCKSYVFCLPDPYGCSCATGWKGLQCNEACHPGFYGPDCKLRCSCNNGEMCDRFQGCLCSPGWQGLQCEREGIQRMTPKIVDLPDHIEVNSGKFNPICKASGWPLPTNEEMTLVKPDGTVLHPKDFNHTDHFSVAIFTIHRILPPDSGVWVCSVNTVAGMVEKPFNISVKVLPKPLNAPNVIDTGHNFAVINISSEPYFGDGPIKSKKLLYKPVNHYEAWQHIQVTNEIVTLNYLEPRTEYELCVQLVRRGEGGEGHPGPVRRFTTASIGLPPPRGLNLLPKSQTTLNLTWQPIFPSSEDDFYVEVERRSVQKSDQQNIKVPGNLTSVLLNNLHPREQYVVRARVNTKAQGEWSEDLTAWTLSDILPPQPENIKISNITHSSAVISWTILDGYSISSITIRYKVQGKNEDQHVDVKIKNATITQYQLKGLEPETAYQVDIFAENNIGSSNPAFSHELVTLPESQAPADLGGGKMLLIAILGSAGMTCLTVLLAFLIILQLKRANVQRRMAQAFQNVREEPAVQFNSGTLALNRKVKNNPDPTIYPVLDWNDIKFQDVIGEGNFGQVLKARIKKDGLRMDAAIKRMKEYASKDDHRDFAGELEVLCKLGHHPNIINLLGACEHRGYLYLAIEYAPHGNLLDFLRKSRVLETDPAFAIANSTASTLSSQQLLHFAADVARGMDYLSQKQFIHRDLAARNILVGENYVAKIADFGLSRGQEVYVKKTMGRLPVRWMAIESLNYSVYTTNSDVWSYGVLLWEIVSLGGTPYCGMTCAELYEKLPQGYRLEKPLNCDDEVYDLMRQCWREKPYERPSFAQILVSLNRMLEERKTYVNTTLYEKFTYAGIDCSAEEAA,1124,NP_000450.3.csv,refseq-TEK-NM_000459.5_clinical_seed_0_final,refseq-TEK-NM_000459.5.a2m,Invitae,refseq-TEK-NM_000459.5.npy,1,1124,1124
+NP_000451.1,MELTELLLVVMLLLTARLTLSSPAPPACDLRVLSKLLRDSHVLHSRLSQCPEVHPLPTPVLLPAVDFSLGEWKTQMEETKAQDILGAVTLLLEGVMAARGQLGPTCLSSLLGQLSGQVRLLLGALQSLLGTQLPPQGRTTAHKDPNAIFLSFQHLLRGKVRFLMLVGGSTLCVRRAPPTTAVPSRTSLVLTLNELPNRTSGLLETNFTASARTTGSGLLKWQQGFRAKIPGLLNQTSRSLDQIPGYLNRIHELLNGTRGLFPGPSRRTLGAPDISSGTSDTGSLPPNLQPGYSPSPTHPPTGQYTLFPLPPTLPTPVVQLHPLLPDPSAPTPTPTSPLLNTSYTHSQNLSQEG,353,NP_000451.1.csv,refseq-THPO-NM_000460.3_clinical_seed_0_final,refseq-THPO-NM_000460.3.a2m,Invitae,refseq-THPO-NM_000460.3.npy,1,353,353
+NP_000452.2,MTPNSMTENGLTAWDKPKHCPDREHDWKLVGMSEACLHRKSHSERRSTLKNEQSSPHLIQTTWTSSIFHLDHDDVNDQSVSSAQTFQTEEKKCKGYIPSYLDKDELCVVCGDKATGYHYRCITCEGCKGFFRRTIQKNLHPSYSCKYEGKCVIDKVTRNQCQECRFKKCIYVGMATDLVLDDSKRLAKRKLIEENREKRRREELQKSIGHKPEPTDEEWELIKTVTEAHVATNAQGSHWKQKRKFLPEDIGQAPIVNAPEGGKVDLEAFSHFTKIITPAITRVVDFAKKLPMFCELPCEDQIILLKGCCMEIMSLRAAVRYDPESETLTLNGEMAVTRGQLKNGGLGVVSDAIFDLGMSLSSFNLDDTEVALLQAVLLMSSDRPGLACVERIEKYQDSFLLAFEHYINYRKHHVTHFWPKLLMKVTDLRMIGACHASRFLHMKVECPTELFPPLFLEVFED,461,NP_000452.2.csv,refseq-THRB-NM_000461.4_clinical_seed_0_final,refseq-THRB-NM_000461.4.a2m,Invitae,refseq-THRB-NM_000461.4.npy,1,461,461
+NP_000456.2,MPDNRQPRNRQPRIRSGNEPRSAPAMEPDGRGAWAHSRAALDRLEKLLRCSRCTNILREPVCLGGCEHIFCSNCVSDCIGTGCPVCYTPAWIQDLKINRQLDSMIQLCSKLRNLLHDNELSDLKEDKPRKSLFNDAGNKKNSIKMWFSPRSKKVRYVVSKASVQTQPAIKKDASAQQDSYEFVSPSPPADVSERAKKASARSGKKQKKKTLAEINQKWNLEAEKEDGEFDSKEESKQKLVSFCSQPSVISSPQINGEIDLLASGSLTESECFGSLTEVSLPLAEQIESPDTKSRNEVVTPEKVCKNYLTSKKSLPLENNGKRGHHNRLSSPISKRCRTSILSTSGDFVKQTVPSENIPLPECSSPPSCKRKVGGTSGRKNSNMSDEFISLSPGTPPSTLSSSSYRRVMSSPSAMKLLPNMAVKRNHRGETLLHIASIKGDIPSVEYLLQNGSDPNVKDHAGWTPLHEACNHGHLKVVELLLQHKALVNTTGYQNDSPLHDAAKNGHVDIVKLLLSYGASRNAVNIFGLRPVDYTDDESMKSLLLLPEKNESSSASHCSVMNTGQRRDGPLVLIGSGLSSEQQKMLSELAVILKAKKYTEFDSTVTHVVVPGDAVQSTLKCMLGILNGCWILKFEWVKACLRRKVCEQEEKYEIPEGPRRSRLNREQLLPKLFDGCYFYLWGTFKHHPKDNLIKLVTAGGGQILSRKPKPDSDVTQTINTVAYHARPDSDQRFCTQYIIYEDLCNYHPERVRQGKVWKAPSSWFIDCVMSFELLPLDS,777,NP_000456.2.csv,refseq-BARD1-NM_000465.3_clinical_seed_0_final,refseq-BARD1-NM_000465.3.a2m,Invitae,refseq-BARD1-NM_000465.3.npy,1,777,777
+NP_000457.1,MWGSDRLAGAGGGGAAVTVAFTNARDCFLHLPRRLVAQLHLLQNQAIEVVWSHQPAFLSWVEGRHFSDQGENVAEINRQVGQKLGLSNGGQVFLKPCSHVVSCQQVEVEPLSADDWEILELHAVSLEQHLLDQIRIVFPKAIFPVWVDQQTYIFIQIVALIPAASYGRLETDTKLLIQPKTRRAKENTFSKADAEYKKLHSYGRDQKGMMKELQTKQLQSNTVGITESNENESEIPVDSSSVASLWTMIGSIFSFQSEKKQETSWGLTEINAFKNMQSKVVPLDNIFRVCKSQPPSIYNASATSVFHKHCAIHVFPWDQEYFDVEPSFTVTYGKLVKLLSPKQQQSKTKQNVLSPEKEKQMSEPLDQKKIRSDHNEEDEKACVLQVVWNGLEELNNAIKYTKNVEVLHLGKVWIPDDLRKRLNIEMHAVVRITPVEVTPKIPRSLKLQPRENLPKDISEEDIKTVFYSWLQQSTTTMLPLVISEEEFIKLETKDGLKEFSLSIVHSWEKEKDKNIFLLSPNLLQKTTIQVLLDPMVKEENSEEIDFILPFLKLSSLGGVNSLGVSSLEHITHSLLGRPLSRQLMSLVAGLRNGALLLTGGKGSGKSTLAKAICKEAFDKLDAHVERVDCKALRGKRLENIQKTLEVAFSEAVWMQPSVVLLDDLDLIAGLPAVPEHEHSPDAVQSQRLAHALNDMIKEFISMGSLVALIATSQSQQSLHPLLVSAQGVHIFQCVQHIQPPNQEQRCEILCNVIKNKLDCDINKFTDLDLQHVAKETGGFVARDFTVLVDRAIHSRLSRQSISTREKLVLTTLDFQKALRGFLPASLRSVNLHKPRDLGWDKIGGLHEVRQILMDTIQLPAKYPELFANLPIRQRTGILLYGPPGTGKTLLAGVIARESRMNFISVKGPELLSKYIGASEQAVRDIFIRAQAAKPCILFFDEFESIAPRRGHDNTGVTDRVVNQLLTQLDGVEGLQGVYVLAATSRPDLIDPALLRPGRLDKCVYCPPPDQVSRLEILNVLSDSLPLADDVDLQHVASVTDSFTGADLKALLYNAQLEALHGMLLSSGLQDGSSSSDSDLSLSSMVFLNHSSGSDDSAGDGECGLDQSLVSLEMSEILPDESKFNMYRLYFGSSYESELGNGTSSDLSSQCLSAPSSMTQDLPGVPGKDQLFSQPPVLRTASQEGCQELTQEQRDQLRADISIIKGRYRSQSGEDESMNQPGPIKTRLAISQSHLMTALGHTRPSISEDDWKNFAELYESFQNPKRRKNQSGTMFRPGQKVTLA,1283,NP_000457.1.csv,refseq-PEX1-NM_000466.2_clinical_seed_0_final,refseq-PEX1-NM_000466.2.a2m,Invitae,refseq-PEX1-NM_000466.2.npy,1,1283,1283
+NP_000465.1,MMQDVSSSPVSPADDSLSNSEEEPDRQQPPSGKRGGRKRRSSRRSAGGGAGPGGAAGGGVGGGDEPGSPAQGKRGKKSAGCGGGGGAGGGGGSSSGGGSPQSYEELQTQRVMANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDELDSKMASCSYVAHERLSYAFSVWRMEGAWSMSASH,202,NP_000465.1.csv,refseq-TWIST1-NM_000474.3_clinical_seed_0_final,refseq-TWIST1-NM_000474.3.a2m,Invitae,refseq-TWIST1-NM_000474.3.npy,1,202,202
+NP_000466.2,MAGENHQWQGSILYNMLMSAKQTRAAPEAPETRLVDQCWGCSCGDEPGVGREGLLGGRNVALLYRCCFCGKDHPRQGSILYSMLTSAKQTYAAPKAPEATLGPCWGCSCGSDPGVGRAGLPGGRPVALLYRCCFCGEDHPRQGSILYSLLTSSKQTHVAPAAPEARPGGAWWDRSYFAQRPGGKEALPGGRATALLYRCCFCGEDHPQQGSTLYCVPTSTNQAQAAPEERPRAPWWDTSSGALRPVALKSPQVVCEAASAGLLKTLRFVKYLPCFQVLPLDQQLVLVRNCWASLLMLELAQDRLQFETVEVSEPSMLQKILTTRRRETGGNEPLPVPTLQHHLAPPAEARKVPSASQVQAIKCFLSKCWSLNISTKEYAYLKGTVLFNPDVPGLQCVKYIQGLQWGTQQILSEHTRMTHQGPHDRFIELNSTLFLLRFINANVIAELFFRPIIGTVSMDDMMLEMLCTKI,470,NP_000466.2.csv,refseq-NR0B1-NM_000475.4_clinical_seed_0_final,refseq-NR0B1-NM_000475.4.a2m,Invitae,refseq-NR0B1-NM_000475.4.npy,1,470,470
+NP_000467.1,MEEKLKKTKIIFVVGGPGSGKGTQCEKIVQKYGYTHLSTGDLLRSEVSSGSARGKKLSEIMEKGQLVPLETVLDMLRDAMVAKVNTSKGFLIDGYPREVQQGEEFERRIGQPTLLLYVDAGPETMTQRLLKRGETSGRVDDNEETIKKRLETYYKATEPVIAFYEKRGIVRKVNAEGSVDSVFSQVCTHLDALK,194,NP_000467.1.csv,refseq-AK1-NM_000476.3_clinical_seed_0_final,refseq-AK1-NM_000476.3.a2m,Invitae,refseq-AK1-NM_000476.3.npy,1,194,194
+NP_000470.3,MRDLPLTSLALVLSALGALLGTEALRAEEPAVGTSGLIFREDLDWPPGSPQEPLCLVALGGDSNGSSSPLRVVGALSAYEQAFLGAVQRARWGPRDLATFGVCNTGDRQAALPSLRRLGAWLRDPGGQRLVVLHLEEVTWEPTPSLRFQEPPPGGAGPPELALLVLYPGPGPEVTVTRAGLPGAQSLCPSRDTRYLVLAVDRPAGAWRGSGLALTLQPRGEDSRLSTARLQALLFGDDHRCFTRMTPALLLLPRSEPAPLPAHGQLDTVPFPPPRPSAELEESPPSADPFLETLTRLVRALRVPPARASAPRLALDPDALAGFPQGLVNLSDPAALERLLDGEEPLLLLLRPTAATTGDPAPLHDPTSAPWATALARRVAAELQAAAAELRSLPGLPPATAPLLARLLALCPGGPGGLGDPLRALLLLKALQGLRVEWRGRDPRGPGRAQRSAGATAADGPCALRELSVDLRAERSVLIPETYQANNCQGVCGWPQSDRNPRYGNHVVLLLKMQVRGAALARPPCCVPTAYAGKLLISLSEERISAHHVPNMVATECGCR,560,NP_000470.3.csv,refseq-AMH-NM_000479.5_clinical_seed_0_final,refseq-AMH-NM_000479.5.a2m,Invitae,refseq-AMH-NM_000479.5_theta_0.2.npy,1,560,560
+NP_000472.2,MQRAVSVVARLGFRLQAFPPALCRPLSCAQEVLRRTPLYDFHLAHGGKMVAFAGWSLPVQYRDSHTDSHLHTRQHCSLFDVSHMLQTKILGSDRVKLMESLVVGDIAELRPNQGTLSLFTNEAGGILDDLIVTNTSEGHLYVVSNAGCWEKDLALMQDKVRELQNQGRDVGLEVLDNALLALQGPTAAQVLQAGVADDLRKLPFMTSAVMEVFGVSGCRVTRCGYTGEDGVEISVPVAGAVHLATAILKNPEVKLAGLAARDSLRLEAGLCLYGNDIDEHTTPVEGSLSWTLGKRRRAAMDFPGAKVIVPQLKGRVQRRRVGLMCEGAPMRAHSPILNMEGTKIGTVTSGCPSPSLKKNVAMGYVPCEYSRPGTMLLVEVRRKQQMAVVSKMPFVPTNYYTLK,403,NP_000472.2.csv,refseq-AMT-NM_000481.3_clinical_seed_0_final,refseq-AMT-NM_000481.3.a2m,Invitae,refseq-AMT-NM_000481.3.npy,1,403,403
+NP_000475.1,MLPGLALLLLAAWTARALEVPTDGNAGLLAEPQIAMFCGRLNMHMNVQNGKWDSDPSGTKTCIDTKEGILQYCQEVYPELQITNVVEANQPVTIQNWCKRGRKQCKTHPHFVIPYRCLVGEFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKETCSEKSTNLHDYGMLLPCGIDKFRGVEFVCCPLAEESDNVDSADAEEDDSDVWWGGADTDYADGSEDKVVEVAEEEEVAEVEEEEADDDEDDEDGDEVEEEAEEPYEEATERTTSIATTTTTTTESVEEVVREVCSEQAETGPCRAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTEEYCMAVCGSAMSQSLLKTTQEPLARDPVKLPTTAASTPDAVDKYLETPGDENEHAHFQKAKERLEAKHRERMSQVMREWEEAERQAKNLPKADKKAVIQHFQEKVESLEQEAANERQQLVETHMARVEAMLNDRRRLALENYITALQAVPPRPRHVFNMLKKYVRAEQKDRQHTLKHFEHVRMVDPKKAAQIRSQVMTHLRVIYERMNQSLSLLYNVPAVAEEIQDEVDELLQKEQNYSDDVLANMISEPRISYGNDALMPSLTETKTTVELLPVNGEFSLDDLQPWHSFGADSVPANTENEVEPVDARPAADRGLTTRPGSGLTNIKTEEISEVKMDAEFRHDSGYEVHHQKLVFFAEDVGSNKGAIIGLMVGGVVIATVIVITLVMLKKKQYTSIHHGVVEVDAAVTPEERHLSKMQQNGYENPTYKFFEQMQN,770,NP_000475.1.csv,refseq-APP-NM_000484.3_clinical_seed_0_final,refseq-APP-NM_000484.3.a2m,Invitae,refseq-APP-NM_000484.3.npy,1,770,770
+NP_000476.1,MADSELQLVEQRIRSFPDFPTPGVVFRDISPVLKDPASFRAAIGLLARHLKATHGGRIDYIAGLDSRGFLFGPSLAQELGLGCVLIRKRGKLPGPTLWASYSLEYGKAELEIQKDALEPGQRVVVVDDLLATGGTMNAACELLGRLQAEVLECVSLVELTSLKGREKLAPVPFFSLLQYE,180,NP_000476.1.csv,refseq-APRT-NM_000485.2_clinical_seed_0_final,refseq-APRT-NM_000485.2.a2m,Invitae,refseq-APRT-NM_000485.2.npy,1,180,180
+NP_000477.1,MWELRSIAFSRAVFAEFLATLLFVFFGLGSALNWPQALPSVLQIAMAFGLGIGTLVQALGHISGAHINPAVTVACLVGCHVSVLRAAFYVAAQLLGAVAGAALLHEITPADIRGDLAVNALSNSTTAGQAVTVELFLTLQLVLCIFASTDERRGENPGTPALSIGFSVALGHLLGIHYTGCSMNPARSLAPAVVTGKFDDHWVFWIGPLVGAILGSLLYNYVLFPPAKSLSERLAVLKGLEPDTDWEEREVRRRQSVELHSPQSLPRGTKA,271,NP_000477.1.csv,refseq-AQP2-NM_000486.6_clinical_seed_0_final,refseq-AQP2-NM_000486.6.a2m,Invitae,refseq-AQP2-NM_000486.6_theta_0.2.npy,1,271,271
+NP_000478.3,MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQICCHPGCTPRPACCHCPDPHA,509,NP_000478.3.csv,refseq-ARSA-NM_000487.5_clinical_seed_0_final,refseq-ARSA-NM_000487.5.a2m,Invitae,refseq-ARSA-NM_000487.5.npy,1,509,509
+NP_000479.1,MYSNVIGTVTSGKRKVYLLSLLLIGFWDCVTCHGSPVDICTAKPRDIPMNPMCIYRSPEKKATEDEGSEQKIPEATNRRVWELSKANSRFATTFYQHLADSKNDNDNIFLSPLSISTAFAMTKLGACNDTLQQLMEVFKFDTISEKTSDQIHFFFAKLNCRLYRKANKSSKLVSANRLFGDKSLTFNETYQDISELVYGAKLQPLDFKENAEQSRAAINKWVSNKTEGRITDVIPSEAINELTVLVLVNTIYFKGLWKSKFSPENTRKELFYKADGESCSASMMYQEGKFRYRRVAEGTQVLELPFKGDDITMVLILPKPEKSLAKVEKELTPEVLQEWLDELEEMMLVVHMPRFRIEDGFSLKEQLQDMGLVDLFSPEKSKLPGIVAEGRDDLYVSDAFHKAFLEVNEEGSEAAASTAVVIAGRSLNPNRVTFKANRPFLVFIREVPLNTIIFMGRVANPCVK,464,NP_000479.1.csv,refseq-SERPINC1-NM_000488.3_clinical_seed_0_final,refseq-SERPINC1-NM_000488.3.a2m,Invitae,refseq-SERPINC1-NM_000488.3.npy,1,464,464
+NP_000481.2,MPDTMLPACFLGLLAFSSACYFQNCPRGGKRAMSDLELRQCLPCGPGGKGRCFGPSICCADELGCFVGTAEALRCQEENYLPSPCQSGQKACGSGGRCAAFGVCCNDESCVTEPECREGFHRRARASDRSNATQLDGPAGALLLRLVQLAGAPEPFEPAQPDAY,164,NP_000481.2.csv,refseq-AVP-NM_000490.4_clinical_seed_0_final,refseq-AVP-NM_000490.4.a2m,Invitae,refseq-AVP-NM_000490.4.npy,1,164,164
+NP_000483.3,MQRSPLEKASVVSKLFFSWTRPILRKGYRQRLELSDIYQIPSVDSADNLSEKLEREWDRELASKKNPKLINALRRCFFWRFMFYGIFLYLGEVTKAVQPLLLGRIIASYDPDNKEERSIAIYLGIGLCLLFIVRTLLLHPAIFGLHHIGMQMRIAMFSLIYKKTLKLSSRVLDKISIGQLVSLLSNNLNKFDEGLALAHFVWIAPLQVALLMGLIWELLQASAFCGLGFLIVLALFQAGLGRMMMKYRDQRAGKISERLVITSEMIENIQSVKAYCWEEAMEKMIENLRQTELKLTRKAAYVRYFNSSAFFFSGFFVVFLSVLPYALIKGIILRKIFTTISFCIVLRMAVTRQFPWAVQTWYDSLGAINKIQDFLQKQEYKTLEYNLTTTEVVMENVTAFWEEGFGELFEKAKQNNNNRKTSNGDDSLFFSNFSLLGTPVLKDINFKIERGQLLAVAGSTGAGKTSLLMVIMGELEPSEGKIKHSGRISFCSQFSWIMPGTIKENIIFGVSYDEYRYRSVIKACQLEEDISKFAEKDNIVLGEGGITLSGGQRARISLARAVYKDADLYLLDSPFGYLDVLTEKEIFESCVCKLMANKTRILVTSKMEHLKKADKILILHEGSSYFYGTFSELQNLQPDFSSKLMGCDSFDQFSAERRNSILTETLHRFSLEGDAPVSWTETKKQSFKQTGEFGEKRKNSILNPINSIRKFSIVQKTPLQMNGIEEDSDEPLERRLSLVPDSEQGEAILPRISVISTGPTLQARRRQSVLNLMTHSVNQGQNIHRKTTASTRKVSLAPQANLTELDIYSRRLSQETGLEISEEINEEDLKECFFDDMESIPAVTTWNTYLRYITVHKSLIFVLIWCLVIFLAEVAASLVVLWLLGNTPLQDKGNSTHSRNNSYAVIITSTSSYYVFYIYVGVADTLLAMGFFRGLPLVHTLITVSKILHHKMLHSVLQAPMSTLNTLKAGGILNRFSKDIAILDDLLPLTIFDFIQLLLIVIGAIAVVAVLQPYIFVATVPVIVAFIMLRAYFLQTSQQLKQLESEGRSPIFTHLVTSLKGLWTLRAFGRQPYFETLFHKALNLHTANWFLYLSTLRWFQMRIEMIFVIFFIAVTFISILTTGEGEGRVGIILTLAMNIMSTLQWAVNSSIDVDSLMRSVSRVFKFIDMPTEGKPTKSTKPYKNGQLSKVMIIENSHVKKDDIWPSGGQMTVKDLTAKYTEGGNAILENISFSISPGQRVGLLGRTGSGKSTLLSAFLRLLNTEGEIQIDGVSWDSITLQQWRKAFGVIPQKVFIFSGTFRKNLDPYEQWSDQEIWKVADEVGLRSVIEQFPGKLDFVLVDGGCVLSHGHKQLMCLARSVLSKAKILLLDEPSAHLDPVTYQIIRRTLKQAFADCTVILCEHRIEAMLECQQFLVIEENKVRQYDSIQKLLNERSLFRQAISPSDRVKLFPHRNSSKCKSKPQIAALKEETEEEVQDTRL,1480,NP_000483.3.csv,refseq-CFTR-NM_000492.3_clinical_seed_0_final,refseq-CFTR-NM_000492.3.a2m,Invitae,refseq-CFTR-NM_000492.3_theta_0.2.npy,1,1480,1480
+NP_000484.2,MLPQIPFLLLVSLNLVHGVFYAERYQMPTGIKGPLPNTKTQFFIPYTIKSKGIAVRGEQGTPGPPGPAGPRGHPGPSGPPGKPGYGSPGLQGEPGLPGPPGPSAVGKPGVPGLPGKPGERGPYGPKGDVGPAGLPGPRGPPGPPGIPGPAGISVPGKPGQQGPTGAPGPRGFPGEKGAPGVPGMNGQKGEMGYGAPGRPGERGLPGPQGPTGPSGPPGVGKRGENGVPGQPGIKGDRGFPGEMGPIGPPGPQGPPGERGPEGIGKPGAAGAPGQPGIPGTKGLPGAPGIAGPPGPPGFGKPGLPGLKGERGPAGLPGGPGAKGEQGPAGLPGKPGLTGPPGNMGPQGPKGIPGSHGLPGPKGETGPAGPAGYPGAKGERGSPGSDGKPGYPGKPGLDGPKGNPGLPGPKGDPGVGGPPGLPGPVGPAGAKGMPGHNGEAGPRGAPGIPGTRGPIGPPGIPGFPGSKGDPGSPGPPGPAGIATKGLNGPTGPPGPPGPRGHSGEPGLPGPPGPPGPPGQAVMPEGFIKAGQRPSLSGTPLVSANQGVTGMPVSAFTVILSKAYPAIGTPIPFDKILYNRQQHYDPRTGIFTCQIPGIYYFSYHVHVKGTHVWVGLYKNGTPVMYTYDEYTKGYLDQASGSAIIDLTENDQVWLQLPNAESNGLYSSEYVHSSFSGFLVAPM,680,NP_000484.2.csv,refseq-COL10A1-NM_000493.3_clinical_seed_0_final,refseq-COL10A1-NM_000493.3.a2m,Invitae,refseq-COL10A1-NM_000493.3.npy,1,680,680
+NP_000485.3,MDVTKKNKRDGTEVTERIVTETVTTRLTSLPPKGGTSNGYAKTASLGGGSRLEKQSLTHGSSGYINSTGSTRGHASTSSYRRAHSPASTLPNSPGSTFERKTHVTRHAYEGSSSGNSSPEYPRKEFASSSTRGRSQTRESEIRVRLQSASPSTRWTELDDVKRLLKGSRSASVSPTRNSSNTLPIPKKGTVETKIVTASSQSVSGTYDATILDANLPSHVWSSTLPAGSSMGTYHNNMTTQSSSLLNTNAYSAGSVFGVPNNMASCSPTLHPGLSTSSSVFGMQNNLAPSLTTLSHGTTTTSTAYGVKKNMPQSPAAVNTGVSTSAACTTSVQSDDLLHKDCKFLILEKDNTPAKKEMELLIMTKDSGKVFTASPASIAATSFSEDTLKKEKQAAYNADSGLKAEANGDLKTVSTKGKTTTADIHSYGSSGGGGSGGGGGVGGAGGGPWGPAPAWCPCGSCCSWWKWLLGLLLTWLLLLGLLFGLIALAEEVRKLKARVDELERIRRSILPYGDSMDRIEKDRLQGMAPAAGADLDKIGLHSDSQEELWMFVRKKLMMEQENGNLRGSPGPKGDMGSPGPKGDRGFPGTPGIPGPLGHPGPQGPKGQKGSVGDPGMEGPMGQRGREGPMGPRGEAGPPGSGEKGERGAAGEPGPHGPPGVPGSVGPKGSSGSPGPQGPPGPVGLQGLRGEVGLPGVKGDKGPMGPPGPKGDQGEKGPRGLTGEPGMRGLPGAVGEPGAKGAMGPAGPDGHQGPRGEQGLTGMPGIRGPPGPSGDPGKPGLTGPQGPQGLPGTPGRPGIKGEPGAPGKIVTSEGSSMLTVPGPPGPPGAMGPPGPPGAPGPAGPAGLPGHQEVLNLQGPPGPPGPRGPPGPSIPGPPGPRGPPGEGLPGPPGPPGSFLSNSETFLSGPPGPPGPPGPKGDQGPPGPRGHQGEQGLPGFSTSGSSSFGLNLQGPPGPPGPQGPKGDKGDPGVPGALGIPSGPSEGGSSSTMYVSGPPGPPGPPGPPGSISSSGQEIQQYISEYMQSDSIRSYLSGVQGPPGPPGPPGPVTTITGETFDYSELASHVVSYLRTSGYGVSLFSSSISSEDILAVLQRDDVRQYLRQYLMGPRGPPGPPGASGDGSLLSLDYAELSSRILSYMSSSGISIGLPGPPGPPGLPGTSYEELLSLLRGSEFRGIVGPPGPPGPPGIPGNVWSSISVEDLSSYLHTAGLSFIPGPPGPPGPPGPRGPPGVSGALATYAAENSDSFRSELISYLTSPDVRSFIVGPPGPPGPQGPPGDSRLLSTDASHSRGSSSSSHSSSVRRGSSYSSSMSTGGGGAGSLGAGGAFGEAAGDRGPYGTDIGPGGGYGAAAEGGMYAGNGGLLGADFAGDLDYNELAVRVSESMQRQGLLQGMAYTVQGPPGQPGPQGPPGISKVFSAYSNVTADLMDFFQTYGAIQGPPGQKGEMGTPGPKGDRGPAGPPGHPGPPGPRGHKGEKGDKGDQVYAGRRRRRSIAVKP,1497,NP_000485.3.csv,refseq-COL17A1-NM_000494.3_clinical_seed_0_final,refseq-COL17A1-NM_000494.3.a2m,Invitae,refseq-COL17A1-NM_000494.3.npy,1,1497,1497
+NP_000486.1,MKLRGVSLAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCSGIKGEKGERGFPGLEGHPGLPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQGIPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMSSLPGPKGNPGYPGPPGIQGLPGPTGIPGPIGPPGPPGLMGPPGPPGLPGPKGNMGLNFQGPKGEKGEQGLQGPPGPPGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGIRGPPGPPGGEKGEKGEQGEPGKRGKPGKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKGDTGPPGPPGLVIPRPGTGITIGEKGNIGLPGLPGEKGERGFPGIQGPPGLPGPPGAAVMGPPGPPGFPGERGQKGDEGPPGISIPGPPGLDGQPGAPGLPGPPGPAGPHIPPSDEICEPGPPGPPGSPGDKGLQGEQGVKGDKGDTCFNCIGTGISGPPGQPGLPGLPGPPGSLGFPGQKGEKGQAGATGPKGLPGIPGAPGAPGFPGSKGEPGDILTFPGMKGDKGELGSPGAPGLPGLPGTPGQDGLPGLPGPKGEPGGITFKGERGPPGNPGLPGLPGNIGPMGPPGFGPPGPVGEKGIQGVAGNPGQPGIPGPKGDPGQTITQPGKPGLPGNPGRDGDVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGIPGPPGAPGTPGRIGLEGPPGPPGFPGPKGEPGFALPGPPGPPGLPGFKGALGPKGDRGFPGPPGPPGRTGLDGLPGPKGDVGPNGQPGPMGPPGLPGIGVQGPPGPPGIPGPIGQPGLHGIPGEKGDPGPPGLDVPGPPGERGSPGIPGAPGPIGPPGSPGLPGKAGASGFPGTKGEMGMMGPPGPPGPLGIPGRSGVPGLKGDDGLQGQPGLPGPTGEKGSKGEPGLPGPPGPMDPNLLGSKGEKGEPGLPGIPGVSGPKGYQGLPGDPGQPGLSGQPGLPGPPGPKGNPGLPGQPGLIGPPGLKGTIGDMGFPGPQGVEGPPGPSGVPGQPGSPGLPGQKGDKGDPGISSIGLPGLPGPKGEPGLPGYPGNPGIKGSVGDPGLPGLPGTPGAKGQPGLPGFPGTPGPPGPKGISGPPGNPGLPGEPGPVGGGGHPGQPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGLSGQKGDGGLPGIPGNPGLPGPKGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPPGRPGLPGPEGPPGLPGNGGIKGEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPGLNGMKGDPGLPGVPGFPGMKGPSGVPGSAGPEGEPGLIGPPGPPGLPGPSGQSIIIKGDAGPPGIPGQPGLKGLPGPQGPQGLPGPTGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGTRGLDGPPGPDGLQGPPGPPGTSSVAHGFLITRHSQTTDAPQCPQGTLQVYEGFSLLYVQGNKRAHGQDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMSMQPLKGQSIQPFISRCAVCEAPAVVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRGTCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTRISRCQVCMKRT,1685,NP_000486.1.csv,refseq-COL4A5-NM_000495.4_clinical_seed_0_final,refseq-COL4A5-NM_000495.4.a2m,Invitae,refseq-COL4A5-NM_000495.4.npy,1,1685,1685
+NP_000487.1,MASDHQTQAGKPQSLNPKIIIFEQENFQGHSHELNGPCPNLKETGVEKAGSVLVQAGPWVGYEQANCKGEQFVFEKGEYPRWDSWTSSRRTDSLSSLRPIKVDSQEHKIILYENPNFTGKKMEIIDDDVPSFHAHGYQEKVSSVRVQSGTWVGYQYPGYRGLQYLLEKGDYKDSSDFGAPHPQVQSVRRIRDMQWHQRGAFHPSN,205,NP_000487.1.csv,refseq-CRYBB2-NM_000496.2_clinical_seed_0_final,refseq-CRYBB2-NM_000496.2.a2m,Invitae,refseq-CRYBB2-NM_000496.2.npy,1,205,205
+NP_000488.3,MALRAKAEVCMAVPWLSLQRAQALGTRAARVPRTVLPFEAMPRRPGNRWLRLLQIWREQGYEDLHLEVHQTFQELGPIFRYDLGGAGMVCVMLPEDVEKLQQVDSLHPHRMSLEPWVAYRQHRGHKCGVFLLNGPEWRFNRLRLNPEVLSPNAVQRFLPMVDAVARDFSQALKKKVLQNARGSLTLDVQPSIFHYTIEASNLALFGERLGLVGHSPSSASLNFLHALEVMFKSTVQLMFMPRSLSRWTSPKVWKEHFEAWDCIFQYGDNCIQKIYQELAFSRPQQYTSIVAELLLNAELSPDAIKANSMELTAGSVDTTVFPLLMTLFELARNPNVQQALRQESLAAAASISEHPQKATTELPLLRAALKETLRLYPVGLFLERVASSDLVLQNYHIPAGTLVRVFLYSLGRNPALFPRPERYNPQRWLDIRGSGRNFYHVPFGFGMRQCLGRRLAEAEMLLLLHHVLKHLQVETLTQEDIKMVYSFILRPSMFPLLTFRAIN,503,NP_000488.3.csv,refseq-CYP11B1-NM_000497.3_clinical_seed_0_final,refseq-CYP11B1-NM_000497.3.a2m,Invitae,refseq-CYP11B1-NM_000497.3.npy,1,503,503
+NP_000489.3,MALRAKAEVCVAAPWLSLQRARALGTRAARAPRTVLPFEAMPQHPGNRWLRLLQIWREQGYEHLHLEMHQTFQELGPIFRYNLGGPRMVCVMLPEDVEKLQQVDSLHPCRMILEPWVAYRQHRGHKCGVFLLNGPEWRFNRLRLNPDVLSPKAVQRFLPMVDAVARDFSQALKKKVLQNARGSLTLDVQPSIFHYTIEASNLALFGERLGLVGHSPSSASLNFLHALEVMFKSTVQLMFMPRSLSRWISPKVWKEHFEAWDCIFQYGDNCIQKIYQELAFNRPQHYTGIVAELLLKAELSLEAIKANSMELTAGSVDTTAFPLLMTLFELARNPDVQQILRQESLAAAASISEHPQKATTELPLLRAALKETLRLYPVGLFLERVVSSDLVLQNYHIPAGTLVQVFLYSLGRNAALFPRPERYNPQRWLDIRGSGRNFHHVPFGFGMRQCLGRRLAEAEMLLLLHHVLKHFLVETLTQEDIKMVYSFILRPGTSPLLTFRAIN,503,NP_000489.3.csv,refseq-CYP11B2-NM_000498.3_clinical_seed_0_final,refseq-CYP11B2-NM_000498.3.a2m,Invitae,refseq-CYP11B2-NM_000498.3_theta_0.2.npy,1,503,503
+NP_000491.4,MLLLGLLLLLPLLAGARLLWNWWKLRSLHLPPLAPGFLHLLQPDLPIYLLGLTQKFGPIYRLHLGLQDVVVLNSKRTIEEAMVKKWADFAGRPEPLTYKLVSRNYPDLSLGDYSLLWKAHKKLTRSALLLGIRDSMEPVVEQLTQEFCERMRAQPGTPVAIEEEFSLLTCSIICYLTFGDKIKDDNLMPAYYKCIQEVLKTWSHWSIQIVDVIPFLRFFPNPGLRRLKQAIEKRDHIVEMQLRQHKESLVAGQWRDMMDYMLQGVAQPSMEEGSGQLLEGHVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPGASSSRVPYKDRARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGAHLDETVWERPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPSGDALPSLQPLPHCSVILKMQPFQVRLQPRGMGAHSPGQSQ,495,NP_000491.4.csv,refseq-CYP21A2-NM_000500.7_clinical_seed_0_final,refseq-CYP21A2-NM_000500.7.a2m,Invitae,refseq-CYP21A2-NM_000500.7_theta_0.2.npy,1,495,495
+NP_000495.1,MGRPLHLVLLSASLAGLLLLGESLFIRREQANNILARVTRANSFLEEMKKGHLERECMEETCSYEEAREVFEDSDKTNEFWNKYKDGDQCETSPCQNQGKCKDGLGEYTCTCLEGFEGKNCELFTRKLCSLDNGDCDQFCHEEQNSVVCSCARGYTLADNGKACIPTGPYPCGKQTLERRKRSVAQATSSSGEAPDSITWKPYDAADLDPTENPFDLLDFNQTQPERGDNNLTRIVGGQECKDGECPWQALLINEENEGFCGGTILSEFYILTAAHCLYQAKRFKVRVGDRNTEQEEGGEAVHEVEVVIKHNRFTKETYDFDIAVLRLKTPITFRMNVAPACLPERDWAESTLMTQKTGIVSGFGRTHEKGRQSTRLKMLEVPYVDRNSCKLSSSFIITQNMFCAGYDTKQEDACQGDSGGPHVTRFKDTYFVTGIVSWGEGCARKGKYGIYTKVTAFLKWIDRSMKTRGLPKAKSHAPEVITSSPLK,488,NP_000495.1.csv,refseq-F10-NM_000504.3_clinical_seed_0_final,refseq-F10-NM_000504.3.a2m,Invitae,refseq-F10-NM_000504.3.npy,1,488,488
+NP_000496.2,MRALLLLGFLLVSLESTLSIPPWEAPKEHKYKAEEHTVVLTVTGEPCHFPFQYHRQLYHKCTHKGRPGPQPWCATTPNFDQDQRWGYCLEPKKVKDHCSKHSPCQKGGTCVNMPSGPHCLCPQHLTGNHCQKEKCFEPQLLRFFHKNEIWYRTEQAAVARCQCKGPDAHCQRLASQACRTNPCLHGGRCLEVEGHRLCHCPVGYTGAFCDVDTKASCYDGRGLSYRGLARTTLSGAPCQPWASEATYRNVTAEQARNWGLGGHAFCRNPDNDIRPWCFVLNRDRLSWEYCDLAQCQTPTQAAPPTPVSPRLHVPLMPAQPAPPKPQPTTRTPPQSQTPGALPAKREQPPSLTRNGPLSCGQRLRKSLSSMTRVVGGLVALRGAHPYIAALYWGHSFCAGSLIAPCWVLTAAHCLQDRPAPEDLTVVLGQERRNHSCEPCQTLAVRSYRLHEAFSPVSYQHDLALLRLQEDADGSCALLSPYVQPVCLPSGAARPSETTLCQVAGWGHQFEGAEEYASFLQEAQVPFLSLERCSAPDVHGSSILPGMLCAGFLEGGTDACQGDSGGPLVCEDQAAERRLTLQGIISWGSGCGDRNKPGVYTDVAYYLAWIREHTVS,615,NP_000496.2.csv,refseq-F12-NM_000505.3_clinical_seed_0_final,refseq-F12-NM_000505.3.a2m,Invitae,refseq-F12-NM_000505.3.npy,1,615,615
+NP_000497.1,MAHVRGLQLPGCLALAALCSLVHSQHVFLAPQQARSLLQRVRRANTFLEEVRKGNLERECVEETCSYEEAFEALESSTATDVFWAKYTACETARTPRDKLAACLEGNCAEGLGTNYRGHVNITRSGIECQLWRSRYPHKPEINSTTHPGADLQENFCRNPDSSTTGPWCYTTDPTVRRQECSIPVCGQDQVTVAMTPRSEGSSVNLSPPLEQCVPDRGQQYQGRLAVTTHGLPCLAWASAQAKALSKHQDFNSAVQLVENFCRNPDGDEEGVWCYVAGKPGDFGYCDLNYCEEAVEEETGDGLDEDSDRAIEGRTATSEYQTFFNPRTFGSGEADCGLRPLFEKKSLEDKTERELLESYIDGRIVEGSDAEIGMSPWQVMLFRKSPQELLCGASLISDRWVLTAAHCLLYPPWDKNFTENDLLVRIGKHSRTRYERNIEKISMLEKIYIHPRYNWRENLDRDIALMKLKKPVAFSDYIHPVCLPDRETAASLLQAGYKGRVTGWGNLKETWTANVGKGQPSVLQVVNLPIVERPVCKDSTRIRITDNMFCAGYKPDEGKRGDACEGDSGGPFVMKSPFNNRWYQMGIVSWGEGCDRDGKYGFYTHVFRLKKWIQKVIDQFGE,622,NP_000497.1.csv,refseq-F2-NM_000506.3_clinical_seed_0_final,refseq-F2-NM_000506.3.a2m,Invitae,refseq-F2-NM_000506.3.npy,1,622,622
+NP_000498.2,MADQAPFDTDVNTLTRFVMEEGRKARGTGELTQLLNSLCTAVKAISSAVRKAGIAHLYGIAGSTNVTGDQVKKLDVLSNDLVMNMLKSSFATCVLVSEEDKHAIIVEPEKRGKYVVCFDPLDGSSNIDCLVSVGTIFGIYRKKSTDEPSEKDALQPGRNLVAAGYALYGSATMLVLAMDCGVNCFMLDPAIGEFILVDKDVKIKKKGKIYSLNEGYARDFDPAVTEYIQRKKFPPDNSAPYGARYVGSMVADVHRTLVYGGIFLYPANKKSPNGKLRLLYECNPMAYVMEKAGGMATTGKEAVLDVIPTDIHQRAPVILGSPDDVLEFLKVYEKHSAQ,338,NP_000498.2.csv,refseq-FBP1-NM_000507.3_clinical_seed_0_final,refseq-FBP1-NM_000507.3.a2m,Invitae,refseq-FBP1-NM_000507.3.npy,1,338,338
+NP_000500.2,MSWSLHPRNLILYFYALLFLSSTCVAYVATRDNCCILDERFGSYCPTTCGIADFLSTYQTKVDKDLQSLEDILHQVENKTSEVKQLIKAIQLTYNPDESSKPNMIDAATLKSRKMLEEIMKYEASILTHDSSIRYLQEIYNSNNQKIVNLKEKVAQLEAQCQEPCKDTVQIHDITGKDCQDIANKGAKQSGLYFIKPLKANQQFLVYCEIDGSGNGWTVFQKRLDGSVDFKKNWIQYKEGFGHLSPTGTTEFWLGNEKIHLISTQSAIPYALRVELEDWNGRTSTADYAMFKVGPEADKYRLTYAYFAGGDAGDAFDGFDFGDDPSDKFFTSHNGMQFSTWDNDNDKFEGNCAEQDGSGWWMNKCHAGHLNGVYYQGGTYSKASTPNGYDNGIIWATWKTRWYSMKKTTMKIIPFNRLTIGEGQQHHLGGAKQAGDV,437,NP_000500.2.csv,refseq-FGG-NM_000509.5_clinical_seed_0_final,refseq-FGG-NM_000509.5.a2m,Invitae,refseq-FGG-NM_000509.5.npy,1,437,437
+NP_000503.1,MAAVVAATRWWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVYASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALVPAQPQLNVCNWAVMNWAPPGCEKLGKCLTPPESIPKKCLWSH,522,NP_000503.1.csv,refseq-GALNS-NM_000512.4_clinical_seed_0_final,refseq-GALNS-NM_000512.4.a2m,Invitae,refseq-GALNS-NM_000512.4.npy,1,522,522
+NP_000506.2,MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF,217,NP_000506.2.csv,refseq-GH1-NM_000515.4_clinical_seed_0_final,refseq-GH1-NM_000515.4.a2m,Invitae,refseq-GH1-NM_000515.4.npy,1,217,217
+NP_000508.1,MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR,142,NP_000508.1.csv,refseq-HBA2-NM_000517.4_clinical_seed_0_final,refseq-HBA2-NM_000517.4.a2m,Invitae,refseq-HBA2-NM_000517.4.npy,1,142,142
+NP_000509.1,MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH,147,NP_000509.1.csv,refseq-HBB-NM_000518.4_clinical_seed_0_final,refseq-HBB-NM_000518.4.a2m,Invitae,refseq-HBB-NM_000518.4.npy,1,147,147
+NP_000511.2,MTSSRLWFSLLLAAAFAGRATALWPWPQNFQTSDQRYVLYPNNFQFQYDVSSAAQPGCSVLDEAFQRYRDLLFGSGSWPRPYLTGKRHTLEKNVLVVSVVTPGCNQLPTLESVENYTLTINDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFINKTEIEDFPRFPHRGLLLDTSRHYLPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELMRKGSYNPVTHIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSGSEPSGTFGPVNPSLNNTYEFMSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDFMRKKGFGEDFKQLESFYIQTLLDIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDIPVNYMKELELVTKAGFRALLSAPWYLNRISYGPDWKDFYIVEPLAFEGTPEQKALVIGGEACMWGEYVDNTNLVPRLWPRAGAVAERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT,529,NP_000511.2.csv,refseq-HEXA-NM_000520.4_clinical_seed_0_final,refseq-HEXA-NM_000520.4.a2m,Invitae,refseq-HEXA-NM_000520.4.npy,1,529,529
+NP_000514.2,MSRAGSWDMDGLRADGGGAGGAPASSSSSSVAAAAASGQCRGFLSAPVFAGTHSGRAAAAAAAAAAAAAAASGFAYPGTSERTGSSSSSSSSAVVAARPEAPPAKECPAPTPAAAAAAPPSAPALGYGYHFGNGYYSCRMSHGVGLQQNALKSSPHASLGGFPVEKYMDVSGLASSSVPANEVPARAKEVSFYQGYTSPYQHVPGYIDMVSTFGSGEPRHEAYISMEGYQSWTLANGWNSQVYCTKDQPQGSHFWKSSFPGDVALNQPDMCVYRRGRKKRVPYTKLQLKELENEYAINKFINKDKRRRISAATNLSERQVTIWFQNRRVKDKKIVSKLKDTVS,343,NP_000514.2.csv,refseq-HOXD13-NM_000523.3_clinical_seed_0_final,refseq-HOXD13-NM_000523.3.a2m,Invitae,refseq-HOXD13-NM_000523.3.npy,1,343,343
+NP_000517.3,MTTCSRQFTSSSSMKGSCGIGGGIGGGSSRISSVLAGGSCRAPSTYGGGLSVSSSRFSSGGACGLGGGYGGGFSSSSSSFGSGFGGGYGGGLGAGLGGGFGGGFAGGDGLLVGSEKVTMQNLNDRLASYLDKVRALEEANADLEVKIRDWYQRQRPAEIKDYSPYFKTIEDLRNKILTATVDNANVLLQIDNARLAADDFRTKYETELNLRMSVEADINGLRRVLDELTLARADLEMQIESLKEELAYLKKNHEEEMNALRGQVGGDVNVEMDAAPGVDLSRILNEMRDQYEKMAEKNRKDAEEWFFTKTEELNREVATNSELVQSGKSEISELRRTMQNLEIELQSQLSMKASLENSLEETKGRYCMQLAQIQEMIGSVEEQLAQLRCEMEQQNQEYKILLDVKTRLEQEIATYRRLLEGEDAHLSSSQFSSGSQSSRDVTSSSRQIRTKVMDVHDGKVVSTHEQVLRTKN,472,NP_000517.3.csv,refseq-KRT14-NM_000526.5_clinical_seed_0_final,refseq-KRT14-NM_000526.5.a2m,Invitae,refseq-KRT14-NM_000526.5.npy,1,472,472
+NP_000518.1,MGPWGWKLRWTVALLLAAAGTAVGDRCERNEFQCQDGKCISYKWVCDGSAECQDGSDESQETCLSVTCKSGDFSCGGRVNRCIPQFWRCDGQVDCDNGSDEQGCPPKTCSQDEFRCHDGKCISRQFVCDSDRDCLDGSDEASCPVLTCGPASFQCNSSTCIPQLWACDNDPDCEDGSDEWPQRCRGLYVFQGDSSPCSAFEFHCLSGECIHSSWRCDGGPDCKDKSDEENCAVATCRPDEFQCSDGNCIHGSRQCDREYDCKDMSDEVGCVNVTLCEGPNKFKCHSGECITLDKVCNMARDCRDWSDEPIKECGTNECLDNNGGCSHVCNDLKIGYECLCPDGFQLVAQRRCEDIDECQDPDTCSQLCVNLEGGYKCQCEEGFQLDPHTKACKAVGSIAYLFFTNRHEVRKMTLDRSEYTSLIPNLRNVVALDTEVASNRIYWSDLSQRMICSTQLDRAHGVSSYDTVISRDIQAPDGLAVDWIHSNIYWTDSVLGTVSVADTKGVKRKTLFRENGSKPRAIVVDPVHGFMYWTDWGTPAKIKKGGLNGVDIYSLVTENIQWPNGITLDLLSGRLYWVDSKLHSISSIDVNGGNRKTILEDEKRLAHPFSLAVFEDKVFWTDIINEAIFSANRLTGSDVNLLAENLLSPEDMVLFHNLTQPRGVNWCERTTLSNGGCQYLCLPAPQINPHSPKFTCACPDGMLLARDMRSCLTEAEAAVATQETSTVRLKVSSTAVRTQHTTTRPVPDTSRLPGATPGLTTVEIVTMSHQALGDVAGRGNEKKPSSVRALSIVLPIVLLVFLCLGVFLLWKNWRLKNINSINFDNPVYQKTTEDEVHICHNQDGYSYPSRQMVSLEDDVA,860,NP_000518.1.csv,refseq-LDLR-NM_000527.4_clinical_seed_0_final,refseq-LDLR-NM_000527.4.a2m,Invitae,refseq-LDLR-NM_000527.4.npy,1,860,860
+NP_000519.2,MGAYARASGVCARGCLDSAGPWTMSRALRPPLPPLCFFLLLLAAAGARAGGYETCPTVQPNMLNVHLLPHTHDDVGWLKTVDQYFYGIKNDIQHAGVQYILDSVISALLADPTRRFIYVEIAFFSRWWHQQTNATQEVVRDLVRQGRLEFANGGWVMNDEAATHYGAIVDQMTLGLRFLEDTFGNDGRPRVAWHIDPFGHSREQASLFAQMGFDGFFFGRLDYQDKWVRMQKLEMEQVWRASTSLKPPTADLFTGVLPNGYNPPRNLCWDVLCVDQPLVEDPRSPEYNAKELVDYFLNVATAQGRYYRTNHTVMTMGSDFQYENANMWFKNLDKLIRLVNAQQAKGSSVHVLYSTPACYLWELNKANLTWSVKHDDFFPYADGPHQFWTGYFSSRPALKRYERLSYNFLQVCNQLEALVGLAANVGPYGSGDSAPLNEAMAVLQHHDAVSGTSRQHVANDYARQLAAGWGPCEVLLSNALARLRGFKDHFTFCQQLNISICPLSQTAARFQVIVYNPLGRKVNWMVRLPVSEGVFVVKDPNGRTVPSDVVIFPSSDSQAHPPELLFSASLPALGFSTYSVAQVPRWKPQARAPQPIPRRSWSPALTIENEHIRATFDPDTGLLMEIMNMNQQLLLPVRQTFFWYNASIGDNESDQASGAYIFRPNQQKPLPVSRWAQIHLVKTPLVQEVHQNFSAWCSQVVRLYPGQRHLELEWSVGPIPVGDTWGKEVISRFDTPLETKGRFYTDSNGREILERRRDYRPTWKLNQTEPVAGNYYPVNTRIYITDGNMQLTVLTDRSQGGSSLRDGSLELMVHRRLLKDDGRGVSEPLMENGSGAWVRGRHLVLLDTAQAAAAGHRLLAEQEVLAPQVVLAPGGGAAYNLGAPPRTQFSGLRRDLPPSVHLLTLASWGPEMVLLRLEHQFAVGEDSGRNLSAPVTLNLRDLFSTFTITRLQETTLVANQLREAASRLKWTTNTGPTPHQTPYQLDPANITLEPMEIRTFLASVQWKEVDG,1011,NP_000519.2.csv,refseq-MAN2B1-NM_000528.3_clinical_seed_0_final,refseq-MAN2B1-NM_000528.3.a2m,Invitae,refseq-MAN2B1-NM_000528.3.npy,1,1011,1011
+NP_000520.1,MKHIINSYENINNTARNNSDCPRVVLPEEIFFTISIVGVLENLIVLLAVFKNKNLQAPMYFFICSLAISDMLGSLYKILENILIILRNMGYLKPRGSFETTADDIIDSLFVLSLLGSIFSLSVIAADRYITIFHALRYHSIVTMRRTVVVLTVIWTFCTGTGITMVIFSHHVPTVITFTSLFPLMLVFILCLYVHMFLLARSHTRKISTLPRANMKGAITLTILLGVFIFCWAPFVLHVLLMTFCPSNPYCACYMSLFQVNGMLIMCNAVIDPFIYAFRSPELRDAFKKMIFCSRYW,297,NP_000520.1.csv,refseq-MC2R-NM_000529.2_clinical_seed_0_final,refseq-MC2R-NM_000529.2.a2m,Invitae,refseq-MC2R-NM_000529.2.npy,1,297,297
+NP_000522.3,MLFNLRILLNNAAFRNGHNFMVRNFRCGQPLQNKVQLKGRDLLTLKNFTGEEIKYMLWLSADLKFRIKQKGEYLPLLQGKSLGMIFEKRSTRTRLSTETGFALLGGHPCFLTTQDIHLGVNESLTDTARVLSSMADAVLARVYKQSDLDTLAKEASIPIINGLSDLYHPIQILADYLTLQEHYSSLKGLTLSWIGDGNNILHSIMMSAAKFGMHLQAATPKGYEPDASVTKLAEQYAKENGTKLLLTNDPLEAAHGGNVLITDTWISMGQEEEKKKRLQAFQGYQVTMKTAKVAASDWTFLHCLPRKPEEVDDEVFYSPRSLVFPEAENRKWTIMAVMVSLLTDYSPQLQKPKF,354,NP_000522.3.csv,refseq-OTC-NM_000531.5_clinical_seed_0_final,refseq-OTC-NM_000531.5.a2m,Invitae,refseq-OTC-NM_000531.5.npy,1,354,354
+NP_000523.2,MAAALRVAAVGARLSVLASGLRAAVRSLCSQATSVNERIENKRRTALLGGGQRRIDAQHKRGKLTARERISLLLDPGSFVESDMFVEHRCADFGMAADKNKFPGDSVVTGRGRINGRLVYVFSQDFTVFGGSLSGAHAQKICKIMDQAITVGAPVIGLNDSGGARIQEGVESLAGYADIFLRNVTASGVIPQISLIMGPCAGGAVYSPALTDFTFMVKDTSYLFITGPDVVKSVTNEDVTQEELGGAKTHTTMSGVAHRAFENDVDALCNLRDFFNYLPLSSQDPAPVRECHDPSDRLVPELDTIVPLESTKAYNMVDIIHSVVDEREFFEIMPNYAKNIIVGFARMNGRTVGIVGNQPKVASGCLDINSSVKGARFVRFCDAFNIPLITFVDVPGFLPGTAQEYGGIIRHGAKLLYAFAEATVPKVTVITRKAYGGAYDVMSSKHLCGDTNYAWPTAEIAVMGAKGAVEIIFKGHENVEAAQAEYIEKFANPFPAAVRGFVDDIIQPSSTRARICCDLDVLASKKVQRPWRKHANIPL,539,NP_000523.2.csv,refseq-PCCB-NM_000532.4_clinical_seed_0_final,refseq-PCCB-NM_000532.4.a2m,Invitae,refseq-PCCB-NM_000532.4.npy,1,539,539
+NP_000524.3,MGLLECCARCLVGAPFASLVATGLCFFGVALFCGCGHEALTGTEKLIETYFSKNYQDYEYLINVIHAFQYVIYGTASFFFLYGALLLAEGFYTTGAVRQIFGDYKTTICGKGLSATVTGGQKGRGSRGQHQAHSLERVCHCLGKWLGHPDKFVGITYALTVVWLLVFACSAVPVYIYFNTWTTCQSIAFPSKTSASIGSLCADARMYGVLPWNAFPGKVCGSNLLSICKTAEFQMTFHLFIAAFVGAAATLVSLLTFMIAATYNFAVLKLMGRGTKF,277,NP_000524.3.csv,refseq-PLP1-NM_000533.3_clinical_seed_0_final,refseq-PLP1-NM_000533.3.a2m,Invitae,refseq-PLP1-NM_000533.3.npy,1,277,277
+NP_000525.1,MKQLPAATVRLLSSSQIITSVVSVVKELIENSLDAGATSVDVKLENYGFDKIEVRDNGEGIKAVDAPVMAMKYYTSKINSHEDLENLTTYGFRGEALGSICCIAEVLITTRTAADNFSTQYVLDGSGHILSQKPSHLGQGTTVTALRLFKNLPVRKQFYSTAKKCKDEIKKIQDLLMSFGILKPDLRIVFVHNKAVIWQKSRVSDHKMALMSVLGTAVMNNMESFQYHSEESQIYLSGFLPKCDADHSFTSLSTPERSFIFINSRPVHQKDILKLIRHHYNLKCLKESTRLYPVFFLKIDVPTADVDVNLTPDKSQVLLQNKESVLIALENLMTTCYGPLPSTNSYENNKTDVSAADIVLSKTAETDVLFNKVESSGKNYSNVDTSVIPFQNDMHNDESGKNTDDCLNHQISIGDFGYGHCSSEISNIDKNTKNAFQDISMSNVSWENSQTEYSKTCFISSVKHTQSENGNKDHIDESGENEEEAGLENSSEISADEWSRGNILKNSVGENIEPVKILVPEKSLPCKVSNNNYPIPEQMNLNEDSCNKKSNVIDNKSGKVTAYDLLSNRVIKKPMSASALFVQDHRPQFLIENPKTSLEDATLQIEELWKTLSEEEKLKYEEKATKDLERYNSQMKRAIEQESQMSLKDGRKKIKPTSAWNLAQKHKLKTSLSNQPKLDELLQSQIEKRRSQNIKMVQIPFSMKNLKINFKKQNKVDLEEKDEPCLIHNLRFPDAWLMTSKTEVMLLNPYRVEEALLFKRLLENHKLPAEPLEKPIMLTESLFNGSHYLDVLYKMTADDQRYSGSTYLSDPRLTANGFKIKLIPGVSITENYLEIEGMANCLPFYGVADLKEILNAILNRNAKEVYECRPRKVISYLEGEAVRLSRQLPMYLSKEDIQDIIYRMKHQFGNEIKECVHGRPFFHHLTYLPETT,932,NP_000525.1.csv,refseq-PMS1-NM_000534.4_clinical_seed_0_final,refseq-PMS1-NM_000534.4.a2m,Invitae,refseq-PMS1-NM_000534.4.npy,1,932,932
+NP_000527.2,MSLQMVTVSNNIALIQPGFSLMNFDGQVFFFGQKGWPKRSCPTGVFHLDVKHNHVKLKPTIFSKDSCYLPPLRYPATCTFKGSLESEKHQYIIHGGKTPNNEVSDKIYVMSIVCKNNKKVTFRCTEKDLVGDVPEARYGHSINVVYSRGKSMGVLFGGRSYMPSTHRTTEKWNSVADCLPCVFLVDFEFGCATSYILPELQDGLSFHVSIAKNDTIYILGGHSLANNIRPANLYRIRVDLPLGSPAVNCTVLPGGISVSSAILTQTNNDEFVIVGGYQLENQKRMICNIISLEDNKIEIREMETPDWTPDIKHSKIWFGSNMGNGTVFLGIPGDNKQVVSEGFYFYMLKCAEDDTNEEQTTFTNSQTSTEDPGDSTPFEDSEEFCFSAEANSFDGDDEFDTYNEDDEEDESETGYWITCCPTCDVDINTWVPFYSTELNKPAMIYCSHGDGHWVHAQCMDLAERTLIHLSAGSNKYYCNEHVEIARALHTPQRVLPLKKPPMKSLRKKGSGKILTPAKKSFLRRLFD,527,NP_000527.2.csv,refseq-RAG2-NM_000536.3_clinical_seed_0_final,refseq-RAG2-NM_000536.3.a2m,Invitae,refseq-RAG2-NM_000536.3_theta_0.2.npy,1,527,527
+NP_000528.1,MDGWRRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKRMPSIRESLKERGVDMARLGPEWSQPMKRLTLGNTTSSVILTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVTQMFGEVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGVSVGSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLCTLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALAR,406,NP_000528.1.csv,refseq-REN-NM_000537.3_clinical_seed_0_final,refseq-REN-NM_000537.3.a2m,Invitae,refseq-REN-NM_000537.3.npy,1,406,406
+NP_000530.1,MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLGGEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSRYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKEAAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA,348,NP_000530.1.csv,refseq-RHO-NM_000539.3_clinical_seed_0_final,refseq-RHO-NM_000539.3.a2m,Invitae,refseq-RHO-NM_000539.3.npy,1,348,348
+NP_000532.2,MAASGKTSKSEPNHVIFKKISRDKSVTIYLGNRDYIDHVSQVQPVDGVVLVDPDLVKGKKVYVTLTCAFRYGQEDIDVIGLTFRRDLYFSRVQVYPPVGAASTPTKLQESLLKKLGSNTYPFLLTFPDYLPCSVMLQPAPQDSGKSCGVDFEVKAFATDSTDAEEDKIPKKSSVRLLIRKVQHAPLEMGPQPRAEAAWQFFMSDKPLHLAVSLNKEIYFHGEPIPVTVTVTNNTEKTVKKIKAFVEQVANVVLYSSDYYVKPVAMEEAQEKVPPNSTLTKTLTLLPLLANNRERRGIALDGKIKHEDTNLASSTIIKEGIDRTVLGILVSYQIKVKLTVSGFLGELTSSEVATEVPFRLMHPQPEDPAKESYQDANLVFEEFARHNLKDAGEAEEGKRDKNDVDE,405,NP_000532.2.csv,refseq-SAG-NM_000541.4_clinical_seed_0_final,refseq-SAG-NM_000541.4.a2m,Invitae,refseq-SAG-NM_000541.4.npy,1,405,405
+NP_000533.4,MAESHLLQWLLLLLPTLCGPGTAAWTTSSLACAQGPEFWCQSLEQALQCRALGHCLQEVWGHVGADDLCQECEDIVHILNKMAKEAIFQDTMRKFLEQECNVLPLKLLMPQCNQVLDDYFPLVIDYFQNQTDSNGICMHLGLCKSRQPEPEQEPGMSDPLPKPLRDPLPDPLLDKLVLPVLPGALQARPGPHTQDLSEQQFPIPLPYCWLCRALIKRIQAMIPKGALAVAVAQVCRVVPLVAGGICQCLAERYSVILLDTLLGRMLPQLVCRLVLRCSMDDSAGPRSPTGEWLPRDSECHLCMSVTTQAGNSSEQAIPQAMLQACVGSWLDREKCKQFVEQHTPQLLTLVPRGWDAHTTCQALGVCGTMSSPLQCIHSPDL,381,NP_000533.4.csv,PSPB_HUMAN_b01_clinical_seed_0_final,PSPB_HUMAN_b01.a2m,EVE,PSPB_HUMAN_b01_theta_0.2.npy,1,381,381
+NP_000534.3,MPRYGASLRQSCPRSGREQGQDGTAGAPGLLWMGLVLALALALALALALSDSRVLWAPAEAHPLSPQGHPARLHRIVPRLRDVFGWGNLTCPICKGLFTAINLGLKKEPNVARVGSVAIKLCNLLKIAPPAVCQSIVHLFEDDMVEVWRRSVLSPSEACGLLLGSTCGHWDIFSSWNISLPTVPKPPPKPPSPPAPGAPVSRILFLTDLHWDHDYLEGTDPDCADPLCCRRGSGLPPASRPGAGYWGEYSKCDLPLRTLESLLSGLGPAGPFDMVYWTGDIPAHDVWHQTRQDQLRALTTVTALVRKFLGPVPVYPAVGNHESTPVNSFPPPFIEGNHSSRWLYEAMAKAWEPWLPAEALRTLRIGGFYALSPYPGLRLISLNMNFCSRENFWLLINSTDPAGQLQWLVGELQAAEDRGDKVHIIGHIPPGHCLKSWSWNYYRIVARYENTLAAQFFGHTHVDEFEVFYDEETLSRPLAVAFLAPSATTYIGLNPGYRVYQIDGNYSGSSHVVLDHETYILNLTQANIPGAIPHWQLLYRARETYGLPNTLPTAWHNLVYRMRGDMQLFQTFWFLYHKGHPPSEPCGTPCRLATLCAQLSARADSPALCRHLMPDGSLPEAQSLWPRPLFC,631,NP_000534.3.csv,refseq-SMPD1-NM_000543.4_clinical_seed_0_final,refseq-SMPD1-NM_000543.4.a2m,Invitae,refseq-SMPD1-NM_000543.4.npy,1,631,631
+NP_000536.6,MVSKLSQLQTELLAALLESGLSKEALIQALGEPGPYLLAGEGPLDKGESCGGGRGELAELPNGLGETRGSEDETDDDGEDFTPPILKELENLSPEEAAHQKAVVETLLQEDPWRVAKMVKSYLQQHNIPQREVVDTTGLNQSHLSQHLNKGTPMKTQKRAALYTWYVRKQREVAQQFTHAGQGGLIEEPTGDELPTKKGRRNRFKWGPASQQILFQAYERQKNPSKEERETLVEECNRAECIQRGVSPSQAQGLGSNLVTEVRVYNWFANRRKEEAFRHKLAMDTYSGPPPGPGPGPALPAHSSPGLPPPALSPSKVHGVRYGQPATSETAEVPSSSGGPLVTVSTPLHQVSPTGLEPSHSLLSTEAKLVSAAGGPLPPVSTLTALHSLEQTSPGLNQQPQNLIMASLPGVMTIGPGEPASLGPTFTNTGASTLVIGLASTQAQSVPVINSMGSSLTTLQPVQFSQPLHPSYQQPLMPPVQSHVTQSPFMATMAQLQSPHALYSHKPEVAQYTHTGLLPQTMLITDTTNLSALASLTPTKQVFTSDTEASSESGLHTPASQATTLHVPSQDPASIQHLQPAHRLSASPTVSSSSLVLYQSSDSSNGQSHLLPSNHSVIETFISTQMASSSQ,631,NP_000536.6.csv,refseq-HNF1A-NM_000545.8_clinical_seed_0_final,refseq-HNF1A-NM_000545.8.a2m,Invitae,refseq-HNF1A-NM_000545.8.npy,1,631,631
+NP_000537.3,MEEPQSDPSVEPPLSQETFSDLWKLLPENNVLSPLPSQAMDDLMLSPDDIEQWFTEDPGPDEAPRMPEAAPPVAPAPAAPTPAAPAPAPSWPLSSSVPSQKTYQGSYGFRLGFLHSGTAKSVTCTYSPALNKMFCQLAKTCPVQLWVDSTPPPGTRVRAMAIYKQSQHMTEVVRRCPHHERCSDSDGLAPPQHLIRVEGNLRVEYLDDRNTFRHSVVVPYEPPEVGSDCTTIHYNYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRNSFEVRVCACPGRDRRTEEENLRKKGEPHHELPPGSTKRALPNNTSSSPQPKKKPLDGEYFTLQIRGRERFEMFRELNEALELKDAQAGKEPGGSRAHSSHLKSKKGQSTSRHKKLMFKTEGPDSD,393,NP_000537.3.csv,refseq-TP53-NM_000546.5_clinical_seed_0_final,refseq-TP53-NM_000546.5.a2m,Invitae,refseq-TP53-NM_000546.5.npy,1,393,393
+NP_000538.3,MRALAVLSVTLVMACTEAFFPFISRGKELLWGKPEESRVSSVLEESKRLVDTAMYATMQRNLKKRGILSPAQLLSFSKLPEPTSGVIARAAEIMETSIQAMKRKVNLKTQQSQHPTDALSEDLLSIIANMSGCLPYMLPPKCPNTCLANKYRPITGACNNRDHPRWGASNTALARWLPPVYEDGFSQPRGWNPGFLYNGFPLPPVREVTRHVIQVSNEVVTDDDRYSDLLMAWGQYIDHDIAFTPQSTSKAAFGGGADCQMTCENQNPCFPIQLPEEARPAAGTACLPFYRSSAACGTGDQGALFGNLSTANPRQQMNGLTSFLDASTVYGSSPALERQLRNWTSAEGLLRVHARLRDSGRAYLPFVPPRAPAACAPEPGIPGETRGPCFLAGDGRASEVPSLTALHTLWLREHNRLAAALKALNAHWSADAVYQEARKVVGALHQIITLRDYIPRILGPEAFQQYVGPYEGYDSTANPTVSNVFSTAAFRFGHATIHPLVRRLDASFQEHPDLPGLWLHQAFFSPWTLLRGGGLDPLIRGLLARPAKLQVQDQLMNEELTERLFVLSNSSTLDLASINLQRGRDHGLPGYNEWREFCGLPRLETPADLSTAIASRSVADKILDLYKHPDNIDVWLGGLAENFLPRARTGPLFACLIGKQMKALRDGDWFWWENSHVFTDAQRRELEKHSLSRVICDNTGLTRVPMDAFQVGKFPEDFESCDSITGMNLEAWRETFPQDDKCGFPESVENGDFVHCEESGRRVLVYSCRHGYELQGREQLTCTQEGWDFQPPLCKDVNECADGAHPPCHASARCRNTKGGFQCLCADPYELGDDGRTCVDSGRLPRVTWISMSLAALLIGGFAGLTSTVICRWTRTGTKSTLPISETGGGTPELRCGKHQAVGTSPQRAAAQDSEQESAGMEGRDTHRLPRAL,933,NP_000538.3.csv,refseq-TPO-NM_000547.5_clinical_seed_0_final,refseq-TPO-NM_000547.5.a2m,Invitae,refseq-TPO-NM_000547.5.npy,1,933,933
+NP_000539.2,MAKPTSKDSGLKEKFKILLGLGTPRPNPRSAEGKQTEFIITAEILRELSMECGLNNRIRMIGQICEVAKTKKFEEHAVEALWKAVADLLQPERPLEARHAVLALLKAIVQGQGERLGVLRALFFKVIKDYPSNEDLHERLEVFKALTDNGRHITYLEEELADFVLQWMDVGLSSEFLLVLVNLVKFNSCYLDEYIARMVQMICLLCVRTASSVDIEVSLQVLDAVVCYNCLPAESLPLFIVTLCRTINVKELCEPCWKLMRNLLGTHLGHSAIYNMCHLMEDRAYMEDAPLLRGAVFFVGMALWGAHRLYSLRNSPTSVLPSFYQAMACPNEVVSYEIVLSITRLIKKYRKELQVVAWDILLNIIERLLQQLQTLDSPELRTIVHDLLTTVEELCDQNEFHGSQERYFELVERCADQRPESSLLNLISYRAQSIHPAKDGWIQNLQALMERFFRSESRGAVRIKVLDVLSFVLLINRQFYEEELINSVVISQLSHIPEDKDHQVRKLATQLLVDLAEGCHTHHFNSLLDIIEKVMARSLSPPPELEERDVAAYSASLEDVKTAVLGLLVILQTKLYTLPASHATRVYEMLVSHIQLHYKHSYTLPIASSIRLQAFDFLLLLRADSLHRLGLPNKDGVVRFSPYCVCDYMEPERGSEKKTSGPLSPPTGPPGPAPAGPAVRLGSVPYSLLFRVLLQCLKQESDWKVLKLVLGRLPESLRYKVLIFTSPCSVDQLCSALCSMLSGPKTLERLRGAPEGFSRTDLHLAVVPVLTALISYHNYLDKTKQREMVYCLEQGLIHRCASQCVVALSICSVEMPDIIIKALPVLVVKLTHISATASMAVPLLEFLSTLARLPHLYRNFAAEQYASVFAISLPYTNPSKFNQYIVCLAHHVIAMWFIRCRLPFRKDFVPFITKGLRSNVLLSFDDTPEKDSFRARSTSLNERPKSLRIARPPKQGLNNSPPVKEFKESSAAEAFRCRSISVSEHVVRSRIQTSLTSASLGSADENSVAQADDSLKNLHLELTETCLDMMARYVFSNFTAVPKRSPVGEFLLAGGRTKTWLVGNKLVTVTTSVGTGTRSLLGLDSGELQSGPESSSSPGVHVRQTKEAPAKLESQAGQQVSRGARDRVRSMSGGHGLRVGALDVPASQFLGSATSPGPRTAPAAKPEKASAGTRVPVQEKTNLAAYVPLLTQGWAEILVRRPTGNTSWLMSLENPLSPFSSDINNMPLQELSNALMAAERFKEHRDTALYKSLSVPAASTAKPPPLPRSNTVASFSSLYQSSCQGQLHRSVSWADSAVVMEEGSPGEVPVLVEPPGLEDVEAALGMDRRTDAYSRSSSVSSQEEKSLHAEELVGRGIPIERVVSSEGGRPSVDLSFQPSQPLSKSSSSPELQTLQDILGDPGDKADVGRLSPEVKARSQSGTLDGESAAWSASGEDSRGQPEGPLPSSSPRSPSGLRPRGYTISDSAPSRRGKRVERDALKSRATASNAEKVPGINPSFVFLQLYHSPFFGDESNKPILLPNESQSFERSVQLLDQIPSYDTHKIAVLYVGEGQSNSELAILSNEHGSYRYTEFLTGLGRLIELKDCQPDKVYLGGLDVCGEDGQFTYCWHDDIMQAVFHIATLMPTKDVDKHRCDKKRHLGNDFVSIVYNDSGEDFKLGTIKGQFNFVHVIVTPLDYECNLVSLQCRKDMEGLVDTSVAKIVSDRNLPFVARQMALHANMASQVHHSRSNPTDIYPSKWIARLRHIKRLRQRICEEAAYSNPSLPLVHPPSHSKAPAQTPAEPTPGYEVGQRKRLISSVEDFTEFV,1807,NP_000539.2.csv,refseq-TSC2-NM_000548.3_clinical_seed_0_final,refseq-TSC2-NM_000548.3.a2m,Invitae,refseq-TSC2-NM_000548.3.npy,1,1807,1807
+NP_000541.1,MSAPKLLSLGCIFFPLLLFQQARAQFPRQCATVEALRSGMCCPDLSPVSGPGTDRCGSSSGRGRCEAVTADSRPHSPQYPHDGRDDREVWPLRFFNRTCHCNGNFSGHNCGTCRPGWRGAACDQRVLIVRRNLLDLSKEEKNHFVRALDMAKRTTHPLFVIATRRSEEILGPDGNTPQFENISIYNYFVWTHYYSVKKTFLGVGQESFGEVDFSHEGPAFLTWHRYHLLRLEKDMQEMLQEPSFSLPYWNFATGKNVCDICTDDLMGSRSNFDSTLISPNSVFSQWRVVCDSLEDYDTLGTLCNSTEDGPIRRNPAGNVARPMVQRLPEPQDVAQCLEVGLFDTPPFYSNSTNSFRNTVEGYSDPTGKYDPAVRSLHNLAHLFLNGTGGQTHLSPNDPIFVLLHTFTDAVFDEWLRRYNADISTFPLENAPIGHNRQYNMVPFWPPVTNTEMFVTAPDNLGYTYEIQWPSREFSVPEIIAIAVVGALLLVALIFGTASYLIRARRSMDEANQPLLTDQYQCYAEEYEKLQNPNQSVV,537,NP_000541.1.csv,refseq-TYRP1-NM_000550.2_clinical_seed_0_final,refseq-TYRP1-NM_000550.2.a2m,Invitae,refseq-TYRP1-NM_000550.2.npy,1,537,537
+NP_000544.2,MSEKKLETTAQQRKCPEWMNVQNKRCAVEERKACVRKSVFEDDLPFLEFTGSIVYSYDASDCSFLSEDISMSLSDGDVVGFDMEWPPLYNRGKLGKVALIQLCVSESKCYLFHVSSMSVFPQGLKMLLENKAVKKAGVGIEGDQWKLLRDFDIKLKNFVELTDVANKKLKCTETWSLNSLVKHLLGKQLLKDKSIRCSNWSKFPLTEDQKLYAATDAYAGFIIYRNLEILDDTVQRFAINKEEEILLSDMNKQLTSISEEVMDLAKHLPHAFSKLENPRRVSILLKDISENLYSLRRMIIGSTNIETELRPSNNLNLLSFEDSTTGGVQQKQIREHEVLIHVEDETWDPTLDHLAKHDGEDVLGNKVERKEDGFEDGVEDNKLKENMERACLMSLDITEHELQILEQQSQEEYLSDIAYKSTEHLSPNDNENDTSYVIESDEDLEMEMLKHLSPNDNENDTSYVIESDEDLEMEMLKSLENLNSGTVEPTHSKCLKMERNLGLPTKEEEEDDENEANEGEEDDDKDFLWPAPNEEQVTCLKMYFGHSSFKPVQWKVIHSVLEERRDNVAVMATGYGKSLCFQYPPVYVGKIGLVISPLISLMEDQVLQLKMSNIPACFLGSAQSENVLTDIKLGKYRIVYVTPEYCSGNMGLLQQLEADIGITLIAVDEAHCISEWGHDFRDSFRKLGSLKTALPMVPIVALTATASSSIREDIVRCLNLRNPQITCTGFDRPNLYLEVRRKTGNILQDLQPFLVKTSSHWEFEGPTIIYCPSRKMTQQVTGELRKLNLSCGTYHAGMSFSTRKDIHHRFVRDEIQCVIATIAFGMGINKADIRQVIHYGAPKDMESYYQEIGRAGRDGLQSSCHVLWAPADINLNRHLLTEIRNEKFRLYKLKMMAKMEKYLHSSRCRRQIILSHFEDKQVQKASLGIMGTEKCCDNCRSRLDHCYSMDDSEDTSWDFGPQAFKLLSAVDILGEKFGIGLPILFLRGSNSQRLADQYRRHSLFGTGKDQTESWWKAFSRQLITEGFLVEVSRYNKFMKICALTKKGRNWLHKANTESQSLILQANEELCPKKLLLPSSKTVSSGTKEHCYNQVPVELSTEKKSNLEKLYSYKPCDKISSGSNISKKSIMVQSPEKAYSSSQPVISAQEQETQIVLYGKLVEARQKHANKMDVPPAILATNKILVDMAKMRPTTVENVKRIDGVSEGKAAMLAPLLEVIKHFCQTNSVQTDLFSSTKPQEEQKTSLVAKNKICTLSQSMAITYSLFQEKKMPLKSIAESRILPLMTIGMHLSQAVKAGCPLDLERAGLTPEVQKIIADVIRNPPVNSDMSKISLIRMLVPENIDTYLIHMAIEILKHGPDSGLQPSCDVNKRRCFPGSEEICSSSKRSKEEVGINTETSSAERKRRLPVWFAKGSDTSKKLMDKTKRGGLFS,1432,NP_000544.2.csv,refseq-WRN-NM_000553.4_clinical_seed_0_final,refseq-WRN-NM_000553.4.a2m,Invitae,refseq-WRN-NM_000553.4.npy,1,1432,1432
+NP_000545.1,MMAYMNPGPHYSVNALALSGPSVDLMHQAVPYPSAPRKQRRERTTFTRSQLEELEALFAKTQYPDVYAREEVALKINLPESRVQVWFKNRRAKCRQQRQQQKQQQQPPGGQAKARPAKRKAGTSPRPSTDVCPDPLGISDSYSPPLPGPSGSPTTAVATVSIWSPASESPLPEAQRAGLVASGPSLTSAPYAMTYAPASAFCSSPSAYGSPSSYFSGLDPYLSPMVPQLGGPALSPLSGPSVGPSLAQSPTSLSGQSYGAYSPVDSLEFKDPTGTWKFTYNPMDPLDYKDQSAWKFQIL,299,NP_000545.1.csv,refseq-CRX-NM_000554.4_clinical_seed_0_final,refseq-CRX-NM_000554.4.a2m,Invitae,refseq-CRX-NM_000554.4.npy,1,299,299
+NP_000548.2,MRLPKLLTFLLWYLAWLDLEFICTVLGAPDLGQRPQGTRPGLAKAEAKERPPLARNVFRPGGHSYGGGATNANARAKGGTGQTGGLTQPKKDEPKKLPPRPGGPEPKPGHPPQTRQATARTVTPKGQLPGGKAPPKAGSVPSSFLLKKAREPGPPREPKEPFRPPPITPHEYMLSLYRTLSDADRKGGNSSVKLEAGLANTITSFIDKGQDDRGPVVRKQRYVFDISALEKDGLLGAELRILRKKPSDTAKPAAPGGGRAAQLKLSSCPSGRQPAALLDVRSVPGLDGSGWEVFDIWKLFRNFKNSAQLCLELEAWERGRAVDLRGLGFDRAARQVHEKALFLVFGRTKKRDLFFNEIKARSGQDDKTVYEYLFSQRRKRRAPLATRQGKRPSKNLKARCSRKALHVNFKDMGWDDWIIAPLEYEAFHCEGLCEFPLRSHLEPTNHAVIQTLMNSMDPESTPPTCCVPTRLSPISILFIDSANNVVYKQYEDMVVESCGCR,501,NP_000548.2.csv,NP_000548.2_colabfold_clinical_seed_0_final,NP_000548.2_colabfold.a2m,colabfold,NP_000548.2_colabfold_theta_0.2.npy,1,501,501
+NP_000578.2,MKVISLFILVGFIGEFQSFSSASSPVNCQWDFYAPWSECNGCTKTQTRRRSVAVYGQYGGQPCVGNAFETQSCEPTRGCPTEEGCGERFRCFSGQCISKSLVCNGDSDCDEDSADEDRCEDSERRPSCDIDKPPPNIELTGNGYNELTGQFRNRVINTKSFGGQCRKVFSGDGKDFYRLSGNVLSYTFQVKINNDFNYEFYNSTWSYVKHTSTEHTSSSRKRSFFRSSSSSSRSYTSHTNEIHKGKSYQLLVVENTVEVAQFINNNPEFLQLAEPFWKELSHLPSLYDYSAYRRLIDQYGTHYLQSGSLGGEYRVLFYVDSEKLKQNDFNSVEEKKCKSSGWHFVVKFSSHGCKELENALKAASGTQNNVLRGEPFIRGGGAGFISGLSYLELDNPAGNKRRYSAWAESVTNLPQVIKQKLTPLYELVKEVPCASVKKLYLKWALEEYLDEFDPCHCRPCQNGGLATVEGTHCLCHCKPYTFGAACEQGVLVGNQAGGVDGGWSCWSSWSPCVQGKKTRSRECNNPPPSGGGRSCVGETTESTQCEDEELEHLRLLEPHCFPLSLVPTEFCPSPPALKDGFVQDEGTMFPVGKNVVYTCNEGYSLIGNPVARCGEDLRWLVGEMHCQKIACVLPVLMDGIQSHPQKPFYTVGEKVTVSCSGGMSLEGPSAFLCGSSLKWSPEMKNARCVQKENPLTQAVPKCQRWEKLQNSRCVCKMPYECGPSLDVCAQDERSKRILPLTVCKMHVLHCQGRNYTLTGRDSCTLPASAEKACGACPLWGKCDAESSKCVCREASECEEEGFSICVEVNGKEQTMSECEAGALRCRGQSISVTSIRPCAAETQ,843,NP_000578.2.csv,refseq-C7-NM_000587.2_clinical_seed_0_final,refseq-C7-NM_000587.2.a2m,Invitae,refseq-C7-NM_000587.2.npy,1,843,843
+NP_000584.3,MASSRCPAPRGCRCLPGASLAWLGTVLLLLADWVLLRTALPRIFSLLVPTALPLLRVWAVGLSRWAVLWLGACGVLRATVGSKSENAGAQGWLAALKPLAAALGLALPGLALFRELISWGAPGSADSTRLLHWGSHPTAFVVSYAAALPAAALWHKLGSLWVPGGQGGSGNPVRRLLGCLGSETRRLSLFLVLVVLSSLGEMAIPFFTGRLTDWILQDGSADTFTRNLTLMSILTIASAVLEFVGDGIYNNTMGHVHSHLQGEVFGAVLRQETEFFQQNQTGNIMSRVTEDTSTLSDSLSENLSLFLWYLVRGLCLLGIMLWGSVSLTMVTLITLPLLFLLPKKVGKWYQLLEVQVRESLAKSSQVAIEALSAMPTVRSFANEEGEAQKFREKLQEIKTLNQKEAVAYAVNSWTTSISGMLLKVGILYIGGQLVTSGAVSSGNLVTFVLYQMQFTQAVEVLLSIYPRVQKAVGSSEKIFEYLDRTPRCPPSGLLTPLHLEGLVQFQDVSFAYPNRPDVLVLQGLTFTLRPGEVTALVGPNGSGKSTVAALLQNLYQPTGGQLLLDGKPLPQYEHRYLHRQVAAVGQEPQVFGRSLQENIAYGLTQKPTMEEITAAAVKSGAHSFISGLPQGYDTEVDEAGSQLSGGQRQAVALARALIRKPCVLILDDATSALDANSQLQVEQLLYESPERYSRSVLLITQHLSLVEQADHILFLEGGAIREGGTHQQLMEKKGCYWAMVQAPADAPE,748,NP_000584.3.csv,NP_000584.3_clinical_seed_0_final,NP_000584.3.a2m,popEVE,NP_000584.3_theta_0.2.npy,1,748,748
+NP_000594.2,MGNLKSVAQEPGPPCGLGLGLGLGLCGKQGPATPAPEPSRAPASLLPPAPEHSPPSSPLTQPPEGPKFPRVKNWEVGSITYDTLSAQAQQDGPCTPRRCLGSLVFPRKLQGRPSPGPPAPEQLLSQARDFINQYYSSIKRSGSQAHEQRLQEVEAEVAATGTYQLRESELVFGAKQAWRNAPRCVGRIQWGKLQVFDARDCRSAQEMFTYICNHIKYATNRGNLRSAITVFPQRCPGRGDFRIWNSQLVRYAGYRQQDGSVRGDPANVEITELCIQHGWTPGNGRFDVLPLLLQAPDDPPELFLLPPELVLEVPLEHPTLEWFAALGLRWYALPAVSNMLLEIGGLEFPAAPFSGWYMSTEIGTRNLCDPHRYNILEDVAVCMDLDTRTTSSLWKDKAAVEINVAVLHSYQLAKVTIVDHHAATASFMKHLENEQKARGGCPADWAWIVPPISGSLTPVFHQEMVNYFLSPAFRYQPDPWKGSAAKGTGITRKKTFKEVANAVKISASLMGTVMAKRVKATILYGSETGRAQSYAQQLGRLFRKAFDPRVLCMDEYDVVSLEHETLVLVVTSTFGNGDPPENGESFAAALMEMSGPYNSSPRPEQHKSYKIRFNSISCSDPLVSSWRRKRKESSNTDSAGALGTLRFCVFGLGSRAYPHFCAFARAVDTRLEELGGERLLQLGQGDELCGQEEAFRGWAQAAFQAACETFCVGEDAKAAARDIFSPKRSWKRQRYRLSAQAEGLQLLPGLIHVHRRKMFQATIRSVENLQSSKSTRATILVRLDTGGQEGLQYQPGDHIGVCPPNRPGLVEALLSRVEDPPAPTEPVAVEQLEKGSPGGPPPGWVRDPRLPPCTLRQALTFFLDITSPPSPQLLRLLSTLAEEPREQQELEALSQDPRRYEEWKWFRCPTLLEVLEQFPSVALPAPLLLTQLPLLQPRYYSVSSAPSTHPGEIHLTVAVLAYRTQDGLGPLHYGVCSTWLSQLKPGDPVPCFIRGAPSFRLPPDPSLPCILVGPGTGIAPFRGFWQERLHDIESKGLQPTPMTLVFGCRCSQLDHLYRDEVQNAQQRGVFGRVLTAFSREPDNPKTYVQDILRTELAAEVHRVLCLERGHMFVCGDVTMATNVLQTVQRILATEGDMELDEAGDVIGVLRDQQRYHEDIFGLTLRTQEVTSRIRTQSFSLQERQLRGAVPWAFDPPGSDTNSP,1203,NP_000594.2.csv,refseq-NOS3-NM_000603.4_clinical_seed_0_final,refseq-NOS3-NM_000603.4.a2m,Invitae,refseq-NOS3-NM_000603.4.npy,1,1203,1203
+NP_000603.1,MGIPMGKSMLVLLTFLAFASCCIAAYRPSETLCGGELVDTLQFVCGDRGFYFSRPASRVSRRSRGIVEECCFRSCDLALLETYCATPAKSERDVSTPPTVLPDNFPRYPVGKFFQYDTWKQSTQRLRRGLPALLRARRGHVLAKELEAFREAKRHRPLIALPTQDPAHGGAPPEMASNRK,180,NP_000603.1.csv,refseq-IGF2-NM_000612.5_clinical_seed_0_final,refseq-IGF2-NM_000612.5.a2m,Invitae,refseq-IGF2-NM_000612.5.npy,1,180,180
+NP_000608.1,MVLGPEQKMSDDSVSGDHGESASLGNINPAYSNPSLSQSPGDSEEYFATYFNEKISIPEEEYSCFSFRKLWAFTGPGFLMSIAYLDPGNIESDLQSGAVAGFKLLWILLLATLVGLLLQRLAARLGVVTGLHLAEVCHRQYPKVPRVILWLMVELAIIGSDMQEVIGSAIAINLLSVGRIPLWGGVLITIADTFVFLFLDKYGLRKLEAFFGFLITIMALTFGYEYVTVKPSQSQVLKGMFVPSCSGCRTPQIEQAVGIVGAVIMPHNMYLHSALVKSRQVNRNNKQEVREANKYFFIESCIALFVSFIINVFVVSVFAEAFFGKTNEQVVEVCTNTSSPHAGLFPKDNSTLAVDIYKGGVVLGCYFGPAALYIWAVGILAAGQSSTMTGTYSGQFVMEGFLNLKWSRFARVVLTRSIAIIPTLLVAVFQDVEHLTGMNDFLNVLQSLQLPFALIPILTFTSLRPVMSDFANGLGWRIAGGILVLIICSINMYFVVVYVRDLGHVALYVVAAVVSVAYLGFVFYLGWQCLIALGMSFLDCGHTVSISKGLLTEEATRGYVK,561,NP_000608.1.csv,refseq-SLC11A2-NM_000617.2_clinical_seed_0_final,refseq-SLC11A2-NM_000617.2.a2m,Invitae,refseq-SLC11A2-NM_000617.2.npy,1,561,561
+NP_000633.2,MGHSKQIRILLLNEMEKLEKTLFRLEQGYELQFRLGPTLQGKAVTVYTNYPFPGETFNREKFRSLDWENPTEREDDSDKYCKLNLQQSGSFQYYFLQGNEKSGGGYIVVDPILRVGADNHVLPLDCVTLQTFLAKCLGPFDEWESRLRVAKESGYNMIHFTPLQTLGLSRSCYSLANQLELNPDFSRPNRKYTWNDVGQLVEKLKKEWNVICITDVVYNHTAANSKWIQEHPECAYNLVNSPHLKPAWVLDRALWRFSCDVAEGKYKEKGIPALIENDHHMNSIRKIIWEDIFPKLKLWEFFQVDVNKAVEQFRRLLTQENRRVTKSDPNQHLTIIQDPEYRRFGCTVDMNIALTTFIPHDKGPAAIEECCNWFHKRMEELNSEKHRLINYHQEQAVNCLLGNVFYERLAGHGPKLGPVTRKHPLVTRYFTFPFEEIDFSMEESMIHLPNKACFLMAHNGWVMGDDPLRNFAEPGSEVYLRRELICWGDSVKLRYGNKPEDCPYLWAHMKKYTEITATYFQGVRLDNCHSTPLHVAEYMLDAARNLQPNLYVVAELFTGSEDLDNVFVTRLGISSLIREAMSAYNSHEEGRLVYRYGGEPVGSFVQPCLRPLMPAIAHALFMDITHDNECPIVHRSAYDALPSTTIVSMACCASGSTRGYDELVPHQISVVSEERFYTKWNPEALPSNTGEVNFQSGIIAARCAISKLHQELGAKGFIQVYVDQVDEDIVAVTRHSPSIHQSVVAVSRTAFRNPKTSFYSKEVPQMCIPGKIEEVVLEARTIERNTKPYRKDENSINGTPDITVEIREHIQLNESKIVKQAGVATKGPNEYIQEIEFENLSPGSVIIFRVSLDPHAQVAVGILRNHLTQFSPHFKSGSLAVDNADPILKIPFASLASRLTLAELNQILYRCESEEKEDGGGCYDIPNWSALKYAGLQGLMSVLAEIRPKNDLGHPFCNNLRSGDWMIDYVSNRLISRSGTIAEVGKWLQAMFFYLKQIPRYLIPCYFDAILIGAYTTLLDTAWKQMSSFVQNGSTFVKHLSLGSVQLCGVGKFPSLPILSPALMDVPYRLNEITKEKEQCCVSLAAGLPHFSSGIFRCWGRDTFIALRGILLITGRYVEARNIILAFAGTLRHGLIPNLLGEGIYARYNCRDAVWWWLQCIQDYCKMVPNGLDILKCPVSRMYPTDDSAPLPAGTLDQPLFEVIQEAMQKHMQGIQFRERNAGPQIDRNMKDEGFNITAGVDEETGFVYGGNRFNCGTWMDKMGESDRARNRGIPATPRDGSAVEIVGLSKSAVRWLLELSKKNIFPYHEVTVKRHGKAIKVSYDEWNRKIQDNFEKLFHVSEDPSDLNEKHPNLVHKRGIYKDSYGASSPWCDYQLRPNFTIAMVVAPELFTTEKAWKALEIAEKKLLGPLGMKTLDPDDMVYCGIYDNALDNDNYNLAKGFNYHQGPEWLWPIGYFLRAKLYFSRLMGPETTAKTIVLVKNVLSRHYVHLERSPWKGLPELTNENAQYCPFSCETQAWSIATILETLYDL,1532,NP_000633.2.csv,refseq-AGL-NM_000642.2_clinical_seed_0_final,refseq-AGL-NM_000642.2.a2m,Invitae,refseq-AGL-NM_000642.2.npy,1,1532,1532
+NP_000651.3,MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEIYDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS,390,NP_000651.3.csv,refseq-TGFB1-NM_000660.5_clinical_seed_0_final,refseq-TGFB1-NM_000660.5.a2m,Invitae,refseq-TGFB1-NM_000660.5.npy,1,390,390
+NP_000657.1,MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVVTVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQYLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANPTDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSNPHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEGVTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPALGFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS,408,NP_000657.1.csv,refseq-ACY1-NM_000666.2_clinical_seed_0_final,refseq-ACY1-NM_000666.2.a2m,Invitae,refseq-ACY1-NM_000666.2.npy,1,408,408
+NP_000676.1,MILNSSTEDGIKRIQDDCPKAGRHNYIFVMIPTLYSIIFVVGIFGNSLVVIVIYFYMKLKTVASVFLLNLALADLCFLLTLPLWAVYTAMEYRWPFGNYLCKIASASVSFNLYASVFLLTCLSIDRYLAIVHPMKSRLRRTMLVAKVTCIIIWLLAGLASLPAIIHRNVFFIENTNITVCAFHYESQNSTLPIGLGLTKNILGFLFPFLIILTSYTLIWKALKKAYEIQKNKPRNDDIFKIIMAIVLFFFFSWIPHQIFTFLDVLIQLGIIRDCRIADIVDTAMPITICIAYFNNCLNPLFYGFLGKKFKRYFLQLLKYIPPKAKSHSNLSTKMSTLSYRPSDNVSSSTKKPAPCFEVE,359,NP_000676.1.csv,refseq-AGTR1-NM_000685.5_clinical_seed_0_final,refseq-AGTR1-NM_000685.5.a2m,Invitae,refseq-AGTR1-NM_000685.5_theta_0.2.npy,1,359,359
+NP_000678.1,MSDKLPYKVADIGLAAWGRKALDIAENEMPGLMRMRERYSASKPLKGARIAGCLHMTVETAVLIETLVTLGAEVQWSSCNIFSTQDHAAAAIAKAGIPVYAWKGETDEEYLWCIEQTLYFKDGPLNMILDDGGDLTNLIHTKYPQLLPGIRGISEETTTGVHNLYKMMANGILKVPAINVNDSVTKSKFDNLYGCRESLIDGIKRATDVMIAGKVAVVAGYGDVGKGCAQALRGFGARVIITEIDPINALQAAMEGYEVTTMDEACQEGNIFVTTTGCIDIILGRHFEQMKDDAIVCNIGHFDVEIDVKWLNENAVEKVNIKPQVDRYRLKNGRRIILLAEGRLVNLGCAMGHPSFVMSNSFTNQVMAQIELWTHPDKYPVGVHFLPKKLDEAVAEAHLGKLNVKLTKLTEKQAQYLGMSCDGPFKPDHYRY,432,NP_000678.1.csv,refseq-AHCY-NM_000687.3_clinical_seed_0_final,refseq-AHCY-NM_000687.3.a2m,Invitae,refseq-AHCY-NM_000687.3.npy,1,432,432
+NP_000683.3,MLRFLAPRLLSLQGRTARYSSAAALPSPILNPDIPYNQLFINNEWQDAVSKKTFPTVNPTTGEVIGHVAEGDRADVDRAVKAAREAFRLGSPWRRMDASERGRLLNRLADLVERDRVYLASLETLDNGKPFQESYALDLDEVIKVYRYFAGWADKWHGKTIPMDGQHFCFTRHEPVGVCGQIIPWNFPLVMQGWKLAPALATGNTVVMKVAEQTPLSALYLASLIKEAGFPPGVVNIITGYGPTAGAAIAQHVDVDKVAFTGSTEVGHLIQKAAGDSNLKRVTLELGGKSPSIVLADADMEHAVEQCHEALFFNMGQCCCAGSRTFVEESIYNEFLERTVEKAKQRKVGNPFELDTQQGPQVDKEQFERVLGYIQLGQKEGAKLLCGGERFGERGFFIKPTVFGGVQDDMRIAKEEIFGPVQPLFKFKKIEEVVERANNTRYGLAAAVFTRDLDKAMYFTQALQAGTVWVNTYNIVTCHTPFGGFKESGNGRELGEDGLKAYTEVKTVTIKVPQKNS,517,NP_000683.3.csv,refseq-ALDH1B1-NM_000692.4_clinical_seed_0_final,refseq-ALDH1B1-NM_000692.4.a2m,Invitae,refseq-ALDH1B1-NM_000692.4_theta_0.2.npy,1,517,517
+NP_000684.2,MATANGAVENGQPDRKPPALPRPIRNLEVKFTKIFINNEWHESKSGKKFATCNPSTREQICEVEEGDKPDVDKAVEAAQVAFQRGSPWRRLDALSRGRLLHQLADLVERDRATLAALETMDTGKPFLHAFFIDLEGCIRTLRYFAGWADKIQGKTIPTDDNVVCFTRHEPIGVCGAITPWNFPLLMLVWKLAPALCCGNTMVLKPAEQTPLTALYLGSLIKEAGFPPGVVNIVPGFGPTVGAAISSHPQINKIAFTGSTEVGKLVKEAASRSNLKRVTLELGGKNPCIVCADADLDLAVECAHQGVFFNQGQCCTAASRVFVEEQVYSEFVRRSVEYAKKRPVGDPFDVKTEQGPQIDQKQFDKILELIESGKKEGAKLECGGSAMEDKGLFIKPTVFSEVTDNMRIAKEEIFGPVQPILKFKSIEEVIKRANSTDYGLTAAVFTKNLDKALKLASALESGTVWINCYNALYAQAPFGGFKMSGNGRELGEYALAEYTEVKTVTIKLGDKNP,512,NP_000684.2.csv,refseq-ALDH1A3-NM_000693.3_clinical_seed_0_final,refseq-ALDH1A3-NM_000693.3.a2m,Invitae,refseq-ALDH1A3-NM_000693.3_theta_0.2.npy,1,512,512
+NP_000693.1,MGRGAGREYSPAATTAENGGGKKKQKEKELDELKKEVAMDDHKLSLDELGRKYQVDLSKGLTNQRAQDVLARDGPNALTPPPTTPEWVKFCRQLFGGFSILLWIGAILCFLAYGIQAAMEDEPSNDNLYLGVVLAAVVIVTGCFSYYQEAKSSKIMDSFKNMVPQQALVIREGEKMQINAEEVVVGDLVEVKGGDRVPADLRIISSHGCKVDNSSLTGESEPQTRSPEFTHENPLETRNICFFSTNCVEGTARGIVIATGDRTVMGRIATLASGLEVGRTPIAMEIEHFIQLITGVAVFLGVSFFVLSLILGYSWLEAVIFLIGIIVANVPEGLLATVTVCLTLTAKRMARKNCLVKNLEAVETLGSTSTICSDKTGTLTQNRMTVAHMWFDNQIHEADTTEDQSGATFDKRSPTWTALSRIAGLCNRAVFKAGQENISVSKRDTAGDASESALLKCIELSCGSVRKMRDRNPKVAEIPFNSTNKYQLSIHEREDSPQSHVLVMKGAPERILDRCSTILVQGKEIPLDKEMQDAFQNAYMELGGLGERVLGFCQLNLPSGKFPRGFKFDTDELNFPTEKLCFVGLMSMIDPPRAAVPDAVGKCRSAGIKVIMVTGDHPITAKAIAKGVGIISEGNETVEDIAARLNIPMSQVNPREAKACVVHGSDLKDMTSEQLDEILKNHTEIVFARTSPQQKLIIVEGCQRQGAIVAVTGDGVNDSPALKKADIGIAMGISGSDVSKQAADMILLDDNFASIVTGVEEGRLIFDNLKKSIAYTLTSNIPEITPFLLFIIANIPLPLGTVTILCIDLGTDMVPAISLAYEAAESDIMKRQPRNSQTDKLVNERLISMAYGQIGMIQALGGFFTYFVILAENGFLPSRLLGIRLDWDDRTMNDLEDSYGQEWTYEQRKVVEFTCHTAFFASIVVVQWADLIICKTRRNSVFQQGMKNKILIFGLLEETALAAFLSYCPGMGVALRMYPLKVTWWFCAFPYSLLIFIYDEVRKLILRRYPGGWVEKETYY,1020,NP_000693.1.csv,refseq-ATP1A2-NM_000702.3_clinical_seed_0_final,refseq-ATP1A2-NM_000702.3.a2m,Invitae,refseq-ATP1A2-NM_000702.3.npy,1,1020,1020
+NP_000700.1,MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSSLDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILYESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHPISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK,445,NP_000700.1.csv,refseq-BCKDHA-NM_000709.3_clinical_seed_0_final,refseq-BCKDHA-NM_000709.3.a2m,Invitae,refseq-BCKDHA-NM_000709.3.npy,1,445,445
+NP_000708.1,MRMLLALLALSAARPSASAESHWCYEVQAESSNYPCLVPVKWGGNCQKDRQSPINIVTTKAKVDKKLGRFFFSGYDKKQTWTVQNNGHSVMMLLENKASISGGGLPAPYQAKQLHLHWSDLPYKGSEHSLDGEHFAMEMHIVHEKEKGTSRNVKEAQDPEDEIAVLAFLVEAGTQVNEGFQPLVEALSNIPKPEMSTTMAESSLLDLLPKEEKLRHYFRYLGSLTTPTCDEKVVWTVFREPIQLHREQILAFSQKLYYDKEQTVSMKDNVRPLQQLGQRTVIKSGAPGRPLPWALPALLGPMLACLLAGFLR,312,NP_000708.1.csv,refseq-CA4-NM_000717.4_clinical_seed_0_final,refseq-CA4-NM_000717.4.a2m,Invitae,refseq-CA4-NM_000717.4.npy,1,312,312
+NP_000710.5,MVNENTRMYIPEENHQGSNYGSPRPAHANMNANAAAGLAPEHIPTPGAALSWQAAIDAARQAKLMGSAGNATISTVSSTQRKRQQYGKPKKQGSTTATRPPRALLCLTLKNPIRRACISIVEWKPFEIIILLTIFANCVALAIYIPFPEDDSNATNSNLERVEYLFLIIFTVEAFLKVIAYGLLFHPNAYLRNGWNLLDFIIVVVGLFSAILEQATKADGANALGGKGAGFDVKALRAFRVLRPLRLVSGVPSLQVVLNSIIKAMVPLLHIALLVLFVIIIYAIIGLELFMGKMHKTCYNQEGIADVPAEDDPSPCALETGHGRQCQNGTVCKPGWDGPKHGITNFDNFAFAMLTVFQCITMEGWTDVLYWVNDAVGRDWPWIYFVTLIIIGSFFVLNLVLGVLSGEFSKEREKAKARGDFQKLREKQQLEEDLKGYLDWITQAEDIDPENEDEGMDEEKPRNMSMPTSETESVNTENVAGGDIEGENCGARLAHRISKSKFSRYWRRWNRFCRRKCRAAVKSNVFYWLVIFLVFLNTLTIASEHYNQPNWLTEVQDTANKALLALFTAEMLLKMYSLGLQAYFVSLFNRFDCFVVCGGILETILVETKIMSPLGISVLRCVRLLRIFKITRYWNSLSNLVASLLNSVRSIASLLLLLFLFIIIFSLLGMQLFGGKFNFDEMQTRRSTFDNFPQSLLTVFQILTGEDWNSVMYDGIMAYGGPSFPGMLVCIYFIILFICGNYILLNVFLAIAVDNLADAESLTSAQKEEEEEKERKKLARTASPEKKQELVEKPAVGESKEEKIELKSITADGESPPATKINMDDLQPNENEDKSPYPNPETTGEEDEEEPEMPVGPRPRPLSELHLKEKAVPMPEASAFFIFSSNNRFRLQCHRIVNDTIFTNLILFFILLSSISLAAEDPVQHTSFRNHILFYFDIVFTTIFTIEIALKMTAYGAFLHKGSFCRNYFNILDLLVVSVSLISFGIQSSAINVVKILRVLRVLRPLRAINRAKGLKHVVQCVFVAIRTIGNIVIVTTLLQFMFACIGVQLFKGKLYTCSDSSKQTEAECKGNYITYKDGEVDHPIIQPRSWENSKFDFDNVLAAMMALFTVSTFEGWPELLYRSIDSHTEDKGPIYNYRVEISIFFIIYIIIIAFFMMNIFVGFVIVTFQEQGEQEYKNCELDKNQRQCVEYALKARPLRRYIPKNQHQYKVWYVVNSTYFEYLMFVLILLNTICLAMQHYGQSCLFKIAMNILNMLFTGLFTVEMILKLIAFKPKHYFCDAWNTFDALIVVGSIVDIAITEVNPAEHTQCSPSMNAEENSRISITFFRLFRVMRLVKLLSRGEGIRTLLWTFIKSFQALPYVALLIVMLFFIYAVIGMQVFGKIALNDTTEINRNNNFQTFPQAVLLLFRCATGEAWQDIMLACMPGKKCAPESEPSNSTEGETPCGSSFAVFYFISFYMLCAFLIINLFVAVIMDNFDYLTRDWSILGPHHLDEFKRIWAEYDPEAKGRIKHLDVVTLLRRIQPPLGFGKLCPHRVACKRLVSMNMPLNSDGTVMFNATLFALVRTALRIKTEGNLEQANEELRAIIKKIWKRTSMKLLDQVVPPAGDDEVTVGKFYATFLIQEYFRKFKKRKEQGLVGKPSQRNALSLQAGLRTLHDIGPEIRRAISGDLTAEEELDKAMKEAVSAASEDDIFRRAGGLFGNHVSYYQSDGRSAFPQTFTTQRPLHINKAGSSQGDTESPSHEKLVDSTFTPSSYSSTGSNANINNANNTALGRLPRPAGYPSTVSTVEGHGPPLSPAIRVQEVAWKLSSNRCHSRESQAAMAGQEETSQDETYEVKMNHDTEACSEPSLLSTEMLSYQDDENRQLTLPEEDKRDIRQSPKRGFLRSASLGRRASFHLECLKRQKDRGGDISQKTVLPLHLVHHQALAVAGLSPLLQRSHSPASFPRPFATPPATPGSRGWPPQPVPTLRLEGVESSEKLNSSFPSIHCGSWAETTPGGGGSSAARRVRPVSLMVPSQAGAPGRQFHGSASSLVEAVLISEGLGQFAQDPKFIEVTTQELADACDMTIEEMESAADNILSGGAPQSPNGALLPFVNCRDAGQDRAGGEEDAGCVRARGRPSEEELQDSRVYVSSL,2138,NP_000710.5.csv,refseq-CACNA1C-NM_000719.6_clinical_seed_0_final,refseq-CACNA1C-NM_000719.6.a2m,Invitae,refseq-CACNA1C-NM_000719.6_theta_0.2.npy,1,2138,2138
+NP_000711.1,MMMMMMMKKMQHQRQQQADHANEANYARGTRLPLSGEGPTSQPNSSKQTVLSWQAAIDAARQAKAAQTMSTSAPPPVGSLSQRKRQQYAKSKKQGNSSNSRPARALFCLSLNNPIRRACISIVEWKPFDIFILLAIFANCVALAIYIPFPEDDSNSTNHNLEKVEYAFLIIFTVETFLKIIAYGLLLHPNAYVRNGWNLLDFVIVIVGLFSVILEQLTKETEGGNHSSGKSGGFDVKALRAFRVLRPLRLVSGVPSLQVVLNSIIKAMVPLLHIALLVLFVIIIYAIIGLELFIGKMHKTCFFADSDIVAEEDPAPCAFSGNGRQCTANGTECRSGWVGPNGGITNFDNFAFAMLTVFQCITMEGWTDVLYWVNDAIGWEWPWVYFVSLIILGSFFVLNLVLGVLSGEFSKEREKAKARGDFQKLREKQQLEEDLKGYLDWITQAEDIDPENEEEGGEEGKRNTSMPTSETESVNTENVSGEGENRGCCGSLWCWWRRRGAAKAGPSGCRRWGQAISKSKLSRRWRRWNRFNRRRCRAAVKSVTFYWLVIVLVFLNTLTISSEHYNQPDWLTQIQDIANKVLLALFTCEMLVKMYSLGLQAYFVSLFNRFDCFVVCGGITETILVELEIMSPLGISVFRCVRLLRIFKVTRHWTSLSNLVASLLNSMKSIASLLLLLFLFIIIFSLLGMQLFGGKFNFDETQTKRSTFDNFPQALLTVFQILTGEDWNAVMYDGIMAYGGPSSSGMIVCIYFIILFICGNYILLNVFLAIAVDNLADAESLNTAQKEEAEEKERKKIARKESLENKKNNKPEVNQIANSDNKVTIDDYREEDEDKDPYPPCDVPVGEEEEEEEEDEPEVPAGPRPRRISELNMKEKIAPIPEGSAFFILSKTNPIRVGCHKLINHHIFTNLILVFIMLSSAALAAEDPIRSHSFRNTILGYFDYAFTAIFTVEILLKMTTFGAFLHKGAFCRNYFNLLDMLVVGVSLVSFGIQSSAISVVKILRVLRVLRPLRAINRAKGLKHVVQCVFVAIRTIGNIMIVTTLLQFMFACIGVQLFKGKFYRCTDEAKSNPEECRGLFILYKDGDVDSPVVRERIWQNSDFNFDNVLSAMMALFTVSTFEGWPALLYKAIDSNGENIGPIYNHRVEISIFFIIYIIIVAFFMMNIFVGFVIVTFQEQGEKEYKNCELDKNQRQCVEYALKARPLRRYIPKNPYQYKFWYVVNSSPFEYMMFVLIMLNTLCLAMQHYEQSKMFNDAMDILNMVFTGVFTVEMVLKVIAFKPKGYFSDAWNTFDSLIVIGSIIDVALSEADPTESENVPVPTATPGNSEESNRISITFFRLFRVMRLVKLLSRGEGIRTLLWTFIKSFQALPYVALLIAMLFFIYAVIGMQMFGKVAMRDNNQINRNNNFQTFPQAVLLLFRCATGEAWQEIMLACLPGKLCDPESDYNPGEEYTCGSNFAIVYFISFYMLCAFLIINLFVAVIMDNFDYLTRDWSILGPHHLDEFKRIWSEYDPEAKGRIKHLDVVTLLRRIQPPLGFGKLCPHRVACKRLVAMNMPLNSDGTVMFNATLFALVRTALKIKTEGNLEQANEELRAVIKKIWKKTSMKLLDQVVPPAGDDEVTVGKFYATFLIQDYFRKFKKRKEQGLVGKYPAKNTTIALQAGLRTLHDIGPEIRRAISCDLQDDEPEETKREEEDDVFKRNGALLGNHVNHVNSDRRDSLQQTNTTHRPLHVQRPSIPPASDTEKPLFPPAGNSVCHNHHNHNSIGKQVPTSTNANLNNANMSKAAHGKRPSIGNLEHVSENGHHSSHKHDREPQRRSSVKRTRYYETYIRSDSGDEQLPTICREDPEIHGYFRDPHCLGEQEYFSSEECYEDDSSPTWSRQNYGYYSRYPGRNIDSERPRGYHHPQGFLEDDDSPVCYDSRRSPRRRLLPPTPASHRRSSFNFECLRRQSSQEEVPSSPIFPHRTALPLHLMQQQIMAVAGLDSSKAQKYSPSHSTRSWATPPATPPYRDWTPCYTPLIQVEQSEALDQVNGSLPSLHRSSWYTDEPDISYRTFTPASLTVPSSFRNKNSDKQRSADSLVEAVLISEGLGRYARDPKFVSATKHEIADACDLTIDEMESAASTLLNGNVRPRANGDVGPLSHRQDYELQDFGPGYSDEEPDPGRDEEDLADEMICITTL,2181,NP_000711.1.csv,refseq-CACNA1D-NM_000720.3_clinical_seed_0_final,refseq-CACNA1D-NM_000720.3.a2m,Invitae,refseq-CACNA1D-NM_000720.3.npy,1,2181,2181
+NP_000712.2,MARFGEAVVARPGSGDGDSDQSRNRQGTPVPASGQAAAYKQTKAQRARTMALYNPIPVRQNCFTVNRSLFIFGEDNIVRKYAKKLIDWPPFEYMILATIIANCIVLALEQHLPEDDKTPMSRRLEKTEPYFIGIFCFEAGIKIVALGFIFHKGSYLRNGWNVMDFIVVLSGILATAGTHFNTHVDLRTLRAVRVLRPLKLVSGIPSLQIVLKSIMKAMVPLLQIGLLLFFAILMFAIIGLEFYSGKLHRACFMNNSGILEGFDPPHPCGVQGCPAGYECKDWIGPNDGITQFDNILFAVLTVFQCITMEGWTTVLYNTNDALGATWNWLYFIPLIIIGSFFVLNLVLGVLSGEFAKERERVENRRAFMKLRRQQQIERELNGYRAWIDKAEEVMLAEENKNAGTSALEVLRRATIKRSRTEAMTRDSSDEHCVDISSVGTPLARASIKSAKVDGVSYFRHKERLLRISIRHMVKSQVFYWIVLSLVALNTACVAIVHHNQPQWLTHLLYYAEFLFLGLFLLEMSLKMYGMGPRLYFHSSFNCFDFGVTVGSIFEVVWAIFRPGTSFGISVLRALRLLRIFKITKYWASLRNLVVSLMSSMKSIISLLFLLFLFIVVFALLGMQLFGGRFNFNDGTPSANFDTFPAAIMTVFQILTGEDWNEVMYNGIRSQGGVSSGMWSAIYFIVLTLFGNYTLLNVFLAIAVDNLANAQELTKDEQEEEEAFNQKHALQKAKEVSPMSAPNMPSIERDRRRRHHMSMWEPRSSHLRERRRRHHMSVWEQRTSQLRKHMQMSSQEALNREEAPTMNPLNPLNPLSSLNPLNAHPSLYRRPRAIEGLALGLALEKFEEERISRGGSLKGDGGDRSSALDNQRTPLSLGQREPPWLARPCHGNCDPTQQEAGGGEAVVTFEDRARHRQSQRRSRHRRVRTEGKESSSASRSRSASQERSLDEAMPTEGEKDHELRGNHGAKEPTIQEERAQDLRRTNSLMVSRGSGLAGGLDEADTPLVLPHPELEVGKHVVLTEQEPEGSSEQALLGNVQLDMGRVISQSEPDLSCITANTDKATTESTSVTVAIPDVDPLVDSTVVHISNKTDGEASPLKEAEIREDEEEVEKKKQKKEKRETGKAMVPHSSMFIFSTTNPIRRACHYIVNLRYFEMCILLVIAASSIALAAEDPVLTNSERNKVLRYFDYVFTGVFTFEMVIKMIDQGLILQDGSYFRDLWNILDFVVVVGALVAFALANALGTNKGRDIKTIKSLRVLRVLRPLKTIKRLPKLKAVFDCVVTSLKNVFNILIVYKLFMFIFAVIAVQLFKGKFFYCTDSSKDTEKECIGNYVDHEKNKMEVKGREWKRHEFHYDNIIWALLTLFTVSTGEGWPQVLQHSVDVTEEDRGPSRSNRMEMSIFYVVYFVVFPFFFVNIFVALIIITFQEQGDKMMEECSLEKNERACIDFAISAKPLTRYMPQNRHTFQYRVWHFVVSPSFEYTIMAMIALNTVVLMMKYYSAPCTYELALKYLNIAFTMVFSLECVLKVIAFGFLNYFRDTWNIFDFITVIGSITEIILTDSKLVNTSGFNMSFLKLFRAARLIKLLRQGYTIRILLWTFVQSFKALPYVCLLIAMLFFIYAIIGMQVFGNIKLDEESHINRHNNFRSFFGSLMLLFRSATGEAWQEIMLSCLGEKGCEPDTTAPSGQNENERCGTDLAYVYFVSFIFFCSFLMLNLFVAVIMDNFEYLTRDSSILGPHHLDEFVRVWAEYDRAACGRIHYTEMYEMLTLMSPPLGLGKRCPSKVAYKRLVLMNMPVAEDMTVHFTSTLMALIRTALDIKIAKGGADRQQLDSELQKETLAIWPHLSQKMLDLLVPMPKASDLTVGKIYAAMMIMDYYKQSKVKKQRQQLEEQKNAPMFQRMEPSSLPQEIIANAKALPYLQQDPVSGLSGRSGYPSMSPLSPQDIFQLACMDPADDGQFQERQSLVVTDPSSMRRSFSTIRDKRSNSSWLEEFSMERSSENTYKSRRRSYHSSLRLSAHRLNSDSGHKSDTHRSGGRERGRSKERKHLLSPDVSRCNSEERGTQADWESPERRQSRSPSEGRSQTPNRQGTGSLSESSIPSVSDTSTPRRSRRQLPPVPPKPRPLLSYSSLIRHAGSISPPADGSEEGSPLTSQALESNNACLTESSNSPHPQQSQHASPQRYISEPYLALHEDSHASDCGEEETLTFEAAVATSLGRSNTIGSAPPLRHSWQMPNGHYRRRRRGGPGPGMMCGAVNNLLSDTEEDDKC,2270,NP_000712.2.csv,refseq-CACNA1E-NM_000721.3_clinical_seed_0_final,refseq-CACNA1E-NM_000721.3.a2m,Invitae,refseq-CACNA1E-NM_000721.3.npy,1,2270,2270
+NP_000717.2,MSSSSYAKNGTADGPHSPTSQVARGTTTRRSRLKRSDGSTTSTSFILRQGSADSYTSRPSDSDVSLEEDREAIRQEREQQAAIQLERAKSKPVAFAVKTNVSYCGALDEDVPVPSTAISFDAKDFLHIKEKYNNDWWIGRLVKEGCEIGFIPSPLRLENIRIQQEQKRGRFHGGKSSGNSSSSLGEMVSGTFRATPTSTAKQKQKVTEHIPPYDVVPSMRPVVLVGPSLKGYEVTDMMQKALFDFLKHRFDGRISITRVTADISLAKRSVLNNPSKRAIIERSNTRSSLAEVQSEIERIFELARSLQLVVLDADTINHPAQLIKTSLAPIIVHVKVSSPKVLQRLIKSRGKSQSKHLNVQLVAADKLAQCPPEMFDVILDENQLEDACEHLGEYLEAYWRATHTTSSTPMTPLLGRNLGSTALSPYPTAISGLQSQRMRHSNHSTENSPIERRSLMTSDENYHNERARKSRNRLSSSSQHSRDHYPLVEEDYPDSYQDTYKPHRNRGSPGGYSHDSRHRL,520,NP_000717.2.csv,refseq-CACNB4-NM_000726.3_clinical_seed_0_final,refseq-CACNB4-NM_000726.3.a2m,Invitae,refseq-CACNB4-NM_000726.3.npy,1,520,520
+NP_000735.1,MELGGPGAPRLLPPLLLLLGTGLLRASSHVETRAHAEERLLKKLFSGYNKWSRPVANISDVVLVRFGLSIAQLIDVDEKNQMMTTNVWVKQEWHDYKLRWDPADYENVTSIRIPSELIWRPDIVLYNNADGDFAVTHLTKAHLFHDGRVQWTPPAIYKSSCSIDVTFFPFDQQNCTMKFGSWTYDKAKIDLVNMHSRVDQLDFWESGEWVIVDAVGTYNTRKYECCAEIYPDITYAFVIRRLPLFYTINLIIPCLLISCLTVLVFYLPSECGEKITLCISVLLSLTVFLLLITEIIPSTSLVIPLIGEYLLFTMIFVTLSIVITVFVLNVHHRSPRTHTMPTWVRRVFLDIVPRLLLMKRPSVVKDNCRRLIESMHKMASAPRFWPEPEGEPPATSGTQSLHPPSPSFCVPLDVPAEPGPSCKSPSDQLPPQQPLEAEKASPHPSPGPCRPPHGTQAPGLAKARSLSVQHMSSPGEAVEGGVRCRSRSIQYCVPRDDAAPEADGQAAGALASRNTHSAELPPPDQPSPCKCTCKKEPSSVSPSATVKTRSTKAPPPHLPLSPALTRAVEGVQYIADHLKAEDTDFSVKEDWKYVAMVIDRIFLWMFIIVCLLGTVGLFLPPWLAGMI,627,NP_000735.1.csv,refseq-CHRNA4-NM_000744.6_clinical_seed_0_final,refseq-CHRNA4-NM_000744.6.a2m,Invitae,refseq-CHRNA4-NM_000744.6.npy,1,627,627
+NP_000738.2,MTPGALLMLLGALGAPLAPGVRGSEAEGRLREKLFSGYDSSVRPAREVGDRVRVSVGLILAQLISLNEKDEEMSTKVYLDLEWTDYRLSWDPAEHDGIDSLRITAESVWLPDVVLLNNNDGNFDVALDISVVVSSDGSVRWQPPGIYRSSCSIQVTYFPFDWQNCTMVFSSYSYDSSEVSLQTGLGPDGQGHQEIHIHEGTFIENGQWEIIHKPSRLIQPPGDPRGGREGQRQEVIFYLIIRRKPLFYLVNVIAPCILITLLAIFVFYLPPDAGEKMGLSIFALLTLTVFLLLLADKVPETSLSVPIIIKYLMFTMVLVTFSVILSVVVLNLHHRSPHTHQMPLWVRQIFIHKLPLYLRLKRPKPERDLMPEPPHCSSPGSGWGRGTDEYFIRKPPSDFLFPKPNRFQPELSAPDLRRFIDGPNRAVALLPELREVVSSISYIARQLQEQEDHDALKEDWQFVAMVVDRLFLWTFIIFTSVGTLVIFLDATYHLPPPDPFP,501,NP_000738.2.csv,refseq-CHRNB1-NM_000747.2_clinical_seed_0_final,refseq-CHRNB1-NM_000747.2.a2m,Invitae,refseq-CHRNB1-NM_000747.2.npy,1,501,501
+NP_000739.1,MARRCGPVALLLGFGLLRLCSGVWGTDTEERLVEHLLDPSRYNKLIRPATNGSELVTVQLMVSLAQLISVHEREQIMTTNVWLTQEWEDYRLTWKPEEFDNMKKVRLPSKHIWLPDVVLYNNADGMYEVSFYSNAVVSYDGSIFWLPPAIYKSACKIEVKHFPFDQQNCTMKFRSWTYDRTEIDLVLKSEVASLDDFTPSGEWDIVALPGRRNENPDDSTYVDITYDFIIRRKPLFYTINLIIPCVLITSLAILVFYLPSDCGEKMTLCISVLLALTVFLLLISKIVPPTSLDVPLVGKYLMFTMVLVTFSIVTSVCVLNVHHRSPTTHTMAPWVKVVFLEKLPALLFMQQPRHHCARQRLRLRRRQREREGAGALFFREAPGADSCTCFVNRASVQGLAGAFGAEPAPVAGPGRSGEPCGCGLREAVDGVRFIADHMRSEDDDQSVSEDWKYVAMVIDRLFLWIFVFVCVFGTIGMFLQPLFQNYTTTTFLHSDHSAPSSK,502,NP_000739.1.csv,refseq-CHRNB2-NM_000748.2_clinical_seed_0_final,refseq-CHRNB2-NM_000748.2.a2m,Invitae,refseq-CHRNB2-NM_000748.2.npy,1,502,502
+NP_000742.1,MEGPVLTLGLLAALAVCGSWGLNEEERLIRHLFQEKGYNKELRPVAHKEESVDVALALTLSNLISLKEVEETLTTNVWIEHGWTDNRLKWNAEEFGNISVLRLPPDMVWLPEIVLENNNDGSFQISYSCNVLVYHYGFVYWLPPAIFRSSCPISVTYFPFDWQNCSLKFSSLKYTAKEITLSLKQDAKENRTYPVEWIIIDPEGFTENGEWEIVHRPARVNVDPRAPLDSPSRQDITFYLIIRRKPLFYIINILVPCVLISFMVNLVFYLPADSGEKTSVAISVLLAQSVFLLLISKRLPATSMAIPLIGKFLLFGMVLVTMVVVICVIVLNIHFRTPSTHVLSEGVKKLFLETLPELLHMSRPAEDGPSPGALVRRSSSLGYISKAEEYFLLKSRSDLMFEKQSERHGLARRLTTARRPPASSEQAQQELFNELKPAVDGANFIVNHMRDQNNYNEEKDSWNRVARTVDRLCLFVVTPVMVVGTAWIFLQGVYNQPPPQPFPGDPYSYNVQDKRFI,517,NP_000742.1.csv,refseq-CHRND-NM_000751.2_clinical_seed_0_final,refseq-CHRND-NM_000751.2.a2m,Invitae,refseq-CHRND-NM_000751.2.npy,1,517,517
+NP_000751.1,MARLGNCSLTWAALIILLLPGSLEECGHISVSAPIVHLGDPITASCIIKQNCSHLDPEPQILWRLGAELQPGGRQQRLSDGTQESIITLPHLNHTQAFLSCCLNWGNSLQILDQVELRAGYPPAIPHNLSCLMNLTTSSLICQWEPGPETHLPTSFTLKSFKSRGNCQTQGDSILDCVPKDGQSHCCIPRKHLLLYQNMGIWVQAENALGTSMSPQLCLDPMDVVKLEPPMLRTMDPSPEAAPPQAGCLQLCWEPWQPGLHINQKCELRHKPQRGEASWALVGPLPLEALQYELCGLLPATAYTLQIRCIRWPLPGHWSDWSPSLELRTTERAPTVRLDTWWRQRQLDPRTVQLFWKPVPLEEDSGRIQGYVVSWRPSGQAGAILPLCNTTELSCTFHLPSEAQEVALVAYNSAGTSRPTPVVFSESRGPALTRLHAMARDPHSLWVGWEPPNPWPQGYVIEWGLGPPSASNSNKTWRMEQNGRATGFLLKENIRPFQLYEIIVTPLYQDTMGPSQHVYAYSQEMAPSHAPELHLKHIGKTWAQLEWVPEPPELGKSPLTHYTIFWTNAQNQSFSAILNASSRGFVLHGLEPASLYHIHLMAASQAGATNSTVLTLMTLTPEGSELHIILGLFGLLLLLTCLCGTAWLCCSPNRKNPLWPSVPDPAHSSLGSWVPTIMEEDAFQLPGLGTPPITKLTVLEEDEKKPVPWESHNSSETCGLPTLVQTYVLQGDPRAVSTQPQSQSGTSDQVLYGQLLGSPTSPGPGHYLRCDSTQPLLAGLTPSPKSYENLWFQASPLGTLVTPAPSQEDDCVFGPLLNFPLLQGIRVHGMEALGSF,836,NP_000751.1.csv,refseq-CSF3R-NM_000760.3_clinical_seed_0_final,refseq-CSF3R-NM_000760.3.a2m,Invitae,refseq-CSF3R-NM_000760.3.npy,1,836,836
+NP_000772.2,MLAKGLPPRSVLVKGCQTFLSAPREGLGRLRVPTGEGAGISTRSPRPFNEIPSPGDNGWLNLYHFWRETGTHKVHLHHVQNFQKYGPIYREKLGNVESVYVIDPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLKKSAAWKKDRVALNQEVMAPEATKNFLPLLDAVSRDFVSVLHRRIKKAGSGNYSGDISDDLFRFAFESITNVIFGERQGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAAWDVIFSKADIYTQNFYWELRQKGSVHHDYRGILYRLLGDSKMSFEDIKANVTEMLAGGVDTTSMTLQWHLYEMARNLKVQDMLRAEVLAARHQAQGDMATMLQLVPLLKASIKETLRLHPISVTLQRYLVNDLVLRDYMIPAKTLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITYFRNLGFGWGVRQCLGRRIAELEMTIFLINMLENFRVEIQHLSDVGTTFNLILMPEKPISFTFWPFNQEATQQ,521,NP_000772.2.csv,refseq-CYP11A1-NM_000781.2_clinical_seed_0_final,refseq-CYP11A1-NM_000781.2.a2m,Invitae,refseq-CYP11A1-NM_000781.2.npy,1,521,521
+NP_000773.2,MSSPISKSRSLAAFLQQLRSPRQPPRLVTSTAYTSPQPREVPVCPLTAGGETQNAAALPGPTSWPLLGSLLQILWKGGLKKQHDTLVEYHKKYGKIFRMKLGSFESVHLGSPCLLEALYRTESAYPQRLEIKPWKAYRDYRKEGYGLLILEGEDWQRVRSAFQKKLMKPGEVMKLDNKINEVLADFMGRIDELCDERGHVEDLYSELNKWSFESICLVLYEKRFGLLQKNAGDEAVNFIMAIKTMMSTFGRMMVTPVELHKSLNTKVWQDHTLAWDTIFKSVKACIDNRLEKYSQQPSADFLCDIYHQNRLSKKELYAAVTELQLAAVETTANSLMWILYNLSRNPQVQQKLLKEIQSVLPENQVPRAEDLRNMPYLKACLKESMRLTPSVPFTTRTLDKATVLGEYALPKGTVLMLNTQVLGSSEDNFEDSSQFRPERWLQEKEKINPFAHLPFGVGKRMCIGRRLAELQLHLALCWIVRKYDIQATDNEPVEMLHSGTLVPSRELPIAFCQR,514,NP_000773.2.csv,refseq-CYP24A1-NM_000782.4_clinical_seed_0_final,refseq-CYP24A1-NM_000782.4.a2m,Invitae,refseq-CYP24A1-NM_000782.4.npy,1,514,514
+NP_000775.1,MAALGCARLRWALRGAGRGLCPHGARAKAAIPAALPSDKATGAPGAGPGVRRRQRSLEEIPRLGQLRFFFQLFVQGYALQLHQLQVLYKAKYGPMWMSYLGPQMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDQHDLTYGPFTTEGHHWYQLRQALNQRLLKPAEAALYTDAFNEVIDDFMTRLDQLRAESASGNQVSDMAQLFYYFALEAICYILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPVLPFWKRYLDGWNAIFSFGKKLIDEKLEDMEAQLQAAGPDGIQVSGYLHFLLASGQLSPREAMGSLPELLMAGVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHMPLLKAVLKETLRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPTAFSEPESFQPHRWLRNSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKYKVVLAPETGELKSVARIVLVPNKKVGLQFLQRQC,531,NP_000775.1.csv,refseq-CYP27A1-NM_000784.3_clinical_seed_0_final,refseq-CYP27A1-NM_000784.3.a2m,Invitae,refseq-CYP27A1-NM_000784.3.npy,1,531,531
+NP_000776.1,MTQTLKYASRVFHRVRWAPELGASLGYREYHSARRSLADIPGPSTPSFLAELFCKGGLSRLHELQVQGAAHFGPVWLASFGTVRTVYVAAPALVEELLRQEGPRPERCSFSPWTEHRRCRQRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAGTLNNVVCDLVRRLRRQRGRGTGPPALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDTETFIRAVGSVFVSTLLTMAMPHWLRHLVPGPWGRLCRDWDQMFAFAQRHVERREAEAAMRNGGQPEKDLESGAHLTHFLFREELPAQSILGNVTELLLAGVDTVSNTLSWALYELSRHPEVQTALHSEITAALSPGSSAYPSATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPDKDIHVGDYIIPKNTLVTLCHYATSRDPAQFPEPNSFRPARWLGEGPTPHPFASLPFGFGKRSCMGRRLAELELQMALAQILTHFEVQPEPGAAPVRPKTRTVLVPERSINLQFLDR,508,NP_000776.1.csv,refseq-CYP27B1-NM_000785.3_clinical_seed_0_final,refseq-CYP27B1-NM_000785.3.a2m,Invitae,refseq-CYP27B1-NM_000785.3.npy,1,508,508
+NP_000780.1,MGAASGRRGPGLLLPLPLLLLLPPQPALALDPGLQPGNFSADEAGAQLFAQSYNSSAEQVLFQSVAASWAHDTNITAENARRQEEAALLSQEFAEAWGQKAKELYEPIWQNFTDPQLRRIIGAVRTLGSANLPLAKRQQYNALLSNMSRIYSTAKVCLPNKTATCWSLDPDLTNILASSRSYAMLLFAWEGWHNAAGIPLKPLYEDFTALSNEAYKQDGFTDTGAYWRSWYNSPTFEDDLEHLYQQLEPLYLNLHAFVRRALHRRYGDRYINLRGPIPAHLLGDMWAQSWENIYDMVVPFPDKPNLDVTSTMLQQGWNATHMFRVAEEFFTSLELSPMPPEFWEGSMLEKPADGREVVCHASAWDFYNRKDFRIKQCTRVTMDQLSTVHHEMGHIQYYLQYKDLPVSLRRGANPGFHEAIGDVLALSVSTPEHLHKIGLLDRVTNDTESDINYLLKMALEKIAFLPFGYLVDQWRWGVFSGRTPPSRYNFDWWYLRTKYQGICPPVTRNETHFDAGAKFHVPNVTPYIRYFVSFVLQFQFHEALCKEAGYEGPLHQCDIYRSTKAGAKLRKVLQAGSSRPWQEVLKDMVGLDALDAQPLLKYFQPVTQWLQEQNQQNGEVLGWPEYQWHPPLPDNYPEGIDLVTDEAEASKFVEEYDRTSQVVWNEYAEANWNYNTNITTETSKILLQKNMQIANHTLKYGTQARKFDVNQLQNTTIKRIIKKVQDLERAALPAQELEEYNKILLDMETTYSVATVCHPNGSCLQLEPDLTNVMATSRKYEDLLWAWEGWRDKAGRAILQFYPKYVELINQAARLNGYVDAGDSWRSMYETPSLEQDLERLFQELQPLYLNLHAYVRRALHRHYGAQHINLEGPIPAHLLGNMWAQTWSNIYDLVVPFPSAPSMDTTEAMLKQGWTPRRMFKEADDFFTSLGLLPVPPEFWNKSMLEKPTDGREVVCHASAWDFYNGKDFRIKQCTTVNLEDLVVAHHEMGHIQYFMQYKDLPVALREGANPGFHEAIGDVLALSVSTPKHLHSLNLLSSEGGSDEHDINFLMKMALDKIAFIPFSYLVDQWRWRVFDGSITKENYNQEWWSLRLKYQGLCPPVPRTQGDFDPGAKFHIPSSVPYIRYFVSFIIQFQFHEALCQAAGHTGPLHKCDIYQSKEAGQRLATAMKLGFSRPWPEAMQLITGQPNMSASAMLSYFKPLLDWLRTENELHGEKLGWPQYNWTPNSARSEGPLPDSGRVSFLGLDLDAQQARVGQWLLLFLGIALLVATLGLSQRLFSIRHRSLHRHSHGPQFGSEVELRHS,1306,NP_000780.1.csv,refseq-ACE-NM_000789.3_clinical_seed_0_final,refseq-ACE-NM_000789.3.a2m,Invitae,refseq-ACE-NM_000789.3.npy,1,1306,1306
+NP_000797.2,MRKSPGLSDCLWAWILLLSTLTGRSYGQPSLQDELKDNTTVFTRILDRLLDGYDNRLRPGLGERVTEVKTDIFVTSFGPVSDHDMEYTIDVFFRQSWKDERLKFKGPMTVLRLNNLMASKIWTPDTFFHNGKKSVAHNMTMPNKLLRITEDGTLLYTMRLTVRAECPMHLEDFPMDAHACPLKFGSYAYTRAEVVYEWTREPARSVVVAEDGSRLNQYDLLGQTVDSGIVQSSTGEYVVMTTHFHLKRKIGYFVIQTYLPCIMTVILSQVSFWLNRESVPARTVFGVTTVLTMTTLSISARNSLPKVAYATAMDWFIAVCYAFVFSALIEFATVNYFTKRGYAWDGKSVVPEKPKKVKDPLIKKNNTYAPTATSYTPNLARGDPGLATIAKSATIEPKEVKPETKPPEPKKTFNSVSKIDRLSRIAFPLLFGIFNLVYWATYLNREPQLKAPTPHQ,456,NP_000797.2.csv,refseq-GABRA1-NM_000806.5_clinical_seed_0_final,refseq-GABRA1-NM_000806.5.a2m,Invitae,refseq-GABRA1-NM_000806.5.npy,1,456,456
+NP_000799.1,MIITQTSHCYMTSLGILFLINILPGTTGQGESRRQEPGDFVKQDIGGLSPKHAPDIPDDSTDNITIFTRILDRLLDGYDNRLRPGLGDAVTEVKTDIYVTSFGPVSDTDMEYTIDVFFRQTWHDERLKFDGPMKILPLNNLLASKIWTPDTFFHNGKKSVAHNMTTPNKLLRLVDNGTLLYTMRLTIHAECPMHLEDFPMDVHACPLKFGSYAYTTAEVVYSWTLGKNKSVEVAQDGSRLNQYDLLGHVVGTEIIRSSTGEYVVMTTHFHLKRKIGYFVIQTYLPCIMTVILSQVSFWLNRESVPARTVFGVTTVLTMTTLSISARNSLPKVAYATAMDWFIAVCYAFVFSALIEFATVNYFTKRSWAWEGKKVPEALEMKKKTPAAPAKKTSTTFNIVGTTYPINLAKDTEFSTISKGAAPSASSTPTIIASPKATYVQDSPTETKTYNSVSKVDKISRIIFPVLFAIFNLVYWATYVNRESAIKGMIRKQ,492,NP_000799.1.csv,refseq-GABRA3-NM_000808.3_clinical_seed_0_final,refseq-GABRA3-NM_000808.3.a2m,Invitae,refseq-GABRA3-NM_000808.3_theta_0.2.npy,1,492,492
+NP_000801.1,MDNGMFSGFIMIKNLLLFCISMNLSSHFGFSQMPTSSVKDETNDNITIFTRILDGLLDGYDNRLRPGLGERITQVRTDIYVTSFGPVSDTEMEYTIDVFFRQSWKDERLRFKGPMQRLPLNNLLASKIWTPDTFFHNGKKSIAHNMTTPNKLLRLEDDGTLLYTMRLTISAECPMQLEDFPMDAHACPLKFGSYAYPNSEVVYVWTNGSTKSVVVAEDGSRLNQYHLMGQTVGTENISTSTGEYTIMTAHFHLKRKIGYFVIQTYLPCIMTVILSQVSFWLNRESVPARTVFGVTTVLTMTTLSISARNSLPKVAYATAMDWFIAVCYAFVFSALIEFATVNYFTKRGWAWDGKKALEAAKIKKKREVILNKSTNAFTTGKMSHPPNIPKEQTPAGTSNTTSVSVKPSEEKTSESKKTYNSISKIDKMSRIVFPVLFGTFNLVYWATYLNREPVIKGAASPK,462,NP_000801.1.csv,refseq-GABRA5-NM_000810.3_clinical_seed_0_final,refseq-GABRA5-NM_000810.3.a2m,Invitae,refseq-GABRA5-NM_000810.3.npy,1,462,462
+NP_000803.2,MWTVQNRESLGLLSFPVMITMVCCAHSTNEPSNMSYVKETVDRLLKGYDIRLRPDFGGPPVDVGMRIDVASIDMVSEVNMDYTLTMYFQQSWKDKRLSYSGIPLNLTLDNRVADQLWVPDTYFLNDKKSFVHGVTVKNRMIRLHPDGTVLYGLRITTTAACMMDLRRYPLDEQNCTLEIESYGYTTDDIEFYWNGGEGAVTGVNKIELPQFSIVDYKMVSKKVEFTTGAYPRLSLSFRLKRNIGYFILQTYMPSTLITILSWVSFWINYDASAARVALGITTVLTMTTISTHLRETLPKIPYVKAIDIYLMGCFVFVFLALLEYAFVNYIFFGKGPQKKGASKQDQSANEKNKLEMNKVQVDAHGNILLSTLEIRNETSGSEVLTSVSDPKATMYSYDSASIQYRKPLSSREAYGRALDRHGVPSKGRIRRRASQLKVKIPDLTDVNSIDKWSRMFFPITFSLFNVVYWLYYVH,474,NP_000803.2.csv,refseq-GABRB1-NM_000812.4_clinical_seed_0_final,refseq-GABRB1-NM_000812.4.a2m,Invitae,refseq-GABRB1-NM_000812.4_theta_0.2.npy,1,474,474
+NP_000806.2,MDAPARLLAPLLLLCAQQLRGTRAMNDIGDYVGSNLEISWLPNLDGLIAGYARNFRPGIGGPPVNVALALEVASIDHISEANMEYTMTVFLHQSWRDSRLSYNHTNETLGLDSRFVDKLWLPDTFIVNAKSAWFHDVTVENKLIRLQPDGVILYSIRITSTVACDMDLAKYPMDEQECMLDLESYGYSSEDIVYYWSESQEHIHGLDKLQLAQFTITSYRFTTELMNFKSAGQFPRLSLHFHLRRNRGVYIIQSYMPSVLLVAMSWVSFWISQAAVPARVSLGITTVLTMTTLMVSARSSLPRASAIKALDVYFWICYVFVFAALVEYAFAHFNADYRKKQKAKVKVSRPRAEMDVRNAIVLFSLSAAGVTQELAISRRQRRVPGNLMGSYRSVGVETGETKKEGAARSGGQGGIRARLRPIDADTIDIYARAVFPAAFAAVNVIYWAAYAM,452,NP_000806.2.csv,refseq-GABRD-NM_000815.4_clinical_seed_0_final,refseq-GABRD-NM_000815.4.a2m,Invitae,refseq-GABRD-NM_000815.4.npy,1,452,452
+NP_000807.2,MSSPNIWSTGSSVYSTPVFSQKMTVWILLLLSLYPGFTSQKSDDDYEDYASNKTWVLTPKVPEGDVTVILNNLLEGYDNKLRPDIGVKPTLIHTDMYVNSIGPVNAINMEYTIDIFFAQTWYDRRLKFNSTIKVLRLNSNMVGKIWIPDTFFRNSKKADAHWITTPNRMLRIWNDGRVLYTLRLTIDAECQLQLHNFPMDEHSCPLEFSSYGYPREEIVYQWKRSSVEVGDTRSWRLYQFSFVGLRNTTEVVKTTSGDYVVMSVYFDLSRRMGYFTIQTYIPCTLIVVLSWVSFWINKDAVPARTSLGITTVLTMTTLSTIARKSLPKVSYVTAMDLFVSVCFIFVFSALVEYGTLHYFVSNRKPSKDKDKKKKNPAPTIDIRPRSATIQMNNATHLQERDEEYGYECLDGKDCASFFCCFEDCRTGAWRHGRIHIRIAKMDSYARIFFPTAFCLFNLVYWVSYLYL,467,NP_000807.2.csv,refseq-GABRG2-NM_000816.3_clinical_seed_0_final,refseq-GABRG2-NM_000816.3.a2m,Invitae,refseq-GABRG2-NM_000816.3.npy,1,467,467
+NP_000812.2,MAVSAGSARTSPSSDKVQKDKAELISGPRQDSRIGKLLGFEWTDLSSWRRLVTLLNRPTDPASLAVFRFLFGFLMVLDIPQERGLSSLDRKYLDGLDVCRFPLLDALRPLPLDWMYLVYTIMFLGALGMMLGLCYRISCVLFLLPYWYVFLLDKTSWNNHSYLYGLLAFQLTFMDANHYWSVDGLLNAHRRNAHVPLWNYAVLRGQIFIVYFIAGVKKLDADWVEGYSMEYLSRHWLFSPFKLLLSEELTSLLVVHWGGLLLDLSAGFLLFFDVSRSIGLFFVSYFHCMNSQLFSIGMFSYVMLASSPLFCSPEWPRKLVSYCPRRLQQLLPLKAAPQPSVSCVYKRSRGKSGQKPGLRHQLGAAFTLLYLLEQLFLPYSHFLTQGYNNWTNGLYGYSWDMMVHSRSHQHVKITYRDGRTGELGYLNPGVFTQSRRWKDHADMLKQYATCLSRLLPKYNVTEPQIYFDIWVSINDRFQQRIFDPRVDIVQAAWSPFQRTSWVQPLLMDLSPWRAKLQEIKSSLDNHTEVVFIADFPGLHLENFVSEDLGNTSIQLLQGEVTVELVAEQKNQTLREGEKMQLPAGEYHKVYTTSPSPSCYMYVYVNTTELALEQDLAYLQELKEKVENGSETGPLPPELQPLLEGEVKGGPEPTPLVQTFLRRQQRLQEIERRRNTPFHERFFRFLLRKLYVFRRSFLMTCISLRNLILGRPSLEQLAQEVTYANLRPFEAVGELNPSNTDSSHSNPPESNPDPVHSEF,758,NP_000812.2.csv,VKGC_HUMAN_b01_clinical_seed_0_final,VKGC_HUMAN_b01.a2m,EVE,VKGC_HUMAN_b01_theta_0.2.npy,1,758,758
+NP_000814.2,MDRRMWGAHVFCVLSPLPTVLGHMHPECDFITQLREDESACLQAAEEMPNTTLGCPATWDGLLCWPTAGSGEWVTLPCPDFFSHFSSESGAVKRDCTITGWSEPFPPYPVACPVPLELLAEEESYFSTVKIIYTVGHSISIVALFVAITILVALRRLHCPRNYVHTQLFTTFILKAGAVFLKDAALFHSDDTDHCSFSTVLCKVSVAASHFATMTNFSWLLAEAVYLNCLLASTSPSSRRAFWWLVLAGWGLPVLFTGTWVSCKLAFEDIACWDLDDTSPYWWIIKGPIVLSVGVNFGLFLNIIRILVRKLEPAQGSLHTQSQYWRLSKSTLFLIPLFGIHYIIFNFLPDNAGLGIRLPLELGLGSFQGFIVAILYCFLNQEVRTEISRKWHGHDPELLPAWRTRAKWTTPSRSAAKVLTSMC,423,NP_000814.2.csv,refseq-GHRHR-NM_000823.3_clinical_seed_0_final,refseq-GHRHR-NM_000823.3.a2m,Invitae,refseq-GHRHR-NM_000823.3.npy,1,423,423
+NP_000824.1,MGRVGYWTLLVLPALLVWRGPAPSAAAEKGPPALNIAVMLGHSHDVTERELRTLWGPEQAAGLPLDVNVVALLMNRTDPKSLITHVCDLMSGARIHGLVFGDDTDQEAVAQMLDFISSHTFVPILGIHGGASMIMADKDPTSTFFQFGASIQQQATVMLKIMQDYDWHVFSLVTTIFPGYREFISFVKTTVDNSFVGWDMQNVITLDTSFEDAKTQVQLKKIHSSVILLYCSKDEAVLILSEARSLGLTGYDFFWIVPSLVSGNTELIPKEFPSGLISVSYDDWDYSLEARVRDGIGILTTAASSMLEKFSYIPEAKASCYGQMERPEVPMHTLHPFMVNVTWDGKDLSFTEEGYQVHPRLVVIVLNKDREWEKVGKWENHTLSLRHAVWPRYKSFSDCEPDDNHLSIVTLEEAPFVIVEDIDPLTETCVRNTVPCRKFVKINNSTNEGMNVKKCCKGFCIDILKKLSRTVKFTYDLYLVTNGKHGKKVNNVWNGMIGEVVYQRAVMAVGSLTINEERSEVVDFSVPFVETGISVMVSRSNGTVSPSAFLEPFSASVWVMMFVMLLIVSAIAVFVFEYFSPVGYNRNLAKGKAPHGPSFTIGKAIWLLWGLVFNNSVPVQNPKGTTSKIMVSVWAFFAVIFLASYTANLAAFMIQEEFVDQVTGLSDKKFQRPHDYSPPFRFGTVPNGSTERNIRNNYPYMHQYMTKFNQKGVEDALVSLKTGKLDAFIYDAAVLNYKAGRDEGCKLVTIGSGYIFATTGYGIALQKGSPWKRQIDLALLQFVGDGEMEELETLWLTGICHNEKNEVMSSQLDIDNMAGVFYMLAAAMALSLITFIWEHLFYWKLRFCFTGVCSDRPGLLFSISRGIYSCIHGVHIEEKKKSPDFNLTGSQSNMLKLLRSAKNISSMSNMNSSRMDSPKRAADFIQRGSLIMDMVSDKGNLMYSDNRSFQGKESIFGDNMNELQTFVANRQKDNLNNYVFQGQHPLTLNESNPNTVEVAVSTESKANSRPRQLWKKSVDSIRQDSLSQNPVSQRDEATAENRTHSLKSPRYLPEEMAHSDISETSNRATCHREPDNSKNHKTKDNFKRSVASKYPKDCSEVERTYLKTKSSSPRDKIYTIDGEKEPGFHLDPPQFVENVTLPENVDFPDPYQDPSENFRKGDSTLPMNRNPLHNEEGLSNNDQYKLYSKHFTLKDKGSPHSETSERYRQNSTHCRSCLSNMPTYSGHFTMRSPFKCDACLRMGNLYDIDEDQMLQETGNPATGEQVYQQDWAQNNALQLQKNKLRISRQHSYDNIVDKPRELDLSRPSRSISLKDRERLLEGNFYGSLFSVPSSKLSGKKSSLFPQGLEDSKRSKSLLPDHTSDNPFLHSHRDDQRLVIGRCPSDPYKHSLPSQAVNDSYLRSSLRSTASYCSRDSRGHNDVYISEHVMPYAANKNNMYSTPRVLNSCSNRRVYKKMPSIESDV,1464,NP_000824.1.csv,refseq-GRIN2A-NM_000833.4_clinical_seed_0_final,refseq-GRIN2A-NM_000833.4.a2m,Invitae,refseq-GRIN2A-NM_000833.4.npy,1,1464,1464
+NP_000825.2,MKPRAECCSPKFWLVLAVLAVSGSRARSQKSPPSIGIAVILVGTSDEVAIKDAHEKDDFHHLSVVPRVELVAMNETDPKSIITRICDLMSDRKIQGVVFADDTDQEAIAQILDFISAQTLTPILGIHGGSSMIMADKDESSMFFQFGPSIEQQASVMLNIMEEYDWYIFSIVTTYFPGYQDFVNKIRSTIENSFVGWELEEVLLLDMSLDDGDSKIQNQLKKLQSPIILLYCTKEEATYIFEVANSVGLTGYGYTWIVPSLVAGDTDTVPAEFPTGLISVSYDEWDYGLPARVRDGIAIITTAASDMLSEHSFIPEPKSSCYNTHEKRIYQSNMLNRYLINVTFEGRNLSFSEDGYQMHPKLVIILLNKERKWERVGKWKDKSLQMKYYVWPRMCPETEEQEDDHLSIVTLEEAPFVIVESVDPLSGTCMRNTVPCQKRIVTENKTDEEPGYIKKCCKGFCIDILKKISKSVKFTYDLYLVTNGKHGKKINGTWNGMIGEVVMKRAYMAVGSLTINEERSEVVDFSVPFIETGISVMVSRSNGTVSPSAFLEPFSADVWVMMFVMLLIVSAVAVFVFEYFSPVGYNRCLADGREPGGPSFTIGKAIWLLWGLVFNNSVPVQNPKGTTSKIMVSVWAFFAVIFLASYTANLAAFMIQEEYVDQVSGLSDKKFQRPNDFSPPFRFGTVPNGSTERNIRNNYAEMHAYMGKFNQRGVDDALLSLKTGKLDAFIYDAAVLNYMAGRDEGCKLVTIGSGKVFASTGYGIAIQKDSGWKRQVDLAILQLFGDGEMEELEALWLTGICHNEKNEVMSSQLDIDNMAGVFYMLGAAMALSLITFICEHLFYWQFRHCFMGVCSGKPGMVFSISRGIYSCIHGVAIEERQSVMNSPTATMNNTHSNILRLLRTAKNMANLSGVNGSPQSALDFIRRESSVYDISEHRRSFTHSDCKSYNNPPCEENLFSDYISEVERTFGNLQLKDSNVYQDHYHHHHRPHSIGSASSIDGLYDCDNPPFTTQSRSISKKPLDIGLPSSKHSQLSDLYGKFSFKSDRYSGHDDLIRSDVSDISTHTVTYGNIEGNAAKRRKQQYKDSLKKRPASAKSRREFDEIELAYRRRPPRSPDHKRYFRDKEGLRDFYLDQFRTKENSPHWEHVDLTDIYKERSDDFKRDSVSGGGPCTNRSHIKHGTGDKHGVVSGVPAPWEKNLTNVEWEDRSGGNFCRSCPSKLHNYSTTVTGQNSGRQACIRCEACKKAGNLYDISEDNSLQELDQPAAPVAVTSNASTTKYPQSPTNSKAQKKNRNKLRRQHSYDTFVDLQKEEAALAPRSVSLKDKGRFMDGSPYAHMFEMSAGESTFANNKSSVPTAGHHHHNNPGGGYMLSKSLYPDRVTQNPFIPTFGDDQCLLHGSKSYFFRQPTVAGASKARPDFRALVTNKPVVSALHGAVPARFQKDICIGNQSNPCVPNNKNPRAFNGSSNGHVYEKLSSIESDV,1484,NP_000825.2.csv,refseq-GRIN2B-NM_000834.3_clinical_seed_0_final,refseq-GRIN2B-NM_000834.3.a2m,Invitae,refseq-GRIN2B-NM_000834.3.npy,1,1484,1484
+NP_000827.2,MRGAGGPRGPRGPAKMLLLLALACASPFPEEAPGPGGAGGPGGGLGGARPLNVALVFSGPAYAAEAARLGPAVAAAVRSPGLDVRPVALVLNGSDPRSLVLQLCDLLSGLRVHGVVFEDDSRAPAVAPILDFLSAQTSLPIVAVHGGAALVLTPKEKGSTFLQLGSSTEQQLQVIFEVLEEYDWTSFVAVTTRAPGHRAFLSYIEVLTDGSLVGWEHRGALTLDPGAGEAVLSAQLRSVSAQIRLLFCAREEAEPVFRAAEEAGLTGSGYVWFMVGPQLAGGGGSGAPGEPPLLPGGAPLPAGLFAVRSAGWRDDLARRVAAGVAVVARGAQALLRDYGFLPELGHDCRAQNRTHRGESLHRYFMNITWDNRDYSFNEDGFLVNPSLVVISLTRDRTWEVVGSWEQQTLRLKYPLWSRYGRFLQPVDDTQHLTVATLEERPFVIVEPADPISGTCIRDSVPCRSQLNRTHSPPPDAPRPEKRCCKGFCIDILKRLAHTIGFSYDLYLVTNGKHGKKIDGVWNGMIGEVFYQRADMAIGSLTINEERSEIVDFSVPFVETGISVMVARSNGTVSPSAFLEPYSPAVWVMMFVMCLTVVAVTVFIFEYLSPVGYNRSLATGKRPGGSTFTIGKSIWLLWALVFNNSVPVENPRGTTSKIMVLVWAFFAVIFLASYTANLAAFMIQEEYVDTVSGLSDRKFQRPQEQYPPLKFGTVPNGSTEKNIRSNYPDMHSYMVRYNQPRVEEALTQLKAGKLDAFIYDAAVLNYMARKDEGCKLVTIGSGKVFATTGYGIALHKGSRWKRPIDLALLQFLGDDEIEMLERLWLSGICHNDKIEVMSSKLDIDNMAGVFYMLLVAMGLSLLVFAWEHLVYWRLRHCLGPTHRMDFLLAFSRGMYSCCSAEAAPPPAKPPPPPQPLPSPAYPAPRPAPGPAPFVPRERASVDRWRRTKGAGPPGGAGLADGFHRYYGPIEPQGLGLGLGEARAAPRGAAGRPLSPPAAQPPQKPPPSYFAIVRDKEPAEPPAGAFPGFPSPPAPPAAAATAVGPPLCRLAFEDESPPAPARWPRSDPESQPLLGPGAGGAGGTGGAGGGAPAAPPPCRAAPPPCPYLDLEPSPSDSEDSESLGGASLGGLEPWWFADFPYPYAERLGPPPGRYWSVDKLGGWRAGSWDYLPPRSGPAAWHCRHCASLELLPPPRHLSCSHDGLDGGWWAPPPPPWAAGPLPRRRARCGCPRSHPHRPRASHRTPAAAAPHHHRHRRAAGGWDLPPPAPTSRSLEDLSSCPRAAPARRLTGPSRHARRCPHAAHWGPPLPTASHRRHRGGDLGTRRGSAHFSSLESEV,1336,NP_000827.2.csv,refseq-GRIN2D-NM_000836.2_clinical_seed_0_final,refseq-GRIN2D-NM_000836.2.a2m,Invitae,refseq-GRIN2D-NM_000836.2.npy,1,1336,1336
+NP_000834.2,MARPRRAREPLLVALLPLAWLAQAGLARAAGSVRLAGGLTLGGLFPVHARGAAGRACGQLKKEQGVHRLEAMLYALDRVNADPELLPGVRLGARLLDTCSRDTYALEQALSFVQALIRGRGDGDEVGVRCPGGVPPLRPAPPERVVAVVGASASSVSIMVANVLRLFAIPQISYASTAPELSDSTRYDFFSRVVPPDSYQAQAMVDIVRALGWNYVSTLASEGNYGESGVEAFVQISREAGGVCIAQSIKIPREPKPGEFSKVIRRLMETPNARGIIIFANEDDIRRVLEAARQANLTGHFLWVGSDSWGAKTSPILSLEDVAVGAITILPKRASIDGFDQYFMTRSLENNRRNIWFAEFWEENFNCKLTSSGTQSDDSTRKCTGEERIGRDSTYEQEGKVQFVIDAVYAIAHALHSMHQALCPGHTGLCPAMEPTDGRMLLQYIRAVRFNGSAGTPVMFNENGDAPGRYDIFQYQATNGSASSGGYQAVGQWAETLRLDVEALQWSGDPHEVPSSLCSLPCGPGERKKMVKGVPCCWHCEACDGYRFQVDEFTCEACPGDMRPTPNHTGCRPTPVVRLSWSSPWAAPPLLLAVLGIVATTTVVATFVRYNNTPIVRASGRELSYVLLTGIFLIYAITFLMVAEPGAAVCAARRLFLGLGTTLSYSALLTKTNRIYRIFEQGKRSVTPPPFISPTSQLVITFSLTSLQVVGMIAWLGARPPHSVIDYEEQRTVDPEQARGVLKCDMSDLSLIGCLGYSLLLMVTCTVYAIKARGVPETFNEAKPIGFTMYTTCIIWLAFVPIFFGTAQSAEKIYIQTTTLTVSLSLSASVSLGMLYVPKTYVILFHPEQNVQKRKRSLKATSTVAAPPKGEDAEAHK,877,NP_000834.2.csv,refseq-GRM6-NM_000843.3_clinical_seed_0_final,refseq-GRM6-NM_000843.3.a2m,Invitae,refseq-GRM6-NM_000843.3.npy,1,877,877
+NP_000835.1,MVQLRKLLRVLTLMKFPCCVLEVLLCALAAAARGQEMYAPHSIRIEGDVTLGGLFPVHAKGPSGVPCGDIKRENGIHRLEAMLYALDQINSDPNLLPNVTLGARILDTCSRDTYALEQSLTFVQALIQKDTSDVRCTNGEPPVFVKPEKVVGVIGASGSSVSIMVANILRLFQIPQISYASTAPELSDDRRYDFFSRVVPPDSFQAQAMVDIVKALGWNYVSTLASEGSYGEKGVESFTQISKEAGGLCIAQSVRIPQERKDRTIDFDRIIKQLLDTPNSRAVVIFANDEDIKQILAAAKRADQVGHFLWVGSDSWGSKINPLHQHEDIAEGAITIQPKRATVEGFDAYFTSRTLENNRRNVWFAEYWEENFNCKLTISGSKKEDTDRKCTGQERIGKDSNYEQEGKVQFVIDAVYAMAHALHHMNKDLCADYRGVCPEMEQAGGKKLLKYIRNVNFNGSAGTPVMFNKNGDAPGRYDIFQYQTTNTSNPGYRLIGQWTDELQLNIEDMQWGKGVREIPASVCTLPCKPGQRKKTQKGTPCCWTCEPCDGYQYQFDEMTCQHCPYDQRPNENRTGCQDIPIIKLEWHSPWAVIPVFLAMLGIIATIFVMATFIRYNDTPIVRASGRELSYVLLTGIFLCYIITFLMIAKPDVAVCSFRRVFLGLGMCISYAALLTKTNRIYRIFEQGKKSVTAPRLISPTSQLAITSSLISVQLLGVFIWFGVDPPNIIIDYDEHKTMNPEQARGVLKCDITDLQIICSLGYSILLMVTCTVYAIKTRGVPENFNEAKPIGFTMYTTCIVWLAFIPIFFGTAQSAEKLYIQTTTLTISMNLSASVALGMLYMPKVYIIIFHPELNVQKRKRSFKAVVTAATMSSRLSHKPSDRPNGEAKTELCENVDPNSPAAKKKYVSYNNLVI,915,NP_000835.1.csv,refseq-GRM7-NM_000844.3_clinical_seed_0_final,refseq-GRM7-NM_000844.3.a2m,Invitae,refseq-GRM7-NM_000844.3.npy,1,915,915
+NP_000851.2,MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALDEQFEPQKTLFIQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGLDYMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLNAICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALNGAIMKITTSKGIHFQDYDTTPFQAKTQ,266,NP_000851.2.csv,NP_000851.2_clinical_seed_0_final,NP_000851.2.a2m,popEVE,NP_000851.2_theta_0.2.npy,1,266,266
+NP_000866.1,MKSGSGGGSPTSLWGLLFLSAALSLWPTSGEICGPGIDIRNDYQQLKRLENCTVIEGYLHILLISKAEDYRSYRFPKLTVITEYLLLFRVAGLESLGDLFPNLTVIRGWKLFYNYALVIFEMTNLKDIGLYNLRNITRGAIRIEKNADLCYLSTVDWSLILDAVSNNYIVGNKPPKECGDLCPGTMEEKPMCEKTTINNEYNYRCWTTNRCQKMCPSTCGKRACTENNECCHPECLGSCSAPDNDTACVACRHYYYAGVCVPACPPNTYRFEGWRCVDRDFCANILSAESSDSEGFVIHDGECMQECPSGFIRNGSQSMYCIPCEGPCPKVCEEEKKTKTIDSVTSAQMLQGCTIFKGNLLINIRRGNNIASELENFMGLIEVVTGYVKIRHSHALVSLSFLKNLRLILGEEQLEGNYSFYVLDNQNLQQLWDWDHRNLTIKAGKMYFAFNPKLCVSEIYRMEEVTGTKGRQSKGDINTRNNGERASCESDVLHFTSTTTSKNRIIITWHRYRPPDYRDLISFTVYYKEAPFKNVTEYDGQDACGSNSWNMVDVDLPPNKDVEPGILLHGLKPWTQYAVYVKAVTLTMVENDHIRGAKSEILYIRTNASVPSIPLDVLSASNSSSQLIVKWNPPSLPNGNLSYYIVRWQRQPQDGYLYRHNYCSKDKIPIRKYADGTIDIEEVTENPKTEVCGGEKGPCCACPKTEAEKQAEKEEAEYRKVFENFLHNSIFVPRPERKRRDVMQVANTTMSSRSRNTTAADTYNITDPEELETEYPFFESRVDNKERTVISNLRPFTLYRIDIHSCNHEAEKLGCSASNFVFARTMPAEGADDIPGPVTWEPRPENSIFLKWPEPENPNGLILMYEIKYGSQVEDQRECVSRQEYRKYGGAKLNRLNPGNYTARIQATSLSGNGSWTDPVFFYVQAKTGYENFIHLIIALPVAVLLIVGGLVIMLYVFHRKRNNSRLGNGVLYASVNPEYFSAADVYVPDEWEVAREKITMSRELGQGSFGMVYEGVAKGVVKDEPETRVAIKTVNEAASMRERIEFLNEASVMKEFNCHHVVRLLGVVSQGQPTLVIMELMTRGDLKSYLRSLRPEMENNPVLAPPSLSKMIQMAGEIADGMAYLNANKFVHRDLAARNCMVAEDFTVKIGDFGMTRDIYETDYYRKGGKGLLPVRWMSPESLKDGVFTTYSDVWSFGVVLWEIATLAEQPYQGLSNEQVLRFVMEGGLLDKPDNCPDMLFELMRMCWQYNPKMRPSFLEIISSIKEEMEPGFREVSFYYSEENKLPEPEELDLEPENMESVPLDPSASSSSLPLPDRHSGHKAENGPGPGVLVLRASFDERQPYAHMNGGRKNERALPLPQSSTC,1367,NP_000866.1.csv,refseq-IGF1R-NM_000875.4_clinical_seed_0_final,refseq-IGF1R-NM_000875.4.a2m,Invitae,refseq-IGF1R-NM_000875.4.npy,1,1367,1367
+NP_000874.2,MEGPLTPPPLQGGGAAAVPEPGARQHPGHETAAQRYSARLLQAGYEPESPRLDLATHPTTPRSELSSVVLLAGVGVQMDRLRRASMADYLISGGTGYVPEDGLTAQQLFASADGLTYNDFLILPGFIDFIADEVDLTSALTRKITLKTPLISSPMDTVTEADMAIAMALMGGIGFIHHNCTPEFQANEVRKVKKFEQGFITDPVVLSPSHTVGDVLEAKMRHGFSGIPITETGTMGSKLVGIVTSRDIDFLAEKDHTTLLSEVMTPRIELVVAPAGVTLKEANEILQRSKKGKLPIVNDCDELVAIIARTDLKKNRDYPLASKDSQKQLLCGAAVGTREDDKYRLDLLTQAGVDVIVLDSSQGNSVYQIAMVHYIKQKYPHLQVIGGNVVTAAQAKNLIDAGVDGLRVGMGCGSICITQEVMACGRPQGTAVYKVAEYARRFGVPIIADGGIQTVGHVVKALALGASTVMMGSLLAATTEAPGEYFFSDGVRLKKYRGMGSLDAMEKSSSSQKRYFSEGDKVKIAQGVSGSIQDKGSIQKFVPYLIAGIQHGCQDIGARSLSVLRSMMYSGELKFEKRTMSAQIEGGVHGLHSYEKRLY,599,NP_000874.2.csv,refseq-IMPDH1-NM_000883.3_clinical_seed_0_final,refseq-IMPDH1-NM_000883.3.a2m,Invitae,refseq-IMPDH1-NM_000883.3.npy,1,599,599
+NP_000879.2,MGIELLCLFFLFLGRNDHVQGGCALGGAETCEDCLLIGPQCAWCAQENFTHPSGVGERCDTPANLLAKGCQLNFIENPVSQVEILKNKPLSVGRQKNSSDIVQIAPQSLILKLRPGGAQTLQVHVRQTEDYPVDLYYLMDLSASMDDDLNTIKELGSRLSKEMSKLTSNFRLGFGSFVEKPVSPFVKTTPEEIANPCSSIPYFCLPTFGFKHILPLTNDAERFNEIVKNQKISANIDTPEGGFDAIMQAAVCKEKIGWRNDSLHLLVFVSDADSHFGMDSKLAGIVIPNDGLCHLDSKNEYSMSTVLEYPTIGQLIDKLVQNNVLLIFAVTQEQVHLYENYAKLIPGATVGLLQKDSGNILQLIISAYEELRSEVELEVLGDTEGLNLSFTAICNNGTLFQHQKKCSHMKVGDTASFSVTVNIPHCERRSRHIIIKPVGLGDALELLVSPECNCDCQKEVEVNSSKCHHGNGSFQCGVCACHPGHMGPRCECGEDMLSTDSCKEAPDHPSCSGRGDCYCGQCICHLSPYGNIYGPYCQCDNFSCVRHKGLLCGGNGDCDCGECVCRSGWTGEYCNCTTSTDSCVSEDGVLCSGRGDCVCGKCVCTNPGASGPTCERCPTCGDPCNSKRSCIECHLSAAGQAREECVDKCKLAGATISEEEDFSKDGSVSCSLQGENECLITFLITTDNEGKTIIHSINEKDCPKPPNIPMIMLGVSLAILLIGVVLLCIWKLLVSFHDRKEVAKFEAERSKAKWQTGTNPLYRGSTSTFKNVTYKHREKQKVDLSTDC,788,NP_000879.2.csv,refseq-ITGB6-NM_000888.4_clinical_seed_0_final,refseq-ITGB6-NM_000888.4.a2m,Invitae,refseq-ITGB6-NM_000888.4.npy,1,788,788
+NP_000881.3,MAGDSRNAMNQDMEIGVTPWDPKKIPKQARDYVPIATDRTRLLAEGKKPRQRYMEKSGKCNVHHGNVQETYRYLSDLFTTLVDLKWRFNLLVFTMVYTVTWLFFGFIWWLIAYIRGDLDHVGDQEWIPCVENLSGFVSAFLFSIETETTIGYGFRVITEKCPEGIILLLVQAILGSIVNAFMVGCMFVKISQPKKRAETLMFSNNAVISMRDEKLCLMFRVGDLRNSHIVEASIRAKLIKSRQTKEGEFIPLNQTDINVGFDTGDDRLFLVSPLIISHEINQKSPFWEMSQAQLHQEEFEVVVILEGMVEATGMTCQARSSYMDTEVLWGHRFTPVLTLEKGFYEVDYNTFHDTYETNTPSCCAKELAEMKREGRLLQYLPSPPLLGGCAEAGLDAEAEQNEEDEPKGLGGSREARGSV,419,NP_000881.3.csv,refseq-KCNJ5-NM_000890.3_clinical_seed_0_final,refseq-KCNJ5-NM_000890.3.a2m,Invitae,refseq-KCNJ5-NM_000890.3.npy,1,419,419
+NP_000882.1,MGSVRTNRYSIVSSEEDGMKLATMAVANGFGNGKSKVHTRQQCRSRFVKKDGHCNVQFINVGEKGQRYLADIFTTCVDIRWRWMLVIFCLAFVLSWLFFGCVFWLIALLHGDLDASKEGKACVSEVNSFTAAFLFSIETQTTIGYGFRCVTDECPIAVFMVVFQSIVGCIIDAFIIGAVMAKMAKPKKRNETLVFSHNAVIAMRDGKLCLMWRVGNLRKSHLVEAHVRAQLLKSRITSEGEYIPLDQIDINVGFDSGIDRIFLVSPITIVHEIDEDSPLYDLSKQDIDNADFEIVVILEGMVEATAMTTQCRSSYLANEILWGHRYEPVLFEEKHYYKVDYSRFHKTYEVPNTPLCSARDLAEKKYILSNANSFCYENEVALTSKEEDDSENGVPESTSTDTPPDIDLHNQASVPLEPRPLRRESEI,427,NP_000882.1.csv,refseq-KCNJ2-NM_000891.2_clinical_seed_0_final,refseq-KCNJ2-NM_000891.2.a2m,Invitae,refseq-KCNJ2-NM_000891.2.npy,1,427,427
+NP_000883.2,MILFKQATYFISLFATVSCGCLTQLYENAFFRGGDVASMYTPNAQYCQMRCTFHPRCLLFSFLPASSINDMEKRFGCFLKDSVTGTLPKVHRTGAVSGHSLKQCGHQISACHRDIYKGVDMRGVNFNVSKVSSVEECQKRCTSNIRCQFFSYATQTFHKAEYRNNCLLKYSPGGTPTAIKVLSNVESGFSLKPCALSEIGCHMNIFQHLAFSDVDVARVLTPDAFVCRTICTYHPNCLFFTFYTNVWKIESQRNVCLLKTSESGTPSSSTPQENTISGYSLLTCKRTLPEPCHSKIYPGVDFGGEELNVTFVKGVNVCQETCTKMIRCQFFTYSLLPEDCKEEKCKCFLRLSMDGSPTRIAYGTQGSSGYSLRLCNTGDNSVCTTKTSTRIVGGTNSSWGEWPWQVSLQVKLTAQRHLCGGSLIGHQWVLTAAHCFDGLPLQDVWRIYSGILNLSDITKDTPFSQIKEIIIHQNYKVSEGNHDIALIKLQAPLNYTEFQKPICLPSKGDTSTIYTNCWVTGWGFSKEKGEIQNILQKVNIPLVTNEECQKRYQDYKITQRMVCAGYKEGGKDACKGDSGGPLVCKHNGMWRLVGITSWGEGCARREQPGVYTKVAEYMDWILEKTQSSDGKAQMQSPA,638,NP_000883.2.csv,refseq-KLKB1-NM_000892.4_clinical_seed_0_final,refseq-KLKB1-NM_000892.4.a2m,Invitae,refseq-KLKB1-NM_000892.4.npy,1,638,638
+NP_000890.1,MKKTQTWILTCIYLQLLLFNPLVKTEGICRNRVTNNVKDVTKLVANLPKDYMITLKYVPGMDVLPSHCWISEMVVQLSDSLTDLLDKFSNISEGLSNYSIIDKLVNIVDDLVECVKENSSKDLKKSFKSPEPRLFTPEEFFRIFNRSIDAFKDFVVASETSDCVVSSTLSPEKDSRVSVTKPFMLPPVAASSLRNDSSSSNRKAKNPPGDSSLHWAAMALPALFSLIIGFAFGALYWKKRQPSLTRAVENIQINEEDNEISMLQEKEREFQEV,273,NP_000890.1.csv,refseq-KITLG-NM_000899.4_clinical_seed_0_final,refseq-KITLG-NM_000899.4.a2m,Invitae,refseq-KITLG-NM_000899.4.npy,1,273,273
+NP_000892.2,METKGYHSLPEGLDMERRWGQVSQAVERSSLGPTERTDENNYMEIVNVSCVSGAIPNNSTQGSSKEKQELLPCLQQDNNRPGILTSDIKTELESKELSATVAESMGLYMDSVRDADYSYEQQNQQGSMSPAKIYQNVEQLVKFYKGNGHRPSTLSCVNTPLRSFMSDSGSSVNGGVMRAVVKSPIMCHEKSPSVCSPLNMTSSVCSPAGINSVSSTTASFGSFPVHSPITQGTPLTCSPNVENRGSRSHSPAHASNVGSPLSSPLSSMKSSISSPPSHCSVKSPVSSPNNVTLRSSVSSPANINNSRCSVSSPSNTNNRSTLSSPAASTVGSICSPVNNAFSYTASGTSAGSSTLRDVVPSPDTQEKGAQEVPFPKTEEVESAISNGVTGQLNIVQYIKPEPDGAFSSSCLGGNSKINSDSSFSVPIKQESTKHSCSGTSFKGNPTVNPFPFMDGSYFSFMDDKDYYSLSGILGPPVPGFDGNCEGSGFPVGIKQEPDDGSYYPEASIPSSAIVGVNSGGQSFHYRIGAQGTISLSRSARDQSFQHLSSFPPVNTLVESWKSHGDLSSRRSDGYPVLEYIPENVSSSTLRSVSTGSSRPSKICLVCGDEASGCHYGVVTCGSCKVFFKRAVEGQHNYLCAGRNDCIIDKIRRKNCPACRLQKCLQAGMNLGARKSKKLGKLKGIHEEQPQQQQPPPPPPPPQSPEEGTTYIAPAKEPSVNTALVPQLSTISRALTPSPVMVLENIEPEIVYAGYDSSKPDTAENLLSTLNRLAGKQMIQVVKWAKVLPGFKNLPLEDQITLIQYSWMCLSSFALSWRSYKHTNSQFLYFAPDLVFNEEKMHQSAMYELCQGMHQISLQFVRLQLTFEEYTIMKVLLLLSTIPKDGLKSQAAFEEMRTNYIKELRKMVTKCPNNSGQSWQRFYQLTKLLDSMHDLVSDLLEFCFYTFRESHALKVEFPAMLVEIISDQLPKVESGNAKPLYFHRK,984,NP_000892.2.csv,refseq-NR3C2-NM_000901.4_clinical_seed_0_final,refseq-NR3C2-NM_000901.4.a2m,Invitae,refseq-NR3C2-NM_000901.4.npy,1,984,984
+NP_000894.1,MVGRRALIVLAHSERTSFNYAMKEAAAAALKKKGWEVVESDLYAMNFNPIISRKDITGKLKDPANFQYPAESVLAYKEGHLSPDIVAEQKKLEAADLVIFQFPLQWFGVPAILKGWFERVFIGEFAYTYAAMYDKGPFRSKKAVLSITTGGSGSMYSLQGIHGDMNVILWPIQSGILHFCGFQVLEPQLTYSIGHTPADARIQILEGWKKRLENIWDETPLYFAPSSLFDLNFQAGFLMKKEVQDEEKNKKFGLSVGHHLGKSIPTDNQIKARK,274,NP_000894.1.csv,refseq-NQO1-NM_000903.2_clinical_seed_0_final,refseq-NQO1-NM_000903.2.a2m,Invitae,refseq-NQO1-NM_000903.2.npy,1,274,274
+NP_000899.1,MPSLLVLTFSPCVLLGWALLAGGTGGGGVGGGGGGAGIGGGRQEREALPPQKIEVLVLLPQDDSYLFSLTRVRPAIEYALRSVEGNGTGRRLLPPGTRFQVAYEDSDCGNRALFSLVDRVAAARGAKPDLILGPVCEYAAAPVARLASHWDLPMLSAGALAAGFQHKDSEYSHLTRVAPAYAKMGEMMLALFRHHHWSRAALVYSDDKLERNCYFTLEGVHEVFQEEGLHTSIYSFDETKDLDLEDIVRNIQASERVVIMCASSDTIRSIMLVAHRHGMTSGDYAFFNIELFNSSSYGDGSWKRGDKHDFEAKQAYSSLQTVTLLRTVKPEFEKFSMEVKSSVEKQGLNMEDYVNMFVEGFHDAILLYVLALHEVLRAGYSKKDGGKIIQQTWNRTFEGIAGQVSIDANGDRYGDFSVIAMTDVEAGTQEVIGDYFGKEGRFEMRPNVKYPWGPLKLRIDENRIVEHTNSSPCKSCGLEESAVTGIVVGALLGAGLLMAFYFFRKKYRITIERRTQQEESNLGKHRELREDSIRSHFSVA,540,NP_000899.1.csv,refseq-NPR3-NM_000908.3_clinical_seed_0_final,refseq-NPR3-NM_000908.3.a2m,Invitae,refseq-NPR3-NM_000908.3.npy,1,540,540
+NP_000909.2,MLRRALLCLAVAALVRADAPEEEDHVLVLRKSNFAEALAAHKYLLVEFYAPWCGHCKALAPEYAKAAGKLKAEGSEIRLAKVDATEESDLAQQYGVRGYPTIKFFRNGDTASPKEYTAGREADDIVNWLKKRTGPAATTLPDGAAAESLVESSEVAVIGFFKDVESDSAKQFLQAAEAIDDIPFGITSNSDVFSKYQLDKDGVVLFKKFDEGRNNFEGEVTKENLLDFIKHNQLPLVIEFTEQTAPKIFGGEIKTHILLFLPKSVSDYDGKLSNFKTAAESFKGKILFIFIDSDHTDNQRILEFFGLKKEECPAVRLITLEEEMTKYKPESEELTAERITEFCHRFLEGKIKPHLMSQELPEDWDKQPVKVLVGKNFEDVAFDEKKNVFVEFYAPWCGHCKQLAPIWDKLGETYKDHENIVIAKMDSTANEVEAVKVHSFPTLKFFPASADRTVIDYNGERTLDGFKKFLESGGQDGAGDDDDLEDLEEAEEPDMEEDDDQKAVKDEL,508,NP_000909.2.csv,refseq-P4HB-NM_000918.3_clinical_seed_0_final,refseq-P4HB-NM_000918.3.a2m,Invitae,refseq-P4HB-NM_000918.3.npy,1,508,508
+NP_000911.2,MLKFRTVHGGLRLLGIRRTSTAPAASPNVRRLEYKPIKKVMVANRGEIAIRVFRACTELGIRTVAIYSEQDTGQMHRQKADEAYLIGRGLAPVQAYLHIPDIIKVAKENNVDAVHPGYGFLSERADFAQACQDAGVRFIGPSPEVVRKMGDKVEARAIAIAAGVPVVPGTDAPITSLHEAHEFSNTYGFPIIFKAAYGGGGRGMRVVHSYEELEENYTRAYSEALAAFGNGALFVEKFIEKPRHIEVQILGDQYGNILHLYERDCSIQRRHQKVVEIAPAAHLDPQLRTRLTSDSVKLAKQVGYENAGTVEFLVDRHGKHYFIEVNSRLQVEHTVTEEITDVDLVHAQIHVAEGRSLPDLGLRQENIRINGCAIQCRVTTEDPARSFQPDTGRIEVFRSGEGMGIRLDNASAFQGAVISPHYDSLLVKVIAHGKDHPTAATKMSRALAEFRVRGVKTNIAFLQNVLNNQQFLAGTVDTQFIDENPELFQLRPAQNRAQKLLHYLGHVMVNGPTTPIPVKASPSPTDPVVPAVPIGPPPAGFRDILLREGPEGFARAVRNHPGLLLMDTTFRDAHQSLLATRVRTHDLKKIAPYVAHNFSKLFSMENWGGATFDVAMRFLYECPWRRLQELRELIPNIPFQMLLRGANAVGYTNYPDNVVFKFCEVAKENGMDVFRVFDSLNYLPNMLLGMEAAGSAGGVVEAAISYTGDVADPSRTKYSLQYYMGLAEELVRAGTHILCIKDMAGLLKPTACTMLVSSLRDRFPDLPLHIHTHDTSGAGVAAMLACAQAGADVVDVAADSMSGMTSQPSMGALVACTRGTPLDTEVPMERVFDYSEYWEGARGLYAAFDCTATMKSGNSDVYENEIPGGQYTNLHFQAHSMGLGSKFKEVKKAYVEANQMLGDLIKVTPSSKIVGDLAQFMVQNGLSRAEAEAQAEELSFPRSVVEFLQGYIGVPHGGFPEPFRSKVLKDLPRVEGRPGASLPPLDLQALEKELVDRHGEEVTPEDVLSAAMYPDVFAHFKDFTATFGPLDSLNTRLFLQGPKIAEEFEVELERGKTLHIKALAVSDLNRAGQRQVFFELNGQLRSILVKDTQAMKEMHFHPKALKDVKGQIGAPMPGKVIDIKVVAGAKVAKGQPLCVLSAMKMETVVTSPMEGTVRKVHVTKDMTLEGDDLILEIE,1178,NP_000911.2.csv,refseq-PC-NM_000920.3_clinical_seed_0_final,refseq-PC-NM_000920.3.a2m,Invitae,refseq-PC-NM_000920.3.npy,1,1178,1178
+NP_000912.3,MAVPGDAARVRDKPVHSGVSQAPTAGRDCHHRADPASPRDSGCRGCWGDLVLQPLRSSRKLSSALCAGSLSFLLALLVRLVRGEVGCDLEQCKEAAAAEEEEAAPGAEGGVFPGPRGGAPGGGARLSPWLQPSALLFSLLCAFFWMGLYLLRAGVRLPLAVALLAACCGGEALVQIGLGVGEDHLLSLPAAGVVLSCLAAATWLVLRLRLGVLMIALTSAVRTVSLISLERFKVAWRPYLAYLAGVLGILLARYVEQILPQSAEAAPREHLGSQLIAGTKEDIPVFKRRRRSSSVVSAEMSGCSSKSHRRTSLPCIPREQLMGHSEWDHKRGPRGSQSSGTSITVDIAVMGEAHGLITDLLADPSLPPNVCTSLRAVSNLLSTQLTFQAIHKPRVNPVTSLSENYTCSDSEESSEKDKLAIPKRLRRSLPPGLLRRVSSTWTTTTSATGLPTLEPAPVRRDRSTSIKLQEAPSSSPDSWNNPVMMTLTKSRSFTSSYAISAANHVKAKKQSRPGALAKISPLSSPCSSPLQGTPASSLVSKISAVQFPESADTTAKQSLGSHRALTYTQSAPDLSPQILTPPVICSSCGRPYSQGNPADEPLERSGVATRTPSRTDDTAQVTSDYETNNNSDSSDIVQNEDETECLREPLRKASACSTYAPETMMFLDKPILAPEPLVMDNLDSIMEQLNTWNFPIFDLVENIGRKCGRILSQVSYRLFEDMGLFEAFKIPIREFMNYFHALEIGYRDIPYHNRIHATDVLHAVWYLTTQPIPGLSTVINDHGSTSDSDSDSGFTHGHMGYVFSKTYNVTDDKYGCLSGNIPALELMALYVAAAMHDYDHPGRTNAFLVATSAPQAVLYNDRSVLENHHAAAAWNLFMSRPEYNFLINLDHVEFKHFRFLVIEAILATDLKKHFDFVAKFNGKVNDDVGIDWTNENDRLLVCQMCIKLADINGPAKCKELHLQWTDGIVNEFYEQGDEEASLGLPISPFMDRSAPQLANLQESFISHIVGPLCNSYDSAGLMPGKWVEDSDESGDTDDPEEEEEEAPAPNEEETCENNESPKKKTFKRRKIYCQITQHLLQNHKMWKKVIEEEQRLAGIENQSLDQTPQSHSSEQIQAIKEEEEEKGKPRGEEIPTQKPDQ,1141,NP_000912.3.csv,refseq-PDE3A-NM_000921.4_clinical_seed_0_final,refseq-PDE3A-NM_000921.4.a2m,Invitae,refseq-PDE3A-NM_000921.4.npy,1,1141,1141
+NP_000924.3,MAKPYEFNWQKEVPSFLQEGAVFDRYEEESFVFEPNCLFKVDEFGFFLTWRSEGKEGQVLECSLINSIRSGAIPKDPKILAALEAVGKSENDLEGRIVCVCSGTDLVNISFTYMVAENPEVTKQWVEGLRSIIHNFRANNVSPMTCLKKHWMKLAFMTNTNGKIPVRSITRTFASGKTEKVIFQALKELGLPSGKNDEIEPTAFSYEKFYELTQKICPRTDIEDLFKKINGDKTDYLTVDQLVSFLNEHQRDPRLNEILFPFYDAKRAMQIIEMYEPDEDLKKKGLISSDGFCRYLMSDENAPVFLDRLELYQEMDHPLAHYFISSSHNTYLTGRQFGGKSSVEMYRQVLLAGCRCVELDCWDGKGEDQEPIITHGKAMCTDILFKDVIQAIKETAFVTSEYPVILSFENHCSKYQQYKMSKYCEDLFGDLLLKQALESHPLEPGRALPSPNDLKRKILIKNKRLKPEVEKKQLEALRSMMEAGESASPANILEDDNEEEIESADQEEEAHPEFKFGNELSADDLGHKEAVANSVKKGLVTVEDEQAWMASYKYVGATTNIHPYLSTMINYAQPVKFQGFHVAEERNIHYNMSSFNESVGLGYLKTHAIEFVNYNKRQMSRIYPKGGRVDSSNYMPQIFWNAGCQMVSLNYQTPDLAMQLNQGKFEYNGSCGYLLKPDFMRRPDRTFDPFSETPVDGVIAATCSVQVISGQFLSDKKIGTYVEVDMYGLPTDTIRKEFRTRMVMNNGLNPVYNEESFVFRKVILPDLAVLRIAVYDDNNKLIGQRILPLDGLQAGYRHISLRNEGNKPLSLPTIFCNIVLKTYVPDGFGDIVDALSDPKKFLSITEKRADQMRAMGIETSDIADVPSDTSKNDKKGKANTAKANVTPQSSSELRPTTTAALASGVEAKKGIELIPQVRIEDLKQMKAYLKHLKKQQKELNSLKKKHAKEHSTMQKLHCTQVDKIVAQYDKEKSTHEKILEKAMKKKGGSNCLEMKKETEIKIQTLTSDHKSKVKEIVAQHTKEWSEMINTHSAEEQEIRDLHLSQQCELLKKLLINAHEQQTQQLKLSHDRESKEMRAHQAKISMENSKAISQDKSIKNKAERERRVRELNSSNTKKFLEERKRLAMKQSKEMDQLKKVQLEHLEFLEKQNEQLLKSCHAVSQTQGEGDAADGEIGSRDGPQTSNSSMKLQNAN,1194,NP_000924.3.csv,refseq-PLCB4-NM_000933.3_clinical_seed_0_final,refseq-PLCB4-NM_000933.3.a2m,Invitae,refseq-PLCB4-NM_000933.3.npy,1,1194,1194
+NP_000928.1,MHGGGPPSGDSACPLRTIKRVQFGVLSPDELKRMSVTEGGIKYPETTEGGRPKLGGLMDPRQGVIERTGRCQTCAGNMTECPGHFGHIELAKPVFHVGFLVKTMKVLRCVCFFCSKLLVDSNNPKIKDILAKSKGQPKKRLTHVYDLCKGKNICEGGEEMDNKFGVEQPEGDEDLTKEKGHGGCGRYQPRIRRSGLELYAEWKHVNEDSQEKKILLSPERVHEIFKRISDEECFVLGMEPRYARPEWMIVTVLPVPPLSVRPAVVMQGSARNQDDLTHKLADIVKINNQLRRNEQNGAAAHVIAEDVKLLQFHVATMVDNELPGLPRAMQKSGRPLKSLKQRLKGKEGRVRGNLMGKRVDFSARTVITPDPNLSIDQVGVPRSIAANMTFAEIVTPFNIDRLQELVRRGNSQYPGAKYIIRDNGDRIDLRFHPKPSDLHLQTGYKVERHMCDGDIVIFNRQPTLHKMSMMGHRVRILPWSTFRLNLSVTTPYNADFDGDEMNLHLPQSLETRAEIQELAMVPRMIVTPQSNRPVMGIVQDTLTAVRKFTKRDVFLERGEVMNLLMFLSTWDGKVPQPAILKPRPLWTGKQIFSLIIPGHINCIRTHSTHPDDEDSGPYKHISPGDTKVVVENGELIMGILCKKSLGTSAGSLVHISYLEMGHDITRLFYSNIQTVINNWLLIEGHTIGIGDSIADSKTYQDIQNTIKKAKQDVIEVIEKAHNNELEPTPGNTLRQTFENQVNRILNDARDKTGSSAQKSLSEYNNFKSMVVSGAKGSKINISQVIAVVGQQNVEGKRIPFGFKHRTLPHFIKDDYGPESRGFVENSYLAGLTPTEFFFHAMGGREGLIDTAVKTAETGYIQRRLIKSMESVMVKYDATVRNSINQVVQLRYGEDGLAGESVEFQNLATLKPSNKAFEKKFRFDYTNERALRRTLQEDLVKDVLSNAHIQNELEREFERMREDREVLRVIFPTGDSKVVLPCNLLRMIWNAQKIFHINPRLPSDLHPIKVVEGVKELSKKLVIVNGDDPLSRQAQENATLLFNIHLRSTLCSRRMAEEFRLSGEAFDWLLGEIESKFNQAIAHPGEMVGALAAQSLGEPATQMTLNTFHYAGVSAKNVTLGVPRLKELINISKKPKTPSLTVFLLGQSARDAERAKDILCRLEHTTLRKVTANTAIYYDPNPQSTVVAEDQEWVNVYYEMPDFDVARISPWLLRVELDRKHMTDRKLTMEQIAEKINAGFGDDLNCIFNDDNAEKLVLRIRIMNSDENKMQEEEEVVDKMDDDVFLRCIESNMLTDMTLQGIEQISKVYMHLPQTDNKKKIIITEDGEFKALQEWILETDGVSLMRVLSEKDVDPVRTTSNDIVEIFTVLGIEAVRKALERELYHVISFDGSYVNYRHLALLCDTMTCRGHLMAITRHGVNRQDTGPLMKCSFEETVDVLMEAAAHGESDPMKGVSENIMLGQLAPAGTGCFDLLLDAEKCKYGMEIPTNIPGLGAAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGFSPSAASDASGFSPGYSPAWSPTPGSPGSPGPSSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSPSYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPNYSPTSPNYTPTSPSYSPTSPSYSPTSPNYTPTSPNYSPTSPSYSPTSPSYSPTSPSYSPSSPRYTPQSPTYTPSSPSYSPSSPSYSPTSPKYTPTSPSYSPSSPEYTPTSPKYSPTSPKYSPTSPKYSPTSPTYSPTTPKYSPTSPTYSPTSPVYTPTSPKYSPTSPTYSPTSPKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAISPDDSDEEN,1970,NP_000928.1.csv,NP_000928.1_clinical_seed_0_final,NP_000928.1.a2m,popEVE,NP_000928.1_theta_0.2.npy,1,1970,1970
+NP_000933.1,MLRLSERNMKVLLAAALIAGSVFFLLLPGPSAADEKKKGPKVTVKVYFDLRIGDEDVGRVIFGLFGKTVPKTVDNFVALATGEKGFGYKNSKFHRVIKDFMIQGGDFTRGDGTGGKSIYGERFPDENFKLKHYGPGWVSMANAGKDTNGSQFFITTVKTAWLDGKHVVFGKVLEGMEVVRKVESTKTDSRDKPLKDVIIADCGKIEVEKPFAIAKE,216,NP_000933.1.csv,refseq-PPIB-NM_000942.4_clinical_seed_0_final,refseq-PPIB-NM_000942.4.a2m,Invitae,refseq-PPIB-NM_000942.4.npy,1,216,216
+NP_000935.1,MSEPKAIDPKLSTTDRVVKAVPFPPSHRLTAKEVFDNDGKPRVDILKAHLMKEGRLEESVALRIITEGASILRQEKNLLDIDAPVTVCGDIHGQFFDLMKLFEVGGSPANTRYLFLGDYVDRGYFSIECVLYLWALKILYPKTLFLLRGNHECRHLTEYFTFKQECKIKYSERVYDACMDAFDCLPLAALMNQQFLCVHGGLSPEINTLDDIRKLDRFKEPPAYGPMCDILWSDPLEDFGNEKTQEHFTHNTVRGCSYFYSYPAVCEFLQHNNLLSILRAHEAQDAGYRMYRKSQTTGFPSLITIFSAPNYLDVYNNKAAVLKYENNVMNIRQFNCSPHPYWLPNFMDVFTWSLPFVGEKVTEMLVNVLNICSDDELGSEEDGFDGATAAARKEVIRNKIRAIGKMARVFSVLREESESVLTLKGLTPTGMLPSGVLSGGKQTLQSATVEAIEADEAIKGFSPQHKITSFEEAKGLDRINERMPPRRDAMPSDANLNSINKALTSETNGTDSNGSNSSNIQ,521,NP_000935.1.csv,refseq-PPP3CA-NM_000944.4_clinical_seed_0_final,refseq-PPP3CA-NM_000944.4.a2m,Invitae,refseq-PPP3CA-NM_000944.4.npy,1,521,521
+NP_000952.1,MAWAALLGLLAALLLLLLLSRRRTRRPGEPPLDLGSIPWLGYALDFGKDAASFLTRMKEKHGDIFTILVGGRYVTVLLDPHSYDAVVWEPRTRLDFHAYAIFLMERIFDVQLPHYSPSDEKARMKLTLLHRELQALTEAMYTNLHAVLLGDATEAGSGWHEMGLLDFSYSFLLRAGYLTLYGIEALPRTHESQAQDRVHSADVFHTFRQLDRLLPKLARGSLSVGDKDHMCSVKSRLWKLLSPARLARRAHRSKWLESYLLHLEEMGVSEEMQARALVLQLWATQGNMGPAAFWLLLFLLKNPEALAAVRGELESILWQAEQPVSQTTTLPQKVLDSTPVLDSVLSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLSPQRDPEIYTDPEVFKYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLGRSYAVNSIKQFVFLVLVHLDLELINADVEIPEFDLSRYGFGLMQPEHDVPVRYRIRP,500,NP_000952.1.csv,refseq-PTGIS-NM_000961.3_clinical_seed_0_final,refseq-PTGIS-NM_000961.3.a2m,Invitae,refseq-PTGIS-NM_000961.3_theta_0.2.npy,1,500,500
+NP_000956.2,MFDCMDVLSVSPGQILDFYTASPSSCMLQEKALKACFSGLTQTEWQHRHTAQSIETQSTSSEELVPSPPSPLPPPRVYKPCFVCQDKSSGYHYGVSACEGCKGFFRRSIQKNMIYTCHRDKNCVINKVTRNRCQYCRLQKCFEVGMSKESVRNDRNKKKKETSKQECTESYEMTAELDDLTEKIRKAHQETFPSLCQLGKYTTNSSADHRVRLDLGLWDKFSELATKCIIKIVEFAKRLPGFTGLTIADQITLLKAACLDILILRICTRYTPEQDTMTFSDGLTLNRTQMHNAGFGPLTDLVFTFANQLLPLEMDDTETGLLSAICLICGDRQDLEEPTKVDKLQEPLLEALKIYIRKRRPSKPHMFPKILMKITDLRSISAKGAERVITLKMEIPGSMPPLIQEMLENSEGHEPLTPSSSGNTAEHSPSISPSSVENSGVSQSPLVQ,448,NP_000956.2.csv,refseq-RARB-NM_000965.4_clinical_seed_0_final,refseq-RARB-NM_000965.4.a2m,Invitae,refseq-RARB-NM_000965.4.npy,1,448,448
+NP_000960.2,MGFVKVVKNKAYFKRYQVKFRRRREGKTDYYARKRLVIQDKNKYNTPKYRMIVRVTNRDIICQIAYARIEGDMIVCAAYAHELPKYGVKVGLTNYAAAYCTGLLLARRLLNRFGMDKIYEGQVEVTGDEYNVESIDGQPGAFTCYLDAGLARTTTGNKVFGALKGAVDGGLSIPHSTKRFPGYDSESKEFNAEVHRKHIMGQNVADYMRYLMEEDEDAYKKQFSQYIKNSVTPDMMEEMYKKAHAAIRENPVYEKKPKKEVKKKRWNRPKMSLAQKKDRVAQKKASFLRAQERAAES,297,NP_000960.2.csv,refseq-RPL5-NM_000969.3_clinical_seed_0_final,refseq-RPL5-NM_000969.3.a2m,Invitae,refseq-RPL5-NM_000969.3.npy,1,297,297
+NP_000968.2,MAPSRNGMVLKPHFHKDWQRRVATWFNQPARKIRRRKARQAKARRIAPRPASGPIRPIVRCPTVRYHTKVRAGRGFSLEELRVAGIHKKVARTIGISVDPRRRNKSTESLQANVQRLKEYRSKLILFPRKPSAPKKGDSSAEELKLATQLTGPVMPVRNVYKKEKARVITEEEKNFKAFASLRMARANARLFGIRAKRAKEAAEQDVEKKK,211,NP_000968.2.csv,refseq-RPL13-NM_000977.3_clinical_seed_0_final,refseq-RPL13-NM_000977.3.a2m,Invitae,refseq-RPL13-NM_000977.3.npy,1,211,211
+NP_000987.2,MSGRLWSKAIFAGYKRGLRNQREHTALLKIEGVYARDETEFYLGKRCAYVYKAKNNTVTPGGKPNKTRVIWGKVTRAHGNSGMVRAKFRSNLPAKAIGHRIRVMLYPSRI,110,NP_000987.2.csv,refseq-RPL35A-NM_000996.2_clinical_seed_0_final,refseq-RPL35A-NM_000996.2.a2m,Invitae,refseq-RPL35A-NM_000996.2.npy,1,110,110
+NP_001001331.1,MGDMTNSDFYSKNQRNESSHGGEFGCTMEELRSLMELRGTEAVVKIKETYGDTEAICRRLKTSPVEGLPGTAPDLEKRKQIFGQNFIPPKKPKTFLQLVWEALQDVTLIILEIAAIISLGLSFYHPPGEGNEGCATAQGGAEDEGEAEAGWIEGAAILLSVICVVLVTAFNDWSKEKQFRGLQSRIEQEQKFTVVRAGQVVQIPVAEIVVGDIAQVKYGDLLPADGLFIQGNDLKIDESSLTGESDQVRKSVDKDPMLLSGTHVMEGSGRMLVTAVGVNSQTGIIFTLLGAGGEEEEKKDKKGVKKGDGLQLPAADGAAASNAADSANASLVNGKMQDGNVDASQSKAKQQDGAAAMEMQPLKSAEGGDADDRKKASMHKKEKSVLQGKLTKLAVQIGKAGLVMSAITVIILVLYFTVDTFVVNKKPWLPECTPVYVQYFVKFFIIGVTVLVVAVPEGLPLAVTISLAYSVKKMMKDNNLVRHLDACETMGNATAICSDKTGTLTTNRMTVVQAYVGDVHYKEIPDPSSINTKTMELLINAIAINSAYTTKILPPEKEGALPRQVGNKTECGLLGFVLDLKQDYEPVRSQMPEEKLYKVYTFNSVRKSMSTVIKLPDESFRMYSKGASEIVLKKCCKILNGAGEPRVFRPRDRDEMVKKVIEPMACDGLRTICVAYRDFPSSPEPDWDNENDILNELTCICVVGIEDPVRPEVPEAIRKCQRAGITVRMVTGDNINTARAIAIKCGIIHPGEDFLCLEGKEFNRRIRNEKGEIEQERIDKIWPKLRVLARSSPTDKHTLVKGIIDSTHTEQRQVVAVTGDGTNDGPALKKADVGFAMGIAGTDVAKEASDIILTDDNFSSIVKAVMWGRNVYDSISKFLQFQLTVNVVAVIVAFTGACITQDSPLKAVQMLWVNLIMDTFASLALATEPPTETLLLRKPYGRNKPLISRTMMKNILGHAVYQLALIFTLLFVGEKMFQIDSGRNAPLHSPPSEHYTIIFNTFVMMQLFNEINARKIHGERNVFDGIFRNPIFCTIVLGTFAIQIVIVQFGGKPFSCSPLQLDQWMWCIFIGLGELVWGQVIATIPTSRLKFLKEAGRLTQKEEIPEEELNEDVEEIDHAERELRRGQILWFRGLNRIQTQIRVVKAFRSSLYEGLEKPESRTSIHNFMAHPEFRIEDSQPHIPLIDDTDLEEDAALKQNSSPPSSLNKNNSAIDSGINLTTDTSKSATSSSPGSPIHSLETSL,1243,NP_001001331.1.csv,refseq-ATP2B2-NM_001001331.3_clinical_seed_0_final,refseq-ATP2B2-NM_001001331.3.a2m,Invitae,refseq-ATP2B2-NM_001001331.3.npy,1,1243,1243
+NP_001001344.1,MGDMANSSIEFHPKPQQQRDVPQAGGFGCTLAELRTLMELRGAEALQKIEEAYGDVSGLCRRLKTSPTEGLADNTNDLEKRRQIYGQNFIPPKQPKTFLQLVWEALQDVTLIILEVAAIVSLGLSFYAPPGEESEACGNVSGGAEDEGEAEAGWIEGAAILLSVICVVLVTAFNDWSKEKQFRGLQSRIEQEQKFTVIRNGQLLQVPVAALVVGDIAQVKYGDLLPADGVLIQANDLKIDESSLTGESDHVRKSADKDPMLLSGTHVMEGSGRMVVTAVGVNSQTGIIFTLLGAGGEEEEKKDKKGKQQDGAMESSQTKAKKQDGAVAMEMQPLKSAEGGEMEEREKKKANAPKKEKSVLQGKLTKLAVQIGKAGLVMSAITVIILVLYFVIETFVVEGRTWLAECTPVYVQYFVKFFIIGVTVLVVAVPEGLPLAVTISLAYSVKKMMKDNNLVRHLDACETMGNATAICSDKTGTLTTNRMTVVQSYLGDTHYKEIPAPSALTPKILDLLVHAISINSAYTTKILPPEKEGALPRQVGNKTECALLGFVLDLKRDFQPVREQIPEDKLYKVYTFNSVRKSMSTVIRMPDGGFRLFSKGASEILLKKCTNILNSNGELRGFRPRDRDDMVRKIIEPMACDGLRTICIAYRDFSAGQEPDWDNENEVVGDLTCIAVVGIEDPVRPEVPEAIRKCQRAGITVRMVTGDNINTARAIAAKCGIIQPGEDFLCLEGKEFNRRIRNEKGEIEQERLDKVWPKLRVLARSSPTDKHTLVKGIIDSTTGEQRQVVAVTGDGTNDGPALKKADVGFAMGIAGTDVAKEASDIILTDDNFTSIVKAVMWGRNVYDSISKFLQFQLTVNVVAVIVAFTGACITQDSPLKAVQMLWVNLIMDTFASLALATEPPTESLLLRKPYGRDKPLISRTMMKNILGHAVYQLAIIFTLLFVGELFFDIDSGRNAPLHSPPSEHYTIIFNTFVMMQLFNEINARKIHGERNVFDGIFSNPIFCTIVLGTFGIQIVIVQFGGKPFSCSPLSTEQWLWCLFVGVGELVWGQVIATIPTSQLKCLKEAGHGPGKDEMTDEELAEGEEEIDHAERELRRGQILWFRGLNRIQTQIRVVKAFRSSLYEGLEKPESKTSIHNFMATPEFLINDYTHNIPLIDDTDVDENEERLRAPPPPSPNQNNNAIDSGIYLTTHVTKSATSSVFSSSPGSPLHSVETSL,1220,NP_001001344.1.csv,refseq-ATP2B3-NM_001001344.2_clinical_seed_0_final,refseq-ATP2B3-NM_001001344.2.a2m,Invitae,refseq-ATP2B3-NM_001001344.2.npy,1,1220,1220
+NP_001001412.3,MMDKFRMIFQFLQSNQESFMNGICGIMALASAQMYSAFDFNCPCLPGYNAAYSAGILLAPPLVLFLLGLVMNNNVSMLAEEWKRPLGRRAKDPAVLRYMFCSMAQRALIAPVVWVAVTLLDGKCFLCAFCTAVPVSALGNGSLAPGLPAPELARLLARVPCPEIYDGDWLLAREVAVRYLRCISQALGWSFVLLTTLLAFVVRSVRPCFTQAAFLKSKYWSHYIDIERKLFDETCTEHAKAFAKVCIQQFFEAMNHDLELGHTHGTLATAPASAAAPTTPDGAEEEREKLRGITDQGTMNRLLTSWHKCKPPLRLGQEEPPLMGNGWAGGGPRPPRKEVATYFSKV,346,NP_001001412.3.csv,refseq-CALHM1-NM_001001412.3_clinical_seed_0_final,refseq-CALHM1-NM_001001412.3.a2m,Invitae,refseq-CALHM1-NM_001001412.3.npy,1,346,346
+NP_001001430.1,MSDIEEVVEEYEEEEQEEAAVEEQEEAAEEDAEAEAETEETRAEEDEEEEEAKEAEDGPMEESKPKPRSFMPNLVPPKIPDGERVDFDDIHRKRMEKDLNELQALIEAHFENRKKEEEELVSLKDRIERRRAERAEQQRIRNEREKERQNRLAEERARREEEENRRKAEDEARKKKALSNMMHFGGYIQKQAQTERKSGKRQTEREKKKKILAERRKVLAIDHLNEDQLREKAKELWQSIYNLEAEKFDLQEKFKQQKYEINVLRNRINDNQKVSKTRGKAKVTGRWK,288,NP_001001430.1.csv,refseq-TNNT2-NM_001001430.2_clinical_seed_0_final,refseq-TNNT2-NM_001001430.2.a2m,Invitae,refseq-TNNT2-NM_001001430.2.npy,1,288,288
+NP_001001438.1,MTEGTCLRRRGGPYKTEPATDLGRWRLNCERGRQTWTYLQDERAGREQTGLEAYALGLDTKNYFKDLPKAHTAFEGALNGMTFYVGLQAEDGHWTGDYGGPLFLLPGLLITCHVARIPLPAGYREEIVRYLRSVQLPDGGWGLHIEDKSTVFGTALNYVSLRILGVGPDDPDLVRARNILHKKGGAVAIPSWGKFWLAVLNVYSWEGLNTLFPEMWLFPDWAPAHPSTLWCHCRQVYLPMSYCYAVRLSAAEDPLVQSLRQELYVEDFASIDWLAQRNNVAPDELYTPHSWLLRVVYALLNLYEHHHSAHLRQRAVQKLYEHIVADDRFTKSISIGPISKTINMLVRWYVDGPASTAFQEHVSRIPDYLWMGLDGMKMQGTNGSQIWDTAFAIQALLEAGGHHRPEFSSCLQKAHEFLRLSQVPDNPPDYQKYYRQMRKGGFSFSTLDCGWIVSDCTAEALKAVLLLQEKCPHVTEHIPRERLCDAVAVLLNMRNPDGGFATYETKRGGHLLELLNPSEVFGDIMIDYTYVECTSAVMQALKYFHKRFPEHRAAEIRETLTQGLEFCRRQQRADGSWEGSWGVCFTYGTWFGLEAFACMGQTYRDGTACAEVSRACDFLLSRQMADGGWGEDFESCEERRYLQSAQSQIHNTCWAMMGLMAVRHPDIEAQERGVRCLLEKQLPNGDWPQENIAGVFNKSCAISYTSYRNIFPIWALGRFSQLYPERALAGHP,732,NP_001001438.1.csv,refseq-LSS-NM_001001438.2_clinical_seed_0_final,refseq-LSS-NM_001001438.2.a2m,Invitae,refseq-LSS-NM_001001438.2.npy,1,732,732
+NP_001001547.1,MGCDRNCGLIAGAVIGAVLAVFGGILMPVGDLLIQKTIKKQVVLEEGTIAFKNWVKTGTEVYRQFWIFDVQNPQEVMMNSSNIQVKQRGPYTYRVRFLAKENVTQDAEDNTVSFLQPNGAIFEPSLSVGTEADNFTVLNLAVAAASHIYQNQFVQMILNSLINKSKSSMFQVRTLRELLWGYRDPFLSLVPYPVTTTVGLFYPYNNTADGVYKVFNGKDNISKVAIIDTYKGKRNLSYWESHCDMINGTDAASFPPFVEKSQVLQFFSSDICRSIYAVFESDVNLKGIPVYRFVLPSKAFASPVENPDNYCFCTEKIISKNCTSYGVLDISKCKEGRPVYISLPHFLYASPDVSEPIDGLNPNEEEHRTYLDIEPITGFTLQFAKRLQVNLLVKPSEKIQVLKNLKRNYIVPILWLNETGTIGDEKANMFRSQVTGKINLLGLIEMILLSVGVVMFVAFMISYCACRSKTIK,472,NP_001001547.1.csv,refseq-CD36-NM_001001547.2_clinical_seed_0_final,refseq-CD36-NM_001001547.2.a2m,Invitae,refseq-CD36-NM_001001547.2.npy,1,472,472
+NP_001001557.1,MDTPRVLLSAVFLISFLWDLPGFQQASISSSSSSAELGSTKGMRSRKEGKMQRAPRDSDAGREGQEPQPRPQDEPRAQQPRAQEPPGRGPRVVPHEYMLSIYRTYSIAEKLGINASFFQSSKSANTITSFVDRGLDDLSHTPLRRQKYLFDVSMLSDKEELVGAELRLFRQAPSAPWGPPAGPLHVQLFPCLSPLLLDARTLDPQGAPPAGWEVFDVWQGLRHQPWKQLCLELRAAWGELDAGEAEARARGPQQPPPPDLRSLGFGRRVRPPQERALLVVFTRSQRKNLFAEMREQLGSAEAAGPGAGAEGSWPPPSGAPDARPWLPSPGRRRRRTAFASRHGKRHGKKSRLRCSKKPLHVNFKELGWDDWIIAPLEYEAYHCEGVCDFPLRSHLEPTNHAIIQTLMNSMDPGSTPPSCCVPTKLTPISILYIDAGNNVVYKQYEDMVVESCGCR,455,NP_001001557.1.csv,refseq-GDF6-NM_001001557.2_clinical_seed_0_final,refseq-GDF6-NM_001001557.2.a2m,Invitae,refseq-GDF6-NM_001001557.2.npy,1,455,455
+NP_001001563.2,MAASAAVFSRLRSGLRLGSRGLCTRLATPPRRAPDQAAEIGSRGSTKAQGPQQQPGSEGPSYAKKVALWLAGLLGAGGTVSVVYIFGNNPVDENGAKIPDEFDNDPILVQQLRRTYKYFKDYRQMIIEPTSPCLLPDPLQEPYYQPPYTLVLELTGVLLHPEWSLATGWRFKKRPGIETLFQQLAPLYEIVIFTSETGMTAFPLIDSVDPHGFISYRLFRDATRYMDGHHVKDISCLNRDPARVVVVDCKKEAFRLQPYNGVALRPWDGNSDDRVLLDLSAFLKTIALNGVEDVRTVLEHYALEDDPLAAFKQRQSRLEQEEQQRLAELSKSNKQNLFLGSLTSRLWPRSKQP,353,NP_001001563.2.csv,TIM50_HUMAN_b01_clinical_seed_0_final,TIM50_HUMAN_b01.a2m,EVE,TIM50_HUMAN_b01_theta_0.2.npy,1,353,353
+NP_001001937.1,MLSVRVAAAVVRALPRRAGLVSRNALGSSFIAARNFHASNTHLQKTGTAEMSSILEERILGADTSVDLEETGRVLSIGDGIARVHGLRNVQAEEMVEFSSGLKGMSLNLEPDNVGVVVFGNDKLIKEGDIVKRTGAIVDVPVGEELLGRVVDALGNAIDGKGPIGSKTRRRVGLKAPGIIPRISVREPMQTGIKAVDSLVPIGRGQRELIIGDRQTGKTSIAIDTIINQKRFNDGSDEKKKLYCIYVAIGQKRSTVAQLVKRLTDADAMKYTIVVSATASDAAPLQYLAPYSGCSMGEYFRDNGKHALIIYDDLSKQAVAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERAAKMNDAFGGGSLTALPVIETQAGDVSAYIPTNVISITDGQIFLETELFYKGIRPAINVGLSVSRVGSAAQTRAMKQVAGTMKLELAQYREVAAFAQFGSDLDAATQQLLSRGVRLTELLKQGQYSPMAIEEQVAVIYAGVRGYLDKLEPSKITKFENAFLSHVVSQHQALLGTIRADGKISEQSDAKLKEIVTNFLAGFEA,553,NP_001001937.1.csv,refseq-ATP5F1A-NM_001001937.1_clinical_seed_0_final,refseq-ATP5F1A-NM_001001937.1.a2m,Invitae,refseq-ATP5F1A-NM_001001937.1.npy,1,553,553
+NP_001002295.1,MEVTADQPRWVSHHHPAVLNGQHPDTHHPGLSHSYMDAAQYPLPEEVDVLFNIDGQGNHVPPYYGNSVRATVQRYPPTHHGSQVCRPPLLHGSLPWLDGGKALGSHHTASPWNLSPFSKTSIHHGSPGPLSVYPPASSSSLSGGHASPHLFTFPPTPPKDVSPDPSLSTPGSAGSARQDEKECLKYQVPLPDSMKLESSHSRGSMTALGGASSSTHHPITTYPPYVPEYSSGLFPPSSLLGGSPTGFGCKSRPKARSSTEGRECVNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINRPLTMKKEGIQTRNRKMSSKSKKCKKVHDSLEDFPKNSSFNPAALSRHMSSLSHISPFSHSSHMLTTPTPMHPPSSLSFGPHHPSSMVTAMG,444,NP_001002295.1.csv,refseq-GATA3-NM_001002295.1_clinical_seed_0_final,refseq-GATA3-NM_001002295.1.a2m,Invitae,refseq-GATA3-NM_001002295.1.npy,1,444,444
+NP_001002755.1,MAATARRGWGAAAVAAGLRRRFCHMLKNPYTIKKQPLHQFVQRPLFPLPAAFYHPVRYMFIQTQDTPNPNSLKFIPGKPVLETRTMDFPTPAAAFRSPLARQLFRIEGVKSVFFGPDFITVTKENEELDWNLLKPDIYATIMDFFASGLPLVTEETPSGEAGSEEDDEVVAMIKELLDTRIRPTVQEDGGDVIYKGFEDGIVQLKLQGSCTSCPSSIITLKNGIQNMLQFYIPEVEGVEQVMDDESDEKEANSP,254,NP_001002755.1.csv,refseq-NFU1-NM_001002755.2_clinical_seed_0_final,refseq-NFU1-NM_001002755.2.a2m,Invitae,refseq-NFU1-NM_001002755.2.npy,1,254,254
+NP_001002841.1,MAPKKPEPKKEAAKPAPAPAPAPAPAPAPAPEAPKEPAFDPKSVKIDFTADQIEEFKEAFSLFDRTPTGEMKITYGQCGDVLRALGQNPTNAEVLRVLGKPKPEEMNVKMLDFETFLPILQHISRNKEQGTYEDFVEGLRVFDKESNGTVMGAELRHVLATLGEKMTEAEVEQLLAGQEDANGCINYEAFVKHIMSG,197,NP_001002841.1.csv,refseq-MYL4-NM_001002841.1_clinical_seed_0_final,refseq-MYL4-NM_001002841.1.a2m,Invitae,refseq-MYL4-NM_001002841.1.npy,1,197,197
+NP_001003694.1,MGVDFDVKTFCHNLRATKPPYECPVETCRKVYKSYSGIEYHLYHYDHDNPPPPQQTPLRKHKKKGRQSRPANKQSPSPSEVSQSPGREVMSYAQAQRMVEVDLHGRVHRISIFDNLDVVSEDEEAPEEAPENGSNKENTETPAATPKSGKHKNKEKRKDSNHHHHHNVSASTTPKLPEVVYRELEQDTPDAPPRPTSYYRYIEKSAEELDEEVEYDMDEEDYIWLDIMNERRKTEGVSPIPQEIFEYLMDRLEKESYFESHNKGDPNALVDEDAVCCICNDGECQNSNVILFCDMCNLAVHQECYGVPYIPEGQWLCRRCLQSPSRAVDCALCPNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRGSGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSFSVRKTAYCDIHTPPGSARRLPALSHSEGEEDEDEEEDEGKGWSSEKVKKAKAKSRIKMKKARKILAEKRAAAPVVSVPCIPPHRLSKITNRLTIQRKSQFMQRLHSYWTLKRQSRNGVPLLRRLQTHLQSQRNCDQVGRDSEDKNWALKEQLKSWQRLRHDLERARLLVELIRKREKLKRETIKVQQIAMEMQLTPFLILLRKTLEQLQEKDTGNIFSEPVPLSEVTELDEVPDYLDHIKKPMDFFTMKQNLEAYRYLNFDDFEEDFNLIVSNCLKYNAKDTIFYRAAVRLREQGGAVLRQARRQAEKMGIDFETGMHIPHSLAGDEATHHTEDAAEEERLVLLENQKHLPVEEQLKLLLERLDEVNASKQSVGRSRRAKMIKKEMTALRRKLAHQRETGRDGPERHGPSSRGSLTPHPAACDKDGQTDSAAEESSSQETSKGLGPNMSSTPAHEVGRRTSVLFSKKNPKTAGPPKRPGRPPKNRESQMTPSHGGSPVGPPQLPIMSSLRQRKRGRSPRPSSSSDSDSDKSTEDPPMDLPANGFSGGNQPVKKSFLVYRNDCSLPRSSSDSESSSSSSSSAASDRTSTTPSKQGRGKPSFSRGTFPEDSSEDTSGTENEAYSVGTGRGVGHSMVRKSLGRGAGWLSEDEDSPLDALDLVWAKCRGYPSYPALIIDPKMPREGMFHHGVPIPVPPLEVLKLGEQMTQEAREHLYLVLFFDNKRTWQWLPRTKLVPLGVNQDLDKEKMLEGRKSNIRKSVQIAYHRALQHRSKVQGEQSSETSDSD,1220,NP_001003694.1.csv,refseq-BRPF1-NM_001003694.1_clinical_seed_0_final,refseq-BRPF1-NM_001003694.1.a2m,Invitae,refseq-BRPF1-NM_001003694.1_theta_0.2.npy,1,1220,1220
+NP_001003722.1,MPSEGRCWETLKALRSSDKGRLCYYRDWLLRREDVLEECMSLPKLSSYSGWVVEHVLPHMQENQPLSETSPSSTSASALDQPSFVPKSPDASSAFSPASPATPNGTKGKDESQHTESMVLQSSRGIKVEGCVRMYELVHRMKGTEGLRLWQEEQERKVQALSEMASEQLKRFDEWKELKQHKEFQDLREVMEKSSREALGHQEKLKAEHRHRAKILNLKLREAEQQRVKQAEQERLRKEEGQIRLRALYALQEEMLQLSQQLDASEQHKALLKVDLAAFQTRGNQLCSLISGIIRASSESSYPTAESQAEAERALREMRDLLMNLGQEITRACEDKRRQDEEEAQVKLQEAQMQQGPEAHKEPPAPSQGPGGKQNEDLQVKVQDITMQWYQQLQDASMQCVLTFEGLTNSKDSQAKKIKMDLQKAATIPVSQISTIAGSKLKEIFDKIHSLLSGKPVQSGGRSVSVTLNPQGLDFVQYKLAEKFVKQGEEEVASHHEAAFPIAVVASGIWELHPRVGDLILAHLHKKCPYSVPFYPTFKEGMALEDYQRMLGYQVKDSKVEQQDNFLKRMSGMIRLYAAIIQLRWPYGNRQEIHPHGLNHGWRWLAQILNMEPLSDVTATLLFDFLEVCGNALMKQYQVQFWKMLILIKEDYFPRIEAITSSGQMGSFIRLKQFLEKCLQHKDIPVPKGFLTSSFWRS,698,NP_001003722.1.csv,refseq-GLE1-NM_001003722.1_clinical_seed_0_final,refseq-GLE1-NM_001003722.1.a2m,Invitae,refseq-GLE1-NM_001003722.1.npy,1,698,698
+NP_001003800.1,MSAPSEEEEYARLVMEAQPEWLRAEVKRLSHELAETTREKIQAAEYGLAVLEEKHQLKLQFEELEVDYEAIRSEMEQLKEAFGQAHTNHKKVAADGESREESLIQESASKEQYYVRKVLELQTELKQLRNVLTNTQSENERLASVAQELKEINQNVEIQRGRLRDDIKEYKFREARLLQDYSELEEENISLQKQVSVLRQNQVEFEGLKHEIKRLEEETEYLNSQLEDAIRLKEISERQLEEALETLKTEREQKNSLRKELSHYMSINDSFYTSHLHVSLDGLKFSDDAAEPNNDAEALVNGFEHGGLAKLPLDNKTSTPKKEGLAPPSPSLVSDLLSELNISEIQKLKQQLMQMEREKAGLLATLQDTQKQLEHTRGSLSEQQEKVTRLTENLSALRRLQASKERQTALDNEKDRDSHEDGDYYEVDINGPEILACKYHVAVAEAGELREQLKALRSTHEAREAQHAEEKGRYEAEGQALTEKVSLLEKASRQDRELLARLEKELKKVSDVAGETQGSLSVAQDELVTFSEELANLYHHVCMCNNETPNRVMLDYYREGQGGAGRTSPGGRTSPEARGRRSPILLPKGLLAPEAGRADGGTGDSSPSPGSSLPSPLSDPRREPMNIYNLIAIIRDQIKHLQAAVDRTTELSRQRIASQELGPAVDKDKEALMEEILKLKSLLSTKREQITTLRTVLKANKQTAEVALANLKSKYENEKAMVTETMMKLRNELKALKEDAATFSSLRAMFATRCDEYITQLDEMQRQLAAAEDEKKTLNSLLRMAIQQKLALTQRLELLELDHEQTRRGRAKAAPKTKPATPSVSHTCACASDRAEGTGLANQVFCSEKHSIYCD,855,NP_001003800.1.csv,refseq-BICD2-NM_001003800.1_clinical_seed_0_final,refseq-BICD2-NM_001003800.1.a2m,Invitae,refseq-BICD2-NM_001003800.1.npy,1,855,855
+NP_001003811.1,MISAHCNLRLLCSSDSSASASQVAGTTEVVENLVTNDNSPNIPEAIDRLFSDIANINRESMAEITDIQIEEMAVNLWNWALTIGGGWLVNEEQKIRLHYVACKLLSMCEASFASEQSIQRLIMMNMRIGKEWLDAGNFLIADECFQAAVASLEQLYVKLIQRSSPEADLTMEKITVESDHFRVLSYQAESAVAQGDFQRASMCVLQCKDMLMRLPQMTSSLHHLCYNFGVETQKNNKYEESSFWLSQSYDIGKMDKKSTGPEMLAKVLRLLATNYLDWDDTKYYDKALNAVNLANKEHLSSPGLFLKMKILLKGETSNEELLEAVMEILHLDMPLDFCLNIAKLLMDHERESVGFHFLTIIHERFKSSENIGKVLILHTDMLLQRKEELLAKEKIEEIFLAHQTGRQLTAESMNWLHNILWRQAASSFEVQNYTDALQWYYYSLRFYSTDEMDLDFTKLQRNMACCYLNLQQLDKAKEAVAEAERHDPRNVFTQFYIFKIAVIEGNSERALQAIITLENILTDEESEDNDLVAERGSPTMLLSLAAQFALENGQQIVAEKALEYLAQHSEDQEQVLTAVKCLLRFLLPKIAEMPESEDKKKEMDRLLTCLNRAFVKLSQPFGEEALSLESRANEAQWFRKTAWNLAVQCDKDPVMMREFFILSYKMSQFCPSDQVILIARKTCLLMAVAVDLEQGRKASTAFEQTMFLSRALEEIQTCNDIHNFLKQTGTFSNDSCEKLLLLYEFEVRAKLNDPLLESFLESVWELPHLETKTFETIAIIAMEKPAHYPLIALKALKKALLLYKKEEPIDISQYSKCMHNLVNLSVPDGASNVELCPLEEVWGYFEDALSHISRTKDYPEMEILWLMVKSWNTGVLMFSRSKYASAEKWCGLALRFLNHLTSFKESYETQMNMLYSQLVEALSNNKGPVFHEHGYWSKSD,940,NP_001003811.1.csv,refseq-TEX11-NM_001003811.1_clinical_seed_0_final,refseq-TEX11-NM_001003811.1.a2m,Invitae,refseq-TEX11-NM_001003811.1.npy,1,940,940
+NP_001003841.1,MVRLVLPNPGLDARIPSLAELETIEQEEASSRPKWDNKAQYMLTCLGFCVGLGNVWRFPYLCQSHGGGAFMIPFLILLVLEGIPLLYLEFAIGQRLRRGSLGVWSSIHPALKGLGLASMLTSFMVGLYYNTIISWIMWYLFNSFQEPLPWSDCPLNENQTGYVDECARSSPVDYFWYRETLNISTSISDSGSIQWWMLLCLACAWSVLYMCTIRGIETTGKAVYITSTLPYVVLTIFLIRGLTLKGATNGIVFLFTPNVTELAQPDTWLDAGAQVFFSFSLAFGGLISFSSYNSVHNNCEKDSVIVSIINGFTSVYVAIVVYSVIGFRATQRYDDCFSTNILTLINGFDLPEGNVTQENFVDMQQRCNASDPAAYAQLVFQTCDINAFLSEAVEGTGLAFIVFTEAITKMPLSPLWSVLFFIMLFCLGLSSMFGNMEGVVVPLQDLRVIPPKWPKEVLTGLICLGTFLIGFIFTLNSGQYWLSLLDSYAGSIPLLIIAFCEMFSVVYVYGVDRFNKDIEFMIGHKPNIFWQVTWRVVSPLLMLIIFLFFFVVEVSQELTYSIWDPGYEEFPKSQKISYPNWVYVVVVIVAGVPSLTIPGYAIYKLIRNHCQKPGDHQGLVSTLSTASMNGDLKY,634,NP_001003841.1.csv,refseq-SLC6A19-NM_001003841.2_clinical_seed_0_final,refseq-SLC6A19-NM_001003841.2.a2m,Invitae,refseq-SLC6A19-NM_001003841.2.npy,1,634,634
+NP_001004051.1,MTGAEIEPSAQAKPEKKAGEEVIAGPERENDVPLVVRPKVRTQATTGARPKTETKSVPAARPKTEAQAMSGARPKTEVQVMGGARPKTEAQGITGARPKTDARAVGGARSKTDAKAIPGARPKDEAQAWAQSEFGTEAVSQAEGVSQTNAVAWPLATAESGSVTKSKGLSMDRELVNVDAETFPGTQGQKGIQPWFGPGEETNMGSWCYSRPRAREEASNESGFWSADETSTASSFWTGEETSVRSWPREESNTRSRHRAKHQTNPRSRPRSKQEAYVDSWSGSEDEASNPFSFWVGENTNNLFRPRVREEANIRSKLRTNREDCFESESEDEFYKQSWVLPGEEANSRFRHRDKEDPNTALKLRAQKDVDSDRVKQEPRFEEEVIIGSWFWAEKEASLEGGASAICESEPGTEEGAIGGSAYWAEEKSSLGAVAREEAKPESEEEAIFGSWFWDRDEACFDLNPCPVYKVSDRFRDAAEELNASSRPQTWDEVTVEFKPGLFHGVGFRSTSPFGIPEEASEMLEAKPKNLELSPEGEEQESLLQPDQPSPEFTFQYDPSYRSVREIREHLRARESAESESWSCSCIQCELKIGSEEFEEFLLLMDKIRDPFIHEISKIAMGMRSASQFTRDFIRDSGVVSLIETLLNYPSSRVRTSFLENMIHMAPPYPNLNMIETFICQVCEETLAHSVDSLEQLTGIRMLRHLTMTIDYHTLIANYMSGFLSLLTTANARTKFHVLKMLLNLSENPAVAKKLFSAKALSIFVGLFNIEETNDNIQIVIKMFQNISNIIKSGKMSLIDDDFSLEPLISAFREFEELAKQLQAQIDNQNDPEVGQQS,838,NP_001004051.1.csv,refseq-GPRASP2-NM_001004051.3_clinical_seed_0_final,refseq-GPRASP2-NM_001004051.3.a2m,Invitae,refseq-GPRASP2-NM_001004051.3.npy,1,838,838
+NP_001004127.2,MAAGERSWCLCKLLRFFYSLFFPGLIVCGTLCVCLVIVLWGIRLLLQRKKKLVSTSKNGKNQMVIAFFHPYCNAGGGGERVLWCALRALQKKYPEAVYVVYTGDVNVNGQQILEGAFRRFNIRLIHPVQFVFLRKRYLVEDSLYPHFTLLGQSLGSIFLGWEALMQCVPDVYIDSMGYAFTLPLFKYIGGCQVGSYVHYPTISTDMLSVVKNQNIGFNNAAFITRNPFLSKVKLIYYYLFAFIYGLVGSCSDVVMVNSSWTLNHILSLWKVGNCTNIVYPPCDVQTFLDIPLHEKKMTPGHLLVSVGQFRPEKNHPLQIRAFAKLLNKKMVESPPSLKLVLIGGCRNKDDELRVNQLRRLSEDLGVQEYVEFKINIPFDELKNYLSEATIGLHTMWNEHFGIGVVECMAAGTIILAHNSGGPKLDIVVPHEGDITGFLAESEEDYAETIAHILSMSAEKRLQIRKSARASVSRFSDQEFEVTFLSSVEKLFK,492,NP_001004127.2.csv,refseq-ALG11-NM_001004127.2_clinical_seed_0_final,refseq-ALG11-NM_001004127.2.a2m,Invitae,refseq-ALG11-NM_001004127.2.npy,1,492,492
+NP_001005271.2,MASPLRDEEEEEEEMVVSEEEEEEEEEGDEEEEEEVEAADEDDEEDDDEGVLGRGPGHDRGRDRHSPPGCHLFPPPPPPPPPLPPPPPPPPPDKDDIRLLPSALGVKKRKRGPKKQKENKPGKPRKRKKRDSEEEFGSERDEYREKSESGGSEYGTGPGRKRRRKHREKKEKKTKRRKKGEGDGGQKQVEQKSSATLLLTWGLEDVEHVFSEEDYHTLTNYKAFSQFMRPLIAKKNPKIPMSKMMTILGAKWREFSANNPFKGSAAAVAAAAAAAAAAVAEQVSAAVSSATPIAPSGPPALPPPPAADIQPPPIRRAKTKEGKGPGHKRRSKSPRVPDGRKKLRGKKMAPLKIKLGLLGGKRKKGGSYVFQSDEGPEPEAEESDLDSGSVHSASGRPDGPVRTKKLKRGRPGRKKKKVLGCPAVAGEEEVDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVAVPAPQQADGNPDVPPPRPLQGRSEREFFVKWVGLSYWHCSWAKELQLEIFHLVMYRNYQRKNDMDEPPPLDYGSGEDDGKSDKRKVKDPHYAEMEEKYYRFGIKPEWMTVHRIINHSVDKKGNYHYLVKWRDLPYDQSTWEEDEMNIPEYEEHKQSYWRHRELIMGEDPAQPRKYKKKKKELQGDGPPSSPTNDPTVKYETQPRFITATGGTLHMYQLEGLNWLRFSWAQGTDTILADEMGLGKTIQTIVFLYSLYKEGHTKGPFLVSAPLSTIINWEREFQMWAPKFYVVTYTGDKDSRAIIRENEFSFEDNAIKGGKKAFKMKREAQVKFHVLLTSYELITIDQAALGSIRWACLVVDEAHRLKNNQSKFFRVLNGYKIDHKLLLTGTPLQNNLEELFHLLNFLTPERFNNLEGFLEEFADISKEDQIKKLHDLLGPHMLRRLKADVFKNMPAKTELIVRVELSPMQKKYYKYILTRNFEALNSRGGGNQVSLLNIMMDLKKCCNHPYLFPVAAMESPKLPSGAYEGGALIKSSGKLMLLQKMLRKLKEQGHRVLIFSQMTKMLDLLEDFLDYEGYKYERIDGGITGALRQEAIDRFNAPGAQQFCFLLSTRAGGLGINLATADTVIIFDSDWNPHNDIQAFSRAHRIGQANKVMIYRFVTRASVEERITQVAKRKMMLTHLVVRPGLGSKAGSMSKQELDDILKFGTEELFKDENEGENKEEDSSVIHYDNEAIARLLDRNQDATEDTDVQNMNEYLSSFKVAQYVVREEDKIEEIEREIIKQEENVDPDYWEKLLRHHYEQQQEDLARNLGKGKRVRKQVNYNDAAQEDQDNQSEYSVGSEEEDEDFDERPEGRRQSKRQLRNEKDKPLPPLLARVGGNIEVLGFNTRQRKAFLNAVMRWGMPPQDAFTTQWLVRDLRGKTEKEFKAYVSLFMRHLCEPGADGSETFADGVPREGLSRQQVLTRIGVMSLVKKKVQEFEHINGRWSMPELMPDPSADSKRSSRASSPTKTSPTTPEASATNSPCTSKPATPAPSEKGEGIRTPLEKEEAENQEEKPEKNSRIGEKMETEADAPSPAPSLGERLEPRKIPLEDEVPGVPGEMEPEPGYRGDREKSATESTPGERGEEKPLDGQEHRERPEGETGDLGKREDVKGDRELRPGPRDEPRSNGRREEKTEKPRFMFNIADGGFTELHTLWQNEERAAISSGKLNEIWHRRHDYWLLAGIVLHGYARWQDIQNDAQFAIINEPFKTEANKGNFLEMKNKFLARRFKLLEQALVIEEQLRRAAYLNLSQEPAHPAMALHARFAEAECLAESHQHLSKESLAGNKPANAVLHKVLNQLEELLSDMKADVTRLPATLSRIPPIAARLQMSERSILSRLASKGTEPHPTPAYPPGPYATPPGYGAAFSAAPVGALAAAGANYSQMPAGSFITAATNGPPVLVKKEKEMVGALVSDGLDRKEPRAGEVICIDD,2059,NP_001005271.2.csv,refseq-CHD3-NM_001005271.2_clinical_seed_0_final,refseq-CHD3-NM_001005271.2.a2m,Invitae,refseq-CHD3-NM_001005271.2.npy,1,2059,2059
+NP_001005369.1,MNQKLLKLENLLRFHTIYRQLHSLCQRRALRQWRHGFSSAYPVWTAQLCAWPWPTDVLTGAALSQYRLLVTKKEEGPWKSQLSSTKSKKVVEVWIGMTIEELARAMEKNTDYVYEALLNTDIDIDSLEADSHLDEVWIKEVITKAGMKLKWSKLKQDKVRKNKDAVRRPQADPALLTPRSPVVTIMGHVDHGKTTLLDKFRKTQVAAVETGGITQHIGAFLVSLPSGEKITFLDTPGHAAFSAMRARGAQVTDIVVLVVAADDGVMKQTVESIQHAKDAQVPIILAVNKCDKAEADPEKVKKELLAYDVVCEDYGGDVQAVPVSALTGDNLMALAEATVALAEMLELKADPNGPVEGTVIESFTDKGRGLVTTAIIQRGTLRKGSVLVAGKCWAKVRLMFDENGKTIDEAYPSMPVGITGWRDLPSAGEEILEVESEPRAREVVDWRKYEQEQEKGQEDLKIIEEKRKEHKEAHQKAREKYGHLLWKKRSILRFLERKEQIPLKPKEKRERDSNVLSVIIKGDVDGSVEAILNIIDTYDASHECELELVHFGVGDVSANDVNLAETFDGVIYGFNVNAGNVIQQSAAKKGVKIKLHKIIYRLVEDLQEELSSRLPCAVEEHPVGEASILATFSVTEGKKKVPVAGCRVQKGQLEKQKKFKLTRNGHVIWKGSLTSLKHHKDDISIVKTGMDCGLSLDEDNMEFQVGDRIVCYEEKQIQAKTSWDPGF,727,NP_001005369.1.csv,refseq-MTIF2-NM_001005369.1_clinical_seed_0_final,refseq-MTIF2-NM_001005369.1.a2m,Invitae,refseq-MTIF2-NM_001005369.1.npy,1,727,727
+NP_001005463.1,MFGIQENIPRGGTTMKEEPLGSGMNPVRSWMHTAGVVDANTAAQSGVGLARAHFEKQPPSNLRKSNFFHFVLALYDRQGQPVEIERTAFVDFVEKEKEPNNEKTNNGIHYKLQLLYSNGVRTEQDLYVRLIDSMTKQAIVYEGQDKNPEMCRVLLTHEIMCSRCCDKKSCGNRNETPSDPVIIDRFFLKFFLKCNQNCLKNAGNPRDMRRFQVVVSTTVNVDGHVLAVSDNMFVHNNSKHGRRARRLDPSEATPCIKAISPSEGWTTGGATVIIIGDNFFDGLQVVFGTMLVWSELITPHAIRVQTPPRHIPGVVEVTLSYKSKQFCKGAPGRFVYTALNEPTIDYGFQRLQKVIPRHPGDPERLPKEVLLKRAADLVEALYGMPHNNQEIILKRAADIAEALYSVPRNHNQIPTLGNNPAHTGMMGVNSFSSQLAVNVSETSQANDQVGYSRNTSSVSPRGYVPSSTPQQSNYNTVSTSMNGYGSGAMASLGVPGSPGFLNGSSANSPYGMKQKSAFAPVVRPQASPPPSCTSANGNGLQAMSGLVVPPM,551,NP_001005463.1.csv,refseq-EBF3-NM_001005463.2_clinical_seed_0_final,refseq-EBF3-NM_001005463.2.a2m,Invitae,refseq-EBF3-NM_001005463.2.npy,1,551,551
+NP_001005731.1,MAGPRPSPWARLLLAALISVSLSGTLANRCKKAPVKSCTECVRVDKDCAYCTDEMFRDRRCNTQAELLAAGCQRESIVVMESSFQITEETQIDTTLRRSQMSPQGLRVRLRPGEERHFELEVFEPLESPVDLYILMDFSNSMSDDLDNLKKMGQNLARVLSQLTSDYTIGFGKFVDKVSVPQTDMRPEKLKEPWPNSDPPFSFKNVISLTEDVDEFRNKLQGERISGNLDAPEGGFDAILQTAVCTRDIGWRPDSTHLLVFSTESAFHYEADGANVLAGIMSRNDERCHLDTTGTYTQYRTQDYPSVPTLVRLLAKHNIIPIFAVTNYSYSYYEKLHTYFPVSSLGVLQEDSSNIVELLEEAFNRIRSNLDIRALDSPRGLRTEVTSKMFQKTRTGSFHIRRGEVGIYQVQLRALEHVDGTHVCQLPEDQKGNIHLKPSFSDGLKMDAGIICDVCTCELQKEVRSARCSFNGDFVCGQCVCSEGWSGQTCNCSTGSLSDIQPCLREGEDKPCSGRGECQCGHCVCYGEGRYEGQFCEYDNFQCPRTSGFLCNDRGRCSMGQCVCEPGWTGPSCDCPLSNATCIDSNGGICNGRGHCECGRCHCHQQSLYTDTICEINYSAIHPGLCEDLRSCVQCQAWGTGEKKGRTCEECNFKVKMVDELKRAEEVVVRCSFRDEDDDCTYSYTMEGDGAPGPNSTVLVHKKKDCPPGSFWWLIPLLLLLLPLLALLLLLCWKYCACCKACLALLPCCNRGHMVGFKEDHYMLRENLMASDHLDTPMLRSGNLKGRDVVRWKVTNNMQRPGFATHAASINPTELVPYGLSLRLARLCTENLLKPDTRECAQLRQEVEENLNEVYRQISGVHKLQQTKFRQQPNAGKKQDHTIVDTVLMAPRSAKPALLKLTEKQVEQRAFHDLKVAPGYYTLTADQDARGMVEFQEGVELVDVRVPLFIRPEDDDEKQLLVEAIDVPAGTATLGRRLVNITIIKEQARDVVSFEQPEFSVSRGDQVARIPVIRRVLDGGKSQVSYRTQDGTAQGNRDYIPVEGELLFQPGEAWKELQVKLLELQEVDSLLRGRQVRRFHVQLSNPKFGAHLGQPHSTTIIIRDPDELDRSFTSQMLSSQPPPHGDLGAPQNPNAKAAGSRKIHFNWLPPSGKPMGYRVKYWIQGDSESEAHLLDSKVPSVELTNLYPYCDYEMKVCAYGAQGEGPYSSLVSCRTHQEVPSEPGRLAFNVVSSTVTQLSWAEPAETNGEITAYEVCYGLVNDDNRPIGPMKKVLVDNPKNRMLLIENLRESQPYRYTVKARNGAGWGPEREAIINLATQPKRPMSIPIIPDIPIVDAQSGEDYDSFLMYSDDVLRSPSGSQRPSVSDDTEHLVNGRMDFAFPGSTNSLHRMTTTSAAAYGTHLSPHVPHRVLSTSSTLTRDYNSLTRSEHSHSTTLPRDYSTLTSVSSHDSRLTAGVPDTPTRLVFSALGPTSLRVSWQEPRCERPLQGYSVEYQLLNGGELHRLNIPNPAQTSVVVEDLLPNHSYVFRVRAQSQEGWGREREGVITIESQVHPQSPLCPLPGSAFTLSTPSAPGPLVFTALSPDSLQLSWERPRRPNGDIVGYLVTCEMAQGGGPATAFRVDGDSPESRLTVPGLSENVPYKFKVQARTTEGFGPEREGIITIESQDGGPFPQLGSRAGLFQHPLQSEYSSITTTHTSATEPFLVDGLTLGAQHLEAGGSLTRHVTQEFVSRTLTTSGTLSTHMDQQFFQT,1752,NP_001005731.1.csv,refseq-ITGB4-NM_001005731.2_clinical_seed_0_final,refseq-ITGB4-NM_001005731.2.a2m,Invitae,refseq-ITGB4-NM_001005731.2.npy,1,1752,1752
+NP_001005741.1,MEFSSPSREECPKPLSRVSIMAGSLTGLLLLQAVSWASGARPCIPKSFGYSSVVCVCNATYCDSFDPPTFPALGTFSRYESTRSGRRMELSMGPIQANHTGTGLLLTLQPEQKFQKVKGFGGAMTDAAALNILALSPPAQNLLLKSYFSEEGIGYNIIRVPMASCDFSIRTYTYADTPDDFQLHNFSLPEEDTKLKIPLIHRALQLAQRPVSLLASPWTSPTWLKTNGAVNGKGSLKGQPGDIYHQTWARYFVKFLDAYAEHKLQFWAVTAENEPSAGLLSGYPFQCLGFTPEHQRDFIARDLGPTLANSTHHNVRLLMLDDQRLLLPHWAKVVLTDPEAAKYVHGIAVHWYLDFLAPAKATLGETHRLFPNTMLFASEACVGSKFWEQSVRLGSWDRGMQYSHSIITNLLYHVVGWTDWNLALNPEGGPNWVRNFVDSPIIVDITKDTFYKQPMFYHLGHFSKFIPEGSQRVGLVASQKNDLDAVALMHPDGSAVVVVLNRSSKDVPLTIKDPAVGFLETISPGYSIHTYLWRRQ,536,NP_001005741.1.csv,refseq-GBA-NM_001005741.2_clinical_seed_0_final,refseq-GBA-NM_001005741.2.a2m,Invitae,refseq-GBA-NM_001005741.2.npy,1,536,536
+NP_001006658.1,MFFYLSKKISIPNNVKLQCVSWNKEQGFIACGGEDGLLKVLKLETQTDDAKLRGLAAPSNLSMNQTLEGHSGSVQVVTWNEQYQKLTTSDENGLIIVWMLYKGSWIEEMINNRNKSVVRSMSWNADGQKICIVYEDGAVIVGSVDGNRIWGKDLKGIQLSHVTWSADSKVLLFGMANGEIHIYDNQGNFMIKMKLSCLVNVTGAISIAGIHWYHGTEGYVEPDCPCLAVCFDNGRCQIMRHENDQNPVLIDTGMYVVGIQWNHMGSVLAVAGFQKAAMQDKDVNIVQFYTPFGEHLGTLKVPGKEISALSWEGGGLKIALAVDSFIYFANIRPNYKWGYCSNTVVYAYTRPDRPEYCVVFWDTKNNEKYVKYVKGLISITTCGDFCILATKADENHPQEENEMETFGATFVLVLCNSIGTPLDPKYIDIVPLFVAMTKTHVIAASKEAFYTWQYRVAKKLTALEINQITRSRKEGRERIYHVDDTPSGSMDGVLDYSKTIQGTRDPICAITASDKILIVGRESGTIQRYSLPNVGLIQKYSLNCRAYQLSLNCNSSRLAIIDISGVLTFFDLDARVTDSTGQQVVGELLKLERRDVWDMKWAKDNPDLFAMMEKTRMYVFRNLDPEEPIQTSGYICNFEDLEIKSVLLDEILKDPEHPNKDYLINFEIRSLRDSRALIEKVGIKDASQFIEDNPHPRLWRLLAEAALQKLDLYTAEQAFVRCKDYQGIKFVKRLGKLLSESMKQAEVVGYFGRFEEAERTYLEMDRRDLAIGLRLKLGDWFRVLQLLKTGSGDADDSLLEQANNAIGDYFADRQKWLNAVQYYVQGRNQERLAECYYMLEDYEGLENLAISLPENHKLLPEIAQMFVRVGMCEQAVTAFLKCSQPKAAVDTCVHLNQWNKAVELAKNHSMKEIGSLLARYASHLLEKNKTLDAIELYRKANYFFDAAKLMFKIADEEAKKGSKPLRVKKLYVLSALLIEQYHEQMKNAQRGKVKGKSSEATSALAGLLEEEVLSTTDRFTDNAWRGAEAYHFFILAQRQLYEGCVDTALKTALHLKDYEDIIPPVEIYSLLALCACASRAFGTCSKAFIKLKSLETLSSEQKQQYEDLALEIFTKHTSKDNRKPELDSLMEGGEGKLPTCVATGSPITEYQFWMCSVCKHGVLAQEISHYSFCPLCHSPVG,1181,NP_001006658.1.csv,refseq-WDR35-NM_001006657.1_clinical_seed_0_final,refseq-WDR35-NM_001006657.1.a2m,Invitae,refseq-WDR35-NM_001006657.1.npy,1,1181,1181
+NP_001006659.1,MGAAGLLGVFLALVAPGVLGISCGSPPPILNGRISYYSTPIAVGTVIRYSCSGTFRLIGEKSLLCITKDKVDGTWDKPAPKCEYFNKYSSCPEPIVPGGYKIRGSTPYRHGDSVTFACKTNFSMNGNKSVWCQANNMWGPTRLPTCVSVFPLECPALPMIHNGHHTSENVGSIAPGLSVTYSCESGYLLVGEKIINCLSSGKWSAVPPTCEEARCKSLGRFPNGKVKEPPILRVGVTANFFCDEGYRLQGPPSSRCVIAGQGVAWTKMPVCEEIFCPSPPPILNGRHIGNSLANVSYGSIVTYTCDPDPEEGVNFILIGESTLRCTVDSQKTGTWSGPAPRCELSTSAVQCPHPQILRGRMVSGQKDRYTYNDTVIFACMFGFTLKGSKQIRCNAQGTWEPSAPVCEKECQAPPNILNGQKEDRHMVRFDPGTSIKYSCNPGYVLVGEESIQCTSEGVWTPPVPQCKVAACEATGRQLLTKPQHQFVRPDVNSSCGEGYKLSGSVYQECQGTIPWFMEIRLCKEITCPPPPVIYNGAHTGSSLEDFPYGTTVTYTCNPGPERGVEFSLIGESTIRCTSNDQERGTWSGPAPLCKLSLLAVQCSHVHIANGYKISGKEAPYFYNDTVTFKCYSGFTLKGSSQIRCKADNTWDPEIPVCEKGCQSPPGLHHGRHTGGNTVFFVSGMTVDYTCDPGYLLVGNKSIHCMPSGNWSPSAPRCEETCQHVRQSLQELPAGSRVELVNTSCQDGYQLTGHAYQMCQDAENGIWFKKIPLCKVIHCHPPPVIVNGKHTGMMAENFLYGNEVSYECDQGFYLLGEKKLQCRSDSKGHGSWSGPSPQCLRSPPVTRCPNPEVKHGYKLNKTHSAYSHNDIVYVDCNPGFIMNGSRVIRCHTDNTWVPGVPTCIKKAFIGCPPPPKTPNGNHTGGNIARFSPGMSILYSCDQGYLLVGEALLLCTHEGTWSQPAPHCKEVNCSSPADMDGIQKGLEPRKMYQYGAVVTLECEDGYMLEGSPQSQCQSDHQWNPPLAVCRSRSLAPVLCGIAAGLILLTFLIVITLYVISKHRARNYYTDTSQKEAFHLEAREVYSVDPYNPAS,1092,NP_001006659.1.csv,refseq-CR2-NM_001006658.2_clinical_seed_0_final,refseq-CR2-NM_001006658.2.a2m,Invitae,refseq-CR2-NM_001006658.2.npy,1,1092,1092
+NP_001007027.1,MKTRQNKDSMSMRSGRKKEAPGPREELRSRGRASPGGVSTSSSDGKAEKSRQTAKKARVEEASTPKVNKQGRSEEISESESEETNAPKKTKTEQELPRPQSPSDLDSLDGRSLNDDGSSDPRDIDQDNRSTSPSIYSPGSVENDSDSSSGLSQGPARPYHPPPLFPPSPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGAPPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAPPTKPPTTPVGGGNLPSAPPPANFPHVTPNLPPPPALRPLNNASASPPGLGAQPLPGHLPSPHAMGQGMGGLPPGPEKGPTLAPSPHSLPPASSSAPAPPMRFPYSSSSSSSAAASSSSSSSSSSASPFPASQALPSYPHSFPPPTSLSVSNQPPKYTQPSLPSQAVWSQGPPPPPPYGRLLANSNAHPGPFPPSTGAQSTAHPPVSTHHHHHQQQQQQQQQQQQQQQQQQQHHGNSGPPPPGAFPHPLEGGSSHHAHPYAMSPSLGSLRPYPPGPAHLPPPHSQVSYSQAGPNGPPVSSSSNSSSSTSQGSYPCSHPSPSQGPQGAPYPFPPVPTVTTSSATLSTVIATVASSPAGYKTASPPGPPPYGKRAPSPGAYKTATPPGYKPGSPPSFRTGTPPGYRGTSPPAGPGTFKPGSPTVGPGPLPPAGPSGLPSLPPPPAAPASGPPLSATQIKQEPAEEYETPESPVPPARSPSPPPKVVDVPSHASQSARFNKHLDRGFNSCARSDLYFVPLEGSKLAKKRADLVEKVRREAEQRAREEKEREREREREKEREREKERELERSVKLAQEGRAPVECPSLGPVPHRPPFEPGSAVATVPPYLGPDTPALRTLSEYARPHVMSPGNRNHPFYVPLGAVDPGLLGYNVPALYSSDPAAREREREARERDLRDRLKPGFEVKPSELEPLHGVPGPGLDPFPRHGGLALQPGPPGLHPFPFHPSLGPLERERLALAAGPALRPDMSYAERLAAERQHAERVAALGNDPLARLQMLNVTPHHHQHSHIHSHLHLHQQDAIHAASASVHPLIDPLASGSHLTRIPYPAGTLPNPLLPHPLHENEVLRHQLFAAPYRDLPASLSAPMSAAHQLQAMHAQSAELQRLALEQQQWLHAHHPLHSVPLPAQEDYYSHLKKESDKPL,1190,NP_001007027.1.csv,refseq-ATN1-NM_001007026.1_clinical_seed_0_final,refseq-ATN1-NM_001007026.1.a2m,Invitae,refseq-ATN1-NM_001007026.1_theta_0.2.npy,1,1190,1190
+NP_001007227.1,MSRVPSPPPPAEMSSGPVAESWCYTQIKVVKFSYMWTINNFSFCREEMGEVIKSSTFSSGANDKLKWCLRVNPKGLDEESKDYLSLYLLLVSCPKSEVRAKFKFSILNAKGEETKAMESQRAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQDSVNISGQNTMNMVKVPECRLADELGGLWENSRFTDCCLCVAGQEFQAHKAILAARSPVFSAMFEHEMEESKKNRVEINDVEPEVFKEMMCFIYTGKAPNLDKMADDLLAAADKYALERLKVMCEDALCSNLSVENAAEILILADLHSADQLKTQAVDFINYHASDVLETSGWKSMVVSHPHLVAEAYRSLASAQCPFLGPPRKRLKQS,374,NP_001007227.1.csv,refseq-SPOP-NM_001007226.1_clinical_seed_0_final,refseq-SPOP-NM_001007226.1.a2m,Invitae,refseq-SPOP-NM_001007226.1.npy,1,374,374
+NP_001007528.1,MSGAALGLEIVFVFFLALFLLHRYGDFKKQHRLVIIGTLLAWYLCFLIVFILPLDVSTTIYNRCKHAAANSSPPENSNITGLYATANPVPSQHPCFKPWSYIPDGIMPIFWRVVYWTSQFLTWILLPFMQSYARSGGFSITGKIKTALIENAIYYGTYLLIFGAFLIYVAVNPHLHLEWNQLQTIGIAAANTWGLFLLVLLLGYGLVEIPRSYWNGAKRGYLLMKTYFKAAKLMTEKADAEENLEDAMEEVRKVNESIKYNHPLRKCVDTILKKCPTEYQEKMGRNMDDYEDFDEKHSIYPSEKSLVKLHKQVIYSVQRHRRTQVQWQILLEQAFYLEDVAKNETSATHQFVHTFQSPEPENRFIQYFYNPTFEWYWECLLRPWFYKILAVVLSIFSVIVVWSECTFFSTTPVLSLFAVFIQLAEKTYNYIYIEIACFLSIFFLSICVYSTVFRIRVFNYYYLASHHQTDAYSLLFSGMLFCRLTPPLCLNFLGLTHMDSSISHKNTQPTAYTSIMGSMKVLSFIADGFYIYYPMLVVILCIATYFSLGTRCLNLLGFQQFMGDDDMTSDLVNEGKELIRKEKRKRQRQEEGENRRREWKERYGHNREDSTRNRNIHTDPKESNFSDVNTNRSAFKYTRANNRTERDRIELLQDAEPLDFNAETFTDDPLESESGRYQPGGRYLSMSRSDIFNDV,695,NP_001007528.1.csv,refseq-LMBRD2-NM_001007527.1_clinical_seed_0_final,refseq-LMBRD2-NM_001007527.1.a2m,Invitae,refseq-LMBRD2-NM_001007527.1.npy,1,695,695
+NP_001008214.1,MSHQPLSCLTEKEDSPSESTGNGPPHLAHPNLDTFTPEELLQQMKELLTENHQLKEAMKLNNQAMKGRFEELSAWTEKQKEERQFFEIQSKEAKERLMALSHENEKLKEELGKLKGKSERSSEDPTDDSRLPRAEAEQEKDQLRTQVVRLQAEKADLLGIVSELQLKLNSSGSSEDSFVEIRMAEGEAEGSVKEIKHSPGPTRTVSTGTALSKYRSRSADGAKNYFEHEELTVSQLLLCLREGNQKVERLEVALKEAKERVSDFEKKTSNRSEIETQTEGSTEKENDEEKGPETVGSEVEALNLQVTSLFKELQEAHTKLSEAELMKKRLQEKCQALERKNSAIPSELNEKQELVYTNKKLELQVESMLSEIKMEQAKTEDEKSKLTVLQMTHNKLLQEHNNALKTIEELTRKESEKVDRAVLKELSEKLELAEKALASKQLQMDEMKQTIAKQEEDLETMTILRAQMEVYCSDFHAERAAREKIHEEKEQLALQLAVLLKENDAFEDGGRQSLMEMQSRHGARTSDSDQQAYLVQRGAEDRDWRQQRNIPIHSCPKCGEVLPDIDTLQIHVMDCII,577,NP_001008214.1.csv,OPTN_HUMAN_b03_clinical_seed_0_final,OPTN_HUMAN_b03.a2m,EVE,OPTN_HUMAN_b03_theta_0.2.npy,1,577,577
+NP_001008537.1,MDNQQDKAIVASANGENTLINGVKENDSEDQDVAMKSFAALEAAAPIQPTPVAQKETLMYPRGLLPLPSKKPCMQSPPSPLGLIEAPEHAANSASVNAISLTSGIAKGLNTWSLPNECEKAPFAIMEPAGMSALNGDCLMQPSRTCLGCFMESKDAVDPEPGISLKVGDLNRDYETCAVSDIGIQCINAGENMKYGEQLLSDQLLGFPLHKSRAGDRRETEKPDIDLEDPAQKSYYEALLLDKCNTEEALLANSNQDWGYFETFISESKIELLDLCSKNELSVNLFSEEDVDNYMFDDDESTLGSDVCSLKIRYESFQDNVRDKTTLLMQEDAQFNFFPSVFTTCPKRESKSGALKQSSDFSQFKVPDVSIIWGEEDKNLDKKKGKEEGQEDKGVEKKDGKDNGEKPALNKPCSGTEVEQLKNPKQGHLANSLETSGSFSDDSSFIEISYDAMGEIKDCSRYMARDTNSGSSSSQQNYGLRAKRKVRYSEDYLYDVDSLEGEKVNERKEWLPVGSKEEDDDEWCPKKRRKVTRKEPPVIIKYIIINRFKGEKNMLVKLGKVDASETTVNLSENQLNKYAKLAPLKGFWQKKKKQRNTNTDSIKTPFSQKQSFEPGSFEVSFLPPARKRKSKLGNRHRIQRIPSIEISASSKQISLCNDQRHASNHKEDGGLKGTLKSAPLGAPSCANGSHLNDITGPDSVKVKAQDTEFKGPERKVLNKIKFKSEARLKSKKVKAAGQESKPIVQMSPLLENQSSKANLKNEVIPGTSNSSRLSEFHEAKAAKSSTFLPTTCSSEMPLSSANVTTNIPVIPGGYLQTLLDASDLSNNTSISYFSHHSPEQNEGSLTQTEKSFVPLQPTQDCVLTSSSDSELQQSSHNFKMESSNYRNVWPNKATSGTQEFMAEVSREIAPTQSSEFGASQVVSMENNLTPTTYNPICLNSGGSNCNKVLYDSMQDTQLPSDDSYQLCHFNNGEICFPFQQGPVNMDDGRLFSFDSMAPLSVSSSNYCSLSLKSCEKDGDDDITDDFLAHCSPKLVIQQSIDEIAPLKESTDLLDISNFTPDKFRHSSLSEMSPPDTPSLSPQITRCESMKTLGTLKGFQEGVPGPLDSVEKIKWDCSTLSRQVQMEDGFTLNNHQFQFHMFNDEDSVSLLQKNPCLSTFNDPSGQISTNNKVSKSRKKSSPSKSGAMNQSSSQKNTRKKSLKGNNKGIEKPPGKNSRQVPKSTKKGKYMAAINGEKMQIGIGRGGSQTNTISSTGKTLAECIQHGGPMASMKMPSQKGLSGDWALGKESSPGWSDMSMGTNTNSLLDDDQREFQEPSYILSNIASGMADVQRFMMASIEPLWEPMEHHGDPNIFYSPESNSLKLKTLKILAGTPQESKKKINSGSQGATKNHRSIKGVSKSNGKTAIGDPGRANMPGYNEDSRSTFFDKKYSNMSTLGNNGPTHKKLYRHKSSSKALRDEKCKGKHMEREQVHKDESGTASFEKLRDSDYNLLKAETTFWVLPVFEEETRIFQKDI,1516,NP_001008537.1.csv,refseq-NEXMIF-NM_001008537.2_clinical_seed_0_final,refseq-NEXMIF-NM_001008537.2.a2m,Invitae,refseq-NEXMIF-NM_001008537.2.npy,1,1516,1516
+NP_001008701.1,MARLAAVLWNLCVTAVLVTSATQGLSRAGLPFGLMRRELACEGYPIELRCPGSDVIMVENANYGRTDDKICDADPFQMENVQCYLPDAFKIMSQRCNNRTQCVVVAGSDAFPDPCPGTYKYLEVQYDCVPYKVEQKVFVCPGTLQKVLEPTSTHESEHQSGAWCKDPLQAGDRIYVMPWIPYRTDTLTEYASWEDYVAARHTTTYRLPNRVDGTGFVVYDGAVFYNKERTRNIVKYDLRTRIKSGETVINTANYHDTSPYRWGGKTDIDLAVDENGLWVIYATEGNNGRLVVSQLNPYTLRFEGTWETGYDKRSASNAFMVCGVLYVLRSVYVDDDSEAAGNRVDYAFNTNANREEPVSLTFPNPYQFISSVDYNPRDNQLYVWNNYFVVRYSLEFGPPDPSAGPATSPPLSTTTTARPTPLTSTASPAATTPLRRAPLTTHPVGAINQLGPDLPPATAPVPSTRRPPAPNLHVSPELFCEPREVRRVQWPATQQGMLVERPCPKGTRGIASFQCLPALGLWNPRGPDLSNCTSPWVNQVAQKIKSGENAANIASELARHTRGSIYAGDVSSSVKLMEQLLDILDAQLQALRPIERESAGKNYNKMHKRERTCKDYIKAVVETVDNLLRPEALESWKDMNATEQVHTATMLLDVLEEGAFLLADNVREPARFLAAKENVVLEVTVLNTEGQVQELVFPQEEYPRKNSIQLSAKTIKQNSRNGVVKVVFILYNNLGLFLSTENATVKLAGEAGPGGPGGASLVVNSQVIAASINKESSRVFLMDPVIFTVAHLEDKNHFNANCSFWNYSERSMLGYWSTQGCRLVESNKTHTTCACSHLTNFAVLMAHREIYQGRINELLLSVITWVGIVISLVCLAICISTFCFLRGLQTDRNTIHKNLCINLFLAELLFLVGIDKTQYEIACPIFAGLLHYFFLAAFSWLCLEGVHLYLLLVEVFESEYSRTKYYYLGGYCFPALVVGIAAAIDYRSYGTEKACWLRVDNYFIWSFIGPVSFVIVVNLVFLMVTLHKMIRSSSVLKPDSSRLDNIKSWALGAIALLFLLGLTWAFGLLFINKESVVMAYLFTTFNAFQGVFIFVFHCALQKKVHKEYSKCLRHSYCCIRSPPGGTHGSLKTSAMRSNTRYYTGTQSRIRRMWNDTVRKQTESSFMAGDINSTPTLNRGTMGNHLLTNPVLQPRGGTSPYNTLIAESVGFNPSSPPVFNSPGSYREPKHPLGGREACGMDTLPLNGNFNNSYSLRSGDFPPGDGGPEPPRGRNLADAAAFEKMIISELVHNNLRGSSSAAKGPPPPEPPVPPVPGGGGEEEAGGPGGADRAEIELLYKALEEPLLLPRAQSVLYQSDLDESESCTAEDGATSRPLSSPPGRDSLYASGANLRDSPSYPDSSPEGPSEALPPPPPAPPGPPEIYYTSRPPALVARNPLQGYYQVRRPSHEGYLAAPGLEGPGPDGDGQMQLVTSL,1474,NP_001008701.1.csv,refseq-ADGRL1-NM_001008701.2_clinical_seed_0_final,refseq-ADGRL1-NM_001008701.2.a2m,Invitae,refseq-ADGRL1-NM_001008701.2.npy,1,1474,1474
+NP_001009999.1,MLSGKKAAAAAAAAAAAATGTEAGPGTAGGSENGSEVAAQPAGLSGPAEVGPGAVGERTPRKKEPPRASPPGGLAEPPGSAGPQAGPTVVPGSATPMETGIAETPEGRRTSRRKRAKVEYREMDESLANLSEDEYYSEEERNAKAEKEKKLPPPPPQAPPEEENESEPEEPSGQAGGLQDDSSGGYGDGQASGVEGAAFQSRLPHDRMTSQEAACFPDIISGPQQTQKVFLFIRNRTLQLWLDNPKIQLTFEATLQQLEAPYNSDTVLVHRVHSYLERHGLINFGIYKRIKPLPTKKTGKVIIIGSGVSGLAAARQLQSFGMDVTLLEARDRVGGRVATFRKGNYVADLGAMVVTGLGGNPMAVVSKQVNMELAKIKQKCPLYEANGQADTVKVPKEKDEMVEQEFNRLLEATSYLSHQLDFNVLNNKPVSLGQALEVVIQLQEKHVKDEQIEHWKKIVKTQEELKELLNKMVNLKEKIKELHQQYKEASEVKPPRDITAEFLVKSKHRDLTALCKEYDELAETQGKLEEKLQELEANPPSDVYLSSRDRQILDWHFANLEFANATPLSTLSLKHWDQDDDFEFTGSHLTVRNGYSCVPVALAEGLDIKLNTAVRQVRYTASGCEVIAVNTRSTSQTFIYKCDAVLCTLPLGVLKQQPPAVQFVPPLPEWKTSAVQRMGFGNLNKVVLCFDRVFWDPSVNLFGHVGSTTASRGELFLFWNLYKAPILLALVAGEAAGIMENISDDVIVGRCLAILKGIFGSSAVPQPKETVVSRWRADPWARGSYSYVAAGSSGNDYDLMAQPITPGPSIPGAPQPIPRLFFAGEHTIRNYPATVHGALLSGLREAGRIADQFLGAMYTLPRQATPGVPAQQSPSM,876,NP_001009999.1.csv,refseq-KDM1A-NM_001009999.2_clinical_seed_0_final,refseq-KDM1A-NM_001009999.2.a2m,Invitae,refseq-KDM1A-NM_001009999.2.npy,1,876,876
+NP_001010867.1,MATAALLRGATPGRGGPVWRWRLRAAPRCRLAHSSCSPGGDPTAGAAWACFRLDGRTLLRVRGPDAAPFLLGLLTNELPLPSPAAAGAPPAARAGYAHFLNVQGRTLYDVILYGLQEHSEVSGFLLECDSSVQGALQKHLALYRIRRKVTVEPHPELRVWAVLPSSPEACGAASLQERAGAAAILIRDPRTARMGWRLLTQDEGPALVPGGRLGDLWDYHQHRYLQGVPEGVRDLPPGVALPLESNLAFMNGVSFTKGCYIGQELTARTHHMGVIRKRLFPVRFLDPLPTSGITPGATVLTASGQTVGKFRAGQGNVGLALLWSEKIKGPLHIRASEGAQVALAASVPDWWPTVSK,356,NP_001010867.1.csv,refseq-IBA57-NM_001010867.3_clinical_seed_0_final,refseq-IBA57-NM_001010867.3.a2m,Invitae,refseq-IBA57-NM_001010867.3.npy,1,356,356
+NP_001010874.2,MFKRHKSLASERKRALLSQRATRFILKDDMRNFHFLSKLVLSAGPLRPTPAVKHSKTTHFEIEIFDAQTRKQICILDKVTQSSTIHDVKQKFHKACPKWYPSRVGLQLECGGPFLKDYITIQSIAASSIVTLYATDLGQQVSWTTVFLAEYTGPLLIYLLFYLRIPCIYDGKESARRLRHPVVHLACFCHCIHYIRYLLETLFVHKVSAGHTPLKNLIMSCAFYWGFTSWIAYYINHPLYTPPSFGNRQITVSAINFLICEAGNHFINVMLSHPNHTGNNACFPSPNYNPFTWMFFLVSCPNYTYEIGSWISFTVMTQTLPVGIFTLLMSIQMSLWAQKKHKIYLRKFNSYIHRKSAMIPFIL,363,NP_001010874.2.csv,refseq-TECRL-NM_001010874.4_clinical_seed_0_final,refseq-TECRL-NM_001010874.4.a2m,Invitae,refseq-TECRL-NM_001010874.4.npy,1,363,363
+NP_001010892.1,MEDSTSPKQEKENQEELGETRRPWEGKTAASPQYSEPESSEPLEAKQGPETGRQSRSSRPWSPQSRAKTPLGGPAGPETSSPAPVSPREPSSSPSPLAPARQDLAAPPQSDRTTSVIPEAGTPYPDPLEQSSDKRESTPHHTSQSEGNTFQQSQQPKPHLCGRRDVSYNNAKQKELRFDVFQEEDSNSDYDLQQPAPGGSEVAPSMLEITIQNAKAYLLKTSSNSGFNLYDHLSNMLTKILNERPENAVDIFENISQDVKMAHFSKKFDALQNENELLPTYEIAEKQKALFLQGHLEGVDQELEDEIAENALPNVMESAFYFEQAGVGLGTDETYRIFLALKQLTDTHPIQRCRFWGKILGLEMNYIVAEVEFREGEDEEEVEEEDVAEERDNGESEAHEDEEDELPKSFYKAPQAIPKEESRTGANKYVYFVCNEPGRPWVKLPPVIPAQIVIARKIKKFFTGRLDAPIISYPPFPGNESNYLRAQIARISAGTHVSPLGFYQFGEEEGEEEEEAEGGRNSFEENPDFEGIQVIDLVESLSNWVHHVQHILSQGRCNWFNSIQKNEEEEEEEDEEKDDSDYIEQEVGLPLLTPISEDLEIQNIPPWTTRLSSNLIPQYAIAVLQSNLWPGAYAFSNGKKFENFYIGWGHKYSPDNYTPPVPPPVYQEYPSGPEITEMDDPSVEEEQAFRAAQEAVLLAAENEESEEDEDEEDDYD,716,NP_001010892.1.csv,refseq-RSPH4A-NM_001010892.2_clinical_seed_0_final,refseq-RSPH4A-NM_001010892.2.a2m,Invitae,refseq-RSPH4A-NM_001010892.2.npy,1,716,716
+NP_001011551.1,MLSESSSFLKGVMLGSIFCALITMLGHIRIGHGNRMHHHEHHHLQAPNKEDILKISEDERMELSKSFRVYCIILVKPKDVSLWAAVKETWTKHCDKAEFFSSENVKVFESINMDTNDMWLMMRKAYKYAFDKYRDQYNWFFLARPTTFAIIENLKYFLLKKDPSQPFYLGHTIKSGDLEYVGMEGGIVLSVESMKRLNSLLNIPEKCPEQGGMIWKISEDKQLAVCLKYAGVFAENAEDADGKDVFNTKSVGLSIKEAMTYHPNQVVEGCCSDMAVTFNGLTPNQMHVMMYGVYRLRAFGHIFNDALVFLPPNGSDND,318,NP_001011551.1.csv,refseq-C1GALT1C1-NM_001011551.2_clinical_seed_0_final,refseq-C1GALT1C1-NM_001011551.2.a2m,Invitae,refseq-C1GALT1C1-NM_001011551.2.npy,1,318,318
+NP_001011658.1,MSGSFYFVIVGHHDNPVFEMEFLPAGKAESKDDHRHLNQFIAHAALDLVDENMWLSNNMYLKTVDKFNEWFVSAFVTAGHMRFIMLHDIRQEDGIKNFFTDVYDLYIKFSMNPFYEPNSPIRSSAFDRKVQFLGKKHLLS,140,NP_001011658.1.csv,refseq-TRAPPC2-NM_001011658.3_clinical_seed_0_final,refseq-TRAPPC2-NM_001011658.3.a2m,Invitae,refseq-TRAPPC2-NM_001011658.3.npy,1,140,140
+NP_001012426.1,MMVESASETIRSAPSGQNGVGSLSGQADGSSGGATGTTASGTGREVTTGADSNGEMSPAELLHFQQQQALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQILSPPQLQALLQQQQALMLQQLQEYYKKQQEQLHLQLLTQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQASGPLQTLPQAAVCPTDLPQLWKGEGAPGQPAEDSVKQEGLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGSHPLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRSTAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPLNPVPGSSSFSKVTVSAADSFPDGLVHPPTSAAAPVTPLRPPGLGSASLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAILETPDRQLTLNEIYNWFTRMFAYFRRNTATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGALNASYQAALAESSFPLLNSPGMLNPGSASSLLPLSHDDVGAPVEPLPSNGSSSPPRLSPPQYSHQVQVKEEPAEAEEDRQPGPPLGAPNPSASGPPEDRDLEEELPGEELS,680,NP_001012426.1.csv,refseq-FOXP4-NM_001012426.1_clinical_seed_0_final,refseq-FOXP4-NM_001012426.1.a2m,Invitae,refseq-FOXP4-NM_001012426.1.npy,1,680,680
+NP_001013860.1,MAQTPDGISCELRGEITRFLWPKEVELLLKTWLPGEGAVQNHVLALLRWRAYLLHTTCLPLRVDCTFSYLEVQAMALQETPPQVTFELESLRELVLEFPGVAALEQLAQHVAAAIKKVFPRSTLGKLFRRPTPASMLARLERSSPSESTDPCSPCGGFLETYEALCDYNGFPFREEIQWDVDTIYHRQGCRHFSLGDFSHLGSRDLALSVAALSYNLWFRCLSCVDMKLSLEVSEQILHMMSQSSHLEELVLETCSLRGDFVRRLAQALAGHSSSGLRELSLAGNLLDDRGMTALSRHLERCPGALRRLSLAQTGLTPRGMRALGRALATNAAFDSTLTHLDLSGNPGALGASEDSGGLYSFLSRPNVLSFLNLAGTDTALDTVRGCSVGGWMTGRADWRAGRGGLGPPAGVANSLPPQLFAAVSRGCCTSLTHLDASRNVFSRTKSRAAPAALQLFLSRARTLRHLGLAGCKLPPDALRALLDGLALNTHLRDLHLDLSACELRSAGAQVIQDLVCDAGAVSSLDLADNGFGSDMVTLVLAIGRSRSLRHVALGRNFNVRCKETLDDVLHRIVQLMQDDDCPLQSLSVAESRLKLGASVLLRALATNPNLTALDISGNAMGDAGAKLLAKALRVNSRLRSVVWDRNHTSALGLLDVAQALEQNHSLKAMPLPLNDVAQAQRSRPELTARAVHQIQACLLRNNRADPASSDHTTRLQPLGLVSDPSEQEVNELCQSVQEHVELLGCGAGPQGEAAVRQAEDAIQNANFSLSILPILYEAGSSPSHHWQLGQKLEGLLRQVGEVCRQDIQDFTQATLDTARSLCPQMLQGSSWREQLEGVLAGSRGLPELLPEQLLQDAFTRLRDMRLSITGTLAESIVAQALAGLSAARDQLVESLAQQATVTMPPALPAPDGGEPSLLEPGELEGLFFPEEKEEEKEKDDSPPQKWPELSHGLHLVPFIHSAAEEAEPEPELAAPGEDAEPQAGPSARGSPSPAAPGPPAGPLPRMDLPLAGQPLRHPTRARPRPRRQHHHRPPPGGPQVPPALPQEGNGLSARVDEGVEEFFSKRLIQQDRLWAPEEDPATEGGATPVPRTLRKKLGTLFAFKKPRSTRGPRTDLETSPGAAPRTRKTTFGDLLRPPTRPSRGEELGGAEGDTSSPDPAGRSRPRYTRDSKAYSMILLPAEEEATLGARPDKRRPLERGETELAPSFEQRVQVMLQRIGVSRGSGGAEGKRKQSKDGEIKKAGSDGDIMDSSTEAPPISIKSRTHSVSADPSCRPGPGSQGPESATWKTLGQQLNAELRSRGWGQQDGPGPPSPGQSPSPCRTSPSPDSLGLPEDPCLGPRNEDGQLRPRPLSAGRRAVSVHEDQLQAPAERPLRLQRSPVLKRRPKLEAPPSPSLGSGLGTEPLPPQPTEPSSPERSPPSPATDQRGGGPNP,1435,NP_001013860.1.csv,refseq-RLTPR-NM_001013838.1_clinical_seed_0_final,refseq-RLTPR-NM_001013838.1.a2m,Invitae,refseq-RLTPR-NM_001013838.1.npy,1,1435,1435
+NP_001014437.1,MADSSGQQAPDYRSILSISDEAARAQALNEHLSTRSYVQGYSLSQADVDAFRQLSAPPADPQLFHVARWFRHIEALLGSPCGKGQPCRLQASKGRRVQPQWSPPAGTQPCRLHLYNSLTRNKEVFIPQDGKKVTWYCCGPTVYDASHMGHARSYISFDILRRVLKDYFKFDVFYCMNITDIDDKIIKRARQNHLFEQYREKRPEAAQLLEDVQAALKPFSVKLNETTDPDKKQMLERIQHAVQLATEPLEKAVQSRLTGEEVNSCVEVLLEEAKDLLSDWLDSTLGCDVTDNSIFSKLPKFWEGDFHRDMEALNVLPPDVLTRVSEYVPEIVNFVQKIVDNGYGYVSNGSVYFDTAKFASSEKHSYGKLVPEAVGDQKALQEGEGDLSISADRLSEKRSPNDFALWKASKPGEPSWPCPWGKGRPGWHIECSAMAGTLLGASMDIHGGGFDLRFPHHDNELAQSEAYFENDCWVRYFLHTGHLTIAGCKMSKSLKNFITIKDALKKHSARQLRLAFLMHSWKDTLDYSSNTMESALQYEKFLNEFFLNVKDILRAPVDITGQFEKWGEEEAELNKNFYDKKTAIHKALCDNVDTRTVMEEMRALVSQCNLYMAARKAVRKRPNQALLENIALYLTHMLKIFGAVEEDSSLGFPVGGPGTSLSLEATVMPYLQVLSEFREGVRKIAREQKVPEILQLSDALRDNILPELGVRFEDHEGLPTVVKLVDRNTLLKEREEKRRVEEEKRKKKEEAARRKQEQEAAKLAKMKIPPSEMFLSETDKYSKFDENGLPTHDMEGKELSKGQAKKLKKLFEAQEKLYKEYLQMAQNGSFQ,831,NP_001014437.1.csv,refseq-CARS-NM_001014437.2_clinical_seed_0_final,refseq-CARS-NM_001014437.2.a2m,Invitae,refseq-CARS-NM_001014437.2.npy,1,831,831
+NP_001014839.1,MSCCDLAAAGQLGKASIMASDCEPALNQAEGRNPTLERYLGALREAKNDSEQFAALLLVTKAVKAGDIDAKTRRRIFDAVGFTFPNRLLTTKEAPDGCPDHVLRALGVALLACFCSDPELAAHPQVLNKIPILSTFLTARGDPDDAARRSMIDDTYQCLTAVAGTPRGPRHLIAGGTVSALCQAYLGHGYGFDQALALLVGLLAAAETQCWKEAEPDLLAVLRGLSEDFQKAEDASKFELCQLLPLFLPPTTVPPECYRDLQAGLARILGSKLSSWQRNPALKLAARLAHACGSDWIPAGSSGSKFLALLVNLACVEVRLALEETGTEVKEDVVTACYALMELGIQECTRCEQSLLKEPQKVQLVSVMKEAIGAVIHYLLQVGSEKQKEPFVFASVRILGAWLAEETSSLRKEVCQLLPFLVRYAKTLYEEAEEANDLSQQVANLAISPTTPGPTWPGDALRLLLPGWCHLTVEDGPREILIKEGAPSLLCKYFLQQWELTSPGHDTSVLPDSVEIGLQTCCHIFLNLVVTAPGLIKRDACFTSLMNTLMTSLPALVQQQGRLLLAANVATLGLLMARLLSTSPALQGTPASRGFFAAAILFLSQSHVARATPGSDQAVLALSPEYEGIWADLQELWFLGMQAFTGCVPLLPWLAPAALRSRWPQELLQLLGSVSPNSVKPEMVAAYQGVLVELARANRLCREAMRLQAGEETASHYRMAALEQCLSEP,729,NP_001014839.1.csv,refseq-NCDN-NM_001014839.1_clinical_seed_0_final,refseq-NCDN-NM_001014839.1.a2m,Invitae,refseq-NCDN-NM_001014839.1.npy,1,729,729
+NP_001015880.1,MSGIKKQKTENQQKSTNVVYQAHHVSRNKRGQVVGTRGGFRGCTVWLTGLSGAGKTTISFALEEYLVSHAIPCYSLDGDNVRHGLNRNLGFSPGDREENIRRIAEVAKLFADAGLVCITSFISPFAKDRENARKIHESAGLPFFEIFVDAPLNICESRDVKGLYKRARAGEIKGFTGIDSDYEKPETPERVLKTNLSTVSDCVHQVVELLQEQNIVPYTIIKDIHELFVPENKLDHVRAEAETLPSLSITKLDLQWVQVLSEGWATPLKGFMREKEYLQVMHFDTLLDGMALPDGVINMSIPIVLPVSAEDKTRLEGCSKFVLAHGGRRVAILRDAEFYEHRKEERCSRVWGTTCTKHPHIKMVMESGDWLVGGDLQVLEKIRWNDGLDQYRLTPLELKQKCKEMNADAVFAFQLRNPVHNGHALLMQDTRRRLLERGYKHPVLLLHPLGGWTKDDDVPLDWRMKQHAAVLEEGVLDPKSTIVAIFPSPMLYAGPTEVQWHCRSRMIAGANFYIVGRDPAGMPHPETKKDLYEPTHGGKVLSMAPGLTSVEIIPFRVAAYNKAKKAMDFYDPARHNEFDFISGTRMRKLAREGENPPDGFMAPKAWKVLTDYYRSLEKN,619,NP_001015880.1.csv,refseq-PAPSS2-NM_001015880.1_clinical_seed_0_final,refseq-PAPSS2-NM_001015880.1.a2m,Invitae,refseq-PAPSS2-NM_001015880.1.npy,1,619,619
+NP_001017420.1,MAALTPRKRKQDSLKCDSLLHFTENLFPSPNKKHCFYQNSDKNEENLHCSQQEHFVLSALKTTEINRLPSANQGSPFKSALSTVSFYNQNKWYLNPLERKLIKESRSTCLKTNDEDKSFPIVTEKMQGKPVCSKKNNKKPQKSLTAKYQPKYRHIKPVSRNSRNSKQNRVIYKPIVEKENNCHSAENNSNAPRVLSQKIKPQVTLQGGAAFFVRKKSSLRKSSLENEPSLGRTQKSKSEVIEDSDVETVSEKKTFATRQVPKCLVLEEKLKIGLLSASSKNKEKLIKDSSDDRVSSKEHKVDKNEAFSSEDSLGENKTISPKSTVYPIFSASSVNSKRSLGEEQFSVGSVNFMKQTNIQKNTNTRDTSKKTKDQLIIDAGQKHFGATVCKSCGMIYTASNPEDEMQHVQHHHRFLEGIKYVGWKKERVVAEFWDGKIVLVLPHDPSFAIKKVEDVQELVDNELGFQQVVPKCPNKIKTFLFISDEKRVVGCLIAEPIKQAFRVLSEPIGPESPSSTECPRAWQCSDVPEPAVCGISRIWVFRLKRRKRIARRLVDTLRNCFMFGCFLSTDEIAFSDPTPDGKLFATKYCNTPNFLVYNFNS,601,NP_001017420.1.csv,refseq-ESCO2-NM_001017420.2_clinical_seed_0_final,refseq-ESCO2-NM_001017420.2.a2m,Invitae,refseq-ESCO2-NM_001017420.2.npy,1,601,601
+NP_001017535.1,MEAMAASTSLPDPGDFDRNVPRICGVCGDRATGFHFNAMTCEGCKGFFRRSMKRKALFTCPFNGDCRITKDNRRHCQACRLKRCVDIGMMKEFILTDEEVQRKREMILKRKEEEALKDSLRPKLSEEQQRIIAILLDAHHKTYDPTYSDFCQFRPPVRVNDGGGSHPSRPNSRHTPSFSGDSSSSCSDHCITSSDMMDSSSFSNLDLSEEDSDDPSVTLELSQLSMLPHLADLVSYSIQKVIGFAKMIPGFRDLTSEDQIVLLKSSAIEVIMLRSNESFTMDDMSWTCGNQDYKYRVSDVTKAGHSLELIEPLIKFQVGLKKLNLHEEEHVLLMAICIVSPDRPGVQDAALIEAIQDRLSNTLQTYIRCRHPPPGSHLLYAKMIQKLADLRSLNEEHSKQYRCLSFQPECSMKLTPLVLEVFGNEIS,427,NP_001017535.1.csv,refseq-VDR-NM_001017535.1_clinical_seed_0_final,refseq-VDR-NM_001017535.1.a2m,Invitae,refseq-VDR-NM_001017535.1.npy,1,427,427
+NP_001017995.1,MPPRRSIVEVKVLDVQKRRVPNKHYVYIIRVTWSSGSTEAIYRRYSKFFDLQMQMLDKFPMEGGQKDPKQRIIPFLPGKILFRRSHIRDVAVKRLIPIDEYCKALIQLPPYISQCDEVLQFFETRPEDLNPPKEEHIGKKKSGGDQTSVDPMVLEQYVVVANYQKQESSEISLSVGQVVDIIEKNESGWWFVSTAEEQGWVPATCLEGQDGVQDEFSLQPEEEEKYTVIYPYTARDQDEMNLERGAVVEVIQKNLEGWWKIRYQGKEGWAPASYLKKNSGEPLPPKPGPGSPSHPGALDLDGVSRQQNAVGREKELLSSQRDGRFEGRPVPDGDAKQRSPKMRQRPPPRRDMTIPRGLNLPKPPIPPQVEEEYYTIAEFQTTIPDGISFQAGLKVEVIEKNLSGWWYIQIEDKEGWAPATFIDKYKKTSNASRPNFLAPLPHEVTQLRLGEAAALENNTGSEATGPSRPLPDAPHGVMDSGLPWSKDWKGSKDVLRKASSDMSASAGYEEISDPDMEEKPSLPPRKESIIKSEGELLERERERQRTEQLRGPTPKPPGVILPMMPAKHIPPARDSRRPEPKPDKSRLFQLKNDMGLECGHKVLAKEVKKPNLRPISKSKTDLPEEKPDATPQNPFLKSRPQVRPKPAPSPKTEPPQGEDQVDICNLRSKLRPAKSQDKSLLDGEGPQAVGGQDVAFSRSFLPGEGPGRAQDRTGKQDGLSPKEISCRAPPRPAKTTDPVSKSVPVPLQEAPQQRPVVPPRRPPPPKKTSSSSRPLPEVRGPQCEGHESRAAPTPGRALLVPPKAKPFLSNSLGGQDDTRGKGSLGPWGTGKIGENREKAAAASVPNADGLKDSLYVAVADFEGDKDTSSFQEGTVFEVREKNSSGWWFCQVLSGAPSWEGWIPSNYLRKKP,911,NP_001017995.1.csv,refseq-SH3PXD2B-NM_001017995.2_clinical_seed_0_final,refseq-SH3PXD2B-NM_001017995.2.a2m,Invitae,refseq-SH3PXD2B-NM_001017995.2.npy,1,911,911
+NP_001018005.1,MDAIKKKMQMLKLDKENALDRAEQAEADKKAAEDRSKQLEDELVSLQKKLKGTEDELDKYSEALKDAQEKLELAEKKATDAEADVASLNRRIQLVEEELDRAQERLATALQKLEEAEKAADESERGMKVIESRAQKDEEKMEIQEIQLKEAKHIAEDADRKYEEVARKLVIIESDLERAEERAELSEGKCAELEEELKTVTNNLKSLEAQAEKYSQKEDRYEEEIKVLSDKLKEAETRAEFAERSVTKLEKSIDDLEDELYAQKLKYKAISEELDHALNDMTSI,284,NP_001018005.1.csv,refseq-TPM1-NM_001018005.1_clinical_seed_0_final,refseq-TPM1-NM_001018005.1.a2m,Invitae,refseq-TPM1-NM_001018005.1.npy,1,284,284
+NP_001018123.1,MTSKQAMSSNEQERLLCYNGEVLVFQLSKGNFADKEPTKTPILHVRRMVFDRGTKVFVQKSTGFFTIKEENSHLKIMCCNCVSDFRTGINLPYIVIEKNKKNNVFEYFLLILHSTNKFEMRLSFKLGYEMKDGLRVLNGPLILWRHVKAFFFISSQTGKVVSVSGNFSSIQWAGEIENLGMVLLGLKECCLSEEECTQEPSKSDYAIWNTKFCVYSLESQEVLSDIYIIPPAYSSVVTYVHICATEIIKNQLRISLIALTRKNQLISFQNGTPKNVCQLPFGDPCAVQLMDSGGGNLFFVVSFISNNACAVWKESFQVAAKWEKLSLVLIDDFIGSGTEQVLLLFKDSLNSDCLTSFKITDLGKINYSSEPSDCNEDDLFEDKQENRYLVVPPLETGLKVCFSSFRELRQHLLLKEKIISKSYKALINLVQGKDDNTSSAEEKECLVPLCGEEENSVHILDEKLSDNFQDSEQLVEKIWYRVIDDSLVVGVKTTSSLKLSLNDVTLSLLMDQAHDSRFRLLKCQNRVIKLSTNPFPAPYLMPCEIGLEAKRVTLTPDSKKEESFVCEHPSKKECVQIITAVTSLSPLLTFSKFCCTVLLQIMERESGNCPKDRYVVCGRVFLSLEDLSTGKYLLTFPKKKPIEHMEDLFALLAAFHKSCFQITSPGYALNSMKVWLLEHMKCEIIKEFPEVYFCERPGSFYGTLFTWKQRTPFEGILIIYSRNQTVMFQCLHNLIRILPINCFLKNLKSGSENFLIDNMAFTLEKELVTLSSLSSAIAKHESNFMQRCEVSKGKSSVVAAALSDRRENIHPYRKELQREKKKMLQTNLKVSGALYREITLKVAEVQLKSDFAAQKLSNL,859,NP_001018123.1.csv,refseq-FANCB-NM_001018113.1_clinical_seed_0_final,refseq-FANCB-NM_001018113.1.a2m,Invitae,refseq-FANCB-NM_001018113.1.npy,1,859,859
+NP_001018494.1,MSVACVLKRKAVLWQDSFSPHLKHHPQEPANPNMPVVLTSGTGSQAQPQPAANQALAAGTHSSPVPGSIGVAGRSQDDAMVDYFFQRQHGEQLGGGGSGGGGYNNSKHRWPTGDNIHAEHQVRSMDELNHDFQALALEGRAMGEQLLPGKKFWETDESSKDGPKGIFLGDQWRDSAWGTSDHSVSQPIMVQRRPGQSFHVNSEVNSVLSPRSESGGLGVSMVEYVLSSSPGDSCLRKGGFGPRDADSDENDKGEKKNKGTFDGDKLGDLKEEGDVMDKTNGLPVQNGIDADVKDFSRTPGNCQNSANEVDLLGPNQNGSEGLAQLTSTNGAKPVEDFSNMESQSVPLDPMEHVGMEPLQFDYSGTQVPVDSAAATVGLFDYNSQQQLFQRPNALAVQQLTAAQQQQYALAAAHQPHIGLAPAAFVPNPYIISAAPPGTDPYTAGLAAAATLGPAVVPHQYYGVTPWGVYPASLFQQQAAAAAAATNSANQQTTPQAQQGQQQVLRGGASQRPLTPNQNQQGQQTDPLVAAAAVNSALAFGQGLAAGMPGYPVLAPAAYYDQTGALVVNAGARNGLGAPVRLVAPAPVIISSSAAQAAVAAAAASANGAAGGLAGTTNGPFRPLGTQQPQPQPQQQPNNNLASSSFYGNNSLNSNSQSSSLFSQGSAQPANTSLGFGSSSSLGATLGSALGGFGTAVANSNTGSGSRRDSLTGSSDLYKRTSSSLTPIGHSFYNGLSFSSSPGPVGMPLPSQGPGHSQTPPPSLSSHGSSSSLNLGGLTNGSGRYISAAPGAEAKYRSASSASSLFSPSSTLFSSSRLRYGMSDVMPSGRSRLLEDFRNNRYPNLQLREIAGHIMEFSQDQHGSRFIQLKLERATPAERQLVFNEILQAAYQLMVDVFGNYVIQKFFEFGSLEQKLALAERIRGHVLSLALQMYGCRVIQKALEFIPSDQQVINEMVRELDGHVLKCVKDQNGNHVVQKCIECVQPQSLQFIIDAFKGQVFALSTHPYGCRVIQRILEHCLPDQTLPILEELHQHTEQLVQDQYGNYVIQHVLEHGRPEDKSKIVAEIRGNVLVLSQHKFASNVVEKCVTHASRTERAVLIDEVCTMNDGPHSALYTMMKDQYANYVVQKMIDVAEPGQRKIVMHKIRPHIATLRKYTYGKHILAKLEKYYMKNGVDLGPICGPPNGII,1188,NP_001018494.1.csv,refseq-PUM1-NM_001020658.1_clinical_seed_0_final,refseq-PUM1-NM_001020658.1.a2m,Invitae,refseq-PUM1-NM_001020658.1.npy,1,1188,1188
+NP_001019242.1,MEKGPVRAPAEKPRGARCSNGFPERDPPRPGPSRPAEKPPRPEAKSAQPADGWKGERPRSEEDNELNLPNLAAAYSSILSSLGENPQRQGLLKTPWRAASAMQFFTKGYQETISDVLNDAIFDEDHDEMVIVKDIDMFSMCEHHLVPFVGKVHIGYLPNKQVLGLSKLARIVEIYSRRLQVQERLTKQIAVAITEALRPAGVGVVVEATSAEP,213,NP_001019242.1.csv,refseq-GCH1-NM_001024071.2_clinical_seed_0_final,refseq-GCH1-NM_001024071.2.a2m,Invitae,refseq-GCH1-NM_001024071.2.npy,1,213,213
+NP_001019801.3,MASNSLFSTVTPCQQNFFWDPSTSRRFSPPSSSLQPGKMSDVSPVVAAQQQQQQQQQQQQQQQQQQQQQQQEAAAAAAAAAAAAAAAAAVPRLRPPHDNRTMVEIIADHPAELVRTDSPNFLCSVLPSHWRCNKTLPVAFKVVALGEVPDGTVVTVMAGNDENYSAELRNASAVMKNQVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKVTVDGPREPRRHRQKLDDSKPSLFSDRLSDLGRIPHPSMRVGVPPQNPRPSLNSAPSPFNPQGQSQITDPRQAQSSPPWSYDQSYPSYLSQMTSPSIHSTTPLSSTRGTGLPAITDVPRRISDDDTATSDFCLWPSTLSKKSQAGASELGPFSDPRQFPSISSLTESRFSNPRMHYPATFTYTPPVTSGMSLGMSATTHYHTYLPPPYPGSSQSQSGPFQTSSTPYLYYGTSSGSYQFPMVPGGDRSPSRMLPPCTTTSNGSTLLNPNLPNQNDGVDADGSHSSSPTVLNSSGRMDESVWRPY,521,NP_001019801.3.csv,refseq-RUNX2-NM_001024630.3_clinical_seed_0_final,refseq-RUNX2-NM_001024630.3.a2m,Invitae,refseq-RUNX2-NM_001024630.3.npy,1,521,521
+NP_001020105.1,MPLVKRNIDPRHLCHTALPRGIKNELECVTNISLANIIRQLSSLSKYAEDIFGELFNEAHSFSFRVNSLQERVDRLSVSVTQLDPKEEELSLQDITMRKAFRSSTIQDQQLFDRKTLPIPLQETYDVCEQPPPLNILTPYRDDGKEGLKFYTNPSYFFDLWKEKMLQDTEDKRKEKRKQKQKNLDRPHEPEKVPRAPHDRRREWQKLAQGPELAEDDANLLHKHIEVANGPASHFETRPQTYVDHMDGSYSLSALPFSQMSELLTRAEERVLVRPHEPPPPPPMHGAGDAKPIPTCISSATGLIENRPQSPATGRTPVFVSPTPPPPPPPLPSALSTSSLRASMTSTPPPPVPPPPPPPATALQAPAVPPPPAPLQIAPGVLHPAPPPIAPPLVQPSPPVARAAPVCETVPVHPLPQGEVQGLPPPPPPPPLPPPGIRPSSPVTVTALAHPPSGLHPTPSTAPGPHVPLMPPSPPSQVIPASEPKRHPSTLPVISDARSVLLEAIRKGIQLRKVEEQREQEAKHERIENDVATILSRRIAVEYSDSEDDSEFDEVDWLE,559,NP_001020105.1.csv,refseq-WASF1-NM_001024934.1_clinical_seed_0_final,refseq-WASF1-NM_001024934.1.a2m,Invitae,refseq-WASF1-NM_001024934.1.npy,1,559,559
+NP_001020279.1,MDSFDLALLQEWDLESLWGEDILNQRNDSLVVEFQSSASRCRSVYEPDRNALRRKERERRNQETQQDDGTFNSSYSLFSEPYKTNKGDELSNRIQNTLGNYDEMKDFLTDRSNQSHLVGVPKPGVPQTPVNKIDEHFVADSRAQNQPSSICSTTTSTPAAVPVQQSKRGTMGWQKAGHPPSDGQQRATQQGSLRTLLGDGVGRQQPRAKQVCNVEVGLQTQERPPAMAAKHSSSGHCVQNFPPSLASKPSLVQQKPTAYVRPMDGQDQAPDESPKLKSSSETSVHCTSYRGVPASKPEPARAKAKLSKFSIPKQGEESRSGETNSCVEEIIREMTWLPPLSAIQAPGKVEPTKFPFPNKDSQLVSSGHNNPKKGDAEPESPDNGTSNTSMLEDDLKLSSDEEENEQQAAQRTALRALSDSAVVQQPNCRTSVPSSKGSSSSSSSGSSSSSSDSESSSGSDSETESSSSESEGSKPPHFSSPEAEPASSNKWQLDKWLNKVNPHKPPILIQNESHGSESNQYYNPVKEDVQDCGKVPDVCQPSLREKEIKSTCKEEQRPRTANKAPGSKGVKQKSPPAAVAVAVSAAAPPPAVPCAPAENAPAPARRSAGKKPTRRTERTSAGDGANCHRPEEPAAADALGTSVVVPPEPTKTRPCGNNRASHRKELRSSVTCEKRRTRGLSRIVPKSKEFIETESSSSSSSSDSDLESEQEEYPLSKAQTVAASASSGNDQRLKEAAANGGSGPRAPVGSINARTTSDIAKELEEQFYTLVPFGRNELLSPLKDSDEIRSLWVKIDLTLLSRIPEHLPQEPGVLSAPATKDSESAPPSHTSDTPAEKALPKSKRKRKCDNEDDYREIKKSQGEKDSSSRLATSTSNTLSANHCNMNINSVAIPINKNEKMLRSPISPLSDASKHKYTSEDLTSSSRPNGNSLFTSASSSKKPKADSQLQPHGGDLTKAAHNNSENIPLHKSRPQTKPWSPGSNGHRDCKRQKLVFDDMPRSADYFMQEAKRMKHKADAMVEKFGKALNYAEAALSFIECGNAMEQGPMESKSPYTMYSETVELIRYAMRLKTHSGPNATPEDKQLAALCYRCLALLYWRMFRLKRDHAVKYSKALIDYFKNSSKAAQAPSPWGASGKSTGTPSPMSPNPSPASSVGSQGSLSNASALSPSTIVSIPQRIHQMAANHVSITNSILHSYDYWEMADNLAKENREFFNDLDLLMGPVTLHSSMEHLVQYSQQGLHWLRNSAHLS,1251,NP_001020279.1.csv,NP_001020279.1_clinical_seed_0_final,NP_001020279.1.a2m,popEVE,NP_001020279.1_theta_0.2.npy,1,1251,1251
+NP_001020466.1,MDTAYPREDTRAPTPSKAGAHTALTLGAPHPPPRDHLIWSVFSTLYLNLCCLGFLALAYSIKARDQKVVGDLEAARRFGSKAKCYNILAAMWTLVPPLLLLGLVVTGALHLARLAKDSAAFFSTKFDDADYD,132,NP_001020466.1.csv,refseq-IFITM5-NM_001025295.2_clinical_seed_0_final,refseq-IFITM5-NM_001025295.2.a2m,Invitae,refseq-IFITM5-NM_001025295.2.npy,1,132,132
+NP_001020560.1,MPRQFPKLNISEVDEQVRLLAEKVFAKVLREEDSKDALSLFTVPEDCPIGQKEAKERELQKELAEQKSVETAKRKKSFKMIRSQSLSLQMPPQQDWKGPPAASPAMSPTTPVVTGATSLPTPAPYAMPEFQRVTISGDYCAGITLEDYEQAAKSLAKALMIREKYARLAYHRFPRITSQYLGHPRADTAPPEEGLPDFHPPPLPQEDPYCLDDAPPNLDYLVHMQGGILFVYDNKKMLEHQEPHSLPYPDLETYTVDMSHILALITDGPTKTYCHRRLNFLESKFSLHEMLNEMSEFKELKSNPHRDFYNVRKVDTHIHAAACMNQKHLLRFIKHTYQTEPDRTVAEKRGRKITLRQVFDGLHMDPYDLTVDSLDVHAGRQTFHRFDKFNSKYNPVGASELRDLYLKTENYLGGEYFARMVKEVARELEESKYQYSEPRLSIYGRSPEEWPNLAYWFIQHKVYSPNMRWIIQVPRIYDIFRSKKLLPNFGKMLENIFLPLFKATINPQDHRELHLFLKYVTGFDSVDDESKHSDHMFSDKSPNPDVWTSEQNPPYSYYLYYMYANIMVLNNLRRERGLSTFLFRPHCGEAGSITHLVSAFLTADNISHGLLLKKSPVLQYLYYLAQIPIAMSPLSNNSLFLEYSKNPLREFLHKGLHVSLSTDDPMQFHYTKEALMEEYAIAAQVWKLSTCDLCEIARNSVLQSGLSHQEKQKFLGQNYYKEGPEGNDIRKTNVAQIRMAFRYETLCNELSFLSDAMKSEEITALTN,767,NP_001020560.1.csv,refseq-AMPD3-NM_001025389.1_clinical_seed_0_final,refseq-AMPD3-NM_001025389.1.a2m,Invitae,refseq-AMPD3-NM_001025389.1.npy,1,767,767
+NP_001020774.1,MAEDEPDAKSPKTGGRAPPGGAEAGEPTTLLQRLRGTISKAVQNKVEGILQDVQKFSDNDKLYLYLQLPSGPTTGDKSSEPSTLSNEEYMYAYRWIRNHLEEHTDTCLPKQSVYDAYRKYCESLACCRPLSTANFGKIIREIFPDIKARRLGGRGQSKYCYSGIRRKTLVSMPPLPGLDLKGSESPEMGPEVTPAPRDELVEAACALTCDWAERILKRSFSSIVEVARFLLQQHLISARSAHAHVLKAMGLAEEDEHAPRERSSKPKNGLENPEGGAHKKPERLAQPPKDLEARTGAGPLARGERKKSVVESSAPGANNLQVNALVARLPLLLPRAPRSLIPPIPVSPPILAPRLSSGALKVATLPLSSRAGAPPAAVPIINMILPTVPALPGPGPGPGRAPPGGLTQPRGTENREVGIGGDQGPHDKGVKRTAEVPVSEASGQAPPAKAAKQDIEDTASDAKRKRGRPRKKSGGSGERNSTPLKSAAAMESAQSSRLPWETWGSGGEGNSAGGAERPGPMGEAEKGAVLAQGQGDGTVSKGGRGPGSQHTKEAEDKIPLVPSKVSVIKGSRSQKEAFPLAKGEVDTAPQGNKDLKEHVLQSSLSQEHKDPKATPP,616,NP_001020774.1.csv,refseq-RFX5-NM_001025603.2_clinical_seed_0_final,refseq-RFX5-NM_001025603.2.a2m,Invitae,refseq-RFX5-NM_001025603.2.npy,1,616,616
+NP_001025023.1,MGCAPSIHVSQSGVIYCRDSDESSSPRQTTSVSQGPAAPLPGLFVQTDAADAIPPSRASGPPSVARVRRARTELGSGSSAGSAAPAATTSRGRRRHCCSSAEAETQTCYTSVKQVSSAEVRIGPMRLTQDPIQVLLIFAKEDSQSDGFWWACDRAGYRCNIARTPESALECFLDKHHEIIVIDHRQTQNFDAEAVCRSIRATNPSEHTVILAVVSRVSDDHEEASVLPLLHAGFNRRFMENSSIIACYNELIQIEHGEVRSQFKLRACNSVFTALDHCHEAIEITSDDHVIQYVNPAFERMMGYHKGELLGKELADLPKSDKNRADLLDTINTCIKKGKEWQGVYYARRKSGDSIQQHVKITPVIGQGGKIRHFVSLKKLCCTTDNNKQIHKIHRDSGDNSQTEPHSFRYKNRRKESIDVKSISSRGSDAPSLQNRRYPSMARIHSMTIEAPITKDGLRRLSGNEYVFTKNVHQSHSHLAMPITINDVPPCISQLLDNEESWDFNIFELEAITHKRPLVYLGLKVFSRFGVCEFLNCSETTLRAWFQVIEANYHSSNAYHNSTHAADVLHATAFFLGKERVKGSLDQLDEVAALIAATVHDVDHPGRTNSFLCNAGSELAVLYNDTAVLESHHTALAFQLTVKDTKCNIFKNIDRNHYRTLRQAIIDMVLATEMTKHFEHVNKFVNSINKPMAAEIEGSDCECNPAGKNFPENQILIKRMMIKCADVANPCRPLDLCIEWAGRISEEYFAQTDEEKRQGLPVVMPVFDRNTCSIPKSQISFIDYFITDMFDAWDAFAHLPALMQHLADNYKHWKTLDDLKCKSLRLPSDS,830,NP_001025023.1.csv,refseq-PDE8B-NM_001029852.4_clinical_seed_0_final,refseq-PDE8B-NM_001029852.4.a2m,Invitae,refseq-PDE8B-NM_001029852.4_theta_0.2.npy,1,830,830
+NP_001025053.1,MRVKPQGLVVTSSAVCSSPDYLREPKYYPGGPPTPRPLLPTRPPASPPDKAFSTHAFSENPRPPPRRDPSTRRPPVLAKGDDPLPPRAARPVSQARCPTPVGDGSSSRRCWDNGRVNLRPVVQLIDIMKDLTRLSQDLQHSGVHLDCGGLRLSRPPAPPPGDLQYSFFSSPSLANSIRSPEERATPHAKSERPSHPLYEPEPEPRDSPQPGQGHSPGATAAATGLPPEPEPDSTDYSELADADILSELASLTCPEAQLLEAQALEPPSPEPEPQLLDPQPRFLDPQALEPLGEALELPPLQPLADPLGLPGLALQALDTLPDSLESQLLDPQALDPLPKLLDVPGRRLEPQQPLGHCPLAEPLRLDLCSPHGPPGPEGHPKYALRRTDRPKILCRRRKAGRGRKADAGPEGRLLPLPMPTGLVAALAEPPPPPPPPPPALPGPGPVSVPELKPESSQTPVVSTRKGKCRGVRRMVVKMAKIPVSLGRRNKTTYKVSSLSSSLSVEGKELGLRVSAEPTPLLKMKNNGRNVVVVFPPGEMPIILKRKRGRPPKNLLLGPGKPKEPAVVAAEAATVAAATMAMPEVKKRRRRKQKLASPQPSYAADANDSKAEYSDVLAKLAFLNRQSQCAGRCSPPRCWTPSEPESVHQAPDTQSISHFLHRVQGFRRRGGKAGGFGGRGGGHAAKSARCSFSDFFEGIGKKKKVVAVAAAGVGGPGLTELGHPRKRGRGEVDAVTGKPKRKRRSRKNGTLFPEQVPSGPGFGEAGAEWAGDKGGGWAPHHGHPGGQAGRNCGFQGTEARAFASTGLESGASGRGSYYSTGAPSGQTELSQERQNLFTGYFRSLLDSDDSSDLLDFALSASRPESRKASGTYAGPPTSALPAQRGLATFPSRGAKASPVAVGSSGAGADPSFQPVLSARQTFPPGRAASYGLTPAASDCRAAETFPKLVPPPSAMARSPTTHPPANTYLPQYGGYGAGQSVFAPTKPFTGQDCANSKDCSFAYGSGNSLPASPSSAHSAGYAPPPTGGPCLPPSKASFFSSSEGAPFSGSAPTPLRCDSRASTVSPGGYMVPKGTTASATSAASAASSSSSSFQPSPENCRQFAGASQWPFRQGYGGLDWASEAFSQLYNPSFDCHVSEPNVILDISNYTPQKVKQQTAVSETFSESSSDSTQFNQPVGGGGFRRANSEASSSEGQSSLSSLEKLMMDWNEASSAPGYNWNQSVLFQSSSKPGRGRRKKVDLFEASHLGFPTSASAAASGYPSKRSTGPRQPRGGRGGGACSAKKERGGAAAKAKFIPKPQPVNPLFQDSPDLGLDYYSGDSSMSPLPSQSRAFGVGERDPCDFIGPYSMNPSTPSDGTFGQGFHCDSPSLGAPELDGKHFPPLAHPPTVFDAGLQKAYSPTCSPTLGFKEELRPPPTKLAACEPLKHGLQGASLGHAAAAQAHLSCRDLPLGQPHYDSPSCKGTAYWYPPGSAARSPPYEGKVGTGLLADFLGRTEAACLSAPHLASPPATPKADKEPLEMARPPGPPRGPAAAAAGYGCPLLSDLTLSPVPRDSLLPLQDTAYRYPGFMPQAHPGLGGGPKSGFLGPMAEPHPEDTFTVTSL,1603,NP_001025053.1.csv,refseq-AHDC1-NM_001029882.3_clinical_seed_0_final,refseq-AHDC1-NM_001029882.3.a2m,Invitae,refseq-AHDC1-NM_001029882.3.npy,1,1603,1603
+NP_001025054.1,MGCTPSHSDLVNSVAKSGIQFLKKPKAIRPGCQGGSERGSIPLLVKNSTCYDAGEGLAEEQPSPRRNQTTAKGLCQLMGDPASGKRKDMEGLIPGTKTSSSQLNKSQSHMAKDIPFKTQGSHGSQGADFSGDESEESSTQDTSKWKRTAKCHTSSTQSHCYQTIHPAHEPEGKVDFPEPLVKAHQQAYTYLHSSLSKYEAILCIIHQATQTRELLQPMVSFLLLCFEEISQLLGEISKDGEVLLQEVREDLAWPLKKREPQEQPNLLQQLLQYTVSKLQVLNGTVASLTGSFLEGSSSYLHSTATHLENKLSTKRNVDERLLRALRQLESLASGCGDPGVQGLPLCSEDSGIGADNESVQSVDKLGKQTSWDLAPEPEEWKSVTSPHTEARQSGHTWQQSPFCLGSGRPQDCLLSGAPMAKVQPRAQDEARSPCLSSTSPENITSPPLKLGTSTPCDSFGIGVSVEPHLSKTSRPMDASSLSDSEDSSPEEEEEDKMSSMSLCAWQEKTPHSRPQSSPADRESPFQARTRRLRSLQAQEMILKMKESISERIKFVPVPCGHQDWSEEEEGRTVVPPRPSTVSGSRRAPERQTRSQSESCLQSHVEDPTFQELRRVQRDLSQKLEAFYALGAKGQGQSQEQILQPRAAAVWPNGTCRVSPSNTTSRLKASLTKNFSILPSQDKSILQKCNPHPEDEQGKAGKLPNAIPSGEVSEAAKATDWNVRGCPTRTSVKKLIETFSPTESLRMLGDSKDAGASPCLRNCIMPPRFPKYTGLAPLYPKPQISPASGRESLKMGIGWKPLAPIFPPLPKAEAAKSEELSCEMEGNLEHLPPPPMEVLMDKSFASLESPESSKSTENSPKETQEPGPGEAGPTRRTWASPKLRASVSPLDLLPSKSTASLTKPHSTGPGSGRSSCQPRKPALDLSSPPATSQSPEVKGGTWSQAEKATSLYRQPRKAIAWHHSGPPSGQNRTSESSLARPRQSRERSPPVGRKASPTRTHWVPQADKRRRSLPSSYRPAQPSPSAVQTPPSPPVSPRVLSPPTTKRRTSPPHQPKLPNPPPESAPAQCKVPSPPTQHPEASPPFSIPSPSPPMSPSQEHKETRDSEDSQAVIAKVSGNTHSIFCPATSSLFEAKPPLSTAHPLTPPSLPPEAGGPLGNPAECWKNSSGPWLRADSQRRAALCALNPLPFLRRTASDRQPGGRPQPPTLDPTSTSYESQLGQNSSSEESPKKDTEPGSSPCSPELQGGTRRASPPEFCVLGHGLQPEPRTGHIQDKSQPEAQPQQEEVS,1288,NP_001025054.1.csv,refseq-PCARE-NM_001029883.2_clinical_seed_0_final,refseq-PCARE-NM_001029883.2.a2m,Invitae,refseq-PCARE-NM_001029883.2.npy,1,1288,1288
+NP_001025178.1,MPAPIRLRELIRTIRTARTQAEEREMIQKECAAIRSSFREEDNTYRCRNVAKLLYMHMLGYPAHFGQLECLKLIASQKFTDKRIGYLGAMLLLDERQDVHLLMTNCIKNDLNHSTQFVQGLALCTLGCMGSSEMCRDLAGEVEKLLKTSNSYLRKKAALCAVHVIRKVPELMEMFLPATKNLLNEKNHGVLHTSVVLLTEMCERSPDMLAHFRKNEKLVPQLVRILKNLIMSGYSPEHDVSGISDPFLQVRILRLLRILGRNDDDSSEAMNDILAQVATNTETSKNVGNAILYETVLTIMDIKSESGLRVLAINILGRFLLNNDKNIRYVALTSLLKTVQTDHNAVQRHRSTIVDCLKDLDVSIKRRAMELSFALVNGNNIRGMMKELLYFLDSCEPEFKADCASGIFLAAEKYAPSKRWHIDTIMRVLTTAGSYVRDDAVPNLIQLITNSVEMHAYTVQRLYKAILGDYSQQPLVQVAAWCIGEYGDLLVSGQCEEEEPIQVTEDEVLDILESVLISNMSTSVTRGYALTAIMKLSTRFTCTVNRIKKVVSIYGSSIDVELQQRAVEYNALFKKYDHMRSALLERMPVMEKVTTNGPTEIVQTNGETEPAPLETKPPPSGPQPTSQANDLLDLLGGNDITPVIPTAPTSKPSSAGGELLDLLGDINLTGAPAAAPAPASVPQISQPPFLLDGLSSQPLFNDIAAGIPSITAYSKNGLKIEFTFERSNTNPSVTVITIQASNSTELDMTDFVFQAAVPKTFQLQLLSPSSSIVPAFNTGTITQVIKVLNPQKQQLRMRIKLTYNHKGSAMQDLAEVNNFPPQSWQ,825,NP_001025178.1.csv,refseq-AP1G1-NM_001030007.1_clinical_seed_0_final,refseq-AP1G1-NM_001030007.1.a2m,Invitae,refseq-AP1G1-NM_001030007.1_theta_0.2.npy,1,825,825
+NP_001025482.1,MPWRRRRNRVSALEGGREEEAPPEAAAVPPALLTSPQQTEAAAERILLRGIFEIGRDSCDVVLSERALRWRPIQPERPAGDSKYDLLCKEEFIELKDIFSVKLKRRCSVKQQRSGTLLGITLFICLKKEQNKLKNSTLDLINLSEDHCDIWFRQFKKILAGFPNRPKSLKILLNPQSHKKEATQVYYEKVEPLLKLAGIKTDVTIMEYEGHALSLLKECELQGFDGGHRKPLFAIHWSVQRLFTGMQTLEPSVVCVGGDGSASEVAHALLLRAQKNAGMETDRILTPVRAQLPLGLIPAGSTNVLAHSLHGVPHVITATLHIIMGHVQLVDVCTFSTAGKLLRFGFSAMFGFGGRTLALAEKYRWMSPNQRRDFAVVKALAKLKAEDCEISFLPFNSSDDVQERRAQGSPKSDCNDQWQMIQGQFLNVSIMAIPCLCSVAPRGLAPNTRLNNGSMALIIARNTSRPEFIKHLKRYASVKNQFNFPFVETYTVEEVKVHPRNNTGGYNPEEEEDETASENCFPWNVDGDLMEVASEVHIRLHPRLISLYGGSMEEMIPK,558,NP_001025482.1.csv,refseq-CERKL-NM_001030311.2_clinical_seed_0_final,refseq-CERKL-NM_001030311.2.a2m,Invitae,refseq-CERKL-NM_001030311.2.npy,1,558,558
+NP_001026859.1,MTSGATRYRLSCSLRGHELDVRGLVCCAYPPGAFVSVSRDRTTRLWAPDSPNRSFTEMHCMSGHSNFVSCVCIIPSSDIYPHGLIATGGNDHNICIFSLDSPMPLYILKGHKNTVCSLSSGKFGTLLSGSWDTTAKVWLNDKCMMTLQGHTAAVWAVKILPEQGLMLTGSADKTVKLWKAGRCERTFSGHEDCVRGLAILSETEFLSCANDASIRRWQITGECLEVYYGHTNYIYSISVFPNCRDFVTTAEDRSLRIWKHGECAQTIRLPAQSIWCCCVLDNGDIVVGASDGIIRVFTESEDRTASAEEIKAFEKELSHATIDSKTGDLGDINAEQLPGREHLNEPGTREGQTRLIRDGEKVEAYQWSVSEGRWIKIGDVVGSSGANQQTSGKVLYEGKEFDYVFSIDVNEGGPSYKLPYNTSDDPWLTAYNFLQKNDLNPMFLDQVAKFIIDNTKGQMLGLGNPSFSDPFTGGGRYVPGSSGSSNTLPTADPFTGAGRYVPGSASMGTTMAGVDPFTGNSAYRSAASKTMNIYFPKKEAVTFDQANPTQILGKLKELNGTAPEEKKLTEDDLILLEKILSLICNSSSEKPTVQQLQILWKAINCPEDIVFPALDILRLSIKHPSVNENFCNEKEGAQFSSHLINLLNPKGKPANQLLALRTFCNCFVGQAGQKLMMSQRESLMSHAIELKSGSNKNIHIALATLALNYSVCFHKDHNIEGKAQCLSLISTILEVVQDLEATFRLLVALGTLISDDSNAVQLAKSLGVDSQIKKYSSVSEPAKVSECCRFILNLL,795,NP_001026859.1.csv,refseq-PLAA-NM_001031689.2_clinical_seed_0_final,refseq-PLAA-NM_001031689.2.a2m,Invitae,refseq-PLAA-NM_001031689.2_theta_0.2.npy,1,795,795
+NP_001026895.2,MFVPRSLKIKRNANDDGKSCVAKIIKPDPEDLQLDKSRDVPVDAVATEAATIDRHISESCPFPSPGGQLAEVHSVSPEQGAKDSHPSEEPVKSFSKTQRWAEPGEPICVVCGRYGEYICDKTDEDVCSLECKAKHLLQVKEKEEKSKLSNPQKADSEPESPLNASYVYKEHPFILNLQEDQIENLKQQLGILVQGQEVTRPIIDFEHCSLPEVLNHNLKKSGYEVPTPIQMQMIPVGLLGRDILASADTGSGKTAAFLLPVIMRALFESKTPSALILTPTRELAIQIERQAKELMSGLPRMKTVLLVGGLPLPPQLYRLQQHVKVIIATPGRLLDIIKQSSVELCGVKIVVVDEADTMLKMGFQQQVLDILENIPNDCQTILVSATIPTSIEQLASQLLHNPVRIITGEKNLPCANVRQIILWVEDPAKKKKLFEILNDKKLFKPPVLVFVDCKLGADLLSEAVQKITGLKSISIHSEKSQIERKNILKGLLEGDYEVVVSTGVLGRGLDLISVRLVVNFDMPSSMDEYVHQIGRVGRLGQNGTAITFINNNSKRLFWDIAKRVKPTGSILPPQLLNSPYLHDQKRKEQQKDKQTQNDLVTGANLMDIIRKHDKSNSQK,619,NP_001026895.2.csv,refseq-DDX59-NM_001031725.4_clinical_seed_0_final,refseq-DDX59-NM_001031725.4.a2m,Invitae,refseq-DDX59-NM_001031725.4.npy,1,619,619
+NP_001027565.1,MMLSTEGREGFVVKVRGLPWSCSADEVMRFFSDCKIQNGTSGIRFIYTREGRPSGEAFVELESEEEVKLALKKDRETMGHRYVEVFKSNSVEMDWVLKHTGPNSPDTANDGFVRLRGLPFGCSKEEIVQFFSGLEIVPNGMTLPVDFQGRSTGEAFVQFASQEIAEKALKKHKERIGHRYIEIFKSSRAEVRTHYDPPRKLMAMQRPGPYDRPGAGRGYNSIGRGAGFERMRRGAYGGGYGGYDDYGGYNDGYGFGSDRFGRDLNYCFSGMSDHRYGDGGSSFQSTTGHCVHMRGLPYRATENDIYNFFSPLNPMRVHIEIGPDGRVTGEADVEFATHEDAVAAMAKDKANMQHRYVELFLNSTAGTSGGAYDHSYVELFLNSTAGASGGAYGSQMMGGMGLSNQSSYGGPASQQLSGGYGGGYGGQSSMSGYDQVLQENSSDYQSNLA,449,NP_001027565.1.csv,refseq-HNRNPH2-NM_001032393.2_clinical_seed_0_final,refseq-HNRNPH2-NM_001032393.2.a2m,Invitae,refseq-HNRNPH2-NM_001032393.2_theta_0.2.npy,1,449,449
+NP_001027567.2,MMFRSDRMWSCHWKWKPSPLLFLFALYIMCVPHSVWGCANCRVVLSNPSGTFTSPCYPNDYPNSQACMWTLRAPTGYIIQITFNDFDIEEAPNCIYDSLSLDNGESQTKFCGATAKGLSFNSSANEMHVSFSSDFSIQKKGFNASYIRVAVSLRNQKVILPQTSDAYQVSVAKSISIPELSAFTLCFEATKVGHEDSDWTAFSYSNASFTQLLSFGKAKSGYFLSISDSKCLLNNALPVKEKEDIFAESFEQLCLVWNNSLGSIGVNFKRNYETVPCDSTISKVIPGNGKLLLGSNQNEIVSLKGDIYNFRLWNFTMNAKILSNLSCNVKGNVVDWQNDFWNIPNLALKAESNLSCGSYLIPLPAAELASCADLGTLCQDGIIYRISVVIQNILRHPEVKVQSKVAEWLNSTFQNWNYTVYVVNISFHLSAGEDKIKVKRSLEDEPRLVLWALLVYNATNNTNLEGKIIQQKLLKNNESLDEGLRLHTVNVRQLGHCLAMEEPKGYYWPSIQPSEYVLPCPDKPGFSASRICFYNATNPLVTYWGPVDISNCLKEANEVANQILNLTADGQNLTSANITNIVEQVKRIVNKEENIDITLGSTLMNIFSNILSSSDSDLLESSSEALKTIDELAFKIDLNSTSHVNITTRNLALSVSSLLPGTNAISNFSIGLPSNNESYFQMDFESGQVDPLASVILPPNLLENLSPEDSVLVRRAQFTFFNKTGLFQDVGPQRKTLVSYVMACSIGNITIQNLKDPVQIKIKHTRTQEVHHPICAFWDLNKNKSFGGWNTSGCVAHRDSDASETVCLCNHFTHFGVLMDLPRSASQLDARNTKVLTFISYIGCGISAIFSAATLLTYVAFEKLRRDYPSKILMNLSTALLFLNLLFLLDGWITSFNVDGLCIAVAVLLHFFLLATFTWMGLEAIHMYIALVKVFNTYIRRYILKFCIIGWGLPALVVSVVLASRNNNEVYGKESYGKEKGDEFCWIQDPVIFYVTCAGYFGVMFFLNIAMFIVVMVQICGRNGKRSNRTLREEVLRNLRSVVSLTFLLGMTWGFAFFAWGPLNIPFMYLFSIFNSLQGLFIFIFHCAMKENVQKQWRQHLCCGRFRLADNSDWSKTATNIIKKSSDNLGKSLSSSSIGSNSTYLTSKSKSSSTTYFKRNSHTDSASMDKSLSKLAHADGDQTSIIPVHQVIDKVKGYCNAHSDNFYKNIIMSDTFSHSTKF,1222,NP_001027567.2.csv,refseq-ADGRG6-NM_001032395.3_clinical_seed_0_final,refseq-ADGRG6-NM_001032395.3.a2m,Invitae,refseq-ADGRG6-NM_001032395.3.npy,1,1222,1222
+NP_001028.1,MGRLLALVVGAALVSSACGGCVEVDSETEAVYGMTFKILCISCKRRSETNAETFTEWTFRQKGTEEFVKILRYENEVLQLEEDERFEGRVVWNGSRGTKDLQDLSIFITNVTYNHSGDYECHVYRLLFFENYEHNTSVVKKIHIEVVDKANRDMASIVSEIMMYVLIVVLTIWLVAEMIYCYKKIAAATETAAQENASEYLAITSESKENCTGVQVAE,218,NP_001028.1.csv,refseq-SCN1B-NM_001037.4_clinical_seed_0_final,refseq-SCN1B-NM_001037.4.a2m,Invitae,refseq-SCN1B-NM_001037.4.npy,1,218,218
+NP_001029.1,MEGNKLEEQDSSPPQSTPGLMKGNKREEQGLGPEPAAPQQPTAEEEALIEFHRSYRELFEFFCNNTTIHGAIRLVCSQHNRMKTAFWAVLWLCTFGMMYWQFGLLFGEYFSYPVSLNINLNSDKLVFPAVTICTLNPYRYPEIKEELEELDRITEQTLFDLYKYSSFTTLVAGSRSRRDLRGTLPHPLQRLRVPPPPHGARRARSVASSLRDNNPQVDWKDWKIGFQLCNQNKSDCFYQTYSSGVDAVREWYRFHYINILSRLPETLPSLEEDTLGNFIFACRFNQVSCNQANYSHFHHPMYGNCYTFNDKNNSNLWMSSMPGINNGLSLMLRAEQNDFIPLLSTVTGARVMVHGQDEPAFMDDGGFNLRPGVETSISMRKETLDRLGGDYGDCTKNGSDVPVENLYPSKYTQQVCIHSCFQESMIKECGCAYIFYPRPQNVEYCDYRKHSSWGYCYYKLQVDFSSDHLGCFTKCRKPCSVTSYQLSAGYSRWPSVTSQEWVFQMLSRQNNYTVNNKRNGVAKVNIFFKELNYKTNSESPSVTMVTLLSNLGSQWSLWFGSSVLSVVEMAELVFDLLVIMFLMLLRRFRSRYWSPGRGGRGAQEVASTLASSPPSHFCPHPMSLSLSQPGPAPSPALTAPPPAYATLGPRPSPGGSAGASSSTCPLGGP,669,NP_001029.1.csv,refseq-SCNN1A-NM_001038.5_clinical_seed_0_final,refseq-SCNN1A-NM_001038.5.a2m,Invitae,refseq-SCNN1A-NM_001038.5.npy,1,669,669
+NP_001029027.1,MSSFEGQMAEYPTISIDRFDRENLRARAYFLSHCHKDHMKGLRAPTLKRRLECSLKVYLYCSPVTKELLLTSPKYRFWKKRIISIEIETPTQISLVDEASGEKEEIVVTLLPAGHCPGSVMFLFQGNNGTVLYTGDFRLAQGEAARMELLHSGGRVKDIQSVYLDTTFCDPRFYQIPSREECLSGVLELVRSWITRSPYHVVWLNCKAAYGYEYLFTNLSEELGVQVHVNKLDMFRNMPEILHHLTTDRNTQIHACRHPKAEEYFQWSKLPCGITSRNRIPLHIISIKPSTMWFGERSRKTNVIVRTGESSYRACFSFHSSYSEIKDFLSYLCPVNAYPNVIPVGTTMDKVVEILKPLCRSSQSTEPKYKPLGKLKRARTVHRDSEEEDDYLFDDPLPIPLRHKVPYPETFHPEVFSMTAVSEKQPEKLRQTPGCCRAECMQSSRFTNFVDCEESNSESEEEVGIPASLQGDLGSVLHLQKADGDVPQWEVFFKRNDEITDESLENFPSSTVAGGSQSPKLFSDSDGESTHISSQNSSQSTHITEQGSQGWDSQSDTVLLSSQERNSGDITSLDKADYRPTIKENIPASLMEQNVICPKDTYSDLKSRDKDVTIVPSTGEPTTLSSETHIPEEKSLLNLSTNADSQSSSDFEVPSTPEAELPKREHLQYLYEKLATGESIAVKKRKCSLLDT,692,NP_001029027.1.csv,refseq-DCLRE1C-NM_001033855.2_clinical_seed_0_final,refseq-DCLRE1C-NM_001033855.2.a2m,Invitae,refseq-DCLRE1C-NM_001033855.2.npy,1,692,692
+NP_001030025.1,MREPEELMPDSGAVFTFGKSKFAENNPGKFWFKNDVPVHLSCGDEHSAVVTGNNKLYMFGSNNWGQLGLGSKSAISKPTCVKALKPEKVKLAACGRNHTLVSTEGGNVYATGGNNEGQLGLGDTEERNTFHVISFFTSEHKIKQLSAGSNTSAALTEDGRLFMWGDNSEGQIGLKNVSNVCVPQQVTIGKPVSWISCGYYHSAFVTTDGELYVFGEPENGKLGLPNQLLGNHRTPQLVSEIPEKVIQVACGGEHTVVLTENAVYTFGLGQFGQLGLGTFLFETSEPKVIENIRDQTISYISCGENHTALITDIGLMYTFGDGRHGKLGLGLENFTNHFIPTLCSNFLRFIVKLVACGGCHMVVFAAPHRGVAKEIEFDEINDTCLSVATFLPYSSLTSGNVLQRTLSARMRRRERERSPDSFSMRRTLPPIEGTLGLSACFLPNSVFPRCSERNLQESVLSEQDLMQPEEPDYLLDEMTKEAEIDNSSTVESLGETTDILNMTHIMSLNSNEKSLKLSPVQKQKKQQTIGELTQDTALTENDDSDEYEEMSEMKEGKACKQHVSQGIFMTQPATTIEAFSDEEVEIPEEKEGAEDSKGNGIEEQEVEANEENVKVHGGRKEKTEILSDDLTDKAEVSEGKAKSVGEAEDGPEGRGDGTCEEGSSGAEHWQDEEREKGEKDKGRGEMERPGEGEKELAEKEEWKKRDGEEQEQKEREQGHQKERNQEMEEGGEEEHGEGEEEEGDREEEEEKEGEGKEEGEGEEVEGEREKEEGERKKEERAGKEEKGEEEGDQGEGEEEETEGRGEEKEEGGEVEGGEVEEGKGEREEEEEEGEGEEEEGEGEEEEGEGEEEEGEGKGEEEGEEGEGEEEGEEGEGEGEEEEGEGEGEEEGEGEGEEEEGEGEGEEEGEGEGEEEEGEGKGEEEGEEGEGEGEEEEGEGEGEDGEGEGEEEEGEWEGEEEEGEGEGEEEGEGEGEEGEGEGEEEEGEGEGEEEEGEEEGEEEGEGEEEGEGEGEEEEEGEVEGEVEGEEGEGEGEEEEGEEEGEEREKEGEGEENRRNREEEEEEEGKYQETGEEENERQDGEEYKKVSKIKGSVKYGKHKTYQKKSVTNTQGNGKEQRSKMPVQSKRLLKNGPSGSKKFWNNVLPHYLELK,1152,NP_001030025.1.csv,refseq-RPGR-NM_001034853.1_clinical_seed_0_final,refseq-RPGR-NM_001034853.1.a2m,Invitae,refseq-RPGR-NM_001034853.1.npy,1,1152,1152
+NP_001032.2,MARKKFSGLEISLIVLFVIVTIIAIALIVVLATKTPAVDEISDSTSTPATTRVTTNPSDSGKCPNVLNDPVNVRINCIPEQFPTEGICAQRGCCWRPWNDSLIPWCFFVDNHGYNVQDMTTTSIGVEAKLNRIPSPTLFGNDINSVLFTTQNQTPNRFRFKITDPNNRRYEVPHQYVKEFTGPTVSDTLYDVKVAQNPFSIQVIRKSNGKTLFDTSIGPLVYSDQYLQISTRLPSDYIYGIGEQVHKRFRHDLSWKTWPIFTRDQLPGDNNNNLYGHQTFFMCIEDTSGKSFGVFLMNSNAMEIFIQPTPIVTYRVTGGILDFYILLGDTPEQVVQQYQQLVGLPAMPAYWNLGFQLSRWNYKSLDVVKEVVRRNREAGIPFDTQVTDIDYMEDKKDFTYDQVAFNGLPQFVQDLHDHGQKYVIILDPAISIGRRANGTTYATYERGNTQHVWINESDGSTPIIGEVWPGLTVYPDFTNPNCIDWWANECSIFHQEVQYDGLWIDMNEVSSFIQGSTKGCNVNKLNYPPFTPDILDKLMYSKTICMDAVQNWGKQYDVHSLYGYSMAIATEQAVQKVFPNKRSFILTRSTFAGSGRHAAHWLGDNTASWEQMEWSITGMLEFSLFGIPLVGADICGFVAETTEELCRRWMQLGAFYPFSRNHNSDGYEHQDPAFFGQNSLLVKSSRQYLTIRYTLLPFLYTLFYKAHVFGETVARPVLHEFYEDTNSWIEDTEFLWGPALLITPVLKQGADTVSAYIPDAIWYDYESGAKRPWRKQRVDMYLPADKIGLHLRGGYIIPIQEPDVTTTASRKNPLGLIVALGENNTAKGDFFWDDGETKDTIQNGNYILYTFSVSNNTLDIVCTHSSYQEGTTLAFQTVKILGLTDSVTEVRVAENNQPMNAHSNFTYDASNQVLLIADLKLNLGRNFSVQWNQIFSENERFNCYPDADLATEQKCTQRGCVWRTGSSLSKAPECYFPRQDNSYSVNSARYSSMGITADLQLNTANARIKLPSDPISTLRVEVKYHKNDMLQFKIYDPQKKRYEVPVPLNIPTTPISTYEDRLYDVEIKENPFGIQIRRRSSGRVIWDSWLPGFAFNDQFIQISTRLPSEYIYGFGEVEHTAFKRDLNWNTWGMFTRDQPPGYKLNSYGFHPYYMALEEEGNAHGVFLLNSNAMDVTFQPTPALTYRTVGGILDFYMFLGPTPEVATKQYHEVIGHPVMPAYWALGFQLCRYGYANTSEVRELYDAMVAANIPYDVQYTDIDYMERQLDFTIGEAFQDLPQFVDKIRGEGMRYIIILDPAISGNETKTYPAFERGQQNDVFVKWPNTNDICWAKVWPDLPNITIDKTLTEDEAVNASRAHVAFPDFFRTSTAEWWAREIVDFYNEKMKFDGLWIDMNEPSSFVNGTTTNQCRNDELNYPPYFPELTKRTDGLHFRTICMEAEQILSDGTSVLHYDVHNLYGWSQMKPTHDALQKTTGKRGIVISRSTYPTSGRWGGHWLGDNYARWDNMDKSIIGMMEFSLFGMSYTGADICGFFNNSEYHLCTRWMQLGAFYPYSRNHNIANTRRQDPASWNETFAEMSRNILNIRYTLLPYFYTQMHEIHANGGTVIRPLLHEFFDEKPTWDIFKQFLWGPAFMVTPVLEPYVQTVNAYVPNARWFDYHTGKDIGVRGQFQTFNASYDTINLHVRGGHILPCQEPAQNTFYSRQKHMKLIVAADDNQMAQGSLFWDDGESIDTYERDLYLSVQFNLNQTTLTSTILKRGYINKSETRLGSLHVWGKGTTPVNAVTLTYNGNKNSLPFNEDTTNMILRIDLTTHNVTLEEPIEINWS,1827,NP_001032.2.csv,refseq-SI-NM_001041.3_clinical_seed_0_final,refseq-SI-NM_001041.3.a2m,Invitae,refseq-SI-NM_001041.3.npy,1,1827,1827
+NP_001034302.2,MALTGYSWLLLSATFLNVGAEISITLEPAQPSEGDNVTLVVHGLSGELLAYSWYAGPTLSVSYLVASYIVSTGDETPGPAHTGREAVRPDGSLDIQGILPRHSGTYILQTFNRQLQTEVGYGHVQVHEILAQPTVLANSTALVERRDTLRLMCSSPSPTAEVRWFFNGGALPVALRLGLSPDGRVLARHGIRREEAGAYQCEVWNPVSVSRSEPINLTVYFGPERVAILQDSTTRTGCTIKVDFNTSLTLWCVSRSCPEPEYVWTFNGQALKNGQDHLNISSMTAAQEGTYTCIAKNTKTLLSGSASVVVKLSAAAVATMIVPVPTKPTEGQDVTLTVQGYPKDLLVYAWYRGPASEPNRLLSQLPSGTWIAGPAHTGREVGFPNCSLLVQKLNLTDTGRYTLKTVTVQGKTETLEVELQVAPLG,425,NP_001034302.2.csv,refseq-CEACAM16-NM_001039213.2_clinical_seed_0_final,refseq-CEACAM16-NM_001039213.2.a2m,Invitae,refseq-CEACAM16-NM_001039213.2.npy,1,425,425
+NP_001034437.1,MLKALFLTMLTLALVKSQDTEETITYTQCTDGYEWDPVRQQCKDIDECDIVPDACKGGMKCVNHYGGYLCLPKTAQIIVNNEQPQQETQPAEGTSGATTGVVAASSMATSGVLPGGGFVASAAAVAGPEMQTGRNNFVIRRNPADPQRIPSNPSHRIQCAAGYEQSEHNVCQDIDECTAGTHNCRADQVCINLRGSFACQCPPGYQKRGEQCVDIDECTIPPYCHQRCVNTPGSFYCQCSPGFQLAANNYTCVDINECDASNQCAQQCYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCPQGYQVVRSRTCQDINECETTNECREDEMCWNYHGGFRCYPRNPCQDPYILTPENRCVCPVSNAMCRELPQSIVYKYMSIRSDRSVPSDIFQIQATTIYANTINTFRIKSGNENGEFYLRQTSPVSAMLVLVKSLSGPREHIVDLEMLTVSSIGTFRTSSVLRLTIIVGPFSF,493,NP_001034437.1.csv,refseq-EFEMP1-NM_001039348.2_clinical_seed_0_final,refseq-EFEMP1-NM_001039348.2.a2m,Invitae,refseq-EFEMP1-NM_001039348.2.npy,1,493,493
+NP_001034639.1,MASYYEILDVPRSASADDIKKAYRRKALQWHPDKNPDNKEFAEKKFKEVAEAYEVLSDKHKREIYDRYGREGLTGTGTGPSRAEAGSGGPGFTFTFRSPEEVFREFFGSGDPFAELFDDLGPFSELQNRGSRHSGPFFTFSSSFPGHSDFSSSSFSFSPGAGAFRSVSTSTTFVQGRRITTRRIMENGQERVEVEEDGQLKSVTINGVPDDLALGLELSRREQQPSVTSRSGGTQVQQTPASCPLDSDLSEDEDLQLAMAYSLSEMEAAGKKPADVF,277,NP_001034639.1.csv,refseq-DNAJB2-NM_001039550.1_clinical_seed_0_final,refseq-DNAJB2-NM_001039550.1.a2m,Invitae,refseq-DNAJB2-NM_001039550.1.npy,1,277,277
+NP_001035.1,MSKSKCSVGLMSSVVAPAKEPNAVGPKEVELILVKEQNGVQLTSSTLTNPRQSPVEAQDRETWGKKIDFLLSVIGFAVDLANVWRFPYLCYKNGGGAFLVPYLLFMVIAGMPLFYMELALGQFNREGAAGVWKICPILKGVGFTVILISLYVGFFYNVIIAWALHYLFSSFTTELPWIHCNNSWNSPNCSDAHPGDSSGDSSGLNDTFGTTPAAEYFERGVLHLHQSHGIDDLGPPRWQLTACLVLVIVLLYFSLWKGVKTSGKVVWITATMPYVVLTALLLRGVTLPGAIDGIRAYLSVDFYRLCEASVWIDAATQVCFSLGVGFGVLIAFSSYNKFTNNCYRDAIVTTSINSLTSFSSGFVVFSFLGYMAQKHSVPIGDVAKDGPGLIFIIYPEAIATLPLSSAWAVVFFIMLLTLGIDSAMGGMESVITGLIDEFQLLHRHRELFTLFIVLATFLLSLFCVTNGGIYVFTLLDHFAAGTSILFGVLIEAIGVAWFYGVGQFSDDIQQMTGQRPSLYWRLCWKLVSPCFLLFVVVVSIVTFRPPHYGAYIFPDWANALGWVIATSSMAMVPIYAAYKFCSLPGSFREKLAYAIAPEKDRELVDRGEVRQFTLRHWLKV,620,NP_001035.1.csv,refseq-SLC6A3-NM_001044.4_clinical_seed_0_final,refseq-SLC6A3-NM_001044.4.a2m,Invitae,refseq-SLC6A3-NM_001044.4.npy,1,620,620
+NP_001035197.1,MIKCLSVEVQAKLRSGLAISSLGQCVEELALNSIDAEAKCVAVRVNMETFQVQVIDNGFGMGSDDVEKVGNRYFTSKCHSVQDLENPRFYGFRGEALANIADMASAVEISSKKNRTMKTFVKLFQSGKALKACEADVTRASAGTTVTVYNLFYQLPVRRKCMDPRLEFEKVRQRIEALSLMHPSISFSLRNDVSGSMVLQLPKTKDVCSRFCQIYGLGKSQKLREISFKYKEFELSGYISSEAHYNKNMQFLFVNKRLVLRTKLHKLIDFLLRKESIICKPKNGPTSRQMNSSLRHRSTPELYGIYVINVQCQFCEYDVCMEPAKTLIEFQNWDTLLFCIQEGVKMFLKQEKLFVELSGEDIKEFSEDNGFSLFDATLQKRVTSDERSNFQEACNNILDSYEMFNLQSKAVKRKTTAENVNTQSSRDSEATRKNTNDAFLYIYESGGPGHSKMTEPSLQNKDSSCSESKMLEQETIVASEAGENEKHKKSFLEHSSLENPCGTSLEMFLSPFQTPCHFEESGQDLEIWKESTTVNGMAANILKNNRIQNQPKRFKDATEVGCQPLPFATTLWGVHSAQTEKEKKKESSNCGRRNVFSYGRVKLCSTGFITHVVQNEKTKSTETEHSFKNYVRPGPTRAQETFGNRTRHSVETPDIKDLASTLSKESGQLPNKKNCRTNISYGLENEPTATYTMFSAFQEGSKKSQTDCILSDTSPSFPWYRHVSNDSRKTDKLIGFSKPIVRKKLSLSSQLGSLEKFKRQYGKVENPLDTEVEESNGVTTNLSLQVEPDILLKDKNRLENSDVCKITTMEHSDSDSSCQPASHILNSEKFPFSKDEDCLEQQMPSLRESPMTLKELSLFNRKPLDLEKSSESLASKLSRLKGSERETQTMGMMSRFNELPNSDSSRKDSKLCSVLTQDFCMLFNNKHEKTENGVIPTSDSATQDNSFNKNSKTHSNSNTTENCVISETPLVLPYNNSKVTGKDSDVLIRASEQQIGSLDSPSGMLMNPVEDATGDQNGICFQSEESKARACSETEESNTCCSDWQRHFDVALGRMVYVNKMTGLSTFIAPTEDIQAACTKDLTTVAVDVVLENGSQYRCQPFRSDLVLPFLPRARAERTVMRQDNRDTVDDTVSSESLQSLFSEWDNPVFARYPEVAVDVSSGQAESLAVKIHNILYPYRFTKGMIHSMQVLQQVDNKFIACLMSTKTEENGEAGGNLLVLVDQHAAHERIRLEQLIIDSYEKQQAQGSGRKKLLSSTLIPPLEITVTEEQRRLLWCYHKNLEDLGLEFVFPDTSDSLVLVGKVPLCFVEREANELRRGRSTVTKSIVEEFIREQLELLQTTGGIQGTLPLTVQKVLASQACHGAIKFNDGLSLQESCRLIEALSSCQLPFQCAHGRPSMLPLADIDHLEQEKQIKPNLTKLRKMAQAWRLFGKAECDTRQSLQQSMPPCEPP,1453,NP_001035197.1.csv,refseq-MLH3-NM_001040108.1_clinical_seed_0_final,refseq-MLH3-NM_001040108.1.a2m,Invitae,refseq-MLH3-NM_001040108.1.npy,1,1453,1453
+NP_001035202.1,MAQKGQLSDDEKFLFVDKNFINSPVAQADWAAKRLVWVPSEKQGFEAASIKEEKGDEVVVELVENGKKVTVGKDDIQKMNPPKFSKVEDMAELTCLNEASVLHNLRERYFSGLIYTYSGLFCVVVNPYKHLPIYSEKIVDMYKGKKRHEMPPHIYAIADTAYRSMLQDREDQSILCTGESGAGKTENTKKVIQYLAVVASSHKGKKDTSITQGPSFAYGELEKQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDVTGYIVGANIETYLLEKSRAIRQARDERTFHIFYYMIAGAKEKMRSDLLLEGFNNYTFLSNGFVPIPAAQDDEMFQETVEAMAIMGFSEEEQLSILKVVSSVLQLGNIVFKKERNTDQASMPDNTAAQKVCHLMGINVTDFTRSILTPRIKVGRDVVQKAQTKEQADFAVEALAKATYERLFRWILTRVNKALDKTHRQGASFLGILDIAGFEIFEVNSFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIELIERPNNPPGVLALLDEECWFPKATDKSFVEKLCTEQGSHPKFQKPKQLKDKTEFSIIHYAGKVDYNASAWLTKNMDPLNDNVTSLLNASSDKFVADLWKDVDRIVGLDQMAKMTESSLPSASKTKKGMFRTVGQLYKEQLGKLMTTLRNTTPNFVRCIIPNHEKRSGKLDAFLVLEQLRCNGVLEGIRICRQGFPNRIVFQEFRQRYEILAANAIPKGFMDGKQACILMIKALELDPNLYRIGQSKIFFRTGVLAHLEEERDLKITDVIMAFQAMCRGYLARKAFAKRQQQLTAMKVIQRNCAAYLKLRNWQWWRLFTKVKPLLQVTRQEEEMQAKEDELQKTKERQQKAENELKELEQKHSQLTEEKNLLQEQLQAETELYAEAEEMRVRLAAKKQELEEILHEMEARLEEEEDRGQQLQAERKKMAQQMLDLEEQLEEEEAARQKLQLEKVTAEAKIKKLEDEILVMDDQNNKLSKERKLLEERISDLTTNLAEEEEKAKNLTKLKNKHESMISELEVRLKKEEKSRQELEKLKRKLEGDASDFHEQIADLQAQIAELKMQLAKKEEELQAALARLDDEIAQKNNALKKIRELEGHISDLQEDLDSERAARNKAEKQKRDLGEELEALKTELEDTLDSTATQQELRAKREQEVTVLKKALDEETRSHEAQVQEMRQKHAQAVEELTEQLEQFKRAKANLDKNKQTLEKENADLAGELRVLGQAKQEVEHKKKKLEAQVQELQSKCSDGERARAELNDKVHKLQNEVESVTGMLNEAEGKAIKLAKDVASLSSQLQDTQELLQEETRQKLNVSTKLRQLEEERNSLQDQLDEEMEAKQNLERHISTLNIQLSDSKKKLQDFASTVEALEEGKKRFQKEIENLTQQYEEKAAAYDKLEKTKNRLQQELDDLVVDLDNQRQLVSNLEKKQRKFDQLLAEEKNISSKYADERDRAEAEAREKETKALSLARALEEALEAKEELERTNKMLKAEMEDLVSSKDDVGKNVHELEKSKRALETQMEEMKTQLEELEDELQATEDAKLRLEVNMQALKGQFERDLQARDEQNEEKRRQLQRQLHEYETELEDERKQRALAAAAKKKLEGDLKDLELQADSAIKGREEAIKQLRKLQAQMKDFQRELEDARASRDEIFATAKENEKKAKSLEADLMQLQEDLAAAERARKQADLEKEELAEELASSLSGRNALQDEKRRLEARIAQLEEELEEEQGNMEAMSDRVRKATQQAEQLSNELATERSTAQKNESARQQLERQNKELRSKLHEMEGAVKSKFKSTIAALEAKIAQLEEQVEQEAREKQAATKSLKQKDKKLKEILLQVEDERKMAEQYKEQAEKGNARVKQLKRQLEEAEEESQRINANRRKLQRELDEATESNEAMGREVNALKSKLRGPPPQETSQ,1945,NP_001035202.1.csv,refseq-MYH11-NM_001040113.1_clinical_seed_0_final,refseq-MYH11-NM_001040113.1.a2m,Invitae,refseq-MYH11-NM_001040113.1.npy,1,1945,1945
+NP_001035526.1,MAAPILRSFSWGRWSGTLNLSVLLPLGLRKAHSGAQGLLAAQKARGLFKDFFPETGTKIELPELFDRGTASFPQTIYCGFDPTADSLHVGHLLALLGLFHLQRAGHNVIALVGGATARLGDPSGRTKEREALETERVRANARALRLGLEALAANHQQLFTDGRSWGSFTVLDNSAWYQKQHLVDFLAAVGGHFRMGTLLSRQSVQLRLKSPEGMSLAEFFYQVLQAYDFYYLFQRYGCRVQLGGSDQLGNIMSGYEFINKLTGEDVFGITVPLITSTTGAKLGKSAGNAVWLNRDKTSPFELYQFFVRQPDDSVERYLKLFTFLPLPEIDHIMQLHVKEPERRGPQKRLAAEVTKLVHGREGLDSAKRCTQALYHSSIDALEVMSDQELKELFKEAPFSEFFLDPGTSVLDTCRKANAIPDGPRGYRMITEGGVSINHQQVTNPESVLIVGQHILKNGLSLLKIGKRNFYIIKWLQL,477,NP_001035526.1.csv,refseq-YARS2-NM_001040436.2_clinical_seed_0_final,refseq-YARS2-NM_001040436.2.a2m,Invitae,refseq-YARS2-NM_001040436.2.npy,1,477,477
+NP_001035835.1,MALWMRLLPLLALLALWGPDPAAAFVNQHLCGSHLVEALYLVCGERGFFYTPKTRREAEDLQASALSLSSSTSTWPEGLDATARAPPALVVTANIGQAGGSSSRQFRQRALGTSDSPVLFIHCPGAAGTAQGLEYRGRRVTTELVWEEVDSSPQPQGSESLPAQPPAQPAPQPEPQQAREPSPEVSCCGLWPRRPQRSQN,200,NP_001035835.1.csv,refseq-INS-IGF2-NM_001042376.3_clinical_seed_0_final,refseq-INS-IGF2-NM_001042376.3.a2m,Invitae,refseq-INS-IGF2-NM_001042376.3_theta_0.2.npy,1,200,200
+NP_001035897.1,MGGCAGSRRRFSDSEGEETVPEPRLPLLDHQGAHWKNAVGFWLLGLCNNFSYVVMLSAAHDILSHKRTSGNQSHVDPGPTPIPHNSSSRFDCNSVSTAAVLLADILPTLVIKLLAPLGLHLLPYSPRVLVSGICAAGSFVLVAFSHSVGTSLCGVVFASISSGLGEVTFLSLTAFYPRAVISWWSSGTGGAGLLGALSYLGLTQAGLSPQQTLLSMLGIPALLLASYFLLLTSPEAQDPGGEEEAESAARQPLIRTEAPESKPGSSSSLSLRERWTVFKGLLWYIVPLVVVYFAEYFINQGLFELLFFWNTSLSHAQQYRWYQMLYQAGVFASRSSLRCCRIRFTWALALLQCLNLVFLLADVWFGFLPSIYLVFLIILYEGLLGGAAYVNTFHNIALETSDEHREFAMAATCISDTLGISLSGLLALPLHDFLCQLS,438,NP_001035897.1.csv,refseq-CLN3-NM_001042432.1_clinical_seed_0_final,refseq-CLN3-NM_001042432.1.a2m,Invitae,refseq-CLN3-NM_001042432.1.npy,1,438,438
+NP_001035937.1,MRKRTEPVALEHERCAAAGSSSSGSAAAALDADCRLKQNLRLTGPAAAEPRCAADAGMKRALGRRKGVWLRLRKILFCVLGLYIAIPFLIKLCPGIQAKLIFLNFVRVPYFIDLKKPQDQGLNHTCNYYLQPEEDVTIGVWHTVPAVWWKNAQGKDQMWYEDALASSHPIILYLHGNAGTRGGDHRVELYKVLSSLGYHVVTFDYRGWGDSVGTPSERGMTYDALHVFDWIKARSGDNPVYIWGHSLGTGVATNLVRRLCERETPPDALILESPFTNIREEAKSHPFSVIYRYFPGFDWFFLDPITSSGIKFANDENVKHISCPLLILHAEDDPVVPFQLGRKLYSIAAPARSFRDFKVQFVPFHSDLGYRHKYIYKSPELPRILREFLGKSEPEHQH,398,NP_001035937.1.csv,refseq-ABHD12-NM_001042472.2_clinical_seed_0_final,refseq-ABHD12-NM_001042472.2.a2m,Invitae,refseq-ABHD12-NM_001042472.2.npy,1,398,398
+NP_001035940.1,MWGRFLAPEASGRDSPGGARSFPAGPDYSSAWLPANESLWQATTVPSNHRNNHIRRHSIASDSGDTGIGTSCSDSVEDHSTSSGTLSFKPSQSLITLPTAHVMPSNSSASISKLRESLTPDGSKWSTSLMQTLGNHSRGEQDSSLDMKDFRPLRKWSSLSKLTAPDNCGQGGTVCREESRNGLEKIGKAKALTSQLRTIGPSCLHDSMEMLRLEDKEINKKRSSTLDCKYKFESCSKEDFRASSSTLRRQPVDMTYSALPESKPIMTSSEAFEPPKYLMLGQQAVGGVPIQPSVRTQMWLTEQLRTNPLEGRNTEDSYSLAPWQQQQIEDFRQGSETPMQVLTGSSRQSYSPGYQDFSKWESMLKIKEGLLRQKEIVIDRQKQQITHLHERIRDNELRAQHAMLGHYVNCEDSYVASLQPQYENTSLQTPFSEESVSHSQQGEFEQKLASTEKEVLQLNEFLKQRLSLFSEEKKKLEEKLKTRDRYISSLKKKCQKESEQNKEKQRRIETLEKYLADLPTLDDVQSQSLQLQILEEKNKNLQEALIDTEKKLEEIKKQCQDKETQLICQKKKEKELVTTVQSLQQKVERCLEDGIRLPMLDAKQLQNENDNLRQQNETASKIIDSQQDEIDRMILEIQSMQGKLSKEKLTTQKMMEELEKKERNVQRLTKALLENQRQTDETCSLLDQGQEPDQSRQQTVLSKRPLFDLTVIDQLFKEMSCCLFDLKALCSILNQRAQGKEPNLSLLLGIRSMNCSAEETENDHSTETLTKKLSDVCQLRRDIDELRTTISDRYAQDMGDNCITQ,805,NP_001035940.1.csv,refseq-CEP85L-NM_001042475.2_clinical_seed_0_final,refseq-CEP85L-NM_001042475.2.a2m,Invitae,refseq-CEP85L-NM_001042475.2_theta_0.2.npy,1,805,805
+NP_001035963.1,MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLMLNRSLSRLQWASLLLLFTGVAIVQAQQAGGGGPRPLDQNPGAGLAAVVASCLSSGFAGVYFEKILKGSSGSVWLRNLQLGLFGTALGLVGLWWAEGTAVATRGFFFGYTPAVWGVVLNQAFGGLLVAVVVKYADNILKGFATSLSIVLSTVASIRLFGFHVDPLFALGAGLVIGAVYLYSLPRGAAKAIASASASASGPCVHQQPPGQPPPPQLSSHRGDLITEPFLPKSVLVK,393,NP_001035963.1.csv,refseq-SLC35A2-NM_001042498.2_clinical_seed_0_final,refseq-SLC35A2-NM_001042498.2.a2m,Invitae,refseq-SLC35A2-NM_001042498.2.npy,1,393,393
+NP_001036068.1,MAGVGPGGYAAEFVPPPECPVFEPSWEEFTDPLSFIGRIRPLAEKTGICKIRPPKDWQPPFACEVKSFRFTPRVQRLNELEAMTRVRLDFLDQLAKFWELQGSTLKIPVVERKILDLYALSKIVASKGGFEMVTKEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGVSLMGVQMPNLDLKEKVEPEVLSTDTQTSPEPGTRMNILPKRTRRVKTQSESGDVSRNTELKKLQIFGAGPKVVGLAMGTKDKEDEVTRRRKVTNRSDAFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKGDWRCPKCVAEECSKPREAFGFEQAVREYTLQSFGEMADNFKSDYFNMPVHMVPTELVEKEFWRLVSSIEEDVIVEYGADISSKDFGSGFPVKDGRRKILPEEEEYALSGWNLNNMPVLEQSVLAHINVDISGMKVPWLYVGMCFSSFCWHIEDHWSYSINYLHWGEPKTWYGVPSHAAEQLEEVMRELAPELFESQPDLLHQLVTIMNPNVLMEHGVPVYRTNQCAGEFVVTFPRAYHSGFNQGYNFAEAVNFCTADWLPIGRQCVNHYRRLRRHCVFSHEELIFKMAADPECLDVGLAAMVCKELTLMTEEETRLRESVVQMGVLMSEEEVFELVPDDERQCSACRTTCFLSALTCSCNPERLVCLYHPTDLCPCPMQKKCLRYRYPLEDLPSLLYGVKVRAQSYDTWVSRVTEALSANFNHKKDLIELRVMLEDAEDRKYPENDLFRKLRDAVKEAETCASVAQLLLSKKQKHRQSPDSGRTRTKLTVEELKAFVQQLFSLPCVISQARQVKNLLDDVEEFHERAQEAMMDETPDSSKLQMLIDMGSSLYVELPELPRLKQELQQARWLDEVRLTLSDPQQVTLDVMKKLIDSGVGLAPHHAVEKAMAELQELLTVSERWEEKAKVCLQARPRHSVASLESIVNEAKNIPAFLPNVLSLKEALQKAREWTAKVEAIQSGSNYAYLEQLESLSAKGRPIPVRLEALPQVESQVAAARAWRERTGRTFLKKNSSHTLLQVLSPRTDIGVYGSGKNRRKKVKELIEKEKEKDLDLEPLSDLEEGLEETRDTAMVVAVFKEREQKEIEAMHSLRAANLAKMTMVDRIEEVKFCICRKTASGFMLQCELCKDWFHNSCVPLPKSSSQKKGSSWQAKEVKFLCPLCMRSRRPRLETILSLLVSLQKLPVRLPEGEALQCLTERAMSWQDRARQALATDELSSALAKLSVLSQRMVEQAAREKTEKIISAELQKAAANPDLQGHLPSFQQSAFNRVVSSVSSSPRQTMDYDDEETDSDEDIRETYGYDMKDTASVKSSSSLEPNLFCDEEIPIKSEEVVTHMWTAPSFCAEHAYSSASKSCSQGSSTPRKQPRKSPLVPRSLEPPVLELSPGAKAQLEELMMVGDLLEVSLDETQHIWRILQATHPPSEDRFLHIMEDDSMEEKPLKVKGKDSSEKKRKRKLEKVEQLFGEGKQKSKELKKMDKPRKKKLKLGADKSKELNKLAKKLAKEEERKKKKEKAAAAKVELVKESTEKKREKKVLDIPSKYDWSGAEESDDENAVCAAQNCQRPCKDKVDWVQCDGGCDEWFHQVCVGVSPEMAENEDYICINCAKKQGPVSPGPAPPPSFIMSYKLPMEDLKETS,1690,NP_001036068.1.csv,refseq-KDM5A-NM_001042603.2_clinical_seed_0_final,refseq-KDM5A-NM_001042603.2.a2m,Invitae,refseq-KDM5A-NM_001042603.2_theta_0.2.npy,1,1690,1690
+NP_001036167.1,MFAAATKSFVKQVGDGGRLVPVPSLSEADKYQPLSLVVKKKRCFLFPRYKFTSTPFTLKDILLGDREISAGISSYQLLNYEDESDVSLYGRRGNHIVNDVGINVAGSDSIAVKASFGIVTKHEVEVSTLLKEITTRKINFDHSLIRQSRSSRKAVLCVVMESIRTTRQCSLSVHAGIRGEAMRFHFMDEQNPKGRDKAIVFPAHTTIAFSVFELFIYLDGAFDLCVTSVSKGGFEREETATFALLYRLRNILFERNRRVMDVISRSQLYLDDLFSDYYDKPLSMTDISLKEGTHIRVNLLNHNIPKGPCILCGMGNFKRETVYGCFQCSVDGQKYVRLHAVPCFDIWHKRMK,352,NP_001036167.1.csv,refseq-DFNB59-NM_001042702.3_clinical_seed_0_final,refseq-DFNB59-NM_001042702.3.a2m,Invitae,refseq-DFNB59-NM_001042702.3.npy,1,352,352
+NP_001036214.1,MIAAPEIPTDFNLLQESETHFSSDTDFEDIEGKNQKQGKGKTCKKGKKGPAEKGKGGNGGGKPPSGPNRMNGHHQQNGVENMMLFEVVKMGKSAMQSVVDDWIESYKHDRDIALLDLINFFIQCSGCKGVVTAEMFRHMQNSEIIRKMTEEFDEDSGDYPLTMAGPQWKKFKSSFCEFIGVLVRQCQYSIIYDEYMMDTVISLLTGLSDSQVRAFRHTSTLAAMKLMTALVNVALNLSINMDNTQRQYEAERNKMIGKRANERLELLLQKRKELQENQDEIENMMNAIFKGVFVHRYRDAIAEIRAICIEEIGIWMKMYSDAFLNDSYLKYVGWTMHDKQGEVRLKCLTALQGLYYNKELNSKLELFTSRFKDRIVSMTLDKEYDVAVQAIKLLTLVLQSSEEVLTAEDCENVYHLVYSAHRPVAVAAGEFLYKKLFSRRDPEEDGMMKRRGRQGPNANLVKTLVFFFLESELHEHAAYLVDSMWDCATELLKDWECMNSLLLEEPLSGEEALTDRQESALIEIMLCTIRQAAECHPPVGRGTGKRVLTAKEKKTQLDDRTKITELFAVALPQLLAKYSVDAEKVTNLLQLPQYFDLEIYTTGRLEKHLDALLRQIRNIVEKHTDTDVLEACSKTYHALCNEEFTIFNRVDISRSQLIDELADKFNRLLEDFLQEGEEPDEDDAYQVLSTLKRITAFHNAHDLSKWDLFACNYKLLKTGIENGDMPEQIVIHALQCTHYVILWQLAKITESSSTKEDLLRLKKQMRVFCQICQHYLTNVNTTVKEQAFTILCDILMIFSHQIMSGGRDMLEPLVYTPDSSLQSELLSFILDHVFIEQDDDNNSADGQQEDEASKIEALHKRRNLLAAFCKLIVYTVVEMNTAADIFKQYMKYYNDYGDIIKETMSKTRQIDKIQCAKTLILSLQQLFNEMIQENGYNFDRSSSTFSGIKELARRFALTFGLDQLKTREAIAMLHKDGIEFAFKEPNPQGESHPPLNLAFLDILSEFSSKLLRQDKRTVYVYLEKFMTFQMSLRREDVWLPLMSYRNSLLAGGDDDTMSVISGISSRGSTVRSKKSKPSTGKRKVVEGMQLSLTEESSSSDSMWLSREQTLHTPVMMQTPQLTSTIMREPKRLRPEDSFMSVYPMQTEHHQTPLDYNTQVTWMLAQRQQEEARQQQERAAMSYVKLRTNLQHAIRRGTSLMEDDEEPIVEDVMMSSEGRIEDLNEGMDFDTMDIDLPPSKNRRERTELKPDFFDPASIMDESVLGVSMF,1268,NP_001036214.1.csv,refseq-STAG2-NM_001042749.2_clinical_seed_0_final,refseq-STAG2-NM_001042749.2.a2m,Invitae,refseq-STAG2-NM_001042749.2.npy,1,1268,1268
+NP_001037.1,MEPRPTAPSSGAPGLAGVGETPSAAALAAARVELPGTAVPSVPEDAAPASRDGGGVRDEGPAAAGDGLGRPLGPTPSQSRFQVDLVSENAGRAAAAAAAAAAAAAAAGAGAGAKQTPADGEASGESEPAKGSEEAKGRFRVNFVDPAASSSAEDSLSDAAGVGVDGPNVSFQNGGDTVLSEGSSLHSGGGGGSGHHQHYYYDTHTNTYYLRTFGHNTMDAVPRIDHYRHTAAQLGEKLLRPSLAELHDELEKEPFEDGFANGEESTPTRDAVVTYTAESKGVVKFGWIKGVLVRCMLNIWGVMLFIRLSWIVGQAGIGLSVLVIMMATVVTTITGLSTSAIATNGFVRGGGAYYLISRSLGPEFGGAIGLIFAFANAVAVAMYVVGFAETVVELLKEHSILMIDEINDIRIIGAITVVILLGISVAGMEWEAKAQIVLLVILLLAIGDFVIGTFIPLESKKPKGFFGYKSEIFNENFGPDFREEETFFSVFAIFFPAATGILAGANISGDLADPQSAIPKGTLLAILITTLVYVGIAVSVGSCVVRDATGNVNDTIVTELTNCTSAACKLNFDFSSCESSPCSYGLMNNFQVMSMVSGFTPLISAGIFSATLSSALASLVSAPKIFQALCKDNIYPAFQMFAKGYGKNNEPLRGYILTFLIALGFILIAELNVIAPIISNFFLASYALINFSVFHASLAKSPGWRPAFKYYNMWISLLGAILCCIVMFVINWWAALLTYVIVLGLYIYVTYKKPDVNWGSSTQALTYLNALQHSIRLSGVEDHVKNFRPQCLVMTGAPNSRPALLHLVHDFTKNVGLMICGHVHMGPRRQAMKEMSIDQAKYQRWLIKNKMKAFYAPVHADDLREGAQYLMQAAGLGRMKPNTLVLGFKKDWLQADMRDVDMYINLFHDAFDIQYGVVVIRLKEGLDISHLQGQEELLSSQEKSPGTKDVVVSVEYSKKSDLDTSKPLSEKPITHKVEEEDGKTATQPLLKKESKGPIVPLNVADQKLLEASTQFQKKQGKNTIDVWWLFDDGGLTLLIPYLLTTKKKWKDCKIRVFIGGKINRIDHDRRAMATLLSKFRIDFSDIMVLGDINTKPKKENIIAFEEIIEPYRLHEDDKEQDIADKMKEDEPWRITDNELELYKTKTYRQIRLNELLKEHSSTANIIVMSLPVARKGAVSSALYMAWLEALSKDLPPILLVRGNHQSVLTFYS,1212,NP_001037.1.csv,refseq-SLC12A2-NM_001046.2_clinical_seed_0_final,refseq-SLC12A2-NM_001046.2.a2m,Invitae,refseq-SLC12A2-NM_001046.2_theta_0.2.npy,1,1212,1212
+NP_001037850.1,MRTDSGARLEEGHLRPPRALPPVPSQDDIPLSRPKKKKPRTKNTPASASLEGLAQTAGRRPSEGNEPSTKELKEHPEAPVQRRQKKTRLPLELETSSTQKKSSSSSLLRNENGIDAEPAEEAVIQKPRRKTKKTQPAELQYANELGVEDEDIITDEQTTVEQQSVFTAPTGISQPVGKVFVEKSRRFQAADRSELIKTTENIDVSMDVKPSWTTRDVALTVHRAFRMIGLFSHGFLAGCAVWNIVVIYVLAGDQLSNLSNLLQQYKTLAYPFQSLLYLLLALSTISAFDRIDFAKISVAIRNFLALDPTALASFLYFTALILSLSQQMTSDRIHLYTPSSVNGSLWEAGIEEQILQPWIVVNLVVALLVGLSWLFLSYRPGMDLSEELMFSSEVEEYPDKEKEIKASS,408,NP_001037850.1.csv,refseq-TMEM237-NM_001044385.2_clinical_seed_0_final,refseq-TMEM237-NM_001044385.2.a2m,Invitae,refseq-TMEM237-NM_001044385.2.npy,1,408,408
+NP_001041629.1,MPWQAFRRFGQKLVRRRTLESGMAETRLARCLSTLDLVALGVGSTLGAGVYVLAGEVAKDKAGPSIVICFLVAALSSVLAGLCYAEFGARVPRSGSAYLYSYVTVGELWAFTTGWNLILSYVIGTASVARAWSSAFDNLIGNHISKTLQGSIALHVPHVLAEYPDFFALGLVLLLTGLLALGASESALVTKVFTGVNLLVLGFVMISGFVKGDVHNWKLTEEDYELAMAELNDTYSLGPLGSGGFVPFGFEGILRGAATCFYAFVGFDCIATTGEEAQNPQRSIPMGIVISLSVCFLAYFAVSSALTLMMPYYQLQPESPLPEAFLYIGWAPARYVVAVGSLCALSTSLLGSMFPMPRVIYAMAEDGLLFRVLARIHTGTRTPIIATVVSGIIAAFMAFLFKLTDLVDLMSIGTLLAYSLVSICVLILRYQPDQETKTGEEVELQEEAITTESEKLTLWGLFFPLNSIPTPLSGQIVYVCSSLLAVLLTALCLVLAQWSVPLLSGDLLWTAVVVLLLLLIIGIIVVIWRQPQSSTPLHFKVPALPLLPLMSIFVNIYLMMQMTAGTWARFGVWMLIGFAIYFGYGIQHSLEEIKSNQPSRKSRAKTVDLDPGTLYVHSV,619,NP_001041629.1.csv,refseq-SLC7A3-NM_001048164.2_clinical_seed_0_final,refseq-SLC7A3-NM_001048164.2.a2m,Invitae,refseq-SLC7A3-NM_001048164.2.npy,1,619,619
+NP_001041675.1,MLCSLLLCECLLLVAGYAHDDDWIDPTDMLNYDAASGTMRKSQAKYGISGEKDVSPDLSCADEISECYHKLDSLTYKIDECEKKKREDYESQSNPVFRRYLNKILIEAGKLGLPDENKGDMHYDAEIILKRETLLEIQKFLNGEDWKPGALDDALSDILINFKFHDFETWKWRFEDSFGVDPYNVLMVLLCLLCIVVLVATELWTYVRWYTQLRRVLIISFLFSLGWNWMYLYKLAFAQHQAEVAKMEPLNNVCAKKMDWTGSIWEWFRSSWTYKDDPCQKYYELLLVNPIWLVPPTKALAVTFTTFVTEPLKHIGKGTGEFIKALMKEIPALLHLPVLIIMALAILSFCYGAGKSVHVLRHIGGPESEPPQALRPRDRRRQEEIDYRPDGGAGDADFHYRGQMGPTEQGPYAKTYEGRREILRERDVDLRFQTGNKSPEVLRAFDVPDAEAREHPTVVPSHKSPVLDTKPKETGGILGEGTPKESSTESSQSAKPVSGQDTSGNTEGSPAAEKAQLKSEAAGSPDQGSTYSPARGVAGPRGQDPVSSPCG,551,NP_001041675.1.csv,refseq-CLCC1-NM_001048210.2_clinical_seed_0_final,refseq-CLCC1-NM_001048210.2.a2m,Invitae,refseq-CLCC1-NM_001048210.2.npy,1,551,551
+NP_001050.1,MATLPAAETWIDGGGGVGADAVNLTASLAAGAATGAVETGWLQLLDQAGNLSSSPSALGLPVASPAPSQPWANLTNQFVQPSWRIALWSLAYGVVVAVAVLGNLIVIWIILAHKRMRTVTNYFLVNLAFSDASMAAFNTLVNFIYALHSEWYFGANYCRFQNFFPITAVFASIYSMTAIAVDRYMAIIDPLKPRLSATATKIVIGSIWILAFLLAFPQCLYSKTKVMPGRTLCFVQWPEGPKQHFTYHIIVIILVYCFPLLIMGITYTIVGITLWGGEIPGDTCDKYHEQLKAKRKVVKMMIIVVMTFAICWLPYHIYFILTAIYQQLNRWKYIQQVYLASFWLAMSSTMYNPIIYCCLNKRFRAGFKRAFRWCPFIKVSSYDELELKTTRFHPNRQSSMYTVTRMESMTVVFDPNDADTTRSSRKKRATPRDPSFNGCSRRNSKSASATSSFISSPYTSVDEYS,465,NP_001050.1.csv,refseq-TACR3-NM_001059.2_clinical_seed_0_final,refseq-TACR3-NM_001059.2.a2m,Invitae,refseq-TACR3-NM_001059.2.npy,1,465,465
+NP_001051.1,MWPNGSSLGPCFRPTNITLEERRLIASPWFAASFCVVGLASNLLALSVLAGARQGGSHTRSSFLTFLCGLVLTDFLGLLVTGTIVVSQHAALFEWHAVDPGCRLCRFMGVVMIFFGLSPLLLGAAMASERYLGITRPFSRPAVASQRRAWATVGLVWAAALALGLLPLLGVGRYTVQYPGSWCFLTLGAESGDVAFGLLFSMLGGLSVGLSFLLNTVSVATLCHVYHGQEAAQQRPRDSEVEMMAQLLGIMVVASVCWLPLLVFIAQTVLRNPPAMSPAGQLSRTTEKELLIYLRVATWNQILDPWVYILFRRAVLRRLQPRLSTRPRSLSLQPQLTQRSGLQ,343,NP_001051.1.csv,refseq-TBXA2R-NM_001060.5_clinical_seed_0_final,refseq-TBXA2R-NM_001060.5.a2m,Invitae,refseq-TBXA2R-NM_001060.5.npy,1,343,343
+NP_001056.1,MGLSTVPDLLLPLVLLELLVGIYPSGVIGLVPHLGDREKRDSVCPQGKYIHPQNNSICCTKCHKGTYLYNDCPGPGQDTDCRECESGSFTASENHLRHCLSCSKCRKEMGQVEISSCTVDRDTVCGCRKNQYRHYWSENLFQCFNCSLCLNGTVHLSCQEKQNTVCTCHAGFFLRENECVSCSNCKKSLECTKLCLPQIENVKGTEDSGTTVLLPLVIFFGLCLLSLLFIGLMYRYQRWKSKLYSIVCGKSTPEKEGELEGTTTKPLAPNPSFSPTPGFTPTLGFSPVPSSTFTSSSTYTPGDCPNFAAPRREVAPPYQGADPILATALASDPIPNPLQKWEDSAHKPQSLDTDDPATLYAVVENVPPLRWKEFVRRLGLSDHEIDRLELQNGRCLREAQYSMLATWRRRTPRREATLELLGRVLRDMDLLGCLEDIEEALCGPAALPPAPSLLR,455,NP_001056.1.csv,refseq-TNFRSF1A-NM_001065.3_clinical_seed_0_final,refseq-TNFRSF1A-NM_001065.3.a2m,Invitae,refseq-TNFRSF1A-NM_001065.3.npy,1,455,455
+NP_001060.1,MREIVHIQAGQCGNQIGAKFWEVISDEHGIDPTGSYHGDSDLQLERINVYYNEAAGNKYVPRAILVDLEPGTMDSVRSGPFGQIFRPDNFVFGQSGAGNNWAKGHYTEGAELVDSVLDVVRKESESCDCLQGFQLTHSLGGGTGSGMGTLLISKIREEYPDRIMNTFSVMPSPKVSDTVVEPYNATLSVHQLVENTDETYSIDNEALYDICFRTLKLTTPTYGDLNHLVSATMSGVTTCLRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTSRGSQQYRALTVPELTQQMFDSKNMMAACDPRHGRYLTVAAIFRGRMSMKEVDEQMLNVQNKNSSYFVEWIPNNVKTAVCDIPPRGLKMSATFIGNSTAIQELFKRISEQFTAMFRRKAFLHWYTGEGMDEMEFTEAESNMNDLVSEYQQYQDATADEQGEFEEEEGEDEA,445,NP_001060.1.csv,refseq-TUBB2A-NM_001069.2_clinical_seed_0_final,refseq-TUBB2A-NM_001069.2.a2m,Invitae,refseq-TUBB2A-NM_001069.2.npy,1,445,445
+NP_001061.2,MPREIITLQLGQCGNQIGFEFWKQLCAEHGISPEGIVEEFATEGTDRKDVFFYQADDEHYIPRAVLLDLEPRVIHSILNSPYAKLYNPENIYLSEHGGGAGNNWASGFSQGEKIHEDIFDIIDREADGSDSLEGFVLCHSIAGGTGSGLGSYLLERLNDRYPKKLVQTYSVFPNQDEMSDVVVQPYNSLLTLKRLTQNADCVVVLDNTALNRIATDRLHIQNPSFSQINQLVSTIMSASTTTLRYPGYMNNDLIGLIASLIPTPRLHFLMTGYTPLTTDQSVASVRKTTVLDVMRRLLQPKNVMVSTGRDRQTNHCYIAILNIIQGEVDPTQVHKSLQRIRERKLANFIPWGPASIQVALSRKSPYLPSAHRVSGLMMANHTSISSLFERTCRQYDKLRKREAFLEQFRKEDMFKDNFDEMDTSREIVQQLIDEYHAATRPDYISWGTQEQ,451,NP_001061.2.csv,refseq-TUBG1-NM_001070.4_clinical_seed_0_final,refseq-TUBG1-NM_001070.4.a2m,Invitae,refseq-TUBG1-NM_001070.4.npy,1,451,451
+NP_001063.2,MACLLRSFQRISAGVFFLALWGMVVGDKLLVVPQDGSHWLSMKDIVEVLSDRGHEIVVVVPEVNLLLKESKYYTRKIYPVPYDQEELKNRYQSFGNNHFAERSFLTAPQTEYRNNMIVIGLYFINCQSLLQDRDTLNFFKESKFDALFTDPALPCGVILAEYLGLPSVYLFRGFPCSLEHTFSRSPDPVSYIPRCYTKFSDHMTFSQRVANFLVNLLEPYLFYCLFSKYEELASAVLKRDVDIITLYQKVSVWLLRYDFVLEYPRPVMPNMVFIGGINCKKRKDLSQEFEAYINASGEHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH,532,NP_001063.2.csv,refseq-UGT1A6-NM_001072.3_clinical_seed_0_final,refseq-UGT1A6-NM_001072.3.a2m,Invitae,refseq-UGT1A6-NM_001072.3.npy,1,532,532
+NP_001070.2,MPDPAAHLPFFYGSISRAEAEEHLKLAGMADGLFLLRQCLRSLGGYVLSLVHDVRFHHFPIERQLNGTYAIAGGKAHCGPAELCEFYSRDPDGLPCNLRKPCNRPSGLEPQPGVFDCLRDAMVRDYVRQTWKLEGEALEQAIISQAPQVEKLIATTAHERMPWYHSSLTREEAERKLYSGAQTDGKFLLRPRKEQGTYALSLIYGKTVYHYLISQDKAGKYCIPEGTKFDTLWQLVEYLKLKADGLIYCLKEACPNSSASNASGAAAPTLPAHPSTLTHPQRRIDTLNSDGYTPEPARITSPDKPRPMPMDTSVYESPYSDPEELKDKKLFLKRDNLLIADIELGCGNFGSVRQGVYRMRKKQIDVAIKVLKQGTEKADTEEMMREAQIMHQLDNPYIVRLIGVCQAEALMLVMEMAGGGPLHKFLVGKREEIPVSNVAELLHQVSMGMKYLEEKNFVHRDLAARNVLLVNRHYAKISDFGLSKALGADDSYYTARSAGKWPLKWYAPECINFRKFSSRSDVWSYGVTMWEALSYGQKPYKKMKGPEVMAFIEQGKRMECPPECPPELYALMSDCWIYKWEDRPDFLTVEQRMRACYYSLASKVEGPPGSTQKAEAACA,619,NP_001070.2.csv,refseq-ZAP70-NM_001079.3_clinical_seed_0_final,refseq-ZAP70-NM_001079.3.a2m,Invitae,refseq-ZAP70-NM_001079.3.npy,1,619,619
+NP_001070712.1,MRIISRQIVLLFSGFWGLAMGAFPSSVQIGGLFIRNTDQEYTAFRLAIFLHNTSPNASEAPFNLVPHVDNIETANSFAVTNAFCSQYSRGVFAIFGLYDKRSVHTLTSFCSALHISLITPSFPTEGESQFVLQLRPSLRGALLSLLDHYEWNCFVFLYDTDRGYSILQAIMEKAGQNGWHVSAICVENFNDVSYRQLLEELDRRQEKKFVIDCEIERLQNILEQIVSVGKHVKGYHYIIANLGFKDISLERFIHGGANVTGFQLVDFNTPMVIKLMDRWKKLDQREYPGSETPPKYTSALTYDGVLVMAETFRSLRRQKIDISRRGNAGDCLANPAAPWGQGIDMERTLKQVRIQGLTGNVQFDHYGRRVNYTMDVFELKSTGPRKVGYWNDMDKLVLIQDVPTLGNDTAAIENRTVVVTTIMPLMKNPILRN,433,NP_001070712.1.csv,refseq-GRIA4-NM_001077244.2_clinical_seed_0_final,refseq-GRIA4-NM_001077244.2.a2m,Invitae,refseq-GRIA4-NM_001077244.2.npy,1,433,433
+NP_001070834.1,MKQIFFLDDSGPPFGHMVLALGGYLGGFDGNFLWNRIGAEYSSNVPVWSLRLLPALAGALSVPMAYQIVLELHFSHCAAMGAALLMLIENALITQSRLMLLESVLIFFNLLAVLSYLKFFNCQKHSPFSLSWWFWLTLTGVACSCAVGIKYMGVFTYVLVLGVAAVHAWHLLGDQTLSNVCVFCHLLARAVALLVIPVVLYLLFFYVHLILVFRSGPHDQIMSSAFQASLEGGLARITQGQPLEVAFGSQVTLRNVFGKPVPCWLHSHQDTYPMIYENGRGSSHQQQVTCYPFKDVNNWWIVKDPRRHQLVVSSPPRPVRHGDMVQLVHGMTTRSLNTHDVAAPLSPHSQEVSCYIDYNISMPAQNLWRLEIVNRGSDTDVWKTILSEVRFVHVNTSAVLKLSGAHLPDWGYRQLEIVGEKLSRGYHGSTVWNVEEHRYGASQEQRERERELHSPAQVDVSRNLSFMARFSELQWRMLALRSDDSEHKYSSSPLEWVTLDTNIAYWLHPRTSAQIHLLGNIVIWVSGSLALAIYALLSLWYLLRRRRNVHDLPQDAWLRWVLAGALCAGGWAVNYLPFFLMEKTLFLYHYLPALTFQILLLPVVLQHISDHLCRSQLQRSIFSALVVAWYSSACHVSNTLRPLTYGDKSLSPHELKALRWKDSWDILIRKH,671,NP_001070834.1.csv,refseq-POMT1-NM_001077366.2_clinical_seed_0_final,refseq-POMT1-NM_001077366.2.a2m,Invitae,refseq-POMT1-NM_001077366.2.npy,1,671,671
+NP_001070884.2,MATRRSQTWSPGSRSACERCSWRSMSSSLTRSSAVTARGSAPKPRCSCCWPLRSRTSRRCWWPSGATVSLPRPLCHEAPRARSARAGLPNRLPTALFNSGFWLKRSSYEEQPTVRFQHQVLLVALLGPESDGFLAWSTFPAFNRLQGDRLRVPLVSTREEDRNQDGKTDMLHFKLELPLQSTEHVLGVQLILTFSYRLHRMATLVMQSMAFLQSSFPVPGSQLYVNGDLRLQQKQPLSCGGLDARYNISVINGTSPFAYDYDLTHIVAAYQERNVTTVLNDPNPIWLVGRAADAPFVINAIIRYPVEVISYQPGFWEMVKFAWVQYVSILLIFLWVFERIKIFVFQNQVVTTIPVTVTPRGDLCKEHLS,369,NP_001070884.2.csv,refseq-TMEM231-NM_001077416.2_clinical_seed_0_final,refseq-TMEM231-NM_001077416.2.a2m,Invitae,refseq-TMEM231-NM_001077416.2.npy,1,369,369
+NP_001070984.1,MARGLGAPHWVAVGLLTWATLGLLVAGLGGHDDLHDDLQEDFHGHSHRHSHEDFHHGHSHAHGHGHTHESIWHGHTHDHDHGHSHEDLHHGHSHGYSHESLYHRGHGHDHEHSHGGYGESGAPGIKQDLDAVTLWAYALGATVLISAAPFFVLFLIPVESNSPRHRSLLQILLSFASGGLLGDAFLHLIPHALEPHSHHTLEQPGHGHSHSGQGPILSVGLWVLSGIVAFLVVEKFVRHVKGGHGHSHGHGHAHSHTRGSHGHGRQERSTKEKQSSEEEEKETRGVQKRRGGSTVPKDGPVRPQNAEEEKRGLDLRVSGYLNLAADLAHNFTDGLAIGASFRGGRGLGILTTMTVLLHEVPHEVGDFAILVQSGCSKKQAMRLQLLTAVGALAGTACALLTEGGAVGSEIAGGAGPGWVLPFTAGGFIYVATVSVLPELLREASPLQSLLEVLGLLGGVIMMVLIAHLE,469,NP_001070984.1.csv,refseq-SLC39A7-NM_001077516.1_clinical_seed_0_final,refseq-SLC39A7-NM_001077516.1.a2m,Invitae,refseq-SLC39A7-NM_001077516.1.npy,1,469,469
+NP_001071.1,MATCIWLRSCGARRLGSTFPGCRLRPRAGGLVPASGPAPGPAQLRCYAGRLAGLSAALLRTDSFVGGRWLPAAATFPVQDPASGAALGMVADCGVREARAAVRAAYEAFCRWREVSAKERSSLLRKWYNLMIQNKDDLARIITAESGKPLKEAHGEILYSAFFLEWFSEEARRVYGDIIHTPAKDRRALVLKQPIGVAAVITPWNFPSAMITRKVGAALAAGCTVVVKPAEDTPFSALALAELASQAGIPSGVYNVIPCSRKNAKEVGEAICTDPLVSKISFTGSTTTGKILLHHAANSVKRVSMELGGLAPFIVFDSANVDQAVAGAMASKFRNTGQTCVCSNQFLVQRGIHDAFVKAFAEAMKKNLRVGNGFEEGTTQGPLINEKAVEKVEKQVNDAVSKGATVVTGGKRHQLGKNFFEPTLLCNVTQDMLCTHEETFGPLAPVIKFDTEEEAIAIANAADVGLAGYFYSQDPAQIWRVAEQLEVGMVGVNEGLISSVECPFGGVKQSGLGREGSKYGIDEYLELKYVCYGGL,535,NP_001071.1.csv,refseq-ALDH5A1-NM_001080.3_clinical_seed_0_final,refseq-ALDH5A1-NM_001080.3.a2m,Invitae,refseq-ALDH5A1-NM_001080.3.npy,1,535,535
+NP_001072982.1,MSEPHRVQFTSLPGSLNPAFLKKSRKEEAGAGEQHQDCEPAAAAVRITLTLFEPDHKRCPEFFYPELVKNIRGKVKGLQPGDKKKDLSDPFNDEEKERHKVEALARKFEEKYGGKKRRKDRIQDLIDMGYGYDESDSFIDNSEAYDELVPASLTTKYGGFYINSGTLQFRQASESEDDFIKEKKKKSPKKRKLKEGGEKIKKKKKDDTYDKEKKSKKSKFSKAGFTALNASKEKKKKKYSGALSVKEMLKKFQKEKEAQKKREEEHKPVAVPSAEAQGLRELEGASDPLLSLFGSTSDNDLLQAATAMDSLTDLDLEHLLSESPEGSPFRDMDDGSDSLGVGLDQEFRQPSSLPEGLPAPLEKRVKELAQAARAAEGESRQKFFTQDINGILLDIEAQTRELSSQVRSGVYAYLASFLPCSKDALLKRARKLHLYEQGGRLKEPLQKLKEAIGRAMPEQMAKYQDECQAHTQAKVAKMLEEEKDKEQRDRICSDEEEDEEKGGRRIMGPRKKFQWNDEIRELLCQVVKIKLESQDLERNNKAQAWEDCVKGFLDAEVKPLWPKGWMQARTLFKESRRGHGHLTSILAKKKVMAPSKIKVKESSTKPDKKVSVPSGQIGGPIALPSDHQTGGLSIGASSRELPSQASGGLANPPPVNLEDSLDEDLIRNPASSVEAVSKELAALNSRAAGNSEFTLPAPSKAPAEKVGGVLCTEEKRNFAKPSPSAPPPASSLQSPLNFLAEQALALGQSSQEKKPESSGYKELSCQAPLNKGLPEVHQSKAKHHSLPRTSHGPQVAVPVPGPQVKVFHAGTQQQKNFTPPSPFANKLQGPKASPTQCHRSLLQLVKTAAKGQGFHPSAPATSGGLSASSSSSHKTPASSSSALSHPAKPHSVSSAGSSYKNNPFASSISKHGVSSGSSSSGGTPVQSSVSGSLVPGIQPPSVGQATSRPVPSSAGKKMPVSQKLTLVAPPGGPNGDSSGGTQGVAKLLTSPSLKPSAVSSVTSSTSLSKGASGTVLLAGSSLMASPYKSSSPKLSGAMSSNSLGIITPVPIPVHVLSFSADSSAKAGVSKDAIVTGPAPGSFHHGLGHSLLAGLHSSPPHAAPLPHAAVPTHIPQSLPGASQLHGKGPAVPRKL,1134,NP_001072982.1.csv,refseq-UBN1-NM_001079514.3_clinical_seed_0_final,refseq-UBN1-NM_001079514.3.a2m,Invitae,refseq-UBN1-NM_001079514.3_theta_0.2.npy,1,1134,1134
+NP_001073136.1,MWSGGSGKARGWEAAAGGRSSPGRLSRRRIMSMSPKHTTPFSVSDILSPLEESYKKVGMEGGGLGAPLAAYRQGQAAPPTAAMQQHAVGHHGAVTAAYHMTAAGVPQLSHSAVGGYCNGNLGNMSELPPYQDTMRNSASGPGWYGANPDPRFPAISRFMGPASGMNMSGMGGLGSLGDVSKNMAPLPSAPRRKRRVLFSQAQVYELERRFKQQKYLSAPEREHLASMIHLTPTQVKIWFQNHRYKMKRQAKDKAAQQQLQQDSGGGGGGGGTGCPQQQQAQQQSPRRVAVPVLVKDGKPCQAGAPAPGAASLQGHAQQQAQHQAQAAQAAAAAISVGSGGAGLGAHPGHQPGSAGQSPDLAHHAASPAALQGQVSSLSHLNSSGSDYGTMSCSTLLYGRTW,401,NP_001073136.1.csv,refseq-NKX2-1-NM_001079668.2_clinical_seed_0_final,refseq-NKX2-1-NM_001079668.2.a2m,Invitae,refseq-NKX2-1-NM_001079668.2.npy,1,401,401
+NP_001073270.1,MSRINKNVVLALLTLTSSAFLLFQLYYYKHYLSTKNGAGLSKSKGSRIGFDSTQWRAVKKFIMLTSNQNVPVFLIDPLILELINKNFEQVKNTSHGSTSQCKFFCVPRDFTAFALQYHLWKNEEGWFRIAENMGFQCLKIESKDPRLDGIDSLSGTEIPLHYICKLATHAIHLVVFHERSGNYLWHGHLRLKEHIDRKFVPFRKLQFGRYPGAFDRPELQQVTVDGLEVLIPKDPMHFVEEVPHSRFIECRYKEARAFFQQYLDDNTVEAVAFRKSAKELLQLAAKTLNKLGVPFWLSSGTCLGWYRQCNIIPYSKDVDLGIFIQDYKSDIILAFQDAGLPLKHKFGKVEDSLELSFQGKDDVKLDVFFFYEETDHMWNGGTQAKTGKKFKYLFPKFTLCWTEFVDMKVHVPCETLEYIEANYGKTWKIPVKTWDWKRSPPNVQPNGIWPISEWDEVIQLY,461,NP_001073270.1.csv,refseq-FKTN-NM_001079802.1_clinical_seed_0_final,refseq-FKTN-NM_001079802.1.a2m,Invitae,refseq-FKTN-NM_001079802.1.npy,1,461,461
+NP_001073279.2,MFEIDYSRDSFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGPYICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQVENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGTGSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILARGASVNLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTEKYFALRNIIQKFEKVPEGPIPPSTPKFAYGKVTLEKLKTVGAALDILCPSGPIKSLYPLTFIQVKQHYGFVLYRTTLPQDCSNPAPLSSPLNGVHDRAYVAVDGIPQGVLERNNVITLNITGKAGATLDLLVENMGRVNYGAYINDFKGLVSNLTLSSNILTDWTIFPLDTEDAVCSHLGGWGHRDSGHHDEAWAHNSSNYTLPAFYMGNFSIPSGIPDLPQDTFIQFPGWTKGQVWINGFNLGRYWPARGPQLTLFVPQHILMTSAPNTITVLELEWAPCSSDDPELCAVTFVDRPVIGSSVTYDHPSKPVEKRLMPPPPQKNKDSWLDHV,647,NP_001073279.2.csv,refseq-GLB1-NM_001079811.3_clinical_seed_0_final,refseq-GLB1-NM_001079811.3.a2m,Invitae,refseq-GLB1-NM_001079811.3.npy,1,647,647
+NP_001073882.3,MALLLTLTSPDLEGTWDTRDKDGFKAQEGPPLAVPEFPVCGLYRIYGVCGSFSSFFIIRCSLCALETLKSPQHDPLEIPEQSLKLIPLVSGKRELTRGQKAGEKPLAAGPGEEELLRGSAPHAQDTQSEELPPSCTISGEKKPPAVSGEATGADAGRLCPPPRSRAPHKDRTLARSRPQTQGEDCSLPVGEVKIGKRSYSPAPGKQKKPNAMGLAPTSSPGAPNSARATHNPVPCGSGRGPCHLANLLSTLAQSNQNRDHKQGPPEVTCQIRKKTRTLYRSDQLEELEKIFQEDHYPDSDKRREIAQTVGVTPQRIMVKGAGSLVAGWSGGGPTIETLELQSERSAVAWVWFQNRRAKWRKMEKLNGKESKDNPAAPGPASSQCSSAAEILPAVPMEPKPDPFPQESPLDTFPEPPMLLTSDQTLAPTQPSEGAQRVVTPPLFSPPPVRRADLPFPLGPVHTPQLMPLLMDVAGSDSSHKDGPCGSWGTSITLPPPCSYLEELEPQDYQQSNQPGPFQFSQAPQPPLFQSPQPKLPYLPTFPFSMPSSLTLPPPEDSLFMFPCGPSGGTSQGYCPGASSGQILMQPPAGNIGTASWSDPCLPELPFPGPFCPQALGHPPGGDGYFPDLFPTPCPQALGRQPSSALSWMPEGARPGTGPLLSKAKEEPPAASLDQPSALEEARGDDKNSHVP,691,NP_001073882.3.csv,refseq-NOBOX-NM_001080413.3_clinical_seed_0_final,refseq-NOBOX-NM_001080413.3.a2m,Invitae,refseq-NOBOX-NM_001080413.3.npy,1,691,691
+NP_001073883.2,MDVTVSELLELFLQSPLVTWVKTFGPFGSGSQDNLTMYMDLVDGIFLNQIMLQIDPRPTNQRINKHVNNDVNLRIQNLTILVRNIKTYYQEVLQQLIVMNLPNVLMIGRDPLSGKSMEEIKKVLLLVLGCAVQCERKEEFIERIKQLDIETQAGIVAHIQEVTHNQENVFDLQWLELPDVAPEELEALSRSMVLHLRRLIDQRDECTELIVDLTQERDYLQAQHPPSPIKSSSADSTPSPTSSLSSEDKQHLAVELADTKARLRRVRQELEDKTEQLVDTRHEVDQLVLELQKVKQENIQLAADARSARAYRDELDSLREKANRVERLELELTRCKEKLHDVDFYKARMEELREDNIILIETKAMLEEQLTAARARGDKVHELEKENLQLKSKLHDLELDRDTDKKRIEELLEENMVLEIAQKQSMNESAHLGWELEQLSKNADLSDASRKSFVFELNECASSRILKLEKENQSLQSTIQGLRDASLVLEESGLKCGELEKENHQLSKKIEKLQTQLEREKQSNQDLETLSEELIREKEQLQSDMETLKADKARQIKDLEQEKDHLNRAMWSLRERSQVSSEARMKDVEKENKALHQTVTEANGKLSQLEFEKRQLHRDLEQAKEKGERAEKLERELQRLQEENGRLARKVTSLETATEKVEALEHESQGLQLENRTLRKSLDTLQNVSLQLEGLERDNKQLDAENLELRRLVETMRFTSTKLAQMERENQQLEREKEELRKNVDLLKALGKKSERLELSYQSVSAENLRLQQSLESSSHKTQTLESELGELEAERQALRRDLEALRLANAQLEGAEKDRKALEQEVAQLEKDKKLLEKEAKRLWQQVELKDAVLDDSTAKLSAVEKESRALDKELARCRDAAGKLKELEKDNRDLTKQVTVHARTLTTLREDLVLEKLKSQQLSSELDKLSQELEKVGLNRELLLQEDDSGSDTKYKILEGRNESALKTTLAMKEEKIVLLEAQMEEKASLNRQLESELQMLKKECETLRQNQGEGQHLQNSFKHPAGKTAASHQGKEAWGPGHKEATMELLRVKDRAIELERNNAALQAEKQLLKEQLQHLETQNVTFSSQILTLQKQSAFLQEHNTTLQTQTAKLQVENSTLSSQSAALTAQYTLLQNHHTAKETENESLQRQQEQLTAAYEALLQDHEHLGTLHERQSAEYEALIRQHSCLKTLHRNLELEHKELGERHGDMLKRKAELEEREKVLTTEREALQQEQRTNALAMGENQRLRGELDRVNFLHHQLKGEYEELHAHTKELKTSLNNAQLELNRWQARFDELKEQHQTMDISLTKLDNHCELLSRLKGNLEEENHHLLSQIQLLSQQNQMLLEQNMENKEQYHEEQKQYIDKLNALRRHKEKLEEKIMDQYKFYDPPPKKKNHWIGAKALVKLIKPKKEGSRERLKSTVDSPPWQLESSDPASPAASQPLRSQAENPDTPALGSNCAEERDAHNGSVGKGPGDLKPKRGSPHRGSLDRTDASTDLAMRSWPSELGSRTCSTSATTTAPSNSTPIARHPGRTKGYNSDDNLCEPSLEFEVPNHRQYVSRPSSLESSRNTSSNSSPLNLKGSSEQLHGRSESFSSEDLIPSRDLATLPREASTPGRNALGRHEYPLPRNGPLPQEGAQKRGTAPPYVGVRPCSASPSSEMVTLEEFLEESNRSSPTHDTPSCRDDLLSDYFRKASDPPAIGGQPGPPAKKEGAKMPTNFVAPTVKMAAPTSEGRPLKPGQYVKPNFRLTEAEAPPSVAPRQAQPPQSLSLGRPRQAPVPPASHAPASRSASLSRAFSLASADLLRASGPEACKQESPQKLGAPEALGGRETGSHTLQSPAPPSSHSLARERTPLVGKAGSSCQGPGPRSRPLDTRRFSLAPPKEERLAPLHQSATAPAIATAGAGAAAAGSGSNSQLLHFSPAAAPAARTKPKAPPRSGEVATITPVRAGLSLSEGDGVPGQGCSEGLPAKSPGRSPDLAPHLGRALEDCSRGSVSKSSPASPEPGGDPQTVWYEYGCV,2028,NP_001073883.2.csv,refseq-CCDC88C-NM_001080414.3_clinical_seed_0_final,refseq-CCDC88C-NM_001080414.3.a2m,Invitae,refseq-CCDC88C-NM_001080414.3.npy,1,2028,2028
+NP_001073890.2,MSLLCVGVKKAKFDGAQEKFNTYVTLKVQNVKSTTIAVRGSQPSWEQDFMFEINRLDLGLTVEVWNKGLIWDTMVGTVWIPLRTIRQSNEEGPGEWLTLDSQVIMADSEICGTKDPTFHRILLDTRFELPLDIPEEEARYWAKKLEQLNAMRDQDEYSFQDEQDKPLPVPSNQCCNWNYFGWGEQHNDDPDSAVDDRDSDYRSETSNSIPPPYYTTSQPNASVHQYSVRPPPLGSRESYSDSMHSYEEFSEPQALSPTGSSRYASSGELSQGSSQLSEDFDPDEHSLQGSDMEDERDRDSYHSCHSSVSYHKDSPRWDQDEEELEEDLEDFLEEEELPEDEEELEEEEEEVPDDLGSYAQREDVAVAEPKDFKRISLPPAAPGKEDKAPVAPTEAPDMAKVAPKPATPDKVPAAEQIPEAEPPKDEESFRPREDEEGQEGQDSMSRAKANWLRAFNKVRMQLQEARGEGEMSKSLWFKGGPGGGLIIIDSMPDIRKRKPIPLVSDLAMSLVQSRKAGITSALASSTLNNEELKNHVYKKTLQALIYPISCTTPHNFEVWTATTPTYCYECEGLLWGIARQGMRCTECGVKCHEKCQDLLNADCLQRAAEKSSKHGAEDRTQNIIMVLKDRMKIRERNKPEIFELIQEIFAVTKTAHTQQMKAVKQSVLDGTSKWSAKISITVVCAQGLQAKDKTGSSDPYVTVQVGKTKKRTKTIYGNLNPVWEENFHFECHNSSDRIKVRVWDEDDDIKSRVKQRFKRESDDFLGQTIIEVRTLSGEMDVWYNLDKRTDKSAVSGAIRLHISVEIKGEEKVAPYHVQYTCLHENLFHFVTDVQNNGVVKIPDAKGDDAWKVYYDETAQEIVDEFAMRYGVESIYQAMTHFACLSSKYMCPGVPAVMSTLLANINAYYAHTTASTNVSASDRFAASNFGKERFVKLLDQLHNSLRIDLSMYRNNFPASSPERLQDLKSTVDLLTSITFFRMKVQELQSPPRASQVVKDCVKACLNSTYEYIFNNCHELYSREYQTDPAKKGEVLPEEQGPSIKNLDFWSKLITLIVSIIEEDKNSYTPCLNQFPQELNVGKISAEVMWNLFAQDMKYAMEEHDKHRLCKSADYMNLHFKVKWLYNEYVTELPAFKDRVPEYPAWFEPFVIQWLDENEEVSRDFLHGALERDKKDGFQQTSEHALFSCSVVDVFSQLNQSFEIIKKLECPDPQIVGHYMRRFAKTISNVLLQYADIISKDFASYCSKEKEKVPCILMNNTQQLRVQLEKMFEAMGGKELDAEASDILKELQVKLNNVLDELSRVFATSFQPHIEECVKQMGDILSQVKGTGNVPASACSSVAQDADNVLQPIMDLLDSNLTLFAKICEKTVLKRVLKELWKLVMNTMEKTIVLPPLTDQTMIGNLLRKHGKGLEKGRVKLPSHSDGTQMIFNAAKELGQLSKLKDHMVREEAKSLTPKQCAVVELALDTIKQYFHAGGVGLKKTFLEKSPDLQSLRYALSLYTQATDLLIKTFVQTQSAQGLGVEDPVGEVSVHVELFTHPGTGEHKVTVKVVAANDLKWQTSGIFRPFIEVNIIGPQLSDKKRKFATKSKNNSWAPKYNESFQFTLSADAGPECYELQVCVKDYCFAREDRTVGLAVLQLRELAQRGSAACWLPLGRRIHMDDTGLTVLRILSQRSNDEVAKEFVKLKSDTRSAEEGGAAPAP,1703,NP_001073890.2.csv,refseq-UNC13A-NM_001080421.2_clinical_seed_0_final,refseq-UNC13A-NM_001080421.2.a2m,Invitae,refseq-UNC13A-NM_001080421.2.npy,1,1703,1703
+NP_001073893.1,MHRAVDPPGARAAREAFALGGLSCAGAWSSCPPHPPPRSAWLPGGRCSASIGQPPLPAPLPPSHGSSSGHPSKPYYAPGAPTPRPLHGKLESLHGCVQALLREPAQPGLWEQLGQLYESEHDSEEATRCYHSALRYGGSFAELGPRIGRLQQAQLWNFHTGSCQHRAKVLPPLEQVWNLLHLEHKRNYGAKRGGPPVKRAAEPPVVQPVPPAALSGPSGEEGLSPGGKRRRGCNSEQTGLPPGLPLPPPPLPPPPPPPPPPPPPLPGLATSPPFQLTKPGLWSTLHGDAWGPERKGSAPPERQEQRHSLPHPYPYPAPAYTAHPPGHRLVPAAPPGPGPRPPGAESHGCLPATRPPGSDLRESRVQRSRMDSSVSPAATTACVPYAPSRPPGLPGTTTSSSSSSSSNTGLRGVEPNPGIPGADHYQTPALEVSHHGRLGPSAHSSRKPFLGAPAATPHLSLPPGPSSPPPPPCPRLLRPPPPPAWLKGPACRAAREDGEILEELFFGTEGPPRPAPPPLPHREGFLGPPASRFSVGTQDSHTPPTPPTPTTSSSNSNSGSHSSSPAGPVSFPPPPYLARSIDPLPRPPSPAQNPQDPPLVPLTLALPPAPPSSCHQNTSGSFRRPESPRPRVSFPKTPEVGPGPPPGPLSKAPQPVPPGVGELPARGPRLFDFPPTPLEDQFEEPAEFKILPDGLANIMKMLDESIRKEEEQQQHEAGVAPQPPLKEPFASLQSPFPTDTAPTTTAPAVAVTTTTTTTTTTTATQEEEKKPPPALPPPPPLAKFPPPSQPQPPPPPPPSPASLLKSLASVLEGQKYCYRGTGAAVSTRPGPLPTTQYSPGPPSGATALPPTSAAPSAQGSPQPSASSSSQFSTSGGPWARERRAGEEPVPGPMTPTQPPPPLSLPPARSESEVLEEISRACETLVERVGRSATDPADPVDTAEPADSGTERLLPPAQAKEEAGGVAAVSGSCKRRQKEHQKEHRRHRRACKDSVGRRPREGRAKAKAKVPKEKSRRVLGNLDLQSEEIQGREKSRPDLGGASKAKPPTAPAPPSAPAPSAQPTPPSASVPGKKAREEAPGPPGVSRADMLKLRSLSEGPPKELKIRLIKVESGDKETFIASEVEERRLRMADLTISHCAADVVRASRNAKVKGKFRESYLSPAQSVKPKINTEEKLPREKLNPPTPSIYLESKRDAFSPVLLQFCTDPRNPITVIRGLAGSLRLNLGLFSTKTLVEASGEHTVEVRTQVQQPSDENWDLTGTRQIWPCESSRSHTTIAKYAQYQASSFQESLQEEKESEDEESEEPDSTTGTPPSSAPDPKNHHIIKFGTNIDLSDAKRWKPQLQELLKLPAFMRVTSTGNMLSHVGHTILGMNTVQLYMKVPGSRTPGHQENNNFCSVNINIGPGDCEWFAVHEHYWETISAFCDRHGVDYLTGSWWPILDDLYASNIPVYRFVQRPGDLVWINAGTVHWVQATGWCNNIAWNVGPLTAYQYQLALERYEWNEVKNVKSIVPMIHVSWNVARTVKISDPDLFKMIKFCLLQSMKHCQVQRESLVRAGKKIAYQGRVKDEPAYYCNECDVEVFNILFVTSENGSRNTYLVHCEGCARRRSAGLQGVVVLEQYRTEELAQAYDAFTLVRARRARGQRRRALGQAAGTGFGSPAAPFPEPPPAFSPQAPASTSR,1682,NP_001073893.1.csv,refseq-KDM6B-NM_001080424.1_clinical_seed_0_final,refseq-KDM6B-NM_001080424.1.a2m,Invitae,refseq-KDM6B-NM_001080424.1.npy,1,1682,1682
+NP_001073911.1,MEGQTPGSRGLPEKPHPATAAATLSSMGAVFILMKSALGAGLLNFPWAFSKAGGVVPAFLVELVSLVFLISGLVILGYAAAVSGQATYQGVVRGLCGPAIGKLCEACFLLNLLMISVAFLRVIGDQLEKLCDSLLSGTPPAPQPWYADQRFTLPLLSVLVILPLSAPREIAFQKYTSILGTLAACYLALVITVQYYLWPQGLVRESHPSLSPASWTSVFSVFPTICFGFQCHEAAVSIYCSMRKRSLSHWALVSVLSLLACCLIYSLTGVYGFLTFGTEVSADVLMSYPGNDMVIIVARVLFAVSIVTVYPIVLFLGRSVMQDFWRRSCLGGWGPSALADPSGLWVRMPLTILWVTVTLAMALFMPDLSEIVSIIGGISSFFIFIFPGLCLICAMGVEPIGPRVKCCLEVWGVVSVLVGTFIFGQSTAAAVWEMF,435,NP_001073911.1.csv,refseq-SLC38A8-NM_001080442.2_clinical_seed_0_final,refseq-SLC38A8-NM_001080442.2.a2m,Invitae,refseq-SLC38A8-NM_001080442.2.npy,1,435,435
+NP_001073918.2,MEQLNELELLMEKSFWEEAELPAELFQKKVVASFPRTVLSTGMDNRYLVLAVNTVQNKEGNCEKRLVITASQSLENKELCILRNDWCSVPVEPGDIIHLEGDCTSDTWIIDKDFGYLILYPDMLISGTSIASSIRCMRRAVLSETFRSSDPATRQMLIGTVLHEVFQKAINNSFAPEKLQELAFQTIQEIRHLKEMYRLNLSQDEIKQEVEDYLPSFCKWAGDFMHKNTSTDFPQMQLSLPSDNSKDNSTCNIEVVKPMDIEESIWSPRFGLKGKIDVTVGVKIHRGYKTKYKIMPLELKTGKESNSIEHRSQVVLYTLLSQERRADPEAGLLLYLKTGQMYPVPANHLDKRELLKLRNQMAFSLFHRISKSATRQKTQLASLPQIIEEEKTCKYCSQIGNCALYSRAVEQQMDCSSVPIVMLPKIEEETQHLKQTHLEYFSLWCLMLTLESQSKDNKKNHQNIWLMPASEMEKSGSCIGNLIRMEHVKIVCDGQYLHNFQCKHGAIPVTNLMAGDRVIVSGEERSLFALSRGYVKEINMTTVTCLLDRNLSVLPESTLFRLDQEEKNCDIDTPLGNLSKLMENTFVSKKLRDLIIDFREPQFISYLSSVLPHDAKDTVACILKGLNKPQRQAMKKVLLSKDYTLIVGMPGTGKTTTICTLVRILYACGFSVLLTSYTHSAVDNILLKLAKFKIGFLRLGQIQKVHPAIQQFTEQEICRSKSIKSLALLEELYNSQLIVATTCMGINHPIFSRKIFDFCIVDEASQISQPICLGPLFFSRRFVLVGDHQQLPPLVLNREARALGMSESLFKRLEQNKSAVVQLTVQYRMNSKIMSLSNKLTYEGKLECGSDKVANAVINLRHFKDVKLELEFYADYSDNPWLMGVFEPNNPVCFLNTDKVPAPEQVEKGGVSNVTEAKLIVFLTSIFVKAGCSPSDIGIIAPYRQQLKIINDLLARSIGMVEVNTVDKYQGRDKSIVLVSFVRSNKDGTVGELLKDWRRLNVAITRAKHKLILLGCVPSLNCYPPLEKLLNHLNSEKLIIDLPSREHESLCHILGDFQRE,1060,NP_001073918.2.csv,refseq-DNA2-NM_001080449.2_clinical_seed_0_final,refseq-DNA2-NM_001080449.2.a2m,Invitae,refseq-DNA2-NM_001080449.2.npy,1,1060,1060
+NP_001073936.1,MSVGELYSQCTRVWIPDPDEVWRSAELTKDYKEGDKSLQLRLEDETILEYPIDVQRNQLPFLRNPDILVGENDLTALSYLHEPAVLHNLKVRFLESNHIYTYCGIVLVAINPYEQLPIYGQDVIYTYSGQNMGDMDPHIFAVAEEAYKQMARDEKNQSIIVSGESGAGKTVSAKYAMRYFATVGGSASETNIEEKVLASSPIMEAIGNAKTTRNDNSSRFGKYIQIGFDKRYHIIGANMRTYLLEKSRVVFQADDERNYHIFYQLCAAAGLPEFKELALTSAEDFFYTSQGGDTSIEGVDDAEDFEKTRQAFTLLGVKESHQMSIFKIIASILHLGSVAIQAERDGDSCSISPQDVYLSNFCRLLGVEHSQMEHWLCHRKLVTTSETYVKTMSLQQVINARNALAKHIYAQLFGWIVEHINKALHTSLKQHSFIGVLDIYGFETFEVNSFEQFCINYANEKLQQQFNSHVFKLEQEEYMKEQIPWTLIDFYDNQPCIDLIEAKLGILDLLDEECKVPKGTDQNWAQKLYDRHSSSQHFQKPRMSNTAFIIVHFADKVEYLSDGFLEKNRDTVYEEQINILKASKFPLVADLFHDDKDPVPATTPGKGSSSKISVRSARPPMKVSNKEHKKTVGHQFRTSLHLLMETLNATTPHYVRCIKPNDEKLPFHFDPKRAVQQLRACGVLETIRISAAGYPSRWAYHDFFNRYRVLVKKRELANTDKKAICRSVLENLIKDPDKFQFGRTKIFFRAGQVAYLEKLRADKFRTATIMIQKTVRGWLQKVKYHRLKGATLTLQRYCRGHLARRLAEHLRRIRAAVVLQKHYRMQRARQAYQRVRRAAVVIQAFTRAMFVRRTYRQVLMEHKATTIQKHVRGWMARRHFQRLRDAAIVIQCAFRMLKARRELKALRIEARSAEHLKRLNVGMENKVVQLQRKIDEQNKEFKTLSEQLSVTTSTYTMEVERLKKELVHYQQSPGEDTSLRLQEEVESLRTELQRAHSERKILEDAHSREKDELRKRVADLEQENALLKDEKEQLNNQILCQSKDEFAQNSVKENLMKKELEEERSRYQNLVKEYSQLEQRYDNLRDEMTIIKQTPGHRRNPSNQSSLESDSNYPSISTSEIGDTEDALQQVEEIGLEKAAMDMTVFLKLQKRVRELEQERKKLQVQLEKREQQDSKKVQAEPPQTDIDLDPNADLAYNSLKRQELESENKKLKNDLNELRKAVADQATQNNSSHGSPDSYSLLLNQLKLAHEELEVRKEEVLILRTQIVSADQRRLAGRNAEPNINARSSWPNSEKHVDQEDAIEAYHGVCQTNSKTEDWGYLNEDGELGLAYQGLKQVARLLEAQLQAQSLEHEEEVEHLKAQLEALKEEMDKQQQTFCQTLLLSPEAQVEFGVQQEISRLTNENLDLKELVEKLEKNERKLKKQLKIYMKKAQDLEAAQALAQSERKRHELNRQVTVQRKEKDFQGMLEYHKEDEALLIRNLVTDLKPQMLSGTVPCLPAYILYMCIRHADYTNDDLKVHSLLTSTINGIKKVLKKHNDDFEMTSFWLSNTCRLLHCLKQYSGDEGFMTQNTAKQNEHCLKNFDLTEYRQVLSDLSIQIYQQLIKIAEGVLQPMIVSAMLENESIQGLSGVKPTGYRKRSSSMADGDNSYCLEAIIRQMNAFHTVMCDQGLDPEIILQVFKQLFYMINAVTLNNLLLRKDVCSWSTGMQLRYNISQLEEWLRGRNLHQSGAVQTMEPLIQAAQLLQLKKKTQEDAEAICSLCTSLSTQQIVKILNLYTPLNEFEERVTVAFIRTIQAQLQERNDPQQLLLDAKHMFPVLFPFNPSSLTMDSIHIPACLNLEFLNEV,1848,NP_001073936.1.csv,refseq-MYO5B-NM_001080467.2_clinical_seed_0_final,refseq-MYO5B-NM_001080467.2.a2m,Invitae,refseq-MYO5B-NM_001080467.2_theta_0.2.npy,1,1848,1848
+NP_001073945.1,MLKREMKPESDRPRKVRFRIASSHSGRVLKEVYEDGQPSGSLDSECASICGIDGLGDSDGQQNGHIESEGDENENDQDSLLVLARAASEKGFGTRRVNILSKNGTVRGVKYKVSAGQALFNNLTKVLQQPSTDLEFDRVVIYTTCLRVVRTTFERCELVRKIFQNHRVKFEEKNIALNGEYGKELDERCRRVSEAPSLPVVFIDGHYLGGAEKILSMNESGELQDILTKIERVQHPHECPSCGGFGFLPCSVCHGSKMSMFRNCFTDSFKALKCTACNENGLQRCKNCAG,290,NP_001073945.1.csv,refseq-GRXCR1-NM_001080476.2_clinical_seed_0_final,refseq-GRXCR1-NM_001080476.2.a2m,Invitae,refseq-GRXCR1-NM_001080476.2.npy,1,290,290
+NP_001073986.1,MSIAIPLGVTTSDTSYSDMAAGSDPESVEASPAVNEKSVYSTHNYGTTQRHGCRGLPYATIIPRSDLNGLPSPVEERCGDSPNSEGETVPTWCPCGLSQDGFLLNCDKCRGMSRGKVIRLHRRKQDNISGGDSSATESWDEELSPSTVLYTATQHTPTSITLTVRRTKPKKRKKSPEKGRAAPKTKKIKNSPSEAQNLDENTTEGWENRIRLWTDQYEEAFTNQYSADVQNALEQHLHSSKEFVGKPTILDTINKTELACNNTVIGSQMQLQLGRVTRVQKHRKILRAARDLALDTLIIEYRGKVMLRQQFEVNGHFFKKPYPFVLFYSKFNGVEMCVDARTFGNDARFIRRSCTPNAEVRHMIADGMIHLCIYAVSAITKDAEVTIAFDYEYSNCNYKVDCACHKGNRNCPIQKRNPNATELPLLPPPPSLPTIGAETRRRKARRKELEMEQQNEASEENNDQQSQEVPEKVTVSSDHEEVDNPEEKPEEEKEEVIDDQENLAHSRRTREDRKVEAIMHAFENLEKRKKRRDQPLEQSNSDVEITTTTSETPVGEETKTEAPESEVSNSVSNVTIPSTPQSVGVNTRRSSQAGDIAAEKLVPKPPPAKPSRPRPKSRISRYRTSSAQRLKRQKQANAQQAELSQAALEEGGSNSLVTPTEAGSLDSSGENRPLTGSDPTVVSITGSHVNRAASKYPKTKKYLVTEWLNDKAEKQECPVECPLRITTDPTVLATTLNMLPGLIHSPLICTTPKHYIRFGSPFIPERRRRPLLPDGTFSSCKKRWIKQALEEGMTQTSSVPQETRTQHLYQSNENSSSSSICKDNADLLSPLKKWKSRYLMEQNVTKLLRPLSPVTPPPPNSGSKSPQLATPGSSHPGEEECRNGYSLMFSPVTSLTTASRCNTPLQFELCHRKDLDLAKVGYLDSNTNSCADRPSLLNSGHSDLAPHPSLGPTSETGFPSRSGDGHQTLVRNSDQAFRTEFNLMYAYSPLNAMPRADGLYRGSPLVGDRKPLHLDGGYCSPAEGFSSRYEHGLMKDLSRGSLSPGGERACEGVPSAPQNPPQRKKVSLLEYRKRKQEAKENSAGGGGDSAQSKSKSAGAGQGSSNSVSDTGAHGVQGSSARTPSSPHKKFSPSHSSMSHLEAVSPSDSRGTSSSHCRPQENISSRWMVPTSVERLREGGSIPKVLRSSVRVAQKGEPSPTWESNITEKDSDPADGEGPETLSSALSKGATVYSPSRYSYQLLQCDSPRTESQSLLQQSSSPFRGHPTQSPGYSYRTTALRPGNPPSHGSSESSLSSTSYSSPAHPVSTDSLAPFTGTPGYFSSQPHSGNSTGSNLPRRSCPSSAASPTLQGPSDSPTSDSVSQSSTGTLSSTSFPQNSRSSLPSDLRTISLPSAGQSAVYQASRVSAVSNSQHYPHRGSGGVHQYRLQPLQGSGVKTQTGLS,1442,NP_001073986.1.csv,refseq-SETD5-NM_001080517.2_clinical_seed_0_final,refseq-SETD5-NM_001080517.2.a2m,Invitae,refseq-SETD5-NM_001080517.2.npy,1,1442,1442
+NP_001073991.2,MNPREEKVKIITEEFIENDEDADMGRQNKNSKVRRQPRKKQPPTAVPKEMVSEKSHLGNPQEPVQEEPKTRLLSMTVRRGPRSLPPIPSTSRTGFAEFSMRGRMREKLQAARSKAESALLQEIPTPRPRRLRSPSKKELETEFGTEPGKEVERTQQEVDSQSYSRVKFHDSARKIKPKPQVPPGFPSAEEAYNFFTFNFDPEPEGSEEKPKARHRAGTNQEEEEGEEEEPPAQGGGKEMDEEELLNGDDAEDFLLGLDHVADDFVAVRPADYESIHDRLQMEREMLFIPSRQTVPTYKKLPENVQPRFLEDEGLYTGVRPEVARTNQNIMENRLLMQDPERRWFGDDGRILALPNPIKPFPSRPPVLTQEQSIKAELETLYKKAVKYVHSSQHVIRSGDPPGNFQLDIDISGLIFTHHPCFSREHVLAAKLAQLYDQYLARHQRNKAKFLTDKLQALRNAVQTGLDPEKPHQSLDTIQKTINEYKSEIRQTRKFRDAEQEKDRTLLKTIIKVWKEMKSLREFQRFTNTPLKLVLRKEKADQKADEEAYEAEIQAEISELLEEHTEEYAQKMEEYRTSLQQWKAWRKVQRAKKKKRKQAAEEHPGDEIAEPYPEEDLVKPSPPEPTDRAVIEQEVRERAAQSRRRPWEPTLVPELSLAGSVTPNDQCPRAEVSRREDVKKRSVYLKVLFNNKEVSRTVSRPLGADFRVHFGQIFNLQIVNWPESLTLQVYETVGHSSPTLLAEVFLPIPETTVVTGRAPTEEVEFSSNQHVTLDHEGVGSGVPFSFEADGSNQLTLMTSGKVSHSVAWAIGENGIPLIPPLSQQNIGFRSALKKADAISSIGTSGLTDMKKLAKWAAESKLDPNDPNNAPLMQLISVATSGESYVPDFFRLEQLQQEFNFVSDQELNRSKRFRLLHLRSQEVPEFRNYKQVPVYDREIMEKVFQDYEKRLRDRNVIETKEHIDTHRAIVAKYLQQVRESVINRFLIAKQYFLLADMIVEEEVPNISILGLSLFKLAEQKRPLRPRRKGRKKVTAQNLSDGDIKLLVNIVRAYDIPVRKPAVSKFQQPSRSSRMFSEKHAASPSTYSPTHNADYPLGQVLVRPFVEVSFQRTVCHTTTAEGPNPSWNEELELPFRAPNGDYSTASLQSVKDVVFINIFDEVLHDVLEDDRERGSGIHTRIERHWLGCVKMPFSTIYFQARIDGTFKIDIPPVLLGYSKERNMILERGFDSVRSLSEGSYITLFITIEPQLVPGESIREKFESQEDEKLLQATEKFQAECALKFPNRQCLTTVIDISGKTVFITRYLKPLNPPQELLNVYPNNLQATAELVARYVSLIPFLPDTVSFGGICDLWSTSDQFLDLLAGDEEEHAVLLCNYFLSLGKKAWLLMGNAIPEGPTAYVLTWEQGRYLIWNPCSGHFYGQFDTFCPLKNVGCLIGPDNIWFNIQRYESPLRINFDVTRPKLWKSFFSRSLPYPGLSSVQPEELIYQRSDKAAAAELQDRIEKILKEKIMDWRPRHLTRWNRYCTSTLRHFLPLLEKSQGEDVEDDHRAELLKQLGDYRFSGFPLHMPYSEVKPLIDAVYSTGVHNIDVPNVEFALAVYIHPYPKNVLSVWIYVASLIRNR,1620,NP_001073991.2.csv,refseq-CC2D2A-NM_001080522.2_clinical_seed_0_final,refseq-CC2D2A-NM_001080522.2.a2m,Invitae,refseq-CC2D2A-NM_001080522.2_theta_0.2.npy,1,1620,1620
+NP_001075.1,MTSSGPGPRFLLLLPLLLPPAASASDRPRGRDPVNPEKLLVITVATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWLKKEMEKYADREDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQYPEVGTGKRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLDHKSRIFQNLNGALDEVVLKFDRNRVRIRNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNGWTPEGGCGFCNQDRRTLPGGQPPPRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTLFLHNNEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFSLDADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQRKRVGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSNQHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDVYWFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWLQLLRTYVGPMTESLFPGYHTKARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHKGLDYEGGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP,738,NP_001075.1.csv,refseq-PLOD3-NM_001084.4_clinical_seed_0_final,refseq-PLOD3-NM_001084.4.a2m,Invitae,refseq-PLOD3-NM_001084.4.npy,1,738,738
+NP_001075019.1,MAAAAVVVPAEWIKNWEKSGRGEFLHLCRILSENKSHDSSTYRDFQQALYELSYHVIKGNLKHEQASNVLSDISEFREDMPSILADVFCILDIETNCLEEKSKRDYFTQLVLACLYLVSDTVLKERLDPETLESLGLIKQSQQFNQKSVKIKTKLFYKQQKFNLLREENEGYAKLIAELGQDLSGSITSDLILENIKSLIGCFNLDPNRVLDVILEVFECRPEHDDFFISLLESYMSMCEPQTLCHILGFKFKFYQEPNGETPSSLYRVAAVLLQFNLIDLDDLYVHLLPADNCIMDEHKREIAEAKQIVRKLTMVVLSSEKMDEREKEKEKEEEKVEKPPDNQKLGLLEALLKIGDWQHAQNIMDQMPPYYAASHKLIALAICKLIHITIEPLYRRVGVPKGAKGSPVNALQNKRAPKQAESFEDLRRDVFNMFCYLGPHLSHDPILFAKVVRIGKSFMKEFQSDGSKQEDKEKTEVILSCLLSITDQVLLPSLSLMDCNACMSEELWGMFKTFPYQHRYRLYGQWKNETYNSHPLLVKVKAQTIDRAKYIMKRLTKENVKPSGRQIGKLSHSNPTILFDYILSQIQKYDNLITPVVDSLKYLTSLNYDVLAYCIIEALANPEKERMKHDDTTISSWLQSLASFCGAVFRKYPIDLAGLLQYVANQLKAGKSFDLLILKEVVQKMAGIEITEEMTMEQLEAMTGGEQLKAEGGYFGQIRNTKKSSQRLKDALLDHDLALPLCLLMAQQRNGVIFQEGGEKHLKLVGKLYDQCHDTLVQFGGFLASNLSTEDYIKRVPSIDVLCNEFHTPHDAAFFLSRPMYAHHISSKYDELKKSEKGSKQQHKVHKYITSCEMVMAPVHEAVVSLHVSKVWDDISPQFYATFWSLTMYDLAVPHTSYEREVNKLKVQMKAIDDNQEMPPNKKKKEKERCTALQDKLLEEEKKQMEHVQRVLQRLKLEKDNWLLAKSTKNETITKFLQLCIFPRCIFSAIDAVYCARFVELVHQQKTPNFSTLLCYDRVFSDIIYTVASCTENEASRYGRFLCCMLETVTRWHSDRATYEKECGNYPGFLTILRATGFDGGNKADQLDYENFRHVVHKWHYKLTKASVHCLETGEYTHIRNILIVLTKILPWYPKVLNLGQALERRVHKICQEEKEKRPDLYALAMGYSGQLKSRKSYMIPENEFHHKDPPPRNAVASVQNGPGGGPSSSSIGSASKSDESSTEETDKSRERSQCGVKAVNKASSTTPKGNSSNGNSGSNSNKAVKENDKEKGKEKEKEKKEKTPATTPEARVLGKDGKEKPKEERPNKDEKARETKERTPKSDKEKEKFKKEEKAKDEKFKTTVPNAESKSTQEREREKEPSRERDIAKEMKSKENVKGGEKTPVSGSLKSPVPRSDIPEPEREQKRRKIDTHPSPSHSSTVKDSLIELKESSAKLYINHTPPPLSKSKEREMDKKDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLTKRRKEENGTMGVSKHKSESPCESPYPNEKDKEKNKSKSSGKEKGSDSFKSEKMDKISSGGKKESRHDKEKIEKKEKRDSSGGKEEKKHHKSSDKHR,1593,NP_001075019.1.csv,refseq-THOC2-NM_001081550.1_clinical_seed_0_final,refseq-THOC2-NM_001081550.1.a2m,Invitae,refseq-THOC2-NM_001081550.1.npy,1,1593,1593
+NP_001076007.1,MRPRGLPPLLVVLLGCWASVSAQTDATPAVTTEGLNSTEAALATFGTFPSTRPPGTPRAPGPSSGPRPTPVTDVAVLCVCDLSPAQCDINCCCDPDCSSVDFSVFSACSVPVVTGDSQFCSQKAVIYSLNFTANPPQRVFELVDQINPSIFCIHITNYKPALSFINPEVPDENNFDTLMKTSDGFTLNAESYVSFTTKLDIPTAAKYEYGVPLQTSDSFLRFPSSLTSSLCTDNNPAAFLVNQAVKCTRKINLEQCEEIEALSMAFYSSPEILRVPDSRKKVPITVQSIVIQSLNKTLTRREDTDVLQPTLVNAGHFSLCVNVVLEVKYSLTYTDAGEVTKADLSFVLGTVSSVVVPLQQKFEIHFLQENTQPVPLSGNPGYVVGLPLAAGFQPHKGSGIIQTTNRYGQLTILHSTTEQDCLALEGVRTPVLFGYTMQSGCKLRLTGALPCQLVAQKVKSLLWGQGFPDYVAPFGNSQAQDMLDWVPIHFITQSFNRKHFVLQDSCQLPGALVIEVKWTKYGSLLNPQAKIVNVTANLISSSFPEANSGNERTILISTAVTFVDVSAPAEAGFRAPPAINARLPFNFFFPFV,592,NP_001076007.1.csv,refseq-TCTN1-NM_001082538.2_clinical_seed_0_final,refseq-TCTN1-NM_001082538.2.a2m,Invitae,refseq-TCTN1-NM_001082538.2.npy,1,592,592
+NP_001076580.1,MKPIQKLLAGLILLTWCVEGCSSQHWSYGLRPGGKRDAENLIDSFQEIVKEVGQLAETQRFECTTHQPRSPLRDLKGALESLIEEETGQKKI,92,NP_001076580.1.csv,refseq-GNRH1-NM_001083111.2_clinical_seed_0_final,refseq-GNRH1-NM_001083111.2.a2m,Invitae,refseq-GNRH1-NM_001083111.2.npy,1,92,92
+NP_001076585.1,MAARLLLLGILLLLLPLPVPAPCHTAARSECKRSHKFVPGAWLAGEGVDVTSLRRSGSFPVDTQRFLRPDGTCTLCENALQEGTLQRLPLALTNWRAQGSGCQRHVTRAKVSSTEAVARDAARSIRNDWKVGLDVTPKPTSNVHVSVAGSHSQAANFAAQKTHQDQYSFSTDTVECRFYSFHVVHTPPLHPDFKRALGDLPHHFNASTQPAYLRLISNYGTHFIRAVELGGRISALTALRTCELALEGLTDNEVEDCLTVEAQVNIGIHGSISAEAKACEEKKKKHKMTASFHQTYRERHSEVVGGHHTSINDLLFGIQAGPEQYSAWVNSLPGSPGLVDYTLEPLHVLLDSQDPRREALRRALSQYLTDRARWRDCSRPCPPGRQKSPRDPCQCVCHGSAVTTQDCCPRQRGLAQLEVTFIQAWGLWGDWFTATDAYVKLFFGGQELRTSTVWDNNNPIWSVRLDFGDVLLATGGPLRLQVWDQDSGRDDDLLGTCDQAPKSGSHEVRCNLNHGHLKFRYHARCLPHLGGGTCLDYVPQMLLGEPPGNRSGAVW,555,NP_001076585.1.csv,refseq-PRF1-NM_001083116.1_clinical_seed_0_final,refseq-PRF1-NM_001083116.1.a2m,Invitae,refseq-PRF1-NM_001083116.1.npy,1,555,555
+NP_001077083.1,MAALLRRLLQRERPSAASGRPVGRREANLGTDAGVAVRVRFAPSPTGFLHLGGLRTALYNYIFAKKYQGSFILRLEDTDQTRVVPGAAENIEDMLEWAGIPPDESPRRGGPAGPYQQSQRLELYAQATEALLKTGAAYPCFCSPQRLELLKKEALRNHQTPRYDNRCRNMSQEQVAQKLAKDPKPAIRFRLEQVVPAFQDLVYGWNRHEVASVEGDPVIMKSDGFPTYHLACVVDDHHMGISHVLRGSEWLVSTAKHLLLYQALGWQPPHFAHLPLLLNRDGSKLSKRQGDVFLEHFAADGFLPDSLLDIITNCGSGFAENQMGRTLPELITQFNLTQVTCHSALLDLEKLPEFNRLHLQRLVSNESQRRQLVGKLQVLVEEAFGCQLQNRDVLNPVYVERILLLRQGHICRLQDLVSPVYSYLWTRPAVGRAQLDAISEKVDVIAKRVLGLLERSSMSLTQDMLNGELKKLSEGLEGTKYSNVMKLLRMALSGQQQGPPVAEMMLALGPKEVRERIQKVVSS,523,NP_001077083.1.csv,refseq-EARS2-NM_001083614.1_clinical_seed_0_final,refseq-EARS2-NM_001083614.1.a2m,Invitae,refseq-EARS2-NM_001083614.1.npy,1,523,523
+NP_001077088.2,MQKIMHISVLLSPVLWGLIFGVSSNSIQIGGLFPRGADQEYSAFRVGMVQFSTSEFRLTPHIDNLEVANSFAVTNAFCSQFSRGVYAIFGFYDKKSVNTITSFCGTLHVSFITPSFPTDGTHPFVIQMRPDLKGALLSLIEYYQWDKFAYLYDSDRGLSTLQAVLDSAAEKKWQVTAINVGNINNDKKDEMYRSLFQDLELKKERRVILDCERDKVNDIVDQVITIGKHVKGYHYIIANLGFTDGDLLKIQFGGANVSGFQIVDYDDSLVSKFIERWSTLEEKEYPGAHTTTIKYTSALTYDAVQVMTEAFRNLRKQRIEISRRGNAGDCLANPAVPWGQGVEIERALKQVQVEGLSGNIKFDQNGKRINYTINIMELKTNGPRKIGYWSEVDKMVVTLTELPSGNDTSGLENKTVVVTTILESPYVMMKKNHEMLEGNERYEGYCVDLAAEIAKHCGFKYKLTIVGDGKYGARDADTKIWNGMVGELVYGKADIAIAPLTITLVREEVIDFSKPFMSLGISIMIKKPQKSKPGVFSFLDPLAYEIWMCIVFAYIGVSVVLFLVSRFSPYEWHTEEFEDGRETQSSESTNEFGIFNSLWFSLGAFMQQGCDISPRSLSGRIVGGVWWFFTLIIISSYTANLAAFLTVERMVSPIESAEDLSKQTEIAYGTLDSGSTKEFFRRSKIAVFDKMWTYMRSAEPSVFVRTTAEGVARVRKSKGKYAYLLESTMNEYIEQRKPCDTMKVGGNLDSKGYGIATPKGSSLRNAVNLAVLKLNEQGLLDKLKNKWWYDKGECGSGGGDSKEKTSALSLSNVAGVFYILVGGLGLAMLVALIEFCYKSRAEAKRMKVAKNAQNINPSSSQNSQNFATYKEGYNVYGIESVKI,883,NP_001077088.2.csv,refseq-GRIA2-NM_001083619.3_clinical_seed_0_final,refseq-GRIA2-NM_001083619.3.a2m,Invitae,refseq-GRIA2-NM_001083619.3.npy,1,883,883
+NP_001077430.1,MAAVGSGGYARNDAGEKLPSVMAGVPARRGQSSPPPAPPICLRRRTRLSTASEETVQNRVSLEKVLGITAQNSSGLTCDPGTGHVAYLAGCVVVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGENGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYQHDMVLNVWDWKKDIVVASNKVSCRVIALSFSEDSSYFVTVGNRHVRFWFLEVSTETKVTSTVPLVGRSGILGELHNNIFCGVACGRGRMAGSTFCVSYSGLLCQFNEKRVLEKWINLKVSLSSCLCVSQELIFCGCTDGIVRIFQAHSLHYLANLPKPHYLGVDVAQGLEPSFLFHRKAEAVYPDTVALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSELFHSSYVWNVEVYPEFEDQRACLPSGSFLTCSSDNTIRFWNLDSSPDSHWQKNIFSNTLLKVVYVENDIQHLQDMSHFPDRGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKPETGLTLLASASRDRLIHVLNVEKNYNLEQTLDDHSSSITAIKFAGNRDIQMISCGADKSIYFRSAQQGSDGLHFVRTHHVAEKTTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQKKCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWHLGPEITNCMKQHLLEIDHRQQQQHTNDKKRSGHPRQDTYVSTPSEIHSLSPGEQTEDDLEEECEPEEMLKTPSKDSLDPDPRCLLTNGKLPLWAKRLLGDDDVADGLAFHAKRSYQPHGRWAERAGQEPLKTILDAQDLDCYFTPMKPESLENSILDSLEPQSLASLLSESESPQEAGRGHPSFLPQQKESSEASELILYSLEAEVTVTGTDSQYCRKEVEAGPGDQQGDSYLRVSSDSPKDQSPPEDSGESEADLECSFAAIHSPAPPPDPAPRFATSLPHFPGCAGPTEDELSLPEGPSVPSSSLPQTPEQEKFLRHHFETLTESPCRELFPAALGDVEASEAEDHFFNPRLSISTQFLSSLQKASRFTHTFPPRATQCLVKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEETLEAWRPPPPCLTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGDSEGPIVATLAQPLRRPSSVGELASLGQELQAITTATTPSLDSEGQEPALRSWGNHEARANLRLTLSSACDGLLQPPVDTQPGVTVPAVSFPAPSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQGSPARWSEPWVPVEALPPSPLELSRVGNILHRLQTTFQEALDLYRVLVSSGQVDTGQQQARTELVSTFLWIHSQLEAECLVGTSVAPAQALPSPGPPSPPTLYPLASPDLQALLEHYSELLVQAVRRKARGH,1523,NP_001077430.1.csv,refseq-WDR62-NM_001083961.1_clinical_seed_0_final,refseq-WDR62-NM_001083961.1.a2m,Invitae,refseq-WDR62-NM_001083961.1.npy,1,1523,1523
+NP_001077431.1,MHHQQRMAALGTDKELSDLLDFSAMFSPPVSSGKNGPTSLASGHFTGSNVEDRSSSGSWGNGGHPSPSRNYGDGTPYDHMTSRDLGSHDNLSPPFVNSRIQSKTERGSYSSYGRESNLQGCHQQSLLGGDMDMGNPGTLSPTKPGSQYYQYSSNNPRRRPLHSSAMEVQTKKVRKVPPGLPSSVYAPSASTADYNRDSPGYPSSKPATSTFPSSFFMQDGHHSSDPWSSSSGMNQPGYAGMLGNSSHIPQSSSYCSLHPHERLSYPSHSSADINSSLPPMSTFHRSGTNHYSTSSCTPPANGTDSIMANRGSGAAGSSQTGDALGKALASIYSPDHTNNSFSSNPSTPVGSPPSLSAGTAVWSRNGGQASSSPNYEGPLHSLQSRIEDRLERLDDAIHVLRNHAVGPSTAMPGGHGDMHGIIGPSHNGAMGGLGSGYGTGLLSANRHSLMVGTHREDGVALRGSHSLLPNQVPVPQLPVQSATSPDLNPPQDPYRGMPPGLQGQSVSSGSSEIKSDDEGDENLQDTKSSEDKKLDDDKKDIKSITRSRSSNNDDEDLTPEQKAEREKERRMANNARERLRVRDINEAFKELGRMVQLHLKSDKPQTKLLILHQAVAVILSLEQQVRERNLNPKAACLKRREEEKVSSEPPPLSLAGPHPGMGDASNHMGQM,671,NP_001077431.1.csv,refseq-TCF4-NM_001083962.1_clinical_seed_0_final,refseq-TCF4-NM_001083962.1.a2m,Invitae,refseq-TCF4-NM_001083962.1.npy,1,671,671
+NP_001080.2,MAVLRQLALLLWKNYTLQKRKVLVTVLELFLPLLFSGILIWLRLKIQSENVPNATIYPGQSIQELPLFFTFPPPGDTWELAYIPSHSDAAKTVTETVRRALVINMRVRGFPSEKDFEDYIRYDNCSSSVLAAVVFEHPFNHSKEPLPLAVKYHLRFSYTRRNYMWTQTGSFFLKETEGWHTTSLFPLFPNPGPREPTSPDGGEPGYIREGFLAVQHAVDRAIMEYHADAATRQLFQRLTVTIKRFPYPPFIADPFLVAIQYQLPLLLLLSFTYTALTIARAVVQEKERRLKEYMRMMGLSSWLHWSAWFLLFFLFLLIAASFMTLLFCVKVKPNVAVLSRSDPSLVLAFLLCFAISTISFSFMVSTFFSKANMAAAFGGFLYFFTYIPYFFVAPRYNWMTLSQKLCSCLLSNVAMAMGAQLIGKFEAKGMGIQWRDLLSPVNVDDDFCFGQVLGMLLLDSVLYGLVTWYMEAVFPGQFGVPQPWYFFIMPSYWCGKPRAVAGKEEEDSDPEKALRNEYFEAEPEDLVAGIKIKHLSKVFRVGNKDRAAVRDLNLNLYEGQITVLLGHNGAGKTTTLSMLTGLFPPTSGRAYISGYEISQDMVQIRKSLGLCPQHDILFDNLTVAEHLYFYAQLKGLSRQKCPEEVKQMLHIIGLEDKWNSRSRFLSGGMRRKLSIGIALIAGSKVLILDEPTSGMDAISRRAIWDLLQRQKSDRTIVLTTHFMDEADLLGDRIAIMAKGELQCCGSSLFLKQKYGAGYHMTLVKEPHCNPEDISQLVHHHVPNATLESSAGAELSFILPRESTHRFEGLFAKLEKKQKELGIASFGASITTMEEVFLRVGKLVDSSMDIQAIQLPALQYQHERRASDWAVDSNLCGAMDPSDGIGALIEEERTAVKLNTGLALHCQQFWAMFLKKAAYSWREWKMVAAQVLVPLTCVTLALLAINYSSELFDDPMLRLTLGEYGRTVVPFSVPGTSQLGQQLSEHLKDALQAEGQEPREVLGDLEEFLIFRASVEGGGFNERCLVAASFRDVGERTVVNALFNNQAYHSPATALAVVDNLLFKLLCGPHASIVVSNFPQPRSALQAAKDQFNEGRKGFDIALNLLFAMAFLASTFSILAVSERAVQAKHVQFVSGVHVASFWLSALLWDLISFLIPSLLLLVVFKAFDVRAFTRDGHMADTLLLLLLYGWAIIPLMYLMNFFFLGAATAYTRLTIFNILSGIATFLMVTIMRIPAVKLEELSKTLDHVFLVLPNHCLGMAVSSFYENYETRRYCTSSEVAAHYCKKYNIQYQENFYAWSAPGVGRFVASMAASGCAYLILLFLIETNLLQRLRGILCALRRRRTLTELYTRMPVLPEDQDVADERTRILAPSPDSLLHTPLIIKELSKVYEQRVPLLAVDRLSLAVQKGECFGLLGFNGAGKTTTFKMLTGEESLTSGDAFVGGHRISSDVGKVRQRIGYCPQFDALLDHMTGREMLVMYARLRGIPERHIGACVENTLRGLLLEPHANKLVRTYSGGNKRKLSTGIALIGEPAVIFLDEPSTGMDPVARRLLWDTVARARESGKAIIITSHSMEECEALCTRLAIMVQGQFKCLGSPQHLKSKFGSGYSLRAKVQSEGQQEALEEFKAFVDLTFPGSVLEDEHQGMVHYHLPGRDLSWAKVFGILEKAKEKYGVDDYSVSQISLEQVFLSFAHLQPPTAEEGR,1704,NP_001080.2.csv,ABCA3_HUMAN_b03_clinical_seed_0_final,ABCA3_HUMAN_b03.a2m,EVE,ABCA3_HUMAN_b03_theta_0.2.npy,1,1704,1704
+NP_001089.1,MAPYSLLVTRLQKALGVRQYHVASVLCQRAKVAMSHFEPNEYIHYDLLEKNINIVRKRLNRPLTLSEKIVYGHLDDPASQEIERGKSYLRLRPDRVAMQDATAQMAMLQFISSGLSKVAVPSTIHCDHLIEAQVGGEKDLRRAKDINQEVYNFLATAGAKYGVGFWKPGSGIIHQIILENYAYPGVLLIGTDSHTPNGGGLGGICIGVGGADAVDVMAGIPWELKCPKVIGVKLTGSLSGWSSPKDVILKVAGILTVKGGTGAIVEYHGPGVDSISCTGMATICNMGAEIGATTSVFPYNHRMKKYLSKTGREDIANLADEFKDHLVPDPGCHYDQLIEINLSELKPHINGPFTPDLAHPVAEVGKVAEKEGWPLDIRVGLIGSCTNSSYEDMGRSAAVAKQALAHGLKCKSQFTITPGSEQIRATIERDGYAQILRDLGGIVLANACGPCIGQWDRKDIKKGEKNTIVTSYNRNFTGRNDANPETHAFVTSPEIVTALAIAGTLKFNPETDYLTGTDGKKFRLEAPDADELPKGEFDPGQDTYQHPPKDSSGQHVDVSPTSQRLQLLEPFDKWDGKDLEDLQILIKVKGKCTTDHISAAGPWLKFRGHLDNISNNLLIGAINIENGKANSVRNAVTQEFGPVPDTARYYKKHGIRWVVIGDENYGEGSSREHAALEPRHLGGRAIITKSFARIHETNLKKQGLLPLTFADPADYNKIHPVDKLTIQGLKDFTPGKPLKCIIKHPNGTQETILLNHTFNETQIEWFRAGSALNRMKELQQ,780,NP_001089.1.csv,refseq-ACO2-NM_001098.2_clinical_seed_0_final,refseq-ACO2-NM_001098.2.a2m,Invitae,refseq-ACO2-NM_001098.2.npy,1,780,780
+NP_001091738.1,MAASQTSQTVASHVPFADLCSTLERIQKSKGRAEKIRHFREFLDSWRKFHDALHKNHKDVTDSFYPAMRLILPQLERERMAYGIKETMLAKLYIELLNLPRDGKDALKLLNYRTPTGTHGDAGDFAMIAYFVLKPRCLQKGSLTIQQVNDLLDSIASNNSAKRKDLIKKSLLQLITQSSALEQKWLIRMIIKDLKLGVSQQTIFSVFHNDAAELHNVTTDLEKVCRQLHDPSVGLSDISITLFSAFKPMLAAIADIEHIEKDMKHQSFYIETKLDGERMQMHKDGDVYKYFSRNGYNYTDQFGASPTEGSLTPFIHNAFKADIQICILDGEMMAYNPNTQTFMQKGTKFDIKRMVEDSDLQTCYCVFDVLMVNNKKLGHETLRKRYEILSSIFTPIPGRIEIVQKTQAHTKNEVIDALNEAIDKREEGIMVKQPLSIYKPDKRGEGWLKIKPEYVSGLMDELDILIVGGYWGKGSRGGMMSHFLCAVAEKPPPGEKPSVFHTLSRVGSGCTMKELYDLGLKLAKYWKPFHRKAPPSSILCGTEKPEVYIEPCNSVIVQIKAAEIVPSDMYKTGCTLRFPRIEKIRDDKEWHECMTLDDLEQLRGKASGKLASKHLYIGGDDEPQEKKRKAAPKMKKVIGIIEHLKAPNLTNVNKISNIFEDVEFCVMSGTDSQPKPDLENRIAEFGGYIVQNPGPDTYCVIAGSENIRVKNIILSNKHDVVKPAWLLECFKTKSFVPWQPRFMIHMCPSTKEHFAREYDCYGDSYFIDTDLNQLKEVFSGIKNSNEQTPEEMASLIADLEYRYSWDCSPLSMFRRHTVYLDSYAVINDLSTKNEGTRLAIKALELRFHGAKVVSCLAEGVSHVIIGEDHSRVADFKAFRRTFKRKFKILKESWVTDSIDKCELQEENQYLI,911,NP_001091738.1.csv,refseq-LIG4-NM_001098268.1_clinical_seed_0_final,refseq-LIG4-NM_001098268.1.a2m,Invitae,refseq-LIG4-NM_001098268.1.npy,1,911,911
+NP_001091981.1,MATANFGKIQIGIYVEIKRSDGRIHQAMVTSLNEDNESVTVEWIENGDTKGKEIDLESIFSLNPDLVPDEEIEPSPETPPPPASSAKVNKIVKNRRTVASIKNDPPSRDNRVVGSARARPSQFPEQSSSAQQNGSVSDISPVQAAKKEFGPPSRRKSNCVKEVEKLQEKREKRRLQQQELREKRAQDVDATNPNYEIMCMIRDFRGSLDYRPLTTADPIDEHRICVCVRKRPLNKKETQMKDLDVITIPSKDVVMVHEPKQKVDLTRYLENQTFRFDYAFDDSAPNEMVYRFTARPLVETIFERGMATCFAYGQTGSGKTHTMGGDFSGKNQDCSKGIYALAARDVFLMLKKPNYKKLELQVYATFFEIYSGKVFDLLNRKTKLRVLEDGKQQVQVVGLQEREVKCVEDVLKLIDIGNSCRTSGQTSANAHSSRSHAVFQIILRRKGKLHGKFSLIDLAGNERGADTSSADRQTRLEGAEINKSLLALKECIRALGRNKPHTPFRASKLTQVLRDSFIGENSRTCMIATISPGMASCENTLNTLRYANRVKEFGISPSDIPFSQGSGSRPDLSPSYEYDDFSPSVTRVKELTVDPTAAGDVRPIMHHPPNQIDDLETQWGVGSSPQRDDLKLLCEQNEEEVSPQLFTFHEAVSQMVEMEEQVVEDHRAVFQESIRWLEDEKALLEMTEEVDYDVDSYATQLEAILEQKIDILTELRDKVKSFRAALQEEEQASKQINPKRPRAL,744,NP_001091981.1.csv,refseq-KIF2A-NM_001098511.2_clinical_seed_0_final,refseq-KIF2A-NM_001098511.2.a2m,Invitae,refseq-KIF2A-NM_001098511.2.npy,1,744,744
+NP_001092138.1,MWLCPLALTLILMAASGAACEVKDVCVGSPGIPGTPGSHGLPGRDGRDGVKGDPGPPGPMGPPGETPCPPGNNGLPGAPGVPGERGEKGEAGERGPPGLPAHLDEELQATLHDFRHQILQTRGALSLQGSIMTVGEKVFSSNGQSITFDAIQEACARAGGRIAVPRNPEENEAIASFVKKYNTYAYVGLTEGPSPGDFRYSDGTPVNYTNWYRGEPAGRGKEQCVEMYTDGQWNDRNCLYSRLTICEF,248,NP_001092138.1.csv,refseq-SFTPA2-NM_001098668.3_clinical_seed_0_final,refseq-SFTPA2-NM_001098668.3.a2m,Invitae,refseq-SFTPA2-NM_001098668.3.npy,1,248,248
+NP_001092272.1,MIDSVKLRRDSAADFFSHYEYLCALQNSVPLPAVRACLREGVLDFNADRLRGVDWAPLLSTLKINKDLPLVSIKSFFQPWLGDTGSDMNKFCRSRVPAIRYKDVTFQLCKALKGCLSISSVLKNLELNGLILRERDLTILAKGLNKSASLVHLSLANCPIGDGGLEIICQGIKSSITLKTVNFTGCNLTWQGADHMAKILKYQTMRRHEETWAESLRYRRPDLDCMAGLRRITLNCNTLIGDLGACAFADSLSEDLWLRALDLQQCGLTNEGAKALLEALETNTTLVVLDIRKNPLIDHSMMKAVIKKVLQNGRSAKSEYQWITSPSVKEPSKTAKQKRRTIILGSGHKGKATIRIVGLATKKPVSSGRKHSLGKEYYAPAPLPPGVSGFLPWRTAERAKRHRGFPLIKTRDICNQLQQPGFPVTVTVESPSSSEVEEVDDSSESVHEVPEKTSIEQEALQEKLEECLKQLKEERVIRLKVDKRVSELEHENAQLRNINFSLSEALHAQSLTNMILDDEGVLGSIENSFQKFHAFLDLLKDAGLGQLATMAGIDQSDFQLLGHPQMTSTVSNPPKEEKKALEDEKPEPKQNALGQMQNIQVSICMQSAYNEGTLMKFQKITGDARIPLPLDSFPVPVSTPEGLGTSSNNLGVPATEQRQESFEGFIARMCSPSPDATSGTGSQRKEEELSRNSRSSSEKKTKTGEYTKKHSDKQHPGKDLHS,722,NP_001092272.1.csv,refseq-CEP78-NM_001098802.1_clinical_seed_0_final,refseq-CEP78-NM_001098802.1.a2m,Invitae,refseq-CEP78-NM_001098802.1.npy,1,722,722
+NP_001092690.1,MEPRESGKAPVTFDDITVYLLQEEWVLLSQQQKELCGSNKLVAPLGPTVANPELFRKFGRGPEPWLGSVQGQRSLLEHHPGKKQMGYMGEMEVQGPTRESGQSLPPQKKAYLSHLSTGSGHIEGDWAGRNRKLLKPRSIQKSWFVQFPWLIMNEEQTALFCSACREYPSIRDKRSRLIEGYTGPFKVETLKYHAKSKAHMFCVNALAARDPIWAARFRSIRDPPGDVLASPEPLFTADCPIFYPPGPLGGFDSMAELLPSSRAELEDPGGDGAIPAMYLDCISDLRQKEITDGIHSSSDINILYNDAVESCIQDPSAEGLSEEVPVVFEELPVVFEDVAVYFTREEWGMLDKRQKELYRDVMRMNYELLASLGPAAAKPDLISKLERRAAPWIKDPNGPKWGKGRPPGNKKMVAVREADTQASAADSALLPGSPVEARASCCSSSICEEGDGPRRIKRTYRPRSIQRSWFGQFPWLVIDPKETKLFCSACIERPNLHDKSSRLVRGYTGPFKVETLKYHEVSKAHRLCVNTVEIKEDTPHTALVPEISSDLMANMEHFFNAAYSIAYHSRPLNDFEKILQLLQSTGTVILGKYRNRTACTQFIKYISETLKREILEDVRNSPCVSVLLDSSTDASEQACVGIYIRYFKQMEVKESYITLAPLYSETADGYFETIVSALDELDIPFRKPGWVVGLGTDGSAMLSCRGGLVEKFQEVIPQLLPVHCVAHRLHLAVVDACGSIDLVKKCDRHIRTVFKFYQSSNKRLNELQEGAAPLEQEIIRLKDLNAVRWVASRRRTLHALLVSWPALARHLQRVAEAGGQIGHRAKGMLKLMRGFHFVKFCHFLLDFLSIYRPLSEVCQKEIVLITEVNATLGRAYVALESLRHQAGPKEEEFNASFKDGRLHGICLDKLEVAEQRFQADRERTVLTGIEYLQQRFDADRPPQLKNMEVFDTMAWPSGIELASFGNDDILNLARYFECSLPTGYSEEALLEEWLGLKTIAQHLPFSMLCKNALAQHCRFPLLSKLMAVVVCVPISTSCCERGFKAMNRIRTDERTKLSNEVLNMLMMTAVNGVAVTEYDPQPAIQHWYLTSSGRRFSHVYTCAQVPARSPASARLRKEEMGALYVEEPRTQKPPILPSREAAEVLKDCIMEPPERLLYPHTSQEAPGMS,1169,NP_001092690.1.csv,refseq-ZNF862-NM_001099220.1_clinical_seed_0_final,refseq-ZNF862-NM_001099220.1.a2m,Invitae,refseq-ZNF862-NM_001099220.1.npy,1,1169,1169
+NP_001092692.1,MASNHKSSAARPVSRGGVGLTGRPPSGIRPLSGNIRVATAMPPGTARPGSRGCPIGTGGVLSSQIKVAHRPVTQQGLTGMKTGTKGPQRQILDKSYYLGLLRSKISELTTEVNKLQKGIEMYNQENSVYLSYEKRAETLAVEIKELQGQLADYNMLVDKLNTNTEMEEVMNDYNMLKAQNDRETQSLDVIFTERQAKEKQIRSVEEEIEQEKQATDDIIKNMSFENQVKYLEMKTTNEKLLQELDTLQQQLDSQNMKKESLEAEIAHSQVKQEAVLLHEKLYELESHRDQMIAEDKSIGSPMEEREKLLKQIKDDNQEIASMERQLTDTKEKINQFIEEIRQLDMDLEEHQGEMNQKYKELKKREEHMDTFIETFEETKNQELKRKAQIEANIVALLEHCSRNINRIEQISSITNQELKMMQDDLNFKSTEVQKSQSTAQNLTSDIQRLQLDLQKMELLESKMTEEQHSLKSKIKQMTTDLEIYNDLPALKSSGEEKIKKLHQERMILSTHRNAFKKIMEKQNIEYEALKTQLQENETHSQLTNLERKWQHLEQNNFAMKEFIATKSQESDYQPIKKNVTKQIAEYNKTIVDALHSTSGN,600,NP_001092692.1.csv,refseq-IFT74-NM_001099222.1_clinical_seed_0_final,refseq-IFT74-NM_001099222.1.a2m,Invitae,refseq-IFT74-NM_001099222.1.npy,1,600,600
+NP_001092744.1,MATPLVAGPAALRFAAAASWQVVRGRCVEHFPRVLEFLRSLRAVAPGLVRYRHHERLCMGLKAKVVVELILQGRPWAQVLKALNHHFPESGPIVRDPKATKQDLRKILEAQETFYQQVKQLSEAPVDLASKLQELEQEYGEPFLAAMEKLLFEYLCQLEKALPTPQAQQLQDVLSWMQPGVSITSSLAWRQYGVDMGWLLPECSVTDSVNLAEPMEQNPPQQQRLALHNPLPKAKPGTHLPQGPSSRTHPEPLAGRHFNLAPLGRRRVQSQWASTRGGHKERPTVMLFPFRNLGSPTQVISKPESKEEHAIYTADLAMGTRAASTGKSKSPCQTLGGRALKENPVDLPATEQKENCLDCYMDPLRLSLLPPRARKPVCPPSLCSSVITIGDLVLDSDEEENGQGEGKESLENYQKTKFDTLIPTLCEYLPPSGHGAIPVSSCDCRDSSRPL,451,NP_001092744.1.csv,refseq-TINF2-NM_001099274.1_clinical_seed_0_final,refseq-TINF2-NM_001099274.1.a2m,Invitae,refseq-TINF2-NM_001099274.1.npy,1,451,451
+NP_001093326.2,MALVIQVGKLRPREVRTPQTINPSLFPSLPVKLSSIIEVPSGGERCCSRRTLVYKARAFWKGAPLPCWMNRHLWKSQLCEMVQPSGGPAADQDVLGEESPLGKPAMLHLPSEQGAPETLQRCLEENQELRDAIRQSNQILRERCEELLHFQASQREEKEFLMCKFQEARKLVERLGLEKLDLKRQKEQALREVEHLKRCQQQMAEDKASVKAQVTSLLGELQESQSRLEAATKECQALEGRARAASEQARQLESEREALQQQHSVQVDQLRMQGQSVEAALRMERQAASEEKRKLAQLQVAYHQLFQEYDNHIKSSVVGSERKRGMQLEDLKQQLQQAEEALVAKQEVIDKLKEEAEQHKIVMETVPVLKAQADIYKADFQAERQAREKLAEKKELLQEQLEQLQREYSKLKASCQESARIEDMRKRHVEVSQAPLPPAPAYLSSPLALPSQRRSPPEEPPDFCCPKCQYQAPDMDTLQIHVMECIE,487,NP_001093326.2.csv,refseq-IKBKG-NM_001099856.2_clinical_seed_0_final,refseq-IKBKG-NM_001099856.2.a2m,Invitae,refseq-IKBKG-NM_001099856.2.npy,1,487,487
+NP_001094383.2,MAERGRLGLPGAPGALNTPVPMNLFATWEVDGSSPSCVPRLCSLTLKKLVVFKELEKELISVVIAVKMQGSKRILRSHEIVLPPSGQVETDLALTFSLQYPHFLKREGNKLQIMLQRRKRYKNRTILGYKTLAAGSISMAEVMQHPSEGGQVLSLCSSIKEAPVKAAEIWIASLSSQPIDHEDSTMQAGPKAKSTDNYSEEEYESFSSEQEASDDAVQGQDLDEDDFDVGKPKKQRRSIVRTTSMTRQQNFKQKVVALLRRFKVSDEVLDSEQDPAEHIPEAEEDLDLLYDTLDMEHPSDSGPDMEDDDSVLSTPKPKLRPYFEGLSHSSSQTEIGSIHSARSHKEPPSPADVPEKTRSLGGRQPSDSVSDTVALGVPGPREHPGQPEDSPEAEASTLDVFTERLPPSGRITKTESLVIPSTRSEGKQAGRRGRSTSLKERQAARPQNERANSLDNERCPDARSQLQVQLQIPRKTVYDQLNHILISDDQLPENIILVNTSDWQGQFLSDVLQRHTLPVVCTCSPADVQAAFSTIVSRIQRYCNCNSQPPTPVKIAVAGAQHYLSAILRLFVEQLSHKTPDWLGYMRFLVIPLGSHPVARYLGSVDYRYNNFFQDLAWRDLFNKLEAQSAVQDTPDIVSRITQYIAGANCAHQLPIAEAMLTYKQKRKKHFHFDFTLSPDEESSQKFIPFVGVVKVGIVEPSSATSGDSDDAAPSGSGTLSSTPPSASPAAKEASPTPPSSPSVSGGLSSPSQGVGAELMGLQVDYWTAAQPADRKRDAEKKDLPVTKNTLKCTFRSLQVSRLPSSGEAAATPTMSMTVVTKEKNKKVMFLPKKAKDKDVESKSQCIEGISRLICTARQQQNMLRVLIDGVECSDVKFFQLAAQWSSHVKHFPICIFGHSKATF,904,NP_001094383.2.csv,refseq-PACS2-NM_001100913.2_clinical_seed_0_final,refseq-PACS2-NM_001100913.2.a2m,Invitae,refseq-PACS2-NM_001100913.2.npy,1,904,904
+NP_001094385.1,MEESGMAHESAEDLFHFNVGGWHFSVPRSKLSQFPDSLLWKEASALTSSESQRLFIDRDGSTFRHVHYYLYTSKLSFSSCAELNLLYEQALGLQLMPLLQTLDNLKEGKHHLRVRPADLPVAERASLNYWRTWKCISKPSEFPIKSPAFTGLHDKAPLGLMDTPLLDTEEEVHYCFLPLDLVAKYPSLVTEDNLLWLAETVALIECECSEFRFIVNFLRSQKILLPDNFSNIDVLEAEVEILEIPALTEAVRWYRMNMGGCSPTTCSPLSPGKGARTASLESVKPLYTMALGLLVKYPDSALGQLRIESTLDGSRLYITGNGVLFQHVKNWLGTCRLPLTETISEVYELCAFLDKRDITYEPIKVALKTHLEPRTLAPMDVLNEWTAEITVYSPQQIIKVYVGSHWYATTLQTLLKYPELLSNPQRVYWITYGQTLLIHGDGQMFRHILNFLRLGKLFLPSEFKEWPLFCQEVEEYHIPSLSEALAQCEAYKSWTQEKESENEEAFSIRRLHVVTEGPGSLVEFSRDTKETTAYMPVDFEDCSDRTPWNKAKGNLVRSNQMDEAEQYTRPIQVSLCRNAKRAGNPSTYSHCRGLCTNPGHWGSHPESPPKKKCTTINLTQKSETKDPPATPMQKLISLVREWDMVNCKQWEFQPLTATRSSPLEEATLQLPLGSEAASQPSTSAAWKAHSTASEKDPGPQAGAGAGAKDKGPEPTFKPYLPPKRAGTLKDWSKQRTKERESPAPEQPLPEASEVDSLGVILKVTHPPVVGSDGFCMFFEDSIIYTTEMDNLRHTTPTASPQPQEVTFLSFSLSWEEMFYAQKCHCFLADIIMDSIRQKDPKAITAKVVSLANRLWTLHISPKQFVVDLLAITGFKDDRHTQERLYSWVELTLPFARKYGRCMDLLIQRGLSRSVSYSILGKYLQED,926,NP_001094385.1.csv,refseq-KCTD19-NM_001100915.1_clinical_seed_0_final,refseq-KCTD19-NM_001100915.1.a2m,Invitae,refseq-KCTD19-NM_001100915.1.npy,1,926,926
+NP_001094832.1,MARGPQTLVQVWVGGQLFQADRALLVEHCGFFRGLFRSGMRETRAAEVRLGVLSAGGFRATLQVLRGDRPALAAEDELLQAVECAAFLQAPALARFLEHNLTSDNCALLCDAAAAFGLRDVFHSAALFICDGERELAAELALPEARAYVAALRPSSYAAVSTHTPAPGFLEDASRTLCYLDEEEDAWRTLAALPLEASTLLAGVATLGNKLYIVGGVRGASKEVVELGFCYDPDGGTWHEFPSPHQPRYDTALAGFDGRLYAIGGEFQRTPISSVERYDPAAGCWSFVADLPQPAAGVPCAQACGRLFVCLWRPADTTAVVEYAVRTDAWLPVAELRRPQSYGHCMVAHRDSLYVVRNGPSDDFLHCAIDCLNLATGQWTALPGQFVNSKGALFTAVVRGDTVYTVNRMFTLLYAIEGGTWRLLREKAGFPRPGSLQTFLLRLPPGAPGPVTSTTAEL,458,NP_001094832.1.csv,refseq-KBTBD13-NM_001101362.2_clinical_seed_0_final,refseq-KBTBD13-NM_001101362.2.a2m,Invitae,refseq-KBTBD13-NM_001101362.2.npy,1,458,458
+NP_001094896.1,MEAGPPGSARPAEPGPCLSGQRGADHTASASLQSVAGTEPGRHPQAVAAVLPAGGCGERMGVPTPKQFCPILERPLISYTLQALERVCWIKDIVVAVTGENMEVMKSIIQKYQHKRISLVEAGVTRHRSIFNGLKALAEDQINSKLSKPEVVIIHDAVRPFVEEGVLLKVVTAAKEHGAAGAIRPLVSTVVSPSADGCLDYSLERARHRASEMPQAFLFDVIYEAYQQCSDYDLEFGTECLQLALKYCCTKAKLVEGSPDLWKVTYKRDLYAAESIIKERISQEICVVMDTEEDNKHVGHLLEEVLKSELNHVKVTSEALGHAGRHLQQIILDQCYNFVCVNVTTSDFQETQKLLSMLEESSLCILYPVVVVSVHFLDFKLVPPSQKMENLMQIREFAKEVKERNILLYGLLISYPQDDQKLQESLRQGAIIIASLIKERNSGLIGQLLIA,451,NP_001094896.1.csv,refseq-ISPD-NM_001101426.3_clinical_seed_0_final,refseq-ISPD-NM_001101426.3.a2m,Invitae,refseq-ISPD-NM_001101426.3.npy,1,451,451
+NP_001095876.1,MNNQKVVAVLLQECKQVLDQLLLEAPDVSEEDKSEDQRCRALLPSELRTLIQEAKEMKWPFVPEKWQYKQAVGPEDKTNLKDVIGAGLQQLLASLRASILARDCAAAAAIVFLVDRFLYGLDVSGKLLQVAKGLHKLQPATPIAPQVVIRQARISVNSGKLLKAEYILSSLISNNGATGTWLYRNESDKVLVQSVCIQIRGQILQKLGMWYEAAELIWASIVGYLALPQPDKKGLSTSLGILADIFVSMSKNDYEKFKNNPQINLSLLKEFDHHLLSAAEACKLAAAFSAYTPLFVLTAVNIRGTCLLSYSSSNDCPPELKNLHLCEAKEAFEIGLLTKRDDEPVTGKQELHSFVKAAFGLTTVHRRLHGETGTVHAASQLCKEAMGKLYNFSTSSRSQDREALSQEVMSVIAQVKEHLQVQSFSNVDDRSYVPESFECRLDKLILHGQGDFQKILDTYSQHHTSVCEVFESDCGNNKNEQKDAKTGVCITALKTEIKNIDTVSTTQEKPHCQRDTGISSSLMGKNVQRELRRGGRRNWTHSDAFRVSLDQDVETETEPSDYSNGEGAVFNKSLSGSQTSSAWSNLSGFSSSASWEEVNYHVDDRSARKEPGKEHLVDTQCSTALSEELENDREGRAMHSLHSQLHDLSLQEPNNDNLEPSQNQPQQQMPLTPFSPHNTPGIFLAPGAGLLEGAPEGIQEVRNMGPRNTSAHSRPSYRSASWSSDSGRPKNMGTHPSVQKEEAFEIIVEFPETNCDVKDRQGKEQGEEISERGAGPTFKASPSWVDPEGETAESTEDAPLDFHRVLHNSLGNISMLPCSSFTPNWPVQNPDSRKSGGPVAEQGIDPDASTVDEEGQLLDSMDVPCTNGHGSHRLCILRQPPGQRAETPNSSVSGNILFPVLSEDCTTTEEGNQPGNMLNCSQNSSSSSVWWLKSPAFSSGSSEGDSPWSYLNSSGSSWVSLPGKMRKEILEARTLQPDDFEKLLAGVRHDWLFQRLENTGVFKPSQLHRAHSALLLKYSKKSELWTAQETIVYLGDYLTVKKKGRQRNAFWVHHLHQEEILGRYVGKDYKEQKGLWHHFTDVERQMTAQHYVTEFNKRLYEQNIPTQIFYIPSTILLILEDKTIKGCISVEPYILGEFVKLSNNTKVVKTEYKATEYGLAYGHFSYEFSNHRDVVVDLQGWVTGNGKGLIYLTDPQIHSVDQKVFTTNFGKRGIFYFFNNQHVECNEICHRLSLTRPSMEKPCT,1244,NP_001095876.1.csv,refseq-ALPK1-NM_001102406.1_clinical_seed_0_final,refseq-ALPK1-NM_001102406.1.a2m,Invitae,refseq-ALPK1-NM_001102406.1.npy,1,1244,1244
+NP_001096.1,MVDGVMILPVLIMIALPSPSMEDEKPKVNPKLYMCVCEGLSCGNEDHCEGQQCFSSLSINDGFHVYQKGCFQVYEQGKMTCKTPPSPGQAVECCQGDWCNRNITAQLPTKGKSFPGTQNFHLEVGLIILSVVFAVCLLACLLGVALRKFKRRNQERLNPRDVEYGTIEGLITTNVGDSTLADLLDHSCTSGSGSGLPFLVQRTVARQITLLECVGKGRYGEVWRGSWQGENVAVKIFSSRDEKSWFRETELYNTVMLRHENILGFIASDMTSRHSSTQLWLITHYHEMGSLYDYLQLTTLDTVSCLRIVLSIASGLAHLHIEIFGTQGKPAIAHRDLKSKNILVKKNGQCCIADLGLAVMHSQSTNQLDVGNNPRVGTKRYMAPEVLDETIQVDCFDSYKRVDIWAFGLVLWEVARRMVSNGIVEDYKPPFYDVVPNDPSFEDMRKVVCVDQQRPNIPNRWFSDPTLTSLAKLMKECWYQNPSARLTALRIKKTLTKIDNSLDKLKTDC,509,NP_001096.1.csv,refseq-ACVR1-NM_001105.4_clinical_seed_0_final,refseq-ACVR1-NM_001105.4.a2m,Invitae,refseq-ACVR1-NM_001105.4.npy,1,509,509
+NP_001097.2,MTAPWVALALLWGSLCAGSGRGEAETRECIYYNANWELERTNQSGLERCEGEQDKRLHCYASWRNSSGTIELVKKGCWLDDFNCYDRQECVATEENPQVYFCCCEGNFCNERFTHLPEAGGPEVTYEPPPTAPTLLTVLAYSLLPIGGLSLIVLLAFWMYRHRKPPYGHVDIHEDPGPPPPSPLVGLKPLQLLEIKARGRFGCVWKAQLMNDFVAVKIFPLQDKQSWQSEREIFSTPGMKHENLLQFIAAEKRGSNLEVELWLITAFHDKGSLTDYLKGNIITWNELCHVAETMSRGLSYLHEDVPWCRGEGHKPSIAHRDFKSKNVLLKSDLTAVLADFGLAVRFEPGKPPGDTHGQVGTRRYMAPEVLEGAINFQRDAFLRIDMYAMGLVLWELVSRCKAADGPVDEYMLPFEEEIGQHPSLEELQEVVVHKKMRPTIKDHWLKHPGLAQLCVTIEECWDHDAEARLSAGCVEERVSLIRRSVNGTTSDCLVSLVTSVTNVDLPPKESSI,512,NP_001097.2.csv,refseq-ACVR2B-NM_001106.3_clinical_seed_0_final,refseq-ACVR2B-NM_001106.3.a2m,Invitae,refseq-ACVR2B-NM_001106.3.npy,1,512,512
+NP_001098101.1,MEAEGSSAPARAGSGEGSDSAGGATLKAPKHLWRHEQHHQYPLRQPQFRLLHPHHHLPPPPPPSPQPQPQCPLQPPPPPPLPPPPPPPGAARGRYASSGATGRVRHRGYSDTERYLYCRAMDRTSYAVETGHRPGLKKSRMSWPSSFQGLRRFDVDNGTSAGRSPLDPMTSPGSGLILQANFVHSQRRESFLYRSDSDYDLSPKSMSRNSSIASDIHGDDLIVTPFAQVLASLRTVRNNFAALTNLQDRAPSKRSPMCNQPSINKATITEEAYQKLASETLEELDWCLDQLETLQTRHSVSEMASNKFKRMLNRELTHLSEMSRSGNQVSEFISNTFLDKQHEVEIPSPTQKEKEKKKRPMSQISGVKKLMHSSSLTNSSIPRFGVKTEQEDVLAKELEDVNKWGLHVFRIAELSGNRPLTVIMHTIFQERDLLKTFKIPVDTLITYLMTLEDHYHADVAYHNNIHAADVVQSTHVLLSTPALEAVFTDLEILAAIFASAIHDVDHPGVSNQFLINTNSELALMYNDSSVLENHHLAVGFKLLQEENCDIFQNLTKKQRQSLRKMVIDIVLATDMSKHMNLLADLKTMVETKKVTSSGVLLLDNYSDRIQVLQNMVHCADLSNPTKPLQLYRQWTDRIMEEFFRQGDRERERGMEISPMCDKHNASVEKSQVGFIDYIVHPLWETWADLVHPDAQDILDTLEDNREWYQSTIPQSPSPAPDDPEEGRQGQTEKFQFELTLEEDGESDTEKDSGSQVEEDTSCSDSKTLCTQDSESTEIPLDEQVEEEAVGEEEESQPEACVIDDRSPDT,809,NP_001098101.1.csv,refseq-PDE4D-NM_001104631.1_clinical_seed_0_final,refseq-PDE4D-NM_001104631.1.a2m,Invitae,refseq-PDE4D-NM_001104631.1.npy,1,809,809
+NP_001098548.2,MKSEDYPHETMAPDIHEERQYRCEDCDQLFESKAELADHQKFPCSTPHSAFSMVEEDFQQKLESENDLQEIHTIQECKECDQVFPDLQSLEKHMLSHTEEREYKCDQCPKAFNWKSNLIRHQMSHDSGKHYECENCAKVFTDPSNLQRHIRSQHVGARAHACPECGKTFATSSGLKQHKHIHSSVKPFICEVCHKSYTQFSNLCRHKRMHADCRTQIKCKDCGQMFSTTSSLNKHRRFCEGKNHFAAGGFFGQGISLPGTPAMDKTSMVNMSHANPGLADYFGANRHPAGLTFPTAPGFSFSFPGLFPSGLYHRPPLIPASSPVKGLSSTEQTNKSQSPLMTHPQILPATQDILKALSKHPSVGDNKPVELQPERSSEERPFEKISDQSESSDLDDVSTPSGSDLETTSGSDLESDIESDKEKFKENGKMFKDKVSPLQNLASINNKKEYSNHSIFSPSLEEQTAVSGAVNDSIKAIASIAEKYFGSTGLVGLQDKKVGALPYPSMFPLPFFPAFSQSMYPFPDRDLRSLPLKMEPQSPGEVKKLQKGSSESPFDLTTKRKDEKPLTPVPSKPPVTPATSQDQPLDLSMGSRSRASGTKLTEPRKNHVFGGKKGSNVESRPASDGSLQHARPTPFFMDPIYRVEKRKLTDPLEALKEKYLRPSPGFLFHPQFQLPDQRTWMSAIENMAEKLESFSALKPEASELLQSVPSMFNFRAPPNALPENLLRKGKERYTCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNKEKPFKCHLCDRCFGQQTNLDRHLKKHENGNMSGTATSSPHSELESTGAILDDKEDAYFTEIRNFIGNSNHGSQSPRNVEERMNGSHFKDEKALVTSQNSDLLDDEEVEDEVLLDEEDEDNDITGKTGKEPVTSNLHEGNPEDDYEETSALEMSCKTSPVRYKEEEYKSGLSALDHIRHFTDSLKMRKMEDNQYSEAELSSFSTSHVPEELKQPLHRKSKSQAYAMMLSLSDKESLHSTSHSSSNVWHSMARAAAESSAIQSISHV,1051,NP_001098548.2.csv,refseq-MECOM-NM_001105078.3_clinical_seed_0_final,refseq-MECOM-NM_001105078.3.a2m,Invitae,refseq-MECOM-NM_001105078.3.npy,1,1051,1051
+NP_001098717.1,MAAAKPTLTDSLSFCLAQLAAAAGEALGGEKDPATNETPLSRALLALRTRHIKAAGGIERFRARGGLRPLLALLRRAAAAGSAPSQAGPGSAPSSAASGASSPAPASGPAPSAVSSSSPTPPVRLRKTLDLALSILADCCTEGACRTEVRRLGGILPLVTILQCMKTDSIQNRTARALGNLAMEPESCGDIHCAGAVPLLVESLTACQDSQCLQSVVRALRNLADSPQHRLALAQQGAVRPLAELLATAPDAALTLALVRALLELSRGCSRACAEQLSLGGGLGPLVSLASHPKRAVREGTILILANLCAQGLIRPALGNAGGVEVLVDELRQRRDPNGASPTSQQPLVRAVCLLCREAINRARLRDAGGLDLLMGLLRDPRASAWHPRIVAALVGFLYDTGALGRLQALGLVPLLAGQLCGEAGEEEEEGREAASWDFPEERTPERAQGGSFRSLRSWLISEGYATGPDDISPDWSPEQCPPEPMEPASPAPTPTSLRAPRTQRTPGRSPAAAIEEPWGREGPALLLLSRFSQAPDPSGALVTGPALYGLLTYVTGAPGPPSPRALRILSRLTCNPACLEAFVRSYGAALLRAWLVLGVAPDDWPAPRARPTLHSRHRELGERLLQNLTVQAESPFGVGALTHLLLSGSPEDRVACALTLPFICRKPSLWRRLLLEQGGLRLLLAALTRPAPHPLFLFFAADSLSCLQDLVSPTVSPAVPQAVPMDLDSPSPCLYEPLLGPAPVPAPDLHFLLDSGLQLPAQRAASATASPFFRALLSGSFAEAQMDLVPLRGLSPGAAWPVLHHLHGCRGCGAALGPVPPPGQPLLGSEAEEALEAAGRFLLPGLEEELEEAVGRIHLGPQGGPESVGEVFRLGRPRLAAHCARWTLGSEQCPRKRGLALVGLVEAAGEEAGPLTEALLAVVMGIELGARVPA,935,NP_001098717.1.csv,refseq-ARMC5-NM_001105247.1_clinical_seed_0_final,refseq-ARMC5-NM_001105247.1.a2m,Invitae,refseq-ARMC5-NM_001105247.1.npy,1,935,935
+NP_001103279.2,MFEQLKPIEPVQKTLPWVGEVAATLQEAMKRDCWREARVKKKPVTFEDVAVNFTQEEWDCLDASQRVLYQDVMSETFKNLTSVARIFLHKPELITKLEQEEEQWREFVHLPNTEGLSEGKKKELREQHPSLRDEGTSDDKVFLACRGAGQCPLSAPAGTMDRTRVLQASQAGPPFFCYTCGKCFSRRSYLYSHQFVHNPKLTNSCSQCGKLFRSPKSLSYHRRMHLGERPFCCTLCDKTYCDASGLSRHRRVHLGYRPHSCSVCGKSFRDQSELKRHQKIHQNQEPVDGNQECTLRIPGTQAEFQTPIARSQRSIQGLLDVNHAPVARSQEPIFRTEGPMAQNQASVLKNQAPVTRTQAPITGTLCQDARSNSHPVKPSRLNVFCCPHCSLTFSKKSYLSRHQKAHLTEPPNYCFHCSKSFSSFSRLVRHQQTHWKQKSYLCPICDLSFGEKEGLMDHWRGYKGKDLCQSSHHKCRVILGQWLGFSHDVPTMAGEEWKHGGDQSPPRIHTPRRRGLREKACKGDKTKEAVSILKHK,536,NP_001103279.2.csv,refseq-ZFP57-NM_001109809.2_clinical_seed_0_final,refseq-ZFP57-NM_001109809.2.a2m,Invitae,refseq-ZFP57-NM_001109809.2.npy,1,536,536
+NP_001104262.1,MAAAAAAAPSGGGGGGEEERLEEKSEDQDLQGLKDKPLKFKKVKKDKKEEKEGKHEPVQPSAHHSAEPAEAGKAETSEGSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTTRPKAATSEGVQVKRVLEKSPGKLLVKMPFQTSPGGKAEGGGATTSTQVMVIKRPGRKRKAEADPQAIPKKRGRKPGSVVAAAAAEAKKKAVKESSIRSVQETVLPIKKRKTRETVSIEVKEVVKPLLVSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSSASSPPKKEHHHHHHHSESPKAPVPLLPPLPPPPPEPESSEDPTSPPEPQDLSSSVCKEEKMPRGGSLESDGCPKEPAKTQPAVATAATAAEKYKHRGEGERKDIVSSSMPRPNREEPVDSRTPVTERVS,498,NP_001104262.1.csv,refseq-MECP2-NM_001110792.1_clinical_seed_0_final,refseq-MECP2-NM_001110792.1.a2m,Invitae,refseq-MECP2-NM_001110792.1.npy,1,498,498
+NP_001104505.1,MDMWTALLILQALLLPSLADGATPALRFVAVGDWGGVPNAPFHTAREMANAKEIARTVQILGADFILSLGDNFYFTGVQDINDKRFQETFEDVFSDRSLRKVPWYVLAGNHDHLGNVSAQIAYSKISKRWNFPSPFYRLHFKIPQTNVSVAIFMLDTVTLCGNSDDFLSQQPERPRDVKLARTQLSWLKKQLAAAREDYVLVAGHYPVWSIAEHGPTHCLVKQLRPLLATYGVTAYLCGHDHNLQYLQDENGVGYVLSGAGNFMDPSKRHQRKVPNGYLRFHYGTEDSLGGFAYVEISSKEMTVTYIEASGKSLFKTRLPRRARP,325,NP_001104505.1.csv,refseq-ACP5-NM_001111035.2_clinical_seed_0_final,refseq-ACP5-NM_001111035.2.a2m,Invitae,refseq-ACP5-NM_001111035.2.npy,1,325,325
+NP_001104595.1,MEAGSGPPGGPGSESPNRAVEYLLELNNIIESQQQLLETQRRRIEELEGQLDQLTQENRDLREESQLHRGELHRDPHGARDSPGRESQYQNLRETQFHHRELRESQFHQAARDVGYPNREGAYQNREAVYRDKERDASYPLQDTTGYTARERDVAQCHLHHENPALGRERGGREAGPAHPGREKEAGYSAAVGVGPRPPRERGQLSRGASRSSSPGAGGGHSTSTSTSPATTLQRKSDGENSRTVSVEGDAPGSDLSTAVDSPGSQPPYRLSQLPPSSSHMGGPPAGVGLPWAQRARLQPASVALRKQEEEEIKRSKALSDSYELSTDLQDKKVEMLERKYGGSFLSRRAARTIQTAFRQYRMNKNFERLRSSASESRMSRRIILSNMRMQFSFEEYEKAQNPAYFEGKPASLDEGAMAGARSHRLERGLPYGGSCGGGIDGGGSSVTTSGEFSNDITELEDSFSKQVKSLAESIDEALNCHPSGPMSEEPGSAQLEKRESKEQQEDSSATSFSDLPLYLDDTVPQQSPERLPSTEPPPQGRPEFWAPAPLPPVPPPVPSGTREDGSREEGTRRGPGCLECRDFRLRAAHLPLLTIEPPSDSSVDLSDRSDRGSVHRQLVYEADGCSPHGTLKHKGPPGRAPIPHRHYPAPEGPAPAPPGPLPPAPNSGTGPSGVAGGRRLGKCEAAGENSDGGDNESLESSSNSNETINCSSGSSSRDSLREPPATGLCKQTYQRETRHSWDSPAFNNDVVQRRHYRIGLNLFNKKPEKGIQYLIERGFLSDTPVGVAHFILERKGLSRQMIGEFLGNRQKQFNRDVLDCVVDEMDFSSMDLDDALRKFQSHIRVQGEAQKVERLIEAFSQRYCVCNPALVRQFRNPDTIFILAFAIILLNTDMYSPSVKAERKMKLDDFIKNLRGVDNGEDIPRDLLVGIYQRIQGRELRTNDDHVSQVQAVERMIVGKKPVLSLPHRRLVCCCQLYEVPDPNRPQRLGLHQREVFLFNDLLVVTKIFQKKKILVTYSFRQSFPLVEMHMQLFQNSYYQFGIKLLSAVPGGERKVLIIFNAPSLQDRLRFTSDLRESIAEVQEMEKYRVESELEKQKGMMRPNASQPGGAKDSVNGTMARSSLEDTYGAGDGLKRGALSSSLRDLSDAGKRGRRNSVGSLDSTIEGSVISSPRPHQRMPPPPPPPPPEEYKSQRPVSNSSSFLGSLFGSKRGKGPFQMPPPPTGQASASSSSASSTHHHHHHHHHGHSHGGLGVLPDGQSKLQALHAQYCQGPGPAPPPYLPPQQPSLPPPPQQPPPLPQLGSIPPPPASAPPVGPHRHFHAHGPVPGPQHYTLGRPGRAPRRGAGGHPQFAPHGRHPLHQPTSPLPLYSPAPQHPPAHKQGPKHFIFSHHPQMMPAAGAAGGPGSRPPGGSYSHPHHPQSPLSPHSPIPPHPSYPPLPPPSPHTPHSPLPPTSPHGPLHASGPPGTANPPSANPKAKPSRISTVV,1488,NP_001104595.1.csv,refseq-IQSEC2-NM_001111125.2_clinical_seed_0_final,refseq-IQSEC2-NM_001111125.2.a2m,Invitae,refseq-IQSEC2-NM_001111125.2.npy,1,1488,1488
+NP_001106212.1,MGQGDESERIVINVGGTRHQTYRSTLRTLPGTRLAWLAEPDAHSHFDYDPRADEFFFDRHPGVFAHILNYYRTGKLHCPADVCGPLYEEELAFWGIDETDVEPCCWMTYRQHRDAEEALDSFGGAPLDNSADDADADGPGDSGDGEDELEMTKRLALSDSPDGRPGGFWRRWQPRIWALFEDPYSSRYARYVAFASLFFILVSITTFCLETHERFNPIVNKTEIENVRNGTQVRYYREAETEAFLTYIEGVCVVWFTFEFLMRVIFCPNKVEFIKNSLNIIDFVAILPFYLEVGLSGLSSKAAKDVLGFLRVVRFVRILRIFKLTRHFVGLRVLGHTLRASTNEFLLLIIFLALGVLIFATMIYYAERIGAQPNDPSASEHTHFKNIPIGFWWAVVTMTTLGYGDMYPQTWSGMLVGALCALAGVLTIAMPVPVIVNNFGMYYSLAMAKQKLPKKKKKHIPRPPQLGSPNYCKSVVNSPHHSTQSDTCPLAQEEILEINRADSKLNGEVAKAALANEDCPHIDQALTPDEGLPFTRSGTRERYGPCFLLSTGEYACPPGGGMRKDLCKESPVIAKYMPTEAVRVT,585,NP_001106212.1.csv,refseq-KCNC1-NM_001112741.1_clinical_seed_0_final,refseq-KCNC1-NM_001112741.1.a2m,Invitae,refseq-KCNC1-NM_001112741.1.npy,1,585,585
+NP_001106279.3,MAAARDPPEVSLREATQRKLRRFSELRGKLVARGEFWDIVAITAADEKQELAYNQQLSEKLKRKELPLGVQYHVFVDPAGAKIGNGGSTLCALQCLEKLYGDKWNSFTILLIHSDEWKKKVSESYVITIERLEDDLQIKEKELTELRNIFGSDEAFSKVNLNYRTENGLSLLHLCCICGGKKSHIRTLMLKGLRPSRLTRNGFTALHLAVYKDNAELITSLLHSGADIQQVGYGGLTALHIATIAGHLEAADVLLQHGANVNIQDAVFFTPLHIAAYYGHEQVTRLLLKFGADVNVSGEVGDRPLHLASAKGFLNIAKLLMEEGSKADVNAQDNEDHVPLHFCSRFGHHDIVKYLLQSDLEVQPHVVNIYGDTPLHLACYNGKFEVAKEIIQISGTESLTKENIFSETAFHSACTYGKSIDLVKFLLDQNVININHQGRDGHTGLHSACYHGHIRLVQFLLDNGADMNLVACDPSRSSGEKDEQTCLMWAYEKGHDAIVTLLKHYKRPQDELPCNEYSQPGGDGSYVSVPSPLGKIKSMTKEKADILLLRAGLPSHFHLQLSEIEFHEIIGSGSFGKVYKGRCRNKIVAIKRYRANTYCSKSDVDMFCREVSILCQLNHPCVIQFVGACLNDPSQFAIVTQYISGGSLFSLLHEQKRILDLQSKLIIAVDVAKGMEYLHNLTQPIIHRDLNSHNILLYEDGHAVVADFGESRFLQSLDEDNMTKQPGNLRWMAPEVFTQCTRYTIKADVFSYALCLWEILTGEIPFAHLKPAAAAADMAYHHIRPPIGYSIPKPISSLLIRGWNACPEGRPEFSEVVMKLEECLCNIELMSPASSNSSGSLSPSSSSDCLVNRGGPGRSHVAALRSRFELEYALNARSYAALSQSAGQYSSQGLSLEEMKRSLQYTPIDKYGYVSDPMSSMHFHSCRNSSSFEDSS,936,NP_001106279.3.csv,NP_001106279.3_clinical_seed_0_final,NP_001106279.3.a2m,popEVE,NP_001106279.3_theta_0.2.npy,1,936,936
+NP_001106962.1,MRNSEEQPSGGTTVLQRLLQEQLRYGNPSENRSLLAIHQQATGNGPPFPSGSGNPGPQSDVLSPQDHHQQLVAHAARQEPQGQEIQSENLIMEKQLSPRMQNNEELPTYEEAKVQSQYFRGQQHASVGAAFYVTGVTNQKMRTEGRPSVQRLNPGKMHQDEGLRDLKQGHVRSLSERLMQMSLATSGVKAHPPVTSAPLSPPQPNDLYKNPTSSSEFYKAQGPLPNQHSLKGMEHRGPPPEYPFKGMPPQSVVCKPQEPGHFYSEHRLNQPGRTEGQLMRYQHPPEYGAARPAQDISLPLSARNSQPHSPTSSLTSGGSLPLLQSPPSTRLSPARHPLVPNQGDHSAHLPRPQQHFLPNQAHQGDHYRLSQPGLSQQQQQQQQQHHHHHHHQQQQQQQPQQQPGEAYSAMPRAQPSSASYQPVPADPFAIVSRAQQMVEILSDENRNLRQELEGCYEKVARLQKVETEIQRVSEAYENLVKSSSKREALEKAMRNKLEGEIRRMHDFNRDLRERLETANKQLAEKEYEGSEDTRKTISQLFAKNKESQREKEKLEAELATARSTNEDQRRHIEIRDQALSNAQAKVVKLEEELKKKQVYVDKVEKMQQALVQLQAACEKREQLEHRLRTRLERELESLRIQQRQGNCQPTNVSEYNAAALMELLREKEERILALEADMTKWEQKYLEENVMRHFALDAAATVAAQRDTTVISHSPNTSYDTALEARIQKEEEEILMANKRCLDMEGRIKTLHAQIIEKDAMIKVLQQRSRKEPSKTEQLSCMRPAKSLMSISNAGSGLLSHSSTLTGSPIMEEKRDDKSWKGSLGILLGGDYRAEYVPSTPSPVPPSTPLLSAHSKTGSRDCSTQTERGTESNKTAAVAPISVPAPVAAAATAAAITATAATITTTMVAAAPVAVAAAAAPAAAAAPSPATAAATAAAVSPAAAGQIPAAASVASAAAVAPSAAAAAAVQVAPAAPAPVPAPALVPVPAPAAAQASAPAQTQAPTSAPAVAPTPAPTPTPAVAQAEVPASPATGPGPHRLSIPSLTCNPDKTDGPVFHSNTLERKTPIQILGQEPDAEMVEYLI,1084,NP_001106962.1.csv,refseq-AMOT-NM_001113490.1_clinical_seed_0_final,refseq-AMOT-NM_001113490.1.a2m,Invitae,refseq-AMOT-NM_001113490.1.npy,1,1084,1084
+NP_001107655.1,MQHIFAFFCTGFLGAVVGANFPNNIQIGGLFPNQQSQEHAAFRFALSQLTEPPKLLPQIDIVNISDSFEMTYRFCSQFSKGVYAIFGFYERRTVNMLTSFCGALHVCFITPSFPVDTSNQFVLQLRPELQDALISIIDHYKWQKFVYIYDADRGLSVLQKVLDTAAEKNWQVTAVNILTTTEEGYRMLFQDLEKKKERLVVVDCESERLNAILGQIIKLEKNGIGYHYILANLGFMDIDLNKFKESGANVTGFQLVNYTDTIPAKIMQQWKNSDARDHTRVDWKRPKYTSALTYDGVKVMAEAFQSLRRQRIDISRRGNAGDCLANPAVPWGQGIDIQRALQQVRFEGLTGNVQFNEKGRRTNYTLHVIEMKHDGIRKIGYWNEDDKFVPAATDAQAGGDNSSVQNRTYIVTTILEDPYVMLKKNANQFEGNDRYEGYCVELAAEIAKHVGYSYRLEIVSDGKYGARDPDTKAWNGMVGELVYGRADVAVAPLTITLVREEVIDFSKPFMSLGISIMIKKPQKSKPGVFSFLDPLAYEIWMCIVFAYIGVSVVLFLVSRFSPYEWHSEEFEEGRDQTTSDQSNEFGIFNSLWFSLGAFMQQGCDISPRSLSGRIVGGVWWFFTLIIISSYTANLAAFLTVERMVSPIESAEDLAKQTEIAYGTLEAGSTKEFFRRSKIAVFEKMWTYMKSAEPSVFVRTTEEGMIRVRKSKGKYAYLLESTMNEYIEQRKPCDTMKVGGNLDSKGYGIATPKGSALRGPVNLAVLKLSEQGVLDKLKSKWWYDKGECGSKDSGSKDKTSALSLSNVAGVFYILIGGLGLAMLVALIEFCYKSRSESKRMKGFCLIPQQSINEAIRTSTLPRNSGAGASSGGSGENGRVVSHDFPKSMQSIPCMSHSSGMPLGATGL,906,NP_001107655.1.csv,refseq-GRIA1-NM_001114183.1_clinical_seed_0_final,refseq-GRIA1-NM_001114183.1.a2m,Invitae,refseq-GRIA1-NM_001114183.1.npy,1,906,906
+NP_001108220.1,MSMSANTMIFMILGASVVMAIACLMDMNALLDRFHNYILPHLRGEDRVCHCNCGRHHIHYVIPYDGDQSVVDASENYFVTDSVTKQEIDLMLGLLLGFCISWFLVWMDGVLHCAVRAWRAGRRYDGSWTWLPKLCSLRELGRRPHRPFEEAAGNMVHVKQKLYHNGHPSPRHL,173,NP_001108220.1.csv,refseq-TMEM240-NM_001114748.1_clinical_seed_0_final,refseq-TMEM240-NM_001114748.1.a2m,Invitae,refseq-TMEM240-NM_001114748.1.npy,1,173,173
+NP_001116229.1,MSCQAFTSADTFIPLNSDASATLPLIMHHSAAECLPVSNHATNVMSTVPSILSLIQTPKCLCTHFSVTTLGNTATGLHYSVPSCHYGNQPSTYGVMAGSLTPCLYKFPDHTLSHGFPPIHQPLLAEDPTAADFKQELRRKSKLVEEPIDMDSPEIRELEKFANEFKVRRIKLGYTQTNVGEALAAVHGSEFSQTTICRFENLQLSFKNACKLKAILSKWLEEAEQVGALYNEKVGANERKRKRRTTISIAAKDALERHFGEQNKPSSQEIMRMAEELNLEKEVVRVWFCNRRQREKRVKTSLNQSLFSISKEHLECR,317,NP_001116229.1.csv,refseq-POU1F1-NM_001122757.1_clinical_seed_0_final,refseq-POU1F1-NM_001122757.1.a2m,Invitae,refseq-POU1F1-NM_001122757.1.npy,1,317,317
+NP_001116293.1,MAPKRQSPLPPQKKKPRPPPALGPEETSASAGLPKKGEKEQQEAIEHIDEVQNEIDRLNEQASEEILKVEQKYNKLRQPFFQKRSELIAKIPNFWVTTFVNHPQVSALLGEEDEEALHYLTRVEVTEFEDIKSGYRIDFYFDENPYFENKVLSKEFHLNESGDPSSKSTEIKWKSGKDLTKRSSQTQNKASRKRQHEEPESFFTWFTDHSDAGADELGEVIKDDIWPNPLQYYLVPDMDDEEGEGEEDDDDDEEEEGLEDIDEEGDEDEGEEDEDDDEGEEGEEDEGEDD,290,NP_001116293.1.csv,refseq-SET-NM_001122821.1_clinical_seed_0_final,refseq-SET-NM_001122821.1.a2m,Invitae,refseq-SET-NM_001122821.1.npy,1,290,290
+NP_001119578.1,MVDSTEYEVASQPEVETSPLGDGASPGPEQVKLKKEISLLNGVCLIVGNMIGSGIFVSPKGVLIYSASFGLSLVIWAVGGLFSVFGALCYAELGTTIKKSGASYAYILEAFGGFLAFIRLWTSLLIIEPTSQAIIAITFANYMVQPLFPSCFAPYAASRLLAAACICLLTFINCAYVKWGTLVQDIFTYAKVLALIAVIVAGIVRLGQGASTHFENSFEGSSFAVGDIALALYSALFSYSGWDTLNYVTEEIKNPERNLPLSIGISMPIVTIIYILTNVAYYTVLDMRDILASDAVAVTFADQIFGIFNWIIPLSVALSCFGGLNASIVAASRLFFVGSREGHLPDAICMIHVERFTPVPSLLFNGIMALIYLCVEDIFQLINYYSFSYWFFVGLSIVGQLYLRWKEPDRPRPLKLSVFFPIVFCLCTIFLVAVPLYSDTINSLIGIAIALSGLPFYFLIIRVPEHKRPLYLRRIVGSATRYLQVLCMSVAAEMDLEDGGEMPKQRDPKSN,511,NP_001119578.1.csv,refseq-SLC7A7-NM_001126106.2_clinical_seed_0_final,refseq-SLC7A7-NM_001126106.2.a2m,Invitae,refseq-SLC7A7-NM_001126106.2.npy,1,511,511
+NP_001119580.2,MAELPTTETPGDATLCSGRFTISTLLSSDEPSPPAAYDSSHPSHLTHSSTFCMRTFGYNTIDVVPTYEHYANSTQPGEPRKVRPTLADLHSFLKQEGRHLHALAFDSRPSHEMTDGLVEGEAGTSSEKNPEEPVRFGWVKGVMIRCMLNIWGVILYLRLPWITAQAGIVLTWIIILLSVTVTSITGLSISAISTNGKVKSGGTYFLISRSLGPELGGSIGLIFAFANAVGVAMHTVGFAETVRDLLQEYGAPIVDPINDIRIIAVVSVTVLLAISLAGMEWESKAQVLFFLVIMVSFANYLVGTLIPPSEDKASKGFFSYRADIFVQNLVPDWRGPDGTFFGMFSIFFPSATGILAGANISGDLKDPAIAIPKGTLMAIFWTTISYLAISATIGSCVVRDASGVLNDTVTPGWGACEGLACSYGWNFTECTQQHSCHYGLINYYQTMSMVSGFAPLITAGIFGATLSSALACLVSAAKVFQCLCEDQLYPLIGFFGKGYGKNKEPVRGYLLAYAIAVAFIIIAELNTIAPIISNFFLCSYALINFSCFHASITNSPGWRPSFQYYNKWAALFGAIISVVIMFLLTWWAALIAIGVVLFLLLYVIYKKPEVNWGSSVQAGSYNLALSYSVGLNEVEDHIKNYRPQCLVLTGPPNFRPALVDFVGTFTRNLSLMICGHVLIGPHKQRMPELQLIANGHTKWLNKRKIKAFYSDVIAEDLRRGVQILMQAAGLGRMKPNILVVGFKKNWQSAHPATVEDYIGILHDAFDFNYGVCVMRMREGLNVSKMMQAHINPVFDPAEDGKEASARVDPKALVKEEQATTIFQSEQGKKTIDIYWLFDDGGLTLLIPYLLGRKRRWSKCKIRVFVGGQINRMDQERKAIISLLSKFRLGFHEVHILPDINQNPRAEHTKRFEDMIAPFRLNDGFKDEATVNEMRRDCPWKISDEEITKNRVKSLRQVRLNEIVLDYSRDAALIVITLPIGRKGKCPSSLYMAWLETLSQDLRPPVILIRGNQENVLTFYCQ,1021,NP_001119580.2.csv,refseq-SLC12A3-NM_001126108.2_clinical_seed_0_final,refseq-SLC12A3-NM_001126108.2.a2m,Invitae,refseq-SLC12A3-NM_001126108.2_theta_0.2.npy,1,1021,1021
+NP_001120.3,MAAVRGAPLLSCLLALLALCPGGRPQTVLTDDEIEEFLEGFLSELEPEPREDDVEAPPPPEPTPRVRKAQAGGKPGKRPGTAAEVPPEKTKDKGKKGKKDKGPKVPKESLEGSPRPPKKGKEKPPKATKKPKEKPPKATKKPKEKPPKATKKPKEKPPKATKKPPSGKRPPILAPSETLEWPLPPPPSPGPEELPQEGGAPLSNNWQNPGEETHVEAREHQPEPEEETEQPTLDYNDQIEREDYEDFEYIRRQKQPRPPPSRRRRPERVWPEPPEEKAPAPAPEERIEPPVKPLLPPLPPDYGDGYVIPNYDDMDYYFGPPPPQKPDAERQTDEEKEELKKPKKEDSSPKEETDKWAVEKGKDHKEPRKGEELEEEWTPTEKVKCPPIGMESHRIEDNQIRASSMLRHGLGAQRGRLNMQTGATEDDYYDGAWCAEDDARTQWIEVDTRRTTRFTGVITQGRDSSIHDDFVTTFFVGFSNDSQTWVMYTNGYEEMTFHGNVDKDTPVLSELPEPVVARFIRIYPLTWNGSLCMRLEVLGCSVAPVYSYYAQNEVVATDDLDFRHHSYKDMRQLMKVVNEECPTITRTYSLGKSSRGLKIYAMEISDNPGEHELGEPEFRYTAGIHGNEVLGRELLLLLMQYLCREYRDGNPRVRSLVQDTRIHLVPSLNPDGYEVAAQMGSEFGNWALGLWTEEGFDIFEDFPDLNSVLWGAEERKWVPYRVPNNNLPIPERYLSPDATVSTEVRAIIAWMEKNPFVLGANLNGGERLVSYPYDMARTPTQEQLLAAAMAAARGEDEDEVSEAQETPDHAIFRWLAISFASAHLTLTEPYRGGCQAQDYTGGMGIVNGAKWNPRTGTINDFSYLHTNCLELSFYLGCDKFPHESELPREWENNKEALLTFMEQVHRGIKGVVTDEQGIPIANATISVSGINHGVKTASGGDYWRILNPGEYRVTAHAEGYTPSAKTCNVDYDIGATQCNFILARSNWKRIREIMAMNGNRPIPHIDPSRPMTPQQRRLQQRRLQHRLRLRAQMRLRRLNATTTLGPHTVPPTLPPAPATTLSTTIEPWGLIPPTTAGWEESETETYTEVVTEFGTEVEPEFGTKVEPEFETQLEPEFETQLEPEFEEEEEEEKEEEIATGQAFPFTTVETYTVNFGDF,1158,NP_001120.3.csv,refseq-AEBP1-NM_001129.4_clinical_seed_0_final,refseq-AEBP1-NM_001129.4.a2m,Invitae,refseq-AEBP1-NM_001129.4.npy,1,1158,1158
+NP_001120680.1,MEQDRTNHVEGNRLSPFLIPSPPICQTEPLATKLQNGSPLPERAHPEVNGDTKWHSFKSYYGIPCMKGSQNSRVSPDFTQESRGYSKCLQNGGIKRTVSEPSLSGLLQIKKLKQDQKANGERRNFGVSQERNPGESSQPNVSDLSDKKESVSSVAQENAVKDFTSFSTHNCSGPENPELQILNEQEGKSANYHDKNIVLLKNKAVLMPNGATVSASSVEHTHGELLEKTLSQYYPDCVSIAVQKTTSHINAINSQATNELSCEITHPSHTSGQINSAQTSNSELPPKPAAVVSEACDADDADNASKLAAMLNTCSFQKPEQLQQQKSVFEICPSPAENNIQGTTKLASGEEFCSGSSSNLQAPGGSSERYLKQNEMNGAYFKQSSVFTKDSFSATTTPPPPSQLLLSPPPPLPQVPQLPSEGKSTLNGGVLEEHHHYPNQSNTTLLREVKIEGKPEAPPSQSPNPSTHVCSPSPMLSERPQNNCVNRNDIQTAGTMTVPLCSEKTRPMSEHLKHNPPIFGSSGELQDNCQQLMRNKEQEILKGRDKEQTRDLVPPTQHYLKPGWIELKAPRFHQAESHLKRNEASLPSILQYQPNLSNQMTSKQYTGNSNMPGGLPRQAYTQKTTQLEHKSQMYQVEMNQGQSQGTVDQHLQFQKPSHQVHFSKTDHLPKAHVQSLCGTRFHFQQRADSQTEKLMSPVLKQHLNQQASETEPFSNSHLLQHKPHKQAAQTQPSQSSHLPQNQQQQQKLQIKNKEEILQTFPHPQSNNDQQREGSFFGQTKVEECFHGENQYSKSSEFETHNVQMGLEEVQNINRRNSPYSQTMKSSACKIQVSCSNNTHLVSENKEQTTHPELFAGNKTQNLHHMQYFPNNVIPKQDLLHRCFQEQEQKSQQASVLQGYKNRNQDMSGQQAAQLAQQRYLIHNHANVFPVPDQGGSHTQTPPQKDTQKHAALRWHLLQKQEQQQTQQPQTESCHSQMHRPIKVEPGCKPHACMHTAPPENKTWKKVTKQENPPASCDNVQQKSIIETMEQHLKQFHAKSLFDHKALTLKSQKQVKVEMSGPVTVLTRQTTAAELDSHTPALEQQTTSSEKTPTKRTAASVLNNFIESPSKLLDTPIKNLLDTPVKTQYDFPSCRCVEQIIEKDEGPFYTHLGAGPNVAAIREIMEERFGQKGKAIRIERVIYTGKEGKSSQGCPIAKWVVRRSSSEEKLLCLVRERAGHTCEAAVIVILILVWEGIPLSLADKLYSELTETLRKYGTLTNRRCALNEERTCACQGLDPETCGASFSFGCSWSMYYNGCKFARSKIPRKFKLLGDDPKEEEKLESHLQNLSTLMAPTYKKLAPDAYNNQIEYEHRAPECRLGLKEGRPFSGVTACLDFCAHAHRDLHNMQNGSTLVCTLTREDNREFGGKPEDEQLHVLPLYKVSDVDEFGSVEAQEEKKRSGAIQVLSSFRRKVRMLAEPVKTCRQRKLEAKKAAAEKLSSLENSSNKNEKEKSAPSRTKQTENASQAKQLAELLRLSGPVMQQSQQPQPLQKQPPQPQQQQRPQQQQPHHPQTESVNSYSASGSTNPYMRRPNPVSPYPNSSHTSDIYGSTSPMNFYSTSSQAAGSYLNSSNPMNPYPGLLNQNTQYPSYQCNGNLSVDNCSPYLGSYSPQSQPMDLYRYPSQDPLSKLSLPPIHTLYQPRFGNSQSFTSKYLGYGNQNMQGDGFSSCTIRPNVHHVGKLPPYPTHEMDGHFMGATSRLPPNLSNPNMDYKNGEHHSPSHIIHNYSAAPGMFNSSLHALHLQNKENDMLSHTANGLSKMLPALNHDRTACVQGGLHKLSDANGQEKQPLALVQGVASGAEDNDEVWSDSEQSFLDPDIGGVAVAPTHGSILIECAKRELHATTPLKNPNRNHPTRISLVFYQHKSMNEPKHGLALWEAKMAEKAREKEEECEKYGPDYVPQKSHGKKVKREPAEPHETSEPTYLRFIKSLAERTMSVTTDSTVTTSPYAFTRVTGPYNRYI,2002,NP_001120680.1.csv,refseq-TET2-NM_001127208.2_clinical_seed_0_final,refseq-TET2-NM_001127208.2.a2m,Invitae,refseq-TET2-NM_001127208.2_theta_0.2.npy,1,2002,2002
+NP_001120693.1,MARFGDEMPARYGGGGSGAAAGVVVGSGGGRGAGGSRQGGQPGAQRMYKQSMAQRARTMALYNPIPVRQNCLTVNRSLFLFSEDNVVRKYAKKITEWPPFEYMILATIIANCIVLALEQHLPDDDKTPMSERLDDTEPYFIGIFCFEAGIKIIALGFAFHKGSYLRNGWNVMDFVVVLTGILATVGTEFDLRTLRAVRVLRPLKLVSGIPSLQVVLKSIMKAMIPLLQIGLLLFFAILIFAIIGLEFYMGKFHTTCFEEGTDDIQGESPAPCGTEEPARTCPNGTKCQPYWEGPNNGITQFDNILFAVLTVFQCITMEGWTDLLYNSNDASGNTWNWLYFIPLIIIGSFFMLNLVLGVLSGEFAKERERVENRRAFLKLRRQQQIERELNGYMEWISKAEEVILAEDETDGEQRHPFDGALRRTTIKKSKTDLLNPEEAEDQLADIASVGSPFARASIKSAKLENSTFFHKKERRMRFYIRRMVKTQAFYWTVLSLVALNTLCVAIVHYNQPEWLSDFLYYAEFIFLGLFMSEMFIKMYGLGTRPYFHSSFNCFDCGVIIGSIFEVIWAVIKPGTSFGISVLRALRLLRIFKVTKYWASLRNLVVSLLNSMKSIISLLFLLFLFIVVFALLGMQLFGGQFNFDEGTPPTNFDTFPAAIMTVFQILTGEDWNEVMYDGIKSQGGVQGGMVFSIYFIVLTLFGNYTLLNVFLAIAVDNLANAQELTKDEQEEEEAANQKLALQKAKEVAEVSPLSAANMSIAVKEQQKNQKPAKSVWEQRTSEMRKQNLLASREALYNEMDPDERWKAAYTRHLRPDMKTHLDRPLVVDPQENRNNNTNKSRAAEPTVDQRLGQQRAEDFLRKQARYHDRARDPSGSAGLDARRPWAGSQEAELSREGPYGRESDHHAREGSLEQPGFWEGEAERGKAGDPHRRHVHRQGGSRESRSGSPRTGADGEHRRHRAHRRPGEEGPEDKAERRARHREGSRPARGGEGEGEGPDGGERRRRHRHGAPATYEGDARREDKERRHRRRKENQGSGVPVSGPNLSTTRPIQQDLGRQDPPLAEDIDNMKNNKLATAESAAPHGSLGHAGLPQSPAKMGNSTDPGPMLAIPAMATNPQNAASRRTPNNPGNPSNPGPPKTPENSLIVTNPSGTQTNSAKTARKPDHTTVDIPPACPPPLNHTVVQVNKNANPDPLPKKEEEKKEEEEDDRGEDGPKPMPPYSSMFILSTTNPLRRLCHYILNLRYFEMCILMVIAMSSIALAAEDPVQPNAPRNNVLRYFDYVFTGVFTFEMVIKMIDLGLVLHQGAYFRDLWNILDFIVVSGALVAFAFTGNSKGKDINTIKSLRVLRVLRPLKTIKRLPKLKAVFDCVVNSLKNVFNILIVYMLFMFIFAVVAVQLFKGKFFHCTDESKEFEKDCRGKYLLYEKNEVKARDREWKKYEFHYDNVLWALLTLFTVSTGEGWPQVLKHSVDATFENQGPSPGYRMEMSIFYVVYFVVFPFFFVNIFVALIIITFQEQGDKMMEEYSLEKNERACIDFAISAKPLTRHMPQNKQSFQYRMWQFVVSPPFEYTIMAMIALNTIVLMMKFYGASVAYENALRVFNIVFTSLFSLECVLKVMAFGILNYFRDAWNIFDFVTVLGSITDILVTEFGNNFINLSFLRLFRAARLIKLLRQGYTIRILLWTFVQSFKALPYVCLLIAMLFFIYAIIGMQVFGNIGIDVEDEDSDEDEFQITEHNNFRTFFQALMLLFRSATGEAWHNIMLSCLSGKPCDKNSGILTRECGNEFAYFYFVSFIFLCSFLMLNLFVAVIMDNFEYLTRDSSILGPHHLDEYVRVWAEYDPAACGRIHYKDMYSLLRVISPPLGLGKKCPHRVACKRLLRMDLPVADDNTVHFNSTLMALIRTALDIKIAKGGADKQQMDAELRKEMMAIWPNLSQKTLDLLVTPHKSTDLTVGKIYAAMMIMEYYRQSKAKKLQAMREEQDRTPLMFQRMEPPSPTQEGGPGQNALPSTQLDPGGALMAHESGLKESPSWVTQRAQEMFQKTGTWSPEQGPPTDMPNSQPNSQSVEMREMGRDGYSDSEHYLPMEGQGRAASMPRLPAENQRRRGRPRGNNLSTISDTSPMKRSASVLGPKARRLDDYSLERVPPEENQRHHQRRRDRSHRASERSLGRYTDVDTGLGTDLSMTTQSGDLPSKERDQERGRPKDRKHRQHHHHHHHHHHPPPPDKDRYAQERPDHGRARARDQRWSRSPSEGREHMAHRQ,2261,NP_001120693.1.csv,refseq-CACNA1A-NM_001127221.1_clinical_seed_0_final,refseq-CACNA1A-NM_001127221.1.a2m,Invitae,refseq-CACNA1A-NM_001127221.1.npy,1,2261,2261
+NP_001120864.1,MEVVDETEALQRFFEGHDINGALEPSNIDTSILEEYISKEDASDLCFPDISAPASSASYSHGQPAMPGSSGVHHLSPPGGGPSPGRHGPLPPPGYGTPLNCNNNNGMGAAPKPFPGGTGPPIKAEPKAPYAPGTLPDSPPDSGSEAYSPQQVNEPHLLRTITPETLCHVGVPSRLEHPPPPPAHLPGPPPPPPPPPHYPVLQRDLYMKAEPPIPHYAAMGQGLVPTDLHHTQQSQMLHQLLQQHGAELPTHPSKKRKHSESPPSTLNAQMLNGMIKQEPGTVTALPLHPTRAPSPPWPPQGPLSPGPGSLPLSIARVQTPPWHPPGAPSPGLLQDSDSLSGSYLDPNYQSIKWQPHQQNKWATLYDANYKELPMLTYRVDADKGFNFSVGDDAFVCQKKNHFQVTVYIGMLGEPKYVKTPEGLKPLDCFYLKLHGVKLEALNQSINIEQSQSDRSKRPFNPVTVNLPPEQVTKVTVGRLHFSETTANNMRKKGKPNPDQRYFMLVVALQAHAQNQNYTLAAQISERIIVRASNPGQFESDSDVLWQRAQVPDTVFHHGRVGINTDRPDEALVVHGNVKVMGSLMHPSDLRAKEHVQEVDTTEQLKRISRMRLVHYRYKPEFAASAGIEATAPETGVIAQEVKEILPEAVKDTGDMVFANGKTIENFLVVNKERIFMENVGAVKELCKLTDNLETRIDELERWSHKLAKLRRLDSLKSTGSSGAFSHAGSQFSRAGSVPHKKRPPKVASKSSSVVPDQACISQRFLQGTIIALVVVMAFSVVSMSTLYVLSLRTEEDLVDTDGSFAVSTSCLLALLRPQPPGGSEALCPWSSQSFGTTQLRQSPLTTGLPGIQPSLLLVTTSLTSSAPGSAVRTLDMCSSHPCPVICCSSPTTNPTTGPSLGPSFNPGHVLSPSPSPSTNRSGPSQMALLPVTNIRAKSWGLSVNGIGHSKHHKSLEPLASPAVPFPGGQGKAKNSPSLGFHGRARRGALQSSVGPAEPTWAQGQSASLLAEPVPSLTSIQVLENSMSITSQYCAPGDACRPGNFTYHIPVSSGTPLHLSLTLQMNSSSPVSVVLCSLRSKEEPCEEGSLPQSLHTHQDTQGTSHRWPITILSFREFTYHFRVALLGQANCSSEALAQPATDYHFHFYRLCD,1151,NP_001120864.1.csv,refseq-MYRF-NM_001127392.2_clinical_seed_0_final,refseq-MYRF-NM_001127392.2.a2m,Invitae,refseq-MYRF-NM_001127392.2.npy,1,1151,1151
+NP_001120972.1,MKAPAVLAPGILVLLFTLVQRSNGECKEALAKSEMNVNMKYQLPNFTAETPIQNVILHEHHIFLGATNYIYVLNEEDLQKVAEYKTGPVLEHPDCFPCQDCSSKANLSGGVWKDNINMALVVDTYYDDQLISCGSVNRGTCQRHVFPHNHTADIQSEVHCIFSPQIEEPSQCPDCVVSALGAKVLSSVKDRFINFFVGNTINSSYFPDHPLHSISVRRLKETKDGFMFLTDQSYIDVLPEFRDSYPIKYVHAFESNNFIYFLTVQRETLDAQTFHTRIIRFCSINSGLHSYMEMPLECILTEKRKKRSTKKEVFNILQAAYVSKPGAQLARQIGASLNDDILFGVFAQSKPDSAEPMDRSAMCAFPIKYVNDFFNKIVNKNNVRCLQHFYGPNHEHCFNRTLLRNSSGCEARRDEYRTEFTTALQRVDLFMGQFSEVLLTSISTFIKGDLTIANLGTSEGRFMQVVVSRSGPSTPHVNFLLDSHPVSPEVIVEHTLNQNGYTLVITGKKITKIPLNGLGCRHFQSCSQCLSAPPFVQCGWCHDKCVRSEECLSGTWTQQICLPAIYKVFPNSAPLEGGTRLTICGWDFGFRRNNKFDLKKTRVLLGNESCTLTLSESTMNTLKCTVGPAMNKHFNMSIIISNGHGTTQYSTFSYVDPVITSISPKYGPMAGGTLLTLTGNYLNSGNSRHISIGGKTCTLKSVSNSILECYTPAQTISTEFAVKLKIDLANRETSIFSYREDPIVYEIHPTKSFISTWWKEPLNIVSFLFCFASGGSTITGVGKNLNSVSVPRMVINVHEAGRNFTVACQHRSNSEIICCTTPSLQQLNLQLPLKTKAFFMLDGILSKYFDLIYVHNPVFKPFEKPVMISMGNENVLEIKGNDIDPEAVKGEVLKVGNKSCENIHLHSEAVLCTVPNDLLKLNSELNIEWKQAISSTVLGKVIVQPDQNFTGLIAGVVSISTALLLLLGFFLWLKKRKQIKDLGSELVRYDARVHTPHLDRLVSARSVSPTTEMVSNESVDYRATFPEDQFPNSSQNGSCRQVQYPLTDMSPILTSGDSDISSPLLQNTVHIDLSALNPELVQAVQHVVIGPSSLIVHFNEVIGRGHFGCVYHGTLLDNDGKKIHCAVKSLNRITDIGEVSQFLTEGIIMKDFSHPNVLSLLGICLRSEGSPLVVLPYMKHGDLRNFIRNETHNPTVKDLIGFGLQVAKGMKYLASKKFVHRDLAARNCMLDEKFTVKVADFGLARDMYDKEYYSVHNKTGAKLPVKWMALESLQTQKFTTKSDVWSFGVLLWELMTRGAPPYPDVNTFDITVYLLQGRRLLQPEYCPDPLYEVMLKCWHPKAEMRPSFSELVSRISAIFSTFIGEHYVHVNATYVNVKCVAPYPSLLSSEDNADDEVDTRPASFWETS,1408,NP_001120972.1.csv,refseq-MET-NM_001127500.1_clinical_seed_0_final,refseq-MET-NM_001127500.1.a2m,Invitae,refseq-MET-NM_001127500.1.npy,1,1408,1408
+NP_001121370.1,MAMWQGAMDNRGFQQGSFSSFQNSSSDEDLMDIPATAMDFSMRDDVPPLDREVGEDKSYNGGGIGSSNRIMDFLEEPIPGVGTYDDFNTIDWVREKSRDRDRHREITNKSKESTWALIHSVSDAFSGWLLMLLIGLLSGSLAGLIDISAHWMTDLKEGICTGGFWFNHEHCCWNSEHVTFEERDKCPEWNSWSQLIISTDEGAFAYIVNYFMYVLWALLFAFLAVSLVKVFAPYACGSGIPEIKTILSGFIIRGYLGKWTLVIKTITLVLAVSSGLSLGKEGPLVHVACCCGNILCHCFNKYRKNEAKRREVLSAAAAAGVSVAFGAPIGGVLFSLEEVSYYFPLKTLWRSFFAALVAAFTLRSINPFGNSRLVLFYVEFHTPWHLFELVPFILLGIFGGLWGALFIRTNIAWCRKRKTTQLGKYPVIEVLVVTAITAILAFPNEYTRMSTSELISELFNDCGLLDSSKLCDYENRFNTSKGGELPDRPAGVGVYSAMWQLALTLILKIVITIFTFGMKIPSGLFIPSMAVGAIAGRLLGVGMEQLAYYHQEWTVFNSWCSQGADCITPGLYAMVGAAACLGGVTRMTVSLVVIMFELTGGLEYIVPLMAAAMTSKWVADALGREGIYDAHIRLNGYPFLEAKEEFAHKTLAMDVMKPRRNDPLLTVLTQDSMTVEDVETIISETTYSGFPVVVSRESQRLVGFVLRRDLIISIENARKKQDGVVSTSIIYFTEHSPPLPPYTPPTLKLRNILDLSPFTVTDLTPMEIVVDIFRKLGLRQCLVTHNGRLLGIITKKDVLKHIAQMANQDPDSILFN,816,NP_001121370.1.csv,refseq-CLCN5-NM_001127898.1_clinical_seed_0_final,refseq-CLCN5-NM_001127898.1.a2m,Invitae,refseq-CLCN5-NM_001127898.1_theta_0.2.npy,1,816,816
+NP_001121631.1,MMEEEELEFVEELEAVLQLTPEVQLAIEQVFPSQDPLDRADFNAVEYINTLFPTEQSLANIDEVVNKIRLKIRRLDDNIRTVVRGQTNVGQDGRQALEEAQKAIQQLFGKIKDIKDKAEKSEQMVKEITRDIKQLDHAKRHLTTSITTLNHLHMLAGGVDSLEAMTRRRQYGEVANLLQGVMNVLEHFHKYMGIPQIRQLSERVKAAQTELGQQILADFEEAFPSQGTKRPGGPSNVLRDACLVANILDPRIKQEIIKKFIKQHLSEYLVLFQENQDVAWLDKIDRRYAWIKRQLVDYEEKYGRMFPREWCMAERIAVEFCHVTRAELAKIMRTRAKEIEVKLLLFAIQRTTNFEGFLAKRFSGCTLTDGTLKKLESPPPSTNPFLEDEPTPEMEELATEKGDLDQPKKPKAPDNPFHGIVSKCFEPHLYVYIESQDKNLGELIDRFVADFKAQGPPKPNTDEGGAVLPSCADLFVYYKKCMVQCSQLSTGEPMIALTTIFQKYLREYAWKILSGNLPKTTTSSGGLTISSLLKEKEGSEVAKFTLEELCLICNILSTAEYCLATTQQLEEKLKEKVDVSLIERINLTGEMDTFSTVISSSIQLLVQDLDAACDPALTAMSKMQWQNVEHVGDQSPYVTSVILHIKQNVPIIRDNLASTRKYFTQFCVKFANSFIPKFITHLFKCKPISMVGAEQLLLDTHSLKMVLLDLPSISSQVVRKAPASYTKIVVKGMTRAEMILKVVMAPHEPLVVFVDNYIKLLTDCNTETFQKILDMKGLKRSEQSSMLELLRQRLPAPPSGAESSGSLSLTAPTPEQESSRIRKLEKLIKKRL,832,NP_001121631.1.csv,refseq-VPS53-NM_001128159.2_clinical_seed_0_final,refseq-VPS53-NM_001128159.2.a2m,Invitae,refseq-VPS53-NM_001128159.2.npy,1,832,832
+NP_001121699.1,METYGYLQRESCFQGPHELYFKNLSKRNKQIMEKNGNNRKLRVCVATCNRADYSKLAPIMFGIKTEPEFFELDVVVLGSHLIDDYGNTYRMIEQDDFDINTRLHTIVRGEDEAAMVESVGLALVKLPDVLNRLKPDIMIVHGDRFDALALATSAALMNIRILHIEGGEVSGTIDDSIRHAITKLAHYHVCCTRSAEQHLISMCEDHDRILLAGCPSYDKLLSAKNKDYMSIIRMWLGDDVKSKDYIVALQHPVTTDIKHSIKMFELTLDALISFNKRTLVLFPNIDAGSKEMVRVMRKKGIEHHPNFRAVKHVPFDQFIQLVAHAGCMIGNSSCGVREVGAFGTPVINLGTRQIGRETGENVLHVRDADTQDKILQALHLQFGKQYPCSKIYGDGNAVPRILKFLKSIDLQEPLQKKFCFPPVKENISQDIDHILETLSALAVDLGGTNLRVAIVSMKGEIVKKYTQFNPKTYEERINLILQMCVEAAAEAVKLNCRILGVGISTGGRVNPREGIVLHSTKLIQEWNSVDLRTPLSDTLHLPVWVDNDGNCAALAERKFGQGKGLENFVTLITGTGIGGGIIHQHELIHGSSFCAAELGHLVVSLDGPDCSCGSHGCIEAYASGMALQREAKKLHDEDLLLVEGMSVPKDEAVGALHLIQAAKLGNAKAQSILRTAGTALGLGVVNILHTMNPSLVILSGVLASHYIHIVKDVIRQQALSSVQDVDVVVSDLVDPALLGAASMVLDYTTRRIY,753,NP_001121699.1.csv,refseq-GNE-NM_001128227.2_clinical_seed_0_final,refseq-GNE-NM_001128227.2.a2m,Invitae,refseq-GNE-NM_001128227.2.npy,1,753,753
+NP_001121775.1,MAEAVLIDLFGLKLNSQKNCHQTLLKTLNAVQYHHAAKAKFLCIMCCSNISYERDGEQDNCEIETSNGLSALLEEFEIVSCPSMAATLYTIKQKIDEKNLSSIKVIVPRHRKTLMKAFIDQLFTDVYNFEFEDLQVTFRGGLFKQSIEINVITAQELRGIQNEIETFLRSLPALRGKLTIITSSLIPDIFIHGFTTRTGGISYIPTLSSFNLFSSSKRRDPKVVVQENLRRLANAAGFNVEKFYRIKTHHSNDIWIMGRKEPDSYDGITTNQRGVTIAALGADCIPIVFADPVKKACGVAHAGWKGTLLGVAMATVNAMIAEYGCSLEDIVVVLGPSVGPCCFTLPRESAEAFHNLHPACVQLFDSPNPCIDIRKATRILLEQGGILPQNIQDQNQDLNLCTSCHPDKFFSHVRDGLNFGTQIGFISIKE,430,NP_001121775.1.csv,refseq-LACC1-NM_001128303.1_clinical_seed_0_final,refseq-LACC1-NM_001128303.1.a2m,Invitae,refseq-LACC1-NM_001128303.1.npy,1,430,430
+NP_001121897.1,MTPLVSRLSRLWAIMRKPRAAVGSGHRKQAASQEGRQKHAKNNSQAKPSACDACAGMIAECPGAPAGLARQPEEVVLQASVSSYHLFRDVAEVTAFRGSLLSWYDQEKRDLPWRRRAEDEMDLDRRAYAVWVSEVMLQQTQVATVINYYTGWMQKWPTLQDLASASLEEVNQLWAGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQLLPGVGRYTAGAIASIAFGQATGVVDGNVARVLCRVRAIGADPSSTLVSQQLWGLAQQLVDPARPGDFNQAAMELGATVCTPQRPLCSQCPVESLCRARQRVEQEQLLASGSLSGSPDVEECAPNTGQCHLCLPPSEPWDQTLGVVNFPRKASRKPPREESSATCVLEQPGALGAQILLVQRPNSGLLAGLWEFPSVTWEPSEQLQRKALLQELQRWAGPLPATHLRHLGEVVHTFSHIKLTYQVYGLALEGQTPVTTVPPGARWLTQEEFHTAAVSTAMKKVFRVYQGQQPGTCMGSKRSQVSSPCSRKKPRMGQQVLDNFFRSHISTDAHSLNSAAQ,549,NP_001121897.1.csv,refseq-MUTYH-NM_001128425.1_clinical_seed_0_final,refseq-MUTYH-NM_001128425.1.a2m,Invitae,refseq-MUTYH-NM_001128425.1.npy,1,549,549
+NP_001121901.1,MNLFNLDRFRFEKRNKIEEAPEATPQPSQPGPSSPISLSAEEENAEGEVSRANTPDSDITEKTEDSSVPETPDNERKASISYFKNQRGIQYIDLSSDSEDVVSPNCSNTVQEKTFNKDTVIIVSEPSEDEESQGLPTMARRNDDISELEDLSELEDLKDAKLQTLKELFPQRSDNDLLKLIESTSTMDGAIAAALLMFGDAGGGPRKRKLSSSSEPYEEDEFNDDQSIKKTRLDHGEESNESAESSSNWEKQESIVLKLQKEFPNFDKQELREVLKEHEWMYTEALESLKVFAEDQDMQYVSQSEVPNGKEVSSRSQNYPKNATKTKLKQKFSMKAQNGFNKKRKKNVFNPKRVVEDSEYDSGSDVGSSLDEDYSSGEEVMEDGYKGKILHFLQDASIGELTLIPQCSQKKAQKITELRPFNSWEALFTKMSKTNGLSEDLIWHCKTLIQERDVVIRLMNKCEDISNKLTKQVTMLTGNGGGWNIEQPSILNQSLSLKPYQKVGLNWLALVHKHGLNGILADEMGLGKTIQAIAFLAYLYQEGNNGPHLIVVPASTIDNWLREVNLWCPTLKVLCYYGSQEERKQIRFNIHSRYEDYNVIVTTYNCAISSSDDRSLFRRLKLNYAIFDEGHMLKNMGSIRYQHLMTINANNRLLLTGTPVQNNLLELMSLLNFVMPHMFSSSTSEIRRMFSSKTKSADEQSIYEKERIAHAKQIIKPFILRRVKEEVLKQLPPKKDRIELCAMSEKQEQLYLGLFNRLKKSINNLVTEKNTEMCNVMMQLRKMANHPLLHRQYYTAEKLKEMSQLMLKEPTHCEANPDLIFEDMEVMTDFELHVLCKQYRHINNFQLDMDLILDSGKFRVLGCILSELKQKGDRVVLFSQFTMMLDILEVLLKHHQHRYLRLDGKTQISERIHLIDEFNTDMDIFVFLLSTKAGGLGINLTSANVVILHDIDCNPYNDKQAEDRCHRVGQTKEVLVIKLISQGTIEESMLKINQQKLKLEQDMTTVDEGDEGSMPADIATLLKTSMGL,1028,NP_001121901.1.csv,refseq-SMARCAD1-NM_001128429.2_clinical_seed_0_final,refseq-SMARCAD1-NM_001128429.2.a2m,Invitae,refseq-SMARCAD1-NM_001128429.2.npy,1,1028,1028
+NP_001121903.1,MKLLLLHPAFQSCLLLTLLGLWRTTPEAHASSLGAPAISAASFLQDLIHRYGEGDSLTLQQLKALLNHLDVGVGRGNVTQHVQGHRNLSTCFSSGDLFTAHNFSEQSRIGSSELQEFCPTILQQLDSRACTSENQENEENEQTEEGRPSAVEVWGYGLLCVTVISLCSLLGASVVPFMKKTFYKRLLLYFIALAIGTLYSNALFQLIPEAFGFNPLEDYYVSKSAVVFGGFYLFFFTEKILKILLKQKNEHHHGHSHYASESLPSKKDQEEGVMEKLQNGDLDHMIPQHCSSELDGKAPMVDEKVIVGSLSVQDLQASQSACYWLKGVRYSDIGTLAWMITLSDGLHNFIDGLAIGASFTVSVFQGISTSVAILCEEFPHELGDFVILLNAGMSIQQALFFNFLSACCCYLGLAFGILAGSHFSANWIFALAGGMFLYISLADMFPEMNEVCQEDERKGSILIPFIIQNLGLLTGFTIMVVLTMYSGQIQIG,492,NP_001121903.1.csv,refseq-SLC39A14-NM_001128431.2_clinical_seed_0_final,refseq-SLC39A14-NM_001128431.2.a2m,Invitae,refseq-SLC39A14-NM_001128431.2.npy,1,492,492
+NP_001122092.1,MSNNGLDIQDKPPAPPMRNTSTMIGAGSKDAGTLNHGSKPLPPNPEEKKKKDRFYRSILPGDKTNKKKEKERPEISLPSDFEHTIHVGFDAVTGEFTGMPEQWARLLQTSNITKSEQKKNPQAVLDVLEFYNSKKTSNSQKYMSFTDKSAEDYNSSNALNVKAVSETPAVPPVSEDEDDDDDDATPPPVIAPRPEHTKSVYTRSVIEPLPVTPTRDVATSPISPTENNTTPPDALTRNTEKQKKKPKMSDEEILEKLRSIVSVGDPKKKYTRFEKIGQGASGTVYTAMDVATGQEVAIKQMNLQQQPKKELIINEILVMRENKNPNIVNYLDSYLVGDELWVVMEYLAGGSLTDVVTETCMDEGQIAAVCRECLQALEFLHSNQVIHRDIKSDNILLGMDGSVKLTDFGFCAQITPEQSKRSTMVGTPYWMAPEVVTRKAYGPKVDIWSLGIMAIEMIEGEPPYLNENPLRALYLIATNGTPELQNPEKLSAIFRDFLNRCLEMDVEKRGSAKELLQVRKLRFQVFSNFSMIAASIPEDCQAPLQPHSTDCCS,553,NP_001122092.1.csv,refseq-PAK1-NM_001128620.1_clinical_seed_0_final,refseq-PAK1-NM_001128620.1.a2m,Invitae,refseq-PAK1-NM_001128620.1.npy,1,553,553
+NP_001122317.1,MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGPSPGSAHSMMGPSPGPPSAGHPIPTQGPGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQGYPSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQNQLHQLRAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGPGPGPGPGPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAAPTSTPQKLIPPQPTGRPSPAPPAVPPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRITPIQKPRGLDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNFQRQLRQEVVVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKHQEYLNSILQHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKAENAEGQTPAIGPDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYEVAPRSDSEESGSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENAKQDVDDEYGVSQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYNNNLNGILADEMGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVVKVSYKGSPAARRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTHYVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGEKVDLNEEETILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRHMQAKGVLLTDGSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIVQGLDLYRASGKFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTTKAEDRGMLLKTFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRAHRIGQQNEVRVLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAILEHEEQDEEEDEVPDDETVNQMIARHEEEFDLFMRMDLDRRREEARNPKRKPRLMEEDELPSWIIKDDAEVERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWLKTLKAIEEGTLEEIEEEVRQKKSSRKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTKKMKKIVDAVIKYKDSSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLNDLEKDVMLLCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQKIEKEDDSEGEESEEEEEGEEEGSESESRSVKVKIKLGRKEKAQDRLKGGRRRPSRGSRAKPVVSDDDSEEEQEEDRSGSGSEED,1617,NP_001122317.1.csv,refseq-SMARCA4-NM_001128845.2_clinical_seed_0_final,refseq-SMARCA4-NM_001128845.2.a2m,Invitae,refseq-SMARCA4-NM_001128845.2.npy,1,1617,1617
+NP_001123292.1,MESLKTDTEMPYPEVIVDVGRVIFGEENRKKMTNSCLKRSENSRIIRAICALLNSGGGVIKAEIDDKTYSYQCHGLGQDLETSFQKLLPSGSQKYLDYMQQGHNLLIFVKSWSPDVFSLPLRICSLRSNLYRRDVTSAINLSASSALELLREKGFRAQRGRPRVKKLHPQQVLNRCIQEEEDMRILASEFFKKDKLMYKEKLNFTESTHVEFKRFTTKKVIPRIKEMLPHYVSAFANTQGGYVLIGVDDKSKEVVGCKWEKVNPDLLKKEIENCIEKLPTFHFCCEKPKVNFTTKILNVYQKDVLDGYVCVIQVEPFCCVVFAEAPDSWIMKDNSVTRLTAEQWVVMMLDTQSAPPSLVTDYNSCLISSASSARKSPGYPIKVHKFKEALQRHLFPVTQEEVQFKPESLCKKLFSDHKELEGLMKTLIHPCSQGIVIFSRSWAGDVGFRKEQNVLCDALLIAVNSPVVLYTILIDPNWPGGLEYARNTAHQLKQKLQTVGGYTGKVCIIPRLIHLSSTQSRPGEIPLRYPRSYRLADEEEMEDLLQALVVVSLSSRSLLSDQMGCEFFNLLIMEQSQLLSESLQKTRELFIYCFPGVRKTALAIKIMEKIKDLFHCKPKEILYVCESDSLKDFVTQQTTCQAVTRKTFMQGEFLKIKHIVMDETENFCSKYGNWYMKAKNITHPKAKGTGSENLHHGILWLFLDPFQIHHADVNGLPPPSAQFPRKTITSGIHCALEIAKVMKEEMKRIKENPPSNMSPDTLALFSETAYEEATCAQALPGVCETKTNLTTEQIANYVARKCHSLFQCGYLPKDIAILCRRGEDRGRYRLALLKAMELIETHRPSEVVFSPATGVWGSHIVLDSIQQFSGLERTVVFGLSPECDQSEEFHKLCFASRAIKHLYLLYEKRAAY,912,NP_001123292.1.csv,refseq-SLFN14-NM_001129820.1_clinical_seed_0_final,refseq-SLFN14-NM_001129820.1.a2m,Invitae,refseq-SLFN14-NM_001129820.1.npy,1,912,912
+NP_001123476.1,MDHYDSQQTNDYMQPEEDWDRDLLLDPAWEKQQRKTFTAWCNSHLRKAGTQIENIEEDFRDGLKLMLLLEVISGERLAKPERGKMRVHKISNVNKALDFIASKGVKLVSIGAEEIVDGNVKMTLGMIWTIILRFAIQDISVEETSAKEGLLLWCQRKTAPYKNVNIQNFHISWKDGLGFCALIHRHRPELIDYGKLRKDDPLTNLNTAFDVAEKYLDIPKMLDAEDIVGTARPDEKAIMTYVSSFYHAFSGAQKAETAANRICKVLAVNQENEQLMEDYEKLASDLLEWIRRTIPWLENRVPENTMHAMQQKLEDFRDYRRLHKPPKVQEKCQLEINFNTLQTKLRLSNRPAFMPSEGRMVSDINNAWGCLEQVEKGYEEWLLNEIRRLERLDHLAEKFRQKASIHEAWTDGKEAMLRQKDYETATLSEIKALLKKHEAFESDLAAHQDRVEQIAAIAQELNELDYYDSPSVNARCQKICDQWDNLGALTQKRREALERTEKLLETIDQLYLEYAKRAAPFNNWMEGAMEDLQDTFIVHTIEEIQGLTTAHEQFKATLPDADKERLAILGIHNEVSKIVQTYHVNMAGTNPYTTITPQEINGKWDHVRQLVPRRDQALTEEHARQQHNERLRKQFGAQANVIGPWIQTKMEEIGRISIEMHGTLEDQLSHLRQYEKSIVNYKPKIDQLEGDHQLIQEALIFDNKHTNYTMEHIRVGWEQLLTTIARTINEVENQILTRDAKGISQEQMNEFRASFNHFDRDHSGTLGPEEFKACLISLGYDIGNDPQKKTGMMDTDDFRACLISMGYNMGEAEFARIMSIVDPNRLGVVTFQAFIDFMSRETADTDTADQVMASFKILAGDKNYITMDELRRELPPDQAEYCIARMAPYTGPDSVPGALDYMSFSTALYGESDL,914,NP_001123476.1.csv,refseq-ACTN1-NM_001130004.1_clinical_seed_0_final,refseq-ACTN1-NM_001130004.1.a2m,Invitae,refseq-ACTN1-NM_001130004.1.npy,1,914,914
+NP_001123482.1,MILTKAQYDEIAQCLVSVPPTRQSLRKLKQRFPSQSQATLLSIFSQEYQKHIKRTHAKHHTSEAIESYYQRYLNGVVKNGAAPVLLDLANEVDYAPSLMARLILERFLQEHEETPPSKSIINSMLRDPSQIPDGVLANQVYQCIVNDCCYGPLVDCIKHAIGHEHEVLLRDLLLEKNLSFLDEDQLRAKGYDKTPDFILQVPVAVEGHIIHWIESKASFGDECSHHAYLHDQFWSYWNRFGPGLVIYWYGFIQELDCNRERGILLKACFPTNIVTLCHSIA,281,NP_001123482.1.csv,refseq-C15orf41-NM_001130010.2_clinical_seed_0_final,refseq-C15orf41-NM_001130010.2.a2m,Invitae,refseq-C15orf41-NM_001130010.2.npy,1,281,281
+NP_001123561.1,MLTQAAVRLVRGSLRKTSWAEWGHRELRLGQLAPFTAPHKDKSFSDQRSELKRRLKAEKKVAEKEAKQKELSEKQLSQATAAATNHTTDNGVGPEEESVDPNQYYKIRSQAIHQLKVNGEDPYPHKFHVDISLTDFIQKYSHLQPGDHLTDITLKVAGRIHAKRASGGKLIFYDLRGEGVKLQVMANSRNYKSEEEFIHINNKLRRGDIIGVQGNPGKTKKGELSIIPYEITLLSPCLHMLPHLHFGLKDKETRYRQRYLDLILNDFVRQKFIIRSKIITYIRSFLDELGFLEIETPMMNIIPGGAVAKPFITYHNELDMNLYMRIAPELYHKMLVVGGIDRVYEIGRQFRNEGIDLTHNPEFTTCEFYMAYADYHDLMEITEKMVSGMVKHITGSYKVTYHPDGPEGQAYDVDFTPPFRRINMVEELEKALGMKLPETNLFETEETRKILDDICVAKAVECPPPRTTARLLDKLVGEFLEVTCINPTFICDHPQIMSPLAKWHRSKEGLTERFELFVMKKEICNAYTELNDPMRQRQLFEEQAKAKAAGDDEAMFIDENFCTALEYGLPPTAGWGMGIDRVAMFLTDSNNIKEVLLFPAMKPEDKKENVATTDTLESTTVGTSV,625,NP_001123561.1.csv,refseq-KARS1-NM_001130089.1_clinical_seed_0_final,refseq-KARS1-NM_001130089.1.a2m,Invitae,refseq-KARS1-NM_001130089.1_theta_0.2.npy,1,625,625
+NP_001123575.1,MVAERTHKAAATGARGPGELGAPGTVALVAARAERGARLPSPGSCGLLTLALCSLALSLLAHFRTAELQARVLRLEAERGEQQMETAILGRVNQLLDEKWKLHSRRRREAPKTSPGCNCPPGPPGPTGRPGLPGDKGAIGMPGRVGSPGDAGLSIIGPRGPPGQPGTRGFPGFPGPIGLDGKPGHPGPKGDMGLTGPPGQPGPQGQKGEKGQCGEYPHRECLSSMPAALRSSQIIALKLLPLLNSVRLAPPPVIKRRTFQGEQSQASIQGPPGPPGPPGPSGPLGHPGLPGPMGPPGLPGPPGPKGDPGIQGYHGRKGERGMPGMPGKHGAKGAPGIAVAGMKGEPGIPGTKGEKGAEGSPGLPGLLGQKGEKGDAGNSIGGGRGEPGPPGLPGPPGPKGEAGVDGQVGPPGQPGDKGERGAAGEQGPDGPKGSKGEPGKGEMVDYNGNINEALQEIRTLALMGPPGLPGQIGPPGAPGIPGQKGEIGLPGPPGHDGEKGPRGKPGDMGPPGPQGPPGKDGPPGVKGENGHPGSPGEKGEKGETGQAGSPGEKGEAGEKGNPGAEVPGLPGPEGPPGPPGLQGVPGPKGEAGLDGAKGEKGFQGEKGDRGPLGLPGASGLDGRPGPPGTPGPIGVPGPAGPKGERGSKGDPGMTGPTGAAGLPGLHGPPGDKGNRGERGKKGSRGPKGDKGDQGAPGLDAPCPLGEDGLPVQGCWNK,717,NP_001123575.1.csv,refseq-COL13A1-NM_001130103.1_clinical_seed_0_final,refseq-COL13A1-NM_001130103.1.a2m,Invitae,refseq-COL13A1-NM_001130103.1.npy,1,717,717
+NP_001123577.1,MQHSCIPTPPSPFSAPPAFLPVVTRESRRGLSSGGSAGRNAGVTATAAAADGWKGRLPSPLVLLPRSARCQARRRRGGRTSSLLLLPPTPERALFASPSPDPSPRGLGASSGAAEGAGAGLLLGCRASMSDNQSWNSSGSEEDPETESGPPVERCGVLSKWTNYIHGWQDRWVVLKNNALSYYKSEDETEYGCRGSICLSKAVITPHDFDECRFDISVNDSVWYLRAQDPDHRQQWIDAIEQHKTESGYGSESSLRRHGSMVSLVSGASGYSATSTSSFKKGHSLREKLAEMETFRDILCRQVDTLQKYFDACADAVSKDELQRDKVVEDDEDDFPTTRSDGDFLHSTNGNKEKLFPHVTPKGINGIDFKGEAITFKATTAGILATLSHCIELMVKREDSWQKRLDKETEKKRRTEEAYKNAMTELKKKSHFGGPDYEEGPNSLINEEEFFDAVEAALDRQDKIEEQSQSEKVRLHWPTSLPSGDAFSSVGTHRFVQKPYSRSSSMSSIDLVSASDDVHRFSSQVEEMVQNHMTYSLQDVGGDANWQLVVEEGEMKVYRREVEENGIVLDPLKATHAVKGVTGHEVCNYFWNVDVRNDWETTIENFHVVETLADNAIIIYQTHKRVWPASQRDVLYLSVIRKIPALTENDPETWIVCNFSVDHDSAPLNNRCVRAKINVAMICQTLVSPPEGNQEISRDNILCKITYVANVNPGGWAPASVLRAVAKREYPKFLKRFTSYVQEKTAGKPILF,752,NP_001123577.1.csv,refseq-CERT1-NM_001130105.1_clinical_seed_0_final,refseq-CERT1-NM_001130105.1.a2m,Invitae,refseq-CERT1-NM_001130105.1.npy,1,752,752
+NP_001123616.1,MPGPRGAAGGLAPEMRGAGAAGLLALLLLLLLLLLGLGGRVEGGPAGERGAGGGGALARERFKVVFAPVICKRTCLKGQCRDSCQQGSNMTLIGENGHSTDTLTGSGFRVVVCPLPCMNGGQCSSRNQCLCPPDFTGRFCQVPAGGAGGGTGGSGPGLSRTGALSTGALPPLAPEGDSVASKHAIYAVQVIADPPGPGEGPPAQHAAFLVPLGPGQISAEVQAPPPVVNVRVHHPPEASVQVHRIESSNAESAAPSQHLLPHPKPSHPRPPTQKPLGRCFQDTLPKQPCGSNPLPGLTKQEDCCGSIGTAWGQSKCHKCPQLQYTGVQKPGPVRGEVGADCPQGYKRLNSTHCQDINECAMPGVCRHGDCLNNPGSYRCVCPPGHSLGPSRTQCIADKPEEKSLCFRLVSPEHQCQHPLTTRLTRQLCCCSVGKAWGARCQRCPTDGTAAFKEICPAGKGYHILTSHQTLTIQGESDFSLFLHPDGPPKPQQLPESPSQAPPPEDTEEERGVTTDSPVSEERSVQQSHPTATTTPARPYPELISRPSPPTMRWFLPDLPPSRSAVEIAPTQVTETDECRLNQNICGHGECVPGPPDYSCHCNPGYRSHPQHRYCVDVNECEAEPCGPGRGICMNTGGSYNCHCNRGYRLHVGAGGRSCVDLNECAKPHLCGDGGFCINFPGHYKCNCYPGYRLKASRPPVCEDIDECRDPSSCPDGKCENKPGSFKCIACQPGYRSQGGGACRDVNECAEGSPCSPGWCENLPGSFRCTCAQGYAPAPDGRSCLDVDECEAGDVCDNGICSNTPGSFQCQCLSGYHLSRDRSHCEDIDECDFPAACIGGDCINTNGSYRCLCPQGHRLVGGRKCQDIDECSQDPSLCLPHGACKNLQGSYVCVCDEGFTPTQDQHGCEEVEQPHHKKECYLNFDDTVFCDSVLATNVTQQECCCSLGAGWGDHCEIYPCPVYSSAEFHSLCPDGKGYTQDNNIVNYGIPAHRDIDECMLFGSEICKEGKCVNTQPGYECYCKQGFYYDGNLLECVDVDECLDESNCRNGVCENTRGGYRCACTPPAEYSPAQRQCLSPEEMDVDECQDPAACRPGRCVNLPGSYRCECRPPWVPGPSGRDCQLPESPAERAPERRDVCWSQRGEDGMCAGPLAGPALTFDDCCCRQGRGWGAQCRPCPPRGAGSHCPTSQSESNSFWDTSPLLLGKPPRDEDSSEEDSDECRCVSGRCVPRPGGAVCECPGGFQLDASRARCVDIDECRELNQRGLLCKSERCVNTSGSFRCVCKAGFARSRPHGACVPQRRR,1303,NP_001123616.1.csv,refseq-LTBP3-NM_001130144.2_clinical_seed_0_final,refseq-LTBP3-NM_001130144.2.a2m,Invitae,refseq-LTBP3-NM_001130144.2.npy,1,1303,1303
+NP_001123617.1,MDPGQQPPPQPAPQGQGQPPSQPPQGQGPPSGPGQPAPAATQAAPQAPPAGHQIVHVRGDSETDLEALFNAVMNPKTANVPQTVPMRLRKLPDSFFKPPEPKSHSRQASTDAGTAGALTPQHVRAHSSPASLQLGAVSPGTLTPTGVVSGPAATPTAQHLRQSSFEIPDDVPLPAGWEMAKTSSGQRYFLNHIDQTTTWQDPRKAMLSQMNVTAPTSPPVQQNMMNSASGPLPDGWEQAMTQDGEIYYINHKNKTTSWLDPRLDPRFAMNQRISQSAPVKQPPPLAPQSPQGGVMGGSNSNQQQQMRLQQLQMEKERLRLKQQELLRQAMRNINPSTANSPKCQELALRSQLPTLEQDGGTQNPVSSPGMSQELRTMTTNSSDPFLNSGTYHSRDESTDSGLSMSSYSVPRTPDDFLNSVDEMDTGDTINQSTLPSQQNRFPDYLEAIPGTNVDLGTLEGDGMNIEGEELMPSLQEALSSDILNDMESVLAATKLDKESFLTWL,504,NP_001123617.1.csv,refseq-YAP1-NM_001130145.2_clinical_seed_0_final,refseq-YAP1-NM_001130145.2.a2m,Invitae,refseq-YAP1-NM_001130145.2.npy,1,504,504
+NP_001123990.1,MMMVRRGLLAWISRVVVLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNSPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFGPIMKVKNEKLNMANTLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIVVRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLRKQKQKTSSKKT,532,NP_001123990.1.csv,refseq-CSGALNACT1-NM_001130518.1_clinical_seed_0_final,refseq-CSGALNACT1-NM_001130518.1.a2m,Invitae,refseq-CSGALNACT1-NM_001130518.1.npy,1,532,532
+NP_001124162.1,MEDGPSNNASCFRRLTECFLSPSLTDEKVKAYLSLHPQVLDEFVSESVSAETVEKWLKRKNNKSEDESAPKEVSRYQDTNMQGVVYELNSYIEQRLDTGGDNQLLLYELSSIIKIATKADGFALYFLGECNNSLCIFTPPGIKEGKPRLIPAGPITQGTTVSAYVAKSRKTLLVEDILGDERFPRGTGLESGTRIQSVLCLPIVTAIGDLIGILELYRHWGKEAFCLSHQEVATANLAWASVAIHQVQVCRGLAKQTELNDFLLDVSKTYFDNIVAIDSLLEHIMIYAKNLVNADRCALFQVDHKNKELYSDLFDIGEEKEGKPVFKKTKEIRFSIEKGIAGQVARTGEVLNIPDAYADPRFNREVDLYTGYTTRNILCMPIVSRGSVIGVVQMVNKISGSAFSKTDENNFKMFAVFCALALHCANMYHRIRHSECIYRVTMEKLSYHSICTSEEWQGLMQFTLPVRLCKEIELFHFDIGPFENMWPGIFVYMVHRSCGTSCFELEKLCRFIMSVKKNYRRVPYHNWKHAVTVAHCMYAILQNNHTLFTDLERKGLLIACLCHDLDHRGFSNSYLQKFDHPLAALYSTSTMEQHHFSQTVSILQLEGHNIFSTLSSSEYEQVLEIIRKAIIATDLALYFGNRKQLEEMYQTGSLNLNNQSHRDRVIGLMMTACDLCSVTKLWPVTKLTANDIYAEFWAEGDEMKKLGIQPIPMMDRDKKDEVPQGQLGFYNAVAIPCYTTLTQILPPTEPLLKACRDNLSQWEKVIRGEETATWISSPSVAQKAAASED,789,NP_001124162.1.csv,refseq-PDE10A-NM_001130690.2_clinical_seed_0_final,refseq-PDE10A-NM_001130690.2.a2m,Invitae,refseq-PDE10A-NM_001130690.2.npy,1,789,789
+NP_001124295.1,MPARTAPARVPTLAVPAISLPDDVRRRLKDLERDSLTEKECVKEKLNLLHEFLQTEIKNQLCDLETKLRKEELSEEGYLAKVKSLLNKDLSLENGAHAYNREVNGRLENGNQARSEARRVGMADANSPPKPLSKPRTPRRSKSDGEAKRSRDPPASASQVTGIRAEPSPSPRITRKSTRQTTITSHFAKGPAKRKPQEESERAKSDESIKEEDKDQDEKRRRVTSRERVARPLPAEEPERAKSGTRTEKEEERDEKEEKRLRSQTKEPTPKQKLKEEPDREARAGVQADEDEDGDEKDEKKHRSQPKDLAAKRRPEEKEPEKVNPQISDEKDEDEKEEKRRKTTPKEPTEKKMARAKTVMNSKTHPPKCIQCGQYLDDPDLKYGQHPPDAVDEPQMLTNEKLSIFDANESGFESYEALPQHKLTCFSVYCKHGHLCPIDTGLIEKNIELFFSGSAKPIYDDDPSLEGGVNGKNLGPINEWWITGFDGGEKALIGFSTSFAEYILMDPSPEYAPIFGLMQEKIYISKIVVEFLQSNSDSTYEDLINKIETTVPPSGLNLNRFTEDSLLRHAQFVVEQVESYDEAGDSDEQPIFLTPCMRDLIKLAGVTLGQRRAQARRQTIRHSTREKDRGPTKATTTKLVYQIFDTFFAEQIEKDDREDKENAFKRRRCGVCEVCQQPECGKCKACKDMVKFGGSGRSKQACQERRCPNMAMKEADDDEEVDDNIPEMPSPKKMHQGKKKKQNKNRISWVGEAVKTDGKKSYYKKVCIDAETLEVGDCVSVIPDDSSKPLYLARVTALWEDSSNGQMFHAHWFCAGTDTVLGATSDPLELFLVDECEDMQLSYIHSKVKVIYKAPSENWAMEGGMDPESLLEGDDGKTYFYQLWYDQDYARFESPPKTQPTEDNKFKFCVSCARLAEMRQKEIPRVLEQLEDLDSRVLYYSATKNGILYRVGDGVYLPPEAFTFNIKLSSPVKRPRKEPVDEDLYPEHYRKYSDYIKGSNLDAPEPYRIGRIKEIFCPKKSNGRPNETDIKIRVNKFYRPENTHKSTPASYHADINLLYWSDEEAVVDFKAVQGRCTVEYGEDLPECVQVYSMGGPNRFYFLEAYNAKSKSFEDPPNHARSPGNKGKGKGKGKGKPKSQACEPSEPEIEIKLPKLRTLDVFSGCGGLSEGFHQAGISDTLWAIEMWDPAAQAFRLNNPGSTVFTEDCNILLKLVMAGETTNSRGQRLPQKGDVEMLCGGPPCQGFSGMNRFNSRTYSKFKNSLVVSFLSYCDYYRPRFFLLENVRNFVSFKRSMVLKLTLRCLVRMGYQCTFGVLQAGQYGVAQTRRRAIILAAAPGEKLPLFPEPLHVFAPRACQLSVVVDDKKFVSNITRLSSGPFRTITVRDTMSDLPEVRNGASALEISYNGEPQSWFQRQLRGAQYQPILRDHICKDMSALVAARMRHIPLAPGSDWRDLPNIEVRLSDGTMARKLRYTHHDRKNGRSSSGALRGVCSCVEAGKACDPAARQFNTLIPWCLPHTGNRHNHWAGLYGRLEWDGFFSTTVTNPEPMGKQGRVLHPEQHRVVSVRECARSQGFPDTYRLFGNILDKHRQVGNAVPPPLAKAIGLEIKLCMLAKARESASAKIKEEEAAKD,1632,NP_001124295.1.csv,refseq-DNMT1-NM_001130823.1_clinical_seed_0_final,refseq-DNMT1-NM_001130823.1.a2m,Invitae,refseq-DNMT1-NM_001130823.1.npy,1,1632,1632
+NP_001124497.1,MAMRELVEAECGGANPLMKLAGHFTQDKALRQEGLRPGPWPPGAPASEAASKPLGVASEDELVAEFLQDQNAPLVSRAPQTFKMDDLLAEMQQIEQSNFRQAPQRAPGVADLALSENWAQEFLAAGDAVDVTQDYNETDWSQEFISEVTDPLSVSPARWAEEYLEQSEEKLWLGEPEGTATDRWYDEYHPEEDLQHTASDFVAKVDDPKLANSEFLKFVRQIGEGQVSLESGAGSGRAQAEQWAAEFIQQQGTSDAWVDQFTRPVNTSALDMEFERAKSAIESDVDFWDKLQAELEEMAKRDAEAHPWLSDYDDLTSATYDKGYQFEEENPLRDHPQPFEEGLRRLQEGDLPNAVLLFEAAVQQDPKHMEAWQYLGTTQAENEQELLAISALRRCLELKPDNQTALMALAVSFTNESLQRQACETLRDWLRYTPAYAHLVTPAEEGAGGAGLGPSKRILGSLLSDSLFLEVKELFLAAVRLDPTSIDPDVQCGLGVLFNLSGEYDKAVDCFTAALSVRPNDYLLWNKLGATLANGNQSEEAVAAYRRALELQPGYIRSRYNLGISCINLGAHREAVEHFLEALNMQRKSRGPRGEGGAMSENIWSTLRLALSMLGQSDAYGAADARDLSTLLTMFGLPQ,639,NP_001124497.1.csv,refseq-PEX5-NM_001131025.1_clinical_seed_0_final,refseq-PEX5-NM_001131025.1.a2m,Invitae,refseq-PEX5-NM_001131025.1.npy,1,639,639
+NP_001125.1,MKWVESIFLIFLLNFTESRTLHRNEYGIASILDSYQCTAEISLADLATIFFAQFVQEATYKEVSKMVKDALTAIEKPTGDEQSSGCLENQLPAFLEELCHEKEILEKYGHSDCCSQSEEGRHNCFLAHKKPTPASIPLFQVPEPVTSCEAYEEDRETFMNKFIYEIARRHPFLYAPTILLWAARYDKIIPSCCKAENAVECFQTKAATVTKELRESSLLNQHACAVMKNFGTRTFQAITVTKLSQKFTKVNFTEIQKLVLDVAHVHEHCCRGDVLDCLQDGEKIMSYICSQQDTLSNKITECCKLTTLERGQCIIHAENDEKPEGLSPNLNRFLGDRDFNQFSSGEKNIFLASFVHEYSRRHPQLAVSVILRVAKGYQELLEKCFQTENPLECQDKGEEELQKYIQESQALAKRSCGLFQKLGEYYLQNAFLVAYTKKAPQLTSSELMAITRKMAATAATCCQLSEDKLLACGEGAADIIIGHLCIRHEMTPVNPGVGQCCTSSYANRRPCFSSLVVDETYVPPAFSDDKFIFHKDLCQAQGVALQTMKQEFLINLVKQKPQITEEQLEAVIADFSGLLEKCCQGQEQEVCFAEEGQKLISKTRAALGV,609,NP_001125.1.csv,refseq-AFP-NM_001134.3_clinical_seed_0_final,refseq-AFP-NM_001134.3.a2m,Invitae,refseq-AFP-NM_001134.3.npy,1,609,609
+NP_001127835.2,MVLAAAMSQDADPSGPEQPDRVACSVPGARASPAPSGPRGMQQPPPPPQPPPPPQAGLPQIIQNAAKLLDKNPFSVSNPNPLLPSPASLQLAQLQAQLTLHRLKLAQTAVTNNTAAATVLNQVLSKVAMSQPLFNQLRHPSVITGPHGHAGVPQHAAAIPSTRFPSNAIAFSPPSQTRGPGPSMNLPNQPPSAMVMHPFTGVMPQTPGQPAVILGIGKTGPAPATAGFYEYGKASSGQTYGPETDGQPGFLPSSASTSGSVTYEGHYSHTGQDGQAAFSKDFYGPNSQGSHVASGFPAEQAGGLKSEVGPLLQGTNSQWESPHGFSGQSKPDLTAGPMWPPPHNQPYELYDPEEPTSDRTPPSFGGRLNNSKQGFIGAGRRAKEDQALLSVRPLQAHELNDFHGVAPLHLPHICSICDKKVFDLKDWELHVKGKLHAQKCLVFSENAGIRCILGSAEGTLCASPNSTAVYNPAGNEDYASNLGTSYVPIPARSFTQSSPTFPLASVGTTFAQRKGAGRVVHICNLPEGSCTENDVINLGLPFGKVTNYILMKSTNQAFLEMAYTEAAQAMVQYYQEKSAVINGEKLLIRMSKRYKELQLKKPGKAVAAIIQDIHSQRERDMFREADRYGPERPRSRSPVSRSLSPRSHTPSFTSCSSSHSPPGPSRADWGNGRDSWEHSPYARREEERDPAPWRDNGDDKRDRMDPWAHDRKHHPRQLDKAELDERPEGGRPHREKYPRSGSPNLPHSVSSYKSREDGYYRKEPKAKSDKYLKQQQDAPGRSRRKDEARLRESRHPHPDDSGKEDGLGPKVTRAPEGAKAKQNEKNKTKRTDRDQEGADDRKENTMAENEAGKEEQEGMEESPQSVGRQEKEAEFSDPENTRTKKEQDWESESEAEGESWYPTNMEELVTVDEVGEEEDFIVEPDIPELEEIVPIDQKDKICPETCLCVTTTLDLDLAQDFPKEGVKAVGNGAAEISLKSPRELPSASTSCPSDMDVEMPGLNLDAERKPAESETGLSLEDSDCYEKEAKGVESSDVHPAPTVQQMSSPKPAEERARQPSPFVDDCKTRGTPEDGACEGSPLEEKASPPIETDLQNQACQEVLTPENSRYVEMKSLEVRSPEYTEVELKQPLSLPSWEPEDVFSELSIPLGVEFVVPRTGFYCKLCGLFYTSEETAKMSHCRSAVHYRNLQKYLSQLAEEGLKETEGADSPRPEDSGIVPRFERKKL,1227,NP_001127835.2.csv,NP_001127835.2_colabfold_clinical_seed_0_final,NP_001127835.2_colabfold.a2m,colabfold,NP_001127835.2_colabfold_theta_0.2.npy,1,1227,1227
+NP_001128243.1,MSRRFTVTSLPPAGPARSPDPESRRHSVADPRHLPGEDVKGDGNPKESSPFINSTDTEKGKEYDGKNMALFEEEMDTSPMVSSLLSGLANYTNLPQGSREHEEAENNEGGKKKPVQAPRMGTFMGVYLPCLQNIFGVILFLRLTWVVGIAGIMESFCMVFICCSCTMLTAISMSAIATNGVVPAGGSYYMISRSLGPEFGGAVGLCFYLGTTFAGAMYILGTIEILLAYLFPAMAIFKAEDASGEAAAMLNNMRVYGTCVLTCMATVVFVGVKYVNKFALVFLGCVILSILAIYAGVIKSAFDPPNFPICLLGNRTLSRHGFDVCAKLAWEGNETVTTRLWGLFCSSRFLNATCDEYFTRNNVTEIQGIPGAASGLIKENLWSSYLTKGVIVERSGMTSVGLADGTPIDMDHPYVFSDMTSYFTLLVGIYFPSVTGIMAGSNRSGDLRDAQKSIPTGTILAIATTSAVYISSVVLFGACIEGVVLRDKFGEAVNGNLVVGTLAWPSPWVIVIGSFFSTCGAGLQSLTGAPRLLQAISRDGIVPFLQVFGHGKANGEPTWALLLTACICEIGILIASLDEVAPILSMFFLMCYMFVNLACAVQTLLRTPNWRPRFRYYHWTLSFLGMSLCLALMFICSWYYALVAMLIAGLIYKYIEYRGAEKEWGDGIRGLSLSAARYALLRLEEGPPHTKNWRPQLLVLVRVDQDQNVVHPQLLSLTSQLKAGKGLTIVGSVLEGTFLENHPQAQRAEESIRRLMEAEKVKGFCQVVISSNLRDGVSHLIQSGGLGGLQHNTVLVGWPRNWRQKEDHQTWRNFIELVRETTAGHLALLVTKNVSMFPGNPERFSEGSIDVWWIVHDGGMLMLLPFLLRHHKVWRKCKMRIFTVAQMDDNSIQMKKDLTTFLYHLRITAEVEVVEMHESDISAYTYEKTLVMEQRSQILKQMHLTKNEREREIQSITDESRGSIRRKNPANTRLRLNVPEETAGDSEEKPEEEVQLIHDQSAPSCPSSSPSPGEEPEGEGETDPEKVHLTWTKDKSVAEKNKGPSPVSSEGIKDFFSMKPEWENLNQSNVRRMHTAVRLNEVIVKKSRDAKLVLLNMPGPPRNRNGDENYMEFLEVLTEHLDRVMLVRGGGREVITIYS,1139,NP_001128243.1.csv,S12A5_HUMAN_b07_clinical_seed_0_final,S12A5_HUMAN_b07.a2m,EVE,S12A5_HUMAN_b07_theta_0.2.npy,1,1139,1139
+NP_001128493.1,MNEKSCSFHSKEELRDGQGERLSAGYSPSYDKDKSVLAFRGIPISELKNHGILQALTTEAYEWEPRVVSTEVVRAQEEWEAVDTIQPETGSQASSEQPGQLISFSEALQHFQTVDLSPFKKRIQPTIRRTGLAALRHYLFGPPKLHQRLREERDLVLTIAQCGLDSQDPVHGRVLQTIYKKLTGSKFDCALHGNHWEDLGFQGANPATDLRGAGFLALLHLLYLVMDSKTLPMAQEIFRLSRHHIQQFPFCLMSVNITHIAIQALREECLSRECNRQQKVIPVVNSFYAATFLHLAHVWRTQRKTISDSGFVLKELEVLAKKSPRRLLKTLELYLARVSKGQASLLGAQKCYGPEAPPFKDLTFTGESDLQSHSSEGVWLI,381,NP_001128493.1.csv,refseq-ELMOD3-NM_001135021.1_clinical_seed_0_final,refseq-ELMOD3-NM_001135021.1.a2m,Invitae,refseq-ELMOD3-NM_001135021.1.npy,1,381,381
+NP_001128715.1,MAEARKRRELLPLIYHHLLRAGYVRAAREVKEQSGQKCFLAQPVTLLDIYTHWQQTSELGRKRKAEEDAALQAKKTRVSDPISTSESSEEEEEAEAETAKATPRLASTNSSVLGADLPSSMKEKAKAETEKAGKTGNSMPHPATGKTVANLLSGKSPRKSAEPSANTTLVSETEEEGSVPAFGAAAKPGMVSAGQADSSSEDTSSSSDETDVEGKPSVKPAQVKASSVSTKESPARKAAPAPGKVGDVTPQVKGGALPPAKRAKKPEEESESSEEGSESEEEAPAGTRSQVKASEKILQVRAASAPAKGTPGKGATPAPPGKAGAVASQTKAGKPEEDSESSSEESSDSEEETPAAKALLQAKASGKTSQVGAASAPAKESPRKGAAPAPPGKTGPAVAKAQAGKREEDSQSSSEESDSEEEAPAQAKPSGKAPQVRAASAPAKESPRKGAAPAPPRKTGPAAAQVQVGKQEEDSRSSSEESDSDREALAAMNAAQVKPLGKSPQVKPASTMGMGPLGKGAGPVPPGKVGPATPSAQVGKWEEDSESSSEESSDSSDGEVPTAVAPAQEKSLGNILQAKPTSSPAKGPPQKAGPVAVQVKAEKPMDNSESSEESSDSADSEEAPAAMTAAQAKPALKIPQTKACPKKTNTTASAKVAPVRVGTQAPRKAGTATSPAGSSPAVAGGTQRPAEDSSSSEESDSEEEKTGLAVTVGQAKSVGKGLQVKAASVPVKGSLGQGTAPVLPGKTGPTVTQVKAEKQEDSESSEEESDSEEAAASPAQVKTSVKKTQAKANPAAARAPSAKGTISAPGKVVTAAAQAKQRSPSKVKPPVRNPQNSTVLARGPASVPSVGKAVATAAQAQTGPEEDSGSSEEESDSEEEAETLAQVKPSGKTHQIRAALAPAKESPRKGAAPTPPGKTGPSAAQAGKQDDSGSSSEESDSDGEAPAAVTSAQVIKPPLIFVDPNRSPAGPAATPAQAQAASTPRKARASESTARSSSSESEDEDVIPATQCLTPGIRTNVVTMPTAHPRIAPKASMAGASSSKESSRISDGKKQEGPATQVSKKNPASLPLTQAALKVLAQKASEAQPPVARTQPSSGVDSAVGTLPATSPQSTSVQAKGTNKLRKPKLPEVQQATKAPESSDDSEDSSDSSSGSEEDGEGPQGAKSAHTLGPTPSRTETLVEETAAESSEDDVVAPSQSLLSGYMTPGLTPANSQASKATPKLDSSPSVSSTLAAKDDPDGKQEAKPQQAAGMLSPKTGGKEAASGTTPQKSRKPKKGAGNPQASTLALQSNITQCLLGQPWPLNEAQVQASVVKVLTELLEQERKKVVDTTKESSRKGWESRKRKLSGDQPAARTPRSKKKKKLGAGEGGEASVSPEKTSTTSKGKAKRDKASGDVKEKKGKGSLGSQGAKDEPEEELQKGMGTVEGGDQSNPKSKKEKKKSDKRKKDKEKKEKKKKAKKASTKDSESPSQKKKKKKKKTAEQTV,1488,NP_001128715.1.csv,refseq-TCOF1-NM_001135243.1_clinical_seed_0_final,refseq-TCOF1-NM_001135243.1.a2m,Invitae,refseq-TCOF1-NM_001135243.1.npy,1,1488,1488
+NP_001129123.1,MAGDLSAGFFMEELNTYRQKQGVVLKYQELPNSGPPHDRRFTFQVIIDGREFPEGEGRSKKEAKNAAAKLAVEILNKEKKAVSPLLLTTTNSSEGLSMGNYIGLINRIAQKKRLTVNYEQCASGVHGPEGFHYKCKMGQKEYSIGTGSTKQEAKQLAAKLAYLQILSEETSVKSDYLSSGSFATTCESQSNSLVTSTLASESSSEGDFSADTSEINSNSDSLNSSSLLMNGLRNNQRKAKRSLAPRFDLPDMKETKYTVDKRFGMDFKEIELIGSGGFGQVFKAKHRIDGKTYVIKRVKYNNEKAEREVKALAKLDHVNIVHYNGCWDGFDYDPETSDDSLESSDYDPENSKNSSRSKTKCLFIQMEFCDKGTLEQWIEKRRGEKLDKVLALELFEQITKGVDYIHSKKLIHRDLKPSNIFLVDTKQVKIGDFGLVTSLKNDGKRTRSKGTLRYMSPEQISSQDYGKEVDLYALGLILAELLHVCDTAFETSKFFTDLRDGIISDIFDKKEKTLLQKLLSKKPEDRPNTSEILRTLTVWKKSPEKNERHTC,551,NP_001129123.1.csv,refseq-EIF2AK2-NM_001135651.2_clinical_seed_0_final,refseq-EIF2AK2-NM_001135651.2.a2m,Invitae,refseq-EIF2AK2-NM_001135651.2.npy,1,551,551
+NP_001129277.1,MVSESHHEALAAPPVTTVATVLPSNATEPASPGEGKEDAFSKLKEKFMNELHKIPLPPWALIAIAIVAVLLVLTCCFCICKKCLFKKKNKKKGKEKGGKNAINMKDVKDLGKTMKDQALKDDDAETGLTDGEEKEEPKEEEKLGKLQYSLDYDFQNNQLLVGIIQAAELPALDMGGTSDPYVKVFLLPDKKKKFETKVHRKTLNPVFNEQFTFKVPYSELGGKTLVMAVYDFDRFSKHDIIGEFKVPMNTVDFGHVTEEWRDLQSAEKEEQEKLGDICFSLRYVPTAGKLTVVILEAKNLKKMDVGGLSDPYVKIHLMQNGKRLKKKKTTIKKNTLNPYYNESFSFEVPFEQIQKVQVVVTVLDYDKIGKNDAIGKVFVGYNSTGAELRHWSDMLANPRRPIAQWHTLQVEEEVDAMLAVKK,422,NP_001129277.1.csv,refseq-SYT1-NM_001135805.2_clinical_seed_0_final,refseq-SYT1-NM_001135805.2.a2m,Invitae,refseq-SYT1-NM_001135805.2.npy,1,422,422
+NP_001129392.2,MYFLSGWPKRLLCPLGSPAEAPFHVQSDPQRAFFAVLAAARLSIWYSRPSVLIVTYKEPAKSSTQFGSYKQAEWRPDSTMIAVSTANGYILFFHITSTRGDKYLYEPVYPKGSPQMKGTPHFKEEQCAPALNLEMRKILDLQAPIMSLQSVLEDLLVATSDGLLHLIHWEGMTNGRKAINLCTVPFSVDLQSSRVGSFLGFTDVHIRDMEYCATLDGFAVVFNDGKVGFITPVSSRFTAEQLHGVWPQDVVDGTCVAVNNKYRLMAFGCVSGSVQVYTIDNSTGAMLLSHKLELTAKQYPDIWNKTGAVKLMRWSPDNSVVIVTWEYGGLSLWSVFGAQLICTLGGDFAYRSDGTKKDPLKINSMSWGAEGYHLWVISGFGSQNTEIESDLRSVVKQPSILLFQFIKSVLTVNPCMSNQEQVLLQGEDRLYLNCGEASQTQNPRSSSTHSEHKPSREKSPFADGGLESQGLSTLLGHRHWHVVQISSTYLESNWPIRFSAIDKLGQNIAVVGKFGFAHYSLLTKKWKLFGNITQEQNMIVTGGLAWWNDFMVLACYNINDRQEELRVYLRTSNLDNAFAHVTKAQAETLLLSVFQDMVIVFRADCSICLYSIERKSDGPNTTAGIQVLQEVSMSRYIPHPFLVVSVTLTSVSTENGITLKMPQQARGAESIMLNLAGQLIMMQRDRSGPQIREKDSNPNNQRKLLPFCPPVVLAQSVENVWTTCRANKQKRHLLEALWLSCGGAGMKVWLPLFPRDHRKPHSFLSQRIMLPFHINIYPLAVLFEDALVLGAVNDTLLYDSLYTRNNAREQLEVLFPFCVVERTSQIYLHHILRQLLVRNLGEQALLLAQSCATLPYFPHVLELMLHEVLEEEATSREPIPDPLLPTVAKFITEFPLFLQTVVHCARKTEYALWNYLFAAVGNPKDLFEECLMAQDLDTAASYLIILQNMEVPAVSRQHATLLFNTALEQGKWDLCRHMIRFLKAIGSGESETPPSTPTAQEPSSSGGFEFFRNRSISLSQSAENVPASKFSLQKTLSMPSGPSGKRWSKDSDCAENMYIDMMLWRHARRLLEDVRLKDLGCFAAQLGFELISWLCKERTRAARVDNFVIALKRLHKDFLWPLPIIPASSISSPFKNGKYRTGNVDFMSLVQGELYFTPCIYTFCY,1165,NP_001129392.2.csv,refseq-RIC1-NM_001135920.2_clinical_seed_0_final,refseq-RIC1-NM_001135920.2.a2m,Invitae,refseq-RIC1-NM_001135920.2_theta_0.2.npy,1,1165,1165
+NP_001129507.1,MQGSSLWLSLTFRSARVLSRARFFEWQSPGLPNTAAMENGTGPYGEERPREVQETTVTEGAAKIAFPSANEVFYNPVQEFNRDLTCAVITEFARIQLGAKGIQIKVPGEKDTQKVVVDLSEQEEEKVELKESENLASGDQPRTAAVGEICEEGLHVLEGLAASGLRSIRFALEVPGLRSVVANDASTRAVDLIRRNVQLNDVAHLVQPSQADARMLMYQHQRVSERFDVIDLDPYGSPATFLDAAVQAVSEGGLLCVTCTDMAVLAGNSGETCYSKYGAMALKSRACHEMALRIVLHSLDLRANCYQRFVVPLLSISADFYVRVFVRVFTGQAKVKASASKQALVFQCVGCGAFHLQRLGKASGVPSGRAKFSAACGPPVTPECEHCGQRHQLGGPMWAEPIHDLDFVGRVLEAVSANPGRFHTSERIRGVLSVITEELPDVPLYYTLDQLSSTIHCNTPSLLQLRSALLHADFRVSLSHACKNAVKTDAPASALWDIMRCWEKECPVKRERLSETSPAFRILSVEPRLQANFTIREDANPSSRQRGLKRFQANPEANWGPRPRARPGGKAADEAMEERRRLLQNKRKEPPEDVAQRAARLKTFPCKRFKEGTCQRGDQCCYSHSPPTPRVSADAAPDCPETSNQTPPGPGAAAGPGID,659,NP_001129507.1.csv,refseq-TRMT1-NM_001136035.2_clinical_seed_0_final,refseq-TRMT1-NM_001136035.2.a2m,Invitae,refseq-TRMT1-NM_001136035.2.npy,1,659,659
+NP_001129524.1,MAEAHQAVGFRPSLTSDGAEVELSAPVLQEIYLSGLRSWKRHLSRFWNDFLTGVFPASPLSWLFLFSAIQLAWFLQLDPSLGLMEKIKELLPDWGGQHHGLRGVLAAALFASCLWGALIFTLHVALRLLLSYHGWLLEPHGAMSSPTKTWLALVRIFSGRHPMLFSYQRSLPRQPVPSVQDTVRKYLESVRPILSDEDFDWTAVLAQEFLRLQASLLQWYLRLKSWWASNYVSDWWEEFVYLRSRNPLMVNSNYYMMAARAGNAVHALLLYRHRLNRQEIPPTLLMGMRPLCSAQYEKIFNTTRIPGVQKDYIRHLHDSQHVAVFHRGRFFRMGTHSRNSLLSPRALEQQFQRILDDPSPACPHEEHLAALTAAPRGTWAQVRTSLKTQAAEALEAVEGAAFFVSLDAEPAGLTREDPAASLDAYAHALLAGRGHDRWFDKSFTLIVFSNGKLGLSVEHSWADCPISGHMWEFTLATECFQLGYSTDGHCKGHPDPTLPQPQRLQWDLPDQIHSSISLALRGAKILSENVDCHVVPFSLFGKSFIRRCHLSSDSFIQIALQLAHFRDRGQFCLTYESAMTRLFLEGRTETVRSCTREACNFVRAMEDKEKTDPQCLALFRVAVDKHQALLKAAMSGQGVDRHLFALYIVSRFLHLQSPFLTQVHSEQWQLSTSQIPVQQMHLFDVHNYPDYVSSGGGFGPADDHGYGVSYIFMGDGMITFHISSKKSSTKTDSHRLGQHIEDALLDVASLFQAGQHFKRRFRGSGKENSRHRCGFLSRQTGASKASMTSTDF,792,NP_001129524.1.csv,refseq-CPT1C-NM_001136052.2_clinical_seed_0_final,refseq-CPT1C-NM_001136052.2.a2m,Invitae,refseq-CPT1C-NM_001136052.2.npy,1,792,792
+NP_001129649.1,MMTAKAVDKIPVTLSGFVHQLSDNIYPVEDLAATSVTIFPNAELGGPFDQMNGVAGDGMINIDMTGEKRSLDLPYPSSFAPVSAPRNQTFTYMGKFSIDPQYPGASCYPEGIINIVSAGILQGVTSPASTTASSSVTSASPNPLATGPLGVCTMSQTQPDLDHLYSPPPPPPPYSGCAGDLYQDPSAFLSAATTSTSSSLAYPPPPSYPSPKPATDPGLFPMIPDYPGFFPSQCQRDLHGTAGPDRKPFPCPLDTLRVPPPLTPLSTIRNFTLGGPSAGVTGPGASGGSEGPRLPGSSSAAAAAAAAAAYNPHHLPLRPILRPRKYPNRPSKTPVHERPYPCPAEGCDRRFSRSDELTRHIRIHTGHKPFQCRICMRNFSRSDHLTTHIRTHTGEKPFACDYCGRKFARSDERKRHTKIHLRQKERKSSAPSASVPAPSTASCSGGVQPGGTLCSSNSSSLGGGPLAPCSSRTRTP,476,NP_001129649.1.csv,EGR2_HUMAN_b03_clinical_seed_0_final,EGR2_HUMAN_b03.a2m,EVE,EGR2_HUMAN_b03_theta_0.2.npy,1,476,476
+NP_001129668.1,MDWQPDEQGLQQVLQLLKDSQSPNTATQRIVQDKLKQLNQFPDFNNYLIFVLTRLKSEDEPTRSLSGLILKNNVKAHYQSFPPPVADFIKQECLNNIGDASSLIRATIGILITTIASKGELQMWPELLPQLCNLLNSEDYNTCEGAFGALQKICEDSSELLDSDALNRPLNIMIPKFLQFFKHCSPKIRSHAIACVNQFIMDRAQALMDNIDTFIEHLFALAVDDDPEVRKNVCRALVMLLEVRIDRLIPHMHSIIQYMLQRTQDHDENVALEACEFWLTLAEQPICKEVLASHLVQLIPILVNGMKYSEIDIILLKGDVEEDEAVPDSEQDIKPRFHKSRTVTLPHEAERPDGSEDAEDDDDDDALSDWNLRKCSAAALDVLANVFREELLPHLLPLLKGLLFHPEWVVKESGILVLGAIAEGCMQGMVPYLPELIPHLIQCLSDKKALVRSIACWTLSRYAHWVVSQPPDMHLKPLMTELLKRILDGNKRVQEAACSAFATLEEEACTELVPYLSYILDTLVFAFGKYQHKNLLILYDAIGTLADSVGHHLNQPEYIQKLMPPLIQKWNELKDEDKDLFPLLECLSSVATALQSGFLPYCEPVYQRCVTLVQKTLAQAMMYTQHPEQYEAPDKDFMIVALDLLSGLAEGLGGHVEQLVARSNIMTLLFQCMQDSMPEVRQSSFALLGDLTKACFIHVKPCIAEFMPILGTNLNPEFISVCNNATWAIGEICMQMGAEMQPYVQMVLNNLVEIINRPNTPKTLLENTGRLTSPSAIPAITIGRLGYVCPQEVAPMLQQFIRPWCTSLRNIRDNEEKDSAFRGICMMIGVNPGGVVQDFIFFCDAVASWVSPKDDLRDMFYKILHGFKDQVGEDNWQQFSEQFPPLLKERLAAFYGV,897,NP_001129668.1.csv,refseq-TNPO2-NM_001136196.1_clinical_seed_0_final,refseq-TNPO2-NM_001136196.1.a2m,Invitae,refseq-TNPO2-NM_001136196.1.npy,1,897,897
+NP_001130.1,MATYKVRVATGTDLLSGTRDSISLTIVGTQGESHKQLLNHFGRDFATGAVGQYTVQCPQDLGELIIIRLHKERYAFFPKDPWYCNYVQICAPNGRIYHFPAYQWMDGYETLALREATGKTTADDSLPVLLEHRKEEIRAKQDFYHWRVFLPGLPSYVHIPSYRPPVRRHRNPNRPEWNGYIPGFPILINFKATKFLNLNLRYSFLKTASFFVRLGPMALAFKVRGLLDCKHSWKRLKDIRKIFPGKKSVVSEYVAEHWAEDTFFGYQYLNGVNPGLIRRCTRIPDKFPVTDDMVAPFLGEGTCLQAELEKGNIYLADYRIMEGIPTVELSGRKQHHCAPLCLLHFGPEGKMMPIAIQLSQTPGPDCPIFLPSDSEWDWLLAKTWVRYAEFYSHEAIAHLLETHLIAEAFCLALLRNLPMCHPLYKLLIPHTRYTVQINSIGRAVLLNEGGLSAKGMSLGVEGFAGVMVRALSELTYDSLYLPNDFVERGVQDLPGYYYRDDSLAVWNALEKYVTEIITYYYPSDAAVEGDPELQSWVQEIFKECLLGRESSGFPRCLRTVPELIRYVTIVIYTCSAKHAAVNTGQMEFTAWMPNFPASMRNPPIQTKGLTTLETFMDTLPDVKTTCITLLVLWTLSREPDDRRPLGHFPDIHFVEEAPRRSIEAFRQRLNQISHDIRQRNKCLPIPYYYLDPVLIENSISI,701,NP_001130.1.csv,refseq-ALOX12B-NM_001139.2_clinical_seed_0_final,refseq-ALOX12B-NM_001139.2.a2m,Invitae,refseq-ALOX12B-NM_001139.2.npy,1,701,701
+NP_001136.1,MVMGLGVLLLVFVLGLGLTPPTLAQDNSRYTHFLTQHYDAKPQGRDDRYCESIMRRRGLTSPCKDINTFIHGNKRSIKAICENKNGNPHRENLRISKSSFQVTTCKLHGGSPWPPCQYRATAGFRNVVVACENGLPVHLDQSIFRRP,147,NP_001136.1.csv,refseq-ANG-NM_001145.4_clinical_seed_0_final,refseq-ANG-NM_001145.4.a2m,Invitae,refseq-ANG-NM_001145.4.npy,1,147,147
+NP_001136029.1,MTLLIRVLLQERGAQDRRIQDLETELEKMEARLNAALREKTSLSANNATLEKQLIELTRTNELLKSKFSENGNQKNLRILSLELMKLRNKRETKMRGMMAKQEGMEMKLQVTQRSLEESQGKIAQLEGKLVSIEKEKIDEKSETEKLLEYIEEISCASDQVEKYKLDIAQLEENLKEKNDEILSLKQSLEENIVILSKQVEDLNVKCQLLEKEKEDHVNRNREHNENLNAEMQNLKQKFILEQQEREKLQQKELQIDSLLQQEKELSSSLHQKLCSFQEEMVKEKNLFEEELKQTLDELDKLQQKEEQAERLVKQLEEEAKSRAEELKLLEEKLKGKEAELEKSSAAHTQATLLLQEKYDSMVQSLEDVTAQFESYKALTASEIEDLKLENSSLQEKAAKAGKNAEDVQHQILATESSNQEYVRMLLDLQTKSALKETEIKEITVSFLQKITDLQNQLKQQEEDFRKQLEDEEGRKAEKENTTAELTEEINKWRLLYEELYNKTKPFQLQLDAFEVEKQALLNEHGAAQEQLNKIRDSYAKLLGHQNLKQKIKHVVKLKDENSQLKSEVSKLRCQLAKKKQSETKLQEELNKVLGIKHFDPSKAFHHESKENFALKTPLKEGNTNCYRAPMECQESWK,638,NP_001136029.1.csv,refseq-HMMR-NM_001142557.2_clinical_seed_0_final,refseq-HMMR-NM_001142557.2.a2m,Invitae,refseq-HMMR-NM_001142557.2.npy,1,638,638
+NP_001136082.1,MEVVGDFEYSKRDLVGHGAFAVVFRGRHRQKTDWEVAIKSINKKNLSKSQILLGKEIKILKELQHENIVALYDVQELPNSVFLVMEYCNGGDLADYLQAKGTLSEDTIRVFLHQIAAAMRILHSKGIIHRDLKPQNILLSYANRRKSSVSGIRIKIADFGFARYLHSNMMAATLCGSPMYMAPEVIMSQHYDAKADLWSIGTVIYQCLVGKPPFQANSPQDLRMFYEKNRSLMPSIPRETSPYLANLLLGLLQRNQKDRMDFEAFFSHPFLEQGPVKKSCPVPVPMYSGSVSGSSCGSSPSCRFASPPSLPDMQHIQEENLSSPPLGPPNYLQVSKDSASTSSKNSSCDTDDFVLVPHNISSDHSCDMPVGTAGRRASNEFLVCGGQCQPTVSPHSETAPIPVPTQIRNYQRIEQNLTSTASSGTNVHGSPRSAVVRRSNTSPMGFLRPGSCSPVPADTAQTVGRRLSTGSSRPYSPSPLVGTIPEQFSQCCCGHPQGHDSRSRNSSGSPVPQAQSPQSLLSGARLQSAPTLTDIYQNKQKLRKQHSDPVCPSHTGAGYSYSPQPSRPGSLGTSPTKHLGSSPRSSDWFFKTPLPTIIGSPTKTTAPFKIPKTQASSNLLALVTRHGPAEEQSKDGNEPRECAHCLLVQGSERQRAEQQSKAVFGRSVSTGKLSDQQGKTPICRHQGSTDSLNTERPMDIAPAGACGGVLAPPAGTAASSKAVLFTVGSPPHSAAAPTCTHMFLRTRTTSVGPSNSGGSLCAMSGRVCVGSPPGPGFGSSPPGAEAAPSLRYVPYGASPPSLEGLITFEAPELPEETLMEREHTDTLRHLNVMLMFTECVLDLTAMRGGNPELCTSAVSLYQIQESVVVDQISQLSKDWGRVEQLVLYMKAAQLLAASLHLAKAQIKSGKLSPSTAVKQVVKNLNERYKFCITMCKKLTEKLNRFFSDKQRFIDEINSVTAEKLIYNCAVEMVQSAALDEMFQQTEDIVYRYHKAALLLEGLSRILQDPADIENVHKYKCSIERRLSALCHSTATV,1036,NP_001136082.1.csv,refseq-ULK2-NM_001142610.1_clinical_seed_0_final,refseq-ULK2-NM_001142610.1.a2m,Invitae,refseq-ULK2-NM_001142610.1.npy,1,1036,1036
+NP_001136256.1,MSSSCSGLSRVLVAVATALVSASSPCPQAWGPPGVQYGQPGRSVKLCCPGVTAGDPVSWFRDGEPKLLQGPDSGLGHELVLAQADSTDEGTYICQTLDGALGGTVTLQLGYPPARPVVSCQAADYENFSCTWSPSQISGLPTRYLTSYRKKTVLGADSQRRSPSTGPWPCPQDPLGAARCVVHGAEFWSQYRINVTEVNPLGASTRLLDVSLQSILRPDPPQGLRVESVPGYPRRLRASWTYPASWPCQPHFLLKFRLQYRPAQHPAWSTVEPAGLEEVITDAVAGLPHAVRVSARDFLDAGTWSTWSPEAWGTPSTGTIPKEIPAWGQLHTQPEVEPQVDSPAPPRPSLQPHPRLLDHRDSVEQVAVLASLGILSFLGLVAGALALGLWLRLRRGGKDGSPKPGFLASVIPVDRRPGAPNL,422,NP_001136256.1.csv,refseq-IL11RA-NM_001142784.2_clinical_seed_0_final,refseq-IL11RA-NM_001142784.2.a2m,Invitae,refseq-IL11RA-NM_001142784.2.npy,1,422,422
+NP_001136438.1,MGNSYAGQLKSARFEEALHNSIEASLRCSSVVPRPIFSQLYLDPDQHPFSSADVKPKVEDLDKDLVNRYTQNGSLDFSNNLTVNEMEDDEDDEEMSDSNSPPIPYSQKPAPEGSCTTDGFCQAGKDLRLVSLCMEQIDIPAGFLLVGAKSPNLPEHILVCAVDKRFLPDDHGKNALLGFSGNCIGCGERGFRYFTEFSNHINLKLTTQPKKQKHLKYYLVRSSQGVLSKGPLICWKECRSRQSSASCHSIKPSSSVSSTVTPENGTTNGYKSGFTQTDAANGNSSHGGKGSASSSTPAHTGNYSLSPRPSYASGDQATMFISGPPKKRHRGWYPGSPLPQPGLVVPVPTVRPLSRTEPLLSAPVPQTPLTGILQPRPIPAGETVIVPENLLSNSGVRPVILIGYGTLPYFYGNVGDIVVSPLLVNCYKIPQLENKDLEKLGLTGSQFLSVENMILLTIQYLVRLGPDQVPLREEFEQIMLKAMQEFTLRERALQIGAQCVPVSPGQLPWLARLIASVSQDLVHVVVTQNSLAEGISETLRTLSEMRHYQRLPDYVVVICASKIRGNEFCVVVLGQHQSRALAESMLTTSEFLKEISYELITGKVSFLASHFKTTSLGDDLDKLLEKMQQRRGDSVVTPFDGDLNECVSPQEAAAMIPTQNLDLDNETFHIYQPQLTVARKLLSQVCAIADSGSQSLDLGHFSKVDFIIIVPRSEVLVQQTLQRIRQSGVLVDLGLEENGTAHQRAEKYVVRLDNEIQTKFEVFMRRVKQNPYTLFVLVHDNSHVELTSVISGSLSHSEPSHGLADRVINCREVLEAFNLLVLQVSSFPYTLQTQQSRISSSNEVHWIQLDTGEDVGCEEKLYFGLSEYSKSLQWGITSPLLRCDETFEKMVNTLLERYPRLHSMVVRCYLLIQQYSEALMALTTMASLRDHSTPETLSIMDDLISSPGKNKSGRGHMLIIRVPSVQLAMLAKERLQEVRDKLGLQYRFEIILGNPATELSVATHFVARLKSWRGNEPEEWIPRTYQDLDGLPCIVILTGKDPLGETFPRSLKYCDLRLIDSSYLTRTALEQEVGLACCYVSKEVIRGPTVALDLSGKEQERAAVSENDSDELLIDLERPQSNSSAVTGTSGSIMENGVSSSSTADKSQKQSLTPSFQSPATSLGLDEGVSASSAGAGAGETLKQECDSLGPQMASSTTSKPSSSSSGPRTLPWPGQPIRGCRGPQAALPPVVILSKAAYSLLGSQKSGKLPSSSSLLPHADVAWVSSLRPLLNKDMSSEEQSLYYRQWTLARQHHADYSNQLDPASGTRNFHPRRLLLTGPPQVGKTGSYLQFLRILFRMLIRLLEVDVYDEEEINTDHNESSEVSQSEGEPWPDIESFSKMPFDVSVHDPKYSLMSLVYTEKLAGVKQEVIKESKVEEPRKRETVSIMLTKYAAYNTFHHCEQCRQYMDFTSASQMSDSTLHAFTFSSSMLGEEVQLYFIIPKSKESHFVFSKQGKHLESMRLPLVSDKNLNAVKSPIFTPSSGRHEHGLLNLFHAMEGISHLHLLVVKEYEMPLYRKYWPNHIMLVLPGMFNNAGVGAARFLIKELSYHNLELERNRLEELGIKRQCVWPFIVMMDDSCVLWNIHSVQEPSSQPMEVGVSSKNVSLKTVLQHIEATPKIVHYAILGIQKWSSKLTSQSLKAPFSRCHVHDFILLNTDLTQNVQYDFNRYFCEDADFNLRTNSSGLLICRFNNFSLMKKHVQVGGQRDFIIKPKIMVSESLAPILPLQYICAPDSEHTLLAAPAQFLLEKFLQHASYKLFPKAIHNFRSPVLAIDCYLNIGPEVAICYISSRPHSSNVNCEGVFFSGLLLYLCDSFVGADLKKFKFLKGATLCVICQDRSSLRQTIVRLELEDEWQFRLRDEFQTANSSDDKPLYFLTGRHV,1923,NP_001136438.1.csv,refseq-GREB1L-NM_001142966.1_clinical_seed_0_final,refseq-GREB1L-NM_001142966.1.a2m,Invitae,refseq-GREB1L-NM_001142966.1.npy,1,1923,1923
+NP_001137282.1,MCGATSFLHECTRLILVTTQNAEFLQKGLQVHTCFGVYPHASVWHDCASQKKGCAVYLHVSVEFNKLIPENGFIKFHQVRRVMTILFLTMVISYFGCMKAAPMKEANIRGQGGLAYPGVRTHGTLESVNGPKAGSRGLTSLADTFEHVIEELLDEDQKVRPNEENNKDADLYTSRVMLSSQVPLEPPLLFLLEEYKNYLDAANMSMRVRRHSDPARRGELSVCDSISEWVTAADKKTAVDMSGGTVTVLEKVPVSKGQLKQYFYETKCNPMGYTKEGCRGIDKRHWNSQCRTTQSYVRALTMDSKKRIGWRFIRIDTSCVCTLTIKRGR,329,NP_001137282.1.csv,NP_001137282.1_clinical_seed_0_final,NP_001137282.1.a2m,popEVE,NP_001137282.1_theta_0.2.npy,1,329,329
+NP_001137326.1,MTDTVFSNSSNRWMYPSDRPLQSNDKEQLQAGWSVHPGGQPDRQRKQEELTDEEKEIINRVIARAEKMEEMEQERIGRLVDRLENMRKNVAGDGVNRCILCGEQLGMLGSACVVCEDCKKNVCTKCGVETNNRLHSVWLCKICIEQREVWKRSGAWFFKGFPKQVLPQPMPIKKTKPQQPVSEPAAPEQPAPEPKHPARAPARGDSEDRRGPGQKTGPDPASAPGRGNYGPPVRRASEARMSSSSRDSESWDHSGGAGDSSRSPAGLRRANSVQASRPAPGSVQSPAPPQPGQPGTPGGSRPGPGPAGRFPDQKPEVAPSDPGTTAPPREERTGGVGGYPAVGAREDRMSHPSGPYSQASAAAPQPAAARQPPPPEEEEEEANSYDSDEATTLGALEFSLLYDQDNSSLQCTIIKAKGLKPMDSNGLADPYVKLHLLPGASKSNKLRTKTLRNTRNPIWNETLVYHGITDEDMQRKTLRISVCDEDKFGHNEFIGETRFSLKKLKPNQRKNFNICLERVIPMKRAGTTGSARGMALYEEEQVERVGDIEERGKILVSLMYSTQQGGLIVGIIRCVHLAAMDANGYSDPFVKLWLKPDMGKKAKHKTQIKKKTLNPEFNEEFFYDIKHSDLAKKSLDISVWDYDIGKSNDYIGGCQLGISAKGERLKHWYECLKNKDKKIERWHQLQNENHVSSD,694,NP_001137326.1.csv,refseq-RPH3A-NM_001143854.1_clinical_seed_0_final,refseq-RPH3A-NM_001143854.1.a2m,Invitae,refseq-RPH3A-NM_001143854.1.npy,1,694,694
+NP_001137451.1,MEDSGKTFSSEEEEANYWKDLAMTYKQRAENTQEELREFQEGSREYEAELETQLQQIETRNRDLLSENNRLRMELETIKEKFEVQHSEGYRQISALEDDLAQTKAIKDQLQKYIRELEQANDDLERAKRATIMSLEDFEQRLNQAIERNAFLESELDEKENLLESVQRLKDEARDLRQELAVQQKQEKPRTPMPSSVEAERTDTAVQATGSVPSTPIAHRGPSSSLNTPGSFRRGLDDSTGGTPLTPAARISALNIVGDLLRKVGALESKLASCRNLVYDQSPNRTGGPASGRSSKNRDGGERRPSSTSVPLGDKGLDTSCRWLSKSTTRSSSSC,335,NP_001137451.1.csv,refseq-NDE1-NM_001143979.1_clinical_seed_0_final,refseq-NDE1-NM_001143979.1.a2m,Invitae,refseq-NDE1-NM_001143979.1.npy,1,335,335
+NP_001137453.1,MRKKWKMGGMKYIFSLLFFLLLEGGKTEQVKHSETYCMFQDKKYRVGERWHPYLEPYGLVYCVNCICSENGNVLCSRVRCPNVHCLSPVHIPHLCCPRCPEDSLPPVNNKVTSKSCEYNGTTYQHGELFVAEGLFQNRQPNQCTQCSCSEGNVYCGLKTCPKLTCAFPVSVPDSCCRVCRGDGELSWEHSDGDIFRQPANREARHSYHRSHYDPPPSRQAGGLSRFPGARSHRGALMDSQQASGTIVQIVINNKHKHGQVCVSNGKTYSHGESWHPNLRAFGIVECVLCTCNVTKQECKKIHCPNRYPCKYPQKIDGKCCKVCPGKKAKEELPGQSFDNKGYFCGEETMPVYESVFMEDGETTRKIALETERPPQVEVHVWTIRKGILQHFHIEKISKRMFEELPHFKLVTRTTLSQWKIFTEGEAQISQMCSSRVCRTELEDLVKVLYLERSEKGHC,458,NP_001137453.1.csv,refseq-CHRDL1-NM_001143981.1_clinical_seed_0_final,refseq-CHRDL1-NM_001143981.1.a2m,Invitae,refseq-CHRDL1-NM_001143981.1.npy,1,458,458
+NP_001137458.1,MTSRDQPRPKGPPKSTSPCPGISNSESSPTLNYQGILNRLKQFPRFSPHFAAELESIYYSLHKIQQDVAEHHKQIGNVLQIVESCSQLQGFQSEEVSPAEPASPGTPQQVKDKTLQESSFEDIMATRSSDWLRRPLGEDNQPETQLFWDKEPWFWHDTLTEQLWRIFAGVHDEKAKPRDRQQAPGLGQESKAPGSCDPGTDPCPEDASTPRPPEASSSPPEGSQDRNTSWGVVQEPPGRASRFLQSISWDPEDFEDAWKRPDALPGQSKRLAVPCKLEKMRILAHGELVLATAISSFTRHVFTCGRRGIKVWSLTGQVAEDRFPESHLPIQTPGAFLRTCLLSSNSRSLLTGGYNLASVSVWDLAAPSLHVKEQLPCAGLNCQALDANLDANLAFASFTSGVVRIWDLRDQSVVRDLKGYPDGVKSIVVKGYNIWTGGPDACLRCWDQRTIMKPLEYQFKSQIMSLSHSPQEDWVLLGMANGQQWLQSTSGSQRHMVGQKDSVILSVKFSPFGQWWASVGMDDFLGVYSMPAGTKVFEVPEMSPVTCCDVSSNNRLVVTGSGEHASVYQITY,572,NP_001137458.1.csv,refseq-TLE6-NM_001143986.1_clinical_seed_0_final,refseq-TLE6-NM_001143986.1.a2m,Invitae,refseq-TLE6-NM_001143986.1.npy,1,572,572
+NP_001138341.1,MRQPAVRLVRLGRVPYAELLGLQDRWLRRLQAEPGIEAPSGTEAGALLLCEPAGPVYTAGLRGGLTPEETARLRALGAEVRVTGRGGLATFHGPGQLLCHPVLDLRRLGLRLRMHVASLEACAVRLCELQGLQDARARPPPYTGVWLDDRKICAIGVRCGRHITSHGLALNCSTDLTWFEHIVPCGLVGTGVTSLSKELQRHVTVEEVMPPFLVAFKEIYKCTLISEDSPN,231,NP_001138341.1.csv,refseq-LIPT2-NM_001144869.2_clinical_seed_0_final,refseq-LIPT2-NM_001144869.2.a2m,Invitae,refseq-LIPT2-NM_001144869.2.npy,1,231,231
+NP_001138545.1,MCKDYVYDKDIEQIAKEEQGEALKLQASTSTEVSHQQCSVPGLGEKFPTWETTKPELELLGHNPRRRRITSSFTIGLRGLINLGNTCFMNCIVQALTHTPILRDFFLSDRHRCEMPSPELCLVCEMSSLFRELYSGNPSPHVPYKLLHLVWIHARHLAGYRQQDAHEFLIAALDVLHRHCKGDDVGKAANNPNHCNCIIDQIFTGGLQSDVTCQACHGVSTTIDPCWDISLDLPGSCTSFWPMSPGRESSVNGESHIPGITTLTDCLRRFTRPEHLGSSAKIKCGSCQSYQESTKQLTMNKLPVVACFHFKRFEHSAKQRRKITTYISFPLELDMTPFMASSKESRMNGQLQLPTNSGNNENKYSLFAVVNHQGTLESGHYTSFIRHHKDQWFKCDDAVITKASIKDVLDSEGYLLFYHKQVLEHESEKVKEMNTQAY,438,NP_001138545.1.csv,refseq-USP27X-NM_001145073.1_clinical_seed_0_final,refseq-USP27X-NM_001145073.1.a2m,Invitae,refseq-USP27X-NM_001145073.1.npy,1,438,438
+NP_001138584.1,MNCLEGPGKTCGPLASEEELVSACQLEKEEENEGEEEEEEEDEEDLDPDLDPDLEEEENDLGDPAVLGAVHNTQRALLSSPGVKAPGMLGMSLASLHFLWQTLDYLSPIPFWPTFPSTSSPAQHFGPRLPSPDPTLFCSLLTSWPPRFSHLTQLHPRHQRILQQQQHSQTPSPPAKKPWSQQPDPYANLMTRKEKDWVIKVQMVQLQSAKPRLDDYYYQEYYQKLEKKQADEELLGRRNRVESLKLVTPYIPKAEAYESVVRIEGSLGQVAVSTCFSPRRAIDAVPHGTQEQDIEAASSQRLRVLYRIEKMFLQLLEIEEGWKYRPPPPCFSEQQSNQVEKLFQTLKTQEQNNLEEAADGFLQVLSVRKGKALVARLLPFLPQDQAVTILLAITHHLPLLVRRDVADQALQMLFKPLGKCISHLTLHELLQGLQGLTLLPPGSSERPVTVVLQNQFGISLLYALLSHGEQLVSLHSSLEEPNSDHTAWTDMVVLIAWEIAQMPTASLAEPLAFPSNLLPLFCHHVDKQLVQQLEARMEFAWIY,543,NP_001138584.1.csv,refseq-PATL2-NM_001145112.1_clinical_seed_0_final,refseq-PATL2-NM_001145112.1.a2m,Invitae,refseq-PATL2-NM_001145112.1.npy,1,543,543
+NP_001138780.1,MGTPWRKRKGIAGPGLPDLSCALVLQPRAQVGTMSPAIALAFLPLVVTLLVRYRHYFRLLVRTVLLRSLRDCLSGLRIEERAFSYVLTHALPGDPGHILTTLDHWSSRCEYLSHMGPVKGQILMRLVEEKAPACVLELGTYCGYSTLLIARALPPGGRLLTVERDPRTAAVAEKLIRLAGFDEHMVELIVGSSEDVIPCLRTQYQLSRADLVLLAHRPRCYLRDLQLLEAHALLPAGATVLADHVLFPGAPRFLQYAKSCGRYRCRLHHTGLPDFPAIKDGIAQLTYAGPG,291,NP_001138780.1.csv,refseq-LRTOMT-NM_001145308.4_clinical_seed_0_final,refseq-LRTOMT-NM_001145308.4.a2m,Invitae,refseq-LRTOMT-NM_001145308.4.npy,1,291,291
+NP_001138830.1,MKRRLDDQESPVYAAQQRRIPGSTEAFPHQHRVLAPAPPVYEAVSETMQSATGIQYSVTPSYQVSAMPQSSGSHGPAIAAVHSSHHHPTAVQPHGGQVVQSHAHPAPPVAPVQGQQQFQRLKVEDALSYLDQVKLQFGSQPQVYNDFLDIMKEFKSQSIDTPGVISRVSQLFKGHPDLIMGFNTFLPPGYKIEVQTNDMVNVTTPGQVHQIPTHGIQPQPQPPPQHPSQPSAQSAPAPAQPAPQPPPAKVSKPSQLQAHTPASQQTPPLPPYASPRSPPVQPHTPVTISLGTAPSLQNNQPVEFNHAINYVNKIKNRFQGQPDIYKAFLEILHTYQKEQRNAKEAGGNYTPALTEQEVYAQVARLFKNQEDLLSEFGQFLPDANSSVLLSKTTAEKVDSVRNDHGGTVKKPQLNNKPQRPSQNGCQIRRHPTGTTPPVKKKPKLLNLKDSSMADASKHGGGTESLFFDKVRKALRSAEAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFLGYKESVHLETYPKERATEGIAMEIDYASCKRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSWSEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLATIRVLEAIQKKLSRLSAEEQAKFRLDNTLGGTSEVIHRKALQRIYADKAADIIDGLRKNPSIAVPIVLKRLKMKEEEWREAQRGFNKVWREQNEKYYLKSLDHQGINFKQNDTKVLRSKSLLNEIESIYDERQEQATEENAGVPVGPHLSLAYEDKQILEDAAALIIHHVKRQTGIQKEDKYKIKQIMHHFIPDLLFAQRGDLSDVEEEEEEEMDVDEATGAVKKHNGVGGSPPKSKLLFSNTAAQKLRGMDEVYNLFYVNNNWYIFMRLHQILCLRLLRICSQAERQIEEENREREWEREVLGIKRDKSDSPAIQLRLKEPMDVDVEDYYPAFLDMVRSLLDGNIDSSQYEDSLREMFTIHAYIAFTMDKLIQSIVRQLQHIVSDEICVQVTDLYLAENNNGATGGQLNTQNSRSLLESTYQRKAEQLMSDENCFKLMFIQSQGQVQLTIELLDTEEENSDDPVEAERWSDYVERYMNSDTTSPELREHLAQKPVFLPRNLRRIRKCQRGREQQEKEGKEGNSKKTMENVDSLDKLECRFKLNSYKMVYVIKSEDYMYRRTALLRAHQSHERVSKRLHQRFQAWVDKWTKEHVPREMAAETSKWLMGEGLEGLVPCTTTCDTETLHFVSINKYRVKYGTVFKAP,1273,NP_001138830.1.csv,refseq-SIN3A-NM_001145358.1_clinical_seed_0_final,refseq-SIN3A-NM_001145358.1.a2m,Invitae,refseq-SIN3A-NM_001145358.1.npy,1,1273,1273
+NP_001138940.4,MKKASRSVGSVPKVSAISKTQTAEKIKPENSSSASTGGKLVKPGTAASLSKTKSSDDLLAGMAGGVTVTNGVKGKKSTCPSAAPSASAPAMTTVENKSKISTGTASSTKRSTSTGNKESSSTRERLRERTRLNQSKKLPSAGQGANDMALAKRSRSRTATECDVRMSKSKSDNQISDRAALEAKVKDLLTLAKTKDVEILHLRNELRDMRAQLGINEDHSEGDEKSEKETIMAHQPTDVESTLLQLQEQNTAIREELNQLKNENRMLKDRLNALGFSLEQRLDNSEKLFGYQSLSPEITPGNQSDGGGTLTSSVEGSAPGSVEDLLSQDENTLMDHQHSNSMDNLDSECSEVYQPLTSSDDALDAPSSSESEGIPSIERSRKGSSGNASEVSVACLTERIHQMEENQHSTSEELQATLQELADLQQITQELNSENERLGEEKVILMESLCQQSDKLEHFSRQIEYFRSLLDEHHISYVIDEDVKSGRYMELEQRYMDLAENARFEREQLLGVQQHLSNTLKMAEQDNKEAQEMIGALKERSHHMERIIESEQKGKAALAATLEEYKATVASDQIEMNRLKAQLENEKQKVAELYSIHNSGDKSDIQDLLESVRLDKEKAETLASSLQEDLAHTRNDANRLQDAIAKVEDEYRAFQEEAKKQIEDLNMTLEKLRSDLDEKETERSDMKETIFELEDEVEQHRAVKLHDNLIISDLENTVKKLQDQKHDMEREIKTLHRRLREESAEWRQFQADLQTAVVIANDIKSEAQEEIGDLKRRLHEAQEKNEKLTKELEEIKSRKQEEERGRVYNYMNAVERDLAALRQGMGLSRRSSTSSEPTPTVKTLIKSFDSASQVPNPAAAAIPRTPLSPSPMKTPPAAAVSPMQRHSISGPISTSKPLTALSDKRPNYGEIPVQEHLLRTSSASRPASLPRVPAMESAKTLSVSRRSSEEVKRDISAQEGASPASLMAMGTTSPQLSLSSSPTASVTPTTRSRIREERKDPLSALAREYGGSKRNALLKWCQKKTEGYQNIDITNFSSSWNDGLAFCALLHTYLPAHIPYQELNSQDKRRNFMLAFQAAESVGIKSTLDINEMVRTERPDWQNVMLYVTAIYKYFET,1117,NP_001138940.4.csv,NP_001138940.4_colabfold_clinical_seed_0_final,NP_001138940.4_colabfold.a2m,colabfold,NP_001138940.4_colabfold_theta_0.2.npy,1,1117,1117
+NP_001139189.2,MEEQVFKGDPDTPHSISFSGSGFLSFYQAGAVDALRDLAPRMLETAHRFAGTSAGAVIAALAICGIEMDEYLRVLNVGVAEVKKSFLGPLSPSCKMVQMMRQFLYRVLPEDSYKVTTGKLHVSLTRLTDGENVVVSEFTSKEELIEALYCSCFVPVYCGLIPPTYRGVRYIDGGFTGMQPCAFWTDAITISTFSGQQDICPRDCPAIFHDFRMFNCSFQFSLENIARMTHALFPPDLVILHDYYYRGYEDAVLYLRRLNAVYLNSSSKRVIFPRVEVYCQIELALGNECPERSQPSLRARQASLEGATQPHKEWVPKGDGRGSHGPPVSQPVQTLEFTCESPVSAPVSPLEQPPAQPLASSTPLSLSGMPPVSFPAVHKPPSSTPGSSLPTPPPGLSPLSPQQQVQPSGSPARSLHSQAPTSPRPSLGPSTVGAPQTLPRSSLSAFPAQPPVEELGQEQPQAVALLVSSKPKSAVPLVHVKETVSKPYVTESPAEDSNWVNKVFKKNKQKTSGTRKGFPRHSGSKKPSSKVQ,532,NP_001139189.2.csv,refseq-PNPLA1-NM_001145717.1_clinical_seed_0_final,refseq-PNPLA1-NM_001145717.1.a2m,Invitae,refseq-PNPLA1-NM_001145717.1.npy,1,532,532
+NP_001139267.1,MNGAPSPEDGASPSSPPLPPPPPPSWREFCESHARAAALDFARRFRLYLASHPQYAGPGAEAAFSRRFAELFLQHFEAEVARASGSLSPPILAPLSPGAEISPHDLSLESCRVGGPLAVLGPSRSSEDLAGPLPSSVSSSSTTSSKPKLKKRFSLRSVGRSVRGSVRGILQWRGTVDPPSSAGPLETSSGPPVLGGNSNSNSSGGAGTVGRGLVSDGTSPGERWTHRFERLRLSRGGGALKDGAGMVQREELLSFMGAEEAAPDPAGVGRGGGVAGPPSGGGGQPQWQKCRLLLRSEGEGGGGSRLEFFVPPKASRPRLSIPCSSITDVRTTTALEMPDRENTFVVKVEGPSEYIMETVDAQHVKAWVSDIQECLSPGPCPATSPRPMTLPLAPGTSFLTRENTDSLELSCLNHSESLPSQDLLLGPSESNDRLSQGAYGGLSDRPSASISPSSASIAASHFDSMELLPPELPPRIPIEEGPPTGTVHPLSAPYPPLDTPETATGSFLFQGEPEGGEGDQPLSGYPWFHGMLSRLKAAQLVLTGGTGSHGVFLVRQSETRRGEYVLTFNFQGKAKHLRLSLNEEGQCRVQHLWFQSIFDMLEHFRVHPIPLESGGSSDVVLVSYVPSSQRQQEPTTSHDPPQPPEPPSWTDPPQPGAEEASRAPEVAAAAAAAAKERQEKEKAGGGGVPEELVPVVELVPVVELEEAIAPGSEAQGAGSGGDAGVPPMVQLQQSPLGGDGEEGGHPRAINNQYSFV,756,NP_001139267.1.csv,refseq-SH2B1-NM_001145795.1_clinical_seed_0_final,refseq-SH2B1-NM_001145795.1.a2m,Invitae,refseq-SH2B1-NM_001145795.1.npy,1,756,756
+NP_001142.2,MGDHAWSFLKDFLAGGVAAAVSKTAVAPIERVKLLLQVQHASKQISAEKQYKGIIDCVVRIPKEQGFLSFWRGNLANVIRYFPTQALNFAFKDKYKQLFLGGVDRHKQFWRYFAGNLASGGAAGATSLCFVYPLDFARTRLAADVGKGAAQREFHGLGDCIIKIFKSDGLRGLYQGFNVSVQGIIIYRAAYFGVYDTAKGMLPDPKNVHIFVSWMIAQSVTAVAGLVSYPFDTVRRRMMMQSGRKGADIMYTGTVDCWRKIAKDEGAKAFFKGAWSNVLRGMGGAFVLVLYDEIKKYV,298,NP_001142.2.csv,refseq-SLC25A4-NM_001151.3_clinical_seed_0_final,refseq-SLC25A4-NM_001151.3.a2m,Invitae,refseq-SLC25A4-NM_001151.3.npy,1,298,298
+NP_001148.1,MSYPGYPPPPGGYPPAAPGGGPWGGAAYPPPPSMPPIGLDNVATYAGQFNQDYLSGMAANMSGTFGGANMPNLYPGAPGAGYPPVPPGGFGQPPSAQQPVPPYGMYPPPGGNPPSRMPSYPPYPGAPVPGQPMPPPGQQPPGAYPGQPPVTYPGQPPVPLPGQQQPVPSYPGYPGSGTVTPAVPPTQFGSRGTITDAPGFDPLRDAEVLRKAMKGFGTDEQAIIDCLGSRSNKQRQQILLSFKTAYGKDLIKDLKSELSGNFEKTILALMKTPVLFDIYEIKEAIKGVGTDEACLIEILASRSNEHIRELNRAYKAEFKKTLEEAIRSDTSGHFQRLLISLSQGNRDESTNVDMSLAQRDAQELYAAGENRLGTDESKFNAVLCSRSRAHLVAVFNEYQRMTGRDIEKSICREMSGDLEEGMLAVVKCLKNTPAFFAERLNKAMRGAGTKDRTLIRIMVSRSETDLLDIRSEYKRMYGKSLYHDISGDTSGDYRKILLKICGGND,505,NP_001148.1.csv,refseq-ANXA11-NM_001157.2_clinical_seed_0_final,refseq-ANXA11-NM_001157.2.a2m,Invitae,refseq-ANXA11-NM_001157.2.npy,1,505,505
+NP_001153508.1,MQAWRKGPDGPQKTSSDSMSRLMDSDMDYERPNVETIKCVVVGDNAVGKTRLICARACNATLTQYQLLATHVPTVWAIDQYRVCQEVLERSRDVVDDVSVSLRLWDTFGDHHKDRRFAYGRSDVVVLCFSIANPNSLHHVKTMWYPEIKHFCPRAPVILVGCQLDLRYADLEAVNRARRPLARPIKPNEILPPEKGREVAKELGIPYYETSVVAQFGIKDVFDNAIRAALISRRHLQFWKSHLRNVQRPLLQAPFLPPKPPPPIIVVPDPPSSSEECPAHLLEDPLCADVILVLQERVRIFAHKIYLSTSSSKFYDLFLMDLSEGELGGPSEPGGTHPEDHQGHSDQHHHHHHHHHGRDFLLRAASFDVCESVDEAGGSGPAGLRASTSDGILRGNGTGYLPGRGRVLSSWSRAFVSIQEEMAEDPLTYKSRLMVVVKMDSSIQPGPFRAVLKYLYTGELDENERDLMHIAHIAELLEVFDLRMMVANILNNEAFMNQEITKAFHVRRTNRVKECLAKGTFSDVTFILDDGTISAHKPLLISSCDWMAAMFGGPFVESSTREVVFPYTSKSCMRAVLEYLYTGMFTSSPDLDDMKLIILANRLCLPHLVALTEQYTVTGLMEATQMMVDIDGDVLVFLELAQFHCAYQLADWCLHHICTNYNNVCRKFPRDMKAMSPENQEYFEKHRWPPVWYLKEEDHYQRARKEREKEDYLHLKRQPKRRWLFWNSPSSPSSSAASSSSPSSSSAVV,749,NP_001153508.1.csv,refseq-RHOBTB2-NM_001160036.1_clinical_seed_0_final,refseq-RHOBTB2-NM_001160036.1.a2m,Invitae,refseq-RHOBTB2-NM_001160036.1.npy,1,749,749
+NP_001153605.1,MPRHHAGGEEGGAAGLWVKSGAAAAAAGGGRLGSGMKDVESGRGRVLLNSAAARGDGLLLLGTRAATLGGGGGGLRESRRGKQGARMSLLGKPLSYTSSQSCRRNVKYRRVQNYLYNVLERPRGWAFIYHAFVFLLVFGCLILSVFSTIPEHTKLASSCLLILEFVMIVVFGLEFIIRIWSAGCCCRYRGWQGRLRFARKPFCVIDTIVLIASIAVVSAKTQGNIFATSALRSLRFLQILRMVRMDRRGGTWKLLGSVVYAHSKELITAWYIGFLVLIFSSFLVYLVEKDANKEFSTYADALWWGTITLTTIGYGDKTPLTWLGRLLSAGFALLGISFFALPAGILGSGFALKVQEQHRQKHFEKRRNPAANLIQCVWRSYAADEKSVSIATWKPHLKALHTCSPTKKEQGEASSSKFCSNKQKLFRMYTSRKQSQKLSFKERVRMASPRGQSIKSRQASVGDRRSPSTDITAEGSPTKVQKSWSFNDRTRFRPSLRLKSSQPKPVIDADTALGTDDVYDEKGCQCDVSVEDLTPPLKTVIRAIRIMKFHVAKRKFKETLRPYDVKDVIEQYSAGHLDMLCRIKSLQTRVDQILGKGQITSDKKSREKITAEHETTDDLSMLGRVVKVEKQVQSIESKLDCLLDIYQQVLRKGSASALALASFQIPPFECEQTSDYQSPVDSKDLSGSAQNSGCLSRSTSANISRGLQFILTPNEFSAQTFYALSPTMHSQATQVPISQSDGSAVAATNTIANQINTAPKPAAPTTLQIPPPLPAIKHLPRPETLHPNPAGLQESISDVTTCLVASKENVQVAQSNLTKDRSMRKSFDMGGETLLSVCPMVPKDLGKSLSVQNLIRSTEELNIQLSGSESSGSRGSQDFYPKWRESKLFITDEEVGPEETETDTFDAAPQPAREAAFASDSLRTGRSRSSQSICKAGESTDALSLPHVKLK,951,NP_001153605.1.csv,refseq-KCNQ5-NM_001160133.1_clinical_seed_0_final,refseq-KCNQ5-NM_001160133.1.a2m,Invitae,refseq-KCNQ5-NM_001160133.1.npy,1,951,951
+NP_001153695.1,MAKYQGEVQSLKLDDDSVIEGVSDQVLVAVVVSFALIATLVYALFRNVHQNIHPENQELVRVLREQLQTEQDAPAATRQQFYTDMYCPICLHQASFPVETNCGHLFCGACIIAYWRYGSWLGAISCPICRQTVTLLLTVFGEDDQSQDVLRLHQDINDYNRRFSGQPRSIMERIMDLPTLLRHAFREMFSVGGLFWMFRIRIILCLMGAFFYLISPLDFVPEALFGILGFLDDFFVIFLLLIYISIMYREVITQRLTR,258,NP_001153695.1.csv,refseq-RNF170-NM_001160223.1_clinical_seed_0_final,refseq-RNF170-NM_001160223.1.a2m,Invitae,refseq-RNF170-NM_001160223.1.npy,1,258,258
+NP_001153705.1,MAFKVGRDKYEPAAVSEQGDKKGKKGKKDRDMDELKKEVSMDDHKLSLDELHRKYGTDLSRGLTSARAAEILARDGPNALTPPPTTPEWIKFCRQLFGGFSMLLWIGAILCFLAYSIQAATEEEPQNDNLYLGVVLSAVVIITGCFSYYQEAKSSKIMESFKNMVPQQALVIRNGEKMSINAEEVVVGDLVEVKGGDRIPADLRIISANGCKVDNSSLTGESEPQTRSPDFTNENPLETRNIAFFSTNCVEGTARGIVVYTGDRTVMGRIATLASGLEGGQTPIAAEIEHFIHIITGVAVFLGVSFFILSLILEYTWLEAVIFLIGIIVANVPEGLLATVTVCLTLTAKRMARKNCLVKNLEAVETLGSTSTICSDKTGTLTQNRMTVAHMWFDNQIHEADTTENQSGVSFDKTSATWLALSRIAGLCNRAVFQANQENLPILKRAVAGDASESALLKCIELCCGSVKEMRERYAKIVEIPFNSTNKYQLSIHKNPNTSEPQHLLVMKGAPERILDRCSSILLHGKEQPLDEELKDAFQNAYLELGGLGERVLGFCHLFLPDEQFPEGFQFDTDDVNFPIDNLCFVGLISMIDPPRAAVPDAVGKCRSAGIKVIMVTGDHPITAKAIAKGVGIISEGNETVEDIAARLNIPVSQVNPRDAKACVVHGSDLKDMTSEQLDDILKYHTEIVFARTSPQQKLIIVEGCQRQGAIVAVTGDGVNDSPALKKADIGVAMGIAGSDVSKQAADMILLDDNFASIVTGVEEGRLIFDNLKKSIAYTLTSNIPEITPFLIFIIANIPLPLGTVTILCIDLGTDMVPAISLAYEQAESDIMKRQPRNPKTDKLVNERLISMAYGQIGMIQALGGFFTYFVILAENGFLPIHLLGLRVDWDDRWINDVEDSYGQQWTYEQRKIVEFTCHTAFFVSIVVVQWADLVICKTRRNSVFQQGMKNKILIFGLFEETALAAFLSYCPGMGVALRMYPLKPTWWFCAFPYSLLIFVYDEVRKLIIRRRPGGWVEKETYY,1023,NP_001153705.1.csv,NP_001153705.1_clinical_seed_0_final,NP_001153705.1.a2m,popEVE,NP_001153705.1_theta_0.2.npy,1,1023,1023
+NP_001153839.1,MDKEKDGIPISSLREITLLLRLRHPNIVELKEVVVGNHLESIFLVMGYCEQDLASLLENMPTPFSEAQVKCIVLQVLRGLQYLHRNFIIHRDLKVSNLLMTDKGCVKTADFGLARAYGVPVKPMTPKVVTLWYRAPELLLGTTTQTTSIDMWAVGCILAELLAHRPLLPGTSEIHQIDLIVQLLGTPSENIWPGFSKLPLVGQYSLRKQPYNNLKHKFPWLSEAGLRLLHFLFMYDPKKRATAGDCLESSYFKEKPLPCEPELMPTFPHHRNKRAAPATSEGQSKRCKP,289,NP_001153839.1.csv,refseq-CDK10-NM_001160367.1_clinical_seed_0_final,refseq-CDK10-NM_001160367.1.a2m,Invitae,refseq-CDK10-NM_001160367.1_theta_0.2.npy,1,289,289
+NP_001153844.1,MSVPDYMQCAEDHQTLLVVVQPVGIVSEENFFRIYKRICSVSQISVRDSQRVLYIRYRHHYPPENNEWGDFQTHRKVVGLITITDCFSAKDWPQTFEKFHVQKEIYGSTLYDSRLFVFGLQGEIVEQPRTDVAFYPNYEDCQTVEKRIEDFIESLFIVLESKRLDRATDKSGDKIPLLCVPFEKKDFVGLDTDSRHYKKRCQGRMRKHVGDLCLQAGMLQDSLVHYHMSVELLRSVNDFLWLGAALEGLCSASVIYHYPGGTGGKSGARRFQGSTLPAEAANRHRPGAQEVLIDPGALTTNGINPDTSTEIGRAKNCLSPEDIIDKYKEAISYYSKYKNAGVIELEACIKAVRVLAIQKRSMEASEFLQNAVYINLRQLSEEEKIQRYSILSELYELIGFHRKSAFFKRVAAMQCVAPSIAEPGWRACYKLLLETLPGYSLSLDPKDFSRGTHRGWAAVQMRLLHELVYASRRMGNPALSVRHLSFLLQTMLDFLSDQEKKDVAQSLENYTSKCPGTMEPIALPGGLTLPPVPFTKLPIVRHVKLLNLPASLRPHKMKSLLGQNVSTKSPFIYSPIIAHNRGEERNKKIDFQWVQGDVCEVQLMVYNPMPFELRVENMGLLTSGVEFESLPAALSLPAESGLYPVTLVGVPQTTGTITVNGYHTTVFGVFSDCLLDNLPGIKTSGSTVEVIPALPRLQISTSLPRSAHSLQPSSGDEISTNVSVQLYNGESQQLIIKLENIGMEPLEKLEVTSKVLTTKEKLYGDFLSWKLEETLAQFPLQPGKVATFTINIKVKLDFSCQENLLQDLSDDGISVSGFPLSSPFRQVVRPRVEGKPVNPPESNKAGDYSHVKTLEAVLNFKYSGGPGHTEGYYRNLSLGLHVEVEPSVFFTRVSTLPATSTRQCHLLLDVFNSTEHELTVSTRSSEALILHAGECQRMAIQVDKFNFESFPESPGEKGQFANPKQLEEERREARGLEIHSKLGICWRIPSLKRSGEASVEGLLNQLVLEHLQLAPLQWDVLVDGQPCDREAVAACQVGDPVRLEVRLTNRSPRSVGPFALTVVPFQDHQNGVHNYDLHDTVSFVGSSTFYLDAVQPSGQSACLGALLFLYTGDFFLHIRFHEDSTSKELPPSWFCLPSVHVCALEAQA,1148,NP_001153844.1.csv,refseq-TRAPPC9-NM_001160372.1_clinical_seed_0_final,refseq-TRAPPC9-NM_001160372.1.a2m,Invitae,refseq-TRAPPC9-NM_001160372.1_theta_0.2.npy,1,1148,1148
+NP_001155197.1,MPNKNKKEKESPKAGKSGKSSKEGQDTVESEGTSPEEPSSPKVPPPLLPELLVLIFGGLQGRDVPPADQEKLFIQKLRQCCVLFDFVSDPLSDLKWKEVKRAALSEMVEYITHNRNVITEPIYPEVVHMFAVNMFRTLPPSSNPTGAEFDPEEDEPTLEAAWPHLQLVYEFFLRFLESPDFQPNIAKKYIDQKFVLQLLELFDSEDPRERDFLKTTLHRIYGKFLGLRAYIRKQINNIFYRFIYETEHHNGIAELLEILGSIINGFALPLKEEHKIFLLKVLLPLHKVKSLSVYHPQLAYCVVQFLEKDSTLTEPVVMALLKYWPKTHSPKEVMFLNELEEILDVIEPSEFVKIMEPLFRQLAKCVSSPHFQVAERALYYWNNEYIMSLISDNAAKILPIMFPSLYRNSKTHWNKTIHGLIYNALKLFMEMNQKLFDDCTQQFKAEKLKEKLKMKEREEAWVKIENLAKANPQYTVYSQASTMSIPVAMETDGPLFEDVQMLRKTVKDEAHQAQKDPKKDRPLARRKSELPQDPHTKKALEAHCRADELASQDGR,555,NP_001155197.1.csv,refseq-PPP2R5C-NM_001161725.1_clinical_seed_0_final,refseq-PPP2R5C-NM_001161725.1.a2m,Invitae,refseq-PPP2R5C-NM_001161725.1.npy,1,555,555
+NP_001156685.1,MGAPACALALCVAVAIVAGASSESLGTEQRVVGRAAEVPGPEPGQQEQLVFGSGDAVELSCPPPGGGPMGPTVWVKDGTGLVPSERVLVGPQRLQVLNASHEDSGAYSCRQRLTQRVLCHFSVRVTDAPSSGDDEDGEDEAEDTGVDTGAPYWTRPERMDKKLLAVPAANTVRFRCPAAGNPTPSISWLKNGREFRGEHRIGGIKLRHQQWSLVMESVVPSDRGNYTCVVENKFGSIRQTYTLDVLERSPHRPILQAGLPANQTAVLGSDVEFHCKVYSDAQPHIQWLKHVEVNGSKVGPDGTPYVTVLKSWISESVEADVRLRLANVSERDGGEYLCRATNFIGVAEKAFWLSVHGPRAAEEELVEADEAGSVYAGILSYGVGFFLFILVVAAVTLCRLRSPPKKGLGSPTVHKISRFPLKRQVSLESNASMSSNTPLVRIARLSSGEGPTLANVSELELPADPKWELSRARLTLGKPLGEGCFGQVVMAEAIGIDKDRAAKPVTVAVKMLKDDATDKDLSDLVSEMEMMKMIGKHKNIINLLGACTQGGPLYVLVEYAAKGNLREFLRARRPPGLDYSFDTCKPPEEQLTFKDLVSCAYQVARGMEYLASQKCIHRDLAARNVLVTEDNVMKIADFGLARDVHNLDYYKKTTNGRLPVKWMAPEALFDRVYTHQSDVWSFGVLLWEIFTLGGSPYPGIPVEELFKLLKEGHRMDKPANCTHDLYMIMRECWHAAPSQRPTFKQLVEDLDRVLTVTSTDEYLDLSAPFEQYSPGGQDTPSSSSSGDDSVFAHDLLPPAPPSSGGSRT,808,NP_001156685.1.csv,refseq-FGFR3-NM_001163213.2_clinical_seed_0_final,refseq-FGFR3-NM_001163213.2.a2m,Invitae,refseq-FGFR3-NM_001163213.2.npy,1,808,808
+NP_001156908.2,MFPLKDAEMGAFTFFASALPHDVCGSNGLPLTPNSIKILGRFQILKTITHPRLCQYVDISRGKHERLVVVAEHCERSLEDLLRERKPVSCSTVLCIAFEVLQGLQYMNKHGIVHRALSPHNILLDRKGHIKLAKFGLYHMTAHGDDVDFPIGYPSYLAPEVIAQGIFKTTDHMPSKKPLPSGPKSDVWSLGIILFELCVGRKLFQSLDISERLKFLLTLDCVDDTLIVLAEEHGCLDIIKELPETVIDLLNKCLTFHPSKRPTPDQLMKDKVFSEVSPLYTPFTKPASLFSSSLRCADLTLPEDISQLCKDINNDYLAERSIEEVYYLWCLAGGDLEKELVNKEIIRSKPPICTLPNFLFEDGESFGQGRDRSSLLDDTTVTLSLCQLRNRLKDVGGEAFYPLLEDDQSNLPHSNSNNELSAAATLPLIIREKDTEYQLNRIILFDRLLKAYPYKKNQIWKEARVDIPPLMRGLTWAALLGVEGAIHAKYDAIDKDTPIPTDRQIEVDIPRCHQYDELLSSPEGHAKFRRVLKAWVVSHPDLVYWQGLDSLCAPFLYLNFNNEALAYACMSAFIPKYLYNFFLKDNSHVIQEYLTVFSQMIAFHDPELSNHLNEIGFIPDLYAIPWFLTMFTHVFPLHKIFHLWDTLLLGNSSFPFCIGVAILQQLRDRLLANGFNECILLFSDLPEIDIERCVRESINLFCWTPKSATYRQHAQPPKPSSDSSGGRSSAPYFSAECPDPPKTDLSRESIPLNDLKSEVSPRISAEDLIDLCELTVTGHFKTPSKKTKSSKPKLLVVDIRNSEDFIRGHISGSINIPFSAAFTAEGELTQGPYTAMLQNFKGKVIVIVGHVAKHTAEFAAHLVKMKYPRICILDGGINKIKPTGLLTIPSPQI,893,NP_001156908.2.csv,refseq-TBCK-NM_001163436.4_clinical_seed_0_final,refseq-TBCK-NM_001163436.4.a2m,Invitae,refseq-TBCK-NM_001163436.4_theta_0.2.npy,1,893,893
+NP_001157281.1,MAQGSGGREGALRTPAGGWHSPPSPDMQELLRSVERDLSIDPRQLAPAPGGTHVVALVPARWLASLRDRRLPLGPCPRAEGLGEAEVRTLLQRSVQRLPAGWTRVEVHGLRKRRLSYPLGGGLPFEDGSCGPETLTRFMQEVAAQNYRNLWRHAYHTYGQPYSHSPAPSAVPALDSVRQALQRVYGCSFLPVGETTQCPSYAREGPCPPRGSPACPSLLRAEALLESPEMLYVVHPYVQFSLHDVVTFSPAKLTNSQAKVLFILFRVLRAMDACHRQGLACGALSLYHIAVDEKLCSELRLDLSAYERPEEDENEEAPVARDEAGIVSQEEQGGQPGQPTGQEELRSLVLDWVHGRISNFHYLMQLNRLAGRRQGDPNYHPVLPWVVDFTTPHGRFRDLRKSKFRLNKGDKQLDFTYEMTRQAFVAGGAGGGEPPHVPHHISDVLSDITYYVYKARRTPRSVLCGHVRAQWEPHEYPASMERMQNWTPDECIPEFYTDPSIFRSIHPDMPDLDVPAWCSSSQEFVAAHRALLESREVSRDLHHWIDLTFGYKLQGKEAVKEKNVCLHLVDAHTHLASYGVVQLFDQPHPQRLAGAPALAPEPPLIPKLLVQTIQETTGREDFTENPGQLPNGVGRPVLEATPCEASWTRDRPVAGEDDLEQATEALDSISLAGKAGDQLGSSSQASPGLLSFSVASASRPGRRNKAAGADPGEGEEGRILLPEGFNPMQALEELEKTGNFLAKGLGGLLEVPEQPRVQPAVPLQCLLHRDMQALGVLLAEMVFATRVRTLQPDAPLWVRFQAVRGLCTRHPKEVPVSLQPVLDTLLQMSGPEVPMGAERGKLDQLFEYRPVSQGLPPPCPSQLLSPFSSVVPFPPYFPALHRFILLYQARRVEDEAQGRELVFALWQQLGAVLKDITPEGLEILLPFVLSLMSEEHTAVYTAWYLFEPVAKALGPKNANKYLLKPLIGAYESPCQLHGRFYLYTDCFVAQLMVRLGLQAFLTHLLPHVLQVLAGAEASQEESKDLAGAAEEEESGLPGAGPGSCAFGEEIPMDGEPPASSGLGLPDYTSGVSFHDQADLPETEDFQAGLYVTESPQPQEAEAVSLGRLSDKSSTSETSLGEERAPDEGGAPVDKSSLRSGDSSQDLKQSEGSEEEEEEEDSCVVLEEEEGEQEEVTGASELTLSDTVLSMETVVAGGSGGDGEEEEEALPEQSEGKEQKILLDTACKMVRWLSAKLGPTVASRHVARNLLRLLTSCYVGPTRQQFTVSSGESPPLSAGNIYQKRPVLGDIVSGPVLSCLLHIARLYGEPVLTYQYLPYISYLVAPGSASGPSRLNSRKEAGLLAAVTLTQKIIVYLSDTTLMDILPRISHEVLLPVLSFLTSLVTGFPSGAQARTILCVKTISLIALICLRIGQEMVQQHLSEPVATFFQVFSQLHELRQQDLKLDPAGRGEGQLPQVVFSDGQQRPVDPALLDELQKVFTLEMAYTIYVPFSCLLGDIIRKIIPNHELVGELAALYLESISPSSRNPASVEPTMPGTGPEWDPHGGGCPQDDGHSGTFGSVLVGNRIQIPNDSRPENPGPLGPISGVGGGGLGSGSDDNALKQELPRSVHGLSGNWLAYWQYEIGVSQQDAHFHFHQIRLQSFPGHSGAVKCVAPLSSEDFFLSGSKDRTVRLWPLYNYGDGTSETAPRLVYTQHRKSVFFVGQLEAPQHVVSCDGAVHVWDPFTGKTLRTVEPLDSRVPLTAVAVMPAPHTSITMASSDSTLRFVDCRKPGLQHEFRLGGGLNPGLVRALAISPSGRSVVAGFSSGFMVLLDTRTGLVLRGWPAHEGDILQIKAVEGSVLVSSSSDHSLTVWKELEQKPTHHYKSASDPIHTFDLYGSEVVTGTVSNKIGVCSLLEPPSQATTKLSSENFRGTLTSLALLPTKRHLLLGSDNGVIRLLA,1941,NP_001157281.1.csv,refseq-WDR81-NM_001163809.1_clinical_seed_0_final,refseq-WDR81-NM_001163809.1.a2m,Invitae,refseq-WDR81-NM_001163809.1.npy,1,1941,1941
+NP_001157750.1,MAAQGYGYYRTVIFSAMFGGYSLYYFNRKTFSFVMPSLVEEIPLDKDDLGFITSSQSAAYAISKFVSGVLSDQMSARWLFSSGLLLVGLVNIFFAWSSTVPVFAALWFLNGLAQGLGWPPCGKVLRKWFEPSQFGTWWAILSTSMNLAGGLGPILATILAQSYSWRSTLALSGALCVVVSFLCLLLIHNEPADVGLRNLDPMPSEGKKGSLKEESTLQELLLSPYLWVLSTGYLVVFGVKTCCTDWGQFFLIQEKGQSALVGSSYMSALEVGGLVGSIAAGYLSDRAMAKAGLSNYGNPRHGLLLFMMAGMTVSMYLFRVTVTSDSPKDVAFWTLALHPLAELTGFTEHELWILVLGAVFGFSSYGPIALFGVIANESAPPNLCGTSHAIVGLMANVGGFLAGLPFSTIAKHYSWSTAFWVAEVICAASTAAFFLLRNIRTKMGRVSKKAE,451,NP_001157750.1.csv,NP_001157750.1_colabfold_clinical_seed_0_final,NP_001157750.1_colabfold.a2m,colabfold,NP_001157750.1_colabfold_theta_0.2.npy,1,451,451
+NP_001157814.1,MLERKKPKTAENQKASEENEITQPGGSSAKPGLPCLNFEAVLSPDPALIHSTHSLTNSHAHTGSSDCDISCKGMTERIHSINLHNFSNSVLETLNEQRNRGHFCDVTVRIHGSMLRAHRCVLAAGSPFFQDKLLLGYSDIEIPSVVSVQSVQKLIDFMYSGVLRVSQSEALQILTAASILQIKTVIDECTRIVSQNVGDVFPGIQDSGQDTPRGTPESGTSGQSSDTESGYLQSHPQHSVDRIYSALYACSMQNGSGERSFYSGAVVSHHETALGLPRDHHMEDPSWITRIHERSQQMERYLSTTPETTHCRKQPRPVRIQTLVGNIHIKQEMEDDYDYYGQQRVQILERNESEECTEDTDQAEGTESEPKGESFDSGVSSSIGTEPDSVEQQFGPGAARDSQAEPTQPEQAAEAPAEGGPQTNQLETGASSPERSNEVEMDSTVITVSNSSDKSVLQQPSVNTSIGQPLPSTQLYLRQTETLTSNLRMPLTLTSNTQVIGTAGNTYLPALFTTQPAGSGPKPFLFSLPQPLAGQQTQFVTVSQPGLSTFTAQLPAPQPLASSAGHSTASGQGEKKPYECTLCNKTFTAKQNYVKHMFVHTGEKPHQCSICWRSFSLKDYLIKHMVTHTGVRAYQCSICNKRFTQKSSLNVHMRLHRGEKSYECYICKKKFSHKTLLERHVALHSASNGTPPAGTPPGARAGPPGVVACTEGTTYVCSVCPAKFDQIEQFNDHMRMHVSDG,741,NP_001157814.1.csv,refseq-ZBTB20-NM_001164342.2_clinical_seed_0_final,refseq-ZBTB20-NM_001164342.2.a2m,Invitae,refseq-ZBTB20-NM_001164342.2.npy,1,741,741
+NP_001157877.1,MLRGAPGLGLTARKGAEDSAEDLGGPCPEPGGDSGVLGANGASCSRGEAEEPAGRRRARPVRSKARRMAANVRERKRILDYNEAFNALRRALRHDLGGKRLSKIATLRRAIHRIAALSLVLRASPAPRGPCGHLECHGPAARGDTGDTGASPPPPAGPSLARPDAARPSVPSAPRCASCPPHAPLARPSAVAEGPGLAQASGGSWRRCPGASSAGPPPWPRGYLRSAPGMGHPRS,235,NP_001157877.1.csv,refseq-BHLHA9-NM_001164405.1_clinical_seed_0_final,refseq-BHLHA9-NM_001164405.1.a2m,Invitae,refseq-BHLHA9-NM_001164405.1.npy,1,235,235
+NP_001158233.1,MASPPACPSEEDESLKGCELYVQLHGIQQVLKDCIVHLCISKPERPMKFLREHFEKLEKEENRQILARQKSNSQSDSHDEEVSPTPPNPVVKARRRRGGVSAEVYTEEDAVSYVRKVIPKDYKTMTALAKAISKNVLFAHLDDNERSDIFDAMFPVTHIAGETVIQQGNEGDNFYVVDQGEVDVYVNGEWVTNISEGGSFGELALIYGTPRAATVKAKTDLKLWGIDRDSYRRILMGSTLRKRKMYEEFLSKVSILESLEKWERLTVADALEPVQFEDGEKIVVQGEPGDDFYIITEGTASVLQRRSPNEEYVEVGRLGPSDYFGEIALLLNRPRAATVVARGPLKCVKLDRPRFERVLGPCSEILKRNIQRYNSFISLTV,381,NP_001158233.1.csv,refseq-PRKAR1B-NM_001164761.1_clinical_seed_0_final,refseq-PRKAR1B-NM_001164761.1.a2m,Invitae,refseq-PRKAR1B-NM_001164761.1.npy,1,381,381
+NP_001159435.1,MEQTVLVPPGPDSFNFFTRESLAAIERRIAEEKAKNPKPDKKDDDENGPKPNSDLEAGKNLPFIYGDIPPEMVSEPLEDLDPYYINKKTFIVLNKGKAIFRFSATSALYILTPFNPLRKIAIKILVHSLFSMLIMCTILTNCVFMTMSNPPDWTKNVEYTFTGIYTFESLIKIIARGFCLEDFTFLRDPWNWLDFTVITFAYVTEFVDLGNVSALRTFRVLRALKTISVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNKCIQWPPTNASLEEHSIEKNITVNYNGTLINETVFEFDWKSYIQDSRYHYFLEGFLDALLCGNSSDAGQCPEGYMCVKAGRNPNYGYTSFDTFSWAFLSLFRLMTQDFWENLYQLTLRAAGKTYMIFFVLVIFLGSFYLINLILAVVAMAYEEQNQATLEEAEQKEAEFQQMIEQLKKQQEAAQQAATATASEHSREPSAAGRLSDSSSEASKLSSKSAKERRNRRKKRKQKEQSGGEEKDEDEFQKSESEDSIRRKGFRFSIEGNRLTYEKRYSSPHQSLLSIRGSLFSPRRNSRTSLFSFRGRAKDVGSENDFADDEHSTFEDNESRRDSLFVPRRHGERRNSNLSQTSRSSRMLAVFPANGKMHSTVDCNGVVSLVGGPSVPTSPVGQLLPEVIIDKPATDDNGTTTETEMRKRRSSSFHVSMDFLEDPSQRQRAMSIASILTNTVEELEESRQKCPPCWYKFSNIFLIWDCSPYWLKVKHVVNLVVMDPFVDLAITICIVLNTLFMAMEHYPMTDHFNNVLTVGNLVFTGIFTAEMFLKIIAMDPYYYFQEGWNIFDGFIVTLSLVELGLANVEGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKSYKDCVCKIASDCQLPRWHMNDFFHSFLIVFRVLCGEWIETMWDCMEVAGQAMCLTVFMMVMVIGNLVVLNLFLALLLSSFSADNLAATDDDNEMNNLQIAVDRMHKGVAYVKRKIYEFIQQSFIRKQKILDEIKPLDDLNNKKDSCMSNHTAEIGKDLDYLKDVNGTTSGIGTGSSVEKYIIDESDYMSFINNPSLTVTVPIAVGESDFENLNTEDFSSESDLEESKEKLNESSSSSEGSTVDIGAPVEEQPVVEPEETLEPEACFTEGCVQRFKCCQINVEEGRGKQWWNLRRTCFRIVEHNWFETFIVFMILLSSGALAFEDIYIDQRKTIKTMLEYADKVFTYIFILEMLLKWVAYGYQTYFTNAWCWLDFLIVDVSLVSLTANALGYSELGAIKSLRTLRALRPLRALSRFEGMRVVVNALLGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFYHCINTTTGDRFDIEDVNNHTDCLKLIERNETARWKNVKVNFDNVGFGYLSLLQVATFKGWMDIMYAAVDSRNVELQPKYEESLYMYLYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKFGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPGNKFQGMVFDFVTRQVFDISIMILICLNMVTMMVETDDQSEYVTTILSRINLVFIVLFTGECVLKLISLRHYYFTIGWNIFDFVVVILSIVGMFLAELIEKYFVSPTLFRVIRLARIGRILRLIKGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYAIFGMSNFAYVKREVGIDDMFNFETFGNSMICLFQITTSAGWDGLLAPILNSKPPDCDPNKVNPGSSVKGDCGNPSVGIFFFVSYIIISFLVVVNMYIAVILENFSVATEESAEPLSEDDFEMFYEVWEKFDPDATQFMEFEKLSQFAAALEPPLNLPQPNKLQLIAMDLPMVSGDRIHCLDILFAFTKRVLGESGEMDALRIQMEERFMASNPSKVSYQPITTTLKRKQEEVSAVIIQRAYRRHLLKRTVKQASFTYNKNKIKGGANLLIKEDMIIDRINENSITEKTDLTMSTAACPPSYDRVTKPIVEKHEQEGKDEKAKGK,2009,NP_001159435.1.csv,refseq-SCN1A-NM_001165963.1_clinical_seed_0_final,refseq-SCN1A-NM_001165963.1.a2m,Invitae,refseq-SCN1A-NM_001165963.1.npy,1,2009,2009
+NP_001159725.2,MEALGFLKLEVNGPMVTVALSVALLALLKWYSTSAFSRLEKLGLRHPKPSPFIGNLTFFRQGFWESQMELRKLYGPLCGYYLGRRMFIVISEPDMIKQVLVENFSNFTNRMASGLEFKSVADSVLFLRDKRWEEVRGALMSAFSPEKLNELGLLIMQERIKGHMGGQQAPQRIPPTRLSKPSGIYVNLHYATLPFCMVPLISQACDLLLAHLKRYAESGDAFDIQRCYCNYTTDVVASVAFGTPVDSWQAPEDPFVKHCKRFFEFCIPRPILVLLLSFPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQAAEERRRDFLQMVLDARHSASPMGVQDFDIVRDVFSSTGCKPNPSRQHQPSPMARPLTVDEIVGQAFIFLIAGYEIITNTLSFATYLLATNPDCQEKLLREVDVFKEKHMAPEFCSLEEGLPYLDMVIAETLRMYPPAFRFTREAAQDCEVLGQRIPAGAVLEMAVGALHHDPEHWPSPETFNPERFTAEARQQHRPFTYLPFGAGPRSCLGVRLGLLEVKLTLLHVLHKFRFQACPETQVPLQLESKSALGPKNGVYIKIVSR,579,NP_001159725.2.csv,NP_001159725.2_clinical_seed_0_final,NP_001159725.2.a2m,popEVE,NP_001159725.2_theta_0.2.npy,1,579,579
+NP_001159762.1,MAWWLIAFAHGDLAPSEGTAEPCVTSIHSFSSAFLFSIEVQVTIGFGGRMVTEECPLAILILIVQNIVGLMINAIMLGCIFMKTAQAHRRAETLIFSKHAVIALRHGRLCFMLRVGDLRKSMIISATIHMQVVRKTTSPEGEVVPLHQVDIPMENGVGGNSIFLVAPLIIYHVIDANSPLYDLAPSDLHHHQDLEIIVILEGVVETTGITTQARTSYLADEILWGQRFVPIVAEEDGRYSVDYSKFGNTVKVPTPLCTARQLDEDHSLLEALTLASARGPLRKRSVPMAKAKPKFSISPDSLS,303,NP_001159762.1.csv,refseq-KCNJ11-NM_001166290.2_clinical_seed_0_final,refseq-KCNJ11-NM_001166290.2.a2m,Invitae,refseq-KCNJ11-NM_001166290.2.npy,1,303,303
+NP_001161206.1,MGGKAWPRRAVGTAGGPCAEQISAPFQTLLMPHLPLASFRPPFWGLRHSRGLPRFHSVSTQSEPHGSPISRRNREAKQKRLREKQATLEAEIAGESKSPAESIKAWRPKELVLYEIPTKPGEKKDVSGPLPPAYSPRYVEAAWYPWWVREGFFKPEYQARLPQATGETFSMCIPPPNVTGSLHIGHALTVAIQDALVRWHRMRGDQVLWVPGSDHAGIATQAVVEKQLWKERGVRRHELSREAFLREVWQWKEAKGGEICEQLRALGASLDWDRECFTMDVGSSVAVTEAFVRLYKAGLLYRNHQLVNWSCALRSAISDIEVENRPLPGHTQLRLPGCPTPVSFGLLFSVAFPVDGEPDAEVVVGTTRPETLPGDVAVAVHPDDSRYTHLHGRQLRHPLMGQPLPLITDYAVQPHVGTGAVKVTPAHSPADAEMGARHGLSPLNVIAEDGTMTSLCGDWLQGLHRFVAREKIMSVLSEWGLFRGLQNHPMVLPICSRSGDVIEYLLKNQWFVRCQEMGARAAKAVESGALELSPSFHQKNWQHWFSHIGDWCVSRQLWWGHQIPAYLVVEDHAQGEEDCWVVGRSEAEAREVAAELTGRPGAELTLERDPDVLDTWFSSALFPFSALGWPQETPDLARFYPLSLLETGSDLLLFWVGRMVMLGTQLTGQLPFSKVLLHPMVRDRQGRKMSKSLGNVLDPRDIISGVEMQVLQEKLRSGNLDPAELAIVAAAQKKDFPHGIPECGTDALRFTLCSHGVQAGDLHLSVSEVQSCRHFCNKIWNALRFILNALGEKFVPQPAEELSPSSPMDAWILSRLALAAQECERGFLTRELSLVTHALHHFWLHNLCDVYLEAVKPVLWHSPRPLGPPQVLFSCADLGLRLLAPLMPFLAEELWQRLPPRPGCPPAPSISVAPYPSACSLEHWRQPELERRFSRVQEVVQVLRALRATYQLTKARPRVLLQSSEPGDQGLFEAFLEPLGTLGYCGAVGLLPPGAAAPSGWAQAPLSDTAQVYMELQGLVDPQIQLPLLAARRYKLQKQLDSLTARTPSEGEAGTQRQQKLSSLQLELSKLDKAASHLRQLMDEPPAPGSPEL,1093,NP_001161206.1.csv,refseq-VARS2-NM_001167734.1_clinical_seed_0_final,refseq-VARS2-NM_001167734.1.a2m,Invitae,refseq-VARS2-NM_001167734.1.npy,1,1093,1093
+NP_001162625.1,MESENMDSENMKTENMESQNVDFESVSSVTALEALSKLLNPEEEDDSDYGQTNGLSTIGAMGPGNIGPPQIEELKVIPETSEENNEDIWNSEEIPEGAEYDDMWDVREIPEYEIIFRQQVGTEDIFLGLSKKDSSTGCCSELVAKIKLPNTNPSDIQIDIQETILDLRTPQKKLLITLPELVECTSAKAFYIPETETLEITMTMKRELDIANFF,214,NP_001162625.1.csv,refseq-PIH1D3-NM_001169154.1_clinical_seed_0_final,refseq-PIH1D3-NM_001169154.1.a2m,Invitae,refseq-PIH1D3-NM_001169154.1.npy,1,214,214
+NP_001164672.1,MSGAVRGRRREVGLQHGGCGTGGGYGDGLARSGQSWRWWRSLSLGAVGSSAGTEPGRPAGASTFRLLRRRQQRHSGTFSYLDDVPFKTGDKFKTPAKVGLPIGFSLPDCLQVVREVQYDFSLEKKTIEWAEEIKKIEEAEREAECKIAEAEAKVNSKSGPEGDSKMSFSKTHSTATMPPPINPILASLQHNSILTPTRVSSSATKQKVLSPPHIKADFNLADFECEEDPFDNLELKTIDEKEELRNILVGTTGPIMAQLLDNNLPRGGSGSVLQDEEVLASLERATLDFKPLHKPNGFITLPQLGNCEKMSLSSKVSLPPIPAVSNIKSLSFPKLDSDDSNQKTAKLASTFHSTSCLRNGTFQNSLKPSTQSSASELNGHHTLGLSALNLDSGTEMPALTSSQMPSLSVLSVCTEESSPPNTGPTVTPPNFSVSQVPNMPSCPQAYSELQMLSPSERQCVETVVNMGYSYECVLRAMKKKGENIEQILDYLFAHGQLCEKGFDPLLVEEALEMHQCSEEKMMEFLQLMSKFKEMGFELKDIKEVLLLHNNDQDNALEDLMARAGAS,566,NP_001164672.1.csv,refseq-UBAP1-NM_001171201.1_clinical_seed_0_final,refseq-UBAP1-NM_001171201.1.a2m,Invitae,refseq-UBAP1-NM_001171201.1.npy,1,566,566
+NP_001165558.2,MASRKENAKSANRVLRISQLDALELNKALEQLVWSQFTQCFHGFKPGLLARFEPEVKACLWVFLWRFTIYSKNATVGQSVLNIKYKNDFSPNLRYQPPSKNQKIWYAVCTIGGRWLEERCYDLFRNHHLASFGKVKQCVNFVIGLLKLGGLINFLIFLQRGKFATLTERLLGIHSVFCKPQNICEVGFEYMNRELLWHGFAEFLIFLLPLINVQKLKAKLSSWCIPLTGAPNSDNTLATSGKECALCGEWPTMPHTIGCEHIFCYFCAKSSFLFDVYFTCPKCGTEVHSLQPLKSGIEMSEVNAL,305,NP_001165558.2.csv,refseq-PEX2-NM_001172087.2_clinical_seed_0_final,refseq-PEX2-NM_001172087.2.a2m,Invitae,refseq-PEX2-NM_001172087.2.npy,1,305,305
+NP_001165806.1,MDEMATTQISKDELDELKEAFAKVDLNSNGFICDYELHELFKEANMPLPGYKIFQEVKSSDIAKTFRKAINRKEGICALGGTSELSSEGTQHSYSEEEKYAFVNWINKALENDPDCRHVIPMNPNTDDLFKAVGDGIVLCKMINLSVPDTIDERAINKKKLTPFIIQENLNLALNSASAIGCHVVNIGAEDLRAGKPHLVLGLLWQIIKIGLFADIELSRNEALAALLRDGETLEELMKLSPEELLLRWANFHLENSGWQKINNFSADIKDSKAYFHLLNQIAPKGQKEGEPRIDINMSGFNETDDLKRAESMLQQADKLGCRQFVTPADVVSGNPKLNLAFVANLFNKYPALTKPENQDIDWTLLEGETREERTFRNWMNSLGVNPHVNHLYADLQDALVILQLYERIKVPVDWSKVNKPPYPKLGANMKKLENCNYAVELGKHPAKFSLVGIGGQDLNDGNQTLTLALVWQLMRRYTLNVLEDLGDGQKANDDIIVNWVNRTLSEAGKSTSIQSFKDKTISSSLAVVDLIDAIQPGCINYDLVKSGNLTEDDKHNNAKYAVSMARRIGARVYALPEDLVEVKPKMVMTVFACLMGRGMKRV,603,NP_001165806.1.csv,refseq-PLS3-NM_001172335.2_clinical_seed_0_final,refseq-PLS3-NM_001172335.2.a2m,Invitae,refseq-PLS3-NM_001172335.2.npy,1,603,603
+NP_001165948.1,MLLLRLPPHRSHASPLDCKLQDRCRKCYSPRSGQACPPALAAAWLRRCERRGGRPRGGRRKELTLGLRPARCSAPGPAKDDAWRPQAGRSSSDTNESEIKSNEEPLLRKSSRRFVIFPIQYPDIWKMYKQAQASFWTAEEVDLSKDLPHWNKLKADEKYFISHILAFFAASDGIVNENLVERFSQEVQVPEARCFYGFQILIENVHSEMYSLLIDTYIRDPKKREFLFNAIETMPYVKKKADWALRWIADRKSTFGERVVAFAAVEGVFFSGSFAAIFWLKKRGLMPGLTFSNELISRDEGLHCDFACLMFQYLVNKPSEERVREIIVDAVKIEQEFLTEALPVGLIGMNCILMKQYIEFVADRLLVELGFSKVFQAENPFDFMENISLEGKTNFFEKRVSEYQRFAVMAETTDNVFTLDADF,423,NP_001165948.1.csv,NP_001165948.1_colabfold_clinical_seed_0_final,NP_001165948.1_colabfold.a2m,colabfold,NP_001165948.1_colabfold_theta_0.2.npy,1,423,423
+NP_001166167.1,MSLLRSLRVFLVARTGSYPAGSLLRQSPQPRHTFYAGPRLSASASSKELLMKLRRKTGYSFVNCKKALETCGGDLKQAEIWLHKEAQKEGWSKAAKLQGRKTKEGLIGLLQEGNTTVLVEVNCETDFVSRNLKFQLLVQQVALGTMMHCQTLKDQPSAYSKVQWLTPVNLALWEAEAGGSLEGFLNSSELSGLPAGPDREGSLKDQLALAIGKLGENMILKRAAWVKVPSGFYVGSYVHGAMQSPSLHKLVLGKYGALVICETSEQKTNLEDVGRRLGQHVVGMAPLSVGSLDDEPGGEAETKMLSQPYLLDPSITLGQYVQPQGVSVVDFVRFECGEGEEAAETE,346,NP_001166167.1.csv,refseq-TSFM-NM_001172696.1_clinical_seed_0_final,refseq-TSFM-NM_001172696.1.a2m,Invitae,refseq-TSFM-NM_001172696.1.npy,1,346,346
+NP_001166939.1,MAAVSGLVRRPLREVSGLLKRRFHWTAPAALQVTVRDAINQGMDEELERDEKVFLLGEEVAQYDGAYKVSRGLWKKYGDKRIIDTPISEMGFAGIAVGAAMAGLRPICEFMTFNFSMQAIDQVINSAAKTYYMSGVAAQHSQCFAAWYGHCPGLKVVSPWNSEDAKGLIKSAIRDNNPVVVLENELMYGVPFEFPPEAQSKDFLIPIGKAKIERQGTHITVVSHSRPVGHCLEAAAVLSKEGVECEVINMRTIRPMDMETIEASVMKTNHLVTVEGGWPQFGVGAEICARIMEGPAFNFLDAPAVRVTGADVPMPYAKILEDNSIPQVKDIIFAIKKTLNI,341,NP_001166939.1.csv,refseq-PDHB-NM_001173468.1_clinical_seed_0_final,refseq-PDHB-NM_001173468.1.a2m,Invitae,refseq-PDHB-NM_001173468.1_theta_0.2.npy,1,341,341
+NP_001167621.1,MFSLMASCCGWFKRWREPVRKVTLLMVGLDNAGKTATAKGIQGEYPEDVAPTVGFSKINLRQGKFEVTIFDLGGGIRIRGIWKNYYAESYGVIFVVDSSDEERMEETKEAMSEMLRHPRISGKPILVLANKQDKEGALGEADVIECLSLEKLVNEHKCLCQIEPCSAISGYGKKIDKSIKKGLYWLLHVIARDFDALNERIQKETTEQRALEEQEKQERAERVRKLREERKQNEQEQAELDGTSGLAELDPEPTNPFQPIASVIIENEGKLEREKKNQKMEKDSDGCHLKHKMEHEQIETQGQVNHNGQKNNEFGLVENYKEALTQQLKNEDETDRPSLESANGKKKTKKLRMKRNHRVEPLNIDDCAPESPTPPPPPPPVGWGTPKVTRLPKLEPLGETHHNDFYRKPLPPLAVPQRPNSDAHDVIS,428,NP_001167621.1.csv,refseq-ARL13B-NM_001174150.1_clinical_seed_0_final,refseq-ARL13B-NM_001174150.1.a2m,Invitae,refseq-ARL13B-NM_001174150.1.npy,1,428,428
+NP_001170787.2,MPSSLPGSQVPHPTLDAVDLVEKTLRNEGTSSSAPVLEEGDTDPWTLPQLKDTSQPWKELRVAGRLRRVAGSVLKACGLLGSLYFFICSLDVLSSAFQLLGSKVAGDIFKDNVVLSNPVAGLVIGVLVTALVQSSSTSSSIVVSMVAAKLLTVRVSVPIIMGVNVGTSITSTLVSMAQSGDRDEFQRAFSGSAVHGIFNWLTVLVLLPLESATALLERLSELALGAASLTPRAQAPDILKVLTKPLTHLIVQLDSDMIMSSATGNATNSSLIKHWCGTTGQPTQENSSCGAFGPCTEKNSTAPADRLPCRHLFAGTELTDLAVGCILLAGSLLVLCGCLVLIVKLLNSVLRGRVAQVVRTVINADFPFPLGWLGGYLAVLAGAGLTFALQSSSVFTAAVVPLMGVGVISLDRAYPLLLGSNIGTTTTALLAALASPADRMLSALQVALIHFFFNLAGILLWYLVPALRLPIPLARHFGVVTARYRWVAGVYLLLGFLLLPLAAFGLSLAGGMELAAVGGPLVGLVLLVILVTVLQRRRPAWLPVRLRSWAWLPVWLHSLEPWDRLVTRCCPCNVCSPPKATTKEAYCYENPEILASQQL,599,NP_001170787.2.csv,refseq-SLC34A3-NM_001177316.2_clinical_seed_0_final,refseq-SLC34A3-NM_001177316.2.a2m,Invitae,refseq-SLC34A3-NM_001177316.2.npy,1,599,599
+NP_001170991.1,MPWSFRSSTPTWLRMSSCSWEMTYNTNAQVPDSAGTATAYLCGVKANEGTVGVSAATERSRCNTTQGNEVTSILRWAKDAGKSVGIVTTTRVNHATPSAAYAHSADRDWYSDNEMPPEALSQGCKDIAYQLMHNIRDIDVIMGGGRKYMYPKNKTDVEYESDEKARGTRLDGLDLVDTWKSFKPRYKHSHFIWNRTELLTLDPHNVDYLLGLFEPGDMQYELNRNNVTDPSLSEMVVVAIQILRKNPKGFFLLVEGGRIDHGHHEGKAKQALHEAVEMDRAIGQAGSLTSSEDTLTVVTADHSHVFTFGGYTPRGNSIFGLAPMLSDTDKKPFTAILYGNGPGYKVVGGERENVSMVDYAHNNYQAQSAVPLRHETHGGEDVAVFSKGPMAHLLHGVHEQNYVPHVMAYAACIGANLGHCAPASSAGSLAAGPLLLALALYPLSVLF,447,NP_001170991.1.csv,refseq-ALPL-NM_001177520.3_clinical_seed_0_final,refseq-ALPL-NM_001177520.3.a2m,Invitae,refseq-ALPL-NM_001177520.3_theta_0.2.npy,1,447,447
+NP_001171481.1,MFVSDFRKEFYEVVQSQRVLLFVASDVDALCACKILQALFQCDHVQYTLVPVSGWQELETAFLEHKEQFHYFILINCGANVDLLDILQPDEDTIFFVCDTHRPVNVVNVYNDTQIKLLIKQDDDLEVPAYEDIFRDEEEDEEHSGNDSDGSEPSEKRTRLEEEIVEQTMRRRQRREWEARSGSGSEPVAAALEKSSRLFAGPMSDRTAPRSPRRDILFDYEQYEYHGTSSAMVMFELAWMLSKDLNDMLWWAIVGLTDQWVQDKITQMKYVTDVGVLQRHVSRHNHRNEDEENTLSVDCTRISFEYDLRLVLYQHWSLHDSLCNTSYTAARFKLWSVHGQKRLQEFLADMGLPLKQVKQKFQAMDISLKENLREMIEESANKFGMKDMRVQTFSIHFGFKHKFLASDVVFATMSLMESPEKDGSGTDHFIQALDSLSRSNLDKLYHGLELAKKQLRATQQTIASCLCTNLVISQGPFLYCSLMEGTPDVMLFSRPASLSLLSKHLLKSFVCSTKNRRCKLLPLVMAAPLSMEHGTVTVVGIPPETDSSDRKNFFGRAFEKAAESTSSRMLHNHFDLSVIELKAEDRSKFLDALISLLS,598,NP_001171481.1.csv,refseq-CDC45-NM_001178010.2_clinical_seed_0_final,refseq-CDC45-NM_001178010.2.a2m,Invitae,refseq-CDC45-NM_001178010.2.npy,1,598,598
+NP_001171536.2,MAFYSCCWVLLALTWHTSAYGPDQRAQKKGDIILGGLFPIHFGVAAKDQDLKSRPESVECIRYNFRGFRWLQAMIFAIEEINSSPALLPNLTLGYRIFDTCNTVSKALEATLSFVAQNKIDSLNLDEFCNCSEHIPSTIAVVGATGSGVSTAVANLLGLFYIPQVSYASSSRLLSNKNQFKSFLRTIPNDEHQATAMADIIEYFRWNWVGTIAADDDYGRPGIEKFREEAEERDICIDFSELISQYSDEEEIQHVVEVIQNSTAKVIVVFSSGPDLEPLIKEIVRRNITGKIWLASEAWASSSLIAMPQYFHVVGGTIGFALKAGQIPGFREFLKKVHPRKSVHNGFAKEFWEETFNCHLQEGAKGPLPVDTFLRGHEESGDRFSNSSTAFRPLCTGDENISSVETPYIDYTHLRISYNVYLAVYSIAHALQDIYTCLPGRGLFTNGSCADIKKVEAWQVLKHLRHLNFTNNMGEQVTFDECGDLVGNYSIINWHLSPEDGSIVFKEVGYYNVYAKKGERLFINEEKILWSGFSREPLTFVLSVLQVPFSNCSRDCLAGTRKGIIEGEPTCCFECVECPDGEYSDETDASACNKCPDDFWSNENHTSCIAKEIEFLSWTEPFGIALTLFAVLGIFLTAFVLGVFIKFRNTPIVKATNRELSYLLLFSLLCCFSSSLFFIGEPQDWTCRLRQPAFGISFVLCISCILVKTNRVLLVFEAKIPTSFHRKWWGLNLQFLLVFLCTFMQIVICVIWLYTAPPSSYRNQELEDEIIFITCHEGSLMALGFLIGYTCLLAAICFFFAFKSRKLPENFNEAKFITFSMLIFFIVWISFIPAYASTYGKFVSAVEVIAILAASFGLLACIFFNKIYIILFKPSRNTIEEVRCSTAAHAFKVAARATLRRSNVSRKRSSSLGGSTGSTPSSSISSKSNSEDPFPQPERQKQQQPLALTQQEQQQQPLTLPQQQRSQQQPRCKQKVIFGSGTVTFSLSFDEPQKNAMAHRNSTHQNSLEAQKSSDTLTRHEPLLPLQCGETDLDLTVQETGLQGPVGGDQRPEVEDPEELSPALVVSSSQSFVISGGGSTVTENVVNS,1088,NP_001171536.2.csv,refseq-CASR-NM_001178065.2_clinical_seed_0_final,refseq-CASR-NM_001178065.2.a2m,Invitae,refseq-CASR-NM_001178065.2.npy,1,1088,1088
+NP_001171582.1,MAEKVNNFPPLPKFIPLKPCFYQDFEADIPPQHVSMTKRLYYLWMLNSVTLAVNLVGCLAWLIGGGGATNFGLAFLWLILFTPCSYVCWFRPIYKAFKTDSSFSFMAFFFTFMAQLVISIIQAVGIPGWGVCGWIATISFFGTNIGSAVVMLIPTVMFTVMAVFSFIALSMVHKFYRGSGGSFSKAQEEWTTGAWKNPHVQQAAQNAAMGAAQGAMNQPQTQYSATPNYTYSNEM,235,NP_001171582.1.csv,refseq-SCAMP5-NM_001178111.1_clinical_seed_0_final,refseq-SCAMP5-NM_001178111.1.a2m,Invitae,refseq-SCAMP5-NM_001178111.1.npy,1,235,235
+NP_001171634.3,MAFVTRQFMRSVSSSSTASASAKKIIVKHVTVIGGGLMGAGIAQVAAATGHTVVLVDQTEDILAKSKKGIEESLRKVAKKKFAENLKAGDEFVEKTLSTIATSTDAASVVHSTDLVVEAIVENLKVKNELFKRLDKFAAEHTIFASNTSSLQITSIANATTRQDRFAGLHFFNPVPVMKLVEVIKTPMTSQKTFESLVDFSKALGKHPVSCKDTPGFIVNRLLVPYLMEAIRLYERDFQTCGDSNSGLGFSLKGDASKEDIDTAMKLGAGYPMGPFELLDYVGLDTTKFIVDGWHEMDAENPLHQPSPSLNKLVAENKFGKKTGEGFYKYK,331,NP_001171634.3.csv,refseq-HADH-NM_001184705.4_clinical_seed_0_final,refseq-HADH-NM_001184705.4.a2m,Invitae,refseq-HADH-NM_001184705.4_theta_0.2.npy,1,331,331
+NP_001171809.1,MESLLLPVLLLLAILWTQAAALINLKYSVEEEQRAGTVIANVAKDAREAGFALDPRQASAFRVVSNSAPHLVDINPSSGLLVTKQKIDRDLLCRQSPKCIISLEVMSSSMEICVIKVEIKDLNDNAPSFPAAQIELEISEAASPGTRIPLDSAYDPDSGSFGVQTYELTPNELFGLEIKTRGDGSRFAELVVEKSLDRETQSHYSFRITALDGGDPPRLGTVGLSIKVTDSNDNNPVFSESTYAVSVPENSPPNTPVIRLNASDPDEGTNGQVVYSFYGYVNDRTRELFQIDPHSGLVTVTGALDYEEGHVYELDVQAKDLGPNSIPAHCKVTVSVLDTNDNPPVINLLSVNSELVEVSESAPPGYVIALVRVSDRDSGLNGRVQCRLLGNVPFRLQEYESFSTILVDGRLDREQHDQYNLTIQARDGGVPMLQSAKSFTVLITDENDNHPHFSKPYYQVIVQENNTPGAYLLSVSARDPDLGLNGSVSYQIVPSQVRDMPVFTYVSINPNSGDIYALRSFNHEQTKAFEFKVLAKDGGLPSLQSNATVRVIILDVNDNTPVITAPPLINGTAEVYIPRNSGIGYLVTVVKAEDYDEGENGRVTYDMTEGDRGFFEIDQVNGEVRTTRTFGESSKSSYELIVVAHDHGKTSLSASALVLIYLSPALDAQESMGSVNLSLIFIIALGSIAGILFVTMIFVAIKCKRDNKEIRTYNCSNCLTITCLLGCFIKGQNSKCLHCISVSPISEEQDKKTEEKVSLRGKRIAEYSYGHQKKSSKKKKISKNDIRLVPRDVEETDKMNVVSCSSLTSSLNYFDYHQQTLPLGCRRSESTFLNVENQNTRNTSANHIYHHSFNSQGPQQPDLIINGVPLPETENYSFDSNYVNSRAHLIKSSSTFKDLEGNSLKDSGHEESDQTDSEHDVQRSLYCDTAVNDVLNTSVTSMGSQMPDHDQNEGFHCREECRILGHSDRCWMPRNPMPIRSKSPEHVRNIIALSIEATAADVEAYDDCGPTKRTFATFGKDVSDHPAEERPTLKGKRTVDVTICSPKVNSVIREAGNGCEAISPVTSPLHLKSSLPTKPSVSYTIALAPPARDLEQYVNNVNNGPTRPSEAEPRGADSEKVMHEVSPILKEGRNKESPGVKRLKDIVL,1148,NP_001171809.1.csv,refseq-PCDH19-NM_001184880.1_clinical_seed_0_final,refseq-PCDH19-NM_001184880.1.a2m,Invitae,refseq-PCDH19-NM_001184880.1.npy,1,1148,1148
+NP_001171818.1,MSQGRGKYDFYIGLGLAMSSSIFIGGSFILKKKGLLRLARKGSMRAGQGGHAYLKEWLWWAGLLSMGAGEVANFAAYAFAPATLVTPLGALSVLVSAILSSYFLNERLNLHGKIGCLLSILGSTVMVIHAPKEEEIETLNEMSHKLGDPGFVVFATLVVIVALILIFVVGPRHGQTNILVYITICSVIGAFSVSCVKGLGIAIKELFAGKPVLRHPLAWILLLSLIVCVSTQINYLNRALDIFNTSIVTPIYYVFFTTSVLTCSAILFKEWQDMPVDDVIGTLSGFFTIIVGIFLLHAFKDVSFSLASLPVSFRKDEKAMNGNLSNMYEVLNNNEESLTCGIEQHTGENVSRRNGNLTAF,360,NP_001171818.1.csv,refseq-NIPA2-NM_001184889.1_clinical_seed_0_final,refseq-NIPA2-NM_001184889.1.a2m,Invitae,refseq-NIPA2-NM_001184889.1.npy,1,360,360
+NP_001174.2,MMAAMATARVRMGPRCAQALWRMPWLPVFLSLAAAAAAAAAEQQVPLVLWSSDRDLWAPAADTHEGHITSDLQLSTYLDPALELGPRNVLLFLQDKLSIEDFTAYGGVFGNKQDSAFSNLENALDLAPSSLVLPAVDWYAVSTLTTYLQEKLGASPLHVDLATLRELKLNASLPALLLIRLPYTASSGLMAPREVLTGNDEVIGQVLSTLKSEDVPYTAALTAVRPSRVARDVAVVAGGLGRQLLQKQPVSPVIHPPVSYNDTAPRILFWAQNFSVAYKDQWEDLTPLTFGVQELNLTGSFWNDSFARLSLTYERLFGTTVTFKFILANRLYPVSARHWFTMERLEVHSNGSVAYFNASQVTGPSIYSFHCEYVSSLSKKGSLLVARTQPSPWQMMLQDFQIQAFNVMGEQFSYASDCASFFSPGIWMGLLTSLFMLFIFTYGLHMILSLKTMDRFDDHKGPTISLTQIV,470,NP_001174.2.csv,refseq-ATP6AP1-NM_001183.4_clinical_seed_0_final,refseq-ATP6AP1-NM_001183.4.a2m,Invitae,refseq-ATP6AP1-NM_001183.4_theta_0.2.npy,1,470,470
+NP_001177203.1,MNSVRAANRRPRRVSRPRPVQQQQQQPPQQPPPQPPQQQPPQQQPPPPPQQQQQQQPPPPPPPPPPLPQERNNVGERDDDVPADMVAEESGPGAQNSPYQLRRKTLLPKRTACPTKNSMEGASTSTTENFGHRAKRARVSGKSQDLSAAPAEQYLQEKLPDEVVLKIFSYLLEQDLCRAACVCKRFSELANDPILWKRLYMEVFEYTRPMMHPEPGKFYQINPEEYEHPNPWKESFQQLYKGAHVKPGFAEHFYSNPARYKGRENMLYYDTIEDALGGVQEAHFDGLIFVHSGIYTDEWIYIESPITMIGAAPGKVADKVIIENTRDSTFVFMEGSEDAYVGYMTIRFNPDDKSAQHHNAHHCLEITVNCSPIIDHCIIRSTCTVGSAVCVSGQGACPTIKHCNISDCENVGLYITDHAQGIYEDNEISNNALAGIWVKNHGNPIIRRNHIHHGRDVGVFTFDHGMGYFESCNIHRNRIAGFEVKAYANPTVVRCEIHHGQTGGIYVHEKGRGQFIENKIYANNFAGVWITSNSDPTIRGNSIFNGNQGGVYIFGDGRGLIEGNDIYGNALAGIQIRTNSCPIVRHNKIHDGQHGGIYVHEKGQGVIEENEVYSNTLAGVWVTTGSTPVLRRNRIHSGKQVGVYFYDNGHGVLEDNDIYNHMYSGVQIRTGSNPKIRRNKIWGGQNGGILVYNSGLGCIEDNEIFDNAMAGVWIKTDSNPTLRRNKIHDGRDGGICIFNGGRGLLEENDIFRNAQAGVLISTNSHPILRKNRIFDGFAAGIEITNHATATLEGNQIFNNRFGGLFLASGVNVTMKDNKIMNNQDAIEKAVSRGQCLYKISSYTSYPMHDFYRCHTCNTTDRNAICVNCIKKCHQGHDVEFIRHDRFFCDCGAGTLSNPCTLAGEPTHDTDTLYDSAPPIESNTLQHN,927,NP_001177203.1.csv,refseq-FBXO11-NM_001190274.1_clinical_seed_0_final,refseq-FBXO11-NM_001190274.1.a2m,Invitae,refseq-FBXO11-NM_001190274.1.npy,1,927,927
+NP_001177666.1,MMYSPICLTQDEFHPFIEALLPHVRAIAYTWFNLQARKRKYFKKHEKRMSKDEERAVKDELLSEKPEIKQKWASRLLAKLRKDIRQEYREDFVLTVTGKKHPCCVLSNPDQKGKIRRIDCLRQADKVWRLDLVMVILFKGIPLESTDGERLMKSPHCTNPALCVQPHHITVSVKELDLFLAYYVQEQDSGQSGSPSHNDPAKNPPGYLEDSFVKSGVFNVSELVRVSRTPITQGTGVNFPIGEIPSQPYYHDMNSGVNLQRSLSSPPSSKRPKTISIDENMEPSPTGDFYPSPSSPAAGSRTWHERDQDMSSPTTMKKPEKPLFSSASPQDSSPRLSTFPQHHHPGIPGVAHSVISTRTPPPPSPLPFPTQAILPPAPSSYFSHPTIRYPPHLNPQDTLKNYVPSYDPSSPQTSQPNGSGQVVGKVPGHFTPVLAPSPHPSAVRPVTLSMTDTKPITTSTEAYTASGTSQANRYVGLSPRDPSFLHQQQSWYLG,494,NP_001177666.1.csv,refseq-NFIB-NM_001190737.1_clinical_seed_0_final,refseq-NFIB-NM_001190737.1.a2m,Invitae,refseq-NFIB-NM_001190737.1.npy,1,494,494
+NP_001177716.1,MQACGGGAAGRRAFDSICPNRMLALPGRALLCKPGKPERKFAPPRKFFPGCTGGSPVSVYEDPPDAEPTALPALTTIDLQDLADCSSLLGSDAPPGGDLAASQNHSHQTEADFNLQDFRDTVDDLISDSSSMMSPTLASGDFPFSPCDISPFGPCLSPPLDPRALQSPPLRPPDVPPPEQYWKEVADQNQRALGDALVENNQLHVTLTQKQEEIASLKERNVQLKELASRTRHLASVLDKLMITQSRDCGAAAEPFLLKAKAKRSLEELVSAAGQDCAEVDAILREISERCDEALQSRDPKRPRLLPEPANTDTRPGNLHGAFRGLRTDCSRSALNLSHSELEEGGSFSTRIRSHSTIRTLAFPQGNAFTIRTANGGYKFRWVPS,385,NP_001177716.1.csv,refseq-MCIDAS-NM_001190787.1_clinical_seed_0_final,refseq-MCIDAS-NM_001190787.1.a2m,Invitae,refseq-MCIDAS-NM_001190787.1.npy,1,385,385
+NP_001180345.1,MSHVAVENALGLDQQFAGLDLNSSDNQSGGSTASKGRYIPPHLRNREATKGFYDKDSSGWSSSKDKDAYSSFGSRSDSRGKSSFFSDRGSGSRGRFDDRGRSDYDGIGSRGDRSGFGKFERGGNSRWCDKSDEDDWSKPLPPSERLEQELFSGGNTGINFEKYDDIPVEATGNNCPPHIESFSDVEMGEIIMGNIELTRYTRPTPVQKHAIPIIKEKRDLMACAQTGSGKTAAFLLPILSQIYSDGPGEALRAMKENGRYGRRKQYPISLVLAPTRELAVQIYEEARKFSYRSRVRPCVVYGGADIGQQIRDLERGCHLLVATPGRLVDMMERGKIGLDFCKYLVLDEADRMLDMGFEPQIRRIVEQDTMPPKGVRHTMMFSATFPKEIQMLARDFLDEYIFLAVGRVGSTSENITQKVVWVEESDKRSFLLDLLNATGKDSLTLVFVETKKGADSLEDFLYHEGYACTSIHGDRSQRDREEALHQFRSGKSPILVATAVAARGLDISNVKHVINFDLPSDIEEYVHRIGRTGRVGNLGLATSFFNERNINITKDLLDLLVEAKQEVPSWLENMAYEHHYKGSSRGRSKSRFSGGFGARDYRQSSGASSSSFSSSRASSSRSGGGGHGSSRGFGGGGYGGFYNSDGYGGNYNSQGVDWWGN,661,NP_001180345.1.csv,refseq-DDX3X-NM_001193416.2_clinical_seed_0_final,refseq-DDX3X-NM_001193416.2.a2m,Invitae,refseq-DDX3X-NM_001193416.2.npy,1,661,661
+NP_001181939.1,MMRVNGDDDSVAALSFLYDYYMGPKEKRILSSSTGGRNDQGKRYYHGMEYETDLTPLESPTHLMKFLTENVSGTPEYPDLLKKNNLMSLEGALPTPGKAAPLPAGPSKLEAGSVDSYLLPTTDMYDNGSLNSLFESIHGVPPTQRWQPDSTFKDDPQESMLFPDILKTSPEPPCPEDYPSLKSDFEYTLGSPKAIHIKSGESPMAYLNKGQFYPVTLRTPAGGKGLALSSNKVKSVVMVVFDNEKVPVEQLRFWKHWHSRQPTAKQRVIDVADCKENFNTVEHIEEVAYNALSFVWNVNEEAKVFIGVNCLSTDFSSQKGVKGVPLNLQIDTYDCGLGTERLVHRAVCQIKIFCDKGAERKMRDDERKQFRRKVKCPDSSNSGVKGCLLSGFRGNETTYLRPETDLETPPVLFIPNVHFSSLQRSGGAAPSAGPSSSNRLPLKRTCSPFTEEFEPLPSKQAKEGDLQRVLLYVRRETEEVFDALMLKTPDLKGLRNAISEKYGFPEENIYKVYKKCKRGILVNMDNNIIQHYSNHVAFLLDMGELDGKIQIILKEL,556,NP_001181939.1.csv,refseq-GRHL3-NM_001195010.1_clinical_seed_0_final,refseq-GRHL3-NM_001195010.1.a2m,Invitae,refseq-GRHL3-NM_001195010.1.npy,1,556,556
+NP_001182058.1,MLLAVLLLLPLPSSWFAHGHPLYTRLPPSALQVLSAQGTQALQAAQRSAQWAINRVAMEIQHRSHECRGSGRPRPQALLQDPPEPGPCGERRPSTANVTRAHGRIVGGSAAPPGAWPWLVRLQLGGQPLCGGVLVAASWVLTAAHCFVGAPNELLWTVTLAEGSRGEQAEEVPVNRILPHPKFDPRTFHNDLALVQLWTPVSPGGSARPVCLPQEPQEPPAGTACAIAGWGALFEDGPEAEAVREARVPLLSTDTCRRALGPGLRPSTMLCAGYLAGGVDSCQGDSGGPLTCSEPGPRPREVLFGVTSWGDGCGEPGKPGVYTRVAVFKDWLQEQMSASSSREPSCRELLAWDPPQELQADAARLCAFYARLCPGSQGACARLAHQQCLQRRRRCELRSLAHTLLGLLRNAQELLGPRPGLRRLAPALALPAPALRESPLHPARELRLHSGSRAAGTRFPKRRPEPRGEANGCPGLEPLRQKLAALQGAHAWILQVPSEHLAMNFHEVLADLGSKTLTGLFRAWVRAGLGGRHVAFSGLVGLEPATLARSLPRLLVQALQAFRVAALAEGEPEGPWMDVGQGPGLERKGHHPLNPQVPPARQP,603,NP_001182058.1.csv,refseq-PRSS56-NM_001195129.1_clinical_seed_0_final,refseq-PRSS56-NM_001195129.1.a2m,Invitae,refseq-PRSS56-NM_001195129.1.npy,1,603,603
+NP_001182192.1,MAQGFAVGFDPLGLGDLSSGSLSSLSSRGHLGSDSGSTATRYLLRKQQRLLNGPPRGIRASSPMGRVILINSPIEANSDESDIIHSVRVEKSPAGRLGFSVRGGSEHGLGIFVSKVEEGSSAERAGLCVGDKITEVNGLSLESTTMGSAVKVLTSSSRLHMMVRRMGRVPGIKFSKEKTTWVDVVNRRLVVEKCGSTPSDTSSEDGVRRIVHLYTTSDDFCLGFNIRGGKEFGLGIYVSKVDHGGLAEENGIKVGDQVLAANGVRFDDISHSQAVEVLKGQTHIMLTIKETGRYPAYKEMVSEYCWLDRLSNGVLQQLSPASESSSSVSSCASSAPYSSGSLPSDRMDICLGQEEPGSRGPGWGRADTAMQTEPDAGGRVETWCSVRPTVILRDTAIRSDGPHPGRRLDSALSESPKTALLLALSRPRPPITRSQSYLTLWEEKQQRKKEKSGSPGEKGALQRSKTLMNLFFKGGRQGRLARDGRREAWTLDSGSLAKTYPRLDIEKAGGVGPVQKFVTWRLRRDQERGRALLSARSGSPSSQLPNVDEQVQAWESRRPLIQDLAQRLLTDDEVLAVTRHCSRYVHEGGIEDLVRPLLAILDRPEKLLLLQDIRSVVAPTDLGRFDSMVMLVELEAFEALKSRAVRPPALRPARQDTPPKRHLITPVPDSRGGFYLLPVNGFPEEEDNGELRERLGALKVSPSASAPRHPHKGIPPLQDVPVDAFTPLRIACTPPPQLPPVAPRPLRPNWLLTEPLSREHPPQSQIRGRAQSRSRSRSRSRSRSSRGQGKSPGRRSPSPVPTPAPSMTNGRYHKPRKARPPLPRPLDGEAAKVGAKQGPSESGTEGTAKEAAMKNPSGELKTVTLSKMKQSLGISISGGIESKVQPMVKIEKIFPGGAAFLSGALQAGFELVAVDGENLEQVTHQRAVDTIRRAYRNKAREPMELVVRVPGPSPRPSPSDSSALTDGGLPADHLPAHQPLDAAPVPAHWLPEPPTNPQTPPTDARLLQPTPSPAPSPALQTPDSKPAPSPRIP,1033,NP_001182192.1.csv,refseq-PDZD7-NM_001195263.1_clinical_seed_0_final,refseq-PDZD7-NM_001195263.1.a2m,Invitae,refseq-PDZD7-NM_001195263.1.npy,1,1033,1033
+NP_001182399.1,MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGRLGSTGAKMQGVPLKHSGHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVRKDMLFNQLIEMALLSLGYSHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVADMLQDVYHVVTLKIQLHSCPKLEDLPPEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISSIVNSTYYANVSAAKCQEFGRWYKHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPGNTAEQPPSPAQLSHGSQPSVRTPLPNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVNRLLAQQSLNQQYLNHPPPVSRSMNKPLEQQVSTNTEVSSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFLQLPEAERDRIYQDERERSLNAASAMGPAPLISTPPSRPPQVKTATIATERNGKPENNTMNINASIYDEIQQEMKRAKVSQALFAKVAATKSQGWLCELLRWKEDPSPENRTLWENLSMIRRFLSLPQPERDAIYEQESNAVHHHGDRPPHIIHVPAEQIQSPSPTTLGKGESRGVFLPGLPTPAPWLGAAPQQQQQQQQQQQQQQQAPPPPQPQQQPQTGPRLPPRQPTVASPAESDEENRQKTRPRTKISVEALGILQSFIQDVGLYPDEEAIQTLSAQLDLPKYTIIKFFQNQRYYLKHHGKLKDNSGLEVDVAEYKEEELLKDLEESVQDKNTNTLFSVKLEEELSVEGNTDINTDLKD,795,NP_001182399.1.csv,refseq-SATB1-NM_001195470.2_clinical_seed_0_final,refseq-SATB1-NM_001195470.2.a2m,Invitae,refseq-SATB1-NM_001195470.2.npy,1,795,795
+NP_001185.3,MDARGGGGRPGESPGATPAPGPPPPPPPAPPQQQPPPPPPPAPPPGPGPAPPQHPPRAEALPPEAADEGGPRGRLRSRDSSCGRPGTPGAASTAKGSPNGECGRGEPQCSPAGPEGPARGPKVSFSCRGAASGPAPGPGPAEEAGSEEAGPAGEPRGSQASFMQRQFGALLQPGVNKFSLRMFGSQKAVEREQERVKSAGAWIIHPYSDFRFYWDFTMLLFMVGNLIIIPVGITFFKDETTAPWIVFNVVSDTFFLMDLVLNFRTGIVIEDNTEIILDPEKIKKKYLRTWFVVDFVSSIPVDYIFLIVEKGIDSEVYKTARALRIVRFTKILSLLRLLRLSRLIRYIHQWEEIFHMTYDLASAVMRICNLISMMLLLCHWDGCLQFLVPMLQDFPRNCWVSINGMVNHSWSELYSFALFKAMSHMLCIGYGRQAPESMTDIWLTMLSMIVGATCYAMFIGHATALIQSLDSSRRQYQEKYKQVEQYMSFHKLPADFRQKIHDYYEHRYQGKMFDEDSILGELNGPLREEIVNFNCRKLVASMPLFANADPNFVTAMLTKLKFEVFQPGDYIIREGTIGKKMYFIQHGVVSVLTKGNKEMKLSDGSYFGEICLLTRGRRTASVRADTYCRLYSLSVDNFNEVLEEYPMMRRAFETVAIDRLDRIGKKNSILLHKVQHDLNSGVFNNQENAIIQEIVKYDREMVQQAELGQRVGLFPPPPPPPQVTSAIATLQQAAAMSFCPQVARPLVGPLALGSPRLVRRPPPGPAPAAASPGPPPPASPPGAPASPRAPRTSPYGGLPAAPLAGPALPARRLSRASRPLSASQPSLPHGAPGPAASTRPASSSTPRLGPTPAARAAAPSPDRRDSASPGAAGGLDPQDSARSRLSSNL,889,NP_001185.3.csv,refseq-HCN2-NM_001194.3_clinical_seed_0_final,refseq-HCN2-NM_001194.3.a2m,Invitae,refseq-HCN2-NM_001194.3.npy,1,889,889
+NP_001185797.1,MSEEIITPVYCTGVSAQVQKQRARELGLGRHENAIKYLGQDYEQLRVRCLQSGTLFRDEAFPPVPQSLGYKDLGPNSSKTYGIKWKRPTELLSNPQFIVDGATRTDICQGALGDCWLLAAIASLTLNDTLLHRVVPHGQSFQNGYAGIFHFQLWQFGEWVDVVVDDLLPIKDGKLVFVHSAEGNEFWSALLEKAYAKVNGSYEALSGGSTSEGFEDFTGGVTEWYELRKAPSDLYQIILKALERGSLLGCSIDISSVLDMEAITFKKLVKGHAYSVTGAKQVNYRGQVVSLIRMRNPWGEVEWTGAWSDSSSEWNNVDPYERDQLRVKMEDGEFWMSFRDFMREFTRLEICNLTPDALKSRTIRKWNTTLYEGTWRRGSTAGGCRNYPATFWVNPQFKIRLDETDDPDDYGDRESGCSFVLALMQKHRRRERRFGRDMETIGFAVYEVPPELVGQPAVHLKRDFFLANASRARSEQFINLREVSTRFRLPPGEYVVVPSTFEPNKEGDFVLRFFSEKSAGTVELDDQIQANLPDEQVLSEEEIDENFKALFRQLAGEDMEISVKELRTILNRIISKHKDLRTKGFSLESCRSMVNLMDRDGNGKLGLVEFNILWNRIRNYLSIFRKFDLDKSGSMSAYEMRMAIESAGFKLNKKLYELIITRYSEPDLAVDFDNFVCCLVRLETMFRFFKTLDTDLDGVVTFDLFKWLQLTMFA,714,NP_001185797.1.csv,refseq-CAPN1-NM_001198868.1_clinical_seed_0_final,refseq-CAPN1-NM_001198868.1.a2m,Invitae,refseq-CAPN1-NM_001198868.1.npy,1,714,714
+NP_001185885.1,MSRGGSYPHLLWDVRKRSLGLEDPSRLRSRYLGRREFIQRLKLEATLNVHDGCVNTICWNDTGEYILSGSDDTKLVISNPYSRKVLTTIRSGHRANIFSAKFLPCTNDKQIVSCSGDGVIFYTNVEQDAETNRQCQFTCHYGTTYEIMTVPNDPYTFLSCGEDGTVRWFDTRIKTSCTKEDCKDDILINCRRAATSVAICPPIPYYLAVGCSDSSVRIYDRRMLGTRATGNYAGRGTTGMVARFIPSHLNNKSCRVTSLCYSEDGQEILVSYSSDYIYLFDPKDDTARELKTPSAEERREELRQPPVKRLRLRGDWSDTGPRARPESERERDGEQSPNVSLMQRMSDMLSRWFEEASEVAQSNRGRGRSRPRGGTSQSDISTLPTVPSSPDLEVSETAMEVDTPAEQFLQPSTSSTMSAQAHSTSSPTESPHSTPLLSSPDSEQRQSVEASGHHTHHQSEFLRGPEIALLRKRLQQLRLKKAEQQRQQELAAHTQQQPSTSDQSSHEGSSQDPHASDSPSSVVNKQLGSMSLDEQQDNNNEKLSPKPGTGEPVLSLHYSTEGTTTSTIKLNFTDEWSSIASSSRGIGSHCKSEGQEESFVPQSSVQPPEGDSETKAPEESSEDVTKYQEGVSAENPVENHINITQSDKFTAKPLDSNSGERNDLNLDRSCGVPEESASSEKAKEPETSDQTSTESATNENNTNPEPQFQTEATGPSAHEETSTRDSALQDTDDSDDDPVLIPGARYRAGPGDRFNIRGTTIGDRIMRRSAVARIQEFFRRRKERKEMEELDTLNIRRPLVKMVYKGHRNSRTMIKEANFWGANFVMSGSDCGHIFIWDRHTAEHLMLLEADNHVVNCLQPHPFDPILASSGIDYDIKIWSPLEESRIFNRKLADEVITRNELMLEETRNTITVPASFMLRMLASLNHIRADRLEGDRSEGSGQENENEDEE,951,NP_001185885.1.csv,NP_001185885.1_colabfold_clinical_seed_0_final,NP_001185885.1_colabfold.a2m,colabfold,NP_001185885.1_colabfold_theta_0.2.npy,1,951,951
+NP_001186.1,MYRRSYVFQTRKEQYEHADEASRAAEPERPADEGWAGATSLAALQGLGERVAAHVQRARALEQRHAGLRRQLDAFQRLGELAGPEDALARQVESNRQRVRDLEAERARLERQGTEAQRALDEFRSKYENECECQLLLKEMLERLNKEADEALLHNLRLQLEAQFLQDDISAAKDRHKKNLLEVQTYISILQQIIHTTPPASIVTSGMREEKLLTEREVAALRSQLEEGREVLSHLQAQRVELQAQTTTLEQAIKSAHECYDDEIQLYNEQIETLRKEIEETERVLEKSSYDCRQLAVAQQTLKNELDRYHRIIEIEGNRLTSAFIETPIPLFTQSHGVSLSTGSGGKDLTRALQDITAAKPRQKALPKNVPRRKEIITKDKTNGALEDAPLKGLEDTKLVQVVLKEESESKFESESKEVSPLTQEGAPEDVPDGGQISKGFGKLYRKVKEKVRSPKEPETPTELYTKERHVLVTGDANYVDPRFYVSSITAKGGVAVSVAEDSVLYDGQVEPSPESPKPPLENGQVGLQEKEDGQPIDQQPIDKEIEPDGAELEGPEEKREGEERDEESRRPCAMVTPGAEEPSIPEPPKPAADQDGAEVLGTRSRSLPEKGPPKALAYKTVEVVESIEKISTESIQTYEETAVIVETMIGKTKSDKKKSGEKSS,665,NP_001186.1.csv,refseq-BFSP1-NM_001195.4_clinical_seed_0_final,refseq-BFSP1-NM_001195.4.a2m,Invitae,refseq-BFSP1-NM_001195.4.npy,1,665,665
+NP_001186036.1,MDSPGYNCFVDKDKMDAAIQDLGPKELSCTELQELKQLARQGYWAQSHALRGKVYQRLIRDIPCRTVTPDASVYSDIVGKIVGKHSSSCLPLPEFVDNTQVPSYCLNARGEGAVRKILLCLANQFPDISFCPALPAVVALLLHYSIDEAECFEKACRILACNDPGRRLIDQSFLAFESSCMTFGDLVNKYCQAAHKLMVAVSEDVLQVYADWQRWLFGELPLCYFARVFDVFLVEGYKVLYRVALAILKFFHKVRAGQPLESDSVKQDIRTFVRDIAKTVSPEKLLEKAFAIRLFSRKEIQLLQMANEKALKQKGITVKQKSVSLSKRQFVHLAVHAENFRSEIVSVREMRDIWSWVPERFALCQPLLLFSSLQHGYSLARFYFQCEGHEPTLLLIKTTQKEVCGAYLSTDWSERNKFGGKLGFFGTGECFVFRLQPEVQRYEWVVIKHPELTKPPPLMAAEPTAPLSHSASSDPADRLSPFLAARHFNLPSKTESMFMAGGSDCLIVGGGGGQALYIDGDLNRGRTSHCDTFNNQPLCSENFLIAAVEAWGFQDPDTQ,559,NP_001186036.1.csv,refseq-TBC1D24-NM_001199107.1_clinical_seed_0_final,refseq-TBC1D24-NM_001199107.1.a2m,Invitae,refseq-TBC1D24-NM_001199107.1.npy,1,559,559
+NP_001186136.1,MALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEKGRMFLNARKELQSDFLRFCLSAAQHRAATSQLEGRVVRRVLTVASRALCPTGGPPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVLYSEALGRASVVPLPYERLLREPGLLAVQGLPEGLAFRRPAEYDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVELNGVSLIPKGSRDCGLHGQAPKVPPQDLPPTATSSSMASFLYSTALPNHAIRELKQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQNVHASKRILFSIVHDKSEKWDAFIKETEDINTLRECVQILFNSRYAEALGLDHMVPVPYRKIACDPEAVEIVGIPDKIPFKRPCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGELGGLRPIKIEPEDLDIIQVTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLRKQVELLFNTRYAKAIGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRKILEASNSIQFVIKRPELLTEGVKEPIMDSQERDSGDPLVDESLKRQGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTFGSQNLERILAVADKIKFTVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNTYDIHRLEKILKAREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVQWPMYMVDYAGLNVQLPGPLNY,976,NP_001186136.1.csv,NP_001186136.1_colabfold_clinical_seed_0_final,NP_001186136.1_colabfold.a2m,colabfold,NP_001186136.1_colabfold_theta_0.2.npy,1,976,976
+NP_001186181.1,MAKERCLKKSFQDSLEDIKKRMKEKRNKNLAEIGKRRSFIAAPCQIITNTSTLLKNYQDNNKMLVLALENEKSKVKEAQDIILQLRKECYYLTCQLYALKGKLTSQQTVEPAQNQEICSSGMDPNSDDSSRNLFVKDLPQIPLEETELPGQGESFQIEDQIPTIPQDTLGVDFDSGEAKSTDNVLPRTVSVRSSLKKHCNSICQFDSLDDFETSHLAGKSFEFERVGFLDPLVNMHIPENVQHNACQWSKDQVNLSPKLIQPGTFTKTKEDILESKSEQTKSKQRDTQERKREEKRKANRRKSKRMSKYKENKSENKKTVPQKKMHKSVSSNDAYNFNLEEGVHLTPFRQKVSNDSNREENNESEVSLCESSGSGDDSDDLYLPTCKYIQNPTSNSDRPVTRPLAKRALKYTDEKETEGSKPTKTPTTTPPETQQSPHLSLKDITNVSLYPVVKIRRLSLSPKKNKASPAVALPKRRCTASVNYKEPTLASKLRRGDPFTDLCFLNSPIFKQKKDLRRSKKRALEVSPAKEAIFILYYVREFVSRFPDCRKCKLETHICLR,561,NP_001186181.1.csv,refseq-SGO1-NM_001199252.2_clinical_seed_0_final,refseq-SGO1-NM_001199252.2.a2m,Invitae,refseq-SGO1-NM_001199252.2.npy,1,561,561
+NP_001186728.1,MAWPKLPAPWLLLCTWLPAGCLSLLVTVQHTERYVTLFASIILKCDYTTSAQLQDVVVTWRFKSFCKDPIFDYYSASYQAALSLGQDPSNDCNDNQREVRIVAQRRGQNEPVLGVDYRQRKITIQNRADLVINEVMWWDHGVYYCTIEAPGDTSGDPDKEVKLIVLHWLTVIFIILGALLLLLLIGVCWCQCCPQYCCCYIRCPCCPAHCCCPEEALARHRYMKQAQALGPQMMGKPLYWGADRSSQVSSYPMHPLLQRDLSLPSSLPQMPMTQTTNQPPIANGVLEYLEKELRNLNLAQPLPPDLKGRFGHPCSMLSSLGSEVVERRIIHLPPLIRDLSSSRRTSDSLHQQWLTPIPSRPWDLREGRSHHHYPDFHQELQDRGPKSWALERRELDPSWSGRHRSSRLNGSPIHWSDRDSLSDVPSSSEARWRPSHPPFRSRCQERPRRPSPRESTQRHGRRRRHRSYSPPLPSGLSSWSSEEDKERQPQSWRAHRRGSHSPHWPEEKPPSYRSLDITPGKNSRKKGSVERRSEKDSSHSGRSVVI,546,NP_001186728.1.csv,refseq-ILDR1-NM_001199799.1_clinical_seed_0_final,refseq-ILDR1-NM_001199799.1.a2m,Invitae,refseq-ILDR1-NM_001199799.1.npy,1,546,546
+NP_001186846.1,MGGEWGQSAICPESAQEWTYQVGQHLVDMDLGAITKYSALHAKPNGLILQYGTAGFRTKAEHLDHVMFRMGLLAVLRSKQTKSTIGVMVTASHNPEEDNGVKLVDPLGEMLAPSWEEHATCLANAEEQDMQRVLIDISEKEAVNLQQDAFVVIGRDTRPSSEKLSQSVIDGVTVLGGQFHDYGLLTTPQLHYMVYCRNTGGRYGKATIEGYYQKLSKAFVELTKQASCSGDEYRSLKVDCANGIGALKLREMEHYFSQGLSVQLFNDGSKGKLNHLCGADFVKSHQKPPQGMEIKSNERCCSFDGDADRIVYYYHDADGHFHLIDGDKIATLISSFLKELLVEIGESLNIGVVQTAYANGSSTRYLEEVMKVPVYCTKTGVKHLHHKAQEFDIGVYFEANGHGTALFSTAVEMKIKQSAEQLEDKKRKAAKMLENIIDLFNQAAGDAISDMLVIEAILALKGLTVQQWDALYTDLPNRQLKVQVADRRVISTTDAERQAVTPPGLQEAINDLVKKYKLSRAFVRPSGTEDVVRVYAEADSQESADHLAHEVSLAVFQLAGGIGERPQPGF,570,NP_001186846.1.csv,refseq-PGM3-NM_001199917.1_clinical_seed_0_final,refseq-PGM3-NM_001199917.1.a2m,Invitae,refseq-PGM3-NM_001199917.1.npy,1,570,570
+NP_001188356.1,MAPRKRSHHGLGFLCCFGGSDIPEINLRDNHPLQFMEFSSPIPNAEELNIRFAELVDELDLTDKNREAMFALPPEKKWQIYCSKKKEQEDPNKLATSWPDYYIDRINSMAAMQSLYAFDEEETEMRNQVVEDLKTALRTQPMRFVTRFIELEGLTCLLNFLRSMDHATCESRIHTSLIGCIKALMNNSQGRAHVLAQPEAISTIAQSLRTENSKTKVAVLEILGAVCLVPGGHKKVLQAMLHYQVYAAERTRFQTLLNELDRSLGRYRDEVNLKTAIMSFINAVLNAGAGEDNLEFRLHLRYEFLMLGIQPVIDKLRQHENAILDKHLDFFEMVRNEDDLELARRFDMVHIDTKSASQMFELIHKKLKYTEAYPCLLSVLHHCLQMPYKRNGGYFQQWQLLDRILQQIVLQDERGVDPDLAPLENFNVKNIVNMLINENEVKQWRDQAEKFRKEHMELVSRLERKERECETKTLEKEEMMRTLNKMKDKLARESQELRQARGQVAELVAQLSELSTGPVSSPPPPGGPLTLSSSMTTNDLPPPPPPLPFACCPPPPPPPLPPGGPPTPPGAPPCLGMGLPLPQDPYPSSDVPLRKKRVPQPSHPLKSFNWVKLNEERVPGTVWNEIDDMQVFRILDLEDFEKMFSAYQRHQKELGSTEDIYLASRKVKELSVIDGRRAQNCIILLSKLKLSNEEIRQAILKMDEQEDLAKDMLEQLLKFIPEKSDIDLLEEHKHEIERMARADRFLYEMSRIDHYQQRLQALFFKKKFQERLAEAKPKVEAILLASRELVRSKRLRQMLEVILAIGNFMNKGQRGGAYGFRVASLNKIADTKSSIDRNISLLHYLIMILEKHFPDILNMPSELQHLPEAAKVNLAELEKEVGNLRRGLRAVEVELEYQRRQVREPSDKFVPVMSDFITVSSFSFSELEDQLNEARDKFAKALMHFGEHDSKMQPDEFFGIFDTFLQAFSEARQDLEAMRRRKEEEERRARMEAMLKEQRERERWQRQRKVLAAGSSLEEGGEFDDLVSALRSGEVFDKDLCKLKRSRKRSGSQALEVTRERAINRLNY,1068,NP_001188356.1.csv,refseq-DAAM2-NM_001201427.1_clinical_seed_0_final,refseq-DAAM2-NM_001201427.1.a2m,Invitae,refseq-DAAM2-NM_001201427.1_theta_0.2.npy,1,1068,1068
+NP_001189333.2,MWRLPRALCVHAAKTSKLSGPWSRPAAFMSTLLINQPQYAWLKELGLREENEGVYNGSWGGRGEVITTYCPANNEPIARVRQASVADYEETVKKAREAWKIWADIPAPKRGEIVRQIGDALREKIQVLGSLVSLEMGKILVEGVGEVQEYVDICDYAVGLSRMIGGPILPSERSGHALIEQWNPVGLVGIITAFNFPVAVYGWNNAIAMICGNVCLWKGAPTTSLISVAVTKIIAKVLEDNKLPGAICSLTCGGADIGTAMAKDERVNLLSFTGSTQVGKQVGLMVQERFGRSLLELGGNNAIIAFEDADLSLVVPSALFAAVGTAGQRCTTARRLVMDRPGNYVEPTIVTGLGHDASIAHTETFAPILYVFKFKNEEEVFAWNNEVKQGLSSSIFTKDLGRIFRWLGPKGSDCGIVNVNIPTSGAEIGGAFGGEKHTGGGRESGSDAWKQYMRRSTCTINYSKDLPLAQGIKFQ,475,NP_001189333.2.csv,refseq-ALDH7A1-NM_001202404.2_clinical_seed_0_final,refseq-ALDH7A1-NM_001202404.2.a2m,Invitae,refseq-ALDH7A1-NM_001202404.2.npy,1,475,475
+NP_001189472.1,MAANVGSMFQYWKRFDLQQLQRELDATATVLANRQDESEQSRKRLIEQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLNVYKRLIDVPDPVPALDLGQQLQLKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKNQAETIALEKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKVQSLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQLASQIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLSAKNSTLKQLEEKLKGQADYEEVKKELNILKSMEFAPSEGAGTQDAAKPLEVLLLEKNRSLQSENAALRISNSDLSGSARRKGKDQPESRRPGSLPAPPPSQLPRNPGEQASNTNGTHQFSPAGLSQDFFSSSLASPSLPLASTGKFALNSLLQRQLMQSFYSKAMQEAGSTSMIFSTGPYSTNSISSQSPLQQSPDVNGMAPSPSQSESAGSVSEGEEMDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLTVRGKEPFHKMKQFLSDEQNILALRSIQGRQRENPGQSLNRLFQEVPKRRNGSEGNITTRIRASETGSDEAIKSILEQAKRELQVQKTAEPAQPSSASGSGNSDDAIRSILQQARREMEAQQAALDPALKQAPLSQSDITILTPKLLSTSPMPTVSSYPPLAISLKKPSAAPEAGASALPNPPALKKEAQDAPGLDPQGAADCAQGVLRQVKNEVGRSGAWKDHWWSAVQPERRNAASSEEAKAEETGGGKEKGSGGSGGGSQPRAERSQLQGPSSSEYWKEWPSAESPYSQSSELSLTGASRSETPQNSPLPSSPIVPMSKPTKPSVPPLTPEQYEVYMYQEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIRMQLWLNGELGQGVLPVQGQQQGPVLHSVTSLQDPLQQGCVSSESTPKTSASCSPAPESPMSSSESVKSLTELVQQPCPPIEASKDSKPPEPSDPPASDSQPTTPLPLSGHSALSIQELVAMSPELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVRMQLWLNDPNNVEKLMDMKRMEKKAYMKRRHSSVSDSQPCEPPSVGTEYSQGASPQPQHQLKKPRVVLAPEEKEALKRAYQQKPYPSPKTIEDLATQLNLKTSTVINWFHNYRSRIRRELFIEEIQAGSQGQAGASDSPSARSGRAAPSSEGDSCDGVEATEGPGSADTEEPKSQGEAEREEVPRPAEQTEPPPSGTPGPDDARDDDHEGGPVEGPGPLPSPASATATAAPAAPEDAATSAAAAPGEGPAAPSSAPPPSNSSSSSAPRRPSSLQSLFGLPEAAGARDSRDNPLRKKKAANLNSIIHRLEKAASREEPIEWEF,1516,NP_001189472.1.csv,refseq-CUX1-NM_001202543.1_clinical_seed_0_final,refseq-CUX1-NM_001202543.1.a2m,Invitae,refseq-CUX1-NM_001202543.1.npy,1,1516,1516
+NP_001191197.1,MELVLVFLCSLLAPMVLASAAEKEKEMDPFHYDYQTLRIGGLVFAVVLFSVGILLILSRRCKCSFNQKPRNRAPESRELKCSHQVEGGSPKGDVDPFYYDYETVRNGGLIFAGLAFIVGLLILLSRRFRCGGNKKRRQINEDEP,144,NP_001191197.1.csv,NP_001191197.1_clinical_seed_0_final,NP_001191197.1.a2m,popEVE,NP_001191197.1_theta_0.2.npy,1,144,144
+NP_001191244.1,MKENVASATVFTLLLFLNTCLLNGQLPPGKPEIFKCRSPNKETFTCWWRPGTDGGLPTNYSLTYHREGETLMHECPDYITGGPNSCHFGKQYTSMWRTYIMMVNATNQMGSSFSDELYVDVTYIVQPDPPLELAVEVKQPEDRKPYLWIKWSPPTLIDLKTGWFTLLYEIRLKPEKAAEWEIHFAGQQTEFKILSLHPGQKYLVQVRCKPDHGYWSAWSPATFIQIPSDFTMNDTTVWISVAVLSAVICLIIVWAVALKGYSMVTCIFPPVPGPKIKGFDAHLLEKGKSEELLSALGCQDFPPTSDYEDLLVEYLEVDDSEDQHLMSVHSKEHPSQEREQRQAQEARDS,349,NP_001191244.1.csv,refseq-PRLR-NM_001204315.1_clinical_seed_0_final,refseq-PRLR-NM_001204315.1.a2m,Invitae,refseq-PRLR-NM_001204315.1_theta_0.2.npy,1,349,349
+NP_001191354.2,MPNVAETERSNDSGNGEHKSERKSPEENLQGAVKSFCTSASGAPLGPKGDGHYPWSCPVTHTREKIYAICSDYAFLNQATSIYKTPNPSRSPCLPDSTSLSAGNNSSRYIGIPTSTSEIIYNEENSLENLSNSLGKLPLAWEIDKSEFDGVTTNSKHKSGNAKKQVSKRKTSDKKGRYQKECPQHSPLEDIKQRKVLDLRRWYCISRPQYKTSCGISSLISCWNFLYSTMGAGNLPPITQEEALHILGFQPPFEDIRFGPFTGNTTLMRWFRQINDHFHVKGCSYVLYKPHGKNKTAGETASGALSKLTRGLKDESLAYIYHCQNHYFCPIGFEATPVKANKAFSRGPLSPQEVEYWILIGESSRKHPAIHCKKWADIVTDLNTQNPEYLDIRHLERGLQYRKTKKVGGNLHCIIAFQRLNWQRFGLWNFPFGTIRQESQPPTHAQGIAKSESEDNISKKQHGRLGRSFSASFHQDSAWKKMSNISIWLNQALKGVRDRHGNSIENPHLLTLFHRLCKLLFFRIRPIFVFDGDAPLLKKQTLVKRRQRKDLASSDSRKTTEKLLKTFLKRQAIKTAFRSKRDEALPSLTQVRRENDLYVLPPLQEEEKHSSEEEDEKEWQERMNQKQALQEEFFHNPQAIDIESEDFSSLPPEVKHEILTDMKEFTKRRRTLFEAMPEESDDFSQYQLKGLLKKNYLNQHIEHVQKEMNQQHSGHIRRQYEDEGGFLKEVESRRVVSEDTSHYILIKGIQAKTVAEVDSESLPSSSKMHGMSFDVKSSPCEKLKTEKEPDATPPSPRTLLAMQAALLGSSSEEELESENRRQARGRNAPAAVDEGSISPRTLSAIKRALDDDEDVKVCAGDDVQTGGPGAEEMRINSSTENSDEGLKVRDGKGIPFTATLASSSVNSAEEHVASTNEGREPTDSVPKEQMSLVHVGTEAFPISDESMIKDRKDRLPLESAVVRHSDAPGLPNGRELTPASPTCTNSVSKNETHAEVLEQQNELCPYESKFDSSLLSSDDETKCKPNSASEVIGPVSLQETSSIVSVPSEAVDNVENVVSFNAKEHENFLETIQEQQTTESAGQDLISIPKAVEPMEIDSEESESDGSFIEVQSVISDEELQAEFPETSKPPSEQGEEELVGTREGEAPAESESLLRDNSERDDVDGEPQEAEKDAEDSLHEWQDINLEELETLESNLLAQQNSLKAQKQQQERIAATVTGQMFLESQELLRLFGIPYIQAPMEAEAQCAILDLTDQTSGTITDDSDIWLFGARHVYRNFFNKNKFVEYYQYVDFHNQLGLDRNKLINLAYLLGSDYTEGIPTVGCVTAMEILNEFPGHGLEPLLKFSEWWHEAQKNPKIRPNPHDTKVKKKLRTLQLTPGFPNPAVAEAYLKPVVDDSKGSFLWGKPDLDKIREFCQRYFGWNRTKTDESLFPVLKQLDAQQTQLRIDSFFRLAQQEKEDAKRIKSQRLNRAVTCMLRKEKEAAASEIEAVSVAMEKEFELLDKAKGKTQKRGITNTLEESSSLKRKRLSDSKGKNTCGGFLGETCLSESSDGSSSEDAESSSLMNVQRRTAAKEPKTSASDSQNSVKEAPVKNGGATTSSSSDSDDDGGKEKMVLVTARSVFGKKRRKLRRARGRKRKT,1640,NP_001191354.2.csv,NP_001191354.2_clinical_seed_0_final,NP_001191354.2.a2m,popEVE,NP_001191354.2_theta_0.2.npy,1,1640,1640
+NP_001193.2,MIPGNRMLMVVLLCQVLLGGASHASLIPETGKKKVAEIQGHAGGRRSGQSHELLRDFEATLLQMFGLRRRPQPSKSAVIPDYMRDLYRLQSGEEEEEQIHSTGLEYPERPASRANTVRSFHHEEHLENIPGTSENSAFRFLFNLSSIPENEVISSAELRLFREQVDQGPDWERGFHRINIYEVMKPPAEVVPGHLITRLLDTRLVHHNVTRWETFDVSPAVLRWTREKQPNYGLAIEVTHLHQTRTHQGQHVRISRSLPQGSGNWAQLRPLLVTFGHDGRGHALTRRRRAKRSPKHHSQRARKKNKNCRRHSLYVDFSDVGWNDWIVAPPGYQAFYCHGDCPFPLADHLNSTNHAIVQTLVNSVNSSIPKACCVPTELSAISMLYLDEYDKVVLKNYQEMVVEGCGCR,408,NP_001193.2.csv,refseq-BMP4-NM_001202.3_clinical_seed_0_final,refseq-BMP4-NM_001202.3.a2m,Invitae,refseq-BMP4-NM_001202.3.npy,1,408,408
+NP_001193928.1,MLKFKYGARNPLDAGAAEPIASRASRLNLFFQGKPPFMTQQQMSPLSREGILDALFVLFEECSQPALMKIKHVSNFVRKYSDTIAELQELQPSAKDFEVRSLVGCGHFAEVQVVREKATGDIYAMKVMKKKALLAQEQVSFFEEERNILSRSTSPWIPQLQYAFQDKNHLYLVMEYQPGGDLLSLLNRYEDQLDENLIQFYLAELILAVHSVHLMGYVHRDIKPENILVDRTGHIKLVDFGSAAKMNSNKMVNAKLPIGTPDYMAPEVLTVMNGDGKGTYGLDCDWWSVGVIAYEMIYGRSPFAEGTSARTFNNIMNFQRFLKFPDDPKVSSDFLDLIQSLLCGQKERLKFEGLCCHPFFSKIDWNNIRNSPPPFVPTLKSDDDTSNFDEPEKNSWVSSSPCQLSPSGFSGEELPFVGFSYSKALGILGRSESVVSGLDSPAKTSSMEKKLLIKSKELQDSQDKCHKMEQEMTRLHRRVSEVEAVLSQKEVELKASETQRSLLEQDLATYITECSSLKRSLEQARMEVSQEDDKALQLLHDIREQSRKLQEIKEQEYQAQVEEMRLMMNQLEEDLVSARRRSDLYESELRESRLAAEEFKRKATECQHKLLKAKDQGKPEVGEYAKLEKINAEQQLKIQELQEKLEKAVKASTEATELLQNIRQAKERAERELEKLQNREDSSEGIRKKLVEAEERRHSLENKVKRLETMERRENRLKDDIQTKSQQIQQMADKILELEEKHREAQVSAQHLEVHLKQKEQHYEEKIKVLDNQIKKDLADKETLENMMQRHEEEAHEKGKILSEQKAMINAMDSKIRSLEQRIVELSEANKLAANSSLFTQRNMKAQEEMISELRQQKFYLETQAGKLEAQNRKLEEQLEKISHQDHSDKNRLLELETRLREVSLEHEEQKLELKRQLTELQLSLQERESQLTALQAARAALESQLRQAKTELEETTAEAEEEIQALTAHRDEIQRKFDALRNSCTVITDLEEQLNQLTEDNAELNNQNFYLSKQLDEASGANDEIVQLRSEVDHLRREITEREMQLTSQKQTMEALKTTCTMLEEQVMDLEALNDELLEKERQWEAWRSVLGDEKSQFECRVRELQRMLDTEKQSRARADQRITESRQVVELAVKEHKAEILALQQALKEQKLKAESLSDKLNDLEKKHAMLEMNARSLQQKLETERELKQRLLEEQAKLQQQMDLQKNHIFRLTQGLQEALDRADLLKTERSDLEYQLENIQVLYSHEKVKMEGTISQQTKLIDFLQAKMDQPAKKKKGLFSRRKEDPALPTQVPLQYNELKLALEKEKARCAELEEALQKTRIELRSAREEAAHRKATDHPHPSTPATARQQIAMSAIVRSPEHQPSAMSLLAPPSSRRKESSTPEEFSRRLKERMHHNIPHRFNVGLNMRATKCAVCLDTVHFGRQASKCLECQVMCHPKCSTCLPATCGLPAEYATHFTEAFCRDKMNSPGLQTKEPSSSLHLEGWMKVPRNNKRGQQGWDRKYIVLEGSKVLIYDNEAREAGQRPVEEFELCLPDGDVSIHGAVGASELANTAKADVPYILKMESHPHTTCWPGRTLYLLAPSFPDKQRWVTALESVVAGGRVSREKAEADAKLLGNSLLKLEGDDRLDMNCTLPFSDQVVLVGTEEGLYALNVLKNSLTHVPGIGAVFQIYIIKDLEKLLMIAGEERALCLVDVKKVKQSLAQSHLPAQPDISPNIFEAVKGCHLFGAGKIENGLCICAAMPSKVVILRYNENLSKYCIRKEIETSEPCSCIHFTNYSILIGTNKFYEIDMKQYTLEEFLDKNDHSLAPAVFAASSNSFPVSIVQVNSAGQREEYLLCFHEFGVFVDSYGRRSRTDDLKWSRLPLAFAYREPYLFVTHFNSLEVIEIQARSSAGTPARAYLDIPNPRYLGPAISSGAIYLASSYQDKLRVICCKGNLVKESGTEHHRGPSTSRSSPNKRGPPTYNEHITKRVASSPAPPEGPSHPREPSTPHRYREGRTELRRDKSPGRPLEREKSPGRMLSTRRERSPGRLFEDSSRGRLPAGAVRTPLSQVNKVWDQSSV,2069,NP_001193928.1.csv,refseq-CIT-NM_001206999.1_clinical_seed_0_final,refseq-CIT-NM_001206999.1.a2m,Invitae,refseq-CIT-NM_001206999.1.npy,1,2069,2069
+NP_001194.1,MLLRSAGKLNVGTKKEDGESTAPTPRPKVLRCKCHHHCPEDSVNNICSTDGYCFTMIEEDDSGLPVVTSGCLGLEGSDFQCRDTPIPHQRRSIECCTERNECNKDLHPTLPPLKNRDFVDGPIHHRALLISVTVCSLLLVLIILFCYFRYKRQETRPRYSIGLEQDETYIPPGESLRDLIEQSQSSGSGSGLPLLVQRTIAKQIQMVKQIGKGRYGEVWMGKWRGEKVAVKVFFTTEEASWFRETEIYQTVLMRHENILGFIAADIKGTGSWTQLYLITDYHENGSLYDYLKSTTLDAKSMLKLAYSSVSGLCHLHTEIFSTQGKPAIAHRDLKSKNILVKKNGTCCIADLGLAVKFISDTNEVDIPPNTRVGTKRYMPPEVLDESLNRNHFQSYIMADMYSFGLILWEVARRCVSGGIVEEYQLPYHDLVPSDPSYEDMREIVCIKKLRPSFPNRWSSDECLRQMGKLMTECWAHNPASRLTALRVKKTLAKMSESQDIKL,502,NP_001194.1.csv,refseq-BMPR1B-NM_001203.2_clinical_seed_0_final,refseq-BMPR1B-NM_001203.2.a2m,Invitae,refseq-BMPR1B-NM_001203.2.npy,1,502,502
+NP_001195.2,MTSSLQRPWRVPWLPWTILLVSTAAASQNQERLCAFKDPYQQDLGIGESRISHENGTILCSKGSTCYGLWEKSKGDINLVKQGCWSHIGDPQECHYEECVVTTTPPSIQNGTYRFCCCSTDLCNVNFTENFPPPDTTPLSPPHSFNRDETIIIALASVSVLAVLIVALCFGYRMLTGDRKQGLHSMNMMEAAASEPSLDLDNLKLLELIGRGRYGAVYKGSLDERPVAVKVFSFANRQNFINEKNIYRVPLMEHDNIARFIVGDERVTADGRMEYLLVMEYYPNGSLCKYLSLHTSDWVSSCRLAHSVTRGLAYLHTELPRGDHYKPAISHRDLNSRNVLVKNDGTCVISDFGLSMRLTGNRLVRPGEEDNAAISEVGTIRYMAPEVLEGAVNLRDCESALKQVDMYALGLIYWEIFMRCTDLFPGESVPEYQMAFQTEVGNHPTFEDMQVLVSREKQRPKFPEAWKENSLAVRSLKETIEDCWDQDAEARLTAQCAEERMAELMMIWERNKSVSPTVNPMSTAMQNERNLSHNRRVPKIGPYPDYSSSSYIEDSIHHTDSIVKNISSEHSMSSTPLTIGEKNRNSINYERQQAQARIPSPETSVTSLSTNTTTTNTTGLTPSTGMTTISEMPYPDETNLHTTNVAQSIGPTPVCLQLTEEDLETNKLDPKEVDKNLKESSDENLMEHSLKQFSGPDPLSSTSSSLLYPLIKLAVEATGQQDFTQTANGQACLIPDVLPTQIYPLPKQQNLPKRPTSLPLNTKNSTKEPRLKFGSKHKSNLKQVETGVAKMNTINAAEPHVVTVTMNGVAGRNHSVNSHAATTQYANGTVLSGQTTNIVTHRAQEMLQNQFIGEDTRLNINSSPDEHEPLLRREQQAGHDEGVLDRLVDRRERPLEGGRTNSNNNNSNPCSEQDVLAQGVPSTAADPGPSKPRRAQRPNSLDLSATNVLDGSSIQIGESTQDGKSGSGEKIKKRVKTPYSLKRWRPSTWVISTESLDCEVNNNGSNRAVHSKSSTAVYLAEGGTATTMVSKDIGMNCL,1038,NP_001195.2.csv,refseq-BMPR2-NM_001204.6_clinical_seed_0_final,refseq-BMPR2-NM_001204.6.a2m,Invitae,refseq-BMPR2-NM_001204.6.npy,1,1038,1038
+NP_001202.5,MAAVKKEGGALSEAMSLEGDEWELSKENVQPLRQGRIMSTLQGALAQESACNNTLQQQKRAFEYEIRFYTGNDPLDVWDRYISWTEQNYPQGGKESNMSTLLERAVEALQGEKRYYSDPRFLNLWLKLGRLCNEPLDMYSYLHNQGIGVSLAQFYISWAEEYEARENFRKADAIFQEGIQQKAEPLERLQSQHRQFQARVSRQTLLALEKEEEEEVFESSVPQRSTLAELKSKGKKTARAPIIRVGGALKAPSQNRGLQNPFPQQMQNNSRITVFDENADEASTAELSKPTVQPWIAPPMPRAKENELQAGPWNTGRSLEHRPRGNTASLIAVPAVLPSFTPYVEETARQPVMTPCKIEPSINHILSTRKPGKEEGDPLQRVQSHQQASEEKKEKMMYCKEKIYAGVGEFSFEEIRAEVFRKKLKEQREAELLTSAEKRAEMQKQIEEMEKKLKEIQTTQQERTGDQQEETMPTKETTKLQIASESQKIPGMTLSSSVCQVNCCARETSLAENIWQEQPHSKGPSVPFSIFDEFLLSEKKNKSPPADPPRVLAQRRPLAVLKTSESITSNEDVSPDVCDEFTGIEPLSEDAIITGFRNVTICPNPEDTCDFARAARFVSTPFHEIMSLKDLPSDPERLLPEEDLDVKTSEDQQTACGTIYSQTLSIKKLSPIIEDSREATHSSGFSGSSASVASTSSIKCLQIPEKLELTNETSENPTQSPWCSQYRRQLLKSLPELSASAELCIEDRPMPKLEIEKEIELGNEDYCIKREYLICEDYKLFWVAPRNSAELTVIKVSSQPVPWDFYINLKLKERLNEDFDHFCSCYQYQDGCIVWHQYINCFTLQDLLQHSEYITHEITVLIIYNLLTIVEMLHKAEIVHGDLSPRCLILRNRIHDPYDCNKNNQALKIVDFSYSVDLRVQLDVFTLSGFRTVQILEGQKILANCSSPYQVDLFGIADLAHLLLFKEHLQVFWDGSFWKLSQNISELKDGELWNKFFVRILNANDEATVSVLGELAAEMNGVFDTTFQSHLNKALWKVGKLTSPGALLFQ,1050,NP_001202.5.csv,refseq-BUB1B-NM_001211.6_clinical_seed_0_final,refseq-BUB1B-NM_001211.6.a2m,Invitae,refseq-BUB1B-NM_001211.6.npy,1,1050,1050
+NP_001209.1,MPRRSLHAAAVLLLVILKEQPSSPAPVNGSKWTYFGPDGENSWSKKYPSCGGLLQSPIDLHSDILQYDASLTPLEFQGYNLSANKQFLLTNNGHSVKLNLPSDMHIQGLQSRYSATQLHLHWGNPNDPHGSEHTVSGQHFAAELHIVHYNSDLYPDASTASNKSEGLAVLAVLIEMGSFNPSYDKIFSHLQHVKYKGQEAFVPGFNIEELLPERTAEYYRYRGSLTTPPCNPTVLWTVFRNPVQISQEQLLALETALYCTHMDDPSPREMINNFRQVQKFDERLVYTSFSQVQVCTAAGLSLGIILSLALAGILGICIVVVVSIWLFRRKSIKKGDNKGVIYKPATKMETEAHA,354,NP_001209.1.csv,refseq-CA12-NM_001218.4_clinical_seed_0_final,refseq-CA12-NM_001218.4.a2m,Invitae,refseq-CA12-NM_001218.4.npy,1,354,354
+NP_001211.3,MATTVTCTRFTDEYQLYEDIGKGAFSVVRRCVKLCTGHEYAAKIINTKKLSARDHQKLEREARICRLLKHSNIVRLHDSISEEGFHYLVFDLVTGGELFEDIVAREYYSEADASHCIQQILEAVLHCHQMGVVHRDLKPENLLLASKCKGAAVKLADFGLAIEVQGDQQAWFGFAGTPGYLSPEVLRKEAYGKPVDIWACGVILYILLVGYPPFWDEDQHKLYQQIKAGAYDFPSPEWDTVTPEAKNLINQMLTINPAKRITAHEALKHPWVCQRSTVASMMHRQETVECLKKFNARRKLKGAILTTMLATRNFSVGRQTTAPATMSTAASGTTMGLVEQAKSLLNKKADGVKPQTNSTKNSAAATSPKGTLPPAALEPQTTVIHNPVDGIKESSDSANTTIEDEDAKAPRVPDILSSVRRGSGAPEAEGPLPCPSPAPFSPLPAPSPRISDILNSVRRGSGTPEAEGPLSAGPPPCLSPALLGPLSSPSPRISDILNSVRRGSGTPEAEGPSPVGPPPCPSPTIPGPLPTPSRKQEIIKTTEQLIEAVNNGDFEAYAKICDPGLTSFEPEALGNLVEGMDFHRFYFENLLAKNSKPIHTTILNPHVHVIGEDAACIAYIRLTQYIDGQGRPRTSQSEETRVWHRRDGKWQNVHFHCSGAPVAPLQ,666,NP_001211.3.csv,refseq-CAMK2B-NM_001220.4_clinical_seed_0_final,refseq-CAMK2B-NM_001220.4.a2m,Invitae,refseq-CAMK2B-NM_001220.4.npy,1,666,666
+NP_001219.2,MDFSRNLYDIGEQLDSEDLASLKFLSLDYIPQRKQEPIKDALMLFQRLQEKRMLEESNLSFLKELLFRINRLDLLITYLNTRKEEMERELQTPGRAQISAYRFHFCRMSWAEANSQCQTQSVPFWRRVDHLLIRVMLYQISEEVSRSELRSFKFLLQEEISKCKLDDDMNLLDIFIEMEKRVILGEGKLDILKRVCAQINKSLLKIINDYEEFSKGEELCGVMTISDSPREQDSESQTLDKVYQMKSKPRGYCLIINNHNFAKAREKVPKLHSIRDRNGTHLDAGALTTTFEELHFEIKPHDDCTVEQIYEILKIYQLMDHSNMDCFICCILSHGDKGIIYGTDGQEAPIYELTSQFTGLKCPSLAGKPKVFFIQACQGDNYQKGIPVETDSEEQPYLEMDLSSPQTRYIPDEADFLLGMATVNNCVSYRNPAEGTWYIQSLCQSLRERCPRGDDILTILTEVNYEVSNKDDKKNMGKQMPQPTFTLRKKLVFPSD,496,NP_001219.2.csv,refseq-CASP8-NM_001228.4_clinical_seed_0_final,refseq-CASP8-NM_001228.4.a2m,Invitae,refseq-CASP8-NM_001228.4.npy,1,496,496
+NP_001222.3,MSATDRMGPRAVPGLRLALLLLLVLGTPKSGVQGQEGLDFPEYDGVDRVINVNAKNYKNVFKKYEVLALLYHEPPEDDKASQRQFEMEELILELAAQVLEDKGVGFGLVDSEKDAAVAKKLGLTEVDSMYVFKGDEVIEYDGEFSADTIVEFLLDVLEDPVELIEGERELQAFENIEDEIKLIGYFKSKDSEHYKAFEDAAEEFHPYIPFFATFDSKVAKKLTLKLNEIDFYEAFMEEPVTIPDKPNSEEEIVNFVEEHRRSTLRKLKPESMYETWEDDMDGIHIVAFAEEADPDGFEFLETLKAVAQDNTENPDLSIIWIDPDDFPLLVPYWEKTFDIDLSAPQIGVVNVTDADSVWMEMDDEEDLPSAEELEDWLEDVLEGEINTEDDDDDDDD,396,NP_001222.3.csv,refseq-CASQ1-NM_001231.4_clinical_seed_0_final,refseq-CASQ1-NM_001231.4.a2m,Invitae,refseq-CASQ1-NM_001231.4.npy,1,396,396
+NP_001223.2,MKRTHLFIVGIYFLSSCRAEEGLNFPTYDGKDRVVSLSEKNFKQVLKKYDLLCLYYHEPVSSDKVTQKQFQLKEIVLELVAQVLEHKAIGFVMVDAKKEAKLAKKLGFDEEGSLYILKGDRTIEFDGEFAADVLVEFLLDLIEDPVEIISSKLEVQAFERIEDYIKLIGFFKSEDSEYYKAFEEAAEHFQPYIKFFATFDKGVAKKLSLKMNEVDFYEPFMDEPIAIPNKPYTEEELVEFVKEHQRPTLRRLRPEEMFETWEDDLNGIHIVAFAEKSDPDGYEFLEILKQVARDNTDNPDLSILWIDPDDFPLLVAYWEKTFKIDLFRPQIGVVNVTDADSVWMEIPDDDDLPTAEELEDWIEDVLSGKINTEDDDEDDDDDDNSDEEDNDDSDDDDDE,399,NP_001223.2.csv,refseq-CASQ2-NM_001232.4_clinical_seed_0_final,refseq-CASQ2-NM_001232.4.a2m,Invitae,refseq-CASQ2-NM_001232.4_theta_0.2.npy,1,399,399
+NP_001229804.1,MVAPVLETSHVFCCPNRVRGVLNWSSGPRGLLAFGTSCSVVLYDPLKRVVVTNLNGHTARVNCIQWICKQDGSPSTELVSGGSDNQVIHWEIEDNQLLKAVHLQGHEGPVYAVHAVYQRRTSDPALCTLIVSAAADSAVRLWSKKGPEVMCLQTLNFGNGFALALCLSFLPNTDVTWKTGQVERGRAWKPPASLALCSRSCDSMVSCYASILCKALWKEKLHTFWHHNRISFLPSAFRPIPILACGNDDCRIHIFAQQNDQFQKVLSLCGHEDWIRGVEWAAFGRDLFLASCSQDCLIRIWKLYIKSTSLETQDDDNIRLKENTFTIENESVKIAFAVTLETVLAGHENWVNAVHWQPVFYKDGVLQQPVRLLSASMDKTMILWAPDEESGVWLEQVRVGEVGGNTLGFYDCQFNEDGSMIIAHAFHGALHLWKQNTVNPREWTPEIVISGHFDGVQDLVWDPEGEFIITVGTDQTTRLFAPWKRKDQSQVTWHEIARPQIHGYDLKCLAMINRFQFVSGADEKVLRVFSAPRNFVENFCAITGQSLNHVLCNQDSDLPEGATVPALGLSNKAVFQGDIASQPSDEEELLTSTGFEYQQVAFQPSILTEPPTEDHLLQNTLWPEVQKLYGHGYEIFCVTCNSSKTLLASACKAAKKEHAAIILWNTTSWKQVQNLVFHSLTVTQMAFSPNEKFLLAVSRDRTWSLWKKQDTISPEFEPVFSLFAFTNKITSVHSRIIWSCDWSPDSKYFFTGSRDKKVVVWGECDSTDDCIEHNIGPCSSVLDVGGAVTAVSVCPVLHPSQRYVVAVGLECGKICLYTWKKTDQVPEINDWTHCVETSQSQSHTLAIRKLCWKNCSGKTEQKEAEGAEWLHFASCGEDHTVKIHRVNKCAL,891,NP_001229804.1.csv,refseq-ELP2-NM_001242875.2_clinical_seed_0_final,refseq-ELP2-NM_001242875.2.a2m,Invitae,refseq-ELP2-NM_001242875.2.npy,1,891,891
+NP_001229817.2,MNASEFRRRGKEMVDYMANYMEGIEGRQVYPDVEPGYLRPLIPAAAPQEPDTFEDIINDVEKIIMPGGSASEATLVALLAARTKVIHRLQAASPELTQAAIMEKLVAYSSDQAHSSVERAGLIGGVKLKAIPSDGNFAMRASALQEALERDKAAGLIPFFMVATLGTTTCCSFDNLLEVGPICNKEDIWLHVDAAYAGSAFICPEFRHLLNGVEFADSFNFNPHKWLLVNFDCSAMWVKKRTDLTGAFRLDPTYLKHSHQDSGLITDYRHWQIPLGRRFRSLKMWFVFRMYGVKGLQAYIRKHVQLSHEFESLVRQDPRFEICVEVILGLVCFRLKGSNKVNEALLQRINSAKKIHLVPCHLRDKFVLRFAICSRTVESAHVQRAWEHIKELAADVLRAERE,402,NP_001229817.2.csv,refseq-DDC-NM_001242888.2_clinical_seed_0_final,refseq-DDC-NM_001242888.2.a2m,Invitae,refseq-DDC-NM_001242888.2.npy,1,402,402
+NP_001229825.1,MRTTKVYKLVIHKKGFGGSDDELVVNPKVFPHIKLGDIVEIAHPNDEYSPLLLQVKSLKEDLQKETISVDQTVTQVFRLRPYQDVYVNVVDPKDVTLDLVELTFKDQYIGRGDMWRLKKSLVSTCAYITQKVEFAGIRAQAGELWVKNEKVMCGYISEDTRVVFRSTSAMVYIFIQMSCEMWDFDIYGDLYFEKAVNGFLADLFTKWKEKNCSHEVTVVLFSRTFYDAKSVDEFPEINRASIRQDHKGRFYEDFYKVVVQNERREEWTSLLVTIKKLFIQYPVLVRLEQAEGFPQGDNSTSAQGNYLEAINLSFNVFDKHYINRNFDRTGQMSVVITPGVGVFEVDRLLMILTKQRMIDNGIGVDLVCMGEQPLHAVPLFKLHNRSAPRDSRLGDDYNIPHWINHSFYTSKSQLFCNSFTPRIKLAGKKPASEKAKNGRDTSLGSPKESENALPIQVDYDAYDAQVFRLPGPSRAQCLTTCRSVRERESHSRKSASSCDVSSSPSLPSRTLPTEEVRSQASDDSSLGKSANILMIPHPHLHQYEVSSSLGYTSTRDVLENMMEPPQRDSSAPGRFHVGSAESMLHVRPGGYTPQRALINPFAPSRMPMKLTSNRRRWMHTFPVGPSGEAIQIHHQTRQNMAELQGSGQRDPTHSSAELLELAYHEAAGRHSNSRQPGDGMSFLNFSGTEELSVGLLSNSGAGMNPRTQNKDSLEDSVSTSPDPILTLSAPPVVPGFCCTVGVDWKSLTTPACLPLTTDYFPDRQGLQNDYTEGCYDLLPEADIDRRDEDGVQMTAQQVFEEFICQRLMQGYQIIVQPKTQKPNPAVPPPLSSSPLYSRGLVSRNRPEEEDQYWLSMGRTFHKVTLKDKMITVTRYLPKYPYESAQIHYTYSLCPSHSDSEFVSCWVEFSHERLEEYKWNYLDQYICSAGSEDFSLIESLKFWRTRFLLLPACVTATKRITEGEAHCDIYGDRPRADEDEWQLLDGFVRFVEGLNRIRRRHRSDRMMRKGTAMKGLQMTGPISTHSLESTAPPVGKKGTSALSALLEMEASQKCLGEQQAAVHGGKSSAQSAESSSVAMTPTYMDSPRKDGAFFMEFVRSPRTASSAFYPQVSVDQTATPMLDGTSLGICTGQSMDRGNSQTFGNSQNIGEQGYSSTNSSDSSSQQLVASSLTSSSTLTEILEAMKHPSTGVQLLSEQKGLSPYCFISAEVVHWLVNHVEGIQTQAMAIDIMQKMLEEQLITHASGEAWRTFIYGFYFYKIVTDKEPDRVAMQQPATTWHTAGVDDFASFQRKWFEVAFVAEELVHSEIPAFLLPWLPSRPASYASRHSSFSRSFGGRSQAAALLAATVPEQRTVTLDVDVNNRTDRLEWCSCYYHGNFSLNAAFEIKLHWMAVTAAVLFEMVQGWHRKATSCGFLLVPVLEGPFALPSYLYGDPLRAQLFIPLNISCLLKEGSEHLFDSFEPETYWDRMHLFQEAIAHRFGFVQDKYSASAFNFPAENKPQYIHVTGTVFLQLPYSKRKFSGQQRRRRNSTSSTNQNMFCEERVGYNWAYNTMLTKTWRSSATGDEKFADRLLKDFTDFCINRDNRLVTFWTSCLEKMHASAP,1603,NP_001229825.1.csv,refseq-DEPDC5-NM_001242896.1_clinical_seed_0_final,refseq-DEPDC5-NM_001242896.1.a2m,Invitae,refseq-DEPDC5-NM_001242896.1.npy,1,1603,1603
+NP_001229886.1,MNRYTTMRQLGDGTYGSVLMGKSNESGELVAIKRMKRKFYSWDECMNLREVKSLKKLNHANVIKLKEVIRENDHLYFIFEYMKENLYQLMKDRNKLFPESVIRNIMYQILQGLAFIHKHGFFHRDMKPENLLCMGPELVKIADFGLARELRSQPPYTDYVSTRWYRAPEVLLRSSVYSSPIDVWAVGSIMAELYMLRPLFPGTSEVDEIFKICQVLGTPKKSDWPEGYQLASSMNFRFPQCVPINLKTLIPNASNEAIQLMTEMLNWDPKKRPTASQALKHPYFQVGQVLGPSSNHLESKQSLNKQLQPLESKPSLVEVEPKPLPDIIDQVVGQPQPKTSQQPLQPIQPPQNLSVQQPPKQQSQEKPPQTLFPSIVKNMPTKPNGTLSHKSGRRRWGQTIFKSGDSWEELEDYDFGASHSKKPSMGVFKEKRKKDSPFRLPEPVPSGSNHSTGENKSLPAVTSLKSDSELSTAPTSKQYYLKQSRYLPGVNPKKVSLIASGKEINPHTWSNQLFPKSLGPVGAELAFKRSNAEESIIKPIEKLSCNETFPEKLEDPQGNLGSYATYNQSGYIPSFLKKEVQSAGQRIHLAPLNATASEYTWNTKTGRGQFSGRTYNPTAKNLNIVNRAQPIPSVHGRTDWVAKYGGHR,648,NP_001229886.1.csv,refseq-MAK-NM_001242957.2_clinical_seed_0_final,refseq-MAK-NM_001242957.2.a2m,Invitae,refseq-MAK-NM_001242957.2.npy,1,648,648
+NP_001230106.1,MARRKPEGSSFNMTHLSMAMAFSFPPVASGQLHPQLGNTQHQTELGKELATTSTMPYQYPALTPEQKKELSDIAHRIVAPGKGILAADESTGSIAKRLQSIGTENTEENRRFYRQLLLTADDRVNPCIGGVILFHETLYQKADDGRPFPQVIKSKGGVVGIKVDKGVVPLAGTNGETTTQGLDGLSERCAQYKKDGADFAKWRCVLKIGEHTPSALAIMENANVLARYASICQQNGIVPIVEPEILPDGDHDLKRCQYVTEKVLAAVYKALSDHHIYLEGTLLKPNMVTPGHACTQKFSHEEIAMATVTALRRTVPPAVTGITFLSGGQSEEEASINLNAINKCPLLKPWALTFSYGRALQASALKAWGGKKENLKAAQEEYVKRALANSLACQGKYTPSGQAGAAASESLFVSNHAY,418,NP_001230106.1.csv,NP_001230106.1_clinical_seed_0_final,NP_001230106.1.a2m,popEVE,NP_001230106.1_theta_0.2.npy,1,418,418
+NP_001230695.2,MDDWKPSPLIKPFGARKKRSWYLTWKYKLTNQRALRRFCQTGAVLFLLVTVIVNIKLILDTRRAISEANEDPEPEQDYDEALGRLEPPRRRGSGPRRVLDVEVYSSRSKVYVAVDGTTVLEDEAREQGRGIHVIVLNQATGHVMAKRVFDTYSPHEDEAMVLFLNMVAPGRVLICTVKDEGSFHLKDTAKALLRSLGSQAGPALGWRDTWAFVGRKGGPVFGEKHSKSPALSSWGDPVLLKTDVPLSSAEEAECHWADTELNRRRRRFCSKVEGYGSVCSCKDPTPIEFSPDPLPDNKVLNVPVAVIAGNRPNYLYRMLRSLLSAQGVSPQMITVFIDGYYEEPMDVVALFGLRGIQHTPISIKNARVSQHYKASLTATFNLFPEAKFAVVLEEDLDIAVDFFSFLSQSIHLLEEDDSLYCISAWNDQGYEHTAEDPALLYRVETMPGLGWVLRRSLYKEELEPKWPTPEKLWDWDMWMRMPEQRRGRECIIPDVSRSYHFGIVGLNMNGYFHEAYFKKHKFNTVPGVQLRNVDSLKKEAYEVEVHRLLSEAEVLDHSKNPCEDSFLPDTEGHTYVAFIRMEKDDDFTTWTQLAKCLHIWDLDVRGNHRGLWRLFRKKNHFLMSEEATLSHPNFPGATPKGGGSPRSPRTDMRPPPGPCGAGPGSESNLFIDCPEGLENRPNLEGLDFFLGWNAALRVGLALTQETAVPNPWTGPAGAHMLTQTHSETLRHWTRPPLSLLFVQISKAG,748,NP_001230695.2.csv,NP_001230695.2_colabfold_clinical_seed_0_final,NP_001230695.2_colabfold.a2m,colabfold,NP_001230695.2_colabfold_theta_0.2.npy,1,748,748
+NP_001231118.1,MFWCGTCFVTNNMKGSEVSLEKKKKIKMPVKRLREVVSQNHGDHLVLLKDELPCVPPALSANKRLPVGTGTSLNGTSRGSSDLTSARNCYQPLLENPMVSESDFSKDVAVQVLPLDKIEENNKQKANDIFISQYTMGQKDALRTVLKQNVSLCLTGWSDHSGVITTHCSLYLLRLMRSSHLSLPSSWDYRAQSMPVFKEVKVHLLEDAGIEKDAVTQETRISPSGIDSATTVAAATAAAIATAAPLIKVQSDLEAKVNSVTELLSKLQETDKHLQRVTEQQTSIQRKQEKLHCHDHEKQMNVFMEQHIRHLEKLQQQQIDIQTHFISAALKTSSFQPVSMPSSRAVEKYSVKPEHPNLGSCNPSLYNTFASKQAPLKEVEDTSFDKQKSPLETPAPRRFAPVPVSRDDELSKRENLLEEKENMEVSCHRGNVRLLEQILNNNDSLTRKSESSNTTSLTRSKIGWTPEKTNRFPSCEELETTKVTMQKSDDVLHDLGQKEKETNSMVQPKESLSMLKLPDLPQNSVKLQTTNTTRSVLKDAEKILRGVQNNKKVLEENLEAIIRAKDGAAMYSLINALSTNREMSEKIRIRKTVDEWIKTISAEIQDELSRTDYEQKRFDQKNQRTKKGQNMTKDIRTNTQDKTVNKSVIPRKHSQKQIEEHFRNLPMRGMPASSLQKERKEGLLKATTVIQDEDYMLQVYGKPVYQGHRSTLKKGPYLRFNSPSPKSRPQRPKVIERVKGTKVKSIRTQTDFYATKPKKMDSKMKHSVPVLPHGDQQYLFSPSREMPTFSGTLEGHLIPMAILLGQTQSNSDTMPPAGVIVSKPHPVTVTTSIPPSSRKVETGVKKPNIAIVEMKSEKKDPPQLTVQVLPSVDIDSISNSSADVLSPLSSPKEASLPPVQTWIKTPEIMKVDEEEVKFPGTNFDEIIDVIQEEEKCDEIPDSEPILEFNRSVKADSTKYNGPPFPPVASTFQPTADILDKVIERKETLENSLIQWVEQEIMSRIISGLFPVQQQIAPSISVSVSETSEPLTSDIVEGTSSGALQLFVDAGVPVNSNVIKHFVNEALAETIAVMLGDREAKKQGPVATGVSGDASTNETYLPARVCTPLPTPQPTPPCSPSSPAKECVLVKTPDSSPCDSDHDMAFPVKEICAEKGDDMPAIMLVNTPTVTPTTTPPPAAAVFTPTLSDISIDKLKVSSPELPKPWGDGDLPLEEENPNSPQEELHPRAIVMSVAKDEEPESMDFPAQPPPPEPVPFMPFPAGTKAPSPSQMPGSDSSTLESTLSVTVTETETLDKPISEGEILFSCGQKLAPKILEDIGLYLTNLNDSLSSTLHDAVEMEDDPPSEGQVIRMSHKKFHADAILSFAKQNQESAVSQQAVYHSEDLENSVGELSEGQRPQLTAAAENILMGHSLYMQPPVTNTQSLDQQCDPKPLSRQFDTVSGSIYEDSCASHGPMSLGELELEPNSKLVLPTTLLTAQENDVNLPVAAEDFSQYQLKQNQDVKQVEHKPSQSYLRVRNKSDIAPSQQQVSPGDMDRTQIELNPYLTCVFSGLGVHVKKVSCIGKLGLWRFVIQIISSPRWESSATLRFTDAPCQDVSDAAVSEPRGLLSVSESQHNAGGHGVFGGRYLLNGKRQPAQCLCHWF,1644,NP_001231118.1.csv,refseq-KIAA0586-NM_001244189.1_clinical_seed_0_final,refseq-KIAA0586-NM_001244189.1.a2m,Invitae,refseq-KIAA0586-NM_001244189.1.npy,1,1644,1644
+NP_001233.2,MARPHPWWLCVLGTLVGLSATPAPKSCPERHYWAQGKLCCQMCEPGTFLVKDCDQHRKAAQCDPCIPGVSFSPDHHTRPHCESCRHCNSGLLVRNCTITANAECACRNGWQCRDKECTECDPLPNPSLTARSSQALSPHPQPTHLPYVSEMLEARTAGHMQTLADFRQLPARTLSTHWPPQRSLCSSDFIRILVIFSGMFLVFTLAGALFLHQRRKYRSNKGESPVEPAEPCHYSCPREEEGSTIPIQEDYRKPEPACSP,260,NP_001233.2.csv,refseq-CD27-NM_001242.5_clinical_seed_0_final,refseq-CD27-NM_001242.5.a2m,Invitae,refseq-CD27-NM_001242.5_theta_0.2.npy,1,260,260
+NP_001242976.1,MTIMVEDIMKLLCSLSGERKMKAAVKHSGKGALVTGAMAFVGGLVGGPPGLAVGGAVGGLLGAWMTSGQFKPVPQILMELPPAEQQRLFNEAAAIIRHLEWTDAVQLTALVMGSEALQQQLLAMLVNYVTKELRAEIQYDD,141,NP_001242976.1.csv,NP_001242976.1_colabfold_clinical_seed_0_final,NP_001242976.1_colabfold.a2m,colabfold,NP_001242976.1_colabfold_theta_0.2.npy,1,141,141
+NP_001243169.1,MYQVPLPLDRDGTLVRLRFTMVALVTVCCPLVAFLFCILWSLLFHFKETTATHCGVPNYLPSVSSAIGGEVPQRYVWRFCIGLHSAPRFLVAFAYWNHYLSCTSPCSCYRPLCRLNFGLNVVENLALLVLTYVSSSEDFTIHENAFIVFIASSLGHMLLTCILWRLTKKHTVSQEDRKSYSWKQRLFIINFISFFSALAVYFRHNMYCEAGVYTIFAILEYTVVLTNMAFHMTAWWDFGNKELLITSQPEEKRF,254,NP_001243169.1.csv,refseq-PGAP2-NM_001256240.1_clinical_seed_0_final,refseq-PGAP2-NM_001256240.1.a2m,Invitae,refseq-PGAP2-NM_001256240.1.npy,1,254,254
+NP_001243328.1,MEAVVNLYQEVMKHADPRIQGYPLMGSPLLMTSILLTYVYFVLSLGPRIMANRKPFQLRGFMIVYNFSLVALSLYIVYEFLMSGWLSTYTWRCDPVDYSNSPEALRMVRVAWLFLFSKFIELMDTVIFILRKKDGQVTFLHVFHHSVLPWSWWWGVKIAPGGMGSFHAMINSSVHVIMYLYYGLSAFGPVAQPYLWWKKHMTAIQLIQFVLVSLHISQYYFMSSCNYQYPVIIHLIWMYGTIFFMLFSNFWYHSYTKGKRLPRALQQNGAPGIAKVKAN,279,NP_001243328.1.csv,refseq-ELOVL1-NM_001256399.1_clinical_seed_0_final,refseq-ELOVL1-NM_001256399.1.a2m,Invitae,refseq-ELOVL1-NM_001256399.1.npy,1,279,279
+NP_001243371.1,MAASSSEISEMKGVEESPKVPGEGPGHSEAETGPPQVLAGVPDQPEAPQPGPNTTAAPVDSGPKAGLAPETTETPAGASETAQATDLSLSPGGESKANCSPEDPCQETVSKPEVSKEATADQGSRLESAAPPEPAPEPAPQPDPRPDSQPTPKPALQPELPTQEDPTPEILSESVGEKQENGAVVPLQAGDGEEGPAPEPHSPPSKKSPPANGAPPRVLQQLVEEDRMRRAHSGHPGSPRGSLSRHPSSQLAGPGVEGGEGTQKPRDYIILAILSCFCPMWPVNIVAFAYAVMSRNSLQQGDVDGAQRLGRVAKLLSIVALVGGVLIIIASCVINLGGEWGLGTGRGGMEGLARAALLTPAPALSCLSSLPLLCLSLSPPPPVCPSLSSPTVYK,394,NP_001243371.1.csv,NP_001243371.1_colabfold_clinical_seed_0_final,NP_001243371.1_colabfold.a2m,colabfold,NP_001243371.1_colabfold_theta_0.2.npy,1,394,394
+NP_001243423.1,MAAAPALKHWRTTLERVEKFVSPLYFTDCNLRGRLFGASCPVAVLSSFLTPERLPYQEAVQRDFRPAQVGDSFGPTWWTCWFRVELTIPEAWVGQEVHLCWESDGEGLVWRDGEPVQGLTKEGEKTSYVLTDRLGERDPRSLTLYVEVACNGLLGAGKGSMIAAPDPEKMFQLSRAELAVFHRDVHMLLVDLELLLGIAKGLGKDNQRSFQALYTANQMVNVCDPAQPETFPVAQALASRFFGQHGGESQHTIHATGHCHIDTAWLWPFKETVRKCARSWVTALQLMERNPEFIFACSQAQQLEWVKSRYPGLYSRIQEFACRGQFVPVGGTWVEMDGNLPSGEAMVRQFLQGQNFFLQEFGKMCSEFWLPDTFGYSAQLPQIMHGCGIRRFLTQKLSWNLVNSFPHHTFFWEGLDGSRVLVHFPPGDSYGMQGSVEEVLKTVANNRDKGRANHSAFLFGFGDGGGGPTQTMLDRLKRLSNTDGLPRVQLSSPRQLFSALESDSEQLCTWVGELFLELHNGTYTTHAQIKKGNRECERILHDVELLSSLALARSAQFLYPAAQLQHLWRLLLLNQFHDVVTGSCIQMVAEEAMCHYEDIRSHGNTLLSAAAAALCAGEPGPEGLLIVNTLPWKRIEVMALPKPGGAHSLGLTPSPGDSAQHGLCSCSSPHLTAAPAAPAACVRSARAPTDSASRPPPTKTDGSVTLDNGIIRVKLDPTGRLTSLVLVASGREAIAEGAVGNQFVLFDDVPLYWDAWDVMDYHLETRKPVLGQAGTLAVGTEGGLRGSAWFLLQISPNSRLSQEVVLDVGCPYVRFHTEVHWHEAHKFLKVEFPARVRSSQATYEIQFGHLQRPTHYNTSWDWARFEVWAHRWMDLSEHGFGLALLNDCKYGASVRGSILSLSLLRAPKAPDATADTGRHEFTYALMPHKGSFQDAGVIQAAYSLNFPLLALPAPSPAPATSWSAFSVSSPAVVLETVKQAESSPQRRSLVLRLYEAHGSHVDCWLHLSLPVQEAILCDLLERPDPAGHLTLRDNRLKLTFSPFQVLSLLLVLQPPPH,1057,NP_001243423.1.csv,refseq-MAN2C1-NM_001256494.1_clinical_seed_0_final,refseq-MAN2C1-NM_001256494.1.a2m,Invitae,refseq-MAN2C1-NM_001256494.1.npy,1,1057,1057
+NP_001243439.1,MFRRPVLQVLRQFVRHESETTTSLVLERSLNRVHLLGRVGQDPVLRQVEGKNPVTIFSLATNEMWRSGDSEVYQLGDVSQKTTWHRISVFRPGLRDVAYQYVKKGSRIYLEGKIDYGEYMDKNNVRRQATTIIADNIIFLSDQTKEKE,148,NP_001243439.1.csv,refseq-SSBP1-NM_001256510.1_clinical_seed_0_final,refseq-SSBP1-NM_001256510.1.a2m,Invitae,refseq-SSBP1-NM_001256510.1.npy,1,148,148
+NP_001243506.2,MGRRPARCYRYCKNKPYPKSRFCRGVPDAKIRIFDLGRKKAKVDEFPLCGHMVSDEYEQLSSEALEAARICANKYMVKSCGKDGFHIRVRLHPFHVIRINKMLSCAGADRSTSQRSGASPSSMLMNLKTWWLKSGSSQMAVGSSTSPVVALWTSGGPCTHEGFQCAAPLLILTNKFYFLSTYVFVSTFLTGKELPLGTFGSLPFHFRNRLTTQPCS,216,NP_001243506.2.csv,NP_001243506.2_colabfold_clinical_seed_0_final,NP_001243506.2_colabfold.a2m,colabfold,NP_001243506.2_colabfold_theta_0.2.npy,1,216,216
+NP_001243556.1,MTSTGKDGGAQHAQYVGPYRLEKTLGKGQTGLVKLGVHCVTCQKVAIKIVNREKLSESVLMKVEREIAILKLIEHPHVLKLHDVYENKKYLYLVLEHVSGGELFDYLVKKGRLTPKEARKFFRQIISALDFCHSHSICHRDLKPENLLLDEKNNIRIADFGMASLQVGDSLLETSCGSPHYACPEVIRGEKYDGRKADVWSCGVILFALLVGALPFDDDNLRQLLEKVKRGVFHMPHFIPPDCQSLLRGMIEVDAARRLTLEHIQKHIWYIGGKNEPEPEQPIPRKVQIRSLPSLEDIDPDVLDSMHSLGCFRDRNKLLQDLLSEEENQEKMIYFLLLDRKERYPSQEDEDLPPRNEIDPPRKRVDSPMLNRHGKRRPERKSMEVLSVTDGGSPVPARRAIEMAQHGQRSRSISGASSGLSTSPLSSPRVTPHPSPRGSPLPTPKGTPVHTPKESPAGTPNPTPPSSPSVGGVPWRARLNSIKNSFLGSPRFHRRKLQVPTPEEMSNLTPESSPELAKKSWFGNFISLEKEEQIFVVIKDKPLSSIKADIVHAFLSIPSLSHSVISQTSFRAEYKATGGPAVFQKPVKFQVDITYTEGGEAQKENGIYSVTFTLLSGPSRRFKRVVETIQAQLLSTHDPPAAQHLSDTTNCMEMMTGRLSKCGSPLSNFFDVIKQLFSDEKNGQAAQAPSTPAKRSAHGPLGDSAAAGPGPGGDAEYPTGKDTAKMGPPTARREQP,736,NP_001243556.1.csv,refseq-BRSK2-NM_001256627.1_clinical_seed_0_final,refseq-BRSK2-NM_001256627.1.a2m,Invitae,refseq-BRSK2-NM_001256627.1.npy,1,736,736
+NP_001243643.1,MLPLLDSSKRAGTLGSGCGVPRVHSAALSREEGASRDIWRIKVWARVMTTPAGSGSGFGSVSWWGLSPALDLQAERDATVDALPTTMVPQPAVILPGPPVDPDSQADTVHSNPELDVLLLGSVDGRHLLRTLSRAKFWPRRRFNFFVLENNLEAVARHMLIFSLALEEPEKMGLQERSETFLEVWGNALLRPPVAAFVRAQADLLAHLVPEPDRLEEQLPWLSLRALKFRERDALEAVFRFWAGGEKGPQAFPMSRLWDSRLRHYLGSRYDARRGVSDWDLRMKLHDRGAQVIHPQEFRRWRDTGVAFELRDSSAYHVPNRTLASGRLLSYRGERVAARGYWGDIATGPFVAFGIEADDESLLRTSNGQPVKTAGEITQHNVTELLRDVAAWGRARATGGDLEEQQHAEGSPEPGTPAPTPESFTVHFLPLNSAQTLHHKSCYNGRFQLLYVACGMVHLLIPELGACVAPGGNLIVELARYLVDVRQEQLQGFNTRVRELAQAAGFAPQTGARPSETFARFCKSQESALGNTVPAVEPGTPPLDILAQPLEASNPALEGLTQPLQGGTPHCEPCQLPSESPGSLSEVLAQPQGALAPPNCESDSKTGV,608,NP_001243643.1.csv,NP_001243643.1_colabfold_clinical_seed_0_final,NP_001243643.1_colabfold.a2m,colabfold,NP_001243643.1_colabfold_theta_0.2.npy,1,608,608
+NP_001243805.1,MAHTTKVNGSASGKAGDILSGDQDKEQKDPYFVETPYGYQLDLDFLKYVDDIQKGNTIKRLNIQKRRKPSVPCPEPRTTSGQQGIWTSTESLSSSNSDDNKQCPNFLIARSQVTSTPISKPPPPLETSLPFLTIPENRQLPPPSPQLPKHNLHVTKTLMETRRRLEQERATMQMTPGEFRRPRLASFGGMGTTSSLPSFVGSGNHNPAKHQLQNGYQGNGDYGSYAPAAPTTSSMGSSIRHSPLSSGISTPVTNVSPMHLQHIREQMAIALKRLKELEEQVRTIPVLQVKISVLQEEKRQLVSQLKNQRAASQINVCGVRKRSYSAGNASQLEQLSRARRSGGELYIDYEEEEMETVEQSTQRIKEFRQLTADMQALEQKIQDSSCEASSELRENGECRSVAVGAEENMNDIVVYHRGSRSCKDAAVGTLVEMRNCGVSVTEAMLGVMTEADKEIELQQQTIESLKEKIYRLEVQLRETTHDREMTKLKQELQAAGSRKKVDKATMAQPLVFSKVVEAVVQTRDQMVGSHMDLVDTCVGTSVETNSVGISCQPECKNKVVGPELPMNWWIVKERVEMHDRCAGRSVEMCDKSVSVEVSVCETGSNTEESVNDLTLLKTNLNLKEVRSIGCGDCSVDVTVCSPKECASRGVNTEAVSQVEAAVMAVPRTADQDTSTDLEQVHQFTNTETATLIESCTNTCLSTLDKQTSTQTVETRTVAVGEGRVKDINSSTKTRSIGVGTLLSGHSGFDRPSAVKTKESGVGQININDNYLVGLKMRTIACGPPQLTVGLTASRRSVGVGDDPVGESLENPQPQAPLGMMTGLDHYIERIQKLLAEQQTLLAENYSELAEAFGEPHSQMGSLNSQLISTLSSINSVMKSASTEELRNPDFQKTSLGKITGNYLGYTCKCGGLQSGSPLSSQTSQPEQEVGTSEGKPISSLDAFPTQEGTLSPVNLTDDQIAAGLYACTNNESTLKSIMKKKDGNKDSNGAKKNLQFVGINGGYETTSSDDSSSDESSSSESDDECDVIEYPLEEEEEEEDEDTRGMAEGHHAVNIEGLKSARVEDEMQVQECEPEKVEIRERYELSEKMLSACNLLKNTINDPKALTSKDMRFCLNTLQHEWFRVSSQKSAIPAMVGDYIAAFEAISPDVLRYVINLADGNGNTALHYSVSHSNFEIVKLLLDADVCNVDHQNKAGYTPIMLAALAAVEAEKDMRIVEELFGCGDVNAKASQAGQTALMLAVSHGRIDMVKGLLACGADVNIQDDEGSTALMCASEHGHVEIVKLLLAQPGCNGHLEDNDGSTALSIALEAGHKDIAVLLYAHVNFAKAQSPGTPRLGRKTSPGPTHRGSFD,1352,NP_001243805.1.csv,refseq-KANK1-NM_001256876.3_clinical_seed_0_final,refseq-KANK1-NM_001256876.3.a2m,Invitae,refseq-KANK1-NM_001256876.3_theta_0.2.npy,1,1352,1352
+NP_001244074.1,MAELYRVLEAGKIGIFESPTGTGKSLSLICGALSWLRDFEQKKREEEARLLETGTGPLHDEKDESLCLSSSCEGAAGTPRPAGEPAWVTQFVQKKEERDLVDRLKAEQARRKQREERLQQLQHRVQLKYAAKRLRQEEEERENLLRLSREMLETGPEAERLEQLESGEEELVLAEYESDEEKKVASRVDEDEDDLEEEHITKIYYCSRTHSQLAQFVHEVKKSPFGKDVRLVSLGSRQNLCVNEDVKSLGSVQLINDRCVDMQRSRHEKKKGAEEEKPKRRRQEKQAACPFYNHEQMGLLRDEALAEVKDMEQLLALGKEARACPYYGSRLAIPAAQLVVLPYQMLLHAATRQAAGIRLQDQVVIIDEAHNLIDTITGMHSVEVSGSQLCQAHSQLLQYVERYGKRLKAKNLMYLKQILYLLEKFVAVLGGNIKQNPNTQSLSQTGTELKTINDFLFQSQIDNINLFKVQRYCEKSMISRKLFGFTERYGAVFSSREQPKLAGFQQFLQSLQPRTTEALAAPADESQASTLRPASPLMHIQGFLAALTTANQDGRVILSRQGSLSQSTLKFLLLNPAVHFAQVVKECRAVVIAGGTMQPVSDFRQQLLACAGVEAERVVEFSCGHVIPPDNILPLVICSGISNQPLEFTFQKRELPQMMDEVGRILCNLCGVVPGGVVCFFPSYEYLRQVHAHWEKGGLLGRLAARKKIFQEPKSAHQVEQVLLAYSRCIQACGQERGQVTGALLLSVVGGKMSEGINFSDNLGRCVVMVGMPFPNIRSAELQEKMAYLDQTLPRAPGQAPPGKALVENLCMKAVNQSIGRAIRHQKDFASVVLLDQRYARPPVLAKLPAWIRARVEVKATFGPAIAAVQKFHREKSASS,880,NP_001244074.1.csv,refseq-DDX11-NM_001257145.1_clinical_seed_0_final,refseq-DDX11-NM_001257145.1.a2m,Invitae,refseq-DDX11-NM_001257145.1_theta_0.2.npy,1,880,880
+NP_001244159.1,MNNHQLELAKQLHKEGHLFYCTCRVLTCPGQAKSIASAPGKCQDSAALTSTAFSGLDFGLLSGYLHKQALVTATHPTCTLLFPSCHAFFPLPLTPTLYKMHKGWKNYCSQKSLNEASMDEYLGSLGLFRKLTAKDASCLFRAISEQLFCSQVHHLEIRKACVSYMRENQQTFESYVEGSFEKYLERLGDPKESAGQLEIRALSLIYNRDFILYRFPGKPPTYVTDNGYEDKILLCYSSSGHYDSVYSKQFQSSAAVCQAVLYEILYKDVFVVDEEELKTAIKLFRSGSKKNRNNAVTGSEDAHTDYKSSNQNRMEEWGACYNAENIPEGYNKGTEETKSPENPSKMPFPYKVLKALDPEIYRNVEFDVWLDSRKELQKSDYMEYAGRQYYLGDKCQVCLESEGRYYNAHIQEVGNENNSVTVFIEELAEKHVVPLANLKPVTQVMSVPAWNAMPSRKGRGYQKMPGGYVPEIVISEMDIKQQKKMFKKIRGKEVYMTMAYGKGDPLLPPRLQHSMHYGHDPPMHYSQTAGNVMSNEHFHPQHPSPRQGRGYGMPRNSSRFINRHNMPGPKVDFYPGPGKRCCQSYDNFSYRSRSFRRSHRQMSCVNKESQYGFTPGNGQMPRGLEETITFYEVEEGDETAYPTLPNHGGPSTMVPATSGYCVGRRGHSSGKQTLNLEEGNGQSENGRYHEEYLYRAEPDYETSGVYSTTASTANLSLQDRKSCSMSPQDTVTSYNYPQKMMGNIAAVAASCANNVPAPVLSNGAAANQAISTTSVSSQNAIQPLFVSPPTHGRPDTKVLQYYFNLGLQCYYHSYWHSMVYVPQMQQQLHVENYPVYTEPPLVDQTVPQCYSEVRREDGIQAEASANDTFPNADSSSVPHGAVYYPVMSDPYGQPPLPGFDSCLPVVPDYSCVPPWHPVGTAYGGSSQIHGAINPGPIGCIAPSPPASHYVPQGM,954,NP_001244159.1.csv,refseq-ALG13-NM_001257230.1_clinical_seed_0_final,refseq-ALG13-NM_001257230.1.a2m,Invitae,refseq-ALG13-NM_001257230.1.npy,1,954,954
+NP_001244941.2,MASRCWRWWGWSAWPRTRLPPAGSTPSFCHHFSTQEKTPQICVVGSGPAGFYTAQHLLKRVEALCSQPRVLNSPALSGEGEDLGASQPLSLGPTSCHPVPQQHPQAHVDIYEKQPVPFGLVRFGVAPDHPEVKNVINTFTQTAHSGRCAFWGNVEVGRDVTVPELQEAYHAVVLSYGAEDHRALEIPGEELPGVCSARAFVGWYNGLPENQELEPDLSCDTAVILGQGNVALDVARILLTPPEHLERTDITKAALGVLRQSRVKTVWLVGRRGPLQVAFTIKELREMIQLPGARPILDPVDFLGLQDKIKEVPRPRKRLTELLLRTATEKPGPAEAARQASASRAWGLRFFRSPQQVLPSPDGRRAAGVRLAVTRLEGVDEATRAVPTGDMEDLPCGLVLSSIGYKSRPVDPSVPFDSKLGVIPNVEGRVMDVPGLYCSGWVKRGPTGVIATTMTDSFLTGQMLLQDLKAGLLPSGPRPGYAAIQALLSSRGVRPVSFSDWEKLDAEEVARGQGTGKPREKLVDPQEMLRLLGH,534,NP_001244941.2.csv,NP_001244941.2_colabfold_clinical_seed_0_final,NP_001244941.2_colabfold.a2m,colabfold,NP_001244941.2_colabfold_theta_0.2.npy,1,534,534
+NP_001245150.1,MSRPLITRSPASPLNNQGIPTPAQLTKSNAPVHIDVGGHMYTSSLATLTKYPESRIGRLFDGTEPIVLDSLKQHYFIDRDGQMFRYILNFLRTSKLLIPDDFKDYTLLYEEAKYFQLQPMLLEMERWKQDRETGRFSRPCECLVVRVAPDLGERITLSGDKSLIEEVFPEIGDVMCNSVNAGWNHDSTHVIRFPLNGYCHLNSVQVLERLQQRGFEIVGSCGGGVDSSQFSEYVLRRELRRTPRVPSVIRIKQEPLD,257,NP_001245150.1.csv,refseq-KCTD1-NM_001258221.2_clinical_seed_0_final,refseq-KCTD1-NM_001258221.2.a2m,Invitae,refseq-KCTD1-NM_001258221.2.npy,1,257,257
+NP_001245218.1,MRLEWGPRPAALPWPAGMCAAERAEGAFTLQSVAQPMRPIASTATKCGNCGPGYSTPLEAMKGPREEIVYLPCIYRNTGTEAPDYLATVDVDPKSPQYCQVIHRLPMPNLKDELHHSGWNTCSSCFGDSTKSRTKLVLPSLISSRIYVVDVGSEPRAPKLHKVIEPKDIHAKCELAFLHTSHCLASGEVMISSLGDVKGNGKGGFVLLDGETFEVKGTWERPGGAAPLGYDFWYQPRHNVMISTEWAAPNVLRDGFNPADVEAGLYGSHLYVWDWQRHEIVQTLSLKDGLIPLEIRFLHNPDAAQGFVGCALSSTIQRFYKNEGGTWSVEKVIQVPPKKVKGWLLPEMPGLITDILLSLDDRFLYFSNWLHGDLRQYDISDPQRPRLTGQLFLGGSIVKGGPVQVLEDEELKSQPEPLVVKGKRVAGGPQMIQLSLDGKRLYITTSLYSAWDKQFYPDLIREGSVMLQVDVDTVKGGLKLNPNFLVDFGKEPLGPALAHELRYPGGDCSSDIWI,514,NP_001245218.1.csv,refseq-SELENBP1-NM_001258289.1_clinical_seed_0_final,refseq-SELENBP1-NM_001258289.1.a2m,Invitae,refseq-SELENBP1-NM_001258289.1.npy,1,514,514
+NP_001251.1,MDYDFKVKLSSERERVEDLFEYEGCKVGRGTYGHVYKAKRKDGKDDKDYALKQIEGTGISMSACREIALLRELKHPNVISLQKVFLSHADRKVWLLFDYAEHDLWHIIKFHRASKANKKPVQLPRGMVKSLLYQILDGIHYLHANWVLHRDLKPANILVMGEGPERGRVKIADMGFARLFNSPLKPLADLDPVVVTFWYRAPELLLGARHYTKAIDIWAIGCIFAELLTSEPIFHCRQEDIKTSNPYHHDQLDRIFNVMGFPADKDWEDIKKMPEHSTLMKDFRRNTYTNCSLIKYMEKHKVKPDSKAFHLLQKLLTMDPIKRITSEQAMQDPYFLEDPLPTSDVFAGCQIPYPKREFLTEEEPDDKGDKKNQQQQQGNNHTNGTGHPGNQDSSHTQGPPLKKVRVVPPTTTSGGLIMTSDYQRSNPHAAYPNPGPSTSQPQSSMGYSATSQQPPQYSHQTHRY,464,NP_001251.1.csv,refseq-CDK8-NM_001260.2_clinical_seed_0_final,refseq-CDK8-NM_001260.2.a2m,Invitae,refseq-CDK8-NM_001260.2.npy,1,464,464
+NP_001254507.1,MAGDGRRAEAVREGWGVYVTPRAPIREGRGRLAPQNGGSSDAPAYRTPPSRQGRREVRFSDEPPEVYGDFEPLVAKERSPVGKRTRLEEFRSDSAKEEVRESAYYLRSRQRRQPRPQETEEMKTRRTTRLQQQHSEQPPLQPSPVMTRRGLRDSHSSEEDEASSQTDLSQTISKKTVRSIQEAPAVSEDLVIRLRRPPLRYPRYEATSVQQKVNFSEEGETEEDDQDSSHSSVTTVKARSRDSDESGDKTTRSSSQYIESFWQSSQSQNFTAHDKQPSVLSSGYQKTPQEWAPQTARIRTRMQNDSILKSELGNQSPSTSSRQVTGQPQNASFVKRNRWWLLPLIAALASGSFWFFSTPEVETTAVQEFQNQMNQLKNKYQGQDEKLWKRSQTFLEKHLNSSHPRSQPAILLLTAARDAEEALRCLSEQIADAYSSFRSVRAIRIDGTDKATQDSDTVKLEVDQELSNGFKNGQNAAVVHRFESFPAGSTLIFYKYCDHENAAFKDVALVLTVLLEEETLGTSLGLKEVEEKVRDFLKVKFTNSNTPNSYNHMDPDKLNGLWSRISHLVLPVQPENALKRGICL,584,NP_001254507.1.csv,refseq-TOR1AIP1-NM_001267578.1_clinical_seed_0_final,refseq-TOR1AIP1-NM_001267578.1.a2m,Invitae,refseq-TOR1AIP1-NM_001267578.1.npy,1,584,584
+NP_001254538.1,MSARGVVQHALPTPRRGALTMSLNSSLSCRKELSNLTEEEGGEGGVIITQFIAIIVITIFVCLGNLVIVVTLYKKSYLLTLSNKFVFSLTLSNFLLSVLVLPFVVTSSIRREWIFGVVWCNFSALLYLLISSASMLTLGVIAIDRYYAVLYPMVYPMKITGNRAVMALVYIWLHSLIGCLPPLFGWSSVEFDEFKWMCVAAWHREPGYTAFWQIWCALFPFLVMLVCYGFIFRVARVKARKVHCGTVVIVEEDAQRTGRKNSSTSTSSSGSRRNAFQGVVYSANQCKALITILVVLGAFMVTWGPYMVVIASEALWGKSSVSPSLETWATWLSFASAVCHPLIYGLWNKTVRKELLGMCFGDRYYREPFVQRQRTSRLFSISNRITDLGLSPHLTALMAGGQPLGHSSSTGDTGFSCSQDSGTDMMLLEDYTSDDNPPSHCTCPPKRRSSVTFEDEVEQIKEAAKNSILHVKAEVHKSLDSYAASLAKAIEAEAKINLFGEEALPGVLVTARTVPGGGFGGRRGSRTLVSQRLQLQSIEEGDVLAAEQR,549,NP_001254538.1.csv,refseq-GPR161-NM_001267609.1_clinical_seed_0_final,refseq-GPR161-NM_001267609.1.a2m,Invitae,refseq-GPR161-NM_001267609.1.npy,1,549,549
+NP_001255.4,MGSSRAPWMGRVGGHGMMALLLAGLLLPGTLAKSIGTFSDPCKDPTRITSPNDPCLTGKGDSSGFSSYSGSSSSGSSISSARSSGGGSSGSSSGSSIAQGGSAGSFKPGTGYSQVSYSSGSGSSLQGASGSSQLGSSSSHSGNSGSHSGSSSSHSSSSSSFQFSSSSFQVGNGSALPTNDNSYRGILNPSQPGQSSSSSQTFGVSSSGQSVSSNQRPCSSDIPDSPCSGGPIVSHSGPYIPSSHSVSGGQRPVVVVVDQHGSGAPGVVQGPPCSNGGLPGKPCPPITSVDKSYGGYEVVGGSSDSYLVPGMTYSKGKIYPVGYFTKENPVKGSPGVPSFAAGPPISEGKYFSSNPIIPSQSAASSAIAFQPVGTGGVQLCGGGSTGSKGPCSPSSSRVPSSSSISSSSGLPYHPCGSASQSPCSPPGTGSFSSSSSSQSSGKIILQPCGSKSSSSGHPCMSVSSLTLTGGPDGSPHPDPSAGAKPCGSSSAGKIPCRSIRDILAQVKPLGPQLADPEVFLPQGELLNSP,529,NP_001255.4.csv,refseq-CDSN-NM_001264.5_clinical_seed_0_final,refseq-CDSN-NM_001264.5.a2m,Invitae,refseq-CDSN-NM_001264.5_theta_0.2.npy,1,529,529
+NP_001258750.1,MEPLRLLILLFVTELSGAHNTTVFQGVAGQSLQVSCPYDSMKHWGRRKAWCRQLGEKGPCQRVVSTHNLWLLSFLRRWNGSTAITDDTLGGTLTITLRNLQPHDAGLYQCQSLHGSEADTLRKVLVEVLADPLDHRDAGDLWFPGESESFEDAHVEHSISRAERHVKEDDGRKSPGEVPPGTSPACILATWPPGLLVLLWQETTLPEHCFSWTLEAGTG,219,NP_001258750.1.csv,refseq-TREM2-NM_001271821.1_clinical_seed_0_final,refseq-TREM2-NM_001271821.1.a2m,Invitae,refseq-TREM2-NM_001271821.1.npy,1,219,219
+NP_001258928.1,MAERRAFAQKISRTVAAEVRKQISGQYSGSPQLLKNLNIVGNISHHTTVPLTEAVDPVDLEDYLITHPLAVDSGPLRDLIEFPPDDIEVVYSPRDCRTLVSAVPEESEMDPHVRDCIRSYTEDWAIVIRKYHKLGTGFNPNTLDKQKERQKGLPKQVFESDEAPDGNSYQDDQDDLKRRSMSIDDTPRGSWACSIFDLKNSLPDALLPNLLDRTPNEEIDRQNDDQRKSNRHKELFALHPSPDEEEPIERLSVPDIPKEHFGQRLLVKCLSLKFEIEIEPIFASLALYDVKEKKKISENFYFDLNSEQMKGLLRPHVPPAAITTLARSAIFSITYPSQDVFLVIKLEKVLQQGDIGECAEPYMIFKEADATKNKEKLEKLKSQADQFCQRLGKYRMPFAWTAIHLMNIVSSAGSLERDSTEVEISTGERKGSWSERRNSSIVGRRSLERTTSGDDACNLTSFRPATLTVTNFFKQEGDRLSDEDLYKFLADMRRPSSVLRRLRPITAQLKIDISPAPENPHYCLTPELLQVKLYPDSRVRPTREILEFPARDVYVPNTTYRNLLYIYPQSLNFANRQGSARNITVKVQFMYGEDPSNAMPVIFGKSSCSEFSKEAYTAVVYHNRSPDFHEEIKVKLPATLTDHHHLLFTFYHVSCQQKQNTPLETPVGYTWIPMLQNGRLKTGQFCLPVSLEKPPQAYSVLSPEVPLPGMKWVDNHKGVFNVEVVAVSSIHTQDPYLDKFFALVNALDEHLFPVRIGDMRIMENNLENELKSSISALNSSQLEPVVRFLHLLLDKLILLVIRPPVIAGQIVNLGQASFEAMASIINRLHKNLEGNHDQHGRNSLLASYIHYVFRLPNTYPNSSSPGPGGLGGSVHYATMARSAVRPASLNLNRSRSLSNSNPDISGTPTSPDDEVRSIIGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAESTQAMDRSCNRMSSHTETSSFLQTLTGRLPTKKLFHEELALQWVVCSGSVRESALQQAWFFFELMVKSMVHHLYFNDKLEAPRKSRFPERFMDDIAALVSTIASDIVSRFQKDTEMVERLNTSLAFFLNDLLSVMDRGFVFSLIKSCYKQVSSKLYSLPNPSVLVSLRLDFLRIICSHEHYVTLNLPCSLLTPPASPSPSVSSATSQSSGFSTNVQDQKIANMFELSVPFRQQHYLAGLVLTELAVILDPDAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAMLYLPLIGIIMETVPQLYDFTETHNQRGRPICIATDDYESESGSMISQTVAMAIAGTSVPQLTRPGSFLLTSTSGRQHTTFSAESSRSLLICLLWVLKNADETVLQKWFTDLSVLQLNRLLDLLYLCVSCFEYKGKKVFERMNSLTFKKSKDMRAKLEEAILGSIGARQEMVRRSRGQLERSPSGSAFGSQENLRWRKDMTHWRQNTEKLDKSRAEIEHEALIDGNLATEANLIILDTLEIVVQTVSVTESKESILGGVLKVLLHSMACNQSAVYLQHCFATQRALVSKFPELLFEEETEQCADLCLRLLRHCSSSIGTIRSHASASLYLLMRQNFEIGNNFARVKMQVTMSLSSLVGTSQNFNEEFLRRSLKTILTYAEEDLELRETTFPDQVQDLVFNLHMILSDTVKMKEHQEDPEMLIDLMYRIAKGYQTSPDLRLTWLQNMAGKHSERSNHAEAAQCLVHSAALVAEYLSMLEDRKYLPVGCVTFQNISSNVLEESAVSDDVVSPDEEGICSGKYFTESGLVGLLEQAAASFSMAGMYEAVNEVYKVLIPIHEANRDAKKLSTIHGKLQEAFSKIVHQDGKRMFGTYFRVGFYGTKFGDLDEQEFVYKEPAITKLAEISHRLEGFYGERFGEDVVEVIKDSNPVDKCKLDPNKAYIQITYVEPYFDTYEMKDRITYFDKNYNLRRFMYCTPFTLDGRAHGELHEQFKRKTILTTSHAFPYIKTRVNVTHKEEIILTPIEVAIEDMQKKTQELAFATHQDPADPKMLQMVLQGSVGTTVNQGPLEVAQVFLSEIPSDPKLFRHHNKLRLCFKDFTKRCEDALRKNKSLIGPDQKEYQRELERNYHRLKEALQPLINRKIPQLYKAVLPVTCHRDSFSRMSLRKMDL,2129,NP_001258928.1.csv,refseq-DOCK7-NM_001271999.1_clinical_seed_0_final,refseq-DOCK7-NM_001271999.1.a2m,Invitae,refseq-DOCK7-NM_001271999.1.npy,1,2129,2129
+NP_001261.2,MNGHSDEESVRNSSGESSQSDDDSGSASGSGSGSSSGSSSDGSSSQSGSSDSDSGSESGSQSESESDTSRENKVQAKPPKVDGAEFWKSSPSILAVQRSAILKKQQQQQQQQQHQASSNSGSEEDSSSSEDSDDSSSEVKRKKHKDEDWQMSGSGSPSQSGSDSESEEEREKSSCDETESDYEPKNKVKSRKPQNRSKSKNGKKILGQKKRQIDSSEEDDDEEDYDNDKRSSRRQATVNVSYKEDEEMKTDSDDLLEVCGEDVPQPEEEEFETIERFMDCRIGRKGATGATTTIYAVEADGDPNAGFEKNKEPGEIQYLIKWKGWSHIHNTWETEETLKQQNVRGMKKLDNYKKKDQETKRWLKNASPEDVEYYNCQQELTDDLHKQYQIVERIIAHSNQKSAAGYPDYYCKWQGLPYSECSWEDGALISKKFQACIDEYFSRNQSKTTPFKDCKVLKQRPRFVALKKQPSYIGGHEGLELRDYQLNGLNWLAHSWCKGNSCILADEMGLGKTIQTISFLNYLFHEHQLYGPFLLVVPLSTLTSWQREIQTWASQMNAVVYLGDINSRNMIRTHEWTHHQTKRLKFNILLTTYEILLKDKAFLGGLNWAFIGVDEAHRLKNDDSLLYKTLIDFKSNHRLLITGTPLQNSLKELWSLLHFIMPEKFSSWEDFEEEHGKGREYGYASLHKELEPFLLRRVKKDVEKSLPAKVEQILRMEMSALQKQYYKWILTRNYKALSKGSKGSTSGFLNIMMELKKCCNHCYLIKPPDNNEFYNKQEALQHLIRSSGKLILLDKLLIRLRERGNRVLIFSQMVRMLDILAEYLKYRQFPFQRLDGSIKGELRKQALDHFNAEGSEDFCFLLSTRAGGLGINLASADTVVIFDSDWNPQNDLQAQARAHRIGQKKQVNIYRLVTKGSVEEDILERAKKKMVLDHLVIQRMDTTGKTVLHTGSAPSSSTPFNKEELSAILKFGAEELFKEPEGEEQEPQEMDIDEILKRAETHENEPGPLTVGDELLSQFKVANFSNMDEDDIELEPERNSKNWEEIIPEDQRRRLEEEERQKELEEIYMLPRMRNCAKQISFNGSEGRRSRSRRYSGSDSDSISEGKRPKKRGRPRTIPRENIKGFSDAEIRRFIKSYKKFGGPLERLDAIARDAELVDKSETDLRRLGELVHNGCIKALKDSSSGTERTGGRLGKVKGPTFRISGVQVNAKLVISHEEELIPLHKSIPSDPEERKQYTIPCHTKAAHFDIDWGKEDDSNLLIGIYEYGYGSWEMIKMDPDLSLTHKILPDDPDKKPQAKQLQTRADYLIKLLSRDLAKKEALSGAGSSKRRKARAKKNKAMKSIKVKEEIKSDSSPLPSEKSDEDDDKLSESKSDGRERSKKSSVSDAPVHITASGEPVPISEESEELDQKTFSICKERMRPVKAALKQLDRPEKGLSEREQLEHTRQCLIKIGDHITECLKEYTNPEQIKQWRKNLWIFVSKFTEFDARKLHKLYKHAIKKRQESQQNSDQNSNLNPHVIRNPDVERLKENTNHDDSSRDSYSSDRHLTQYHDHHKDRHQGDSYKKSDSRKRPYSSFSNGKDHRDWDHYKQDSRYYSDREKHRKLDDHRSRDHRSNLEGSLKDRSHSDHRSHSDHRLHSDHRSSSEYTHHKSSRDYRYHSDWQMDHRASSSGPRSPLDQRSPYGSRSPFEHSVEHKSTPEHTWSSRKT,1710,NP_001261.2.csv,refseq-CHD1-NM_001270.2_clinical_seed_0_final,refseq-CHD1-NM_001270.2.a2m,Invitae,refseq-CHD1-NM_001270.2_theta_0.2.npy,1,1710,1710
+NP_001262.3,MMRNKDKSQEEDSSLHSNASSHSASEEASGSDSGSQSESEQGSDPGSGHGSESNSSSESSESQSESESESAGSKSQPVLPEAKEKPASKKERIADVKKMWEEYPDVYGVRRSNRSRQEPSRFNIKEEASSGSESGSPKRRGQRQLKKQEKWKQEPSEDEQEQGTSAESEPEQKKVKARRPVPRRTVPKPRVKKQPKTQRGKRKKQDSSDEDDDDDEAPKRQTRRRAAKNVSYKEDDDFETDSDDLIEMTGEGVDEQQDNSETIEKVLDSRLGKKGATGASTTVYAIEANGDPSGDFDTEKDEGEIQYLIKWKGWSYIHSTWESEESLQQQKVKGLKKLENFKKKEDEIKQWLGKVSPEDVEYFNCQQELASELNKQYQIVERVIAVKTSKSTLGQTDFPAHSRKPAPSNEPEYLCKWMGLPYSECSWEDEALIGKKFQNCIDSFHSRNNSKTIPTRECKALKQRPRFVALKKQPAYLGGENLELRDYQLEGLNWLAHSWCKNNSVILADEMGLGKTIQTISFLSYLFHQHQLYGPFLIVVPLSTLTSWQREFEIWAPEINVVVYIGDLMSRNTIREYEWIHSQTKRLKFNALITTYEILLKDKTVLGSINWAFLGVDEAHRLKNDDSLLYKTLIDFKSNHRLLITGTPLQNSLKELWSLLHFIMPEKFEFWEDFEEDHGKGRENGYQSLHKVLEPFLLRRVKKDVEKSLPAKVEQILRVEMSALQKQYYKWILTRNYKALAKGTRGSTSGFLNIVMELKKCCNHCYLIKPPEENERENGQEILLSLIRSSGKLILLDKLLTRLRERGNRVLIFSQMVRMLDILAEYLTIKHYPFQRLDGSIKGEIRKQALDHFNADGSEDFCFLLSTRAGGLGINLASADTVVIFDSDWNPQNDLQAQARAHRIGQKKQVNIYRLVTKGTVEEEIIERAKKKMVLDHLVIQRMDTTGRTILENNSGRSNSNPFNKEELTAILKFGAEDLFKELEGEESEPQEMDIDEILRLAETRENEVSTSATDELLSQFKVANFATMEDEEELEERPHKDWDEIIPEEQRKKVEEEERQKELEEIYMLPRIRSSTKKAQTNDSDSDTESKRQAQRSSASESETEDSDDDKKPKRRGRPRSVRKDLVEGFTDAEIRRFIKAYKKFGLPLERLECIARDAELVDKSVADLKRLGELIHNSCVSAMQEYEEQLKENASEGKGPGKRRGPTIKISGVQVNVKSIIQHEEEFEMLHKSIPVDPEEKKKYCLTCRVKAAHFDVEWGVEDDSRLLLGIYEHGYGNWELIKTDPELKLTDKILPVETDKKPQGKQLQTRADYLLKLLRKGLEKKGAVTGGEEAKLKKRKPRVKKENKVPRLKEEHGIELSSPRHSDNPSEEGEVKDDGLEKSPMKKKQKKKENKENKEKQMSSRKDKEGDKERKKSKDKKEKPKSGDAKSSSKSKRSQGPVHITAGSEPVPIGEDEDDDLDQETFSICKERMRPVKKALKQLDKPDKGLNVQEQLEHTRNCLLKIGDRIAECLKAYSDQEHIKLWRRNLWIFVSKFTEFDARKLHKLYKMAHKKRSQEEEEQKKKDDVTGGKKPFRPEASGSSRDSLISQSHTSHNLHPQKPHLPASHGPQMHGHPRDNYNHPNKRHFSNADRGDWQRERKFNYGGGNNNPPWGSDRHHQYEQHWYKDHHYGDRRHMDAHRSGSYRPNNMSRKRPYDQYSSDRDHRGHRDYYDRHHHDSKRRRSDEFRPQNYHQQDFRRMSDHRPAMGYHGQGPSDHYRSFHTDKLGEYKQPLPPLHPAVSDPRSPPSQKSPHDSKSPLDHRSPLERSLEQKNNPDYNWNVRKT,1828,NP_001262.3.csv,refseq-CHD2-NM_001271.3_clinical_seed_0_final,refseq-CHD2-NM_001271.3.a2m,Invitae,refseq-CHD2-NM_001271.3.npy,1,1828,1828
+NP_001264.2,MASGLGSPSPCSAGSEEEDMDALLNNSLPPPHPENEEDPEEDLSETETPKLKKKKKPKKPRDPKIPKSKRQKKERMLLCRQLGDSSGEGPEFVEEEEEVALRSDSEGSDYTPGKKKKKKLGPKKEKKSKSKRKEEEEEEDDDDDSKEPKSSAQLLEDWGMEDIDHVFSEEDYRTLTNYKAFSQFVRPLIAAKNPKIAVSKMMMVLGAKWREFSTNNPFKGSSGASVAAAAAAAVAVVESMVTATEVAPPPPPVEVPIRKAKTKEGKGPNARRKPKGSPRVPDAKKPKPKKVAPLKIKLGGFGSKRKRSSSEDDDLDVESDFDDASINSYSVSDGSTSRSSRSRKKLRTTKKKKKGEEEVTAVDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTCPALKGKVQKILIWKWGQPPSPTPVPRPPDADPNTPSPKPLEGRPERQFFVKWQGMSYWHCSWVSELQLELHCQVMFRNYQRKNDMDEPPSGDFGGDEEKSRKRKNKDPKFAEMEERFYRYGIKPEWMMIHRILNHSVDKKGHVHYLIKWRDLPYDQASWESEDVEIQDYDLFKQSYWNHRELMRGEEGRPGKKLKKVKLRKLERPPETPTVDPTVKYERQPEYLDATGGTLHPYQMEGLNWLRFSWAQGTDTILADEMGLGKTVQTAVFLYSLYKEGHSKGPFLVSAPLSTIINWEREFEMWAPDMYVVTYVGDKDSRAIIRENEFSFEDNAIRGGKKASRMKKEASVKFHVLLTSYELITIDMAILGSIDWACLIVDEAHRLKNNQSKFFRVLNGYSLQHKLLLTGTPLQNNLEELFHLLNFLTPERFHNLEGFLEEFADIAKEDQIKKLHDMLGPHMLRRLKADVFKNMPSKTELIVRVELSPMQKKYYKYILTRNFEALNARGGGNQVSLLNVVMDLKKCCNHPYLFPVAAMEAPKMPNGMYDGSALIRASGKLLLLQKMLKNLKEGGHRVLIFSQMTKMLDLLEDFLEHEGYKYERIDGGITGNMRQEAIDRFNAPGAQQFCFLLSTRAGGLGINLATADTVIIYDSDWNPHNDIQAFSRAHRIGQNKKVMIYRFVTRASVEERITQVAKKKMMLTHLVVRPGLGSKTGSMSKQELDDILKFGTEELFKDEATDGGGDNKEGEDSSVIHYDDKAIERLLDRNQDETEDTELQGMNEYLSSFKVAQYVVREEEMGEEEEVEREIIKQEESVDPDYWEKLLRHHYEQQQEDLARNLGKGKRIRKQVNYNDGSQEDRDWQDDQSDNQSDYSVASEEGDEDFDERSEAPRRPSRKGLRNDKDKPLPPLLARVGGNIEVLGFNARQRKAFLNAIMRYGMPPQDAFTTQWLVRDLRGKSEKEFKAYVSLFMRHLCEPGADGAETFADGVPREGLSRQHVLTRIGVMSLIRKKVQEFEHVNGRWSMPELAEVEENKKMSQPGSPSPKTPTPSTPGDTQPNTPAPVPPAEDGIKIEENSLKEEESIEGEKEVKSTAPETAIECTQAPAPASEDEKVVVEPPEGEEKVEKAEVKERTEEPMETEPKGAADVEKVEEKSAIDLTPIVVEDKEEKKEEEEKKEVMLQNGETPKDLNDEKQKKNIKQRFMFNIADGGFTELHSLWQNEERAATVTKKTYEIWHRRHDYWLLAGIINHGYARWQDIQNDPRYAILNEPFKGEMNRGNFLEIKNKFLARRFKLLEQALVIEEQLRRAAYLNMSEDPSHPSMALNTRFAEVECLAESHQHLSKESMAGNKPANAVLHKVLKQLEELLSDMKADVTRLPATIARIPPVAVRLQMSERNILSRLANRAPEPTPQQVAQQQ,1912,NP_001264.2.csv,refseq-CHD4-NM_001273.3_clinical_seed_0_final,refseq-CHD4-NM_001273.3.a2m,Invitae,refseq-CHD4-NM_001273.3_theta_0.2.npy,1,1912,1912
+NP_001265067.1,MLEASQDMGRTLEWRPRTPALVRGCHLEGVAGHKEAHILRVLPGHSAGPRTVTVKVELSCAPGDLDAVLILQGPPYVSWLIDANHNMQIWTTGEYSFKIFPEKNIRGFKLPDTPQGLLGEARMLNASIVASFVELPLASIVSLHASSCGGRLQTSPAPIQTTPPKDTCSPELLMSLIQTKCADDAMTLVLKKELVAHLKCTITGLTFWDPSCEAEDRGDKFVLRSAYSSCGMQVSASMISNEAVVNILSSSSPQRKKVHCLNMDSLSFQLGLYLSPHFLQASNTIEPGQQSFVQVRVSPSVSEFLLQLDSCHLDLGPEGGTVELIQGRAAKGNCVSLLSPSPEGDPRFSFLLHFYTVPIPKTGTLSCTVALRPKTGSQDQEVHRTVFMRLNIISPDLSGCTSKGLVLPAVLGITFGAFLIGALLTAALWYIYSHTRSPSKREPVVAVAAPASSESSSTNHSIGSTQSTPCSTSSMA,476,NP_001265067.1.csv,refseq-ENG-NM_001278138.2_clinical_seed_0_final,refseq-ENG-NM_001278138.2.a2m,Invitae,refseq-ENG-NM_001278138.2_theta_0.2.npy,1,476,476
+NP_001265273.1,MTYVSCFYHAFAGAEQVRQSLKAHSALWKDPPPESSTCSYQEMRRSSVNSSAMAETAANRICKVLAVNQENERLMEEYERLASELLEWIRRTIPWLENRTPEKTMQAMQKKLEDFRDYRRKHKPPKVQEKCQLEINFNTLQTKLRISNRPAFMPSEGKMVSDIAGAWQRLEQAEKGYEEWLLNEIRRLERLEHLAEKFRQKASTHETWAYGKEQILLQKDYESASLTEVRALLRKHEAFESDLAAHQDRVEQIAAIAQELNELDYHDAVNVNDRCQKICDQWDRLGTLTQKRREALERMEKLLETIDQLHLEFAKRAAPFNNWMEGAMEDLQDMFIVHSIEEIQSLITAHEQFKATLPEADGERQSIMAIQNEVEKVIQSYNIRISSSNPYSTVTMDELRTKWDKVKQLVPIRDQSLQEELARQHANERLRRQFAAQANAIGPWIQNKMEEIARSSIQITGALEDQMNQLKQYEHNIINYKNNIDKLEGDHQLIQEALVFDNKHTNYTMEHIRVGWELLLTTIARTINEVETQILTRDAKGITQEQMNEFRASFNHFDRRKNGLMDHEDFRACLISMGYDLGEAEFARIMTLVDPNGQGTVTFQSFIDFMTRETADTDTAEQVIASFRILASDKPYILAEELRRELPPDQAQYCIKRMPAYSGPGSVPGALDYAAFSSALYGESDL,686,NP_001265273.1.csv,refseq-ACTN2-NM_001278344.2_clinical_seed_0_final,refseq-ACTN2-NM_001278344.2.a2m,Invitae,refseq-ACTN2-NM_001278344.2_theta_0.2.npy,1,686,686
+NP_001265868.1,MAGLTAAAPRPGVLLLLLSILHPSRPGGVPGAIPGGVPGGVFYPGAGLGALGGGALGPGGKPLKPVPGGLAGAGLGAGLGAFPAVTFPGALVPGGVADAAAAYKAAKAGAGLGGVPGVGGLGVSAGAVVPQPGAGVKPGKVPGVGLPGVYPGGVLPGARFPGVGVLPGVPTGAGVKPKAPGVGGAFAGIPGVGPFGGPQPGVPLGYPIKAPKLPGGYGLPYTTGKLPYGYGPGGVAGAAGKAGYPTGTGVGPQAAAAAAAKAAAKFGAGAAGVLPGVGGAGVPGVPGAIPGIGGIAGVGTPAAAAAAAAAAKAAKYGAAAGLVPGGPGFGPGVVGVPGAGVPGVGVPGAGIPVVPGAGIPGAAVPGVVSPEAAAKAAAKAAKYGARPGVGVGGIPTYGVGAGGFPGFGVGVGGIPGVAGVPGVGGVPGVGGVPGVGISPEAQAAAAAKAAKYGAAGAGVLGGLVPGAPGAVPGVPGTGGVPGVGTPAAAAAKAAAKAAQFGLVPGVGVAPGVGVAPGVGVAPGVGLAPGVGVAPGVGVAPGVGVAPGIGPGGVAAAAKSAAKVAAKAQLRAAAGLGAGIPGLGVGVGVPGLGVGAGVPGLGVGAGVPGFGAGADEGVRRSLSPELREGDPSSSQHLPSTPSSPRVPGALAAAKAAKYGAAVPGVLGGLGALGGVGIPGGVVGAGPAAAAAAAKAAAKAAQFGLVGAAGLGGLGVGGLGVPGVGGLGGIPPAAAAKAAKYGAAGLGGVLGGAGQFPLGGVAARPGFGLSPIFPGGACLGKACGRKRK,786,NP_001265868.1.csv,refseq-ELN-NM_001278939.1_clinical_seed_0_final,refseq-ELN-NM_001278939.1.a2m,Invitae,refseq-ELN-NM_001278939.1.npy,1,786,786
+NP_001269154.1,MLVDGPSERPALCFLLLAVAMSFFGSALSIDETRAHLLLKEKMMRLGGRLVLNTKEELANERLMTLKIAEMKEAMRTLIFPPSMHFFQAKHLIERSQVFNILRMMPKGAALHLHDIGIVTMDWLVRNVTYRPHCHICFTPRGIMQFRFAHPTPRPSEKCSKWILLEDYRKRVQNVTEFDDSLLRNFTLVTQHPEVIYTNQNVVWSKFETIFFTISGLIHYAPVFRDYVFRSMQEFYEDNVLYMEIRARLLPVYELSGEHHDEEWSVKTYQEVAQKFVETHPEFIGIKIIYSDHRSKDVAVIAESIRMAMGLRIKFPTVVAGFDLVGHEDTGHSLHDYKEALMIPAKDGVKLPYFFHAGETDWQGTSIDRNILDALMLNTTRIGHGFALSKHPAVRTYSWKKDIPIEVCPISNQVLKLVSDLRNHPVATLMATGHPMVISSDDPAMFGAKGLSYDFYEVFMGIGGMKADLRTLKQLAMNSIKYSTLLESEKNTFMEIWKKRWDKFIADVATK,511,NP_001269154.1.csv,refseq-ADA2-NM_001282225.1_clinical_seed_0_final,refseq-ADA2-NM_001282225.1.a2m,Invitae,refseq-ADA2-NM_001282225.1.npy,1,511,511
+NP_001269210.1,MVGGGRRVGRDEPVPSVGALGQGSPDSMSVGFIGAGQLAFALAKGFTAAGVLAAHKIMASSPDMDLATVSALRKMGVKLTPHNKETVQHSDVLFLAVKPHIIPFILDEIGADIEDRHIVVSCAAGVTISSIEKKLSAFRPAPRVIRCMTNTPVVVREGATVYATGTHAQVEDGRLMEQLLSSVGFCTEVEEDLIDAVTGLSGSGPAYAFTALDALADGGVKMGLPRRLAVRLGAQALLGAAKMLLHSEQHPGQLKDNVSSPGGATIHALHVLESGGFRSLLINAVEASCIRTRELQSMADQEQVSPAAIKKTILDKVKLDSPAGTALSPSGHTKLLPRSLAPAGKD,346,NP_001269210.1.csv,refseq-PYCR1-NM_001282281.2_clinical_seed_0_final,refseq-PYCR1-NM_001282281.2.a2m,Invitae,refseq-PYCR1-NM_001282281.2_theta_0.2.npy,1,346,346
+NP_001269333.1,MTRDDLFNTNATIVATLTAACAQHCPEAMICVIANPVNSTIPITAEVFKKHGVYNPNKIFGVTTLDIVRANTFVAELKGLDPARVNVPVIGGHAGKTIIPLISQCTPKVDFPQDQLTALTGRIQEAGTEVVKAKAGAGSATLSMAYAGARFVFSLVDAMNGKEGVVECSFVKSQETECTYFSTPLLLGKKGIEKNLGIGKVSSFEEKMISDAIPELKASIKKGEDFVKTLK,231,NP_001269333.1.csv,refseq-MDH2-NM_001282404.2_clinical_seed_0_final,refseq-MDH2-NM_001282404.2.a2m,Invitae,refseq-MDH2-NM_001282404.2.npy,1,231,231
+NP_001269463.1,MKRQNVRTLSLIVCTFTYLLVGAAVFDALESDHEMREEEKLKAEEIRIKGKYNISSEDYRQLELVILQSEPHRAGVQWKFAGSFYFAITVITTIGYGHAAPGTDAGKAFCMFYAVLGIPLTLVMFQSLGERMNTFVRYLLKRIKKCCGMRNTDVSMENMVTVGFFSCMGTLCIGAAAFSQCEEWSFFHAYYYCFITLTTIGFGDYVALQTKGALQKKPLYVAFSFMYILVGLTVIGAFLNLVVLRFLTMNSEDERRDAEERASLAGNRNSMVIHIPEEPRPSRPRYKADVPDLQSVCSCTCYRSQDYGGRSVAPQNSFSAKLAPHYFHSISYKIEEISPSTLKNSLFPSPISSISPGLHSFTDHQRLMKRRKSV,374,NP_001269463.1.csv,refseq-KCNK9-NM_001282534.1_clinical_seed_0_final,refseq-KCNK9-NM_001282534.1.a2m,Invitae,refseq-KCNK9-NM_001282534.1.npy,1,374,374
+NP_001269613.2,MRMEAGEAAPPAGAGGRAAGGWGKWVRLNVGGTVFLTTRQTLCREQKSFLSRLCQGEELQSDRDETGAYLIDRDPTYFGPILNFLRHGKLVLDKDMAEEGVLEEAEFYNIGPLIRIIKDRMEEKDYTVTQVPPKHVYRVLQCQEEELTQMVSTMSDGWRFEQLVNIGSSYNYGSEDQAEFLCVVSKELHSTPNGLSSESSRKTKSTEEQLEEQQQQEEEVEEVEVEQVQVEADAQEKAQSSQDPANLFSLPPLPPPPLPAGGSRPHPLRPEAELAVRASPRPLARPQSCHPCCYKPEAPGCEAPDHLQGLGVPI,314,NP_001269613.2.csv,NP_001269613.2_colabfold_clinical_seed_0_final,NP_001269613.2_colabfold.a2m,colabfold,NP_001269613.2_colabfold_theta_0.2.npy,1,314,314
+NP_001269862.1,MAQAIFEALEGMDNQTVLAVQSLLDGQGAVPDPTGQSVNAPPAIQPLDDEDVFLCGKCKKQFNSLPAFMTHKREQCQGNAPALATVSLATNSIYPPSAAPTAVQQAPTPANRQISTYITVPPSPLIQTLVQGNILVSDDVLMSAMSAFTSLDQPMPQGPPPVQSSLNMHSVPSYLTQPPPPPPPPPPLPPPPPPQPPPPPPQSLGPPGRPNPGGNGVVEVYSAAAPLAGSGTVEIQALGMQPYPPLEVPNQCVEPPVYPTPTVYSPGKQGFKPKGPNPAAPMTSATGGTVATFDSPATLKTRRAKGARGLPEAAGKPKAQKLKCSYCDKSFTKNFDLQQHIRSHTGEKPFQCIACGRAFAQKSNVKKHMQTHKVWPPGHSGGTVSRNSVTVQVMALNPSRQEDEESTGLGQPLPGAPQPQALSTAGEEEGDKPESKQVVLIDSSYLCQFCPSKFSTYFQLKSHMTQHKNEQVYKCVVKSCAQTFPKLDTFLEHIKSHQEELSYRCHLCGKDFPSLYDLGVHQYSHSLLPQHSPKKDNAVYKCVKCVNKYSTPEALEHHLQTATHNFPCPHCQKVFPCERYLRRHLPTHGSGGRFKCQVCKKFFRREHYLKLHAHIHSGEKPYKCSVCESAFNRKDKLKRHMLIHEPFKKYKCPFSTHTGCSKEFNRPDKLKAHILSHSGMKLHKCALCSKSFSRRAHLAEHQRAHTGNYKFRCAGCAKGFSRHKYLKDHRCRLGPQKDKDLQTRRPPQRRAAPRSCGSGGRKVLTPLPDPLGLEELKDTGAGLVPEAVPGKPPFAEPDAVLSIVVGGAVGAETELVVPGHAEGLGSNLALAELQAGAEGPCAMLAVPVYIQASE,854,NP_001269862.1.csv,refseq-ZNF341-NM_001282933.1_clinical_seed_0_final,refseq-ZNF341-NM_001282933.1.a2m,Invitae,refseq-ZNF341-NM_001282933.1.npy,1,854,854
+NP_001271165.2,MDSYFKAAVSDLDKLLDDFEQNPDEQDYLQDVQNAYDSNHCSVSSELASSQRTSLLPKDQECVNSCASSETSYGTNESSLNEKTLKGLTSIQNEKNVTGLDLLSSVDGGTSDEIQPLYMGRCSKPICDLISDMGNLVHATNSEEDIKKLLPDDFKSNADSLIGLDLSSVSDTPCVSSTDHDSDTVREQQNDISSELQNREIGGIKELGIKVDTTLSDSYNYSGTENLKDKKIFNQLESIVDFNMSSALTRQSSKMFHAKDKLQHKSQPCGLLKDVGLVKEEVDVAVITAAECLKEEGKTSALTCSLPKNEDLCLNDSNSRDENFKLPDFSFQEDKTVIKQSAQEDSKSLDLKDNDVIQDSSSALHVSSKDVPSSLSCLPASGSMCGSLIESKARGDFLPQHEHKDNIQDAVTIHEEIQNSVVLGGEPFKENDLLKQEKCKSILLQSLIEGMEDRKIDPDQTVIRAESLDGGDTSSTVVESQEGLSGTHVPESSDCCEGFINTFSSNDMDGQDLDYFNIDEGAKSGPLISDAELDAFLTEQYLQTTNIKSFEENVNDSKSQMNQIDMKGLDDGNINNIYFNAEAGAIGESHGINIICEIVDKQNTIENGLSLGEKSTIPVQQGLPTSKSEITNQLSVSDINSQSVGGARPKQLFSLPSRTRSSKDLNKPDVPDTIESEPSTADTVVPITCAIDSTADPQVSFNSNYIDIESNSEGGSSFVTANEDSVPENTCKEGLVLGQKQPTWVPDSEAPNCMNCQVKFTFTKRRHHCRACGKVFCGVCCNRKCKLQYLEKEARVCVVCYETISKAQAFERMMSPTGSNLKSNHSDECTTVQPPQENQTSSIPSPATLPVSALKQPGVEGLCSKEQKRVWFADGILPNGEVADTTKLSSGSKRCSEDFSPLSPDVPMTVNTVDHSHSTTVEKPNNETGDITRNEIIQSPISQVPSVEKLSMNTGNEGLPTSGSFTLDDDVFAETEEPSSPTGVLVNSNLPIASISDYRLLCDINKYVCNKISLLPNDEDSLPPLLVASGEKGSVPVVEEHPSHEQIILLLEGESFHPVTFVLNANLLVNVKFIFYSSDKYWYFSTNGLHGLGQAEIIILLLCLPNEDTIPKDIFRLFITIYKDALKGKYIENLDNITFTESFLSSKDHGGFLFITPTFQKLDDLSLPSNPFLCGILIQKLEIPWAKVFPMRLMLRLGAEYKAYPAPLTSIRGRKPLFGEIGHTIMNLLVDLRNYQYTLHNIDQLLIHMEMGKSCIKIPRKKYSDVMKVLNSSNEHVISIGASFSTEADSHLVCIQNDGIYETQANSATGHPRKVTGASFVVFNGALKTSSGFLAKSSIVEDGLMVQITPETMNGLRLALREQKDFKITCGKVDAVDLREYVDICWVDAEEKGNKGVISSVDGISLQGFPSEKIKLEADFETDEKIVKCTEVFYFLKDQDLSILSTSYQFAKEIAMACSAALCPHLKTLKSNGMNKIGLRVSIDTDMVEFQAGSEGQLLPQHYLNDLDSALIPVIHGGTSNSSLPLEIELVFFIIEHLF,1539,NP_001271165.2.csv,ZFY16_HUMAN_b01_clinical_seed_0_final,ZFY16_HUMAN_b01.a2m,EVE,ZFY16_HUMAN_b01_theta_0.2.npy,1,1539,1539
+NP_001273003.2,MSDTDSDEDSAGGGPFSLAGFLFGNINGAGQLEGESVLDDECKKHLAGLGALGLGSLITELTANEELTGTDGALVNDEGWVRSTEDAVDYSDINEVAEDESRRYQQTMGSLQPLCHSDYDEDDYDADCEDIDCKLMPPPPPPPGPMKKDKDQDSITGVSENGEGIILPSIIAPSSLASEKVDFSSSSDSESEMGPQEATQAESEDGKLTLPLAGIMQHDATKLLPSVTELFPEFRPGKVLRFLRLFGPGKNVPSVWRSARRKRKKKHRELIQEEQIQEVECSVESEVSQKSLWNYDYAPPPPPEQCLSDDEITMMAPVESKFSQSTGDIDKVTDTKPRVAEWRYGPARLWYDMLGVPEDGSGFDYGFKLRKTEHEPVIKSRMIEEFRKLEENNGTDLLADENFLMVTQLHWEDDIIWDGEDVKHKGTKPQRASLAGWLPSSMTRNAMAYNVQQGFAATLDDDKPWYSIFPIDNEDLVYGRWEDNIIWDAQAMPRLLEPPVLTLDPNDENLILEIPDEKEEATSNSPSKESKKESSLKKSRILLGKTGVIKEEPQQNMSQPEVKDPWNLSNDEYYYPKQQGLRGTFGGNIIQHSIPAVELRQPFFPTHMGPIKLRQFHRPPLKKYSFGALSQPGPHSVQPLLKHIKKKAKMREQERQASGGGEMFFMRTPQDLTGKDGDLILAEYSEENGPLMMQVGMATKIKNYYKRKPGKDPGAPDCKYGETVYCHTSPFLGSLHPGQLLQAFENNLFRAPIYLHKMPETDFLIIRTRQGYYIRELVDIFVVGQQCPLFEVPGPNSKRANTHIRDFLQVFIYRLFWKSKDRPRRIRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMDSNWWVLKSDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRLKDAGYGEKSFFAPEEENEEDFQMKIDDEVRTAPWNTTRAFIAAMKGKCLLEVTGVADPTGCGEGFSYVKIPNKPTQQKDDKEPQPVKKTVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMSTEQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEVLSTDTDSSSAEDSDFEEMGKNIENMLQNKKTSSQLSREREEQERKELQRMLLAAGSAASGNNHRDDDTASVTSLNSSATGRCLKIYRTFRDEEGKEYVRCETVRKPAVIDAYVRIRTTKDEEFIRKFALFDEQHREEMRKERRRIQEQLRRLKRNQEKEKLKGPPEKKPKKMKERPDLKLKCGACGAIGHMRTNKFCPLYYQTNAPPSNPVAMTEEQEEELEKTVIHNDNEELIKVEGTKIVLGKQLIESADEVRRKSLVLKFPKQQLPPKKKRRVGTTVHCDYLNRPHKSIHRRRTDPMVTLSSILESIINDMRDLPNTYPFHTPVNAKVVKDYYKIITRPMDLQTLRENVRKRLYPSREEFREHLELIVKNSATYNGPKHSLTQISQSMLDLCDEKLKEKEDKLARLEKAINPLLDDDDQVAFSFILDNIVTQKMMAVPDSWPFHHPVNKKFVPDYYKVIVNPMDLETIRKNISKHKYQSRESFLDDVNLILANSVKYNGPESQYTKTAQEIVNVCYQTLTEYDEHLTQLEKDICTAKEAALEEAELESLDPMTPGPYTPQAKPPDLYDTNTSLSMSRDASVFQDESNMSVLDIPSATPEKQVTQEGEDGDGDLADEEEGTVQQPQASVLYEDLLMSEGEDDEEDAGSDEEGDNPFSAIQLSESGSDSDVGSGGIRPKQPRMLQENTRMDMENEESMMSYEGDGGEASHGLEDSNISYGSYEEPDPKSNTQDTSFSSIGGYEVSEEEEDEEEEEQRSGPSVLSQVHLSEDEEDSEDFHSIAGDSDLDSDE,1875,NP_001273003.2.csv,NP_001273003.2_colabfold_clinical_seed_0_final,NP_001273003.2_colabfold.a2m,colabfold,NP_001273003.2_colabfold_theta_0.2.npy,1,1875,1875
+NP_001273540.1,MEGLTLSDAEQKYYSDLFSYCDIESTKKVVVNGRVLELFRAAQLPNDVVLQIMELCGATRLGYFGRSQFYIALKLVAVAQSGFPLRVESINTVKDLPLPRFVASKNEQESRHAASYSSDSENQGSYSGVIPPPPGRGQVKKGSVSHDTVQPRTSADAQEPASPVVSPQQSPPTSPHTWRKHSRHPSGGNSERPLAGPGPFWSPFGEAQSGSSAGDAVWSGHSPPPPQENWVSFADTPPTSTLLTMHPASVQDQTTVRTVASATTAIEIRRQSSSYDDPWKITDEQRQYYVNQFKTIQPDLNGFIPGSAAKEFFTKSKLPILELSHIWELSDFDKDGALTLDEFCAAFHLVVARKNGYDLPEKLPESLMPKLIDLEDSADVGDQPGEVGYSGSPAEAPPSKSPSMPSLNQTWPELNQSSEQWETFSERSSSSQTLTQFDSNIAPADPDTAIVHPVPIRMTPSKIHMQEMELKRTGSDHTNPTSPLLVKPSDLLEENKINSSVKFASGNTVADGYSSSDSFTSDPEQIGSNVTRQRSHSGTSPDNTAPPPPPPRPQPSHSRSSSLDMNRTFTVTTGQQQAGVVAHPPAVPPRPQPSQAPGPAVHRPVDADGLITHTSTSPQQIPEQPNFADFSQFEVFAASNVNDEQDDEAEKHPEVLPAEKASDPASSLRVAKTDSKTEEKTAASAPANVSKGTTPLAPPPKPVRRRLKSEDELRPEVDEHTQKTGVLAAVLASQPSIPRSVGKDKKAIQASIRRNKETNTVLARLNSELQQQLKDVLEERISLEVQLEQLRPFSHL,796,NP_001273540.1.csv,refseq-REPS1-NM_001286611.1_clinical_seed_0_final,refseq-REPS1-NM_001286611.1.a2m,Invitae,refseq-REPS1-NM_001286611.1.npy,1,796,796
+NP_001273644.1,MRTGSAASSAAAAAAAAAASASPATGVCMKTPGGGRRGIRRDPGAEPGAAALRGPRQRPILSRLTNNSRIVGLLAQLEKINAEPSESDTARYVTSKILHLAQSQEKTRREMTAKGSTGMEILLSTLENTKDLQTTLNILSILVELVSAGGGRRVSFLVTKGGSQILLQLLMNASKESPPHEDLMVQIHSILAKIGPKDKKFGVKARINGALNITLNLVKQNLQNHRLVLPCLQLLRVYSANSVNSVSLGKNGVVELMFKIIGPFSKKNSSLIKVALDTLAALLKSKTNARRAVDRGYVQVLLTIYVDWHRHDNRHRNMLIRKGILQSLKSVTNIKLGRKAFIDANGMKILYNTSQECLAVRTLDPLVNTSSLIMRKCFPKNRLPLPTIKSSFHFQLPVIPVTGPVAQLYSLPPEVDDVVDESDDNDDIDVEAENETENEDDLDQNFKNDDIETDINKLKPQQEPGRTIEDLKMYEHLFPELVDDFQDYDLISKEPKPFVFEGKVRGPIVVPTAGEETSGNSGNLRKVVMKENISSKGDEGEKKSTFMDLAKEDIKDNDRTLQQQPGDQNRTISSVHGLNNDIVKALDRITLQNIPSQTAPGFTAEMKKDCSLPLTVLTCAKACPHMATCGNVLFEGRTVQLGKLCCTGVETEDDEDTESNSSVEQASVEVPDGPTLHDPDLYIEIVKNTKSVPEYSEVAYPDYFGHIPPPFKEPILERPYGVQRTKIAQDIERLIHQSDIIDRVVYDLDNPNYTIPEEGDILKFNSKFESGNLRKVIQIRKNEYDLILNSDINSNHYHQWFYFEVSGMRPGVAYRFNIINCEKSNSQFNYGMQPLMYSVQEALNARPWWIRMGTDICYYKNHFSRSSVAAGGQKGKSYYTITFTVNFPHKDDVCYFAYHYPYTYSTLQMHLQKLESAHNPQQIYFRKDVLCETLSGNSCPLVTITAMPESNYYEHICHFRNRPYVFLSARVHPGETNASWVMKGTLEYLMSNNPTAQSLRESYIFKIVPMLNPDGVINGNHRCSLSGEDLNRQWQSPSPDLHPTIYHAKGLLQYLAAVKRLPLVYCDYHGHSRKKNVFMYGCSIKETVWHTNDNATSCDVVEDTGYRTLPKILSHIAPAFCMSSCSFVVEKSKESTARVVVWREIGVQRSYTMESTLCGCDQGKYKGLQIGTRELEEMGAKFCVGLLRLKRLTSPLEYNLPSSLLDFENDLIESSCKVTSPTTYVLDEDEPRFLEEVDYSAESNDELDIELAENVGDYEPSAQEEVLSDSELSRTYLP,1278,NP_001273644.1.csv,refseq-AGTPBP1-NM_001286715.1_clinical_seed_0_final,refseq-AGTPBP1-NM_001286715.1.a2m,Invitae,refseq-AGTPBP1-NM_001286715.1.npy,1,1278,1278
+NP_001274420.1,MSQFQVPLAVQPDLPGLYDFPQRQVMVGSFPGSGLSMAGSESQLRGGGDGRKKRKRCGTCEPCRRLENCGACTSCTNRRTHQICKLRKCEVLKKKVGLLKEVEIKAGEGAGPWGQGAAVKTGSELSPVDGPVPGQMDSGPVYHGDSRQLSASGVPVNGAREPAGPSLLGTGGPWRVDQKPDWEAAPGPAHTARLEDAHDLVAFSAVAEAVSSYGALSTRLYETFNREMSREAGNNSRGPRPGPEGCSAGSEDLDTLQTALALARHGMKPPNCNCDGPECPDYLEWLEGKIKSVVMEGGEERPRLPGPLPPGEAGLPAPSTRPLLSSEVPQISPQEGLPLSQSALSIAKEKNISLQTAIAIEALTQLSSALPQPSHSTPQASCPLPEALSPPAPFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQSVFKRPEALPTKPKVKVEAPSSSPAPAPSPVLQREAPTPSSEPDTHQKAQTALQQHLHHKRSLFLEQVHDTSFPAPSEPSAPGWWPPPSSPVPRLPDRPPKEKKKKLPTPAGGPVGTEKAAPGIKPSVRKPIQIKKSRPREAQPLFPPVRQIVLEGLRSPASQEVQAHPPAPLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVPIQDPENQQTCLPAPESPFATRSPKQIKIESSGAVTVLSTTCFHSEEGGQEATPTKAENPLTPTLSGFLESPLKYLDTPTKSLLDTPAKRAQAEFPTCDCVEQIVEKDEGPYYTHLGSGPTVASIRELMEERYGEKGKAIRIEKVIYTGKEGKSSRGCPIAKWVIRRHTLEEKLLCLVRHRAGHHCQNAVIVILILAWEGIPRSLGDTLYQELTDTLRKYGNPTSRRCGLNDDRTCACQGKDPNTCGASFSFGCSWSMYFNGCKYARSKTPRKFRLAGDNPKEEEVLRKSFQDLATEVAPLYKRLAPQAYQNQVTNEEIAIDCRLGLKEGRPFAGVTACMDFCAHAHKDQHNLYNGCTVVCTLTKEDNRCVGKIPEDEQLHVLPLYKMANTDEFGSEENQNAKVGSGAIQVLTAFPREVRRLPEPAKSCRQRQLEARKAAAEKKKIQKEKLSTPEKIKQEALELAGITSDPGLSLKGGLSQQGLKPSLKVEPQNHFSSFKYSGNAVVESYSVLGNCRPSDPYSMNSVYSYHSYYAQPSLTSVNGFHSKYALPSFSYYGFPSSNPVFPSQFLGPGAWGHSGSSGSFEKKPDLHALHNSLSPAYGGAEFAELPSQAVPTDAHHPTPHHQQPAYPGPKEYLLPKAPLLHSVSRDPSPFAQSSNCYNRSIKQEPVDPLTQAEPVPRDAGKMGKTPLSEVSQNGGPSHLWGQYSGGPSMSPKRTNGVGGSWGVFSSGESPAIVPDKLSSFGASCLAPSHFTDGQWGLFPGEGQQAASHSGGRLRGKPWSPCKFGNSTSALAGPSLTEKPWALGAGDFNSALKGSPGFQDKLWNPMKGEEGRIPAAGASQLDRAWQSFGLPLGSSEKLFGALKSEEKLWDPFSLEEGPAEEPPSKGAVKEEKGGGGAEEEEEELWSDSEHNFLDENIGGVAVAPAHGSILIECARRELHATTPLKKPNRCHPTRISLVFYQHKNLNQPNHGLALWEAKMKQLAERARARQEEAARLGLGQQEAKLYGKKRKWGGTVVAEPQQKEKKGVVPTRQALAVPTDSAVTVSSYAYTKVTGPYSRWI,1795,NP_001274420.1.csv,refseq-TET3-NM_001287491.1_clinical_seed_0_final,refseq-TET3-NM_001287491.1.a2m,Invitae,refseq-TET3-NM_001287491.1.npy,1,1795,1795
+NP_001275582.1,MAQILPIRFQEHLQLQNLGINPANIGFSTLTMESDKFICIREKVGEQAQVVIIDMNDPSNPIRRPISADSAIMNPASKVIALKGIKESGKTLQIFNIEMKSKMKAHTMTDDVTFWKWISLNTVALVTDNAVYHWSMEGESQPVKMFDRHSSLAGCQIINYRTDAKQKWLLLTGISAQQNRVVGAMQLYSVDRKVSQPIEGHAASFAQFKMEGNAEESTLFCFAVRGQAGGKLHIIEVGTPPTGNQPFPKKAVDVFFPPEAQNDFPVAMQISEKHDVVFLITKYGYIHLYDLETGTCIYMNRISGETIFVTAPHEATAGIIGVNRKGQVLSVCVEEENIIPYITNVLQNPDLALRMAVRNNLAGAEELFARKFNALFAQGNYSEAAKVAANAPKGILRTPDTIRRFQSVPAQPGQTSPLLQYFGILLDQGQLNKYESLELCRPVLQQGRKQLLEKWLKEDKLECSEELGDLVKSVDPTLALSVYLRANVPNKVIQCFAETGQVQKIVLYAKKVGYTPDWIFLLRNVMRISPDQGQQFAQMLVQDEEPLADITQIVDVFMEYNLIQQCTAFLLDALKNNRPSEGPLQTRLLEMNLMHAPQVADAILGNQMFTHYDRAHIAQLCEKAGLLQRALEHFTDLYDIKRAVVHTHLLNPEWLVNYFGSLSVEDSLECLRAMLSANIRQNLQICVQVASKYHEQLSTQSLIELFESFKSFEGLFYFLGSIVNFSQDPDVHFKYIQAACKTGQIKEVERICRESNCYDPERVKNFLKEAKLTDQLPLIIVCDRFDFVHDLVLYLYRNNLQKYIEIYVQKVNPSRLPVVIGGLLDVDCSEDVIKNLILVVRGQFSTDELVAEVEKRNRLKLLLPWLEARIHEGCEEPATHNALAKIYIDSNNNPERFLRENPYYDSRVVGKYCEKRDPHLACVAYERGQCDLELINVCNENSLFKSLSRYLVRRKDPELWGSVLLESNPYRRPLIDQVVQTALSETQDPEEVSVTVKAFMTADLPNELIELLEKIVLDNSVFSEHRNLQNLLILTAIKADRTRVMEYINRLDNYDAPDIANIAISNELFEEAFAIFRKFDVNTSAVQVLIEHIGNLDRAYEFAERCNEPAVWSQLAKAQLQKGMVKEAIDSYIKADDPSSYMEVVQAANTSGNWEELVKYLQMARKKARESYVETELIFALAKTNRLAELEEFINGPNNAHIQQVGDRCYDEKMYDAAKLLYNNVSNFGRLASTLVHLGEYQAAVDGARKANSTRTWKEVCFACVDGKEFRLAQMCGLHIVVHADELEELINYYQDRGYFEELITMLEAALGLERAHMGMFTELAILYSKFKPQKMREHLELFWSRVNIPKVLRAAEQAHLWAELVFLYDKYEEYDNAIITMMNHPTDAWKEGQFKDIITKVANVELYYRAIQFYLEFKPLLLNDLLMVLSPRLDHTRAVNYFSKVKQLPLVKPYLRSVQNHNNKSVNESLNNLFITEEDYQALRTSIDAYDNFDNISLAQRLEKHELIEFRRIAAYLFKGNNRWKQSVELCKKDSLYKDAMQYASESKDTELAEELLQWFLQEEKRECFGACLFTCYDLLRPDVVLETAWRHNIMDFAMPYFIQVMKEYLTKVDKLDASESLRKEEEQATETQPIVYGQPQLMLTAGPSVAVPPQAPFGYGYTAPPYGQPQPGFGYSM,1679,NP_001275582.1.csv,refseq-CLTC-NM_001288653.1_clinical_seed_0_final,refseq-CLTC-NM_001288653.1.a2m,Invitae,refseq-CLTC-NM_001288653.1.npy,1,1679,1679
+NP_001275676.1,MADAEVIILPKKHKKKKERKSLPEEDVAEIQHAEEFLIKPESKVAKLDTSQWPLLLKNFDKLNVRTTHYTPLACGSNPLKREIGDYIRTGFINLDKPSNPSSHEVVAWIRRILRVEKTGHSGTLDPKVTGCLIVCIERATRLVKSQQSAGKEYVGIVRLHNAIEGGTQLSRALETLTGALFQRPPLIAAVKRQLRVRTIYESKMIEYDPERRLGIFWVSCEAGTYIRTLCVHLGLLLGVGGQMQELRRVRSGVMSEKDHMVTMHDVLDAQWLYDNHKDESYLRRVVYPLEKLLTSHKRLVMKDSAVNAICYGAKIMLPGVLRYEDGIEVNQEIVVITTKGEAICMAIALMTTAVISTCDHGIVAKIKRVIMERDTYPRKWGLGPKASQKKLMIKQGLLDKHGKPTDSTPATWKQEYVDYR,420,NP_001275676.1.csv,refseq-DKC1-NM_001288747.2_clinical_seed_0_final,refseq-DKC1-NM_001288747.2.a2m,Invitae,refseq-DKC1-NM_001288747.2_theta_0.2.npy,1,420,420
+NP_001275880.1,MAAKGAHGSYLKVESELERCRAEGHWDRMPELVRQLQTLSMPGGGGNRRGSPSAAFTFPDTDDFGKLLLAEALLEQCLKENHAKIKDSMPLLEKNEPKMSEAKNYLSSILNHGRLSPQYMCEAMLILGKLHYVEGSYRDAISMYARAGIDDMSMENKPLYQMRLLSEAFVIKGLSLERLPNSIASRFRLTEREEEVITCFERASWIAQVFLQELEKTTNNSTSRHLKGCHPLDYELTYFLEAALQSAYVKNLKKGNIVKGMRELREVLRTVETKATQNFKVMAAKHLAGVLLHSLSEECYWSPLSHPLPEFMGKEESSFATQALRKPHLYEGDNLYCPKDNIEEALLLLLISESMATRDVVLSRVPEQEEDRTVSLQNAAAIYDLLSITLGRRGQYVMLSECLERAMKFAFGEFHLWYQVALSMVACGKSAYAVSLLRECVKLRPSDPTVPLMAAKVCIGSLRWLEEAEHFAMMVISLGEEAGEFLPKGYLALGLTYSLQATDATLKSKQDELHRKALQTLERAQQLAPSDPQVILYVSLQLALVRQISSAMEQLQEALKVRKDDAHALHLLALLFSAQKHHQHALDVVNMAITEHPENFNLMFTKVKLEQVLKGPEEALVTCRQVLRLWQTLYSFSQLGDFRSPEGFQTPQRNICNSEIYRGGGLEKDGSFGEGLTMKKQSGMHLTLPDAHDADSGSRRASSIAASRLEEAMSELTMPSSVLKQGPMQLWTTLEQIWLQAAELFMEQQHLKEAGFCIQEAAGLFPTSHSVLYMRGRLAEVKGNLEEAKQLYKEALTVNPDGVRIMHSLGLMLSRLGHKSLAQKVLRDAVERQSTCHEAWQGLGEVLQAQGQNEAAVDCFLTALELEASSPVLPFSIIPREL,882,NP_001275880.1.csv,NP_001275880.1_colabfold_clinical_seed_0_final,NP_001275880.1_colabfold.a2m,colabfold,NP_001275880.1_colabfold_theta_0.2.npy,1,882,882
+NP_001275929.1,MPVAEAPQVAGGQGDGGDGEEAEPEGMFKACEDSKRKARGYLRLVPLFVLLALLVLASAGVLLWYFLGYKAEVMVSQVYSGSLRVLNRHFSQDLTRRESSAFRSETAKAQKMLKELITSTRLGTYYNSSSVYSFGEGPLTCFFWFILQIPEHRRLMLSPEVVQALLVEELLSTVNSSAAVPYRAEYEVDPEGLVILEASVKDIAALNSTLGCYRYSYVGQGQVLRLKGPDHLASSCLWHLQGPKDLMLKLRLEWTLAECRDRLAMYDVAGPLEKRLITSVYGCSRQEPVVEVLASGAIMAVVWKKGLHSYYDPFVLSVQPVVFQACEVNLTLDNRLDSQGVLSTPYFPSYYSPQTHCSWHLTVPSLDYGLALWFDAYALRRQKYDLPCTQGQWTIQNRRLCGLRILQPYAERIPVVATAGITINFTSQISLTGPGVRVHYGLYNQSDPCPGEFLCSVNGLCVPACDGVKDCPNGLDERNCVCRATFQCKEDSTCISLPKVCDGQPDCLNGSDEEQCQEGVPCGTFTFQCEDRSCVKKPNPQCDGRPDCRDGSDEEHCDCGLQGPSSRIVGGAVSSEGEWPWQASLQVRGRHICGGALIADRWVITAAHCFQEDSMASTVLWTVFLGKVWQNSRWPGEVSFKVSRLLLHPYHEEDSHDYDVALLQLDHPVVRSAAVRPVCLPARSHFFEPGLHCWITGWGALREGALRADAVALFYGWRNQGSETCCCPISNALQKVDVQLIPQDLCSEVYRYQVTPRMLCAGYRKGKKDACQGDSGGPLVCKALSGRWFLAGLVSWGLGCGRPNYFGVYTRITGVISWIQQVVT,824,NP_001275929.1.csv,NP_001275929.1_colabfold_clinical_seed_0_final,NP_001275929.1_colabfold.a2m,colabfold,NP_001275929.1_colabfold_theta_0.2.npy,1,824,824
+NP_001278.1,MANVSKKVSWSGRDRDDEEAAPLLRRTARPGGGTPLLNGAGPGAARQSPRSALFRVGHMSSVELDDELLDPDMDPPHPFPKEIPHNEKLLSLKYESLDYDNSENQLFLEEERRINHTAFRTVEIKRWVICALIGILTGLVACFIDIVVENLAGLKYRVIKGNIDKFTEKGGLSFSLLLWATLNAAFVLVGSVIVAFIEPVAAGSGIPQIKCFLNGVKIPHVVRLKTLVIKVSGVILSVVGGLAVGKEGPMIHSGSVIAAGISQGRSTSLKRDFKIFEYFRRDTEKRDFVSAGAAAGVSAAFGAPVGGVLFSLEEGASFWNQFLTWRIFFASMISTFTLNFVLSIYHGNMWDLSSPGLINFGRFDSEKMAYTIHEIPVFIAMGVVGGVLGAVFNALNYWLTMFRIRYIHRPCLQVIEAVLVAAVTATVAFVLIYSSRDCQPLQGGSMSYPLQLFCADGEYNSMAAAFFNTPEKSVVSLFHDPPGSYNPLTLGLFTLVYFFLACWTYGLTVSAGVFIPSLLIGAAWGRLFGISLSYLTGAAIWADPGKYALMGAAAQLGGIVRMTLSLTVIMMEATSNVTYGFPIMLVLMTAKIVGDVFIEGLYDMHIQLQSVPFLHWEAPVTSHSLTAREVMSTPVTCLRRREKVGVIVDVLSDTASNHNGFPVVEHADDTQPARLQGLILRSQLIVLLKHKVFVERSNLGLVQRRLRLKDFRDAYPRFPPIQSIHVSQDERECTMDLSEFMNPSPYTVPQEASLPRVFKLFRALGLRHLVVVDNRNQVVGLVTRKDLARYRLGKRGLEELSLAQT,805,NP_001278.1.csv,refseq-CLCN7-NM_001287.5_clinical_seed_0_final,refseq-CLCN7-NM_001287.5.a2m,Invitae,refseq-CLCN7-NM_001287.5.npy,1,805,805
+NP_001278585.2,MGDILAHESELLGLVKEYLDFAEFEDTLKTFSKECKIKGKPLCKTVGGSFRDSKSLTIQKDLVAAFDNGDQKVFFDLWEEHISSSIRDGDSFAQKLEFYLHIHFAIYLLKYSVGRPDKEELDEKISYFKTYLETKGAALSQTTEFLPFYALPFVPNPMVHPSFKELFQDSWTPELKLKLIKFLALISKASNTPKLLTIYKENGQSNKEILQQLHQQLVEAERRSVTYLKRYNKIQADYHNLIGVTAELVDSLEATVSGKMITPEYLQSVCVRLFSNQMRQSLAHSVDFTRPGTASTMLRASLAPVKLKDVPLLPSLDYEKLKKDLILGSDRLKAFLLQALRWRLTTSHPGEQRETVLQAYISNDLLDCYSHNQRSVLQLLHSTSDVVRQYMARLINAFASLAEGRLYLAQNTKVLQMLEGRLKEEDKDIITRENVLGALQKFSLRRPLQTAMIQDGLIFWLVDVLKDPDCLSDYTLEYSVALLMNLCLRSTGKNMCAKVAGLVLKVLSDLLGHENHEIQPYVNGALYSILSVPSIREEARAMGMEDILRCFIKEGNAEMIRQIEFIIKQLNSEELPDGVLESDDDEDEDDEEDHDIMEADLDKDELIQPQLGELSGEKLLTTEYLGIMTNTGKTRRKGLANVQWSGDEPLQRPVTPGGHRNGYPV,665,NP_001278585.2.csv,refseq-ARMC9-NM_001291656.2_clinical_seed_0_final,refseq-ARMC9-NM_001291656.2.a2m,Invitae,refseq-ARMC9-NM_001291656.2_theta_0.2.npy,1,665,665
+NP_001278650.1,MTTHVTLEDALSNVDLLEELPLPDQQPCIEPPPSSIMYQANFDTNFEDRNAFVTGIARYIEQATVHSSMVKCNEQPNRVEIYEKTVEVLEPEVTKLMKFMYFQRKAIERFCSEVKRLCHAERRKDFVSEAYLLTLGKFINMFAVLDELKNMKCSVKNDHSAYKRAAQFLRKMADPQSIQESQNLSMFLANHNRITQCLHQQLEVIPGYEELLADIVNICVDYYENKMYLTPSEKHMLLKVMGFGLYLMDGNVSNIYKLDAKKRINLSKIDKFFKQLQVVPLFGDMQIELARYIKTSAHYEENKSKWTCTQSSISPQYNICEQMVQIRDDHIRFISELARYSNSEVVTGSGLDSQKSDEEYRELFDLALRGLQLLSKWSAHVMEVYSWKLVHPTDKFCNKDCPGTAEEYERATRYNYTSEEKFAFVEVIAMIKGLQVLMGRMESVFNQAIRNTIYAALQDFAQVTLREPLRQAVRKKKNVLISVLQAIRKTICDWEGGREPPNDPCLRGEKDPKGGFDIKVPRRAVGPSSTQLYMVRTMLESLIADKSGSKKTLRSSLDGPIVLAIEDFHKQSFFFTHLLNISEALQQCCDLSQLWFREFFLELTMGRRIQFPIEMSMPWILTDHILETKEPSMMEYVLYPLDLYNDSAYYALTKFKKQFLYDEIEAEVNLCFDQFVYKLADQIFAYYKAMAGSVLLDKRFRAECKNYGVIIPYPPSNRYETLLKQRHVQLLGRSIDLNRLITQRISAAMYKSLDQAISRFESEDLTSIVELEWLLEINRLTHRLLCKHMTLDSFDAMFREANHNVSAPYGRITLHVFWELNFDFLPNYCYNGSTNRFVRTAIPFTQEPQRDKPANVQPYYLYGSKPLNIAYSHIYSSYRNFVGPPHFKTICRLLGYQGIAVVMEELLKIVKSLLQGTILQYVKTLIEVMPKICRLPRHEYGSPGILEFFHHQLKDIIEYAELKTDVFQSLREVGNAILFCLLIEQALSQEEVCDLLHAAPFQNILPRVYIKEGERLEVRMKRLEAKYAPLHLVPLIERLGTPQQIAIAREGDLLTKERLCCGLSMFEVILTRIRSYLQDPIWRGPPPTNGVMHVDECVEFHRLWSAMQFVYCIPVGTNEFTAEQCFGDGLNWAGCSIIVLLGQQRRFDLFDFCYHLLKVQRQDGKDEIIKNVPLKKMADRIRKYQILNNEVFAILNKYMKSVETDSSTVEHVRCFQPPIHQSLATTC,1227,NP_001278650.1.csv,refseq-CYFIP2-NM_001291721.2_clinical_seed_0_final,refseq-CYFIP2-NM_001291721.2.a2m,Invitae,refseq-CYFIP2-NM_001291721.2.npy,1,1227,1227
+NP_001278933.1,MAFNKFNVLHWHIVDDQSFPYQSITFPELSNKGSYSLSHVYTPNDVRMVIEYARLRGIRVLPEFDTPGHTLSWGKGQKDLLTPCYSRQNKLDSFGPINPTLNTTYSFLTTFFKEISEVFPDQFIHLGGDEVEFKCWESNPKIQDFMRQKGFGTDFKKLESFYIQKVLDIIATINKGSIVWQEVFDDKAKLAPGTIVEVWKDSAYPEELSRVTASGFPVILSAPWYLDLISYGQDWRKYYKVEPLDFGGTQKQKQLFIGGEACLWGEYVDATNLTPRLWPRASAVGERLWSSKDVRDMDDAYDRLTRHRCRMVERGIAAQPLYAGYCNHENM,331,NP_001278933.1.csv,refseq-HEXB-NM_001292004.1_clinical_seed_0_final,refseq-HEXB-NM_001292004.1.a2m,Invitae,refseq-HEXB-NM_001292004.1.npy,1,331,331
+NP_001284478.1,MSVEAYGPSSQTLTFLDTEEAELLGADTQGSEFEFTDFTLPSQTQTPPGGPGGPGGGGAGGPGGAGAGAAAGQLDAQVGPEGILQNGAVDDSVAKTSQLLAELNFEEDEEDTYYTKDLPIHACSYCGIHDPACVVYCNTSKKWFCNGRGNTSGSHIVNHLVRAKCKEVTLHKDGPLGETVLECYNCGCRNVFLLGFIPAKADSVVVLLCRQPCASQSSLKDINWDSSQWQPLIQDRCFLSWLVKIPSEQEQLRARQITAQQINKLEELWKENPSATLEDLEKPGVDEEPQHVLLRYEDAYQYQNIFGPLVKLEADYDKKLKESQTQDNITVRWDLGLNKKRIAYFTLPKTDSGNEDLVIIWLRDMRLMQGDEICLRYKGDLAPLWKGIGHVIKVPDNYGDEIAIELRSSVGAPVEVTHNFQVDFVWKSTSFDRMQSALKTFAVDETSVSGYIYHKLLGHEVEDVIIKCQLPKRFTAQGLPDLNHSQVYAVKTVLQRPLSLIQGPPGTGKTVTSATIVYHLARQGNGPVLVCAPSNIAVDQLTEKIHQTGLKVVRLCAKSREAIDSPVSFLALHNQIRNMDSMPELQKLQQLKDETGELSSADEKRYRALKRTAERELLMNADVICCTCVGAGDPRLAKMQFRSILIDESTQATEPECMVPVVLGAKQLILVGDHCQLGPVVMCKKAAKAGLSQSLFERLVVLGIRPIRLQVQYRMHPALSAFPSNIFYEGSLQNGVTAADRVKKGFDFQWPQPDKPMFFYVTQGQEEIASSGTSYLNRTEAANVEKITTKLLKAGAKPDQIGIITPYEGQRSYLVQYMQFSGSLHTKLYQEVEIASVDAFQGREKDFIILSCVRANEHQGIGFLNDPRRLNVALTRARYGVIIVGNPKALSKQPLWNHLLNYYKEQKVLVEGPLNNLRESLMQFSKPRKLVNTINPGARFMTTAMYDAREAIIPGSVYDRSSQGRPSSMYFQTHDQIGMISAGPSHVAAMNIPIPFNLVMPPMPPPGYFGQANGPAAGRGTPKGKTGRGGRQKNRFGLPGPSQTNLPNSQASQDVASQPFSQGALTQGYISMSQPSQMSQPGLSQPELSQDSYLGDEFKSQIDVALSQDSTYQGERAYQHGGVTGLSQY,1129,NP_001284478.1.csv,refseq-UPF1-NM_001297549.1_clinical_seed_0_final,refseq-UPF1-NM_001297549.1.a2m,Invitae,refseq-UPF1-NM_001297549.1.npy,1,1129,1129
+NP_001287771.1,MRLLERMRKDWFMVGIVLAIAGAKLEPSIGVNGGPLKPEITVSYIAVATIFFNSGLSLKTEELTSALVHLKLHLFIQIFTLAFFPATIWLFLQLLSITPINEWLLKGLQTVGCMPPPVSSAVILTKAVGGNEAAAIFNSAFGSFLGIVITPLLLLLFLGSSSSVPFTSIFSQLFMTVVVPLIIGQIVRRYIKDWLERKKPPFGAISSSVLLMIIYTTFCDTFSNPNIDLDKFSLVLILFIIFSIQLSFMLLTFIFSTRNNSGFTPADTVAIIFCSTHKSLTLGIPMLKIVFAGHEHLSLISVPLLIYHPAQILLGSVLVPTIKSWMVSRQKKLLQTRGPLANLNNPEGLEYLSIKFGH,358,NP_001287771.1.csv,refseq-SLC10A7-NM_001300842.2_clinical_seed_0_final,refseq-SLC10A7-NM_001300842.2.a2m,Invitae,refseq-SLC10A7-NM_001300842.2.npy,1,358,358
+NP_001287829.1,MARKKVRPRLIAELARRVRALREQLNRPRDSQLYAVDYETLTRPFSGRRLPVRAWADVRRESRLLQLLGRLPLFGLGRLVTRKSWLWQHDEPCYWRLTRVRPDYTAQNLDHGKAWGILTFKDASFSSSGKTESEAREIEHVMYHDWRLVPKHEEEAFTAFTPAPEDSLASVPYPPLLRAMIIAERQKNGDTSTEEPMLNVQRIRMEPWDYPAKQEDKGRAKGTPV,225,NP_001287829.1.csv,refseq-MRPS34-NM_001300900.1_clinical_seed_0_final,refseq-MRPS34-NM_001300900.1.a2m,Invitae,refseq-MRPS34-NM_001300900.1.npy,1,225,225
+NP_001288.3,MLGWVQRVLPQPPGTPRKTKMQEEEEVEPEPEMEAEVEPEPNPEEAETESESMPPEESFKEEEVAVADPSPQETKEAALTSTISLRAQGAEISEMNSPSRRVLTWLMKGVEKVIPQPVHSITEDPAQILGHGSTGDTGCTDEPNEALEAQDTRPGLRLLLWLEQNLERVLPQPPKSSEVWRDEPAVATGAASDPAPPGRPQEMGPKLQARETPSLPTPIPLQPKEEPKEAPAPEPQPGSQAQTSSLPPTRDPARLVAWVLHRLEMALPQPVLHGKIGEQEPDSPGICDVQTISILPGGQVEPDLVLEEVEPPWEDAHQDVSTSPQGTEVVPAYEEENKAVEKMPRELSRIEEEKEDEEEEEEEEEEEEEEEVTEVLLDSCVVSQVGVGQSEEDGTRPQSTSDQKLWEEVGEEAKKEAEEKAKEEAEEVAEEEAEKEPQDWAETKEEPEAEAEAASSGVPATKQHPEVQVEDTDADSCPLMAEENPPSTVLPPPSPAKSDTLIVPSSASGTHRKKLPSEDDEAEELKALSPAESPVVAWSDPTTPKDTDGQDRAASTASTNSAIINDRLQELVKLFKERTEKVKEKLIDPDVTSDEESPKPSPAKKAPEPAPDTKPAEAEPVEEEHYCDMLCCKFKHRPWKKYQFPQSIDPLTNLMYVLWLFFVVMAWNWNCWLIPVRWAFPYQTPDNIHHWLLMDYLCDLIYFLDITVFQTRLQFVRGGDIITDKKDMRNNYLKSRRFKMDLLSLLPLDFLYLKVGVNPLLRLPRCLKYMAFFEFNSRLESILSKAYVYRVIRTTAYLLYSLHLNSCLYYWASAYQGLGSTHWVYDGVGNSYIRCYYFAVKTLITIGGLPDPKTLFEIVFQLLNYFTGVFAFSVMIGQMRDVVGAATAGQTYYRSCMDSTVKYMNFYKIPKSVQNRVKTWYEYTWHSQGMLDESELMVQLPDKMRLDLAIDVNYNIVSKVALFQGCDRQMIFDMLKRLRSVVYLPNDYVCKKGEIGREMYIIQAGQVQVLGGPDGKSVLVTLKAGSVFGEISLLAVGGGNRRTANVVAHGFTNLFILDKKDLNEILVHYPESQKLLRKKARRMLRSNNKPKEEKSVLILPPRAGTPKLFNAALAMTGKMGGKGAKGGKLAHLRARLKELAALEAAAKQQELVEQAKSSQDVKGEEGSAAPDQHTHPKEAATDPPAPRTPPEPPGSPPSSPPPASLGRPEGEEEGPAEPEEHSVRICMSPGPEPGEQILSVKMPEEREEKAE,1251,NP_001288.3.csv,refseq-CNGB1-NM_001297.4_clinical_seed_0_final,refseq-CNGB1-NM_001297.4.a2m,Invitae,refseq-CNGB1-NM_001297.4.npy,1,1251,1251
+NP_001288153.1,MGNKQTIFTEEQLDNYQDCTFFNKKDILKWGNRGSEGFGHSSRVTQAVHSLAPGPSCLSCTHSDCCVCFLQENPFKERIVAAFSEDGEGNLTFNDFVDMFSVLCESAPRELKANYAFKIYDFNTDNFICKEDLELTLARLTKSELDEEEVVLVCDKVIEEADLDGDGKLGFADFEDMIAKAPDFLSTFHIRI,192,NP_001288153.1.csv,refseq-CIB2-NM_001301224.2_clinical_seed_0_final,refseq-CIB2-NM_001301224.2.a2m,Invitae,refseq-CIB2-NM_001301224.2_theta_0.2.npy,1,192,192
+NP_001289.1,MAKINTQYSHPSRTHLKVKTSDRDLNRAENGLSRAHSSSEETSSVLQPGIAMETRGLADSGQGSFTGQGIARLSRLIFLLRRWAARHVHHQDQGPDSFPDRFRGAELKEVSSQESNAQANVGSQEPADRGRSAWPLAKCNTNTSNNTEEEKKTKKKDAIVVDPSSNLYYRWLTAIALPVFYNWYLLICRACFDELQSEYLMLWLVLDYSADVLYVLDVLVRARTGFLEQGLMVSDTNRLWQHYKTTTQFKLDVLSLVPTDLAYLKVGTNYPEVRFNRLLKFSRLFEFFDRTETRTNYPNMFRIGNLVLYILIIIHWNACIYFAISKFIGFGTDSWVYPNISIPEHGRLSRKYIYSLYWSTLTLTTIGETPPPVKDEEYLFVVVDFLVGVLIFATIVGNVGSMISNMNASRAEFQAKIDSIKQYMQFRKVTKDLETRVIRWFDYLWANKKTVDEKEVLKSLPDKLKAEIAINVHLDTLKKVRIFQDCEAGLLVELVLKLRPTVFSPGDYICKKGDIGKEMYIINEGKLAVVADDGVTQFVVLSDGSYFGEISILNIKGSKSGNRRTANIRSIGYSDLFCLSKDDLMEALTEYPEAKKALEEKGRQILMKDNLIDEELARAGADPKDLEEKVEQLGSSLDTLQTRFARLLAEYNATQMKMKQRLSQLESQVKGGGDKPLADGEVPGDATKTEDKQQ,694,NP_001289.1.csv,refseq-CNGA3-NM_001298.2_clinical_seed_0_final,refseq-CNGA3-NM_001298.2.a2m,Invitae,refseq-CNGA3-NM_001298.2.npy,1,694,694
+NP_001290039.1,MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVIGFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPPGIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGIPGTPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPPGVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGDKGEKGSPGFPGEPGYPGLIGRQGPQGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGTPGPRGEPGPKGFPGLPGQPGPPGLPVPGQAGAPGFPGERGEKGDRGFPGTSLPGPSGRDGLPGPPGSPGPPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGFIGEIGEKGQKGESCLICDIDGYRGPPGPQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPLLFQIHK,519,NP_001290039.1.csv,refseq-COL4A1-NM_001303110.2_clinical_seed_0_final,refseq-COL4A1-NM_001303110.2.a2m,Invitae,refseq-COL4A1-NM_001303110.2.npy,1,519,519
+NP_001290185.1,MAFTNYSSLNRAQLTFEYLHTNSTTHEFLFGALAELVDNARDADATRIDIYAERREDLRGGFMLCFLDDGAGMDPSDAASVIQFGKSAKRTPESTQIGQYGNGLKSGSMRIGKDFILFTKKEDTMTCLFLSRTFHEEEGIDEVIVPLPTWNARTREPVTDNVEKFAIETELIYKYSPFRTEEEVMTQFMKIPGDSGTLVIIFNLKLMDNGEPELDIISNPRDIQMAETSPEGTKPERRSFRAYAAVLYIDPRMRIFIHGHKVQTKRLSCCLYKPRMYKYTSSRFKTRAEQEVKKAEHVARIAEEKAREAESKARTLEVRLGGDLTRDSRVMLRQVQNRAITLRREADVKKRIKEAKQRALKEPKELNFVFGVNIEHRDLDGMFIYNCSRLIKMYEKVGPQLEGGMACGGVVGVVDVPYLVLEPTHNKQDFADAKEYRHLLRAMGEHLAQYWKDIAIAQRGIIKFWDEFGYLSANWNQPPSSELRYKRRRAMEIPTTIQCDLCLKWRTLPFQLSSVEKDYPDTWVCSMNPDPEQDRCEASEQKQKVPLGTFRKDMKTQEEKQKQLTEKIRQQQEKLEALQKTTPIRSQADLKKLPLEVTTRPSTEEPVRRPQRPRSPPLPAVIRNAPSRPPSLPTPRPASQPRKAPVISSTPKLPALAAREEASTSRLLQPPEAPRKPANTLVKTASRPAPLVQQLSPSLLPNSKSPREVPSPKVIKTPVVKKTESPIKLSPATPSRKRSVAVSDEEEVEEEAERRKERCKRGRFVVKEEKKDSNELSDSAGEEDSADLKRAQKDKGLHVEVRVNREWYTGRVTAVEVGKHVVRWKVKFDYVPTDTTPRDRWVEKGSEDVRLMKPPSPEHQSLDTQQEGGEEEVGPVAQQAIAVAEPSTSECLRIEPDTTALSTNHETIDLLVQILRNCLRYFLPPSFPISKKQLSAMNSDELISFPLKEYFKQYEVGLQNLCNSYQSRADSRAKASEESLRTSERKLRETEEKLQKLRTNIVALLQKVQEDIDINTDDELDAYIEDLITKGD,1032,NP_001290185.1.csv,refseq-MORC2-NM_001303256.2_clinical_seed_0_final,refseq-MORC2-NM_001303256.2.a2m,Invitae,refseq-MORC2-NM_001303256.2.npy,1,1032,1032
+NP_001290425.1,MSKQVSLPEMIKDWTKEHVKKWVNEDLKINEQYGQILLSEEVTGLVLQELTEKDLVEMGLPWGPALLIKRSYNKLNSKSPESDNHDPGQLDNSKPSKTEHQKNPKHTKKEEENSMSSNIDYDPREIRDIKQEESILMKENVLDEVANAKHKKKGKLKPEQLTCMPYPFDQFHDSHRYIEHYTLQPETGALNLIDPIHEFKALTNTETATEVDIKMKFSNEVFRFASACMNSRTNGTIHFGVKDKPHGEIVGVKITSKAAFIDHFNVMIKKYFEESEINEAKKCIREPRFVEVLLQNNTPSDRFVIEVDTIPKHSICNDKYFYIQMQICKDKIWKQNQNLSLFVREGASSRDILANSKQRDVDFKAFLQNLKSLVASRKEAEEEYGMKAMKKESEGLKLVKLLIGNRDSLDNSYYDWYILVTNKCHPNQIKHLDFLKEIKWFAVLEFDPESMINGVVKAYKESRVANLHFPNQYEDKTTNMWEKISTLNLYQQPSWIFCNGRSDLKSETYKPLEPHLWQRERASEVRKLILFLTDENIMTRGKFLVVFLLLSSVESPGDPLIETFWAFYQALKGMENMLCISVNSHIYQRWKDLLQTRMKMEDELTNHSISTLNIELVNSTILKLKSVTRSSRRFLPARGSSSVILEKKKEDVLTALEILCENECTETDIEKDKSKFLEFKKSKEEHFYRGGKVSWWNFYFSSENYSSDFVKRDSYEKLKDLIHCWAESPKPIFAKIINLYHHPGCGGTTLAMHVLWDLKKNFRCAVLKNKTTDFAEIAEQVINLVTYRAKSHQDYIPVLLLVDDFEEQENVYFLQNAIHSVLAEKDLRYEKTLVIILNCMRSRNPDESAKLADSIALNYQLSSKEQRAFGAKLKEIEKQHKNCENFYSFMIMKSNFDETYIENVVRNILKGQDVDSKEAQLISFLALLSSYVTDSTISVSQCEIFLGIIYTSTPWEPESLEDKMGTYSTLLIKTEVAEYGRYTGVRIIHPLIALYCLKELERSYHLDKCQIALNILEENLFYDSGIGRDKFQHDVQTLLLTRQRKVYGDETDTLFSPLMEALQNKDIEKVLSAGSRRFPQNAFICQALARHFYIKEKDFNTALDWARQAKMKAPKNSYISDTLGQVYKSEIKWWLDGNKNCRSITVNDLTHLLEAAEKASRAFKESQRQTDSKNYETENWSPQKSQRRYDMYNTACFLGEIEVGLYTIQILQLTPFFHKENELSKKHMVQFLSGKWTIPPDPRNECYLALSKFTSHLKNLQSDLKRCFDFFIDYMVLLKMRYTQKEIAEIMLSKKVSRCFRKYTELFCHLDPCLLQSKESQLLQEENCRKKLEALRADRFAGLLEYLNPNYKDATTMESIVNEYAFLLQQNSKKPMTNEKQNSILANIILSCLKPNSKLIQPLTTLKKQLREVLQFVGLSHQYPGPYFLACLLFWPENQELDQDSKLIEKYVSSLNRSFRGQYKRMCRSKQASTLFYLGKRKGLNSIVHKAKIEQYFDKAQNTNSLWHSGDVWKKNEVKDLLRRLTGQAEGKLISVEYGTEEKIKIPVISVYSGPLRSGRNIERVSFYLGFSIEGPLAYDIEVI,1584,NP_001290425.1.csv,refseq-SAMD9L-NM_001303496.1_clinical_seed_0_final,refseq-SAMD9L-NM_001303496.1.a2m,Invitae,refseq-SAMD9L-NM_001303496.1.npy,1,1584,1584
+NP_001291646.4,LERGGEAAAAAAAAAAAPGRGSESPVTISRAGNAGELVSPLLLPPTRRRRRRHIQGPGPVLNLPSAAAAPPVARAPEAAGGGSRSEDYSSSPHSAAAAARPLAAEEKQAQSLQPSSSRRSSHYPAAVQSQAAAERGASATAKSRAISILQKKPRHQQLLPSLSSFFFSHRLPDMTAIIKEIVSRNKRRYQEDGFDLDLTYIYPNIIAMGFPAERLEGVYRNNIDDVVRFLDSKHKNHYKIYNLCAERHYDTAKFNCRVAQYPFEDHNPPQLELIKPFCEDLDQWLSEDDNHVAAIHCKAGKGRTGVMICAYLLHRGKFLKAQEALDFYGEVRTRDKKGVTIPSQRRYVYYYSYLLKNHLDYRPVALLFHKMMFETIPMFSGGTCNPQFVVCQLKVKIYSSNSGPTRREDKFMYFEFPQPLPVCGDIKVEFFHKQNKMLKKDKMFHFWVNTFFIPGPEETSEKVENGSLCDQEIDSICSIERADNDKEYLVLTLTKNDLDKANKDKANRYFSPNFKVKLYFTKTVEEPSNPEASSSTSVTPDVSDNEPDHYRYSDTTDSDPENEPFDEDQHTQITKV,576,NP_001291646.4.csv,NP_001291646.4_colabfold_clinical_seed_0_final,NP_001291646.4_colabfold.a2m,colabfold,NP_001291646.4_colabfold_theta_0.2.npy,1,576,576
+NP_001293061.2,MLRGPGPGLLLLAVQCLGTAVPSTGASKSKRQAQQMVQPQSPVAVSQSKPGCYDNGKHYQINQQWERTYLGNALVCTCYGGSRGFNCESKPEAEETCFDKYTGNTYRVGDTYERPKDSMIWDCTCIGAGRGRISCTIANRCHEGGQSYKIGDTWRRPHETGGYMLECVCLGNGKGEWTCKPIAEKCFDHAAGTSYVVGETWEKPYQGWMMVDCTCLGEGSGRITCTSRNRCNDQDTRTSYRIGDTWSKKDNRGNLLQCICTGNGRGEWKCERHTSVQTTSSGSGPFTDVRAAVYQPQPHPQPPPYGHCVTDSGVVYSVGMQWLKTQGNKQMLCTCLGNGVSCQETAVTQTYGGNSNGEPCVLPFTYNGRTFYSCTTEGRQDGHLWCSTTSNYEQDQKYSFCTDHTVLVQTRGGNSNGALCHFPFLYNNHNYTDCTSEGRRDNMKWCGTTQNYDADQKFGFCPMAAHEEICTTNEGVMYRIGDQWDKQHDMGHMMRCTCVGNGRGEWTCIAYSQLRDQCIVDDITYNVNDTFHKRHEEGHMLNCTCFGQGRGRWKCDPVDQCQDSETGTFYQIGDSWEKYVHGVRYQCYCYGRGIGEWHCQPLQTYPSSSGPVEVFITETPSQPNSHPIQWNAPQPSHISKYILRWRPKNSVGRWKEATIPGHLNSYTIKGLKPGVVYEGQLISIQQYGHQEVTRFDFTTTSTSTPVTSNTVTGETTPFSPLVATSESVTEITASSFVVSWVSASDTVSGFRVEYELSEEGDEPQYLDLPSTATSVNIPDLLPGRKYIVNVYQISEDGEQSLILSTSQTTAPDAPPDTTVDQVDDTSIVVRWSRPQAPITGYRIVYSPSVEGSSTELNLPETANSVTLSDLQPGVQYNITIYAVEENQESTPVVIQQETTGTPRSDTVPSPRDLQFVEVTDVKVTIMWTPPESAVTGYRVDVIPVNLPGEHGQRLPISRNTFAEVTGLSPGVTYYFKVFAVSHGRESKPLTAQQTTKLDAPTNLQFVNETDSTVLVRWTPPRAQITGYRLTVGLTRRGQPRQYNVGPSVSKYPLRNLQPASEYTVSLVAIKGNQESPKATGVFTTLQPGSSIPPYNTEVTETTIVITWTPAPRIGFKLGVRPSQGGEAPREVTSDSGSIVVSGLTPGVEYVYTIQVLRDGQERDAPIVNKVVTPLSPPTNLHLEANPDTGVLTVSWERSTTPDITGYRITTTPTNGQQGNSLEEVVHADQSSCTFDNLSPGLEYNVSVYTVKDDKESVPISDTIIPAVPPPTDLRFTNIGPDTMRVTWAPPPSIDLTNFLVRYSPVKNEEDVAELSISPSDNAVVLTNLLPGTEYVVSVSSVYEQHESTPLRGRQKTGLDSPTGIDFSDITANSFTVHWIAPRATITGYRIRHHPEHFSGRPREDRVPHSRNSITLTNLTPGTEYVVSIVALNGREESPLLIGQQSTVSDVPRDLEVVAATPTSLLISWDAPAVTVRYYRITYGETGGNSPVQEFTVPGSKSTATISGLKPGVDYTITVYAVTGRGDSPASSKPISINYRTEIDKPSQMQVTDVQDNSISVKWLPSSSPVTGYRVTTTPKNGPGPTKTKTAGPDQTEMTIEGLQPTVEYVVSVYAQNPSGESQPLVQTAVTTIPAPTDLKFTQVTPTSLSAQWTPPNVQLTGYRVRVTPKEKTGPMKEINLAPDSSSVVVSGLMVATKYEVSVYALKDTLTSRPAQGVVTTLENVSPPRRARVTDATETTITISWRTKTETITGFQVDAVPANGQTPIQRTIKPDVRSYTITGLQPGTDYKIYLYTLNDNARSSPVVIDASTAIDAPSNLRFLATTPNSLLVSWQPPRARITGYIIKYEKPGSPPREVVPRPRPGVTEATITGLEPGTEYTIYVIALKNNQKSEPLIGRKKTVQKTPFVTHPGYDTGNGIQLPGTSGQQPSVGQQMIFEEHGFRRTTPPTTATPIRHRPRPYPPNVGQEALSQTTISWAPFQDTSEYIISCHPVGTDEEPLQFRVPGTSTSATLTGLTRGATYNVIVEALKDQQRHKVREEVVTVGNSVNEGLNQPTDDSCFDPYTVSHYAVGDEWERMSESGFKLLCQCLGFGSGHFRCDSSRWCHDNGVNYKIGEKWDRQGENGQMMSCTCLGNGKGEFKCDPHEATCYDDGKTYHVGEQWQKEYLGAICSCTCFGGQRGWRCDNCRRPGGEPSPEGTTGQSYNQYSQRYHQRTNTNVNCPIECFMPLDVQADREDSRE,2240,NP_001293061.2.csv,refseq-FN1-NM_001306132.2_clinical_seed_0_final,refseq-FN1-NM_001306132.2.a2m,Invitae,refseq-FN1-NM_001306132.2.npy,1,2240,2240
+NP_001294.2,MAASPHTLSSRLLTGCVGGSVWYLERRTIQDSPHKFLHLLRNVNKQWITFQHFSFLKRMYVTQLNRSHNQQVRPKPEPVASPFLEKTSSGQAKAEIYEMRPLSPPSLSLSRKPNEKELIELEPDSVIEDSIDVGKETKEEKRWKEMKLQVYDLPGILARLSKIKLTALVVSTTAAGFALAPGPFDWPCFLLTSVGTGLASCAANSINQFFEVPFDSNMNRTKNRPLVRGQISPLLAVSFATCCAVPGVAILTLGVNPLTGALGLFNIFLYTCCYTPLKRISIANTWVGAVVGAIPPVMGWTAATGSLDAGAFLLGGILYSWQFPHFNALSWGLREDYSRGGYCMMSVTHPGLCRRVALRHCLALLVLSAAAPVLDITTWTFPIMALPINAYISYLGFRFYVDADRRSSRRLFFCSLWHLPLLLLLMLTCKRPSGGGDAGPPPS,443,NP_001294.2.csv,refseq-COX10-NM_001303.3_clinical_seed_0_final,refseq-COX10-NM_001303.3.a2m,Invitae,refseq-COX10-NM_001303.3.npy,1,443,443
+NP_001295205.1,MESERSKRMGNACIPLKRIAYFLCLLSALLLTEGKKPAKPKCPAVCTCTKDNALCENARSIPRTVPPDVISLSFVRSGFTEISEGSFLFTPSLQLLSLANNNLQTLPKDIFKGLDSLTNVDLRGNSFNCDCKLKWLVEWLGHTNATVEDIYCEGPPEYKKRKINSLSSKDFDCIITEFAKSQDLPYQSLSIDTFSYLNDEYVVIAQPFTGKCIFLEWDHVEKTFRNYDNITGTSTVVCKPIVIETQLYVIVAQLFGGSHIYKRDSFANKFIKIQDIEILKIRKPNDIETFKIENNWYFVVADSSKAGFTTIYKWNGNGFYSHQSLHAWYRDTDVEYLEIVRTPQTLRTPHLILSSSSQRPVIYQWNKATQLFTNQTDIPNMEDVYAVKHFSVKGDVYICLTRFIGDSKVMKWGGSSFQDIQRMPSRGSMVFQPLQINNYQYAILGSDYSFTQVYNWDAEKAKFVKFQELNVQAPRSFTHVSINKRNFLFASSFKGNTQIYKHVIVDLSA,509,NP_001295205.1.csv,refseq-LGI1-NM_001308276.2_clinical_seed_0_final,refseq-LGI1-NM_001308276.2.a2m,Invitae,refseq-LGI1-NM_001308276.2.npy,1,509,509
+NP_001298127.1,MIGGLFIYNHKGEVLISRVYRDDIGSRQAADSAVFSSSGPFPGEWLEANRRNAVDAFRVNVIHARQQVRSPVTNIARTSFFHVKRSNIWLAAVTKQNVNAAMVFEFLYKMCDVMAAYFGKISEENIKNNFVLIYELLDEILDFGYPQNSETGALKTFITQQGIKSQHQTKEEQSQITSQVTGQIGWRREGIKYRRNELFLDVLESVNLLMSPQGQVLSAHVSGRVVMKSYLSGMPECKFGMNDKIVIEKQGKGTADETSKSGKQSIAIDDCTFHQCVRLSKFDSERSISFIPPDGEFELMRYRTTKDIILPFRVIPLVREVGRTKLEVKVVIKSNFKPSLLAQKIEVRIPTPLNTSGVQVICMKGKAKYKASENAIVWKIKRMAGMKESQISAEIELLPTNDKKKWARPPISMNFEVPFAPSGLKVRYLKVFEPKLNYSDHDVIKWVRYIGRSGIYETRC,460,NP_001298127.1.csv,refseq-AP2M1-NM_001311198.1_clinical_seed_0_final,refseq-AP2M1-NM_001311198.1.a2m,Invitae,refseq-AP2M1-NM_001311198.1.npy,1,460,460
+NP_001303260.1,MWGCQAPLSRLPASGETPMSQSWLVSPLATRPSFTARCSHSTRHTGWRWCCFMERPLTLTRGSSWAHCSYCHRGATGPWPLTFQLTTRSAMTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVVTVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQYLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANPTDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSNPHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEGVTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPALGFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS,498,NP_001303260.1.csv,NP_001303260.1_clinical_seed_0_final,NP_001303260.1.a2m,popEVE,NP_001303260.1_theta_0.2.npy,1,498,498
+NP_001304019.1,MRSTTLLALLALVLLYLVSGALVFRALEQPHEQQAQRELGEVREKFLRAHPCVSDQELGLLIKEVADALGGGADPETNSTSNSSHSAWDLGSAFFFSGTIITTIGYGNVALRTDAGRLFCIFYALVGIPLFGILLAGVGDRLGSSLRHGIGHIEAIFLKWHVPPELVRVLSAMLFLLIGCLLFVLTPTFVFCYMEDWSKLEAIYFVIVTLTTVGFGDYVAGADPRQDSPAYQPLVWFWILLGLAYFASVLTTIGNWLRVVSRRTRAEMGGLTAQAASWTGTVTARVTQRAGPAAPPPEKEQPLLPPPPCPAQPLGRPRSPSPPEKAQPPSPPTASALDYPSENLAFIDESSDTQSERGCPLPRAPRGRRRPNPPRKPVRPRGPGRPRDKGVPV,393,NP_001304019.1.csv,refseq-KCNK4-NM_001317090.1_clinical_seed_0_final,refseq-KCNK4-NM_001317090.1.a2m,Invitae,refseq-KCNK4-NM_001317090.1.npy,1,393,393
+NP_001304919.1,MAGGVDGPIGIPFPDHSSDILSGLNEQRTQGLLCDVVILVEGREFPTHRSVLAACSQYFKKLFTSGAVVDQQNVYEIDFVSAEALTALMDFAYTATLTVSTANVGDILSAARLLEIPAVSHVCADLLDRQILAADAGADAGQLDLVDQIDQRNLLRAKEYLEFFQSNPMNSLPPAAAAAAASFPWSAFGASDDDLDATKEAVAAAVAAVAAGDCNGLDFYGPGPPAERPPTGDGDEGDSNPGLWPERDEDAPTGGLFPPPVAPPAATQNGHYGRGGEEEAASLSEAAPEPGDSPGFLSGAAEGEDGDGPDVDGLAASTLLQQMMSSVGRAGAAAGDSDEESRADDKGVMDYYLKYFSGAHDGDVYPAWSQKVEKKIRAKAFQKCPICEKVIQGAGKLPRHIRTHTGEKPYECNICKVRFTRQDKLKVHMRKHTGEKPYLCQQCGAAFAHNYDLKNHMRVHTGLRPYQCDSCCKTFVRSDHLHRHLKKDGCNGVPSRRGRKPRVRGGAPDPSPGATATPGAPAQPSSPDARRNGQEKHFKDEDEDEDVASPDGLGRLNVAGAGGGGDSGGGPGAATDGNFTAGLA,584,NP_001304919.1.csv,refseq-ZBTB7A-NM_001317990.1_clinical_seed_0_final,refseq-ZBTB7A-NM_001317990.1.a2m,Invitae,refseq-ZBTB7A-NM_001317990.1.npy,1,584,584
+NP_001305092.1,MEMTEMTGVSLKRGALVVEDNDSGVPVEETKKQKLSECSLTKGQDGLQNDFLSISEDVPRPPDTVSTGKGGKNSEAQLEDEEEEEEDGLSEECEEEESESFADMMKHGLTEADVGITKFVSSHQGFSGILKERYSDFVVHEIGKDGRISHLNDLSIPVDEEDPSEDIFTVLTAEEKQRLEELQLFKNKETSVAIEVIEDTKEKRTIIHQAIKSLFPGLETKTEDREGKKYIVAYHAAGKKALAKVRTAADPRKHSWPKSRGSYCHFVLYKENKDTMDAINVLSKYLRVKPNIFSYMGTKDKRAITVQEIAVLKITAQRLAHLNKCLMNFKLGNFSYQKNPLKLGELQGNHFTVVLRNITGTDDQVQQAMNSLKEIGFINYYGMQRFGTTAVPTYQVGRAILQNSWTEVMDLILKPRSGAEKGYLVKCREEWAKTKDPTAALRKLPVKRCVEGQLLRGLSKYGMKNIVSAFGIIPRNNRLMYIHSYQSYVWNNMVSKRIEDYGLKPVPGDLVLKGATATYIEEDDVNNYSIHDVVMPLPGFDVIYPKHKIQEAYREMLTADNLDIDNMRHKIRDYSLSGAYRKIIIRPQNVSWEVVAYDDPKIPLFNTDVDNLEGKTPPVFASEGKYRALKMDFSLPPSTYATMAIREVLKMDTSIKNQTQLNTTWLR,667,NP_001305092.1.csv,refseq-PUS7-NM_001318163.1_clinical_seed_0_final,refseq-PUS7-NM_001318163.1.a2m,Invitae,refseq-PUS7-NM_001318163.1.npy,1,667,667
+NP_001305781.1,MMEIQMDEGGGVVVYQDDYCSGSVMSERVSGLAGSIYREFERLIHCYDEEVVKELMPLVVNVLENLDSVLSENQEHEVELELLREDNEQLLTQYEREKALRRQAEEKFIEFEDALEQEKKELQIQVEHYEFQTRQLELKAKNYADQISRLEERESEMKKEYNALHQRHTEMIQTYVEHIERSKMQQVGGNSQTESSLPGRSRKERPTSLNVFPLADGTVRAQIGGKLVPAGDHWHLSDLGQLQSSSSYQCPQDEMSESGQSSAAATPSTTGTKSNTPTSSVPSAAVTPLNESLQPLGDYGVGSKNSKRAREKRDSRNMEVQVTQEMRNVSIGMGSSDEWSDVQDIIDSTPELDMCPETRLDRTGSSPTQGIVNKAFGINTDSLYHELSTAGSEVIGDVDEGADLLGEFSVRDDFFGMGKEVGNLLLENSQLLETKNALNVVKNDLIAKVDQLSGEQEVLRGELEAAKQAKVKLENRIKELEEELKRVKSEAIIARREPKEEAEDVSSYLCTESDKIPMAQRRRFTRVEMARVLMERNQYKERLMELQEAVRWTEMIRASREHPSVQEKKKSTIWQFFSRLFSSSSSPPPAKRPYPSVNIHYKSPTTAGFSQRRNHAMCPISAGSRPLEFFPDDDCTSSARREQKREQYRQVREHVRNDDGRLQACGWSLPAKYKQLSPNGGQEDTRMKNVPVPVYCRPLVEKDPTMKLWCAAGVNLSGWRPNEDDAGNGVKPAPGRDPLTCDREGDGEPKSAHTSPEKKKAKELPEMDATSSRVWILTSTLTTSKVVIIDANQPGTVVDQFTVCNAHVLCISSIPAASDSDYPPGEMFLDSDVNPEDPGADGVLAGITLVGCATRCNVPRSNCSSRGDTPVLDKGQGEVATIANGKVNPSQSTEEATEATEVPDPGPSEPETATLRPGPLTEHVFTDPAPTPSSGPQPGSENGPEPDSSSTRPEPEPSGDPTGAGSSAAPTMWLGAQNGWLYVHSAVANWKKCLHSIKLKDSVLSLVHVKGRVLVALADGTLAIFHRGEDGQWDLSNYHLMDLGHPHHSIRCMAVVYDRVWCGYKNKVHVIQPKTMQIEKSFDAHPRRESQVRQLAWIGDGVWVSIRLDSTLRLYHAHTHQHLQDVDIEPYVSKMLGTGKLGFSFVRITALLVAGSRLWVGTGNGVVISIPLTETVVLHRGQLLGLRANKTSPTSGEGARPGGIIHVYGDDSSDRAASSFIPYCSMAQAQLCFHGHRDAVKFFVSVPGNVLATLNGSVLDSPAEGPGPAAPASEVEGQKLRNVLVLSGGEGYIDFRIGDGEDDETEEGAGDMSQVKPVLSKAERSHIIVWQVSYTPE,1337,NP_001305781.1.csv,refseq-MAPK8IP3-NM_001318852.1_clinical_seed_0_final,refseq-MAPK8IP3-NM_001318852.1.a2m,Invitae,refseq-MAPK8IP3-NM_001318852.1.npy,1,1337,1337
+NP_001307127.1,MTCGSYCGGRAFSCISACGPRPGRCCITAAPYRGISCYRGLTGGFGSHSVCGGFRAGSCGRSFGYRSGGVCGPSPPCITTVSVNESLLTPLNLEIDPNAQCVKQEEKEQIKSLNSRFAAFIDKVRFLEQQNKLLETKLQFYQNRECCQSNLEPLFEGYIETLRREAECVEADSGRLASELNHVQEVLEGYKKKYEEEVSLRATAENEFVALKKDVDCAYLRKSDLEANVEALIQEIDFLRRLYEEEIRVLQSHISDTSVVVKLDNSRDLNMDCIIAEIKAQYDDIVTRSRAEAESWYRSKCEEMKATVIRHGETLRRTKEEINELNRMIQRLTAEVENAKCQNSKLEAAVAQSEQQGEAALSDARCKLAELEGALQKAKQDMACLIREYQEVMNSKLGLDIEIATYRRLLEGEEQRLCEGVGSVNVCVSSSRGGVVCGDLCASTTAPVVSTRVSSVPSNSNVVVGTTNACAPSARVGVCGGSCKRC,486,NP_001307127.1.csv,refseq-KRT86-NM_001320198.1_clinical_seed_0_final,refseq-KRT86-NM_001320198.1.a2m,Invitae,refseq-KRT86-NM_001320198.1.npy,1,486,486
+NP_001307509.1,MNNSLENTISFEEYIRVKARSVPQHRMKEFLDSLASKGPEALQEFQQTATTTMVYQQGGNCIYTDSTEVAGSLLELACPVTTSVQPQTQQEQQIQVQQPQQVQVQVQVQQSPQQVSAQLSPQLTVHQPTEQPIQVQVQIQGQAPQSAAPSIQTPSLQSPSPSQLQAAQIQVQHVQAAQQIQAAEIPEEHIPHQQIQAQLVAGQSLAGGQQIQIQTVGALSPPPSQQGSPREGERRVGTASVLQPVKKRKVDMPITVSYAISGQPVATVLAIPQGQQQSYVSLRPDLLTVDSAHLYSATGTITSPTGETWTIPVYSAQPRGDPQQQSITHIAIPQEAYNAVHVSGSPTALAAVKLEDDKEKMVGTTSVVKNSHEEVVQTLANSLFPAQFMNGNIHIPVAVQAVAGTYQNTAQTVHIWDPQQQPQQQTPQEQTPPPQQQQQQLQVTCSAQTVQVAEVEPQSQPQPSPELLLPNSLKPEEGLEVWKNWAQTKNAELEKDAQNRLAPIGRRQLLRFQEDLISSAVAELNYGLCLMTREARNGEGEPYDPDVLYYIFLCIQKYLFENGRVDDIFSDLYYVRFTEWLHEVLKDVQPRVTPLGYVLPSHVTEEMLWECKQLGAHSPSTLLTTLMFFNTKYFLLKTVDQHMKLAFSKVLRQTKKNPSNPKDKSTSIRYLKALGIHQTGQKVTDDMYAEQTENPENPLRCPIKLYDFYLFKCPQSVKGRNDTFYLTPEPVVAPNSPIWYSVQPISREQMGQMLTRILVIREIQEAIAVANASTMH,776,NP_001307509.1.csv,refseq-QRICH1-NM_001320580.1_clinical_seed_0_final,refseq-QRICH1-NM_001320580.1.a2m,Invitae,refseq-QRICH1-NM_001320580.1.npy,1,776,776
+NP_001307594.1,MAEERVATRTQFPVSTESQKPRQKKAPEFPILEKQNWLIHLHYIRKDYEACKAVIKEQLQETQGLCEYAIYVQALIFRLEGNIQESLELFQTCAVLSPQSADNLKQVARSLFLLGKHKAAIEVYNEAAKLNQKDWEISHNLGVCYIYLKQFNKAQDQLHNALNLNRHDLTYIMLGKIHLLEGDLDKAIEVYKKAVEFSPENTELLTTLGLLYLQAILAAGSMMQTHGDFDVALTKYRVVACAVPESPPLWNNIGMCFFGKKKYVAAISCLKRANYLAPFDWKILYNLGLVHLTMQQYASAFHFLSAAINFQPKMGELYMLLAVALTNLEDIENAKRAYAEAVHLDKCNPLVNLNYAVLLYNQGEKKNALAQYQEMEKKVSLLKDNSSLEFDSEMVEMAQKLGAALQVGEALVWTKPVKDPKSKHQTTSTSKPASFQQPLGSNQALGQAMSSAAAYRTLPSGAGGTSQFTKPPSLPLEPEPAVESSPTETSEQIREK,496,NP_001307594.1.csv,refseq-BBS4-NM_001320665.2_clinical_seed_0_final,refseq-BBS4-NM_001320665.2.a2m,Invitae,refseq-BBS4-NM_001320665.2.npy,1,496,496
+NP_001307679.1,MAQDRLQLFIAKMKIPFLLLFFLWEAESHAASRPNIILVMADDLGIGDPGCYGNKTIRTPNIDRLASGGVKLTQHLAASPLCTPSRAAFMTGRYPVRSGMASWSRTGVFLFTASSGGLPTDEITFAKLLKDQGYSTALIGKWHLGMSCHSKTDFCHHPLHHGFNYFYGISLTNLRDCKPGEGSVFTTGFKRLVFLPLQIVGVTLLTLAALNCLGLLHVPLGVFFSLLFLAALILTLFLGFLHYFRPLNCFMMRNYEIIQQPMSYDNLTQRLTVEAAQFIQRNTETPFLLVLSYLHVHTALFSSKDFAGKSQHGVYGDAVEEMDWSVGQILNLLDELRLANDTLIYFTSDQGAHVEEVSSKGEIHGGSNGIYKGGKANNWEGGIRVPGILRWPRVIQAGQKIDEPTSNMDIFPTVAKLAGAPLPEDRIIDGRDLMPLLEGKSQRSDHEFLFHYCNAYLNAVRWHPQNSTSIWKAFFFTPNFNPVGSNGCFATHVCFCFGSYVTHHDPPLLFDISKDPRERNPLTPASEPRFYEILKVMQEAADRHTQTLPEVPDQFSWNNFLWKPWLQLCCPSTGLSCQCDREKQDKRLSR,590,NP_001307679.1.csv,NP_001307679.1_colabfold_clinical_seed_0_final,NP_001307679.1_colabfold.a2m,colabfold,NP_001307679.1_colabfold_theta_0.2.npy,1,590,590
+NP_001307852.1,MQYLNIKEDCNAMAFCAKMRSSKKTEVNLEAPEPGVEVIFYLSDREPLRLGSGEYTAEELCIRAAQACRISPLCHNLFALYDENTKLWYAPNRTITVDDKMSLRLHYRMRFYFTNWHGTNDNEQSVWRHSPKKQKNGYEKKKIPDATPLLDASSLEYLFAQGQYDLVKCLAPIRDPKTEQDGHDIENECLGMAVLAISHYAMMKKMQLPELPKDISYKRYIPETLNKSIRQRNLLTRMRINNVFKDFLKEFNNKTICDSSVSTHDLKVKYLATLETLTKHYGAEIFETSMLLISSENEMNWFHSNDGGNVLYYEVMVTGNLGIQWRHKPNVVSVEKEKNKLKRKKLENKHKKDEEKNKIREEWNNFSYFPEITHIVIKESVVSINKQDNKKMELKLSSHEEALSFVSLVDGYFRLTADAHHYLCTDVAPPLIVHNIQNGCHGPICTEYAINKLRQEGSEEGMYVLRWSCTDFDNILMTVTCFEKSEQVQGAQKQFKNFQIEVQKGRYSLHGSDRSFPSLGDLMSHLKKQILRTDNISFMLKRCCQPKPREISNLLVATKKAQEWQPVYPMSQLSFDRILKKDLVQGEHLGRGTRTHIYSGTLMDYKDDEGTSEEKKIKVILKVLDPSHRDISLAFFEAASMMRQVSHKHIVYLYGVCVRDVENIMVEEFVEGGPLDLFMHRKSDVLTTPWKFKVAKQLASALSYLEDKDLVHGNVCTKNLLLAREGIDSECGPFIKLSDPGIPITVLSRQECIERIPWIAPECVEDSKNLSVAADKWSFGTTLWEICYNGEIPLKDKTLIEKERFYESRCRPVTPSCKELADLMTRCMNYDPNQRPFFRAIMRDINKLEEQNPDIVSEKKPATEVDPTHFEKRFLKRIRDLGEGHFGKVELCRYDPEGDNTGEQVAVKSLKPESGGNHIADLKKEIEILRNLYHENIVKYKGICTEDGGNGIKLIMEFLPSGSLKEYLPKNKNKINLKQQLKYAVQICKGMDYLGSRQYVHRDLAARNVLVESEHQVKIGDFGLTKAIETDKEYYTVKDDRDSPVFWYAPECLMQSKFYIASDVWSFGVTLHELLTYCDSDSSPMALFLKMIGPTHGQMTVTRLVNTLKEGKRLPCPPNCPDEVYQLMRKCWEFQPSNRTSFQNLIEGFEALLK,1154,NP_001307852.1.csv,refseq-JAK1-NM_002227.3_clinical_seed_0_final,refseq-JAK1-NM_002227.3.a2m,Invitae,refseq-JAK1-NM_002227.3.npy,1,1154,1154
+NP_001308031.1,MEDAGGGEETPAPEAPHPPQLAPPEEQGLLFQEETIDLGGDEFGSEENETASEGSSPLADKLNEHMMESVLISDSPNSEGDAGDLGRVRDEAEPGGEGDPGPEPAGTPSPSGEADGDCAPEDAAPSSGGAPRQDAAREVPGSEAARPEQEPPVAEPVPVCTIFSQRAPPASGDGFEPQMVKSPSFGGASEASARTPPQVVQPSPSLSTFFGDTAASHSLASDFFDSFTTSAFISVSNPGAGSPAPASPPPLAVPGTEGRPEPVAMRGPQAAAPPASPEPFAHIQAVFAGSDDPFATALSMSEMDRRNDAWLPGEATRGVLRAVATQQRGAVFVDKENLTMPGLRFDNIQGDAVKDLMLRFLGEKAAAKRQVLNADSVEQSFVGLKQLISCRNWRAAVDLCGRLLTAHGQGYGKSGLLTSHTTDSLQLWFVRLALLVKLGLFQNAEMEFEPFGNLDQPDLYYEYYPHVYPGRRGSMVPFSMRILHAELQQYLGNPQESLDRLHKVKTVCSKILANLEQGLAEDGGMSSVTQEGRQASIRLWRSRLGRVMYSMANCLLLMKDYVLAVEAYHSVIKYYPEQEPQLLSGIGRISLQIGDIKTAEKYFQDVEKVTQKLDGLQGKIMVLMNSAFLHLGQNNFAEAHRFFTEILRMDPRNAVANNNAAVCLLYLGKLKDSLRQLEAMVQQDPRHYLHESVLFNLTTMYELESSRSMQKKQALLEAVAGKEGDSFNTQCLKLA,735,NP_001308031.1.csv,refseq-TRAPPC12-NM_001321102.1_clinical_seed_0_final,refseq-TRAPPC12-NM_001321102.1.a2m,Invitae,refseq-TRAPPC12-NM_001321102.1.npy,1,735,735
+NP_001308943.1,MERAESSSTEPAKAIKPIDRKSVHQICSGQVVLSLSTAVKELVENSLDAGATNIDLKLKDYGVDLIEVSDNGCGVEEENFEGLTLKHHTSKIQEFADLTQVETFGFRGEALSSLCALSDVTISTCHASAKVGTRLMFDHNGKIIQKTPYPRPRGTTVSVQQLFSTLPVRHKEFQRNIKKEYAKMVQVLHAYCIISAGIRVSCTNQLGQGKRQPVVCTGGSPSIKENIGSVFGQKQLQSLIPFVQLPPSDSVCEEYGLSCSDALHNLFYISGFISQCTHGVGRSSTDRQFFFINRRPCDPAKVCRLVNEVYHMYNRHQYPFVVLNISVDSECVDINVTPDKRQILLQEEKLLLAVLKTSLIGMFDSDVNKLNVSQQPLLDVEGNLIKMHAADLEKPMVEKQDQSPSLRTGEEKKDVSISRLREAFSLRHTTENKPHSPKTPEPRRSPLGQKRGMLSSSTSGAISDKGVLRPQKEAVSSSHGPSDPTDRAEVEKDSGHGSTSVDSEGFSIPDTGSHCSSEYAASSPGDRGSQEHVDSQEKAPKTDDSFSDVDCHSNQEDTGCKFRVLPQPTNLATPNTKRFKKEEILSSSDICQKLVNTQDMSASQVDVAVKINKKVVPLDFSMSSLAKRIKQLHHEAQQSEGEQNYRKFRAKICPGENQAAEDELRKEISKTMFAEMEIIGQFNLGFIITKLNEDIFIVDQHATDEKYNFEMLQQHTVLQGQRLIAPQTLNLTAVNEAVLIENLEIFRKNGFDFVIDENVMDFSQNCILLAPVTERAKLISLPTSKNWTFGPQDVDELIFMLSDSPGVMCRPSRVKQMFASRACRKSVMIGTALNTSEMKKLITHMGEMDHPWNCPHGRPTMRHIANLGVISQN,873,NP_001308943.1.csv,NP_001308943.1_colabfold_clinical_seed_0_final,NP_001308943.1_colabfold.a2m,colabfold,NP_001308943.1_colabfold_theta_0.2.npy,1,873,873
+NP_001308971.2,MAAPILKDVVAYVEVWSSNGTENYSKTFTTQLVDMGAKVSKTFNKQVTHVIFKDGYQSTWDKAQKRGVKLVSVLWVEKCRTAGAHIDESLFPAANMNEHLSSLIKKKRKCMQPKDFNFKTPENDKRFQKKFEKMAKELQRQKTNLDDDVPILLFESNGSLIYTPTIEINSRHHSAMEKRLQEMKEKRENLSPTSSQMIQQSHDNPSNSLCEAPLNISRDTLCSDEYFAGGLHSSFDDLCGNSGCGNQERKLEGSINDIKSDVCISSLVLKANNIHSSPSFTHLDKSSPQKFLSNLSKEEINLQRNIAGKVVTPDQKQAAGMSQETFEEKYRLSPTLSSTKGHLLIHSRPRSSSVKRKRVSHGSHSPPKEKCKRKRSTRRSIMPRLQLCRSEDRLQHVAGPALEALSCGESSYDDYFSPDNLKERYSENLPPESQLPSSPAQLSCRSLSKKERTSIFEMSDFSCVGKKTRTVDITNFTAKTISSPRKTGNGEGRATSSCVTSAPEEALRCCRQAGKEDACPEGNGFSYTIEDPALPKGHDDDLTPLEGSLEEMKEAVGLKSTQNKGTTSKISNSSEGEAQSEHEPCFIVDCNMETSTEEKENLPGGYSGSVKNRPTRHDVLDDSCDGFKDLIKPHEELKKSGRGKKPTRTLVMTSMPSEKQNVVIQVVDKLKGFSIAPDVCETTTHVLSGKPLRTLNVLLGIARGCWVLSYDWVLWSLELGHWISEEPFELSHHFPAAPLCRSECHLSAGPYRGTLFADQPAMFVSPASSPPVAKLCELVHLCGGRVSQVPRQASIVIGPYSGKKKATVKYLSEKWVLDGVSLCHQAGVCSDEISAHCNLHLLGSSDSSASAFPVAGITGPATMPRFHHPAQGLCP,875,NP_001308971.2.csv,NP_001308971.2_colabfold_clinical_seed_0_final,NP_001308971.2_colabfold.a2m,colabfold,NP_001308971.2_colabfold_theta_0.2.npy,1,875,875
+NP_001309774.1,MVLNSLDKMIQLQKNTANIRNICVLAHVDHGKTTLADCLISSNGIISSRLAGKLRYMDSREDEQIRGITMKSSAISLHYATGNEEYLINLIDSPGHVDFSSEVSTAVRICDGCIIVVDAVEGVCPQTQAVLRQAWLENIRPVLVINKIDRLIVELKFTPQEAYSHLKNILEQINALTGTLFTSKVLEERAERETESQVNPNSEQGEQVYDWSTGLEDTDDSHLYFSPEQGNVVFTSAIDGWGFGIEHFARIYSQKIGIKKEVLMKTLWGDYYINMKAKKIMKGDQAKGKKPLFVQLILENIWSLYDAVLKKDKDKIDKIVTSLGLKIGAREARHSDPKVQINAICSQWLPISHAVLAMVCQKLPSPLDITAERVERLMCTGSQTFDSFPPETQALKAAFMKCGSEDTAPVIIFVSKMFAVDAKALPQNKPRPLTQEEIAQRRERARQRHAEKLAAAQGQAPLEPTQDGSAIETCPKGEEPRGDEQQVESMTPKPVLQEENNQESFIAFARVFSGVARRGKKIFVLGPKYSPLEFLRRVPLGFSAPPDGLPQVPHMAYCALENLYLLMGRELEYLEEVPPGNVLGIGGLQDFVLKSATLCSLPSCPPFIPLNFEATPIVRVAVEPKHPSEMPQLVKGMKLLNQADPCVQILIQETGEHVLVTAGEVHLQRCLDDLKERFAKIHISVSEPIIPFRETITKPPKVDMVNEEIGKQQKVAVIHQMKEDQSKIPEGIQVDSDGLITITTPNKLATLSVRAMPLPEEVTQILEENSDLIRSMEQLTSSLNEGENTHMIHQKTQEKIWEFKGKLEQHLTGRRWRNIVDQIWSFGPRKCGPNILVNKSEDFQNSVWTGPADKASKEASRYRDLGNSIVSGFQLATLSGPMCEEPLMGVCFVLEKWDLSKFEEQGASDLAKEGQEENETCSGGNENQELQDGCSEAFEKRTSQKGESPLTDCYGPFSGQLIATMKEACRYALQVKPQRLMAAMYTCDIMATGDVLGRVYAVLSKREGRVLQEEMKEGTDMFIIKAVLPVAESFGFADEIRKRTSGLASPQLVFSHWEIIPSDPFWVPTTEEEYLHFGEKADSENQARKYMNAVRKRKGLYVEEKIVEHAEKQRTLSKNK,1120,NP_001309774.1.csv,refseq-EFL1-NM_001322845.1_clinical_seed_0_final,refseq-EFL1-NM_001322845.1.a2m,Invitae,refseq-EFL1-NM_001322845.1.npy,1,1120,1120
+NP_001309863.1,MESCYNPGLDGIIEYDDFKLNSSIVEPKEPAPETADGPYLVIVEQPKQRGFRFRYGCEGPSHGGLPGASSEKGRKTYPTVKICNYEGPAKIEVDLVTHSDPPRAHAHSLVGKQCSELGICAVSVGPKDMTAQFNNLGVLHVTKKNMMGTMIQKLQRQRLRSRPQGLTEAEQRELEQEAKELKKVMDLSIVRLRFSAFLRASDGSFSLPLKPVISQPIHDSKSPGASNLKISRMDKTAGSVRGGDEVYLLCDKVQKDDIEVRFYEDDENGWQAFGDFSPTDVHKQYAIVFRTPPYHKMKIERPVTVFLQLKRKRGGDVSDSKQFTYYPLVEDKEEVQRKRRKALPTFSQPFGGGSHMGGGSGGAAGGYGGAGGGGSLGFFPSSLAYSPYQSGAGPMGCYPGGGGGAQMAATVPSRDSGEEAAEPSAPSRTPQCEPQAPEMLQRAREYNARLFGLAQRSARALLDYGVTADARALLAGQRHLLTAQDENGDTPLHLAIIHGQTSVIEQIVYVIHHAQDLGVVNLTNHLHQTPLHLAVITGQTSVVSFLLRVGADPALLDRHGDSAMHLALRAGAGAPELLRALLQSGAPAVPQLLHMPDFEGLYPVHLAVRARSPECLDLLVDSGAEVEATERQGGRTALHLATEMEELGLVTHLVTKLRANVNARTFAGNTPLHLAAGLGYPTLTRLLLKAGADIHAENEEPLCPLPSPPTSDSDSDSEGPEKDTRSSFRGHTPLDLTCSTKVKTLLLNAAQNTMEPPLTPPSPAGPGLSLGDTALQNLEQLLDGPEAQGSWAELAERLGLRSLVDTYRQTTSPSGSLLRSYELAGGDLAGLLEALSDMGLEEGVRLLRGPETRDKLPSTAEVKEDSAYGSQSVEQEAEKLGPPPEPPGGLCHGHPQPQVH,900,NP_001309863.1.csv,NFKB2_HUMAN_b05_clinical_seed_0_final,NFKB2_HUMAN_b05.a2m,EVE,NFKB2_HUMAN_b05_theta_0.2.npy,1,900,900
+NP_001313340.1,MATSVGHRCLGLLHGVAPWRSSLHPCEITALSQSLQPLRKLPFRAFRTDARKIHTAPARTMFLLRPLPILLVTGGGYAGYRQYEKYRERELEKLGLEIPPKLAGHWEVALYKSVPTRLLSRAWGRLNQVELPHWLRRPVYSLYIWTFGVNMKEAAVEDLHHYRNLSEFFRRKLKPQARPVCGLHSVISPSDGRILNFGQVKNCEVEQVKGVTYSLESFLGPRMCTEDLPFPPAASCDSFKNQLVTREGNELYHCVIYLAPGDYHCFHSPTDWTVSHRRHFPGSLMSVNPGMARWIKELFCHNERVVLTGDWKHGFFSLTAVGATNVGSIRIYFDRDLHTNSPRHSKGSYNDFSFVTHTNREGVPMRKGEHLGEFNLGSTIVLIFEAPKDFNFQLKTGQKIRFGEALGSL,409,NP_001313340.1.csv,refseq-PISD-NM_001326411.1_clinical_seed_0_final,refseq-PISD-NM_001326411.1.a2m,Invitae,refseq-PISD-NM_001326411.1.npy,1,409,409
+NP_001316485.1,MDGLRQRVEHFLEQRNLVTEVLGALEAKTGVEKRYLAAGAVTLLSLYLLFGYGASLLCNLIGFVYPAYASIKAIESPSKDDDTVWLTYWVVYALFGLAEFFSDLLLSWFPFYYVGKCAFLLFCMAPRPWNGALMLYQRVVRPLFLRHHGAVDRIMNDLSGRALDAAAGITRNVLQVLARSRAGITPVAVAGPSTPLEADLKPSQTPQPKDK,211,NP_001316485.1.csv,refseq-REEP6-NM_001329556.1_clinical_seed_0_final,refseq-REEP6-NM_001329556.1.a2m,Invitae,refseq-REEP6-NM_001329556.1.npy,1,211,211
+NP_001316843.1,MVAACRSVAGLLPRRRRCFPARAPLLRVALCLLCWTPAAVRAVPELGLWLETVNDKSGPLIFRKTMFNSTDIKLSVKSFHCSGPVKFTIVWHLKYHTCHNEHSNLEELFQKHKLSVDEDFCHYLKNDNCWTTKNENLDCNSDSQVFPSLNNKELINIRNVSNQERSMDVVARTQKDGFHIFIVSIKTENTDASWNLNVSLSMIGPHGYISASDWPLMIFYMVMCIVYILYGILWLTWSACYWKDILRIQFWIAAVIFLGMLEKAVFYSEYQNISNTGLSTQGLLIFAELISAIKRTLARLLVIIVSLGYGIVKPRLGTVMHRVIGLGLLYLIFAAVEGVMRVIGVSDSDLVLLASLPLSLLDSGLCWWIFISLAQTMKTLRLRKNTVKFSLYRHFKNTLIFAVLASIVFMGWTTKTFRIAKCQSDWMERWVDDAFWSFLFSLILIVIMFLWRPSANNQRYAFMPLIDDSDDEIEEFMVTSENLTEGIKLRASKSVSNGTAKPATSENFDEDLKWVEENIPSSFTDVALPVLVDSDEEIMTRSEMAEKMFSSEKIM,555,NP_001316843.1.csv,NP_001316843.1_colabfold_clinical_seed_0_final,NP_001316843.1_colabfold.a2m,colabfold,NP_001316843.1_colabfold_theta_0.2.npy,1,555,555
+NP_001317619.1,MKTKLNIYNMQFLLFVFLVWDPARLVLANIQEDEAKNNITIFTRILDRLLDGYDNRLRPGLGDSITEVFTNIYVTSFGPVSDTDMEYTIDVFFRQKWKDERLKFKGPMNILRLNNLMASKIWTPDTFFHNGKKSVAHNMTMPNKLLRIQDDGTLLYTMRLTVQAECPMHLEDFPMDAHSCPLKFGSYAYTTSEVTYIWTYNASDSVQVAPDGSRLNQYDLLGQSIGKETIKSSTGEYTVMTAHFHLKRKIGYFVIQTYLPCIMTVILSQVSFWLNRESVPARTVFGVTTVLTMTTLSISARNSLPKVAYATAMDWFIAVCYAFVFSALIEFATVNYFTKRGWAWDGKSVVNDKSPSIKAEGITLTYNSVKAILQGAKLIWSKYIAFSWPSLFQEKTLEYLEKWMDCLTFKQSSKKEKASVMIQNNAYAVAVANYAPNLSKDPVLSTISKSATTPEPNKKPENKPAEAKKTFNSVSKIDRMSRIVFPVLFGTFNLVYWATYLNREPVLGVSP,511,NP_001317619.1.csv,refseq-GABRA2-NM_001330690.1_clinical_seed_0_final,refseq-GABRA2-NM_001330690.1.a2m,Invitae,refseq-GABRA2-NM_001330690.1.npy,1,511,511
+NP_001317964.1,MKLLHVFLLFLCFHLRFCKVTYTSQEDLVEKKCLAKKYTHLSCDKVFCQPWQRCIEGTCVCKLPYQCPKNGTAVCATNRRSFPTYCQQKSLECLHPGTKFLNNGTCTAEGKFSVSLKHGNTDSEGIVEVKLVDQDKTMFICKSSWSMREANVACLDLGFQQGADTQRRFKLSDLSINSTECLHVHCRGLETSLAECTFTKRRTMGYQDFADVVCYTQKADSPMDDFFQCVNGKYISQMKACDGINDCGDQSDELCCKACQGKGFHCKSGVCIPSQYQCNGEVDCITGEDEVGCAEETEILTADMDAERRRIKSLLPKLSCGVKNRMHIRRKRIVGGKRAQLGDLPWQVAIKDASGITCGGIYIGGCWILTAAHCLRASKTHRYQIWTTVVDWIHPDLKRIVIEYVDRIIFHENYNAGTYQNDIALIEMKKDGNKKDCELPRSIPACVPWSPYLFQPNDTCIVSGWGREKDNERVFSLQWGEVKLISNCSKFYGNRFYEKEMECAGTYDGSIDACKGDSGGPLVCMDANNVTYVWGVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQYNV,576,NP_001317964.1.csv,refseq-CFI-NM_001331035.1_clinical_seed_0_final,refseq-CFI-NM_001331035.1.a2m,Invitae,refseq-CFI-NM_001331035.1_theta_0.2.npy,1,576,576
+NP_001319.1,MGSSHLLNKGLPLGVRPPIMNGPLHPRPLVALLDGRDCTVEMPILKDVATVAFCDAQSTQEIHEKVLNEAVGALMYHTITLTREDLEKFKALRIIVRIGSGFDNIDIKSAGDLGIAVCNVPAASVEETADSTLCHILNLYRRATWLHQALREGTRVQSVEQIREVASGAARIRGETLGIIGLGRVGQAVALRAKAFGFNVLFYDPYLSDGVERALGLQRVSTLQDLLFHSDCVTLHCGLNEHNHHLINDFTVKQMRQGAFLVNTARGGLVDEKALAQALKEGRIRGAALDVHESEPFSFSQGPLKDAPNLICTPHAAWYSEQASIEMREEAAREIRRAITGRIPDSLKNCVNKDHLTAATHWASMDPAVVHPELNGAAYRYPPGVVGVAPTGIPAAVEGIVPSAMSLSHGLPPVAHPPHAPSPGQTVKPEADRDHASDQL,440,NP_001319.1.csv,refseq-CTBP1-NM_001328.2_clinical_seed_0_final,refseq-CTBP1-NM_001328.2.a2m,Invitae,refseq-CTBP1-NM_001328.2.npy,1,440,440
+NP_001323.1,MFARKPPGAAPLGAMPVPDQPSSASEKTSSLSPGLNTSNGDGSETETTSAILASVKEQELQFERLTRELEAERQIVASQLERCKLGSETGSMSSMSSAEEQFQWQSQDGQKDIEDELTTGLELVDSCIRSLQESGILDPQDYSTGERPSLLSQSALQLNSKPEGSFQYPASYHSNQTLALGETTPSQLPARGTQARATGQSFSQGTTSRAGHLAGPEPAPPPPPPPREPFAPSLGSAFHLPDAPPAAAAAALYYSSSTLPAPPRGGSPLAAPQGGSPTKLQRGGSAPEGATYAAPRGSSPKQSPSRLAKSYSTSSPINIVVSSAGLSPIRVTSPPTVQSTISSSPIHQLSSTIGTYATLSPTKRLVHASEQYSKHSQELYATATLQRPGSLAAGSRASYSSQHGHLGPELRALQSPEHHIDPIYEDRVYQKPPMRSLSQSQGDPLPPAHTGTYRTSTAPSSPGVDSVPLQRTGSQHGPQNAAAATFQRASYAAGPASNYADPYRQLQYCPSVESPYSKSGPALPPEGTLARSPSIDSIQKDPREFGWRDPELPEVIQMLQHQFPSVQSNAAAYLQHLCFGDNKIKAEIRRQGGIQLLVDLLDHRMTEVHRSACGALRNLVYGKANDDNKIALKNCGGIPALVRLLRKTTDLEIRELVTGVLWNLSSCDALKMPIIQDALAVLTNAVIIPHSGWENSPLQDDRKIQLHSSQVLRNATGCLRNVSSAGEEARRRMRECDGLTDALLYVIQSALGSSEIDSKTVENCVCILRNLSYRLAAETSQGQHMGTDELDGLLCGEANGKDAESSGCWGKKKKKKKSQDQWDGVGPLPDCAEPPKGIQMLWHPSIVKPYLTLLSECSNPDTLEGAAGALQNLAAGSWKWSVYIRAAVRKEKGLPILVELLRIDNDRVVCAVATALRNMALDVRNKELIGKYAMRDLVHRLPGGNNSNNTASKAMSDDTVTAVCCTLHEVITKNMENAKALRDAGGIEKLVGISKSKGDKHSPKVVKAASQVLNSMWQYRDLRSLYKKDGWSQYHFVASSSTIERDRQRPYSSSRTPSISPVRVSPNNRSASAPASPREMISLKERKTDYECTGSNATYHGAKGEHTSRKDAMTAQNTGISTLYRNSYGAPAEDIKHNQVSAQPVPQEPSRKDYETYQPFQNSTRNYDESFFEDQVHHRPPASEYTMHLGLKSTGNYVDFYSAARPYSELNYETSHYPASPDSWV,1225,NP_001323.1.csv,refseq-CTNND2-NM_001332.3_clinical_seed_0_final,refseq-CTNND2-NM_001332.3.a2m,Invitae,refseq-CTNND2-NM_001332.3.npy,1,1225,1225
+NP_001334748.1,MSQLRLLPSRLGVQAARLLAAHDVPVFGWRSRSSGPPATFPSSKGGGGSSYMEEMYFAWLENPQSVHKSWDSFFREASEEAFSGSAQPRPPSVVHESRSAVSSRTKTSKLVEDHLAVQSLIRAYQIRGHHVAQLDPLGILDADLDSFVPSDLITTIDKLAFYDLQEADLDKEFQLPTTTFIGGSENTLSLREIIRRLENTYCQHIGLEFMFINDVEQCQWIRQKFETPGVMQFSSEEKRTLLARLVRSMRFEDFLARKWSSEKRFGLEGCEVMIPALKTIIDKSSEMGIENVILGMPHRGRLNVLANVIRKDLEQIFCQFDPKLEAADEGSGDVKYHLGMYHERINRVTNRNITLSLVANPSHLEAVDPVVQGKTKAEQFYRGDAQGKKVMSILVHGDAAFAGQGVVYETFHLSDLPSYTTNGTVHVVVNNQIGFTTDPRMARSSPYPTDVARVVNAPIFHVNADDPEAVIYVCSVAAEWRNTFNKDVVVDLVCYRRRGHNEMDEPMFTQPLMYKQIHRQVPVLKKYADKLIAEGTVTLQEFEEEIAKYDRICEEAYGRSKDKKILHIKHWLDSPWPGFFNVDGEPKSMTCPATGIPEDMLTHIGSVASSVPLEDFKIHTGLSRILRGRADMTKNRTVDWALAEYMAFGSLLKEGIHVRLSGQDVERGTFSHRHHVLHDQEVDRRTCVPMNHLWPDQAPYTVCNSSLSEYGVLGFELGYAMASPNALVLWEAQFGDFHNTAQCIIDQFISTGQAKWVRHNGIVLLLPHGMEGMGPEHSSARPERFLQMSNDDSDAYPAFTKDFEVSQLYDCNWIVVNCSTPANYFHVLRRQILLPFRKPLIIFTPKSLLRHPEAKSSFDQMVSGTSFQRVIPEDGAAARAPEQVQRLIFCTGKVYYDLVKERSSQDLEEKVAITRLEQISPFPFDLIKQEAEKYPGAELAWCQEEHKNMGYYDYISPRFMTILRRARPIWYVGRDPAAAPATGNRNTHLVSLKKFLDTAFNLQAFEGKTF,1010,NP_001334748.1.csv,OGDHL_HUMAN_b10_clinical_seed_0_final,OGDHL_HUMAN_b10.a2m,EVE,OGDHL_HUMAN_b10_theta_0.2.npy,1,1010,1010
+NP_001336584.1,MSRKISKESKKVNISSSLESEDISLETTVPTDDISSSEEREGKVRITRQLIERKELLHNIQLLKIELSQKTMMIDNLKVDYLTKIEELEEKLNDALHQKQLLTLRLDNQLAFQQKDASKYQELMKQEMETILLRQKQLEETNLQLREKAGDVRRNLRDFELTEEQYIKLKAFPEDQLSIPEYVSVRFYELVNPLRKEICELQVKKNILAEELSTNKNQLKQLTETYEEDRKNYSEVQIRCQRLALELADTKQLIQQGDYRQENYDKVKSERDALEQEVIELRRKHEILEASHMIQTKERSELSKEVVTLEQTVTLLQKDKEYLNRQNMELSVRCAHEEDRLERLQAQLEESKKAREEMYEKYVASRDHYKTEYENKLHDELEQIRLKTNQEIDQLRNASREMYERENRSGGWEVQYLRTGIWQEPSHYIIIWHKARRNLREARDNAVAEKERAVMAEKDALEKHDQLLDRYRELQLSTESKVTEFLHQSKLKSFESERVQLLQEETARNLTQCQLECEKYQKKLEVLTKEFYSLQASSEKRITELQAQNSEHQARLDIYEKLEKELDEIIMQTAEIENEDEAERVLFSYGYGANVPTTAKRRLKQSVHLARRVLQLEKQNSLILKDLEHRKDQVTQLSQELDRANSLLNQTQQPYRYLIESVRQRDSKIDSLTESIAQLEKDVSNLNKEKSALLQTKNQMALDLEQLLNHREELAAMKQILVKMHSKHSENSLLLTKTEPKHVTENQKSKTLNVPKEHEDNIFTPKPTLFTKKEAPEWSKKQKMKT,786,NP_001336584.1.csv,NP_001336584.1_colabfold_clinical_seed_0_final,NP_001336584.1_colabfold.a2m,colabfold,NP_001336584.1_colabfold_theta_0.2.npy,1,786,786
+NP_001336644.1,MWVCSTLWRVRTPARQWRGLLPASGCHGPAASSYSASAEPARVRALVYGHHGDPAKVVDDSVILFRGITRELFQRFPWIFLQLITAVISSASTVLKNLELAAVRGSDVRVKMLAAPINPSDINMIQGNYGFLPELPAVGGNEGVAQVVAVGSNVTGLKPGDWVIPANAGLGTWRTEAVFSEEALIQVPSDIPLQSAATLGVNPCTAYRMLMDFEQLQPGDSVIQNASNSGVGQAVIQIAAALGLRTINVVRDRPDIQKLSDRLKSLGAEHVITEEELRRPEMKNFFKDMPQPRLALNCVGGKSSTELLRQLARGGTMVTYGGMAKQPVVASVSLLIFKDLKLRGFWLSQWKKDHSPDQFKELILTLCDLIRRGQLTAPACSQVPLQDYQSALEASMKPFISSKQILTM,408,NP_001336644.1.csv,NP_001336644.1_colabfold_clinical_seed_0_final,NP_001336644.1_colabfold.a2m,colabfold,NP_001336644.1_colabfold_theta_0.2.npy,1,408,408
+NP_001340.2,MPSASASRKSQEKPREIMDAAEDYAKERYGISSMIQSQEKPDRVLVRVRDLTIQKADEVVWVRARVHTSRAKGKQCFLVLRQQQFNVQALVAVGDHASKQMVKFAANINKESIVDVEGVVRKVNQKIGSCTQQDVELHVQKIYVISLAEPRLPLQLDDAVRPEAEGEEEGRATVNQDTRLDNRVIDLRTSTSQAVFRLQSGICHLFRETLINKGFVEIQTPKIISAASEGGANVFTVSYFKNNAYLAQSPQLYKQMCICADFEKVFSIGPVFRAEDSNTHRHLTEFVGLDIEMAFNYHYHEVMEEIADTMVQIFKGLQERFQTEIQTVNKQFPCEPFKFLEPTLRLEYCEALAMLREAGVEMGDEDDLSTPNEKLLGHLVKEKYDTDFYILDKYPLAVRPFYTMPDPRNPKQSNSYDMFMRGEEILSGAQRIHDPQLLTERALHHGIDLEKIKAYIDSFRFGAPPHAGGGIGLERVTMLFLGLHNVRQTSMFPRDPKRLTP,501,NP_001340.2.csv,refseq-DARS-NM_001349.3_clinical_seed_0_final,refseq-DARS-NM_001349.3.a2m,Invitae,refseq-DARS-NM_001349.3.npy,1,501,501
+NP_001340274.1,MENSHPPHHHHQQPPPQPGPSGERRNHHWRSYKLMIDPALKKGHHKLYRYDGQHFSLAMSSNRPVEIVEDPRVVGIWTKNKELELSVPKFKIDEFYVGPVPPKQVTFAKLNDNIRENFLRDMCKKYGEVEEVEILYNPKTKKHLGIAKVVFATVRGAKDAVQHLHSTSVMGNIIHVELDTKGETRMRFYELLVTGRYTPQTLPVGELDAVSPIVNETLQLSDALKRLKDGGLSAGCGSGSSSVTPNSGGTPFSQDTAYSSCRLDTPNSYGQGTPLTPRLGTPFSQDSSYSSRQPTPSYLFSQDPAVTFKARRHESKFTDAYNRRHEHHYVHNSPAVTAVAGATAAFRGSSDLPFGAVGGTGGSSGPPFKAQPQDSATFAHTPPPAQATPAPGFKSAFSPYQTPVAHFPPPPEEPTATAAFGARDSGEFRRAPAPPPLPPAEPLAKEKPGTPPGPPPPDTNSMELGGRPTFGWSPEPCDSPGTPTLESSPAGPEKPHDSLDSRIEMLLKEQRTKLLFLREPDSDTELQMEGSPISSSSSQLSPLAPFGTNSQPGFRGPTPPSSRPSSTGLEDISPTPLPDSDEDEELDLGLGPRPPPEPGPPDPAGLLSQTAEVALDLVGDRTPTSEKMDEGQQSSGEDMEISDDEMPSAPITSADCPKPMVVTPGAAAVAAPSVLAPTLPLPPPPGFPPLPPPPPPPPPQPGFPMPPPLPPPPPPPPPAHPAVTVPPPPLPAPPGVPPPPILPPLPPFPPGLFPVMQVDMSHVLGGQWGGMPMSFQMQTQVLSRLMTGQGACPYPPFMAAAAAAASAGLQFVNLPPYRGPFSLSNSGPGRGQHWPPLPKFDPSVPPPGYMPRQEDPHKATVDGVLLVVLKELKAIMKRDLNRKMVEVVAFRAFDEWWDKKERMAKASLTPVKSGEHKDEDRPKPKDRIASCLLESWGKGEGLGYEGLGLGIGLRGAIRLPSFKVKRKEPPDTTSSGDQKRLRPSTSVDEEDEESERERDRDMADTPCELAKRDPKGVGVRRRPARPLELDSGGEEDEKESLSASSSSSASSSSGSSTTSPSSSASDKEEEQESTEEEEEAEEEEEEEVPRSQLSSSSTSSTSDKDDDDDDSDDRDESENDDEDTALSEASEKDEGDSDEEETVSIVTSKAEATSSSESSESSEFESSSESSPSSSEDEEEVVAREEEEEEEEEEMVAEESMASAGPEDFEQDGEEAALAPGAPAVDSLGMEEEVDIETEAVAPEERPSMLDEPPLPVGVEEPADSREPPEEPGLSQEGAMLLSPEPPAKEVEARPPLSPERAPEHDLEVEPEPPMMLPLPLQPPLPPPRPPRPPSPPPEPETTDASHPSVPPEPLAEDHPPHTPGLCGSLAKSQSTETVPATPGGEPPLSGGSSGLSLSSPQVPGSPFSYPAPSPSLSSGGLPRTPGRDFSFTPTFSEPSGPLLLPVCPLPTGRRDERSGPLASPVLLETGLPLPLPLPLPLPLALPAVLRAQARAPTPLPPLLPAPLASCPPPMKRKPGRPRRSPPSMLSLDGPLVRPPAGAALGRELLLLPGQPQTPVFPSTHDPRTVTLDFRNAGIPAPPPPLPPQPPPPPPPPPVEPTKLPFKELDNQWPSEAIPPGPRGRDEVTEEYMELAKSRGPWRRPPKKRHEDLVPPAGSPELSPPQPLFRPRSEFEEMTILYDIWNGGIDEEDIRFLCVTYERLLQQDNGMDWLNDTLWVYHPSTSLSSAKKKKRDDGIREHVTGCARSEGFYTIDKKDKLRYLNSSRASTDEPPADTQGMSIPAQPHASTRAGSERRSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN,1966,NP_001340274.1.csv,NP_001340274.1_clinical_seed_0_final,NP_001340274.1.a2m,popEVE,NP_001340274.1_theta_0.2.npy,1,1966,1966
+NP_001340741.2,MFRRSLNRFCAGEEKRVGTRTVFVGNHPVSETEAYIAQRFCDNRIVSSKYTLWNFLPKNLFEQFRRIANFYFLIIFLVQVTVDTPTSPVTSGLPLFFVITVTAIKQGYEDCLRHRADNEVNKSTVYIIENAKRVRKESEKIKVGDVVEVQADETFPCDLILLSSCTTDGTCYVTTASLDGESNCKTHYAVRDTIALCTAESIDTLRAAIECEQPQPDLYKFVGRINIYSNSLEAVARSLGPENLLLKGATLKNTEKIYGVAVYTGMETKMALNYQGKSQKRSAVEKSINAFLIVYLFILLTKAAVCTTLKYVWQSTPYNDEPWYNQKTQKERETLKVLKMFTDFLSFMVLFNFIIPVSMYVTVEMQKFLGSFFISWDKDFYDEEINEGALVNTSDLNEELGQVDYVFTDKTGTLTENSMEFIECCIDGHKYKGVTQEVDGLSQTDGTLTYFDKVDKNREELFLRALCLCHTVEIKTNDAVDGATESAELTYISSSPDEIALVKGAKRYGFTFLGNRNGYMRVENQRKEIEEYELLHTLNFDAVRRRMSVIVKTQEGDILLFCKGADSAVFPRVQNHEIELTKVHVERNAMDGYRTLCVAFKEIAPDDYERINRQLIEAKMALQDREEKMEKVFDDIETNMNLIGATAVEDKLQDQAAETIEALHAAGLKVWVLTGDKMETAKSTCYACRLFQTNTELLELTTKTIEESERKEDRLHELLIEYRKKLLHEFPKSTRSFKKAWTEHQEYGLIIDGSTLSLILNSSQDSSSNNYKSIFLQICMKCTAVLCCRMAPLQKAQIVRMVKNLKGSPITLSIGDGANDVSMILESHVGIGIKGKEGRQAARNSDYSVPKFKHLKKLLLAHGHLYYVRIAHLVQYFFYKNLCFILPQFLYQFFCGFSQQPLYDAAYLTMYNICFTSLPILAYSLLEQHINIDTLTSDPRLYMKISGNAMLQLGPFLYWTFLAAFEGTVFFFGTYFLFQTASLEENGKVYGNWTFGTIVFTVLVFTVTLKLALDTRFWTWINHFVIWGSLAFYVFFSFFWGGIIWPFLKQQRMYFVFAQMLSSVSTWLAIILLIFISLFPEILLIVLKNVRRRSARRNLSCRRASDSLSARPSVRPLLLRTFSDESNVL,1129,NP_001340741.2.csv,NP_001340741.2_colabfold_clinical_seed_0_final,NP_001340741.2_colabfold.a2m,colabfold,NP_001340741.2_colabfold_theta_0.2.npy,1,1129,1129
+NP_001341530.2,MATATRLLGWRVASWRLRPPLAGFVSQRAHSLLPVDDAINGLSEEQRQLRQTMAKFLQEHLAPKAQEIDRSNEFKNLREFWKQLGNLGVLGITAPVQYGGSGLGYLEHVLVMEEISRASGAVGLSYGAHSNLCINQLVRNGNEAQKEKYLPKLISGEYIGALAMSEPNAGSDVVSMKLKAEKKGNHYILNGNKFWITNGPDADVLIVYAKTDLAAVPASRGITAFIVEKGMPGFSTSKKLDKLGMRGSNTCELIFEDCKIPAANILGHENKGVYVLMSGLDLERLVLAGGPLGLMQAVLDHTIPYLHVREAFGQKIGHFQLMQGKMADMYTRLMACRQYVYNVAKACDEGHCTAKDCAGVILYSAECATQVALDGIQCFGQTFSAQHPPGREGETRGQTSLEQVSTARELRCKILLPELTPRVLLPQLASWISDQKLPFLKILLCSPTSLGQSSLAGFTKLRWPYPLLQPHLLPHPYTTALHLC,484,NP_001341530.2.csv,NP_001341530.2_colabfold_clinical_seed_0_final,NP_001341530.2_colabfold.a2m,colabfold,NP_001341530.2_colabfold_theta_0.2.npy,1,484,484
+NP_001349700.1,MSGPVPSRARVYTDVNTHRPREYWDYESHVVEWGNQDDYQLVRKLGRGKYSEVFEAINITNNEKVVVKILKPVKKKKIKREIKILENLRGGPNIITLADIVKDPVSRTPALVFEHVNNTDFKQLYQTLTDYDIRFYMYEILKALDYCHSMGIMHRDVKPHNVMIDHEHRKLRLIDWGLAEFYHPGQEYNVRVASRYFKGPELLVDYQMYDYSLDMWSLGCMLASMIFRKEPFFHGHDNYDQLVRIAKVLGTEDLYDYIDKYNIELDPRFNDILGRHSRKRWERFVHSENQHLVSPEALDFLDKLLRYDHQSRLTAREAMEHPYFYTVVKDQARMGSSSMPGGSTPVSSANMMSGISSVPTPSPLGPLAGSPVIAAANPLGMPVPAAAGAQQ,391,NP_001349700.1.csv,refseq-CSNK2A1-NM_001895.3_clinical_seed_0_final,refseq-CSNK2A1-NM_001895.3.a2m,Invitae,refseq-CSNK2A1-NM_001895.3.npy,1,391,391
+NP_001351.2,MAAKSQPNIPKAKSLDGVTNDRTASQGQWGRAWEVDWFSLASVIFLLLFAPFIVYYFIMACDQYSCALTGPVVDIVTGHARLSDIWAKTPPITRKAAQLYTLWVTFQVLLYTSLPDFCHKFLPGYVGGIQEGAVTPAGVVNKYQINGLQAWLLTHLLWFANAHLLSWFSPTIIFDNWIPLLWCANILGYAVSTFAMVKGYFFPTSARDCKFTGNFFYNYMMGIEFNPRIGKWFDFKLFFNGRPGIVAWTLINLSFAAKQRELHSHVTNAMVLVNVLQAIYVIDFFWNETWYLKTIDICHDHFGWYLGWGDCVWLPYLYTLQGLYLVYHPVQLSTPHAVGVLLLGLVGYYIFRVANHQKDLFRRTDGRCLIWGRKPKVIECSYTSADGQRHHSKLLVSGFWGVARHFNYVGDLMGSLAYCLACGGGHLLPYFYIIYMAILLTHRCLRDEHRCASKYGRDWERYTAAVPYRLLPGIF,475,NP_001351.2.csv,refseq-DHCR7-NM_001360.2_clinical_seed_0_final,refseq-DHCR7-NM_001360.2.a2m,Invitae,refseq-DHCR7-NM_001360.2.npy,1,475,475
+NP_001351952.1,MCENQLKTKADATAQIEVIPCKICGDKSSGIHYGVITCEGCKGFFRRSQQNNASYSCPRQRNCLIDRTNRNRCQHCRLQKCLALGMSRDAVKFGRMSKKQRDSLYAEVQKHQQRLQEQRQQQSGEAEALARVYSSSISNGLSNLNNETSGTYANGHVIDLPKSEGYYNVDSGQPSPDQSGLDMTGIKQIKQEPIYDLTSVPNLFTYSSFNNGQLAPGITMTEIDRIAQNIIKSHLETCQYTMEELHQLAWQTHTYEEIKAYQSKSREALWQQCAIQITHAIQYVVEFAKRITGFMELCQNDQILLLKSGCLEVVLVRMCRAFNPLNNTVLFEGKYGGMQMFKALGSDDLVNEAFDFAKNLCSLQLTEEEIALFSSAVLISPDRAWLIEPRKVQKLQEKIYFALQHVIQKNHLDDETLAKLIAKIPTITAVCNLHGEKLQVFKQSHPEIVNTLFPPLYKELFNPDCATGCK,470,NP_001351952.1.csv,NP_001351952.1_clinical_seed_0_final,NP_001351952.1.a2m,popEVE,NP_001351952.1_theta_0.2.npy,1,470,470
+NP_001352.2,MAWRHLKKRAQDAVIILGGGGLLFASYLMATGDERFYAEHLMPTLQGLLDPESAHRLAVRFTSLGLLPRARFQDSDMLEVRVLGHKFRNPVGIAAGFDKHGEAVDGLYKMGFGFVEIGSVTPKPQEGNPRPRVFRLPEDQAVINRYGFNSHGLSVVEHRLRARQQKQAKLTEDGLPLGVNLGKNKTSVDAAEDYAEGVRVLGPLADYLVVNVSSPNTAGLRSLQGKAELRRLLTKVLQERDGLRRVHRPAVLVKIAPDLTSQDKEDIASVVKELGIDGLIVTNTTVSRPAGLQGALRSETGGLSGKPLRDLSTQTIREMYALTQGRVPIIGVGGVSSGQDALEKIRAGASLVQLYTALTFWGPPVVGKVKRELEALLKEQGFGGVTDAIGADHRR,395,NP_001352.2.csv,refseq-DHODH-NM_001361.4_clinical_seed_0_final,refseq-DHODH-NM_001361.4.a2m,Invitae,refseq-DHODH-NM_001361.4.npy,1,395,395
+NP_001352465.1,MAMLPPPGPQSFVHFTKQSLALIEQRIAERKSKEPKEEKKDDDEEAPKPSSDLEAGKQLPFIYGDIPPGMVSEPLEDLDPYYADKKTFIVLNKGKTIFRFNATPALYMLSPFSPLRRISIKILVHSLFSMLIMCTILTNCIFMTMNNPPDWTKNVEYTFTGIYTFESLVKILARGFCVGEFTFLRDPWNWLDFVVIVFAYLTEFVNLGNVSALRTFRVLRALKTISVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLKHKCFRNSLENNETLESIMNTLESEEDFRKYFYYLEGSKDALLCGFSTDSGQCPEGYTCVKIGRNPDYGYTSFDTFSWAFLALFRLMTQDYWENLYQQTLRAAGKTYMIFFVVVIFLGSFYLINLILAVVAMAYEEQNQANIEEAKQKELEFQQMLDRLKKEQEEAEAIAAAAAEYTSIRRSRIMGLSESSSETSKLSSKSAKERRNRRKKKNQKKLSSGEEKGDAEKLSKSESEDSIRRKSFHLGVEGHRRAHEKRLSTPNQSPLSIRGSLFSARRSSRTSLFSFKGRGRDIGSETEFADDEHSIFGDNESRRGSLFVPHRPQERRSSNISQASRSPPMLPVNGKMHSAVDCNGVVSLVDGRSALMLPNGQLLPEVIIDKATSDDSGTTNQIHKKRRCSSYLLSEDMLNDPNLRQRAMSRASILTNTVEELEESRQKCPPWWYRFAHKFLIWNCSPYWIKFKKCIYFIVMDPFVDLAITICIVLNTLFMAMEHHPMTEEFKNVLAIGNLVFTGIFAAEMVLKLIAMDPYEYFQVGWNIFDSLIVTLSLVELFLADVEGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKSYKECVCKINDDCTLPRWHMNDFFHSFLIVFRVLCGEWIETMWDCMEVAGQAMCLIVYMMVMVIGNLVVLNLFLALLLSSFSSDNLTAIEEDPDANNLQIAVTRIKKGINYVKQTLREFILKAFSKKPKISREIRQAEDLNTKKENYISNHTLAEMSKGHNFLKEKDKISGFGSSVDKHLMEDSDGQSFIHNPSLTVTVPIAPGESDLENMNAEELSSDSDSEYSKVRLNRSSSSECSTVDNPLPGEGEEAEAEPMNSDEPEACFTDGCVWRFSCCQVNIESGKGKIWWNIRKTCYKIVEHSWFESFIVLMILLSSGALAFEDIYIERKKTIKIILEYADKIFTYIFILEMLLKWIAYGYKTYFTNAWCWLDFLIVDVSLVTLVANTLGYSDLGPIKSLRTLRALRPLRALSRFEGMRVVVNALIGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFYECINTTDGSRFPASQVPNRSECFALMNVSQNVRWKNLKVNFDNVGLGYLSLLQVATFKGWTIIMYAAVDSVNVDKQPKYEYSLYMYIYFVVFIIFGSFFTLNLFIGVIIDNFNQQKKKLGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPGNKIQGCIFDLVTNQAFDISIMVLICLNMVTMMVEKEGQSQHMTEVLYWINVVFIILFTGECVLKLISLRHYYFTVGWNIFDFVVVIISIVGMFLADLIETYFVSPTLFRVIRLARIGRILRLVKGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYAIFGMSNFAYVKKEDGINDMFNFETFGNSMICLFQITTSAGWDGLLAPILNSKPPDCDPKKVHPGSSVEGDCGNPSVGIFYFVSYIIISFLVVVNMYIAVILENFSVATEESTEPLSEDDFEMFYEVWEKFDPDATQFIEFSKLSDFAAALDPPLLIAKPNKVQLIAMDLPMVSGDRIHCLDILFAFTKRVLGESGEMDSLRSQMEERFMSANPSKVSYEPITTTLKRKQEDVSATVIQRAYRRYRLRQNVKNISSIYIKDGDRDDDLLNKKDMAFDNVNENSSPEKTDATSSTTSPPSYDSVTKPDKEKYEQDRTEKEDKGKDSKESKK,1988,NP_001352465.1.csv,NP_001352465.1_clinical_seed_0_final,NP_001352465.1.a2m,popEVE,NP_001352465.1_theta_0.2.npy,1,1988,1988
+NP_001353076.1,MPEPWGTVYFLGIAQVFSFLFSWWNLEGVMNQADAPRPLNWTIRKLCHAAFLPSVRLLKAQKSWIERAFYKRECVHIIPSTKDPHRCCCGRLIGQHVGLTPSISVLQNEKNESRLSRNDIQSEKWSISKHTQLSPTDAFGTIEFQGGGHSNKAMYVRVSFDTKPDLLLHLMTKEWQLELPKLLISVHGGLQNFELQPKLKQVFGKGLIKAAMTTGAWIFTGGVNTGVIRHVGDALKDHASKSRGKICTIGIAPWGIVENQEDLIGRDVVRPYQTMSNPMSKLTVLNSMHSHFILADNGTTGKYGAEVKLRRQLEKHISLQKINTRCLPFFSLDSRLFYSFWGSCQLDSVGIGQGVPVVALIVEGGPNVISIVLEYLRDTPPVPVVVCDGSGRASDILAFGHKYSEEGGLINESLRDQLLVTIQKTFTYTRTQAQHLFIILMECMKKKELITVFRMGSEGHQDIDLAILTALLKGANASAPDQLSLALAWNRVDIARSQIFIYGQQWPVGSLEQAMLDALVLDRVDFVKLLIENGVSMHRFLTISRLEELYNTRHGPSNTLYHLVRDVKKREYPGFGWIYFKGNLPPDYRISLIDIGLVIEYLMGGAYRCNYTRKRFRTLYHNLFGPKRPKALKLLGMEDDIPLRRGRKTTKKREEEVDIDLDDPEINHFPFPFHELMVWAVLMKRQKMALFFWQHGEEAMAKALVACKLCKAMAHEASENDMVDDISQELNHNSRDFGQLAVELLDQSYKQDEQLAMKLLTYELKNWSNATCLQLAVAAKHRDFIAHTCSQMLLTDMWMGRLRMRKNSGLKVILGILLPPSILSLEFKNKDDMPYMSQAQEIHLQEKEAEEPEKPTKEKEEEDMELTAMLGRNNGESSRKKDEEEVQSKHRLIPLGRKIYEFYNAPIVKFWFYTLAYIGYLMLFNYIVLVKMERWPSTQEWIVISYIFTLGIEKMREILMSEPGKLLQKVKVWLQEYWNVTDLIAILLFSVGMILRLQDQPFRSDGRVIYCVNIIYWYIRLLDIFGVNKYLGPYVMMIGKMMIDMMYFVIIMLVVLMSFGVARQAILFPNEEPSWKLAKNIFYMPYWMIYGEVFADQIDPPCGQNETREDGKIIQLPPCKTGAWIVPAIMACYLLVANILLVNLLIAVFNNTFFEVKSISNQVWKFQRYQLIMTFHERPVLPPPLIIFSHMTMIFQHLCCRWRKHESDPDERDYGLKLFITDDELKKVHDFEEQCIEEYFREKDDRFNSSNDERIRVTSERVENMSMRLEEVNEREHSMKASLQTVDIRLAQLEDLIGRMATALERLTGLERAESNKIRSRTSSDCTDAAYIVRQSSFNSQEGNTFKLQESIDPAGEETMSPTSPTLMPRMRSHSFYSVNMKDKGGIEKLESIFKERSLSLHRATSSHSVAKEPKAPAAPANTLAIVPDSRRPSSCIDIYVSAMDELHCDIDPLDNSVNILGLGEPSFSTPVPSTAPSSSAYATLAPTDRPPSRSIDFEDITSMDTRSFSSDYTHLPECQNPWDSEPPMYHTIERSKSSRYLATTPFLLEEAPIVKSHSFMFSPSRSYYANFGVPVKTAEYTSITDCIDTRCVNAPQAIADRAAFPGGLGDKVEDLTCCHPEREAELSHPSSDSEENEAKGRRATIAISSQEGDNSERTLSNNITVPKIERANSYSAEEPSAPYAHTRKSFSISDKLDRQRNTASLRNPFQRSKSSKPEGRGDSLSMRRLSRTSAFQSFESKHN,1744,NP_001353076.1.csv,NP_001353076.1_clinical_seed_0_final,NP_001353076.1.a2m,popEVE,NP_001353076.1_theta_0.2.npy,1,1744,1744
+NP_001354752.1,MGDDQEDDFPRRLSESMEDLSLDLGALQGSEYLQDLGLGAPSHSQPGETPDSRPTGEEPGRDSLFSSLAGSQDLSRRRSWERSRSCSESWRRLSLDASAVDEEPCLPRTLASLALNLPGGGLKTWTQGCLSGGGTPAESPGKECDSPKKRGRSRSVPVSFYEIRSPEISPGLEVPTPPVQGLEPPVLECMEKDHVEPDHVLIVQQVLQELRQYHGARQRACMSASPGGAHSNLTWFEFLSESEDGAGKNEKSDKSTSVKRRLSCLRSRVTRQKEKGKSPAHLKDKGQDARERRECVNGHQLLQGTFSGPSSCPLCGKPFLSSASLKEHPRGTLLSDGSPALSRNVGMTVSQKGGPQPTPSPAGPGTQLGPITGEMDEADSAFLKFKQTADDSLSLTSPNTESIFVEDPYTASLRSEIESDGHEFEAESWSLAVDAAYAKKQKREVVKRQDVLYELMQTEVHHVRTLKIMLKVYSRALQEELQFSSKAIGRLFPCADDLLETHSHFLARLKERRQESLEEGSDRNYVIQKIGDLLVQQFSGENGERMKEKYGVFCSGHNEAVSHYKLLLQQNKKFQNLIKKIGNFSIVRRLGVQECILLVTQRITKYPVLVERIIQNTEAGTEDYEDLTQALNLIKDIISQVDAKVSECEKGQRLREIAGKMDLKSSSKLKNGLTFRKEDMLQRQLHLEGMLCWKTTSGRLKDILAILLTDVLLLLQEKDQKYVFASVDSKPPVISLQKLIVREVANEEKAMFLISASLQGPEMYEIYTSSKEDRNAWMAHIQRAVESCPDEEEGPFSLPEEERKVVEARATRLRDFQERLSMKDQLIAQSLLEKQQIYLEMAEMGGLEDLPQPRGLFRGGDPSETLQGELILKSAMSEIEGIQSLICRQLGSANGQAEDGGSSTGPPRRAETFAGYDCTNSPTKNGSFKKKVSSTDPRPRDWRGPPNSPDLKLSDSDIPGSSEESPQVVEAPGTESDPRLPTVLESELVQRIQTLSQLLLNLQAVIAHQDSYVETQRAAIQEREKQFRLQSTRGNLLLEQERQRNFEKQREERAALEKLQSQLRHEQQRWERERQWQHQELERAGARLQEREGEARQLRERLEQERAELERQRQAYQHDLERLREAQRAVERERERLELLRRLKKQNTAPGALPPDTLAEAQPPSHPPSFNGEGLEGPRVSMLPSGVGPEYAERPEVARRDSAPTENRLAKSDVPIQLLSATNQFQRQAAVQQQIPTKLAASTKGGKDKGGKSRGSQRWESSASFDLKQQLLLNKLMGKDESTSRNRRSLSPILPGRHSPAPPPDPGFPAPSPPPADSPSEGFSLKAGGTALLPGPPAPSPLPATPLSAKEDASKEDVIFF,1361,NP_001354752.1.csv,NP_001354752.1_clinical_seed_0_final,NP_001354752.1.a2m,popEVE,NP_001354752.1_theta_0.2.npy,1,1361,1361
+NP_001354903.1,MRDEIATTVFFVTRLVKKHDKLSKQQIEDFAEKLMTILFETYRSHWHSDCPSKGQAFRCIRINNNQNKDPILERACVESNVDFSHLGLPKEMTIWVDPFEVCCRYGEKNHPFTVASFKGRWEEWELYQQISYAVSRASSDVSSGTSCDEESCSKEPRVIPKVSNPKSIYQVENLKQPFQSWLQIPRKKNVVDGRVGLLGNTYHGSQKHPKCYRPAMHRLDRYHWVNTHR,229,NP_001354903.1.csv,NP_001354903.1_clinical_seed_0_final,NP_001354903.1.a2m,popEVE,NP_001354903.1_theta_0.2.npy,1,229,229
+NP_001354906.1,MGVAGRNRPGAAWAVLLLLLLLPPLLLLAGAVPPGRGRAAGPQEDVDECAQGLDDCHADALCQNTPTSYKCSCKPGYQGEGRQCEDIDECGNELNGGCVHDCLNIPGNYRCTCFDGFMLAHDGHNCLDVDECLENNGGCQHTCVNVMGSYECCCKEGFFLSDNQHTCIHRSEEGLSCMNKDHGCSHICKEAPRGSVACECRPGFELAKNQRDCILTCNHGNGGCQHSCDDTADGPECSCHPQYKMHTDGRSCLEREDTVLEVTESNTTSVVDGDKRVKRRLLMETCAVNNGGCDRTCKDTSTGVHCSCPVGFTLQLDGKTCKDIDECQTRNGGCDHFCKNIVGSFDCGCKKGFKLLTDEKSCQDVDECSLDRTCDHSCINHPGTFACACNRGYTLYGFTHCGDTNECSINNGGCQQVCVNTVGSYECQCHPGYKLHWNKKDCVEVKGLLPTSVSPRVSLHCGKSGGGDGCFLRCHSGIHLSSGLQGAYSVTCGSSSPLRNKQQKSNDSAFGDVTTIRTSVTFKLNEGKCSLKNAELFPEGLRPALPEKHSSVKESFRYVNLTCSSGKQVPGAPGRPSTPKEMFITVEFELETNQKEVTASCDLSCIVKRTEKRLRKAIRTLRKAVHREQFHLQLSGMNLDVAKKPPRTSERQAESCGVGQGHAENQCVSCRAGTYYDGARERCILCPNGTFQNEEGQMTCEPCPRPGNSGALKTPEAWNMSECGGLCQPGEYSADGFAPCQLCALGTFQPEAGRTSCFPCGGGLATKHQGATSFQDCETRVQCSPGHFYNTTTHRCIRCPVGTYQPEFGKNNCVSCPGNTTTDFDGSTNITQCKNRRCGGELGDFTGYIESPNYPGNYPANTECTWTINPPPKRRILIVVPEIFLPIEDDCGDYLVMRKTSSSNSVTTYETCQTYERPIAFTSRSKKLWIQFKSNEGNSARGFQVPYVTYDEDYQELIEDIVRDGRLYASENHQEILKDKKLIKALFDVLAHPQNYFKYTAQESREMFPRSFIRLLRSKVSRFLRPYK,1028,NP_001354906.1.csv,NP_001354906.1_colabfold_clinical_seed_0_final,NP_001354906.1_colabfold.a2m,colabfold,NP_001354906.1_colabfold_theta_0.2.npy,1,1028,1028
+NP_001355738.1,MASYPSGSGKPKAKYPFKKRASLQASTAAPEARGGLGAPPLQSARSLPGPAPCLKHFPLDLRTSMDGKCKEIAEELFTRSLAESELRSAPYEFPEESPIEQLEERRQRLERQISQDVKLEPDILLRAKQDFLKTDSDSDLQLYKEQGEGQGDRSLRERDVLEREFQRVTISGEEKCGVPFTDLLDAAKSVVRALFIREKYMALSLQSFCPTTRRYLQQLAEKPLETRTYEQGPDTPVSADAPVHPPALEQHPYEHCEPSTMPGDLGLGLRMVRGVVHVYTRREPDEHCSEVELPYPDLQEFVADVNVLMALIINGPIKSFCYRRLQYLSSKFQMHVLLNEMKELAAQKKVPHRDFYNIRKVDTHIHASSCMNQKHLLRFIKRAMKRHLEEIVHVEQGREQTLREVFESMNLTAYDLSVDTLDVHADRNTFHRFDKFNAKYNPIGESVLREIFIKTDNRVSGKYFAHIIKEVMSDLEESKYQNAELRLSIYGRSRDEWDKLARWAVMHRVHSPNVRWLVQVPRLFDVYRTKGQLANFQEMLENIFLPLFEATVHPASHPELHLFLEHVDGFDSVDDESKPENHVFNLESPLPEAWVEEDNPPYAYYLYYTFANMAMLNHLRRQRGFHTFVLRPHCGEAGPIHHLVSAFMLAENISHGLLLRKAPVLQYLYYLAQIGIAMSPLSNNSLFLSYHRNPLPEYLSRGLMVSLSTDDPLQFHFTKEPLMEEYSIATQVWKLSSCDMCELARNSVLMSGFSHKVKSHWLGPNYTKEGPEGNDIRRTNVPDIRVGYRYETLCQELALITQAVQSEMLETIPEEAGITMSPGPQ,825,NP_001355738.1.csv,NP_001355738.1_colabfold_clinical_seed_0_final,NP_001355738.1_colabfold.a2m,colabfold,NP_001355738.1_colabfold_theta_0.2.npy,1,825,825
+NP_001355847.1,MQNSHSGVNQLGGVFVNGRPLPDSTRQKIVELAHSGARPCDISRILQVILPAPPHSPPPRPSTLNALSSFLTTHADAKVQVLDNQNVSNGCVSKILGRYYETGSIRPRAIGGSKPRVATPEVVSKIAQYKRECPSIFAWEIRDRLLSEGVCTNDNIPSVSSINRVLRNLASEKQQMGADGMYDKLRMLNGQTGSWGTRPGWYPGTSVPGQPTQDGCQQQEGGGENTNSISSNGEDSDEAQMRLQLKRKLQRNRTSFTQEQIEALEKEFERTHYPDVFARERLAAKIDLPEARIQVWFSNRRAKWRREEKLRNQRRQASNTPSHIPISSSFSTSVYQPIPQPTTPVSSFTSGSMLGRTDTALTNTYSALPPMPSFTMANNLPMQPPVPSQTSSYSCMLPTSPSVNGRSYDTYTPPHMQTHMNSQPMGTSGTTSTGLISPGVSVPVQVPGSEPDMSQYWPRLQ,461,NP_001355847.1.csv,NP_001355847.1_colabfold_clinical_seed_0_final,NP_001355847.1_colabfold.a2m,colabfold,NP_001355847.1_colabfold_theta_0.2.npy,1,461,461
+NP_001356.1,MSQRPRAPRSALWLLAPPLLRWAPPLLTVLHSDLFQALLDILDYYEASLSESQKYRYQDEDTPPLEHSPAHLPNQANSPPVIVNTDTLEAPGYELQVNGTEGEMEYEEITLERGNSGLGFSIAGGTDNPHIGDDPSIFITKIIPGGAAAQDGRLRVNDSILFVNEVDVREVTHSAAVEALKEAGSIVRLYVMRRKPPAEKVMEIKLIKGPKGLGFSIAGGVGNQHIPGDNSIYVTKIIEGGAAHKDGRLQIGDKILAVNSVGLEDVMHEDAVAALKNTYDVVYLKVAKPSNAYLSDSYAPPDITTSYSQHLDNEISHSSYLGTDYPTAMTPTSPRRYSPVAKDLLGEEDIPREPRRIVIHRGSTGLGFNIVGGEDGEGIFISFILAGGPADLSGELRKGDQILSVNGVDLRNASHEQAAIALKNAGQTVTIIAQYKPEEYSRFEAKIHDLREQLMNSSLGSGTASLRSNPKRGFYIRALFDYDKTKDCGFLSQALSFRFGDVLHVIDASDEEWWQARRVHSDSETDDIGFIPSKRRVERREWSRLKAKDWGSSSGSQGREDSVLSYETVTQMEVHYARPIIILGPTKDRANDDLLSEFPDKFGSCVPHTTRPKREYEIDGRDYHFVSSREKMEKDIQAHKFIEAGQYNSHLYGTSVQSVREVAEQGKHCILDVSANAVRRLQAAHLHPIAIFIRPRSLENVLEINKRITEEQARKAFDRATKLEQEFTECFSAIVEGDSFEEIYHKVKRVIEDLSGPYIWVPARERL,767,NP_001356.1.csv,refseq-DLG4-NM_001365.4_clinical_seed_0_final,refseq-DLG4-NM_001365.4.a2m,Invitae,refseq-DLG4-NM_001365.4.npy,1,767,767
+NP_001356355.1,MKWLGESKNMVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGFEGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMNTSAFPSRSSRHFSKSDSFSHNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKEHVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNRLKKLGDSSKNSDSQSVSSNTDADTTQEKNNATSNRKSSVGVKKNSKSRTLTRQSMSRIPASSNSTSSKLTHINNSRVPKKLKKPAKPLLSKIKLRNHCKRLEQKNASRKLEMGNLVLKEPKVVLYKNLPIKKDKEPEGPAQAAVASGCLTRHAAREHRQNPVRGAHSQGESSPCTYITRRSVRTRTNLKEASDIKLEPNTLNGYKSSVTEPCPDSGEQLQPAPVLQEEELAHETAQKGEAKCHKSDTGMSKKKSRQGKLVKQFAKIEESTPVHDSPGKDDAVPDLMGPHSDQGEHSGTVGVPVSYTDCAPSPVGCSVVTSDSFKTKDSFRTAKSKKKRRITRYDAQLILENNSGIPKLTLRRRHDSSSKTNDQENDGMNSSKISIKLSKDHDNDNNLYVAKLNNGFNSGSGSSSTKLKIQLKRDEENRGSYTEGLHENGVCCSDPLSLLESRMEVDDYSQYEEESTDDSSSSEGDEEEDDYDDDFEDDFIPLPPAKRLRLIVGKDSIDIDISSRRREDQSLRLNA,885,NP_001356355.1.csv,NP_001356355.1_clinical_seed_0_final,NP_001356355.1.a2m,popEVE,NP_001356355.1_theta_0.2.npy,1,885,885
+NP_001357188.2,MGLKAAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFVEHFLAVNRVIPTNVPELTFQPSPAPDPPGGLTYFPVADLSIIAALYARFTAQIRGAVDLSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFAVVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAERSWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERYPMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNVREALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGEERPGEQSQGTQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQVRQKVRIVSREAEAAEAEEPWGEEAREGRRRGPRRESKPEEPPPPKKPALDKGLGTGQGAVSGPPRKPPGTVAGTARGPEGGSTAQVPAPTASPPPEGPVLTFQSEKMKGMKELLVATKINSSAIKLQLTAQSQVQMKKQKVSTPSDYTLSFLKRQRKGL,610,NP_001357188.2.csv,NP_001357188.2_colabfold_clinical_seed_0_final,NP_001357188.2_colabfold.a2m,colabfold,NP_001357188.2_colabfold_theta_0.2.npy,1,610,610
+NP_001357524.1,MVVLRAGKKTFLPPLCRAFACRGCQLAPERGAERRDTAPSGVSRFCPPRKSCHDWIGPPDKYSNLRPVHFYIPENESPLEQKLRKLRQETQEWNQQFWANQNLTFSKEKEEFIHSRLKTKGLGLRTESGQKATLNAEEMADFYKEFLSKNFQKHMYYNRDWYKRNFAITFFMGKVALERIWNKLKQKQKKRSN,193,NP_001357524.1.csv,NP_001357524.1_colabfold_clinical_seed_0_final,NP_001357524.1_colabfold.a2m,colabfold,NP_001357524.1_colabfold_theta_0.2.npy,1,193,193
+NP_001357587.1,MSGARSKLALFLCGCYVVALGAHTGEESVADHHEAEYYVAAVYEHPSILSLNPLALISRQEALELMNQNLDIYEQQVMTAAQKDVQIIVFPEDGIHGFNFTRTSIYPFLDFMPSPQVVRWNPCLEPHRFNDTEVLQRLSCMAIRGDMFLVANLGTKEPCHSSDPRCPKDGRYQFNTNVVFSNNGTLVDRYRKHNLYFEAAFDVPLKVDLITFDTPFAGRFGIFTCFDILFFDPAIRVLRDYKVKHVVYPTAWMNQLPLLAAIEIQKAFAVAFGINVLAANVHHPVLGMTGSGIHTPLESFWYHDMENPKSHLIIAQVAKNPVGLIGAENATGETDPSHSKFLKILSGDPYCEKDAQEVHCDEATKWNVNAPPTFHSEMMYDNFTLVPVWGKEGYLHVCSNGLCCYLLYERPTLSKELYALGVFDGLHTVHGTYYIQVCALVRCGGLGFDTCGQEITEATGIFEFHLWGNFSTSYIFPLFLTSGMTLEVPDQLGWENDHYFLRKSRLSSGLVTAALYGRLYERD,523,NP_001357587.1.csv,NP_001357587.1_colabfold_clinical_seed_0_final,NP_001357587.1_colabfold.a2m,colabfold,NP_001357587.1_colabfold_theta_0.2.npy,1,523,523
+NP_001358001.1,MATVAANPAAAAAAVAAAAAVTEDREPQHEELPGLDSQWRQIENGESGRERPLRAGESWFLVEKHWYKQWEAYVQGGDQDSSTFPGCINNATLFQDEINWRLKEGLVEGEDYVLLPAAAWHYLVSWYGLEHGQPPIERKVIELPNIQKVEVYPVELLLVRHNDLGKSHTVQFSHTDSIGLVLRTARERFLVEPQEDTRLWAKNSEGSLDRLYDTHITVLDAALETGQLIIMETRKKDGTWPSAQLHVMNNNMSEEDEDFKGQPGICGLTNLGNTCFMNSALQCLSNVPQLTEYFLNNCYLEELNFRNPLGMKGEIAEAYADLVKQAWSGHHRSIVPHVFKNKVGHFASQFLGYQQHDSQELLSFLLDGLHEDLNRVKKKEYVELCDAAGRPDQEVAQEAWQNHKRRNDSVIVDTFHGLFKSTLVCPDCGNVSVTFDPFCYLSVPLPISHKRVLEVFFIPMDPRRKPEQHRLVVPKKGKISDLCVALSKHTGISPERMMVADVFSHRFYKLYQLEEPLSSILDRDDIFVYEVSGRIEAIEGSREDIVVPVYLRERTPARDYNNSYYGLMLFGHPLLVSVPRDRFTWEGLYNVLMYRLSRYVTKPNSDDEDDGDEKEDDEEDKDDVPGPSTGGSLRDPEPEQAGPSSGVTNRCPFLLDNCLGTSQWPPRRRRKQLFTLQTVNSNGTSDRTTSPEEVHAQPYIAIDWEPEMKKRYYDEVEAEGYVKHDCVGYVMKKAPVRLQECIELFTTVETLEKENPWYCPSCKQHQLATKKLDLWMLPEILIIHLKRFSYTKFSREKLDTLVEFPIRDLDFSEFVIQPQNESNPELYKYDLIAVSNHYGGMRDGHYTTFACNKDSGQWHYFDDNSVSPVNENQIESKAAYVLFYQRQDVARRLLSPAGSSGAPASPACSSPPSSEFMDVN,920,NP_001358001.1.csv,NP_001358001.1_colabfold_clinical_seed_0_final,NP_001358001.1_colabfold.a2m,colabfold,NP_001358001.1_colabfold_theta_0.2.npy,1,920,920
+NP_001358995.1,MKMLWKLTDNIKYEDCEDRHDGTSNGTARLPQLGTVGQSPYTSAPPLSHTPNADFQPPYFPPPYQPIYPQSQDPYSHVNDPYSLNPLHAQPQPQHPGWPGQRQSQESGLLHTHRGLPHQLSGLDPRRDYRRHEDLLHGPHALSSGLGDLSIHSLPHAIEEVPHVEDPGINIPDQTVIKKGPVSLSKSNSNAVSAIPINKDNLFGGVVNPNEVFCSVPGRLSLLSSTSKYKVTVAEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLREKLDKIGLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYVCETEFPAKAVAEFLNRQHSDPNEQVTRKNMLLATKQICKEFTDLLAQDRSPLGNSRPNPILEPGIQSCLTHFNLISHGFGSPAVCAAVTALQNYLTEALKAMDKMYLSNNPNSHTDNNAKSSDKEEKHRK,439,NP_001358995.1.csv,NP_001358995.1_colabfold_clinical_seed_0_final,NP_001358995.1_colabfold.a2m,colabfold,NP_001358995.1_colabfold_theta_0.2.npy,1,439,439
+NP_001359162.1,MFWRRRKCDVMPIVLVRPTNRTRRLDSTGAGMGPSSHQQQESPLPTITHCAGCTTAWSPCSFNSPDMETPLQFQRGFFPEQPPPPPRSSHLHCQQQQQSQDKPCPPFAPLPHPHHHPHLAHQQPASGGSSPCLRCNSCASSGAPAAGAGDNLSLLLRTSSPGGAFRTRTSSPLSGSSCCCCCCSSRRGSQLNVSELTPSSHASALRQQYAQQSAQQSASASQYHQCHSLQPAASPTGSLGSLGSGPPLSHHHHHPHPAHHQHHQPQARRESNPFTEIAMSSCRYNGGVMRPLSNLSASRRNLHEMDSEAQPLQPPASVGGGGGASSPSAAAAAAAAVSSSAPEIVVSKPEHNNSNNLALYGTGGGGSTGGGGGGGGSGHGSSSGTKSSKKKNQNIGYKLGHRRALFEKRKRLSDYALIFGMFGIVVMVIETELSWGAYDKASLYSLALKCLISLSTIILLGLIIVYHAREIQLFMVDNGADDWRIAMTYERIFFICLEILVCAIHPIPGNYTFTWTARLAFSYAPSTTTADVDIILSIPMFLRLYLIARVMLLHSKLFTDASSRSIGALNKINFNTRFVMKTLMTICPGTVLLVFSISLWIIAAWTVRACERYHDQQDVTSNFLGAMWLISITFLSIGYGDMVPNTYCGKGVCLLTGIMGAGCTALVVAVVARKLELTKAEKHVHNFMMDTQLTKRVKNAAANVLRETWLIYKNTKLVKKIDHAKVRKHQRKFLQAIHQLRSVKMEQRKLNDQANTLVDLAKTQNIMYDMISDLNERSEDFEKRIVTLETKLETLIGSIHALPGLISQTIRQQQRDFIEAQMESYDKHVTYNAERSRSSSRRRRSSSTAPPTSSESS,857,NP_001359162.1.csv,NP_001359162.1_clinical_seed_0_final,NP_001359162.1.a2m,popEVE,NP_001359162.1_theta_0.2.npy,1,857,857
+NP_001365406.2,MLVLLLHAVVLGLPSAWAVGACARACPAACACSTVERGCSVRCDRAGLLRVPAELPCEAVSIDLDRNGLRFLGERAFGTLPSLRRLSLRHNNLSFITPGAFKGLPRLAELRLAHNGDLRYLHARTFAALSRLRRLDLAACRLFSVPERLLAELPALRELAAFDNLFRRVPGALRGLANLTHAHLERGRIEAVASSSLQGLRRLRSLSLQANRVRAVHAGAFGDCGVLEHLLLNDNLLAELPADAFRGLRRLRTLNLGGNALDRVARAWFADLAELELLYLDRNSIAFVEEGAFQNLSGLLALHLNGNRLTVLAWVAFQPGFFLGRLFLFRNPWCCDCRLEWLRDWMEGSGRVTDVPCASPGSVAGLDLSQVTFGRSSDGLCVDPEELNLTTSSPGPSPEPAATTVSRFSSLLSKLLAPRVPVEEAANTTGGLANASLSDSLSSRGVGGAGRQPWFLLASCLLPSVAQHVVFGLQMD,476,NP_001365406.2.csv,NP_001365406.2_colabfold_clinical_seed_0_final,NP_001365406.2_colabfold.a2m,colabfold,NP_001365406.2_colabfold_theta_0.2.npy,1,476,476
+NP_001365421.1,MRDLLQYIACFFAFFSAGFLIVATWTDCWMVNADDSLEVSTKCRGLWWECVTNAFDGIRTCDEYDSILAEHPLKLVVTRALMITADILAGFGFLTLLLGLDCVKFLPDEPYIKVRICFVAGATLLIAGTPGIIGSVWYAVDVYVERSTLVLHNIFLGIQYKFGWSCWLGMAGSLGCFLAGAVLTCCLYLFKDVGPERNYPYSLRKAYSAAGVSMAKSYSAPRTETAKMYAVDTRV,235,NP_001365421.1.csv,NP_001365421.1_colabfold_clinical_seed_0_final,NP_001365421.1_colabfold.a2m,colabfold,NP_001365421.1_colabfold_theta_0.2.npy,1,235,235
+NP_001366127.1,MAAMAPALTDAAAEAHHIRFKLAPPSSTLSPGSAENNGNANILIAANGTKRKAIAAEDPSLDFRNNPTKEDLGKLQPLVASYLCSDVTSVPSKESLKLQGVFSKQTVLKSHPLLSQSYELRAELLGRQPVLEFSLENLRTMNTSGQTALPQAPVNGLAKKLTKSSTHSDHDNSTSLNGGKRALTSSALHGGEMGGSESGDLKGGMTNCTLPHRSLDVEHTTLYSNNSTANKSSVNSMEQPALQGSSRLSPGTDSSSNLGGVKLEGKKSPLSSILFSALDSDTRITALLRRQADIESRARRLQKRLQVVQAKQVERHIQHQLGGFLEKTLSKLPNLESLRPRSQLMLTRKAEAALRKAASETTTSEGLSNFLKSNSISEELERFTASGIANLRCSEQAFDSDVTDSSSGGESDIEEEELTRADPEQRHVPLRRRSEWKWAADRAAIVSRWNWLQAHVSDLEYRIRQQTDIYKQIRANKGLIVLGEVPPPEHTTDLFLPLSSEVKTDHGTDKLIESVSQPLENHGAPIIGHISESLSTKSCGALRPVNGVINTLQPVLADHIPGDSSDAEEQLHKKQRLNLVSSSSDGTCVAARTRPVLSCKKRRLVRPNSIVPLSKKVHRNSTIRPGCDVNPSCALCGSGSINTMPPEIHYEAPLLERLSQLDSCVHPVLAFPDDVPTSLHFQSMLKSQWQNKPFDKIKPPKKLSLKHRAPMPGSLPDSARKDRHKLVSSFLTTAKLSHHQTRPDRTHRQHLDDVGAVPMVERVTAPKAERLLNPPPPVHDPNHSKMRLRDHSSERSEVLKHHTDMSSSSYLAATHHPPHSPLVRQLSTSSDSPAPASSSSQVTASTSQQPVRRRRGESSFDINNIVIPMSVAATTRVEKLQYKEILTPSWREVDLQSLKGSPDEENEEIEDLSDAAFAALHAKCEEMERARWLWTTSVPPQRRGSRSYRSSDGRTTPQLGSANPSTPQPASPDVSSSHSLSEYSHGQSPRSPISPELHSAPLTPVARDTPRHLASEDTRCSTPELGLDEQSVQPWERRTFPLAHSPQAECEDQLDAQERAARCTRRTSGSKTGRETEAAPTSPPIVPLKSRHLVAAATAQRPTHR,1105,NP_001366127.1.csv,NP_001366127.1_colabfold_clinical_seed_0_final,NP_001366127.1_colabfold.a2m,colabfold,NP_001366127.1_colabfold_theta_0.2.npy,1,1105,1105
+NP_001366380.1,MISTAPLYSGVHNWTSSDRIRMCGINEERRAPLSDEESTTGDCQHFGSQEFCVSSSFSKVELTAVGSGSNARGADPDGSATEKLGHKSEDKPDDPQPKMDYAGNVAEAEGLLVPLSSPGDGLKLPASDSAEASNSRADCSWTPLNTQMSKQVDCSPAGVKALDSRQGVGEKNTFILATLGTGVPVEGTLPLVTTNFSPLPAPICPPAPGSASVPHSVPDAFQVPLSVPAPVPHSGLVPVQVATSVPAPSPPLAPVPALAPAPPSVPTLISDSNPLSVSASVLVPVPASAPPSGPVPLSAPAPAPLSVPVSAPPLALIQAPVPPSAPTLVLAPVPTPVLAPMPASTPPAAPAPPSVPMPTPTPSSGPPSTPTLIPAFAPTPVPAPTPAPIFTPAPTPMPAATPAAIPTSAPIPASFSLSRVCFPAAQAPAMQKVPLSFQPGTVLTPSQPLVYIPPPSCGQPLSVATLPTTLGVSSTLTLPVLPSYLQDRCLPGVLASPELRSYPYAFSVARPLTSDSKLVSLEVNRLPCTSPSGSTTTQPAPDGVPGPLADTSLVTASAKVLPTPQPLLPAPSGSSAPPHPAKMPSGTEQQTEGTSVTFSPLKSPPQLEREMASPPECSEMPLDLSSKSNRQKLPLPNQRKTPPMPVLTPVHTSSKALLSTVLSRSQRTTQAAGGNVTSCLGSTSSPFVIFPEIVRNGDPSTWVKNSTALISTIPGTYVGVANPVPASLLLNKDPNLGLNRDPRHLPKQEPISIIDQGEPKGTGATCGKKGSQAGAEGQPSTVKRYTPARIAPGLPGCQTKELSLWKPTGPANIYPRCSVNGKPTSTQVLPVGWSPYHQASLLSIGISSAGQLTPSQGAPIRPTSVVSEFSGVPSLSSSEAVHGLPEGQPRPGGSFVPEQDPVTKNKTCRIAAKPYEEQVNPVLLTLSPQTGTLALSVQPSGGDIRMNQGPEESESHLCSDSTPKMEGPQGACGLKLAGDTKPKNQVLATYMSHELVLATPQNLPKMPELPLLPHDSHPKELILDVVPSSRRGSSTERPQLGSQVDLGRVKMEKVDGDVVFNLATCFRADGLPVAPQRGQAEVRAKAGQARVKQESVGVFACKNKWQPDDVTESLPPKKMKCGKEKDSEEQQLQPQAKAVVRSSHRPKCRKLPSDPQESTKKSPRGASDSGKEHNGVRGKHKHRKPTKPESQSPGKRADSHEEGSLEKKAKSSFRDFIPVVLSTRTRSQSGSICSSFAGMADSDMGSQEVFPTEEEEEVTPTPAKRRKVRKTQRDTQYRSHHAQDKSLLSQGRRHLWRAREMPWRTEAARQMWDTNEEEEEEEEEGLLKRKKRRRQKSRKYQTGEYLTEQEDEQRRKGRADLKARKQKTSSSQSLEHRLRNRNLLLPNKVQGISDSPNGFLPNNLEEPACLENSEKPSGKRKCKTKHMATVSEEAKGKGRWSQQKTRSPKSPTPVKPTEPCTPSKSRSASSEEASESPTARQIPPEARRLIVNKNAGETLLQRAARLGYKDVVLYCLQKDSEDVNHRDNAGYTALHEACSRGWTDILNILLEHGANVNCSAQDGTRPVHDAVVNDNLETIWLLLSYGADPTLATYSGQTAMKLASSDTMKRFLSDHLSDLQGRAEGDPGVSWDFYSSSVLEEKDGFACDLLHNPPGSSDQEGDDPMEEDDFMFELSDKPLLPCYNLQVSVSRGPCNWFLFSDVLKRLKLSSRIFQARFPHFEITTMPKAEFYRQVASSQLLTPAERPGGLDDRSPPGSSETVELVRYEPDLLRLLGSEVEFQSCNS,1785,NP_001366380.1.csv,NP_001366380.1_clinical_seed_0_final,NP_001366380.1.a2m,popEVE,NP_001366380.1_theta_0.2.npy,1,1785,1785
+NP_001366429.1,MAPRCPWPWPRRRRLLDVLAPLVLLLGVRAASAEPERISEEVGLLQLLGDPPPQQVTQTDDPDVGLAYVFGPDANSGQVARYHFPSLFFRDFSLLFHIRPATEGPGVLFAITDSAQAMVLLGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLALSVAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGADPDKFQGVIAELKVRRDPQVSPMHCLDEEGDDSDGASGDSGSGLGDARELLREETGAALKPRLPAPPPVTTPPLAGGSSTEDSRSEEVEEQTTVASLGAQTLPGSDSVSTWDGSVRTPGGRVKEGGLKGQKGEPGVPGPPGRAGPPGSPCLPGPPGLPCPVSPLGPAGPALQTVPGPQGPPGPPGRDGTPGRDGEPGDPGEDGKPGDTGPQGFPGTPGDVGPKGDKGDPGVGERGPPGPQGPPGPPGPSFRHDKLTFIDMEGSGFGGDLEALRGPRGFPGPPGPPGVPGLPGEPGRFGVNSSDVPGPAGLPGVPGREGPPGFPGLPGPPGPPGREGPPGRTGQKGSLGEAGAPGHKGSKGAPGPAGARGESGLAGAPGPAGPPGPPGPPGPPGPGLPAGFDDMEGSGGPFWSTARSADGPQGPPGLPGLKGDPGVPGLPGAKGEVGADGVPGFPGLPGREGIAGPQGPKGDRGSRGEKGDPGKDGVGQPGLPGPPGPPGPVVYVSEQDGSVLSVPGPEGRPGFAGFPGPAGPKGNLGSKGERGSPGPKGEKGEPGSIFSPDGGALGPAQKGAKGEPGFRGPPGPYGRPGYKGEIGFPGRPGRPGMNGLKGEKGEPGDASLGFGMRGMPGPPGPPGPPGPPGTPVYDSNVFAESSRPGPPGLPGNQGPPGPKGAKGEVGPPGPPGQFPFDFLQLEAEMKGEKGDRGDAGQKGERGEPGGGGFFGSSLPGPPGPPGPPGPRGYPGIPGPKGESIRGQPGPPGPQGPPGIGYEGRQGPPGPPGPPGPPSFPGPHRQTISVPGPPGPPGPPGPPGTMGASSGVRLWATRQAMLGQVHEVPEGWLIFVAEQEELYVRVQNGFRKVQLEARTPLPRGTDNEVAALQPPVVQLHDSNPYPRREHPHPTARPWRADDILASPPRLPEPQPYPGAPHHSSYVHLRPARPTSPPAHSHRDFQPVLHLVALNSPLSGGMRGIRGADFQCFQQARAVGLAGTFRAFLSSRLQDLYSIVRRADRAAVPIVNLKDELLFPSWEALFSGSEGPLKPGARIFSFDGKDVLRHPTWPQKSVWHGSDPNGRRLTESYCETWRTEAPSATGQASSLLGGRLLGQSAASCHHAYIVLCIENSFMTASK,1339,NP_001366429.1.csv,NP_001366429.1_colabfold_clinical_seed_0_final,NP_001366429.1_colabfold.a2m,colabfold,NP_001366429.1_colabfold_theta_0.2.npy,1,1339,1339
+NP_001366543.1,MIPAASSTPPGDALFPSVAPQDFWRSQVTGYSGSVTRHLSHRANNFKRHPKRRKCIRPSPPPPPNTPCPLELVDFGDLHPQRSFRELLFNGCILFGIEFSYAMETAYVTPVLLQMGLPDQLYSLVWFISPILGFLLQPLLGAWSDRCTSRFGRRRPFILVLAIGALLGLSLLLNGRDIGIALADVTGNHKWGLLLTVCGVVLMDFSADSADNPSHAYMMDVCSPADQDRGLNIHALLAGLGGGFGYVVGGIHWDKTGFGRALGGQLRVIYLFTAVTLSVTTVLTLVSIPERPLRPPSEKRAAMKSPSLPLPPSPPVLPEEGPGDSLPSHTATNFSSPISPPSPLTPKYGSFISRDSSLTGISEFASSFGTANIDSVLIDCFTGGHDSYLAIPGSVPRPPISVSFPRAPDGFYRQDRGLLEGREGALTSGCDGDILRVGSLDTSKPRSSGILKRPQTLAIPDAAGGGGPETSRRRNVTFSQQVANILLNGVKYESELTGSSERAEQPLSVGRLCSTICNMPKALRTLCVNHFLGELPAKPPRWLSFEGMLLFYTDFMGEVVFQGDPKAPHTSEAYQKYNSGVTMGCWGMCIYAFSAAFYSAILEKLEEFLSVRTLYFIAYLAFGLGTGLATLSRNLYVVLSLCITYGILFSTLCTLPYSLLCDYYQSKKFAGSSADGTRRGMGVDISLLSCQYFLAQILVSLVLGPLTSAVGSANGVMYFSSLVSFLGCLYSSLFVIYEIPPSDAADEEHRPLLLNV,756,NP_001366543.1.csv,NP_001366543.1_clinical_seed_0_final,NP_001366543.1.a2m,popEVE,NP_001366543.1_theta_0.2.npy,1,756,756
+NP_001368914.1,MAALRQPQVAELLAEARRAFREEFGAEPELAVSAPGRVNLIGEHTDYNQGLVLPMALELMTVLVGSPRKDGLVSLLTTSEGADEPQRLQFPLPTAQRSLEPGTPRWANYVKGVIQYYPAAPLPGFSAVVVSSVPLGGGLSSSASLEVATYTFLQQLCPDSGTIAARAQVCQQAEHSFAGMPCGIMDQFISLMGQKGHALLIDCRSLETSLVPLSDPKLAVLITNSNVRHSLASSEYPVRRRQCEEVARALGKESLREVQLEELEAARDLVSKEGFRRARHVVGEIRRTAQAAAALRRGDYRAFGRLMVESHRSLRDDYEVSCPELDQLVEAALAVPGVYGSRMTGGGFGGCTVTLLEASAAPHAMRHIQEHYGGTATFYLSQAADGAKVLCL,392,NP_001368914.1.csv,refseq-GALK1-NM_000154.1_clinical_seed_0_final,refseq-GALK1-NM_000154.1.a2m,Invitae,refseq-GALK1-NM_000154.1.npy,1,392,392
+NP_001369746.2,MAPAGVSLRATILCLLAWAGLAAGDRVYIHPFHLVIHNESTCEQLAKANAGKPKDPTFIPAPIQAKTSPVDEKALQDQLVLVAAKLDTEDKLRAAMVGMLANFLGFRIYGMHSELWGVVHGATVLSPTAVFGTLASLYLGALDHTADRLQAILGVPWKDKNCTSRLDAHKVLSALQAVQGLLVAQGRADSQAQLLLSTVVGVFTAPGLHLKQPFVQGLALYTPVVLPRSLDFTELDVAAEKIDRFMQAVTGWKTGCSLMGASVDSTLAFNTYVHFQGKMKGFSLLAEPQEFWVDNSTSVSVPMLSGMGTFQHWSDIQDNFSVTQVPFTESACLLLIQPHYASDLDKVEGLTFQQNSLNWMKKLSPRTIHLTMPQLVLQGSYDLQDLLAQAELPAILHTELNLQKLSNDRIRVGEVLNSIFFELEADEREPTESTQQLNKPEVLEVTLNRPFLFAVYDQSATALHFLGRVANPLSTA,476,NP_001369746.2.csv,NP_001369746.2_colabfold_clinical_seed_0_final,NP_001369746.2_colabfold.a2m,colabfold,NP_001369746.2_colabfold_theta_0.2.npy,1,476,476
+NP_001373.2,MWAFSELPMPLLINLIVSLLGFVATVTLIPAFRGHFIAARLCGQDLNKTSRQQIPESQGVISGAVFLIILFCFIPFPFLNCFVKEQCKAFPHHEFVALIGALLAICCMIFLGFADDVLNLRWRHKLLLPTAASLPLLMVYFTNFGNTTIVVPKPFRPILGLHLDLGILYYVYMGLLAVFCTNAINILAGINGLEAGQSLVISASIIVFNLVELEGDCRDDHVFSLYFMIPFFFTTLGLLYHNWYPSRVFVGDTFCYFAGMTFAVVGILGHFSKTMLLFFMPQVFNFLYSLPQLLHIIPCPRHRIPRLNIKTGKLEMSYSKFKTKSLSFLGTFILKVAESLQLVTVHQSETEDGEFTECNNMTLINLLLKVLGPIHERNLTLLLLLLQILGSAITFSIRYQLVRLFYDV,408,NP_001373.2.csv,refseq-DPAGT1-NM_001382.3_clinical_seed_0_final,refseq-DPAGT1-NM_001382.3.a2m,Invitae,refseq-DPAGT1-NM_001382.3.npy,1,408,408
+NP_001374.4,MAALVVSGAAEQGGRDGPGRGRAPRGRVANQIPPEILKNPQLQAAIRVLPSNYNFEIPKTIWRIQQAQAKKVALQMPEGLLLFACTIVDILERFTEAEVMVMGDVTYGACCVDDFTARALGADFLVHYGHSCLIPMDTSAQDFRVLYVFVDIRIDTTHLLDSLRLTFPPATALALVSTIQFVSTLQAAAQELKAEYRVSVPQCKPLSPGEILGCTSPRLSKEVEAVVYLGDGRFHLESVMIANPNVPAYRYDPYSKVLSREHYDHQRMQAARQEAIATARSAKSWGLILGTLGRQGSPKILEHLESRLRALGLSFVRLLLSEIFPSKLSLLPEVDVWVQVACPRLSIDWGTAFPKPLLTPYEAAVALRDISWQQPYPMDFYAGSSLGPWTVNHGQDRRPHAPGRPARGKVQEGSARPPSAVACEDCSCRDEKVAPLAP,438,NP_001374.4.csv,NP_001374.4_colabfold_clinical_seed_0_final,NP_001374.4_colabfold.a2m,colabfold,NP_001374.4_colabfold_theta_0.2.npy,1,438,438
+NP_001374178.1,MAGTVLGVGAGVFILALLWVAVLLLCVLLSRASGAARNWELPRINGNAPSWYLSRKKDFGMKRFSVIFLFFGAVIITSVLLLFPRAGEFPAPEVEVKIVDDFFIGRYVLLAFLSAIFLGGLFLVLIHYVLEPIYAKPLHSY,141,NP_001374178.1.csv,NP_001374178.1_clinical_seed_0_final,NP_001374178.1.a2m,popEVE,NP_001374178.1_theta_0.2.npy,1,141,141
+NP_001376.1,MAAPSRLLIRGGRVVNDDFSEVADVLVEDGVVRALGHDLLPPGGAPAGLRVLDAAGKLVLPGGIDTHTHMQFPFMGSRSIDDFHQGTKAALSGGTTMIIDFAIPQKGGSLIEAFETWRSWADPKVCCDYSLHVAVTWWSDQVKEEMKILVQDKGVNSFKMFMAYKDLYMVTDLELYEAFSRCKEIGAIAQVHAENGDLIAEGAKKMLALGITGPEGHELCRPEAVEAEATLRAITIASAVNCPLYIVHVMSKSAAKVIADARRDGKVVYGEPIAASLGTDGTHYWNKEWHHAAHHVMGPPLRPDPSTPDFLMNLLANDDLTTTGTDNCTFNTCQKALGKDDFTKIPNGVNGVEDRMSVIWEKGVHSGKMDENRFVAVTSTNAAKIFNLYPRKGRIAVGSDADIVIWDPKGTRTISAKTHHQAVNFNIFEGMVCHGVPLVTISRGKVVYEAGVFSVTAGDGKFIPRKPFAEYIYKRIKQRDRTCTPTPVERAPYKGEVATLKSRVTKEDATAGTRKQAHP,519,NP_001376.1.csv,refseq-DPYS-NM_001385.2_clinical_seed_0_final,refseq-DPYS-NM_001385.2.a2m,Invitae,refseq-DPYS-NM_001385.2.npy,1,519,519
+NP_001377.1,MSYQGKKNIPRITSDRLLIKGGKIVNDDQSFYADIYMEDGLIKQIGENLIVPGGVKTIEAHSRMVIPGGIDVHTRFQMPDQGMTSADDFFQGTKAALAGGTTMIIDHVVPEPGTSLLAAFDQWREWADSKSCCDYSLHVDISEWHKGIQEEMEALVKDHGVNSFLVYMAFKDRFQLTDCQIYEVLSVIRDIGAIAQVHAENGDIIAEEQQRILDLGITGPEGHVLSRPEEVEAEAVNRAITIANQTNCPLYITKVMSKSSAEVIAQARKKGTVVYGEPITASLGTDGSHYWSKNWAKAAAFVTSPPLSPDPTTPDFLNSLLSCGDLQVTGSAHCTFNTAQKAVGKDNFTLIPEGTNGTEERMSVIWDKAVVTGKMDENQFVAVTSTNAAKVFNLYPRKGRIAVGSDADLVIWDPDSVKTISAKTHNSSLEYNIFEGMECRGSPLVVISQGKIVLEDGTLHVTEGSGRYIPRKPFPDFVYKRIKARSRLAELRGVPRGLYDGPVCEVSVTPKTVTPASSAKTSPAKQQAPPVRNLHQSGFSLSGAQIDDNIPRRTTQRIVAPPGGRANITSLG,572,NP_001377.1.csv,refseq-DPYSL2-NM_001386.5_clinical_seed_0_final,refseq-DPYSL2-NM_001386.5.a2m,Invitae,refseq-DPYSL2-NM_001386.5.npy,1,572,572
+NP_001380.2,MWILALSLFQSFANVFSEDLHSSLYFVNASLQEVVFASTTGTLVPCPAAGIPPVTLRWYLATGEEIYDVPGIRHVHPNGTLQIFPFPPSSFSTLIHDNTYYCTAENPSGKIRSQDVHIKAVLREPYTVRVEDQKTMRGNVAVFKCIIPSSVEAYITVVSWEKDTVSLVSGSRFLITSTGALYIKDVQNEDGLYNYRCITRHRYTGETRQSNSARLFVSDPANSAPSILDGFDHRKAMAGQRVELPCKALGHPEPDYRWLKDNMPLELSGRFQKTVTGLLIENIRPSDSGSYVCEVSNRYGTAKVIGRLYVKQPLKATISPRKVKSSVGSQVSLSCSVTGTEDQELSWYRNGEILNPGKNVRITGINHENLIMDHMVKSDGGAYQCFVRKDKLSAQDYVQVVLEDGTPKIISAFSEKVVSPAEPVSLMCNVKGTPLPTITWTLDDDPILKGGSHRISQMITSEGNVVSYLNISSSQVRDGGVYRCTANNSAGVVLYQARINVRGPASIRPMKNITAIAGRDTYIHCRVIGYPYYSIKWYKNSNLLPFNHRQVAFENNGTLKLSDVQKEVDEGEYTCNVLVQPQLSTSQSVHVTVKVPPFIQPFEFPRFSIGQRVFIPCVVVSGDLPITITWQKDGRPIPGSLGVTIDNIDFTSSLRISNLSLMHNGNYTCIARNEAAAVEHQSQLIVRVPPKFVVQPRDQDGIYGKAVILNCSAEGYPVPTIVWKFSKGAGVPQFQPIALNGRIQVLSNGSLLIKHVVEEDSGYYLCKVSNDVGADVSKSMYLTVKIPAMITSYPNTTLATQGQKKEMSCTAHGEKPIIVRWEKEDRIINPEMARYLVSTKEVGEEVISTLQILPTVREDSGFFSCHAINSYGEDRGIIQLTVQEPPDPPEIEIKDVKARTITLRWTMGFDGNSPITGYDIECKNKSDSWDSAQRTKDVSPQLNSATIIDIHPSSTYSIRMYAKNRIGKSEPSNELTITADEAAPDGPPQEVHLEPISSQSIRVTWKAPKKHLQNGIIRGYQIGYREYSTGGNFQFNIISVDTSGDSEVYTLDNLNKFTQYGLVVQACNRAGTGPSSQEIITTTLEDVPSYPPENVQAIATSPESISISWSTLSKEALNGILQGFRVIYWANLMDGELGEIKNITTTQPSLELDGLEKYTNYSIQVLAFTRAGDGVRSEQIFTRTKEDVPGPPAGVKAAAASASMVFVSWLPPLKLNGIIRKYTVFCSHPYPTVISEFEASPDSFSYRIPNLSRNRQYSVWVVAVTSAGRGNSSEIITVEPLAKAPARILTFSGTVTTPWMKDIVLPCKAVGDPSPAVKWMKDSNGTPSLVTIDGRRSIFSNGSFIIRTVKAEDSGYYSCIANNNWGSDEIILNLQVQVPPDQPRLTVSKTTSSSITLSWLPGDNGGSSIRGYILQYSEDNSEQWGSFPISPSERSYRLENLKCGTWYKFTLTAQNGVGPGRISEIIEAKTLGKEPQFSKEQELFASINTTRVRLNLIGWNDGGCPITSFTLEYRPFGTTVWTTAQRTSLSKSYILYDLQEATWYELQMRVCNSAGCAEKQANFATLNYDGSTIPPLIKSVVQNEEGLTTNEGLKMLVTISCILVGVLLLFVLLLVVRRRRREQRLKRLRDAKSLAEMLMSKNTRTSDTLSKQQQTLRMHIDIPRAQLLIEERDTMETIDDRSTVLLTDADFGEAAKQKSLTVTHTVHYQSVSQATGPLVDVSDARPGTNPTTRRNAKAGPTARNRYASQWTLNRPHPTISAHTLTTDWRLPTPRAAGSVDKESDSYSVSPSQDTDRARSSMVSTESASSTYEELARAYEHAKMEEQLRHAKFTITECFISDTSSEQLTAGTNEYTDSLTSSTPSESGICRFTASPPKPQDGGRVMNMAVPKAHRPGDLIHLPPYLRMDFLLNRGGPGTSRDLSLGQACLEPQKSRTLKRPTVLEPIPMEAASSASSTREGQSWQPGAVATLPQREGAELGQAAKMSSSQESLLDSRGHLKGNNPYAKSYTLV,2012,NP_001380.2.csv,refseq-DSCAM-NM_001389.3_clinical_seed_0_final,refseq-DSCAM-NM_001389.3.a2m,Invitae,refseq-DSCAM-NM_001389.3.npy,1,2012,2012
+NP_001380429.1,MSPAIALAFLPLVVTLLVRYRHYFRLLVRTVLLRSLRDCLSGLRIEERAFSYVLTHALPGDPGHILTTLDHWSSRCEYLSHMGPVKGQILMRLVEEKAPACVLELGTYCGYSTLLIARALPPGGRLLTVERDPRTAAVAEKLIRLAGFDEHMVELIVGSSEDVIPCLRTQYQLSRADLVLLAHRPRCYLRDLQLLEAHALLPAGATVLADHVLFPGAPRFLQYAKSCGRYRCRLHHTGLPDFPAIKDGIAQLTYAGPG,258,NP_001380429.1.csv,NP_001380429.1_clinical_seed_0_final,NP_001380429.1.a2m,popEVE,NP_001380429.1_theta_0.2.npy,1,258,258
+NP_001387.2,MHTGGETSACKPSSVRLAPSFSFHAAGLQMAGQMPHSHQYSDRRQPNISDQQVSALSYSDQIQQPLTNQVMPDIVMLQRRMPQTFRDPATAPLRKLSVDLIKTYKHINEVYYAKKKRRHQQGQGDDSSHKKERKVYNDGYDDDNYDYIVKNGEKWMDRYEIDSLIGKGSFGQVVKAYDRVEQEWVAIKIIKNKKAFLNQAQIEVRLLELMNKHDTEMKYYIVHLKRHFMFRNHLCLVFEMLSYNLYDLLRNTNFRGVSLNLTRKFAQQMCTALLFLATPELSIIHCDLKPENILLCNPKRSAIKIVDFGSSCQLGQRIYQYIQSRFYRSPEVLLGMPYDLAIDMWSLGCILVEMHTGEPLFSGANEVDQMNKIVEVLGIPPAHILDQAPKARKFFEKLPDGTWNLKKTKDGKREYKPPGTRKLHNILGVETGGPGGRRAGESGHTVADYLKFKDLILRMLDYDPKTRIQPYYALQHSFFKKTADEGTNTSNSVSTSPAMEQSQSSGTTSSTSSSSGGSSGTSNSGRARSDPTHQHRHSGGHFTAAVQAMDCETHSPQVRQQFPAPLGWSGTEAPTQVTVETHPVQETTFHVAPQQNALHHHHGNSSHHHHHHHHHHHHHGQQALGNRTRPRVYNSPTNSSSTQDSMEVGHSHHSMTSLSSSTTSSSTSSSSTGNQGNQAYQNRPVAANTLDFGQNGAMDVNLTVYSNPRQETGIAGHPTYQFSANTGPAHYMTEGHLTMRQGADREESPMTGVCVQQSPVASS,763,NP_001387.2.csv,refseq-DYRK1A-NM_001396.3_clinical_seed_0_final,refseq-DYRK1A-NM_001396.3.a2m,Invitae,refseq-DYRK1A-NM_001396.3.npy,1,763,763
+NP_001388.1,MRGVWPPPVSALLSALGMSTYKRATLDEEDLVDSLSEGDAYPNGLQVNFHSPRSGQRCWAARTQVEKRLVVLVVLLAAGLVACLAALGIQYQTRSPSVCLSEACVSVTSSILSSMDPTVDPCHDFFSYACGGWIKANPVPDGHSRWGTFSNLWEHNQAIIKHLLENSTASVSEAERKAQVYYRACMNETRIEELRAKPLMELIERLGGWNITGPWAKDNFQDTLQVVTAHYRTSPFFSVYVSADSKNSNSNVIQVDQSGLGLPSRDYYLNKTENEKVLTGYLNYMVQLGKLLGGGDEEAIRPQMQQILDFETALANITIPQEKRRDEELIYHKVTAAELQTLAPAINWLPFLNTIFYPVEINESEPIVVYDKEYLEQISTLINTTDRCLLNNYMIWNLVRKTSSFLDQRFQDADEKFMEVMYGTKKTCLPRWKFCVSDTENNLGFALGPMFVKATFAEDSKSIATEIILEIKKAFEESLSTLKWMDEETRKSAKEKADAIYNMIGYPNFIMDPKELDKVFNDYTAVPDLYFENAMRFFNFSWRVTADQLRKAPNRDQWSMTPPMVNAYYSPTKNEIVFPAGILQAPFYTRSSPKALNFGGIGVVVGHELTHAFDDQGREYDKDGNLRPWWKNSSVEAFKRQTECMVEQYSNYSVNGEPVNGRHTLGENIADNGGLKAAYRAYQNWVKKNGAEHSLPTLGLTNNQLFFLGFAQVWCSVRTPESSHEGLITDPHSPSRFRVIGSLSNSKEFSEHFRCPPGSPMNPPHKCEVW,770,NP_001388.1.csv,refseq-ECE1-NM_001397.2_clinical_seed_0_final,refseq-ECE1-NM_001397.2.a2m,Invitae,refseq-ECE1-NM_001397.2.npy,1,770,770
+NP_001390.1,MGYPEVERRELLPAAAPRERGSQGCGCGGAPARAGEGNSCLLFLGFFGLSLALHLLTLCCYLELRSELRRERGAESRLGGSGTPGTSGTLSSLGGLDPDSPITSHLGQPSPKQQPLEPGEAALHSDSQDGHQMALLNFFFPDEKPYSEEESRRVRRNKRSKSNEGADGPVKNKKKGKKAGPPGPNGPPGPPGPPGPQGPPGIPGIPGIPGTTVMGPPGPPGPPGPQGPPGLQGPSGAADKAGTRENQPAVVHLQGQGSAIQVKNDLSGGVLNDWSRITMNPKVFKLHPRSGELEVLVDGTYFIYSQVEVYYINFTDFASYEVVVDEKPFLQCTRSIETGKTNYNTCYTAGVCLLKARQKIAVKMVHADISINMSKHTTFFGAIRLGEAPAS,391,NP_001390.1.csv,refseq-EDA-NM_001399.4_clinical_seed_0_final,refseq-EDA-NM_001399.4.a2m,Invitae,refseq-EDA-NM_001399.4.npy,1,391,391
+NP_001412.1,MAITLQPSDLIFEFASNGMDDDIHQLEDPSVFPAVIVEQVPYPDLLHLYSGLELDDVHNGIITDGTLCMTQDQILEGSFLLTDDNEATSHTMSTAEVLLNMESPSDILDEKQIFSTSEMLPDSDPAPAVTLPNYLFPASEPDALNRAGDTSDQEGHSLEEKASREESAKKTGKSKKRIRKTKGNRSTSPVTDPSIPIRKKSKDGKGSTIYLWEFLLALLQDRNTCPKYIKWTQREKGIFKLVDSKAVSKLWGKQKNKPDMNYETMGRALRYYYQRGILAKVEGQRLVYQFKEMPKDLVVIEDEDESSEATAAPPQASTASVASASTTRRTSSRVSSRSAPQGKGSSSWEKPKIQHVGLQPSASLELGPSLDEEIPTTSTMLVSPAEGQVKLTKAVSASSVPSNIHLGVAPVGSGSALTLQTIPLTTVLTNGPPASTTAPTQLVLQSVPAASTFKDTFTLQASFPLNASFQDSQVAAPGAPLILSGLPQLLAGANRPTNPAPPTVTGAGPAGPSSQPPGTVIAAFIRTSGTTAAPRVKEGPLRSSSYVQGMVTGAPMEGLLVPEETLRELLRDQAHLQPLPTQVVSRGSHNPSLLGNQTLSPPSRPTVGLTPVAELELSSGSGSLLMAEPSVTTSGSLLTRSPTPAPFSPFNPTSLIKMEPHDI,663,NP_001412.1.csv,refseq-ELF4-NM_001421.3_clinical_seed_0_final,refseq-ELF4-NM_001421.3.a2m,Invitae,refseq-ELF4-NM_001421.3.npy,1,663,663
+NP_001421.2,MTADKEKKRSSSERRKEKSRDAARCRRSKETEVFYELAHELPLPHSVSSHLDKASIMRLAISFLRTHKLLSSVCSENESEAEADQQMDNLYLKALEGFIAVVTQDGDMIFLSENISKFMGLTQVELTGHSIFDFTHPCDHEEIRENLSLKNGSGFGKKSKDMSTERDFFMRMKCTVTNRGRTVNLKSATWKVLHCTGQVKVYNNCPPHNSLCGYKEPLLSCLIIMCEPIQHPSHMDIPLDSKTFLSRHSMDMKFTYCDDRITELIGYHPEELLGRSAYEFYHALDSENMTKSHQNLCTKGQVVSGQYRMLAKHGGYVWLETQGTVIYNPRNLQPQCIMCVNYVLSEIEKNDVVFSMDQTESLFKPHLMAMNSIFDSSGKGAVSEKSNFLFTKLKEEPEELAQLAPTPGDAIISLDFGNQNFEESSAYGKAILPPSQPWATELRSHSTQSEAGSLPAFTVPQAAAPGSTTPSATSSSSSCSTPNSPEDYYTSLDNDLKIEVIEKLFAMDTEAKDQCSTQTDFNELDLETLAPYIPMDGEDFQLSPICPEERLLAENPQSTPQHCFSAMTNIFQPLAPVAPHSPFLLDKFQQQLESKKTEPEHRPMSSIFFDAGSKASLPPCCGQASTPLSSMGGRSNTQWPPDPPLHFGPTKWAVGDQRTEFLGAAPLGPPVSPPHVSTFKTRSAKGFGARGPDVLSPAMVALSNKLKLKRQLEYEEQAFQDLSGGDPPGGSTSHLMWKRMKNLRGGSCPLMPDKPLSANVPNDKFTQNPMRGLGHPLRHLPLPQPPSAISPGENSKSRFPPQCYATQYQDYSLSSAHKVSGMASRLLGPSFESYLLPELTRYDCEVNVPVLGSSTLLQGGDLLRALDQAT,870,NP_001421.2.csv,refseq-EPAS1-NM_001430.4_clinical_seed_0_final,refseq-EPAS1-NM_001430.4.a2m,Invitae,refseq-EPAS1-NM_001430.4.npy,1,870,870
+NP_001431.1,MTGYTMLRNGGAGNGGQTCMLRWSNRIRLTWLSFTLFVILVFFPLIAHYYLTTLDEADEAGKRIFGPRVGNELCEVKHVLDLCRIRESVSEELLQLEAKRQELNSEIAKLNLKIEACKKSIENAKQDLLQLKNVISQTEHSYKELMAQNQPKLSLPIRLLPEKDDAGLPPPKATRGCRLHNCFDYSRCPLTSGFPVYVYDSDQFVFGSYLDPLVKQAFQATARANVYVTENADIACLYVILVGEMQEPVVLRPAELEKQLYSLPHWRTDGHNHVIINLSRKSDTQNLLYNVSTGRAMVAQSTFYTVQYRPGFDLVVSPLVHAMSEPNFMEIPPQVPVKRKYLFTFQGEKIESLRSSLQEARSFEEEMEGDPPADYDDRIIATLKAVQDSKLDQVLVEFTCKNQPKPSLPTEWALCGEREDRLELLKLSTFALIITPGDPRLVISSGCATRLFEALEVGAVPVVLGEQVQLPYQDMLQWNEAALVVPKPRVTEVHFLLRSLSDSDLLAMRRQGRFLWETYFSTADSIFNTVLAMIRTRIQIPAAPIREEAAAEIPHRSGKAAGTDPNMADNGDLDLGPVETEPPYASPRYLRNFTLTVTDFYRSWNCAPGPFHLFPHTPFDPVLPSEAKFLGSGTGFRPIGGGAGGSGKEFQAALGGNVPREQFTVVMLTYEREEVLMNSLERLNGLPYLNKVVVVWNSPKLPSEDLLWPDIGVPIMVVRTEKNSLNNRFLPWNEIETEAILSIDDDAHLRHDEIMFGFRVWREARDRIVGFPGRYHAWDIPHQSWLYNSNYSCELSMVLTGAAFFHKYYAYLYSYVMPQAIRDMVDEYINCEDIAMNFLVSHITRKPPIKVTSRWTFRCPGCPQALSHDDSHFHERHKCINFFVKVYGYMPLLYTQFRVDSVLFKTRLPHDKTKCFKFI,919,NP_001431.1.csv,refseq-EXTL3-NM_001440.3_clinical_seed_0_final,refseq-EXTL3-NM_001440.3.a2m,Invitae,refseq-EXTL3-NM_001440.3.npy,1,919,919
+NP_001439.2,MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGSTCCSQEMEEKYSLQSKDDFKSVVSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGHLYMQNSELFKDLFVELKRYYVVGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTEQLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDVVSKVSVVNPTAQCTHALLKMIYCSHCRGLVTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMLMVAERLEGPFNIESVMDPIDVKISDAIMNMQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVKEKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTSKPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAGKSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR,556,NP_001439.2.csv,refseq-GPC4-NM_001448.2_clinical_seed_0_final,refseq-GPC4-NM_001448.2.a2m,Invitae,refseq-GPC4-NM_001448.2.npy,1,556,556
+NP_001440.2,MAEKFDCHYCRDPLQGKKYVQKDGHHCCLKCFDKFCANTCVECRKPIGADSKEVHYKNRFWHDTCFRCAKCLHPLANETFVAKDNKILCNKCTTREDSPKCKGCFKAIVAGDQNVEYKGTVWHKDCFTCSNCKQVIGTGSFFPKGEDFYCVTCHETKFAKHCVKCNKAITSGGITYQDQPWHADCFVCVTCSKKLAGQRFTAVEDQYYCVDCYKNFVAKKCAGCKNPITGFGKGSSVVAYEGQSWHDYCFHCKKCSVNLANKRFVFHQEQVYCPDCAKKL,280,NP_001440.2.csv,refseq-FHL1-NM_001449.4_clinical_seed_0_final,refseq-FHL1-NM_001449.4.a2m,Invitae,refseq-FHL1-NM_001449.4.npy,1,280,280
+NP_001442.2,MSSAPEKQQPPHGGGGGGGGGGGAAMDPASSGPSKAKKTNAGIRRPEKPPYSYIALIVMAIQSSPTKRLTLSEIYQFLQSRFPFFRGSYQGWKNSVRHNLSLNECFIKLPKGLGRPGKGHYWTIDPASEFMFEEGSFRRRPRGFRRKCQALKPMYSMMNGLGFNHLPDTYGFQGSAGGLSCPPNSLALEGGLGMMNGHLPGNVDGMALPSHSVPHLPSNGGHSYMGGCGGAAAGEYPHHDSSVPASPLLPTGAGGVMEPHAVYSGSAAAWPPSASAALNSGASYIKQQPLSPCNPAANPLSGSLSTHSLEQPYLHQNSHNAPAELQGIPRYHSQSPSMCDRKEFVFSFNAMASSSMHSAGGGSYYHQQVTYQDIKPCVM,379,NP_001442.2.csv,refseq-FOXF1-NM_001451.2_clinical_seed_0_final,refseq-FOXF1-NM_001451.2.a2m,Invitae,refseq-FOXF1-NM_001451.2.npy,1,379,379
+NP_001444.2,MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPGGMARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKDAVKDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDIKTENGTCPSPPQPLSPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAPPPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAYSPGQSSLYSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAGGSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRLTSWYLNQAGGDLGHLASAAAAAAAAGYPGQQQNFHSVREMFESQRIGLNNSPVNGNSSCQMAFPSSQSLYRTSGAFVYDCSKF,553,NP_001444.2.csv,refseq-FOXC1-NM_001453.2_clinical_seed_0_final,refseq-FOXC1-NM_001453.2.a2m,Invitae,refseq-FOXC1-NM_001453.2.npy,1,553,553
+NP_001457.1,MRPRSALPRLLLPLLLLPAAGPAQFHGEKGISIPDHGFCQPISIPLCTDIAYNQTIMPNLLGHTNQEDAGLEVHQFYPLVKVQCSPELRFFLCSMYAPVCTVLEQAIPPCRSICERARQGCEALMNKFGFQWPERLRCEHFPRHGAEQICVGQNHSEDGAPALLTTAPPPGLQPGAGGTPGGPGGGGAPPRYATLEHPFHCPRVLKVPSYLSYKFLGERDCAAPCEPARPDGSMFFSQEETRFARLWILTWSVLCCASTFFTVTTYLVDMQRFRYPERPIIFLSGCYTMVSVAYIAGFVLQERVVCNERFSEDGYRTVVQGTKKEGCTILFMMLYFFSMASSIWWVILSLTWFLAAGMKWGHEAIEANSQYFHLAAWAVPAVKTITILAMGQIDGDLLSGVCFVGLNSLDPLRGFVLAPLFVYLFIGTSFLLAGFVSLFRIRTIMKHDGTKTEKLERLMVRIGVFSVLYTVPATIVIACYFYEQAFREHWERSWVSQHCKSLAIPCPAHYTPRMSPDFTVYMIKYLMTLIVGITSGFWIWSGKTLHSWRKFYTRLTNSRHGETTV,565,NP_001457.1.csv,refseq-FZD2-NM_001466.3_clinical_seed_0_final,refseq-FZD2-NM_001466.3.a2m,Invitae,refseq-FZD2-NM_001466.3.npy,1,565,565
+NP_001461.1,MLLLLLLAPLFLRPPGAGGAQTPNATSEGCQIIHPPWEGGIRYRGLTRDQVKAINFLPVDYEIEYVCRGEREVVGPKVRKCLANGSWTDMDTPSRCVRICSKSYLTLENGKVFLTGGDLPALDGARVDFRCDPDFHLVGSSRSICSQGQWSTPKPHCQVNRTPHSERRAVYIGALFPMSGGWPGGQACQPAVEMALEDVNSRRDILPDYELKLIHHDSKCDPGQATKYLYELLYNDPIKIILMPGCSSVSTLVAEAARMWNLIVLSYGSSSPALSNRQRFPTFFRTHPSATLHNPTRVKLFEKWGWKKIATIQQTTEVFTSTLDDLEERVKEAGIEITFRQSFFSDPAVPVKNLKRQDARIIVGLFYETEARKVFCEVYKERLFGKKYVWFLIGWYADNWFKIYDPSINCTVDEMTEAVEGHITTEIVMLNPANTRSISNMTSQEFVEKLTKRLKRHPEETGGFQEAPLAYDAIWALALALNKTSGGGGRSGVRLEDFNYNNQTITDQIYRAMNSSSFEGVSGHVVFDASGSRMAWTLIEQLQGGSYKKIGYYDSTKDDLSWSKTDKWIGGSPPADQTLVIKTFRFLSQKLFISVSVLSSLGIVLAVVCLSFNIYNSHVRYIQNSQPNLNNLTAVGCSLALAAVFPLGLDGYHIGRNQFPFVCQARLWLLGLGFSLGYGSMFTKIWWVHTVFTKKEEKKEWRKTLEPWKLYATVGLLVGMDVLTLAIWQIVDPLHRTIETFAKEEPKEDIDVSILPQLEHCSSRKMNTWLGIFYGYKGLLLLLGIFLAYETKSVSTEKINDHRAVGMAIYNVAVLCLITAPVTMILSSQQDAAFAFASLAIVFSSYITLVVLFVPKMRRLITRGEWQSEAQDTMKTGSSTNNNEEEKSRLLEKENRELEKIIAEKEERVSELRHQLQSRQQLRSRRHPPTPPEPSGGLPRGPPEPPDRLSCDGSRVHLLYK,961,NP_001461.1.csv,refseq-GABBR1-NM_001470.3_clinical_seed_0_final,refseq-GABBR1-NM_001470.3.a2m,Invitae,refseq-GABBR1-NM_001470.3.npy,1,961,961
+NP_001469.1,MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQSPRRPELPDLAPEPRYAHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDLTKAFDPAELRAASATREQEFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSILVPGLSLQAASGQEVYQVNLTASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNRQLQLVTYSSRSYQTNTADTVRFSTEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNISALVTIATKTFLRYDRLRALITSIRRFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGKGWFAGRNLAVSQVTTKYVLWVDDDFVFTARTRLERLVDVLERTPLDLVGGAVREISGFATTYRQLLSVEPGAPGLGNCLRQRRGFHHELVGFPGCVVTDGVVNFFLARTDKVREVGFDPRLSRVAHLEFFLDGLGSLRVGSCSDVVVDHASKLKLPWTSRDAGAETYARYRYPGSLDESQMAKHRLLFFKHRLQCMTSQ,533,NP_001469.1.csv,refseq-B4GALNT1-NM_001478.4_clinical_seed_0_final,refseq-B4GALNT1-NM_001478.4.a2m,Invitae,refseq-B4GALNT1-NM_001478.4.npy,1,533,533
+NP_001473.1,MLRVRCLRGGSRGAEAVHYIGSRLGRTLTGWVQRTFQSTQAATASSRNSCAADDKATEPLPKDCPVSSYNEWDPLEEVIVGRAENACVPPFTIEVKANTYEKYWPFYQKQGGHYFPKDHLKKAVAEIEEMCNILKTEGVTVRRPDPIDWSLKYKTPDFESTGLYSAMPRDILIVVGNEIIEAPMAWRSRFFEYRAYRSIIKDYFHRGAKWTTAPKPTMADELYNQDYPIHSVEDRHKLAAQGKFVTTEFEPCFDAADFIRAGRDIFAQRSQVTNYLGIEWMRRHLAPDYRVHIISFKDPNPMHIDATFNIIGPGIVLSNPDRPCHQIDLFKKAGWTIITPPTPIIPDDHPLWMSSKWLSMNVLMLDEKRVMVDANEVPIQKMFEKLGITTIKVNIRNANSLGGGFHCWTCDVRRRGTLQSYLD,423,NP_001473.1.csv,refseq-GATM-NM_001482.2_clinical_seed_0_final,refseq-GATM-NM_001482.2.a2m,Invitae,refseq-GATM-NM_001482.2.npy,1,423,423
+NP_001482.1,MPLSMRYLFIISVSSVIIFIVFSVFNFGGDPSFQRLNISDPLRLTQVCTSFINGKTRFLWKNKLMIHEKSSCKEYLTQSHYITAPLSKEEADFPLAYIMVIHHHFDTFARLFRAIYMPQNIYCVHVDEKATTEFKDAVEQLLSCFPNAFLASKMEPVVYGGISRLQADLNCIRDLSAFEVSWKYVINTCGQDFPLKTNKEIVQYLKGFKGKNITPGVLPPAHAIGRTKYVHQEHLGKELSYVIRTTALKPPPPHNLTIYFGSAYVALSREFANFVLHDPRAVDLLQWSKDTFSPDEHFWVTLNRIPGVPGSMPNASWTGNLRAIKWSDMEDRHGGCHGHYVHGICIYGNGDLKWLVNSPSLFANKFELNTYPLTVECLELRHRERTLNQSETAIQPSWYF,400,NP_001482.1.csv,refseq-GCNT2-NM_001491.2_clinical_seed_0_final,refseq-GCNT2-NM_001491.2.a2m,Invitae,refseq-GCNT2-NM_001491.2.npy,1,400,400
+NP_001483.3,MPPPQQGPCGHHLLLLLALLLPSLPLTRAPVPPGPAAALLQALGLRDEPQGAPRLRPVPPVMWRLFRRRDPQETRSGSRRTSPGVTLQPCHVEELGVAGNIVRHIPDRGAPTRASEPASAAGHCPEWTVVFDLSAVEPAERPSRARLELRFAAAAAAAPEGGWELSVAQAGQGAGADPGPVLLRQLVPALGPPVRAELLGAAWARNASWPRSLRLALALRPRAPAACARLAEASLLLVTLDPRLCHPLARPRRDAEPVLGGGPGGACRARRLYVSFREVGWHRWVIAPRGFLANYCQGQCALPVALSGSGGPPALNHAVLRALMHAAAPGAADLPCCVPARLSPISVLFFDNSDNVVLRQYEDMVVDECGCR,372,NP_001483.3.csv,refseq-GDF1-NM_001492.4_clinical_seed_0_final,refseq-GDF1-NM_001492.4.a2m,Invitae,refseq-GDF1-NM_001492.4.npy,1,372,372
+NP_001484.1,MDEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESSSITPLEELYKRFQLLEGPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVVEGSFVYKGGKIYKVPSTETEALASNLMGMFEKRRFRKFLVFVANFDENDPKTFEGVDPQTTSMRDVYRKFDLGQDVIDFTGHALALYRTDDYLDQPCLETVNRIKLYSESLARYGKSPYLYPLYGLGELPQGFARLSAIYGGTYMLNKPVDDIIMENGKVVGVKSEGEVARCKQLICDPSYIPDRVRKAGQVIRIICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISYAHNVAAQGKYIAIASTTVETTDPEKEVEPALELLEPIDQKFVAISDLYEPIDDGCESQVFCSCSYDATTHFETTCNDIKDIYKRMAGTAFDFENMKRKQNDVFGEAEQ,447,NP_001484.1.csv,refseq-GDI1-NM_001493.2_clinical_seed_0_final,refseq-GDI1-NM_001493.2.a2m,Invitae,refseq-GDI1-NM_001493.2.npy,1,447,447
+NP_001501.2,MEVFPFLLVLSVWWSRTWDSANADSIIHIGAIFDESAKKDDEVFRTAVGDLNQNEEILQTEKITFSVTFVDGNNPFQAVQEACELMNQGILALVSSIGCTSAGSLQSLADAMHIPHLFIQRSTAGTPRSGCGLTRSNRNDDYTLSVRPPVYLHDVILRVVTEYAWQKFIIFYDSEYDIRGIQEFLDKVSQQGMDVALQKVENNINKMITTLFDTMRIEELNRYRDTLRRAILVMNPATAKSFITEVVETNLVAFDCHWIIINEEINDVDVQELVRRSIGRLTIIRQTFPVPQNISQRCFRGNHRISSTLCDPKDPFAQNMEISNLYIYDTVLLLANAFHKKLEDRKWHSMASLSCIRKNSKPWQGGRSMLETIKKGGVSGLTGELEFGENGGNPNVHFEILGTNYGEELGRGVRKLGCWNPVTGLNGSLTDKKLENNMRGVVLRVVTVLEEPFVMVSENVLGKPKKYQGFSIDVLDALSNYLGFNYEIYVAPDHKYGSPQEDGTWNGLVGELVFKRADIGISALTITPDRENVVDFTTRYMDYSVGVLLRRAEKTVDMFACLAPFDLSLWACIAGTVLLVGLLVYLLNWLNPPRLQMGSMTSTTLYNSMWFVYGSFVQQGGEVPYTTLATRMMMGAWWLFALIVISSYTANLAAFLTITRIESSIQSLQDLSKQTEIPYGTVLDSAVYEHVRMKGLNPFERDSMYSQMWRMINRSNGSENNVLESQAGIQKVKYGNYAFVWDAAVLEYVAINDPDCSFYTIGNTVADRGYGIALQHGSPYRDVFSQRILELQQNGDMDILKHKWWPKNGQCDLYSSVDTKQKGGALDIKSFAGVFCILAAGIVLSCFIAMLETWWNKRKGSRVPSKEDDKEIDLEHLHRRVNSLCTDDDSPHKQFSTSSIDLTPLDIDTLPTRQALEQISDFRNTHITTTTFIPEQIQTLSRTLSAKAASGFTFGNVPEHRTGPFRHRAPNGGFFRSPIKTMSSIPYQPTPTLGLNLGNDPDRGTSI,1007,NP_001501.2.csv,refseq-GRID2-NM_001510.3_clinical_seed_0_final,refseq-GRID2-NM_001510.3.a2m,Invitae,refseq-GRID2-NM_001510.3.npy,1,1007,1007
+NP_001510.2,MTGRVCRGCGGTDIELDAARGDAVCTACGSVLEDNIIVSEVQFVESSGGGSSAVGQFVSLDGAGKTPTLGGGFHVNLGKESRAQTLQNGRRHIHHLGNQLQLNQHCLDTAFNFFKMAVSRHLTRGRKMAHVIAACLYLVCRTEGTPHMLLDLSDLLQVNVYVLGKTFLLLARELCINAPAIDPCLYIPRFAHLLEFGEKNHEVSMTALRLLQRMKRDWMHTGRRPSGLCGAALLVAARMHDFRRTVKEVISVVKVCESTLRKRLTEFEDTPTSQLTIDEFMKIDLEEECDPPSYTAGQRKLRMKQLEQVLSKKLEEVEGEISSYQDAIEIELENSRPKAKGGLASLAKDGSTEDTASSLCGEEDTEDEELEAAASHLNKDLYRELLGGAPGSSEAAGSPEWGGRPPALGSLLDPLPTAASLGISDSIRECISSQSSDPKDASGDGELDLSGIDDLEIDRYILNESEARVKAELWMRENAEYLREQREKEARIAKEKELGIYKEHKPKKSCKRREPIQASTAREAIEKMLEQKKISSKINYSVLRGLSSAGGGSPHREDAQPEHSASARKLSRRRTPASRSGADPVTSVGKRLRPLVSTQPAKKVATGEALLPSSPTLGAEPARPQAVLVESGPVSYHADEEADEEEPDEEDGEPCVSALQMMGSNDYGCDGDEDDGY,677,NP_001510.2.csv,refseq-BRF1-NM_001519.3_clinical_seed_0_final,refseq-BRF1-NM_001519.3.a2m,Invitae,refseq-BRF1-NM_001519.3.npy,1,677,677
+NP_001511.2,MDALESLLDEVALEGLDGLCLPALWSRLETRVPPFPLPLEPCTQEFLWRALATHPGISFYEEPRERPDLQLQDRYEEIDLETGILESRRDPVALEDVYPIHMILENKDGIQGSCRYFKERKNITNDIRTKSLQPRCTMVEAFDRWGKKLIIVASQAMRYRALIGQEGDPDLKLPDFSYCILERLGRSRWQGELQRDLHTTAFKVDAGKLHYHRKILNKNGLITMQSHVIRLPTGAQQHSILLLLNRFHVDRRSKYDILMEKLSVMLSTRTNHIETLGKLREELGLCERTFKRLYQYMLNAGLAKVVSLRLQEIHPECGPCKTKKGTDVMVRCLKLLKEFKRNDHDDDEDEEVISKTVPPVDIVFERDMLTQTYDLIERRGTKGISQAEIRVAMNVGKLEARMLCRLLQRFKVVKGFMEDEGRQRTTKYISCVFAEESDLSRQYQREKARSELLTTVSLASMQEESLLPEGEDTFLSESDSEEERSSSKRRGRGSQKDTRASANLRPKTQPHHSTPTKGGWKVVNLHPLKKQPPSFPGAAEERACQSLASRDSLLDTSSVSEPNVSFVSHCADSNSGDIAVIEEVRMENPKESSSSLKTGRHSSGQDKPHETYRLLKRRNLIIEAVTNLRLIESLFTIQKMIMDQEKQEGVSTKCCKKSIVRLVRNLSEEGLLRLYRTTVIQDGIKKKVDLVVHPSMDQNDPLVRSAIEQVRFRISNSSTANRVKTSQPPVPQGEAEEDSQGKEGPSGSGDSQLSASSRSESGRMKKSDNKMGITPLRNYHPIVVPGLGRSLGFLPKMPRLRVVHMFLWYLIYGHPASNTVEKPSFISERRTIKQESGRAGVRPSSSGSAWEACSEAPSKGSQDGVTWEAEVELATETVYVDDASWMRYIPPIPVHRDFGFGWALVSDILLCLPLSIFIQIVQVSYKVDNLEEFLNDPLKKHTLIRFLPRPIRQQLLYKRRYIFSVVENLQRLCYMGLLQFGPTEKFQDKDQVFIFLKKNAVIVDTTICDPHYNLARSSRPFERRLYVLNSMQDVENYWFDLQCVCLNTPLGVVRCPRVRKNSSTDQGSDEEGSLQKEQESAMDKHNLERKCAMLEYTTGSREVVDEGLIPGDGLGAAGLDSSFYGHLKRNWIWTSYIINQAKKENTAAENGLTVRLQTFLSKRPMPLSARGNSRLNIWGEARVGSELCAGWEEQFEVDREPSLDRNRRVRGGKSQKRKRLKKDPGKKIKRKKKGEFPGEKSKRLRYHDEADQSALQRMTRLRVTWSMQEDGLLVLCRIASNVLNTKVKGPFVTWQVVRDILHATFEESLDKTSHSVGRRARYIVKNPQAYLNYKVCLAEVYQDKALVGDFMNRRGDYDDPKVCANEFKEFVEKLKEKFSSALRNSNLEIPDTLQELFARYRVLAIGDEKDQTRKEDELNSVDDIHFLVLQNLIQSTLALSDSQMKSYQSFQTFRLYREYKDHVLVKAFMECQKRSLVNRRRVNHTLGPKKNRALPFVPMSYQLSQTYYRIFTWRFPSTICTESFQFLDRMRAAGKLDQPDRFSFKDQDNNEPTNDMVAFSLDGPGGNCVAVLTLFSLGLISVDVRIPEQIIVVDSSMVENEVIKSLGKDGSLEDDEDEEDDLDEGVGGKRRSMEVKPAQASHTNYLLMRGYYSPGIVSTRNLNPNDSIVVNSCQMKFQLRCTPVPARLRPAAAPLEELTMGTSCLPDTFTKLINPQENTCSLEEFVLQLELSGYSPEDLTAALEILEAIIATGCFGIDKEELRRRFSALEKAGGGRTRTFADCIQALLEQHQVLEVGGNTARLVAMGSAWPWLLHSVRLKDREDADIQREDPQARPLEGSSSEDSPPEGQAPPSHSPRGTKRRASWASENGETDAEGTQMTPAKRPALQDSNLAPSLGPGAEDGAEAQAPSPPPALEDTAAAGAAQEDQEGVGEFSSPGQEQLSGQAQPPEGSEDPRGFTESFGAANISQAARERDCESVCFIGRPWRVVDGHLNLPVCKGMMEAMLYHIMTRPGIPESSLLRHYQGVLQPVAVLELLQGLESLGCIRKRWLRKPRPVSLFSTPVVEEVEVPSSLDESPMAFYEPTLDCTLRLGRVFPHEVNWNKWIHL,2109,NP_001511.2.csv,refseq-GTF3C1-NM_001520.3_clinical_seed_0_final,refseq-GTF3C1-NM_001520.3.a2m,Invitae,refseq-GTF3C1-NM_001520.3_theta_0.2.npy,1,2109,2109
+NP_001521.1,MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVMRLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYMGLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGRTMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSKTFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQVTTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQCIVCVNYVVSGIIQHDLIFSLQQTECVLKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTETDDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEPNPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVTVFQQTQIQEPTANATTTTATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYRDTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKRKMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLACRLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN,826,NP_001521.1.csv,refseq-HIF1A-NM_001530.3_clinical_seed_0_final,refseq-HIF1A-NM_001530.3.a2m,Invitae,refseq-HIF1A-NM_001530.3.npy,1,826,826
+NP_001529.2,MQEAPAALPTEPGPSPVPAFLGKLWALVGDPGTDHLIRWSPSGTSFLVSDQSRFAKEVLPQYFKHSNMASFVRQLNMYGFRKVVSIEQGGLLRPERDHVEFQHPSFVRGREQLLERVRRKVPALRGDDGRWRPEDLGRLLGEVQALRGVQESTEARLRELRQQNEILWREVVTLRQSHGQQHRVIGKLIQCLFGPLQAGPSNAGGKRKLSLMLDEGSSCPTPAKFNTCPLPGALLQDPYFIQSPSTYSLSQRQIWALALTGPGAPSSLTSQKTLHPLRGPGFLPPVMAGAPPPLPVAVVQAILEGKGSFSPEGPRNAQQPEPGDPREIPDRGPLGLESGDRSPESLLPPMLLQPPQESVEPAGPLDVLGPSLQGREWTLMDLDMELSLMQPLVPERGEPELAVKGLNSPSPGKDPTLGAPLLLDVQAALGGPALGLPGALTIYSTPESRTASYLGPEASPSP,462,NP_001529.2.csv,refseq-HSF4-NM_001538.3_clinical_seed_0_final,refseq-HSF4-NM_001538.3.a2m,Invitae,refseq-HSF4-NM_001538.3.npy,1,462,462
+NP_001531.1,MTERRVPFSLLRGPSWDPFRDWYPHSRLFDQAFGLPRLPEEWSQWLGGSSWPGYVRPLPPAAIESPAVAAPAYSRALSRQLSSGVSEIRHTADRWRVSLDVNHFAPDELTVKTKDGVVEITGKHEERQDEHGYISRCFTRKYTLPPGVDPTQVSSSLSPEGTLTVEAPMPKLATQSNEITIPVTFESRAQLGGPEAAKSDETAAK,205,NP_001531.1.csv,refseq-HSPB1-NM_001540.3_clinical_seed_0_final,refseq-HSPB1-NM_001540.3.a2m,Invitae,refseq-HSPB1-NM_001540.3.npy,1,205,205
+NP_001534.1,MPALACLRRLCRHVSPQAVLFLLFIFCLFSVFISAYYLYGWKRGLEPSADAPEPDCGDPPPVAPSRLLPLKPVQAATPSRTDPLVLVFVESLYSQLGQEVVAILESSRFKYRTEIAPGKGDMPTLTDKGRGRFALIIYENILKYVNLDAWNRELLDKYCVAYGVGIIGFFKANENSLLSAQLKGFPLFLHSNLGLKDCSINPKSPLLYVTRPSEVEKGVLPGEDWTVFQSNHSTYEPVLLAKTRSSESIPHLGADAGLHAALHATVVQDLGLHDGIQRVLFGNNLNFWLHKLVFVDAVAFLTGKRLSLPLDRYILVDIDDIFVGKEGTRMKVEDVKALFDTQNELRAHIPNFTFNLGYSGKFFHTGTNAEDAGDDLLLSYVKEFWWFPHMWSHMQPHLFHNQSVLAEQMALNKKFAVEHGIPTDMGYAVAPHHSGVYPVHVQLYEAWKQVWSIRVTSTEEYPHLKPARYRRGFIHNGIMVLPRQTCGLFTHTIFYNEYPGGSSELDKIINGGELFLTVLLNPISIFMTHLSNYGNDRLGLYTFKHLVRFLHSWTNLRLQTLPPVQLAQKYFQIFSEEKDPLWQDPCEDKRHKDIWSKEKTCDRFPKLLIIGPQKTGTTALYLFLGMHPDLSSNYPSSETFEEIQFFNGHNYHKGIDWYMEFFPIPSNTTSDFYFEKSANYFDSEVAPRRAAALLPKAKVLTILINPADRAYSWYQHQRAHDDPVALKYTFHEVITAGSDASSKLRALQNRCLVPGWYATHIERWLSAYHANQILVLDGKLLRTEPAKVMDMVQKFLGVTNTIDYHKTLAFDPKKGFWCQLLEGGKTKCLGKSKGRKYPEMDLDSRAFLKDYYRDHNIELSKLLYKMGQTLPTWLREDLQNTR,882,NP_001534.1.csv,refseq-NDST1-NM_001543.4_clinical_seed_0_final,refseq-NDST1-NM_001543.4.a2m,Invitae,refseq-NDST1-NM_001543.4.npy,1,882,882
+NP_001547.1,MSWSPSLTTQTCGAWEMKERLGTGGFGNVIRWHNQETGEQIAIKQCRQELSPRNRERWCLEIQIMRRLTHPNVVAARDVPEGMQNLAPNDLPLLAMEYCQGGDLRKYLNQFENCCGLREGAILTLLSDIASALRYLHENRIIHRDLKPENIVLQQGEQRLIHKIIDLGYAKELDQGSLCTSFVGTLQYLAPELLEQQKYTVTVDYWSFGTLAFECITGFRPFLPNWQPVQWHSKVRQKSEVDIVVSEDLNGTVKFSSSLPYPNNLNSVLAERLEKWLQLMLMWHPRQRGTDPTYGPNGCFKALDDILNLKLVHILNMVTGTIHTYPVTEDESLQSLKARIQQDTGIPEEDQELLQEAGLALIPDKPATQCISDGKLNEGHTLDMDLVFLFDNSKITYETQISPRPQPESVSCILQEPKRNLAFFQLRKVWGQVWHSIQTLKEDCNRLQQGQRAAMMNLLRNNSCLSKMKNSMASMSQQLKAKLDFFKTSIQIDLEKYSEQTEFGITSDKLLLAWREMEQAVELCGRENEVKLLVERMMALQTDIVDLQRSPMGRKQGGTLDDLEEQARELYRRLREKPRDQRTEGDSQEMVRLLLQAIQSFEKKVRVIYTQLSKTVVCKQKALELLPKVEEVVSLMNEDEKTVVRLQEKRQKELWNLLKIACSKVRGPVSGSPDSMNASRLSQPGQLMSQPSTASNSLPEPAKKSEELVAEAHNLCTLLENAIQDTVREQDQSFTALDWSWLQTEEEEHSCLEQAS,756,NP_001547.1.csv,refseq-IKBKB-NM_001556.2_clinical_seed_0_final,refseq-IKBKB-NM_001556.2.a2m,Invitae,refseq-IKBKB-NM_001556.2.npy,1,756,756
+NP_001548.1,MEDFNMESDSFEDFWKGEDLSNYSYSSTLPPFLLDAAPCEPESLEINKYFVVIIYALVFLLSLLGNSLVMLVILYSRVGRSVTDVYLLNLALADLLFALTLPIWAASKVNGWIFGTFLCKVVSLLKEVNFYSGILLLACISVDRYLAIVHATRTLTQKRYLVKFICLSIWGLSLLLALPVLLFRRTVYSSNVSPACYEDMGNNTANWRMLLRILPQSFGFIVPLLIMLFCYGFTLRTLFKAHMGQKHRAMRVIFAVVLIFLLCWLPYNLVLLADTLMRTQVIQETCERRNHIDRALDATEILGILHSCLNPLIYAFIGQKFRHGLLKILAIHGLISKDSLPKDSRPSFVGSSSGHTSTTL,360,NP_001548.1.csv,refseq-CXCR2-NM_001557.3_clinical_seed_0_final,refseq-CXCR2-NM_001557.3.a2m,Invitae,refseq-CXCR2-NM_001557.3_theta_0.2.npy,1,360,360
+NP_001549.2,MLPCLVVLLAALLSLRLGSDAHGTELPSPPSVWFEAEFFHHILHWTPIPNQSESTCYEVALLRYGIESWNSISNCSQTLSYDLTAVTLDLYHSNGYRARVRAVDGSRHSNWTVTNTRFSVDEVTLTVGSVNLEIHNGFILGKIQLPRPKMAPANDTYESIFSHFREYEIAIRKVPGNFTFTHKKVKHENFSLLTSGEVGEFCVQVKPSVASRSNKGMWSKEECISLTRQYFTVTNVIIFFAFVLLLSGALAYCLALQLYVRRRKKLPSVLLFKKPSPFIFISQRPSPETQDTIHPLDEEAFLKVSPELKNLDLHGSTDSGFGSTKPSLQTEEPQFLLPDPHPQADRTLGNREPPVLGDSCSSGSSNSTDSGICLQEPSLSPSTGPTWEQQVGSNSRGQDDSGIDLVQNSEGRAGDTQGGSALGHHSPPEPEVPGEEDPAAVAFQGYLRQTRCAEEKATKTGCLEEESPLTDGLGPKFGRCLVDEAGLHPPALAKGYLKQDPLEMTLASSGAPTGQWNQPTEEWSLLALSSCSDLGISDWSFAHDLAPLGCVAAPGGLLGSFNSDLVTLPLISSLQSSE,578,NP_001549.2.csv,refseq-IL10RA-NM_001558.3_clinical_seed_0_final,refseq-IL10RA-NM_001558.3.a2m,Invitae,refseq-IL10RA-NM_001558.3.npy,1,578,578
+NP_001554.2,MYLETRRAIFVFWIFLQVQGTKDISINIYHSETKDIDNPPRNETTESTEKMYKMSTMRRIFDLAKHRTKRSAFFPTGVKVCPQESMKQILDSLQAYYRLRVCQEAVWEAYRIFLDRIPDTGEYQDWVSICQQETFCLFDIGKNFSNSQEHLDLLQQRIKQRSFPDRKDEISAEKTLGEPGETIVISTDVANVSLGPFPLTPDDTLLNEILDNTLNDTKMPTTERETEFAVLEEQRVELSVSLVNQKFKAELADSQSPYYQELAGKSQLQMQKIFKKLPGFKKIHVLGFRPKKEKDGSSSTEMQLTAIFKRHSAEAKSPASDLLSFDSNKIESEEVYHGTMEEDKQPEIYLTATDLKRLISKALEEEQSLDVGTIQFTDEIAGSLPAFGPDTQSELPTSFAVITEDATLSPELPPVEPQLETVDGAEHGLPDTSWSPPAMASTSLSEAPPFFMASSIFSLTDQGTTDTMATDQTMLVPGLTIPTSDYSAISQLALGISHPPASSDDSRSSAGGEDMVRHLDEMDLSDTPAPSEVPELSEYVSVPDHFLEDTTPVSALQYITTSSMTIAPKGRELVVFFSLRVANMAFSNDLFNKSSLEYRALEQQFTQLLVPYLRSNLTGFKQLEILNFRNGSVIVNSKMKFAKSVPYNLTKAVHGVLEDFRSAAAQQLHLEIDSYSLNIEPADQADPCKFLACGEFAQCVKNERTEEAECRCKPGYDSQGSLDGLEPGLCGPGTKECEVLQGKGAPCRLPDHSENQAYKTSVKKFQNQQNNKVISKRNSELLTVEYEEFNHQDWEGN,797,NP_001554.2.csv,refseq-IMPG1-NM_001563.3_clinical_seed_0_final,refseq-IMPG1-NM_001563.3.a2m,Invitae,refseq-IMPG1-NM_001563.3.npy,1,797,797
+NP_001596.2,MDSTLTASEIRQRFIDFFKRNEHTYVHSSATIPLDDPTLLFANAGMNQFKPIFLNTIDPSHPMAKLSRAANTQKCIRAGGKHNDLDDVGKDVYHHTFFEMLGSWSFGDYFKELACKMALELLTQEFGIPIERLYVTYFGGDEAAGLEADLECKQIWQNLGLDDTKILPGNMKDNFWEMGDTGPCGPCSEIHYDRIGGRDAAHLVNQDDPNVLEIWNLVFIQYNREADGILKPLPKKSIDTGMGLERLVSVLQNKMSNYDTDLFVPYFEAIQKGTGARPYTGKVGAEDADGIDMAYRVLADHARTITVALADGGRPDNTGRGYVLRRILRRAVRYAHEKLNASRGFFATLVDVVVQSLGDAFPELKKDPDMVKDIINEEEVQFLKTLSRGRRILDRKIQSLGDSKTIPGDTAWLLYDTYGFPVDLTGLIAEEKGLVVDMDGFEEERKLAQLKSQGKGAGGEDLIMLDIYAIEELRARGLEVTDDSPKYNYHLDSSGSYVFENTVATVMALRREKMFVEEVSTGQECGVVLDKTCFYAEQGGQIYDEGYLVKVDDSSEDKTEFTVKNAQVRGGYVLHIGTIYGDLKVGDQVWLFIDEPRRRPIMSNHTATHILNFALRSVLGEADQKGSLVAPDRLRFDFTAKGAMSTQQIKKAEEIANEMIEAAKAVYTQDCPLAAAKAIQGLRAVFDETYPDPVRVVSIGVPVSELLDDPSGPAGSLTSVEFCGGTHLRNSSHAGAFVIVTEEAIAKGIRRIVAVTGAEAQKALRKAESLKKCLSVMEAKVKAQTAPNKDVQREIADLGEALATAVIPQWQKDELRETLKSLKKVMDDLDRASKADVQKRVLEKTKQFIDSNPNQPLVILEMESGASAKALNEALKLFKMHSPQTSAMLFTVDNEAGKITCLCQVPQNAANRGLKASEWVQQVSGLMDGKGGGKDVSAQATGKNVGCLQEALQLATSFAQLRLGDVKN,968,NP_001596.2.csv,refseq-AARS1-NM_001605.2_clinical_seed_0_final,refseq-AARS1-NM_001605.2.a2m,Invitae,refseq-AARS1-NM_001605.2.npy,1,968,968
+NP_001600.1,MEGLAVRLLRGSRLLRRNFLTCLSSWKIPPHVSKSSQSEALLNITNNGIHFAPLQTFTDEEMMIKSSVKKFAQEQIAPLVSTMDENSKMEKSVIQGLFQQGLMGIEVDPEYGGTGASFLSTVLVIEELAKVDASVAVFCEIQNTLINTLIRKHGTEEQKATYLPQLTTEKVGSFCLSEAGAGSDSFALKTRADKEGDYYVLNGSKMWISSAEHAGLFLVMANVDPTIGYKGITSFLVDRDTPGLHIGKPENKLGLRASSTCPLTFENVKVPEANILGQIGHGYKYAIGSLNEGRIGIAAQMLGLAQGCFDYTIPYIKERIQFGKRLFDFQGLQHQVAHVATQLEAARLLTYNAARLLEAGKPFIKEASMAKYYASEIAGQTTSKCIEWMGGVGYTKDYPVEKYFRDAKIGTIYEGASNIQLNTIAKHIDAEY,432,NP_001600.1.csv,refseq-ACADSB-NM_001609.3_clinical_seed_0_final,refseq-ACADSB-NM_001609.3.a2m,Invitae,refseq-ACADSB-NM_001609.3.npy,1,432,432
+NP_001605.1,MEEEIAALVIDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHQGVMVGMGQKDSYVGDEAQSKRGILTLKYPIEHGIVTNWDDMEKIWHHTFYNELRVAPEEHPVLLTEAPLNPKANREKMTQIMFETFNTPAMYVAIQAVLSLYASGRTTGIVMDSGDGVTHTVPIYEGYALPHAILRLDLAGRDLTDYLMKILTERGYSFTTTAEREIVRDIKEKLCYVALDFEQEMATAASSSSLEKSYELPDGQVITIGNERFRCPEALFQPSFLGMESCGIHETTFNSIMKCDVDIRKDLYANTVLSGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQQMWISKQEYDESGPSIVHRKCF,375,NP_001605.1.csv,refseq-ACTG1-NM_001614.3_clinical_seed_0_final,refseq-ACTG1-NM_001614.3.a2m,Invitae,refseq-ACTG1-NM_001614.3.npy,1,375,375
+NP_001616.1,MAPSVPAAEPEYPKGIRAVLLGPPGAGKGTQAPRLAENFCVCHLATGDMLRAMVASGSELGKKLKATMDAGKLVSDEMVVELIEKNLETPLCKNGFLLDGFPRTVRQAEMLDDLMEKRKEKLDSVIEFSIPDSLLIRRITGRLIHPKSGRSYHEEFNPPKEPMKDDITGEPLIRRSDDNEKALKIRLQAYHTQTTPLIEYYRKRGIHSAIDASQTPDVVFASILAAFSKATCKDLVMFI,239,NP_001616.1.csv,refseq-AK2-NM_001625.3_clinical_seed_0_final,refseq-AK2-NM_001625.3.a2m,Invitae,refseq-AK2-NM_001625.3.npy,1,239,239
+NP_001617.1,MNEVSVIKEGWLHKRGEYIKTWRPRYFLLKSDGSFIGYKERPEAPDQTLPPLNNFSVAECQLMKTERPRPNTFVIRCLQWTTVIERTFHVDSPDEREEWMRAIQMVANSLKQRAPGEDPMDYKCGSPSDSSTTEEMEVAVSKARAKVTMNDFDYLKLLGKGTFGKVILVREKATGRYYAMKILRKEVIIAKDEVAHTVTESRVLQNTRHPFLTALKYAFQTHDRLCFVMEYANGGELFFHLSRERVFTEERARFYGAEIVSALEYLHSRDVVYRDIKLENLMLDKDGHIKITDFGLCKEGISDGATMKTFCGTPEYLAPEVLEDNDYGRAVDWWGLGVVMYEMMCGRLPFYNQDHERLFELILMEEIRFPRTLSPEAKSLLAGLLKKDPKQRLGGGPSDAKEVMEHRFFLSINWQDVVQKKLLPPFKPQVTSEVDTRYFDDEFTAQSITITPPDRYDSLGLLELDQRTHFPQFSYSASIRE,481,NP_001617.1.csv,refseq-AKT2-NM_001626.4_clinical_seed_0_final,refseq-AKT2-NM_001626.4.a2m,Invitae,refseq-AKT2-NM_001626.4.npy,1,481,481
+NP_001640.1,MEGAEPRARPERLAEAETRAADGGRLVEVQLSGGAPWGFTLKGGREHGEPLVITKIEEGSKAAAVDKLLAGDEIVGINDIGLSGFRQEAICLVKGSHKTLKLVVKRRSELGWRPHSWHATKFSDSHPELAASPFTSTSGCPSWSGRHHASSSSHDLSSSWEQTNLQRTLDHFSSLGSVDSLDHPSSRLSVAKSNSSIDHLGSHSKRDSAYGSFSTSSSTPDHTLSKADTSSAENILYTVGLWEAPRQGGRQAQAAGDPQGSEEKLSCFPPRVPGDSGKGPRPEYNAEPKLAAPGRSNFGPVWYVPDKKKAPSSPPPPPPPLRSDSFAATKSHEKAQGPVFSEAAAAQHFTALAQAQPRGDRRPELTDRPWRSAHPGSLGKGSGGPGCPQEAHADGSWPPSKDGASSRLQASLSSSDVRFPQSPHSGRHPPLYSDHSPLCADSLGQEPGAASFQNDSPPQVRGLSSCDQKLGSGWQGPRPCVQGDLQAAQLWAGCWPSDTALGALESLPPPTVGQSPRHHLPQPEGPPDARETGRCYPLDKGAEGCSAGAQEPPRASRAEKASQRLAASITWADGESSRICPQETPLLHSLTQEGKRRPESSPEDSATRPPPFDAHVGKPTRRSDRFATTLRNEIQMHRAKLQKSRSTVALTAAGEAEDGTGRWRAGLGGGTQEGPLAGTYKDHLKEAQARVLRATSFKRRDLDPNPGDLYPESLEHRMGDPDTVPHFWEAGLAQPPSSTSGGPHPPRIGGRRRFTAEQKLKSYSEPEKMNEVGLTRGYSPHQHPRTSEDTVGTFADRWKFFEETSKPVPQRPAQKQALHGIPRDKPERPRTAGRTCEGTEPWSRTTSLGDSLNAHSAAEKAGTSDLPRRLGTFAEYQASWKEQRKPLEARSSGRCHSADDILDVSLDPQERPQHVHGRSRSSPSTDHYKQEASVELRRQAGDPGEPREELPSAVRAEEGQSTPRQADAQCREGSPGSQQHPPSQKAPNPPTFSELSHCRGAPELPREGRGRAGTLPRDYRYSEESTPADLGPRAQSPGSPLHARGQDSWPVSSALLSKRPAPQRPPPPKREPRRYRATDGAPADAPVGVLGRPFPTPSPASLDVYVARLSLSHSPSVFSSAQPQDTPKATVCERGSQHVSGDASRPLPEALLPPKQQHLRLQTATMETSRSPSPQFAPQKLTDKPPLLIQDEDSTRIERVMDNNTTVKMVPIKIVHSESQPEKESRQSLACPAEPPALPHGLEKDQIKTLSTSEQFYSRFCLYTRQGAEPEAPHRAQPAEPQPLGTQVPPEKDRCTSPPGLSYMKAKEKTVEDLKSEELAREIVGKDKSLADILDPSVKIKTTMDLMEGIFPKDEHLLEEAQQRRKLLPKIPSPRSTEERKEEPSVPAAVSLATNSTYYSTSAPKAELLIKMKDLQEQQEHEEDSGSDLDHDLSVKKQELIESISRKLQVLREARESLLEDVQANTVLGAEVEAIVKGVCKPSEFDKFRMFIGDLDKVVNLLLSLSGRLARVENALNNLDDGASPGDRQSLLEKQRVLIQQHEDAKELKENLDRRERIVFDILANYLSEESLADYEHFVKMKSALIIEQRELEDKIHLGEEQLKCLLDSLQPERGK,1616,NP_001640.1.csv,refseq-SHROOM2-NM_001649.3_clinical_seed_0_final,refseq-SHROOM2-NM_001649.3.a2m,Invitae,refseq-SHROOM2-NM_001649.3.npy,1,1616,1616
+NP_001642.1,MKKEVCSVAFLKAVFAEFLATLIFVFFGLGSALKWPSALPTILQIALAFGLAIGTLAQALGPVSGGHINPAITLALLVGNQISLLRAFFYVAAQLVGAIAGAGILYGVAPLNARGNLAVNALNNNTTQGQAMVVELILTFQLALCIFASTDSRRTSPVGSPALSIGLSVTLGHLVGIYFTGCSMNPARSFGPAVVMNRFSPAHWVFWVGPIVGAVLAAILYFYLLFPNSLSLSERVAIIKGTYEPDEDWEEQREERKKTMELTTR,265,NP_001642.1.csv,refseq-AQP5-NM_001651.3_clinical_seed_0_final,refseq-AQP5-NM_001651.3.a2m,Invitae,refseq-AQP5-NM_001651.3.npy,1,265,265
+NP_001671.2,MTGLSMDGGGSPKGDVDPFYYDYETVRNGGLIFAGLAFIVGLLILLSRRFRCGGNKKRRQINEDEP,66,NP_001671.2.csv,refseq-FXYD2-NM_001680.4_clinical_seed_0_final,refseq-FXYD2-NM_001680.4.a2m,Invitae,refseq-FXYD2-NM_001680.4.npy,1,66,66
+NP_001672.1,MENAHTKTVEEVLGHFGVNESTGLSLEQVKKLKERWGSNELPAEEGKTLLELVIEQFEDLLVRILLLAACISFVLAWFEEGEETITAFVEPFVILLILVANAIVGVWQERNAENAIEALKEYEPEMGKVYRQDRKSVQRIKAKDIVPGDIVEIAVGDKVPADIRLTSIKSTTLRVDQSILTGESVSVIKHTDPVPDPRAVNQDKKNMLFSGTNIAAGKAMGVVVATGVNTEIGKIRDEMVATEQERTPLQQKLDEFGEQLSKVISLICIAVWIINIGHFNDPVHGGSWIRGAIYYFKIAVALAVAAIPEGLPAVITTCLALGTRRMAKKNAIVRSLPSVETLGCTSVICSDKTGTLTTNQMSVCRMFILDRVEGDTCSLNEFTITGSTYAPIGEVHKDDKPVNCHQYDGLVELATICALCNDSALDYNEAKGVYEKVGEATETALTCLVEKMNVFDTELKGLSKIERANACNSVIKQLMKKEFTLEFSRDRKSMSVYCTPNKPSRTSMSKMFVKGAPEGVIDRCTHIRVGSTKVPMTSGVKQKIMSVIREWGSGSDTLRCLALATHDNPLRREEMHLEDSANFIKYETNLTFVGCVGMLDPPRIEVASSVKLCRQAGIRVIMITGDNKGTAVAICRRIGIFGQDEDVTSKAFTGREFDELNPSAQRDACLNARCFARVEPSHKSKIVEFLQSFDEITAMTGDGVNDAPALKKAEIGIAMGSGTAVAKTASEMVLADDNFSTIVAAVEEGRAIYNNMKQFIRYLISSNVGEVVCIFLTAALGFPEALIPVQLLWVNLVTDGLPATALGFNPPDLDIMNKPPRNPKEPLISGWLFFRYLAIGCYVGAATVGAAAWWFIAADGGPRVSFYQLSHFLQCKEDNPDFEGVDCAIFESPYPMTMALSVLVTIEMCNALNSLSENQSLLRMPPWENIWLVGSICLSMSLHFLILYVEPLPLIFQITPLNVTQWLMVLKISLPVILMDETLKFVARNYLEPAILE,997,NP_001672.1.csv,refseq-ATP2A2-NM_001681.3_clinical_seed_0_final,refseq-ATP2A2-NM_001681.3.a2m,Invitae,refseq-ATP2A2-NM_001681.3_theta_0.2.npy,1,997,997
+NP_001681.2,MDFSKLPKILDEDKESTFGYVHGVSGPVVTACDMAGAAMYELVRVGHSELVGEIIRLEGDMATIQVYEETSGVSVGDPVLRTGKPLSVELGPGIMGAIFDGIQRPLSDISSQTQSIYIPRGVNVSALSRDIKWDFTPCKNLRVGSHITGGDIYGIVSENSLIKHKIMLPPRNRGTVTYIAPPGNYDTSDVVLELEFEGVKEKFTMVQVWPVRQVRPVTEKLPANHPLLTGQRVLDALFPCVQGGTTAIPGAFGCGKTVISQSLSKYSNSDVIIYVGCGERGNEMSEVLRDFPELTMEVDGKVESIMKRTALVANTSNMPVAAREASIYTGITLSEYFRDMGYHVSMMADSTSRWAEALREISGRLAEMPADSGYPAYLGARLASFYERAGRVKCLGNPEREGSVSIVGAVSPPGGDFSDPVTSATLGIVQVFWGLDKKLAQRKHFPSVNWLISYSKYMRALDEYYDKHFTEFVPLRTKAKEILQEEEDLAEIVQLVGKASLAETDKITLEVAKLIKDDFLQQNGYTPYDRFCPFYKTVGMLSNMIAFYDMARRAVETTAQSDNKITWSIIREHMGDILYKLSSMKFKDPLKDGEAKIKSDYAQLLEDMQNAFRSLED,617,NP_001681.2.csv,refseq-ATP6V1A-NM_001690.3_clinical_seed_0_final,refseq-ATP6V1A-NM_001690.3.a2m,Invitae,refseq-ATP6V1A-NM_001690.3.npy,1,617,617
+NP_001683.2,MAMEIDSRPGGLPGSSCNLGAAREHMQAVTRNYITHPRVTYRTVCSVNGPLVVLDRVKFAQYAEIVHFTLPDGTQRSGQVLEVAGTKAIVQVFEGTSGIDARKTTCEFTGDILRTPVSEDMLGRVFNGSGKPIDKGPVVMAEDFLDINGQPINPHSRIYPEEMIQTGISPIDVMNSIARGQKIPIFSAAGLPHNEIAAQICRQAGLVKKSKAVLDYHDDNFAIVFAAMGVNMETARFFKSDFEQNGTMGNVCLFLNLANDPTIERIITPRLALTTAEFLAYQCEKHVLVILTDMSSYAEALREVSAAREEVPGRRGFPGYMYTDLATIYERAGRVEGRGGSITQIPILTMPNDDITHPIPDLTGFITEGQIYVDRQLHNRQIYPPINVLPSLSRLMKSAIGEGMTRKDHGDVSNQLYACYAIGKDVQAMKAVVGEEALTSEDLLYLEFLQKFEKNFINQGPYENRSVFESLDLGWKLLRIFPKEMLKRIPQAVIDEFYSREGALQDLAPDTAL,513,NP_001683.2.csv,refseq-ATP6V1B1-NM_001692.3_clinical_seed_0_final,refseq-ATP6V1B1-NM_001692.3.a2m,Invitae,refseq-ATP6V1B1-NM_001692.3.npy,1,513,513
+NP_001684.2,MALRAMRGIVNGAAPELPVPTGGPAVGAREQALAVSRNYLSQPRLTYKTVSGVNGPLVILDHVKFPRYAEIVHLTLPDGTKRSGQVLEVSGSKAVVQVFEGTSGIDAKKTSCEFTGDILRTPVSEDMLGRVFNGSGKPIDRGPVVLAEDFLDIMGQPINPQCRIYPEEMIQTGISAIDGMNSIARGQKIPIFSAAGLPHNEIAAQICRQAGLVKKSKDVVDYSEENFAIVFAAMGVNMETARFFKSDFEENGSMDNVCLFLNLANDPTIERIITPRLALTTAEFLAYQCEKHVLVILTDMSSYAEALREVSAAREEVPGRRGFPGYMYTDLATIYERAGRVEGRNGSITQIPILTMPNDDITHPIPDLTGYITEGQIYVDRQLHNRQIYPPINVLPSLSRLMKSAIGEGMTRKDHADVSNQLYACYAIGKDVQAMKAVVGEEALTSDDLLYLEFLQKFERNFIAQGPYENRTVFETLDIGWQLLRIFPKEMLKRIPQSTLSEFYPRDSAKH,511,NP_001684.2.csv,refseq-ATP6V1B2-NM_001693.3_clinical_seed_0_final,refseq-ATP6V1B2-NM_001693.3.a2m,Invitae,refseq-ATP6V1B2-NM_001693.3.npy,1,511,511
+NP_001687.1,MALSDADVQKQIKHMMAFIEQEANEKAEEIDAKAEEEFNIEKGRLVQTQRLKIMEYYEKKEKQIEQQKKIQMSNLMNQARLKVLRARDDLITDLLNEAKQRLSKVVKDTTRYQVLLDGLVLQGLYQLLEPRMIVRCRKQDFPLVKAAVQKAIPMYKIATKNDVDVQIDQESYLPEDIAGGVEIYNGDRKIKVSNTLESRLDLIAQQMMPEVRGALFGANANRKFLD,226,NP_001687.1.csv,refseq-ATP6V1E1-NM_001696.3_clinical_seed_0_final,refseq-ATP6V1E1-NM_001696.3.a2m,Invitae,refseq-ATP6V1E1-NM_001696.3.npy,1,226,226
+NP_001689.1,MAAAVAAAPGALGSLHAGGARLVAACSAWLCPGLRLPGSLAGRRAGPAIWAQGWVPAAGGPAPKRGYSSEMKTEDELRVRHLEEENRGIVVLGINRAYGKNSLSKNLIKMLSKAVDALKSDKKVRTIIIRSEVPGIFCAGADLKERAKMSSSEVGPFVSKIRAVINDIANLPVPTIAAIDGLALGGGLELALACDIRVAASSAKMGLVETKLAIIPGGGGTQRLPRAIGMSLAKELIFSARVLDGKEAKAVGLISHVLEQNQEGDAAYRKALDLAREFLPQGPVAMRVAKLAINQGMEVDLVTGLAIEEACYAQTIPTKDRLEGLLAFKEKRPPRYKGE,339,NP_001689.1.csv,refseq-AUH-NM_001698.2_clinical_seed_0_final,refseq-AUH-NM_001698.2.a2m,Invitae,refseq-AUH-NM_001698.2.npy,1,339,339
+NP_001702.1,MWPLWRLVSLLALSQALPFEQRGFWDFTLDDGPFMMNDEEASGADTSGVLDPDSVTPTYSAMCPFGCHCHLRVVQCSDLGLKSVPKEISPDTTLLDLQNNDISELRKDDFKGLQHLYALVLVNNKISKIHEKAFSPLRKLQKLYISKNHLVEIPPNLPSSLVELRIHDNRIRKVPKGVFSGLRNMNCIEMGGNPLENSGFEPGAFDGLKLNYLRISEAKLTGIPKDLPETLNELHLDHNKIQAIELEDLLRYSKLYRLGLGHNQIRMIENGSLSFLPTLRELHLDNNKLARVPSGLPDLKLLQVVYLHSNNITKVGVNDFCPMGFGVKRAYYNGISLFNNPVPYWEVQPATFRCVTDRLAIQFGNYKK,368,NP_001702.1.csv,refseq-BGN-NM_001711.5_clinical_seed_0_final,refseq-BGN-NM_001711.5.a2m,Invitae,refseq-BGN-NM_001711.5.npy,1,368,368
+NP_001708.3,MRRRPPSRGGRGAARARETRRQPRHRSGRRMAEAISCTLNCSCQSFKPGKINHRQCDQCKHGWVAHALSKLRIPPMYPTSQVEIVQSNVVFDISSLMLYGTQAIPVRLKILLDRLFSVLKQDEVLQILHALDWTLQDYIRGYVLQDASGKVLDHWSIMTSEEEVATLQQFLRFGETKSIVELMAIQEKEEQSIIIPPSTANVDIRAFIESCSHRSSSLPTPVDKGNPSSIHPFENLISNMTFMLPFQFFNPLPPALIGSLPEQYMLEQGHDQSQDPKQEVHGPFPDSSFLTSSSTPFQVEKDQCLNCPDAITKKEDSTHLSDSSSYNIVTKFERTQLSPEAKVKPERNSLGTKKGRVFCTACEKTFYDKGTLKIHYNAVHLKIKHKCTIEGCNMVFSSLRSRNRHSANPNPRLHMPMNRNNRDKDLRNSLNLASSENYKCPGFTVTSPDCRPPPSYPGSGEDSKGQPAFPNIGQNGVLFPNLKTVQPVLPFYRSPATPAEVANTPGILPSLPLLSSSIPEQLISNEMPFDALPKKKSRKSSMPIKIEKEAVEIANEKRHNLSSDEDMPLQVVSEDEQEACSPQSHRVSEEQHVQSGGLGKPFPEGERPCHRESVIESSGAISQTPEQATHNSERETEQTPALIMVPREVEDGGHEHYFTPGMEPQVPFSDYMELQQRLLAGGLFSALSNRGMAFPCLEDSKELEHVGQHALARQIEENRFQCDICKKTFKNACSVKIHHKNMHVKEMHTCTVEGCNATFPSRRSRDRHSSNLNLHQKALSQEALESSEDHFRAAYLLKDVAKEAYQDVAFTQQASQTSVIFKGTSRMGSLVYPITQVHSASLESYNSGPLSEGTILDLSTTSSMKSESSSHSSWDSDGVSEEGTVLMEDSDGNCEGSSLVPGEDEYPICVLMEKADQSLASLPSGLPITCHLCQKTYSNKGTFRAHYKTVHLRQLHKCKVPGCNTMFSSVRSRNRHSQNPNLHKSLASSPSHLQ,994,NP_001708.3.csv,refseq-BNC1-NM_001717.3_clinical_seed_0_final,refseq-BNC1-NM_001717.3.a2m,Invitae,refseq-BNC1-NM_001717.3.npy,1,994,994
+NP_001709.1,MPGLGRRAQWLCWWWGLLCSCCGPPPLRPPLPAAAAAAAGGQLLGDGGSPGRTEQPPPSPQSSSGFLYRRLKTQEKREMQKEILSVLGLPHRPRPLHGLQQPQPPALRQQEEQQQQQQLPRGEPPPGRLKSAPLFMLDLYNALSADNDEDGASEGERQQSWPHEAASSSQRRQPPPGAAHPLNRKSLLAPGSGSGGASPLTSAQDSAFLNDADMVMSFVNLVEYDKEFSPRQRHHKEFKFNLSQIPEGEVVTAAEFRIYKDCVMGSFKNQTFLISIYQVLQEHQHRDSDLFLLDTRVVWASEEGWLEFDITATSNLWVVTPQHNMGLQLSVVTRDGVHVHPRAAGLVGRDGPYDKQPFMVAFFKVSEVHVRTTRSASSRRRQQSRNRSTQSQDVARVSSASDYNSSELKTACRKHELYVSFQDLGWQDWIIAPKGYAANYCDGECSFPLNAHMNATNHAIVQTLVHLMNPEYVPKPCCAPTKLNAISVLYFDDNSNVILKKYRNMVVRACGCH,513,NP_001709.1.csv,refseq-BMP6-NM_001718.5_clinical_seed_0_final,refseq-BMP6-NM_001718.5.a2m,Invitae,refseq-BMP6-NM_001718.5.npy,1,513,513
+NP_001730.1,MLGRNTWKTSAFSFLVEQMWAPLWSRSMRPGRWCSQRSCAWQTSNNTLHPLWTVPVSVPGGTRQSPINIQWRDSVYDPQLKPLRVSYEAASCLYIWNTGYLFQVEFDDATEASGISGGPLENHYRLKQFHFHWGAVNEGGSEHTVDGHAYPAELHLVHWNSVKYQNYKEAVVGENGLAVIGVFLKLGAHHQTLQRLVDILPEIKHKDARAAMRPFDPSTLLPTCWDYWTYAGSLTTPPLTESVTWIIQKEPVEVAPSQLSAFRTLLFSALGEEEKMMVNNYRPLQPLMNRKVWASFQATNEGTRS,305,NP_001730.1.csv,refseq-CA5A-NM_001739.1_clinical_seed_0_final,refseq-CA5A-NM_001739.1.a2m,Invitae,refseq-CA5A-NM_001739.1.npy,1,305,305
+NP_001745.2,MASDSIFESFPSYPQCFMRECILGMNPSRDVHDASTSRRFTPPSTALSPGKMSEALPLGAPDAGAALAGKLRSGDRSMVEVLADHPGELVRTDSPNFLCSVLPTHWRCNKTLPIAFKVVALGDVPDGTLVTVMAGNDENYSAELRNATAAMKNQVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKITVDGPREPRRHRQKLDDQTKPGSLSFSERLSELEQLRRTAMRVSPHHPAPTPNPRASLNHSTAFNPQPQSQMQDTRQIQPSPPWSYDQSYQYLGSIASPSVHPATPISPGRASGMTTLSAELSSRLSTAPDLTAFSDPRQFPALPSISDPRMHYPGAFTYSPTPVTSGIGIGMSAMGSATRYHTYLPPPYPGSSQAQGGPFQASSPSYHLYYGASAGSYQFSMVGGERSPPRILPPCTNASTGSALLNPSLPNQSDVVEAEGSHSNSPTNMAPSARLEEAVWRPY,480,NP_001745.2.csv,refseq-RUNX1-NM_001754.4_clinical_seed_0_final,refseq-RUNX1-NM_001754.4.a2m,Invitae,refseq-RUNX1-NM_001754.4.npy,1,480,480
+NP_001750.1,MELLCHEVDPVRRAVRDRNLLRDDRVLQNLLTIEERYLPQCSYFKCVQKDIQPYMRRMVATWMLEVCEEQKCEEEVFPLAMNYLDRFLAGVPTPKSHLQLLGAVCMFLASKLKETSPLTAEKLCIYTDNSIKPQELLEWELVVLGKLKWNLAAVTPHDFIEHILRKLPQQREKLSLIRKHAQTFIALCATDFKFAMYPPSMIATGSVGAAICGLQQDEEVSSLTCDALTELLAKITNTDVDCLKACQEQIEAVLLNSLQQYRQDQRDGSKSEDELDQASTPTDVRDIDL,289,NP_001750.1.csv,refseq-CCND2-NM_001759.4_clinical_seed_0_final,refseq-CCND2-NM_001759.4.a2m,Invitae,refseq-CCND2-NM_001759.4.npy,1,289,289
+NP_001759.3,MALPVTALLLPLALLLHAARPSQFRVSPLDRTWNLGETVELKCQVLLSNPTSGCSWLFQPRGAAASPTFLLYLSQNKPKAAEGLDTQRFSGKRLGDTFVLTLSDFRRENEGYYFCSALSNSIMYFSHFVPVFLPAKPTTTPAPRPPTPAPTIASQPLSLRPEACRPAAGGAVHTRGLDFACDIYIWAPLAGTCGVLLLSLVITLYCNHRNRRRVCKCPRPVVKSGDKPSLSARYV,235,NP_001759.3.csv,refseq-CD8A-NM_001768.6_clinical_seed_0_final,refseq-CD8A-NM_001768.6.a2m,Invitae,refseq-CD8A-NM_001768.6.npy,1,235,235
+NP_001761.3,MPPPRLLFFLLFLTPMEVRPEEPLVVKVEEGDNAVLQCLKGTSDGPTQQLTWSRESPLKPFLKLSLGLPGLGIHMRPLAIWLFIFNVSQQMGGFYLCQPGPPSEKAWQPGWTVNVEGSGELFRWNVSDLGGLGCGLKNRSSEGPSSPSGKLMSPKLYVWAKDRPEIWEGEPPCLPPRDSLNQSLSQDLTMAPGSTLWLSCGVPPDSVSRGPLSWTHVHPKGPKSLLSLELKDDRPARDMWVMETGLLLPRATAQDAGKYYCHRGNLTMSFHLEITARPVLWHWLLRTGGWKVSAVTLAYLIFCLCSLVGILHLQRALVLRRKRKRMTDPTRRFFKVTPPPGSGPQNQYGNVLSLPTPTSGLGRAQRWAAGLGGTAPSYGNPSSDVQADGALGSRSPPGVGPEEEEGEGYEEPDSEEDSEFYENDSNLGQDQLSQDGSGYENPEDEPLGPEDEDSFSNAESYENEDEELTQPVARTMDFLSPHGSAWDPSREATSLGSQSYEDMRGILYAAPQLRSIRGQPGPNHEEDADSYENMDNPDGPDPAWGGGGRMGTWSTR,556,NP_001761.3.csv,refseq-CD19-NM_001770.5_clinical_seed_0_final,refseq-CD19-NM_001770.5.a2m,Invitae,refseq-CD19-NM_001770.5.npy,1,556,556
+NP_001774.1,MPGGPGVLQALPATIFLLFLLSAVYLGPGCQALWMHKVPASLMVSLGEDAHFQCPHNSSNNANVTWWRVLHGNYTWPPEFLGPGEDPNGTLIIQNVNKSHGGIYVCRVQEGNESYQQSCGTYLRVRQPPPRPFLDMGEGTKNRIITAEGIILLFCAVVPGTLLLFRKRWQNEKLGLDAGDEYEDENLYEGLNLDDCSMYEDISRGLQGTYQDVGSLNIGDVQLEKP,226,NP_001774.1.csv,refseq-CD79A-NM_001783.3_clinical_seed_0_final,refseq-CD79A-NM_001783.3.a2m,Invitae,refseq-CD79A-NM_001783.3.npy,1,226,226
+NP_001783.2,MCRIAGALRTLLPLLAALLQASVEASGEIALCKTGFPEDVYSAVLSKDVHEGQPLLNVKFSNCNGKRKVQYESSEPADFKVDEDGMVYAVRSFPLSSEHAKFLIYAQDKETQEKWQVAVKLSLKPTLTEESVKESAEVEEIVFPRQFSKHSGHLQRQKRDWVIPPINLPENSRGPFPQELVRIRSDRDKNLSLRYSVTGPGADQPPTGIFIINPISGQLSVTKPLDREQIARFHLRAHAVDINGNQVENPIDIVINVIDMNDNRPEFLHQVWNGTVPEGSKPGTYVMTVTAIDADDPNALNGMLRYRIVSQAPSTPSPNMFTINNETGDIITVAAGLDREKVQQYTLIIQATDMEGNPTYGLSNTATAVITVTDVNDNPPEFTAMTFYGEVPENRVDIIVANLTVTDKDQPHTPAWNAVYRISGGDPTGRFAIQTDPNSNDGLVTVVKPIDFETNRMFVLTVAAENQVPLAKGIQHPPQSTATVSVTVIDVNENPYFAPNPKIIRQEEGLHAGTMLTTFTAQDPDRYMQQNIRYTKLSDPANWLKIDPVNGQITTIAVLDRESPNVKNNIYNATFLASDNGIPPMSGTGTLQIYLLDINDNAPQVLPQEAETCETPDPNSINITALDYDIDPNAGPFAFDLPLSPVTIKRNWTITRLNGDFAQLNLKIKFLEAGIYEVPIIITDSGNPPKSNISILRVKVCQCDSNGDCTDVDRIVGAGLGTGAIIAILLCIIILLILVLMFVVWMKRRDKERQAKQLLIDPEDDVRDNILKYDEEGGGEEDQDYDLSQLQQPDTVEPDAIKPVGIRRMDERPIHAEPQYPVRSAAPHPGDIGDFINEGLKAADNDPTAPPYDSLLVFDYEGSGSTAGSLSSLNSSSSGGEQDYDYLNDWGPRFKKLADMYGGGDD,906,NP_001783.2.csv,refseq-CDH2-NM_001792.5_clinical_seed_0_final,refseq-CDH2-NM_001792.5.a2m,Invitae,refseq-CDH2-NM_001792.5_theta_0.2.npy,1,906,906
+NP_001784.2,MGLPRGPLASLLLLQVCWLQCAASEPCRAVFREAEVTLEAGGAEQEPGQALGKVFMGCPGQEPALFSTDNDDFTVRNGETVQERRSLKERNPLKIFPSKRILRRHKRDWVVAPISVPENGKGPFPQRLNQLKSNKDRDTKIFYSITGPGADSPPEGVFAVEKETGWLLLNKPLDREEIAKYELFGHAVSENGASVEDPMNISIIVTDQNDHKPKFTQDTFRGSVLEGVLPGTSVMQVTATDEDDAIYTYNGVVAYSIHSQEPKDPHDLMFTIHRSTGTISVISSGLDREKVPEYTLTIQATDMDGDGSTTTAVAVVEILDANDNAPMFDPQKYEAHVPENAVGHEVQRLTVTDLDAPNSPAWRATYLIMGGDDGDHFTITTHPESNQGILTTRKGLDFEAKNQHTLYVEVTNEAPFVLKLPTSTATIVVHVEDVNEAPVFVPPSKVVEVQEGIPTGEPVCVYTAEDPDKENQKISYRILRDPAGWLAMDPDSGQVTAVGTLDREDEQFVRNNIYEVMVLAMDNGSPPTTGTGTLLLTLIDVNDHGPVPEPRQITICNQSPVRQVLNITDKDLSPHTSPFQAQLTDDSDIYWTAEVNEEGDTVVLSLKKFLKQDTYDVHLSLSDHGNKEQLTVIRATVCDCHGHVETCPGPWKGGFILPVLGAVLALLFLLLVLLLLVRKKRKIKEPLLLPEDDTRDNVFYYGEEGGGEEDQDYDITQLHRGLEARPEVVLRNDVAPTIIPTPMYRPRPANPDEIGNFIIENLKAANTDPTAPPYDTLLVFDYEGSGSDAASLSSLTSSASDQDQDYDYLNEWGSRFKKLADMYGGGEDD,829,NP_001784.2.csv,refseq-CDH3-NM_001793.5_clinical_seed_0_final,refseq-CDH3-NM_001793.5.a2m,Invitae,refseq-CDH3-NM_001793.5.npy,1,829,829
+NP_001785.2,MTAGAGVLLLLLSLSGALRAHNEDLTTRETCKAGFSEDDYTALISQNILEGEKLLQVKFSSCVGTKGTQYETNSMDFKVGADGTVFATRELQVPSEQVAFTVTAWDSQTAEKWDAVVRLLVAQTSSPHSGHKPQKGKKVVALDPSPPPKDTLLPWPQHQNANGLRRRKRDWVIPPINVPENSRGPFPQQLVRIRSDKDNDIPIRYSITGVGADQPPMEVFSIDSMSGRMYVTRPMDREEHASYHLRAHAVDMNGNKVENPIDLYIYVIDMNDNRPEFINQVYNGSVDEGSKPGTYVMTVTANDADDSTTANGMVRYRIVTQTPQSPSQNMFTINSETGDIVTVAAGLDREKVQQYTVIVQATDMEGNLNYGLSNTATAIITVTDVNDNPPEFTASTFAGEVPENRVETVVANLTVMDRDQPHSPNWNAVYRIISGDPSGHFSVRTDPVTNEGMVTVVKAVDYELNRAFMLTVMVSNQAPLASGIQMSFQSTAGVTISIMDINEAPYFPSNHKLIRLEEGVPPGTVLTTFSAVDPDRFMQQAVRYSKLSDPASWLHINATNGQITTAAVLDRESLYTKNNVYEATFLAADNGIPPASGTGTLQIYLIDINDNAPELLPKEAQICEKPNLNAINITAADADVDPNIGPYVFELPFVPAAVRKNWTITRLNGDYAQLSLRILYLEAGMYDVPIIVTDSGNPPLSNTSIIKVKVCPCDDNGDCTTIGAVAAAGLGTGAIVAILICILILLTMVLLFVMWMKRREKERHTKQLLIDPEDDVRDNILKYDEEGGGEEDQDYDLSQLQQPEAMGHVPSKAPGVRRVDERPVGAEPQYPIRPMVPHPGDIGDFINEGLRAADNDPTAPPYDSLLVFDYEGSGSTAGSVSSLNSSSSGDQDYDYLNDWGPRFKKLADMYGGGEED,916,NP_001785.2.csv,refseq-CDH4-NM_001794.3_clinical_seed_0_final,refseq-CDH4-NM_001794.3.a2m,Invitae,refseq-CDH4-NM_001794.3.npy,1,916,916
+NP_001796.2,MSHGTYYECEPRGGQQPLEFSGGRAGPGELGDMCEHEASIDLSAYIESGEEQLLSDLFAVKPAPEARGLKGPGTPAFPHYLPPDPRPFAYPPHTFGPDRKALGPGIYSSPGSYDPRAVAVKEEPRGPEGSRAASRGSYNPLQYQVAHCGQTAMHLPPTLAAPGQPLRVLKAPLATAAPPCSPLLKAPSPAGPLHKGKKAVNKDSLEYRLRRERNNIAVRKSRDKAKRRILETQQKVLEYMAENERLRSRVEQLTQELDTLRNLFRQIPEAANLIKGVGGCS,281,NP_001796.2.csv,refseq-CEBPE-NM_001805.3_clinical_seed_0_final,refseq-CEBPE-NM_001805.3.a2m,Invitae,refseq-CEBPE-NM_001805.3.npy,1,281,281
+NP_001813.1,MALTLFDTDEYRPPVWKSYLYQLQQEAPHPRRITCTCEVENRPKYYGREFHGMISREAADQLLIVAEGSYLIRESQRQPGTYTLALRFGSQTRNFRLYYDGKHFVGEKRFESIHDLVTDGLITLYIETKAAEYIAKMTINPIYEHVGYTTLNREPAYKKHMPVLKETHDERDSTGQDGVSEKRLTSLVRRATLKENEQIPKYEKIHNFKVHTFRGPHWCEYCANFMWGLIAQGVKCADCGLNVHKQCSKMVPNDCKPDLKHVKKVYSCDLTTLVKAHTTKRPMVVDMCIREIESRGLNSEGLYRVSGFSDLIEDVKMAFDRDGEKADISVNMYEDINIITGALKLYFRDLPIPLITYDAYPKFIESAKIMDPDEQLETLHEALKLLPPAHCETLRYLMAHLKRVTLHEKENLMNAENLGIVFGPTLMRSPELDAMAALNDIRYQRLVVELLIKNEDILF,459,NP_001813.1.csv,refseq-CHN1-NM_001822.5_clinical_seed_0_final,refseq-CHN1-NM_001822.5.a2m,Invitae,refseq-CHN1-NM_001822.5.npy,1,459,459
+NP_001821.2,MVNAGAMSGSGNLMDFLDEPFPDVGTYEDFHTIDWLREKSRDTDRHRKITSKSKESIWEFIKSLLDAWSGWVVMLLIGLLAGTLAGVIDLAVDWMTDLKEGVCLSAFWYSHEQCCWTSNETTFEDRDKCPLWQKWSELLVNQSEGASAYILNYLMYILWALLFAFLAVSLVRVFAPYACGSGIPEIKTILSGFIIRGYLGKWTLLIKTVTLVLVVSSGLSLGKEGPLVHVACCCGNFFSSLFSKYSKNEGKRREVLSAAAAAGVSVAFGAPIGGVLFSLEEVSYYFPLKTLWRSFFAALVAAFTLRSINPFGNSRLVLFYVEYHTPWYMAELFPFILLGVFGGLWGTLFIRCNIAWCRRRKTTRLGKYPVLEVIVVTAITAIIAYPNPYTRQSTSELISELFNDCGALESSQLCDYINDPNMTRPVDDIPDRPAGVGVYTAMWQLALALIFKIVVTIFTFGMKIPSGLFIPSMAVGAIAGRMVGIGVEQLAYHHHDWIIFRNWCRPGADCVTPGLYAMVGAAACLGGVTRMTVSLVVIMFELTGGLEYIVPLMAAAVTSKWVADAFGKEGIYEAHIHLNGYPFLDVKDEFTHRTLATDVMRPRRGEPPLSVLTQDSMTVEDVETLIKETDYNGFPVVVSRDSERLIGFAQRRELILAIKNARQRQEGIVSNSIMYFTEEPPELPANSPHPLKLRRILNLSPFTVTDHTPMETVVDIFRKLGLRQCLVTRSGRLLGIITKKDVLRHMAQMANQDPESIMFN,760,NP_001821.2.csv,refseq-CLCN4-NM_001830.3_clinical_seed_0_final,refseq-CLCN4-NM_001830.3.a2m,Invitae,refseq-CLCN4-NM_001830.3.npy,1,760,760
+NP_001835.3,MIRLGAPQTLVLLTLLVAAVLRCQGQDVQEAGSCVQDGQRYNDKDVWKPEPCRICVCDTGTVLCDDIICEDVKDCLSPEIPFGECCPICPTDLATASGQPGPKGQKGEPGDIKDIVGPKGPPGPQGPAGEQGPRGDRGDKGEKGAPGPRGRDGEPGTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFDEKAGGAQLGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPMGPRGPPGPPGKPGDDGEAGKPGKAGERGPPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAGAPGVKGESGSPGENGSPGPMGPRGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFPGAPGAKGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPGVMGFPGPKGANGEPGKAGEKGLPGAPGLRGLPGKDGETGAAGPPGPAGPAGERGEQGAPGPSGFQGLPGPPGPPGEGGKPGDQGVPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGDRGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHRGFTGLQGLPGPPGPSGDQGASGPAGPSGPRGPPGPVGPSGKDGANGIPGPIGPPGPRGRSGETGPAGPPGNPGPPGPPGPPGPGIDMSAFAGLGPREKGPDPLQYMRADQAAGGLRQHDAEVDATLKSLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCFL,1487,NP_001835.3.csv,refseq-COL2A1-NM_001844.4_clinical_seed_0_final,refseq-COL2A1-NM_001844.4.a2m,Invitae,refseq-COL2A1-NM_001844.4.npy,1,1487,1487
+NP_001837.2,MGRDQRAVAGPALRRWLLLGTVTVGFLAQSVLAGVKKFDVPCGGRDCSGGCQCYPEKGGRGQPGPVGPQGYNGPPGLQGFPGLQGRKGDKGERGAPGVTGPKGDVGARGVSGFPGADGIPGHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQGPKGQKGEPYALPKEERDRYRGEPGEPGLVGFQGPPGRPGHVGQMGPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKGEKGDVGQPGPNGIPSDTLHPIIAPTGVTFHPDQYKGEKGSEGEPGIRGISLKGEEGIMGFPGLRGYPGLSGEKGSPGQKGSRGLDGYQGPDGPRGPKGEAGDPGPPGLPAYSPHPSLAKGARGDPGFPGAQGEPGSQGEPGDPGLPGPPGLSIGDGDQRRGLPGEMGPKGFIGDPGIPALYGGPPGPDGKRGPPGPPGLPGPPGPDGFLFGLKGAKGRAGFPGLPGSPGARGPKGWKGDAGECRCTEGDEAIKGLPGLPGPKGFAGINGEPGRKGDRGDPGQHGLPGFPGLKGVPGNIGAPGPKGAKGDSRTITTKGERGQPGVPGVPGMKGDDGSPGRDGLDGFPGLPGPPGDGIKGPPGDPGYPGIPGTKGTPGEMGPPGLGLPGLKGQRGFPGDAGLPGPPGFLGPPGPAGTPGQIDCDTDVKRAVGGDRQEAIQPGCIGGPKGLPGLPGPPGPTGAKGLRGIPGFAGADGGPGPRGLPGDAGREGFPGPPGFIGPRGSKGAVGLPGPDGSPGPIGLPGPDGPPGERGLPGEVLGAQPGPRGDAGVPGQPGLKGLPGDRGPPGFRGSQGMPGMPGLKGQPGLPGPSGQPGLYGPPGLHGFPGAPGQEGPLGLPGIPGREGLPGDRGDPGDTGAPGPVGMKGLSGDRGDAGFTGEQGHPGSPGFKGIDGMPGTPGLKGDRGSPGMDGFQGMPGLKGRPGFPGSKGEAGFFGIPGLKGLAGEPGFKGSRGDPGPPGPPPVILPGMKDIKGEKGDEGPMGLKGYLGAKGIQGMPGIPGLSGIPGLPGRPGHIKGVKGDIGVPGIPGLPGFPGVAGPPGITGFPGFIGSRGDKGAPGRAGLYGEIGATGDFGDIGDTINLPGRPGLKGERGTTGIPGLKGFFGEKGTEGDIGFPGITGVTGVQGPPGLKGQTGFPGLTGPPGSQGELGRIGLPGGKGDDGWPGAPGLPGFPGLRGIRGLHGLPGTKGFPGSPGSDIHGDPGFPGPPGERGDPGEANTLPGPVGVPGQKGDQGAPGERGPPGSPGLQGFPGITPPSNISGAPGDKGAPGIFGLKGYRGPPGPPGSAALPGSKGDTGNPGAPGTPGTKGWAGDSGPQGRPGVFGLPGEKGPRGEQGFMGNTGPTGAVGDRGPKGPKGDPGFPGAPGTVGAPGIAGIPQKIAVQPGTVGPQGRRGPPGAPGEMGPQGPPGEPGFRGAPGKAGPQGRGGVSAVPGFRGDEGPIGHQGPIGQEGAPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQEPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASRNDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEAPAIAIAVHSQDVSIPHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYYANKYSFWLTTIPEQSFQGSPSADTLKAGLIRTHISRCQVCMKNL,1712,NP_001837.2.csv,refseq-COL4A2-NM_001846.4_clinical_seed_0_final,refseq-COL4A2-NM_001846.4.a2m,Invitae,refseq-COL4A2-NM_001846.4_theta_0.2.npy,1,1712,1712
+NP_001839.2,MRAARALLPLLLQACWTAAQDEPETPRAVAFQDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGGSHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIATDHTYRRNFTAADWGQSRDAEEAISQTIDTIVDMIKNNVEQVCCSFECQPARGPPGLRGDPGFEGERGKPGLPGEKGEAGDPGRPGDLGPVGYQGMKGEKGSRGEKGSRGPKGYKGEKGKRGIDGVDGVKGEMGYPGLPGCKGSPGFDGIQGPPGPKGDPGAFGLKGEKGEPGADGEAGRPGSSGPSGDEGQPGEPGPPGEKGEAGDEGNPGPDGAPGERGGPGERGPRGTPGTRGPRGDPGEAGPQGDQGREGPVGVPGDPGEAGPIGPKGYRGDEGPPGSEGARGAPGPAGPPGDPGLMGERGEDGPAGNGTEGFPGFPGYPGNRGAPGINGTKGYPGLKGDEGEAGDPGDDNNDIAPRGVKGAKGYRGPEGPQGPPGHQGPPGPDECEILDIIMKMCSCCECKCGPIDLLFVLDSSESIGLQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYAGVVQYSHSQMQEHVSLRSPSIRNVQELKEAIKSLQWMAGGTFTGEALQYTRDQLLPPSPNNRIALVITDGRSDTQRDTTPLNVLCSPGIQVVSVGIKDVFDFIPGSDQLNVISCQGLAPSQGRPGLSLVKENYAELLEDAFLKNVTAQICIDKKCPDYTCPITFSSPADITILLDGSASVGSHNFDTTKRFAKRLAERFLTAGRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYVTRFYREASSGAAKKRLLLFSDGNSQGATPAAIEKAVQEAQRAGIEIFVVVVGRQVNEPHIRVLVTGKTAEYDVAYGESHLFRVPSYQALLRGVFHQTVSRKVALG,1028,NP_001839.2.csv,refseq-COL6A1-NM_001848.2_clinical_seed_0_final,refseq-COL6A1-NM_001848.2.a2m,Invitae,refseq-COL6A1-NM_001848.2.npy,1,1028,1028
+NP_001840.3,MLQGTCSVLLLWGILGAIQAQQQEVISPDTTERNNNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDRSKGTVHFAVVITDGHVTGSPCGGIKLQAERAREEGIRLFAVAPNQNLKEQGLRDIASTPHELYRNDYATMLPDSTEIDQDTINRIIKVMKHEAYGECYKVSCLEIPGPSGPKGYRGQKGAKGNMGEPGEPGQKGRQGDPGIEGPIGFPGPKGVPGFKGEKGEFGADGRKGAPGLAGKNGTDGQKGKLGRIGPPGCKGDPGNRGPDGYPGEAGSPGERGDQGGKGDPGRPGRRGPPGEIGAKGSKGYQGNSGAPGSPGVKGAKGGPGPRGPKGEPGRRGDPGTKGSPGSDGPKGEKGDPGPEGPRGLAGEVGNKGAKGDRGLPGPRGPQGALGEPGKQGSRGDPGDAGPRGDSGQPGPKGDPGRPGFSYPGPRGAPGEKGEPGPRGPEGGRGDFGLKGEPGRKGEKGEPADPGPPGEPGPRGPRGVPGPEGEPGPPGDPGLTECDVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQKTRVFAVVITDGRHDPRDDDLNLRALCDRDVTVTAIGIGDMFHEKHESENLYSIACDKPQQVRNMTLFSDLVAEKFIDDMEDVLCPDPQIVCPDLPCQTELSVAQCTQRPVDIVFLLDGSERLGEQNFHKARRFVEQVARRLTLARRDDDPLNARVALLQFGGPGEQQVAFPLSHNLTAIHEALETTQYLNSFSHVGAGVVHAINAIVRSPRGGARRHAELSFVFLTDGVTGNDSLHESAHSMRKQNVVPTVLALGSDVDMDVLTTLSLGDRAAVFHEKDYDSLAQPGFFDRFIRWIC,1019,NP_001840.3.csv,refseq-COL6A2-NM_001849.3_clinical_seed_0_final,refseq-COL6A2-NM_001849.3.a2m,Invitae,refseq-COL6A2-NM_001849.3.npy,1,1019,1019
+NP_001843.1,MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPPGKAGPPGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPGPPGLPGPGFAGPPGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQGLEGSADFLCPTNCPPGMKGPPGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGEQGIPGPPGPQGIRGYPGMAGPKGETGPHGYKGMVGAIGATGPPGEEGPRGPPGRAGEKGDEGSPGIRGPQGITGPKGATGPPGINGKDGTPGTPGMKGSAGQAGQPGSPGHQGLAGVPGQPGTKGGPGDQGEPGPQGLPGFSGPPGKEGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQGLPGVKGDKGSPGKTGPRGKVGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGPSGDAGAPGVQGYPGPPGPRGLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQEQLAEVAVSAKREALGAVGMMGPPGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTGPKGKRGEKGDPGEVGRGHPGMPGPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGLPGPVGLPGFCEPAACLGASAYASARLTEPGSIKGP,689,NP_001843.1.csv,refseq-COL9A2-NM_001852.3_clinical_seed_0_final,refseq-COL9A2-NM_001852.3.a2m,Invitae,refseq-COL9A2-NM_001852.3.npy,1,689,689
+NP_001845.3,MEPWSSRWKTKRWLWDFTVTTLALTFLFQAREVRGAAPVDVLKALDFHNSPEGISKTTGFCTNRKNSKGSDTAYRVSKQAQLSAPTKQLFPGGTFPEDFSILFTVKPKKGIQSFLLSIYNEHGIQQIGVEVGRSPVFLFEDHTGKPAPEDYPLFRTVNIADGKWHRVAISVEKKTVTMIVDCKKKTTKPLDRSERAIVDTNGITVFGTRILDEEVFEGDIQQFLITGDPKAAYDYCEHYSPDCDSSAPKAAQAQEPQIDEYAPEDIIEYDYEYGEAEYKEAESVTEGPTVTEETIAQTEANIVDDFQEYNYGTMESYQTEAPRHVSGTNEPNPVEEIFTEEYLTGEDYDSQRKNSEDTLYENKEIDGRDSDLLVDGDLGEYDFYEYKEYEDKPTSPPNEEFGPGVPAETDITETSINGHGAYGEKGQKGEPAVVEPGMLVEGPPGPAGPAGIMGPPGLQGPTGPPGDPGDRGPPGRPGLPGADGLPGPPGTMLMLPFRYGGDGSKGPTISAQEAQAQAILQQARIALRGPPGPMGLTGRPGPVGGPGSSGAKGESGDPGPQGPRGVQGPPGPTGKPGKRGRPGADGGRGMPGEPGAKGDRGFDGLPGLPGDKGHRGERGPQGPPGPPGDDGMRGEDGEIGPRGLPGEAGPRGLLGPRGTPGAPGQPGMAGVDGPPGPKGNMGPQGEPGPPGQQGNPGPQGLPGPQGPIGPPGEKGPQGKPGLAGLPGADGPPGHPGKEGQSGEKGALGPPGPQGPIGYPGPRGVKGADGVRGLKGSKGEKGEDGFPGFKGDMGLKGDRGEVGQIGPRGEDGPEGPKGRAGPTGDPGPSGQAGEKGKLGVPGLPGYPGRQGPKGSTGFPGFPGANGEKGARGVAGKPGPRGQRGPTGPRGSRGARGPTGKPGPKGTSGGDGPPGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPPGPGGVVGPQGPTGETGPIGERGHPGPPGPPGEQGLPGAAGKEGAKGDPGPQGISGKDGPAGLRGFPGERGLPGAQGAPGLKGGEGPQGPPGPVGSPGERGSAGTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLPGPAGPAGSPGEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAPGIAGGDGEPGPRGQQGMFGQKGDEGARGFPGPPGPIGLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQGPNGADGPQGPPGSVGSVGGVGEKGEPGEAGNPGPPGEAGVGGPKGERGEKGEAGPPGAAGPPGAKGPPGDDGPKGNPGPVGFPGDPGPPGEPGPAGQDGVGGDKGEDGDPGQPGPPGPSGEAGPPGPPGKRGPPGAAGAEGRQGEKGAKGEAGAEGPPGKTGPVGPQGPAGKPGPEGLRGIPGPVGEQGLPGAAGQDGPPGPMGPPGLPGLKGDPGSKGEKGHPGLIGLIGPPGEQGEKGDRGLPGTQGSPGAKGDGGIPGPAGPLGPPGPPGLPGPQGPKGNKGSTGPAGQKGDSGLPGPPGSPGPPGEVIQPLPILSSKKTRRHTEGMQADADDNILDYSDGMEEIFGSLNSLKQDIEHMKFPMGTQTNPARTCKDLQLSHPDFPDGEYWIDPNQGCSGDSFKVYCNFTSGGETCIYPDKKSEGVRISSWPKEKPGSWFSEFKRGKLLSYLDVEGNSINMVQMTFLKLLTASARQNFTYHCHQSAAWYDVSSGSYDKALRFLGSNDEEMSYDNNPFIKTLYDGCASRKGYEKTVIEINTPKIDQVPIVDVMINDFGDQNQKFGFEVGPVCFLG,1806,NP_001845.3.csv,refseq-COL11A1-NM_001854.3_clinical_seed_0_final,refseq-COL11A1-NM_001854.3.a2m,Invitae,refseq-COL11A1-NM_001854.3_theta_0.2.npy,1,1806,1806
+NP_001850.1,MDHSHHMGMSYMDSNSTMQPSHHHPTTSASHSHGGGDSSMMMMPMTFYFGFKNVELLFSGLVINTAGEMAGAFVAVFLLAMFYEGLKIARESLLRKSQVSIRYNSMPVPGPNGTILMETHKTVGQQMLSFPHLLQTVLHIIQVVISYFLMLIFMTYNGYLCIAVAAGAGTGYFLFSWKKAVVVDITEHCH,190,NP_001850.1.csv,refseq-SLC31A1-NM_001859.3_clinical_seed_0_final,refseq-SLC31A1-NM_001859.3.a2m,Invitae,refseq-SLC31A1-NM_001859.3.npy,1,190,190
+NP_001859.1,MRGLLVLSVLLGAVFGKEDFVGHQVLRISVADEAQVQKVKELEDLEHLQLDFWRGPAHPGSPIDVRVPFPSIQAVKIFLESHGISYETMIEDVQSLLDEEQEQMFAFRSRARSTDTFNYATYHTLEEIYDFLDLLVAENPHLVSKIQIGNTYEGRPIYVLKFSTGGSKRPAIWIDTGIHSREWVTQASGVWFAKKITQDYGQDAAFTAILDTLDIFLEIVTNPDGFAFTHSTNRMWRKTRSHTAGSLCIGVDPNRNWDAGFGLSGASSNPCSETYHGKFANSEVEVKSIVDFVKDHGNIKAFISIHSYSQLLMYPYGYKTEPVPDQDELDQLSKAAVTALASLYGTKFNYGSIIKAIYQASGSTIDWTYSQGIKYSFTFELRDTGRYGFLLPASQIIPTAKETWLALLTIMEHTLNHPY,419,NP_001859.1.csv,refseq-CPA1-NM_001868.3_clinical_seed_0_final,refseq-CPA1-NM_001868.3.a2m,Invitae,refseq-CPA1-NM_001868.3.npy,1,419,419
+NP_001866.2,MTRILTAFKVVRTLKTGFGFTNVTAHQKWKFSRPGIRLLSVKAQTAHIVLEDGTKMKGYSFGHPSSVAGEVVFNTGLGGYPEAITDPAYKGQILTMANPIIGNGGAPDTTALDELGLSKYLESNGIKVSGLLVLDYSKDYNHWLATKSLGQWLQEEKVPAIYGVDTRMLTKIIRDKGTMLGKIEFEGQPVDFVDPNKQNLIAEVSTKDVKVYGKGNPTKVVAVDCGIKNNVIRLLVKRGAEVHLVPWNHDFTKMEYDGILIAGGPGNPALAEPLIQNVRKILESDRKEPLFGISTGNLITGLAAGAKTYKMSMANRGQNQPVLNITNKQAFITAQNHGYALDNTLPAGWKPLFVNVNDQTNEGIMHESKPFFAVQFHPEVTPGPIDTEYLFDSFFSLIKKGKATTITSVLPKPALVASRVEVSKVLILGSGGLSIGQAGEFDYSGSQAVKAMKEENVKTVLMNPNIASVQTNEVGLKQADTVYFLPITPQFVTEVIKAEQPDGLILGMGGQTALNCGVELFKRGVLKEYGVKVLGTSVESIMATEDRQLFSDKLNEINEKIAPSFAVESIEDALKAADTIGYPVMIRSAYALGGLGSGICPNRETLMDLSTKAFAMTNQILVEKSVTGWKEIEYEVVRDADDNCVTVCNMENVDAMGVHTGDSVVVAPAQTLSNAEFQMLRRTSINVVRHLGIVGECNIQFALHPTSMEYCIIEVNARLSRSSALASKATGYPLAFIAAKIALGIPLPEIKNVVSGKTSACFEPSLDYMVTKIPRWDLDRFHGTSSRIGSSMKSVGEVMAIGRTFEESFQKALRMCHPSIEGFTPRLPMNKEWPSNLDLRKELSEPSSTRIYAIAKAIDDNMSLDEIEKLTYIDKWFLYKMRDILNMEKTLKGLNSESMTEETLKRAKEIGFSDKQISKCLGLTEAQTRELRLKKNIHPWVKQIDTLAAEYPSVTNYLYVTYNGQEHDVNFDDHGMMVLGCGPYHIGSSVEFDWCAVSSIRTLRQLGKKTVVVNCNPETVSTDFDECDKLYFEELSLERILDIYHQEACGGCIISVGGQIPNNLAVPLYKNGVKIMGTSPLQIDRAEDRSIFSAVLDELKVAQAPWKAVNTLNEALEFAKSVDYPCLLRPSYVLSGSAMNVVFSEDEMKKFLEEATRVSQEHPVVLTKFVEGAREVEMDAVGKDGRVISHAISEHVEDAGVHSGDATLMLPTQTISQGAIEKVKDATRKIAKAFAISGPFNVQFLVKGNDVLVIECNLRASRSFPFVSKTLGVDFIDVATKVMIGENVDEKHLPTLDHPIIPADYVAIKAPMFSWPRLRDADPILRCEMASTGEVACFGEGIHTAFLKAMLSTGFKIPQKGILIGIQQSFRPRFLGVAEQLHNEGFKLFATEATSDWLNANNVPATPVAWPSQEGQNPSLSSIRKLIRDGSIDLVINLPNNNTKFVHDNYVIRRTAVDSGIPLLTNFQVTKLFAEAVQKSRKVDSKSLFHYRQYSAGKAA,1500,NP_001866.2.csv,refseq-CPS1-NM_001875.4_clinical_seed_0_final,refseq-CPS1-NM_001875.4.a2m,Invitae,refseq-CPS1-NM_001875.4.npy,1,1500,1500
+NP_001867.2,MAEAHQAVAFQFTVTPDGIDLRLSHEALRQIYLSGLHSWKKKFIRFKNGIITGVYPASPSSWLIVVVGVMTTMYAKIDPSLGIIAKINRTLETANCMSSQTKNVVSGVLFGTGLWVALIVTMRYSLKVLLSYHGWMFTEHGKMSRATKIWMGMVKIFSGRKPMLYSFQTSLPRLPVPAVKDTVNRYLQSVRPLMKEEDFKRMTALAQDFAVGLGPRLQWYLKLKSWWATNYVSDWWEEYIYLRGRGPLMVNSNYYAMDLLYILPTHIQAARAGNAIHAILLYRRKLDREEIKPIRLLGSTIPLCSAQWERMFNTSRIPGEETDTIQHMRDSKHIVVYHRGRYFKVWLYHDGRLLKPREMEQQMQRILDNTSEPQPGEARLAALTAGDRVPWARCRQAYFGRGKNKQSLDAVEKAAFFVTLDETEEGYRSEDPDTSMDSYAKSLLHGRCYDRWFDKSFTFVVFKNGKMGLNAEHSWADAPIVAHLWEYVMSIDSLQLGYAEDGHCKGDINPNIPYPTRLQWDIPGECQEVIETSLNTANLLANDVDFHSFPFVAFGKGIIKKCRTSPDAFVQLALQLAHYKDMGKFCLTYEASMTRLFREGRTETVRSCTTESCDFVRAMVDPAQTVEQRLKLFKLASEKHQHMYRLAMTGSGIDRHLFCLYVVSKYLAVESPFLKEVLSEPWRLSTSQTPQQQVELFDLENNPEYVSSGGGFGPVADDGYGVSYILVGENLINFHISSKFSCPETDSHRFGRHLKEAMTDIITLFGLSSNSKK,773,NP_001867.2.csv,refseq-CPT1A-NM_001876.4_clinical_seed_0_final,refseq-CPT1A-NM_001876.4.a2m,Invitae,refseq-CPT1A-NM_001876.4_theta_0.2.npy,1,773,773
+NP_001876.1,MDIAIHHPWIRRPFFPFHSPSRLFDQFFGEHLLESDLFPTSTSLSPFYLRPPSFLRAPSWFDTGLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIEVHGKHEERQDEHGFISREFHRKYRIPADVDPLTITSSLSSDGVLTVNGPRKQVSGPERTIPITREEKPAVTAAPKK,175,NP_001876.1.csv,refseq-CRYAB-NM_001885.2_clinical_seed_0_final,refseq-CRYAB-NM_001885.2.a2m,Invitae,refseq-CRYAB-NM_001885.2.npy,1,175,175
+NP_001877.1,MTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGFETVRSLKVLSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLTIFEQENFLGKKGELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQYVLECDHHSGDYKHFREWGSHAPTFQVQSIRRIQQ,196,NP_001877.1.csv,refseq-CRYBA4-NM_001886.2_clinical_seed_0_final,refseq-CRYBA4-NM_001886.2.a2m,Invitae,refseq-CRYBA4-NM_001886.2.npy,1,196,196
+NP_001878.1,MSQAAKASASATVAVNPGPDTKGKGAPPAGTSPSPGTTLAPTTVPITSAKAAELPPGNYRLVVFELENFQGRRAEFSGECSNLADRGFDRVRSIIVSAGPWVAFEQSNFRGEMFILEKGEYPRWNTWSSSYRSDRLMSFRPIKMDAQEHKISLFEGANFKGNTIEIQGDDAPSLWVYGFSDRVGSVKVSSGTWVGYQYPGYRGYQYLLEPGDFRHWNEWGAFQPQMQSLRRLRDKQWHLEGSFPVLATEPPK,252,NP_001878.1.csv,refseq-CRYBB1-NM_001887.3_clinical_seed_0_final,refseq-CRYBB1-NM_001887.3.a2m,Invitae,refseq-CRYBB1-NM_001887.3.npy,1,252,252
+NP_001895.1,MATQADLMELDMAMEPDRKAAVSHWQQQSYLDSGIHSGATTTAPSLSGKGNPEEEDVDTSQVLYEWEQGFSQSFTQEQVADIDGQYAMTRAQRVRAAMFPETLDEGMQIPSTQFDAAHPTNVQRLAEPSQMLKHAVVNLINYQDDAELATRAIPELTKLLNDEDQVVVNKAAVMVHQLSKKEASRHAIMRSPQMVSAIVRTMQNTNDVETARCTAGTLHNLSHHREGLLAIFKSGGIPALVKMLGSPVDSVLFYAITTLHNLLLHQEGAKMAVRLAGGLQKMVALLNKTNVKFLAITTDCLQILAYGNQESKLIILASGGPQALVNIMRTYTYEKLLWTTSRVLKVLSVCSSNKPAIVEAGGMQALGLHLTDPSQRLVQNCLWTLRNLSDAATKQEGMEGLLGTLVQLLGSDDINVVTCAAGILSNLTCNNYKNKMMVCQVGGIEALVRTVLRAGDREDITEPAICALRHLTSRHQEAEMAQNAVRLHYGLPVVVKLLHPPSHWPLIKATVGLIRNLALCPANHAPLREQGAIPRLVQLLVRAHQDTQRRTSMGGTQQQFVEGVRMEEIVEGCTGALHILARDVHNRIVIRGLNTIPLFVQLLYSPIENIQRVAAGVLCELAQDKEAAEAIEAEGATAPLTELLHSRNEGVATYAAAVLFRMSEDKPQDYKKRLSVELTSSLFRTEPMAWNETADLGLDIGAQGEPLGYRQDDPSYRSFHSGGYGQDALGMDPMMEHEMGGHHPGADYPVDGLPDLGHAQDLMDGLPPGDSNQLAWFDTDL,781,NP_001895.1.csv,refseq-CTNNB1-NM_001904.3_clinical_seed_0_final,refseq-CTNNB1-NM_001904.3.a2m,Invitae,refseq-CTNNB1-NM_001904.3.npy,1,781,781
+NP_001905.1,MAEQSDEAVKYYTLEEIQKHNHSKSTWLILHHKVYDLTKFLEEHPGGEEVLREQAGGDATENFEDVGHSTDAREMSKTFIIGELHPDDRPKLNKPPEP,98,NP_001905.1.csv,refseq-CYB5A-NM_001914.3_clinical_seed_0_final,refseq-CYB5A-NM_001914.3.a2m,Invitae,refseq-CYB5A-NM_001914.3.npy,1,98,98
+NP_001907.3,MAAAAASLRGVVLGPRGAGLPGARARGLLCSARPGQLPLRTPQAVALSSKSGLSRGRKVMLSALGMLAAGGAGLAMALHSAVSASDLELHPPSYPWSHRGLLSSLDHTSIRRGFQVYKQVCASCHSMDFVAYRHLVGVCYTEDEAKELAAEVEVQDGPNEDGEMFMRPGKLFDYFPKPYPNSEAARAANNGALPPDLSYIVRARHGGEDYVFSLLTGYCEPPTGVSLREGLYFNPYFPGQAIAMAPPIYTDVLEFDDGTPATMSQIAKDVCTFLRWASEPEHDHRKRMGLKMLMMMALLVPLVYTIKRHKWSVLKSRKLAYRPPK,325,NP_001907.3.csv,refseq-CYC1-NM_001916.5_clinical_seed_0_final,refseq-CYC1-NM_001916.5.a2m,Invitae,refseq-CYC1-NM_001916.5.npy,1,325,325
+NP_001908.3,MRVVVIGAGVIGLSTALCIHERYHSVLQPLDIKVYADRFTPLTTTDVAAGLWQPYLSDPNNPQEADWSQQTFDYLLSHVHSPNAENLGLFLISGYNLFHEAIPDPSWKDTVLGFRKLTPRELDMFPDYGYGWFHTSLILEGKNYLQWLTERLTERGVKFFQRKVESFEEVAREGADVIVNCTGVWAGALQRDPLLQPGRGQIMKVDAPWMKHFILTHDPERGIYNSPYIIPGTQTVTLGGIFQLGNWSELNNIQDHNTIWEGCCRLEPTLKNARIIGERTGFRPVRPQIRLEREQLRTGPSNTEVIHNYGHGGYGLTIHWGCALEAAKLFGRILEEKKLSRMPPSHL,347,NP_001908.3.csv,refseq-DAO-NM_001917.4_clinical_seed_0_final,refseq-DAO-NM_001917.4.a2m,Invitae,refseq-DAO-NM_001917.4.npy,1,347,347
+NP_001909.4,MAAVRMLRTWSRNAGKLICVRYFQTCGNVHVLKPNYVCFFGYPSFKYSHPHHFLKTTAALRGQVVQFKLSDIGEGIREVTVKEWYVKEGDTVSQFDSICEVQSDKASVTITSRYDGVIKKLYYNLDDIAYVGKPLVDIETEALKDSEEDVVETPAVSHDEHTHQEIKGRKTLATPAVRRLAMENNIKLSEVVGSGKDGRILKEDILNYLEKQTGAILPPSPKVEIMPPPPKPKDMTVPILVSKPPVFTGKDKTEPIKGFQKAMVKTMSAALKIPHFGYCDEIDLTELVKLREELKPIAFARGIKLSFMPFFLKAASLGLLQFPILNASVDENCQNITYKASHNIGIAMDTEQGLIVPNVKNVQICSIFDIATELNRLQKLGSVSQLSTTDLTGGTFTLSNIGSIGGTFAKPVIMPPEVAIGALGSIKAIPRFNQKGEVYKAQIMNVSWSADHRVIDGATMSRFSNLWKSYLENPAFMLLDLK,482,NP_001909.4.csv,refseq-DBT-NM_001918.5_clinical_seed_0_final,refseq-DBT-NM_001918.5.a2m,Invitae,refseq-DBT-NM_001918.5_theta_0.2.npy,1,482,482
+NP_001914.3,MSYNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRALYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVVEELTRIH,1140,NP_001914.3.csv,refseq-DDB1-NM_001923.4_clinical_seed_0_final,refseq-DDB1-NM_001923.4.a2m,Invitae,refseq-DDB1-NM_001923.4.npy,1,1140,1140
+NP_001918.3,MSQAYSSSQRVSSYRRTFGGAPGFPLGSPLSSPVFPRAGFGSKGSSSSVTSRVYQVSRTSGGAGGLGSLRASRLGTTRTPSSYGAGELLDFSLADAVNQEFLTTRTNEKVELQELNDRFANYIEKVRFLEQQNAALAAEVNRLKGREPTRVAELYEEELRELRRQVEVLTNQRARVDVERDNLLDDLQRLKAKLQEEIQLKEEAENNLAAFRADVDAATLARIDLERRIESLNEEIAFLKKVHEEEIRELQAQLQEQQVQVEMDMSKPDLTAALRDIRAQYETIAAKNISEAEEWYKSKVSDLTQAANKNNDALRQAKQEMMEYRHQIQSYTCEIDALKGTNDSLMRQMRELEDRFASEASGYQDNIARLEEEIRHLKDEMARHLREYQDLLNVKMALDVEIATYRKLLEGEESRINLPIQTYSALNFRETSPEQRGSEVHTKKTVMIKTIETRDGEVVSEATQQQHEVL,470,NP_001918.3.csv,refseq-DES-NM_001927.3_clinical_seed_0_final,refseq-DES-NM_001927.3.a2m,Invitae,refseq-DES-NM_001927.3.npy,1,470,470
+NP_001922.2,MWRVCARRAQNVAPWAGLEARWTALQEVPGTPRVTSRSGPAPARRNSVTTGYGGVRALCGWTPSSGATPRNRLLLQLLGSPGRRYYSLPPHQKVPLPSLSPTMQAGTIARWEKKEGDKINEGDLIAEVETDKATVGFESLEECYMAKILVAEGTRDVPIGAIICITVGKPEDIEAFKNYTLDSSAAPTPQAAPAPTPAATASPPTPSAQAPGSSYPPHMQVLLPALSPTMTMGTVQRWEKKVGEKLSEGDLLAEIETDKATIGFEVQEEGYLAKILVPEGTRDVPLGTPLCIIVEKEADISAFADYRPTEVTDLKPQVPPPTPPPVAAVPPTPQPLAPTPSAPCPATPAGPKGRVFVSPLAKKLAVEKGIDLTQVKGTGPDGRITKKDIDSFVPSKVAPAPAAVVPPTGPGMAPVPTGVFTDIPISNIRRVIAQRLMQSKQTIPHYYLSIDVNMGEVLLVRKELNKILEGRSKISVNDFIIKASALACLKVPEANSSWMDTVIRQNHVVDVSVAVSTPAGLITPIVFNAHIKGVETIANDVVSLATKAREGKLQPHEFQGGTFTISNLGMFGIKNFSAIINPPQACILAIGASEDKLVPADNEKGFDVASMMSVTLSCDHRVVDGAVGAQWLAEFRKYLEKPITMLL,647,NP_001922.2.csv,refseq-DLAT-NM_001931.4_clinical_seed_0_final,refseq-DLAT-NM_001931.4.a2m,Invitae,refseq-DLAT-NM_001931.4.npy,1,647,647
+NP_001928.2,MDLSLLWVLLPLVTMAWGQYGDYGYPYQQYHDYSDDGWVNLNRQGFSYQCPQGQVIVAVRSIFSKKEGSDRQWNYACMPTPQSLGEPTECWWEEINRAGMEWYQTCSNNGLVAGFQSRYFESVLDREWQFYCCRYSKRCPYSCWLTTEYPGHYGEEMDMISYNYDYYIRGATTTFSAVERDRQWKFIMCRMTEYDCEFANV,201,NP_001928.2.csv,refseq-DPT-NM_001937.4_clinical_seed_0_final,refseq-DPT-NM_001937.4.a2m,Invitae,refseq-DPT-NM_001937.4.npy,1,201,201
+NP_001934.2,MARSPGRAYALLLLLICFNVGSGLHLQVLSTRNENKLLPKHPHLVRQKRAWITAPVALREGEDLSKKNPIAKIHSDLAEERGLKITYKYTGKGITEPPFGIFVFNKDTGELNVTSILDREETPFFLLTGYALDARGNNVEKPLELRIKVLDINDNEPVFTQDVFVGSVEELSAAHTLVMKINATDADEPNTLNSKISYRIVSLEPAYPPVFYLNKDTGEIYTTSVTLDREEHSSYTLTVEARDGNGEVTDKPVKQAQVQIRILDVNDNIPVVENKVLEGMVEENQVNVEVTRIKVFDADEIGSDNWLANFTFASGNEGGYFHIETDAQTNEGIVTLIKEVDYEEMKNLDFSVIVANKAAFHKSIRSKYKPTPIPIKVKVKNVKEGIHFKSSVISIYVSESMDRSSKGQIIGNFQAFDEDTGLPAHARYVKLEDRDNWISVDSVTSEIKLAKLPDFESRYVQNGTYTVKIVAISEDYPRKTITGTVLINVEDINDNCPTLIEPVQTICHDAEYVNVTAEDLDGHPNSGPFSFSVIDKPPGMAEKWKIARQESTSVLLQQSEKKLGRSEIQFLISDNQGFSCPEKQVLTLTVCECLHGSGCREAQHDSYVGLGPAAIALMILAFLLLLLVPLLLLMCHCGKGAKGFTPIPGTIEMLHPWNNEGAPPEDKVVPSFLPVDQGGSLVGRNGVGGMAKEATMKGSSSASIVKGQHEMSEMDGRWEEHRSLLSGRATQFTGATGAIMTTETTKTARATGASRDMAGAQAAAVALNEEFLRNYFTDKAASYTEEDENHTAKDCLLVYSQEETESLNASIGCCSFIEGELDDRFLDDLGLKFKTLAEVCLGQKIDINKEIEQRQKPATETSMNTASHSLCEQTMVNSENTYSSGSSFPVPKSLQEANAEKVTQEIVTERSVSSRQAQKVATPLPDPMASRNVIATETSYVTGSTMPPTTVILGPSQPQSLIVTERVYAPASTLVDQPYANEGTVVVTERVIQPHGGGSNPLEGTQHLQDVPYVMVRERESFLAPSSGVQPTLAMPNIAVGQNVTVTERVLAPASTLQSSYQIPTENSMTARNTTVSGAGVPGPLPDFGLEESGHSNSTITTSSTRVTKHSTVQHSYS,1118,NP_001934.2.csv,refseq-DSG2-NM_001943.3_clinical_seed_0_final,refseq-DSG2-NM_001943.3.a2m,Invitae,refseq-DSG2-NM_001943.3.npy,1,1118,1118
+NP_001944.1,MAALMTPGTGAPPAPGDFSGEGSQGLPDPSPEPKQLPELIRMKRDGGRLSEADIRGFVAAVVNGSAQGAQIGAMLMAIRLRGMDLEETSVLTQALAQSGQQLEWPEAWRQQLVDKHSTGGVGDKVSLVLAPALAACGCKVPMISGRGLGHTGGTLDKLESIPGFNVIQSPEQMQVLLDQAGCCIVGQSEQLVPADGILYAARDVTATVDSLPLITASILSKKLVEGLSALVVDVKFGGAAVFPNQEQARELAKTLVGVGASLGLRVAAALTAMDKPLGRCVGHALEVEEALLCMDGAGPPDLRDLVTTLGGALLWLSGHAGTQAQGAARVAAALDDGSALGRFERMLAAQGVDPGLARALCSGSPAERRQLLPRAREQEELLAPADGTVELVRALPLALVLHELGAGRSRAGEPLRLGVGAELLVDVGQRLRRGTPWLRVHRDGPALSGPQSRALQEALVLSDRAPFAAPSPFAELVLPPQQ,482,NP_001944.1.csv,refseq-TYMP-NM_001953.4_clinical_seed_0_final,refseq-TYMP-NM_001953.4.a2m,Invitae,refseq-TYMP-NM_001953.4.npy,1,482,482
+NP_001949.1,MGKEKTHINIVVIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAAEMGKGSFKYAWVLDKLKAERERGITIDISLWKFETTKYYITIIDAPGHRDFIKNMITGTSQADCAVLIVAAGVGEFEAGISKNGQTREHALLAYTLGVKQLIVGVNKMDSTEPAYSEKRYDEIVKEVSAYIKKIGYNPATVPFVPISGWHGDNMLEPSPNMPWFKGWKVERKEGNASGVSLLEALDTILPPTRPTDKPLRLPLQDVYKIGGIGTVPVGRVETGILRPGMVVTFAPVNITTEVKSVEMHHEALSEALPGDNVGFNVKNVSVKDIRRGNVCGDSKSDPPQEAAQFTSQVIILNHPGQISAGYSPVIDCHTAHIACKFAELKEKIDRRSGKKLEDNPKSLKSGDAAIVEMVPGKPMCVESFSQYPPLGRFAVRDMRQTVAVGVIKNVEKKSGGAGKVTKSAQKAQKAGK,463,NP_001949.1.csv,refseq-EEF1A2-NM_001958.5_clinical_seed_0_final,refseq-EEF1A2-NM_001958.5.a2m,Invitae,refseq-EEF1A2-NM_001958.5_theta_0.2.npy,1,463,463
+NP_001952.1,MVNFTVDQIRAIMDKKANIRNMSVIAHVDHGKSTLTDSLVCKAGIIASARAGETRFTDTRKDEQERCITIKSTAISLFYELSENDLNFIKQSKDGAGFLINLIDSPGHVDFSSEVTAALRVTDGALVVVDCVSGVCVQTETVLRQAIAERIKPVLMMNKMDRALLELQLEPEELYQTFQRIVENVNVIISTYGEGESGPMGNIMIDPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFAAKGEGQLGPAERAKKVEDMMKKLWGDRYFDPANGKFSKSATSPEGKKLPRTFCQLILDPIFKVFDAIMNFKKEETAKLIEKLDIKLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEAAMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFSGLVSTGLKVRIMGPNYTPGKKEDLYLKPIQRTILMMGRYVEPIEDVPCGNIVGLVGVDQFLVKTGTITTFEHAHNMRVMKFSVSPVVRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICLKDLEEDHACIPIKKSDPVVSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDIDKGEVSARQELKQRARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQYLNEIKDSVVAGFQWATKEGALCEENMRGVRFDVHDVTLHADAIHRGGGQIIPTARRCLYASVLTAQPRLMEPIYLVEIQCPEQVVGGIYGVLNRKRGHVFEESQVAGTPMFVVKAYLPVNESFGFTADLRSNTGGQAFPQCVFDHWQILPGDPFDNSSRPSQVVAETRKRKGLKEGIPALDNFLDKL,858,NP_001952.1.csv,refseq-EEF2-NM_001961.3_clinical_seed_0_final,refseq-EEF2-NM_001961.3.a2m,Invitae,refseq-EEF2-NM_001961.3.npy,1,858,858
+NP_001963.1,MTLGRRLACLFLACVLPALLLGGTALASEIVGGRRARPHAWPFMVSLQLRGGHFCGATLIAPNFVMSAAHCVANVNVRAVRVVLGAHNLSRREPTRQVFAVQRIFENGYDPVNLLNDIVILQLNGSATINANVQVAQLPAQGRRLGNGVQCLAMGWGLLGRNRGIASVLQELNVTVVTSLCRRSNVCTLVRGRQAGVCFGDSGSPLVCNGLIHGIASFVRGGCASGLYPDAFAPVAQFVNWIDSIIQRSEDNPCPHPRDPDPASRTH,267,NP_001963.1.csv,refseq-ELANE-NM_001972.2_clinical_seed_0_final,refseq-ELANE-NM_001972.2.a2m,Invitae,refseq-ELANE-NM_001972.2.npy,1,267,267
+NP_001973.2,MRANDALQVLGLLFSLARGSEVGNSQAVCPGTLNGLSVTGDAENQYQTLYKLYERCEVVMGNLEIVLTGHNADLSFLQWIREVTGYVLVAMNEFSTLPLPNLRVVRGTQVYDGKFAIFVMLNYNTNSSHALRQLRLTQLTEILSGGVYIEKNDKLCHMDTIDWRDIVRDRDAEIVVKDNGRSCPPCHEVCKGRCWGPGSEDCQTLTKTICAPQCNGHCFGPNPNQCCHDECAGGCSGPQDTDCFACRHFNDSGACVPRCPQPLVYNKLTFQLEPNPHTKYQYGGVCVASCPHNFVVDQTSCVRACPPDKMEVDKNGLKMCEPCGGLCPKACEGTGSGSRFQTVDSSNIDGFVNCTKILGNLDFLITGLNGDPWHKIPALDPEKLNVFRTVREITGYLNIQSWPPHMHNFSVFSNLTTIGGRSLYNRGFSLLIMKNLNVTSLGFRSLKEISAGRIYISANRQLCYHHSLNWTKVLRGPTEERLDIKHNRPRRDCVAEGKVCDPLCSSGGCWGPGPGQCLSCRNYSRGGVCVTHCNFLNGEPREFAHEAECFSCHPECQPMEGTATCNGSGSDTCAQCAHFRDGPHCVSSCPHGVLGAKGPIYKYPDVQNECRPCHENCTQGCKGPELQDCLGQTLVLIGKTHLTMALTVIAGLVVIFMMLGGTFLYWRGRRIQNKRAMRRYLERGESIEPLDPSEKANKVLARIFKETELRKLKVLGSGVFGTVHKGVWIPEGESIKIPVCIKVIEDKSGRQSFQAVTDHMLAIGSLDHAHIVRLLGLCPGSSLQLVTQYLPLGSLLDHVRQHRGALGPQLLLNWGVQIAKGMYYLEEHGMVHRNLAARNVLLKSPSQVQVADFGVADLLPPDDKQLLYSEAKTPIKWMALESIHFGKYTHQSDVWSYGVTVWELMTFGAEPYAGLRLAEVPDLLEKGERLAQPQICTIDVYMVMVKCWMIDENIRPTFKELANEFTRMARDPPRYLVIKRESGPGIAPGPEPHGLTNKKLEEVELEPELDLDLDLEAEEDNLATTTLGSALSLPVGTLNRPRGSQSLLSPSSGYMPMNQGNLGESCQESAVSGSSERCPRPVSLHPMPRGCLASESSEGHVTGSEAELQEKVSMCRSRSRSRSPRPRGDSAYHSQRHSLLTPVTPLSPPGLEEEDVNGYVMPDTHLKGTPSSREGTLSSVGLSSVLGTEEEDEDEEYEYMNRRRRHSPPHPPRPSSLEELGYEYMDVGSDLSASLGSTQSCPLHPVPIMPTAGTTPDEDYEYMNRQRDGGGPGGDYAAMGACPASEQGYEEMRAFQGPGHQAPHVHYARLKTLRSLEATDSAFDNPDYWHSRLFPKANAQRT,1342,NP_001973.2.csv,refseq-ERBB3-NM_001982.3_clinical_seed_0_final,refseq-ERBB3-NM_001982.3.a2m,Invitae,refseq-ERBB3-NM_001982.3.npy,1,1342,1342
+NP_001976.1,MAELRVLVAVKRVIDYAVKIRVKPDRTGVVTDGVKHSMNPFCEIAVEEAVRLKEKKLVKEVIAVSCGPAQCQETIRTALAMGADRGIHVEVPPAEAERLGPLQVARVLAKLAEKEKVDLVLLGKQAIDDDCNQTGQMTAGFLDWPQGTFASQVTLEGDKLKVEREIDGGLETLRLKLPAVVTADLRLNEPRYATLPNIMKAKKKKIEVIKPGDLGVDLTSKLSVISVEDPPQRTAGVKVETTEDLVAKLKEIGRI,255,NP_001976.1.csv,refseq-ETFB-NM_001985.2_clinical_seed_0_final,refseq-ETFB-NM_001985.2.a2m,Invitae,refseq-ETFB-NM_001985.2.npy,1,255,255
+NP_001978.1,MSETPAQCSIKQERISYTPPESPVPSYASSTPLHVPVPRALRMEEDSIRLPAHLRLQPIYWSRDDVAQWLKWAENEFSLRPIDSNTFEMNGKALLLLTKEDFRYRSPHSGDVLYELLQHILKQRKPRILFSPFFHPGNSIHTQPEVILHQNHEEDNCVQRTPRPSVDNVHHNPPTIELLHRSRSPITTNHRPSPDPEQRPLRSPLDNMIRRLSPAERAQGPRPHQENNHQESYPLSVSPMENNHCPASSESHPKPSSPRQESTRVIQLMPSPIMHPLILNPRHSVDFKQSRLSEDGLHREGKPINLSHREDLAYMNHIMVSVSPPEEHAMPIGRIADCRLLWDYVYQLLSDSRYENFIRWEDKESKIFRIVDPNGLARLWGNHKNRTNMTYEKMSRALRHYYKLNIIRKEPGQRLLFRFMKTPDEIMSGRTDRLEHLESQELDEQIYQEDEC,452,NP_001978.1.csv,refseq-ETV6-NM_001987.4_clinical_seed_0_final,refseq-ETV6-NM_001987.4.a2m,Invitae,refseq-ETV6-NM_001987.4.npy,1,452,452
+NP_002001.1,MAPLGEVGNYFGVQDAVPFGNVPVLPVDSPVLLSDHLGQSEAGGLPRGPAVTDLDHLKGILRRRQLYCRTGFHLEIFPNGTIQGTRKDHSRFGILEFISIAVGLVSIRGVDSGLYLGMNEKGELYGSEKLTQECVFREQFEENWYNTYSSNLYKHVDTGRRYYVALNKDGTPREGTRTKRHQKFTHFLPRPVDPDKVPELYKDILSQS,208,NP_002001.1.csv,refseq-FGF9-NM_002010.2_clinical_seed_0_final,refseq-FGF9-NM_002010.2.a2m,Invitae,refseq-FGF9-NM_002010.2.npy,1,208,208
+NP_002008.2,MDGTIKEALSVVSDDQSLFDSAYGAAAHLPKADMTASGSPDYGQPHKINPLPPQQEWINQPVRVNVKREYDHMNGSRESPVDCSVSKCSKLVGGGESNPMNYNSYMDEKNGPPPPNMTTNERRVIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGKELCKMNKEDFLRATTLYNTEVLLSHLSYLRESSLLAYNTTSHTDQSSRLSVKEDPSYDSVRRGAWGNNMNSGLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPHPSSMPVTSSSFFGAASQYWTSPTGGIYPNPNVPRHPNTHVPSHLGSYY,452,NP_002008.2.csv,refseq-FLI1-NM_002017.4_clinical_seed_0_final,refseq-FLI1-NM_002017.4.a2m,Invitae,refseq-FLI1-NM_002017.4.npy,1,452,452
+NP_002009.1,MEATGVLPFVRGVDLSGNDFKGGYFPENVKAMTSLRWLKLNRTGLCYLPEELAALQKLEHLSVSHNNLTTLHGELSSLPSLRAIVARANSLKNSGVPDDIFKLDDLSVLDLSHNQLTECPRELENAKNMLVLNLSHNSIDTIPNQLFINLTDLLYLDLSENRLESLPPQMRRLVHLQTLVLNGNPLLHAQLRQLPAMTALQTLHLRSTQRTQSNLPTSLEGLSNLADVDLSCNDLTRVPECLYTLPSLRRLNLSSNQITELSLCIDQWVHVETLNLSRNQLTSLPSAICKLSKLKKLYLNSNKLDFDGLPSGIGKLTNLEEFMAANNNLELVPESLCRCPKLRKLVLNKNHLVTLPEAIHFLTEIEVLDVRENPNLVMPPKPADRAAEWYNIDFSLQNQLRLAGASPATVAAAAAAGSGPKDPMARKMRLRRRKDSAQDDQAKQVLKGMSDVAQEKNKKQEESADARAPSGKVRRWDQGLEKPRLDYSEFFTEDVGQLPGLTIWQIENFVPVLVEEAFHGKFYEADCYIVLKTFLDDSGSLNWEIYYWIGGEATLDKKACSAIHAVNLRNYLGAECRTVREEMGDESEEFLQVFDNDISYIEGGTASGFYTVEDTHYVTRMYRVYGKKNIKLEPVPLKGTSLDPRFVFLLDRGLDIYVWRGAQATLSSTTKARLFAEKINKNERKGKAEITLLVQGQELPEFWEALGGEPSEIKKHVPEDFWPPQPKLYKVGLGLGYLELPQINYKLSVEHKQRPKVELMPRMRLLQSLLDTRCVYILDCWSDVFIWLGRKSPRLVRAAALKLGQELCGMLHRPRHATVSRSLEGTEAQVFKAKFKNWDDVLTVDYTRNAEAVLQSPGLSGKVKRDAEKKDQMKADLTALFLPRQPPMSLAEAEQLMEEWNEDLDGMEGFVLEGKKFARLPEEEFGHFYTQDCYVFLCRYWVPVEYEEEEKKEDKEEKAEGKEGEEATAEAEEKQPEEDFQCIVYFWQGREASNMGWLTFTFSLQKKFESLFPGKLEVVRMTQQQENPKFLSHFKRKFIIHRGKRKAVQGAQQPSLYQIRTNGSALCTRCIQINTDSSLLNSEFCFILKVPFESEDNQGIVYAWVGRASDPDEAKLAEDILNTMFDTSYSKQVINEGEEPENFFWVGIGAQKPYDDDAEYMKHTRLFRCSNEKGYFAVTEKCSDFCQDDLADDDIMLLDNGQEVYMWVGTQTSQVEIKLSLKACQVYIQHMRSKEHERPRRLRLVRKGNEQHAFTRCFHAWSAFCKALA,1269,NP_002009.1.csv,refseq-FLII-NM_002018.3_clinical_seed_0_final,refseq-FLII-NM_002018.3.a2m,Invitae,refseq-FLII-NM_002018.3.npy,1,1269,1269
+NP_002011.2,MQRGAALCLRLWLCLGLLDGLVSGYSMTPPTLNITEESHVIDTGDSLSISCRGQHPLEWAWPGAQEAPATGDKDSEDTGVVRDCEGTDARPYCKVLLLHEVHANDTGSYVCYYKYIKARIEGTTAASSYVFVRDFEQPFINKPDTLLVNRKDAMWVPCLVSIPGLNVTLRSQSSVLWPDGQEVVWDDRRGMLVSTPLLHDALYLQCETTWGDQDFLSNPFLVHITGNELYDIQLLPRKSLELLVGEKLVLNCTVWAEFNSGVTFDWDYPGKQAERGKWVPERRSQQTHTELSSILTIHNVSQHDLGSYVCKANNGIQRFRESTEVIVHENPFISVEWLKGPILEATAGDELVKLPVKLAAYPPPEFQWYKDGKALSGRHSPHALVLKEVTEASTGTYTLALWNSAAGLRRNISLELVVNVPPQIHEKEASSPSIYSRHSRQALTCTAYGVPLPLSIQWHWRPWTPCKMFAQRSLRRRQQQDLMPQCRDWRAVTTQDAVNPIESLDTWTEFVEGKNKTVSKLVIQNANVSAMYKCVVSNKVGQDERLIYFYVTTIPDGFTIESKPSEELLEGQPVLLSCQADSYKYEHLRWYRLNLSTLHDAHGNPLLLDCKNVHLFATPLAASLEEVAPGARHATLSLSIPRVAPEHEGHYVCEVQDRRSHDKHCHKKYLSVQALEAPRLTQNLTDLLVNVSDSLEMQCLVAGAHAPSIVWYKDERLLEEKSGVDLADSNQKLSIQRVREEDAGRYLCSVCNAKGCVNSSASVAVEGSEDKGSMEIVILVGTGVIAVFFWVLLLLIFCNMRRPAHADIKTGYLSIIMDPGEVPLEEQCEYLSYDASQWEFPRERLHLGRVLGYGAFGKVVEASAFGIHKGSSCDTVAVKMLKEGATASEHRALMSELKILIHIGNHLNVVNLLGACTKPQGPLMVIVEFCKYGNLSNFLRAKRDAFSPCAEKSPEQRGRFRAMVELARLDRRRPGSSDRVLFARFSKTEGGARRASPDQEAEDLWLSPLTMEDLVCYSFQVARGMEFLASRKCIHRDLAARNILLSESDVVKICDFGLARDIYKDPDYVRKGSARLPLKWMAPESIFDKVYTTQSDVWSFGVLLWEIFSLGASPYPGVQINEEFCQRLRDGTRMRAPELATPAIRRIMLNCWSGDPKARPAFSELVEILGDLLQGRGLQEEEEVCMAPRSSQSSEEGSFSQVSTMALHIAQADAEDSPPSLQRHSLAARYYNWVSFPGCLARGAETRGSSRMKTFEEFPMTPTTYKGSVDNQTDSGMVLASEEFEQIESRHRQESGFR,1298,NP_002011.2.csv,refseq-FLT4-NM_002020.4_clinical_seed_0_final,refseq-FLT4-NM_002020.4.a2m,Invitae,refseq-FLT4-NM_002020.4.npy,1,1298,1298
+NP_002015.1,MEELVVEVRGSNGAFYKAFVKDVHEDSITVAFENNWQPDRQIPFHDVRFPPPVGYNKDINESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIERLRSVNPNKPATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVTYDPENYQLVILSINEVTSKRAHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASRFHEQFIVREDLMGLAIGTHGANIQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPRNLVGKVIGKNGKLIQEIVDKSGVVRVRIEAENEKNVPQEEEIMPPNSLPSNNSRVGPNAPEEKKHLDIKENSTHFSQPNSTKVQRVLVASSVVAGESQKPELKAWQGMVPFVFVGTKDSIANATVLLDYHLNYLKEVDQLRLERLQIDEQLRQIGASSRPPPNRTDKEKSYVTDDGQGMGRGSRPYRNRGHGRRGPGYTSGTNSEASNASETESDHRDELSDWSLAPTEEERESFLRRGDGRRRGGGGRGQGGRGRGGGFKGNDDHSRTDNRPRNPREAKGRTTDGSLQIRVDCNNERSVHTKTLQNTSSEGSRLRTGKDRNQKKEKPDSVDGQQPLVNGVP,632,NP_002015.1.csv,refseq-FMR1-NM_002024.5_clinical_seed_0_final,refseq-FMR1-NM_002024.5.a2m,Invitae,refseq-FMR1-NM_002024.5.npy,1,632,632
+NP_002016.2,MDLFDFFRDWDLEQQCHYEQDRSALKKREWERRNQEVQQEDDLFSSGFDLFGEPYKVAEYTNKGDALANRVQNTLGNYDEMKNLLTNHSNQNHLVGIPKNSVPQNPNNKNEPSFFPEQKNRIIPPHQDNTHPSAPMPPPSVVILNSTLIHSNRKSKPEWSRDSHNPSTVLASQASGQPNKMQTLTQDQSQAKLEDFFVYPAEQPQIGEVEESNPSAKEDSNPNSSGEDAFKEIFQSNSPEESEFAVQAPGSPLVASSLLAPSSGLSVQNFPPGLYCKTSMGQQKPTAYVRPMDGQDQAPDISPTLKPSIEFENSFGNLSFGTLLDGKPSAASSKTKLPKFTILQTSEVSLPSDPSCVEEILREMTHSWPTPLTSMHTAGHSEQSTFSIPGQESQHLTPGFTLQKWNDPTTRASTKSVSFKSMLEDDLKLSSDEDDLEPVKTLTTQCTATELYQAVEKAKPRNNPVNPPLATPQPPPAVQASGGSGSSSESESSSESDSDTESSTTDSESNEAPRVATPEPEPPSTNKWQLDKWLNKVTSQNKSFICGQNETPMETISLPPPIIQPMEVQMKVKTNASQVPAEPKERPLLSLIREKARPRPTQKIPETKALKHKLSTTSETVSQRTIGKKQPKKVEKNTSTDEFTWPKPNITSSTPKEKESVELHDPPRGRNKATAHKPAPRKEPRPNIPLAPEKKKYRGPGKIVPKSREFIETDSSTSDSNTDQEETLQIKVLPPCIISGGNTAKSKEICGASLTLSTLMSSSGSNNNLSISNEEPTFSPIPVMQTEILSPLRDHENLKNLWVKIDLDLLSRVPGHSSLHAAPAKPDHKETATKPKRQTAVTAVEKPAPKGKRKHKPIEVAEKIPEKKQRLEEATTICLLPPCISPAPPHKPPNTRENNSSRRANRRKEEKLFPPPLSPLPEDPPRRRNVSGNNGPFGQDKNIAMTGQITSTKPKRTEGKFCATFKGISVNEGDTPKKASSATITVTNTAIATATVTATAIVTTTVTATATATATTTTTTTTISTITSTITTGLMDSSHLEMTSWAALPLLSSSSTNVRRPKLTFDDSVHNADYYMQEAKKLKHKADALFEKFGKAVNYADAALSFTECGNAMERDPLEAKSPYTMYSETVELLRYAMRLKNFASPLASDGDKKLAVLCYRCLSLLYLRMFKLKKDHAMKYSRSLMEYFKQNASKVAQIPSPWVSNGKNTPSPVSLNNVSPINAMGNCNNGPVTIPQRIHHMAASHVNITSNVLRGYEHWDMADKLTRENKEFFGDLDTLMGPLTQHSSMTNLVRYVRQGLCWLRIDAHLL,1311,NP_002016.2.csv,refseq-AFF2-NM_002025.3_clinical_seed_0_final,refseq-AFF2-NM_002025.3.a2m,Invitae,refseq-AFF2-NM_002025.3.npy,1,1311,1311
+NP_002038.2,MPSPRPVLLRGARAALLLLLPPRLLARPSLLLRRSLSAASCPPISLPAAASRSSMDGAGAEEVLAPLRLAVRQQGDLVRKLKEDKAPQVDVDKAVAELKARKRVLEAKELALQPKDDIVDRAKMEDTLKRRFFYDQAFAIYGGVSGLYDFGPVGCALKNNIIQTWRQHFIQEEQILEIDCTMLTPEPVLKTSGHVDKFADFMVKDVKNGECFRADHLLKAHLQKLMSDKKCSVEKKSEMESVLAQLDNYGQQELADLFVNYNVKSPITGNDLSPPVSFNLMFKTFIGPGGNMPGYLRPETAQGIFLNFKRLLEFNQGKLPFAAAQIGNSFRNEISPRSGLIRVREFTMAEIEHFVDPSEKDHPKFQNVADLHLYLYSAKAQVSGQSARKMRLGDAVEQGVINNTVLGYFIGRIYLYLTKVGISPDKLRFRQHMENEMAHYACDCWDAESKTSYGWIEIVGCADRSCYDLSCHARATKVPLVAEKPLKEPKTVNVVQFEPSKGAIGKAYKKDAKLVMEYLAICDECYITEMEMLLNEKGEFTIETEGKTFQLTKDMINVKRFQKTLYVEEVVPNVIEPSFGLGRIMYTVFEHTFHVREGDEQRTFFSFPAVVAPFKCSVLPLSQNQEFMPFVKELSEALTRHGVSHKVDDSSGSIGRRYARTDEIGVAFGVTIDFDTVNKTPHTATLRDRDSMRQIRAEISELPSIVQDLANGNITWADVEARYPLFEGQETGKKETIEE,739,NP_002038.2.csv,refseq-GARS-NM_002047.2_clinical_seed_0_final,refseq-GARS-NM_002047.2.a2m,Invitae,refseq-GARS-NM_002047.2.npy,1,739,739
+NP_002040.1,MEFPGLGSLGTSEPLPQFVDPALVSSTPESGVFFPSGPEGLDAAASSTAPSTATAAAAALAYYRDAEAYRHSPVFQVYPLLNCMEGIPGGSPYAGWAYGKTGLYPASTVCPTREDSPPQAVEDLDGKGSTSFLETLKTERLSPDLLTLGPALPSSLPVPNSAYGGPDFSSTFFSPTGSPLNSAAYSSPKLRGTLPLPPCEARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCTNCQTTTTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMRKDGIQTRNRKASGKGKKKRGSSLGGTGAAEGPAGGFMVVAGGSGSGNCGEVASGLTLGPPGTAHLYQGLGPVVLSGPVSHLMPFPGPLLGSPTGSFPTGPMPPTTSTTVVAPLSS,413,NP_002040.1.csv,refseq-GATA1-NM_002049.3_clinical_seed_0_final,refseq-GATA1-NM_002049.3.a2m,Invitae,refseq-GATA1-NM_002049.3.npy,1,413,413
+NP_002043.2,MYQSLAMAANHGPPPGAYEAGGPGAFMHGAGAASSPVYVPTPRVPSSVLGLSYLQGGGAGSASGGASGGSSGGAASGAGPGTQQGSPGWSQAGADGAAYTPPPVSPRFSFPGTTGSLAAAAAAAAAREAAAYSSGGGAAGAGLAGREQYGRAGFAGSYSSPYPAYMADVGASWAAAAAASAGPFDSPVLHSLPGRANPAARHPNLDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMRKEGIQTRKRKPKNLNKSKTPAAPSGSESLPPASGASSNSSNATTSSSEEMRPIKTEPGLSSHYGHSSSVSQTFSVSAMSGHGPSIHPVLSALKLSPQGYASPVSQSPQTSSKQDSWNSLVLADSHGDIITA,442,NP_002043.2.csv,refseq-GATA4-NM_002052.3_clinical_seed_0_final,refseq-GATA4-NM_002052.3.a2m,Invitae,refseq-GATA4-NM_002052.3.npy,1,442,442
+NP_002046.1,MERRRITSAARRSYVSSGEMMVGGLAPGRRLGPGTRLSLARMPPPLPTRVDFSLAGALNAGFKETRASERAEMMELNDRFASYIEKVRFLEQQNKALAAELNQLRAKEPTKLADVYQAELRELRLRLDQLTANSARLEVERDNLAQDLATVRQKLQDETNLRLEAENNLAAYRQEADEATLARLDLERKIESLEEEIRFLRKIHEEEVRELQEQLARQQVHVELDVAKPDLTAALKEIRTQYEAMASSNMHEAEEWYRSKFADLTDAAARNAELLRQAKHEANDYRRQLQSLTCDLESLRGTNESLERQMREQEERHVREAASYQEALARLEEEGQSLKDEMARHLQEYQDLLNVKLALDIEIATYRKLLEGEENRITIPVQTFSNLQIRETSLDTKSVSEGHLKRNIVVKTVEMRDGEVIKESKQEHKDVM,432,NP_002046.1.csv,refseq-GFAP-NM_002055.4_clinical_seed_0_final,refseq-GFAP-NM_002055.4.a2m,Invitae,refseq-GFAP-NM_002055.4.npy,1,432,432
+NP_002056.2,MTTSASSHLNKGIKQVYMSLPQGEKVQAMYIWIDGTGEGLRCKTRTLDSEPKCVEELPEWNFDGSSTLQSEGSNSDMYLVPAAMFRDPFRKDPNKLVLCEVFKYNRRPAETNLRHTCKRIMDMVSNQHPWFGMEQEYTLMGTDGHPFGWPSNGFPGPQGPYYCGVGADRAYGRDIVEAHYRACLYAGVKIAGTNAEVMPAQWEFQIGPCEGISMGDHLWVARFILHRVCEDFGVIATFDPKPIPGNWNGAGCHTNFSTKAMREENGLKYIEEAIEKLSKRHQYHIRAYDPKGGLDNARRLTGFHETSNINDFSAGVANRSASIRIPRTVGQEKKGYFEDRRPSANCDPFSVTEALIRTCLLNETGDEPFQYKN,373,NP_002056.2.csv,refseq-GLUL-NM_002065.6_clinical_seed_0_final,refseq-GLUL-NM_002065.6.a2m,Invitae,refseq-GLUL-NM_002065.6.npy,1,373,373
+NP_002058.2,MTLESMMACCLSDEVKESKRINAEIEKQLRRDKRDARRELKLLLLGTGESGKSTFIKQMRIIHGAGYSEEDKRGFTKLVYQNIFTAMQAMIRAMETLKILYKYEQNKANALLIREVDVEKVTTFEHQYVSAIKTLWEDPGIQECYDRRREYQLSDSAKYYLTDVDRIATLGYLPTQQDVLRVRVPTTGIIEYPFDLENIIFRMVDVGGQRSERRKWIHCFENVTSIMFLVALSEYDQVLVESDNENRMEESKALFRTIITYPWFQNSSVILFLNKKDLLEDKILYSHLVDYFPEFDGPQRDAQAAREFILKMFVDLNPDSDKIIYSHFTCATDTENIRFVFAAVKDTILQLNLKEYNLV,359,NP_002058.2.csv,refseq-GNA11-NM_002067.2_clinical_seed_0_final,refseq-GNA11-NM_002067.2.a2m,Invitae,refseq-GNA11-NM_002067.2_theta_0.2.npy,1,359,359
+NP_002060.4,MGCTLSAEDKAAVERSKMIDRNLREDGEKAAREVKLLLLGAGESGKSTIVKQMKIIHEAGYSEEECKQYKAVVYSNTIQSIIAIIRAMGRLKIDFGDSARADDARQLFVLAGAAEEGFMTAELAGVIKRLWKDSGVQACFNRSREYQLNDSAAYYLNDLDRIAQPNYIPTQQDVLRTRVKTTGIVETHFTFKDLHFKMFDVGGQRSERKKWIHCFEGVTAIIFCVALSDYDLVLAEDEEMNRMHESMKLFDSICNNKWFTDTSIILFLNKKDLFEEKIKKSPLTICYPEYAGSNTYEEAAAYIQCQFEDLNKRKDTKEIYTHFTCATDTKNVQFVFDAVTDVIIKNNLKDCGLF,354,NP_002060.4.csv,refseq-GNAI1-NM_002069.5_clinical_seed_0_final,refseq-GNAI1-NM_002069.5.a2m,Invitae,refseq-GNAI1-NM_002069.5.npy,1,354,354
+NP_002063.2,MTLESIMACCLSEEAKEARRINDEIERQLRRDKRDARRELKLLLLGTGESGKSTFIKQMRIIHGSGYSDEDKRGFTKLVYQNIFTAMQAMIRAMDTLKIPYKYEHNKAHAQLVREVDVEKVSAFENPYVDAIKSLWNDPGIQECYDRRREYQLSDSTKYYLNDLDRVADPAYLPTQQDVLRVRVPTTGIIEYPFDLQSVIFRMVDVGGQRSERRKWIHCFENVTSIMFLVALSEYDQVLVESDNENRMEESKALFRTIITYPWFQNSSVILFLNKKDLLEEKIMYSHLVDYFPEYDGPQRDAQAAREFILKMFVDLNPDSDKIIYSHFTCATDTENIRFVFAAVKDTILQLNLKEYNLV,359,NP_002063.2.csv,refseq-GNAQ-NM_002072.4_clinical_seed_0_final,refseq-GNAQ-NM_002072.4.a2m,Invitae,refseq-GNAQ-NM_002072.4.npy,1,359,359
+NP_002065.1,MSELDQLRQEAEQLKNQIRDARKACADATLSQITNNIDPVGRIQMRTRRTLRGHLAKIYAMHWGTDSRLLVSASQDGKLIIWDSYTTNKVHAIPLRSSWVMTCAYAPSGNYVACGGLDNICSIYNLKTREGNVRVSRELAGHTGYLSCCRFLDDNQIVTSSGDTTCALWDIETGQQTTTFTGHTGDVMSLSLAPDTRLFVSGACDASAKLWDVREGMCRQTFTGHESDINAICFFPNGNAFATGSDDATCRLFDLRADQELMTYSHDNIICGITSVSFSKSGRLLLAGYDDFNCNVWDALKADRAGVLAGHDNRVSCLGVTDDGMAVATGSWDSFLKIWN,340,NP_002065.1.csv,refseq-GNB1-NM_002074.4_clinical_seed_0_final,refseq-GNB1-NM_002074.4.a2m,Invitae,refseq-GNB1-NM_002074.4.npy,1,340,340
+NP_002067.1,MRLLPLAPGRLRRGSPRHLPSCSPALLLLVLGGCLGVFGVAAGTRRPNVVLLLTDDQDEVLGGMTPLKKTKALIGEMGMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILRSMCGYQTFFAGKYLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLTDVLANVSLDFLDYKSNFEPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIHGTNKHWLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYIFYTSDNGYHTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKPNQTSKMLVANIDLGPTILDIAGYDLNKTQMDGMSLLPILRGASNLTWRSDVLVEYQGEGRNVTDPTCPSLSPGVSQCFPDCVCEDAYNNTYACVRTMSALWNLQYCEFDDQEVFVEVYNLTADPDQITNIAKTIDPELLGKMNYRLMMLQSCSGPTCRTPGVFDPGYRFDPRLMFSNRGSVRTRRFSKHLL,552,NP_002067.1.csv,refseq-GNS-NM_002076.3_clinical_seed_0_final,refseq-GNS-NM_002076.3.a2m,Invitae,refseq-GNS-NM_002076.3.npy,1,552,552
+NP_002071.2,MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKMNLGVGAYRDDNGKPYVLPSVRKAEAQIAAKNLDKEYLPIGGLAEFCKASAELALGENSEVLKSGRFVTVQTISGTGALRIGASFLQRFFKFSRDVFLPKPTWGNHTPIFRDAGMQLQGYRYYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVKKRNLFAFFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMVCKDADEAKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGMRTQLVSNLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVTSSNVGYLAHAIHQVTK,430,NP_002071.2.csv,refseq-GOT2-NM_002080.3_clinical_seed_0_final,refseq-GOT2-NM_002080.3.a2m,Invitae,refseq-GOT2-NM_002080.3.npy,1,430,430
+NP_002078.1,MWTLVSWVALTAGLVAGTRCPDGQFCPVACCLDPGGASYSCCRPLLDKWPTTLSRHLGGPCQVDAHCSAGHSCIFTVSGTSSCCPFPEAVACGDGHHCCPRGFHCSADGRSCFQRSGNNSVGAIQCPDSQFECPDFSTCCVMVDGSWGCCPMPQASCCEDRVHCCPHGAFCDLVHTRCITPTGTHPLAKKLPAQRTNRAVALSSSVMCPDARSRCPDGSTCCELPSGKYGCCPMPNATCCSDHLHCCPQDTVCDLIQSKCLSKENATTDLLTKLPAHTVGDVKCDMEVSCPDGYTCCRLQSGAWGCCPFTQAVCCEDHIHCCPAGFTCDTQKGTCEQGPHQVPWMEKAPAHLSLPDPQALKRDVPCDNVSSCPSSDTCCQLTSGEWGCCPIPEAVCCSDHQHCCPQGYTCVAEGQCQRGSEIVAGLEKMPARRASLSHPRDIGCDQHTSCPVGQTCCPSLGGSWACCQLPHAVCCEDRQHCCPAGYTCNVKARSCEKEVVSAQPATFLARSPHVGVKDVECGEGHFCHDNQTCCRDNRQGWACCPYRQGVCCADRRHCCPAGFRCAARGTKCLRREAPRWDAPLRDPALRQLL,593,NP_002078.1.csv,refseq-GRN-NM_002087.3_clinical_seed_0_final,refseq-GRN-NM_002087.3.a2m,Invitae,refseq-GRN-NM_002087.3.npy,1,593,593
+NP_002100.2,MAERAALEELVKLQGERVRGLKQQKASAELIEEEVAKLLKLKAQLGPDESKQKFVLKTPKGTRDYSPRQMAVREKVFDVIIRCFKRHGAEVIDTPVFELKETLMGKYGEDSKLIYDLKDQGGELLSLRYDLTVPFARYLAMNKLTNIKRYHIAKVYRRDNPAMTRGRYREFYQCDFDIAGNFDPMIPDAECLKIMCEILSSLQIGDFLVKVNDRRILDGMFAICGVSDSKFRTICSSVDKLDKVSWEEVKNEMVGEKGLAPEVADRIGDYVQQHGGVSLVEQLLQDPKLSQNKQALEGLGDLKLLFEYLTLFGIDDKISFDLSLARGLDYYTGVIYEAVLLQTPAQAGEEPLGVGSVAAGGRYDGLVGMFDPKGRKVPCVGLSIGVERIFSIVEQRLEALEEKIRTTETQVLVASAQKKLLEERLKLVSELWDAGIKAELLYKKNPKLLNQLQYCEEAGIPLVAIIGEQELKDGVIKLRSVTSREEVDVRREDLVEEIKRRTGQPLCIC,509,NP_002100.2.csv,refseq-HARS-NM_002109.5_clinical_seed_0_final,refseq-HARS-NM_002109.5.a2m,Invitae,refseq-HARS-NM_002109.5.npy,1,509,509
+NP_002104.2,MWLLVSVILISRISSVGGEATFCDFPKINHGILYDEEKYKPFSQVPTGEVFYYSCEYNFVSPSKSFWTRITCTEEGWSPTPKCLRLCFFPFVENGHSESSGQTHLEGDTVQIICNTGYRLQNNENNISCVERGWSTPPKCRSTDTSCVNPPTVQNAHILSRQMSKYPSGERVRYECRSPYEMFGDEEVMCLNGNWTEPPQCKDSTGKCGPPPPIDNGDITSFPLSVYAPASSVEYQCQNLYQLEGNKRITCRNGQWSEPPKCLHPCVISREIMENYNIALRWTAKQKLYLRTGESAEFVCKRGYRLSSRSHTLRTTCWDGKLEYPTCAKR,330,NP_002104.2.csv,refseq-CFHR1-NM_002113.2_clinical_seed_0_final,refseq-CFHR1-NM_002113.2.a2m,Invitae,refseq-CFHR1-NM_002113.2_theta_0.2.npy,1,330,330
+NP_002131.2,METEQPEETFPNTETNGEFGKRPAEDMEEEQAFKRSRNTDEMVELRILLQSKNAGAVIGKGGKNIKALRTDYNASVSVPDSSGPERILSISADIETIGEILKKIIPTLEEGLQLPSPTATSQLPLESDAVECLNYQHYKGSDFDCELRLLIHQSLAGGIIGVKGAKIKELRENTQTTIKLFQECCPHSTDRVVLIGGKPDRVVECIKIILDLISESPIKGRAQPYDPNFYDETYDYGGFTMMFDDRRGRPVGFPMRGRGGFDRMPPGRGGRPMPPSRRDYDDMSPRRGPPPPPPGRGGRGGSRARNLPLPPPPPPRGGDLMAYDRRGRPGDRYDGMVGFSADETWDSAIDTWSPSEWQMAYEPQGGSGYDYSYAGGRGSYGDLGGPIITTQVTIPKDLAGSIIGKGGQRIKQIRHESGASIKIDEPLEGSEDRIITITGTQDQIQNAQYLLQNSVKQYADVEGF,464,NP_002131.2.csv,refseq-HNRNPK-NM_002140.4_clinical_seed_0_final,refseq-HNRNPK-NM_002140.4.a2m,Invitae,refseq-HNRNPK-NM_002140.4.npy,1,464,464
+NP_002147.2,MLRLPTVFRQMRPVSRVLAPHLTRAYAKDVKFGADARALMLQGVDLLADAVAVTMGPKGRTVIIEQSWGSPKVTKDGVTVAKSIDLKDKYKNIGAKLVQDVANNTNEEAGDGTTTATVLARSIAKEGFEKISKGANPVEIRRGVMLAVDAVIAELKKQSKPVTTPEEIAQVATISANGDKEIGNIISDAMKKVGRKGVITVKDGKTLNDELEIIEGMKFDRGYISPYFINTSKGQKCEFQDAYVLLSEKKISSIQSIVPALEIANAHRKPLVIIAEDVDGEALSTLVLNRLKVGLQVVAVKAPGFGDNRKNQLKDMAIATGGAVFGEEGLTLNLEDVQPHDLGKVGEVIVTKDDAMLLKGKGDKAQIEKRIQEIIEQLDVTTSEYEKEKLNERLAKLSDGVAVLKVGGTSDVEVNEKKDRVTDALNATRAAVEEGIVLGGGCALLRCIPALDSLTPANEDQKIGIEIIKRTLKIPAMTIAKNAGVEGSLIVEKIMQSSSEVGYDAMAGDFVNMVEKGIIDPTKVVRTALLDAAGVASLLTTAEVVVTEIPKEEKDPGMGAMGGMGGGMGGGMF,573,NP_002147.2.csv,refseq-HSPD1-NM_002156.4_clinical_seed_0_final,refseq-HSPD1-NM_002156.4.a2m,Invitae,refseq-HSPD1-NM_002156.4.npy,1,573,573
+NP_002151.2,MGAMTQLLAGVFLAFLALATEGGVLKKVIRHKRQSGVNATLPEENQPVVFNHVYNIKLPVGSQCSVDLESASGEKDLAPPSEPSESFQEHTVDGENQIVFTHRINIPRRACGCAAAPDVKELLSRLEELENLVSSLREQCTAGAGCCLQPATGRLDTRPFCSGRGNFSTEGCGCVCEPGWKGPNCSEPECPGNCHLRGRCIDGQCICDDGFTGEDCSQLACPSDCNDQGKCVNGVCICFEGYAGADCSREICPVPCSEEHGTCVDGLCVCHDGFAGDDCNKPLCLNNCYNRGRCVENECVCDEGFTGEDCSELICPNDCFDRGRCINGTCYCEEGFTGEDCGKPTCPHACHTQGRCEEGQCVCDEGFAGVDCSEKRCPADCHNRGRCVDGRCECDDGFTGADCGELKCPNGCSGHGRCVNGQCVCDEGYTGEDCSQLRCPNDCHSRGRCVEGKCVCEQGFKGYDCSDMSCPNDCHQHGRCVNGMCVCDDGYTGEDCRDRQCPRDCSNRGLCVDGQCVCEDGFTGPDCAELSCPNDCHGQGRCVNGQCVCHEGFMGKDCKEQRCPSDCHGQGRCVDGQCICHEGFTGLDCGQHSCPSDCNNLGQCVSGRCICNEGYSGEDCSEVSPPKDLVVTEVTEETVNLAWDNEMRVTEYLVVYTPTHEGGLEMQFRVPGDQTSTIIQELEPGVEYFIRVFAILENKKSIPVSARVATYLPAPEGLKFKSIKETSVEVEWDPLDIAFETWEIIFRNMNKEDEGEITKSLRRPETSYRQTGLAPGQEYEISLHIVKNNTRGPGLKRVTTTRLDAPSQIEVKDVTDTTALITWFKPLAEIDGIELTYGIKDVPGDRTTIDLTEDENQYSIGNLKPDTEYEVSLISRRGDMSSNPAKETFTTGLDAPRNLRRVSQTDNSITLEWRNGKAAIDSYRIKYAPISGGDHAEVDVPKSQQATTKTTLTGLRPGTEYGIGVSAVKEDKESNPATINAATELDTPKDLQVSETAETSLTLLWKTPLAKFDRYRLNYSLPTGQWVGVQLPRNTTSYVLRGLEPGQEYNVLLTAEKGRHKSKPARVKASTEQAPELENLTVTEVGWDGLRLNWTAADQAYEHFIIQVQEANKVEAARNLTVPGSLRAVDIPGLKAATPYTVSIYGVIQGYRTPVLSAEASTGETPNLGEVVVAEVGWDALKLNWTAPEGAYEYFFIQVQEADTVEAAQNLTVPGGLRSTDLPGLKAATHYTITIRGVTQDFSTTPLSVEVLTEEVPDMGNLTVTEVSWDALRLNWTTPDGTYDQFTIQVQEADQVEEAHNLTVPGSLRSMEIPGLRAGTPYTVTLHGEVRGHSTRPLAVEVVTEDLPQLGDLAVSEVGWDGLRLNWTAADNAYEHFVIQVQEVNKVEAAQNLTLPGSLRAVDIPGLEAATPYRVSIYGVIRGYRTPVLSAEASTAKEPEIGNLNVSDITPESFNLSWMATDGIFETFTIEIIDSNRLLETVEYNISGAERTAHISGLPPSTDFIVYLSGLAPSIRTKTISATATTEALPLLENLTISDINPYGFTVSWMASENAFDSFLVTVVDSGKLLDPQEFTLSGTQRKLELRGLITGIGYEVMVSGFTQGHQTKPLRAEIVTEAEPEVDNLLVSDATPDGFRLSWTADEGVFDNFVLKIRDTKKQSEPLEITLLAPERTRDITGLREATEYEIELYGISKGRRSQTVSAIATTAMGSPKEVIFSDITENSATVSWRAPTAQVESFRITYVPITGGTPSMVTVDGTKTQTRLVKLIPGVEYLVSIIAMKGFEESEPVSGSFTTALDGPSGLVTANITDSEALARWQPAIATVDSYVISYTGEKVPEITRTVSGNTVEYALTDLEPATEYTLRIFAEKGPQKSSTITAKFTTDLDSPRDLTATEVQSETALLTWRPPRASVTGYLLVYESVDGTVKEVIVGPDTTSYSLADLSPSTHYTAKIQALNGPLRSNMIQTIFTTIGLLYPFPKDCSQAMLNGDTTSGLYTIYLNGDKAEALEVFCDMTSDGGGWIVFLRRKNGRENFYQNWKAYAAGFGDRREEFWLGLDNLNKITAQGQYELRVDLRDHGETAFAVYDKFSVGDAKTRYKLKVEGYSGTAGDSMAYHNGRSFSTFDKDTDSAITNCALSYKGAFWYRNCHRVNLMGRYGDNNHSQGVNWFHWKGHEHSIQFAEMKLRPSNFRNLEGRRKRA,2201,NP_002151.2.csv,refseq-TNC-NM_002160.3_clinical_seed_0_final,refseq-TNC-NM_002160.3.a2m,Invitae,refseq-TNC-NM_002160.3.npy,1,2201,2201
+NP_002159.2,MAGYLRVVRSLCRASGSRPAWAPAALTAPTSQEQPRRHYADKRIKVAKPVVEMDGDEMTRIIWQFIKEKLILPHVDIQLKYFDLGLPNRDQTDDQVTIDSALATQKYSVAVKCATITPDEARVEEFKLKKMWKSPNGTIRNILGGTVFREPIICKNIPRLVPGWTKPITIGRHAHGDQYKATDFVADRAGTFKMVFTPKDGSGVKEWEVYNFPAGGVGMGMYNTDESISGFAHSCFQYAIQKKWPLYMSTKNTILKAYDGRFKDIFQEIFDKHYKTDFDKNKIWYEHRLIDDMVAQVLKSSGGFVWACKNYDGDVQSDILAQGFGSLGLMTSVLVCPDGKTIEAEAAHGTVTRHYREHQKGRPTSTNPIASIFAWTRGLEHRGKLDGNQDLIRFAQMLEKVCVETVESGAMTKDLAGCIHGLSNVKLNEHFLNTTDFLDTIKSNLDRALGRQ,452,NP_002159.2.csv,refseq-IDH2-NM_002168.3_clinical_seed_0_final,refseq-IDH2-NM_002168.3.a2m,Invitae,refseq-IDH2-NM_002168.3.npy,1,452,452
+NP_002171.2,MASAAVESFVTKQLDLLELERDAEVEERRSWQENISLKELQSRGVCLLKLQVSSQRTGLYGRLLVTFEPRRYGSAAALPSNSFTSGDIVGLYDAANEGSQLATGILTRVTQKSVTVAFDESHDFQLSLDRENSYRLLKLANDVTYRRLKKALIALKKYHSGPASSLIEVLFGRSAPSPASEIHPLTFFNTCLDTSQKEAVLFALSQKELAIIHGPPGTGKTTTVVEIILQAVKQGLKVLCCAPSNIAVDNLVERLALCKQRILRLGHPARLLESIQQHSLDAVLARSDSAQIVADIRKDIDQVFVKNKKTQDKREKSNFRNEIKLLRKELKEREEAAMLESLTSANVVLATNTGASADGPLKLLPESYFDVVVIDECAQALEASCWIPLLKARKCILAGDHKQLPPTTVSHKAALAGLSLSLMERLAEEYGARVVRTLTVQYRMHQAIMRWASDTMYLGQLTAHSSVARHLLRDLPGVAATEETGVPLLLVDTAGCGLFELEEEDEQSKGNPGEVRLVSLHIQALVDAGVPARDIAVVSPYNLQVDLLRQSLVHRHPELEIKSVDGFQGREKEAVILSFVRSNRKGEVGFLAEDRRINVAVTRARRHVAVICDSRTVNNHAFLKTLVEYFTQHGEVRTAFEYLDDIVPENYSHENSQGSSHAATKPQGPATSTRTGSQRQEGGQEAAAPARQGRKKPAGKSLASEAPSQPSLNGGSPEGVESQDGVDHFRAMIVEFMASKKMQLEFPPSLNSHDRLRVHQIAEEHGLRHDSSGEGKRRFITVSKRAPRPRAALGPPAGTGGPAPLQPVPPTPAQTEQPPREQRGPDQPDLRTLHLERLQRVRSAQGQPASKEQQASGQQKLPEKKKKKAKGHPATDLPTEEDFEALVSAAVKADNTCGFAKCTAGVTTLGQFCQLCSRRYCLSHHLPEIHGCGERARAHARQRISREGVLYAGSGTKNGSLDPAKRAQLQRRLDKKLSELSNQRTSRRKERGT,993,NP_002171.2.csv,refseq-IGHMBP2-NM_002180.2_clinical_seed_0_final,refseq-IGHMBP2-NM_002180.2.a2m,Invitae,refseq-IGHMBP2-NM_002180.2.npy,1,993,993
+NP_002172.2,MSPARLRPRLHFCLVLLLLLVVPAAWGCGPGRVVGSRRRPPRKLVPLAYKQFSPNVPEKTLGASGRYEGKIARSSERFKELTPNYNPDIIFKDEENTGADRLMTQRCKDRLNSLAISVMNQWPGVKLRVTEGWDEDGHHSEESLHYEGRAVDITTSDRDRNKYGLLARLAVEAGFDWVYYESKAHVHCSVKSEHSAAAKTGGCFPAGAQVRLESGARVALSAVRPGDRVLAMGEDGSPTFSDVLIFLDREPHRLRAFQVIETQDPPRRLALTPAHLLFTADNHTEPAARFRATFASHVQPGQYVLVAGVPGLQPARVAAVSTHVALGAYAPLTKHGTLVVEDVVASCFAAVADHHLAQLAFWPLRLFHSLAWGSWTPGEGVHWYPQLLYRLGRLLLEEGSFHPLGMSGAGS,411,NP_002172.2.csv,refseq-IHH-NM_002181.4_clinical_seed_0_final,refseq-IHH-NM_002181.4.a2m,Invitae,refseq-IHH-NM_002181.4.npy,1,411,411
+NP_002176.2,MTILGTTFGMVFSLLQVVSGESGYAQNGDLEDAELDDYSFSCYSQLEVNGSQHSLTCAFEDPDVNITNLEFEICGALVEVKCLNFRKLQEIYFIETKKFLLIGKSNICVKVGEKSLTCKKIDLTTIVKPEAPFDLSVVYREGANDFVVTFNTSHLQKKYVKVLMHDVAYRQEKDENKWTHVNLSSTKLTLLQRKLQPAAMYEIKVRSIPDHYFKGFWSEWSPSYYFRTPEINNSSGEMDPILLTISILSFFSVALLVILACVLWKKRIKPIVWPSLPDHKKTLEHLCKKPRKNLNVSFNPESFLDCQIHRVDDIQARDEVEGFLQDTFPQQLEESEKQRLGGDVQSPNCPSEDVVITPESFGRDSSLTCLAGNVSACDAPILSSSRSLDCRESGKNGPHVYQDLLLSLGTTNSTLPPPFSLQSGILTLNPVAQGQPILTSLGSNQEEAYVTMSSFYQNQ,459,NP_002176.2.csv,refseq-IL7R-NM_002185.3_clinical_seed_0_final,refseq-IL7R-NM_002185.3.a2m,Invitae,refseq-IL7R-NM_002185.3.npy,1,459,459
+NP_002195.1,MGPGPSRAPRAPRLMLCALALMVAAGGCVVSAFNLDTRFLVVKEAGNPGSLFGYSVALHRQTERQQRYLLLAGAPRELAVPDGYTNRTGAVYLCPLTAHKDDCERMNITVKNDPGHHIIEDMWLGVTVASQGPAGRVLVCAHRYTQVLWSGSEDQRRMVGKCYVRGNDLELDSSDDWQTYHNEMCNSNTDYLETGMCQLGTSGGFTQNTVYFGAPGAYNWKGNSYMIQRKEWDLSEYSYKDPEDQGNLYIGYTMQVGSFILHPKNITIVTGAPRHRHMGAVFLLSQEAGGDLRRRQVLEGSQVGAYFGSAIALADLNNDGWQDLLVGAPYYFERKEEVGGAIYVFMNQAGTSFPAHPSLLLHGPSGSAFGLSVASIGDINQDGFQDIAVGAPFEGLGKVYIYHSSSKGLLRQPQQVIHGEKLGLPGLATFGYSLSGQMDVDENFYPDLLVGSLSDHIVLLRARPVINIVHKTLVPRPAVLDPALCTATSCVQVELCFAYNQSAGNPNYRRNITLAYTLEADRDRRPPRLRFAGSESAVFHGFFSMPEMRCQKLELLLMDNLRDKLRPIIISMNYSLPLRMPDRPRLGLRSLDAYPILNQAQALENHTEVQFQKECGPDNKCESNLQMRAAFVSEQQQKLSRLQYSRDVRKLLLSINVTNTRTSERSGEDAHEALLTLVVPPALLLSSVRPPGACQANETIFCELGNPFKRNQRMELLIAFEVIGVTLHTRDLQVQLQLSTSSHQDNLWPMILTLLVDYTLQTSLSMVNHRLQSFFGGTVMGESGMKTVEDVGSPLKYEFQVGPMGEGLVGLGTLVLGLEWPYEVSNGKWLLYPTEITVHGNGSWPCRPPGDLINPLNLTLSDPGDRPSSPQRRRRQLDPGGGQGPPPVTLAAAKKAKSETVLTCATGRAHCVWLECPIPDAPVVTNVTVKARVWNSTFIEDYRDFDRVRVNGWATLFLRTSIPTINMENKTTWFSVDIDSELVEELPAEIELWLVLVAVGAGLLLLGLIILLLWKCGFFKRARTRALYEAKRQKAEMKSQPSETERLTDDY,1051,NP_002195.1.csv,refseq-ITGA3-NM_002204.3_clinical_seed_0_final,refseq-ITGA3-NM_002204.3.a2m,Invitae,refseq-ITGA3-NM_002204.3.npy,1,1051,1051
+NP_002212.3,MAVYCYALNSLVIMNSANEMKSGGGPGPSGSETPPPPRRAVLSPGSVFSPGRGASFLFPPAESLSPEEPRSPGGWRSGRRRLNSSSGSGSGSSGSSVSSPSWAGRLRGDRQQVVAAGTLSPPGPEEAKRKLRILQRELQNVQVNQKVGMFEAHIQAQSSAIQAPRSPRLGRARSPSPCPFRSSSQPPGRVLVQGARSEERRTKSWGEQCPETSGTDSGRKGGPSLCSSQVKKGMPPLPGRAAPTGSEAQGPSAFVRMEKGIPASPRCGSPTAMEIDKRGSPTPGTRSCLAPSLGLFGASLTMATEVAARVTSTGPHRPQDLALTEPSGRARELEDLQPPEALVERQGQFLGSETSPAPERGGPRDGEPPGKMGKGYLPCGMPGSGEPEVGKRPEETTVSVQSAESSDSLSWSRLPRALASVGPEEARSGAPVGGGRWQLSDRVEGGSPTLGLLGGSPSAQPGTGNVEAGIPSGRMLEPLPCWDAAKDLKEPQCPPGDRVGVQPGNSRVWQGTMEKAGLAWTRGTGVQSEGTWESQRQDSDALPSPELLPQDPDKPFLRKACSPSNIPAVIITDMGTQEDGALEETQGSPRGNLPLRKLSSSSASSTGFSSSYEDSEEDISSDPERTLDPNSAFLHTLDQQKPRVSKSWRKIKNMVHWSPFVMSFKKKYPWIQLAGHAGSFKAAANGRILKKHCESEQRCLDRLMVDVLRPFVPAYHGDVVKDGERYNQMDDLLADFDSPCVMDCKMGIRTYLEEELTKARKKPSLRKDMYQKMIEVDPEAPTEEEKAQRAVTKPRYMQWRETISSTATLGFRIEGIKKEDGTVNRDFKKTKTREQVTEAFREFTKGNHNILIAYRDRLKAIRTTLEVSPFFKCHEVIGSSLLFIHDKKEQAKVWMIDFGKTTPLPEGQTLQHDVPWQEGNREDGYLSGLNNLVDILTEMSQDAPLA,946,NP_002212.3.csv,refseq-ITPKB-NM_002221.3_clinical_seed_0_final,refseq-ITPKB-NM_002221.3.a2m,Invitae,refseq-ITPKB-NM_002221.3.npy,1,946,946
+NP_002217.3,MRAQGRGRLPRRLLLLLALWVQAARPMGYFELQLSALRNVNGELLSGACCDGDGRTTRAGGCGHDECDTYVRVCLKEYQAKVTPTGPCSYGHGATPVLGGNSFYLPPAGAAGDRARARARAGGDQDPGLVVIPFQFAWPRSFTLIVEAWDWDNDTTPNEELLIERVSHAGMINPEDRWKSLHFSGHVAHLELQIRVRCDENYYSATCNKFCRPRNDFFGHYTCDQYGNKACMDGWMGKECKEAVCKQGCNLLHGGCTVPGECRCSYGWQGRFCDECVPYPGCVHGSCVEPWQCNCETNWGGLLCDKDLNYCGSHHPCTNGGTCINAEPDQYRCTCPDGYSGRNCEKAEHACTSNPCANGGSCHEVPSGFECHCPSGWSGPTCALDIDECASNPCAAGGTCVDQVDGFECICPEQWVGATCQLDANECEGKPCLNAFSCKNLIGGYYCDCIPGWKGINCHINVNDCRGQCQHGGTCKDLVNGYQCVCPRGFGGRHCELERDECASSPCHSGGLCEDLADGFHCHCPQGFSGPLCEVDVDLCEPSPCRNGARCYNLEGDYYCACPDDFGGKNCSVPREPCPGGACRVIDGCGSDAGPGMPGTAASGVCGPHGRCVSQPGGNFSCICDSGFTGTYCHENIDDCLGQPCRNGGTCIDEVDAFRCFCPSGWEGELCDTNPNDCLPDPCHSRGRCYDLVNDFYCACDDGWKGKTCHSREFQCDAYTCSNGGTCYDSGDTFRCACPPGWKGSTCAVAKNSSCLPNPCVNGGTCVGSGASFSCICRDGWEGRTCTHNTNDCNPLPCYNGGICVDGVNWFRCECAPGFAGPDCRINIDECQSSPCAYGATCVDEINGYRCSCPPGRAGPRCQEVIGFGRSCWSRGTPFPHGSSWVEDCNSCRCLDGRRDCSKVWCGWKPCLLAGQPEALSAQCPLGQRCLEKAPGQCLRPPCEAWGECGAEEPPSTPCLPRSGHLDNNCARLTLHFNRDHVPQGTTVGAICSGIRSLPATRAVARDRLLVLLCDRASSGASAVEVAVSFSPARDLPDSSLIQGAAHAIVAAITQRGNSSLLLAVTEVKVETVVTGGSSTGLLVPVLCGAFSVLWLACVVLCVWWTRKRRKERERSRLPREESANNQWAPLNPIRNPIERPGGHKDVLYQCKNFTPPPRRADEALPGPAGHAAVREDEEDEDLGRGEEDSLEAEKFLSHKFTKDPGRSPGRPAHWASGPKVDNRAVRSINEARYAGKE,1238,NP_002217.3.csv,refseq-JAG2-NM_002226.4_clinical_seed_0_final,refseq-JAG2-NM_002226.4.a2m,Invitae,refseq-JAG2-NM_002226.4.npy,1,1238,1238
+NP_002226.1,MRSEKSLTLAAPGEVRGPEGEQQDAGDFPEAGGGGGCCSSERLVINISGLRFETQLRTLSLFPDTLLGDPGRRVRFFDPLRNEYFFDRNRPSFDAILYYYQSGGRLRRPVNVPLDIFLEEIRFYQLGDEALAAFREDEGCLPEGGEDEKPLPSQPFQRQVWLLFEYPESSGPARGIAIVSVLVILISIVIFCLETLPQFRVDGRGGNNGGVSRVSPVSRGSQEEEEDEDDSYTFHHGITPGEMGTGGSSSLSTLGGSFFTDPFFLVETLCIVWFTFELLVRFSACPSKPAFFRNIMNIIDLVAIFPYFITLGTELVQQQEQQPASGGGGQNGQQAMSLAILRVIRLVRVFRIFKLSRHSKGLQILGKTLQASMRELGLLIFFLFIGVILFSSAVYFAEADDDDSLFPSIPDAFWWAVVTMTTVGYGDMYPMTVGGKIVGSLCAIAGVLTIALPVPVIVSNFNYFYHRETEQEEQGQYTHVTCGQPAPDLRATDNGLGKPDFPEANRERRPSYLPTPHRAYAEKRMLTEV,529,NP_002226.1.csv,refseq-KCNA6-NM_002235.4_clinical_seed_0_final,refseq-KCNA6-NM_002235.4.a2m,Invitae,refseq-KCNA6-NM_002235.4.npy,1,529,529
+NP_002231.1,MAKLTESMTNVLEGDSMDQDVESPVAIHQPKLPKQARDDLPRHISRDRTKRKIQRYVRKDGKCNVHHGNVRETYRYLTDIFTTLVDLKWRFNLLIFVMVYTVTWLFFGMIWWLIAYIRGDMDHIEDPSWTPCVTNLNGFVSAFLFSIETETTIGYGYRVITDKCPEGIILLLIQSVLGSIVNAFMVGCMFVKISQPKKRAETLVFSTHAVISMRDGKLCLMFRVGDLRNSHIVEASIRAKLIKSKQTSEGEFIPLNQTDINVGYYTGDDRLFLVSPLIISHEINQQSPFWEISKAQLPKEELEIVVILEGMVEATGMTCQARSSYITSEILWGYRFTPVLTLEDGFYEVDYNSFHETYETSTPSLSAKELAELASRAELPLSWSVSSKLNQHAELETEEEEKNLEEQTERNGDVANLENESKV,423,NP_002231.1.csv,refseq-KCNJ6-NM_002240.4_clinical_seed_0_final,refseq-KCNJ6-NM_002240.4.a2m,Invitae,refseq-KCNJ6-NM_002240.4.npy,1,423,423
+NP_002232.2,MTSVAKVYYSQTTQTESRPLMGPGIRRRRVLTKDGRSNVRMEHIADKRFLYLKDLWTTFIDMQWRYKLLLFSATFAGTWFLFGVVWYLVAVAHGDLLELDPPANHTPCVVQVHTLTGAFLFSLESQTTIGYGFRYISEECPLAIVLLIAQLVLTTILEIFITGTFLAKIARPKKRAETIRFSQHAVVASHNGKPCLMIRVANMRKSLLIGCQVTGKLLQTHQTKEGENIRLNQVNVTFQVDTASDSPFLILPLTFYHVVDETSPLKDLPLRSGEGDFELVLILSGTVESTSATCQVRTSYLPEEILWGYEFTPAISLSASGKYIADFSLFDQVVKVASPSGLRDSTVRYGDPEKLKLEESLREQAEKEGSALSVRISNV,379,NP_002232.2.csv,refseq-KCNJ10-NM_002241.4_clinical_seed_0_final,refseq-KCNJ10-NM_002241.4.a2m,Invitae,refseq-KCNJ10-NM_002241.4.npy,1,379,379
+NP_002233.2,MDSSNCKVIAPLLSQRYRRMVTKDGHSTLQMDGAQRGLAYLRDAWGILMDMRWRWMMLVFSASFVVHWLVFAVLWYVLAEMNGDLELDHDAPPENHTICVKYITSFTAAFSFSLETQLTIGYGTMFPSGDCPSAIALLAIQMLLGLMLEAFITGAFVAKIARPKNRAFSIRFTDTAVVAHMDGKPNLIFQVANTRPSPLTSVRVSAVLYQERENGKLYQTSVDFHLDGISSDECPFFIFPLTYYHSITPSSPLATLLQHENPSHFELVVFLSAMQEGTGEICQRRTSYLPSEIMLHHCFASLLTRGSKGEYQIKMENFDKTVPEFPTPLVSKSPNRTDLDIHINGQSIDNFQISETGLTE,360,NP_002233.2.csv,refseq-KCNJ13-NM_002242.4_clinical_seed_0_final,refseq-KCNJ13-NM_002242.4.a2m,Invitae,refseq-KCNJ13-NM_002242.4.npy,1,360,360
+NP_002237.1,MKRQNVRTLALIVCTFTYLLVGAAVFDALESEPELIERQRLELRQQELRARYNLSQGGYEELERVVLRLKPHKAGVQWRFAGSFYFAITVITTIGYGHAAPSTDGGKVFCMFYALLGIPLTLVMFQSLGERINTLVRYLLHRAKKGLGMRRADVSMANMVLIGFFSCISTLCIGAAAFSHYEHWTFFQAYYYCFITLTTIGFGDYVALQKDQALQTQPQYVAFSFVYILTGLTVIGAFLNLVVLRFMTMNAEDEKRDAEHRALLTRNGQAGGGGGGGSAHTTDTASSTAAAGGGGFRNVYAEVLHFQSMCSCLWYKSREKLQYSIPMIIPRDLSTSDTCVEQSHSSPGGGGRYSDTPSRRCLCSGAPRSAISSVSTGLHSLSTFRGLMKRRSSV,394,NP_002237.1.csv,refseq-KCNK3-NM_002246.2_clinical_seed_0_final,refseq-KCNK3-NM_002246.2.a2m,Invitae,refseq-KCNK3-NM_002246.2.npy,1,394,394
+NP_002238.2,MANGGGGGGGSSGGGGGGGGSSLRMSSNIHANHLSLDASSSSSSSSSSSSSSSSSSSSSSVHEPKMDALIIPVTMEVPCDSRGQRMWWAFLASSMVTFFGGLFIILLWRTLKYLWTVCCHCGGKTKEAQKINNGSSQADGTLKPVDEKEEAVAAEVGWMTSVKDWAGVMISAQTLTGRVLVVLVFALSIGALVIYFIDSSNPIESCQNFYKDFTLQIDMAFNVFFLLYFGLRFIAANDKLWFWLEVNSVVDFFTVPPVFVSVYLNRSWLGLRFLRALRLIQFSEILQFLNILKTSNSIKLVNLLSIFISTWLTAAGFIHLVENSGDPWENFQNNQALTYWECVYLLMVTMSTVGYGDVYAKTTLGRLFMVFFILGGLAMFASYVPEIIELIGNRKKYGGSYSAVSGRKHIVVCGHITLESVSNFLKDFLHKDRDDVNVEIVFLHNISPNLELEALFKRHFTQVEFYQGSVLNPHDLARVKIESADACLILANKYCADPDAEDASNIMRVISIKNYHPKIRIITQMLQYHNKAHLLNIPSWNWKEGDDAICLAELKLGFIAQSCLAQGLSTMLANLFSMRSFIKIEEDTWQKYYLEGVSNEMYTEYLSSAFVGLSFPTVCELCFVKLKLLMIAIEYKSANRESRILINPGNHLKIQEGTLGFFIASDAKEVKRAFFYCKACHDDITDPKRIKKCGCKRLEDEQPSTLSPKKKQRNGGMRNSPNTSPKLMRHDPLLIPGNDQIDNMDSNVKKYDSTGMFHWCAPKEIEKVILTRSEAAMTVLSGHVVVCIFGDVSSALIGLRNLVMPLRASNFHYHELKHIVFVGSIEYLKREWETLHNFPKVSILPGTPLSRADLRAVNINLCDMCVILSANQNNIDDTSLQDKECILASLNIKSMQFDDSIGVLQANSQGFTPPGMDRSSPDNSPVHGMLRQPSITTGVNIPIITELVNDTNVQFLDQDDDDDPDTELYLTQPFACGTAFAVSVLDSLMSATYFNDNILTLIRTLVTGGATPELEALIAEENALRGGYSTPQTLANRDRCRVAQLALLDGPFADLGDGGCYGDLFCKALKTYNMLCFGIYRLRDAHLSTPSQCTKRYVITNPPYEFELVPTDLIFCLMQFDHNAGQSRASLSHSSHSSQSSSKKSSSVHSIPSTANRQNRPKSRESRDKQKYVQEERL,1178,NP_002238.2.csv,refseq-KCNMA1-NM_002247.3_clinical_seed_0_final,refseq-KCNMA1-NM_002247.3.a2m,Invitae,refseq-KCNMA1-NM_002247.3.npy,1,1178,1178
+NP_002240.3,MDTSGHFHDSGVGDLDEDPKCPCPSSGDEQQQQQQQQQQQQPPPPAPPAAPQQPLGPSLQPQPPQLQQQQQQQQQQQQQQPPHPLSQLAQLQSQPVHPGLLHSSPTAFRAPPSSNSTAILHPSSRQGSQLNLNDHLLGHSPSSTATSGPGGGSRHRQASPLVHRRDSNPFTEIAMSSCKYSGGVMKPLSRLSASRRNLIEAETEGQPLQLFSPSNPPEIVISSREDNHAHQTLLHHPNATHNHQHAGTTASSTTFPKANKRKNQNIGYKLGHRRALFEKRKRLSDYALIFGMFGIVVMVIETELSWGLYSKDSMFSLALKCLISLSTIILLGLIIAYHTREVQLFVIDNGADDWRIAMTYERILYISLEMLVCAIHPIPGEYKFFWTARLAFSYTPSRAEADVDIILSIPMFLRLYLIARVMLLHSKLFTDASSRSIGALNKINFNTRFVMKTLMTICPGTVLLVFSISLWIIAAWTVRVCERYHDQQDVTSNFLGAMWLISITFLSIGYGDMVPHTYCGKGVCLLTGIMGAGCTALVVAVVARKLELTKAEKHVHNFMMDTQLTKRIKNAAANVLRETWLIYKHTKLLKKIDHAKVRKHQRKFLQAIHQLRSVKMEQRKLSDQANTLVDLSKMQNVMYDLITELNDRSEDLEKQIGSLESKLEHLTASFNSLPLLIADTLRQQQQQLLSAIIEARGVSVAVGTTHTPISDSPIGVSSTSFPTPYTSSSSC,731,NP_002240.3.csv,refseq-KCNN3-NM_002249.5_clinical_seed_0_final,refseq-KCNN3-NM_002249.5.a2m,Invitae,refseq-KCNN3-NM_002249.5.npy,1,731,731
+NP_002241.1,MGGDLVLGLGALRRRKRLLEQEKSLAGWALVLAGTGIGLMVLHAEMLWFGGCSWALYLFLVKCTISISTFLLLCLIVAFHAKEVQLFMTDNGLRDWRVALTGRQAAQIVLELVVCGLHPAPVRGPPCVQDLGAPLTSPQPWPGFLGQGEALLSLAMLLRLYLVPRAVLLRSGVLLNASYRSIGALNQVRFRHWFVAKLYMNTHPGRLLLGLTLGLWLTTAWVLSVAERQAVNATGHLSDTLWLIPITFLTIGYGDVVPGTMWGKIVCLCTGVMGVCCTALLVAVVARKLEFNKAEKHVHNFMMDIQYTKEMKESAARVLQEAWMFYKHTRRKESHAARRHQRKLLAAINAFRQVRLKHRKLREQVNSMVDISKMHMILYDLQQNLSSSHRALEKQIDTLAGKLDALTELLSTALGPRQLPEPSQQSK,427,NP_002241.1.csv,refseq-KCNN4-NM_002250.2_clinical_seed_0_final,refseq-KCNN4-NM_002250.2.a2m,Invitae,refseq-KCNN4-NM_002250.2.npy,1,427,427
+NP_002244.1,MQSKVLLAVALWLCVETRAASVGLPSVSLDLPRLSIQKDILTIKANTTLQITCRGQRDLDWLWPNNQSGSEQRVEVTECSDGLFCKTLTIPKVIGNDTGAYKCFYRETDLASVIYVYVQDYRSPFIASVSDQHGVVYITENKNKTVVIPCLGSISNLNVSLCARYPEKRFVPDGNRISWDSKKGFTIPSYMISYAGMVFCEAKINDESYQSIMYIVVVVGYRIYDVVLSPSHGIELSVGEKLVLNCTARTELNVGIDFNWEYPSSKHQHKKLVNRDLKTQSGSEMKKFLSTLTIDGVTRSDQGLYTCAASSGLMTKKNSTFVRVHEKPFVAFGSGMESLVEATVGERVRIPAKYLGYPPPEIKWYKNGIPLESNHTIKAGHVLTIMEVSERDTGNYTVILTNPISKEKQSHVVSLVVYVPPQIGEKSLISPVDSYQYGTTQTLTCTVYAIPPPHHIHWYWQLEEECANEPSQAVSVTNPYPCEEWRSVEDFQGGNKIEVNKNQFALIEGKNKTVSTLVIQAANVSALYKCEAVNKVGRGERVISFHVTRGPEITLQPDMQPTEQESVSLWCTADRSTFENLTWYKLGPQPLPIHVGELPTPVCKNLDTLWKLNATMFSNSTNDILIMELKNASLQDQGDYVCLAQDRKTKKRHCVVRQLTVLERVAPTITGNLENQTTSIGESIEVSCTASGNPPPQIMWFKDNETLVEDSGIVLKDGNRNLTIRRVRKEDEGLYTCQACSVLGCAKVEAFFIIEGAQEKTNLEIIILVGTAVIAMFFWLLLVIILRTVKRANGGELKTGYLSIVMDPDELPLDEHCERLPYDASKWEFPRDRLKLGKPLGRGAFGQVIEADAFGIDKTATCRTVAVKMLKEGATHSEHRALMSELKILIHIGHHLNVVNLLGACTKPGGPLMVIVEFCKFGNLSTYLRSKRNEFVPYKTKGARFRQGKDYVGAIPVDLKRRLDSITSSQSSASSGFVEEKSLSDVEEEEAPEDLYKDFLTLEHLICYSFQVAKGMEFLASRKCIHRDLAARNILLSEKNVVKICDFGLARDIYKDPDYVRKGDARLPLKWMAPETIFDRVYTIQSDVWSFGVLLWEIFSLGASPYPGVKIDEEFCRRLKEGTRMRAPDYTTPEMYQTMLDCWHGEPSQRPTFSELVEHLGNLLQANAQQDGKDYIVLPISETLSMEEDSGLSLPTSPVSCMEEEEVCDPKFHYDNTAGISQYLQNSKRKSRPVSVKTFEDIPLEEPEVKVIPDDNQTDSGMVLASEELKTLEDRTKLSPSFGGMVPSKSRESVASEGSNQTSGYQSGYHSDDTDTTVYSSEEAELLKLIEIGVQTGSTAQILQPDSGTTLSSPPV,1356,NP_002244.1.csv,refseq-KDR-NM_002253.2_clinical_seed_0_final,refseq-KDR-NM_002253.2.a2m,Invitae,refseq-KDR-NM_002253.2.npy,1,1356,1356
+NP_002258.2,MAENPSLENHRIKSFKNKGRDVETMRRHRNEVTVELRKNKRDEHLLKKRNVPQEESLEDSDVDADFKAQNVTLEAILQNATSDNPVVQLSAVQAARKLLSSDRNPPIDDLIKSGILPILVKCLERDDNPSLQFEAAWALTNIASGTSAQTQAVVQSNAVPLFLRLLRSPHQNVCEQAVWALGNIIGDGPQCRDYVISLGVVKPLLSFISPSIPITFLRNVTWVIVNLCRNKDPPPPMETVQEILPALCVLIYHTDINILVDTVWALSYLTDGGNEQIQMVIDSGVVPFLVPLLSHQEVKVQTAALRAVGNIVTGTDEQTQVVLNCDVLSHFPNLLSHPKEKINKEAVWFLSNITAGNQQQVQAVIDAGLIPMIIHQLAKGDFGTQKEAAWAISNLTISGRKDQVEYLVQQNVIPPFCNLLSVKDSQVVQVVLDGLKNILIMAGDEASTIAEIIEECGGLEKIEVLQQHENEDIYKLAFEIIDQYFSGDDIDEDPCLIPEATQGGTYNFDPTANLQTKEFNF,521,NP_002258.2.csv,refseq-KPNA3-NM_002267.3_clinical_seed_0_final,refseq-KPNA3-NM_002267.3.a2m,Invitae,refseq-KPNA3-NM_002267.3.npy,1,521,521
+NP_002283.3,MELTSRERGRGQPLPWELRLGLLLSVLAATLAQAPAPDVPGCSRGSCYPATGDLLVGRADRLTASSTCGLNGPQPYCIVSHLQDEKKCFLCDSRRPFSARDNPHSHRIQNVVTSFAPQRRAAWWQSENGIPAVTIQLDLEAEFHFTHLIMTFKTFRPAAMLVERSADFGRTWHVYRYFSYDCGADFPGVPLAPPRHWDDVVCESRYSEIEPSTEGEVIYRVLDPAIPIPDPYSSRIQNLLKITNLRVNLTRLHTLGDNLLDPRREIREKYYYALYELVVRGNCFCYGHASECAPAPGAPAHAEGMVHGACICKHNTRGLNCEQCQDFYRDLPWRPAEDGHSHACRKCECHGHTHSCHFDMAVYLASGNVSGGVCDGCQHNTAGRHCELCRPFFYRDPTKDLRDPAVCRSCDCDPMGSQDGGRCDSHDDPALGLVSGQCRCKEHVVGTRCQQCRDGFFGLSISDRLGCRRCQCNARGTVPGSTPCDPNSGSCYCKRLVTGRGCDRCLPGHWGLSHDLLGCRPCDCDVGGALDPQCDEGTGQCHCRQHMVGRRCEQVQPGYFRPFLDHLIWEAEDTRGQVLDVVERLVTPGETPSWTGSGFVRLQEGQTLEFLVASVPKAMDYDLLLRLEPQVPEQWAELELIVQRPGPVPAHSLCGHLVPKDDRIQGTLQPHARYLIFPNPVCLEPGISYKLHLKLVRTGGSAQPETPYSGPGLLIDSLVLLPRVLVLEMFSGGDAAALERQATFERYQCHEEGLVPSKTSPSEACAPLLISLSTLIYNGALPCQCNPQGSLSSECNPHGGQCLCKPGVVGRRCDLCAPGYYGFGPTGCQACQCSHEGALSSLCEKTSGQCLCRTGAFGLRCDRCQRGQWGFPSCRPCVCNGHADECNTHTGACLGCRDHTGGEHCERCIAGFHGDPRLPYGGQCRPCPCPEGPGSQRHFATSCHQDEYSQQIVCHCRAGYTGLRCEACAPGHFGDPSRPGGRCQLCECSGNIDPMDPDACDPHTGQCLRCLHHTEGPHCAHCKPGFHGQAARQSCHRCTCNLLGTNPQQCPSPDQCHCDPSSGQCPCLPNVQGPSCDRCAPNFWNLTSGHGCQPCACHPSRARGPTCNEFTGQCHCRAGFGGRTCSECQELHWGDPGLQCHACDCDSRGIDTPQCHRFTGHCSCRPGVSGVRCDQCARGFSGIFPACHPCHACFGDWDRVVQDLAARTQRLEQRAQELQQTGVLGAFESSFWHMQEKLGIVQGIVGARNTSAASTAQLVEATEELRREIGEATEHLTQLEADLTDVQDENFNANHALSGLERDRLALNLTLRQLDQHLDLLKHSNFLGAYDSIRHAHSQSAEAERRANTSALAVPSPVSNSASARHRTEALMDAQKEDFNSKHMANQRALGKLSAHTHTLSLTDINELVCGAPGDAPCATSPCGGAGCRDEDGQPRCGGLSCNGAAATADLALGRARHTQAELQRALAEGGSILSRVAETRRQASEAQQRAQAALDKANASRGQVEQANQELQELIQSVKDFLNQEGADPDSIEMVATRVLELSIPASAEQIQHLAGAIAERVRSLADVDAILARTVGDVRRAEQLLQDARRARSWAEDEKQKAETVQAALEEAQRAQGIAQGAIRGAVADTRDTEQTLYQVQERMAGAERALSSAGERARQLDALLEALKLKRAGNSLAASTAEETAGSAQGRAQEAEQLLRGPLGDQYQTVKALAERKAQGVLAAQARAEQLRDEARDLLQAAQDKLQRLQELEGTYEENERALESKAAQLDGLEARMRSVLQAINLQVQIYNTCQ,1798,NP_002283.3.csv,refseq-LAMB2-NM_002292.3_clinical_seed_0_final,refseq-LAMB2-NM_002292.3.a2m,Invitae,refseq-LAMB2-NM_002292.3.npy,1,1798,1798
+NP_002287.2,MPSRKFADGEVVRGRWPGSSLYYEVEILSHDSTSQLYTVKYKDGTELELKENDIKPLTSFRQRKGGSTSSSPSRRRGSRSRSRSRSPGRPPKSARRSASASHQADIKEARREVEVKLTPLILKPFGNSISRYNGEPEHIERNDAPHKNTQEKFSLSQESSYIATQYSLRPRREEVKLKEIDSKEEKYVAKELAVRTFEVTPIRAKDLEFGGVPGVFLIMFGLPVFLFLLLLMCKQKDPSLLNFPPPLPALYELWETRVFGVYLLWFLIQVLFYLLPIGKVVEGTPLIDGRRLKYRLNGFYAFILTSAVIGTSLFQGVEFHYVYSHFLQFALAATVFCVVLSVYLYMRSLKAPRNDLSPASSGNAVYDFFIGRELNPRIGTFDLKYFCELRPGLIGWVVINLVMLLAEMKIQDRAVPSLAMILVNSFQLLYVVDALWNEEALLTTMDIIHDGFGFMLAFGDLVWVPFIYSFQAFYLVSHPNEVSWPMASLIIVLKLCGYVIFRGANSQKNAFRKNPSDPKLAHLKTIHTSTGKNLLVSGWWGFVRHPNYLGDLIMALAWSLPCGFNHILPYFYIIYFTMLLVHREARDEYHCKKKYGVAWEKYCQRVPYRIFPYIY,615,NP_002287.2.csv,refseq-LBR-NM_002296.3_clinical_seed_0_final,refseq-LBR-NM_002296.3.a2m,Invitae,refseq-LBR-NM_002296.3.npy,1,615,615
+NP_002290.2,MELSWHVVFIALLSFSCWGSDWESDRNFISTAGPLTNDLLHNLSGLLGDQSSNFVAGDKDMYVCHQPLPTFLPEYFSSLHASQITHYKVFLSWAQLLPAGSTQNPDEKTVQCYRRLLKALKTARLQPMVILHHQTLPASTLRRTEAFADLFADYATFAFHSFGDLVGIWFTFSDLEEVIKELPHQESRASQLQTLSDAHRKAYEIYHESYAFQGGKLSVVLRAEDIPELLLEPPISALAQDTVDFLSLDLSYECQNEASLRQKLSKLQTIEPKVKVFIFNLKLPDCPSTMKNPASLLFSLFEAINKDQVLTIGFDINEFLSCSSSSKKSMSCSLTGSLALQPDQQQDHETTDSSPASAYQRIWEAFANQSRAERDAFLQDTFPEGFLWGASTGAFNVEGGWAEGGRGVSIWDPRRPLNTTEGQATLEVASDSYHKVASDVALLCGLRAQVYKFSISWSRIFPMGHGSSPSLPGVAYYNKLIDRLQDAGIEPMATLFHWDLPQALQDHGGWQNESVVDAFLDYAAFCFSTFGDRVKLWVTFHEPWVMSYAGYGTGQHPPGISDPGVASFKVAHLVLKAHARTWHHYNSHHRPQQQGHVGIVLNSDWAEPLSPERPEDLRASERFLHFMLGWFAHPVFVDGDYPATLRTQIQQMNRQCSHPVAQLPEFTEAEKQLLKGSADFLGLSHYTSRLISNAPQNTCIPSYDTIGGFSQHVNHVWPQTSSSWIRVVPWGIRRLLQFVSLEYTRGKVPIYLAGNGMPIGESENLFDDSLRVDYFNQYINEVLKAIKEDSVDVRSYIARSLIDGFEGPSGYSQRFGLHHVNFSDSSKSRTPRKSAYFFTSIIEKNGFLTKGAKRLLPPNTVNLPSKVRAFTFPSEVPSKAKVVWEKFSSQPKFERDLFYHGTFRDDFLWGVSSSAYQIEGAWDADGKGPSIWDNFTHTPGSNVKDNATGDIACDSYHQLDADLNMLRALKVKAYRFSISWSRIFPTGRNSSINSHGVDYYNRLINGLVASNIFPMVTLFHWDLPQALQDIGGWENPALIDLFDSYADFCFQTFGDRVKFWMTFNEPMYLAWLGYGSGEFPPGVKDPGWAPYRIAHAVIKAHARVYHTYDEKYRQEQKGVISLSLSTHWAEPKSPGVPRDVEAADRMLQFSLGWFAHPIFRNGDYPDTMKWKVGNRSELQHLATSRLPSFTEEEKRFIRATADVFCLNTYYSRIVQHKTPRLNPPSYEDDQEMAEEEDPSWPSTAMNRAAPWGTRRLLNWIKEEYGDIPIYITENGVGLTNPNTEDTDRIFYHKTYINEALKAYRLDGIDLRGYVAWSLMDNFEWLNGYTVKFGLYHVDFNNTNRPRTARASARYYTEVITNNGMPLAREDEFLYGRFPEGFIWSAASAAYQIEGAWRADGKGLSIWDTFSHTPLRVENDAIGDVACDSYHKIAEDLVTLQNLGVSHYRFSISWSRILPDGTTRYINEAGLNYYVRLIDTLLAASIQPQVTIYHWDLPQTLQDVGGWENETIVQRFKEYADVLFQRLGDKVKFWITLNEPFVIAYQGYGYGTAAPGVSNRPGTAPYIVGHNLIKAHAEAWHLYNDVYRASQGGVISITISSDWAEPRDPSNQEDVEAARRYVQFMGGWFAHPIFKNGDYNEVMKTRIRDRSLAAGLNKSRLPEFTESEKRRINGTYDFFGFNHYTTVLAYNLNYATAISSFDADRGVASIADRSWPDSGSFWLKMTPFGFRRILNWLKEEYNDPPIYVTENGVSQREETDLNDTARIYYLRTYINEALKAVQDKVDLRGYTVWSAMDNFEWATGFSERFGLHFVNYSDPSLPRIPKASAKFYASVVRCNGFPDPATGPHACLHQPDAGPTISPVRQEEVQFLGLMLGTTEAQTALYVLFSLVLLGVCGLAFLSYKYCKRSKQGKTQRSQQELSPVSSF,1927,NP_002290.2.csv,refseq-LCT-NM_002299.3_clinical_seed_0_final,refseq-LCT-NM_002299.3.a2m,Invitae,refseq-LCT-NM_002299.3_theta_0.2.npy,1,1927,1927
+NP_002294.2,MICQKFCVVLLHWEFIYVITAFNLSYPITPWRFKLSCMPPNSTYDYFLLPAGLSKNTSNSNGHYETAVEPKFNSSGTHFSNLSKTTFHCCFRSEQDRNCSLCADNIEGKTFVSTVNSLVFQQIDANWNIQCWLKGDLKLFICYVESLFKNLFRNYNYKVHLLYVLPEVLEDSPLVPQKGSFQMVHCNCSVHECCECLVPVPTAKLNDTLLMCLKITSGGVIFQSPLMSVQPINMVKPDPPLGLHMEITDDGNLKISWSSPPLVPFPLQYQVKYSENSTTVIREADKIVSATSLLVDSILPGSSYEVQVRGKRLDGPGIWSDWSTPRVFTTQDVIYFPPKILTSVGSNVSFHCIYKKENKIVPSKEIVWWMNLAEKIPQSQYDVVSDHVSKVTFFNLNETKPRGKFTYDAVYCCNEHECHHRYAELYVIDVNINISCETDGYLTKMTCRWSTSTIQSLAESTLQLRYHRSSLYCSDIPSIHPISEPKDCYLQSDGFYECIFQPIFLLSGYTMWIRINHSLGSLDSPPTCVLPDSVVKPLPPSSVKAEITINIGLLKISWEKPVFPENNLQFQIRYGLSGKEVQWKMYEVYDAKSKSVSLPVPDLCAVYAVQVRCKRLDGLGYWSNWSNPAYTVVMDIKVPMRGPEFWRIINGDTMKKEKNVTLLWKPLMKNDSLCSVQRYVINHHTSCNGTWSEDVGNHTKFTFLWTEQAHTVTVLAINSIGASVANFNLTFSWPMSKVNIVQSLSAYPLNSSCVIVSWILSPSDYKLMYFIIEWKNLNEDGEIKWLRISSSVKKYYIHDHFIPIEKYQFSLYPIFMEGVGKPKIINSFTQDDIEKHQSDAGLYVIVPVIISSSILLLGTLLISHQRMKKLFWEDVPNPKNCSWAQGLNFQKPETFEHLFIKHTASVTCGPLLLEPETISEDISVDTSWKNKDEMMPTTVVSLLSTTDLEKGSVCISDQFNSVNFSEAEGTEVTYEDESQRQPFVKYATLISNSKPSETGEEQGLINSSVTKCFSSKNSPLKDSFSNSSWEIEAQAFFILSDQHPNIISPHLTFSEGLDELLKLEGNFPEENNDKKSIYYLGVTSIKKRESGVLLTDKSRVSCPFPAPCLFTDIRVLQDSCSHFVENNINLGTSSKKTFASYMPQFQTCSTQTHKIMENKMCDLTV,1165,NP_002294.2.csv,refseq-LEPR-NM_002303.5_clinical_seed_0_final,refseq-LEPR-NM_002303.5.a2m,Invitae,refseq-LEPR-NM_002303.5.npy,1,1165,1165
+NP_002307.2,MDIATGPESLERCFPRGQTDCAKMLDGIKMEEHALRPGPATLGVLLGSDCPHPAVCEGCQRPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGCMEKIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSSVSPDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQEQQNSQRLGQEVLSSRMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGNDSIFHDIDSDTSLTSLSDCFLGSSDVGSLQARVGNPIDRLYSMQSSYFAS,395,NP_002307.2.csv,refseq-LMX1B-NM_002316.3_clinical_seed_0_final,refseq-LMX1B-NM_002316.3.a2m,Invitae,refseq-LMX1B-NM_002316.3.npy,1,395,395
+NP_002308.2,MRFAWTVLLLGPLQLCALVHCAPPAAGQQQPPREPPAAPGAWRQQIQWENNGQVFSLLSLGSQYQPQRRRDPGAAVPGAANASAQQPRTPILLIRDNRTAAARTRTAGSSGVTAGRPRPTARHWFQAGYSTSRAREAGASRAENQTAPGEVPALSNLRPPSRVDGMVGDDPYNPYKYSDDNPYYNYYDTYERPRPGGRYRPGYGTGYFQYGLPDLVADPYYIQASTYVQKMSMYNLRCAAEENCLASTAYRADVRDYDHRVLLRFPQRVKNQGTSDFLPSRPRYSWEWHSCHQHYHSMDEFSHYDLLDANTQRRVAEGHKASFCLEDTSCDYGYHRRFACTAHTQGLSPGCYDTYGADIDCQWIDITDVKPGNYILKVSVNPSYLVPESDYTNNVVRCDIRYTGHHAYASGCTISPY,417,NP_002308.2.csv,refseq-LOX-NM_002317.6_clinical_seed_0_final,refseq-LOX-NM_002317.6.a2m,Invitae,refseq-LOX-NM_002317.6.npy,1,417,417
+NP_002325.2,MRRQWGALLLGALLCAHGLASSPECACGRSHFTCAVSALGECTCIPAQWQCDGDNDCGDHSDEDGCILPTCSPLDFHCDNGKCIRRSWVCDGDNDCEDDSDEQDCPPRECEEDEFPCQNGYCIRSLWHCDGDNDCGDNSDEQCDMRKCSDKEFRCSDGSCIAEHWYCDGDTDCKDGSDEENCPSAVPAPPCNLEEFQCAYGRCILDIYHCDGDDDCGDWSDESDCSSHQPCRSGEFMCDSGLCINAGWRCDGDADCDDQSDERNCTTSMCTAEQFRCHSGRCVRLSWRCDGEDDCADNSDEENCENTGSPQCALDQFLCWNGRCIGQRKLCNGVNDCGDNSDESPQQNCRPRTGEENCNVNNGGCAQKCQMVRGAVQCTCHTGYRLTEDGHTCQDVNECAEEGYCSQGCTNSEGAFQCWCETGYELRPDRRSCKALGPEPVLLFANRIDIRQVLPHRSEYTLLLNNLENAIALDFHHRRELVFWSDVTLDRILRANLNGSNVEEVVSTGLESPGGLAVDWVHDKLYWTDSGTSRIEVANLDGAHRKVLLWQNLEKPRAIALHPMEGTIYWTDWGNTPRIEASSMDGSGRRIIADTHLFWPNGLTIDYAGRRMYWVDAKHHVIERANLDGSHRKAVISQGLPHPFAITVFEDSLYWTDWHTKSINSANKFTGKNQEIIRNKLHFPMDIHTLHPQRQPAGKNRCGDNNGGCTHLCLPSGQNYTCACPTGFRKISSHACAQSLDKFLLFARRMDIRRISFDTEDLSDDVIPLADVRSAVALDWDSRDDHVYWTDVSTDTISRAKWDGTGQEVVVDTSLESPAGLAIDWVTNKLYWTDAGTDRIEVANTDGSMRTVLIWENLDRPRDIVVEPMGGYMYWTDWGASPKIERAGMDASGRQVIISSNLTWPNGLAIDYGSQRLYWADAGMKTIEFAGLDGSKRKVLIGSQLPHPFGLTLYGERIYWTDWQTKSIQSADRLTGLDRETLQENLENLMDIHVFHRRRPPVSTPCAMENGGCSHLCLRSPNPSGFSCTCPTGINLLSDGKTCSPGMNSFLIFARRIDIRMVSLDIPYFADVVVPINITMKNTIAIGVDPQEGKVYWSDSTLHRISRANLDGSQHEDIITTGLQTTDGLAVDAIGRKVYWTDTGTNRIEVGNLDGSMRKVLVWQNLDSPRAIVLYHEMGFMYWTDWGENAKLERSGMDGSDRAVLINNNLGWPNGLTVDKASSQLLWADAHTERIEAADLNGANRHTLVSPVQHPYGLTLLDSYIYWTDWQTRSIHRADKGTGSNVILVRSNLPGLMDMQAVDRAQPLGFNKCGSRNGGCSHLCLPRPSGFSCACPTGIQLKGDGKTCDPSPETYLLFSSRGSIRRISLDTSDHTDVHVPVPELNNVISLDYDSVDGKVYYTDVFLDVIRRADLNGSNMETVIGRGLKTTDGLAVDWVARNLYWTDTGRNTIEASRLDGSCRKVLINNSLDEPRAIAVFPRKGYLFWTDWGHIAKIERANLDGSERKVLINTDLGWPNGLTLDYDTRRIYWVDAHLDRIESADLNGKLRQVLVSHVSHPFALTQQDRWIYWTDWQTKSIQRVDKYSGRNKETVLANVEGLMDIIVVSPQRQTGTNACGVNNGGCTHLCFARASDFVCACPDEPDSRPCSLVPGLVPPAPRATGMSEKSPVLPNTPPTTLYSSTTRTRTSLEEVEGRCSERDARLGLCARSNDAVPAAPGEGLHISYAIGGLLSILLILVVIAALMLYRHKKSKFTDPGMGNLTYSNPSYRTSTQEVKIEAIPKPAMYNQLCYKKEGGPDHNYTKEKIKIVEGICLLSGDDAEWDDLKQLRSSRGGLLRDHVCMKTDTVSIQASSGSLDDTETEQLLQEEQSECSSVHTAATPERRGSLPDTGWKHERKLSSESQV,1905,NP_002325.2.csv,refseq-LRP4-NM_002334.4_clinical_seed_0_final,refseq-LRP4-NM_002334.4.a2m,Invitae,refseq-LRP4-NM_002334.4_theta_0.2.npy,1,1905,1905
+NP_002326.2,MEAAPPGPPWPLLLLLLLLLALCGCPAPAAASPLLLFANRRDVRLVDAGGVKLESTIVVSGLEDAAAVDFQFSKGAVYWTDVSEEAIKQTYLNQTGAAVQNVVISGLVSPDGLACDWVGKKLYWTDSETNRIEVANLNGTSRKVLFWQDLDQPRAIALDPAHGYMYWTDWGETPRIERAGMDGSTRKIIVDSDIYWPNGLTIDLEEQKLYWADAKLSFIHRANLDGSFRQKVVEGSLTHPFALTLSGDTLYWTDWQTRSIHACNKRTGGKRKEILSALYSPMDIQVLSQERQPFFHTRCEEDNGGCSHLCLLSPSEPFYTCACPTGVQLQDNGRTCKAGAEEVLLLARRTDLRRISLDTPDFTDIVLQVDDIRHAIAIDYDPLEGYVYWTDDEVRAIRRAYLDGSGAQTLVNTEINDPDGIAVDWVARNLYWTDTGTDRIEVTRLNGTSRKILVSEDLDEPRAIALHPVMGLMYWTDWGENPKIECANLDGQERRVLVNASLGWPNGLALDLQEGKLYWGDAKTDKIEVINVDGTKRRTLLEDKLPHIFGFTLLGDFIYWTDWQRRSIERVHKVKASRDVIIDQLPDLMGLKAVNVAKVVGTNPCADRNGGCSHLCFFTPHATRCGCPIGLELLSDMKTCIVPEAFLVFTSRAAIHRISLETNNNDVAIPLTGVKEASALDFDVSNNHIYWTDVSLKTISRAFMNGSSVEHVVEFGLDYPEGMAVDWMGKNLYWADTGTNRIEVARLDGQFRQVLVWRDLDNPRSLALDPTKGYIYWTEWGGKPRIVRAFMDGTNCMTLVDKVGRANDLTIDYADQRLYWTDLDTNMIESSNMLGQERVVIADDLPHPFGLTQYSDYIYWTDWNLHSIERADKTSGRNRTLIQGHLDFVMDILVFHSSRQDGLNDCMHNNGQCGQLCLAIPGGHRCGCASHYTLDPSSRNCSPPTTFLLFSQKSAISRMIPDDQHSPDLILPLHGLRNVKAIDYDPLDKFIYWVDGRQNIKRAKDDGTQPFVLTSLSQGQNPDRQPHDLSIDIYSRTLFWTCEATNTINVHRLSGEAMGVVLRGDRDKPRAIVVNAERGYLYFTNMQDRAAKIERAALDGTEREVLFTTGLIRPVALVVDNTLGKLFWVDADLKRIESCDLSGANRLTLEDANIVQPLGLTILGKHLYWIDRQQQMIERVEKTTGDKRTRIQGRVAHLTGIHAVEEVSLEEFSAHPCARDNGGCSHICIAKGDGTPRCSCPVHLVLLQNLLTCGEPPTCSPDQFACATGEIDCIPGAWRCDGFPECDDQSDEEGCPVCSAAQFPCARGQCVDLRLRCDGEADCQDRSDEADCDAICLPNQFRCASGQCVLIKQQCDSFPDCIDGSDELMCEITKPPSDDSPAHSSAIGPVIGIILSLFVMGGVYFVCQRVVCQRYAGANGPFPHEYVSGTPHVPLNFIAPGGSQHGPFTGIACGKSMMSSVSLMGGRGGVPLYDRNHVTGASSSSSSSTKATLYPPILNPPPSPATDPSLYNMDMFYSSNIPATARPYRPYIIRGMAPPTTPCSTDVCDSDYSASRWKASKYYLDLNSDSDPYPPPPTPHSQYLSAEDSCPPSPATERSYFHLFPPPPSPCTDSS,1615,NP_002326.2.csv,refseq-LRP5-NM_002335.3_clinical_seed_0_final,refseq-LRP5-NM_002335.3.a2m,Invitae,refseq-LRP5-NM_002335.3.npy,1,1615,1615
+NP_002327.2,MGAVLRSLLACSFCVLLRAAPLLLYANRRDLRLVDATNGKENATIVVGGLEDAAAVDFVFSHGLIYWSDVSEEAIKRTEFNKTESVQNVVVSGLLSPDGLACDWLGEKLYWTDSETNRIEVSNLDGSLRKVLFWQELDQPRAIALDPSSGFMYWTDWGEVPKIERAGMDGSSRFIIINSEIYWPNGLTLDYEEQKLYWADAKLNFIHKSNLDGTNRQAVVKGSLPHPFALTLFEDILYWTDWSTHSILACNKYTGEGLREIHSDIFSPMDIHAFSQQRQPNATNPCGIDNGGCSHLCLMSPVKPFYQCACPTGVKLLENGKTCKDGATELLLLARRTDLRRISLDTPDFTDIVLQLEDIRHAIAIDYDPVEGYIYWTDDEVRAIRRSFIDGSGSQFVVTAQIAHPDGIAVDWVARNLYWTDTGTDRIEVTRLNGTMRKILISEDLEEPRAIVLDPMVGYMYWTDWGEIPKIERAALDGSDRVVLVNTSLGWPNGLALDYDEGKIYWGDAKTDKIEVMNTDGTGRRVLVEDKIPHIFGFTLLGDYVYWTDWQRRSIERVHKRSAEREVIIDQLPDLMGLKATNVHRVIGSNPCAEENGGCSHLCLYRPQGLRCACPIGFELISDMKTCIVPEAFLLFSRRADIRRISLETNNNNVAIPLTGVKEASALDFDVTDNRIYWTDISLKTISRAFMNGSALEHVVEFGLDYPEGMAVDWLGKNLYWADTGTNRIEVSKLDGQHRQVLVWKDLDSPRALALDPAEGFMYWTEWGGKPKIDRAAMDGSERTTLVPNVGRANGLTIDYAKRRLYWTDLDTNLIESSNMLGLNREVIADDLPHPFGLTQYQDYIYWTDWSRRSIERANKTSGQNRTIIQGHLDYVMDILVFHSSRQSGWNECASSNGHCSHLCLAVPVGGFVCGCPAHYSLNADNRTCSAPTTFLLFSQKSAINRMVIDEQQSPDIILPIHSLRNVRAIDYDPLDKQLYWIDSRQNMIRKAQEDGSQGFTVVVSSVPSQNLEIQPYDLSIDIYSRYIYWTCEATNVINVTRLDGRSVGVVLKGEQDRPRAVVVNPEKGYMYFTNLQERSPKIERAALDGTEREVLFFSGLSKPIALALDSRLGKLFWADSDLRRIESSDLSGANRIVLEDSNILQPVGLTVFENWLYWIDKQQQMIEKIDMTGREGRTKVQARIAQLSDIHAVKELNLQEYRQHPCAQDNGGCSHICLVKGDGTTRCSCPMHLVLLQDELSCGEPPTCSPQQFTCFTGEIDCIPVAWRCDGFTECEDHSDELNCPVCSESQFQCASGQCIDGALRCNGDANCQDKSDEKNCEVLCLIDQFRCANGQCIGKHKKCDHNVDCSDKSDELDCYPTEEPAPQATNTVGSVIGVIVTIFVSGTVYFICQRMLCPRMKGDGETMTNDYVVHGPASVPLGYVPHPSSLSGSLPGMSRGKSMISSLSIMGGSSGPPYDRAHVTGASSSSSSSTKGTYFPAILNPPPSPATERSHYTMEFGYSSNSPSTHRSYSYRPYSYRHFAPPTTPCSTDVCDSDYAPSRRMTSVATAKGYTSDLNYDSEPVPPPPTPRSQYLSAEENYESCPPSPYTERSYSHHLYPPPPSPCTDSS,1613,NP_002327.2.csv,refseq-LRP6-NM_002336.2_clinical_seed_0_final,refseq-LRP6-NM_002336.2.a2m,Invitae,refseq-LRP6-NM_002336.2.npy,1,1613,1613
+NP_002372.1,MPRPAPARRLPGLLLLLWPLLLLPSAAPDPVARPGFRRLETRGPGGSPGRRPSPAAPDGAPASGTSEPGRARGAGVCKSRPLDLVFIIDSSRSVRPLEFTKVKTFVSRIIDTLDIGPADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEAFTVEAGAREPSSNIPKVAIIVTDGRPQDQVNEVAARAQASGIELYAVGVDRADMASLKMMASEPLEEHVFYVETYGVIEKLSSRFQETFCALDPCVLGTHQCQHVCISDGEGKHHCECSQGYTLNADKKTCSALDRCALNTHGCEHICVNDRSGSYHCECYEGYTLNEDRKTCSAQDKCALGTHGCQHICVNDRTGSHHCECYEGYTLNADKKTCSVRDKCALGSHGCQHICVSDGAASYHCDCYPGYTLNEDKKTCSATEEARRLVSTEDACGCEATLAFQDKVSSYLQRLNTKLDDILEKLKINEYGQIHR,486,NP_002372.1.csv,refseq-MATN3-NM_002381.4_clinical_seed_0_final,refseq-MATN3-NM_002381.4.a2m,Invitae,refseq-MATN3-NM_002381.4.npy,1,486,486
+NP_002380.3,MEPPGRRECPFPSWRFPGLLLAAMVLLLYSFSDACEEPPTFEAMELIGKPKPYYEIGERVDYKCKKGYFYIPPLATHTICDRNHTWLPVSDDACYRETCPYIRDPLNGQAVPANGTYEFGYQMHFICNEGYYLIGEEILYCELKGSVAIWSGKPPICEKVLCTPPPKIKNGKHTFSEVEVFEYLDAVTYSCDPAPGPDPFSLIGESTIYCGDNSVWSRAAPECKVVKCRFPVVENGKQISGFGKKFYYKATVMFECDKGFYLDGSDTIVCDSNSTWDPPVPKCLKVLPPSSTKPPALSHSVSTSSTTKSPASSASGPRPTYKPPVSNYPGYPKPEEGILDSLDVWVIAVIVIAIVVGVAVICVVPYRYLQRRKKKGTYLTDETHREVKFTSL,392,NP_002380.3.csv,refseq-CD46-NM_002389.4_clinical_seed_0_final,refseq-CD46-NM_002389.4.a2m,Invitae,refseq-CD46-NM_002389.4.npy,1,392,392
+NP_002388.2,MGRKKIQITRIMDERNRQVTFTKRKFGLMKKAYELSVLCDCEIALIIFNSTNKLFQYASTDMDKVLLKYTEYNEPHESRTNSDIVETLRKKGLNGCDSPDPDADDSVGHSPESEDKYRKINEDIDLMISRQRLCAVPPPNFEMPVSIPVSSHNSLVYSNPVSSLGNPNLLPLAHPSLQRNSMSPGVTHRPPSAGNTGGLMGGDLTSGAGTSAGNGYGNPRNSPGLLVSPGNLNKNMQAKSPPPMNLGMNNRKPDLRVLIPPGSKNTMPSVSEDVDLLLNQRINNSQSAQSLATPVVSVATPTLPGQGMGGYPSAISTTYGTEYSLSSADLSSLSGFNTASALHLGSVTGWQQQHLHNMPPSALSQLGACTSTHLSQSSNLSLPSTQSLNIKSEPVSPPRDRTTTPSRYPQHTRHEAGRSPVDSLSSCSSSYDGSDREDHRNEFHSPIGLTRPSPDERESPSVKRMRLSEGWAT,473,NP_002388.2.csv,refseq-MEF2C-NM_002397.4_clinical_seed_0_final,refseq-MEF2C-NM_002397.4.a2m,Invitae,refseq-MEF2C-NM_002397.4.npy,1,473,473
+NP_002392.2,MDEQEALNSIMNDLVALQMNRRHRMPGYETMKNKDTGHSNRQSDVRIKFEHNGERRIIAFSRPVKYEDVEHKVTTVFGQPLDLHYMNNELSILLKNQDDLDKAIDILDRSSSMKSLRILLLSQDRNHNSSSPHSGVSRQVRIKASQSAGDINTIYQPPEPRSRHLSVSSQNPGRSSPPPGYVPERQQHIARQGSYTSINSEGEFIPETSEQCMLDPLSSAENSLSGSCQSLDRSADSPSFRKSRMSRAQSFPDNRQEYSDRETQLYDKGVKGGTYPRRYHVSVHHKDYSDGRRTFPRIRRHQGNLFTLVPSSRSLSTNGENMGLAVQYLDPRGRLRSADSENALSVQERNVPTKSPSAPINWRRGKLLGQGAFGRVYLCYDVDTGRELASKQVQFDPDSPETSKEVSALECEIQLLKNLQHERIVQYYGCLRDRAEKTLTIFMEYMPGGSVKDQLKAYGALTESVTRKYTRQILEGMSYLHSNMIVHRDIKGANILRDSAGNVKLGDFGASKRLQTICMSGTGMRSVTGTPYWMSPEVISGEGYGRKADVWSLGCTVVEMLTEKPPWAEYEAMAAIFKIATQPTNPQLPSHISEHGRDFLRRIFVEARQRPSAEELLTHHFAQLMY,626,NP_002392.2.csv,refseq-MAP3K3-NM_002401.3_clinical_seed_0_final,refseq-MAP3K3-NM_002401.3.a2m,Invitae,refseq-MAP3K3-NM_002401.3.npy,1,626,626
+NP_002393.2,MVRRDRLRRMREWWVQVGLLAVPLLAAYLHIPPPQLSPALHSWKSSGKFFTYKGLRIFYQDSVGVVGSPEIVVLLHGFPTSSYDWYKIWEGLTLRFHRVIALDFLGFGFSDKPRPHHYSIFEQASIVEALLRHLGLQNRRINLLSHDYGDIVAQELLYRYKQNRSGRLTIKSLCLSNGGIFPETHRPLLLQKLLKDGGVLSPILTRLMNFFVFSRGLTPVFGPYTRPSESELWDMWAGIRNNDGNLVIDSLLQYINQRKKFRRRWVGALASVTIPIHFIYGPLDPVNPYPEFLELYRKTLPRSTVSILDDHISHYPQLEDPMGFLNAYMGFINSF,335,NP_002393.2.csv,refseq-MEST-NM_002402.3_clinical_seed_0_final,refseq-MEST-NM_002402.3.a2m,Invitae,refseq-MEST-NM_002402.3.npy,1,335,335
+NP_002411.3,MKDSNRCCCGQFTNQHIPPLPSATPSKNEEESKQVETQPEKWSVAKHTQSYPTDSYGVLEFQGGGYSNKAMYIRVSYDTKPDSLLHLMVKDWQLELPKLLISVHGGLQNFEMQPKLKQVFGKGLIKAAMTTGAWIFTGGVSTGVISHVGDALKDHSSKSRGRVCAIGIAPWGIVENKEDLVGKDVTRVYQTMSNPLSKLSVLNNSHTHFILADNGTLGKYGAEVKLRRLLEKHISLQKINTRLGQGVPLVGLVVEGGPNVVSIVLEYLQEEPPIPVVICDGSGRASDILSFAHKYCEEGGIINESLREQLLVTIQKTFNYNKAQSHQLFAIIMECMKKKELVTVFRMGSEGQQDIEMAILTALLKGTNVSAPDQLSLALAWNRVDIARSQIFVFGPHWPPLGSLAPPTDSKATEKEKKPPMATTKGGRGKGKGKKKGKVKEEVEEETDPRKIELLNWVNALEQAMLDALVLDRVDFVKLLIENGVNMQHFLTIPRLEELYNTRLGPPNTLHLLVRDVKKSNLPPDYHISLIDIGLVLEYLMGGAYRCNYTRKNFRTLYNNLFGPKRPKALKLLGMEDDEPPAKGKKKKKKKKEEEIDIDVDDPAVSRFQYPFHELMVWAVLMKRQKMAVFLWQRGEESMAKALVACKLYKAMAHESSESDLVDDISQDLDNNSKDFGQLALELLDQSYKHDEQIAMKLLTYELKNWSNSTCLKLAVAAKHRDFIAHTCSQMLLTDMWMGRLRMRKNPGLKVIMGILLPPTILFLEFRTYDDFSYQTSKENEDGKEKEEENTDANADAGSRKGDEENEHKKQRSIPIGTKICEFYNAPIVKFWFYTISYLGYLLLFNYVILVRMDGWPSLQEWIVISYIVSLALEKIREILMSEPGKLSQKIKVWLQEYWNITDLVAISTFMIGAILRLQNQPYMGYGRVIYCVDIIFWYIRVLDIFGVNKYLGPYVMMIGKMMIDMLYFVVIMLVVLMSFGVARQAILHPEEKPSWKLARNIFYMPYWMIYGEVFADQIDLYAMEINPPCGENLYDEEGKRLPPCIPGAWLTPALMACYLLVANILLVNLLIAVFNNTFFEVKSISNQVWKFQRYQLIMTFHDRPVLPPPMIILSHIYIIIMRLSGRCRKKREGDQEERDRGLKLFLSDEELKRLHEFEEQCVQEHFREKEDEQQSSSDERIRVTSERVENMSMRLEEINERETFMKTSLQTVDLRLAQLEELSNRMVNALENLAGIDRSDLIQARSRASSECEATYLLRQSSINSADGYSLYRYHFNGEELLFEDTSLSTSPGTGVRKKTCSFRIKEEKDVKTHLVPECQNSLHLSLGTSTSATPDGSHLAVDDLKNAEESKLGPDIGISKEDDERQTDSKKEETISPSLNKTDVIHGQDKSDVQNTQLTVETTNIEGTISYPLEETKITRYFPDETINACKTMKSRSFVYSRGRKLVGGVNQDVEYSSITDQQLTTEWQCQVQKITRSHSTDIPYIVSEAAVQAEHKEQFADMQDEHHVAEAIPRIPRLSLTITDRNGMENLLSVKPDQTLGFPSLRSKSLHGHPRNVKSIQGKLDRSGHASSVSSLVIVSGMTAEEKKVKKEKASTETEC,1603,NP_002411.3.csv,refseq-TRPM1-NM_002420.5_clinical_seed_0_final,refseq-TRPM1-NM_002420.5.a2m,Invitae,refseq-TRPM1-NM_002420.5.npy,1,1603,1603
+NP_002418.1,MHPGVLAAFLFLSWTHCRALPLPSGGDEDDLSEEDLQFAERYLRSYYHPTNLAGILKENAASSMTERLREMQSFFGLEVTGKLDDNTLDVMKKPRCGVPDVGEYNVFPRTLKWSKMNLTYRIVNYTPDMTHSEVEKAFKKAFKVWSDVTPLNFTRLHDGIADIMISFGIKEHGDFYPFDGPSGLLAHAFPPGPNYGGDAHFDDDETWTSSSKGYNLFLVAAHEFGHSLGLDHSKDPGALMFPIYTYTGKSHFMLPDDDVQGIQSLYGPGDEDPNPKHPKTPDKCDPSLSLDAITSLRGETMIFKDRFFWRLHPQQVDAELFLTKSFWPELPNRIDAAYEHPSHDLIFIFRGRKFWALNGYDILEGYPKKISELGLPKEVKKISAAVHFEDTGKTLLFSGNQVWRYDDTNHIMDKDYPRLIEEDFPGIGDKVDAVYEKNGYIYFFNGPIQFEYSIWSNRIVRVMPANSILWC,471,NP_002418.1.csv,refseq-MMP13-NM_002427.3_clinical_seed_0_final,refseq-MMP13-NM_002427.3.a2m,Invitae,refseq-MMP13-NM_002427.3.npy,1,471,471
+NP_002421.3,MFGLDQFEPQVNSRNAGQGERNFNETGLSMNTHFKAPAFHTGGPPGPVDPAMSALGEPPILGMNMEPYGFHARGHSELHAGGLQAQPVHGFFGGQQPHHGHPGSHHPHQHHPHFGGNFGGPDPGASCLHGGRLLGYGGAAGGLGSQPPFAEGYEHMAESQGPESFGPQRPGNLPDFHSSGASSHAVPAPCLPLDQSPNRAASFHGLPSSSGSDSHSLEPRRVTNQGAVDSLEYNYPGEAPSGHFDMFSPSDSEGQLPHYAAGRQVPGGAFPGASAMPRAAGMVGLSKMHAQPPQQQPQQQQQPQQQQQQHGVFFERFSGARKMPVGLEPSVGSRHPLMQPPQQAPPPPQQQPPQQPPQQQPPPPPGLLVRQNSCPPALPRPQQGEAGTPSGGLQDGGPMLPSQHAQFEYPIHRLENRSMHPYSEPVFSMQHPPPQQAPNQRLQHFDAPPYMNVAKRPRFDFPGSAGVDRCASWNGSMHNGALDNHLSPSAYPGLPGEFTPPVPDSFPSGPPLQHPAPDHQSLQQQQQQQQQQQQQQQQQQQQQQQQQQQQRQNAALMIKQMASRNQQQRLRQPNLAQLGHPGDVGQGGLVHGGPVGGLAQPNFEREGGSTGAGRLGTFEQQAPHLAQESAWFSGPHPPPGDLLPRRMGGSGLPADCGPHDPSLAPPPPPGGSGVLFRGPLQEPMRMPGEGHVPALPSPGLQFGGSLGGLGQLQSPGAGVGLPSAASERRPPPPDFATSALGGQPGFPFGAAGRQSTPHSGPGVNSPPSAGGGGGSSGGGGGGGAYPPQPDFQPSQRTSASKLGALSLGSFNKPSSKDNLFGQSCLAALSTACQNMIASLGAPNLNVTFNKKNPPEGKRKLSQNETDGAAVAGNPGSDYFPGGTAPGAPGPGGPSGTSSSGSKASGPPNPPAQGDGTSLSPNYTLESTSGNDGKPVSGGGGRGRGRRKRDSGHVSPGTFFDKYSAAPDSGGAPGVSPGQQQASGAAVGGSSAGETRGAPTPHEKALTSPSWGKGAELLLGDQPDLIGSLDGGAKSDSSSPNVGEFASDEVSTSYANEDEVSSSSDNPQALVKASRSPLVTGSPKLPPRGVGAGEHGPKAPPPALGLGIMSNSTSTPDSYGGGGGPGHPGTPGLEQVRTPTSSSGAPPPDEIHPLEILQAQIQLQRQQFSISEDQPLGLKGGKKGECAVGASGAQNGDSELGSCCSEAVKSAMSTIDLDSLMAEHSAAWYMPADKALVDSADDDKTLAPWEKAKPQNPNSKEAHDLPANKASASQPGSHLQCLSVHCTDDVGDAKARASVPTWRSLHSDISNRFGTFVAALT,1320,NP_002421.3.csv,refseq-MN1-NM_002430.2_clinical_seed_0_final,refseq-MN1-NM_002430.2.a2m,Invitae,refseq-MN1-NM_002430.2.npy,1,1320,1320
+NP_002426.1,MAAPRVFPLSCAVQQYAWGKMGSNSEVARLLASSDPLAQIAEDKPYAELWMGTHPRGDAKILDNRISQKTLSQWIAENQDSLGSKVKDTFNGNLPFLFKVLSVETPLSIQAHPNKELAEKLHLQAPQHYPDANHKPEMAIALTPFQGLCGFRPVEEIVTFLKKVPEFQFLIGDEAATHLKQTMSHDSQAVASSLQSCFSHLMKSEKKVVVEQLNLLVKRISQQAAAGNNMEDIFGELLLQLHQQYPGDIGCFAIYFLNLLTLKPGEAMFLEANVPHAYLKGDCVECMACSDNTVRAGLTPKFIDVPTLCEMLSYTPSSSKDRLFLPTRSQEDPYLSIYDPPVPDFTIMKTEVPGSVTEYKVLALDSASILLMVQGTVIASTPTTQTPIPLQRGGVLFIGANESVSLKLTEPKDLLIFRACCLL,423,NP_002426.1.csv,refseq-MPI-NM_002435.2_clinical_seed_0_final,refseq-MPI-NM_002435.2.a2m,Invitae,refseq-MPI-NM_002435.2.npy,1,423,423
+NP_002428.1,MALWRAYQRALAAHPWKVQVLTAGSLMGLGDIISQQLVERRGLQEHQRGRTLTMVSLGCGFVGPVVGGWYKVLDRFIPGTTKVDALKKMLLDQGGFAPCFLGCFLPLVGALNGLSAQDNWAKLQRDYPDALITNYYLWPAVQLANFYLVPLHYRLAVVQCVAVIWNSYLSWKAHRL,176,NP_002428.1.csv,refseq-MPV17-NM_002437.4_clinical_seed_0_final,refseq-MPV17-NM_002437.4.a2m,Invitae,refseq-MPV17-NM_002437.4.npy,1,176,176
+NP_002430.3,MSRRKPASGGLAASSSAPARQAVLSRFFQSTGSLKSTSSSTGAADQVDPGAAAAAAAAAAAAPPAPPAPAFPPQLPPHIATEIDRRKKRPLENDGPVKKKVKKVQQKEGGSDLGMSGNSEPKKCLRTRNVSKSLEKLKEFCCDSALPQSRVQTESLQERFAVLPKCTDFDDISLLHAKNAVSSEDSKRQINQKDTTLFDLSQFGSSNTSHENLQKTASKSANKRSKSIYTPLELQYIEMKQQHKDAVLCVECGYKYRFFGEDAEIAARELNIYCHLDHNFMTASIPTHRLFVHVRRLVAKGYKVGVVKQTETAALKAIGDNRSSLFSRKLTALYTKSTLIGEDVNPLIKLDDAVNVDEIMTDTSTSYLLCISENKENVRDKKKGNIFIGIVGVQPATGEVVFDSFQDSASRSELETRMSSLQPVELLLPSALSEQTEALIHRATSVSVQDDRIRVERMDNIYFEYSHAFQAVTEFYAKDTVDIKGSQIISGIVNLEKPVICSLAAIIKYLKEFNLEKMLSKPENFKQLSSKMEFMTINGTTLRNLEILQNQTDMKTKGSLLWVLDHTKTSFGRRKLKKWVTQPLLKLREINARLDAVSEVLHSESSVFGQIENHLRKLPDIERGLCSIYHKKCSTQEFFLIVKTLYHLKSEFQAIIPAVNSHIQSDLLRTVILEIPELLSPVEHYLKILNEQAAKVGDKTELFKDLSDFPLIKKRKDEIQGVIDEIRMHLQEIRKILKNPSAQYVTVSGQEFMIEIKNSAVSCIPTDWVKVGSTKAVSRFHSPFIVENYRHLNQLREQLVLDCSAEWLDFLEKFSEHYHSLCKAVHHLATVDCIFSLAKVAKQGDYCRPTVQEERKIVIKNGRHPVIDVLLGEQDQYVPNNTDLSEDSERVMIITGPNMGGKSSYIKQVALITIMAQIGSYVPAEEATIGIVDGIFTRMGAADNIYKGQSTFMEELTDTAEIIRKATSQSLVILDELGRGTSTHDGIAIAYATLEYFIRDVKSLTLFVTHYPPVCELEKNYSHQVGNYHMGFLVSEDESKLDPGAAEQVPDFVTFLYQITRGIAARSYGLNVAKLADVPGEILKKAAHKSKELEGLINTKRKRLKYFAKLWTMHNAQDLQKWTEEFNMEETQTSLLH,1137,NP_002430.3.csv,refseq-MSH3-NM_002439.4_clinical_seed_0_final,refseq-MSH3-NM_002439.4.a2m,Invitae,refseq-MSH3-NM_002439.4.npy,1,1137,1137
+NP_002431.2,MLRPEISSTSPSAPAVSPSSGETRSPQGPRYNFGLQETPQSRPSVQVVSASTCPGTSGAAGDRSSSSSSLPCPAPNSRPAQGSYFGNKRAYAENTVASNFTFGASSSSARDTNYPQTLKTPLSTGNPQRSGYKSWTPQVGYSASSSSAISAHSPSVIVAVVEGRGLARGEIGMASIDLKNPQIILSQFADNTTYAKVITKLKILSPLEIIMSNTACAVGNSTKLFTLITENFKNVNFTTIQRKYFNETKGLEYIEQLCIAEFSTVLMEVQSKYYCLAAVAALLKYVEFIQNSVYAPKSLKICFQGSEQTAMIDSSSAQNLELLINNQDYRNNHTLFGVLNYTKTPGGSRRLRSNILEPLVDIETINMRLDCVQELLQDEELFFGLQSVISRFLDTEQLLSVLVQIPKQDTVNAAESKITNLIYLKHTLELVDPLKIAMKNCNTPLLRAYYGSLEDKRFGIILEKIKTVINDDARYMKGCLNMRTQKCYAVRSNINEFLDIARRTYTEIVDDIAGMISQLGEKYSLPLRTSFSSARGFFIQMTTDCIALPSDQLPSEFIKISKVKNSYSFTSADLIKMNERCQESLREIYHMTYMIVCKLLSEIYEHIHCLYKLSDTVSMLDMLLSFAHACTLSDYVRPEFTDTLAIKQGWHPILEKISAEKPIANNTYVTEGSNFLIITGPNMSGKSTYLKQIALCQIMAQIGSYVPAEYSSFRIAKQIFTRISTDDDIETNSSTFMKEMKEIAYILHNANDKSLILIDELGRGTNTEEGIGICYAVCEYLLSLKAFTLFATHFLELCHIDALYPNVENMHFEVQHVKNTSRNKEAILYTYKLSKGLTEEKNYGLKAAEVSSLPPSIVLDAKEITTQITRQILQNQRSTPEMERQRAVYHLATRLVQTARNSQLDPDSLRIYLSNLKKKYKEDFPRTEQVPEKTEE,936,NP_002431.2.csv,refseq-MSH4-NM_002440.3_clinical_seed_0_final,refseq-MSH4-NM_002440.3.a2m,Invitae,refseq-MSH4-NM_002440.3.npy,1,936,936
+NP_002435.1,MPKTISVRVTTMDAELEFAIQPNTTGKQLFDQVVKTIGLREVWFFGLQYQDTKGFSTWLKLNKKVTAQDVRKESPLLFKFRAKFYPEDVSEELIQDITQRLFFLQVKEGILNDDIYCPPETAVLLASYAVQSKYGDFNKEVHKSGYLAGDKLLPQRVLEQHKLNKDQWEERIQVWHEEHRGMLREDAVLEYLKIAQDLEMYGVNYFSIKNKKGSELWLGVDALGLNIYEQNDRLTPKIGFPWSEIRNISFNDKKFVIKPIDKKAPDFVFYAPRLRINKRILALCMGNHELYMRRRKPDTIEVQQMKAQAREEKHQKQMERAMLENEKKKREMAEKEKEKIEREKEELMERLKQIEEQTKKAQQELEEQTRRALELEQERKRAQSEAEKLAKERQEAEEAKEALLQASRDQKKTQEQLALEMAELTARISQLEMARQKKESEAVEWQQKAQMVQEDLEKTRAELKTAMSTPHVAEPAENEQDEQDENGAEASADLRADAMAKDRSEEERTTEAEKNERVQKHLKALTSELANARDESKKTANDMIHAENMRLGRDKYKTLRQIRQGNTKQRIDEFESM,577,NP_002435.1.csv,refseq-MSN-NM_002444.2_clinical_seed_0_final,refseq-MSN-NM_002444.2.a2m,Invitae,refseq-MSN-NM_002444.2.npy,1,577,577
+NP_002440.2,MASPSKGNDLFSPDEEGPAVVAGPGPGPGGAEGAAEERRVKVSSLPFSVEALMSDKKPPKEASPLPAESASAGATLRPLLLSGHGAREAHSPGPLVKPFETASVKSENSEDGAAWMQEPGRYSPPPRHMSPTTCTLRKHKTNRKPRTPFTTSQLLALERKFRQKQYLSIAERAEFSSSLNLTETQVKIWFQNRRAKAKRLQEAELEKLKMAAKPMLPSSFSLPFPISSPLQAASIYGASYPFHRPVLPIPPVGLYATPVGYGMYHLS,267,NP_002440.2.csv,refseq-MSX2-NM_002449.4_clinical_seed_0_final,refseq-MSX2-NM_002449.4.a2m,Invitae,refseq-MSX2-NM_002449.4.npy,1,267,267
+NP_002445.2,MRRFLLLYATQQGQAKAIAEEICEQAVVHGFSADLHCISESDKYDLKTETAPLVVVVSTTGTGDPPDTARKFVKEIQNQTLPVDFFAHLRYGLLGLGDSEYTYFCNGGKIIDKRLQELGARHFYDTGHADDCVGLELVVEPWIAGLWPALRKHFRSSRGQEEISGALPVASPASSRTDLVKSELLHIESQVELLRFDDSGRKDSEVLKQNAVNSNQSNVVIEDFESSLTRSVPPLSQASLNIPGLPPEYLQVHLQESLGQEESQVSVTSADPVFQVPISKAVQLTTNDAIKTTLLVELDISNTDFSYQPGDAFSVICPNSDSEVQSLLQRLQLEDKREHCVLLKIKADTKKKGATLPQHIPAGCSLQFIFTWCLEIRAIPKKAFLRALVDYTSDSAEKRRLQELCSKQGAADYSRFVRDACACLLDLLLAFPSCQPPLSLLLEHLPKLQPRPYSCASSSLFHPGKLHFVFNIVEFLSTATTEVLRKGVCTGWLALLVASVLQPNIHASHEDSGKALAPKISISPRTTNSFHLPDDPSIPIIMVGPGTGIAPFIGFLQHREKLQEQHPDGNFGAMWLFFGCRHKDRDYLFRKELRHFLKHGILTHLKVSFSRDAPVGEEEAPAKYVQDNIQLHGQQVARILLQENGHIYVCGDAKNMAKDVHDALVQIISKEVGVEKLEAMKTLATLKEEKRYLQDIWS,698,NP_002445.2.csv,refseq-MTRR-NM_002454.2_clinical_seed_0_final,refseq-MTRR-NM_002454.2.a2m,Invitae,refseq-MTRR-NM_002454.2.npy,1,698,698
+NP_002456.2,MPEPTKKEENEVPAPAPPPEEPSKEKEAGTTPAKDEEEVSPPSALPPGLGSRALERKDSDWTLVETPPGEEQAKQNANSQLSILFIEKPQGGTVKVGEDITFIAKVKAEDLLRKPTIKWFKGKWMDLASKAGKHLQLKETFERHSRVYTFEMQIIKAKDNFAGNYRCEVTYKDKFDSCSFDLEVHESTGTTPNIDIRSAFKRSGEGQEDAGELDFSGLLKRREVKQQEEEPQVDVWELLKNAKPSEYEKIAFQYGITDLRGMLKRLKRMRREEKKSAAFAKILDPAYQVDKGGRVRFVVELADPKLEVKWYKNGQEIRPSTKYIFEHKGCQRILFINNCQMTDDSEYYVTAGDEKCSTELFVREPPIMVTKQLEDTTAYCGERVELECEVSEDDANVKWFKNGEEIIPGPKSRYRIRVEGKKHILIIEGATKADAAEYSVMTTGGQSSAKLSVDLKPLKILTPLTDQTVNLGKEICLKCEISENIPGKWTKNGLPVQESDRLKVVHKGRIHKLVIANALTEDEGDYVFAPDAYNVTLPAKVHVIDPPKIILDGLDADNTVTVIAGNKLRLEIPISGEPPPKAMWSRGDKAIMEGSGRIRTESYPDSSTLVIDIAERDDSGVYHINLKNEAGEAHASIKVKVVDFPDPPVAPTVTEVGDDWCIMNWEPPAYDGGSPILGYFIERKKKQSSRWMRLNFDLCKETTFEPKKMIEGVAYEVRIFAVNAIGISKPSMPSRPFVPLAVTSPPTLLTVDSVTDTTVTMRWRPPDHIGAAGLDGYVLEYCFEGTEDWIVANKDLIDKTKFTITGLPTDAKIFVRVKAVNAAGASEPKYYSQPILVKEIIEPPKIRIPRHLKQTYIRRVGEAVNLVIPFQGKPRPELTWKKDGAEIDKNQINIRNSETDTIIFIRKAERSHSGKYDLQVKVDKFVETASIDIQIIDRPGPPQIVKIEDVWGENVALTWTPPKDDGNAAITGYTIQKADKKSMEWFTVIEHYHRTSATITELVIGNEYYFRVFSENMCGLSEDATMTKESAVIARDGKIYKNPVYEDFDFSEAPMFTQPLVNTYAIAGYNATLNCSVRGNPKPKITWMKNKVAIVDDPRYRMFSNQGVCTLEIRKPSPYDGGTYCCKAVNDLGTVEIECKLEVKVIYQGVNTPGQPVFLEGQQQSLHNKDF,1171,NP_002456.2.csv,refseq-MYBPC1-NM_002465.3_clinical_seed_0_final,refseq-MYBPC1-NM_002465.3.a2m,Invitae,refseq-MYBPC1-NM_002465.3_theta_0.2.npy,1,1171,1171
+NP_002460.1,MMMDLFETGSYFFYLDGENVTLQPLEVAEGSPLYPGSDGTLSPCQDQMPPEAGSDSSGEEHVLAPPGLQPPHCPGQCLIWACKTCKRKSAPTDRRKAATLRERRRLKKINEAFEALKRRTVANPNQRLPKVEILRSAISYIERLQDLLHRLDQQEKMQELGVDPFSYRPKQENLEGADFLRTCSSQWPSVSDHSRGLVITAKEGGASIDSSASSSLRCLSSIVDSISSEERKLPCVEEVVEK,242,NP_002460.1.csv,refseq-MYF6-NM_002469.2_clinical_seed_0_final,refseq-MYF6-NM_002469.2.a2m,Invitae,refseq-MYF6-NM_002469.2.npy,1,242,242
+NP_002461.2,MSSDTEMEVFGIAAPFLRKSEKERIEAQNQPFDAKTYCFVVDSKEEYAKGKIKSSQDGKVTVETEDNRTLVVKPEDVYAMNPPKFDRIEDMAMLTHLNEPAVLYNLKDRYTSWMIYTYSGLFCVTVNPYKWLPVYNPEVVEGYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGESGAGKTVNTKRVIQYFATIAATGDLAKKKDSKMKGTLEDQIISANPLLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGKLASADIETYLLEKSRVTFQLKAERSYHIFYQILSNKKPELIELLLITTNPYDYPFISQGEILVASIDDAEELLATDSAIDILGFTPEEKSGLYKLTGAVMHYGNMKFKQKQREEQAEPDGTEVADKTAYLMGLNSSDLLKALCFPRVKVGNEYVTKGQTVDQVHHAVNALSKSVYEKLFLWMVTRINQQLDTKLPRQHFIGVLDIAGFEIFEYNSLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTFIDFGMDLAACIELIEKPMGIFSILEEECMFPKATDTSFKNKLYDQHLGKSNNFQKPKVVKGRAEAHFSLIHYAGTVDYSVSGWLEKNKDPLNETVVGLYQKSSNRLLAHLYATFATADADSGKKKVAKKKGSSFQTVSALFRENLNKLMSNLRTTHPHFVRCIIPNETKTPGAMEHSLVLHQLRCNGVLEGIRICRKGFPNRILYGDFKQRYRVLNASAIPEGQFIDSKKACEKLLASIDIDHTQYKFGHTKVFFKAGLLGTLEEMRDDRLAKLITRTQAVCRGFLMRVEFQKMVQRRESIFCIQYNIRSFMNVKHWPWMKLFFKIKPLLKSAETEKEMATMKEEFQKTKDELAKSEAKRKELEEKLVTLVQEKNDLQLQVQAESENLLDAEERCDQLIKAKFQLEAKIKEVTERAEDEEEINAELTAKKRKLEDECSELKKDIDDLELTLAKVEKEKHATENKVKNLTEELSGLDETIAKLTREKKALQEAHQQALDDLQAEEDKVNSLNKTKSKLEQQVEDLESSLEQEKKLRVDLERNKRKLEGDLKLAQESILDLENDKQQLDERLKKKDFEYCQLQSKVEDEQTLGLQFQKKIKELQARIEELEEEIEAERATRAKTEKQRSDYARELEELSERLEEAGGVTSTQIELNKKREAEFLKLRRDLEEATLQHEAMVAALRKKHADSVAELGEQIDNLQRVKQKLEKEKSEFKLEIDDLSSSMESVSKSKANLEKICRTLEDQLSEARGKNEEIQRSLSELTTQKSRLQTEAGELSRQLEEKESIVSQLSRSKQAFTQQTEELKRQLEEENKAKNALAHALQSSRHDCDLLREQYEEEQEGKAELQRALSKANSEVAQWRTKYETDAIQRTEELEEAKKKLAQRLQDSEEQVEAVNAKCASLEKTKQRLQGEVEDLMVDVERANSLAAALDKKQRNFDKVLAEWKTKCEESQAELEASLKESRSLSTELFKLKNAYEEALDQLETVKRENKNLEQEIADLTEQIAENGKTIHELEKSRKQIELEKADIQLALEEAEAALEHEEAKILRIQLELTQVKSEIDRKIAEKDEEIEQLKRNYQRTVETMQSALDAEVRSRNEAIRLKKKMEGDLNEIEIQLSHANRQAAETLKHLRSVQGQLKDTQLHLDDALRGQEDLKEQLAIVERRANLLQAEVEELRATLEQTERARKLAEQELLDSNERVQLLHTQNTSLIHTKKKLETDLMQLQSEVEDASRDARNAEEKAKKAITDAAMMAEELKKEQDTSAHLERMKKNLEQTVKDLQHRLDEAEQLALKGGKKQIQKLETRIRELEFELEGEQKKNTESVKGLRKYERRVKELTYQSEEDRKNVLRLQDLVDKLQVKVKSYKRQAEEADEQANAHLTKFRKAQHELEEAEERADIAESQVNKLRAKTRDFTSSRMVVHESEE,1940,NP_002461.2.csv,refseq-MYH3-NM_002470.3_clinical_seed_0_final,refseq-MYH3-NM_002470.3.a2m,Invitae,refseq-MYH3-NM_002470.3.npy,1,1940,1940
+NP_002462.2,MTDAQMADFGAAAQYLRKSEKERLEAQTRPFDIRTECFVPDDKEEFVKAKILSREGGKVIAETENGKTVTVKEDQVLQQNPPKFDKIEDMAMLTFLHEPAVLFNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNAEVVAAYRGKKRSEAPPHIFSISDNAYQYMLTDRENQSILITGESGAGKTVNTKRVIQYFASIAAIGDRGKKDNANANKGTLEDQIIQANPALEAFGNAKTVRNDNSSRFGKFIRIHFGATGKLASADIETYLLEKSRVIFQLKAERNYHIFYQILSNKKPELLDMLLVTNNPYDYAFVSQGEVSVASIDDSEELMATDSAFDVLGFTSEEKAGVYKLTGAIMHYGNMKFKQKQREEQAEPDGTEDADKSAYLMGLNSADLLKGLCHPRVKVGNEYVTKGQSVQQVYYSIGALAKAVYEKMFNWMVTRINATLETKQPRQYFIGVLDIAGFEIFDFNSFEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTFIDFGMDLQACIDLIEKPMGIMSILEEECMFPKATDMTFKAKLYDNHLGKSNNFQKPRNIKGKQEAHFSLIHYAGTVDYNILGWLEKNKDPLNETVVALYQKSSLKLMATLFSSYATADTGDSGKSKGGKKKGSSFQTVSALHRENLNKLMTNLRTTHPHFVRCIIPNERKAPGVMDNPLVMHQLRCNGVLEGIRICRKGFPNRILYGDFRQRYRILNPVAIPEGQFIDSRKGTEKLLSSLDIDHNQYKFGHTKVFFKAGLLGLLEEMRDERLSRIITRMQAQARGQLMRIEFKKIVERRDALLVIQWNIRAFMGVKNWPWMKLYFKIKPLLKSAETEKEMATMKEEFGRIKETLEKSEARRKELEEKMVSLLQEKNDLQLQVQAEQDNLNDAEERCDQLIKNKIQLEAKVKEMNERLEDEEEMNAELTAKKRKLEDECSELKKDIDDLELTLAKVEKEKHATENKVKNLTEEMAGLDEIIAKLTKEKKALQEAHQQALDDLQVEEDKVNSLSKSKVKLEQQVDDLEGSLEQEKKVRMDLERAKRKLEGDLKLTQESIMDLENDKLQLEEKLKKKEFDINQQNSKIEDEQVLALQLQKKLKENQARIEELEEELEAERTARAKVEKLRSDLSRELEEISERLEEAGGATSVQIEMNKKREAEFQKMRRDLEEATLQHEATAAALRKKHADSVAELGEQIDNLQRVKQKLEKEKSEFKLELDDVTSNMEQIIKAKANLEKVSRTLEDQANEYRVKLEEAQRSLNDFTTQRAKLQTENGELARQLEEKEALISQLTRGKLSYTQQMEDLKRQLEEEGKAKNALAHALQSARHDCDLLREQYEEETEAKAELQRVLSKANSEVAQWRTKYETDAIQRTEELEEAKKKLAQRLQDAEEAVEAVNAKCSSLEKTKHRLQNEIEDLMVDVERSNAAAAALDKKQRNFDKILAEWKQKYEESQSELESSQKEARSLSTELFKLKNAYEESLEHLETFKRENKNLQEEISDLTEQLGEGGKNVHELEKVRKQLEVEKLELQSALEEAEASLEHEEGKILRAQLEFNQIKAEIERKLAEKDEEMEQAKRNHQRVVDSLQTSLDAETRSRNEVLRVKKKMEGDLNEMEIQLSHANRMAAEAQKQVKSLQSLLKDTQIQLDDAVRANDDLKENIAIVERRNNLLQAELEELRAVVEQTERSRKLAEQELIETSERVQLLHSQNTSLINQKKKMESDLTQLQSEVEEAVQECRNAEEKAKKAITDAAMMAEELKKEQDTSAHLERMKKNMEQTIKDLQHRLDEAEQIALKGGKKQLQKLEARVRELEGELEAEQKRNAESVKGMRKSERRIKELTYQTEEDKKNLLRLQDLVDKLQLKVKAYKRQAEEAEEQANTNLSKFRKVQHELDEAEERADIAESQVNKLRAKSRDIGAKQKMHDEE,1939,NP_002462.2.csv,refseq-MYH6-NM_002471.3_clinical_seed_0_final,refseq-MYH6-NM_002471.3.a2m,Invitae,refseq-MYH6-NM_002471.3.npy,1,1939,1939
+NP_002463.2,MSASSDAEMAVFGEAAPYLRKSEKERIEAQNKPFDAKTSVFVAEPKESYVKSTIQSKEGGKVTVKTEGGATLTVREDQVFPMNPPKYDKIEDMAMMTHLHEPGVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYKPEVVAAYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGESGAGKTVNTKRVIQYFATIAVTGEKKKDESGKMQGTLEDQIISANPLLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGKLASADIETYLLEKSRVTFQLKAERSYHIFYQITSNKKPDLIEMLLITTNPYDYAFVSQGEITVPSIDDQEELMATDSAIDILGFTPEEKVSIYKLTGAVMHYGNMKFKQKQREEQAEPDGTEVADKAAYLQSLNSADLLKALCYPRVKVGNEYVTKGQTVQQVYNAVGALAKAVYEKMFLWMVTRINQQLDTKQPRQYFIGVLDIAGFEIFDFNSLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTFIDFGMDLAACIELIEKPLGIFSILEEECMFPKATDTSFKNKLYDQHLGKSANFQKPKVVKGKAEAHFSLIHYAGTVDYNITGWLDKNKDPLNDTVVGLYQKSAMKTLASLFSTYASAEADSSAKKGAKKKGSSFQTVSALFRENLNKLMTNLRSTHPHFVRCIIPNETKTPGAMEHELVLHQLRCNGVLEGIRICRKGFPSRILYGDFKQRYKVLNASAIPEGQFIDSKKASEKLLASIDIDHTQYKFGHTKVFFKAGLLGLLEEMRDEKLAQIITRTQAVCRGFLMRVEYQKMLQRREALFCIQYNVRAFMNVKHWPWMKLFFKIKPLLKSAETEKEMATMKEEFQKTKDELAKSEAKRKELEEKMVTLLKEKNDLQLQVQSEADSLADAEERCEQLIKNKIQLEAKIKEVTERAEEEEEINAELTAKKRKLEDECSELKKDIDDLELTLAKVEKEKHATENKVKNLTEEMAGLDETIAKLSKEKKALQETHQQTLDDLQAEEDKVNILTKAKTKLEQQVDDLEGSLEQEKKLRMDLERAKRKLEGDLKLAQESTMDMENDKQQLDEKLEKKEFEISNLISKIEDEQAVEIQLQKKIKELQARIEELGEEIEAERASRAKAEKQRSDLSRELEEISERLEEAGGATSAQVELNKKREAEFQKLRRDLEEATLQHEAMVAALRKKHADSMAELGEQIDNLQRVKQKLEKEKSELKMETDDLSSNAEAISKAKGNLEKMCRSLEDQVSELKTKEEEQQRLINDLTAQRARLQTEAGEYSRQLDEKDALVSQLSRSKQASTQQIEELKHQLEEETKAKNALAHALQSSRHDCDLLREQYEEEQEGKAELQRALSKANSEVAQWRTKYETDAIQRTEELEEAKKKLAQRLQEAEEHVEAVNAKCASLEKTKQRLQNEVEDLMLDVERSNAACAALDKKQRNFDKVLSEWKQKYEETQAELEASQKESRSLSTELFKVKNVYEESLDQLETLRRENKNLQQEISDLTEQIAEGGKQIHELEKIKKQVEQEKCEIQAALEEAEASLEHEEGKILRIQLELNQVKSEVDRKIAEKDEEIDQLKRNHTRVVETMQSTLDAEIRSRNDALRVKKKMEGDLNEMEIQLNHANRLAAESLRNYRNTQGILKETQLHLDDALRGQEDLKEQLAIVERRANLLQAEIEELWATLEQTERSRKIAEQELLDASERVQLLHTQNTSLINTKKKLENDVSQLQSEVEEVIQESRNAEEKAKKAITDAAMMAEELKKEQDTSAHLERMKKNLEQTVKDLQHRLDEAEQLALKGGKKQIQKLEARVRELEGEVENEQKRNAEAVKGLRKHERRVKELTYQTEEDRKNVLRLQDLVDKLQAKVKSYKRQAEEAEEQSNANLSKFRKLQHELEEAEERADIAESQVNKLRVKSREVHTKISAE,1937,NP_002463.2.csv,refseq-MYH8-NM_002472.2_clinical_seed_0_final,refseq-MYH8-NM_002472.2.a2m,Invitae,refseq-MYH8-NM_002472.2.npy,1,1937,1937
+NP_002464.1,MAQQAADKYLYVDKNFINNPLAQADWAAKKLVWVPSDKSGFEPASLKEEVGEEAIVELVENGKKVKVNKDDIQKMNPPKFSKVEDMAELTCLNEASVLHNLKERYYSGLIYTYSGLFCVVINPYKNLPIYSEEIVEMYKGKKRHEMPPHIYAITDTAYRSMMQDREDQSILCTGESGAGKTENTKKVIQYLAYVASSHKSKKDQGELERQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDVNGYIVGANIETYLLEKSRAIRQAKEERTFHIFYYLLSGAGEHLKTDLLLEPYNKYRFLSNGHVTIPGQQDKDMFQETMEAMRIMGIPEEEQMGLLRVISGVLQLGNIVFKKERNTDQASMPDNTAAQKVSHLLGINVTDFTRGILTPRIKVGRDYVQKAQTKEQADFAIEALAKATYERMFRWLVLRINKALDKTKRQGASFIGILDIAGFEIFDLNSFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIDLIEKPAGPPGILALLDEECWFPKATDKSFVEKVMQEQGTHPKFQKPKQLKDKADFCIIHYAGKVDYKADEWLMKNMDPLNDNIATLLHQSSDKFVSELWKDVDRIIGLDQVAGMSETALPGAFKTRKGMFRTVGQLYKEQLAKLMATLRNTNPNFVRCIIPNHEKKAGKLDPHLVLDQLRCNGVLEGIRICRQGFPNRVVFQEFRQRYEILTPNSIPKGFMDGKQACVLMIKALELDSNLYRIGQSKVFFRAGVLAHLEEERDLKITDVIIGFQACCRGYLARKAFAKRQQQLTAMKVLQRNCAAYLKLRNWQWWRLFTKVKPLLQVSRQEEEMMAKEEELVKVREKQLAAENRLTEMETLQSQLMAEKLQLQEQLQAETELCAEAEELRARLTAKKQELEEICHDLEARVEEEEERCQHLQAEKKKMQQNIQELEEQLEEEESARQKLQLEKVTTEAKLKKLEEEQIILEDQNCKLAKEKKLLEDRIAEFTTNLTEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTDLSDQIAELQAQIAELKMQLAKKEEELQAALARVEEEAAQKNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELEDTLDSTAAQQELRSKREQEVNILKKTLEEEAKTHEAQIQEMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQGKGDSEHKRKKVEAQLQELQVKFNEGERVRTELADKVTKLQVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLSTKLKQVEDEKNSFREQLEEEEEAKHNLEKQIATLHAQVADMKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQQELDDLLVDLDHQRQSACNLEKKQKKFDQLLAEEKTISAKYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDERKQRSMAVAARKKLEMDLKDLEAHIDSANKNRDEAIKQLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEELAAAERAKRQAQQERDELADEIANSSGKGALALEEKRRLEARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENARQQLERQNKELKVKLQEMEGTVKSKYKASITALEAKIAQLEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQADKASTRLKQLKRQLEEAEEEAQRANASRRKLQRELEDATETADAMNREVSSLKNKLRRGDLPFVVPRRMARKGAGDGSDEEVDGKADGAEAKPAE,1960,NP_002464.1.csv,refseq-MYH9-NM_002473.5_clinical_seed_0_final,refseq-MYH9-NM_002473.5.a2m,Invitae,refseq-MYH9-NM_002473.5.npy,1,1960,1960
+NP_002476.2,MWKLLPAAGPAGGEPYRLLTGVEYVVGRKNCAILIENDQSISRNHAVLTANFSVTNLSQTDEIPVLTLKDNSKYGTFVNEEKMQNGFSRTLKSGDGITFGVFGSKFRIEYEPLVACSSCLDVSGKTALNQAILQLGGFTVNNWTEECTHLVMVSVKVTIKTICALICGRPIVKPEYFTEFLKAVESKKQPPQIESFYPPLDEPSIGSKNVDLSGRQERKQIFKGKTFIFLNAKQHKKLSSAVVFGGGEARLITEENEEEHNFFLAPGTCVVDTGITNSQTLIPDCQKKWIQSIMDMLQRQGLRPIPEAEIGLAVIFMTTKNYCDPQGHPSTGLKTTTPGPSLSQGVSVDEKLMPSAPVNTTTYVADTESEQADTWDLSERPKEIKVSKMEQKFRMLSQDAPTVKESCKTSSNNNSMVSNTLAKMRIPNYQLSPTKLPSINKSKDRASQQQQTNSIRNYFQPSTKKRERDEENQEMSSCKSARIETSCSLLEQTQPATPSLWKNKEQHLSENEPVDTNSDNNLFTDTDLKSIVKNSASKSHAAEKLRSNKKREMDDVAIEDEVLEQLFKDTKPELEIDVKVQKQEEDVNVRKRPRMDIETNDTFSDEAVPESSKISQENEIGKKRELKEDSLWSAKEISNNDKLQDDSEMLPKKLLLTEFRSLVIKNSTSRNPSGINDDYGQLKNFKKFKKVTYPGAGKLPHIIGGSDLIAHHARKNTELEEWLRQEMEVQNQHAKEESLADDLFRYNPYLKRRR,754,NP_002476.2.csv,refseq-NBN-NM_002485.4_clinical_seed_0_final,refseq-NBN-NM_002485.4.a2m,Invitae,refseq-NBN-NM_002485.4_theta_0.2.npy,1,754,754
+NP_002486.1,MAAVSMSVVLRQTLWRRRAVAVAALSVSRVPTRSLRTSTWRLAQDQTQDTQLITVDEKLDITTLTGVPEEHIKTRKVRIFVPARNNMQSGVNNTKKWKMEFDTRERWENPLMGWASTADPLSNMVLTFSTKEDAVSFAEKNGWSYDIEERKVPKPKSKSYGANFSWNKRTRVSTK,175,NP_002486.1.csv,refseq-NDUFS4-NM_002495.3_clinical_seed_0_final,refseq-NDUFS4-NM_002495.3.a2m,Invitae,refseq-NDUFS4-NM_002495.3.npy,1,175,175
+NP_002492.2,MYSPYCLTQDEFHPFIEALLPHVRAFSYTWFNLQARKRKYFKKHEKRMSKDEERAVKDELLGEKPEIKQKWASRLLAKLRKDIRPEFREDFVLTITGKKPPCCVLSNPDQKGKIRRIDCLRQADKVWRLDLVMVILFKGIPLESTDGERLYKSPQCSNPGLCVQPHHIGVTIKELDLYLAYFVHTPESGQSDSSNQQGDADIKPLPNGHLSFQDCFVTSGVWNVTELVRVSQTPVATASGPNFSLADLESPSYYNINQVTLGRRSITSPPSTSTTKRPKSIDDSEMESPVDDVFYPGTGRSPAAGSSQSSGWPNDVDAGPASLKKSGKLDFCSALSSQGSSPRMAFTHHPLPVLAGVRPGSPRATASALHFPSTSIIQQSSPYFTHPTIRYHHHHGQDSLKEFVQFVCSDGSGQATGQHSQRQAPPLPTGLSASDPGTATF,441,NP_002492.2.csv,refseq-NFIX-NM_002501.2_clinical_seed_0_final,refseq-NFIX-NM_002501.2.a2m,Invitae,refseq-NFIX-NM_002501.2.npy,1,441,441
+NP_002509.2,MDEDEKDRAKRASRNKSEKKRRDQFNVLIKELSSMLPGNTRKMDKTTVLEKVIGFLQKHNEVSAQTEICDIQQDWKPSFLSNEEFTQLMLEALDGFIIAVTTDGSIIYVSDSITPLLGHLPSDVMDQNLLNFLPEQEHSEVYKILSSHMLVTDSPSPEYLKSDSDLEFYCHLLRGSLNPKEFPTYEYIKFVGNFRSYNNVPSPSCNGFDNTLSRPCRVPLGKEVCFIATVRLATPQFLKEMCIVDEPLEEFTSRHSLEWKFLFLDHRAPPIIGYLPFEVLGTSGYDYYHIDDLELLARCHQHLMQFGKGKSCCYRFLTKGQQWIWLQTHYYITYHQWNSKPEFIVCTHSVVSYADVRVERRQELALEDPPSEALHSSALKDKGSSLEPRQHFNTLDVGASGLNTSHSPSASSRSSHKSSHTAMSEPTSTPTKLMAEASTPALPRSATLPQELPVPGLSQAATMPAPLPSPSSCDLTQQLLPQTVLQSTPAPMAQFSAQFSMFQTIKDQLEQRTRILQANIRWQQEELHKIQEQLCLVQDSNVQMFLQQPAVSLSFSSTQRPEAQQQLQQRSAAVTQPQLGAGPQLPGQISSAQVTSQHLLRESSVISTQGPKPMRSSQLMQSSGRSGSSLVSPFSSATAALPPSLNLTTPASTSQDASQCQPSPDFSHDRQLRLLLSQPIQPMMPGSCDARQPSEVSRTGRQVKYAQSQTVFQNPDAHPANSSSAPMPVLLMGQAVLHPSFPASQPSPLQPAQARQQPPQHYLQVQAPTSLHSEQQDSLLLSTYSQQPGTLGYPQPPPAQPQPLRPPRRVSSLSESSGLQQPPR,824,NP_002509.2.csv,refseq-NPAS2-NM_002518.3_clinical_seed_0_final,refseq-NPAS2-NM_002518.3.a2m,Invitae,refseq-NPAS2-NM_002518.3.npy,1,824,824
+NP_002519.2,MTALSARMLTRSRSLGPGAGPRGCREEPGPLRRREAAAEARKSHSPVKRPRKAQRLRVAYEGSDSEKGEGAEPLKVPVWEPQDWQQQLVNIRAMRNKKDAPVDHLGTEHCYDSSAPPKVRRYQVLLSLMLSSQTKDQVTAGAMQRLRARGLTVDSILQTDDATLGKLIYPVGFWRSKVKYIKQTSAILQQHYGGDIPASVAELVALPGVGPKMAHLAMAVAWGTVSGIAVDTHVHRIANRLRWTKKATKSPEETRAALEEWLPRELWHEINGLLVGFGQQTCLPVHPRCHACLNQALCPAAQGL,304,NP_002519.2.csv,NP_002519.2_colabfold_clinical_seed_0_final,NP_002519.2_colabfold.a2m,colabfold,NP_002519.2_colabfold_theta_0.2.npy,1,304,304
+NP_002520.2,MLRGGRRGQLGWHSWAAGPGSLLAWLILASAGAAPCPDACCPHGSSGLRCTRDGALDSLHHLPGAENLTELYIENQQHLQHLELRDLRGLGELRNLTIVKSGLRFVAPDAFHFTPRLSRLNLSFNALESLSWKTVQGLSLQELVLSGNPLHCSCALRWLQRWEEEGLGGVPEQKLQCHGQGPLAHMPNASCGVPTLKVQVPNASVDVGDDVLLRCQVEGRGLEQAGWILTELEQSATVMKSGGLPSLGLTLANVTSDLNRKNVTCWAENDVGRAEVSVQVNVSFPASVQLHTAVEMHHWCIPFSVDGQPAPSLRWLFNGSVLNETSFIFTEFLEPAANETVRHGCLRLNQPTHVNNGNYTLLAANPFGQASASIMAAFMDNPFEFNPEDPIPVSFSPVDTNSTSGDPVEKKDETPFGVSVAVGLAVFACLFLSTLLLVLNKCGRRNKFGINRPAVLAPEDGLAMSLHFMTLGGSSLSPTEGKGSGLQGHIIENPQYFSDACVHHIKRRDIVLKWELGEGAFGKVFLAECHNLLPEQDKMLVAVKALKEASESARQDFQREAELLTMLQHQHIVRFFGVCTEGRPLLMVFEYMRHGDLNRFLRSHGPDAKLLAGGEDVAPGPLGLGQLLAVASQVAAGMVYLAGLHFVHRDLATRNCLVGQGLVVKIGDFGMSRDIYSTDYYRVGGRTMLPIRWMPPESILYRKFTTESDVWSFGVVLWEIFTYGKQPWYQLSNTEAIDCITQGRELERPRACPPEVYAIMRGCWQREPQQRHSIKDVHARLQALAQAPPVYLDVLG,796,NP_002520.2.csv,NTRK1_HUMAN_b07_clinical_seed_0_final,NTRK1_HUMAN_b07.a2m,EVE,NTRK1_HUMAN_b07_theta_0.2.npy,1,796,796
+NP_002538.1,MGHPPLEFSDCYLDSPDFRERLKCYEQELERTNKFIKDVIKDGNALISAMRNYSSAVQKFSQTLQSFQFDFIGDTLTDDEINIAESFKEFAELLNEVENERMMMVHNASDLLIKPLENFRKEQIGFTKERKKKFEKDGERFYSLLDRHLHLSSKKKESQLQEADLQVDKERHNFFESSLDYVYQIQEVQESKKFNIVEPVLAFLHSLFISNSLTVELTQDFLPYKQQLQLSLQNTRNHFSSTREEMEELKKRMKEAPQTCKLPGQPTIEGYLYTQEKWALGISWVKYYCQYEKETKTLTMTPMEQKPGAKQGPLDLTLKYCVRRKTESIDKRFCFDIETNERPGTITLQALSEANRRLWMEAMDGKEPIYHSPITKQQEMELNEVGFKFVRKCINIIETKGIKTEGLYRTVGSNIQVQKLLNAFFDPKCPGDVDFHNSDWDIKTITSSLKFYLRNLSEPVMTYRLHKELVSAAKSDNLDYRLGAIHSLVYKLPEKNREMLELLIRHLVNVCEHSKENLMTPSNMGVIFGPTLMRAQEDTVAAMMNIKFQNIVVEILIEHFGKIYLGPPEESAAPPVPPPRVTARRHKPITISKRLLRERTVFYTSSLDESEDEIQHQTPNGTITSSIEPPKPPQHPKLPIQRSGETDPGRKSPSRPILDGKLEPCPEVDVGKLVSRLQDGGTKITPKATNGPMPGSGPTKTPSFHIKRPAPRPLAHHKEGDADSFSKVRPPGEKPTIIRPPVRPPDPPCRAATPQKPEPKPDIVAGNAGEITSSVVASRTRFFETASRKTGSSQGRLPGDES,802,NP_002538.1.csv,refseq-OPHN1-NM_002547.2_clinical_seed_0_final,refseq-OPHN1-NM_002547.2.a2m,Invitae,refseq-OPHN1-NM_002547.2.npy,1,802,802
+NP_002543.2,MSSRKSKSNSLIHTECLSQVQRILRERFCRQSPHSNLFGVQVQYKHLSELLKRTALHGESNSVLIIGPRGSGKTMLINHALKELMEIEEVSENVLQVHLNGLLQINDKIALKEITRQLNLENVVGDKVFGSFAENLSFLLEALKKGDRTSSCPVIFILDEFDLFAHHKNQTLLYNLFDISQSAQTPIAVIGLTCRLDILELLEKRVKSRFSHRQIHLMNSFGFPQYVKIFKEQLSLPAEFPDKVFAEKWNENVQYLSEDRSVQEVLQKHFNISKNLRSLHMLLMLALNRVTASHPFMTAVDLMEASQLCSMDSKANIVHGLSVLEICLIIAMKHLNDIYEEEPFNFQMVYNEFQKFVQRKAHSVYNFEKPVVMKAFEHLQQLELIKPMERTSGNSQREYQLMKLLLDNTQIMNALQKYPNCPTDVRQWATSSLSWL,436,NP_002543.2.csv,refseq-ORC4-NM_002552.4_clinical_seed_0_final,refseq-ORC4-NM_002552.4.a2m,Invitae,refseq-ORC4-NM_002552.4.npy,1,436,436
+NP_002569.1,MSDGLDNEEKPPAPPLRMNSNNRDSSALNHSSKPLPMAPEEKNKKARLRSIFPGGGDKTNKKKEKERPEISLPSDFEHTIHVGFDAVTGEFTGIPEQWARLLQTSNITKLEQKKNPQAVLDVLKFYDSKETVNNQKYMSFTSGDKSAHGYIAAHPSSTKTASEPPLAPPVSEEEDEEEEEEEDENEPPPVIAPRPEHTKSIYTRSVVESIASPAVPNKEVTPPSAENANSSTLYRNTDRQRKKSKMTDEEILEKLRSIVSVGDPKKKYTRFEKIGQGASGTVYTALDIATGQEVAIKQMNLQQQPKKELIINEILVMRENKNPNIVNYLDSYLVGDELWVVMEYLAGGSLTDVVTETCMDEGQIAAVCRECLQALDFLHSNQVIHRDIKSDNILLGMDGSVKLTDFGFCAQITPEQSKRSTMVGTPYWMAPEVVTRKAYGPKVDIWSLGIMAIEMVEGEPPYLNENPLRALYLIATNGTPELQNPERLSAVFRDFLNRCLEMDVDRRGSAKELLQHPFLKLAKPLSSLTPLIIAAKEAIKNSSR,544,NP_002569.1.csv,refseq-PAK3-NM_002578.4_clinical_seed_0_final,refseq-PAK3-NM_002578.4.a2m,Invitae,refseq-PAK3-NM_002578.4_theta_0.2.npy,1,544,544
+NP_002573.1,MEIIRSNFKSNLHKVYQAIEEADFFAIDGEFSGISDGPSVSALTNGFDTPEERYQKLKKHSMDFLLFQFGLCTFKYDYTDSKYITKSFNFYVFPKPFNRSSPDVKFVCQSSSIDFLASQGFDFNKVFRNGIPYLNQEEERQLREQYDEKRSQANGAGALSYVSPNTSKCPVTIPEDQKKFIDQVVEKIEDLLQSEENKNLDLEPCTGFQRKLIYQTLSWKYPKGIHVETLETEKKERYIVISKVDEEERKRREQQKHAKEQEELNDAVGFSRVIHAIANSGKLVIGHNMLLDVMHTVHQFYCPLPADLSEFKEMTTCVFPRLLDTKLMASTQPFKDIINNTSLAELEKRLKETPFNPPKVESAEGFPSYDTASEQLHEAGYDAYITGLCFISMANYLGSFLSPPKIHVSARSKLIEPFFNKLFLMRVMDIPYLNLEGPDLQPKRDHVLHVTFPKEWKTSDLYQLFSAFGNIQISWIDDTSAFVSLSQPEQVKIAVNTSKYAESYRIQTYAEYMGRKQEEKQIKRKWTEDSWKEADSKRLNPQCIPYTLQNHYYRNNSFTAPSTVGKRNLSPSQEEAGLEDGVSGEISDTELEQTDSCAEPLSEGRKKAKKLKRMKKELSPAGSISKNSPATLFEVPDTW,639,NP_002573.1.csv,refseq-PARN-NM_002582.3_clinical_seed_0_final,refseq-PARN-NM_002582.3.a2m,Invitae,refseq-PARN-NM_002582.3.npy,1,639,639
+NP_002576.1,MDEQPRLMHSHAGVGMAGHPGLSQHLQDGAGGTEGEGGRKQDIGDILQQIMTITDQSLDEAQARKHALNCHRMKPALFNVLCEIKEKTVLSIRGAQEEEPTDPQLMRLDNMLLAEGVAGPEKGGGSAAAAAAAAASGGAGSDNSVEHSDYRAKLSQIRQIYHTELEKYEQACNEFTTHVMNLLREQSRTRPISPKEIERMVSIIHRKFSSIQMQLKQSTCEAVMILRSRFLDARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRYKKNIGKFQEEANIYAAKTAVTATNVSAHGSQANSPSTPNSAGSSSSFNMSNSGDLFMSVQSLNGDSYQGAQVGANVQSQVDTLRHVISQTGGYSDGLAASQMYSPQGISANGGWQDATTPSSVTSPTEGPGSVHSDTSN,430,NP_002576.1.csv,refseq-PBX1-NM_002585.3_clinical_seed_0_final,refseq-PBX1-NM_002585.3.a2m,Invitae,refseq-PBX1-NM_002585.3.npy,1,430,430
+NP_002599.1,MNRCWALFLSLCCYLRLVSAEGDPIPEELYEMLSDHSIRSFDDLQRLLHGDPGEEDGAELDLNMTRSHSGGELESLARGRRSLGSLTIAEPAMIAECKTRTEVFEISRRLIDRTNANFLVWPPCVEVQRCSGCCNNRNVQCRPTQVQLRPVQVRKIEIVRKKPIFKKATVTLEDHLACKCETVAAARPVTRSPGGSQEQRAKTPQTRVTIRTVRVRRPPKGKHRKFKHTHDKTALKETLGA,241,NP_002599.1.csv,refseq-PDGFB-NM_002608.3_clinical_seed_0_final,refseq-PDGFB-NM_002608.3.a2m,Invitae,refseq-PDGFB-NM_002608.3.npy,1,241,241
+NP_002600.1,MRLPGAMPALALKGELLLLSLLLLLEPQISQGLVVTPPGPELVLNVSSTFVLTCSGSAPVVWERMSQEPPQEMAKAQDGTFSSVLTLTNLTGLDTGEYFCTHNDSRGLETDERKRLYIFVPDPTVGFLPNDAEELFIFLTEITEITIPCRVTDPQLVVTLHEKKGDVALPVPYDHQRGFSGIFEDRSYICKTTIGDREVDSDAYYVYRLQVSSINVSVNAVQTVVRQGENITLMCIVIGNEVVNFEWTYPRKESGRLVEPVTDFLLDMPYHIRSILHIPSAELEDSGTYTCNVTESVNDHQDEKAINITVVESGYVRLLGEVGTLQFAELHRSRTLQVVFEAYPPPTVLWFKDNRTLGDSSAGEIALSTRNVSETRYVSELTLVRVKVAEAGHYTMRAFHEDAEVQLSFQLQINVPVRVLELSESHPDSGEQTVRCRGRGMPQPNIIWSACRDLKRCPRELPPTLLGNSSEEESQLETNVTYWEEEQEFEVVSTLRLQHVDRPLSVRCTLRNAVGQDTQEVIVVPHSLPFKVVVISAILALVVLTIISLIILIMLWQKKPRYEIRWKVIESVSSDGHEYIYVDPMQLPYDSTWELPRDQLVLGRTLGSGAFGQVVEATAHGLSHSQATMKVAVKMLKSTARSSEKQALMSELKIMSHLGPHLNVVNLLGACTKGGPIYIITEYCRYGDLVDYLHRNKHTFLQHHSDKRRPPSAELYSNALPVGLPLPSHVSLTGESDGGYMDMSKDESVDYVPMLDMKGDVKYADIESSNYMAPYDNYVPSAPERTCRATLINESPVLSYMDLVGFSYQVANGMEFLASKNCVHRDLAARNVLICEGKLVKICDFGLARDIMRDSNYISKGSTFLPLKWMAPESIFNSLYTTLSDVWSFGILLWEIFTLGGTPYPELPMNEQFYNAIKRGYRMAQPAHASDEIYEIMQKCWEEKFEIRPPFSQLVLLLERLLGEGYKKKYQQVDEEFLRSDHPAILRSQARLPGFHGLRSPLDTSSVLYTAVQPNEGDNDYIIPLPDPKPEVADEGPLEGSPSLASSTLNEVNTSSTISCDSPLEPQDEPEPEPQLELQVEPEPELEQLPDSGCPAPRAEAEDSFL,1106,NP_002600.1.csv,refseq-PDGFRB-NM_002609.4_clinical_seed_0_final,refseq-PDGFRB-NM_002609.4.a2m,Invitae,refseq-PDGFRB-NM_002609.4.npy,1,1106,1106
+NP_002606.3,MQALVLLLCIGALLGHSSCQNPASPPEEGSPDPDSTGALVEEEDPFFKVPVNKLAAAVSNFGYDLYRVRSSTSPTTNVLLSPLSVATALSALSLGAEQRTESIIHRALYYDLISSPDIHGTYKELLDTVTAPQKNLKSASRIVFEKKLRIKSSFVAPLEKSYGTRPRVLTGNPRLDLQEINNWVQAQMKGKLARSTKEIPDEISILLLGVAHFKGQWVTKFDSRKTSLEDFYLDEERTVRVPMMSDPKAVLRYGLDSDLSCKIAQLPLTGSMSIIFFLPLKVTQNLTLIEESLTSEFIHDIDRELKTVQAVLTVPKLKLSYEGEVTKSLQEMKLQSLFDSPDFSKITGKPIKLTQVEHRAGFEWNEDGAGTTPSPGLQPAHLTFPLDYHLNQPFIFVLRDTDTGALLFIGKILDPRGP,418,NP_002606.3.csv,refseq-SERPINF1-NM_002615.6_clinical_seed_0_final,refseq-SERPINF1-NM_002615.6.a2m,Invitae,refseq-SERPINF1-NM_002615.6.npy,1,418,418
+NP_002612.1,MITEGAQAPRLLLPPLLLLLTLPATGSDPVLCFTQYEESSGKCKGLLGGGVSVEDCCLNTAFAYQKRSGGLCQPCRSPRWSLWSTWAPCSVTCSEGSQLRYRRCVGWNGQCSGKVAPGTLEWQLQACEDQQCCPEMGGWSGWGPWEPCSVTCSKGTRTRRRACNHPAPKCGGHCPGQAQESEACDTQQVCPTHGAWATWGPWTPCSASCHGGPHEPKETRSRKCSAPEPSQKPPGKPCPGLAYEQRRCTGLPPCPVAGGWGPWGPVSPCPVTCGLGQTMEQRTCNHPVPQHGGPFCAGDATRTHICNTAVPCPVDGEWDSWGEWSPCIRRNMKSISCQEIPGQQSRGRTCRGRKFDGHRCAGQQQDIRHCYSIQHCPLKGSWSEWSTWGLCMPPCGPNPTRARQRLCTPLLPKYPPTVSMVEGQGEKNVTFWGRPLPRCEELQGQKLVVEEKRPCLHVPACKDPEEEEL,469,NP_002612.1.csv,refseq-CFP-NM_002621.2_clinical_seed_0_final,refseq-CFP-NM_002621.2.a2m,Invitae,refseq-CFP-NM_002621.2.npy,1,469,469
+NP_002624.2,MVKIVTVKTQAYQDQKPGTSGLRKRVKVFQSSANYAENFIQSIISTVEPAQRQEATLVVGGDGRFYMKEAIQLIARIAAANGIGRLVIGQNGILSTPAVSCIIRKIKAIGGIILTASHNPGGPNGDFGIKFNISNGGPAPEAITDKIFQISKTIEEYAVCPDLKVDLGVLGKQQFDLENKFKPFTVEIVDSVEAYATMLRSIFDFSALKELLSGPNRLKIRIDAMHGVVGPYVKKILCEELGAPANSAVNCVPLEDFGGHHPDPNLTYAADLVETMKSGEHDFGAAFDGDGDRNMILGKHGFFVNPSDSVAVIAANIFSIPYFQQTGVRGFARSMPTSGALDRVASATKIALYETPTGWKFFGNLMDASKLSLCGEESFGTGSDHIREKDGLWAVLAWLSILATRKQSVEDILKDHWQKYGRNFFTRYDYEEVEAEGANKMMKDLEALMFDRSFVGKQFSANDKVYTVEKADNFEYSDPVDGSISRNQGLRLIFTDGSRIVFRLSGTGSAGATIRLYIDSYEKDVAKINQDPQVMLAPLISIALKVSQLQERTGRTAPTVIT,562,NP_002624.2.csv,refseq-PGM1-NM_002633.2_clinical_seed_0_final,refseq-PGM1-NM_002633.2.a2m,Invitae,refseq-PGM1-NM_002633.2.npy,1,562,562
+NP_002632.1,MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFTPDPFRRHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVSVEAVLPMDKRLDRLISHCGPVTGYIFALLAVFNFLFLIFLRWMTPDSIIDVAIDATGPRGAWTNNYSHSKRGGENNEISETR,484,NP_002632.1.csv,refseq-PIGA-NM_002641.3_clinical_seed_0_final,refseq-PIGA-NM_002641.3.a2m,Invitae,refseq-PIGA-NM_002641.3.npy,1,484,484
+NP_002633.1,MYAQPVTNTKEVKWQKVLYERQPFPDNYVDRRFLEELRKNIHARKYQYWAVVFESSVVIQQLCSVCVFVVIWWYMDEGLLAPHWLLGTGLASSLIGYVLFDLIDGGEGRKKSGQTRWADLKSALVFITFTYGFSPVLKTLTESVSTDTIYAMSVFMLLGHLIFFDYGANAAIVSSTLSLNMAIFASVCLASRLPRSLHAFIMVTFAIQIFALWPMLQKKLKACTPRSYVGVTLLFAFSAVGGLLSISAVGAVLFALLLMSISCLCPFYLIRLQLFKENIHGPWDEAEIKEDLSRFLS,297,NP_002633.1.csv,refseq-PIGC-NM_002642.3_clinical_seed_0_final,refseq-PIGC-NM_002642.3.a2m,Invitae,refseq-PIGC-NM_002642.3.npy,1,297,297
+NP_002644.4,MDAFKGGMSLERLPEGLRPPPPPPHDMGPAFHLARPADPREPLENSASESSDTELPEKERGGEPKGPEDSGAGGTGCGGADDPAKKKKQRRQRTHFTSQQLQELEATFQRNRYPDMSMREEIAVWTNLTEPRVRVWFKNRRAKWRKRERNQQLDLCKGGYVPQFSGLVQPYEDVYAAGYSYNNWAAKSLAPAPLSTKSFTFFNSMSPLSSQSMFSAPSSISSMTMPSSMGPGAVPGMPNSGLNNINNLTGSSLNSAMSPGACPYGTPASPYSVYRDTCNSSLASLRLKSKQHSSFGYGGLQGPASGLNACQYNS,314,NP_002644.4.csv,refseq-PITX1-NM_002653.4_clinical_seed_0_final,refseq-PITX1-NM_002653.4.a2m,Invitae,refseq-PITX1-NM_002653.4.npy,1,314,314
+NP_002652.2,MSTTVNVDSLAEYEKSQIKRALELGTVMTVFSFRKSTPERRTVQVIMETRQVAWSKTADKIEGFLDIMEIKEIRPGKNSKDFERAKAVRQKEDCCFTILYGTQFVLSTLSLAADSKEDAVNWLSGLKILHQEAMNASTPTIIESWLRKQIYSVDQTRRNSISLRELKTILPLINFKVSSAKFLKDKFVEIGAHKDELSFEQFHLFYKKLMFEQQKSILDEFKKDSSVFILGNTDRPDASAVYLHDFQRFLIHEQQEHWAQDLNKVRERMTKFIDDTMRETAEPFLFVDEFLTYLFSRENSIWDEKYDAVDMQDMNNPLSHYWISSSHNTYLTGDQLRSESSPEAYIRCLRMGCRCIELDCWDGPDGKPVIYHGWTRTTKIKFDDVVQAIKDHAFVTSSFPVILSIEEHCSVEQQRHMAKAFKEVFGDLLLTKPTEASADQLPSPSQLREKIIIKHKKLGPRGDVDVNMEDKKDEHKQQGELYMWDSIDQKWTRHYCAIADAKLSFSDDIEQTMEEEVPQDIPPTELHFGEKWFHKKVEKRTSAEKLLQEYCMETGGKDGTFLVRESETFPNDYTLSFWRSGRVQHCRIRSTMEGGTLKYYLTDNLTFSSIYALIQHYRETHLRCAEFELRLTDPVPNPNPHESKPWYYDSLSRGEAEDMLMRIPRDGAFLIRKREGSDSYAITFRARGKVKHCRINRDGRHFVLGTSAYFESLVELVSYYEKHSLYRKMRLRYPVTPELLERYNMERDINSLYDVSRMYVDPSEINPSMPQRTVKALYDYKAKRSDELSFCRGALIHNVSKEPGGWWKGDYGTRIQQYFPSNYVEDISTADFEELEKQIIEDNPLGSLCRGILDLNTYNVVKAPQGKNQKSFVFILEPKQQGDPPVEFATDRVEELFEWFQSIREITWKIDTKENNMKYWEKNQSIAIELSDLVVYCKPTSKTKDNLENPDFREIRSFVETKADSIIRQKPVDLLKYNQKGLTRVYPKGQRVDSSNYDPFRLWLCGSQMVALNFQTADKYMQMNHALFSLNGRTGYVLQPESMRTEKYDPMPPESQRKILMTLTVKVLGARHLPKLGRSIACPFVEVEICGAEYDNNKFKTTVVNDNGLSPIWAPTQEKVTFEIYDPNLAFLRFVVYEEDMFSDPNFLAHATYPIKAVKSGFRSVPLKNGYSEDIELASLLVFCEMRPVLESEEELYSSCRQLRRRQEELNNQLFLYDTHQNLRNANRDALVKEFSVNENQLQLYQEKCNKRLREKRVSNSKFYS,1265,NP_002652.2.csv,refseq-PLCG2-NM_002661.4_clinical_seed_0_final,refseq-PLCG2-NM_002661.4.a2m,Invitae,refseq-PLCG2-NM_002661.4.npy,1,1265,1265
+NP_002653.1,MSLKNEPRVNTSALQKIAADMSNIIENLDTRELHFEGEEVDYDVSPSDPKIQEVYIPFSAIYNTQGFKEPNIQTYLSGCPIKAQVLEVERFTSTTRVPSINLYTIELTHGEFKWQVKRKFKHFQEFHRELLKYKAFIRIPIPTRRHTFRRQNVREEPREMPSLPRSSENMIREEQFLGRRKQLEDYLTKILKMPMYRNYHATTEFLDISQLSFIHDLGPKGIEGMIMKRSGGHRIPGLNCCGQGRACYRWSKRWLIVKDSFLLYMKPDSGAIAFVLLVDKEFKIKVGKKETETKYGIRIDNLSRTLILKCNSYRHARWWGGAIEEFIQKHGTNFLKDHRFGSYAAIQENALAKWYVNAKGYFEDVANAMEEANEEIFITDWWLSPEIFLKRPVVEGNRWRLDCILKRKAQQGVRIFIMLYKEVELALGINSEYTKRTLMRLHPNIKVMRHPDHVSSTVYLWAHHEKLVIIDQSVAFVGGIDLAYGRWDDNEHRLTDVGSVKRVTSGPSLGSLPPAAMESMESLRLKDKNEPVQNLPIQKSIDDVDSKLKGIGKPRKFSKFSLYKQLHRHHLHDADSISSIDSTSSYFNHYRSHHNLIHGLKPHFKLFHPSSESEQGLTRPHADTGSIRSLQTGVGELHGETRFWHGKDYCNFVFKDWVQLDKPFADFIDRYSTPRMPWHDIASAVHGKAARDVARHFIQRWNFTKIMKSKYRSLSYPFLLPKSQTTAHELRYQVPGSVHANVQLLRSAADWSAGIKYHEESIHAAYVHVIENSRHYIYIENQFFISCADDKVVFNKIGDAIAQRILKAHRENQKYRVYVVIPLLPGFEGDISTGGGNALQAIMHFNYRTMCRGENSILGQLKAELGNQWINYISFCGLRTHAELEGNLVTELIYVHSKLLIADDNTVIIGSANINDRSMLGKRDSEMAVIVQDTETVPSVMDGKEYQAGRFARGLRLQCFRVVLGYLDDPSEDIQDPVSDKFFKEVWVSTAARNATIYDKVFRCLPNDEVHNLIQLRDFINKPVLAKEDPIRAEEELKKIRGFLVQFPFYFLSEESLLPSVGTKEAIVPMEVWT,1074,NP_002653.1.csv,refseq-PLD1-NM_002662.4_clinical_seed_0_final,refseq-PLD1-NM_002662.4.a2m,Invitae,refseq-PLD1-NM_002662.4.npy,1,1074,1074
+NP_002668.1,MSNKFLGTWKLVSSENFDDYMKALGVGLATRKLGNLAKPTVIISKKGDIITIRTESTFKNTEISFKLGQEFEETTADNRKTKSIVTLQRGSLNQVQRWDGKETTIKRKLVNGKMVAECKMKGVVCTRIYEKV,132,NP_002668.1.csv,refseq-PMP2-NM_002677.3_clinical_seed_0_final,refseq-PMP2-NM_002677.3.a2m,Invitae,refseq-PMP2-NM_002677.3.npy,1,132,132
+NP_002682.2,MDGKRRPGPGPGVPPKRARGGLWDDDDAPRPSQFEEDLALMEEMEAEHRLQEQEEEELQSVLEGVADGQVPPSAIDPRWLRPTPPALDPQTEPLIFQQLEIDHYVGPAQPVPGGPPPSRGSVPVLRAFGVTDEGFSVCCHIHGFAPYFYTPAPPGFGPEHMGDLQRELNLAISRDSRGGRELTGPAVLAVELCSRESMFGYHGHGPSPFLRITVALPRLVAPARRLLEQGIRVAGLGTPSFAPYEANVDFEIRFMVDTDIVGCNWLELPAGKYALRLKEKATQCQLEADVLWSDVVSHPPEGPWQRIAPLRVLSFDIECAGRKGIFPEPERDPVIQICSLGLRWGEPEPFLRLALTLRPCAPILGAKVQSYEKEEDLLQAWSTFIRIMDPDVITGYNIQNFDLPYLISRAQTLKVQTFPFLGRVAGLCSNIRDSSFQSKQTGRRDTKVVSMVGRVQMDMLQVLLREYKLRSYTLNAVSFHFLGEQKEDVQHSIITDLQNGNDQTRRRLAVYCLKDAYLPLRLLERLMVLVNAVEMARVTGVPLSYLLSRGQQVKVVSQLLRQAMHEGLLMPVVKSEGGEDYTGATVIEPLKGYYDVPIATLDFSSLYPSIMMAHNLCYTTLLRPGTAQKLGLTEDQFIRTPTGDEFVKTSVRKGLLPQILENLLSARKRAKAELAKETDPLRRQVLDGRQLALKVSANSVYGFTGAQVGKLPCLEISQSVTGFGRQMIEKTKQLVESKYTVENGYSTSAKVVYGDTDSVMCRFGVSSVAEAMALGREAADWVSGHFPSPIRLEFEKVYFPYLLISKKRYAGLLFSSRPDAHDRMDCKGLEAVRRDNCPLVANLVTASLRRLLIDRDPEGAVAHAQDVISDLLCNRIDISQLVITKELTRAASDYAGKQAHVELAERMRKRDPGSAPSLGDRVPYVIISAAKGVAAYMKSEDPLFVLEHSLPIDTQYYLEQQLAKPLLRIFEPILGEGRAEAVLLRGDHTRCKTVLTGKVGGLLAFAKRRNCCIGCRTVLSHQGAVCEFCQPRESELYQKEVSHLNALEERFSRLWTQCQRCQGSLHEDVICTSRDCPIFYMRKKVRKDLEDQEQLLRRFGPPGPEAW,1107,NP_002682.2.csv,refseq-POLD1-NM_002691.3_clinical_seed_0_final,refseq-POLD1-NM_002691.3.a2m,Invitae,refseq-POLD1-NM_002691.3.npy,1,1107,1107
+NP_002684.1,MSRLLWRKVAGATVGPGPVPAPGRWVSSSVPASDPSDGQRRRQQQQQQQQQQQQQPQQPQVLSSEGGQLRHNPLDIQMLSRGLHEQIFGQGGEMPGEAAVRRSVEHLQKHGLWGQPAVPLPDVELRLPPLYGDNLDQHFRLLAQKQSLPYLEAANLLLQAQLPPKPPAWAWAEGWTRYGPEGEAVPVAIPEERALVFDVEVCLAEGTCPTLAVAISPSAWYSWCSQRLVEERYSWTSQLSPADLIPLEVPTGASSPTQRDWQEQLVVGHNVSFDRAHIREQYLIQGSRMRFLDTMSMHMAISGLSSFQRSLWIAAKQGKHKVQPPTKQGQKSQRKARRGPAISSWDWLDISSVNSLAEVHRLYVGGPPLEKEPRELFVKGTMKDIRENFQDLMQYCAQDVWATHEVFQQQLPLFLERCPHPVTLAGMLEMGVSYLPVNQNWERYLAEAQGTYEELQREMKKSLMDLANDACQLLSGERYKEDPWLWDLEWDLQEFKQKKAKKVKKEPATASKLPIEGAGAPGDPMDQEDLGPCSEEEEFQQDVMARACLQKLKGTTELLPKRPQHLPGHPGWYRKLCPRLDDPAWTPGPSLLSLQMRVTPKLMALTWDGFPLHYSERHGWGYLVPGRRDNLAKLPTGTTLESAGVVCPYRAIESLYRKHCLEQGKQQLMPQEAGLAEEFLLTDNSAIWQTVEELDYLEVEAEAKMENLRAAVPGQPLALTARGGPKDTQPSYHHGNGPYNDVDIPGCWFFKLPHKDGNSCNVGSPFAKDFLPKMEDGTLQAGPGGASGPRALEINKMISFWRNAHKRISSQMVVWLPRSALPRAVIRHPDYDEEGLYGAILPQVVTAGTITRRAVEPTWLTASNARPDRVGSELKAMVQAPPGYTLVGADVDSQELWIAAVLGDAHFAGMHGCTAFGWMTLQGRKSRGTDLHSKTATTVGISREHAKIFNYGRIYGAGQPFAERLLMQFNHRLTQQEAAEKAQQMYAATKGLRWYRLSDEGEWLVRELNLPVDRTEGGWISLQDLRKVQRETARKSQWKKWEVVAERAWKGGTESEMFNKLESIATSDIPRTPVLGCCISRALEPSAVQEEFMTSRVNWVVQSSAVDYLHLMLVAMKWLFEEFAIDGRFCISIHDEVRYLVREEDRYRAALALQITNLLTRCMFAYKLGLNDLPQSVAFFSAVDIDRCLRKEVTMDCKTPSNPTGMERRYGIPQGEALDIYQIIELTKGSLEKRSQPGP,1239,NP_002684.1.csv,refseq-POLG-NM_002693.2_clinical_seed_0_final,refseq-POLG-NM_002693.2.a2m,Invitae,refseq-POLG-NM_002693.2.npy,1,1239,1239
+NP_002706.1,MDEKVFTKELDQWIEQLNECKQLSESQVKSLCEKAKEILTKESNVQEVRCPVTVCGDVHGQFHDLMELFRIGGKSPDTNYLFMGDYVDRGYYSVETVTLLVALKVRYRERITILRGNHESRQITQVYGFYDECLRKYGNANVWKYFTDLFDYLPLTALVDGQIFCLHGGLSPSIDTLDHIRALDRLQEVPHEGPMCDLLWSDPDDRGGWGISPRGAGYTFGQDISETFNHANGLTLVSRAHQLVMEGYNWCHDRNVVTIFSAPNYCYRCGNQAAIMELDDTLKYSFLQFDPAPRRGEPHVTRRTPDYFL,309,NP_002706.1.csv,refseq-PPP2CA-NM_002715.2_clinical_seed_0_final,refseq-PPP2CA-NM_002715.2.a2m,Invitae,refseq-PPP2CA-NM_002715.2.npy,1,309,309
+NP_002725.1,MESGSTAASEEARSLRECELYVQKHNIQALLKDSIVQLCTARPERPMAFLREYFERLEKEEAKQIQNLQKAGTRTDSREDEISPPPPNPVVKGRRRRGAISAEVYTEEDAASYVRKVIPKDYKTMAALAKAIEKNVLFSHLDDNERSDIFDAMFSVSFIAGETVIQQGDEGDNFYVIDQGETDVYVNNEWATSVGEGGSFGELALIYGTPRAATVKAKTNVKLWGIDRDSYRRILMGSTLRKRKMYEEFLSKVSILESLDKWERLTVADALEPVQFEDGQKIVVQGEPGDEFFIILEGSAAVLQRRSENEEFVEVGRLGPSDYFGEIALLMNRPRAATVVARGPLKCVKLDRPRFERVLGPCSDILKRNIQQYNSFVSLSV,381,NP_002725.1.csv,refseq-PRKAR1A-NM_002734.4_clinical_seed_0_final,refseq-PRKAR1A-NM_002734.4.a2m,Invitae,refseq-PRKAR1A-NM_002734.4.npy,1,381,381
+NP_002730.1,MAGLGPGVGDSEGGPRPLFCRKGALRQKVVHEVKSHKFTARFFKQPTFCSHCTDFIWGIGKQGLQCQVCSFVVHRRCHEFVTFECPGAGKGPQTDDPRNKHKFRLHSYSSPTFCDHCGSLLYGLVHQGMKCSCCEMNVHRRCVRSVPSLCGVDHTERRGRLQLEIRAPTADEIHVTVGEARNLIPMDPNGLSDPYVKLKLIPDPRNLTKQKTRTVKATLNPVWNETFVFNLKPGDVERRLSVEVWDWDRTSRNDFMGAMSFGVSELLKAPVDGWYKLLNQEEGEYYNVPVADADNCSLLQKFEACNYPLELYERVRMGPSSSPIPSPSPSPTDPKRCFFGASPGRLHISDFSFLMVLGKGSFGKVMLAERRGSDELYAIKILKKDVIVQDDDVDCTLVEKRVLALGGRGPGGRPHFLTQLHSTFQTPDRLYFVMEYVTGGDLMYHIQQLGKFKEPHAAFYAAEIAIGLFFLHNQGIIYRDLKLDNVMLDAEGHIKITDFGMCKENVFPGTTTRTFCGTPDYIAPEIIAYQPYGKSVDWWSFGVLLYEMLAGQPPFDGEDEEELFQAIMEQTVTYPKSLSREAVAICKGFLTKHPGKRLGSGPDGEPTIRAHGFFRWIDWERLERLEIPPPFRPRPCGRSGENFDKFFTRAAPALTPPDRLVLASIDQADFQGFTYVNPDFVHPDARSPTSPVPVPVM,697,NP_002730.1.csv,refseq-PRKCG-NM_002739.4_clinical_seed_0_final,refseq-PRKCG-NM_002739.4.a2m,Invitae,refseq-PRKCG-NM_002739.4.npy,1,697,697
+NP_002733.2,MSAPPVLRPPSPLLPVAAAAAAAAAALVPGSGPGPAPFLAPVAAPVGGISFHLQIGLSREPVLLLQDSSGDYSLAHVREMACSIVDQKFPECGFYGMYDKILLFRHDPTSENILQLVKAASDIQEGDLIEVVLSASATFEDFQIRPHALFVHSYRAPAFCDHCGEMLWGLVRQGLKCEGCGLNYHKRCAFKIPNNCSGVRRRRLSNVSLTGVSTIRTSSAELSTSAPDEPLLQKSPSESFIGREKRSNSQSYIGRPIHLDKILMSKVKVPHTFVIHSYTRPTVCQYCKKLLKGLFRQGLQCKDCRFNCHKRCAPKVPNNCLGEVTINGDLLSPGAESDVVMEEGSDDNDSERNSGLMDDMEEAMVQDAEMAMAECQNDSGEMQDPDPDHEDANRTISPSTSNNIPLMRVVQSVKHTKRKSSTVMKEGWMVHYTSKDTLRKRHYWRLDSKCITLFQNDTGSRYYKEIPLSEILSLEPVKTSALIPNGANPHCFEITTANVVYYVGENVVNPSSPSPNNSVLTSGVGADVARMWEIAIQHALMPVIPKGSSVGTGTNLHRDISVSISVSNCQIQENVDISTVYQIFPDEVLGSGQFGIVYGGKHRKTGRDVAIKIIDKLRFPTKQESQLRNEVAILQNLHHPGVVNLECMFETPERVFVVMEKLHGDMLEMILSSEKGRLPEHITKFLITQILVALRHLHFKNIVHCDLKPENVLLASADPFPQVKLCDFGFARIIGEKSFRRSVVGTPAYLAPEVLRNKGYNRSLDMWSVGVIIYVSLSGTFPFNEDEDIHDQIQNAAFMYPPNPWKEISHEAIDLINNLLQVKMRKRYSVDKTLSHPWLQDYQTWLDLRELECKIGERYITHESDDLRWEKYAGEQGLQYPTHLINPSASHSDTPETEETEMKALGERVSIL,912,NP_002733.2.csv,refseq-PRKD1-NM_002742.2_clinical_seed_0_final,refseq-PRKD1-NM_002742.2.a2m,Invitae,refseq-PRKD1-NM_002742.2.npy,1,912,912
+NP_002746.1,MPKKKPTPIQLNPAPDGSAVNGTSSAETNLEALQKKLEELELDEQQRKRLEAFLTQKQKVGELKDDDFEKISELGAGNGGVVFKVSHKPSGLVMARKLIHLEIKPAIRNQIIRELQVLHECNSPYIVGFYGAFYSDGEISICMEHMDGGSLDQVLKKAGRIPEQILGKVSIAVIKGLTYLREKHKIMHRDVKPSNILVNSRGEIKLCDFGVSGQLIDSMANSFVGTRSYMSPERLQGTHYSVQSDIWSMGLSLVEMAVGRYPIPPPDAKELELMFGCQVEGDAAETPPRPRTPGRPLSSYGMDSRPPMAIFELLDYIVNEPPPKLPSGVFSLEFQDFVNKCLIKNPAERADLKQLMVHAFIKRSDAEEVDFAGWLCSTIGLNQPSTPTHAAGV,393,NP_002746.1.csv,refseq-MAP2K1-NM_002755.3_clinical_seed_0_final,refseq-MAP2K1-NM_002755.3.a2m,Invitae,refseq-MAP2K1-NM_002755.3.npy,1,393,393
+NP_002755.1,MPNIKIFSGSSHQDLSQKIADRLGLELGKVVTKKFSNQETCVEIGESVRGEDVYIVQSGCGEINDNLMELLIMINACKIASASRVTAVIPCFPYARQDKKDKSRAPISAKLVANMLSVAGADHIITMDLHASQIQGFFDIPVDNLYAEPAVLKWIRENISEWRNCTIVSPDAGGAKRVTSIADRLNVDFALIHKERKKANEVDRMVLVGDVKDRVAILVDDMADTCGTICHAADKLLSAGATRVYAILTHGIFSGPAISRINNACFEAVVVTNTIPQEDKMKHCSKIQVIDISMILAEAIRRTHNGESVSYLFSHVPL,318,NP_002755.1.csv,refseq-PRPS1-NM_002764.3_clinical_seed_0_final,refseq-PRPS1-NM_002764.3.a2m,Invitae,refseq-PRPS1-NM_002764.3.npy,1,318,318
+NP_002759.2,MDDTLFQLKFTAKQLEKLAKKAEKDSKAEQAKVKKALLQKNVECARVYAENAIRKKNEGVNWLRMASRVDAVASKVQTAVTMKGVTKNMAQVTKALDKALSTMDLQKVSSVMDRFEQQVQNLDVHTSVMEDSMSSATTLTTPQEQVDSLIMQIAEENGLEVLDQLSQLPEGASAVGESSVRSQEDQLSRRLAALRN,196,NP_002759.2.csv,refseq-CHMP1A-NM_002768.4_clinical_seed_0_final,refseq-CHMP1A-NM_002768.4.a2m,Invitae,refseq-CHMP1A-NM_002768.4.npy,1,196,196
+NP_002760.1,MNPLLILTFVAAALAAPFDDDDKIVGGYNCEENSVPYQVSLNSGYHFCGGSLINEQWVVSAGHCYKSRIQVRLGEHNIEVLEGNEQFINAAKIIRHPQYDRKTLNNDIMLIKLSSRAVINARVSTISLPTAPPATGTKCLISGWGNTASSGADYPDELQCLDAPVLSQAKCEASYPGKITSNMFCVGFLEGGKDSCQGDSGGPVVCNGQLQGVVSWGDGCAQKNKPGVYTKVYNYVKWIKNTIAANS,247,NP_002760.1.csv,refseq-PRSS1-NM_002769.4_clinical_seed_0_final,refseq-PRSS1-NM_002769.4.a2m,Invitae,refseq-PRSS1-NM_002769.4_theta_0.2.npy,1,247,247
+NP_002766.1,MQIPRAALLPLLLLLLAAPASAQLSRAGRSAPLAAGCPDRCEPARCPPQPEHCEGGRARDACGCCEVCGAPEGAACGLQEGPCGEGLQCVVPFGVPASATVRRRAQAGLCVCASSEPVCGSDANTYANLCQLRAASRRSERLHRPPVIVLQRGACGQGQEDPNSLRHKYNFIADVVEKIAPAVVHIELFRKLPFSKREVPVASGSGFIVSEDGLIVTNAHVVTNKHRVKVELKNGATYEAKIKDVDEKADIALIKIDHQGKLPVLLLGRSSELRPGEFVVAIGSPFSLQNTVTTGIVSTTQRGGKELGLRNSDMDYIQTDAIINYGNSGGPLVNLDGEVIGINTLKVTAGISFAIPSDKIKKFLTESHDRQAKGKAITKKKYIGIRMMSLTSSKAKELKDRHRDFPDVISGAYIIEVIPDTPAEAGGLKENDVIISINGQSVVSANDVSDVIKRESTLNMVVRRGNEDIMITVIPEEIDP,480,NP_002766.1.csv,refseq-HTRA1-NM_002775.4_clinical_seed_0_final,refseq-HTRA1-NM_002775.4.a2m,Invitae,refseq-HTRA1-NM_002775.4.npy,1,480,480
+NP_002769.1,MYALFLLASLLGAALAGPVLGLKECTRGSAVWCQNVKTASDCGAVKHCLQTVWNKPTVKSLPCDICKDVVTAAGDMLKDNATEEEILVYLEKTCDWLPKPNMSASCKEIVDSYLPVILDIIKGEMSRPGEVCSALNLCESLQKHLAELNHQKQLESNKIPELDMTEVVAPFMANIPLLLYPQDGPRSKPQPKDNGDVCQDCIQMVTDIQTAVRTNSTFVQALVEHVKEECDRLGPGMADICKNYISQYSEIAIQMMMHMQPKEICALVGFCDEVKEMPMQTLVPAKVASKNVIPALELVEPIKKHEVPAKSDVYCEVCEFLVKEVTKLIDNNKTEKEILDAFDKMCSKLPKSLSEECQEVVDTYGSSILSILLEEVSPELVCSMLHLCSGTRLPALTVHVTQPKDGGFCEVCKKLVGYLDRNLEKNSTKQEILAALEKGCSFLPDPYQKQCDQFVAEYEPVLIEILVEVMDPSFVCLKIGACPSAHKPLLGTEKCIWGPSYWCQNTETAAQCNAVEHCKRHVWN,524,NP_002769.1.csv,refseq-PSAP-NM_002778.3_clinical_seed_0_final,refseq-PSAP-NM_002778.3.a2m,Invitae,refseq-PSAP-NM_002778.3.npy,1,524,524
+NP_002791.1,MLRAGAPTGDLPRAGEVHTGTTIMAVEFDGGVVMGSDSRVSAGEAVVNRVFDKLSPLHERIYCALSGSAADAQAVADMAAYQLELHGIELEEPPLVLAAANVVRNISYKYREDLSAHLMVAGWDQREGGQVYGTLGGMLTRQPFAIGGSGSTFIYGYVDAAYKPGMSPEECRRFTTDAIALAMSRDGSSGGVIYLVTITAAGVDHRVILGNELPKFYDE,219,NP_002791.1.csv,refseq-PSMB9-NM_002800.4_clinical_seed_0_final,refseq-PSMB9-NM_002800.4.a2m,Invitae,refseq-PSMB9-NM_002800.4.npy,1,219,219
+NP_002825.3,MTSRRWFHPNITGVEAENLLLTRGVDGSFLARPSKSNPGDFTLSVRRNGAVTHIKIQNTGDYYDLYGGEKFATLAELVQYYMEHHGQLKEKNGDVIELKYPLNCADPTSERWFHGHLSGKEAEKLLTEKGKHGSFLVRESQSHPGDFVLSVRTGDDKGESNDGKSKVTHVMIRCQELKYDVGGGERFDSLTDLVEHYKKNPMVETLGTVLQLKQPLNTTRINAAEIESRVRELSKLAETTDKVKQGFWEEFETLQQQECKLLYSRKEGQRQENKNKNRYKNILPFDHTRVVLHDGDPNEPVSDYINANIIMPEFETKCNNSKPKKSYIATQGCLQNTVNDFWRMVFQENSRVIVMTTKEVERGKSKCVKYWPDEYALKEYGVMRVRNVKESAAHDYTLRELKLSKVGQGNTERTVWQYHFRTWPDHGVPSDPGGVLDFLEEVHHKQESIMDAGPVVVHCSAGIGRTGTFIVIDILIDIIREKGVDCDIDVPKTIQMVRSQRSGMVQTEAQYRFIYMAVQHYIETLQRRIEEEQKSKRKGHEYTNIKYSLADQTSGDQSPLPPCTPTPPCAEMREDSARVYENVGLMQQQKSFR,593,NP_002825.3.csv,refseq-PTPN11-NM_002834.3_clinical_seed_0_final,refseq-PTPN11-NM_002834.3.a2m,Invitae,refseq-PTPN11-NM_002834.3.npy,1,593,593
+NP_002829.3,MTMYLWLKLLAFGFAFLDTEVFVTGQSPTPSPTGLTTAKMPSVPLSSDPLPTHTTAFSPASTFERENDFSETTTSLSPDNTSTQVSPDSLDNASAFNTTGVSSVQTPHLPTHADSQTPSAGTDTQTFSGSAANAKLNPTPGSNAISDVPGERSTASTFPTDPVSPLTTTLSLAHHSSAALPARTSNTTITANTSDAYLNASETTTLSPSGSAVISTTTIATTPSKPTCDEKYANITVDYLYNKETKLFTAKLNVNENVECGNNTCTNNEVHNLTECKNASVSISHNSCTAPDKTLILDVPPGVEKFQLHDCTQVEKADTTICLKWKNIETFTCDTQNITYRFQCGNMIFDNKEIKLENLEPEHEYKCDSEILYNNHKFTNASKIIKTDFGSPGEPQIIFCRSEAAHQGVITWNPPQRSFHNFTLCYIKETEKDCLNLDKNLIKYDLQNLKPYTKYVLSLHAYIIAKVQRNGSAAMCHFTTKSAPPSQVWNMTVSMTSDNSMHVKCRPPRDRNGPHERYHLEVEAGNTLVRNESHKNCDFRVKDLQYSTDYTFKAYFHNGDYPGEPFILHHSTSYNSKALIAFLAFLIIVTSIALLVVLYKIYDLHKKRSCNLDEQQELVERDDEKQLMNVEPIHADILLETYKRKIADEGRLFLAEFQSIPRVFSKFPIKEARKPFNQNKNRYVDILPYDYNRVELSEINGDAGSNYINASYIDGFKEPRKYIAAQGPRDETVDDFWRMIWEQKATVIVMVTRCEEGNRNKCAEYWPSMEEGTRAFGDVVVKINQHKRCPDYIIQKLNIVNKKEKATGREVTHIQFTSWPDHGVPEDPHLLLKLRRRVNAFSNFFSGPIVVHCSAGVGRTGTYIGIDAMLEGLEAENKVDVYGYVVKLRRQRCLMVQVEAQYILIHQALVEYNQFGETEVNLSELHPYLHNMKKRDPPSEPSPLEAEFQRLPSYRSWRTQHIGNQEENKSKNRNSNVIPYDYNRVPLKHELEMSKESEHDSDESSDDDSDSEEPSKYINASFIMSYWKPEVMIAAQGPLKETIGDFWQMIFQRKVKVIVMLTELKHGDQEICAQYWGEGKQTYGDIEVDLKDTDKSSTYTLRVFELRHSKRKDSRTVYQYQYTNWSVEQLPAEPKELISMIQVVKQKLPQKNSSEGNKHHKSTPLLIHCRDGSQQTGIFCALLNLLESAETEEVVDIFQVVKALRKARPGMVSTFEQYQFLYDVIASTYPAQNGQVKKNNHQEDKIEFDNEVDKVKQDANCVNPLGAPEKLPEAKEQAEGSEPTSGTEGPEHSVNGPASPALNQGS,1306,NP_002829.3.csv,refseq-PTPRC-NM_002838.4_clinical_seed_0_final,refseq-PTPRC-NM_002838.4.a2m,Invitae,refseq-PTPRC-NM_002838.4.npy,1,1306,1306
+NP_002851.2,MLSQVYRCGFQPFNQHLLPWVKCTTVFRSHCIQPSVIRHVRSWSNIPFITVPLSRTHGKSFAHRSELKHAKRIVVKLGSAVVTRGDECGLALGRLASIVEQVSVLQNQGREMMLVTSGAVAFGKQRLRHEILLSQSVRQALHSGQNQLKEMAIPVLEARACAAAGQSGLMALYEAMFTQYSICAAQILVTNLDFHDEQKRRNLNGTLHELLRMNIVPIVNTNDAVVPPAEPNSDLQGVNVISVKDNDSLAARLAVEMKTDLLIVLSDVEGLFDSPPGSDDAKLIDIFYPGDQQSVTFGTKSRVGMGGMEAKVKAALWALQGGTSVVIANGTHPKVSGHVITDIVEGKKVGTFFSEVKPAGPTVEQQGEMARSGGRMLATLEPEQRAEIIHHLADLLTDQRDEILLANKKDLEEAEGRLAAPLLKRLSLSTSKLNSLAIGLRQIAASSQDSVGRVLRRTRIAKNLELEQVTVPIGVLLVIFESRPDCLPQVAALAIASGNGLLLKGGKEAAHSNRILHLLTQEALSIHGVKEAVQLVNTREEVEDLCRLDKMIDLIIPRGSSQLVRDIQKAAKGIPVMGHSEGICHMYVDSEASVDKVTRLVRDSKCEYPAACNALETLLIHRDLLRTPLFDQIIDMLRVEQVKIHAGPKFASYLTFSPSEVKSLRTEYGDLELCIEVVDNVQDAIDHIHKYGSSHTDVIVTEDENTAEFFLQHVDSACVFWNASTRFSDGYRFGLGAEVGISTSRIHARGPVGLEGLLTTKWLLRGKDHVVSDFSEHGSLKYLHENLPIPQRNTN,795,NP_002851.2.csv,refseq-ALDH18A1-NM_002860.3_clinical_seed_0_final,refseq-ALDH18A1-NM_002860.3.a2m,Invitae,refseq-ALDH18A1-NM_002860.3.npy,1,795,795
+NP_002854.3,MAKPLTDQEKRRQISIRGIVGVENVAELKKSFNRHLHFTLVKDRNVATTRDYYFALAHTVRDHLVGRWIRTQQHYYDKCPKRVYYLSLEFYMGRTLQNTMINLGLQNACDEAIYQLGLDIEELEEIEEDAGLGNGGLGRLAACFLDSMATLGLAAYGYGIRYEYGIFNQKIRDGWQVEEADDWLRYGNPWEKSRPEFMLPVHFYGKVEHTNTGTKWIDTQVVLALPYDTPVPGYMNNTVNTMRLWSARAPNDFNLRDFNVGDYIQAVLDRNLAENISRVLYPNDNFFEGKELRLKQEYFVVAATLQDIIRRFKASKFGSTRGAGTVFDAFPDQVAIQLNDTHPALAIPELMRIFVDIEKLPWSKAWELTQKTFAYTNHTVLPEALERWPVDLVEKLLPRHLEIIYEINQKHLDRIVALFPKDVDRLRRMSLIEEEGSKRINMAHLCIVGSHAVNGVAKIHSDIVKTKVFKDFSELEPDKFQNKTNGITPRRWLLLCNPGLAELIAEKIGEDYVKDLSQLTKLHSFLGDDVFLRELAKVKQENKLKFSQFLETEYKVKINPSSMFDVQVKRIHEYKRQLLNCLHVITMYNRIKKDPKKLFVPRTVIIGGKAAPGYHMAKMIIKLITSVADVVNNDPMVGSKLKVIFLENYRVSLAEKVIPATDLSEQISTAGTEASGTGNMKFMLNGALTIGTMDGANVEMAEEAGEENLFIFGMRIDDVAALDKKGYEAKEYYEALPELKLVIDQIDNGFFSPKQPDLFKDIINMLFYHDRFKVFADYEAYVKCQDKVSQLYMNPKAWNTMVLKNIAASGKFSSDRTIKEYAQNIWNVEPSDLKISLSNESNKVNGN,847,NP_002854.3.csv,refseq-PYGL-NM_002863.4_clinical_seed_0_final,refseq-PYGL-NM_002863.4.a2m,Invitae,refseq-PYGL-NM_002863.4.npy,1,847,847
+NP_002863.1,MQAIKCVVVGDGAVGKTCLLISYTTNAFPGEYIPTVFDNYSANVMVDSKPVNLGLWDTAGQEDYDRLRPLSYPQTDVFLICFSLVSPASYENVRAKWFPEVRHHCPSTPIILVGTKLDLRDDKDTIEKLKEKKLAPITYPQGLALAKEIDSVKYLECSALTQRGLKTVFDEAIRAVLCPQPTRQQKRACSLL,192,NP_002863.1.csv,refseq-RAC2-NM_002872.4_clinical_seed_0_final,refseq-RAC2-NM_002872.4.a2m,Invitae,refseq-RAC2-NM_002872.4.npy,1,192,192
+NP_002866.2,MAMQMQLEANADTSVEEESFGPQPISRLEQCGINANDVKKLEEAGFHTVEAVAYAPKKELINIKGISEAKADKILAEAAKLVPMGFTTATEFHQRRSEIIQITTGSKELDKLLQGGIETGSITEMFGEFRTGKTQICHTLAVTCQLPIDRGGGEGKAMYIDTEGTFRPERLLAVAERYGLSGSDVLDNVAYARAFNTDHQTQLLYQASAMMVESRYALLIVDSATALYRTDYSGRGELSARQMHLARFLRMLLRLADEFGVAVVITNQVVAQVDGAAMFAADPKKPIGGNIIAHASTTRLYLRKGRGETRICKIYDSPCLPEAEAMFAINADGVGDAKD,339,NP_002866.2.csv,refseq-RAD51-NM_002875.4_clinical_seed_0_final,refseq-RAD51-NM_002875.4.a2m,Invitae,refseq-RAD51-NM_002875.4.npy,1,339,339
+NP_002871.1,MEHIQGAWKTISNGFGFKDAVFDGSSCISPTIVQQFGYQRRASDDGKLTDPSKTSNTIRVFLPNKQRTVVNVRNGMSLHDCLMKALKVRGLQPECCAVFRLLHEHKGKKARLDWNTDAASLIGEELQVDFLDHVPLTTHNFARKTFLKLAFCDICQKFLLNGFRCQTCGYKFHEHCSTKVPTMCVDWSNIRQLLLFPNSTIGDSGVPALPSLTMRRMRESVSRMPVSSQHRYSTPHAFTFNTSSPSSEGSLSQRQRSTSTPNVHMVSTTLPVDSRMIEDAIRSHSESASPSALSSSPNNLSPTGWSQPKTPVPAQRERAPVSGTQEKNKIRPRGQRDSSYYWEIEASEVMLSTRIGSGSFGTVYKGKWHGDVAVKILKVVDPTPEQFQAFRNEVAVLRKTRHVNILLFMGYMTKDNLAIVTQWCEGSSLYKHLHVQETKFQMFQLIDIARQTAQGMDYLHAKNIIHRDMKSNNIFLHEGLTVKIGDFGLATVKSRWSGSQQVEQPTGSVLWMAPEVIRMQDNNPFSFQSDVYSYGIVLYELMTGELPYSHINNRDQIIFMVGRGYASPDLSKLYKNCPKAMKRLVADCVKKVKEERPLFPQILSSIELLQHSLPKINRSASEPSLHRAAHTEDINACTLTTSPRLPVF,648,NP_002871.1.csv,refseq-RAF1-NM_002880.3_clinical_seed_0_final,refseq-RAF1-NM_002880.3.a2m,Invitae,refseq-RAF1-NM_002880.3.npy,1,648,648
+NP_002878.2,MDVLVSECSARLLQQEEEIKSLTAEIDRLKNCGCLGASPNLEQLQEENLKLKYRLNILRKSLQAERNKPTKNMINIISRLQEVFGHAIKAAYPDLENPPLLVTPSQQAKFGDYQCNSAMGISQMLKTKEQKVNPREIAENITKHLPDNECIEKVEIAGPGFINVHLRKDFVSEQLTSLLVNGVQLPALGENKKVIVDFSSPNIAKEMHVGHLRSTIIGESISRLFEFAGYDVLRLNHVGDWGTQFGMLIAHLQDKFPDYLTVSPPIGDLQVFYKESKKRFDTEEEFKKRAYQCVVLLQGKNPDITKAWKLICDVSRQELNKIYDALDVSLIERGESFYQDRMNDIVKEFEDRGFVQVDDGRKIVFVPGCSIPLTIVKSDGGYTYDTSDLAAIKQRLFEEKADMIIYVVDNGQSVHFQTIFAAAQMIGWYDPKVTRVFHAGFGVVLGEDKKKFKTRSGETVRLMDLLGEGLKRSMDKLKEKERDKVLTAEELNAAQTSVAYGCIKYADLSHNRLNDYIFSFDKMLDDRGNTAAYLLYAFTRIRSIARLANIDEEMLQKAARETKILLDHEKEWKLGRCILRFPEILQKILDDLFLHTLCDYIYELATAFTEFYDSCYCVEKDRQTGKILKVNMWRMLLCEAVAAVMAKGFDILGIKPVQRM,660,NP_002878.2.csv,refseq-RARS-NM_002887.3_clinical_seed_0_final,refseq-RARS-NM_002887.3.a2m,Invitae,refseq-RARS-NM_002887.3.npy,1,660,660
+NP_002885.1,MNISGSSCGSPNSADTSSDFKDLWTKLKECHDREVQGLQVKVTKLKQERILDAQRLEEFFTKNQQLREQQKVLHETIKVLEDRLRAGLCDRCAVTEEHMRKKQQEFENIRQQNLKLITELMNERNTLQEENKKLSEQLQQKIENDQQHQAAELECEEDVIPDSPITAFSFSGVNRLRRKENPHVRYIEQTHTKLEHSVCANEMRKVSKSSTHPQHNPNENEILVADTYDQSQSPMAKAHGTSSYTPDKSSFNLATVVAETLGLGVQEESETQGPMSPLGDELYHCLEGNHKKQPFEESTRNTEDSLRFSDSTSKTPPQEELPTRVSSPVFGATSSIKSGLDLNTSLSPSLLQPGKKKHLKTLPFSNTCISRLEKTRSKSEDSALFTHHSLGSEVNKIIIQSSNKQILINKNISESLGEQNRTEYGKDSNTDKHLEPLKSLGGRTSKRKKTEEESEHEVSCPQASFDKENAFPFPMDNQFSMNGDCVMDKPLDLSDRFSAIQRQEKSQGSETSKNKFRQVTLYEALKTIPKGFSSSRKASDGNCTLPKDSPGEPCSQECIILQPLNKCSPDNKPSLQIKEENAVFKIPLRPRESLETENVLDDIKSAGSHEPIKIQTRSDHGGCELASVLQLNPCRTGKIKSLQNNQDVSFENIQWSIDPGADLSQYKMDVTVIDTKDGSQSKLGGETVDMDCTLVSETVLLKMKKQEQKGEKSSNEERKMNDSLEDMFDRTTHEEYESCLADSFSQAADEEEELSTATKKLHTHGDKQDKVKQKAFVEPYFKGDERETSLQNFPHIEVVRKKEERRKLLGHTCKECEIYYADMPAEEREKKLASCSRHRFRYIPPNTPENFWEVGFPSTQTCMERGYIKEDLDPCPRPKRRQPYNAIFSPKGKEQKT,897,NP_002885.1.csv,refseq-RBBP8-NM_002894.2_clinical_seed_0_final,refseq-RBBP8-NM_002894.2.a2m,Invitae,refseq-RBBP8-NM_002894.2.npy,1,897,897
+NP_002891.1,MMREWVLLMSVLLCGLAGPTHLFQPSLVLDMAKVLLDNYCFPENLLGMQEAIQQAIKSHEILSISDPQTLASVLTAGVQSSLNDPRLVISYEPSTPEPPPQVPALTSLSEEELLAWLQRGLRHEVLEGNVGYLRVDSVPGQEVLSMMGEFLVAHVWGNLMGTSALVLDLRHCTGGQVSGIPYIISYLHPGNTILHVDTIYNRPSNTTTEIWTLPQVLGERYGADKDVVVLTSSQTRGVAEDIAHILKQMRRAIVVGERTGGGALDLRKLRIGESDFFFTVPVSRSLGPLGGGSQTWEGSGVLPCVGTPAEQALEKALAILTLRSALPGVVHCLQEVLKDYYTLVDRVPTLLQHLASMDFSTVVSEEDLVTKLNAGLQAASEDPRLLVRAIGPTETPSWPAPDAAAEDSPGVAPELPEDEAIRQALVDSVFQVSVLPGNVGYLRFDSFADASVLGVLAPYVLRQVWEPLQDTEHLIMDLRHNPGGPSSAVPLLLSYFQGPEAGPVHLFTTYDRRTNITQEHFSHMELPGPRYSTQRGVYLLTSHRTATAAEEFAFLMQSLGWATLVGEITAGNLLHTRTVPLLDTPEGSLALTVPVLTFIDNHGEAWLGGGVVPDAIVLAEEALDKAQEVLEFHQSLGALVEGTGHLLEAHYARPEVVGQTSALLRAKLAQGAYRTAVDLESLASQLTADLQEVSGDHRLLVFHSPGELVVEEAPPPPPAVPSPEELTYLIEALFKTEVLPGQLGYLRFDAMAELETVKAVGPQLVRLVWQQLVDTAALVIDLRYNPGSYSTAIPLLCSYFFEAEPRQHLYSVFDRATSKVTEVWTLPQVAGQRYGSHKDLYILMSHTSGSAAEAFAHTMQDLQRATVIGEPTAGGALSVGIYQVGSSPLYASMPTQMAMSATTGKAWDLAGVEPDITVPMSEALSIAQDIVALRAKVPTVLQTAGKLVADNYASAELGAKMATKLSGLQSRYSRVTSEVALAEILGADLQMLSGDPHLKAAHIPENAKDRIPGIVPMQIPSPEVFEELIKFSFHTNVLEDNIGYLRFDMFGDGELLTQVSRLLVEHIWKKIMHTDAMIIDMRFNIGGPTSSIPILCSYFFDEGPPVLLDKIYSRPDDSVSELWTHAQVVGERYGSKKSMVILTSSVTAGTAEEFTYIMKRLGRALVIGEVTSGGCQPPQTYHVDDTNLYLTIPTARSVGASDGSSWEGVGVTPHVVVPAEEALARAKEMLQHNQLRVKRSPGLQDHL,1247,NP_002891.1.csv,refseq-RBP3-NM_002900.2_clinical_seed_0_final,refseq-RBP3-NM_002900.2.a2m,Invitae,refseq-RBP3-NM_002900.2.npy,1,1247,1247
+NP_002896.2,MWLPLLLGALLWAVLWLLRDRQSLPASNAFVFITGCDSGFGRLLALQLDQRGFRVLASCLTPSGAEDLQRVASSRLHTTLLDITDPQSVQQAAKWVEMHVKEAGLFGLVNNAGVAGIIGPTPWLTRDDFQRVLNVNTMGPIGVTLALLPLLQQARGRVINITSVLGRLAANGGGYCVSKFGLEAFSDSLRRDVAHFGIRVSIVEPGFFRTPVTNLESLEKTLQACWARLPPATQAHYGGAFLTKYLKMQQRIMNLICDPDLTKVSRCLEHALTARHPRTRYSPGWDAKLLWLPASYLPASLVDAVLTWVLPKPAQAVY,318,NP_002896.2.csv,RDH5_HUMAN_b07_clinical_seed_0_final,RDH5_HUMAN_b07.a2m,EVE,RDH5_HUMAN_b07_theta_0.2.npy,1,318,318
+NP_002920.1,MDFGSLETVVANSAFIAARGSFDGSSSQPSRDKKYLAKLKLPPLSKCESLRDSLSLEFESVCLEQPIGKKLFQQFLQSAEKHLPALELWKDIEDYDTADNDLQPQKAQTILAQYLDPQAKLFCSFLDEGIVAKFKEGPVEIQDGLFQPLLQATLAHLGQAPFQEYLGSLYFLRFLQWKWLEAQPMGEDWFLDFRVLGKGGFGEVSACQMKATGKLYACKKLNKKRLKKRKGYQGAMVEKKILMKVHSRFIVSLAYAFETKADLCLVMTIMNGGDIRYHIYNVNEENPGFPEPRALFYTAQIICGLEHLHQRRIVYRDLKPENVLLDNDGNVRISDLGLAVELLDGQSKTKGYAGTPGFMAPELLQGEEYDFSVDYFALGVTLYEMIAARGPFRARGEKVENKELKHRIISEPVKYPDKFSQASKDFCEALLEKDPEKRLGFRDETCDKLRAHPLFKDLNWRQLEAGMLMPPFIPDSKTVYAKDIQDVGAFSTVKGVAFDKTDTEFFQEFATGNCPIPWQEEMIETGIFGELNVWRSDGQMPDDMKGISGGSSSSSKSGMCLVS,563,NP_002920.1.csv,refseq-GRK1-NM_002929.3_clinical_seed_0_final,refseq-GRK1-NM_002929.3.a2m,Invitae,refseq-GRK1-NM_002929.3.npy,1,563,563
+NP_002927.2,MSWLLFLAHRVALAALPCRRGSRGFGMFYAVRRGRKTGVFLTWNECRAQVDRFPAARFKKFATEDEAWAFVRKSASPEVSEGHENQHGQESEAKASKRLREPLDGDGHESAEPYAKHMKPSVEPAPPVSRDTFSYMGDFVVVYTDGCCSSNGRRRPRAGIGVYWGPGHPLNVGIRLPGRQTNQRAEIHAACKAIEQAKTQNINKLVLYTDSMFTINGITNWVQGWKKNGWKTSAGKEVINKEDFVALERLTQGMDIQWMHVPGHSGFIGNEEADRLAREGAKQSED,286,NP_002927.2.csv,refseq-RNASEH1-NM_002936.4_clinical_seed_0_final,refseq-RNASEH1-NM_002936.4.a2m,Invitae,refseq-RNASEH1-NM_002936.4.npy,1,286,286
+NP_002932.1,MKWKHVPFLVMISLLSLSPNHLFLAQLIPDPEDVERGNDHGTPIPTSDNDDNSLGYTGSRLRQEDFPPRIVEHPSDLIVSKGEPATLNCKAEGRPTPTIEWYKGGERVETDKDDPRSHRMLLPSGSLFFLRIVHGRKSRPDEGVYVCVARNYLGEAVSHNASLEVAILRDDFRQNPSDVMVAVGEPAVMECQPPRGHPEPTISWKKDGSPLDDKDERITIRGGKLMITYTRKSDAGKYVCVGTNMVGERESEVAELTVLERPSFVKRPSNLAVTVDDSAEFKCEARGDPVPTVRWRKDDGELPKSRYEIRDDHTLKIRKVTAGDMGSYTCVAENMVGKAEASATLTVQEPPHFVVKPRDQVVALGRTVTFQCEATGNPQPAIFWRREGSQNLLFSYQPPQSSSRFSVSQTGDLTITNVQRSDVGYYICQTLNVAGSIITKAYLEVTDVIADRPPPVIRQGPVNQTVAVDGTFVLSCVATGSPVPTILWRKDGVLVSTQDSRIKQLENGVLQIRYAKLGDTGRYTCIASTPSGEATWSAYIEVQEFGVPVQPPRPTDPNLIPSAPSKPEVTDVSRNTVTLSWQPNLNSGATPTSYIIEAFSHASGSSWQTVAENVKTETSAIKGLKPNAIYLFLVRAANAYGISDPSQISDPVKTQDVLPTSQGVDHKQVQRELGNAVLHLHNPTVLSSSSIEVHWTVDQQSQYIQGYKILYRPSGANHGESDWLVFEVRTPAKNSVVIPDLRKGVNYEIKARPFFNEFQGADSEIKFAKTLEEAPSAPPQGVTVSKNDGNGTAILVSWQPPPEDTQNGMVQEYKVWCLGNETRYHINKTVDGSTFSVVIPFLVPGIRYSVEVAASTGAGSGVKSEPQFIQLDAHGNPVSPEDQVSLAQQISDVVKQPAFIAGIGAACWIILMVFSIWLYRHRKKRNGLTSTYAGIRKVPSFTFTPTVTYQRGGEAVSSGGRPGLLNISEPAAQPWLADTWPNTGNNHNDCSISCCTAGNGNSDSNLTTYSRPADCIANYNNQLDNKQTNLMLPESTVYGDVDLSNKINEMKTFNSPNLKDGRFVNPSGQPTPYATTQLIQSNLSNNMNNGSGDSGEKHWKPLGQQKQEVAPVQYNIVEQNKLNKDYRANDTVPPTIPYNQSYDQNTGGSYNSSDRGSSTSGSQGHKKGARTPKVPKQGGMNWADLLPPPPAHPPPHSNSEEYNISVDESYDQEMPCPVPPARMYLQQDELEEEEDERGPTPPVRGAASSPAAVSYSHQSTATLTPSPQEELQPMLQDCPEETGHMQHQPDRRRQPVSPPPPPRPISPPHTYGYISGPLVSDMDTDAPEEEEDEADMEVAKMQTRRLLLRGLEQTPASSVGDLESSVTGSMINGWGSASEEDNISSGRSSVSSSDGSFFTDADFAQAVAAAAEYAGLKVARRQMQDAAGRRHFHASQCPRPTSPVSTDSNMSAAVMQKTRPAKKLKHQPGHLRRETYTDDLPPPPVPPPAIKSPTAQSKTQLEVRPVVVPKLPSMDARTDRSSDRKGSSYKGREVLDGRQVVDMRTNPGDPREAQEQQNDGKGRGNKAAKRDLPPAKTHLIQEDILPYCRPTFPTSNNPRDPSSSSSMSSRGSGSRQREQANVGRRNIAEMQVLGGYERGEDNNEELEETES,1651,NP_002932.1.csv,refseq-ROBO1-NM_002941.3_clinical_seed_0_final,refseq-ROBO1-NM_002941.3.a2m,Invitae,refseq-ROBO1-NM_002941.3.npy,1,1651,1651
+NP_002959.2,MSRRKQAKPQHFQSDPEVASLPRRDGDTEKGQPSRPTKSKDAHVCGRCCAEFFELSDLLLHKKNCTKNQLVLIVNENPASPPETFSPSPPPDNPDEQMNDTVNKTDQVDCSDLSEHNGLDREESMEVEAPVANKSGSGTSSGSHSSTAPSSSSSSSSSSGGGGSSSTGTSAITTSLPQLGDLTTLGNFSVINSNVIIENLQSTKVAVAQFSQEARCGGASGGKLAVPALMEQLLALQQQQIHQLQLIEQIRHQILLLASQNADLPTSSSPSQGTLRTSANPLSTLSSHLSQQLAAAAGLAQSLASQSASISGVKQLPPIQLPQSSSGNTIIPSNSGSSPNMNILAAAVTTPSSEKVASSAGASHVSNPAVSSSSSPAFAISSLLSPASNPLLPQQASANSVFPSPLPNIGTTAEDLNSLSALAQQRKSKPPNVTAFEAKSTSDEAFFKHKCRFCAKVFGSDSALQIHLRSHTGERPFKCNICGNRFSTKGNLKVHFQRHKEKYPHIQMNPYPVPEHLDNIPTSTGIPYGMSIPPEKPVTSWLDTKPVLPTLTTSVGLPLPPTLPSLIPFIKTEEPAPIPISHSATSPPGSVKSDSGGPESATRNLGGLPEEAEGSTLPPSGGKSEESGMVTNSVPTASSSVLSSPAADCGPAGSATTFTNPLLPLMSEQFKAKFPFGGLLDSAQASETSKLQQLVENIDKKATDPNECIICHRVLSCQSALKMHYRTHTGERPFKCKICGRAFTTKGNLKTHYSVHRAMPPLRVQHSCPICQKKFTNAVVLQQHIRMHMGGQIPNTPVPDSYSESMESDTGSFDEKNFDDLDNFSDENMEDCPEGSIPDTPKSADASQDSLSSSPLPLEMSSIAALENQMKMINAGLAEQLQASLKSVENGSIEGDVLTNDSSSVGGDMESQSAGSPAISESTSSMQALSPSNSTQEFHKSPSIEEKPQRAVPSEFANGLSPTPVNGGALDLTSSHAEKIIKEDSLGILFPFRDRGKFKNTACDICGKTFACQSALDIHYRSHTKERPFICTVCNRGFSTKGNLKQHMLTHQMRDLPSQLFEPSSNLGPNQNSAVIPANSLSSLIKTEVNGFVHVSPQDSKDTPTSHVPSGPLSSSATSPVLLPALPRRTPKQHYCNTCGKTFSSSSALQIHERTHTGEKPFACTICGRAFTTKGNLKVHMGTHMWNSTPARRGRRLSVDGPMTFLGGNPVKFPEMFQKDLAARSGSGDPSSFWNQYAAALSNGLAMKANEISVIQNGGIPPIPGSLGSGNSSPVSGLTGNLERLQNSEPNAPLAGLEKMASSENGTNFRFTRFVEDSKEIVTS,1324,NP_002959.2.csv,refseq-SALL1-NM_002968.2_clinical_seed_0_final,refseq-SALL1-NM_002968.2.a2m,Invitae,refseq-SALL1-NM_002968.2.npy,1,1324,1324
+NP_002963.2,MARLADYFVLVAFGPHPRGSGEGQGQILQRFPEKDWEDNPFPQGIELFCQPSGWQLCPERNPPTFFVAVLTDINSERHYCACLTFWEPAEPSQETTRVEDATEREEEGDEGGQTHLSPTAPAPSAQLFAPKTLVLVSRLDHTEVFRNSLGLIYAIHVEGLNVCLENVIGNLLTCTVPLAGGSQRTISLGAGDRQVIQTPLADSLPVSRCSVALLFRQLGITNVLSLFCAALTEHKVLFLSRSYQRLADACRGLLALLFPLRYSFTYVPILPAQLLEVLSTPTPFIIGVNAAFQAETQELLDVIVADLDGGTVTIPECVHIPPLPEPLQSQTHSVLSMVLDPELELADLAFPPPTTSTSSLKMQDKELRAVFLRLFAQLLQGYRWCLHVVRIHPEPVIRFHKAAFLGQRGLVEDDFLMKVLEGMAFAGFVSERGVPYRPTDLFDELVAHEVARMRADENHPQRVLRHVQELAEQLYKNENPYPAVAMHKVQRPGESSHLRRVPRPFPRLDEGTVQWIVDQAAAKMQGAPPAVKAERRTTVPSGPPMTAILERCSGLHVNSARRLEVVRNCISYVFEGKMLEAKKLLPAVLRALKGRAARRCLAQELHLHVQQNRAVLDHQQFDFVVRMMNCCLQDCTSLDEHGIAAALLPLVTAFCRKLSPGVTQFAYSCVQEHVVWSTPQFWEAMFYGDVQTHIRALYLEPTEDLAPAQEVGEAPSQEDERSALDVASEQRRLWPTLSREKQQELVQKEESTVFSQAIHYANRMSYLLLPLDSSKSRLLRERAGLGDLESASNSLVTNSMAGSVAESYDTESGFEDAETCDVAGAVVRFINRFVDKVCTESGVTSDHLKGLHVMVPDIVQMHIETLEAVQRESRRLPPIQKPKLLRPRLLPGEECVLDGLRVYLLPDGREEGAGGSAGGPALLPAEGAVFLTTYRVIFTGMPTDPLVGEQVVVRSFPVAALTKEKRISVQTPVDQLLQDGLQLRSCTFQLLKMAFDEEVGSDSAELFRKQLHKLRYPPDIRATFAFTLGSAHTPGRPPRVTKDKGPSLRTLSRNLVKNAKKTIGRQHVTRKKYNPPSWEHRGQPPPEDQEDEISVSEELEPSTLTPSSALKPSDRMTMSSLVERACCRDYQRLGLGTLSSSLSRAKSEPFRISPVNRMYAICRSYPGLLIVPQSVQDNALQRVSRCYRQNRFPVVCWRSGRSKAVLLRSGGLHGKGVVGLFKAQNAPSPGQSQADSSSLEQEKYLQAVVSSMPRYADASGRNTLSGFSSAHMGSHVPSPRARVTTLSNPMAASASRRTAPRGKWGSVRTSGRSSGLGTDVGSRLAGRDALAPPQANGGPPDPGFLRPQRAALYILGDKAQLKGVRSDPLQQWELVPIEVFEARQVKASFKKLLKACVPGCPAAEPSPASFLRSLEDSEWLIQIHKLLQVSVLVVELLDSGSSVLVGLEDGWDITTQVVSLVQLLSDPFYRTLEGFRLLVEKEWLSFGHRFSHRGAHTLAGQSSGFTPVFLQFLDCVHQVHLQFPMEFEFSQFYLKFLGYHHVSRRFRTFLLDSDYERIELGLLYEEKGERRGQVPCRSVWEYVDRLSKRTPVFHNYMYAPEDAEVLRPYSNVSNLKVWDFYTEETLAEGPPYDWELAQGPPEPPEEERSDGGAPQSRRRVVWPCYDSCPRAQPDAISRLLEELQRLETELGQPAERWKDTWDRVKAAQRLEGRPDGRGTPSSLLVSTAPHHRRSLGVYLQEGPVGSTLSLSLDSDQSSGSTTSGSRQAARRSTSTLYSQFQTAESENRSYEGTLYKKGAFMKPWKARWFVLDKTKHQLRYYDHRVDTECKGVIDLAEVEAVAPGTPTMGAPKTVDEKAFFDVKTTRRVYNFCAQDVPSAQQWVDRIQSCLSDA,1893,NP_002963.2.csv,refseq-SBF1-NM_002972.3_clinical_seed_0_final,refseq-SBF1-NM_002972.3.a2m,Invitae,refseq-SBF1-NM_002972.3.npy,1,1893,1893
+NP_002991.2,MAAVVALSLRRRLPATTLGGACLQASRGAQTAAATAPRIKKFAIYRWDPDKAGDKPHMQTYEVDLNKCGPMVLDALIKIKNEVDSTLTFRRSCREGICGSCAMNINGGNTLACTRRIDTNLNKVSKIYPLPHMYVIKDLVPDLSNFYAQYKSIEPYLKKKDESQEGKQQYLQSIEEREKLDGLYECILCACCSTSCPSYWWNGDKYLGPAVLMQAYRWMIDSRDDFTEERLAKLQDPFSLYRCHTIMNCTRTCPKGLNPGKAIAEIKKMMATYKEKKASV,280,NP_002991.2.csv,refseq-SDHB-NM_003000.2_clinical_seed_0_final,refseq-SDHB-NM_003000.2.a2m,Invitae,refseq-SDHB-NM_003000.2.npy,1,280,280
+NP_002992.1,MAALLLRHVGRHCLRAHFSPQLCIRNAVPLGTTAKEEMERFWNKNIGSNRPLSPHITIYSWSLPMAMSICHRGTGIALSAGVSLFGMSALLLPGNFESYLELVKSLCLGPALIHTAKFALVFPLMYHTWNGIRHLMWDLGKGLKIPQLYQSGVVVLVLTVLSSMGLAAM,169,NP_002992.1.csv,refseq-SDHC-NM_003001.3_clinical_seed_0_final,refseq-SDHC-NM_003001.3.a2m,Invitae,refseq-SDHC-NM_003001.3.npy,1,169,169
+NP_002993.1,MAVLWRLSAVCGALGGRALLLRTPVVRPAHISAFLQDRPIPEWCGVQHIHLSPSHHSGSKAASLHWTSERVVSVLLLGLLPAAYLNPCSAMDYSLAAALTLHGHWGLGQVVTDYVHGDALQKAAKAGLLALSALTFAGLCYFNYHDVGICKAVAMLWKL,159,NP_002993.1.csv,refseq-SDHD-NM_003002.3_clinical_seed_0_final,refseq-SDHD-NM_003002.3.a2m,Invitae,refseq-SDHD-NM_003002.3.npy,1,159,159
+NP_003005.2,MFLSILVALCLWLHLALGVRGAPCEAVRIPMCRHMPWNITRMPNHLHHSTQENAILAIEQYEELVDVNCSAVLRFFLCAMYAPICTLEFLHDPIKPCKSVCQRARDDCEPLMKMYNHSWPESLACDELPVYDRGVCISPEAIVTDLPEDVKWIDITPDMMVQERPLDVDCKRLSPDRCKCKKVKPTLATYLSKNYSYVIHAKIKAVQRSGCNEVTTVVDVKEIFKSSSPIPRTQVPLITNSSCQCPHILPHQDVLIMCYEWRSRMMLLENCLVEKWRDQLSKRSIQWEERLQEQRRTVQDKKKTAGRTSRSNPPKPKGKPPAPKPASPKKNIKTRSAQKRTNPKRV,346,NP_003005.2.csv,refseq-SFRP4-NM_003014.3_clinical_seed_0_final,refseq-SFRP4-NM_003014.3.a2m,Invitae,refseq-SFRP4-NM_003014.3.npy,1,346,346
+NP_003009.2,MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVVLIVVVIVGALLMGLHMSQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLVVYDYQQLLIAYKPAPGTCCYIMKIAPESIPSLEALTRKVHNFQMECSLQAKPAVPTSKLGQAEGRDAGSAPSGGDPAFLGMAVSTLCGEVPLYYI,197,NP_003009.2.csv,refseq-SFTPC-NM_003018.3_clinical_seed_0_final,refseq-SFTPC-NM_003018.3.a2m,Invitae,refseq-SFTPC-NM_003018.3.npy,1,197,197
+NP_003014.3,MAAEEMHWPVPMKAIGAQNLLTMPGGVAKAGYLHKKGGTQLQLLKWPLRFVIIHKRCVYYFKSSTSASPQGAFSLSGYNRVMRAAEETTSNNVFPFKIIHISKKHRTWFFSASSEEERKSWMALLRREIGHFHEKKDLPLDTSDSSSDTDSFYGAVERPVDISLSPYPTDNEDYEHDDEDDSYLEPDSPEPGRLEDALMHPPAYPPPPVPTPRKPAFSDMPRAHSFTSKGPGPLLPPPPPKHGLPDVGLAAEDSKRDPLCPRRAEPCPRVPATPRRMSDPPLSTMPTAPGLRKPPCFRESASPSPEPWTPGHGACSTSSAAIMATATSRNCDKLKSFHLSPRGPPTSEPPPVPANKPKFLKIAEEDPPREAAMPGLFVPPVAPRPPALKLPVPEAMARPAVLPRPEKPQLPHLQRSPPDGQSFRSFSFEKPRQPSQADTGGDDSDEDYEKVPLPNSVFVNTTESCEVERLFKATSPRGEPQDGLYCIRNSSTKSGKVLVVWDETSNKVRNYRIFEKDSKFYLEGEVLFVSVGSMVEHYHTHVLPSHQSLLLRHPYGYTGPR,561,NP_003014.3.csv,refseq-SH3BP2-NM_003023.4_clinical_seed_0_final,refseq-SH3BP2-NM_003023.4.a2m,Invitae,refseq-SH3BP2-NM_003023.4.npy,1,561,561
+NP_003026.2,MEPIYPFARPQMNTRFPSSRMVPFHFPPSKCALWNPTPTGDFIYLHLSYYRNPKLVVTEKTIRLAYRHAKQNKKNSSCFLLGSLTADEDEEGVTLTVDRFDPGREVPECLEITPTASLPGDFLIPCKVHTQELCSREMIVHSVDDFSSALKALQCHICSKDSLDCGKLLSLRVHITSRESLDSVEFDLHWAAVTLANNFKCTPVKPIPIIPTALARNLSSNLNISQVQGTYKYGYLTMDETRKLLLLLESDPKVYSLPLVGIWLSGITHIYSPQVWACCLRYIFNSSVQERVFSESGNFIIVLYSMTHKEPEFYECFPCDGKIPDFRFQLLTSKETLHLFKNVEPPDKNPIRCELSAESQNAETEFFSKASKNFSIKRSSQKLSSGKMPIHDHDSGVEDEDFSPRPIPSPHPVSQKISKIQPSVPELSLVLDGNFIESNPLPTPLEMVNNENPPLINHLEHLKPLQPQLYDEKHSPEVEAGEPSLRGIPNQLNQDKPALLRHCKVRQPPAYKKGNPHTRNSIKPSSHNGPSHDIFEKLQTVSAGNVQNEEYPIRPSTLNSRQSSLAPQSQPHDFVFSPHNSGRPMELQIPTPPLPSYCSTNVCRCCQHHSHIQYSPLNSWQGANTVGSIQDVQSEALQKHSLFHPSGCPALYCNAFCSSSSPIALRPQGDMGSCSPHSNIEPSPVARPPSHMDLCNPQPCTVCMHTPKTESDNGMMGLSPDAYRFLTEQDRQLRLLQAQIQRLLEAQSLMPCSPKTTAVEDTVQAGRQMELVSVEAQSSPGLHMRKGVSIAVSTGASLFWNAAGEDQEPDSQMKQDDTKISSEDMNFSVDINNEVTSLPGSASSLKAVDIPSFEESNIAVEEEFNQPLSVSNSSLVVRKEPDVPVFFPSGQLAESVSMCLQTGPTGGASNNSETSEEPKIEHVMQPLLHQPSDNQKIYQDLLGQVNHLLNSSSKETEQPSTKAVIISHECTRTQNVYHTKKKTHHSRLVDKDCVLNATLKQLRSLGVKIDSPTKVKKNAHNVDHASVLACISPEAVISGLNCMSFANVGMSGLSPNGVDLSMEANAIALKYLNENQLSQLSVTRSNQNNCDPFSLLHINTDRSTVGLSLISPNNMSFATKKYMKRYGLLQSSDNSEDEEEPPDNADSKSEYLLNQNLRSIPEQLGGQKEPSKNDHEIINCSNCESVGTNADTPVLRNITNEVLQTKAKQQLTEKPAFLVKNLKPSPAVNLRTGKAEFTQHPEKENEGDITIFPESLQPSETLKQMNSMNSVGTFLDVKRLRQLPKLF,1287,NP_003026.2.csv,refseq-STIL-NM_003035.2_clinical_seed_0_final,refseq-STIL-NM_003035.2.a2m,Invitae,refseq-STIL-NM_003035.2.npy,1,1287,1287
+NP_003027.1,MEAAAGGRGCFQPHPGLQKTLEQFHLSSMSSLGGPAAFSARWAQEAYKKESAKEAGAAAVPAPVPAATEPPPVLHLPAIQPPPPVLPGPFFMPSDRSTERCETVLEGETISCFVVGGEKRLCLPQILNSVLRDFSLQQINAVCDELHIYCSRCTADQLEILKVMGILPFSAPSCGLITKTDAERLCNALLYGGAYPPPCKKELAASLALGLELSERSVRVYHECFGKCKGLLVPELYSSPSAACIQCLDCRLMYPPHKFVVHSHKALENRTCHWGFDSANWRAYILLSQDYTGKEEQARLGRCLDDVKEKFDYGNKYKRRVPRVSSEPPASIRPKTDDTSSQSPAPSEKDKPSSWLRTLAGSSNKSLGCVHPRQRLSAFRPWSPAVSASEKELSPHLPALIRDSFYSYKSFETAVAPNVALAPPAQQKVVSSPPCAAAVSRAPEPLATCTQPRKRKLTVDTPGAPETLAPVAAPEEDKDSEAEVEVESREEFTSSLSSLSSPSFTSSSSAKDLGSPGARALPSAVPDAAAPADAPSGLEAELEHLRQALEGGLDTKEAKEKFLHEVVKMRVKQEEKLSAALQAKRSLHQELEFLRVAKKEKLREATEAKRNLRKEIERLRAENEKKMKEANESRLRLKRELEQARQARVCDKGCEAGRLRAKYSAQIEDLQVKLQHAEADREQLRADLLREREAREHLEKVVKELQEQLWPRARPEAAGSEGAAELEP,728,NP_003027.1.csv,refseq-SKI-NM_003036.3_clinical_seed_0_final,refseq-SKI-NM_003036.3.a2m,Invitae,refseq-SKI-NM_003036.3.npy,1,728,728
+NP_003029.2,MEKSNETNGYLDSAQAGPAAGPGAPGTAAGRARRCAGFLRRQALVLLTVSGVLAGAGLGAALRGLSLSRTQVTYLAFPGEMLLRMLRMIILPLVVCSLVSGAASLDASCLGRLGGIAVAYFGLTTLSASALAVALAFIIKPGSGAQTLQSSDLGLEDSGPPPVPKETVDSFLDLARNLFPSNLVVAAFRTYATDYKVVTQNSSSGNVTHEKIPIGTEIEGMNILGLVLFALVLGVALKKLGSEGEDLIRFFNSLNEATMVLVSWIMWYVPVGIMFLVGSKIVEMKDIIVLVTSLGKYIFASILGHVIHGGIVLPLIYFVFTRKNPFRFLLGLLAPFATAFATCSSSATLPSMMKCIEENNGVDKRISRFILPIGATVNMDGAAIFQCVAAVFIAQLNNVELNAGQIFTILVTATASSVGAAGVPAGGVLTIAIILEAIGLPTHDLPLILAVDWIVDRTTTVVNVEGDALGAGILHHLNQKATKKGEQELAEVKVEAIPNCKSEEETSPLVTHQNPAGPVASAPELESKESVL,532,NP_003029.2.csv,refseq-SLC1A4-NM_003038.4_clinical_seed_0_final,refseq-SLC1A4-NM_003038.4.a2m,Invitae,refseq-SLC1A4-NM_003038.4.npy,1,532,532
+NP_003032.1,MEEHTEAGSAPEMGAQKALIDNPADILVIAAYFLLVIGVGLWSMCRTNRGTVGGYFLAGRSMVWWPVGASLFASNIGSGHFVGLAGTGAASGLAVAGFEWNALFVVLLLGWLFAPVYLTAGVITMPQYLRKRFGGRRIRLYLSVLSLFLYIFTKISVDMFSGAVFIQQALGWNIYASVIALLGITMIYTVTGGLAALMYTDTVQTFVILGGACILMGYAFHEVGGYSGLFDKYLGAATSLTVSEDPAVGNISSFCYRPRPDSYHLLRHPVTGDLPWPALLLGLTIVSGWYWCSDQVIVQRCLAGKSLTHIKAGCILCGYLKLTPMFLMVMPGMISRILYPDEVACVVPEVCRRVCGTEVGCSNIAYPRLVVKLMPNGLRGLMLAVMLAALMSSLASIFNSSSTLFTMDIYTRLRPRAGDRELLLVGRLWVVFIVVVSVAWLPVVQAAQGGQLFDYIQAVSSYLAPPVSAVFVLALFVPRVNEQGAFWGLIGGLLMGLARLIPEFSFGSGSCVQPSACPAFLCGVHYLYFAIVLFFCSGLLTLTVSLCTAPIPRKHLHRLVFSLRHSKEEREDLDADEQQGSSLPVQNGCPESAMEMNEPQAPAPSLFRQCLLWFCGMSRGGVGSPPPLTQEEAAAAARRLEDISEDPSWARVVNLNALLMMAVAVFLWGFYA,672,NP_003032.1.csv,refseq-SLC5A2-NM_003041.3_clinical_seed_0_final,refseq-SLC5A2-NM_003041.3.a2m,Invitae,refseq-SLC5A2-NM_003041.3.npy,1,672,672
+NP_003033.3,MATNGSKVADGQISTEVSEAPVANDKPKTLVVKVQKKAADLPDRDTWKGRFDFLMSCVGYAIGLGNVWRFPYLCGKNGGGAFLIPYFLTLIFAGVPLFLLECSLGQYTSIGGLGVWKLAPMFKGVGLAAAVLSFWLNIYYIVIISWAIYYLYNSFTTTLPWKQCDNPWNTDRCFSNYSMVNTTNMTSAVVEFWERNMHQMTDGLDKPGQIRWPLAITLAIAWILVYFCIWKGVGWTGKVVYFSATYPYIMLIILFFRGVTLPGAKEGILFYITPNFRKLSDSEVWLDAATQIFFSYGLGLGSLIALGSYNSFHNNVYRDSIIVCCINSCTSMFAGFVIFSIVGFMAHVTKRSIADVAASGPGLAFLAYPEAVTQLPISPLWAILFFSMLLMLGIDSQFCTVEGFITALVDEYPRLLRNRRELFIAAVCIISYLIGLSNITQGGIYVFKLFDYYSASGMSLLFLVFFECVSISWFYGVNRFYDNIQEMVGSRPCIWWKLCWSFFTPIIVAGVFIFSAVQMTPLTMGNYVFPKWGQGVGWLMALSSMVLIPGYMAYMFLTLKGSLKQRIQVMVQPSEDIVRPENGPEQPQAGSSTSKEAYI,599,NP_003033.3.csv,refseq-SLC6A1-NM_003042.3_clinical_seed_0_final,refseq-SLC6A1-NM_003042.3.a2m,Invitae,refseq-SLC6A1-NM_003042.3.npy,1,599,599
+NP_003038.2,MVLRSGICGLSPHRIFPSLLVVVALVGLLPVLRSHGLQLSPTASTIRSSEPPRERSIGDVTTAPPEVTPESRPVNHSVTDHGMKPRKAFPVLGIDYTHVRTPFEISLWILLACLMKIGFHVIPTISSIVPESCLLIVVGLLVGGLIKGVGETPPFLQSDVFFLFLLPPIILDAGYFLPLRQFTENLGTILIFAVVGTLWNAFFLGGLMYAVCLVGGEQINNIGLLDNLLFGSIISAVDPVAVLAVFEEIHINELLHILVFGESLLNDAVTVVLYHLFEEFANYEHVGIVDIFLGFLSFFVVALGGVLVGVVYGVIAAFTSRFTSHIRVIEPLFVFLYSYMAYLSAELFHLSGIMALIASGVVMRPYVEANISHKSHTTIKYFLKMWSSVSETLIFIFLGVSTVAGSHHWNWTFVISTLLFCLIARVLGVLGLTWFINKFRIVKLTPKDQFIIAYGGLRGAIAFSLGYLLDKKHFPMCDLFLTAIITVIFFTVFVQGMTIRPLVDLLAVKKKQETKRSINEEIHTQFLDHLLTGIEDICGHYGHHHWKDKLNRFNKKYVKKCLIAGERSKEPQLIAFYHKMEMKQAIELVESGGMGKIPSAVSTVSMQNIHPKSLPSERILPALSKDKEEEIRKILRNNLQKTRQRLRSYNRHTLVADPYEEAWNQMLLRRQKARQLEQKINNYLTVPAHKLDSPTMSRARIGSDPLAYEPKEDLPVITIDPASPQSPESVDLVNEELKGKVLGLSRDPAKVAEEDEDDDGGIMMRSKETSSPGTDDVFTPAPSDSPSSQRIQRCLSDPGPHPEPGEGEPFFPKGQ,815,NP_003038.2.csv,refseq-SLC9A1-NM_003047.4_clinical_seed_0_final,refseq-SLC9A1-NM_003047.4.a2m,Invitae,refseq-SLC9A1-NM_003047.4.npy,1,815,815
+NP_003043.3,MLSYGERLGSPAVSPLPVRGGHVMRGTAFAYVPSPQVLHRIPGTSAYAFPSLGPVALAEHTCPCGEVLERHEPLPAKLALEEEQKPESRLVPKLRQAGAMLLKVPLMLTFLYLFVCSLDMLSSAFQLAGGKVAGDIFKDNAILSNPVAGLVVGILVTVLVQSSSTSTSIIVSMVSSGLLEVSSAIPIIMGSNIGTSVTNTIVALMQAGDRTDFRRAFAGATVHDCFNWLSVLVLLPLEAATGYLHHITRLVVASFNIHGGRDAPDLLKIITEPFTKLIIQLDESVITSIATGDESLRNHSLIQIWCHPDSLQAPTSMSRAEANSSQTLGNATMEKCNHIFVDTGLPDLAVGLILLAGSLVLLCTCLILLVKMLNSLLKGQVAKVIQKVINTDFPAPFTWVTGYFAMVVGASMTFVVQSSSVFTSAITPLIGLGVISIERAYPLTLGSNIGTTTTAILAALASPREKLSSAFQIALCHFFFNISGILLWYPVPCTRLPIRMAKALGKRTAKYRWFAVLYLLVCFLLLPSLVFGISMAGWQVMVGVGTPFGALLAFVVLINVLQSRSPGHLPKWLQTWDFLPRWMHSLKPLDHLITRATLCCARPEPRSPPLPPRVFLEELPPATPSPRLALPAHHNATRL,639,NP_003043.3.csv,refseq-SLC34A1-NM_003052.4_clinical_seed_0_final,refseq-SLC34A1-NM_003052.4.a2m,Invitae,refseq-SLC34A1-NM_003052.4.npy,1,639,639
+NP_003045.2,MALSELALVRWLQESRRSRKLILFIVFLALLLDNMLLTVVVPIIPSYLYSIKHEKNATEIQTARPVHTASISDSFQSIFSYYDNSTMVTGNATRDLTLHQTATQHMVTNASAVPSDCPSEDKDLLNENVQVGLLFASKATVQLITNPFIGLLTNRIGYPIPIFAGFCIMFVSTIMFAFSSSYAFLLIARSLQGIGSSCSSVAGMGMLASVYTDDEERGNVMGIALGGLAMGVLVGPPFGSVLYEFVGKTAPFLVLAALVLLDGAIQLFVLQPSRVQPESQKGTPLTTLLKDPYILIAAGSICFANMGIAMLEPALPIWMMETMCSRKWQLGVAFLPASISYLIGTNIFGILAHKMGRWLCALLGMIIVGVSILCIPFAKNIYGLIAPNFGVGFAIGMVDSSMMPIMGYLVDLRHVSVYGSVYAIADVAFCMGYAIGPSAGGAIAKAIGFPWLMTIIGIIDILFAPLCFFLRSPPAKEEKMAILMDHNCPIKTKMYTQNNIQSYPIGEDEESESD,514,NP_003045.2.csv,refseq-SLC18A2-NM_003054.4_clinical_seed_0_final,refseq-SLC18A2-NM_003054.4.a2m,Invitae,refseq-SLC18A2-NM_003054.4.npy,1,514,514
+NP_003046.2,MESAEPAGQARAAATKLSEAVGAALQEPRRQRRLVLVIVCVALLLDNMLYMVIVPIVPDYIAHMRGGGEGPTRTPEVWEPTLPLPTPANASAYTANTSASPTAAWPAGSALRPRYPTESEDVKIGVLFASKAILQLLVNPLSGPFIDRMSYDVPLLIGLGVMFASTVLFAFAEDYATLFAARSLQGLGSAFADTSGIAMIADKYPEEPERSRALGVALAFISFGSLVAPPFGGILYEFAGKRVPFLVLAAVSLFDALLLLAVAKPFSAAARARANLPVGTPIHRLMLDPYIAVVAGALTTCNIPLAFLEPTIATWMKHTMAASEWEMGMAWLPAFVPHVLGVYLTVRLAARYPHLQWLYGALGLAVIGASSCIVPACRSFAPLVVSLCGLCFGIALVDTALLPTLAFLVDVRHVSVYGSVYAIADISYSVAYALGPIVAGHIVHSLGFEQLSLGMGLANLLYAPVLLLLRNVGLLTRSRSERDVLLDEPPQGLYDAVRLRERPVSGQDGEPRSPPGPFDACEDDYNYYYTRS,532,NP_003046.2.csv,refseq-SLC18A3-NM_003055.2_clinical_seed_0_final,refseq-SLC18A3-NM_003055.2.a2m,Invitae,refseq-SLC18A3-NM_003055.2.npy,1,532,532
+NP_003051.1,MRDYDEVTAFLGEWGPFQRLIFFLLSASIIPNGFTGLSSVFLIATPEHRCRVPDAANLSSAWRNHTVPLRLRDGREVPHSCRRYRLATIANFSALGLEPGRDVDLGQLEQESCLDGWEFSQDVYLSTIVTEWNLVCEDDWKAPLTISLFFVGVLLGSFISGQLSDRFGRKNVLFVTMGMQTGFSFLQIFSKNFEMFVVLFVLVGMGQISNYVAAFVLGTEILGKSVRIIFSTLGVCIFYAFGYMVLPLFAYFIRDWRMLLVALTMPGVLCVALWWFIPESPRWLISQGRFEEAEVIIRKAAKANGIVVPSTIFDPSELQDLSSKKQQSHNILDLLRTWNIRMVTIMSIMLWMTISVGYFGLSLDTPNLHGDIFVNCFLSAMVEVPAYVLAWLLLQYLPRRYSMATALFLGGSVLLFMQLVPPDLYYLATVLVMVGKFGVTAAFSMVYVYTAELYPTVVRNMGVGVSSTASRLGSILSPYFVYLGAYDRFLPYILMGSLTILTAILTLFLPESFGTPLPDTIDQMLRVKGMKHRKTPSHTRMLKDGQERPTILKSTAF,557,NP_003051.1.csv,refseq-SLC22A5-NM_003060.3_clinical_seed_0_final,refseq-SLC22A5-NM_003060.3.a2m,Invitae,refseq-SLC22A5-NM_003060.3.npy,1,557,557
+NP_003061.3,MSTPTDPGAMPHPGPSPGPGPSPGPILGPSPGPGPSPGSVHSMMGPSPGPPSVSHPMPTMGSTDFPQEGMHQMHKPIDGIHDKGIVEDIHCGSMKGTGMRPPHPGMGPPQSPMDQHSQGYMSPHPSPLGAPEHVSSPMSGGGPTPPQMPPSQPGALIPGDPQAMSQPNRGPSPFSPVQLHQLRAQILAYKMLARGQPLPETLQLAVQGKRTLPGLQQQQQQQQQQQQQQQQQQQQQQQPQQQPPQPQTQQQQQPALVNYNRPSGPGPELSGPSTPQKLPVPAPGGRPSPAPPAAAQPPAAAVPGPSVPQPAPGQPSPVLQLQQKQSRISPIQKPQGLDPVEILQEREYRLQARIAHRIQELENLPGSLPPDLRTKATVELKALRLLNFQRQLRQEVVACMRRDTTLETALNSKAYKRSKRQTLREARMTEKLEKQQKIEQERKRRQKHQEYLNSILQHAKDFKEYHRSVAGKIQKLSKAVATWHANTEREQKKETERIEKERMRRLMAEDEEGYRKLIDQKKDRRLAYLLQQTDEYVANLTNLVWEHKQAQAAKEKKKRRRRKKKAEENAEGGESALGPDGEPIDESSQMSDLPVKVTHTETGKVLFGPEAPKASQLDAWLEMNPGYEVAPRSDSEESDSDYEEEDEEEESSRQETEEKILLDPNSEEVSEKDAKQIIETAKQDVDDEYSMQYSARGSQSYYTVAHAISERVEKQSALLINGTLKHYQLQGLEWMVSLYNNNLNGILADEMGLGKTIQTIALITYLMEHKRLNGPYLIIVPLSTLSNWTYEFDKWAPSVVKISYKGTPAMRRSLVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTHYVAPRRILLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGERVDLNEEETILIIRRLHKVLRPFLLRRLKKEVESQLPEKVEYVIKCDMSALQKILYRHMQAKGILLTDGSEKDKKGKGGAKTLMNTIMQLRKICNHPYMFQHIEESFAEHLGYSNGVINGAELYRASGKFELLDRILPKLRATNHRVLLFCQMTSLMTIMEDYFAFRNFLYLRLDGTTKSEDRAALLKKFNEPGSQYFIFLLSTRAGGLGLNLQAADTVVIFDSDWNPHQDLQAQDRAHRIGQQNEVRVLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAILEHEEENEEEDEVPDDETLNQMIARREEEFDLFMRMDMDRRREDARNPKRKPRLMEEDELPSWIIKDDAEVERLTCEEEEEKIFGRGSRQRRDVDYSDALTEKQWLRAIEDGNLEEMEEEVRLKKRKRRRNVDKDPAKEDVEKAKKRRGRPPAEKLSPNPPKLTKQMNAIIDTVINYKDRCNVEKVPSNSQLEIEGNSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLGDLEKDVMLLCHNAQTFNLEGSQIYEDSIVLQSVFKSARQKIAKEEESEDESNEEEEEEDEEESESEAKSVKVKIKLNKKDDKGRDKGKGKKRPNRGKAKPVVSDFDSDEEQDEREQSEGSGTDDE,1590,NP_003061.3.csv,refseq-SMARCA2-NM_003070.4_clinical_seed_0_final,refseq-SMARCA2-NM_003070.4.a2m,Invitae,refseq-SMARCA2-NM_003070.4.npy,1,1590,1590
+NP_003064.2,MMMMALSKTFGQKPVKFQLEDDGEFYMIGSEVGNYLRMFRGSLYKRYPSLWRRLATVEERKKIVASSHGKKTKPNTKDHGYTTLATSVTLLKASEVEEILDGNDEKYKAVSISTEPPTYLREQKAKRNSQWVPTLPNSSHHLDAVPCSTTINRNRMGRDKKRTFPLCFDDHDPAVIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSDQRVIIKLNIHVGNISLVDQFEWDMSEKENSPEKFALKLCSELGLGGEFVTTIAYSIRGQLSWHQKTYAFSENPLPTVEIAIRNTGDADQWCPLLETLTDAEMEKKIRDQDRNTRRMRRLANTAPAW,385,NP_003064.2.csv,refseq-SMARCB1-NM_003073.3_clinical_seed_0_final,refseq-SMARCB1-NM_003073.3.a2m,Invitae,refseq-SMARCB1-NM_003073.3.npy,1,385,385
+NP_003066.2,MAVRKKDGGPNVKYYEAADTVTQFDNVRLWLGKNYKKYIQAEPPTNKSLSSLVVQLLQFQEEVFGKHVSNAPLTKLPIKCFLDFKAGGSLCHILAAAYKFKSDQGWRRYDFQNPSRMDRNVEMFMTIEKSLVQNNCLSRPNIFLCPEIEPKLLGKLKDIIKRHQGTVTEDKNNASHVVYPVPGNLEEEEWVRPVMKRDKQVLLHWGYYPDSYDTWIPASEIEASVEDAPTPEKPRKVHAKWILDTDTFNEWMNEEDYEVNDDKNPVSRRKKISAKTLTDEVNSPDSDRRDKKGGNYKKRKRSPSPSPTPEAKKKNAKKGPSTPYTKSKRGHREEEQEDLTKDMDEPSPVPNVEEVTLPKTVNTKKDSESAPVKGGTMTDLDEQEDESMETTGKDEDENSTGNKGEQTKNPDLHEDNVTEQTHHIIIPSYAAWFDYNSVHAIERRALPEFFNGKNKSKTPEIYLAYRNFMIDTYRLNPQEYLTSTACRRNLAGDVCAIMRVHAFLEQWGLINYQVDAESRPTPMGPPPTSHFHVLADTPSGLVPLQPKTPQQTSASQQMLNFPDKGKEKPTDMQNFGLRTDMYTKKNVPSKSKAAASATREWTEQETLLLLEALEMYKDDWNKVSEHVGSRTQDECILHFLRLPIEDPYLEDSEASLGPLAYQPIPFSQSGNPVMSTVAFLASVVDPRVASAAAKSALEEFSKMKEEVPTALVEAHVRKVEEAAKVTGKADPAFGLESSGIAGTTSDEPERIEESGNDEARVEGQATDEKKEPKEPREGGGAIEEEAKEKTSEAPKKDEEKGKEGDSEKESEKSDGDPIVDPEKEKEPKEGQEEVLKEVVESEGERKTKVERDIGEGNLSTAAAAALAAAAVKAKHLAAVEERKIKSLVALLVETQMKKLEIKLRHFEELETIMDREREALEYQRQQLLADRQAFHMEQLKYAEMRARQQHFQQMHQQQQQPPPALPPGSQPIPPTGAAGPPAVHGLAVAPASVVPAPAGSGAPPGSLGPSEQIGQAGSTAGPQQQQPAGAPQPGAVPPGVPPPGPHGPSPFPNQQTPPSMMPGAVPGSGHPGVAGNAPLGLPFGMPPPPPPPAPSIIPFGSLADSISINLPAPPNLHGHHHHLPFAPGTLPPPNLPVSMANPLHPNLPATTTMPSSLPLGPGLGSAAAQSPAIVAAVQGNLLPSASPLPDPGTPLPPDPTAPSPGTVTPVPPPQ,1214,NP_003066.2.csv,refseq-SMARCC2-NM_003075.4_clinical_seed_0_final,refseq-SMARCC2-NM_003075.4.a2m,Invitae,refseq-SMARCC2-NM_003075.4.npy,1,1214,1214
+NP_003070.3,MSKRPSYAPPPTPAPATQMPSTPGFVGYNPYSHLAYNNYRLGGNPGTNSRVTASSGITIPKPPKPPDKPLMPYMRYSRKVWDQVKASNPDLKLWEIGKIIGGMWRDLTDEEKQEYLNEYEAEKIEYNESMKAYHNSPAYLAYINAKSRAEAALEEESRQRQSRMEKGEPYMSIQPAEDPDDYDDGFSMKHTATARFQRNHRLISEILSESVVPDVRSVVTTARMQVLKRQVQSLMVHQRKLEAELLQIEERHQEKKRKFLESTDSFNNELKRLCGLKVEVDMEKIAAEIAQAEEQARKRQEEREKEAAEQAERSQSSIVPEEEQAANKGEEKKDDENIPMETEETHLEETTESQQNGEEGTSTPEDKESGQEGVDSMAEEGTSDSNTGSESNSATVEEPPTDPIPEDEKKE,411,NP_003070.3.csv,refseq-SMARCE1-NM_003079.4_clinical_seed_0_final,refseq-SMARCE1-NM_003079.4.a2m,Invitae,refseq-SMARCE1-NM_003079.4.npy,1,411,411
+NP_003089.1,MASGRRAPRTGLLELRAGAGSGAGGERWQRVLLSLAEDVLTVSPADGDPGPEPGAPREQEPAQLNGAAEPGAGPPQLPEALLLQRRRVTVRKADAGGLGISIKGGRENKMPILISKIFKGLAADQTEALFVGDAILSVNGEDLSSATHDEAVQVLKKTGKEVVLEVKYMKDVSPYFKNSTGGTSVGWDSPPASPLQRQPSSPGPTPRNFSEAKHMSLKMAYVSKRCTPNDPEPRYLEICSADGQDTLFLRAKDEASARSWATAIQAQVNTLTPRVKDELQALLAATSTAGSQDIKQIGWLTEQLPSGGTAPTLALLTEKELLLYLSLPETREALSRPARTAPLIATRLVHSGPSKGSVPYDAELSFALRTGTRHGVDTHLFSVESPQELAAWTRQLVDGCHRAAEGVQEVSTACTWNGRPCSLSVHIDKGFTLWAAEPGAARAVLLRQPFEKLQMSSDDGASLLFLDFGGAEGEIQLDLHSCPKTIVFIIHSFLSAKVTRLGLLA,505,NP_003089.1.csv,refseq-SNTA1-NM_003098.2_clinical_seed_0_final,refseq-SNTA1-NM_003098.2.a2m,Invitae,refseq-SNTA1-NM_003098.2.npy,1,505,505
+NP_003095.2,MAAAAKPNNLSLVVHGPGDLRLENYPIPEPGPNEVLLRMHSVGICGSDVHYWEYGRIGNFIVKKPMVLGHEASGTVEKVGSSVKHLKPGDRVAIEPGAPRENDEFCKMGRYNLSPSIFFCATPPDDGNLCRFYKHNAAFCYKLPDNVTFEEGALIEPLSVGIHACRRGGVTLGHKVLVCGAGPIGMVTLLVAKAMGAAQVVVTDLSATRLSKAKEIGADLVLQISKESPQEIARKVEGQLGCKPEVTIECTGAEASIQAGIYATRSGGNLVLVGLGSEMTTVPLLHAAIREVDIKGVFRYCNTWPVAISMLASKSVNVKPLVTHRFPLEKALEAFETFKKGLGLKIMLKCDPSDQNP,357,NP_003095.2.csv,refseq-SORD-NM_003104.5_clinical_seed_0_final,refseq-SORD-NM_003104.5.a2m,Invitae,refseq-SORD-NM_003104.5_theta_0.2.npy,1,357,357
+NP_003097.1,MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLMKKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGYPQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQSGPVPGTAINGTLPLSHM,317,NP_003097.1.csv,refseq-SOX2-NM_003106.3_clinical_seed_0_final,refseq-SOX2-NM_003106.3.a2m,Invitae,refseq-SOX2-NM_003106.3.npy,1,317,317
+NP_003098.1,MVQQTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGGKADDPSWCKTPSGHIKRPMNAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKHMADYPDYKYRPRKKVKSGNANSSSSAAASSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGGGGGGASGGGANSKPAQKKSCGSKVAGGAGGGVSKPHAKLILAGGGGGGKAAAAAAASFAAEQAGAAALLPLGAAADHHSLYKARTPSASASASSAASASAALAAPGKHLAEKKVKRVYLFGGLGTSSSPVGGVGAGADPSDPLGLYEEEGAGCSPDAPSLSGRSSAASSPAAGRSPADHRGYASLRAASPAPSSAPSHASSSASSHSSSSSSSGSSSSDDEFEDDLLDLNPSSNFESMSLGSFSSSSALDRDLDFNFEPGSGSHFEFPDYCTPEVSEMISGDWLESSISNLVFTY,474,NP_003098.1.csv,refseq-SOX4-NM_003107.2_clinical_seed_0_final,refseq-SOX4-NM_003107.2.a2m,Invitae,refseq-SOX4-NM_003107.2.npy,1,474,474
+NP_003099.1,MVQQAESLEAESNLPREALDTEEGEFMACSPVALDESDPDWCKTASGHIKRPMNAFMVWSKIERRKIMEQSPDMHNAEISKRLGKRWKMLKDSEKIPFIREAERLRLKHMADYPDYKYRPRKKPKMDPSAKPSASQSPEKSAAGGGGGSAGGGAGGAKTSKGSSKKCGKLKAPAAAGAKAGAGKAAQSGDYGGAGDDYVLGSLRVSGSGGGGAGKTVKCVFLDEDDDDDDDDDELQLQIKQEPDEEDEEPPHQQLLQPPGQQPSQLLRRYNVAKVPASPTLSSSAESPEGASLYDEVRAGATSGAGGGSRLYYSFKNITKQHPPPLAQPALSPASSRSVSTSSSSSSGSSSGSSGEDADDLMFDLSLNFSQSAHSASEQQLGGGAAAGNLSLSLVDKDLDSFSEGSLGSHFEFPDYCTPELSEMIAGDWLEANFSDLVFTY,441,NP_003099.1.csv,refseq-SOX11-NM_003108.3_clinical_seed_0_final,refseq-SOX11-NM_003108.3.a2m,Invitae,refseq-SOX11-NM_003108.3.npy,1,441,441
+NP_003110.1,MAVLLLLLRALRRGPGPGPRPLWGPGPAWSPGFPARPGRGRPYMASRPPGDLAEAGGRALQSLQLRLLTPTFEGINGLLLKQHLVQNPVRLWQLLGGTFYFNTSRLKQKNKEKDKSKGKAPEEDEEERRRRERDDQMYRERLRTLLVIAVVMSLLNALSTSGGSISWNDFVHEMLAKGEVQRVQVVPESDVVEVYLHPGAVVFGRPRLALMYRMQVANIDKFEEKLRAAEDELNIEAKDRIPVSYKRTGFFGNALYSVGMTAVGLAILWYVFRLAGMTGREGGFSAFNQLKMARFTIVDGKMGKGVSFKDVAGMHEAKLEVREFVDYLKSPERFLQLGAKVPKGALLLGPPGCGKTLLAKAVATEAQVPFLAMAGPEFVEVIGGLGAARVRSLFKEARARAPCIVYIDEIDAVGKKRSTTMSGFSNTEEEQTLNQLLVEMDGMGTTDHVIVLASTNRADILDGALMRPGRLDRHVFIDLPTLQERREIFEQHLKSLKLTQSSTFYSQRLAELTPGFSGADIANICNEAALHAAREGHTSVHTLNFEYAVERVLAGTAKKSKILSKEEQKVVAFHESGHALVGWMLEHTEAVMKVSITPRTNAALGFAQMLPRDQHLFTKEQLFERMCMALGGRASEALSFNEVTSGAQDDLRKVTRIAYSMVKQFGMAPGIGPISFPEAQEGLMGIGRRPFSQGLQQMMDHEARLLVAKAYRHTEKVLQDNLDKLQALANALLEKEVINYEDIEALIGPPPHGPKKMIAPQRWIDAQREKQDLGEEETEETQQPPLGGEEPTWPK,795,NP_003110.1.csv,refseq-SPG7-NM_003119.2_clinical_seed_0_final,refseq-SPG7-NM_003119.2.a2m,Invitae,refseq-SPG7-NM_003119.2.npy,1,795,795
+NP_003127.1,MVLADLGRKITSALRSLSNATIINEEVLNAMLKEVCTALLEADVNIKLVKQLRENVKSAIDLEEMASGLNKRKMIQHAVFKELVKLVDPGVKAWTPTKGKQNVIMFVGLQGSGKTTTCSKLAYYYQRKGWKTCLICADTFRAGAFDQLKQNATKARIPFYGSYTEMDPVIIASEGVEKFKNENFEIIIVDTSGRHKQEDSLFEEMLQVANAIQPDNIVYVMDASIGQACEAQAKAFKDKVDVASVIVTKLDGHAKGGGALSAVAATKSPIIFIGTGEHIDDFEPFKTQPFISKLLGMGDIEGLIDKVNELKLDDNEALIEKLKHGQFTLRDMYEQFQNIMKMGPFSQILGMIPGFGTDFMSKGNEQESMARLKKLMTIMDSMNDQELDSTDGAKVFSKQPGRIQRVARGSGVSTRDVQELLTQYTKFAQMVKKMGGIKGLFKGGDMSKNVSQSQMAKLNQQMAKMMDPRVLHHMGGMAGLQSMMRQFQQGAAGNMKGMMGFNNM,504,NP_003127.1.csv,refseq-SRP54-NM_003136.3_clinical_seed_0_final,refseq-SRP54-NM_003136.3.a2m,Invitae,refseq-SRP54-NM_003136.3.npy,1,504,504
+NP_003147.2,MDVCVRLALWLLWGLLLHQGQSLSHSHSEKATGTSSGANSEESTAAEFCRIDKPLCHSEDEKLSFEAVRNIHKLMDDDANGDVDVEESDEFLREDLNYHDPTVKHSTFHGEDKLISVEDLWKAWKSSEVYNWTVDEVVQWLITYVELPQYEETFRKLQLSGHAMPRLAVTNTTMTGTVLKMTDRSHRQKLQLKALDTVLFGPPLLTRHNHLKDFMLVVSIVIGVGGCWFAYIQNRYSKEHMKKMMKDLEGLHRAEQSLHDLQERLHKAQEEHRTVEVEKVHLEKKLRDEINLAKQEAQRLKELREGTENERSRQKYAEEELEQVREALRKAEKELESHSSWYAPEALQKWLQLTHEVEVQYYNIKKQNAEKQLLVAKEGAEKIKKKRNTLFGTFHVAHSSSLDDVDHKILTAKQALSEVTAALRERLHRWQQIEILCGFQIVNNPGIHSLVAALNIDPSWMGSTRPNPAHFIMTDDVDDMDEEIVSPLSMQSPSLQSSVRQRLTEPQHGLGSQRDLTHSDSESSLHMSDRQRVAPKPPQMSRAADEALNAMTSNGSHRLIEGVHPGSLVEKLPDSPALAKKALLALNHGLDKAHSLMELSPSAPPGGSPHLDSSRSHSPSSPDPDTPSPVGDSRALQASRNTRIPHLAGKKAVAEEDNGSIGEETDSSPGRKKFPLKIFKKPLKK,685,NP_003147.2.csv,refseq-STIM1-NM_003156.3_clinical_seed_0_final,refseq-STIM1-NM_003156.3.a2m,Invitae,refseq-STIM1-NM_003156.3.npy,1,685,685
+NP_003150.1,MKIPNIGNVMNKFEILGVVGEGAYGVVLKCRHKETHEIVAIKKFKDSEENEEVKETTLRELKMLRTLKQENIVELKEAFRRRGKLYLVFEYVEKNMLELLEEMPNGVPPEKVKSYIYQLIKAIHWCHKNDIVHRDIKPENLLISHNDVLKLCDFGFARNLSEGNNANYTEYVATRWYRSPELLLGAPYGKSVDMWSVGCILGELSDGQPLFPGESEIDQLFTIQKVLGPLPSEQMKLFYSNPRFHGLRFPAVNHPQSLERRYLGILNSVLLDLMKNLLKLDPADRYLTEQCLNHPTFQTQRLLDRSPSRSAKRKPYHVESSTLSNRNQAGKSTALQSHHRSNSKDIQNLSVGLPRADEGLPANESFLNGNLAGASLSPLHTKTYQASSQPGSTSKDLTNNNIPHLLSPKEAKSKTEFDFNIDPKPSEGPGTKYLKSNSRSQQNRHSFMESSQSKAGTLQPNEKQSRHSYIDTIPQSSRSPSYRTKAKSHGALSDSKSVSNLSEARAQIAEPSTSRYFPSSCLDLNSPTSPTPTRHSDTRTLLSPSGRNNRNEGTLDSRRTTTRHSKTMEELKLPEHMDSSHSHSLSAPHESFSYGLGYTSPFSSQQRPHRHSMYVTRDKVRAKGLDGSLSIGQGMAARANSLQLLSPQPGEQLPPEMTVARSSVKETSREGTSSFHTRQKSEGGVYHDPHSDDGTAPKENRHLYNDPVPRRVGSFYRVPSPRPDNSFHENNVSTRVSSLPSESSSGTNHSKRQPAFDPWKSPENISHSEQLKEKEKQGFFRSMKKKKKKSQTVPNSDSPDLLTLQKSIHSASTPSSRPKEWRPEKISDLQTQSQPLKSLRKLLHLSSASNHPASSDPRFQPLTAQQTKNSFSEIRIHPLSQASGGSSNIRQEPAPKGRPALQLPDGGCDGRRQRHHSGPQDRRFMLRTTEQQGEYFCCGDPKKPHTPCVPNRALHRPISSPAPYPVLQVRGTSMCPTLQVRGTDAFSCPTQQSGFSFFVRHVMREALIHRAQVNQAALLTYHENAALTGK,1030,NP_003150.1.csv,refseq-CDKL5-NM_003159.2_clinical_seed_0_final,refseq-CDKL5-NM_003159.2.a2m,Invitae,refseq-CDKL5-NM_003159.2.npy,1,1030,1030
+NP_003156.1,MAPIGLKAVVGEKIMHDVIKKVKKKGEWKVLVVDQLSMRMLSSCCKMTDIMTEGITIVEDINKRREPLPSLEAVYLITPSEKSVHSLISDFKDPPTAKYRAAHVFFTDSCPDALFNELVKSRAAKVIKTLTEINIAFLPYESQVYSLDSADSFQSFYSPHKAQMKNPILERLAEQIATLCATLKEYPAVRYRGEYKDNALLAQLIQDKLDAYKADDPTMGEGPDKARSQLLILDRGFDPSSPVLHELTFQAMSYDLLPIENDVYKYETSGIGEARVKEVLLDEDDDLWIALRHKHIAEVSQEVTRSLKDFSSSKRMNTGEKTTMRDLSQMLKKMPQYQKELSKYSTHLHLAEDCMKHYQGTVDKLCRVEQDLAMGTDAEGEKIKDPMRAIVPILLDANVSTYDKIRIILLYIFLKNGITEENLNKLIQHAQIPPEDSEIITNMAHLGVPIVTDSTLRRRSKPERKERISEQTYQLSRWTPIIKDIMEDTIEDKLDTKHYPYISTRSSASFSTTAVSARYGHWHKNKAPGEYRSGPRLIIFILGGVSLNEMRCAYEVTQANGKWEVLIGSTHILTPTKFLMDLRHPDFRESSRVSFEDQAPTME,603,NP_003156.1.csv,refseq-STXBP1-NM_003165.3_clinical_seed_0_final,refseq-STXBP1-NM_003165.3.a2m,Invitae,refseq-STXBP1-NM_003165.3.npy,1,603,603
+NP_003163.1,MAAVAALQLGLRAAGLGRAPASAAWRSVLRVSPRPGVAWRPSRCGSSAAEASATKAEDDSFLQWVLLLIPVTAFGLGTWQVQRRKWKLNLIAELESRVLAEPVPLPADPMELKNLEYRPVKVRGCFDHSKELYMMPRTMVDPVREAREGGLISSSTQSGAYVVTPFHCTDLGVTILVNRGFVPRKKVNPETRQKGQIEGEVDLIGMVRLTETRQPFVPENNPERNHWHYRDLEAMARITGAEPIFIDANFQSTVPGGPIGGQTRVTLRNEHLQYIVTWYGLSAATSYLWFKKFLRGTPGV,300,NP_003163.1.csv,refseq-SURF1-NM_003172.3_clinical_seed_0_final,refseq-SURF1-NM_003172.3.a2m,Invitae,refseq-SURF1-NM_003172.3.npy,1,300,300
+NP_003165.2,MKRKERIARRLEGIENDTQPILLQSCTGLVTHRLLEEDTPRYMRASDPASPHIGRSNEEEETSDSSLEKQTRSKYCTETSGVHGDSPYGSGTMDTHSLESKAERIARYKAERRRQLAEKYGLTLDPEADSEYLSRYTKSRKEPDAVEKRGGKSDKQEESSRDASSLYPGTETMGLRTCAGESKDYALHVGDGSSDPEVLLNIENQRRGQELSATRQAHDLSPAAESSSTFSFSGRDSSFTEVPRSPKHAHSSSLQQAASRSPSFGDPQLSPEARPRCTSHSETPTVDDEEKVDERAKLSVAAKRLLFREMEKSFDEQNVPKRRSRNTAVEQRLRRLQDRSLTQPITTEEVVIAATLQASAHQKALAKDQTNEGKELAEQGEPDSSTLSLAEKLALFNKLSQPVSKAISTRNRIDTRQRRMNARYQTQPVTLGEVEQVQSGKLIPFSPAVNTSVSTVASTVAPMYAGDLRTKPPLDHNASATDYKFSSSIENSDSPVRSILKSQAWQPLVEGSENKGMLREYGETESKRALTGRDSGMEKYGSFEEAEASYPILNRAREGDSHKESKYAVPRRGSLERANPPITHLGDEPKEFSMAKMNAQGNLDLRDRLPFEEKVEVENVMKRKFSLRAAEFGEPTSEQTGTAAGKTIAQTTAPVSWKPQDSSEQPQEKLCKNPCAMFAAGEIKTPTGEGLLDSPSKTMSIKERLALLKKSGEEDWRNRLSRRQEGGKAPASSLHTQEAGRSLIKKRVTESRESQMTIEERKQLITVREEAWKTRGRGAANDSTQFTVAGRMVKKGLASPTAITPVASPICGKTRGTTPVSKPLEDIEARPDMQLESDLKLDRLETFLRRLNNKVGGMHETVLTVTGKSVKEVMKPDDDETFAKFYRSVDYNMPRSPVEMDEDFDVIFDPYAPKLTSSVAEHKRAVRPKRRVQASKNPLKMLAAREDLLQEYTEQRLNVAFMESKRMKVEKMSSNSNFSEVTLAGLASKENFSNVSLRSVNLTEQNSNNSAVPYKRLMLLQIKGRRHVQTRLVEPRASALNSGDCFLLLSPHCCFLWVGEFANVIEKAKASELATLIQTKRELGCRATYIQTIEEGINTHTHAAKDFWKLLGGQTSYQSAGDPKEDELYEAAIIETNCIYRLMDDKLVPDDDYWGKIPKCSLLQPKEVLVFDFGSEVYVWHGKEVTLAQRKIAFQLAKHLWNGTFDYENCDINPLDPGECNPLIPRKGQGRPDWAIFGRLTEHNETILFKEKFLDWTELKRSNEKNPGELAQHKEDPRTDVKAYDVTRMVSMPQTTAGTILDGVNVGRGYGLVEGHDRRQFEITSVSVDVWHILEFDYSRLPKQSIGQFHEGDAYVVKWKFMVSTAVGSRQKGEHSVRAAGKEKCVYFFWQGRHSTVSEKGTSALMTVELDEERGAQVQVLQGKEPPCFLQCFQGGMVVHSGRREEEEENVQSEWRLYCVRGEVPVEGNLLEVACHCSSLRSRTSMVVLNVNKALIYLWHGCKAQAHTKEVGRTAANKIKEQCPLEAGLHSSSKVTIHECDEGSEPLGFWDALGRRDRKAYDCMLQDPGSFNFAPRLFILSSSSGDFAATEFVYPARAPSVVSSMPFLQEDLYSAPQPALFLVDNHHEVYLWQGWWPIENKITGSARIRWASDRKSAMETVLQYCKGKNLKKPAPKSYLIHAGLEPLTFTNMFPSWEHREDIAEITEMDTEVSNQITLVEDVLAKLCKTIYPLADLLARPLPEGVDPLKLEIYLTDEDFEFALDMTRDEYNALPAWKQVNLKKAKGLF,1788,NP_003165.2.csv,refseq-SVIL-NM_003174.3_clinical_seed_0_final,refseq-SVIL-NM_003174.3.a2m,Invitae,refseq-SVIL-NM_003174.3.npy,1,1788,1788
+NP_003168.2,MASSGMADSANHLPFFFGNITREEAEDYLVQGGMSDGLYLLRQSRNYLGGFALSVAHGRKAHHYTIERELNGTYAIAGGRTHASPADLCHYHSQESDGLVCLLKKPFNRPQGVQPKTGPFEDLKENLIREYVKQTWNLQGQALEQAIISQKPQLEKLIATTAHEKMPWFHGKISREESEQIVLIGSKTNGKFLIRARDNNGSYALCLLHEGKVLHYRIDKDKTGKLSIPEGKKFDTLWQLVEHYSYKADGLLRVLTVPCQKIGTQGNVNFGGRPQLPGSHPATWSAGGIISRIKSYSFPKPGHRKSSPAQGNRQESTVSFNPYEPELAPWAADKGPQREALPMDTEVYESPYADPEEIRPKEVYLDRKLLTLEDKELGSGNFGTVKKGYYQMKKVVKTVAVKILKNEANDPALKDELLAEANVMQQLDNPYIVRMIGICEAESWMLVMEMAELGPLNKYLQQNRHVKDKNIIELVHQVSMGMKYLEESNFVHRDLAARNVLLVTQHYAKISDFGLSKALRADENYYKAQTHGKWPVKWYAPECINYYKFSSKSDVWSFGVLMWEAFSYGQKPYRGMKGSEVTAMLEKGERMGCPAGCPREMYDLMNLCWTYDVENRPGFAAVELRLRNYYYDVVN,635,NP_003168.2.csv,refseq-SYK-NM_003177.6_clinical_seed_0_final,refseq-SYK-NM_003177.6.a2m,Invitae,refseq-SYK-NM_003177.6.npy,1,635,635
+NP_003184.1,MSDTLTADVIGRRVEVNGEHATVRFAGVVPPVAGPWLGVEWDNPERGKHDGSHEGTVYFKCRHPTGGSFIRPNKVNFGTDFLTAIKNRYVLEDGPEEDRKEQIVTIGNKPVETIGFDSIMKQQSQLSKLQEVSLRNCAVSCAGEKGGVAEACPNIRKVDLSKNLLSSWDEVIHIADQLRHLEVLNVSENKLKFPSGSVLTGTLSVLKVLVLNQTGITWAEVLRCVAGCPGLEELYLESNNIFISERPTDVLQTVKLLDLSSNQLIDENQLYLIAHLPRLEQLILSDTGISSLHFPDAGIGCKTSMFPSLKYLVVNDNQISQWSFFNELEKLPSLRALSCLRNPLTKEDKEAETARLLIIASIGQLKTLNKCEILPEERRRAELDYRKAFGNEWKQAGGHKDPEKNRLSEEFLTAHPRYQFLCLKYGAPEDWELKTQQPLMLKNQLLTLKIKYPHQLDQKVLEKQLPGSMTIQKVKGLLSRLLKVPVSDLLLSYESPKKPGREIELENDLKSLQFYSVENGDCLLVRW,527,NP_003184.1.csv,refseq-TBCE-NM_003193.4_clinical_seed_0_final,refseq-TBCE-NM_003193.4.a2m,Invitae,refseq-TBCE-NM_003193.4.npy,1,527,527
+NP_003192.1,MAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFVYLPRWFSSVLASCPKKPVSSYLRFSKEQLPIFKAQNPDAKTTELIRRIAQRWRELPDSKKKIYQDAYRAEWQVYKEEISRFKEQLTPSQIMSLEKEIMDKHLKRKAMTKKKELTLLGKPKRPRSAYNVYVAERFQEAKGDSPQEKLKTVKENWKNLSDSEKELYIQHAKEDETRYHNEMKSWEEQMIEVGRKDLLRRTIKKQRKYGAEEC,246,NP_003192.1.csv,refseq-TFAM-NM_003201.2_clinical_seed_0_final,refseq-TFAM-NM_003201.2.a2m,Invitae,refseq-TFAM-NM_003201.2.npy,1,246,246
+NP_003212.2,MHSPPRDQAAIMLWKLVENVKYEDIYEDRHDGVPSHSSRLSQLGSVSQGPYSSAPPLSHTPSSDFQPPYFPPPYQPLPYHQSQDPYSHVNDPYSLNPLHQPQQHPWGQRQRQEVGSEAGSLLPQPRAALPQLSGLDPRRDYHSVRRPDVLLHSAHHGLDAGMGDSLSLHGLGHPGMEDVQSVEDANNSGMNLLDQSVIKKVPVPPKSVTSLMMNKDGFLGGMSVNTGEVFCSVPGRLSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLRERLEKIGLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYICETEFPAKAVSEYLNRQHTDPSDLHSRKNMLLATKQLCKEFTDLLAQDRTPIGNSRPSPILEPGIQSCLTHFSLITHGFGAPAICAALTALQNYLTEALKGMDKMFLNNTTTNRHTSGEGPGSKTGDKEEKHRK,460,NP_003212.2.csv,refseq-TFAP2B-NM_003221.3_clinical_seed_0_final,refseq-TFAP2B-NM_003221.3.a2m,Invitae,refseq-TFAP2B-NM_003221.3.npy,1,460,460
+NP_003218.2,MERLWGLFQRAQQLSPRSSQTVYQRVEGPRKGHLEEEEEDGEEGAETLAHFCPMELRGPEPLGSRPRQPNLIPWAAAGRRAAPYLVLTALLIFTGAFLLGYVAFRGSCQACGDSVLVVSEDVNYEPDLDFHQGRLYWSDLQAMFLQFLGEGRLEDTIRQTSLRERVAGSAGMAALTQDIRAALSRQKLDHVWTDTHYVGLQFPDPAHPNTLHWVDEAGKVGEQLPLEDPDVYCPYSAIGNVTGELVYAHYGRPEDLQDLRARGVDPVGRLLLVRVGVISFAQKVTNAQDFGAQGVLIYPEPADFSQDPPKPSLSSQQAVYGHVHLGTGDPYTPGFPSFNQTQFPPVASSGLPSIPAQPISADIASRLLRKLKGPVAPQEWQGSLLGSPYHLGPGPRLRLVVNNHRTSTPINNIFGCIEGRSEPDHYVVIGAQRDAWGPGAAKSAVGTAILLELVRTFSSMVSNGFRPRRSLLFISWDGGDFGSVGSTEWLEGYLSVLHLKAVVYVSLDNAVLGDDKFHAKTSPLLTSLIESVLKQVDSPNHSGQTLYEQVVFTNPSWDAEVIRPLPMDSSAYSFTAFVGVPAVEFSFMEDDQAYPFLHTKEDTYENLHKVLQGRLPAVAQAVAQLAGQLLIRLSHDRLLPLDFGRYGDVVLRHIGNLNEFSGDLKARGLTLQWVYSARGDYIRAAEKLRQEIYSSEERDERLTRMYNVRIMRVEFYFLSQYVSPADSPFRHIFMGRGDHTLGALLDHLRLLRSNSSGTPGATSSTGFQESRFRRQLALLTWTLQGAANALSGDVWNIDNNF,801,NP_003218.2.csv,refseq-TFR2-NM_003227.3_clinical_seed_0_final,refseq-TFR2-NM_003227.3.a2m,Invitae,refseq-TFR2-NM_003227.3.npy,1,801,801
+NP_003229.1,MHYCVLSAFLILHLVTVALSLSTCSTLDMDQFMRKRIEAIRGQILSKLKLTSPPEDYPEPEEVPPEVISIYNSTRDLLQEKASRRAAACERERSDEEYYAKEVYKIDMPPFFPSENAIPPTFYRPYFRIVRFDVSAMEKNASNLVKAEFRVFRLQNPKARVPEQRIELYQILKSKDLTSPTQRYIDSKVVKTRAEGEWLSFDVTDAVHEWLHHKDRNLGFKISLHCPCCTFVPSNNYIIPNKSEELEARFAGIDGTSTYTSGDQKTIKSTRKKNSGKTPHLLLMLLPSYRLESQQTNRRKKRALDAAYCFRNVQDNCCLRPLYIDFKRDLGWKWIHEPKGYNANFCAGACPYLWSSDTQHSRVLSLYNTINPEASASPCCVSQDLEPLTILYYIGKTPKIEQLSNMIVKSCKCS,414,NP_003229.1.csv,refseq-TGFB2-NM_003238.3_clinical_seed_0_final,refseq-TGFB2-NM_003238.3.a2m,Invitae,refseq-TGFB2-NM_003238.3.npy,1,414,414
+NP_003230.1,MKMHLQRALVVLALLNFATVSLSLSTCTTLDFGHIKKKRVEAIRGQILSKLRLTSPPEPTVMTHVPYQVLALYNSTRELLEEMHGEREEGCTQENTESEYYAKEIHKFDMIQGLAEHNELAVCPKGITSKVFRFNVSSVEKNRTNLFRAEFRVLRVPNPSSKRNEQRIELFQILRPDEHIAKQRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEISIHCPCHTFQPNGDILENIHEVMEIKFKGVDNEDDHGRGDLGRLKKQKDHHNPHLILMMIPPHRLDNPGQGGQRKKRALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMVVKSCKCS,412,NP_003230.1.csv,refseq-TGFB3-NM_003239.3_clinical_seed_0_final,refseq-TGFB3-NM_003239.3.a2m,Invitae,refseq-TGFB3-NM_003239.3.npy,1,412,412
+NP_003233.4,MGRGLLRGLWPLHIVLWTRIASTIPPHVQKSVNNDMIVTDNNGAVKFPQLCKFCDVRFSTCDNQKSCMSNCSITSICEKPQEVCVAVWRKNDENITLETVCHDPKLPYHDFILEDAASPKCIMKEKKKPGETFFMCSCSSDECNDNIIFSEEYNTSNPDLLLVIFQVTGISLLPPLGVAISVIIIFYCYRVNRQQKLSSTWETGKTRKLMEFSEHCAIILEDDRSDISSTCANNINHNTELLPIELDTLVGKGRFAEVYKAKLKQNTSEQFETVAVKIFPYEEYASWKTEKDIFSDINLKHENILQFLTAEERKTELGKQYWLITAFHAKGNLQEYLTRHVISWEDLRKLGSSLARGIAHLHSDHTPCGRPKMPIVHRDLKSSNILVKNDLTCCLCDFGLSLRLDPTLSVDDLANSGQVGTARYMAPEVLESRMNLENVESFKQTDVYSMALVLWEMTSRCNAVGEVKDYEPPFGSKVREHPCVESMKDNVLRDRGRPEIPSFWLNHQGIQMVCETLTECWDHDPEARLTAQCVAERFSELEHLDRLSGRSCSEEKIPEDGSLNTTK,567,NP_003233.4.csv,refseq-TGFBR2-NM_003242.5_clinical_seed_0_final,refseq-TGFBR2-NM_003242.5.a2m,Invitae,refseq-TGFBR2-NM_003242.5.npy,1,567,567
+NP_003237.2,MGLAWGLGVLFLMHVCGTNRIPESGGDNSVFDIFELTGAARKGSGRRLVKGPDPSSPAFRIEDANLIPPVPDDKFQDLVDAVRAEKGFLLLASLRQMKKTRGTLLALERKDHSGQVFSVVSNGKAGTLDLSLTVQGKQHVVSVEEALLATGQWKSITLFVQEDRAQLYIDCEKMENAELDVPIQSVFTRDLASIARLRIAKGGVNDNFQGVLQNVRFVFGTTPEDILRNKGCSSSTSVLLTLDNNVVNGSSPAIRTNYIGHKTKDLQAICGISCDELSSMVLELRGLRTIVTTLQDSIRKVTEENKELANELRRPPLCYHNGVQYRNNEEWTVDSCTECHCQNSVTICKKVSCPIMPCSNATVPDGECCPRCWPSDSADDGWSPWSEWTSCSTSCGNGIQQRGRSCDSLNNRCEGSSVQTRTCHIQECDKRFKQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARETKACKKDACPINGGWGPWSPWDICSVTCGGGVQKRSRLCNNPTPQFGGKDCVGDVTENQICNKQDCPIDGCLSNPCFAGVKCTSYPDGSWKCGACPPGYSGNGIQCTDVDECKEVPDACFNHNGEHRCENTDPGYNCLPCPPRFTGSQPFGQGVEHATANKQVCKPRNPCTDGTHDCNKNAKCNYLGHYSDPMYRCECKPGYAGNGIICGEDTDLDGWPNENLVCVANATYHCKKDNCPNLPNSGQEDYDKDGIGDACDDDDDNDKIPDDRDNCPFHYNPAQYDYDRDDVGDRCDNCPYNHNPDQADTDNNGEGDACAADIDGDGILNERDNCQYVYNVDQRDTDMDGVGDQCDNCPLEHNPDQLDSDSDRIGDTCDNNQDIDEDGHQNNLDNCPYVPNANQADHDKDGKGDACDHDDDNDGIPDDKDNCRLVPNPDQKDSDGDGRGDACKDDFDHDSVPDIDDICPENVDISETDFRRFQMIPLDPKGTSQNDPNWVVRHQGKELVQTVNCDPGLAVGYDEFNAVDFSGTFFINTERDDDYAGFVFGYQSSSRFYVVMWKQVTQSYWDTNPTRAQGYSGLSVKVVNSTTGPGEHLRNALWHTGNTPGQVRTLWHDPRHIGWKDFTAYRWRLSHRPKTGFIRVVMYEGKKIMADSGPIYDKTYAGGRLGLFVFSQEMVFFSDLKYECRDP,1170,NP_003237.2.csv,refseq-THBS1-NM_003246.3_clinical_seed_0_final,refseq-THBS1-NM_003246.3.a2m,Invitae,refseq-THBS1-NM_003246.3.npy,1,1170,1170
+NP_003276.3,MGADGETVVLKNMLIGINLILLGSMIKPSECQLEVTTERVQRQSVEEEGGIANYNTSSKEQPVVFNHVYNINVPLDNLCSSGLEASAEQEVSAEDETLAEYMGQTSDHESQVTFTHRINFPKKACPCASSAQVLQELLSRIEMLEREVSVLRDQCNANCCQESAATGQLDYIPHCSGHGNFSFESCGCICNEGWFGKNCSEPYCPLGCSSRGVCVDGQCICDSEYSGDDCSELRCPTDCSSRGLCVDGECVCEEPYTGEDCRELRCPGDCSGKGRCANGTCLCEEGYVGEDCGQRQCLNACSGRGQCEEGLCVCEEGYQGPDCSAVAPPEDLRVAGISDRSIELEWDGPMAVTEYVISYQPTALGGLQLQQRVPGDWSGVTITELEPGLTYNISVYAVISNILSLPITAKVATHLSTPQGLQFKTITETTVEVQWEPFSFSFDGWEISFIPKNNEGGVIAQVPSDVTSFNQTGLKPGEEYIVNVVALKEQARSPPTSASVSTVIDGPTQILVRDVSDTVAFVEWIPPRAKVDFILLKYGLVGGEGGRTTFRLQPPLSQYSVQALRPGSRYEVSVSAVRGTNESDSATTQFTTEIDAPKNLRVGSRTATSLDLEWDNSEAEVQEYKVVYSTLAGEQYHEVLVPRGIGPTTRATLTDLVPGTEYGVGISAVMNSQQSVPATMNARTELDSPRDLMVTASSETSISLIWTKASGPIDHYRITFTPSSGIASEVTVPKDRTSYTLTDLEPGAEYIISVTAERGRQQSLESTVDAFTGFRPISHLHFSHVTSSSVNITWSDPSPPADRLILNYSPRDEEEEMMEVSLDATKRHAVLMGLQPATEYIVNLVAVHGTVTSEPIVGSITTGIDPPKDITISNVTKDSVMVSWSPPVASFDYYRVSYRPTQVGRLDSSVVPNTVTEFTITRLNPATEYEISLNSVRGREESERICTLVHTAMDNPVDLIATNITPTEALLQWKAPVGEVENYVIVLTHFAVAGETILVDGVSEEFRLVDLLPSTHYTATMYATNGPLTSGTISTNFSTLLDPPANLTASEVTRQSALISWQPPRAEIENYVLTYKSTDGSRKELIVDAEDTWIRLEGLLENTDYTVLLQAAQDTTWSSITSTAFTTGGRVFPHPQDCAQHLMNGDTLSGVYPIFLNGELSQKLQVYCDMTTDGGGWIVFQRRQNGQTDFFRKWADYRVGFGNVEDEFWLGLDNIHRITSQGRYELRVDMRDGQEAAFASYDRFSVEDSRNLYKLRIGSYNGTAGDSLSYHQGRPFSTEDRDNDVAVTNCAMSYKGAWWYKNCHRTNLNGKYGESRHSQGINWYHWKGHEFSIPFVEMKMRPYNHRLMAGRKRQSLQF,1358,NP_003276.3.csv,refseq-TNR-NM_003285.2_clinical_seed_0_final,refseq-TNR-NM_003285.2.a2m,Invitae,refseq-TNR-NM_003285.2.npy,1,1358,1358
+NP_003280.2,MDAIKKKMQMLKLDKENAIDRAEQAEADKKQAEDRCKQLEEEQQALQKKLKGTEDEVEKYSESVKEAQEKLEQAEKKATDAEADVASLNRRIQLVEEELDRAQERLATALQKLEEAEKAADESERGMKVIENRAMKDEEKMELQEMQLKEAKHIAEDSDRKYEEVARKLVILEGELERSEERAEVAESKCGDLEEELKIVTNNLKSLEAQADKYSTKEDKYEEEIKLLEEKLKEAETRAEFAERSVAKLEKTIDDLEDEVYAQKMKYKAISEELDNALNDITSL,284,NP_003280.2.csv,refseq-TPM2-NM_003289.3_clinical_seed_0_final,refseq-TPM2-NM_003289.3.a2m,Invitae,refseq-TPM2-NM_003289.3.npy,1,284,284
+NP_003292.1,MENETVSELNQTQLQPRAVVALEYQVVTILLVLIICGLGIVGNIMVVLVVMRTKHMRTPTNCYLVSLAVADLMVLVAAGLPNITDSIYGSWVYGYVGCLCITYLQYLGINASSCSITAFTIERYIAICHPIKAQFLCTFSRAKKIIIFVWAFTSLYCMLWFFLLDLNISTYKDAIVISCGYKISRNYYSPIYLMDFGVFYVVPMILATVLYGFIARILFLNPIPSDPKENSKTWKNDSTHQNTNLNVNTSNRCFNSTVSSRKQVTKMLAVVVILFALLWMPYRTLVVVNSFLSSPFQENWFLLFCRICIYLNSAINPVIYNLMSQKFRAAFRKLCNCKQKPTEKPANYSVALNYSVIKESDHFSTELDDITVTDTYLSATKVSFDDTCLASEVSFSQS,398,NP_003292.1.csv,refseq-TRHR-NM_003301.5_clinical_seed_0_final,refseq-TRHR-NM_003301.5.a2m,Invitae,refseq-TRHR-NM_003301.5.npy,1,398,398
+NP_003312.3,MTTMAAATLLRATPHFSGLAAGRTFLLQGLLRLLKAPALPLLCRGLAVEAKKTYVRDKPHVNVGTIGHVDHGKTTLTAAITKILAEGGGAKFKKYEEIDNAPEERARGITINAAHVEYSTAARHYAHTDCPGHADYVKNMITGTAPLDGCILVVAANDGPMPQTREHLLLARQIGVEHVVVYVNKADAVQDSEMVELVELEIRELLTEFGYKGEETPVIVGSALCALEGRDPELGLKSVQKLLDAVDTYIPVPARDLEKPFLLPVEAVYSVPGRGTVVTGTLERGILKKGDECELLGHSKNIRTVVTGIEMFHKSLERAEAGDNLGALVRGLKREDLRRGLVMVKPGSIKPHQKVEAQVYILSKEEGGRHKPFVSHFMPVMFSLTWDMACRIILPPEKELAMPGEDLKFNLILRQPMILEKGQRFTLRDGNRTIGTGLVTNTLAMTEEEKNIKWG,455,NP_003312.3.csv,refseq-TUFM-NM_003321.4_clinical_seed_0_final,refseq-TUFM-NM_003321.4.a2m,Invitae,refseq-TUFM-NM_003321.4_theta_0.2.npy,1,455,455
+NP_003313.3,MPLRDETLREVWASDSGHEEESLSPEAPRRPKQRPAPAQRLRKKRTEAPESPCPTGSKPRKPGAGRTGRPREEPSPDPAQARAPQTVYARFLRDPEAKKRDPRETFLVARAPDAEDEEEEEEEDEEDEEEEAEEKKEKILLPPKKPLREKSSADLKERRAKAQGPRGDLGSPDPPPKPLRVRNKEAPAGEGTKMRKTKKKGSGEADKDPSGSPASARKSPAAMFLVGEGSPDKKALKKKGTPKGARKEEEEEEEAATVIKKSNQKGKAKGKGKKKAKEERAPSPPVEVDEPREFVLRPAPQGRTVRCRLTRDKKGMDRGMYPSYFLHLDTEKKVFLLAGRKRKRSKTANYLISIDPTNLSRGGENFIGKLRSNLLGNRFTVFDNGQNPQRGYSTNVASLRQELAAVIYETNVLGFRGPRRMTVIIPGMSAENERVPIRPRNASDGLLVRWQNKTLESLIELHNKPPVWNDDSGSYTLNFQGRVTQASVKNFQIVHADDPDYIVLQFGRVAEDAFTLDYRYPLCALQAFAIALSSFDGKLACE,542,NP_003313.3.csv,refseq-TULP1-NM_003322.4_clinical_seed_0_final,refseq-TULP1-NM_003322.4.a2m,Invitae,refseq-TULP1-NM_003322.4.npy,1,542,542
+NP_003325.2,MSSSPLSKKRRVSGPDPKPGSNCSPAQSVLSEVPSVPTNGMAKNGSEADIDEGLYSRQLYVLGHEAMKRLQTSSVLVSGLRGLGVEIAKNIILGGVKAVTLHDQGTAQWADLSSQFYLREEDIGKNRAEVSQPRLAELNSYVPVTAYTGPLVEDFLSGFQVVVLTNTPLEDQLRVGEFCHNRGIKLVVADTRGLFGQLFCDFGEEMILTDSNGEQPLSAMVSMVTKDNPGVVTCLDEARHGFESGDFVSFSEVQGMVELNGNQPMEIKVLGPYTFSICDTSNFSDYIRGGIVSQVKVPKKISFKSLVASLAEPDFVVTDFAKFSRPAQLHIGFQALHQFCAQHGRPPRPRNEEDAAELVALAQAVNARALPAVQQNNLDEDLIRKLAYVAAGDLAPINAFIGGLAAQEVMKACSGKFMPIMQWLYFDALECLPEDKEVLTEDKCLQRQNRYDGQVAVFGSDLQEKLGKQKYFLVGAGAIGCELLKNFAMIGLGCGEGGEIIVTDMDTIEKSNLNRQFLFRPWDVTKLKSDTAAAAVRQMNPHIRVTSHQNRVGPDTERIYDDDFFQNLDGVANALDNVDARMYMDRRCVYYRKPLLESGTLGTKGNVQVVIPFLTESYSSSQDPPEKSIPICTLKNFPNAIEHTLQWARDEFEGLFKQPAENVNQYLTDPKFVERTLRLAGTQPLEVLEAVQRSLVLQRPQTWADCVTWACHHWHTQYSNNIRQLLHNFPPDQLTSSGAPFWSGPKRCPHPLTFDVNNPLHLDYVMAAANLFAQTYGLTGSQDRAAVATFLQSVQVPEFTPKSGVKIHVSDQELQSANASVDDSRLEELKATLPSPDKLPGFKMYPIDFEKDDDSNFHMDFIVAASNLRAENYDIPSADRHKSKLIAGKIIPAIATTTAAVVGLVCLELYKVVQGHRQLDSYKNGFLNLALPFFGFSEPLAAPRHQYYNQEWTLWDRFEVQGLQPNGEEMTLKQFLDYFKTEHKLEITMLSQGVSMLYSFFMPAAKLKERLDQPMTEIVSRVSKRKLGRHVRALVLELCCNDESGEDVEVPYVRYTIR,1058,NP_003325.2.csv,refseq-UBA1-NM_003334.3_clinical_seed_0_final,refseq-UBA1-NM_003334.3.a2m,Invitae,refseq-UBA1-NM_003334.3.npy,1,1058,1058
+NP_003350.1,MFEIKKICCIGAGYVGGPTCSVIAHMCPEIRVTVVDVNESRINAWNSPTLPIYEPGLKEVVESCRGKNLFFSTNIDDAIKEADLVFISVNTPTKTYGMGKGRAADLKYIEACARRIVQNSNGYKIVTEKSTVPVRAAESIRRIFDANTKPNLNLQVLSNPEFLAEGTAIKDLKNPDRVLIGGDETPEGQRAVQALCAVYEHWVPREKILTTNTWSSELSKLAANAFLAQRISSINSISALCEATGADVEEVATAIGMDQRIGNKFLKASVGFGGSCFQKDVLNLVYLCEALNLPEVARYWQQVIDMNDYQRRRFASRIIDSLFNTVTDKKIAILGFAFKKDTGDTRESSSIYISKYLMDEGAHLHIYDPKVPREQIVVDLSHPGVSEDDQVSRLVTISKDPYEACDGAHAVVICTEWDMFKELDYERIHKKMLKPAFIFDGRRVLDGLHNELQTIGFQIETIGKKVSSKRIPYAPSGEIPKFSLQDPPNKKPKV,494,NP_003350.1.csv,refseq-UGDH-NM_003359.3_clinical_seed_0_final,refseq-UGDH-NM_003359.3.a2m,Invitae,refseq-UGDH-NM_003359.3.npy,1,494,494
+NP_003352.2,MGQPSLTWMLMVVVASWFITTAATDTSEARWCSECHSNATCTEDEAVTTCTCQEGFTGDGLTCVDLDECAIPGAHNCSANSSCVNTPGSFSCVCPEGFRLSPGLGCTDVDECAEPGLSHCHALATCVNVVGSYLCVCPAGYRGDGWHCECSPGSCGPGLDCVPEGDALVCADPCQAHRTLDEYWRSTEYGEGYACDTDLRGWYRFVGQGGARMAETCVPVLRCNTAAPMWLNGTHPSSDEGIVSRKACAHWSGHCCLWDASVQVKACAGGYYVYNLTAPPECHLAYCTDPSSVEGTCEECSIDEDCKSNNGRWHCQCKQDFNITDISLLEHRLECGANDMKVSLGKCQLKSLGFDKVFMYLSDSRCSGFNDRDNRDWVSVVTPARDGPCGTVLTRNETHATYSNTLYLADEIIIRDLNIKINFACSYPLDMKVSLKTALQPMVSALNIRVGGTGMFTVRMALFQTPSYTQPYQGSSVTLSTEAFLYVGTMLDGGDLSRFALLMTNCYATPSSNATDPLKYFIIQDRCPHTRDSTIQVVENGESSQGRFSVQMFRFAGNYDLVYLHCEVYLCDTMNEKCKPTCSGTRFRSGSVIDQSRVLNLGPITRKGVQATVSRAFSSLGLLKVWLPLLLSATLTLTFQ,640,NP_003352.2.csv,refseq-UMOD-NM_003361.3_clinical_seed_0_final,refseq-UMOD-NM_003361.3.a2m,Invitae,refseq-UMOD-NM_003361.3.npy,1,640,640
+NP_003357.2,MKLLTRAGSFSRFYSLKVAPKVKATAAPAGAPPQPQDLEFTKLPNGLVIASLENYSPVSRIGLFIKAGSRYEDFSNLGTTHLLRLTSSLTTKGASSFKITRGIEAVGGKLSVTATRENMAYTVECLRGDVDILMEFLLNVTTAPEFRRWEVADLQPQLKIDKAVAFQNPQTHVIENLHAAAYRNALANPLYCPDYRIGKVTSEELHYFVQNHFTSARMALIGLGVSHPVLKQVAEQFLNMRGGLGLSGAKANYRGGEIREQNGDSLVHAAFVAESAVAGSAEANAFSVLQHVLGAGPHVKRGSNTTSHLHQAVAKATQQPFDVSAFNASYSDSGLFGIYTISQATAAGDVIKAAYNQVKTIAQGNLSNTDVQAAKNKLKAGYLMSVESSECFLEEVGSQALVAGSYMPPSTVLQQIDSVANADIINAAKKFVSGQKSMAASGNLGHTPFVDEL,453,NP_003357.2.csv,refseq-UQCRC2-NM_003366.3_clinical_seed_0_final,refseq-UQCRC2-NM_003366.3.a2m,Invitae,refseq-UQCRC2-NM_003366.3_theta_0.2.npy,1,453,453
+NP_003374.3,MGTSALWALWLLLALCWAPRESGATGTGRKAKCEPSQFQCTNGRCITLLWKCDGDEDCVDGSDEKNCVKKTCAESDFVCNNGQCVPSRWKCDGDPDCEDGSDESPEQCHMRTCRIHEISCGAHSTQCIPVSWRCDGENDCDSGEDEENCGNITCSPDEFTCSSGRCISRNFVCNGQDDCSDGSDELDCAPPTCGAHEFQCSTSSCIPISWVCDDDADCSDQSDESLEQCGRQPVIHTKCPASEIQCGSGECIHKKWRCDGDPDCKDGSDEVNCPSRTCRPDQFECEDGSCIHGSRQCNGIRDCVDGSDEVNCKNVNQCLGPGKFKCRSGECIDISKVCNQEQDCRDWSDEPLKECHINECLVNNGGCSHICKDLVIGYECDCAAGFELIDRKTCGDIDECQNPGICSQICINLKGGYKCECSRGYQMDLATGVCKAVGKEPSLIFTNRRDIRKIGLERKEYIQLVEQLRNTVALDADIAAQKLFWADLSQKAIFSASIDDKVGRHVKMIDNVYNPAAIAVDWVYKTIYWTDAASKTISVATLDGTKRKFLFNSDLREPASIAVDPLSGFVYWSDWGEPAKIEKAGMNGFDRRPLVTADIQWPNGITLDLIKSRLYWLDSKLHMLSSVDLNGQDRRIVLKSLEFLAHPLALTIFEDRVYWIDGENEAVYGANKFTGSELATLVNNLNDAQDIIVYHELVQPSGKNWCEEDMENGGCEYLCLPAPQINDHSPKYTCSCPSGYNVEENGRDCQSTATTVTYSETKDTNTTEISATSGLVPGGINVTTAVSEVSVPPKGTSAAWAILPLLLLVMAAVGGYLMWRNWQHKNMKSMNFDNPVYLKTTEEDLSIDIGRHSASVGHTYPAISVVSTDDDLA,873,NP_003374.3.csv,refseq-VLDLR-NM_003383.4_clinical_seed_0_final,refseq-VLDLR-NM_003383.4.a2m,Invitae,refseq-VLDLR-NM_003383.4.npy,1,873,873
+NP_003383.4,MKKSIGILSPGVALGMAGSAMSSKFFLVALAIFFSFAQVVIEANSWWSLGMNNPVQMSEVYIIGAQPLCSQLAGLSQGQKKLCHLYQDHMQYIGEGAKTGIKECQYQFRHRRWNCSTVDNTSVFGRVMQIGSRETAFTYAVSAAGVVNAMSRACREGELSTCGCSRAARPKDLPRDWLWGGCGDNIDYGYRFAKEFVDARERERIHAKGSYESARILMNLHNNEAGRRTVYNLADVACKCHGVSGSCSLKTCWLQLADFRKVGDALKEKYDSAAAMRLNSRGKLVQVNSRFNSPTTQDLVYIDPSPDYCVRNESTGSLGTQGRLCNKTSEGMDGCELMCCGRGYDQFKTVQTERCHCKFHWCCYVKCKKCTEIVDQFVCK,380,NP_003383.4.csv,refseq-WNT5A-NM_003392.4_clinical_seed_0_final,refseq-WNT5A-NM_003392.4.a2m,Invitae,refseq-WNT5A-NM_003392.4.npy,1,380,380
+NP_003385.2,MLEEPRPRPPPSGLAGLLFLALCSRALSNEILGLKLPGEPPLTANTVCLTLSGLSKRQLGLCLRNPDVTASALQGLHIAVHECQHQLRDQRWNCSALEGGGRLPHHSAILKRGFRESAFSFSMLAAGVMHAVATACSLGKLVSCGCGWKGSGEQDRLRAKLLQLQALSRGKSFPHSLPSPGPGSSPSPGPQDTWEWGGCNHDMDFGEKFSRDFLDSREAPRDIQARMRIHNNRVGRQVVTENLKRKCKCHGTSGSCQFKTCWRAAPEFRAVGAALRERLGRAIFIDTHNRNSGAFQPRLRPRRLSGELVYFEKSPDFCERDPTMGSPGTRGRACNKTSRLLDGCGSLCCGRGHNVLRQTRVERCHCRFHWCCYVLCDECKVTEWVNVCK,389,NP_003385.2.csv,refseq-WNT10B-NM_003394.3_clinical_seed_0_final,refseq-WNT10B-NM_003394.3.a2m,Invitae,refseq-WNT10B-NM_003394.3.npy,1,389,389
+NP_003394.1,MASGDTLYIATDGSEMPAEIVELHEIEVETIPVETIETTVVGEEEEEDDDDEDGGGGDHGGGGGHGHAGHHHHHHHHHHHPPMIALQPLVTDDPTQVHHHQEVILVQTREEVVGGDDSDGLRAEDGFEDQILIPVPAPAGGDDDYIEQTLVTVAAAGKSGGGGSSSSGGGRVKKGGGKKSGKKSYLSGGAGAAGGGGADPGNKKWEQKQVQIKTLEGEFSVTMWSSDEKKDIDHETVVEEQIIGENSPPDYSEYMTGKKLPPGGIPGIDLSDPKQLAEFARMKPRKIKEDDAPRTIACPHKGCTKMFRDNSAMRKHLHTHGPRVHVCAECGKAFVESSKLKRHQLVHTGEKPFQCTFEGCGKRFSLDFNLRTHVRIHTGDRPYVCPFDGCNKKFAQSTNLKSHILTHAKAKNNQ,414,NP_003394.1.csv,refseq-YY1-NM_003403.4_clinical_seed_0_final,refseq-YY1-NM_003403.4.a2m,Invitae,refseq-YY1-NM_003403.4.npy,1,414,414
+NP_003403.2,MLLDAGPQYPAIGVTTFGASRHHSAGDVAERDVGLGINPFADGMGAFKLNPSSHELASAGQTAFTSQAPGYAAAAALGHHHHPGHVGSYSSAAFNSTRDFLFRNRGFGDAAAAASAQHSLFAASAGGFGGPHGHTDAAGHLLFPGLHEQAAGHASPNVVNGQMRLGFSGDMYPRPEQYGQVTSPRSEHYAAPQLHGYGPMNVNMAAHHGAGAFFRYMRQPIKQELICKWIEPEQLANPKKSCNKTFSTMHELVTHVTVEHVGGPEQSNHICFWEECPREGKPFKAKYKLVNHIRVHTGEKPFPCPFPGCGKVFARSENLKIHKRTHTGEKPFKCEFEGCDRRFANSSDRKKHMHVHTSDKPYLCKMCDKSYTHPSSLRKHMKVHESSSQGSQPSPAASSGYESSTPPTIVSPSTDNPTTSSLSPSSSAVHHTAGHSALSSNFNEWYV,447,NP_003403.2.csv,refseq-ZIC1-NM_003412.3_clinical_seed_0_final,refseq-ZIC1-NM_003412.3.a2m,Invitae,refseq-ZIC1-NM_003412.3.npy,1,447,447
+NP_003404.1,MTMLLDGGPQFPGLGVGSFGAPRHHEMPNREPAGMGLNPFGDSTHAAAAAAAAAAFKLSPAAAHDLSSGQSSAFTPQGSGYANALGHHHHHHHHHHHTSQVPSYGGAASAAFNSTREFLFRQRSSGLSEAASGGGQHGLFAGSASSLHAPAGIPEPPSYLLFPGLHEQGAGHPSPTGHVDNNQVHLGLRGELFGRADPYRPVASPRTDPYAAGAQFPNYSPMNMNMGVNVAAHHGPGAFFRYMRQPIKQELSCKWIDEAQLSRPKKSCDRTFSTMHELVTHVTMEHVGGPEQNNHVCYWEECPREGKSFKAKYKLVNHIRVHTGEKPFPCPFPGCGKIFARSENLKIHKRTHTGEKPFKCEFEGCDRRFANSSDRKKHMHVHTSDKPYICKVCDKSYTHPSSLRKHMKVHESQGSDSSPAASSGYESSTPPAIASANSKDTTKTPSAVQTSTSHNPGLPPNFNEWYV,467,NP_003404.1.csv,refseq-ZIC3-NM_003413.3_clinical_seed_0_final,refseq-ZIC3-NM_003413.3.a2m,Invitae,refseq-ZIC3-NM_003413.3.npy,1,467,467
+NP_003457.1,MPHNSIRSGHGGLNQLGGAFVNGRPLPEVVRQRIVDLAHQGVRPCDISRQLRVSHGCVSKILGRYYETGSIRPGVIGGSKPKVATPKVVEKIGDYKRQNPTMFAWEIRDRLLAEGVCDNDTVPSVSSINRIIRTKVQQPFNLPMDSCVATKSLSPGHTLIPSSAVTPPESPQSDSLGSTYSINGLLGIAQPGSDKRKMDDSDQDSCRLSIDSQSSSSGPRKHLRTDAFSQHHLEPLECPFERQHYPEAYASPSHTKGEQGLYPLPLLNSTLDDGKATLTPSNTPLGRNLSTHQTYPVVADPHSPFAIKQETPEVSSSSSTPSSLSSSAFLDLQQVGSGVPPFNAFPHAASVYGQFTGQALLSGREMVGPTLPGYPPHIPTSGQGSYASSAIAGMVAGSEYSGNAYGHTPYSSYSEAWRFPNSSLLSSPYYYSSTSRPSAPPTTATAFDHL,450,NP_003457.1.csv,PAX8_HUMAN_b03_clinical_seed_0_final,PAX8_HUMAN_b03.a2m,EVE,PAX8_HUMAN_b03_theta_0.2.npy,1,450,450
+NP_003461.2,MNHQQQQQQQKAGEQQLSEPEDMEMEAGDTDDPPRITQNPVINGNVALSDGHNTAEEDMEDDTSWRSEATFQFTVERFSRLSESVLSPPCFVRNLPWKIMVMPRFYPDRPHQKSVGFFLQCNAESDSTSWSCHAQAVLKIINYRDDEKSFSRRISHLFFHKENDWGFSNFMAWSEVTDPEKGFIDDDKVTFEVFVQADAPHGVAWDSKKHTGYVGLKNQGATCYMNSLLQTLFFTNQLRKAVYMMPTEGDDSSKSVPLALQRVFYELQHSDKPVGTKKLTKSFGWETLDSFMQHDVQELCRVLLDNVENKMKGTCVEGTIPKLFRGKMVSYIQCKEVDYRSDRREDYYDIQLSIKGKKNIFESFVDYVAVEQLDGDNKYDAGEHGLQEAEKGVKFLTLPPVLHLQLMRFMYDPQTDQNIKINDRFEFPEQLPLDEFLQKTDPKDPANYILHAVLVHSGDNHGGHYVVYLNPKGDGKWCKFDDDVVSRCTKEEAIEHNYGGHDDDLSVRHCTNAYMLVYIRESKLSEVLQAVTDHDIPQQLVERLQEEKRIEAQKRKERQEAHLYMQVQIVAEDQFCGHQGNDMYDEEKVKYTVFKVLKNSSLAEFVQSLSQTMGFPQDQIRLWPMQARSNGTKRPAMLDNEADGNKTMIELSDNENPWTIFLETVDPELAASGATLPKFDKDHDVMLFLKMYDPKTRSLNYCGHIYTPISCKIRDLLPVMCDRAGFIQDTSLILYEEVKPNLTERIQDYDVSLDKALDELMDGDIIVFQKDDPENDNSELPTAKEYFRDLYHRVDVIFCDKTIPNDPGFVVTLSNRMNYFQVAKTVAQRLNTDPMLLQFFKSQGYRDGPGNPLRHNYEGTLRDLLQFFKPRQPKKLYYQQLKMKITDFENRRSFKCIWLNSQFREEEITLYPDKHGCVRDLLEECKKAVELGEKASGKLRLLEIVSYKIIGVHQEDELLECLSPATSRTFRIEEIPLDQVDIDKENEMLVTVAHFHKEVFGTFGIPFLLRIHQGEHFREVMKRIQSLLDIQEKEFEKFKFAIVMMGRHQYINEDEYEVNLKDFEPQPGNMSHPRPWLGLDHFNKAPKRSRYTYLEKAIKIHN,1102,NP_003461.2.csv,refseq-USP7-NM_003470.2_clinical_seed_0_final,refseq-USP7-NM_003470.2.a2m,Invitae,refseq-USP7-NM_003470.2.npy,1,1102,1102
+NP_003467.1,MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKVCYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKCPRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTGIGFGGLTQQVEKKE,194,NP_003467.1.csv,refseq-CSRP3-NM_003476.4_clinical_seed_0_final,refseq-CSRP3-NM_003476.4.a2m,Invitae,refseq-CSRP3-NM_003476.4.npy,1,194,194
+NP_003468.2,MAASWRLGCDPRLLRYLVGFPGRRSVGLVKGALGWSVSRGANWRWFHSTQWLRGDPIKILMPSLSPTMEEGNIVKWLKKEGEAVSAGDALCEIETDKAVVTLDASDDGILAKIVVEEGSKNIRLGSLIGLIVEEGEDWKHVEIPKDVGPPPPVSKPSEPRPSPEPQISIPVKKEHIPGTLRFRLSPAARNILEKHSLDASQGTATGPRGIFTKEDALKLVQLKQTGKITESRPTPAPTATPTAPSPLQATAGPSYPRPVIPPVSTPGQPNAVGTFTEIPASNIRRVIAKRLTESKSTVPHAYATADCDLGAVLKVRQDLVKDDIKVSVNDFIIKAAAVTLKQMPDVNVSWDGEGPKQLPFIDISVAVATDKGLLTPIIKDAAAKGIQEIADSVKALSKKARDGKLLPEEYQGGSFSISNLGMFGIDEFTAVINPPQACILAVGRFRPVLKLTEDEEGNAKLQQRQLITVTMSSDSRVVDDELATRFLKSFKANLENPIRLA,501,NP_003468.2.csv,refseq-PDHX-NM_003477.2_clinical_seed_0_final,refseq-PDHX-NM_003477.2.a2m,Invitae,refseq-PDHX-NM_003477.2.npy,1,501,501
+NP_003482.1,MNIRNARPEDLMNMQHCNLLCLPENYQMKYYFYHGLSWPQLSYIAEDENGKIVGYVLAKMEEDPDDVPHGHITSLAVKRSHRRLGLAQKLMDQASRAMIENFNAKYVSLHVRKSNRAALHLYSNTLNFQISEVEPKYYADGEDAYAMKRDLTQMADELRRHLELKEKGRHVVLGAIENKVESKGNSPPSSGEACREEKGLAAEDSGGDSKDLSEVSETTESTDVKDSSEASDSAS,235,NP_003482.1.csv,refseq-NAA10-NM_003491.3_clinical_seed_0_final,refseq-NAA10-NM_003491.3.a2m,Invitae,refseq-NAA10-NM_003491.3.npy,1,235,235
+NP_003485.1,MLRVFILYAENVHTPDTDISDAYCSAVFAGVKKRTKVIKNSVNPVWNEGFEWDLKGIPLDQGSELHVVVKDHETMGRNRFLGEAKVPLREVLATPSLSASFNAPLLDTKKQPTGASLVLQVSYTPLPGAVPLFPPPTPLEPSPTLPDLDVVADTGGEEDTEDQGLTGDEAEPFLDQSGGPGAPTTPRKLPSRPPPHYPGIKRKRSAPTSRKLLSDKPQDFQIRVQVIEGRQLPGVNIKPVVKVTAAGQTKRTRIHKGNSPLFNETLFFNLFDSPGELFDEPIFITVVDSRSLRTDALLGEFRMDVGTIYREPRHAYLRKWLLLSDPDDFSAGARGYLKTSLCVLGPGDEAPLERKDPSEDKEDIESNLLRPTGVALRGAHFCLKVFRAEDLPQMDDAVMDNVKQIFGFESNKKNLVDPFVEVSFAGKMLCSKILEKTANPQWNQNITLPAMFPSMCEKMRIRIIDWDRLTHNDIVATTYLSMSKISAPGGEIEEEPAGAVKPSKASDLDDYLGFLPTFGPCYINLYGSPREFTGFPDPYTELNTGKGEGVAYRGRLLLSLETKLVEHSEQKVEDLPADDILRVEKYLRRRKYSLFAAFYSATMLQDVDDAIQFEVSIGNYGNKFDMTCLPLASTTQYSRAVFDGCHYYYLPWGNVKPVVVLSSYWEDISHRIETQNQLLGIADRLEAGLEQVHLALKAQCSTEDVDSLVAQLTDELIAGCSQPLGDIHETPSATHLDQYLYQLRTHHLSQITEAALALKLGHSELPAALEQAEDWLLRLRALAEEPQNSLPDIVIWMLQGDKRVAYQRVPAHQVLFSRRGANYCGKNCGKLQTIFLKYPMEKVPGARMPVQIRVKLWFGLSVDEKEFNQFAEGKLSVFAETYENETKLALVGNWGTTGLTYPKFSDVTGKIKLPKDSFRPSAGWTWAGDWFVCPEKTLLHDMDAGHLSFVEEVFENQTRLPGGQWIYMSDNYTDVNGEKVLPKDDIECPLGWKWEDEEWSTDLNRAVDEQGWEYSITIPPERKPKHWVPAEKMYYTHRRRRWVRLRRRDLSQMEALKRHRQAEAEGEGWEYASLFGWKFHLEYRKTDAFRRRRWRRRMEPLEKTGPAAVFALEGALGGVMDDKSEDSMSVSTLSFGVNRPTISCIFDYGNRYHLRCYMYQARDLAAMDKDSFSDPYAIVSFLHQSQKTVVVKNTLNPTWDQTLIFYEIEIFGEPATVAEQPPSIVVELYDHDTYGADEFMGRCICQPSLERMPRLAWFPLTRGSQPSGELLASFELIQREKPAIHHIPGFEVQETSRILDESEDTDLPYPPPQREANIYMVPQNIKPALQRTAIEILAWGLRNMKSYQLANISSPSLVVECGGQTVQSCVIRNLRKNPNFDICTLFMEVMLPREELYCPPITVKVIDNRQFGRRPVVGQCTIRSLESFLCDPYSAESPSPQGGPDDVSLLSPGEDVLIDIDDKEPLIPIQEEEFIDWWSKFFASIGEREKCGSYLEKDFDTLKVYDTQLENVEAFEGLSDFCNTFKLYRGKTQEETEDPSVIGEFKGLFKIYPLPEDPAIPMPPRQFHQLAAQGPQECLVRIYIVRAFGLQPKDPNGKCDPYIKISIGKKSVSDQDNYIPCTLEPVFGKMFELTCTLPLEKDLKITLYDYDLLSKDEKIGETVVDLENRLLSKFGARCGLPQTYCVSGPNQWRDQLRPSQLLHLFCQQHRVKAPVYRTDRVMFQDKEYSIEEIEAGRIPNPHLGPVEERLALHVLQQQGLVPEHVESRPLYSPLQPDIEQGKLQMWVDLFPKALGRPGPPFNITPRRARRFFLRCIIWNTRDVILDDLSLTGEKMSDIYVKGWMIGFEEHKQKTDVHYRSLGGEGNFNWRFIFPFDYLPAEQVCTIAKKDAFWRLDKTESKIPARVVFQIWDNDKFSFDDFLGSLQLDLNRMPKPAKTAKKCSLDQLDDAFHPEWFVSLFEQKTVKGWWPCVAEEGEKKILAGKLEMTLEIVAESEHEERPAGQGRDEPNMNPKLEDPRRPDTSFLWFTSPYKTMKFILWRRFRWAIILFIILFILLLFLAIFIYAFPNYAAMKLVKPFS,2080,NP_003485.1.csv,refseq-DYSF-NM_003494.3_clinical_seed_0_final,refseq-DYSF-NM_003494.3.a2m,Invitae,refseq-DYSF-NM_003494.3.npy,1,2080,2080
+NP_003551.2,MQFFGRLVNTFSGVTNLFSNPFRVKEVAVADYTSSDRVREEGQLILFQNTPNRTWDCVLVNPRNSQSGFRLFQLELEADALVNFHQYSSQLLPFYESSPQVLHTEVLQHLTDLIRNHPSWSVAHLAVELGIRECFHHSRIISCANCAENEEGCTPLHLACRKGDGEILVELVQYCHTQMDVTDYKGETVFHYAVQGDNSQVLQLLGRNAVAGLNQVNNQGLTPLHLACQLGKQEMVRVLLLCNARCNIMGPNGYPIHSAMKFSQKGCAEMIISMDSSQIHSKDPRYGASPLHWAKNAEMARMLLKRGCNVNSTSSAGNTALHVAVMRNRFDCAIVLLTHGANADARGEHGNTPLHLAMSKDNVEMIKALIVFGAEVDTPNDFGETPTFLASKIGRLVTRKAILTLLRTVGAEYCFPPIHGVPAEQGSAAPHHPFSLERAQPPPISLNNLELQDLMHISRARKPAFILGSMRDEKRTHDHLLCLDGGGVKGLIIIQLLIAIEKASGVATKDLFDWVAGTSTGGILALAILHSKSMAYMRGMYFRMKDEVFRGSRPYESGPLEEFLKREFGEHTKMTDVRKPKVMLTGTLSDRQPAELHLFRNYDAPETVREPRFNQNVNLRPPAQPSDQLVWRAARSSGAAPTYFRPNGRFLDGGLLANNPTLDAMTEIHEYNQDLIRKGQANKVKKLSIVVSLGTGRSPQVPVTCVDVFRPSNPWELAKTVFGAKELGKMVVDCCTDPDGRAVDRARAWCEMVGIQYFRLNPQLGTDIMLDEVSDTVLVNALWETEVYIYEHREEFQKLIQLLLSP,806,NP_003551.2.csv,refseq-PLA2G6-NM_003560.2_clinical_seed_0_final,refseq-PLA2G6-NM_003560.2.a2m,Invitae,refseq-PLA2G6-NM_003560.2.npy,1,806,806
+NP_003570.2,MRRSLAPSQLAKRKPEGRSCDDEDWQPGLVTPRKRKSSSETQIQECFLSPFRKPLSQLTNQPPCLDSSQHEAFIRSILSKPFKVPIPNYQGPLGSRALGLKRAGVRRALHDPLEKDALVLYEPPPLSAHDQLKLDKEKLPVHVVVDPILSKVLRPHQREGVKFLWECVTSRRIPGSHGCIMADEMGLGKTLQCITLMWTLLRQSPECKPEIDKAVVVSPSSLVKNWYNEVGKWLGGRIQPLAIDGGSKDEIDQKLEGFMNQRGARVSSPILIISYETFRLHVGVLQKGSVGLVICDEGHRLKNSENQTYQALDSLNTSRRVLISGTPIQNDLLEYFSLVHFVNSGILGTAHEFKKHFELPILKGRDAAASEADRQLGEERLRELTSIVNRCLIRRTSDILSKYLPVKIEQVVCCRLTPLQTELYKRFLRQAKPAEELLEGKMSVSSLSSITSLKKLCNHPALIYDKCVEEEDGFVGALDLFPPGYSSKALEPQLSGKMLVLDYILAVTRSRSSDKVVLVSNYTQTLDLFEKLCRARRYLYVRLDGTMSIKKRAKVVERFNSPSSPDFVFMLSSKAGGCGLNLIGANRLVMFDPDWNPANDEQAMARVWRDGQKKTCYIYRLLSAGTIEEKIFQRQSHKKALSSCVVDEEQDVERHFSLGELKELFILDEASLSDTHDRLHCRRCVNSRQIRPPPDGSDCTSDLAGWNHCTDKWGLRDEVLQAAWDAASTAITFVFHQRSHEEQRGLR,747,NP_003570.2.csv,refseq-RAD54L-NM_003579.3_clinical_seed_0_final,refseq-RAD54L-NM_003579.3.a2m,Invitae,refseq-RAD54L-NM_003579.3.npy,1,747,747
+NP_003578.2,MATPAGLERWVQDELHSVLGLSERHVAQFLIGTAQRCTSAEEFVQRLRDTDTLDLSGPARDFALRLWNKVPRKAVVEKPARAAEREARALLEKNRSYRLLEDSEESSEETVSRAGSSLQKKRKKRKHLRKKREEEEEEEASEKGKKKTGGSKQQTEKPESEDEWERTERERLQDLEERDAFAERVRQRDKDRTRNVLERSDKKAYEEAQKRLKMAEEDRKAMVPELRKKSRREYLAKREREKLEDLEAELADEEFLFGDVELSRHERQELKYKRRVRDLAREYRAAGEQEKLEATNRYHMPKETRGQPARAVDLVEEESGAPGEEQRRWEEARLGAASLKFGARDAASQEPKYQLVLEEEETIEFVRATQLQGDEEPSAPPTSTQAQQKESIQAVRRSLPVFPFREELLAAIANHQVLIIEGETGSGKTTQIPQYLFEEGYTNKGMKIACTQPRRVAAMSVAARVAREMGVKLGNEVGYSIRFEDCTSERTVLRYMTDGMLLREFLSEPDLASYSVVMVDEAHERTLHTDILFGLIKDVARFRPELKVLVASATMDTARFSTFFDDAPVFRIPGRRFPVDIFYTKAPEADYLEACVVSVLQIHVTQPPGDILVFLTGQEEIEAACEMLQDRCRRLGSKIRELLVLPIYANLPSDMQARIFQPTPPGARKVVVATNIAETSLTIEGIIYVLDPGFCKQKSYNPRTGMESLTVTPCSKASANQRAGRAGRVAAGKCFRLYTAWAYQHELEETTVPEIQRTSLGNVVLLLKSLGIHDLMHFDFLDPPPYETLLLALEQLYALGALNHLGELTTSGRKMAELPVDPMLSKMILASEKYSCSEEILTVAAMLSVNNSIFYRPKDKVVHADNARVNFFLPGGDHLVLLNVYTQWAESGYSSQWCYENFVQFRSMRRARDVREQLEGLLERVEVGLSSCQGDYIRVRKAITAGYFYHTARLTRSGYRTVKQQQTVFIHPNSSLFEQQPRWLLYHELVLTTKEFMRQVLEIESSWLLEVAPHYYKAKELEDPHAKKMPKKIGKTREELG,1041,NP_003578.2.csv,refseq-DHX16-NM_003587.4_clinical_seed_0_final,refseq-DHX16-NM_003587.4.a2m,Invitae,refseq-DHX16-NM_003587.4_theta_0.2.npy,1,1041,1041
+NP_003579.3,MMSQSSGSGDGNDDEATTSKDGGFSSPSPSAAAAAQEVRSATDGNTSTTPPTSAKKRKLNSSSSSSSNSSNEREDFDSTSSSSSTPPLQPRDSASPSTSSFCLGVSVAASSHVPIQKKLRFEDTLEFVGFDAKMAEESSSSSSSSSPTAATSQQQQLKNKSILISSVASVHHANGLAKSSTTVSSFANSKPGSAKKLVIKNFKDKPKLPENYTDETWQKLKEAVEAIQNSTSIKYNLEELYQAVENLCSYKISANLYKQLRQICEDHIKAQIHQFREDSLDSVLFLKKIDRCWQNHCRQMIMIRSIFLFLDRTYVLQNSMLPSIWDMGLELFRAHIISDQKVQNKTIDGILLLIERERNGEAIDRSLLRSLLSMLSDLQIYQDSFEQRFLEETNRLYAAEGQKLMQEREVPEYLHHVNKRLEEEADRLITYLDQTTQKSLIATVEKQLLGEHLTAILQKGLNNLLDENRIQDLSLLYQLFSRVRGGVQVLLQQWIEYIKAFGSTIVINPEKDKTMVQELLDFKDKVDHIIDICFLKNEKFINAMKEAFETFINKRPNKPAELIAKYVDSKLRAGNKEATDEELEKMLDKIMIIFRFIYGKDVFEAFYKKDLAKRLLVGKSASVDAEKSMLSKLKHECGAAFTSKLEGMFKDMELSKDIMIQFKQYMQNQNVPGNIELTVNILTMGYWPTYVPMEVHLPPEMVKLQEIFKTFYLGKHSGRKLQWQSTLGHCVLKAEFKEGKKELQVSLFQTLVLLMFNEGEEFSLEEIKQATGIEDGELRRTLQSLACGKARVLAKNPKGKDIEDGDKFICNDDFKHKLFRIKINQIQMKETVEEQASTTERVFQDRQYQIDAAIVRIMKMRKTLSHNLLVSEVYNQLKFPVKPADLKKRIESLIDRDYMERDKENPNQYNYIA,913,NP_003579.3.csv,refseq-CUL4B-NM_003588.3_clinical_seed_0_final,refseq-CUL4B-NM_003588.3.a2m,Invitae,refseq-CUL4B-NM_003588.3.npy,1,913,913
+NP_003581.1,MSNLSKGTGSRKDTKMRIRAFPMTMDEKYVNSIWDLLKNAIQEIQRKNNSGLSFEELYRNAYTMVLHKHGEKLYTGLREVVTEHLINKVREDVLNSLNNNFLQTLNQAWNDHQTAMVMIRDILMYMDRVYVQQNNVENVYNLGLIIFRDQVVRYGCIRDHLRQTLLDMIARERKGEVVDRGAIRNACQMLMILGLEGRSVYEEDFEAPFLEMSAEFFQMESQKFLAENSASVYIKKVEARINEEIERVMHCLDKSTEEPIVKVVERELISKHMKTIVEMENSGLVHMLKNGKTEDLGCMYKLFSRVPNGLKTMCECMSSYLREQGKALVSEEGEGKNPVDYIQGLLDLKSRFDRFLLESFNNDRLFKQTIAGDFEYFLNLNSRSPEYLSLFIDDKLKKGVKGLTEQEVETILDKAMVLFRFMQEKDVFERYYKQHLARRLLTNKSVSDDSEKNMISKLKTECGCQFTSKLEGMFRDMSISNTTMDEFRQHLQATGVSLGGVDLTVRVLTTGYWPTQSATPKCNIPPAPRHAFEIFRRFYLAKHSGRQLTLQHHMGSADLNATFYGPVKKEDGSEVGVGGAQVTGSNTRKHILQVSTFQMTILMLFNNREKYTFEEIQQETDIPERELVRALQSLACGKPTQRVLTKEPKSKEIENGHIFTVNDQFTSKLHRVKIQTVAAKQGESDPERKETRQKVDDDRKHEIEAAIVRIMKSRKKMQHNVLVAEVTQQLKARFLPSPVVIKKRIEGLIEREYLARTPEDRKVYTYVA,768,NP_003581.1.csv,refseq-CUL3-NM_003590.4_clinical_seed_0_final,refseq-CUL3-NM_003590.4.a2m,Invitae,refseq-CUL3-NM_003590.4.npy,1,768,768
+NP_003592.3,MSSAAEPPPPPPPESAPSKPAASIASGGSNSSNKGGPEGVAAQAVASAASAGPADAEMEEIFDDASPGKQKEIQEPDPTYEEKMQTDRANRFEYLLKQTELFAHFIQPAAQKTPTSPLKMKPGRPRIKKDEKQNLLSVGDYRHRRTEQEEDEELLTESSKATNVCTRFEDSPSYVKWGKLRDYQVRGLNWLISLYENGINGILADEMGLGKTLQTISLLGYMKHYRNIPGPHMVLVPKSTLHNWMSEFKRWVPTLRSVCLIGDKEQRAAFVRDVLLPGEWDVCVTSYEMLIKEKSVFKKFNWRYLVIDEAHRIKNEKSKLSEIVREFKTTNRLLLTGTPLQNNLHELWSLLNFLLPDVFNSADDFDSWFDTNNCLGDQKLVERLHMVLRPFLLRRIKADVEKSLPPKKEVKIYVGLSKMQREWYTRILMKDIDILNSAGKMDKMRLLNILMQLRKCCNHPYLFDGAEPGPPYTTDMHLVTNSGKMVVLDKLLPKLKEQGSRVLIFSQMTRVLDILEDYCMWRNYEYCRLDGQTPHDERQDSINAYNEPNSTKFVFMLSTRAGGLGINLATADVVILYDSDWNPQVDLQAMDRAHRIGQTKTVRVFRFITDNTVEERIVERAEMKLRLDSIVIQQGRLVDQNLNKIGKDEMLQMIRHGATHVFASKESEITDEDIDGILERGAKKTAEMNEKLSKMGESSLRNFTMDTESSVYNFEGEDYREKQKIAFTEWIEPPKRERKANYAVDAYFREALRVSEPKAPKAPRPPKQPNVQDFQFFPPRLFELLEKEILFYRKTIGYKVPRNPELPNAAQAQKEEQLKIDEAESLNDEELEEKEKLLTQGFTNWNKRDFNQFIKANEKWGRDDIENIAREVEGKTPEEVIEYSAVFWERCNELQDIEKIMAQIERGEARIQRRISIKKALDTKIGRYKAPFHQLRISYGTNKGKNYTEEEDRFLICMLHKLGFDKENVYDELRQCIRNSPQFRFDWFLKSRTAMELQRRCNTLITLIERENMELEEKEKAEKKKRGPKPSTQKRKMDGAPDGRGRKKKLKL,1052,NP_003592.3.csv,refseq-SMARCA5-NM_003601.4_clinical_seed_0_final,refseq-SMARCA5-NM_003601.4.a2m,Invitae,refseq-SMARCA5-NM_003601.4.npy,1,1052,1052
+NP_003602.1,MMAQSNMFTVADVLSQDELRKKLYQTFKDRGILDTLKTQLRNQLIHELMHPVLSGELQPRSISVEGSSLLIGASNSLVADHLQRCGYEYSLSVFFPESGLAKEKVFTMQDLLQLIKINPTSSLYKSLVSGSDKENQKGFLMHFLKELAEYHQAKESCNMETQTSSTFNRDSLAEKLQLIDDQFADAYPQRIKFESLEIKLNEYKREIEEQLRAEMCQKLKFFKDTEIAKIKMEAKKKYEKELTMFQNDFEKACQAKSEALVLREKSTLERIHKHQEIETKEIYAQRQLLLKDMDLLRGREAELKQRVEAFELNQKLQEEKHKSITEALRRQEQNIKSFEETYDRKLKNELLKYQLELKDDYIIRTNRLIEDERKNKEKAVHLQEELIAINSKKEELNQSVNRVKELELELESVKAQSLAITKQNHMLNEKVKEMSDYSLLKEEKLELLAQNKLLKQQLEESRNENLRLLNRLAQPAPELAVFQKELRKAEKAIVVEHEEFESCRQALHKQLQDEIEHSAQLKAQILGYKASVKSLTTQVADLKLQLKQTQTALENEVYCNPKQSVIDRSVNGLINGNVVPCNGEISGDFLNNPFKQENVLARMVASRITNYPTAWVEGSSPDSDLEFVANTKARVKELQQEAERLEKAFRSYHRRVIKNSAKSPLAAKSPPSLHLLEAFKNITSSSPERHIFGEDRVVSEQPQVGTLEERNDVVEALTGSAASRLRGGTSSRRLSSTPLPKAKRSLESEMYLEGLGRSHIASPSPCPDRMPLPSPTESRHSLSIPPVSSPPEQKVGLYRRQTELQDKSEFSDVDKLAFKDNEEFESSFESAGNMPRQLEMGGLSPAGDMSHVDAAAAAVPLSYQHPSVDQKQIEEQKEEEKIREQQVKERRQREERRQSNLQEVLERERRELEKLYQERKMIEESLKIKIKKELEMENELEMSNQEIKDKSAHSENPLEKYMKIIQQEQDQESADKSSKKMVQEGSLVDTLQSSDKVESLTGFSHEELDDSW,1012,NP_003602.1.csv,refseq-OFD1-NM_003611.2_clinical_seed_0_final,refseq-OFD1-NM_003611.2.a2m,Invitae,refseq-OFD1-NM_003611.2.npy,1,1012,1012
+NP_003611.1,MAGLYSLGVSVFSDQGGRKYMEDVTQIVVEPEPTAEEKPSPRRSLSQPLPPRPSPAALPGGEVSGKGPAVAAREARDPLPDAGASPAPSRCCRRRSSVAFFAVCDGHGGREAAQFAREHLWGFIKKQKGFTSSEPAKVCAAIRKGFLACHLAMWKKLAEWPKTMTGLPSTSGTTASVVIIRGMKMYVAHVGDSGVVLGIQDDPKDDFVRAVEVTQDHKPELPKERERIEGLGGSVMNKSGVNRVVWKRPRLTHNGPVRRSTVIDQIPFLAVARALGDLWSYDFFSGEFVVSPEPDTSVHTLDPQKHKYIILGSDGLWNMIPPQDAISMCQDQEEKKYLMGEHGQSCAKMLVNRALGRWRQRMLRADNTSAIVICISPEVDNQGNFTNEDELYLNLTDSPSYNSQETCVMTPSPCSTPPVKSLEEDPWPRVNSKDHIPALVRSNAFSENFLEVSAEIARENVQGVVIPSKDPEPLEENCAKALTLRIHDSLNNSLPIGLVPTNSTNTVMDQKNLKMSTPGQMKAQEIERTPPTNFKRTLEESNSGPLMKKHRRNGLSRSSGAQPASLPTTSQRKNSVKLTMRRRLRGQKKIGNPLLHQHRKTVCVC,605,NP_003611.1.csv,refseq-PPM1D-NM_003620.3_clinical_seed_0_final,refseq-PPM1D-NM_003620.3.a2m,Invitae,refseq-PPM1D-NM_003620.3.npy,1,605,605
+NP_003621.1,MLRSVWNFLKRHKKKCIFLGTVLGGVYILGKYGQKKIREIQEREAAEYIAQARRQYHFESNQRTCNMTVLSMLPTLREALMQQLNSESLTALLKNRPSNKLEIWEDLKIISFTRSTVAVYSTCMLVVLLRVQLNIIGGYIYLDNAAVGKNGTTILAPPDVQQQYLSSIQHLLGDGLTELITVIKQAVQKVLGSVSLKHSLSLLDLEQKLKEIRNLVEQHKSSSWINKDGSKPLLCHYMMPDEETPLAVQACGLSPRDITTIKLLNETRDMLESPDFSTVLNTCLNRGFSRLLDNMAEFFRPTEQDLQHGNSMNSLSSVSLPLAKIIPIVNGQIHSVCSETPSHFVQDLLTMEQVKDFAANVYEAFSTPQQLEK,373,NP_003621.1.csv,refseq-PEX3-NM_003630.2_clinical_seed_0_final,refseq-PEX3-NM_003630.2.a2m,Invitae,refseq-PEX3-NM_003630.2.npy,1,373,373
+NP_003623.1,MMHLRLFCILLAAVSGAEGWGYYGCDEELVGPLYARSLGASSYYSLLTAPRFARLHGISGWSPRIGDPNPWLQIDLMKKHRIRAVATQGSFNSWDWVTRYMLLYGDRVDSWTPFYQRGHNSTFFGNVNESAVVRHDLHFHFTARYIRIVPLAWNPRGKIGLRLGLYGCPYKADILYFDGDDAISYRFPRGVSRSLWDVFAFSFKTEEKDGLLLHAEGAQGDYVTLELEGAHLLLHMSLGSSPIQPRPGHTTVSAGGVLNDQHWHYVRVDRFGRDVNFTLDGYVQRFILNGDFERLNLDTEMFIGGLVGAARKNLAYRHNFRGCIENVIFNRVNIADLAVRRHSRITFEGKVAFRCLDPVPHPINFGGPHNFVQVPGFPRRGRLAVSFRFRTWDLTGLLLFSRLGDGLGHVELTLSEGQVNVSIAQSGRKKLQFAAGYRLNDGFWHEVNFVAQENHAVISIDDVEGAEVRVSYPLLIRTGTSYFFGGCPKPASRWDCHSNQTAFHGCMELLKVDGQLVNLTLVEGRRLGFYAEVLFDTCGITDRCSPNMCEHDGRCYQSWDDFICYCELTGYKGETCHTPLYKESCEAYRLSGKTSGNFTIDPDGSGPLKPFVVYCDIRENRAWTVVRHDRLWTTRVTGSSMERPFLGAIQYWNASWEEVSALANASQHCEQWIEFSCYNSRLLNTAGGYPYSFWIGRNEEQHFYWGGSQPGIQRCACGLDRSCVDPALYCNCDADQPQWRTDKGLLTFVDHLPVTQVVIGDTNRSTSEAQFFLRPLRCYGDRNSWNTISFHTGAALRFPPIRANHSLDVSFYFRTSAPSGVFLENMGGPYCQWRRPYVRVELNTSRDVVFAFDVGNGDENLTVHSDDFEFNDDEWHLVRAEINVKQARLRVDHRPWVLRPMPLQTYIWMEYDQPLYVGSAELKRRPFVGCLRAMRLNGVTLNLEGRANASEGTSPNCTGHCAHPRLPCFHGGRCVERYSYYTCDCDLTAFDGPYCNHDIGGFFEPGTWMRYNLQSALRSAAREFSHMLSRPVPGYEPGYIPGYDTPGYVPGYHGPGYRLPDYPRPGRPVPGYRGPVYNVTGEEVSFSFSTSSAPAVLLYVSSFVRDYMAVLIKDDGTLQLRYQLGTSPYVYQLTTRPVTDGQPHSINITRVYRNLFIQVDYFPLTEQKFSLLVDSQLDSPKALYLGRVMETGVIDPEIQRYNTPGFSGCLSGVRFNNVAPLKTHFRTPRPMTAELAEALRVQGELSESNCGAMPRLVSEVPPELDPWYLPPDFPYYHDEGWVAILLGFLVAFLLLGLVGMLVLFYLQNHRYKGSYHTNEPKAAHEYHPGSKPPLPTSGPAQVPTPTAAPNQAPASAPAPAPTPAPAPGPRDQNLPQILEESRSE,1384,NP_003623.1.csv,refseq-CNTNAP1-NM_003632.2_clinical_seed_0_final,refseq-CNTNAP1-NM_003632.2.a2m,Invitae,refseq-CNTNAP1-NM_003632.2.npy,1,1384,1384
+NP_003631.2,MRNLKLFRTLEFRDIQGPGNPQCFSLRTEQGTVLIGSEHGLIEVDPVSREVKNEVSLVAEGFLPEDGSGRIVGVQDLLDQESVCVATASGDVILCSLSTQQLECVGSVASGISVMSWSPDQELVLLATGQQTLIMMTKDFEPILEQQIHQDDFGESKFITVGWGRKETQFHGSEGRQAAFQMQMHESALPWDDHRPQVTWRGDGQFFAVSVVCPETGARKVRVWNREFALQSTSEPVAGLGPALAWKPSGSLIASTQDKPNQQDIVFFEKNGLLHGHFTLPFLKDEVKVNDLLWNADSSVLAVWLEDLQREESSIPKTCVQLWTVGNYHWYLKQSLSFSTCGKSKIVSLMWDPVTPYRLHVLCQGWHYLAYDWHWTTDRSVGDNSSDLSNVAVIDGNRVLVTVFRQTVVPPPMCTYQLLFPHPVNQVTFLAHPQKSNDLAVLDASNQISVYKCGDCPSADPTVKLGAVGGSGFKVCLRTPHLEKRYKIQFENNEDQDVNPLKLGLLTWIEEDVFLAVSHSEFSPRSVIHHLTAASSEMDEEHGQLNVSSSAAVDGVIISLCCNSKTKSVVLQLADGQIFKYLWESPSLAIKPWKNSGGFPVRFPYPCTQTELAMIGEEECVLGLTDRCRFFINDIEVASNITSFAVYDEFLLLTTHSHTCQCFCLRDASFKTLQAGLSSNHVSHGEVLRKVERGSRIVTVVPQDTKLVLQMPRGNLEVVHHRALVLAQIRKWLDKLMFKEAFECMRKLRINLNLIYDHNPKVFLGNVETFIKQIDSVNHINLFFTELKEEDVTKTMYPAPVTSSVYLSRDPDGNKIDLVCDAMRAVMESINPHKYCLSILTSHVKKTTPELEIVLQKVHELQGNAPSDPDAVSAEEALKYLLHLVDVNELYDHSLGTYDFDLVLMVAEKSQKDPKEYLPFLNTLKKMETNYQRFTIDKYLKRYEKAIGHLSKCGPEYFPECLNLIKDKNLYNEALKLYSPSSQQYQDISIAYGEHLMQEHMYEPAGLMFARCGAHEKALSAFLTCGNWKQALCVAAQLNFTKDQLVGLGRTLAGKLVEQRKHIDAAMVLEECAQDYEEAVLLLLEGAAWEEALRLVYKYNRLDIIETNVKPSILEAQKNYMAFLDSQTATFSRHKKRLLVVRELKEQAQQAGLDDEVPHGQESDLFSETSSVVSGSEMSGKYSHSNSRISARSSKNRRKAERKKHSLKEGSPLEDLALLEALSEVVQNTENLKDEVYHILKVLFLFEFDEQGRELQKAFEDTLQLMERSLPEIWTLTYQQNSATPVLGPNSTANSIMASYQQQKTSVPVLDAELFIPPKINRRTQWKLSLLD,1332,NP_003631.2.csv,refseq-IKBKAP-NM_003640.3_clinical_seed_0_final,refseq-IKBKAP-NM_003640.3.a2m,Invitae,refseq-IKBKAP-NM_003640.3.npy,1,1332,1332
+NP_003638.1,MEAERRPAPGSPSEGLFADGHLILWTLCSVLLPVFITFWCSLQRSRRQLHRRDIFRKSKHGWRDTDLFSQPTYCCVCAQHILQGAFCDCCGLRVDEGCLRKADKRFQCKEIMLKNDTKVLDAMPHHWIRGNVPLCSYCMVCKQQCGCQPKLCDYRCIWCQKTVHDECMKNSLKNEKCDFGEFKNLIIPPSYLTSINQMRKDKKTDYEVLASKLGKQWTPLIILANSRSGTNMGEGLLGEFRILLNPVQVFDVTKTPPIKALQLCTLLPYYSARVLVCGGDGTVGWVLDAVDDMKIKGQEKYIPQVAVLPLGTGNDLSNTLGWGTGYAGEIPVAQVLRNVMEADGIKLDRWKVQVTNKGYYNLRKPKEFTMNNYFSVGPDALMALNFHAHREKAPSLFSSRILNKAVYLFYGTKDCLVQECKDLNKKVELELDGERVALPSLEGIIVLNIGYWGGGCRLWEGMGDETYPLARHDDGLLEVVGVYGSFHCAQIQVKLANPFRIGQAHTVRLILKCSMMPMQVDGEPWAQGPCTVTITHKTHAMMLYFSGEQTDDDISSTSDQEDIKATE,567,NP_003638.1.csv,refseq-DGKE-NM_003647.2_clinical_seed_0_final,refseq-DGKE-NM_003647.2.a2m,Invitae,refseq-DGKE-NM_003647.2.npy,1,567,567
+NP_003651.1,MMCEVMPTISEDGRRGSALGPDEAGGELERLMVTMLTERERLLETLREAQDGLATAQLRLRELGHEKDSLQRQLSIALPQEFAALTKELNLCREQLLEREEEIAELKAERNNTRLLLEHLECLVSRHERSLRMTVVKRQAQSPGGVSSEVEVLKALKSLFEHHKALDEKVRERLRMALERVAVLEEELELSNQETLNLREQLSRRRSGLEEPGKDGDGQTLANGLGPGGDSNRRTAELEEALERQRAEVCQLRERLAVLCRQMSQLEEELGTAHRELGKAEEANSKLQRDLKEALAQREDMEERITTLEKRYLSAQREATSLHDANDKLENELASKESLYRQSEEKSRQLAEWLDDAKQKLQQTLQKAETLPEIEAQLAQRVAALNKAEERHGNFEERLRQLEAQLEEKNQELQRARQREKMNDDHNKRLSETVDKLLSESNERLQLHLKERMGALEEKNSLSEEIANMKKLQDELLLNKEQLLAEMERMQMEIDQLRGRPPSSYSRSLPGSALELRYSQAPTLPSGAHLDPYVAGSGRAGKRGRWSGVKEEPSKDWERSAPAGSIPPPFPGELDGSDEEEAEGMFGAELLSPSGQADVQTLAIMLQEQLEAINKEIKLIQEEKETTEQRAEELESRVSSSGLDSLGRYRSSCSLPPSLTTSTLASPSPPSSGHSTPRLAPPSPAREGTDKANHVPKEEAGAPRGEGPAIPGDTPPPTPRSARLERMTQALALQAGSLEDGGPPRGSEGTPDSLHKAPKKKSIKSSIGRLFGKKEKGRMGPPGRDSSSLAGTPSDETLATDPLGLAKLTGPGDKDRRNKRKHELLEEACRQGLPFAAWDGPTVVSWLELWVGMPAWYVAACRANVKSGAIMANLSDTEIQREIGISNPLHRLKLRLAIQEMVSLTSPSAPASSRTSTGNVWMTHEEMESLTATTKPETKEISWEQILAYGDMNHEWVGNDWLPSLGLPQYRSYFMESLVDARMLDHLNKKELRGQLKMVDSFHRVSLHYGIMCLKRLNYDRKDLERRREESQTQIRDVMVWSNERVMGWVSGLGLKEFATNLTESGVHGALLALDETFDYSDLALLLQIPTQNAQARQLLEKEFSNLISLGTDRRLDEDSAKSFSRSPSWRKMFREKDLRGVTPDSAEMLPPNFRSAAAGALGSPGLPLRKLQPEGQTSGSSRADGVSVRTYSC,1194,NP_003651.1.csv,refseq-PPFIA3-NM_003660.3_clinical_seed_0_final,refseq-PPFIA3-NM_003660.3.a2m,Invitae,refseq-PPFIA3-NM_003660.3.npy,1,1194,1194
+NP_003667.1,MGSRVSREDFEWVYTDQPHADRRREILAKYPEIKSLMKPDPNLIWIIIMMVLTQLGAFYIVKDLDWKWVIFGAYAFGSCINHSMTLAIHEIAHNAAFGNCKAMWNRWFGMFANLPIGIPYSISFKRYHMDHHRYLGADGVDVDIPTDFEGWFFCTAFRKFIWVILQPLFYAFRPLFINPKPITYLEVINTVAQVTFDILIYYFLGIKSLVYMLAASLLGLGLHPISGHFIAEHYMFLKGHETYSYYGPLNLLTFNVGYHNEHHDFPNIPGKSLPLVRKIAAEYYDNLPHYNSWIKVLYDFVMDDTISPYSRMKRHQKGEMVLE,323,NP_003667.1.csv,refseq-DEGS1-NM_003676.3_clinical_seed_0_final,refseq-DEGS1-NM_003676.3.a2m,Invitae,refseq-DEGS1-NM_003676.3_theta_0.2.npy,1,323,323
+NP_003671.1,MGDAPSPEEKLHLITRNLQEVLGEEKLKEILKERELKIYWGTATTGKPHVAYFVPMSKIADFLKAGCEVTILFADLHAYLDNMKAPWELLELRVSYYENVIKAMLESIGVPLEKLKFIKGTDYQLSKEYTLDVYRLSSVVTQHDSKKAGAEVVKQVEHPLLSGLLYPGLQALDEEYLKVDAQFGGIDQRKIFTFAEKYLPALGYSKRVHLMNPMVPGLTGSKMSSSEEESKIDLLDRKEDVKKKLKKAFCEPGNVENNGVLSFIKHVLFPLKSEFVILRDEKWGGNKTYTAYVDLEKDFAAEVVHPGDLKNSVEVALNKLLDPIREKFNTPALKKLASAAYPDPSKQKPMAKGPAKNSEPEEVIPSRLDIRVGKIITVEKHPDADSLYVEKIDVGEAEPRTVVSGLVQFVPKEELQDRLVVVLCNLKPQKMRGVESQGMLLCASIEGINRQVEPLDPPAGSAPGEHVFVKGYEKGQPDEELKPKKKVFEKLQADFKISEECIAQWKQTNFMTKLGSISCKSLKGGNIS,528,NP_003671.1.csv,refseq-YARS1-NM_003680.3_clinical_seed_0_final,refseq-YARS1-NM_003680.3.a2m,Invitae,refseq-YARS1-NM_003680.3.npy,1,528,528
+NP_003672.1,MEEECRVLSIQSHVIRGYVGNRAATFPLQVLGFEIDAVNSVQFSNHTGYAHWKGQVLNSDELQELYEGLRLNNMNKYDYVLTGYTRDKSFLAMVVDIVQELKQQNPRLVYVCDPVLGDKWDGEGSMYVPEDLLPVYKEKVVPLADIITPNQFEAELLSGRKIHSQEEALRVMDMLHSMGPDTVVITSSDLPSPQGSNYLIVLGSQRRRNPAGSVVMERIRMDIRKVDAVFVGTGDLFAAMLLAWTHKHPNNLKVACEKTVSTLHHVLQRTIQCAKAQAGEGVRPSPMQLELRMVQSKRDIEDPEIVVQATVL,312,NP_003672.1.csv,refseq-PDXK-NM_003681.5_clinical_seed_0_final,refseq-PDXK-NM_003681.5.a2m,Invitae,refseq-PDXK-NM_003681.5_theta_0.2.npy,1,312,312
+NP_003679.2,MADDDVLFEDVYELCEVIGKGPFSVVRRCINRETGQQFAVKIVDVAKFTSSPGLSTEDLKREASICHMLKHPHIVELLETYSSDGMLYMVFEFMDGADLCFEIVKRADAGFVYSEAVASHYMRQILEALRYCHDNNIIHRDVKPHCVLLASKENSAPVKLGGFGVAIQLGESGLVAGGRVGTPHFMAPEVVKREPYGKPVDVWGCGVILFILLSGCLPFYGTKERLFEGIIKGKYKMNPRQWSHISESAKDLVRRMLMLDPAERITVYEALNHPWLKERDRYAYKIHLPETVEQLRKFNARRKLKGAVLAAVSSHKFNSFYGDPPEELPDFSEDPTSSGLLAAERAVSQVLDSLEEIHALTDCSEKDLDFLHSVFQDQHLHTLLDLYDKINTKSSPQIRNPPSDAVQRAKEVLEEISCYPENNDAKELKRILTQPHFMALLQTHDVVAHEVYSDEALRVTPPPTSPYLNGDSPESANGDMDMENVTRVRLVQFQKNTDEPMGITLKMNELNHCIVARIMHGGMIHRQGTLHVGDEIREINGISVANQTVEQLQKMLREMRGSITFKIVPSYRTQSSSCERDSPSTSRQSPANGHSSTNNSVSDLPSTTQPKGRQIYVRAQFEYDPAKDDLIPCKEAGIRFRVGDIIQIISKDDHNWWQGKLENSKNGTAGLIPSPELQEWRVACIAMEKTKQEQQASCTWFGKKKKQYKDKYLAKHNADLVTYEEVVKLPAFKRKTLVLLGAHGVGRRHIKNTLITKHPDRFAYPIPHTTRPPKKDEENGKNYYFVSHDQMMQDISNNEYLEYGSHEDAMYGTKLETIRKIHEQGLIAILDVEPQALKVLRTAEFAPFVVFIAAPTITPGLNEDESLQRLQKESDILQRTYAHYFDLTIINNEIDETIRHLEEAVELVCTAPQWVPVSWVY,921,NP_003679.2.csv,refseq-CASK-NM_003688.3_clinical_seed_0_final,refseq-CASK-NM_003688.3.a2m,Invitae,refseq-CASK-NM_003688.3.npy,1,921,921
+NP_003681.1,MSQSRHRAEAPPLEREDSGTFSLGKMITAKPGKTPIQVLHEYGMKTKNIPVYECERSDVQIHVPTFTFRVTVGDITCTGEGTSKKLAKHRAAEAAINILKANASICFAVPDPLMPDPSKQPKNQLNPIGSLQELAIHHGWRLPEYTLSQEGGPAHKREYTTICRLESFMETGKGASKKQAKRNAAEKFLAKFSNISPENHISLTNVVGHSLGCTWHSLRNSPGEKINLLKRSLLSIPNTDYIQLLSEIAKEQGFNITYLDIDELSANGQYQCLAELSTSPITVCHGSGISCGNAQSDAAHNALQYLKIIAERK,313,NP_003681.1.csv,refseq-PRKRA-NM_003690.4_clinical_seed_0_final,refseq-PRKRA-NM_003690.4.a2m,Invitae,refseq-PRKRA-NM_003690.4.npy,1,313,313
+NP_003696.2,MAVKVQTTKRGDPHELRNIFLQYASTEVDGERYMTPEDFVQRYLGLYNDPNSNPKIVQLLAGVADQTKDGLISYQEFLAFESVLCAPDSMFIVAFQLFDKSGNGEVTFENVKEIFGQTIIHHHIPFNWDCEFIRLHFGHNRKKHLNYTEFTQFLQELQLEHARQAFALKDKSKSGMISGLDFSDIMVTIRSHMLTPFVEENLVSAAGGSISHQVSFSYFNAFNSLLNNMELVRKIYSTLAGTRKDVEVTKEEFAQSAIRYGQVTPLEIDILYQLADLYNASGRLTLADIERIAPLAEGALPYNLAELQRQQSPGLGRPIWLQIAESAYRFTLGSVAGAVGATAVYPIDLVKTRMQNQRGSGSVVGELMYKNSFDCFKKVLRYEGFFGLYRGLIPQLIGVAPEKAIKLTVNDFVRDKFTRRDGSVPLPAEVLAGGCAGGSQVIFTNPLEIVKIRLQVAGEITTGPRVSALNVLRDLGIFGLYKGAKACFLRDIPFSAIYFPVYAHCKLLLADENGHVGGLNLLAAGAMAGVPAASLVTPADVIKTRLQVAARAGQTTYSGVIDCFRKILREEGPSAFWKGTAARVFRSSPQFGVTLVTYELLQRWFYIDFGGLKPAGSEPTPKSRIADLPPANPDHIGGYRLATATFAGIENKFGLYLPKFKSPSVAVVQPKAAVAATQ,678,NP_003696.2.csv,refseq-SLC25A12-NM_003705.4_clinical_seed_0_final,refseq-SLC25A12-NM_003705.4.a2m,Invitae,refseq-SLC25A12-NM_003705.4.npy,1,678,678
+NP_003709.3,MPSSSDTALGGGGGLSWAEKKLEERRKRRRFLSPQQPPLLLPLLQPQLLQPPPPPPPLLFLAAPGTAAAAAAAAAASSSCFSPGPPLEVKRLARGKRRAGGRQKRRRGPRAGQEAEKRRVFSLPQPQQDGGGGASSGGGVTPLVEYEDVSSQSEQGLLLGGASAATAATAAGGTGGSGGSPASSSGTQRRGEGSERRPRRDRRSSSGRSKERHREHRRRDGQRGGSEASKSRSRHSHSGEERAEVAKSGSSSSSGGRRKSASATSSSSSSRKDRDSKAHRSRTKSSKEPPSAYKEPPKAYREDKTEPKAYRRRRSLSPLGGRDDSPVSHRASQSLRSRKSPSPAGGGSSPYSRRLPRSPSPYSRRRSPSYSRHSSYERGGDVSPSPYSSSSWRRSRSPYSPVLRRSGKSRSRSPYSSRHSRSRSRHRLSRSRSRHSSISPSTLTLKSSLAAELNKNKKARAAEAARAAEAAKAAEATKAAEAAAKAAKASNTSTPTKGNTETSASASQTNHVKDVKKIKIEHAPSPSSGGTLKNDKAKTKPPLQVTKVENNLIVDKATKKAVIVGKESKSAATKEESVSLKEKTKPLTPSIGAKEKEQHVALVTSTLPPLPLPPMLPEDKEADSLRGNISVKAVKKEVEKKLRCLLADLPLPPELPGGDDLSKSPEEKKTATQLHSKRRPKICGPRYGETKEKDIDWGKRCVDKFDIIGIIGEGTYGQVYKARDKDTGEMVALKKVRLDNEKEGFPITAIREIKILRQLTHQSIINMKEIVTDKEDALDFKKDKGAFYLVFEYMDHDLMGLLESGLVHFNENHIKSFMRQLMEGLDYCHKKNFLHRDIKCSNILLNNRGQIKLADFGLARLYSSEESRPYTNKVITLWYRPPELLLGEERYTPAIDVWSCGCILGELFTKKPIFQANQELAQLELISRICGSPCPAVWPDVIKLPYFNTMKPKKQYRRKLREEFVFIPAAALDLFDYMLALDPSKRCTAEQALQCEFLRDVEPSKMPPPDLPLWQDCHELWSKKRRRQKQMGMTDDVSTIKAPRKDLSLGLDDSRTNTPQGVLPSSQLKSQGSSNVAPVKTGPGQHLNHSELAILLNLLQSKTSVNMADFVQVLNIKVNSETQQQLNKINLPAGILATGEKQTDPSTPQQESSKPLGGIQPSSQTIQPKVETDAAQAAVQSAFAVLLTQLIKAQQSKQKDVLLEERENGSGHEASLQLRPPPEPSTPVSGQDDLIQHQDMRILELTPEPDRPRILPPDQRPPEPPEPPPVTEEDLDYRTENQHVPTTSSSLTDPHAGVKAALLQLLAQHQPQDDPKREGGIDYQAGDTYVSTSDYKDNFGSSSFSSAPYVSNDGLGSSSAPPLERRSFIGNSDIQSLDNYSTASSHSGGPPQPSAFSESFPSSVAGYGDIYLNAGPMLFSGDKDHRFEYSHGPIAVLANSSDPSTGPESTHPLPAKMHNYNYGGNLQENPSGPSLMHGQTWTSPAQGPGYSQGYRGHISTSTGRGRGRGLPY,1512,NP_003709.3.csv,refseq-CDK13-NM_003718.4_clinical_seed_0_final,refseq-CDK13-NM_003718.4.a2m,Invitae,refseq-CDK13-NM_003718.4.npy,1,1512,1512
+NP_003712.1,MELTQPAEDLIQTQQTPASELGDPEDPGEEAADGSDTVVLSLFPCTPEPVNPEPDASVSSPQAGSSLKHSTTLTNRQRGNEVSALPATLDSLSIHQLAAQGELDQLKEHLRKGDNLVNKPDERGFTPLIWASAFGEIETVRFLLEWGADPHILAKERESALSLASTGGYTDIVGLLLERDVDINIYDWNGGTPLLYAVRGNHVKCVEALLARGADLTTEADSGYTPMDLAVALGYRKVQQVIENHILKLFQSNLVPADPE,260,NP_003712.1.csv,refseq-RFXANK-NM_003721.3_clinical_seed_0_final,refseq-RFXANK-NM_003721.3.a2m,Invitae,refseq-RFXANK-NM_003721.3.npy,1,260,260
+NP_003713.3,MNFETSRCATLQYCPDPYIQRFVETPAHFSWKESYYRSTMSQSTQTNEFLSPEVFQHIWDFLEQPICSVQPIDLNFVDEPSEDGATNKIEISMDCIRMQDSDLSDPMWPQYTNLGLLNSMDQQIQNGSSSTSPYNTDHAQNSVTAPSPYAQPSSTFDALSPSPAIPSNTDYPGPHSFDVSFQQSSTAKSATWTYSTELKKLYCQIAKTCPIQIKVMTPPPQGAVIRAMPVYKKAEHVTEVVKRCPNHELSREFNEGQIAPPSHLIRVEGNSHAQYVEDPITGRQSVLVPYEPPQVGTEFTTVLYNFMCNSSCVGGMNRRPILIIVTLETRDGQVLGRRCFEARICACPGRDRKADEDSIRKQQVSDSTKNGDGTKRPFRQNTHGIQMTSIKKRRSPDDELLYLPVRGRETYEMLLKIKESLELMQYLPQHTIETYRQQQQQQHQHLLQKQTSIQSPSSYGNSSPPLNKMNSMNKLPSVSQLINPQQRNALTPTTIPDGMGANIPMMGTHMPMAGDMNGLSPTQALPPPLSMPSTSHCTPPPPYPTDCSIVSFLARLGCSSCLDYFTTQGLTTIYQIEHYSMDDLASLKIPEQFRHAIWKGILDHRQLHEFSSPSHLLRTPSSASTVSVGSSETRGERVIDAVRFTLRQTISFPPRDEWNDFNFDMDARRNKQQRIKEEGE,680,NP_003713.3.csv,refseq-TP63-NM_003722.4_clinical_seed_0_final,refseq-TP63-NM_003722.4.a2m,Invitae,refseq-TP63-NM_003722.4.npy,1,680,680
+NP_003733.2,MSDSVILRSIKKFGEENDGFESDKSYNNDKKSRLQDEKKGDGVRVGFFQLFRFSSSTDIWLMFVGSLCAFLHGIAQPGVLLIFGTMTDVFIDYDVELQELQIPGKACVNNTIVWTNSSLNQNMTNGTRCGLLNIESEMIKFASYYAGIAVAVLITGYIQICFWVIAAARQIQKMRKFYFRRIMRMEIGWFDCNSVGELNTRFSDDINKINDAIADQMALFIQRMTSTICGFLLGFFRGWKLTLVIISVSPLIGIGAATIGLSVSKFTDYELKAYAKAGVVADEVISSMRTVAAFGGEKREVERYEKNLVFAQRWGIRKGIVMGFFTGFVWCLIFLCYALAFWYGSTLVLDEGEYTPGTLVQIFLSVIVGALNLGNASPCLEAFATGRAAATSIFETIDRKPIIDCMSEDGYKLDRIKGEIEFHNVTFHYPSRPEVKILNDLNMVIKPGEMTALVGPSGAGKSTALQLIQRFYDPCEGMVTVDGHDIRSLNIQWLRDQIGIVEQEPVLFSTTIAENIRYGREDATMEDIVQAAKEANAYNFIMDLPQQFDTLVGEGGGQMSGGQKQRVAIARALIRNPKILLLDMATSALDNESEAMVQEVLSKIQHGHTIISVAHRLSTVRAADTIIGFEHGTAVERGTHEELLERKGVYFTLVTLQSQGNQALNEEDIKDATEDDMLARTFSRGSYQDSLRASIRQRSKSQLSYLVHEPPLAVVDHKSTYEEDRKDKDIPVQEEVEPAPVRRILKFSAPEWPYMLVGSVGAAVNGTVTPLYAFLFSQILGTFSIPDKEEQRSQINGVCLLFVAMGCVSLFTQFLQGYAFAKSGELLTKRLRKFGFRAMLGQDIAWFDDLRNSPGALTTRLATDASQVQGAAGSQIGMIVNSFTNVTVAMIIAFSFSWKLSLVILCFFPFLALSGATQTRMLTGFASRDKQALEMVGQITNEALSNIRTVAGIGKERRFIEALETELEKPFKTAIQKANIYGFCFAFAQCIMFIANSASYRYGGYLISNEGLHFSYVFRVISAVVLSATALGRAFSYTPSYAKAKISAARFFQLLDRQPPISVYNTAGEKWDNFQGKIDFVDCKFTYPSRPDSQVLNGLSVSISPGQTLAFVGSSGCGKSTSIQLLERFYDPDQGKVMIDGHDSKKVNVQFLRSNIGIVSQEPVLFACSIMDNIKYGDNTKEIPMERVIAAAKQAQLHDFVMSLPEKYETNVGSQGSQLSRGEKQRIAIARAIVRDPKILLLDEATSALDTESEKTVQVALDKAREGRTCIVIAHRLSTIQNADIIAVMAQGVVIEKGTHEELMAQKGAYYKLVTTGSPIS,1321,NP_003733.2.csv,refseq-ABCB11-NM_003742.2_clinical_seed_0_final,refseq-ABCB11-NM_003742.2.a2m,Invitae,refseq-ABCB11-NM_003742.2.npy,1,1321,1321
+NP_003745.1,MATPAVPVSAPPATPTPVPAAAPASVPAPTPAPAAAPVPAAAPASSSDPAAAAAATAAPGQTPASAQAPAQTPAPALPGPALPGPFPGGRVVRLHPVILASIVDSYERRNEGAARVIGTLLGTVDKHSVEVTNCFSVPHNESEDEVAVDMEFAKNMYELHKKVSPNELILGWYATGHDITEHSVLIHEYYSREAPNPIHLTVDTSLQNGRMSIKAYVSTLMGVPGRTMGVMFTPLTVKYAYYDTERIGVDLIMKTCFSPNRVIGLSSDLQQVGGASARIQDALSTVLQYAEDVLSGKVSADNTVGRFLMSLVNQVPKIVPDDFETMLNSNINDLLMVTYLANLTQSQIALNEKLVNL,357,NP_003745.1.csv,refseq-EIF3F-NM_003754.2_clinical_seed_0_final,refseq-EIF3F-NM_003754.2.a2m,Invitae,refseq-EIF3F-NM_003754.2.npy,1,357,357
+NP_003755.2,MKDRLAELLDLSKQYDQQFPDGDDEFDSPHEDIVFETDHILESLYRDIRDIQDENQLLVADVKRLGKQNARFLTSMRRLSSIKRDTNSIAKAIKARGEVIHCKLRAMKELSEAAEAQHGPHSAVARISRAQYNALTLTFQRAMHDYNQAEMKQRDNCKIRIQRQLEIMGKEVSGDQIEDMFEQGKWDVFSENLLADVKGARAALNEIESRHRELLRLESRIRDVHELFLQMAVLVEKQADTLNVIELNVQKTVDYTGQAKAQVRKAVQYEEKNPCRTLCCFCCPCLK,287,NP_003755.2.csv,refseq-STX11-NM_003764.3_clinical_seed_0_final,refseq-STX11-NM_003764.3.a2m,Invitae,refseq-STX11-NM_003764.3.npy,1,287,287
+NP_003782.1,MKLVNIWLLLLVVLLCGKKHLGDRLEKKSFEKAPCPGCSHLTLKVEFSSTVVEYEYIVAFNGYFTAKARNSFISSALKSSEVDNWRIIPRNNPSSDYPSDFEVIQIKEKQKAGLLTLEDHPNIKRVTPQRKVFRSLKYAESDPTVPCNETRWSQKWQSSRPLRRASLSLGSGFWHATGRHSSRRLLRAIPRQVAQTLQADVLWQMGYTGANVRVAVFDTGLSEKHPHFKNVKERTNWTNERTLDDGLGHGTFVAGVIASMRECQGFAPDAELHIFRVFTNNQVSYTSWFLDAFNYAILKKIDVLNLSIGGPDFMDHPFVDKVWELTANNVIMVSAIGNDGPLYGTLNNPADQMDVIGVGGIDFEDNIARFSSRGMTTWELPGGYGRMKPDIVTYGAGVRGSGVKGGCRALSGTSVASPVVAGAVTLLVSTVQKRELVNPASMKQALIASARRLPGVNMFEQGHGKLDLLRAYQILNSYKPQASLSPSYIDLTECPYMWPYCSQPIYYGGMPTVVNVTILNGMGVTGRIVDKPDWQPYLPQNGDNIEVAFSYSSVLWPWSGYLAISISVTKKAASWEGIAQGHVMITVASPAETESKNGAEQTSTVKLPIKVKIIPTPPRSKRVLWDQYHNLRYPPGYFPRDNLRMKNDPLDWNGDHIHTNFRDMYQHLRSMGYFVEVLGAPFTCFDASQYGTLLMVDSEEEYFPEEIAKLRRDVDNGLSLVIFSDWYNTSVMRKVKFYDENTRQWWMPDTGGANIPALNELLSVWNMGFSDGLYEGEFTLANHDMYYASGCSIAKFPEDGVVITQTFKDQGLEVLKQETAVVENVPILGLYQIPAEGGGRIVLYGDSNCLDDSHRQKDCFWLLDALLQYTSYGVTPPSLSHSGNRQRPPSGAGSVTPERMEGNHLHRYSKVLEAHLGDPKPRPLPACPRLSWAKPQPLNETAPSNLWKHQKLLSIDLDKVVLPNFRSNRPQVRPLSPGESGAWDIPGGIMPGRYNQEVGQTIPVFAFLGAMVVLAFFVVQINKAKSRPKRRKPRVKRPQLMQQVHPPKTPSV,1052,NP_003782.1.csv,refseq-MBTPS1-NM_003791.3_clinical_seed_0_final,refseq-MBTPS1-NM_003791.3.a2m,Invitae,refseq-MBTPS1-NM_003791.3_theta_0.2.npy,1,1052,1052
+NP_003784.2,MAPWLQLLSLLGLLPGAVAAPAQPRAASFQAWGPPSPELLAPTRFALEMFNRGRAAGTRAVLGLVRGRVRRAGQGSLYSLEATLEEPPCNDPMVCRLPVSKKTLLCSFQVLDELGRHVLLRKDCGPVDTKVPGAGEPKSAFTQGSAMISSLSQNHPDNRNETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD,484,NP_003784.2.csv,refseq-CTSF-NM_003793.3_clinical_seed_0_final,refseq-CTSF-NM_003793.3.a2m,Invitae,refseq-CTSF-NM_003793.3.npy,1,484,484
+NP_003788.2,MSEREVSTAPAGTDMPAAKKQKLSSDENSNPDLSGDENDDAVSIESGTNTERPDTPTNTPNAPGRKSWGKGKWKSKKCKYSFKCVNSLKEDHNQPLFGVQFNWHSKEGDPLVFATVGSNRVTLYECHSQGEIRLLQSYVDADADENFYTCAWTYDSNTSHPLLAVAGSRGIIRIINPITMQCIKHYVGHGNAINELKFHPRDPNLLLSVSKDHALRLWNIQTDTLVAIFGGVEGHRDEVLSADYDLLGEKIMSCGMDHSLKLWRINSKRMMNAIKESYDYNPNKTNRPFISQKIHFPDFSTRDIHRNYVDCVRWLGDLILSKSCENAIVCWKPGKMEDDIDKIKPSESNVTILGRFDYSQCDIWYMRFSMDFWQKMLALGNQVGKLYVWDLEVEDPHKAKCTTLTHHKCGAAIRQTSFSRDSSILIAVCDDASIWRWDRLR,441,NP_003788.2.csv,refseq-EED-NM_003797.4_clinical_seed_0_final,refseq-EED-NM_003797.4.a2m,Invitae,refseq-EED-NM_003797.4.npy,1,441,441
+NP_003792.1,MGLLSDPVRRRALARLVLRLNAPLCVLSYVAGIAWFLALVFPPLTQRTYMSENAMGSTMVEEQFAGGDRARAFARDFAAHRKKSGALPVAWLERTMRSVGLEVYTQSFSRKLPFPDETHERYMVSGTNVYGILRAPRAASTESLVLTVPCGSDSTNSQAVGLLLALAAHFRGQIYWAKDIVFLVTEHDLLGTEAWLEAYHDVNVTGMQSSPLQGRAGAIQAAVALELSSDVVTSLDVAVEGLNGQLPNLDLLNLFQTFCQKGGLLCTLQGKLQPEDWTSLDGPLQGLQTLLLMVLRQASGRPHGSHGLFLRYRVEALTLRGINSFRQYKYDLVAVGKALEGMFRKLNHLLERLHQSFFLYLLPGLSRFVSIGLYMPAVGFLLLVLGLKALELWMQLHEAGMGLEEPGGAPGPSVPLPPSQGVGLASLVAPLLISQAMGLALYVLPVLGQHVATQHFPVAEAEAVVLTLLAIYAAGLALPHNTHRVVSTQAPDRGWMALKLVALIYLALQLGCIALTNFSLGFLLATTMVPTAALAKPHGPRTLYAALLVLTSPAATLLGSLFLWRELQEAPLSLAEGWQLFLAALAQGVLEHHTYGALLFPLLSLGLYPCWLLFWNVLFWK,621,NP_003792.1.csv,refseq-GPAA1-NM_003801.3_clinical_seed_0_final,refseq-GPAA1-NM_003801.3.a2m,Invitae,refseq-GPAA1-NM_003801.3.npy,1,621,621
+NP_003796.1,MEARDKQVLRSLRLELGAEVLVEGLVLQYLYQEGILTENHIQEINAQTTGLRKTMLLLDILPSRGPKAFDTFLDSLQEFPWVREKLKKAREEAMTDLPAGDRLTGIPSHILNSSPSDRQINQLAQRLGPEWEPMVLSLGLSQTDIYRCKANHPHNVQSQVVEAFIRWRQRFGKQATFQSLHNGLRAVEVDPSLLLHMLE,199,NP_003796.1.csv,refseq-CRADD-NM_003805.4_clinical_seed_0_final,refseq-CRADD-NM_003805.4.a2m,Invitae,refseq-CRADD-NM_003805.4.npy,1,199,199
+NP_003826.2,MTIRHQGQQYRPRMAFLQKIEALVKDMQNPETGVRMQNQRVLVTSVPHAMTGSDVLQWIVQRLWISSLEAQNLGNFIVRYGYIYPLQDPKNLILKPDGSLYRFQTPYFWPTQQWPAEDTDYAIYLAKRNIKKKGILEEYEKENYNFLNQKMNYKWDFVIMQAKEQYRAGKERNKADRYALDCQEKAYWLVHRCPPGMDNVLDYGLDRVTNPNEVKVNQKQTVVAVKKEIMYYQQALMRSTVKSSVSLGGIVKYSEQFSSNDAIMSGCLPSNPWITDDTQFWDLNAKLVEIPTKMRVERWAFNFSELIRDPKGRQSFQYFLKKEFSGENLGFWEACEDLKYGDQSKVKEKAEEIYKLFLAPGARRWINIDGKTMDITVKGLKHPHRYVLDAAQTHIYMLMKKDSYARYLKSPIYKDMLAKAIEPQETTKKSSTLPFMRRHLRSSPSPVILRQLEEEAKAREAANTVDITQPGQHMAPSPHLTVYTGTCMPPSPSSPFSSSCRSPRKPFASPSRFIRRPSTTICPSPIRVALESSSGLEQKGECSGSMAPRGPSVTESSEASLDTSWPRSRPRAPPKARMALSFSRFLRRGCLASPVFARLSPKCPAVSHGRVQPLGDVGQQLPRLKSKRVANFFQIKMDVPTGSGTCLMDSEDAGTGESGDRATEKEVICPWESL,674,NP_003826.2.csv,refseq-RGS9-NM_003835.3_clinical_seed_0_final,refseq-RGS9-NM_003835.3.a2m,Invitae,refseq-RGS9-NM_003835.3.npy,1,674,674
+NP_003830.1,MAPRARRRRPLFALLLLCALLARLQVALQIAPPCTSEKHYEHLGRCCNKCEPGKYMSSKCTTTSDSVCLPCGPDEYLDSWNEEDKCLLHKVCDTGKALVAVVAGNSTTPRRCACTAGYHWSQDCECCRRNTECAPGLGAQHPLQLNKDTVCKPCLAGYFSDAFSSTDKCRPWTNCTFLGKRVEHHGTEKSDAVCSSSLPARKPPNEPHVYLPGLIILLLFASVALVAAIIFGVCYRKKGKALTANLWHWINEACGRLSGDKESSGDSCVSTHTANFGQQGACEGVLLLTLEEKTFPEDMCYPDQGGVCQGTCVGGGPYAQGEDARMLSLVSKTEIEEDSFRQMPTEDEYMDRPSQPTDQLLFLTEPGSKSTPPFSEPLEVGENDSLSQCFTGTQSTVGSESCNCTEPLCRTDWTPMSSENYLQKEVDSGHCPHWAASPSPNWADVCTGCRNPPGEDCEPLVGSPKRGPLPQCAYGMGLPPEEEASRTEARDQPEDGADGRLPSSARAGAGSGSSPGGQSPASGNVTGNSNSTFISSGQVMNFKGDIIVVYVSQTSQEGAAAAAEPMGRPVQEETLARRDSFAGNGPRFPDPCGGPEGLREPEKASRPVQEQGGAKA,616,NP_003830.1.csv,refseq-TNFRSF11A-NM_003839.3_clinical_seed_0_final,refseq-TNFRSF11A-NM_003839.3.a2m,Invitae,refseq-TNFRSF11A-NM_003839.3.npy,1,616,616
+NP_003840.2,MTATLAAAADIATMVSGSSGLAAARLLSRSFLLPQNGIRHCSYTASRQHLYVDKNTKIICQGFTGKQGTFHSQQALEYGTKLVGGTTPGKGGQTHLGLPVFNTVKEAKEQTGATASVIYVPPPFAAAAINEAIEAEIPLVVCITEGIPQQDMVRVKHKLLRQEKTRLIGPNCPGVINPGECKIGIMPGHIHKKGRIGIVSRSGTLTYEAVHQTTQVGLGQSLCVGIGGDPFNGTDFIDCLEIFLNDSATEGIILIGEIGGNAEENAAEFLKQHNSGPNSKPVVSFIAGLTAPPGRRMGHAGAIIAGGKGGAKEKISALQSAGVVVSMSPAQLGTTIYKEFEKRKML,346,NP_003840.2.csv,refseq-SUCLG1-NM_003849.3_clinical_seed_0_final,refseq-SUCLG1-NM_003849.3.a2m,Invitae,refseq-SUCLG1-NM_003849.3.npy,1,346,346
+NP_003841.1,MAASMFYGRLVAVATLRNHRPRTAQRAAAQVLGSSGLFNNHGLQVQQQQQRNLSLHEYMSMELLQEAGVSVPKGYVAKSPDEAYAIAKKLGSKDVVIKAQVLAGGRGKGTFESGLKGGVKIVFSPEEAKAVSSQMIGKKLFTKQTGEKGRICNQVLVCERKYPRREYYFAITMERSFQGPVLIGSSHGGVNIEDVAAESPEAIIKEPIDIEEGIKKEQALQLAQKMGFPPNIVESAAENMVKLYSLFLKYDATMIEINPMVEDSDGAVLCMDAKINFDSNSAYRQKKIFDLQDWTQEDERDKDAAKANLNYIGLDGNIGCLVNGAGLAMATMDIIKLHGGTPANFLDVGGGATVHQVTEAFKLITSDKKVLAILVNIFGGIMRCDVIAQGIVMAVKDLEIKIPVVVRLQGTRVDDAKALIADSGLKILACDDLDEAARMVVKLSEIVTLAKQAHVDVKFQLPI,463,NP_003841.1.csv,refseq-SUCLA2-NM_003850.2_clinical_seed_0_final,refseq-SUCLA2-NM_003850.2.a2m,Invitae,refseq-SUCLA2-NM_003850.2.npy,1,463,463
+NP_003854.1,MATGTDQVVGLGLVAVSLIIFTYYTAWVILLPFIDSQHVIHKYFLPRAYAVAIPLAAGLLLLLFVGLFISYVMLKTKRVTKKAQ,84,NP_003854.1.csv,refseq-DPM2-NM_003863.3_clinical_seed_0_final,refseq-DPM2-NM_003863.3.a2m,Invitae,refseq-DPM2-NM_003863.3.npy,1,84,84
+NP_003856.1,MSPSLQEGAQLGENKPSTCSFSIERILGLDQKKDCVPLMKPHRPWADTCSSSGKDGNLCLHVPNPPSGISFPSVVDHPMPEERASKYENYFSASERLSLKRELSWYRGRRPRTAFTQNQIEVLENVFRVNCYPGIDIREDLAQKLNLEEDRIQIWFQNRRAKLKRSHRESQFLMAKKNFNTNLLE,185,NP_003856.1.csv,refseq-HESX1-NM_003865.2_clinical_seed_0_final,refseq-HESX1-NM_003865.2.a2m,Invitae,refseq-HESX1-NM_003865.2.npy,1,185,185
+NP_003871.1,MQGLLFSTLLLAGLAQFCCRVQGTGPLDTTPEGRPGEVSDAPQRKQFCHWPCKCPQQKPRCPPGVSLVRDGCGCCKICAKQPGEICNEADLCDPHKGLYCDYSVDRPRYETGVCAYLVAVGCEFNQVHYHNGQVFQPNPLFSCLCVSGAIGCTPLFIPKLAGSHCSGAKGGKKSDQSNCSLEPLLQQLSTSYKTMPAYRNLPLIWKKKCLVQATKWTPCSRTCGMGISNRVTNENSNCEMRKEKRLCYIQPCDSNILKTIKIPKGKTCQPTFQLSKAEKFVFSGCSSTQSYKPTFCGICLDKRCCIPNKSKMITIQFDCPNEGSFKWKMLWITSCVCQRNCREPGDIFSELKIL,354,NP_003871.1.csv,refseq-CCN6-NM_003880.3_clinical_seed_0_final,refseq-CCN6-NM_003880.3.a2m,Invitae,refseq-CCN6-NM_003880.3.npy,1,354,354
+NP_003874.2,MAKTVAYFYDPDVGNFHYGAGHPMKPHRLALTHSLVLHYGLYKKMIVFKPYQASQHDMCRFHSEDYIDFLQRVSPTNMQGFTKSLNAFNVGDDCPVFPGLFEFCSRYTGASLQGATQLNNKICDIAINWAGGLHHAKKFEASGFCYVNDIVIGILELLKYHPRVLYIDIDIHHGDGVQEAFYLTDRVMTVSFHKYGNYFFPGTGDMYEVGAESGRYYCLNVPLRDGIDDQSYKHLFQPVINQVVDFYQPTCIVLQCGADSLGCDRLGCFNLSIRGHGECVEYVKSFNIPLLVLGGGGYTVRNVARCWTYETSLLVEEAISEELPYSEYFEYFAPDFTLHPDVSTRIENQNSRQYLDQIRQTIFENLKMLNHAPSVQIHDVPADLLTYDRTDEADAEERGPEENYSRPEAPNEFYDGDHDNDKESDVEI,428,NP_003874.2.csv,refseq-HDAC3-NM_003883.3_clinical_seed_0_final,refseq-HDAC3-NM_003883.3.a2m,Invitae,refseq-HDAC3-NM_003883.3.npy,1,428,428
+NP_003886.3,MRKRWACWSGSDAPGGCGGGCGRRRRRSRRKRAASEERRMAFSKGFRIYHKLDPPPFSLIVETRHKEECLMFESGAVAVLSSAEKEAIKGTYSKVLDAYGLLGVLRLNLGDTMLHYLVLVTGCMSVGKIQESEVFRVTSTEFISLRIDSSDEDRISEVRKVLNSGNFYFAWSASGISLDLSLNAHRSMQEQTTDNRFFWNQSLHLHLKHYGVNCDDWLLRLMCGGVEIRTIYAAHKQAKACLISRLSCERAGTRFNVRGTNDDGHVANFVETEQVVYLDDSVSSFIQIRGSVPLFWEQPGLQVGSHRVRMSRGFEANAPAFDRHFRTLKNLYGKQIIVNLLGSKEGEHMLSKAFQSHLKASEHAADIQMVNFDYHQMVKGGKAEKLHSVLKPQVQKFLDYGFFYFNGSEVQRCQSGTVRTNCLDCLDRTNSVQAFLGLEMLAKQLEALGLAEKPQLVTRFQEVFRSMWSVNGDSISKIYAGTGALEGKAKLKDGARSVTRTIQNNFFDSSKQEAIDVLLLGNTLNSDLADKARALLTTGSLRVSEQTLQSASSKVLKSMCENFYKYSKPKKIRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQEFQDKRSKPTDIFAIGFEEMVELNAGNIVSASTTNQKLWAVELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAAGQSQVKERNEDFIEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQVFRGFLEGKVTFAPTYKYDLFSDDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFQDESKILYTWTPGTLLHYGRAELKTSDHRPVVALIDIDIFEVEAEERQNIYKEVIAVQGPPDGTVLVSIKSSLPENNFFDDALIDELLQQFASFGEVILIRFVEDKMWVTFLEGSSALNVLSLNGKELLNRTITIALKSPDWIKNLEEEMSLEKISIALPSSTSSTLLGEDAEVAADFDMEGDVDDYSAEVEELLPQHLQPSSSSGLGTSPSSSPRTSPCQSPTISEGPVPSLPIRPSRAPSRTPGPPSAQSSPIDAQPATPLPQKDPAQPLEPKRPPPPRPVAPPTRPAPPQRPPPPSGARSPAPTRKEFGGIGAPPSPGVARREMEAPKSPGTTRKDNIGRSQPSPQAGLAGPGPAGYSTARPTIPPRAGVISAPQSHARASAGRLTPESQSKTSETSKGSTFLPEPLKPQAAFPPQSSLPPPAQRLQEPLVPVAAPMPQSGPQPNLETPPQPPPRSRSSHSLPSEASSQPQVKTNGISDGKRESPLKIDPFEDLSFNLLAVSKAQLSVQTSPVPTPDPKRLIQLPSATQSNVLSSVSCMPTMPPIPARSQSQENMRSSPNPFITGLTRTNPFSDRTAAPGNPFRAKSEESEATSWFSKEEPVTISPFPSLQPLGHNKSRASSSLDGFKDSFDLQGQSTLKISNPKGWVTFEEEEDFGVKGKSKSACSDLLGNQPSSFSGSNLTLNDDWNKGTNVSFCVLPSRRPPPPPVPLLPPGTSPPVDPFTTLASKASPTLDFTER,1612,NP_003886.3.csv,refseq-SYNJ1-NM_003895.3_clinical_seed_0_final,refseq-SYNJ1-NM_003895.3.a2m,Invitae,refseq-SYNJ1-NM_003895.3.npy,1,1612,1612
+NP_003887.3,MRTKAAGCAERRPLQPRTEAAAAPAGRAMPSEYTYVKLRSDCSRPSLQWYTRAQSKMRRPSLLLKDILKCTLLVFGVWILYILKLNYTTEECDMKKMHYVDPDHVKRAQKYAQQVLQKECRPKFAKTSMALLFEHRYSVDLLPFVQKAPKDSEAESKYDPPFGFRKFSSKVQTLLELLPEHDLPEHLKAKTCRRCVVIGSGGILHGLELGHTLNQFDVVIRLNSAPVEGYSEHVGNKTTIRMTYPEGAPLSDLEYYSNDLFVAVLFKSVDFNWLQAMVKKETLPFWVRLFFWKQVAEKIPLQPKHFRILNPVIIKETAFDILQYSEPQSRFWGRDKNVPTIGVIAVVLATHLCDEVSLAGFGYDLNQPRTPLHYFDSQCMAAMNFQTMHNVTTETKFLLKLVKEGVVKDLSGGIDREF,418,NP_003887.3.csv,refseq-ST3GAL5-NM_003896.3_clinical_seed_0_final,refseq-ST3GAL5-NM_003896.3.a2m,Invitae,refseq-ST3GAL5-NM_003896.3.npy,1,418,418
+NP_003891.1,MASLTVKAYLLGKEDAAREIRRFSFCCSPEPEAEAEAAAGPGPCERLLSRVAALFPALRPGGFQAHYRDEDGDLVAFSSDEELTMAMSYVKDDIFRIYIKEKKECRRDHRPPCAQEAPRNMVHPNVICDGCNGPVVGTRYKCSVCPDYDLCSVCEGKGLHRGHTKLAFPSPFGHLSEGFSHSRWLRKVKHGHFGWPGWEMGPPGNWSPRPPRAGEARPGPTAESASGPSEDPSVNFLKNVGESVAAALSPLGIEVDIDVEHGGKRSRLTPVSPESSSTEEKSSSQPSSCCSDPSKPGGNVEGATQSLAEQMRKIALESEGRPEEQMESDNCSGGDDDWTHLSSKEVDPSTGELQSLQMPESEGPSSLDPSQEGPTGLKEAALYPHLPPEADPRLIESLSQMLSMGFSDEGGWLTRLLQTKNYDIGAALDTIQYSKHPPPL,440,NP_003891.1.csv,refseq-SQSTM1-NM_003900.4_clinical_seed_0_final,refseq-SQSTM1-NM_003900.4.a2m,Invitae,refseq-SQSTM1-NM_003900.4.npy,1,440,440
+NP_003892.2,MPSTDLLMLKAFEPYLEILEVYSTKAKNYVNGHCTKYEPWQLIAWSVVWTLLIVWGYEFVFQPESLWSRFKKKCFKLTRKMPIIGRKIQDKLNKTKDDISKNMSFLKVDKEYVKALPSQGLSSSAVLEKLKEYSSMDAFWQEGRASGTVYSGEEKLTELLVKAYGDFAWSNPLHPDIFPGLRKIEAEIVRIACSLFNGGPDSCGCVTSGGTESILMACKAYRDLAFEKGIKTPEIVAPQSAHAAFNKAASYFGMKIVRVPLTKMMEVDVRAMRRAISRNTAMLVCSTPQFPHGVIDPVPEVAKLAVKYKIPLHVDACLGGFLIVFMEKAGYPLEHPFDFRVKGVTSISADTHKYGYAPKGSSLVLYSDKKYRNYQFFVDTDWQGGIYASPTIAGSRPGGISAACWAALMHFGENGYVEATKQIIKTARFLKSELENIKGIFVFGNPQLSVIALGSRDFDIYRLSNLMTAKGWNLNQLQFPPSIHFCITLLHARKRVAIQFLKDIRESVTQIMKNPKAKTTGMGAIYGMAQTTVDRNMVAELSSVFLDSLYSTDTVTQGSQMNGSPKPH,568,NP_003892.2.csv,refseq-SGPL1-NM_003901.3_clinical_seed_0_final,refseq-SGPL1-NM_003901.3.a2m,Invitae,refseq-SGPL1-NM_003901.3.npy,1,568,568
+NP_003898.2,MAAPVVAPPGVVVSRANKRSGAGPGGSGGGGARGAEEEPPPPLQAVLVADSFDRRFFPISKDQPRVLLPLANVALIDYTLEFLTATGVQETFVFCCWKAAQIKEHLLKSKWCRPTSLNVVRIITSELYRSLGDVLRDVDAKALVRSDFLLVYGDVISNINITRALEEHRLRRKLEKNVSVMTMIFKESSPSHPTRCHEDNVVVAVDSTTNRVLHFQKTQGLRRFAFPLSLFQGSSDGVEVRYDLLDCHISICSPQVAQLFTDNFDYQTRDDFVRGLLVNEEILGNQIHMHVTAKEYGARVSNLHMYSAVCADVIRRWVYPLTPEANFTDSTTQSCTHSRHNIYRGPEVSLGHGSILEENVLLGSGTVIGSNCFITNSVIGPGCHIGDNVVLDQTYLWQGVRVAAGAQIHQSLLCDNAEVKERVTLKPRSVLTSQVVVGPNITLPEGSVISLHPPDAEEDEDDGEFSDDSGADQEKDKVKMKGYNPAEVGAAGKGYLWKAAGMNMEEEEELQQNLWGLKINMEEESESESEQSMDSEEPDSRGGSPQMDDIKVFQNEVLGTLQRGKEENISCDNLVLEINSLKYAYNISLKEVMQVLSHVVLEFPLQQMDSPLDSSRYCALLLPLLKAWSPVFRNYIKRAADHLEALAAIEDFFLEHEALGISMAKVLMAFYQLEILAEETILSWFSQRDTTDKGQQLRKNQQLQRFIQWLKEAEEESSEDD,721,NP_003898.2.csv,refseq-EIF2B5-NM_003907.2_clinical_seed_0_final,refseq-EIF2B5-NM_003907.2.a2m,Invitae,refseq-EIF2B5-NM_003907.2.npy,1,721,721
+NP_003910.1,MQLPRWWELGDPCAWTGQGRGTRRMSPATTGTFLLTVYSIFSKVHSDRNVYPSAGVLFVHVLEREYFKGEFPPYPKPGEISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTAENVGKPTIIEITAYNRRTFETARHNLIINIMSAEDFPLPYQAEFFIKNMNVEEMLASEVLGDFLGAVKNVWQPERLNAINITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVENPQNQLRCSQEMEPVITCDKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGEYKPPSDSLKSRDYYTDFLITLAVPSAVALVLFLILAYIMCCRREGVEKRNMQTPDIQLVHHSAIQKSTKELRDMSKNREIAWPLSTLPVFHPVTGEIIPPLHTDNYDSTNMPLMQTQQNLPHQTQIPQQQTTGKWYP,437,NP_003910.1.csv,refseq-SGCE-NM_003919.2_clinical_seed_0_final,refseq-SGCE-NM_003919.2.a2m,Invitae,refseq-SGCE-NM_003919.2.npy,1,437,437
+NP_003914.1,MGPCSGSRLGPPEAESPSQPPKRRKKRYLRHDKPPYTYLAMIALVIQAAPSRRLKLAQIIRQVQAVFPFFREDYEGWKDSIRHNLSSNRCFRKVPKDPAKPQAKGNFWAVDVSLIPAEALRLQNTALCRRWQNGGARGAFAKDLGPYVLHGRPYRPPSPPPPPSEGFSIKSLLGGSGEGAPWPGLAPQSSPVPAGTGNSGEEAVPTPPLPSSERPLWPLCPLPGPTRVEGETVQGGAIGPSTLSPEPRAWPLHLLQGTAVPGGRSSGGHRASLWGQLPTSYLPIYTPNVVMPLAPPPTSCPQCPSTSPAYWGVAPETRGPPGLLCDLDALFQGVPPNKSIYDVWVSHPRDLAAPGPGWLLSWCSL,365,NP_003914.1.csv,refseq-FOXH1-NM_003923.2_clinical_seed_0_final,refseq-FOXH1-NM_003923.2.a2m,Invitae,refseq-FOXH1-NM_003923.2.npy,1,365,365
+NP_003915.2,MYKMEYSYLNSSAYESCMAGMDTSSLASAYADFSSCSQASGFQYNPIRTTFGATSGCPSLTPGSCSLGTLRDHQSSPYAAVPYKLFTDHGGLNEKRKQRRIRTTFTSAQLKELERVFAETHYPDIYTREELALKIDLTEARVQVWFQNRRAKFRKQERAAAAAAAAAKNGSSGKKSDSSRDDESKEAKSTDPDSTGGPGPNPNPTPSCGANGGGGGGPSPAGAPGAAGPGGPGGEPGKGGAAAAAAAAAAAAAAAAAAAAGGLAAAGGPGQGWAPGPGPITSIPDSLGGPFASVLSSLQRPNGAKAALVKSSMF,314,NP_003915.2.csv,refseq-PHOX2B-NM_003924.3_clinical_seed_0_final,refseq-PHOX2B-NM_003924.3.a2m,Invitae,refseq-PHOX2B-NM_003924.3.npy,1,314,314
+NP_003916.1,MGTTGLESLSLGDRGAAPTVTSSERLVPDPPNDLRKEDVAMELERVGEDEEQMMIKRSSECNPLLQEPIASAQFGATAGTECRKSVPCGWERVVKQRLFGKTAGRFDVYFISPQGLKFRSKSSLANYLHKNGETSLKPEDFDFTVLSKRGIKSRYKDCSMAALTSHLQNQSNNSNWNLRTRSKCKKDVFMPPSSSSELQESRGLSNFTSTHLLLKEDEGVDDVNFRKVRKPKGKVTILKGIPIKKTKKGCRKSCSGFVQSDSKRESVCNKADAESEPVAQKSQLDRTVCISDAGACGETLSVTSEENSLVKKKERSLSSGSNFCSEQKTSGIINKFCSAKDSEHNEKYEDTFLESEEIGTKVEVVERKEHLHTDILKRGSEMDNNCSPTRKDFTGEKIFQEDTIPRTQIERRKTSLYFSSKYNKEALSPPRRKAFKKWTPPRSPFNLVQETLFHDPWKLLIATIFLNRTSGKMAIPVLWKFLEKYPSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSYRIFCVNEWKQVHPEDHKLNKYHDWLWENHEKLSLS,580,NP_003916.1.csv,refseq-MBD4-NM_003925.2_clinical_seed_0_final,refseq-MBD4-NM_003925.2.a2m,Invitae,refseq-MBD4-NM_003925.2.npy,1,580,580
+NP_003928.1,MEPSSLELPADTVQRIAAELKCHPTDERVALHLDEEDKLRHFRECFYIPKIQDLPPVDLSLVNKDENAIYFLGNSLGLQPKMVKTYLEEELDKWAKIAAYGHEVGKRPWITGDESIVGLMKDIVGANEKEIALMNALTVNLHLLMLSFFKPTPKRYKILLEAKAFPSDHYAIESQLQLHGLNIEESMRMIKPREGEETLRIEDILEVIEKEGDSIAVILFSGVHFYTGQHFNIPAITKAGQAKGCYVGFDLAHAVGNVELYLHDWGVDFACWCSYKYLNAGAGGIAGAFIHEKHAHTIKPALVGWFGHELSTRFKMDNKLQLIPGVCGFRISNPPILLVCSLHASLEIFKQATMKALRKKSVLLTGYLEYLIKHNYGKDKAATKKPVVNIITPSHVEERGCQLTITFSVPNKDVFQELEKRGVVCDKRNPNGIRVAPVPLYNSFHDVYKFTNLLTSILDSAETKN,465,NP_003928.1.csv,refseq-KYNU-NM_003937.2_clinical_seed_0_final,refseq-KYNU-NM_003937.2.a2m,Invitae,refseq-KYNU-NM_003937.2.npy,1,465,465
+NP_003968.3,MADIIARLREDGIQKRVIQEGRGELPDFQDGTKATFHYRTLHSDDEGTVLDDSRARGKPMELIIGKKFKLPVWETIVCTMREGEIAQFLCDIKHVVLYPLVAKSLRNIAVGKDPLEGQRHCCGVAQMREHSSLGHADLDALQQNPQPLIFHMEMLKVESPGTYQQDPWAMTDEEKAKAVPLIHQEGNRLYREGHVKEAAAKYYDAIACLKNLQMKEQPGSPEWIQLDQQITPLLLNYCQCKLVVEEYYEVLDHCSSILNKYDDNVKAYFKRGKAHAAVWNAQEAQADFAKVLELDPALAPVVSRELRALEARIRQKDEEDKARFRGIFSH,330,NP_003968.3.csv,NP_003968.3_colabfold_clinical_seed_0_final,NP_003968.3_colabfold.a2m,colabfold,NP_003968.3_colabfold_theta_0.2.npy,1,330,330
+NP_003969.2,MMPQLQFKDAFWCRDFTAHTGYEVLLQRLLDGRKMCKDMEELLRQRAQAEERYGKELVQIARKAGGQTEINSLRASFDSLKQQMENVGSSHIQLALTLREELRSLEEFRERQKEQRKKYEAVMDRVQKSKLSLYKKAMESKKTYEQKCRDADDAEQAFERISANGHQKQVEKSQNKARQCKDSATEAERVYRQSIAQLEKVRAEWEQEHRTTCEAFQLQEFDRLTILRNALWVHSNQLSMQCVKDDELYEEVRLTLEGCSIDADIDSFIQAKSTGTEPPAPVPYQNYYDREVTPLTSSPGIQPSCGMIKRFSGLLHGSPKTTSLAASAASTETLTPTPERNEGVYTAIAVQEIQGNPASPAQEYRALYDYTAQNPDELDLSAGDILEVILEGEDGWWTVERNGQRGFVPGSYLEKL,416,NP_003969.2.csv,refseq-PSTPIP1-NM_003978.3_clinical_seed_0_final,refseq-PSTPIP1-NM_003978.3.a2m,Invitae,refseq-PSTPIP1-NM_003978.3_theta_0.2.npy,1,416,416
+NP_003982.1,MQPPPSLCGRALVALVLACGLSRIWGEERGFPPDRATPLLQTAEIMTPPTKTLWPKGSNASLARSLAPAEVPKGDRTAGSPPRTISPPPCQGPIEIKETFKYINTVVSCLVFVLGIIGNSTLLRIIYKNKCMRNGPNILIASLALGDLLHIVIDIPINVYKLLAEDWPFGAEMCKLVPFIQKASVGITVLSLCALSIDRYRAVASWSRIKGIGVPKWTAVEIVLIWVVSVVLAVPEAIGFDIITMDYKGSYLRICLLHPVQKTAFMQFYKTAKDWWLFSFYFCLPLAITAFFYTLMTCEMLRKKSGMQIALNDHLKQRREVAKTVFCLVLVFALCWLPLHLSRILKLTLYNQNDPNRCELLSFLLVLDYIGINMASLNSCINPIALYLVSKRFKNCFKAGPHVGNKLVMLFSVNIECDGTVNQNPTMWPERKSNNN,436,NP_003982.1.csv,refseq-EDNRB-NM_003991.4_clinical_seed_0_final,refseq-EDNRB-NM_003991.4.a2m,Invitae,refseq-EDNRB-NM_003991.4.npy,1,436,436
+NP_003986.2,MALPSLLLLVAALAGGVRPPGARNLTLAVVLPEHNLSYAWAWPRVGPAVALAVEALGRALPVDLRFVSSELEGACSEYLAPLSAVDLKLYHDPDLLLGPGCVYPAASVARFASHWRLPLLTAGAVASGFSAKNDHYRTLVRTGPSAPKLGEFVVTLHGHFNWTARAALLYLDARTDDRPHYFTIEGVFEALQGSNLSVQHQVYAREPGGPEQATHFIRANGRIVYICGPLEMLHEILLQAQRENLTNGDYVFFYLDVFGESLRAGPTRATGRPWQDNRTREQAQALREAFQTVLVITYREPPNPEYQEFQNRLLIRAREDFGVELGPSLMNLIAGCFYDGILLYAEVLNETIQEGGTREDGLRIVEKMQGRRYHGVTGLVVMDKNNDRETDFVLWAMGDLDSGDFQPAAHYSGAEKQIWWTGRPIPWVKGAPPSDNPPCAFDLDDPSCDKTPLSTLAIVALGTGITFIMFGVSSFLIFRKLMLEKELASMLWRIRWEELQFGNSERYHKGAGSRLTLSLRGSSYGSLMTAHGKYQIFANTGHFKGNVVAIKHVNKKRIELTRQVLFELKHMRDVQFNHLTRFIGACIDPPNICIVTEYCPRGSLQDILENDSINLDWMFRYSLINDLVKGMAFLHNSIISSHGSLKSSNCVVDSRFVLKITDYGLASFRSTAEPDDSHALYAKKLWTAPELLSGNPLPTTGMQKADVYSFGIILQEIALRSGPFYLEGLDLSPKEIVQKVRNGQRPYFRPSIDRTQLNEELVLLMERCWAQDPAERPDFGQIKGFIRRFNKEGGTSILDNLLLRMEQYANNLEKLVEERTQAYLEEKRKAEALLYQILPHSVAEQLKRGETVQAEAFDSVTIYFSDIVGFTALSAESTPMQVVTLLNDLYTCFDAIIDNFDVYKVETIGDAYMVVSGLPGRNGQRHAPEIARMALALLDAVSSFRIRHRPHDQLRLRIGVHTGPVCAGVVGLKMPRYCLFGDTVNTASRMESNGQALKIHVSSTTKDALDELGCFQLELRGDVEMKGKGKMRTYWLLGERKGPPGLL,1047,NP_003986.2.csv,refseq-NPR2-NM_003995.3_clinical_seed_0_final,refseq-NPR2-NM_003995.3.a2m,Invitae,refseq-NPR2-NM_003995.3.npy,1,1047,1047
+NP_003989.2,MAEDDPYLGRPEQMFHLDPSLTHTIFNPEVFQPQMALPTADGPYLQILEQPKQRGFRFRYVCEGPSHGGLPGASSEKNKKSYPQVKICNYVGPAKVIVQLVTNGKNIHLHAHSLVGKHCEDGICTVTAGPKDMVVGFANLGILHVTKKKVFETLEARMTEACIRGYNPGLLVHPDLAYLQAEGGGDRQLGDREKELIRQAALQQTKEMDLSVVRLMFTAFLPDSTGSFTRRLEPVVSDAIYDSKAPNASNLKIVRMDRTAGCVTGGEEIYLLCDKVQKDDIQIRFYEEEENGGVWEGFGDFSPTDVHRQFAIVFKTPKYKDINITKPASVFVQLRRKSDLETSEPKPFLYYPEIKDKEEVQRKRQKLMPNFSDSFGGGSGAGAGGGGMFGSGGGGGGTGSTGPGYSFPHYGFPTYGGITFHPGTTKSNAGMKHGTMDTESKKDPEGCDKSDDKNTVNLFGKVIETTEQDQEPSEATVGNGEVTLTYATGTKEESAGVQDNLFLEKAMQLAKRHANALFDYAVTGDVKMLLAVQRHLTAVQDENGDSVLHLAIIHLHSQLVRDLLEVTSGLISDDIINMRNDLYQTPLHLAVITKQEDVVEDLLRAGADLSLLDRLGNSVLHLAAKEGHDKVLSILLKHKKAALLLDHPNGDGLNAIHLAMMSNSLPCLLLLVAAGADVNAQEQKSGRTALHLAVEHDNISLAGCLLLEGDAHVDSTTYDGTTPLHIAAGRGSTRLAALLKAAGADPLVENFEPLYDLDDSWENAGEDEGVVPGTTPLDMATSWQVFDILNGKPYEPEFTSDDLLAQGDMKQLAEDVKLQLYKLLEIPDPDKNWATLAQKLGLGILNNAFRLSPAPSKTLMDNYEVSGGTVRELVEALRQMGYTEAIEVIQAASSPVKTTSQAHSLPLSPASTRQQIDELRDSDSVCDSGVETSFRKLSFTESLTSGASLLTLNKMPHDYGQEGPLEGKI,969,NP_003989.2.csv,refseq-NFKB1-NM_003998.3_clinical_seed_0_final,refseq-NFKB1-NM_003998.3.a2m,Invitae,refseq-NFKB1-NM_003998.3.npy,1,969,969
+NP_003990.1,MALFAVFQTTFFLTLLSLRTYQSEVLAERLPLTPVSLKVSTNSTRQSLHLQWTVHNLPYHQELKMVFQIQISRIETSNVIWVGNYSTTVKWNQVLHWSWESELPLECATHFVRIKSLVDDAKFPEPNFWSNWSSWEEVSVQDSTGQDILFVFPKDKLVEEGTNVTICYVSRNIQNNVSCYLEGKQIHGEQLDPHVTAFNLNSVPFIRNKGTNIYCEASQGNVSEGMKGIVLFVSKVLEEPKDFSCETEDFKTLHCTWDPGTDTALGWSKQPSQSYTLFESFSGEKKLCTHKNWCNWQITQDSQETYNFTLIAENYLRKRSVNILFNLTHRVYLMNPFSVNFENVNATNAIMTWKVHSIRNNFTYLCQIELHGEGKMMQYNVSIKVNGEYFLSELEPATEYMARVRCADASHFWKWSEWSGQNFTTLEAAPSEAPDVWRIVSLEPGNHTVTLFWKPLSKLHANGKILFYNVVVENLDKPSSSELHSIPAPANSTKLILDRCSYQICVIANNSVGASPASVIVISADPENKEVEEERIAGTEGGFSLSWKPQPGDVIGYVVDWCDHTQDVLGDFQWKNVGPNTTSTVISTDAFRPGVRYDFRIYGLSTKRIACLLEKKTGYSQELAPSDNPHVLVDTLTSHSFTLSWKDYSTESQPGFIQGYHVYLKSKARQCHPRFEKAVLSDGSECCKYKIDNPEEKALIVDNLKPESFYEFFITPFTSAGEGPSATFTKVTTPDEHSSMLIHILLPMVFCVLLIMVMCYLKSQWIKETCYPDIPDPYKSSILSLIKFKENPHLIIMNVSDCIPDAIEVVSKPEGTKIQFLGTRKSLTETELTKPNYLYLLPTEKNHSGPGPCICFENLTYNQAASDSGSCGHVPVSPKAPSMLGLMTSPENVLKALEKNYMNSLGEIPAGETSLNYVSQLASPMFGDKDSLPTNPVEAPHCSEYKMQMAVSLRLALPPPTENSSLSSITLLDPGEHYC,979,NP_003990.1.csv,refseq-OSMR-NM_003999.2_clinical_seed_0_final,refseq-OSMR-NM_003999.2.a2m,Invitae,refseq-OSMR-NM_003999.2.npy,1,979,979
+NP_003995.2,MDWGTLQTILGGVNKHSTSIGKIWLTVLFIFRIMILVVAAKEVWGDEQADFVCNTLQPGCKNVCYDHYFPISHIRLWALQLIFVSTPALLVAMHVAYRRHEKKRKFIKGEIKSEFKDIEEIKTQKVRIEGSLWWTYTSSIFFRVIFEAAFMYVFYVMYDGFSMQRLVKCNAWPCPNTVDCFVSRPTEKTVFTVFMIAVSGICILLNVTELCYLLIRYCSGKSKKPV,226,NP_003995.2.csv,refseq-GJB2-NM_004004.5_clinical_seed_0_final,refseq-GJB2-NM_004004.5.a2m,Invitae,refseq-GJB2-NM_004004.5.npy,1,226,226
+NP_004010.1,MREQLKGHETQTTCWDHPKMTELYQSLADLNNVRFSAYRTAMKLRRLQKALCLDLLSLSAACDALDQHNLKQNDQPMDILQIINCLTTIYDRLEQEHNNLVNVPLCVDMCLNWLLNVYDTGRTGRIRVLSFKTGIISLCKAHLEDKYRYLFKQVASSTGFCDQRRLGLLLHDSIQIPRQLGEVASFGGSNIEPSVRSCFQFANNKPEIEAALFLDWMRLEPQSMVWLPVLHRVAAAETAKHQAKCNICKECPIIGFRYRSLKHFNYDICQSCFFSGRVAKGHKMHYPMVEYCTPTTSGEDVRDFAKVLKNKFRTKRYFAKHPRMGYLPVQTVLEGDNMET,340,NP_004010.1.csv,refseq-DMD-NM_004019.2_clinical_seed_0_final,refseq-DMD-NM_004019.2.a2m,Invitae,refseq-DMD-NM_004019.2.npy,1,340,340
+NP_004026.2,MNPDLRRERDSASFNPELLTHILDGSPEKTRRRREIENMILNDPDFQHEDLNFLTRSQRYEVAVRKSAIMVKKMREFGIADPDEIMWFKNFVHRGRPEPLDLHLGMFLPTLLHQATAEQQERFFMPAWNLEIIGTYAQTEMGHGTHLRGLETTATYDPETQEFILNSPTVTSIKWWPGGLGKTSNHAIVLAQLITKGKCYGLHAFIVPIREIGTHKPLPGITVGDIGPKFGYDEIDNGYLKMDNHRIPRENMLMKYAQVKPDGTYVKPLSNKLTYGTMVFVRSFLVGEAARALSKACTIAIRYSAVRHQSEIKPGEPEPQILDFQTQQYKLFPLLATAYAFQFVGAYMKETYHRINEGIGQGDLSELPELHALTAGLKAFTSWTANTGIEACRMACGGHGYSHCSGLPNIYVNFTPSCTFEGENTVMMLQTARFLMKSYDQVHSGKLVCGMVSYLNDLPSQRIQPQQVAVWPTMVDINSPESLTEAYKLRAARLVEIAAKNLQKEVIHRKSKEVAWNLTSVDLVRASEAHCHYVVVKLFSEKLLKIQDKAIQAVLRSLCLLYSLYGISQNAGDFLQGSIMTEPQITQVNQRVKELLTLIRSDAVALVDAFDFQDVTLGSVLGRYDGNVYENLFEWAKNSPLNKAEVHESYKHLKSLQSKL,660,NP_004026.2.csv,refseq-ACOX1-NM_004035.6_clinical_seed_0_final,refseq-ACOX1-NM_004035.6.a2m,Invitae,refseq-ACOX1-NM_004035.6.npy,1,660,660
+NP_004035.2,MAPGQLALFSVSDKTGLVEFARNLTALGLNLVASGGTAKALRDAGLAVRDVSELTGFPEMLGGRVKTLHPAVHAGILARNIPEDNADMARLDFNLIRVVACNLYPFVKTVASPGVTVEEAVEQIDIGGVTLLRAAAKNHARVTVVCEPEDYVVVSTEMQSSESKDTSLETRRQLALKAFTHTAQYDEAISDYFRKQYSKGVSQMPLRYGMNPHQTPAQLYTLQPKLPITVLNGAPGFINLCDALNAWQLVKELKEALGIPAAASFKHVSPAGAAVGIPLSEDEAKVCMVYDLYKTLTPISAAYARARGADRMSSFGDFVALSDVCDVPTAKIISREVSDGIIAPGYEEEALTILSKKKNGNYCVLQMDQSYKPDENEVRTLFGLHLSQKRNNGVVDKSLFSNVVTKNKDLPESALRDLIVATIAVKYTQSNSVCYAKNGQVIGIGAGQQSRIHCTRLAGDKANYWWLRHHPQVLSMKFKTGVKRAEISNAIDQYVTGTIGEDEDLIKWKALFEEVPELLTEAEKKEWVEKLTEVSISSDAFFPFRDNVDRAKRSGVAYIAAPSGSAADKVVIEACDELGIILAHTNLRLFHH,592,NP_004035.2.csv,refseq-ATIC-NM_004044.6_clinical_seed_0_final,refseq-ATIC-NM_004044.6.a2m,Invitae,refseq-ATIC-NM_004044.6.npy,1,592,592
+NP_004046.2,MFSCVKPYEDQNYSALRRDCRRRKVLFEDPLFPATDDSLYYKGTPGPAVRWKRPKGICEDPRLFVDGISSHDLHQGQVGNCWFVAACSSLASRESLWQKVIPDWKEQEWDPEKPNAYAGIFHFHFWRFGEWVDVVIDDRLPTVNNQLIYCHSNSRNEFWCALVEKAYAKLAGCYQALDGGNTADALVDFTGGVSEPIDLTEGDFANDETKRNQLFERMLKVHSRGGLISASIKAVTAADMEARLACGLVKGHAYAVTDVRKVRLGHGLLAFFKSEKLDMIRLRNPWGEREWNGPWSDTSEEWQKVSKSEREKMGVTVQDDGEFWMTFEDVCRYFTDIIKCRVINTSHLSIHKTWEEARLHGAWTLHEDPRQNRGGGCINHKDTFFQNPQYIFEVKKPEDEVLICIQQRPKRSTRREGKGENLAIGFDIYKVEENRQYRMHSLQHKAASSIYINSRSVFLRTDQPEGRYVIIPTTFEPGHTGEFLLRVFTDVPSNCRELRLDEPPHTCWSSLCGYPQLVTQVHVLGAAGLKDSPTGANSYVIIKCEGDKVRSAVQKGTSTPEYNVKGIFYRKKLSQPITVQVWNHRVLKDEFLGQVHLKADPDNLQALHTLHLRDRNSRQPSNLPGTVAVHILSSTSLMAV,640,NP_004046.2.csv,refseq-CAPN5-NM_004055.4_clinical_seed_0_final,refseq-CAPN5-NM_004055.4.a2m,Invitae,refseq-CAPN5-NM_004055.4.npy,1,640,640
+NP_004067.1,MAEQHGAPEQAAAGKSHGDLGGSYKVILYELENFQGKRCELSAECPSLTDSLLEKVGSIQVESGPWLAFESRAFRGEQFVLEKGDYPRWDAWSNSRDSDSLLSLRPLNIDSPHHKLHLFENPAFSGRKMEIVDDDVPSLWAHGFQDRVASVRAINGTWVGYEFPGYRGRQYVFERGEYRHWNEWDASQPQLQSVRRIRDQKWHKRGRFPSS,211,NP_004067.1.csv,refseq-CRYBB3-NM_004076.4_clinical_seed_0_final,refseq-CRYBB3-NM_004076.4.a2m,Invitae,refseq-CRYBB3-NM_004076.4.npy,1,211,211
+NP_004073.2,MAQSKRHVYSRTPSGSRMSAEASARPLRVGSRVEVIGKGHRGTVAYVGATLFATGKWVGVILDEAKGKNDGTVQGRKYFTCDEGHGIFVRQSQIQVFEDGADTTSPETPDSSASKVLKREGTDTTAKTSKLRGLKPKKAPTARKTTTRRPKPTRPASTGVAGASSSLGPSGSASAGELSSSEPSTPAQTPLAAPIIPTPVLTSPGAVPPLPSPSKEEEGLRAQVRDLEEKLETLRLKRAEDKAKLKELEKHKIQLEQVQEWKSKMQEQQADLQRRLKEARKEAKEALEAKERYMEEMADTADAIEMATLDKEMAEERAESLQQEVEALKERVDELTTDLEILKAEIEEKGSDGAASSYQLKQLEEQNARLKDALVRMRDLSSSEKQEHVKLQKLMEKKNQELEVVRQQRERLQEELSQAESTIDELKEQVDAALGAEEMVEMLTDRNLNLEEKVRELRETVGDLEAMNEMNDELQENARETELELREQLDMAGARVREAQKRVEAAQETVADYQQTIKKYRQLTAHLQDVNRELTNQQEASVERQQQPPPETFDFKIKFAETKAHAKAIEMELRQMEVAQANRHMSLLTAFMPDSFLRPGGDHDCVLVLLLMPRLICKAELIRKQAQEKFELSENCSERPGLRGAAGEQLSFAAGLVYSLSLLQATLHRYEHALSQCSVDVYKKVGSLYPEMSAHERSLDFLIELLHKDQLDETVNVEPLTKAIKYYQHLYSIHLAEQPEDCTMQLADHIKFTQSALDCMSVEVGRLRAFLQGGQEATDIALLLRDLETSCSDIRQFCKKIRRRMPGTDAPGIPAALAFGPQVSDTLLDCRKHLTWVVAVLQEVAAAAAQLIAPLAENEGLLVAALEELAFKASEQIYGTPSSSPYECLRQSCNILISTMNKLATAMQEGEYDAERPPSKPPPVELRAAALRAEITDAEGLGLKLEDRETVIKELKKSLKIKGEELSEANVRLSLLEKKLDSAAKDADERIEKVQTRLEETQALLRKKEKEFEETMDALQADIDQLEAEKAELKQRLNSQSKRTIEGLRGPPPSGIATLVSGIAGEEQQRGAIPGQAPGSVPGPGLVKDSPLLLQQISAMRLHISQLQHENSILKGAQMKASLASLPPLHVAKLSHEGPGSELPAGALYRKTSQLLETLNQLSTHTHVVDITRTSPAAKSPSAQLMEQVAQLKSLSDTVEKLKDEVLKETVSQRPGATVPTDFATFPSSAFLRAKEEQQDDTVYMGKVTFSCAAGFGQRHRLVLTQEQLHQLHSRLIS,1278,NP_004073.2.csv,refseq-DCTN1-NM_004082.4_clinical_seed_0_final,refseq-DCTN1-NM_004082.4.a2m,Invitae,refseq-DCTN1-NM_004082.4.npy,1,1278,1278
+NP_004076.1,MDSSSSSSAAGLGAVDPQLQHFIEVETQKQRFQQLVHQMTELCWEKCMDKPGPKLDSRAEACFVNCVERFIDTSQFILNRLEQTQKSKPVFSESLSD,97,NP_004076.1.csv,refseq-TIMM8A-NM_004085.3_clinical_seed_0_final,refseq-TIMM8A-NM_004085.3.a2m,Invitae,refseq-TIMM8A-NM_004085.3.npy,1,97,97
+NP_004077.1,MSAAWIPALGLGVCLLLLPGPAGSEGAAPIAITCFTRGLDIRKEKADVLCPGGCPLEEFSVYGNIVYASVSSICGAAVHRGVISNSGGPVRVYSLPGRENYSSVDANGIQSQMLSRWSASFTVTKGKSSTQEATGQAVSTAHPPTGKRLKKTPEKKTGNKDCKADIAFLIDGSFNIGQRRFNLQKNFVGKVALMLGIGTEGPHVGLVQASEHPKIEFYLKNFTSAKDVLFAIKEVGFRGGNSNTGKALKHTAQKFFTVDAGVRKGIPKVVVVFIDGWPSDDIEEAGIVAREFGVNVFIVSVAKPIPEELGMVQDVTFVDKAVCRNNGFFSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPIRESPNKNFLVIVTDGQSYDDVQGPAAAAHDAGITIFSVGVAWAPLDDLKDMASKPKESHAFFTREFTGLEPIVSDVIRGICRDFLESQQ,550,NP_004077.1.csv,refseq-COCH-NM_004086.2_clinical_seed_0_final,refseq-COCH-NM_004086.2.a2m,Invitae,refseq-COCH-NM_004086.2.npy,1,550,550
+NP_004083.3,MAALRVLLSCVRGPLRPPVRCPAWRPFASGANFEYIIAEKRGKNNTVGLIQLNRPKALNALCDGLIDELNQALKTFEEDPAVGAIVLTGGDKAFAAGADIKEMQNLSFQDCYSSKFLKHWDHLTQVKKPVIAAVNGYAFGGGCELAMMCDIIYAGEKAQFAQPEILIGTIPGAGGTQRLTRAVGKSLAMEMVLTGDRISAQDAKQAGLVSKICPVETLVEEAIQCAEKIASNSKIVVAMAKESVNAAFEMTLTEGSKLEKKLFYSTFATDDRKEGMTAFVEKRKANFKDQ,290,NP_004083.3.csv,refseq-ECHS1-NM_004092.3_clinical_seed_0_final,refseq-ECHS1-NM_004092.3.a2m,Invitae,refseq-ECHS1-NM_004092.3.npy,1,290,290
+NP_004091.3,MEDSQDLNEQSVKKTCTESDVSQSQNSRSMEMQDLASPHTLVGGGDTPGSSKLEKSNLSSTSVTTNGTGGENMTVLNTADWLLSCNTPSSATMSLLAVKTEPLNSSETTATTGDGALDTFTGSVITSSGYSPRSAHQYSPQLYPSKPYPHILSTPAAQTMSAYAGQTQYSGMQQPAVYTAYSQTGQPYSLPTYDLGVMLPAIKTESGLSQTQSPLQSGCLSYSPGFSTPQPGQTPYSYQMPGSSFAPSSTIYANNSVSNSTNFSGSQQDYPSYTAFGQNQYAQYYSASTYGAYMTSNNTADGTPSSTSTYQLQESLPGLTNQPGEFDTMQSPSTPIKDLDERTCRSSGSKSRGRGRKNNPSPPPDSDLERVFVWDLDETIIVFHSLLTGSYAQKYGKDPPMAVTLGLRMEEMIFNLADTHLFFNDLEECDQVHIDDVSSDDNGQDLSTYSFATDGFHAAASSANLCLPTGVRGGVDWMRKLAFRYRRVKELYNTYKNNVGGLLGPAKRDAWLQLRAEIEGLTDSWLTNALKSLSIISTRSNCINVLVTTTQLIPALAKVLLYSLGGAFPIENIYSATKIGKESCFERIMQRFGRKVVYVVIGDGVEEEQAAKKHNMPFWRISSHSDLLALHQALELEYL,639,NP_004091.3.csv,refseq-EYA4-NM_004100.4_clinical_seed_0_final,refseq-EYA4-NM_004100.4.a2m,Invitae,refseq-EYA4-NM_004100.4.npy,1,639,639
+NP_004105.1,MAAAIASSLIRQKRQAREREKSNACKCVSSPSKGKTSCDKNKLNVFSRVKLFGSKKRRRRRPEPQLKGIVTKLYSRQGYHLQLQADGTIDGTKDEDSTYTLFNLIPVGLRVVAIQGVQTKLYLAMNSEGYLYTSELFTPECKFKESVFENYYVTYSSMIYRQQQSGRGWYLGLNKEGEIMKGNHVKKNKPAAHFLPKPLKVAMYKEPSLHDLTEFSRSGSGTPTKSRSVSGVLNGGKSMSHNEST,245,NP_004105.1.csv,refseq-FGF13-NM_004114.3_clinical_seed_0_final,refseq-FGF13-NM_004114.3.a2m,Invitae,refseq-FGF13-NM_004114.3.npy,1,245,245
+NP_004125.3,MISASRAAAARLVGAAASRGPTAARHQDSWNGLSHEAFRLVSRRDYASEAIKGAVVGIDLGTTNSCVAVMEGKQAKVLENAEGARTTPSVVAFTADGERLVGMPAKRQAVTNPNNTFYATKRLIGRRYDDPEVQKDIKNVPFKIVRASNGDAWVEAHGKLYSPSQIGAFVLMKMKETAENYLGHTAKNAVITVPAYFNDSQRQATKDAGQISGLNVLRVINEPTAAALAYGLDKSEDKVIAVYDLGGGTFDISILEIQKGVFEVKSTNGDTFLGGEDFDQALLRHIVKEFKRETGVDLTKDNMALQRVREAAEKAKCELSSSVQTDINLPYLTMDSSGPKHLNMKLTRAQFEGIVTDLIRRTIAPCQKAMQDAEVSKSDIGEVILVGGMTRMPKVQQTVQDLFGRAPSKAVNPDEAVAIGAAIQGGVLAGDVTDVLLLDVTPLSLGIETLGGVFTKLINRNTTIPTKKSQVFSTAADGQTQVEIKVCQGEREMAGDNKLLGQFTLIGIPPAPRGVPQIEVTFDIDANGIVHVSAKDKGTGREQQIVIQSSGGLSKDDIENMVKNAEKYAEEDRRKKERVEAVNMAEGIIHDTETKMEEFKDQLPADECNKLKEEISKMRELLARKDSETGENIRQAASSLQQASLKLFEMAYKKMASEREGSGSSGTGEQKEDQKEEKQ,679,NP_004125.3.csv,refseq-HSPA9-NM_004134.6_clinical_seed_0_final,refseq-HSPA9-NM_004134.6.a2m,Invitae,refseq-HSPA9-NM_004134.6_theta_0.2.npy,1,679,679
+NP_004144.2,MAHYPTRLKTRKTYSWVGRPLLDRKLHYQTYREMCVKTEGCSTEIHIQIGQFVLIEGDDDENPYVAKLLELFEDDSDPPPKKRARVQWFVRFCEVPACKRHLLGRKPGAQEIFWYDYPACDSNINAETIIGLVRVIPLAPKDVVPTNLKNEKTLFVKLSWNEKKFRPLSSELFAELNKPQESAAKCQKPVRAKSKSAESPSWTPAEHVAKRIESRHSASKSRQTPTHPLTPRARKRLELGNLGNPQMSQQTSCASLDSPGRIKRKVAFSEITSPSKRSQPDKLQTLSPALKAPEKTRETGLSYTEDDKKASPEHRIILRTRIAASKTIDIREERTLTPISGGQRSSVVPSVILKPENIKKRDAKEAKAQNEATSTPHRIRRKSSVLTMNRIRQQLRFLGNSKSDQEEKEILPAAEISDSSSDEEEASTPPLPRRAPRTVSRNLRSSLKSSLHTLTKVPKKSLKPRTPRCAAPQIRSRSLAAQEPASVLEEARLRLHVSAVPESLPCREQEFQDIYNFVESKLLDHTGGCMYISGVPGTGKTATVHEVIRCLQQAAQANDVPPFQYIEVNGMKLTEPHQVYVQILQKLTGQKATANHAAELLAKQFCTRGSPQETTVLLVDELDLLWTHKQDIMYNLFDWPTHKEARLVVLAIANTMDLPERIMMNRVSSRLGLTRMCFQPYTYSQLQQILRSRLKHLKAFEDDAIQLVARKVAALSGDARRCLDICRRATEICEFSQQKPDSPGLVTIAHSMEAVDEMFSSSYITAIKNSSVLEQSFLRAILAEFRRSGLEEATFQQIYSQHVALCRMEGLPYPTMSETMAVCSHLGSCRLLLVEPSRNDLLLRVRLNVSQDDVLYALKDE,861,NP_004144.2.csv,refseq-ORC1-NM_004153.3_clinical_seed_0_final,refseq-ORC1-NM_004153.3.a2m,Invitae,refseq-ORC1-NM_004153.3.npy,1,861,861
+NP_004159.2,MSGVRGLSRLLSARRLALAKAWPTVLQTGTRGFHFTVDGNKRASAKVSDSISAQYPVVDHEFDAVVVGAGGAGLRAAFGLSEAGFNTACVTKLFPTRSHTVAAQGGINAALGNMEEDNWRWHFYDTVKGSDWLGDQDAIHYMTEQAPAAVVELENYGMPFSRTEDGKIYQRAFGGQSLKFGKGGQAHRCCCVADRTGHSLLHTLYGRSLRYDTSYFVEYFALDLLMENGECRGVIALCIEDGSIHRIRAKNTVVATGGYGRTYFSCTSAHTSTGDGTAMITRAGLPCQDLEFVQFHPTGIYGAGCLITEGCRGEGGILINSQGERFMERYAPVAKDLASRDVVSRSMTLEIREGRGCGPEKDHVYLQLHHLPPEQLATRLPGISETAMIFAGVDVTKEPIPVLPTVHYNMGGIPTNYKGQVLRHVNGQDQIVPGLYACGEAACASVHGANRLGANSLLDLVVFGRACALSIEESCRPGDKVPPIKPNAGEESVMNLDKLRFADGSIRTSELRLSMQKSMQNHAAVFRVGSVLQEGCGKISKLYGDLKHLKTFDRGMVWNTDLVETLELQNLMLCALQTIYGAEARKESRGAHAREDYKVRIDEYDYSKPIQGQQKKPFEEHWRKHTLSYVDVGTGKVTLEYRPVIDKTLNEADCATVPPAIRSY,664,NP_004159.2.csv,refseq-SDHA-NM_004168.2_clinical_seed_0_final,refseq-SDHA-NM_004168.2.a2m,Invitae,refseq-SDHA-NM_004168.2.npy,1,664,664
+NP_004161.4,MGKPARKGCEWKRFLKNNWVLLSTVAAVVLGITTGVLVREHSNLSTLEKFYFAFPGEILMRMLKLIILPLIISSMITGVAALDSNVSGKIGLRAVVYYFCTTLIAVILGIVLVVSIKPGVTQKVGEIARTGSTPEVSTVDAMLDLIRNMFPENLVQACFQQYKTKREEVKPPSDPEMNMTEESFTAVMTTAISKNKTKEYKIVGMYSDGINVLGLIVFCLVFGLVIGKMGEKGQILVDFFNALSDATMKIVQIIMCYMPLGILFLIAGKIIEVEDWEIFRKLGLYMATVLTGLAIHSIVILPLIYFIVVRKNPFRFAMGMAQALLTALMISSSSATLPVTFRCAEENNQVDKRITRFVLPVGATINMDGTALYEAVAAVFIAQLNDLDLGIGQIITISITATSASIGAAGVPQAGLVTMVIVLSAVGLPAEDVTLIIAVDWLLDRFRTMVNVLGDAFGTGIVEKLSKKELEQMDVSSEVNIVNPFALESTILDNEDSDTKKSYVNGGFAVDKSDTISFTQTSQF,524,NP_004161.4.csv,refseq-SLC1A1-NM_004170.5_clinical_seed_0_final,refseq-SLC1A1-NM_004170.5.a2m,Invitae,refseq-SLC1A1-NM_004170.5.npy,1,524,524
+NP_004162.2,MASTEGANNMPKQVEVRMHDSHLGSEEPKHRHLGLRLCDKLGKNLLLTLTVFGVILGAVCGGLLRLASPIHPDVVMLIAFPGDILMRMLKMLILPLIISSLITGLSGLDAKASGRLGTRAMVYYMSTTIIAAVLGVILVLAIHPGNPKLKKQLGPGKKNDEVSSLDAFLDLIRNLFPENLVQACFQQIQTVTKKVLVAPPPDEEANATSAVVSLLNETVTEVPEETKMVIKKGLEFKDGMNVLGLIGFFIAFGIAMGKMGDQAKLMVDFFNILNEIVMKLVIMIMWYSPLGIACLICGKIIAIKDLEVVARQLGMYMVTVIIGLIIHGGIFLPLIYFVVTRKNPFSFFAGIFQAWITALGTASSAGTLPVTFRCLEENLGIDKRVTRFVLPVGATINMDGTALYEAVAAIFIAQMNGVVLDGGQIVTVSLTATLASVGAASIPSAGLVTMLLILTAVGLPTEDISLLVAVDWLLDRMRTSVNVVGDSFGAGIVYHLSKSELDTIDSQHRVHEDIEMTKTQSIYDDMKNHRESNSNQCVYAAHNSVIVDECKVTLAANGKSADCSVEEEPWKREK,574,NP_004162.2.csv,refseq-SLC1A2-NM_004171.3_clinical_seed_0_final,refseq-SLC1A2-NM_004171.3.a2m,Invitae,refseq-SLC1A2-NM_004171.3.npy,1,574,574
+NP_004165.2,MWGLGARGPDRGLLLALALGGLARAGGVEVEPGGAHGESGGFQVVTFEWAHVQDPYVIALWILVASLAKIGFHLSHKVTSVVPESALLIVLGLVLGGIVWAADHIASFTLTPTVFFFYLLPPIVLDAGYFMPNRLFFGNLGTILLYAVVGTVWNAATTGLSLYGVFLSGLMGDLQIGLLDFLLFGSLMAAVDPVAVLAVFEEVHVNEVLFIIVFGESLLNDAVTVVLYNVFESFVALGGDNVTGVDCVKGIVSFFVVSLGGTLVGVVFAFLLSLVTRFTKHVRIIEPGFVFIISYLSYLTSEMLSLSAILAITFCGICCQKYVKANISEQSATTVRYTMKMLASSAETIIFMFLGISAVNPFIWTWNTAFVLLTLVFISVYRAIGVVLQTWLLNRYRMVQLEPIDQVVLSYGGLRGAVAFALVVLLDGDKVKEKNLFVSTTIIVVFFTVIFQGLTIKPLVQWLKVKRSEHREPRLNEKLHGRAFDHILSAIEDISGQIGHNYLRDKWSHFDRKFLSRVLMRRSAQKSRDRILNVFHELNLKDAISYVAEGERRGSLAFIRSPSTDNVVNVDFTPRSSTVEASVSYLLRENVSAVCLDMQSLEQRRRSIRDAEDMVTHHTLQQYLYKPRQEYKHLYSRHELTPTEDEKQDREIFHRTMRKRLESFKSTKLGLNQNKKAAKLYKRERAQKRRNSSIPNGKLPMESPAQNFTIKEKDLELSDTEEPPNYDEEMSGGIEFLASVTKDTASDSPAGIDNPVFSPDEALDRSLLARLPPWLSPGETVVPSQRARTQIPYSPGTFCRLMPFRLSSKSVDSFLQADGPEERPPAALPESTHM,834,NP_004165.2.csv,refseq-SLC9A3-NM_004174.3_clinical_seed_0_final,refseq-SLC9A3-NM_004174.3.a2m,Invitae,refseq-SLC9A3-NM_004174.3.npy,1,834,834
+NP_004167.3,MDEPPFSEAALEQALGEPCDLDAALLTDIEDMLQLINNQDSDFPGLFDPPYAGSGAGGTDPASPDTSSPGSLSPPPATLSSSLEAFLSGPQAAPSPLSPPQPAPTPLKMYPSMPAFSPGPGIKEESVPLSILQTPTPQPLPGALLPQSFPAPAPPQFSSTPVLGYPSPPGGFSTGSPPGNTQQPLPGLPLASPPGVPPVSLHTQVQSVVPQQLLTVTAAPTAAPVTTTVTSQIQQVPVLLQPHFIKADSLLLTAMKTDGATVKAAGLSPLVSGTTVQTGPLPTLVSGGTILATVPLVVDAEKLPINRLAAGSKAPASAQSRGEKRTAHNAIEKRYRSSINDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQHSNQKLKQENLSLRTAVHKSKSLKDLVSACGSGGNTDVLMEGVKTEVEDTLTPPPSDAGSPFQSSPLSLGSRGSGSGGSGSDSEPDSPVFEDSKAKPEQRPSLHSRGMLDRSRLALCTLVFLCLSCNPLASLLGARGLPSPSDTTSVYHSPGRNVLGTESRDGPGWAQWLLPPVVWLLNGLLVLVSLVLLFVYGEPVTRPHSGPAVYFWRHRKQADLDLARGDFAQAAQQLWLALRALGRPLPTSHLDLACSLLWNLIRHLLQRLWVGRWLAGRAGGLQQDCALRVDASASARDAALVYHKLHQLHTMGKHTGGHLTATNLALSALNLAECAGDAVSVATLAEIYVAAALRVKTSLPRALHFLTRFFLSSARQACLAQSGSVPPAMQWLCHPVGHRFFVDGDWSVLSTPWESLYSLAGNPVDPLAQVTQLFREHLLERALNCVTQPNPSPGSADGDKEFSDALGYLQLLNSCSDAAGAPAYSFSISSSMATTTGVDPVAKWWASLTAVVIHWLRRDEEAAERLCPLVEHLPRVLQESERPLPRAALHSFKAARALLGCAKAESGPASLTICEKASGYLQDSLATTPASSSIDKAVQLFLCDLLLVVRTSLWRQQQPPAPAPAAQGTSSRPQASALELRGFQRDLSSLRRLAQSFRPAMRRVFLHEATARLMAGASPTRTHQLLDRSLRRRAGPGGKGGAVAELEPRPTRREHAEALLLASCYLPPGFLSAPGQRVGMLAEAARTLEKLGDRRLLHDCQQMLMRLGGGTTVTSS,1147,NP_004167.3.csv,refseq-SREBF1-NM_004176.4_clinical_seed_0_final,refseq-SREBF1-NM_004176.4.a2m,Invitae,refseq-SREBF1-NM_004176.4_theta_0.2.npy,1,1147,1147
+NP_004172.2,MQLKPMEINPEMLNKVLSRLGVAGQWRFVDVLGLEEESLGSVPAPACALLLLFPLTAQHENFRKKQIEELKGQEVSPKVYFMKQTIGNSCGTIGLIHAVANNQDKLGFEDGSVLKQFLSETEKMSPEDRAKCFEKNEAIQAAHDAVAQEGQCRVDDKVNFHFILFNNVDGHLYELDGRMPFPVNHGASSEDTLLKDAAKVCREFTEREQGEVRFSAVALCKAA,223,NP_004172.2.csv,refseq-UCHL1-NM_004181.4_clinical_seed_0_final,refseq-UCHL1-NM_004181.4.a2m,Invitae,refseq-UCHL1-NM_004181.4.npy,1,223,223
+NP_004174.1,MTITYTSQVANARLGSFSRLLLCWRGSIYKLLYGEFLIFLLCYYIIRFIYRLALTEEQQLMFEKLTLYCDSYIQLIPISFVLGFYVTLVVTRWWNQYENLPWPDRLMSLVSGFVEGKDEQGRLLRRTLIRYANLGNVLILRSVSTAVYKRFPSAQHLVQAGFMTPAEHKQLEKLSLPHNMFWVPWVWFANLSMKAWLGGRIRDPILLQSLLNEMNTLRTQCGHLYAYDWISIPLVYTQVVTVAVYSFFLTCLVGRQFLNPAKAYPGHELDLVVPVFTFLQFFFYVGWLKVAEQLINPFGEDDDDFETNWIVDRNLQVSLLAVDEMHQDLPRMEPDMYWNKPEPQPPYTAASAQFRRASFMGSTFNISLNKEEMEFQPNQEDEEDAHAGIIGRFLGLQSHDHHPPRANSRTKLLWPKRESLLHEGLPKNHKAAKQNVRGQEDNKAWKLKAVDAFKSAPLYQRPGYYSAPQTPLSPTPMFFPLEPSAPSKLHSVTGIDTKDKSLKTVSSGAKKSFELLSESDGALMEHPEVSQVRRKTVEFNLTDMPEIPENHLKEPLEQSPTNIHTTLKDHMDPYWALENRDEAHS,585,NP_004174.1.csv,refseq-BEST1-NM_004183.3_clinical_seed_0_final,refseq-BEST1-NM_004183.3.a2m,Invitae,refseq-BEST1-NM_004183.3.npy,1,585,585
+NP_004175.2,MPNSEPASLLELFNSIATQGELVRSLKAGNASKDEIDSAVKMLVSLKMSYKAAAGEDYKADCPPGNPAPTSNHGPDATEAEEDFVDPWTVQTSSAKGIDYDKLIVRFGSSKIDKELINRIERATGQRPHHFLRRGIFFSHRDMNQVLDAYENKKPFYLYTGRGPSSEAMHVGHLIPFIFTKWLQDVFNVPLVIQMTDDEKYLWKDLTLDQAYSYAVENAKDIIACGFDINKTFIFSDLDYMGMSSGFYKNVVKIQKHVTFNQVKGIFGFTDSDCIGKISFPAIQAAPSFSNSFPQIFRDRTDIQCLIPCAIDQDPYFRMTRDVAPRIGYPKPALLHSTFFPALQGAQTKMSASDPNSSIFLTDTAKQIKTKVNKHAFSGGRDTIEEHRQFGGNCDVDVSFMYLTFFLEDDDKLEQIRKDYTSGAMLTGELKKALIEVLQPLIAEHQARRKEVTDEIVKEFMTPRKLSFDFQ,471,NP_004175.2.csv,refseq-WARS-NM_004184.3_clinical_seed_0_final,refseq-WARS-NM_004184.3.a2m,Invitae,refseq-WARS-NM_004184.3.npy,1,471,471
+NP_004178.2,MEPGSDDFLPPPECPVFEPSWAEFRDPLGYIAKIRPIAEKSGICKIRPPADWQPPFAVEVDNFRFTPRIQRLNELEAQTRVKLNYLDQIAKFWEIQGSSLKIPNVERRILDLYSLSKIVVEEGGYEAICKDRRWARVAQRLNYPPGKNIGSLLRSHYERIVYPYEMYQSGANLVQCNTRPFDNEEKDKEYKPHSIPLRQSVQPSKFNSYGRRAKRLQPDPEPTEEDIEKNPELKKLQIYGAGPKMMGLGLMAKDKTLRKKDKEGPECPPTVVVKEELGGDVKVESTSPKTFLESKEELSHSPEPCTKMTMRLRRNHSNAQFIESYVCRMCSRGDEDDKLLLCDGCDDNYHIFCLLPPLPEIPKGVWRCPKCVMAECKRPPEAFGFEQATREYTLQSFGEMADSFKADYFNMPVHMVPTELVEKEFWRLVNSIEEDVTVEYGADIHSKEFGSGFPVSDSKRHLTPEEEEYATSGWNLNVMPVLEQSVLCHINADISGMKVPWLYVGMVFSAFCWHIEDHWSYSINYLHWGEPKTWYGVPSLAAEHLEEVMKKLTPELFDSQPDLLHQLVTLMNPNTLMSHGVPVVRTNQCAGEFVITFPRAYHSGFNQGYNFAEAVNFCTADWLPAGRQCIEHYRRLRRYCVFSHEELICKMAACPEKLDLNLAAAVHKEMFIMVQEERRLRKALLEKGITEAEREAFELLPDDERQCIKCKTTCFLSALACYDCPDGLVCLSHINDLCKCSSSRQYLRYRYTLDELPAMLHKLKVRAESFDTWANKVRVALEVEDGRKRSLEELRALESEARERRFPNSELLQQLKNCLSEAEACVSRALGLVSGQEAGPHRVAGLQMTLTELRAFLDQMNNLPCAMHQIGDVKGVLEQVEAYQAEAREALASLPSSPGLLQSLLERGRQLGVEVPEAQQLQRQVEQARWLDEVKRTLAPSARRGTLAVMRGLLVAGASVAPSPAVDKAQAELQELLTIAERWEEKAHLCLEARQKHPPATLEAIIREAENIPVHLPNIQALKEALAKARAWIADVDEIQNGDHYPCLDDLEGLVAVGRDLPVGLEELRQLELQVLTAHSWREKASKTFLKKNSCYTLLEVLCPCADAGSDSTKRSRWMEKELGLYKSDTELLGLSAQDLRDPGSVIVAFKEGEQKEKEGILQLRRTNSAKPSPLASSSTASSTTSICVCGQVLAGAGALQCDLCQDWFHGRCVSVPRLLSSPRPNPTSSPLLAWWEWDTKFLCPLCMRSRRPRLETILALLVALQRLPVRLPEGEALQCLTERAISWQGRARQALASEDVTALLGRLAELRQRLQAEPRPEEPPNYPAAPASDPLREGSGKDMPKVQGLLENGDSVTSPEKVAPEEGSGKRDLELLSSLLPQLTGPVLELPEATRAPLEELMMEGDLLEVTLDENHSIWQLLQAGQPPDLERIRTLLELEKAERHGSRARGRALERRRRRKVDRGGEGDDPAREELEPKRVRSSGPEAEEVQEEEELEEETGGEGPPAPIPTTGSPSTQENQNGLEPAEGTTSGPSAPFSTLTPRLHLPCPQQPPQQQL,1560,NP_004178.2.csv,refseq-KDM5C-NM_004187.3_clinical_seed_0_final,refseq-KDM5C-NM_004187.3.a2m,Invitae,refseq-KDM5C-NM_004187.3.npy,1,1560,1560
+NP_004179.3,MPRSFLVKSKKAHTYHQPRVQEDEPLWPPALTPVPRDQAPSNSPVLSTLFPNQCLDWTNLKREPELEQDQNLARMAPAPEGPIVLSRPQDGDSPLSDSPPFYKPSFSWDTLATTYGHSYRQAPSTMQSAFLEHSVSLYGSPLVPSTEPALDFSLRYSPGMDAYHCVKCNKVFSTPHGLEVHVRRSHSGTRPFACDICGKTFGHAVSLEQHTHVHSQERSFECRMCGKAFKRSSTLSTHLLIHSDTRPYPCQFCGKRFHQKSDMKKHTYIHTGEKPHKCQVCGKAFSQSSNLITHSRKHTGFKPFSCELCTKGFQRKVDLRRHRESQHNLK,330,NP_004179.3.csv,refseq-GFI1B-NM_004188.6_clinical_seed_0_final,refseq-GFI1B-NM_004188.6.a2m,Invitae,refseq-GFI1B-NM_004188.6.npy,1,330,330
+NP_004195.2,MVLKAFFPTCCVSTDSGLLVGRWVPEQSSAVVLAVLHFPFIPIQVKQLLAQVRQASQVGVAVLGTWCHCRQEPEESLGRFLESLGAVFPHEPWLRLCRERGGTFWSCEATHRQAPTAPGAPGEDQVMLIFYDQRQVLLSQLHLPTVLPDRQAGATTASTGGLAAVFDTVARSEVLFRSDRFDEGPVRLSHWQSEGVEASILAELARRASGPICLLLASLLSLVSAVSACRVFKLWPLSFLGSKLSTCEQLRHRLEHLTLIFSTRKAENPAQLMRKANTVASVLLDVALGLMLLSWLHGRSRIGHLADALVPVADHVAEELQHLLQWLMGAPAGLKMNRALDQVLGRFFLYHIHLWISYIHLMSPFVEHILWHVGLSACLGLTVALSLLSDIIALLTFHIYCFYVYGARLYCLKIHGLSSLWRLFRGKKWNVLRQRVDSCSYDLDQLFIGTLLFTILLFLLPTTALYYLVFTLLRLLVVAVQGLIHLLVDLINSLPLYSLGLRLCRPYRLAAGVKFRVLRHEAGRPLRLLMQINPLPYSRVVHTYRLPSCGCHPKHSWGALCRKLFLGELIYPWRQRGDKQD,581,NP_004195.2.csv,refseq-PIGQ-NM_004204.3_clinical_seed_0_final,refseq-PIGQ-NM_004204.3.a2m,Invitae,refseq-PIGQ-NM_004204.3.npy,1,581,581
+NP_004199.1,MFRCGGLAAGALKQKLVPLVRTVCVRSPRQRNRLPGNLFQRWHVPLELQMTRQMASSGASGGKIDNSVLVLIVGLSTVGAGAYAYKTMKEDEKRYNERISGLGLTPEQKQKKAALSASEGEEVPQDKAPSHVPFLLIGGGTAAFAAARSIRARDPGARVLIVSEDPELPYMRPPLSKELWFSDDPNVTKTLRFKQWNGKERSIYFQPPSFYVSAQDLPHIENGGVAVLTGKKVVQLDVRDNMVKLNDGSQITYEKCLIATGGTPRSLSAIDRAGAEVKSRTTLFRKIGDFRSLEKISREVKSITIIGGGFLGSELACALGRKARALGTEVIQLFPEKGNMGKILPEYLSNWTMEKVRREGVKVMPNAIVQSVGVSSGKLLIKLKDGRKVETDHIVAAVGLEPNVELAKTGGLEIDSDFGGFRVNAELQARSNIWVAGDAACFYDIKLGRRRVEHHDHAVVSGRLAGENMTGAAKPYWHQSMFWSDLGPDVGYEAIGLVDSSLPTVGVFAKATAQDNPKSATEQSGTGIRSESETESEASEITIPPSTPAVPQAPVQGEDYGKGVIFYLRDKVVVGIVLWNIFNRMPIARKIIKDGEQHEDLNEVAKLFNIHED,613,NP_004199.1.csv,refseq-AIFM1-NM_004208.3_clinical_seed_0_final,refseq-AIFM1-NM_004208.3.a2m,Invitae,refseq-AIFM1-NM_004208.3.npy,1,613,613
+NP_004202.4,MDCSAPKEMNKLPANSPEAAAAQGHPDGPCAPRTSPEQELPAAAAPPPPRVPRSASTGAQTFQSADARACEAERPGVGSCKLSSPRAQAASAALRDLREAQGAQASPPPGSSGPGNALHCKIPFLRGPEGDANVSVGKGTLERNNTPVVGWVNMSQSTVVLATDGITSVLPGSVATVATQEDEQGDENKARGNWSSKLDFILSMVGYAVGLGNVWRFPYLAFQNGGGAFLIPYLMMLALAGLPIFFLEVSLGQFASQGPVSVWKAIPALQGCGIAMLIISVLIAIYYNVIICYTLFYLFASFVSVLPWGSCNNPWNTPECKDKTKLLLDSCVISDHPKIQIKNSTFCMTAYPNVTMVNFTSQANKTFVSGSEEYFKYFVLKISAGIEYPGEIRWPLALCLFLAWVIVYASLAKGIKTSGKVVYFTATFPYVVLVILLIRGVTLPGAGAGIWYFITPKWEKLTDATVWKDAATQIFFSLSAAWGGLITLSSYNKFHNNCYRDTLIVTCTNSATSIFAGFVIFSVIGFMANERKVNIENVADQGPGIAFVVYPEALTRLPLSPFWAIIFFLMLLTLGLDTMFATIETIVTSISDEFPKYLRTHKPVFTLGCCICFFIMGFPMITQGGIYMFQLVDTYAASYALVIIAIFELVGISYVYGLQRFCEDIEMMIGFQPNIFWKVCWAFVTPTILTFILCFSFYQWEPMTYGSYRYPNWSMVLGWLMLACSVIWIPIMFVIKMHLAPGRFIERLKLVCSPQPDWGPFLAQHRGERYKNMIDPLGTSSLGLKLPVKDLELGTQC,797,NP_004202.4.csv,refseq-SLC6A5-NM_004211.3_clinical_seed_0_final,refseq-SLC6A5-NM_004211.3.a2m,Invitae,refseq-SLC6A5-NM_004211.3.npy,1,797,797
+NP_004209.2,MGTRDDEYDYLFKVVLIGDSGVGKSNLLSRFTRNEFNLESKSTIGVEFATRSIQVDGKTIKAQIWDTAGQERYRAITSAYYRGAVGALLVYDIAKHLTYENVERWLKELRDHADSNIVIMLVGNKSDLRHLRAVPTDEARAFAEKNNLSFIETSALDSTNVEEAFKNILTEIYRIVSQKQIADRAAHDESPGNNVVDISVPPTTDGQKPNKLQCCQNL,218,NP_004209.2.csv,refseq-RAB11B-NM_004218.4_clinical_seed_0_final,refseq-RAB11B-NM_004218.4.a2m,Invitae,refseq-RAB11B-NM_004218.4_theta_0.2.npy,1,218,218
+NP_004229.1,MSNRPNNNPGGSLRRSQRNTAGAQPQDDSIGGRSCSSSSAVIVPQPEDPDRANTSERQKTGQVPKKDNSRGVKRSASPDYNRTNSPSSAKKPKALQHTESPSETNKPHSKSKKRHLDQEQQLKSAQSPSTSKAHTRKSGATGGSRSQKRKRTESSCVKSGSGSESTGAEERSAKPTKLASKSATSAKAGCSTITDSSSAASTSSSSSAVASASSTVPPGARVKQGKDQNKARRSRSASSPSPRRSSREKEQSKTGGSSKFDWAARFSPKVSLPKTKLSLPGSSKSETSKPGPSGLQAKLASLRKSTKKRSESPPAELPSLRRSTRQKTTGSCASTSRRGSGLGKRGAAEARRQEKMADPESNQEAVNSSAARTDEAPQGAAGAVGMTTSGESESDDSEMGRLQALLEARGLPPHLFGPLGPRMSQLFHRTIGSGASSKAQQLLQGLQASDESQQLQAVIEMCQLLVMGNEETLGGFPVKSVVPALITLLQMEHNFDIMNHACRALTYMMEALPRSSAVVVDAIPVFLEKLQVIQCIDVAEQALTALEMLSRRHSKAILQAGGLADCLLYLEFFSINAQRNALAIAANCCQSITPDEFHFVADSLPLLTQRLTHQDKKSVESTCLCFARLVDNFQHEENLLQQVASKDLLTNVQQLLVVTPPILSSGMFIMVVRMFSLMCSNCPTLAVQLMKQNIAETLHFLLCGASNGSCQEQIDLVPRSPQELYELTSLICELMPCLPKEGIFAVDTMLKKGNAQNTDGAIWQWRDDRGLWHPYNRIDSRIIEQINEDTGTARAIQRKPNPLANSNTSGYSESKKDDARAQLMKEDPELAKSFIKTLFGVLYEVYSSSAGPAVRHKCLRAILRIIYFADAELLKDVLKNHAVSSHIASMLSSQDLKIVVGALQMAEILMQKLPDIFSVYFRREGVMHQVKHLAESESLLTSPPKACTNGSGSMGSTTSVSSGTATAATHAAADLGSPSLQHSRDDSLDLSPQGRLSDVLKRKRLPKRGPRRPKYSPPRDDDKVDNQAKSPTTTQSPKSSFLASLNPKTWGRLSTQSNSNNIEPARTAGGSGLARAASKDTISNNREKIKGWIKEQAHKFVERYFSSENMDGSNPALNVLQRLCAATEQLNLQVDGGAECLVEIRSIVSESDVSSFEIQHSGFVKQLLLYLTSKSEKDAVSREIRLKRFLHVFFSSPLPGEEPIGRVEPVGNAPLLALVHKMNNCLSQMEQFPVKVHDFPSGNGTGGSFSLNRGSQALKFFNTHQLKCQLQRHPDCANVKQWKGGPVKIDPLALVQAIERYLVVRGYGRVREDDEDSDDDGSDEEIDESLAAQFLNSGNVRHRLQFYIGEHLLPYNMTVYQAVRQFSIQAEDERESTDDESNPLGRAGIWTKTHTIWYKPVREDEESNKDCVGGKRGRAQTAPTKTSPRNAKKHDELWHDGVCPSVSNPLEVYLIPTPPENITFEDPSLDVILLLRVLHAISRYWYYLYDNAMCKEIIPTSEFINSKLTAKANRQLQDPLVIMTGNIPTWLTELGKTCPFFFPFDTRQMLFYVTAFDRDRAMQRLLDTNPEINQSDSQDSRVAPRLDRKKRTVNREELLKQAESVMQDLGSSRAMLEIQYENEVGTGLGPTLEFYALVSQELQRADLGLWRGEEVTLSNPKGSQEGTKYIQNLQGLFALPFGRTAKPAHIAKVKMKFRFLGKLMAKAIMDFRLVDLPLGLPFYKWMLRQETSLTSHDLFDIDPVVARSVYHLEDIVRQKKRLEQDKSQTKESLQYALETLTMNGCSVEDLGLDFTLPGFPNIELKKGGKDIPVTIHNLEEYLRLVIFWALNEGVSRQFDSFRDGFESVFPLSHLQYFYPEELDQLLCGSKADTWDAKTLMECCRPDHGYTHDSRAVKFLFEILSSFDNEQQRLFLQFVTGSPRLPVGGFRSLNPPLTIVRKTFESTENPDDFLPSVMTCVNYLKLPDYSSIEIMREKLLIAAREGQQSFHLS,1992,NP_004229.1.csv,refseq-TRIP12-NM_004238.2_clinical_seed_0_final,refseq-TRIP12-NM_004238.2.a2m,Invitae,refseq-TRIP12-NM_004238.2.npy,1,1992,1992
+NP_004230.2,MSSWLGGLGSGLGQSLGQVGGSLASLTGQISNFTKDMLMEGTEEVEAELPDSRTKEIEAIHAILRSENERLKKLCTDLEEKHEASEIQIKQQSTSYRNQLQQKEVEISHLKARQIALQDQLLKLQSAAQSVPSGAGVPATTASSSFAYGISHHPSAFHDDDMDFGDIISSQQEINRLSNEVSRLESEVGHWRHIAQTSKAQGTDNSDQSEICKLQNIIKELKQNRSQEIDDHQHEMSVLQNAHQQKLTEISRRHREELSDYEERIEELENLLQQGGSGVIETDLSKIYEMQKTIQVLQIEKVESTKKMEQLEDKIKDINKKLSSAENDRDILRREQEQLNVEKRQIMEECENLKLECSKLQPSAVKQSDTMTEKERILAQSASVEEVFRLQQALSDAENEIMRLSSLNQDNSLAEDNLKLKMRIEVLEKEKSLLSQEKEELQMSLLKLNNEYEVIKSTATRDISLDSELHDLRLNLEAKEQELNQSISEKETLIAEIEELDRQNQEATKHMILIKDQLSKQQNEGDSIISKLKQDLNDEKKRVHQLEDDKMDITKELDVQKEKLIQSEVALNDLHLTKQKLEDKVENLVDQLNKSQESNVSIQKENLELKEHIRQNEEELSRIRNELMQSLNQDSNSNFKDTLLKEREAEVRNLKQNLSELEQLNENLKKVAFDVKMENEKLVLACEDVRHQLEECLAGNNQLSLEKNTIVETLKMEKGEIEAELCWAKKRLLEEANKYEKTIEELSNARNLNTSALQLEHEHLIKLNQKKDMEIAELKKNIEQMDTDHKETKDVLSSSLEEQKQLTQLINKKEIFIEKLKERSSKLQEELDKYSQALRKNEILRQTIEEKDRSLGSMKEENNHLQEELERLREEQSRTAPVADPKTLDSVTELASEVSQLNTIKEHLEEEIKHHQKIIEDQNQSKMQLLQSLQEQKKEMDEFRYQHEQMNATHTQLFLEKDEEIKSLQKTIEQIKTQLHEERQDIQTDNSDIFQETKVQSLNIENGSEKHDLSKAETERLVKGIKERELEIKLLNEKNISLTKQIDQLSKDEVGKLTQIIQQKDLEIQALHARISSTSHTQDVVYLQQQLQAYAMEREKVFAVLNEKTRENSHLKTEYHKMMDIVAAKEAALIKLQDENKKLSTRFESSGQDMFRETIQNLSRIIREKDIEIDALSQKCQTLLAVLQTSSTGNEAGGVNSNQFEELLQERDKLKQQVKKMEEWKQQVMTTVQNMQHESAQLQEELHQLQAQVLVDSDNNSKLQVDYTGLIQSYEQNETKLKNFGQELAQVQHSIGQLCNTKDLLLGKLDIISPQLSSASLLTPQSAECLRASKSEVLSESSELLQQELEELRKSLQEKDATIRTLQENNHRLSDSIAATSELERKEHEQTDSEIKQLKEKQDVLQKLLKEKDLLIKAKSDQLLSSNENFTNKVNENELLRQAVTNLKERILILEMDIGKLKGENEKIVETYRGKETEYQALQETNMKFSMMLREKEFECHSMKEKALAFEQLLKEKEQGKTGELNQLLNAVKSMQEKTVVFQQERDQVMLALKQKQMENTALQNEVQRLRDKEFRSNQELERLRNHLLESEDSYTREALAAEDREAKLRKKVTVLEEKLVSSSNAMENASHQASVQVESLQEQLNVVSKQRDETALQLSVSQEQVKQYALSLANLQMVLEHFQQEEKAMYSAELEKQKQLIAEWKKNAENLEGKVISLQECLDEANAALDSASRLTEQLDVKEEQIEELKRQNELRQEMLDDVQKKLMSLANSSEGKVDKVLMRNLFIGHFHTPKNQRHEVLRLMGSILGVRREEMEQLFHDDQGGVTRWMTGWLGGGSKSVPNTPLRPNQQSVVNSSFSELFVKFLETESHPSIPPPKLSVHDMKPLDSPGRRKRDTNAPESFKDTAESRSGRRTDVNPFLAPRSAAVPLINPAGLGPGGPGHLLLKPISDVLPTFTPLPALPDNSAGVVLKDLLKQ,1979,NP_004230.2.csv,refseq-TRIP11-NM_004239.4_clinical_seed_0_final,refseq-TRIP11-NM_004239.4.a2m,Invitae,refseq-TRIP11-NM_004239.4_theta_0.2.npy,1,1979,1979
+NP_004238.3,MDTDLYDEFGNYIGPELDSDEDDDELGRETKDLDEMDDDDDDDDVGDHDDDHPGMEVVLHEDKKYYPTAEEVYGPEVETIVQEEDTQPLTEPIIKPVKTKKFTLMEQTLPVTVYEMDFLADLMDNSELIRNVTLCGHLHHGKTCFVDCLIEQTHPEIRKRYDQDLCYTDILFTEQERGVGIKSTPVTVVLPDTKGKSYLFNIMDTPGHVNFSDEVTAGLRISDGVVLFIDAAEGVMLNTERLIKHAVQERLAVTVCINKIDRLILELKLPPTDAYYKLRHIVDEVNGLISMYSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDIYFNPKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQVVGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHTTKMYSTDDGVQFHAFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADPVVTFCETVVETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEVVQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQGFQWGTREGPLCDELIRNVKFKILDAVVAQEPLHRGGGQIIPTARRVVYSAFLMATPRLMEPYYFVEVQAPADCVSAVYTVLARRRGHVTQDAPIPGSPLYTIKAFIPAIDSFGFETDLRTHTQGQAFSLSVFHHWQIVPGDPLDKSIVIRPLEPQPAPHLAREFMIKTRRRKGLSEDVSISKFFDDPMLLELAKQDVVLNYPM,972,NP_004238.3.csv,refseq-EFTUD2-NM_004247.3_clinical_seed_0_final,refseq-EFTUD2-NM_004247.3.a2m,Invitae,refseq-EFTUD2-NM_004247.3.npy,1,972,972
+NP_004251.4,MERLRDVRERLQAWERAFRRQRGRRPSQDDVEAAPEETRALYREYRTLKRTTGQAGGGLRSSESLPAAAEEAPEPRCWGPHLNRAATKSPQSTPGRSRQGSVPDYGQRLKANLKGTLQAGPALGRRPWPLGRASSKASTPKPPGTGPVPSFAEKVSDEPPQLPEPQPRPGRLQHLQASLSQRLGSLDPGWLQRCHSEVPDFLGAPKACRPDLGSEESQLLIPGESAVLGPGAGSQGPEASAFQEVSIRVGSPQPSSSGGEKRRWNEEPWESPAQVQQESSQAGPPSEGAGAVAVEEDPPGEPVQAQPPQPCSSPSNPRYHGLSPSSQARAGKAEGTAPLHIFPRLARHDRGNYVRLNMKQKHYVRGRALRSRLLRKQAWKQKWRKKGECFGGGGATVTTKESCFLNEQFDHWAAQCPRPASEEDTDAVGPEPLVPSPQPVPEVPSLDPTVLPLYSLGPSGQLAETPAEVFQALEQLGHQAFRPGQERAVMRILSGISTLLVLPTGAGKSLCYQLPALLYSRRSPCLTLVVSPLLSLMDDQVSGLPPCLKAACIHSGMTRKQRESVLQKIRAAQVHVLMLTPEALVGAGGLPPAAQLPPVAFACIDEAHCLSQWSHNFRPCYLRVCKVLRERMGVHCFLGLTATATRRTASDVAQHLAVAEEPDLHGPAPVPTNLHLSVSMDRDTDQALLTLLQGKRFQNLDSIIIYCNRREDTERIAALLRTCLHAAWVPGSGGRAPKTTAEAYHAGMCSRERRRVQRAFMQGQLRVVVATVAFGMGLDRPDVRAVLHLGLPPSFESYVQAVGRAGRDGQPAHCHLFLQPQGEDLRELRRHVHADSTDFLAVKRLVQRVFPACTCTCTRPPSEQEGAVGGERPVPKYPPQEAEQLSHQAAPGPRRVCMGHERALPIQLTVQALDMPEEAIETLLCYLELHPHHWLELLATTYTHCRLNCPGGPAQLQALAHRCPPLAVCLAQQLPEDPGQGSSSVEFDMVKLVDSMGWELASVRRALCQLQWDHEPRTGVRRGTGVLVEFSELAFHLRSPGDLTAEEKDQICDFLYGRVQARERQALARLRRTFQAFHSVAFPSCGPCLEQQDEERSTRLKDLLGRYFEEEEGQEPGGMEDAQGPEPGQARLQDWEDQVRCDIRQFLSLRPEEKFSSRAVARIFHGIGSPCYPAQVYGQDRRFWRKYLHLSFHALVGLATEELLQVAR,1208,NP_004251.4.csv,RECQ4_HUMAN_b07_clinical_seed_0_final,RECQ4_HUMAN_b07.a2m,EVE,RECQ4_HUMAN_b07_theta_0.2.npy,1,1208,1208
+NP_004259.3,MSGVRAVRISIESACEKQVHEVGLDGTETYLPPLSMSQNLARLAQRIDFSQGSGSEEEEAAGTEGDAQEWPGAGSSADQDDEEGVVKFQPSLWPWDSVRNNLRSALTEMCVLYDVLSIVRDKKFMTLDPVSQDALPPKQNPQTLQLISKKKSLAGAAQILLKGAERLTKSVTENQENKLQRDFNSELLRLRQHWKLRKVGDKILGDLSYRSAGSLFPHHGTFEVIKNTDLDLDKKIPEDYCPLDVQIPSDLEGSAYIKVSIQKQAPDIGDLGTVNLFKRPLPKSKPGSPHWQTKLEAAQNVLLCKEIFAQLSREAVQIKSQVPHIVVKNQIISQPFPSLQLSISLCHSSNDKKSQKFATEKQCPEDHLYVLEHNLHLLIREFHKQTLSSIMMPHPASAPFGHKRMRLSGPQAFDKNEINSLQSSEGLLEKIIKQAKHIFLRSRAAATIDSLASRIEDPQIQAHWSNINDVYESSVKVLITSQGYEQICKSIQLQLNIGVEQIRVVHRDGRVITLSYQEQELQDFLLSQMSQHQVHAVQQLAKVMGWQVLSFSNHVGLGPIESIGNASAITVASPSGDYAISVRNGPESGSKIMVQFPRNQCKDLPKSDVLQDNKWSHLRGPFKEVQWNKMEGRNFVYKMELLMSALSPCLL,651,NP_004259.3.csv,refseq-MED17-NM_004268.4_clinical_seed_0_final,refseq-MED17-NM_004268.4.a2m,Invitae,refseq-MED17-NM_004268.4.npy,1,651,651
+NP_004264.2,MEKGLTLPQDCRDFVHSLKMRSKYALFLVFVVIVFVFIEKENKIISRVSDKLKQIPQALADANSTDPALILAENASLLSLSELDSAFSQLQSRLRNLSLQLGVEPAMEAAGEEEEEQRKEEEPPRPAVAGPRRHVLLMATTRTGSSFVGEFFNQQGNIFYLFEPLWHIERTVSFEPGGANAAGSALVYRDVLKQLFLCDLYVLEHFITPLPEDHLTQFMFRRGSSRSLCEDPVCTPFVKKVFEKYHCKNRRCGPLNVTLAAEACRRKEHMALKAVRIRQLEFLQPLAEDPRLDLRVIQLVRDPRAVLASRMVAFAGKYKTWKKWLDDEGQDGLREEEVQRLRGNCESIRLSAELGLRQPAWLRGRYMLVRYEDVARGPLQKAREMYRFAGIPLTPQVEDWIQKNTQAAHDGSGIYSTQKNSSEQFEKWRFSMPFKLAQVVQAACGPAMRLFGYKLARDAAALTNRSVSLLEERGTFWVT,479,NP_004264.2.csv,refseq-CHST3-NM_004273.4_clinical_seed_0_final,refseq-CHST3-NM_004273.4.a2m,Invitae,refseq-CHST3-NM_004273.4.npy,1,479,479
+NP_004269.1,MEAMWLLCVALAVLAWGFLWVWDSSERMKSREQGGRLGAESRTLLVIAHPDDEAMFFAPTVLGLARLRHWVYLLCFSAGNYYNQGETRKKELLQSCDVLGIPLSSVMIIDNRDFPDDPGMQWDTEHVARVLLQHIEVNGINLVVTFDAGGVSGHSNHIALYAAVRALHSEGKLPKGCSVLTLQSVNVLRKYISLLDLPLSLLHTQDVLFVLNSKEVAQAKKAMSCHRSQLLWFRRLYIIFSRYMRINSLSFL,252,NP_004269.1.csv,refseq-PIGL-NM_004278.3_clinical_seed_0_final,refseq-PIGL-NM_004278.3.a2m,Invitae,refseq-PIGL-NM_004278.3.npy,1,252,252
+NP_004270.2,MAAAAARVVLSSAARRRLWGFSESLLIRGAAGRSLYFGENRLRSTQAATQVVLNVPETRVTCLESGLRVASEDSGLSTCTVGLWIDAGSRYENEKNNGTAHFLEHMAFKGTKKRSQLDLELEIENMGAHLNAYTSREQTVYYAKAFSKDLPRAVEILADIIQNSTLGEAEIERERGVILREMQEVETNLQEVVFDYLHATAYQNTALGRTILGPTENIKSISRKDLVDYITTHYKGPRIVLAAAGGVSHDELLDLAKFHFGDSLCTHKGEIPALPPCKFTGSEIRVRDDKMPLAHLAIAVEAVGWAHPDTICLMVANTLIGNWDRSFGGGMNLSSKLAQLTCHGNLCHSFQSFNTSYTDTGLWGLYMVCESSTVADMLHVVQKEWMRLCTSVTESEVARARNLLKTNMLLQLDGSTPICEDIGRQMLCYNRRIPIPELEARIDAVNAETIREVCTKYIYNRSPAIAAVGPIKQLPDFKQIRSNMCWLRD,489,NP_004270.2.csv,refseq-PMPCB-NM_004279.2_clinical_seed_0_final,refseq-PMPCB-NM_004279.2.a2m,Invitae,refseq-PMPCB-NM_004279.2.npy,1,489,489
+NP_004272.2,MSAATHSPMMQVASGNGDRDPLPPGWEIKIDPQTGWPFFVDHNSRTTTWNDPRVPSEGPKETPSSANGPSREGSRLPPAREGHPVYPQLRPGYIPIPVLHEGAENRQVHPFHVYPQPGMQRFRTEAAAAAPQRSQSPLRGMPETTQPDKQCGQVAAAAAAQPPASHGPERSQSPAASDCSSSSSSASLPSSGRSSLGSHQLPRGYISIPVIHEQNVTRPAAQPSFHQAQKTHYPAQQGEYQTHQPVYHKIQGDDWEPRPLRAASPFRSSVQGASSREGSPARSSTPLHSPSPIRVHTVVDRPQQPMTHRETAPVSQPENKPESKPGPVGPELPPGHIPIQVIRKEVDSKPVSQKPPPPSEKVEVKVPPAPVPCPPPSPGPSAVPSSPKSVATEERAAPSTAPAEATPPKPGEAEAPPKHPGVLKVEAILEKVQGLEQAVDNFEGKKTDKKYLMIEEYLTKELLALDSVDPEGRADVRQARRDGVRKVQTILEKLEQKAIDVPGQVQVYELQPSNLEADQPLQAIMEMGAVAADKGKKNAGNAEDPHTETQQPEATAAATSNPSSMTDTPGNPAAP,575,NP_004272.2.csv,refseq-BAG3-NM_004281.3_clinical_seed_0_final,refseq-BAG3-NM_004281.3.a2m,Invitae,refseq-BAG3-NM_004281.3.npy,1,575,575
+NP_004278.2,MDPLFQQTHKQVHEIQSCMGRLETADKQSVHIVENEIQASIDQIFSRLERLEILSSKEPPNKRQNARLRVDQLKYDVQHLQTALRNFQHRRHAREQQERQREELLSRTFTTNDSDTTIPMDESLQFNSSLQKVHNGMDDLILDGHNILDGLRTQRLTLKGTQKKILDIANMLGLSNTVMRLIEKRAFQDKYFMIGGMLLTCVVMFLVVQYLT,212,NP_004278.2.csv,refseq-GOSR2-NM_004287.3_clinical_seed_0_final,refseq-GOSR2-NM_004287.3.a2m,Invitae,refseq-GOSR2-NM_004287.3.npy,1,212,212
+NP_004288.1,MAGCCCLSAEEKESQRISAEIERQLRRDKKDARRELKLLLLGTGESGKSTFIKQMRIIHGSGYSDEDRKGFTKLVYQNIFTAMQAMIRAMDTLRIQYVCEQNKENAQIIREVEVDKVSMLSREQVEAIKQLWQDPGIQECYDRRREYQLSDSAKYYLTDIDRIATPSFVPTQQDVLRVRVPTTGIIEYPFDLENIIFRMVDVGGQRSERRKWIHCFESVTSIIFLVALSEYDQVLAECDNENRMEESKALFKTIITYPWFLNSSVILFLNKKDLLEEKIMYSHLISYFPEYTGPKQDVRAARDFILKLYQDQNPDKEKVIYSHFTCATDTDNIRFVFAAVKDTILQLNLREFNLV,355,NP_004288.1.csv,refseq-GNA14-NM_004297.3_clinical_seed_0_final,refseq-GNA14-NM_004297.3.a2m,Invitae,refseq-GNA14-NM_004297.3.npy,1,355,355
+NP_004290.2,MALLAMHSWRWAAAAAAFEKRRHSAILIRPLVSVSGSGPQWRPHQLGALGTARAYQQIPESLKSITWQRLGKGNSGQFLDAAKALQVWPLIEKRTCWHGHAGGGLHTDPKEGLKDVDTRKIIKAMLSYVWPKDRPDLRARVAISLGFLGGAKAMNIVVPFMFKYAVDSLNQMSGNMLNLSDAPNTVATMATAVLIGYGVSRAGAAFFNEVRNAVFGKVAQNSIRRIAKNVFLHLHNLDLGFHLSRQTGALSKAIDRGTRGISFVLSALVFNLLPIMFEVMLVSGVLYYKCGAQFALVTLGTLGTYTAFTVAVTRWRTRFRIEMNKADNDAGNAAIDSLLNYETVKYFNNERYEAQRYDGFLKTYETASLKSTSTLAMLNFGQSAIFSVGLTAIMVLASQGIVAGTLTVGDLVMVNGLLFQLSLPLNFLGTVYRETRQALIDMNTLFTLLKVDTQIKDKVMASPLQITPQTATVAFDNVHFEYIEGQKVLSGISFEVPAGKKVAIVGGSGSGKSTIVRLLFRFYEPQKGSIYLAGQNIQDVSLESLRRAVGVVPQDAVLFHNTIYYNLLYGNISASPEEVYAVAKLAGLHDAILRMPHGYDTQVGERGLKLSGGEKQRVAIARAILKDPPVILYDEATSSLDSITEETILGAMKDVVKHRTSIFIAHRLSTVVDADEIIVLDQGKVAERGTHHGLLANPHSIYSEMWHTQSSRVQNHDNPKWEAKKENISKEEERKKLQEEIVNSVKGCGNCSC,753,NP_004290.2.csv,refseq-ABCB7-NM_004299.4_clinical_seed_0_final,refseq-ABCB7-NM_004299.4.a2m,Invitae,refseq-ABCB7-NM_004299.4_theta_0.2.npy,1,753,753
+NP_004295.2,MGAIGLLWLLPLLLSTAAVGSGMGTGQRAGSPAAGPPLQPREPLSYSRLQRKSLAVDFVVPSLFRVYARDLLLPPSSSELKAGRPEARGSLALDCAPLLRLLGPAPGVSWTAGSPAPAEARTLSRVLKGGSVRKLRRAKQLVLELGEEAILEGCVGPPGEAAVGLLQFNLSELFSWWIRQGEGRLRIRLMPEKKASEVGREGRLSAAIRASQPRLLFQIFGTGHSSLESPTNMPSPSPDYFTWNLTWIMKDSFPFLSHRSRYGLECSFDFPCELEYSPPLHDLRNQSWSWRRIPSEEASQMDLLDGPGAERSKEMPRGSFLLLNTSADSKHTILSPWMRSSSEHCTLAVSVHRHLQPSGRYIAQLLPHNEAAREILLMPTPGKHGWTVLQGRIGRPDNPFRVALEYISSGNRSLSAVDFFALKNCSEGTSPGSKMALQSSFTCWNGTVLQLGQACDFHQDCAQGEDESQMCRKLPVGFYCNFEDGFCGWTQGTLSPHTPQWQVRTLKDARFQDHQDHALLLSTTDVPASESATVTSATFPAPIKSSPCELRMSWLIRGVLRGNVSLVLVENKTGKEQGRMVWHVAAYEGLSLWQWMVLPLLDVSDRFWLQMVAWWGQGSRAIVAFDNISISLDCYLTISGEDKILQNTAPKSRNLFERNPNKELKPGENSPRQTPIFDPTVHWLFTTCGASGPHGPTQAQCNNAYQNSNLSVEVGSEGPLKGIQIWKVPATDTYSISGYGAAGGKGGKNTMMRSHGVSVLGIFNLEKDDMLYILVGQQGEDACPSTNQLIQKVCIGENNVIEEEIRVNRSVHEWAGGGGGGGGATYVFKMKDGVPVPLIIAAGGGGRAYGAKTDTFHPERLENNSSVLGLNGNSGAAGGGGGWNDNTSLLWAGKSLQEGATGGHSCPQAMKKWGWETRGGFGGGGGGCSSGGGGGGYIGGNAASNNDPEMDGEDGVSFISPLGILYTPALKVMEGHGEVNIKHYLNCSHCEVDECHMDPESHKVICFCDHGTVLAEDGVSCIVSPTPEPHLPLSLILSVVTSALVAALVLAFSGIMIVYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGAGHYEDTILKSKNSMNQPGP,1620,NP_004295.2.csv,refseq-ALK-NM_004304.4_clinical_seed_0_final,refseq-ALK-NM_004304.4.a2m,Invitae,refseq-ALK-NM_004304.4.npy,1,1620,1620
+NP_004302.1,MGLLSILRKLKSAPDQEVRILLLGLDNAGKTTLLKQLASEDISHITPTQGFNIKSVQSQGFKLNVWDIGGQRKIRPYWKNYFENTDILIYVIDSADRKRFEETGQELAELLEEEKLSCVPVLIFANKQDLLTAAPASEIAEGLNLHTIRDRVWQIQSCSALTGEGVQDGMNWVCKNVNAKKK,182,NP_004302.1.csv,refseq-ARL3-NM_004311.4_clinical_seed_0_final,refseq-ARL3-NM_004311.4.a2m,Invitae,refseq-ARL3-NM_004311.4_theta_0.2.npy,1,182,182
+NP_004312.2,MAGASVKVAVRVRPFNSREMSRDSKCIIQMSGSTTTIVNPKQPKETPKSFSFDYSYWSHTSPEDINYASQKQVYRDIGEEMLQHAFEGYNVCIFAYGQTGAGKSYTMMGKQEKDQQGIIPQLCEDLFSRINDTTNDNMSYSVEVSYMEIYCERVRDLLNPKNKGNLRVREHPLLGPYVEDLSKLAVTSYNDIQDLMDSGNKARTVAATNMNETSSRSHAVFNIIFTQKRHDAETNITTEKVSKISLVDLAGSERADSTGAKGTRLKEGANINKSLTTLGKVISALAEMDSGPNKNKKKKKTDFIPYRDSVLTWLLRENLGGNSRTAMVAALSPADINYDETLSTLRYADRAKQIRCNAVINEDPNNKLIRELKDEVTRLRDLLYAQGLGDITDMTNALVGMSPSSSLSALSSRAASVSSLHERILFAPGSEEAIERLKETEKIIAELNETWEEKLRRTEAIRMEREALLAEMGVAMREDGGTLGVFSPKKTPHLVNLNEDPLMSECLLYYIKDGITRVGREDGERRQDIVLSGHFIKEEHCVFRSDSRGGSEAVVTLEPCEGADTYVNGKKVTEPSILRSGNRIIMGKSHVFRFNHPEQARQERERTPCAETPAEPVDWAFAQRELLEKQGIDMKQEMEQRLQELEDQYRREREEATYLLEQQRLDYESKLEALQKQMDSRYYPEVNEEEEEPEDEVQWTERECELALWAFRKWKWYQFTSLRDLLWGNAIFLKEANAISVELKKKVQFQFVLLTDTLYSPLPPDLLPPEAAKDRETRPFPRTIVAVEVQDQKNGATHYWTLEKLRQRLDLMREMYDRAAEVPSSVIEDCDNVVTGGDPFYDRFPWFRLVGRAFVYLSNLLYPVPLVHRVAIVSEKGEVKGFLRVAVQAISADEEAPDYGSGVRQSGTAKISFDDQHFEKFQSESCPVVGMSRSGTSQEELRIVEGQGQGADVGPSADEVNNNTCSAVPPEGLLLDSSEKAALDGPLDAALDHLRLGNTFTFRVTVLQASSISAEYADIFCQFNFIHRHDEAFSTEPLKNTGRGPPLGFYHVQNIAVEVTKSFIEYIKSQPIVFEVFGHYQQHPFPPLCKDVLSPLRPSRRHFPRVMPLSKPVPATKLSTLTRPCPGPCHCKYDLLVYFEICELEANGDYIPAVVDHRGGMPCMGTFLLHQGIQRRITVTLLHETGSHIRWKEVRELVVGRIRNTPETDESLIDPNILSLNILSSGYIHPAQDDRTFYQFEAAWDSSMHNSLLLNRVTPYREKIYMTLSAYIEMENCTQPAVVTKDFCMVFYSRDAKLPASRSIRNLFGSGSLRASESNRVTGVYELSLCHVADAGSPGMQRRRRRVLDTSVAYVRGEENLAGWRPRSDSLILDHQWELEKLSLLQEVEKTRHYLLLREKLETAQRPVPEALSPAFSEDSESHGSSSASSPLSAEGRPSPLEAPNERQRELAVKCLRLLTHTFNREYTHSHVCVSASESKLSEMSVTLLRDPSMSPLGVATLTPSSTCPSLVEGRYGATDLRTPQPCSRPASPEPELLPEADSKKLPSPARATETDKEPQRLLVPDIQEIRVSPIVSKKGYLHFLEPHTSGWARRFVVVRRPYAYMYNSDKDTVERFVLNLATAQVEYSEDQQAMLKTPNTFAVCTEHRGILLQAASDKDMHDWLYAFNPLLAGTIRSKLSRRRSAQMRV,1690,NP_004312.2.csv,refseq-KIF1A-NM_004321.6_clinical_seed_0_final,refseq-KIF1A-NM_004321.6.a2m,Invitae,refseq-KIF1A-NM_004321.6.npy,1,1690,1690
+NP_004319.1,MPLSDFILALKDNPYFGAGFGLVGVGTALALARKGVQLGLVAFRRHYMITLEVPARDRSYAWLLSWLTRHSTRTQHLSVETSYLQHESGRISTKFEFVPSPGNHFIWYRGKWIRVERSREMQMIDLQTGTPWESVTFTALGTDRKVFFNILEEARELALQQEEGKTVMYTAVGSEWRPFGYPRRRRPLNSVVLQQGLADRIVRDVQEFIDNPKWYTDRGIPYRRGYLLYGPPGCGKSSFITALAGELEHSICLLSLTDSSLSDDRLNHLLSVAPQQSLVLLEDVDAAFLSRDLAVENPVKYQGLGRLTFSGLLNALDGVASTEARIVFMTTNHVDRLDPALIRPGRVDLKEYVGYCSHWQLTQMFQRFYPGQAPSLAENFAEHVLRATNQISPAQVQGYFMLYKNDPVGAIHNAESLRR,419,NP_004319.1.csv,refseq-BCS1L-NM_004328.4_clinical_seed_0_final,refseq-BCS1L-NM_004328.4.a2m,Invitae,refseq-BCS1L-NM_004328.4.npy,1,419,419
+NP_004320.2,MPQLYIYIRLLGAYLFIISRVQGQNLDSMLHGTGMKSDSDQKKSENGVTLAPEDTLPFLKCYCSGHCPDDAINNTCITNGHCFAIIEEDDQGETTLASGCMKYEGSDFQCKDSPKAQLRRTIECCRTNLCNQYLQPTLPPVVIGPFFDGSIRWLVLLISMAVCIIAMIIFSSCFCYKHYCKSISSRRRYNRDLEQDEAFIPVGESLKDLIDQSQSSGSGSGLPLLVQRTIAKQIQMVRQVGKGRYGEVWMGKWRGEKVAVKVFFTTEEASWFRETEIYQTVLMRHENILGFIAADIKGTGSWTQLYLITDYHENGSLYDFLKCATLDTRALLKLAYSAACGLCHLHTEIYGTQGKPAIAHRDLKSKNILIKKNGSCCIADLGLAVKFNSDTNEVDVPLNTRVGTKRYMAPEVLDESLNKNHFQPYIMADIYSFGLIIWEMARRCITGGIVEEYQLPYYNMVPSDPSYEDMREVVCVKRLRPIVSNRWNSDECLRAVLKLMSECWAHNPASRLTALRIKKTLAKMVESQDVKI,532,NP_004320.2.csv,refseq-BMPR1A-NM_004329.3_clinical_seed_0_final,refseq-BMPR1A-NM_004329.3.a2m,Invitae,refseq-BMPR1A-NM_004329.3.npy,1,532,532
+NP_004324.2,MAALSGGGGGGAEPGQALFNGDMEPEAGAGAGAAASSAADPAIPEEVWNIKQMIKLTQEHIEALLDKFGGEHNPPSIYLEAYEEYTSKLDALQQREQQLLESLGNGTDFSVSSSASMDTVTSSSSSSLSVLPSSLSVFQNPTDVARSNPKSPQKPIVRVFLPNKQRTVVPARCGVTVRDSLKKALMMRGLIPECCAVYRIQDGEKKPIGWDTDISWLTGEELHVEVLENVPLTTHNFVRKTFFTLAFCDFCRKLLFQGFRCQTCGYKFHQRCSTEVPLMCVNYDQLDLLFVSKFFEHHPIPQEEASLAETALTSGSSPSAPASDSIGPQILTSPSPSKSIPIPQPFRPADEDHRNQFGQRDRSSSAPNVHINTIEPVNIDDLIRDQGFRGDGGSTTGLSATPPASLPGSLTNVKALQKSPGPQRERKSSSSSEDRNRMKTLGRRDSSDDWEIPDGQITVGQRIGSGSFGTVYKGKWHGDVAVKMLNVTAPTPQQLQAFKNEVGVLRKTRHVNILLFMGYSTKPQLAIVTQWCEGSSLYHHLHIIETKFEMIKLIDIARQTAQGMDYLHAKSIIHRDLKSNNIFLHEDLTVKIGDFGLATVKSRWSGSHQFEQLSGSILWMAPEVIRMQDKNPYSFQSDVYAFGIVLYELMTGQLPYSNINNRDQIIFMVGRGYLSPDLSKVRSNCPKAMKRLMAECLKKKRDERPLFPQILASIELLARSLPKIHRSASEPSLNRAGFQTEDFSLYACASPKTPIQAGGYGAFPVH,766,NP_004324.2.csv,refseq-BRAF-NM_004333.4_clinical_seed_0_final,refseq-BRAF-NM_004333.4.a2m,Invitae,refseq-BRAF-NM_004333.4.npy,1,766,766
+NP_004332.2,MAALVLEDGSVLRGQPFGAAVSTAGEVVFQTGMVGYPEALTDPSYKAQILVLTYPLIGNYGIPPDEMDEFGLCKWFESSGIHVAALVVGECCPTPSHWSATRTLHEWLQQHGIPGLQGVDTRELTKKLREQGSLLGKLVQNGTEPSSLPFLDPNARPLVPEVSIKTPRVFNTGGAPRILALDCGLKYNQIRCLCQRGAEVTVVPWDHALDSQEYEGLFLSNGPGDPASYPSVVSTLSRVLSEPNPRPVFGICLGHQLLALAIGAKTYKMRYGNRGHNQPCLLVGSGRCFLTSQNHGFAVETDSLPADWAPLFTNANDGSNEGIVHNSLPFFSVQFHPEHQAGPSDMELLFDIFLETVKEATAGNPGGQTVRERLTERLCPPGIPTPGSGLPPPRKVLILGSGGLSIGQAGEFDYSGSQAIKALKEENIQTLLINPNIATVQTSQGLADKVYFLPITPHYVTQVIRNERPDGVLLTFGGQTALNCGVELTKAGVLARYGVRVLGTPVETIELTEDRRAFAARMAEIGEHVAPSEAANSLEQAQAAAERLGYPVLVRAAFALGGLGSGFASNREELSALVAPAFAHTSQVLVDKSLKGWKEIEYEVVRDAYGNCVTVCNMENLDPLGIHTGESIVVAPSQTLNDREYQLLRQTAIKVTQHLGIVGECNVQYALNPESEQYYIIEVNARLSRSSALASKATGYPLAYVAAKLALGIPLPELRNSVTGGTAAFEPSVDYCVVKIPRWDLSKFLRVSTKIGSCMKSVGEVMGIGRSFEEAFQKALRMVDENCVGFDHTVKPVSDMELETPTDKRIFVVAAALWAGYSVDRLYELTRIDRWFLHRMKRIIAHAQLLEQHRGQPLPPDLLQQAKCLGFSDKQIALAVLSTELAVRKLRQELGICPAVKQIDTVAAEWPAQTNYLYLTYWGTTHDLTFRTPHVLVLGSGVYRIGSSVEFDWCAVGCIQQLRKMGYKTIMVNYNPETVSTDYDMCDRLYFDEISFEVVMDIYELENPEGVILSMGGQLPNNMAMALHRQQCRVLGTSPEAIDSAENRFKFSRLLDTIGISQPQWRELSDLESARQFCQTVGYPCVVRPSYVLSGAAMNVAYTDGDLERFLSSAAAVSKEHPVVISKFIQEAKEIDVDAVASDGVVAAIAISEHVENAGVHSGDATLVTPPQDITAKTLERIKAIVHAVGQELQVTGPFNLQLIAKDDQLKVIECNVRVSRSFPFVSKTLGVDLVALATRVIMGEEVEPVGLMTGSGVVGVKVPQFSFSRLAGADVVLGVEMTSTGEVAGFGESRCEAYLKAMLSTGFKIPKKNILLTIGSYKNKSELLPTVRLLESLGYSLYASLGTADFYTEHGVKVTAVDWHFEEAVDGECPPQRSILEQLAEKNFELVINLSMRGAGGRRLSSFVTKGYRTRRLAADFSVPLIIDIKCTKLFVEALGQIGPAPPLKVHVDCMTSQKLVRLPGLIDVHVHLREPGGTHKEDFASGTAAALAGGITMVCAMPNTRPPIIDAPALALAQKLAEAGARCDFALFLGASSENAGTLGTVAGSAAGLKLYLNETFSELRLDSVVQWMEHFETWPSHLPIVAHAEQQTVAAVLMVAQLTQRSVHICHVARKEEILLIKAAKARGLPVTCEVAPHHLFLSHDDLERLGPGKGEVRPELGSRQDVEALWENMAVIDCFASDHAPHTLEEKCGSRPPPGFPGLETMLPLLLTAVSEGRLSLDDLLQRLHHNPRRIFHLPPQEDTYVEVDLEHEWTIPSHMPFSKAHWTPFEGQKVKGTVRRVVLRGEVAYIDGQVLVPPGYGQDVRKWPQGAVPQLPPSAPATSEMTTTPERPRRGIPGLPDGRFHLPPRIHRASDPGLPAEEPKEKSSRKVAEPELMGTPDGTCYPPPPVPRQASPQNLGTPGLLHPQTSPLLHSLVGQHILSVQQFTKDQMSHLFNVAHTLRMMVQKERSLDILKGKVMASMFYEVSTRTSSSFAAAMARLGGAVLSFSEATSSVQKGESLADSVQTMSCYADVVVLRHPQPGAVELAAKHCRRPVINAGDGVGEHPTQALLDIFTIREELGTVNGMTITMVGDLKHGRTVHSLACLLTQYRVSLRYVAPPSLRMPPTVRAFVASRGTKQEEFESIEEALPDTDVLYMTRIQKERFGSTQEYEACFGQFILTPHIMTRAKKKMVVMHPMPRVNEISVEVDSDPRAAYFRQAENGMYIRMALLATVLGRF,2225,NP_004332.2.csv,PYR1_HUMAN_b03_clinical_seed_0_final,PYR1_HUMAN_b03.a2m,EVE,PYR1_HUMAN_b03_theta_0.2.npy,1,2225,2225
+NP_004351.1,MGPWSRSLSALLLLLQVSSWLCQEPEPCHPGFDAESYTFTVPRRHLERGRVLGRVNFEDCTGRQRTAYFSLDTRFKVGTDGVITVKRPLRFHNPQIHFLVYAWDSTYRKFSTKVTLNTVGHHHRPPPHQASVSGIQAELLTFPNSSPGLRRQKRDWVIPPISCPENEKGPFPKNLVQIKSNKDKEGKVFYSITGQGADTPPVGVFIIERETGWLKVTEPLDRERIATYTLFSHAVSSNGNAVEDPMEILITVTDQNDNKPEFTQEVFKGSVMEGALPGTSVMEVTATDADDDVNTYNAAIAYTILSQDPELPDKNMFTINRNTGVISVVTTGLDRESFPTYTLVVQAADLQGEGLSTTATAVITVTDTNDNPPIFNPTTYKGQVPENEANVVITTLKVTDADAPNTPAWEAVYTILNDDGGQFVVTTNPVNNDGILKTAKGLDFEAKQQYILHVAVTNVVPFEVSLTTSTATVTVDVLDVNEAPIFVPPEKRVEVSEDFGVGQEITSYTAQEPDTFMEQKITYRIWRDTANWLEINPDTGAISTRAELDREDFEHVKNSTYTALIIATDNGSPVATGTGTLLLILSDVNDNAPIPEPRTIFFCERNPKPQVINIIDADLPPNTSPFTAELTHGASANWTIQYNDPTQESIILKPKMALEVGDYKINLKLMDNQNKDQVTTLEVSVCDCEGAAGVCRKAQPVEAGLQIPAILGILGGILALLILILLLLLFLRRRAVVKEPLLPPEDDTRDNVYYYDEEGGGEEDQDFDLSQLHRGLDARPEVTRNDVAPTLMSVPRYLPRPANPDEIGNFIDENLKAADTDPTAPPYDSLLVFDYEGSGSEAASLSSLNSSESDKDQDYDYLNEWGNRFKKLADMYGGGEDD,882,NP_004351.1.csv,refseq-CDH1-NM_004360.3_clinical_seed_0_final,refseq-CDH1-NM_004360.3.a2m,Invitae,refseq-CDH1-NM_004360.3.npy,1,882,882
+NP_004355.2,MESADFYEAEPRPPMSSHLQSPPHAPSSAAFGFPRGAGPAQPPAPPAAPEPLGGICEHETSIDISAYIDPAAFNDEFLADLFQHSRQQEKAKAAVGPTGGGGGGDFDYPGAPAGPGGAVMPGGAHGPPPGYGCAAAGYLDGRLEPLYERVGAPALRPLVIKQEPREEDEAKQLALAGLFPYQPPPPPPPSHPHPHPPPAHLAAPHLQFQIAHCGQTTMHLQPGHPTPPPTPVPSPHPAPALGAAGLPGPGSALKGLGAAHPDLRASGGSGAGKAKKSVDKNSNEYRVRRERNNIAVRKSRDKAKQRNVETQQKVLELTSDNDRLRKRVEQLSRELDTLRGIFRQLPESSLVKAMGNCA,358,NP_004355.2.csv,refseq-CEBPA-NM_004364.4_clinical_seed_0_final,refseq-CEBPA-NM_004364.4.a2m,Invitae,refseq-CEBPA-NM_004364.4.npy,1,358,358
+NP_004357.3,MAAAAAEEGMEPRALQYEQTLMYGRYTQDLGAFAKEEAARIRLGGPEPWKGPPSSRAAPELLEYGRSRCARCRVCSVRCHKFLVSRVGEDWIFLVLLGLLMALVSWVMDYAIAACLQAQQWMSRGLNTSILLQYLAWVTYPVVLITFSAGFTQILAPQAVGSGIPEMKTILRGVVLKEYLTLKTFIAKVIGLTCALGSGMPLGKEGPFVHIASMCAALLSKFLSLFGGIYENESRNTEMLAAACAVGVGCCFAAPIGGVLFSIEVTSTFFAVRNYWRGFFAATFSAFIFRVLAVWNRDEETITALFKTRFRLDFPFDLQELPAFAVIGIASGFGGALFVYLNRKIVQVMRKQKTINRFLMRKRLLFPALVTLLISTLTFPPGFGQFMAGQLSQKETLVTLFDNRTWVRQGLVEELEPPSTSQAWNPPRANVFLTLVIFILMKFWMSALATTIPVPCGAFMPVFVIGAAFGRLVGESMAAWFPDGIHTDSSTYRIVPGGYAVVGAAALAGAVTHTVSTAVIVFELTGQIAHILPVMIAVILANAVAQSLQPSLYDSIIRIKKLPYLPELGWGRHQQYRVRVEDIMVRDVPHVALSCTFRDLRLALHRTKGRMLALVESPESMILLGSIERSQVVALLGAQLSPARRRQHMQERRATQTSPLSDQEGPPTPEASVCFQVNTEDSAFPAARGETHKPLKPALKRGPSVTRNLGESPTGSAESAGIALRSLFCGSPPPEAASEKLESCEKRKLKRVRISLASDADLEGEMSPEEILEWEEQQLDEPVNFSDCKIDPAPFQLVERTSLHKTHTIFSLLGVDHAYVTSIGRLIGIVTLKELRKAIEGSVTAQGVKVRPPLASFRDSATSSSDTETTEVHALWGPHSRHGLPREGSPSDSDDKCQ,898,NP_004357.3.csv,refseq-CLCN2-NM_004366.5_clinical_seed_0_final,refseq-CLCN2-NM_004366.5.a2m,Invitae,refseq-CLCN2-NM_004366.5.npy,1,898,898
+NP_004362.2,MLTKFETKSARVKGLSFHPKRPWILTSLHNGVIQLWDYRMCTLIDKFDEHDGPVRGIDFHKQQPLFVSGGDDYKIKVWNYKLRRCLFTLLGHLDYIRTTFFHHEYPWILSASDDQTIRVWNWQSRTCVCVLTGHNHYVMCAQFHPTEDLVVSASLDQTVRVWDISGLRKKNLSPGAVESDVRGITGVDLFGTTDAVVKHVLEGHDRGVNWAAFHPTMPLIVSGADDRQVKIWRMNESKAWEVDTCRGHYNNVSCAVFHPRQELILSNSEDKSIRVWDMSKRTGVQTFRRDHDRFWVLAAHPNLNLFAAGHDGGMIVFKLERERPAYAVHGNMLHYVKDRFLRQLDFNSSKDVAVMQLRSGSKFPVFNMSYNPAENAVLLCTRASNLENSTYDLYTIPKDADSQNPDAPEGKRSSGLTAVWVARNRFAVLDRMHSLLIKNLKNEITKKVQVPNCDEIFYAGTGNLLLRDADSITLFDVQQKRTLASVKISKVKYVIWSADMSHVALLAKHAIVICNRKLDALCNIHENIRVKSGAWDESGVFIYTTSNHIKYAVTTGDHGIIRTLDLPIYVTRVKGNNVYCLDRECRPRVLTIDPTEFKFKLALINRKYDEVLHMVRNAKLVGQSIIAYLQKKGYPEVALHFVKDEKTRFSLALECGNIEIALEAAKALDDKNCWEKLGEVALLQGNHQIVEMCYQRTKNFDKLSFLYLITGNLEKLRKMMKIAEIRKDMSGHYQNALYLGDVSERVRILKNCGQKSLAYLTAATHGLDEEAESLKETFDPEKETIPDIDPNAKLLQPPAPIMPLDTNWPLLTVSKGFFEGTIASKGKGGALAADIDIDTVGTEGWGEDAELQLDEDGFVEATEGLGDDALGKGQEEGGGWDVEEDLELPPELDISPGAAGGAEDGFFVPPTKGTSPTQIWCNNSQLPVDHILAGSFETAMRLLHDQVGVIQFGPYKQLFLQTYARGRTTYQALPCLPSMYGYPNRNWKDAGLKNGVPAVGLKLNDLIQRLQLCYQLTTVGKFEEAVEKFRSILLSVPLLVVDNKQEIAEAQQLITICREYIVGLSVETERKKLPKETLEQQKRICEMAAYFTHSNLQPVHMILVLRTALNLFFKLKNFKTAATFARRLLELGPKPEVAQQTRKILSACEKNPTDAYQLNYDMHNPFDICAASYRPIYRGKPVEKCPLSGACYSPEFKGQICRVTTVTEIGKDVIGLRISPLQFR,1224,NP_004362.2.csv,refseq-COPA-NM_004371.3_clinical_seed_0_final,refseq-COPA-NM_004371.3.a2m,Invitae,refseq-COPA-NM_004371.3.npy,1,1224,1224
+NP_004367.2,MQRLLFPPLRALKGRQYLPLLAPRAAPRAQCDCIRRPLRPGQYSTISEVALQSGRGTVSLPSKAAERVVGRWLLVCSGTVAGAVILGGVTRLTESGLSMVDWHLIKEMKPPTSQEEWEAEFQRYQQFPEFKILNHDMTLTEFKFIWYMEYSHRMWGRLVGLVYILPAAYFWRKGWLSRGMKGRVLALCGLVCFQGLLGWYMVKSGLEEKSDSHDIPRVSQYRLAAHLGSALVLYCASLWTSLSLLLPPHKLPETHQLLQLRRFAHGTAGLVFLTALSGAFVAGLDAGLVYNSFPKMGESWIPEDLFTFSPILRNVFENPTMVQFDHRILGITSVTAITVLYFLSRRIPLPRRTKMAAVTLLALAYTQGPVLFNFTFKISDLDEGIRNI,388,NP_004367.2.csv,refseq-COX15-NM_004376.6_clinical_seed_0_final,refseq-COX15-NM_004376.6.a2m,Invitae,refseq-COX15-NM_004376.6.npy,1,388,388
+NP_004378.1,MFPSPALTPTPFSVKDILNLEQQQRSLAAAGELSARLEATLAPSSCMLAAFKPEAYAGPEAAAPGLPELRAELGRAPSPAKCASAFPAAPAFYPRAYSDPDPAKDPRAEKKELCALQKAVELEKTEADNAERPRARRRRKPRVLFSQAQVYELERRFKQQRYLSAPERDQLASVLKLTSTQVKIWFQNRRYKCKRQRQDQTLELVGLPPPPPPPARRIAVPVLVRDGKPCLGDSAPYAPAYGVGLNPYGYNAYPAYPGYGGAACSPGYSCTAAYPAGPSPAQPATAAANNNFVNFGVGDLNAVQSPGIPQSNSGVSTLHGIRAW,324,NP_004378.1.csv,refseq-NKX2-5-NM_004387.3_clinical_seed_0_final,refseq-NKX2-5-NM_004387.3.a2m,Invitae,refseq-NKX2-5-NM_004387.3.npy,1,324,324
+NP_004388.2,MSTARTENPVIMGLSSQNGQLRGPVKPTGGPGGGGTQTQQQMNQLKNTNTINNGTQQQAQSMTTTIKPGDDWKKTLKLPPKDLRIKTSDVTSTKGNEFEDYCLKRELLMGIFEMGWEKPSPIQEESIPIALSGRDILARAKNGTGKSGAYLIPLLERLDLKKDNIQAMVIVPTRELALQVSQICIQVSKHMGGAKVMATTGGTNLRDDIMRLDDTVHVVIATPGRILDLIKKGVAKVDHVQMIVLDEADKLLSQDFVQIMEDIILTLPKNRQILLYSATFPLSVQKFMNSHLQKPYEINLMEELTLKGVTQYYAYVTERQKVHCLNTLFSRLQINQSIIFCNSSQRVELLAKKISQLGYSCFYIHAKMRQEHRNRVFHDFRNGLCRNLVCTDLFTRGIDIQAVNVVINFDFPKLAETYLHRIGRSGRFGHLGLAINLITYDDRFNLKSIEEQLGTEIKPIPSNIDKSLYVAEYHSEPVEDEKP,483,NP_004388.2.csv,refseq-DDX6-NM_004397.4_clinical_seed_0_final,refseq-DDX6-NM_004397.4.a2m,Invitae,refseq-DDX6-NM_004397.4.npy,1,483,483
+NP_004399.2,MGNRGMEDLIPLVNRLQDAFSAIGQNADLDLPQIAVVGGQSAGKSSVLENFVGRDFLPRGSGIVTRRPLVLQLVNATTEYAEFLHCKGKKFTDFEEVRLEIEAETDRVTGTNKGISPVPINLRVYSPHVLNLTLVDLPGMTKVPVGDQPPDIEFQIRDMLMQFVTKENCLILAVSPANSDLANSDALKVAKEVDPQGQRTIGVITKLDLMDEGTDARDVLENKLLPLRRGYIGVVNRSQKDIDGKKDITAALAAERKFFLSHPSYRHLADRMGTPYLQKVLNQQLTNHIRDTLPGLRNKLQSQLLSIEKEVEEYKNFRPDDPARKTKALLQMVQQFAVDFEKRIEGSGDQIDTYELSGGARINRIFHERFPFELVKMEFDEKELRREISYAIKNIHGIRTGLFTPDMAFETIVKKQVKKIREPCLKCVDMVISELISTVRQCTKKLQQYPRLREEMERIVTTHIREREGRTKEQVMLLIDIELAYMNTNHEDFIGFANAQQRSNQMNKKKTSGNQDEILVIRKGWLTINNIGIMKGGSKEYWFVLTAENLSWYKDDEEKEKKYMLSVDNLKLRDVEKGFMSSKHIFALFNTEQRNVYKDYRQLELACETQEEVDSWKASFLRAGVYPERVGDKEKASETEENGSDSFMHSMDPQLERQVETIRNLVDSYMAIVNKTVRDLMPKTIMHLMINNTKEFIFSELLANLYSCGDQNTLMEESAEQAQRRDEMLRMYHALKEALSIIGDINTTTVSTPMPPPVDDSWLQVQSVPAGRRSPTSSPTPQRRAPAVPPARPGSRGPAPGPPPAGSALGGAPPVPSRPGASPDPFGPPPQVPSRPNRAPPGVPSRSGQASPSRPESPRPPFDL,864,NP_004399.2.csv,refseq-DNM1-NM_004408.3_clinical_seed_0_final,refseq-DNM1-NM_004408.3.a2m,Invitae,refseq-DNM1-NM_004408.3.npy,1,864,864
+NP_004416.2,MGTTARAALVLTYLAVASAASEGGFTATGQRQLRPEHFQEVGYAAPPSPPLSRSLPMDHPDSSQHGPPFEGQSQVQPPPSQEATPLQQEKLLPAQLPAEKEVGPPLPQEAVPLQKELPSLQHPNEQKEGTPAPFGDQSHPEPESWNAAQHCQQDRSQGGWGHRLDGFPPGRPSPDNLNQICLPNRQHVVYGPWNLPQSSYSHLTRQGETLNFLEIGYSRCCHCRSHTNRLECAKLVWEEAMSRFCEAEFSVKTRPHWCCTRQGEARFSCFQEEAPQPHYQLRACPSHQPDISSGLELPFPPGVPTLDNIKNICHLRRFRSVPRNLPATDPLQRELLALIQLEREFQRCCRQGNNHTCTWKAWEDTLDKYCDREYAVKTHHHLCCRHPPSPTRDECFARRAPYPNYDRDILTIDIGRVTPNLMGHLCGNQRVLTKHKHIPGLIHNMTARCCDLPFPEQACCAEEEKLTFINDLCGPRRNIWRDPALCCYLSPGDEQVNCFNINYLRNVALVSGDTENAKGQGEQGSTGGTNISSTSEPKEE,540,NP_004416.2.csv,refseq-ECM1-NM_004425.3_clinical_seed_0_final,refseq-ECM1-NM_004425.3.a2m,Invitae,refseq-ECM1-NM_004425.3.npy,1,540,540
+NP_004420.1,MARPGQRWLGKWLVAMVVWALCRLATPLAKNLEPVSWSSLNPKFLSGKGLVIYPKIGDKLDIICPRAEAGRPYEYYKLYLVRPEQAAACSTVLDPNVLVTCNRPEQEIRFTIKFQEFSPNYMGLEFKKHHDYYITSTSNGSLEGLENREGGVCRTRTMKIIMKVGQDPNAVTPEQLTTSRPSKEADNTVKMATQAPGSRGSLGDSDGKHETVNQEEKSGPGASGGSSGDPDGFFNSKVALFAAVGAGCVIFLLIIIFLTVLLLKLRKRHRKHTQQRAAALSLSTLASPKGGSGTAGTEPSDIIIPLRTTENNYCPHYEKVSGDYGHPVYIVQEMPPQSPANIYYKV,346,NP_004420.1.csv,refseq-EFNB1-NM_004429.4_clinical_seed_0_final,refseq-EFNB1-NM_004429.4.a2m,Invitae,refseq-EFNB1-NM_004429.4.npy,1,346,346
+NP_004422.2,MELQAARACFALLWGCALAAAAAAQGKEVVLLDFAAAGGELGWLTHPYGKGWDLMQNIMNDMPIYMYSVCNVMSGDQDNWLRTNWVYRGEAERIFIELKFTVRDCNSFPGGASSCKETFNLYYAESDLDYGTNFQKRLFTKIDTIAPDEITVSSDFEARHVKLNVEERSVGPLTRKGFYLAFQDIGACVALLSVRVYYKKCPELLQGLAHFPETIAGSDAPSLATVAGTCVDHAVVPPGGEEPRMHCAVDGEWLVPIGQCLCQAGYEKVEDACQACSPGFFKFEASESPCLECPEHTLPSPEGATSCECEEGFFRAPQDPASMPCTRPPSAPHYLTAVGMGAKVELRWTPPQDSGGREDIVYSVTCEQCWPESGECGPCEASVRYSEPPHGLTRTSVTVSDLEPHMNYTFTVEARNGVSGLVTSRSFRTASVSINQTEPPKVRLEGRSTTSLSVSWSIPPPQQSRVWKYEVTYRKKGDSNSYNVRRTEGFSVTLDDLAPDTTYLVQVQALTQEGQGAGSKVHEFQTLSPEGSGNLAVIGGVAVGVVLLLVLAGVGFFIHRRRKNQRARQSPEDVYFSKSEQLKPLKTYVDPHTYEDPNQAVLKFTTEIHPSCVTRQKVIGAGEFGEVYKGMLKTSSGKKEVPVAIKTLKAGYTEKQRVDFLGEAGIMGQFSHHNIIRLEGVISKYKPMMIITEYMENGALDKFLREKDGEFSVLQLVGMLRGIAAGMKYLANMNYVHRDLAARNILVNSNLVCKVSDFGLSRVLEDDPEATYTTSGGKIPIRWTAPEAISYRKFTSASDVWSFGIVMWEVMTYGERPYWELSNHEVMKAINDGFRLPTPMDCPSAIYQLMMQCWQQERARRPKFADIVSILDKLIRAPDSLKTLADFDPRVSIRLPSTSGSEGVPFRTVSEWLESIKMQQYTEHFMAAGYTAIEKVVQMTNDDIKRIGVRLPGHQKRIAYSLLGLKDQVNTVGIPI,976,NP_004422.2.csv,refseq-EPHA2-NM_004431.3_clinical_seed_0_final,refseq-EPHA2-NM_004431.3.a2m,Invitae,refseq-EPHA2-NM_004431.3.npy,1,976,976
+NP_004429.1,MAGIFYFALFSCLFGICDAVTGSRVYPANEVTLLDSRSVQGELGWIASPLEGGWEEVSIMDEKNTPIRTYQVCNVMEPSQNNWLRTDWITREGAQRVYIEIKFTLRDCNSLPGVMGTCKETFNLYYYESDNDKERFIRENQFVKIDTIAADESFTQVDIGDRIMKLNTEIRDVGPLSKKGFYLAFQDVGACIALVSVRVFYKKCPLTVRNLAQFPDTITGADTSSLVEVRGSCVNNSEEKDVPKMYCGADGEWLVPIGNCLCNAGHEERSGECQACKIGYYKALSTDATCAKCPPHSYSVWEGATSCTCDRGFFRADNDAASMPCTRPPSAPLNLISNVNETSVNLEWSSPQNTGGRQDISYNVVCKKCGAGDPSKCRPCGSGVHYTPQQNGLKTTKVSITDLLAHTNYTFEIWAVNGVSKYNPNPDQSVSVTVTTNQAAPSSIALVQAKEVTRYSVALAWLEPDRPNGVILEYEVKYYEKDQNERSYRIVRTAARNTDIKGLNPLTSYVFHVRARTAAGYGDFSEPLEVTTNTVPSRIIGDGANSTVLLVSVSGSVVLVVILIAAFVISRRRSKYSKAKQEADEEKHLNQGVRTYVDPFTYEDPNQAVREFAKEIDASCIKIEKVIGVGEFGEVCSGRLKVPGKREICVAIKTLKAGYTDKQRRDFLSEASIMGQFDHPNIIHLEGVVTKCKPVMIITEYMENGSLDAFLRKNDGRFTVIQLVGMLRGIGSGMKYLSDMSYVHRDLAARNILVNSNLVCKVSDFGMSRVLEDDPEAAYTTRGGKIPIRWTAPEAIAYRKFTSASDVWSYGIVMWEVMSYGERPYWDMSNQDVIKAIEEGYRLPPPMDCPIALHQLMLDCWQKERSDRPKFGQIVNMLDKLIRNPNSLKRTGTESSRPNTALLDPSSPEFSAVVSVGDWLQAIKMDRYKDNFTAAGYTTLEAVVHVNQEDLARIGITAITHQNKILSSVQAMRTQMQQMHGRMVPV,986,NP_004429.1.csv,refseq-EPHA4-NM_004438.4_clinical_seed_0_final,refseq-EPHA4-NM_004438.4.a2m,Invitae,refseq-EPHA4-NM_004438.4.npy,1,986,986
+NP_004435.3,MELRVLLCWASLAAALEETLLNTKLETADLKWVTFPQVDGQWEELSGLDEEQHSVRTYEVCDVQRAPGQAHWLRTGWVPRRGAVHVYATLRFTMLECLSLPRAGRSCKETFTVFYYESDADTATALTPAWMENPYIKVDTVAAEHLTRKRPGAEATGKVNVKTLRLGPLSKAGFYLAFQDQGACMALLSLHLFYKKCAQLTVNLTRFPETVPRELVVPVAGSCVVDAVPAPGPSPSLYCREDGQWAEQPVTGCSCAPGFEAAEGNTKCRACAQGTFKPLSGEGSCQPCPANSHSNTIGSAVCQCRVGYFRARTDPRGAPCTTPPSAPRSVVSRLNGSSLHLEWSAPLESGGREDLTYALRCRECRPGGSCAPCGGDLTFDPGPRDLVEPWVVVRGLRPDFTYTFEVTALNGVSSLATGPVPFEPVNVTTDREVPPAVSDIRVTRSSPSSLSLAWAVPRAPSGAVLDYEVKYHEKGAEGPSSVRFLKTSENRAELRGLKRGASYLVQVRARSEAGYGPFGQEHHSQTQLDESEGWREQLALIAGTAVVGVVLVLVVIVVAVLCLRKQSNGREAEYSDKHGQYLIGHGTKVYIDPFTYEDPNEAVREFAKEIDVSYVKIEEVIGAGEFGEVCRGRLKAPGKKESCVAIKTLKGGYTERQRREFLSEASIMGQFEHPNIIRLEGVVTNSMPVMILTEFMENGALDSFLRLNDGQFTVIQLVGMLRGIASGMRYLAEMSYVHRDLAARNILVNSNLVCKVSDFGLSRFLEENSSDPTYTSSLGGKIPIRWTAPEAIAFRKFTSASDAWSYGIVMWEVMSFGERPYWDMSNQDVINAIEQDYRLPPPPDCPTSLHQLMLDCWQKDRNARPRFPQVVSALDKMIRNPASLKIVARENGGASHPLLDQRQPHYSAFGSVGEWLRAIKMGRYEESFAAAGFGSFELVSQISAEDLLRIGVTLAGHQKKILASVQHMKSQAKPGTPGGTGGPAPQY,987,NP_004435.3.csv,refseq-EPHB4-NM_004444.4_clinical_seed_0_final,refseq-EPHB4-NM_004444.4.a2m,Invitae,refseq-EPHB4-NM_004444.4.npy,1,987,987
+NP_004437.2,MATLSLTVNSGDPPLGALLAVEHVKDDVSISVEEGKENILHVSENVIFTDVNSILRYLARVATTAGLYGSNLMEHTEIDHWLEFSATKLSSCDSFTSTINELNHCLSLRTYLVGNSLSLADLCVWATLKGNAAWQEQLKQKKAPVHVKRWFGFLEAQQAFQSVGTKWDVSTTKARVAPEKKQDVGKFVELPGAEMGKVTVRFPPEASGYLHIGHAKAALLNQHYQVNFKGKLIMRFDDTNPEKEKEDFEKVILEDVAMLHIKPDQFTYTSDHFETIMKYAEKLIQEGKAYVDDTPAEQMKAEREQRIDSKHRKNPIEKNLQMWEEMKKGSQFGQSCCLRAKIDMSSNNGCMRDPTLYRCKIQPHPRTGNKYNVYPTYDFACPIVDSIEGVTHALRTTEYHDRDEQFYWIIEALGIRKPYIWEYSRLNLNNTVLSKRKLTWFVNEGLVDGWDDPRFPTVRGVLRRGMTVEGLKQFIAAQGSSRSVVNMEWDKIWAFNKKVIDPVAPRYVALLKKEVIPVNVPEAQEEMKEVAKHPKNPEVGLKPVWYSPKVFIEGADAETFSEGEMVTFINWGNLNITKIHKNADGKIISLDAKLNLENKDYKKTTKVTWLAETTHALPIPVICVTYEHLITKPVLGKDEDFKQYVNKNSKHEELMLGDPCLKDLKKGDIIQLQRRGFFICDQPYEPVSPYSCKEAPCVLIYIPDGHTKEMPTSGSKEKTKVEATKNETSAPFKERPTPSLNNNCTTSEDSLVLYNRVAVQGDVVRELKAKKAPKEDVDAAVKQLLSLKAEYKEKTGQEYKPGNPPAEIGQNISSNSSASILESKSLYDEVAAQGEVVRKLKAEKSPKAKINEAVECLLSLKAQYKEKTGKEYIPGQPPLSQSSDSSPTRNSEPAGLETPEAKVLFDKVASQGEVVRKLKTEKAPKDQVDIAVQELLQLKAQYKSLIGVEYKPVSATGAEDKDKKKKEKENKSEKQNKPQKQNDGQRKDPSKNQGGGLSSSGAGEGQGPKKQTRLGLEAKKEENLADWYSQVITKSEMIEYHDISGCYILRPWAYAIWEAIKDFFDAEIKKLGVENCYFPMFVSQSALEKEKTHVADFAPEVAWVTRSGKTELAEPIAIRPTSETVMYPAYAKWVQSHRDLPIKLNQWCNVVRWEFKHPQPFLRTREFLWQEGHSAFATMEEAAEEVLQILDLYAQVYEELLAIPVVKGRKTEKEKFAGGDYTTTIEAFISASGRAIQGGTSHHLGQNFSKMFEIVFEDPKIPGEKQFAYQNSWGLTTRTIGVMTMVHGDNMGLVLPPRVACVQVVIIPCGITNALSEEDKEALIAKCNDYRRRLLSVNIRVRADLRDNYSPGWKFNHWELKGVPIRLEVGPRDMKSCQFVAVRRDTGEKLTVAENEAETKLQAILEDIQVTLFTRASEDLKTHMVVANTMEDFQKILDSGKIVQIPFCGEIDCEDWIKKTTARDQDLEPGAPSMGAKSLCIPFKPLCELQPGAKCVCGKNPAKYYTLFGRSY,1512,NP_004437.2.csv,refseq-EPRS-NM_004446.2_clinical_seed_0_final,refseq-EPRS-NM_004446.2.a2m,Invitae,refseq-EPRS-NM_004446.2.npy,1,1512,1512
+NP_004439.2,MELAALCRWGLLLALLPPGAASTQVCTGTDMKLRLPASPETHLDMLRHLYQGCQVVQGNLELTYLPTNASLSFLQDIQEVQGYVLIAHNQVRQVPLQRLRIVRGTQLFEDNYALAVLDNGDPLNNTTPVTGASPGGLRELQLRSLTEILKGGVLIQRNPQLCYQDTILWKDIFHKNNQLALTLIDTNRSRACHPCSPMCKGSRCWGESSEDCQSLTRTVCAGGCARCKGPLPTDCCHEQCAAGCTGPKHSDCLACLHFNHSGICELHCPALVTYNTDTFESMPNPEGRYTFGASCVTACPYNYLSTDVGSCTLVCPLHNQEVTAEDGTQRCEKCSKPCARVCYGLGMEHLREVRAVTSANIQEFAGCKKIFGSLAFLPESFDGDPASNTAPLQPEQLQVFETLEEITGYLYISAWPDSLPDLSVFQNLQVIRGRILHNGAYSLTLQGLGISWLGLRSLRELGSGLALIHHNTHLCFVHTVPWDQLFRNPHQALLHTANRPEDECVGEGLACHQLCARGHCWGPGPTQCVNCSQFLRGQECVEECRVLQGLPREYVNARHCLPCHPECQPQNGSVTCFGPEADQCVACAHYKDPPFCVARCPSGVKPDLSYMPIWKFPDEEGACQPCPINCTHSCVDLDDKGCPAEQRASPLTSIISAVVGILLVVVLGVVFGILIKRRQQKIRKYTMRRLLQETELVEPLTPSGAMPNQAQMRILKETELRKVKVLGSGAFGTVYKGIWIPDGENVKIPVAIKVLRENTSPKANKEILDEAYVMAGVGSPYVSRLLGICLTSTVQLVTQLMPYGCLLDHVRENRGRLGSQDLLNWCMQIAKGMSYLEDVRLVHRDLAARNVLVKSPNHVKITDFGLARLLDIDETEYHADGGKVPIKWMALESILRRRFTHQSDVWSYGVTVWELMTFGAKPYDGIPAREIPDLLEKGERLPQPPICTIDVYMIMVKCWMIDSECRPRFRELVSEFSRMARDPQRFVVIQNEDLGPASPLDSTFYRSLLEDDDMGDLVDAEEYLVPQQGFFCPDPAPGAGGMVHHRHRSSSTRSGGGDLTLGLEPSEEEAPRSPLAPSEGAGSDVFDGDLGMGAAKGLQSLPTHDPSPLQRYSEDPTVPLPSETDGYVAPLTCSPQPEYVNQPDVRPQPPSPREGPLPAARPAGATLERPKTLSPGKNGVVKDVFAFGGAVENPEYLTPQGGAAPQPHPPPAFSPAFDNLYYWDQDPPERGAPPSTFKGTPTAENPEYLGLDVPV,1255,NP_004439.2.csv,refseq-ERBB2-NM_004448.3_clinical_seed_0_final,refseq-ERBB2-NM_004448.3.a2m,Invitae,refseq-ERBB2-NM_004448.3.npy,1,1255,1255
+NP_004443.3,MSSDDRHLGSSCGSFIKTEPSSPSSGIDALSHHSPSGSSDASGGFGLALGTHANGLDSPPMFAGAGLGGTPCRKSYEDCASGIMEDSAIKCEYMLNAIPKRLCLVCGDIASGYHYGVASCEACKAFFKRTIQGNIEYSCPATNECEITKRRRKSCQACRFMKCLKVGMLKEGVRLDRVRGGRQKYKRRLDSESSPYLSLQISPPAKKPLTKIVSYLLVAEPDKLYAMPPPGMPEGDIKALTTLCDLADRELVVIIGWAKHIPGFSSLSLGDQMSLLQSAWMEILILGIVYRSLPYDDKLVYAEDYIMDEEHSRLAGLLELYRAILQLVRRYKKLKVEKEEFVTLKALALANSDSMYIEDLEAVQKLQDLLHEALQDYELSQRHEEPWRTGKLLLTLPLLRQTAAKAVQHFYSVKLQGKVPMHKLFLEMLEAKVGQEQLRGSPKDERMSSHDGKCPFQSAAFTSRDQSNSPGIPNPRPSSPTPLNERGRQISPSTRTPGGQGKHLWLTM,508,NP_004443.3.csv,refseq-ESRRB-NM_004452.3_clinical_seed_0_final,refseq-ESRRB-NM_004452.3.a2m,Invitae,refseq-ESRRB-NM_004452.3.npy,1,508,508
+NP_004444.2,MLVPLAKLSCLAYQCFHALKIKKNYLPLCATRWSSTSTVPRITTHYTIYPRDKDKRWEGVNMERFAEEADVVIVGAGPAGLSAAVRLKQLAVAHEKDIRVCLVEKAAQIGAHTLSGACLDPGAFKELFPDWKEKGAPLNTPVTEDRFGILTEKYRIPVPILPGLPMNNHGNYIVRLGHLVSWMGEQAEALGVEVYPGYAAAEVLFHDDGSVKGIATNDVGIQKDGAPKATFERGLELHAKVTIFAEGCHGHLAKQLYKKFDLRANCEPQTYGIGLKELWVIDEKNWKPGRVDHTVGWPLDRHTYGGSFLYHLNEGEPLVALGLVVGLDYQNPYLSPFREFQRWKHHPSIRPTLEGGKRIAYGARALNEGGFQSIPKLTFPGGLLIGCSPGFMNVPKIKGTHTAMKSGILAAESIFNQLTSENLQSKTIGLHVTEYEDNLKNSWVWKELYSVRNIRPSCHGVLGVYGGMIYTGIFYWILRGMEPWTLKHKGSDFERLKPAKDCTPIEYPKPDGQISFDLLSSVALSGTNHEHDQPAHLTLRDDSIPVNRNLSIYDGPEQRFCPAGVYEFVPVEQGDGFRLQINAQNCVHCKTCDIKDPSQNINWVVPEGGGGPAYNGM,617,NP_004444.2.csv,refseq-ETFDH-NM_004453.3_clinical_seed_0_final,refseq-ETFDH-NM_004453.3.a2m,Invitae,refseq-ETFDH-NM_004453.3.npy,1,617,617
+NP_004447.2,MGQTGKKSEKGPVCWRKRVKSEYMRLRQLKRFRRADEVKSMFSSNRQKILERTEILNQEWKQRRIQPVHILTSVSSLRGTRECSVTSDLDFPTQVIPLKTLNAVASVPIMYSWSPLQQNFMVEDETVLHNIPYMGDEVLDQDGTFIEELIKNYDGKVHGDRECGFINDEIFVELVNALGQYNDDDDDDDGDDPEEREEKQKDLEDHRDDKESRPPRKFPSDKIFEAISSMFPDKGTAEELKEKYKELTEQQLPGALPPECTPNIDGPNAKSVQREQSLHSFHTLFCRRCFKYDCFLHRKCNYSFHATPNTYKRKNTETALDNKPCGPQCYQHLEGAKEFAAALTAERIKTPPKRPGGRRRGRLPNNSSRPSTPTINVLESKDTDSDREAGTETGGENNDKEEEEKKDETSSSSEANSRCQTPIKMKPNIEPPENVEWSGAEASMFRVLIGTYYDNFCAIARLIGTKTCRQVYEFRVKESSIIAPAPAEDVDTPPRKKKRKHRLWAAHCRKIQLKKDGSSNHVYNYQPCDHPRQPCDSSCPCVIAQNFCEKFCQCSSECQNRFPGCRCKAQCNTKQCPCYLAVRECDPDLCLTCGAADHWDSKNVSCKNCSIQRGSKKHLLLAPSDVAGWGIFIKDPVQKNEFISEYCGEIISQDEADRRGKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVMMVNGDHRIGIFAKRAIQTGEELFFDYRYSQADALKYVGIEREMEIP,751,NP_004447.2.csv,refseq-EZH2-NM_004456.4_clinical_seed_0_final,refseq-EZH2-NM_004456.4.a2m,Invitae,refseq-EZH2-NM_004456.4.npy,1,751,751
+NP_004449.1,MAKRIKAKPTSDKPGSPYRSVTHFDSLAVIDIPGADTLDKLFDHAVSKFGKKDSLGTREILSEENEMQPNGKVFKKLILGNYKWMNYLEVNRRVNNFGSGLTALGLKPKNTIAIFCETRAEWMIAAQTCFKYNFPLVTLYATLGKEAVVHGLNESEASYLITSVELLESKLKTALLDISCVKHIIYVDNKAINKAEYPEGFEIHSMQSVEELGSNPENLGIPPSRPTPSDMAIVMYTSGSTGRPKGVMMHHSNLIAGMTGQCERIPGLGPKDTYIGYLPLAHVLELTAEISCFTYGCRIGYSSPLTLSDQSSKIKKGSKGDCTVLKPTLMAAVPEIMDRIYKNVMSKVQEMNYIQKTLFKIGYDYKLEQIKKGYDAPLCNLLLFKKVKALLGGNVRMMLSGGAPLSPQTHRFMNVCFCCPIGQGYGLTESCGAGTVTEVTDYTTGRVGAPLICCEIKLKDWQEGGYTINDKPNPRGEIVIGGQNISMGYFKNEEKTAEDYSVDENGQRWFCTGDIGEFHPDGCLQIIDRKKDLVKLQAGEYVSLGKVEAALKNCPLIDNICAFAKSDQSYVISFVVPNQKRLTLLAQQKGVEGTWVDICNNPAMEAEILKEIREAANAMKLERFEIPIKVRLSPEPWTPETGLVTDAFKLKRKELRNHYLKDIERMYGGK,670,NP_004449.1.csv,refseq-ACSL4-NM_004458.2_clinical_seed_0_final,refseq-ACSL4-NM_004458.2.a2m,Invitae,refseq-ACSL4-NM_004458.2.npy,1,670,670
+NP_004454.2,MHGHRAPGGAGPSEPEHPATNPPGAAPPACADSDPGASEPGLLARRGSGSALGGPLDPQFVGPSDTSLGAAPGHRVLPCGPSPQHHRALRFSYHLEGSQPRPGLHQGNRILVKSLSLDPGQSLEPHPEGPQRLRSDPGPPTETPSQRPSPLKRAPGPKPQVPPKPSYLQMPRMPPPLEPIPPPPSRPLPADPRVAKGLAPRAEASPSSAAVSSLIEKFEREPVIVASDRPVPGPSPGPPEPVMLPQPTSQPPVPQLPEGEASRCLFLLAPGPRDGEKVPNRDSGIDSISSPSNSEETCFVSDDGPPSHSLCPGPPALASVPVALADPHRPGSQEVDSDLEEEDDEEEEEEKDREIPVPLMERQESVELTVQQKVFHIANELLQTEKAYVSRLHLLDQVFCARLLEEARNRSSFPADVVHGIFSNICSIYCFHQQFLLPELEKRMEEWDRYPRIGDILQKLAPFLKMYGEYVKNFDRAVELVNTWTERSTQFKVIIHEVQKEEACGNLTLQHHMLEPVQRIPRYELLLKDYLLKLPHGSPDSKDAQKSLELIATAAEHSNAAIRKMERMHKLLKVYELLGGEEDIVSPTKELIKEGHILKLSAKNGTTQDRYLILFNDRLLYCVPRLRLLGQKFSVRARIDVDGMELKESSNLNLPRTFLVSGKQRSLELQARTEEEKKDWVQAINSTLLKHEQTLETFKLLNSTNREDEDTPPNSPNVDLGKRAPTPIREKEVTMCMRCQEPFNSITKRRHHCKACGHVVCGKCSEFRARLVYDNNRSNRVCTDCYVALHGVPGSSPACSQHTPQRRRSILEKQASVAAENSVICSFLHYMEKGGKGWHKAWFVVPENEPLVLYIYGAPQDVKAQRSLPLIGFEVGPPEAGERPDRRHVFKITQSHLSWYFSPETEELQRRWMAVLGRAGRGDTFCPGPTLSEDREMEEAPVAALGATAEPPESPQTRDKT,961,NP_004454.2.csv,refseq-FGD1-NM_004463.2_clinical_seed_0_final,refseq-FGD1-NM_004463.2.a2m,Invitae,refseq-FGD1-NM_004463.2.npy,1,961,961
+NP_004456.1,MWKWILTHCASAFPHLPGCCCCCFLLLFLVSSVPVTCQALGQDMVSPEATNSSSSSFSSPSSAGRHVRSYNHLQGDVRWRKLFSFTKYFLKIEKNGKVSGTKKENCPYSILEITSVEIGVVAVKAINSNYYLAMNKKGKLYGSKEFNNDCKLKERIEENGYNTYASFNWQHNGRQMYVALNGKGAPRRGQKTRRKNTSAHFLPMVVHS,208,NP_004456.1.csv,refseq-FGF10-NM_004465.1_clinical_seed_0_final,refseq-FGF10-NM_004465.1.a2m,Invitae,refseq-FGF10-NM_004465.1.npy,1,208,208
+NP_004472.1,MRRRSRMLLCFAFLWVLGIAYYMYSGGGSALAGGAGGGAGRKEDWNEIDPIKKKDLHHSNGEEKAQSMETLPPGKVRWPDFNQEAYVGGTMVRSGQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGALLGKIEKVRVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGALQQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLIKLQGCRENDSRQKWEQIEGNSKLRHVGSNLCLDSRTAKSGGLSVEVCGPALSQQWKFTLNLQQ,571,NP_004472.1.csv,refseq-GALNT2-NM_004481.4_clinical_seed_0_final,refseq-GALNT2-NM_004481.4.a2m,Invitae,refseq-GALNT2-NM_004481.4.npy,1,571,571
+NP_004473.2,MAHLKRLVKLHIKRHYHKKFWKLGAVIFFFIIVLVLMQREVSVQYSKEESRMERNMKNKNKMLDLMLEAVNNIKDAMPKMQIGAPVRQNIDAGERPCLQGYYTAAELKPVLDRPPQDSNAPGASGKAFKTTNLSVEEQKEKERGEAKHCFNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLDEYVKQFSIVKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLCLDVGENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQLKACTYKGHKTVVTGEQIWEIQKDQLLYNPFLKMCLSANGEHPSLVSCNPSDPLQKWILSQND,633,NP_004473.2.csv,refseq-GALNT3-NM_004482.3_clinical_seed_0_final,refseq-GALNT3-NM_004482.3.a2m,Invitae,refseq-GALNT3-NM_004482.3.npy,1,633,633
+NP_004475.1,MAGTVRTACLVVAMLLSLDFPGQAQPPPPPPDATCHQVRSFFQRLQPGLKWVPETPVPGSDLQVCLPKGPTCCSRKMEEKYQLTARLNMEQLLQSASMELKFLIIQNAAVFQEAFEIVVRHAKNYTNAMFKNNYPSLTPQAFEFVGEFFTDVSLYILGSDINVDDMVNELFDSLFPVIYTQLMNPGLPDSALDINECLRGARRDLKVFGNFPKLIMTQVSKSLQVTRIFLQALNLGIEVINTTDHLKFSKDCGRMLTRMWYCSYCQGLMMVKPCGGYCNVVMQGCMAGVVEIDKYWREYILSLEELVNGMYRIYDMENVLLGLFSTIHDSIQYVQKNAGKLTTTIGKLCAHSQQRQYRSAYYPEDLFIDKKVLKVAHVEHEETLSSRRRELIQKLKSFISFYSALPGYICSHSPVAENDTLCWNGQELVERYSQKAARNGMKNQFNLHELKMKGPEPVVSQIIDKLKHINQLLRTMSMPKGRVLDKNLDEEGFESGDCGDDEDECIGGSGDGMIKVKNQLRFLAELAYDLDVDDAPGNSQQATPKDNEISTFHNLGNVHSPLKLLTSMAISVVCFFFLVH,580,NP_004475.1.csv,refseq-GPC3-NM_004484.3_clinical_seed_0_final,refseq-GPC3-NM_004484.3.a2m,Invitae,refseq-GPC3-NM_004484.3.npy,1,580,580
+NP_004482.4,MMMARKQDVRIPTYNISVVGLSGTEKEKGQCGIGKSCLCNRFVRPSADEFHLDHTSVLSTSDFGGRVVNNDHFLYWGEVSRSLEDCVECKMHIVEQTEFIDDQTFQPHRSTALQPYIKRAAATKLASAEKLMYFCTDQLGLEQDFEQKQMPDGKLLVDGFLLGIDVSRGMNRNFDDQLKFVSNLYNQLAKTKKPIVVVLTKCDEGVERYIRDAHTFALSKKNLQVVETSARSNVNVDLAFSTLVQLIDKSRGKTKIIPYFEALKQQSQQIATAKDKYEWLVSRIVKNHNENWLSVSRKMQASPEYQDYVYLEGTQKAKKLFLQHIHRLKHEHIERRRKLYLAALPLAFEALIPNLDEIDHLSCIKAKKLLETKPEFLKWFVVLEETPWDATSHIDNMENERIPFDLMDTVPAEQLYEAHLEKLRNERKRVEMRRAFKENLETSPFITPGKPWEEARSFIMNEDFYQWLEESVYMDIYGKHQKQIIDKAKEEFQELLLEYSELFYELELDAKPSKEKMGVIQDVLGEEQRFKALQKLQAERDALILKHIHFVYHPTKETCPSCPACVDAKIEHLISSRFIRPSDRNQKNSLSDPNIDRINLVILGKDGLARELANEIRALCTNDDKYVIDGKMYELSLRPIEGNVRLPVNSFQTPTFQPHGCLCLYNSKESLSYVVESIEKSRESTLGRRDNHLVHLPLTLILVNKRGDTSGETLHSLIQQGQQIASKLQCVFLDPASAGIGYGRNINEKQISQVLKGLLDSKRNLNLVSSTASIKDLADVDLRIVMCLMCGDPFSADDILFPVLQSQTCKSSHCGSNNSVLLELPIGLHKKRIELSVLSYHSSFSIRKSRLVHGYIVFYSAKRKASLAMLRAFLCEVQDIIPIQLVALTDGAVDVLDNDLSREQLTEGEEIAQEIDGRFTSIPCSQPQHKLEIFHPFFKDVVEKKNIIEATHMYDNAAEACSTTEEVFNSPRAGSPLCNSNLQDSEEDIEPSYSLFREDTSLPSLSKDHSKLSMELEGNDGLSFIMSNFESKLNNKVPPPVKPKPPVHFEITKGDLSYLDQGHRDGQRKSVSSSPWLPQDGFDPSDYAEPMDAVVKPRNEEENIYSVPHDSTQGKIITIRNINKAQSNGSGNGSDSEMDTSSLERGRKVSIVSKPVLYRTRCTRLGRFASYRTSFSVGSDDELGPIRKKEEDQASQGYKGDNAVIPYETDEDPRRRNILRSLRRNTKKPKPKPRPSITKATWESNYFGVPLTTVVTPEKPIPIFIERCIEYIEATGLSTEGIYRVSGNKSEMESLQRQFDQDHNLDLAEKDFTVNTVAGAMKSFFSELPDPLVPYNMQIDLVEAHKINDREQKLHALKEVLKKFPKENHEVFKYVISHLNKVSHNNKVNLMTSENLSICFWPTLMRPDFSTMDALTATRTYQTIIELFIQQCPFFFYNRPITEPPGARPSSPSAVASTVPFLTSTPVTSQPSPPQSPPPTPQSPMQPLLPSQLQAEHTL,1499,NP_004482.4.csv,refseq-ARHGAP35-NM_004491.4_clinical_seed_0_final,refseq-ARHGAP35-NM_004491.4.a2m,Invitae,refseq-ARHGAP35-NM_004491.4.npy,1,1499,1499
+NP_004484.1,MAAACRSVKGLVAVITGGASGLGLATAERLVGQGASAVLLDLPNSGGEAQAKKLGNNCVFAPADVTSEKDVQTALALAKGKFGRVDVAVNCAGIAVASKTYNLKKGQTHTLEDFQRVLDVNLMGTFNVIRLVAGEMGQNEPDQGGQRGVIINTASVAAFEGQVGQAAYSASKGGIVGMTLPIARDLAPIGIRVMTIAPGLFGTPLLTSLPEKVCNFLASQVPFPSRLGDPAEYAHLVQAIIENPFLNGEVIRLDGAIRMQP,261,NP_004484.1.csv,HCD2_HUMAN_b07_clinical_seed_0_final,HCD2_HUMAN_b07.a2m,EVE,HCD2_HUMAN_b07_theta_0.2.npy,1,261,261
+NP_004510.1,MGLKARRAAGAAGGGGDGGGGGGGAANPAGGDAAAAGDEERKVGLAPGDVEQVTLALGAGADKDGTLLLEGGGRDEGQRRTPQGIGLLAKTPLSRPVKRNNAKYRRIQTLIYDALERPRGWALLYHALVFLIVLGCLILAVLTTFKEYETVSGDWLLLLETFAIFIFGAEFALRIWAAGCCCRYKGWRGRLKFARKPLCMLDIFVLIASVPVVAVGNQGNVLATSLRSLRFLQILRMLRMDRRGGTWKLLGSAICAHSKELITAWYIGFLTLILSSFLVYLVEKDVPEVDAQGEEMKEEFETYADALWWGLITLATIGYGDKTPKTWEGRLIAATFSLIGVSFFALPAGILGSGLALKVQEQHRQKHFEKRRKPAAELIQAAWRYYATNPNRIDLVATWRFYESVVSFPFFRKEQLEAASSQKLGLLDRVRLSNPRGSNTKGKLFTPLNVDAIEESPSKEPKPVGLNNKERFRTAFRMKAYAFWQSSEDAGTGDPMAEDRGYGNDFPIEDMIPTLKAAIRAVRILQFRLYKKKFKETLRPYDVKDVIEQYSAGHLDMLSRIKYLQTRIDMIFTPGPPSTPKHKKSQKGSAFTFPSQQSPRNEPYVARPSTSEIEDQSMMGKFVKVERQVQDMGKKLDFLVDMHMQHMERLQVQVTEYYPTKGTSSPAEAEKKEDNRYSDLKTIICNYSETGPPEPPYSFHQVTIDKVSPYGFFAHDPVNLPRGGPSSGKVQATPPSSATTYVERPTVLPILTLLDSRVSCHSQADLQGPYSDRISPRQRRSITRDSDTPLSLMSVNHEELERSPSGFSISQDRDDYVFGPNGGSSWMREKRYLAEGETDTDTDPFTPSGSMPLSSTGDGISDSVWTPSNKPI,872,NP_004510.1.csv,refseq-KCNQ3-NM_004519.3_clinical_seed_0_final,refseq-KCNQ3-NM_004519.3.a2m,Invitae,refseq-KCNQ3-NM_004519.3.npy,1,872,872
+NP_004512.1,MADLAECNIKVMCRFRPLNESEVNRGDKYIAKFQGEDTVVIASKPYAFDRVFQSSTSQEQVYNDCAKKIVKDVLEGYNGTIFAYGQTSSGKTHTMEGKLHDPEGMGIIPRIVQDIFNYIYSMDENLEFHIKVSYFEIYLDKIRDLLDVSKTNLSVHEDKNRVPYVKGCTERFVCSPDEVMDTIDEGKSNRHVAVTNMNEHSSRSHSIFLINVKQENTQTEQKLSGKLYLVDLAGSEKVSKTGAEGAVLDEAKNINKSLSALGNVISALAEGSTYVPYRDSKMTRILQDSLGGNCRTTIVICCSPSSYNESETKSTLLFGQRAKTIKNTVCVNVELTAEQWKKKYEKEKEKNKILRNTIQWLENELNRWRNGETVPIDEQFDKEKANLEAFTVDKDITLTNDKPATAIGVIGNFTDAERRKCEEEIAKLYKQLDDKDEEINQQSQLVEKLKTQMLDQEELLASTRRDQDNMQAELNRLQAENDASKEEVKEVLQALEELAVNYDQKSQEVEDKTKEYELLSDELNQKSATLASIDAELQKLKEMTNHQKKRAAEMMASLLKDLAEIGIAVGNNDVKQPEGTGMIDEEFTVARLYISKMKSEVKTMVKRCKQLESTQTESNKKMEENEKELAACQLRISQHEAKIKSLTEYLQNVEQKKRQLEESVDALSEELVQLRAQEKVHEMEKEHLNKVQTANEVKQAVEQQIQSHRETHQKQISSLRDEVEAKAKLITDLQDQNQKMMLEQERLRVEHEKLKATDQEKSRKLHELTVMQDRREQARQDLKGLEETVAKELQTLHNLRKLFVQDLATRVKKSAEIDSDDTGGSAAQKQKISFLENNLEQLTKVHKQLVRDNADLRCELPKLEKRLRATAERVKALESALKEAKENASRDRKRYQQEVDRIKEAVRSKNMARRGHSAQIAKPIRPGQHPAASPTHPSAIRGGGAFVQNSQPVAVRGGGGKQV,963,NP_004512.1.csv,refseq-KIF5B-NM_004521.2_clinical_seed_0_final,refseq-KIF5B-NM_004521.2.a2m,Invitae,refseq-KIF5B-NM_004521.2.npy,1,963,963
+NP_004513.1,MADPAECSIKVMCRFRPLNEAEILRGDKFIPKFKGDETVVIGQGKPYVFDRVLPPNTTQEQVYNACAKQIVKDVLEGYNGTIFAYGQTSSGKTHTMEGKLHDPQLMGIIPRIAHDIFDHIYSMDENLEFHIKVSYFEIYLDKIRDLLDVSKTNLAVHEDKNRVPYVKGCTERFVSSPEEVMDVIDEGKANRHVAVTNMNEHSSRSHSIFLINIKQENVETEKKLSGKLYLVDLAGSEKVSKTGAEGAVLDEAKNINKSLSALGNVISALAEGTKTHVPYRDSKMTRILQDSLGGNCRTTIVICCSPSVFNEAETKSTLMFGQRAKTIKNTVSVNLELTAEEWKKKYEKEKEKNKTLKNVIQHLEMELNRWRNGEAVPEDEQISAKDQKNLEPCDNTPIIDNIAPVVAGISTEEKEKYDEEISSLYRQLDDKDDEINQQSQLAEKLKQQMLDQDELLASTRRDYEKIQEELTRLQIENEAAKDEVKEVLQALEELAVNYDQKSQEVEDKTRANEQLTDELAQKTTTLTTTQRELSQLQELSNHQKKRATEILNLLLKDLGEIGGIIGTNDVKTLADVNGVIEEEFTMARLYISKMKSEVKSLVNRSKQLESAQMDSNRKMNASERELAACQLLISQHEAKIKSLTDYMQNMEQKRRQLEESQDSLSEELAKLRAQEKMHEVSFQDKEKEHLTRLQDAEEMKKALEQQMESHREAHQKQLSRLRDEIEEKQKIIDEIRDLNQKLQLEQEKLSSDYNKLKIEDQEREMKLEKLLLLNDKREQAREDLKGLEETVSRELQTLHNLRKLFVQDLTTRVKKSVELDNDDGGGSAAQKQKISFLENNLEQLTKVHKQLVRDNADLRCELPKLEKRLRATAERVKALESALKEAKENAMRDRKRYQQEVDRIKEAVRAKNMARRAHSAQIAKPIRPGHYPASSPTAVHAIRGGGGSSSNSTHYQK,957,NP_004513.1.csv,refseq-KIF5C-NM_004522.3_clinical_seed_0_final,refseq-KIF5C-NM_004522.3.a2m,Invitae,refseq-KIF5C-NM_004522.3.npy,1,957,957
+NP_004514.2,MASQPNSSAKKKEEKGKNIQVVVRCRPFNLAERKASAHSIVECDPVRKEVSVRTGGLADKSSRKTYTFDMVFGASTKQIDVYRSVVCPILDEVIMGYNCTIFAYGQTGTGKTFTMEGERSPNEEYTWEEDPLAGIIPRTLHQIFEKLTDNGTEFSVKVSLLEIYNEELFDLLNPSSDVSERLQMFDDPRNKRGVIIKGLEEITVHNKDEVYQILEKGAAKRTTAATLMNAYSSRSHSVFSVTIHMKETTIDGEELVKIGKLNLVDLAGSENIGRSGAVDKRAREAGNINQSLLTLGRVITALVERTPHVPYRESKLTRILQDSLGGRTRTSIIATISPASLNLEETLSTLEYAHRAKNILNKPEVNQKLTKKALIKEYTEEIERLKRDLAAAREKNGVYISEENFRVMSGKLTVQEEQIVELIEKIGAVEEELNRVTELFMDNKNELDQCKSDLQNKTQELETTQKHLQETKLQLVKEEYITSALESTEEKLHDAASKLLNTVEETTKDVSGLHSKLDRKKAVDQHNAEAQDIFGKNLNSLFNNMEELIKDGSSKQKAMLEVHKTLFGNLLSSSVSALDTITTVALGSLTSIPENVSTHVSQIFNMILKEQSLAAESKTVLQELINVLKTDLLSSLEMILSPTVVSILKINSQLKHIFKTSLTVADKIEDQKKELDGFLSILCNNLHELQENTICSLVESQKQCGNLTEDLKTIKQTHSQELCKLMNLWTERFCALEEKCENIQKPLSSVQENIQQKSKDIVNKMTFHSQKFCADSDGFSQELRNFNQEGTKLVEESVKHSDKLNGNLEKISQETEQRCESLNTRTVYFSEQWVSSLNEREQELHNLLEVVSQCCEASSSDITEKSDGRKAAHEKQHNIFLDQMTIDEDKLIAQNLELNETIKIGLTKLNCFLEQDLKLDIPTGTTPQRKSYLYPSTLVRTEPREHLLDQLKRKQPELLMMLNCSENNKEETIPDVDVEEAVLGQYTEEPLSQEPSVDAGVDCSSIGGVPFFQHKKSHGKDKENRGINTLERSKVEETTEHLVTKSRLPLRAQINL,1056,NP_004514.2.csv,refseq-KIF11-NM_004523.3_clinical_seed_0_final,refseq-KIF11-NM_004523.3.a2m,Invitae,refseq-KIF11-NM_004523.3.npy,1,1056,1056
+NP_004517.2,MAESSESFTMASSPAQRRRGNDPLTSSPGRSSRRTDALTSSPGRDLPPFEDESEGLLGTEGPLEEEEDGEELIGDGMERDYRAIPELDAYEAEGLALDDEDVEELTASQREAAERAMRQRDREAGRGLGRMRRGLLYDSDEEDEERPARKRRQVERATEDGEEDEEMIESIENLEDLKGHSVREWVSMAGPRLEIHHRFKNFLRTHVDSHGHNVFKERISDMCKENRESLVVNYEDLAAREHVLAYFLPEAPAELLQIFDEAALEVVLAMYPKYDRITNHIHVRISHLPLVEELRSLRQLHLNQLIRTSGVVTSCTGVLPQLSMVKYNCNKCNFVLGPFCQSQNQEVKPGSCPECQSAGPFEVNMEETIYQNYQRIRIQESPGKVAAGRLPRSKDAILLADLVDSCKPGDEIELTGIYHNNYDGSLNTANGFPVFATVILANHVAKKDNKVAVGELTDEDVKMITSLSKDQQIGEKIFASIAPSIYGHEDIKRGLALALFGGEPKNPGGKHKVRGDINVLLCGDPGTAKSQFLKYIEKVSSRAIFTTGQGASAVGLTAYVQRHPVSREWTLEAGALVLADRGVCLIDEFDKMNDQDRTSIHEAMEQQSISISKAGIVTSLQARCTVIAAANPIGGRYDPSLTFSENVDLTEPIISRFDILCVVRDTVDPVQDEMLARFVVGSHVRHHPSNKEEEGLANGSAAEPAMPNTYGVEPLPQEVLKKYIIYAKERVHPKLNQMDQDKVAKMYSDLRKESMATGSIPITVRHIESMIRMAEAHARIHLRDYVIEDDVNMAIRVMLESFIDTQKFSVMRSMRKTFARYLSFRRDNNELLLFILKQLVAEQVTYQRNRFGAQQDTIEVPEKDLVDKARQINIHNLSAFYDSELFRMNKFSHDLKRKMILQQF,904,NP_004517.2.csv,refseq-MCM2-NM_004526.3_clinical_seed_0_final,refseq-MCM2-NM_004526.3.a2m,Invitae,refseq-MCM2-NM_004526.3.npy,1,904,904
+NP_004521.1,MEALMARGALTGPLRALCLLGCLLSHAAAAPSPIIKFPGDVAPKTDKELAVQYLNTFYGCPKESCNLFVLKDTLKKMQKFFGLPQTGDLDQNTIETMRKPRCGNPDVANYNFFPRKPKWDKNQITYRIIGYTPDLDPETVDDAFARAFQVWSDVTPLRFSRIHDGEADIMINFGRWEHGDGYPFDGKDGLLAHAFAPGTGVGGDSHFDDDELWTLGEGQVVRVKYGNADGEYCKFPFLFNGKEYNSCTDTGRSDGFLWCSTTYNFEKDGKYGFCPHEALFTMGGNAEGQPCKFPFRFQGTSYDSCTTEGRTDGYRWCGTTEDYDRDKKYGFCPETAMSTVGGNSEGAPCVFPFTFLGNKYESCTSAGRSDGKMWCATTANYDDDRKWGFCPDQGYSLFLVAAHEFGHAMGLEHSQDPGALMAPIYTYTKNFRLSQDDIKGIQELYGASPDIDLGTGPTPTLGPVTPEICKQDIVFDGIAQIRGEIFFFKDRFIWRTVTPRDKPMGPLLVATFWPELPEKIDAVYEAPQEEKAVFFAGNEYWIYSASTLERGYPKPLTSLGLPPDVQRVDAAFNWSKNKKTYIFAGDKFWRYNEVKKKMDPGFPKLIADAWNAIPDNLDAVVDLQGGGHSYFFKGAYYLKLENQSLKSVKFGSIKSDWLGC,660,NP_004521.1.csv,refseq-MMP2-NM_004530.5_clinical_seed_0_final,refseq-MMP2-NM_004530.5.a2m,Invitae,refseq-MMP2-NM_004530.5.npy,1,660,660
+NP_004526.1,MSLENEDKRARTRSKALRGPPETTAADLSCPTPGCTGSGHVRGKYSRHRSLQSCPLAKKRKLEGAEAEHLVSKRKSHPLKLALDEGYGVDSDGSEDTEVKDASVSDESEGTLEGAEAETSGQDEIHRPETAEGRSPVKSHFGSNPIGSATASSKGSYSSYQGIIATSLLNLGQIAEETLVEEDLGQAAKPGPGIVHLLQEAAEGAASEEGEKGLFIQPEDAEEVVEVTTERSQDLCPQSLEDAASEESSKQKGILSHEEEDEEEEEEEEEEEEDEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAAPDVIFQEDTSHTSAQKAPELRGPESPSPKPEYSVIVEVRSDDDKDEDTHSRKSTVTDESEMQDMMTRGNLGLLEQAIALKAEQVRTVCEPGCPPAEQSQLGLGEPGKAAKPLDTVRKSYYSKDPSRAEKREIKCPTPGCDGTGHVTGLYPHHRSLSGCPHKDRIPPEILAMHENVLKCPTPGCTGQGHVNSNRNTHRSLSGCPIAAAEKLAKSHEKQQPQTGDPSKSSSNSDRILRPMCFVKQLEVPPYGSYRPNVAPATPRANLAKELEKFSKVTFDYASFDAQVFGKRMLAPKIQTSETSPKAFQCFDYSQDAEAAHMAATAILNLSTRCWEMPENLSTKPQDLPSKSVDIEVDENGTLDLSMHKHRKRENAFPSSSSCSSSPGVKSPDASQRHSSTSAPSSSMTSPQSSQASRQDEWDRPLDYTKPSRLREEEPEESEPAAHSFASSEADDQEVSEENFEERKYPGEVTLTNFKLKFLSKDIKKELLTCPTPGCDGSGHITGNYASHRSLSGCPLADKSLRNLMAAHSADLKCPTPGCDGSGHITGNYASHRSLSGCPRAKKSGVKVAPTKDDKEDPELMKCPVPGCVGLGHISGKYASHRSASGCPLAARRQKEGSLNGSSFSWKSLKNEGPTCPTPGCDGSGHANGSFLTHRSLSGCPRATFAGKKGKLSGDEVLSPKFKTSDVLENDEEIKQLNQEIRDLNESNSEMEAAMVQLQSQISSMEKNLKNIEEENKLIEEQNEALFLELSGLSQALIQSLANIRLPHMEPICEQNFDAYVSTLTDMYSNQDPENKDLLESIKQAVRGIQV,1121,NP_004526.1.csv,refseq-MYT1-NM_004535.2_clinical_seed_0_final,refseq-MYT1-NM_004535.2.a2m,Invitae,refseq-MYT1-NM_004535.2.npy,1,1121,1121
+NP_004532.1,MWFEILPGLSVMGVCLLIPGLATAYIHRFTNGGKEKRVAHFGYHWSLMERDRRISGVDRYYVSKGLENID,70,NP_004532.1.csv,refseq-NDUFA1-NM_004541.3_clinical_seed_0_final,refseq-NDUFA1-NM_004541.3.a2m,Invitae,refseq-NDUFA1-NM_004541.3.npy,1,70,70
+NP_004535.1,MALRLLKLAATSASARVVAAGAQRVRGIHSSVQCKLRYGMWHFLLGDKASKRLTERSRVITVDGNICTGKGKLAKEIAEKLGFKHFPEAGIHYPDSTTGDGKPLATDYNGNCSLEKFYDDPRSNDGNSYRLQSWLYSSRLLQYSDALEHLLTTGQGVVLERSIFSDFVFLEAMYNQGFIRKQCVDHYNEVKSVTICDYLPPHLVIYIDVPVPEVQRRIQKKGDPHEMKITSAYLQDIENAYKKTFLPEMSEKCEVLQYSAREAQDSKKVVEDIEYLKFDKGPWLKQDNRTLYHLRLLVQDKFEVLNYTSIPIFLPEVTIGAHQTDRVLHQFRELPGRKYSPGYNTEVGDKWIWLK,355,NP_004535.1.csv,refseq-NDUFA10-NM_004544.3_clinical_seed_0_final,refseq-NDUFA10-NM_004544.3.a2m,Invitae,refseq-NDUFA10-NM_004544.3.npy,1,355,355
+NP_004541.1,MAALRALCGFRGVAAQVLRPGAGVRLPIQPSRGVRQWQPDVEWAQQFGGAVMYPSKETAHWKPPPWNDVDPPKDTIVKNITLNFGPQHPAAHGVLRLVMELSGEMVRKCDPHIGLLHRGTEKLIEYKTYLQALPYFDRLDYVSMMCNEQAYSLAVEKLLNIRPPPRAQWIRVLFGEITRLLNHIMAVTTHALDLGAMTPFFWLFEEREKMFEFYERVSGARMHAAYIRPGGVHQDLPLGLMDDIYQFSKNFSLRLDELEELLTNNRIWRNRTIDIGVVTAEEALNYGFSGVMLRGSGIQWDLRKTQPYDVYDQVEFDVPVGSRGDCYDRYLCRVEEMRQSLRIIAQCLNKMPPGEIKVDDAKVSPPKRAEMKTSMESLIHHFKLYTEGYQVPPGATYTAIEAPKGEFGVYLVSDGSSRPYRCKIKAPGFAHLAGLDKMSKGHMLADVVAIIGTQDIVFGEVDR,463,NP_004541.1.csv,refseq-NDUFS2-NM_004550.4_clinical_seed_0_final,refseq-NDUFS2-NM_004550.4.a2m,Invitae,refseq-NDUFS2-NM_004550.4.npy,1,463,463
+NP_004542.1,MAAAAVARLWWRGILGASALTRGTGRPSVLLLPVRRESAGADTRPTVRPRNDVAHKQLSAFGEYVAEILPKYVQQVQVSCFNELEVCIHPDGVIPVLTFLRDHTNAQFKSLVDLTAVDVPTRQNRFEIVYNLLSLRFNSRIRVKTYTDELTPIESAVSVFKAANWYEREIWDMFGVFFANHPDLRRILTDYGFEGHPFRKDFPLSGYVELRYDDEVKRVVAEPVELAQEFRKFDLNSPWEAFPVYRQPPESLKLEAGDKKPDAK,264,NP_004542.1.csv,refseq-NDUFS3-NM_004551.2_clinical_seed_0_final,refseq-NDUFS3-NM_004551.2.a2m,Invitae,refseq-NDUFS3-NM_004551.2.npy,1,264,264
+NP_004551.2,MARGSALPRRPLLCIPAVWAAAALLLSVSRTSGEVEVLDPNDPLGPLDGQDGPIPTLKGYFLNFLEPVNNITIVQGQTAILHCKVAGNPPPNVRWLKNDAPVVQEPRRIIIRKTEYGSRLRIQDLDTTDTGYYQCVATNGMKTITATGVLFVRLGPTHSPNHNFQDDYHEDGFCQPYRGIACARFIGNRTIYVDSLQMQGEIENRITAAFTMIGTSTHLSDQCSQFAIPSFCHFVFPLCDARSRTPKPRELCRDECEVLESDLCRQEYTIARSNPLILMRLQLPKCEALPMPESPDAANCMRIGIPAERLGRYHQCYNGSGMDYRGTASTTKSGHQCQPWALQHPHSHHLSSTDFPELGGGHAYCRNPGGQMEGPWCFTQNKNVRMELCDVPSCSPRDSSKMGILYILVPSIAIPLVIACLFFLVCMCRNKQKASASTPQRRQLMASPSQDMEMPLINQHKQAKLKEISLSAVRFMEELGEDRFGKVYKGHLFGPAPGEQTQAVAIKTLKDKAEGPLREEFRHEAMLRARLQHPNVVCLLGVVTKDQPLSMIFSYCSHGDLHEFLVMRSPHSDVGSTDDDRTVKSALEPPDFVHLVAQIAAGMEYLSSHHVVHKDLATRNVLVYDKLNVKISDLGLFREVYAADYYKLLGNSLLPIRWMAPEAIMYGKFSIDSDIWSYGVVLWEVFSYGLQPYCGYSNQDVVEMIRNRQVLPCPDDCPAWVYALMIECWNEFPSRRPRFKDIHSRLRAWGNLSNYNSSAQTSGASNTTQTSSLSTSPVSNVSNARYVGPKQKAPPFPQPQFIPMKGQIRPMVPPPQLYVPVNGYQPVPAYGAYLPNFYPVQIPMQMAPQQVPPQMVPKPSSHHSGSGSTSTGYVTTAPSNTSMADRAALLSEGADDTQNAPEDGAQSTVQEAEEEEEGSVPETELLGDCDTLQVDEAQVQLEA,943,NP_004551.2.csv,refseq-ROR2-NM_004560.3_clinical_seed_0_final,refseq-ROR2-NM_004560.3.a2m,Invitae,refseq-ROR2-NM_004560.3.npy,1,943,943
+NP_004553.2,MIVFVRFNSSHGFPVEVDSDTSIFQLKEVVAKRQGVPADQLRVIFAGKELRNDWTVQNCDLDQQSIVHIVQRPWRKGQEMNATGGDDPRNAAGGCEREPQSLTRVDLSSSVLPGDSVGLAVILHTDSRKDSPPAGSPAGRSIYNSFYVYCKGPCQRVQPGKLRVQCSTCRQATLTLTQGPSCWDDVLIPNRMSGECQSPHCPGTSAEFFFKCGAHPTSDKETSVALHLIATNSRNITCITCTDVRSPVLVFQCNSRHVICLDCFHLYCVTRLNDRQFVHDPQLGYSLPCVAGCPNSLIKELHHFRILGEEQYNRYQQYGAEECVLQMGGVLCPRPGCGAGLLPEPDQRKVTCEGGNGLGCGFAFCRECKEAYHEGECSAVFEASGTTTQAYRVDERAAEQARWEAASKETIKKTTKPCPRCHVPVEKNGGCMHMKCPQPQCRLEWCWNCGCEWNRVCMGDHWFDV,465,NP_004553.2.csv,refseq-PRKN-NM_004562.2_clinical_seed_0_final,refseq-PRKN-NM_004562.2.a2m,Invitae,refseq-PRKN-NM_004562.2.npy,1,465,465
+NP_004555.1,MAAPMLRWGCRGRRWAFARVDGGSCHRRGAPTGSTSNQIRGESSVAQQPLHTAQKTRKGEHKWAAVVGLEIHAQISSNSKLFSGSQVRFSAPPNSLVSFFDASLPGTLPVLNRRCVEAAVMTGLALNCHINKKSLFDRKHYFYADLPAGYQITQQRLPIAVNGSLIYGVCAGKKQSQVIPKTVRIKQIQLEQDSGKSLHDNLRSQTLIDLNRAGVGLLEVVLEPDMSCGEEAATAVRELQLILQALGTSQANMAEGQLRVDANISVHHPGEPLGVRTEVKNLNSIRFLAKAIDYEIQRQINELENGGEILNETRSFHHKLGCTMSMRDKEGKQDYRFMPEPNLPPLVLYDATSLPAGADPQQVINIDQIRETLPELPSVTREKLVQQYGMLLEHSFTLLNEVGLLEFFQNVIKETRAEPKKVTSWVLNTFLGYLKQQNLAVSESPVTPSALAELLDLLDSRTISSSAAKQVFEELWKREGKTPGQIVSEKQLELMQDQGALEQLCHSVMEAHPQVVMDVKNRNPRAINKLIGLVRKATQSRADPVMIKEILEKKLSL,557,NP_004555.1.csv,refseq-GATB-NM_004564.2_clinical_seed_0_final,refseq-GATB-NM_004564.2.a2m,Invitae,refseq-GATB-NM_004564.2.npy,1,557,557
+NP_004563.2,MAAPGAPAEYGYIRTVLGQQILGQLDSSSLALPSEAKLKLAGSSGRGGQTVKSLRIQEQVQQTLARKGRSSVGNGNLHRTSSVPEYVYNLHLVENDFVGGRSPVPKTYDMLKAGTTATYEGRWGRGTAQYSSQKSVEERSLRHPLRRLEISPDSSPERAHYTHSDYQYSQRSQAGHTLHHQESRRAALLVPPRYARSEIVGVSRAGTTSRQRHFDTYHRQYQHGSVSDTVFDSIPANPALLTYPRPGTSRSMGNLLEKENYLTAGLTVGQVRPLVPLQPVTQNRASRSSWHQSSFHSTRTLREAGPSVAVDSSGRRAHLTVGQAAAGGSGNLLTERSTFTDSQLGNADMEMTLERAVSMLEADHMLPSRISAAATFIQHECFQKSEARKRVNQLRGILKLLQLLKVQNEDVQRAVCGALRNLVFEDNDNKLEVAELNGVPRLLQVLKQTRDLETKKQITDHTVNLRSRNGWPGAVAHACNPSTLGGQGGRITRSGVRDQPDQHGLLWNLSSNDKLKNLMITEALLTLTENIIIPFSGWPEGDYPKANGLLDFDIFYNVTGCLRNMSSAGADGRKAMRRCDGLIDSLVHYVRGTIADYQPDDKATENCVCILHNLSYQLEAELPEKYSQNIYIQNRNIQTDNNKSIGCFGSRSRKVKEQYQDVPMPEEKSNPKGVEWLWHSIVIRMYLSLIAKSVRNYTQEASLGALQNLTAGSGPMPTSVAQTVVQKESGLQHTRKMLHVGDPSVKKTAISLLRNLSRNLSLQNEIAKETLPDLVSIIPDTVPSTDLLIETTASACYTLNNIIQNSYQNARDLLNTGGIQKIMAISAGDAYASNKASKAASVLLYSLWAHTELHHAYKKAQFKKTDFVNSRTAKAYHSLKD,881,NP_004563.2.csv,refseq-PKP2-NM_004572.3_clinical_seed_0_final,refseq-PKP2-NM_004572.3.a2m,Invitae,refseq-PKP2-NM_004572.3.npy,1,881,881
+NP_004571.2,MSDGDYDYLIKFLALGDSGVGKTSVLYQYTDGKFNSKFITTVGIDFREKRVVYRASGPDGATGRGQRIHLQLWDTAGQERFRSLTTAFFRDAMGFLLLFDLTNEQSFLNVRNWISQLQMHAYCENPDIVLCGNKSDLEDQRVVKEEEAIALAEKYGIPYFETSAANGTNISQAIEMLLDLIMKRMERCVDKSWIPEGVVRSNGHASTDQLSEEKEKGACGC,221,NP_004571.2.csv,refseq-RAB27A-NM_004580.4_clinical_seed_0_final,refseq-RAB27A-NM_004580.4.a2m,Invitae,refseq-RAB27A-NM_004580.4.npy,1,221,221
+NP_004577.1,MPLAQLADPWQKMAVESPSDSAENGQQIMDEPMGEEEINPQTEEVSIKEIAITHHVKEGHEKADPSQFELLKVLGQGSFGKVFLVKKISGSDARQLYAMKVLKKATLKVRDRVRTKMERDILVEVNHPFIVKLHYAFQTEGKLYLILDFLRGGDLFTRLSKEVMFTEEDVKFYLAELALALDHLHSLGIIYRDLKPENILLDEEGHIKLTDFGLSKESIDHEKKAYSFCGTVEYMAPEVVNRRGHTQSADWWSFGVLMFEMLTGTLPFQGKDRKETMTMILKAKLGMPQFLSPEAQSLLRMLFKRNPANRLGAGPDGVEEIKRHSFFSTIDWNKLYRREIHPPFKPATGRPEDTFYFDPEFTAKTPKDSPGIPPSANAHQLFRGFSFVAITSDDESQAMQTVGVHSIVQQLHRNSIQFTDGYEVKEDIGVGSYSVCKRCIHKATNMEFAVKIIDKSKRDPTEEIEILLRYGQHPNIITLKDVYDDGKYVYVVTELMKGGELLDKILRQKFFSEREASAVLFTITKTVEYLHAQGVVHRDLKPSNILYVDESGNPESIRICDFGFAKQLRAENGLLMTPCYTANFVAPEVLKRQGYDAACDIWSLGVLLYTMLTGYTPFANGPDDTPEEILARIGSGKFSLSGGYWNSVSDTAKDLVSKMLHVDPHQRLTAALVLRHPWIVHWDQLPQYQLNRQDAPHLVKGAMAATYSALNRNQSPVLEPVGRSTLAQRRGIKKITSTAL,740,NP_004577.1.csv,refseq-RPS6KA3-NM_004586.2_clinical_seed_0_final,refseq-RPS6KA3-NM_004586.2.a2m,Invitae,refseq-RPS6KA3-NM_004586.2.npy,1,740,740
+NP_004580.1,MAMLVLVPGRVMRPLGGQLWRFLPRGLEFWGPAEGTARVLLRQFCARQAEAWRASGRPGYCLGTRPLSTARPPPPWSQKGPGDSTRPSKPGPVSWKSLAITFAIGGALLAGMKHVKKEKAEKLEKERQRHIGKPLLGGPFSLTTHTGERKTDKDYLGQWLLIYFGFTHCPDVCPEELEKMIQVVDEIDSITTLPDLTPLFISIDPERDTKEAIANYVKEFSPKLVGLTGTREEVDQVARAYRVYYSPGPKDEDEDYIVDHTIIMYLIGPDGEFLDYFGQNKRKGEIAASIATHMRPYRKKS,301,NP_004580.1.csv,refseq-SCO1-NM_004589.2_clinical_seed_0_final,refseq-SCO1-NM_004589.2.a2m,Invitae,refseq-SCO1-NM_004589.2_theta_0.2.npy,1,301,301
+NP_004586.2,MAAARHSTLDFMLGAKADGETILKGLQSIFQEQGMAESVHTWQDHGYLATYTNKNGSFANLRIYPHGLVLLDLQSYDGDAQGKEEIDSILNKVEERMKELSQDSTGRVKRLPPIVRGGAIDRYWPTADGRLVEYDIDEVVYDEDSPYQNIKILHSKQFGNILILSGDVNLAESDLAYTRAIMGSGKEDYTGKDVLILGGGDGGILCEIVKLKPKMVTMVEIDQMVIDGCKKYMRKTCGDVLDNLKGDCYQVLIEDCIPVLKRYAKEGREFDYVINDLTAVPISTSPEEDSTWEFLRLILDLSMKVLKQDGKYFTQGNCVNLTEALSLYEEQLGRLYCPVEFSKEIVCVPSYLELWVFYTVWKKAKP,366,NP_004586.2.csv,refseq-SMS-NM_004595.4_clinical_seed_0_final,refseq-SMS-NM_004595.4.a2m,Invitae,refseq-SMS-NM_004595.4.npy,1,366,366
+NP_004587.1,MAVPETRPNHTIYINNLNEKIKKDELKKSLYAIFSQFGQILDILVSRSLKMRGQAFVIFKEVSSATNALRSMQGFPFYDKPMRIQYAKTDSDIIAKMKGTFVERDRKREKRKPKSQETPATKKAVQGGGATPVVGAVQGPVPGMPPMTQAPRIMHHMPGQPPYMPPPGMIPPPGLAPGQIPPGAMPPQQLMPGQMPPAQPLSENPPNHILFLTNLPEETNELMLSMLFNQFPGFKEVRLVPGRHDIAFVEFDNEVQAGAARDALQGFKITQNNAMKISFAKK,282,NP_004587.1.csv,refseq-SNRPA-NM_004596.4_clinical_seed_0_final,refseq-SNRPA-NM_004596.4.a2m,Invitae,refseq-SNRPA-NM_004596.4.npy,1,282,282
+NP_004599.2,MYHPRELYPSLGAGYRLGPAQPGADSSFPPALAEGYRYPELDTPKLDCFLSGMEAAPRTLAAHPPLPLLPPAMGTEPAPSAPEALHSLPGVSLSLENRELWKEFSSVGTEMIITKAGRRMFPACRVSVTGLDPEARYLFLLDVIPVDGARYRWQGRRWEPSGKAEPRLPDRVYIHPDSPATGAHWMRQPVSFHRVKLTNSTLDPHGHLILHSMHKYQPRIHLVRAAQLCSQHWGGMASFRFPETTFISVTAYQNPQITQLKIAANPFAKGFRENGRNCKRERDARVKRKLRGPEPAATEAYGSGDTPGGPCDSTLGGDIRESDPEQAPAPGEATAAPAPLCGGPSAEAYLLHPAAFHGAPSHLPTRSPSFPEAPDSGRSAPYSAAFLELPHGSGGSGYPAAPPAVPFAPHFLQGGPFPLPYTAPGGYLDVGSKPMY,436,NP_004599.2.csv,refseq-TBX6-NM_004608.3_clinical_seed_0_final,refseq-TBX6-NM_004608.3.a2m,Invitae,refseq-TBX6-NM_004608.3.npy,1,436,436
+NP_004603.1,MEAAVAAPRPRLLLLVLAAAAAAAAALLPGATALQCFCHLCTKDNFTCVTDGLCFVSVTETTDKVIHNSMCIAEIDLIPRDRPFVCAPSSKTGSVTTTYCCNQDHCNKIELPTTVKSSPGLGPVELAAVIAGPVCFVCISLMLMVYICHNRTVIHHRVPNEEDPSLDRPFISEGTTLKDLIYDMTTSGSGSGLPLLVQRTIARTIVLQESIGKGRFGEVWRGKWRGEEVAVKIFSSREERSWFREAEIYQTVMLRHENILGFIAADNKDNGTWTQLWLVSDYHEHGSLFDYLNRYTVTVEGMIKLALSTASGLAHLHMEIVGTQGKPAIAHRDLKSKNILVKKNGTCCIADLGLAVRHDSATDTIDIAPNHRVGTKRYMAPEVLDDSINMKHFESFKRADIYAMGLVFWEIARRCSIGGIHEDYQLPYYDLVPSDPSVEEMRKVVCEQKLRPNIPNRWQSCEALRVMAKIMRECWYANGAARLTALRIKKTLSQLSQQEGIKM,503,NP_004603.1.csv,refseq-TGFBR1-NM_004612.2_clinical_seed_0_final,refseq-TGFBR1-NM_004612.2.a2m,Invitae,refseq-TGFBR1-NM_004612.2.npy,1,503,503
+NP_004605.4,MLLWPLRGWAARALRCFGPGSRGSPASGPGPRRVQRRAWPPDKEQEKEKKSVICVEGNIASGKTTCLEFFSNATDVEVLTEPVSKWRNVRGHNPLGLMYHDASRWGLTLQTYVQLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDLIVYLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSLFPMAAPVLVIEADHHMERMLELFEQNRDRILTPENRKHCP,265,NP_004605.4.csv,refseq-TK2-NM_004614.4_clinical_seed_0_final,refseq-TK2-NM_004614.4.a2m,Invitae,refseq-TK2-NM_004614.4.npy,1,265,265
+NP_004612.2,MSQSPAFGPRRGSSPRGAAGAAARRNESQDYLLMDSELGEDGCPQAPLPCYGYYPCFRGSDNRLAHRRQTVLREKGRRLANRGPAYMFSDRSTSLSIEEERFLDAAEYGNIPVVRKMLEECHSLNVNCVDYMGQNALQLAVANEHLEITELLLKKENLSRVGDALLLAISKGYVRIVEAILSHPAFAEGKRLATSPSQSELQQDDFYAYDEDGTRFSHDVTPIILAAHCQEYEIVHTLLRKGARIERPHDYFCKCNDCNQKQKHDSFSHSRSRINAYKGLASPAYLSLSSEDPVMTALELSNELAVLANIEKEFKNDYKKLSMQCKDFVVGLLDLCRNTEEVEAILNGDVETLQSGDHGRPNLSRLKLAIKYEVKKFVAHPNCQQQLLSIWYENLSGLRQQTMAVKFLVVLAVAIGLPFLALIYWFAPCSKMGKIMRGPFMKFVAHAASFTIFLGLLVMNAADRFEGTKLLPNETSTDNAKQLFRMKTSCFSWMEMLIISWVIGMIWAECKEIWTQGPKEYLFELWNMLDFGMLAIFAASFIARFMAFWHASKAQSIIDANDTLKDLTKVTLGDNVKYYNLARIKWDPSDPQIISEGLYAIAVVLSFSRIAYILPANESFGPLQISLGRTVKDIFKFMVIFIMVFVAFMIGMFNLYSYYIGAKQNEAFTTVEESFKTLFWAIFGLSEVKSVVINYNHKFIENIGYVLYGVYNVTMVIVLLNMLIAMINSSFQEIEDDADVEWKFARAKLWFSYFEEGRTLPVPFNLVPSPKSLFYLLLKLKKWISELFQGHKKGFQEDAEMNKINEEKKLGILGSHEDLSKLSLDKKQVGHNKQPSIRSSEDFHLNSFNNPPRQYQKIMKRLIKRYVLQAQIDKESDEVNEGELKEIKQDISSLRYELLEEKSQNTEDLAELIRELGEKLSMEPNQEETNR,931,NP_004612.2.csv,refseq-TRPC6-NM_004621.5_clinical_seed_0_final,refseq-TRPC6-NM_004621.5.a2m,Invitae,refseq-TRPC6-NM_004621.5.npy,1,931,931
+NP_004616.2,MNRKARRCLGHLFLSLGMVYLRIGGFSSVVALGASIICNKIPGLAPRQRAICQSRPDAIIVIGEGSQMGLDECQFQFRNGRWNCSALGERTVFGKELKVGSREAAFTYAIIAAGVAHAITAACTQGNLSDCGCDKEKQGQYHRDEGWKWGGCSADIRYGIGFAKVFVDAREIKQNARTLMNLHNNEAGRKILEENMKLECKCHGVSGSCTTKTCWTTLPQFRELGYVLKDKYNEAVHVEPVRASRNKRPTFLKIKKPLSYRKPMDTDLVYIEKSPNYCEEDPVTGSVGTQGRACNKTAPQASGCDLMCCGRGYNTHQYARVWQCNCKFHWCCYVKCNTCSERTEMYTCK,349,NP_004616.2.csv,refseq-WNT7A-NM_004625.3_clinical_seed_0_final,refseq-WNT7A-NM_004625.3.a2m,Invitae,refseq-WNT7A-NM_004625.3.npy,1,349,349
+NP_004619.3,MARKRAAGGEPRGRELRSQKSKAKSKARREEEEEDAFEDEKPPKKSLLSKVSQGKRKRGCSHPGGSADGPAKKKVAKVTVKSENLKVIKDEALSDGDDLRDFPSDLKKAHHLKRGATMNEDSNEEEEESENDWEEVEELSEPVLGDVRESTAFSRSLLPVKPVEIEIETPEQAKTRERSEKIKLEFETYLRRAMKRFNKGVHEDTHKVHLLCLLANGFYRNNICSQPDLHAIGLSIIPARFTRVLPRDVDTYYLSNLVKWFIGTFTVNAELSASEQDNLQTTLERRFAIYSARDDEELVHIFLLILRALQLLTRLVLSLQPIPLKSATAKGKKPSKERLTADPGGSSETSSQVLENHTKPKTSKGTKQEETFAKGTCRPSAKGKRNKGGRKKRSKPSSSEEDEGPGDKQEKATQRRPHGRERRVASRVSYKEESGSDEAGSGSDFELSSGEASDPSDEDSEPGPPKQRKAPAPQRTKAGSKSASRTHRGSHRKDPSLPAASSSSSSSKRGKKMCSDGEKAEKRSIAGIDQWLEVFCEQEEKWVCVDCVHGVVGQPLTCYKYATKPMTYVVGIDSDGWVRDVTQRYDPVWMTVTRKCRVDAEWWAETLRPYQSPFMDREKKEDLEFQAKHMDQPLPTAIGLYKNHPLYALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVHTLHSRDTWLKKARVVRLGEVPYKMVKGFSNRARKARLAEPQLREENDLGLFGYWQTEEYQPPVAVDGKVPRNEFGNVYLFLPSMMPIGCVQLNLPNLHRVARKLDIDCVQAITGFDFHGGYSHPVTDGYIVCEEFKDVLLTAWENEQAVIERKEKEKKEKRALGNWKLLAKGLLIRERLKRRYGPKSEAAAPHTDAGGGLSSDEEEGTSSQAEAARILAASWPQNREDEEKQKLKGGPKKTKREKKAAASHLFPFEQL,940,NP_004619.3.csv,refseq-XPC-NM_004628.5_clinical_seed_0_final,refseq-XPC-NM_004628.5.a2m,Invitae,refseq-XPC-NM_004628.5.npy,1,940,940
+NP_004620.1,MSRQTTSVGSSCLDLWREKNDRLVRQAKVAQNSGLTLRRQQLAQDALEGLRGLLHSLQGLPAAVPVLPLELTVTCNFIILRASLAQGFTEDQAQDIQRSLERVLETQEQQGPRLEQGLRELWDSVLRASCLLPELLSALHRLVGLQAALWLSADRLGDLALLLETLNGSQSGASKDLLLLLKTWSPPAEELDAPLTLQDAQGLKDVLLTAFAYRQGLQELITGNPDKALSSLHEAASGLCPRPVLVQVYTALGSCHRKMGNPQRALLYLVAALKEGSAWGPPLLEASRLYQQLGDTTAELESLELLVEALNVPCSSKAPQFLIEVELLLPPPDLASPLHCGTQSQTKHILASRCLQTGRAGDAAEHYLDLLALLLDSSEPRFSPPPSPPGPCMPEVFLEAAVALIQAGRAQDALTLCEELLSRTSSLLPKMSRLWEDARKGTKELPYCPLWVSATHLLQGQAWVQLGAQKVAISEFSRCLELLFRATPEEKEQGAAFNCEQGCKSDAALQQLRAAALISRGLEWVASGQDTKALQDFLLSVQMCPGNRDTYFHLLQTLKRLDRRDEATALWWRLEAQTKGSHEDALWSLPLYLESYLSWIRPSDRDAFLEEFRTSLPKSCDL,622,NP_004620.1.csv,refseq-FANCG-NM_004629.1_clinical_seed_0_final,refseq-FANCG-NM_004629.1.a2m,Invitae,refseq-FANCG-NM_004629.1.npy,1,622,622
+NP_004628.4,MTSRKKVLLKVIILGDSGVGKTSLMNQYVNKKFSNQYKATIGADFLTKEVMVDDRLVTMQIWDTAGQERFQSLGVAFYRGADCCVLVFDVTAPNTFKTLDSWRDEFLIQASPRDPENFPFVVLGNKIDLENRQVATKRAQAWCYSKNNIPYFETSAKEAINVEQAFQTIARNALKQETEVELYNEFPEPIKLDKNDRAKASAESCSC,207,NP_004628.4.csv,refseq-RAB7A-NM_004637.5_clinical_seed_0_final,refseq-RAB7A-NM_004637.5.a2m,Invitae,refseq-RAB7A-NM_004637.5.npy,1,207,207
+NP_004637.1,MALGTTLRASLLLLGLLTEGLAQLAIPASVPRGFWALPENLTVVEGASVELRCGVSTPGSAVQWAKDGLLLGPDPRIPGFPRYRLEGDPARGEFHLHIEACDLSDDAEYECQVGRSEMGPELVSPRVILSILVPPKLLLLTPEAGTMVTWVAGQEYVVNCVSGDAKPAPDITILLSGQTISDISANVNEGSQQKLFTVEATARVTPRSSDNRQLLVCEASSPALEAPIKASFTVNVLFPPGPPVIEWPGLDEGHVRAGQSLELPCVARGGNPLATLQWLKNGQPVSTAWGTEHTQAVARSVLVMTVRPEDHGAQLSCEAHNSVSAGTQEHGITLQVTFPPSAIIILGSASQTENKNVTLSCVSKSSRPRVLLRWWLGWRQLLPMEETVMDGLHGGHISMSNLTFLARREDNGLTLTCEAFSEAFTKETFKKSLILNVKYPAQKLWIEGPPEGQKLRAGTRVRLVCLAIGGNPEPSLMWYKDSRTVTESRLPQESRRVHLGSVEKSGSTFSRELVLVTGPSDNQAKFTCKAGQLSASTQLAVQFPPTNVTILANASALRPGDALNLTCVSVSSNPPVNLSWDKEGERLEGVAAPPRRAPFKGSAAARSVLLQVSSRDHGQRVTCRAHSAELRETVSSFYRLNVLYRPEFLGEQVLVVTAVEQGEALLPVSVSANPAPEAFNWTFRGYRLSPAGGPRHRILSSGALHLWNVTRADDGLYQLHCQNSEGTAEARLRLDVHYAPTIRALQDPTEVNVGGSVDIVCTVDANPILPGMFNWERLGEDEEDQSLDDMEKISRGPTGRLRIHHAKLAQAGAYQCIVDNGVAPPARRLLRLVVRFAPQVEHPTPLTKVAAAGDSTSSATLHCRARGVPNIVFTWTKNGVPLDLQDPRYTEHTYHQGGVHSSLLTIANVSAAQDYALFTCTATNALGSDQTNIQLVSISRPDPPSGLKVVSLTPHSVGLEWKPGFDGGLPQRFCIRYEALGTPGFHYVDVVPPQATTFTLTGLQPSTRYRVWLLASNALGDSGLADKGTQLPITTPGLHQPSGEPEDQLPTEPPSGPSGLPLLPVLFALGGLLLLSNASCVGGVLWQRRLRRLAEGISEKTEAGSEEDRVRNEYEESQWTGERDTQSSTVSTTEAEPYYRSLRDFSPQLPPTQEEVSYSRGFTGEDEDMAFPGHLYDEVERTYPPSGAWGPLYDEVQMGPWDLHWPEDTYQDPRGIYDQVAGDLDTLEPDSLPFELRGHLV,1241,NP_004637.1.csv,refseq-NPHS1-NM_004646.3_clinical_seed_0_final,refseq-NPHS1-NM_004646.3.a2m,Invitae,refseq-NPHS1-NM_004646.3.npy,1,1241,1241
+NP_004647.1,MNKGWLELESDPGLFTLLVEDFGVKGVQVEEIYDLQSKCQGPVYGFIFLFKWIEERRSRRKVSTLVDDTSVIDDDIVNNMFFAHQLIPNSCATHALLSVLLNCSSVDLGPTLSRMKDFTKGFSPESKGYAIGNAPELAKAHNSHARPEPRHLPEKQNGLSAVRTMEAFHFVSYVPITGRLFELDGLKVYPIDHGPWGEDEEWTDKARRVIMERIGLATAGEPYHDIRFNLMAVVPDRRIKYEARLHVLKVNRQTVLEALQQLIRVTQPELIQTHKSQESQLPEESKSASNKSPLVLEANRAPAASEGNHTDGAEEAAGSCAQAPSHSPPNKPKLVVKPPGSSLNGVHPNPTPIVQRLPAFLDNHNYAKSPMQEEEDLAAGVGRSRVPVRPPQQYSDDEDDYEDDEEDDVQNTNSALRYKGKGTGKPGALSGSADGQLSVLQPNTINVLAEKLKESQKDLSIPLSIKTSSGAGSPAVAVPTHSQPSPTPSNESTDTASEIGSAFNSPLRSPIRSANPTRPSSPVTSHISKVLFGEDDSLLRVDCIRYNRAVRDLGPVISTGLLHLAEDGVLSPLALTEGGKGSSPSIRPIQGSQGSSSPVEKEVVEATDSREKTGMVRPGEPLSGEKYSPKELLALLKCVEAEIANYEACLKEEVEKRKKFKIDDQRRTHNYDEFICTFISMLAQEGMLANLVEQNISVRRRQGVSIGRLHKQRKPDRRKRSRPYKAKRQ,729,NP_004647.1.csv,refseq-BAP1-NM_004656.3_clinical_seed_0_final,refseq-BAP1-NM_004656.3.a2m,Invitae,refseq-BAP1-NM_004656.3.npy,1,729,729
+NP_004654.1,MGTRDDEYDYLFKVVLIGDSGVGKSNLLSRFTRNEFNLESKSTIGVEFATRSIQVDGKTIKAQIWDTAGQERYRAITSAYYRGAVGALLVYDIAKHLTYENVERWLKELRDHADSNIVIMLVGNKSDLRHLRAVPTDEARAFAEKNGLSFIETSALDSTNVEAAFQTILTEIYRIVSQKQMSDRRENDMSPSNNVVPIHVPPTTENKPKVQCCQNI,216,NP_004654.1.csv,refseq-RAB11A-NM_004663.4_clinical_seed_0_final,refseq-RAB11A-NM_004663.4.a2m,Invitae,refseq-RAB11A-NM_004663.4.npy,1,216,216
+NP_004665.2,MAAAGAGPGQEAGAGPGPGAVANATGAEEGEMKPVAAGAAAPPGEGISAAPTVEPSSGEAEGGEANLVDVSGGLETESSNGKDTLEGAGDTSEVMDTQAGSVDEENGRQLGEVELQCGICTKWFTADTFGIDTSSCLPFMTNYSFHCNVCHHSGNTYFLRKQANLKEMCLSALANLTWQSRTQDEHPKTMFSKDKDIIPFIDKYWECMTTRQRPGKMTWPNNIVKTMSKERDVFLVKEHPDPGSKDPEEDYPKFGLLDQDLSNIGPAYDNQKQSSAVSTSGNLNGGIAAGSSGKGRGAKRKQQDGGTTGTTKKARSDPLFSAQRLPPHGYPLEHPFNKDGYRYILAEPDPHAPDPEKLELDCWAGKPIPGDLYRACLYERVLLALHDRAPQLKISDDRLTVVGEKGYSMVRASHGVRKGAWYFEITVDEMPPDTAARLGWSQPLGNLQAPLGYDKFSYSWRSKKGTKFHQSIGKHYSSGYGQGDVLGFYINLPEDTETAKSLPDTYKDKALIKFKSYLYFEEKDFVDKAEKSLKQTPHSEIIFYKNGVNQGVAYKDIFEGVYFPAISLYKSCTVSINFGPCFKYPPKDLTYRPMSDMGWGAVVEHTLADVLYHVETEVDGRRSPPWEP,628,NP_004665.2.csv,refseq-ASH2L-NM_004674.4_clinical_seed_0_final,refseq-ASH2L-NM_004674.4.a2m,Invitae,refseq-ASH2L-NM_004674.4.npy,1,628,628
+NP_004688.2,MASSRASSTQATKTKAPDDLVAPVVKKPHIYYGSLEEKERERLAKGESGILGKDGLKAGIEAGNINITSGEVFEIEEHISERQAEVLAEFERRKRARQINVSTDDSEVKACLRALGEPITLFGEGPAERRERLRNILSVVGTDALKKTKKDDEKSKKSKEEYQQTWYHEGPNSLKVARLWIANYSLPRAMKRLEEARLHKEIPETTRTSQMQELHKSLRSLNNFCSQIGDDRPISYCHFSPNSKMLATACWSGLCKLWSVPDCNLLHTLRGHNTNVGAIVFHPKSTVSLDPKDVNLASCAADGSVKLWSLDSDEPVADIEGHTVRVARVMWHPSGRFLGTTCYDRSWRLWDLEAQEEILHQEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGYHIATGSGDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGNFLLTGAYDNTAKIWTHPGWSPLKTLAGHEGKVMGLDISSDGQLIATCSYDRTFKLWMAE,522,NP_004688.2.csv,refseq-PRPF4-NM_004697.4_clinical_seed_0_final,refseq-PRPF4-NM_004697.4.a2m,Invitae,refseq-PRPF4-NM_004697.4.npy,1,522,522
+NP_004689.1,MALSKRELDELKPWIEKTVKRVLGFSEPTVVTAALNCVGKGMDKKKAADHLKPFLDDSTLRFVDKLFEAVEEGRSSRHSKSSSDRSRKRELKEVFGDDSEISKESSGVKKRRIPRFEEVEEEPEVIPGPPSESPGMLTKLQIKQMMEAATRQIEERKKQLSFISPPTPQPKTPSSSQPERLPIGNTIQPSQAATFMNDAIEKARKAAELQARIQAQLALKPGLIGNANMVGLANLHAMGIAPPKVELKDQTKPTPLILDEQGRTVDATGKEIELTHRMPTLKANIRAVKREQFKQQLKEKPSEDMESNTFFDPRVSIAPSQRQRRTFKFHDKGKFEKIAQRLRTKAQLEKLQAEISQAARKTGIHTSTRLALIAPKKELKEGDIPEIEWWDSYIIPNGFDLTEENPKREDYFGITNLVEHPAQLNPPVDNDTPVTLGVYLTKKEQKKLRRQTRREAQKELQEKVRLGLMPPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQKAHEEANAARKLTAEQRKVKKIKKLKEDISQGVHISVYRVRNLSNPAKKFKIEANAGQLYLTGVVVLHKDVNVVVVEGGPKAQKKFKRLMLHRIKWDEQTSNTKGDDDEESDEEAVKKTNKCVLVWEGTAKDRSFGEMKFKQCPTENMAREHFKKHGAEHYWDLALSESVLESTD,683,NP_004689.1.csv,refseq-PRPF3-NM_004698.2_clinical_seed_0_final,refseq-PRPF3-NM_004698.2.a2m,Invitae,refseq-PRPF3-NM_004698.2.npy,1,683,683
+NP_004691.2,MAEAPPRRLGLGPPPGDAPRAELVALTAVQSEQGEAGGGGSPRRLGLLGSPLPPGAPLPGPGSGSGSACGQRSSAAHKRYRRLQNWVYNVLERPRGWAFVYHVFIFLLVFSCLVLSVLSTIQEHQELANECLLILEFVMIVVFGLEYIVRVWSAGCCCRYRGWQGRFRFARKPFCVIDFIVFVASVAVIAAGTQGNIFATSALRSMRFLQILRMVRMDRRGGTWKLLGSVVYAHSKELITAWYIGFLVLIFASFLVYLAEKDANSDFSSYADSLWWGTITLTTIGYGDKTPHTWLGRVLAAGFALLGISFFALPAGILGSGFALKVQEQHRQKHFEKRRMPAANLIQAAWRLYSTDMSRAYLTATWYYYDSILPSFRELALLFEHVQRARNGGLRPLEVRRAPVPDGAPSRYPPVATCHRPGSTSFCPGESSRMGIKDRIRMGSSQRRTGPSKQHLAPPTMPTSPSSEQVGEATSPTKVQKSWSFNDRTRFRASLRLKPRTSAEDAPSEEVAEEKSYQCELTVDDIMPAVKTVIRSIRILKFLVAKRKFKETLRPYDVKDVIEQYSAGHLDMLGRIKSLQTRVDQIVGRGPGDRKAREKGDKGPSDAEVVDEISMMGRVVKVEKQVQSIEHKLDLLLGFYSRCLRSGTSASLGAVQVPLFDPDITSDYHSPVDHEDISVSAQTLSISRSVSTNMD,695,NP_004691.2.csv,refseq-KCNQ4-NM_004700.3_clinical_seed_0_final,refseq-KCNQ4-NM_004700.3.a2m,Invitae,refseq-KCNQ4-NM_004700.3.npy,1,695,695
+NP_004704.3,MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDFKATLLLESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHLIIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAEPLLTLERLTEIVASAPKGELLKRVLNPLLPYGPALIEHCLLENGFSGNVKVDEKLETKDIEKVLVSLQKAEDYMKTTSNFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHQGERKVRVQDEDMETLASCTSELISEEMEQLDGGDTSSDEDKEEHETPVEVELMTQVDQEDITLQSGRDELNEELIQEESSEDEGEYEEVRKDQDSVGEMKDEGEETLNYPDTTIDLSHLQPQRSIQKLASKEESSNSSDSKSQSRRHLSAKERREMKKKKLPSDSGDLEALEGKDKEKESTVHIETHQNTSKNVAAVQPMKRGQKSKMKKMKEKYKDQDEEDRELIMKLLGSAGSNKEEKGKKGKKGKTKDEPVKKQPQKPRGGQRVSDNIKKETPFLEVITHELQDFAVDDPHDDKEEQDLDQQGNEENLFDSLTGQPHPEDVLLFAIPICAPYTTMTNYKYKVKLTPGVQKKGKAAKTALNSFMHSKEATAREKDLFRSVKDTDLSRNIPGKVKVSAPNLLNVKRK,1076,NP_004704.3.csv,NP_004704.3_clinical_seed_0_final,NP_004704.3.a2m,popEVE,NP_004704.3_theta_0.2.npy,1,1076,1076
+NP_004705.1,MAVPPGHGPFSGFPGPQEHTQVLPDVRLLPRRLPLAFRDATSAPLRKLSVDLIKTYKHINEVYYAKKKRRAQQAPPQDSSNKKEKKVLNHGYDDDNHDYIVRSGERWLERYEIDSLIGKGSFGQVVKAYDHQTQELVAIKIIKNKKAFLNQAQIELRLLELMNQHDTEMKYYIVHLKRHFMFRNHLCLVFELLSYNLYDLLRNTHFRGVSLNLTRKLAQQLCTALLFLATPELSIIHCDLKPENILLCNPKRSAIKIVDFGSSCQLGQRIYQYIQSRFYRSPEVLLGTPYDLAIDMWSLGCILVEMHTGEPLFSGSNEVDQMNRIVEVLGIPPAAMLDQAPKARKYFERLPGGGWTLRRTKELRKDYQGPGTRRLQEVLGVQTGGPGGRRAGEPGHSPADYLRFQDLVLRMLEYEPAARISPLGALQHGFFRRTADEATNTGPAGSSASTSPAPLDTCPSSSTASSISSSGGSSGSSSDNRTYRYSNRYCGGPGPPITDCEMNSPQVPPSQPLRPWAGGDVPHKTHQAPASASSLPGTGAQLPPQPRYLGRPPSPTSPPPPELMDVSLVGGPADCSPPHPAPAPQHPAASALRTRMTGGRPPLPPPDDPATLGPHLGLRGVPQSTAASS,629,NP_004705.1.csv,refseq-DYRK1B-NM_004714.2_clinical_seed_0_final,refseq-DYRK1B-NM_004714.2.a2m,Invitae,refseq-DYRK1B-NM_004714.2.npy,1,629,629
+NP_004713.2,MISQFFILSSKGDPLIYKDFRGDSGGRDVAELFYRKLTGLPGDESPVVMHHHGRHFIHIRHSGLYLVVTTSENVSPFSLLELLSRLATLLGDYCGSLGEGTISRNVALVYELLDEVLDYGYVQTTSTEMLRNFIQTEAVVSKPFSLFDLSSVGLFGAETQQSKVAPSSAASRPVLSSRSDQSQKNEVFLDVVERLSVLIASNGSLLKVDVQGEIRLKSFLPSGSEMRIGLTEEFCVGKSELRGYGPGIRVDEVSFHSSVNLDEFESHRILRLQPPQGELTVMRYQLSDDLPSPLPFRLFPSVQWDRGSGRLQVYLKLRCDLLSKSQALNVRLHLPLPRGVVSLSQELSSPEQKAELAEGALRWDLPRVQGGSQLSGLFQMDVPGPPGPPSHGLSTSASPLGLGPASLSFELPRHTCSGLQVRFLRLAFRPCGNANPHKWVRHLSHSDAYVIRI,453,NP_004713.2.csv,refseq-AP4M1-NM_004722.3_clinical_seed_0_final,refseq-AP4M1-NM_004722.3.a2m,Invitae,refseq-AP4M1-NM_004722.3.npy,1,453,453
+NP_004728.1,MLGICRGRRKFLAASLSLLCIPAITWIYLFSGSFEDGKPVSLSPLESQAHSPRYTASSQRERESLEVRMREVEEENRALRRQLSLAQGRAPSHRRGNHSKTYSMEEGTGDSENLRAGIVAGNSSECGQQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQTWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMKWEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQCYRDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQKQLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSVIQLDLANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNKVAHIMELDVQEYEFIVLPNAYMIHMPHAPSFDITKFRSNKQYRICLKTLKEEFQQDMSRRYGFAALKYLTAENNS,756,NP_004728.1.csv,refseq-LARGE-NM_004737.4_clinical_seed_0_final,refseq-LARGE-NM_004737.4.a2m,Invitae,refseq-LARGE-NM_004737.4.npy,1,756,756
+NP_004729.1,MAKVEQVLSLEPQHELKFRGPFTDVVTTNLKLGNPTDRNVCFKVKTTAPRRYCVRPNSGIIDAGASINVSVMLQPFDYDPNEKSKHKFMVQSMFAPTDTSDMEAVWKEAKPEDLMDSKLRCVFELPAENDKPHDVEINKIISTTASKTETPIVSKSLSSSLDDTEVKKVMEECKRLQGEVQRLREENKQFKEEDGLRMRKTVQSNSPISALAPTGKEEGLSTRLLALVVLFFIVGVIIGKIAL,243,NP_004729.1.csv,refseq-VAPB-NM_004738.4_clinical_seed_0_final,refseq-VAPB-NM_004738.4.a2m,Invitae,refseq-VAPB-NM_004738.4.npy,1,243,243
+NP_004735.2,MKNPMLEVVSLLLEKLLLISNFTLFSSGAAGEDKGRNSFYETSSFHRGDVLEVPRTHLTHYGIYLGDNRVAHMMPDILLALTDDMGRTQKVVSNKRLILGVIVKVASIRVDTVEDFAYGANILVNHLDESLQKKALLNEEVARRAEKLLGFTPYSLLWNNCEHFVTYCRYGTPISPQSDKFCETVKIIIRDQRSVLASAVLGLASIVCTGLVSYTTLPAIFIPFFLWMAG,230,NP_004735.2.csv,refseq-LRAT-NM_004744.4_clinical_seed_0_final,refseq-LRAT-NM_004744.4.a2m,Invitae,refseq-LRAT-NM_004744.4.npy,1,230,230
+NP_004741.1,MPAGRRGPAAQSARRPPPLLPLLLLLCVLGAPRAGSGAHTAVISPQDPTLLIGSSLLATCSVHGDPPGATAEGLYWTLNGRRLPPELSRVLNASTLALALANLNGSRQRSGDNLVCHARDGSILAGSCLYVGLPPEKPVNISCWSKNMKDLTCRWTPGAHGETFLHTNYSLKYKLRWYGQDNTCEEYHTVGPHSCHIPKDLALFTPYEIWVEATNRLGSARSDVLTLDILDVVTTDPPPDVHVSRVGGLEDQLSVRWVSPPALKDFLFQAKYQIRYRVEDSVDWKVVDDVSNQTSCRLAGLKPGTVYFVQVRCNPFGIYGSKKAGIWSEWSHPTAASTPRSERPGPGGGACEPRGGEPSSGPVRRELKQFLGWLKKHAYCSNLSFRLYDQWRAWMQKSHKTRNQDEGILPSGRRGTARGPAR,422,NP_004741.1.csv,refseq-CRLF1-NM_004750.4_clinical_seed_0_final,refseq-CRLF1-NM_004750.4.a2m,Invitae,refseq-CRLF1-NM_004750.4.npy,1,422,422
+NP_004743.1,MPAAAVQEAVGVCSYGMQLSWDINDPQMPQELALFDQFREWPDGYVRFIYSSDEKKAQRHLSGWAMRNTNNHNGHILKKSCLGVVVCTQACTLPDGSRLQLRPAICDKARLKQQKKACPNCHSALELIPCRGHSGYPVTNFWRLDGNAIFFQAKGVHDHPRPESKSETEARRSAIKRQMASFYQPQKKRIRESEAEENQDSSGHFSNIPPLENPEDFDIVTETSFPIPGQPCPSFPKSDVYKATCDLATFQGDKMPPFQKYSSPRIYLPRPPCSYELANPGYTNSSPYPTLYKDSTSIPNDTDWVHLNTLQCNVNSYSSYERSFDFTNKQHGWKPALGKPSLVERTNHGQFQAMATRPYYNPELPCRYLTTPPPGAPALQTVITTTTKVSYQAYQPPAMKYSDSVREVKSLSSCNYAPEDTGMSVYPEPWGPPVTVTRAASPSGPPPMKIAGDCRAIRPTVAIPHEPVSSRTDEAETWDVCLSGLGSAVSYSDRVGPFFTYNNEDF,506,NP_004743.1.csv,refseq-GCM2-NM_004752.3_clinical_seed_0_final,refseq-GCM2-NM_004752.3.a2m,Invitae,refseq-GCM2-NM_004752.3_theta_0.2.npy,1,506,506
+NP_004762.2,MKVLPASGLAVFLIMALKFSTAAPSLVAASPRTWRNNYRLAQAYLDKYYTNKEGHQIGEMVARGSNSMIRKIKELQAFFGLQVTGKLDQTTMNVIKKPRCGVPDVANYRLFPGEPKWKKNTLTYRISKYTPSMSSVEVDKAVEMALQAWSSAVPLSFVRINSGEADIMISFENGDHGDSYPFDGPRGTLAHAFAPGEGLGGDTHFDNAEKWTMGTNGFNLFTVAAHEFGHALGLAHSTDPSALMYPTYKYKNPYGFHLPKDDVKGIQALYGPRKVFLGKPTLPHAPHHKPSIPDLCDSSSSFDAVTMLGKELLLFKDRIFWRRQVHLRTGIRPSTITSSFPQLMSNVDAAYEVAERGTAYFFKGPHYWITRGFQMQGPPRTIYDFGFPRHVQQIDAAVYLREPQKTLFFVGDEYYSYDERKRKMEKDYPKNTEEEFSGVNGQIDAAVELNGYIYFFSGPKTYKYDTEKEDVVSVVKSSSWIGC,483,NP_004762.2.csv,refseq-MMP20-NM_004771.3_clinical_seed_0_final,refseq-MMP20-NM_004771.3.a2m,Invitae,refseq-MMP20-NM_004771.3.npy,1,483,483
+NP_004764.1,MASLKCSTVVCVICLEKPKYRCPACRVPYCSVVCFRKHKEQCNPETRPVEKKIRSALPTKTVKPVENKDDDDSIADFLNSDEEEDRVSLQNLKNLGESATLRSLLLNPHLRQLMVNLDQGEDKAKLMRAYMQEPLFVEFADCCLGIVEPSQNEES,155,NP_004764.1.csv,refseq-ZNHIT3-NM_004773.4_clinical_seed_0_final,refseq-ZNHIT3-NM_004773.4.a2m,Invitae,refseq-ZNHIT3-NM_004773.4_theta_0.2.npy,1,155,155
+NP_004773.1,MSAYPKSYNPFDDDGEDEGARPAPWRDARDLPDGPDAPADRQQYLRQEVLRRAEATAASTSRSLALMYESEKVGVASSEELARQRGVLERTEKMVDKMDQDLKISQKHINSIKSVFGGLVNYFKSKPVETPPEQNGTLTSQPNNRLKEAISTSKEQEAKYQASHPNLRKLDDTDPVPRGAGSAMSTDAYPKNPHLRAYHQKIDSNLDELSMGLGRLKDIALGMQTEIEEQDDILDRLTTKVDKLDVNIKSTERKVRQL,258,NP_004773.1.csv,refseq-SNAP29-NM_004782.3_clinical_seed_0_final,refseq-SNAP29-NM_004782.3.a2m,Invitae,refseq-SNAP29-NM_004782.3.npy,1,258,258
+NP_004784.2,MAASTGYVRLWGAARCWVLRRPMLAAAGGRVPTAAGAWLLRGQRTCDASPPWALWGRGPAIGGQWRGFWEASSRGGGAFSGGEDASEGGAEEGAGGAGGSAGAGEGPVITALTPMTIPDVFPHLPLIAITRNPVFPRFIKIIEVKNKKLVELLRRKVRLAQPYVGVFLKRDDSNESDVVESLDEIYHTGTFAQIHEMQDLGDKLRMIVMGHRRVHISRQLEVEPEEPEAENKHKPRRKSKRGKKEAEDELSARHPAELAMEPTPELPAEVLMVEVENVVHEDFQVTEEVKALTAEIVKTIRDIIALNPLYRESVLQMMQAGQRVVDNPIYLSDMGAALTGAESHELQDVLEETNIPKRLYKALSLLKKEFELSKLQQRLGREVEEKIKQTHRKYLLQEQLKIIKKELGLEKDDKDAIEEKFRERLKELVVPKHVMDVVDEELSKLGLLDNHSSEFNVTRNYLDWLTSIPWGKYSNENLDLARAQAVLEEDHYGMEDVKKRILEFIAVSQLRGSTQGKILCFYGPPGVGKTSIARSIARALNREYFRFSVGGMTDVAEIKGHRRTYVGAMPGKIIQCLKKTKTENPLILIDEVDKIGRGYQGDPSSALLELLDPEQNANFLDHYLDVPVDLSKVLFICTANVTDTIPEPLRDRMEMINVSGYVAQEKLAIAERYLVPQARALCGLDESKAKLSSDVLTLLIKQYCRESGVRNLQKQVEKVLRKSAYKIVSGEAESVEVTPENLQDFVGKPVFTVERMYDVTPPGVVMGLAWTAMGGSTLFVETSLRRPQDKDAKGDKDGSLEVTGQLGEVMKESARIAYTFARAFLMQHAPANDYLVTSHIHLHVPEGATPKDGPSAGCTIVTALLSLAMGRPVRQNLAMTGEVSLTGKILPVGGIKEKTIAAKRAGVTCIVLPAENKKDFYDLAAFITEGLEVHFVEHYREIFDIAFPDEQAEALAVER,959,NP_004784.2.csv,refseq-LONP1-NM_004793.3_clinical_seed_0_final,refseq-LONP1-NM_004793.3.a2m,Invitae,refseq-LONP1-NM_004793.3.npy,1,959,959
+NP_004786.2,MPASAPPRRPRPPPPSLSLLLVLLGLGGRRLRAEPGDGAQTWARFSRPPAPEAAGLFQGTFPDGFLWAVGSAAYQTEGGWQQHGKGASIWDTFTHHPLAPPGDSRNASLPLGAPSPLQPATGDVASDSYNNVFRDTEALRELGVTHYRFSISWARVLPNGSAGVPNREGLRYYRRLLERLRELGVQPVVTLYHWDLPQRLQDAYGGWANRALADHFRDYAELCFRHFGGQVKYWITIDNPYVVAWHGYATGRLAPGIRGSPRLGYLVAHNLLLAHAKVWHLYNTSFRPTQGGQVSIALSSHWINPRRMTDHSIKECQKSLDFVLGWFAKPVFIDGDYPESMKNNLSSILPDFTESEKKFIKGTADFFALCFGPTLSFQLLDPHMKFRQLESPNLRQLLSWIDLEFNHPQIFIVENGWFVSGTTKRDDAKYMYYLKKFIMETLKAIKLDGVDVIGYTAWSLMDGFEWHRGYSIRRGLFYVDFLSQDKMLLPKSSALFYQKLIEKNGFPPLPENQPLEGTFPCDFAWGVVDNYIQVDTTLSQFTDLNVYLWDVHHSKRLIKVDGVVTKKRKSYCVDFAAIQPQIALLQEMHVTHFRFSLDWALILPLGNQSQVNHTILQYYRCMASELVRVNITPVVALWQPMAPNQGLPRLLARQGAWENPYTALAFAEYARLCFQELGHHVKLWITMNEPYTRNMTYSAGHNLLKAHALAWHVYNEKFRHAQNGKISIALQADWIEPACPFSQKDKEVAERVLEFDIGWLAEPIFGSGDYPWVMRDWLNQRNNFLLPYFTEDEKKLIQGTFDFLALSHYTTILVDSEKEDPIKYNDYLEVQEMTDITWLNSPSQVAVVPWGLRKVLNWLKFKYGDLPMYIISNGIDDGLHAEDDQLRVYYMQNYINEALKAHILDGINLCGYFAYSFNDRTAPRFGLYRYAADQFEPKASMKHYRKIIDSNGFPGPETLERFCPEEFTVCTECSFFHTRKSLLAFIAFLFFASIISLSLIFYYSKKGRRSYK,1012,NP_004786.2.csv,refseq-KL-NM_004795.3_clinical_seed_0_final,refseq-KL-NM_004795.3.a2m,Invitae,refseq-KL-NM_004795.3.npy,1,1012,1012
+NP_004804.2,MEKLRLLGLRYQEYVTRHPAATAQLETAVRGFSYLLAGRFADSHELSELVYSASNLLVLLNDGILRKELRKKLPVSLSQQKLLTWLSVLECVEVFMEMGAAKVWGEVGRWLVIALVQLAKAVLRMLLLLWFKAGLQTSPPIVPLDRETQAQPPDGDHSPGNHEQSYVGKRSNRVVRTLQNTPSLHSRHWGAPQQREGRQQQHHEELSATPTPLGLQETIAEFLYIARPLLHLLSLGLWGQRSWKPWLLAGVVDVTSLSLLSDRKGLTRRERRELRRRTILLLYYLLRSPFYDRFSEARILFLLQLLADHVPGVGLVTRPLMDYLPTWQKIYFYSWG,336,NP_004804.2.csv,refseq-PEX16-NM_004813.4_clinical_seed_0_final,refseq-PEX16-NM_004813.4.a2m,Invitae,refseq-PEX16-NM_004813.4_theta_0.2.npy,1,336,336
+NP_004808.2,MPVRGDRGFPPRRELSGWLRAPGMEELIWEQYTVTLQKDSKRGFGIAVSGGRDNPHFENGETSIVISDVLPGGPADGLLQENDRVVMVNGTPMEDVLHSFAVQQLRKSGKVAAIVVKRPRKVQVAALQASPPLDQDDRAFEVMDEFDGRSFRSGYSERSRLNSHGGRSRSWEDSPERGRPHERARSRERDLSRDRSRGRSLERGLDQDHARTRDRSRGRSLERGLDHDFGPSRDRDRDRSRGRSIDQDYERAYHRAYDPDYERAYSPEYRRGARHDARSRGPRSRSREHPHSRSPSPEPRGRPGPIGVLLMKSRANEEYGLRLGSQIFVKEMTRTGLATKDGNLHEGDIILKINGTVTENMSLTDARKLIEKSRGKLQLVVLRDSQQTLINIPSLNDSDSEIEDISEIESNRSFSPEERRHQYSDYDYHSSSEKLKERPSSREDTPSRLSRMGATPTPFKSTGDIAGTVVPETNKEPRYQEDPPAPQPKAAPRTFLRPSPEDEAIYGPNTKMVRFKKGDSVGLRLAGGNDVGIFVAGIQEGTSAEQEGLQEGDQILKVNTQDFRGLVREDAVLYLLEIPKGEMVTILAQSRADVYRDILACGRGDSFFIRSHFECEKETPQSLAFTRGEVFRVVDTLYDGKLGNWLAVRIGNELEKGLIPNKSRAEQMASVQNAQRDNAGDRADFWRMRGQRSGVKKNLRKSREDLTAVVSVSTKFPAYERVLLREAGFKRPVVLFGPIADIAMEKLANELPDWFQTAKTEPKDAGSEKSTGVVRLNTVRQIIEQDKHALLDVTPKAVDLLNYTQWFPIVIFFNPDSRQGVKTMRQRLNPTSNKSSRKLFDQANKLKKTCAHLFTATINLNSANDSWFGSLKDTIQHQQGEAVWVSEGKMEGMDDDPEDRMSYLTAMGADYLSCDSRLISDFEDTDGEGGAYTDNELDEPAEEPLVSSITRSSEPVQHEESIRKPSPEPRAQMRRAASSDQLRDNSPPPAFKPEPPKAKTQNKEESYDFSKSYEYKSNPSAVAGNETPGASTKGYPPPVAAKPTFGRSILKPSTPIPPQEGEEVGESSEEQDNAPKSVLGKVKIFEKMDHKARLQRMQELQEAQNARIEIAQKHPDIYAVPIKTHKPDPGTPQHTSSRPPEPQKAPSRPYQDTRGSYGSDAEEEEYRQQLSEHSKRGYYGQSARYRDTEL,1190,NP_004808.2.csv,refseq-TJP2-NM_004817.3_clinical_seed_0_final,refseq-TJP2-NM_004817.3.a2m,Invitae,refseq-TJP2-NM_004817.3.npy,1,1190,1190
+NP_004809.2,MAGELADKKDRDASPSKEERKRSRTPDRERDRDRDRKSSPSKDRKRHRSRDRRRGGSRSRSRSRSKSAERERRHKERERDKERDRNKKDRDRDKDGHRRDKDRKRSSLSPGRGKDFKSRKDRDSKKDEEDEHGDKKPKAQPLSLEELLAKKKAEEEAEAKPKFLSKAEREAEALKRRQQEVEERQRMLEEERKKRKQFQDLGRKMLEDPQERERRERRERMERETNGNEDEEGRQKIREEKDKSKELHAIKERYLGGIKKRRRTRHLNDRKFVFEWDASEDTSIDYNPLYKERHQVQLLGRGFIAGIDLKQQKREQSRFYGDLMEKRRTLEEKEQEEARLRKLRKKEAKQRWDDRHWSQKKLDEMTDRDWRIFREDYSITTKGGKIPNPIRSWKDSSLPPHILEVIDKCGYKEPTPIQRQAIPIGLQNRDIIGVAETGSGKTAAFLIPLLVWITTLPKIDRIEESDQGPYAIILAPTRELAQQIEEETIKFGKPLGIRTVAVIGGISREDQGFRLRMGCEIVIATPGRLIDVLENRYLVLSRCTYVVLDEADRMIDMGFEPDVQKILEHMPVSNQKPDTDEAEDPEKMLANFESGKHKYRQTVMFTATMPPAVERLARSYLRRPAVVYIGSAGKPHERVEQKVFLMSESEKRKKLLAILEQGFDPPIIIFVNQKKGCDVLAKSLEKMGYNACTLHGGKGQEQREFALSNLKAGAKDILVATDVAGRGIDIQDVSMVVNYDMAKNIEDYIHRIGRTGRAGKSGVAITFLTKEDSAVFYELKQAILESPVSSCPPELANHPDAQHKPGTILTKKRREETIFA,820,NP_004809.2.csv,refseq-DDX23-NM_004818.2_clinical_seed_0_final,refseq-DDX23-NM_004818.2.a2m,Invitae,refseq-DDX23-NM_004818.2.npy,1,820,820
+NP_004811.1,MAGEVSAATGRFSLERLGLPGLALAAALLLLALCLLVRRTRRPGEPPLIKGWLPYLGVVLNLRKDPLRFMKTLQKQHGDTFTVLLGGKYITFILDPFQYQLVIKNHKQLSFRVFSNKLLEKAFSISQLQKNHDMNDELHLCYQFLQGKSLDILLESMMQNLKQVFEPQLLKTTSWDTAELYPFCSSIIFEITFTTIYGKVIVCDNNKFISELRDDFLKFDDKFAYLVSNIPIELLGNVKSIREKIIKCFSSEKLAKMQGWSEVFQSRQDVLEKYYVHEDLEIGAHHLGFLWASVANTIPTMFWAMYYLLRHPEAMAAVRDEIDRLLQSTGQKKGSGFPIHLTREQLDSLICLESSIFEALRLSSYSTTIRFVEEDLTLSSETGDYCVRKGDLVAIFPPVLHGDPEIFEAPEEFRYDRFIEDGKKKTTFFKRGKKLKCYLMPFGTGTSKCPGRFFALMEIKQLLVILLTYFDLEIIDDKPIGLNYSRLLFGIQYPDSDVLFRYKVKS,506,NP_004811.1.csv,refseq-CYP7B1-NM_004820.3_clinical_seed_0_final,refseq-CYP7B1-NM_004820.3.a2m,Invitae,refseq-CYP7B1-NM_004820.3.npy,1,506,506
+NP_004817.2,MEPPYSLTAHYDEFQEVKYVSRCGAGGARGASLPPGFPLGAARSATGARSGLPRWNRREVCLLSGLVFAAGLCAILAAMLALKYLGPVAAGGGACPEGCPERKAFARAARFLAANLDASIDPCQDFYSFACGGWLRRHAIPDDKLTYGTIAAIGEQNEERLRRLLARPGGGPGGAAQRKVRAFFRSCLDMREIERLGPRPMLEVIEDCGGWDLGGAEERPGVAARWDLNRLLYKAQGVYSAAALFSLTVSLDDRNSSRYVIRIDQDGLTLPERTLYLAQDEDSEKILAAYRVFMERVLSLLGADAVEQKAQEILQVEQQLANITVSEHDDLRRDVSSMYNKVTLGQLQKITPHLRWKWLLDQIFQEDFSEEEEVVLLATDYMQQVSQLIRSTPHRVLHNYLVWRVVVVLSEHLSPPFREALHELAQEMEGSDKPQELARVCLGQANRHFGMALGALFVHEHFSAASKAKVQQLVEDIKYILGQRLEELDWMDAETRAAARAKLQYMMVMVGYPDFLLKPDAVDKEYEFEVHEKTYFKNILNSIRFSIQLSVKKIRQEVDKSTWLLPPQALNAYYLPNKNQMVFPAGILQPTLYDPDFPQSLNYGGIGTIIGHELTHGYDDWGGQYDRSGNLLHWWTEASYSRFLRKAECIVRLYDNFTVYNQRVNGKHTLGENIADMGGLKLAYHAYQKWVREHGPEHPLPRLKYTHDQLFFIAFAQNWCIKRRSQSIYLQVLTDKHAPEHYRVLGSVSQFEEFGRAFHCPKDSPMNPAHKCSVW,775,NP_004817.2.csv,refseq-ECEL1-NM_004826.3_clinical_seed_0_final,refseq-ECEL1-NM_004826.3.a2m,Invitae,refseq-ECEL1-NM_004826.3.npy,1,775,775
+NP_004827.4,MERAISPGLLVRALLLLLLLLGLAARTVAAGRARGLPAPTAEAAFGLGAAAAPTSATRVPAAGAVAAAEVTVEDAEALPAAAGEQEPRGPEPDDETELRPRGRSLVIISTLDGRIAALDPENHGKKQWDLDVGSGSLVSSSLSKPEVFGNKMIIPSLDGALFQWDQDRESMETVPFTVESLLESSYKFGDDVVLVGGKSLTTYGLSAYSGKVRYICSALGCRQWDSDEMEQEEDILLLQRTQKTVRAVGPRSGNEKWNFSVGHFELRYIPDMETRAGFIESTFKPNENTEESKIISDVEEQEAAIMDIVIKVSVADWKVMAFSKKGGHLEWEYQFCTPIASAWLLKDGKVIPISLFDDTSYTSNDDVLEDEEDIVEAARGATENSVYLGMYRGQLYLQSSVRISEKFPSSPKALESVTNENAIIPLPTIKWKPLIHSPSRTPVLVGSDEFDKCLSNDKFSHEEYSNGALSILQYPYDNGYYLPYYKRERNKRSTQITVRFLDNPHYNKNIRKKDPVLLLHWWKEIVATILFCIIATTFIVRRLFHPHPHRQRKESETQCQTENKYDSVSGEANDSSWNDIKNSGYISRYLTDFEPIQCLGRGGFGVVFEAKNKVDDCNYAIKRIRLPNRELAREKVMREVKALAKLEHPGIVRYFNAWLEAPPEKWQEKMDEIWLKDESTDWPLSSPSPMDAPSVKIRRMDPFATKEHIEIIAPSPQRSRSFSVGISCDQTSSSESQFSPLEFSGMDHEDISESVDAAYNLQDSCLTDCDVEDGTMDGNDEGHSFELCPSEASPYVRSRERTSSSIVFEDSGCDNASSKEEPKTNRLHIGNHCANKLTAFKPTSSKSSSEATLSISPPRPTTLSLDLTKNTTEKLQPSSPKVYLYIQMQLCRKENLKDWMNGRCTIEERERSVCLHIFLQIAEAVEFLHSKGLMHRDLKPSNIFFTMDDVVKVGDFGLVTAMDQDEEEQTVLTPMPAYARHTGQVGTKLYMSPEQIHGNSYSHKVDIFSLGLILFELLYPFSTQMERVRTLTDVRNLKFPPLFTQKYPCEYVMVQDMLSPSPMERPEAINIIENAVFEDLDFPGKTVLRQRSRSLSSSGTKHSRQSNNSHSPLPSN,1116,NP_004827.4.csv,refseq-EIF2AK3-NM_004836.6_clinical_seed_0_final,refseq-EIF2AK3-NM_004836.6.a2m,Invitae,refseq-EIF2AK3-NM_004836.6.npy,1,1116,1116
+NP_004846.4,MRRPLSKCGMEPGGGDASLTLHGLQNRSHGKIKLRKRKSTLYFNTQEKSARRRGDLLGENIYLLLFTIALRILNCFLVQTSFVPDEYWQSLEVSHHMVFNYGYLTWEWTERLRSYTYPLIFASIYKILHLLGKDSVQLLIWIPRLAQALLSAVADVRLYSLMKQLENQEVARWVFFCQLCSWFTWYCCTRTLTNTMETVLTIIALFYYPLEGSKSMNSVKYSSLVALAFIIRPTAVILWTPLLFRHFCQEPRKLDLILHHFLPVGFVTLSLSLMIDRIFFGQWTLVQFNFLKFNVLQNWGTFYGSHPWHWYFSQGFPVILGTHLPFFIHGCYLAPKRYRILLVTVLWTLLVYSMLSHKEFRFIYPVLPFCMVFCGYSLTHLKTWKKPALSFLFLSNLFLALYTGLVHQRGTLDVMSHIQKVCYNNPNKSSASIFIMMPCHSTPYYSHVHCPLPMRFLQCPPDLTGKSHYLDEADVFYLNPLNWLHREFHDDASLPTHLITFSILEEEISAFLISSNYKRTAVFFHTHLPEGRIGSHIYVYERKLKGKFNMKMKF,554,NP_004846.4.csv,refseq-PIGB-NM_004855.4_clinical_seed_0_final,refseq-PIGB-NM_004855.4.a2m,Invitae,refseq-PIGB-NM_004855.4.npy,1,554,554
+NP_004853.2,MSVPGPYQAATGPSSAPSAPPSYEETVAVNSYYPTPPAPMPGPTTGLVTGPDGKGMNPPSYYTQPAPIPNNNPITVQTVYVQHPITFLDRPIQMCCPSCNKMIVSQLSYNAGALTWLSCGSLCLLGCIAGCCFIPFCVDALQDVDHYCPNCRALLGTYKRL,161,NP_004853.2.csv,refseq-LITAF-NM_004862.3_clinical_seed_0_final,refseq-LITAF-NM_004862.3.a2m,Invitae,refseq-LITAF-NM_004862.3.npy,1,161,161
+NP_004854.1,MRPEPGGCCCRRTVRANGCVANGEVRNGYVRSSAAAAAAAAAGQIHHVTQNGGLYKRPFNEAFEETPMLVAVLTYVGYGVLTLFGYLRDFLRYWRIEKCHHATEREEQKDFVSLYQDFENFYTRNLYMRIRDNWNRPICSVPGARVDIMERQSHDYNWSFKYTGNIIKGVINMGSYNYLGFARNTGSCQEAAAKVLEEYGAGVCSTRQEIGNLDKHEELEELVARFLGVEAAMAYGMGFATNSMNIPALVGKGCLILSDELNHASLVLGARLSGATIRIFKHNNMQSLEKLLKDAIVYGQPRTRRPWKKILILVEGIYSMEGSIVRLPEVIALKKKYKAYLYLDEAHSIGALGPTGRGVVEYFGLDPEDVDVMMGTFTKSFGASGGYIGGKKELIDYLRTHSHSAVYATSLSPPVVEQIITSMKCIMGQDGTSLGKECVQQLAENTRYFRRRLKEMGFIIYGNEDSPVVPLMLYMPAKIGAFGREMLKRNIGVVVVGFPATPIIESRARFCLSAAHTKEILDTALKEIDEVGDLLQLKYSRHRLVPLLDRPFDETTYEETED,562,NP_004854.1.csv,refseq-SPTLC2-NM_004863.3_clinical_seed_0_final,refseq-SPTLC2-NM_004863.3.a2m,Invitae,refseq-SPTLC2-NM_004863.3.npy,1,562,562
+NP_004861.2,MAAEADGPLKRLLVPILLPEKCYDQLFVQWDLLHVPCLKILLSKGLGLGIVAGSLLVKLPQVFKILGAKSAEGLSLQSVMLELVALTGTMVYSITNNFPFSSWGEALFLMLQTITICFLVMHYRGQTVKGVAFLACYGLVLLVLLSPLTPLTVVTLLQASNVPAVVVGRLLQAATNYHNGHTGQLSAITVFLLFGGSLARIFTSIQETGDPLMAGTFVVSSLCNGLIAAQLLFYWNAKPPHKQKKAQ,247,NP_004861.2.csv,refseq-MPDU1-NM_004870.3_clinical_seed_0_final,refseq-MPDU1-NM_004870.3.a2m,Invitae,refseq-MPDU1-NM_004870.3.npy,1,247,247
+NP_004886.3,MKMASTRCKLARYLEDLEDVDLKKFKMHLEDYPPQKGCIPLPRGQTEKADHVDLATLMIDFNGEEKAWAMAVWIFAAINRRDLYEKAKRDEPKWGSDNARVSNPTVICQEDSIEEEWMGLLEYLSRISICKMKKDYRKKYRKYVRSRFQCIEDRNARLGESVSLNKRYTRLRLIKEHRSQQEREQELLAIGKTKTCESPVSPIKMELLFDPDDEHSEPVHTVVFQGAAGIGKTILARKMMLDWASGTLYQDRFDYLFYIHCREVSLVTQRSLGDLIMSCCPDPNPPIHKIVRKPSRILFLMDGFDELQGAFDEHIGPLCTDWQKAERGDILLSSLIRKKLLPEASLLITTRPVALEKLQHLLDHPRHVEILGFSEAKRKEYFFKYFSDEAQARAAFSLIQENEVLFTMCFIPLVCWIVCTGLKQQMESGKSLAQTSKTTTAVYVFFLSSLLQPRGGSQEHGLCAHLWGLCSLAADGIWNQKILFEESDLRNHGLQKADVSAFLRMNLFQKEVDCEKFYSFIHMTFQEFFAAMYYLLEEEKEGRTNVPGSRLKLPSRDVTVLLENYGKFEKGYLIFVVRFLFGLVNQERTSYLEKKLSCKISQQIRLELLKWIEVKAKAKKLQIQPSQLELFYCLYEMQEEDFVQRAMDYFPKIEINLSTRMDHMVSSFCIENCHRVESLSLGFLHNMPKEEEEEEKEGRHLDMVQCVLPSSSHAACSHGLVNSHLTSSFCRGLFSVLSTSQSLTELDLSDNSLGDPGMRVLCETLQHPGCNIRRLWLGRCGLSHECCFDISLVLSSNQKLVELDLSDNALGDFGIRLLCVGLKHLLCNLKKLWLVSCCLTSACCQDLASVLSTSHSLTRLYVGENALGDSGVAILCEKAKNPQCNLQKLGLVNSGLTSVCCSALSSVLSTNQNLTHLYLRGNTLGDKGIKLLCEGLLHPDCKLQVLELDNCNLTSHCCWDLSTLLTSSQSLRKLSLGNNDLGDLGVMMFCEVLKQQSCLLQNLGLSEMYFNYETKSALETLQEEKPELTVVFEPSW,1036,NP_004886.3.csv,NLRP3_HUMAN_b07_clinical_seed_0_final,NLRP3_HUMAN_b07.a2m,EVE,NLRP3_HUMAN_b07_theta_0.2.npy,1,1036,1036
+NP_004888.2,MLRAPGCLLRTSVAPAAALAAALLSSLARCSLLEPRDPVASSLSPYFGTKTRYEDVNPVLLSGPEAPWRDPELLEGTCTPVQLVALIRHGTRYPTVKQIRKLRQLHGLLQARGSRDGGASSTGSRDLGAALADWPLWYADWMDGQLVEKGRQDMRQLALRLASLFPALFSRENYGRLRLITSSKHRCMDSSAAFLQGLWQHYHPGLPPPDVADMEFGPPTVNDKLMRFFDHCEKFLTEVEKNATALYHVEAFKTGPEMQNILKKVAATLQVPVNDLNADLIQVAFFTCSFDLAIKGVKSPWCDVFDIDDAKVLEYLNDLKQYWKRGYGYTINSRSSCTLFQDIFQHLDKAVEQKQRSQPISSPVILQFGHAETLLPLLSLMGYFKDKEPLTAYNYKKQMHRKFRSGLIVPYASNLIFVLYHCENAKTPKEQFRVQMLLNEKVLPLAYSQETVSFYEDLKNHYKDILQSCQTSEECELARANSTSDEL,487,NP_004888.2.csv,refseq-MINPP1-NM_004897.4_clinical_seed_0_final,refseq-MINPP1-NM_004897.4.a2m,Invitae,refseq-MINPP1-NM_004897.4.npy,1,487,487
+NP_004915.2,MVDYHAANQSYQYGPSSAGNGAGGGGSMGDYMAQEDDWDRDLLLDPAWEKQQRKTFTAWCNSHLRKAGTQIENIDEDFRDGLKLMLLLEVISGERLPKPERGKMRVHKINNVNKALDFIASKGVKLVSIGAEEIVDGNAKMTLGMIWTIILRFAIQDISVEETSAKEGLLLWCQRKTAPYKNVNVQNFHISWKDGLAFNALIHRHRPELIEYDKLRKDDPVTNLNNAFEVAEKYLDIPKMLDAEDIVNTARPDEKAIMTYVSSFYHAFSGAQKAETAANRICKVLAVNQENEHLMEDYEKLASDLLEWIRRTIPWLEDRVPQKTIQEMQQKLEDFRDYRRVHKPPKVQEKCQLEINFNTLQTKLRLSNRPAFMPSEGKMVSDINNGWQHLEQAEKGYEEWLLNEIRRLERLDHLAEKFRQKASIHEAWTDGKEAMLKHRDYETATLSDIKALIRKHEAFESDLAAHQDRVEQIAAIAQELNELDYYDSHNVNTRCQKICDQWDALGSLTHSRREALEKTEKQLEAIDQLHLEYAKRAAPFNNWMESAMEDLQDMFIVHTIEEIEGLISAHDQFKSTLPDADREREAILAIHKEAQRIAESNHIKLSGSNPYTTVTPQIINSKWEKVQQLVPKRDHALLEEQSKQQSNEHLRRQFASQANVVGPWIQTKMEEIGRISIEMNGTLEDQLSHLKQYERSIVDYKPNLDLLEQQHQLIQEALIFDNKHTNYTMEHIRVGWEQLLTTIARTINEVENQILTRDAKGISQEQMQEFRASFNHFDKDHGGALGPEEFKACLISLGYDVENDRQGEAEFNRIMSLVDPNHSGLVTFQAFIDFMSRETTDTDTADQVIASFKVLAGDKNFITAEELRRELPPDQAEYCIARMAPYQGPDAVPGALDYKSFSTALYGESDL,911,NP_004915.2.csv,refseq-ACTN4-NM_004924.5_clinical_seed_0_final,refseq-ACTN4-NM_004924.5.a2m,Invitae,refseq-ACTN4-NM_004924.5.npy,1,911,911
+NP_004919.1,MKLTRKMVLTRAKASELHSVRKLNCWGSRLTDISICQEMPSLEVITLSVNSISTLEPVSRCQRLSELYLRRNRIPSLAELFYLKGLPRLRVLWLAENPCCGTSPHRYRMTVLRTLPRLQKLDNQAVTEEELSRALSEGEEITAAPEREGTGHGGPKLCCTLSSLSSAAETGRDPLDSEEEATSGAQDERGLKPPSRGQFPSLSARDASSSHRGRNVLTAILLLLRELDAEGLEAVQQTVGSRLQALRGEEVQEHAE,256,NP_004919.1.csv,refseq-CFAP410-NM_004928.2_clinical_seed_0_final,refseq-CFAP410-NM_004928.2.a2m,Invitae,refseq-CFAP410-NM_004928.2.npy,1,256,256
+NP_004928.2,MIRNWLTIFILFPLKLVEKCESSVSLTVPPVVKLENGSSTNVSLTLRPPLNATLVITFEITFRSKNITILELPDEVVVPPGVTNSSFQVTSQNVGQLTVYLHGNHSNQTGPRIRFLVIRSSAISIINQVIGWIYFVAWSISFYPQVIMNWRRKSVIGLSFDFVALNLTGFVAYSVFNIGLLWVPYIKEQFLLKYPNGVNPVNSNDVFFSLHAVVLTLIIIVQCCLYERGGQRVSWPAIGFLVLAWLFAFVTMIVAAVGVTTWLQFLFCFSYIKLAVTLVKYFPQAYMNFYYKSTEGWSIGNVLLDFTGGSFSLLQMFLQSYNNDQWTLIFGDPTKFGLGVFSIVFDVVFFIQHFCLYRKRPGYDQLN,367,NP_004928.2.csv,refseq-CTNS-NM_004937.2_clinical_seed_0_final,refseq-CTNS-NM_004937.2.a2m,Invitae,refseq-CTNS-NM_004937.2.npy,1,367,367
+NP_004936.2,MGNRGMEELIPLVNKLQDAFSSIGQSCHLDLPQIAVVGGQSAGKSSVLENFVGRDFLPRGSGIVTRRPLILQLIFSKTEHAEFLHCKSKKFTDFDEVRQEIEAETDRVTGTNKGISPVPINLRVYSPHVLNLTLIDLPGITKVPVGDQPPDIEYQIKDMILQFISRESSLILAVTPANMDLANSDALKLAKEVDPQGLRTIGVITKLDLMDEGTDARDVLENKLLPLRRGYIGVVNRSQKDIEGKKDIRAALAAERKFFLSHPAYRHMADRMGTPHLQKTLNQQLTNHIRESLPALRSKLQSQLLSLEKEVEEYKNFRPDDPTRKTKALLQMVQQFGVDFEKRIEGSGDQVDTLELSGGARINRIFHERFPFELVKMEFDEKDLRREISYAIKNIHGVRTGLFTPDLAFEAIVKKQVVKLKEPCLKCVDLVIQELINTVRQCTSKLSSYPRLREETERIVTTYIREREGRTKDQILLLIDIEQSYINTNHEDFIGFANAQQRSTQLNKKRAIPNQVIRRGWLTINNISLMKGGSKEYWFVLTAESLSWYKDEEEKEKKYMLPLDNLKIRDVEKGFMSNKHVFAIFNTEQRNVYKDLRQIELACDSQEDVDSWKASFLRAGVYPEKDQAENEDGAQENTFSMDPQLERQVETIRNLVDSYVAIINKSIRDLMPKTIMHLMINNTKAFIHHELLAYLYSSADQSSLMEESADQAQRRDDMLRMYHALKEALNIIGDISTSTVSTPVPPPVDDTWLQSASSHSPTPQRRPVSSIHPPGRPPAVRGPTPGPPLIPVPVGAAASFSAPPIPSRPGPQSVFANSDLFPAPPQIPSRPVRIPPGIPPGVPSRRPPAAPSRPTIIRPAEPSLLD,866,NP_004936.2.csv,refseq-DNM2-NM_004945.3_clinical_seed_0_final,refseq-DNM2-NM_004945.3.a2m,Invitae,refseq-DNM2-NM_004945.3.npy,1,866,866
+NP_004937.1,MAPWRKADKERHGVAIYNFQGSGAPQLSLQIGDVVRIQETCGDWYRGYLIKHKMLQGIFPKSFIHIKEVTVEKRRNTENIIPAEIPLAQEVTTTLWEWGSIWKQLYVASKKERFLQVQSMMYDLMEWRSQLLSGTLPKDELKELKQKVTSKIDYGNKILELDLIVRDEDGNILDPDNTSVISLFHAHEEATDKITERIKEEMSKDQPDYAMYSRISSSPTHSLYVFVRNFVCRIGEDAELFMSLYDPNKQTVISENYLVRWGSRGFPKEIEMLNNLKVVFTDLGNKDLNRDKIYLICQIVRVGKMDLKDTGAKKCTQGLRRPFGVAVMDITDIIKGKAESDEEKQHFIPFHPVTAENDFLHSLLGKVIASKGDSGGQGLWVTMKMLVGDIIQIRKDYPHLVDRTTVVARKLGFPEIIMPGDVRNDIYITLLQGDFDKYNKTTQRNVEVIMCVCAEDGKTLPNAICVGAGDKPMNEYRSVVYYQVKQPRWMETVKVAVPIEDMQRIHLRFMFRHRSSLESKDKGEKNFAMSYVKLMKEDGTTLHDGFHDLVVLKGDSKKMEDASAYLTLPSYRHHVENKGATLSRSSSSVGGLSVSSRDVFSISTLVCSTKLTQNVGLLGLLKWRMKPQLLQENLEKLKIVDGEEVVKFLQDTLDALFNIMMEHSQSDEYDILVFDALIYIIGLIADRKFQHFNTVLEAYIQQHFSATLAYKKLMTVLKTYLDTSSRGEQCEPILRTLKALEYVFKFIVRSRTLFSQLYEGKEQMEFEESMRRLFESINNLMKSQYKTTILLQVAALKYIPSVLHDVEMVFDAKLLSQLLYEFYTCIPPVKLQKQKVQSMNEIVQSNLFKKQECRDILLPVITKELKELLEQKDDMQHQVLERKYCVELLNSILEVLSYQDAAFTYHHIQEIMVQLLRTVNRTVITMGRDHILISHFVACMTAILNQMGDQHYSFYIETFQTSSELVDFLMETFIMFKDLIGKNVYPGDWMAMSMVQNRVFLRAINKFAETMNQKFLEHTNFEFQLWNNYFHLAVAFITQDSLQLEQFSHAKYNKILNKYGDMRRLIGFSIRDMWYKLGQNKICFIPGMVGPILEMTLIPEAELRKATIPIFFDMMLCEYQRSGDFKKFENEIILKLDHEVEGGRGDEQYMQLLESILMECAAEHPTIAKSVENFVNLVKGLLEKLLDYRGVMTDESKDNRMSCTVNLLNFYKDNNREEMYIRYLYKLRDLHLDCDNYTEAAYTLLLHTWLLKWSDEQCASQVMQTGQQHPQTHRQLKETLYETIIGYFDKGKMWEEAISLCKELAEQYEMEIFDYELLSQNLIQQAKFYESIMKILRPKPDYFAVGYYGQGFPSFLRNKVFIYRGKEYERREDFQMQLMTQFPNAEKMNTTSAPGDDVKNAPGQYIQCFTVQPVLDEHPRFKNKPVPDQIINFYKSNYVQRFHYSRPVRRGTVDPENEFASMWIERTSFVTAYKLPGILRWFEVVHMSQTTISPLENAIETMSTANEKILMMINQYQSDETLPINPLSMLLNGIVDPAVMGGFAKYEKAFFTEEYVRDHPEDQDKLTHLKDLIAWQIPFLGAGIKIHEKRVSDNLRPFHDRMEECFKNLKMKVEKEYGVREMPDFDDRRVGRPRSMLRSYRQMSIISLASMNSDCSTPSKPTSESFDLELASPKTPRVEQEEPISPGSTLPEVKLRRSKKRTKRSSVVFADEKAAAESDLKRLSRKHEFMSDTNLSEHAAIPLKASVLSQMSFASQSMPTIPALALSVAGIPGLDEANTSPRLSQTFLQLSDGDKKTLTRKKVNQFFKTMLASKSAEEGKQIPDSLSTDL,1830,NP_004937.1.csv,refseq-DOCK2-NM_004946.2_clinical_seed_0_final,refseq-DOCK2-NM_004946.2.a2m,Invitae,refseq-DOCK2-NM_004946.2.npy,1,1830,1830
+NP_004950.2,MDYSYDEDLDELCPVCGDKVSGYHYGLLTCESCKGFFKRTVQNNKHYTCTESQSCKIDKTQRKRCPFCRFQKCLTVGMRLEAVRADRMRGGRNKFGPMYKRDRALKQQKKAQIRANGFKLETGPPMGVPPPPPPAPDYVLPPSLHGPEPKGLAAGPPAGPLGDFGAPALPMAVPGAHGPLAGYLYPAFPGRAIKSEYPEPYASPPQPGLPYGYPEPFSGGPNVPELILQLLQLEPDEDQVRARILGCLQEPTKSRPDQPAAFGLLCRMADQTFISIVDWARRCMVFKELEVADQMTLLQNCWSELLVFDHIYRQVQHGKEGSILLVTGQEVELTTVATQAGSLLHSLVLRAQELVLQLLALQLDRQEFVCLKFIILFSLDLKFLNNHILVKDAQEKANAALLDYTLCHYPHCGDKFQQLLLCLVEVRALSMQAKEYLYHKHLGNEMPRNNLLIEMLQAKQT,461,NP_004950.2.csv,refseq-NR5A1-NM_004959.4_clinical_seed_0_final,refseq-NR5A1-NM_004959.4.a2m,Invitae,refseq-NR5A1-NM_004959.4.npy,1,461,461
+NP_004951.1,MASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGYDRGGYRGRGGDRGGFRGGRGGGDRGGFGPGKMDSRGEHRQDRRERPY,526,NP_004951.1.csv,refseq-FUS-NM_004960.3_clinical_seed_0_final,refseq-FUS-NM_004960.3.a2m,Invitae,refseq-FUS-NM_004960.3.npy,1,526,526
+NP_004954.2,MKTLLLDLALWSLLFQPGWLSFSSQVSQNCHNGSYEISVLMMGNSAFAEPLKNLEDAVNEGLEIVRGRLQNAGLNVTVNATFMYSDGLIHNSGDCRSSTCEGLDLLRKISNAQRMGCVLIGPSCTYSTFQMYLDTELSYPMISAGSFGLSCDYKETLTRLMSPARKLMYFLVNFWKTNDLPFKTYSWSTSYVYKNGTETEDCFWYLNALEASVSYFSHELGFKVVLRQDKEFQDILMDHNRKSNVIIMCGGPEFLYKLKGDRAVAEDIVIILVDLFNDQYFEDNVTAPDYMKNVLVLTLSPGNSLLNSSFSRNLSPTKRDFALAYLNGILLFGHMLKIFLENGENITTPKFAHAFRNLTFEGYDGPVTLDDWGDVDSTMVLLYTSVDTKKYKVLLTYDTHVNKTYPVDMSPTFTWKNSKLPNDITGRGPQILMIAVFTLTGAVVLLLLVALLMLRKYRKDYELRQKKWSHIPPENIFPLETNETNHVSLKIDDDKRRDTIQRLRQCKYDKKRVILKDLKHNDGNFTEKQKIELNKLLQIDYYNLTKFYGTVKLDTMIFGVIEYCERGSLREVLNDTISYPDGTFMDWEFKISVLYDIAKGMSYLHSSKTEVHGRLKSTNCVVDSRMVVKITDFGCNSILPPKKDLWTAPEHLRQANISQKGDVYSYGIIAQEIILRKETFYTLSCRDRNEKIFRVENSNGMKPFRPDLFLETAEEKELEVYLLVKNCWEEDPEKRPDFKKIETTLAKIFGLFHDQKNESYMDTLIRRLQLYSRNLEHLVEERTQLYKAERDRADRLNFMLLPRLVVKSLKEKGFVEPELYEEVTIYFSDIVGFTTICKYSTPMEVVDMLNDIYKSFDHIVDHHDVYKVETIGDAYMVASGLPKRNGNRHAIDIAKMALEILSFMGTFELEHLPGLPIWIRIGVHSGPCAAGVVGIKMPRYCLFGDTVNTASRMESTGLPLRIHVSGSTIAILKRTECQFLYEVRGETYLKGRGNETTYWLTGMKDQKFNLPTPPTVENQQRLQAEFSDMIANSLQKRQAAGIRSQKPRRVASYKKGTLEYLQLNTTDKESTYF,1073,NP_004954.2.csv,refseq-GUCY2C-NM_004963.3_clinical_seed_0_final,refseq-GUCY2C-NM_004963.3.a2m,Invitae,refseq-GUCY2C-NM_004963.3.npy,1,1073,1073
+NP_004961.1,MALRKGGLALALLLLSWVALGPRSLEGADPGTPGEAEGPACPAACVCSYDDDADELSVFCSSRNLTRLPDGVPGGTQALWLDGNNLSSVPPAAFQNLSSLGFLNLQGGQLGSLEPQALLGLENLCHLHLERNQLRSLALGTFAHTPALASLGLSNNRLSRLEDGLFEGLGSLWDLNLGWNSLAVLPDAAFRGLGSLRELVLAGNRLAYLQPALFSGLAELRELDLSRNALRAIKANVFVQLPRLQKLYLDRNLIAAVAPGAFLGLKALRWLDLSHNRVAGLLEDTFPGLLGLRVLRLSHNAIASLRPRTFKDLHFLEELQLGHNRIRQLAERSFEGLGQLEVLTLDHNQLQEVKAGAFLGLTNVAVMNLSGNCLRNLPEQVFRGLGKLHSLHLEGSCLGRIRPHTFTGLSGLRRLFLKDNGLVGIEEQSLWGLAELLELDLTSNQLTHLPHRLFQGLGKLEYLLLSRNRLAELPADALGPLQRAFWLDVSHNRLEALPNSLLAPLGRLRYLSLRNNSLRTFTPQPPGLERLWLEGNPWDCGCPLKALRDFALQNPSAVPRFVQAICEGDDCQPPAYTYNNITCASPPEVVGLDLRDLSEAHFAPC,605,NP_004961.1.csv,refseq-IGFALS-NM_004970.2_clinical_seed_0_final,refseq-IGFALS-NM_004970.2.a2m,Invitae,refseq-IGFALS-NM_004970.2.npy,1,605,605
+NP_004963.1,MGMACLTMTEMEGTSTSSIYQNGDISGNANSMKQIDPVLQVYLYHSLGKSEADYLTFPSGEYVAEEICIAASKACGITPVYHNMFALMSETERIWYPPNHVFHIDESTRHNVLYRIRFYFPRWYCSGSNRAYRHGISRGAEAPLLDDFVMSYLFAQWRHDFVHGWIKVPVTHETQEECLGMAVLDMMRIAKENDQTPLAIYNSISYKTFLPKCIRAKIQDYHILTRKRIRYRFRRFIQQFSQCKATARNLKLKYLINLETLQSAFYTEKFEVKEPGSGPSGEEIFATIIITGNGGIQWSRGKHKESETLTEQDLQLYCDFPNIIDVSIKQANQEGSNESRVVTIHKQDGKNLEIELSSLREALSFVSLIDGYYRLTADAHHYLCKEVAPPAVLENIQSNCHGPISMDFAISKLKKAGNQTGLYVLRCSPKDFNKYFLTFAVERENVIEYKHCLITKNENEEYNLSGTKKNFSSLKDLLNCYQMETVRSDNIIFQFTKCCPPKPKDKSNLLVFRTNGVSDVPTSPTLQRPTHMNQMVFHKIRNEDLIFNESLGQGTFTKIFKGVRREVGDYGQLHETEVLLKVLDKAHRNYSESFFEAASMMSKLSHKHLVLNYGVCVCGDENILVQEFVKFGSLDTYLKKNKNCINILWKLEVAKQLAWAMHFLEENTLIHGNVCAKNILLIREEDRKTGNPPFIKLSDPGISITVLPKDILQERIPWVPPECIENPKNLNLATDKWSFGTTLWEICSGGDKPLSALDSQRKLQFYEDRHQLPAPKWAELANLINNCMDYEPDFRPSFRAIIRDLNSLFTPDYELLTENDMLPNMRIGALGFSGAFEDRDPTQFEERHLKFLQQLGKGNFGSVEMCRYDPLQDNTGEVVAVKKLQHSTEEHLRDFEREIEILKSLQHDNIVKYKGVCYSAGRRNLKLIMEYLPYGSLRDYLQKHKERIDHIKLLQYTSQICKGMEYLGTKRYIHRDLATRNILVENENRVKIGDFGLTKVLPQDKEYYKVKEPGESPIFWYAPESLTESKFSVASDVWSFGVVLYELFTYIEKSKSPPAEFMRMIGNDKQGQMIVFHLIELLKNNGRLPRPDGCPDEIYMIMTECWNNNVNQRPSFRDLALRVDQIRDNMAG,1132,NP_004963.1.csv,refseq-JAK2-NM_004972.3_clinical_seed_0_final,refseq-JAK2-NM_004972.3.a2m,Invitae,refseq-JAK2-NM_004972.3.npy,1,1132,1132
+NP_004964.2,MSKERPKRNIIQKKYDDSDGIPWSEERVVRKVLYLSLKEFKNSQKRQHAEGIAGSLKTVNGLLGNDQSKGLGPASEQSENEKDDASQVSSTSNDVSSSDFEEGPSRKRPRLQAQRKFAQSQPNSPSTTPVKIVEPLLPPPATQISDLSKRKPKTEDFLTFLCLRGSPALPNSMVYFGSSQDEEEVEEEDDETEDVKTATNNASSSCQSTPRKGKTHKHVHNGHVFNGSSRSTREKEPVQKHKSKEATPAKEKHSDHRADSRREQASANHPAAAPSTGSSAKGLAATHHHPPLHRSAQDLRKQVSKVNGVTRMSSLGAGVTSAKKMREVRPSPSKTVKYTATVTKGAVTYTKAKRELVKDTKPNHHKPSSAVNHTISGKTESSNAKTRKQVLSLGGASKSTGPAVNGLKVSGRLNPKSCTKEVGGRQLREGLQLREGLRNSKRRLEEAHQAEKPQSPPKKMKGAAGPAEGPGKKAPAERGLLNGHVKKEVPERSLERNRPKRATAGKSTPGRQAHGKADSASCENRSTSQPESVHKPQDSGKAEKGGGKAGWAAMDEIPVLRPSAKEFHDPLIYIESVRAQVEKFGMCRVIPPPDWRPECKLNDEMRFVTQIQHIHKLGRRWGPNVQRLACIKKHLKSQGITMDELPLIGGCELDLACFFRLINEMGGMQQVTDLKKWNKLADMLRIPRTAQDRLAKLQEAYCQYLLSYDSLSPEEHRRLEKEVLMEKEILEKRKGPLEGHTENDHHKFHPLPRFEPKNGLIHGVAPRNGFRSKLKEVGQAQLKTGRRRLFAQEKEVVKEEEEDKGVLNDFHKCIYKGRSVSLTTFYRTARNIMSMCFSKEPAPAEIEQEYWRLVEEKDCHVAVHCGKVDTNTHGSGFPVGKSEPFSRHGWNLTVLPNNTGSILRHLGAVPGVTIPWLNIGMVFSTSCWSRDQNHLPYIDYLHTGADCIWYCIPAEEENKLEDVVHTLLQANGTPGLQMLESNVMISPEVLCKEGIKVHRTVQQSGQFVVCFPGSFVSKVCCGYSVSETVHFATTQWTSMGFETAKEMKRRHIAKPFSMEKLLYQIAQAEAKKENGPTLSTISALLDELRDTELRQRRQLFEAGLHSSARYGSHDGSSTVADGKKKPRKWLQLETSERRCQICQHLCYLSMVVQENENVVFCLECALRHVEKQKSCRGLKLMYRYDEEQIISLVNQICGKVSGKNGSIENCLSKPTPKRGPRKRATVDVPPSRLSASSSSKSASSSS,1246,NP_004964.2.csv,refseq-JARID2-NM_004973.3_clinical_seed_0_final,refseq-JARID2-NM_004973.3.a2m,Invitae,refseq-JARID2-NM_004973.3.npy,1,1246,1246
+NP_004965.1,MTVATGDPADEAAALPGHPQDTYDPEADHECCERVVINISGLRFETQLKTLAQFPETLLGDPKKRMRYFDPLRNEYFFDRNRPSFDAILYYYQSGGRLRRPVNVPLDIFSEEIRFYELGEEAMEMFREDEGYIKEEERPLPENEFQRQVWLLFEYPESSGPARIIAIVSVMVILISIVSFCLETLPIFRDENEDMHGSGVTFHTYSNSTIGYQQSTSFTDPFFIVETLCIIWFSFEFLVRFFACPSKAGFFTNIMNIIDIVAIIPYFITLGTELAEKPEDAQQGQQAMSLAILRVIRLVRVFRIFKLSRHSKGLQILGQTLKASMRELGLLIFFLFIGVILFSSAVYFAEADERESQFPSIPDAFWWAVVSMTTVGYGDMVPTTIGGKIVGSLCAIAGVLTIALPVPVIVSNFNYFYHRETEGEEQAQYLQVTSCPKIPSSPDLKKSRSASTISKSDYMEIQEGVNNSNEDFREENLKTANCTLANTNYVNITKMLTDV,499,NP_004965.1.csv,refseq-KCNA2-NM_004974.3_clinical_seed_0_final,refseq-KCNA2-NM_004974.3.a2m,Invitae,refseq-KCNA2-NM_004974.3.npy,1,499,499
+NP_004966.1,MPAGMTKHGSRSTSSLPPEPMEIVRSKACSRRVRLNVGGLAHEVLWRTLDRLPRTRLGKLRDCNTHDSLLEVCDDYSLDDNEYFFDRHPGAFTSILNFYRTGRLHMMEEMCALSFSQELDYWGIDEIYLESCCQARYHQKKEQMNEELKREAETLREREGEEFDNTCCAEKRKKLWDLLEKPNSSVAAKILAIISIMFIVLSTIALSLNTLPELQSLDEFGQSTDNPQLAHVEAVCIAWFTMEYLLRFLSSPKKWKFFKGPLNAIDLLAILPYYVTIFLTESNKSVLQFQNVRRVVQIFRIMRILRILKLARHSTGLQSLGFTLRRSYNELGLLILFLAMGIMIFSSLVFFAEKDEDDTKFKSIPASFWWATITMTTVGYGDIYPKTLLGKIVGGLCCIAGVLVIALPIPIIVNNFSEFYKEQKRQEKAIKRREALERAKRNGSIVSMNMKDAFARSIEMMDIVVEKNGENMGKKDKVQDNHLSPNKWKWTKRTLSETSSSKSFETKEQGSPEKARSSSSPQHLNVQQLEDMYNKMAKTQSQPILNTKESAAQSKPKEELEMESIPSPVAPLPTRTEGVIDMRSMSSIDSFISCATDFPEATRFSHSPLTSLPSKTGGSTAPEVGWRGALGASGGRFVEANPSPDASQHSSFFIESPKSSMKTNNPLKLRALKVNFMEGDPSPLLPVLGMYHDPLRNRGSAAAAVAGLECATLLDKAVLSPESSIYTTASAKTPPRSPEKHTAIAFNFEAGVHQYIDADTDDEGQLLYSVDSSPPKSLPGSTSPKFSTGTRSEKNHFESSPLPTSPKFLRQNCIYSTEALTGKGPSGQEKCKLENHISPDVRVLPGGGAHGSTRDQSI,858,NP_004966.1.csv,refseq-KCNB1-NM_004975.2_clinical_seed_0_final,refseq-KCNB1-NM_004975.2.a2m,Invitae,refseq-KCNB1-NM_004975.2.npy,1,858,858
+NP_004968.2,MLSSVCVSSFRGRQGASKQQPAPPPQPPESPPPPPLPPQQQQPAQPGPAASPAGPPAPRGPGDRRAEPCPGLPAAAMGRHGGGGGDSGKIVINVGGVRHETYRSTLRTLPGTRLAGLTEPEAAARFDYDPGADEFFFDRHPGVFAYVLNYYRTGKLHCPADVCGPLFEEELGFWGIDETDVEACCWMTYRQHRDAEEALDSFEAPDPAGAANAANAAGAHDGGLDDEAGAGGGGLDGAGGELKRLCFQDAGGGAGGPPGGAGGAGGTWWRRWQPRVWALFEDPYSSRAARYVAFASLFFILISITTFCLETHEGFIHISNKTVTQASPIPGAPPENITNVEVETEPFLTYVEGVCVVWFTFEFLMRITFCPDKVEFLKSSLNIIDCVAILPFYLEVGLSGLSSKAAKDVLGFLRVVRFVRILRIFKLTRHFVGLRVLGHTLRASTNEFLLLIIFLALGVLIFATMIYYAERIGADPDDILGSNHTYFKNIPIGFWWAVVTMTTLGYGDMYPKTWSGMLVGALCALAGVLTIAMPVPVIVNNFGMYYSLAMAKQKLPKKKNKHIPRPPQPGSPNYCKPDPPPPPPPHPHHGSGGISPPPPITPPSMGVTVAGAYPAGPHTHPGLLRGGAGGLGIMGLPPLPAPGEPCPLAQEEVIEINRADPRPNGDPAAAALAHEDCPAIDQPAMSPEDKSPITPGSRGRYSRDRACFLLTDYAPSPDGSIRKATGAPPLPPQDWRKPGPPSFLPDLNANAAAWISP,757,NP_004968.2.csv,refseq-KCNC3-NM_004977.2_clinical_seed_0_final,refseq-KCNC3-NM_004977.2.a2m,Invitae,refseq-KCNC3-NM_004977.2.npy,1,757,757
+NP_004971.2,MAAGVAAWLPFARAAAIGWMPVANCPMPLAPADKNKRQDELIVLNVSGRRFQTWRTTLERYPDTLLGSTEKEFFFNEDTKEYFFDRDPEVFRCVLNFYRTGKLHYPRYECISAYDDELAFYGILPEIIGDCCYEEYKDRKRENAERLMDDNDSENNQESMPSLSFRQTMWRAFENPHTSTLALVFYYVTGFFIAVSVITNVVETVPCGTVPGSKELPCGERYSVAFFCLDTACVMIFTVEYLLRLFAAPSRYRFIRSVMSIIDVVAIMPYYIGLVMTNNEDVSGAFVTLRVFRVFRIFKFSRHSQGLRILGYTLKSCASELGFLLFSLTMAIIIFATVMFYAEKGSSASKFTSIPASFWYTIVTMTTLGYGDMVPKTIAGKIFGSICSLSGVLVIALPVPVIVSNFSRIYHQNQRADKRRAQKKARLARIRVAKTGSSNAYLHSKRNGLLNEALELTGTPEEEHMGKTTSLIESQHHHLLHCLEKTTGLSYLVDDPLLSVRTSTIKNHEFIDEQMFEQNCMESSMQNYPSTRSPSLSSHPGLTTTCCSRRSKKTTHLPNSNLPATRLRSMQELSTIHIQGSEQPSLTTSRSSLNLKADDGLRPNCKTSQITTAIISIPTPPALTPEGESRPPPASPGPNTNIPSIASNVVKVSAL,655,NP_004971.2.csv,refseq-KCND3-NM_004980.4_clinical_seed_0_final,refseq-KCND3-NM_004980.4.a2m,Invitae,refseq-KCND3-NM_004980.4.npy,1,655,655
+NP_004973.1,MLARKSIIPEEYVLARIAAENLRKPRIRDRLPKARFIAKSGACNLAHKNIREQGRFLQDIFTTLVDLKWRHTLVIFTMSFLCSWLLFAIMWWLVAFAHGDIYAYMEKSGMEKSGLESTVCVTNVRSFTSAFLFSIEVQVTIGFGGRMMTEECPLAITVLILQNIVGLIINAVMLGCIFMKTAQAHRRAETLIFSRHAVIAVRNGKLCFMFRVGDLRKSMIISASVRIQVVKKTTTPEGEVVPIHQLDIPVDNPIESNNIFLVAPLIICHVIDKRSPLYDISATDLANQDLEVIVILEGVVETTGITTQARTSYIAEEIQWGHRFVSIVTEEEGVYSVDYSKFGNTVKVAAPRCSARELDEKPSILIQTLQKSELSHQNSLRKRNSMRRNNSMRRNNSIRRNNSSLMVPKVQFMTPEGNQNTSES,424,NP_004973.1.csv,refseq-KCNJ8-NM_004982.3_clinical_seed_0_final,refseq-KCNJ8-NM_004982.3.a2m,Invitae,refseq-KCNJ8-NM_004982.3.npy,1,424,424
+NP_004975.2,MAETNNECSIKVLCRFRPLNQAEILRGDKFIPIFQGDDSVVIGGKPYVFDRVFPPNTTQEQVYHACAMQIVKDVLAGYNGTIFAYGQTSSGKTHTMEGKLHDPQLMGIIPRIARDIFNHIYSMDENLEFHIKVSYFEIYLDKIRDLLDVTKTNLSVHEDKNRVPFVKGCTERFVSSPEEILDVIDEGKSNRHVAVTNMNEHSSRSHSIFLINIKQENMETEQKLSGKLYLVDLAGSEKVSKTGAEGAVLDEAKNINKSLSALGNVISALAEGTKSYVPYRDSKMTRILQDSLGGNCRTTMFICCSPSSYNDAETKSTLMFGQRAKTIKNTASVNLELTAEQWKKKYEKEKEKTKAQKETIAKLEAELSRWRNGENVPETERLAGEEAALGAELCEETPVNDNSSIVVRIAPEERQKYEEEIRRLYKQLDDKDDEINQQSQLIEKLKQQMLDQEELLVSTRGDNEKVQRELSHLQSENDAAKDEVKEVLQALEELAVNYDQKSQEVEEKSQQNQLLVDELSQKVATMLSLESELQRLQEVSGHQRKRIAEVLNGLMKDLSEFSVIVGNGEIKLPVEISGAIEEEFTVARLYISKIKSEVKSVVKRCRQLENLQVECHRKMEVTGRELSSCQLLISQHEAKIRSLTEYMQSVELKKRHLEESYDSLSDELAKLQAQETVHEVALKDKEPDTQDADEVKKALELQMESHREAHHRQLARLRDEINEKQKTIDELKDLNQKLQLELEKLQADYEKLKSEEHEKSTKLQELTFLYERHEQSKQDLKGLEETVARELQTLHNLRKLFVQDVTTRVKKSAEMEPEDSGGIHSQKQKISFLENNLEQLTKVHKQLVRDNADLRCELPKLEKRLRATAERVKALEGALKEAKEGAMKDKRRYQQEVDRIKEAVRYKSSGKRGHSAQIAKPVRPGHYPASSPTNPYGTRSPECISYTNSLFQNYQNLYLQATPSSTSDMYFANSCTSSGATSSGGPLASYQKANMDNGNATDINDNRSDLPCGYEAEDQAKLFPLHQETAAS,1032,NP_004975.2.csv,refseq-KIF5A-NM_004984.2_clinical_seed_0_final,refseq-KIF5A-NM_004984.2.a2m,Invitae,refseq-KIF5A-NM_004984.2.npy,1,1032,1032
+NP_004981.2,MRLFVSDGVPGCLPVLAAAGRARGRAEVLISTVGPEDCVVPFLTRPKVPVLQLDSGNYLFSTSAICRYFFLLSGWEQDDLTNQWLEWEATELQPALSAALYYLVVQGKKGEDVLGSVRRALTHIDHSLSRQNCPFLAGETESLADIVLWGALYPLLQDPAYLPEELSALHSWFQTLSTQEPCQRAAETVLKQQGVLALRPYLQKQPQPSPAEGRAVTNEPEEEELATLSEEEIAMAVTAWEKGLESLPPLRPQQNPVLPVAGERNVLITSALPYVNNVPHLGNIIGCVLSADVFARYSRLRQWNTLYLCGTDEYGTATETKALEEGLTPQEICDKYHIIHADIYRWFNISFDIFGRTTTPQQTKITQDIFQQLLKRGFVLQDTVEQLRCEHCARFLADRFVEGVCPFCGYEEARGDQCDKCGKLINAVELKKPQCKVCRSCPVVQSSQHLFLDLPKLEKRLEEWLGRTLPGSDWTPNAQFITRSWLRDGLKPRCITRDLKWGTPVPLEGFEDKVFYVWFDATIGYLSITANYTDQWERWWKNPEQVDLYQFMAKDNVPFHSLVFPCSALGAEDNYTLVSHLIATEYLNYEDGKFSKSRGVGVFGDMAQDTGIPADIWRFYLLYIRPEGQDSAFSWTDLLLKNNSELLNNLGNFINRAGMFVSKFFGGYVPEMVLTPDDQRLLAHVTLELQHYHQLLEKVRIRDALRSILTISRHGNQYIQVNEPWKRIKGSEADRQRAGTVTGLAVNIAALLSVMLQPYMPTVSATIQAQLQLPPPACSILLTNFLCTLPAGHQIGTVSPLFQKLENDQIESLRQRFGGGQAKTSPKPAVVETVTTAKPQQIQALMDEVTKQGNIVRELKAQKADKNEVAAEVAKLLDLKKQLAVAEGKPPEAPKGKKKK,900,NP_004981.2.csv,refseq-MARS-NM_004990.3_clinical_seed_0_final,refseq-MARS-NM_004990.3.a2m,Invitae,refseq-MARS-NM_004990.3.npy,1,900,900
+NP_004985.2,MSLWQPLVLVLLVLGCCFAAPRQRQSTLVLFPGDLRTNLTDRQLAEEYLYRYGYTRVAEMRGESKSLGPALLLLQKQLSLPETGELDSATLKAMRTPRCGVPDLGRFQTFEGDLKWHHHNITYWIQNYSEDLPRAVIDDAFARAFALWSAVTPLTFTRVYSRDADIVIQFGVAEHGDGYPFDGKDGLLAHAFPPGPGIQGDAHFDDDELWSLGKGVVVPTRFGNADGAACHFPFIFEGRSYSACTTDGRSDGLPWCSTTANYDTDDRFGFCPSERLYTQDGNADGKPCQFPFIFQGQSYSACTTDGRSDGYRWCATTANYDRDKLFGFCPTRADSTVMGGNSAGELCVFPFTFLGKEYSTCTSEGRGDGRLWCATTSNFDSDKKWGFCPDQGYSLFLVAAHEFGHALGLDHSSVPEALMYPMYRFTEGPPLHKDDVNGIRHLYGPRPEPEPRPPTTTTPQPTAPPTVCPTGPPTVHPSERPTAGPTGPPSAGPTGPPTAGPSTATTVPLSPVDDACNVNIFDAIAEIGNQLYLFKDGKYWRFSEGRGSRPQGPFLIADKWPALPRKLDSVFEERLSKKLFFFSGRQVWVYTGASVLGPRRLDKLGLGADVAQVTGALRSGRGKMLLFSGRRLWRFDVKAQMVDPRSASEVDRMFPGVPLDTHDVFQYREKAYFCQDRFYWRVSSRSELNQVDQVGYVTYDILQCPED,707,NP_004985.2.csv,refseq-MMP9-NM_004994.2_clinical_seed_0_final,refseq-MMP9-NM_004994.2.a2m,Invitae,refseq-MMP9-NM_004994.2.npy,1,707,707
+NP_004989.2,MGSKGVYQYHWQSHNVKHSGVDDMVLLSKITENSIVENLKKRYMDDYIFTYIGSVLISVNPFKQMPYFGEKEIEMYQGAAQYENPPHIYALADNMYRNMIIDRENQCVIISGESGAGKTVAAKYIMSYISRVSGGGTKVQHVKDIILQSNPLLEAFGNAKTVRNNNSSRFGKYFEIQFSPGGEPDGGKISNFLLEKSRVVMRNPGERSFHIFYQLIEGASAEQKHSLGITSMDYYYYLSLSGSYKVDDIDDRREFQETLHAMNVIGIFAEEQTLVLQIVAGILHLGNISFKEVGNYAAVESEEFLAFPAYLLGINQDRLKEKLTSRQMDSKWGGKSESIHVTLNVEQACYTRDALAKALHARVFDFLVDSINKAMEKDHEEYNIGVLDIYGFEIFQKNGFEQFCINFVNEKLQQIFIELTLKAEQEEYVQEGIRWTPIEYFNNKIVCDLIENKVNPPGIMSILDDVCATMHAVGEGADQTLLQKLQMQIGSHEHFNSWNQGFIIHHYAGKVSYDMDGFCERNRDVLFMDLIELMQSSELPFIKSLFPENLQADKKGRPTTAGSKIKKQANDLVSTLMKCTPHYIRCIKPNETKKPRDWEESRVKHQVEYLGLKENIRVRRAGYAYRRIFQKFLQRYAILTKATWPSWQGEEKQGVLHLLQSVNMDSDQFQLGRSKVFIKAPESLFLLEEMRERKYDGYARVIQKSWRKFVARKKYVQMREEASDLLLNKKERRRNSINRNFIGDYIGMEEHPELQQFVGKREKIDFADTVTKYDRRFKGVKRDLLLTPKCLYLIGREKVKQGPDKGLVKEVLKRKIEIERILSVSLSTMQDDIFILHEQEYDSLLESVFKTEFLSLLAKRYEEKTQKQLPLKFSNTLELKLKKENWGPWSAGGSRQVQFHQGFGDLAVLKPSNKVLQVSIGPGLPKNSRPTRRNTTQNTGYSSGTQNANYPVRAAPPPPGYHQNGVIRNQYVPYPHAPGSQRSNQKSLYTSMARPPLPRQQSTSSDRVSQTPESLDFLKVPDQGAAGVRRQTTSRPPPAGGRPKPQPKPKPQVPQCKALYAYDAQDTDELSFNANDIIDIIKEDPSGWWTGRLRGKQGLFPNNYVTKI,1108,NP_004989.2.csv,refseq-MYO1E-NM_004998.3_clinical_seed_0_final,refseq-MYO1E-NM_004998.3.a2m,Invitae,refseq-MYO1E-NM_004998.3.npy,1,1108,1108
+NP_004990.3,MEDGKPVWAPHPTDGFQMGNIVDIGPDSLTIEPLNQKGKTFLALINQVFPAEEDSKKDVEDNCSLMYLNEATLLHNIKVRYSKDRIYTYVANILIAVNPYFDIPKIYSSEAIKSYQGKSLGTRPPHVFAIADKAFRDMKVLKMSQSIIVSGESGAGKTENTKFVLRYLTESYGTGQDIDDRIVEANPLLEAFGNAKTVRNNNSSRFGKFVEIHFNEKSSVVGGFVSHYLLEKSRICVQGKEERNYHIFYRLCAGASEDIREKLHLSSPDNFRYLNRGCTRYFANKETDKQILQNRKSPEYLKAGSMKDPLLDDHGDFIRMCTAMKKIGLDDEEKLDLFRVVAGVLHLGNIDFEEAGSTSGGCNLKNKSAQSLEYCAELLGLDQDDLRVSLTTRVMLTTAGGTKGTVIKVPLKVEQANNARDALAKTVYSHLFDHVVNRVNQCFPFETSSYFIGVLDIAGFEYFEHNSFEQFCINYCNEKLQQFFNERILKEEQELYQKEGLGVNEVHYVDNQDCIDLIEAKLVGILDILDEENRLPQPSDQHFTSAVHQKHKDHFRLTIPRKSKLAVHRNIRDDEGFIIRHFAGAVCYETTQFVEKNNDALHMSLESLICESRDKFIRELFESSTNNNKDTKQKAGKLSFISVGNKFKTQLNLLLDKLRSTGASFIRCIKPNLKMTSHHFEGAQILSQLQCSGMVSVLDLMQGGYPSRASFHELYNMYKKYMPDKLARLDPRLFCKALFKALGLNENDYKFGLTKVFFRPGKFAEFDQIMKSDPDHLAELVKRVNHWLTCSRWKKVQWCSLSVIKLKNKIKYRAEACIKMQKTIRMWLCKRRHKPRIDGLVKVGTLKKRLDKFNEVVSVLKDGKPEMNKQIKNLEISIDTLMAKIKSTMMTQEQIQKEYDALVKSSEELLSALQKKKQQEEEAERLRRIQEEMEKERKRREEDEKRRRKEEEERRMKLEMEAKRKQEEEERKKREDDEKRIQAEVEAQLARQKEEESQQQAVLEQERRDRELALRIAQSEAELISDEAQADLALRRNDGTRPKMTPEQMAKEMSEFLSRGPAVLATKAAAGTKKYDLSKWKYAELRDTINTSCDIELLAACREEFHRRLKVYHAWKSKNKKRNTETEQRAPKSVTDYDFAPFLNNSPQQNPAAQIPARQREIEMNRQQRFFRIPFIRPADQYKDPQSKKKGWWYAHFDGPWIARQMELHPDKPPILLVAGKDDMEMCELNLEETGLTRKRGAEILPRQFEEIWERCGGIQYLQNAIESRQARPTYATAMLQSLLK,1285,NP_004990.3.csv,refseq-MYO6-NM_004999.3_clinical_seed_0_final,refseq-MYO6-NM_004999.3.a2m,Invitae,refseq-MYO6-NM_004999.3.npy,1,1285,1285
+NP_004993.1,MAAAAQSRVVRVLSMSRSAITAIATSVCHGPPCRQLHHALMPHGKGGRSSVSGIVATVFGATGFLGRYVVNHLGRMGSQVIIPYRCDKYDIMHLRPMGDLGQLLFLEWDARDKDSIRRVVQHSNVVINLIGRDWETKNFDFEDVFVKIPQAIAQLSKEAGVEKFIHVSHLNANIKSSSRYLRNKAVGEKVVRDAFPEAIIVKPSDIFGREDRFLNSFASMHRFGPIPLGSLGWKTVKQPVYVVDVSKGIVNAVKDPDANGKSFAFVGPSRYLLFHLVKYIFAVAHRLFLPFPLPLFAYRWVARVFEISPFEPWITRDKVERMHITDMKLPHLPGLEDLGIQATPLELKAIEVLRRHRTYRWLSAEIEDVKPAKTVNI,377,NP_004993.1.csv,refseq-NDUFA9-NM_005002.4_clinical_seed_0_final,refseq-NDUFA9-NM_005002.4.a2m,Invitae,refseq-NDUFA9-NM_005002.4.npy,1,377,377
+NP_004997.4,MLRIPVRKALVGLSKSPKGCVRTTATAASNLIEVFVDGQSVMVEPGTTVLQACEKVGMQIPRFCYHERLSVAGNCRMCLVEIEKAPKVVAACAMPVMKGWNILTNSEKSKKAREGVMEFLLANHPLDCPICDQGGECDLQDQSMMFGNDRSRFLEGKRAVEDKNIGPLVKTIMTRCIQCTRCIRFASEIAGVDDLGTTGRGNDMQVGTYIEKMFMSELSGNIIDICPVGALTSKPYAFTARPWETRKTESIDVMDAVGSNIVVSTRTGEVMRILPRMHEDINEEWISDKTRFAYDGLKRQRLTEPMVRNEKGLLTYTSWEDALSRVAGMLQSFQGKDVAAIAGGLVDAEALVALKDLLNRVDSDTLCTEEVFPTAGAGTDLRSNYLLNTTIAGVEEADVVLLVGTNPRFEAPLFNARIRKSWLHNDLKVALIGSPVDLTYTYDHLGDSPKILQDIASGSHPFSQVLKEAKKPMVVLGSSALQRNDGAAILAAVSSIAQKIRMTSGVTGDWKVMNILHRIASQVAALDLGYKPGVEAIRKNPPKVLFLLGADGGCITRQDLPKDCFIIYQGHHGDVGAPIADVILPGAAYTEKSATYVNTEGRAQQTKVAVTPPGLAREDWKIIRALSEIAGMTLPYDTLDQVRNRLEEVSPNLVRYDDIEGANYFQQANELSKLVNQQLLADPLVPPQLTIKDFYMTDSISRASQTMAKCVKAVTEGAQAVEEPSIC,727,NP_004997.4.csv,refseq-NDUFS1-NM_005006.6_clinical_seed_0_final,refseq-NDUFS1-NM_005006.6.a2m,Invitae,refseq-NDUFS1-NM_005006.6.npy,1,727,727
+NP_005008.2,MDAQCSAKVNARKRRKEAPGPNGATEEDGVPSKVQRCAVGLRQPAPFSDEIEVDFSKPYVRVTMEEASRGTPCERPVRVYADGIFDLFHSGHARALMQAKNLFPNTYLIVGVCSDELTHNFKGFTVMNENERYDAVQHCRYVDEVVRNAPWTLTPEFLAEHRIDFVAHDDIPYSSAGSDDVYKHIKEAGMFAPTQRTEGISTSDIITRIVRDYDVYARRNLQRGYTAKELNVSFINEKKYHLQERVDKVKKKVKDVEEKSKEFVQKVEEKSIDLIQKWEEKSREFIGSFLEMFGPEGALKHMLKEGKGRMLQAISPKQSPSSSPTRERSPSPSFRWPFSGKTSPPCSPANLSRHKAAAYDISEDEED,367,NP_005008.2.csv,refseq-PCYT1A-NM_005017.3_clinical_seed_0_final,refseq-PCYT1A-NM_005017.3.a2m,Invitae,refseq-PCYT1A-NM_005017.3.npy,1,367,367
+NP_005013.1,MAGWNAYIDNLMADGTCQDAAIVGYKDSPSVWAAVPGKTFVNITPAEVGVLVGKDRSSFYVNGLTLGGQKCSVIRDSLLQDGEFSMDLRTKSTGGAPTFNVTVTKTDKTLVLLMGKEGVHGGLINKKCYEMASHLRRSQY,140,NP_005013.1.csv,refseq-PFN1-NM_005022.3_clinical_seed_0_final,refseq-PFN1-NM_005022.3.a2m,Invitae,refseq-PFN1-NM_005022.3.npy,1,140,140
+NP_005016.1,MAFLGLFSLLVLQSMATGATFPEEAIADLSVNMYNRLRATGEDENILFSPLSIALAMGMMELGAQGSTQKEIRHSMGYDSLKNGEEFSFLKEFSNMVTAKESQYVMKIANSLFVQNGFHVNEEFLQMMKKYFNAAVNHVDFSQNVAVANYINKWVENNTNNLVKDLVSPRDFDAATYLALINAVYFKGNWKSQFRPENTRTFSFTKDDESEVQIPMMYQQGEFYYGEFSDGSNEAGGIYQVLEIPYEGDEISMMLVLSRQEVPLATLEPLVKAQLVEEWANSVKKQKVEVYLPRFTVEQEIDLKDVLKALGITEIFIKDANLTGLSDNKEIFLSKAIHKSFLEVNEEGSEAAAVSGMIAISRMAVLYPQVIVDHPFFFLIRNRRTGTILFMGRVMHPETMNTSGHDFEEL,410,NP_005016.1.csv,refseq-SERPINI1-NM_005025.4_clinical_seed_0_final,refseq-SERPINI1-NM_005025.4.a2m,Invitae,refseq-SERPINI1-NM_005025.4.npy,1,410,410
+NP_005017.3,MPPGVDCPMEFWTKEENQSVVVDFLLPTGVYLNFPVSRNANLSTIKQLLWHRAQYEPLFHMLSGPEAYVFTCINQTAEQQELEDEQRRLCDVQPFLPVLRLVAREGDRVKKLINSQISLLIGKGLHEFDSLCDPEVNDFRAKMCQFCEEAAARRQQLGWEAWLQYSFPLQLEPSAQTWGPGTLRLPNRALLVNVKFEGSEESFTFQVSTKDVPLALMACALRKKATVFRQPLVEQPEDYTLQVNGRHEYLYGSYPLCQFQYICSCLHSGLTPHLTMVHSSSILAMRDEQSNPAPQVQKPRAKPPPIPAKKPSSVSLWSLEQPFRIELIQGSKVNADERMKLVVQAGLFHGNEMLCKTVSSSEVSVCSEPVWKQRLEFDINICDLPRMARLCFALYAVIEKAKKARSTKKKSKKADCPIAWANLMLFDYKDQLKTGERCLYMWPSVPDEKGELLNPTGTVRSNPNTDSAAALLICLPEVAPHPVYYPALEKILELGRHSECVHVTEEEQLQLREILERRGSGELYEHEKDLVWKLRHEVQEHFPEALARLLLVTKWNKHEDVAQMLYLLCSWPELPVLSALELLDFSFPDCHVGSFAIKSLRKLTDDELFQYLLQLVQVLKYESYLDCELTKFLLDRALANRKIGHFLFWHLRSEMHVPSVALRFGLILEAYCRGSTHHMKVLMKQGEALSKLKALNDFVKLSSQKTPKPQTKELMHLCMRQEAYLEALSHLQSPLDPSTLLAEVCVEQCTFMDSKMKPLWIMYSNEEAGSGGSVGIIFKNGDDLRQDMLTLQMIQLMDVLWKQEGLDLRMTPYGCLPTGDRTGLIEVVLRSDTIANIQLNKSNMAATAAFNKDALLNWLKSKNPGEALDRAIEEFTLSCAGYCVATYVLGIGDRHSDNIMIRESGQLFHIDFGHFLGNFKTKFGINRERVPFILTYDFVHVIQQGKTNNSEKFERFRGYCERAYTILRRHGLLFLHLFALMRAAGLPELSCSKDIQYLKDSLALGKTEEEALKHFRVKFNEALRESWKTKVNWLAHNVSKDNRQ,1044,NP_005017.3.csv,refseq-PIK3CD-NM_005026.3_clinical_seed_0_final,refseq-PIK3CD-NM_005026.3.a2m,Invitae,refseq-PIK3CD-NM_005026.3.npy,1,1044,1044
+NP_005018.2,MAGPEGFQYRALYPFRRERPEDLELLPGDVLVVSRAALQALGVAEGGERCPQSVGWMPGLNERTRQRGDFPGTYVEFLGPVALARPGPRPRGPRPLPARPRDGAPEPGLTLPDLPEQFSPPDVAPPLLVKLVEAIERTGLDSESHYRPELPAPRTDWSLSDVDQWDTAALADGIKSFLLALPAPLVTPEASAEARRALREAAGPVGPALEPPTLPLHRALTLRFLLQHLGRVASRAPALGPAVRALGATFGPLLLRAPPPPSSPPPGGAPDGSEPSPDFPALLVEKLLQEHLEEQEVAPPALPPKPPKAKPASTVLANGGSPPSLQDAEWYWGDISREEVNEKLRDTPDGTFLVRDASSKIQGEYTLTLRKGGNNKLIKVFHRDGHYGFSEPLTFCSVVDLINHYRHESLAQYNAKLDTRLLYPVSKYQQDQIVKEDSVEAVGAQLKVYHQQYQDKSREYDQLYEEYTRTSQELQMKRTAIEAFNETIKIFEEQGQTQEKCSKEYLERFRREGNEKEMQRILLNSERLKSRIAEIHESRTKLEQQLRAQASDNREIDKRMNSLKPDLMQLRKIRDQYLVWLTQKGARQKKINEWLGIKNETEDQYALMEDEDDLPHHEERTWYVGKINRTQAEEMLSGKRDGTFLIRESSQRGCYACSVVVDGDTKHCVIYRTATGFGFAEPYNLYGSLKELVLHYQHASLVQHNDALTVTLAHPVRAPGPGPPPAAR,728,NP_005018.2.csv,refseq-PIK3R2-NM_005027.4_clinical_seed_0_final,refseq-PIK3R2-NM_005027.4.a2m,Invitae,refseq-PIK3R2-NM_005027.4.npy,1,728,728
+NP_005041.1,MAVAGPAPGAGARPRLDLQFLQRFLQILKVLFPSWSSQNALMFLTLLCLTLLEQFVIYQVGLIPSQYYGVLGNKDLEGFKTLTFLAVMLIVLNSTLKSFDQFTCNLLYVSWRKDLTEHLHRLYFRGRAYYTLNVLRDDIDNPDQRISQDVERFCRQLSSMASKLIISPFTLVYYTYQCFQSTGWLGPVSIFGYFILGTVVNKTLMGPIVMKLVHQEKLEGDFRFKHMQIRVNAEPAAFYRAGHVEHMRTDRRLQRLLQTQRELMSKELWLYIGINTFDYLGSILSYVVIAIPIFSGVYGDLSPAELSTLVSKNAFVCIYLISCFTQLIDLSTTLSDVAGYTHRIGQLRETLLDMSLKSQDCEILGESEWGLDTPPGWPAAEPADTAFLLERVSISAPSSDKPLIKDLSLKISEGQSLLITGNTGTGKTSLLRVLGGLWTSTRGSVQMLTDFGPHGVLFLPQKPFFTDGTLREQVIYPLKEVYPDSGSADDERILRFLELAGLSNLVARTEGLDQQVDWNWYDVLSPGEMQRLSFARLFYLQPKYAVLDEATSALTEEVESELYRIGQQLGMTFISVGHRQSLEKFHSLVLKLCGGGRWELMRIKVE,606,NP_005041.1.csv,refseq-ABCD4-NM_005050.3_clinical_seed_0_final,refseq-ABCD4-NM_005050.3.a2m,Invitae,refseq-ABCD4-NM_005050.3.npy,1,606,606
+NP_005042.1,MAALDSLSLFTSLGLSEQKARETLKNSALSAQLREAATQAQQTLGSTIDKATGILLYGLASRLRDTRRLSFLVSYIASKKIHTEPQLSAALEYVRSHPLDPIDTVDFERECGVGVIVTPEQIEEAVEAAINRHRPQLLVERYHFNMGLLMGEARAVLKWADGKMIKNEVDMQVLHLLGPKLEADLEKKFKVAKARLEETDRRTAKDVVENGETADQTLSLMEQLRGEALKFHKPGENYKTPGYVVTPHTMNLLKQHLEITGGQVRTRFPPEPNGILHIGHAKAINFNFGYAKANNGICFLRFDDTNPEKEEAKFFTAICDMVAWLGYTPYKVTYASDYFDQLYAWAVELIRRGLAYVCHQRGEELKGHNTLPSPWRDRPMEESLLLFEAMRKGKFSEGEATLRMKLVMEDGKMDPVAYRVKYTPHHRTGDKWCIYPTYDYTHCLCDSIEHITHSLCTKEFQARRSSYFWLCNALDVYCPVQWEYGRLNLHYAVVSKRKILQLVATGAVRDWDDPRLFTLTALRRRGFPPEAINNFCARVGVTVAQTTMEPHLLEACVRDVLNDTAPRAMAVLESLRVIITNFPAAKSLDIQVPNFPADETKGFHQVPFAPIVFIERTDFKEEPEPGFKRLAWGQPVGLRHTGYVIELQHVVKGPSGCVESLEVTCRRADAGEKPKAFIHWVSQPLMCEVRLYERLFQHKNPEDPTEVPGGFLSDLNLASLHVVDAALVDCSVALAKPFDKFQFERLGYFSVDPDSHQGKLVFNRTVTLKEDPGKV,775,NP_005042.1.csv,refseq-QARS-NM_005051.2_clinical_seed_0_final,refseq-QARS-NM_005051.2.a2m,Invitae,refseq-QARS-NM_005051.2.npy,1,775,775
+NP_005046.2,MGQDQTKQQIEKGLQLYQSNQTEKALQVWTKVLEKSSDLMGRFRVLGCLVTAHSEMGRYKEMLKFAVVQIDTARELEDADFLLESYLNLARSNEKLCEFHKTISYCKTCLGLPGTRAGAQLGGQVSLSMGNAFLGLSVFQKALESFEKALRYAHNNDDAMLECRVCCSLGSFYAQVKDYEKALFFPCKAAELVNNYGKGWSLKYRAMSQYHMAVAYRLLGRLGSAMECCEESMKIALQHGDRPLQALCLLCFADIHRSRGDLETAFPRYDSAMSIMTEIGNRLGQVQALLGVAKCWVARKALDKALDAIERAQDLAEEVGNKLSQLKLHCLSESIYRSKGLQRELRAHVVRFHECVEETELYCGLCGESIGEKNSRLQALPCSHIFHLRCLQNNGTRSCPNCRRSSMKPGFV,412,NP_005046.2.csv,refseq-RAPSN-NM_005055.4_clinical_seed_0_final,refseq-RAPSN-NM_005055.4.a2m,Invitae,refseq-RAPSN-NM_005055.4.npy,1,412,412
+NP_005052.1,MSHRKFSAPRHGHLGFLPHKRSHRHRGKVKTWPRDDPSQPVHLTAFLGYKAGMTHTLREVHRPGLKISKREEVEAVTIVETPPLVVVGVVGYVATPRGLRSFKTIFAEHLSDECRRRFYKDWHKSKKKAFTKACKRWRDTDGKKQLQKDFAAMKKYCKVIRVIVHTQMKLLPFRQKKAHIMEIQLNGGTVAEKVAWAQARLEKQVPVHSVFSQSEVIDVIAVTKGRGVKGVTSRWHTKKLPRKTHKGLRKVACIGAWHPARVGCSIARAGQKGYHHRTELNKKIFRIGRGPHMEDGKLVKNNASTSYDVTAKSITPLGGFPHYGEVNNDFVMLKGCIAGTKKRVITLRKSLLVHHSRQAVENIELKFIDTTSKFGHGRFQTAQEKRAFMGPQKKHLEKETPETSGDL,407,NP_005052.1.csv,refseq-RPL3L-NM_005061.2_clinical_seed_0_final,refseq-RPL3L-NM_005061.2.a2m,Invitae,refseq-RPL3L-NM_005061.2.npy,1,407,407
+NP_005059.2,MKEKSKNAARTRREKENSEFYELAKLLPLPSAITSQLDKASIIRLTTSYLKMRVVFPEGLGEAWGHSSRTSPLDNVGRELGSHLLQTLDGFIFVVAPDGKIMYISETASVHLGLSQVELTGNSIYEYIHPADHDEMTAVLTAHQPYHSHFVQEYEIERSFFLRMKCVLAKRNAGLTCGGYKVIHCSGYLKIRQYSLDMSPFDGCYQNVGLVAVGHSLPPSAVTEIKLHSNMFMFRASLDMKLIFLDSRVAELTGYEPQDLIEKTLYHHVHGCDTFHLRCAHHLLLVKGQVTTKYYRFLAKHGGWVWVQSYATIVHNSRSSRPHCIVSVNYVLTDTEYKGLQLSLDQISASKPAFSYTSSSTPTMTDNRKGAKSRLSSSKSKSRTSPYPQYSGFHTERSESDHDSQWGGSPLTDTASPQLLDPADRPGSQHDASCAYRQFSDRSSLCYGFALDHSRLVEERHFHTQACEGGRCEAGRYFLGTPQAGREPWWGSRAALPLTKASPESREAYENSMPHIASVHRIHGRGHWDEDSVVSSPDPGSASESGDRYRTEQYQSSPHEPSKIETLIRATQQMIKEEENRLQLRKAPSDQLASINGAGKKHSLCFANYQQPPPTGEVCHGSALANTSPCDHIQQREGKMLSPHENDYDNSPTALSRISSPNSDRISKSSLILAKDYLHSDISPHQTAGDHPTVSPNCFGSHRQYFDKHAYTLTGYALEHLYDSETIRNYSLGCNGSHFDVTSHLRMQPDPAQGHKGTSVIITNGS,766,NP_005059.2.csv,refseq-SIM1-NM_005068.2_clinical_seed_0_final,refseq-SIM1-NM_005068.2.a2m,Invitae,refseq-SIM1-NM_005068.2.npy,1,766,766
+NP_005076.3,MGDEMDAMIPEREMKDFQFRALKKVRIFDSPEELPKERSSLLAVSNKYGLVFAGGASGLQIFPTKNLLIQNKPGDDPNKIVDKVQGLLVPMKFPIHHLALSCDNLTLSACMMSSEYGSIIAFFDVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETVKVCATLPSTVAVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYESDHPVRVLDVLWIGTYVFAIVYAAADGTLETSPDVVMALLPKKEEKHPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTEVSILARQSDQINWESWLLEDSSRAELPVTDKSDDSLPMGVVVDYTNQVEITISDEKTLPPAPVLMLLSTDGVLCPFYMINQNPGVKSLIKTPERLSLEGERQPKSPGSTPTTPTSSQAPQKLDASAAAAPASLPPSSPAAPIATFSLLPAGGAPTVFSFGSSSLKSSATVTGEPPSYSSGSDSSKAAPGPGPSTFSFVPPSKASLAPTPAASPVAPSAASFSFGSSGFKPTLESTPVPSVSAPNIAMKPSFPPSTSAVKVNLSEKFTAAATSTPVSSSQSAPPMSPFSSASKPAASGPLSHPTPLSAPPSSVPLKSSVLPSPSGRSAQGSSSPVPSMVQKSPRITPPAAKPGSPQAKSLQPAVAEKQGHQWKDSDPVMAGIGEEIAHFQKELEELKARTSKACFQVGTSEEMKMLRTESDDLHTFLLEIKETTESLHGDISSLKTTLLEGFAGVEEAREQNERNRDSGYLHLLYKRPLDPKSEAQLQEIRRLHQYVKFAVQDVNDVLDLEWDQHLEQKKKQRHLLVPERETLFNTLANNREIINQQRKRLNHLVDSLQQLRLYKQTSLWSLSSAVPSQSSIHSFDSDLESLCNALLKTTIESHTKSLPKVPAKLSPMKQAQLRNFLAKRKTPPVRSTAPASLSRSAFLSQRYYEDLDEVSSTSSVSQSLESEDARTSCKDDEAVVQAPRHAPVVRTPSIQPSLLPHAAPFAKSHLVHGSSPGVMGTSVATSASKIIPQGADSTMLATKTVKHGAPSPSHPISAPQAAAAAALRRQMASQAPAVNTLTESTLKNVPQVVNVQELKNNPATPSTAMGSSVPYSTAKTPHPVLTPVAANQAKQGSLINSLKPSGPTPASGQLSSGDKASGTAKIETAVTSTPSASGQFSKPFSFSPSGTGFNFGIITPTPSSNFTAAQGATPSTKESSQPDAFSSGGGSKPSYEAIPESSPPSGITSASNTTPGEPAASSSRPVAPSGTALSTTSSKLETPPSKLGELLFPSSLAGETLGSFSGLRVGQADDSTKPTNKASSTSLTSTQPTKTSGVPSGFNFTAPPVLGKHTEPPVTSSATTTSVAPPAATSTSSTAVFGSLPVTSAGSSGVISFGGTSLSAGKTSFSFGSQQTNSTVPPSAPPPTTAATPLPTSFPTLSFGSLLSSATTPSLPMSAGRSTEEATSSALPEKPGDSEVSASAASLLEEQQSAQLPQAPPQTSDSVKKEPVLAQPAVSNSGTAASSTSLVALSAEATPATTGVPDARTEAVPPASSFSVPGQTAVTAAAISSAGPVAVETSSTPIASSTTSIVAPGPSAEAAAFGTVTSGSSVFAQPPAASSSSAFNQLTNNTATAPSATPVFGQVAASTAPSLFGQQTGSTASTAAATPQVSSSGFSSPAFGTTAPGVFGQTTFGQASVFGQSASSAASVFSFSQPGFSSVPAFGQPASSTPTSTSGSVFGAASSTSSSSSFSFGQSSPNTGGGLFGQSNAPAFGQSPGFGQGGSVFGGTSAATTTAATSGFSFCQASGFGSSNTGSVFGQAASTGGIVFGQQSSSSSGSVFGSGNTGRGGGFFSGLGGKPSQDAANKNPFSSASGGFGSTATSNTSNLFGNSGAKTFGGFASSSFGEQKPTGTFSSGGGSVASQGFGFSSPNKTGGFGAAPVFGSPPTFGGSPGFGGVPAFGSAPAFTSPLGSTGGKVFGEGTAAASAGGFGFGSSSNTTSFGTLASQNAPTFGSLSQQTSGFGTQSSGFSGFGSGTGGFSFGSNNSSVQGFGGWRS,2090,NP_005076.3.csv,refseq-NUP214-NM_005085.3_clinical_seed_0_final,refseq-NUP214-NM_005085.3.a2m,Invitae,refseq-NUP214-NM_005085.3.npy,1,2090,2090
+NP_005085.2,MLLGASLVGVLLFSKLVLKLPWTQVGFSLLFLYLGSGGWRFIRVFIKTIRRDIFGGLVLLKVKAKVRQCLQERRTVPILFASTVRRHPDKTALIFEGTDTHWTFRQLDEYSSSVANFLQARGLASGDVAAIFMENRNEFVGLWLGMAKLGVEAALINTNLRRDALLHCLTTSRARALVFGSEMASAICEVHASLDPSLSLFCSGSWEPGAVPPSTEHLDPLLKDAPKHLPSCPDKGFTDKLFYIYTSGTTGLPKAAIVVHSRYYRMAALVYYGFRMRPNDIVYDCLPLYHSAGNIVGIGQCLLHGMTVVIRKKFSASRFWDDCIKYNCTIVQYIGELCRYLLNQPPREAENQHQVRMALGNGLRQSIWTNFSSRFHIPQVAEFYGATECNCSLGNFDSQVGACGFNSRILSFVYPIRLVRVNEDTMELIRGPDGVCIPCQPGEPGQLVGRIIQKDPLRRFDGYLNQGANNKKIAKDVFKKGDQAYLTGDVLVMDELGYLYFRDRTGDTFRWKGENVSTTEVEGTLSRLLDMADVAVYGVEVPGTEGRAGMAAVASPTGNCDLERFAQVLEKELPLYARPIFLRLLPELHKTGTYKFQKTELRKEGFDPAIVKDPLFYLDAQKGRYVPLDQEAYSRIQAGEEKL,643,NP_005085.2.csv,refseq-SLC27A4-NM_005094.3_clinical_seed_0_final,refseq-SLC27A4-NM_005094.3.a2m,Invitae,refseq-SLC27A4-NM_005094.3_theta_0.2.npy,1,643,643
+NP_005111.2,MAAFGILSYEHRPLKRPRLGPPDVYPQDPKQKEDELTALNVKQGFNNQPAVSGDEHGSAKNVSFNPAKISSNFSSIIAEKLRCNTLPDTGRRKPQVNQKDNFWLVTARSQSAINTWFTDLAGTKPLTQLAKKVPIFSKKEEVFGYLAKYTVPVMRAAWLIKMTCAYYAAISETKVKKRHVDPFMEWTQIITKYLWEQLQKMAEYYRPGPAGSGGCGSTIGPLPHDVEVAIRQWDYTEKLAMFMFQDGMLDRHEFLTWVLECFEKIRPGEDELLKLLLPLLLRYSGEFVQSAYLSRRLAYFCTRRLALQLDGVSSHSSHVISAQSTSTLPTTPAPQPPTSSTPSTPFSDLLMCPQHRPLVFGLSCILQTILLCCPSALVWHYSLTDSRIKTGSPLDHLPIAPSNLPMPEGNSAFTQQVRAKLREIEQQIKERGQAVEVRWSFDKCQEATAGFTIGRVLHTLEVLDSHSFERSDFSNSLDSLCNRIFGLGPSKDGHEISSDDDAVVSLLCEWAVSCKRSGRHRAMVVAKLLEKRQAEIEAERCGESEAADEKGSIASGSLSAPSAPIFQDVLLQFLDTQAPMLTDPRSESERVEFFNLVLLFCELIRHDVFSHNMYTCTLISRGDLAFGAPGPRPPSPFDDPADDPEHKEAEGSSSSKLEDPGLSESMDIDPSSSVLFEDMEKPDFSLFSPTMPCEGKGSPSPEKPDVEKEVKPPPKEKIEGTLGVLYDQPRHVQYATHFPIPQEESCSHECNQRLVVLFGVGKQRDDARHAIKKITKDILKVLNRKGTAETDQLAPIVPLNPGDLTFLGGEDGQKRRRNRPEAFPTAEDIFAKFQHLSHYDQHQVTAQVSRNVLEQITSFALGMSYHLPLVQHVQFIFDLMEYSLSISGLIDFAIQLLNELSVVEAELLLKSSDLVGSYTTSLCLCIVAVLRHYHACLILNQDQMAQVFEGLCGVVKHGMNRSDGSSAERCILAYLYDLYTSCSHLKNKFGELFSDFCSKVKNTIYCNVEPSESNMRWAPEFMIDTLENPAAHTFTYTGLGKSLSENPANRYSFVCNALMHVCVGHHDPDRVNDIAILCAELTGYCKSLSAEWLGVLKALCCSSNNGTCGFNDLLCNVDVSDLSFHDSLATFVAILIARQCLLLEDLIRCAAIPSLLNAACSEQDSEPGARLTCRILLHLFKTPQLNPCQSDGNKPTVGIRSSCDRHLLAASQNRIVDGAVFAVLKAVFVLGDAELKGSGFTVTGGTEELPEEEGGGGSGGRRQGGRNISVETASLDVYAKYVLRSICQQEWVGERCLKSLCEDSNDLQDPVLSSAQAQRLMQLICYPHRLLDNEDGENPQRQRIKRILQNLDQWTMRQSSLELQLMIKQTPNNEMNSLLENIAKATIEVFQQSAETGSSSGSTASNMPSSSKTKPVLSSLERSGVWLVAPLIAKLPTSVQGHVLKAAGEELEKGQHLGSSSRKERDRQKQKSMSLLSQQPFLSLVLTCLKGQDEQREGLLTSLYSQVHQIVNNWRDDQYLDDCKPKQLMHEALKLRLNLVGGMFDTVQRSTQQTTEWAMLLLEIIISGTVDMQSNNELFTTVLDMLSVLINGTLAADMSSISQGSMEENKRAYMNLAKKLQKELGERQSDSLEKVRQLLPLPKQTRDVITCEPQGSLIDTKGNKIAGFDSIFKKEGLQVSTKQKISPWDLFEGLKPSAPLSWGWFGTVRVDRRVARGEEQQRLLLYHTHLRPRPRAYYLEPLPLPPEDEEPPAPTLLEPEKKAPEPPKTDKPGAAPPSTEERKKKSTKGKKRSQPATKTEDYGMGPGRSGPYGVTVPPDLLHHPNPGSITHLNYRQGSIGLYTQNQPLPAGGPRVDPYRPVRLPMQKLPTRPTYPGVLPTTMTGVMGLEPSSYKTSVYRQQQPAVPQGQRLRQQLQQSQGMLGQSSVHQMTPSSSYGLQTSQGYTPYVSHVGLQQHTGPAGTMVPPSYSSQPYQSTHPSTNPTLVDPTRHLQQRPSGYVHQQAPTYGHGLTSTQRFSHQTLQQTPMISTMTPMSAQGVQAGVRSTAILPEQQQQQQQQQQQQQQQQQQQQQQQQQQYHIRQQQQQQILRQQQQQQQQQQQQQQQQQQQQQQQQQQHQQQQQQQAAPPQPQPQSQPQFQRQGLQQTQQQQQTAALVRQLQQQLSNTQPQPSTNIFGRY,2177,NP_005111.2.csv,refseq-MED12-NM_005120.2_clinical_seed_0_final,refseq-MED12-NM_005120.2.a2m,Invitae,refseq-MED12-NM_005120.2.npy,1,2177,2177
+NP_005112.2,MSASFVPNGASLEDCHCNLFCLADLTGIKWKKYVWQGPTSAPILFPVTEEDPILSSFSRCLKADVLGVWRRDQRPGRRELWIFWWGEDPSFADLIHHDLSEEEDGVWENGLSYECRTLLFKAVHNLLERCLMNRNFVRIGKWFVKPYEKDEKPINKSEHLSCSFTFFLHGDSNVCTSVEINQHQPVYLLSEEHITLAQQSNSPFQVILCPFGLNGTLTGQAFKMSDSATKKLIGEWKQFYPISCCLKEMSEEKQEDMDWEDDSLAAVEVLVAGVRMIYPACFVLVPQSDIPTPSPVGSTHCSSSCLGVHQVPASTRDPAMSSVTLTPPTSPEEVQTVDPQSVQKWVKFSSVSDGFNSDSTSHHGGKIPRKLANHVVDRVWQECNMNRAQNKRKYSASSGGLCEEATAAKVASWDFVEATQRTNCSCLRHKNLKSRNAGQQGQAPSLGQQQQILPKHKTNEKQEKSEKPQKRPLTPFHHRVSVSDDVGMDADSASQRLVISAPDSQVRFSNIRTNDVAKTPQMHGTEMANSPQPPPLSPHPCDVVDEGVTKTPSTPQSQHFYQMPTPDPLVPSKPMEDRIDSLSQSFPPQYQEAVEPTVYVGTAVNLEEDEANIAWKYYKFPKKKDVEFLPPQLPSDKFKDDPVGPFGQESVTSVTELMVQCKKPLKVSDELVQQYQIKNQCLSAIASDAEQEPKIDPYAFVEGDEEFLFPDKKDRQNSEREAGKKHKVEDGTSSVTVLSHEEDAMSLFSPSIKQDAPRPTSHARPPSTSLIYDSDLAVSYTDLDNLFNSDEDELTPGSKKSANGSDDKASCKESKTGNLDPLSCISTADLHKMYPTPPSLEQHIMGFSPMNMNNKEYGSMDTTPGGTVLEGNSSSIGAQFKIEVDEGFCSPKPSEIKDFSYVYKPENCQILVGCSMFAPLKTLPSQYLPPIKLPEECIYRQSWTVGKLELLSSGPSMPFIKEGDGSNMDQEYGTAYTPQTHTSFGMPPSSAPPSNSGAGILPSPSTPRFPTPRTPRTPRTPRGAGGPASAQGSVKYENSDLYSPASTPSTCRPLNSVEPATVPSIPEAHSLYVNLILSESVMNLFKDCNFDSCCICVCNMNIKGADVGVYIPDPTQEAQYRCTCGFSAVMNRKFGNNSGLFLEDELDIIGRNTDCGKEAEKRFEALRATSAEHVNGGLKESEKLSDDLILLLQDQCTNLFSPFGAADQDPFPKSGVISNWVRVEERDCCNDCYLALEHGRQFMDNMSGGKVDEALVKSSCLHPWSKRNDVSMQCSQDILRMLLSLQPVLQDAIQKKRTVRPWGVQGPLTWQQFHKMAGRGSYGTDESPEPLPIPTFLLGYDYDYLVLSPFALPYWERLMLEPYGSQRDIAYVVLCPENEALLNGAKSFFRDLTAIYESCRLGQHRPVSRLLTDGIMRVGSTASKKLSEKLVAEWFSQAADGNNEAFSKLKLYAQVCRYDLGPYLASLPLDSSLLSQPNLVAPTSQSLITPPQMTNTGNANTPSATLASAASSTMTVTSGVAISTSVATANSTLTTASTSSSSSSNLNSGVSSNKLPSFPPFGSMNSNAAGSMSTQANTVQSGQLGGQQTSALQTAGISGESSSLPTQPHPDVSESTMDRDKVGIPTDGDSHAVTYPPAIVVYIIDPFTYENTDESTNSSSVWTLGLLRCFLEMVQTLPPHIKSTVSVQIIPCQYLLQPVKHEDREIYPQHLKSLAFSAFTQCRRPLPTSTNVKTLTGFGPGLAMETALRSPDRPECIRLYAPPFILAPVKDKQTELGETFGEAGQKYNVLFVGYCLSHDQRWILASCTDLYGELLETCIINIDVPNRARRKKSSARKFGLQKLWEWCLGLVQMSSLPWRVVIGRLGRIGHGELKDWSCLLSRRNLQSLSKRLKDMCRMCGISAADSPSILSACLVAMEPQGSFVIMPDSVSTGSVFGRSTTLNMQTSQLNTPQDTSCTHILVFPTSASVQVASATYTTENLDLAFNPNNDGADGMGIFDLLDTGDDLDPDIINILPASPTGSPVHSPGSHYPHGGDAGKGQSTDRLLSTEPHEEVPNILQQPLALGYFVSTAKAGPLPDWFWSACPQAQYQCPLFLKASLHLHVPSVQSDELLHSKHSHPLDSNQTSDVLRFVLEQYNALSWLTCDPATQDRRSCLPIHFVVLNQLYNFIMNML,2174,NP_005112.2.csv,refseq-MED13-NM_005121.2_clinical_seed_0_final,refseq-MED13-NM_005121.2.a2m,Invitae,refseq-MED13-NM_005121.2.npy,1,2174,2174
+NP_005114.1,MGSKMNLIEHSHLPTTDEFSFSENLFGVLTEQVAGPLGQNLEVEPYSQYSNVQFPQVQPQISSSSYYSNLGFYPQQPEEWYSPGIYELRRMPAETLYQGETEVAEMPVTKKPRMGASAGRIKGDELCVVCGDRASGYHYNALTCEGCKGFFRRSITKNAVYKCKNGGNCVMDMYMRRKCQECRLRKCKEMGMLAECLLTEIQCKSKRLRKNVKQHADQTVNEDSEGRDLRQVTSTTKSCREKTELTPDQQTLLHFIMDSYNKQRMPQEITNKILKEEFSAEENFLILTEMATNHVQVLVEFTKKLPGFQTLDHEDQIALLKGSAVEAMFLRSAEIFNKKLPSGHSDLLEERIRNSGISDEYITPMFSFYKSIGELKMTQEEYALLTAIVILSPDRQYIKDREAVEKLQEPLLDVLQKLCKIHQPENPQHFACLLGRLTELRTFNHHHAEMLMSWRVNDHKFTPLLCEIWDVQ,472,NP_005114.1.csv,refseq-NR1H4-NM_005123.3_clinical_seed_0_final,refseq-NR1H4-NM_005123.3.a2m,Invitae,refseq-NR1H4-NM_005123.3.npy,1,472,472
+NP_005129.2,MLLLTRSPTAWHRLSQLKPRVLPGTLGGQALHLRSWLLSRQGPAETGGQGQPQGPGLRTRLLITGLFGAGLGGAWLALRAEKERLQQQKRTEALRQAAVGQGDFHLLDHRGRARCKADFRGQWVLMYFGFTHCPDICPDELEKLVQVVRQLEAEPGLPPVQPVFITVDPERDDVEAMARYVQDFHPRLLGLTGSTKQVAQASHSYRVYYNAGPKDEDQDYIVDHSIAIYLLNPDGLFTDYYGRSRSAEQISDSVRRHMAAFRSVLS,266,NP_005129.2.csv,refseq-SCO2-NM_005138.2_clinical_seed_0_final,refseq-SCO2-NM_005138.2.a2m,Invitae,refseq-SCO2-NM_005138.2.npy,1,266,266
+NP_005132.2,MKRMVSWSFHKLKTMKHLLLLLLCVFLVKSQGVNDNEEGFFSARGHRPLDKKREEAPSLRPAPPPISGGGYRARPAKAAATQKKVERKAPDAGGCLHADPDLGVLCPTGCQLQEALLQQERPIRNSVDELNNNVEAVSQTSSSSFQYMYLLKDLWQKRQKQVKDNENVVNEYSSELEKHQLYIDETVNSNIPTNLRVLRSILENLRSKIQKLESDVSAQMEYCRTPCTVSCNIPVVSGKECEEIIRKGGETSEMYLIQPDSSVKPYRVYCDMNTENGGWTVIQNRQDGSVDFGRKWDPYKQGFGNVATNTDGKNYCGLPGEYWLGNDKISQLTRMGPTELLIEMEDWKGDKVKAHYGGFTVQNEANKYQISVNKYRGTAGNALMDGASQLMGENRTMTIHNGMFFSTYDRDNDGWLTSDPRKQCSKEDGGGWWYNRCHAANPNGRYYWGGQYTWDMAKHGTDDGVVWMNWKGSWYSMRKMSMKIRPFFPQQ,491,NP_005132.2.csv,refseq-FGB-NM_005141.4_clinical_seed_0_final,refseq-FGB-NM_005141.4.a2m,Invitae,refseq-FGB-NM_005141.4.npy,1,491,491
+NP_005133.2,MAWFALYLLSLLWATAGTSTQTQSSCSVPSAQEPLVNGIQVLMENSVTSSAYPNPSILIAMNLAGAYNLKAQKLLTYQLMSSDNNDLTIGQLGLTIMALTSSCRDPGDKVSILQRQMENWAPSSPNAEASAFYGPSLAILALCQKNSEATLPIAVRFAKTLLANSSPFNVDTGAMATLALTCMYNKIPVGSEEGYRSLFGQVLKDIVEKISMKIKDNGIIGDIYSTGLAMQALSVTPEPSKKEWNCKKTTDMILNEIKQGKFHNPMSIAQILPSLKGKTYLDVPQVTCSPDHEVQPTLPSNPGPGPTSASNITVIYTINNQLRGVELLFNETINVSVKSGSVLLVVLEEAQRKNPMFKFETTMTSWGLVVSSINNIAENVNHKTYWQFLSGVTPLNEGVADYIPFNHEHITANFTQY,417,NP_005133.2.csv,refseq-GIF-NM_005142.2_clinical_seed_0_final,refseq-GIF-NM_005142.2.a2m,Invitae,refseq-GIF-NM_005142.2.npy,1,417,417
+NP_005135.2,MESTPSFLKGTPTWEKTAPENGIVRQEPGSPPRDGLHHGPLCLGEPAPFWRGVLSTPDSWLPPGFPQGPKDMLPLVEGEGPQNGERKVNWLGSKEGLRWKEAMLTHPLAFCGPACPPRCGPLMPEHSGGHLKSDPVAFRPWHCPFLLETKILERAPFWVPTCLPPYLVSGLPPEHPCDWPLTPHPWVYSGGQPKVPSAFSLGSKGFYYKDPSIPRLAKEPLAAAEPGLFGLNSGGHLQRAGEAERPSLHQRDGEMGAGRQQNPCPLFLGQPDTVPWTSWPACPPGLVHTLGNVWAGPGDGNLGYQLGPPATPRCPSPEPPVTQRGCCSSYPPTKGGGLGPCGKCQEGLEGGASGASEPSEEVNKASGPRACPPSHHTKLKKTWLTRHSEQFECPRGCPEVEERPVARLRALKRAGSPEVQGAMGSPAPKRPPDPFPGTAEQGAGGWQEVRDTSIGNKDVDSGQHDEQKGPQDGQASLQDPGLQDIPCLALPAKLAQCQSCAQAAGEGGGHACHSQQVRRSPLGGELQQEEDTATNSSSEEGPGSGPDSRLSTGLAKHLLSGLGDRLCRLLRREREALAWAQREGQGPAVTEDSPGIPRCCSRCHHGLFNTHWRCPRCSHRLCVACGRVAGTGRAREKAGFQEQSAEECTQEAGHAACSLMLTQFVSSQALAELSTAMHQVWVKFDIRGHCPCQADARVWAPGDAGQQKESTQKTPPTPQPSCNGDTHRTKSIKEETPDSAETPAEDRAGRGPLPCPSLCELLASTAVKLCLGHERIHMAFAPVTPALPSDDRITNILDSIIAQVVERKIQEKALGPGLRAGPGLRKGLGLPLSPVRPRLPPPGALLWLQEPQPCPRRGFHLFQEHWRQGQPVLVSGIQRTLQGNLWGTEALGALGGQVQALSPLGPPQPSSLGSTTFWEGFSWPELRPKSDEGSVLLLHRALGDEDTSRVENLAASLPLPEYCALHGKLNLASYLPPGLALRPLEPQLWAAYGVSPHRGHLGTKNLCVEVADLVSILVHADTPLPAWHRAQKDFLSGLDGEGLWSPGSQVSTVWHVFRAQDAQRIRRFLQMVCPAGAGALEPGAPGSCYLDAGLRRRLREEWGVSCWTLLQAPGEAVLVPAGAPHQVQGLVSTVSVTQHFLSPETSALSAQLCHQGPSLPPDCHLLYAQMDWAVFQAVKVAVGTLQEAK,1189,NP_005135.2.csv,refseq-HR-NM_005144.4_clinical_seed_0_final,refseq-HR-NM_005144.4.a2m,Invitae,refseq-HR-NM_005144.4_theta_0.2.npy,1,1189,1189
+NP_005140.1,MAMSELGTRKPSDGTVSHLLNVVESELQAGREKGDPTEKQLQIILEDAPLWQRFKEVTNEMIVTKNGRRMFPVLKISVTGLDPNAMYSLLLDFVPTDSHRWKYVNGEWVPAGKPEVSSHSCVYIHPDSPNFGAHWMKAPISFSKVKLTNKLNGGGQIMLNSLHKYEPQVHIVRVGSAHRMVTNCSFPETQFIAVTAYQNEEITALKIKYNPFAKAFLDAKERNHLRDVPEAISESQHVTYSHLGGWIFSNPDGVCTAGNSNYQYAAPLPLPAPHTHHGCEHYSGLRGHRQAPYPSAYMHRNHSPSVNLIESSSNNLQVFSGPDSWTSLSSTPHASILSVPHTNGPINPGPSPYPCLWTISNGAGGPSGPGPEVHASTPGAFLLGNPAVTSPPSVLSTQAPTSAGVEVLGEPSLTSIAVSTWTAVASHPFAGWGGPGAGGHHSPSSLDG,448,NP_005140.1.csv,refseq-TBX19-NM_005149.2_clinical_seed_0_final,refseq-TBX19-NM_005149.2.a2m,Invitae,refseq-TBX19-NM_005149.2.npy,1,448,448
+NP_005174.2,MSESEGGKDTTPEPSPANGAGPGPEWGLCPGPPAVEGESSGASGLGTPKRRNQHSKHKTVAVASAQRSPRALFCLTLANPLRRSCISIVEWKPFDILILLTIFANCVALGVYIPFPEDDSNTANHNLEQVEYVFLVIFTVETVLKIVAYGLVLHPSAYIRNGWNLLDFIIVVVGLFSVLLEQGPGRPGDAPHTGGKPGGFDVKALRAFRVLRPLRLVSGVPSLHIVLNSIMKALVPLLHIALLVLFVIIIYAIIGLELFLGRMHKTCYFLGSDMEAEEDPSPCASSGSGRACTLNQTECRGRWPGPNGGITNFDNFFFAMLTVFQCVTMEGWTDVLYWMQDAMGYELPWVYFVSLVIFGSFFVLNLVLGVLSGEFSKEREKAKARGDFQKQREKQQMEEDLRGYLDWITQAEELDMEDPSADDNLGSMAEEGRAGHRPQLAELTNRRRGRLRWFSHSTRSTHSTSSHASLPASDTGSMTETQGDEDEEEGALASCTRCLNKIMKTRVCRRLRRANRVLRARCRRAVKSNACYWAVLLLVFLNTLTIASEHHGQPVWLTQIQEYANKVLLCLFTVEMLLKLYGLGPSAYVSSFFNRFDCFVVCGGILETTLVEVGAMQPLGISVLRCVRLLRIFKVTRHWASLSNLVASLLNSMKSIASLLLLLFLFIIIFSLLGMQLFGGKFNFDQTHTKRSTFDTFPQALLTVFQILTGEDWNVVMYDGIMAYGGPFFPGMLVCIYFIILFICGNYILLNVFLAIAVDNLASGDAGTAKDKGGEKSNEKDLPQENEGLVPGVEKEEEEGARREGADMEEEEEEEEEEEEEEEEEGAGGVELLQEVVPKEKVVPIPEGSAFFCLSQTNPLRKGCHTLIHHHVFTNLILVFIILSSVSLAAEDPIRAHSFRNHILGYFDYAFTSIFTVEILLKMTVFGAFLHRGSFCRSWFNMLDLLVVSVSLISFGIHSSAISVVKILRVLRVLRPLRAINRAKGLKHVVQCVFVAIRTIGNIMIVTTLLQFMFACIGVQLFKGKFYTCTDEAKHTPQECKGSFLVYPDGDVSRPLVRERLWVNSDFNFDNVLSAMMALFTVSTFEGWPALLYKAIDAYAEDHGPIYNYRVEISVFFIVYIIIIAFFMMNIFVGFVIITFRAQGEQEYQNCELDKNQRQCVEYALKAQPLRRYIPKNPHQYRVWATVNSAAFEYLMFLLILLNTVALAMQHYEQTAPFNYAMDILNMVFTGLFTIEMVLKIIAFKPKHYFTDAWNTFDALIVVGSIVDIAVTEVNNGGHLGESSEDSSRISITFFRLFRVMRLVKLLSKGEGIRTLLWTFIKSFQALPYVALLIAMIFFIYAVIGMQMFGKVALQDGTQINRNNNFQTFPQAVLLLFRCATGEAWQEIMLASLPGNRCDPESDFGPGEEFTCGSNFAIAYFISFFMLCAFLIINLFVAVIMDNFDYLTRDWSILGPHHLDEFKRIWSEYDPGAKGRIKHLDVVALLRRIQPPLGFGKLCPHRVACKRLVAMNMPLNSDGTVTFNATLFALVRTSLKIKTEGNLEQANQELRIVIKKIWKRMKQKLLDEVIPPPDEEEVTVGKFYATFLIQDYFRKFRRRKEKGLLGNDAAPSTSSALQAGLRSLQDLGPEMRQALTCDTEEEEEEGQEGVEEEDEKDLETNKATMVSQPSARRGSGISVSLPVGDRLPDSLSFGPSDDDRGTPTSSQPSVPQAGSNTHRRGSGALIFTIPEEGNSQPKGTKGQNKQDEDEEVPDRLSYLDEQAGTPPCSVLLPPHRAQRYMDGHLVPRRRLLPPTPAGRKPSFTIQCLQRQGSCEDLPIPGTYHRGRNSGPNRAQGSWATPPQRGRLLYAPLLLVEEGAAGEGYLGRSSGPLRTFTCLHVPGTHSDPSHGKRGSADSLVEAVLISEGLGLFARDPRFVALAKQEIADACRLTLDEMDNAASDLLAQGTSSLYSDEESILSRFDEEDLGDEMACVHAL,1977,NP_005174.2.csv,CAC1F_HUMAN_b07_clinical_seed_0_final,CAC1F_HUMAN_b07.a2m,EVE,CAC1F_HUMAN_b07_theta_0.2.npy,1,1977,1977
+NP_005179.2,MAGNVKKSSGAGGGSGSGGSGSGGLIGLMKDAFQPHHHHHHHLSPHPPGTVDKKMVEKCWKLMDKVVRLCQNPKLALKNSPPYILDLLPDTYQHLRTILSRYEGKMETLGENEYFRVFMENLMKKTKQTISLFKEGKERMYEENSQPRRNLTKLSLIFSHMLAELKGIFPSGLFQGDTFRITKADAAEFWRKAFGEKTIVPWKSFRQALHEVHPISSGLEAMALKSTIDLTCNDYISVFEFDIFTRLFQPWSSLLRNWNSLAVTHPGYMAFLTYDEVKARLQKFIHKPGSYIFRLSCTRLGQWAIGYVTADGNILQTIPHNKPLFQALIDGFREGFYLFPDGRNQNPDLTGLCEPTPQDHIKVTQEQYELYCEMGSTFQLCKICAENDKDVKIEPCGHLMCTSCLTSWQESEGQGCPFCRCEIKGTEPIVVDPFDPRGSGSLLRQGAEGAPSPNYDDDDDERADDTLFMMKELAGAKVERPPSPFSMAPQASLPPVPPRLDLLPQRVCVPSSASALGTASKAASGSLHKDKPLPVPPTLRDLPPPPPPDRPYSVGAESRPQRRPLPCTPGDCPSRDKLPPVPSSRLGDSWLPRPIPKVPVSAPSSSDPWTGRELTNRHSLPFSLPSQMEPRPDVPRLGSTFSLDTSMSMNSSPLVGPECDHPKIKPSSSANAIYSLAARPLPVPKLPPGEQCEGEEDTEYMTPSSRPLRPLDTSQSSRACDCDQQIDSCTYEAMYNIQSQAPSITESSTFGEGNLAAAHANTGPEESENEDDGYDVPKPPVPAVLARRTLSDISNASSSFGWLSLDGDPTTNVTEGSQVPERPPKPFPRRINSERKAGSCQQGSGPAASAATASPQLSSEIENLMSQGYSYQDIQKALVIAQNNIEMAKNILREFVSISSPAHVAT,906,NP_005179.2.csv,refseq-CBL-NM_005188.3_clinical_seed_0_final,refseq-CBL-NM_005188.3.a2m,Invitae,refseq-CBL-NM_005188.3.npy,1,906,906
+NP_005190.4,MHGGQGPLLLLLLLAVCLGAQGRNQEERLLADLMQNYDPNLRPAERDSDVVNVSLKLTLTNLISLNEREEALTTNVWIEMQWCDYRLRWDPRDYEGLWVLRVPSTMVWRPDIVLENNVDGVFEVALYCNVLVSPDGCIYWLPPAIFRSACSISVTYFPFDWQNCSLIFQSQTYSTNEIDLQLSQEDGQTIEWIFIDPEAFTENGEWAIQHRPAKMLLDPAAPAQEAGHQKVVFYLLIQRKPLFYVINIIAPCVLISSVAILIHFLPAKAGGQKCTVAINVLLAQTVFLFLVAKKVPETSQAVPLISKYLTFLLVVTILIVVNAVVVLNVSLRSPHTHSMARGVRKVFLRLLPQLLRMHVRPLAPAAVQDTQSRLQNGSSGWSITTGEEVALCLPRSELLFQQWQRQGLVAAALEKLEKGPELGLSQFCGSLKQAAPAIQACVEACNLIACARHQQSHFDNGNEEWFLVGRVLDRVCFLAMLSLFICGTAGIFLMAHYNRVPALPFPGDPRPYLPSPD,517,NP_005190.4.csv,refseq-CHRNG-NM_005199.4_clinical_seed_0_final,refseq-CHRNG-NM_005199.4.a2m,Invitae,refseq-CHRNG-NM_005199.4.npy,1,517,517
+NP_005202.2,MGPGVLLLLLVATAWHGQGIPVIEPSVPELVVKPGATVTLRCVGNGSVEWDGPPSPHWTLYSDGSSSILSTNNATFQNTGTYRCTEPGDPLGGSAAIHLYVKDPARPWNVLAQEVVVFEDQDALLPCLLTDPVLEAGVSLVRVRGRPLMRHTNYSFSPWHGFTIHRAKFIQSQDYQCSALMGGRKVMSISIRLKVQKVIPGPPALTLVPAELVRIRGEAAQIVCSASSVDVNFDVFLQHNNTKLAIPQQSDFHNNRYQKVLTLNLDQVDFQHAGNYSCVASNVQGKHSTSMFFRVVESAYLNLSSEQNLIQEVTVGEGLNLKVMVEAYPGLQGFNWTYLGPFSDHQPEPKLANATTKDTYRHTFTLSLPRLKPSEAGRYSFLARNPGGWRALTFELTLRYPPEVSVIWTFINGSGTLLCAASGYPQPNVTWLQCSGHTDRCDEAQVLQVWDDPYPEVLSQEPFHKVTVQSLLTVETLEHNQTYECRAHNSVGSGSWAFIPISAGAHTHPPDEFLFTPVVVACMSIMALLLLLLLLLLYKYKQKPKYQVRWKIIESYEGNSYTFIDPTQLPYNEKWEFPRNNLQFGKTLGAGAFGKVVEATAFGLGKEDAVLKVAVKMLKSTAHADEKEALMSELKIMSHLGQHENIVNLLGACTHGGPVLVITEYCCYGDLLNFLRRKAEAMLGPSLSPGQDPEGGVDYKNIHLEKKYVRRDSGFSSQGVDTYVEMRPVSTSSNDSFSEQDLDKEDGRPLELRDLLHFSSQVAQGMAFLASKNCIHRDVAARNVLLTNGHVAKIGDFGLARDIMNDSNYIVKGNARLPVKWMAPESIFDCVYTVQSDVWSYGILLWEIFSLGLNPYPGILVNSKFYKLVKDGYQMAQPAFAPKNIYSIMQACWALEPTHRPTFQQICSFLQEQAQEDRRERDYTNLPSSSRSGGSGSSSSELEEESSSEHLTCCEQGDIAQPLLQPNNYQFC,972,NP_005202.2.csv,refseq-CSF1R-NM_005211.3_clinical_seed_0_final,refseq-CSF1R-NM_005211.3.a2m,Invitae,refseq-CSF1R-NM_005211.3.npy,1,972,972
+NP_005205.2,MACLGFQRHKAQLNLATRTWPCTLLFFLLFIPVFCKAMHVAQPAVVLASSRGIASFVCEYASPGKATEVRVTVLRQADSQVTEVCAATYMMGNELTFLDDSICTGTSSGNQVNLTIQGLRAMDTGLYICKVELMYPPPYYLGIGNGTQIYVIDPEPCPDSDFLLWILAAVSSGLFFYSFLLTAVSLSKMLKKRSPLTTGVYVKMPPTEPECEKQFQPYFIPIN,223,NP_005205.2.csv,refseq-CTLA4-NM_005214.4_clinical_seed_0_final,refseq-CTLA4-NM_005214.4.a2m,Invitae,refseq-CTLA4-NM_005214.4.npy,1,223,223
+NP_005206.2,MENSLRCVWVPKLAFVLFGASLFSAHLQVTGFQIKAFTALRFLSEPSDAVTMRGGNVLLDCSAESDRGVPVIKWKKDGIHLALGMDERKQQLSNGSLLIQNILHSRHHKPDEGLYQCEASLGDSGSIISRTAKVAVAGPLRFLSQTESVTAFMGDTVLLKCEVIGEPMPTIHWQKNQQDLTPIPGDSRVVVLPSGALQISRLQPGDIGIYRCSARNPASSRTGNEAEVRILSDPGLHRQLYFLQRPSNVVAIEGKDAVLECCVSGYPPPSFTWLRGEEVIQLRSKKYSLLGGSNLLISNVTDDDSGMYTCVVTYKNENISASAELTVLVPPWFLNHPSNLYAYESMDIEFECTVSGKPVPTVNWMKNGDVVIPSDYFQIVGGSNLRILGVVKSDEGFYQCVAENEAGNAQTSAQLIVPKPAIPSSSVLPSAPRDVVPVLVSSRFVRLSWRPPAEAKGNIQTFTVFFSREGDNRERALNTTQPGSLQLTVGNLKPEAMYTFRVVAYNEWGPGESSQPIKVATQPELQVPGPVENLQAVSTSPTSILITWEPPAYANGPVQGYRLFCTEVSTGKEQNIEVDGLSYKLEGLKKFTEYSLRFLAYNRYGPGVSTDDITVVTLSDVPSAPPQNVSLEVVNSRSIKVSWLPPPSGTQNGFITGYKIRHRKTTRRGEMETLEPNNLWYLFTGLEKGSQYSFQVSAMTVNGTGPPSNWYTAETPENDLDESQVPDQPSSLHVRPQTNCIIMSWTPPLNPNIVVRGYIIGYGVGSPYAETVRVDSKQRYYSIERLESSSHYVISLKAFNNAGEGVPLYESATTRSITDPTDPVDYYPLLDDFPTSVPDLSTPMLPPVGVQAVALTHDAVRVSWADNSVPKNQKTSEVRLYTVRWRTSFSASAKYKSEDTTSLSYTATGLKPNTMYEFSVMVTKNRRSSTWSMTAHATTYEAAPTSAPKDLTVITREGKPRAVIVSWQPPLEANGKITAYILFYTLDKNIPIDDWIMETISGDRLTHQIMDLNLDTMYYFRIQARNSKGVGPLSDPILFRTLKVEHPDKMANDQGRHGDGGYWPVDTNLIDRSTLNEPPIGQMHPPHGSVTPQKNSNLLVIIVVTVGVITVLVVVIVAVICTRRSSAQQRKKRATHSAGKRKGSQKDLRPPDLWIHHEEMEMKNIEKPSGTDPAGRDSPIQSCQDLTPVSHSQSETQLGSKSTSHSGQDTEEAGSSMSTLERSLAARRAPRAKLMIPMDAQSNNPAVVSAIPVPTLESAQYPGILPSPTCGYPHPQFTLRPVPFPTLSVDRGFGAGRSQSVSEGPTTQQPPMLPPSQPEHSSSEEAPSRTIPTACVRPTHPLRSFANPLLPPPMSAIEPKVPYTPLLSQPGPTLPKTHVKTASLGLAGKARSPLLPVSVPTAPEVSEESHKPTEDSANVYEQDDLSEQMASLEGLMKQLNAITGSAF,1447,NP_005206.2.csv,refseq-DCC-NM_005215.3_clinical_seed_0_final,refseq-DCC-NM_005215.3.a2m,Invitae,refseq-DCC-NM_005215.3_theta_0.2.npy,1,1447,1447
+NP_005207.3,MEPSTAARAWALFWLLLPLLGAVCASGPRTLVLLDNLNVRETHSLFFRSLKDRGFELTFKTADDPSLSLIKYGEFLYDNLIIFSPSVEDFGGNINVETISAFIDGGGSVLVAASSDIGDPLRELGSECGIEFDEEKTAVIDHHNYDISDLGQHTLIVADTENLLKAPTIVGKSSLNPILFRGVGMVADPDNPLVLDILTGSSTSYSFFPDKPITQYPHAVGKNTLLIAGLQARNNARVIFSGSLDFFSDSFFNSAVQKAAPGSQRYSQTGNYELAVALSRWVFKEEGVLRVGPVSHHRVGETAPPNAYTVTDLVEYSIVIQQLSNGKWVPFDGDDIQLEFVRIDPFVRTFLKKKGGKYSVQFKLPDVYGVFQFKVDYNRLGYTHLYSSTQVSVRPLQHTQYERFIPSAYPYYASAFSMMLGLFIFSIVFLHMKEKEKSD,439,NP_005207.3.csv,refseq-DDOST-NM_005216.5_clinical_seed_0_final,refseq-DDOST-NM_005216.5.a2m,Invitae,refseq-DDOST-NM_005216.5_theta_0.2.npy,1,439,439
+NP_005211.1,MSGSFDRKLSSILTDISSSLSCHAGSKDSPTLPESSVTDLGYYSAPQHDYYSGQPYGQTVNPYTYHHQFNLNGLAGTGAYSPKSEYTYGASYRQYGAYREQPLPAQDPVSVKEEPEAEVRMVNGKPKKVRKPRTIYSSYQLAALQRRFQKAQYLALPERAELAAQLGLTQTQVKIWFQNRRSKFKKLYKNGEVPLEHSPNNSDSMACNSPPSPALWDTSSHSTPAPARSQLPPPLPYSASPSYLDDPTNSWYHAQNLSGPHLQQQPPQPATLHHASPGPPPNPGAVY,287,NP_005211.1.csv,refseq-DLX3-NM_005220.3_clinical_seed_0_final,refseq-DLX3-NM_005220.3.a2m,Invitae,refseq-DLX3-NM_005220.3.npy,1,287,287
+NP_005212.1,MTGVFDRRVPSIRSGDFQAPFQTSAAMHHPSQESPTLPESSATDSDYYSPTGGAPHGYCSPTSASYGKALNPYQYQYHGVNGSAGSYPAKAYADYSYASSYHQYGGAYNRVPSATNQPEKEVTEPEVRMVNGKPKKVRKPRTIYSSFQLAALQRRFQKTQYLALPERAELAASLGLTQTQVKIWFQNKRSKIKKIMKNGEMPPEHSPSSSDPMACNSPQSPAVWEPQGSSRSLSHHPHAHPPTSNQSPASSYLENSASWYTSAASSINSHLPPPGSLQHPLALASGTLY,289,NP_005212.1.csv,refseq-DLX5-NM_005221.5_clinical_seed_0_final,refseq-DLX5-NM_005221.5.a2m,Invitae,refseq-DLX5-NM_005221.5.npy,1,289,289
+NP_005219.2,MRPSGTAGAALLALLAALCPASRALEEKKVCQGTSNKLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYDANKTGLKELPMRNLQEILHGAVRFSNNPALCNVESIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEENCQKLTKIICAQQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGSCVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILKTVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVVSLNITSLGLRSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQKTKIISNRGENSCKATGQVCHALCSPEGCWGPEPRDCVSCRNVSRGRECVDKCNLLEGEPREFVENSECIQCHPECLPQAMNITCTGRGPDNCIQCAHYIDGPHCVKTCPAGVMGENNTLVWKYADAGHVCHLCHPNCTYGCTGPGLEGCPTNGPKIPSIATGMVGALLLLLVVALGIGLFMRRRHIVRKRTLRRLLQERELVEPLTPSGEAPNQALLRILKETEFKKIKVLGSGAFGTVYKGLWIPEGEKVKIPVAIKELREATSPKANKEILDEAYVMASVDNPHVCRLLGICLTSTVQLITQLMPFGCLLDYVREHKDNIGSQYLLNWCVQIAKGMNYLEDRRLVHRDLAARNVLVKTPQHVKITDFGLAKLLGAEEKEYHAEGGKVPIKWMALESILHRIYTHQSDVWSYGVTVWELMTFGSKPYDGIPASEISSILEKGERLPQPPICTIDVYMIMVKCWMIDADSRPKFRELIIEFSKMARDPQRYLVIQGDERMHLPSPTDSNFYRALMDEEDMDDVVDADEYLIPQQGFFSSPSTSRTPLLSSLSATSNNSTVACIDRNGLQSCPIKEDSFLQRYSSDPTGALTEDSIDDTFLPVPEYINQSVPKRPAGSVQNPVYHNQPLNPAPSRDPHYQDPHSTAVGNPEYLNTVQPTCVNSTFDSPAHWAQKGSHQISLDNPDYQQDFFPKEAKPNGIFKGSTAENAEYLRVAPQSSEFIGA,1210,NP_005219.2.csv,refseq-EGFR-NM_005228.5_clinical_seed_0_final,refseq-EGFR-NM_005228.5.a2m,Invitae,refseq-EGFR-NM_005228.5.npy,1,1210,1210
+NP_005226.1,MKPATGLWVWVSLLVAAGTVQPSDSQSVCAGTENKLSSLSDLEQQYRALRKYYENCEVVMGNLEITSIEHNRDLSFLRSVREVTGYVLVALNQFRYLPLENLRIIRGTKLYEDRYALAIFLNYRKDGNFGLQELGLKNLTEILNGGVYVDQNKFLCYADTIHWQDIVRNPWPSNLTLVSTNGSSGCGRCHKSCTGRCWGPTENHCQTLTRTVCAEQCDGRCYGPYVSDCCHRECAGGCSGPKDTDCFACMNFNDSGACVTQCPQTFVYNPTTFQLEHNFNAKYTYGAFCVKKCPHNFVVDSSSCVRACPSSKMEVEENGIKMCKPCTDICPKACDGIGTGSLMSAQTVDSSNIDKFINCTKINGNLIFLVTGIHGDPYNAIEAIDPEKLNVFRTVREITGFLNIQSWPPNMTDFSVFSNLVTIGGRVLYSGLSLLILKQQGITSLQFQSLKEISAGNIYITDNSNLCYYHTINWTTLFSTINQRIVIRDNRKAENCTAEGMVCNHLCSSDGCWGPGPDQCLSCRRFSRGRICIESCNLYDGEFREFENGSICVECDPQCEKMEDGLLTCHGPGPDNCTKCSHFKDGPNCVEKCPDGLQGANSFIFKYADPDRECHPCHPNCTQGCNGPTSHDCIYYPWTGHSTLPQHARTPLIAAGVIGGLFILVIVGLTFAVYVRRKSIKKKRALRRFLETELVEPLTPSGTAPNQAQLRILKETELKRVKVLGSGAFGTVYKGIWVPEGETVKIPVAIKILNETTGPKANVEFMDEALIMASMDHPHLVRLLGVCLSPTIQLVTQLMPHGCLLEYVHEHKDNIGSQLLLNWCVQIAKGMMYLEERRLVHRDLAARNVLVKSPNHVKITDFGLARLLEGDEKEYNADGGKMPIKWMALECIHYRKFTHQSDVWSYGVTIWELMTFGGKPYDGIPTREIPDLLEKGERLPQPPICTIDVYMVMVKCWMIDADSRPKFKELAAEFSRMARDPQRYLVIQGDDRMKLPSPNDSKFFQNLLDEEDLEDMMDAEEYLVPQAFNIPPPIYTSRARIDSNRSEIGHSPPPAYTPMSGNQFVYRDGGFAAEQGVSVPYRAPTSTIPEAPVAQGATAEIFDDSCCNGTLRKPVAPHVQEDSSTQRYSADPTVFAPERSPRGELDEEGYMTPMRDKPKQEYLNPVEENPFVSRRKNGDLQALDNPEYHNASNGPPKAEDEYVNEPLYLNTFANTLGKAEYLKNNILSMPEKAKKAFDNPDYWNHSLPPRSTLQHPDYLQEYSTKYFYKQNGRIRPIVAENPEYLSEFSLKPGTVLPPPPYRHRNTVV,1308,NP_005226.1.csv,refseq-ERBB4-NM_005235.2_clinical_seed_0_final,refseq-ERBB4-NM_005235.2.a2m,Invitae,refseq-ERBB4-NM_005235.2.npy,1,1308,1308
+NP_005227.1,MESGQPARRIAMAPLLEYERQLVLELLDTDGLVVCARGLGADRLLYHFLQLHCHPACLVLVLNTQPAEEEYFINQLKIEGVEHLPRRVTNEITSNSRYEVYTQGGVIFATSRILVVDFLTDRIPSDLITGILVYRAHRIIESCQEAFILRLFRQKNKRGFIKAFTDNAVAFDTGFCHVERVMRNLFVRKLYLWPRFHVAVNSFLEQHKPEVVEIHVSMTPTMLAIQTAILDILNACLKELKCHNPSLEVEDLSLENAIGKPFDKTIRHYLDPLWHQLGAKTKSLVQDLKILRTLLQYLSQYDCVTFLNLLESLRATEKAFGQNSGWLFLDSSTSMFINARARVYHLPDAKMSKKEKISEKMEIKEGEETKKELVLESNPKWEALTEVLKEIEAENKESEALGGPGQVLICASDDRTCSQLRDYITLGAEAFLLRLYRKTFEKDSKAEEVWMKFRKEDSSKRIRKSHKRPKDPQNKERASTKERTLKKKKRKLTLTQMVGKPEELEEEGDVEEGYRREISSSPESCPEEIKHEEFDVNLSSDAAFGILKEPLTIIHPLLGCSDPYALTRVLHEVEPRYVVLYDAELTFVRQLEIYRASRPGKPLRVYFLIYGGSTEEQRYLTALRKEKEAFEKLIREKASMVVPEEREGRDETNLDLVRGTASADVSTDTRKAGGQEQNGTQQSIVVDMREFRSELPSLIHRRGIDIEPVTLEVGDYILTPEMCVERKSISDLIGSLNNGRLYSQCISMSRYYKRPVLLIEFDPSKPFSLTSRGALFQEISSNDISSKLTLLTLHFPRLRILWCPSPHATAELFEELKQSKPQPDAATALAITADSETLPESEKYNPGPQDFLLKMPGVNAKNCRSLMHHVKNIAELAALSQDELTSILGNAANAKQLYDFIHTSFAEVVSKGKGKK,916,NP_005227.1.csv,refseq-ERCC4-NM_005236.2_clinical_seed_0_final,refseq-ERCC4-NM_005236.2.a2m,Invitae,refseq-ERCC4-NM_005236.2.npy,1,916,916
+NP_005238.1,MGLIWLLLLSLLEPGWPAAGPGARLRRDAGGRGGVYEHLGGAPRRRKLYCATKYHLQLHPSGRVNGSLENSAYSILEITAVEVGIVAIRGLFSGRYLAMNKRGRLYASEHYSAECEFVERIHELGYNTYASRLYRTVSSTPGARRQPSAERLWYVSVNGKGRPRRGFKTRRTQKSSLFLPRVLDHRDHEMVRQLQSGLPRPPGKGVQPRRRRQKQSPDNLEPSHVQASRLGSQLEASAH,239,NP_005238.1.csv,refseq-FGF3-NM_005247.2_clinical_seed_0_final,refseq-FGF3-NM_005247.2.a2m,Invitae,refseq-FGF3-NM_005247.2.npy,1,239,239
+NP_005240.3,MLDMGDRKEVKMIPKSSFSINSLVPEAVQNDNHHASHGHHNSHHPQHHHHHHHHHHHPPPPAPQPPPPPQQQQPPPPPPPAPQPPQTRGAPAADDDKGPQQLLLPPPPPPPPAAALDGAKADGLGGKGEPGGGPGELAPVGPDEKEKGAGAGGEEKKGAGEGGKDGEGGKEGEKKNGKYEKPPFSYNALIMMAIRQSPEKRLTLNGIYEFIMKNFPYYRENKQGWQNSIRHNLSLNKCFVKVPRHYDDPGKGNYWMLDPSSDDVFIGGTTGKLRRRSTTSRAKLAFKRGARLTSTGLTFMDRAGSLYWPMSPFLSLHHPRASSTLSYNGTTSAYPSHPMPYSSVLTQNSLGNNHSFSTANGLSVDRLVNGEIPYATHHLTAAALAASVPCGLSVPCSGTYSLNPCSVNLLAGQTSYFFPHVPHPSMTSQSSTSMSARAASSSTSPQAPSTLPCESLRPSLPSFTTGLSGGLSDYFTHQNQGSSSNPLIH,489,NP_005240.3.csv,refseq-FOXG1-NM_005249.4_clinical_seed_0_final,refseq-FOXG1-NM_005249.4.a2m,Invitae,refseq-FOXG1-NM_005249.4.npy,1,489,489
+NP_005242.1,MQARYSVSDPNALGVVPYLSEQNYYRAAGSYGGMASPMGVYSGHPEQYSAGMGRSYAPYHHHQPAAPKDLVKPPYSYIALITMAIQNAPEKKITLNGIYQFIMDRFPFYRENKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKDVSKEKEERAHLKEPPPAASKGAPATPHLADAPKEAEKKVVIKSEAASPALPVITKVETLSPESALQGSPRSAASTPAGSPDGSLPEHHAAAPNGLPGFSVENIMTLRTSPPGGELSPGAGRAGLVVPPLALPYAAAPPAAYGQPCAQGLEAGAAGGYQCSMRAMSLYTGAERPAHMCVPPALDEALSDHPSGPTSPLSALNLAAGQEGALAATGHHHQHHGHHHPQAPPPPPAPQPQPTPQPGAAAAQAASWYLNHSGDLNHLPGHTFAAQQQTFPNVREMFNSHRLGIENSTLGESQVSGNASCQLPYRSTPPLYRHAAPYSYDCTKY,501,NP_005242.1.csv,refseq-FOXC2-NM_005251.2_clinical_seed_0_final,refseq-FOXC2-NM_005251.2.a2m,Invitae,refseq-FOXC2-NM_005251.2.npy,1,501,501
+NP_005248.2,MALTDGGWCLPKRFGAAGADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPFETPVLHSLQSRAGAPLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKEGIQTRKRKPKNINKSKTCSGNSNNSIPMTPTSTSSNSDDCSKNTSPTTQPTASGAGAPVMTGAGESTNPENSELKYSGQDGLYIGVSLASPAEVTSSVRPDSWCALALA,595,NP_005248.2.csv,refseq-GATA6-NM_005257.5_clinical_seed_0_final,refseq-GATA6-NM_005257.5.a2m,Invitae,refseq-GATA6-NM_005257.5.npy,1,595,595
+NP_005253.3,MAAPGERGRFHGGNLFFLPGGARSEMMDDLATDARGRGAGRRDAAASASTPAQAPTSDSPVAEDASRRRPCRACVDFKTWMRTQQKRDTKFREDCPPDREELGRHSWAVLHTLAAYYPDLPTPEQQQDMAQFIHLFSKFYPCEECAEDLRKRLCRNHPDTRTRACFTQWLCHLHNEVNRKLGKPDFDCSKVDERWRDGWKDGSCD,205,NP_005253.3.csv,refseq-GFER-NM_005262.2_clinical_seed_0_final,refseq-GFER-NM_005262.2.a2m,Invitae,refseq-GFER-NM_005262.2.npy,1,205,205
+NP_005254.2,MPRSFLVKSKKAHSYHQPRSPGPDYSLRLENVPAPSRADSTSNAGGAKAEPRDRLSPESQLTEAPDRASASPDSCEGSVCERSSEFEDFWRPPSPSASPASEKSMCPSLDEAQPFPLPFKPYSWSGLAGSDLRHLVQSYRPCGALERGAGLGLFCEPAPEPGHPAALYGPKRAAGGAGAGAPGSCSAGAGATAGPGLGLYGDFGSAAAGLYERPTAAAGLLYPERGHGLHADKGAGVKVESELLCTRLLLGGGSYKCIKCSKVFSTPHGLEVHVRRSHSGTRPFACEMCGKTFGHAVSLEQHKAVHSQERSFDCKICGKSFKRSSTLSTHLLIHSDTRPYPCQYCGKRFHQKSDMKKHTFIHTGEKPHKCQVCGKAFSQSSNLITHSRKHTGFKPFGCDLCGKGFQRKVDLRRHRETQHGLK,422,NP_005254.2.csv,refseq-GFI1-NM_005263.3_clinical_seed_0_final,refseq-GFI1-NM_005263.3.a2m,Invitae,refseq-GFI1-NM_005263.3.npy,1,422,422
+NP_005258.2,MGDWSFLGNILEEVNEHSTVIGRVWLTVLFIFRILILGTAAEFVWGDEQSDFVCNTQQPGCENVCYDEAFPISHIRLWVLQIIFVSTPSLMYVGHAVHYVRMEEKRKSREAEELGQQAGTNGGPDQGSVKKSSGSKGTKKFRLEGTLLRTYICHIIFKTLFEVGFIVGHYFLYGFRILPLYRCSRWPCPNVVDCFVSRPTEKTIFILFMLSVASVSLFLNVMELGHLGLKGIRSALKRPVEQPLGEIPEKSLHSIAVSSIQKAKGYQLLEEEKIVSHYFPLTEVGMVETSPLPAKPFNQFEEKISTGPLGDLSRGYQETLPSYAQVGAQEVEGEGPPAEEGAEPEVGEKKEEAERLTTEEQEKVAVPEGEKVETPGVDKEGEKEEPQSEKVSKQGLPAEKTPSLCPELTTDDARPLSRLSKASSRARSDDLTV,433,NP_005258.2.csv,refseq-GJA8-NM_005267.4_clinical_seed_0_final,refseq-GJA8-NM_005267.4.a2m,Invitae,refseq-GJA8-NM_005267.4.npy,1,433,433
+NP_005262.1,MYRYLGEALLLSRAGPAALGSASADSAALLGWARGQPAAAPQPGLALAARRHYSEAVADREDDPNFFKMVEGFFDRGASIVEDKLVEDLRTRESEEQKRNRVRGILRIIKPCNHVLSLSFPIRRDDGSWEVIEGYRAQHSQHRTPCKGGIRYSTDVSVDEVKALASLMTYKCAVVDVPFGGAKAGVKINPKNYTDNELEKITRRFTMELAKKGFIGPGIDVPAPDMSTGEREMSWIADTYASTIGHYDINAHACVTGKPISQGGIHGRISATGRGVFHGIENFINEASYMSILGMTPGFGDKTFVVQGFGNVGLHSMRYLHRFGAKCIAVGESDGSIWNPDGIDPKELEDFKLQHGSILGFPKAKPYEGSILEADCDILIPAASEKQLTKSNAPRVKAKIIAEGANGPTTPEADKIFLERNIMVIPDLYLNAGGVTVSYFEWLKNLNHVSYGRLTFKYERDSNYHLLMSVQESLERKFGKHGGTIPIVPTAEFQDRISGASEKDIVHSGLAYTMERSARQIMRTAMKYNLGLDLRTAAYVNAIEKVFKVYNEAGVTFT,558,NP_005262.1.csv,refseq-GLUD1-NM_005271.4_clinical_seed_0_final,refseq-GLUD1-NM_005271.4.a2m,Invitae,refseq-GLUD1-NM_005271.4.npy,1,558,558
+NP_005263.1,MGSGASAEDKELAKRSKELEKKLQEDADKEAKTVKLLLLGAGESGKSTIVKQMKIIHQDGYSPEECLEFKAIIYGNVLQSILAIIRAMTTLGIDYAEPSCADDGRQLNNLADSIEEGTMPPELVEVIRRLWKDGGVQACFERAAEYQLNDSASYYLNQLERITDPEYLPSEQDVLRSRVKTTGIIETKFSVKDLNFRMFDVGGQRSERKKWIHCFEGVTCIIFCAALSAYDMVLVEDDEVNRMHESLHLFNSICNHKFFAATSIVLFLNKKDLFEEKIKKVHLSICFPEYDGNNSYDDAGNYIKSQFLDLNMRKDVKEIYSHMTCATDTQNVKFVFDAVTDIIIKENLKDCGLF,354,NP_005263.1.csv,refseq-GNAT2-NM_005272.3_clinical_seed_0_final,refseq-GNAT2-NM_005272.3.a2m,Invitae,refseq-GNAT2-NM_005272.3.npy,1,354,354
+NP_005264.2,MSELEQLRQEAEQLRNQIRDARKACGDSTLTQITAGLDPVGRIQMRTRRTLRGHLAKIYAMHWGTDSRLLVSASQDGKLIIWDSYTTNKVHAIPLRSSWVMTCAYAPSGNFVACGGLDNICSIYSLKTREGNVRVSRELPGHTGYLSCCRFLDDNQIITSSGDTTCALWDIETGQQTVGFAGHSGDVMSLSLAPDGRTFVSGACDASIKLWDVRDSMCRQTFIGHESDINAVAFFPNGYAFTTGSDDATCRLFDLRADQELLMYSHDNIICGITSVAFSRSGRLLLAGYDDFNCNIWDAMKGDRAGVLAGHDNRVSCLGVTDDGMAVATGSWDSFLKIWN,340,NP_005264.2.csv,refseq-GNB2-NM_005273.3_clinical_seed_0_final,refseq-GNB2-NM_005273.3.a2m,Invitae,refseq-GNB2-NM_005273.3.npy,1,340,340
+NP_005324.3,MGLSPSAPAVAVQASNASASPPSGCPMHEGKMKGCPVNTEPSGPTCEKKTYSVPAHQERAYEYVECPIRGTAAENKENLDPSNLMPPPNQTPAPDQPFALSTVREESSIPRADSEKKWVYPSEQMFWNAMLKKGWKWKDEDISQKDMYNIIRIHNQNNEQAWKEILKWEALHAAECPCGPSLIRFGGKAKEYSPRARIRSWMGYELPFDRHDWIINRCGTEVRYVIDYYDGGEVNKDYQFTILDVRPALDSLSAVWDRMKVAWWRWTS,268,NP_005324.3.csv,refseq-HCCS-NM_005333.4_clinical_seed_0_final,refseq-HCCS-NM_005333.4.a2m,Invitae,refseq-HCCS-NM_005333.4.npy,1,268,268
+NP_005325.2,MASAVSPANLPAVLLQPRWKRVVGWSGPVPRPRHGHRAVAIKELIVVFGGGNEGIVDELHVYNTATNQWFIPAVRGDIPPGCAAYGFVCDGTRLLVFGGMVEYGKYSNDLYELQASRWEWKRLKAKTPKNGPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNNIPRYLNDLYILELRPGSGVVAWDIPITYGVLPPPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGWVPLVMDDVKVATHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRARAGHCAVAINTRLYIWSGRDGYRKAWNNQVCCKDLWYLETEKPPPPARVQLVRANTNSLEVSWGAVATADSYLLQLQKYDIPATAATATSPTPNPVPSVPANPPKSPAPAAAAPAVQPLTQVGITLLPQAAPAPPTTTTIQVLPTVPGSSISVPTAARTQGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSSPQMSGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTMAVTPGTTTLPATVKVASSPVMVSNPATRMLKTAAAQVGTSVSSATNTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVTKTITLVKSPISVPGGSALISNLGKVMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPSTTKPGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVMTSGTGAPAKIITAVPKIATGHGQQGVTQVVLKGAPGQPGTILRTVPMGGVRLVTPVTVSAVKPAVTTLVVKGTTGVTTLGTVTGTVSTSLAGAGGHSTSASLATPITTLGTIATLSSQVINPTAITVSAAQTTLTAAGGLTTPTITMQPVSQPTQVTLITAPSGVEAQPVHDLPVSILASPTTEQPTATVTIADSGQGDVQPGTVTLVCSNPPCETHETGTTNTATTTVVANLGGHPQPTQVQFVCDRQEAAASLVTSTVGQQNGSVVRVCSNPPCETHETGTTNTATTATSNMAGQHGCSNPPCETHETGTTNTATTAMSSVGANHQRDARRACAAGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLGPSMAREPGGRSPAFVQLAPLSSKVRLSSPSIKDLPAGRHSHAVSTAAMTRSSVGAGEPRMAPVCESLQGGSPSTTVTVTALEALLCPSATVTQVCSNPPCETHETGTTNTATTSNAGSAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQPPAGRPCETHQTTSTGTTMSVSVGALLPDATSSHRTVESGLEVAAAPSVTPQAGTALLAPFPTQRVCSNPPCETHETGTTHTATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPPPEELQVSPGPRQQLPPRQLLQSASTALMGESAEVLSASQTPELPAAVDLSSTGEPSSGQESAGSAVVATVVVQPPPPTQSEVDQLSLPQELMAEAQAGTTTLMVTGLTPEELAVTAAAEAAAQAAATEEAQALAIQAVLQAAQQAVMGTGEPMDTSEAAATVTQAELGHLSAEGQEGQATTIPIVLTQQELAALVQQQQLQEAQAQQQHHHLPTEALAPADSLNDPAIESNCLNELAGTVPSTVALLPSTATESLAPSNTFVAPQPVVVASPAKLQAAATLTEVANGIESLGVKPDLPPPPSKAPMKKENQWFDVGVIKGTNVMVTHYFLPPDDAVPSDDDLGTVPDYNQLKKQELQPGTAYKFRVAGINACGRGPFSEISAFKTCLPGFPGAPCAIKISKSPDGAHLTWEPPSVTSGKIIEYSVYLAIQSSQAGGELKSSTPAQLAFMRVYCGPSPSCLVQSSSLSNAHIDYTTKPAIIFRIAARNEKGYGPATQVRWLQETSKDSSGTKPANKRPMSSPEMKSAPKKSKADGQ,2035,NP_005325.2.csv,refseq-HCFC1-NM_005334.2_clinical_seed_0_final,refseq-HCFC1-NM_005334.2.a2m,Invitae,refseq-HCFC1-NM_005334.2.npy,1,2035,2035
+NP_005328.2,MSLTSAYQHKLAEKLTILNDRGQGVLIRMYNIKKTCSDPKSKPPFLLEKSMEPSLKYINKKFPNIDVRNSTQHLGPVHREKAEIIRFLTNYYQSFVDVMEFRDHVYELLNTIDACQCHFDINLNFDFTRSYLDLIVTYTSVILLLSRIEDRRILIGMYNCAHEMLHGHGDPSFARLGQMVLEYDHPLKKLTEEFGPHTKAVSGALLSLHFLFVRRNQGAEQWRSAQLLSLISNPPAMINPANSDTMACEYLSVEVMERWIIIGFLLCHGCLNSNSQCQKLWKLCLQGSLYITLIREDVLQVHKVTEDLFSSLKGYGKRVADIKESKEHVIANSGQFHCQRRQFLRMAVKELETVLADEPGLLGPKALFAFMALSFIRDEVTWLVRHTENVTKTKTPEDYADSSIAELLFLLEGIRSLVRRHIKVIQQYHLQYLARFDALVLSDIIQNLSVCPEEESIIMSSFVSILSSLNLKQVDNGEKFEFSGLRLDWFRLQAYTSVAKAPLHLHENPDLAKVMNLIVFHSRMLDSVEKLLVETSDLSTFCFHLRIFEKMFAMTLEESAMLRYAIAFPLICAHFVHCTHEMCPEEYPHLKNHGLHHCNSFLEELAKQTSNCVLEICAEQRNLSEQLLPKHCATTISKAKNKKTRKQRQTPRKGEPERDKPGAESHRKNRSIVTNMDKLHLNLTELALTMNHVYSFSVFEHTIFPSEYLSSHLEARLNRAIVWLAGYNATTQEIVRPSELLAGVKAYIGFIQSLAQFLGADASRVIRNALLQQTQPLDSCGEQTITTLYTNWYLESLLRQASSGTIILSPAMQAFVSLPREGEQNFSAEEFSDISEMRALAELLGPYGMKFLSENLMWHVTSQIVELKKLVVENMDILVQIRSNFSKPDLMASLLPQLTGAENVLKRMTIIGVILSFRAMAQEGLREVFSSHCPFLMGPIECLKEFVTPDTDIKVTLSIFELASAAGVGCDIDPALVAAIANLKADTSSPEEEYKVACLLLIFLAVSLPLLATDPSSFYSIEKDGYNNNIHCLTKAIIQVSAALFTLYNKNIETHLKEFLVVASVSLLQLGQETDKLKTRNRESISLLMRLVVEESSFLTLDMLESCFPYVLLRNAYREVSRAFHLN,1127,NP_005328.2.csv,refseq-NCKAP1L-NM_005337.4_clinical_seed_0_final,refseq-NCKAP1L-NM_005337.4.a2m,Invitae,refseq-NCKAP1L-NM_005337.4.npy,1,1127,1127
+NP_005334.1,MTEYKLVVVGAGGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQVVIDGETCLLDILDTAGQEEYSAMRDQYMRTGEGFLCVFAINNTKSFEDIHQYREQIKRVKDSDDVPMVLVGNKCDLAARTVESRQAQDLARSYGIPYIETSAKTRQGVEDAFYTLVREIRQHKLRKLNPPDESGPGCMSCKCVLS,189,NP_005334.1.csv,refseq-HRAS-NM_005343.2_clinical_seed_0_final,refseq-HRAS-NM_005343.2.a2m,Invitae,refseq-HRAS-NM_005343.2_theta_0.2.npy,1,189,189
+NP_005340.2,MDHTEGSPAEEPPAHAPSPGKFGERPPPKRLTREAMRNYLKERGDQTVLILHAKVAQKSYGNEKRFFCPPPCVYLMGSGWKKKKEQMERDGCSEQESQPCAFIGIGNSDQEMQQLNLEGKNYCTAKTLYISDSDKRKHFMLSVKMFYGNSDDIGVFLSKRIKVISKPSKKKQSLKNADLCIASGTKVALFNRLRSQTVSTRYLHVEGGNFHASSQQWGAFFIHLLDDDESEGEEFTVRDGYIHYGQTVKLVCSVTGMALPRLIIRKVDKQTALLDADDPVSQLHKCAFYLKDTERMYLCLSQERIIQFQATPCPKEPNKEMINDGASWTIISTDKAEYTFYEGMGPVLAPVTPVPVVESLQLNGGGDVAMLELTGQNFTPNLRVWFGDVEAETMYRCGESMLCVVPDISAFREGWRWVRQPVQVPVTLVRNDGIIYSTSLTFTYTPEPGPRPHCSAAGAILRANSSQVPPNESNTNSEGSYTNASTNSTSVTSSTATVVS,500,NP_005340.2.csv,refseq-RBPJ-NM_005349.3_clinical_seed_0_final,refseq-RBPJ-NM_005349.3.a2m,Invitae,refseq-RBPJ-NM_005349.3.npy,1,500,500
+NP_005350.1,MDNMSITNTPTSNDACLSIVHSLMCHRQGGESETFAKRAIESLVKKLKEKKDELDSLITAITTNGAHPSKCVTIQRTLDGRLQVAGRKGFPHVIYARLWRWPDLHKNELKHVKYCQYAFDLKCDSVCVNPYHYERVVSPGIDLSGLTLQSNAPSSMMVKDEYVHDFEGQPSLSTEGHSIQTIQHPPSNRASTETYSTPALLAPSESNATSTANFPNIPVASTSQPASILGGSHSEGLLQIASGPQPGQQQNGFTGQPATYHHNSTTTWTGSRTAPYTPNLPHHQNGHLQHHPPMPPHPGHYWPVHNELAFQPPISNHPAPEYWCSIAYFEMDVQVGETFKVPSSCPIVTVDGYVDPSGGDRFCLGQLSNVHRTEAIERARLHIGKGVQLECKGEGDVWVRCLSDHAVFVQSYYLDREAGRAPGDAVHKIYPSAYIKVFDLRQCHRQMQQQAATAQAAAAAQAAAVAGNIPGPGSVGGIAPAISLSAAAGIGVDDLRRLCILRMSFVKGWGPDYPRQSIKETPCWIEIHLHRALQLLDEVLHTMPIADPQPLD,552,NP_005350.1.csv,refseq-SMAD4-NM_005359.5_clinical_seed_0_final,refseq-SMAD4-NM_005359.5.a2m,Invitae,refseq-SMAD4-NM_005359.5.npy,1,552,552
+NP_005351.2,MASELAMSNSDLPTSPLAMEYVNDFDLMKFEVKKEPVETDRIISQCGRLIAGGSLSSTPMSTPCSSVPPSPSFSAPSPGSGSEQKAHLEDYYWMTGYPQQLNPEALGFSPEDAVEALISNSHQLQGGFDGYARGAQQLAAAAGAGAGASLGGSGEEMGPAAAVVSAVIAAAAAQSGAGPHYHHHHHHAAGHHHHPTAGAPGAAGSAAASAGGAGGAGGGGPASAGGGGGGGGGGGGGGAAGAGGALHPHHAAGGLHFDDRFSDEQLVTMSVRELNRQLRGVSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHVLESEKNQLLQQVDHLKQEISRLVRERDAYKEKYEKLVSSGFRENGSSSDNPSSPEFFITEPTRKLEPSVGYATFWKPQHRVLTSVFTK,403,NP_005351.2.csv,refseq-MAF-NM_005360.4_clinical_seed_0_final,refseq-MAF-NM_005360.4.a2m,Invitae,refseq-MAF-NM_005360.4.npy,1,403,403
+NP_005364.1,MPSWALFMVTSCLLLAPQNLAQVSSQDVSLLASDSEPLKCFSRTFEDLTCFWDEEEAAPSGTYQLLYAYPREKPRACPLSSQSMPHFGTRYVCQFPDQEEVRLFFPLHLWVKNVFLNQTRTQRVLFVDSVGLPAPPSIIKAMGGSQPGELQISWEEPAPEISDFLRYELRYGPRDPKNSTGPTVIQLIATETCCPALQRPHSASALDQSPCAQPTMPWQDGPKQTSPSREASALTAEGGSCLISGLQPGNSYWLQLRSEPDGISLGGSWGSWSLPVTVDLPGDAVALGLQCFTLDLKNVTCQWQQQDHASSQGFFYHSRARCCPRDRYPIWENCEEEEKTNPGLQTPQFSRCHFKSRNDSIIHILVEVTTAPGTVHSYLGSPFWIHQAVRLPTPNLHWREISSGHLELEWQHPSSWAAQETCYQLRYTGEGHQDWKVLEPPLGARGGTLELRPRSRYRLQLRARLNGPTYQGPWSSWSDPTRVETATETAWISLVTALHLVLGLSAVLGLLLLRWQFPAHYRRLRHALWPSLPDLHRVLGQYLRDTAALSPPKATVSDTCEEVEPSLLEILPKSSERTPLPLCSSQAQMDYRRLQPSCLGTMPLSVCPPMAESGSCCTTHIANHSYLPLSYWQQP,635,NP_005364.1.csv,refseq-MPL-NM_005373.2_clinical_seed_0_final,refseq-MPL-NM_005373.2.a2m,Invitae,refseq-MPL-NM_005373.2.npy,1,635,635
+NP_005369.2,MPSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENELWGSPAEEDAFGLGGLGGLTPNPVILQDCMWSGFSAREKLERAVSEKLQHGRGPPTAGSTAQSPGAGAASPAGRGHGGAAGAGRAGAALPAELAHPAAECVDPAVVFPFPVNKREPAPVPAAPASAPAAGPAVASGAGIAAPAGAPGVAPPRPGGRQTSGGDHKALSTSGEDTLSDSDDEDDEEEDEEEEIDVVTVEKRRSSSNTKAVTTFTITVRPKNAALGPGRAQSSELILKRCLPIHQQHNYAAPSPYVESEDAPPQKKIKSEASPRPLKSVIPPKAKSLSPRNSDSEDSERRRNHNILERQRRNDLRSSFLTLRDHVPELVKNEKAAKVVILKKATEYVHSLQAEEHQLLLEKEKLQARQQQLLKKIEHARTC,464,NP_005369.2.csv,refseq-MYCN-NM_005378.5_clinical_seed_0_final,refseq-MYCN-NM_005378.5.a2m,Invitae,refseq-MYCN-NM_005378.5_theta_0.2.npy,1,464,464
+NP_005391.1,MVVFNGLLKIKICEAVSLKPTAWSLRHAVGPRPQTFLLDPYIALNVDDSRIGQTATKQKTNSPAWHDEFVTDVCNGRKIELAVFHDAPIGYDDFVANCTIQFEELLQNGSRHFEDWIDLEPEGRVYVIIDLSGSSGEAPKDNEERVFRERMRPRKRQGAVRRRVHQVNGHKFMATYLRQPTYCSHCRDFIWGVIGKQGYQCQVCTCVVHKRCHELIITKCAGLKKQETPDQVGSQRFSVNMPHKFGIHNYKVPTFCDHCGSLLWGLLRQGLQCKVCKMNVHRRCETNVAPNCGVDARGIAKVLADLGVTPDKITNSGQRRKKLIAGAESPQPASGSSPSEEDRSKSAPTSPCDQEIKELENNIRKALSFDNRGEEHRAASSPDGQLMSPGENGEVRQGQAKRLGLDEFNFIKVLGKGSFGKVMLAELKGKDEVYAVKVLKKDVILQDDDVDCTMTEKRILALARKHPYLTQLYCCFQTKDRLFFVMEYVNGGDLMFQIQRSRKFDEPRSRFYAAEVTSALMFLHQHGVIYRDLKLDNILLDAEGHCKLADFGMCKEGILNGVTTTTFCGTPDYIAPEILQELEYGPSVDWWALGVLMYEMMAGQPPFEADNEDDLFESILHDDVLYPVWLSKEAVSILKAFMTKNPHKRLGCVASQNGEDAIKQHPFFKEIDWVLLEQKKIKPPFKPRIKTKRDVNNFDQDFTREEPVLTLVDEAIVKQINQEEFKGFSYFGEDLMP,737,NP_005391.1.csv,refseq-PRKCE-NM_005400.2_clinical_seed_0_final,refseq-PRKCE-NM_005400.2.a2m,Invitae,refseq-PRKCE-NM_005400.2.npy,1,737,737
+NP_005393.2,MAANKPKGQNSLALHKVIMVGSGGVGKSALTLQFMYDEFVEDYEPTKADSYRKKVVLDGEEVQIDILDTAGQEDYAAIRDNYFRSGEGFLCVFSITEMESFAATADFREQILRVKEDENVPFLLVGNKSDLEDKRQVSVEEAKNRAEQWNVNYVETSAKTRANVDKVFFDLMREIRARKMEDSKEKNGKKKRKSLAKRIRERCCIL,206,NP_005393.2.csv,refseq-RALA-NM_005402.3_clinical_seed_0_final,refseq-RALA-NM_005402.3.a2m,Invitae,refseq-RALA-NM_005402.3.npy,1,206,206
+NP_005403.2,MLYFSLFWAARPLQRCGQLVRMAIRAQHSNAAQTQTGEANRGWTGQESLSDSDPEMWELLQREKDRQCRGLELIASENFCSRAALEALGSCLNNKYSEGYPGKRYYGGAEVVDEIELLCQRRALEAFDLDPAQWGVNVQPYSGSPANLAVYTALLQPHDRIMGLDLPDGGHLTHGYMSDVKRISATSIFFESMPYKLNPKTGLIDYNQLALTARLFRPRLIIAGTSAYARLIDYARMREVCDEVKAHLLADMAHISGLVAAKVIPSPFKHADIVTTTTHKTLRGARSGLIFYRKGVKAVDPKTGREIPYTFEDRINFAVFPSLQGGPHNHAIAAVAVALKQACTPMFREYSLQVLKNARAMADALLERGYSLVSGGTDNHLVLVDLRPKGLDGARAERVLELVSITANKNTCPGDRSAITPGGLRLGAPALTSRQFREDDFRRVVDFIDEGVNIGLEVKSKTAKLQDFKSFLLKDSETSQRLANLRQRVEQFARAFPMPGFDEH,504,NP_005403.2.csv,refseq-SHMT2-NM_005412.5_clinical_seed_0_final,refseq-SHMT2-NM_005412.5.a2m,Invitae,refseq-SHMT2-NM_005412.5.npy,1,504,504
+NP_005404.1,MVFRSPLDLYSSHFLLPNFADSHHRSILLASSGGGNGAGGGGGAGGGSGGGNGAGGGGAGGAGGGGGGGSRAPPEELSMFQLPTLNFSPEQVASVCETLEETGDIERLGRFLWSLPVAPGACEAINKHESILRARAVVAFHTGNFRDLYHILENHKFTKESHGKLQAMWLEAHYQEAEKLRGRPLGPVDKYRVRKKFPLPRTIWDGEQKTHCFKERTRSLLREWYLQDPYPNPSKKRELAQATGLTPTQVGNWFKNRRQRDRAAAAKNRLQHQAIGPSGMRSLAEPGCPTHGSAESPSTAASPTTSVSSLTERADTGTSILSVTSSDSECDV,332,NP_005404.1.csv,refseq-SIX3-NM_005413.3_clinical_seed_0_final,refseq-SIX3-NM_005413.3.a2m,Invitae,refseq-SIX3-NM_005413.3.npy,1,332,332
+NP_005410.1,MAQWEMLQNLDSPFQDQLHQLYSHSLLPVDIRQYLAVWIEDQNWQEAALGSDDSKATMLFFHFLDQLNYECGRCSQDPESLLLQHNLRKFCRDIQPFSQDPTQLAEMIFNLLLEEKRILIQAQRAQLEQGEPVLETPVESQQHEIESRILDLRAMMEKLVKSISQLKDQQDVFCFRYKIQAKGKTPSLDPHQTKEQKILQETLNELDKRRKEVLDASKALLGRLTTLIELLLPKLEEWKAQQQKACIRAPIDHGLEQLETWFTAGAKLLFHLRQLLKELKGLSCLVSYQDDPLTKGVDLRNAQVTELLQRLLHRAFVVETQPCMPQTPHRPLILKTGSKFTVRTRLLVRLQEGNESLTVEVSIDRNPPQLQGFRKFNILTSNQKTLTPEKGQSQGLIWDFGYLTLVEQRSGGSGKGSNKGPLGVTEELHIISFTVKYTYQGLKQELKTDTLPVVIISNMNQLSIAWASVLWFNLLSPNLQNQQFFSNPPKAPWSLLGPALSWQFSSYVGRGLNSDQLSMLRNKLFGQNCRTEDPLLSWADFTKRESPPGKLPFWTWLDKILELVHDHLKDLWNDGRIMGFVSRSQERRLLKKTMSGTFLLRFSESSEGGITCSWVEHQDDDKVLIYSVQPYTKEVLQSLPLTEIIRHYQLLTEENIPENPLRFLYPRIPRDEAFGCYYQEKVNLQERRKYLKHRLIVVSNRQVDELQQPLELKPEPELESLELELGLVPEPELSLDLEPLLKAGLDLGPELESVLESTLEPVIEPTLCMVSQTVPEPDQGPVSQPVPEPDLPCDLRHLNTEPMEIFRNCVKIEEIMPNGDPLLAGQNTVDEVYVSRPSHFYTDGPLMPSDF,851,NP_005410.1.csv,refseq-STAT2-NM_005419.3_clinical_seed_0_final,refseq-STAT2-NM_005419.3.a2m,Invitae,refseq-STAT2-NM_005419.3.npy,1,851,851
+NP_005413.2,MNYSSFLRIWVSFIFALVQHQAQPRELMYPFWQNDTKTPKVDDGSSSEIKLAIPVFFFGVPYRTVYVNNNGVVSFNVLVSQFTPESFPLTDGRAFVAPFWADVHNGIRGEIYYRETMEPAILKRATKDIRKYFKDMATFSATWVFIVTWEEVTFYGGSSTTPVNTFQAVLVSDGSYTFTLFNYYEINWTTGTASGGDPLTGLGGVMAQAGFNGGNLTNFFSLPGSRTPEIVNIQETTNVNVPGRWAFKVDGKEIDPANGCTSRGQFLRRGEVFWDDLNCTVKCRCLDFNNEIYCQEASCSPYEVCEPKGKFFYCSAVETSTCVVFGEPHYHTFDGFLFHFQGSCAYLLARQCLQTSSLPFFSVEAKNEHRRGSAVSWVKELSVEVNGYKILIPKGSYGRVKVNDLVTSLPVTLDLGTVKIYQSGISTAVETDFGLLVTFDGQHYASISVPGSYINSTCGLCGNYNKNPLDDFLRPDGRPAMSVLDLGESWRVYHADWKCDSGCVDNCTQCDAATEALYFGSDYCGFLNKTDGPLWECGTVVDPTAFVHSCVYDLCSVRDNGTLLCQAIQAYALVCQALGIPIGDWRTQTGCVSTVQCPSFSHYSVCTSSCPDTCSDLTASRNCATPCTEGCECNQGFVLSTSQCVPLHKCGCDFDGHYYTMGEFFWATANCTVQCLCEEGGDVYCFNKTCGSGEVCAVEDGYQGCFPKRETVCLLSQNQVLHTFDGASYAFPSEFSYTLLKTCPERPEYLEIDINKKKPDAGPAWLRGLRILVADQEVKIGGIGASEVKLNGQEVELPFFHPSGKLEIYRNKNSTTVESKGVVTVQYSDIGLLYIRLSTTYFNCTGGLCGFYNANASDEFCLPNGKCTDNLAVFLESWTTFEEICNGECGDLLKACNNDSELLKFYRSRSRCGIINDPSNSSFLECHGVVNVTAYYRTCLFRLCQSGGNESELCDSVARYASACKNADVEVGPWRTYDFCPLECPENSHFEECITCTETCETLTLGPICVDSCSEGCQCDEGYALLGSQCVTRSECGCNFEGHQLATNETFWVDLDCQIFCYCSGTDNRVHCETIPCKDDEYCMEEGGLYYCQARTDASCIVSGYGHYLTFDGFPFDFQTSCPLILCTTGSRPSSDSFPKFVVTAKNEDRDPSLALWVKQVDVTVFGYSIVIHRAYKHTVLVNSERLYLPLKLGQGKINIFSFGFHVVVETDFGLKVVYDWKTFLSITVPRSMQNSTYGLCGRYNGNPDDDLEMPMGLLASSVNEFGQSWVKRDTFCQVGCGDRCPSCAKVEGFSKVQQLCSLIPNQNAAFSKCHSKVNPTFFYKNCLFDSCIDGGAVQTACSWLQNYASTCQTQGITVTGWRNYTSCTVTCPPNSHYESCVSVCQPRCAAIRLKSDCSHYCVEGCHCDAGYVLNGKSCILPHSCGCYSDGKYYEPKQLFWNSDCTRRCRCFRRNVIQCDPRQCKSDEECALRNGVRGCFSTKTSYCLAAGGGVFRTFDGAFLRFPANCAFVLSTICQKLPDISFQLIINFDKWSAPNLTIISPVYFYINEEQILINDRNTVKVNGTQVNVPFITGLATKIYSSEGFLVIDTSPDIQIYYNGFNVIKISISERLQNKVCGLCGNFNGDLTDDYVTLRGKPVVSSVVLAQSWKTNGMQKRPLAPSCNELQFSQYAAMCDNVHIQKMQGDGYCLKLTDMKGFFQPCYGLLDPLPFYESCYLDGCYSHKKFQLCGSLAAYGEACRSFGILSTEWIEKENCSGVVEDPCVGADCPNRTCELGNGRELCGCIEPPPYGNNSHDIIDAEVTCKAAQMEVSISKCKLFQLGFEREGVRINDRQCTGIEGEDFISFQINNTKGNCGNIVQSNGTHIMYKNTLWIESANNTGNIITRDRTINVEFSCAYELDIKISLDSVVKPMLSVINLTVPTQEGSFITKMALYKNASYKHPYRQGEVVLTTRDVLYVGVFVVGADATHLILTLNKCYATPTRDSNDKLRYFIIEGGCQNLKDNTIGIEENAVSLTCRFHVTVFKFIGDYDEVHLHCAVSLCDSEKYSCKITCPHNSRIATDYTKEPKEQIISVGPIRRKRLDWCEDNGGCEQICTSRVDGPLCSCVTGTLQEDGKSCRASNSSMELQVWTLLLIMIQISLWHFVYKSGTTS,2155,NP_005413.2.csv,refseq-TECTA-NM_005422.2_clinical_seed_0_final,refseq-TECTA-NM_005422.2.a2m,Invitae,refseq-TECTA-NM_005422.2.npy,1,2155,2155
+NP_005421.1,MGLWALLPGWVSATLLLALAALPAALAANSSGRWWGIVNVASSTNLLTDSKSLQLVLEPSLQLLSRKQRRLIRQNPGILHSVSGGLQSAVRECKWQFRNRRWNCPTAPGPHLFGKIVNRGCRETAFIFAITSAGVTHSVARSCSEGSIESCTCDYRRRGPGGPDWHWGGCSDNIDFGRLFGREFVDSGEKGRDLRFLMNLHNNEAGRTTVFSEMRQECKCHGMSGSCTVRTCWMRLPTLRAVGDVLRDRFDGASRVLYGNRGSNRASRAELLRLEPEDPAHKPPSPHDLVYFEKSPNFCTYSGRLGTAGTAGRACNSSSPALDGCELLCCGRGHRTRTQRVTERCNCTFHWCCHVSCRNCTHTRVLHECL,370,NP_005421.1.csv,refseq-WNT1-NM_005430.3_clinical_seed_0_final,refseq-WNT1-NM_005430.3.a2m,Invitae,refseq-WNT1-NM_005430.3.npy,1,370,370
+NP_005436.1,MYIKQVIIQGFRSYRDQTIVDPFSSKHNVIVGRNGSGKSNFFYAIQFVLSDEFSHLRPEQRLALLHEGTGPRVISAFVEIIFDNSDNRLPIDKEEVSLRRVIGAKKDQYFLDKKMVTKNDVMNLLESAGFSRSNPYYIVKQGKINQMATAPDSQRLKLLREVAGTRVYDERKEESISLMKETEGKREKINELLKYIEERLHTLEEEKEELAQYQKWDKMRRALEYTIYNQELNETRAKLDELSAKRETSGEKSRQLRDAQQDARDKMEDIERQVRELKTKISAMKEEKEQLSAERQEQIKQRTKLELKAKDLQDELAGNSEQRKRLLKERQKLLEKIEEKQKELAETEPKFNSVKEKEERGIARLAQATQERTDLYAKQGRGSQFTSKEERDKWIKKELKSLDQAINDKKRQIAAIHKDLEDTEANKEKNLEQYNKLDQDLNEVKARVEELDRKYYEVKNKKDELQSERNYLWREENAEQQALAAKREDLEKKQQLLRAATGKAILNGIDSINKVLDHFRRKGINQHVQNGYHGIVMNNFECEPAFYTCVEVTAGNRLFYHIVDSDEVSTKILMEFNKMNLPGEVTFLPLNKLDVRDTAYPETNDAIPMISKLRYNPRFDKAFKHVFGKTLICRSMEVSTQLARAFTMDCITLEGDQVSHRGALTGGYYDTRKSRLELQKDVRKAEEELGELEAKLNENLRRNIERINNEIDQLMNQMQQIETQQRKFKASRDSILSEMKMLKEKRQQSEKTFMPKQRSLQSLEASLHAMESTRESLKAELGTDLLSQLSLEDQKRVDALNDEIRQLQQENRQLLNERIKLEGIITRVETYLNENLRKRLDQVEQELNELRETEGGTVLTATTSELEAINKRVKDTMARSEDLDNSIDKTEAGIKELQKSMERWKNMEKEHMDAINHDTKELEKMTNRQGMLLKKKEECMKKIRELGSLPQEAFEKYQTLSLKQLFRKLEQCNTELKKYSHVNKKALDQFVNFSEQKEKLIKRQEELDRGYKSIMELMNVLELRKYEAIQLTFKQVSKNFSEVFQKLVPGGKATLVMKKGDVEGSQSQDEGEGSGESERGSGSQSSVPSVDQFTGVGIRVSFTGKQGEMREMQQLSGGQKSLVALALIFAIQKCDPAPFYLFDEIDQALDAQHRKAVSDMIMELAVHAQFITTTFRPELLESADKFYGVKFRNKVSHIDVITAEMAKDFVEDDTTHG,1217,NP_005436.1.csv,refseq-SMC3-NM_005445.3_clinical_seed_0_final,refseq-SMC3-NM_005445.3.a2m,Invitae,refseq-SMC3-NM_005445.3.npy,1,1217,1217
+NP_005439.2,MVLLSILRILFLCELVLFMEHRAQMAEGGQSSIALLAEAPTLPLIEELLEESPGEQPRKPRLLGHSLRYMLELYRRSADSHGHPRENRTIGATMVRLVKPLTNVARPHRGTWHIQILGFPLRPNRGLYQLVRATVVYRHHLQLTRFNLSCHVEPWVQKNPTNHFPSSEGDSSKPSLMSNAWKEMDITQLVQQRFWNNKGHRILRLRFMCQQQKDSGGLELWHGTSSLDIAFLLLYFNDTHKSIRKAKFLPRGMEEFMERESLLRRTRQADGISAEVTASSSKHSGPENNQCSLHPFQISFRQLGWDHWIIAPPFYTPNYCKGTCLRVLRDGLNSPNHAIIQNLINQLVDQSVPRPSCVPYKYVPISVLMIEANGSILYKEYEGMIAESCTCR,392,NP_005439.2.csv,refseq-BMP15-NM_005448.2_clinical_seed_0_final,refseq-BMP15-NM_005448.2.a2m,Invitae,refseq-BMP15-NM_005448.2.npy,1,392,392
+NP_005441.1,MERCPSLGVTLYALVVVLGLRATPAGGQHYLHIRPAPSDNLPLVDLIEHPDPIFDPKEKDLNETLLRSLLGGHYDPGFMATSPPEDRPGGGGGAAGGAEDLAELDQLLRQRPSGAMPSEIKGLEFSEGLAQGKKQRLSKKLRRKLQMWLWSQTFCPVLYAWNDLGSRFWPRYVKVGSCFSKRSCSVPEGMVCKPSKSVHLTVLRWRCQRRGGQRCGWIPIQYPIISECKCSC,232,NP_005441.1.csv,refseq-NOG-NM_005450.4_clinical_seed_0_final,refseq-NOG-NM_005450.4.a2m,Invitae,refseq-NOG-NM_005450.4.npy,1,232,232
+NP_005449.5,MASPRSSGQPGPPPPPPPPPARLLLLLLLPLLLPLAPGAWGWARGAPRPPPSSPPLSIMGLMPLTKEVAKGSIGRGVLPAVELAIEQIRNESLLRPYFLDLRLYDTECDNAKGLKAFYDAIKYGPNHLMVFGGVCPSVTSIIAESLQGWNLVQLSFAATTPVLADKKKYPYFFRTVPSDNAVNPAILKLLKHYQWKRVGTLTQDVQRFSEVRNDLTGVLYGEDIEISDTESFSNDPCTSVKKLKGNDVRIILGQFDQNMAAKVFCCAYEENMYGSKYQWIIPGWYEPSWWEQVHTEANSSRCLRKNLLAAMEGYIGVDFEPLSSKQIKTISGKTPQQYEREYNNKRSGVGPSKFHGYAYDGIWVIAKTLQRAMETLHASSRHQRIQDFNYTDHTLGRIILNAMNETNFFGVTGQVVFRNGERMGTIKFTQFQDSREVKVGEYNAVADTLEIINDTIRFQGSEPPKDKTIILEQLRKISLPLYSILSALTILGMIMASAFLFFNIKNRNQKLIKMSSPYMNNLIILGGMLSYASIFLFGLDGSFVSEKTFETLCTVRTWILTVGYTTAFGAMFAKTWRVHAIFKNVKMKKKIIKDQKLLVIVGGMLLIDLCILICWQAVDPLRRTVEKYSMEPDPAGRDISIRPLLEHCENTHMTIWLGIVYAYKGLLMLFGCFLAWETRNVSIPALNDSKYIGMSVYNVGIMCIIGAAVSFLTRDQPNVQFCIVALVIIFCSTITLCLVFVPKLITLRTNPDAATQNRRFQFTQNQKKEDSKTSTSVTSVNQASTSRLEGLQSENHRLRMKITELDKDLEEVTMQLQDTPEKTTYIKQNHYQELNDILNLGNFTESTDGGKAILKNHLDQNPQLQWNTTEPSRTCKDPIEDINSPEHIQRRLSLQLPILHHAYLPSIGGVDASCVSPCVSPTASPRHRHVPPSFRVMVSGL,941,NP_005449.5.csv,refseq-GABBR2-NM_005458.7_clinical_seed_0_final,refseq-GABBR2-NM_005458.7.a2m,Invitae,refseq-GABBR2-NM_005458.7.npy,1,941,941
+NP_005452.2,MAAELSMGPELPTSPLAMEYVNDFDLLKFDVKKEPLGRAERPGRPCTRLQPAGSVSSTPLSTPCSSVPSSPSFSPTEQKTHLEDLYWMASNYQQMNPEALNLTPEDAVEALIGSHPVPQPLQSFDSFRGAHHHHHHHHPHPHHAYPGAGVAHDELGPHAHPHHHHHHQASPPPSSAASPAQQLPTSHPGPGPHATASATAAGGNGSVEDRFSDDQLVSMSVRELNRHLRGFTKDEVIRLKQKRRTLKNRGYAQSCRYKRVQQKHHLENEKTQLIQQVEQLKQEVSRLARERDAYKVKCEKLANSGFREAGSTSDSPSSPEFFL,323,NP_005452.2.csv,refseq-MAFB-NM_005461.4_clinical_seed_0_final,refseq-MAFB-NM_005461.4.a2m,Invitae,refseq-MAFB-NM_005461.4.npy,1,323,323
+NP_005466.1,MNGPALQPSSPSSAPSASPAAAPRGWSEFCELHAVAAARELARQYWLFAREHPQHAPLRAELVSLQFTDLFQRYFCREVRDGRAPGRDYRDTGRGPPAKAEASPEPGPGPAAPGLPKARSSEELAPPRPPGPCSFQHFRRSLRHIFRRRSAGELPAAHTAAAPGTPGEAAETPARPGLAKKFLPWSLAREPPPEALKEAVLRYSLADEASMDSGARWQRGRLALRRAPGPDGPDRVLELFDPPKSSRPKLQAACSSIQEVRWCTRLEMPDNLYTFVLKVKDRTDIIFEVGDEQQLNSWMAELSECTGRGLESTEAEMHIPSALEPSTSSSPRGSTDSLNQGASPGGLLDPACQKTDHFLSCYPWFHGPISRVKAAQLVQLQGPDAHGVFLVRQSETRRGEYVLTFNFQGIAKHLRLSLTERGQCRVQHLHFPSVVDMLHHFQRSPIPLECGAACDVRLSSYVVVVSQPPGSCNTVLFPFSLPHWDSESLPHWGSELGLPHLSSSGCPRGLSPEGLPGRSSPPEQIFHLVPSPEELANSLQHLEHEPVNRARDSDYEMDSSSRSHLRAIDNQYTPL,575,NP_005466.1.csv,refseq-SH2B3-NM_005475.2_clinical_seed_0_final,refseq-SH2B3-NM_005475.2.a2m,Invitae,refseq-SH2B3-NM_005475.2.npy,1,575,575
+NP_005468.1,MDKLPPSMRKRLYSLPQQVGAKAWIMDEEEDAEEEGAGGRQDPSRRSIRLRPLPSPSPSAAAGGTESRSSALGAADSEGPARGAGKSSTNGDCRRFRGSLASLGSRGGGSGGTGSGSSHGHLHDSAEERRLIAEGDASPGEDRTPPGLAAEPERPGASAQPAASPPPPQQPPQPASASCEQPSVDTAIKVEGGAAAGDQILPEAEVRLGQAGFMQRQFGAMLQPGVNKFSLRMFGSQKAVEREQERVKSAGFWIIHPYSDFRFYWDLTMLLLMVGNLIIIPVGITFFKDENTTPWIVFNVVSDTFFLIDLVLNFRTGIVVEDNTEIILDPQRIKMKYLKSWFMVDFISSIPVDYIFLIVETRIDSEVYKTARALRIVRFTKILSLLRLLRLSRLIRYIHQWEEIFHMTYDLASAVVRIVNLIGMMLLLCHWDGCLQFLVPMLQDFPDDCWVSINNMVNNSWGKQYSYALFKAMSHMLCIGYGRQAPVGMSDVWLTMLSMIVGATCYAMFIGHATALIQSLDSSRRQYQEKYKQVEQYMSFHKLPPDTRQRIHDYYEHRYQGKMFDEESILGELSEPLREEIINFNCRKLVASMPLFANADPNFVTSMLTKLRFEVFQPGDYIIREGTIGKKMYFIQHGVVSVLTKGNKETKLADGSYFGEICLLTRGRRTASVRADTYCRLYSLSVDNFNEVLEEYPMMRRAFETVALDRLDRIGKKNSILLHKVQHDLNSGVFNYQENEIIQQIVQHDREMAHCAHRVQAAASATPTPTPVIWTPLIQAPLQAAAATTSVAIALTHHPRLPAAIFRPPPGSGLGNLGAGQTPRHLKRLQSLIPSALGSASPASSPSQVDTPSSSSFHIQQLAGFSAPAGLSPLLPSSSSSPPPGACGSPSAPTPSAGVAATTIAGFGHFHKALGGSLSSSDSPLLTPLQPGARSPQAAQPSPAPPGARGGLGLPEHFLPPPPSSRSPSSSPGQLGQPPGELSLGLATGPLSTPETPPRQPEPPSLVAGASGGASPVGFTPRGGLSPPGHSPGPPRTFPSAPPRASGSHGSLLLPPASSPPPPQVPQRRGTPPLTPGRLTQDLKLISASQPALPQDGAQTLRRASPHSSGESMAAFPLFPRAGGGSGGSGSSGGLGPPGRPYGAIPGQHVTLPRKTSSGSLPPPLSLFGARATSSGGPPLTAGPQREPGARPEPVRSKLPSNL,1203,NP_005468.1.csv,refseq-HCN4-NM_005477.2_clinical_seed_0_final,refseq-HCN4-NM_005477.2.a2m,Invitae,refseq-HCN4-NM_005477.2.npy,1,1203,1203
+NP_005493.2,MACWPQLRLLLWKNLTFRRRQTCQLLLEVAWPLFIFLILISVRLSYPPYEQHECHFPNKAMPSAGTLPWVQGIICNANNPCFRYPTPGEAPGVVGNFNKSIVARLFSDARRLLLYSQKDTSMKDMRKVLRTLQQIKKSSSNLKLQDFLVDNETFSGFLYHNLSLPKSTVDKMLRADVILHKVFLQGYQLHLTSLCNGSKSEEMIQLGDQEVSELCGLPREKLAAAERVLRSNMDILKPILRTLNSTSPFPSKELAEATKTLLHSLGTLAQELFSMRSWSDMRQEVMFLTNVNSSSSSTQIYQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALFGGNGTEEDAETFYDNSTTPYCNDLMKNLESSPLSRIIWKALKPLLVGKILYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELSPKIWTFMENSQEMDLVRMLLDSRDNDHFWEQQLDGLDWTAQDIVAFLAKHPEDVQSSNGSVYTWREAFNETNQAIRTISRFMECVNLNKLEPIATEVWLINKSMELLDERKFWAGIVFTGITPGSIELPHHVKYKIRMDIDNVERTNKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDVVEQAIIRVLTGTEKKTGVYMQQMPYPCYVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKGIVYEKEARLKETMRIMGLDNSILWFSWFISSLIPLLVSAGLLVVILKLGNLLPYSDPSVVFVFLSVFAVVTILQCFLISTLFSRANLAAACGGIIYFTLYLPYVLCVAWQDYVGFTLKIFASLLSPVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFLYGVMTWYIEAVFPGQYGIPRPWYFPCTKSYWFGEESDEKSHPGSNQKRISEICMEEEPTHLKLGVSIQNLVKVYRDGMKVAVDGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYILGKDIRSEMSTIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVGLPSSKLKSKTSQLSGGMQRKLSVALAFVGGSKVVILDEPTAGVDPYSRRGIWELLLKYRQGRTIILSTHHMDEADVLGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLSSCRNSSSTVSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDIGHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAETSDGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETDLLSGMDGKGSYQVKGWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQPWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPDTPCQAGEEEWTTAPVPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQNTADILQDLTGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNTQALPPSQEVNDAIKQMKKHLKLAKDSSADRFLNSLGRFMTGLDTKNNVKVWFNNKGWHAISSFLNVINNAILRANLQKGENPSHYGITAFNHPLNLTKQQLSEVALMTTSVDVLVSICVIFAMSFVPASFVVFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYVVPATLVIIIFICFQQKSYVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYVVLTSVNLFIGINGSVATFVLELFTDNKLNNINDILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSWDLVGRNLFAMAVEGVVFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRILDGGGQNDILEIKELTKIYRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLTGDTTVTRGDAFLNKNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGEWAIRKLGLVKYGEKYAGNYSGGNKRKLSTAMALIGGPPVVFLDEPTTGMDPKARRFLWNCALSVVKEGRSVVLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIVVRIAGSNPDLKPVQDFFGLAFPGSVLKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTVVDVAVLTSFLQDEKVKESYV,2261,NP_005493.2.csv,refseq-ABCA1-NM_005502.3_clinical_seed_0_final,refseq-ABCA1-NM_005502.3.a2m,Invitae,refseq-ABCA1-NM_005502.3.npy,1,2261,2261
+NP_005506.3,MEKSKNFRIDALLAVDPPRAASAQSAPLALVTSLAAAASGTGGGGGGGGASGGTSGSCSPASSEPPAAPADRLRAESPSPPRLLAAHCALLPKPGFLGAGGGGGGTGGGHGGPHHHAHPGAAAAAAAAAAAAAAGGLALGLHPGGAQGGAGLPAQAALYGHPVYGYSAAAAAAALAGQHPALSYSYPQVQGAHPAHPADPIKLGAGTFQLDQWLRASTAGMILPKMPDFNSQAQSNLLGKCRRPRTAFTSQQLLELEHQFKLNKYLSRPKRFEVATSLMLTETQVKIWFQNRRMKWKRSKKAKEQAAQEAEKQKGGGGGAGKGGAEEPGAEELLGPPAPGDKGSGRRLRDLRDSDPEEDEDEDDEDHFPYSNGASVHAASSDCSSEDDSPPPRPSHQPAPQ,401,NP_005506.3.csv,refseq-MNX1-NM_005515.3_clinical_seed_0_final,refseq-MNX1-NM_005515.3.a2m,Invitae,refseq-MNX1-NM_005515.3.npy,1,401,401
+NP_005509.1,MQRLLTPVKRILQLTRAVQETSLTPARLLPVAHQRFSTASAVPLAKTDTWPKDVGILALEVYFPAQYVDQTDLEKYNNVEAGKYTVGLGQTRMGFCSVQEDINSLCLTVVQRLMERIQLPWDSVGRLEVGTETIIDKSKAVKTVLMELFQDSGNTDIEGIDTTNACYGGTASLFNAANWMESSSWDGRYAMVVCGDIAVYPSGNARPTGGAGAVAMLIGPKAPLALERGLRGTHMENVYDFYKPNLASEYPIVDGKLSIQCYLRALDRCYTSYRKKIQNQWKQAGSDRPFTLDDLQYMIFHTPFCKMVQKSLARLMFNDFLSASSDTQTSLYKGLEAFGGLKLEDTYTNKDLDKALLKASQDMFDKKTKASLYLSTHNGNMYTSSLYGCLASLLSHHSAQELAGSRIGAFSYGSGLAASFFSFRVSQDAAPGSPLDKLVSSTSDLPKRLASRKCVSPEEFTEIMNQREQFYHKVNFSPPGDTNSLFPGTWYLERVDEQHRRKYARRPV,508,NP_005509.1.csv,refseq-HMGCS2-NM_005518.3_clinical_seed_0_final,refseq-HMGCS2-NM_005518.3.a2m,Invitae,refseq-HMGCS2-NM_005518.3.npy,1,508,508
+NP_005521.1,MAGPAWISKVSRLLGAFHNPKQVTRGFTGGVQTVTLIPGDGIGPEISAAVMKIFDAAKAPIQWEERNVTAIQGPGGKWMIPSEAKESMDKNKMGLKGPLKTPIAAGHPSMNLLLRKTFDLYANVRPCVSIEGYKTPYTDVNIVTIRENTEGEYSGIEHVIVDGVVQSIKLITEGASKRIAEFAFEYARNNHRSNVTAVHKANIMRMSDGLFLQKCREVAESCKDIKFNEMYLDTVCLNMVQDPSQFDVLVMPNLYGDILSDLCAGLIGGLGVTPSGNIGANGVAIFESVHGTAPDIAGKDMANPTALLLSAVMMLRHMGLFDHAARIEAACFATIKDGKSLTKDLGGNAKCSDFTEEICRRVKDLD,366,NP_005521.1.csv,refseq-IDH3A-NM_005530.2_clinical_seed_0_final,refseq-IDH3A-NM_005530.2.a2m,Invitae,refseq-IDH3A-NM_005530.2.npy,1,366,366
+NP_005526.1,MEPLVTWVVPLLFLFLLSRQGAACRTSECCFQDPPYPDADSGSASGPRDLRCYRISSDRYECSWQYEGPTAGVSHFLRCCLSSGRCCYFAAGSATRLQFSDQAGVSVLYTVTLWVESWARNQTEKSPEVTLQLYNSVKYEPPLGDIKVSKLAGQLRMEWETPDNQVGAEVQFRHRTPSSPWKLGDCGPQDDDTESCLCPLEMNVAQEFQLRRRQLGSQGSSWSKWSSPVCVPPENPPQPQVRFSVEQLGQDGRRRLTLKEQPTQLELPEGCQGLAPGTEVTYRLQLHMLSCPCKAKATRTLHLGKMPYLSGAAYNVAVISSNQFGPGLNQTWHIPADTHTEPVALNISVGTNGTTMYWPARAQSMTYCIEWQPVGQDGGLATCSLTAPQDPDPAGMATYSWSRESGAMGQEKCYYITIFASAHPEKLTLWSTVLSTYHFGGNASAAGTPHHVSVKNHSLDSVSVDWAPSLLSTCPGVLKEYVVRCRDEDSKQVSEHPVQPTETQVTLSGLRAGVAYTVQVRADTAWLRGVWSQPQRFSIEVQVSDWLIFFASLGSFLSILLVGVLGYLGLNRAARHLCPPLPTPCASSAIEFPGGKETWQWINPVDFQEEASLQEALVVEMSWDKGERTEPLEKTELPEGAPELALDTELSLEDGDRCKAKM,662,NP_005526.1.csv,refseq-IL12RB1-NM_005535.2_clinical_seed_0_final,refseq-IL12RB1-NM_005535.2.a2m,Invitae,refseq-IL12RB1-NM_005535.2.npy,1,662,662
+NP_005545.1,MASTSTTIRSHSSSRRGFSANSARLPGVSRSGFSSVSVSRSRGSGGLGGACGGAGFGSRSLYGLGGSKRISIGGGSCAISGGYGSRAGGSYGFGGAGSGFGFGGGAGIGFGLGGGAGLAGGFGGPGFPVCPPGGIQEVTVNQSLLTPLNLQIDPTIQRVRAEEREQIKTLNNKFASFIDKVRFLEQQNKVLETKWTLLQEQGTKTVRQNLEPLFEQYINNLRRQLDSIVGERGRLDSELRGMQDLVEDFKNKYEDEINKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRALYDAELSQMQTHISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEIAQRSRAEAESWYQTKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCANLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRKLLEGEECRLNGEGVGQVNISVVQSTVSSGYGGASGVGSGLGLGGGSSYSYGSGLGVGGGFSSSSGRAIGGGLSSVGGGSSTIKYTTTSSSSRKSYKH,564,NP_005545.1.csv,refseq-KRT6A-NM_005554.3_clinical_seed_0_final,refseq-KRT6A-NM_005554.3.a2m,Invitae,refseq-KRT6A-NM_005554.3.npy,1,564,564
+NP_005546.2,MASTSTTIRSHSSSRRGFSANSARLPGVSRSGFSSISVSRSRGSGGLGGACGGAGFGSRSLYGLGGSKRISIGGGSCAISGGYGSRAGGSYGFGGAGSGFGFGGGAGIGFGLGGGAGLAGGFGGPGFPVCPPGGIQEVTVNQSLLTPLNLQIDPAIQRVRAEEREQIKTLNNKFASFIDKVRFLEQQNKVLDTKWTLLQEQGTKTVRQNLEPLFEQYINNLRRQLDNIVGERGRLDSELRNMQDLVEDLKNKYEDEINKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRALYDAELSQMQTHISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEIAQRSRAEAESWYQTKYEELQITAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCANLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRKLLEGEECRLNGEGVGQVNISVVQSTVSSGYGGASGVGSGLGLGGGSSYSYGSGLGVGGGFSSSSGRATGGGLSSVGGGSSTIKYTTTSSSSRKSYKH,564,NP_005546.2.csv,refseq-KRT6B-NM_005555.3_clinical_seed_0_final,refseq-KRT6B-NM_005555.3.a2m,Invitae,refseq-KRT6B-NM_005555.3.npy,1,564,564
+NP_005548.2,MTTCSRQFTSSSSMKGSCGIGGGIGGGSSRISSVLAGGSCRAPSTYGGGLSVSSRFSSGGACGLGGGYGGGFSSSSSFGSGFGGGYGGGLGAGFGGGLGAGFGGGFAGGDGLLVGSEKVTMQNLNDRLASYLDKVRALEEANADLEVKIRDWYQRQRPSEIKDYSPYFKTIEDLRNKIIAATIENAQPILQIDNARLAADDFRTKYEHELALRQTVEADVNGLRRVLDELTLARTDLEMQIEGLKEELAYLRKNHEEEMLALRGQTGGDVNVEMDAAPGVDLSRILNEMRDQYEQMAEKNRRDAETWFLSKTEELNKEVASNSELVQSSRSEVTELRRVLQGLEIELQSQLSMKASLENSLEETKGRYCMQLSQIQGLIGSVEEQLAQLRCEMEQQSQEYQILLDVKTRLEQEIATYRRLLEGEDAHLSSQQASGQSYSSREVFTSSSSSSSRQTRPILKEQSSSSFSQGQSS,473,NP_005548.2.csv,refseq-KRT16-NM_005557.3_clinical_seed_0_final,refseq-KRT16-NM_005557.3.a2m,Invitae,refseq-KRT16-NM_005557.3.npy,1,473,473
+NP_005553.2,MPALWLGCCLCFSLLLPAARATSRREVCDCNGKSRQCIFDRELHRQTGNGFRCLNCNDNTDGIHCEKCKNGFYRHRERDRCLPCNCNSKGSLSARCDNSGRCSCKPGVTGARCDRCLPGFHMLTDAGCTQDQRLLDSKCDCDPAGIAGPCDAGRCVCKPAVTGERCDRCRSGYYNLDGGNPEGCTQCFCYGHSASCRSSAEYSVHKITSTFHQDVDGWKAVQRNGSPAKLQWSQRHQDVFSSAQRLDPVYFVAPAKFLGNQQVSYGQSLSFDYRVDRGGRHPSAHDVILEGAGLRITAPLMPLGKTLPCGLTKTYTFRLNEHPSNNWSPQLSYFEYRRLLRNLTALRIRATYGEYSTGYIDNVTLISARPVSGAPAPWVEQCICPVGYKGQFCQDCASGYKRDSARLGPFGTCIPCNCQGGGACDPDTGDCYSGDENPDIECADCPIGFYNDPHDPRSCKPCPCHNGFSCSVMPETEEVVCNNCPPGVTGARCELCADGYFGDPFGEHGPVRPCQPCQCNNNVDPSASGNCDRLTGRCLKCIHNTAGIYCDQCKAGYFGDPLAPNPADKCRACNCNPMGSEPVGCRSDGTCVCKPGFGGPNCEHGAFSCPACYNQVKIQMDQFMQQLQRMEALISKAQGGDGVVPDTELEGRMQQAEQALQDILRDAQISEGASRSLGLQLAKVRSQENSYQSRLDDLKMTVERVRALGSQYQNRVRDTHRLITQMQLSLAESEASLGNTNIPASDHYVGPNGFKSLAQEATRLAESHVESASNMEQLTRETEDYSKQALSLVRKALHEGVGSGSGSPDGAVVQGLVEKLEKTKSLAQQLTREATQAEIEADRSYQHSLRLLDSVSRLQGVSDQSFQVEEAKRIKQKADSLSSLVTRHMDEFKRTQKNLGNWKEEAQQLLQNGKSGREKSDQLLSRANLAKSRAQEALSMGNATFYEVESILKNLREFDLQVDNRKAEAEEAMKRLSYISQKVSDASDKTQQAERALGSAAADAQRAKNGAGEALEISSEIEQEIGSLNLEANVTADGALAMEKGLASLKSEMREVEGELERKELEFDTNMDAVQMVITEAQKVDTRAKNAGVTIQDTLNTLDGLLHLMDQPLSVDEEGLVLLEQKLSRAKTQINSQLRPMMSELEERARQQRGHLHLLETSIDGILADVKNLENIRDNLPPGCYNTQALEQQ,1193,NP_005553.2.csv,refseq-LAMC2-NM_005562.2_clinical_seed_0_final,refseq-LAMC2-NM_005562.2.a2m,Invitae,refseq-LAMC2-NM_005562.2.npy,1,1193,1193
+NP_005557.1,MATLKDQLIYNLLKEEQTPQNKITVVGVGAVGMACAISILMKDLADELALVDVIEDKLKGEMMDLQHGSLFLRTPKIVSGKDYNVTANSKLVIITAGARQQEGESRLNLVQRNVNIFKFIIPNVVKYSPNCKLLIVSNPVDILTYVAWKISGFPKNRVIGSGCNLDSARFRYLMGERLGVHPLSCHGWVLGEHGDSSVPVWSGMNVAGVSLKTLHPDLGTDKDKEQWKEVHKQVVESAYEVIKLKGYTSWAIGLSVADLAESIMKNLRRVHPVSTMIKGLYGIKDDVFLSVPCILGQNGISDLVKVTLTSEEEARLKKSADTLWGIQKELQF,332,NP_005557.1.csv,refseq-LDHA-NM_005566.3_clinical_seed_0_final,refseq-LDHA-NM_005566.3.a2m,Invitae,refseq-LDHA-NM_005566.3.npy,1,332,332
+NP_005563.1,METPSQRRATRSGAQASSTPLSPTRITRLQEKEDLQELNDRLAVYIDRVRSLETENAGLRLRITESEEVVSREVSGIKAAYEAELGDARKTLDSVAKERARLQLELSKVREEFKELKARNTKKEGDLIAAQARLKDLEALLNSKEAALSTALSEKRTLEGELHDLRGQVAKLEAALGEAKKQLQDEMLRRVDAENRLQTMKEELDFQKNIYSEELRETKRRHETRLVEIDNGKQREFESRLADALQELRAQHEDQVEQYKKELEKTYSAKLDNARQSAERNSNLVGAAHEELQQSRIRIDSLSAQLSQLQKQLAAKEAKLRDLEDSLARERDTSRRLLAEKEREMAEMRARMQQQLDEYQELLDIKLALDMEIHAYRKLLEGEEERLRLSPSPTSQRSRGRASSHSSQTQGGGSVTKKRKLESTESRSSFSQHARTSGRVAVEEVDEEGKFVRLRNKSNEDQSMGNWQIKRQNGDDPLLTYRFPPKFTLKAGQVVTIWAAGAGATHSPPTDLVWKAQNTWGCGNSLRTALINSTGEEVAMRKLVRSVTVVEDDEDEDGDDLLHHHHVSGSRR,572,NP_005563.1.csv,refseq-LMNA-NM_005572.3_clinical_seed_0_final,refseq-LMNA-NM_005572.3.a2m,Invitae,refseq-LMNA-NM_005572.3.npy,1,572,572
+NP_005564.1,MATATPVPPRMGSRAGGPTTPLSPTRLSRLQEKEELRELNDRLAVYIDKVRSLETENSALQLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKCKAEHDQLLLNYAKKESDLNGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLAAAKKQLADETLLKVDLENRCQSLTEDLEFRKSMYEEEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRIESLSSQLSNLQKESRACLERIQELEDLLAKEKDNSRRMLTDKEREMAEIRDQMQQQLNDYEQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSRVTVSRASSSRSVRTTRGKRKRVDVEESEASSSVSISHSASATGNVCIEEIDVDGKFIRLKNTSEQDQPMGGWEMIRKIGDTSVSYKYTSRYVLKAGQTVTIWAANAGVTASPPTDLIWKNQNSWGTGEDVKVILKNSQGEEVAQRSTVFKTTIPEEEEEEEEAAGVVVEEELFHQQGTPRASNRSCAIM,586,NP_005564.1.csv,refseq-LMNB1-NM_005573.3_clinical_seed_0_final,refseq-LMNB1-NM_005573.3.a2m,Invitae,refseq-LMNB1-NM_005573.3.npy,1,586,586
+NP_005576.3,MFRSKRSGLVRRLWRSRVVPDREEGGSGGGGGGDEDGSLGSRAEPAPRAREGGGCGRSEVRPVAPRRPRDAVGQRGAQGAGRRRRAGGPPRPMSEPGAGAGSSLLDVAEPGGPGWLPESDCETVTCCLFSERDAAGAPRDASDPLAGAALEPAGGGRSREARSRLLLLEQELKTVTYSLLKRLKERSLDTLLEAVESRGGVPGGCVLVPRADLRLGGQPAPPQLLLGRLFRWPDLQHAVELKPLCGCHSFAAAADGPTVCCNPYHFSRLCGPESPPPPYSRLSPRDEYKPLDLSDSTLSYTETEATNSLITAPGEFSDASMSPDATKPSHWCSVAYWEHRTRVGRLYAVYDQAVSIFYDLPQGSGFCLGQLNLEQRSESVRRTRSKIGFGILLSKEPDGVWAYNRGEHPIFVNSPTLDAPGGRALVVRKVPPGYSIKVFDFERSGLQHAPEPDAADGPYDPNSVRISFAKGWGPCYSRQFITSCPCWLEILLNNPR,496,NP_005576.3.csv,refseq-SMAD6-NM_005585.4_clinical_seed_0_final,refseq-SMAD6-NM_005585.4.a2m,Invitae,refseq-SMAD6-NM_005585.4.npy,1,496,496
+NP_005582.1,MSTADALDDENTFKILVATDIHLGFMEKDAVRGNDTFVTLDEILRLAQENEVDFILLGGDLFHENKPSRKTLHTCLELLRKYCMGDRPVQFEILSDQSVNFGFSKFPWVNYQDGNLNISIPVFSIHGNHDDPTGADALCALDILSCAGFVNHFGRSMSVEKIDISPVLLQKGSTKIALYGLGSIPDERLYRMFVNKKVTMLRPKEDENSWFNLFVIHQNRSKHGSTNFIPEQFLDDFIDLVIWGHEHECKIAPTKNEQQLFYISQPGSSVVTSLSPGEAVKKHVGLLRIKGRKMNMHKIPLHTVRQFFMEDIVLANHPDIFNPDNPKVTQAIQSFCLEKIEEMLENAERERLGNSHQPEKPLVRLRVDYSGGFEPFSVLRFSQKFVDRVANPKDIIHFFRHREQKEKTGEEINFGKLITKPSEGTTLRVEDLVKQYFQTAEKNVQLSLLTERGMGEAVQEFVDKEEKDAIEELVKYQLEKTQRFLKERHIDALEDKIDEEVRRFRETRQKNTNEEDDEVREAMTRARALRSQSEESASAFSADDLMSIDLAEQMANDSDDSISAATNKGRGRGRGRRGGRGQNSASRGGSQRGRADTGLETSTRSRNSKTAVSASRNMSIIDAFKSTRQQPSRNVTTKNYSEVIEVDESDVEEDIFPTTSKTDQRWSSTSSSKIMSQSQVSKGVDFESSEDDDDDPFMNTSSLRRNRR,708,NP_005582.1.csv,refseq-MRE11-NM_005591.3_clinical_seed_0_final,refseq-MRE11-NM_005591.3.a2m,Invitae,refseq-MRE11-NM_005591.3.npy,1,708,708
+NP_005583.1,MRELVNIPLVHILTLVAFSGTEKLPKAPVITTPLETVDALVEEVATFMCAVESYPQPEISWTRNKILIKLFDTRYSIRENGQLLTILSVEDSDDGIYCCTANNGVGGAVESCGALQVKMKPKITRPPINVKIIEGLKAVLPCTTMGNPKPSVSWIKGDSPLRENSRIAVLESGSLRIHNVQKEDAGQYRCVAKNSLGTAYSKVVKLEVEVFARILRAPESHNVTFGSFVTLHCTATGIPVPTITWIENGNAVSSGSIQESVKDRVIDSRLQLFITKPGLYTCIATNKHGEKFSTAKAAATISIAEWSKPQKDNKGYCAQYRGEVCNAVLAKDALVFLNTSYADPEEAQELLVHTAWNELKVVSPVCRPAAEALLCNHIFQECSPGVVPTPIPICREYCLAVKELFCAKEWLVMEEKTHRGLYRSEMHLLSVPECSKLPSMHWDPTACARLPHLDYNKENLKTFPPMTSSKPSVDIPNLPSSSSSSFSVSPTYSMTVIISIMSSFAIFVLLTITTLYCCRRRKQWKNKKRESAAVTLTTLPSELLLDRLHPNPMYQRMPLLLNPKLLSLEYPRNNIEYVRDIGEGAFGRVFQARAPGLLPYEPFTMVAVKMLKEEASADMQADFQREAALMAEFDNPNIVKLLGVCAVGKPMCLLFEYMAYGDLNEFLRSMSPHTVCSLSHSDLSMRAQVSSPGPPPLSCAEQLCIARQVAAGMAYLSERKFVHRDLATRNCLVGENMVVKIADFGLSRNIYSADYYKANENDAIPIRWMPPESIFYNRYTTESDVWAYGVVLWEIFSYGLQPYYGMAHEEVIYYVRDGNILSCPENCPVELYNLMRLCWSKLPADRPSFTSIHRILERMCERAEGTVSV,869,NP_005583.1.csv,refseq-MUSK-NM_005592.3_clinical_seed_0_final,refseq-MUSK-NM_005592.3.a2m,Invitae,refseq-MUSK-NM_005592.3.npy,1,869,869
+NP_005586.1,MYSPLCLTQDEFHPFIEALLPHVRAFAYTWFNLQARKRKYFKKHEKRMSKEEERAVKDELLSEKPEVKQKWASRLLAKLRKDIRPEYREDFVLTVTGKKPPCCVLSNPDQKGKMRRIDCLRQADKVWRLDLVMVILFKGIPLESTDGERLVKSPQCSNPGLCVQPHHIGVSVKELDLYLAYFVHAADSSQSESPSQPSDADIKDQPENGHLGFQDSFVTSGVFSVTELVRVSQTPIAAGTGPNFSLSDLESSSYYSMSPGAMRRSLPSTSSTSSTKRLKSVEDEMDSPGEEPFYTGQGRSPGSGSQSSGWHEVEPGMPSPTTLKKSEKSGFSSPSPSQTSSLGTAFTQHHRPVITGPRASPHATPSTLHFPTSPIIQQPGPYFSHPAIRYHPQETLKEFVQLVCPDAGQQAGQVGFLNPNGSSQGKVHNPFLPTPMLPPPPPPPMARPVPLPVPDTKPPTTSTEGGAASPTSPILVPGIKVAASHHPPDRPPDPFSTL,498,NP_005586.1.csv,refseq-NFIA-NM_005595.4_clinical_seed_0_final,refseq-NFIA-NM_005595.4.a2m,Invitae,refseq-NFIA-NM_005595.4.npy,1,498,498
+NP_005594.2,MSTERDSETTFDEDSQPNDEVVPYSDDETEDELDDQGSAVEPEQNRVNREAEENREPFRKECTWQVKANDRKYHEQPHFMNTKFLCIKESKYANNAIKTYKYNAFTFIPMNLFEQFKRAANLYFLALLILQAVPQISTLAWYTTLVPLLVVLGVTAIKDLVDDVARHKMDKEINNRTCEVIKDGRFKVAKWKEIQVGDVIRLKKNDFVPADILLLSSSEPNSLCYVETAELDGETNLKFKMSLEITDQYLQREDTLATFDGFIECEEPNNRLDKFTGTLFWRNTSFPLDADKILLRGCVIRNTDFCHGLVIFAGADTKIMKNSGKTRFKRTKIDYLMNYMVYTIFVVLILLSAGLAIGHAYWEAQVGNSSWYLYDGEDDTPSYRGFLIFWGYIIVLNTMVPISLYVSVEVIRLGQSHFINWDLQMYYAEKDTPAKARTTTLNEQLGQIHYIFSDKTGTLTQNIMTFKKCCINGQIYGDHRDASQHNHNKIEQVDFSWNTYADGKLAFYDHYLIEQIQSGKEPEVRQFFFLLAVCHTVMVDRTDGQLNYQAASPDEGALVNAARNFGFAFLARTQNTITISELGTERTYNVLAILDFNSDRKRMSIIVRTPEGNIKLYCKGADTVIYERLHRMNPTKQETQDALDIFANETLRTLCLCYKEIEEKEFTEWNKKFMAASVASTNRDEALDKVYEEIEKDLILLGATAIEDKLQDGVPETISKLAKADIKIWVLTGDKKETAENIGFACELLTEDTTICYGEDINSLLHARMENQRNRGGVYAKFAPPVQESFFPPGGNRALIITGSWLNEILLEKKTKRNKILKLKFPRTEEERRMRTQSKRRLEAKKEQRQKNFVDLACECSAVICCRVTPKQKAMVVDLVKRYKKAITLAIGDGANDVNMIKTAHIGVGISGQEGMQAVMSSDYSFAQFRYLQRLLLVHGRWSYIRMCKFLRYFFYKNFAFTLVHFWYSFFNGYSAQTAYEDWFITLYNVLYTSLPVLLMGLLDQDVSDKLSLRFPGLYIVGQRDLLFNYKRFFVSLLHGVLTSMILFFIPLGAYLQTVGQDGEAPSDYQSFAVTIASALVITVNFQIGLDTSYWTFVNAFSIFGSIALYFGIMFDFHSAGIHVLFPSAFQFTGTASNALRQPYIWLTIILAVAVCLLPVVAIRFLSMTIWPSESDKIQKHRKRLKAEEQWQRRQQVFRRGVSTRRSAYAFSHQRGYADLISSGRSIRKKRSPLDAIVADGTAEYRRTGDS,1251,NP_005594.2.csv,refseq-ATP8B1-NM_005603.6_clinical_seed_0_final,refseq-ATP8B1-NM_005603.6.a2m,Invitae,refseq-ATP8B1-NM_005603.6.npy,1,1251,1251
+NP_005600.1,MSRPLSDQEKRKQISVRGLAGVENVTELKKNFNRHLHFTLVKDRNVATPRDYYFALAHTVRDHLVGRWIRTQQHYYEKDPKRIYYLSLEFYMGRTLQNTMVNLALENACDEATYQLGLDMEELEEIEEDAGLGNGGLGRLAACFLDSMATLGLAAYGYGIRYEFGIFNQKISGGWQMEEADDWLRYGNPWEKARPEFTLPVHFYGHVEHTSQGAKWVDTQVVLAMPYDTPVPGYRNNVVNTMRLWSAKAPNDFNLKDFNVGGYIQAVLDRNLAENISRVLYPNDNFFEGKELRLKQEYFVVAATLQDIIRRFKSSKFGCRDPVRTNFDAFPDKVAIQLNDTHPSLAIPELMRILVDLERMDWDKAWDVTVRTCAYTNHTVLPEALERWPVHLLETLLPRHLQIIYEINQRFLNRVAAAFPGDVDRLRRMSLVEEGAVKRINMAHLCIAGSHAVNGVARIHSEILKKTIFKDFYELEPHKFQNKTNGITPRRWLVLCNPGLAEVIAERIGEDFISDLDQLRKLLSFVDDEAFIRDVAKVKQENKLKFAAYLEREYKVHINPNSLFDIQVKRIHEYKRQLLNCLHVITLYNRIKREPNKFFVPRTVMIGGKAAPGYHMAKMIIRLVTAIGDVVNHDPAVGDRLRVIFLENYRVSLAEKVIPAADLSEQISTAGTEASGTGNMKFMLNGALTIGTMDGANVEMAEEAGEENFFIFGMRVEDVDKLDQRGYNAQEYYDRIPELRQVIEQLSSGFFSPKQPDLFKDIVNMLMHHDRFKVFADYEDYIKCQEKVSALYKNPREWTRMVIRNIATSGKFSSDRTIAQYAREIWGVEPSRQRLPAPDEAI,842,NP_005600.1.csv,refseq-PYGM-NM_005609.3_clinical_seed_0_final,refseq-PYGM-NM_005609.3.a2m,Invitae,refseq-PYGM-NM_005609.3.npy,1,842,842
+NP_005609.3,MGSRCALALAVLSALLCQVWSSGVFELKLQEFVNKKGLLGNRNCCRGGAGPPPCACRTFFRVCLKHYQASVSPEPPCTYGSAVTPVLGVDSFSLPDGGGADSAFSNPIRFPFGFTWPGTFSLIIEALHTDSPDDLATENPERLISRLATQRHLTVGEEWSQDLHSSGRTDLKYSYRFVCDEHYYGEGCSVFCRPRDDAFGHFTCGERGEKVCNPGWKGPYCTEPICLPGCDEQHGFCDKPGECKCRVGWQGRYCDECIRYPGCLHGTCQQPWQCNCQEGWGGLFCNQDLNYCTHHKPCKNGATCTNTGQGSYTCSCRPGYTGATCELGIDECDPSPCKNGGSCTDLENSYSCTCPPGFYGKICELSAMTCADGPCFNGGRCSDSPDGGYSCRCPVGYSGFNCEKKIDYCSSSPCSNGAKCVDLGDAYLCRCQAGFSGRHCDDNVDDCASSPCANGGTCRDGVNDFSCTCPPGYTGRNCSAPVSRCEHAPCHNGATCHERGHRYVCECARGYGGPNCQFLLPELPPGPAVVDLTEKLEGQGGPFPWVAVCAGVILVLMLLLGCAAVVVCVRLRLQKHRPPADPCRGETETMNNLANCQREKDISVSIIGATQIKNTNKKADFHGDHSADKNGFKARYPAVDYNLVQDLKGDDTAVRDAHSKRDTKCQPQGSSGEEKGTPTTLRGGEASERKRPDSGCSTSKDTKYQSVYVISEEKDECVIATEV,723,NP_005609.3.csv,refseq-DLL1-NM_005618.3_clinical_seed_0_final,refseq-DLL1-NM_005618.3.a2m,Invitae,refseq-DLL1-NM_005618.3.npy,1,723,723
+NP_005620.1,MAKKSAENGIYSVSGDEKKGPLIAPGPDGAPAKGDGPVGLGTPGGRLAVPPRETWTRQMDFIMSCVGFAVGLGNVWRFPYLCYKNGGGVFLIPYVLIALVGGIPIFFLEISLGQFMKAGSINVWNICPLFKGLGYASMVIVFYCNTYYIMVLAWGFYYLVKSFTTTLPWATCGHTWNTPDCVEIFRHEDCANASLANLTCDQLADRRSPVIEFWENKVLRLSGGLEVPGALNWEVTLCLLACWVLVYFCVWKGVKSTGKIVYFTATFPYVVLVVLLVRGVLLPGALDGIIYYLKPDWSKLGSPQVWIDAGTQIFFSYAIGLGALTALGSYNRFNNNCYKDAIILALINSGTSFFAGFVVFSILGFMAAEQGVHISKVAESGPGLAFIAYPRAVTLMPVAPLWAALFFFMLLLLGLDSQFVGVEGFITGLLDLLPASYYFRFQREISVALCCALCFVIDLSMVTDGGMYVFQLFDYYSASGTTLLWQAFWECVVVAWVYGADRFMDDIACMIGYRPCPWMKWCWSFFTPLVCMGIFIFNVVYYEPLVYNNTYVYPWWGEAMGWAFALSSMLCVPLHLLGCLLRAKGTMAERWQHLTQPIWGLHHLEYRAQDADVRGLTTLTPVSESSKVVVVESVM,635,NP_005620.1.csv,refseq-SLC6A8-NM_005629.3_clinical_seed_0_final,refseq-SLC6A8-NM_005629.3.a2m,Invitae,refseq-SLC6A8-NM_005629.3.npy,1,635,635
+NP_005621.2,MGLLPKLGASQGSDTSTSRAGRCARSVFGNIKVFVLCQGLLQLCQLLYSAYFKSSLTTIEKRFGLSSSSSGLISSLNEISNAILIIFVSYFGSRVHRPRLIGIGGLFLAAGAFILTLPHFLSEPYQYTLASTGNNSRLQAELCQKHWQDLPPSKCHSTTQNPQKETSSMWGLMVVAQLLAGIGTVPIQPFGISYVDDFSEPSNSPLYISILFAISVFGPAFGYLLGSVMLQIFVDYGRVNTAAVNLVPGDPRWIGAWWLGLLISSALLVLTSFPFFFFPRAMPIGAKRAPATADEARKLEEAKSRGSLVDFIKRFPCIFLRLLMNSLFVLVVLAQCTFSSVIAGLSTFLNKFLEKQYGTSAAYANFLIGAVNLPAAALGMLFGGILMKRFVFSLQAIPRIATTIITISMILCVPLFFMGCSTPTVAEVYPPSTSSSIHPQSPACRRDCSCPDSIFHPVCGDNGIEYLSPCHAGCSNINMSSATSKQLIYLNCSCVTGGSASAKTGSCPVPCAHFLLPAIFLISFVSLIACISHNPLYMMVLRVVNQEEKSFAIGVQFLLMRLLAWLPSPALYGLTIDHSCIRWNSLCLGRRGACAYYDNDALRDRYLGLQMGYKALGMLLLCFISWRVKKNKEYNVQKAAGLI,643,NP_005621.2.csv,refseq-SLCO2A1-NM_005630.2_clinical_seed_0_final,refseq-SLCO2A1-NM_005630.2.a2m,Invitae,refseq-SLCO2A1-NM_005630.2.npy,1,643,643
+NP_005622.1,MAAARPARGPELPLLGLLLLLLLGDPGRGAASSGNATGPGPRSAGGSARRSAAVTGPPPPLSHCGRAAPCEPLRYNVCLGSVLPYGATSTLLAGDSDSQEEAHGKLVLWSGLRNAPRCWAVIQPLLCAVYMPKCENDRVELPSRTLCQATRGPCAIVERERGWPDFLRCTPDRFPEGCTNEVQNIKFNSSGQCEVPLVRTDNPKSWYEDVEGCGIQCQNPLFTEAEHQDMHSYIAAFGAVTGLCTLFTLATFVADWRNSNRYPAVILFYVNACFFVGSIGWLAQFMDGARREIVCRADGTMRLGEPTSNETLSCVIIFVIVYYALMAGVVWFVVLTYAWHTSFKALGTTYQPLSGKTSYFHLLTWSLPFVLTVAILAVAQVDGDSVSGICFVGYKNYRYRAGFVLAPIGLVLIVGGYFLIRGVMTLFSIKSNHPGLLSEKAASKINETMLRLGIFGFLAFGFVLITFSCHFYDFFNQAEWERSFRDYVLCQANVTIGLPTKQPIPDCEIKNRPSLLVEKINLFAMFGTGIAMSTWVWTKATLLIWRRTWCRLTGQSDDEPKRIKKSKMIAKAFSKRHELLQNPGQELSFSMHTVSHDGPVAGLAFDLNEPSADVSSAWAQHVTKMVARRGAILPQDISVTPVATPVPPEEQANLWLVEAEISPELQKRLGRKKKRRKRKKEVCPLAPPPELHPPAPAPSTIPRLPQLPRQKCLVAAGAWGAGDSCRQGAWTLVSNPFCPEPSPPQDPFLPSAPAPVAWAHGRRQGLGPIHSRTNLMDTELMDADSDF,787,NP_005622.1.csv,refseq-SMO-NM_005631.5_clinical_seed_0_final,refseq-SMO-NM_005631.5.a2m,Invitae,refseq-SMO-NM_005631.5_theta_0.2.npy,1,787,787
+NP_005624.2,MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESNDDALQYVEELILQLLNMLCQAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNPLSLPVEKIHPLLKEVLGYKIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDVEDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRELNLIIKVFREPFVSNSKLFSANDVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPHPLVGSCFEDLAEELAFDPYESYARDILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQLEEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESACRFYSQQMKGKQLAIKKMNEIQKNIDGWEGKDIGQCCNEFIMEGTLTRVGAKHERHIFLFDGLMICCKSNHGQPRLPGASNAEYRLKEKFFMRKVQINDKDDTNEYKHAFEIILKDENSVIFSAKSAEEKNNWMAALISLQYRSTLERMLDVTMLQEEKEEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPIIKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADRIAIENGDQPLSAELKRFRKEYIQPVQLRVLNVCRHWVEHHFYDFERDAYLLQRMEEFIGTVRGKAMKKWVESITKIIQRKKIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLHPIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKILEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSKRRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRNPKPLPRFPKKYSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAPNSPRTPLTPPPASGASSTTDVCSVFDSDHSSPFHSSNDTVFIQVTLPHGPRSASVSSISLTKGTDEVPVPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSISDRTSISDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHGNAFFPNSPSPFTPPPPQTPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMHRDGPPLLENAHSS,1333,NP_005624.2.csv,refseq-SOS1-NM_005633.3_clinical_seed_0_final,refseq-SOS1-NM_005633.3.a2m,Invitae,refseq-SOS1-NM_005633.3.npy,1,1333,1333
+NP_005632.1,MAEEKKLKLSNTVLPSESMKVVAESMGIAQIQEETCQLLTDEVSYRIKEIAQDALKFMHMGKRQKLTTSDIDYALKLKNVEPLYGFHAQEFIPFRFASGGGRELYFYEEKEVDLSDIINTPLPRVPLDVCLKAHWLSIEGCQPAIPENPPPAPKEQQKAEATEPLKSAKPGQEEDGPLKGKGQGATTADGKGKEKKAPPLLEGAPLRLKPRSIHELSVEQQLYYKEITEACVGSCEAKRAEALQSIATDPGLYQMLPRFSTFISEGVRVNVVQNNLALLIYLMRMVKALMDNPTLYLEKYVHELIPAVMTCIVSRQLCLRPDVDNHWALRDFAARLVAQICKHFSTTTNNIQSRITKTFTKSWVDEKTPWTTRYGSIAGLAELGHDVIKTLILPRLQQEGERIRSVLDGPVLSNIDRIGADHVQSLLLKHCAPVLAKLRPPPDNQDAYRAEFGSLGPLLCSQVVKARAQAALQAQQVNRTTLTITQPRPTLTLSQAPQPGPRTPGLLKVPGSIALPVQTLVSARAAAPPQPSPPPTKFIVMSSSSSAPSTQQVLSLSTSAPGSGSTTTSPVTTTVPSVQPIVKLVSTATTAPPSTAPSGPGSVQKYIVVSLPPTGEGKGGPTSHPSPVPPPASSPSPLSGSALCGGKQEAGDSPPPAPGTPKANGSQPNSGSPQPAP,677,NP_005632.1.csv,refseq-TAF6-NM_005641.3_clinical_seed_0_final,refseq-TAF6-NM_005641.3.a2m,Invitae,refseq-TAF6-NM_005641.3.npy,1,677,677
+NP_005641.1,MQSFREQSSYHGNQQSYPQEVHGSSRLEEFSPRQAQMFQNFGGTGGSSGSSGSGSGGGRRGAAAAAAAMASETSGHQGYQGFRKEAGDFYYMAGNKDPVTTGTPQPPQRRPSGPVQSYGPPQGSSFGNQYGSEGHVGQFQAQHSGLGGVSHYQQDYTGPFSPGSAQYQQQASSQQQQQQVQQLRQQLYQSHQPLPQATGQPASSSSHLQPMQRPSTLPSSAAGYQLRVGQFGQHYQSSASSSSSSSFPSPQRFSQSGQSYDGSYNVNAGSQYEGHNVGSNAQAYGTQSNYSYQPQSMKNFEQAKIPQGTQQGQQQQQPQQQQHPSQHVMQYTNAATKLPLQSQVGQYNQPEVPVRSPMQFHQNFSPISNPSPAASVVQSPSCSSTPSPLMQTGENLQCGQGSVPMGSRNRILQLMPQLSPTPSMMPSPNSHAAGFKGFGLEGVPEKRLTDPGLSSLSALSTQVANLPNTVQHMLLSDALTPQKKTSKRPSSSKKADSCTNSEGSSQPEEQLKSPMAESLDGGCSSSSEDQGERVRQLSGQSTSSDTTYKGGASEKAGSSPAQGAQNEPPRLNASPAAREEATSPGAKDMPLSSDGNPKVNEKTVGVIVSREAMTGRVEKPGGQDKGSQEDDPAATQRPPSNGGAKETSHASLPQPEPPGGGGSKGNKNGDNNSNHNGEGNGQSGHSAAGPGFTSRTEPSKSPGSLRYSYKDSFGSAVPRNVSGFPQYPTGQEKGDFTGHGERKGRNEKFPSLLQEVLQGYHHHPDRRYSRSTQEHQGMAGSLEGTTRPNVLVSQTNELASRGLLNKSIGSLLENPHWGPWERKSSSTAPEMKQINLTDYPIPRKFEIEPQSSAHEPGGSLSERRSVICDISPLRQIVRDPGAHSLGHMSADTRIGRNDRLNPTLSQSVILPGGLVSMETKLKSQSGQIKEEDFEQSKSQASFNNKKSGDHCHPPSIKHESYRGNASPGAATHDSLSDYGPQDSRPTPMRRVPGRVGGREGMRGRSPSQYHDFAEKLKMSPGRSRGPGGDPHHMNPHMTFSERANRSSLHTPFSPNSETLASAYHANTRAHAYGDPNAGLNSQLHYKRQMYQQQPEEYKDWSSGSAQGVIAAAQHRQEGPRKSPRQQQFLDRVRSPLKNDKDGMMYGPPVGTYHDPSAQEAGRCLMSSDGLPNKGMELKHGSQKLQESCWDLSRQTSPAKSSGPPGMSSQKRYGPPHETDGHGLAEATQSSKPGSVMLRLPGQEDHSSQNPLIMRRRVRSFISPIPSKRQSQDVKNSSTEDKGRLLHSSKEGADKAFNSYAHLSHSQDIKSIPKRDSSKDLPSPDSRNCPAVTLTSPAKTKILPPRKGRGLKLEAIVQKITSPNIRRSASSNSAEAGGDTVTLDDILSLKSGPPEGGSVAVQDADIEKRKGEVASDLVSPANQELHVEKPLPRSSEEWRGSVDDKVKTETHAETVTAGKEPPGAMTSTTSQKPGSNQGRPDGSLGGTAPLIFPDSKNVPPVGILAPEANPKAEEKENDTVTISPKQEGFPPKGYFPSGKKKGRPIGSVNKQKKQQQPPPPPPQPPQIPEGSADGEPKPKKQRQRRERRKPGAQPRKRKTKQAVPIVEPQEPEIKLKYATQPLDKTDAKNKSFYPYIHVVNKCELGAVCTIINAEEEEQTKLVRGRKGQRSLTPPPSSTESKALPASSFMLQGPVVTESSVMGHLVCCLCGKWASYRNMGDLFGPFYPQDYAATLPKNPPPKRATEMQSKVKVRHKSASNGSKTDTEEEEEQQQQQKEQRSLAAHPRFKRRHRSEDCGGGPRSLSRGLPCKKAATEGSSEKTVLDSKPSVPTTSEGGPELELQIPELPLDSNEFWVHEGCILWANGIYLVCGRLYGLQEALEIAREMKCSHCQEAGATLGCYNKGCSFRYHYPCAIDADCLLHEENFSVRCPKHKPPLPCPLPPLQNKTAKGSLSTEQSERG,1960,NP_005641.1.csv,refseq-TCF20-NM_005650.3_clinical_seed_0_final,refseq-TCF20-NM_005650.3.a2m,Invitae,refseq-TCF20-NM_005650.3.npy,1,1960,1960
+NP_005645.1,MAMVVSSWRDPQDDVAGGNPGGPNPAAQAARGGGGGAGEQQQQAGSGAPHTPQTPGQPGAPATPGTAGDKGQGPPGSGQSQQHIECVVCGDKSSGKHYGQFTCEGCKSFFKRSVRRNLTYTCRANRNCPIDQHHRNQCQYCRLKKCLKVGMRREAVQRGRMPPTQPNPGQYALTNGDPLNGHCYLSGYISLLLRAEPYPTSRYGSQCMQPNNIMGIENICELAARLLFSAVEWARNIPFFPDLQITDQVSLLRLTWSELFVLNAAQCSMPLHVAPLLAAAGLHASPMSADRVVAFMDHIRIFQEQVEKLKALHVDSAEYSCLKAIVLFTSDACGLSDAAHIESLQEKSQCALEEYVRSQYPNQPSRFGKLLLRLPSLRTVSSSVIEQLFFVRLVGKTPIETLIRDMLLSGSSFNWPYMSIQCS,423,NP_005645.1.csv,refseq-NR2F1-NM_005654.5_clinical_seed_0_final,refseq-NR2F1-NM_005654.5.a2m,Invitae,refseq-NR2F1-NM_005654.5.npy,1,423,423
+NP_005655.1,MEEPAAPSEAHEAAGAQAGAEAAREGVSGPDLPVCEPSGESAAPDSALPHAARGWAPFPVAPVPAHLRRGGLRPAPASGGGAWPSPLPSRSSGIWTKQIICRYYIHGQCKEGENCRYSHDLSGRKMATEGGVSPPGASAGGGPSTAAHIEPPTQEVAEAPPAASSLSLPVIGSAAERGFFEAERDNADRGAAGGAGVESWADAIEFVPGQPYRGRWVASAPEAPLQSSETERKQMAVGSGLRFCYYASRGVCFRGESCMYLHGDICDMCGLQTLHPMDAAQREEHMRACIEAHEKDMELSFAVQRGMDKVCGICMEVVYEKANPNDRRFGILSNCNHSFCIRCIRRWRSARQFENRIVKSCPQCRVTSELVIPSEFWVEEEEEKQKLIQQYKEAMSNKACRYFAEGRGNCPFGDTCFYKHEYPEGWGDEPPGPGGGSFSAYWHQLVEPVRMGEGNMLYKSIKKELVVLRLASLLFKRFLSLRDELPFSEDQWDLLHYELEEYFNLIL,507,NP_005655.1.csv,refseq-MKRN3-NM_005664.3_clinical_seed_0_final,refseq-MKRN3-NM_005664.3.a2m,Invitae,refseq-MKRN3-NM_005664.3.npy,1,507,507
+NP_005661.1,MRFRFGVVVPPAVAGARPELLVVGSRPELGRWEPRGAVRLRPAGTAAGDGALALQEPGLWLGEVELAAEEAAQDGAEPGRVDTFWYKFLKREPGGELSWEGNGPHHDRCCTYNENNLVDGVYCLPIGHWIEATGHTNEMKHTTDFYFNIAGHQAMHYSRILPNIWLGSCPRQVEHVTIKLKHELGITAVMNFQTEWDIVQNSSGCNRYPEPMTPDTMIKLYREEGLAYIWMPTPDMSTEGRVQMLPQAVCLLHALLEKGHIVYVHCNAGVGRSTAAVCGWLQYVMGWNLRKVQYFLMAKRPAVYIDEEALARAQEDFFQKFGKVRSSVCSL,331,NP_005661.1.csv,refseq-EPM2A-NM_005670.3_clinical_seed_0_final,refseq-EPM2A-NM_005670.3.a2m,Invitae,refseq-EPM2A-NM_005670.3.npy,1,331,331
+NP_005667.2,MEYERRGGRGDRTGRYGATDRSQDDGGENRSRDHDYRDMDYRSYPREYGSQEGKHDYDDSSEEQSAEDSYEASPGSETQRRRRRRHRHSPTGPPGFPRDGDYRDQDYRTEQGEEEEEEEDEEEEEKASNIVMLRMLPQAATEDDIRGQLQSHGVQAREVRLMRNKSSGQSRGFAFVEFSHLQDATRWMEANQHSLNILGQKVSMHYSDPKPKINEDWLCNKCGVQNFKRREKCFKCGVPKSEAEQKLPLGTRLDQQTLPLGGRELSQGLLPLPQPYQAQGVLASQALSQGSEPSSENANDTIILRNLNPHSTMDSILGALAPYAVLSSSNVRVIKDKQTQLNRGFAFIQLSTIVEAAQLLQILQALHPPLTIDGKTINVEFAKGSKRDMASNEGSRISAASVASTAIAAAQWAISQASQGGEGTWATSEEPPVDYSYYQQDEGYGNSQGTESSLYAHGYLKGTKGPGITGTKGDPTGAGPEASLEPGADSVSMQAFSRAQPGAAPGIYQQSAEASSSQGTAANSQSYTIMSPAVLKSELQSPTHPSSALPPATSPTAQESYSQYPVPDVSTYQYDETSGYYYDPQTGLYYDPNSQYYYNAQSQQYLYWDGERRTYVPALEQSADGHKETGAPSKEGKEKKEKHKTKTAQQIAKDMERWARSLNKQKENFKNSFQPISSLRDDERRESATADAGYAILEKKGALAERQHTSMDLPKLASDDRPSPPRGLVAAYSGESDSEEEQERGGPEREEKLTDWQKLACLLCRRQFPSKEALIRHQQLSGLHKQNLEIHRRAHLSENELEALEKNDMEQMKYRDRAAERREKYGIPEPPEPKRRKYGGISTASVDFEQPTRDGLGSDNIGSRMLQAMGWKEGSGLGRKKQGIVTPIEAQTRVRGSGLGARGSSYGVTSTESYKETLHKTMVTRFNEAQ,930,NP_005667.2.csv,refseq-RBM10-NM_005676.4_clinical_seed_0_final,refseq-RBM10-NM_005676.4.a2m,Invitae,refseq-RBM10-NM_005676.4.npy,1,930,930
+NP_005668.2,MVVLNPMTLGIYLQLFFLSIVSQPTFINSVLPISAALPSLDQKKRGGHKACCLLTPPPPPLFPPPFFRGGRSPLLSPDMKNLMLELETSQSPCMQGSLGSPGPPGPQGPPGLPGKTGPKGEKGELGRPGRKGRPGPPGVPGMPGPIGWPGPEGPRGEKGDLGMMGLPGSRGPMGSKGYPGSRGEKGSRGEKGDLGPKGEKGFPGFPGMLGQKGEMGPKGEPGIAGHRGPTGRPGKRGKQGQKGDSGVMGPPGKPGPSGQPGRPGPPGPPPAGQLIMGPKGERGFPGPPGRCLCGPTMNVNNPSYGESVYGPSSPRVPVIFVVNNQEELERLNTQNAIAFRRDQRSLYFKDSLGWLPIQLTPFYPVDYTADQHGTCGDGLLQPGEECDDGNSDVGDDCIRCHRAYCGDGHRHEGVEDCDGSDFGYLTCETYLPGSYGDLQCTQYCYIDSTPCRYFT,455,NP_005668.2.csv,refseq-COLQ-NM_005677.3_clinical_seed_0_final,refseq-COLQ-NM_005677.3.a2m,Invitae,refseq-COLQ-NM_005677.3.npy,1,455,455
+NP_005673.3,MTPQSLLQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIENSEEALTVHAPFPAAHPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLLCFQHQEESLAQGPPLLATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASVDMCELKRDLQLLSQFLKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATVWKLQPTAGLQDLHIHSRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQDKNSSQVLGEKVLGIVVQNTKVANLTEPVVLTFQHQLQPKNVTLQCVFWVEDPTLSSPGHWSSAGCETVRRETQTSCFCNHLTYFAVLMVSSVEVDAVHKHYLSLLSYVGCVVSALACLVTIAAYLCSRVPLPCRRKPRDYTIKVHMNLLLAVFLLDTSFLLSEPVALTGSEAGCRASAIFLHFSLLTCLSWMGLEGYNLYRLVVEVFGTYVPGYLLKLSAMGWGFPIFLVTLVALVDVDNYGPIILAVHRTPEGVIYPSMCWIRDSLVSYITNLGLFSLVFLFNMAMLATMVVQILRLRPHTQKWSHVLTLLGLSLVLGLPWALIFFSFASGTFQLVVLYLFSIITSFQGFLIFIWYWSMRLQARGGPSPLKSNSDSARLPISSGSTSSSRI,693,NP_005673.3.csv,refseq-ADGRG1-NM_005682.6_clinical_seed_0_final,refseq-ADGRG1-NM_005682.6.a2m,Invitae,refseq-ADGRG1-NM_005682.6.npy,1,693,693
+NP_005678.3,MPTVSVKRDLLFQALGRTYTDEEFDELCFEFGLELDEITSEKEIISKEQGNVKAAGASDVVLYKIDVPANRYDLLCLEGLVRGLQVFKERIKAPVYKRVMPDGKIQKLIITEETAKIRPFAVAAVLRNIKFTKDRYDSFIELQEKLHQNICRKRALVAIGTHDLDTLSGPFTYTAKRPSDIKFKPLNKTKEYTACELMNIYKTDNHLKHYLHIIENKPLYPVIYDSNGVVLSMPPIINGDHSRITVNTRNIFIECTGTDFTKAKIVLDIIVTMFSEYCENQFTVEAAEVVFPNGKSHTFPELAYRKEMVRADLINKKVGIRETPENLAKLLTRMYLKSEVIGDGNQIEIEIPPTRADIIHACDIVEDAAIAYGYNNIQMTLPKTYTIANQFPLNKLTELLRHDMAAAGFTEALTFALCSQEDIADKLGVDISATKAVHISNPKTAEFQVARTTLLPGLLKTIAANRKMPLPLKLFEISDIVIKDSNTDVGAKNYRHLCAVYYNKNPGFEIIHGLLDRIMQLLDVPPGEDKGGYVIKASEGPAFFPGRCAEIFARGQSVGKLGVLHPDVITKFELTMPCSSLEINVGPFL,589,NP_005678.3.csv,refseq-FARSB-NM_005687.4_clinical_seed_0_final,refseq-FARSB-NM_005687.4.a2m,Invitae,refseq-FARSB-NM_005687.4.npy,1,589,589
+NP_005680.1,MVTVGNYCEAEGPVGPAWMQDGLSPCFFFTLVPSTRMALGTLALVLALPCRRRERPAGADSLSWGAGPRISPYVLQLLLATLQAALPLAGLAGRVGTARGAPLPSYLLLASVLESLAGACGLWLLVVERSQARQRLAMGIWIKFRHSPGLLLLWTVAFAAENLALVSWNSPQWWWARADLGQQVQFSLWVLRYVVSGGLFVLGLWAPGLRPQSYTLQVHEEDQDVERSQVRSAAQQSTWRDFGRKLRLLSGYLWPRGSPALQLVVLICLGLMGLERALNVLVPIFYRNIVNLLTEKAPWNSLAWTVTSYVFLKFLQGGGTGSTGFVSNLRTFLWIRVQQFTSRRVELLIFSHLHELSLRWHLGRRTGEVLRIADRGTSSVTGLLSYLVFNVIPTLADIIIGIIYFSMFFNAWFGLIVFLCMSLYLTLTIVVTEWRTKFRRAMNTQENATRARAVDSLLNFETVKYYNAESYEVERYREAIIKYQGLEWKSSASLVLLNQTQNLVIGLGLLAGSLLCAYFVTEQKLQVGDYVLFGTYIIQLYMPLNWFGTYYRMIQTNFIDMENMFDLLKEETEVKDLPGAGPLRFQKGRIEFENVHFSYADGRETLQDVSFTVMPGQTLALVGPSGAGKSTILRLLFRFYDISSGCIRIDGQDISQVTQASLRSHIGVVPQDTVLFNDTIADNIRYGRVTAGNDEVEAAAQAAGIHDAIMAFPEGYRTQVGERGLKLSGGEKQRVAIARTILKAPGIILLDEATSALDTSNERAIQASLAKVCANRTTIVVAHRLSTVVNADQILVIKDGCIVERGRHEALLSRGGVYADMWQLQQGQEETSEDTKPQTMER,842,NP_005680.1.csv,refseq-ABCB6-NM_005689.2_clinical_seed_0_final,refseq-ABCB6-NM_005689.2.a2m,Invitae,refseq-ABCB6-NM_005689.2_theta_0.2.npy,1,842,842
+NP_005682.2,MSLSFCGNNISSYNINDGVLQNSCFVDALNLVPHVFLLFITFPILFIGWGSQSSKVQIHHNTWLHFPGHNLRWILTFALLFVHVCEIAEGIVSDSRRESRHLHLFMPAVMGFVATTTSIVYYHNIETSNFPKLLLALFLYWVMAFITKTIKLVKYCQSGLDISNLRFCITGMMVILNGLLMAVEINVIRVRRYVFFMNPQKVKPPEDLQDLGVRFLQPFVNLLSKATYWWMNTLIISAHKKPIDLKAIGKLPIAMRAVTNYVCLKDAYEEQKKKVADHPNRTPSIWLAMYRAFGRPILLSSTFRYLADLLGFAGPLCISGIVQRVNETQNGTNNTTGISETLSSKEFLENAYVLAVLLFLALILQRTFLQASYYVTIETGINLRGALLAMIYNKILRLSTSNLSMGEMTLGQINNLVAIETNQLMWFLFLCPNLWAMPVQIIMGVILLYNLLGSSALVGAAVIVLLAPIQYFIATKLAEAQKSTLDYSTERLKKTNEILKGIKLLKLYAWEHIFCKSVEETRMKELSSLKTFALYTSLSIFMNAAIPIAAVLATFVTHAYASGNNLKPAEAFASLSLFHILVTPLFLLSTVVRFAVKAIISVQKLNEFLLSDEIGDDSWRTGESSLPFESCKKHTGVQPKTINRKQPGRYHLDSYEQSTRRLRPAETEDIAIKVTNGYFSWGSGLATLSNIDIRIPTGQLTMIVGQVGCGKSSLLLAILGEMQTLEGKVHWSNVNESEPSFEATRSRNRYSVAYAAQKPWLLNATVEENITFGSPFNKQRYKAVTDACSLQPDIDLLPFGDQTEIGERGINLSGGQRQRICVARALYQNTNIVFLDDPFSALDIHLSDHLMQEGILKFLQDDKRTLVLVTHKLQYLTHADWIIAMKDGSVLREGTLKDIQTKDVELYEHWKTLMNRQDQELEKDMEADQTTLERKTLRRAMYSREAKAQMEDEDEEEEEEEDEDDNMSTVMRLRTKMPWKTCWRYLTSGGFFLLILMIFSKLLKHSVIVAIDYWLATWTSEYSINNTGKADQTYYVAGFSILCGAGIFLCLVTSLTVEWMGLTAAKNLHHNLLNKIILGPIRFFDTTPLGLILNRFSADTNIIDQHIPPTLESLTRSTLLCLSAIGMISYATPVFLVALLPLGVAFYFIQKYFRVASKDLQELDDSTQLPLLCHFSETAEGLTTIRAFRHETRFKQRMLELTDTNNIAYLFLSAANRWLEVRTDYLGACIVLTASIASISGSSNSGLVGLGLLYALTITNYLNWVVRNLADLEVQMGAVKKVNSFLTMESENYEGTMDPSQVPEHWPQEGEIKIHDLCVRYENNLKPVLKHVKAYIKPGQKVGICGRTGSGKSSLSLAFFRMVDIFDGKIVIDGIDISKLPLHTLRSRLSIILQDPILFSGSIRFNLDPECKCTDDRLWEALEIAQLKNMVKSLPGGLDAVVTEGGENFSVGQRQLFCLARAFVRKSSILIMDEATASIDMATENILQKVVMTAFADRTVVTIAHRVSSIMDAGLVLVFSEGILVECDTVPNLLAHKNGLFSTLVMTNK,1549,NP_005682.2.csv,refseq-ABCC9-NM_005691.2_clinical_seed_0_final,refseq-ABCC9-NM_005691.2.a2m,Invitae,refseq-ABCC9-NM_005691.2_theta_0.2.npy,1,1549,1549
+NP_005699.1,MPSWIGAVILPLLGLLLSLPAGADVKARSCGEVRQAYGAKGFSLADIPYQEIAGEHLRICPQEYTCCTTEMEDKLSQQSKLEFENLVEETSHFVRTTFVSRHKKFDEFFRELLENAEKSLNDMFVRTYGMLYMQNSEVFQDLFTELKRYYTGGNVNLEEMLNDFWARLLERMFQLINPQYHFSEDYLECVSKYTDQLKPFGDVPRKLKIQVTRAFIAARTFVQGLTVGREVANRVSKVSPTPGCIRALMKMLYCPYCRGLPTVRPCNNYCLNVMKGCLANQADLDTEWNLFIDAMLLVAERLEGPFNIESVMDPIDVKISEAIMNMQENSMQVSAKVFQGCGQPKPAPALRSARSAPENFNTRFRPYNPEERPTTAAGTSLDRLVTDIKEKLKLSKKVWSALPYTICKDESVTAGTSNEEECWNGHSKARYLPEIMNDGLTNQINNPEVDVDITRPDTFIRQQIMALRVMTNKLKNAYNGNDVNFQDTSDESSGSGSGSGCMDDVCPTEFEFVTTEAPAVDPDRREVDSSAAQRGHSLLSWSLTCIVLALQRLCR,555,NP_005699.1.csv,refseq-GPC6-NM_005708.3_clinical_seed_0_final,refseq-GPC6-NM_005708.3.a2m,Invitae,refseq-GPC6-NM_005708.3.npy,1,555,555
+NP_005700.2,MDRKVAREFRHKVDFLIENDAEKDYLYDVLRMYHQTMDVAVLVGDLKLVINEPSRLPLFDAIRPLIPLKHQVEYDQLTPRRSRKLKEVRLDRLHPEGLGLSVRGGLEFGCGLFISHLIKGGQADSVGLQVGDEIVRINGYSISSCTHEEVINLIRTKKTVSIKVRHIGLIPVKSSPDEPLTWQYVDQFVSESGGVRGSLGSPGNRENKEKKVFISLVGSRGLGCSISSGPIQKPGIFISHVKPGSLSAEVGLEIGDQIVEVNGVDFSNLDHKEAVNVLKSSRSLTISIVAAAGRELFMTDRERLAEARQRELQRQELLMQKRLAMESNKILQEQQEMERQRRKEIAQKAAEENERYRKEMEQIVEEEEKFKKQWEEDWGSKEQLLLPKTITAEVHPVPLRKPKYDQGVEPELEPADDLDGGTEEQGEQDFRKYEEGFDPYSMFTPEQIMGKDVRLLRIKKEGSLDLALEGGVDSPIGKVVVSAVYERGAAERHGGIVKGDEIMAINGKIVTDYTLAEAEAALQKAWNQGGDWIDLVVAVCPPKEYDDELTFF,552,NP_005700.2.csv,refseq-USH1C-NM_005709.3_clinical_seed_0_final,refseq-USH1C-NM_005709.3.a2m,Invitae,refseq-USH1C-NM_005709.3.npy,1,552,552
+NP_005701.1,MPLPVALQTRLAKRGILKHLEPEPEEEIIAEDYDDDPVDYEATRLEGLPPSWYKVFDPSCGLPYYWNADTDLVSWLSPHDPNSVVTKSAKKLRSSNADAEEKLDRSHDKSDRGHDKSDRSHEKLDRGHDKSDRGHDKSDRDRERGYDKVDRERERDRERDRDRGYDKADREEGKERRHHRREELAPYPKSKKAVSRKDEELDPMDPSSYSDAPRGTWSTGLPKRNEAKTGADTTAAGPLFQQRPYPSPGAVLRANAEASRTKQQD,265,NP_005701.1.csv,refseq-PQBP1-NM_005710.2_clinical_seed_0_final,refseq-PQBP1-NM_005710.2.a2m,Invitae,refseq-PQBP1-NM_005710.2.npy,1,265,265
+NP_005711.1,MAYHSFLVEPISCHAWNKDRTQIAICPNNHEVHIYEKSGAKWTKVHELKEHNGQVTGIDWAPESNRIVTCGTDRNAYVWTLKGRTWKPTLVILRINRAARCVRWAPNENKFAVGSGSRVISICYFEQENDWWVCKHIKKPIRSTVLSLDWHPNNVLLAAGSCDFKCRIFSAYIKEVEERPAPTPWGSKMPFGELMFESSSSCGWVHGVCFSASGSRVAWVSHDSTVCLADADKKMAVATLASETLPLLALTFITDNSLVAAGHDCFPVLFTYDAAAGMLSFGGRLDVPKQSSQRGLTARERFQNLDKKASSEGGTAAGAGLDSLHKNSVSQISVLSGGKAKCSQFCTTGMDGGMSIWDVKSLESALKDLKIK,372,NP_005711.1.csv,refseq-ARPC1B-NM_005720.3_clinical_seed_0_final,refseq-ARPC1B-NM_005720.3.a2m,Invitae,refseq-ARPC1B-NM_005720.3.npy,1,372,372
+NP_005723.2,MSRIEKMSILGVRSFGIEDKDKQIITFFSPLTILVGPNGAGKTTIIECLKYICTGDFPPGTKGNTFVHDPKVAQETDVRAQIRLQFRDVNGELIAVQRSMVCTQKSKKTEFKTLEGVITRTKHGEKVSLSSKCAEIDREMISSLGVSKAVLNNVIFCHQEDSNWPLSEGKALKQKFDEIFSATRYIKALETLRQVRQTQGQKVKEYQMELKYLKQYKEKACEIRDQITSKEAQLTSSKEIVKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALDSRKKQMEKDNSELEEKMEKVFQGTDEQLNDLYHNHQRTVREKERKLVDCHRELEKLNKESRLLNQEKSELLVEQGRLQLQADRHQEHIRARDSLIQSLATQLELDGFERGPFSERQIKNFHKLVRERQEGEAKTANQLMNDFAEKETLKQKQIDEIRDKKTGLGRIIELKSEILSKKQNELKNVKYELQQLEGSSDRILELDQELIKAERELSKAEKNSNVETLKMEVISLQNEKADLDRTLRKLDQEMEQLNHHTTTRTQMEMLTKDKADKDEQIRKIKSRHSDELTSLLGYFPNKKQLEDWLHSKSKEINQTRDRLAKLNKELASSEQNKNHINNELKRKEEQLSSYEDKLFDVCGSQDFESDLDRLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPVCQRVFQTEAELQEVISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLVPMRQSIIDLKEKEIPELRNKLQNVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGIDLDRTVQQVNQEKQEKQHKLDTVSSKIELNRKLIQDQQEQIQHLKSTTNELKSEKLQISTNLQRRQQLEEQTVELSTEVQSLYREIKDAKEQVSPLETTLEKFQQEKEELINKKNTSNKIAQDKLNDIKEKVKNIHGYMKDIENYIQDGKDDYKKQKETELNKVIAQLSECEKHKEKINEDMRLMRQDIDTQKIQERWLQDNLTLRKRNEELKEVEEERKQHLKEMGQMQVLQMKSEHQKLEENIDNIKRNHNLALGRQKGYEEEIIHFKKELREPQFRDAEEKYREMMIVMRTTELVNKDLDIYYKTLDQAIMKFHSMKMEEINKIIRDLWRSTYRGQDIEYIEIRSDADENVSASDKRRNYNYRVVMLKGDTALDMRGRCSAGQKVLASLIIRLALAETFCLNCGIIALDEPTTNLDRENIESLAHALVEIIKSRSQQRNFQLLVITHDEDFVELLGRSEYVEKFYRIKKNIDQCSEIVKCSVSSLGFNVH,1312,NP_005723.2.csv,refseq-RAD50-NM_005732.4_clinical_seed_0_final,refseq-RAD50-NM_005732.4.a2m,Invitae,refseq-RAD50-NM_005732.4_theta_0.2.npy,1,1312,1312
+NP_005754.2,MLQVHRTGLGRLGVSLSKGLHHKAVLAVRREDVNAWERRAPLAPKHIKGITNLGYKVLIQPSNRRAIHDKDYVKAGGILQEDISEACLILGVKRPPEEKLMSRKTYAFFSHTIKAQEANMGLLDEILKQEIRLIDYEKMVDHRGVRVVAFGQWAGVAGMINILHGMGLRLLALGHHTPFMHIGMAHNYRNSSQAVQAVRDAGYEISLGLMPKSIGPLTFVFTGTGNVSKGAQAIFNELPCEYVEPHELKEVSQTGDLRKVYGTVLSRHHHLVRKTDAVYDPAEYDKHPERYISRFNTDIAPYTTCLINGIYWEQNTPRLLTRQDAQSLLAPGKFSPAGVEGCPALPHKLVAICDISADTGGSIEFMTECTTIEHPFCMYDADQHIIHDSVEGSGILMCSIDNLPAQLPIEATECFGDMLYPYVEEMILSDATQPLESQNFSPVVRDAVITSNGTLPDKYKYIQTLRESRERAQSLSMGTRRKVLVLGSGYISEPVLEYLSRDGNIEITVGSDMKNQIEQLGKKYNINPVSMDICKQEEKLGFLVAKQDLVISLLPYVLHPLVAKACITNKVNMVTASYITPALKELEKSVEDAGITIIGELGLDPGLDHMLAMETIDKAKEVGATIESYISYCGGLPAPEHSNNPLRYKFSWSPVGVLMNVMQSATYLLDGKVVNVAGGISFLDAVTSMDFFPGLNLEGYPNRDSTKYAEIYGISSAHTLLRGTLRYKGYMKALNGFVKLGLINREALPAFRPEANPLTWKQLLCDLVGISPSSEHDVLKEAVLKKLGGDNTQLEAAEWLGLLGDEQVPQAESILDALSKHLVMKLSYGPEEKDMIVMRDSFGIRHPSGHLEHKTIDLVAYGDINGFSAMAKTVGLPTAMAAKMLLDGEIGAKGLMGPFSKEIYGPILERIKAEGIIYTTQSTIKP,926,NP_005754.2.csv,refseq-AASS-NM_005763.3_clinical_seed_0_final,refseq-AASS-NM_005763.3.a2m,Invitae,refseq-AASS-NM_005763.3.npy,1,926,926
+NP_005756.2,MAVFVVLLALVAGVLGNEFSILKSPGSVVFRNGNWPIPGERIPDVAALSMGFSVKEDLSWPGLAVGNLFHRPRATVMVMVKGVNKLALPPGSVISYPLENAVPFSLDSVANSIHSLFSEETPVVLQLAPSEERVYMVGKANSVFEDLSVTLRQLRNRLFQENSVLSSLPLNSLSRNNEVDLLFLSELQVLHDISSLLSRHKHLAKDHSPDLYSLELAGLDEIGKRYGEDSEQFRDASKILVDALQKFADDMYSLYGGNAVVELVTVKSFDTSLIRKTRTILEAKQAKNPASPYNLAYKYNFEYSVVFNMVLWIMIALALAVIITSYNIWNMDPGYDSIIYRMTNQKIRMD,350,NP_005756.2.csv,refseq-ATP6AP2-NM_005765.2_clinical_seed_0_final,refseq-ATP6AP2-NM_005765.2.a2m,Invitae,refseq-ATP6AP2-NM_005765.2.npy,1,350,350
+NP_005778.1,MAAGLRKRGRSGSAAQAEGLCKQWLQRAWQERRLLLREPRYTLLVAACLCLAEVGITFWVIHRVAYTEIDWKAYMAEVEGVINGTYDYTQLQGDTGPLVYPAGFVYIFMGLYYATSRGTDIRMAQNIFAVLYLATLLLVFLIYHQTCKVPPFVFFFMCCASYRVHSIFVLRLFNDPVAMVLLFLSINLLLAQRWGWGCCFFSLAVSVKMNVLLFAPGLLFLLLTQFGFRGALPKLGICAGLQVVLGLPFLLENPSGYLSRSFDLGRQFLFHWTVNWRFLPEALFLHRAFHLALLTAHLTLLLLFALCRWHRTGESILSLLRDPSKRKVPPQPLTPNQIVSTLFTSNFIGICFSRSLHYQFYVWYFHTLPYLLWAMPARWLTHLLRLLVLGLIELSWNTYPSTSCSSAALHICHAVILLQLWLGPQPFPKSTQHSKKAH,438,NP_005778.1.csv,refseq-ALG3-NM_005787.5_clinical_seed_0_final,refseq-ALG3-NM_005787.5.a2m,Invitae,refseq-ALG3-NM_005787.5.npy,1,438,438
+NP_005848.2,MGMWASLDALWEMPAEKRIFGAVLLFSWTVYLWETFLAQRQRRIYKTTTHVPPELGQIMDSETFEKSRLYQLDKSTFSFWSGLYSETEGTLILLFGGIPYLWRLSGRFCGYAGFGPEYEITQSLVFLLLATLFSALTGLPWSLYNTFVIEEKHGFNQQTLGFFMKDAIKKFVVTQCILLPVSSLLLYIIKIGGDYFFIYAWLFTLVVSLVLVTIYADYIAPLFDKFTPLPEGKLKEEIEVMAKSIDFPLTKVYVVEGSKRSSHSNAYFYGFFKNKRIVLFDTLLEEYSVLNKDIQEDSGMEPRNEEEGNSEEIKAKVKNKKQGCKNEEVLAVLGHELGHWKLGHTVKNIIISQMNSFLCFFLFAVLIGRKELFAAFGFYDSQPTLIGLLIIFQFIFSPYNEVLSFCLTVLSRRFEFQADAFAKKLGKAKDLYSALIKLNKDNLGFPVSDWLFSMWHYSHPPLLERLQALKTMKQH,475,NP_005848.2.csv,refseq-ZMPSTE24-NM_005857.4_clinical_seed_0_final,refseq-ZMPSTE24-NM_005857.4.a2m,Invitae,refseq-ZMPSTE24-NM_005857.4.npy,1,475,475
+NP_005850.1,MADRDSGSEQGGAALGSGGSLGHPGSGSGSGGGGGGGGGGGGSGGGGGGAPGGLQHETQELASKRVDIQNKRFYLDVKQNAKGRFLKIAEVGAGGNKSRLTLSMSVAVEFRDYLGDFIEHYAQLGPSQPPDLAQAQDEPRRALKSEFLVRENRKYYMDLKENQRGRFLRIRQTVNRGPGLGSTQGQTIALPAQGLIEFRDALAKLIDDYGVEEEPAELPEGTSLTVDNKRFFFDVGSNKYGVFMRVSEVKPTYRNSITVPYKVWAKFGHTFCKYSEEMKKIQEKQREKRAACEQLHQQQQQQQEETAAATLLLQGEEEGEED,322,NP_005850.1.csv,refseq-PURA-NM_005859.4_clinical_seed_0_final,refseq-PURA-NM_005859.4.a2m,Invitae,refseq-PURA-NM_005859.4.npy,1,322,322
+NP_005852.2,MKGKEEKEGGARLGAGGGSPEKSPSAQELKEQGNRLFVGRKYPEAAACYGRAITRNPLVAVYYTNRALCYLKMQQHEQALADCRRALELDGQSVKAHFFLGQCQLEMESYDEAIANLQRAYSLAKEQRLNFGDDIPSALRIAKKKRWNSIEERRIHQESELHSYLSRLIAAERERELEECQRNHEGDEDDSHVRAQQACIEAKHDKYMADMDELFSQVDEKRKKRDIPDYLCGKISFELMREPCITPSGITYDRKDIEEHLQRVGHFDPVTRSPLTQEQLIPNLAMKEVIDAFISENGWVEDY,303,NP_005852.2.csv,refseq-STUB1-NM_005861.3_clinical_seed_0_final,refseq-STUB1-NM_005861.3.a2m,Invitae,refseq-STUB1-NM_005861.3.npy,1,303,303
+NP_005853.2,MITSELPVLQDSTNETTAHSDAGSELEETEVKGKRKRGRPGRPPSTNKKPRKSPGEKSRIEAGIRGAGRGRANGHPQQNGEGEPVTLFEVVKLGKSAMQSVVDDWIESYKQDRDIALLDLINFFIQCSGCRGTVRIEMFRNMQNAEIIRKMTEEFDEDSGDYPLTMPGPQWKKFRSNFCEFIGVLIRQCQYSIIYDEYMMDTVISLLTGLSDSQVRAFRHTSTLAAMKLMTALVNVALNLSIHQDNTQRQYEAERNKMIGKRANERLELLLQKRKELQENQDEIENMMNSIFKGIFVHRYRDAIAEIRAICIEEIGVWMKMYSDAFLNDSYLKYVGWTLHDRQGEVRLKCLKALQSLYTNRELFPKLELFTNRFKDRIVSMTLDKEYDVAVEAIRLVTLILHGSEEALSNEDCENVYHLVYSAHRPVAVAAGEFLHKKLFSRHDPQAEEALAKRRGRNSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLEEPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNKLTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVVEKHVESDVLEACSKTYSILCSEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEEADDDDIYNVLSTLKRLTSFHNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQCSHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDLLMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEANKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQIDKIQCAKTLILSLQQLFNELVQEQGPNLDRTSAHVSGIKELARRFALTFGLDQIKTREAVATLHKDGIEFAFKYQNQKGQEYPPPNLAFLEVLSEFSSKLLRQDKKTVHSYLEKFLTEQMMERREDVWLPLISYRNSLVTGGEDDRMSVNSGSSSSKTSSVRNKKGRPPLHKKRVEDESLDNTWLNRTDTMIQTPGPLPAPQLTSTVLRENSRPMGDQIQEPESEHGSEPDFLHNPQMQISWLGQPKLEDLNRKDRTGMNYMKVRTGVRHAVRGLMEEDAEPIFEDVMMSSRSQLEDMNEEFEDTMVIDLPPSRNRRERAELRPDFFDSAAIIEDDSGFGMPMF,1258,NP_005853.2.csv,refseq-STAG1-NM_005862.2_clinical_seed_0_final,refseq-STAG1-NM_005862.2.a2m,Invitae,refseq-STAG1-NM_005862.2.npy,1,1258,1258
+NP_005857.1,MQWAVGRRWAWAALLLAVAAVLTQVVWLWLGTQSFVFQREEIAQLARQYAGLDHELAFSRLIVELRRLHPGHVLPDEELQWVFVNAGGWMGAMCLLHASLSEYVLLFGTALGSRGHSGRYWAEISDTIISGTFHQWREGTTKSEVFYPGETVVHGPGEATAVEWGPNTWMVEYGRGVIPSTLAFALADTVFSTQDFLTLFYTLRSYARGLRLELTTYLFGQDP,223,NP_005857.1.csv,refseq-SIGMAR1-NM_005866.3_clinical_seed_0_final,refseq-SIGMAR1-NM_005866.3.a2m,Invitae,refseq-SIGMAR1-NM_005866.3.npy,1,223,223
+NP_005887.2,MSKKISGGSVVEMQGDEMTRIIWELIKEKLIFPYVELDLHSYDLGIENRDATNDQVTKDAAEAIKKHNVGVKCATITPDEKRVEEFKLKQMWKSPNGTIRNILGGTVFREAIICKNIPRLVSGWVKPIIIGRHAYGDQYRATDFVVPGPGKVEITYTPSDGTQKVTYLVHNFEEGGGVAMGMYNQDKSIEDFAHSSFQMALSKGWPLYLSTKNTILKKYDGRFKDIFQEIYDKQYKSQFEAQKIWYEHRLIDDMVAQAMKSEGGFIWACKNYDGDVQSDSVAQGYGSLGMMTSVLVCPDGKTVEAEAAHGTVTRHYRMYQKGQETSTNPIASIFAWTRGLAHRAKLDNNKELAFFANALEEVSIETIEAGFMTKDLAACIKGLPNVQRSDYLNTFEFMDKLGENLKIKLAQAKL,414,NP_005887.2.csv,refseq-IDH1-NM_005896.4_clinical_seed_0_final,refseq-IDH1-NM_005896.4.a2m,Invitae,refseq-IDH1-NM_005896.4.npy,1,414,414
+NP_005893.1,MSSILPFTPPIVKRLLGWKKGEQNGQEEKWCEKAVKSLVKKLKKTGQLDELEKAITTQNVNTKCITIPRSLDGRLQVSHRKGLPHVIYCRLWRWPDLHSHHELRAMELCEFAFNMKKDEVCVNPYHYQRVETPVLPPVLVPRHTEIPAEFPPLDDYSHSIPENTNFPAGIEPQSNIPETPPPGYLSEDGETSDHQMNHSMDAGSPNLSPNPMSPAHNNLDLQPVTYCEPAFWCSISYYELNQRVGETFHASQPSMTVDGFTDPSNSERFCLGLLSNVNRNAAVELTRRHIGRGVRLYYIGGEVFAECLSDSAIFVQSPNCNQRYGWHPATVCKIPPGCNLKIFNNQEFAALLAQSVNQGFEAVYQLTRMCTIRMSFVKGWGAEYRRQTVTSTPCWIELHLNGPLQWLDKVLTQMGSPSIRCSSVS,425,NP_005893.1.csv,SMAD3_HUMAN_b03_clinical_seed_0_final,SMAD3_HUMAN_b03.a2m,EVE,SMAD3_HUMAN_b03_theta_0.2.npy,1,425,425
+NP_005899.3,MRLHLLLLLALCGAGTTAAELSYSLRGNWSICNGNGSLELPGAVPGCVHSALFQQGLIQDSYYRFNDLNYRWVSLDNWTYSKEFKIPFEISKWQKVNLILEGVDTVSKILFNEVTIGETDNMFNRYSFDITNVVRDVNSIELRFQSAVLYAAQQSKAHTRYQVPPDCPPLVQKGECHVNFVRKEQCSFSWDWGPSFPTQGIWKDVRIEAYNICHLNYFTFSPIYDKSAQEWNLEIESTFDVVSSKPVGGQVIVAIPKLQTQQTYSIELQPGKRIVELFVNISKNITVETWWPHGHGNQTGYNMTVLFELDGGLNIEKSAKVYFRTVELIEEPIKGSPGLSFYFKINGFPIFLKGSNWIPADSFQDRVTSELLRLLLQSVVDANMNTLRVWGGGIYEQDEFYELCDELGIMVWQDFMFACALYPTDQGFLDSVTAEVAYQIKRLKSHPSIIIWSGNNENEEALMMNWYHISFTDRPIYIKDYVTLYVKNIRELVLAGDKSRPFITSSPTNGAETVAEAWVSQNPNSNYFGDVHFYDYISDCWNWKVFPKARFASEYGYQSWPSFSTLEKVSSTEDWSFNSKFSLHRQHHEGGNKQMLYQAGLHFKLPQSTDPLRTFKDTIYLTQVMQAQCVKTETEFYRRSRSEIVDQQGHTMGALYWQLNDIWQAPSWASLEYGGKWKMLHYFAQNFFAPLLPVGFENENTFYIYGVSDLHSDYSMTLSVRVHTWSSLEPVCSRVTERFVMKGGEAVCLYEEPVSELLRRCGNCTRESCVVSFYLSADHELLSPTNYHFLSSPKEAVGLCKAQITAIISQQGDIFVFDLETSAVAPFVWLDVGSIPGRFSDNGFLMTEKTRTILFYPWEPTSKNELEQSFHVTSLTDIY,879,NP_005899.3.csv,refseq-MANBA-NM_005908.3_clinical_seed_0_final,refseq-MANBA-NM_005908.3.a2m,Invitae,refseq-MANBA-NM_005908.3.npy,1,879,879
+NP_005901.2,MAEPRQEFEVMEDHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLKESPLQTPTEDGSEEPGSETSDAKSTPTAEDVTAPLVDEGAPGKQAAAQPHTEIPEGTTAEEAGIGDTPSLEDEAAGHVTQARMVSKSKDGTGSDDKKAKGADGKTKIATPRGAAPPGQKGQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRSRTPSLPTPPTREPKKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTENLKHQPGGGKVQIINKKLDLSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSLGNIHHKPGGGQVEVKSEKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAKTDHGAEIVYKSPVVSGDTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL,441,NP_005901.2.csv,refseq-MAPT-NM_005910.5_clinical_seed_0_final,refseq-MAPT-NM_005910.5.a2m,Invitae,refseq-MAPT-NM_005910.5.npy,1,441,441
+NP_005903.2,MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYSDGGCYEQLFVSPEVFVTLGVISLLENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTDTDAQSFTVNIDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWAACTVSGILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTGAIRQGANMKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMCNSIIDPLIYALRSQELRKTFKEIICCYPLGGLCDLSSRY,332,NP_005903.2.csv,refseq-MC4R-NM_005912.2_clinical_seed_0_final,refseq-MC4R-NM_005912.2.a2m,Invitae,refseq-MC4R-NM_005912.2.npy,1,332,332
+NP_005907.3,MALKDYALEKEKVKKFLQEFYQDDELGKKQFKYGNQLVRLAHREQVALYVDLDDVAEDDPELVDSICENARRYAKLFADAVQELLPQYKEREVVNKDVLDVYIEHRLMMEQRSRDPGMVRSPQNQYPAELMRRFELYFQGPSSNKPRVIREVRADSVGKLVTVRGIVTRVSEVKPKMVVATYTCDQCGAETYQPIQSPTFMPLIMCPSQECQTNRSGGRLYLQTRGSRFIKFQEMKMQEHSDQVPVGNIPRSITVLVEGENTRIAQPGDHVSVTGIFLPILRTGFRQVVQGLLSETYLEAHRIVKMNKSEDDESGAGELTREELRQIAEEDFYEKLAASIAPEIYGHEDVKKALLLLLVGGVDQSPRGMKIRGNINICLMGDPGVAKSQLLSYIDRLAPRSQYTTGRGSSGVGLTAAVLRDSVSGELTLEGGALVLADQGVCCIDEFDKMAEADRTAIHEVMEQQTISIAKAGILTTLNARCSILAAANPAYGRYNPRRSLEQNIQLPAALLSRFDLLWLIQDRPDRDNDLRLAQHITYVHQHSRQPPSQFEPLDMKLMRRYIAMCREKQPMVPESLADYITAAYVEMRREAWASKDATYTSARTLLAILRLSTALARLRMVDVVEKEDVNEAIRLMEMSKDSLLGDKGQTARTQRPADVIFATVRELVSGGRSVRFSEAEQRCVSRGFTPAQFQAALDEYEELNVWQVNASRTRITFV,719,NP_005907.3.csv,refseq-MCM7-NM_005916.4_clinical_seed_0_final,refseq-MCM7-NM_005916.4.a2m,Invitae,refseq-MCM7-NM_005916.4.npy,1,719,719
+NP_005912.1,MAAAAGNRASSSGFPGARATSPEAGGGGGALKASSAPAAAAGLLREAGSGGRERADWRRRQLRKVRSVELDQLPEQPLFLAASPPASSTSPSPEPADAAGSGTGFQPVAVPPPHGAASRGGAHLTESVAAPDSGASSPAAAEPGEKRAPAAEPSPAAAPAGREMENKETLKGLHKMDDRPEERMIREKLKATCMPAWKHEWLERRNRRGPVVVKPIPVKGDGSEMNHLAAESPGEVQASAASPASKGRRSPSPGNSPSGRTVKSESPGVRRKRVSPVPFQSGRITPPRRAPSPDGFSPYSPEETNRRVNKVMRARLYLLQQIGPNSFLIGGDSPDNKYRVFIGPQNCSCARGTFCIHLLFVMLRVFQLEPSDPMLWRKTLKNFEVESLFQKYHSRRSSRIKAPSRNTIQKFVSRMSNSHTLSSSSTSTSSSENSIKDEEEQMCPICLLGMLDEESLTVCEDGCRNKLHHHCMSIWAEECRRNREPLICPLCRSKWRSHDFYSHELSSPVDSPSSLRAAQQQTVQQQPLAGSRRNQESNFNLTHYGTQQIPPAYKDLAEPWIQVFGMELVGCLFSRNWNVREMALRRLSHDVSGALLLANGESTGNSGGSSGSSPSGGATSGSSQTSISGDVVEACCSVLSMVCADPVYKVYVAALKTLRAMLVYTPCHSLAERIKLQRLLQPVVDTILVKCADANSRTSQLSISTLLELCKGQAGELAVGREILKAGSIGIGGVDYVLNCILGNQTESNNWQELLGRLCLIDRLLLEFPAEFYPHIVSTDVSQAEPVEIRYKKLLSLLTFALQSIDNSHSMVGKLSRRIYLSSARMVTTVPHVFSKLLEMLSVSSSTHFTRMRRRLMAIADEVEIAEAIQLGVEDTLDGQQDSFLQASVPNNYLETTENSSPECTVHLEKTGKGLCATKLSASSEDISERLASISVGPSSSTTTTTTTTEQPKPMVQTKGRPHSQCLNSSPLSHHSQLMFPALSTPSSSTPSVPAGTATDVSKHRLQGFIPCRIPSASPQTQRKFSLQFHRNCPENKDSDKLSPVFTQSRPLPSSNIHRPKPSRPTPGNTSKQGDPSKNSMTLDLNSSSKCDDSFGCSSNSSNAVIPSDETVFTPVEEKCRLDVNTELNSSIEDLLEASMPSSDTTVTFKSEVAVLSPEKAENDDTYKDDVNHNQKCKEKMEAEEEEALAIAMAMSASQDALPIVPQLQVENGEDIIIIQQDTPETLPGHTKAKQPYREDTEWLKGQQIGLGAFSSCYQAQDVGTGTLMAVKQVTYVRNTSSEQEEVVEALREEIRMMSHLNHPNIIRMLGATCEKSNYNLFIEWMAGGSVAHLLSKYGAFKESVVINYTEQLLRGLSYLHENQIIHRDVKGANLLIDSTGQRLRIADFGAAARLASKGTGAGEFQGQLLGTIAFMAPEVLRGQQYGRSCDVWSVGCAIIEMACAKPPWNAEKHSNHLALIFKIASATTAPSIPSHLSPGLRDVALRCLELQPQDRPPSRELLKHPVFRTTW,1512,NP_005912.1.csv,refseq-MAP3K1-NM_005921.1_clinical_seed_0_final,refseq-MAP3K1-NM_005921.1.a2m,Invitae,refseq-MAP3K1-NM_005921.1.npy,1,1512,1512
+NP_005923.3,MLCVGRLGGLGARAAALPPRRAGRGSLEAGIRARRVSTSWSPVGAAFNVKPQGSRLDLFGERRGLFGVPELSAPEGFHIAQEKALRKTELLVDRACSTPPGPQTVLIFDELSDSLCRVADLADFVKIAHPEPAFREAAEEACRSIGTMVEKLNTNVDLYQSLQKLLADKKLVDSLDPETRRVAELFMFDFEISGIHLDKEKRKRAVDLNVKILDLSSTFLMGTNFPNKIEKHLLPEHIRRNFTSAGDHIIIDGLHAESPDDLVREAAYKIFLYPNAGQLKCLEELLSSRDLLAKLVGYSTFSHRALQGTIAKNPETVMQFLEKLSDKLSERTLKDFEMIRGMKMKLNPQNSEVMPWDPPYYSGVIRAERYNIEPSLYCPFFSLGACMEGLNILLNRLLGISLYAEQPAKGEVWSEDVRKLAVVHESEGLLGYIYCDFFQRADKPHQDCHFTIRGGRLKEDGDYQLPVVVLMLNLPRSSRSSPTLLTPSMMENLFHEMGHAMHSMLGRTRYQHVTGTRCPTDFAEVPSILMEYFANDYRVVNQFARHYQTGQPLPKNMVSRLCESKKVCAAADMQLQVFYATLDQIYHGKHPLRNSTTDILKETQEKFYGLPYVPNTAWQLRFSHLVGYGARYYSYLMSRAVASMVWKECFLQDPFNRAAGERYRREMLAHGGGREPMLMVEGMLQKCPSVDDFVSALVSDLDLDFETFLMDSE,713,NP_005923.3.csv,refseq-MIPEP-NM_005932.4_clinical_seed_0_final,refseq-MIPEP-NM_005932.4.a2m,Invitae,refseq-MIPEP-NM_005932.4_theta_0.2.npy,1,713,713
+NP_005934.2,MWKSWKLRTDVRVREGAGGSPCASSQPGSRGPCFLPGLSSQEVSRRRQFLREHAAPFSAFLTDSFGRQHSYLRISLTEKCNLRCQYCMPEEGVPLTPKANLLTTEEILTLARLFVKEGIDKIRLTGGEPLIRPDVVDIVAQLQRLEGLRTIGVTTNGINLARLLPQLQKAGLSAINISLDTLVPAKFEFIVRRKGFHKVMEGIHKAIELGYNPVKVNCVVMRGLNEDELLDFAALTEGLPLDVRFIEYMPFDGNKWNFKKMVSYKEMLDTVRQQWPELEKVPEEESSTAKAFKIPGFQGQISFITSMSEHFCGTCNRLRITADGNLKVCLFGNSEVSLRDHLRAGASEQELLRIIGAAVGRKKRQHAGMFSISQMKNRPMILIGG,385,NP_005934.2.csv,refseq-MOCS1-NM_005943.5_clinical_seed_0_final,refseq-MOCS1-NM_005943.5.a2m,Invitae,refseq-MOCS1-NM_005943.5.npy,1,385,385
+NP_005947.3,MAPAEILNGKEISAQIRARLKNQVTQLKEQVPGFTPRLAILQVGNRDDSNLYINVKLKAAEEIGIKATHIKLPRTTTESEVMKYITSLNEDSTVHGFLVQLPLDSENSINTEEVINAIAPEKDVDGLTSINAGKLARGDLNDCFIPCTPKGCLELIKETGVPIAGRHAVVVGRSKIVGAPMHDLLLWNNATVTTCHSKTAHLDEEVNKGDILVVATGQPEMVKGEWIKPGAIVIDCGINYVPDDKKPNGRKVVGDVAYDEAKERASFITPVPGGVGPMTVAMLMQSTVESAKRFLEKFKPGKWMIQYNNLNLKTPVPSDIDISRSCKPKPIGKLAREIGLLSEEVELYGETKAKVLLSALERLKHRPDGKYVVVTGITPTPLGEGKSTTTIGLVQALGAHLYQNVFACVRQPSQGPTFGIKGGAAGGGYSQVIPMEEFNLHLTGDIHAITAANNLVAAAIDARIFHELTQTDKALFNRLVPSVNGVRRFSDIQIRRLKRLGIEKTDPTTLTDEEINRFARLDIDPETITWQRVLDTNDRFLRKITIGQAPTEKGHTRTAQFDISVASEIMAVLALTTSLEDMRERLGKMVVASSKKGEPVSAEDLGVSGALTVLMKDAIKPNLMQTLEGTPVFVHAGPFANIAHGNSSIIADRIALKLVGPEGFVVTEAGFGADIGMEKFFNIKCRYSGLCPHVVVLVATVRALKMHGGGPTVTAGLPLPKAYIQENLELVEKGFSNLKKQIENARMFGIPVVVAVNAFKTDTESELDLISRLSREHGAFDAVKCTHWAEGGKGALALAQAVQRAAQAPSSFQLLYDLKLPVEDKIRIIAQKIYGADDIELLPEAQHKAEVYTKQGFGNLPICMAKTHLSLSHNPEQKGVPTGFILPIRDIRASVGAGFLYPLVGTMSTMPGLPTRPCFYDIDLDPETEQVNGLF,935,NP_005947.3.csv,refseq-MTHFD1-NM_005956.3_clinical_seed_0_final,refseq-MTHFD1-NM_005956.3.a2m,Invitae,refseq-MTHFD1-NM_005956.3_theta_0.2.npy,1,935,935
+NP_005948.3,MVNEARGNSSLNPCLEGSASSGSESSKDSSRCSTPGLDPERHERLREKMRRRLESGDKWFSLEFFPPRTAEGAVNLISRFDRMAAGGPLYIDVTWHPAGDPGSDKETSSMMIASTAVNYCGLETILHMTCCRQRLEEITGHLHKAKQLGLKNIMALRGDPIGDQWEEEEGGFNYAVDLVKHIRSEFGDYFDICVAGYPKGHPEAGSFEADLKHLKEKVSAGADFIITQLFFEADTFFRFVKACTDMGITCPIVPGIFPIQGYHSLRQLVKLSKLEVPQEIKDVIEPIKDNDAAIRNYGIELAVSLCQELLASGLVPGLHFYTLNREMATTEVLKRLGMWTEDPRRPLPWALSAHPKRREEDVRPIFWASRPKSYIYRTQEWDEFPNGRWGNSSSPAFGELKDYYLFYLKSKSPKEELLKMWGEELTSEESVFEVFVLYLSGEPNRNGHKVTCLPWNDEPLAAETSLLKEELLRVNRQGILTINSQPNINGKPSSDPIVGWGPSGGYVFQKAYLEFFTSRETAEALLQVLKKYELRVNYHLVNVKGENITNAPELQPNAVTWGIFPGREIIQPTVVDPVSFMFWKDEAFALWIERWGKLYEEESPSRTIIQYIHDNYFLVNLVDNDFPLDNCLWQVVEDTLELLNRPTQNARETEAP,656,NP_005948.3.csv,refseq-MTHFR-NM_005957.4_clinical_seed_0_final,refseq-MTHFR-NM_005957.4.a2m,Invitae,refseq-MTHFR-NM_005957.4.npy,1,656,656
+NP_005955.3,MAQRTGLEDPERYLFVDRAVIYNPATQADWTAKKLVWIPSERHGFEAASIKEERGDEVMVELAENGKKAMVNKDDIQKMNPPKFSKVEDMAELTCLNEASVLHNLKDRYYSGLIYTYSGLFCVVINPYKNLPIYSENIIEMYRGKKRHEMPPHIYAISESAYRCMLQDREDQSILCTGESGAGKTENTKKVIQYLAHVASSHKGRKDHNIPGELERQLLQANPILESFGNAKTVKNDNSSRFGKFIRINFDVTGYIVGANIETYLLEKSRAVRQAKDERTFHIFYQLLSGAGEHLKSDLLLEGFNNYRFLSNGYIPIPGQQDKDNFQETMEAMHIMGFSHEEILSMLKVVSSVLQFGNISFKKERNTDQASMPENTVAQKLCHLLGMNVMEFTRAILTPRIKVGRDYVQKAQTKEQADFAVEALAKATYERLFRWLVHRINKALDRTKRQGASFIGILDIAGFEIFELNSFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIDLIERPANPPGVLALLDEECWFPKATDKTFVEKLVQEQGSHSKFQKPRQLKDKADFCIIHYAGKVDYKADEWLMKNMDPLNDNVATLLHQSSDRFVAELWKDVDRIVGLDQVTGMTETAFGSAYKTKKGMFRTVGQLYKESLTKLMATLRNTNPNFVRCIIPNHEKRAGKLDPHLVLDQLRCNGVLEGIRICRQGFPNRIVFQEFRQRYEILTPNAIPKGFMDGKQACERMIRALELDPNLYRIGQSKIFFRAGVLAHLEEERDLKITDIIIFFQAVCRGYLARKAFAKKQQQLSALKVLQRNCAAYLKLRHWQWWRVFTKVKPLLQVTRQEEELQAKDEELLKVKEKQTKVEGELEEMERKHQQLLEEKNILAEQLQAETELFAEAEEMRARLAAKKQELEEILHDLESRVEEEEERNQILQNEKKKMQAHIQDLEEQLDEEEGARQKLQLEKVTAEAKIKKMEEEILLLEDQNSKFIKEKKLMEDRIAECSSQLAEEEEKAKNLAKIRNKQEVMISDLEERLKKEEKTRQELEKAKRKLDGETTDLQDQIAELQAQIDELKLQLAKKEEELQGALARGDDETLHKNNALKVVRELQAQIAELQEDFESEKASRNKAEKQKRDLSEELEALKTELEDTLDTTAAQQELRTKREQEVAELKKALEEETKNHEAQIQDMRQRHATALEELSEQLEQAKRFKANLEKNKQGLETDNKELACEVKVLQQVKAESEHKRKKLDAQVQELHAKVSEGDRLRVELAEKASKLQNELDNVSTLLEEAEKKGIKFAKDAASLESQLQDTQELLQEETRQKLNLSSRIRQLEEEKNSLQEQQEEEEEARKNLEKQVLALQSQLADTKKKVDDDLGTIESLEEAKKKLLKDAEALSQRLEEKALAYDKLEKTKNRLQQELDDLTVDLDHQRQVASNLEKKQKKFDQLLAEEKSISARYAEERDRAEAEAREKETKALSLARALEEALEAKEEFERQNKQLRADMEDLMSSKDDVGKNVHELEKSKRALEQQVEEMRTQLEELEDELQATEDAKLRLEVNMQAMKAQFERDLQTRDEQNEEKKRLLIKQVRELEAELEDERKQRALAVASKKKMEIDLKDLEAQIEAANKARDEVIKQLRKLQAQMKDYQRELEEARASRDEIFAQSKESEKKLKSLEAEILQLQEELASSERARRHAEQERDELADEITNSASGKSALLDEKRRLEARIAQLEEELEEEQSNMELLNDRFRKTTLQVDTLNAELAAERSAAQKSDNARQQLERQNKELKAKLQELEGAVKSKFKATISALEAKIGQLEEQLEQEAKERAAANKLVRRTEKKLKEIFMQVEDERRHADQYKEQMEKANARMKQLKRQLEEAEEEATRANASRRKLQRELDDATEANEGLSREVSTLKNRLRRGGPISFSSSRSGRRQLHLEGASLELSDDDTESKTSDVNETQPPQSE,1976,NP_005955.3.csv,refseq-MYH10-NM_005964.4_clinical_seed_0_final,refseq-MYH10-NM_005964.4.a2m,Invitae,refseq-MYH10-NM_005964.4.npy,1,1976,1976
+NP_005973.1,MSMLPSFGFTQEQVACVCEVLQQGGNLERLGRFLWSLPACDHLHKNESVLKAKAVVAFHRGNFRELYKILESHQFSPHNHPKLQQLWLKAHYVEAEKLRGRPLGAVGKYRVRRKFPLPRTIWDGEETSYCFKEKSRGVLREWYAHNPYPSPREKRELAEATGLTTTQVSNWFKNRRQRDRAAEAKERENTENNNSSSNKQNQLSPLEGGKPLMSSSEEEFSPPQSPDQNSVLLLQGNMGHARSSNYSLPGLTASQPSHGLQTHQHQLQDSLLGPLTSSLVDLGS,284,NP_005973.1.csv,refseq-SIX1-NM_005982.3_clinical_seed_0_final,refseq-SIX1-NM_005982.3.a2m,Invitae,refseq-SIX1-NM_005982.3.npy,1,284,284
+NP_005984.3,MALSDEPAAGGPEEEAEDETLAFGAALEAFGESAETRALLGRLREVHGGGAEREVALERFRVIMDKYQEQPHLLDPHLEWMMNLLLDIVQDQTSPASLVHLAFKFLYIITKVRGYKTFLRLFPHEVADVEPVLDLVTIQNPKDHEAWETRYMLLLWLSVTCLIPFDFSRLDGNLLTQPGQARMSIMDRILQIAESYLIVSDKARDAAAVLVSRFITRPDVKQSKMAEFLDWSLCNLARSSFQTMQGVITMDGTLQALAQIFKHGKREDCLPYAATVLRCLDGCRLPESNQTLLRKLGVKLVQRLGLTFLKPKVAAWRYQRGCRSLAANLQLLTQGQSEQKPLILTEDDDEDDDVPEGVERVIEQLLVGLKDKDTVVRWSAAKGIGRMAGRLPRALADDVVGSVLDCFSFQETDKAWHGGCLALAELGRRGLLLPSRLVDVVAVILKALTYDEKRGACSVGTNVRDAACYVCWAFARAYEPQELKPFVTAISSALVIAAVFDRDINCRRAASAAFQENVGRQGTFPHGIDILTTADYFAVGNRSNCFLVISVFIAGFPEYTQPMIDHLVTMKISHWDGVIRELAARALHNLAQQAPEFSATQVFPRLLSMTLSPDLHMRHGSILACAEVAYALYKLAAQENRPVTDHLDEQAVQGLKQIHQQLYDRQLYRGLGGQLMRQAVCVLIEKLSLSKMPFRGDTVIDGWQWLINDTLRHLHLISSHSRQQMKDAAVSALAALCSEYYMKEPGEADPAIQEELITQYLAELRNPEEMTRCGFSLALGALPGFLLKGRLQQVLTGLRAVTHTSPEDVSFAESRRDGLKAIARICQTVGVKAGAPDEAVCGENVSQIYCALLGCMDDYTTDSRGDVGTWVRKAAMTSLMDLTLLLARSQPELIEAHTCERIMCCVAQQASEKIDRFRAHAASVFLTLLHFDSPPIPHVPHRGELEKLFPRSDVASVNWSAPSQAFPRITQLLGLPTYRYHVLLGLVVSLGGLTESTIRHSTQSLFEYMKGIQSDPQALGSFSGTLLQIFEDNLLNERVSVPLLKTLDHVLTHGCFDIFTTEEDHPFAVKLLALCKKEIKNSKDIQKLLSGIAVFCEMVQFPGDVRRQALLQLCLLLCHRFPLIRKTTASQVYETLLTYSDVVGADVLDEVVTVLSDTAWDAELAVVREQRNRLCDLLGVPRPQLVPQPGAC,1192,NP_005984.3.csv,refseq-TBCD-NM_005993.4_clinical_seed_0_final,refseq-TBCD-NM_005993.4.a2m,Invitae,refseq-TBCD-NM_005993.4.npy,1,1192,1192
+NP_005985.3,MREPALAASAMAYHPFHAPRPADFPMSAFLAAAQPSFFPALALPPGALAKPLPDPGLAGAAAAAAAAAAAAEAGLHVSALGPHPPAAHLRSLKSLEPEDEVEDDPKVTLEAKELWDQFHKLGTEMVITKSGRRMFPPFKVRVSGLDKKAKYILLMDIVAADDCRYKFHNSRWMVAGKADPEMPKRMYIHPDSPATGEQWMAKPVAFHKLKLTNNISDKHGFTILNSMHKYQPRFHIVRANDILKLPYSTFRTYVFPETDFIAVTAYQNDKITQLKIDNNPFAKGFRDTGNGRREKRKQLTLPSLRLYEEHCKPERDGAESDASSCDPPPAREPPTSPGAAPSPLRLHRARAEEKSCAADSDPEPERLSEERAGAPLGRSPAPDSASPTRLTEPERARERRSPERGKEPAESGGDGPFGLRSLEKERAEARRKDEGRKEAAEGKEQGLAPLVVQTDSASPLGAGHLPGLAFSSHLHGQQFFGPLGAGQPLFLHPGQFTMGPGAFSAMGMGHLLASVAGGGNGGGGGPGTAAGLDAGGLGPAASAASTAAPFPFHLSQHMLASQGIPMPTFGGLFPYPYTYMAAAAAAASALPATSAAAAAAAAAGSLSRSPFLGSARPRLRFSPYQIPVTIPPSTSLLTTGLASEGSKAAGGNSREPSPLPELALRKVGAPSRGALSPSGSAKEAANELQSIQRLVSGLESQRALSPGRESPK,712,NP_005985.3.csv,refseq-TBX2-NM_005994.3_clinical_seed_0_final,refseq-TBX2-NM_005994.3.a2m,Invitae,refseq-TBX2-NM_005994.3.npy,1,712,712
+NP_005987.3,MSLSMRDPVIPGTSMAYHPFLPHRAPDFAMSAVLGHQPPFFPALTLPPNGAAALSLPGALAKPIMDQLVGAAETGIPFSSLGPQAHLRPLKTMEPEEEVEDDPKVHLEAKELWDQFHKRGTEMVITKSGRRMFPPFKVRCSGLDKKAKYILLMDIIAADDCRYKFHNSRWMVAGKADPEMPKRMYIHPDSPATGEQWMSKVVTFHKLKLTNNISDKHGFTILNSMHKYQPRFHIVRANDILKLPYSTFRTYLFPETEFIAVTAYQNDKITQLKIDNNPFAKGFRDTGNGRREKRKQLTLQSMRVFDERHKKENGTSDESSSEQAAFNCFAQASSPAASTVGTSNLKDLCPSEGESDAEAESKEEHGPEACDAAKISTTTSEEPCRDKGSPAVKAHLFAAERPRDSGRLDKASPDSRHSPATISSSTRGLGAEERRSPVREGTAPAKVEEARALPGKEAFAPLTVQTDAAAAHLAQGPLPGLGFAPGLAGQQFFNGHPLFLHPSQFAMGGAFSSMAAAGMGPLLATVSGASTGVSGLDSTAMASAAAAQGLSGASAATLPFHLQQHVLASQGLAMSPFGSLFPYPYTYMAAAAAASSAAASSSVHRHPFLNLNTMRPRLRYSPYSIPVPVPDGSSLLTTALPSMAAAAGPLDGKVAALAASPASVAVDSGSELNSRSSTLSSSSMSLSPKLCAEKEAATSELQSIQRLVSGLEAKPDRSRSASP,723,NP_005987.3.csv,refseq-TBX3-NM_005996.3_clinical_seed_0_final,refseq-TBX3-NM_005996.3.a2m,Invitae,refseq-TBX3-NM_005996.3.npy,1,723,723
+NP_005991.1,MRECISVHVGQAGVQMGNACWELYCLEHGIQPDGQMPSDKTIGGGDDSFTTFFCETGAGKHVPRAVFVDLEPTVIDEIRNGPYRQLFHPEQLITGKEDAANNYARGHYTIGKEIIDPVLDRIRKLSDQCTGLQGFLVFHSFGGGTGSGFTSLLMERLSVDYGKKSKLEFSIYPAPQVSTAVVEPYNSILTTHTTLEHSDCAFMVDNEAIYDICRRNLDIERPTYTNLNRLISQIVSSITASLRFDGALNVDLTEFQTNLVPYPRIHFPLATYAPVISAEKAYHEQLSVAEITNACFEPANQMVKCDPRHGKYMACCLLYRGDVVPKDVNAAIAAIKTKRSIQFVDWCPTGFKVGINYQPPTVVPGGDLAKVQRAVCMLSNTTAIAEAWARLDHKFDLMYAKRAFVHWYVGEGMEEGEFSEAREDMAALEKDYEEVGIDSYEDEDEGEE,448,NP_005991.1.csv,refseq-TUBA4A-NM_006000.2_clinical_seed_0_final,refseq-TUBA4A-NM_006000.2.a2m,Invitae,refseq-TUBA4A-NM_006000.2.npy,1,448,448
+NP_005994.2,MLSVASRSGPFAPVLSATSRGVAGALRPLVQATVPATPEQPVLDLKRPFLSRESLSGQAVRRPLVASVGLNVPASVCYSHTDIKVPDFSEYRRLEVLDSTKSSRESSEARKGFSYLVTGVTTVGVAYAAKNAVTQFVSSMSASADVLALAKIEIKLSDIPEGKNMAFKWRGKPLFVRHRTQKEIEQEAAVELSQLRDPQHDLDRVKKPEWVILIGVCTHLGCVPIANAGDFGGYYCPCHGSHYDASGRIRLGPAPLNLEVPTYEFTSDDMVIVG,274,NP_005994.2.csv,refseq-UQCRFS1-NM_006003.2_clinical_seed_0_final,refseq-UQCRFS1-NM_006003.2.a2m,Invitae,refseq-UQCRFS1-NM_006003.2.npy,1,274,274
+NP_005996.2,MDSNTAPLGPSCPQPPPAPQPQARSRLNATASLEQERSERPRAPGPQAGPGPGVRDAAAPAEPQAQHTRSRERADGTGPTKGDMEIPFEEVLERAKAGDPKAQTEVGKHYLQLAGDTDEELNSCTAVDWLVLAAKQGRREAVKLLRRCLADRRGITSENEREVRQLSSETDLERAVRKAALVMYWKLNPKKKKQVAVAELLENVGQVNEHDGGAQPGPVPKSLQKQRRMLERLVSSESKNYIALDDFVEITKKYAKGVIPSSLFLQDDEDDDELAGKSPEDLPLRLKVVKYPLHAIMEIKEYLIDMASRAGMHWLSTIIPTHHINALIFFFIVSNLTIDFFAFFIPLVIFYLSFISMVICTLKVFQDSKAWENFRTLTDLLLRFEPNLDVEQAEVNFGWNHLEPYAHFLLSVFFVIFSFPIASKDCIPCSELAVITGFFTVTSYLSLSTHAEPYTRRALATEVTAGLLSLLPSMPLNWPYLKVLGQTFITVPVGHLVVLNVSVPCLLYVYLLYLFFRMAQLRNFKGTYCYLVPYLVCFMWCELSVVILLESTGLGLLRASIGYFLFLFALPILVAGLALVGVLQFARWFTSLELTKIAVTVAVCSVPLLLRWWTKASFSVVGMVKSLTRSSMVKLILVWLTAIVLFCWFYVYRSEGMKVYNSTLTWQQYGALCGPRAWKETNMARTQILCSHLEGHRVTWTGRFKYVRVTDIDNSAESAINMLPFFIGDWMRCLYGEAYPACSPGNTSTAEEELCRLKLLAKHPCHIKKFDRYKFEITVGMPFSSGADGSRSREEDDVTKDIVLRASSEFKSVLLSLRQGSLIEFSTILEGRLGSKWPVFELKAISCLNCMAQLSPTRRHVKIEHDWRSTVHGAVKFAFDFFFFPFLSAA,890,NP_005996.2.csv,refseq-WFS1-NM_006005.3_clinical_seed_0_final,refseq-WFS1-NM_006005.3.a2m,Invitae,refseq-WFS1-NM_006005.3.npy,1,890,890
+NP_006000.2,MRECISIHVGQAGVQIGNACWELYCLEHGIQPDGQMPSDKTIGGGDDSFNTFFSETGAGKHVPRAVFVDLEPTVIDEVRTGTYRQLFHPEQLITGKEDAANNYARGHYTIGKEIIDLVLDRIRKLADQCTGLQGFLVFHSFGGGTGSGFTSLLMERLSVDYGKKSKLEFSIYPAPQVSTAVVEPYNSILTTHTTLEHSDCAFMVDNEAIYDICRRNLDIERPTYTNLNRLIGQIVSSITASLRFDGALNVDLTEFQTNLVPYPRIHFPLATYAPVISAEKAYHEQLSVAEITNACFEPANQMVKCDPRHGKYMACCLLYRGDVVPKDVNAAIATIKTKRTIQFVDWCPTGFKVGINYQPPTVVPGGDLAKVQRAVCMLSNTTAIAEAWARLDHKFDLMYAKRAFVHWYVGEGMEEGEFSEAREDMAALEKDYEEVGVDSVEGEGEEEGEEY,451,NP_006000.2.csv,refseq-TUBA1A-NM_006009.3_clinical_seed_0_final,refseq-TUBA1A-NM_006009.3.a2m,Invitae,refseq-TUBA1A-NM_006009.3.npy,1,451,451
+NP_006006.3,MAAQVAPAAASSLGNPPPPPPSELKKAEQQQREEAGGEAAAAAAAERGEMKAAAGQESEGPAVGPPQPLGKELQDGAESNGGGGGGGAGSGGGPGAEPDLKNSNGNAGPRPALNNNLTEPPGGGGGGSSDGVGAPPHSAAAALPPPAYGFGQPYGRSPSAVAAAAAAVFHQQHGGQQSPGLAALQSGGGGGLEPYAGPQQNSHDHGFPNHQYNSYYPNRSAYPPPAPAYALSSPRGGTPGSGAAAAAGSKPPPSSSASASSSSSSFAQQRFGAMGGGGPSAAGGGTPQPTATPTLNQLLTSPSSARGYQGYPGGDYSGGPQDGGAGKGPADMASQCWGAAAAAAAAAAASGGAQQRSHHAPMSPGSSGGGGQPLARTPQPSSPMDQMGKMRPQPYGGTNPYSQQQGPPSGPQQGHGYPGQPYGSQTPQRYPMTMQGRAQSAMGGLSYTQQIPPYGQQGPSGYGQQGQTPYYNQQSPHPQQQQPPYSQQPPSQTPHAQPSYQQQPQSQPPQLQSSQPPYSQQPSQPPHQQSPAPYPSQQSTTQQHPQSQPPYSQPQAQSPYQQQQPQQPAPSTLSQQAAYPQPQSQQSQQTAYSQQRFPPPQELSQDSFGSQASSAPSMTSSKGGQEDMNLSLQSRPSSLPDLSGSIDDLPMGTEGALSPGVSTSGISSSQGEQSNPAQSPFSPHTSPHLPGIRGPSPSPVGSPASVAQSRSGPLSPAAVPGNQMPPRPPSGQSDSIMHPSMNQSSIAQDRGYMQRNPQMPQYSSPQPGSALSPRQPSGGQIHTGMGSYQQNSMGSYGPQGGQYGPQGGYPRQPNYNALPNANYPSAGMAGGINPMGAGGQMHGQPGIPPYGTLPPGRMSHASMGNRPYGPNMANMPPQVGSGMCPPPGGMNRKTQETAVAMHVAANSIQNRPPGYPNMNQGGMMGTGPPYGQGINSMAGMINPQGPPYSMGGTMANNSAGMAASPEMMGLGDVKLTPATKMNNKADGTPKTESKSKKSSSSTTTNEKITKLYELGGEPERKMWVDRYLAFTEEKAMGMTNLPAVGRKPLDLYRLYVSVKEIGGLTQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQCLYAFECKIERGEDPPPDIFAAADSKKSQPKIQPPSPAGSGSMQGPQTPQSTSSSMAEGGDLKPPTPASTPHSQIPPLPGMSRSNSVGIQDAFNDGSDSTFQKRNSMTPNPGYQPSMNTSDMMGRMSYEPNKDPYGSMRKAPGSDPFMSSGQGPNGGMGDPYSRAAGPGLGNVAMGPRQHYPYGGPYDRVRTEPGIGPEGNMSTGAPQPNLMPSNPDSGMYSPSRYPPQQQQQQQQRHDSYGNQFSTQGTPSGSPFPSQQTTMYQQQQQNYKRPMDGTYGPPAKRHEGEMYSVPYSTGQGQPQQQQLPPAQPQPASQQQAAQPSPQQDVYNQYGNAYPATATAATERRPAGGPQNQFPFQFGRDRVSAPPGTNAQQNMPPQMMGGPIQASAEVAQQGTMWQGRNDMTYNYANRQSTGSAPQGPAYHGVNRTDEMLHTDQRANHEGSWPSHGTRQPPYGPSAPVPPMTRPPPSNYQPPPSMQNHIPQVSSPAPLPRPMENRTSPSKSPFLHSGMKMQKAGPPVPASHIAPAPVQPPMIRRDITFPPGSVEATQPVLKQRRRLTMKDIGTPEAWRVMMSLKSGLLAESTWALDTINILLYDDNSIMTFNLSQLPGLLELLVEYFRRCLIEIFGILKEYEVGDPGQRTLLDPGRFSKVSSPAPMEGGEEEEELLGPKLEEEEEEEVVENDEEIAFSGKDKPASENSEEKLISKFDKLPVKIVQKNDPFVVDCSDKLGRVQEFDSGLLHWRIGGGDTTEHIQTHFESKTELLPSRPHAPCPPAPRKHVTTAEGTPGTTDQEGPPPDGPPEKRITATMDDMLSTRSSTLTEDGAKSSEAIKESSKFPFGISPAQSHRNIKILEDEPHSKDETPLCTLLDWQDSLAKRCVCVSNTIRSLSFVPGNDFEMSKHPGLLLILGKLILLHHKHPERKQAPLTYEKEEEQDQGVSCNKVEWWWDCLEMLRENTLVTLANISGQLDLSPYPESICLPVLDGLLHWAVCPSAEAQDPFSTLGPNAVLSPQRLVLETLSKLSIQDNNVDLILATPPFSRLEKLYSTMVRFLSDRKNPVCREMAVVLLANLAQGDSLAARAIAVQKGSIGNLLGFLEDSLAATQFQQSQASLLHMQNPPFEPTSVDMMRRAARALLALAKVDENHSEFTLYESRLLDISVSPLMNSLVSQVICDVLFLIGQS,2285,NP_006006.3.csv,refseq-ARID1A-NM_006015.4_clinical_seed_0_final,refseq-ARID1A-NM_006015.4.a2m,Invitae,refseq-ARID1A-NM_006015.4.npy,1,2285,2285
+NP_006008.1,MALVLGSLLLLGLCGNSFSGGQPSSTDAPKAWNYELPATNYETQDSHKAGPIGILFELVHIFLYVVQPRDFPEDTLRKFLQKAYESKIDYDKPETVILGLKIVYYEAGIILCCVLGLLFIILMPLVGYFFCMCRCCNKCGGEMHQRQKENGPFLRKCFAISLLVICIIISIGIFYGFVANHQVRTRIKRSRKLADSNFKDLRTLLNETPEQIKYILAQYNTTKDKAFTDLNSINSVLGGGILDRLRPNIIPVLDEIKSMATAIKETKEALENMNSTLKSLHQQSTQLSSSLTSVKTSLRSSLNDPLCLVHPSSETCNSIRLSLSQLNSNPELRQLPPVDAELDNVNNVLRTDLDGLVQQGYQSLNDIPDRVQRQTTTVVAGIKRVLNSIGSDIDNVTQRLPIQDILSAFSVYVNNTESYIHRNLPTLEEYDSYWWLGGLVICSLLTLIVIFYYLGLLCGVCGYDRHATPTTRGCVSNTGGVFLMVGVGLSFLFCWILMIIVVLTFVFGANVEKLICEPYTSKELFRVLDTPYLLNEDWEYYLSGKLFNKSKMKLTFEQVYSDCKKNRGTYGTLHLQNSFNISEHLNINEHTGSISSELESLKVNLNIFLLGAAGRKNLQDFAACGIDRMNYDSYLAQTGKSPAGVNLLSFAYDLEAKANSLPPGNLRNSLKRDAQTIKTIHQQRVLPIEQSLSTLYQSVKILQRTGNGLLERVTRILASLDFAQNFITNNTSSVIIEETKKYGRTIIGYFEHYLQWIEFSISEKVASCKPVATALDTAVDVFLCSYIIDPLNLFWFGIGKATVFLLPALIFAVKLAKYYRRMDSEDVYDDVETIPMKNMENGNNGYHKDHVYGIHNPVMTSPSQH,865,NP_006008.1.csv,refseq-PROM1-NM_006017.3_clinical_seed_0_final,refseq-PROM1-NM_006017.3.a2m,Invitae,refseq-PROM1-NM_006017.3_theta_0.2.npy,1,865,865
+NP_006010.2,MGSMFRSEEVALVQLFLPTAAAYTCVSRLGELGLVEFRDLNASVSAFQRRFVVDVRRCEELEKTFTFLQEEVRRAGLVLPPPKGRLPAPPPRDLLRIQEETERLAQELRDVRGNQQALRAQLHQLQLHAAVLRQGHEPQLAAAHTDGASERTPLLQAPGGPHQDLRVNFVAGAVEPHKAPALERLLWRACRGFLIASFRELEQPLEHPVTGEPATWMTFLISYWGEQIGQKIRKITDCFHCHVFPFLQQEEARLGALQQLQQQSQELQEVLGETERFLSQVLGRVLQLLPPGQVQVHKMKAVYLALNQCSVSTTHKCLIAEAWCSVRDLPALQEALRDSSMEEGVSAVAHRIPCRDMPPTLIRTNRFTASFQGIVDAYGVGRYQEVNPAPYTIITFPFLFAVMFGDVGHGLLMFLFALAMVLAENRPAVKAAQNEIWQTFFRGRYLLLLMGLFSIYTGFIYNECFSRATSIFPSGWSVAAMANQSGWSDAFLAQHTMLTLDPNVTGVFLGPYPFGIDPIWSLAANHLSFLNSFKMKMSVILGVVHMAFGVVLGVFNHVHFGQRHRLLLETLPELTFLLGLFGYLVFLVIYKWLCVWAARAASAPSILIHFINMFLFSHSPSNRLLYPRQEVVQATLVVLALAMVPILLLGTPLHLLHRHRRRLRRRPADRQEENKAGLLDLPDASVNGWSSDEEKAGGLDDEEEAELVPSEVLMHQAIHTIEFCLGCVSNTASYLRLWALSLAHAQLSEVLWAMVMRIGLGLGREVGVAAVVLVPIFAAFAVMTVAILLVMEGLSAFLHALRLHWVEFQNKFYSGTGYKLSPFTFAATDD,830,NP_006010.2.csv,VPP3_HUMAN_b01_clinical_seed_0_final,VPP3_HUMAN_b01.a2m,EVE,VPP3_HUMAN_b01_theta_0.2.npy,1,830,830
+NP_006021.2,MAVPARTCGASRPGPARTARPWPGCGPHPGPGTRRPTSGPPRPLWLLLPLLPLLAAPGASAYSFPQQHTMQHWARRLEQEVDGVMRIFGGVQQLREIYKDNRNLFEVQENEPQKLVEKVAGDIESLLDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSKASTLRLDFIEDPNFKNKVNYSYAAVQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANCNKMIMMFTDGGEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAIRINTQEYLDVLGRPMVLAGKEAKQVQWTNVYEDALGLGLVVTGTLPVFNLTQDGPGEKKNQLILGVMGIDVALNDIKRLTPNYTLGANGYVFAIDLNGYVLLHPNLKPQTTNFREPVTLDFLDAELEDENKEEIRRSMIDGNKGHKQIRTLVKSLDERYIDEVTRNYTWVPIRSTNYSLGLVLPPYSTFYLQANLSDQILQVKYFEFLLPSSFESEGHVFIAPREYCKDLNASDNNTEFLKNFIELMEKVTPDSKQCNNFLLHNLILDTGITQQLVERVWRDQDLNTYSLLAVFAATDGGITRVFPNKAAEDWTENPEPFNASFYRRSLDNHGYVFKPPHQDALLRPLELENDTVGILVSTAVELSLGRRTLRPAVVGVKLDLEAWAEKFKVLASNRTHQDQPQKCGPNSHCEMDCEVNNEDLLCVLIDDGGFLVLSNQNHQWDQVGRFFSEVDANLMLALYNNSFYTRKESYDYQAACAPQPPGNLGAAPRGVFVPTVADFLNLAWWTSAAAWSLFQQLLYGLIYHSWFQADPAEAEGSPETRESSCVMKQTQYYFGSVNASYNAIIDCGNCSRLFHAQRLTNTNLLFVVAEKPLCSQCEAGRLLQKETHSDGPEQCELVQRPRYRRGPHICFDYNATEDTSDCGRGASFPPSLGVLVSLQLLLLLGLPPRPQPQVLVHASRRL,1143,NP_006021.2.csv,refseq-CACNA2D2-NM_006030.3_clinical_seed_0_final,refseq-CACNA2D2-NM_006030.3.a2m,Invitae,refseq-CACNA2D2-NM_006030.3.npy,1,1143,1143
+NP_006026.3,MSAKVRLKKLEQLLLDGPWRNESALSVETLLDVLVCLYTECSHSALRRDKYVAEFLEWAKPFTQLVKEMQLHREDFEIIKVIGRGAFGEVAVVKMKNTERIYAMKILNKWEMLKRAETACFREERDVLVNGDCQWITALHYAFQDENHLYLVMDYYVGGDLLTLLSKFEDKLPEDMARFYIGEMVLAIDSIHQLHYVHRDIKPDNVLLDVNGHIRLADFGSCLKMNDDGTVQSSVAVGTPDYISPEILQAMEDGMGKYGPECDWWSLGVCMYEMLYGETPFYAESLVETYGKIMNHEERFQFPSHVTDVSEEAKDLIQRLICSRERRLGQNGIEDFKKHAFFEGLNWENIRNLEAPYIPDVSSPSDTSNFDVDDDVLRNTEILPPGSHTGFSGLHLPFIGFTFTTESCFSDRGSLKSIMQSNTLTKDEDVQRDLEHSLQMEAYERRIRRLEQEKLELSRKLQESTQTVQSLHGSSRALSNSNRDKEIKKLNEEIERLKNKIADSNRLERQLEDTVALRQEREDSTQRLRGLEKQHRVVRQEKEELHKQLVEASERLKSQAKELKDAHQQRKLALQEFSELNERMAELRAQKQKVSRQLRDKEEEMEVATQKVDAMRQEMRRAEKLRKELEAQLDDAVAEASKERKLREHSENFCKQMESELEALKVKQGGRGAGATLEHQQEISKIKSELEKKVLFYEEELVRREASHVLEVKNVKKEVHDSESHQLALQKEILMLKDKLEKSKRERHNEMEEAVGTIKDKYERERAMLFDENKKLTAENEKLCSFVDKLTAQNRQLEDELQDLAAKKESVAHWEAQIAEIIQWVSDEKDARGYLQALASKMTEELEALRSSSLGSRTLDPLWKVRRSQKLDMSARLELQSALEAEIRAKQLVQEELRKVKDANLTLESKLKDSEAKNRELLEEMEILKKKMEEKFRADTGLKLPDFQDSIFEYFNTAPLAHDLTFRTSSASEQETQAPKPEASPSMSVAASEQQEDMARPPQRPSAVPLPTTQALALAGPKPKAHQFSIKSFSSPTQCSHCTSLMVGLIRQGYACEVCSFACHVSCKDGAPQVCPIPPEQSKRPLGVDVQRGIGTAYKGHVKVPKPTGVKKGWQRAYAVVCDCKLFLYDLPEGKSTQPGVIASQVLDLRDDEFSVSSVLASDVIHATRRDIPCIFRVTASLLGAPSKTSSLLILTENENEKRKWVGILEGLQSILHKNRLRNQVVHVPLEAYDSSLPLIKAILTAAIVDADRIAVGLEEGLYVIEVTRDVIVRAADCKKVHQIELAPREKIVILLCGRNHHVHLYPWSSLDGAEGSFDIKLPETKGCQLMATATLKRNSGTCLFVAVKRLILCYEIQRTKPFHRKFNEIVAPGSVQCLAVLRDRLCVGYPSGFCLLSIQGDGQPLNLVNPNDPSLAFLSQQSFDALCAVELESEEYLLCFSHMGLYVDPQGRRARAQELMWPAAPVACSCSPTHVTVYSEYGVDVFDVRTMEWVQTIGLRRIRPLNSEGTLNLLNCEPPRLIYFKSKFSGAVLNVPDTSDNSKKQMLRTRSKRRFVFKVPEEERLQQRREMLRDPELRSKMISNPTNFNHVAHMGPGDGMQVLMDLPLSAVPPSQEERPGPAPTNLARQPPSRNKPYISWPSSGGSEPSVTVPLRSMSDPDQDFDKEPDSDSTKHSTPSNSSNPSGPPSPNSPHRSQLPLEGLEQPACDT,1711,NP_006026.3.csv,refseq-CDC42BPB-NM_006035.3_clinical_seed_0_final,refseq-CDC42BPB-NM_006035.3.a2m,Invitae,refseq-CDC42BPB-NM_006035.3_theta_0.2.npy,1,1711,1711
+NP_006027.2,MQQKTKLFLQALKYSIPHLGKCMQKQHLNHYNFADHCYNRIKLKKYHLTKCLQNKPKISELARNIPSRSFSCKDLQPVKQENEKPLPENMDAFEKVRTKLETQPQEEYEIINVEVKHGGFVYYQEGCCLVRSKDEEADNDNYEVLFNLEELKLDQPFIDCIRVAPDEKYVAAKIRTEDSEASTCVIIKLSDQPVMEASFPNVSSFEWVKDEEDEDVLFYTFQRNLRCHDVYRATFGDNKRNERFYTEKDPSYFVFLYLTKDSRFLTINIMNKTTSEVWLIDGLSPWDPPVLIQKRIHGVLYYVEHRDDELYILTNVGEPTEFKLMRTAADTPAIMNWDLFFTMKRNTKVIDLDMFKDHCVLFLKHSNLLYVNVIGLADDSVRSLKLPPWACGFIMDTNSDPKNCPFQLCSPIRPPKYYTYKFAEGKLFEETGHEDPITKTSRVLRLEAKSKDGKLVPMTVFHKTDSEDLQKKPLLVHVYGAYGMDLKMNFRPERRVLVDDGWILAYCHVRGGGELGLQWHADGRLTKKLNGLADLEACIKTLHGQGFSQPSLTTLTAFSAGGVLAGALCNSNPELVRAVTLEAPFLDVLNTMMDTTLPLTLEELEEWGNPSSDEKHKNYIKRYCPYQNIKPQHYPSIHITAYENDERVPLKGIVSYTEKLKEAIAEHAKDTGEGYQTPNIILDIQPGGNHVIEDSHKKITAQIKFLYEELGLDSTSVFEDLKKYLKF,727,NP_006027.2.csv,refseq-PREPL-NM_006036.4_clinical_seed_0_final,refseq-PREPL-NM_006036.4.a2m,Invitae,refseq-PREPL-NM_006036.4_theta_0.2.npy,1,727,727
+NP_006028.2,MSSQSHPDGLSGRDQPVELLNPARVNHMPSTVDVATALPLQVAPSAVPMDLRLDHQFSLPVAEPALREQQLQQELLALKQKQQIQRQILIAEFQRQHEQLSRQHEAQLHEHIKQQQEMLAMKHQQELLEHQRKLERHRQEQELEKQHREQKLQQLKNKEKGKESAVASTEVKMKLQEFVLNKKKALAHRNLNHCISSDPRYWYGKTQHSSLDQSSPPQSGVSTSYNHPVLGMYDAKDDFPLRKTASEPNLKLRSRLKQKVAERRSSPLLRRKDGPVVTALKKRPLDVTDSACSSAPGSGPSSPNNSSGSVSAENGIAPAVPSIPAETSLAHRLVAREGSAAPLPLYTSPSLPNITLGLPATGPSAGTAGQQDAERLTLPALQQRLSLFPGTHLTPYLSTSPLERDGGAAHSPLLQHMVLLEQPPAQAPLVTGLGALPLHAQSLVGADRVSPSIHKLRQHRPLGRTQSAPLPQNAQALQHLVIQQQHQQFLEKHKQQFQQQQLQMNKIIPKPSEPARQPESHPEETEEELREHQALLDEPYLDRLPGQKEAHAQAGVQVKQEPIESDEEEAEPPREVEPGQRQPSEQELLFRQQALLLEQQRIHQLRNYQASMEAAGIPVSFGGHRPLSRAQSSPASATFPVSVQEPPTKPRFTTGLVYDTLMLKHQCTCGSSSSHPEHAGRIQSIWSRLQETGLRGKCECIRGRKATLEELQTVHSEAHTLLYGTNPLNRQKLDSKKLLGSLASVFVRLPCGGVGVDSDTIWNEVHSAGAARLAVGCVVELVFKVATGELKNGFAVVRPPGHHAEESTPMGFCYFNSVAVAAKLLQQRLSVSKILIVDWDVHHGNGTQQAFYSDPSVLYMSLHRYDDGNFFPGSGAPDEVGTGPGVGFNVNMAFTGGLDPPMGDAEYLAAFRTVVMPIASEFAPDVVLVSSGFDAVEGHPTPLGGYNLSARCFGYLTKQLMGLAGGRIVLALEGGHDLTAICDASEACVSALLGNELDPLPEKVLQQRPNANAVRSMEKVMEIHSKYWRCLQRTTSTAGRSLIEAQTCENEEAETVTAMASLSVGVKPAEKRPDEEPMEEEPPL,1084,NP_006028.2.csv,refseq-HDAC4-NM_006037.3_clinical_seed_0_final,refseq-HDAC4-NM_006037.3.a2m,Invitae,refseq-HDAC4-NM_006037.3.npy,1,1084,1084
+NP_006051.1,MDADEGQDMSQVSGKESPPVSDTPDEGDEPMPIPEDLSTTSGGQQSSKSDRVVASNVKVETQSDEENGRACEMNGEECAEDLRMLDASGEKMNGSHRDQGSSALSGVGGIRLPNGKLKCDICGIICIGPNVLMVHKRSHTGERPFQCNQCGASFTQKGNLLRHIKLHSGEKPFKCHLCNYACRRRDALTGHLRTHSVGKPHKCGYCGRSYKQRSSLEEHKERCHNYLESMGLPGTLYPVIKEETNHSEMAEDLCKIGSERSLVLDRLASNVAKRKSSMPQKFLGDKGLSDTPYDSSASYEKENEMMKSHVMDQAINNAINYLGAESLRPLVQTPPGGSEVVPVISPMYQLHKPLAEGTPRSNHSAQDSAVENLLLLSKAKLVPSEREASPSNSCQDSTDTESNNEEQRSGLIYLTNHIAPHARNGLSLKEEHRAYDLLRAASENSQDALRVVSTSGEQMKVYKCEHCRVLFLDHVMYTIHMGCHGFRDPFECNMCGYHSQDRYEFSSHITRGEHRFHMS,519,NP_006051.1.csv,refseq-IKZF1-NM_006060.6_clinical_seed_0_final,refseq-IKZF1-NM_006060.6.a2m,Invitae,refseq-IKZF1-NM_006060.6.npy,1,519,519
+NP_006054.2,MDSQRELAEELRLYQSTLLQDGLKDLLDEKKFIDCTLKAGDKSLPCHRLILSACSPYFREYFLSEIDEAKKKEVVLDNVDPAILDLIIKYLYSASIDLNDGNVQDIFALASRFQIPSVFTVCVSYLQKRLAPGNCLAILRLGLLLDCPRLAISAREFVSDRFVQICKEEDFMQLSPQELISVISNDSLNVEKEEAVFEAVMKWVRTDKENRVKNLSEVFDCIRFRLMTEKYFKDHVEKDDIIKSNPDLQKKIKVLKDAFAGKLPEPSKNAAKTGAGEVNGDVGDEDLLPGYLNDIPRHGMFVKDLILLVNDTAAVAYDPTENECYLTALAEQIPRNHSSIVTQQNQIYVVGGLYVDEENKDQPLQSYFFQLDSIASEWVGLPPLPSARCLFGLGEVDDKIYVVAGKDLQTEASLDSVLCYDPVAAKWNEVKKLPIKVYGHNVISHKGMIYCLGGKTDDKKCTNRVFIFNPKKGDWKDLAPMKIPRSMFGVAVHKGKIVIAGGVTEDGLSASVEAFDLTTNKWDVMTEFPQERSSISLVSLAGSLYAIGGFAMIQLESKEFAPTEVNDIWKYEDDKKEWAGMLKEIRYASGASCLATRLNLFKLSKL,606,NP_006054.2.csv,refseq-KLHL41-NM_006063.2_clinical_seed_0_final,refseq-KLHL41-NM_006063.2.a2m,Invitae,refseq-KLHL41-NM_006063.2.npy,1,606,606
+NP_006061.2,MNGQLDLSGKLIIKAQLGEDIRRIPIHNEDITYDELVLMMQRVFRGKLLSNDEVTIKYKDEDGDLITIFDSSDLSFAIQCSRILKLTLFVNGQPRPLESSQVKYLRRELIELRNKVNRLLDSLEPPGEPGPSTNIPENDTVDGREEKSASDSSGKQSTQVMAASMSAFDPLKNQDEINKNVMSAFGLTDDQVSGPPSAPAEDRSGTPDSIASSSSAAHPPGVQPQQPPYTGAQTQAGQIEGQMYQQYQQQAGYGAQQPQAPPQQPQQYGIQYSASYSQQTGPQQPQQFQGYGQQPTSQAPAPAFSGQPQQLPAQPPQQYQASNYPAQTYTAQTSQPTNYTVAPASQPGMAPSQPGAYQPRPGFTSLPGSTMTPPPSGPNPYARNRPPFGQGYTQPGPGYR,400,NP_006061.2.csv,refseq-TFG-NM_006070.5_clinical_seed_0_final,refseq-TFG-NM_006070.5.a2m,Invitae,refseq-TFG-NM_006070.5.npy,1,400,400
+NP_006068.2,MFRLNSLSALAELAVGSRWYHGGSQPIQIRRRLMMVAFLGASAVTASTGLLWKRAHAESPPCVDNLKSDIGDKGKNKDEGDVCNHEKKTADLAPHPEEKKKKRSGFRDRKVMEYENRIRAYSTPDKIFRYFATLKVISEPGEAEVFMTPEDFVRSITPNEKQPEHLGLDQYIIKRFDGKTEKISQEREKFADEGSIFYTLGECGLISFSDYIFLTTVLSTPQRNFEIAFKMFDLNGDGEVDMEEFEQVQSIIRSQTSMGMRHRDRPTTGNTLKSGLCSALTTYFFGADLKGKLTIKNFLEFQRKLQHDVLKLEFERHDPVDGRITERQFGGMLLAYSGVQSKKLTAMQRQLKKHFKEGKGLTFQEVENFFTFLKNINDVDTALSFYHMAGASLDKVTMQQVARTVAKVELSDHVCDVVFALFDCDGNGELSNKEFVSIMKQRLMRGLEKPKDMGFTRLMQAMWKCAQETAWDFALPKQ,478,NP_006068.2.csv,refseq-MICU1-NM_006077.3_clinical_seed_0_final,refseq-MICU1-NM_006077.3.a2m,Invitae,refseq-MICU1-NM_006077.3.npy,1,478,478
+NP_006070.2,MADHMMAMNHGRFPDGTNGLHHHPAHRMGMGQFPSPHHHQQQQPQHAFNALMGEHIHYGAGNMNATSGIRHAMGPGTVNGGHPPSALAPAARFNNSQFMGPPVASQGGSLPASMQLQKLNNQYFNHHPYPHNHYMPDLHPAAGHQMNGTNQHFRDCNPKHSGGSSTPGGSGGSSTPGGSGSSSGGGAGSSNSGGGSGSGNMPASVAHVPAAMLPPNVIDTDFIDEEVLMSLVIEMGLDRIKELPELWLGQNEFDFMTDFVCKQQPSRVSC,270,NP_006070.2.csv,refseq-CITED2-NM_006079.4_clinical_seed_0_final,refseq-CITED2-NM_006079.4.a2m,Invitae,refseq-CITED2-NM_006079.4.npy,1,270,270
+NP_006077.2,MREIVHIQAGQCGNQIGAKFWEVISDEHGIDPSGNYVGDSDLQLERISVYYNEASSHKYVPRAILVDLEPGTMDSVRSGAFGHLFRPDNFIFGQSGAGNNWAKGHYTEGAELVDSVLDVVRKECENCDCLQGFQLTHSLGGGTGSGMGTLLISKVREEYPDRIMNTFSVVPSPKVSDTVVEPYNATLSIHQLVENTDETYCIDNEALYDICFRTLKLATPTYGDLNHLVSATMSGVTTSLRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTARGSQQYRALTVPELTQQMFDAKNMMAACDPRHGRYLTVATVFRGRMSMKEVDEQMLAIQSKNSSYFVEWIPNNVKVAVCDIPPRGLKMSSTFIGNSTAIQELFKRISEQFTAMFRRKAFLHWYTGEGMDEMEFTEAESNMNDLVSEYQQYQDATAEEEGEMYEDDEEESEAQGPK,450,NP_006077.2.csv,refseq-TUBB3-NM_006086.3_clinical_seed_0_final,refseq-TUBB3-NM_006086.3.a2m,Invitae,refseq-TUBB3-NM_006086.3.npy,1,450,450
+NP_006078.2,MREIVHLQAGQCGNQIGAKFWEVISDEHGIDPTGTYHGDSDLQLERINVYYNEATGGNYVPRAVLVDLEPGTMDSVRSGPFGQIFRPDNFVFGQSGAGNNWAKGHYTEGAELVDAVLDVVRKEAESCDCLQGFQLTHSLGGGTGSGMGTLLISKIREEFPDRIMNTFSVVPSPKVSDTVVEPYNATLSVHQLVENTDETYCIDNEALYDICFRTLKLTTPTYGDLNHLVSATMSGVTTCLRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTSRGSQQYRALTVPELTQQMFDAKNMMAACDPRHGRYLTVAAVFRGRMSMKEVDEQMLSVQSKNSSYFVEWIPNNVKTAVCDIPPRGLKMAATFIGNSTAIQELFKRISEQFTAMFRRKAFLHWYTGEGMDEMEFTEAESNMNDLVSEYQQYQDATAEEGEFEEEAEEEVA,444,NP_006078.2.csv,refseq-TUBB4A-NM_006087.2_clinical_seed_0_final,refseq-TUBB4A-NM_006087.2.a2m,Invitae,refseq-TUBB4A-NM_006087.2.npy,1,444,444
+NP_006112.3,MSRQFSSRSGYRSGGGFSSGSAGIINYQRRTTSSSTRRSGGGGGRFSSCGGGGGSFGAGGGFGSRSLVNLGGSKSISISVARGGGRGSGFGGGYGGGGFGGGGFGGGGFGGGGIGGGGFGGFGSGGGGFGGGGFGGGGYGGGYGPVCPPGGIQEVTINQSLLQPLNVEIDPEIQKVKSREREQIKSLNNQFASFIDKVRFLEQQNQVLQTKWELLQQVDTSTRTHNLEPYFESFINNLRRRVDQLKSDQSRLDSELKNMQDMVEDYRNKYEDEINKRTNAENEFVTIKKDVDGAYMTKVDLQAKLDNLQQEIDFLTALYQAELSQMQTQISETNVILSMDNNRSLDLDSIIAEVKAQYEDIAQKSKAEAESLYQSKYEELQITAGRHGDSVRNSKIEISELNRVIQRLRSEIDNVKKQISNLQQSISDAEQRGENALKDAKNKLNDLEDALQQAKEDLARLLRDYQELMNTKLALDLEIATYRTLLEGEESRMSGECAPNVSVSVSTSHTTISGGGSRGGGGGGYGSGGSSYGSGGGSYGSGGGGGGGRGSYGSGGSSYGSGGGSYGSGGGGGGHGSYGSGSSSGGYRGGSGGGGGGSSGGRGSGGGSSGGSIGGRGSSSGGVKSSGGSSSVKFVSTTYSGVTR,644,NP_006112.3.csv,refseq-KRT1-NM_006121.4_clinical_seed_0_final,refseq-KRT1-NM_006121.4.a2m,Invitae,refseq-KRT1-NM_006121.4.npy,1,644,644
+NP_006114.1,MPPPRTGRGLLWLGLVLSSVCVALGSETQANSTTDALNVLLIIVDDLRPSLGCYGDKLVRSPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWRVHAGNFSTIPQYFKENGYVTMSVGKVFHPGISSNHTDDSPYSWSFPPYHPSSEKYENTKTCRGPDGELHANLLCPVDVLDVPEGTLPDKQSTEQAIQLLEKMKTSASPFFLAVGYHKPHIPFRYPKEFQKLYPLENITLAPDPEVPDGLPPVAYNPWMDIRQREDVQALNISVPYGPIPVDFQRKIRQSYFASVSYLDTQVGRLLSALDDLQLANSTIIAFTSDHGFLMRTNT,343,NP_006114.1.csv,refseq-IDS-NM_006123.4_clinical_seed_0_final,refseq-IDS-NM_006123.4.a2m,Invitae,refseq-IDS-NM_006123.4_theta_0.2.npy,1,343,343
+NP_006120.1,MPGVARLPLLLGLLLLPRPGRPLDLADYTYDLAEEDDSEPLNYKDPCKAAAFLGDIALDEEDLRAFQVQQAVDLRRHTARKSSIKAAVPGNTSTPSCQSTNGQPQRGACGRWRGRSRSRRAATSRPERVWPDGVIPFVIGGNFTGSQRAVFRQAMRHWEKHTCVTFLERTDEDSYIVFTYRPCGCCSYVGRRGGGPQAISIGKNCDKFGIVVHELGHVVGFWHEHTRPDRDRHVSIVRENIQPGQEYNFLKMEPQEVESLGETYDFDSIMHYARNTFSRGIFLDTIVPKYEVNGVKPPIGQRTRLSKGDIAQARKLYKCPACGETLQDSTGNFSSPEYPNGYSAHMHCVWRISVTPGEKIILNFTSLDLYRSRLCWYDYVEVRDGFWRKAPLRGRFCGSKLPEPIVSTDSRLWVEFRSSSNWVGKGFFAVYEAICGGDVKKDYGHIQSPNYPDDYRPSKVCIWRIQVSEGFHVGLTFQSFEIERHDSCAYDYLEVRDGHSESSTLIGRYCGYEKPDDIKSTSSRLWLKFVSDGSINKAGFAVNFFKEVDECSRPNRGGCEQRCLNTLGSYKCSCDPGYELAPDKRRCEAACGGFLTKLNGSITSPGWPKEYPPNKNCIWQLVAPTQYRISLQFDFFETEGNDVCKYDFVEVRSGLTADSKLHGKFCGSEKPEVITSQYNNMRVEFKSDNTVSKKGFKAHFFSDKDECSKDNGGCQQDCVNTFGSYECQCRSGFVLHDNKHDCKEAGCDHKVTSTSGTITSPNWPDKYPSKKECTWAISSTPGHRVKLTFMEMDIESQPECAYDHLEVFDGRDAKAPVLGRFCGSKKPEPVLATGSRMFLRFYSDNSVQRKGFQASHATECGGQVRADVKTKDLYSHAQFGDNNYPGGVDCEWVIVAEEGYGVELVFQTFEVEEETDCGYDYMELFDGYDSTAPRLGRYCGSGPPEEVYSAGDSVLVKFHSDDTITKKGFHLRYTSTKFQDTLHSRK,986,NP_006120.1.csv,refseq-BMP1-NM_006129.4_clinical_seed_0_final,refseq-BMP1-NM_006129.4.a2m,Invitae,refseq-BMP1-NM_006129.4.npy,1,986,986
+NP_006138.1,MALHPRRVRLKPWLVAQVDSGLYPGLIWLHRDSKRFQIPWKHATRHSPQQEEENTIFKAWAVETGKYQEGVDDPDPAKWKAQLRCALNKSREFNLMYDGTKEVPMNPVKIYQVCDIPQPQGSIINPGSTGSAPWDEKDNDVDEEDEEDELDQSQHHVPIQDTFPFLNINGSPMAPASVGNCSVGNCSPEAVWPKTEPLEMEVPQAPIQPFYSSPELWISSLPMTDLDIKFQYRGKEYGQTMTVSNPQGCRLFYGDLGPMPDQEELFGPVSLEQVKFPGPEHITNEKQKLFTSKLLDVMDRGLILEVSGHAIYAIRLCQCKVYWSGPCAPSLVAPNLIERQKKVKLFCLETFLSDLIAHQKGQIEKQPPFEIYLCFGEEWPDGKPLERKLILVQVIPVVARMIYEMFSGDFTRSFDSGSVRLQISTPDIKDNIVAQLKQLYRILQTQESWQPMQPTPSMQLPPALPPQ,467,NP_006138.1.csv,refseq-IRF6-NM_006147.3_clinical_seed_0_final,refseq-IRF6-NM_006147.3.a2m,Invitae,refseq-IRF6-NM_006147.3.npy,1,467,467
+NP_006149.2,MSSFSYEPYYSTSYKRRYVETPRVHISSVRSGYSTARSAYSSYSAPVSSSLSVRRSYSSSSGSLMPSLENLDLSQVAAISNDLKSIRTQEKAQLQDLNDRFASFIERVHELEQQNKVLEAELLVLRQKHSEPSRFRALYEQEIRDLRLAAEDATNEKQALQGEREGLEETLRNLQARYEEEVLSREDAEGRLMEARKGADEAALARAELEKRIDSLMDEISFLKKVHEEEIAELQAQIQYAQISVEMDVTKPDLSAALKDIRAQYEKLAAKNMQNAEEWFKSRFTVLTESAAKNTDAVRAAKDEVSESRRLLKAKTLEIEACRGMNEALEKQLQELEDKQNADISAMQDTINKLENELRTTKSEMARYLKEYQDLLNVKMALDIEIAAYRKLLEGEETRLSFTSVGSITSGYSQSSQVFGRSAYGGLQTSSYLMSTRSFPSYYTSHVQEEQIEVEETIEAAKAEEAKDEPPSEGEAEEEEKDKEEAEEEEAAEEEEAAKEESEEAKEEEEGGEGEEGEETKEAEEEEKKVEGAGEEQAAKKKD,543,NP_006149.2.csv,NFL_HUMAN_b03_clinical_seed_0_final,NFL_HUMAN_b03.a2m,EVE,NFL_HUMAN_b03_theta_0.2.npy,1,543,543
+NP_006151.3,MLTRLFSEPGLLSDVPKFASWGDGEDDEPRSDKGDAPPPPPPAPGPGAPGPARAAKPVPLRGEEGTEATLAEVKEEGELGGEEEEEEEEEEGLDEAEGERPKKRGPKKRKMTKARLERSKLRRQKANARERNRMHDLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEILRSGKRPDLVSYVQTLCKGLSQPTTNLVAGCLQLNSRNFLTEQGADGAGRFHGSGGPFAMHPYPYPCSRLAGAQCQAAGGLGGGAAHALRTHGYCAAYETLYAAAGGGGASPDYNSSEYEGPLSPPLCLNGNFSLKQDSSPDHEKSYHYSMHYSALPGSRPTGHGLVFGSSAVRGGVHSENLLSYDMHLHHDRGPMYEELNAFFHN,382,NP_006151.3.csv,refseq-NEUROD2-NM_006160.3_clinical_seed_0_final,refseq-NEUROD2-NM_006160.3.a2m,Invitae,refseq-NEUROD2-NM_006160.3.npy,1,382,382
+NP_006155.2,MMDLELPPPGLPSQQDMDLIDILWRQDIDLGVSREVFDFSQRRKEYELEKQKKLEKERQEQLQKEQEKAFFAQLQLDEETGEFLPIQPAQHIQSETSGSANYSQVAHIPKSDALYFDDCMQLLAQTFPFVDDNEVSSATFQSLVPDIPGHIESPVFIATNQAQSPETSVAQVAPVDLDGMQQDIEQVWEELLSIPELQCLNIENDKLVETTMVPSPEAKLTEVDNYHFYSSIPSMEKEVGNCSPHFLNAFEDSFSSILSTEDPNQLTVNSLNSDATVNTDFGDEFYSAFIAEPSISNSMPSPATLSHSLSELLNGPIDVSDLSLCKAFNQNHPESTAEFNDSDSGISLNTSPSVASPEHSVESSSYGDTLLGLSDSEVEELDSAPGSVKQNGPKTPVHSSGDMVQPLSPSQGQSTHVHDAQCENTPEKELPVSPGHRKTPFTKDKHSSRLEAHLTRDELRAKALHIPFPVEKIINLPVVDFNEMMSKEQFNEAQLALIRDIRRRGKNKVAAQNCRKRKLENIVELEQDLDHLKDEKEKLLKEKGENDKSLHLLKKQLSTLYLEVFSMLRDEDGKPYSPSEYSLQQTRDGNVFLVPKSKKPDVKKN,605,NP_006155.2.csv,refseq-NFE2L2-NM_006164.4_clinical_seed_0_final,refseq-NFE2L2-NM_006164.4.a2m,Invitae,refseq-NFE2L2-NM_006164.4.npy,1,605,605
+NP_006168.1,MALPPSPLAMEYVNDFDLMKFEVKREPSEGRPGPPTASLGSTPYSSVPPSPTFSEPGMVGATEGTRPGLEELYWLATLQQQLGAGEALGLSPEEAMELLQGQGPVPVDGPHGYYPGSPEETGAQHVQLAERFSDAALVSMSVRELNRQLRGCGRDEALRLKQRRRTLKNRGYAQACRSKRLQQRRGLEAERARLAAQLDALRAEVARLARERDLYKARCDRLTSSGPGSGDPSHLFL,237,NP_006168.1.csv,refseq-NRL-NM_006177.3_clinical_seed_0_final,refseq-NRL-NM_006177.3.a2m,Invitae,refseq-NRL-NM_006177.3.npy,1,237,237
+NP_006170.1,MLPLPSCSLPILLLFLLPSVPIESQPPPSTLPPFLAPEWDLLSPRVVLSRGAPAGPPLLFLLEAGAFRESAGAPANRSRRGVSETAPASRRGELAVCDAVSGWVTDRRTAVDLRGREVEVLGEVPAAGGSPLRQYFFETRCKADNAEEGGPGAGGGGCRGVDRRHWVSECKAKQSYVRALTADAQGRVGWRWIRIDTACVCTLLSRTGRA,210,NP_006170.1.csv,refseq-NTF4-NM_006179.4_clinical_seed_0_final,refseq-NTF4-NM_006179.4.a2m,Invitae,refseq-NTF4-NM_006179.4.npy,1,210,210
+NP_006171.2,MSSWIRWHGPAMARLWGFCWLVVGFWRAAFACPTSCKCSASRIWCSDPSPGIVAFPRLEPNSVDPENITEIFIANQKRLEIINEDDVEAYVGLRNLTIVDSGLKFVAHKAFLKNSNLQHINFTRNKLTSLSRKHFRHLDLSELILVGNPFTCSCDIMWIKTLQEAKSSPDTQDLYCLNESSKNIPLANLQIPNCGLPSANLAAPNLTVEEGKSITLSCSVAGDPVPNMYWDVGNLVSKHMNETSHTQGSLRITNISSDDSGKQISCVAENLVGEDQDSVNLTVHFAPTITFLESPTSDHHWCIPFTVKGNPKPALQWFYNGAILNESKYICTKIHVTNHTEYHGCLQLDNPTHMNNGDYTLIAKNEYGKDEKQISAHFMGWPGIDDGANPNYPDVIYEDYGTAANDIGDTTNRSNEIPSTDVTDKTGREHLSVYAVVVIASVVGFCLLVMLFLLKLARHSKFGMKDFSWFGFGKVKSRQGVGPASVISNDDDSASPLHHISNGSNTPSSSEGGPDAVIIGMTKIPVIENPQYFGITNSQLKPDTFVQHIKRHNIVLKRELGEGAFGKVFLAECYNLCPEQDKILVAVKTLKDASDNARKDFHREAELLTNLQHEHIVKFYGVCVEGDPLIMVFEYMKHGDLNKFLRAHGPDAVLMAEGNPPTELTQSQMLHIAQQIAAGMVYLASQHFVHRDLATRNCLVGENLLVKIGDFGMSRDVYSTDYYRVGGHTMLPIRWMPPESIMYRKFTTESDVWSLGVVLWEIFTYGKQPWYQLSNNEVIECITQGRVLQRPRTCPQEVYELMLGCWQREPHMRKNIKGIHTLLQNLAKASPVYLDILG,838,NP_006171.2.csv,refseq-NTRK2-NM_006180.4_clinical_seed_0_final,refseq-NTRK2-NM_006180.4.a2m,Invitae,refseq-NTRK2-NM_006180.4.npy,1,838,838
+NP_006173.2,MILIPRMLLVLFLLLPILSSAKAQVNPAICRYPLGMSGGQIPDEDITASSQWSESTAAKYGRLDSEEGDGAWCPEIPVEPDDLKEFLQIDLHTLHFITLVGTQGRHAGGHGIEFAPMYKINYSRDGTRWISWRNRHGKQVLDGNSNPYDIFLKDLEPPIVARFVRFIPVTDHSMNVCMRVELYGCVWLDGLVSYNAPAGQQFVLPGGSIIYLNDSVYDGAVGYSMTEGLGQLTDGVSGLDDFTQTHEYHVWPGYDYVGWRNESATNGYIEIMFEFDRIRNFTTMKVHCNNMFAKGVKIFKEVQCYFRSEASEWEPNAISFPLVLDDVNPSARFVTVPLHHRMASAIKCQYHFADTWMMFSEITFQSDAAMYNNSEALPTSPMAPTTYDPMLKVDDSNTRILIGCLVAIIFILLAIIVIILWRQFWQKMLEKASRRMLDDEMTVSLSLPSDSSMFNNNRSSSPSEQGSNSTYDRIFPLRPDYQEPSRLIRKLPEFAPGEEESGCSGVVKPVQPSGPEGVPHYAEADIVNLQGVTGGNTYSVPAVTMDLLSGKDVAVEEFPRKLLTFKEKLGEGQFGEVHLCEVEGMEKFKDKDFALDVSANQPVLVAVKMLRADANKNARNDFLKEIKIMSRLKDPNIIHLLAVCITDDPLCMITEYMENGDLNQFLSRHEPPNSSSSDVRTVSYTNLKFMATQIASGMKYLSSLNFVHRDLATRNCLVGKNYTIKIADFGMSRNLYSGDYYRIQGRAVLPIRWMSWESILLGKFTTASDVWAFGVTLWETFTFCQEQPYSQLSDEQVIENTGEFFRDQGRQTYLPQPAICPDSVYKLMLSCWRRDTKNRPSFQEIHLLLLQQGDE,855,NP_006173.2.csv,refseq-DDR2-NM_006182.4_clinical_seed_0_final,refseq-DDR2-NM_006182.4.a2m,Invitae,refseq-DDR2-NM_006182.4_theta_0.2.npy,1,855,855
+NP_006177.1,MPCVQAQYGSSPQGASPASQSYSYHSSGEYSSDFLTPEFVKFSMDLTNTEITATTSLPSFSTFMDNYSTGYDVKPPCLYQMPLSGQQSSIKVEDIQMHNYQQHSHLPPQSEEMMPHSGSVYYKPSSPPTPTTPGFQVQHSPMWDDPGSLHNFHQNYVATTHMIEQRKTPVSRLSLFSFKQSPPGTPVSSCQMRFDGPLHVPMNPEPAGSHHVVDGQTFAVPNPIRKPASMGFPGLQIGHASQLLDTQVPSPPSRGSPSNEGLCAVCGDNAACQHYGVRTCEGCKGFFKRTVQKNAKYVCLANKNCPVDKRRRNRCQYCRFQKCLAVGMVKEVVRTDSLKGRRGRLPSKPKSPQEPSPPSPPVSLISALVRAHVDSNPAMTSLDYSRFQANPDYQMSGDDTQHIQQFYDLLTGSMEIIRGWAEKIPGFADLPKADQDLLFESAFLELFVLRLAYRSNPVEGKLIFCNGVVLHRLQCVRGFGEWIDSIVEFSSNLQNMNIDISAFSCIAALAMVTERHGLKEPKRVEELQNKIVNCLKDHVTFNNGGLNRPNYLSKLLGKLPELRTLCTQGLQRIFYLKLEDLVPPPAIIDKLFLDTLPF,598,NP_006177.1.csv,refseq-NR4A2-NM_006186.4_clinical_seed_0_final,refseq-NR4A2-NM_006186.4.a2m,Invitae,refseq-NR4A2-NM_006186.4.npy,1,598,598
+NP_006183.2,MKFTLGLGSRAWRVSWEGAAAAAAGPGAGGSALRCRAQRVSSPRLGRRGSRLSGALPLCLSRGGGGAQALPDCAGPSPGHPGHPGARQLAGPLAMEQTYGEVNQLGGVFVNGRPLPNAIRLRIVELAQLGIRPCDISRQLRVSHGCVSKILARYNETGSILPGAIGGSKPRVTTPNVVKHIRDYKQGDPGIFAWEIRDRLLADGVCDKYNVPSVSSISRILRNKIGSLAQPGPYEASKQPPSQPTLPYNHIYQYPYPSPVSPTGAKMGSHPGVPGTAGHVSIPRSWPSAHSVSNILGIRTFMEQTGALAGSEGTAYSPKMEDWAGVNRTAFPATPAVNGLEKPALEADIKYTQSASTLSAVGGFLPACAYPASNQHGVYSAPGGGYLAPGPPWPPAQGPPLAPPGAGVAVHGGELAAAMTFKHPSREGSLPAPAARPRTPSVAYTDCPSRPRPPRGSSPRTRARRERQADPGAQVCAAAPAIGTGRIGGLAEEEASAGPRGARPASPQAQPCLWPDPPHFLYWSGFLGFSELGF,534,NP_006183.2.csv,refseq-PAX1-NM_006192.4_clinical_seed_0_final,refseq-PAX1-NM_006192.4.a2m,Invitae,refseq-PAX1-NM_006192.4.npy,1,534,534
+NP_006185.1,MEPAFGEVNQLGGVFVNGRPLPNAIRLRIVELAQLGIRPCDISRQLRVSHGCVSKILARYNETGSILPGAIGGSKPRVTTPTVVKHIRTYKQRDPGIFAWEIRDRLLADGVCDKYNVPSVSSISRILRNKIGNLAQQGHYDSYKQHQPTPQPALPYNHIYSYPSPITAAAAKVPTPPGVPAIPGSVAMPRTWPSSHSVTDILGIRSITDQVSDSSPYHSPKVEEWSSLGRNNFPAAAPHAVNGLEKGALEQEAKYGQAPNGLPAVGSFVSASSMAPYPTPAQVSPYMTYSAAPSGYVAGHGWQHAGGTSLSPHNCDIPASLAFKGMQAAREGSHSVTASAL,341,NP_006185.1.csv,refseq-PAX9-NM_006194.3_clinical_seed_0_final,refseq-PAX9-NM_006194.3.a2m,Invitae,refseq-PAX9-NM_006194.3.npy,1,341,341
+NP_006195.3,MGEINQVAVEKYLEENPQFAKEYFDRKLRVEVLGEIFKNSQVPVQSSMSFSELTQVEESALCLELLWTVQEEGGTPEQGVHRALQRLAHLLQADRCSMFLCRSRNGIPEVASRLLDVTPTSKFEDNLVGPDKEVVFPLDIGIVGWAAHTKKTHNVPDVKKNSHFSDFMDKQTGYVTKNLLATPIVVGKEVLAVIMAVNKVNASEFSKQDEEVFSKYLNFVSIILRLHHTSYMYNIESRRSQILMWSANKVFEELTDVERQFHKALYTVRSYLNCERYSIGLLDMTKEKEFYDEWPIKLGEVEPYKGPKTPDGREVNFYKIIDYILHGKEEIKVIPTPPADHWTLISGLPTYVAENGFICNMMNAPADEYFTFQKGPVDETGWVIKNVLSLPIVNKKEDIVGVATFYNRKDGKPFDEHDEYITETLTQFLGWSLLNTDTYDKMNKLENRKDIAQEMLMNQTKATPEEIKSILKFQEKLNVDVIDDCEEKQLVAILKEDLPDPRSAELYEFRFSDFPLTEHGLIKCGIRLFFEINVVEKFKVPVEVLTRWMYTVRKGYRAVTYHNWRHGFNVGQTMFTLLMTGRLKKYYTDLEAFAMLAAAFCHDIDHRGTNNLYQMKSTSPLARLHGSSILERHHLEYSKTLLQDESLNIFQNLNKRQFETVIHLFEVAIIATDLALYFKKRTMFQKIVDACEQMQTEEEAIKYVTVDPTKKEIIMAMMMTACDLSAITKPWEVQSQVALMVANEFWEQGDLERTVLQQQPIPMMDRNKRDELPKLQVGFIDFVCTFVYKEFSRFHKEITPMLSGLQNNRVEWKSLADEYDAKMKVIEEEAKKQEGGAEKAAEDSGGGDDKKSKTCLML,858,NP_006195.3.csv,refseq-PDE6C-NM_006204.3_clinical_seed_0_final,refseq-PDE6C-NM_006204.3.a2m,Invitae,refseq-PDE6C-NM_006204.3.npy,1,858,858
+NP_006197.1,MGTSHPAFLVLGCLLTGLSLILCQLSLPSILPNENEKVVQLNSSFSLRCFGESEVSWQYPMSEEESSDVEIRNEENNSGLFVTVLEVSSASAAHTGLYTCYYNHTQTEENELEGRHIYIYVPDPDVAFVPLGMTDYLVIVEDDDSAIIPCRTTDPETPVTLHNSEGVVPASYDSRQGFNGTFTVGPYICEATVKGKKFQTIPFNVYALKATSELDLEMEALKTVYKSGETIVVTCAVFNNEVVDLQWTYPGEVKGKGITMLEEIKVPSIKLVYTLTVPEATVKDSGDYECAARQATREVKEMKKVTISVHEKGFIEIKPTFSQLEAVNLHEVKHFVVEVRAYPPPRISWLKNNLTLIENLTEITTDVEKIQEIRYRSKLKLIRAKEEDSGHYTIVAQNEDAVKSYTFELLTQVPSSILDLVDDHHGSTGGQTVRCTAEGTPLPDIEWMICKDIKKCNNETSWTILANNVSNIITEIHSRDRSTVEGRVTFAKVEETIAVRCLAKNLLGAENRELKLVAPTLRSELTVAAAVLVLLVIVIISLIVLVVIWKQKPRYEIRWRVIESISPDGHEYIYVDPMQLPYDSRWEFPRDGLVLGRVLGSGAFGKVVEGTAYGLSRSQPVMKVAVKMLKPTARSSEKQALMSELKIMTHLGPHLNIVNLLGACTKSGPIYIITEYCFYGDLVNYLHKNRDSFLSHHPEKPKKELDIFGLNPADESTRSYVILSFENNGDYMDMKQADTTQYVPMLERKEVSKYSDIQRSLYDRPASYKKKSMLDSEVKNLLSDDNSEGLTLLDLLSFTYQVARGMEFLASKNCVHRDLAARNVLLAQGKIVKICDFGLARDIMHDSNYVSKGSTFLPVKWMAPESIFDNLYTTLSDVWSYGILLWEIFSLGGTPYPGMMVDSTFYNKIKSGYRMAKPDHATSEVYEIMVKCWNSEPEKRPSFYHLSEIVENLLPGQYKKSYEKIHLDFLKSDHPAVARMRVDSDNAYIGVTYKNEEDKLKDWEGGLDEQRLSADSGYIIPLPDIDPVPEEEDLGKRNRHSSQTSEESAIETGSSSSTFIKREDETIEDIDMMDDIGIDSSDLVEDSFL,1089,NP_006197.1.csv,refseq-PDGFRA-NM_006206.4_clinical_seed_0_final,refseq-PDGFRA-NM_006206.4.a2m,Invitae,refseq-PDGFRA-NM_006206.4.npy,1,1089,1089
+NP_006199.2,MERDGCAGGGSRGGEGGRAPREGPAGNGRDRGRSHAAEAPGDPQAAASLLAPMDVGEEPLEKAARARTAKDPNTYKVLSLVLSVCVLTTILGCIFGLKPSCAKEVKSCKGRCFERTFGNCRCDAACVELGNCCLDYQETCIEPEHIWTCNKFRCGEKRLTRSLCACSDDCKDKGDCCINYSSVCQGEKSWVEEPCESINEPQCPAGFETPPTLLFSLDGFRAEYLHTWGGLLPVISKLKKCGTYTKNMRPVYPTKTFPNHYSIVTGLYPESHGIIDNKMYDPKMNASFSLKSKEKFNPEWYKGEPIWVTAKYQGLKSGTFFWPGSDVEINGIFPDIYKMYNGSVPFEERILAVLQWLQLPKDERPHFYTLYLEEPDSSGHSYGPVSSEVIKALQRVDGMVGMLMDGLKELNLHRCLNLILISDHGMEQGSCKKYIYLNKYLGDVKNIKVIYGPAARLRPSDVPDKYYSFNYEGIARNLSCREPNQHFKPYLKHFLPKRLHFAKSDRIEPLTFYLDPQWQLALNPSERKYCGSGFHGSDNVFSNMQALFVGYGPGFKHGIEADTFENIEVYNLMCDLLNLTPAPNNGTHGSLNHLLKNPVYTPKHPKEVHPLVQCPFTRNPRDNLGCSCNPSILPIEDFQTQFNLTVAEEKIIKHETLPYGRPRVLQKENTICLLSQHQFMSGYSQDILMPLWTSYTVDRNDSFSTEDFSNCLYQDFRIPLSPVHKCSFYKNNTKVSYGFLSPPQLNKNSSGIYSEALLTTNIVPMYQSFQVIWRYFHDTLLRKYAEERNGVNVVSGPVFDFDYDGRCDSLENLRQKRRVIRNQEILIPTHFFIVLTSCKDTSQTPLHCENLDTLAFILPHRTDNSESCVHGKHDSSWVEELLMLHRARITDVEHITGLSFYQQRKEPVSDILKLKTHLPTFSQED,925,NP_006199.2.csv,refseq-ENPP1-NM_006208.2_clinical_seed_0_final,refseq-ENPP1-NM_006208.2.a2m,Invitae,refseq-ENPP1-NM_006208.2.npy,1,925,925
+NP_006205.1,MEQLRAAARLQIVLGHLGRPSAGAVVAHPTSGTISSASFHPQQFQYTLDNNVLTLEQRKFYEENGFLVIKNLVPDADIQRFRNEFEKICRKEVKPLGLTVMRDVTISKSEYAPSEKMITKVQDFQEDKELFRYCTLPEILKYVECFTGPNIMAMHTMLINKPPDSGKKTSRHPLHQDLHYFPFRPSDLIVCAWTAMEHISRNNGCLVVLPGTHKGSLKPHDYPKWEGGVNKMFHGIQDYEENKARVHLVMEKGDTVFFHPLLIHGSGQNKTQGFRKAISCHFASADCHYIDVKGTSQENIEKEVVGIAHKFFGAENSVNLKDIWMFRARLVKGERTNL,338,NP_006205.1.csv,refseq-PHYH-NM_006214.3_clinical_seed_0_final,refseq-PHYH-NM_006214.3.a2m,Invitae,refseq-PHYH-NM_006214.3.npy,1,338,338
+NP_006209.2,MPPRPSSGELWGIHLMPPRILVECLLPNGMIVTLECLREATLITIKHELFKEARKYPLHQLLQDESSYIFVSVTQEAEREEFFDETRRLCDLRLFQPFLKVIEPVGNREEKILNREIGFAIGMPVCEFDMVKDPEVQDFRRNILNVCKEAVDLRDLNSPHSRAMYVYPPNVESSPELPKHIYNKLDKGQIIVVIWVIVSPNNDKQKYTLKINHDCVPEQVIAEAIRKKTRSMLLSSEQLKLCVLEYQGKYILKVCGCDEYFLEKYPLSQYKYIRSCIMLGRMPNLMLMAKESLYSQLPMDCFTMPSYSRRISTATPYMNGETSTKSLWVINSALRIKILCATYVNVNIRDIDKIYVRTGIYHGGEPLCDNVNTQRVPCSNPRWNEWLNYDIYIPDLPRAARLCLSICSVKGRKGAKEEHCPLAWGNINLFDYTDTLVSGKMALNLWPVPHGLEDLLNPIGVTGSNPNKETPCLELEFDWFSSVVKFPDMSVIEEHANWSVSREAGFSYSHAGLSNRLARDNELRENDKEQLKAISTRDPLSEITEQEKDFLWSHRHYCVTIPEILPKLLLSVKWNSRDEVAQMYCLVKDWPPIKPEQAMELLDCNYPDPMVRGFAVRCLEKYLTDDKLSQYLIQLVQVLKYEQYLDNLLVRFLLKKALTNQRIGHFFFWHLKSEMHNKTVSQRFGLLLESYCRACGMYLKHLNRQVEAMEKLINLTDILKQEKKDETQKVQMKFLVEQMRRPDFMDALQGFLSPLNPAHQLGNLRLEECRIMSSAKRPLWLNWENPDIMSELLFQNNEIIFKNGDDLRQDMLTLQIIRIMENIWQNQGLDLRMLPYGCLSIGDCVGLIEVVRNSHTIMQIQCKGGLKGALQFNSHTLHQWLKDKNKGEIYDAAIDLFTRSCAGYCVATFILGIGDRHNSNIMVKDDGQLFHIDFGHFLDHKKKKFGYKRERVPFVLTQDFLIVISKGAQECTKTREFERFQEMCYKAYLAIRQHANLFINLFSMMLGSGMPELQSFDDIAYIRKTLALDKTEQEALEYFMKQMNDAHHGGWTTKMDWIFHTIKQHALN,1068,NP_006209.2.csv,refseq-PIK3CA-NM_006218.2_clinical_seed_0_final,refseq-PIK3CA-NM_006218.2.a2m,Invitae,refseq-PIK3CA-NM_006218.2.npy,1,1068,1068
+NP_006222.2,MSLRSGGRRRADPGADGEASRDDGATSSVSALKRLERSQWTDKMDLRFGFERLKEPGEKTGWLINMHPTEILDEDKRLGSAVDYYFIQDDGSRFKVALPYKPYFYIATRKGCEREVSSFLSKKFQGKIAKVETVPKEDLDLPNHLVGLKRNYIRLSFHTVEDLVKVRKEISPAVKKNREQDHASDAYTALLSSVLQRGGVITDEEETSKKIADQLDNIVDMREYDVPYHIRLSIDLKIHVAHWYNVRYRGNAFPVEITRRDDLVERPDPVVLAFDIETTKLPLKFPDAETDQIMMISYMIDGQGYLITNREIVSEDIEDFEFTPKPEYEGPFCVFNEPDEAHLIQRWFEHVQETKPTIMVTYNGDFFDWPFVEARAAVHGLSMQQEIGFQKDSQGEYKAPQCIHMDCLRWVKRDSYLPVGSHNLKAAAKAKLGYDPVELDPEDMCRMATEQPQTLATYSVSDAVATYYLYMKYVHPFIFALCTIIPMEPDEVLRKGSGTLCEALLMVQAFHANIIFPNKQEQEFNKLTDDGHVLDSETYVGGHVEALESGVFRSDIPCRFRMNPAAFDFLLQRVEKTLRHALEEEEKVPVEQVTNFEEVCDEIKSKLASLKDVPSRIECPLIYHLDVGAMYPNIILTNRLQPSAMVDEATCAACDFNKPGANCQRKMAWQWRGEFMPASRSEYHRIQHQLESEKFPPLFPEGPARAFHELSREEQAKYEKRRLADYCRKAYKKIHITKVEERLTTICQRENSFYVDTVRAFRDRRYEFKGLHKVWKKKLSAAVEVGDAAEVKRCKNMEVLYDSLQLAHKCILNSFYGYVMRKGARWYSMEMAGIVCFTGANIITQARELIEQIGRPLELDTDGIWCVLPNSFPENFVFKTTNVKKPKVTISYPGAMLNIMVKEGFTNDQYQELAEPSSLTYVTRSENSIFFEVDGPYLAMILPASKEEGKKLKKRYAVFNEDGSLAELKGFEVKRRGELQLIKIFQSSVFEAFLKGSTLEEVYGSVAKVADYWLDVLYSKAANMPDSELFELISENRSMSRKLEDYGEQKSTSISTAKRLAEFLGDQMVKDAGLSCRYIISRKPEGSPVTERAIPLAIFQAEPTVRKHFLRKWLKSSSLQDFDIRAILDWDYYIERLGSAIQKIITIPAALQQVKNPVPRVKHPDWLHKKLLEKNDVYKQKKISELFTLEGRRQVTMAEASEDSPRPSAPDMEDFGLVKLPHPAAPVTVKRKRVLWESQEESQDLTPTVPWQEILGQPPALGTSQEEWLVWLRFHKKKWQLQARQRLARRKRQRLESAEGVLRPGAIRDGPATGLGSFLRRTARSILDLPWQIVQISETSQAGLFRLWALVGSDLHCIRLSIPRVFYVNQRVAKAEEGASYRKVNRVLPRSNMVYNLYEYSVPEDMYQEHINEINAELSAPDIEGVYETQVPLLFRALVHLGCVCVVNKQLVRHLSGWEAETFALEHLEMRSLAQFSYLEPGSIRHIYLYHHAQAHKALFGIFIPSQRRASVFVLDTVRSNQMPSLGALYSAEHGLLLEKVGPELLPPPKHTFEVRAETDLKTICRAIQRFLLAYKEERRGPTLIAVQSSWELKRLASEIPVLEEFPLVPICVADKINYGVLDWQRHGARRMIRHYLNLDTCLSQAFEMSRYFHIPIGNLPEDISTFGSDLFFARHLQRHNHLLWLSPTARPDLGGKEADDNCLVMEFDDQATVEINSSGCYSTVCVELDLQNLAVNTILQSHHVNDMEGADSMGISFDVIQQASLEDMITGGQAASAPASYDETALCSNTFRILKSMVVGWVKEITQYHNIYADNQVMHFYRWLRSPSSLLHDPALHRTLHNMMKKLFLQLIAEFKRLGSSVIYANFNRIILCTKKRRVEDAIAYVEYITSSIHSKETFHSLTISFSRCWEFLLWMDPSNYGGIKGKVSSRIHCGLQDSQKAGGAEDEQENEDDEEERDGEEEEEAEESNVEDLLENNWNILQFLPQAASCQNYFLMIVSAYIVAVYHCMKDGLRRSAPGSTPVRRRGASQLSQEAEGAVGALPGMITFSQDYVANELTQSFFTITQKIQKKVTGSRNSTELSEMFPVLPGSHLLLNNPALEFIKYVCKVLSLDTNITNQVNKLNRDLLRLVDVGEFSEEAQFRDPCRSYVLPEVICRSCNFCRDLDLCKDSSFSEDGAVLPQWLCSNCQAPYDSSAIEMTLVEVLQKKLMAFTLQDLVCLKCRGVKETSMPVYCSCAGDFALTIHTQVFMEQIGIFRNIAQHYGMSYLLETLEWLLQKNPQLGH,2286,NP_006222.2.csv,refseq-POLE-NM_006231.3_clinical_seed_0_final,refseq-POLE-NM_006231.3.a2m,Invitae,refseq-POLE-NM_006231.3.npy,1,2286,2286
+NP_006236.1,MPYKLKKEKEPPKVAKCTAKPSSSGKDGGGENTEEAQPQPQPQPQPQAQSQPPSSNKRPSNSTPPPTQLSKIKYSGGPQIVKKERRQSSSRFNLSKNRELQKLPALKDSPTQEREELFIQKLRQCCVLFDFVSDPLSDLKFKEVKRAGLNEMVEYITHSRDVVTEAIYPEAVTMFSVNLFRTLPPSSNPTGAEFDPEEDEPTLEAAWPHLQLVYEFFLRFLESPDFQPNIAKKYIDQKFVLALLDLFDSEDPRERDFLKTILHRIYGKFLGLRAYIRRQINHIFYRFIYETEHHNGIAELLEILGSIINGFALPLKEEHKMFLIRVLLPLHKVKSLSVYHPQLAYCVVQFLEKESSLTEPVIVGLLKFWPKTHSPKEVMFLNELEEILDVIEPSEFSKVMEPLFRQLAKCVSSPHFQVAERALYYWNNEYIMSLISDNAARVLPIMFPALYRNSKSHWNKTIHGLIYNALKLFMEMNQKLFDDCTQQYKAEKQKGRFRMKEREEMWQKIEELARLNPQYPMFRAPPPLPPVYSMETETPTAEDIQLLKRTVETEAVQMLKDIKKEKVLLRRKSELPQDVYTIKALEAHKRAEEFLTASQEAL,602,NP_006236.1.csv,refseq-PPP2R5D-NM_006245.3_clinical_seed_0_final,refseq-PPP2R5D-NM_006245.3.a2m,Invitae,refseq-PPP2R5D-NM_006245.3.npy,1,602,602
+NP_006249.1,MGTLRDLQYALQEKIEELRQRDALIDELELELDQKDELIQKLQNELDKYRSVIRPATQQAQKQSASTLQGEPRTKRQAISAEPTAFDIQDLSHVTLPFYPKSPQSKDLIKEAILDNDFMKNLELSQIQEIVDCMYPVEYGKDSCIIKEGDVGSLVYVMEDGKVEVTKEGVKLCTMGPGKVFGELAILYNCTRTATVKTLVNVKLWAIDRQCFQTIMMRTGLIKHTEYMEFLKSVPTFQSLPEEILSKLADVLEETHYENGEYIIRQGARGDTFFIISKGTVNVTREDSPSEDPVFLRTLGKGDWFGEKALQGEDVRTANVIAAEAVTCLVIDRDSFKHLIGGLDDVSNKAYEDAEAKAKYEAEAAFFANLKLSDFNIIDTLGVGGFGRVELVQLKSEESKTFAMKILKKRHIVDTRQQEHIRSEKQIMQGAHSDFIVRLYRTFKDSKYLYMLMEACLGGELWTILRDRGSFEDSTTRFYTACVVEAFAYLHSKGIIYRDLKPENLILDHRGYAKLVDFGFAKKIGFGKKTWTFCGTPEYVAPEIILNKGHDISADYWSLGILMYELLTGSPPFSGPDPMKTYNIILRGIDMIEFPKKIAKNAANLIKKLCRDNPSERLGNLKNGVKDIQKHKWFEGFNWEGLRKGTLTPPIIPSVASPTDTSNFDSFPEDNDEPPPDDNSGWDIDF,686,NP_006249.1.csv,refseq-PRKG1-NM_006258.3_clinical_seed_0_final,refseq-PRKG1-NM_006258.3.a2m,Invitae,refseq-PRKG1-NM_006258.3.npy,1,686,686
+NP_006252.4,MEAERRRQAEKPKKGRVGSNLLPERHPATGTPTTTVDSSAPPCRRLPGAGGGRSRFSPQGGQRGRPHSRRRHRTTFSPVQLEQLESAFGRNQYPDIWARESLARDTGLSEARIQVWFQNRRAKQRKQERSLLQPLAHLSPAAFSSFLPESTACPYSYAAPPPPVTCFPHPYSHALPSQPSTGGAFALSHQSEDWYPTLHPAPAGHLPCPPPPPMLPLSLEPSKSWN,226,NP_006252.4.csv,refseq-PROP1-NM_006261.5_clinical_seed_0_final,refseq-PROP1-NM_006261.5.a2m,Invitae,refseq-PROP1-NM_006261.5.npy,1,226,226
+NP_006256.1,MFYAHFVLSKRGPLAKIWLAAHWDKKLTKAHVFECNLESSVESIISPKVKMALRTSGHLLLGVVRIYHRKAKYLLADCNEAFIKIKMAFRPGVVDLPEENREAAYNAITLPEEFHDFDQPLPDLDDIDVAQQFSLNQSRVEEITMREEVGNISILQENDFGDFGMDDREIMREGSAFEDDDMLVSTTTSNLLLESEQSTSNLNEKINHLEYEDQYKDDNFGEGNDGGILDDKLISNNDGGIFDDPPALSEAGVMLPEQPAHDDMDEDDNVSMGGPDSPDSVDPVEPMPTMTDQTTLVPNEEEAFALEPIDITVKETKAKRKRKLIVDSVKELDSKTIRAQLSDYSDIVTTLDLAPPTKKLMMWKETGGVEKLFSLPAQPLWNNRLLKLFTRCLTPLVPEDLRKRRKGGEADNLDEFLKEFENPEVPREDQQQQHQQRDVIDEPIIEEPSRLQESVMEASRTNIDESAMPPPPPQGVKRKAGQIDPEPVMPPQQVEQMEIPPVELPPEEPPNICQLIPELELLPEKEKEKEKEKEDDEEEEDEDASGGDQDQEERRWNKRTQQMLHGLQRALAKTGAESISLLELCRNTNRKQAAAKFYSFLVLKKQQAIELTQEEPYSDIIATPGPRFHII,631,NP_006256.1.csv,refseq-RAD21-NM_006265.2_clinical_seed_0_final,refseq-RAD21-NM_006265.2.a2m,Invitae,refseq-RAD21-NM_006265.2.npy,1,631,631
+NP_006259.1,MAAVVENVVKLLGEQYYKDAMEQCHNYNARLCAERSVRLPFLDSQTGVAQSNCYIWMEKRHRGPGLASGQLYSYPARRWRKKRRAHPPEDPRLSFPSIKPDTDQTLKKEGLISQDGSSLEALLRTDPLEKRGAPDPRVDDDSLGEFPVTNSRARKRILEPDDFLDDLDDEDYEEDTPKRRGKGKSKGKGVGSARKKLDASILEDRDKPYACDICGKRYKNRPGLSYHYAHSHLAEEEGEDKEDSQPPTPVSQRSEEQKSKKGPDGLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSCHLCLDLLKEKASIYQNQNSS,391,NP_006259.1.csv,refseq-DPF2-NM_006268.4_clinical_seed_0_final,refseq-DPF2-NM_006268.4.a2m,Invitae,refseq-DPF2-NM_006268.4.npy,1,391,391
+NP_006260.1,MSDTPSTGFSIIHPTSSEGQVPPPRHLSLTHPVVAKRISFYKSGDPQFGGVRVVVNPRSFKSFDALLDNLSRKVPLPFGVRNISTPRGRHSITRLEELEDGESYLCSHGRKVQPVDLDKARRRPRPWLSSRAISAHSPPHPVAVAAPGMPRPPRSLVVFRNGDPKTRRAVLLSRRVTQSFEAFLQHLTEVMQRPVVKLYATDGRRVPSLQAVILSSGAVVAAGREPFKPGNYDIQKYLLPARLPGISQRVYPKGNAKSESRKISTHMSSSSRSQIYSVSSEKTHNNDCYLDYSFVPEKYLALEKNDSQNLPIYPSEDDIEKSIIFNQDGTMTVEMKVRFRIKEEETIKWTTTVSKTGPSNNDEKSEMSFPGRTESRSSGLKLAACSFSADVSPMERSSNQEGSLAEEINIQMTDQVAETCSSASWENATVDTDIIQGTQDQAKHRFYRPPTPGLRRVRQKKSVIGSVTLVSETEVQEKMIGQFSYSEERESGENKSEYHMFTHSCSKMSSVSNKPVLVQINNNDQMEESSLERKKENSLLKSSAISAGVIEITSQKMLEMSHNNGLPSTISNNSIVEEDVVDCVVLDNKTGIKNFKTYGNTNDRFSPISADATHFSSNNSGTDKNISEAPASEASSTVTARIDRLINEFAQCGLTKLPKNEKKILSSVASKKKKKSRQQAINSRYQDGQLATKGILNKNERINTKGRITKEMIVQDSDSPLKGGILCEEDLQKSDTVIESNTFCSKSNLNSTISKNFHRNKLNTTQNSKVQGLLTKRKSRSLNKISLGAPKKREIGQRDKVFPHNESKYCKSTFENKSLFHVFNILEQKPKDFYAPQSQAEVASGYLRGMAKKSLVSKVTDSHITLKSQKKRKGDKVKASAILSKQHATTRANSLASLKKPDFPEAIAHHSIQNYIQSWLQNINPYPTLKPIKSAPVCRNETSVVNCSNNSFSGNDPHTNSGKISNFVMESNKHITKIAGLTGDNLCKEGDKSFIANDTGEEDLHETQVGSLNDAYLVPLHEHCTLSQSAINDHNTKSHIAAEKSGPEKKLVYQEINLARKRQSVEAAIQVDPIEEETPKDLLPVLMLHQLQASVPGIHKTQNGVVQMPGSLAGVPFHSAICNSSTNLLLAWLLVLNLKGSMNSFCQVDAHKATNKSSETLALLEILKHIAITEEADDLKAAVANLVESTTSHFGLSEKEQDMVPIDLSANCSTVNIQSVPKCSENERTQGISSLDGGCSASEACAPEVCVLEVTCSPCEMCTVNKAYSPKETCNPSDTFFPSDGYGVDQTSMNKACFLGEVCSLTDTVFSDKACAQKENHTYEGACPIDETYVPVNVCNTIDFLNSKENTYTDNLDSTEELERGDDIQKDLNILTDPEYKNGFNTLVSHQNVSNLSSCGLCLSEKEAELDKKHSSLDDFENCSLRKFQDENAYTSFDMEEPRTSEEPGSITNSMTSSERNISELESFEELENHDTDIFNTVVNGGEQATEELIQEEVEASKTLELIDISSKNIMEEKRMNGIIYEIISKRLATPPSLDFCYDSKQNSEKETNEGETKMVKMMVKTMETGSYSESSPDLKKCIKSPVTSDWSDYRPDSDSEQPYKTSSDDPNDSGELTQEKEYNIGFVKRAIEKLYGKADIIKPSFFPGSTRKSQVCPYNSVEFQCSRKASLYDSEGQSFGSSEQVSSSSSMLQEFQEERQDKCDVSAVRDNYCRGDIVEPGTKQNDDSRILTDIEEGVLIDKGKWLLKENHLLRMSSENPGMCGNADTTSVDTLLDNNSSEVPYSHFGNLAPGPTMDELSSSELEELTQPLELKCNYFNMPHGSDSEPFHEDLLDVRNETCAKERIANHHTEEKGSHQSERVCTSVTHSFISAGNKVYPVSDDAIKNQPLPGSNMIHGTLQEADSLDKLYALCGQHCPILTVIIQPMNEEDRGFAYRKESDIENFLGFYLWMKIHPYLLQTDKNVFREENNKASMRQNLIDNAIGDIFDQFYFSNTFDLMGKRRKQKRINFLGLEEEGNLKKFQPDLKERFCMNFLHTSLLVVGNVDSNTQDLSGQTNEIFKAVDENNNLLNNRFQGSRTNLNQVVRENINCHYFFEMLGQACLLDICQVETSLNISNRNILELCMFEGENLFIWEEEDILNLTDLESSREQEDL,2156,NP_006260.1.csv,refseq-RP1-NM_006269.1_clinical_seed_0_final,refseq-RP1-NM_006269.1.a2m,Invitae,refseq-RP1-NM_006269.1.npy,1,2156,2156
+NP_006281.1,MAEQVLPQALYLSNMRKAVKIRERTPEDIFKPTNGIIHHFKTMHRYTLEMFRTCQFCPQFREIIHKALIDRNIQATLESQKKLNWCREVRKLVALKTNGDGNCLMHATSQYMWGVQDTDLVLRKALFSTLKETDTRNFKFRWQLESLKSQEFVETGLCYDTRNWNDEWDNLIKMASTDTPMARSGLQYNSLEEIHIFVLCNILRRPIIVISDKMLRSLESGSNFAPLKVGGIYLPLHWPAQECYRYPIVLGYDSHHFVPLVTLKDSGPEIRAVPLVNRDRGRFEDLKVHFLTDPENEMKEKLLKEYLMVIEIPVQGWDHGTTHLINAAKLDEANLPKEINLVDDYFELVQHEYKKWQENSEQGRREGHAQNPMEPSVPQLSLMDVKCETPNCPFFMSVNTQPLCHECSERRQKNQNKLPKLNSKPGPEGLPGMALGASRGEAYEPLAWNPEESTGGPHSAPPTAPSPFLFSETTAMKCRSPGCPFTLNVQHNGFCERCHNARQLHASHAPDHTRHLDPGKCQACLQDVTRTFNGICSTCFKRTTAEASSSLSTSLPPSCHQRSKSDPSRLVRSPSPHSCHRAGNDAPAGCLSQAARTPGDRTGTSKCRKAGCVYFGTPENKGFCTLCFIEYRENKHFAAASGKVSPTASRFQNTIPCLGRECGTLGSTMFEGYCQKCFIEAQNQRFHEAKRTEEQLRSSQRRDVPRTTQSTSRPKCARASCKNILACRSEELCMECQHPNQRMGPGAHRGEPAPEDPPKQRCRAPACDHFGNAKCNGYCNECFQFKQMYG,790,NP_006281.1.csv,refseq-TNFAIP3-NM_006290.3_clinical_seed_0_final,refseq-TNFAIP3-NM_006290.3.a2m,Invitae,refseq-TNFAIP3-NM_006290.3.npy,1,790,790
+NP_006286.1,MSTLYVSPHPDAFPSLRALIAARYGEAGEGPGWGGAHPRICLQPPPTSRTPFPPPRLPALEQGPGGLWVWGATAVAQLLWPAGLGGPGGSRAAVLVQQWVSYADTELIPAACGATLPALGLRSSAQDPQAVLGALGRALSPLEEWLRLHTYLAGEAPTLADLAAVTALLLPFRYVLDPPARRIWNNVTRWFVTCVRQPEFRAVLGEVVLYSGARPLSHQPGPEAPALPKTAAQLKKEAKKREKLEKFQQKQKIQQQQPPPGEKKPKPEKREKRDPGVITYDLPTPPGEKKDVSGPMPDSYSPRYVEAAWYPWWEQQGFFKPEYGRPNVSAANPRGVFMMCIPPPNVTGSLHLGHALTNAIQDSLTRWHRMRGETTLWNPGCDHAGIATQVVVEKKLWREQGLSRHQLGREAFLQEVWKWKEEKGDRIYHQLKKLGSSLDWDRACFTMDPKLSAAVTEAFVRLHEEGIIYRSTRLVNWSCTLNSAISDIEVDKKELTGRTLLSVPGYKEKVEFGVLVSFAYKVQGSDSDEEVVVATTRIETMLGDVAVAVHPKDTRYQHLKGKNVIHPFLSRSLPIVFDEFVDMDFGTGAVKITPAHDQNDYEVGQRHGLEAISIMDSRGALINVPPPFLGLPRFEARKAVLVALKERGLFRGIEDNPMVVPLCNRSKDVVEPLLRPQWYVRCGEMAQAASAAVTRGDLRILPEAHQRTWHAWMDNIREWCISRQLWWGHRIPAYFVTVSDPAVPPGEDPDGRYWVSGRNEAEAREKAAKEFGVSPDKISLQQDEDVLDTWFSSGLFPLSILGWPNQSEDLSVFYPGTLLETGHDILFFWVARMVMLGLKLTGRLPFREVYLHAIVRDAHGRKMSKSLGNVIDPLDVIYGISLQGLHNQLLNSNLDPSEVEKAKEGQKADFPAGIPECGTDALRFGLCAYMSQGRDINLDVNRILGYRHFCNKLWNATKFALRGLGKGFVPSPTSQPGGHESLVDRWIRSRLTEAVRLSNQGFQAYDFPAVTTAQYSFWLYELCDVYLECLKPVLNGVDQVAAECARQTLYTCLDVGLRLLSPFMPFVTEELFQRLPRRMPQAPPSLCVTPYPEPSECSWKDPEAEAALELALSITRAVRSLRADYNLTRIRPDCFLEVADEATGALASAVSGYVQALASAGVVAVLALGAPAPQGCAVALASDRCSIHLQLQGLVDPARELGKLQAKRVEAQRQAQRLRERRAASGYPVKVPLEVQEADEAKLQQTEAELRKVDEAIALFQKML,1264,NP_006286.1.csv,refseq-VARS-NM_006295.2_clinical_seed_0_final,refseq-VARS-NM_006295.2.a2m,Invitae,refseq-VARS-NM_006295.2.npy,1,1264,1264
+NP_006293.2,MARGERRRRAVPAEGVRTAERAARGGPGRRDGRGGGPRSTAGGVALAVVVLSLALGMSGRWVLAWYRARRAVTLHSAPPVLPADSSSPAVAPDLFWGTYRPHVYFGMKTRSPKPLLTGLMWAQQGTTPGTPKLRHTCEQGDGVGPYGWEFHDGLSFGRQHIQDGALRLTTEFVKRPGGQHGGDWSWRVTVEPQDSGTSALPLVSLFFYVVTDGKEVLLPEVGAKGQLKFISGHTSELGDFRFTLLPPTSPGDTAPKYGSYNVFWTSNPGLPLLTEMVKSRLNSWFQHRPPGAPPERYLGLPGSLKWEDRGPSGQGQGQFLIQQVTLKIPISIEFVFESGSAQAGGNQALPRLAGSLLTQALESHAEGFRERFEKTFQLKEKGLSSGEQVLGQAALSGLLGGIGYFYGQGLVLPDIGVEGSEQKVDPALFPPVPLFTAVPSRSFFPRGFLWDEGFHQLVVQRWDPSLTREALGHWLGLLNADGWIGREQILGDEARARVPPEFLVQRAVHANPPTLLLPVAHMLEVGDPDDLAFLRKALPRLHAWFSWLHQSQAGPLPLSYRWRGRDPALPTLLNPKTLPSGLDDYPRASHPSVTERHLDLRCWVALGARVLTRLAEHLGEAEVAAELGPLAASLEAAESLDELHWAPELGVFADFGNHTKAVQLKPRPPQGLVRVVGRPQPQLQYVDALGYVSLFPLLLRLLDPTSSRLGPLLDILADSRHLWSPFGLRSLAASSSFYGQRNSEHDPPYWRGAVWLNVNYLALGALHHYGHLEGPHQARAAKLHGELRANVVGNVWRQYQATGFLWEQYSDRDGRGMGCRPFHGWTSLVLLAMAEDY,837,NP_006293.2.csv,refseq-MOGS-NM_006302.2_clinical_seed_0_final,refseq-MOGS-NM_006302.2.a2m,Invitae,refseq-MOGS-NM_006302.2.npy,1,837,837
+NP_006294.2,MPMYQVKPYHGGGAPLRVELPTCMYRLPNVHGRSYGPAPGAGHVQEESNLSLQALESRQDDILKRLYELKAAVDGLSKMIQTPDADLDVTNIIQADEPTTLTTNALDLNSVLGKDYGALKDIVINANPASPPLSLLVLHRLLCEHFRVLSTVHTHSSVKSVPENLLKCFGEQNKKQPRQDYQLGFTLIWKNVPKTQMKFSIQTMCPIEGEGNIARFLFSLFGQKHNAVNATLIDSWVDIAIFQLKEGSSKEKAAVFRSMNSALGKSPWLAGNELTVADVVLWSVLQQIGGCSVTVPANVQRWMRSCENLAPFNTALKLLK,320,NP_006294.2.csv,refseq-AIMP2-NM_006303.3_clinical_seed_0_final,refseq-AIMP2-NM_006303.3.a2m,Invitae,refseq-AIMP2-NM_006303.3.npy,1,320,320
+NP_006297.2,MGFLKLIEIENFKSYKGRQIIGPFQRFTAIIGPNGSGKSNLMDAISFVLGEKTSNLRVKTLRDLIHGAPVGKPAANRAFVSMVYSEEGAEDRTFARVIVGGSSEYKINNKVVQLHEYSEELEKLGILIKARNFLVFQGAVESIAMKNPKERTALFEEISRSGELAQEYDKRKKEMVKAEEDTQFNYHRKKNIAAERKEAKQEKEEADRYQRLKDEVVRAQVQLQLFKLYHNEVEIEKLNKELASKNKEIEKDKKRMDKVEDELKEKKKELGKMMREQQQIEKEIKEKDSELNQKRPQYIKAKENTSHKIKKLEAAKKSLQNAQKHYKKRKGDMDELEKEMLSVEKARQEFEERMEEESQSQGRDLTLEENQVKKYHRLKEEASKRAATLAQELEKFNRDQKADQDRLDLEERKKVETEAKIKQKLREIEENQKRIEKLEEYITTSKQSLEEQKKLEGELTEEVEMAKRRIDEINKELNQVMEQLGDARIDRQESSRQQRKAEIMESIKRLYPGSVYGRLIDLCQPTQKKYQIAVTKVLGKNMDAIIVDSEKTGRDCIQYIKEQRGEPETFLPLDYLEVKPTDEKLRELKGAKLVIDVIRYEPPHIKKALQYACGNALVCDNVEDARRIAFGGHQRHKTVALDGTLFQKSGVISGGASDLKAKARRWDEKAVDKLKEKKERLTEELKEQMKAKRKEAELRQVQSQAHGLQMRLKYSQSDLEQTKTRHLALNLQEKSKLESELANFGPRINDIKRIIQSREREMKDLKEKMNQVEDEVFEEFCREIGVRNIREFEEEKVKRQNEIAKKRLEFENQKTRLGIQLDFEKNQLKEDQDKVHMWEQTVKKDENEIEKLKKEEQRHMKIIDETMAQLQDLKNQHLAKKSEVNDKNHEMEEIRKKLGGANKEMTHLQKEVTAIETKLEQKRSDRHNLLQACKMQDIKLPLSKGTMDDISQEEGSSQGEDSVSGSQRISSIYAREALIEIDYGDLCEDLKDAQAEEEIKQEMNTLQQKLNEQQSVLQRIAAPNMKAMEKLESVRDKFQETSDEFEAARKRAKKAKQAFEQIKKERFDRFNACFESVATNIDEIYKALSRNSSAQAFLGPENPEEPYLDGINYNCVAPGKRFRPMDNLSGGEKTVAALALLFAIHSYKPAPFFVLDEIDAALDNTNIGKVANYIKEQSTCNFQAIVISLKEEFYTKAESLIGVYPEQGDCVISKVLTFDLTKYPDANPNPNEQ,1233,NP_006297.2.csv,refseq-SMC1A-NM_006306.2_clinical_seed_0_final,refseq-SMC1A-NM_006306.2.a2m,Invitae,refseq-SMC1A-NM_006306.2_theta_0.2.npy,1,1233,1233
+NP_006334.2,MGPAPLPLLLGLFLPALWRRAITEAREEAKPYPLFPGPFPGSLQTDHTPLLSLPHASGYQPALMFSPTQPGRPHTGNVAIPQVTSVESKPLPPLAFKHTVGHIILSEHKGVKFNCSISVPNIYQDTTISWWKDGKELLGAHHAITQFYPDDEVTAIIASFSITSVQRSDNGSYICKMKINNEEIVSDPIYIEVQGLPHFTKQPESMNVTRNTAFNLTCQAVGPPEPVNIFWVQNSSRVNEQPEKSPSVLTVPGLTEMAVFSCEAHNDKGLTVSKGVQINIKAIPSPPTEVSIRNSTAHSILISWVPGFDGYSPFRNCSIQVKEADPLSNGSVMIFNTSALPHLYQIKQLQALANYSIGVSCMNEIGWSAVSPWILASTTEGAPSVAPLNVTVFLNESSDNVDIRWMKPPTKQQDGELVGYRISHVWQSAGISKELLEEVGQNGSRARISVQVHNATCTVRIAAVTRGGVGPFSDPVKIFIPAHGWVDYAPSSTPAPGNADPVLIIFGCFCGFILIGLILYISLAIRKRVQETKFGNAFTEEDSELVVNYIAKKSFCRRAIELTLHSLGVSEELQNKLEDVVIDRNLLILGKILGEGEFGSVMEGNLKQEDGTSLKVAVKTMKLDNSSQREIEEFLSEAACMKDFSHPNVIRLLGVCIEMSSQGIPKPMVILPFMKYGDLHTYLLYSRLETGPKHIPLQTLLKFMVDIALGMEYLSNRNFLHRDLAARNCMLRDDMTVCVADFGLSKKIYSGDYYRQGRIAKMPVKWIAIESLADRVYTSKSDVWAFGVTMWEIATRGMTPYPGVQNHEMYDYLLHGHRLKQPEDCLDELYEIMYSCWRTDPLDRPTFSVLRLQLEKLLESLPDVRNQADVIYVNTQLLESSEGLAQGSTLAPLDLNIDPDSIIASCTPRAAISVVTAEVHDSKPHEGRYILNGGSEEWEDLTSAPSAAVTAEKNSVLPGERLVRNGVSWSHSSMLPLGSSLPDELLFADDSSEGSEVLM,999,NP_006334.2.csv,refseq-MERTK-NM_006343.2_clinical_seed_0_final,refseq-MERTK-NM_006343.2.a2m,Invitae,refseq-MERTK-NM_006343.2.npy,1,999,999
+NP_006339.4,MEGGGGSVAVAGLGARGSGAAAATVRELLQDGCYSDFLNEDFDVKTYTSQSIHQAVIAEQLAKLAQGISQLDRELHLQVVARHEDLLAQATGIESLEGVLQMMQTRIGALQGAVDRIKAKIVEPYNKIVARTAQLARLQVACDLLRRIIRILNLSKRLQGQLQGGSREITKAAQSLNELDYLSQGIDLSGIEVIENDLLFIARARLEVENQAKRLLEQGLETQNPTQVGTALQVFYNLGTLKDTITSVVDGYCATLEENINSALDIKVLTQPSQSAVRGGPGRSTMPTPGNTAALRASFWTNMEKLMDHIYAVCGQVQHLQKVLAKKRDPVSHICFIEEIVKDGQPEIFYTFWNSVTQALSSQFHMATNSSMFLKQAFEGEYPKLLRLYNDLWKRLQQYSQHIQGNFNASGTTDLYVDLQHMEDDAQDIFIPKKPDYDPEKALKDSLQPYEAAYLSKSLSRLFDPINLVFPPGGRNPPSSDELDGIIKTIASELNVAAVDTNLTLAVSKNVAKTIQLYSVKSEQLLSTQGDASQVIGPLTEGQRRNVAVVNSLYKLHQSVTKVVSSQSSFPLAAEQTIISALKAIHALMENAVQPLLTSVGDAIEAIIITMHQEDFSGSLSSSGKPDVPCSLYMKELQGFIARVMSDYFKHFECLDFVFDNTEAIAQRAVELFIRHASLIRPLGEGGKMRLAADFAQMELAVGPFCRRVSDLGKSYRMLRSFRPLLFQASEHVASSPALGDVIPFSIIIQFLFTRAPAELKSPFQRAEWSHTRFSQWLDDHPSEKDRLLLIRGALEAYVQSVRSREGKEFAPVYPIMVQLLQKAMSALQ,829,NP_006339.4.csv,NP_006339.4_colabfold_clinical_seed_0_final,NP_006339.4_colabfold.a2m,colabfold,NP_006339.4_colabfold_theta_0.2.npy,1,829,829
+NP_006350.1,MARRGWRRAPLRRGVGSSPRARRLMRPLWLLLAVGVFDWAGASDGGGGEARAMDEEIVSEKQAEESHRQDSANLLIFILLLTLTILTIWLFKHRRARFLHETGLAMIYGLLVGLVLRYGIHVPSDVNNVTLSCEVQSSPTTLLVTFDPEVFFNILLPPIIFYAGYSLKRRHFFRNLGSILAYAFLGTAISCFVIGSIMYGCVTLMKVTGQLAGDFYFTDCLLFGAIVSATDPVTVLAIFHELQVDVELYALLFGESVLNDAVAIVLSSSIVAYQPAGDNSHTFDVTAMFKSIGIFLGIFSGSFAMGAATGVVTALVTKFTKLREFQLLETGLFFLMSWSTFLLAEAWGFTGVVAVLFCGITQAHYTYNNLSTESQHRTKQLFELLNFLAENFIFSYMGLTLFTFQNHVFNPTFVVGAFVAIFLGRAANIYPLSLLLNLGRRSKIGSNFQHMMMFAGLRGAMAFALAIRDTATYARQMMFSTTLLIVFFTVWVFGGGTTAMLSCLHIRVGVDSDQEHLGVPENERRTTKAESAWLFRMWYNFDHNYLKPLLTHSGPPLTTTLPACCGPIARCLTSPQAYENQEQLKDDDSDLILNDGDISLTYGDSTVNTEPATSSAPRRFMGNSSEDALDRELAFGDHELVIRGTRLVLPMDDSEPPLNLLDNTRHGPA,669,NP_006350.1.csv,refseq-SLC9A6-NM_006359.2_clinical_seed_0_final,refseq-SLC9A6-NM_006359.2.a2m,Invitae,refseq-SLC9A6-NM_006359.2.npy,1,669,669
+NP_006354.2,MATYLEFIQQNEERDGVRFSWNVWPSSRLEATRMVVPLACLLTPLKERPDLPPVQYEPVLCSRPTCKAVLNPLCQVDYRAKLWACNFCFQRNQFPPAYGGISEVNQPAELMPQFSTIEYVIQRGAQSPLIFLYVVDTCLEEDDLQALKESLQMSLSLLPPDALVGLITFGRMVQVHELSCEGISKSYVFRGTKDLTAKQIQDMLGLTKPAMPMQQARPAQPQEHPFASSRFLQPVHKIDMNLTDLLGELQRDPWPVTQGKRPLRSTGVALSIAVGLLEGTFPNTGARIMLFTGGPPTQGPGMVVGDELKIPIRSWHDIEKDNARFMKKATKHYEMLANRTAANGHCIDIYACALDQTGLLEMKCCANLTGGYMVMGDSFNTSLFKQTFQRIFTKDFNGDFRMAFGATLDVKTSRELKIAGAIGPCVSLNVKGPCVSENELGVGGTSQWKICGLDPTSTLGIYFEVVNQHNTPIPQGGRGAIQFVTHYQHSSTQRRIRVTTIARNWADVQSQLRHIEAAFDQEAAAVLMARLGVFRAESEEGPDVLRWLDRQLIRLCQKFGQYNKEDPTSFRLSDSFSLYPQFMFHLRRSPFLQVFNNSPDESSYYRHHFARQDLTQSLIMIQPILYSYSFHGPPEPVLLDSSSILADRILLMDTFFQIVIYLGETIAQWRKAGYQDMPEYENFKHLLQAPLDDAQEILQARFPMPRYINTEHGGSQARFLLSKVNPSQTHNNLYAWGQETGAPILTDDVSLQVFMDHLKKLAVSSAC,767,NP_006354.2.csv,refseq-SEC23B-NM_006363.4_clinical_seed_0_final,refseq-SEC23B-NM_006363.4.a2m,Invitae,refseq-SEC23B-NM_006363.4.npy,1,767,767
+NP_006355.2,MTTYLEFIQQNEERDGVRFSWNVWPSSRLEATRMVVPVAALFTPLKERPDLPPIQYEPVLCSRTTCRAVLNPLCQVDYRAKLWACNFCYQRNQFPPSYAGISELNQPAELLPQFSSIEYVVLRGPQMPLIFLYVVDTCMEDEDLQALKESMQMSLSLLPPTALVGLITFGRMVQVHELGCEGISKSYVFRGTKDLSAKQLQEMLGLSKVPLTQATRGPQVQQPPPSNRFLQPVQKIDMNLTDLLGELQRDPWPVPQGKRPLRSSGVALSIAVGLLECTFPNTGARIMMFIGGPATQGPGMVVGDELKTPIRSWHDIDKDNAKYVKKGTKHFEALANRAATTGHVIDIYACALDQTGLLEMKCCPNLTGGYMVMGDSFNTSLFKQTFQRVFTKDMHGQFKMGFGGTLEIKTSREIKISGAIGPCVSLNSKGPCVSENEIGTGGTCQWKICGLSPTTTLAIYFEVVNQHNAPIPQGGRGAIQFVTQYQHSSGQRRIRVTTIARNWADAQTQIQNIAASFDQEAAAILMARLAIYRAETEEGPDVLRWLDRQLIRLCQKFGEYHKDDPSSFRFSETFSLYPQFMFHLRRSSFLQVFNNSPDESSYYRHHFMRQDLTQSLIMIQPILYAYSFSGPPEPVLLDSSSILADRILLMDTFFQILIYHGETIAQWRKSGYQDMPEYENFRHLLQAPVDDAQEILHSRFPMPRYIDTEHGGSQARFLLSKVNPSQTHNNMYAWGQESGAPILTDDVSLQVFMDHLKKLAVSSAA,765,NP_006355.2.csv,refseq-SEC23A-NM_006364.3_clinical_seed_0_final,refseq-SEC23A-NM_006364.3.a2m,Invitae,refseq-SEC23A-NM_006364.3.npy,1,765,765
+NP_006362.1,MEPGRRGAAALLALLCVACALRAGRAQYERYSFRSFPRDELMPLESAYRHALDKYSGEHWAESVGYLEISLRLHRLLRDSEAFCHRNCSAAPQPEPAAGLASYPELRLFGGLLRRAHCLKRCKQGLPAFRQSQPSREVLADFQRREPYKFLQFAYFKANNLPKAIAAAHTFLLKHPDDEMMKRNMAYYKSLPGAEDYIKDLETKSYESLFIRAVRAYNGENWRTSITDMELALPDFFKAFYECLAACEGSREIKDFKDFYLSIADHYVEVLECKIQCEENLTPVIGGYPVEKFVATMYHYLQFAYYKLNDLKNAAPCAVSYLLFDQNDKVMQQNLVYYQYHRDTWGLSDEHFQPRPEAVQFFNVTTLQKELYDFAKENIMDDDEGEVVEYVDDLLELEETS,401,NP_006362.1.csv,refseq-CRTAP-NM_006371.4_clinical_seed_0_final,refseq-CRTAP-NM_006371.4.a2m,Invitae,refseq-CRTAP-NM_006371.4.npy,1,401,401
+NP_006381.2,MDLNRIIQALKGTIDPKLRIAAENELNQSYKIINFAPSLLRIIVSDHVEFPVRQAAAIYLKNMVTQYWPDREPPPGEAIFPFNIHENDRQQIRDNIVEGIIRSPDLVRVQLTMCLRAIIKHDFPGHWPGVVDKIDYYLQSQSSASWLGSLLCLYQLVKTYEYKKAEEREPLIIAMQIFLPRIQQQIVQLLPDSSYYSVLLQKQILKIFYALVQYALPLQLVNNQTMTTWMEIFRTIIDRTVPPETLHIDEDDRPELVWWKCKKWALHIVARLFERYGSPGNVTKEYFEFSEFFLKTYAVGIQQVLLKILDQYRQKEYVAPRVLQQAFNYLNQGVVHSITWKQMKPHIQNISEDVIFSVMCYKDEDEELWQEDPYEYIRMKFDIFEDYASPTTAAQTLLYTAAKKRKEVLPKMMAFCYQILTDPNFDPRKKDGALHVIGSLAEILLKKSLFKDQMELFLQNHVFPLLLSNLGYLRARSCWVLHAFSSLKFHNELNLRNAVELAKKSLIEDKEMPVKVEAALALQSLISNQIQAKEYMKPHVRPIMQELLHIVRETENDDVTNVIQKMICEYSQEVASIAVDMTQHLAEIFGKVLQSDEYEEVEDKTVMAMGILHTIDTILTVVEDHKEITQQLENICLRIIDLVLQKHVIEFYEEILSLAYSLTCHSISPQMWQLLGILYEVFQQDCFEYFTDMMPLLHNYVTIDTDTLLSNAKHLEILFTMCRKVLCGDAGEDAECHAAKLLEVIILQCKGRGIDQCIPLFVQLVLERLTRGVKTSELRTMCLQVAIAALYYNPDLLLHTLERIQLPHNPGPITVQFINQWMNDTDCFLGHHDRKMCIIGLSILLELQNRPPAVDAVVGQIVPSILFLFLGLKQVCATRQLVNREDRSKAEKADMEENEEISSDEEETNVTAQAMQSNNGRGEDEEEEDDDWDEEVLEETALEGFSTPLDLDNSVDEYQFFTQALITVQSRDAAWYQLLMAPLSEDQRTALQEVYTLAEHRRTVAEAKKKIEQQGGFTFENKGVLSAFNFGTVPSNN,1037,NP_006381.2.csv,refseq-IPO8-NM_006390.3_clinical_seed_0_final,refseq-IPO8-NM_006390.3.a2m,Invitae,refseq-IPO8-NM_006390.3.npy,1,1037,1037
+NP_006386.1,MAAATGDPGLSKLQFAPFSSALDVGFWHELTQKKLNEYRLDEAPKDIKGYYYNGDSAGLPARLTLEFSAFDMSAPTPARCCPAIGTLYNTNTLESFKTADKKLLLEQAANEIWESIKSGTALENPVLLNKFLLLTFADLKKYHFYYWFCYPALCLPESLPLIQGPVGLDQRFSLKQIEALECAYDNLCQTEGVTALPYFLIKYDENMVLVSLLKHYSDFFQGQRTKITIGVYDPCNLAQYPGWPLRNFLVLAAHRWSSSFQSVEVVCFRDRTMQGARDVAHSIIFEVKLPEMAFSPDCPKAVGWEKNQKGGMGPRMVNLSECMDPKRLAESSVDLNLKLMCWRLVPTLDLDKVVSVKCLLLGAGTLGCNVARTLMGWGVRHITFVDNAKISYSNPVRQPLYEFEDCLGGGKPKALAAADRLQKIFPGVNARGFNMSIPMPGHPVNFSSVTLEQARRDVEQLEQLIESHDVVFLLMDTRESRWLPAVIAASKRKLVINAALGFDTFVVMRHGLKKPKQQGAGDLCPNHPVASADLLGSSLFANIPGYKLGCYFCNDVVAPGDSTRDRTLDQQCTVSRPGLAVIAGALAVELMVSVLQHPEGGYAIASSSDDRMNEPPTSLGLVPHQIRGFLSRFDNVLPVSLAFDKCTACSSKVLDQYEREGFNFLAKVFNSSHSFLEDLTGLTLLHQETQAAEIWDMSDDETI,703,NP_006386.1.csv,refseq-ATG7-NM_006395.2_clinical_seed_0_final,refseq-ATG7-NM_006395.2.a2m,Invitae,refseq-ATG7-NM_006395.2.npy,1,703,703
+NP_006388.2,MDLSELERDNTGRCRLSSPVPAVCRKEPCVLGVDEAGRGPVLGPMVYAICYCPLPRLADLEALKVADSKTLLESERERLFAKMEDTDFVGWALDVLSPNLISTSMLGRVKYNLNSLSHDTATGLIQYALDQGVNVTQVFVDTVGMPETYQARLQQSFPGIEVTVKAKADALYPVVSAASICAKVARDQAVKKWQFVEKLQDLDTDYGSGYPNDPKTKAWLKEHVEPVFGFPQFVRFSWRTAQTILEKEAEDVIWEDSASENQEGLRKITSYFLNEGSQARPRSSHRYFLERGLESATSL,299,NP_006388.2.csv,refseq-RNASEH2A-NM_006397.2_clinical_seed_0_final,refseq-RNASEH2A-NM_006397.2.a2m,Invitae,refseq-RNASEH2A-NM_006397.2.npy,1,299,299
+NP_006403.2,MELWPCLAAALLLLLLLVQLSRAAEFYAKVALYCALCFTVSAVASLVCLLRHGGRTVENMSIIGWFVRSFKYFYGLRFEVRDPRRLQEARPCVIVSNHQSILDMMGLMEVLPERCVQIAKRELLFLGPVGLIMYLGGVFFINRQRSSTAMTVMADLGERMVRENLKVWIYPEGTRNDNGDLLPFKKGAFYLAVQAQVPIVPVVYSSFSSFYNTKKKFFTSGTVTVQVLEAIPTSGLTAADVPALVDTCHRAMRTTFLHISKTPQENGATAGSGVQPAQ,278,NP_006403.2.csv,refseq-AGPAT2-NM_006412.3_clinical_seed_0_final,refseq-AGPAT2-NM_006412.3.a2m,Invitae,refseq-AGPAT2-NM_006412.3.npy,1,278,278
+NP_006406.1,MATATEQWVLVEMVQALYEAPAYHLILEGILILWIIRLLFSKTYKLQERSDLTVKEKEELIEEWQPEPLVPPVPKDHPALNYNIVSGPPSHKTVVNGKECINFASFNFLGLLDNPRVKAAALASLKKYGVGTCGPRGFYGTFDVHLDLEDRLAKFMKTEEAIIYSYGFATIASAIPAYSKRGDIVFVDRAACFAIQKGLQASRSDIKLFKHNDMADLERLLKEQEIEDQKNPRKARVTRRFIVVEGLYMNTGTICPLPELVKLKYKYKARIFLEESLSFGVLGEHGRGVTEHYGINIDDIDLISANMENALASIGGFCCGRSFVIDHQRLSGQGYCFSASLPPLLAAAAIEALNIMEENPGIFAVLKEKCGQIHKALQGISGLKVVGESLSPAFHLQLEESTGSREQDVRLLQEIVDQCMNRSIALTQARYLEKEEKCLPPPSIRVVVTVEQTEEELERAASTIKEVAQAVLL,473,NP_006406.1.csv,refseq-SPTLC1-NM_006415.3_clinical_seed_0_final,refseq-SPTLC1-NM_006415.3.a2m,Invitae,refseq-SPTLC1-NM_006415.3.npy,1,473,473
+NP_006412.2,MYEGKKTKNMFLTRALEKILADKEVKKAHHSQLRKACEVALEEIKAETEKQSPPHGEAKAGSSTLPPVKSKTNFIEADKYFLPFELACQSKCPRIVSTSLDCLQKLIAYGHLTGNAPDSTTPGKKLIDRIIETICGCFQGPQTDEGVQLQIIKALLTAVTSQHIEIHEGTVLQAVRTCYNIYLASKNLINQTTAKATLTQMLNVIFARMENQALQEAKQMEKERHRQHHHLLQSPVSHHEPESPQLRYLPPQTVDHISQEHEGDLDLHTNDVDKSLQDDTEPENGSDISSAENEQTEADQATAAETLSKNEVLYDGENHDCEEKPQDIVQNIVEEMVNIVVGDMGEGTTINASADGNIGTIEDGSDSENIQANGIPGTPISVAYTPSLPDDRLSVSSNDTQESGNSSGPSPGAKFSHILQKDAFLVFRSLCKLSMKPLSDGPPDPKSHELRSKILSLQLLLSILQNAGPIFRTNEMFINAIKQYLCVALSKNGVSSVPEVFELSLSIFLTLLSNFKTHLKMQIEVFFKEIFLYILETSTSSFDHKWMVIQTLTRICADAQSVVDIYVNYDCDLNAANIFERLVNDLSKIAQGRGSQELGMSNVQELSLRKKGLECLVSILKCMVEWSKDQYVNPNSQTTLGQEKPSEQEMSEIKHPETINRYGSLNSLESTSSSGIGSYSTQMSGTDNPEQFEVLKQQKEIIEQGIDLFNKKPKRGIQYLQEQGMLGTTPEDIAQFLHQEERLDSTQVGEFLGDNDKFNKEVMYAYVDQHDFSGKDFVSALRMFLEGFRLPGEAQKIDRLMEKFAARYLECNQGQTLFASADTAYVLAYSIIMLTTDLHSPQVKNKMTKEQYIKMNRGINDSKDLPEEYLSAIYNEIAGKKISMKETKELTIPTKSSKQNVASEKQRRLLYNLEMEQMAKTAKALMEAVSHVQAPFTSATHLEHVRPMFKLAWTPFLAAFSVGLQDCDDTEVASLCLEGIRCAIRIACIFSIQLERDAYVQALARFTLLTVSSGITEMKQKNIDTIKTLITVAHTDGNYLGNSWHEILKCISQLELAQLIGTGVKPRYISGTVRGREGSLTGTKDQAPDEFVGLGLVGGNVDWKQIASIQESIGETSSQSVVVAVDRIFTGSTRLDGNAIVDFVRWLCAVSMDELLSTTHPRMFSLQKIVEISYYNMGRIRLQWSRIWEVIGDHFNKVGCNPNEDVAIFAVDSLRQLSMKFLEKGELANFRFQKDFLRPFEHIMKRNRSPTIRDMVVRCIAQMVNSQAANIRSGWKNIFSVFHLAASDQDESIVELAFQTTGHIVTLVFEKHFPATIDSFQDAVKCLSEFACNAAFPDTSMEAIRLIRHCAKYVSDRPQAFKEYTSDDMNVAPEDRVWVRGWFPILFELSCIINRCKLDVRTRGLTVMFEIMKTYGHTYEKHWWQDLFRIVFRIFDNMKLPEQQTEKAEWMTTTCNHALYAICDVFTQYLEVLSDVLLDDIFAQLYWCVQQDNEQLARSGTNCLENVVILNGEKFTLEIWDKTCNCTLDIFKTTIPHALLTWRPNSGETAPPPPSPVSEKPLDTISQKSVDIHDSIQPRSVDNRPQAPLVSASAVNEEVSKIKSTAKFPEQKLFAALLIKCVVQLELIQTIDNIVFFPATSKKEDAENLAAAQRDAVDFDVRVDTQDQGMYRFLTSQQLFKLLDCLLESHRFAKAFNSNNEQRTALWKAGFKGKSKPNLLKQETSSLACGLRILFRMYMDESRVSAWEEVQQRLLNVCSEALSYFLTLTSESHREAWTNLLLLFLTKVLKISDNRFKAHASFYYPLLCEIMQFDLIPELRAVLRRFFLRIGVVFQISQPPEQELGINKQ,1849,NP_006412.2.csv,refseq-ARFGEF1-NM_006421.4_clinical_seed_0_final,refseq-ARFGEF1-NM_006421.4.a2m,Invitae,refseq-ARFGEF1-NM_006421.4.npy,1,1849,1849
+NP_006423.1,MRFLAATFLLLALSTAAQAEPVQFKDCGSVDGVIKEVNVSPCPTQPCQLSKGQSYSVNVTFTSNIQSKSSKAVVHGILMGVPVPFPIPEPDGCKSGINCPIQKDKTYSYLNKLPVKSEYPSIKLVVEWQLQDDKNQSLFCWEIPVQIVSHL,151,NP_006423.1.csv,refseq-NPC2-NM_006432.3_clinical_seed_0_final,refseq-NPC2-NM_006432.3.a2m,Invitae,refseq-NPC2-NM_006432.3.npy,1,151,151
+NP_006450.2,MNMTQARVLVAAVVGLVAVLLYASIHKIEEGHLAVYYRGGALLTSPSGPGYHIMLPFITTFRSVQTTLQTDEVKNVPCGTSGGVMIYIDRIEVVNMLAPYAVFDIVRNYTADYDKTLIFNKIHHELNQFCSAHTLQEVYIELFDQIDENLKQALQKDLNLMAPGLTIQAVRVTKPKIPEAIRRNFELMEAEKTKLLIAAQKQKVVEKEAETERKKAVIEAEKIAQVAKIRFQQKVMEKETEKRISEIEDAAFLAREKAKADAEYYAAHKYATSNKHKLTPEYLELKKYQAIASNSKIYFGSNIPNMFVDSSCALKYSDIRTGRESSLPSKEALEPSGENVIQNKESTG,348,NP_006450.2.csv,refseq-ERLIN1-NM_006459.3_clinical_seed_0_final,refseq-ERLIN1-NM_006459.3.a2m,Invitae,refseq-ERLIN1-NM_006459.3.npy,1,348,348
+NP_006454.1,MSDHGDVSLPPEDRVRALSQLGSAVEVNEDIPPRRYFRSGVEIIRMASIYSEEGNIEHAFILYNKYITLFIEKLPKHRDYKSAVIPEKKDTVKKLKEIAFPKAEELKAELLKRYTKEYTEYNEEKKKEAEELARNMAIQQELEKEKQRVAQQKQQQLEQEQFHAFEEMIRNQELEKERLKIVQEFGKVDPGLGGPLVPDLEKPSLDVFPTLTVSSIQPSDCHTTVRPAKPPVVDRSLKPGALSNSESIPTIDGLRHVVVPGRLCPQFLQLASANTARGVETCGILCGKLMRNEFTITHVLIPKQSAGSDYCNTENEEELFLIQDQQGLITLGWIHTHPTQTAFLSSVDLHTHCSYQMMLPESVAIVCSPKFQETGFFKLTDHGLEEISSCRQKGFHPHSKDPPLFCSCSHVTVVDRAVTITDLR,424,NP_006454.1.csv,refseq-STAMBP-NM_006463.4_clinical_seed_0_final,refseq-STAMBP-NM_006463.4.a2m,Invitae,refseq-STAMBP-NM_006463.4.npy,1,424,424
+NP_006484.2,MAQEVDTAQGAEMRRGAGAARGRASWCWALALLWLAVVPGWSRVSGIPSRRHWPVPYKRFDFRPKPDPYCQAKYTFCPTGSPIPVMEGDDDIEVFRLQAPVWEFKYGDLLGHLKIMHDAIGFRSTLTGKNYTMEWYELFQLGNCTFPHLRPEMDAPFWCNQGAACFFEGIDDVHWKENGTLVQVATISGNMFNQMAKWVKQDNETGIYYETWNVKASPEKGAETWFDSYDCSKFVLRTFNKLAEFGAEFKNIETNYTRIFLYSGEPTYLGNETSVFGPTGNKTLGLAIKRFYYPFKPHLPTKEFLLSLLQIFDAVIVHKQFYLFYNFEYWFLPMKFPFIKITYEEIPLPIRNKTLSGL,358,NP_006484.2.csv,CLN5_HUMAN_b01_clinical_seed_0_final,CLN5_HUMAN_b01.a2m,EVE,CLN5_HUMAN_b01_theta_0.2.npy,1,358,358
+NP_006485.2,MKTPADTGFAFPDWAYKPESSPGSRQIQLWHFILELLRKEEYQGVIAWQGDYGEFVIKDPDEVARLWGVRKCKPQMNYDKLSRALRYYYNKRILHKTKGKRFTYKFNFNKLVLVNYPFIDVGLAGGAVPQSAPPVPSGGSHFRFPPSTPSEVLSPTEDPRSPPACSSSSSSLFSAVVARRLGRGSVSDCSDGTSELEEPLGEDPRARPPGPPDLGAFRGPPLARLPHDPGVFRVYPRPRGGPEPLSPFPVSPLAGPGSLLPPQLSPALPMTPTHLAYTPSPTLSPMYPSGGGGPSGSGGGSHFSFSPEDMKRYLQAHTQSVYNYHLSPRAFLHYPGLVVPQPQRPDKCPLPPMAPETPPVPSSASSSSSSSSSPFKFKLQPPPLGRRQRAAGEKAVAGADKSGGSAGGLAEGAGALAPPPPPPQIKVEPISEGESEEVEVTDISDEDEEDGEVFKTPRAPPAPPKPEPGEAPGASQCMPLKLRFKRRWSEDCRLEGGGGPAGGFEDEGEDKKVRGEGPGEAGGPLTPRRVSSDLQHATAQLSLEHRDS,548,NP_006485.2.csv,refseq-ERF-NM_006494.3_clinical_seed_0_final,refseq-ERF-NM_006494.3.a2m,Invitae,refseq-ERF-NM_006494.3.npy,1,548,548
+NP_006487.1,MGCTLSAEDKAAVERSKMIDRNLREDGEKAAKEVKLLLLGAGESGKSTIVKQMKIIHEDGYSEDECKQYKVVVYSNTIQSIIAIIRAMGRLKIDFGEAARADDARQLFVLAGSAEEGVMTPELAGVIKRLWRDGGVQACFSRSREYQLNDSASYYLNDLDRISQSNYIPTQQDVLRTRVKTTGIVETHFTFKDLYFKMFDVGGQRSERKKWIHCFEGVTAIIFCVALSDYDLVLAEDEEMNRMHESMKLFDSICNNKWFTETSIILFLNKKDLFEEKIKRSPLTICYPEYTGSNTYEEAAAYIQCQFEDLNRRKDTKEIYTHFTCATDTKNVQFVFDAVTDVIIKNNLKECGLY,354,NP_006487.1.csv,refseq-GNAI3-NM_006496.3_clinical_seed_0_final,refseq-GNAI3-NM_006496.3.a2m,Invitae,refseq-GNAI3-NM_006496.3.npy,1,354,354
+NP_006504.2,MVLDLDLFRVDKGGDPALIRETQEKRFKDPGLVDQLVKADSEWRRCRFRADNLNKLKNLCSKTIGEKMKKKEPVGDDESVPENVLSFDDLTADALANLKVSQIKKVRLLIDEAILKCDAERIKLEAERFENLREIGNLLHPSVPISNDEDVDNKVERIWGDCTVRKKYSHVDLVVMVDGFEGEKGAVVAGSRGYFLKGVLVFLEQALIQYALRTLGSRGYIPIYTPFFMRKEVMQEVAQLSQFDEELYKVIGKGSEKSDDNSYDEKYLIATSEQPIAALHRDEWLRPEDLPIKYAGLSTCFRQEVGSHGRDTRGIFRVHQFEKIEQFVYSSPHDNKSWEMFEEMITTAEEFYQSLGIPYHIVNIVSGSLNHAASKKLDLEAWFPGSGAFRELVSCSNCTDYQARRLRIRYGQTKKMMDKVEFVHMLNATMCATTRTICAILENYQTEKGITVPEKLKEFMPPGLQELIPFVKPAPIEQEPSKKQKKQHEGSKKKAAARDVTLENRLQNMEVTDA,514,NP_006504.2.csv,refseq-SARS-NM_006513.3_clinical_seed_0_final,refseq-SARS-NM_006513.3.a2m,Invitae,refseq-SARS-NM_006513.3.npy,1,514,514
+NP_006507.2,MEPSSKKLTGRLMLAVGGAVLGSLQFGYNTGVINAPQKVIEEFYNQTWVHRYGESILPTTLTTLWSLSVAIFSVGGMIGSFSVGLFVNRFGRRNSMLMMNLLAFVSAVLMGFSKLGKSFEMLILGRFIIGVYCGLTTGFVPMYVGEVSPTALRGALGTLHQLGIVVGILIAQVFGLDSIMGNKDLWPLLLSIIFIPALLQCIVLPFCPESPRFLLINRNEENRAKSVLKKLRGTADVTHDLQEMKEESRQMMREKKVTILELFRSPAYRQPILIAVVLQLSQQLSGINAVFYYSTSIFEKAGVQQPVYATIGSGIVNTAFTVVSLFVVERAGRRTLHLIGLAGMAGCAILMTIALALLEQLPWMSYLSIVAIFGFVAFFEVGPGPIPWFIVAELFSQGPRPAAIAVAGFSNWTSNFIVGMCFQYVEQLCGPYVFIIFTVLLVLFFIFTYFKVPETKGRTFDEIASGFRQGGASQSDKTPEELFHPLGADSQV,492,NP_006507.2.csv,refseq-SLC2A1-NM_006516.2_clinical_seed_0_final,refseq-SLC2A1-NM_006516.2.a2m,Invitae,refseq-SLC2A1-NM_006516.2.npy,1,492,492
+NP_006508.2,MALQSQASEEAKGPWQEADQEQQEPVGSPEPESEPEPEPEPEPVPVPPPEPQPEPQPLPDPAPLPELEFESERVHEPEPTPTVETRGTARGFQPPEGGFGWVVVFAATWCNGSIFGIHNSVGILYSMLLEEEKEKNRQVEFQAAWVGALAMGMIFFCSPIVSIFTDRLGCRITATAGAAVAFIGLHTSSFTSSLSLRYFTYGILFGCGCSFAFQPSLVILGHYFQRRLGLANGVVSAGSSIFSMSFPFLIRMLGDKIKLAQTFQVLSTFMFVLMLLSLTYRPLLPSSQDTPSKRGVRTLHQRFLAQLRKYFNMRVFRQRTYRIWAFGIAAAALGYFVPYVHLMKYVEEEFSEIKETWVLLVCIGATSGLGRLVSGHISDSIPGLKKIYLQVLSFLLLGLMSMMIPLCRDFGGLIVVCLFLGLCDGFFITIMAPIAFELVGPMQASQAIGYLLGMMALPMIAGPPIAGLLRNCFGDYHVAFYFAGVPPIIGAVILFFVPLMHQRMFKKEQRDSSKDKMLAPDPDPNGELLPGSPNPEEPI,539,NP_006508.2.csv,refseq-SLC16A2-NM_006517.4_clinical_seed_0_final,refseq-SLC16A2-NM_006517.4.a2m,Invitae,refseq-SLC16A2-NM_006517.4.npy,1,539,539
+NP_006512.2,MSHAAEPARDGVEASAEGPRAVFVLLEERRPADSAQLLSLNSLLPESGIVADIELENVLDPDSFYELKSQPLPLRSSLPISLQATPATPATLSASSSAGGSRTPAMSSSSSSRVLLRQQLMRAQAQEQERRERREQAAAAPFPSPAPASPAISVVGVSAGGHTLSRPPPAQVPREVLKVQTHLENPTRYHLQQARRQQVKQYLSTTLGPKLASQALTPPPGPASAQPLPAPEAAHTTGPTGSAPNSPMALLTIGSSSEKEIDDVIDEIISLESSYNDEMLSYLPGGTTGLQLPSTLPVSGNLLDVYSSQGVATPAITVSNSCPAELPNIKREISETEAKALLKERQKKDNHNLIERRRRFNINDRIKELGTLIPKSSDPEMRWNKGTILKASVDYIRKLQKEQQRSKDLESRQRSLEQANRSLQLRIQELELQAQIHGLPVPPTPGLLSLATTSASDSLKPEQLDIEEEGRPGAATFHVGGGPAQNAPHQQPPAPPSDALLDLHFPSDHLGDLGDPFHLGLEDILMEEEEGVVGGLSGGALSPLRAASDPLLSSVSPAVSKASSRRSSFSMEEES,575,NP_006512.2.csv,refseq-TFE3-NM_006521.4_clinical_seed_0_final,refseq-TFE3-NM_006521.4.a2m,Invitae,refseq-TFE3-NM_006521.4.npy,1,575,575
+NP_006525.2,MSGLGENLDPLASDSRKRKLPCDTPGQGLTCSGEKRRREQESKYIEELAELISANLSDIDNFNVKPDKCAILKETVRQIRQIKEQGKTISNDDDVQKADVSSTGQGVIDKDSLGPLLLQALDGFLFVVNRDGNIVFVSENVTQYLQYKQEDLVNTSVYNILHEEDRKDFLKNLPKSTVNGVSWTNETQRQKSHTFNCRMLMKTPHDILEDINASPEMRQRYETMQCFALSQPRAMMEEGEDLQSCMICVARRITTGERTFPSNPESFITRHDLSGKVVNIDTNSLRSSMRPGFEDIIRRCIQRFFSLNDGQSWSQKRHYQEAYLNGHAETPVYRFSLADGTIVTAQTKSKLFRNPVTNDRHGFVSTHFLQREQNGYRPNPNPVGQGIRPPMAGCNSSVGGMSMSPNQGLQMPSSRAYGLADPSTTGQMSGARYGGSSNIASLTPGPGMQSPSSYQNNNYGLNMSSPPHGSPGLAPNQQNIMISPRNRGSPKIASHQFSPVAGVHSPMASSGNTGNHSFSSSSLSALQAISEGVGTSLLSTLSSPGPKLDNSPNMNITQPSKVSNQDSKSPLGFYCDQNPVESSMCQSNSRDHLSDKESKESSVEGAENQRGPLESKGHKKLLQLLTCSSDDRGHSSLTNSPLDSSCKESSVSVTSPSGVSSSTSGGVSSTSNMHGSLLQEKHRILHKLLQNGNSPAEVAKITAEATGKDTSSITSCGDGNVVKQEQLSPKKKENNALLRYLLDRDDPSDALSKELQPQVEGVDNKMSQCTSSTIPSSSQEKDPKIKTETSEEGSGDLDNLDAILGDLTSSDFYNNSISSNGSHLGTKQQVFQGTNSLGLKSSQSVQSIRPPYNRAVSLDSPVSVGSSPPVKNISAFPMLPKQPMLGGNPRMMDSQENYGSSMGGPNRNVTVTQTPSSGDWGLPNSKAGRMEPMNSNSMGRPGGDYNTSLPRPALGGSIPTLPLRSNSIPGARPVLQQQQQMLQMRPGEIPMGMGANPYGQAAASNQLGSWPDGMLSMEQVSHGTQNRPLLRNSLDDLVGPPSNLEGQSDERALLDQLHTLLSNTDATGLEEIDRALGIPELVNQGQALEPKQDAFQGQEAAVMMDQKAGLYGQTYPAQGPPMQGGFHLQGQSPSFNSMMNQMNQQGNFPLQGMHPRANIMRPRTNTPKQLRMQLQQRLQGQQFLNQSRQALELKMENPTAGGAAVMRPMMQPQQGFLNAQMVAQRSRELLSHHFRQQRVAMMMQQQQQQQQQQQQQQQQQQQQQQQQQQQQQTQAFSPPPNVTASPSMDGLLAGPTMPQAPPQQFPYQPNYGMGQQPDPAFGRVSSPPNAMMSSRMGPSQNPMMQHPQAASIYQSSEMKGWPSGNLARNSSFSQQQFAHQGNPAVYSMVHMNGSSGHMGQMNMNPMPMSGMPMGPDQKYC,1420,NP_006525.2.csv,refseq-NCOA3-NM_006534.4_clinical_seed_0_final,refseq-NCOA3-NM_006534.4.a2m,Invitae,refseq-NCOA3-NM_006534.4.npy,1,1420,1420
+NP_006536.3,MGSGCRIECIFFSEFHPTLGPKITYQVPEDFISRELFDTVQVYIITKPELQNKLITVTAMEKKLIGCPVCIEHKKYSRNALLFNLGFVCDAQAKTCALEPIVKKLAGYLTTLELESSFVSMEESKQKLVPIMTILLEELNASGRCTLPIDESNTIHLKVIEQRPDPPVAQEYDVPVFTKDKEDFFNSQWDLTTQQILPYIDGFRHIQKISAEADVELNLVRIAIQNLLYYGVVTLVSILQYSNVYCPTPKVQDLVDDKSLQEACLSYVTKQGHKRASLRDVFQLYCSLSPGTTVRDLIGRHPQQLQHVDERKLIQFGLMKNLIRRLQKYPVRVTREEQSHPARLYTGCHSYDEICCKTGMSYHELDERLENDPNIIICWK,380,NP_006536.3.csv,refseq-NPRL2-NM_006545.4_clinical_seed_0_final,refseq-NPRL2-NM_006545.4.a2m,Invitae,refseq-NPRL2-NM_006545.4.npy,1,380,380
+NP_006554.1,MATAETALPSISTLTALGPFPDTQDDFLKWWRSEEAQDMGPGPPDPTEPPLHVKSEDQPGEEEDDERGADATWDLDLLLTNFSGPEPGGAPQTCALAPSEASGAQYPPPPETLGAYAGGPGLVAGLLGSEDHSGWVRPALRARAPDAFVGPALAPAPAPEPKALALQPVYPGPGAGSSGGYFPRTGLSVPAASGAPYGLLSGYPAMYPAPQYQGHFQLFRGLQGPAPGPATSPSFLSCLGPGTVGTGLGGTAEDPGVIAETAPSKRGRRSWARKRQAAHTCAHPGCGKSYTKSSHLKAHLRTHTGEKPYACTWEGCGWRFARSDELTRHYRKHTGQRPFRCQLCPRAFSRSDHLALHMKRHL,362,NP_006554.1.csv,refseq-KLF1-NM_006563.4_clinical_seed_0_final,refseq-KLF1-NM_006563.4.a2m,Invitae,refseq-KLF1-NM_006563.4.npy,1,362,362
+NP_006556.1,MEGDAVEAIVEESETFIKGKERKTYQRRREGGQEEDACHLPQNQTDGGEVVQDVNSSVQMVMMEQLDPTLLQMKTEVMEGTVAPEAEAAVDDTQIITLQVVNMEEQPINIGELQLVQVPVPVTVPVATTSVEELQGAYENEVSKEGLAESEPMICHTLPLPEGFQVVKVGANGEVETLEQGELPPQEDPSWQKDPDYQPPAKKTKKTKKSKLRYTEEGKDVDVSVYDFEEEQQEGLLSEVNAEKVVGNMKPPKPTKIKKKGVKKTFQCELCSYTCPRRSNLDRHMKSHTDERPHKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCPDCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTYKLKRHMRTHSGEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPHCDTVIARKSDLGVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQCDYACRQERHMIMHKRTHTGEKPYACSHCDKTFRQKQLLDMHFKRYHDPNFVPAAFVCSKCGKTFTRRNTMARHADNCAGPDGVEGENGGETKKSKRGRKRKMRSKKEDSSDSENAEPDLDDNEDEEEPAVEIEPEPEPQPVTPAPPPAKKRRGRPPGRTNQPKQNQPTAIIQVEDQNTGAIENIIVEVKKEPDAEPAEGEEEEAQPAATDAPNGDLTPEMILSMMDR,727,NP_006556.1.csv,refseq-CTCF-NM_006565.3_clinical_seed_0_final,refseq-CTCF-NM_006565.3.a2m,Invitae,refseq-CTCF-NM_006565.3.npy,1,727,727
+NP_006558.1,MVGSALRRGAHAYVYLVSKASHISRGHQHQAWGSRPPAAECATQRAPGSVVELLGKSYPQDDHSNLTRKVLTRVGRNLHNQQHHPLWLIKERVKEHFYKQYVGRFGTPLFSVYDNLSPVVTTWQNFDSLLIPADHPSRKKGDNYYLNRTHMLRAHTSAHQWDLLHAGLDAFLVVGDVYRRDQIDSQHYPIFHQLEAVRLFSKHELFAGIKDGESLQLFEQSSRSAHKQETHTMEAVKLVEFDLKQTLTRLMAHLFGDELEIRWVDCYFPFTHPSFEMEINFHGEWLEVLGCGVMEQQLVNSAGAQDRIGWAFGLGLERLAMILYDIPDIRLFWCEDERFLKQFCVSNINQKVKFQPLSKYPAVINDISFWLPSENYAENDFYDLVRTIGGDLVEKVDLIDKFVHPKTHKTSHCYRITYRHMERTLSQREVRHIHQALQEAAVQLLGVEGRF,451,NP_006558.1.csv,refseq-FARS2-NM_006567.3_clinical_seed_0_final,refseq-FARS2-NM_006567.3.a2m,Invitae,refseq-FARS2-NM_006567.3.npy,1,451,451
+NP_006570.1,MTTNAGPLHPYWPQHLRLDNFVPNDRPTWHILAGLFSVTGVLVVTTWLLSGRAAVVPLGTWRRLSLCWFAVCGFIHLVIEGWFVLYYEDLLGDQAFLSQLWKEYAKGDSRYILGDNFTVCMETITACLWGPLSLWVVIAFLRQHPLRFILQLVVSVGQIYGDVLYFLTEHRDGFQHGELGHPLYFWFYFVFMNALWLVLPGVLVLDAVKHLTHAQSTLDAKATKAKSKKN,230,NP_006570.1.csv,refseq-EBP-NM_006579.2_clinical_seed_0_final,refseq-EBP-NM_006579.2.a2m,Invitae,refseq-EBP-NM_006579.2.npy,1,230,230
+NP_006584.1,MQLEHCLSPSIMLSKKFLNVSSSYPHSGGSELVLHDHPIISTTDNLERSSPLKKITRGMTNQSDTDNFPDSKDSPGDVQRSKLSPVLDGVSELRHSFDGSAADRYLLSQSSQPQSAATAPSAMFPYPGQHGPAHPAFSIGSPSRYMAHHPVITNGAYNSLLSNSSPQGYPTAGYPYPQQYGHSYQGAPFYQFSSTQPGLVPGKAQVYLCNRPLWLKFHRHQTEMIITKQGRRMFPFLSFNISGLDPTAHYNIFVDVILADPNHWRFQGGKWVPCGKADTNVQGNRVYMHPDSPNTGAHWMRQEISFGKLKLTNNKGASNNNGQMVVLQSLHKYQPRLHVVEVNEDGTEDTSQPGRVQTFTFPETQFIAVTAYQNTDITQLKIDHNPFAKGFRDNYDTIYTGCDMDRLTPSPNDSPRSQIVPGARYAMAGSFLQDQFVSNYAKARFHPGAGAGPGPGTDRSVPHTNGLLSPQQAEDPGAPSPQRWFVTPANNRLDFAASAYDTATDFAGNAATLLSYAAAGVKALPLQAAGCTGRPLGYYADPSGWGARSPPQYCGTKSGSVLPCWPNSAAAAARMAGANPYLGEEAEGLAAERSPLPPGAAEDAKPKDLSDSSWIETPSSIKSIDSSDSGIYEQAKRRRISPADTPVSESSSPLKSEVLAQRDCEKNCAKDISGYYGFYSHS,682,NP_006584.1.csv,refseq-TBR1-NM_006593.3_clinical_seed_0_final,refseq-TBR1-NM_006593.3.a2m,Invitae,refseq-TBR1-NM_006593.3.npy,1,682,682
+NP_006585.2,MPYLGSEDVVKELKKALCNPHIQADRLRYRNVIQRVIRYMTQGLDMSGVFMEMVKASATVDIVQKKLVYLYMCTYAPLKPDLALLAINTLCKDCSDPNPMVRGLALRSMCSLRMPGVQEYIQQPILNGLRDKASYVRRVAVLGCAKMHNLHGDSEVDGALVNELYSLLRDQDPIVVVNCLRSLEEILKQEGGVVINKPIAHHLLNRMSKLDQWGQAEVLNFLLRYQPRSEEELFDILNLLDSFLKSSSPGVVMGATKLFLILAKMFPHVQTDVLVRVKGPLLAACSSESRELCFVALCHVRQILHSLPGHFSSHYKKFFCSYSEPHYIKLQKVEVLCELVNDENVQQVLEELRGYCTDVSADFAQAAIFAIGGIARTYTDQCVQILTELLGLRQEHITTVVVQTFRDLVWLCPQCTEAVCQALPGCEENIQDSEGKQALIWLLGVHGERIPNAPYVLEDFVENVKSETFPAVKMELLTALLRLFLSRPAECQDMLGRLLYYCIEEEKDMAVRDRGLFYYRLLLVGIDEVKRILCSPKSDPTLGLLEDPAERPVNSWASDFNTLVPVYGKAHWATISKCQGAERCDPELPKTSSFAASGPLIPEENKERVQELPDSGALMLVPNRQLTADYFEKTWLSLKVAHQQVLPWRGEFHPDTLQMALQVVNIQTIAMSRAGSRPWKAYLSAQDDTGCLFLTELLLEPGNSEMQISVKQNEARTETLNSFISVLETVIGTIEEIKS,739,NP_006585.2.csv,refseq-AP4B1-NM_006594.3_clinical_seed_0_final,refseq-AP4B1-NM_006594.3.a2m,Invitae,refseq-AP4B1-NM_006594.3.npy,1,739,739
+NP_006603.2,MAGASVKVAVRVRPFNARETSQDAKCVVSMQGNTTSIINPKQSKDAPKSFTFDYSYWSHTSTEDPQFASQQQVYRDIGEEMLLHAFEGYNVCIFAYGQTGAGKSYTMMGRQEPGQQGIVPQLCEDLFSRVSENQSAQLSYSVEVSYMEIYCERVRDLLNPKSRGSLRVREHPILGPYVQDLSKLAVTSYADIADLMDCGNKARTVAATNMNETSSRSHAVFTIVFTQRCHDQLTGLDSEKVSKISLVDLAGSERADSSGARGMRLKEGANINKSLTTLGKVISALADMQSKKRKSDFIPYRDSVLTWLLKENLGGNSRTAMIAALSPADINYEETLSTLRYADRTKQIRCNAIINEDPNARLIRELQEEVARLRELLMAQGLSASALEGLKTEEGSVRGALPAVSSPPAPVSPSSPTTHNGELEPSFSPNTESQIGPEEAMERLQETEKIIAELNETWEEKLRKTEALRMEREALLAEMGVAVREDGGTVGVFSPKKTPHLVNLNEDPLMSECLLYHIKDGVTRVGQVDMDIKLTGQFIREQHCLFRSIPQPDGEVVVTLEPCEGAETYVNGKLVTEPLVLKSGNRIVMGKNHVFRFNHPEQARLERERGVPPPPGPPSEPVDWNFAQKELLEQQGIDIKLEMEKRLQDLENQYRKEKEEADLLLEQQRLYADSDSGDDSDKRSCEESWRLISSLREQLPPTTVQTIVKRCGLPSSGKRRAPRRVYQIPQRRRLQGKDPRWATMADLKMQAVKEICYEVALADFRHGRAEIEALAALKMRELCRTYGKPDGPGDAWRAVARDVWDTVGEEEGGGAGSGGGSEEGARGAEVEDLRAHIDKLTGILQEVKLQNSSKDRELQALRDRMLRMERVIPLAQDHEDENEEGGEVPWAPPEGSEAAEEAAPSDRMPSARPPSPPLSSWERVSRLMEEDPAFRRGRLRWLKQEQLRLQGLQGSGGRGGGLRRPPARFVPPHDCKLRFPFKSNPQHRESWPGMGSGEAPTPLQPPEEVTPHPATPARRPPSPRRSHHPRRNSLDGGGRSRGAGSAQPEPQHFQPKKHNSYPQPPQPYPAQRPPGPRYPPYTTPPRMRRQRSAPDLKESGAAV,1103,NP_006603.2.csv,refseq-KIF1C-NM_006612.5_clinical_seed_0_final,refseq-KIF1C-NM_006612.5.a2m,Invitae,refseq-KIF1C-NM_006612.5.npy,1,1103,1103
+NP_006614.2,MAFANLRKVLISDSLDPCCRKILQDGGLQVVEKQNLSKEELIAELQDCEGLIVRSATKVTADVINAAEKLQVVGRAGTGVDNVDLEAATRKGILVMNTPNGNSLSAAELTCGMIMCLARQIPQATASMKDGKWERKKFMGTELNGKTLGILGLGRIGREVATRMQSFGMKTIGYDPIISPEVSASFGVQQLPLEEIWPLCDFITVHTPLLPSTTGLLNDNTFAQCKKGVRVVNCARGGIVDEGALLRALQSGQCAGAALDVFTEEPPRDRALVDHENVISCPHLGASTKEAQSRCGEEIAVQFVDMVKGKSLTGVVNAQALTSAFSPHTKPWIGLAEALGTLMRAWAGSPKGTIQVITQGTSLKNAGNCLSPAVIVGLLKEASKQADVNLVNAKLLVKEAGLNVTTSHSPAAPGEQGFGECLLAVALAGAPYQAVGLVQGTTPVLQGLNGAVFRPEVPLRRDLPLLLFRTQTSDPAMLPTMIGLLAEAGVRLLSYQTSLVSDGETWHVMGISSLLPSLEAWKQHVTEAFQFHF,533,NP_006614.2.csv,refseq-PHGDH-NM_006623.3_clinical_seed_0_final,refseq-PHGDH-NM_006623.3.a2m,Invitae,refseq-PHGDH-NM_006623.3.npy,1,533,533
+NP_006615.2,MARLTKRRQADTKAIQHLWAAIEIIRNQKQIANIDRITKYMSRVHGMHPKETTRQLSLAVKDGLIVETLTVGCKGSKAGIEQEGYWLPGDEIDWETENHDWYCFECHLPGEVLICDLCFRVYHSKCLSDEFRLRDSSSPWQCPVCRSIKKKNTNKQEMGTYLRFIVSRMKERAIDLNKKGKDNKHPMYRRLVHSAVDVPTIQEKVNEGKYRSYEEFKADAQLLLHNTVIFYGADSEQADIARMLYKDTCHELDELQLCKNCFYLSNARPDNWFCYPCIPNHELVWAKMKGFGFWPAKVMQKEDNQVDVRFFGHHHQRAWIPSENIQDITVNIHRLHVKRSMGWKKACDELELHQRFLREGRFWKSKNEDRGEEEAESSISSTSNEQLKVTQEPRAKKGRRNQSVEPKKEEPEPETEAVSSSQEIPTMPQPIEKVSVSTQTKKLSASSPRMLHRSTQTTNDGVCQSMCHDKYTKIFNDFKDRMKSDHKRETERVVREALEKLRSEMEEEKRQAVNKAVANMQGEMDRKCKQVKEKCKEEFVEEIKKLATQHKQLISQTKKKQWCYNCEEEAMYHCCWNTSYCSIKCQQEHWHAEHKRTCRRKR,602,NP_006615.2.csv,refseq-ZMYND11-NM_006624.5_clinical_seed_0_final,refseq-ZMYND11-NM_006624.5.a2m,Invitae,refseq-ZMYND11-NM_006624.5.npy,1,602,602
+NP_006651.2,MPSCGACTCGAAAVRLITSSLASAQRGISGGRIHMSVLGRLGTFETQILQRAPLRSFTETPAYFASKDGISKDGSGDGNKKSASEGSSKKSGSGNSGKGGNQLRCPKCGDLCTHVETFVSSTRFVKCEKCHHFFVVLSEADSKKSIIKEPESAAEAVKLAFQQKPPPPPKKIYNYLDKYVVGQSFAKKVLSVAVYNHYKRIYNNIPANLRQQAEVEKQTSLTPRELEIRRREDEYRFTKLLQIAGISPHGNALGASMQQQVNQQIPQEKRGGEVLDSSHDDIKLEKSNILLLGPTGSGKTLLAQTLAKCLDVPFAICDCTTLTQAGYVGEDIESVIAKLLQDANYNVEKAQQGIVFLDEVDKIGSVPGIHQLRDVGGEGVQQGLLKLLEGTIVNVPEKNSRKLRGETVQVDTTNILFVASGAFNGLDRIISRRKNEKYLGFGTPSNLGKGRRAAAAADLANRSGESNTHQDIEEKDRLLRHVEARDLIEFGMIPEFVGRLPVVVPLHSLDEKTLVQILTEPRNAVIPQYQALFSMDKCELNVTEDALKAIARLALERKTGARGLRSIMEKLLLEPMFEVPNSDIVCVEVDKEVVEGKKEPGYIRAPTKESSEEEYDSGVEEEGWPRQADAANS,633,NP_006651.2.csv,refseq-CLPX-NM_006660.4_clinical_seed_0_final,refseq-CLPX-NM_006660.4.a2m,Invitae,refseq-CLPX-NM_006660.4.npy,1,633,633
+NP_006693.3,MEAPLQTGMVLGVMIGAGVAVVVTAVLILLVVRRLRVPKTPAPDGPRYRFRKRDKVLFYGRKIMRKVSQSTSSLVDTSVSATSRPRMRKKLKMLNIAKKILRIQKETPTLQRKEPPPAVLEADLTEGDLANSHLPSEVLYMLKNVRVLGHFEKPLFLELCRHMVFQRLGQGDYVFRPGQPDASIYVVQDGLLELCLPGPDGKECVVKEVVPGDSVNSLLSILDVITGHQHPQRTVSARAARDSTVLRLPVEAFSAVFTKYPESLVRVVQIIMVRLQRVTFLALHNYLGLTNELFSHEIQPLRLFPSPGLPTRTSPVRGSKRMVSTSATDEPRETPGRPPDPTGAPLPGPTGDPVKPTSLETPSAPLLSRCVSMPGDISGLQGGPRSDFDMAYERGRISVSLQEEASGGSLAAPARTPTQEPREQPAGACEYSYCEDESATGGCPFGPYQGRQTSSIFEAAKQELAKLMRIEDPSLLNSRVLLHHAKAGTIIARQGDQDVSLHFVLWGCLHVYQRMIDKAEDVCLFVAQPGELVGQLAVLTGEPLIFTLRAQRDCTFLRISKSDFYEIMRAQPSVVLSAAHTVAARMSPFVRQMDFAIDWTAVEAGRALYRQGDRSDCTYIVLNGRLRSVIQRGSGKKELVGEYGRGDLIGVVEALTRQPRATTVHAVRDTELAKLPEGTLGHIKRRYPQVVTRLIHLLSQKILGNLQQLQGPFPAGSGLGVPPHSELTNPASNLATVAILPVCAEVPMVAFTLELQHALQAIGPTLLLNSDIIRARLGASALDSIQEFRLSGWLAQQEDAHRIVLYQTDASLTPWTVRCLRQADCILIVGLGDQEPTLGQLEQMLENTAVRALKQLVLLHREEGAGPTRTVEWLNMRSWCSGHLHLRCPRRLFSRRSPAKLHELYEKVFSRRADRHSDFSRLARVLTGNTIALVLGGGGARGCSHIGVLKALEEAGVPVDLVGGTSIGSFIGALYAEERSASRTKQRAREWAKSMTSVLEPVLDLTYPVTSMFTGSAFNRSIHRVFQDKQIEDLWLPYFNVTTDITASAMRVHKDGSLWRYVRASMTLSGYLPPLCDPKDGHLLMDGGYINNLPADIARSMGAKTVIAIDVGSQDETDLSTYGDSLSGWWLLWKRLNPWADKVKVPDMAEIQSRLAYVSCVRQLEVVKSSSYCEYLRPPIDCFKTMDFGKFDQIYDVGYQYGKAVFGGWSRGNVIEKMLTDRRSTDLNESRRADVLAFPSSGFTDLAEIVSRIEPPTSYVSDGCADGEESDCLTEYEEDAGPDCSRDEGGSPEGASPSTASEMEEEKSILRQRRCLPQEPPGSATDA,1327,NP_006693.3.csv,refseq-PNPLA6-NM_006702.4_clinical_seed_0_final,refseq-PNPLA6-NM_006702.4.a2m,Invitae,refseq-PNPLA6-NM_006702.4.npy,1,1327,1327
+NP_006736.1,MATNESVSIFSSASLAVEYVDSLLPENPLQEPFKNAWNYMLNNYTKFQIATWGSLIVHEALYFLFCLPGFLFQFIPYMKKYKIQKDKPETWENQWKCFKVLLFNHFCIQLPLICGTYYFTEYFNIPYDWERMPRWYFLLARCFGCAVIEDTWHYFLHRLLHHKRIYKYIHKVHHEFQAPFGMEAEYAHPLETLILGTGFFIGIVLLCDHVILLWAWVTIRLLETIDVHSGYDIPLNPLNLIPFYAGSRHHDFHHMNFIGNYASTFTWWDRIFGTDSQYNAYNEKRKKFEKKTE,293,NP_006736.1.csv,refseq-MSMO1-NM_006745.4_clinical_seed_0_final,refseq-MSMO1-NM_006745.4.a2m,Invitae,refseq-MSMO1-NM_006745.4.npy,1,293,293
+NP_006740.1,MAMDEYLWMVILGFIIAFILAFSVGANDVANSFGTAVGSGVVTLRQACILASIFETTGSVLLGAKVGETIRKGIIDVNLYNETVETLMAGEVSAMVGSAVWQLIASFLRLPISGTHCIVGSTIGFSLVAIGTKGVQWMELVKIVASWFISPLLSGFMSGLLFVLIRIFILKKEDPVPNGLRALPVFYAATIAINVFSIMYTGAPVLGLVLPMWAIALISFGVALLFAFFVWLFVCPWMRRKITGKLQKEGALSRVSDESLSKVQEAESPVFKELPGAKANDDSTIPLTGAAGETLGTSEGTSAGSHPRAAYGRALSMTHGSVKSPISNGTFGFDGHTRSDGHVYHTVHKDSGLYKDLLHKIHIDRGPEEKPAQESNYRLLRRNNSYTCYTAAICGLPVHATFRAADSSAPEDSEKLVGDTVSYSKKRLRYDSYSSYCNAVAEAEIEAEEGGVEMKLASELADPDQPREDPAEEEKEEKDAPEVHLLFHFLQVLTACFGSFAHGGNDVSNAIGPLVALWLIYKQGGVTQEAATPVWLLFYGGVGICTGLWVWGRRVIQTMGKDLTPITPSSGFTIELASAFTVVIASNIGLPVSTTHCKVGSVVAVGWIRSRKAVDWRLFRNIFVAWFVTVPVAGLFSAAVMALLMYGILPYV,652,NP_006740.1.csv,refseq-SLC20A2-NM_006749.4_clinical_seed_0_final,refseq-SLC20A2-NM_006749.4.a2m,Invitae,refseq-SLC20A2-NM_006749.4.npy,1,652,652
+NP_006746.1,MSSSPVKRQRMESALDQLKQFTTVVADTGDFHAIDEYKPQDATTNPSLILAAAQMPAYQELVEEAIAYGRKLGGSQEDQIKNAIDKLFVLFGAEILKKIPGRVSTEVDARLSFDKDAMVARARRLIELYKEAGISKDRILIKLSSTWEGIQAGKELEEQHGIHCNMTLLFSFAQAVACAEAGVTLISPFVGRILDWHVANTDKKSYEPLEDPGVKSVTKIYNYYKKFSYKTIVMGASFRNTGEIKALAGCDFLTISPKLLGELLQDNAKLVPVLSAKAAQASDLEKIHLDEKSFRWLHNEDQMAVEKLSDGIRKFAADAVKLERMLTERMFNAENGK,337,NP_006746.1.csv,refseq-TALDO1-NM_006755.1_clinical_seed_0_final,refseq-TALDO1-NM_006755.1.a2m,Invitae,refseq-TALDO1-NM_006755.1.npy,1,337,337
+NP_006748.1,MSDEEVEQVEEQYEEEEEAQEEEEVQEDTAEEDAEEEKPRPKLTAPKIPEGEKVDFDDIQKKRQNKDLMELQALIDSHFEARKKEEEELVALKERIEKRRAERAEQQRIRAEKERERQNRLAEEKARREEEDAKRRAEDDLKKKKALSSMGANYSSYLAKADQKRGKKQTAREMKKKILAERRKPLNIDHLGEDKLRDKAKELWETLHQLEIDKFEFGEKLKRQKYDITTLRSRIDQAQKHSKKAGTPAKGKVGGRWK,258,NP_006748.1.csv,refseq-TNNT3-NM_006757.3_clinical_seed_0_final,refseq-TNNT3-NM_006757.3.a2m,Invitae,refseq-TNNT3-NM_006757.3.npy,1,258,258
+NP_006756.2,MGARGAPSRRRQAGRRLRYLPTGSFPFLLLLLLLCIQLGGGQKKKENLLAEKVEQLMEWSSRRSIFRMNGDKFRKFIKAPPRNYSMIVMFTALQPQRQCSVCRQANEEYQILANSWRYSSAFCNKLFFSMVDYDEGTDVFQQLNMNSAPTFMHFPPKGRPKRADTFDLQRIGFAAEQLAKWIADRTDVHIRVFRPPNYSGTIALALLVSLVGGLLYLRRNNLEFIYNKTGWAMVSLCIVFAMTSGQMWNHIRGPPYAHKNPHNGQVSYIHGSSQAQFVAESHIILVLNAAITMGMVLLNEAATSKGDVGKRRIICLVGLGLVVFFFSFLLSIFRSKYHGYPYSDLDFE,348,NP_006756.2.csv,refseq-TUSC3-NM_006765.3_clinical_seed_0_final,refseq-TUSC3-NM_006765.3.a2m,Invitae,refseq-TUSC3-NM_006765.3.npy,1,348,348
+NP_006757.2,MVKLANPLYTEWILEAIKKVKKQKQRPSEERICNAVSSSHGLDRKTVLEQLELSVKDGTILKVSNKGLNSYKDPDNPGRIALPKPRNHGKLDNKQNVDWNKLIKRAVEGLAESGGSTLKSIERFLKGQKDVSALFGGSAASGFHQQLRLAIKRAIGHGRLLKDGPLYRLNTKATNVDGKESCESLSCLPPVSLLPHEKDKPVAEPIPICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSSCRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWICQICRPRKKGRKLLQKKAAQIKRRYTNPIGRPKNRLKKQNTVSKGPFSKVRTGPGRGRKRKITLSSQSASSSSEEGYLERIDGLDFCRDSNVSLKFNKKTKGLIDGLTKFFTPSPDGRKARGEVVDYSEQYRIRKRGNRKSSTSDWPTDNQDGWDGKQENEERLFGSQEIMTEKDMELFRDIQEQALQKVGVTGPPDPQVRCPSVIEFGKYEIHTWYSSPYPQEYSRLPKLYLCEFCLKYMKSRTILQQHMKKCGWFHPPANEIYRKNNISVFEVDGNVSTIYCQNLCLLAKLFLDHKTLYYDVEPFLFYVLTQNDVKGCHLVGYFSKEKHCQQKYNVSCIMILPQYQRKGYGRFLIDFSYLLSKREGQAGSPEKPLSDLGRLSYMAYWKSVILECLYHQNDKQISIKKLSKLTGICPQDITSTLHHLRMLDFRSDQFVIIRREKLIQDHMAKLQLNLRPVDVDPECLRWTPVIVSNSVVSEEEEEEAEEGENEEPQCQERELEISVGKSVSHENKEQDSYSVESEKKPEVMAPVSSTRLSKQVLPHDSLPANSQPSRRGRWGRKNRKTQERFGDKDSKLLLEETSSAPQEQYGECGEKSEATQEQYTESEEQLVASEEQPSQDGKPDLPKRRLSEGVEPWRGQLKKSPEALKCRLTEGSERLPRRYSEGDRAVLRGFSESSEEEEEPESPRSSSPPILTKPTLKRKKPFLHRRRRVRKRKHHNSSVVTETISETTEVLDEPFEDSDSERPMPRLEPTFEIDEEEEEEDENELFPREYFRRLSSQDVLRCQSSSKRKSKDEEEDEESDDADDTPILKPVSLLRKRDVKNSPLEPDTSTPLKKKKGWPKGKSRKPIHWKKRPGRKPGFKLSREIMPVSTQACVIEPIVSIPKAGRKPKIQESEETVEPKEDMPLPEERKEEEEMQAEAEEAEEGEEEDAASSEVPAASPADSSNSPETETKEPEVEEEEEKPRVSEEQRQSEEEQQELEEPEPEEEEDAAAETAQNDDHDADDEDDGHLESTKKKELEEQPTREDVKEEPGVQESFLDANMQKSREKIKDKEETELDSEEEQPSHDTSVVSEQMAGSEDDHEEDSHTKEELIELKEEEEIPHSELDLETVQAVQSLTQEESSEHEGAYQDCEETLAACQTLQSYTQADEDPQMSMVEDCHASEHNSPISSVQSHPSQSVRSVSSPNVPALESGYTQISPEQGSLSAPSMQNMETSPMMDVPSVSDHSQQVVDSGFSDLGSIESTTENYENPSSYDSTMGGSICGNSSSQSSCSYGGLSSSSSLTQSSCVVTQQMASMGSSCSMMQQSSVQPAANCSIKSPQSCVVERPPSNQQQQPPPPPPQQPQPPPPQPQPAPQPPPPQQQPQQQPQPQPQQPPPPPPPQQQPPLSQCSMNNSFTPAPMIMEIPESGSTGNISIYERIPGDFGAGSYSQPSATFSLAKLQQLTNTIMDPHAMPYSHSPAVTSYATSVSLSNTGLAQLAPSHPLAGTPQAQATMTPPPNLASTTMNLTSPLLQCNMSATNIGIPHTQRLQGQMPVKGHISIRSKSAPLPSAAAHQQQLYGRSPSAVAMQAGPRALAVQRGMNMGVNLMPTPAYNVNSMNMNTLNAMNSYRMTQPMMNSSYHSNPAYMNQTAQYPMQMQMGMMGSQAYTQQPMQPNPHGNMMYTGPSHHSYMNAAGVPKQSLNGPYMRR,2004,NP_006757.2.csv,refseq-KAT6A-NM_006766.4_clinical_seed_0_final,refseq-KAT6A-NM_006766.4.a2m,Invitae,refseq-KAT6A-NM_006766.4.npy,1,2004,2004
+NP_006758.2,MAGPGSTGGQIGAAALAGGARSKVAPSVDFDHSCSDSVEYLTLNFGPFETVHRWRRLPPCDEFVGARRSKHTVVAYKDAIYVFGGDNGKTMLNDLLRFDVKDCSWCRAFTTGTPPAPRYHHSAVVYGSSMFVFGGYTGDIYSNSNLKNKNDLFEYKFATGQWTEWKIEGRLPVARSAHGATVYSDKLWIFAGYDGNARLNDMWTIGLQDRELTCWEEVAQSGEIPPSCCNFPVAVCRDKMFVFSGQSGAKITNNLFQFEFKDKTWTRIPTEHLLRGSPPPPQRRYGHTMVAFDRHLYVFGGAADNTLPNELHCYDVDFQTWEVVQPSSDSEVGGAEVPERACASEEVPTLTYEERVGFKKSRDVFGLDFGTTSAKQPTQPASELPSGRLFHAAAVISDAMYIFGGTVDNNIRSGEMYRFQFSCYPKCTLHEDYGRLWESRQFCDVEFVLGEKEECVQGHVAIVTARSRWLRRKITQARERLAQKLEQEAAPVPREAPGVAAGGARPPLLHVAIREAEARPFEVLMQFLYTDKIKYPRKGHVEDVLLIMDVYKLALSFQLCRLEQLCRQYIEASVDLQNVLVVCESAARLQLSQLKEHCLNFVVKESHFNQVIMMKEFERLSSPLIVEIVRRKQQPPPRTPLDQPVDIGTSLIQDMKAYLEGAGAEFCDITLLLDGHPRPAHKAILAARSSYFEAMFRSFMPEDGQVNISIGEMVPSRQAFESMLRYIYYGEVNMPPEDSLYLFAAPYYYGFYNNRLQAYCKQNLEMNVTVQNVLQILEAADKTQALDMKRHCLHIIVHQFTKVSKLPTLRSLSQQLLLDIIDSLASHISDKQCAELGADI,840,NP_006758.2.csv,refseq-LZTR1-NM_006767.3_clinical_seed_0_final,refseq-LZTR1-NM_006767.3.a2m,Invitae,refseq-LZTR1-NM_006767.3.npy,1,840,840
+NP_006763.2,MSRSRASIHRGSIPAMSYAPFRDVRGPSMHRTQYVHSPYDRPGWNPRFCIISGNQLLMLDEDEIHPLLIRDRRSESSRNKLLRRTVSVPVEGRPHGEHEYHLGRSRRKSVPGGKQYSMEGAPAAPFRPSQGFLSRRLKSSIKRTKSQPKLDRTSSFRQILPRFRSADHDRARLMQSFKESHSHESLLSPSSAAEALELNLDEDSIIKPVHSSILGQEFCFEVTTSSGTKCFACRSAAERDKWIENLQRAVKPNKDNSRRVDNVLKLWIIEARELPPKKRYYCELCLDDMLYARTTSKPRSASGDTVFWGEHFEFNNLPAVRALRLHLYRDSDKKRKKDKAGYVGLVTVPVATLAGRHFTEQWYPVTLPTGSGGSGGMGSGGGGGSGGGSGGKGKGGCPAVRLKARYQTMSILPMELYKEFAEYVTNHYRMLCAVLEPALNVKGKEEVASALVHILQSTGKAKDFLSDMAMSEVDRFMEREHLIFRENTLATKAIEEYMRLIGQKYLKDAIGEFIRALYESEENCEVDPIKCTASSLAEHQANLRMCCELALCKVVNSHCVFPRELKEVFASWRLRCAERGREDIADRLISASLFLRFLCPAIMSPSLFGLMQEYPDEQTSRTLTLIAKVIQNLANFSKFTSKEDFLGFMNEFLELEWGSMQQFLYEISNLDTLTNSSSFEGYIDLGRELSTLHALLWEVLPQLSKEALLKLGPLPRLLNDISTALRNPNIQRQPSRQSERPRPQPVVLRGPSAEMQGYMMRDLNSSIDLQSFMARGLNSSMDMARLPSPTKEKPPPPPPGGGKDLFYVSRPPLARSSPAYCTSSSDITEPEQKMLSVNKSVSMLDLQGDGPGGRLNSSSVSNLAAVGDLLHSSQASLTAALGLRPAPAGRLSQGSGSSITAAGMRLSQMGVTTDGVPAQQLRIPLSFQNPLFHMAADGPGPPGGHGGGGGHGPPSSHHHHHHHHHHRGGEPPGDTFAPFHGYSKSEDLSSGVPKPPAASILHSHSYSDEFGPSGTDFTRRQLSLQDNLQHMLSPPQITIGPQRPAPSGPGGGSGGGSGGGGGGQPPPLQRGKSQQLTVSAAQKPRPSSGNLLQSPEPSYGPARPRQQSLSKEGSIGGSGGSGGGGGGGLKPSITKQHSQTPSTLNPTMPASERTVAWVSNMPHLSADIESAHIEREEYKLKEYSKSMDESRLDRVKEYEEEIHSLKERLHMSNRKLEEYERRLLSQEEQTSKILMQYQARLEQSEKRLRQQQAEKDSQIKSIIGRLMLVEEELRRDHPAMAEPLPEPKKRLLDAQERQLPPLGPTNPRVTLAPPWNGLAPPAPPPPPRLQITENGEFRNTADH,1343,NP_006763.2.csv,refseq-SYNGAP1-NM_006772.2_clinical_seed_0_final,refseq-SYNGAP1-NM_006772.2.a2m,Invitae,refseq-SYNGAP1-NM_006772.2.npy,1,1343,1343
+NP_006774.2,MDWGTLHTFIGGVNKHSTSIGKVWITVIFIFRVMILVVAAQEVWGDEQEDFVCNTLQPGCKNVCYDHFFPVSHIRLWALQLIFVSTPALLVAMHVAYYRHETTRKFRRGEKRNDFKDIEDIKKQKVRIEGSLWWTYTSSIFFRIIFEAAFMYVFYFLYNGYHLPWVLKCGIDPCPNLVDCFISRPTEKTVFTIFMISASVICMLLNVAELCYLLLKVCFRRSKRAQTQKNHPNHALKESKQNEMNELISDSGQNAITGFPS,261,NP_006774.2.csv,refseq-GJB6-NM_006783.4_clinical_seed_0_final,refseq-GJB6-NM_006783.4.a2m,Invitae,refseq-GJB6-NM_006783.4.npy,1,261,261
+NP_006787.2,MAHRCLRLWGRGGCWPRGLQQLLVPGGVGPGEQPCLRTLYRFVTTQARASRNSLLTDIIAAYQRFCSRPPKGFEKYFPNGKNGKKASEPKEVMGEKKESKPAATTRSSGGGGGGGGKRGGKKDDSHWWSRFQKGDIPWDDKDFRMFFLWTALFWGGVMFYLLLKRSGREITWKDFVNNYLSKGVVDRLEVVNKRFVRVTFTPGKTPVDGQYVWFNIGSVDTFERNLETLQQELGIEGENRVPVVYIAESDGSFLLSMLPTVLIIAFLLYTIRRGPAGIGRTGRGMGGLFSVGETTAKVLKDEIDVKFKDVAGCEEAKLEIMEFVNFLKNPKQYQDLGAKIPKGAILTGPPGTGKTLLAKATAGEANVPFITVSGSEFLEMFVGVGPARVRDLFALARKNAPCILFIDEIDAVGRKRGRGNFGGQSEQENTLNQLLVEMDGFNTTTNVVILAGTNRPDILDPALLRPGRFDRQIFIGPPDIKGRASIFKVHLRPLKLDSTLEKDKLARKLASLTPGFSGADVANVCNEAALIAARHLSDSINQKHFEQAIERVIGGLEKKTQVLQPEEKKTVAYHEAGHAVAGWYLEHADPLLKVSIIPRGKGLGYAQYLPKEQYLYTKEQLLDRMCMTLGGRVSEEIFFGRITTGAQDDLRKVTQSAYAQIVQFGMNEKVGQISFDLPRQGDMVLEKPYSEATARLIDDEVRILINDAYKRTVALLTEKKADVEKVALLLLEKEVLDKNDMVELLGPRPFAEKSTYEEFVEGTGSLDEDTSLPEGLKDWNKEREKEKEEPPGEKVAN,797,NP_006787.2.csv,refseq-AFG3L2-NM_006796.2_clinical_seed_0_final,refseq-AFG3L2-NM_006796.2.a2m,Invitae,refseq-AFG3L2-NM_006796.2.npy,1,797,797
+NP_006822.1,MGEEANDDKKPTTKFELERETELRFEVEASQSVQLELLTGMAEIFGTELTRNKKFTFDAGAKVAVFTWHGCSVQLSGRTEVAYVSKDTPMLLYLNTHTALEQMRRQAEKEEERGPRVMVVGPTDVGKSTVCRLLLNYAVRLGRRPTYVELDVGQGSVSIPGTMGALYIERPADVEEGFSIQAPLVYHFGSTTPGTNIKLYNKITSRLADVFNQRCEVNRRASVSGCVINTCGWVKGSGYQALVHAASAFEVDVVVVLDQERLYNELKRDLPHFVRTVLLPKSGGVVERSKDFRRECRDERIREYFYGFRGCFYPHAFNVKFSDVKIYKVGAPTIPDSCLPLGMSQEDNQLKLVPVTPGRDMVHHLLSVSTAEGTEENLSETSVAGFIVVTSVDLEHQVFTVLSPAPRPLPKNFLLIMDIRFMDLK,425,NP_006822.1.csv,refseq-CLP1-NM_006831.2_clinical_seed_0_final,refseq-CLP1-NM_006831.2.a2m,Invitae,refseq-CLP1-NM_006831.2.npy,1,425,425
+NP_006843.2,MMEELHSLDPRRQELLEARFTGVGVSKGPLNSESSNQSLCSVGSLSDKEVETPEKKQNDQRNRKRKAEPYETSQGKGTPRGHKISDYFEFAGGSAPGTSPGRSVPPVARSSPQHSLSNPLPRRVEQPLYGLDGSAAKEATEEQSALPTLMSVMLAKPRLDTEQLAQRGAGLCFTFVSAQQNSPSSTGSGNTEHSCSSQKQISIQHRQTQSDLTIEKISALENSKNSDLEKKEGRIDDLLRANCDLRRQIDEQQKMLEKYKERLNRCVTMSKKLLIEKSKQEKMACRDKSMQDRLRLGHFTTVRHGASFTEQWTDGYAFQNLIKQQERINSQREEIERQRKMLAKRKPPAMGQAPPATNEQKQRKSKTNGAENETLTLAEYHEQEEIFKLRLGHLKKEEAEIQAELERLERVRNLHIRELKRIHNEDNSQFKDHPTLNDRYLLLHLLGRGGFSEVYKAFDLTEQRYVAVKIHQLNKNWRDEKKENYHKHACREYRIHKELDHPRIVKLYDYFSLDTDSFCTVLEYCEGNDLDFYLKQHKLMSEKEARSIIMQIVNALKYLNEIKPPIIHYDLKPGNILLVNGTACGEIKITDFGLSKIMDDDSYNSVDGMELTSQGAGTYWYLPPECFVVGKEPPKISNKVDVWSVGVIFYQCLYGRKPFGHNQSQQDILQENTILKATEVQFPPKPVVTPEAKAFIRRCLAYRKEDRIDVQQLACDPYLLPHIRKSVSTSSPAGAAIASTSGASNNSSSN,750,NP_006843.2.csv,refseq-TLK2-NM_006852.3_clinical_seed_0_final,refseq-TLK2-NM_006852.3.a2m,Invitae,refseq-TLK2-NM_006852.3.npy,1,750,750
+NP_006850.2,MSLRCGDAARTLGPRVFGRYFCSPVRPLSSLPDKKKELLQNGPDLQDFVSGDLADRSTWDEYKGNLKRQKGERLRLPPWLKTEIPMGKNYNKLKNTLRNLNLHTVCEEARCPNIGECWGGGEYATATATIMLMGDTCTRGCRFCSVKTARNPPPLDASEPYNTAKAIAEWGLDYVVLTSVDRDDMPDGGAEHIAKTVSYLKERNPKILVECLTPDFRGDLKAIEKVALSGLDVYAHNVETVPELQSKVRDPRANFDQSLRVLKHAKKVQPDVISKTSIMLGLGENDEQVYATMKALREADVDCLTLGQYMQPTRRHLKVEEYITPEKFKYWEKVGNELGFHYTASGPLVRSSYKAGEFFLKNLVAKRKTKDL,372,NP_006850.2.csv,refseq-LIAS-NM_006859.3_clinical_seed_0_final,refseq-LIAS-NM_006859.3.a2m,Invitae,refseq-LIAS-NM_006859.3.npy,1,372,372
+NP_006851.1,MVKLAAKCILADPAVGKTALAQIFRSDGAHFQKSYTLTTGMDLVVKTVPVPDTGDSVELFIFDSAGKELFSEMLDKLWESPNVLCLVYDVTNEESFNNCSKWLEKARSQAPGISLPGVLVGNKTDLAGRRAVDSAEARAWALGQGLECFETSVKEMENFEAPFHCLAKQFHQLYREKVEVFRALA,185,NP_006851.1.csv,refseq-IFT27-NM_006860.4_clinical_seed_0_final,refseq-IFT27-NM_006860.4.a2m,Invitae,refseq-IFT27-NM_006860.4.npy,1,185,185
+NP_006867.1,MQMSYAIRCAFYQLLLAALMLVAMLQLLYLSLLSGLHGQEEQDQYFEFFPPSPRSVDQVKAQLRTALASGGVLDASGDYRVYRGLLKTTMDPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMRARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNNLLRNLAREGANYALVIDVDMVPSEGLWRGLREMLDQSNQWGGTALVVPAFEIRRARRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNRISQACELHVAGFDFEVLNEGFLVHKGFKEALKFHPQKEAENQHNKILYRQFKQELKAKYPNSPRRC,415,NP_006867.1.csv,refseq-B4GAT1-NM_006876.2_clinical_seed_0_final,refseq-B4GAT1-NM_006876.2.a2m,Invitae,refseq-B4GAT1-NM_006876.2.npy,1,415,415
+NP_006874.1,MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLARSRELGTSDSSLQDITEGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKREDVKSEDEDGQTKLKQRRSRTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKQENQMHKGVILGTANHLDACRVAPYVNMGALRMPFQQMEFCSCRPGWSIMA,225,NP_006874.1.csv,refseq-SHOX-NM_006883.2_clinical_seed_0_final,refseq-SHOX-NM_006883.2.a2m,Invitae,refseq-SHOX-NM_006883.2.npy,1,225,225
+NP_008822.2,MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQYFLRRGDYADHQQWMGLSDSVRSCRLIPHSGSHRIRLYEREDYRGQMIEFTEDCSCLQDRFRFNEIHSLNVLEGSWVLYELSNYRGRQYLLMPGDYRRYQDWGATNARVGSLRRVIDFS,174,NP_008822.2.csv,refseq-CRYGD-NM_006891.3_clinical_seed_0_final,refseq-CRYGD-NM_006891.3.a2m,Invitae,refseq-CRYGD-NM_006891.3.npy,1,174,174
+NP_008823.1,MKGDTRHLNGEEDAGGREDSILVNGACSDQSSDSPPILEAIRTPEIRGRRSSSRLSKREVSSLLSYTQDLTGDGDGEDGDGSDTPVMPKLFRETRTRSESPAVRTRNNNSVSSRERHRPSPRSTRGRQGRNHVDESPVEFPATRSLRRRATASAGTPWPSPPSSYLTIDLTDDTEDTHGTPQSSSTPYARLAQDSQQGGMESPQVEADSGDGDSSEYQDGKEFGIGDLVWGKIKGFSWWPAMVVSWKATSKRQAMSGMRWVQWFGDGKFSEVSADKLVALGLFSQHFNLATFNKLVSYRKAMYHALEKARVRAGKTFPSSPGDSLEDQLKPMLEWAHGGFKPTGIEGLKPNNTQPVVNKSKVRRAGSRKLESRKYENKTRRRTADDSATSDYCPAPKRLKTNCYNNGKDRGDEDQSREQMASDVANNKSSLEDGCLSCGRKNPVSFHPLFEGGLCQTCRDRFLELFYMYDDDGYQSYCTVCCEGRELLLCSNTSCCRCFCVECLEVLVGTGTAAEAKLQEPWSCYMCLPQRCHGVLRRRKDWNVRLQAFFTSDTGLEYEAPKLYPAIPAARRRPIRVLSLFDGIATGYLVLKELGIKVGKYVASEVCEESIAVGTVKHEGNIKYVNDVRNITKKNIEEWGPFDLVIGGSPCNDLSNVNPARKGLYEGTGRLFFEFYHLLNYSRPKEGDDRPFFWMFENVVAMKVGDKRDISRFLECNPVMIDAIKVSAAHRARYFWGNLPGMNRPVIASKNDKLELQDCLEYNRIAKLKKVQTITTKSNSIKQGKNQLFPVVMNGKEDVLWCTELERIFGFPVHYTDVSNMGRGARQKLLGRSWSVPVIRHLFAPLKDYFACE,853,NP_008823.1.csv,refseq-DNMT3B-NM_006892.3_clinical_seed_0_final,refseq-DNMT3B-NM_006892.3.a2m,Invitae,refseq-DNMT3B-NM_006892.3.npy,1,853,853
+NP_008825.4,MGKKVAIIGAGVSGLASIRSCLEEGLEPTCFEKSNDIGGLWKFSDHAEEGRASIYKSVFSNSSKEMMCFPDFPFPDDFPNFMHNSKIQEYIIAFAKEKNLLKYIQFKTFVSSVNKHPDFATTGQWDVTTERDGKKESAVFDAVMVCSGHHVYPNLPKESFPGLNHFKGKCFHSRDYKEPGVFNGKRVLVVGLGNSGCDIATELSRTAEQVMISSRSGSWVMSRVWDNGYPWDMLLVTRFGTFLKNNLPTAISDWLYVKQMNARFKHENYGLMPLNGVLRKEPVFNDELPASILCGIVSVKPNVKEFTETSAIFEDGTIFEGIDCVIFATGYSFAYPFLDESIIKSRNNEIILFKGVFPPLLEKSTIAVIGFVQSLGAAIPTVDLQSRWAAQVIKGTCTLPSMEDMMNDINEKMEKKRKWFGKSETIQTDYIVYMDELSSFIGAKPNIPWLFLTDPKLAMEVYFGPCSPYQFRLVGPGQWPGARNAILTQWDRSLKPMQTRVVGRLQKPCFFFHWLKLFAIPILLIAVFLVLT,532,NP_008825.4.csv,refseq-FMO3-NM_006894.5_clinical_seed_0_final,refseq-FMO3-NM_006894.5.a2m,Invitae,refseq-FMO3-NM_006894.5.npy,1,532,532
+NP_008846.2,MGCFFSKRRKADKESRPENEEERPKQYSWDQREKVDPKDYMFSGLKDETVGRLPGTVAGQQFLIQDCENCNIYIFDHSATVTIDDCTNCIIFLGPVKGSVFFRNCRDCKCTLACQQFRVRDCRKLEVFLCCATQPIIESSSNIKFGCFQWYYPELAFQFKDAGLSIFNNTWSNIHDFTPVSGELNWSLLPEDAVVQDYVPIPTTEELKAVRVSTEANRSIVPISRGQRQKSSDESCLVVLFAGDYTIANARKLIDEMVGKGFFLVQTKEVSMKAEDAQRVFREKAPDFLPLLNKGPVIALEFNGDGAVEVCQLIVNEIFNGTKMFVSESKETASGDVDSFYNFADIQMGI,350,NP_008846.2.csv,refseq-RP2-NM_006915.2_clinical_seed_0_final,refseq-RP2-NM_006915.2.a2m,Invitae,refseq-RP2-NM_006915.2.npy,1,350,350
+NP_008853.3,MAQALLVPPGPESFRLFTRESLAAIEKRAAEEKAKKPKKEQDNDDENKPKPNSDLEAGKNLPFIYGDIPPEMVSEPLEDLDPYYINKKTFIVMNKGKAIFRFSATSALYILTPLNPVRKIAIKILVHSLFSMLIMCTILTNCVFMTLSNPPDWTKNVEYTFTGIYTFESLIKILARGFCLEDFTFLRDPWNWLDFSVIVMAYVTEFVDLGNVSALRTFRVLRALKTISVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNKCLQWPPSDSAFETNTTSYFNGTMDSNGTFVNVTMSTFNWKDYIGDDSHFYVLDGQKDPLLCGNGSDAGQCPEGYICVKAGRNPNYGYTSFDTFSWAFLSLFRLMTQDYWENLYQLTLRAAGKTYMIFFVLVIFLGSFYLVNLILAVVAMAYEEQNQATLEEAEQKEAEFQQMLEQLKKQQEEAQAVAAASAASRDFSGIGGLGELLESSSEASKLSSKSAKEWRNRRKKRRQREHLEGNNKGERDSFPKSESEDSVKRSSFLFSMDGNRLTSDKKFCSPHQSLLSIRGSLFSPRRNSKTSIFSFRGRAKDVGSENDFADDEHSTFEDSESRRDSLFVPHRHGERRNSNVSQASMSSRMVPGLPANGKMHSTVDCNGVVSLVGGPSALTSPTGQLPPEGTTTETEVRKRRLSSYQISMEMLEDSSGRQRAVSIASILTNTMEELEESRQKCPPCWYRFANVFLIWDCCDAWLKVKHLVNLIVMDPFVDLAITICIVLNTLFMAMEHYPMTEQFSSVLTVGNLVFTGIFTAEMVLKIIAMDPYYYFQEGWNIFDGIIVSLSLMELGLSNVEGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKSYKECVCKINDDCTLPRWHMNDFFHSFLIVFRVLCGEWIETMWDCMEVAGQTMCLIVFMLVMVIGNLVVLNLFLALLLSSFSSDNLAATDDDNEMNNLQIAVGRMQKGIDYVKNKMRECFQKAFFRKPKVIEIHEGNKIDSCMSNNTGIEISKELNYLRDGNGTTSGVGTGSSVEKYVIDENDYMSFINNPSLTVTVPIAVGESDFENLNTEEFSSESELEESKEKLNATSSSEGSTVDVVLPREGEQAETEPEEDLKPEACFTEGCIKKFPFCQVSTEEGKGKIWWNLRKTCYSIVEHNWFETFIVFMILLSSGALAFEDIYIEQRKTIKTMLEYADKVFTYIFILEMLLKWVAYGFQTYFTNAWCWLDFLIVDVSLVSLVANALGYSELGAIKSLRTLRALRPLRALSRFEGMRVVVNALVGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFYHCVNMTTGNMFDISDVNNLSDCQALGKQARWKNVKVNFDNVGAGYLALLQVATFKGWMDIMYAAVDSRDVKLQPVYEENLYMYLYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKFGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPANKFQGMVFDFVTRQVFDISIMILICLNMVTMMVETDDQGKYMTLVLSRINLVFIVLFTGEFVLKLVSLRHYYFTIGWNIFDFVVVILSIVGMFLAEMIEKYFVSPTLFRVIRLARIGRILRLIKGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYAIFGMSNFAYVKKEAGIDDMFNFETFGNSMICLFQITTSAGWDGLLAPILNSAPPDCDPDTIHPGSSVKGDCGNPSVGIFFFVSYIIISFLVVVNMYIAVILENFSVATEESAEPLSEDDFEMFYEVWEKFDPDATQFIEFSKLSDFAAALDPPLLIAKPNKVQLIAMDLPMVSGDRIHCLDILFAFTKRVLGESGEMDALRIQMEDRFMASNPSKVSYEPITTTLKRKQEEVSAAIIQRNFRCYLLKQRLKNISSNYNKEAIKGRIDLPIKQDMIIDKLNGNSTPEKTDGSSSTTSPPSYDSVTKPDKEKFEKDKPEKESKGKEVRENQK,2000,NP_008853.3.csv,refseq-SCN3A-NM_006922.3_clinical_seed_0_final,refseq-SCN3A-NM_006922.3.a2m,Invitae,refseq-SCN3A-NM_006922.3.npy,1,2000,2000
+NP_008860.4,MMETERLVLPPPDPLDLPLRAVELGCTGHWELLNLPGAPESSLPHGLPPCAPDLQQEAEQLFLSSPAWLPLHGVEHSARKWQRKTDPWSLLAVLGAPVPSDLQAQRHPTTGQILGYKEVLLENTNLSATTSLSLRRPPGPASQSLWGNPTQYPFWPGGMDEPTITDLNTREEAEEEIDFEKDLLTIPPGFKKGMDFAPKDCPTPAPGLLSLSCMLEPLDLGGGDEDENEAVGQPGGPRGDTVSASPCSAPLARASSLEDLVLKEASTAVSTPEAPEPPSQEQWAIPVDATSPVGDFYRLIPQPAFQWAFEPDVFQKQAILHLERHDSVFVAAHTSAGKTVVAEYAIALAQKHMTRTIYTSPIKALSNQKFRDFRNTFGDVGLLTGDVQLHPEASCLIMTTEILRSMLYSGSDVIRDLEWVIFDEVHYINDVERGVVWEEVLIMLPDHVSIILLSATVPNALEFADWIGRLKRRQIYVISTVTRPVPLEHYLFTGNSSKTQGELFLLLDSRGAFHTKGYYAAVEAKKERMSKHAQTFGAKQPTHQGGPAQDRGVYLSLLASLRTRAQLPVVVFTFSRGRCDEQASGLTSLDLTTSSEKSEIHLFLQRCLARLRGSDRQLPQVLHMSELLNRGLGVHHSGILPILKEIVEMLFSRGLVKVLFATETFAMGVNMPARTVVFDSMRKHDGSTFRDLLPGEYVQMAGRAGRRGLDPTGTVILLCKGRVPEMADLHRMMMGKPSQLQSQFRLTYTMILNLLRVDALRVEDMMKRSFSEFPSRKDSKAHEQALAELTKRLGALEEPDMTGQLVDLPEYYSWGEELTETQHMIQRRIMESVNGLKSLSAGRVVVVKNQEHHNALGVILQVSSNSTSRVFTTLVLCDKPLSQDPQDRGPATAEVPYPDDLVGFKLFLPEGPCDHTVVKLQPGDMAAITTKVLRVNGEKILEDFSKRQQPKFKKDPPLAAVTTAVQELLRLAQAHPAGPPTLDPVNDLQLKDMSVVEGGLRARKLEELIQGAQCVHSPRFPAQYLKLRERMQIQKEMERLRFLLSDQSLLLLPEYHQRVEVLRTLGYVDEAGTVKLAGRVACAMSSHELLLTELMFDNALSTLRPEEIAALLSGLVCQSPGDAGDQLPNTLKQGIERVRAVAKRIGEVQVACGLNQTVEEFVGELNFGLVEVVYEWARGMPFSELAGLSGTPEGLVVRCIQRLAEMCRSLRGAARLVGEPVLGAKMETAATLLRRDIVFAASLYTQ,1246,NP_008860.4.csv,refseq-SKIV2L-NM_006929.4_clinical_seed_0_final,refseq-SKIV2L-NM_006929.4.a2m,Invitae,refseq-SKIV2L-NM_006929.4.npy,1,1246,1246
+NP_008870.2,MQQAPQPYEFFSEENSPKWRGLLVSALRKVQEQVHPTLSANEESLYYIEELIFQLLNKLCMAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNPLLLPVDKIHPSLKEVLGYKVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQDIKVSMCADKVLMDMFDQDDIGLVSLCEDEPSSSGELNYYDLVRTEIAEERQYLRELNMIIKVFREAFLSDRKLFKPSDIEKIFSNISDIHELTVKLLGLIEDTVEMTDESSPHPLAGSCFEDLAEEQAFDPYETLSQDILSPEFHEHFNKLMARPAVALHFQSIADGFKEAVRYVLPRLMLVPVYHCWHYFELLKQLKACSEEQEDRECLNQAITALMNLQGSMDRIYKQYSPRRRPGDPVCPFYSHQLRSKHLAIKKMNEIQKNIDGWEGKDIGQCCNEFIMEGPLTRIGAKHERHIFLFDGLMISCKPNHGQTRLPGYSSAEYRLKEKFVMRKIQICDKEDTCEHKHAFELVSKDENSIIFAAKSAEEKNNWMAALISLHYRSTLDRMLDSVLLKEENEQPLRLPSPEVYRFVVKDSEENIVFEDNLQSRSGIPIIKGGTVVKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSLLIERFEIPEPEPTDADKLAIEKGEQPISADLKRFRKEYVQPVQLRILNVFRHWVEHHFYDFERDLELLERLESFISSVRGKAMKKWVESIAKIIRRKKQAQANGVSHNITFESPPPPIEWHISKPGQFETFDLMTLHPIEIARQLTLLESDLYRKVQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEAENFEERVAVLSRIIEILQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRKILDEAVELSQDHFKKYLVKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKKKGKDLINFSKRRKVAEITGEIQQYQNQPYCLRIEPDMRRFFENLNPMGSASEKEFTDYLFNKSLEIEPRNCKQPPRFPRKSTFSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAETELESTVSAPTSPNTPSTPPVSASSDLSVFLDVDLNSSCGSNSIFAPVLLPHSKSFFSSCGSLHKLSEEPLIPPPLPPRKKFDHDASNSKGNMKSDDDPPAIPPRQPPPPKVKPRVPVPTGAFDGPLHSPPPPPPRDPLPDTPPPVPLRPPEHFINCPFNLQPPPLGHLHRDSDWLRDISTCPNSPSTPPSTPSPRVPRRCYVLSSSQNNLAHPPAPPVPPRQNSSPHLPKLPPKTYKRELSHPPLYRLPLLENAETPQ,1332,NP_008870.2.csv,refseq-SOS2-NM_006939.2_clinical_seed_0_final,refseq-SOS2-NM_006939.2.a2m,Invitae,refseq-SOS2-NM_006939.2.npy,1,1332,1332
+NP_008871.3,MLTDPDLPQEFERMSSKRPASPYGEADGEVAMVTSRQKVEEEESDGLPAFHLPLHVSFPNKPHSEEFQPVSLLTQETCGHRTPTSQHNTMEVDGNKVMSSFAPHNSSTSPQKAEEGGRQSGESLSSTALGTPERRKGSLADVVDTLKQRKMEELIKNEPEETPSIEKLLSKDWKDKLLAMGSGNFGEIKGTPESLAEKERQLMGMINQLTSLREQLLAAHDEQKKLAASQIEKQRQQMELAKQQQEQIARQQQQLLQQQHKINLLQQQIQVQGQLPPLMIPVFPPDQRTLAAAAQQGFLLPPGFSYKAGCSDPYPVQLIPTTMAAAAAATPGLGPLQLQQLYAAQLAAMQVSPGGKLPGIPQGNLGAAVSPTSIHTDKSTNSPPPKSKDEVAQPLNLSAKPKTSDGKSPTSPTSPHMPALRINSGAGPLKASVPAALASPSARVSTIGYLNDHDAVTKAIQEARQMKEQLRREQQVLDGKVAVVNSLGLNNCRTEKEKTTLESLTQQLAVKQNEEGKFSHAMMDFNLSGDSDGSAGVSESRIYRESRGRGSNEPHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMTNLEKQPYYEEQARLSKQHLEKYPDYKYKPRPKRTCLVDGKKLRIGEYKAIMRNRRQEMRQYFNVGQQAQIPIATAGVVYPGAIAMAGMPSPHLPSEHSSVSSSPEPGMPVIQSTYGVKGEEPHIKEEIQAEDINGEIYDEYDEEEDDPDVDYGSDSENHIAGQAN,763,NP_008871.3.csv,refseq-SOX5-NM_006940.5_clinical_seed_0_final,refseq-SOX5-NM_006940.5.a2m,Invitae,refseq-SOX5-NM_006940.5.npy,1,763,763
+NP_008872.1,MAEEQDLSEVELSPVGSEEPRCLSPGSAPSLGPDGGGGGSGLRASPGPGELGKVKKEQQDGEADDDKFPVCIREAVSQVLSGYDWTLVPMPVRVNGASKSKPHVKRPMNAFMVWAQAARRKLADQYPHLHNAELSKTLGKLWRLLNESDKRPFIEEAERLRMQHKKDHPDYKYQPRRRKNGKAAQGEAECPGGEAEQGGTAAIQAHYKSAHLDHRHPGEGSPMSDGNPEHPSGQSHGPPTPPTTPKTELQSGKADPKRDGRSMGEGGKPHIDFGNVDIGEISHEVMSNMETFDVAELDQYLPPNGHPGHVSSYSAAGYGLGSALAVASGHSAWISKPPGVALPTVSPPGVDAKAQVKTETAGPQGPPHYTDQPSTSQIAYTSLSLPHYGSAFPSISRPQFDYSDHQPSGPYYGHSGQASGLYSAFSYMGPSQRPLYTAISDPSPSGPQSHSPTHWEQPVYTTLSRP,466,NP_008872.1.csv,refseq-SOX10-NM_006941.3_clinical_seed_0_final,refseq-SOX10-NM_006941.3.a2m,Invitae,refseq-SOX10-NM_006941.3.npy,1,466,466
+NP_008880.2,MAPSGLKAVVGEKILSGVIRSVKKDGEWKVLIMDHPSMRILSSCCKMSDILAEGITIVEDINKRREPIPSLEAIYLLSPTEKSVQALIKDFQGTPTFTYKAAHIFFTDTCPEPLFSELGRSRLAKVVKTLKEIHLAFLPYEAQVFSLDAPHSTYNLYCPFRAEERTRQLEVLAQQIATLCATLQEYPAIRYRKGPEDTAQLAHAVLAKLNAFKADTPSLGEGPEKTRSQLLIMDRAADPVSPLLHELTFQAMAYDLLDIEQDTYRYETTGLSEAREKAVLLDEDDDLWVELRHMHIADVSKKVTELLRTFCESKRLTTDKANIKDLSQILKKMPQYQKELNKYSTHLHLADDCMKHFKGSVEKLCSVEQDLAMGSDAEGEKIKDSMKLIVPVLLDAAVPAYDKIRVLLLYILLRNGVSEENLAKLIQHANVQAHSSLIRNLEQLGGTVTNPGGSGTSSRLEPRERMEPTYQLSRWTPVIKDVMEDAVEDRLDRNLWPFVSDPAPTASSQAAVSARFGHWHKNKAGIEARAGPRLIVYVMGGVAMSEMRAAYEVTRATEGKWEVLIGSSHILTPTRFLDDLKALDKKLEDIALP,593,NP_008880.2.csv,refseq-STXBP2-NM_006949.3_clinical_seed_0_final,refseq-STXBP2-NM_006949.3.a2m,Invitae,refseq-STXBP2-NM_006949.3.npy,1,593,593
+NP_008909.1,MAEQLSPGKAVDQVCTFLFKKPGRKGAAGRRKRPACDPEPGESGSSSDEGCTVVRPEKKRVTHNPMIQKTRDSGKQKAAYGDLSSEEEEENEPESLGVVYKSTRSAKPVGPEDMGATAVYELDTEKERDAQAIFERSQKIQEELRGKEDDKIYRGINNYQKYMKPKDTSMGNASSGMVRKGPIRAPEHLRATVRWDYQPDICKDYKETGFCGFGDSCKFLHDRSDYKHGWQIERELDEGRYGVYEDENYEVGSDDEEIPFKCFICRQSFQNPVVTKCRHYFCESCALQHFRTTPRCYVCDQQTNGVFNPAKELIAKLEKHRATGEGGASDLPEDPDEDAIPIT,343,NP_008909.1.csv,refseq-RNF113A-NM_006978.2_clinical_seed_0_final,refseq-RNF113A-NM_006978.2.a2m,Invitae,refseq-RNF113A-NM_006978.2.npy,1,343,343
+NP_008919.3,MQRAVPEGFGRRKLGSDMGNAERAPGSRSFGPVPTLLLLAAALLAVSDALGRPSEEDEELVVPELERAPGHGTTRLRLHAFDQQLDLELRPDSSFLAPGFTLQNVGRKSGSETPLPETDLAHCFYSGTVNGDPSSAAALSLCEGVRGAFYLLGEAYFIQPLPAASERLATAAPGEKPPAPLQFHLLRRNRQGDVGGTCGVVDDEPRPTGKAETEDEDEGTEGEDEGAQWSPQDPALQGVGQPTGTGSIRKKRFVSSHRYVETMLVADQSMAEFHGSGLKHYLLTLFSVAARLYKHPSIRNSVSLVVVKILVIHDEQKGPEVTSNAALTLRNFCNWQKQHNPPSDRDAEHYDTAILFTRQDLCGSQTCDTLGMADVGTVCDPSRSCSVIEDDGLQAAFTTAHELGHVFNMPHDDAKQCASLNGVNQDSHMMASMLSNLDHSQPWSPCSAYMITSFLDNGHGECLMDKPQNPIQLPGDLPGTSYDANRQCQFTFGEDSKHCPDAASTCSTLWCTGTSGGVLVCQTKHFPWADGTSCGEGKWCINGKCVNKTDRKHFDTPFHGSWGMWGPWGDCSRTCGGGVQYTMRECDNPVPKNGGKYCEGKRVRYRSCNLEDCPDNNGKTFREEQCEAHNEFSKASFGSGPAVEWIPKYAGVSPKDRCKLICQAKGIGYFFVLQPKVVDGTPCSPDSTSVCVQGQCVKAGCDRIIDSKKKFDKCGVCGGNGSTCKKISGSVTSAKPGYHDIITIPTGATNIEVKQRNQRGSRNNGSFLAIKAADGTYILNGDYTLSTLEQDIMYKGVVLRYSGSSAALERIRSFSPLKEPLTIQVLTVGNALRPKIKYTYFVKKKKESFNAIPTFSAWVIEEWGECSKSCELGWQRRLVECRDINGQPASECAKEVKPASTRPCADHPCPQWQLGEWSSCSKTCGKGYKKRSLKCLSHDGGVLSHESCDPLKKPKHFIDFCTMAECS,967,NP_008919.3.csv,refseq-ADAMTS1-NM_006988.4_clinical_seed_0_final,refseq-ADAMTS1-NM_006988.4.a2m,Invitae,refseq-ADAMTS1-NM_006988.4.npy,1,967,967
+NP_008927.1,MDVPGPVSRRAAAAAATVLLRTARVRRECWFLPTALLCAYGFFASLRPSEPFLTPYLLGPDKNLTEREVFNEIYPVWTYSYLVLLFPVFLATDYLRYKPVVLLQGLSLIVTWFMLLYAQGLLAIQFLEFFYGIATATEIAYYSYIYSVVDLGMYQKVTSYCRSATLVGFTVGSVLGQILVSVAGWSLFSLNVISLTCVSVAFAVAWFLPMPQKSLFFHHIPSTCQRVNGIKVQNGGIVTDTPASNHLPGWEDIESKIPLNMEEPPVEEPEPKPDRLLVLKVLWNDFLMCYSSRPLLCWSVWWALSTCGYFQVVNYTQGLWEKVMPSRYAAIYNGGVEAVSTLLGAVAVFAVGYIKISWSTWGEMTLSLFSLLIAAAVYIMDTVGNIWVCYASYVVFRIIYMLLITIATFQIAANLSMERYALVFGVNTFIALALQTLLTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSSQVTTS,497,NP_008927.1.csv,refseq-SLC19A2-NM_006996.2_clinical_seed_0_final,refseq-SLC19A2-NM_006996.2.a2m,Invitae,refseq-SLC19A2-NM_006996.2.npy,1,497,497
+NP_008986.2,MVKEQFRETDVAKKISHICFGMKSPEEMRQQAHIQVVSKNLYSQDNQHAPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYIDLELPCFHVGYFRAVIGILQMICKTCCHIMLSQEEKKQFLDYLKRPGLTYLQKRGLKKKISDKCRKKNICHHCGAFNGTVKKCGLLKIIHEKYKTNKKVVDPIVSNFLQSFETAIEHNKEVEPLLGRAQENLNPLVVLNLFKRIPAEDVPLLLMNPEAGKPSDLILTRLLVPPLCIRPSVVSDLKSGTNEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSELSGIPLNMAPKKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKANINFLRKLVQNGPEVHPGANFIQQRHTQMKRFLKYGNREKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDRAKACQIIASILVGKDEKIKVRLPPPTILKPVTLWTGKQIFSVILRPSDDNPVRANLRTKGKQYCGKGEDLCANDSYVTIQNSELMSGSMDKGTLGSGSKNNIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTPGQGLLKAKYELLNAGYKKCDEYIEALNTGKLQQQPGCTAEETLEALILKELSVIRDHAGSACLRELDKSNSPLTMALCGSKGSFINISQMIACVGQQAISGSRVPDGFENRSLPHFEKHSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLCSQYDLTVRSSTGDIIQFIYGGDGLDPAAMEGKDEPLEFKRVLDNIKAVFPCPSEPALSKNELILTTESIMKKSEFLCCQDSFLQEIKKFIKGVSEKIKKTRDKYGINDNGTTEPRVLYQLDRITPTQVEKFLETCRDKYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPIITAQLDKDDDADYARLVKGRIEKTLLGEISEYIEEVFLPDDCFILVKLSLERIRLLRLEVNAETVRYSICTSKLRVKPGDVAVHGEAVVCVTPRENSKSSMYYVLQFLKEDLPKVVVQGIPEVSRAVIHIDEQSGKEKYKLLVEGDNLRAVMATHGVKGTRTTSNNTYEVEKTLGIEAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVLMLASFEKTADHLFDAAYFGQKDSVCGVSECIIMGIPMNIGTGLFKLLHKADRDPNPPKRPLIFDTNEFHIPLVT,1390,NP_008986.2.csv,refseq-POLR3A-NM_007055.3_clinical_seed_0_final,refseq-POLR3A-NM_007055.3.a2m,Invitae,refseq-POLR3A-NM_007055.3.npy,1,1390,1390
+NP_008990.2,MMGEAAVAAGPCPLREDSFTRFSSQSNVYGLAGGAGGRGELLAATLKGKVLGFRYQDLRQKIRPVAKELQFNYIPVDAEIVSIDTFNKSPPKRGLVVGITFIKDSGDKGSPFLNIYCDYEPGSEYNLDSIAQSCLNLELQFTPFQLCHAEVQVGDQLETVFLLSGNDPAIHLYKENEGLHQFEEQPVENLFPELTNLTSSVLWLDVHNFPGTSRRLSALGCQSGYVRVAHVDQRSREVLQMWSVLQDGPISRVIVFSLSAAKETKDRPLQDEYSVLVASMLEPAVVYRDLLNRGLEDQLLLPGSDQFDSVLCSLVTDVDLDGRPEVLVATYGQELLCYKYRGPESGLPEAQHGFHLLWQRSFSSPLLAMAHVDLTGDGLQELAVVSLKGVHILQHSLIQASELVLTRLRHQVEQRRRRLQGLEDGAGAGPAENAAS,436,NP_008990.2.csv,refseq-KPTN-NM_007059.3_clinical_seed_0_final,refseq-KPTN-NM_007059.3.a2m,Invitae,refseq-KPTN-NM_007059.3.npy,1,436,436
+NP_009004.2,MNYTESSPLRESTAIGFTPELESIIPVPSNKTTCENWREIHHLVFHVANICFAVGLVIPTTLHLHMIFLRGMLTLGCTLYIVWATLYRCALDIMIWNSVFLGVNILHLSYLLYKKRPVKIEKELSGMYRRLFEPLRVPPDLFRRLTGQFCMIQTLKKGQTYAAEDKTSVDDRLSILLKGKMKVSYRGHFLHNIYPCAFIDSPEFRSTQMHKGEKFQVTIIADDNCRFLCWSRERLTYFLESEPFLYEIFRYLIGKDITNKLYSLNDPTLNDKKAKKLEHQLSLCTQISMLEMRNSIASSSDSDDGLHQFLRGTSSMSSLHVSSPHQRASAKMKPIEEGAEDDDDVFEPASPNTLKVHQLP,360,NP_009004.2.csv,refseq-BVES-NM_007073.4_clinical_seed_0_final,refseq-BVES-NM_007073.4.a2m,Invitae,refseq-BVES-NM_007073.4.npy,1,360,360
+NP_009005.1,MSRQVVRSSKFRHVFGQPAKADQCYEDVRVSQTTWDSGFCAVNPKFVALICEASGGGAFLVLPLGKTGRVDKNAPTVCGHTAPVLDIAWCPHNDNVIASGSEDCTVMVWEIPDGGLMLPLREPVVTLEGHTKRVGIVAWHTTAQNVLLSAGCDNVIMVWDVGTGAAMLTLGPEVHPDTIYSVDWSRDGGLICTSCRDKRVRIIEPRKGTVVAEKDRPHEGTRPVRAVFVSEGKILTTGFSRMSERQVALWDTKHLEEPLSLQELDTSSGVLLPFFDPDTNIVYLCGKGDSSIRYFEITSEAPFLHYLSMFSSKESQRGMGYMPKRGLEVNKCEIARFYKLHERRCEPIAMTVPRKSDLFQEDLYPPTAGPDPALTAEEWLGGRDAGPLLISLKDGYVPPKSRELRVNRGLDTGRRRAAPEASGTPSSDAVSRLEEEMRKLQATVQELQKRLDRLEETVQAK,461,NP_009005.1.csv,refseq-CORO1A-NM_007074.3_clinical_seed_0_final,refseq-CORO1A-NM_007074.3.a2m,Invitae,refseq-CORO1A-NM_007074.3.npy,1,461,461
+NP_009006.2,MTQQPLRGVTSLRFNQDQSCFCCAMETGVRIYNVEPLMEKGHLDHEQVGSMGLVEMLHRSNLLALVGGGSSPKFSEISAVLIWDDAREGKDSKEKLVLEFTFTKPVLSVRMRHDKIVIVLKNRIYVYSFPDNPRKLFEFDTRDNPKGLCDLCPSLEKQLLVFPGHKCGSLQLVDLASTKPGTSSAPFTINAHQSDIACVSLNQPGTVVASASQKGTLIRLFDTQSKEKLVELRRGTDPATLYCINFSHDSSFLCASSDKGTVHIFALKDTRLNRRSALARVGKVGPMIGQYVDSQWSLASFTVPAESACICAFGRNTSKNVNSVIAICVDGTFHKYVFTPDGNCNREAFDVYLDICDDDDF,361,NP_009006.2.csv,refseq-WDR45-NM_007075.3_clinical_seed_0_final,refseq-WDR45-NM_007075.3.a2m,Invitae,refseq-WDR45-NM_007075.3.npy,1,361,361
+NP_009008.2,MIKFFLMVNKQGQTRLSKYYEHVDINKRTLLETEVIKSCLSRSNEQCSFIEYKDFKLIYRQYAALFIVVGVNDTENEMAIYEFIHNFVEVLDEYFSRVSELDVSFFNTVFHSTWQMHSGPYQEPIDELPKICSALEPQQTCFSPDSSSFKGAASTTPIY,159,NP_009008.2.csv,refseq-AP4S1-NM_007077.4_clinical_seed_0_final,refseq-AP4S1-NM_007077.4.a2m,Invitae,refseq-AP4S1-NM_007077.4.npy,1,159,159
+NP_009009.1,MSYSVTLTGPGPWGFRLQGGKDFNMPLTISRITPGSKAAQSQLSQGDLVVAIDGVNTDTMTHLEAQNKIKSASYNLSLTLQKSKRPIPISTTAPPVQTPLPVIPHQKDPALDTNGSLVAPSPSPEARASPGTPGTPELRPTFSPAFSRPSAFSSLAEASDPGPPRASLRAKTSPEGARDLLGPKALPGSSQPRQYNNPIGLYSAETLREMAQMYQMSLRGKASGVGLPGGSLPIKDLAVDSASPVYQAVIKSQNKPEDEADEWARRSSNLQSRSFRILAQMTGTEFMQDPDEEALRRSSTPIEHAPVCTSQATTPLLPASAQPPAAASPSAASPPLATAAAHTAIASASTTAPASSPADSPRPQASSYSPAVAASSAPATHTSYSEGPAAPAPKPRVVTTASIRPSVYQPVPASTYSPSPGANYSPTPYTPSPAPAYTPSPAPAYTPSPVPTYTPSPAPAYTPSPAPNYNPAPSVAYSGGPAEPASRPPWVTDDSFSQKFAPGKSTTSISKQTLPRGGPAYTPAGPQVPPLARGTVQRAERFPASSRTPLCGHCNNVIRGPFLVAMGRSWHPEEFTCAYCKTSLADVCFVEEQNNVYCERCYEQFFAPLCAKCNTKIMGEVMHALRQTWHTTCFVCAACKKPFGNSLFHMEDGEPYCEKDYINLFSTKCHGCDFPVEAGDKFIEALGHTWHDTCFICAVCHVNLEGQPFYSKKDRPLCKKHAHTINL,727,NP_009009.1.csv,refseq-LDB3-NM_007078.3_clinical_seed_0_final,refseq-LDB3-NM_007078.3.a2m,Invitae,refseq-LDB3-NM_007078.3.npy,1,727,727
+NP_009029.3,MAQILPVRFQEHFQLQNLGINPANIGFSTLTMESDKFICIREKVGEQAQVTIIDMSDPMAPIRRPISAESAIMNPASKVIALKAGKTLQIFNIEMKSKMKAHTMAEEVIFWKWVSVNTVALVTETAVYHWSMEGDSQPMKMFDRHTSLVGCQVIHYRTDEYQKWLLLVGISAQQNRVVGAMQLYSVDRKVSQPIEGHAAAFAEFKMEGNAKPATLFCFAVRNPTGGKLHIIEVGQPAAGNQPFVKKAVDVFFPPEAQNDFPVAMQIGAKHGVIYLITKYGYLHLYDLESGVCICMNRISADTIFVTAPHKPTSGIIGVNKKGQVLSVCVEEDNIVNYATNVLQNPDLGLRLAVRSNLAGAEKLFVRKFNTLFAQGSYAEAAKVAASAPKGILRTRETVQKFQSIPAQSGQASPLLQYFGILLDQGQLNKLESLELCHLVLQQGRKQLLEKWLKEDKLECSEELGDLVKTTDPMLALSVYLRANVPSKVIQCFAETGQFQKIVLYAKKVGYTPDWIFLLRGVMKISPEQGLQFSRMLVQDEEPLANISQIVDIFMENSLIQQCTSFLLDALKNNRPAEGLLQTWLLEMNLVHAPQVADAILGNKMFTHYDRAHIAQLCEKAGLLQQALEHYTDLYDIKRAVVHTHLLNPEWLVNFFGSLSVEDSVECLHAMLSANIRQNLQLCVQVASKYHEQLGTQALVELFESFKSYKGLFYFLGSIVNFSQDPDVHLKYIQAACKTGQIKEVERICRESSCYNPERVKNFLKEAKLTDQLPLIIVCDRFGFVHDLVLYLYRNNLQRYIEIYVQKVNPSRTPAVIGGLLDVDCSEEVIKHLIMAVRGQFSTDELVAEVEKRNRLKLLLPWLESQIQEGCEEPATHNALAKIYIDSNNSPECFLRENAYYDSSVVGRYCEKRDPHLACVAYERGQCDLELIKVCNENSLFKSEARYLVCRKDPELWAHVLEETNPSRRQLIDQVVQTALSETRDPEEISVTVKAFMTADLPNELIELLEKIVLDNSVFSEHRNLQNLLILTAIKADRTRVMEYISRLDNYDALDIASIAVSSALYEEAFTVFHKFDMNASAIQVLIEHIGNLDRAYEFAERCNEPAVWSQLAQAQLQKDLVKEAINSYIRGDDPSSYLEVVQSASRSNNWEDLVKFLQMARKKGRESYIETELIFALAKTSRVSELEDFINGPNNAHIQQVGDRCYEEGMYEAAKLLYSNVSNFARLASTLVHLGEYQAAVDNSRKASSTRTWKEVCFACMDGQEFRFAQLCGLHIVIHADELEELMCYYQDRGYFEELILLLEAALGLERAHMGMFTELAILYSKFKPQKMLEHLELFWSRVNIPKVLRAAEQAHLWAELVFLYDKYEEYDNAVLTMMSHPTEAWKEGQFKDIITKVANVELCYRALQFYLDYKPLLINDLLLVLSPRLDHTWTVSFFSKAGQLPLVKPYLRSVQSHNNKSVNEALNHLLTEEEDYQGLRASIDAYDNFDNISLAQQLEKHQLMEFRCIAAYLYKGNNWWAQSVELCKKDHLYKDAMQHAAESRDAELAQKLLQWFLEEGKRECFAACLFTCYDLLRPDMVLELAWRHNLVDLAMPYFIQVMREYLSKVDKLDALESLRKQEEHVTEPAPLVFDFDGHE,1640,NP_009029.3.csv,refseq-CLTCL1-NM_007098.3_clinical_seed_0_final,refseq-CLTCL1-NM_007098.3.a2m,Invitae,refseq-CLTCL1-NM_007098.3.npy,1,1640,1640
+NP_009034.2,MLATRRLLGWSLPARVSVRFSGDTTAPKKTSFGSLKDEDRIFTNLYGRHDWRLKGSLSRGDWYKTKEILLKGPDWILGEIKTSGLRGRGGAGFPTGLKWSFMNKPSDGRPKYLVVNADEGEPGTCKDREILRHDPHKLLEGCLVGGRAMGARAAYIYIRGEFYNEASNLQVAIREAYEAGLIGKNACGSGYDFDVFVVRGAGAYICGEETALIESIEGKQGKPRLKPPFPADVGVFGCPTTVANVETVAVSPTICRRGGTWFAGFGRERNSGTKLFNISGHVNHPCTVEEEMSVPLKELIEKHAGGVTGGWDNLLAVIPGGSSTPLIPKSVCETVLMDFDALVQAQTGLGTAAVIVMDRSTDIVKAIARLIEFYKHESCGQCTPCREGVDWMNKVMARFVRGDARPAEIDSLWEISKQIEGHTICALGDGAAWPVQGLIRHFRPELEERMQRFAQQHQARQAAS,464,NP_009034.2.csv,refseq-NDUFV1-NM_007103.3_clinical_seed_0_final,refseq-NDUFV1-NM_007103.3.a2m,Invitae,refseq-NDUFV1-NM_007103.3.npy,1,464,464
+NP_009051.1,MARGLQVPLPRLATGLLLLLSVQPWAESGKVLVVPTDGSPWLSMREALRELHARGHQAVVLTPEVNMHIKEEKFFTLTAYAVPWTQKEFDRVTLGYTQGFFETEHLLKRYSRSMAIMNNVSLALHRCCVELLHNEALIRHLNATSFDVVLTDPVNLCGAVLAKYLSIPAVFFWRYIPCDLDFKGTQCPNPSSYIPKLLTTNSDHMTFLQRVKNMLYPLALSYICHTFSAPYASLASELFQREVSVVDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASGEHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH,534,NP_009051.1.csv,refseq-UGT1A4-NM_007120.2_clinical_seed_0_final,refseq-UGT1A4-NM_007120.2.a2m,Invitae,refseq-UGT1A4-NM_007120.2.npy,1,534,534
+NP_009057.1,MASGADSKGDDLSTAILKQKNRPNRLIVDEAINEDNSVVSLSQPKMDELQLFRGDTVLLKGKKRREAVCIVLSDDTCSDEKIRMNRVVRNNLRVRLGDVISIQPCPDVKYGKRIHVLPIDDTVEGITGNLFEVYLKPYFLEAYRPIRKGDIFLVRGGMRAVEFKVVETDPSPYCIVAPDTVIHCEGEPIKREDEEESLNEVGYDDIGGCRKQLAQIKEMVELPLRHPALFKAIGVKPPRGILLYGPPGTGKTLIARAVANETGAFFFLINGPEIMSKLAGESESNLRKAFEEAEKNAPAIIFIDELDAIAPKREKTHGEVERRIVSQLLTLMDGLKQRAHVIVMAATNRPNSIDPALRRFGRFDREVDIGIPDATGRLEILQIHTKNMKLADDVDLEQVANETHGHVGADLAALCSEAALQAIRKKMDLIDLEDETIDAEVMNSLAVTMDDFRWALSQSNPSALRETVVEVPQVTWEDIGGLEDVKRELQELVQYPVEHPDKFLKFGMTPSKGVLFYGPPGCGKTLLAKAIANECQANFISIKGPELLTMWFGESEANVREIFDKARQAAPCVLFFDELDSIAKARGGNIGDGGGAADRVINQILTEMDGMSTKKNVFIIGATNRPDIIDPAILRPGRLDQLIYIPLPDEKSRVAILKANLRKSPVAKDVDLEFLAKMTNGFSGADLTEICQRACKLAIRESIESEIRRERERQTNPSAMEVEEDDPVPEIRRDHFEEAMRFARRSVSDNDIRKYEMFAQTLQQSRGFGSFRFPSGNQGGAGPSQGSGGGTGGSVYTEDNDDDLYG,806,NP_009057.1.csv,refseq-VCP-NM_007126.3_clinical_seed_0_final,refseq-VCP-NM_007126.3.a2m,Invitae,refseq-VCP-NM_007126.3.npy,1,806,806
+NP_009060.2,MLLDAGPQFPAIGVGSFARHHHHSAAAAAAAAAEMQDRELSLAAAQNGFVDSAAAHMGAFKLNPGAHELSPGQSSAFTSQGPGAYPGSAAAAAAAAALGPHAAHVGSYSGPPFNSTRDFLFRSRGFGDSAPGGGQHGLFGPGAGGLHHAHSDAQGHLLFPGLPEQHGPHGSQNVLNGQMRLGLPGEVFGRSEQYRQVASPRTDPYSAAQLHNQYGPMNMNMGMNMAAAAAHHHHHHHHHPGAFFRYMRQQCIKQELICKWIDPEQLSNPKKSCNKTFSTMHELVTHVSVEHVGGPEQSNHVCFWEECPREGKPFKAKYKLVNHIRVHTGEKPFPCPFPGCGKVFARSENLKIHKRTHTGEKPFQCEFEGCDRRFANSSDRKKHMHVHTSDKPYLCKMCDKSYTHPSSLRKHMKVHESSPQGSESSPAASSGYESSTPPGLVSPSAEPQSSSNLSPAAAAAAAAAAAAAAAVSAVHRGGGSGSGGAGGGSGGGSGSGGGGGGAGGGGGGSSGGGSGTAGGHSGLSSNFNEWYV,532,NP_009060.2.csv,refseq-ZIC2-NM_007129.3_clinical_seed_0_final,refseq-ZIC2-NM_007129.3.a2m,Invitae,refseq-ZIC2-NM_007129.3.npy,1,532,532
+NP_009075.1,MHRTTRIKITELNPHLMCALCGGYFIDATTIVECLHSFCKTCIVRYLETNKYCPMCDVQVHKTRPLLSIRSDKTLQDIVYKLVPGLFKDEMKRRRDFYAAYPLTEVPNGSNEDRGEVLEQEKGALSDDEIVSLSIEFYEGARDRDEKKGPLENGDGDKEKTGVRFLRCPAAMTVMHLAKFLRNKMDVPSKYKVEVLYEDEPLKEYYTLMDIAYIYPWRRNGPLPLKYRVQPACKRLTLATVPTPSEGTNTSGASECESVSDKAPSPATLPATSSSLPSPATPSHGSPSSHGPPATHPTSPTPPSTASGATTAANGGSLNCLQTPSSTSRGRKMTVNGAPVPPLT,344,NP_009075.1.csv,refseq-PCGF2-NM_007144.2_clinical_seed_0_final,refseq-PCGF2-NM_007144.2.a2m,Invitae,refseq-PCGF2-NM_007144.2.npy,1,344,344
+NP_009125.1,MSRESDVEAQQSHGSSACSQPHGSVTQSQGSSSQSQGISSSSTSTMPNSSQSSHSSSGTLSSLETVSTQELYSIPEDQEPEDQEPEEPTPAPWARLWALQDGFANLECVNDNYWFGRDKSCEYCFDEPLLKRTDKYRTYSKKHFRIFREVGPKNSYIAYIEDHSGNGTFVNTELVGKGKRRPLNNNSEIALSLSRNKVFVFFDLTVDDQSVYPKALRDEYIMSKTLGSGACGEVKLAFERKTCKKVAIKIISKRKFAIGSAREADPALNVETEIEILKKLNHPCIIKIKNFFDAEDYYIVLELMEGGELFDKVVGNKRLKEATCKLYFYQMLLAVQYLHENGIIHRDLKPENVLLSSQEEDCLIKITDFGHSKILGETSLMRTLCGTPTYLAPEVLVSVGTAGYNRAVDCWSLGVILFICLSGYPPFSEHRTQVSLKDQITSGKYNFIPEVWAEVSEKALDLVKKLLVVDPKARFTTEEALRHPWLQDEDMKRKFQDLLSEENESTALPQVLAQPSTSRKRPREGEAEGAETTKRPAVCAAVL,543,NP_009125.1.csv,refseq-CHEK2-NM_007194.3_clinical_seed_0_final,refseq-CHEK2-NM_007194.3.a2m,Invitae,refseq-CHEK2-NM_007194.3.npy,1,543,543
+NP_009129.1,MWRAGSMSAELGVGCALRAVNERVQQAVARRPRDLPAIQPRLVAVSKTKPADMVIEAYGHGQRTFGENYVQELLEKASNPKILSLCPEIKWHFIGHLQKQNVNKLMAVPNLFMLETVDSVKLADKVNSSWQRKGSPERLKVMVQINTSGEESKHGLPPSETIAIVEHINAKCPNLEFVGLMTIGSFGHDLSQGPNPDFQLLLSLREELCKKLNIPADQVELSMGMSADFQHAVEVGSTNVRIGSTIFGERDYSKKPTPDKCAADVKAPLEVAQEH,275,NP_009129.1.csv,refseq-PLPBP-NM_007198.3_clinical_seed_0_final,refseq-PLPBP-NM_007198.3.a2m,Invitae,refseq-PLPBP-NM_007198.3.npy,1,275,275
+NP_009139.1,MPGWRLLTQVGAQVLGRLGDGLGAALGPGNRTHIWLFVRGLHGKSGTWWDEHLSEENVPFIKQLVSDEDKAQLASKLCPLKDEPWPIHPWEPGSFRVGLIALKLGMMPLWTKDGQKHVVTLLQVQDCHVLKYTSKENCNGKMATLSVGGKTVSRFRKATSILEFYRELGLPPKQTVKIFNITDNAAIKPGTPLYAAHFRPGQYVDVTAKTIGKGFQGVMKRWGFKGQPATHGQTKTHRRPGAVATGDIGRVWPGTKMPGKMGNIYRTEYGLKVWRINTKHNIIYVNGSVPGHKNCLVKVKDSKLPAYKDLGKNLPFPTYFPDGDEEELPEDLYDENVCQPGAPSITFA,348,NP_009139.1.csv,refseq-MRPL3-NM_007208.3_clinical_seed_0_final,refseq-MRPL3-NM_007208.3.a2m,Invitae,refseq-MRPL3-NM_007208.3.npy,1,348,348
+NP_009146.2,MRSRVAVRACHKVCRCLLSGFGGRVDAGQPELLTERSSPKGGHVKSHAELEGNGEHPEAPGSGEGSEALLEICQRRHFLSGSKQQLSRDSLLSGCHPGFGPLGVELRKNLAAEWWTSVVVFREQVFPVDALHHKPGPLLPGDSAFRLVSAETLREILQDKELSKEQLVAFLENVLKTSGKLRENLLHGALEHYVNCLDLVNKRLPYGLAQIGVCFHPVFDTKQIRNGVKSIGEKTEASLVWFTPPRTSNQWLDFWLRHRLQWWRKFAMSPSNFSSSDCQDEEGRKGNKLYYNFPWGKELIETLWNLGDHELLHMYPGNVSKLHGRDGRKNVVPCVLSVNGDLDRGMLAYLYDSFQLTENSFTRKKNLHRKVLKLHPCLAPIKVALDVGRGPTLELRQVCQGLFNELLENGISVWPGYLETMQSSLEQLYSKYDEMSILFTVLVTETTLENGLIHLRSRDTTMKEMMHISKLKDFLIKYISSAKNV,485,NP_009146.2.csv,refseq-POLG2-NM_007215.3_clinical_seed_0_final,refseq-POLG2-NM_007215.3.a2m,Invitae,refseq-POLG2-NM_007215.3.npy,1,485,485
+NP_009185.2,MGEVEAPGRLWLESPPGGAPPIFLPSDGQALVLGRGPLTQVTDRKCSRTQVELVADPETRTVAVKQLGVNPSTTGTQELKPGLEGSLGVGDTLYLVNGLHPLTLRWEETRTPESQPDTPPGTPLVSQDEKRDAELPKKRMRKSNPGWENLEKLLVFTAAGVKPQGKVAGFDLDGTLITTRSGKVFPTGPSDWRILYPEIPRKLRELEAEGYKLVIFTNQMSIGRGKLPAEEFKAKVEAVVEKLGVPFQVLVATHAGLYRKPVTGMWDHLQEQANDGTPISIGDSIFVGDAAGRPANWAPGRKKKDFSCADRLFALNLGLPFATPEEFFLKWPAAGFELPAFDPRTVSRSGPLCLPESRALLSASPEVVVAVGFPGAGKSTFLKKHLVSAGYVHVNRDTLGSWQRCVTTCETALKQGKRVAIDNTNPDAASRARYVQCARAAGVPCRCFLFTATLEQARHNNRFREMTDSSHIPVSDMVMYGYRKQFEAPTLAEGFSAILEIPFRLWVEPRLGRLYCQFSEG,521,NP_009185.2.csv,refseq-PNKP-NM_007254.3_clinical_seed_0_final,refseq-PNKP-NM_007254.3.a2m,Invitae,refseq-PNKP-NM_007254.3.npy,1,521,521
+NP_009186.1,MFPSRRKAAQLPWEDGRSGLLSGGLPRKCSVFHLFVACLSLGFFSLLWLQLSCSGDVARAVRGQGQETSGPPRACPPEPPPEHWEEDASWGPHRLAVLVPFRERFEELLVFVPHMRRFLSRKKIRHHIYVLNQVDHFRFNRAALINVGFLESSNSTDYIAMHDVDLLPLNEELDYGFPEAGPFHVASPELHPLYHYKTYVGGILLLSKQHYRLCNGMSNRFWGWGREDDEFYRRIKGAGLQLFRPSGITTGYKTFRHLHDPAWRKRDQKRIAAQKQEQFKVDREGGLNTVKYHVASRTALSVGGAPCTVLNIMLDCDKTATPWCTFS,327,NP_009186.1.csv,refseq-B4GALT7-NM_007255.2_clinical_seed_0_final,refseq-B4GALT7-NM_007255.2.a2m,Invitae,refseq-B4GALT7-NM_007255.2.npy,1,327,327
+NP_009190.2,MNVVFAVKQYISKMIEDSGPGMKVLLMDKETTGIVSMVYTQSEILQKEVYLFERIDSQNREIMKHLKAICFLRPTKENVDYIIQELRRPKYTIYFIYFSNVISKSDVKSLAEADEQEVVAEVQEFYGDYIAVNPHLFSLNILGCCQGRNWDPAQLSRTTQGLTALLLSLKKCPMIRYQLSSEAAKRLAECVKQVITKEYELFEFRRTEVPPLLLILDRCDDAITPLLNQWTYQAMVHELLGINNNRIDLSRVPGISKDLREVVLSAENDEFYANNMYLNFAEIGSNIKNLMEDFQKKKPKEQQKLESIADMKAFVENYPQFKKMSGTVSKHVTVVGELSRLVSERNLLEVSEVEQELACQNDHSSALQNIKRLLQNPKVTEFDAARLVMLYALHYERHSSNSLPGLMMDLRNKGVSEKYRKLVSAVVEYGGKRVRGSDLFSPKDAVAITKQFLKGLKGVENVYTQHQPFLHETLDHLIKGRLKENLYPYLGPSTLRDRPQDIIVFVIGGATYEEALTVYNLNRTTPGVRIVLGGTTVHNTKSFLEEVLASGLHSRSKESSQVTSRSASRR,570,NP_009190.2.csv,VPS45_HUMAN_b01_clinical_seed_0_final,VPS45_HUMAN_b01.a2m,EVE,VPS45_HUMAN_b01_theta_0.2.npy,1,570,570
+NP_009193.2,MASKRALVILAKGAEEMETVIPVDVMRRAGIKVTVAGLAGKDPVQCSRDVVICPDASLEDAKKEGPYDVVVLPGGNLGAQNLSESAAVKEILKEQENRKGLIAAICAGPTALLAHEIGFGSKVTTHPLAKDKMMNGGHYTYSENRVEKDGLILTSRGPGTSFEFALAIVEALNGKEVAAQVKAPLVLKD,189,NP_009193.2.csv,refseq-PARK7-NM_007262.4_clinical_seed_0_final,refseq-PARK7-NM_007262.4.a2m,Invitae,refseq-PARK7-NM_007262.4.npy,1,189,189
+NP_009203.2,MLGITVLAALLACASSCGVPSFPPNLSARVVGGEDARPHSWPWQISLQYLKNDTWRHTCGGTLIASNFVLTAAHCISNTRTYRVAVGKNNLEVEDEEGSLFVGVDTIHVHKRWNALLLRNDIALIKLAEHVELSDTIQVACLPEKDSLLPKDYPCYVTGWGRLWTNGPIADKLQQGLQPVVDHATCSRIDWWGFRVKKTMVCAGGDGVISACNGDSGGPLNCQLENGSWEVFGIVSFGSRRGCNTRKKPVVYTRVSAYIDWINEKMQL,268,NP_009203.2.csv,refseq-CTRC-NM_007272.2_clinical_seed_0_final,refseq-CTRC-NM_007272.2.a2m,Invitae,refseq-CTRC-NM_007272.2.npy,1,268,268
+NP_009220.2,MGKSESQMDITDINTPKPKKKQRWTPLEISLSVLVLLLTIIAVTMIALYATYDDGICKSSDCIKSAARLIQNMDATTEPCTDFFKYACGGWLKRNVIPETSSRYGNFDILRDELEVVLKDVLQEPKTEDIVAVQKAKALYRSCINESAIDSRGGEPLLKLLPDIYGWPVATENWEQKYGASWTAEKAIAQLNSKYGKKVLINLFVGTDDKNSVNHVIHIDQPRLGLPSRDYYECTGIYKEACTAYVDFMISVARLIRQEERLPIDENQLALEMNKVMELEKEIANATAKPEDRNDPMLLYNKMTLAQIQNNFSLEINGKPFSWLNFTNEIMSTVNISITNEEDVVVYAPEYLTKLKPILTKYSARDLQNLMSWRFIMDLVSSLSRTYKESRNAFRKALYGTTSETATWRRCANYVNGNMENAVGRLYVEAAFAGESKHVVEDLIAQIREVFIQTLDDLTWMDAETKKRAEEKALAIKERIGYPDDIVSNDNKLNNEYLELNYKEDEYFENIIQNLKFSQSKQLKKLREKVDKDEWISGAAVVNAFYSSGRNQIVFPAGILQPPFFSAQQSNSLNYGGIGMVIGHEITHGFDDNGRNFNKDGDLVDWWTQQSASNFKEQSQCMVYQYGNFSWDLAGGQHLNGINTLGENIADNGGLGQAYRAYQNYIKKNGEEKLLPGLDLNHKQLFFLNFAQVWCGTYRPEYAVNSIKTDVHSPGNFRIIGTLQNSAEFSEAFHCRKNSYMNPEKKCRVW,750,NP_009220.2.csv,refseq-MME-NM_007289.2_clinical_seed_0_final,refseq-MME-NM_007289.2.a2m,Invitae,refseq-MME-NM_007289.2.npy,1,750,750
+NP_009225.1,MDLSALRVEEVQNVINAMQKILECPICLELIKEPVSTKCDHIFCKFCMLKLLNQKKGPSQCPLCKNDITKRSLQESTRFSQLVEELLKIICAFQLDTGLEYANSYNFAKKENNSPEHLKDEVSIIQSMGYRNRAKRLLQSEPENPSLQETSLSVQLSNLGTVRTLRTKQRIQPQKTSVYIELGSDSSEDTVNKATYCSVGDQELLQITPQGTRDEISLDSAKKAACEFSETDVTNTEHHQPSNNDLNTTEKRAAERHPEKYQGSSVSNLHVEPCGTNTHASSLQHENSSLLLTKDRMNVEKAEFCNKSKQPGLARSQHNRWAGSKETCNDRRTPSTEKKVDLNADPLCERKEWNKQKLPCSENPRDTEDVPWITLNSSIQKVNEWFSRSDELLGSDDSHDGESESNAKVADVLDVLNEVDEYSGSSEKIDLLASDPHEALICKSERVHSKSVESNIEDKIFGKTYRKKASLPNLSHVTENLIIGAFVTEPQIIQERPLTNKLKRKRRPTSGLHPEDFIKKADLAVQKTPEMINQGTNQTEQNGQVMNITNSGHENKTKGDSIQNEKNPNPIESLEKESAFKTKAEPISSSISNMELELNIHNSKAPKKNRLRRKSSTRHIHALELVVSRNLSPPNCTELQIDSCSSSEEIKKKKYNQMPVRHSRNLQLMEGKEPATGAKKSNKPNEQTSKRHDSDTFPELKLTNAPGSFTKCSNTSELKEFVNPSLPREEKEEKLETVKVSNNAEDPKDLMLSGERVLQTERSVESSSISLVPGTDYGTQESISLLEVSTLGKAKTEPNKCVSQCAAFENPKGLIHGCSKDNRNDTEGFKYPLGHEVNHSRETSIEMEESELDAQYLQNTFKVSKRQSFAPFSNPGNAEEECATFSAHSGSLKKQSPKVTFECEQKEENQGKNESNIKPVQTVNITAGFPVVGQKDKPVDNAKCSIKGGSRFCLSSQFRGNETGLITPNKHGLLQNPYRIPPLFPIKSFVKTKCKKNLLEENFEEHSMSPEREMGNENIPSTVSTISRNNIRENVFKEASSSNINEVGSSTNEVGSSINEIGSSDENIQAELGRNRGPKLNAMLRLGVLQPEVYKQSLPGSNCKHPEIKKQEYEEVVQTVNTDFSPYLISDNLEQPMGSSHASQVCSETPDDLLDDGEIKEDTSFAENDIKESSAVFSKSVQKGELSRSPSPFTHTHLAQGYRRGAKKLESSEENLSSEDEELPCFQHLLFGKVNNIPSQSTRHSTVATECLSKNTEENLLSLKNSLNDCSNQVILAKASQEHHLSEETKCSASLFSSQCSELEDLTANTNTQDPFLIGSSKQMRHQSESQGVGLSDKELVSDDEERGTGLEENNQEEQSMDSNLGEAASGCESETSVSEDCSGLSSQSDILTTQQRDTMQHNLIKLQQEMAELEAVLEQHGSQPSNSYPSIISDSSALEDLRNPEQSTSEKAVLTSQKSSEYPISQNPEGLSADKFEVSADSSTSKNKEPGVERSSPSKCPSLDDRWYMHSCSGSLQNRNYPSQEELIKVVDVEEQQLEESGPHDLTETSYLPRQDLEGTPYLESGISLFSDDPESDPSEDRAPESARVGNIPSSTSALKVPQLKVAESAQSPAAAHTTDTAGYNAMEESVSREKPELTASTERVNKRMSMVVSGLTPEEFMLVYKFARKHHITLTNLITEETTHVVMKTDAEFVCERTLKYFLGIAGGKWVVSYFWVTQSIKERKMLNEHDFEVRGDVVNGRNHQGPKRARESQDRKIFRGLEICCYGPFTNMPTDQLEWMVQLCGASVVKELSSFTLGTGVHPIVVVQPDAWTEDNGFHAIGQMCEAPVVTREWVLDSVALYQCQELDTYLIPQIPHSHY,1863,NP_009225.1.csv,refseq-BRCA1-NM_007294.3_clinical_seed_0_final,refseq-BRCA1-NM_007294.3.a2m,Invitae,refseq-BRCA1-NM_007294.3.npy,1,1863,1863
+NP_009297.2,MGQQPGKVLGDQRRPSLPALHFIKGAGKKESSRHGGPHCNVFVEHEALQRPVASDFEPQGLSEAARWNSKENLLAGPSENDPNLFVALYDFVASGDNTLSITKGEKLRVLGYNHNGEWCEAQTKNGQGWVPSNYITPVNSLEKHSWYHGPVSRNAAEYLLSSGINGSFLVRESESSPGQRSISLRYEGRVYHYRINTASDGKLYVSSESRFNTLAELVHHHSTVADGLITTLHYPAPKRNKPTVYGVSPNYDKWEMERTDITMKHKLGGGQYGEVYEGVWKKYSLTVAVKTLKEDTMEVEEFLKEAAVMKEIKHPNLVQLLGVCTREPPFYIITEFMTYGNLLDYLRECNRQEVNAVVLLYMATQISSAMEYLEKKNFIHRDLAARNCLVGENHLVKVADFGLSRLMTGDTYTAHAGAKFPIKWTAPESLAYNKFSIKSDVWAFGVLLWEIATYGMSPYPGIDLSQVYELLEKDYRMERPEGCPEKVYELMRACWQWNPSDRPSFAEIHQAFETMFQESSISDEVEKELGKQGVRGAVSTLLQAPELPTKTRTSRRAAEHRDTTDVPEMPHSKGQGESDPLDHEPAVSPLLPRKERGPPEGGLNEDERLLPKDKKTNLFSALIKKKKKTAPTPPKRSSSFREMDGQPERRGAGEEEGRDISNGALAFTPLDTADPAKSPKPSNGAGVPNGALRESGGSGFRSPHLWKKSSTLTSSRLATGEEEGGGSSSKRFLRSCSASCVPHGAKDTEWRSVTLPRDLQSTGRQFDSSTFGGHKSEKPALPRKRAGENRSDQVTRGTVTPPPRLVKKNEEAADEVFKDIMESSPGSSPPNLTPKPLRRQVTVAPASGLPHKEEAGKGSALGTPAAAEPVTPTSKAGSGAPGGTSKGPAEESRVRRHKHSSESPGRDKGKLSRLKPAPPPPPAASAGKAGGKPSQSPSQEAAGEAVLGAKTKATSLVDAVNSDAAKPSQPGEGLKKPVLPATPKPQSAKPSGTPISPAPVPSTLPSASSALAGDQPSSTAFIPLISTRVSLRKTRQPPERIASGAITKGVVLDSTEALCLAISRNSEQMASHSAVLEAGKNLYTFCVSYVDSIQQMRNKFAFREAINKLENNLRELQICPATAGSGPAATQDFSKLLSSVKEISDIVQR,1149,NP_009297.2.csv,refseq-ABL1-NM_007313.2_clinical_seed_0_final,refseq-ABL1-NM_007313.2.a2m,Invitae,refseq-ABL1-NM_007313.2.npy,1,1149,1149
+NP_009330.1,MSQWYELQQLDSKFLEQVHQLYDDSFPMEIRQYLAQWLEKQDWEHAANDVSFATIRFHDLLSQLDDQYSRFSLENNFLLQHNIRKSKRNLQDNFQEDPIQMSMIIYSCLKEERKILENAQRFNQAQSGNIQSTVMLDKQKELDSKVRNVKDKVMCIEHEIKSLEDLQDEYDFKCKTLQNREHETNGVAKSDQKQEQLLLKKMYLMLDNKRKEVVHKIIELLNVTELTQNALINDELVEWKRRQQSACIGGPPNACLDQLQNWFTIVAESLQQVRQQLKKLEELEQKYTYEHDPITKNKQVLWDRTFSLFQQLIQSSFVVERQPCMPTHPQRPLVLKTGVQFTVKLRLLVKLQELNYNLKVKVLFDKDVNERNTVKGFRKFNILGTHTKVMNMEESTNGSLAAEFRHLQLKEQKNAGTRTNEGPLIVTEELHSLSFETQLCQPGLVIDLETTSLPVVVISNVSQLPSGWASILWYNMLVAEPRNLSFFLTPPCARWAQLSEVLSWQFSSVTKRGLNVDQLNMLGEKLLGPNASPDGLIPWTRFCKENINDKNFPFWLWIESILELIKKHLLPLWNDGCIMGFISKERERALLKDQQPGTFLLRFSESSREGAITFTWVERSQNGGEPDFHAVEPYTKKELSAVTFPDIIRNYKVMAAENIPENPLKYLYPNIDKDHAFGKYYSRPKEAPEPMELDGPKGTGYIKTELISVSEVHPSRLQTTDNLLPMSPEEFDEVSRIVGSVEFDSMMNTV,750,NP_009330.1.csv,refseq-STAT1-NM_007315.3_clinical_seed_0_final,refseq-STAT1-NM_007315.3.a2m,Invitae,refseq-STAT1-NM_007315.3.npy,1,750,750
+NP_015556.1,MAAGGSTQQRRREMAAASAAAISGAGRCRLSKIGATRRPPPARVRVAVRLRPFVDGTAGASDPPCVRGMDSCSLEIANWRNHQETLKYQFDAFYGERSTQQDIYAGSVQPILRHLLEGQNASVLAYGPTGAGKTHTMLGSPEQPGVIPRALMDLLQLTREEGAEGRPWALSVTMSYLEIYQEKVLDLLDPASGDLVIREDCRGNILIPGLSQKPISSFADFERHFLPASRNRTVGATRLNQRSSRSHAVLLVKVDQRERLAPFRQREGKLYLIDLAGSEDNRRTGNKGLRLKESGAINTSLFVLGKVVDALNQGLPRVPYRDSKLTRLLQDSLGGSAHSILIANIAPERRFYLDTVSALNFAARSKEVINRPFTNESLQPHALGPVKLSQKELLGPPEAKRARGPEEEEIGSPEPMAAPASASQKLSPLQKLSSMDPAMLERLLSLDRLLASQGSQGAPLLSTPKRERMVLMKTVEEKDLEIERLKTKQKELEAKMLAQKAEEKENHCPTMLRPLSHRTVTGAKPLKKAVVMPLQLIQEQAASPNAEIHILKNKGRKRKLESLDALEPEEKAEDCWELQISPELLAHGRQKILDLLNEGSARDLRSLQRIGPKKAQLIVGWRELHGPFSQVEDLERVEGITGKQMESFLKANILGLAAGQRCGAS,665,NP_015556.1.csv,refseq-KIF22-NM_007317.2_clinical_seed_0_final,refseq-KIF22-NM_007317.2.a2m,Invitae,refseq-KIF22-NM_007317.2.npy,1,665,665
+NP_015564.5,MARQKKMGQSVLRAVFFLVLGLLGHSHGGFPNTISIGGLFMRNTVQEHSAFRFAVQLYNTNQNTTEKPFHLNYHVDHLDSSNSFSVTNAFCSQFSRGVYAIFGFYDQMSMNTLTSFCGALHTSFVTPSFPTDADVQFVIQMRPALKGAILSLLGHYKWEKFVYLYDTERGFSILQAIMEAAVQNNWQVTARSVGNIKDVQEFRRIIEEMDRRQEKRYLIDCEVERINTILEQVVILGKHSRGYHYMLANLGFTDILLERVMHGGANITGFQIVNNENPMVQQFIQRWVRLDEREFPEAKNAPLKYTSALTHDAILVIAEAFRYLRRQRVDVSRRGSAGDCLANPAVPWSQGIDIERALKMVQVQGMTGNIQFDTYGRRTNYTIDVYEMKVSGSRKAGYWNEYERFVPFSDQQISNDSASSENRTIVVTTILESPYVMYKKNHEQLEGNERYEGYCVDLAYEIAKHVRIKYKLSIVGDGKYGARDPETKIWNGMVGELVYGRADIAVAPLTITLVREEVIDFSKPFMSLGISIMIKKPQKSKPGVFSFLDPLAYEIWMCIVFAYIGVSVVLFLVSRFSPYEWHLEDNNEEPRDPQSPPDPPNEFGIFNSLWFSLGAFMQQGCDISPRSLSGRIVGGVWWFFTLIIISSYTANLAAFLTVERMVSPIESAEDLAKQTEIAYGTLDSGSTKEFFRRSKIAVYEKMWSYMKSAEPSVFTKTTADGVARVRKSKGKFAFLLESTMNEYIEQRKPCDTMKVGGNLDSKGYGVATPKGSALRTPVNLAVLKLSEQGILDKLKNKWWYDKGECGAKDSGSKDKTSALSLSNVAGVFYILVGGLGLAMMVALIEFCYKSRAESKRMKLTKNTQNFKPAPATNTQNYATYREGYNVYGTESVKI,894,NP_015564.5.csv,NP_015564.5_colabfold_clinical_seed_0_final,NP_015564.5_colabfold.a2m,colabfold,NP_015564.5_colabfold_theta_0.2.npy,1,894,894
+NP_015566.1,MSTMRLLTLALLFSCSVARAACDPKIVNIGAVLSTRKHEQMFREAVNQANKRHGSWKIQLNATSVTHKPNAIQMALSVCEDLISSQVYAILVSHPPTPNDHFTPTPVSYTAGFYRIPVLGLTTRMSIYSDKSIHLSFLRTVPPYSHQSSVWFEMMRVYSWNHIILLVSDDHEGRAAQKRLETLLEERESKAEKVLQFDPGTKNVTALLMEAKELEARVIILSASEDDAATVYRAAAMLNMTGSGYVWLVGEREISGNALRYAPDGILGLQLINGKNESAHISDAVGVVAQAVHELLEKENITDPPRGCVGNTNIWKTGPLFKRVLMSSKYADGVTGRVEFNEDGDRKFANYSIMNLQNRKLVQVGIYNGTHVIPNDRKIIWPGGETEKPRGYQMSTRLKIVTIHQEPFVYVKPTLSDGTCKEEFTVNGDPVKKVICTGPNDTSPGSPRHTVPQCCYGFCIDLLIKLARTMNFTYEVHLVADGKFGTQERVNNSNKKEWNGMMGELLSGQADMIVAPLTINNERAQYIEFSKPFKYQGLTILVKKEIPRSTLDSFMQPFQSTLWLLVGLSVHVVAVMLYLLDRFSPFGRFKVNSEEEEEDALTLSSAMWFSWGVLLNSGIGEGAPRSFSARILGMVWAGFAMIIVASYTANLAAFLVLDRPEERITGINDPRLRNPSDKFIYATVKQSSVDIYFRRQVELSTMYRHMEKHNYESAAEAIQAVRDNKLHAFIWDSAVLEFEASQKCDLVTTGELFFRSGFGIGMRKDSPWKQNVSLSILKSHENGFMEDLDKTWVRYQECDSRSNAPATLTFENMAGVFMLVAGGIVAGIFLIFIEIAYKRHKDARRKQMQLAFAAVNVWRKNLQDRKSGRAEPDPKKKATFRAITSTLASSFKRRRSSKDTSTGGGRGALQNQKDTVLPRRAIEREEGQLQLCSRHRES,938,NP_015566.1.csv,refseq-GRIN1-NM_007327.3_clinical_seed_0_final,refseq-GRIN1-NM_007327.3.a2m,Invitae,refseq-GRIN1-NM_007327.3.npy,1,938,938
+NP_031374.2,MGEPAGVAGTMESPFSPGLFHRLDEDWDSALFAELGYFTDTDELQLEAANETYENNFDNLDFDLDLMPWESDIWDINNQICTVKDIKAEPQPLSPASSSYSVSSPRSVDSYSSTQHVPEELDLSSSSQMSPLSLYGENSNSLSSAEPLKEDKPVTGPRNKTENGLTPKKKIQVNSKPSIQPKPLLLPAAPKTQTNSSVPAKTIIIQTVPTLMPLAKQQPIISLQPAPTKGQTVLLSQPTVVQLQAPGVLPSAQPVLAVAGGVTQLPNHVVNVVPAPSANSPVNGKLSVTKPVLQSTMRNVGSDIAVLRRQQRMIKNRESACQSRKKKKEYMLGLEARLKAALSENEQLKKENGTLKRQLDEVVSENQRLKVPSPKRRVVCVMIVLAFIILNYGPMSMLEQDSRRMNPSVSPANQRRHLLGFSAKEAQDTSDGIIQKNSYRYDHSVSNDKALMVLTEEPLLYIPPPPCQPLINTTESLRLNHELRGWVHRHEVERTKSRRMTNNQQKTRILQGALEQGSNSQLMAVQYTETTSSISRNSGSELQVYYASPRSYQDFFEAIRRRGDTFYVVSFRRDHLLLPATTHNKTTRPKMSIVLPAININENVINGQDYEVMMQIDCQVMDTRILHIKSSSVPPYLRDQQRNQTNTFFGSPPAATEATHVVSTIPESLQ,670,NP_031374.2.csv,refseq-ATF6-NM_007348.3_clinical_seed_0_final,refseq-ATF6-NM_007348.3.a2m,Invitae,refseq-ATF6-NM_007348.3.npy,1,670,670
+NP_031399.2,MSSSLGKEKDSKEKDPKVPSAKEREKEAKASGGFGKESKEKEPKTKGKDAKDGKKDSSAAQPGVAFSVDNTIKRPNPAPGTRKKSSNAEVIKELNKCREENSMRLDLSKRSIHILPSSIKELTQLTELYLYSNKLQSLPAEVGCLVNLMTLALSENSLTSLPDSLDNLKKLRMLDLRHNKLREIPSVVYRLDSLTTLYLRFNRITTVEKDIKNLSKLSMLSIRENKIKQLPAEIGELCNLITLDVAHNQLEHLPKEIGNCTQITNLDLQHNELLDLPDTIGNLSSLSRLGLRYNRLSAIPRSLAKCSALEELNLENNNISTLPESLLSSLVKLNSLTLARNCFQLYPVGGPSQFSTIYSLNMEHNRINKIPFGIFSRAKVLSKLNMKDNQLTSLPLDFGTWTSMVELNLATNQLTKIPEDVSGLVSLEVLILSNNLLKKLPHGLGNLRKLRELDLEENKLESLPNEIAYLKDLQKLVLTNNQLTTLPRGIGHLTNLTHLGLGENLLTHLPEEIGTLENLEELYLNDNPNLHSLPFELALCSKLSIMSIENCPLSHLPPQIVAGGPSFIIQFLKMQGPYRAMV,582,NP_031399.2.csv,refseq-SHOC2-NM_007373.3_clinical_seed_0_final,refseq-SHOC2-NM_007373.3.a2m,Invitae,refseq-SHOC2-NM_007373.3.npy,1,582,582
+NP_031400.2,MFQLPILNFSPQQVAGVCETLEESGDVERLGRFLWSLPVAPAACEALNKNESVLRARAIVAFHGGNYRELYHILENHKFTKESHAKLQALWLEAHYQEAEKLRGRPLGPVDKYRVRKKFPLPRTIWDGEQKTHCFKERTRHLLREWYLQDPYPNPSKKRELAQATGLTPTQVGNWFKNRRQRDRAAAAKNRLQQQVLSQGSGRALRAEGDGTPEVLGVATSPAASLSSKAATSAISITSSDSECDI,246,NP_031400.2.csv,refseq-SIX6-NM_007374.2_clinical_seed_0_final,refseq-SIX6-NM_007374.2.a2m,Invitae,refseq-SIX6-NM_007374.2.npy,1,246,246
+NP_031401.1,MSEYIRVTEDENDEPIEIPSEDDGTVLLSTVTAQFPGACGLRYRNPVSQCMRGVRLVEGILHAPDAGWGNLVYVVNYPKDNKRKMDETDASSAVKVKRAVQKTSDLIVLGLPWKTTEQDLKEYFSTFGEVLMVQVKKDLKTGHSKGFGFVRFTEYETQVKVMSQRHMIDGRWCDCKLPNSKQSQDEPLRSRKVFVGRCTEDMTEDELREFFSQYGDVMDVFIPKPFRAFAFVTFADDQIAQSLCGEDLIIKGISVHISNAEPKHNSNRQLERSGRFGGNPGGFGNQGGFGNSRGGGAGLGNNQGSNMGGGMNFGAFSINPAMMAAAQAALQSSWGMMGMLASQQNQSGPSGNNQNQGNMQREPNQAFGSGNNSYSGSNSGAAIGWGSASNAGSGSGFNGGFGSSMDSKSSGWGM,414,NP_031401.1.csv,refseq-TARDBP-NM_007375.3_clinical_seed_0_final,refseq-TARDBP-NM_007375.3.a2m,Invitae,refseq-TARDBP-NM_007375.3.npy,1,414,414
+NP_036192.2,MEALIPVINKLQDVFNTVGADIIQLPQIVVVGTQSSGKSSVLESLVGRDLLPRGTGIVTRRPLILQLVHVSQEDKRKTTGEENGVEAEEWGKFLHTKNKLYTDFDEIRQEIENETERISGNNKGVSPEPIHLKIFSPNVVNLTLVDLPGMTKVPVGDQPKDIELQIRELILRFISNPNSIILAVTAANTDMATSEALKISREVDPDGRRTLAVITKLDLMDAGTDAMDVLMGRVIPVKLGIIGVVNRSQLDINNKKSVTDSIRDEYAFLQKKYPSLANRNGTKYLARTLNRLLMHHIRDCLPELKTRINVLAAQYQSLLNSYGEPVDDKSATLLQLITKFATEYCNTIEGTAKYIETSELCGGARICYIFHETFGRTLESVDPLGGLNTIDILTAIRNATGPRPALFVPEVSFELLVKRQIKRLEEPSLRCVELVHEEMQRIIQHCSNYSTQELLRFPKLHDAIVEVVTCLLRKRLPVTNEMVHNLVAIELAYINTKHPDFADACGLMNNNIEEQRRNRLARELPSAVSRDKSSKVPSALAPASQEPSPAASAEADGKLIQDSRRETKNVASGGGGVGDGVQEPTTGNWRGMLKTSKAEELLAEEKSKPIPIMPASPQKGHAVNLLDVPVPVARKLSAREQRDCEVIERLIKSYFLIVRKNIQDSVPKAVMHFLVNHVKDTLQSELVGQLYKSSLLDDLLTESEDMAQRRKEAADMLKALQGASQIIAEIRETHLW,736,NP_036192.2.csv,refseq-DNM1L-NM_012062.4_clinical_seed_0_final,refseq-DNM1L-NM_012062.4.a2m,Invitae,refseq-DNM1L-NM_012062.4.npy,1,736,736
+NP_036196.1,MWELRSASFWRAIFAEFFATLFYVFFGLGSSLRWAPGPLHVLQVAMAFGLALATLVQSVGHISGAHVNPAVTFAFLVGSQMSLLRAFCYMAAQLLGAVAGAAVLYSVTPPAVRGNLALNTLHPAVSVGQATTVEIFLTLQFVLCIFATYDERRNGQLGSVALAVGFSLALGHLFGMYYTGAGMNPARSFAPAILTGNFTNHWVYWVGPIIGGGLGSLLYDFLLFPRLKSISERLSVLKGAKPDVSNGQPEVTGEPVELNTQAL,263,NP_036196.1.csv,refseq-MIP-NM_012064.3_clinical_seed_0_final,refseq-MIP-NM_012064.3.a2m,Invitae,refseq-MIP-NM_012064.3.npy,1,263,263
+NP_036205.1,MASMGTLAFDEYGRPFLIIKDQDRKSRLMGLEALKSHIMAAKAVANTMRTSLGPNGLDKMMVDKDGDVTVTNDGATILSMMDVDHQIAKLMVELSKSQDDEIGDGTTGVVVLAGALLEEAEQLLDRGIHPIRIADGYEQAARVAIEHLDKISDSVLVDIKDTEPLIQTAKTTLGSKVVNSCHRQMAEIAVNAVLTVADMERRDVDFELIKVEGKVGGRLEDTKLIKGVIVDKDFSHPQMPKKVEDAKIAILTCPFEPPKPKTKHKLDVTSVEDYKALQKYEKEKFEEMIQQIKETGANLAICQWGFDDEANHLLLQNNLPAVRWVGGPEIELIAIATGGRIVPRFSELTAEKLGFAGLVQEISFGTTKDKMLVIEQCKNSRAVTIFIRGGNKMIIEEAKRSLHDALCVIRNLIRDNRVVYGGGAAEISCALAVSQEADKCPTLEQYAMRAFADALEVIPMALSENSGMNPIQTMTEVRARQVKEMNPALGIDCLHKGTNDMKQQHVIETLIGKKQQISLATQMVRMILKIDDIRKPGESEE,541,NP_036205.1.csv,refseq-CCT5-NM_012073.3_clinical_seed_0_final,refseq-CCT5-NM_012073.3.a2m,Invitae,refseq-CCT5-NM_012073.3.npy,1,541,541
+NP_036211.2,MGDRGSSRRRRTGSRPSSHGGGGPAAAEEEVRDAAAGPDVGAAGDAPAPAPNKDGDAGVGSGHWELRCHRLQDSLFSSDSGFSNYRGILNWCVVMLILSNARLFLENLIKYGILVDPIQVVSLFLKDPYSWPAPCLVIAANVFAVAAFQVEKRLAVGALTEQAGLLLHVANLATILCFPAAVVLLVESITPVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHTVSYPDNLTYRDLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEMLFFTQLQVGLIQQWMVPTIQNSMKPFKDMDYSRIIERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREFYRDWWNSESVTYFWQNWNIPVHKWCIRHFYKPMLRRGSSKWMARTGVFLASAFFHEYLVSVPLRMFRLWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDYYVLNYEAPAAEA,488,NP_036211.2.csv,refseq-DGAT1-NM_012079.4_clinical_seed_0_final,refseq-DGAT1-NM_012079.4.a2m,Invitae,refseq-DGAT1-NM_012079.4_theta_0.2.npy,1,488,488
+NP_036214.2,MSRRKQSKPRQIKRPLEDAIEDEEEECPSEETDIISKGDFPLEESFSTEFGPENLSCEEVEYFCNKGDDEGIQETAESDGDTQSEKPGQPGVETDDWDGPGELEVFQKDGERKIQSRQQLPVGTTWGPFPGKMDLNNNSLKTKAQVPMVLTAGPKWLLDVTWQGVEDNKNNCIVYSKGGQLWCTTTKAISEGEELIAFVVDFDSRLQAASQMTLTEGMYPARLLDSIQLLPQQAAMASILPTAIVNKDIFPCKSCGIWYRSERNLQAHLMYYCSGRQREAAPVSEENEDSAHQISSLCPFPQCTKSFSNARALEMHLNSHSGVKMEEFLPPGASLKCTVCSYTADSVINFHQHLFSHLTQAAFRCNHCHFGFQTQRELLQHQELHVPSGKLPRESDMEHSPSATEDSLQPATDLLTRSELPQSQKAMQTKDASSDTELDKCEKKTQLFLTNQRPEIQPTTNKQSFSYTKIKSEPSSPRLASSPVQPNIGPSFPVGPFLSQFSFPQDITMVPQASEILAKMSELVHRRLRHGSSSYPPVIYSPLMPKGATCFECNITFNNLDNYLVHKKHYCSSRWQQMAKSPEFPSVSEKMPEALSPNTGQTSINLLNPAAHSADPENPLLQTSCINSSTVLDLIGPNGKGHDKDFSTQTKKLSTSSNNDDKINGKPVDVKNPSVPLVDGESDPNKTTCEACNITFSRHETYMVHKQYYCATRHDPPLKRSASNKVPAMQRTMRTRKRRKMYEMCLPEQEQRPPLVQQRFLDVANLNNPCTSTQEPTEGLGECYHPRCDIFPGIVSKHLETSLTINKCVPVSKCDTTHSSVSCLEMDVPIDLSKKCLSQSERTTTSPKRLLDYHECTVCKISFNKVENYLAHKQNFCPVTAHQRNDLGQLDGKVFPNPESERNSPDVSYERSIIKCEKNGNLKQPSPNGNLFSSHLATLQGLKVFSEAAQLIATKEENRHLFLPQCLYPGAIKKAKGADQLSPYYGIKPSDYISGSLVIHNTDIEQSRNAENESPKGQASSNGCAALKKDSLPLLPKNRGMVIVNGGLKQDERPAANPQQENISQNPQHEDDHKSPSWISENPLAANENVSPGIPSAEEQLSSIAKGVNGSSQAPTSGKYCRLCDIQFNNLSNFITHKKFYCSSHAAEHVK,1151,NP_036214.2.csv,refseq-ZFPM2-NM_012082.3_clinical_seed_0_final,refseq-ZFPM2-NM_012082.3.a2m,Invitae,refseq-ZFPM2-NM_012082.3.npy,1,1151,1151
+NP_036234.3,MTADKDKDKDKEKDRDRDRDREREKRDKARESENSRPRRSCTLEGGAKNYAESDHSEDEDNDNNSATAEESTKKNKKKPPKKKSRYERTDTGEITSYITEDDVVYRPGDCVYIESRRPNTPYFICSIQDFKLVHNSQACCRSPTPALCDPPACSLPVASQPPQHLSEAGRGPVGSKRDHLLMNVKWYYRQSEVPDSVYQHLVQDRHNENDSGRELVITDPVIKNRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFKARVDSFFYILGYNPETRRLNSTQGEIRVGPSHQAKLPDLQPFPSPDGDTVTQHEELVWMPGVNDCDLLMYLRAARSMAAFAGMCDGGSTEDGCVAASRDDTTLNALNTLHESGYDAGKALQRLVKKPVPKLIEKCWTEDEVKRFVKGLRQYGKNFFRIRKELLPNKETGELITFYYYWKKTPEAASSRAHRRHRRQAVFRRIKTRTASTPVNTPSRPPSSEFLDLSSASEDDFDSEDSEQELKGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGELPPIEKPVDPPPFMFKPVKEEDDGLSGKHSMRTRRSRGSMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSNKRQREKVASDTEEADRTSSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPHTHIQQAPALHPQRPPSPHPPPHPSPHPPLQPLTGSAGQPSAPSHAQPPLHGQGPPGPHSLQAGPLLQHPGPPQPFGLPPQASQGQAPLGTSPAAAYPHTSLQLPASQSALQSQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSMNANLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQNLPPPPASHPPTGLHQVAPQPPFAQHPFVPGGPPPITPPTCPSTSTPPAGPGTSAQPPCSGAAASGGSIAGGSSCPLPTVQIKEEALDDAEEPESPPPPPRSPSPEPTVVDTPSHASQSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAKASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAYHMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPAANPMEHFARHSALTIPPTAGPHPFASFHPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQGSAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFGTPYPRDLPGAIPPPMSAAHQLQAMHAQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQL,1566,NP_036234.3.csv,refseq-RERE-NM_012102.3_clinical_seed_0_final,refseq-RERE-NM_012102.3.a2m,Invitae,refseq-RERE-NM_012102.3.npy,1,1566,1566
+NP_036240.1,MMAKKPPKPAPRRIFQERLKITALPLYFEGFLLIKRSGYREYEHYWTELRGTTLFFYTDKKSIIYVDKLDIVDLTCLTEQNSTEKNCAKFTLVLPKEEVQLKTENTESGEEWRGFILTVTELSVPQNVSLLPGQVIKLHEVLEREKKRRIETEQSTSVEKEKEPTEDYVDVLNPMPACFYTVSRKEATEMLQKNPSLGNMILRPGSDSRNYSITIRQEIDIPRIKHYKVMSVGQNYTIELEKPVTLPNLFSVIDYFVKETRGNLRPFICSTDENTGQEPSMEGRSEKLKKNPHIA,295,NP_036240.1.csv,refseq-STAP1-NM_012108.3_clinical_seed_0_final,refseq-STAP1-NM_012108.3.a2m,Invitae,refseq-STAP1-NM_012108.3.npy,1,295,295
+NP_036255.2,MFYFRGCGRWVAVSFTKQQFPLARLSSDSAAPRTPHFDVIVIGGGHAGTEAATAAARCGSRTLLLTHRVDTIGQMSCNPSFGGIGKGHLMREVDALDGLCSRICDQSGVHYKVLNRRKGPAVWGLRAQIDRKLYKQNMQKEILNTPLLTVQEGAVEDLILTEPEPEHTGKCRVSGVVLVDGSTVYAESVILTTGTFLRGMIVIGLETHPAGRLGDQPSIGLAQTLEKLGFVVGRLKTGTPPRIAKESINFSILNKHIPDNPSIPFSFTNETVWIKPEDQLPCYLTHTNPRVDEIVLKNLHLNSHVKETTRGPRYCPSIESKVLRFPNRLHQVWLEPEGMDSDLIYPQGLSMTLPAELQEKMITCIRGLEKAKVIQPGYGVQYDYLDPRQITPSLETHLVQRLFFAGQINGTTGYEEAAAQGVIAGINASLRVSRKPPFVVSRTEGYIGVLIDDLTTLGTSEPYRMFTSRVEFRLSLRPDNADSRLTLRGYKDAGCVSQQRYERACWMKSSLEEGISVLKSIEFLSSKWKKLIPEASISTSRSLPVRALDVLKYEEVDMDSLAKAVPEPLKKYTKCRELAERLKIEATYESVLFHQLQEIKGVQQDEALQLPKDLDYLTIRDVSLSHEVREKLHFSRPQTIGAASRIPGVTPAAIINLLRFVKTTQRRQSAMNESSKTDQYLCDADRLQEREL,692,NP_036255.2.csv,refseq-MTO1-NM_012123.3_clinical_seed_0_final,refseq-MTO1-NM_012123.3.a2m,Invitae,refseq-MTO1-NM_012123.3.npy,1,692,692
+NP_036276.1,MIPASAKAPHKQPHKQSISIGRGTRKRDEDSGTEVGEGTDEWAQSKATVRPPDQLELTDAELKEEFTRILTANNPHAPQNIVRYSFKEGTYKPIGFVNQLAVHYTQVGNLIPKDSDEGRRQHYRDELVAGSQESVKVISETGNLEEDEEPKELETEPGSQTDVPAAGAAEKVTEEELMTPKQPKERKLTNQFNFSERASQTYNNPVRDRECQTEPPPRTNFSATANQWEIYDAYVEELEKQEKTKEKEKAKTPVAKKSGKMAMRKLTSMESQTDDLIKLSQAAKIMERMVNQNTYDDIAQDFKYYDDAADEYRDQVGTLLPLWKFQNDKAKRLSVTALCWNPKYRDLFAVGYGSYDFMKQSRGMLLLYSLKNPSFPEYMFSSNSGVMCLDIHVDHPYLVAVGHYDGNVAIYNLKKPHSQPSFCSSAKSGKHSDPVWQVKWQKDDMDQNLNFFSVSSDGRIVSWTLVKRKLVHIDVIKLKVEGSTTEVPEGLQLHPVGCGTAFDFHKEIDYMFLVGTEEGKIYKCSKSYSSQFLDTYDAHNMSVDTVSWNPYHTKVFMSCSSDWTVKIWDHTIKTPMFIYDLNSAVGDVAWAPYSSTVFAAVTTDGKAHIFDLAINKYEAICNQPVAAKKNRLTHVQFNLIHPIIIVGDDRGHIISLKLSPNLRKMPKEKKGQEVQKGPAVEIAKLDKLLNLVREVKIKT,699,NP_036276.1.csv,refseq-DNAI1-NM_012144.3_clinical_seed_0_final,refseq-DNAI1-NM_012144.3.a2m,Invitae,refseq-DNAI1-NM_012144.3.npy,1,699,699
+NP_036288.2,MTTETGPDSEVKKAQEEAPQQPEAAAAVTTPVTPAGHGHPEANSNEKHPSQQDTRPAEQSLDMEEKDYSEADGLSERTTPSKAQKSPQKIAKKYKSAICRVTLLDASEYECEVEKHGRGQVLFDLVCEHLNLLEKDYFGLTFCDADSQKNWLDPSKEIKKQIRSSPWNFAFTVKFYPPDPAQLTEDITRYYLCLQLRADIITGRLPCSFVTHALLGSYAVQAELGDYDAEEHVGNYVSELRFAPNQTRELEERIMELHKTYRGMTPGEAEIHFLENAKKLSMYGVDLHHAKDSEGIDIMLGVCANGLLIYRDRLRINRFAWPKILKISYKRSNFYIKIRPGEYEQFESTIGFKLPNHRSAKRLWKVCIEHHTFFRLVSPEPPPKGFLVMGSKFRYSGRTQAQTRQASALIDRPAPFFERSSSKRYTMSRSLDGAEFSRPASVSENHDAGPDGDKRDEDGESGGQRSEAEEGEVRTPTKIKELKPEQETTPRHKQEFLDKPEDVLLKHQASINELKRTLKEPNSKLIHRDRDWERERRLPSSPASPSPKGTPEKANERAGLREGSEEKVKPPRPRAPESDTGDEDQDQERDTVFLKDNHLAIERKCSSITVSSTSSLEAEVDFTVIGDYHGSAFEDFSRSLPELDRDKSDSDTEGLLFSRDLNKGAPSQDDESGGIEDSPDRGACSTPDMPQFEPVKTETMTVSSLAIRKKIEPEAVLQTRVSAMDNTQQVDGSASVGREFIATTPSITTETISTTMENSLKSGKGAAAMIPGPQTVATEIRSLSPIIGKDVLTSTYGATAETLSTSTTTHVTKTVKGGFSETRIEKRIIITGDEDVDQDQALALAIKEAKLQHPDMLVTKAVVYRETDPSPEERDKKPQES,881,NP_036288.2.csv,refseq-EPB41L1-NM_012156.2_clinical_seed_0_final,refseq-EPB41L1-NM_012156.2.a2m,Invitae,refseq-EPB41L1-NM_012156.2.npy,1,881,881
+NP_036292.2,MSPVFPMLTVLTMFYYICLRRRARTATRGEMMNTHRAIESNSQTSPLNAEVVQYAKEVVDFSSHYGSENSMSYTMWNLAGVPNVFPSSGDFTQTAVFRTYGTWWDQCPSASLPFKRTPPNFQSQDYVELTFEQQVYPTAVHVLETYHPGAVIRILACSANPYSPNPPAEVRWEILWSERPTKVNASQARQFKPCIKQINFPTNLIRLEVNSSLLEYYTELDAVVLHGVKDKPVLSLKTSLIDMNDIEDDAYAEKDGCGMDSLNKKFSSAVLGEGPNNGYFDKLPYELIQLILNHLTLPDLCRLAQTCKLLSQHCCDPLQYIHLNLQPYWAKLDDTSLEFLQSRCTLVQWLNLSWTGNRGFISVAGFSRFLKVCGSELVRLELSCSHFLNETCLEVISEMCPNLQALNLSSCDKLPPQAFNHIAKLCSLKRLVLYRTKVEQTALLSILNFCSELQHLSLGSCVMIEDYDVIASMIGAKCKKLRTLDLWRCKNITENGIAELASGCPLLEELDLGWCPTLQSSTGCFTRLAHQLPNLQKLFLTANRSVCDTDIDELACNCTRLQQLDILGTRMVSPASLRKLLESCKDLSLLDVSFCSQIDNRAVLELNASFPKVFIKKSFTQ,621,NP_036292.2.csv,refseq-FBXL4-NM_012160.4_clinical_seed_0_final,refseq-FBXL4-NM_012160.4.a2m,Invitae,refseq-FBXL4-NM_012160.4.npy,1,621,621
+NP_036311.3,MRLRVRLLKRTWPLEVPETEPTLGHLRSHLRQSLLCTWGYSSNTRFTITLNYKDPLTGDEETLASYGIVSGDLICLILQDDIPAPNIPSSTDSEHSSLQNNEQPSLATSSNQTSMQDEQPSDSFQGQAAQSGVWNDDSMLGPSQNFEAESIQDNAHMAEGTGFYPSEPMLCSESVEGQVPHSLETLYQSADCSDANDALIVLIHLLMLESGYIPQGTEAKALSMPEKWKLSGVYKLQYMHPLCEGSSATLTCVPLGNLIVVNATLKINNEIRSVKRLQLLPESFICKEKLGENVANIYKDLQKLSRLFKDQLVYPLLAFTRQALNLPDVFGLVVLPLELKLRIFRLLDVRSVLSLSAVCRDLFTASNDPLLWRFLYLRDFRDNTVRVQDTDWKELYRKRHIQRKESPKGRFVMLLPSSTHTIPFYPNPLHPRPFPSSRLPPGIIGGEYDQRPTLPYVGDPISSLIPGPGETPSQFPPLRPRFDPVGPLPGPNPILPGRGGPNDRFPFRPSRGRPTDGRLSFM,522,NP_036311.3.csv,refseq-FBXO7-NM_012179.3_clinical_seed_0_final,refseq-FBXO7-NM_012179.3.a2m,Invitae,refseq-FBXO7-NM_012179.3.npy,1,522,522
+NP_036318.1,MAGRSDMDPPAAFSGFPALPAVAPSGPPPSPLAGAEPGREPEEAAAGRGEAAPTPAPGPGRRRRRPLQRGKPPYSYIALIAMALAHAPGRRLTLAAIYRFITERFAFYRDSPRKWQNSIRHNLTLNDCFVKVPREPGNPGKGNYWTLDPAAADMFDNGSFLRRRKRFKRAELPAHAAAAPGPPLPFPYAPYAPAPGPALLVPPPSAGPGPSPPARLFSVDSLVNLQPELAGLGAPEPPCCAAPDAAAAAFPPCAAAASPPLYSQVPDRLVLPATRPGPGPLPAEPLLALAGPAAALGPLSPGEAYLRQPGFASGLERYL,319,NP_036318.1.csv,refseq-FOXE3-NM_012186.2_clinical_seed_0_final,refseq-FOXE3-NM_012186.2.a2m,Invitae,refseq-FOXE3-NM_012186.2.npy,1,319,319
+NP_036320.2,MSSFDLPAPSPPRCSPQFPSIGQEPPEMNLYYENFFHPQGVPSPQRPSFEGGGEYGATPNPYLWFNGPTMTPPPYLPGPNASPFLPQAYGVQRPLLPSVSGLGGSDLGWLPIPSQEELMKLVRPPYSYSALIAMAIHGAPDKRLTLSQIYQYVADNFPFYNKSKAGWQNSIRHNLSLNDCFKKVPRDEDDPGKGNYWTLDPNCEKMFDNGNFRRKRKRKSDVSSSTASLALEKTESSLPVDSPKTTEPQDILDGASPGGTTSSPEKRPSPPPSGAPCLNSFLSSMTAYVSGGSPTSHPLVTPGLSPEPSDKTGQNSLTFNSFSPLTNLSNHSGGGDWANPMPTNMLSYGGSVLSQFSPHFYNSVNTSGVLYPREGTEV,378,NP_036320.2.csv,refseq-FOXI1-NM_012188.4_clinical_seed_0_final,refseq-FOXI1-NM_012188.4.a2m,Invitae,refseq-FOXI1-NM_012188.4.npy,1,378,378
+NP_036325.2,MAWRGAGPSVPGAPGGVGLSLGLLLQLLLLLGPARGFGDEEERRCDPIRISMCQNLGYNVTKMPNLVGHELQTDAELQLTTFTPLIQYGCSSQLQFFLCSVYVPMCTEKINIPIGPCGGMCLSVKRRCEPVLKEFGFAWPESLNCSKFPPQNDHNHMCMEGPGDEEVPLPHKTPIQPGEECHSVGTNSDQYIWVKRSLNCVLKCGYDAGLYSRSAKEFTDIWMAVWASLCFISTAFTVLTFLIDSSRFSYPERPIIFLSMCYNIYSIAYIVRLTVGRERISCDFEEAAEPVLIQEGLKNTGCAIIFLLMYFFGMASSIWWVILTLTWFLAAGLKWGHEAIEMHSSYFHIAAWAIPAVKTIVILIMRLVDADELTGLCYVGNQNLDALTGFVVAPLFTYLVIGTLFIAAGLVALFKIRSNLQKDGTKTDKLERLMVKIGVFSVLYTVPATCVIACYFYEISNWALFRYSADDSNMAVEMLKIFMSLLVGITSGMWIWSAKTLHTWQKCSNRLVNSGKVKREKRGNGWVKPGKGSETVV,537,NP_036325.2.csv,refseq-FZD4-NM_012193.4_clinical_seed_0_final,refseq-FZD4-NM_012193.4.a2m,Invitae,refseq-FZD4-NM_012193.4_theta_0.2.npy,1,537,537
+NP_036332.2,MKLKLKNVFLAYFLVSIAGLLYALVQLGQPCDCLPPLRAAAEQLRQKDLRISQLQAELRRPPPAPAQPPEPEALPTIYVVTPTYARLVQKAELVRLSQTLSLVPRLHWLLVEDAEGPTPLVSGLLAASGLLFTHLVVLTPKAQRLREGEPGWVHPRGVEQRNKALDWLRGRGGAVGGEKDPPPPGTQGVVYFADDDNTYSRELFEEMRWTRGVSVWPVGLVGGLRFEGPQVQDGRVVGFHTAWEPSRPFPVDMAGFAVALPLLLDKPNAQFDSTAPRGHLESSLLSHLVDPKDLEPRAANCTRVLVWHTRTEKPKMKQEEQLQRQGRGSDPAIEV,335,NP_036332.2.csv,refseq-B3GAT3-NM_012200.3_clinical_seed_0_final,refseq-B3GAT3-NM_012200.3.a2m,Invitae,refseq-B3GAT3-NM_012200.3.npy,1,335,335
+NP_036335.1,MRPVRLMKVFVTRRIPAEGRVALARAADCEVEQWDSDEPIPAKELERGVAGAHGLLCLLSDHVDKRILDAAGANLKVISTMSVGIDHLALDEIKKRGIRVGYTPDVLTDTTAELAVSLLLTTCRRLPEAIEEVKNGGWTSWKPLWLCGYGLTQSTVGIIGLGRIGQAIARRLKPFGVQRFLYTGRQPRPEEAAEFQAEFVSTPELAAQSDFIVVACSLTPATEGLCNKDFFQKMKETAVFINISRGDVVNQDDLYQALASGKIAAAGLDVTSPEPLPTNHPLLTLKNCVILPHIGSATHRTRNTMSLLAANNLLAGLRGEPMPSELKL,328,NP_036335.1.csv,refseq-GRHPR-NM_012203.1_clinical_seed_0_final,refseq-GRHPR-NM_012203.1.a2m,Invitae,refseq-GRHPR-NM_012203.1_theta_0.2.npy,1,328,328
+NP_036337.2,MERRLGVRAWVKENRGSFQPPVCNKLMHQEQLKVMFIGGPNTRKDYHIEEGEEVFYQLEGDMVLRVLEQGKHRDVVIRQGEIFLLPARVPHSPQRFANTVGLVVERRRLETELDGLRYYVGDTMDVLFEKWFYCKDLGTQLAPIIQEFFSSEQYRTGKPIPDQLLKEPPFPLSTRSIMEPMSLDAWLDSHHRELQAGTPLSLFGDTYETQVIAYGQGSSEGLRQNVDVWLWQLEGSSVVTMGGRRLSLAPDDSLLVLAGTSYAWERTQGSVALSVTQDPACKKPLG,286,NP_036337.2.csv,refseq-HAAO-NM_012205.2_clinical_seed_0_final,refseq-HAAO-NM_012205.2.a2m,Invitae,refseq-HAAO-NM_012205.2.npy,1,286,286
+NP_036340.1,MPLLGLLPRRAWASLLSQLLRPPCASCTGAVRCQSQVAEAVLTSQLKAHQEKPNFIIKTPKGTRDLSPQHMVVREKILDLVISCFKRHGAKGMDTPAFELKETLTEKYGEDSGLMYDLKDQGGELLSLRYDLTVPFARYLAMNKVKKMKRYHVGKVWRRESPTIVQGRYREFCQCDFDIAGQFDPMIPDAECLKIMCEILSGLQLGDFLIKVNDRRIVDGMFAVCGVPESKFRAICSSIDKLDKMAWKDVRHEMVVKKGLAPEVADRIGDYVQCHGGVSLVEQMFQDPRLSQNKQALEGLGDLKLLFEYLTLFGIADKISFDLSLARGLDYYTGVIYEAVLLQTPTQAGEEPLNVGSVAAGGRYDGLVGMFDPKGHKVPCVGLSIGVERIFYIVEQRMKTKGEKVRTTETQVFVATPQKNFLQERLKLIAELWDSGIKAEMLYKNNPKLLTQLHYCESTGIPLVVIIGEQELKEGVIKIRSVASREEVAIKRENFVAEIQKRLSES,506,NP_036340.1.csv,refseq-HARS2-NM_012208.3_clinical_seed_0_final,refseq-HARS2-NM_012208.3.a2m,Invitae,refseq-HARS2-NM_012208.3.npy,1,506,506
+NP_036342.2,MAAAAASHLNLDALREVLECPICMESFTEEQLRPKLLHCGHTICRQCLEKLLASSINGVRCPFCSKITRITSLTQLTDNLTVLKIIDTAGLSEAVGLLMCRSCGRRLPRQFCRSCGLVLCEPCREADHQPPGHCTLPVKEAAEERRRDFGEKLTRLRELMGELQRRKAALEGVSKDLQARYKAVLQEYGHEERRVQDELARSRKFFTGSLAEVEKSNSQVVEEQSYLLNIAEVQAVSRCDYFLAKIKQADVALLEETADEEEPELTASLPRELTLQDVELLKVGHVGPLQIGQAVKKPRTVNVEDSWAMEATASAASTSVTFREMDMSPEEVVASPRASPAKQRGPEAASNIQQCLFLKKMGAKGSTPGMFNLPVSLYVTSQGEVLVADRGNYRIQVFTRKGFLKEIRRSPSGIDSFVLSFLGADLPNLTPLSVAMNCQGLIGVTDSYDNSLKVYTLDGHCVACHRSQLSKPWGITALPSGQFVVTDVEGGKLWCFTVDRGSGVVKYSCLCSAVRPKFVTCDAEGTVYFTQGLGLNLENRQNEHHLEGGFSIGSVGPDGQLGRQISHFFSENEDFRCIAGMCVDARGDLIVADSSRKEILHFPKGGGYSVLIREGLTCPVGIALTPKGQLLVLDCWDHCIKIYSYHLRRYSTP,653,NP_036342.2.csv,refseq-TRIM32-NM_012210.3_clinical_seed_0_final,refseq-TRIM32-NM_012210.3.a2m,Invitae,refseq-TRIM32-NM_012210.3.npy,1,653,653
+NP_036345.2,MRGFGPGLTARRLLPLRLPPRPPGPRLASGQAAGALERAMDELLRRAVPPTPAYELREKTPAPAEGQCADFVSFYGGLAETAQRAELLGRLARGFGVDHGQVAEQSAGVLHLRQQQREAAVLLQAEDRLRYALVPRYRGLFHHISKLDGGVRFLVQLRADLLEAQALKLVEGPDVREMNGVLKGMLSEWFSSGFLNLERVTWHSPCEVLQKISEAEAVHPVKNWMDMKRRVGPYRRCYFFSHCSTPGEPLVVLHVALTGDISSNIQAIVKEHPPSETEEKNKITAAIFYSISLTQQGLQGVELGTFLIKRVVKELQREFPHLGVFSSLSPIPGFTKWLLGLLNSQTKEHGRNELFTDSECKEISEITGGPINETLKLLLSSSEWVQSEKLVRALQTPLMRLCAWYLYGEKHRGYALNPVANFHLQNGAVLWRINWMADVSLRGITGSCGLMANYRYFLEETGPNSTSYLGSKIIKASEQVLSLVAQFQKNSKL,493,NP_036345.2.csv,refseq-MLYCD-NM_012213.2_clinical_seed_0_final,refseq-MLYCD-NM_012213.2.a2m,Invitae,refseq-MLYCD-NM_012213.2.npy,1,493,493
+NP_036348.2,MGESPASVVLNASGGLFSLKMETLESELTCPICLELFEDPLLLPCAHSLCFSCAHRILVSSCSSGESIEPITAFQCPTCRYVISLNHRGLDGLKRNVTLQNIIDRFQKASVSGPNSPSESRRERTYRPTTAMSSERIACQFCEQDPPRDAVKTCITCEVSYCDRCLRATHPNKKPFTSHRLVEPVPDTHLRGITCLDHENEKVNMYCVSDDQLICALCKLVGRHRDHQVASLNDRFEKLKQTLEMNLTNLVKRNSELENQMAKLIQICQQVEVNTAMHEAKLMEECDELVEIIQQRKQMIAVKIKETKVMKLRKLAQQVANCRQCLERSTVLINQAEHILKENDQARFLQSAKNIAERVAMATASSQVLIPDINFNDAFENFALDFSREKKLLEGLDYLTAPNPPSIREELCTASHDTITVHWISDDEFSISSYELQYTIFTGQANFISKSWCSWGLWPEIRKCKEAVSCSRLAGAPRGLYNSVDSWMIVPNIKQNHYTVHGLQSGTRYIFIVKAINQAGSRNSEPTRLKTNSQPFKLDPKMTHKKLKISNDGLQMEKDESSLKKSHTPERFSGTGCYGAAGNIFIDSGCHYWEVVMGSSTWYAIGIAYKSAPKNEWIGKNASSWVFSRCNSNFVVRHNNKEMLVDVPPHLKRLGVLLDYDNNMLSFYDPANSLHLHTFDVTFILPVCPTFTIWNKSLMILSGLPAPDFIDYPERQECNCRPQESPYVSGMKTCH,735,NP_036348.2.csv,refseq-MID2-NM_012216.3_clinical_seed_0_final,refseq-MID2-NM_012216.3.a2m,Invitae,refseq-MID2-NM_012216.3.npy,1,735,735
+NP_036356.1,MEKYVRLQKIGEGSFGKAILVKSTEDGRQYVIKEINISRMSSKEREESRREVAVLANMKHPNIVQYRESFEENGSLYIVMDYCEGGDLFKRINAQKGVLFQEDQILDWFVQICLALKHVHDRKILHRDIKSQNIFLTKDGTVQLGDFGIARVLNSTVELARTCIGTPYYLSPEICENKPYNNKSDIWALGCVLYELCTLKHAFEAGSMKNLVLKIISGSFPPVSLHYSYDLRSLVSQLFKRNPRDRPSVNSILEKGFIAKRIEKFLSPQLIAEEFCLKTFSKFGSQPIPAKRPASGQNSISVMPAQKITKPAAKYGIPLAYKKYGDKKLHEKKPLQKHKQAHQTPEKRVNTGEERRKISEEAARKRRLEFIEKEKKQKDQIISLMKAEQMKRQEKERLERINRAREQGWRNVLSAGGSGEVKAPFLGSGGTIAPSSFSSRGQYEHYHAIFDQMQQQRAEDNEAKWKREIYGRGLPERGILPGVRPGFPYGAAGHHHFPDADDIRKTLKRLKAVSKQANANRQKGQLAVERAKQVEEFLQRKREAMQNKARAEGHMVYLARLRQIRLQNFNERQQIKAKLRGEKKEANHSEGQEGSEEADMRRKKIESLKAHANARAAVLKEQLERKRKEAYEREKKVWEEHLVAKGVKSSDVSPPLGQHETGGSPSKQQMRSVISVTSALKEVGVDSSLTDTRETSEEMQKTNNAISSKREILRRLNENLKAQEDEKGKQNLSDTFEINVHEDAKEHEKEKSVSSDRKKWEAGGQLVIPLDELTLDTSFSTTERHTVGEVIKLGPNGSPRRAWGKSPTDSVLKILGEAELQLQTELLENTTIRSEISPEGEKYKPLITGEKKVQCISHEINPSAIVDSPVETKSPEFSEASPQMSLKLEGNLEEPDDLETEILQEPSGTNKDESLPCTITDVWISEEKETKETQSADRITIQENEVSEDGVSSTVDQLSDIHIEPGTNDSQHSKCDVDKSVQPEPFFHKVVHSEHLNLVPQVQSVQCSPEESFAFRSHSHLPPKNKNKNSLLIGLSTGLFDANNPKMLRTCSLPDLSKLFRTLMDVPTVGDVRQDNLEIDEIEDENIKEGPSDSEDIVFEETDTDLQELQASMEQLLREQPGEEYSEEEESVLKNSDVEPTANGTDVADEDDNPSSESALNEEWHSDNSDGEIASECECDSVFNHLEELRLHLEQEMGFEKFFEVYEKIKAIHEDEDENIEICSKIVQNILGNEHQHLYAKILHLVMADGAYQEDNDE,1258,NP_036356.1.csv,refseq-NEK1-NM_012224.2_clinical_seed_0_final,refseq-NEK1-NM_012224.2.a2m,Invitae,refseq-NEK1-NM_012224.2.npy,1,1258,1258
+NP_036361.1,MSTSWSDRLQNAADMPANMDKHALKKYRREAYHRVFVNRSLAMEKIKCFGFDMDYTLAVYKSPEYESLGFELTVERLVSIGYPQELLSFAYDSTFPTRGLVFDTLYGNLLKVDAYGNLLVCAHGFNFIRGPETREQYPNKFIQRDDTERFYILNTLFNLPETYLLACLVDFFTNCPRYTSCETGFKDGDLFMSYRSMFQDVRDAVDWVHYKGSLKEKTVENLEKYVVKDGKLPLLLSRMKEVGKVFLATNSDYKYTDKIMTYLFDFPHGPKPGSSHRPWQSYFDLILVDARKPLFFGEGTVLRQVDTKTGKLKIGTYTGPLQHGIVYSGGSSDTICDLLGAKGKDILYIGDHIFGDILKSKKRQGWRTFLVIPELAQELHVWTDKSSLFEELQSLDIFLAELYKHLDSSSNERPDISSIQRRIKKVTHDMDMCYGMMGSLFRSGSRQTLFASQVMRYADLYAASFINLLYYPFSYLFRAAHVLMPHESTVEHTHVDINEMESPLATRNRTSVDFKDTDYKRHQLTRSISEIKPPNLFPLAPQEITHCHDEDDDEEEEEEEE,561,NP_036361.1.csv,refseq-NT5C2-NM_012229.4_clinical_seed_0_final,refseq-NT5C2-NM_012229.4.a2m,Invitae,refseq-NT5C2-NM_012229.4.npy,1,561,561
+NP_036375.1,MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEGPRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIYTLQNNLLYVALSNLDAATYQVTYQLKILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFVQWPSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQLGFFGSIFGLMGVYIYDGELVSKNGFFQGYNRLTWIVVVLQALGGLVIAAVIKYADNILKGFATSLSIILSTLISYFWLQDFVPTSVFFLGAILVITATFLYGYDPKPAGNPTKA,325,NP_036375.1.csv,refseq-SLC35A3-NM_012243.2_clinical_seed_0_final,refseq-SLC35A3-NM_012243.2.a2m,Invitae,refseq-SLC35A3-NM_012243.2.npy,1,325,325
+NP_036382.2,MAAAGWRDGSGQEKYRLVVVGGGGVGKSALTIQFIQSYFVTDYDPTIEDSYTKQCVIDDRAARLDILDTAGQEEFGAMREQYMRTGEGFLLVFSVTDRGSFEEIYKFQRQILRVKDRDEFPMILIGNKADLDHQRQVTQEEGQQLARQLKVTYMEASAKIRMNVDQAFHELVRVIRKFQEQECPPSPEPTRKEKDKKGCHCVIF,204,NP_036382.2.csv,refseq-RRAS2-NM_012250.5_clinical_seed_0_final,refseq-RRAS2-NM_012250.5.a2m,Invitae,refseq-RRAS2-NM_012250.5.npy,1,204,204
+NP_036407.1,MVLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEEISVVPNRWLDASLSPVILGVQGGSQCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQPVRLTQLPENGGWNAPITDFYFQQCD,155,NP_036407.1.csv,refseq-IL36RN-NM_012275.2_clinical_seed_0_final,refseq-IL36RN-NM_012275.2.a2m,Invitae,refseq-IL36RN-NM_012275.2.npy,1,155,155
+NP_036412.1,MGRTSKDKRDVYYRLAKENGWRARSAFKLLQLDKEFQLFQGVTRAVDLCAAPGSWSQVLSQKIGGQGSGHVVAVDLQAMAPLPGVVQIQGDITQLSTAKEIIQHFKGCPADLVVCDGAPDVTGLHDVDEYMQAQLLLAALNIATHVLKPGGCFVAKIFRGRDVTLLYSQLQVFFSSVLCAKPRSSRNSSIEAFAVCQGYDPPEGFIPDLSKPLLDHSYDPDFNQLDGPTRIIVPFVTCGDLSSYDSDRSYPLDLEGGSEYKYTPPTQPPISPPYQEACTLKRKGQLAKEIRPQDCPISRVDTFPQPLAAPQCHTLLAPEMEDNEMSCSP,329,NP_036412.1.csv,refseq-FTSJ1-NM_012280.3_clinical_seed_0_final,refseq-FTSJ1-NM_012280.3.a2m,Invitae,refseq-FTSJ1-NM_012280.3.npy,1,329,329
+NP_036413.1,MAAGVAAWLPFARAAAIGWMPVASGPMPAPPRQERKRTQDALIVLNVSGTRFQTWQDTLERYPDTLLGSSERDFFYHPETQQYFFDRDPDIFRHILNFYRTGKLHYPRHECISAYDEELAFFGLIPEIIGDCCYEEYKDRRRENAERLQDDADTDTAGESALPTMTARQRVWRAFENPHTSTMALVFYYVTGFFIAVSVIANVVETVPCGSSPGHIKELPCGERYAVAFFCLDTACVMIFTVEYLLRLAAAPSRYRFVRSVMSIIDVVAILPYYIGLVMTDNEDVSGAFVTLRVFRVFRIFKFSRHSQGLRILGYTLKSCASELGFLLFSLTMAIIIFATVMFYAEKGSSASKFTSIPAAFWYTIVTMTTLGYGDMVPKTIAGKIFGSICSLSGVLVIALPVPVIVSNFSRIYHQNQRADKRRAQKKARLARIRAAKSGSANAYMQSKRNGLLSNQLQSSEDEQAFVSKSGSSFETQHHHLLHCLEKTTNHEFVDEQVFEESCMEVATVNRPSSHSPSLSSQQGVTSTCCSRRHKKTFRIPNANVSGSHQGSIQELSTIQIRCVERTPLSNSRSSLNAKMEECVKLNCEQPYVTTAIISIPTPPVTTPEGDDRPESPEYSGGNIVRVSAL,630,NP_036413.1.csv,refseq-KCND2-NM_012281.2_clinical_seed_0_final,refseq-KCND2-NM_012281.2.a2m,Invitae,refseq-KCND2-NM_012281.2.npy,1,630,630
+NP_036425.1,MAKRSRGPGRRCLLALVLFCAWGTLAVVAQKPGAGCPSRCLCFRTTVRCMHLLLEAVPAVAPQTSILDLRFNRIREIQPGAFRRLRNLNTLLLNNNQIKRIPSGAFEDLENLKYLYLYKNEIQSIDRQAFKGLASLEQLYLHFNQIETLDPDSFQHLPKLERLFLHNNRITHLVPGTFNHLESMKRLRLDSNTLHCDCEILWLADLLKTYAESGNAQAAAICEYPRRIQGRSVATITPEELNCERPRITSEPQDADVTSGNTVYFTCRAEGNPKPEIIWLRNNNELSMKTDSRLNLLDDGTLMIQNTQETDQGIYQCMAKNVAGEVKTQEVTLRYFGSPARPTFVIQPQNTEVLVGESVTLECSATGHPPPRISWTRGDRTPLPVDPRVNITPSGGLYIQNVVQGDSGEYACSATNNIDSVHATAFIIVQALPQFTVTPQDRVVIEGQTVDFQCEAKGNPPPVIAWTKGGSQLSVDRRHLVLSSGTLRISGVALHDQGQYECQAVNIIGSQKVVAHLTVQPRVTPVFASIPSDTTVEVGANVQLPCSSQGEPEPAITWNKDGVQVTESGKFHISPEGFLTINDVGPADAGRYECVARNTIGSASVSMVLSVNVPDVSRNGDPFVATSIVEAIATVDRAINSTRTHLFDSRPRSPNDLLALFRYPRDPYTVEQARAGEIFERTLQLIQEHVQHGLMVDLNGTSYHYNDLVSPQYLNLIANLSGCTAHRRVNNCSDMCFHQKYRTHDGTCNNLQHPMWGASLTAFERLLKSVYENGFNTPRGINPHRLYNGHALPMPRLVSTTLIGTETVTPDEQFTHMLMQWGQFLDHDLDSTVVALSQARFSDGQHCSNVCSNDPPCFSVMIPPNDSRARSGARCMFFVRSSPVCGSGMTSLLMNSVYPREQINQLTSYIDASNVYGSTEHEARSIRDLASHRGLLRQGIVQRSGKPLLPFATGPPTECMRDENESPIPCFLAGDHRANEQLGLTSMHTLWFREHNRIATELLKLNPHWDGDTIYYETRKIVGAEIQHITYQHWLPKILGEVGMRTLGEYHGYDPGINAGIFNAFATAAFRFGHTLVNPLLYRLDENFQPIAQDHLPLHKAFFSPFRIVNEGGIDPLLRGLFGVAGKMRVPSQLLNTELTERLFSMAHTVALDLAAINIQRGRDHGIPPYHDYRVYCNLSAAHTFEDLKNEIKNPEIREKLKRLYGSTLNIDLFPALVVEDLVPGSRLGPTLMCLLSTQFKRLRDGDRLWYENPGVFSPAQLTQIKQTSLARILCDNADNITRVQSDVFRVAEFPHGYGSCDEIPRVDLRVWQDCCEDCRTRGQFNAFSYHFRGRRSLEFSYQEDKPTKKTRPRKIPSVGRQGEHLSNSTSAFSTRSDASGTNDFREFVLEMQKTITDLRTQIKKLESRLSTTECVDAGGESHANNTKWKKDACTICECKDGQVTCFVEACPPATCAVPVNIPGACCPVCLQKRAEEKP,1479,NP_036425.1.csv,refseq-PXDN-NM_012293.2_clinical_seed_0_final,refseq-PXDN-NM_012293.2.a2m,Invitae,refseq-PXDN-NM_012293.2.npy,1,1479,1479
+NP_036442.3,MKEEVKGIPVRVALRCRPLVPKEISEGCQMCLSFVPGEPQVVVGTDKSFTYDFVFDPSTEQEEVFNTAVAPLIKGVFKGYNATVLAYGQTGSGKTYSMGGAYTAEQENEPTVGVIPRVIQLLFKEIDKKSDFEFTLKVSYLEIYNEEILDLLCPSREKAQINIREDPKEGIKIVGLTEKTVLVALDTVSCLEQGNNSRTVASTAMNSQSSRSHAIFTISLEQRKKSDKNSSFRSKLHLVDLAGSERQKKTKAEGDRLKEGININRGLLCLGNVISALGDDKKGGFVPYRDSKLTRLLQDSLGGNSHTLMIACVSPADSNLEETLNTLRYADRARKIKNKPIVNIDPQTAELNHLKQQVQQLQVLLLQAHGGTLPGSITVEPSENLQSLMEKNQSLVEENEKLSRGLSEAAGQTAQMLERIILTEQANEKMNAKLEELRQHAACKLDLQKLVETLEDQELKENVEIICNLQQLITQLSDETVACMAAAIDTAVEQEAQVETSPETSRSSDAFTTQHALRQAQMSKELVELNKALALKEALARKMTQNDSQLQPIQYQYQDNIKELELEVINLQKEKEELVLELQTAKKDANQAKLSERRRKRLQELEGQIADLKKKLNEQSKLLKLKESTERTVSKLNQEIRMMKNQRVQLMRQMKEDAEKFRQWKQKKDKEVIQLKERDRKRQYELLKLERNFQKQSNVLRRKTEEAAAANKRLKDALQKQREVADKRKETQSRGMEGTAARVKNWLGNEIEVMVSTEEAKRHLNDLLEDRKILAQDVAQLKEKKESGENPPPKLRRRTFSLTEVRGQVSESEDSITKQIESLETEMEFRSAQIADLQQKLLDAESEDRPKQRWENIATILEAKCALKYLIGELVSSKIQVSKLESSLKQSKTSCADMQKMLFEERNHFAEIETELQAELVRMEQQHQEKVLYLLSQLQQSQMAEKQLEESVSEKEQQLLSTLKCQDEELEKMREVCEQNQQLLRENEIIKQKLTLLQVASRQKHLPKDTLLSPDSSFEYVPPKPKPSRVKEKFLEQSMDIEDLKYCSEHSVNEHEDGDGDDDEGDDEEWKPTKLVKVSRKNIQGCSCKGWCGNKQCGCRKQKSDCGVDCCCDPTKCRNRQQGKDSLGTVERTQDSEGSFKLEDPTEVTPGLSFFNPVCATPNSKILKEMCDVEQVLSKKTPPAPSPFDLPELKHVATEYQENKAPGKKKKRALASNTSFFSGCSPIEEEAH,1232,NP_036442.3.csv,refseq-KIF4A-NM_012310.5_clinical_seed_0_final,refseq-KIF4A-NM_012310.5.a2m,Invitae,refseq-KIF4A-NM_012310.5.npy,1,1232,1232
+NP_036450.1,MASILLRSCRGRAPARLPPPPRYTVPRGSPGDPAHLSCASTLGLRNCLNVPFGCCTPIHPVYTSSRGDHLGCWALRPECLRIVSRAPWTSTSVGFVAVGPQCLPVRGWHSSRPVRDDSVVEKSLKSLKDKNKKLEEGGPVYSPPAEVVVKKSLGQRVLDELKHYYHGFRLLWIDTKIAARMLWRILNGHSLTRRERRQFLRICADLFRLVPFLVFVVVPFMEFLLPVAVKLFPNMLPSTFETQSLKEERLKKELRVKLELAKFLQDTIEEMALKNKAAKGSATKDFSVFFQKIRETGERPSNEEIMRFSKLFEDELTLDNLTRPQLVALCKLLELQSIGTNNFLRFQLTMRLRSIKADDKLIAEEGVDSLNVKELQAACRARGMRALGVTEDRLRGQLKQWLDLHLHQEIPTSLLILSRAMYLPDTLSPADQLKSTLQTLPEIVAKEAQVKVAEVEGEQVDNKAKLEATLQEEAAIQQEHREKELQKRSEVAKDFEPERVVAAPQRPGTEPQPEMPDTVLQSETLKDTAPVLEGLKEEEITKEEIDILSDACSKLQEQKKSLTKEKEELELLKEDVQDYSEDLQEIKKELSKTGEEKYVEESKASKRLTKRVQQMIGQIDGLISQLEMDQQAGKLAPANGMPTGENVISVAELINAMKQVKHIPESKLTSLAAALDENKDGKVNIDDLVKVIELVDKEDVHISTSQVAEIVATLEKEEKVEEKEKAKEKAEKEVAEVKS,739,NP_036450.1.csv,refseq-LETM1-NM_012318.2_clinical_seed_0_final,refseq-LETM1-NM_012318.2.a2m,Invitae,refseq-LETM1-NM_012318.2.npy,1,739,739
+NP_036459.1,MLLFFTLGLLIHFVFFASIFDIYFTSPLVHGMTPQFTPLPPPARRLVLFVADGLRADALYELDENGNSRAPFIRNIIMHEGSWGISHTRVPTESRPGHVALIAGFYEDVSAVAKGWKENPVEFDSLFNESKYTWSWGSPDILPMFAKGASGDHVYTYSYDAKREDFGAQDATKLDTWVFDNVKDFFHHARNNQSLFSKINEEKIVFFLHLLGIDTNGHAHRPSSRDYKHNIKKVDDGVKEIVSMFNHFYGNDGKTTFIFTSDHGMTDWGSHGAGHPSETLTPLVTWGAGIKYPQRVSAQQFDDAFLKEWRLENWKRLDVNQADIAPLMTSLIGVPFPLNSVGILPVDYLNNTDLFKAESMFTNAVQILEQFKVKMTQKKEVTLPFLFTPFKLLSDSKQFNILRKARSYIKHRKFDEVVSLCKELIHLALKGLSYYHTYDRFFLGVNVVIGFVGWISYASLLIIKSHSNLIKGVSKEVKKPSHLLPCSFVAIGILVAFFLLIQACPWTYYVYGLLPLPIWYAVLREFQVIQDLVVSVLTYPLSHFVGYLLAFTLGIEVLVLSFFYRYMLTAGLTAFAAWPFLTRLWTRAKMTSLSWTFFSLLLAVFPLMPVVGRKPDISLVMGAGLLVLLLSLCVVTSLMKRKDSFIKEELLVHLLQVLSTVLSMYVVYSTQSSLLRKQGLPLMNQIISWATLASSLVVPLLSSPVLFQRLFSILLSLMSTYLLLSTGYEALFPLVLSCLMFVWINIEQETLQQSGVCCKQKLTSIQFSYNTDITQFRQLYLDDIRRAFFLVFFLVTAFFGTGNIASINSFDLASVYCFLTVFSPFMMGALMMWKILIPFVLVMCAFEAVQLTTQLSSKSLFLIVLVISDIMALHFFFLVKDYGSWLDIGTSISHYVIVMSMTIFLVFLNGLAQLLTTKKLRLCGKPKSHFM,931,NP_036459.1.csv,refseq-PIGN-NM_012327.6_clinical_seed_0_final,refseq-PIGN-NM_012327.6.a2m,Invitae,refseq-PIGN-NM_012327.6.npy,1,931,931
+NP_036462.2,MVKLANPLYTEWILEAIQKIKKQKQRPSEERICHAVSTSHGLDKKTVSEQLELSVQDGSVLKVTNKGLASYKDPDNPGRFSSVKPGTFPKSAKGSRGSCNDLRNVDWNKLLRRAIEGLEEPNGSSLKNIEKYLRSQSDLTSTTNNPAFQQRLRLGAKRAVNNGRLLKDGPQYRVNYGSLDGKGAPQYPSAFPSSLPPVSLLPHEKDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWICQVCRPKKKGRKLLHEKAAQIKRRYAKPIGRPKNKLKQRLLSVTSDEGSMNAFTGRGSPGRGQKTKVCTTPSSGHAASGKDSSSRLAVTDPTRPGATTKITTTSTYISASTLKVNKKTKGLIDGLTKFFTPSPDGRRSRGEIIDFSKHYRPRKKVSQKQSCTSHVLATGTTQKLKPPPSSLPPPTPISGQSPSSQKSSTATSSPSPQSSSSQCSVPSLSSLTTNSQLKALFDGLSHIYTTQGQSRKKGHPSYAPPKRMRRKTELSSTAKSKAHFFGKRDIRSRFISHSSSSSWGMARGSIFKAIAHFKRTTFLKKHRMLGRLKYKVTPQMGTPSPGKGSLTDGRIKPDQDDDTEIKINIKQESADVNVIGNKDVVTEEDLDVFKQAQELSWEKIECESGVEDCGRYPSVIEFGKYEIQTWYSSPYPQEYARLPKLYLCEFCLKYMKSKNILLRHSKKCGWFHPPANEIYRRKDLSVFEVDGNMSKIYCQNLCLLAKLFLDHKTLYYDVEPFLFYVLTKNDEKGCHLVGYFSKEKLCQQKYNVSCIMIMPQHQRQGFGRFLIDFSYLLSRREGQAGSPEKPLSDLGRLSYLAYWKSVILEYLYHHHERHISIKAISRATGMCPHDIATTLQHLHMIDKRDGRFVIIRREKLILSHMEKLKTCSRANELDPDSLRWTPILISNAAVSEEEREAEKEAERLMEQASCWEKEEQEILSTRANSRQSPAKVQSKNKYLHSPESRPVTGERGQLLELSKESSEEEEEEEDEEEEEEEEEEEEDEEEEEEEEEEEEEENIQSSPPRLTKPQSVAIKRKRPFVLKKKRGRKRRRINSSVTTETISETTEVLNEPFDNSDEERPMPQLEPTCEIEVEEDGRKPVLRKAFQHQPGKKRQTEEEEGKDNHCFKNADPCRNNMNDDSSNLKEGSKDNPEPLKCKQVWPKGTKRGLSKWRQNKERKTGFKLNLYTPPETPMEPDEQVTVEEQKETSEGKTSPSPIRIEEEVKETGEALLPQEENRREETCAPVSPNTSPGEKPEDDLIKPEEEEEEEEEEEEEEEEEEGEEEEGGGNVEKDPDGAKSQEKEEPEISTEKEDSARLDDHEEEEEEDEEPSHNEDHDADDEDDSHMESAEVEKEELPRESFKEVLENQETFLDLNVQPGHSNPEVLMDCGVDLTASCNSEPKELAGDPEAVPESDEEPPPGEQAQKQDQKNSKEVDTEFKEGNPATMEIDSETVQAVQSLTQESSEQDDTFQDCAETQEACRSLQNYTRADQSPQIATTLDDCQQSDHSSPVSSVHSHPGQSVRSVNSPSVPALENSYAQISPDQSAISVPSLQNMETSPMMDVPSVSDHSQQVVDSGFSDLGSIESTTENYENPSSYDSTMGGSICGNGSSQNSCSYSNLTSSSLTQSSCAVTQQMSNISGSCSMLQQTSISSPPTCSVKSPQGCVVERPPSSSQQLAQCSMAANFTPPMQLAEIPETSNANIGLYERMGQSDFGAGHYPQPSATFSLAKLQQLTNTLIDHSLPYSHSAAVTSYANSASLSTPLSNTGLVQLSQSPHSVPGGPQAQATMTPPPNLTPPPMNLPPPLLQRNMAASNIGISHSQRLQTQIASKGHISMRTKSASLSPAAATHQSQIYGRSQTVAMQGPARTLTMQRGMNMSVNLMPAPAYNVNSVNMNMNTLNAMNGYSMSQPMMNSGYHSNHGYMNQTPQYPMQMQMGMMGTQPYAQQPMQTPPHGNMMYTAPGHHGYMNTGMSKQSLNGSYMRR,2073,NP_036462.2.csv,refseq-KAT6B-NM_012330.3_clinical_seed_0_final,refseq-KAT6B-NM_012330.3.a2m,Invitae,refseq-KAT6B-NM_012330.3.npy,1,2073,2073
+NP_036470.1,MAREDSVKCLRCLLYALNLLFWLMSISVLAVSAWMRDYLNNVLTLTAETRVEEAVILTYFPVVHPVMIAVCCFLIIVGMLGYCGTVKRNLLLLAWYFGSLLVIFCVELACGVWTYEQELMVPVQWSDMVTLKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQEDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQMMSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL,305,NP_036470.1.csv,refseq-TSPAN12-NM_012338.3_clinical_seed_0_final,refseq-TSPAN12-NM_012338.3.a2m,Invitae,refseq-TSPAN12-NM_012338.3.npy,1,305,305
+NP_036475.3,MANLLKTVVTGCSCPLLSNLGSCKGLRVKKDFLRTFYTHQELWCKAPVKPGIPYKQLTVGVPKEIFQNEKRVALSPAGVQNLVKQGFNVVVESGAGEASKFSDDHYRVAGAQIQGAKEVLASDLVVKVRAPMVNPTLGVHEADLLKTSGTLISFIYPAQNPELLNKLSQRKTTVLAMDQVPRVTIAQGYDALSSMANIAGYKAVVLAANHFGRFFTGQITAAGKVPPAKILIVGGGVAGLASAGAAKSMGAIVRGFDTRAAALEQFKSLGAEPLEVDLKESGEGQGGYAKEMSKEFIEAEMKLFAQQCKEVDILISTALIPGKKAPVLFNKEMIESMKEGSVVVDLAAEAGGNFETTKPGELYIHKGITHIGYTDLPSRMATQASTLYSNNITKLLKAISPDKDNFYFDVKDDFDFGTMGHVIRGTVVMKDGKVIFPAPTPKNIPQGAPVKQKTVAELEAEKAATITPFRKTMSTASAYTAGLTGILGLGIAAPNLAFSQMVTTFGLAGIVGYHTVWGVTPALHSPLMSVTNAISGLTAVGGLALMGGHLYPSTTSQGLAALAAFISSVNIAGGFLVTQRMLDMFKRPTDPPEYNYLYLLPAGTFVGGYLAALYSGYNIEQIMYLGSGLCCVGALAGLSTQGTARLGNALGMIGVAGGLAATLGVLKPGPELLAQMSGAMALGGTIGLTIAKRIQISDLPQLVAAFHSLVGLAAVLTCIAEYIIEYPHFATDAAANLTKIVAYLGTYIGGVTFSGSLIAYGKLQGLLKSAPLLLPGRHLLNAGLLAASVGGIIPFMVDPSFTTGITCLGSVSALSAVMGVTLTAAIGGADMPVVITVLNSYSGWALCAEGFLLNNNLLTIVGALIGSSGAILSYIMCVAMNRSLANVILGGYGTTSTAGGKPMEISGTHTEINLDNAIDMIREANSIIITPGYGLCAAKAQYPIADLVKMLTEQGKKVRFGIHPVAGRMPGQLNVLLAEAGVPYDIVLEMDEINHDFPDTDLVLVIGANDTVNSAAQEDPNSIIAGMPVLEVWKSKQVIVMKRSLGVGYAAVDNPIFYKPNTAMLLGDAKKTCDALQAKVRESYQK,1086,NP_036475.3.csv,refseq-NNT-NM_012343.3_clinical_seed_0_final,refseq-NNT-NM_012343.3.a2m,Invitae,refseq-NNT-NM_012343.3.npy,1,1086,1086
+NP_036546.2,MACSIVQFCYFQDLQAARDFLFPHLREEILSGALRRDPSKSTDWEDDGWGAWEENEPQEPEEEGNTCKTQKTSWLQDCVLSLSPTNDLMVIAREQKAVFLVPKWKYSDKGKEEMQFAVGWSGSLNVEEGECVTSALCIPLASQKRSSTGRPDWTCIVVGFTSGYVRFYTENGVLLLAQLLNEDPVLQLKCRTYEIPRHPGVTEQNEELSILYPAAIVTIDGFSLFQSLRACRNQVAKAAASGNENIQPPPLAYKKWGLQDIDTIIDHASVGIMTLSPFDQMKTASNIGGFNAAIKNSPPAMSQYITVGSNPFTGFFYALEGSTQPLLSHVALAVASKLTSALFNAASGWLGWKSKHEEEAVQKQKPKVEPATPLAVRFGLPDSRRHGESICLSPCNTLAAVTDDFGRVILLDVARGIAIRMWKGYRDAQIGWIQTVEDLHERVPEKADFSPFGNSQGPSRVAQFLVIYAPRRGILEVWSTQQGPRVGAFNVGKHCRLLYPGYKIMGLNNVTSQSWQPQTYQICLVDPVSGSVKTVNVPFHLALSDKKSERAKDMHLVKKLAALLKTKSPNLDLVETEIKELILDIKYPATKKQALESILASERLPFSCLRNITQTLMDTLKSQELESVDEGLLQFCANKLKLLQLYESVSQLNSLDFHLDTPFSDNDLALLLRLDEKELLKLQALLEKYKQENTRTNVRFSDDKDGVLPVKTFLEYLEYEKDVLNIKKISEEEYVALGSFFFWKCLHGESSTEDMCHTLESAGLSPQLLLSLLLSVWLSKEKDILDKPQSICCLHTMLSLLSKMKVAIDETWDSQSVSPWWQQMRTACIQSENNGAALLSAHVGHSVAAQISNNMTEKKFSQTVLGADSEALTDSWEALSLDTEYWKLLLKQLEDCLILQTLLHSKGNTQTSKVSSLQAEPLPRLSVKKLLEGGKGGIADSVAKWIFKQDFSPEVLKLANEERDAENPDEPKEGVNRSFLEVSEMEMDLGAIPDLLHLAYEQFPCSLELDVLHAHCCWEYVVQWNKDPEEARFFVRSIEHLKQIFNAHVQNGIALMMWNTFLVKRFSAATYLMDKVGKSPKDRLCRRDVGMSDTAMTSFLGSCLDLLQILMEADVSRDEIQVPVLDTEDAWLSVEGPISIVELALEQKHIHYPLVEHHSILCSILYAVMRFSLKTVKPLSLFDSKGKNAFFKDLTSIQLLPSGEMDPNFISVRQQFLLKVVSAAVQAQHSATKVKDPTEEATPTPFGKDQDWPALAVDLAHHLQVSEDVVRRHYVGELYNYGVDHLGEEAILQVHDKEVLASQLLVLTGQRLAHALLHTQTKEGMELLARLPPTLCTWLKAMDPQDLQNTEVPIATTAKLVNKVIELLPEKHGQYGLALHLIEAVEAISLPSL,1393,NP_036546.2.csv,refseq-RAB3GAP2-NM_012414.3_clinical_seed_0_final,refseq-RAB3GAP2-NM_012414.3.a2m,Invitae,refseq-RAB3GAP2-NM_012414.3.npy,1,1393,1393
+NP_036566.1,MRSPVRDLARNDGEESTDRTPLLPGAPRAEAAPVCCSARYNLAILAFFGFFIVYALRVNLSVALVDMVDSNTTLEDNRTSKACPEHSAPIKVHHNQTGKKYQWDAETQGWILGSFFYGYIITQIPGGYVASKIGGKMLLGFGILGTAVLTLFTPIAADLGVGPLIVLRALEGLGEGVTFPAMHAMWSSWAPPLERSKLLSISYAGAQLGTVISLPLSGIICYYMNWTYVFYFFGTIGIFWFLLWIWLVSDTPQKHKRISHYEKEYILSSLRNQLSSQKSVPWVPILKSLPLWAIVVAHFSYNWTFYTLLTLLPTYMKEILRFNVQENGFLSSLPYLGSWLCMILSGQAADNLRAKWNFSTLCVRRIFSLIGMIGPAVFLVAAGFIGCDYSLAVAFLTISTTLGGFCSSGFSINHLDIAPSYAGILLGITNTFATIPGMVGPVIAKSLTPDNTVGEWQTVFYIAAAINVFGAIFFTLFAKGEVQNWALNDHHGHRH,495,NP_036566.1.csv,refseq-SLC17A5-NM_012434.4_clinical_seed_0_final,refseq-SLC17A5-NM_012434.4.a2m,Invitae,refseq-SLC17A5-NM_012434.4.npy,1,495,495
+NP_036579.2,MSSPLQRAVGDTKRALSASSSSSASLPFDDRDSNHTSEGNGDSLLADEDTDFEDSLNRNVKKRAAKRPPKTTPVAKHPKKGSRVVHRHSRKQSEPPANDLFNAVKAAKSDMQSLVDEWLDSYKQDQDAGFLELVNFFIQSCGCKGIVTPEMFKKMSNSEIIQHLTEQFNEDSGDYPLIAPGPSWKKFQGSFCEFVRTLVCQCQYSLLYDGFPMDDLISLLTGLSDSQVRAFRHTSTLAAMKLMTSLVKVALQLSVHQDNNQRQYEAERNKGPGQRAPERLESLLEKRKELQEHQEEIEGMMNALFRGVFVHRYRDVLPEIRAICIEEIGCWMQSYSTSFLTDSYLKYIGWTLHDKHREVRLKCVKALKGLYGNRDLTTRLELFTSRFKDRMVSMVMDREYDVAVEAVRLLILILKNMEGVLTDADCESVYPVVYASHRGLASAAGEFLYWKLFYPECEIRMMGGREQRQSPGAQRTFFQLLLSFFVESELHDHAAYLVDSLWDCAGARLKDWEGLTSLLLEKDQNLGDVQESTLIEILVSSARQASEGHPPVGRVTGRKGLTSKERKTQADDRVKLTEHLIPLLPQLLAKFSADAEKVTPLLQLLSCFDLHIYCTGRLEKHLELFLQQLQEVVVKHAEPAVLEAGAHALYLLCNPEFTFFSRADFARSQLVDLLTDRFQQELEELLQSSFLDEDEVYNLAATLKRLSAFYNTHDLTRWELYEPCCQLLQKAVDTGEVPHQVILPALTLVYFSILWTLTHISKSDASQKQLSSLRDRMVAFCELCQSCLSDVDTEIQEQAFVLLSDLLLIFSPQMIVGGRDFLRPLVFFPEATLQSELASFLMDHVFIQPGDLGSGDSQEDHLQIERLHQRRRLLAGFCKLLLYGVLEMDAASDVFKHYNKFYNDYGDIIKETLTRARQIDRSHCSRILLLSLKQLYTELLQEHGPQGLNELPAFIEMRDLARRFALSFGPQQLQNRDLVVMLHKEGIQFSLSELPPAGSSNQPPNLAFLELLSEFSPRLFHQDKQLLLSYLEKCLQHVSQAPGHPWGPVTTYCHSLSPVENTAETSPQVLPSSKRRRVEGPAKPNREDVSSSQEESLQLNSIPPTPTLTSTAVKSRQPLWGLKEMEEEDGSELDFAQGQPVAGTERSRFLGPQYFQTPHNPSGPGLGNQLMRLSLMEEDEEEELEIQDESNEERQDTDMQASSYSSTSERGLDLLDSTELDIEDF,1225,NP_036579.2.csv,refseq-STAG3-NM_012447.3_clinical_seed_0_final,refseq-STAG3-NM_012447.3.a2m,Invitae,refseq-STAG3-NM_012447.3.npy,1,1225,1225
+NP_036580.2,MAVWIQAQQLQGEALHQMQALYGQHFPIEVRHYLSQWIESQAWDSVDLDNPQENIKATQLLEGLVQELQKKAEHQVGEDGFLLKIKLGHYATQLQNTYDRCPMELVRCIRHILYNEQRLVREANNGSSPAGSLADAMSQKHLQINQTFEELRLVTQDTENELKKLQQTQEYFIIQYQESLRIQAQFGPLAQLSPQERLSRETALQQKQVSLEAWLQREAQTLQQYRVELAEKHQKTLQLLRKQQTIILDDELIQWKRRQQLAGNGGPPEGSLDVLQSWCEKLAEIIWQNRQQIRRAEHLCQQLPIPGPVEEMLAEVNATITDIISALVTSTFIIEKQPPQVLKTQTKFAATVRLLVGGKLNVHMNPPQVKATIISEQQAKSLLKNENTRNDYSGEILNNCCVMEYHQATGTLSAHFRNMSLKRIKRSDRRGAESVTEEKFTILFESQFSVGGNELVFQVKTLSLPVVVIVHGSQDNNATATVLWDNAFAEPGRVPFAVPDKVLWPQLCEALNMKFKAEVQSNRGLTKENLVFLAQKLFNNSSSHLEDYSGLSVSWSQFNRENLPGRNYTFWQWFDGVMEVLKKHLKPHWNDGAILGFVNKQQAHDLLINKPDGTFLLRFSDSEIGGITIAWKFDSQERMFWNLMPFTTRDFSIRSLADRLGDLNYLIYVFPDRPKDEVYSKYYTPVPCESATAKAVDGYVKPQIKQVVPEFVNASADAGGGSATYMDQAPSPAVCPQAHYNMYPQNPDSVLDTDGDFDLEDTMDVARRVEELLGRPMDSQWIPHAQS,787,NP_036580.2.csv,refseq-STAT5B-NM_012448.3_clinical_seed_0_final,refseq-STAT5B-NM_012448.3.a2m,Invitae,refseq-STAT5B-NM_012448.3.npy,1,787,787
+NP_036584.1,MSGLGRSRRGGRSRVDQEERFPQGLWTGVAMRSCPEEQYWDPLLGTCMSCKTICNHQSQRTCAAFCRSLSCRKEQGKFYDHLLRDCISCASICGQHPKQCAYFCENKLRSPVNLPPELRRQRSGEVENNSDNSGRYQGLEHRGSEASPALPGLKLSADQVALVYSTLGLCLCAVLCCFLVAVACFLKKRGDPCSCQPRSRPRQSPAKSSQDHAMEAGSPVSTSPEPVETCSFCFPECRAPTQESAVTPGTPDPTCAGRWGCHTRTTVLQPCPHIPDSGLGIVCVPAQEGGPGA,293,NP_036584.1.csv,refseq-TNFRSF13B-NM_012452.2_clinical_seed_0_final,refseq-TNFRSF13B-NM_012452.2.a2m,Invitae,refseq-TNFRSF13B-NM_012452.2.npy,1,293,293
+NP_036595.2,MGSLFRSETMCLAQLFLQSGTAYECLSALGEKGLVQFRDLNQNVSSFQRKFVGEVKRCEELERILVYLVQEINRADIPLPEGEASPPAPPLKQVLEMQEQLQKLEVELREVTKNKEKLRKNLLELIEYTHMLRVTKTFVKRNVEFEPTYEEFPSLESDSLLDYSCMQRLGAKLGFVSGLINQGKVEAFEKMLWRVCKGYTIVSYAELDESLEDPETGEVIKWYVFLISFWGEQIGHKVKKICDCYHCHVYPYPNTAEERREIQEGLNTRIQDLYTVLHKTEDYLRQVLCKAAESVYSRVIQVKKMKAIYHMLNMCSFDVTNKCLIAEVWCPEADLQDLRRALEEGSRESGATIPSFMNIIPTKETPPTRIRTNKFTEGFQNIVDAYGVGSYREVNPALFTIITFPFLFAVMFGDFGHGFVMFLFALLLVLNENHPRLNQSQEIMRMFFNGRYILLLMGLFSVYTGLIYNDCFSKSVNLFGSGWNVSAMYSSSHPPAEHKKMVLWNDSVVRHNSILQLDPSIPGVFRGPYPLGIDPIWNLATNRLTFLNSFKMKMSVILGIIHMTFGVILGIFNHLHFRKKFNIYLVSIPELLFMLCIFGYLIFMIFYKWLVFSAETSRVAPSILIEFINMFLFPASKTSGLYTGQEYVQRVLLVVTALSVPVLFLGKPLFLLWLHNGRSCFGVNRSGYTLIRKDSEEEVSLLGSQDIEEGNHQVEDGCREMACEEFNFGEILMTQVIHSIEYCLGCISNTASYLRLWALSLAHAQLSDVLWAMLMRVGLRVDTTYGVLLLLPVIALFAVLTIFILLIMEGLSAFLHAIRLHWVEFQNKFYVGAGTKFVPFSFSLLSSKFNNDDSVA,856,NP_036595.2.csv,refseq-ATP6V0A2-NM_012463.3_clinical_seed_0_final,refseq-ATP6V0A2-NM_012463.3.a2m,Invitae,refseq-ATP6V0A2-NM_012463.3.npy,1,856,856
+NP_036596.3,MGLGTLSPRMLVWLVASGIVFYGELWVCAGLDYDYTFDGNEEDKTETIDYKDPCKAAVFWGDIALDDEDLNIFQIDRTIDLTQNPFGNLGHTTGGLGDHAMSKKRGALYQLIDRIRRIGFGLEQNNTVKGKVPLQFSGQNEKNRVPRAATSRTERIWPGGVIPYVIGGNFTGSQRAMFKQAMRHWEKHTCVTFIERSDEESYIVFTYRPCGCCSYVGRRGNGPQAISIGKNCDKFGIVVHELGHVIGFWHEHTRPDRDNHVTIIRENIQPGQEYNFLKMEPGEVNSLGERYDFDSIMHYARNTFSRGMFLDTILPSRDDNGIRPAIGQRTRLSKGDIAQARKLYRCPACGETLQESNGNLSSPGFPNGYPSYTHCIWRVSVTPGEKIVLNFTTMDLYKSSLCWYDYIEVRDGYWRKSPLLGRFCGDKLPEVLTSTDSRMWIEFRSSSNWVGKGFAAVYEAICGGEIRKNEGQIQSPNYPDDYRPMKECVWKITVSESYHVGLTFQSFEIERHDNCAYDYLEVRDGTSENSPLIGRFCGYDKPEDIRSTSNTLWMKFVSDGTVNKAGFAANFFKEEDECAKPDRGGCEQRCLNTLGSYQCACEPGYELGPDRRSCEAACGGLLTKLNGTITTPGWPKEYPPNKNCVWQVVAPTQYRISVKFEFFELEGNEVCKYDYVEIWSGLSSESKLHGKFCGAEVPEVITSQFNNMRIEFKSDNTVSKKGFKAHFFSDKDECSKDNGGCQHECVNTMGSYMCQCRNGFVLHDNKHDCKEAECEQKIHSPSGLITSPNWPDKYPSRKECTWEISATPGHRIKLAFSEFEIEQHQECAYDHLEVFDGETEKSPILGRLCGNKIPDPLVATGNKMFVRFVSDASVQRKGFQATHSTECGGRLKAESKPRDLYSHAQFGDNNYPGQVDCEWLLVSERGSRLELSFQTFEVEEEADCGYDYVELFDGLDSTAVGLGRFCGSGPPEEIYSIGDSVLIHFHTDDTINKKGFHIRYKSIRYPDTTHTKK,1013,NP_036596.3.csv,refseq-TLL1-NM_012464.4_clinical_seed_0_final,refseq-TLL1-NM_012464.4.a2m,Invitae,refseq-TLL1-NM_012464.4.npy,1,1013,1013
+NP_036601.2,MNKKKKPFLGMPAPLGYVPGLGRGATGFTTRSDIGPARDANDPVDDRHAPPGKRTVGDQMKKNQAADDDDEDLNDTNYDEFNGYAGSLFSSGPYEKDDEEADAIYAALDKRMDERRKERREQREKEEIEKYRMERPKIQQQFSDLKRKLAEVTEEEWLSIPEVGDARNKRQRNPRYEKLTPVPDSFFAKHLQTGENHTSVDPRQTQFGGLNTPYPGGLNTPYPGGMTPGLMTPGTGELDMRKIGQARNTLMDMRLSQVSDSVSGQTVVDPKGYLTDLNSMIPTHGGDINDIKKARLLLKSVRETNPHHPPAWIASARLEEVTGKLQVARNLIMKGTEMCPKSEDVWLEAARLQPGDTAKAVVAQAVRHLPQSVRIYIRAAELETDIRAKKRVLRKALEHVPNSVRLWKAAVELEEPEDARIMLSRAVECCPTSVELWLALARLETYENARKVLNKARENIPTDRHIWITAAKLEEANGNTQMVEKIIDRAITSLRANGVEINREQWIQDAEECDRAGSVATCQAVMRAVIGIGIEEEDRKHTWMEDADSCVAHNALECARAIYAYALQVFPSKKSVWLRAAYFEKNHGTRESLEALLQRAVAHCPKAEVLWLMGAKSKWLAGDVPAARSILALAFQANPNSEEIWLAAVKLESENDEYERARRLLAKARSSAPTARVFMKSVKLEWVQDNIRAAQDLCEEALRHYEDFPKLWMMKGQIEEQKEMMEKAREAYNQGLKKCPHSTPLWLLLSRLEEKIGQLTRARAILEKSRLKNPKNPGLWLESVRLEYRAGLKNIANTLMAKALQECPNSGILWSEAIFLEARPQRRTKSVDALKKCEHDPHVLLAVAKLFWSQRKITKAREWFHRTVKIDSDLGDAWAFFYKFELQHGTEEQQEEVRKRCESAEPRHGELWCAVSKDIANWQKKIGDILRLVAGRIKNTF,941,NP_036601.2.csv,refseq-PRPF6-NM_012469.3_clinical_seed_0_final,refseq-PRPF6-NM_012469.3.a2m,Invitae,refseq-PRPF6-NM_012469.3.npy,1,941,941
+NP_036602.1,MEGAKPTLQLVYQAVQALYHDPDPSGKERASFWLGELQRSVHAWEISDQLLQIRQDVESCYFAAQTMKMKIQTSFYELPTDSHASLRDSLLTHIQNLKDLSPVIVTQLALAIADLALQMPSWKGCVQTLVEKYSNDVTSLPFLLEILTVLPEEVHSRSLRIGANRRTEIIEDLAFYSSTVVSLLMTCVEKAGTDEKMLMKVFRCLGSWFNLGVLDSNFMANNKLLALLFEVLQQDKTSSNLHEAASDCVCSALYAIENVETNLPLAMQLFQGVLTLETAYHMAVAREDLDKVLNYCRIFTELCETFLEKIVCTPGQGLGDLRTLELLLICAGHPQYEVVEISFNFWYRLGEHLYKTNDEVIHGIFKAYIQRLLHALARHCQLEPDHEGVPEETDDFGEFRMRVSDLVKDLIFLIGSMECFAQLYSTLKEGNPPWEVTEAVLFIMAAIAKSVDPENNPTLVEVLEGVVRLPETVHTAVRYTSIELVGEMSEVVDRNPQFLDPVLGYLMKGLCEKPLASAAAKAIHNICSVCRDHMAQHFNGLLEIARSLDSFLLSPEAAVGLLKGTALVLARLPLDKITECLSELCSVQVMALKKLLSQEPSNGISSDPTVFLDRLAVIFRHTNPIVENGQTHPCQKVIQEIWPVLSETLNKHRADNRIVERCCRCLRFAVRCVGKGSAALLQPLVTQMVNVYHVHQHSCFLYLGSILVDEYGMEEGCRQGLLDMLQALCIPTFQLLEQQNGLQNHPDTVDDLFRLATRFIQRSPVTLLRSQVVIPILQWAIASTTLDHRDANCSVMRFLRDLIHTGVANDHEEDFELRKELIGQVMNQLGQQLVSQLLHTCCFCLPPYTLPDVAEVLWEIMQVDRPTFCRWLENSLKGLPKETTVGAVTVTHKQLTDFHKQVTSAEECKQVCWALRDFTRLFR,923,NP_036602.1.csv,refseq-TNPO3-NM_012470.3_clinical_seed_0_final,refseq-TNPO3-NM_012470.3.a2m,Invitae,refseq-TNPO3-NM_012470.3.npy,1,923,923
+NP_036604.2,MGWITEDLIRRNAEHNDCVIFSLEELSLHQQEIERLEHIDKWCRDLKILYLQNNLIGKIENVSKLKKLEYLNLALNNIEKIENLEGCEELAKLDLTVNFIGELSSIKNLQHNIHLKELFLMGNPCASFDHYREFVVATLPQLKWLDGKEIEPSERIKALQDYSVIEPQIREQEKDHCLKRAKLKEEAQRKHQEEDKNEDKRSNAGFDGRWYTDINATLSSLESKDHLQAPDTEEHNTKKLDNSEDDLEFWNKPCLFTPESRLETLRHMEKQRKKQEKLSEKKKKVKPPRTLITEDGKALNVNEPKIDFSLKDNEKQIILDLAVYRYMDTSLIDVDVQPTYVRVMIKGKPFQLVLPAEVKPDSSSAKRSQTTGHLVICMPKVGEVITGGQRAFKSMKTTSDRSREQTNTRSKHMEKLEVDPSKHSFPDVTNIVQEKKHTPRRRPEPKIIPSEEDPTFEDNPEVPPLI,466,NP_036604.2.csv,refseq-LRRC6-NM_012472.4_clinical_seed_0_final,refseq-LRRC6-NM_012472.4.a2m,Invitae,refseq-LRRC6-NM_012472.4.npy,1,466,466
+NP_036610.2,MALNKNHSEGGGVIVNNTESILMSYDHVELTFNDMKNVPEAFKGTKKGTVYLTPYRVIFLSKGKDAMQSFMMPFYLMKDCEIKQPVFGANYIKGTVKAEAGGGWEGSASYKLTFTAGGAIEFGQRMLQVASQASRGEVPSGAYGYSYMPSGAYVYPPPVANGMYPCPPGYPYPPPPPEFYPGPPMMDGAMGYVQPPPPPYPGPMEPPVSGPDVPSTPAAEAKAAEAAASAYYNPGNPHNVYMPTSQPPPPPYYPPEDKKTQ,261,NP_036610.2.csv,refseq-WBP2-NM_012478.3_clinical_seed_0_final,refseq-WBP2-NM_012478.3.a2m,Invitae,refseq-WBP2-NM_012478.3.npy,1,261,261
+NP_036611.2,MVDREQLVQKARLAEQAERYDDMAAAMKNVTELNEPLSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTSADGNEKKIEMVRAYREKIEKELEAVCQDVLSLLDNYLIKNCSETQYESKVFYLKMKGDYYRYLAEVATGEKRATVVESSEKAYSEAHEISKEHMQPTHPIRLGLALNYSVFYYEIQNAPEQACHLAKTAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDQQDDDGGEGNN,247,NP_036611.2.csv,refseq-YWHAG-NM_012479.3_clinical_seed_0_final,refseq-YWHAG-NM_012479.3.a2m,Invitae,refseq-YWHAG-NM_012479.3.npy,1,247,247
+NP_037377.1,MTTSTLQKAIDLVTKATEEDKAKNYEEALRLYQHAVEYFLHAIKYEAHSDKAKESIRAKCVQYLDRAEKLKDYLRSKEKHGKKPVKENQSEGKGSDSDSEGDNPEKKKLQEQLMGAVVMEKPNIRWNDVAGLEGAKEALKEAVILPIKFPHLFTGKRTPWRGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLMSKWLGESEKLVKNLFELARQHKPSIIFIDEVDSLCGSRNENESEAARRIKTEFLVQMQGVGNNNDGTLVLGATNIPWVLDSAIRRRFEKRIYIPLPEEAARAQMFRLHLGSTPHNLTDANIHELARKTEGYSGADISIIVRDSLMQPVRKVQSATHFKKVCGPSRTNPSMMIDDLLTPCSPGDPGAMEMTWMDVPGDKLLEPVVCMSDMLRSLATTRPTVNADDLLKVKKFSEDFGQES,437,NP_037377.1.csv,refseq-VPS4A-NM_013245.2_clinical_seed_0_final,refseq-VPS4A-NM_013245.2.a2m,Invitae,refseq-VPS4A-NM_013245.2.npy,1,437,437
+NP_037379.1,MAAPRAGRGAGWSLRAWRALGGIRWGRRPRLTPDLRALLTSGTSDPRARVTYGTPSLWARLSVGVTEPRACLTSGTPGPRAQLTAVTPDTRTREASENSGTRSRAWLAVALGAGGAVLLLLWGGGRGPPAVLAAVPSPPPASPRSQYNFIADVVEKTAPAVVYIEILDRHPFLGREVPISNGSGFVVAADGLIVTNAHVVADRRRVRVRLLSGDTYEAVVTAVDPVADIATLRIQTKEPLPTLPLGRSADVRQGEFVVAMGSPFALQNTITSGIVSSAQRPARDLGLPQTNVEYIQTDAAIDFGNSGGPLVNLDGEVIGVNTMKVTAGISFAIPSDRLREFLHRGEKKNSSSGISGSQRRYIGVMMLTLSPSILAELQLREPSFPDVQHGVLIHKVILGSPAHRAGLRPGDVILAIGEQMVQNAEDVYEAVRTQSQLAVQIRRGRETLTLYVTPEVTE,458,NP_037379.1.csv,refseq-HTRA2-NM_013247.4_clinical_seed_0_final,refseq-HTRA2-NM_013247.4.a2m,Invitae,refseq-HTRA2-NM_013247.4.npy,1,458,458
+NP_037383.1,MRIMLLFTAILAFSLAQSFGAVCKEPQEEVVPGGGRSKRDPDLYQLLQRLFKSHSSLEGLLKALSQASTDPKESTSPEKRDMHDFFVGLMGKRSVQPDSPTDVNQENVPSFGILKYPPRAE,121,NP_037383.1.csv,refseq-TAC3-NM_013251.3_clinical_seed_0_final,refseq-TAC3-NM_013251.3.a2m,Invitae,refseq-TAC3-NM_013251.3.npy,1,121,121
+NP_037386.1,MQSTSNHLWLLSDILGQGATANVFRGRHKKTGDLFAIKVFNNISFLRPVDVQMREFEVLKKLNHKNIVKLFAIEEETTTRHKVLIMEFCPCGSLYTVLEEPSNAYGLPESEFLIVLRDVVGGMNHLRENGIVHRDIKPGNIMRVIGEDGQSVYKLTDFGAARELEDDEQFVSLYGTEEYLHPDMYERAVLRKDHQKKYGATVDLWSIGVTFYHAATGSLPFRPFEGPRRNKEVMYKIITGKPSGAISGVQKAENGPIDWSGDMPVSCSLSRGLQVLLTPVLANILEADQEKCWGFDQFFAETSDILHRMVIHVFSLQQMTAHKIYIHSYNTATIFHELVYKQTKIISSNQELIYEGRRLVLEPGRLAQHFPKTTEENPIFVVSREPLNTIGLIYEKISLPKVHPRYDLDGDASMAKAITGVVCYACRIASTLLLYQELMRKGIRWLIELIKDDYNETVHKKTEVVITLDFCIRNIEKTVKVYEKLMKINLEAAELGEISDIHTKLLRLSSSQGTIETSLQDIDSRLSPGGSLADAWAHQEGTHPKDRNVEKLQVLLNCMTEIYYQFKKDKAERRLAYNEEQIHKFDKQKLYYHATKAMTHFTDECVKKYEAFLNKSEEWIRKMLHLRKQLLSLTNQCFDIEEEVSKYQEYTNELQETLPQKMFTASSGIKHTMTPIYPSSNTLVEMTLGMKKLKEEMEGVVKELAENNHILERFGSLTMDGGLRNVDCL,729,NP_037386.1.csv,refseq-TBK1-NM_013254.3_clinical_seed_0_final,refseq-TBK1-NM_013254.3.a2m,Invitae,refseq-TBK1-NM_013254.3.npy,1,729,729
+NP_037451.1,MAASQVLGEKINILSGETVKAGDRDPLGNDCPEQDRLPQRSWRQKCASYVLALRPWSFSASLTPVALGSALAYRSHGVLDPRLLVGCAVAVLAVHGAGNLVNTYYDFSKGIDHKKSDDRTLVDRILEPQDVVRFGVFLYTLGCVCAACLYYLSPLKLEHLALIYFGGLSGSFLYTGGIGFKYVALGDLIILITFGPLAVMFAYAIQVGSLAIFPLVYAIPLALSTEAILHSNNTRDMESDREAGIVTLAILIGPTFSYILYNTLLFLPYLVFSILATHCTISLALPLLTIPMAFSLERQFRSQAFNKLPQRTAKLNLLLGLFYVFGIILAPAGSLPKI,338,NP_037451.1.csv,refseq-UBIAD1-NM_013319.2_clinical_seed_0_final,refseq-UBIAD1-NM_013319.2.a2m,Invitae,refseq-UBIAD1-NM_013319.2.npy,1,338,338
+NP_037460.2,MSVGFIGAGQLAYALARGFTAAGILSAHKIIASSPEMNLPTVSALRKMGVNLTRSNKETVKHSDVLFLAVKPHIIPFILDEIGADVQARHIVVSCAAGVTISSVEKKLMAFQPAPKVIRCMTNTPVVVQEGATVYATGTHALVEDGQLLEQLMSSVGFCTEVEEDLIDAVTGLSGSGPAYAFMALDALADGGVKMGLPRRLAIQLGAQALLGAAKMLLDSEQHPCQLKDNVCSPGGATIHALHFLESGGFRSLLINAVEASCIRTRELQSMADQEKISPAALKKTLLDRVKLESPTVSTLTPSSPGKLLTRSLALGGKKD,320,NP_037460.2.csv,refseq-PYCR2-NM_013328.3_clinical_seed_0_final,refseq-PYCR2-NM_013328.3.a2m,Invitae,refseq-PYCR2-NM_013328.3_theta_0.2.npy,1,320,320
+NP_037466.3,MKALILVGGYGTRLRPLTLSTPKPLVDFCNKPILLHQVEALAAAGVDHVILAVSYMSQVLEKEMKAQEQRLGIRISMSHEEEPLGTAGPLALARDLLSETADPFFVLNSDVICDFPFQAMVQFHRHHGQEGSILVTKVEEPSKYGVVVCEADTGRIHRFVEKPQVFVSNKINAGMYILSPAVLQRIQLQPTSIEKEVFPIMAKEGQLYAMELQGFWMDIGQPKDFLTGMCLFLQSLRQKQPERLCSGPGIVGNVLVDPSARIGQNCSIGPNVSLGPGVVVEDGVCIRRCTVLRDARIRSHSWLESCIVGWRCRVGQWVSLWAGLGGERGGECACLPDKAYPLLEVRMENVTVLGEDVIVNDELYLNGASVLPHKSIGESVPEPRIIM,387,NP_037466.3.csv,refseq-GMPPB-NM_013334.4_clinical_seed_0_final,refseq-GMPPB-NM_013334.4.a2m,Invitae,refseq-GMPPB-NM_013334.4_theta_0.2.npy,1,387,387
+NP_037468.1,MAIKFLEVIKPFCVILPEIQKPERKIQFKEKVLWTAITLFIFLVCCQIPLFGIMSSDSADPFYWMRVILASNRGTLMELGISPIVTSGLIMQLLAGAKIIEVGDTPKDRALFNGAQKLFGMIITIGQSIVYVMTGMYGDPSEMGAGICLLITIQLFVAGLIVLLLDELLQKGYGLGSGISLFIATNICETIVWKAFSPTTVNTGRGMEFEGAIIALFHLLATRTDKVRALREAFYRQNLPNLMNLIATIFVFAVVIYFQGFRVDLPIKSARYRGQYNTYPIKLFYTSNIPIILQSALVSNLYVISQMLSARFSGNLLVSLLGTWSDTSSGGPARAYPVGGLCYYLSPPESFGSVLEDPVHAVVYIVFMLGSCAFFSKTWIEVSGSSAKDVAKQLKEQQMVMRGHRETSMVHELNRYIPTAAAFGGLCIGALSVLADFLGAIGSGTGILLAVTIIYQYFEIFVKEQSEVGSMGALLF,476,NP_037468.1.csv,refseq-SEC61A1-NM_013336.3_clinical_seed_0_final,refseq-SEC61A1-NM_013336.3.a2m,Invitae,refseq-SEC61A1-NM_013336.3.npy,1,476,476
+NP_037471.2,MEKWYLMTVVVLIGLTVRWTVSLNSYSGAGKPPMFGDYEAQRHWQEITFNLPVKQWYFNSSDNNLQYWGLDYPPLTAYHSLLCAYVAKFINPDWIALHTSRGYESQAHKLFMRTTVLIADLLIYIPAVVLYCCCLKEISTKKKIANALCILLYPGLILIDYGHFQYNSVSLGFALWGVLGISCDCDLLGSLAFCLAINYKQMELYHALPFFCFLLGKCFKKGLKGKGFVLLVKLACIVVASFVLCWLPFFTEREQTLQVLRRLFPVDRGLFEDKVANIWCSFNVFLKIKDILPRHIQLIMSFCSTFLSLLPACIKLILQPSSKGFKFTLVSCALSFFLFSFQVHEKSILLVSLPVCLVLSEIPFMSTWFLLVSTFSMLPLLLKDELLMPSVVTTMAFFIACVTSFSIFEKTSEEELQLKSFSISVRKYLPCFTFLSRIIQYLFLISVITMVLLTLMTVTLDPPQKLPDLFSVLVCFVSCLNFLFFLVYFNIIIMWDSKSGRNQKKIS,507,NP_037471.2.csv,refseq-ALG6-NM_013339.3_clinical_seed_0_final,refseq-ALG6-NM_013339.3.a2m,Invitae,refseq-ALG6-NM_013339.3.npy,1,507,507
+NP_037514.2,MPPATGGGLAESELRPRRGRCGPQAARAAGRDVAAEAVARSPKRPAWGSRRFEAVGWWALLALVTLLSFATRFHRLDEPPHICWDETHFGKMGSYYINRTFFFDVHPPLGKMLIGLAGYLSGYDGTFLFQKPGDKYEHHSYMGMRGFCAFLGSWLVPFAYLTVLDLSKSLSAALLTAALLTFDTGCLTLSQYILLDPILMFFIMAAMLSMVKYNSCADRPFSAPWWFWLSLTGVSLAGALGVKFVGLFIILQVGLNTIADLWYLFGDLSLSLVTVGKHLTARVLCLIVLPLALYTATFAVHFMVLSKSGPGDGFFSSAFQARLSGNNLHNASIPEHLAYGSVITVKNLRMAIGYLHSHRHLYPEGIGARQQQVTTYLHKDYNNLWIIKKHNTNSDPLDPSFPVEFVRHGDIIRLEHKETSRNLHSHYHEAPMTRKHYQVTGYGINGTGDSNDFWRIEVVNRKFGNRIKVLRSRIRFIHLVTGCVLGSSGKVLPKWGWEQLEVTCTPYLKETLNSIWNVEDHINPKLPNISLDVLQPSFPEILLESHMVMIRGNSGLKPKDNEFTSKPWHWPINYQGLRFSGVNDTDFRVYLLGNPVVWWLNLLSIALYLLSGSIIAVAMQRGARLPAEVAGLSQVLLRGGGQVLLGWTLHYFPFFLMGRVLYFHHYFPAMLFSSMLTGILWDTLLRLCAWGLASWPLARGIHVAGILSLLLGTAYSFYLFHPLAYGMVGPLAQDPQSPMAGLRWLDSWDF,750,NP_037514.2.csv,POMT2_HUMAN_b03_clinical_seed_0_final,POMT2_HUMAN_b03.a2m,EVE,POMT2_HUMAN_b03_theta_0.2.npy,1,750,750
+NP_037518.3,MLRWLRDFVLPTAACQDAEQPTRYETLFQALDRNGDGVVDIGELQEGLRNLGIPLGQDAEEKIFTTGDVNKDGKLDFEEFMKYLKDHEKKMKLAFKSLDKNNDGKIEASEIVQSLQTLGLTISEQQAELILQSIDVDGTMTVDWNEWRDYFLFNPVTDIEEIIRFWKHSTGIDIGDSLTIPDEFTEDEKKSGQWWRQLLAGGIAGAVSRTSTAPLDRLKIMMQVHGSKSDKMNIFGGFRQMVKEGGIRSLWRGNGTNVIKIAPETAVKFWAYEQYKKLLTEEGQKIGTFERFISGSMAGATAQTFIYPMEVMKTRLAVGKTGQYSGIYDCAKKILKHEGLGAFYKGYVPNLLGIIPYAGIDLAVYELLKSYWLDNFAKDSVNPGVMVLLGCGALSSTCGQLASYPLALVRTRMQAQAMLEGSPQLNMVGLFRRIISKEGIPGLYRGITPNFMKVLPAVGISYVVYENMKQTLGVTQK,477,NP_037518.3.csv,refseq-SLC25A24-NM_013386.4_clinical_seed_0_final,refseq-SLC25A24-NM_013386.4.a2m,Invitae,refseq-SLC25A24-NM_013386.4.npy,1,477,477
+NP_037523.2,MLRPGAQLLRGLLLRSCPLQGSPGRPRSVCGREGEEKPPLSAETQWKDRAETVIIGGGCVGVSLAYHLAKAGMKDVVLLEKSELTAGSTWHAAGLTTYFHPGINLKKIHYDSIKLYEKLEEETGQVVGFHQPGSIRLATTPVRVDEFKYQMTRTGWHATEQYLIEPEKIQEMFPLLNMNKVLAGLYNPGDGHIDPYSLTMALAAGARKCGALLKYPAPVTSLKARSDGTWDVETPQGSMRANRIVNAAGFWAREVGKMIGLEHPLIPVQHQYVVTSTISEVKALKRELPVLRDLEGSYYLRQERDGLLFGPYESQEKMKVQDSWVTNGVPPGFGKELFESDLDRIMEHIKAAMEMVPVLKKADIINVVNGPITYSPDILPMVGPHQGVRNYWVAIGFGYGIIHAGGVGKYLSDWILHGEPPFDLIELDPNRYGKWTTTQYTEAKARESYGFNNIVGYPKEERFAGRPTQRVSGLYQRLESKCSMGFHAGWEQPHWFYKPGQDTQYRPSFRRTNWFEPVGSEYKQVMQRVAVTDLSPFGKFNIKGQDSIRLLDHLFANVIPKVGFTNISHMLTPKGRVYAELTVSHQSPGEFLLITGSGSELHDLRWIEEEAVKGGYDVEIKNITDELGVLGVAGPQARKVLQKLTSEDLSDDVFKFLQTKSLKVSNIPVTAIRISYTGELGWELYHRREDSVALYDAIMNAGQEEGIDNFGTYAMNALRLEKAFRAWGLEMNCDTNPLEAGLEYFVKLNKPADFIGKQALKQIKAKGLKRRLVCLTLATDDVDPEGNESIWYNGKVVGNTTSGSYSYSIQKSLAFAYVPVQLSEVGQQVEVELLGKNYPAVIIQEPLVLTEPTRNRLQKKGGKDKT,866,NP_037523.2.csv,refseq-DMGDH-NM_013391.3_clinical_seed_0_final,refseq-DMGDH-NM_013391.3.a2m,Invitae,refseq-DMGDH-NM_013391.3.npy,1,866,866
+NP_038202.2,MAVAQQLRAESDFEQLPDDVAISANIADIEEKRGFTSHFVFVIEVKTKGGSKYLIYRRYRQFHALQSKLEERFGPDSKSSALACTLPTLPAKVYVGVKQEIAEMRIPALNAYMKSLLSLPVWVLMDEDVRIFFYQSPYDSEQVPQALRRLRPRTRKVKSVSPQGNSVDRMAAPRAEALFDFTGNSKLELNFKAGDVIFLLSRINKDWLEGTVRGATGIFPLSFVKILKDFPEEDDPTNWLRCYYYEDTISTIKSVAWEGGACPAFLPSLRPLPLTSPSHGSLSHSKAPSGSQMSHNAVTSHQRPGWPGQPHSPFPHPTPHFQPDASLLQPVTPLGTSRWRKISAALPY,348,NP_038202.2.csv,refseq-NCF4-NM_013416.3_clinical_seed_0_final,refseq-NCF4-NM_013416.3.a2m,Invitae,refseq-NCF4-NM_013416.3.npy,1,348,348
+NP_038460.4,MSLERELRQLSKAKAKAQRAGQRREEAALCHQLGELLAGHGRYAEALEQHWQELQLRERADDPLGCAVAHRKIGERLAEMEDYPAALQHQHQYLELAHSLRNHTELQRAWATIGRTHLDIYDHCQSRDALLQAQAAFEKSLAIVDEELEGTLAQGELNEMRTRLYLNLGLTFESLQQTALCNDYFRKSIFLAEQNHLYEDLFRARYNLGTIHWRAGQHSQAMRCLEGARECAHTMRKRFMESECCVVIAQVLQDLGDFLAAKRALKKAYRLGSQKPVQRAAICQNLQHVLAVVRLQQQLEEAEGRDPQGAMVICEQLGDLFSKAGDFPRAAEAYQKQLRFAELLDRPGAERAIIHVSLATTLGDMKDHHGAVRHYEEELRLRSGNVLEEAKTWLNIALSREEAGDAYELLAPCFQKALSCAQQAQRPQLQRQVLQHLHTVQLRLQPQEAPETETRLRELSVAEDEDEEEEAEEAAATAESEALEAGEVELSEGEDDTDGLTPQLEEDEELQGHLGRRKGSKWNRRNDMGETLLHRACIEGQLRRVQDLVRQGHPLNPRDYCGWTPLHEACNYGHLEIVRFLLDHGAAVDDPGGQGCEGITPLHDALNCGHFEVAELLLERGASVTLRTRKGLSPLETLQQWVKLYRRDLDLETRQKARAMEMLLQAAASGQDPHSSQAFHTPSSLLFDPETSPPLSPCPEPPSNSTRLPEASQAHVRVSPGQAAPAMARPRRSRHGPASSSSSSEGEDSAGPARPSQKRPRCSATAQRVAAWTPGPASNREAATASTSRAAYQAAIRGVGSAQSRLGPGPPRGHSKALAPQAALIPEEECLAGDWLELDMPLTRSRRPRPRGTGDNRRPSSTSGSDSEESRPRARAKQVRLTCMQSCSAPVNAGPSSLASEPPGSPSTPRVSEPSGDSSAAGQPLGPAPPPPIRVRVQVQDHLFLIPVPHSSDTHSVAWLAEQAAQRYYQTCGLLPRLTLRKEGALLAPQDLIPDVLQSNDEVLAEVTSWDLPPLTDRYRRACQSLGQGEHQQVLQAVELQGLGLSFSACSLALDQAQLTPLLRALKLHTALRELRLAGNRLGDKCVAELVAALGTMPSLALLDLSSNHLGPEGLRQLAMGLPGQATLQSLEELDLSMNPLGDGCGQSLASLLHACPLLSTLRLQACGFGPSFFLSHQTALGSAFQDAEHLKTLSLSYNALGAPALARTLQSLPAGTLLHLELSSVAAGKGDSDLMEPVFRYLAKEGCALAHLTLSANHLGDKAVRDLCRCLSLCPSLISLDLSANPEISCASLEELLSTLQKRPQGLSFLGLSGCAVQGPLGLGLWDKIAAQLRELQLCSRRLCAEDRDALRQLQPSRPGPGECTLDHGSKLFFRRL,1378,NP_038460.4.csv,refseq-TONSL-NM_013432.4_clinical_seed_0_final,refseq-TONSL-NM_013432.4.a2m,Invitae,refseq-TONSL-NM_013432.4.npy,1,1378,1378
+NP_038472.2,MAENGESSGPPRPSRGPAAAQGSAAAPAEPKIIKVTVKTPKEKEEFAVPENSSVQQFKEAISKRFKSQTDQLVLIFAGKILKDQDTLIQHGIHDGLTVHLVIKSQNRPQGQSTQPSNAAGTNTTSASTPRSNSTPISTNSNPFGLGSLGGLAGLSSLGLSSTNFSELQSQMQQQLMASPEMMIQIMENPFVQSMLSNPDLMRQLIMANPQMQQLIQRNPEISHLLNNPDIMRQTLEIARNPAMMQEMMRNQDLALSNLESIPGGYNALRRMYTDIQEPMLNAAQEQFGGNPFASVGSSSSSGEGTQPSRTENRDPLPNPWAPPPATQSSATTSTTTSTGSGSGNSSSNATGNTVAAANYVASIFSTPGMQSLLQQITENPQLIQNMLSAPYMRSMMQSLSQNPDLAAQMMLNSPLFTANPQLQEQMRPQLPAFLQQMQNPDTLSAMSNPRAMQALMQIQQGLQTLATEAPGLIPSFTPGVGVGVLGTAIGPVGPVTPIGPIGPIVPFTPIGPIGPIGPTGPAAPPGSTGSGGPTGPTVSSAAPSETTSPTSESGPNQQFIQQMVQALAGANAPQLPNPEVRFQQQLEQLNAMGFLNREANLQALIATGGDINAAIERLLGSQPS,624,NP_038472.2.csv,refseq-UBQLN2-NM_013444.3_clinical_seed_0_final,refseq-UBQLN2-NM_013444.3.a2m,Invitae,refseq-UBQLN2-NM_013444.3.npy,1,624,624
+NP_038475.2,MGGRVFLVFLAFCVWLTLPGAETQDSRGCARWCPQDSSCVNATACRCNPGFSSFSEIITTPMETCDDINECATLSKVSCGKFSDCWNTEGSYDCVCSPGYEPVSGAKTFKNESENTCQDVDECQQNPRLCKSYGTCVNTLGSYTCQCLPGFKLKPEDPKLCTDVNECTSGQNPCHSSTHCLNNVGSYQCRCRPGWQPIPGSPNGPNNTVCEDVDECSSGQHQCDSSTVCFNTVGSYSCRCRPGWKPRHGIPNNQKDTVCEDMTFSTWTPPPGVHSQTLSRFFDKVQDLGRDYKPGLANNTIQSILQALDELLEAPGDLETLPRLQQHCVASHLLDGLEDVLRGLSKNLSNGLLNFSYPAGTELSLEVQKQVDRSVTLRQNQAVMQLDWNQAQKSGDPGPSVVGLVSIPGMGKLLAEAPLVLEPEKQMLLHETHQGLLQDGSPILLSDVISAFLSNNDTQNLSSPVTFTFSHRSVIPRQKVLCVFWEHGQNGCGHWATTGCSTIGTRDTSTICRCTHLSSFAVLMAHYDVQEEDPVLTVITYMGLSVSLLCLLLAALTFLLCKAIQNTSTSLHLQLSLCLFLAHLLFLVAIDQTGHKVLCSIIAGTLHYLYLATLTWMLLEALYLFLTARNLTVVNYSSINRFMKKLMFPVGYGVPAVTVAISAASRPHLYGTPSRCWLQPEKGFIWGFLGPVCAIFSVNLVLFLVTLWILKNRLSSLNSEVSTLRNTRMLAFKATAQLFILGCTWCLGILQVGPAARVMAYLFTIINSLQGVFIFLVYCLLSQQVREQYGKWSKGIRKLKTESEMHTLSSSAKADTSKPSTVN,823,NP_038475.2.csv,refseq-ADGRE2-NM_013447.3_clinical_seed_0_final,refseq-ADGRE2-NM_013447.3.a2m,Invitae,refseq-ADGRE2-NM_013447.3.npy,1,823,823
+NP_054701.1,MVCFRLFPVPGSGLVLVCLVLGAVRSYALELNLTDSENATCLYAKWQMNFTVRYETTNKTYKTVTISDHGTVTYNGSICGDDQNGPKIAVQFGPGFSWIANFTKAASTYSIDSVSFSYNTGDNTTFPDAEDKGILTVDELLAIRIPLNDLFRCNSLSTLEKNDVVQHYWDVLVQAFVQNGTVSTNEFLCDKDKTSTVAPTIHTTVPSPTTTPTPKEKPEAGTYSVNNGNDTCLLATMGLQLNITQDKVASVININPNTTHSTGSCRSHTALLRLNSSTIKYLDFVFAVKNENRFYLKEVNISMYLVNGSVFSIANNNLSYWDAPLGSSYMCNKEQTVSVSGAFQINTFDLRVQPFNVTQGKYSTAQECSLDDDTILIPIIVGAGLSGLIIVIVIAYVIGRRKSYAGYQTL,410,NP_054701.1.csv,refseq-LAMP2-NM_013995.2_clinical_seed_0_final,refseq-LAMP2-NM_013995.2.a2m,Invitae,refseq-LAMP2-NM_013995.2.npy,1,410,410
+NP_054706.1,MPVFHTRTIESILEPVAQQISHLVIMHEEGEVDGKAIPDLTAPVAAVQAAVSNLVRVGKETVQTTEDQILKRDMPPAFIKVENACTKLVQAAQMLQSDPYSVPARDYLIDGSRGILSGTSDLLLTFDEAEVRKIIRVCKGILEYLTVAEVVETMEDLVTYTKNLGPGMTKMAKMIDERQQELTHQEHRVMLVNSMNTVKELLPVLISAMKIFVTTKNSKNQGIEEALKNRNFTVEKMSAEINEIIRVLQLTSWDEDAWASKDTEAMKRALASIDSKLNQAKGWLRDPSASPGDAGEQAIRQILDEAGKVGELCAGKERREILGTCKMLGQMTDQVADLRARGQGSSPVAMQKAQQVSQGLDVLTAKVENAARKLEAMTNSKQSIAKKIDAAQNWLADPNGGPEGEEQIRGALAEARKIAELCDDPKERDDILRSLGEISALTSKLADLRRQGKGDSPEARALAKQVATALQNLQTKTNRAVANSRPAKAAVHLEGKIEQAQRWIDNPTVDDRGVGQAAIRGLVAEGHRLANVMMGPYRQDLLAKCDRVDQLTAQLADLAARGEGESPQARALASQLQDSLKDLKARMQEAMTQEVSDVFSDTTTPIKLLAVAATAPPDAPNREEVFDERAANFENHSGKLGATAEKAAAVGTANKSTVEGIQASVKTARELTPQVVSAARILLRNPGNQAAYEHFETMKNQWIDNVEKMTGLVDEAIDTKSLLDASEEAIKKDLDKCKVAMANIQPQMLVAGATSIARRANRILLVAKREVENSEDPKFREAVKAASDELSKTISPMVMDAKAVAGNISDPGLQKSFLDSGYRILGAVAKVREAFQPQEPDFPPPPPDLEQLRLTDELAPPKPPLPEGEVPPPRPPPPEEKDEEFPEQKAGEVINQPMMMAARQLHDEARKWSSKPGIPAAEVGIGVVAEADAADAAGFPVPPDMEDDYEPELLLMPSNQPVNQPILAAAQSLHREATKWSSKGNDIIAAAKRMALLMAEMSRLVRGGSGTKRALIQCAKDIAKASDEVTRLAKEVAKQCTDKRIRTNLLQVCERIPTISTQLKILSTVKATMLGRTNISDEESEQATEMLVHNAQNLMQSVKETVREAEAASIKIRTDAGFTLRWVRKTPWYQ,1134,NP_054706.1.csv,refseq-VCL-NM_014000.2_clinical_seed_0_final,refseq-VCL-NM_014000.2.a2m,Invitae,refseq-VCL-NM_014000.2_theta_0.2.npy,1,1134,1134
+NP_054722.2,MGDTSEDASIHRLEGTDLDCQVGGLICKSKSAASEQHVFKAPAPRPSLLGLDLLASLKRREREEKDDGEDKKKSKVSSYKDWEESKDDQKDAEEEGGDQAGQNIRKDRHYRSARVETPSHPGGVSEEFWERSRQRERERREHGVYASSKEEKDWKKEKSRDRDYDRKRDRDERDRSRHSSRSERDGGSERSSRRNEPESPRHRPKDAATPSRSTWEEEDSGYGSSRRSQWESPSPTPSYRDSERSHRLSTRDRDRSVRGKYSDDTPLPTPSYKYNEWADDRRHLGSTPRLSRGRGRREEGEEGISFDTEEERQQWEDDQRQADRDWYMMDEGYDEFHNPLAYSSEDYVRRREQHLHKQKQKRISAQRRQINEDNERWETNRMLTSGVVHRLEVDEDFEEDNAAKVHLMVHNLVPPFLDGRIVFTKQPEPVIPVKDATSDLAIIARKGSQTVRKHREQKERKKAQHKHWELAGTKLGDIMGVKKEEEPDKAVTEDGKVDYRTEQKFADHMKRKSEASSEFAKKKSILEQRQYLPIFAVQQELLTIIRDNSIVIVVGETGSGKTTQLTQYLHEDGYTDYGMIGCTQPRRVAAMSVAKRVSEEMGGNLGEEVGYAIRFEDCTSENTLIKYMTDGILLRESLREADLDHYSAIIMDEAHERSLNTDVLFGLLREVVARRSDLKLIVTSATMDAEKFAAFFGNVPIFHIPGRTFPVDILFSKTPQEDYVEAAVKQSLQVHLSGAPGDILIFMPGQEDIEVTSDQIVEHLEELENAPALAVLPIYSQLPSDLQAKIFQKAPDGVRKCIVATNIAETSLTVDGIMFVIDSGYCKLKVFNPRIGMDALQIYPISQANANQRSGRAGRTGPGQCFRLYTQSAYKNELLTTTVPEIQRTNLANVVLLLKSLGVQDLLQFHFMDPPPEDNMLNSMYQLWILGALDNTGGLTSTGRLMVEFPLDPALSKMLIVSCDMGCSSEILLIVSMLSVPAIFYRPKGREEESDQIREKFAVPESDHLTYLNVYLQWKNNNYSTIWCNDHFIHAKAMRKVREVRAQLKDIMVQQRMSLASCGTDWDIVRKCICAAYFHQAAKLKGIGEYVNIRTGMPCHLHPTSSLFGMGYTPDYIVYHELVMTTKEYMQCVTAVDGEWLAELGPMFYSVKQAGKSRQENRRRAKEEASAMEEEMALAEEQLRARRQEQEKRSPLGSVRSTKIYTPGRKEQGEPMTPRRTPARFGL,1227,NP_054722.2.csv,refseq-DHX38-NM_014003.3_clinical_seed_0_final,refseq-DHX38-NM_014003.3.a2m,Invitae,refseq-DHX38-NM_014003.3_theta_0.2.npy,1,1227,1227
+NP_054728.2,MPNPRPGKPSAPSLALGPSPGASPSWRAAPKASDLLGARGPGGTFQGRDLRGGAHASSSSLNPMPPSQLQLPTLPLVMVAPSGARLGPLPHLQALLQDRPHFMHQLSTVDAHARTPVLQVHPLESPAMISLTPPTTATGVFSLKARPGLPPGINVASLEWVSREPALLCTFPNPSAPRKDSTLSAVPQSSYPLLANGVCKWPGCEKVFEEPEDFLKHCQADHLLDEKGRAQCLLQREMVQSLEQQLVLEKEKLSAMQAHLAGKMALTKASSVASSDKGSCCIVAAGSQGPVVPAWSGPREAPDSLFAVRRHLWGSHGNSTFPEFLHNMDYFKFHNMRPPFTYATLIRWAILEAPEKQRTLNEIYHWFTRMFAFFRNHPATWKNAIRHNLSLHKCFVRVESEKGAVWTVDELEFRKKRSQRPSRCSNPTPGP,431,NP_054728.2.csv,refseq-FOXP3-NM_014009.3_clinical_seed_0_final,refseq-FOXP3-NM_014009.3.a2m,Invitae,refseq-FOXP3-NM_014009.3.npy,1,431,431
+NP_054733.2,MADVTARSLQYEYKANSNLVLQADRSLIDRTRRDEPTGEVLSLVGKLEGTRMGDKAQRTKPQMQEERRAKRRKRDEDRHDINKMKGYTLLSEGIDEMVGIIYKPKTKETRETYEVLLSFIQAALGDQPRDILCGAADEVLAVLKNEKLRDKERRKEIDLLLGQTDDTRYHVLVNLGKKITDYGGDKEIQNMDDNIDETYGVNVQFESDEEEGDEDVYGEVREEASDDDMEGDEAVVRCTLSANLVASGELMSSKKKDLHPRDIDAFWLQRQLSRFYDDAIVSQKKADEVLEILKTASDDRECENQLVLLLGFNTFDFIKVLRQHRMMILYCTLLASAQSEAEKERIMGKMEADPELSKFLYQLHETEKEDLIREERSRRERVRQSRMDTDLETMDLDQGGEALAPRQVLDLEDLVFTQGSHFMANKRCQLPDGSFRRQRKGYEEVHVPALKPKPFGSEEQLLPVEKLPKYAQAGFEGFKTLNRIQSKLYRAALETDENLLLCAPTGAGKTNVALMCMLREIGKHINMDGTINVDDFKIIYIAPMRSLVQEMVGSFGKRLATYGITVAELTGDHQLCKEEISATQIIVCTPEKWDIITRKGGERTYTQLVRLIILDEIHLLHDDRGPVLEALVARAIRNIEMTQEDVRLIGLSATLPNYEDVATFLRVDPAKGLFYFDNSFRPVPLEQTYVGITEKKAIKRFQIMNEIVYEKIMEHAGKNQVLVFVHSRKETGKTARAIRDMCLEKDTLGLFLREGSASTEVLRTEAEQCKNLELKDLLPYGFAIHHAGMTRVDRTLVEDLFADKHIQVLVSTATLAWGVNLPAHTVIIKGTQVYSPEKGRWTELGALDILQMLGRAGRPQYDTKGEGILITSHGELQYYLSLLNQQLPIESQMVSKLPDMLNAEIVLGNVQNAKDAVNWLGYAYLYIRMLRSPTLYGISHDDLKGDPLLDQRRLDLVHTAALMLDKNNLVKYDKKTGNFQVTELGRIASHYYITNDTVQTYNQLLKPTLSEIELFRVFSLSSEFKNITVREEEKLELQKLLERVPIPVKESIEEPSAKINVLLQAFISQLKLEGFALMADMVYVTQSAGRLMRAIFEIVLNRGWAQLTDKTLNLCKMIDKRMWQSMCPLRQFRKLPEEVVKKIEKKNFPFERLYDLNHNEIGELIRMPKMGKTIHKYVHLFPKLELSVHLQPITRSTLKVELTITPDFQWDEKVHGSSEAFWILVEDVDSEVILHHEYFLLKAKYAQDEHLITFFVPVFEPLPPQYFIRVVSDRWLSCETQLPVSFRHLILPEKYPPPTELLDLQPLPVSALRNSAFESLYQDKFPFFNPIQTQVFNTVYNSDDNVFVGAPTGSGKTICAEFAILRMLLQSSEGRCVYITPMEALAEQVYMDWYEKFQDRLNKKVVLLTGETSTDLKLLGKGNIIISTPEKWDILSRRWKQRKNVQNINLFVVDEVHLIGGENGPVLEVICSRMRYISSQIERPIRIVALSSSLSNAKDVAHWLGCSATSTFNFHPNVRPVPLELHIQGFNISHTQTRLLSMAKPVYHAITKHSPKKPVIVFVPSRKQTRLTAIDILTTCAADIQRQRFLHCTEKDLIPYLEKLSDSTLKETLLNGVGYLHEGLSPMERRLVEQLFSSGAIQVVVASRSLCWGMNVAAHLVIIMDTQYYNGKIHAYVDYPIYDVLQMVGHANRPLQDDEGRCVIMCQGSKKDFFKKFLYEPLPVESHLDHCMHDHFNAEIVTKTIENKQDAVDYLTWTFLYRRMTQNPNYYNLQGISHRHLSDHLSELVEQTLSDLEQSKCISIEDEMDVAPLNLGMIAAYYYINYTTIELFSMSLNAKTKVRGLIEIISNAAEYENIPIRHHEDNLLRQLAQKVPHKLNNPKFNDPHVKTNLLLQAHLSRMQLSAELQSDTEEILSKAIRLIQACVDVLSSNGWLSPALAAMELAQMVTQAMWSKDSYLKQLPHFTSEHIKRCTDKGVESVFDIMEMEDEERNALLQLTDSQIADVARFCNRYPNIELSYEVVDKDSIRSGGPVVVLVQLEREEEVTGPVIAPLFPQKREEGWWVVIGDAKSNSLISIKRLTLQQKAKVKLDFVAPATGAHNYTLYFMSDAYMGCDQEYKFSVDVKEAETDSDSD,2136,NP_054733.2.csv,refseq-SNRNP200-NM_014014.4_clinical_seed_0_final,refseq-SNRNP200-NM_014014.4.a2m,Invitae,refseq-SNRNP200-NM_014014.4.npy,1,2136,2136
+NP_054742.2,MPTESASCSTARQTKQKRKSHSLSIRRTNSSEQERTGLPRDMLEGQDSKLPSSVRSTLLELFGQIEREFENLYIENLELRREIDTLNERLAAEGQAIDGAELSKGQLKTKASHSTSQLSQKLKTTYKASTSKIVSSFKTTTSRAACQLVKEYIGHRDGIWDVSVAKTQPVVLGTASADHTALLWSIETGKCLVKYAGHVGSVNSIKFHPSEQLALTASGDQTAHIWRYAVQLPTPQPVADTSISGEDEVECSDKDEPDLDGDVSSDCPTIRVPLTSLKSHQGVVIASDWLVGGKQAVTASWDRTANLYDVETSELVHSLTGHDQELTHCCTHPTQRLVVTSSRDTTFRLWDFRDPSIHSVNVFQGHTDTVTSAVFTVGDNVVSGSDDRTVKVWDLKNMRSPIATIRTDSAINRINVCVGQKIIALPHDNRQVRLFDMSGVRLARLPRSSRQGHRRMVCCSAWSEDHPVCNLFTCGFDRQAIGWNINIPALLQEK,494,NP_054742.2.csv,refseq-WDR37-NM_014023.3_clinical_seed_0_final,refseq-WDR37-NM_014023.3.a2m,Invitae,refseq-WDR37-NM_014023.3.npy,1,494,494
+NP_054745.1,MADAAPQLGKRKRELDVEEAHAASTEEKEAGVGNGTCAPVRLPFSGFRLQKVLRESARDKIIFLHGKVNEASGDGDGEDAVVILEKTPFQVEQVAQLLTGSPELQLQFSNDIYSTYHLFPPRQLNDVKTTVVYPATEKHLQKYLRQDLRLIRETGDDYRNITLPHLESQSLSIQWVYNILDKKAEADRIVFENPDPSDGFVLIPDLKWNQQQLDDLYLIAICHRRGIRSLRDLTPEHLPLLRNILHQGQEAILQRYRMKGDHLRVYLHYLPSYYHLHVHFTALGFEAPGSGVERAHLLAEVIENLECDPRHYQQRTLTFALRADDPLLKLLQEAQQS,337,NP_054745.1.csv,refseq-DCPS-NM_014026.4_clinical_seed_0_final,refseq-DCPS-NM_014026.4.a2m,Invitae,refseq-DCPS-NM_014026.4.npy,1,337,337
+NP_054768.2,MSGCGLFLRTTAAARACRGLVVSTANRRLLRTSPPVRAFAKELFLGKIKKKEVFPFPEVSQDELNEINQFLGPVEKFFTEEVDSRKIDQEGKIPDETLEKLKSLGLFGLQVPEEYGGLGFSNTMYSRLGEIISMDGSITVTLAAHQAIGLKGIILAGTEEQKAKYLPKLASGEHIAAFCLTEPASGSDAASIRSRATLSEDKKHYILNGSKVWITNGGLANIFTVFAKTEVVDSDGSVKDKITAFIVERDFGGVTNGKPEDKLGIRGSNTCEVHFENTKIPVENILGEVGDGFKVAMNILNSGRFSMGSVVAGLLKRLIEMTAEYACTRKQFNKRLSEFGLIQEKFALMAQKAYVMESMTYLTAGMLDQPGFPDCSIEAAMVKVFSSEAAWQCVSEALQILGGLGYTRDYPYERILRDTRILLIFEGTNEILRMYIALTGLQHAGRILTTRIHELKQAKVSTVMDTVGRRLRDSLGRTVDLGLTGNHGVVHPSLADSANKFEENTYCFGRTVETLLLRFGKTIMEEQLVLKRVANILINLYGMTAVLSRASRSIRIGLRNHDHEVLLANTFCVEAYLQNLFSLSQLDKYAPENLDEQIKKVSQQILEKRAYICAHPLDRTC,621,NP_054768.2.csv,refseq-ACAD9-NM_014049.4_clinical_seed_0_final,refseq-ACAD9-NM_014049.4.a2m,Invitae,refseq-ACAD9-NM_014049.4.npy,1,621,621
+NP_054772.1,MARPDDEEGAAVAPGHPLAKGYLPLPRGAPVGKESVELQNGPKAGTFPVNGAPRDSLAAASGVLGGPQTPLAPEEETQARLLPAGAGAETPGAESSPLPLTALSPRRFVVLLIFSLYSLVNAFQWIQYSIISNVFEGFYGVTLLHIDWLSMVYMLAYVPLIFPATWLLDTRGLRLTALLGSGLNCLGAWIKCGSVQQHLFWVTMLGQCLCSVAQVFILGLPSRIASVWFGPKEVSTACATAVLGNQLGTAVGFLLPPVLVPNTQNDTNLLACNISTMFYGTSAVATLLFILTAIAFKEKPRYPPSQAQAALQDSPPEEYSYKKSIRNLFKNIPFVLLLITYGIMTGAFYSVSTLLNQMILTYYEGEEVNAGRIGLTLVVAGMVGSILCGLWLDYTKTYKQTTLIVYILSFIGMVIFTFTLDLRYIIIVFVTGGVLGFFMTGYLPLGFEFAVEITYPESEGTSSGLLNASAQIFGILFTLAQGKLTSDYGPKAGNIFLCVWMFIGIILTALIKSDLRRHNINIGITNVDVKAIPADSPTDQEPKTVMLSKQSESAI,555,NP_054772.1.csv,refseq-FLVCR1-NM_014053.3_clinical_seed_0_final,refseq-FLVCR1-NM_014053.3.a2m,Invitae,refseq-FLVCR1-NM_014053.3.npy,1,555,555
+NP_054774.2,MSDQIKFIMDSLNKEPFRKNYNLITFDSLEPMQLLQVLSDVLAEIDPKQLVDIREEMPEQTAKRMLSLLGILKYKPSGNATDMSTFRQGLVIGSKPVIYPVLHWLLQRTNELKKRAYLARFLIKLEVPSEFLQDETVADTNKQYEELMEAFKTLHKEYEQLKISGFSTAEIRKDISAMEEEKDQLIKRVEHLKKRVETAQNHQWMLKIARQLRVEKEREEYLAQQKQEQKNQLFHAVQRLQRVQNQLKSMRQAAADAKPESLMKRLEEEIKFNLYMVTEKFPKELENKKKELHFLQKVVSEPAMGHSDLLELESKINEINTEINQLIEKKMMRNEPIEGKLSLYRQQASIISRKKEAKAEELQEAKEKLASLEREASVKRNQTREFDGTEVLKGDEFKRYVNKLRSKSTVFKKKHQIIAELKAEFGLLQRTEELLKQRHENIQQQLQTMEEKKGISGYSYTQEELERVSALKSEVDEMKGRTLDDMSEMVKKLYSLVSEKKSALASVIKELRQLRQKYQELTQECDEKKSQYDSCAAGLESNRSKLEQEVRRLREECLQEESRYHYTNCMIKNLEVQLRRATDEMKAYISSDQQEKRKAIREQYTKNTAEQENLGKKLREKQKVIRESHGPNMKQAKMWRDLEQLMECKKQCFLKQQSQTSIGQVIQEGGEDRLIL,676,NP_054774.2.csv,refseq-IFT81-NM_014055.3_clinical_seed_0_final,refseq-IFT81-NM_014055.3.a2m,Invitae,refseq-IFT81-NM_014055.3.npy,1,676,676
+NP_054799.4,MLRARPEALMLLGALLTGSLGPSGNQDALSLPWEVQRYDGWFNNLRHHERGAVGCRLQRRVPANYADGVYQALEEPQLPNPRRLSNAATRGIAGLPSLHNRTVLGVFFGYHVLSDVVSVETPGCPAEFLNIRIPPGDPVFDPDQRGDVVLPFQRSRWDPETGRSPSNPRDLANQVTGWLDGSAIYGSSHSWSDALRSFSGGQLASGPDPAFPRDSQNPLLMWAAPDPATGQNGPRGLYAFGAERGNREPFLQALGLLWFRYHNLWAQRLARQHPDWEDEELFQHARKRVIATYQNIAVYEWLPSFLQKTLPEYTGYRPFLDPSISPEFVVASEQFFSTMVPPGVYMRNASCHFRKVLNKGFQSSQALRVCNNYWIRENPNLNSTQEVNELLLGMASQISELEDNIVVEDLRDYWPGPGKFSRTDYVASSIQRGRDMGLPSYSQALLAFGLDIPRNWSDLNPNVDPQVLEATAALYNQDLSQLELLLGGLLESHGDPGPLFSAIVLDQFVRLRDGDRYWFENTRNGLFSKKEIEDIRNTTLRDVLVAVINIDPSALQPNVFVWHKGAPCPQPKQLTTDGLPQCAPLTVLDFFEGSSPGFAITIIALCCLPLVSLLLSGVVAYFRGREHKKLQKKLKESVKKEAAKDGVPAMEWPGPKERSSPIIIQLLSDRCLQVLNRHLTVLRVVQLQPLQQVNLILSNNRGCRTLLLKIPKEYDLVLLFSSEEERGAFVQQLWDFCVRWALGLHVAEMSEKELFRKAVTKQQRERILEIFFRHLFAQVLDINQADAGTLPLDSSQKVREALTCELSRAEFAESLGLKPQDMFVESMFSLADKDGNGYLSFREFLDILVVFMKGSPEDKSRLMFTMYDLDENGFLSKDEFFTMMRSFIEISNNCLSKAQLAEVVESMFRESGFQDKEELTWEDFHFMLRDHDSELRFTQLCVKGGGGGGNGIRDIFKQNISCRVSFITRTPGERSHPQGLGPPAPEAPELGGPGLKKRFGKKAAVPTPRLYTEALQEKMQRGFLAQKLQQYKRFVENYRRHIVCVAIFSAICVGVFADRAYYYGFASPPSDIAQTTLVGIILSRGTAASVSFMFSYILLTMCRNLITFLRETFLNRYVPFDAAVDFHRWIAMAAVVLAILHSAGHAVNVYIFSVSPLSLLACIFPNVFVNDGSKLPQKFYWWFFQTVPGMTGVLLLLVLAIMYVFASHHFRRRSFRGFWLTHHLYILLYALLIIHGSYALIQLPTFHIYFLVPAIIYGGDKLVSLSRKKVEISVVKAELLPSGVTYLQFQRPQGFEYKSGQWVRIACLALGTTEYHPFTLTSAPHEDTLSLHIRAVGPWTTRLREIYSSPKGNGCAGYPKLYLDGPFGEGHQEWHKFEVSVLVGGGIGVTPFASILKDLVFKSSLGSQMLCKKIYFIWVTRTQRQFEWLADIIQEVEENDHQDLVSVHIYVTQLAEKFDLRTTMLYICERHFQKVLNRSLFTGLRSITHFGRPPFEPFFNSLQEVHPQVRKIGVFSCGPPGMTKNVEKACQLVNRQDRAHFMHHYENF,1548,NP_054799.4.csv,refseq-DUOX2-NM_014080.4_clinical_seed_0_final,refseq-DUOX2-NM_014080.4.a2m,Invitae,refseq-DUOX2-NM_014080.4.npy,1,1548,1548
+NP_054831.2,MPYEVNAGYDFTNMVRKKNPPLRNVASEGEGQILEPIGTESKVSGKNKEFSADQMSENTDQSDAAELNHKEEHSLHVQDPSSSSKKDLKSAVLSEKAGFNYESPSKGGNFPSFPHDEVTDRNMLAFSSPAAGGVCEPLKSPQRAEADDPQDMACTPSGDSLETKEDQKMSPKATEETGQAQSGQANCQGLSPVSVASKNPQVPSDGGVRLNKSKTDLLVNDNPDPAPLSPELQDFKCNICGYGYYGNDPTDLIKHFRKYHLGLHNRTRQDAELDSKILALHNMVQFSHSKDFQKVNRSVFSGVLQDINSSRPVLLNGTYDVQVTSGGTFIGIGRKTPDCQGNTKYFRCKFCNFTYMGNSSTELEQHFLQTHPNKIKASLPSSEVAKPSEKNSNKSIPALQSSDSGDLGKWQDKITVKAGDDTPVGYSVPIKPLDSSRQNGTEATSYYWCKFCSFSCESSSSLKLLEHYGKQHGAVQSGGLNPELNDKLSRGSVINQNDLAKSSEGETMTKTDKSSSGAKKKDFSSKGAEDNMVTSYNCQFCDFRYSKSHGPDVIVVGPLLRHYQQLHNIHKCTIKHCPFCPRGLCSPEKHLGEITYPFACRKSNCSHCALLLLHLSPGAAGSSRVKHQCHQCSFTTPDVDVLLFHYESVHESQASDVKQEANHLQGSDGQQSVKESKEHSCTKCDFITQVEEEISRHYRRAHSCYKCRQCSFTAADTQSLLEHFNTVHCQEQDITTANGEEDGHAISTIKEEPKIDFRVYNLLTPDSKMGEPVSESVVKREKLEEKDGLKEKVWTESSSDDLRNVTWRGADILRGSPSYTQASLGLLTPVSGTQEQTKTLRDSPNVEAAHLARPIYGLAVETKGFLQGAPAGGEKSGALPQQYPASGENKSKDESQSLLRRRRGSGVFCANCLTTKTSLWRKNANGGYVCNACGLYQKLHSTPRPLNIIKQNNGEQIIRRRTRKRLNPEALQAEQLNKQQRGSNEEQVNGSPLERRSEDHLTESHQREIPLPSLSKYEAQGSLTKSHSAQQPVLVSQTLDIHKRMQPLHIQIKSPQESTGDPGNSSSVSEGKGSSERGSPIEKYMRPAKHPNYSPPGSPIEKYQYPLFGLPFVHNDFQSEADWLRFWSKYKLSVPGNPHYLSHVPGLPNPCQNYVPYPTFNLPPHFSAVGSDNDIPLDLAIKHSRPGPTANGASKEKTKAPPNVKNEGPLNVVKTEKVDRSTQDELSTKCVHCGIVFLDEVMYALHMSCHGDSGPFQCSICQHLCTDKYDFTTHIQRGLHRNNAQVEKNGKPKE,1294,NP_054831.2.csv,NP_054831.2_colabfold_clinical_seed_0_final,NP_054831.2_colabfold.a2m,colabfold,NP_054831.2_colabfold_theta_0.2.npy,1,1294,1294
+NP_054858.2,MDDRCYPVIFPDERNFRPFTSDSLAAIEKRIAIQKEKKKSKDQTGEVPQPRPQLDLKASRKLPKLYGDIPRELIGKPLEDLDPFYRNHKTFMVLNRKRTIYRFSAKHALFIFGPFNSIRSLAIRVSVHSLFSMFIIGTVIINCVFMATGPAKNSNSNNTDIAECVFTGIYIFEALIKILARGFILDEFSFLRDPWNWLDSIVIGIAIVSYIPGITIKLLPLRTFRVFRALKAISVVSRLKVIVGALLRSVKKLVNVIILTFFCLSIFALVGQQLFMGSLNLKCISRDCKNISNPEAYDHCFEKKENSPEFKMCGIWMGNSACSIQYECKHTKINPDYNYTNFDNFGWSFLAMFRLMTQDSWEKLYQQTLRTTGLYSVFFFIVVIFLGSFYLINLTLAVVTMAYEEQNKNVAAEIEAKEKMFQEAQQLLKEEKEALVAMGIDRSSLTSLETSYFTPKKRKLFGNKKRKSFFLRESGKDQPPGSDSDEDCQKKPQLLEQTKRLSQNLSLDHFDEHGDPLQRQRALSAVSILTITMKEQEKSQEPCLPCGENLASKYLVWNCCPQWLCVKKVLRTVMTDPFTELAITICIIINTVFLAMEHHKMEASFEKMLNIGNLVFTSIFIAEMCLKIIALDPYHYFRRGWNIFDSIVALLSFADVMNCVLQKRSWPFLRSFRVLRVFKLAKSWPTLNTLIKIIGNSVGALGSLTVVLVIVIFIFSVVGMQLFGRSFNSQKSPKLCNPTGPTVSCLRHWHMGDFWHSFLVVFRILCGEWIENMWECMQEANASSSLCVIVFILITVIGKLVVLNLFIALLLNSFSNEERNGNLEGEARKTKVQLALDRFRRAFCFVRHTLEHFCHKWCRKQNLPQQKEVAGGCAAQSKDIIPLVMEMKRGSETQEELGILTSVPKTLGVRHDWTWLAPLAEEEDDVEFSGEDNAQRITQPEPEQQAYELHQENKKPTSQRVQSVEIDMFSEDEPHLTIQDPRKKSDVTSILSECSTIDLQDGFGWLPEMVPKKQPERCLPKGFGCCFPCCSVDKRKPPWVIWWNLRKTCYQIVKHSWFESFIIFVILLSSGALIFEDVHLENQPKIQELLNCTDIIFTHIFILEMVLKWVAFGFGKYFTSAWCCLDFIIVIVSVTTLINLMELKSFRTLRALRPLRALSQFEGMKVVVNALIGAIPAILNVLLVCLIFWLVFCILGVYFFSGKFGKCINGTDSVINYTIITNKSQCESGNFSWINQKVNFDNVGNAYLALLQVATFKGWMDIIYAAVDSTEKEQQPEFESNSLGYIYFVVFIIFGSFFTLNLFIGVIIDNFNQQQKKLGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPLNKCQGLVFDIVTSQIFDIIIISLIILNMISMMAESYNQPKAMKSILDHLNWVFVVIFTLECLIKIFALRQYYFTNGWNLFDCVVVLLSIVSTMISTLENQEHIPFPPTLFRIVRLARIGRILRLVRAARGIRTLLFALMMSLPSLFNIGLLLFLIMFIYAILGMNWFSKVNPESGIDDIFNFKTFASSMLCLFQISTSAGWDSLLSPMLRSKESCNSSSENCHLPGIATSYFVSYIIISFLIVVNMYIAVILENFNTATEESEDPLGEDDFDIFYEVWEKFDPEATQFIKYSALSDFADALPEPLRVAKPNKYQFLVMDLPMVSEDRLHCMDILFAFTARVLGGSDGLDSMKAMMEEKFMEANPLKKLYEPIVTTTKRKEEERGAAIIQKAFRKYMMKVTKGDQGDQNDLENGPHSPLQTLCNGDLSSFGVAKGKVHCD,1791,NP_054858.2.csv,refseq-SCN11A-NM_014139.2_clinical_seed_0_final,refseq-SCN11A-NM_014139.2.a2m,Invitae,refseq-SCN11A-NM_014139.2.npy,1,1791,1791
+NP_054859.2,MSLPLTEEQRKKIEENRQKALARRAEKLLAEQHQRTSSGTSIAGNPFQAKQGPSQNFPRESCKPVSHGVIFKQQNLSSSSNADQRPHDSHSFQAKGIWKKPEEMPTACPGHSPRSQMALTGISPPLAQSPPEVPKQQLLSYELGQGHAQASPEIRFTPFANPTHKPLAKPKSSQETPAHSSGQPPRDAKLEAKTAKASPSGQNISYIHSSSESVTPRTEGRLQQKSGSSVQKGVNSQKGKCVRNGDRFQVLIGYNAELIAVFKTLPSKNYDPDTKTWNFSMNDYSALMKAAQSLPTVNLQPLEWAYGSSESPSTSSEGQAGLPSAPSLSFVKGRCMLISRAYFEADISYSQDLIALFKQMDSRRYDVKTRKWSFLLEEHSKLIAKVRCLPQVQLDPLPTTLTLAFASQLKKTSLSLTPDVPEADLSEVDPKLVSNLMPFQRAGVNFAIAKGGRLLLADDMGLGKTIQAICIAAFYRKEWPLLVVVPSSVRFTWEQAFLRWLPSLSPDCINVVVTGKDRLTAGLINIVSFDLLSKLEKQLKTPFKVVIIDESHFLKNSRTARCRAAMPVLKVAKRVILLSGTPAMSRPAELYTQIIAVKPTFFPQFHAFGLRYCDAKRMPWGWDYSGSSNLGELKLLLEEAVMLRRLKSDVLSQLPAKQRKIVVIAPGRINARTRAALDAAAKEMTTKDKTKQQQKDALILFFNRTAEAKIPSVIEYILDLLESGREKFLVFAHHKVVLDAITQELERKHVQHIRIDGSTSSAEREDLCQQFQLSERHAVAVLSITAANMGLTFSSADLVVFAELFWNPGVLIQAEDRVHRIGQTSSVGIHYLVAKGTADDYLWPLIQEKIKVLAEAGLSETNFSEMTESTDYLYKDPKQQKIYDLFQKSFEKEGSDMELLEAAESFDPGSASGTSGSSSQNMGDTLDESSLTASPQKKRRFEFFDNWDSFTSPL,954,NP_054859.2.csv,refseq-SMARCAL1-NM_014140.3_clinical_seed_0_final,refseq-SMARCAL1-NM_014140.3.a2m,Invitae,refseq-SMARCAL1-NM_014140.3.npy,1,954,954
+NP_055006.1,MAARLLAPPGPDSFKPFTPESLANIERRIAESKLKKPPKADGSHREDDEDSKPKPNSDLEAGKSLPFIYGDIPQGLVAVPLEDFDPYYLTQKTFVVLNRGKTLFRFSATPALYILSPFNLIRRIAIKILIHSVFSMIIMCTILTNCVFMTFSNPPDWSKNVEYTFTGIYTFESLVKIIARGFCIDGFTFLRDPWNWLDFSVIMMAYITEFVNLGNVSALRTFRVLRALKTISVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNKCVVWPINFNESYLENGTKGFDWEEYINNKTNFYTVPGMLEPLLCGNSSDAGQCPEGYQCMKAGRNPNYGYTSFDTFSWAFLALFRLMTQDYWENLYQLTLRAAGKTYMIFFVLVIFVGSFYLVNLILAVVAMAYEEQNQATLEEAEQKEAEFKAMLEQLKKQQEEAQAAAMATSAGTVSEDAIEEEGEEGGGSPRSSSEISKLSSKSAKERRNRRKKRKQKELSEGEEKGDPEKVFKSESEDGMRRKAFRLPDNRIGRKFSIMNQSLLSIPGSPFLSRHNSKSSIFSFRGPGRFRDPGSENEFADDEHSTVEESEGRRDSLFIPIRARERRSSYSGYSGYSQGSRSSRIFPSLRRSVKRNSTVDCNGVVSLIGGPGSHIGGRLLPEATTEVEIKKKGPGSLLVSMDQLASYGRKDRINSIMSVVTNTLVEELEESQRKCPPCWYKFANTFLIWECHPYWIKLKEIVNLIVMDPFVDLAITICIVLNTLFMAMEHHPMTPQFEHVLAVGNLVFTGIFTAEMFLKLIAMDPYYYFQEGWNIFDGFIVSLSLMELSLADVEGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKSYKECVCKINQDCELPRWHMHDFFHSFLIVFRVLCGEWIETMWDCMEVAGQAMCLIVFMMVMVIGNLVVLNLFLALLLSSFSADNLAATDDDGEMNNLQISVIRIKKGVAWTKLKVHAFMQAHFKQREADEVKPLDELYEKKANCIANHTGADIHRNGDFQKNGNGTTSGIGSSVEKYIIDEDHMSFINNPNLTVRVPIAVGESDFENLNTEDVSSESDPEGSKDKLDDTSSSEGSTIDIKPEVEEVPVEQPEEYLDPDACFTEGCVQRFKCCQVNIEEGLGKSWWILRKTCFLIVEHNWFETFIIFMILLSSGALAFEDIYIEQRKTIRTILEYADKVFTYIFILEMLLKWTAYGFVKFFTNAWCWLDFLIVAVSLVSLIANALGYSELGAIKSLRTLRALRPLRALSRFEGMRVVVNALVGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKYHYCFNETSEIRFEIEDVNNKTECEKLMEGNNTEIRWKNVKINFDNVGAGYLALLQVATFKGWMDIMYAAVDSRKPDEQPKYEDNIYMYIYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKFGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPLNKIQGIVFDFVTQQAFDIVIMMLICLNMVTMMVETDTQSKQMENILYWINLVFVIFFTCECVLKMFALRHYYFTIGWNIFDFVVVILSIVGMFLADIIEKYFVSPTLFRVIRLARIGRILRLIKGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIFSIFGMSNFAYVKHEAGIDDMFNFETFGNSMICLFQITTSAGWDGLLLPILNRPPDCSLDKEHPGSGFKGDCGNPSVGIFFFVSYIIISFLIVVNMYIAIILENFSVATEESADPLSEDDFETFYEIWEKFDPDATQFIEYCKLADFADALEHPLRVPKPNTIELIAMDLPMVSGDRIHCLDILFAFTKRVLGDSGELDILRQQMEERFVASNPSKVSYEPITTTLRRKQEEVSAVVLQRAYRGHLARRGFICKKTTSNKLENGGTHREKKESTPSTASLPSYDSVTKPEKEKQQRAEEGRRERAKRQKEVRESKC,1980,NP_055006.1.csv,refseq-SCN8A-NM_014191.3_clinical_seed_0_final,refseq-SCN8A-NM_014191.3.a2m,Invitae,refseq-SCN8A-NM_014191.3.npy,1,1980,1980
+NP_055023.2,MKIITYFCIWAVAWAIPVPQSKPLERHVEKSMNLHLLARSNVSVQDELNASGTIKESGVLVHEGDRGRQENTQDGHKGEGNGSKWAEVGGKSFSTYSTLANEEGNIEGWNGDTGKAETYGHDGIHGKEENITANGIQGQVSIIDNAGATNRSNTNGNTDKNTQNGDVGDAGHNEDVAVVQEDGPQVAGSNNSTDNEDEIIENSCRNEGNTSEITPQINSKRNGTKEAEVTPGTGEDAGLDNSDGSPSGNGADEDEDEGSGDDEDEEAGNGKDSSNNSKGQEGQDHGKEDDHDSSIGQNSDSKEYYDPEGKEDPHNEVDGDKTSKSEENSAGIPEDNGSQRIEDTQKLNHRESKRVENRITKESETHAVGKSQDKGIEIKGPSSGNRNITKEVGKGNEGKEDKGQHGMILGKGNVKTQGEVVNIEGPGQKSEPGNKVGHSNTGSDSNSDGYDSYDFDDKSMQGDDPNSSDESNGNDDANSESDNNSSSRGDASYNSDESKDNGNGSDSKGAEDDDSDSTSDTNNSDSNGNGNNGNDDNDKSDSGKGKSDSSDSDSSDSSNSSDSSDSSDSDSSDSNSSSDSDSSDSDSSDSSDSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSKSDSSKSESDSSDSDSKSDSSDSNSSDSSDNSDSSDSSNSSNSSDSSDSSDSSDSSSSSDSSNSSDSSDSSDSSNSSESSDSSDSSDSDSSDSSDSSNSNSSDSDSSNSSDSSDSSNSSDSSDSSDSSNSSDSSDSSDSSNSSDSSDSSDSSDSSDSSNSSDSNDSSNSSDSSDSSNSSDSSNSSDSSDSSDSSDSDSSNSSDSSNSSDSSDSSNSSDSSDSSDSSDGSDSDSSNRSDSSNSSDSSDSSDSSNSSDSSDSSDSNESSNSSDSSDSSNSSDSDSSDSSNSSDSSDSSNSSDSSESSNSSDNSNSSDSSNSSDSSDSSDSSNSSDSSNSSDSSNSSDSSDSNSSDSSDSSNSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSNSSDSSNSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSESSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSNSSDSSDSSESSDSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSNESSDSSDSSDSSDSSNSSDSSDSSDSSDSTSDSNDESDSQSKSGNGNNNGSDSDSDSEGSDSNHSTSDD,1301,NP_055023.2.csv,refseq-DSPP-NM_014208.3_clinical_seed_0_final,refseq-DSPP-NM_014208.3.a2m,Invitae,refseq-DSPP-NM_014208.3.npy,1,1301,1301
+NP_055040.2,MAAADGDDSLYPIAVLIDELRNEDVQLRLNSIKKLSTIALALGVERTRSELLPFLTDTIYDEDEVLLALAEQLGTFTTLVGGPEYVHCLLPPLESLATVEETVVRDKAVESLRAISHEHSPSDLEAHFVPLVKRLAGGDWFTSRTSACGLFSVCYPRVSSAVKAELRQYFRNLCSDDTPMVRRAAASKLGEFAKVLELDNVKSEIIPMFSNLASDEQDSVRLLAVEACVNIAQLLPQEDLEALVMPTLRQAAEDKSWRVRYMVADKFTELQKAVGPEITKTDLVPAFQNLMKDCEAEVRAAASHKVKEFCENLSADCRENVIMSQILPCIKELVSDANQHVKSALASVIMGLSPILGKDNTIEHLLPLFLAQLKDECPEVRLNIISNLDCVNEVIGIRQLSQSLLPAIVELAEDAKWRVRLAIIEYMPLLAGQLGVEFFDEKLNSLCMAWLVDHVYAIREAATSNLKKLVEKFGKEWAHATIIPKVLAMSGDPNYLHRMTTLFCINVLSEVCGQDITTKHMLPTVLRMAGDPVANVRFNVAKSLQKIGPILDNSTLQSEVKPILEKLTQDQDVDVKYFAQEALTVLSLA,589,NP_055040.2.csv,refseq-PPP2R1A-NM_014225.5_clinical_seed_0_final,refseq-PPP2R1A-NM_014225.5.a2m,Invitae,refseq-PPP2R1A-NM_014225.5.npy,1,589,589
+NP_055051.1,MESSSSSNSYFSVGPTSPSAVVLLYSKELKKWDEFEDILEERRHVSDLKFAMKCYTPLVYKGITPCKPIDIKCSVLNSEEIHYVIKQLSKESLQSVDVLREEVSEILDEMSHKLRLGAIRFCAFTLSKVFKQIFSKVCVNEEGIQKLQRAIQEHPVVLLPSHRSYIDFLMLSFLLYNYDLPVPVIAAGMDFLGMKMVGELLRMSGAFFMRRTFGGNKLYWAVFSEYVKTMLRNGYAPVEFFLEGTRSRSAKTLTPKFGLLNIVMEPFFKREVFDTYLVPISISYDKILEETLYVYELLGVPKPKESTTGLLKARKILSENFGSIHVYFGDPVSLRSLAAGRMSRSSYNLVPRYIPQKQSEDMHAFVTEVAYKMELLQIENMVLSPWTLIVAVLLQNRPSMDFDALVEKTLWLKGLTQAFGGFLIWPDNKPAEEVVPASILLHSNIASLVKDQVILKVDSGDSEVVDGLMLQHITLLMCSAYRNQLLNIFVRPSLVAVALQMTPGFRKEDVYSCFRFLRDVFADEFIFLPGNTLKDFEEGCYLLCKSEAIQVTTKDILVTEKGNTVLEFLVGLFKPFVESYQIICKYLLSEEEDHFSEEQYLAAVRKFTSQLLDQGTSQCYDVLSSDVQKNALAACVRLGVVEKKKINNNCIFNVNEPATTKLEEMLGCKTPIGKPATAKL,680,NP_055051.1.csv,refseq-GNPAT-NM_014236.3_clinical_seed_0_final,refseq-GNPAT-NM_014236.3.a2m,Invitae,refseq-GNPAT-NM_014236.3.npy,1,680,680
+NP_055054.1,MPGSAAKGSELSERIESFVETLKRGGGPRSSEEMARETLGLLRQIITDHRWSNAGELMELIRREGRRMTAAQPSETTVGNMVRRVLKIIREEYGRLHGRSDESDQQESLHKLLTSGGLNEDFSFHYAQLQSNIIEAINELLVELEGTMENIAAQALEHIHSNEVIMTIGFSRTVEAFLKEAARKRKFHVIVAECAPFCQGHEMAVNLSKAGIETTVMTDAAIFAVMSRVNKVIIGTKTILANGALRAVTGTHTLALAAKHHSTPLIVCAPMFKLSPQFPNEEDSFHKFVAPEEVLPFTEGDILEKVSVHCPVFDYVPPELITLFISNIGGNAPSYIYRLMSELYHPDDHVL,351,NP_055054.1.csv,refseq-EIF2B2-NM_014239.3_clinical_seed_0_final,refseq-EIF2B2-NM_014239.3.a2m,Invitae,refseq-EIF2B2-NM_014239.3.npy,1,351,351
+NP_055059.2,MDPPAGAARRLLCPALLLLLLLLPPPLLPPPPPPANARLAAAADPPGGPLGHGAERILAVPVRTDAQGRLVSHVVSAATSRAGVRARRAAPVRTPSFPGGNEEEPGSHLFYNVTVFGRDLHLRLRPNARLVAPGATMEWQGEKGTTRVEPLLGSCLYVGDVAGLAEASSVALSNCDGLAGLIRMEEEEFFIEPLEKGLAAQEAEQGRVHVVYRRPPTSPPLGGPQALDTGASLDSLDSLSRALGVLEEHANSSRRRARRHAADDDYNIEVLLGVDDSVVQFHGKEHVQKYLLTLMNIVNEIYHDESLGAHINVVLVRIILLSYGKSMSLIEIGNPSQSLENVCRWAYLQQKPDTGHDEYHDHAIFLTRQDFGPSGMQGYAPVTGMCHPVRSCTLNHEDGFSSAFVVAHETGHVLGMEHDGQGNRCGDEVRLGSIMAPLVQAAFHRFHWSRCSQQELSRYLHSYDCLLDDPFAHDWPALPQLPGLHYSMNEQCRFDFGLGYMMCTAFRTFDPCKQLWCSHPDNPYFCKTKKGPPLDGTMCAPGKHCFKGHCIWLTPDILKRDGSWGAWSPFGSCSRTCGTGVKFRTRQCDNPHPANGGRTCSGLAYDFQLCSRQDCPDSLADFREEQCRQWDLYFEHGDAQHHWLPHEHRDAKERCHLYCESRETGEVVSMKRMVHDGTRCSYKDAFSLCVRGDCRKVGCDGVIGSSKQEDKCGVCGGDNSHCKVVKGTFTRSPKKHGYIKMFEIPAGARHLLIQEVDATSHHLAVKNLETGKFILNEENDVDASSKTFIAMGVEWEYRDEDGRETLQTMGPLHGTITVLVIPVGDTRVSLTYKYMIHEDSLNVDDNNVLEEDSVVYEWALKKWSPCSKPCGGGSQFTKYGCRRRLDHKMVHRGFCAALSKPKAIRRACNPQECSQPVWVTGEWEPCSQTCGRTGMQVRSVRCIQPLHDNTTRSVHAKHCNDARPESRRACSRELCPGRWRAGPWSQCSVTCGNGTQERPVLCRTADDSFGICQEERPETARTCRLGPCPRNISDPSKKSYVVQWLSRPDPDSPIRKISSKGHCQGDKSIFCRMEVLSRYCSIPGYNKLCCKSCNLYNNLTNVEGRIEPPPGKHNDIDVFMPTLPVPTVAMEVRPSPSTPLEVPLNASSTNATEDHPETNAVDEPYKIHGLEDEVQPPNLIPRRPSPYEKTRNQRIQELIDEMRKKEMLGKF,1211,NP_055059.2.csv,refseq-ADAMTS2-NM_014244.4_clinical_seed_0_final,refseq-ADAMTS2-NM_014244.4.a2m,Invitae,refseq-ADAMTS2-NM_014244.4.npy,1,1211,1211
+NP_055064.1,METRPTALMSSTVAAAAPAAGAASRKESPGRWGLGEDPTGVSPSLQCRVCGDSSSGKHYGIYACNGCSGFFKRSVRRRLIYRCQVGAGMCPVDKAHRNQCQACRLKKCLQAGMNQDAVQNERQPRSTAQVHLDSMESNTESRPESLVAPPAPAGRSPRGPTPMSAARALGHHFMASLITAETCAKLEPEDADENIDVTSNDPEFPSSPYSSSSPCGLDSIHETSARLLFMAVKWAKNLPVFSSLPFRDQVILLEEAWSELFLLGAIQWSLPLDSCPLLAPPEASAAGGAQGRLTLASMETRVLQETISRFRALAVDPTEFACMKALVLFKPETRGLKDPEHVEALQDQSQVMLSQHSKAHHPSQPVRFGKLLLLLPSLRFITAERIELLFFRKTIGNTPMEKLLCDMFKN,410,NP_055064.1.csv,NR2E3_HUMAN_b01_clinical_seed_0_final,NR2E3_HUMAN_b01.a2m,EVE,NR2E3_HUMAN_b01_theta_0.2.npy,1,410,410
+NP_055066.1,MAAAKVALTKRADPAELRTIFLKYASIEKNGEFFMSPNDFVTRYLNIFGESQPNPKTVELLSGVVDQTKDGLISFQEFVAFESVLCAPDALFMVAFQLFDKAGKGEVTFEDVKQVFGQTTIHQHIPFNWDSEFVQLHFGKERKRHLTYAEFTQFLLEIQLEHAKQAFVQRDNARTGRVTAIDFRDIMVTIRPHVLTPFVEECLVAAAGGTTSHQVSFSYFNGFNSLLNNMELIRKIYSTLAGTRKDVEVTKEEFVLAAQKFGQVTPMEVDILFQLADLYEPRGRMTLADIERIAPLEEGTLPFNLAEAQRQKASGDSARPVLLQVAESAYRFGLGSVAGAVGATAVYPIDLVKTRMQNQRSTGSFVGELMYKNSFDCFKKVLRYEGFFGLYRGLLPQLLGVAPEKAIKLTVNDFVRDKFMHKDGSVPLAAEILAGGCAGGSQVIFTNPLEIVKIRLQVAGEITTGPRVSALSVVRDLGFFGIYKGAKACFLRDIPFSAIYFPCYAHVKASFANEDGQVSPGSLLLAGAIAGMPAASLVTPADVIKTRLQVAARAGQTTYSGVIDCFRKILREEGPKALWKGAGARVFRSSPQFGVTLLTYELLQRWFYIDFGGVKPMGSEPVPKSRINLPAPNPDHVGGYKLAVATFAGIENKFGLYLPLFKPSVSTSKAIGGGP,675,NP_055066.1.csv,refseq-SLC25A13-NM_014251.2_clinical_seed_0_final,refseq-SLC25A13-NM_014251.2.a2m,Invitae,refseq-SLC25A13-NM_014251.2.npy,1,675,675
+NP_055067.1,MKSNPAIQAAIDLTAGAAGGTACVLTGQPFDTMKVKMQTFPDLYRGLTDCCLKTYSQVGFRGFYKGTSPALIANIAENSVLFMCYGFCQQVVRKVAGLDKQAKLSDLQNAAAGSFASAFAALVLCPTELVKCRLQTMYEMETSGKIAKSQNTVWSVIKSILRKDGPLGFYHGLSSTLLREVPGYFFFFGGYELSRSFFASGRSKDELGPVPLMLSGGVGGICLWLAVYPVDCIKSRIQVLSMSGKQAGFIRTFINVVKNEGITALYSGLKPTMIRAFPANGALFLAYEYSRKLMMNQLEAY,301,NP_055067.1.csv,refseq-SLC25A15-NM_014252.3_clinical_seed_0_final,refseq-SLC25A15-NM_014252.3.a2m,Invitae,refseq-SLC25A15-NM_014252.3.npy,1,301,301
+NP_055069.1,MRLTRKRLCSFLIALYCLFSLYAAYHVFFGRRRQAPAGSPRGLRKGAAPARERRGREQSTLESEEWNPWEGDEKNEQQHRFKTSLQILDKSTKGKTDLSVQIWGKAAIGLYLWEHIFEGLLDPSDVTAQWREGKSIVGRTQYSFITGPAVIPGYFSVDVNNVVLILNGREKAKIFYATQWLLYAQNLVQIQKLQHLAVVLLGNEHCDNEWINPFLKRNGGFVELLFIIYDSPWINDVDVFQWPLGVATYRNFPVVEASWSMLHDERPYLCNFLGTIYENSSRQALMNILKKDGNDKLCWVSAREHWQPQETNESLKNYQDALLQSDLTLCPVGVNTECYRIYEACSYGSIPVVEDVMTAGNCGNTSVHHGAPLQLLKSMGAPFIFIKNWKELPAVLEKEKTIILQEKIERRKMLLQWYQHFKTELKMKFTNILESSFLMNNKS,443,NP_055069.1.csv,refseq-RXYLT1-NM_014254.2_clinical_seed_0_final,refseq-RXYLT1-NM_014254.2.a2m,Invitae,refseq-RXYLT1-NM_014254.2.npy,1,443,443
+NP_055085.1,MGDTGLRKRREDEKSIQSQEPKTTSLQKELGLISGISIIVGTIIGSGIFVSPKSVLSNTEAVGPCLIIWAACGVLATLGALCFAELGTMITKSGGEYPYLMEAYGPIPAYLFSWASLIVIKPTSFAIICLSFSEYVCAPFYVGCKPPQIVVKCLAAAAILFISTVNSLSVRLGSYVQNIFTAAKLVIVAIIIISGLVLLAQGNTKNFDNSFEGAQLSVGAISLAFYNGLWAYDGWNQLNYITEELRNPYRNLPLAIIIGIPLVTACYILMNVSYFTVMTATELLQSQAVAVTFGDRVLYPASWIVPLFVAFSTIGAANGTCFTAGRLIYVAGREGHMLKVLSYISVRRLTPAPAIIFYGIIATIYIIPGDINSLVNYFSFAAWLFYGLTILGLIVMRFTRKELERPIKVPVVIPVLMTLISVFLVLAPIISKPTWEYLYCVLFILSGLLFYFLFVHYKFGWAQKISKPITMHLQMLMEVVPPEEDPE,487,NP_055085.1.csv,refseq-SLC7A9-NM_014270.4_clinical_seed_0_final,refseq-SLC7A9-NM_014270.4.a2m,Invitae,refseq-SLC7A9-NM_014270.4_theta_0.2.npy,1,487,487
+NP_055086.1,MKAPIPHLILLYATFTQSLKVVTKRGSADGCTDWSIDIKKYQVLVGEPVRIKCALFYGYIRTNYSLAQSAGLSLMWYKSSGPGDFEEPIAFDGSRMSKEEDSIWFRPTLLQDSGLYACVIRNSTYCMKVSISLTVGENDTGLCYNSKMKYFEKAELSKSKEISCRDIEDFLLPTREPEILWYKECRTKTWRPSIVFKRDTLLIREVREDDIGNYTCELKYGGFVVRRTTELTVTAPLTDKPPKLLYPMESKLTIQETQLGDSANLTCRAFFGYSGDVSPLIYWMKGEKFIEDLDENRVWESDIRILKEHLGEQEVSISLIVDSVEEGDLGNYSCYVENGNGRRHASVLLHKRELMYTVELAGGLGAILLLLVCLVTIYKCYKIEIMLFYRNHFGAEELDGDNKDYDAYLSYTKVDPDQWNQETGEEERFALEILPDMLEKHYGYKLFIPDRDLIPTGTYIEDVARCVDQSKRLIIVMTPNYVVRRGWSIFELETRLRNMLVTGEIKVILIECSELRGIMNYQEVEALKHTIKLLTVIKWHGPKCNKLNSKFWKRLQYEMPFKRIEPITHEQALDVSEQGPFGELQTVSAISMAAATSTALATAHPDLRSTFHNTYHSQMRQKHYYRSYEYDVPPTGTLPLTSIGNQHTYCNIPMTLINGQRPQTKSSREQNPDEAHTNSAILPLLPRETSISSVIW,696,NP_055086.1.csv,refseq-IL1RAPL1-NM_014271.3_clinical_seed_0_final,refseq-IL1RAPL1-NM_014271.3.a2m,Invitae,refseq-IL1RAPL1-NM_014271.3.npy,1,696,696
+NP_055112.2,MAEAVLRVARRQLSQRGGSGAPILLRQMFEPVSCTFTYLLGDRESREAVLIDPVLETAPRDAQLIKELGLRLLYAVNTHCHADHITGSGLLRSLLPGCQSVISRLSGAQADLHIEDGDSIRFGRFALETRASPGHTPGCVTFVLNDHSMAFTGDALLIRGCGRTDFQQGCAKTLYHSVHEKIFTLPGDCLIYPAHDYHGFTVSTVEEERTLNPRLTLSCEEFVKIMGNLNLPKPQQIDFAVPANMRCGVQTPTA,254,NP_055112.2.csv,refseq-ETHE1-NM_014297.3_clinical_seed_0_final,refseq-ETHE1-NM_014297.3.a2m,Invitae,refseq-ETHE1-NM_014297.3.npy,1,254,254
+NP_055120.1,MSAACWEEPWGLPGGFAKRVLVTGGAGFIASHMIVSLVEDYPNYMIINLDKLDYCASLKNLETISNKQNYKFIQGDICDSHFVKLLFETEKIDIVLHFAAQTHVDLSFVRAFEFTYVNVYGTHVLVSAAHEARVEKFIYVSTDEVYGGSLDKEFDESSPKQPTNPYASSKAAAECFVQSYWEQYKFPVVITRSSNVYGPHQYPEKVIPKFISLLQHNRKCCIHGSGLQTRNFLYATDVVEAFLTVLKKGKPGEIYNIGTNFEMSVVQLAKELIQLIKETNSESEMENWVDYVNDRPTNDMRYPMKSEKIHGLGWRPKVPWKEGIKKTIEWYRENFHNWKNVEKALEPFPV,350,NP_055120.1.csv,refseq-TGDS-NM_014305.3_clinical_seed_0_final,refseq-TGDS-NM_014305.3.a2m,Invitae,refseq-TGDS-NM_014305.3.npy,1,350,350
+NP_055129.2,MTTEQRRSLQAFQDYIRKTLDPTYILSYMAPWFREEEVQYIQAEKNNKGPMEAATLFLKFLLELQEEGWFRGFLDALDHAGYSGLYEAIESWDFKKIEKLEEYRLLLKRLQPEFKTRIIPTDIISDLSECLINQECEEILQICSTKGMMAGAEKLVECLLRSDKENWPKTLKLALEKERNKFSELWIVEKGIKDVETEDLEDKMETSDIQIFYQEDPECQNLSENSCPPSEVSDTNLYSPFKPRNYQLELALPAMKGKNTIICAPTGCGKTFVSLLICEHHLKKFPQGQKGKVVFFANQIPVYEQQKSVFSKYFERHGYRVTGISGATAENVPVEQIVENNDIIILTPQILVNNLKKGTIPSLSIFTLMIFDECHNTSKQHPYNMIMFNYLDQKLGGSSGPLPQVIGLTASVGVGDAKNTDEALDYICKLCASLDASVIATVKHNLEELEQVVYKPQKFFRKVESRISDKFKYIIAQLMRDTESLAKRICKDLENLSQIQNREFGTQKYEQWIVTVQKACMVFQMPDKDEESRICKALFLYTSHLRKYNDALIISEHARMKDALDYLKDFFSNVRAAGFDEIEQDLTQRFEEKLQELESVSRDPSNENPKLEDLCFILQEEYHLNPETITILFVKTRALVDALKNWIEGNPKLSFLKPGILTGRGKTNQNTGMTLPAQKCILDAFKASGDHNILIATSVADEGIDIAQCNLVILYEYVGNVIKMIQTRGRGRARGSKCFLLTSNAGVIEKEQINMYKEKMMNDSILRLQTWDEAVFREKILHIQTHEKFIRDSQEKPKPVPDKENKKLLCRKCKALACYTADVRVIEECHYTVLGDAFKECFVSRPHPKPKQFSSFEKRAKIFCARQNCSHDWGIHVKYKTFEIPVIKIESFVVEDIATGVQTLYSKWKDFHFEKIPFDPAEMSK,925,NP_055129.2.csv,refseq-DDX58-NM_014314.3_clinical_seed_0_final,refseq-DDX58-NM_014314.3.a2m,Invitae,refseq-DDX58-NM_014314.3.npy,1,925,925
+NP_055134.2,MAAAAASAPQQLSDEELFSQLRRYGLSPGPVTESTRPVYLKKLKKLREEEQQQHRSGGRGNKTRNSNNNNTAAATVAAAGPAAAAAAGMGVRPVSGDLSYLRTPGGLCRISASGPESLLGGPGGASAAPAAGSKVLLGFSSDESDVEASPRDQAGGGGRKDRASLQYRGLKAPPAPLAASEVTNSNSAERRKPHSWWGARRPAGPELQTPPGKDGAVEDEEGEGEDGEERDPETEEPLWASRTVNGSRLVPYSCRENYSDSEEEDDDDVASSRQVLKDDSLSRHRPRRTHSKPLPPLTAKSAGGRLETSVQGGGGLAMNDRAAAAGSLDRSRNLEEAAAAEQGGGCDQVDSSPVPRYRVNAKKLTPLLPPPLTDMDSTLDSSTGSLLKTNNHIGGGAFSVDSPRIYSNSLPPSAAVAASSSLRINHANHTGSNHTYLKNTYNKPKLSEPEEELLQQFKREEVSPTGSFSAHYLSMFLLTAACLFFLILGLTYLGMRGTGVSEDGELSIENPFGETFGKIQESEKTLMMNTLYKLHDRLAQLAGDHECGSSSQRTLSVQEAAAYLKDLGPEYEGIFNTSLQWILENGKDVGIRCVGFGPEEELTNITDVQFLQSTRPLMSFWCRFRRAFVTVTHRLLLLCLGVVMVCVVLRYMKYRWTKEEEETRQMYDMVVKIIDVLRSHNEACQENKDLQPYMPIPHVRDSLIQPHDRKKMKKVWDRAVDFLAANESRVRTETRRIGGADFLVWRWIQPSASCDKILVIPSKVWQGQAFHLDRRNSPPNSLTPCLKIRNMFDPVMEIGDQWHLAIQEAILEKCSDNDGIVHIAVDKNSREGCVYVKCLSPEYAGKAFKALHGSWFDGKLVTVKYLRLDRYHHRFPQALTSNTPLKPSNKHMNSMSHLRLRTGLTNSQGSS,911,NP_055134.2.csv,refseq-LEMD3-NM_014319.4_clinical_seed_0_final,refseq-LEMD3-NM_014319.4.a2m,Invitae,refseq-LEMD3-NM_014319.4.npy,1,911,911
+NP_055139.4,MALQGISVVELSGLAPGPFCAMVLADFGARVVRVDRPGSRYDVSRLGRGKRSLVLDLKQPRGAAVLRRLCKRSDVLLEPFRRGVMEKLQLGPEILQRENPRLIYARLSGFGQSGSFCRLAGHDINYLALSGVLSKIGRSGENPYAPLNLLADFAGGGLMCALGIIMALFDRTRTGKGQVIDANMVEGTAYLSSFLWKTQKLSLWEAPRGQNMLDGGAPFYTTYRTADGEFMAVGAIEPQFYELLIKGLGLKSDELPNQMSMDDWPEMKKKFADVFAEKTKAEWCQIFDGTDACVTPVLTFEEVVHHDHNKERGSFITSEEQDVSPRPAPLLLNTPAIPSFKRDPFIGEHTEEILEEFGFSREEIYQLNSDKIIESNKVKASL,382,NP_055139.4.csv,refseq-AMACR-NM_014324.5_clinical_seed_0_final,refseq-AMACR-NM_014324.5.a2m,Invitae,refseq-AMACR-NM_014324.5.npy,1,382,382
+NP_055147.1,MNMSKQPVSNVRAIQANINIPMGAFRPGAGQPPRRKECTPEVEEGVPPTSDEEKKPIPGAKKLPGPAVNLSEIQNIKSELKYVPKAEQ,88,NP_055147.1.csv,refseq-SMPX-NM_014332.2_clinical_seed_0_final,refseq-SMPX-NM_014332.2.a2m,Invitae,refseq-SMPX-NM_014332.2.npy,1,88,88
+NP_055151.3,MDAALLLNVEGVKKTILHGGTGELPNFITGSRVIFHFRTMKCDEERTVIDDSRQVGQPMHIIIGNMFKLEVWEILLTSMRVHEVAEFWCDTIHTGVYPILSRSLRQMAQGKDPTEWHVHTCGLANMFAYHTLGYEDLDELQKEPQPLVFVIELLQVDAPSDYQRETWNLSNHEKMKAVPVLHGEGNRLFKLGRYEEASSKYQEAIICLRNLQTKEKPWEVQWLKLEKMINTLILNYCQCLLKKEEYYEVLEHTSDILRHHPGIVKAYYVRARAHAEVWNEAEAKADLQKVLELEPSMQKAVRRELRLLENRMAEKQEEERLRCRNMLSQGATQPPAEPPTEPPAQSSTEPPAEPPTAPSAELSAGPPAEPATEPPPSPGHSLQH,384,NP_055151.3.csv,refseq-AIPL1-NM_014336.4_clinical_seed_0_final,refseq-AIPL1-NM_014336.4.a2m,Invitae,refseq-AIPL1-NM_014336.4.npy,1,384,384
+NP_055177.2,MGQREMWRLMSRFNAFKRTNTILHHLRMSKHTDAAEEVLLEKKGCTGVITLNRPKFLNALTLNMIRQIYPQLKKWEQDPETFLIIIKGAGGKAFCAGGDIRVISEAEKAKQKIAPVFFREEYMLNNAVGSCQKPYVALIHGITMGGGVGLSVHGQFRVATEKCLFAMPETAIGLFPDVGGGYFLPRLQGKLGYFLALTGFRLKGRDVYRAGIATHFVDSEKLAMLEEDLLALKSPSKENIASVLENYHTESKIDRDKSFILEEHMDKINSCFSANTVEEIIENLQQDGSSFALEQLKVINKMSPTSLKITLRQLMEGSSKTLQEVLTMEYRLSQACMRGHDFHEGVRAVLIDKDQSPKWKPADLKEVTEEDLNNHFKSLGSSDLKF,386,NP_055177.2.csv,refseq-HIBCH-NM_014362.3_clinical_seed_0_final,refseq-HIBCH-NM_014362.3.a2m,Invitae,refseq-HIBCH-NM_014362.3.npy,1,386,386
+NP_055180.1,MADGQMPFSCHYPSRLRRDPFRDSPLSSRLLDDGFGMDPFPDDLTASWPDWALPRLSSAWPGTLRSGMVPRGPTATARFGVPAEGRTPPPFPGEPWKVCVNVHSFKPEELMVKTKDGYVEVSGKHEEKQQEGGIVSKNFTKKIQLPAEVDPVTVFASLSPEGLLIIEAPQVPPYSTFGESSFNNELPQDSQEVTCT,196,NP_055180.1.csv,refseq-HSPB8-NM_014365.2_clinical_seed_0_final,refseq-HSPB8-NM_014365.2.a2m,Invitae,refseq-HSPB8-NM_014365.2.npy,1,196,196
+NP_055197.2,MKVARFQKIPNGENETMIPVLTSKKASELPVSEVASILQADLQNGLNKCEVSHRRAFHGWNEFDISEDEPLWKKYISQFKNPLIMLLLASAVISVLMHQFDDAVSITVAILIVVTVAFVQEYRSEKSLEELSKLVPPECHCVREGKLEHTLARDLVPGDTVCLSVGDRVPADLRLFEAVDLSIDESSLTGETTPCSKVTAPQPAATNGDLASRSNIAFMGTLVRCGKAKGVVIGTGENSEFGEVFKMMQAEEAPKTPLQKSMDLLGKQLSFYSFGIIGIIMLVGWLLGKDILEMFTISVSLAVAAIPEGLPIVVTVTLALGVMRMVKKRAIVKKLPIVETLGCCNVICSDKTGTLTKNEMTVTHIFTSDGLHAEVTGVGYNQFGEVIVDGDVVHGFYNPAVSRIVEAGCVCNDAVIRNNTLMGKPTEGALIALAMKMGLDGLQQDYIRKAEYPFSSEQKWMAVKCVHRTQQDRPEICFMKGAYEQVIKYCTTYQSKGQTLTLTQQQRDVYQQEKARMGSAGLRVLALASGPELGQLTFLGLVGIIDPPRTGVKEAVTTLIASGVSIKMITGDSQETAVAIASRLGLYSKTSQSVSGEEIDAMDVQQLSQIVPKVAVFYRASPRHKMKIIKSLQKNGSVVAMTGDGVNDAVALKAADIGVAMGQTGTDVCKEAADMILVDDDFQTIMSAIEEGKGIYNNIKNFVRFQLSTSIAALTLISLATLMNFPNPLNAMQILWINIIMDGPPAQSLGVEPVDKDVIRKPPRNWKDSILTKNLILKILVSSIIIVCGTLFVFWRELRDNVITPRDTTMTFTCFVFFDMFNALSSRSQTKSVFEIGLCSNRMFCYAVLGSIMGQLLVIYFPPLQKVFQTESLSILDLLFLLGLTSSVCIVAEIIKKVERSREKIQKHVSSTSSSFLEV,919,NP_055197.2.csv,refseq-ATP2C1-NM_014382.3_clinical_seed_0_final,refseq-ATP2C1-NM_014382.3.a2m,Invitae,refseq-ATP2C1-NM_014382.3.npy,1,919,919
+NP_055199.1,MLWSGCRRFGARLGCLPGGLRVLVQTGHRSLTSCIDPSMGLNEEQKEFQKVAFDFAAREMAPNMAEWDQKELFPVDVMRKAAQLGFGGVYIQTDVGGSGLSRLDTSVIFEALATGCTSTTAYISIHNMCAWMIDSFGNEEQRHKFCPPLCTMEKFASYCLTEPGSGSDAASLLTSAKKQGDHYILNGSKAFISGAGESDIYVVMCRTGGPGPKGISCIVVEKGTPGLSFGKKEKKVGWNSQPTRAVIFEDCAVPVANRIGSEGQGFLIAVRGLNGGRINIASCSLGAAHASVILTRDHLNVRKQFGEPLASNQYLQFTLADMATRLVAARLMVRNAAVALQEERKDAVALCSMAKLFATDECFAICNQALQMHGGYGYLKDYAVQQYVRDSRVHQILEGSNEVMRILISRSLLQE,415,NP_055199.1.csv,refseq-ACAD8-NM_014384.2_clinical_seed_0_final,refseq-ACAD8-NM_014384.2.a2m,Invitae,refseq-ACAD8-NM_014384.2.npy,1,415,415
+NP_055238.1,MNREDRNVLRMKERERRNQEIQQGEDAFPPSSPLFAEPYKVTSKEDKLSSRIQSMLGNYDEMKDFIGDRSIPKLVAIPKPTVPPSADEKSNPNFFEQRHGGSHQSSKWTPVGPAPSTSQSQKRSSGLQSGHSSQRTSAGSSSGTNSSGQRHDRESYNNSGSSSRKKGQHGSEHSKSRSSSPGKPQAVSSLNSSHSRSHGNDHHSKEHQRSKSPRDPDANWDSPSRVPFSSGQHSTQSFPPSLMSKSNSMLQKPTAYVRPMDGQESMEPKLSSEHYSSQSHGNSMTELKPSSKAHLTKLKIPSQPLDASASGDVSCVDEILKEMTHSWPPPLTAIHTPCKTEPSKFPFPTKESQQSNFGTGEQKRYNPSKTSNGHQSKSMLKDDLKLSSSEDSDGEQDCDKTMPRSTPGSNSEPSHHNSEGADNSRDDSSSHSGSESSSGSDSESESSSSDSEANEPSQSASPEPEPPPTNKWQLDNWLNKVNPHKVSPASSVDSNIPSSQGYKKEGREQGTGNSYTDTSGPKETSSATPGRDSKTIQKGSESGRGRQKSPAQSDSTTQRRTVGKKQPKKAEKAAAEEPRGGLKIESETPVDLASSMPSSRHKAATKGSRKPNIKKESKSSPRPTAEKKKYKSTSKSSQKSREIIETDTSSSDSDESESLPPSSQTPKYPESNRTPVKPSSVEEEDSFFRQRMFSPMEEKELLSPLSEPDDRYPLIVKIDLNLLTRIPGKPYKETEPPKGEKKNVPEKHTREAQKQASEKVSNKGKRKHKNEDDNRASESKKPKTEDKNSAGHKPSSNRESSKQSAAKEKDLLPSPAGPVPSKDPKTEHGSRKRTISQSSSLKSSSNSNKETSGSSKNSSSTSKQKKTEGKTSSSSKEVKEKAPSSSSNCPPSAPTLDSSKPRRTKLVFDDRNYSADHYLQEAKKLKHNADALSDRFEKAVYYLDAVVSFIECGNALEKNAQESKSPFPMYSETVDLIKYTMKLKNYLAPDATAADKRLTVLCLRCESLLYLRLFKLKKENALKYSKTLTEHLKNSYNNSQAPSPGLGSKAVGMPSPVSPKLSPGNSGNYSSGASSASASGSSVTIPQKIHQMAASYVQVTSNFLYATEIWDQAEQLSKEQKEFFAELDKVMGPLIFNASIMTDLVRYTRQGLHWLRQDAKLIS,1163,NP_055238.1.csv,refseq-AFF4-NM_014423.3_clinical_seed_0_final,refseq-AFF4-NM_014423.3.a2m,Invitae,refseq-AFF4-NM_014423.3.npy,1,1163,1163
+NP_055282.1,MASQLTQRGALFLLFFLTPAVTPTWYAGSGYYPDESYNEVYAEEVPQAPALDYRVPRWCYTLNIQDGEATCYSPKGGNYHSSLGTRCELSCDRGFRLIGRRSVQCLPSRRWSGTAYCRQMRCHALPFITSGTYTCTNGVLLDSRCDYSCSSGYHLEGDRSRICMEDGRWSGGEPVCVDIDPPKIRCPHSREKMAEPEKLTARVYWDPPLVKDSADGTITRVTLRGPEPGSHFPEGEHVIRYTAYDRAYNRASCKFIVKVQVRRCPTLKPPQHGYLTCTSAGDNYGATCEYHCDGGYDRQGTPSRVCQSSRQWSGSPPICAPMKINVNVNSAAGLLDQFYEKQRLLIISAPDPSNRYYKMQISMLQQSTCGLDLRHVTIIELVGQPPQEVGRIREQQLSANIIEELRQFQRLTRSYFNMVLIDKQGIDRDRYMEPVTPEEIFTFIDDYLLSNQELTQRREQRDICE,465,NP_055282.1.csv,refseq-SRPX2-NM_014467.3_clinical_seed_0_final,refseq-SRPX2-NM_014467.3.a2m,Invitae,refseq-SRPX2-NM_014467.3_theta_0.2.npy,1,465,465
+NP_055306.1,MMQESATETISNSSMNQNGMSTLSSQLDAGSRDGRSSGDTSSEVSTVELLHLQQQQALQAARQLLLQQQTSGLKSPKSSDKQRPLQVPVSVAMMTPQVITPQQMQQILQQQVLSPQQLQALLQQQQAVMLQQQQLQEFYKKQQEQLHLQLLQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQHPGKQAKEQQQQQQQQQQLAAQQLVFQQQLLQMQQLQQQQHLLSLQRQGLISIPPGQAALPVQSLPQAGLSPAEIQQLWKEVTGVHSMEDNGIKHGGLDLTTNNSSSTTSSNTSKASPPITHHSIVNGQSSVLSARRDSSSHEETGASHTLYGHGVCKWPGCESICEDFGQFLKHLNNEHALDDRSTAQCRVQMQVVQQLEIQLSKERERLQAMMTHLHMRPSEPKPSPKPLNLVSSVTMSKNMLETSPQSLPQTPTTPTAPVTPITQGPSVITPASVPNVGAIRRRHSDKYNIPMSSEIAPNYEFYKNADVRPPFTYATLIRQAIMESSDRQLTLNEIYSWFTRTFAYFRRNAATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEVEYQKRRSQKITGSPTLVKNIPTSLGYGAALNASLQAALAESSLPLLSNPGLINNASSGLLQAVHEDLNGSLDHIDSNGNSSPGCSPQPHIHSIHVKEEPVIAEDEDCPMSLVTTANHSPELEDDREIEEEPLSEDLE,715,NP_055306.1.csv,refseq-FOXP2-NM_014491.3_clinical_seed_0_final,refseq-FOXP2-NM_014491.3.a2m,Invitae,refseq-FOXP2-NM_014491.3.npy,1,715,715
+NP_055331.1,MADKRKLQGEIDRCLKKVSEGVEQFEDIWQKLHNAANANQKEKYEADLKKEIKKLQRLRDQIKTWVASNEIKDKRQLIDNRKLIETQMERFKVVERETKTKAYSKEGLGLAQKVDPAQKEKEEVGQWLTNTIDTLNMQVDQFESEVESLSVQTRKKKGDKDKQDRIEGLKRHIEKHRYHVRMLETILRMLDNDSILVDAIRKIKDDVEYYVDSSQDPDFEENEFLYDDLDLEDIPQALVATSPPSHSHMEDEIFNQSSSTPTSTTSSSPIPPSPANCTTENSEDDKKRGRSTDSEVSQSPAKNGSKPVHSNQHPQSPAVPPTYPSGPPPAASALSTTPGNNGVPAPAAPPSALGPKASPAPSHNSGTPAPYAQAVAPPAPSGPSTTQPRPPSVQPSGGGGGGSGGGGSSSSSNSSAGGGAGKQNGATSYSSVVADSPAEVALSSSGGNNASSQALGPPSGPHNPPPSTSKEPSAAAPTGAGGVAPGSGNNSGGPSLLVPLPVNPPSSPTPSFSDAKAAGALLNGPPQFSTAPEIKAPEPLSSLKSMAERAAISSGIEDPVPTLHLTERDIILSSTSAPPASAQPPLQLSEVNIPLSLGVCPLGPVPLTKEQLYQQAMEEAAWHHMPHPSDSERIRQYLPRNPCPTPPYHHQMPPPHSDTVEFYQRLSTETLFFIFYYLEGTKAQYLAAKALKKQSWRFHTKYMMWFQRHEEPKTITDEFEQGTYIYFDYEKWGQRKKEGFTFEYRYLEDRDLQ,753,NP_055331.1.csv,NP_055331.1_clinical_seed_0_final,NP_055331.1.a2m,popEVE,NP_055331.1_theta_0.2.npy,1,753,753
+NP_055400.1,MTRAGDHNRQRGCCGSLADYLTSAKFLLYLGHSLSTWGDRMWHFAVSVFLVELYGNSLLLTAVYGLVVAGSVLVLGAIIGDWVDKNARLKVAQTSLVVQNVSVILCGIILMMVFLHKHELLTMYHGWVLTSCYILIITIANIANLASTATAITIQRDWIVVVAGEDRSKLANMNATIRRIDQLTNILAPMAVGQIMTFGSPVIGCGFISGWNLVSMCVEYVLLWKVYQKTPALAVKAGLKEEETELKQLNLHKDTEPKPLEGTHLMGVKDSNIHELEHEQEPTCASQMAEPFRTFRDGWVSYYNQPVFLAGMGLAFLYMTVLGFDCITTGYAYTQGLSGSILSILMGASAITGIMGTVAFTWLRRKCGLVRTGLISGLAQLSCLILCVISVFMPGSPLDLSVSPFEDIRSRFIQGESITPTKIPEITTEIYMSNGSNSANIVPETSPESVPIISVSLLFAGVIAARIGLWSFDLTVTQLLQENVIESERGIINGVQNSMNYLLDLLHFIMVILAPNPEAFGLLVLISVSFVAMGHIMYFRFAQNTLGNKLFACGPDAKEVRKENQANTSVV,571,NP_055400.1.csv,refseq-SLC40A1-NM_014585.5_clinical_seed_0_final,refseq-SLC40A1-NM_014585.5.a2m,Invitae,refseq-SLC40A1-NM_014585.5.npy,1,571,571
+NP_055414.2,MSDTSESGAGLTRFQAEASEKDSSSMMQTLLTVTQNVEVPETPKASKALEVSEDVKVSKASGVSKATEVSKTPEAREAPATQASSTTQLTDTQVLAAENKSLAADTKKQNADPQAVTMPATETKKVSHVADTKVNTKAQETEAAPSQAPADEPEPESAAAQSQENQDTRPKVKAKKARKVKHLDGEEDGSSDQSQASGTTGGRRVSKALMASMARRASRGPIAFWARRASRTRLAAWARRALLSLRSPKARRGKARRRAAKLQSSQEPEAPPPRDVALLQGRANDLVKYLLAKDQTKIPIKRSDMLKDIIKEYTDVYPEIIERAGYSLEKVFGIQLKEIDKNDHLYILLSTLEPTDAGILGTTKDSPKLGLLMVLLSIIFMNGNRSSEAVIWEVLRKLGLRPGIHHSLFGDVKKLITDEFVKQKYLDYARVPNSNPPEYEFFWGLRSYYETSKMKVLKFACKVQKKDPKEWAAQYREAMEADLKAAAEAAAEAKARAEIRARMGIGLGSENAAGPCNWDEADIGPWAKARIQAGAEAKAKAQESGSASTGASTSTNNSASASASTSGGFSAGASLTATLTFGLFAGLGGAGASTSGSSGACGFSYK,606,NP_055414.2.csv,refseq-MAGED2-NM_177433.2_clinical_seed_0_final,refseq-MAGED2-NM_177433.2.a2m,Invitae,refseq-MAGED2-NM_177433.2.npy,1,606,606
+NP_055440.1,MERRARSSSRESRGRGGRTPHKENKRAKAERSGGGRGRQEAGPEPSGSGRAGTPGEPRAPAATVVDVDEVRGSGEEGTEVVALLESERPEEGTKSSGLGACEWLLVLISLLFIIMTFPFSIWFCVKVVQEYERVIIFRLGHLLPGRAKGPGLFFFLPCLDTYHKVDLRLQTLEIPFHEIVTKDMFIMEIDAICYYRMENASLLLSSLAHVSKAVQFLVQTTMKRLLAHRSLTEILLERKSIAQDAKVALDSVTCIWGIKVERIEIKDVRLPAGLQHSLAVEAEAQRQAKVRMIAAEAEKAASESLRMAAEILSGTPAAVQLRYLHTLQSLSTEKPSTVVLPLPFDLLNCLSSPSNRTQGSLPFPSPSKPVEPLNPKKKDSPML,383,NP_055440.1.csv,refseq-NPHS2-NM_014625.3_clinical_seed_0_final,refseq-NPHS2-NM_014625.3.a2m,Invitae,refseq-NPHS2-NM_014625.3.npy,1,383,383
+NP_055448.1,MSRGSIEIPLRDTDEVIELDFDQLPEGDEVISILKQEHTQLHIWIALALEYYKQGKTEEFVKLLEAARIDGNLDYRDHEKDQMTCLDTLAAYYVQQARKEKNKDNKKDLITQATLLYTMADKIIMYDQNHLLGRACFCLLEGDKMDQADAQFHFVLNQSPNNIPALLGKACISFNKKDYRGALAYYKKALRTNPGCPAEVRLGMGHCFVKLNKLEKARLAFSRALELNSKCVGALVGLAVLELNNKEADSIKNGVQLLSRAYTIDPSNPMVLNHLANHFFFKKDYSKVQHLALHAFHNTEVEAMQAESCYQLARSFHVQEDYDQAFQYYYQATQFASSSFVLPFFGLGQMYIYRGDKENASQCFEKVLKAYPNNYETMKILGSLYAASEDQEKRDIAKGHLKKVTEQYPDDVEAWIELAQILEQTDIQGALSAYGTATRILQEKVQADVPPEILNNVGALHFRLGNLGEAKKYFLASLDRAKAEAEHDEHYYNAISVTTSYNLARLYEAMCEFHEAEKLYKNILREHPNYVDCYLRLGAMARDKGNFYEASDWFKEALQINQDHPDAWSLIGNLHLAKQEWGPGQKKFERILKQPSTQSDTYSMLALGNVWLQTLHQPTRDREKEKRHQDRALAIYKQVLRNDAKNLYAANGIGAVLAHKGYFREARDVFAQVREATADISDVWLNLAHIYVEQKQYISAVQMYENCLRKFYKHQNTEVVLYLARALFKCGKLQECKQTLLKARHVAPSDTVLMFNVALVLQRLATSVLKDEKSNLKEVLNAVKELELAHRYFSYLSKVGDKMRFDLALAATEARQCSDLLSQAQYHVARARKQDEEERELRAKQEQEKELLRQKLLKEQEEKRLREKEEQKKLLEQRAQYVEKTKNILMFTGETEATKEKKRGGGGGRRSKKGGEFDEFVNDDTDDDLPISKKKKRRKGSGSEQEGEDEEGGERKKKKRRRHPKGEEGSDDDETENGPKPKKRRPPKAEKKKAPKPERLPPSMKGKIKSKAIISSSDDSSDEDKLKIADEGHPRNSNSNSDSDEDEQRKKCASSESDSDENQNKSGSEAGSPRRPRRQRSDQDSDSDQPSRKRRPSGSEQSDNESVQSGRSHSGVSENDSRPASPSAESDHESERGSDNEGSGQGSGNESEPEGSNNEASDRGSEHGSDDSD,1173,NP_055448.1.csv,refseq-CTR9-NM_014633.4_clinical_seed_0_final,refseq-CTR9-NM_014633.4.a2m,Invitae,refseq-CTR9-NM_014633.4.npy,1,1173,1173
+NP_055454.1,MSSKEVKTALKSARDAIRNKEYKEALKHCKTVLKQEKNNYNAWVFIGVAAAELEQPDQAQSAYKKAAELEPDQLLAWQGLANLYEKYNHINAKDDLPGVYQKLLDLYESVDKQKWCDVCKKLVDLYYQEKKHLEVARTWHKLIKTRQEQGAENEELHQLWRKLTQFLAESTEDQNNETQQLLFTAFENALGLSDKIPSEDHQVLYRHFIQSLSKFPHESARLKKACEGMINIYPTVQYPLEVLCLHLIESGNLTDEGQQYCCRLVEMDSKSGPGLIGLGIKALQDKKYEDAVRNLTEGLKESPVCTSGWYHLAEAQVKMHRPKEAVLSCSQALKIVDNLGASGNSLYQRNLCLHLKAEALIKLSDYDSSEEAIRTLDQISDADNIPGLLVLKSLAYRNKGSFDEAAKIMEDLLSSYPDLAEVHALEALIHFTKKDYLQAEKCFQRALEKDTEVAEYHYQLGLTYWFMGEETRKDKTKALTHFLKAARLDTYMGKVFCYLGHYYRDVVGDKNRARGCYRKAFELDDTDAESGAAAVDLSVELEDMEMALAILTTVTQKASAGTAKWAWLRRGLYYLKAGQHSQAVADLQAALRADPKDFNCWESLGEAYLSRGGYTTALKSFTKASELNPESIYSVFKVAAIQQILGKYKEAVAQYQMIIKKKEDYVPALKGLGECHLMMAKAALVDYLDGKAVDYIEKALEYFTCALQHRADVSCLWKLAGDACTCLYAVAPSKVNVHVLGVLLGQKEGKQVLKKNELLHLGGRCYGRALKLMSTSNTWCDLGINYYRQAQHLAETGSNMNDLKELLEKSLHCLKKAVRLDSNNHLYWNALGVVACYSGIGNYALAQHCFIKSIQSEQINAVAWTNLGVLYLTNENIEQAHEAFKMAQSLDPSYLMCWIGQALIAEAVGSYDTMDLFRHTTELNMHTEGALGYAYWVCTTLQDKSNRETELYQYNILQMNAIPAAQVILNKYVERIQNYAPAFTMLGYLNEHLQLKKEAANAYQRAILLLQTAEDQDTYNVAIRNYGRLLCSTGEYDKAIQAFKSTPLEVLEDIIGFALALFMKGLYKESSKAYERALSIVESEQDKAHILTALAITEYKQGKTDVAKTLLFKCSILKEPTTESLQALCALGLAMQDATLSKAALNELLKHIKHKDSNYQRCLLTSAIYALQGRSVAVQKQISKAVHSNPGDPALWSLLSRVVAQYAQRNAKGGVVAGNVAHILDSNHGKKALLYTAVNQLAMGSSSAEDEKNTALKTIQKAALLSPGDPAIWAGLMAACHADDKLALVNNTQPKRIDLYLALLSAVSASIKDEKFFENYNQSLEKWSLSQAVTGLIDTGRISEAETLCTKNLKSNPDQPAVILLLRQVQCKPLLESQKPLPDAVLEELQKTVMSNSTSVPAWQWLAHVYQSQGMMRAAEMCYRKSLQLASQRGSWSGKLSSLLRLALLALKVCMANISNDHWPSLVQEATTEALKLCFCPLAVLLQALLQFKRKMGARETRRLLERVVYQPGYPKSIASTARWYLLRHLYAKDDYELIDVLVNNAKTHGDTRALELNQRLSSQ,1564,NP_055454.1.csv,refseq-TTC37-NM_014639.3_clinical_seed_0_final,refseq-TTC37-NM_014639.3.a2m,Invitae,refseq-TTC37-NM_014639.3.npy,1,1564,1564
+NP_055461.1,MNYVGQLAGQVIVTVKELYKGINQATLSGCIDVIVVQQQDGSYQCSPFHVRFGKLGVLRSKEKVIDIEINGSAVDLHMKLGDNGEAFFVEETEEEYEKLPAYLATSPIPTEDQFFKDIDTPLVKSGGDETPSQSSDISHVLETETIFTPSSVKKKKRRRKKYKQDSKKEEQAASAAAEDTCDVGVSSDDDKGAQAARGSSNASLKEEECKEPLLFHSGDHYPLSDGDWSPLETTYPQTACPKSDSELEVKPAESLLRSESHMEWTWGGFPESTKVSKRERSDHHPRTATITPSENTHFRVIPSEDNLISEVEKDASMEDTVCTIVKPKPRALGTQMSDPTSVAELLEPPLESTQISSMLDADHLPNAALAEAPSESKPAAKVDSPSKKKGVHKRSQHQGPDDIYLDDLKGLEPEVAALYFPKSESEPGSRQWPESDTLSGSQSPQSVGSAAADSGTECLSDSAMDLPDVTLSLCGGLSENGEISKEKFMEHIITYHEFAENPGLIDNPNLVIRIYNRYYNWALAAPMILSLQVFQKSLPKATVESWVKDKMPKKSGRWWFWRKRESMTKQLPESKEGKSEAPPASDLPSSSKEPAGARPAENDSSSDEGSQELEESITVDPIPTEPLSHGSTTSYKKSLRLSSDQIAKLKLHDGPNDVVFSITTQYQGTCRCAGTIYLWNWNDKIIISDIDGTITKSDALGQILPQLGKDWTHQGIAKLYHSINENGYKFLYCSARAIGMADMTRGYLHWVNDKGTILPRGPLMLSPSSLFSAFHREVIEKKPEKFKIECLNDIKNLFAPSKQPFYAAFGNRPNDVYAYTQVGVPDCRIFTVNPKGELIQERTKGNKSSYHRLSELVEHVFPLLSKEQNSAFPCPEFSSFCYWRDPIPEVDLDDLS,896,NP_055461.1.csv,refseq-LPIN2-NM_014646.2_clinical_seed_0_final,refseq-LPIN2-NM_014646.2.a2m,Invitae,refseq-LPIN2-NM_014646.2.npy,1,896,896
+NP_055472.1,MAVFDTPEEAFGVLRPVCVQLTKTQTVENVEHLQTRLQAVSDSALQELQQYILFPLRFTLKTPGPKRERLIQSVVECLTFVLSSTCVKEQELLQELFSELSACLYSPSSQKPAAVSEELKLAVIQGLSTLMHSAYGDIILTFYEPSILPRLGFAVSLLLGLAEQEKSKQIKIAALKCLQVLLLQCDCQDHPRSLDELEQKQLGDLFASFLPGISTALTRLITGDFKQGHSIVVSSLKIFYKTVSFIMADEQLKRISKVQAKPAVEHRVAELMVYREADWVKKTGDKLTILIKKIIECVSVHPHWKVRLELVELVEDLLLKCSQSLVECAGPLLKALVGLVNDESPEIQAQCNKVLRHFADQKVVVGNKALADILSESLHSLATSLPRLMNSQDDQGKFSTLSLLLGYLKLLGPKINFVLNSVAHLQRLSKALIQVLELDVADIKIVEERRWNSDDLNASPKTSATQPWNRIQRRYFRFFTDERIFMLLRQVCQLLGYYGNLYLLVDHFMELYHQSVVYRKQAAMILNELVTGAAGLEVEDLHEKHIKTNPEELREIVTSILEEYTSQENWYLVTCLETEEMGEELMMEHPGLQAITSGEHTCQVTSFLAFSKPSPTICSMNSNIWQICIQLEGIGQFAYALGKDFCLLLMSALYPVLEKAGDQTLLISQVATSTMMDVCRACGYDSLQHLINQNSDYLVNGISLNLRHLALHPHTPKVLEVMLRNSDANLLPLVADVVQDVLATLDQFYDKRAASFVSVLHALMAALAQWFPDTGNLGHLQEQSLGEEGSHLNQRPAALEKSTTTAEDIEQFLLNYLKEKDVADGNVSDFDNEEEEQSVPPKVDENDTRPDVEPPLPLQIQIAMDVMERCIHLLSDKNLQIRLKVLDVLDLCVVVLQSHKNQLLPLAHQAWPSLVHRLTRDAPLAVLRAFKVLRTLGSKCGDFLRSRFCKDVLPKLAGSLVTQAPISARAGPVYSHTLAFKLQLAVLQGLGPLCERLDLGEGDLNKVADACLIYLSVKQPVKLQEAARSVFLHLMKVDPDSTWFLLNELYCPVQFTPPHPSLHPVQLHGASGQQNPYTTNVLQLLKELQ,1089,NP_055472.1.csv,refseq-TTI1-NM_014657.2_clinical_seed_0_final,refseq-TTI1-NM_014657.2.a2m,Invitae,refseq-TTI1-NM_014657.2.npy,1,1089,1089
+NP_055484.3,MDTEGFGELLQQAEQLAAETEGISELPHVERNLQEIQQAGERLRSRTLTRTSQETADVKASVLLGSRGLDISHISQRLESLSAATTFEPLEPVKDTDIQGFLKNEKDNALLSAIEESRKRTFGMAEEYHRESMLVEWEQVKQRILHTLLASGEDALDFTQESEPSYISDVGPPGRSSLDNIEMAYARQIYIYNEKIVNGHLQPNLVDLCASVAELDDKSISDMWTMVKQMTDVLLTPATDALKNRSSVEVRMEFVRQALAYLEQSYKNYTLVTVFGNLHQAQLGGVPGTYQLVRSFLNIKLPAPLPGLQDGEVEGHPVWALIYYCMRCGDLLAASQVVNRAQHQLGEFKTWFQEYMNSKDRRLSPATENKLRLHYRRALRNNTDPYKRAVYCIIGRCDVTDNQSEVADKTEDYLWLKLNQVCFDDDGTSSPQDRLTLSQFQKQLLEDYGESHFTVNQQPFLYFQVLFLTAQFEAAVAFLFRMERLRCHAVHVALVLFELKLLLKSSGQSAQLLSHEPGDPPCLRRLNFVRLLMLYTRKFESTDPREALQYFYFLRDEKDSQGENMFLRCVSELVIESREFDMILGKLENDGSRKPGVIDKFTSDTKPIINKVASVAENKGLFEEAAKLYDLAKNADKVLELMNKLLSPVVPQISAPQSNKERLKNMALSIAERYRAQGISANKFVDSTFYLLLDLITFFDEYHSGHIDRAFDIIERLKLVPLNQESVEERVAAFRNFSDEIRHNLSEVLLATMNILFTQFKRLKGTSPSSSSRPQRVIEDRDSQLRSQARTLITFAGMIPYRTSGDTNARLVQMEVLMN,819,NP_055484.3.csv,refseq-NUP93-NM_014669.4_clinical_seed_0_final,refseq-NUP93-NM_014669.4.a2m,Invitae,refseq-NUP93-NM_014669.4.npy,1,819,819
+NP_055496.2,MPPPRTREGRDRRDHHRAPSEEEALEKWDWNCPETRRLLEDAFFREEDYIRQGSEECQKFWTFFERLQRFQNLKTSRKEEKDPGQPKHSIPALADLPRTYDPRYRINLSVLGPATRGSQGLGRHLPAERVAEFRRALLHYLDFGQKQAFGRLAKLQRERAALPIAQYGNRILQTLKEHQVVVVAGDTGCGKSTQVPQYLLAAGFSHVACTQPRRIACISLAKRVGFESLSQYGSQVGYQIRFESTRSAATKIVFLTVGLLLRQIQREPSLPQYEVLIVDEVHERHLHNDFLLGVLQRLLPTRPDLKVILMSATINISLFSSYFSNAPVVQVPGRLFPITVVYQPQEAEPTTSKSEKLDPRPFLRVLESIDHKYPPEERGDLLVFLSGMAEISAVLEAAQTYASHTQRWVVLPLHSALSVADQDKVFDVAPPGVRKCILSTNIAETSVTIDGIRFVVDSGKVKEMSYDPQAKLQRLQEFWISQASAEQRKGRAGRTGPGVCFRLYAESDYDAFAPYPVPEIRRVALDSLVLQMKSMSVGDPRTFPFIEPPPPASLETAILYLRDQGALDSSEALTPIGSLLAQLPVDVVIGKMLILGSMFSLVEPVLTIAAALSVQSPFTRSAQSSPECAAARRPLESDQGDPFTLFNVFNAWVQVKSERSRNSRKWCRRRGIEEHRLYEMANLRRQFKELLEDHGLLAGAQAAQVGDSYSRLQQRRERRALHQLKRQHEEGAGRRRKVLRLQEEQDGGSSDEDRAGPAPPGASDGVDIQDVKFKLRHDLAQLQAAASSAQDLSREQLALLKLVLGRGLYPQLAVPDAFNSSRKDSDQIFHTQAKQGAVLHPTCVFAGSPEVLHAQELEASNCDGSRDDKDKMSSKHQLLSFVSLLETNKPYLVNCVRIPALQSLLLFSRSLDTNGDCSRLVADGWLELQLADSESAIRLLAASLRLRARWESALDRQLAHQAQQQLEEEEEDTPVSPKEVATLSKELLQFTASKIPYSLRRLTGLEVQNMYVGPQTIPATPHLPGLFGSSTLSPHPTKGGYAVTDFLTYNCLTNDTDLYSDCLRTFWTCPHCGLHAPLTPLERIAHENTCPQAPQDGPPGAEEAALETLQKTSVLQRPYHCEACGKDFLFTPTEVLRHRKQHV,1143,NP_055496.2.csv,refseq-DHX34-NM_014681.5_clinical_seed_0_final,refseq-DHX34-NM_014681.5.a2m,Invitae,refseq-DHX34-NM_014681.5_theta_0.2.npy,1,1143,1143
+NP_055509.2,MDGRWQCSCWAWFLLVLAVVAGDTVSTGSTDNSPTSNSLEGGTDATAFWWGEWTKWTACSRSCGGGVTSQERHCLQQRRKSVPGPGNRTCTGTSKRYQLCRVQECPPDGRSFREEQCVSFNSHVYNGRTHQWKPLYPDDYVHISSKPCDLHCTTVDGQRQLMVPARDGTSCKLTDLRGVCVSGKCEPIGCDGVLFSTHTLDKCGICQGDGSSCTHVTGNYRKGNAHLGYSLVTHIPAGARDIQIVERKKSADVLALADEAGYYFFNGNYKVDSPKNFNIAGTVVKYRRPMDVYETGIEYIVAQGPTNQGLNVMVWNQNGKSPSITFEYTLLQPPHESRPQPIYYGFSESAESQGLDGAGLMGFVPHNGSLYGQASSERLGLDNRLFGHPGLDMELGPSQGQETNEVCEQAGGGACEGPPRGKGFRDRNVTGTPLTGDKDDEEVDTHFASQEFFSANAISDQLLGAGSDLKDFTLNETVNSIFAQGAPRSSLAESFFVDYEENEGAGPYLLNGSYLELSSDRVANSSSEAPFPNVSTSLLTSAGNRTHKARTRPKARKQGVSPADMYRWKLSSHEPCSATCTTGVMSAYAMCVRYDGVEVDDSYCDALTRPEPVHEFCAGRECQPRWETSSWSECSRTCGEGYQFRVVRCWKMLSPGFDSSVYSDLCEAAEAVRPEERKTCRNPACGPQWEMSEWSECTAKCGERSVVTRDIRCSEDEKLCDPNTRPVGEKNCTGPPCDRQWTVSDWGPCSGSCGQGRTIRHVYCKTSDGRVVPESQCQMETKPLAIHPCGDKNCPAHWLAQDWERCNTTCGRGVKKRLVLCMELANGKPQTRSGPECGLAKKPPEESTCFERPCFKWYTSPWSECTKTCGVGVRMRDVKCYQGTDIVRGCDPLVKPVGRQACDLQPCPTEPPDDSCQDQPGTNCALAIKVNLCGHWYYSKACCRSCRPPHS,951,NP_055509.2.csv,refseq-ADAMTSL2-NM_014694.3_clinical_seed_0_final,refseq-ADAMTSL2-NM_014694.3.a2m,Invitae,refseq-ADAMTSL2-NM_014694.3.npy,1,951,951
+NP_055513.2,MMDSPFLELWQSKAVSIREQLGLGDRPNDSYCYNSAKNSTVLQGVTFGGIPTVLLIDVSCFLFLILVFSIIRRRFWDYGRIALVSEADSESRFQRLSSTSSSGQQDFENELGCCPWLTAIFRLHDDQILEWCGEDAIHYLSFQRHIIFLLVVVSFLSLCVILPVNLSGDLLDKDPYSFGRTTIANLQTDNDLLWLHTIFAVIYLFLTVGFMRHHTQSIKYKEENLVRRTLFITGLPRDARKETVESHFRDAYPTCEVVDVQLCYNVAKLIYLCKEKKKTEKSLTYYTNLQVKTGQRTLINPKPCGQFCCCEVLGCEWEDAISYYTRMKDRLLERITEEERHVQDQPLGMAFVTFQEKSMATYILKDFNACKCQSLQCKGEPQPSSHSRELYTSKWTVTFAADPEDICWKNLSIQGLRWWLQWLGINFTLFLGLFFLTTPSIILSTMDKFNVTKPIHALNNPIISQFFPTLLLWSFSALLPSIVYYSTLLESHWTKSGENQIMMTKVYIFLIFMVLILPSLGLTSLDFFFRWLFDKTSSEASIRLECVFLPDQGAFFVNYVIASAFIGNGMELLRLPGLILYTFRMIMAKTAADRRNVKQNQAFQYEFGAMYAWMLCVFTVIVAYSITCPIIAPFGLIYILLKHMVDRHNLYFVYLPAKLEKGIHFAAVNQALAAPILCLFWLYFFSFLRLGMKAPATLFTFLVLLLTILVCLAHTCFGCFKHLSPLNYKTEEPASDKGSEAEAHMPPPFTPYVPRILNGLASERTALSPQQQQQQTYGAIHNISGTIPGQCLAQSATGSVAAAPQEA,807,NP_055513.2.csv,refseq-TMEM63A-NM_014698.2_clinical_seed_0_final,refseq-TMEM63A-NM_014698.2.a2m,Invitae,refseq-TMEM63A-NM_014698.2.npy,1,807,807
+NP_055527.1,MDQEGGGDGQKAPSFQWRNYKLIVDPALDPALRRPSQKVYRYDGVHFSVNDSKYIPVEDLQDPRCHVRSKNRDFSLPVPKFKLDEFYIGQIPLKEVTFARLNDNVRETFLKDMCRKYGEVEEVEILLHPRTRKHLGLARVLFTSTRGAKETVKNLHLTSVMGNIIHAQLDIKGQQRMKYYELIVNGSYTPQTVPTGGKALSEKFQGSGAATETAESRRRSSSDTAAYPAGTTAVGTPGNGTPCSQDTSFSSSRQDTPSSFGQFTPQSSQGTPYTSRGSTPYSQDSAYSSSTTSTSFKPRRSENSYQDAFSRRHFSASSASTTASTAIAATTAATASSSASSSSLSSSSSSSSSSSSSQFRSSDANYPAYYESWNRYQRHTSYPPRRATREEPPGAPFAENTAERFPPSYTSYLPPEPSRPTDQDYRPPASEAPPPEPPEPGGGGGGGGPSPEREEVRTSPRPASPARSGSPAPETTNESVPFAQHSSLDSRIEMLLKEQRSKFSFLASDTEEEEENSSMVLGARDTGSEVPSGSGHGPCTPPPAPANFEDVAPTGSGEPGATRESPKANGQNQASPCSSGDDMEISDDDRGGSPPPAPTPPQQPPPPPPPPPPPPPYLASLPLGYPPHQPAYLLPPRPDGPPPPEYPPPPPPPPHIYDFVNSLELMDRLGAQWGGMPMSFQMQTQMLTRLHQLRQGKGLIAASAGPPGGAFGEAFLPFPPPQEAAYGLPYALYAQGQEGRGAYSREAYHLPMPMAAEPLPSSSVSGEEARLPPREEAELAEGKTLPTAGTVGRVLAMLVQEMKSIMQRDLNRKMVENVAFGAFDQWWESKEEKAKPFQNAAKQQAKEEDKEKTKLKEPGLLSLVDWAKSGGTTGIEAFAFGSGLRGALRLPSFKVKRKEPSEISEASEEKRPRPSTPAEEDEDDPEQEKEAGEPGRPGTKPPKRDEERGKTQGKHRKSFALDSEGEEASQESSSEKDEEDDEEDEEDEDREEAVDTTKKETEVSDGEDEESDSSSKCSLYADSDGENDSTSDSESSSSSSSSSSSSSSSSSSSSSSSSESSSEDEEEEERPAALPSASPPPREVPVPTPAPVEVPVPERVAGSPVTPLPEQEASPARPAGPTEESPPSAPLRPPEPPAGPPAPAPRPDERPSSPIPLLPPPKKRRKTVSFSAIEVVPAPEPPPATPPQAKFPGPASRKAPRGVERTIRNLPLDHASLVKSWPEEVSRGGRSRAGGRGRLTEEEEAEPGTEVDLAVLADLALTPARRGLPALPAVEDSEATETSDEAERPRPLLSHILLEHNYALAVKPTPPAPALRPPEPVPAPAALFSSPADEVLEAPEVVVAEAEEPKPQQLQQQREEGEEEGEEEGEEEEEESSDSSSSSDGEGALRRRSLRSHARRRRPPPPPPPPPPRAYEPRSEFEQMTILYDIWNSGLDSEDMSYLRLTYERLLQQTSGADWLNDTHWVHHTITNLTTPKRKRRPQDGPREHQTGSARSEGYYPISKKEKDKYLDVCPVSARQLEGVDTQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN,1707,NP_055527.1.csv,refseq-SETD1A-NM_014712.2_clinical_seed_0_final,refseq-SETD1A-NM_014712.2.a2m,Invitae,refseq-SETD1A-NM_014712.2.npy,1,1707,1707
+NP_055529.2,MALYYDHQIEAPDAAGSPSFISWHPVHPFLAVAYISTTSTGSVDIYLEQGECVPDTHVERPFRVASLCWHPTRLVLAVGWETGEVTVFNKQDKEQHTMPLTHTADITVLRWSPSGNCLLSGDRLGVLLLWRLDQRGRVQGTPLLKHEYGKHLTHCIFRLPPPGEDLVQLAKAAVSGDEKALDMFNWKKSSSGSLLKMGSHEGLLFFVSLMDGTVHYVDEKGKTTQVVSADSTIQMLFYMEKREALVVVTENLRLSLYTVPPEGKAEEVMKVKLSGKTGRRADIALIEGSLLVMAVGEAALRFWDIERGENYILSPDEKFGFEKGENMNCVCYCKVKGLLAAGTDRGRVAMWRKVPDFLGSPGAEGKDRWALQTPTELQGNITQIQWGSRKNLLAVNSVISVAILSERAMSSHFHQQVAAMQVSPSLLNVCFLSTGVAHSLRTDMHISGVFATKDAVAVWNGRQVAIFELSGAAIRSAGTFLCETPVLAMHEENVYTVESNRVQVRTWQGTVKQLLLFSETEGNPCFLDICGNFLVVGTDLAHFKSFDLSRREAKAHCSCRSLAELVPGVGGIASLRCSSSGSTISILPSKADNSPDSKICFYDVEMDTVTVFDFKTGQIDRRETLSFNEQETNKSHLFVDEGLKNYVPVNHFWDQSEPRLFVCEAVQETPRSQPQSANGQPQDGRAGPAADVLILSFFISEEHGFLLHESFPRPATSHSLLGMEVPYYYFTRKPEEADREDEVEPGCHHIPQMVSRRPLRDFVGLEDCDKATRDAMLHFSFFVTIGDMDEAFKSIKLIKSEAVWENMARMCVKTQRLDVAKVCLGNMGHARGARALREAEQEPELEARVAVLATQLGMLEDAEQLYRKCKRHDLLNKFYQAAGRWQEALQVAEHHDRVHLRSTYHRYAGHLEASADCSRALSYYEKSDTHRFEVPRMLSEDLPSLELYVNKMKDKTLWRWWAQYLESQGEMDAALHYYELARDHFSLVRIHCFQGNVQKAAQIANETGNLAASYHLARQYESQEEVGQAVHFYTRAQAFKNAIRLCKENGLDDQLMNLALLSSPEDMIEAARYYEEKGVQMDRAVMLYHKAGHFSKALELAFATQQFVALQLIAEDLDETSDPALLARCSDFFIEHSQYERAVELLLAARKYQEALQLCLGQNMSITEEMAEKMTVAKDSSDLPEESRRELLEQIADCCMRQGSYHLATKKYTQAGNKLKAMRALLKSGDTEKITFFASVSRQKEIYIMAANYLQSLDWRKEPEIMKNIIGFYTKGRALDLLAGFYDACAQVEIDEYQNYDKAHGALTEAYKCLAKAKAKSPLDQETRLAQLQSRMALVKRFIQARRTYTEDPKESIKQCELLLEEPDLDSTIRIGDVYGFLVEHYVRKEEYQTAYRFLEEMRRRLPLANMSYYVSPQAVDAVHRGLGLPLPRTVPEQVRHNSMEDARELDEEVVEEADDDP,1462,NP_055529.2.csv,refseq-IFT140-NM_014714.3_clinical_seed_0_final,refseq-IFT140-NM_014714.3.a2m,Invitae,refseq-IFT140-NM_014714.3.npy,1,1462,1462
+NP_055543.2,MDVFSFVKIAKLSSHRTKSSGWPPPSGTWGLSQVPPYGWEMTANRDGRDYFINHMTQAIPFDDPRLESCQIIPPAPRKVEMRRDPVLGFGFVAGSEKPVVVRSVTPGGPSEGKLIPGDQIVMINDEPVSAAPRERVIDLVRSCKESILLTVIQPYPSPKSAFISAAKKARLKSNPVKVRFSEEVIINGQVSETVKDNSLLFMPNVLKVYLENGQTKSFRFDCSTSIKDVILTLQEKLSIKGIEHFSLMLEQRTEGAGTKLLLLHEQETLTQVTQRPSSHKMRCLFRISFVPKDPIDLLRRDPVAFEYLYVQSCNDVVQERFGPELKYDIALRLAALQMYIATVTTKQTQKISLKYIEKEWGLETFLPSAVLQSMKEKNIKKALSHLVKANQNLVPPGKKLSALQAKVHYLKFLSDLRLYGGRVFKATLVQAEKRSEVTLLVGPRYGISHVINTKTNLVALLADFSHVNRIEMFSEEESLVRVELHVLDVKPITLLMESSDAMNLACLTAGYYRLLVDSRRSIFNMANKKNTATQETGPENKGKHNLLGPDWNCIPQMTTFIGEGEQEAQITYIDSKQKTVEITDSTMCPKEHRHLYIDNAYSSDGLNQQLSQPGEAPCEADYRSLAQRSLLTLSGPETLKKAQESPRGAKVSFIFGDFALDDGISPPTLGYETLLDEGPEMLEKQRNLYIGSANDMKGLDLTPEAEGIQFVENSVYANIGDVKSFQAAEGIEEPLLHDICYAENTDDAEDEDEVSCEEDLVVGEMNQPAILNLSGSSDDIIDLTSLPPPEGDDNEDDFLLRSLNMAIAAPPPGFRDSSDEEDSQSQAASFPEDKEKGSSLQNDEIPVSLIDAVPTSAEGKCEKGLDNAVVSTLGALEALSVSEEQQTSDNSGVAILRAYSPESSSDSGNETNSSEMTESSELATAQKQSENLSRMFLATHEGYHPLAEEQTEFPASKTPAGGLPPKSSHALAARPATDLPPKVVPSKQLLHSDHMEMEPETMETKSVTDYFSKLHMGSVAYSCTSKRKSKLADGEGKAPPNGNTTGKKQQGTKTAEMEEEASGKFGTVSSRDSQHLSTFNLERTAFRKDSQRWYVATEGGMAEKSGLEAATGKTFPRASGLGAREAEGKEEGAPDGETSDGSGLGQGDRFLTDVTCASSAKDLDNPEDADSSTCDHPSKLPEADESVARLCDYHLAKRMSSLQSEGHFSLQSSQGSSVDAGCGTGSSGSACATPVESPLCPSLGKHLIPDASGKGVNYIPSEERAPGLPNHGATFKELHPQTEGMCPRMTVPALHTAINTEPLFGTLRDGCHRLPKIKETTV,1322,NP_055543.2.csv,refseq-FRMPD4-NM_014728.3_clinical_seed_0_final,refseq-FRMPD4-NM_014728.3.a2m,Invitae,refseq-FRMPD4-NM_014728.3.npy,1,1322,1322
+NP_055569.1,MASCVGSRTLSKDDVNYKMHFRMINEQQVEDITIDFFYRPHTITLLSFTIVSLMYFAFTRDDSVPEDNIWRGILSVIFFFLIISVLAFPNGPFTRPHPALWRMVFGLSVLYFLFLVFLLFLNFEQVKSLMYWLDPNLRYATREADVMEYAVNCHVITWERIISHFDIFAFGHFWGWAMKALLIRSYGLCWTISITWELTELFFMHLLPNFAECWWDQVILDILLCNGGGIWLGMVVCRFLEMRTYHWASFKDIHTTTGKIKRAVLQFTPASWTYVRWFDPKSSFQRVAGVYLFMIIWQLTELNTFFLKHIFVFQASHPLSWGRILFIGGITAPTVRQYYAYLTDTQCKRVGTQCWVFGVIGFLEAIVCIKFGQDLFSKTQILYVVLWLLCVAFTTFLCLYGMIWYAEHYGHREKTYSECEDGTYSPEISWHHRKGTKGSEDSPPKHAGNNESHSSRRRNRHSKSKVTNGVGKK,473,NP_055569.1.csv,refseq-PTDSS1-NM_014754.2_clinical_seed_0_final,refseq-PTDSS1-NM_014754.2.a2m,Invitae,refseq-PTDSS1-NM_014754.2.npy,1,473,473
+NP_055577.1,MEPAVSLAVCALLFLLWVRLKGLEFVLIHQRWVFVCLFLLPLSLIFDIYYYVRAWVVFKLSSAPRLHEQRVRDIQKQVREWKEQGSKTFMCTGRPGWLTVSLRVGKYKKTHKNIMINLMDILEVDTKKQIVRVEPLVTMGQVTALLTSIGWTLPVLPELDDLTVGGLIMGTGIESSSHKYGLFQHICTAYELVLADGSFVRCTPSENSDLFYAVPWSCGTLGFLVAAEIRIIPAKKYVKLRFEPVRGLEAICAKFTHESQRQENHFVEGLLYSLDEAVIMTGVMTDEAEPSKLNSIGNYYKPWFFKHVENYLKTNREGLEYIPLRHYYHRHTRSIFWELQDIIPFGNNPIFRYLFGWMVPPKISLLKLTQGETLRKLYEQHHVVQDMLVPMKCLQQALHTFQNDIHVYPIWLCPFILPSQPGLVHPKGNEAELYIDIGAYGEPRVKHFEARSCMRQLEKFVRSVHGFQMLYADCYMNREEFWEMFDGSLYHKLREKLGCQDAFPEVYDKICKAARH,516,NP_055577.1.csv,refseq-DHCR24-NM_014762.3_clinical_seed_0_final,refseq-DHCR24-NM_014762.3.a2m,Invitae,refseq-DHCR24-NM_014762.3.npy,1,516,516
+NP_055595.2,MVGELRYREFRVPLGPGLHAYPDELIRQRVGHDGHPEYQIRWLILRRGDEGDGGSGQVDCKAEHILLWMSKDEIYANCHKMLGEDGQVIGPSQESAGEVGALDKSVLEEMETDVKSLIQRALRQLEECVGTIPPAPLLHTVHVLSAYASIEPLTGVFKDPRVLDLLMHMLSSPDYQIRWSAGRMIQALSSHDAGTRTQILLSLSQQEAIEKHLDFDSRCALLALFAQATLSEHPMSFEGIQLPQVPGRVLFSLVKRYLHVTSLLDQLNDSAAEPGAQNTSAPEELSGERGQLELEFSMAMGTLISELVQAMRWDQASDRPRSSARSPGSIFQPQLADVSPGLPAAQAQPSFRRSRRFRPRSEFASGNTYALYVRDTLQPGMRVRMLDDYEEISAGDEGEFRQSNNGVPPVQVFWESTGRTYWVHWHMLEILGFEEDIEDMVEADEYQGAVASRVLGRALPAWRWRPMTELYAVPYVLPEDEDTEECEHLTLAEWWELLFFIKKLDGPDHQEVLQILQENLDGEILDDEILAELAVPIELAQDLLLTLPQRLNDSALRDLINCHVYKKYGPEALAGNQAYPSLLEAQEDVLLLDAQAQAKDSEDAAKVEAKEPPSQSPNTPLQRLVEGYGPAGKILLDLEQALSSEGTQENKVKPLLLQLQRQPQPFLALMQSLDTPETNRTLHLTVLRILKQLVDFPEALLLPWHEAVDACMACLRSPNTDREVLQELIFFLHRLTSVSRDYAVVLNQLGARDAISKALEKHLGKLELAQELRDMVFKCEKHAHLYRKLITNILGGCIQMVLGQIEDHRRTHQPINIPFFDVFLRYLCQGSSVEVKEDKCWEKVEVSSNPHRASKLTDHNPKTYWESNGSAGSHYITLHMRRGILIRQLTLLVASEDSSYMPARVVVCGGDSTSSLHTELNSVNVMPSASRVILLENLTRFWPIIQIRIKRCQQGGIDTRIRGLEILGPKPTFWPVFREQLCRHTRLFYMVRAQAWSQDMAEDRRSLLHLSSRLNGALRQEQNFADRFLPDDEAAQALGKTCWEALVSPVVQNITSPDEDGISPLGWLLDQYLECQEAVFNPQSRGPAFFSRVRRLTHLLVHVEPCEAPPPVVATPRPKGRNRSHDWSSLATRGLPSSIMRNLTRCWRAVVEKQVNNFLTSSWRDDDFVPRYCEHFNILQNSSSELFGPRAAFLLALQNGCAGALLKLPFLKAAHVSEQFARHIDQQIQGSRIGGAQEMERLAQLQQCLQAVLIFSGLEIATTFEHYYQHYMADRLLGVVSSWLEGAVLEQIGPCFPNRLPQQMLQSLSTSKELQRQFHVYQLQQLDQELLKLEDTEKKIQVGLGASGKEHKSEKEEEAGAAAVVDVAEGEEEEEENEDLYYEGAMPEVSVLVLSRHSWPVASICHTLNPRTCLPSYLRGTLNRYSNFYNKSQSHPALERGSQRRLQWTWLGWAELQFGNQTLHVSTVQMWLLLYLNDLKAVSVESLLAFSGLSADMLNQAIGPLTSSRGPLDLHEQKDIPGGVLKIRDGSKEPRSRWDIVRLIPPQTYLQAEGEDGQNLEKRRNLLNCLIVRILKAHGDEGLHIDQLVCLVLEAWQKGPCPPRGLVSSLGKGSACSSTDVLSCILHLLGKGTLRRHDDRPQVLSYAVPVTVMEPHTESLNPGSSGPNPPLTFHTLQIRSRGVPYASCTATQSFSTFR,1698,NP_055595.2.csv,refseq-CUL7-NM_014780.4_clinical_seed_0_final,refseq-CUL7-NM_014780.4.a2m,Invitae,refseq-CUL7-NM_014780.4.npy,1,1698,1698
+NP_055610.1,MKQPIMADGPRCKRRKQANPRRKNVVNYDNVVDTGSETDEEDKLHIAEDDGIANPLDQETSPASVPNHESSPHVSQALLPREEEEDEIREGGVEHPWHNNEILQASVDGPEEMKEDYDTMGPEATIQTAINNGTVKNANCTSDFEEYFAKRKLEERDGHAVSIEEYLQRSDTAIIYPEAPEELSRLGTPEANGQEENDLPPGTPDAFAQLLTCPYCDRGYKRLTSLKEHIKYRHEKNEENFSCPLCSYTFAYRTQLERHMVTHKPGTDQHQMLTQGAGNRKFKCTECGKAFKYKHHLKEHLRIHSGEKPYECPNCKKRFSHSGSYSSHISSKKCIGLISVNGRMRNNIKTGSSPNSVSSSPTNSAITQLRNKLENGKPLSMSEQTGLLKIKTEPLDFNDYKVLMATHGFSGTSPFMNGGLGATSPLGVHPSAQSPMQHLGVGMEAPLLGFPTMNSNLSEVQKVLQIVDNTVSRQKMDCKAEEISKLKGYHMKDPCSQPEEQGVTSPNIPPVGLPVVSHNGATKSIIDYTLEKVNEAKACLQSLTTDSRRQISNIKKEKLRTLIDLVTDDKMIENHNISTPFSCQFCKESFPGPIPLHQHERYLCKMNEEIKAVLQPHENIVPNKAGVFVDNKALLLSSVLSEKGMTSPINPYKDHMSVLKAYYAMNMEPNSDELLKISIAVGLPQEFVKEWFEQRKVYQYSNSRSPSLERSSKPLAPNSNPPTKDSLLPRSPVKPMDSITSPSIAELHNSVTNCDPPLRLTKPSHFTNIKPVEKLDHSRSNTPSPLNLSSTSSKNSHSSSYTPNSFSSEELQAEPLDLSLPKQMKEPKSIIATKNKTKASSISLDHNSVSSSSENSDEPLNLTFIKKEFSNSNNLDNKSTNPVFSMNPFSAKPLYTALPPQSAFPPATFMPPVQTSIPGLRPYPGLDQMSFLPHMAYTYPTGAATFADMQQRRKYQRKQGFQGELLDGAQDYMSGLDDMTDSDSCLSRKKIKKTESGMYACDLCDKTFQKSSSLLRHKYEHTGKRPHQCQICKKAFKHKHHLIEHSRLHSGEKPYQCDKCGKRFSHSGSYSQHMNHRYSYCKREAEEREAAEREAREKGHLEPTELLMNRAYLQSITPQGYSDSEERESMPRDGESEKEHEKEGEDGYGKLGRQDGDEEFEEEEEESENKSMDTDPETIRDEEETGDHSMDDSSEDGKMETKSDHEEDNMEDGM,1214,NP_055610.1.csv,refseq-ZEB2-NM_014795.3_clinical_seed_0_final,refseq-ZEB2-NM_014795.3.a2m,Invitae,refseq-ZEB2-NM_014795.3.npy,1,1214,1214
+NP_055612.2,MAETSPEPSGQLVVHSDAHSDTVLASFEDQRKKGFLCDITLIVENVHFRAHKALLAASSEYFSMMFAEEGEIGQSIYMLEGMVADTFGILLEFIYTGYLHASEKSTEQILATAQFLKVYDLVKAYTDFQNNHSSPKPTTLNTAGAPVVVISNKKNDPPKRKRGRPKKVNTLQEEKSELAAEEEIQLRVNNSVQNRQNFVVKGDSGVLNEQIAAKEKEESEPTCEPSREEEMPVEKDENYDPKTEDGQASQSRYSKRRIWRSVKLKDYKLVGDQEDHGSAKRICGRRKRPGGPEARCKDCGKVFKYNHFLAIHQRSHTGERPFKCNECGKGFAQKHSLQVHTRMHTGERPYTCTVCSKALTTKHSLLEHMSLHSGQKSFTCDQCGKYFSQNRQLKSHYRVHTGHSLPECKDCHRKFMDVSQLKKHLRTHTGEKPFTCEICGKSFTAKSSLQTHIRIHRGEKPYSCGICGKSFSDSSAKRRHCILHTGKKPFSCPECNLQFARLDNLKAHLKIHSKEKHASDASSISGSSNTEEVRNILQLQPYQLSTSGEQEIQLLVTDSVHNINFMPGPSQGISIVTAESSQNMTADQAANLTLLTQQPEQLQNLILSAQQEQTEHIQSLNMIESQMGPSQTEPVHVITLSKETLEHLHAHQEQTEELHLATSTSDPAQHLQLTQEPGPPPPTHHVPQPTPLGQEQS,697,NP_055612.2.csv,refseq-ZBTB24-NM_014797.2_clinical_seed_0_final,refseq-ZBTB24-NM_014797.2.a2m,Invitae,refseq-ZBTB24-NM_014797.2.npy,1,697,697
+NP_055619.2,MGPGQPASTCVHLAPRTQLDGRSDPKVLQTQNQLQFNRNVPTHSSNLAIRYSCPHAIRIEKLKHSYNESYHCKDADCRVGPDLGSSVSFSVISQERLSYAVHLARRDVKRRQFEKHIKEHHLRSQPQSSQKCGHTKYKIPDHRVERKESKSQAACQCSHQPSKVEISSSGAKVYLYSSHPGQSDLTVPNSPPTHDPGLQPHPRIGDHKNISEQKSLLEVQRLQKELSSCIHKIEEVTKKDRLEEALDPDEERRIRIRRQEQAARSARMLYVLQQQVKEIQEELDKLSPHKIKHTKKSWAMSKLAAAHRGAIRALQMFVTQFTDRGEHPLPARCKELGSLIRQLSLCSVKLDADPSVPDVVIDILQQIEALESLLEKKLSPKKVKKCFSEIRSRFPIGSQKALERWPSTSPKGERRPLTAKDTFPQETSRPSVAKQLLADKYQPDTELPETQRLQSELDVLDADIVLEEGPFILDQSASFKDEVLAVAKTKAGKKKPVTENVPFRKKDTLAPARQQGLRKAERGRQSQPHSKSRVQQTTVSSRLKMNRQPVKDRKAPWIPPNPTSPPASPKCAAWLKVKTSPRDATKEPLQQEDPQEESHLTGAVEHEAARLAWLDAETSKRLKELEELKAKEIDSMQKQRLDWLDAETSRRTKELNELKAEEMYRLQQLSVSATHLADKVEEAVLDRLKPLLVKAQRVNSTTEANIHLKDGSSVNTAKAQPAQEVAAVDFESNNIRQLDDFLEDCASELWAVTHAKILGSETLATVEDSKDSPDLEIMMRRMEEMEKYQESVRQRYNKIAYADPRLWMQEENNDQKISAISEKPLSPHPIRITKTVDRKDPAVNIMLERPCNGNSLDESVGTEEGSEKREAPLLSLAEDSQQKEGRAPLFVPPGMQHSIGDYCSRFEQYLRIISHEAVGSFNPWLIAESFSEELVDEALGAVAAELQDMCEDYAEAVFTSEFLEAAT,967,NP_055619.2.csv,refseq-KIAA0753-NM_014804.2_clinical_seed_0_final,refseq-KIAA0753-NM_014804.2.a2m,Invitae,refseq-KIAA0753-NM_014804.2.npy,1,967,967
+NP_055637.2,MSQQGYVATPPYSQPQPGIGLSPPHYGHYGDPSHTASPTGMMKPAGPLGATATRGMLPPGPPPPGPHQFGQNGAHATGHPPQRFPGPPPVNNVASSHAPYQPSAQSSYPGPISTSSVTQLGSQLSAMQINSYGSGMAPPSQGPPGPLSATSLQTPPRPPQPSILQPGSQVLPPPPTTLNGPGASPLPLPMYRPDGLSGPPPPNAQYQPPPLPGQTLGAGYPPQQANSGPQMAGAQLSYPGGFPGGPAQMAGPPQPQKKLDPDSIPSPIQVIENDRASRGGQVYATNTRGQIPPLVTTDCMIQDQGNASPRFIRCTTYCFPCTSDMAKQAQIPLAAVIKPFATIPSNESPLYLVNHGESGPVRCNRCKAYMCPFMQFIEGGRRYQCGFCNCVNDVPPFYFQHLDHIGRRLDHYEKPELSLGSYEYVATLDYCRKSKPPNPPAFIFMIDVSYSNIKNGLVKLICEELKTMLEKIPKEEQEETSAIRVGFITYNKVLHFFNVKSNLAQPQMMVVTDVGEVFVPLLDGFLVNYQESQSVIHNLLDQIPDMFADSNENETVFAPVIQAGMEALKAADCPGKLFIFHSSLPTAEAPGKLKNRDDKKLVNTDKEKILFQPQTNVYDSLAKDCVAHGCSVTLFLFPSQYVDVASLGLVPQLTGGTLYKYNNFQMHLDRQQFLNDLRNDIEKKIGFDAIMRVRTSTGFRATDFFGGILMNNTTDVEMAAIDCDKAVTVEFKHDDKLSEDSGALIQCAVLYTTISGQRRLRIHNLGLNCSSQLADLYKSCETDALINFFAKSAFKAVLHQPLKVIREILVNQTAHMLACYRKNCASPSAASQLILPDSMKVLPVYMNCLLKNCVLLSRPEISTDERAYQRQLVMTMGVADSQLFFYPQLLPIHTLDVKSTMLPAAVRCSESRLSEEGIFLLANGLHMFLWLGVSSPPELIQGIFNVPSFAHINTDMTLLPEVGNPYSQQLRMIMGIIQQKRPYSMKLTIVKQREQPEMVFRQFLVEDKGLYGGSSYVDFLCCVHKEICQLLN,1032,NP_055637.2.csv,refseq-SEC24D-NM_014822.3_clinical_seed_0_final,refseq-SEC24D-NM_014822.3.a2m,Invitae,refseq-SEC24D-NM_014822.3.npy,1,1032,1032
+NP_055660.1,MPTAAAPIISSVQKLVLYETRARYFLVGSNNAETKYRVLKIDRTEPKDLVIIDDRHVYTQQEVRELLGRLDLGNRTKMGQKGSSGLFRAVSAFGVVGFVRFLEGYYIVLITKRRKMADIGGHAIYKVEDTNMIYIPNDSVRVTHPDEARYLRIFQNVDLSSNFYFSYSYDLSHSLQYNLTVLRMPLEMLKSEMTQNRQESFDIFEDEGLITQGGSGVFGICSEPYMKYVWNGELLDIIKSTVHRDWLLYIIHGFCGQSKLLIYGRPVYVTLIARRSSKFAGTRFLKRGANCEGDVANEVETEQILCDASVMSFTAGSYSSYVQVRGSVPLYWSQDISTMMPKPPITLDQADPFAHVAALHFDQMFQRFGSPIIILNLVKEREKRKHERILSEELVAAVTYLNQFLPPEHTIVYIPWDMAKYTKSKLCNVLDRLNVIAESVVKKTGFFVNRPDSYCSILRPDEKWNELGGCVIPTGRLQTGILRTNCVDCLDRTNTAQFMVGKCALAYQLYSLGLIDKPNLQFDTDAVRLFEELYEDHGDTLSLQYGGSQLVHRVKTYRKIAPWTQHSKDIMQTLSRYYSNAFSDADRQDSINLFLGVFHPTEGKPHLWELPTDFYLHHKNTMRLLPTRRSYTYWWTPEVIKHLPLPYDEVICAVNLKKLIVKKFHKYEEEIDIHNEFFRPYELSSFDDTFCLAMTSSARDFMPKTVGIDPSPFTVRKPDETGKSVLGNKSNREEAVLQRKTAASAPPPPSEEAVSSSSEDDSGTDREEEGSVSQRSTPVKMTDAGDSAKVTENVVQPMKELYGINLSDGLSEEDFSIYSRFVQLGQSQHKQDKNSQQPCSRCSDGVIKLTPISAFSQDNIYEVQPPRVDRKSTEIFQAHIQASQGIMQPLGKEDSSMYREYIRNRYL,907,NP_055660.1.csv,refseq-FIG4-NM_014845.5_clinical_seed_0_final,refseq-FIG4-NM_014845.5.a2m,Invitae,refseq-FIG4-NM_014845.5.npy,1,907,907
+NP_055661.3,MLDFLAENNLCGQAILRIVSCGNAIIAELLRLSEFIPAVFRLKDRADQQKYGDIIFDFSYFKGPELWESKLDAKPELQDLDEEFRENNIEIVTRFYLAFQSVHKYIVDLNRYLDDLNEGVYIQQTLETVLLNEDGKQLLCEALYLYGVMLLVIDQKIEGEVRERMLVSYYRYSAARSSADSNMDDICKLLRSTGYSSQPGAKRPSNYPESYFQRVPINESFISMVIGRLRSDDIYNQVSAYPLPEHRSTALANQAAMLYVILYFEPSILHTHQAKMREIVDKYFPDNWVISIYMGITVNLVDAWEPYKAAKTALNNTLDLSNVREQASRYATVSERVHAQVQQFLKEGYLREEMVLDNIPKLLNCLRDCNVAIRWLMLHTADSACDPNNKRLRQIKDQILTDSRYNPRILFQLLLDTAQFEFILKEMFKQMLSEKQTKWEHYKKEGSERMTELADVFSGVKPLTRVEKNENLQAWFREISKQILSLNYDDSTAAGRKTVQLIQALEEVQEFHQLESNLQVCQFLADTRKFLHQMIRTINIKEEVLITMQIVGDLSFAWQLIDSFTSIMQESIRVNPSMVTKLRATFLKLASALDLPLLRINQANSPDLLSVSQYYSGELVSYVRKVLQIIPESMFTSLLKIIKLQTHDIIEVPTRLDKDKLRDYAQLGPRYEVAKLTHAISIFTEGILMMKTTLVGIIKVDPKQLLEDGIRKELVKRVAFALHRGLIFNPRAKPSELMPKLKELGATMDGFHRSFEYIQDYVNIYGLKIWQEEVSRIINYNVEQECNNFLRTKIQDWQSMYQSTHIPIPKFTPVDESVTFIGRLCREILRITDPKMTCHIDQLNTWYDMKTHQEVTSSRLFSEIQTTLGTFGLNGLDRLLCFMIVKELQNFLSMFQKIILRDRTVQDTLKTLMNAVSPLKSIVANSNKIYFSAIAKTQKIWTAYLEAIMKVGQMQILRQQIANELNYSCRFDSKHLAAALENLNKALLADIEAHYQDPSLPYPKEDNTLLYEITAYLEAAGIHNPLNKIYITTKRLPYFPIVNFLFLIAQLPKLQYNKNLGMVCRKPTDPVDWPPLVLGLLTLLKQFHSRYTEQFLALIGQFICSTVEQCTSQKIPEIPADVVGALLFLEDYVRYTKLPRRVAEAHVPNFIFDEFRTVL,1159,NP_055661.3.csv,refseq-WASHC5-NM_014846.3_clinical_seed_0_final,refseq-WASHC5-NM_014846.3.a2m,Invitae,refseq-WASHC5-NM_014846.3.npy,1,1159,1159
+NP_055689.1,MSLLFSRCNSIVTVKKNKRHMAEVNASPLKHFVTAKKKINGIFEQLGAYIQESATFLEDTYRNAELDPVTTEEQVLDVKGYLSKVRGISEVLARRHMKVAFFGRTSNGKSTVINAMLWDKVLPSGIGHTTNCFLRVEGTDGHEAFLLTEGSEEKRSAKTVNQLAHALHQDKQLHAGSLVSVMWPNSKCPLLKDDLVLMDSPGIDVTTELDSWIDKFCLDADVFVLVANSESTLMQTEKHFFHKVSERLSRPNIFILNNRWDASASEPEYMEEVRRQHMERCTSFLVDELGVVDRSQAGDRIFFVSAKEVLNARIQKAQGMPEGGGALAEGFQVRMFEFQNFERRFEECISQSAVKTKFEQHTVRAKQIAEAVRLIMDSLHMAAREQQVYCEEMREERQDRLKFIDKQLELLAQDYKLRIKQITEEVERQVSTAMAEEIRRLSVLVDDYQMDFHPSPVVLKVYKNELHRHIEEGLGRNMSDRCSTAITNSLQTMQQDMIDGLKPLLPVSVRSQIDMLVPRQCFSLNYDLNCDKLCADFQEDIEFHFSLGWTMLVNRFLGPKNSRRALMGYNDQVQRPIPLTPANPSMPPLPQGSLTQEEFMVSMVTGLASLTSRTSMGILVVGGVVWKAVGWRLIALSFGLYGLLYVYERLTWTTKAKERAFKRQFVEHASEKLQLVISYTGSNCSHQVQQELSGTFAHLCQQVDVTRENLEQEIAAMNKKIEVLDSLQSKAKLLRNKAGWLDSELNMFTHQYLQPSR,757,NP_055689.1.csv,refseq-MFN2-NM_014874.3_clinical_seed_0_final,refseq-MFN2-NM_014874.3.a2m,Invitae,refseq-MFN2-NM_014874.3.npy,1,757,757
+NP_055692.3,MEDRRAEKSCEQACESLKRQDYEMALKHCTEALLSLGQYSMADFTGPCPLEIERIKIESLLYRIASFLQLKNYVQADEDCRHVLGEGLAKGEDAFRAVLCCMQLKGKLQPVSTILAKSLTGESLNGMVTKDLTRLKTLLSETETATSNALSGYHVEDLDEGSCNGWHFRPPPRGITSSEEYTLCKRFLEQGICRYGAQCTSAHSQEELAEWQKRYASRLIKLKQQNENKQLSGSYMETLIEKWMNSLSPEKVLSECIEGVKVEHNPDLSVTVSTKKSHQTWTFALTCKPARMLYRVALLYDAHRPHFSIIAISAGDSTTQVSQEVPENCQEWIGGKMAQNGLDHYVYKVGIAFNTEIFGTFRQTIVFDFGLEPVLMQRVMIDAASTEDLEYLMHAKQQLVTTAKRWDSSSKTIIDFEPNETTDLEKSLLIRYQIPLSADQLFTQSVLDKSLTKSNYQSRLHDLLYIEEIAQYKEISKFNLKVQLQILASFMLTGVSGGAKYAQNGQLFGRFKLTETLSEDTLAGRLVMTKVNAVYLLPVPKQKLVQTQGTKEKVYEATIEEKTKEYIFLRLSRECCEELNLRPDCDTQVELQFQLNRLPLCEMHYALDRIKDNGVLFPDISMTPTIPWSPNRQWDEQLDPRLNAKQKEAVLAITTPLAIQLPPVLIIGPYGTGKTFTLAQAVKHILQQQETRILICTHSNSAADLYIKDYLHPYVEAGNPQARPLRVYFRNRWVKTVHPVVHQYCLISSAHSTFQMPQKEDILKHRVVVVTLNTSQYLCQLDLEPGFFTHILLDEAAQAMECETIMPLALATQNTRIVLAGDHMQLSPFVYSEFARERNLHVSLLDRLYEHYPAEFPCRILLCENYRSHEAIINYTSELFYEGKLMASGKQPAHKDFYPLTFFTARGEDVQEKNSTAFYNNAEVFEVVERVEELRRKWPVAWGKLDDGSIGVVTPYADQVFRIRAELRKKRLSDVNVERVLNVQGKQFRVLFLSTVRTRHTCKHKQTPIKKKEQLLEDSTEDLDYGFLSNYKLLNTAITRAQSLVAVVGDPIALCSIGRCRKFWERFIALCHENSSLHGITFEQIKAQLEALELKKTYVLNPLAPEFIPRALRLQHSGSTNKQQQSPPKGKSLHHTQNDHFQNDGIVQPNPSVLIGNPIRAYTPPPPLGPHPNLGKSPSPVQRIDPHTGTSILYVPAVYGGNVVMSVPLPVPWTGYQGRFAVDPRIITHQAAMAYNMNLLQTHGRGSPIPYGLGHHPPVTIGQPQNQHQEKDQHEQNRNGKSDTNNSGPEINKIRTPEKKPTEPKQVDLESNPQNRSPESRPSVVYPSTKFPRKDNLNPRHINLPLPAPHAQYAIPNRHFHPLPQLPRPPFPIPQQHTLLNQQQNNLPEQPNQIPPQPNQVVQQQSQLNQQPQQPPPQLSPAYQAGPNNAFFNSAVAHRPQSPPAEAVIPEQQPPPMLQEGHSPLRAIAQPGPILPSHLNSFIDENPSGLPIGEALDRIHGSVALETLRQQQARFQQWSEHHAFLSQGSAPYPHHHHPHLQHLPQPPLGLHQPPVRADWKLTSSAEDEVETTYSRFQDLIRELSHRDQSETRELAEMPPPQSRLLQYRQVQSRSPPAVPSPPSSTDHSSHFSNFNDNSRDIEVASNPAFPQRLPPQIFNSPFSLPSEHLAPPPLKYLAPDGAWTFANLQQNHLMGPGFPYGLPPLPHRPPQNPFVQIQNHQHAIGQEPFHPLSSRTVSSSSLPSLEEYEPRGPGRPLYQRRISSSSVQPCSEEVSTPQDSLAQCKELQDHSNQSSFNFSSPESWVNTTSSTPYQNIPCNGSSRTAQPRELIAPPKTVKPPEDQLKSENLEVSSSFNYSVLQHLGQFPPLMPNKQIAESANSSSPQSSAGGKPAMSYASALRAPPKPRPPPEQAKKSSDPLSLFQELSLGSSSGSNGFYSYFK,1942,NP_055692.3.csv,refseq-HELZ-NM_014877.4_clinical_seed_0_final,refseq-HELZ-NM_014877.4.a2m,Invitae,refseq-HELZ-NM_014877.4.npy,1,1942,1942
+NP_055713.1,MARDLVMFRDVAVDFSQEEWECLNSYQRNLYRDVILENYSNLVSLAGCSISKPDVITLLEQGKEPWMVVRDEKRRWTLDLESRYDTKKLFQGKDIYEMNLSQWKVMERIKSCGLEEQESPHEVCFRQVTKTTSEKMPTYRKLTSLPLYQKSHNREKPYECGECGKAFRVRQQLTFHQRIHTGEKPYECKECGKAFRQCAHLSRHQRIHTSDKLYECKKCGKIFTCGSDLRVHQRIHIGEKPYECKECGKAFRVRGQLNLHQRIHTGEKPYECKECGKAFRQYAHLTRHQRLNIAEKCYECKECGQAFLCSTGLRLHHKLHTGEKPYECKECGKAFRVRQQLTLHQRIHTGEKPYDCKECGKTFSRGYHLTLHQRIHTGEKPYECKECQKFFRRYSELISHQGIHIGEKPYECKECGKAFRLFSQLTQHQSIHFGEKPFKCKECEKTFRLLSQLTQHQSIHTGEKPYDCKECGKAFRLHSSLIQHQRIHSGEKPYKCKECKKAFRQHSHLTYHQRIHNVT,519,NP_055713.1.csv,refseq-ZFP30-NM_014898.3_clinical_seed_0_final,refseq-ZFP30-NM_014898.3.a2m,Invitae,refseq-ZFP30-NM_014898.3.npy,1,519,519
+NP_055723.1,MTRECPSPAPGPGAPLSGSVLAEAAVVFAVVLSIHATVWDRYSWCAVALAVQAFYVQYKWDRLLQQGSAVFQFRMSANSGLLPASMVMPLLGLVMKERCQTAGNPFFERFGIVVAATGMAVALFSSVLALGITRPVPTNTCVILGLAGGVIIYIMKHSLSVGEVIEVLEVLLIFVYLNMILLYLLPRCFTPGEALLVLGGISFVLNQLIKRSLTLVESQGDPVDFFLLVVVVGMVLMGIFFSTLFVFMDSGTWASSIFFHLMTCVLSLGVVLPWLHRLIRRNPLLWLLQFLFQTDTRIYLLAYWSLLATLACLVVLYQNAKRSSSESKKHQAPTIARKYFHLIVVATYIPGIIFDRPLLYVAATVCLAVFIFLEYVRYFRIKPLGHTLRSFLSLFLDERDSGPLILTHIYLLLGMSLPIWLIPRPCTQKGSLGGARALVPYAGVLAVGVGDTVASIFGSTMGEIRWPGTKKTFEGTMTSIFAQIISVALILIFDSGVDLNYSYAWILGSISTVSLLEAYTTQIDNLLLPLYLLILLMA,538,NP_055723.1.csv,refseq-DOLK-NM_014908.3_clinical_seed_0_final,refseq-DOLK-NM_014908.3.a2m,Invitae,refseq-DOLK-NM_014908.3.npy,1,538,538
+NP_055742.2,MALIMEPVSKWSPSQVVDWMKGLDDCLQQYIKNFEREKISGDQLLRITHQELEDLGVSRIGHQELILEAVDLLCALNYGLETENLKTLSHKLNASAKNLQNFITGRRRSGHYDGRTSRKLPNDFLTSVVDLIGAAKSLLAWLDRSPFAAVTDYSVTRNNVIQLCLELTTIVQQDCTVYETENKILHVCKTLSGVCDHIISLSSDPLVSQSAHLEVIQLANIKPSEGLGMYIKSTYDGLHVITGTTENSPADRCKKIHAGDEVIQVNHQTVVGWQLKNLVNALREDPSGVILTLKKRPQSMLTSAPALLKNMRWKPLALQPLIPRSPTSSVATPSSTISTPTKRDSSALQDLYIPPPPAEPYIPRDEKGNLPCEDLRGHMVGKPVHKGSESPNSFLDQEYRKRFNIVEEDTVLYCYEYEKGRSSSQGRRESTPTYGKLRPISMPVEYNWVGDYEDPNKMKRDSRRENSLLRYMSNEKIAQEEYMFQRNSKKDTGKKSKKKGDKSNSPTHYSLLPSLQMDALRQDIMGTPVPETTLYHTFQQSSLQHKSKKKNKGPIAGKSKRRISCKDLGRGDCEGWLWKKKDAKSYFSQKWKKYWFVLKDASLYWYINEEDEKAEGFISLPEFKIDRASECRKKYAFKACHPKIKSFYFAAEHLDDMNRWLNRINMLTAGYAERERIKQEQDYWSESDKEEADTPSTPKQDSPPPPYDTYPRPPSMSCASPYVEAKHSRLSSTETSQSQSSHEEFRQEVTGSSAVSPIRKTASQRRSWQDLIETPLTSSGLHYLQTLPLEDSVFSDSAAISPEHRRQSTLPTQKCHLQDHYGPYPLAESERMQVLNGNGGKPRSFTLPRDSGFNHCCLNAPVSACDPQDDVQPPEVEEEEEEEEEEGEAAGENIGEKSESREEKLGDSLQDLYRALEQASLSPLGEHRISTKMEYKLSFIKRCNDPVMNEKLHRLRILKSTLKAREGEVAIIDKVLDNPDLTSKEFQQWKQMYLDLFLDICQNTTSNDPLSISSEVDVITSSLAHTHSYIETHV,1034,NP_055742.2.csv,refseq-CNKSR2-NM_014927.4_clinical_seed_0_final,refseq-CNKSR2-NM_014927.4.a2m,Invitae,refseq-CNKSR2-NM_014927.4.npy,1,1034,1034
+NP_055744.2,MLTTLKPFGSVSVESKMNNKAGSFFWNLRQFSTLVSTSRTMRLCCLGLCKPKIVHSNWNILNNFHNRMQSTDIIRYLFQDAFIFKSDVGFQTKGISTLTALRIERLLYAKRLFFDSKQSLVPVDKSDDELKKVNLNHEVSNEDVLTKETKPNRISSRKLSEECNSLSDVLDAFSKAPTFPSSNYFTAMWTIAKRLSDDQKRFEKRLMFSHPAFNQLCEHMMREAKIMQYKYLLFSLHAIVKLGIPQNTILVQTLLRVTQERINECDEICLSVLSTVLEAMEPCKNVHVLRTGFRILVDQQVWKIEDVFTLQVVMKCIGKDAPIALKRKLEMKALRELDRFSVLNSQHMFEVLAAMNHRSLILLDECSKVVLDNIHGCPLRIMINILQSCKDLQYHNLDLFKGLADYVAATFDIWKFRKVLFILILFENLGFRPVGLMDLFMKRIVEDPESLNMKNILSILHTYSSLNHVYKCQNKEQFVEVMASALTGYLHTISSENLLDAVYSFCLMNYFPLAPFNQLLQKDIISELLTSDDMKNAYKLHTLDTCLKLDDTVYLRDIALSLPQLPRELPSSHTNAKVAEVLSSLLGGEGHFSKDVHLPHNYHIDFEIRMDTNRNQVLPLSDVDTTSATDIQRVAVLCVSRSAYCLGSSHPRGFLAMKMRHLNAMGFHVILVNNWEMDKLEMEDAVTFLKTKIYSVEALPVAAVNVQSTQ,710,NP_055744.2.csv,refseq-FASTKD2-NM_014929.3_clinical_seed_0_final,refseq-FASTKD2-NM_014929.3.a2m,Invitae,refseq-FASTKD2-NM_014929.3.npy,1,710,710
+NP_055761.2,MNSPGGRGKKKGSGGASNPVPPRPPPPCLAPAPPAAGPAPPPESPHKRNLYYFSYPLFVGFALLRLVAFHLGLLFVWLCQRFSRALMAAKRSSGAAPAPASASAPAPVPGGEAERVRVFHKQAFEYISIALRIDEDEKAGQKEQAVEWYKKGIEELEKGIAVIVTGQGEQCERARRLQAKMMTNLVMAKDRLQLLEKMQPVLPFSKSQTDVYNDSTNLACRNGHLQSESGAVPKRKDPLTHTSNSLPRSKTVMKTGSAGLSGHHRAPSYSGLSMVSGVKQGSGPAPTTHKGTPKTNRTNKPSTPTTATRKKKDLKNFRNVDSNLANLIMNEIVDNGTAVKFDDIAGQDLAKQALQEIVILPSLRPELFTGLRAPARGLLLFGPPGNGKTMLAKAVAAESNATFFNISAASLTSKYVGEGEKLVRALFAVARELQPSIIFIDEVDSLLCERREGEHDASRRLKTEFLIEFDGVQSAGDDRVLVMGATNRPQELDEAVLRRFIKRVYVSLPNEETRLLLLKNLLCKQGSPLTQKELAQLARMTDGYSGSDLTALAKDAALGPIRELKPEQVKNMSASEMRNIRLSDFTESLKKIKRSVSPQTLEAYIRWNKDFGDTTV,616,NP_055761.2.csv,refseq-SPAST-NM_014946.3_clinical_seed_0_final,refseq-SPAST-NM_014946.3.a2m,Invitae,refseq-SPAST-NM_014946.3.npy,1,616,616
+NP_055771.4,MAGRPLRIGDQLVLEEDYDETYIPSEQEILEFAREIGIDPIKEPELMWLAREGIVAPLPGEWKPCQDITGDIYYFNFANGQSMWDHPCDEHYRSLVIQERAKLSTSGAIKKKKKKKEKKDKKDRDPPKSSLALGSSLAPVHVPLGGLAPLRGLVDTPPSALRGSQSVSLGSSVESGRQLGELMLPSQGLKTSAYTKGLLGSIYEDKTALSLLGLGEETNEEDEEESDNQSVHSSSEPLRNLHLDIGALGGDFEYEESLRTSQPEEKKDVSLDSDAAGPPTPCKPSSPGADSSLSSAVGKGRQGSGARPGLPEKEENEKSEPKICRNLVTPKADPTGSEPAKASEKEAPEDTVDAGEEGSRREEAAKEPKKKASALEEGSSDASQELEISEHMKEPQLSDSIASDPKSFHGLDFGFRSRISEHLLDVDVLSPVLGGACRQAQQPLGIEDKDDSQSSQDELQSKQSKGLEERLSPPLPHEERAQSPPRSLATEEEPPQGPEGQPEWKEAEELGEDSAASLSLQLSLQREQAPSPPAACEKGKEQHSQAEELGPGQEEAEDPEEKVAVSPTPPVSPEVRSTEPVAPPEQLSEAALKAMEEAVAQVLEQDQRHLLESKQEKMQQLREKLCQEEEEEILRLHQQKEQSLSSLRERLQKAIEEEEARMREEESQRLSWLRAQVQSSTQADEDQIRAEQEASLQKLREELESQQKAERASLEQKNRQMLEQLKEEIEASEKSEQAALNAAKEKALQQLREQLEGERKEAVATLEKEHSAELERLCSSLEAKHREVVSSLQKKIQEAQQKEEAQLQKCLGQVEHRVHQKSYHVAGYEHELSSLLREKRQEVEGEHERRLDKMKEEHQQVMAKAREQYEAEERKQRAELLGHLTGELERLQRAHERELETVRQEQHKRLEDLRRRHREQERKLQDLELDLETRAKDVKARLALLEVQEETARREKQQLLDVQRQVALKSEEATATHQQLEEAQKEHTHLLQSNQQLREILDELQARKLKLESQVDLLQAQSQQLQKHFSSLEAEAQKKQHLLREVTVEENNASPHFEPDLHIEDLRKSLGTNQTKEVSSSLSQSKEDLYLDSLSSHNVWHLLSAEGVALRSAKEFLVQQTRSMRRRQTALKAAQQHWRHELASAQEVAKDPPGIKALEDMRKNLEKETRHLDEMKSAMRKGHNLLKKKEEKLNQLESSLWEEASDEGTLGGSPTKKAVTFDLSDMDSLSSESSESFSPPHREWWRQQRIDSTPSLTSRKIHGLSHSLRQISSQLSSVLSILDSLNPQSPPPLLASMPAQLPPRDPKSTPTPTYYGSLARFSALSSATPTSTQWAWDSGQGPRLPSSVAQTVDDFLLEKWRKYFPSGIPLLSNSPTPLESRLGYMSASEQLRLLQHSHSQVPEAGSTTFQGIIEANRRWLERVKNDPRLPLFSSTPKPKATLSLLQLGLDEHNRVKVYRF,1460,NP_055771.4.csv,refseq-CEP164-NM_014956.4_clinical_seed_0_final,refseq-CEP164-NM_014956.4.a2m,Invitae,refseq-CEP164-NM_014956.4.npy,1,1460,1460
+NP_055775.2,MGWLFLKVLLAGVSFSGFLYPLVDFCISGKTRGQKPNFVIILADDMGWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTRNFAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGSYHPNFRGFDYYFGIPYSHDMGCTDTPGYNHPPCPACPQGDGPSRNLQRDCYTDVALPLYENLNIVEQPVNLSSLAQKYAEKATQFIQRASTSGRPFLLYVALAHMHVPLPVTQLPAAPRGRSLYGAGLWEMDSLVGQIKDKVDHTVKENTFLWFTGDNGPWAQKCELAGSVGPFTGFWQTRQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALAQASLPQGRRFDGVDVSEVLFGRSQPGHRVLFHPNSGAAGEFGALQTVRLERYKAFYITGGARACDGSTGPELQHKFPLIFNLEDDTAEAVPLERGGAEYQAVLPEVRKVLADVLQDIANDNISSADYTQDPSVTPCCNPYQIACRCQAA,525,NP_055775.2.csv,refseq-ARSG-NM_014960.4_clinical_seed_0_final,refseq-ARSG-NM_014960.4.a2m,Invitae,refseq-ARSG-NM_014960.4.npy,1,525,525
+NP_055782.3,MMSEGKPPDKKRPRRSLSISKNKKKASNSIISCFNNAPPAKLACPVCSKMVPRYDLNRHLDEMCANNDFVQVDPGQVGLINSNVSMVDLTSVTLEDVTPKKSPPPKTNLTPGQSDSAKREVKQKISPYFKSNDVVCKNQDELRNRSVKVICLGSLASKLSRKYVKAKKSIDKDEEFAGSSPQSSKSTVVKSLIDNSSEIEDEDQILENSSQKENVFKCDSLKEECIPEHMVRGSKIMEAESQKATRECEKSALTPGFSDNAIMLFSPDFTLRNTLKSTSEDSLVKQECIKEVVEKREACHCEEVKMTVASEAKIQLSDSEAKSHSSADDASAWSNIQEAPLQDDSCLNNDIPHSIPLEQGSSCNGPGQTTGHPYYLRSFLVVLKTVLENEDDMLLFDEQEKGIVTKFYQLSATGQKLYVRLFQRKLSWIKMTKLEYEEIALDLTPVIEELTNAGFLQTESELQELSEVLELLSAPELKSLAKTFHLVNPNGQKQQLVDAFLKLAKQRSVCTWGKNKPGIGAVILKRAKALAGQSVRICKGPRAVFSRILLLFSLTDSMEDEDAACGGQGQLSTVLLVNLGRMEFPSYTINRKTHIFQDRDDLIRYAAATHMLSDISSAMANGNWEEAKELAQCAKRDWNRLKNHPSLRCHEDLPLFLRCFTVGWIYTRILSRFVEILQRLHMYEEAVRELESLLSQRIYCPDSRGRWWDRLALNLHQHLKRLEPTIKCITEGLADPEVRTGHRLSLYQRAVRLRESPSCKKFKHLFQQLPEMAVQDVKHVTITGRLCPQRGMCKSVFVMEAGEAADPTTVLCSVEELALAHYRRSGFDQGIHGEGSTFSTLYGLLLWDIIFMDGIPDVFRNACQAFPLDLCTDSFFTSRRPALEARLQLIHDAPEESLRAWVAATWHEQEGRVASLVSWDRFTSLQQAQDLVSCLGGPVLSGVCRHLAADFRHCRGGLPDLVVWNSQSRHFKLVEVKGPNDRLSHKQMIWLAELQKLGAEVEVCHVVAVGAKSQSLS,1017,NP_055782.3.csv,refseq-FAN1-NM_014967.4_clinical_seed_0_final,refseq-FAN1-NM_014967.4.a2m,Invitae,refseq-FAN1-NM_014967.4.npy,1,1017,1017
+NP_055790.1,MSDSLWTALSNFSMPSFPGGSMFRRTKSCRTSNRKSLILTSTSPTLPRPHSPLPGHLGSSPLDSPRNFSPNTPAHFSFASSRRADGRRWSLASLPSSGYGTNTPSSTVSSSCSSQERLHQLPYQPTVDELHFLSKHFGSTESITDEDGGRRSPAVRPRSRSLSPGRSPSSYDNEIVMMNHVYKERFPKATAQMEEKLRDFTRAYEPDSVLPLADGVLSFIHHQIIELARDCLTKSRDGLITTVYFYELQENLEKLLQDAYERSESLEVAFVTQLVKKLLIIISRPARLLECLEFNPEEFYHLLEAAEGHAKEGHLVKTDIPRYIIRQLGLTRDPFPDVVHLEEQDSGGSNTPEQDDLSEGRSSKAKKPPGENDFDTIKLISNGAYGAVYLVRHRDTRQRFAMKKINKQNLILRNQIQQAFVERDILTFAENPFVVGMFCSFETRRHLCMVMEYVEGGDCATLLKNIGALPVEMARMYFAETVLALEYLHNYGIVHRDLKPDNLLITSMGHIKLTDFGLSKMGLMSLTTNLYEGHIEKDAREFLDKQVCGTPEYIAPEVILRQGYGKPVDWWAMGIILYEFLVGCVPFFGDTPEELFGQVISDDILWPEGDEALPTEAQLLISSLLQTNPLVRLGAGGAFEVKQHSFFRDLDWTGLLRQKAEFIPHLESEDDTSYFDTRSDRYHHVNSYDEDDTTEEEPVEIRQFSSCSPRFSKVYSSMEQLSQHEPKTPVAAAGSSKREPSTKGPEEKVAGKREGLGGLTLREKTWRGGSPEIKRFSASEASFLEGEASPPLGARRRFSALLEPSRFSAPQEDEDEARLRRPPRPSSDPAGSLDARAPKEETQGEGTSSAGDSEATDRPRPGDLCPPSKDGDASGPRATNDLVLRRARHQQMSGDVAVEKRPSRTGGKVIKSASATALSVMIPAVDPHGSSPLASPMSPRSLSSNPSSRDSSPSRDYSPAVSGLRSPITIQRSGKKYGFTLRAIRVYMGDTDVYSVHHIVWHVEEGGPAQEAGLCAGDLITHVNGEPVHGMVHPEVVELILKSGNKVAVTTTPFENTSIRIGPARRSSYKAKMARRNKRPSAKEGQESKKRSSLFRKITKQSNLLHTSRSLSSLNRSLSSSDSLPGSPTHGLPARSPTHSYRSTPDSAYLGASSQSSSPASSTPNSPASSASHHIRPSTLHGLSPKLHRQYRSARCKSAGNIPLSPLAHTPSPTQASPPPLPGHTVGSSHTTQSFPAKLHSSPPVVRPRPKSAEPPRSPLLKRVQSAEKLGASLSADKKGALRKHSLEVGHPDFRKDFHGELALHSLAESDGETPPVEGLGAPRQVAVRRLGRQESPLSLGADPLLPEGASRPPVSSKEKESPGGAEACTPPRATTPGGRTLERDVGCTRHQSVQTEDGTGGMARAVAKAALSPVQEHETGRRSSSGEAGTPLVPIVVEPARPGAKAVVPQPLGADSKGLQEPAPLAPSVPEAPRGRERWVLEVVEERTTLSGPRSKPASPKLSPEPQTPSLAPAKCSAPSSAVTPVPPASLLGSGTKPQVGLTSRCPAEAVPPAGLTKKGVSSPAPPGP,1570,NP_055790.1.csv,refseq-MAST1-NM_014975.2_clinical_seed_0_final,refseq-MAST1-NM_014975.2.a2m,Invitae,refseq-MAST1-NM_014975.2.npy,1,1570,1570
+NP_055800.2,MSLDFGSVALPVQNEDEEYDEEDYEREKELQQLLTDLPHDMLDDDLSSPELQYSDCSEDGTDGQPHHPEQLEMSWNEQMLPKSQSVNGYNEIQSLYAGEKCGNVWEENRSKTEDRHPVYHPEEGGDEGGSGYSPPSKCEQTDLYHLPENFRPYTNGQKQEFNNQATNVIKFSDPQWNHFQGPSCQGLEPYNKVTYKPYQSSAQNNGSPAQEITGSDTFEGLQQQFLGANENSAENMQIIQLQVLNKAKERQLENLIEKLNESERQIRYLNHQLVIIKDEKDGLTLSLRESQKLFQNGKEREIQLEAQIKALETQIQALKVNEEQMIKKSRTTEMALESLKQQLVDLHHSESLQRAREQHESIVMGLTKKYEEQVLSLQKNLDATVTALKEQEDICSRLKDHVKQLERNQEAIKLEKTEIINKLTRSLEESQKQCAHLLQSGSVQEVAQLQFQLQQAQKAHAMSANMNKALQEELTELKDEISLYESAAKLGIHPSDSEGELNIELTESYVDLGIKKVNWKKSKVTSIVQEEDPNEELSKDEFILKLKAEVQRLLGSNSMKRHLVSQLQNDLKDCHKKIEDLHQVKKDEKSIEVETKTDTSEKPKNQLWPESSTSDVVRDDILLLKNEIQVLQQQNQELKETEGKLRNTNQDLCNQMRQMVQDFDHDKQEAVDRCERTYQQHHEAMKTQIRESLLAKHALEKQQLFEAYERTHLQLRSELDKLNKEVTAVQECYLEVCREKDNLELTLRKTTEKEQQTQEKIKEKLIQQLEKEWQSKLDQTIKAMKKKTLDCGSQTDQVTTSDVISKKEMAIMIEEQKCTIQQNLEQEKDIAIKGAMKKLEIELELKHCENITKQVEIAVQNAHQRWLGELPELAEYQALVKAEQKKWEEQHEVSVNKRISFAVSEAKEKWKSELENMRKNILPGKELEEKIHSLQKELELKNEEVPVVIRAELAKARSEWNKEKQEEIHRIQEQNEQDYRQFLDDHRNKINEVLAAAKEDFMKQKTELLLQKETELQTCLDQSRREWTMQEAKRIQLEIYQYEEDILTVLGVLLSDTQKEHISDSEDKQLLEIMSTCSSKWMSVQYFEKLKGCIQKAFQDTLPLLVENADPEWKKRNMAELSKDSASQGTGQGDPGPAAGHHAQPLALQATEAEAEENNKVVEELIEENNDMKNKLEELQTLCKTPPRSLSAGAIENACLPCSGGALEELRGQYIKAVKKIKCDMLRYIQESKERAAEMVKAEVLRERQETARKMRKYYLICLQQILQDDGKEGAEKKIMNAASKLATMAKLLETPISSKSQSKTTQSALPLTSEMLIAVKKSKRNDVNQKIPCCIESKSNSVNTITRTLCEQAPKRRAACNLQRLLENSEHQSIKHVGSKETHLEFQFGDGSCKHLNSLPRNVSPEFVPCEGEGGFGLHKKKDLLSDNGSESLPHSAAYPFLGTLGNKPSPRCTPGPSESGCMHITFRDSNERLGLKVYKCNPLMESENAASEKSQGLDVQEPPVKDGGDLSDCLGWPSSSATLSFDSREASFVHGRPQGTLEIPSESVKSKQFSPSGYLSDTEESNMICQTMKCQRYQTPYLSEETTYLEPGKISVNCGHPSRHKADRLKSDFKKLSSTLPSSVCQQPSRKLIVPLSSQQDSGFDSPFVNLD,1654,NP_055800.2.csv,refseq-CEP152-NM_014985.3_clinical_seed_0_final,refseq-CEP152-NM_014985.3.a2m,Invitae,refseq-CEP152-NM_014985.3.npy,1,1654,1654
+NP_055805.1,MFSKKPHGDVKKSTQKVLDTKKDALTRLKHLRIVIENAESIDLKQFFDQHFSHIYYVFFENFVTIEASLKQKGHKSQREELDAILFIFEKILQLLPERIHQRWQFHSIGLILKKLLHTGNSLKIRREGVRLFLLWLQALQNNCSKEQLWMFSCLIPGFSAPQSEHGPRTLDNLINPPLNLQETQVTIEEITPLVPPQSGDKGQEDLTSYFLEALLKYIVIQVKSLEWKNKENQERGFSFLFSHFKKYYLPYIFPNICKENSLYHPILDIPQMRPKPHYVVIKKDAETNEAIYCTKEPFIKARVIVIRWLVSFWLEPKPHTGPHIPGMEGEVLPKNIQRAAASLVSREESKNDNADKTDRTTEPEQSHSNTSTLTEREPSSSSLCSIDEEHLTDIEIVRRVFSSKRSNVNFVTEIFRQAFLLPICEAAAMRKVVKVYQEWIQQEEKPLFMQEPEEIVITSSDLPCIENVTDHDISMEEGEKREEENGTNTADHVRNSSWAKNGSYQGALHNASEEATEQNIRAGTQAVLQVFIINSSNIFLLEPANEIKNLLDEHTDMCKRILNIYRYMVVQVSMDKKTWEQMLLVLLRVTESVLKMPSQAFLQFQGKKNMTLAGRLAGPLFQTLIVAWIKANLNVYISRELWDDLLSVLSSLTYWEELATEWSLTMETLTKVLARNLYSLDLSDLPLDKLSEQKQKKHKGKGVGHEFQKVSVDKSFSRGWSRDQPGQAPMRQRSATTTGSPGTEKARSIVRQKTVDIDDAQILPRSTRVRHFSQSEETGNEVFGALNEEQPLPRSSSTSDILEPFTVERAKVNKEDMSQKLPPLNSDIGGSSANVPDLMDEFIAERLRSGNASTMTRRGSSPGSLEIPKDLPDILNKQNQMRPIDDPGVPSEWTSPASAGSSDLISSDSHSDSFSAFQYDGRKFDNFGFGTDTGVTSSADVDSGSGHHQSAEEQEVASLTTLHIDSETSSLNQQAFSAEVATITGSESASPVHSPLGSRSQTPSPSTLNIDHMEQKDLQLDEKLHHSVLQTPDDLEISEFPSECCSVMAGGTLTGWHADVATVMWRRMLGILGDVNSIMDPEIHAQVFDYLCELWQNLAKIRDNLGISTDNLTSPSPPVLIPPLRILTPWLFKATMLTDKYKQGKLHAYKLICNTMKRRQDVSPNRDFLTHFYNIMHCGLLHIDQDIVNTIIKHCSPQFFSLGLPGATMLIMDFIVAAGRVASSAFLNAPRVEAQVLLGSLVCFPNLYCELPSLHPNIPDVAVSQFTDVKELIIKTVLSSARDEPSGPARCVALCSLGIWICEELVHESHHPQIKEALNVICVSLKFTNKTVAHVACNMLHMLVHYVPRLQIYQPDSPLKIIQILIATITHLLPSTEASSYEMDKRLVVSLLLCLLDWIMALPLKTLLQPFHATGAESDKTEKSVLNCIYKVLHGCVYGAQCFSNPRYFPMSLSDLASVDYDPFMHLESLKEPEPLHSPDSERSSKLQPVTEVKTQMQHGLISIAARTVITHLVNHLGHYPMSGGPAMLTSQVCENHDNHYSESTELSPELFESPNIQFFVLNNTTLVSCIQIRSEENMPGGGLSAGLASANSNVRIIVRDLSGKYSWDSAILYGPPPVSGLSEPTSFMLSLSHQEKPEEPPTSNECLEDITVKDGLSLQFKRFRETVPTWDTIRDEEDVLDELLQYLGVTSPECLQRTGISLNIPAPQPVCISEKQENDVINAILKQHTEEKEFVEKHFNDLNMKAVEQDEPIPQKPQSAFYYCRLLLSILGMNSWDKRRSFHLLKKNEKLLRELRNLDSRQCRETHKIAVFYVAEGQEDKHSILTNTGGSQAYEDFVAGLGWEVNLTNHCGFMGGLQKNKSTGLTTPYFATSTVEVIFHVSTRMPSDSDDSLTKKLRHLGNDEVHIVWSEHTRDYRRGIIPTEFGDVLIVIYPMKNHMFSIQIMKKPEVPFFGPLFDGAIVNGKVLPIMVRATAINASRALKSLIPLYQNFYEERARYLQTIVQHHLEPTTFEDFAAQVFSPAPYHHLPSDADH,2036,NP_055805.1.csv,refseq-RALGAPA1-NM_014990.1_clinical_seed_0_final,refseq-RALGAPA1-NM_014990.1.a2m,Invitae,refseq-RALGAPA1-NM_014990.1.npy,1,2036,2036
+NP_055824.1,MGFELDRFDGDVDPDLKCALCHKVLEDPLTTPCGHVFCAGCVLPWVVQEGSCPARCRGRLSAKELNHVLPLKRLILKLDIKCAYATRGCGRVVKLQQLPEHLERCDFAPARCRHAGCGQVLLRRDVEAHMRDACDARPVGRCQEGCGLPLTHGEQRAGGHCCARALRAHNGALQARLGALHKALKKEALRAGKREKSLVAQLAAAQLELQMTALRYQKKFTEYSARLDSLSRCVAAPPGGKGEETKSLTLVLHRDSGSLGFNIIGGRPSVDNHDGSSSEGIFVSKIVDSGPAAKEGGLQIHDRIIEVNGRDLSRATHDQAVEAFKTAKEPIVVQVLRRTPRTKMFTPPSESQLVDTGTQTDITFEHIMALTKMSSPSPPVLDPYLLPEEHPSAHEYYDPNDYIGDIHQEMDREELELEEVDLYRMNSQDKLGLTVCYRTDDEDDIGIYISEIDPNSIAAKDGRIREGDRIIQINGIEVQNREEAVALLTSEENKNFSLLIARPELQLDEGWMDDDRNDFLDDLHMDMLEEQHHQAMQFTASVLQQKKHDEDGGTTDTATILSNQHEKDSGVGRTDESTRNDESSEQENNGDDATASSNPLAGQRKLTCSQDTLGSGDLPFSNESFISADCTDADYLGIPVDECERFRELLELKCQVKSATPYGLYYPSGPLDAGKSDPESVDKELELLNEELRSIELECLSIVRAHKMQQLKEQYRESWMLHNSGFRNYNTSIDVRRHELSDITELPEKSDKDSSSAYNTGESCRSTPLTLEISPDNSLRRAAEGISCPSSEGAVGTTEAYGPASKNLLSITEDPEVGTPTYSPSLKELDPNQPLESKERRASDGSRSPTPSQKLGSAYLPSYHHSPYKHAHIPAHAQHYQSYMQLIQQKSAVEYAQSQMSLVSMCKDLSSPTPSEPRMEWKVKIRSDGTRYITKRPVRDRLLRERALKIREERSGMTTDDDAVSEMKMGRYWSKEERKQHLVKAKEQRRRREFMMQSRLDCLKEQQAADDRKEMNILELSHKKMMKKRNKKIFDNWMTIQELLTHGTKSPDGTRVYNSFLSVTTV,1066,NP_055824.1.csv,refseq-PDZRN3-NM_015009.2_clinical_seed_0_final,refseq-PDZRN3-NM_015009.2.a2m,Invitae,refseq-PDZRN3-NM_015009.2.npy,1,1066,1066
+NP_055826.1,MEIDQCLLESLPLGQRQRLVKRMRCEQIKAYYEREKAFQKQEGFLKRLKHAKNPKVHFNLTDMLQDAIIHHNDKEVLRLLKEGADPHTLVSSGGSLLHLCARYDNAFIAEILIDRGVNVNHQDEDFWTPMHIACACDNPDIVLLLVLAGANVLLQDVNGNIPLDYAVEGTESSSILLTYLDENGVDLTSLRQMKLQRPMSMLTDVKHFLSSGGNVNEKNDEGVTLLHMACASGYKEVVSLILEHGGDLNIVDDQYWTPLHLAAKYGQTNLVKLLLMHQANPHLVNCNEEKASDIAASEFIEEMLLKAEIAWEEKMKEPLSASTLAQEEPYEEIIHDLPVLSSKLSPLVLPIAKQDSLLEKDIMFKDATKGLCKQQSQDSIPENPMMSGSTKPEQVKLMPPAPNDDLATLSELNDGSLLYEIQKRFGNNQIYTFIGDILLLVNPYKELPIYSSMVSQLYFSSSGKLCSSLPPHLFSCVERAFHQLFREQRPQCFILSGERGSGKSEASKQIIRHLTCRAGASRATLDSRFKHVVCILEAFGHAKTTLNDLSSCFIKYFELQFCERKQQLTGARIYTYLLEKSRLVSQPLGQSNFLIFYLLMDGLSAEEKYGLHLNNLCAHRYLNQTIQDDASTGERSLNREKLAVLKRALNVVGFSSLEVENLFVILAAILHLGDIRFTALNEGNSAFVSDLQLLEQVAGMLQVSTDELASALTTDIQYFKGDMIIRRHTIQIAEFFRDLLAKSLYSRLFSFLVNTMNSCLHSQDEQKSMQTLDIGILDIFGFEEFQKNEFEQLCVNMTNEKMHHYINEVLFLHEQVECVQEGVTMETAYSPGNQNGVLDFFFQKPSGFLTLLDEESQMIWSVESNFPKKLQSLLESSNTNAVYSPMKDGNGNVALKDHGTAFTIMHYAGRVMYDVVGAIEKNKDSLSQNLLFVMKTSENVVINHLFQSKLSQTGSLVSAYPSFKFRGHKSALLSKKMTASSIIGENKNYLELSKLLKKKGTSTFLQRLERGDPVTIASQLRKSLMDIIGKLQKCTPHFIHCIRPNNSKLPDTFDNFYVSAQLQYIGVLEMVKIFRYGYPVRLSFSDFLSRYKPLADTFLREKKEQSAAERCRLVLQQCKLQGWQMGVRKVFLKYWHADQLNDLCLQLQRKIITCQKVIRGFLARQHLLQRISIRQQEVTSINSFLQNTEDMGLKTYDALVIQNASDIARENDRLRSEMNAPYHKEKLEVRNMQEEGSKRTDDKSGPRHFHPSSMSVCAAVDGLGQCLVGPSIWSPSLHSVFSMDDSSSLPSPRKQPPPKPKRDPNTRLSASYEAVSACLSAAREAANEALARPRPHSDDYSTMKKIPPRKPKRSPNTKLSGSYEEISGSRPGDARPAGAPGAAARVLTPGTPQCALPPAAPPGDEDDSEPVYIEMLGHAARPDSPDPGESVYEEMKCCLPDDGGPGAGSFLLHGASPPLLHRAPEDEAAGPPGDACDIPPPFPNLLPHRPPLLVFPPTPVTCSPASDESPLTPLEVKKLPVLETNLKYPVQPEGSSPLSPQYSKSQKGDGDRPASPGLALFNGSGRASPPSTPPPPPPPPGPPPAPYRPCAHLAFPPEPAPVNAGKAGPSAEAPKVHPKPNSAPVAGPCSSFPKIPYSPVKATRADARKAGSSASPPAPYSPPSSRPLSSPLDELASLFNSGRSVLRKSAAGRKIREAEGFETNMNISSRDDPSTSEITSETQDRNANNHGIQLSNSLSSAITAENGNSISNGLPEEDGYSRLSISGTGTSTFQRHRDSHTTQVIHQLRLSENESVALQELLDWRRKLCEEGQDWQQILHHAEPRVPPPPPCKKPSLLKKPEGASCNRLPSELWDTTI,1858,NP_055826.1.csv,refseq-MYO16-NM_015011.3_clinical_seed_0_final,refseq-MYO16-NM_015011.3.a2m,Invitae,refseq-MYO16-NM_015011.3_theta_0.2.npy,1,1858,1858
+NP_055830.1,MGSEDHGAQNPSCKIMTFRPTMEEFKDFNKYVAYIESQGAHRAGLAKIIPPKEWKPRQTYDDIDDVVIPAPIQQVVTGQSGLFTQYNIQKKAMTVGEYRRLANSEKYCTPRHQDFDDLERKYWKNLTFVSPIYGADISGSLYDDDVAQWNIGSLRTILDMVERECGTIIEGVNTPYLYFGMWKTTFAWHTEDMDLYSINYLHFGEPKSWYAIPPEHGKRLERLAIGFFPGSSQGCDAFLRHKMTLISPIILKKYGIPFSRITQEAGEFMITFPYGYHAGFNHGFNCAESTNFATLRWIDYGKVATQCTCRKDMVKISMDVFVRILQPERYELWKQGKDLTVLDHTRPTALTSPELSSWSASRASLKAKLLRRSHRKRSQPKKPKPEDPKFPGEGTAGAALLEEAGGSVKEEAGPEVDPEEEEEEPQPLPHGREAEGAEEDGRGKLRPTKAKSERKKKSFGLLPPQLPPPPAHFPSEEALWLPSPLEPPVLGPGPAAMEESPLPAPLNVVPPEVPSEELEAKPRPIIPMLYVVPRPGKAAFNQEHVSCQQAFEHFAQKGPTWKEPVSPMELTGPEDGAASSGAGRMETKARAGEGQAPSTFSKLKMEIKKSRRHPLGRPPTRSPLSVVKQEASSDEEASPFSGEEDVSDPDALRPLLSLQWKNRAASFQAERKFNAAAARTEPYCAICTLFYPYCQALQTEKEAPIASLGEGCPATLPSKSRQKTRPLIPEMCFTSGGENTEPLPANSYIGDDGTSPLIACGKCCLQVHASCYGIRPELVNEGWTCSRCAAHAWTAECCLCNLRGGALQMTTDRRWIHVICAIAVPEARFLNVIERHPVDISAIPEQRWKLKCVYCRKRMKKVSGACIQCSYEHCSTSFHVTCAHAAGVLMEPDDWPYVVSITCLKHKSGGHAVQLLRAVSLGQVVITKNRNGLYYRCRVIGAASQTCYEVNFDDGSYSDNLYPESITSRDCVQLGPPSEGELVELRWTDGNLYKAKFISSVTSHIYQVEFEDGSQLTVKRGDIFTLEEELPKRVRSRLSLSTGAPQEPAFSGEEAKAAKRPRVGTPLATEDSGRSQDYVAFVESLLQVQGRPGAPF,1096,NP_055830.1.csv,NP_055830.1_clinical_seed_0_final,NP_055830.1.a2m,popEVE,NP_055830.1_theta_0.2.npy,1,1096,1096
+NP_055840.2,MEVDTEEKRHRTRSKGVRVPVEPAIQELFSCPTPGCDGSGHVSGKYARHRSVYGCPLAKKRKTQDKQPQEPAPKRKPFAVKADSSSVDECDDSDGTEDMDEKEEDEGEEYSEDNDEPGDEDEEDEEGDREEEEEIEEEDEDDDEDGEDVEDEEEEEEEEEEEEEEEENEDHQMNCHNTRIMQDTEKDDNNNDEYDNYDELVAKSLLNLGKIAEDAAYRARTESEMNSNTSNSLEDDSDKNENLGRKSELSLDLDSDVVRETVDSLKLLAQGHGVVLSENMNDRNYADSMSQQDSRNMNYVMLGKPMNNGLMEKMVEESDEEVCLSSLECLRNQCFDLARKLSETNPQERNPQQNMNIRQHVRPEEDFPGRTPDRNYSDMLNLMRLEEQLSPRSRVFASCAKEDGCHERDDDTTSVNSDRSEEVFDMTKGNLTLLEKAIALETERAKAMREKMAMEAGRRDNMRSYEDQSPRQLPGEDRKPKSSDSHVKKPYYDPSRTEKKESKCPTPGCDGTGHVTGLYPHHRSLSGCPHKDRVPPEILAMHESVLKCPTPGCTGRGHVNSNRNSHRSLSGCPIAAAEKLAKAQEKHQSCDVSKSSQASDRVLRPMCFVKQLEIPQYGYRNNVPTTTPRSNLAKELEKYSKTSFEYNSYDNHTYGKRAIAPKVQTRDISPKGYDDAKRYCKDPSPSSSSTSSYAPSSSSNLSCGGGSSASSTCSKSSFDYTHDMEAAHMAATAILNLSTRCREMPQNLSTKPQDLCATRNPDMEVDENGTLDLSMNKQRPRDSCCPILTPLEPMSPQQQAVMNNRCFQLGEGDCWDLPVDYTKMKPRRIDEDESKDITPEDLDPFQEALEERRYPGEVTIPSPKPKYPQCKESKKDLITLSGCPLADKSIRSMLATSSQELKCPTPGCDGSGHITGNYASHRSLSGCPRAKKSGIRIAQSKEDKEDQEPIRCPVPGCDGQGHITGKYASHRSASGCPLAAKRQKDGYLNGSQFSWKSVKTEGMSCPTPGCDGSGHVSGSFLTHRSLSGCPRATSAMKKAKLSGEQMLTIKQRASNGIENDEEIKQLDEEIKELNESNSQMEADMIKLRTQITTMESNLKTIEEENKVIEQQNESLLHELANLSQSLIHSLANIQLPHMDPINEQNFDAYVTTLTEMYTNQDRYQSPENKALLENIKQAVRGIQV,1184,NP_055840.2.csv,refseq-MYT1L-NM_015025.3_clinical_seed_0_final,refseq-MYT1L-NM_015025.3.a2m,Invitae,refseq-MYT1L-NM_015025.3.npy,1,1184,1184
+NP_055844.2,MSNAKERKHAKKMRNQPTNVTLSSGFVADRGVKHHSGGEKPFQAQKQEPHPGTSRQRQTRVNPHSLPDPEVNEQSSSKGMFRKKGGWKAGPEGTSQEIPKYITASTFAQARAAEISAMLKAVTQKSSNSLVFQTLPRHMRRRAMSHNVKRLPRRLQEIAQKEAEKAVHQKKEHSKNKCHKARRCHMNRTLEFNRRQKKNIWLETHIWHAKRFHMVKKWGYCLGERPTVKSHRACYRAMTNRCLLQDLSYYCCLELKGKEEEILKALSGMCNIDTGLTFAAVHCLSGKRQGSLVLYRVNKYPREMLGPVTFIWKSQRTPGDPSESRQLWIWLHPTLKQDILEEIKAACQCVEPIKSAVCIADPLPTPSQEKSQTELPDEKIGKKRKRKDDGENAKPIKKIIGDGTRDPCLPYSWISPTTGIIISDLTMEMNRFRLIGPLSHSILTEAIKAASVHTVGEDTEETPHRWWIETCKKPDSVSLHCRQEAIFELLGGITSPAEIPAGTILGLTVGDPRINLPQKKSKALPNPEKCQDNEKVRQLLLEGVPVECTHSFIWNQDICKSVTENKISDQDLNRMRSELLVPGSQLILGPHESKIPILLIQQPGKVTGEDRLGWGSGWDVLLPKGWGMAFWIPFIYRGVRVGGLKESAVHSQYKRSPNVPGDFPDCPAGMLFAEEQAKNLLEKYKRRPPAKRPNYVKLGTLAPFCCPWEQLTQDWESRVQAYEEPSVASSPNGKESDLRRSEVPCAPMPKKTHQPSDEVGTSIEHPREAEEVMDAGCQESAGPERITDQEASENHVAATGSHLCVLRSRKLLKQLSAWCGPSSEDSRGGRRAPGRGQQGLTREACLSILGHFPRALVWVSLSLLSKGSPEPHTMICVPAKEDFLQLHEDWHYCGPQESKHSDPFRSKILKQKEKKKREKRQKPGRASSDGPAGEEPVAGQEALTLGLWSGPLPRVTLHCSRTLLGFVTQGDFSMAVGCGEALGFVSLTGLLDMLSSQPAAQRGLVLLRPPASLQYRFARIAIEV,1024,NP_055844.2.csv,refseq-POP1-NM_015029.2_clinical_seed_0_final,refseq-POP1-NM_015029.2.a2m,Invitae,refseq-POP1-NM_015029.2.npy,1,1024,1024
+NP_055856.1,MSFRDLRNFTEMMRALGYPRHISMENFRTPNFGLVSEVLLWLVKRYEPQTDIPPDVDTEQDRVFFIKAIAQFMATKAHIKLNTKKLYQADGYAVKELLKITSVLYNAMKTKGMEGSEIVEEDVNKFKFDLGSKIADLKAARQLASEITSKGASLYDLLGMEVELREMRTEAIARPLEINETEKVMRIAIKEILTQVQKTKDLLNNVASDEANLEAKIEKRKLELERNRKRLETLQSVRPCFMDEYEKTEEELQKQYDTYLEKFQNLTYLEQQLEDHHRMEQERFEEAKNTLCLIQNKLKEEEKRLLKSGSNDDSDIDIQEDDESDSELEERRLPKPQTAMEMLMQGRPGKRIVGTMQGGDSDDNEDSEESEIDMEDDDDEDDDLEDESISLSPTKPNRRVRKSEPLDESDNDF,413,NP_055856.1.csv,refseq-CLUAP1-NM_015041.2_clinical_seed_0_final,refseq-CLUAP1-NM_015041.2.a2m,Invitae,refseq-CLUAP1-NM_015041.2.npy,1,413,413
+NP_055862.1,MAAEWASRFWLWATLLIPAAAVYEDQVGKFDWRQQYVGKVKFASLEFSPGSKKLVVATEKNVIAALNSRTGEILWRHVDKGTAEGAVDAMLLHGQDVITVSNGGRIMRSWETNIGGLNWEITLDSGSFQALGLVGLQESVRYIAVLKKTTLALHHLSSGHLKWVEHLPESDSIHYQMVYSYGSGVVWALGVVPFSHVNIVKFNVEDGEIVQQVRVSTPWLQHLSGACGVVDEAVLVCPDPSSRSLQTLALETEWELRQIPLQSLDLEFGSGFQPRVLPTQPNPVDASRAQFFLHLSPSHYALLQYHYGTLSLLKNFPQTALVSFATTGEKTVAAVMACRNEVQKSSSSEDGSMGSFSEKSSSKDSLACFNQTYTINLYLVETGRRLLDTTITFSLEQSGTRPERLYIQVFLKKDDSVGYRALVQTEDHLLLFLQQLAGKVVLWSREESLAEVVCLEMVDLPLTGAQAELEGEFGKKADGLLGMFLKRLSSQLILLQAWTSHLWKMFYDARKPRSQIKNEINIDTLARDEFNLQKMMVMVTASGKLFGIESSSGTILWKQYLPNVKPDSSFKLMVQRTTAHFPHPPQCTLLVKDKESGMSSLYVFNPIFGKWSQVAPPVLKRPILQSLLLPVMDQDYAKVLLLIDDEYKVTAFPATRNVLRQLHELAPSIFFYLVDAEQGRLCGYRLRKDLTTELSWELTIPPEVQRIVKVKGKRSSEHVHSQGRVMGDRSVLYKSLNPNLLAVVTESTDAHHERTFIGIFLIDGVTGRIIHSSVQKKAKGPVHIVHSENWVVYQYWNTKARRNEFTVLELYEGTEQYNATAFSSLDRPQLPQVLQQSYIFPSSISAMEATITERGITSRHLLIGLPSGAILSLPKALLDPRRPEIPTEQSREENLIPYSPDVQIHAERFINYNQTVSRMRGIYTAPSGLESTCLVVAYGLDIYQTRVYPSKQFDVLKDDYDYVLISSVLFGLVFATMITKRLAQVKLLNRAWR,993,NP_055862.1.csv,refseq-EMC1-NM_015047.2_clinical_seed_0_final,refseq-EMC1-NM_015047.2.a2m,Invitae,refseq-EMC1-NM_015047.2.npy,1,993,993
+NP_055884.2,MHKKRVEEGEASDFSLAWDSSVTAAGGLEGEPECDQKTSRALEDRNSVTSQEERNEDDEDMEDESIYTCDHCQQDFESLADLTDHRAHRCPGDGDDDPQLSWVASSPSSKDVASPTQMIGDGCDLGLGEEEGGTGLPYPCQFCDKSFIRLSYLKRHEQIHSDKLPFKCTYCSRLFKHKRSRDRHIKLHTGDKKYHCHECEAAFSRSDHLKIHLKTHSSSKPFKCTVCKRGFSSTSSLQSHMQAHKKNKEHLAKSEKEAKKDDFMCDYCEDTFSQTEELEKHVLTRHPQLSEKADLQCIHCPEVFVDENTLLAHIHQAHANQKHKCPMCPEQFSSVEGVYCHLDSHRQPDSSNHSVSPDPVLGSVASMSSATPDSSASVERGSTPDSTLKPLRGQKKMRDDGQGWTKVVYSCPYCSKRDFNSLAVLEIHLKTIHADKPQQSHTCQICLDSMPTLYNLNEHVRKLHKNHAYPVMQFGNISAFHCNYCPEMFADINSLQEHIRVSHCGPNANPSDGNNAFFCNQCSMGFLTESSLTEHIQQAHCSVGSAKLESPVVQPTQSFMEVYSCPYCTNSPIFGSILKLTKHIKENHKNIPLAHSKKSKAEQSPVSSDVEVSSPKRQRLSASANSISNGEYPCNQCDLKFSNFESFQTHLKLHLELLLRKQACPQCKEDFDSQESLLQHLTVHYMTTSTHYVCESCDKQFSSVDDLQKHLLDMHTFVLYHCTLCQEVFDSKVSIQVHLAVKHSNEKKMYRCTACNWDFRKEADLQVHVKHSHLGNPAKAHKCIFCGETFSTEVELQCHITTHSKKYNCKFCSKAFHAIILLEKHLREKHCVFDAATENGTANGVPPMATKKAEPADLQGMLLKNPEAPNSHEASEDDVDASEPMYGCDICGAAYTMEVLLQNHRLRDHNIRPGEDDGSRKKAEFIKGSHKCNVCSRTFFSENGLREHLQTHRGPAKHYMCPICGERFPSLLTLTEHKVTHSKSLDTGTCRICKMPLQSEEEFIEHCQMHPDLRNSLTGFRCVVCMQTVTSTLELKIHGTFHMQKLAGSSAASSPNGQGLQKLYKCALCLKEFRSKQDLVKLDVNGLPYGLCAGCMARSANGQVGGLAPPEPADRPCAGLRCPECSVKFESAEDLESHMQVDHRDLTPETSGPRKGTQTSPVPRKKTYQCIKCQMTFENEREIQIHVANHMIEEGINHECKLCNQMFDSPAKLLCHLIEHSFEGMGGTFKCPVCFTVFVQANKLQQHIFAVHGQEDKIYDCSQCPQKFFFQTELQNHTMSQHAQ,1284,NP_055884.2.csv,refseq-ZNF423-NM_015069.3_clinical_seed_0_final,refseq-ZNF423-NM_015069.3.a2m,Invitae,refseq-ZNF423-NM_015069.3.npy,1,1284,1284
+NP_055887.3,MPIVMARDLEETASSSEDEEVISQEDHPCIMWTGGCRRIPVLVFHADAILTKDNNIRVIGERYHLSYKIVRTDSRLVRSILTAHGFHEVHPSSTDYNLMWTGSHLKPFLLRTLSEAQKVNHFPRSYELTRKDRLYKNIIRMQHTHGFKAFHILPQTFLLPAEYAEFCNSYSKDRGPWIVKPVASSRGRGVYLINNPNQISLEENILVSRYINNPLLIDDFKFDVRLYVLVTSYDPLVIYLYEEGLARFATVRYDQGAKNIRNQFMHLTNYSVNKKSGDYVSCDDPEVEDYGNKWSMSAMLRYLKQEGRDTTALMAHVEDLIIKTIISAELAIATACKTFVPHRSSCFELYGFDVLIDSTLKPWLLEVNLSPSLACDAPLDLKIKASMISDMFTVVGFVCQDPAQRASTRPIYPTFESSRRNPFQKPQRCRPLSASDAEMKNLVGSAREKGPGKLGGSVLGLSMEEIKVLRRVKEENDRRGGFIRIFPTSETWEIYGSYLEHKTSMNYMLATRLFQDRMTADGAPELKIESLNSKAKLHAALYERKLLSLEVRKRRRRSSRLRAMRPKYPVITQPAEMNVKTETESEEEEEVALDNEDEEQEASQEESAGFLRENQAKYTPSLTALVENTPKENSMKVREWNNKGGHCCKLETQELEPKFNLMQILQDNGNLSKMQARIAFSAYLQHVQIRLMKDSGGQTFSASWAAKEDEQMELVVRFLKRASNNLQHSLRMVLPSRRLALLERRRILAHQLGDFIIVYNKETEQMAEKKSKKKVEEEEEDGVNMENFQEFIRQASEAELEEVLTFYTQKNKSASVFLGTHSKISKNNNNYSDSGAKGDHPETIMEEVKIKPPKQQQTTEIHSDKLSRFTTSAEKEAKLVYSNSSSGPTATLQKIPNTHLSSVTTSDLSPGPCHHSSLSQIPSAIPSMPHQPTILLNTVSASASPCLHPGAQNIPSPTGLPRCRSGSHTIGPFSSFQSAAHIYSQKLSRPSSAKAGSCYLNKHHSGIAKTQKEGEDASLYSKRYNQSMVTAELQRLAEKQAARQYSPSSHINLLTQQVTNLNLATGIINRSSASAPPTLRPIISPSGPTWSTQSDPQAPENHSSSPGSRSLQTGGFAWEGEVENNVYSQATGVVPQHKYHPTAGSYQLQFALQQLEQQKLQSRQLLDQSRARHQAIFGSQTLPNSNLWTMNNGAGCRISSATASGQKPTTLPQKVVPPPSSCASLVPKPPPNHEQVLRRATSQKASKGSSAEGQLNGLQSSLNPAAFVPITSSTDPAHTKI,1281,NP_055887.3.csv,refseq-TTLL5-NM_015072.4_clinical_seed_0_final,refseq-TTLL5-NM_015072.4.a2m,Invitae,refseq-TTLL5-NM_015072.4.npy,1,1281,1281
+NP_055891.1,MDYDFKAKLAAERERVEDLFEYEGCKVGRGTYGHVYKARRKDGKDEKEYALKQIEGTGISMSACREIALLRELKHPNVIALQKVFLSHSDRKVWLLFDYAEHDLWHIIKFHRASKANKKPMQLPRSMVKSLLYQILDGIHYLHANWVLHRDLKPANILVMGEGPERGRVKIADMGFARLFNSPLKPLADLDPVVVTFWYRAPELLLGARHYTKAIDIWAIGCIFAELLTSEPIFHCRQEDIKTSNPFHHDQLDRIFSVMGFPADKDWEDIRKMPEYPTLQKDFRRTTYANSSLIKYMEKHKVKPDSKVFLLLQKLLTMDPTKRITSEQALQDPYFQEDPLPTLDVFAGCQIPYPKREFLNEDDPEEKGDKNQQQQQNQHQQPTAPPQQAAAPPQAPPPQQNSTQTNGTAGGAGAGVGGTGAGLQHSQDSSLNQVPPNKKPRLGPSGANSGGPVMPSDYQHSSSRLNYQSSVQGSSQSQSTLGYSSSSQQSSQYHPSHQAHRY,502,NP_055891.1.csv,refseq-CDK19-NM_015076.4_clinical_seed_0_final,refseq-CDK19-NM_015076.4.a2m,Invitae,refseq-CDK19-NM_015076.4.npy,1,502,502
+NP_055908.1,MAQGSHQIDFQVLHDLRQKFPEVPEVVVSRCMLQNNNNLDACCAVLSQESTRYLYGEGDLNFSDDSGISGLRNHMTSLNLDLQSQNIYHHGREGSRMNGSRTLTHSISDGQLQGGQSNSELFQQEPQTAPAQVPQGFNVFGMSSSSGASNSAPHLGFHLGSKGTSSLSQQTPRFNPIMVTLAPNIQTGRNTPTSLHIHGVPPPVLNSPQGNSIYIRPYITTPGGTTRQTQQHSGWVSQFNPMNPQQVYQPSQPGPWTTCPASNPLSHTSSQQPNQQGHQTSHVYMPISSPTTSQPPTIHSSGSSQSSAHSQYNIQNISTGPRKNQIEIKLEPPQRNNSSKLRSSGPRTSSTSSSVNSQTLNRNQPTVYIAASPPNTDELMSRSQPKVYISANAATGDEQVMRNQPTLFISTNSGASAASRNMSGQVSMGPAFIHHHPPKSRAIGNNSATSPRVVVTQPNTKYTFKITVSPNKPPAVSPGVVSPTFELTNLLNHPDHYVETENIQHLTDPTLAHVDRISETRKLSMGSDDAAYTQALLVHQKARMERLQRELEIQKKKLDKLKSEVNEMENNLTRRRLKRSNSISQIPSLEEMQQLRSCNRQLQIDIDCLTKEIDLFQARGPHFNPSAIHNFYDNIGFVGPVPPKPKDQRSIIKTPKTQDTEDDEGAQWNCTACTFLNHPALIRCEQCEMPRHF,693,NP_055908.1.csv,refseq-TAB2-NM_015093.5_clinical_seed_0_final,refseq-TAB2-NM_015093.5.a2m,Invitae,refseq-TAB2-NM_015093.5.npy,1,693,693
+NP_055915.2,MADTDLFMECEEEELEPWQKISDVIEDSVVEDYNSVDKTTTVSVSQQPVSAPVPIAAHASVAGHLSTSTTVSSSGAQNSDSTKKTLVTLIANNNAGNPLVQQGGQPLILTQNPAPGLGTMVTQPVLRPVQVMQNANHVTSSPVASQPIFITTQGFPVRNVRPVQNAMNQVGIVLNVQQGQTVRPITLVPAPGTQFVKPTVGVPQVFSQMTPVRPGSTMPVRPTTNTFTTVIPATLTIRSTVPQSQSQQTKSTPSTSTTPTATQPTSLGQLAVQSPGQSNQTTNPKLAPSFPSPPAVSIASFVTVKRPGVTGENSNEVAKLVNTLNTIPSLGQSPGPVVVSNNSSAHGSQRTSGPESSMKVTSSIPVFDLQDGGRKICPRCNAQFRVTEALRGHMCYCCPEMVEYQKKGKSLDSEPSVPSAAKPPSPEKTAPVASTPSSTPIPALSPPTKVPEPNENVGDAVQTKLIMLVDDFYYGRDGGKVAQLTNFPKVATSFRCPHCTKRLKNNIRFMNHMKHHVELDQQNGEVDGHTICQHCYRQFSTPFQLQCHLENVHSPYESTTKCKICEWAFESEPLFLQHMKDTHKPGEMPYVCQVCQYRSSLYSEVDVHFRMIHEDTRHLLCPYCLKVFKNGNAFQQHYMRHQKRNVYHCNKCRLQFLFAKDKIEHKLQHHKTFRKPKQLEGLKPGTKVTIRASRGQPRTVPVSSNDTPPSALQEAAPLTSSMDPLPVFLYPPVQRSIQKRAVRKMSVMGRQTCLECSFEIPDFPNHFPTYVHCSLCRYSTCCSRAYANHMINNHVPRKSPKYLALFKNSVSGIKLACTSCTFVTSVGDAMAKHLVFNPSHRSSSILPRGLTWIAHSRHGQTRDRVHDRNVKNMYPPPSFPTNKAATVKSAGATPAEPEELLTPLAPALPSPASTATPPPTPTHPQALALPPLATEGAECLNVDDQDEGSPVTQEPELASGGGGSGGVGKKEQLSVKKLRVVLFALCCNTEQAAEHFRNPQRRIRRWLRRFQASQGENLEGKYLSFEAEEKLAEWVLTQREQQLPVNEETLFQKATKIGRSLEGGFKISYEWAVRFMLRHHLTPHARRAVAHTLPKDVAENAGLFIDFVQRQIHNQDLPLSMIVAIDEISLFLDTEVLSSDDRKENALQTVGTGEPWCDVVLAILADGTVLPTLVFYRGQMDQPANMPDSILLEAKESGYSDDEIMELWSTRVWQKHTACQRSKGMLVMDCHRTHLSEEVLAMLSASSTLPAVVPAGCSSKIQPLDVCIKRTVKNFLHKKWKEQAREMADTACDSDVLLQLVLVWLGEVLGVIGDCPELVQRSFLVASVLPGPDGNINSPTRNADMQEELIASLEEQLKLSGEHSESSTPRPRSSPEETIEPESLHQLFEGESETESFYGFEEADLDLMEI,1410,NP_055915.2.csv,refseq-POGZ-NM_015100.3_clinical_seed_0_final,refseq-POGZ-NM_015100.3.a2m,Invitae,refseq-POGZ-NM_015100.3.npy,1,1410,1410
+NP_055917.1,MNDWHRIFTQNVLVPPHPQRARQPWKESTAFQCVLKWLDGPVIRQGVLEVLSEVECHLRVSFFDVTYRHFFGRTWKTTVKPTKRPPSRIVFNEPLYFHTSLNHPHIVAVVEVVAEGKKRDGSLQTLSCGFGILRIFSNQPDSPISASQDKRLRLYHGTPRALLHPLLQDPAEQNRHMTLIENCSLQYTLKPHPALEPAFHLLPENLLVSGLQQIPGLLPAHGESGDALRKPRLQKPITGHLDDLFFTLYPSLEKFEEELLELHVQDHFQEGCGPLDGGALEILERRLRVGVHNGLGFVQRPQVVVLVPEMDVALTRSASFSRKVVSSSKTSSGSQALVLRSRLRLPEMVGHPAFAVIFQLEYVFSSPAGVDGNAASVTSLSNLACMHMVRWAVWNPLLEADSGRVTLPLQGGIQPNPSHCLVYKVPSASMSSEEVKQVESGTLRFQFSLGSEEHLDAPTEPVSGPKVERRPSRKPPTSPSSPPAPVPRVLAAPQNSPVGPGLSISQLAASPRSPTQHCLARPTSQLPHGSQASPAQAQEFPLEAGISHLEADLSQTSLVLETSIAEQLQELPFTPLHAPIVVGTQTRSSAGQPSRASMVLLQSSGFPEILDANKQPAEAVSATEPVTFNPQKEESDCLQSNEMVLQFLAFSRVAQDCRGTSWPKTVYFTFQFYRFPPATTPRLQLVQLDEAGQPSSGALTHILVPVSRDGTFDAGSPGFQLRYMVGPGFLKPGERRCFARYLAVQTLQIDVWDGDSLLLIGSAAVQMKHLLRQGRPAVQASHELEVVATEYEQDNMVVSGDMLGFGRVKPIGVHSVVKGRLHLTLANVGHPCEQKVRGCSTLPPSRSRVISNDGASRFSGGSLLTTGSSRRKHVVQAQKLADVDSELAAMLLTHARQGKGPQDVSRESDATRRRKLERMRSVRLQEAGGDLGRRGTSVLAQQSVRTQHLRDLQVIAAYRERTKAESIASLLSLAITTEHTLHATLGVAEFFEFVLKNPHNTQHTVTVEIDNPELSVIVDSQEWRDFKGAAGLHTPVEEDMFHLRGSLAPQLYLRPHETAHVPFKFQSFSAGQLAMVQASPGLSNEKGMDAVSPWKSSAVPTKHAKVLFRASGGKPIAVLCLTVELQPHVVDQVFRFYHPELSFLKKAIRLPPWHTFPGAPVGMLGEDPPVHVRCSDPNVICETQNVGPGEPRDIFLKVASGPSPEIKDFFVIIYSDRWLATPTQTWQVYLHSLQRVDVSCVAGQLTRLSLVLRGTQTVRKVRAFTSHPQELKTDPKGVFVLPPRGVQDLHVGVRPLRAGSRFVHLNLVDVDCHQLVASWLVCLCCRQPLISKAFEIMLAAGEGKGVNKRITYTNPYPSRRTFHLHSDHPELLRFREDSFQVGGGETYTIGLQFAPSQRVGEEEILIYINDHEDKNEEAFCVKVIYQ,1426,NP_055917.1.csv,refseq-NPHP4-NM_015102.3_clinical_seed_0_final,refseq-NPHP4-NM_015102.3.a2m,Invitae,refseq-NPHP4-NM_015102.3.npy,1,1426,1426
+NP_055929.1,MLWPRLAAAEWAALAWELLGASVLLIAVRWLVRRLGPRPGGLGRSGTPVPPPSAAAAPASGEMTMDALLARLKLLNPDDLREEIVKAGLKCGPITSTTRFIFEKKLAQALLEQGGRLSSFYHHEAGVTALSQDPQRILKPAEGNPTDQAGFSEDRDFGYSVGLNPPEEEAVTSKTCSVPPSDTDTYRAGATASKEPPLYYGVCPVYEDVPARNERIYVYENKKEALQAVKMIKGSRFKAFSTREDAEKFARGICDYFPSPSKTSLPLSPVKTAPLFSNDRLKDGLCLSESETVNKERANSYKNPRTQDLTAKLRKAVEKGEEDTFSDLIWSNPRYLIGSGDNPTIVQEGCRYNVMHVAAKENQASICQLTLDVLENPDFMRLMYPDDDEAMLQKRIRYVVDLYLNTPDKMGYDTPLHFACKFGNADVVNVLSSHHLIVKNSRNKYDKTPEDVICERSKNKSVELKERIREYLKGHYYVPLLRAEETSSPVIGELWSPDQTAEASHVSRYGGSPRDPVLTLRAFAGPLSPAKAEDFRKLWKTPPREKAGFLHHVKKSDPERGFERVGRELAHELGYPWVEYWEFLGCFVDLSSQEGLQRLEEYLTQQEIGKKAQQETGEREASCRDKATTSGSNSISVRAFLDEDDMSLEEIKNRQNAARNNSPPTVGAFGHTRCSAFPLEQEADLIEAAEPGGPHSSRNGLCHPLNHSRTLAGKRPKAPRGEEAHLPPVSDLTVEFDKLNLQNIGRSVSKTPDESTKTKDQILTSRINAVERDLLEPSPADQLGNGHRRTESEMSARIAKMSLSPSSPRHEDQLEVTREPARRLFLFGEEPSKLDQDVLAALECADVDPHQFPAVHRWKSAVLCYSPSDRQSWPSPAVKGRFKSQLPDLSGPHSYSPGRNSVAGSNPAKPGLGSPGRYSPVHGSQLRRMARLAELAAL,938,NP_055929.1.csv,refseq-ANKLE2-NM_015114.2_clinical_seed_0_final,refseq-ANKLE2-NM_015114.2.a2m,Invitae,refseq-ANKLE2-NM_015114.2.npy,1,938,938
+NP_055940.3,MYSAHRPLMPASSAASRGLGMFVWTNVEPRSVAVFPWHSLVPFLAPSQPDPSVQPSEAQQPASHPVASNQSKEPAESAAVAHERPPGGTGSADPERPPGATCPESPGPGPPHPLGVVESGKGPPPTTEEEASGPPGEPRLDSETESDHDDAFLSIMSPEIQLPLPPGKRRTQSLSALPKERDSSSEKDGRSPNKREKDHIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWCNKDRKKSSSEAKPTSLGLAGGHKETRERSMSETGTAAAPGVSSELLSVAAQTLLSSDTKAPGSSSCGAERLHTVGGPGSARPRAFSHSGVHSLDGGEVDSQALQELTQMVSGPASYSGPKPSTQYGAPGPFAAPGEGGALAATGRPPLLPTRASRSQRAASEDMTSDEERMVICEEEGDDDVIADDGFGTTDIDLKCKERVTDSESGDSSGEDPEGNKGFGRKVFSPVIRSSFTHCRPPLDPEPPGPPDPPVAFGKGYGSAPSSSASSPASSSASAATSFSLGSGTFKAQESGQGSTAGPLRPPPPGAGGPATPSKATRFLPMDPATFRRKRPESVGGLEPPGPSVIAAPPSGGGNILQTLVLPPNKEEQEGGGARVPSAPAPSLAYGAPAAPLSRPAATMVTNVVRPVSSTPVPIASKPFPTSGRAEASPNDTAGARTEMGTGSRVPGGSPLGVSLVYSDKKSAAATSPAPHLVAGPLLGTVGKAPATVTNLLVGTPGYGAPAPPAVQFIAQGAPGGGTTAGSGAGAGSGPNGPVPLGILQPGALGKAGGITQVQYILPTLPQQLQVAPAPAPAPGTKAAAPSGPAPTTSIRFTLPPGTSTNGKVLAATAPTPGIPILQSVPSAPPPKAQSVSPVQAPPPGGSAQLLPGKVLVPLAAPSMSVRGGGAGQPLPLVSPPFSVPVQNGAQPPSKIIQLTPVPVSTPSGLVPPLSPATLPGPTSQPQKVLLPSSTRITYVQSAGGHALPLGTSPASSQAGTVTSYGPTSSVALGFTSLGPSGPAFVQPLLSAGQAPLLAPGQVGVSPVPSPQLPPACAAPGGPVITAFYSGSPAPTSSAPLAQPSQAPPSLVYTVATSTTPPAATILPKGPPAPATATPAPTSPFPSATAGSMTYSLVAPKAQRPSPKAPQKVKAAIASIPVGSFEAGASGRPGPAPRQPLEPGPVREPTAPESELEGQPTPPAPPPLPETWTPTARSSPPLPPPAEERTSAKGPETMASKFPSSSSDWRVPGQGLENRGEPPTPPSPAPAPAVAPGGSSESSSGRAAGDTPERKEAAGTGKKVKVRPPPLKKTFDSVDNRVLSEVDFEERFAELPEFRPEEVLPSPTLQSLATSPRAILGSYRKKRKNSTDLDSAPEDPTSPKRKMRRRSSCSSEPNTPKSAKCEGDIFTFDRTGTEAEDVLGELEYDKVPYSSLRRTLDQRRALVMQLFQDHGFFPSAQATAAFQARYADIFPSKVCLQLKIREVRQKIMQAATPTEQPPGAEAPLPVPPPTGTAAAPAPTPSPAGGPDPTSPSSDSGTAQAAPPLPPPPESGPGQPGWEGAPQPSPPPPGPSTAATGR,1608,NP_055940.3.csv,refseq-CIC-NM_015125.4_clinical_seed_0_final,refseq-CIC-NM_015125.4.a2m,Invitae,refseq-CIC-NM_015125.4.npy,1,1608,1608
+NP_055975.1,MAAVVLAATRLLRGSGSWGCSRLRFGPPAYRRFSSGGAYPNIPLSSPLPGVPKPVFATVDGQEKFETKVTTLDNGLRVASQNKFGQFCTVGILINSGSRYEAKYLSGIAHFLEKLAFSSTARFDSKDEILLTLEKHGGICDCQTSRDTTMYAVSADSKGLDTVVALLADVVLQPRLTDEEVEMTRMAVQFELEDLNLRPDPEPLLTEMIHEAAYRENTVGLHRFCPTENVAKINREVLHSYLRNYYTPDRMVLAGVGVEHEHLVDCARKYLLGVQPAWGSAEAVDIDRSVAQYTGGIAKLERDMSNVSLGPTPIPELTHIMVGLESCSFLEEDFIPFAVLNMMMGGGGSFSAGGPGKGMFSRLYLNVLNRHHWMYNATSYHHSYEDTGLLCIHASADPRQVREMVEIITKEFILMGGTVDTVELERAKTQLTSMLMMNLESRPVIFEDVGRQVLATRSRKLPHELCTLIRNVKPEDVKRVASKMLRGKPAVAALGDLTDLPTYEHIQTALSSKDGRLPRTYRLFR,525,NP_055975.1.csv,refseq-PMPCA-NM_015160.2_clinical_seed_0_final,refseq-PMPCA-NM_015160.2.a2m,Invitae,refseq-PMPCA-NM_015160.2.npy,1,525,525
+NP_055981.1,MTQEPFREELAYDRMPTLERGRQDPASYAPDAKPSDLQLSKRLPPCFSHKTWVFSVLMGSCLLVTSGFSLYLGNVFPAEMDYLRCAAGSCIPSAIVSFTVSRRNANVIPNFQILFVSTFAVTTTCLIWFGCKLVLNPSAININFNLILLLLLELLMAATVIIAARSSEEDCKKKKGSMSDSANILDEVPFPARVLKSYSVVEVIAGISAVLGGIIALNVDDSVSGPHLSVTFFWILVACFPSAIASHVAAECPSKCLVEVLIAISSLTSPLLFTASGYLSFSIMRIVEMFKDYPPAIKPSYDVLLLLLLLVLLLQAGLNTGTAIQCVRFKVSARLQGASWDTQNGPQERLAGEVARSPLKEFDKEKAWRAVVVQMAQ,377,NP_055981.1.csv,refseq-MLC1-NM_015166.3_clinical_seed_0_final,refseq-MLC1-NM_015166.3.a2m,Invitae,refseq-MLC1-NM_015166.3.npy,1,377,377
+NP_056000.1,MTLLITGDSIVSAEAVWDHVTMANRELAFKAGDVIKVLDASNKDWWWGQIDDEEGWFPASFVRLWVNQEDEVEEGPSDVQNGHLDPNSDCLCLGRPLQNRDQMRANVINEIMSTERHYIKHLKDICEGYLKQCRKRRDMFSDEQLKVIFGNIEDIYRFQMGFVRDLEKQYNNDDPHLSEIGPCFLEHQDGFWIYSEYCNNHLDACMELSKLMKDSRYQHFFEACRLLQQMIDIAIDGFLLTPVQKICKYPLQLAELLKYTAQDHSDYRYVAAALAVMRNVTQQINERKRRLENIDKIAQWQASVLDWEGEDILDRSSELIYTGEMAWIYQPYGRNQQRVFFLFDHQMVLCKKDLIRRDILYYKGRIDMDKYEVVDIEDGRDDDFNVSMKNAFKLHNKETEEIHLFFAKKLEEKIRWLRAFREERKMVQEDEKIGFEISENQKRQAAMTVRKVPKQKGVNSARSVPPSYPPPQDPLNHGQYLVPDGIAQSQVFEFTEPKRSQSPFWQNFSRLTPFKK,516,NP_056000.1.csv,refseq-ARHGEF9-NM_015185.2_clinical_seed_0_final,refseq-ARHGEF9-NM_015185.2.a2m,Invitae,refseq-ARHGEF9-NM_015185.2.npy,1,516,516
+NP_056007.1,MAGAQPGVHALQLKPVCVSDSLKKGTKFVKWDDDSTIVTPIILRTDPQGFFFYWTDQNKETELLDLSLVKDARCGRHAKAPKDPKLRELLDVGNIGRLEQRMITVVYGPDLVNISHLNLVAFQEEVAKEWTNEVFSLATNLLAQNMSRDAFLEKAYTKLKLQVTPEGRIPLKNIYRLFSADRKRVETALEACSLPSSRNDSIPQEDFTPEVYRVFLNNLCPRPEIDNIFSEFGAKSKPYLTVDQMMDFINLKQRDPRLNEILYPPLKQEQVQVLIEKYEPNNSLARKGQISVDGFMRYLSGEENGVVSPEKLDLNEDMSQPLSHYFINSSHNTYLTAGQLAGNSSVEMYRQVLLSGCRCVELDCWKGRTAEEEPVITHGFTMTTEISFKEVIEAIAECAFKTSPFPILLSFENHVDSPKQQAKMAEYCRLIFGDALLMEPLEKYPLESGVPLPSPMDLMYKILVKNKKKSHKSSEGSGKKKLSEQASNTYSDSSSMFEPSSPGAGEADTESDDDDDDDDCKKSSMDEGTAGSEAMATEEMSNLVNYIQPVKFESFEISKKRNKSFEMSSFVETKGLEQLTKSPVEFVEYNKMQLSRIYPKGTRVDSSNYMPQLFWNAGCQMVALNFQTMDLAMQINMGMYEYNGKSGYRLKPEFMRRPDKHFDPFTEGIVDGIVANTLSVKIISGQFLSDKKVGTYVEVDMFGLPVDTRRKAFKTKTSQGNAVNPVWEEEPIVFKKVVLPTLACLRIAVYEEGGKFIGHRILPVQAIRPGYHYICLRNERNQPLTLPAVFVYIEVKDYVPDTYADVIEALSNPIRYVNLMEQRAKQLAALTLEDEEEVKKEADPGETPSEAPSEARTTPAENGVNHTTTLTPKPPSQALHSQPAPGSVKAPAKTEDLIQSVLTEVEAQTIEELKQQKSFVKLQKKHYKEMKDLVKRHHKKTTDLIKEHTTKYNEIQNDYLRRRAALEKSAKKDSKKKSEPSSPDHGSSTIEQDLAALDAEMTQKLIDLKDKQQQQLLNLRQEQYYSEKYQKREHIKLLIQKLTDVAEECQNNQLKKLKEICEKEKKELKKKMDKKRQEKITEAKSKDKSQMEEEKTEMIRSYIQEVVQYIKRLEEAQSKRQEKLVEKHKEIRQQILDEKPKLQVELEQEYQDKFKRLPLEILEFVQEAMKGKISEDSNHGSAPLSLSSDPGKVNHKTPSSEELGGDIPGKEFDTPL,1216,NP_056007.1.csv,refseq-PLCB1-NM_015192.3_clinical_seed_0_final,refseq-PLCB1-NM_015192.3.a2m,Invitae,refseq-PLCB1-NM_015192.3.npy,1,1216,1216
+NP_056028.2,MSGGGGGGGSAPSRFADYFVICGLDTETGLEPDELSALCQYIQASKARDGASPFISSTTEGENFEQTPLRRTFKSKVLARYPENVEWNPFDQDAVGMLCMPKGLAFKTQADPREPQFHAFIITREDGSRTFGFALTFYEEVTSKQICSAMQTLYHMHNAEYDVLHAPPADDRDQSSMEDGEDTPVTKLQRFNSYDISRDTLYVSKCICLITPMSFMKACRSVLEQLHQAVTSPQPPPLPLESYIYNVLYEVPLPPPGRSLKFSGVYGPIICQRPSTNELPLFDFPVKEVFELLGVENVFQLFTCALLEFQILLYSQHYQRLMTVAETITALMFPFQWQHVYVPILPASLLHFLDAPVPYLMGLHSNGLDDRSKLELPQEANLCFVDIDNHFIELPEDLPQFPNKLEFVQEVSEILMAFGIPPEGNLHCSESASKLKRLRASELVSDKRNGNIAGSPLHSYELLKENETIARLQALVKRTGVSLEKLEVREDPSSNKDLKVQCDEEELRIYQLNIQIREVFANRFTQMFADYEVFVIQPSQDKESWFTNREQMQNFDKASFLSDQPEPYLPFLSRFLETQMFASFIDNKIMCHDDDDKDPVLRVFDSRVDKIRLLNVRTPTLRTSMYQKCTTVDEAEKAIELRLAKIDHTAIHPHLLDMKIGQGKYEPGFFPKLQSDVLSTGPASNKWTKRNAPAQWRRKDRQKQHTEHLRLDNDQREKYIQEARTMGSTIRQPKLSNLSPSVIAQTNWKFVEGLLKECRNKTKRMLVEKMGREAVELGHGEVNITGVEENTLIASLCDLLERIWSHGLQVKQGKSALWSHLLHYQDNRQRKLTSGSLSTSGILLDSERRKSDASSLMPPLRISLIQDMRHIQNIGEIKTDVGKARAWVRLSMEKKLLSRHLKQLLSDHELTKKLYKRYAFLRCDDEKEQFLYHLLSFNAVDYFCFTNVFTTILIPYHILIVPSKKLGGSMFTANPWICISGELGETQIMQIPRNVLEMTFECQNLGKLTTVQIGHDNSGLYAKWLVEYVMVRNEITGHTYKFPCGRWLGKGMDDGSLERILVGELLTSQPEVDERPCRTPPLQQSPSVIRRLVTISPNNKPKLNTGQIQESIGEAVNGIVKHFHKPEKERGSLTLLLCGECGLVSALEQAFQHGFKSPRLFKNVFIWDFLEKAQTYYETLEKNEVVPEENWHTRARNFCRFVTAINNTPRNIGKDGKFQMLVCLGARDHLLHHWIALLADCPITAHMYEDVALIKDHTLVNSLIRVLQTLQEFNITLETSLVKGIDI,1287,NP_056028.2.csv,refseq-DENND5A-NM_015213.3_clinical_seed_0_final,refseq-DENND5A-NM_015213.3.a2m,Invitae,refseq-DENND5A-NM_015213.3.npy,1,1287,1287
+NP_056029.2,MSSVQSQQEQLSQSDPSPSPNSCSSFELIDMDAGSLYEPVSPHWFYCKIIDSKETWIPFNSEDSQQLEEAYSSGKGCNGRVVPTDGGRYDVHLGERMRYAVYWDELASEVRRCTWFYKGDKDNKYVPYSESFSQVLEETYMLAVTLDEWKKKLESPNREIIILHNPKLMVHYQPVAGSDDWGSTPTEQGRPRTVKRGVENISVDIHCGEPLQIDHLVFVVHGIGPACDLRFRSIVQCVNDFRSVSLNLLQTHFKKAQENQQIGRVEFLPVNWHSPLHSTGVDVDLQRITLPSINRLRHFTNDTILDVFFYNSPTYCQTIVDTVASEMNRIYTLFLQRNPDFKGGVSIAGHSLGSLILFDILTNQKDSLGDIDSEKDSLNIVMDQGDTPTLEEDLKKLQLSEFFDIFEKEKVDKEALALCTDRDLQEIGIPLGPRKKILNYFSTRKNSMGIKRPAPQPASGANIPKESEFCSSSNTRNGDYLDVGIGQVSVKYPRLIYKPEIFFAFGSPIGMFLTVRGLKRIDPNYRFPTCKGFFNIYHPFDPVAYRIEPMVVPGVEFEPMLIPHHKGRKRMHLELREGLTRMSMDLKNNLLGSLRMAWKSFTRAPYPALQASETPEETEAEPESTSEKPSDVNTEETSVAVKEEVLPINVGMLNGGQRIDYVLQEKPIESFNEYLFALQSHLCYWESEDTVLLVLKEIYQTQGIFLDQPLQ,711,NP_056029.2.csv,refseq-DDHD2-NM_015214.2_clinical_seed_0_final,refseq-DDHD2-NM_015214.2.a2m,Invitae,refseq-DDHD2-NM_015214.2.npy,1,711,711
+NP_056062.1,MSSGLWSQEKVTSPYWEERIFYLLLQECSVTDKQTQKLLKVPKGSIGQYIQDRSVGHSRIPSAKGKKNQIGLKILEQPHAVLFVDEKDVVEINEKFTELLLAITNCEERFSLFKNRNRLSKGLQIDVGCPVKVQLRSGEEKFPGVVRFRGPLLAERTVSGIFFGVELLEEGRGQGFTDGVYQGKQLFQCDEDCGVFVALDKLELIEDDDTALESDYAGPGDTMQVELPPLEINSRVSLKVGETIESGTVIFCDVLPGKESLGYFVGVDMDNPIGNWDGRFDGVQLCSFACVESTILLHINDIIPALSESVTQERRPPKLAFMSRGVGDKGSSSHNKPKATGSTSDPGNRNRSELFYTLNGSSVDSQPQSKSKNTWYIDEVAEDPAKSLTEISTDFDRSSPPLQPPPVNSLTTENRFHSLPFSLTKMPNTNGSIGHSPLSLSAQSVMEELNTAPVQESPPLAMPPGNSHGLEVGSLAEVKENPPFYGVIRWIGQPPGLNEVLAGLELEDECAGCTDGTFRGTRYFTCALKKALFVKLKSCRPDSRFASLQPVSNQIERCNSLAFGGYLSEVVEENTPPKMEKEGLEIMIGKKKGIQGHYNSCYLDSTLFCLFAFSSVLDTVLLRPKEKNDVEYYSETQELLRTEIVNPLRIYGYVCATKIMKLRKILEKVEAASGFTSEEKDPEEFLNILFHHILRVEPLLKIRSAGQKVQDCYFYQIFMEKNEKVGVPTIQQLLEWSFINSNLKFAEAPSCLIIQMPRFGKDFKLFKKIFPSLELNITDLLEDTPRQCRICGGLAMYECRECYDDPDISAGKIKQFCKTCNTQVHLHPKRLNHKYNPVSLPKDLPDWDWRHGCIPCQNMELFAVLCIETSHYVAFVKYGKDDSAWLFFDSMADRDGGQNGFNIPQVTPCPEVGEYLKMSLEDLHSLDSRRIQGCARRLLCDAYMCMYQSPTMSLYK,956,NP_056062.1.csv,refseq-CYLD-NM_015247.2_clinical_seed_0_final,refseq-CYLD-NM_015247.2.a2m,Invitae,refseq-CYLD-NM_015247.2.npy,1,956,956
+NP_056080.1,MERRSESPCLRDSPDRRSGSPDVKGPPPVKVARLEQNGSPMGARGRPNGAVAKAVGGLMIPVFCVVEQLDGSLEYDNREEHAEFVLVRKDVLFSQLVETALLALGYSHSSAAQAQGIIKLGRWNPLPLSYVTDAPDATVADMLQDVYHVVTLKIQLQSCSKLEDLPAEQWNHATVRNALKELLKEMNQSTLAKECPLSQSMISSIVNSTYYANVSATKCQEFGRWYKKYKKIKVERVERENLSDYCVLGQRPMHLPNMNQLASLGKTNEQSPHSQIHHSTPIRNQVPALQPIMSPGLLSPQLSPQLVRQQIAMAHLINQQIAVSRLLAHQHPQAINQQFLNHPPIPRAVKPEPTNSSVEVSPDIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEEDPRTASQSLLVNLRAMQNFLNLPEVERDRIYQDERERSMNPNVSMVSSASSSPSSSRTPQAKTSTPTTDLPIKVDGANINITAAIYDEIQQEMKRAKVSQALFAKVAANKSQGWLCELLRWKENPSPENRTLWENLCTIRRFLNLPQHERDVIYEEESRHHHSERMQHVVQLPPEPVQVLHRQQSQPAKESSPPREEAPPPPPPTEDSCAKKPRSRTKISLEALGILQSFIHDVGLYPDQEAIHTLSAQLDLPKHTIIKFFQNQRYHVKHHGKLKEHLGSAVDVAEYKDEELLTESEENDSEEGSEEMYKVEAEEENADKSKAAPAEIDQR,733,NP_056080.1.csv,refseq-SATB2-NM_015265.3_clinical_seed_0_final,refseq-SATB2-NM_015265.3.a2m,Invitae,refseq-SATB2-NM_015265.3.npy,1,733,733
+NP_056082.2,MAANVGSMFQYWKRFDLRRLQKELNSVASELSARQEESEHSHKHLIELRREFKKNVPEEIREMVAPVLKSFQAEVVALSKRSQEAEAAFLSVYKQLIEAPDPVPVFEAARSLDDRLQPPSFDPSGQPRRDLHTSWKRNPELLSPKEQREGTSPAGPTLTEGSRLPGIPGKALLTETLLQRNEAEKQKGLQEVQITLAARLGEAEEKIKVLHSALKATQAELLELRRKYDEEAASKADEVGLIMTNLEKANQRAEAAQREVESLREQLASVNSSIRLACCSPQGPSGDKVNFTLCSGPRLEAALASKDREILRLLKDVQHLQSSLQELEEASANQIADLERQLTAKSEAIEKLEEKLQAQSDYEEIKTELSILKAMKLASSTCSLPQGMAKPEDSLLIAKEAFFPTQKFLLEKPSLLASPEEDPSEDDSIKDSLGTEQSYPSPQQLPPPPGPEDPLSPSPGQPLLGPSLGPDGTRTFSLSPFPSLASGERLMMPPAAFKGEAGGLLVFPPAFYGAKPPTAPATPAPGPEPLGGPEPADGGGGGAAGPGAEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPKPWRKLTVKGKEPFIKMKQFLSDEQNVLALRTIQVRQRGSITPRIRTPETGSDDAIKSILEQAKKEIESQKGGEPKTSVAPLSIANGTTPASTSEDAIKSILEQARREMQAQQQALLEMEVAPRGRSVPPSPPERPSLATASQNGAPALVKQEEGSGGPAQAPLPVLSPAAFVQSIIRKVKSEIGDAGYFDHHWASDRGLLSRPYASVSPSLSSSSSSGYSGQPNGRAWPRGDEAPVPPEDEAAAGAEDEPPRTGELKAEGATAEAGARLPYYPAYVPRTLKPTVPPLTPEQYELYMYREVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIRMQLWLSDQLGQAVGQQPGASQASPTEPRSSPSPPPSPTEPEKSSQEPLSLSLESSKENQQPEGRSSSSLSGKMYSGSQAPGGIQEIVAMSPELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQGSVSDLLSRPKPWHKLSLKGREPFVRMQLWLNDPHNVEKLRDMKKLEKKAYLKRRYGLISTGSDSESPATRSECPSPCLQPQDLSLLQIKKPRVVLAPEEKEALRKAYQLEPYPSQQTIELLSFQLNLKTNTVINWFHNYRSRMRREMLVEGTQDEPDLDPSGGPGILPPGHSHPDPTPQSPDSETEDQKPTVKELELQEGPEENSTPLTTQDKAQVRIKQEQMEEDAEEEAGSQPQDSGELDKGQGPPKEEHPDPPGNDGLPKVAPGPLLPGGSTPDCPSLHPQQESEAGERLHPDPLSFKSASESSRCSLEVSLNSPSAASSPGLMMSVSPVPSSSAPISPSPPGAPPAKVPSASPTADMAGALHPSAKVNPNLQRRHEKMANLNNIIYRVERAANREEALEWEF,1486,NP_056082.2.csv,refseq-CUX2-NM_015267.3_clinical_seed_0_final,refseq-CUX2-NM_015267.3.a2m,Invitae,refseq-CUX2-NM_015267.3.npy,1,1486,1486
+NP_056085.1,MSWFSGLLVPKVDERKTAWGERNGQKRSRRRGTRAGGFCTPRYMSCLRDAEPPSPTPAGPPRCPWQDDAFIRRGGPGKGKELGLRAVALGFEDTEVTTTAGGTAEVAPDAVPRSGRSCWRRLVQVFQSKQFRSAKLERLYQRYFFQMNQSSLTLLMAVLVLLTAVLLAFHAAPARPQPAYVALLACAAALFVGLMVVCNRHSFRQDSMWVVSYVVLGILAAVQVGGALAADPRSPSAGLWCPVFFVYIAYTLLPIRMRAAVLSGLGLSTLHLILAWQLNRGDAFLWKQLGANVLLFLCTNVIGICTHYPAEVSQRQAFQETRGYIQARLHLQHENRQQERLLLSVLPQHVAMEMKEDINTKKEDMMFHKIYIQKHDNVSILFADIEGFTSLASQCTAQELVMTLNELFARFDKLAAENHCLRIKILGDCYYCVSGLPEARADHAHCCVEMGVDMIEAISLVREVTGVNVNMRVGIHSGRVHCGVLGLRKWQFDVWSNDVTLANHMEAGGRAGRIHITRATLQYLNGDYEVEPGRGGERNAYLKEQHIETFLILGASQKRKEEKAMLAKLQRTRANSMEGLMPRWVPDRAFSRTKDSKAFRQMGIDDSSKDNRGTQDALNPEDEVDEFLSRAIDARSIDQLRKDHVRRFLLTFQREDLEKKYSRKVDPRFGAYVACALLVFCFICFIQLLIFPHSTLMLGIYASIFLLLLITVLICAVYSCGSLFPKALQRLSRSIVRSRAHSTAVGIFSVLLVFTSAIANMFTCNHTPIRSCAARMLNLTPADITACHLQQLNYSLGLDAPLCEGTMPTCSFPEYFIGNMLLSLLASSVFLHISSIGKLAMIFVLGLIYLVLLLLGPPATIFDNYDLLLGVHGLASSNETFDGLDCPAAGRVALKYMTPVILLVFALALYLHAQQVESTARLDFLWKLQATGEKEEMEELQAYNRRLLHNILPKDVAAHFLARERRNDELYYQSCECVAVMFASIANFSEFYVELEANNEGVECLRLLNEIIADFDEIISEERFRQLEKIKTIGSTYMAASGLNASTYDQVGRSHITALADYAMRLMEQMKHINEHSFNNFQMKIGLNMGPVVAGVIGARKPQYDIWGNTVNVSSRMDSTGVPDRIQVTTDLYQVLAAKGYQLECRGVVKVKGKGEMTTYFLNGGPSS,1168,NP_056085.1.csv,refseq-ADCY6-NM_015270.4_clinical_seed_0_final,refseq-ADCY6-NM_015270.4.a2m,Invitae,refseq-ADCY6-NM_015270.4.npy,1,1168,1168
+NP_056087.2,MSGPTDETAGDLPVKDTGLNLFGMGGLQETSTTRTMKSRQAVSRVSREELEDRFLRLHDENILLKQHARKQEDKIKRMATKLIRLVNDKKRYERVGGGPKRLGRDVEMEEMIEQLQEKVHELEKQNETLKNRLISAKQQLQTQGYRQTPYNNVQSRINTGRRKANENAGLQECPRKGIKFQDADVAETPHPMFTKYGNSLLEEARGEIRNLENVIQSQRGQIEELEHLAEILKTQLRRKENEIELSLLQLREQQATDQRSNIRDNVEMIKLHKQLVEKSNALSAMEGKFIQLQEKQRTLRISHDALMANGDELNMQLKEQRLKCCSLEKQLHSMKFSERRIEELQDRINDLEKERELLKENYDKLYDSAFSAAHEEQWKLKEQQLKVQIAQLETALKSDLTDKTEILDRLKTERDQNEKLVQENRELQLQYLEQKQQLDELKKRIKLYNQENDINADELSEALLLIKAQKEQKNGDLSFLVKVDSEINKDLERSMRELQATHAETVQELEKTRNMLIMQHKINKDYQMEVEAVTRKMENLQQDYELKVEQYVHLLDIRAARIHKLEAQLKDIAYGTKQYKFKPEIMPDDSVDEFDETIHLERGENLFEIHINKVTFSSEVLQASGDKEPVTFCTYAFYDFELQTTPVVRGLHPEYNFTSQYLVHVNDLFLQYIQKNTITLEVHQAYSTEYETIAACQLKFHEILEKSGRIFCTASLIGTKGDIPNFGTVEYWFRLRVPMDQAIRLYRERAKALGYITSNFKGPEHMQSLSQQAPKTAQLSSTDSTDGNLNELHITIRCCNHLQSRASHLQPHPYVVYKFFDFADHDTAIIPSSNDPQFDDHMYFPVPMNMDLDRYLKSESLSFYVFDDSDTQENIYIGKVNVPLISLAHDRCISGIFELTDHQKHPAGTIHVILKWKFAYLPPSGSITTEDLGNFIRSEEPEVVQRLPPASSVSTLVLAPRPKPRQRLTPVDKKVSFVDIMPHQSDETSPPPEDRKEISPEVEHIPEIEINMLTVPHVPKVSQEGSVDEVKENTEKMQQGKDDVSLLSEGQLAEQSLASSEDETEITEDLEPEVEEDMSASDSDDCIIPGPISKNIKQSLALSPGLGCSSAISAHCNFRLPGSSDFPASASQVDGITGACHHTQPSEKIRIEIIALSLNDSQVTMDDTIQRLFVECRFYSLPAEETPVSLPKPKSGQWVYYNYSNVIYVDKENNKAKRDILKAILQKQEMPNRSLRFTVVSDPPEDEQDLECEDIGVAHVDLADMFQEGRDLIEQNIDVFDARADGEGIGKLRVTVEALHALQSVYKQYRDDLEA,1315,NP_056087.2.csv,refseq-RPGRIP1L-NM_015272.2_clinical_seed_0_final,refseq-RPGRIP1L-NM_015272.2.a2m,Invitae,refseq-RPGRIP1L-NM_015272.2.npy,1,1315,1315
+NP_056089.1,MGQLCWLPLLAPLLLLRPPGVQSAGPIRAFVVPHSHMDVGWVYTVQESMRAYAANVYTSVVEELARGQQRRFIAVEQEFFRLWWDGVASDQQKYQVRQLLEEGRLEFVIGGQVMHDEAVTHLDDQILQLTEGHGFLYETFGIRPQFSWHVDPFGASATTPTLFALAGFNAHLGSRIDYDLKAAMQEARGLQFVWRGSPSLSERQEIFTHIMDQYSYCTPSHIPFSNRSGFYWNGVAVFPKPPQDGVYPNMSEPVTPANINLYAEALVANVKQRAAWFRTPHVLWPWGCDKQFFNASVQFANMDPLLDHINSHAAELGVSVQYATLGDYFRALHALNVTWRVRDHHDFLPYSTEPFQAWTGFYTSRSSLKGLARRASALLYAGESMFTRYLWPAPRGHLDPTWALQQLQQLRWAVSEVQHHDAITGTESPKVRDMYATHLASGMLGMRKLMASIVLDELQPQAPMAASSDAGPAGHFASVYNPLAWTVTTIVTLTVGFPGVRVTDEAGHPVPSQIQNSTETPSAYDLLILTTIPGLSYRHYNIRPTAGAQEGTQEPAATVASTLQFGRRLRRRTSHAGRYLVPVANDCYIVLLDQDTNLMHSIWERQSNRTVRVTQEFLEYHVNGDVKQGPISDNYLFTPGKAAVPAWEAVEMEIVAGQLVTEIRQYFYRNMTAQNYTYAIRSRLTHVPQGHDGELLCHRIEQEYQAGPLELNREAVLRTSTNLNSQQVIYSDNNGYQMQRRPYVSYVNNSIARNYYPMVQSAFMEDGKSRLVLLSERAHGISSQGNGQVEVMLHRRLWNNFDWDLGYNLTLNDTSVVHPVLWLLLGSWSLTTALRQRSALALQHRPVVLFGDLAGTAPKLPGPQQQEAVTLPPNLHLQILSIPGWRYSSNHTEHSQNLRKGHRGEAQADLRRVLLRLYHLYEVGEDPVLSQPVTVNLEAVLQALGSVVAVEERSLTGTWDLSMLHRWSWRTGPGRHRGDTTSPSRPPGGPIITVHPKEIRTFFIHFQQQ,1009,NP_056089.1.csv,refseq-MAN2B2-NM_015274.3_clinical_seed_0_final,refseq-MAN2B2-NM_015274.3.a2m,Invitae,refseq-MAN2B2-NM_015274.3_theta_0.2.npy,1,1009,1009
+NP_056090.1,MAVETLSPDWEFDRVDDGSQKIHAEVQLKNYGKFLEEYTSQLRRIEDALDDSIGDVWDFNLDPIALKLLPYEQSSLLELIKTENKVLNKVITVYAALCCEIKKLKYEAETKFYNGLLFYGEGATDASMVEGDCQIQMGRFISFLQELSCFVTRCYEVVMNVVHQLAALYISNKIAPKIIETTGVHFQTMYEHLGELLTVLLTLDEIIDNHITLKDHWTMYKRLLKSVHHNPSKFGIQEEKLKPFEKFLLKLEGQLLDGMIFQACIEQQFDSLNGGVSVSKNSTFAEEFAHSIRSIFANVEAKLGEPSEIDQRDKYVGICGLFVLHFQIFRTIDKKFYKSLLDICKKVPAITLTANIIWFPDNFLIQKIPAAAKLLDRKSLQAIKIHRDTFLQQKAQSLTKDVQSYYVFVSSWMMKMESILSKEQRMDKFAEDLTNRCNVFIQGFLYAYSISTIIKTTMNLYMSMQKPMTKTSVKALCRLVELLKAIEHMFYRRSMVVADSVSHITQHLQHQALHSISVAKKRVISDKKYSEQRLDVLSALVLAENTLNGPSTKQRRLIVSLALSVGTQMKTFKDEELFPLQVVMKKLDLISELRERVQTQCDCCFLYWHRAVFPIYLDDVYENAVDAARLHYMFSALRDCVPAMMHARHLESYEILLDCYDKEIMEILNEHLLDKLCKEIEKDLRLSVHTHLKLDDRNPFKVGMKDLALFFSLNPIRFFNRFIDIRAYVTHYLDKTFYNLTTVALHDWATYSEMRNLATQRYGLVMTEAHLPSQTLEQGLDVLEIMRNIHIFVSRYLYNLNNQIFIERTSNNKHLNTINIRHIANSIRTHGTGIMNTTVNFTYQFLKKKFYIFSQFMYDEHIKSRLIKDIRFFREIKDQNDHKYPFDRAEKFNRGIRKLGVTPEGQSYLDQFRQLISQIGNAMGYVRMIRSGGLHCSSNAIRFVPDLEDIVNFEELVKEEGLAEETLKAARHLDSVLSDHTRNSAEGTEYFKMLVDVFAPEFRRPKNIHLRNFYIIVPPLTLNFVEHSISCKEKLNKKNKIGAAFTDDGFAMGVAYILKLLDQYREFDSLHWFQSVREKYLKEIRAVAKQQNVQSASQDEKLLQTMNLTQKRLDVYLQEFELLYFSLSSARIFFRADKTAAEENQEKKEKEEETKTSNGDLSDSTVSADPVVK,1173,NP_056090.1.csv,refseq-WASHC4-NM_015275.2_clinical_seed_0_final,refseq-WASHC4-NM_015275.2.a2m,Invitae,refseq-WASHC4-NM_015275.2.npy,1,1173,1173
+NP_056092.2,MATGLGEPVYGLSEDEGESRILRVKVVSGIDLAKKDIFGASDPYVKLSLYVADENRELALVQTKTIKKTLNPKWNEEFYFRVNPSNHRLLFEVFDENRLTRDDFLGQVDVPLSHLPTEDPTMERPYTFKDFLLRPRSHKSRVKGFLRLKMAYMPKNGGQDEENSDQRDDMEHGWEVVDSNDSASQHQEELPPPPLPPGWEEKVDNLGRTYYVNHNNRTTQWHRPSLMDVSSESDNNIRQINQEAAHRRFRSRRHISEDLEPEPSEGGDVPEPWETISEEVNIAGDSLGLALPPPPASPGSRTSPQELSEELSRRLQITPDSNGEQFSSLIQREPSSRLRSCSVTDAVAEQGHLPPPSVAYVHTTPGLPSGWEERKDAKGRTYYVNHNNRTTTWTRPIMQLAEDGASGSATNSNNHLIEPQIRRPRSLSSPTVTLSAPLEGAKDSPVRRAVKDTLSNPQSPQPSPYNSPKPQHKVTQSFLPPGWEMRIAPNGRPFFIDHNTKTTTWEDPRLKFPVHMRSKTSLNPNDLGPLPPGWEERIHLDGRTFYIDHNSKITQWEDPRLQNPAITGPAVPYSREFKQKYDYFRKKLKKPADIPNRFEMKLHRNNIFEESYRRIMSVKRPDVLKARLWIEFESEKGLDYGGVAREWFFLLSKEMFNPYYGLFEYSATDNYTLQINPNSGLCNEDHLSYFTFIGRVAGLAVFHGKLLDGFFIRPFYKMMLGKQITLNDMESVDSEYYNSLKWILENDPTELDLMFCIDEENFGQTYQVDLKPNGSEIMVTNENKREYIDLVIQWRFVNRVQKQMNAFLEGFTELLPIDLIKIFDENELELLMCGLGDVDVNDWRQHSIYKNGYCPNHPVIQWFWKAVLLMDAEKRIRLLQFVTGTSRVPMNGFAELYGSNGPQLFTIEQWGSPEKLPRAHTCFNRLDLPPYETFEDLREKLLMAVENAQGFEGVD,955,NP_056092.2.csv,refseq-NEDD4L-NM_015277.5_clinical_seed_0_final,refseq-NEDD4L-NM_015277.5.a2m,Invitae,refseq-NEDD4L-NM_015277.5.npy,1,955,955
+NP_056093.3,MEDAGAAGPGPEPEPEPEPEPEPAPEPEPEPKPGAGTSEAFSRLWTDVMGILDGSLGNIDDLAQQYADYYNTCFSDVCERMEELRKRRVSQDLEVEKPDASPTSLQLRSQIEESLGFCSAVSTPEVERKNPLHKSNSEDSSVGKGDWKKKNKYFWQNFRKNQKGIMRQTSKGEDVGYVASEITMSDEERIQLMMMVKEKMITIEEALARLKEYEAQHRQSAALDPADWPDGSYPTFDGSSNCNSREQSDDETEESVKFKRLHKLVNSTRRVRKKLIRVEEMKKPSTEGGEEHVFENSPVLDERSALYSGVHKKPLFFDGSPEKPPEDDSDSLTTSPSSSSLDTWGAGRKLVKTFSKGESRGLIKPPKKMGTFFSYPEEEKAQKVSRSLTEGEMKKGLGSLSHGRTCSFGGFDLTNRSLHVGSNNSDPMGKEGDFVYKEVIKSPTASRISLGKKVKSVKETMRKRMSKKYSSSVSEQDSGLDGMPGSPPPSQPDPEHLDKPKLKAGGSVESLRSSLSGQSSMSGQTVSTTDSSTSNRESVKSEDGDDEEPPYRGPFCGRARVHTDFTPSPYDTDSLKLKKGDIIDIISKPPMGTWMGLLNNKVGTFKFIYVDVLSEDEEKPKRPTRRRRKGRPPQPKSVEDLLDRINLKEHMPTFLFNGYEDLDTFKLLEEEDLDELNIRDPEHRAVLLTAVELLQEYDSNSDQSGSQEKLLVDSQGLSGCSPRDSGCYESSENLENGKTRKASLLSAKSSTEPSLKSFSRNQLGNYPTLPLMKSGDALKQGQEEGRLGGGLAPDTSKSCDPPGVTGLNKNRRSLPVSICRSCETLEGPQTVDTWPRSHSLDDLQVEPGAEQDVPTEVTEPPPQIVPEVPQKTTASSTKAQPLEQDSAVDNALLLTQSKRFSEPQKLTTKKLEGSIAASGRGLSPPQCLPRNYDAQPPGAKHGLARTPLEGHRKGHEFEGTHHPLGTKEGVDAEQRMQPKIPSQPPPVPAKKSRERLANGLHPVPMGPSGALPSPDAPCLPVKRGSPASPTSPSDCPPALAPRPLSGQAPGSPPSTRPPPWLSELPENTSLQEHGVKLGPALTRKVSCARGVDLETLTENKLHAEGIDLTEEPYSDKHGRCGIPEALVQRYAEDLDQPERDVAANMDQIRVKQLRKQHRMAIPSGGLTEICRKPVSPGCISSVSDWLISIGLPMYAGTLSTAGFSTLSQVPSLSHTCLQEAGITEERHIRKLLSAARLFKLPPGPEAM,1247,NP_056093.3.csv,refseq-SASH1-NM_015278.3_clinical_seed_0_final,refseq-SASH1-NM_015278.3.a2m,Invitae,refseq-SASH1-NM_015278.3.npy,1,1247,1247
+NP_056109.1,MDEQSVESIAEVFRCFICMEKLRDARLCPHCSKLCCFSCIRRWLTEQRAQCPHCRAPLQLRELVNCRWAEEVTQQLDTLQLCSLTKHEENEKDKCENHHEKLSVFCWTCKKCICHQCALWGGMHGGHTFKPLAEIYEQHVTKVNEEVAKLRRRLMELISLVQEVERNVEAVRNAKDERVREIRNAVEMMIARLDTQLKNKLITLMGQKTSLTQETELLESLLQEVEHQLRSCSKSELISKSSEILMMFQQVHRKPMASFVTTPVPPDFTSELVPSYDSATFVLENFSTLRQRADPVYSPPLQVSGLCWRLKVYPDGNGVVRGYYLSVFLELSAGLPETSKYEYRVEMVHQSCNDPTKNIIREFASDFEVGECWGYNRFFRLDLLANEGYLNPQNDTVILRFQVRSPTFFQKSRDQHWYITQLEAAQTSYIQQINNLKERLTIELSRTQKSRDLSPPDNHLSPQNDDALETRAKKSACSDMLLEGGPTTASVREAKEDEEDEEKIQNEDYHHELSDGDLDLDLVYEDEVNQLDGSSSSASSTATSNTEENDIDEETMSGENDVEYNNMELEEGELMEDAAAAGPAGSSHGYVGSSSRISRRTHLCSAATSSLLDIDPLILIHLLDLKDRSSIENLWGLQPRPPASLLQPTASYSRKDKDQRKQQAMWRVPSDLKMLKRLKTQMAEVRCMKTDVKNTLSEIKSSSAASGDMQTSLFSADQAALAACGTENSGRLQDLGMELLAKSSVANCYIRNSTNKKSNSPKPARSSVAGSLSLRRAVDPGENSRSKGDCQTLSEGSPGSSQSGSRHSSPRALIHGSIGDILPKTEDRQCKALDSDAVVVAVFSGLPAVEKRRKMVTLGANAKGGHLEGLQMTDLENNSETGELQPVLPEGASAAPEEGMSSDSDIECDTENEEQEEHTSVGGFHDSFMVMTQPPDEDTHSSFPDGEQIGPEDLSFNTDENSGR,964,NP_056109.1.csv,refseq-TRIM37-NM_015294.4_clinical_seed_0_final,refseq-TRIM37-NM_015294.4.a2m,Invitae,refseq-TRIM37-NM_015294.4.npy,1,964,964
+NP_056110.2,MAAADGGGPGGASVGTEEDGGGVGHRTVYLFDRREKESELGDRPLQVGERSDYAGFRACVCQTLGISPEEKFVITTTSRKEITCDNFDETVKDGVTLYLLQSVNQLLLTATKERIDFLPHYDTLVKSGMYEYYASEGQNPLPFALAELIDNSLSATSRNIGVRRIQIKLLFDETQGKPAVAVIDNGRGMTSKQLNNWAVYRLSKFTRQGDFESDHSGYVRPVPVPRSLNSDISYFGVGGKQAVFFVGQSARMISKPADSQDVHELVLSKEDFEKKEKNKEAIYSGYIRNRKPSDSVHITNDDERFLHHLIIEEKEKDSFTAVVITGVQPEHIQYLKNYFHLWTRQLAHIYHYYIHGPKGNEIRTSKEVEPFNNIDIEISMFEKGKVPKIVNLREIQDDMQTLYVNTAADSFEFKAHVEGDGVVEGIIRYHPFLYDRETYPDDPCFPSKLKDEDDEDDCFILEKAARGKRPIFECFWNGRLIPYTSVEDFDWCTPPKKRGLAPIECYNRISGALFTNDKFQVSTNKLTFMDLELKLKDKNTLFTRILNGQEQRMKIDREFALWLKDCHEKYDKQIKFTLFKGVITRPDLPSKKQGPWATYAAIEWDGKIYKAGQLVKTIKTLPLFYGSIVRFFLYGDHDGEVYATGGEVQIAMEPQALYDEVRTVPIAKLDRTVAEKAVKKYVEDEMARLPDRLSVTWPEGDELLPNEVRPAGTPIGALRIEILNKKGEAMQKLPGTSHGGSKKLLVELKVILHSSSGNKEIISHISQHGGKWPYWFKKMENIQKLGNYTLKLQVVLNESNADTYAGRPLPSKAIKFSVKEGKPEKFSFGLLDLPFRVGVPFNIPLEFQDEFGHTSQLVTDIQPVLEASGLSLHYEEITKGPNCVIRGVTAKGPVNSCQGKNYNLKVTLPGLKEDSQILKIRLLPGHPRRLKVKPDSEILVIENGTAFPFQVEVLDESDNITAQPKLIVHCKFSGAPNLPVYVVDCSSSGTSILTGSAIQVQNIKKDQTLKARIEIPSCKDVAPVEKTIKLLPSSHVARLQIFSVEGQKAIQIKHQDEVNWIAGDIMHNLIFQMYDEGEREINITSALAEKIKVNWTPEINKEHLLQGLLPDVQVPTSVKDMRYCQVSFQDDHVSLESAFTVRPLPDEPKHLKCEMKGGKTVQMGQELQGEVVIIITDQYGNQIQAFSPSSLSSLSIAGVGLDSSNLKTTFQENTQSISVRGIKFIPGPPGNKDLCFTWREFSDFIRVQLISGPPAKLLLIDWPELKESIPVINGRDLQNPIIVQLCDQWDNPAPVQHVKISLTKASNLKLMPSNQQHKTDEKGRANLGVFSVFAPRGEHTLQVKAIYNKSIIEGPIIKLMILPDPEKPVRLNVKYDKDASFLAGGLFTDFMISVISEDDSIIKNINPARISMKMWKLSTSGNRPPANAETFSCNKIKDNDKEDGCFYFRDKVIPNKVGTYCIQFGFMMDKTNILNSEQVIVEVLPNQPVKLVPKIKPPTPAVSNVRSVASRTLVRDLHLSITDDYDNHTGIDLVGTIIATIKGSNEEDTDTPLFIGKVRTLEFPFVNGSAEIMSLVLAESSPGRDSTEYFIVFEPRLPLLSRTLEPYILPFMFYNDVKKQQQMAALTKEKDQLSQSIVMYKSLFEASQQLLNEMKCQVEEARLKEAQLRNELKIHNIDIPTTQQVPHIEALLKRKLSEQEELKKKPRRSCTLPNYTKGSGDVLGKIAHLAQIEDDRAAMVISWHLASDMDCVVTLTTDAARRIYDETQGRQQVLPLDSIYKKTLPDWKRSLPHFRNGKLYFKPIGDPVFARDLLTFPDNVEHCETVFGMLLGDTIILDNLDAANHYRKEVVKITHCPTLLTRDGDRIRSNGKFGGLQNKAPPMDKLRGMVFGAPVPKQCLILGEQIDLLQQYRSAVCKLDSVNKDLNSQLEYLRTPDMRKKKQELDEHEKNLKLIEEKLGMTPIRKCNDSLRHSPKVETTDCPVPPKRMRREATRQNRIITKTDV,2005,NP_056110.2.csv,refseq-SMCHD1-NM_015295.2_clinical_seed_0_final,refseq-SMCHD1-NM_015295.2.a2m,Invitae,refseq-SMCHD1-NM_015295.2.npy,1,2005,2005
+NP_056146.1,MATAGGGSGADPGSRGLLRLLSFCVLLAGLCRGNSVERKIYIPLNKTAPCVRLLNATHQIGCQSSISGDTGVIHVVEKEEDLQWVLTDGPNPPYMVLLESKHFTRDLMEKLKGRTSRIAGLAVSLTKPSPASGFSPSVQCPNDGFGVYSNSYGPEFAHCREIQWNSLGNGLAYEDFSFPIFLLEDENETKVIKQCYQDHNLSQNGSAPTFPLCAMQLFSHMHAVISTATCMRRSSIQSTFSINPEIVCDPLSDYNVWSMLKPINTTGTLKPDDRVVVAATRLDSRSFFWNVAPGAESAVASFVTQLAAAEALQKAPDVTTLPRNVMFVFFQGETFDYIGSSRMVYDMEKGKFPVQLENVDSFVELGQVALRTSLELWMHTDPVSQKNESVRNQVEDLLATLEKSGAGVPAVILRRPNQSQPLPPSSLQRFLRARNISGVVLADHSGAFHNKYYQSIYDTAENINVSYPEWLSPEEDLNFVTDTAKALADVATVLGRALYELAGGTNFSDTVQADPQTVTRLLYGFLIKANNSWFQSILRQDLRSYLGDGPLQHYIAVSSPTNTTYVVQYALANLTGTVVNLTREQCQDPSKVPSENKDLYEYSWVQGPLHSNETDRLPRCVRSTARLARALSPAFELSQWSSTEYSTWTESRWKDIRARIFLIASKELELITLTVGFGILIFSLIVTYCINAKADVLFIAPREPGAVSY,709,NP_056146.1.csv,refseq-NCSTN-NM_015331.2_clinical_seed_0_final,refseq-NCSTN-NM_015331.2.a2m,Invitae,refseq-NCSTN-NM_015331.2.npy,1,709,709
+NP_056150.1,MTAAANWVANGASLEDCHSNLFSLAELTGIKWRRYNFGGHGDCGPIISAPAQDDPILLSFIRCLQANLLCVWRRDVKPDCKELWIFWWGDEPNLVGVIHHELQVVEEGLWENGLSYECRTLLFKAIHNLLERCLMDKNFVRIGKWFVRPYEKDEKPVNKSEHLSCAFTFFLHGESNVCTSVEIAQHQPIYLINEEHIHMAQSSPAPFQVLVSPYGLNGTLTGQAYKMSDPATRKLIEEWQYFYPMVLKKKEESKEEDELGYDDDFPVAVEVIVGGVRMVYPSAFVLISQNDIPVPQSVASAGGHIAVGQQGLGSVKDPSNCGMPLTPPTSPEQAILGESGGMQSAASHLVSQDGGMITMHSPKRSGKIPPKLHNHMVHRVWKECILNRTQSKRSQMSTPTLEEEPASNPATWDFVDPTQRVSCSCSRHKLLKRCAVGPNRPPTVSQPGFSAGPSSSSSLPPPASSKHKTAERQEKGDKLQKRPLIPFHHRPSVAEELCMEQDTPGQKLGLAGIDSSLEVSSSRKYDKQMAVPSRNTSKQMNLNPMDSPHSPISPLPPTLSPQPRGQETESLDPPSVPVNPALYGNGLELQQLSTLDDRTVLVGQRLPLMAEVSETALYCGIRPSNPESSEKWWHSYRLPPSDDAEFRPPELQGERCDAKMEVNSESTALQRLLAQPNKRFKIWQDKQPQLQPLHFLDPLPLSQQPGDSLGEVNDPYTFEDGDIKYIFTANKKCKQGTEKDSLKKNKSEDGFGTKDVTTPGHSTPVPDGKNAMSIFSSATKTDVRQDNAAGRAGSSSLTQVTDLAPSLHDLDNIFDNSDDDELGAVSPALRSSKMPAVGTEDRPLGKDGRAAVPYPPTVADLQRMFPTPPSLEQHPAFSPVMNYKDGISSETVTALGMMESPMVSMVSTQLTEFKMEVEDGLGSPKPEEIKDFSYVHKVPSFQPFVGSSMFAPLKMLPSHCLLPLKIPDACLFRPSWAIPPKIEQLPMPPAATFIRDGYNNVPSVGSLADPDYLNTPQMNTPVTLNSAAPASNSGAGVLPSPATPRFSVPTPRTPRTPRTPRGGGTASGQGSVKYDSTDQGSPASTPSTTRPLNSVEPATMQPIPEAHSLYVTLILSDSVMNIFKDRNFDSCCICACNMNIKGADVGLYIPDSSNEDQYRCTCGFSAIMNRKLGYNSGLFLEDELDIFGKNSDIGQAAERRLMMCQSTFLPQVEGTKKPQEPPISLLLLLQNQHTQPFASLNFLDYISSNNRQTLPCVSWSYDRVQADNNDYWTECFNALEQGRQYVDNPTGGKVDEALVRSATVHSWPHSNVLDISMLSSQDVVRMLLSLQPFLQDAIQKKRTGRTWENIQHVQGPLTWQQFHKMAGRGTYGSEESPEPLPIPTLLVGYDKDFLTISPFSLPFWERLLLDPYGGHRDVAYIVVCPENEALLEGAKTFFRDLSAVYEMCRLGQHKPICKVLRDGIMRVGKTVAQKLTDELVSEWFNQPWSGEENDNHSRLKLYAQVCRHHLAPYLATLQLDSSLLIPPKYQTPPAAAQGQATPGNAGPLAPNGSAAPPAGSAFNPTSNSSSTNPAASSSASGSSVPPVSSSASAPGISQISTTSSSGFSGSVGGQNPSTGGISADRTQGNIGCGGDTDPGQSSSQPSQDGQESVTERERIGIPTEPDSADSHAHPPAVVIYMVDPFTYAAEEDSTSGNFWLLSLMRCYTEMLDNLPEHMRNSFILQIVPCQYMLQTMKDEQVFYIQYLKSMAFSVYCQCRRPLPTQIHIKSLTGFGPAASIEMTLKNPERPSPIQLYSPPFILAPIKDKQTELGETFGEASQKYNVLFVGYCLSHDQRWLLASCTDLHGELLETCVVNIALPNRSRRSKVSARKIGLQKLWEWCIGIVQMTSLPWRVVIGRLGRLGHGELKDWSILLGECSLQTISKKLKDVCRMCGISAADSPSILSACLVAMEPQGSFVVMPDAVTMGSVFGRSTALNMQSSQLNTPQDASCTHILVFPTSSTIQVAPANYPNEDGFSPNNDDMFVDLPFPDDMDNDIGILMTGNLHSSPNSSPVPSPGSPSGIGVGSHFQHSRSQGERLLSREAPEELKQQPLALGYFVSTAKAENLPQWFWSSCPQAQNQCPLFLKASLHHHISVAQTDELLPARNSQRVPHPLDSKTTSDVLRFVLEQYNALSWLTCNPATQDRTSCLPVHFVVLTQLYNAIMNIL,2210,NP_056150.1.csv,refseq-MED13L-NM_015335.4_clinical_seed_0_final,refseq-MED13L-NM_015335.4.a2m,Invitae,refseq-MED13L-NM_015335.4_theta_0.2.npy,1,2210,2210
+NP_056155.1,MASVWQRLGFYASLLKRQLNGGPDVIKWERRVIPGCTRSIYSATGKWTKEYTLQTRKDVEKWWHQRIKEQASKISEADKSKPKFYVLSMFPYPSGKLHMGHVRVYTISDTIARFQKMRGMQVINPMGWDAFGLPAENAAVERNLHPQSWTQSNIKHMRKQLDRLGLCFSWDREITTCLPDYYKWTQYLFIKLYEAGLAYQKEALVNWDPVDQTVLANEQVDEHGCSWRSGAKVEQKYLRQWFIKTTAYAKAMQDALADLPEWYGIKGMQAHWIGDCVGCHLDFTLKVHGQATGEKLTAYTATPEAIYGTSHVAISPSHRLLHGHSSLKEALRMALVPGKDCLTPVMAVNMLTQQEVPVVILAKADLEGSLDSKIGIPSTSSEDTILAQTLGLAYSEVIETLPDGTERLSSSAEFTGMTRQDAFLALTQKARGKRVGGDVTSDKLKDWLISRQRYWGTPIPIVHCPVCGPTPVPLEDLPVTLPNIASFTGKGGPPLAMASEWVNCSCPRCKGAAKRETDTMDTFVDSAWYYFRYTDPHNPHSPFNTAVADYWMPVDLYIGGKEHAVMHLFYARFFSHFCHDQKMVKHREPFHKLLAQGLIKGQTFRLPSGQYLQREEVDLTGSVPVHAKTKEKLEVTWEKMSKSKHNGVDPEEVVEQYGIDTIRLYILFAAPPEKDILWDVKTDALPGVLRWQQRLWTLTTRFIEARASGKSPQPQLLSNKEKAEARKLWEYKNSVISQVTTHFTEDFSLNSAISQLMGLSNALSQASQSVILHSPEFEDALCALMVMAAPLAPHVTSEIWAGLALVPRKLCAHYTWDASVLLQAWPAVDPEFLQQPEVVQMAVLINNKACGKIPVPQQVARDQDKVHEFVLQSELGVRLLQGRSIKKSFLSPRTALINFLVQD,903,NP_056155.1.csv,refseq-LARS2-NM_015340.3_clinical_seed_0_final,refseq-LARS2-NM_015340.3.a2m,Invitae,refseq-LARS2-NM_015340.3.npy,1,903,903
+NP_056170.2,MAPQKHGGGGGGGSGPSAGSGGGGFGGSAAVAAATASGGKSGGGSCGGGGSYSASSSSSAAAAAGAAVLPVKKPKMEHVQADHELFLQAFEKPTQIYRFLRTRNLIAPIFLHRTLTYMSHRNSRTNIKRKTFKVDDMLSKVEKMKGEQESHSLSAHLQLTFTGFFHKNDKPSPNSENEQNSVTLEVLLVKVCHKKRKDVSCPIRQVPTGKKQVPLNPDLNQTKPGNFPSLAVSSNEFEPSNSHMVKSYSLLFRVTRPGRREFNGMINGETNENIDVNEELPARRKRNREDGEKTFVAQMTVFDKNRRLQLLDGEYEVAMQEMEECPISKKRATWETILDGKRLPPFETFSQGPTLQFTLRWTGETNDKSTAPIAKPLATRNSESLHQENKPGSVKPTQTIAVKESLTTDLQTRKEKDTPNENRQKLRIFYQFLYNNNTRQQTEARDDLHCPWCTLNCRKLYSLLKHLKLCHSRFIFNYVYHPKGARIDVSINECYDGSYAGNPQDIHRQPGFAFSRNGPVKRTPITHILVCRPKRTKASMSEFLESEDGEVEQQRTYSSGHNRLYFHSDTCLPLRPQEMEVDSEDEKDPEWLREKTITQIEEFSDVNEGEKEVMKLWNLHVMKHGFIADNQMNHACMLFVENYGQKIIKKNLCRNFMLHLVSMHDFNLISIMSIDKAVTKLREMQQKLEKGESASPANEEITEEQNGTANGFSEINSKEKALETDSVSGVSKQSKKQKL,739,NP_056170.2.csv,refseq-SUZ12-NM_015355.3_clinical_seed_0_final,refseq-SUZ12-NM_015355.3.a2m,Invitae,refseq-SUZ12-NM_015355.3.npy,1,739,739
+NP_056201.2,MGTKMADLDSPPKLSGVQQPSEGVGGGRCSEISAELIRSLTELQELEAVYERLCGEEKVVERELDALLEQQNTIESKMVTLHRMGPNLQLIEGDAKQLAGMITFTCNLAENVSSKVRQLDLAKNRLYQAIQRADDILDLKFCMDGVQTALRSEDYEQAAAHTHRYLCLDKSVIELSRQGKEGSMIDANLKLLQEAEQRLKAIVAEKFAIATKEGDLPQVERFFKIFPLLGLHEEGLRKFSEYLCKQVASKAEENLLMVLGTDMSDRRAAVIFADTLTLLFEGIARIVETHQPIVETYYGPGRLYTLIKYLQVECDRQVEKVVDKFIKQRDYHQQFRHVQNNLMRNSTTEKIEPRELDPILTEVTLMNARSELYLRFLKKRISSDFEVGDSMASEEVKQEHQKCLDKLLNNCLLSCTMQELIGLYVTMEEYFMRETVNKAVALDTYEKGQLTSSMVDDVFYIVKKCIGRALSSSSIDCLCAMINLATTELESDFRDVLCNKLRMGFPATTFQDIQRGVTSAVNIMHSSLQQGKFDTKGIESTDEAKMSFLVTLNNVEVCSENISTLKKTLESDCTKLFSQGIGGEQAQAKFDSCLSDLAAVSNKFRDLLQEGLTELNSTAIKPQVQPWINSFFSVSHNIEEEEFNDYEANDPWVQQFILNLEQQMAEFKASLSPVIYDSLTGLMTSLVAVELEKVVLKSTFNRLGGLQFDKELRSLIAYLTTVTTWTIRDKFARLSQMATILNLERVTEILDYWGPNSGPLTWRLTPAEVRQVLALRIDFRSEDIKRLRL,789,NP_056201.2.csv,refseq-COG4-NM_015386.2_clinical_seed_0_final,refseq-COG4-NM_015386.2.a2m,Invitae,refseq-COG4-NM_015386.2.npy,1,789,789
+NP_056227.2,MMAAVPPGLEPWNRVRIPKAGNRSAVTVQNPGAALDLCIAAVIKECHLVILSLKSQTLDAETDVLCAVLYSNHNRMGRHKPHLALKQVEQCLKRLKNMNLEGSIQDLFELFSSNENQPLTTKVCVVPSQPVVELVLMKVLGACKLLLRLLDCCCKTFLLTVKHLGLQEFIILNLVMVGLVSRLWVLYKGVLKRLILLYEPLFGLLQEVARIQPMPYFKDFTFPSDITEFLGQPYFEAFKKKMPIAFAAKGINKLLNKLFLINEQSPRASEETLLGISKKAKQMKINVQNNVDLGQPVKNKRVFKEESSEFDVRAFCNQLKHKATQETSFDFKCSQSRLKTTKYSSQKVIGTPHAKSFVQRFREAESFTQLSEEIQMAVVWCRSKKLKAQAIFLGNKLLKSNRLKHLEAQGTSLPKKLECIKTSICNHLLRGSGIKTSKHHLRQRRSQNKFLRRQRKPQRKLQSTLLREIQQFSQGTRKSATDTSAKWRLSHCTVHRTDLYPNSKQLLNSGVSMPVIQTKEKMIHENLRGIHENETDSWTVMQINKNSTSGTIKETDDIDDIFALMGV,567,NP_056227.2.csv,refseq-NEPRO-NM_015412.3_clinical_seed_0_final,refseq-NEPRO-NM_015412.3.a2m,Invitae,refseq-NEPRO-NM_015412.3.npy,1,567,567
+NP_056240.2,MLISKNMPWRRLQGISFGMYSAEELKKLSVKSITNPRYLDSLGNPSANGLYDLALGPADSKEVCSTCVQDFSNCSGHLGHIELPLTVYNPLLFDKLYLLLRGSCLNCHMLTCPRAVIHLLLCQLRVLEVGALQAVYELERILNRFLEENPDPSASEIREELEQYTTEIVQNNLLGSQGAHVKNVCESKSKLIALFWKAHMNAKRCPHCKTGRSVVRKEHNSKLTITFPAMVHRTAGQKDSEPLGIEEAQIGKRGYLTPTSAREHLSALWKNEGFFLNYLFSGMDDDGMESRFNPSVFFLDFLVVPPSRYRPVSRLGDQMFTNGQTVNLQAVMKDVVLIRKLLALMAQEQKLPEEVATPTTDEEKDSLIAIDRSFLSTLPGQSLIDKLYNIWIRLQSHVNIVFDSEMDKLMMDKYPGIRQILEKKEGLFRKHMMGKRVDYAARSVICPDMYINTNEIGIPMVFATKLTYPQPVTPWNVQELRQAVINGPNVHPGASMVINEDGSRTALSAVDMTQREAVAKQLLTPATGAPKPQGTKIVCRHVKNGDILLLNRQPTLHRPSIQAHRARILPEEKVLRLHYANCKAYNADFDGDEMNAHFPQSELGRAEAYVLACTDQQYLVPKDGQPLAGLIQDHMVSGASMTTRGCFFTREHYMELVYRGLTDKVGRVKLLSPSILKPFPLWTGKQVVSTLLINIIPEDHIPLNLSGKAKITGKAWVKETPRSVPGFNPDSMCESQVIIREGELLCGVLDKAHYGSSAYGLVHCCYEIYGGETSGKVLTCLARLFTAYLQLYRGFTLGVEDILVKPKADVKRQRIIEESTHCGPQAVRAALNLPEAASYDEVRGKWQDAHLGKDQRDFNMIDLKFKEEVNHYSNEINKACMPFGLHRQFPENSLQMMVQSGAKGSTVNTMQISCLLGQIELEGRRPPLMASGKSLPCFEPYEFTPRAGGFVTGRFLTGIKPPEFFFHCMAGREGLVDTAVKTSRSGYLQRCIIKHLEGLVVQYDLTVRDSDGSVVQFLYGEDGLDIPKTQFLQPKQFPFLASNYEVIMKSQHLHEVLSRADPKKALHHFRAIKKWQSKHPNTLLRRGAFLSYSQKIQEAVKALKLESENRNGRSPGTQEMLRMWYELDEESRRKYQKKAAACPDPSLSVWRPDIYFASVSETFETKVDDYSQEWAAQTEKSYEKSELSLDRLRTLLQLKWQRSLCEPGEAVGLLAAQSIGEPSTQMTLNTFHFAGRGEMNVTLGIPRLREILMVASANIKTPMMSVPVLNTKKALKRVKSLKKQLTRVCLGEVLQKIDVQESFCMEEKQNKFQVYQLRFQFLPHAYYQQEKCLRPEDILRFMETRFFKLLMESIKKKNNKASAFRNVNTRRATQRDLDNAGELGRSRGEQEGDEEEEGHIVDAEAEEGDADASDAKRKEKQEEEVDYESEEEEEREGEENDDEDMQEERNPHREGARKTQEQDEEVGLGTEEDPSLPALLTQPRKPTHSQEPQGPEAMERRVQAVREIHPFIDDYQYDTEESLWCQVTVKLPLMKINFDMSSLVVSLAHGAVIYATKGITRCLLNETTNNKNEKELVLNTEGINLPELFKYAEVLDLRRLYSNDIHAIANTYGIEAALRVIEKEIKDVFAVYGIAVDPRHLSLVADYMCFEGVYKPLNRFGIRSNSSPLQQMTFETSFQFLKQATMLGSHDELRSPSACLVVGKVVRGGTGLFELKQPLR,1720,NP_056240.2.csv,refseq-POLR1A-NM_015425.5_clinical_seed_0_final,refseq-POLR1A-NM_015425.5.a2m,Invitae,refseq-POLR1A-NM_015425.5_theta_0.2.npy,1,1720,1720
+NP_056241.3,MAAPCAEDPSLERHFKGHRDAVTCVDFSINTKQLASGSMDSCLMVWHMKPQSRAYRFTGHKDAVTCVNFSPSGHLLASGSRDKTVRIWVPNVKGESTVFRAHTATVRSVHFCSDGQSFVTASDDKTVKVWATHRQKFLFSLSQHINWVRCAKFSPDGRLIVSASDDKTVKLWDKSSRECVHSYCEHGGFVTYVDFHPSGTCIAAAGMDNTVKVWDVRTHRLLQHYQLHSAAVNGLSFHPSGNYLITASSDSTLKILDLMEGRLLYTLHGHQGPATTVAFSRTGEYFASGGSDEQVMVWKSNFDIVDHGEVTKVPRPPATLASSMGNLPEVDFPVPPGRGRSVESVQSQPQEPVSVPQTLTSTLEHIVGQLDVLTQTVSILEQRLTLTEDKLKQCLENQQLIMQRATP,407,NP_056241.3.csv,refseq-POC1A-NM_015426.4_clinical_seed_0_final,refseq-POC1A-NM_015426.4.a2m,Invitae,refseq-POC1A-NM_015426.4.npy,1,407,407
+NP_056265.2,MSLVPATNYIYTPLNQLKGGTIVNVYGVVKFFKPPYLSKGTDYCSVVTIVDQTNVKLTCLLFSGNYEALPIIYKNGDIVRFHRLKIQVYKKETQGITSSGFASLTFEGTLGAPIIPRTSSKYFNFTTEDHKMVEALRVWASTHMSPSWTLLKLCDVQPMQYFDLTCQLLGKAEVDGASFLLKVWDGTRTPFPSWRVLIQDLVLEGDLSHIHRLQNLTIDILVYDNHVHVARSLKVGSFLRIYSLHTKLQSMNSENQTMLSLEFHLHGGTSYGRGIRVLPESNSDVDQLKKDLESANLTANQHSDVICQSEPDDSFPSSGSVSLYEVERCQQLSATILTDHQYLERTPLCAILKQKAPQQYRIRAKLRSYKPRRLFQSVKLHCPKCHLLQEVPHEGDLDIIFQDGATKTPDVKLQNTSLYDSKIWTTKNQKGRKVAVHFVKNNGILPLSNECLLLIEGGTLSEICKLSNKFNSVIPVRSGHEDLELLDLSAPFLIQGTIHHYGCKQCSSLRSIQNLNSLVDKTSWIPSSVAEALGIVPLQYVFVMTFTLDDGTGVLEAYLMDSDKFFQIPASEVLMDDDLQKSVDMIMDMFCPPGIKIDAYPWLECFIKSYNVTNGTDNQICYQIFDTTVAEDVI,634,NP_056265.2.csv,refseq-POT1-NM_015450.2_clinical_seed_0_final,refseq-POT1-NM_015450.2.a2m,Invitae,refseq-POT1-NM_015450.2.npy,1,634,634
+NP_056280.2,MGQEPRTLPPSPNWYCARCSDAVPGGLFGFAARTSVFLVRVGPGAGESPGTPPFRVIGELVGHTERVSGFTFSHHPGQYNLCATSSDDGTVKIWDVETKTVVTEHALHQHTISTLHWSPRVKDLIVSGDEKGVVFCYWFNRNDSQHLFIEPRTIFCLTCSPHHEDLVAIGYKDGIVVIIDISKKGEVIHRLRGHDDEIHSIAWCPLPGEDCLSINQEETSEEAEITNGNAVAQAPVTKGCYLATGSKDQTIRIWSCSRGRGVMILKLPFLKRRGGGIDPTVKERLWLTLHWPSNQPTQLVSSCFGGELLQWDLTQSWRRKYTLFSASSEGQNHSRIVFNLCPLQTEDDKQLLLSTSMDRDVKCWDIATLECSWTLPSLGGFAYSLAFSSVDIGSLAIGVGDGMIRVWNTLSIKNNYDVKNFWQGVKSKVTALCWHPTKEGCLAFGTDDGKVGLYDTYSNKPPQISSTYHKKTVYTLAWGPPVPPMSLGGEGDRPSLALYSCGGEGIVLQHNPWKLSGEAFDINKLIRDTNSIKYKLPVHTEISWKADGKIMALGNEDGSIEIFQIPNLKLICTIQQHHKLVNTISWHHEHGSQPELSYLMASGSNNAVIYVHNLKTVIESSPESPVTITEPYRTLSGHTAKITSVAWSPHHDGRLVSASYDGTAQVWDALREEPLCNFRGHRGRLLCVAWSPLDPDCIYSGADDFCVHKWLTSMQDHSRPPQGKKSIELEKKRLSQPKAKPKKKKKPTLRTPVKLESIDGNEEESMKENSGPVENGVSDQEGEEQAREPELPCGLAPAVSREPVICTPVSSGFEKSKVTINNKVILLKKEPPKEKPETLIKKRKARSLLPLSTSLDHRSKEELHQDCLVLATAKHSRELNEDVSADVEERFHLGLFTDRATLYRMIDIEGKGHLENGHPELFHQLMLWKGDLKGVLQTAAERGELTDNLVAMAPAAGYHVWLWAVEAFAKQLCFQDQYVKAASHLLSIHKVYEAVELLKSNHFYREAIAIAKARLRPEDPVLKDLYLSWGTVLERDGHYAVAAKCYLGATCAYDAAKVLAKKGDAASLRTAAELAAIVGEDELSASLALRCAQELLLANNWVGAQEALQLHESLQGQRLVFCLLELLSRHLEEKQLSEGKSSSSYHTWNTGTEGPFVERVTAVWKSIFSLDTPEQYQEAFQKLQNIKYPSATNNTPAKQLLLHICHDLTLAVLSQQMASWDEAVQALLRAVVRSYDSGSFTIMQEVYSAFLPDGCDHLRDKLGDHQSPATPAFKSLEAFFLYGRLYEFWWSLSRPCPNSSVWVRAGHRTLSVEPSQQLDTASTEETDPETSQPEPNRPSELDLRLTEEGERMLSTFKELFSEKHASLQNSQRTVAEVQETLAEMIRQHQKSQLCKSTANGPDKNEPEVEAEQPLCSSQSQCKEEKNEPLSLPELTKRLTEANQRMAKFPESIKAWPFPDVLECCLVLLLIRSHFPGCLAQEMQQQAQELLQKYGNTKTYRRHCQTFCM,1508,NP_056280.2.csv,refseq-GEMIN5-NM_015465.4_clinical_seed_0_final,refseq-GEMIN5-NM_015465.4.a2m,Invitae,refseq-GEMIN5-NM_015465.4_theta_0.2.npy,1,1508,1508
+NP_056289.2,MQRADSEQPSKRPRCDDSPRTPSNTPSAEADWSPGLELHPDYKTWGPEQVCSFLRRGGFEEPVLLKNIRENEITGALLPCLDESRFENLGVSSLGERKKLLSYIQRLVQIHVDTMKVINDPIHGHIELHPLLVRIIDTPQFQRLRYIKQLGGGYYVFPGASHNRFEHSLGVGYLAGCLVHALGEKQPELQISERDVLCVQIAGLCHDLGHGPFSHMFDGRFIPLARPEVKWTHEQGSVMMFEHLINSNGIKPVMEQYGLIPEEDICFIKEQIVGPLESPVEDSLWPYKGRPENKSFLYEIVSNKRNGIDVDKWDYFARDCHHLGIQNNFDYKRFIKFARVCEVDNELRICARDKEVGNLYDMFHTRNSLHRRAYQHKVGNIIDTMITDAFLKADDYIEITGAGGKKYRISTAIDDMEAYTKLTDNIFLEILYSTDPKLKDAREILKQIEYRNLFKYVGETQPTGQIKIKREDYESLPKEVASAKPKVLLDVKLKAEDFIVDVINMDYGMQEKNPIDHVSFYCKTAPNRAIRITKNQVSQLLPEKFAEQLIRVYCKKVDRKSLYAARQYFVQWCADRNFTKPQDGDVIAPLITPQKKEWNDSTSVQNPTRLREASKSRVQLFKDDPM,626,NP_056289.2.csv,refseq-SAMHD1-NM_015474.3_clinical_seed_0_final,refseq-SAMHD1-NM_015474.3.a2m,Invitae,refseq-SAMHD1-NM_015474.3.npy,1,626,626
+NP_056303.3,MAAVVAATALKGRGARNARVLRGILAGATANKASHNRTRALQSHSSPEGKEEPEPLSPELEYIPRKRGKNPMKAVGLAWYSLYTRTWLGYLFYRQQLRRARNRYPKGHSKTQPRLFNGVKVLPIPVLSDNYSYLIIDTQAQLAVAVDPSDPRAVQASIEKEGVTLVAILCTHKHWDHSGGNRDLSRRHRDCRVYGSPQDGIPYLTHPLCHQDVVSVGRLQIRALATPGHTQGHLVYLLDGEPYKGPSCLFSGDLLFLSGCGRTFEGNAETMLSSLDTVLGLGDDTLLWPGHEYAEENLGFAGVVEPENLARERKMQWVQRQRLERKGTCPSTLGEERSYNPFLRTHCLALQEALGPGPGPTGDDDYSRAQLLEELRRLKDMHKSK,385,NP_056303.3.csv,refseq-PNKD-NM_015488.4_clinical_seed_0_final,refseq-PNKD-NM_015488.4.a2m,Invitae,refseq-PNKD-NM_015488.4.npy,1,385,385
+NP_056321.2,MEPKVAELKQKIEDTLCPFGFEVYPFQVAWYNELLPPAFHLPLPGPTLAFLVLSTPAMFDRALKPFLQSCHLRMLTDPVDQCVAYHLGRVRESLPELQIEIIADYEVHPNRRPKILAQTAAHVAGAAYYYQRQDVEADPWGNQRISGVCIHPRFGGWFAIRGVVLLPGIEVPDLPPRKPHDCVPTRADRIALLEGFNFHWRDWTYRDAVTPQERYSEEQKAYFSTPPAQRLALLGLAQPSEKPSSPSPDLPFTTPAPKKPGNPSRARSWLSPRVSPPASPGP,282,NP_056321.2.csv,refseq-MMACHC-NM_015506.2_clinical_seed_0_final,refseq-MMACHC-NM_015506.2.a2m,Invitae,refseq-MMACHC-NM_015506.2.npy,1,282,282
+NP_056346.3,MKQRKGQGSGGSRGRKKRGLSDISPSTSLPPLVEGQLRCFLKLTVNRVIWKIAKPPTCVLVRVRWWGETSDGTLFCPRDALQTEPKAVRTTTRYAIRCGPKQFTSYLTDMAVLVLEVITKLDGLPIGRVQINGLAQLSPTHQINGFFTIVSSTSKKLGELQVSLALEPLSETYDSYHPLPTTDMTENVLLSKQGFRENTEPSSTQFQVPSRPRDIHTIKIDGKELAANSSRSTTPRGKDHVCFAENPDTIKDSSFGLQHSLNSGQSLESVTLKGRAPRKQMSLLNSSEFQPQIRTVAKSHSDSCILSSNNLPTKDLLSALLEQGNKLRNAMVISAMKSSPETSMLLDQVHPPINEDSLRASTQIRAFSRNRFKDHIEDHLLPSTENTFWRHDTKADTRAIQLLLGSAELSQGNFWDGLGSPPDSPSPGSDVYCISELNDPQYDQSLLENLFYTAPKSDTSISDFLSEEDDIVPSKKISQSTALARSSKVLESSDHKLKKRSAGKRNRNLVEQQMLSETPEDAQTMTLSVDRLALLGRTHSVRIIIETMGVPPDSPQMTPGKKSYAGPPPKVTTAKKRTFFVEYHFPVGFSESGLGKTALITEVVRLASSKITDGKVKFQQRFVFPVQFGGPMIEHWWNSNLTFQIYVKKTPQKKPEVIGSVSLSLRAVIQSELLSFSDQLPVQQENGQSPFGPLKVTMELITDNKDFTGINTKLSGNTHYTPLCAPTSPNKALPELNQDMTCTKNPQNLNQIHEETAKKAQNLVLPNRKSPSPVAPHPSTFVATPASHNLVNQTNGTTKESALLLHVLLMVPDGKDFISGESEKQSPCNVYLNCKLFSTEEVTRSVIAWGTTQPVFNFSQVIPVSLSSKYLERLKNNVMVIETWNKVRSPGQDKLLGLVKLPLHQFYMSFKDAKISRLLLDAQYPVVAVDSYMPVIDVFSGHQNGSLRVFLAMGSSNQIMALQRLKNEEGTLPPFSPRPAHFLDQPTAASVAMAEDRGNGLMEHCFEIHIEMVKGLAPLQATVWGEADCYVQYYFPVQHSQSSVLKGPEFLENGITLKPFRTATTLCVPDPIFNSEHHHSLLLPAEVPVQRLLLSAFSAQGLVPGGGVQFEIWCRYYYPNVRDQKVAKGTLPLSRICAMVTTQHREDVGIQTFNLPLTPRIENRKELRNQSSGLLDVGLRYRRSPRTAEGVLAARTVSISVQIIRACGLQAAAKALAEREPALQFSATVGVNASVTTHLSFLPQGEQRRTHPVACSFCPEFSHHVEFTCNLVTQHCSGEACFLAELLEFAEVIFAVYHENTKSASDIISIESCKEYLLGVVKVPTKELLIKRSGITGWYPIILPEDGGLPHGLELMQKIVGGLELSISFTHRGDRERVLEAAEHLGWSFENSLKDFVRMDEGEPATVTISTPRLWLPIHCVLLAGHNHIHKNTYCYLRYKFYDHEAFWTPLKKPKESVNKKQIMVTFKASKRAEVTRGPSLLWYFREERLEIQVWRAYGNDSVERPHQTDSWIGSAYVDLARLGERSARTLTVSGVYPLFGRNASNLSGAALRVHVVLSSLSSHLEPTHELDSMDCSSHSESEQLPRRNDEVQLSPPEVISCHQKSPASTQVPCSSTTAEVRLTQEGPADLDGTFAVSILVERAMHLSLKGSPLTERKVSIPSCCVSFATADESSPVYTQVVENTDSPIWNFQQQSRLSKELLLDPQQTLVFKVWHKGDEERVIGFASVDLSPLLSGFQFVCGWYNITDFSGECQGQIKVAVSPLESLIHFKEERQARRGVETSKSLIPIYSPFSFPASDTYAAFSSHMARQTLDQLAHASSKELDFSSPGRSDTTRSQASRHEEHVQNIRRFHESLHLQGEAPLPCDDKLTTSPLSSQTSILTSLRKNLSELDQIQRYFRQKLTKPFLPLSPQTQTAISQHQESCRDHLGPGASSLDPGSQCILEKSSNLVLQVSSLITGSY,1963,NP_056346.3.csv,refseq-C2CD3-NM_015531.5_clinical_seed_0_final,refseq-C2CD3-NM_015531.5.a2m,Invitae,refseq-C2CD3-NM_015531.5.npy,1,1963,1963
+NP_056348.2,MTSKKLVNSVAGCADDALAGLVACNPNLQLLQGHRVALRSDLDSLKGRVALLSGGGSGHEPAHAGFIGKGMLTGVIAGAVFTSPAVGSILAAIRAVAQAGTVGTLLIVKNYTGDRLNFGLAREQARAEGIPVEMVVIGDDSAFTVLKKAGRRGLCGTVLIHKVAGALAEAGVGLEEIAKQVNVVAKAMGTLGVSLSSCSVPGSKPTFELSADEVELGLGIHGEAGVRRIKMATADEIVKLMLDHMTNTTNASHVPVQPGSSVVMMVNNLGGLSFLELGIIADATVRSLEGRGVKIARALVGTFMSALEMPGISLTLLLVDEPLLKLIDAETTAAAWPNVAAVSITGRKRSRVAPAEPQEAPDSTAAGGSASKRMALVLERVCSTLLGLEEHLNALDRAAGDGDCGTTHSRAARAIQEWLKEGPPPASPAQLLSKLSVLLLEKMGGSSGALYGLFLTAAAQPLKAKTSLPAWSAAMDAGLEAMQKYGKAAPGDRTMLDSLWAAGQELQAWKSPGADLLQVLTKAVKSAEAAAEATKNMEAGAGRASYISSARLEQPDPGAVAAAAILRAILEVLQS,575,NP_056348.2.csv,refseq-TKFC-NM_015533.3_clinical_seed_0_final,refseq-TKFC-NM_015533.3.a2m,Invitae,refseq-TKFC-NM_015533.3.npy,1,575,575
+NP_056372.1,MRGPVGTEEELPRLFAEEMENEDEMSEEEDGGLEAFDDFFPVEPVSLPKKKKPKKLKENKCKGKRKKKEGSNDELSENEEDLEEKSESEGSDYSPNKKKKKKLKDKKEKKAKRKKKDEDEDDNDDGCLKEPKSSGQLMAEWGLDDVDYLFSEEDYHTLTNYKAFSQFLRPLIAKKNPKIPMSKMMTVLGAKWREFSANNPFKGSSAAAAAAAVAAAVETVTISPPLAVSPPQVPQPVPIRKAKTKEGKGPGVRKKIKGSKDGKKKGKGKKTAGLKFRFGGISNKRKKGSSSEEDEREESDFDSASIHSASVRSECSAALGKKSKRRRKKKRIDDGDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPKPLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYRNYQRKNDMDEPPPFDYGSGDEDGKSEKRKNKDPLYAKMEERFYRYGIKPEWMMIHRILNHSFDKKGDVHYLIKWKDLPYDQCTWEIDDIDIPYYDNLKQAYWGHRELMLGEDTRLPKRLLKKGKKLRDDKQEKPPDTPIVDPTVKFDKQPWYIDSTGGTLHPYQLEGLNWLRFSWAQGTDTILADEMGLGKTVQTIVFLYSLYKEGHSKGPYLVSAPLSTIINWEREFEMWAPDFYVVTYTGDKESRSVIRENEFSFEDNAIRSGKKVFRMKKEVQIKFHVLLTSYELITIDQAILGSIEWACLVVDEAHRLKNNQSKFFRVLNSYKIDYKLLLTGTPLQNNLEELFHLLNFLTPERFNNLEGFLEEFADISKEDQIKKLHDLLGPHMLRRLKADVFKNMPAKTELIVRVELSQMQKKYYKFILTRNFEALNSKGGGNQVSLLNIMMDLKKCCNHPYLFPVAAVEAPVLPNGSYDGSSLVKSSGKLMLLQKMLKKLRDEGHRVLIFSQMTKMLDLLEDFLEYEGYKYERIDGGITGGLRQEAIDRFNAPGAQQFCFLLSTRAGGLGINLATADTVIIYDSDWNPHNDIQAFSRAHRIGQNKKVMIYRFVTRASVEERITQVAKRKMMLTHLVVRPGLGSKSGSMTKQELDDILKFGTEELFKDDVEGMMSQGQRPVTPIPDVQSSKGGNLAASAKKKHGSTPPGDNKDVEDSSVIHYDDAAISKLLDRNQDATDDTELQNMNEYLSSFKVAQYVVREEDGVEEVEREIIKQEENVDPDYWEKLLRHHYEQQQEDLARNLGKGKRIRKQVNYNDASQEDQEWQDELSDNQSEYSIGSEDEDEDFEERPEGQSGRRQSRRQLKSDRDKPLPPLLARVGGNIEVLGFNARQRKAFLNAIMRWGMPPQDAFNSHWLVRDLRGKSEKEFRAYVSLFMRHLCEPGADGAETFADGVPREGLSRQHVLTRIGVMSLVRKKVQEFEHVNGKYSTPDLIPEGPEGKKSGEVISSDPNTPVPASPAHLLPAPLGLPDKMEAQLGYMDEKDPGAQKPRQPLEVQALPAALDRVESEDKHESPASKERAREERPEETEKAPPSPEQLPREEVLPEKEKILDKLELSLIHSRGDSSELRPDDTKAEEKEPIETQQNGDKEEDDEGKKEDKKGKFKFMFNIADGGFTELHTLWQNEERAAVSSGKIYDIWHRRHDYWLLAGIVTHGYARWQDIQNDPRYMILNEPFKSEVHKGNYLEMKNKFLARRFKLLEQALVIEEQLRRAAYLNMTQDPNHPAMALNARLAEVECLAESHQHLSKESLAGNKPANAVLHKVLNQLEELLSDMKADVTRLPSMLSRIPPVAARLQMSERSILSRLTNRAGDPTIQQGAFGSSQMYSNNFGPNFRGPGPGGIVNYNQMPLGPYVTDI,1954,NP_056372.1.csv,refseq-CHD5-NM_015557.2_clinical_seed_0_final,refseq-CHD5-NM_015557.2.a2m,Invitae,refseq-CHD5-NM_015557.2.npy,1,1954,1954
+NP_056374.2,MESRETLSSSRQRGGESDFLPVSSAKPPAAPGCAGEPLLSTPGPGKGIPVGGERMEPEEEDELGSGRDVDSNSNADSEKWVAGDGLEEQEFSIKEANFTEGSLKLKIQTTKRAKKPPKNLENYICPPEIKITIKQSGDQKVSRAGKNSKATKEEERSHSKKKLLTASDLAASDLKGFQPQAYERPQKHSTLHYDTGLPQDFTGDTLKPKHQQKSSSQNHMDWSTNSDSGPVTQNCFISPESGRETASTSKIPALEPVASFAKAQGKKGSAGNTWSQLSNNNKDLLLGGVAPSPSSHSSPAPPSSSAECNGLQPLVDQDGGGTKEPPEPPTVGSKKKSSKKDVISQTIPNPDLDWVKNAQKAFDNTEGKREGYSADSAQEASPARQNVSSASNPENDSSHVRITIPIKAPSLDPTNHKRKKRQSIKAVVEKIMPEKALASGITMSSEVVNRILSNSEGNKKDPRVPKLSKMIENESPSVGLETGGNAEKVIPGGVSKPRKPPMVMTPPTCTDHSPSRKLPEIQHPKFAAKRRWTCSKPKPSTMLREAVMATSDKLMLEPPSAYPITPSSPLYTNTDSLTVITPVKKKRGRPKKQPLLTVETIHEGTSTSPVSPISREFPGTKKRKRRRNLAKLAQLVPGEDKPMSEMKFHKKVGKLGVLDKKTIKTINKMKTLKRKNILNQILSCSSSVALKAKAPPETSPGAAAIESKLGKQINVSKRGTIYIGKKRGRKPRAELPPPSEEPKTAIKHPRPVSSQPDVPAVPSNFQSLVASSPAAMHPLSTQLGGSNGNLSPASTETNFSELKTMPNLQPISALPTKTQKGIHSGTWKLSPPRLMANSPSHLCEIGSLKEITLSPVSESHSEETIPSDSGIGTDNNSTSDQAEKSSESRRRYSFDFCSLDNPEAIPSDTSTKNRHGHRQKHLIVDNFLAHESLKKPKHKRKRKSLQNRDDLQFLADLEELITKFQVFRISHRSYTFYHENPYPSIFRINFDHYYPVPYIQYDPLLYLRRTSDLKSKKKRGRPAKTNDTMTKVPFLQGFSYPIPSGSYYAPYGMPYTSMPMMNLGYYGQYPAPLYLSHTLGAASPFMRPTVPPPQFHTNSHVKMSGAAKHKAKHGVHLQGPVSMGLGDMQPSLNPPKVGSASLSSGRLHKRKHKHKHKHKEDRILGTHDNLSGLFAGKATGFSSHILSERLSSADKELPLVSEKNKHKEKQKHQHSEAGHKASKNNFEVDTLSTLSLSDAQHWTQAKEKGDLSSEPVDSCTKRYSGSGGDGGSTRSENLDVFSEMNPSNDKWDSDVSGSKRRSYEGFGTYREKDIQAFKMNRKERSSYDSSMSPGMPSPHLKVDQTAVHSKNEGSVPTMMTRKKPAAVDSVTIPPAPVLSLLAASAATSDAVGSSLKKRFKRREIEAIQCEVRKMCNYTKILSTKKNLDHVNKILKAKRLQRQSKTGNNFVKKRRGRPRKQPTQFDEDSRDQMPVLEKCIDLPSKRGQKPSLSPLVLEPAASQDTIMATIEAVIHMAREAPPLPPPPPPPLPPPPPPPLPPPPPLPKTPRGGKRKHKPQAPAQPPQQSPPQQPLPQEEEVKAKRQRKSRGSESEVLP,1596,NP_056374.2.csv,refseq-SETBP1-NM_015559.2_clinical_seed_0_final,refseq-SETBP1-NM_015559.2.a2m,Invitae,refseq-SETBP1-NM_015559.2.npy,1,1596,1596
+NP_056375.2,MWRLRRAAVACEVCQSLVKHSSGIKGSLPLQKLHLVSRSIYHSHHPTLKLQRPQLRTSFQQFSSLTNLPLRKLKFSPIKYGYQPRRNFWPARLATRLLKLRYLILGSAVGGGYTAKKTFDQWKDMIPDLSEYKWIVPDIVWEIDEYIDFEKIRKALPSSEDLVKLAPDFDKIVESLSLLKDFFTSGSPEETAFRATDRGSESDKHFRKVSDKEKIDQLQEELLHTQLKYQRILERLEKENKELRKLVLQKDDKGIHHRKLKKSLIDMYSEVLDVLSDYDASYNTQDHLPRVVVVGDQSAGKTSVLEMIAQARIFPRGSGEMMTRSPVKVTLSEGPHHVALFKDSSREFDLTKEEDLAALRHEIELRMRKNVKEGCTVSPETISLNVKGPGLQRMVLVDLPGVINTVTSGMAPDTKETIFSISKAYMQNPNAIILCIQDGSVDAERSIVTDLVSQMDPHGRRTIFVLTKVDLAEKNVASPSRIQQIIEGKLFPMKALGYFAVVTGKGNSSESIEAIREYEEEFFQNSKLLKTSMLKAHQVTTRNLSLAVSDCFWKMVRESVEQQADSFKATRFNLETEWKNNYPRLRELDRNELFEKAKNEILDEVISLSQVTPKHWEEILQQSLWERVSTHVIENIYLPAAQTMNSGTFNTTVDIKLKQWTDKQLPNKAVEVAWETLQEEFSRFMTEPKGKEHDDIFDKLKEAVKEESIKRHKWNDFAEDSLRVIQHNALEDRSISDKQQWDAAIYFMEEALQARLKDTENAIENMVGPDWKKRWLYWKNRTQEQCVHNETKNELEKMLKCNEEHPAYLASDEITTVRKNLESRGVEVDPSLIKDTWHQVYRRHFLKTALNHCNLCRRGFYYYQRHFVDSELECNDVVLFWRIQRMLAITANTLRQQLTNTEVRRLEKNVKEVLEDFAEDGEKKIKLLTGKRVQLAEDLKKVREIQEKLDAFIEALHQEK,960,NP_056375.2.csv,refseq-OPA1-NM_015560.2_clinical_seed_0_final,refseq-OPA1-NM_015560.2.a2m,Invitae,refseq-OPA1-NM_015560.2_theta_0.2.npy,1,960,960
+NP_056385.1,MDGPTRGHGLRKKRRSRSQRDRERRSRGGLGAGAAGGGGAGRTRALSLASSSGSDKEDNGKPPSSAPSRPRPPRRKRRESTSAEEDIIDGFAMTSFVTFEALEKDVALKPQERVEKRQTPLTKKKREALTNGLSFHSKKSRLSHPHHYSSDRENDRNLCQHLGKRKKMPKALRQLKPGQNSCRDSDSESASGESKGFHRSSSRERLSDSSAPSSLGTGYFCDSDSDQEEKASDASSEKLFNTVIVNKDPELGVGTLPEHDSQDAGPIVPKISGLERSQEKSQDCCKEPIFEPVVLKDPCPQVAQPIPQPQTEPQLRAPSPDPDLVQRTEAPPQPPPLSTQPPQGPPEAQLQPAPQPQVQRPPRPQSPTQLLHQNLPPVQAHPSAQSLSQPLSAYNSSSLSLNSLSSSRSSTPAKTQPAPPHISHHPSASPFPLSLPNHSPLHSFTPTLQPPAHSHHPNMFAPPTALPPPPPLTSGSLQVAGHPAGSTYSEQDILRQELNTRFLASQSADRGASLGPPPYLRTEFHQHQHQHQHTHQHTHQHTFTPFPHAIPPTAIMPTPAPPMFDKYPTKVDPFYRHSLFHSYPPAVSGIPPMIPPTGPFGSLQGAFQPKTSNPIDVAARPGTVPHTLLQKDPRLTDPFRPMLRKPGKWCAMHVHIAWQIYHHQQKVKKQMQSDPHKLDFGLKPEFLSRPPGPSLFGAIHHPHDLARPSTLFSAAGAAHPTGTPFGPPPHHSNFLNPAAHLEPFNRPSTFTGLAAVGGNAFGGLGNPSVTPNSMFGHKDGPSVQNFSNPHEPWNRLHRTPPSFPTPPPWLKPGELERSASAAAHDRDRDVDKRDSSVSKDDKERESVEKRHSSHPSPAPVLPVNALGHTRSSTEQIRAHLNTEAREKDKPKERERDHSESRKDLAADEHKAKEGHLPEKDGHGHEGRAAGEEAKQLARVPSPYVRTPVVESARPNSTSSREAEPRKGEPAYENPKKSSEVKVKEERKEDHDLPPEAPQTHRASEPPPPNSSSSVHPGPLASMPMTVGVTGIHPMNSISSLDRTRMMTPFMGISPLPGGERFPYPSFHWDPIRDPLRDPYRELDIHRRDPLGRDFLLRNDPLHRLSTPRLYEADRSFRDREPHDYSHHHHHHHHPLSVDPRREHERGGHLDERERLHMLREDYEHTRLHSVHPASLDGHLPHPSLITPGLPSMHYPRISPTAGNQNGLLNKTPPTAALSAPPPLISTLGGRPVSPRRTTPLSAEIRERPPSHTLKDIEAR,1259,NP_056385.1.csv,refseq-AUTS2-NM_015570.3_clinical_seed_0_final,refseq-AUTS2-NM_015570.3.a2m,Invitae,refseq-AUTS2-NM_015570.3.npy,1,1259,1259
+NP_056451.3,MAAVAVAVREDSGSGMKAELPPGPGAVGREMTKEEKLQLRKEKKQQKKKRKEEKGAEPETGSAVSAAQCQGPTRELPESGIQLGTPREKVPAGRSKAELRAERRAKQEAERALKQARKGEQGGPPPKASPSTAGETPSGVKRLPEYPQVDDLLLRRLVKKPERQQVPTRKDYGSKVSLFSHLPQYSRQNSLTQFMSIPSSVIHPAMVRLGLQYSQGLVSGSNARCIALLRALQQVIQDYTTPPNEELSRDLVNKLKPYMSFLTQCRPLSASMHNAIKFLNKEITSVGSSKREEEAKSELRAAIDRYVQEKIVLAAQAISRFAYQKISNGDVILVYGCSSLVSRILQEAWTEGRRFRVVVVDSRPWLEGRHTLRSLVHAGVPASYLLIPAASYVLPEVSKVLLGAHALLANGSVMSRVGTAQLALVARAHNVPVLVCCETYKFCERVQTDAFVSNELDDPDDLQCKRGEHVALANWQNHASLRLLNLVYDVTPPELVDLVITELGMIPCSSVPVVLRVKSSDQ,522,NP_056451.3.csv,refseq-EIF2B4-NM_015636.3_clinical_seed_0_final,refseq-EIF2B4-NM_015636.3.a2m,Invitae,refseq-EIF2B4-NM_015636.3.npy,1,522,522
+NP_056477.1,MHLKHLRTLLSPQDGAAKVTCMAWSQNNAKFAVCTVDRVVLLYDEHGERRDKFSTKPADMKYGRKSYMVKGMAFSPDSTKIAIGQTDNIIYVYKIGEDWGDKKVICNKFIQTSAVTCLQWPAEYIIVFGLAEGKVRLANTKTNKSSTIYGTESYVVSLTTNCSGKGILSGHADGTIVRYFFDDEGSGESQGKLVNHPCPPYALAWATNSIVAAGCDRKIVAYGKEGHMLQTFDYSRDPQEREFTTAVSSPGGQSVVLGSYDRLRVFNWIPRRSIWEEAKPKEITNLYTITALAWKRDGSRLCVGTLCGGVEQFDCCLRRSIYKNKFELTYVGPSQVIVKNLSSGTRVVLKSHYGYEVEEVKILGKERYLVAHTSETLLLGDLNTNRLSEIAWQGSGGNEKYFFENENVCMIFNAGELTLVEYGNNDTLGSVRTEFMNPHLISVRINERCQRGTEDNKKLAYLIDIKTIAIVDLIGGYNIGTVSHESRVDWLELNETGHKLLFRDRKLRLHLYDIESCSKTMILNFCSYMQWVPGSDVLVAQNRNSLCVWYNIEAPERVTMFTIRGDVIGLERGGGKTEVMVMEGVTTVAYTLDEGLIEFGTAIDDGNYIRATAFLETLEMTPETEAMWKTLSKLALEARQLHIAERCFSALGQVAKARFLHETNEIADQVSREYGGEGTDFYQVRARLAMLEKNYKLAEMIFLEQNAVEEAMGMYQELHRWDECIAVAEAKGHPALEKLRRSYYQWLMDTQQEERAGELQESQGDGLAAISLYLKAGLPAKAARLVLTREELLANTELVEHITAALIKGELYERAGDLFEKIHNPQKALECYRKGNAFMKAVELARLAFPVEVVKLEEAWGDHLVQQKQLDAAINHYIEARCSIKAIEAALGARQWKKAIYILDLQDRNTASKYYPLVAQHYASLQEYEIAEELYTKGDRTKDAIDMYTQAGRWEQAHKLAMKCMRPEDVSVLYITQAQEMEKQGKYREAERLYVTVQEPDLAITMYKKHKLYDDMIRLVGKHHPDLLSDTHLHLGKELEAEGRLQEAEYHYLEAQEWKATVNMYRASGLWEEAYRVARTQGGANAHKHVAYLWAKSLGGEAAVRLLNKLGLLEAAVDHAADNCSFEFAFELSRLALKHKTPEVHLKYAMFLEDEGKFEEAEAEFIRAGKPKEAVLMFVHNQDWEAAQRVAEAHDPDSVAEVLVGQARGALEEKDFQKAEGLLLRAQRPGLALNYYKEAGLWSDALRICKDYVPSQLEALQEEYEREATKKGARGVEGFVEQARHWEQAGEYSRAVDCYLKVRDSGNSGLAEKCWMKAAELSIKFLPPQRNMEVVLAVGPQLIGIGKHSAAAELYLNLDLVKEAIDAFIEGEEWNKAKRVAKELDPRYEDYVDQHYKEFLKNQGKVDSLVGVDVIAALDLYVEQGQWDKCIETATKQNYKILHKYVALYATHLIREGSSAQALALYVQHGAPANPQNFNIYKRIFTDMVSSPGTNCAEAYHSWADLRDVLFNLCENLVKSSEANSPAHEEFKTMLLIAHYYATRSAAQSVKQLETVAARLSVSLLRHTQLLPVDKAFYEAGIAAKAVGWDNMAFIFLNRFLDLTDAIEEGTLDGLDHSDFQDTDIPFEVPLPAKQHVPEAEREEVRDWVLTVSMDQRLEQVLPRDERGAYEASLVAASTGVRALPCLITGYPILRNKIEFKRPGKAANKDNWNKFLMAIKTSHSPVCQDVLKFISQWCGGLPSTSFSFQ,1749,NP_056477.1.csv,refseq-IFT172-NM_015662.2_clinical_seed_0_final,refseq-IFT172-NM_015662.2.a2m,Invitae,refseq-IFT172-NM_015662.2.npy,1,1749,1749
+NP_056480.1,MCSLGLFPPPPPRGQVTLYEHNNELVTGSSYESPPPDFRGQWINLPVLQLTKDPLKTPGRLDHGTRTAFIHHREQVWKRCINIWRDVGLFGVLNEIANSEEEVFEWVKTASGWALALCRWASSLHGSLFPHLSLRSEDLIAEFAQVTNWSSCCLRVFAWHPHTNKFAVALLDDSVRVYNASSTIVPSLKHRLQRNVASLAWKPLSASVLAVACQSCILIWTLDPTSLSTRPSSGCAQVLSHPGHTPVTSLAWAPSGGRLLSASPVDAAIRVWDVSTETCVPLPWFRGGGVTNLLWSPDGSKILATTPSAVFRVWEAQMWTCERWPTLSGRCQTGCWSPDGSRLLFTVLGEPLIYSLSFPERCGEGKGCVGGAKSATIVADLSETTIQTPDGEERLGGEAHSMVWDPSGERLAVLMKGKPRVQDGKPVILLFRTRNSPVFELLPCGIIQGEPGAQPQLITFHPSFNKGALLSVGWSTGRIAHIPLYFVNAQFPRFSPVLGRAQEPPAGGGGSIHDLPLFTETSPTSAPWDPLPGPPPVLPHSPHSHL,546,NP_056480.1.csv,refseq-AAAS-NM_015665.5_clinical_seed_0_final,refseq-AAAS-NM_015665.5.a2m,Invitae,refseq-AAAS-NM_015665.5.npy,1,546,546
+NP_056496.1,MATASPSVFLLMVNGQVESAQFPEYDDLYCKYCFVYGQDWAPTAGLEEGISQITSKSQDVRQALVWNFPIDVTFKSTNPYGWPQIVLSVYGPDVFGNDVVRGYGAVHVPFSPGRHKRTIPMFVPESTSKLQKFTSWFMGRRPEYTDPKVVAQGEGREVTRVRSQGFVTLLFNVVTKDMRKLGYDTGPSDTQGVLGPSPPQSFPQ,204,NP_056496.1.csv,refseq-B9D1-NM_015681.3_clinical_seed_0_final,refseq-B9D1-NM_015681.3.a2m,Invitae,refseq-B9D1-NM_015681.3.npy,1,204,204
+NP_056508.2,MASVASCDSRPSSDELPGDPSSQEEDEDYDFEDRVSDSGSYSSASSDYDDLEPEWLDSVQKNGELFYLELSEDEEESLLPETPTVNHVRFSENEIIIEDDYKERKKYEPKLKQFTKILRRKRLLPKRCNKKNSNDNGPVSILKHQSNQKTGVIVQQRYKDVNVYVNPKKLTVIKAKEQLKLLEVLVGIIHQTKWSWRRTGKQGDGERLVVHGLLPGGSAMKSGQVLIGDVLVAVNDVDVTTENIERVLSCIPGPMQVKLTFENAYDVKRETSHPRQKKTQSNTSDLVKLLWGEEVEGIQQSGLNTPHIIMYLTLQLDSETSKEEQEILYHYPMSEASQKLKSVRGIFLTLCDMLENVTGTQVTSSSLLLNGKQIHVAYWKESDKLLLIGLPAEEVPLPRLRNMIENVIQTLKFMYGSLDSAFCQIENVPRLDHFFNLFFQRALQPAKLHSSASPSAQQYDASSAVLLDNLPGVRWLTLPLEIKMELDMALSDLEAADFAELSEDYYDMRRLYTILGSSLFYKGYLICSHLPKDDLIDIAVYCRHYCLLPLAAKQRIGQLIIWREVFPQHHLRPLADSSTEVFPEPEGRYFLLVVGLKHYMLCVLLEAGGCASKAIGSPGPDCVYVDQVKTTLHQLDGVDSRIDERLASSPVPCLSCADWFLTGSREKTDSLTTSPILSRLQGTSKVATSPTCRRTLFGDYSLKTRKPSPSCSSGGSDNGCEGGEDDGFSPHTTPDAVRKQRESQGSDGLEESGTLLKVTKKKSTLPNPFHLGNLKKDLPEKELEIYNTVKLTSGPENTLFHYVALETVQGIFITPTLEEVAQLSGSIHPQLIKNFHQCCLSIRAVFQQTLVEEKKKGLNSGDHSDSAKSVSSLNPVKEHGVLFECSPGNWTDQKKAPPVMAYWVVGRLFLHPKPQELYVCFHDSVTEIAIEIAFKLFFGLTL,942,NP_056508.2.csv,refseq-INTU-NM_015693.3_clinical_seed_0_final,refseq-INTU-NM_015693.3.a2m,Invitae,refseq-INTU-NM_015693.3.npy,1,942,942
+NP_056512.5,MTPISQVRMRKGSAHTAAQPGRLGLHPAGATAHACRGMTSIRARPGLTSAMLGSRAAGFARGLRAVALAWLPGWRGRSFALARAAGAPHGGDLQPPACPEPRGRQLSLSAAAVVDSAPRPLQPYLRLMRLDKPIGTWLLYLPCTWSIGLAAEPGCFPDWYMLSLFGTGAILMRGAGCTINDMWDQDYDKKVTRTANRPIAAGDISTFQSFVFLGGQLTLALGVLLCLNYYSIALGAGSLLLVITYPLMKRISYWPQLALGLTFNWGALLGWSAIKGSCDPSVCLPLYFSGVMWTLIYDTIYAHQDKRDDVLIGLKSTALRFGENTKPWLSGFSVAMLGALSLVGVNSGQTAPYYAALGAVGAHLTHQIYTLDIHRPEDCWNKFISNRTLGLIVFLGIVLGNLWKEKKTDKTKKGIENKIEN,421,NP_056512.5.csv,refseq-COQ2-NM_015697.7_clinical_seed_0_final,refseq-COQ2-NM_015697.7.a2m,Invitae,refseq-COQ2-NM_015697.7.npy,1,421,421
+NP_056517.1,MANVLCNRARLVSYLPGFCSLVKRVVNPKAFSTAGSSGSDESHVAAAPPDICSRTVWPDETMGPFGPQDQRFQLPGNIGFDCHLNGTASQKKSLVHKTLPDVLAEPLSSERHEFVMAQYVNEFQGNDAPVEQEINSAETYFESARVECAIQTCPELLRKDFESLFPEVANGKLMILTVTQKTKNDMTVWSEEVEIEREVLLEKFINGAKEICYALRAEGYWADFIDPSSGLAFFGPYTNNTLFETDERYRHLGFSVDDLGCCKVIRHSLWGTHVVVGSIFTNATPDSHIMKKLSGN,296,NP_056517.1.csv,refseq-MMADHC-NM_015702.2_clinical_seed_0_final,refseq-MMADHC-NM_015702.2.a2m,Invitae,refseq-MMADHC-NM_015702.2.npy,1,296,296
+NP_056536.2,MDLGPLNICEEMTILHGGFLLAEQLFHPKALAELTKSDWERVGRPIVEALREISSAAAHSQPFAWKKKALIIIWAKVLQPHPVTPSDTETRWQEDLFFSVGNMIPTINHTILFELLKSLEASGLFIQLLMALPTTICHAELERFLEHVTVDTSAEDVAFFLDVWWEVMKHKGHPQDPLLSQFSAMAHKYLPALDEFPHPPKRLRSDPDACPTMPLLAMLLRGLTQIQSRILGPGRKCCALANLADMLTVFALTEDDPQEVSATVYLDKLATVISVWNSDTQNPYHQQALAEKVKEAERDVSLTSLAKLPSETIFVGCEFLHHLLREWGEELQAVLRSSQGTSYDSYRLCDSLTSFSQNATLYLNRTSLSKEDRQVVSELAECVRDFLRKTSTVLKNRALEDITASIAMAVIQQKMDRHMEVCYIFASEKKWAFSDEWVACLGSNRALFRQPDLVLRLLETVIDVSTADRAIPESQIRQVIHLILECYADLSLPGKNKVLAGILRSWGRKGLSEKLLAYVEGFQEDLNTTFNQLTQSASEQGLAKAVASVARLVIVHPEVTVKKMCSLAVVNLGTHKFLAQILTAFPALRFVEEQGPNSSATFMVSCLKETVWMKFSTPKEEKQFLELLNCLMSPVKPQGIPVAALLEPDEVLKEFVLPFLRLDVEEVDLSLRIFIQTLEANACREEYWLQTCSPFPLLFSLCQLLDRFSKYWQLPKEKRCLSLDRKDLAIHILELLCEIVSANAETFSPDVWIKSLSWLHRKLEQLDWTVGLRLKSFFEGHFKCEVPATLFEICKLSEDEWTSQAHPGYGAGTGLLAWMECCCVSSGISERMLSLLVVDVGNPEEVRLFSKGFLVALVQVMPWCSPQEWQRLHQLTRRLLEKQLLHVPYSLEYIQFVPLLNLKPFAQELQLSVLFLRTFQFLCSHSCRDWLPLEGWNHVVKLLCGSLTRLLDSVRAIQAAGPWVQGPEQDLTQEALFVYTQVFCHALHIMAMLHPEVCEPLYVLALETLTCYETLSKTNPSVSSLLQRAHEQRFLKSIAEGIGPEERRQTLLQKMSSF,1058,NP_056536.2.csv,refseq-GEMIN4-NM_015721.2_clinical_seed_0_final,refseq-GEMIN4-NM_015721.2.a2m,Invitae,refseq-GEMIN4-NM_015721.2.npy,1,1058,1058
+NP_056651.1,MALHSMRKARERWSFIRALHKGSAAAPALQKDSKKRVFSGIQPTGILHLGNYLGAIESWVRLQDEYDSVLYSIVDLHSITVPQDPAVLRQSILDMTAVLLACGINPEKSILFQQSQVSEHTQLSWILSCMVRLPRLQHLHQWKAKTTKQKHDGTVGLLTYPVLQAADILLYKSTHVPVGEDQVQHMELVQDLAQGFNKKYGEFFPVPESILTSMKKVKSLRDPSAKMSKSDPDKLATVRITDSPEEIVQKFRKAVTDFTSEVTYDPAGRAGVSNIVAVHAAVTGLSVEEVVRRSAGMNTARYKLAVADAVIEKFAPIKREIEKLKLDKDHLEKVLQIGSAKAKELAYTVCQEVKKLVGFL,360,NP_056651.1.csv,refseq-WARS2-NM_015836.3_clinical_seed_0_final,refseq-WARS2-NM_015836.3.a2m,Invitae,refseq-WARS2-NM_015836.3.npy,1,360,360
+NP_056655.3,MNPRQGYSLSGYYTHPFQGYEHRQLRYQQPGPGSSPSSFLLKQIEFLKGQLPEAPVIGKQTPSLPPSLPGLRPRFPVLLASSTRGRQVDIRGVPRGVHLRSQGLQRGFQHPSPRGRSLPQRGVDCLSSHFQELSIYQDQEQRILKFLEELGEGKATTAHDLSGKLGTPKKEINRVLYSLAKKGKLQKEAGTPPLWKIAVSTQAWNQHSGVVRPDGHSQGAPNSDPSLEPEDRNSTSVSEDLLEPFIAVSAQAWNQHSGVVRPDSHSQGSPNSDPGLEPEDSNSTSALEDPLEFLDMAEIKEKICDYLFNVSDSSALNLAKNIGLTKARDINAVLIDMERQGDVYRQGTTPPIWHLTDKKRERMQIKRNTNSVPETAPAAIPETKRNAEFLTCNIPTSNASNNMVTTEKVENGQEPVIKLENRQEARPEPARLKPPVHYNGPSKAGYVDFENGQWATDDIPDDLNSIRAAPGEFRAIMEMPSFYSHGLPRCSPYKKLTECQLKNPISGLLEYAQFASQTCEFNMIEQSGPPHEPRFKFQVVINGREFPPAEAGSKKVAKQDAAMKAMTILLEEAKAKDSGKSEESSHYSTEKESEKTAESQTPTPSATSFFSGKSPVTTLLECMHKLGNSCEFRLLSKEGPAHEPKFQYCVAVGAQTFPSVSAPSKKVAKQMAAEEAMKALHGEATNSMASDNQPEGMISESLDNLESMMPNKVRKIGELVRYLNTNPVGGLLEYARSHGFAAEFKLVDQSGPPHEPKFVYQAKVGGRWFPAVCAHSKKQGKQEAADAALRVLIGENEKAERMGFTELPLTGSTFHDQIAMLSHRCFNTLTNSFQPSLLGRKILAAIIMKKDSEDMGVVVSLGTGNRCVKGDSLSLKGETVNDCHAEIISRRGFIRFLYSELMKYNSQTAKDSIFEPAKGGEKLQIKKTVSFHLYISTAPCGDGALFDKSCSDRAMESTESRHYPVFENPKQGKLRTKVENGEGTIPVESSDIVPTWDGIRLGERLRTMSCSDKILRWNVLGLQGALLTHFLQPIYLKSVTLGYLFSQGHLTRAICCRVTRDGSAFEDGLRHPFIVNHPKVGRVSIYDSKRQSGKTKETSVNWCLADGYDLEILDGTRGTVDGPRNELSRVSKKNIFLLFKKLCSFRYRRDLLRLSYGEAKKAARDYETAKNYFKKGLKDMGYGNWISKPQEEKNFYLCPV,1200,NP_056655.3.csv,refseq-ADAR-NM_015840.4_clinical_seed_0_final,refseq-ADAR-NM_015840.4.a2m,Invitae,refseq-ADAR-NM_015840.4.npy,1,1200,1200
+NP_056953.2,MGETLGDSPIDPESDSFTDTLSANISQEMTMVDTEMPFWPTNFGISSVDLSVMEDHSHSFDIKPFTTVDFSSISTPHYEDIPFTRTDPVVADYKYDLKLQEYQSAIKVEPASPPYYSEKTQLYNKPHEEPSNSLMAIECRVCGDKASGFHYGVHACEGCKGFFRRTIRLKLIYDRCDLNCRIHKKSRNKCQYCRFQKCLAVGMSHNAIRFGRMPQAEKEKLLAEISSDIDQLNPESADLRALAKHLYDSYIKSFPLTKAKARAILTGKTTDKSPFVIYDMNSLMMGEDKIKFKHITPLQEQSKEVAIRIFQGCQFRSVEAVQEITEYAKSIPGFVNLDLNDQVTLLKYGVHEIIYTMLASLMNKDGVLISEGQGFMTREFLKSLRKPFGDFMEPKFEFAVKFNALELDDSDLAIFIAVIILSGDRPGLLNVKPIEDIQDNLLQALELQLKLNHPESSQLFAKLLQKMTDLRQIVTEHVQLLQVIKKTETDMSLHPLLQEIYKDLY,505,NP_056953.2.csv,refseq-PPARG-NM_015869.4_clinical_seed_0_final,refseq-PPARG-NM_015869.4.a2m,Invitae,refseq-PPARG-NM_015869.4.npy,1,505,505
+NP_056968.1,MIPVSLVVVVVGGWTVVYLTDLVLKSSVYFKHSYEDWLENNGLSISPFHIRWQTAVFNRAFYSWGRRKARMLYQWFNFGMVFGVIAMFSSFFLLGKTLMQTLAQMMADSPSSYSSSSSSSSSSSSSSSSSSSSSSSLHNEQVLQVVVPGINLPVNQLTYFFTAVLISGVVHEIGHGIAAIREQVRFNGFGIFLFIIYPGAFVDLFTTHLQLISPVQQLRIFCAGIWHNFVLALLGILALVLLPVILLPFYYTGVGVLITEVAEDSPAIGPRGLFVGDLVTHLQDCPVTNVQDWNECLDTIAYEPQIGYCISASTLQQLSFPVRAYKRLDGSTECCNNHSLTDVCFSYRNNFNKRLHTCLPARKAVEATQVCRTNKDCKKSSSSSFCIIPSLETHTRLIKVKHPPQIDMLYVGHPLHLHYTVSITSFIPRFNFLSIDLPVVVETFVKYLISLSGALAIVNAVPCFALDGQWILNSFLDATLTSVIGDNDVKDLIGFFILLGGSVLLAANVTLGLWMVTAR,519,NP_056968.1.csv,refseq-MBTPS2-NM_015884.3_clinical_seed_0_final,refseq-MBTPS2-NM_015884.3.a2m,Invitae,refseq-MBTPS2-NM_015884.3.npy,1,519,519
+NP_056979.1,MNPSMKQKQEEIKENIKNSSVPRRTLKMIQPSASGSLVGRENELSAGLSKRKHRNDHLTSTTSSPGVIVPESSENKNLGGVTQESFDLMIKENPSSQYWKEVAEKRRKALYEALKENEKLHKEIEQKDNEIARLKKENKELAEVAEHVQYMAELIERLNGEPLDNFESLDNQEFDSEEETVEDSLVEDSEIGTCAEGTVSSSTDAKPCI,209,NP_056979.1.csv,refseq-GMNN-NM_015895.4_clinical_seed_0_final,refseq-GMNN-NM_015895.4.a2m,Invitae,refseq-GMNN-NM_015895.4.npy,1,209,209
+NP_056980.2,MGDLELLLPGEAEVLVRGLRSFPLREMGSEGWNQQHENLEKLNMQAILDATVSQGEPIQELLVTHGKVPTLVEELIAVEMWKQKVFPVFCRVEDFKPQNTFPIYMVVHHEASIINLLETVFFHKEVCESAEDTVLDLVDYCHRKLTLLVAQSGCGGPPEGEGSQDSNPMQELQKQAELMEFEIALKALSVLRYITDCVDSLSLSTLSRMLSTHNLPCLLVELLEHSPWSRREGGKLQQFEGSRWHTVAPSEQQKLSKLDGQVWIALYNLLLSPEAQARYCLTSFAKGRLLKLRAFLTDTLLDQLPNLAHLQSFLAHLTLTETQPPKKDLVLEQIPEIWERLERENRGKWQAIAKHQLQHVFSPSEQDLRLQARRWAETYRLDVLEAVAPERPRCAYCSAEASKRCSRCQNEWYCCRECQVKHWEKHGKTCVLAAQGDRAK,440,NP_056980.2.csv,refseq-ZMYND10-NM_015896.2_clinical_seed_0_final,refseq-ZMYND10-NM_015896.2.a2m,Invitae,refseq-ZMYND10-NM_015896.2.npy,1,440,440
+NP_056989.2,MEVAVEKAVAAAAAASAAASGGPSAAPSGENEAESRQGPDSERGGEAARLNLLDTCAVCHQNIQSRAPKLLPCLHSFCQRCLPAPQRYLMLPAPMLGSAETPPPVPAPGSPVSGSSPFATQVGVIRCPVCSQECAERHIIDNFFVKDTTEVPSSTVEKSNQVCTSCEDNAEANGFCVECVEWLCKTCIRAHQRVKFTKDHTVRQKEEVSPEAVGVTSQRPVFCPFHKKEQLKLYCETCDKLTCRDCQLLEHKEHRYQFIEEAFQNQKVIIDTLITKLMEKTKYIKFTGNQIQNRIIEVNQNQKQVEQDIKVAIFTLMVEINKKGKALLHQLESLAKDHRMKLMQQQQEVAGLSKQLEHVMHFSKWAVSSGSSTALLYSKRLITYRLRHLLRARCDASPVTNNTIQFHCDPSFWAQNIINLGSLVIEDKESQPQMPKQNPVVEQNSQPPSGLSSNQLSKFPTQISLAQLRLQHMQQQVMAQRQQVQRRPAPVGLPNPRMQGPIQQPSISHQQPPPRLINFQNHSPKPNGPVLPPHPQQLRYPPNQNIPRQAIKPNPLQMAFLAQQAIKQWQISSGQGTPSTTNSTSSTPSSPTITSAAGYDGKAFGSPMIDLSSPVGGSYNLPSLPDIDCSSTIMLDNIVRKDTNIDHGQPRPPSNRTVQSPNSSVPSPGLAGPVTMTSVHPPIRSPSASSVGSRGSSGSSSKPAGADSTHKVPVVMLEPIRIKQENSGPPENYDFPVVIVKQESDEESRPQNANYPRSILTSLLLNSSQSSTSEETVLRSDAPDSTGDQPGLHQDNSSNGKSEWLDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLLLFLYCHEMSLAFQDPVPLTVPDYYKIIKNPMDLSTIKKRLQEDYSMYSKPEDFVADFRLIFQNCAEFNEPDSEVANAGIKLENYFEELLKNLYPEKRFPKPEFRNESEDNKFSDDSDDDFVQPRKKRLKSIEERQLLK,1050,NP_056989.2.csv,refseq-TRIM24-NM_015905.2_clinical_seed_0_final,refseq-TRIM24-NM_015905.2.a2m,Invitae,refseq-TRIM24-NM_015905.2.npy,1,1050,1050
+NP_056999.2,MAKNRRDRNSWGGFSEKTYEWSSEEEEPVKKAGPVQVLIVKDDHSFELDETALNRILLSEAVRDKEVVAVSVAGAFRKGKSFLMDFMLRYMYNQESVDWVGDYNEPLTGFSWRGGSERETTGIQIWSEIFLINKPDGKKVAVLLMDTQGTFDSQSTLRDSATVFALSTMISSIQVYNLSQNVQEDDLQHLQLFTEYGRLAMEETFLKPFQSLIFLVRDWSFPYEFSYGADGGAKFLEKRLKVSGNQHEELQNVRKHIHSCFTNISCFLLPHPGLKVATNPNFDGKLKEIDDEFIKNLKILIPWLLSPESLDIKEINGNKITCRGLVEYFKAYIKIYQGEELPHPKSMLQATAEANNLAAVATAKDTYNKKMEEICGGDKPFLAPNDLQTKHLQLKEESVKLFRGVKKMGGEEFSRRYLQQLESEIDELYIQYIKHNDSKNIFHAARTPATLFVVIFITYVIAGVTGFIGLDIIASLCNMIMGLTLITLCTWAYIRYSGEYRELGAVIDQVAAALWDQGSTNEALYKLYSAAATHRHLYHQAFPTPKSESTEQSEKKKM,558,NP_056999.2.csv,refseq-ATL1-NM_015915.4_clinical_seed_0_final,refseq-ATL1-NM_015915.4.a2m,Invitae,refseq-ATL1-NM_015915.4.npy,1,558,558
+NP_057006.1,MEPAVSEPMRDQVARTHLTEDTPKVNADIEKVNQNQAKRCTVIGGSGFLGQHMVEQLLARGYAVNVFDIQQGFDNPQVRFFLGDLCSRQDLYPALKGVNTVFHCASPPPSSNNKELFYRVNYIGTKNVIETCKEAGVQKLILTSSASVIFEGVDIKNGTEDLPYAMKPIDYYTETKILQERAVLGANDPEKNFLTTAIRPHGIFGPRDPQLVPILIEAARNGKMKFVIGNGKNLVDFTFVENVVHGHILAAEQLSRDSTLGGKAFHITNDEPIPFWTFLSRILTGLNYEAPKYHIPYWVAYYLALLLSLLVMVISPVIQLQPTFTPMRVALAGTFHYYSCERAKKAMGYQPLVTMDDAMERTVQSFRHLRRVK,373,NP_057006.1.csv,refseq-NSDHL-NM_015922.2_clinical_seed_0_final,refseq-NSDHL-NM_015922.2.a2m,Invitae,refseq-NSDHL-NM_015922.2.npy,1,373,373
+NP_057021.2,MAAAMPLALLVLLLLGPGGWCLAEPPRDSLREELVITPLPSGDVAATFQFRTRWDSELQREGVSHYRLFPKALGQLISKYSLRELHLSFTQGFWRTRYWGPPFLQAPSGAELWVWFQDTVTDVDKSWKELSNVLSGIFCASLNFIDSTNTVTPTASFKPLGLANDTDHYFLRYAVLPREVVCTENLTPWKKLLPCSSKAGLSVLLKADRLFHTSYHSQAVHIRPVCRNARCTSISWELRQTLSVVFDAFITGQGKKDWSLFRMFSRTLTEPCPLASESRVYVDITTYNQDNETLEVHPPPTTTYQDVILGTRKTYAIYDLLDTAMINNSRNLNIQLKWKRPPENEAPPVPFLHAQRYVSGYGLQKGELSTLLYNTHPYRAFPVLLLDTVPWYLRLYVHTLTITSKGKENKPSYIHYQPAQDRLQPHLLEMLIQLPANSVTKVSIQFERALLKWTEYTPDPNHGFYVSPSVLSALVPSMVAAKPVDWEESPLFNSLFPVSDGSNYFVRLYTEPLLVNLPTPDFSMPYNVICLTCTVVAVCYGSFYNLLTRTFHIEEPRTGGLAKRLANLIRRARGVPPL,578,NP_057021.2.csv,refseq-PIGT-NM_015937.5_clinical_seed_0_final,refseq-PIGT-NM_015937.5.a2m,Invitae,refseq-PIGT-NM_015937.5.npy,1,578,578
+NP_057043.1,MAVLAPLIALVYSVPRLSRWLAQPYYLLSALLSAAFLLVRKLPPLCHGLPTQREDGNPCDFDWREVEILMFLSAIVMMKNRRSITVEQHIGNIFMFSKVANTILFFRLDIRMGLLYITLCIVFLMTCKPPLYMGPEYIKYFNDKTIDEELERDKRVTWIVEFFANWSNDCQSFAPIYADLSLKYNCTGLNFGKVDVGRYTDVSTRYKVSTSPLTKQLPTLILFQGGKEAMRRPQIDKKGRAVSWTFSEENVIREFNLNELYQRAKKLSKAGDNIPEEQPVASTPTTVSDGENKKDK,296,NP_057043.1.csv,refseq-TMX2-NM_015959.3_clinical_seed_0_final,refseq-TMX2-NM_015959.3.a2m,Invitae,refseq-TMX2-NM_015959.3.npy,1,296,296
+NP_057049.5,MAASKVKQDMPPPGGYGPIDYKRNLPRRGLSGYSMLAIGIGTLIYGHWSIMKWNRERRRLQIEDFEARIALLPLLQAETDRRTLQMLRENLEEEAIIMKDVPDWKVGESVFHTTRWVPPLIGELYGLRTTEEALHASHGFMWYT,144,NP_057049.5.csv,refseq-NDUFA13-NM_015965.6_clinical_seed_0_final,refseq-NDUFA13-NM_015965.6.a2m,Invitae,refseq-NDUFA13-NM_015965.6.npy,1,144,144
+NP_057062.1,MGNYKSRPTQTCTDEWKKKVSESYVITIERLEDDLQIKEKELTELRNIFGSDEAFSKVNLNYRTENGLSLLHLCCICGGKKSHIRTLMLKGLRPSRLTRNGFTALHLAVYKDNAELITSLLHSGADIQQVGYGGLTALHIATIAGHLEAADVLLQHGANVNIQDAVFFTPLHIAAYYGHEQVTRLLLKFGADVNVSGEVGDRPLHLASAKGFLNIAKLLMEEGSKADVNAQDNEDHVPLHFCSRFGHHDIVKYLLQSDLEVQPHVVNIYGDTPLHLACYNGKFEVAKEIIQISGTESLTKENIFSETAFHSACTYGKSIDLVKFLLDQNVININHQGRDGHTGLHSACYHGHIRLVQFLLDNGADMNLVACDPSRSSGEKDEQTCLMWAYEKGHDAIVTLLKHYKRPQDELPCNEYSQPGGDGSYVSVPSPLGKIKSMTKEKADILLLRAGLPSHFHLQLSEIEFHEIIGSGSFGKVYKGRCRNKIVAIKRYRANTYCSKSDVDMFCREVSILCQLNHPCVIQFVGACLNDPSQFAIVTQYISGGSLFSLLHEQKRILDLQSKLIIAVDVAKGMEYLHNLTQPIIHRDLNSHNILLYEDGHAVVADFGESRFLQSLDEDNMTKQPGNLRWMAPEVFTQCTRYTIKADVFSYALCLWEILTGEIPFAHLKPAAAAADMAYHHIRPPIGYSIPKPISSLLIRGWNACPEGRPEFSEVVMKLEECLCNIELMSPASSNSSGSLSPSSSSDCLVNRGGPGRSHVAALRSRFELEYALNARSYAALSQSAGQYSSQGLSLEEMKRSLQYTPIDKYGYVSDPMSSMHFHSCRNSSSFEDSS,835,NP_057062.1.csv,refseq-TNNI3K-NM_015978.2_clinical_seed_0_final,refseq-TNNI3K-NM_015978.2.a2m,Invitae,refseq-TNNI3K-NM_015978.2.npy,1,835,835
+NP_057075.1,MEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIRTGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQPRPAFSAIRRNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWEICLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHIYQGSEADSVFSGFLIFPSA,245,NP_057075.1.csv,refseq-C1QA-NM_015991.2_clinical_seed_0_final,refseq-C1QA-NM_015991.2.a2m,Invitae,refseq-C1QA-NM_015991.2.npy,1,245,245
+NP_057090.2,MAAEEEEVDSADTGERSGWLTGWLPTWCPTSISHLKEAEEKMLKCVPCTYKKEPVRISNGNKIWTLKFSHNISNKTPLVLLHGFGGGLGLWALNFGDLCTNRPVYAFDLLGFGRSSRPRFDSDAEEVENQFVESIEEWRCALGLDKMILLGHNLGGFLAAAYSLKYPSRVNHLILVEPWGFPERPDLADQDRPIPVWIRALGAALTPFNPLAGLRIAGPFGLSLVQRLRPDFKRKYSSMFEDDTVTEYIYHCNVQTPSGETAFKNMTIPYGWAKRPMLQRIGKMHPDIPVSVIFGARSCIDGNSGTSIQSLRPHSYVKTIAILGAGHYVYADQPEEFNQKVKEICDTVD,349,NP_057090.2.csv,refseq-ABHD5-NM_016006.4_clinical_seed_0_final,refseq-ABHD5-NM_016006.4.a2m,Invitae,refseq-ABHD5-NM_016006.4.npy,1,349,349
+NP_057092.2,MPSETLWEIAKAEVEKRGINGSEGDGAEIAEKFVFFIGSKNGGKTTIILRCLDRDEPPKPTLALEYTYGRRAKGHNTPKDIAHFWELGGGTSLLDLISIPITGDTLRTFSLVLVLDLSKPNDLWPTMENLLQATKSHVDKVIMKLGKTNAKAVSEMRQKIWNNMPKDHPDHELIDPFPVPLVIIGSKYDVFQDFESEKRKVICKTLRFVAHYYGASLMFTSKSEALLLKIRGVINQLAFGIDKSKSICVDQNKPLFITAGLDSFGQIGSPPVPENDIGKLHAHSPMELWKKVYEKLFPPKSINTLKDIKDPARDPQYAENEVDEMRIQKDLELEQYKRSSSKSWKQIELDS,351,NP_057092.2.csv,refseq-DYNC2LI1-NM_016008.3_clinical_seed_0_final,refseq-DYNC2LI1-NM_016008.3.a2m,Invitae,refseq-DYNC2LI1-NM_016008.3.npy,1,351,351
+NP_057097.2,MALVHKLLRGTYFLRKFSKPTSALYPFLGIRFAEYSSSLQKPVASPGKASSQRKTEGDLQGDHQKEVALDITSSEEKPDVSFDKAIRDEAIYHFRLLKDEIVDHWRGPEGHPLHEVLLEQAKVVWQFRGKEDLDKWTVTSDKTIGGRSEVFLKMGKNNQSALLYGTLSSEAPQDGESTRSGYCAMISRIPRGAFERKMSYDWSQFNTLYLRVRGDGRPWMVNIKEDTDFFQRTNQMYSYFMFTRGGPYWQEVKIPFSKFFFSNRGRIRDVQHELPLDKISSIGFTLADKVDGPFFLEIDFIGVFTDPAHTEEFAYENSPELNPRLFK,327,NP_057097.2.csv,refseq-NDUFAF1-NM_016013.3_clinical_seed_0_final,refseq-NDUFAF1-NM_016013.3.a2m,Invitae,refseq-NDUFAF1-NM_016013.3.npy,1,327,327
+NP_057116.2,MSVMVVRKKVTRKWEKLPGRNTFCCDGRVMMARQKGIFYLTLFLILGTCTLFFAFECRYLAVQLSPAIPVFAAMLFLFSMATLLRTSFSDPGVIPRALPDEAAFIEMEIEATNGAVPQGQRPPPRIKNFQINNQIVKLKYCYTCKIFRPPRASHCSICDNCVERFDHHCPWVGNCVGKRNYRYFYLFILSLSLLTIYVFAFNIVYVALKSLKIGFLETLKETPGTVLEVLICFFTLWSVVGLTGFHTFLVALNQTTNEDIKGSWTGKNRVQNPYSHGNIVKNCCEVLCGPLPPSVLDRRGILPLEESGSRPPSTQETSSSLLPQSPAPTEHLNSNEMPEDSSTPEEMPPPEPPEPPQEAAEAEK,364,NP_057116.2.csv,refseq-ZDHHC9-NM_016032.3_clinical_seed_0_final,refseq-ZDHHC9-NM_016032.3.a2m,Invitae,refseq-ZDHHC9-NM_016032.3.npy,1,364,364
+NP_057118.1,MATSSAALPRILGAGARAPSRWLGFLGKATPRPARPSRRTLGSATALMIRESEDSTDFNDKILNEPLKHSDFFNVKELFSVRSLFDARVHLGHKAGCRHRFMEPYIFGSRLDHDIIDLEQTATHLQLALNFTAHMAYRKGIILFISRNRQFSYLIENMARDCGEYAHTRYFRGGMLTNARLLFGPTVRLPDLIIFLHTLNNIFEPHVAVRDAAKMNIPTVGIVDTNCNPCLITYPVPGNDDSPLAVHLYCRLFQTAITRAKEKRQQVEALYRLQGQKEPGDQGPAHPPGADMSHSL,296,NP_057118.1.csv,refseq-MRPS2-NM_016034.4_clinical_seed_0_final,refseq-MRPS2-NM_016034.4.a2m,Invitae,refseq-MRPS2-NM_016034.4.npy,1,296,296
+NP_057119.3,MATLLRPVLRRLCGLPGLQRPAAEMPLRARSDGAGPLYSHHLPTSPLQKGLLAAGSAAMALYNPYRHDMVAVLGETTGHRTLKVLRDQMRRDPEGAQILQERPRISTSTLDLGKLQSLPEGSLGREYLRFLDVNRVSPDTRAPTRFVDDEELAYVIQRYREVHDMLHTLLGMPTNILGEIVVKWFEAVQTGLPMCILGAFFGPIRLGAQSLQVLVSELIPWAVQNGRRAPCVLNLYYERRWEQSLRALREELGITAPPMHVQGLA,265,NP_057119.3.csv,COQ4_HUMAN_b03_clinical_seed_0_final,COQ4_HUMAN_b03.a2m,EVE,COQ4_HUMAN_b03_theta_0.2.npy,1,265,265
+NP_057122.2,MSIFTPTNQIRLTNVAVVRMKRAGKRFEIACYKNKVVGWRSGVEKDLDEVLQTHSVFVNVSKGQVAKKEDLISAFGTDDQTEICKQILTKGEVQVSDKERHTQLEQMFRDIATIVADKCVNPETKRPYTVILIERAMKDIHYSVKTNKSTKQQALEVIKQLKEKMKIERAHMRLRFILPVNEGKKLKEKLKPLIKVIESEDYGQQLEIVCLIDPGCFREIDELIKKETKGKGSLEVLNLKDVEEGDEKFE,250,NP_057122.2.csv,refseq-SBDS-NM_016038.3_clinical_seed_0_final,refseq-SBDS-NM_016038.3.a2m,Invitae,refseq-SBDS-NM_016038.3_theta_0.2.npy,1,250,250
+NP_057126.2,MAEPASVAAESLAGSRARAARTVLGQVVLPGEELLLPEQEDAEGPGGAVERPLSLNARACSRVRVVCGPGLRRCGDRLLVTKCGRLRHKEPGSGSGGGVYWVDSQQKRYVPVKGDHVIGIVTAKSGDIFKVDVGGSEPASLSYLSFEGATKRNRPNVQVGDLIYGQFVVANKDMEPEMVCIDSCGRANGMGVIGQDGLLFKVTLGLIRKLLAPDCEIIQEVGKLYPLEIVFGMNGRIWVKAKTIQQTLILANILEACEHMTSDQRKQIFSRLAES,275,NP_057126.2.csv,refseq-EXOSC3-NM_016042.3_clinical_seed_0_final,refseq-EXOSC3-NM_016042.3.a2m,Invitae,refseq-EXOSC3-NM_016042.3.npy,1,275,275
+NP_057161.1,MPSKSLVMEYLAHPSTLGLAVGVACGMCLGWSLRVCFGMLPKSKTSKTHTDTESEASILGDSGEYKMILVVRNDLKMGKGKVAAQCSHAAVSAYKQIQRRNPEMLKQWEYCGQPKVVVKAPDEETLIALLAHAKMLGLTVSLIQDAGRTQIAPGSQTVLGIGPGPADLIDKVTGHLKLY,179,NP_057161.1.csv,refseq-PTRH2-NM_016077.4_clinical_seed_0_final,refseq-PTRH2-NM_016077.4.a2m,Invitae,refseq-PTRH2-NM_016077.4.npy,1,179,179
+NP_057182.1,MAGALVRKAADYVRSKDFRDYLMSTHFWGPVANWGLPIAAINDMKKSPEIISGRMTFALCCYSLTFMRFAYKVQPRNWLLFACHATNEVAQLIQGGRLIKHEMTKTASA,109,NP_057182.1.csv,refseq-MPC1-NM_016098.3_clinical_seed_0_final,refseq-MPC1-NM_016098.3.a2m,Invitae,refseq-MPC1-NM_016098.3.npy,1,109,109
+NP_057195.2,MEPAPSEVRLAVREAIHALSSSEDGGHIFCTLESLKRYLGEMEPPALPREKEEFASAHFSPVLRCLASRLSPAWLELLPHGRLEELWASFFLEGPADQAFLVLMETIEGAAGPSFRLMKMARLLARFLREGRLAVLMEAQCRQQTQPGFILLRETLLGKVVALPDHLGNRLQQENLAEFFPQNYFRLLGEEVVRVLQAVVDSLQGGLDSSVSFVSQVLGKACVHGRQQEILGVLVPRLAALTQGSYLHQRVCWRLVEQVPDRAMEAVLTGLVEAALGPEVLSRLLGNLVVKNKKAQFVMTQKLLFLQSRLTTPMLQSLLGHLAMDSQRRPLLLQVLKELLETWGSSSAIRHTPLPQQRHVSKAVLICLAQLGEPELRDSRDELLASMMAGVKCRLDSSLPPVRRLGMIVAEVVSARIHPEGPPLKFQYEEDELSLELLALASPQPAGDGASEAGTSLVPATAEPPAETPAEIVDGGVPQAQLAGSDSDLDSDDEFVPYDMSGDRELKSSKAPAYVRDCVEALTTSEDIERWEAALRALEGLVYRSPTATREVSVELAKVLLHLEEKTCVVGFAGLRQRALVAVTVTDPAPVADYLTSQFYALNYSLRQRMDILDVLTLAAQELSRPGCLGRTPQPGSPSPNTPCLPEAAVSQPGSAVASDWRVVVEERIRSKTQRLSKGGPRQGPAGSPSRFNSVAGHFFFPLLQRFDRPLVTFDLLGEDQLVLGRLAHTLGALMCLAVNTTVAVAMGKALLEFVWALRFHIDAYVRQGLLSAVSSVLLSLPAARLLEDLMDELLEARSWLADVAEKDPDEDCRTLALRALLLLQRLKNRLLPPASP,837,NP_057195.2.csv,refseq-TELO2-NM_016111.3_clinical_seed_0_final,refseq-TELO2-NM_016111.3.a2m,Invitae,refseq-TELO2-NM_016111.3.npy,1,837,837
+NP_057206.2,MVVSTFTDMDTFPNNFPPGGDSGLTGSQSEFQKMLIDERLRCEHHKANYQTLKAEHTRLQNEHVKLQNELKHLFNEKQTQQEKLQLLLEELRGELVEKTKDLEEMKLQILTPQKLELLRAQIQQELETPMRERFRNLDEEVEKYRAVYNKLRYEHTFLKSEFEHQKEEYARILDEGKIKYESEIARLEEDKEELRNQLLNVDLTKDSKRVEQLAREKVYLCQKLKGLEAEVAELKAEKENSEAQVENAQRIQVRQLAEMQATVRSLEAEKQSANLRAERLEKELQSSSEQNTFLINKLHKAEREINTLSSKVKELKHSNKLEITDIKLETARAKSELERERNKIQSELDGLQSDNEILKAAVEHHKVLLVEKDRELIRKVQAAKEEGYQKLVVLQDEKLELENRLADLEKMKVEHDVWRQSEKDQYEEKLRASQMAEEITRKELQSVRLKLQQQIVTIENAEKEKNENSDLKQQISSLQIQVTSLAQSENDLLNSNQMLKEMVERLKQECRNFRSQAEKAQLEAEKTLEEKQIQWLEEKHKLHERITDREEKYNQAKEKLQRAAIAQKKRKSLHENKLKRLQEKVEVLEAKKEELETENQVLNRQNVPFEDYTRLQKRLKDIQRRHNEFRSLILVPNMPPTASINPVSFQSSAMVPSMELPFPPHMQEEQHQRELSLLRKRLEELETTQRKQLEELGSSGE,701,NP_057206.2.csv,refseq-CEP83-NM_016122.2_clinical_seed_0_final,refseq-CEP83-NM_016122.2.a2m,Invitae,refseq-CEP83-NM_016122.2.npy,1,701,701
+NP_057240.3,MEKSSSCESLGSQPAAARPPSVDSLSSASTSHSENSVHTKSASVVSSDSISTSADNFSPDLRVLRESNKLAEMEEPPLLPGENIKDMAKDVTYICPFTGAVRGTLTVTNYRLYFKSMERDPPFVLDASLGVINRVEKIGGASSRGENSYGLETVCKDIRNLRFAHKPEGRTRRSIFENLMKYAFPVSNNLPLFAFEYKEVFPENGWKLYDPLLEYRRQGIPNESWRITKINERYELCDTYPALLVVPANIPDEELKRVASFRSRGRIPVLSWIHPESQATITRCSQPMVGVSGKRSKEDEKYLQAIMDSNAQSHKIFIFDARPSVNAVANKAKGGGYESEDAYQNAELVFLDIHNIHVMRESLRKLKEIVYPNIEETHWLSNLESTHWLEHIKLILAGALRIADKVESGKTSVVVHCSDGWDRTAQLTSLAMLMLDGYYRTIRGFEVLVEKEWLSFGHRFQLRVGHGDKNHADADRSPVFLQFIDCVWQMTRQFPTAFEFNEYFLITILDHLYSCLFGTFLCNSEQQRGKENLPKRTVSLWSYINSQLEDFTNPLYGSYSNHVLYPVASMRHLELWVGYYIRWNPRMKPQEPIHNRYKELLAKRAELQKKVEELQREISNRSTSSSERASSPAQCVTPVQTVV,643,NP_057240.3.csv,refseq-MTMR2-NM_016156.5_clinical_seed_0_final,refseq-MTMR2-NM_016156.5.a2m,Invitae,refseq-MTMR2-NM_016156.5.npy,1,643,643
+NP_057264.4,MGSNSGQAGRHIYKSLADDGPFDSVEPPKRPTSRLIMHSMAMFGREFCYAVEAAYVTPVLLSVGLPSSLYSIVWFLSPILGFLLQPVVGSASDHCRSRWGRRRPYILTLGVMMLVGMALYLNGATVVAALIANPRRKLVWAISVTMIGVVLFDFAADFIDGPIKAYLFDVCSHQDKEKGLHYHALFTGFGGALGYLLGAIDWAHLELGRLLGTEFQVMFFFSALVLTLCFTVHLCSISEAPLTEVAKGIPPQQTPQDPPLSSDGMYEYGSIEKVKNGYVNPELAMQGAKNKNHAEQTRRAMTLKSLLRALVNMPPHYRYLCISHLIGWTAFLSNMLFFTDFMGQIVYRGDPYSAHNSTEFLIYERGVEVGCWGLCINSVFSSLYSYFQKVLVSYIGLKGLYFTGYLLFGLGTGFIGLFPNVYSTLVLCSLFGVMSSTLYTVPFNLITEYHREEEKERQQAPGGDPDNSVRGKGMDCATLTCMVQLAQILVGGGLGFLVNTAGTVVVVVITASAVALIGCCFVALFVRYVD,530,NP_057264.4.csv,S45A2_HUMAN_b03_clinical_seed_0_final,S45A2_HUMAN_b03.a2m,EVE,S45A2_HUMAN_b03_theta_0.2.npy,1,530,530
+NP_057272.1,MSGGVYGGDEVGALVFDIGSFSVRAGYAGEDCPKADFPTTVGLLAAEEGGGLELEGDKEKKGKIFHIDTNALHVPRDGAEVMSPLKNGMIEDWECFRAILDHTYSKHVKSEPNLHPVLMSEAPWNTRAKREKLTELMFEQYNIPAFFLCKTAVLTAFANGRSTGLVLDSGATHTTAIPVHDGYVLQQGIVKSPLAGDFISMQCRELFQEMAIDIIPPYMIAAKEPVREGAPPNWKKKEKLPQVSKSWHNYMCNEVIQDFQASVLQVSDSPYDEQVAAQMPTVHYEMPNGYNTDYGAERLRIPEGLFDPSNVKGLSGNTMLGVGHVVTTSIGMCDIDIRPGLYGSVIVTGGNTLLQGFTDRLNRELSQKTPPSMRLKLIASNSTMERKFSPWIGGSILASLGTFQQMWISKQEYEEGGKQCVERKCP,426,NP_057272.1.csv,refseq-ACTL6B-NM_016188.4_clinical_seed_0_final,refseq-ACTL6B-NM_016188.4.a2m,Invitae,refseq-ACTL6B-NM_016188.4.npy,1,426,426
+NP_057278.2,MCDQTFLVNVFGSCDKCFKQRALRPVFKKSQQLSYCSTCAEIMATEGLHENETLASLKSEAESLKGKLEEERAKLHDVELHQVAERVEALGQFVMKTRRTLKGHGNKVLCMDWCKDKRRIVSSSQDGKVIVWDSFTTNKEHAVTMPCTWVMACAYAPSGCAIACGGLDNKCSVYPLTFDKNENMAAKKKSVAMHTNYLSACSFTNSDMQILTASGDGTCALWDVESGQLLQSFHGHGADVLCLDLAPSETGNTFVSGGCDKKAMVWDMRSGQCVQAFETHESDINSVRYYPSGDAFASGSDDATCRLYDLRADREVAIYSKESIIFGASSVDFSLSGRLLFAGYNDYTINVWDVLKGSRVSILFGHENRVSTLRVSPDGTAFCSGSWDHTLRVWA,395,NP_057278.2.csv,refseq-GNB5-NM_016194.3_clinical_seed_0_final,refseq-GNB5-NM_016194.3.a2m,Invitae,refseq-GNB5-NM_016194.3.npy,1,395,395
+NP_057287.2,MGSAVMDTKKKKDVSSPGGSGGKKNASQKRRSLRVHIPDLSSFAMPLLDGDLEGSGKHSSRKVDSPFGPGSPSKGFFSRGPQPRPSSPMSAPVRPKTSPGSPKTVFPFSYQESPPRSPRRMSFSGIFRSSSKESSPNSNPATSPGGIRFFSRSRKTSGLSSSPSTPTQVTKQHTFPLESYKHEPERLENRIYASSSPPDTGQRFCPSSFQSPTRPPLASPTHYAPSKAAALAAALGPAEAGMLEKLEFEDEAVEDSESGVYMRFMRSHKCYDIVPTSSKLVVFDTTLQVKKAFFALVANGVRAAPLWESKKQSFVGMLTITDFINILHRYYKSPMVQIYELEEHKIETWRELYLQETFKPLVNISPDASLFDAVYSLIKNKIHRLPVIDPISGNALYILTHKRILKFLQLFMSDMPKPAFMKQNLDELGIGTYHNIAFIHPDTPIIKALNIFVERRISALPVVDESGKVVDIYSKFDVINLAAEKTYNNLDITVTQALQHRSQYFEGVVKCNKLEILETIVDRIVRAEVHRLVVVNEADSIVGIISLSDILQALILTPAGAKQKETETE,569,NP_057287.2.csv,refseq-PRKAG2-NM_016203.3_clinical_seed_0_final,refseq-PRKAG2-NM_016203.3.a2m,Invitae,refseq-PRKAG2-NM_016203.3.npy,1,569,569
+NP_057288.1,MCPGALWVALPLLSLLAGSLQGKPLQSWGRGSAGGNAHSPLGVPGGGLPEHTFNLKMFLENVKVDFLRSLNLSGVPSQDKTRVEPPQYMIDLYNRYTSDKSTTPASNIVRSFSMEDAISITATEDFPFQKHILLFNISIPRHEQITRAELRLYVSCQNHVDPSHDLKGSVVIYDVLDGTDAWDSATETKTFLVSQDIQDEGWETLEVSSAVKRWVRSDSTKSKNKLEVTVESHRKGCDTLDISVPPGSRNLPFFVVFSNDHSSGTKETRLELREMISHEQESVLKKLSKDGSTEAGESSHEEDTDGHVAAGSTLARRKRSAGAGSHCQKTSLRVNFEDIGWDSWIIAPKEYEAYECKGGCFFPLADDVTPTKHAIVQTLVHLKFPTKVGKACCVPTKLSPISVLYKDDMGVPTLKYHYEGMSVAECGCR,429,NP_057288.1.csv,refseq-GDF2-NM_016204.2_clinical_seed_0_final,refseq-GDF2-NM_016204.2.a2m,Invitae,refseq-GDF2-NM_016204.2.npy,1,429,429
+NP_057303.2,MAACEGRRSGALGSSQSDFLTPPVGGAPWAVATTVVMYPPPPPPPHRDFISVTLSFGENYDNSKSWRRRSCWRKWKQLSRLQRNMILFLLAFLLFCGLLFYINLADHWKALAFRLEEEQKMRPEIAGLKPANPPVLPAPQKADTDPENLPEISSQKTQRHIQRGPPHLQIRPPSQDLKDGTQEEATKRQEAPVDPRPEGDPQRTVISWRGAVIEPEQGTELPSRRAEVPTKPPLPPARTQGTPVHLNYRQKGVIDVFLHAWKGYRKFAWGHDELKPVSRSFSEWFGLGLTLIDALDTMWILGLRKEFEEARKWVSKKLHFEKDVDVNLFESTIRILGGLLSAYHLSGDSLFLRKAEDFGNRLMPAFRTPSKIPYSDVNIGTGVAHPPRWTSDSTVAEVTSIQLEFRELSRLTGDKKFQEAVEKVTQHIHGLSGKKDGLVPMFINTHSGLFTHLGVFTLGARADSYYEYLLKQWIQGGKQETQLLEDYVEAIEGVRTHLLRHSEPSKLTFVGELAHGRFSAKMDHLVCFLPGTLALGVYHGLPASHMELAQELMETCYQMNRQMETGLSPEIVHFNLYPQPGRRDVEVKPADRHNLLRPETVESLFYLYRVTGDRKYQDWGWEILQSFSRFTRVPSGGYSSINNVQDPQKPEPRDKMESFFLGETLKYLFLLFSDDPNLLSLDAYVFNTEAHPLPIWTPA,699,NP_057303.2.csv,refseq-MAN1B1-NM_016219.4_clinical_seed_0_final,refseq-MAN1B1-NM_016219.4.a2m,Invitae,refseq-MAN1B1-NM_016219.4.npy,1,699,699
+NP_057331.2,MIMFPLFGKISLGILIFVLIEGDFPSLTAQTYLSIEEIQEPKSAVSFLLPEESTDLSLATKKKQPLDRRETERQWLIRRRRSILFPNGVKICPDESVAEAVANHVKYFKVRVCQEAVWEAFRTFWDRLPGREEYHYWMNLCEDGVTSIFEMGTNFSESVEHRSLIMKKLTYAKETVSSSELSSPVPVGDTSTLGDTTLSVPHPEVDAYEGASESSLERPEESISNEIENVIEEATKPAGEQIAEFSIHLLGKQYREELQDSSSFHHQHLEEEFISEVENAFTGLPGYKEIRVLEFRSPKENDSGVDVYYAVTFNGEAISNTTWDLISLHSNKVENHGLVELDDKPTVVYTISNFRDYIAETLQQNFLLGNSSLNPDPDSLQLINVRGVLRHQTEDLVWNTQSSSLQATPSSILDNTFQAAWPSADESITSSIPPLDFSSGPPSATGRELWSESPLGDLVSTHKLAFPSKMGLSSSPEVLEVSSLTLHSVTPAVLQTGLPVASEERTSGSHLVEDGLANVEESEDFLSIDSLPSSSFTQPVPKETIPSMEDSDVSLTSSPYLTSSIPFGLDSLTSKVKDQLKVSPFLPDASMEKELIFDGGLGSGSGQKVDLITWPWSETSSEKSAEPLSKPWLEDDDSLLPAEIEDKKLVLVDKMDSTDQISKHSKYEHDDRSTHFPEEEPLSGPAVPIFADTAAESASLTLPKHISEVPGVDDYSVTKAPLILTSVAISASTDKSDQADAILREDMEQITESSNYEWFDSEVSMVKPDMQTLWTILPESERVWTRTSSLEKLSRDILASTPQSADRLWLSVTQSTKLPPTTISTLLEDEVIMGVQDISLELDRIGTDYYQPEQVQEQNGKVGSYVEMSTSVHSTEMVSVAWPTEGGDDLSYTQTSGALVVFFSLRVTNMMFSEDLFNKNSLEYKALEQRFLELLVPYLQSNLTGFQNLEILNFRNGSIVVNSRMKFANSVPPNVNNAVYMILEDFCTTAYNTMNLAIDKYSLDVESGDEANPCKFQACNEFSECLVNPWSGEAKCRCFPGYLSVEERPCQSLCDLQPDFCLNDGKCDIMPGHGAICRCRVGENWWYRGKHCEEFVSEPVIIGITIASVVGLLVIFSAIIYFFIRTLQAHHDRSERESPFSGSSRQPDSLSSIENAVKYNPVYESHRAGCEKYEGPYPQHPFYSSASGDVIGGLSREEIRQMYESSELSREEIQERMRVLELYANDPEFAAFVREQQVEEV,1241,NP_057331.2.csv,refseq-IMPG2-NM_016247.3_clinical_seed_0_final,refseq-IMPG2-NM_016247.3.a2m,Invitae,refseq-IMPG2-NM_016247.3.npy,1,1241,1241
+NP_057386.2,MAGEGDQQDAAHNMGNHLPLLPAESEEEDEMEVEDQDSKEAKKPNIINFDTSLPTSHTYLGADMEEFHGRTLHDDDSCQVIPVLPQVMMILIPGQTLPLQLFHPQEVSMVRNLIQKDRTFAVLAYSNVQEREAQFGTTAEIYAYREEQDFGIEIVKVKAIGRQRFKVLELRTQSDGIQQAKVQILPECVLPSTMSAVQLESLNKCQIFPSKPVSREDQCSYKWWQKYQKRKFHCANLTSWPRWLYSLYDAETLMDRIKKQLREWDENLKDDSLPSNPIDFSYRVAACLPIDDVLRIQLLKIGSAIQRLRCELDIMNKCTSLCCKQCQETEITTKNEIFSLSLCGPMAAYVNPHGYVHETLTVYKACNLNLIGRPSTEHSWFPGYAWTVAQCKICASHIGWKFTATKKDMSPQKFWGLTRSALLPTIPDTEDEISPDKVILCL,442,NP_057386.2.csv,CRBN_HUMAN_b01_clinical_seed_0_final,CRBN_HUMAN_b01.a2m,EVE,CRBN_HUMAN_b01_theta_0.2.npy,1,442,442
+NP_057390.1,MAPQNLSTFCLLLLYLIGAVIAGRDFYKILGVPRSASIKDIKKAYRKLALQLHPDRNPDDPQAQEKFQDLGAAYEVLSDSEKRKQYDTYGEEGLKDGHQSSHGDIFSHFFGDFGFMFGGTPRQQDRNIPRGSDIIVDLEVTLEEVYAGNFVEVVRNKPVARQAPGKRKCNCRQEMRTTQLGPGRFQMTQEVVCDECPNVKLVNEERTLEVEIEPGVRDGMEYPFIGEGEPHVDGEPGDLRFRIKVVKHPIFERRGDDLYTNVTISLVESLVGFEMDITHLDGHKVHISRDKITRPGAKLWKKGEGLPNFDNNNIKGSLIITFDVDFPKEQLTEEAREGIKQLLKQGSVQKVYNGLQGY,358,NP_057390.1.csv,refseq-DNAJB11-NM_016306.5_clinical_seed_0_final,refseq-DNAJB11-NM_016306.5.a2m,Invitae,refseq-DNAJB11-NM_016306.5.npy,1,358,358
+NP_057411.1,MAGAEWKSLEECLEKHLPLPDLQEVKRVLYGKELRKLDLPREAFEAASREDFELQGYAFEAAEEQLRRPRIVHVGLVQNRIPLPANAPVAEQVSALHRRIKAIVEVAAMCGVNIICFQEAWTMPFAFCTREKLPWTEFAESAEDGPTTRFCQKLAKNHDMVVVSPILERDSEHGDVLWNTAVVISNSGAVLGKTRKNHIPRVGDFNESTYYMEGNLGHPVFQTQFGRIAVNICYGRHHPLNWLMYSINGAEIIFNPSATIGALSESLWPIEARNAAIANHCFTCAINRVGTEHFPNEFTSGDGKKAHQDFGYFYGSSYVAAPDSSRTPGLSRSRDGLLVAKLDLNLCQQVNDVWNFKMTGRYEMYARELAEAVKSNYSPTIVKE,384,NP_057411.1.csv,refseq-UPB1-NM_016327.2_clinical_seed_0_final,refseq-UPB1-NM_016327.2.a2m,Invitae,refseq-UPB1-NM_016327.2.npy,1,384,384
+NP_057425.3,MTSEEMTASVLIPVTQRKVVSAQSAADESSEKVSDINISKAHTVRRSGETSHTISQLNKLKEEPSGSNLPKILSIAREKIVSDENSNEKCWEKIMPDSAKNLNINCNNILRNHQHGLPQRQFYEMYNSVAEEDLCLETGIPSPLERKVFPGIQLELDRPSMGISPLGNQSVIIETGRAHPDSRRAVFHFHYEVDRRMSDTFCTLSENLILDDCGNCVPLPGGEEKQKKNYVAYTCKLMELAKNCDNKNEQLQCDHCDTLNDKYFCFEGSCEKVDMVYSGDSFCRKDFTDSQAAKTFLSHFEDFPDNCDDVEEDAFKSKKERSTLLVRRFCKNDREVKKSVYTGTRAIVRTLPSGHIGLTAWSYIDQKRNGPLLPCGRVMEPPSTVEIRQDGSQRLSEAQWYPIYNAVRREETENTVGSLLHFLTKLPASETAHGRISVGPCLKQCVRDTVCEYRATLQRTSISQYITGSLLEATTSLGARSGLLSTFGGSTGRMMLKERQPGPSVANSNALPSSSAGISKELIDLQPLIQFPEEVASILMEQEQTIYRRVLPVDYLCFLTRDLGTPECQSSLPCLKASISASILTTQNGEHNALEDLVMRFNEVSSWVTWLILTAGSMEEKREVFSYLVHVAKCCWNMGNYNAVMEFLAGLRSRKVLKMWQFMDQSDIETMRSLKDAMAQHESSCEYRKVVTRALHIPGCKVVPFCGVFLKELCEVLDGASGLMKLCPRYNSQEETLEFVADYSGQDNFLQRVGQNGLKNSEKESTVNSIFQVIRSCNRSLETDEEDSPSEGNSSRKSSLKDKSRWQFIIGDLLDSDNDIFEQSKEYDSHGSEDSQKAFDHGTELIPWYVLSIQADVHQFLLQGATVIHYDQDTHLSARCFLQLQPDNSTLTWVKPTTASPASSKAKLGVLNNTAEPGKFPLLGNAGLSSLTEGVLDLFAVKAVYMGHPGIDIHTVCVQNKLGSMFLSETGVTLLYGLQTTDNRLLHFVAPKHTAKMLFSGLLELTRAVRKMRKFPDQRQQWLRKQYVSLYQEDGRYEGPTLAHAVELFGGRRWSARNPSPGTSAKNAEKPNMQRNNTLGISTTKKKKKILMRGESGEVTDDEMATRKAKMHKECRSRSGSDPQDINEQEESEVNAIANPPNPLPSRRAHSLTTAGSPNLAAGTSSPIRPVSSPVLSSSNKSPSSAWSSSSWHGRIKGGMKGFQSFMVSDSNMSFVEFVELFKSFSVRSRKDLKDLFDVYAVPCNRSGSESAPLYTNLTIDENTSDLQPDLDLLTRNVSDLGLFIKSKQQLSDNQRQISDAIAAASIVTNGTGIESTSLGIFGVGILQLNDFLVNCQGEHCTYDEILSIIQKFEPSISMCHQGLMSFEGFARFLMDKENFASKNDESQENIKELQLPLSYYYIESSHNTYLTGHQLKGESSVELYSQVLLQGCRSVELDCWDGDDGMPIIYHGHTLTTKIPFKEVVEAIDRSAFINSDLPIIISIENHCSLPQQRKMAEIFKTVFGEKLVTKFLFETDFSDDPMLPSPDQLRKKVLLKNKKLKAHQTPVDILKQKAHQLASMQVQAYNGGNANPRPANNEEEEDEEDEYDYDYESLSDDNILEDRPENKSCNDKLQFEYNEEIPKRIKKADNSACNKGKVYDMELGEEFYLDQNKKESRQIAPELSDLVIYCQAVKFPGLSTLNASGSSRGKERKSRKSIFGNNPGRMSPGETASFNKTSGKSSCEGIRQTWEESSSPLNPTTSLSAIIRTPKCYHISSLNENAAKRLCRRYSQKLTQHTACQLLRTYPAATRIDSSNPNPLMFWLHGIQLVALNYQTDDLPLHLNAAMFEANGGCGYVLKPPVLWDKNCPMYQKFSPLERDLDSMDPAVYSLTIVSGQNVCPSNSMGSPCIEVDVLGMPLDSCHFRTKPIHRNTLNPMWNEQFLFHVHFEDLVFLRFAVVENNSSAVTAQRIIPLKALKRGYRHLQLRNLHNEVLEISSLFINSRRMEENSSGNTMSASSMFNTEERKCLQTHRVTVHGVPGPEPFTVFTINGGTKAKQLLQQILTNEQDIKPVTTDYFLMEEKYFISKEKNECRKQPFQRAIGPEEEIMQILSSWFPEEGYMGRIVLKTQQENLEEKNIVQDDKEVILSSEEESFFVQVHDVSPEQPRTVIKAPRVSTAQDVIQQTLCKAKYSYSILSNPNPSDYVLLEEVVKDTTNKKTTTPKSSQRVLLDQECVFQAQSKWKGAGKFILKLKEQVQASREDKKKGISFASELKKLTKSTKQPRGLTSPSQLLTSESIQTKEEKPVGGLSSSDTMDYRQ,2302,NP_057425.3.csv,refseq-PLCE1-NM_016341.3_clinical_seed_0_final,refseq-PLCE1-NM_016341.3.a2m,Invitae,refseq-PLCE1-NM_016341.3_theta_0.2.npy,1,2302,2302
+NP_057444.2,MSAWAAASLSRAAARCLLARGPGVRAAPPRDPRPSHPEPRGCGAAPGRTLHFTAAVPAGHNKWSKVRHIKGPKDVERSRIFSKLCLNIRLAVKEGGPNPEHNSNLANILEVCRSKHMPKSTIETALKMEKSKDTYLLYEGRGPGGSSLLIEALSNSSHKCQADIRHILNKNGGVMAVGARHSFDKKGVIVVEVEDREKKAVNLERALEMAIEAGAEDVKETEDEEERNVFKFICDASSLHQVRKKLDSLGLCSVSCALEFIPNSKVQLAEPDLEQAAHLIQALSNHEDVIHVYDNIE,297,NP_057444.2.csv,refseq-TACO1-NM_016360.3_clinical_seed_0_final,refseq-TACO1-NM_016360.3.a2m,Invitae,refseq-TACO1-NM_016360.3.npy,1,297,297
+NP_057445.4,MITGVFSMRLWTPVGVLTSLAYCLHQRRVALAELQEADGQCPVDRSLLKLKMVQVVFRHGARSPLKPLPLEEQVEWNPQLLEVPPQTQFDYTVTNLAGGPKPYSPYDSQYHETTLKGGMFAGQLTKVGMQQMFALGERLRKNYVEDIPFLSPTFNPQEVFIRSTNIFRNLESTRCLLAGLFQCQKEGPIIIHTDEADSEVLYPNYQSCWSLRQRTRGRRQTASLQPGISEDLKKVKDRMGIDSSDKVDFFILLDNVAAEQAHNLPSCPMLKRFARMIEQRAVDTSLYILPKEDRESLQMAVGPFLHILESNLLKAMDSATAPDKIRKLYLYAAHDVTFIPLLMTLGIFDHKWPPFAVDLTMELYQHLESKEWFVQLYYHGKEQVPRGCPDGLCPLDMFLNAMSVYTLSPEKYHALCSQTQVMEVGNEE,428,NP_057445.4.csv,PPA6_HUMAN_b03_clinical_seed_0_final,PPA6_HUMAN_b03.a2m,EVE,PPA6_HUMAN_b03_theta_0.2.npy,1,428,428
+NP_057457.1,MAALRYAGLDDTDSEDELPPGWEERTTKDGWVYYANHTEEKTQWEHPKTGKRKRVAGDLPYGWEQETDENGQVFFVDHINKRTTYLDPRLAFTVDDNPTKPTTRQRYDGSTTAMEILQGRDFTGKVVVVTGANSGIGFETAKSFALHGAHVILACRNMARASEAVSRILEEWHKAKVEAMTLDLALLRSVQHFAEAFKAKNVPLHVLVCNAATFALPWSLTKDGLETTFQVNHLGHFYLVQLLQDVLCRSAPARVIVVSSESHRFTDINDSLGKLDFSRLSPTKNDYWAMLAYNRSKLCNILFSNELHRRLSPRGVTSNAVHPGNMMYSNIHRSWWVYTLLFTLARPFTKSMQQGAATTVYCAAVPELEGLGGMYFNNCCRCMPSPEAQSEETARTLWALSERLIQERLGSQSG,414,NP_057457.1.csv,refseq-WWOX-NM_016373.3_clinical_seed_0_final,refseq-WWOX-NM_016373.3.a2m,Invitae,refseq-WWOX-NM_016373.3.npy,1,414,414
+NP_057485.2,MFGCLVAGRLVQTAAQQVAEDKFVFDLPDYESINHVVVFMLGTIPFPEGMGGSVYFSYPDSNGMPVWQLLGFVTNGKPSAIFKISGLKSGEGSQHPFGAMNIVRTPSVAQIGISVELLDSMAQQTPVGNAAVSSVDSFTQFTQKMLDNFYNFASSFAVSQAQMTPSPSEMFIPANVVLKWYENFQRRLAQNPLFWKT,197,NP_057485.2.csv,refseq-HIKESHI-NM_016401.3_clinical_seed_0_final,refseq-HIKESHI-NM_016401.3.a2m,Invitae,refseq-HIKESHI-NM_016401.3.npy,1,197,197
+NP_057613.4,MLNGAGLDKALKMSLPRRSRIRSSVGPVRSSLGYKKAEDEMSRATSVGDQLEAPARTIYLNQPHLNKFRDNQISTAKYSVLTFLPRFLYEQIRRAANAFFLFIALLQQIPDVSPTGRYTTLVPLIIILTIAGIKEIVEDFKRHKADNAVNKKKTIVLRNGMWHTIMWKEVAVGDIVKVVNGQYLPADVVLLSSSEPQAMCYVETANLDGETNLKIRQGLSHTADMQTREVLMKLSGTIECEGPNRHLYDFTGNLNLDGKSLVALGPDQILLRGTQLRNTQWVFGIVVYTGHDTKLMQNSTKAPLKRSNVEKVTNVQILVLFGILLVMALVSSAGALYWNRSHGEKNWYIKKMDTTSDNFGYNLLTFIILYNNLIPISLLVTLEVVKYTQALFINWDTDMYYIGNDTPAMARTSNLNEELGQVKYLFSDKTGTLTCNIMNFKKCSIAGVTYGHFPELAREPSSDDFCRMPPPCSDSCDFDDPRLLKNIEDRHPTAPCIQEFLTLLAVCHTVVPEKDGDNIIYQASSPDEAALVKGAKKLGFVFTARTPFSVIIEAMGQEQTFGILNVLEFSSDRKRMSVIVRTPSGRLRLYCKGADNVIFERLSKDSKYMEETLCHLEYFATEGLRTLCVAYADLSENEYEEWLKVYQEASTILKDRAQRLEECYEIIEKNLLLLGATAIEDRLQAGVPETIATLLKAEIKIWVLTGDKQETAINIGYSCRLVSQNMALILLKEDSLDATRAAITQHCTDLGNLLGKENDVALIIDGHTLKYALSFEVRRSFLDLALSCKAVICCRVSPLQKSEIVDVVKKRVKAITLAIGDGANDVGMIQTAHVGVGISGNEGMQATNNSDYAIAQFSYLEKLLLVHGAWSYNRVTKCILYCFYKNVVLYIIELWFAFVNGFSGQILFERWCIGLYNVIFTALPPFTLGIFERSCTQESMLRFPQLYKITQNGEGFNTKVFWGHCINALVHSLILFWFPMKALEHDTVLTSGHATDYLFVGNIVYTYVVVTVCLKAGLETTAWTKFSHLAVWGSMLTWLVFFGIYSTIWPTIPIAPDMRGQATMVLSSAHFWLGLFLVPTACLIEDVAWRAAKHTCKKTLLEEVQELETKSRVLGKAVLRDSNGKRLNERDRLIKRLGRKTPPTLFRGSSLQQGVPHGYAFSQEEHGAVSQEEVIRAYDTTKKKSRKK,1188,NP_057613.4.csv,refseq-ATP8A2-NM_016529.5_clinical_seed_0_final,refseq-ATP8A2-NM_016529.5.a2m,Invitae,refseq-ATP8A2-NM_016529.5.npy,1,1188,1188
+NP_057616.2,MSSRKLSGPKGRRLSIHVVTWNVASAAPPLDLSDLLQLNNRNLNLDIYVIGLQELNSGIISLLSDAAFNDSWSSFLMDVLSPLSFIKVSHVRMQGILLLVFAKYQHLPYIQILSTKSTPTGLFGYWGNKGGVNICLKLYGYYVSIINCHLPPHISNNYQRLEHFDRILEMQNCEGRDIPNILDHDLIIWFGDMNFRIEDFGLHFVRESIKNRCYGGLWEKDQLSIAKKHDPLLREFQEGRLLFPPTYKFDRNSNDYDTSEKKRKPAWTDRILWRLKRQPCAGPDTPIPPASHFSLSLRGYSSHMTYGISDHKPVSGTFDLELKPLVSAPLIVLMPEDLWTVENDMMVSYSSTSDFPSSPWDWIGLYKVGLRDVNDYVSYAWVGDSKVSCSDNLNQVYIDISNIPTTEDEFLLCYYSNSLRSVVGISRPFQIPPGSLREDPLGEAQPQI,448,NP_057616.2.csv,refseq-INPP5K-NM_016532.3_clinical_seed_0_final,refseq-INPP5K-NM_016532.3.a2m,Invitae,refseq-INPP5K-NM_016532.3.npy,1,448,448
+NP_057623.2,MSVNYAAGLSPYADKGKCGLPEIFDPPEELERKVWELARLVWQSSSVVFHTGAGISTASGIPDFRGPHGVWTMEERGLAPKFDTTFESARPTQTHMALVQLERVGLLRFLVSQNVDGLHVRSGFPRDKLAELHGNMFVEECAKCKTQYVRDTVVGTMGLKATGRLCTVAKARGLRACRGELRDTILDWEDSLPDRDLALADEASRNADLSITLGTSLQIRPSGNLPLATKRRGGRLVIVNLQPTKHDRHADLRIHGYVDEVMTRLMKHLGLEIPAWDGPRVLERALPPLPRPPTPKLEPKEESPTRINGSIPAGPKQEPCAQHNGSEPASPKRERPTSPAPHRPPKRVKAKAVPS,355,NP_057623.2.csv,refseq-SIRT6-NM_016539.3_clinical_seed_0_final,refseq-SIRT6-NM_016539.3.a2m,Invitae,refseq-SIRT6-NM_016539.3.npy,1,355,355
+NP_057664.1,MMQLLQLLLGLLGPGGYLFLLGDCQEVTTLTVKYQVSEEVPSGTVIGKLSQELGREERRRQAGAAFQVLQLPQALPIQVDSEEGLLSTGRRLDREQLCRQWDPCLVSFDVLATGDLALIHVEIQVLDINDHQPRFPKGEQELEISESASLRTRIPLDRALDPDTGPNTLHTYTLSPSEHFALDVIVGPDETKHAELIVVKELDREIHSFFDLVLTAYDNGNPPKSGTSLVKVNVLDSNDNSPAFAESSLALEIQEDAAPGTLLIKLTATDPDQGPNGEVEFFLSKHMPPEVLDTFSIDAKTGQVILRRPLDYEKNPAYEVDVQARDLGPNPIPAHCKVLIKVLDVNDNIPSIHVTWASQPSLVSEALPKDSFIALVMADDLDSGHNGLVHCWLSQELGHFRLKRTNGNTYMLLTNATLDREQWPKYTLTLLAQDQGLQPLSAKKQLSIQISDINDNAPVFEKSRYEVSTRENNLPSLHLITIKAHDADLGINGKVSYRIQDSPVAHLVAIDSNTGEVTAQRSLNYEEMAGFEFQVIAEDSGQPMLASSVSVWVSLLDANDNAPEVVQPVLSDGKASLSVLVNASTGHLLVPIETPNGLGPAGTDTPPLATHSSRPFLLTTIVARDADSGANGEPLYSIRSGNEAHLFILNPHTGQLFVNVTNASSLIGSEWELEIVVEDQGSPPLQTRALLRVMFVTSVDHLRDSARKPGALSMSMLTVICLAVLLGIFGLILALFMSICRTEKKDNRAYNCREAESTYRQQPKRPQKHIQKADIHLVPVLRGQAGEPCEVGQSHKDVDKEAMMEAGWDPCLQAPFHLTPTLYRTLRNQGNQGAPAESREVLQDTVNLLFNHPRQRNASRENLNLPEPQPATGQPRSRPLKVAGSPTGRLAGDQGSEEAPQRPPASSATLRRQRHLNGKVSPEKESGPRQILRSLVRLSVAAFAERNPVEELTVDSPPVQQISQLLSLLHQGQFQPKPNHRGNKYLAKPGGSRSAIPDTDGPSARAGGQTDPEQEEGPLDPEEDLSVKQLLEEELSSLLDPSTGLALDRLSAPDPAWMARLSLPLTTNYRDNVISPDAAATEEPRTFQTFGKAEAPELSPTGTRLASTFVSEMSSLLEMLLEQRSSMPVEAASEALRRLSVCGRTLSLDLATSAASGMKVQGDPGGKTGTEGKSRGSSSSSRCL,1184,NP_057664.1.csv,refseq-PCDH12-NM_016580.3_clinical_seed_0_final,refseq-PCDH12-NM_016580.3.a2m,Invitae,refseq-PCDH12-NM_016580.3.npy,1,1184,1184
+NP_057683.1,MLSHNTMMKQRKQQATAIMKEVHGNDVDGMDLGKKVSIPRDIMLEELSHLSNRGARLFKMRQRRSDKYTFENFQYQSRAQINHSIAMQNGKVDGSNLEGGSQQAPLTPPNTPDPRSPPNPDNIAPGYSGPLKEIPPEKFNTTAVPKYYQSPWEQAISNDPELLEALYPKLFKPEGKAELPDYRSFNRVATPFGGFEKASRMVKFKVPDFELLLLTDPRFMSFVNPLSGRRSFNRTPKGWISENIPIVITTEPTDDTTVPESEDL,264,NP_057683.1.csv,refseq-MYOZ2-NM_016599.4_clinical_seed_0_final,refseq-MYOZ2-NM_016599.4.a2m,Invitae,refseq-MYOZ2-NM_016599.4.npy,1,264,264
+NP_057688.3,MADAAASPVGKRLLLLFADTAASASASAPAAAAASGDPGPALRTRAWRAGTVRAMSGAVPQDLAIFVEFDGCNWKQHSWVKVHAEEVIVLLLEGSLVWAPREDPVLLQGIRVSIAQWPALTFTPLVDKLGLGSVVPVEYLLDRELRFLSDANGLHLFQMGTDSQNQILLEHAALRETVNALISDQKLQEIFSRGPYSVQGHRVKIYQPEGEEGWLYGVVSHQDSITRLMEVSVTESGEIKSVDPRLIHVMLMDNSAPQSEGGTLKAVKSSKGKKKRESIEGKDGRRRKSASDSGCDPASKKLKGDRGEVDSNGSDGGEASRGPWKGGNASGEPGLDQRAKQPPSTFVPQINRNIRFATYTKENGRTLVVQDEPVGGDTPASFTPYSTATGQTPLAPEVGGAENKEAGKTLEQVGQGIVASAAVVTTASSTPNTVRISDTGLAAGTVPEKQKGSRSQASGENSRNSILASSGFGAPLPSSSQPLTFGSGRSQSNGVLATENKPLGFSFGCSSAQEAQKDTDLSKNLFFQCMSQTLPTSNYFTTVSESLADDSSSRDSFKQSLESLSSGLCKGRSVLGTDTKPGSKAGSSVDRKVPAESMPTLTPAFPRSLLNARTPENHENLFLQPPKLSREEPSNPFLAFVEKVEHSPFSSFASQASGSSSSATTVTSKVAPSWPESHSSADSASLAKKKPLFITTDSSKLVSGVLGSALTSGGPSLSAMGNGRSSSPTSSLTQPIEMPTLSSSPTEERPTVGPGQQDNPLLKTFSNVFGRHSGGFLSSPADFSQENKAPFEAVKRFSLDERSLACRQDSDSSTNSDLSDLSDSEEQLQAKTGLKGIPEHLMGKLGPNGERSAELLLGKSKGKQAPKGRPRTAPLKVGQSVLKDVSKVKKLKQSGEPFLQDGSCINVAPHLHKCRECRLERYRKFKEQEQDDSTVACRFFHFRRLIFTRKGVLRVEGFLSPQQSDPDAMNLWIPSSSLAEGIDLETSKYILANVGDQFCQLVMSEKEAMMMVEPHQKVAWKRAVRGVREMCDVCETTLFNIHWVCRKCGFGVCLDCYRLRKSRPRSETEEMGDEEVFSWLKCAKGQSHEPENLMPTQIIPGTALYNIGDMVHAARGKWGIKANCPCISRQNKSVLRPAVTNGMSQLPSINPSASSGNETTFSGGGGPAPVTTPEPDHVPKADSTDIRSEEPLKTDSSASNSNSELKAIRPPCPDTAPPSSALHWLADLATQKAKEETKEAGSLRSVLNKESHSPFGLDSFNSTAKVSPLTPKLFNSLLLGPTASNNKTEGSSLRDLLHSGPGKLPQTPLDTGIPFPPVFSTSSAGVKSKASLPNFLDHIIASVVENKKTSDASKRACNLTDTQKEVKEMVMGLNVLDPHTSHSWLCDGRLLCLHDPSNKNNWKIFRECWKQGQPVLVSGVHKKLKSELWKPEAFSQEFGDQDVDLVNCRNCAIISDVKVRDFWDGFEIICKRLRSEDGQPMVLKLKDWPPGEDFRDMMPTRFEDLMENLPLPEYTKRDGRLNLASRLPSYFVRPDLGPKMYNAYGLITAEDRRVGTTNLHLDVSDAVNVMVYVGIPIGEGAHDEEVLKTIDEGDADEVTKQRIHDGKEKPGALWHIYAAKDAEKIRELLRKVGEEQGQENPPDHDPIHDQSWYLDQTLRKRLYEEYGVQGWAIVQFLGDAVFIPAGAPHQVHNLYSCIKVAEDFVSPEHVKHCFRLTQEFRHLSNTHTNHEDKLQVKNIIYHAVKDAVGTLKAHESKLARS,1761,NP_057688.3.csv,refseq-KDM3B-NM_016604.4_clinical_seed_0_final,refseq-KDM3B-NM_016604.4.a2m,Invitae,refseq-KDM3B-NM_016604.4.npy,1,1761,1761
+NP_057732.2,METESGNQEKVMEEESTEKKKEVEKKKRSRVKQVLADIAKQVDFWFGDANLHKDRFLREQIEKSRDGYVDISLLVSFNKMKKLTTDGKLIARALRSSAVVELDLEGTRIRRKKPLGERPKDEDERTVYVELLPKNVNHSWIERVFGKCGNVVYISIPHYKSTGDPKGFAFVEFETKEQAAKAIEFLNNPPEEAPRKPGIFPKTVKNKPIPALRVVEEKKKKKKKKGRMKKEDNIQAKEENMDTSNTSISKMKRSRPTSEGSDIESTEPQKQCSKKKKKRDRVEASSLPEVRTGKRKRSSSEDAESLAPRSKVKKIIQKDIIKEASEASKENRDIEISTEEEKDTGDLKDSSLLKTKRKHKKKHKERHKMGEEVIPLRVLSKSEWMDLKKEYLALQKASMASLKKTISQIKSESEMETDSGVPQNTGMKNEKTANREECRTQEKVNATGPQFVSGVIVKIISTEPLPGRKQVRDTLAAISEVLYVDLLEGDTECHARFKTPEDAQAVINAYTEINKKHCWKLEILSGDHEQRYWQKILVDRQAKLNQPREKKRGTEKLITKAEKIRLAKTQQASKHIRFSEYD,582,NP_057732.2.csv,refseq-LARP7-NM_016648.3_clinical_seed_0_final,refseq-LARP7-NM_016648.3.a2m,Invitae,refseq-LARP7-NM_016648.3.npy,1,582,582
+NP_057953.1,MDLEKNYPTPRTSRTGHGGVNQLGGVFVNGRPLPDVVRQRIVELAHQGVRPCDISRQLRVSHGCVSKILGRYYETGSIKPGVIGGSKPKVATPKVVEKIAEYKRQNPTMFAWEIRDRLLAERVCDNDTVPSVSSINRIIRTKVQQPPNQPVPASSHSIVSTGSVTQVSSVSTDSAGSSYSISGILGITSPSADTNKRKRDEGIQESPVPNGHSLPGRDFLRKQMRGDLFTQQQLEVLDRVFERQHYSDIFTTTEPIKPEQTTEYSAMASLAGGLDDMKANLASPTPADIGSSVPGPQSYPIVTGRDLASTTLPGYPPHVPPAGQGSYSAPTLTGMVPGSEFSGSPYSHPQYSSYNDSWRFPNPGLLGSPYYYSAAARGAAPPAAATAYDRH,391,NP_057953.1.csv,refseq-PAX5-NM_016734.2_clinical_seed_0_final,refseq-PAX5-NM_016734.2.a2m,Invitae,refseq-PAX5-NM_016734.2.npy,1,391,391
+NP_058432.1,MSSDASQGVITTPPPPSMPHKERYFDRINENDPEYIRERNMSPDLRQDFNMMEQRKRVTQILQSPAFREDLECLIQEQMKKGHNPTGLLALQQIADYIMANSFSGFSSPPLSLGMVTPINDLPGADTSSYVKGEKLTRCKLASLYRLVDLFGWAHLANTYISVRISKEQDHIIIIPRGLSFSEATASNLVKVNIIGEVVDQGSTNLKIDHTGFSPHAAIYSTRPDVKCVIHIHTLATAAVSSMKCGILPISQESLLLGDVAYYDYQGSLEEQEERIQLQKVLGPSCKVLVLRNHGVVALGETLEEAFHYIFNVQLACEIQVQALAGAGGVDNLHVLDFQKYKAFTYTVAASGGGGVNMGSHQKWKVGEIEFEGLMRTLDNLGYRTGYAYRHPLIREKPRHKSDVEIPATVTAFSFEDDTVPLSPLKYMAQRQQREKTRWLNSPNTYMKVNVPEESRNGETSPRTKITWMKAEDSSKVSGGTPIKIEDPNQFVPLNTNPNEVLEKRNKIREQNRYDLKTAGPQSQLLAGIVVDKPPSTMQFEDDDHGPPAPPNPFSHLTEGELEEYKRTIERKQQGLEDAEQELLSDDASSVSQIQSQTQSPQNVPEKLEENHELFSKSFISMEVPVMVVNGKDDMHDVEDELAKRVSRLSTSTTIENIEITIKSPEKIEEVLSPEGSPSKSPSKKKKKFRTPSFLKKNKKKEKVEA,706,NP_058432.1.csv,refseq-ADD3-NM_016824.4_clinical_seed_0_final,refseq-ADD3-NM_016824.4.a2m,Invitae,refseq-ADD3-NM_016824.4.npy,1,706,706
+NP_058515.1,MPRGEAPGPGRRGAKDEALGEESGERWSPEFHLQRKLADSSHSEQQDRNRVSEELIMVVQEMKKYFPSERRNKPSTLDALNYALRCVHSVQANSEFFQILSQNGAPQADVSMYSLEELATIASEHTSKNTDTFVAVFSFLSGRLVHISEQAALILNRKKDVLASSHFVDLLAPQDMRVFYAHTARAQLPFWNNWTQRAARYECAPVKPFFCRIRGGEDRKQEKCHSPFRIIPYLIHVHHPAQPELESEPCCLTVVEKIHSGYEAPRIPVNKRIFTTTHTPGCVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEHSPIRFCTQNGDYIILDSSWSSFVNPWSRKISFIIGRHKVRTSPLNEDVFATKIKKMNDNDKDITELQEQIYKLLLQPVHVSVSSGYGSLGSSGSQEQLVSIASSSEASGHRVEETKAEQMTLQQVYASVNKIKNLGQQLYIESMTKSSFKPVTGTRTEPNGGGECKTFTSFHQTLKNNSVYTEPCEDLRNDEHSPSYQQINCIDSVIRYLKSYNIPALKRKCISCTNTTSSSSEEDKQNHKADDVQALQAGLQIPAIPKSEMPTNGRSIDTGGGAPQILSTAMLSLGSGISQCGYSSTIVHVPPPETARDATLFCEPWTLNMQPAPLTSEEFKHVGLTAAVLSAHTQKEEQNYVDKFREKILSSPYSSYLQQESRSKAKYSYFQGDSTSKQTRSAGCRKGKHKRKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEGLHGLPLSEGLQPYPAFPFPYLDTFMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMSPTLDPPPSVTSQRREEEKWEAQSEGHPFITSRSSSPLQLNLLQEEMPRPSESPDQMRRNTCPQTEYCVTGNNGSESSPATTGALSTGSPPRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGSPPSESPSRTGSAASGSSDSSIYLTSSVYSSKISQNGQQSQDVQKKETFPNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQKEELAKVYNWIQSQTVTQEIDIQACVTCENEDSADGAATSCGQVLVEDSC,1201,NP_058515.1.csv,refseq-PER3-NM_016831.2_clinical_seed_0_final,refseq-PER3-NM_016831.2.a2m,Invitae,refseq-PER3-NM_016831.2.npy,1,1201,1201
+NP_058628.3,MSMLPTFGFTQEQVACVCEVLQQGGNIERLGRFLWSLPACEHLHKNESVLKAKAVVAFHRGNFRELYKILESHQFSPHNHAKLQQLWLKAHYIEAEKLRGRPLGAVGKYRVRRKFPLPRSIWDGEETSYCFKEKSRSVLREWYAHNPYPSPREKRELAEATGLTTTQVSNWFKNRRQRDRAAEAKERENNENSNSNSHNPLNGSGKSVLGSSEDEKTPSGTPDHSSSSPALLLSPPPPGLPSLHSLGHPPGPSAVPVPVPGGGGADPLQHHHGLQDSILNPMSANLVDLGS,291,NP_058628.3.csv,refseq-SIX2-NM_016932.4_clinical_seed_0_final,refseq-SIX2-NM_016932.4.a2m,Invitae,refseq-SIX2-NM_016932.4.npy,1,291,291
+NP_058633.2,MAPVHGDDSLSDSGSFVSSRARREKKSKKGRQEALERLKKAKAGEKYKYEVEDFTGVYEEVDEEQYSKLVQARQDDDWIVDDDGIGYVEDGREIFDDDLEDDALDADEKGKDGKARNKDKRNVKKLAVTKPNNIKSMFIACAGKKTADKAVDLSKDGLLGDILQDLNTETPQITPPPVMILKKKRSIGASPNPFSVHTATAVPSGKIASPVSRKEPPLTPVPLKRAEFAGDDVQVESTEEEQESGAMEFEDGDFDEPMEVEEVDLEPMAAKAWDKESEPAEEVKQEADSGKGTVSYLGSFLPDVSCWDIDQEGDSSFSVQEVQVDSSHLPLVKGADEEQVFHFYWLDAYEDQYNQPGVVFLFGKVWIESAETHVSCCVMVKNIERTLYFLPREMKIDLNTGKETGTPISMKDVYEEFDEKIATKYKIMKFKSKPVEKNYAFEIPDVPEKSEYLEVKYSAEMPQLPQDLKGETFSHVFGTNTSSLELFLMNRKIKGPCWLEVKSPQLLNQPVSWCKVEAMALKPDLVNVIKDVSPPPLVVMAFSMKTMQNAKNHQNEIIAMAALVHHSFALDKAAPKPPFQSHFCVVSKPKDCIFPYAFKEVIEKKNVKVEVAATERTLLGFFLAKVHKIDPDIIVGHNIYGFELEVLLQRINVCKAPHWSKIGRLKRSNMPKLGGRSGFGERNATCGRMICDVEISAKELIRCKSYHLSELVQQILKTERVVIPMENIQNMYSESSQLLYLLEHTWKDAKFILQIMCELNVLPLALQITNIAGNIMSRTLMGGRSERNEFLLLHAFYENNYIVPDKQIFRKPQQKLGDEDEEIDGDTNKYKKGRKKAAYAGGLVLDPKVGFYDKFILLLDFNSLYPSIIQEFNICFTTVQRVASEAQKVTEDGEQEQIPELPDPSLEMGILPREIRKLVERRKQVKQLMKQQDLNPDLILQYDIRQKALKLTANSMYGCLGFSYSRFYAKPLAALVTYKGREILMHTKEMVQKMNLEVIYGDTDSIMINTNSTNLEEVFKLGNKVKSEVNKLYKLLEIDIDGVFKSLLLLKKKKYAALVVEPTSDGNYVTKQELKGLDIVRRDWCDLAKDTGNFVIGQILSDQSRDTIVENIQKRLIEIGENVLNGSVPVSQFEINKALTKDPQDYPDKKSLPHVHVALWINSQGGRKVKAGDTVSYVICQDGSNLTASQRAYAPEQLQKQDNLTIDTQYYLAQQIHPVVARICEPIDGIDAVLIATWLGLDPTQFRVHHYHKDEENDALLGGPAQLTDEEKYRDCERFKCPCPTCGTENIYDNVFDGSGTDMEPSLYRCSNIDCKASPLTFTVQLSNKLIMDIRRFIKKYYDGWLICEEPTCRNRTRHLPLQFSRTGPLCPACMKATLQPEYSDKSLYTQLCFYRYIFDAECALEKLTTDHEKDKLKKQFFTPKVLQDYRKLKNTAEQFLSRSGYSEVNLSKLFAGCAVKS,1462,NP_058633.2.csv,refseq-POLA1-NM_016937.3_clinical_seed_0_final,refseq-POLA1-NM_016937.3.a2m,Invitae,refseq-POLA1-NM_016937.3.npy,1,1462,1462
+NP_058634.4,MLPCASCLPGSLLLWALLLLLLGSASPQDSEEPDSYTECTDGYEWDPDSQHCRDVNECLTIPEACKGEMKCINHYGGYLCLPRSAAVINDLHGEGPPPPVPPAQHPNPCPPGYEPDDQDSCVDVDECAQALHDCRPSQDCHNLPGSYQCTCPDGYRKIGPECVDIDECRYRYCQHRCVNLPGSFRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCHQGYELHRDGFSCSDIDECSYSSYLCQYRCINEPGRFSCHCPQGYQLLATRLCQDIDECESGAHQCSEAQTCVNFHGGYRCVDTNRCVEPYIQVSENRCLCPASNPLCREQPSSIVHRYMTITSERSVPADVFQIQATSVYPGAYNAFQIRAGNSQGDFYIRQINNVSAMLVLARPVTGPREYVLDLEMVTMNSLMSYRASSVLRLTVFVGAYTF,443,NP_058634.4.csv,refseq-EFEMP2-NM_016938.4_clinical_seed_0_final,refseq-EFEMP2-NM_016938.4.a2m,Invitae,refseq-EFEMP2-NM_016938.4.npy,1,443,443
+NP_058637.1,MVSPRMSGLLSQTVILALIFLPQTRPAGVFELQIHSFGPGPGPGAPRSPCSARLPCRLFFRVCLKPGLSEEAAESPCALGAALSARGPVYTEQPGAPAPDLPLPDGLLQVPFRDAWPGTFSFIIETWREELGDQIGGPAWSLLARVAGRRRLAAGGPWARDIQRAGAWELRFSYRARCEPPAVGTACTRLCRPRSAPSRCGPGLRPCAPLEDECEAPLVCRAGCSPEHGFCEQPGECRCLEGWTGPLCTVPVSTSSCLSPRGPSSATTGCLVPGPGPCDGNPCANGGSCSETPRSFECTCPRGFYGLRCEVSGVTCADGPCFNGGLCVGGADPDSAYICHCPPGFQGSNCEKRVDRCSLQPCRNGGLCLDLGHALRCRCRAGFAGPRCEHDLDDCAGRACANGGTCVEGGGAHRCSCALGFGGRDCRERADPCAARPCAHGGRCYAHFSGLVCACAPGYMGARCEFPVHPDGASALPAAPPGLRPGDPQRYLLPPALGLLVAAGVAGAALLLVHVRRRGHSQDAGSRLLAGTPEPSVHALPDALNNLRTQEGSGDGPSSSVDWNRPEDVDPQGIYVISAPSIYAREVATPLFPPLHTGRAGQRQHLLFPYPSSILSVK,618,NP_058637.1.csv,refseq-DLL3-NM_016941.3_clinical_seed_0_final,refseq-DLL3-NM_016941.3.a2m,Invitae,refseq-DLL3-NM_016941.3.npy,1,618,618
+NP_058648.4,MHPDLGPLCTLLYVTLTILCSSVSSDLAPYFTSEPLSAVQKLGGPVVLHCSAQPVTTRISWLHNGKTLDGNLEHVKIHQGTLTILSLNSSLLGYYQCLANNSIGAIVSGPATVSVAVLGDFGSSTKHVITAEEKSAGFIGCRVPESNPKAEVRYKIRGKWLEHSTENYLILPSGNLQILNVSLEDKGSYKCAAYNPVTHQLKVEPIGRKLLVSRPSSDDVHILHPTHSQALAVLSRSPVTLECVVSGVPAPQVYWLKDGQDIAPGSNWRRLYSHLATDSVDPADSGNYSCMAGNKSGDVKYVTYMVNVLEHASISKGLQDQIVSLGATVHFTCDVHGNPAPNCTWFHNAQPIHPSARHLTAGNGLKISGVTVEDVGMYQCVADNGIGFMHSTGRLEIENDGGFKPVIITAPVSAKVADGDFVTLSCNASGLPVPVIRWYDSHGLITSHPSQVLRSKSRKSQLSRPEGLNLEPVYFVLSQAGASSLHIQAVTQEHAGKYICEAANEHGTTQAEASLMVVPFETNTKAETVTLPDAAQNDDRSKRDGSETGLLSSFPVKVHPSAVESAPEKNASGISVPDAPIILSPPQTHTPDTYNLVWRAGKDGGLPINAYFVKYRKLDDGVGMLGSWHTVRVPGSENELHLAELEPSSLYEVLMVARSAAGEGQPAMLTFRTSKEKTASSKNTQASSPPVGIPKYPVVSEAANNNFGVVLTDSSRHSGVPEAPDRPTISTASETSVYVTWIPRANGGSPITAFKVEYKRMRTSNWLVAAEDIPPSKLSVEVRSLEPGSTYKFRVIAINHYGESFRSSASRPYQVVGFPNRFSSRPITGPHIAYTEAVSDTQIMLKWTYIPSSNNNTPIQGFYIYYRPTDSDNDSDYKRDVVEGSKQWHMIGHLQPETSYDIKMQCFNEGGESEFSNVMICETKVKRVPGASEYPVKDLSTPPNSLGSGGNVGPATSPARSSDMLYLIVGCVLGVMVLILMVFIAMCLWKNRQQNTIQKYDPPGYLYQGSDMNGQMVDYTTLSGASQINGNVHGGFLTNGGLSSGYSHLHHKVPNAVNGIVNGSLNGGLYSGHSNSLTRTHVDFEHPHHLVNGGGMYTAVPQIDPLECVNCRNCRNNNRCFTKTNSTFSSSPPPVVPVVAPYPQDGLEMKPLSHVKVPVCLTSAVPDCGQLPEESVKDNVEPVPTQRTCCQDIVNDVSSDGSEDPAEFSRGDSCAHSETEINIVSWNALILPPVPEGCAEKTMWSPPGIPLDSPTEVLQQPRET,1264,NP_058648.4.csv,refseq-CDON-NM_016952.4_clinical_seed_0_final,refseq-CDON-NM_016952.4.a2m,Invitae,refseq-CDON-NM_016952.4.npy,1,1264,1264
+NP_058651.3,MNRESFAAGERLVSPAYVRQGCEARRSHEHLIRLLLEKGKCPENGWDESTLELFLHELAIMDSNNFLGNCGVGEREGRVASALVARRHYRFIHGIGRSGDISAVQPKAAGSSLLNKITNSLVLDIIKLAGVHTVANCFVVPMATGMSLTLCFLTLRHKRPKAKYIIWPRIDQKSCFKSMITAGFEPVVIENVLEGDELRTDLKAVEAKVQELGPDCILCIHSTTSCFAPRVPDRLEELAVICANYDIPHIVNNAYGVQSSKCMHLIQQGARVGRIDAFVQSLDKNFMVPVGGAIIAGFNDSFIQEISKMYPGRASASPSLDVLITLLSLGSNGYKKLLKERKEMFSYLSNQIKKLSEAYNERLLHTPHNPISLAMTLKTLDEHRDKAVTQLGSMLFTRQVSGARVVPLGSMQTVSGYTFRGFMSHTNNYPCAYLNAASAIGMKMQDVDLFIKRLDRCLKAVRKERSKESDDNYDKTEDVDIEEMALKLDNVLLDTYQDASS,501,NP_058651.3.csv,refseq-SEPSECS-NM_016955.3_clinical_seed_0_final,refseq-SEPSECS-NM_016955.3.a2m,Invitae,refseq-SEPSECS-NM_016955.3.npy,1,501,501
+NP_059129.3,MFPLIGKTIIFDNFPDPSDTWEITETIGKGTYGKVFKVLNKKNGQKAAVKILDPIHDIDEEIEAEYNILKALSDHPNVVRFYGIYFKKDKVNGDKLWLVLELCSGGSVTDLVKGFLKRGERMSEPLIAYILHEALMGLQHLHNNKTIHRDVKGNNILLTTEGGVKLVDFGVSAQLTSTRHRRNTSVGTPFWMAPEVIACEQQLDTTYDARCDTWSLGITAIELGDGDPPLADLHPMRALFKIPRNPPPKLRQPELWSAEFNDFISKCLTKDYEKRPTVSELLQHKFITQIEGKDVMLQKQLTEFIGIHQCMGGTEKARRERIHTKKGNFNRPLISNLKDVDDLATLEILDENTVSEQLEKCYSRDQIYVYVGDILIALNPFQSLGLYSTKHSKLYIGSKRTASPPHIFAMADLGYQSMITYNSDQCIVISGESGAGKTENAHLLVQQLTVLGKANNRTLQEKILQVNNLVEAFGNACTIINDNSSRFGKYLEMKFTSSGAVVGAQISEYLLEKSRVIHQAIGEKNFHIFYYIYAGLAEKKKLAHYKLPENKPPRYLQNDHLRTVQDIMNNSFYKSQYELIEQCFKVIGFTMEQLGSIYSILAAILNVGNIEFSSVATEHQIDKSHISNHTALENCASLLCIRADELQEALTSHCVVTRGETIIRPNTVEKATDVRDAMAKTLYGRLFSWIVNCINSLLKHDSSPSGNGDELSIGILDIFGFENFKKNSFEQLCINIANEQIQYYYNQHVFAWEQNEYLNEDVDARVIEYEDNWPLLDMFLQKPMGLLSLLDEESRFPKATDQTLVEKFEGNLKSQYFWRPKRMELSFGIHHYAGKVLYNASGFLAKNRDTLPTDIVLLLRSSDNSVIRQLVNHPLTKTGNLPHSKTKNVINYQMRTSEKLINLAKGDTGEATRHARETTNMKTQTVASYFRYSLMDLLSKMVVGQPHFVRCIKPNSERQARKYDKEKVLLQLRYTGILETARIRRLGFSHRILFANFIKRYYLLCYKSSEEPRMSPDTCATILEKAGLDNWALGKTKVFLKYYHVEQLNLMRKEAIDKLILIQACVRAFLCSRRYQKIQEKRKESAIIIQSAARGHLVRKQRKEIVDMKNTAVTTIQTSDQEFDYKKNFENTRESFVKKQAENAISANERFISAPNNKGSVSVVKTSTFKPEEETTNAVESNNRVYQTPKKMNNVYEEEVKQEFYLVGPEVSPKQKSVKDLEENSNLRKVEKEEAMIQSYYQRYTEERNCEESKAAYLERKAISERPSYPVPWLAENETSFKKTLEPTLSQRSIYQNANSMEKEKKTSVVTQRAPICSQEEGRGRLRHETVKERQVEPVTQAQEEEDKAAVFIQSKYRGYKRRQQLRKDKMSSFKHQRIVTTPTEVARNTHNLYSYPTKHEEINNIKKKDNKDSKATSEREACGLAIFSKQISKLSEEYFILQKKLNEMILSQQLKSLYLGVSHHKPINRRVSSQQCLSGVCKGEEPKILRPPRRPRKPKTLNNPEDSTYYYLLHKSIQEEKRRPRKDSQGKLLDLEDFYYKEFLPSRSGPKEHSPSLRERRPQQELQNQCIKANERCWAAESPEKEEEREPAANPYDFRRLLRKTSQRRRLVQQS,1616,NP_059129.3.csv,refseq-MYO3A-NM_017433.4_clinical_seed_0_final,refseq-MYO3A-NM_017433.4.a2m,Invitae,refseq-MYO3A-NM_017433.4.npy,1,1616,1616
+NP_059488.2,MALIPDLAMETWLLLAVSLVLLYLYGTHSHGLFKKLGIPGPTPLPFLGNILSYHKGFCMFDMECHKKYGKVWGFYDGQQPVLAITDPDMIKTVLVKECYSVFTNRRPFGPVGFMKSAISIAEDEEWKRLRSLLSPTFTSGKLKEMVPIIAQYGDVLVRNLRREAETGKPVTLKDVFGAYSMDVITSTSFGVNIDSLNNPQDPFVENTKKLLRFDFLDPFFLSITVFPFLIPILEVLNICVFPREVTNFLRKSVKRMKESRLEDTQKHRVDFLQLMIDSQNSKETESHKALSDLELVAQSIIFIFAGYETTSSVLSFIMYELATHPDVQQKLQEEIDAVLPNKAPPTYDTVLQMEYLDMVVNETLRLFPIAMRLERVCKKDVEINGMFIPKGVVVMIPSYALHRDPKYWTEPEKFLPERFSKKNKDNIDPYIYTPFGSGPRNCIGMRFALMNMKLALIRVLQNFSFKPCKETQIPLKLSLGGLLQPEKPVVLKVESRDGTVSGA,503,NP_059488.2.csv,refseq-CYP3A4-NM_017460.5_clinical_seed_0_final,refseq-CYP3A4-NM_017460.5.a2m,Invitae,refseq-CYP3A4-NM_017460.5_theta_0.2.npy,1,503,503
+NP_060004.3,MSSDSELAVFGEAAPFLRKSERERIEAQNRPFDAKTSVFVAEPKESFVKGTIQSREGGKVTVKTEGGATLTVKDDQVFPMNPPKYDKIEDMAMMTHLHEPAVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYKPEVVTAYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGESGAGKTVNTKRVIQYFATIAVTGEKKKEEITSGKIQGTLEDQIISANPLLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGKLASADIETYLLEKSRVVFQLKAERSYHIFYQITSNKKPELIEMLLITTNPYDYPFVSQGEISVASIDDQEELMATDSAIDILGFTNEEKVSIYKLTGAVMHYGNLKFKQKQREEQAEPDGTEVADKAAYLQSLNSADLLKALCYPRVKVGNEYVTKGQTVEQVSNAVGALAKAVYEKMFLWMVARINQQLDTKQPRQYFIGVLDIAGFEIFDFNSLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTFIDFGMDLAACIELIEKPMGIFSILEEECMFPKATDTSFKNKLYDQHLGKSANFQKPKVVKGKAEAHFALIHYAGVVDYNITGWLEKNKDPLNETVVGLYQKSAMKTLAQLFSGAQTAEGEGAGGGAKKGGKKKGSSFQTVSALFRENLNKLMTNLRSTHPHFVRCIIPNETKTPGAMEHELVLHQLRCNGVLEGIRICRKGFPSRILYADFKQRYKVLNASAIPEGQFIDSKKASEKLLASIDIDHTQYKFGHTKVFFKAGLLGLLEEMRDDKLAQLITRTQARCRGFLARVEYQRMVERREAIFCIQYNIRSFMNVKHWPWMKLFFKIKPLLKSAETEKEMATMKEEFQKIKDELAKSEAKRKELEEKMVTLLKEKNDLQLQVQAEAEGLADAEERCDQLIKTKIQLEAKIKEVTERAEDEEEINAELTAKKRKLEDECSELKKDIDDLELTLAKVEKEKHATENKVKNLTEEMAGLDETIAKLTKEKKALQEAHQQTLDDLQAEEDKVNTLTKAKIKLEQQVDDLEGSLEQEKKLRMDLERAKRKLEGDLKLAQESIMDIENEKQQLDEKLKKKEFEISNLQSKIEDEQALGIQLQKKIKELQARIEELEEEIEAERASRAKAEKQRSDLSRELEEISERLEEAGGATSAQIEMNKKREAEFQKMRRDLEEATLQHEATAATLRKKHADSVAELGEQIDNLQRVKQKLEKEKSEMKMEIDDLASNVETVSKAKGNLEKMCRTLEDQLSELKSKEEEQQRLINDLTAQRGRLQTESGEFSRQLDEKEALVSQLSRGKQAFTQQIEELKRQLEEEIKAKNALAHALQSSRHDCDLLREQYEEEQESKAELQRALSKANTEVAQWRTKYETDAIQRTEELEEAKKKLAQRLQAAEEHVEAVNAKCASLEKTKQRLQNEVEDLMLDVERTNAACAALDKKQRNFDKILAEWKQKCEETHAELEASQKEARSLGTELFKIKNAYEESLDQLETLKRENKNLQQEISDLTEQIAEGGKRIHELEKIKKQVEQEKCELQAALEEAEASLEHEEGKILRIQLELNQVKSEVDRKIAEKDEEIDQLKRNHIRIVESMQSTLDAEIRSRNDAIRLKKKMEGDLNEMEIQLNHANRMAAEALRNYRNTQGILKDTQIHLDDALRSQEDLKEQLAMVERRANLLQAEIEELRATLEQTERSRKIAEQELLDASERVQLLHTQNTSLINTKKKLETDISQMQGEMEDILQEARNAEEKAKKAITDAAMMAEELKKEQDTSAHLERMKKNMEQTVKDLQLRLDEAEQLALKGGKKQIQKLEARVRELEGEVESEQKRNAEAVKGLRKHERRVKELTYQTEEDRKNILRLQDLVDKLQAKVKSYKRQAEEAEEQSNTNLAKFRKLQHELEEAEERADIAESQVNKLRVKSREVHTKVISEE,1941,NP_060004.3.csv,refseq-MYH2-NM_017534.5_clinical_seed_0_final,refseq-MYH2-NM_017534.5.a2m,Invitae,refseq-MYH2-NM_017534.5.npy,1,1941,1941
+NP_060011.1,MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMYILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIMEQFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE,178,NP_060011.1.csv,refseq-CRYGS-NM_017541.2_clinical_seed_0_final,refseq-CRYGS-NM_017541.2.a2m,Invitae,refseq-CRYGS-NM_017541.2.npy,1,178,178
+NP_060017.1,MIRRVLPHGMGRGLLTRRPGTRRGGFSLDWDGKVSEIKKKIKSILPGRSCDLLQDTSHLPPEHSDVVIVGGGVLGLSVAYWLKKLESRRGAIRVLVVERDHTYSQASTGLSVGGICQQFSLPENIQLSLFSASFLRNINEYLAVVDAPPLDLRFNPSGYLLLASEKDAAAMESNVKVQRQEGAKVSLMSPDQLRNKFPWINTEGVALASYGMEDEGWFDPWCLLQGLRRKVQSLGVLFCQGEVTRFVSSSQRMLTTDDKAVVLKRIHEVHVKMDRSLEYQPVECAIVINAAGAWSAQIAALAGVGEGPPGTLQGTKLPVEPRKRYVYVWHCPQGPGLETPLVADTSGAYFRREGLGSNYLGGRSPTEQEEPDPANLEVDHDFFQDKVWPHLALRVPAFETLKVQSAWAGYYDYNTFDQNGVVGPHPLVVNMYFATGFSGHGLQQAPGIGRAVAEMVLKGRFQTIDLSPFLFTRFYLGEKIQENNII,486,NP_060017.1.csv,refseq-FOXRED1-NM_017547.3_clinical_seed_0_final,refseq-FOXRED1-NM_017547.3.a2m,Invitae,refseq-FOXRED1-NM_017547.3.npy,1,486,486
+NP_060021.1,MEALTLWLLPWICQCVSVRADSIIHIGAIFEENAAKDDRVFQLAVSDLSLNDDILQSEKITYSIKVIEANNPFQAVQEACDLMTQGILALVTSTGCASANALQSLTDAMHIPHLFVQRNPGGSPRTACHLNPSPDGEAYTLASRPPVRLNDVMLRLVTELRWQKFVMFYDSEYDIRGLQSFLDQASRLGLDVSLQKVDKNISHVFTSLFTTMKTEELNRYRDTLRRAILLLSPQGAHSFINEAVETNLASKDSHWVFVNEEISDPEILDLVHSALGRMTVVRQIFPSAKDNQKCTRNNHRISSLLCDPQEGYLQMLQISNLYLYDSVLMLANAFHRKLEDRKWHSMASLNCIRKSTKPWNGGRSMLDTIKKGHITGLTGVMEFREDSSNPYVQFEILGTTYSETFGKDMRKLATWDSEKGLNGSLQERPMGSRLQGLTLKVVTVLEEPFVMVAENILGQPKRYKGFSIDVLDALAKALGFKYEIYQAPDGRYGHQLHNTSWNGMIGELISKRADLAISAITITPERESVVDFSKRYMDYSVGILIKKPEEKISIFSLFAPFDFAVWACIAAAIPVVGVLIFVLNRIQAVRAQSAAQPRPSASATLHSAIWIVYGAFVQQGGESSVNSMAMRIVMGSWWLFTLIVCSSYTANLAAFLTVSRMDNPIRTFQDLSKQVEMSYGTVRDSAVYEYFRAKGTNPLEQDSTFAELWRTISKNGGADNCVSSPSEGIRKAKKGNYAFLWDVAVVEYAALTDDDCSVTVIGNSISSKGYGIALQHGSPYRDLFSQRILELQDTGDLDVLKQKWWPHMGRCDLTSHASAQADGKSLKLHSFAGVFCILAIGLLLACLVAALELWWNSNRCHQETPKEDKEVNLEQVHRRMNSLMDEDIAHKQISPASIELSALEMGGLAPTQTLEPTREYQNTQLSVSTFLPEQSSHGTSRTLSSGPSSNLPLPLSSSATMPSMQCKHRSPNGGLFRQSPVKTPIPMSFQPVPGGVLPEALDTSHGTSI,1009,NP_060021.1.csv,refseq-GRID1-NM_017551.2_clinical_seed_0_final,refseq-GRID1-NM_017551.2.a2m,Invitae,refseq-GRID1-NM_017551.2.npy,1,1009,1009
+NP_060033.3,MAPWLQLCSVFFTVNACLNGSQLAVAAGGSGRARGADTCGWRGVGPASRNSGLYNITFKYDNCTTYLNPVGKHVIADAQNITISQYACHDQVAVTILWSPGALGIEFLKGFRVILEELKSEGRQCQQLILKDPKQLNSSFKRTGMESQPFLNMKFETDYFVKVVPFPSIKNESNYHPFFFRTRACDLLLQPDNLACKPFWKPRNLNISQHGSDMQVSFDHAPHNFGFRFFYLHYKLKHEGPFKRKTCKQEQTTETTSCLLQNVSPGDYIIELVDDTNTTRKVMHYALKPVHSPWAGPIRAVAITVPLVVISAFATLFTVMCRKKQQENIYSHLDEESSESSTYTAALPRERLRPRPKVFLCYSSKDGQNHMNVVQCFAYFLQDFCGCEVALDLWEDFSLCREGQREWVIQKIHESQFIIVVCSKGMKYFVDKKNYKHKGGGRGSGKGELFLVAVSAIAEKLRQAKQSSSAALSKFIAVYFDYSCEGDVPGILDLSTKYRLMDNLPQLCSHLHSRDHGLQEPGQHTRQGSRRNYFRSKSGRSLYVAICNMHQFIDEEPDWFEKQFVPFHPPPLRYREPVLEKFDSGLVLNDVMCKPGPESDFCLKVEAAVLGATGPADSQHESQHGGLDQDGEARPALDGSAALQPLLHTVKAGSPSDMPRDSGIYDSSVPSSELSLPLMEGLSTDQTETSSLTESVSSSSGLGEEEPPALPSKLLSSGSCKADLGCRSYTDELHAVAPL,739,NP_060033.3.csv,refseq-IL17RD-NM_017563.4_clinical_seed_0_final,refseq-IL17RD-NM_017563.4.a2m,Invitae,refseq-IL17RD-NM_017563.4.npy,1,739,739
+NP_060035.2,MPGLRRDRLLTLLLLGALLSADLYFHLWPQVQRQLRPRERPRGCPCTGRASSLARDSAAAASDPGTIVHNFSRTEPRTEPAGGSHSGSSSKLQALFAHPLYNVPEEPPLLGAEDSLLASQEALRYYRRKVARWNRRHKMYREQMNLTSLDPPLQLRLEASWVQFHLGINRHGLYSRSSPVVSKLLQDMRHFPTISADYSQDEKALLGACDCTQIVKPSGVHLKLVLRFSDFGKAMFKPMRQQRDEETPVDFFYFIDFQRHNAEIAAFHLDRILDFRRVPPTVGRIVNVTKEILEVTKNEILQSVFFVSPASNVCFFAKCPYMCKTEYAVCGNPHLLEGSLSAFLPSLNLAPRLSVPNPWIRSYTLAGKEEWEVNPLYCDTVKQIYPYNNSQRLLNVIDMAIFDFLIGNMDRHHYEMFTKFGDDGFLIHLDNARGFGRHSHDEISILSPLSQCCMIKKKTLLHLQLLAQADYRLSDVMRESLLEDQLSPVLTEPHLLALDRRLQTILRTVEGCIVAHGQQSVIVDGPVEQLAPDSGQANLTS,541,NP_060035.2.csv,refseq-FAM20A-NM_017565.3_clinical_seed_0_final,refseq-FAM20A-NM_017565.3.a2m,Invitae,refseq-FAM20A-NM_017565.3.npy,1,541,541
+NP_060083.1,MALSVPGYSPGFRKPPEVVRLRRKRARSRGAAASPPRELTEPAARRAALVAGLPLRPFPAAGGRGGGSGGGPAAARRNPFARLDNRPRVAAEPPDGPAREQPEAPVPFLDSNQENDLLWEEKFPERTTVTELPQTSHVSFSEPDIPSSKSTELPVDWSIKTRLLFTSSQPFTWADHLKAQEEAQGLVQHCRATEVTLPKSIQDPKLSSELRCTFQQSLIYWLHPALSWLPLFPRIGADRKMAGKTSPWSNDATLQHVLMSDWSVSFTSLYNLLKTKLCPYFYVCTYQFTVLFRAAGLAGSDLITALISPTTRGLREAMRNEGIEFSLPLIKESGHKKETASGTSLGYGEEQAISDEDEEESFSWLEEMGVQDKIKKPDILSIKLRKEKHEVQMDHRPESVVLVKGINTFTLLNFLINSKSLVATSGPQAGLPPTLLSPVAFRGATMQMLKARSVNVKTQALSGYRDQFSLEITGPIMPHSLHSLTMLLKSSQSGSFSAVLYPHEPTAVFNICLQMDKVLDMEVVHKELTNCGLHPNTLEQLSQIPLLGKSSLRNVVLRDYIYNWRS,566,NP_060083.1.csv,refseq-DONSON-NM_017613.3_clinical_seed_0_final,refseq-DONSON-NM_017613.3.a2m,Invitae,refseq-DONSON-NM_017613.3_theta_0.2.npy,1,566,566
+NP_060106.2,MVVPEKEQSWIPKIFKKKTCTTFIVDSTDPGGTLCQCGRPRTAHPAVAMEDAFGAAVVTVWDSDAHTTEKPTDAYGELDFTGAGRKHSNFLRLSDRTDPAAVYSLVTRTWGFRAPNLVVSVLGGSGGPVLQTWLQDLLRRGLVRAAQSTGAWIVTGGLHTGIGRHVGVAVRDHQMASTGGTKVVAMGVAPWGVVRNRDTLINPKGSFPARYRWRGDPEDGVQFPLDYNYSAFFLVDDGTHGCLGGENRFRLRLESYISQQKTGVGGTGIDIPVLLLLIDGDEKMLTRIENATQAQLPCLLVAGSGGAADCLAETLEDTLAPGSGGARQGEARDRIRRFFPKGDLEVLQAQVERIMTRKELLTVYSSEDGSEEFETIVLKALVKACGSSEASAYLDELRLAVAWNRVDIAQSELFRGDIQWRSFHLEASLMDALLNDRPEFVRLLISHGLSLGHFLTPMRLAQLYSAAPSNSLIRNLLDQASHSAGTKAPALKGGAAELRPPDVGHVLRMLLGKMCAPRYPSGGAWDPHPGQGFGESMYLLSDKATSPLSLDAGLGQAPWSDLLLWALLLNRAQMAMYFWEMGSNAVSSALGACLLLRVMARLEPDAEEAARRKDLAFKFEGMGVDLFGECYRSSEVRAARLLLRRCPLWGDATCLQLAMQADARAFFAQDGVQSLLTQKWWGDMASTTPIWALVLAFFCPPLIYTRLITFRKSEEEPTREELEFDMDSVINGEGPVGTADPAEKTPLGVPRQSGRPGCCGGRCGGRRCLRRWFHFWGAPVTIFMGNVVSYLLFLLLFSRVLLVDFQPAPPGSLELLLYFWAFTLLCEELRQGLSGGGGSLASGGPGPGHASLSQRLRLYLADSWNQCDLVALTCFLLGVGCRLTPGLYHLGRTVLCIDFMVFTVRLLHIFTVNKQLGPKIVIVSKMMKDVFFFLFFLGVWLVAYGVATEGLLRPRDSDFPSILRRVFYRPYLQIFGQIPQEDMDVALMEHSNCSSEPGFWAHPPGAQAGTCVSQYANWLVVLLLVIFLLVANILLVNLLIAMFSYTFGKVQGNSDLYWKAQRYRLIREFHSRPALAPPFIVISHLRLLLRQLCRRPRSPQPSSPALEHFRVYLSKEAERKLLTWESVHKENFLLARARDKRESDSERLKRTSQKVDLALKQLGHIREYEQRLKVLEREVQQCSRVLGWVAEALSRSALLPPGGPPPPDLPGSKD,1214,NP_060106.2.csv,refseq-TRPM4-NM_017636.3_clinical_seed_0_final,refseq-TRPM4-NM_017636.3.a2m,Invitae,refseq-TRPM4-NM_017636.3.npy,1,1214,1214
+NP_060111.2,MLGAPDESSVRVAVRIRPQLAKEKIEGCHICTSVTPGEPQVFLGKDKAFTFDYVFDIDSQQEQIYIQCIEKLIEGCFEGYNATVFAYGQTGAGKTYTMGTGFDVNIVEEELGIISRAVKHLFKSIEEKKHIAIKNGLPAPDFKVNAQFLELYNEEVLDLFDTTRDIDAKSKKSNIRIHEDSTGGIYTVGVTTRTVNTESEMMQCLKLGALSRTTASTQMNVQSSRSHAIFTIHVCQTRVCPQIDADNATDNKIISESAQMNEFETLTAKFHFVDLAGSERLKRTGATGERAKEGISINCGLLALGNVISALGDKSKRATHVPYRDSKLTRLLQDSLGGNSQTIMIACVSPSDRDFMETLNTLKYANRARNIKNKVMVNQDRASQQINALRSEITRLQMELMEYKTGKRIIDEEGVESINDMFHENAMLQTENNNLRVRIKAMQETVDALRSRITQLVSDQANHVLARAGEGNEEISNMIHSYIKEIEDLRAKLLESEAVNENLRKNLTRATARAPYFSGSSTFSPTILSSDKETIEIIDLAKKDLEKLKRKEKRKKKSVAGKEDNTDTDQEKKEEKGVSERENNELEVEESQEVSDHEDEEEEEEEEEDDIDGGESSDESDSESDEKANYQADLANITCEIAIKQKLIDELENSQKRLQTLKKQYEEKLMMLQHKIRDTQLERDQVLQNLGSVESYSEEKAKKVRSEYEKKLQAMNKELQRLQAAQKEHARLLKNQSQYEKQLKKLQQDVMEMKKTKVRLMKQMKEEQEKARLTESRRNREIAQLKKDQRKRDHQLRLLEAQKRNQEVVLRRKTEEVTALRRQVRPMSDKVAGKVTRKLSSSDAPAQDTGSSAAAVETDASRTGAQQKMRIPVARVQALPTPATNGNRKKYQRKGLTGRVFISKTARMKWQLLERRVTDIIMQKMTISNMEADMNRLLKQREELTKRREKLSKRREKIVKENGEGDKNVANINEEMESLTANIDYINDSISDCQANIMQMEEAKEEGETLDVTAVINACTLTEARYLLDHFLSMGINKGLQAAQKEAQIKVLEGRLKQTEITSATQNQLLFHMLKEKAELNPELDALLGHALQDLDSVPLENVEDSTDEDAPLNSPGSEGSTLSSDLMKLCGEVKPKNKARRRTTTQMELLYADSSELASDTSTGDASLPGPLTPVAEGQEIGMNTETSGTSAREKELSPPPGLPSKIGSISRQSSLSEKKIPEPSPVTRRKAYEKAEKSKAKEQKHSDSGTSEASLSPPSSPPSRPRNELNVFNRLTVSQGNTSVQQDKSDESDSSLSEVHRSSRRGIINPFPASKGIRAFPLQCIHIAEGHTKAVLCVDSTDDLLFTGSKDRTCKVWNLVTGQEIMSLGGHPNNVVSVKYCNYTSLVFTVSTSYIKVWDIRDSAKCIRTLTSSGQVTLGDACSASTSRTVAIPSGENQINQIALNPTGTFLYAASGNAVRMWDLKRFQSTGKLTGHLGPVMCLTVDQISSGQDLIITGSKDHYIKMFDVTEGALGTVSPTHNFEPPHYDGIEALTIQGDNLFSGSRDNGIKKWDLTQKDLLQQVPNAHKDWVCALGVVPDHPVLLSGCRGGILKVWNMDTFMPVGEMKGHDSPINAICVNSTHIFTAADDRTVRIWKARNLQDGQISDTGDLGEDIASN,1661,NP_060111.2.csv,refseq-KIF21A-NM_017641.3_clinical_seed_0_final,refseq-KIF21A-NM_017641.3.a2m,Invitae,refseq-KIF21A-NM_017641.3.npy,1,1661,1661
+NP_060116.2,MASVAAARAVPVGSGLRGLQRTLPLVVILGATGTGKSTLALQLGQRLGGEIVSADSMQVYEGLDIITNKVSAQEQRICRHHMISFVDPLVTNYTVVDFRNRATALIEDIFARDKIPIVVGGTNYYIESLLWKVLVNTKPQEMGTEKVIDRKVELEKEDGLVLHKRLSQVDPEMAAKLHPHDKRKVARSLQVFEETGISHSEFLHRQHTEEGGGPLGGPLKFSNPCILWLHADQAVLDERLDKRVDDMLAAGLLEELRDFHRRYNQKNVSENSQDYQHGIFQSIGFKEFHEYLITEGKCTLETSNQLLKKGIEALKQVTKRYARKQNRWVKNRFLSRPGPIVPPVYGLEVSDVSKWEESVLEPALEIVQSFIQGHKPTATPIKMPYNEAENKRSYHLCDLCDRIIIGDREWAAHIKSKSHLNQLKKRRRLDSDAVNTIESQSVSPDHNKEPKEKGSPGQNDQELKCSV,467,NP_060116.2.csv,refseq-TRIT1-NM_017646.5_clinical_seed_0_final,refseq-TRIT1-NM_017646.5.a2m,Invitae,refseq-TRIT1-NM_017646.5.npy,1,467,467
+NP_060119.3,MIGCGACEPKVKMAGGQAAAALPTWKMAARRSLSARGRGILQAAAGRLLPLLLLSCCCGAGGCAAVGENEETVIIGLRLEDTNDVSFMEGGALRVSERTRVKLRVYGQNINNETWSRIAFTEHERRRHSPGERGLGGPAPPEPDSGPQRCGIRTSDIIILPHIILNRRTSGIIEIEIKPLRKMEKSKSYYLCTSLSTPALGAGGSGSTGGAVGGKGGSGVAGLPPPPWAETTWIYHDGEDTKMIVGEEKKFLLPFWLQVIFISLLLCLSGMFSGLNLGLMALDPMELRIVQNCGTEKEKNYAKRIEPVRRQGNYLLCSLLLGNVLVNTTLTILLDDIAGSGLVAVVVSTIGIVIFGEIVPQAICSRHGLAVGANTIFLTKFFMMMTFPASYPVSKLLDCVLGQEIGTVYNREKLLEMLRVTDPYNDLVKEELNIIQGALELRTKTVEDVMTPLRDCFMITGEAILDFNTMSEIMESGYTRIPVFEGERSNIVDLLFVKDLAFVDPDDCTPLKTITKFYNHPLHFVFNDTKLDAMLEEFKKGKSHLAIVQRVNNEGEGDPFYEVLGIVTLEDVIEEIIKSEILDETDLYTDNRTKKKVAHRERKQDFSAFKQTDSEMKVKISPQLLLAMHRFLATEVEAFSPSQMSEKILLRLLKHPNVIQELKYDEKNKKAPEYYLYQRNKPVDYFVLILQGKVEVEAGKEGMKFEASAFSYYGVMALTASPVPLSLSRTFVVSRTELLAAGSPGENKSPPRPCGLNHSDSLSRSDRIDAVTPTLGSSNNQLNSSLLQVYIPDYSVRALSDLQFVKISRQQYQNALMASRMDKTPQSSDSENTKIELTLTELHDGLPDETANLLNEQNCVTHSKANHSLHNEGAI,875,NP_060119.3.csv,refseq-CNNM2-NM_017649.4_clinical_seed_0_final,refseq-CNNM2-NM_017649.4.a2m,Invitae,refseq-CNNM2-NM_017649.4.npy,1,875,875
+NP_060121.3,MPTAESEAKVKTKVRFEELLKTHSDLMREKKKLKKKLVRSEENISPDTIRSNLHYMKETTSDDPDTIRSNLPHIKETTSDDVSAANTNNLKKSTRVTKNKLRNTQLATENPNGDASVEEDKQGKPNKKVIKTVPQLTTQDLKPETPENKVDSTHQKTHTKPQPGVDHQKSEKANEGREETDLEEDEELMQAYQCHVTEEMAKEIKRKIRKKLKEQLTYFPSDTLFHDDKLSSEKRKKKKEVPVFSKAETSTLTISGDTVEGEQKKESSVRSVSSDSHQDDEISSMEQSTEDSMQDDTKPKPKKTKKKTKAVADNNEDVDGDGVHEITSRDSPVYPKCLLDDDLVLGVYIHRTDRLKSDFMISHPMVKIHVVDEHTGQYVKKDDSGRPVSSYYEKENVDYILPIMTQPYDFKQLKSRLPEWEEQIVFNENFPYLLRGSDESPKVILFFEILDFLSVDEIKNNSEVQNQECGFRKIAWAFLKLLGANGNANINSKLRLQLYYPPTKPRSPLSVVEAFEWWSKCPRNHYPSTLYVTVRGLKVPDCIKPSYRSMMALQEEKGKPVHCERHHESSSVDTEPGLEESKEVIKWKRLPGQACRIPNKHLFSLNAGERGCFCLDFSHNGRILAAACASRDGYPIILYEIPSGRFMRELCGHLNIIYDLSWSKDDHYILTSSSDGTARIWKNEINNTNTFRVLPHPSFVYTAKFHPAVRELVVTGCYDSMIRIWKVEMREDSAILVRQFDVHKSFINSLCFDTEGHHMYSGDCTGVIVVWNTYVKINDLEHSVHHWTINKEIKETEFKGIPISYLEIHPNGKRLLIHTKDSTLRIMDLRILVARKFVGAANYREKIHSTLTPCGTFLFAGSEDGIVYVWNPETGEQVAMYSDLPFKSPIRDISYHPFENMVAFCAFGQNEPILLYIYDFHVAQQEAEMFKRYNGTFPLPGIHQSQDALCTCPKLPHQGSFQIDEFVHTESSSTKMQLVKQRLETVTEVIRSCAAKVNKNLSFTSPPAVSSQQSKLKQSNMLTAQEILHQFGFTQTGIISIERKPCNHQVDTAPTVVALYDYTANRSDELTIHRGDIIRVFFKDNEDWWYGSIGKGQEGYFPANHVASETLYQELPPEIKERSPPLSPEEKTKIEKSPAPQKQSINKNKSQDFRLGSESMTHSEMRKEQSHEDQGHIMDTRMRKNKQAGRKVTLIE,1196,NP_060121.3.csv,refseq-AHI1-NM_017651.4_clinical_seed_0_final,refseq-AHI1-NM_017651.4.a2m,Invitae,refseq-AHI1-NM_017651.4.npy,1,1196,1196
+NP_060124.2,MAKQLNLPENTDDWTKEDVNQWLESHKIDQKHREILTEQDVNGAVLKWLKKEHLVDMGITHGPAIQIEELFKELRKTAIEDSIQTSKMGKPSKNAPKDQTVSQKERRETSKQKQKGKENPDMANPSAMSTTAKGSKSLKVELIEDKIDYTKERQPSIDLTCVSYPFDEFSNPYRYKLDFSLQPETGPGNLIDPIHEFKAFTNTATATEEDVKMKFSNEVFRFASACMNSRTNGTIHFGVKDKPHGKIVGIKVTNDTKEALINHFNLMINKYFEDHQVQQAKKCIREPRFVEVLLPNSTLSDRFVIEVDIIPQFSECQYDYFQIKMQNYNNKIWEQSKKFSLFVRDGTSSKDITKNKVDFRAFKADFKTLAESRKAAEEKFRAKTNKKEREGPKLVKLLTGNQDLLDNSYYEQYILVTNKCHPDQTKHLDFLKEIKWFAVLEFDPESNINGVVKAYKESRVANLHFPSVYVEQKTTPNETISTLNLYHQPSWIFCNGRLDLDSEKYKPFDPSSWQRERASDVRKLISFLTHEDIMPRGKFLVVFLLLSSVDDPRDPLIETFCAFYQDLKGMENILCICVHPHIFQGWKDLLEARLIKHQDEISSQCISALSLEEINGTILKLKSVTQSSKRLLPSIGLSTVLLKKEEDIMTALEIICENECEGTLLEKDKNKFLEFKASKEEDFYRGGKVSWWNFYFSSESYSSPFVKRDKYERLEAMIQNCADSSKPTSTKIIHLYHHPGCGGTTLAMHILWELRKKFRCAVLKNKTVDFSEIGEQVTSLITYGAMNRQEYVPVLLLVDDFEEQDNVYLLQYSIQTAIAKKYIRYEKPLVIILNCMRSQNPEKSARIPDSIAVIQQLSPKEQRAFELKLKEIKEQHKNFEDFYSFMIMKTNFNKEYIENVVRNILKGQNIFTKEAKLFSFLALLNSYVPDTTISLSQCEKFLGIGNKKAFWGTEKFEDKMGTYSTILIKTEVIECGNYCGVRIIHSLIAEFSLEELKKSYHLNKSQIMLDMLTENLFFDTGMGKSKFLQDMHTLLLTRHRDEHEGETGNWFSPFIEALHKDEGNEAVEAVLLESIHRFNPNAFICQALARHFYIKKKDFGNALNWAKQAKIIEPDNSYISDTLGQVYKSKIRWWIEENGGNGNISVDDLIALLDLAEHASSAFKESQQQSEDREYEVKERLYPKSKRRYDTYNIAGYQGEIEVGLYTIQILQLIPFFDNKNELSKRYMVNFVSGSSDIPGDPNNEYKLALKNYIPYLTKLKFSLKKSFDFFDEYFVLLKPRNNIKQNEEAKTRRKVAGYFKKYVDIFCLLEESQNNTGLGSKFSEPLQVERCRRNLVALKADKFSGLLEYLIKSQEDAISTMKCIVNEYTFLLEQCTVKIQSKEKLNFILANIILSCIQPTSRLVKPVEKLKDQLREVLQPIGLTYQFSEPYFLASLLFWPENQQLDQHSEQMKEYAQALKNSFKGQYKHMHRTKQPIAYFFLGKGKRLERLVHKGKIDQCFKKTPDINSLWQSGDVWKEEKVQELLLRLQGRAENNCLYIEYGINEKITIPITPAFLGQLRSGRSIEKVSFYLGFSIGGPLAYDIEIV,1589,NP_060124.2.csv,refseq-SAMD9-NM_017654.3_clinical_seed_0_final,refseq-SAMD9-NM_017654.3.a2m,Invitae,refseq-SAMD9-NM_017654.3.npy,1,1589,1589
+NP_060132.3,MKEQPVLERLQSQKSWIKGVFDKRECSTIIPSSKNPHRCTPVCQVCQNLIRCYCGRLIGDHAGIDYSWTISAAKGKESEQWSVEKHTTKSPTDTFGTINFQDGEHTHHAKYIRTSYDTKLDHLLHLMLKEWKMELPKLVISVHGGIQNFTMPSKFKEIFSQGLVKAAETTGAWIITEGINTGVSKHVGDALKSHSSHSLRKIWTVGIPPWGVIENQRDLIGKDVVCLYQTLDNPLSKLTTLNSMHSHFILSDDGTVGKYGNEMKLRRNLEKYLSLQKIHCRSRQGVPVVGLVVEGGPNVILSVWETVKDKDPVVVCEGTGRAADLLAFTHKHLADEGMLRPQVKEEIICMIQNTFNFSLKQSKHLFQILMECMVHRDCITIFDADSEEQQDLDLAILTALLKGTNLSASEQLNLAMAWDRVDIAKKHILIYEQHWKPDALEQAMSDALVMDRVDFVKLLIEYGVNLHRFLTIPRLEELYNTKQGPTNTLLHHLVQDVKQHTLLSGYRITLIDIGLVVEYLIGRAYRSNYTRKHFRALYNNLYRKYKHQRHSSGNRNESAESTLHSQFIRTAQPYKFKEKSIVLHKSRKKSKEQNVSDDPESTGFLYPYNDLLVWAVLMKRQKMAMFFWQHGEEATVKAVIACILYRAMAHEAKESHMVDDASEELKNYSKQFGQLALDLLEKAFKQNERMAMTLLTYELRNWSNSTCLKLAVSGGLRPFVSHTCTQMLLTDMWMGRLKMRKNSWLKIIISIILPPTILTLEFKSKAEMSHVPQSQDFQFMWYYSDQNASSSKESASVKEYDLERGHDEKLDENQHFGLESGHQHLPWTRKVYEFYSAPIVKFWFYTMAYLAFLMLFTYTVLVEMQPQPSVQEWLVSIYIFTNAIEVVREICISEPGKFTQKVKVWISEYWNLTETVAIGLFSAGFVLRWGDPPFHTAGRLIYCIDIIFWFSRLLDFFAVNQHAGPYVTMIAKMTANMFYIVIIMAIVLLSFGVARKAILSPKEPPSWSLARDIVFEPYWMIYGEVYAGEIDVCSSQPSCPPGSFLTPFLQAVYLFVQYIIMVNLLIAFFNNVYLDMESISNNLWKYNRYRYIMTYHEKPWLPPPLILLSHVGLLLRRLCCHRAPHDQEEGDVGLKLYLSKEDLKKLHDFEEQCVEKYFHEKMEDVNCSCEERIRVTSERVTEMYFQLKEMNEKVSFIKDSLLSLDSQVGHLQDLSALTVDTLKVLSAVDTLQEDEALLAKRKHSTCKKLPHSWSNVICAEVLGSMEIAGEKKYQYYSMPSSLLRSLAGGRHPPRVQRGALLEITNSKREATNVRNDQERQETQSSIVVSGVSPNRQAHSKYGQFLLVPSNLKRVPFSAETVLPLSRPSVPDVLATEQDIQTEVLVHLTGQTPVVSDWASVDEPKEKHEPIAHLLDGQDKAEQVLPTLSCTPEPMTMSSPLSQAKIMQTGGGYVNWAFSEGDETGVFSIKKKWQTCLPSTCDSDSSRSEQHQKQAQDSSLSDNSTRSAQSSECSEVGPWLQPNTSFWINPLRRYRPFARSHSFRFHKEEKLMKICKIKNLSGSSEIGQGAWVKAKMLTKDRRLSKKKKNTQGLQVPIITVNACSQSDQLNPEPGENSISEEEYSKNWFTVSKFSHTGVEPYIHQKMKTKEIGQCAIQISDYLKQSQEDLSKNSLWNSRSTNLNRNSLLKSSIGVDKISASLKSPQEPHHHYSAIERNNLMRLSQTIPFTPVQLFAGEEITVYRLEESSPLNLDKSMSSWSQRGRAAMIQVLSREEMDGGLRKAMRVVSTWSEDDILKPGQVFIVKSFLPEVVRTWHKIFQESTVLHLCLREIQQQRAAQKLIYTFNQVKPQTIPYTPRFLEVFLIYCHSANQWLTIEKYMTGEFRKYNNNNGDEITPTNTLEELMLAFSHWTYEYTRGELLVLDLQGVGENLTDPSVIKPEVKQSRGMVFGPANLGEDAIRNFIAKHHCNSCCRKLKLPDLKRNDYSPERINSTFGLEIKIESAEEPPARETGRNSPEDDMQL,2022,NP_060132.3.csv,refseq-TRPM6-NM_017662.4_clinical_seed_0_final,refseq-TRPM6-NM_017662.4.a2m,Invitae,refseq-TRPM6-NM_017662.4.npy,1,2022,2022
+NP_060141.3,MLSSTDFTFASWELVVRVDHPNEEQQKDVTLRVSGDLHVGGVMLKLVEQINISQDWSDFALWWEQKHCWLLKTHWTLDKYGVQADAKLLFTPQHKMLRLRLPNLKMVRLRVSFSAVVFKAVSDICKILNIRRSEELSLLKPSGDYFKKKKKKDKNNKEPIIEDILNLESSPTASGSSVSPGLYSKTMTPIYDPINGTPASSTMTWFSDSPLTEQNCSILAFSQPPQSPEALADMYQPRSLVDKAKLNAGWLDSSRSLMEQGIQEDEQLLLRFKYYSFFDLNPKYDAVRINQLYEQARWAILLEEIDCTEEEMLIFAALQYHISKLSLSAETQDFAGESEVDEIEAALSNLEVTLEGGKADSLLEDITDIPKLADNLKLFRPKKLLPKAFKQYWFIFKDTSIAYFKNKELEQGEPLEKLNLRGCEVVPDVNVAGRKFGIKLLIPVADGMNEMYLRCDHENQYAQWMAACMLASKGKTMADSSYQPEVLNILSFLRMKNRNSASQVASSLENMDMNPECFVSPRCAKRHKSKQLAARILEAHQNVAQMPLVEAKLRFIQAWQSLPEFGLTYYLVRFKGSKKDDILGVSYNRLIKIDAATGIPVTTWRFTNIKQWNVNWETRQVVIEFDQNVFTAFTCLSADCKIVHEYIGGYIFLSTRSKDQNETLDEDLFHKLTGGQD,677,NP_060141.3.csv,refseq-FERMT1-NM_017671.4_clinical_seed_0_final,refseq-FERMT1-NM_017671.4.a2m,Invitae,refseq-FERMT1-NM_017671.4.npy,1,677,677
+NP_060167.2,MTASPDYLVVLFGITAGATGAKLGSDEKELILLFWKVVDLANKKVGQLHEVLVRPDQLELTEDCKEETKIDVESLSSASQLDQALRQFNQSVSNELNIGVGTSFCLCTDGQLHVRQILHPEASKKNVLLPECFYSFFDLRKEFKKCCPGSPDIDKLDVATMTEYLNFEKSSSVSRYGASQVEDMGNIILAMISEPYNHRFSDPERVNYKFESGTCSKMELIDDNTVVRARGLPWQSSDQDIARFFKGLNIAKGGAALCLNAQGRRNGEALVRFVSEEHRDLALQRHKHHMGTRYIEVYKATGEDFLKIAGGTSNEVAQFLSKENQVIVRMRGLPFTATAEEVVAFFGQHCPITGGKEGILFVTYPDGRPTGDAFVLFACEEYAQNALRKHKDLLGKRYIELFRSTAAEVQQVLNRFSSAPLIPLPTPPIIPVLPQQFVPPTNVRDCIRLRGLPYAATIEDILDFLGEFATDIRTHGVHMVLNHQGRPSGDAFIQMKSADRAFMAAQKCHKKNMKDRYVEVFQCSAEEMNFVLMGGTLNRNGLSPPPCKLPCLSPPSYTFPAPAAVIPTEAAIYQPSVILNPRALQPSTAYYPAGTQLFMNYTAYYPSPPGSPNSLGYFPTAANLSGVPPQPGTVVRMQGLAYNTGVKEILNFFQGYQYATEDGLIHTNDQARTLPKEWVCI,681,NP_060167.2.csv,refseq-ESRP1-NM_017697.3_clinical_seed_0_final,refseq-ESRP1-NM_017697.3.a2m,Invitae,refseq-ESRP1-NM_017697.3.npy,1,681,681
+NP_060215.4,MLSATPLYGNVHSWMNSERVRMCGASEDRKILVNDGDASKARLELREENPLNHNVVDASTAHRIDGLAALSMDRTGLIREGLRVPGNIVYSSLCGLGSEKGREAATSTLGGLGFSSERNPEMQFKPNTPETVEASAVSGKPPNGFSAIYKTPPGIQKSAVATAEALGLDRPASDKQSPLNINGASYLRLPWVNPYMEGATPAIYPFLDSPNKYSLNMYKALLPQQSYSLAQPLYSPVCTNGERFLYLPPPHYVGPHIPSSLASPMRLSTPSASPAIPPLVHCADKSLPWKMGVSPGNPVDSHAYPHIQNSKQPRVPSAKAVTSGLPGDTALLLPPSPRPSPRVHLPTQPAADTYSEFHKHYARISTSPSVALSKPYMTVSSEFPAARLSNGKYPKAPEGGEGAQPVPGHARKTAVQDRKDGSSPPLLEKQTVTKDVTDKPLDLSSKVVDVDASKADHMKKMAPTVLVHSRAGSGLVLSGSEIPKETLSPPGNGCAIYRSEIISTAPSSWVVPGPSPNEENNGKSMSLKNKALDWAIPQQRSSSCPRMGGTDAVITNVSGSVSSAGRPASASPAPNANADGTKTSRSSVETTPSVIQHVGQPPATPAKHSSSTSSKGAKASNPEPSFKANENGLPPSSIFLSPNEAFRSPPIPYPRSYLPYPAPEGIAVSPLSLHGKGPVYPHPVLLPNGSLFPGHLAPKPGLPYGLPTGRPEFVTYQDALGLGMVHPMLIPHTPIEITKEEKPERRSRSHERARYEDPTLRNRFSEILETSSTKLHPDVPTDKNLKPNPNWNQGKTVVKSDKLVYVDLLREEPDAKTDTNVSKPSFAAESVGQSAEPPKPSVEPALQQHRDFIALREELGRISDFHETYTFKQPVFTVSKDSVLAGTNKENLGLPVSTPFLEPPLGSDGPAVTFGKTQEDPKPFCVGSAPPSVDVTPTYTKDGADEAESNDGKVLKPKPSKLAKRIANSAGYVGDRFKCVTTELYADSSQLSREQRALQMEGLQEDSILCLPAAYCERAMMRFSELEMKEREGGHPATKDSEMCKFSPADWERLKGNQDKKPKSVTLEEAIAEQNESERCEYSVGNKHRDPFEAPEDKDLPVEKYFVERQPVSEPPADQVASDMPHSPTLRVDRKRKVSGDSSHTETTAEEVPEDPLLKAKRRRVSKGLHPKKQRHLLHLRERWEQQVSAADGKPGRQSRKEVTQATQPEAIPQGTNITEEKPGRKRAEAKGNRSWSEESLKPSDNEQGLPVFSGSPPMKSLSSTSAGGKKQAQPSCAPASRPPAKQQKIKENQKTDVLCADEEEDCQAASLLQKYTDNSEKPSGKRLCKTKHLIPQESRRGLPLTGEYYVENADGKVTVRRFRKRPEPSSDYDLSPAKQEPKPFDRLQQLLPASQSTQLPCSSSPQETTQSRPMPPEARRLIVNKNAGETLLQRAARLGYEEVVLYCLENKICDVNHRDNAGYCALHEACARGWLNIVRHLLEYGADVNCSAQDGTRPLHDAVENDHLEIVRLLLSYGADPTLATYSGRTIMKMTHSELMEKFLTDYLNDLQGRNDDDASGTWDFYGSSVCEPDDESGYDVLANPPGPEDQDDDDDAYSDVFEFEFSETPLLPCYNIQVSVAQGPRNWLLLSDVLKKLKMSSRIFRCNFPNVEIVTIAEAEFYRQVSASLLFSCSKDLEAFNPESKELLDLVEFTNEIQTLLGSSVEWLHPSDLASDNYW,1721,NP_060215.4.csv,refseq-BCOR-NM_017745.5_clinical_seed_0_final,refseq-BCOR-NM_017745.5.a2m,Invitae,refseq-BCOR-NM_017745.5.npy,1,1721,1721
+NP_060221.2,MTTFGAVAEWRLPSLRRATLWIPQWFAKKAIFNSPLEAAMAFPHLQQPSFLLASLKADSINKPFAQQCQDLVKVIEDFPAKELHTIFPWLVESIFGSLDGVLVGWNLRCLQGRVNPVEYSIVMEFLDPGGPMMKLVYKLQAEDYKFDFPVSYLPGPVKASIQECILPDSPLYHNKVQFTPTGGLGLNLALNPFEYYIFFFALSLITQKPLPVSLHVRTSDCAYFILVDRYLSWFLPTEGSVPPPLSSSPGGTSPSPPPRTPAIPFASYGLHHTSLLKRHISHQTSVNADPASHEIWRSETLLQVFVEMWLHHYSLEMYQKMQSPHAKESFTPTEEHVLVVRLLLKHLHAFANSLKPEQASPSAHSHATSPLEEFKRAAVPRFVQQKLYLFLQHCFGHWPLDASFRAVLEMWLSYLQPWRYAPDKQAPGSDSQPRCVSEKWAPFVQENLLMYTKLFVGFLNRALRTDLVSPKHALMVFRVAKVFAQPNLAEMIQKGEQLFLEPELVIPHRQHRLFTAPTFTGSFLSPWPPAVTDASFKVKSHVYSLEGQDCKYTPMFGPEARTLVLRLAQLITQAKHTAKSISDQCAESPAGHSFLSWLGFSSMDTNGSYTANDLDEMGQDSVRKTDEYLEKALEYLRQIFRLSEAQLRQFTLALGTTQDENGKKQLPDCIVGEDGLILTPLGRYQIINGLRRFEIEYQGDPELQPIRSYEIASLVRTLFRLSSAINHRFAGQMAALCSRDDFLGSFCRYHLTEPGLASRHLLSPVGRRQVAGHTRGPRLSLRFLGSYRTLVSLLLAFFVASLFCVGPLPCTLLLTLGYVLYASAMTLLTERGKLHQP,837,NP_060221.2.csv,NP_060221.2_colabfold_clinical_seed_0_final,NP_060221.2_colabfold.a2m,colabfold,NP_060221.2_colabfold_theta_0.2.npy,1,837,837
+NP_060222.2,MWLKPEEVLLKNALKLWLMERSNDYFVLQRRRGYGEEGGGGLTGLLVGTLDSVLDSTAKVAPFRILHQTPDSQVYLSIACGANREEITKHWDWLEQNIMKTLSVFDSNEDITNFVQGKIRGLIAEEGKHCFAKEDDPEKFREALLKFEKCFGLPEKEKLVTYYSCSYWKGRVPCQGWLYLSTNFLSFYSFLLGSEIKLIISWDEVSKLEKTSNVILTESIHVCSQGENHYFSMFLHINQTYLLMEQLANYAIRRLFDKETFDNDPVLYNPLQITKRGLENRAHSEQFNAFFRLPKGESLKEVHECFLWVPFSHFNTHGKMCISENYICFASQDGNQCSVIIPLREVLAIDKTNDSSKSVIISIKGKTAFRFHEVKDFEQLVAKLRLRCGAASTQYHDISTELAISSESTEPSDNFEVQSLTSQRECSKTVNTEALMTVFHPQNLETLNSKMLKEKMKEQSWKILFAECGRGVSMFRTKKTRDLVVRGIPETLRGELWMLFSGAVNDMATNPDYYTEVVEQSLGTCNLATEEIERDLRRSLPEHPAFQSDTGISALRRVLTAYAYRNPKIGYCQAMNILTSVLLLYAKEEEAFWLLVAVCERMLPDYFNRRIIGALVDQAVFEELIRDHLPQLTEHMTDMTFFSSVSLSWFLTLFISVLPIESAVNVVDCFFYDGIKAILQLGLAILDYNLDKLLTCKDDAEAVTALNRFFDNVTNKDSPLPSNVQQGSNVSDEKTSHTRVDITDLIRESNEKYGNIRYEDIHSMRCRNRLYVIQTLEETTKQNVLRVVSQDVKLSLQELDELYVIFKKELFLSCYWCLGCPVLKHHDPSLPYLEQYQIDCQQFRALYHLLSPWAHSANKDSLALWTFRLLDENSDCLINFKEFSSAIDIMYNGSFTEKLKLLFKLHIPPAYTEVKSKDASKGDELSKEELLYFSQLHVSKPANEKEAESAKHSPEKGKGKIDIQAYLSQWQDELFKKEENIKDLPRMNQSQFIQFSKTLYNLFHEDPEEESLYQAIAVVTSLLLRMEEVGRKLHSPTSSAKGFSGTVCGSGGPSEEKTGSHLEKDPCSFREEPQWSFAFEQILASLLNEPALVRFFEKPIDVKAKLENARISQLRSRTKM,1120,NP_060222.2.csv,refseq-TBC1D8B-NM_017752.2_clinical_seed_0_final,refseq-TBC1D8B-NM_017752.2.a2m,Invitae,refseq-TBC1D8B-NM_017752.2.npy,1,1120,1120
+NP_060225.4,MGRRSRGRRLQQQQRPEDAEDGAEGGGKRGEAGWEGGYPEIVKENKLFEHYYQELKIVPEGEWGQFMDALREPLPATLRITGYKSHAKEILHCLKNKYFKELEDLEVDGQKVEVPQPLSWYPEELAWHTNLSRKILRKSPHLEKFHQFLVSETESGNISRQEAVSMIPPLLLNVRPHHKILDMCAAPGSKTTQLIEMLHADMNVPFPEGFVIANDVDNKRCYLLVHQAKRLSSPCIMVVNHDASSIPRLQIDVDGRKEILFYDRILCDVPCSGDGTMRKNIDVWKKWTTLNSLQLHGLQLRIATRGAEQLAEGGRMVYSTCSLNPIEDEAVIASLLEKSEGALELADVSNELPGLKWMPGITQWKVMTKDGQWFTDWDAVPHSRHTQIRPTMFPPKDPEKLQAMHLERCLRILPHHQNTGGFFVAVLVKKSSMPWNKRQPKLQGKSAETRESTQLSPADLTEGKPTDPSKLESPSFTGTGDTEIAHATEDLENNGSKKDGVCGPPPSKKMKLFGFKEDPFVFIPEDDPLFPPIEKFYALDPSFPRMNLLTRTTEGKKRQLYMVSKELRNVLLNNSEKMKVINTGIKVWCRNNSGEEFDCAFRLAQEGIYTLYPFINSRIITVSMEDVKILLTQENPFFRKLSSETYSQAKDLAKGSIVLKYEPDSANPDALQCPIVLCGWRGKASIRTFVPKNERLHYLRMMGLEVLGEKKKEGVILTNESAASTGQPDNDVTEGQRAGEPNSPDAEEANSPDVTAGCDPAGVHPPR,767,NP_060225.4.csv,refseq-NSUN2-NM_017755.5_clinical_seed_0_final,refseq-NSUN2-NM_017755.5.a2m,Invitae,refseq-NSUN2-NM_017755.5.npy,1,767,767
+NP_060247.2,MAETVWSTDTGEAVYRSRDPVRNLRLRVHLQRITSSNFLHYQPAAELGKDLIDLATFRPQPTASGHRPEEDEEEEIVIGWQEKLFSQFEVDLYQNETACQSPLDYQYRQEILKLENSGGKKNRRIFTYTDSDRYTNLEEHCQRMTTAASEVPSFLVERMANVRRRRQDRRGMEGGILKSRIVTWEPSEEFVRNNHVINTPLQTMHIMADLGPYKKLGYKKYEHVLCTLKVDSNGVITVKPDFTGLKGPYRIETEGEKQELWKYTIDNVSPHAQPEEEERERRVFKDLYGRHKEYLSSLVGTDFEMTVPGALRLFVNGEVVSAQGYEYDNLYVHFFVELPTAHWSSPAFQQLSGVTQTCTTKSLAMDKVAHFSYPFTFEAFFLHEDESSDALPEWPVLYCEVLSLDFWQRYRVEGYGAVVLPATPGSHTLTVSTWRPVELGTVAELRRFFIGGSLELEDLSYVRIPGSFKGERLSRFGLRTETTGTVTFRLHCLQQSRAFMESSSLQKRMRSVLDRLEGFSQQSSIHNVLEAFRRARRRMQEARESLPQDLVSPSGTLVS,559,NP_060247.2.csv,refseq-MKS1-NM_017777.3_clinical_seed_0_final,refseq-MKS1-NM_017777.3.a2m,Invitae,refseq-MKS1-NM_017777.3.npy,1,559,559
+NP_060269.3,MSPHGDGRGQAQGRAVRVGLRRSGGIRGGVAVFAAVAAVFTFTLPPSVPGGDSGELITAAHELGVAHPPGYPLFTLVAKLAITLFPFGSIAYRVNLLCGLFGAVAASLLFFTVFRLSGSSAGGILAAGVFSFSRLTWQWSIAAEVFSLNNLFVGLLMALTVHFEEAATAKERSKVAKIGAFCCGLSLCNQHTIILYVLCIIPWILFQLLKKKELSLGSLLKLSLYFSAGLLPYVHLPISSYLNHARWTWGDQTTLQGFLTHFLREEYGTFSLAKSEIGSSMSEILLSQVTNMRTELSFNIQALAVCANICLATKDRQNPSLVWLFTGMFCIYSLFFAWRANLDISKPLFMGVVERFWMQSNAVVAVLAGIGLAAVVSETNRVLNSNGLQCLEWLSATLFVVYQIYSNYSVCDQRTNYVIDKFAKNLLTSMPHDAIILLRGDLPGNSLRYMHYCEGLRPDISLVDQEMMTYEWYLPKMAKHLPGVNFPGNRWNPVEGILPSGMVTFNLYHFLEVNKQKETFVCIGIHEGDPTWKKNYSLWPWGSCDKLVPLEIVFNPEEWIKLTKSIYNWTEEYGRFDPSSWESVANEEMWQARMKTPFFIFNLAETAHMPSKVKAQLYAQAYDLYKEIVYLQKEHPVNWHKNYAIACERMLRLQARDADPEVLLSETIRHFRLYSQKAPNDPQQADILGALKHLRKELQSLRNRKNV,707,NP_060269.3.csv,refseq-TMEM260-NM_017799.3_clinical_seed_0_final,refseq-TMEM260-NM_017799.3.a2m,Invitae,refseq-TMEM260-NM_017799.3.npy,1,707,707
+NP_060272.3,MAALGVAEAVAAPHPAEGAETAEAVELSRALSRLLPGLEADSKPGRRRALEALRRALEEPGPAADPTAFQGPWARLLLPRLLRCLSDPAEGCRALAVHLLDLGLRRAARPRDALPRLLPALAARLAGPVPARRPPEACEELRLALVQLLGLAVDLCGAALAPHLDDALRALRCSLLDPFAAVRRESCSCAAALAQATPDHFHMQSESLIGPLMQTISHQHWKVRVAAIEATGAVIHFGNGKSVDDVLSHFAQRLFDDVPQVRRAVASVVGGWLLCLRDRYSFFHKLIPLLLSSLNDEVPEVRQLAASLWEDVGLQWQKENEEDLKDKLDFAPPTPPHYPPHERRPVLGCRELVFRNLSKILPALCHDITDWVVGTRVKSAQLLPVLLLHAEDHATQHLEVVLRTLFQACTDEEAAVVQSCTRSAELVGTFVSPEVFLKLILSTLKKTPSASGLLVLASAMRGCPREALQPHLAAIATELAQAHICQASENDLYLERLLLCVQALVSVCHEDCGVASLQLLDVLLTIVALAGATGLRDKAQETMDSLAMVEGVSSCQDLYRKHIGPLLERVTASHLDWTAHSPELLQFSVIVAQSGPALGEALPHVVPTLRACLQPSQDPQMRLKLFSILSTVLLRATDTINSQGQFPSYLETVTKDILAPNLQWHAGRTAAAIRTAAVSCLWALTSSEVLSAEQIRDVQETLMPQVLTTLEEDSKMTRLISCRIINTFLKTSGGMTDPEKLIRIYPELLKRLDDVSNDVRMAAASTLVTWLQCVKGANAKSYYQSSVQYLYRELLVHLDDPERAIQDAILEVLKEGSGLFPDLLVRETEAVIHKHRSATYCEQLLQHVQAVPATQ,855,NP_060272.3.csv,refseq-DNAAF5-NM_017802.3_clinical_seed_0_final,refseq-DNAAF5-NM_017802.3.a2m,Invitae,refseq-DNAAF5-NM_017802.3_theta_0.2.npy,1,855,855
+NP_060277.1,MPAVLGFEGSANKIGVGVVRDGKVLANPRRTYVTPPGTGFLPGDTARHHRAVILDLLQEALTESGLTSQDIDCIAYTKGPGMGAPLVSVAVVARTVAQLWNKPLVGVNHCIGHIEMGRLITGATSPTVLYVSGGNTQVIAYSEHRYRIFGETIDIAVGNCLDRFARVLKISNDPSPGYNIEQMAKRGKKLVELPYTVKGMDVSFSGILSFIEDVAHRMLATGECTPEDLCFSLQETVFAMLVEITERAMAHCGSQEALIVGGVGCNVRLQEMMATMCQERGARLFATDERFCIDNGAMIAQAGWEMFRAGHRTPLSDSGVTQRYRTDEVEVTWRD,335,NP_060277.1.csv,refseq-OSGEP-NM_017807.3_clinical_seed_0_final,refseq-OSGEP-NM_017807.3.a2m,Invitae,refseq-OSGEP-NM_017807.3.npy,1,335,335
+NP_060289.2,MAAFLKMSVSVNFFRPFTRFLVPFTLHRKRNNLTILQRYMSSKIPAVTYPKNESTPPSEELELDKWKTTMKSSVQEECVSTISSSKDEDPLAATREFIEMWRLLGREVPEHITEEELKTLMECVSNTAKKKYLKYLYTKEKVKKARQIKKEMKAAAREEAKNIKLLETTEEDKQKNFLFLRLWDRNMDIAMGWKGAQAMQFGQPLVFDMAYENYMKRKELQNTVSQLLESEGWNRRNVDPFHIYFCNLKIDGALHRELVKRYQEKWDKLLLTSTEKSHVDLFPKDSIIYLTADSPNVMTTFRHDKVYVIGSFVDKSMQPGTSLAKAKRLNLATECLPLDKYLQWEIGNKNLTLDQMIRILLCLKNNGNWQEALQFVPKRKHTGFLEISQHSQEFINRLKKAKT,403,NP_060289.2.csv,refseq-TRMT10C-NM_017819.3_clinical_seed_0_final,refseq-TRMT10C-NM_017819.3.a2m,Invitae,refseq-TRMT10C-NM_017819.3.npy,1,403,403
+NP_060297.1,MAASMARRLWPLLTRRGFRPRGGCISNDSPRRSFTTEKRNRNLLYEYAREGYSALPQLDIERFCACPEEAAHALELRKGELRSADLPAIISTWQELRQLQEQIRSLEEEKAAVTEAVRALLANQDSGEVQQDPKYQGLRARGREIRKELVHLYPREAQLEEQFYLQALKLPNQTHPDVPVGDESQARVLHMVGDKPVFSFQPRGHLEIGEKLDIIRQKRLSHVSGHRSYYLRGAGALLQHGLVNFTFNKLLRRGFTPMTVPDLLRGAVFEGCGMTPNANPSQIYNIDPARFKDLNLAGTAEVGLAGYFMDHTVAFRDLPVRMVCSSTCYRAETNTGQEPRGLYRVHHFTKVEMFGVTGPGLEQSSQLLEEFLSLQMEILTELGLHFRVLDMPTQELGLPAYRKFDIEAWMPGRGRFGEVTSASNCTDFQSRRLHIMFQTEAGELQFAHTVNATACAVPRLLIALLESNQQKDGSVLVPPALQSYLGTDRITAPTHVPLQYIGPNQPRKPGLPGQPAVS,518,NP_060297.1.csv,refseq-SARS2-NM_017827.3_clinical_seed_0_final,refseq-SARS2-NM_017827.3.a2m,Invitae,refseq-SARS2-NM_017827.3.npy,1,518,518
+NP_060307.2,MWPQDPSRKEVLRFAVSCRILTLMLQALFNAIIPDHHAEAFSPPRLAPSGFVDQLVEGLLGGLSHWDAEHFLFIAEHGYLYEHNFAFFPGFPLALLVGTELLRPLRGLLSLRSCLLISVASLNFLFFMLAAVALHDLGCLVLHCPHQSFYAALLFCLSPANVFLAAGYSEALFALLTFSAMGQLERGRVWTSVLLFAFATGVRSNGLVSVGFLMHSQCQGFFSSLTMLNPLRQLFKLMASLFLSVFTLGLPFALFQYYAYTQFCLPGSARPIPEPLVQLAVDKGYRIAEGNEPPWCFWDVPLIYSYIQDVYWNVGFLKYYELKQVPNFLLAAPVAILVAWATWTYVTTHPWLCLTLGLQRSKNNKTLEKPDLGFLSPQVFVYVVHAAVLLLFGGLCMHVQVLTRFLGSSTPIMYWFPAHLLQDQEPLLRSLKTVPWKPLAEDSPPGQKVPRNPIMGLLYHWKTCSPVTRYILGYFLTYWLLGLLLHCNFLPWT,493,NP_060307.2.csv,refseq-PIGV-NM_017837.3_clinical_seed_0_final,refseq-PIGV-NM_017837.3.a2m,Invitae,refseq-PIGV-NM_017837.3.npy,1,493,493
+NP_060311.1,MAVSTVFSTSSLMLALSRHSLLSPLLSVTSFRRFYRGDSPTDSQKDMIEIPLPPWQERTDESIETKRARLLYESRKRGMLENCILLSLFAKEHLQHMTEKQLNLYDRLINEPSNDWDIYYWATEAKPAPEIFENEVMALLRDFAKNKNKEQRLRAPDLEYLFEKPR,166,NP_060311.1.csv,refseq-SDHAF2-NM_017841.2_clinical_seed_0_final,refseq-SDHAF2-NM_017841.2.a2m,Invitae,refseq-SDHAF2-NM_017841.2.npy,1,166,166
+NP_060319.1,MYAPGGAGLPGGRRRRSPGGSALPKQPERSLASALPGALSITALCTALAEPAWLHIHGGTCSRQELGVSDVLGYVHPDLLKDFCMNPQTVLLLRVIAAFCFLGILCSLSAFLLDVFGPKHPALKITRRYAFAHILTVLQCATVIGFSYWASELILAQQQQHKKYHGSQVYVTFAVSFYLVAGAGGASILATAANLLRHYPTEEEEQALELLSEMEENEPYPAEYEVINQFQPPPAYTP,238,NP_060319.1.csv,refseq-TMEM127-NM_017849.3_clinical_seed_0_final,refseq-TMEM127-NM_017849.3.a2m,Invitae,refseq-TMEM127-NM_017849.3.npy,1,238,238
+NP_060336.3,MLFLALGSPWAVELPLCGRRTALCAAAALRGPRASVSRASSSSGPSGPVAGWSTGPSGAARLLRRPGRAQIPVYWEGYVRFLNTPSDKSEDGRLIYTGNMARAVFGVKCFSYSTSLIGLTFLPYIFTQNNAISESVPLPIQIIFYGIMGSFTVITPVLLHFITKGYVIRLYHEATTDTYKAITYNAMLAETSTVFHQNDVKIPDAKHVFTTFYAKTKSLLVNPVLFPNREDYIHLMGYDKEEFILYMEETSEEKRHKDDK,260,NP_060336.3.csv,refseq-TMEM70-NM_017866.5_clinical_seed_0_final,refseq-TMEM70-NM_017866.5.a2m,Invitae,refseq-TMEM70-NM_017866.5.npy,1,260,260
+NP_060342.2,MWGACKVKVHDSLATISITLRRYLRLGATMAKSKFEYVRDFEADDTCLAHCWVVVRLDGRNFHRFAEKHNFAKPNDSRALQLMTKCAQTVMEELEDIVIAYGQSDEYSFVFKRKTNWFKRRASKFMTHVASQFASSYVFYWRDYFEDQPLLYPPGFDGRVVVYPSNQTLKDYLSWRQADCHINNLYNTVFWALIQQSGLTPVQAQGRLQGTLAADKNEILFSEFNINYNNELPMYRKGTVLIWQKVDEVMTKEIKLPTEMEGKKMAVTRTRTKPVPLHCDIIGDAFWKEHPEILDEDS,298,NP_060342.2.csv,refseq-THG1L-NM_017872.4_clinical_seed_0_final,refseq-THG1L-NM_017872.4.a2m,Invitae,refseq-THG1L-NM_017872.4.npy,1,298,298
+NP_060345.2,MIQNSRPSLLQPQDVGDTVETLMLHPVIKAFLCGSISGTCSTLLFQPLDLLKTRLQTLQPSDHGSRRVGMLAVLLKVVRTESLLGLWKGMSPSIVRCVPGVGIYFGTLYSLKQYFLRGHPPTALESVMLGVGSRSVAGVCMSPITVIKTRYESGKYGYESIYAALRSIYHSEGHRGLFSGLTATLLRDAPFSGIYLMFYNQTKNIVPHDQVDATLIPITNFSCGIFAGILASLVTQPADVIKTHMQLYPLKFQWIGQAVTLIFKDYGLRGFFQGGIPRALRRTLMAAMAWTVYEEMMAKMGLKS,304,NP_060345.2.csv,refseq-SLC25A38-NM_017875.2_clinical_seed_0_final,refseq-SLC25A38-NM_017875.2.a2m,Invitae,refseq-SLC25A38-NM_017875.2.npy,1,304,304
+NP_060352.1,MEATRRRQHLGATGGPGAQLGASFLQARHGSVSADEAARTAPFHLDLWFYFTLQNWVLDFGRPIAMLVFPLEWFPLNKPSVGDYFHMAYNVITPFLLLKLIERSPRTLPRSITYVSIIIFIMGASIHLVGDSVNHRLLFSGYQHHLSVRENPIIKNLKPETLIDSFELLYYYDEYLGHCMWYIPFFLILFMYFSGCFTASKAESLIPGPALLLVAPSGLYYWYLVTEGQIFILFIFTFFAMLALVLHQKRKRLFLDSNGLFLFSSFALTLLLVALWVAWLWNDPVLRKKYPGVIYVPEPWAFYTLHVSSRH,311,NP_060352.1.csv,refseq-CLN6-NM_017882.2_clinical_seed_0_final,refseq-CLN6-NM_017882.2.a2m,Invitae,refseq-CLN6-NM_017882.2.npy,1,311,311
+NP_060379.2,MPATLLRAVARSHHILSKAHQCRRIGHLMLKPLKEFENTTCSTLTIRQSLDLFLPDKTASGLNKSQILEMNQKKSDTSMLSPLNAARCQDEKAHLPTMKSFGTHRRVTHKPNLLGSKWFIKILKRHFSSVSTETFVPKQDFPQVKRPLKASRTRQPSRTNLPVLSVNEDLMHCTAFATADEYHLGNLSQDLASHGYVEVTSLPRDAANILVMGVENSAKEGDPGTIFFFREGAAVFWNVKDKTMKHVMKVLEKHEIQPYEIALVHWENEELNYIKIEGQSKLHRGEIKLNSELDLDDAILEKFAFSNALCLSVKLAIWEASLDKFIESIQSIPEALKAGKKVKLSHEEVMQKIGELFALRHRINLSSDFLITPDFYWDRENLEGLYDKTCQFLSIGRRVKVMNEKLQHCMELTDLMRNHLNEKRALRLEWMIVILITIEVMFELGRVFF,449,NP_060379.2.csv,refseq-RMND1-NM_017909.3_clinical_seed_0_final,refseq-RMND1-NM_017909.3.a2m,Invitae,refseq-RMND1-NM_017909.3.npy,1,449,449
+NP_060399.1,MKSDSSTSAAPLRGLGGPLRSSEPVRAVPARAPAVDLLEEAADLLVVHLDFRAALETCERAWQSLANHAVAEEPAGTSLEVKCSLCVVGIQALAEMDRWQEVLSWVLQYYQVPEKLPPKVLELCILLYSKMQEPGAVLDVVGAWLQDPANQNLPEYGALAEFHVQRVLLPLGCLSEAEELVVGSAAFGEERRLDVLQAIHTARQQQKQEHSGSEEAQKPNLEGSVSHKFLSLPMLVRQLWDSAVSHFFSLPFKKSLLAALILCLLVVRFDPASPSSLHFLYKLAQLFRWIRKAAFSRLYQLRIRD,305,NP_060399.1.csv,refseq-PEX26-NM_017929.5_clinical_seed_0_final,refseq-PEX26-NM_017929.5.a2m,Invitae,refseq-PEX26-NM_017929.5.npy,1,305,305
+NP_060404.4,MSCERKGLSELRSELYFLIARFLEDGPCQQAAQVLIREVAEKELLPRRTDWTGKEHPRTYQNLVKYYRHLAPDHLLQICHRLGPLLEQEIPQSVPGVQTLLGAGRQSLLRTNKSCKHVVWKGSALAALHCGRPPESPVNYGSPPSIADTLFSRKLNGKYRLERLVPTAVYQHMKMHKRILGHLSSVYCVTFDRTGRRIFTGSDDCLVKIWATDDGRLLATLRGHAAEISDMAVNYENTMIAAGSCDKMIRVWCLRTCAPLAVLQGHSASITSLQFSPLCSGSKRYLSSTGADGTICFWLWDAGTLKINPRPAKFTERPRPGVQMICSSFSAGGMFLATGSTDHIIRVYFFGSGQPEKISELEFHTDKVDSIQFSNTSNRFVSGSRDGTARIWQFKRREWKSILLDMATRPAGQNLQGIEDKITKMKVTMVAWDRHDNTVITAVNNMTLKVWNSYTGQLIHVLMGHEDEVFVLEPHPFDPRVLFSAGHDGNVIVWDLARGVKIRSYFNMIEGQGHGAVFDCKCSPDGQHFACTDSHGHLLIFGFGSSSKYDKIADQMFFHSDYRPLIRDANNFVLDEQTQQAPHLMPPPFLVDVDGNPHPSRYQRLVPGRENCREEQLIPQMGVTSSGLNQVLSQQANQEISPLDSMIQRLQQEQDLRRSGEAVISNTSRLSRGSISSTSEVHSPPNVGLRRSGQIEGVRQMHSNAPRSEIATERDLVAWSRRVVVPELSAGVASRQEEWRTAKGEEEIKTYRSEEKRKHLTVPKENKIPTVSKNHAHEHFLDLGESKKQQTNQHNYRTRSALEETPRPSEEIENGSSSSDEGEVVAVSGGTSEEEERAWHSDGSSSDYSSDYSDWTADAGINLQPPKKVPKNKTKKAESSSDEEEESEKQKQKQIKKEKKKVNEEKDGPISPKKKKPKERKQKRLAVGELTENGLTLEEWLPSTWITDTIPRRCPFVPQMGDEVYYFRQGHEAYVEMARKNKIYSINPKKQPWHKMELREQELMKIVGIKYEVGLPTLCCLKLAFLDPDTGKLTGGSFTMKYHDMPDVIDFLVLRQQFDDAKYRRWNIGDRFRSVIDDAWWFGTIESQEPLQLEYPDSLFQCYNVCWDNGDTEKMSPWDMELIPNNAVFPEELGTSVPLTDGECRSLIYKPLDGEWGTNPRDEECERIVAGINQLMTLDIASAFVAPVDLQAYPMYCTVVAYPTDLSTIKQRLENRFYRRVSSLMWEVRYIEHNTRTFNEPGSPIVKSAKFVTDLLLHFIKDQTCYNIIPLYNSMKKKVLSDSEDEEKDADVPGTSTRKRKDHQPRRRLRNRAQSYDIQAWKKQCEELLNLIFQCEDSEPFRQPVDLLEYPDYRDIIDTPMDFATVRETLEAGNYESPMELCKDVRLIFSNSKAYTPSKRSRIYSMSLRLSAFFEEHISSVLSDYKSALRFHKRNTITKRRKKRNRSSSVSSSAASSPERKKRILKPQLKSESSTSAFSTPTRSIPPRHNAAQINGKTESSSVVRTRSNRVVVDPVVTEQPSTSSAAKTFITKANASAIPGKTILENSVKHSKALNTLSSPGQSSFSHGTRNNSAKENMEKEKPVKRKMKSSVLPKASTLSKSSAVIEQGDCKNNALVPGTIQVNGHGGQPSKLVKRGPGRKPKVEVNTNSGEIIHKKRGRKPKKLQYAKPEDLEQNNVHPIRDEVLPSSTCNFLSETNNVKEDLLQKKNRGGRKPKRKMKTQKLDADLLVPASVKVLRRSNRKKIDDPIDEEEEFEELKGSEPHMRTRNQGRRTAFYNEDDSEEEQRQLLFEDTSLTFGTSSRGRVRKLTEKAKANLIGW,1821,NP_060404.4.csv,refseq-PHIP-NM_017934.6_clinical_seed_0_final,refseq-PHIP-NM_017934.6.a2m,Invitae,refseq-PHIP-NM_017934.6.npy,1,1821,1821
+NP_060420.2,MAEPGGAAGRSHPEDGSASEGEKEGNNESHMVSPPEKDDGQKGEEAVGSTEHPEEVTTQAEAAIEEGEVETEGEAAVEGEEEAVSYGDAESEEEYYYTETSSPEGQISAADTTYPYFSPPQELPGEEAYDSVSGEAGLQGFQQEATGPPESRERRVTSPEPSHGVLGPSEQMGQVTSGPAVGRLTGSTEEPQGQVLPMGVQHRFRLSHGSDIESSDLEEFVSQEPVIPPGVPDAHPREGDLPVFQDQIQQPSTEEGAMAERVESEGSDEEAEDEGSQLVVLDPDHPLMVRFQAALKNYLNRQIEKLKLDLQELVVATKQSRAQRQELGVNLYEVQQHLVHLQKLLEKSHDRHAMASSERRQKEEELQAARALYTKTCAAANEERKKLAALQTEMENLALHLFYMQNIDQDMRDDIRVMTQVVKKAETERIRAEIEKKKQDLYVDQLTTRAQQLEEDIALFEAQYLAQAEDTRILRKAVSEACTEIDAISVEKRRIMQQWASSLVGMKHRDEAHRAVLEALRGCQHQAKSTDGEIEAYKKSIMKEEEKNEKLASILNRTETEATLLQKLTTQCLTKQVALQSQFNTYRLTLQDTEDALSQDQLEQMILTEELQAIRQAIQGELELRRKTDAAIREKLQEHMTSNKTTKYFNQLILRLQKEKTNMMTHLSKINGDIAQTTLDITHTSSRLDAHQKTLVELDQDVKKVNELITNSQSEISRRTILIERKQGLINFLNKQLERMVSELGGEEVGPLELEIKRLSKLIDEHDGKAVQAQVTWLRLQQEMVKVTQEQEEQLASLDASKKELHIMEQKKLRVESKIEQEKKEQKEIEHHMKDLDNDLKKLNMLMNKNRCSSEELEQNNRVTENEFVRSLKASERETIKMQDKLNQLSEEKATLLNQLVEAEHQIMLWEKKIQLAKEMRSSVDSEIGQTEIRAMKGEIHRMKVRLGQLLKQQEKMIRAMELAVARRETVTTQAEGQRKMDRKALTRTDFHHKQLELRRKIRDVRKATDECTKTVLELEETQRNVSSSLLEKQEKLSVIQADFDTLEADLTRLGALKRQNLSEIVALQTRLKHLQAVKEGRYVFLFRSKQSLVLERQRLDKRLALIATILDRVRDEYPQFQEALHKVSQMIANKLESPGPS,1142,NP_060420.2.csv,refseq-CCDC40-NM_017950.3_clinical_seed_0_final,refseq-CCDC40-NM_017950.3.a2m,Invitae,refseq-CCDC40-NM_017950.3.npy,1,1142,1142
+NP_060476.2,MQALRHVVCALSGGVDSAVAALLLRRRGYQVTGVFMKNWDSLDEHGVCTADKDCEDAYRVCQILDIPFHQVSYVKEYWNDVFSDFLNEYEKGRTPNPDIVCNKHIKFSCFFHYAVDNLGADAIATGHYARTSLEDEEVFEQKHVKKPEGLFRNRFEVRNAVKLLQAADSFKDQTFFLSQVSQDALRRTIFPLGGLTKEFVKKIAAENRLHHVLQKKESMGMCFIGKRNFEHFLLQYLQPRPGHFISIEDNKVLGTHKGWFLYTLGQRANIGGLREPWYVVEKDSVKGDVFVAPRTDHPALYRDLLRTSRVHWIAEEPPAALVRDKMMECHFRFRHQMALVPCVLTLNQDGTVWVTAVQAVRALATGQFAVFYKGDECLGSGKILRLGPSAYTLQKGQRRAGMATESPSDSPEDGPGLSPLL,421,NP_060476.2.csv,refseq-TRMU-NM_018006.4_clinical_seed_0_final,refseq-TRMU-NM_018006.4.a2m,Invitae,refseq-TRMU-NM_018006.4.npy,1,421,421
+NP_060496.2,MAERGGAGGGPGGAGGGSGQRGSGVAQSPQQPPPQQQQQQPPQQPTPPKLAQATSSSSSTSAAAASSSSSSTSTSMAVAVASGSAPPGGPGPGRTPAPVQMNLYATWEVDRSSSSCVPRLFSLTLKKLVMLKEMDKDLNSVVIAVKLQGSKRILRSNEIVLPASGLVETELQLTFSLQYPHFLKRDANKLQIMLQRRKRYKNRTILGYKTLAVGLINMAEVMQHPNEGALVLGLHSNVKDVSVPVAEIKIYSLSSQPIDHEGIKSKLSDRSPDIDNYSEEEEESFSSEQEGSDDPLHGQDLFYEDEDLRKVKKTRRKLTSTSAITRQPNIKQKFVALLKRFKVSDEVGFGLEHVSREQIREVEEDLDELYDSLEMYNPSDSGPEMEETESILSTPKPKLKPFFEGMSQSSSQTEIGSLNSKGSLGKDTTSPMELAALEKIKSTWIKNQDDSLTETDTLEITDQDMFGDASTSLVVPEKVKTPMKSSKTDLQGSASPSKVEGVHTPRQKRSTPLKERQLSKPLSERTNSSDSERSPDLGHSTQIPRKVVYDQLNQILVSDAALPENVILVNTTDWQGQYVAELLQDQRKPVVCTCSTVEVQAVLSALLTRIQRYCNCNSSMPRPVKVAAVGGQSYLSSILRFFVKSLANKTSDWLGYMRFLIIPLGSHPVAKYLGSVDSKYSSSFLDSGWRDLFSRSEPPVSEQLDVAGRVMQYVNGAATTHQLPVAEAMLTCRHKFPDEDSYQKFIPFIGVVKVGLVEDSPSTAGDGDDSPVVSLTVPSTSPPSSSGLSRDATATPPSSPSMSSALAIVGSPNSPYGDVIGLQVDYWLGHPGERRREGDKRDASSKNTLKSVFRSVQVSRLPHSGEAQLSGTMAMTVVTKEKNKKVPTIFLSKKPREKEVDSKSQVIEGISRLICSAKQQQTMLRVSIDGVEWSDIKFFQLAAQWPTHVKHFPVGLFSGSKAT,963,NP_060496.2.csv,refseq-PACS1-NM_018026.3_clinical_seed_0_final,refseq-PACS1-NM_018026.3.a2m,Invitae,refseq-PACS1-NM_018026.3.npy,1,963,963
+NP_060521.4,MEPGKRRTKDDTWKADDLRKHLWAIQSGGSKEERKHREKKLRKESEMDLPEHKEPRCRDPDQDARSRDRVAEVHTAKESPRGERDRDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRARKEELRQTVAHHNLLGQETRDRQLLERAERKGRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLKYWLYKEEGERRHRKPREPDRDNKHREKSSTREKREKYSKEKSNSFSDKGEERHKEKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRESQNGEHRNRGASSKRDGTSSQHAENLVRNHGKDKDSRRKHGHEEGSSVWWKLDQRPGGEETVEIEKEETDLENARADAYTASCEDDFEDYEDDFEVCDGDDDESSNEPESREKLEELPLAQKKEIQEIQRAINAENERIGELSLKLFQKRGRTEFEKEPRTDTNSSPSRASVCGIFVDFASASHRQKSRTQALKQKMRSTKLLRLIDLDFSFTFSLLDLPPVNEYDMYIRNFGKKNTKQAYVQCNEDNVERDIQTEEIETREVWTQHPGESTVVSGGSEQRDTSDAVVMPKIDTPRLCSFLRAACQVMAVLLEEDRLAAEPSWNLRAQDRALYFSDSSSQLNTSLPFLQNRKVSSLHTSRVQRQMVVSVHDLPEKSFVPLLDSKYVLCVWDIWQPSGPQKVLICESQVTCCCLSPLKAFLLFAGTAHGSVVVWDLREDSRLHYSVTLSDGFWTFRTATFSTDGILTSVNHRSPLQAVEPISTSVHKKQSFVLSPFSTQEEMSGLSFHIASLDESGVLNVWVVVELPKADIAGSISDLGLMPGGRVKLVHSALIQLGDSLSHKGNEFWGTTQTLNVKFLPSDPNHFIIGTDMGLISHGTRQDLRVAPKLFKPQQHGIRPVKVNVIDFSPFGEPIFLAGCSDGSIRLHQLSSAFPLLQWDSSTDSHAVTGLQWSPTRPAVFLVQDDTSNIYIWDLLQSDLGPVAKQQVSPNRLVAMAAVGEPEKAGGSFLALVLARASGSIDIQHLKRRWAAPEVDECNRLRLLLQEALWPEGKLHK,1066,NP_060521.4.csv,refseq-WDR60-NM_018051.4_clinical_seed_0_final,refseq-WDR60-NM_018051.4.a2m,Invitae,refseq-WDR60-NM_018051.4.npy,1,1066,1066
+NP_060525.3,MHAHCLPFLLHAWWALLQAGAATVATALLRTRGQPSSPSPLAYMLSLYRDPLPRADIIRSLQAEDVAVDGQNWTFAFDFSFLSQQEDLAWAELRLQLSSPVDLPTEGSLAIEIFHQPKPDTEQASDSCLERFQMDLFTVTLSQVTFSLGSMVLEVTRPLSKWLKHPGALEKQMSRVAGECWPRPPTPPATNVLLMLYSNLSQEQRQLGGSTLLWEAESSWRAQEGQLSWEWGKRHRRHHLPDRSQLCRKVKFQVDFNLIGWGSWIIYPKQYNAYRCEGECPNPVGEEFHPTNHAYIQSLLKRYQPHRVPSTCCAPVKTKPLSMLYVDNGRVLLDHHKDMIVEECGCL,347,NP_060525.3.csv,refseq-NODAL-NM_018055.4_clinical_seed_0_final,refseq-NODAL-NM_018055.4.a2m,Invitae,refseq-NODAL-NM_018055.4.npy,1,347,347
+NP_060530.3,MRWGLRPRGPGAAALATARSLWGTPRLPCSPGWQGATKRLLVRSVSGASNHQPNSNSGRYRDTVLLPQTSFPMKLLGRQQPDTELEIQQKCGFSELYSWQRERKVKTEFCLHDGPPYANGDPHVGHALNKILKDIANRFHMMNGSKIHFVPGWDCHGLPIEIKVLSELGREAQNLSAMEIRKKARSFAKAAIEKQKSAFIRWGIMADWNNCYYTFDGKYEAKQLRTFYQMYDKGLVYRSYKPVFWSPSSRTALAEAELEYNPEHVSRSIYVKFPLLKPSPKLASLIDGSSPVSILVWTTQPWTIPANEAVCYMPESKYAVVKCSKSGDLYVLAADKVASVASTLETTFETISTLSGVDLENGTCSHPLIPDKASPLLPANHVTMAKGTGLVHTAPAHGMEDYGVASQHNLPMDCLVDEDGVFTDVAGPELQNKAVLEEGTDVVIKMLQTAKNLLKEEKLVHSYPYDWRTKKPVVIRASKQWFINITDIKTAAKELLKKVKFIPGSALNGMVEMMDRRPYWCISRQRVWGVPIPVFHHKTKDEYLINSQTTEHIVKLVEQHGSDIWWTLPPEQLLPKEVLSEVGGPDALEYVPGQDILDIWFDSGTSWSYVLPGPDQRADLYLEGKDQLGGWFQSSLLTSVAARKRAPYKTVIVHGFTLGEKGEKMSKSLGNVIHPDVVVNGGQDQSKEPPYGADVLRWWVADSNVFTEVAIGPSVLNAARDDISKLRNTLRFLLGNVADFNPETDSIPVNDMYVIDQYMLHLLQDLANKITELYKQYDFGKVVRLLRTFYTRELSNFYFSIIKDRLYCEKENDPKRRSCQTALVEILDVIVRSFAPILPHLAEEVFQHIPYIKEPKSVFRTGWISTSSIWKKPGLEEAVESACAMRDSFLGSIPGKNAAEYKVITVIEPGLLFEIIEMLQSEETSSTSQLNELMMASESTLLAQEPREMTADVIELKGKFLINLEGGDIREESSYKVIVMPTTKEKCPRCWKYTAESSDTLCPRCAEVVSGK,1012,NP_060530.3.csv,refseq-IARS2-NM_018060.3_clinical_seed_0_final,refseq-IARS2-NM_018060.3.a2m,Invitae,refseq-IARS2-NM_018060.3.npy,1,1012,1012
+NP_060545.3,MKVTLSALDTSESSFTPLVVIELAQDVKEETKEWLKNRIIAKKKDGGAQLLFRPLLNKYEQETLENQNLYLVGASKIRMLLGAEAVGLVKECNDNTMRAFTYRTRQNFKGFDDNNDDFLTMAECQFIIKHELENLRAKDEKMIPGYPQAKLYPGKSLLRRLLTSGIVIQVFPLHDSEALKKLEDTWYTRFALKYQPIDSIRGYFGETIALYFGFLEYFTFALIPMAVIGLPYYLFVWEDYDKYVIFASFNLIWSTVILELWKRGCANMTYRWGTLLMKRKFEEPRPGFHGVLGINSITGKEEPLYPSYKRQLRIYLVSLPFVCLCLYFSLYVMMIYFDMEVWALGLHENSGSEWTSVLLYVPSIIYAIVIEIMNRLYRYAAEFLTSWENHRLESAYQNHLILKVLVFNFLNCFASLFYIAFVLKDMKLLRQSLATLLITSQILNQIMESFLPYWLQRKHGVRVKRKVQALKADIDATLYEQVILEKEMGTYLGTFDDYLELFLQFGYVSLFSCVYPLAAAFAVLNNFTEVNSDALKMCRVFKRPFSEPSANIGVWQLAFETMSVISVVTNCALIGMSPQVNAVFPESKADLILIVVAVEHALLALKFILAFAIPDKPRHIQMKLARLEFESLEALKQQQMKLVTENLKEEPMESGKEKAT,660,NP_060545.3.csv,refseq-ANO10-NM_018075.3_clinical_seed_0_final,refseq-ANO10-NM_018075.3.a2m,Invitae,refseq-ANO10-NM_018075.3.npy,1,660,660
+NP_060546.2,MGVALRKLTQWTAAGHGTGILEITPLNEAILKEIIVFVESFIYKHPQEAKFVFVEPLEWNTSLAPSAFESGYVVSETTVKSEEVDKNGQPLLFLSVPQIKIRSFGQLSRLLLIAKTGKLKEAQACVEANRDPIVKILGSDYNTMKENSIALNILGKITRDDDPESEIKMKIAMLLKQLDLHLLNHSLKHISLEISLSPMTVKKDIELLKRFSGKGNQTVLESIEYTSDYEFSNGCRAPPWRQIRGEICYVLVKPHDGETLCITCSAGGVFLNGGKTDDEGDVNYERKGSIYKNLVTFLREKSPKFSENMSKLGISFSEDQQKEKDQLGKAPKKEEAAALRKDISGSDKRSLEKNQINFWRNQMTKRWEPSLNWKTTVNYKGKGSAKEIQEDKHTGKLEKPRPSVSHGRAQLLRKSAEKIEETVSDSSSESEEDEEPPDHRQEASADLPSEYWQIQKLVKYLKGGNQTATVIALCSMRDFSLAQETCQLAIRDVGGLEVLINLLETDEVKCKIGSLKILKEISHNPQIRQNIVDLGGLPIMVNILDSPHKSLKCLAAETIANVAKFKRARRVVRQHGGITKLVALLDCAHDSTKPAQSSLYEARDVEVARCGALALWSCSKSHTNKEAIRKAGGIPLLARLLKTSHENMLIPVVGTLQECASEENYRAAIKAERIIENLVKNLNSENEQLQEHCAMAIYQCAEDKETRDLVRLHGGLKPLASLLNNTDNKERLAAVTGAIWKCSISKENVTKFREYKAIETLVGLLTDQPEEVLVNVVGALGECCQERENRVIVRKCGGIQPLVNLLVGINQALLVNVTKAVGACAVEPESMMIIDRLDGVRLLWSLLKNPHPDVKASAAWALCPCIKNAKDAGEMVRSFVGGLELIVNLLKSDNKEVLASVCAAITNIAKDQENLAVITDHGVVPLLSKLANTNNNKLRHHLAEAISRCCMWGRNRVAFGEHKAVAPLVRYLKSNDTNVHRATAQALYQLSEDADNCITMHENGAVKLLLDMVGSPDQDLQEAAAGCISNIRRLALATEKARYT,1044,NP_060546.2.csv,refseq-ARMC4-NM_018076.2_clinical_seed_0_final,refseq-ARMC4-NM_018076.2.a2m,Invitae,refseq-ARMC4-NM_018076.2.npy,1,1044,1044
+NP_060547.2,MAGLTLFVGRLPPSARSEQLEELFSQVGPVKQCFVVTEKGSKACRGFGYVTFSMLEDVQRALKEITTFEGCKINVTVAKKKLRNKTKEKGKNENSECPKKEPKAKKAKVADKKARLIIRNLSFKCSEDDLKTVFAQFGAVLEVNIPRKPDGKMRGFGFVQFKNLLEAGKALKGMNMKEIKGRTVAVDWAVAKDKYKDTQSVSAIGEEKSHESKHQESVKKKGREEEDMEEEENDDDDDDDDEEDGVFDDEDEEEENIESKVTKPVQIQKRAVKRPAPAKSSDHSEEDSDLEESDSIDDGEELAQSDTSTEEQEDKAVQVSNKKKRKLPSDVNEGKTVFIRNLSFDSEEEELGELLQQFGELKYVRIVLHPDTEHSKGCAFAQFMTQEAAQKCLLAASPENEAGGLKLDGRQLKVDLAVTRDEAAKLQTTKVKKPTGTRNLYLAREGLIRAGTKAAEGVSAADMAKRERFELLKHQKLKDQNIFVSRTRLCLHNLPKAVDDKQLRKLLLSATSGEKGVRIKECRVMRDLKGVHGNMKGQSLGYAFAEFQEHEHALKALRLINNNPEIFGPLKRPIVEFSLEDRRKLKMKELRIQRSLQKMRSKPATGEPQKGQPEPAKDQQQKAAQHHTEEQSKVPPEQKRKAGSTSWTGFQTKAEVEQVELPDGKKRRKVLALPSHRGPKIRLRDKGKVKPVHPKKPKPQINQWKQEKQQLSSEQVSRKKAKGNKTETRFNQLVEQYKQKLLGPSKGAPLAKRSKWFDS,759,NP_060547.2.csv,refseq-RBM28-NM_018077.2_clinical_seed_0_final,refseq-RBM28-NM_018077.2.a2m,Invitae,refseq-RBM28-NM_018077.2.npy,1,759,759
+NP_060551.2,MKTLETQPLAPDCCPSDQDPAPAHPSPHASPMNKNADSELMPPPPERGDPPRLSPDPVAGSAVSQELREGDPVSLSTPLETEFGSPSELSPRIEEQELSENTSLPAEEANGSLSEEEANGPELGSGKAMEDTSGEPAAEDEGDTAWNYSFSQLPRFLSGSWSEFSTQPENFLKGCKWAPDGSCILTNSADNILRIYNLPPELYHEGEQVEYAEMVPVLRMVEGDTIYDYCWYSLMSSAQPDTSYVASSSRENPIHIWDAFTGELRASFRAYNHLDELTAAHSLCFSPDGSQLFCGFNRTVRVFSTARPGRDCEVRATFAKKQGQSGIISCIAFSPAQPLYACGSYGRSLGLYAWDDGSPLALLGGHQGGITHLCFHPDGNRFFSGARKDAELLCWDLRQSGYPLWSLGREVTTNQRIYFDLDPTGQFLVSGSTSGAVSVWDTDGPGNDGKPEPVLSFLPQKDCTNGVSLHPSLPLLATASGQRVFPEPTESGDEGEELGLPLLSTRHVHLECRLQLWWCGGAPDSSIPDDHQGEKGQGGTEGGVGELI,548,NP_060551.2.csv,refseq-WRAP53-NM_018081.2_clinical_seed_0_final,refseq-WRAP53-NM_018081.2.a2m,Invitae,refseq-WRAP53-NM_018081.2.npy,1,548,548
+NP_060552.4,MDVLAEEFGNLTPEQLAAPIPTVEEKWRLLPAFLKVKGLVKQHIDSFNYFINVEIKKIMKANEKVTSDADPMWYLKYLNIYVGLPDVEESFNVTRPVSPHECRLRDMTYSAPITVDIEYTRGSQRIIRNALPIGRMPIMLRSSNCVLTGKTPAEFAKLNECPLDPGGYFIVKGVEKVILIQEQLSKNRIIVEADRKGAVGASVTSSTHEKKSRTNMAVKQGRFYLRHNTLSEDIPIVIIFKAMGVESDQEIVQMIGTEEHVMAAFGPSLEECQKAQIFTQMQALKYIGNKVRRQRMWGGGPKKTKIEEARELLASTILTHVPVKEFNFRAKCIYTAVMVRRVILAQGDNKVDDRDYYGNKRLELAGQLLSLLFEDLFKKFNSEMKKIADQVIPKQRAAQFDVVKHMRQDQITNGMVNAISTGNWSLKRFKMDRQGVTQVLSRLSYISALGMMTRISSQFEKTRKVSGPRSLQPSQWGMLCPSDTPEGEACGLVKNLALMTHITTDMEDGPIVKLASNLGVEDVNLLCGEELSYPNVFLVFLNGNILGVIRDHKKLVNTFRLMRRAGYINEFVSISTNLTDRCVYISSDGGRLCRPYIIVKKQKPAVTNKHMEELAQGYRNFEDFLHESLVEYLDVNEENDCNIALYEHTINKDTTHLEIEPFTLLGVCAGLIPYPHHNQSPRNTYQCAMGKQAMGTIGYNQRNRIDTLMYLLAYPQKPMVKTKTIELIEFEKLPAGQNATVAVMSYSGYDIEDALVLNKASLDRGFGRCLVYKNAKCTLKRYTNQTFDKVMGPMLDAATRKPIWRHEILDADGICSPGEKVENKQVLVNKSMPTVTQIPLEGSNVPQQPQYKDVPITYKGATDSYIEKVMISSNAEDAFLIKMLLRQTRRPEIGDKFSSRHGQKGVCGLIVPQEDMPFCDSGICPDIIMNPHGFPSRMTVGKLIELLAGKAGVLDGRFHYGTAFGGSKVKDVCEDLVRHGYNYLGKDYVTSGITGEPLEAYIYFGPVYYQKLKHMVLDKMHARARGPRAVLTRQPTEGRSRDGGLRLGEMERDCLIGYGASMLLLERLMISSDAFEVDVCGQCGLLGYSGWCHYCKSSCHVSSLRIPYACKLLFQELQSMNIIPRLKLSKYNE,1133,NP_060552.4.csv,refseq-POLR3B-NM_018082.5_clinical_seed_0_final,refseq-POLR3B-NM_018082.5.a2m,Invitae,refseq-POLR3B-NM_018082.5.npy,1,1133,1133
+NP_060575.1,MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTPDCFKRECNNKLLKENAVPTIFLCTEPHDKKEDLLEPQEQLPPPPLPPPVSQVDAAIGLLMPPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQLEKLKEVVHFQKEKDDVSERGYVILPNDYFEIVEVPA,213,NP_060575.1.csv,refseq-THAP1-NM_018105.3_clinical_seed_0_final,refseq-THAP1-NM_018105.3.a2m,Invitae,refseq-THAP1-NM_018105.3.npy,1,213,213
+NP_060579.3,MAVPGVGLLTRLNLCARRRTRVQRPIVRLLSCPGTVAKDLRRDEQPSGSVETGFEDKIPKRRFSEMQNERREQAQRTVLIHCPEKISENKFLKYLSQFGPINNHFFYESFGLYAVVEFCQKESIGSLQNGTHTPSTAMETAIPFRSRFFNLKLKNQTSERSRVRSSNQLPRSNKQLFELLCYAESIDDQLNTLLKEFQLTEENTKLRYLTCSLIEDMAAAYFPDCIVRPFGSSVNTFGKLGCDLDMFLDLDETRNLSAHKISGNFLMEFQVKNVPSERIATQKILSVLGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYIYGALDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTLDSLKTLADAEDKCVIEGNNCTFVRDLSRIKPSQNTETLELLLKEFFEYFGNFAFDKNSINIRQGREQNKPDSSPLYIQNPFETSLNISKNVSQSQLQKFVDLARESAWILQQEDTDRPSISSNRPWGLVSLLLPSAPNRKSFTKKKSNKFAIETVKNLLESLKGNRTENFTKTSGKRTISTQT,582,NP_060579.3.csv,refseq-MTPAP-NM_018109.3_clinical_seed_0_final,refseq-MTPAP-NM_018109.3.a2m,Invitae,refseq-MTPAP-NM_018109.3.npy,1,582,582
+NP_060586.2,MAGGAREVLTLQLGHFAGFVGAHWWNQQDAALGRATDSKEPPGELCPDVLYRTGRTLHGQETYTPRLILMDLKGSLSSLKEEGGLYRDKQLDAAIAWQGKLTTHKEELYPKNPYLQDFLSAEGVLSSDGVWRVKSIPNGKGSSPLPTATTPKPLIPTEASIRVWSDFLRVHLHPRSICMIQKYNHDGEAGRLEAFGQGESVLKEPKYQEELEDRLHFYVEECDYLQGFQILCDLHDGFSGVGAKAAELLQDEYSGRGIITWGLLPGPYHRGEAQRNIYRLLNTAFGLVHLTAHSSLVCPLSLGGSLGLRPEPPVSFPYLHYDATLPFHCSAILATALDTVTVPYRLCSSPVSMVHLADMLSFCGKKVVTAGAIIPFPLAPGQSLPDSLMQFGGATPWTPLSACGEPSGTRCFAQSVVLRGIDRACHTSQLTPGTPPPSALHACTTGEEILAQYLQQQQPGVMSSSHLLLTPCRVAPPYPHLFSSCSPPGMVLDGSPKGAAVESIPVFGALCSSSSLHQTLEALARDLTKLDLRRWASFMDAGVEHDDVAELLQELQSLAQCYQGGDSLVD,570,NP_060586.2.csv,refseq-MSTO1-NM_018116.3_clinical_seed_0_final,refseq-MSTO1-NM_018116.3.a2m,Invitae,refseq-MSTO1-NM_018116.3.npy,1,570,570
+NP_060592.2,MYFPSWLSQLYRGLSRPIRRTTQPIWGSLYRSLLQSSQRRIPEFSSFVVRTNTCGELRSSHLGQEVTLCGWIQYRRQNTFLVLRDFDGLVQVIIPQDESAASVKKILCEAPVESVVQVSGTVISRPAGQENPKMPTGEIEIKVKTAELLNACKKLPFEIKNFVKKTEALRLQYRYLDLRSFQMQYNLRLRSQMVMKMREYLCNLHGFVDIETPTLFKRTPGGAKEFLVPSREPGKFYSLPQSPQQFKQLLMVGGLDRYFQVARCYRDEGSRPDRQPEFTQIDIEMSFVDQTGIQSLIEGLLQYSWPNDKDPVVVPFPTMTFAEVLATYGTDKPDTRFGMKIIDISDVFRNTEIGFLQDALSKPHGTVKAICIPEGAKYLKRKDIESIRNFAADHFNQEILPVFLNANRNWNSPVANFIMESQRLELIRLMETQEEDVVLLTAGEHNKACSLLGKLRLECADLLETRGVVLRDPTLFSFLWVVDFPLFLPKEENPRELESAHHPFTAPHPSDIHLLYTEPKKARSQHYDLVLNGNEIGGGSIRIHNAELQRYILATLLKEDVKMLSHLLQALDYGAPPHGGIALGLDRLICLVTGSPSIRDVIAFPKSFRGHDLMSNTPDSVPPEELKPYHIRVSKPTDSKAERAH,645,NP_060592.2.csv,refseq-DARS2-NM_018122.4_clinical_seed_0_final,refseq-DARS2-NM_018122.4.a2m,Invitae,refseq-DARS2-NM_018122.4.npy,1,645,645
+NP_060595.3,MASSNPPPQPAIGDQLVPGVPGPSSEAEDDPGEAFEFDDSDDEEDTSAALGVPSLAPERDTDPPLIHLDSIPVTDPDPAAAPPGTGVPAWVSNGDAADAAFSGARHSSWKRKSSRRIDRFTFPALEEDVIYDDVPCESPDAHQPGAERNLLYEDAHRAGAPRQAEDLGWSSSEFESYSEDSGEEAKPEVEVEPAKHRVSFQPKLSPDLTRLKERYARTKRDILALRVGGRDMQELKHKYDCKMTQLMKAAKSGTKDGLEKTRMAVMRKVSFLHRKDVLGDSEEEDMGLLEVSVSDIKPPAPELGPMPEGLSPQQVVRRHILGSIVQSEGSYVESLKRILQDYRNPLMEMEPKALSARKCQVVFFRVKEILHCHSMFQIALSSRVAEWDSTEKIGDLFVASFSKSMVLDVYSDYVNNFTSAMSIIKKACLTKPAFLEFLKRRQVCSPDRVTLYGLMVKPIQRFPQFILLLQDMLKNTPRGHPDRLSLQLALTELETLAEKLNEQKRLADQVAEIQQLTKSVSDRSSLNKLLTSGQRQLLLCETLTETVYGDRGQLIKSKERRVFLLNDMLVCANINFKPANHRGQLEISSLVPLGPKYVVKWNTALPQVQVVEVGQDGGTYDKDNVLIQHSGAKKASASGQAQNKVYLGPPRLFQELQDLQKDLAVVEQITLLISTLHGTYQNLNMTVAQDWCLALQRLMRVKEEEIHSANKCRLRLLLPGKPDKSGRPISFMVVFITPNPLSKISWVNRLHLAKIGLREENQPGWLCPDEDKKSKAPFWCPILACCIPAFSSRALSLQLGALVHSPVNCPLLGFSAVSTSLPQGYLWVGGGQEGAGGQVEIFSLNRPSPRTVKSFPLAAPVLCMEYIPELEEEAESRDESPTVADPSATVHPTICLGLQDGSILLYSSVDTGTQCLVSCRSPGLQPVLCLRHSPFHLLAGLQDGTLAAYPRTSGGVLWDLESPPVCLTVGPGPVRTLLSLEDAVWASCGPRVTVLEATTLQPQQSFEAHQDEAVSVTHMVKAGSGVWMAFSSGTSIRLFHTETLEHLQEINIATRTTFLLPGQKHLCVTSLLICQGLLWVGTDQGVIVLLPVPRLEGIPKITGKGMVSLNGHCGPVAFLAVATSILAPDILRSDQEEAEGPRAEEDKPDGQAHEPMPDSHVGRELTRKKGILLQYRLRSTAHLPGPLLSMREPAPADGAALEHSEEDGSIYEMADDPDIWVRSRPCARDAHRKEICSVAIISGGQGYRNFGSALGSSGRQAPCGETDSTLLIWQVPLML,1279,NP_060595.3.csv,ARGAL_HUMAN_b05_clinical_seed_0_final,ARGAL_HUMAN_b05.a2m,EVE,ARGAL_HUMAN_b05_theta_0.2.npy,1,1279,1279
+NP_060597.4,MWALCSLLRSAAGRTMSQGRTISQAPARRERPRKDPLRHLRTREKRGPSGCSGGPNTVYLQVVAAGSRDSGAALYVFSEFNRYLFNCGEGVQRLMQEHKLKVARLDNIFLTRMHWSNVGGLSGMILTLKETGLPKCVLSGPPQLEKYLEAIKIFSGPLKGIELAVRPHSAPEYEDETMTVYQIPIHSEQRRGKHQPWQSPERPLSRLSPERSSDSESNENEPHLPHGVSQRRGVRDSSLVVAFICKLHLKRGNFLVLKAKEMGLPVGTAAIAPIIAAVKDGKSITHEGREILAEELCTPPDPGAAFVVVECPDESFIQPICENATFQRYQGKADAPVALVVHMAPASVLVDSRYQQWMERFGPDTQHLVLNENCASVHNLRSHKIQTQLNLIHPDIFPLLTSFRCKKEGPTLSVPMVQGECLLKYQLRPRREWQRDAIITCNPEEFIVEALQLPNFQQSVQEYRRSAQDGPAPAEKRSQYPEIIFLGTGSAIPMKIRNVSATLVNISPDTSLLLDCGEGTFGQLCRHYGDQVDRVLGTLAAVFVSHLHADHHTGLPSILLQRERALASLGKPLHPLLVVAPNQLKAWLQQYHNQCQEVLHHISMIPAKCLQEGAEISSPAVERLISSLLRTCDLEEFQTCLVRHCKHAFGCALVHTSGWKVVYSGDTMPCEALVRMGKDATLLIHEATLEDGLEEEAVEKTHSTTSQAISVGMRMNAEFIMLNHFSQRYAKVPLFSPNFSEKVGVAFDHMKVCFGDFPTMPKLIPPLKALFAGDIEEMEERREKRELRQVRAALLSRELAGGLEDGEPQQKRAHTEEPQAKKVRAQ,826,NP_060597.4.csv,refseq-ELAC2-NM_018127.6_clinical_seed_0_final,refseq-ELAC2-NM_018127.6.a2m,Invitae,refseq-ELAC2-NM_018127.6.npy,1,826,826
+NP_060599.1,MTCWLRGVTATFGRPAEWPGYLSHLCGRSAAMDLGPMRKSYRGDREAFEETHLTSLDPVKQFAAWFEEAVQCPDIGEANAMCLATCTRDGKPSARMLLLKGFGKDGFRFFTNFESRKGKELDSNPFASLVFYWEPLNRQVRVEGPVKKLPEEEAECYFHSRPKSSQIGAVVSHQSSVIPDREYLRKKNEELEQLYQDQEVPKPKSWGGYVLYPQVMEFWQGQTNRLHDRIVFRRGLPTGDSPLGPMTHRGEEDWLYERLAP,261,NP_060599.1.csv,refseq-PNPO-NM_018129.3_clinical_seed_0_final,refseq-PNPO-NM_018129.3.a2m,Invitae,refseq-PNPO-NM_018129.3.npy,1,261,261
+NP_060631.2,MGRKVTVATCALNQWALDFEGNLQRILKSIEIAKNRGARYRLGPELEICGYGCWDHYYESDTLLHSFQVLAALVESPVTQDIICDVGMPVMHRNVRYNCRVIFLNRKILLIRPKMALANEGNYRELRWFTPWSRSRHTEEYFLPRMIQDLTKQETVPFGDAVLVTWDTCIGSEICEELWTPHSPHIDMGLDGVEIITNASGSHQVLRKANTRVDLVTMVTSKNGGIYLLANQKGCDGDRLYYDGCAMIAMNGSVFAQGSQFSLDDVEVLTATLDLEDVRSYRAEISSRNLAASRASPYPRVKVDFALSCHEDLLAPISEPIEWKYHSPEEEISLGPACWLWDFLRRSQQAGFLLPLSGGVDSAATACLIYSMCCQVCEAVRSGNEEVLADVRTIVNQISYTPQDPRDLCGRILTTCYMASKNSSQETCTRARELAQQIGSHHISLNIDPAVKAVMGIFSLVTGKSPLFAAHGGSSRENLALQNVQARIRMVLAYLFAQLSLWSRGVHGGLLVLGSANVDESLLGYLTKYDCSSADINPIGGISKTDLRAFVQFCIQRFQLPALQSILLAPATAELEPLADGQVSQTDEEDMGMTYAELSVYGKLRKVAKMGPYSMFCKLLGMWRHICTPRQVADKVKRFFSKYSMNRHKMTTLTPAYHAENYSPEDNRFDLRPFLYNTSWPWQFRCIENQVLQLERAEPQSLDGVD,706,NP_060631.2.csv,refseq-NADSYN1-NM_018161.4_clinical_seed_0_final,refseq-NADSYN1-NM_018161.4.a2m,Invitae,refseq-NADSYN1-NM_018161.4.npy,1,706,706
+NP_060658.3,MSWLFGINKGPKGEGAGPPPPLPPAQPGAEGGGDRGLGDRPAPKDKWSNFDPTGLERAAKAARELEHSRYAKDALNLAQMQEQTLQLEQQSKLKMRLEALSLLHTLVWAWSLCRAGAVQTQERLSGSASPEQVPAGECCALQEYEAAVEQLKSEQIRAQAEERRKTLSEETRQHQARAQYQDKLARQRYEDQLKQQQLLNEENLRKQEESVQKQEAMRRATVEREMELRHKNEMLRVEAEARARAKAERENADIIREQIRLKAAEHRQTVLESIRTAGTLFGEGFRAFVTDWDKVTATVAGLTLLAVGVYSAKNATLVAGRFIEARLGKPSLVRETSRITVLEALRHPIQVSRRLLSRPQDALEGVVLSPSLEARVRDIAIATRNTKKNRSLYRNILMYGPPGTGKTLFAKKLALHSGMDYAIMTGGDVAPMGREGVTAMHKLFDWANTSRRGLLLFVDEADAFLRKRATEKISEDLRATLNAFLYRTGQHSNKFMLVLASNQPEQFDWAINDRINEMVHFDLPGQEERERLVRMYFDKYVLKPATEGKQRLKLAQFDYGRKCSEVARLTEGMSGREIAQLAVSWQATAYASEDGVLTEAMMDTRVQDAVQQHQQKMCWLKAEGPGRGDEPSPS,634,NP_060658.3.csv,refseq-ATAD3A-NM_018188.3_clinical_seed_0_final,refseq-ATAD3A-NM_018188.3.a2m,Invitae,refseq-ATAD3A-NM_018188.3.npy,1,634,634
+NP_060661.3,MVDVGKWPIFTLLSPQEIASIRKACVFGTSASEALYVTDNDEVFVFGLNYSNCLGTGDNQSTLVPKKLEGLCGKKIKSLSYGSGPHVLLSTEDGVVYAWGHNGYSQLGNGTTNQGIAPVQVCTNLLIKQVVEVACGSHHSMALAADGEVFAWGYNNCGQVGSGSTANQPTPRKVTNCLHIKRVVGIACGQTSSMAVLDNGEVYGWGYNGNGQLGLGNNGNQLTPVRVAALHSVCVNQIVCGYAHTLALTDEGLLYAWGANTYGQLGTGNKNNLLSPAHIMVEKERVVEIAACHSAHTSAAKTQGGHVYMWGQCRGQSVILPHLTHFSCTDDVFACFATPAVSWRLLSVEHEDFLTVAESLKKEFDSPETADLKFRIDGKYIHVHKAVLKIRCEHFRSMFQSYWNEDMKEVIEIDQFSYPVYRAFLQYLYTDTVDLPPEDAIGLLDLATSYCENRLKKLCQHIIKRGITVENAFSLFSAAVRYDAEDLEEFCFKFCINHLTEVTQTAAFWQMDGPLLKEFIAKASKCGAFKN,531,NP_060661.3.csv,refseq-RCBTB1-NM_018191.3_clinical_seed_0_final,refseq-RCBTB1-NM_018191.3.a2m,Invitae,refseq-RCBTB1-NM_018191.3.npy,1,531,531
+NP_060663.2,MDQKILSLAAEKTADKLQEFLQTLREGDLTNLLQNQAVKGKVAGALLRAIFKGSPCSEEAGTLRRRKIYTCCIQLVESGDLQKEIASEIIGLLMLEAHHFPGPLLVELANEFISAVREGSLVNGKSLELLPIILTALATKKENLAYGKGVLSGEECKKQLINTLCSGRWDQQYVIQLTSMFKDVPLTAEEVEFVVEKALSMFSKMNLQEIPPLVYQLLVLSSKGSRKSVLEGIIAFFSALDKQHNEEQSGDELLDVVTVPSGELRHVEGTIILHIVFAIKLDYELGRELVKHLKVGQQGDSNNNLSPFSIALLLSVTRIQRFQDQVLDLLKTSVVKSFKDLQLLQGSKFLQNLVPHRSYVSTMILEVVKNSVHSWDHVTQGLVELGFILMDSYGPKKVLDGKTIETSPSLSRMPNQHACKLGANILLETFKIHEMIRQEILEQVLNRVVTRASSPISHFLDLLSNIVMYAPLVLQSCSSKVTEAFDYLSFLPLQTVQRLLKAVQPLLKVSMSMRDCLILVLRKAMFANQLDARKSAVAGFLLLLKNFKVLGSLSSSQCSQSLSVSQVHVDVHSHYNSVANETFCLEIMDSLRRCLSQQADVRLMLYEGFYDVLRRNSQLANSVMQTLLSQLKQFYEPKPDLLPPLKLEACILTQGDKISLQEPLDYLLCCIQHCLAWYKNTVIPLQQGEEEEEEEEAFYEDLDDILESITNRMIKSELEDFELDKSADFSQSTSIGIKNNICAFLVMGVCEVLIEYNFSISSFSKNRFEDILSLFMCYKKLSDILNEKAGKAKTKMANKTSDSLLSMKFVSSLLTALFRVLLWRYTSIPTSVEESGKKEKGKSISLLCLEGLQKIFSAVQQFYQPKIQQFLRALDVTDKEGEEREDADVSVTQRTAFQIRQFQRSLLNLLSSQEEDFNSKEALLLVTVLTSLSKLLEPSSPQFVQMLSWTSKICKENSREDALFCKSLMNLLFSLHVSYKSPVILLRDLSQDIHGHLGDIDQDVEVEKTNHFAIVNLRTAAPTVCLLVLSQAEKVLEEVDWLITKLKGQVSQETLSEEASSQATLPNQPVEKAIIMQLGTLLTFFHELVQTALPSGSCVDTLLKDLCKMYTTLTALVRYYLQVCQSSGGIPKNMEKLVKLSGSHLTPLCYSFISYVQNKSKSLNYTGEKKEKPAAVATAMARVLRETKPIPNLIFAIEQYEKFLIHLSKKSKVNLMQHMKLSTSRDFKIKGNILDMVLREDGEDENEEGTASEHGGQNKEPAKKKRKK,1268,NP_060663.2.csv,refseq-FANCI-NM_018193.2_clinical_seed_0_final,refseq-FANCI-NM_018193.2.a2m,Invitae,refseq-FANCI-NM_018193.2_theta_0.2.npy,1,1268,1268
+NP_060676.2,MPTTQQSPQDEQEKLLDEAIQAVKVQSFQMKRCLDKNKLMDALKHASNMLGELRTSMLSPKSYYELYMAISDELHYLEVYLTDEFAKGRKVADLYELVQYAGNIIPRLYLLITVGVVYVKSFPQSRKDILKDLVEMCRGVQHPLRGLFLRNYLLQCTRNILPDEGEPTDEETTGDISDSMDFVLLNFAEMNKLWVRMQHQGHSRDREKRERERQELRILVGTNLVRLSQLEGVNVERYKQIVLTGILEQVVNCRDALAQEYLMECIIQVFPDEFHLQTLNPFLRACAELHQNVNVKNIIIALIDRLALFAHREDGPGIPADIKLFDIFSQQVATVIQSRQDMPSEDVVSLQVSLINLAMKCYPDRVDYVDKVLETTVEIFNKLNLEHIATSSAVSKELTRLLKIPVDTYNNILTVLKLKHFHPLFEYFDYESRKSMSCYVLSNVLDYNTEIVSQDQVDSIMNLVSTLIQDQPDQPVEDPDPEDFADEQSLVGRFIHLLRSEDPDQQYLILNTARKHFGAGGNQRIRFTLPPLVFAAYQLAFRYKENSKVDDKWEKKCQKIFSFAHQTISALIKAELAELPLRLFLQGALAAGEIGFENHETVAYEFMSQAFSLYEDEISDSKAQLAAITLIIGTFERMKCFSEENHEPLRTQCALAASKLLKKPDQGRAVSTCAHLFWSGRNTDKNGEELHGGKRVMECLKKALKIANQCMDPSLQVQLFIEILNRYIYFYEKENDAVTIQVLNQLIQKIREDLPNLESSEETEQINKHFHNTLEHLRLRRESPESEGPIYEGLIL,796,NP_060676.2.csv,VPS35_HUMAN_b01_clinical_seed_0_final,VPS35_HUMAN_b01.a2m,EVE,VPS35_HUMAN_b01_theta_0.2.npy,1,796,796
+NP_060700.2,MFPAAPSPRTPGTGSRRGPLAGLGPGSTPRTASRKGLPLGSAVSSPVLFSPVGRRSSLSSRGTPTRMFPHHSITESVNYDVKTFGSSLPVKVMEALTLAEVDDQLTINIDEGGWACLVCKEKLIIWKIALSPITKLSVCKELQLPPSDFHWSADLVALSYSSPSGEAHSTQAVAVMVATREGSIRYWPSLAGEDTYTEAFVDSGGDKTYSFLTAVQGGSFILSSSGSQLIRLIPESSGKIHQHILPQGQGMLSGIGRKVSSLFGILSPSSDLTLSSVLWDRERSSFYSLTSSNISKWELDDSSEKHAYSWDINRALKENITDAIWGSESNYEAIKEGVNIRYLDLKQNCDGLVILAAAWHSADNPCLIYYSLITIEDNGCQMSDAVTVEVTQYNPPFQSEDLILCQLTVPNFSNQTAYLYNESAVYVCSTGTGKFSLPQEKIVFNAQGDSVLGAGACGGVPIIFSRNSGLVSITSRENVSILAEDLEGSLASSVAGPNSESMIFETTTKNETIAQEDKIKLLKAAFLQYCRKDLGHAQMVVDELFSSHSDLDSDSELDRAVTQISVDLMDDYPASDPRWAESVPEEAPGFSNTSLIILHQLEDKMKAHSFLMDFIHQVGLFGRLGSFPVRGTPMATRLLLCEHAEKLSAAIVLKNHHSRLSDLVNTAILIALNKREYEIPSNLTPADVFFREVSQVDTICECLLEHEEQVLRDAPMDSIEWAEVVINVNNILKDMLQAASHYRQNRNSLYRREESLEKEPEYVPWTATSGPGGIRTVIIRQHEIVLKVAYPQADSNLRNIVTEQLVALIDCFLDGYVSQLKSVDKSSNRERYDNLEMEYLQKRSDLLSPLLSLGQYLWAASLAEKYCDFDILVQMCEQTDNQSRLQRYMTQFADQNFSDFLFRWYLEKGKRGKLLSQPISQHGQLANFLQAHEHLSWLHEINSQELEKAHATLLGLANMETRYFAKKKTLLGLSKLAALASDFSEDMLQEKIEEMAEQERFLLHQETLPEQLLAEKQLNLSAMPVLTAPQLIGLYICEENRRANEYDFKKALDLLEYIDEEEDININDLKLEILCKALQRDNWSSSDGKDDPIEVSKDSIFVKILQKLLKDGIQLSEYLPEVKDLLQADQLGSLKSNPYFEFVLKANYEYYVQGQI,1156,NP_060700.2.csv,refseq-NUP133-NM_018230.2_clinical_seed_0_final,refseq-NUP133-NM_018230.2.a2m,Invitae,refseq-NUP133-NM_018230.2.npy,1,1156,1156
+NP_060733.4,MREKGRRKKGRTWAEAAKTVLEKYPNTPMSHKEILQVIQREGLKEIRSGTSPLACLNAMLHTNSRGEEGIFYKVPGRMGVYTLKKDVPDGVKELSEGSEESSDGQSDSQSSENSSSSSDGGSNKEGKKSRWKRKVSSSSPQSGCPSPTIPAGKVISPSQKHSKKALKQALKQQQQKKQQQQCRPSISISSNQHLSLKTVKAASDSVPAKPATWEGKQSDGQTGSPQNSNSSFSSSVKVENTLLGLGKKSFQRSERLHTRQMKRTKCADIDVETPDSILVNTNLRALINKHTFSVLPGDCQQRLLLLLPEVDRQVGPDGLMKLNGSALNNEFFTSAAQGWKERLSEGEFTPEMQVRIRQEIEKEKKVEPWKEQFFESYYGQSSGLSLEDSKKLTASPSDPKVKKTPAEQPKSMPVSEASLIRIVPVVSQSECKEEALQMSSPGRKEECESQGEVQPNFSTSSEPLLSSALNTHELSSILPIKCPKDEDLLEQKPVTSAEQESEKNHLTTASNYNKSESQESLVTSPSKPKSPGVEKPIVKPTAGAGPQETNMKEPLATLVDQSPESLKRKSSLTQEEAPVSWEKRPRVTENRQHQQPFQVSPQPFLNRGDRIQVRKVPPLKIPVSRISPMPFHPSQVSPRARFPVSITSPNRTGARTLADIKAKAQLVKAQRAAAAAAAAAAAAASVGGTIPGPGPGGGQGPGEGGEGQTARGGSPGSDRVSETGKGPTLELAGTGSRGGTRELLPCGPETQPQSETKTTPSQAQPHSVSGAQLQQTPPVPPTPAVSGACTSVPSPAHIEKLDNEKLNPTRATATVASVSHPQGPSSCRQEKAPSPTGPALISGASPVHCAADGTVELKAGPSKNIPNPSASSKTDASVPVAVTPSPLTSLLTTATLEKLPVPQVSATTAPAGSAPPSSTLPAASSLKTPGTSLNMNGPTLRPTSSIPANNPLVTQLLQGKDVPMEQILPKPLTKVEMKTVPLTAKEERGMGALIATNTTENSTREEVNERQSHPATQQQLGKTLQSKQLPQVPRPLQLFSAKELRDSSIDTHQYHEGLSKATQDQILQTLIQRVRRQNLLSVVPPSQFNFAHSGFQLEDISTSQRFMLGFAGRRTSKPAMAGHYLLNISTYGRGSESFRRTHSVNPEDRFCLSSPTEALKMGYTDCKNATGESSSSKEDDTDEESTGDEQESVTVKEEPQVSQSAGKGDTSSGPHSRETLSTSDCLASKNVKAEIPLNEQTTLSKENYLFTRGQTFDEKTLARDLIQAAQKQMAHAVRGKAIRSSPELFSSTVLPLPADSPTHQPLLLPPLQTPKLYGSPTQIGPSYRGMINVSTSSDMDHNSAVPGSQVSSNVGDVMSFSVTVTTIPASQAMNPSSHGQTIPVQAFSEENSIEGTPSKCYCRLKAMIMCKGCGAFCHDDCIGPSKLCVSCLVVR,1435,NP_060733.4.csv,refseq-ASXL2-NM_018263.4_clinical_seed_0_final,refseq-ASXL2-NM_018263.4.a2m,Invitae,refseq-ASXL2-NM_018263.4.npy,1,1435,1435
+NP_060762.3,MLGRSLREVSAALKQGQITPTELCQKCLSLIKKTKFLNAYITVSEEVALKQAEESEKRYKNGQSLGDLDGIPIAVKDNFSTSGIETTCASNMLKGYIPPYNATVVQKLLDQGALLMGKTNLDEFAMGSGSTDGVFGPVKNPWSYSKQYREKRKQNPHSENEDSDWLITGGSSGGSAAAVSAFTCYAALGSDTGGSTRNPAAHCGLVGFKPSYGLVSRHGLIPLVNSMDVPGILTRCVDDAAIVLGALAGPDPRDSTTVHEPINKPFMLPSLADVSKLCIGIPKEYLVPELSSEVQSLWSKAADLFESEGAKVIEVSLPHTSYSIVCYHVLCTSEVASNMARFDGLQYGHRCDIDVSTEAMYAATRREGFNDVVRGRILSGNFFLLKENYENYFVKAQKVRRLIANDFVNAFNSGVDVLLTPTTLSEAVPYLEFIKEDNRTRSAQDDIFTQAVNMAGLPAVSIPVALSNQGLPIGLQFIGRAFCDQQLLTVAKWFEKQVQFPVIQLQELMDDCSAVLENEKLASVSLKQ,528,NP_060762.3.csv,refseq-QRSL1-NM_018292.4_clinical_seed_0_final,refseq-QRSL1-NM_018292.4.a2m,Invitae,refseq-QRSL1-NM_018292.4.npy,1,528,528
+NP_060764.3,MAQKPLRLLACGDVEGKFDILFNRVQAIQKKSGNFDLLLCVGNFFGSTQDAEWEEYKTGIKKAPIQTYVLGANNQETVKYFQDADGCELAENITYLGRKGIFTGSSGLQIVYLSGTESLNEPVPGYSFSPKDVSSLRMMLCTTSQFKGVDILLTSPWPKCVGNFGNSSGEVDTKKCGSALVSSLATGLKPRYHFAALEKTYYERLPYRNHIILQENAQHATRFIALANVGNPEKKKYLYAFSIVPMKLMDAAELVKQPPDVTENPYRKSGQEASIGKQILAPVEESACQFFFDLNEKQGRKRSSTGRDSKSSPHPKQPRKPPQPPGPCWFCLASPEVEKHLVVNIGTHCYLALAKGGLSDDHVLILPIGHYQSVVELSAEVVEEVEKYKATLRRFFKSRGKWCVVFERNYKSHHLQLQVIPVPISCSTTDDIKDAFITQAQEQQIELLEIPEHSDIKQIAQPGAAYFYVELDTGEKLFHRIKKNFPLQFGREVLASEAILNVPDKSDWRQCQISKEDEETLARRFRKDFEPYDFTLDD,538,NP_060764.3.csv,refseq-CWF19L1-NM_018294.5_clinical_seed_0_final,refseq-CWF19L1-NM_018294.5.a2m,Invitae,refseq-CWF19L1-NM_018294.5.npy,1,538,538
+NP_060767.2,MAAAALGSSSGSASPAVAELCQNTPETFLEASKLLLTYADNILRNPNDEKYRSIRIGNTAFSTRLLPVRGAVECLFEMGFEEGETHLIFPKKASVEQLQKIRDLIAIERSSRLDGSNKSHKVKSSQQPAASTQLPTTPSSNPSGLNQHTRNRQGQSSDPPSASTVAADSAILEVLQSNIQHVLVYENPALQEKALACIPVQELKRKSQEKLSRARKLDKGINISDEDFLLLELLHWFKEEFFHWVNNVLCSKCGGQTRSRDRSLLPSDDELKWGAKEVEDHYCDACQFSNRFPRYNNPEKLLETRCGRCGEWANCFTLCCRAVGFEARYVWDYTDHVWTEVYSPSQQRWLHCDACEDVCDKPLLYEIGWGKKLSYVIAFSKDEVVDVTWRYSCKHEEVIARRTKVKEALLRDTINGLNKQRQLFLSENRRKELLQRIIVELVEFISPKTPKPGELGGRISGSVAWRVARGEMGLQRKETLFIPCENEKISKQLHLCYNIVKDRYVRVSNNNQTISGWENGVWKMESIFRKVETDWHMVYLARKEGSSFAYISWKFECGSVGLKVDSISIRTSSQTFQTGTVEWKLRSDTAQVELTGDNSLHSYADFSGATEVILEAELSRGDGDVAWQHTQLFRQSLNDHEENCLEIIIKFSDL,654,NP_060767.2.csv,refseq-NGLY1-NM_018297.3_clinical_seed_0_final,refseq-NGLY1-NM_018297.3.a2m,Invitae,refseq-NGLY1-NM_018297.3.npy,1,654,654
+NP_060789.2,MSQEGDYGRWTISSSDESEEEKPKPDKPSTSSLLCARQGAANEPRYTCSEAQKAAHKRKISPVKFSNTDSVLPPKRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPACHRLKEEEDEYETSGEGQDIWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISYLMAYNAPSLKEWIDVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSAFGLDSFKVKQKFFAGSQEPMATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMWVPS,608,NP_060789.2.csv,refseq-TDP1-NM_018319.3_clinical_seed_0_final,refseq-TDP1-NM_018319.3.a2m,Invitae,refseq-TDP1-NM_018319.3.npy,1,608,608
+NP_060798.2,MNGGKECDGGDKEGGLPAIQVPVGWQRRVDQNGVLYVSPSGSLLSCLEQVKTYLLTDGTCKCGLECPLILPKVFNFDPGAAVKQRTAEDVKADEDVTKLCIHKRKIIAVATLHKSMEAPHPSLVLTSPGGGTNATPVVPSRAATPRSVRNKSHEGITNSVMPECKNPFKLMIGSSNAMGRLYVQELPGSQQQELHPVYPRQRLGSSEHGQKSPFRGSHGGLPSPASSGSQIYGDGSISPRTDPLGSPDVFTRSNPGFHGAPNSSPIHLNRTPLSPPSVMLHGSPVQSSCAMAGRTNIPLSPTLTTKSPVMKKPMCNFSTNMEIPRAMFHHKPPQGPPPPPPPSCALQKKPLTSEKDPLGILDPIPSKPVNQNPVIINPTSFHSNVHSQVPMMNVSMPPAVVPLPSNLPLPTVKPGHMNHGSHVQRVQHSASTSLSPSPVTSPVHMMGTGIGRIEASPQRSRSSSTSSDHGNFMMPPVGPQATSSGIKVPPRSPRSTIGSPRPSMPSSPSTKSDGHHQYKDIPNPLIAGISNVLNTPSSAAFPTASAGSSSVKSQPGLLGMPLNQILNQHNAASFPASSLLSAAAKAQLANQNKLAGNNSSSSSNSGAVAGSGNTEGHSTLNTMFPPTANMLLPTGEGQSGRAALRDKLMSQQKDALRKRKQPPTTVLSLLRQSQMDSSAVPKPGPDLLRKQGQGSFPISSMSQLLQSMSCQSSHLSSNSTPGCGASNTALPCSANQLHFTDPSMNSSVLQNIPLRGEAVHCHNANTNFVHSNSPVPNHHLAGLINQIQASGNCGMLSQSGMALGNSLHPNPPQSRISTSSTPVIPNSIVSSYNQTSSEAGGSGPSSSIAIAGTNHPAITKTTSVLQDGVIVTTAAGNPLQSQLPIGSDFPFVGQEHALHFPSNSTSNNHLPHPLNPSLLSSLPISLPVNQQHLLNQNLLNILQPSAGEGDMSSINNTLSNHQLTHLQSLLNNNQMFPPNQQQQQLLQGYQNLQAFQGQSTIPCPANNNPMACLFQNFQVRMQEDAALLNKRISTQPGLTALPENPNTTLPPFQDTPCELQPRIDPSLGQQVKDGLVVGGPGDASVDAIYKAVVDAASKGMQVVITTAVNSTTQISPIPALSAMSAFTASIGDPLNLSSAVSAVIHGRNMGGVDHDGRLRNSRGARLPKNLDHGKNVNEGDGFEYFKSASCHTSKKQWDGEQSPRGERNRWKYEEFLDHPGHIHSSPCHERPNNVSTLPFLPGEQHPILLPPRNCPGDKILEENFRYNNYKRTMMSFKERLENTVERCAHINGNRPRQSRGFGELLSTAKQDLVLEEQSPSSSNSLENSLVKDYIHYNGDFNAKSVNGCVPSPSDAKSISSEDDLRNPDSPSSNELIHYRPRTFNVGDLVWGQIKGLTSWPGKLVREDDVHNSCQQSPEEGKVEPEKLKTLTEGLEAYSRVRKRNRKSGKLNNHLEAAIHEAMSELDKMSGTVHQIPQGDRQMRPPKPKRRKISR,1494,NP_060798.2.csv,refseq-MBD5-NM_018328.4_clinical_seed_0_final,refseq-MBD5-NM_018328.4.a2m,Invitae,refseq-MBD5-NM_018328.4_theta_0.2.npy,1,1494,1494
+NP_060814.4,MAVVSEDDFQHSSNSTYRTTSSSLRADQEALLEKLLDRPPPGLQRPEDRFCGTYIIFFSLGIGSLLPWNFFITAKEYWMFKLRNSSSPATGEDPEGSDILNYFESYLAVASTVPSMLCLVANFLLVNRVAVHIRVLASLTVILAIFMVITALVKVDTSSWTRGFFAVTIVCMVILSGASTVFSSSIYGMTGSFPMRNSQALISGGAMGGTVSAVASLVDLAASSDVRNSALAFFLTATVFLVLCMGLYLLLSRLEYARYYMRPVLAAHVFSGEEELPQDSLSAPSVASRFIDSHTPPLRPILKKTASLGFCVTYVFFITSLIYPAICTNIESLNKGSGSLWTTKFFIPLTTFLLYNFADLCGRQLTAWIQVPGPNSKALPGFVLLRTCLIPLFVLCNYQPRVHLKTVVFQSDVYPALLSSLLGLSNGYLSTLALLYGPKIVPRELAEATGVVMSFYVCLGLTLGSACSTLLVHLI,475,NP_060814.4.csv,refseq-SLC29A3-NM_018344.5_clinical_seed_0_final,refseq-SLC29A3-NM_018344.5.a2m,Invitae,refseq-SLC29A3-NM_018344.5.npy,1,475,475
+NP_060829.2,MVISESMDILFRIRGGLDLAFQLATPNEIFLKKALKHVLSDLSTKLSSNALVFRICHSSVYIWPSSDINTIPGELTDASACKNILRFIQFEPEEDIKRKFMRKKDKKLSDMHQIVNIDLMLEMSTSLAAVTPIIERESGGHHYVNMTLPVDAVISVAPEETWGKVRKLLVDAIHNQLTDMEKCILKYMKGTSIVVPEPLHFLLPGKKNLVTISYPSGIPDGQLQAYRKELHDLFNLPHDRPYFKRSNAYHFPDEPYKDGYIRNPHTYLNPPNMETGMIYVVQGIYGYHHYMQDRIDDNGWGCAYRSLQTICSWFKHQGYTERSIPTHREIQQALVDAGDKPATFVGSRQWIGSIEVQLVLNQLIGITSKILFVSQGSEIASQGRELANHFQSEGTPVMIGGGVLAHTILGVAWNEITGQIKFLILDPHYTGAEDLQVILEKGWCGWKGPDFWNKDAYYNLCLPQRPNMI,469,NP_060829.2.csv,refseq-UFSP2-NM_018359.5_clinical_seed_0_final,refseq-UFSP2-NM_018359.5.a2m,Invitae,refseq-UFSP2-NM_018359.5.npy,1,469,469
+NP_060844.2,MGKSLSHLPLHSSKEDAYDGVTSENMRNGLVNSEVHNEDGRNGDVSQFPYVEFTGRDSVTCPTCQGTGRIPRGQENQLVALIPYSDQRLRPRRTKLYVMASVFVCLLLSGLAVFFLFPRSIDVKYIGVKSAYVSYDVQKRTIYLNITNTLNITNNNYYSVEVENITAQVQFSKTVIGKARLNNITIIGPLDMKQIDYTVPTVIAEEMSYMYDFCTLISIKVHNIVLMMQVTVTTTYFGHSEQISQERYQYVDCGRNTTYQLGQSEYLNVLQPQQ,274,NP_060844.2.csv,refseq-TMEM106B-NM_018374.3_clinical_seed_0_final,refseq-TMEM106B-NM_018374.3.a2m,Invitae,refseq-TMEM106B-NM_018374.3.npy,1,274,274
+NP_060859.4,MNRAPLKRSRILHMALTGASDPSAEAEANGEKPFLLRALQIALVVSLYWVTSISMVFLNKYLLDSPSLRLDTPIFVTFYQCLVTTLLCKGLSALAACCPGAVDFPSLRLDLRVARSVLPLSVVFIGMITFNNLCLKYVGVAFYNVGRSLTTVFNVLLSYLLLKQTTSFYALLTCGIIIGGFWLGVDQEGAEGTLSWLGTVFGVLASLCVSLNAIYTTKVLPAVDGSIWRLTFYNNVNACILFLPLLLLLGELQALRDFAQLGSAHFWGMMTLGGLFGFAIGYVTGLQIKFTSPLTHNVSGTAKACAQTVLAVLYYEETKSFLWWTSNMMVLGGSSAYTWVRGWEMKKTPEEPSPKDSEKSAMGV,364,NP_060859.4.csv,refseq-SLC35C1-NM_018389.4_clinical_seed_0_final,refseq-SLC35C1-NM_018389.4.a2m,Invitae,refseq-SLC35C1-NM_018389.4.npy,1,364,364
+NP_060921.3,MFLMPTSSELNSGQNFLTQWMTNPSRAGVILNRGFPILEADKEKRAAVDISTSFPIKGTHFSDSFSFINEEDSLLEEQKLESNNPYKPQSDKSETHTAFPCIKKGPQVAACHSAPGHQEENKNDFIPDLASEFKEGAYKDPLFKKLEQLKEVQQKKQEQLKRQQLEQLQRLMEEQEKLLTMVSGQCTLPGLSLLPDDQSQKHRSPGNTTTGERATCCFPSYVYPDPTQEETYPSNILSHEQSNFCRTAHGDFVLTSKRASPNLFSEAQYQEAPVEKNNLKEENRNHPTGESILCWEKVTEQIQEANDKNLQKHDDSSEVANIEERPIKAAIGERKQTFEDYLEEQIQLEEQELKQKQLKEAEGPLPIKAKPKQPFLKRGEGLARFTNAKSKFQKGKESKLVTNQSTSEDQPLFKMDRQQLQRKTALKNKELCADNPILKKDSKARTKSGSVTLSQKPKMLKCSNRKSLSPSGLKIQTGKKCDGQFRDQIKFENKVTSNNKENVTECPKPCDTGCTGWNKTQGKDRLPLSTGPASRLAAKSPIRETMKESESSLDVSLQKKLETWEREKEKENLELDEFLFLEQAADEISFSSNSSFVLKILERDQQICKGHRMSSTPVKAVPQKTNPADPISHCNRSEDLDHTAREKESECEVAPKQLHSLSSADELREQPCKIRKAVQKSTSENQTEWNARDDEGVPNSDSSTDSEEQLDVTIKPSTEDRERGISSREDSPQVCDDKGPFKDTRTQEDKRRDVDLDLSDKDYSSDESIMESIKHKVSEPSRSSSLSLSKMDFDDERTWTDLEENLCNHDVVLGNESTYGTPQTCYPNNEIGILDKTIKRKIAPVKRGEDLSKSRRSRSPPTSELMMKFFPSLKPKPKSDSHLGNELKLNISQDQPPGDNARSQVLREKIIELETEIEKFKAENASLAKLRIERESALEKLRKEIADFEQQKAKELARIEEFKKEEMRKLQKERKVFEKYTTAARTFPDKKEREEIQTLKQQIADLREDLKRKETKWSSTHSRLRSQIQMLVRENTDLREEIKVMERFRLDAWKRAEAIESSLEVEKKDKLANTSVRFQNSQISSGTQVEKYKKNYLPMQGNPPRRSKSAPPRDLGNLDKGQAASPREPLEPLNFPDPEYKEEEEDQDIQGEISHPDGKVEKVYKNGCRVILFPNGTRKEVSADGKTITVTFFNGDVKQVMPDQRVIYYYAAAQTTHTTYPEGLEVLHFSSGQIEKHYPDGRKEITFPDQTVKNLFPDGQEESIFPDGTIVRVQRDGNKLIEFNNGQRELHTAQFKRREYPDGTVKTVYANGHQETKYRSGRIRVKDKEGNVLMDTEL,1338,NP_060921.3.csv,CENPJ_HUMAN_b01_clinical_seed_0_final,CENPJ_HUMAN_b01.a2m,EVE,CENPJ_HUMAN_b01_theta_0.2.npy,1,1338,1338
+NP_060950.3,MVVFGYEAGTKPRDSGVVPVGTEEAPKVFKMAASMHGQPSPSLEDAKLRRPMVIEIIEKNFDYLRKEMTQNIYQMATFGTTAGFSGIFSNFLFRRCFKVKHDALKTYASLATLPFLSTVVTDKLFVIDALYSDNISKENCVFRSSLIGIVCGVFYPSSLAFTKNGRLATKYHTVPLPPKGRVLIHWMTLCQTQMKLMAIPLVFQIMFGILNGLYHYAVFEETLEKTIHEE,230,NP_060950.3.csv,refseq-TMEM126B-NM_018480.4_clinical_seed_0_final,refseq-TMEM126B-NM_018480.4.a2m,Invitae,refseq-TMEM126B-NM_018480.4.npy,1,230,230
+NP_060956.1,MEEPEEPADSGQSLVPVYIYSPEYVSMCDSLAKIPKRASMVHSLIEAYALHKQMRIVKPKVASMEEMATFHTDAYLQHLQKVSQEGDDDHPDSIEYGLGYDCPATEGIFDYAAAIGGATITAAQCLIDGMCKVAINWSGGWHHAKKDEASGFCYLNDAVLGILRLRRKFERILYVDLDLHHGDGVEDAFSFTSKVMTVSLHKFSPGFFPGTGDVSDVGLGKGRYYSVNVPIQDGIQDEKYYQICESVLKEVYQAFNPKAVVLQLGADTIAGDPMCSFNMTPVGIGKCLKYILQWQLATLILGGGGYNLANTARCWTYLTGVILGKTLSSEIPDHEFFTAYGPDYVLEITPSCRPDRNEPHRIQQILNYIKGNLKHVV,377,NP_060956.1.csv,refseq-HDAC8-NM_018486.2_clinical_seed_0_final,refseq-HDAC8-NM_018486.2.a2m,Invitae,refseq-HDAC8-NM_018486.2.npy,1,377,377
+NP_060958.2,MLQDKGLSESEEAFRAPGPALGEASAANAPEPALAAPGLSGAALGSPPGPGADVVAAAAAEQTIENIKVGLHEKELWKKFHEAGTEMIITKAGRRMFPSYKVKVTGMNPKTKYILLIDIVPADDHRYKFCDNKWMVAGKAEPAMPGRLYVHPDSPATGAHWMRQLVSFQKLKLTNNHLDPFGHIILNSMHKYQPRLHIVKADENNAFGSKNTAFCTHVFPETSFISVTSYQNHKITQLKIENNPFAKGFRGSDDSDLRVARLQSKEYPVISKSIMRQRLISPQLSATPDVGPLLGTHQALQHYQHENGAHSQLAEPQDLPLSTFPTQRDSSLFYHCLKRRDGTRHLDLPCKRSYLEAPSSVGEDHYFRSPPPYDQQMLSPSYCSEVTPREACMYSGSGPEIAGVSGVDDLPPPPLSCNMWTSVSPYTSYSVQTMETVPYQPFPTHFTATTMMPRLPTLSAQSSQPPGNAHFSVYNQLSQSQVRERGPSASFPRERGLPQGCERKPPSPHLNAANEFLYSQTFSLSRESSLQYHSGMGTVENWTDG,545,NP_060958.2.csv,refseq-TBX4-NM_018488.3_clinical_seed_0_final,refseq-TBX4-NM_018488.3.a2m,Invitae,refseq-TBX4-NM_018488.3.npy,1,545,545
+NP_061116.5,MGPLQGDGGPALGGADVAPRLSPVRVWPRPQAPKEPALHPMGLSLPKEKGLILCLWSKFCRWFQRRESWAQSRDEQNLLQQKRIWESPLLLAAKDNDVQALNKLLKYEDCKVHQRGAMGETALHIAALYDNLEAAMVLMEAAPELVFEPMTSELYEGQTALHIAVVNQNMNLVRALLARRASVSARATGTAFRRSPCNLIYFGEHPLSFAACVNSEEIVRLLIEHGADIRAQDSLGNTVLHILILQPNKTFACQMYNLLLSYDRHGDHLQPLDLVPNHQGLTPFKLAGVEGNTVMFQHLMQKRKHTQWTYGPLTSTLYDLTEIDSSGDEQSLLELIITTKKREARQILDQTPVKELVSLKWKRYGRPYFCMLGAIYLLYIICFTMCCIYRPLKPRTNNRTSPRDNTLLQQKLLQEAYMTPKDDIRLVGELVTVIGAIIILLVEVPDIFRMGVTRFFGQTILGGPFHVLIITYAFMVLVTMVMRLISASGEVVPMSFALVLGWCNVMYFARGFQMLGPFTIMIQKMIFGDLMRFCWLMAVVILGFASAFYIIFQTEDPEELGHFYDYPMALFSTFELFLTIIDGPANYNVDLPFMYSITYAAFAIIATLLMLNLLIAMMGDTHWRVAHERDELWRAQIVATTVMLERKLPRCLWPRSGICGREYGLGDRWFLRVEDRQDLNRQRIQRYAQAFHTRGSEDLDKDSVEKLELGCPFSPHLSLPMPSVSRSTSRSSANWERLRQGTLRRDLRGIINRGLEDGESWEYQI,765,NP_061116.5.csv,NP_061116.5_clinical_seed_0_final,NP_061116.5.a2m,popEVE,NP_061116.5_theta_0.2.npy,1,765,765
+NP_061154.1,MADEQEIMCKLESIKEIRNKTLQMEKIKARLKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKLLESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLDYFEKQKAEWQTEPQEPPIPESLAAAAAAAQQLQVARKQDTRQTATFRQQPPPMKACLSCHQQIHRNAPICPLCKAKSRSRNPKKPKRKQDE,224,NP_061154.1.csv,refseq-ZC4H2-NM_018684.3_clinical_seed_0_final,refseq-ZC4H2-NM_018684.3.a2m,Invitae,refseq-ZC4H2-NM_018684.3.npy,1,224,224
+NP_061155.2,MDPFTEKLLERTRARRENLQRKMAERPTAAPRSMTHAKRARQPLSEASNQQPLSGGEEKSCTKPSPSKKRCSDNTEVEVSNLENKQPVESTSAKSCSPSPVSPQVQPQAADTISDSVAVPASLLGMRRGLNSRLEATAASSVKTRMQKLAEQRRRWDNDDMTDDIPESSLFSPMPSEEKAASPPRPLLSNASATPVGRRGRLANLAATICSWEDDVNHSFAKQNSVQEQPGTACLSKFSSASGASARINSSSVKQEATFCSQRDGDASLNKALSSSADDASLVNASISSSVKATSPVKSTTSITDAKSCEGQNPELLPKTPISPLKTGVSKPIVKSTLSQTVPSKGELSREICLQSQSKDKSTTPGGTGIKPFLERFGERCQEHSKESPARSTPHRTPIITPNTKAIQERLFKQDTSSSTTHLAQQLKQERQKELACLRGRFDKGNIWSAEKGGNSKSKQLETKQETHCQSTPLKKHQGVSKTQSLPVTEKVTENQIPAKNSSTEPKGFTECEMTKSSPLKITLFLEEDKSLKVTSDPKVEQKIEVIREIEMSVDDDDINSSKVINDLFSDVLEEGELDMEKSQEEMDQALAESSEEQEDALNISSMSLLAPLAQTVGVVSPESLVSTPRLELKDTSRSDESPKPGKFQRTRVPRAESGDSLGSEDRDLLYSIDAYRSQRFKETERPSIKQVIVRKEDVTSKLDEKNNAFPCQVNIKQKMQELNNEINMQQTVIYQASQALNCCVDEEHGKGSLEEAEAERLLLIATGKRTLLIDELNKLKNEGPQRKNKASPQSEFMPSKGSVTLSEIRLPLKADFVCSTVQKPDAANYYYLIILKAGAENMVATPLASTSNSLNGDALTFTTTFTLQDVSNDFEINIEVYSLVQKKDPSGLDKKKKTSKSKAITPKRLLTSITTKSNIHSSVMASPGGLSAVRTSNFALVGSYTLSLSSVGNTKFVLDKVPFLSSLEGHIYLKIKCQVNSSVEERGFLTIFEDVSGFGAWHRRWCVLSGNCISYWTYPDDEKRKNPIGRINLANCTSRQIEPANREFCARRNTFELITVRPQREDDRETLVSQCRDTLCVTKNWLSADTKEERDLWMQKLNQVLVDIRLWQPDACYKPIGKP,1124,NP_061155.2.csv,refseq-ANLN-NM_018685.4_clinical_seed_0_final,refseq-ANLN-NM_018685.4.a2m,Invitae,refseq-ANLN-NM_018685.4.npy,1,1124,1124
+NP_061169.2,MLGMYVPDRFSLKSSRVQDGMGLYTARRVRKGEKFGPFAGEKRMPEDLDENMDYRLMWEVRGSKGEVLYILDATNPRHSNWLRFVHEAPSQEQKNLAAIQEGENIFYLAVEDIETDTELLIGYLDSDMEAEEEEQQIMTVIKEGEVENSRRQSTAGRKDRLGCKEDYACPQCESSFTSEDILAEHLQTLHQKPTEEKEFKCKNCGKKFPVKQALQRHVLQCTAKSSLKESSRSFQCSVCNSSFSSASSFEQHQETCRGDARFVCKADSCGKRLKSKDALKRHQENVHTGDPKKKLICSVCNKKCSSASSLQEHRKIHEIFDCQECMKKFISANQLKRHMITHSEKRPYNCEICNKSFKRLDQVGAHKVIHSEDKPYKCKLCGKGFAHRNVYKNHKKTHSEERPFQCEECKALFRTPFSLQRHLLIHNSERTFKCHHCDATFKRKDTLNVHVQVVHERHKKYRCELCNKAFVTPSVLRSHKKTHTGEKEKICPYCGQKFASSGTLRVHIRSHTGERPYQCPYCEKGFSKNDGLKMHIRTHTREKPYKCSECSKAFSQKRGLDEHKRTHTGEKPFQCDVCDLAFSLKKMLIRHKMTHNPNRPLAECQFCHKKFTRNDYLKVHMDNIHGVADS,630,NP_061169.2.csv,refseq-PRDM5-NM_018699.3_clinical_seed_0_final,refseq-PRDM5-NM_018699.3.a2m,Invitae,refseq-PRDM5-NM_018699.3.npy,1,630,630
+NP_061176.4,MASATAAAARRGLGRALPLFWRGYQTERGVYGYRPRKPESREPQGALERPPVDHGLARLVTVYCEHGHKAAKINPLFTGQALLENVPEIQALVQTLQGPFHTAGLLNMGKEEASLEEVLVYLNQIYCGQISIETSQLQSQDEKDWFAKRFEELQKETFTTEERKHLSKLMLESQEFDHFLATKFSTVKRYGGEGAESMMGFFHELLKMSAYSGITDVIIGMPHRGRLNLLTGLLQFPPELMFRKMRGLSEFPENFSATGDVLSHLTSSVDLYFGAHHPLHVTMLPNPSHLEAVNPVAVGKTRGRQQSRQDGDYSPDNSAQPGDRVICLQVHGDASFCGQGIVPETFTLSNLPHFRIGGSVHLIVNNQLGYTTPAERGRSSLYCSDIGKLVGCAIIHVNGDSPEEVVRATRLAFEYQRQFRKDVIIDLLCYRQWGHNELDEPFYTNPIMYKIIRARKSIPDTYAEHLIAGGLMTQEEVSEIKSSYYAKLNDHLNNMAHYRPPALNLQAHWQGLAQPEAQITTWSTGVPLDLLRFVGMKSVEVPRELQMHSHLLKTHVQSRMEKMMDGIKLDWATAEALALGSLLAQGFNVRLSGQDVGRGTFSQRHAIVVCQETDDTYIPLNHMDPNQKGFLEVSNSPLSEEAVLGFEYGMSIESPKLLPLWEAQFGDFFNGAQIIFDTFISGGEAKWLLQSGIVILLPHGYDGAGPDHSSCRIERFLQMCDSAEEGVDGDTVNMFVVHPTTPAQYFHLLRRQMVRNFRKPLIVASPKMLLRLPAAVSTLQEMAPGTTFNPVIGDSSVDPKKVKTLVFCSGKHFYSLVKQRESLGAKKHDFAIIRVEELCPFPLDSLQQEMSKYKHVKDHIWSQEEPQNMGPWSFVSPRFEKQLACKLRLVGRPPLPVPAVGIGTVHLHQHEDILAKTFA,919,NP_061176.4.csv,refseq-DHTKD1-NM_018706.7_clinical_seed_0_final,refseq-DHTKD1-NM_018706.7.a2m,Invitae,refseq-DHTKD1-NM_018706.7.npy,1,919,919
+NP_061336.1,MSRLEAKKPSLCKSEPLTTERVRTTLSVLKRIVTSCYGPSGRLKQLHNGFGGYVCTTSQSSALLSHLLVTHPILKILTASIQNHVSSFSDCGLFTAILCCNLIENVQRLGLTPTTVIRLNKHLLSLCISYLKSETCGCRIPVDFSSTQILLCLVRSILTSKPACMLTRKETEHVSALILRAFLLTIPENAEGHIILGKSLIVPLKGQRVIDSTVLPGILIEMSEVQLMRLLPIKKSTALKVALFCTTLSGDTSDTGEGTVVVSYGVSLENAVLDQLLNLGRQLISDHVDLVLCQKVIHPSLKQFLNMHRIIAIDRIGVTLMEPLTKMTGTQPIGSLGSICPNSYGSVKDVCTAKFGSKHFFHLIPNEATICSLLLCNRNDTAWDELKLTCQTALHVLQLTLKEPWALLGGGCTETHLAAYIRHKTHNDPESILKDDECTQTELQLIAEAFCSALESVVGSLEHDGGEILTDMKYGHLWSVQADSPCVANWPDLLSQCGCGLYNSQEELNWSFLRSTRRPFVPQSCLPHEAVGSASNLTLDCLTAKLSGLQVAVETANLILDLSYVIEDKN,570,NP_061336.1.csv,refseq-MKKS-NM_018848.3_clinical_seed_0_final,refseq-MKKS-NM_018848.3.a2m,Invitae,refseq-MKKS-NM_018848.3.npy,1,570,570
+NP_061485.1,MQAIKCVVVGDGAVGKTCLLISYTTNAFPGEYIPTVFDNYSANVMVDGKPVNLGLWDTAGQEDYDRLRPLSYPQTVGETYGKDITSRGKDKPIADVFLICFSLVSPASFENVRAKWYPEVRHHCPNTPIILVGTKLDLRDDKDTIEKLKEKKLTPITYPQGLAMAKEIGAVKYLECSALTQRGLKTVFDEAIRAVLCPPPVKKRKRKCLLL,211,NP_061485.1.csv,refseq-RAC1-NM_018890.3_clinical_seed_0_final,refseq-RAC1-NM_018890.3.a2m,Invitae,refseq-RAC1-NM_018890.3.npy,1,211,211
+NP_061764.2,MNPASDGGTSESIFDLDYASWGIRSTLMVAGFVFYLGVFVVCHQLSSSLNATYRSLVAREKVFWDLAATRAVFGVQSTAAGLWALLGDPVLHADKARGQQNWCWFHITTATGFFCFENVAVHLSNLIFRTFDLFLVIHHLFAFLGFLGCLVNLQAGHYLAMTTLLLEMSTPFTCVSWMLLKAGWSESLFWKLNQWLMIHMFHCRMVLTYHMWWVCFWHWDGLVSSLYLPHLTLFLVGLALLTLIINPYWTHKKTQQLLNPVDWNFAQPEAKSRPEGNGQLLRKKRP,286,NP_061764.2.csv,refseq-CLN8-NM_018941.3_clinical_seed_0_final,refseq-CLN8-NM_018941.3.a2m,Invitae,refseq-CLN8-NM_018941.3.npy,1,286,286
+NP_061850.2,MWLRLGPPSLSLSPKPTVGRSLCLTLWFLSLALRASTQAPAPTVNTHFGKLRGARVPLPSEILGPVDQYLGVPYAAPPIGEKRFLPPEPPPSWSGIRNATHFPPVCPQNIHTAVPEVMLPVWFTANLDIVATYIQEPNEDCLYLNVYVPTEDGSGAKKQGEDLADNDGDEDEDIRDSGAKPVMVYIHGGSYMEGTGNMIDGSILASYGNVIVITLNYRVGVLGFLSTGDQAAKGNYGLLDQIQALRWVSENIAFFGGDPRRITVFGSGIGASCVSLLTLSHHSEGLFQRAIIQSGSALSSWAVNYQPVKYTSLLADKVGCNVLDTVDMVDCLRQKSAKELVEQDIQPARYHVAFGPVIDGDVIPDDPEILMEQGEFLNYDIMLGVNQGEGLKFVEGVVDPEDGVSGTDFDYSVSNFVDNLYGYPEGKDTLRETIKFMYTDWADRDNPETRRKTLVALFTDHQWVEPSVVTADLHARYGSPTYFYAFYHHCQSLMKPAWSDAAHGDEVPYVFGVPMVGPTDLFPCNFSKNDVMLSAVVMTYWTNFAKTGDPNKPVPQDTKFIHTKANRFEEVAWSKYNPRDQLYLHIGLKPRVRDHYRATKVAFWKHLVPHLYNLHDMFHYTSTTTKVPPPDTTHSSHITRRPNGKTWSTKRPAISPAYSNENAQGSWNGDQDAGPLLVENPRDYSTELSVTIAVGASLLFLNVLAFAALYYRKDKRRQEPLRQPSPQRGAGAPELGAAPEEELAALQLGPTHHECEAGPPHDTLRLTALPDYTLTLRRSPDDIPLMTPNTITMIPNSLVGLQTLHPYNTFAAGFNSTGLPHSHSTTRV,828,NP_061850.2.csv,refseq-NLGN3-NM_018977.3_clinical_seed_0_final,refseq-NLGN3-NM_018977.3.a2m,Invitae,refseq-NLGN3-NM_018977.3.npy,1,828,828
+NP_061868.1,MLSLAAKLVAFFWRTADTPREEAGQLEPELAEGDTKLKTVRGVVTRYCSDYGMIDDMIYFSSDAVTSRVLLNVGQEVIAVVEENKVSNGLKAIRVEAVSDKWEDDSRNHGSPSDCGPRVLIGCVTSLVEGAGCISQTTYFSLESVCEGFEPCKGDWVEAEYRIRPGTWSSEATSVKPLRYKRVDKVCISSLCGRNGVLEESIFFTLDSLKLPDGYTPRRGDVVNAVVVESSQSCYVWRALCMTLVKRRDAAPVHEATHFYGTILLKNKGDIEVTQVTHFGTLKEGRSKTMVIWIENKGDIPQNLVSCKLAGWDKSKQFRFQMLDKDQMCPVVSFVSVPEKENSSDENINSLNSHTKNKTSQMSESSLVNNRGISPGDCTCKGENGEKDNILSRKQMTEPEPGGLVPPGGKTFIVVICDGKNPGRCKELLLLCFSDFLIGRYLEVNVISGEESLIAAREPFSWKKLKSSQALTSAKTTVVVTAQKRNSRRQLPSFLPQYPIPDRLRKCVEQKIDILTFQPLLAELLNMSNYKEKFSTLLWLEEIYAEMELKEYNMSGIILRRNGDLLVLEVPGLAEGRPSLYAGDKLILKTQEYNGHAIEYISYVTEIHEEDVTLKINPEFEQAYNFEPMDVEFTYNRTTSRRCHFALEHVIHLGVKVLFPEEIILQSPQVTGNWNHAQDTKSSGQSTSKKNRKTMTDQAEHGTEERRVGDKDLPVLAPFTAEMSDWVDEIQTPKARKMEFFNPVLNENQKLAVKRILSGDCRPLPYILFGPPGTGKTVTIIEAVLQVHFALPDSRILVCAPSNSAADLVCLRLHESKVLQPATMVRVNATCRFEEIVIDAVKPYCRDGEDIWKASRFRIIITTCSSSGLFYQIGVRVGHFTHVFVDEAGQASEPECLIPLGLMSDISGQIVLAGDPMQLGPVIKSRLAMAYGLNVSFLERLMSRPAYQRDENAFGACGAHNPLLVTKLVKNYRSHEALLMLPSRLFYHRELEVCADPTVVTSLLGWEKLPKKGFPLIFHGVRGSEAREGKSPSWFNPAEAVQVLRYCCLLAHSISSQVSASDIGVITPYRKQVEKIRILLRNVDLMDIKVGSVEEFQGQEYLVIIISTVRSNEDRFEDDRYFLGFLSNSKRFNVAITRPKALLIVLGNPHVLVRDPCFGALLEYSITNGVYMGCDLPPALQSLQNCGEGVADPSYPVVPESTGPEKHQEPS,1211,NP_061868.1.csv,refseq-MOV10L1-NM_018995.2_clinical_seed_0_final,refseq-MOV10L1-NM_018995.2.a2m,Invitae,refseq-MOV10L1-NM_018995.2.npy,1,1211,1211
+NP_061896.1,MKIFCSRANPTTGSVEWLEEDEHYDYHQEIARSSYADMLHDKDRNVKYYQGIRAAVSRVKDRGQKALVLDIGTGTGLLSMMAVTAGADFCYAIEVFKPMADAAVKIVEKNGFSDKIKVINKHSTEVTVGPEGDMPCRANILVTELFDTELIGEGALPSYEHAHRHLVEENCEAVPHRATVYAQLVESGRMWSWNKLFPIHVQTSLGEQVIVPPVDVESCPGAPSVCDIQLNQVSPADFTVLSDVLPMFSIDFSKQVSSSAACHSRRFEPLTSGRAQVVLSWWDIEMDPEGKIKCTMAPFWAHSDPEEMQWRDHWMQCVYFLPQEEPVVQGSALYLVAHHDDYCVWYSLQRTSPEKNERVRQMRPVCDCQAHLLWNRPRFGEINDQDRTDRYVQALRTVLKPDSVCLCVSDGSLLSVLAHHLGVEQVFTVESSAASHKLLRKIFKANHLEDKINIIEKRPELLTNEDLQGRKVSLLLGEPFFTTSLLPWHNLYFWYVRTAVDQHLGPGAMVMPQAASLHAVVVEFRDLWRIRSPCGDCEGFDVHIMDDMIKRALDFRESREAEPHPLWEYPCRSLSEPWQILTFDFQQPVPLQPLCAEGTVELRRPGQSHAAVLWMEYHLTPECTLSTGLLEPADPEGGCCWNPHCKQAVYFFSPAPDPRALLGGPRTVSYAVEFHPDTGDIIMEFRHADTPD,692,NP_061896.1.csv,refseq-PRMT7-NM_019023.2_clinical_seed_0_final,refseq-PRMT7-NM_019023.2.a2m,Invitae,refseq-PRMT7-NM_019023.2.npy,1,692,692
+NP_061905.2,MENWTGRPWLYLLLLLSLPQLCLDQEVLSGHSLQTPTEEGQGPEGVWGPWVQWASCSQPCGVGVQRRSRTCQLPTVQLHPSLPLPPRPPRHPEALLPRGQGPRPQTSPETLPLYRTQSRGRGGPLRGPASHLGREETQEIRAARRSRLRDPIKPGMFGYGRVPFALPLHRNRRHPRSPPRSELSLISSRGEEAIPSPTPRAEPFSANGSPQTELPPTELSVHTPSPQAEPLSPETAQTEVAPRTRPAPLRHHPRAQASGTEPPSPTHSLGEGGFFRASPQPRRPSSQGWASPQVAGRRPDPFPSVPRGRGQQGQGPWGTGGTPHGPRLEPDPQHPGAWLPLLSNGPHASSLWSLFAPSSPIPRCSGESEQLRACSQAPCPPEQPDPRALQCAAFNSQEFMGQLYQWEPFTEVQGSQRCELNCRPRGFRFYVRHTEKVQDGTLCQPGAPDICVAGRCLSPGCDGILGSGRRPDGCGVCGGDDSTCRLVSGNLTDRGGPLGYQKILWIPAGALRLQIAQLRPSSNYLALRGPGGRSIINGNWAVDPPGSYRAGGTVFRYNRPPREEGKGESLSAEGPTTQPVDVYMIFQEENPGVFYQYVISSPPPILENPTPEPPVPQLQPEILRVEPPLAPAPRPARTPGTLQRQVRIPQMPAPPHPRTPLGSPAAYWKRVGHSACSASCGKGVWRPIFLCISRESGEELDERSCAAGARPPASPEPCHGTPCPPYWEAGEWTSCSRSCGPGTQHRQLQCRQEFGGGGSSVPPERCGHLPRPNITQSCQLRLCGHWEVGSPWSQCSVRCGRGQRSRQVRCVGNNGDEVSEQECASGPPQPPSREACDMGPCTTAWFHSDWSSKCSAECGTGIQRRSVVCLGSGAALGPGQGEAGAGTGQSCPTGSRPPDMRACSLGPCERTWRWYTGPWGECSSECGSGTQRRDIICVSKLGTEFNVTSPSNCSHLPRPPALQPCQGQACQDRWFSTPWSPCSRSCQGGTQTREVQCLSTNQTLSTRCPPQLRPSRKRPCNSQPCSQRPDDQCKDSSPHCPLVVQARLCVYPYYTATCCRSCAHVLERSPQDPS,1074,NP_061905.2.csv,refseq-ADAMTSL4-NM_019032.5_clinical_seed_0_final,refseq-ADAMTSL4-NM_019032.5.a2m,Invitae,refseq-ADAMTSL4-NM_019032.5.npy,1,1074,1074
+NP_061923.2,MAWVKFLRKPGGNLGKVYQPGSMLSLAPTKGLLNEPGQNSCFLNSAVQVLWQLDIFRRSLRVLTGHVCQGDACIFCALKTIFAQFQHSREKALPSDNIRHALAESFKDEQRFQLGLMDDAAECFENMLERIHFHIVPSRDADMCTSKSCITHQKFAMTLYEQCVCRSCGASSDPLPFTEFVRYISTTALCNEVERMLERHERFKPEMFAELLQAANTTDDYRKCPSNCGQKIKIRRVLMNCPEIVTIGLVWDSEHSDLTEAVVRNLATHLYLPGLFYRVTDENAKNSELNLVGMICYTSQHYCAFAFHTKSSKWVFFDDANVKEIGTRWKDVVSKCIRCHFQPLLLFYANPDGTAVSTEDALRQVISWSHYKSVAENMGCEKPVIHKSDNLKENGFGDQAKQRENQKFPTDNISSSNRSHSHTGVGKGPAKLSHIDQREKIKDISRECALKAIEQKNLLSSQRKDLEKGQRKDLGRHRDLVDEDLSHFQSGSPPAPNGFKQHGNPHLYHSQGKGSYKHDRVVPQSRASAQIISSSKSQILAPGEKITGKVKSDNGTGYDTDSSQDSRDRGNSCDSSSKSRNRGWKPMRETLNVDSIFSESEKRQHSPRHKPNISNKPKSSKDPSFSNWPKENPKQKGLMTIYEDEMKQEIGSRSSLESNGKGAEKNKGLVEGKVHGDNWQMQRTESGYESSDHISNGSTNLDSPVIDGNGTVMDISGVKETVCFSDQITTSNLNKERGDCTSLQSQHHLEGFRKELRNLEAGYKSHEFHPESHLQIKNHLIKRSHVHEDNGKLFPSSSLQIPKDHNAREHIHQSDEQKLEKPNECKFSEWLNIENSERTGLPFHVDNSASGKRVNSNEPSSLWSSHLRTVGLKPETAPLIQQQNIMDQCYFENSLSTECIIRSASRSDGCQMPKLFCQNLPPPLPPKKYAITSVPQSEKSESTPDVKLTEVFKATSHLPKHSLSTASEPSLEVSTHMNDERHKETFQVRECFGNTPNCPSSSSTNDFQANSGAIDAFCQPELDSISTCPNETVSLTTYFSVDSCMTDTYRLKYHQRPKLSFPESSGFCNNSLS,1073,NP_061923.2.csv,refseq-USP53-NM_019050.2_clinical_seed_0_final,refseq-USP53-NM_019050.2.a2m,Invitae,refseq-USP53-NM_019050.2.npy,1,1073,1073
+NP_061928.4,MGSGGDSLLGGRGSLPLLLLLIMGGMAQDSPPQILVHPQDQLFQGPGPARMSCQASGQPPPTIRWLLNGQPLSMVPPDPHHLLPDGTLLLLQPPARGHAHDGQALSTDLGVYTCEASNRLGTAVSRGARLSVAVLREDFQIQPRDMVAVVGEQFTLECGPPWGHPEPTVSWWKDGKPLALQPGRHTVSGGSLLMARAEKSDEGTYMCVATNSAGHRESRAARVSIQEPQDYTEPVELLAVRIQLENVTLLNPDPAEGPKPRPAVWLSWKVSGPAAPAQSYTALFRTQTAPGGQGAPWAEELLAGWQSAELGGLHWGQDYEFKVRPSSGRARGPDSNVLLLRLPEKVPSAPPQEVTLKPGNGTVFVSWVPPPAENHNGIIRGYQVWSLGNTSLPPANWTVVGEQTQLEIATHMPGSYCVQVAAVTGAGAGEPSRPVCLLLEQAMERATQEPSEHGPWTLEQLRATLKRPEVIATCGVALWLLLLGTAVCIHRRRRARVHLGPGLYRYTSEDAILKHRMDHSDSQWLADTWRSTSGSRDLSSSSSLSSRLGADARDPLDCRRSLLSWDSRSPGVPLLPDTSTFYGSLIAELPSSTPARPSPQVPAVRRLPPQLAQLSSPCSSSDSLCSRRGLSSPRLSLAPAEAWKAKKKQELQHANSSPLLRGSHSLELRACELGNRGSKNLSQSPGAVPQALVAWRALGPKLLSSSNELVTRHLPPAPLFPHETPPTQSQQTQPPVAPQAPSSILLPAAPIPILSPCSPPSPQASSLSGPSPASSRLSSSSLSSLGEDQDSVLTPEEVALCLELSEGEETPRNSVSPMPRAPSPPTTYGYISVPTASEFTDMGRTGGGVGPKGGVLLCPPRPCLTPTPSEGSLANGWGSASEDNAASARASLVSSSDGSFLADAHFARALAVAVDSFGFGLEPREADCVFIDASSPPSPRDEIFLTPNLSLPLWEWRPDWLEDMEVSHTQRLGRGMPPWPPDSQISSQRSQLHCRMPKAGASPVDYS,1007,NP_061928.4.csv,refseq-ROBO4-NM_019055.5_clinical_seed_0_final,refseq-ROBO4-NM_019055.5.a2m,Invitae,refseq-ROBO4-NM_019055.5.npy,1,1007,1007
+NP_061929.2,MAAGLFGLSARRLLAAAATRGLPAARVRWESSFSRTVVAPSAVAGKRPPEPTTPWQEDPEPEDENLYEKNPDSHGYDKDPVLDVWNMRLVFFFGVSIILVLGSTFVAYLPDYRCTGCPRAWDGMKEWSRREAERLVKYREANGLPIMESNCFDPSKIQLPEDE,163,NP_061929.2.csv,refseq-NDUFB11-NM_019056.6_clinical_seed_0_final,refseq-NDUFB11-NM_019056.6.a2m,Invitae,refseq-NDUFB11-NM_019056.6.npy,1,163,163
+NP_061939.3,MSQLSKNLGDSSPPAEAPKPPVYSRPTVLMRAPPASSRAPPVPWDPPPIDLQASLAAWQAPQPAWEAPQGQLPAPVVPMTQPPALGGPIVPAPPLGGPMGKPPTPGVLMVHPPPPGAPMAQPPTPGVLMVHPSAPGAPMAHPPPPGTPMSHPPPPGTPMAHPPPPGTPMAHPPPPGTPMVHPPPPGTPMAHPPPPGTPMAHPPPPGTPMAHPPPPGTPMAHPPPPGTPMAQPPAPGVLMAQPLTPGVLMVQPAAPGAPMVQPPPAAMMTQPQPSGAPMAKPPGPGVLMIHPPGARAPMTQPPASGAPMAQPAAPPAQPMAPPAQPMASWAPQAQPLILQIQSQVIRAPPQVPQGPQAPPAQLATPPGWQATSPGWQATQQGWQATPLTWQTTQVTWQAPAVTWQVPPPMRQGPPPIRPGPPPIRPGPPPVRQAPPLIRQAPPVIRQAPPVIRQAPPVIRQAPAVIRQAPPVIRQAPPVIRQAPPVIRQAPPLIRQAPPPIRPAPQVLATQPPLWQALPPPPPLRQAPQARLPAPQVQAAPQVPTAPPATQVPAAPPAGPQVPQPVLPAPLSAPLSAPQAVHCPSIIWQAPKGQPPVPHEIPTSMEFQEVQQTQALAWQAQKAPTHIWQPLPAQEAQRQAPPLVQLEQPFQGAPPSQKAVQIQLPPQQAQASGPQAEVPTLPLQPSWQAPPAVLQAQPGPPVAAANFPLGSAKSLMTPSGECRASSIDRRGSSKERRTSSKERRAPSKDRMIFAATFCAPKAVSAARAHLPAAWKNLPATPETFAPSSSVFPATSQFQPASLNAFKGPSAASETPKSLPYALQDPFACVEALPAVPWVPQPNMNASKASQAVPTFLMATAAAPQATATTQEASKTSVEPPRRSGKATRKKKHLEAQEDSRGHTLAFHDWQGPRPWENLNLSDWEVQSPIQVSGDWEHPNTPRGLSGWEGPSTSRILSGWEGPSASWALSAWEGPSTSRALGLSESPGSSLPVVVSEVASVSPGSSATQDNSKVEAQPLSPLDERANALVQFLLVKDQAKVPVQRSEMVKVILREYKDECLDIINRANNKLECAFGYQLKEIDTKNHAYIIINKLGYHTGNLVASYLDRPKFGLLMVVLSLIFMKGNCVREDLIFNFLFKLGLDVRETNGLFGNTKKLITEVFVRQKYLEYRRIPYTEPAEYEFLWGPRAFLETSKMLVLRFLAKLHKKDPQSWPFHYLEALAECEWEDTDEDEPDTGDSAHGPTSRPPPR,1249,NP_061939.3.csv,refseq-MAGEL2-NM_019066.4_clinical_seed_0_final,refseq-MAGEL2-NM_019066.4.a2m,Invitae,refseq-MAGEL2-NM_019066.4.npy,1,1249,1249
+NP_061947.1,MAAASRSASGWALLLLVALWQQRAAGSGVFQLQLQEFINERGVLASGRPCEPGCRTFFRVCLKHFQAVVSPGPCTFGTVSTPVLGTNSFAVRDDSSGGGRNPLQLPFNFTWPGTFSLIIEAWHAPGDDLRPEALPPDALISKIAIQGSLAVGQNWLLDEQTSTLTRLRYSYRVICSDNYYGDNCSRLCKKRNDHFGHYVCQPDGNLSCLPGWTGEYCQQPICLSGCHEQNGYCSKPAECLCRPGWQGRLCNECIPHNGCRHGTCSTPWQCTCDEGWGGLFCDQDLNYCTHHSPCKNGATCSNSGQRSYTCTCRPGYTGVDCELELSECDSNPCRNGGSCKDQEDGYHCLCPPGYYGLHCEHSTLSCADSPCFNGGSCRERNQGANYACECPPNFTGSNCEKKVDRCTSNPCANGGQCLNRGPSRMCRCRPGFTGTYCELHVSDCARNPCAHGGTCHDLENGLMCTCPAGFSGRRCEVRTSIDACASSPCFNRATCYTDLSTDTFVCNCPYGFVGSRCEFPVGLPPSFPWVAVSLGVGLAVLLVLLGMVAVAVRQLRLRRPDDGSREAMNNLSDFQKDNLIPAAQLKNTNQKKELEVDCGLDKSNCGKQQNHTLDYNLAPGPLGRGTMPGKFPHSDKSLGEKAPLRLHSEKPECRISAICSPRDSMYQSVCLISEERNECVIATEV,685,NP_061947.1.csv,refseq-DLL4-NM_019074.3_clinical_seed_0_final,refseq-DLL4-NM_019074.3.a2m,Invitae,refseq-DLL4-NM_019074.3.npy,1,685,685
+NP_061949.3,MARTGWTSPIPLCVSLLLTCGFAEAGKLLVVPMDGSHWFTMQSVVEKLILRGHEVVVVMPEVSWQLGKSLNCTVKTYSTSYTLEDLDREFMDFADAQWKAQVRSLFSLFLSSSNGFFNLFFSHCRSLFNDRKLVEYLKESSFDAVFLDPFDACGLIVAKYFSLPSVVFARGIACHYLEEGAQCPAPLSYVPRILLGFSDAMTFKERVRNHIMHLEEHLFCQYFSKNALEIASEILQTPVTAYDLYSHTSIWLLRTDFVLDYPKPVMPNMIFIGGINCHQGKPLPMEFEAYINASGEHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH,530,NP_061949.3.csv,refseq-UGT1A8-NM_019076.4_clinical_seed_0_final,refseq-UGT1A8-NM_019076.4.a2m,Invitae,refseq-UGT1A8-NM_019076.4.npy,1,530,530
+NP_061950.2,MARAGWTGLLPLYVCLLLTCGFAKAGKLLVVPMDGSHWFTMQSVVEKLILRGHEVVVVMPEVSWQLGRSLNCTVKTYSTSYTLEDQDREFMVFADARWTAPLRSAFSLLTSSSNGIFDLFFSNCRSLFNDRKLVEYLKESCFDAVFLDPFDACGLIVAKYFSLPSVVFARGIFCHYLEEGAQCPAPLSYVPRLLLGFSDAMTFKERVWNHIMHLEEHLFCPYFFKNVLEIASEILQTPVTAYDLYSHTSIWLLRTDFVLEYPKPVMPNMIFIGGINCHQGKPVPMEFEAYINASGEHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH,530,NP_061950.2.csv,refseq-UGT1A7-NM_019077.2_clinical_seed_0_final,refseq-UGT1A7-NM_019077.2.a2m,Invitae,refseq-UGT1A7-NM_019077.2.npy,1,530,530
+NP_061951.1,MATGLQVPLPQLATGLLLLLSVQPWAESGKVLVVPTDGSHWLSMREALRDLHARGHQVVVLTLEVNMYIKEENFFTLTTYAISWTQDEFDRLLLGHTQSFFETEHLLMKFSRRMAIMNNMSLIIHRSCVELLHNEALIRHLHATSFDVVLTDPFHLCAAVLAKYLSIPAVFFLRNIPCDLDFKGTQCPNPSSYIPRLLTTNSDHMTFLQRVKNMLYPLALSYLCHAVSAPYASLASELFQREVSVVDLVSHASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASGEHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH,534,NP_061951.1.csv,refseq-UGT1A5-NM_019078.1_clinical_seed_0_final,refseq-UGT1A5-NM_019078.1.a2m,Invitae,refseq-UGT1A5-NM_019078.1.npy,1,534,534
+NP_061966.1,MATGLQVPLPWLATGLLLLLSVQPWAESGKVLVVPIDGSHWLSMREVLRELHARGHQAVVLTPEVNMHIKEENFFTLTTYAISWTQDEFDRHVLGHTQLYFETEHFLKKFFRSMAMLNNMSLVYHRSCVELLHNEALIRHLNATSFDVVLTDPVNLCAAVLAKYLSIPTVFFLRNIPCDLDFKGTQCPNPSSYIPRLLTTNSDHMTFMQRVKNMLYPLALSYICHAFSAPYASLASELFQREVSVVDILSHASVWLFRGDFVMDYPRPIMPNMVFIGGINCANRKPLSQEFEAYINASGEHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH,534,NP_061966.1.csv,refseq-UGT1A3-NM_019093.2_clinical_seed_0_final,refseq-UGT1A3-NM_019093.2.a2m,Invitae,refseq-UGT1A3-NM_019093.2.npy,1,534,534
+NP_061971.3,MFKSLTKVNKVKPIGENNENEQSSRRNEEGSHPSNQSQQTTAQEENKGEEKSLKTKSTPVTSEEPHTNIQDKLSKKNSSGDLTTNPDPQNAAEPTGTVPEQKEMDPGKEGPNSPQNKPPAAPVINEYADAQLHNLVKRMRQRTALYKKKLVEGDLSSPEASPQTAKPTAVPPVKESDDKPTEHYYRLLWFKVKKMPLTEYLKRIKLPNSIDSYTDRLYLLWLLLVTLAYNWNCCFIPLRLVFPYQTADNIHYWLIADIICDIIYLYDMLFIQPRLQFVRGGDIIVDSNELRKHYRTSTKFQLDVASIIPFDICYLFFGFNPMFRANRMLKYTSFFEFNHHLESIMDKAYIYRVIRTTGYLLFILHINACVYYWASNYEGIGTTRWVYDGEGNEYLRCYYWAVRTLITIGGLPEPQTLFEIVFQLLNFFSGVFVFSSLIGQMRDVIGAATANQNYFRACMDDTIAYMNNYSIPKLVQKRVRTWYEYTWDSQRMLDESDLLKTLPTTVQLALAIDVNFSIISKVDLFKGCDTQMIYDMLLRLKSVLYLPGDFVCKKGEIGKEMYIIKHGEVQVLGGPDGTKVLVTLKAGSVFGEISLLAAGGGNRRTANVVAHGFANLLTLDKKTLQEILVHYPDSERILMKKARVLLKQKAKTAEATPPRKDLALLFPPKEETPKLFKTLLGGTGKASLARLLKLKREQAAQKKENSEGGEEEGKENEDKQKENEDKQKENEDKGKENEDKDKGREPEEKPLDRPECTASPIAVEEEPHSVRRTVLPRGTSRQSLIISMAPSAEGGEEVLTIEVKEKAKQ,809,NP_061971.3.csv,refseq-CNGB3-NM_019098.4_clinical_seed_0_final,refseq-CNGB3-NM_019098.4.a2m,Invitae,refseq-CNGB3-NM_019098.4.npy,1,809,809
+NP_061982.3,MAASCLVLLALCLLLPLLLLGGWKRWRRGRAARHVVAVVLGDVGRSPRMQYHALSLAMHGFSVTLLGFCNSKPHDELLQNNRIQIVGLTELQSLAVGPRVFQYGVKVVLQAMYLLWKLMWREPGAYIFLQNPPGLPSIAVCWFVGCLCGSKLVIDWHNYGYSIMGLVHGPNHPLVLLAKWYEKFFGRLSHLNLCVTNAMREDLADNWHIRAVTVYDKPASFFKETPLDLQHRLFMKLGSMHSPFRARSEPEDPVTERSAFTERDAGSGLVTRLRERPALLVSSTSWTEDEDFSILLAALEKFEQLTLDGHNLPSLVCVITGKGPLREYYSRLIHQKHFQHIQVCTPWLEAEDYPLLLGSADLGVCLHTSSSGLDLPMKVVDMFGCCLPVCAVNFKCLHELVKHEENGLVFEDSEELAAQLQMLFSNFPDPAGKLNQFRKNLRESQQLRWDESWVQTVLPLVMDT,464,NP_061982.3.csv,refseq-ALG1-NM_019109.4_clinical_seed_0_final,refseq-ALG1-NM_019109.4.a2m,Invitae,refseq-ALG1-NM_019109.4.npy,1,464,464
+NP_063916.1,MRIQDPGKVFFGNVDSSGIKHNIFNPPIIARYIRLHPTHYSIRSTLRMELMGCDLNSCSMPLGMESKAISDAQITASSYFTNMFATWSPSKARLHLQGRSNAWRPQVNNPKEWLQVDFQKTMKVTGVTTQGVKSLLTSMYVKEFLISSSQDGHQWTLFFQNGKVKVFQGNQDSFTPVVNSLDPPLLTRYLRIHPQSWVHQIALRMEVLGCEAQDLY,216,NP_063916.1.csv,refseq-F8-NM_019863.3_clinical_seed_0_final,refseq-F8-NM_019863.3.a2m,Invitae,refseq-F8-NM_019863.3.npy,1,216,216
+NP_063938.1,MLFEGLDLVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKSCKLPIPKGSMGFPLIGETGHWLLQGSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENVRKILMGEHHLVSTEWPRSTRMLLGPNTVSNSIGDIHRNKRKVFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLGFSIPEEDLGHLFEVYQQFVDNVFSLPVDLPFSGYRRGIQARQILQKGLEKAIREKLQCTQGKDYLDALDLLIESSKEHGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPTVLEKLRDELRAHGILHSGGCPCEGTLRLDTLSGLRYLDCVIKEVMRLFTPISGGYRTVLQTFELDGFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLGKHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNEILPETEAMLSATV,512,NP_063938.1.csv,refseq-CYP26B1-NM_019885.3_clinical_seed_0_final,refseq-CYP26B1-NM_019885.3.a2m,Invitae,refseq-CYP26B1-NM_019885.3.npy,1,512,512
+NP_063940.1,MAALKSWLSRSVTSFFRYRQCLCVPVVANFKKRCFSELIRPWHKTVTIGFGVTLCAVPIAQKSEPHSLSSEALMRRAVSLVTDSTSTFLSQTTYALIEAITEYTKAVYTLTSLYRQYTSLLGKMNSEEEDEVWQVIIGARAEMTSKHQEYLKLETTWMTAVGLSEMAAEAAYQTGADQASITARNHIQLVKLQVEEVHQLSRKAETKLAEAQIEELRQKTQEEGEERAESEQEAYLRED,239,NP_063940.1.csv,refseq-DIABLO-NM_019887.5_clinical_seed_0_final,refseq-DIABLO-NM_019887.5.a2m,Invitae,refseq-DIABLO-NM_019887.5.npy,1,239,239
+NP_063945.2,MPSKAENLRPSEPAPQPPEGRTLQGQLPGAPPAQRAGSPPDAPGSESPALACSTPATPSGEDPPARAAPIAPRPPARPRLERALSLDDKGWRRRRFRGSQEDLEARNGTSPSRGSVQSEGPGAPAHSCSPPCLSTSLQEIPKSRGVLSSERGSPSSGGNPLSGVASSSPNLPHRDAAVAGSSPRLPSLLPPRPPPALSLDIASDSLRTANKVDSDLADYKLRAQPLLVRAHSSLGPGRPRSPLACDDCSLRSAKSSFSLLAPIRSKDVRSRSYLEGSLLASGALLGADELARYFPDRNVALFVATWNMQGQKELPPSLDEFLLPAEADYAQDLYVIGVQEGCSDRREWETRLQETLGPHYVLLSSAAHGVLYMSLFIRRDLIWFCSEVECSTVTTRIVSQIKTKGALGISFTFFGTSFLFITSHFTSGDGKVAERLLDYTRTVQALVLPRNVPDTNPYRSSAADVTTRFDEVFWFGDFNFRLSGGRTVVDALLCQGLVVDVPALLQHDQLIREMRKGSIFKGFQEPDIHFLPSYKFDIGKDTYDSTSKQRTPSYTDRVLYRSRHKGDICPVSYSSCPGIKTSDHRPVYGLFRVKVRPGRDNIPLAAGKFDRELYLLGIKRRISKEIQRQQALQSQNSSTICSVS,644,NP_063945.2.csv,refseq-INPP5E-NM_019892.4_clinical_seed_0_final,refseq-INPP5E-NM_019892.4.a2m,Invitae,refseq-INPP5E-NM_019892.4.npy,1,644,644
+NP_064425.2,MARKQNRNSKELGLVPLTDDTSHAGPPGPGRALLECDHLRSGVPGGRRRKDWSCSLLVASLAGAFGSSFLYGYNLSVVNAPTPYIKAFYNESWERRHGRPIDPDTLTLLWSVTVSIFAIGGLVGTLIVKMIGKVLGRKHTLLANNGFAISAALLMACSLQAGAFEMLIVGRFIMGIDGGVALSVLPMYLSEISPKEIRGSLGQVTAIFICIGVFTGQLLGLPELLGKESTWPYLFGVIVVPAVVQLLSLPFLPDSPRYLLLEKHNEARAVKAFQTFLGKADVSQEVEEVLAESRVQRSIRLVSVLELLRAPYVRWQVVTVIVTMACYQLCGLNAIWFYTNSIFGKAGIPPAKIPYVTLSTGGIETLAAVFSGLVIEHLGRRPLLIGGFGLMGLFFGTLTITLTLQDHAPWVPYLSIVGILAIIASFCSGPGGIPFILTGEFFQQSQRPAAFIIAGTVNWLSNFAVGLLFPFIQKSLDTYCFLVFATICITGAIYLYFVLPETKNRTYAEISQAFSKRNKAYPPEEKIDSAVTDGKINGRP,540,NP_064425.2.csv,refseq-SLC2A9-NM_020041.2_clinical_seed_0_final,refseq-SLC2A9-NM_020041.2.a2m,Invitae,refseq-SLC2A9-NM_020041.2.npy,1,540,540
+NP_064445.2,MAQQWSLQRLAGRHPQDSYEDSTQSSIFTYTNSNSTRGPFEGPNYHIAPRWVYHLTSVWMIFVVTASVFTNGLVLAATMKFKKLRHPLNWILVNLAVADLAETVIASTISIVNQVSGYFVLGHPMCVLEGYTVSLCGITGLWSLAIISWERWMVVCKPFGNVRFDAKLAIVGIAFSWIWAAVWTAPPIFGWSRYWPHGLKTSCGPDVFSGSSYPGVQSYMIVLMVTCCIIPLAIIMLCYLQVWLAIRAVAKQQKESESTQKAEKEVTRMVVVMIFAYCVCWGPYTFFACFAAANPGYAFHPLMAALPAYFAKSATIYNPVIYVFMNRQFRNCILQLFGKKVDDGSELSSASKTEVSSVSSVSPA,364,NP_064445.2.csv,NP_064445.2_colabfold_clinical_seed_0_final,NP_064445.2_colabfold.a2m,colabfold,NP_064445.2_colabfold_theta_0.2.npy,1,364,364
+NP_064450.3,MGNQDGKLKRSAGDALHEGGGGAEDALGPRDVEATKKGSGGKKALGKHGKGGGGGGGGGESGKKKSKSDSRASVFSNLRIRKNLSKGKGAGGSREDVLDSQALQTGELDSAHSLLTKTPDLSLSADEAGLSDTECADPFEVTGPGGPGPAEARVGGRPIAEDVETAAGAQDGQRTSSGSDTDIYSFHSATEQEDLLSDIQQAIRLQQQQQQQLQLQLQQQQQQQQLQGAEEPAAPPTAVSPQPGAFLGLDRFLLGPSGGAGEAPGSPDTEQALSALSDLPESLAAEPREPQQPPSPGGLPVSEAPSLPAAQPAAKDSPSSTAFPFPEAGPGEEAAGAPVRGAGDTDEEGEEDAFEDAPRGSPGEEWAPEVGEDAPQRLGEEPEEEAQGPDAPAAASLPGSPAPSQRCFKPYPLITPCYIKTTTRQLSSPNHSPSQSPNQSPRIKRRPEPSLSRGSRTALASVAAPAKKHRADGGLAAGLSRSADWTEELGARTPRVGGSAHLLERGVASDSGGGVSPALAAKASGAPAAADGFQNVFTGRTLLEKLFSQQENGPPEEAEKFCSRIIAMGLLLPFSDCFREPCNQNAQTNAASFDQDQLYTWAAVSQPTHSLDYSEGQFPRRVPSMGPPSKPPDEEHRLEDAETESQSAVSETPQKRSDAVQKEVVDMKSEGQATVIQQLEQTIEDLRTKIAELERQYPALDTEVASGHQGLENGVTASGDVCLEALRLEEKEVRHHRILEAKSIQTSPTEEGGVLTLPPVDGLPGRPPCPPGAESGPQTKFCSEISLIVSPRRISVQLDSHQPTQSISQPPPPPSLLWSAGQGQPGSQPPHSISTEFQTSHEHSVSSAFKNSCNIPSPPPLPCTESSSSMPGLGMVPPPPPPLPGMTVPTLPSTAIPQPPPLQGTEMLPPPPPPLPGAGIPPPPPLPGAGILPLPPLPGAGIPPPPPLPGAAIPPPPPLPGAGIPLPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGVGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPRVGIPPPPPLPGAGIPPPPPLPGAGIPPPPPLPGVGIPPPPPLPGVGIPPPPPLPGAGIPPPPPLPGMGIPPAPAPPLPPPGTGIPPPPLLPVSGPPLLPQVGSSTLPTPQVCGFLPPPLPSGLFGLGMNQDKGSRKQPIEPCRPMKPLYWTRIQLHSKRDSSTSLIWEKIEEPSIDCHEFEELFSKTAVKERKKPISDTISKTKAKQVVKLLSNKRSQAVGILMSSLHLDMKDIQHAVVNLDNSVVDLETLQALYENRAQSDELEKIEKHGRSSKDKENAKSLDKPEQFLYELSLIPNFSERVFCILFQSTFSESICSIRRKLELLQKLCETLKNGPGVMQVLGLVLAFGNYMNGGNKTRGQADGFGLDILPKLKDVKSSDNSRSLLSYIVSYYLRNFDEDAGKEQCLFPLPEPQDLFQASQMKFEDFQKDLRKLKKDLKACEVEAGKVYQVSSKEHMQPFKENMEQFIIQAKIDQEAEENSLTETHKCFLETTAYFFMKPKLGEKEVSPNAFFSIWHEFSSDFKDFWKKENKLLLQERVKEAEEVCRQKKGKSLYKIKPRHDSGIKAKISMKT,1722,NP_064450.3.csv,refseq-FMN2-NM_020066.4_clinical_seed_0_final,refseq-FMN2-NM_020066.4.a2m,Invitae,refseq-FMN2-NM_020066.4.npy,1,1722,1722
+NP_064502.9,MAERKGTAKVDFLKKIEKEIQQKWDTERVFEVNASNLEKQTSKGKYFVTFPYPYMNGRLHLGHTFSLSKCEFAVGYQRLKGKCCLFPFGLHCTGMPIKACADKLKREIELYGCPPDFPDEEEEEEETSVKTEDIIIKDKAKGKKSKAAAKAGSSKYQWGIMKSLGLSDEEIVKFSEAEHWLDYFPPLAIQDLKRMGLKVDWRRSFITTDVNPYYDSFVRWQFLTLRERNKIKFGKRYTIYSPKDGQPCMDHDRQTGEGVGPQEYTLLKLKVLEPYPSKLSGLKGKNIFLVAATLRPETMFGQTNCWVRPDMKYIGFETVNGDIFICTQKAARNMSYQGFTKDNGVVPVVKELMGEEILGASLSAPLTSYKVIYVLPMLTIKEDKGTGVVTSVPSDSPDDIAALRDLKKKQALRAKYGIRDDMVLPFEPVPVIEIPGFGNLSAVTICDELKIQSQNDREKLAEAKEKIYLKGFYEGIMLVDGFKGQKVQDVKKTIQKKMIDAGDALIYMEPEKQVMSRSSDECVVALCDQWYLDYGEENWKKQTSQCLKNLETFCEETRRNFEATLGWLQEHACSRTYGLGTHLPWDEQWLIESLSDSTIYMAFYTVAHLLQGGNLHGQAESPLGIRPQQMTKEVWDYVFFKEAPFPKTQIAKEKLDQLKQEFEFWYPVDLRVSGKDLVPNHLSYYLYNHVAMWPEQSDKWPTAVRANGHLLLNSEKMSKSTGNFLTLTQAIDKFSADGMRLALADAGDTVEDANFVEAMADAGILRLYTWVEWVKEMVANWDSLRSGPASTFNDRVFASELNAGIIKTDQNYEKMMFKEALKTGFFEFQAAKDKYRELAVEGMHRELVFRFIEVQTLLLAPFCPHLCEHIWTLLGKPDSIMNASWPVAGPVNEVLIHSSQYLMEVTHDLRLRLKNYMMPAKGKKTDKQPLQKPSHCTIYVAKNYPPWQHTTLSVLRKHFEANNGKLPDNKVIASELGSMPELKKYMKKVMPFVAMIKENLEKMGPRILDLQLEFDEKAVLMENIVYLTNSLELEHIEVKFASEAEDKIREDCCPGKPLNVFRIEPGVSVSLVNPQPSNGHFSTKIEIRQGDNCDSIIRRLMKMNRGIKDLSKVKLMRFDDPLLGPRRVPVLGKEYTEKTPISEHAVFNVDLMSKKIHLTENGIRVDIGDTIIYLVH,1176,NP_064502.9.csv,refseq-LARS-NM_020117.10_clinical_seed_0_final,refseq-LARS-NM_020117.10.a2m,Invitae,refseq-LARS-NM_020117.10.npy,1,1176,1176
+NP_064543.3,MEEETHTDAKIRAENGTGSSPRGPGCSLRHFACEQNLLSRPDGSASFLQGDTSVLAGVYGPAEVKVSKEIFNKATLEVILRPKIGLPGVAEKSRERLIRNTCEAVVLGTLHPRTSITVVLQVVSDAGSLLACCLNAACMALVDAGVPMRALFCGVACALDSDGTLVLDPTSKQEKEARAVLTFALDSVERKLLMSSTKGLYSDTELQQCLAAAQAASQHVFRFYRESLQRRYSKS,235,NP_064543.3.csv,refseq-EXOSC5-NM_020158.3_clinical_seed_0_final,refseq-EXOSC5-NM_020158.3.a2m,Invitae,refseq-EXOSC5-NM_020158.3.npy,1,235,235
+NP_064551.3,MAAASAVSVLLVAAERNRWHRLPSLLLPPRTWVWRQRTMKYTTATGRNITKVLIANRGEIACRVMRTAKKLGVQTVAVYSEADRNSMHVDMADEAYSIGPAPSQQSYLSMEKIIQVAKTSAAQAIHPGCGFLSENMEFAELCKQEGIIFIGPPPSAIRDMGIKSTSKSIMAAAGVPVVEGYHGEDQSDQCLKEHARRIGYPVMIKAVRGGGGKGMRIVRSEQEFQEQLESARREAKKSFNDDAMLIEKFVDTPRHVEVQVFGDHHGNAVYLFERDCSVQRRHQKIIEEAPAPGIKSEVRKKLGEAAVRAAKAVNYVGAGTVEFIMDSKHNFCFMEMNTRLQVEHPVTEMITGTDLVEWQLRIAAGEKIPLSQEEITLQGHAFEARIYAEDPSNNFMPVAGPLVHLSTPRADPSTRIETGVRQGDEVSVHYDPMIAKLVVWAADRQAALTKLRYSLRQYNIVGLHTNIDFLLNLSGHPEFEAGNVHTDFIPQHHKQLLLSRKAAAKESLCQAALGLILKEKAMTDTFTLQAHDQFSPFSSSSGRRLNISYTRNMTLKDGKNNVAIAVTYNHDGSYSMQIEDKTFQVLGNLYSEGDCTYLKCSVNGVASKAKLIILENTIYLFSKEGSIEIDIPVPKYLSSVSSQETQGGPLAPMTGTIEKVFVKAGDKVKAGDSLMVMIAMKMEHTIKSPKDGTVKKVFYREGAQANRHTPLVEFEEEESDKRESE,725,NP_064551.3.csv,refseq-MCCC1-NM_020166.4_clinical_seed_0_final,refseq-MCCC1-NM_020166.4.a2m,Invitae,refseq-MCCC1-NM_020166.4.npy,1,725,725
+NP_064569.3,MAPVGGGGRPVGGPARGRLLLAAPVLLVLLWALGARGQGSPQQGTIVGMRLASCNKSCGTNPDGIIFVSEGSTVNLRLYGYSLGNISSNLISFTEVDDAETLHKSTSCLELTKDLVVQQLVNVSRGNTSGVLVVLTKFLRRSESMKLYALCTRAQPDGPWLKWTDKDSLLFMVEEPGRFLPLWLHILLITVLLVLSGIFSGLNLGLMALDPMELRIVQNCGTEKERRYARKIEPIRRKGNYLLCSLLLGNVLVNTSLTILLDNLIGSGLMAVASSTIGIVIFGEILPQALCSRHGLAVGANTILLTKFFMLLTFPLSFPISKLLDFFLGQEIRTVYNREKLMEMLKVTEPYNDLVKEELNMIQGALELRTKTVEDIMTQLQDCFMIRSDAILDFNTMSEIMESGYTRIPVFEDEQSNIVDILYVKDLAFVDPDDCTPLKTITRFYNHPVHFVFHDTKLDAMLEEFKKGKSHLAIVQKVNNEGEGDPFYEVLGLVTLEDVIEEIIKSEILDESDMYTDNRSRKRVSEKNKRDFSAFKDADNELKVKISPQLLLAAHRFLATEVSQFSPSLISEKILLRLLKYPDVIQELKFDEHNKYYARHYLYTRNKPADYFILILQGKVEVEAGKENMKFETGAFSYYGTMALTSVPSDRSPAHPTPLSRSASLSYPDRTDVSTAATLAGSSNQFGSSVLGQYISDFSVRALVDLQYIKITRQQYQNGLLASRMENSPQFPIDGCTTHMENLAEKSELPVVDETTTLLNERNSLLHKASHENAI,775,NP_064569.3.csv,refseq-CNNM4-NM_020184.3_clinical_seed_0_final,refseq-CNNM4-NM_020184.3.a2m,Invitae,refseq-CNNM4-NM_020184.3.npy,1,775,775
+NP_064576.1,MAPLGTTVLLWSLLRSSPGVERVCFRARIQPWHGGLLQPLPCSFEMGLPRRRFSSEAAESGSPETKKPTFMDEEVQSILTKMTGLNLQKTFKPAIQELKPPTYKLMTQAQLEEATRQAVEAAKVRLKMPPVLEERVPINDVLAEDKILEGTETTKYVFTDISYSIPHRERFIVVREPSGTLRKASWEERDRMIQVYFPKEGRKILTPIIFKEENLRTMYSQDRHVDVLNLCFAQFEPDSTEYIKVHHKTYEDIDKRGKYDLLRSTRYFGGMVWYFVNNKKIDGLLIDQIQRDLIDDATNLVQLYHVLHPDGQSAQGAKDQAAEGINLIKVFAKTEAQKGAYIELTLQTYQEALSRHSAAS,360,NP_064576.1.csv,refseq-MRPS22-NM_020191.2_clinical_seed_0_final,refseq-MRPS22-NM_020191.2.a2m,Invitae,refseq-MRPS22-NM_020191.2.npy,1,360,360
+NP_064608.2,MKMMLVRRFRVLILMVFLVACALHIALDLLPRLERRGARPSGEPGCSCAQPAAEVAAPGWAQVRGRPGEPPAASSAAGDAGWPNKHTLRILQDFSSDPSSNLSSHSLEKLPPAAEPAERALRGRDPGALRPHDPAHRPLLRDPGPRRSESPPGPGGDASLLARLFEHPLYRVAVPPLTEEDVLFNVNSDTRLSPKAAENPDWPHAGAEGAEFLSPGEAAVDSYPNWLKFHIGINRYELYSRHNPAIEALLHDLSSQRITSVAMKSGGTQLKLIMTFQNYGQALFKPMKQTREQETPPDFFYFSDYERHNAEIAAFHLDRILDFRRVPPVAGRMVNMTKEIRDVTRDKKLWRTFFISPANNICFYGECSYYCSTEHALCGKPDQIEGSLAAFLPDLSLAKRKTWRNPWRRSYHKRKKAEWEVDPDYCEEVKQTPPYDSSHRILDVMDMTIFDFLMGNMDRHHYETFEKFGNETFIIHLDNGRGFGKYSHDELSILVPLQQCCRIRKSTYLRLQLLAKEEYKLSLLMAESLRGDQVAPVLYQPHLEALDRRLRVVLKAVRDCVERNGLHSVVDDDLDTEHRAASAR,584,NP_064608.2.csv,refseq-FAM20C-NM_020223.3_clinical_seed_0_final,refseq-FAM20C-NM_020223.3.a2m,Invitae,refseq-FAM20C-NM_020223.3.npy,1,584,584
+NP_064612.2,MSPEKSQEESPEEDTERTERKPMVKDAFKDISIYFTKEEWAEMGDWEKTRYRNVKRNYNALITIGLRATRPAFMCHRRQAIKLQVDDTEDSDEEWTPRQQVKPPWMALRVEQRKHQKGMPKASFSNESSLKELSRTANLLNASGSEQAQKPVSPSGEASTSGQHSRLKLELRKKETERKMYSLRERKGHAYKEVSEPQDDDYLYCEMCQNFFIDSCAAHGPPTFVKDSAVDKGHPNRSALSLPPGLRIGPSGIPQAGLGVWNEASDLPLGLHFGPYEGRITEDEEAANNGYSWLITKGRNCYEYVDGKDKSWANWMRYVNCARDDEEQNLVAFQYHRQIFYRTCRVIRPGCELLVWYGDEYGQELGIKWGSKWKKELMAGREPKPEIHPCPSCCLAFSSQKFLSQHVERNHSSQNFPGPSARKLLQPENPCPGDQNQEQQYPDPHSRNDKTKGQEIKERSKLLNKRTWQREISRAFSSPPKGQMGSCRVGKRIMEEESRTGQKVNPGNTGKLFVGVGISRIAKVKYGECGQGFSVKSDVITHQRTHTGEKLYVCRECGRGFSWKSHLLIHQRIHTGEKPYVCRECGRGFSWQSVLLTHQRTHTGEKPYVCRECGRGFSRQSVLLTHQRRHTGEKPYVCRECGRGFSRQSVLLTHQRRHTGEKPYVCRECGRGFSWQSVLLTHQRTHTGEKPYVCRECGRGFSWQSVLLTHQRTHTGEKPYVCRECGRGFSNKSHLLRHQRTHTGEKPYVCRECGRGFRDKSHLLRHQRTHTGEKPYVCRECGRGFRDKSNLLSHQRTHTGEKPYVCRECGRGFSNKSHLLRHQRTHTGEKPYVCRECGRGFRNKSHLLRHQRTHTGEKPYVCRECGRGFSDRSSLCYHQRTHTGEKPYVCREDE,894,NP_064612.2.csv,refseq-PRDM9-NM_020227.3_clinical_seed_0_final,refseq-PRDM9-NM_020227.3.a2m,Invitae,refseq-PRDM9-NM_020227.3.npy,1,894,894
+NP_064632.2,MAAILGDTIMVAKGLVKLTQAAVETHLQHLGIGGELIMAARALQSTAVEQIGMFLGKVQGQDKHEEYFAENFGGPEGEFHFSVPHAAGASTDFSSASAPDQSAPPSLGHAHSEGPAPAYVASGPFREAGFPGQASSPLGRANGRLFANPRDSFSAMGFQRRFFHQDQSPVGGLTAEDIEKARQAKARPENKQHKQTLSEHARERKVPVTRIGRLANFGGLAVGLGFGALAEVAKKSLRSEDPSGKKAVLGSSPFLSEANAERIVRTLCKVRGAALKLGQMLSIQDDAFINPHLAKIFERVRQSADFMPLKQMMKTLNNDLGPNWRDKLEYFEERPFAAASIGQVHLARMKGGREVAMKIQYPGVAQSINSDVNNLMAVLNMSNMLPEGLFPEHLIDVLRRELALECDYQREAACARKFRDLLKGHPFFYVPEIVDELCSPHVLTTELVSGFPLDQAEGLSQEIRNEICYNILVLCLRELFEFHFMQTDPNWSNFFYDPQQHKVALLDFGATREYDRSFTDLYIQIIRAAADRDRETVRAKSIEMKFLTGYEVKVMEDAHLDAILILGEAFASDEPFDFGTQSTTEKIHNLIPVMLRHRLVPPPEETYSLHRKMGGSFLICSKLKARFPCKAMFEEAYSNYCKRQAQQ,647,NP_064632.2.csv,refseq-COQ8A-NM_020247.4_clinical_seed_0_final,refseq-COQ8A-NM_020247.4.a2m,Invitae,refseq-COQ8A-NM_020247.4.npy,1,647,647
+NP_064716.2,MACGFRRAIACQLSRVLNLPPENLITSISAVPISQKEEVADFQLSVDSLLEKDNDHSRPDIQVQAKRLAEKLRCDTVVSEISTGQRTVNFKINRELLTKTVLQQVIEDGSKYGLKSELFSGLPQKKIVVEFSSPNVAKKFHVGHLRSTIIGNFIANLKEALGHQVIRINYLGDWGMQFGLLGTGFQLFGYEEKLQSNPLQHLFEVYVQVNKEAADDKSVAKAAQEFFQRLELGDVQALSLWQKFRDLSIEEYIRVYKRLGVYFDEYSGESFYREKSQEVLKLLESKGLLLKTIKGTAVVDLSGNGDPSSICTVMRSDGTSLYATRDLAAAIDRMDKYNFDTMIYVTDKGQKKHFQQVFQMLKIMGYDWAERCQHVPFGVVQGMKTRRGDVTFLEDVLNEIQLRMLQNMASIKTTKELKNPQETAERVGLAALIIQDFKGLLLSDYKFSWDRVFQSRGDTGVFLQYTHARLHSLEETFGCGYLNDFNTACLQEPQSVSILQHLLRFDEVLYKSSQDFQPRHIVSYLLTLSHLAAVAHKTLQIKDSPPEVAGARLHLFKAVRSVLANGMKLLGITPVCRM,578,NP_064716.2.csv,refseq-RARS2-NM_020320.3_clinical_seed_0_final,refseq-RARS2-NM_020320.3.a2m,Invitae,refseq-RARS2-NM_020320.3.npy,1,578,578
+NP_065069.1,MYSEWRSLHLVIQNDQGHTSVLHSYPESVGREVANAVVRPLGQVLGTPSVAGSENLLKTDKEVKWTMEVICYGLTLPLDGETVKYCVDVYTDWIMALVLPKDSIPLPVIKEPNQYVQTILKHLQNLFVPRQEQGSSQIRLCLQVLRAIQKLARESSLMARETWEVLLLFLLQINDILLAPPTVQGGIAENLAEKLIGVLFEVWLLACTRCFPTPPYWKTAKEMVANWRHHPAVVEQWSKVICALTSRLLRFTYGPSFPAFKVPDEDASLIPPEMDNECVAQTWFRFLHMLSNPVDLSNPAIISSTPKFQEQFLNVSGMPQELNQYPCLKHLPQIFFRAMRGISCLVDAFLGISRPRSDSAPPTPVNRLSMPQSAAVSTTPPHNRRHRAVTVNKATMKTSTVSTAHASKVQHQTSSTSPLSSPNQTSSEPRPLPAPRRPKVNSILNLFGSWLFDAAFVHCKLHNGINRDSSMTAITTQASMEFRRKGSQMSTDTMVSNPMFDASEFPDNYEAGRAEACGTLCRIFCSKKTGEEILPAYLSRFYMLLIQGLQINDYVCHPVLASVILNSPPLFCCDLKGIDVVVPYFISALETILPDRELSKFKSYVNPTELRRSSINILLSLLPLPHHFGTVKSEVVLEGKFSNDDSSSYDKPITFLSLKLRLVNILIGALQTETDPNNTQMILGAMLNIVQDSALLEAIGCQMEMGGGENNLKSHSRTNSGISSASGGSTEPTTPDSERPAQALLRDYALNTDSAAGLLIRSIHLVTQRLNSQWRQDMSISLAALELLSGLAKVKVMVDSGDRKRAISSVCTYIVYQCSRPAPLHSRDLHSMIVAAFQCLCVWLTEHPDMLDEKDCLKEVLEIVELGISGSKSKNNEQEVKYKGDKEPNPASMRVKDAAEATLTCIMQLLGAFPSPSGPASPCSLVNETTLIKYSRLPTINKHSFRYFVLDNSVILAMLEQPLGNEQNDFFPSVTVLVRGMSGRLAWAQQLCLLPRGAKANQKLFVPEPRPVPKNDVGFKYSVKHRPFPEEVDKIPFVKADLSIPDLHEIVTEELEERHEKLRSGMAQQIAYEIHLEQQSEEELQKRSFPDPVTDCKPPPPAQEFQTARLFLSHFGFLSLEALKEPANSRLPPHLIALDSTIPGFFDDIGYLDLLPCRPFDTVFIFYMKPGQKTNQEILKNVESSRTVQPHFLEFLLSLGWSVDVGRHPGWTGHVSTSWSINCCDDGEGSQQEVISSEDIGASIFNGQKKVLYYADALTEIAFVVPSPVESLTDSLESNISDQDSDSNMDLMPGILKQPSLTLELFPNHTDNLNSSQRLSPSSRMRKLPQGRPVPPLGPETRVSVVWVERYDDIENFPLSELMTEISTGVETTANSSTSLRSTTLEKEVPVIFIHPLNTGLFRIKIQGATGKFNMVIPLVDGMIVSRRALGFLVRQTVINICRRKRLESDSYSPPHVRRKQKITDIVNKYRNKQLEPEFYTSLFQEVGLKNCSS,1494,NP_065069.1.csv,refseq-RALGAPB-NM_020336.2_clinical_seed_0_final,refseq-RALGAPB-NM_020336.2.a2m,Invitae,refseq-RALGAPB-NM_020336.2.npy,1,1494,1494
+NP_065071.1,MNSMDRHIQQTNDRLQCIKQHLQNPANFHNAATELLDWCGDPRAFQRPFEQSLMGCLTVVSRVAAQQGFDLDLGYRLLAVCAANRDKFTPKSAALLSSWCEELGRLLLLRHQKSRQSDPPGKLPMQPPLSSMSSMKPTLSHSDGSFPYDSVPWQQNTNQPPGSLSVVTTVWGVTNTSQSQVLGNPMANANNPMNPGGNPMASGMTTSNPGLNSPQFAGQQQQFSAKAGPAQPYIQQSMYGRPNYPGSGGFGASYPGGPNAPAGMGIPPHTRPPADFTQPAAAAAAAAVAAAAATATATATATVAALQETQNKDINQYGPMGPTQAYNSQFMNQPGPRGPASMGGSMNPASMAAGMTPSGMSGPPMGMNQPRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQNIKRPYPGEPNYGNQQYGPNSQFPTQPGQYPAPNPPRPLTSPNYPGQRMPSQPSSGQYPPPTVNMGQYYKPEQFNGQNNTFSGSSYSNYSQGNVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQDVKPPFPPDIKPNMSALPPPPANHNDELRLTFPVRDGVVLEPFRLEHNLAVSNHVFHLRPTVHQTLMWRSDLELQFKCYHHEDRQMNTNWPASVQVSVNATPLTIERGDNKTSHKPLHLKHVCQPGRNTIQITVTACCCSHLFVLQLVHRPSVRSVLQGLLKKRLLPAEHCITKIKRNFSSVAASSGNTTLNGEDGVEQTAIKVSLKCPITFRRIQLPARGHDCKHVQCFDLESYLQLNCERGTWRCPVCNKTALLEGLEVDQYMWGILNAIQHSEFEEVTIDPTCSWRPVPIKSDLHIKDDPDGIPSKRFKTMSPSQMIMPNVMEMIAALGPGPSPYPLPPPPGGTNSNDYSSQGNNYQGHGNFDFPHGNPGGTSMNDFMHGPPQLSHPPDMPNNMAALEKPLSHPMQETMPHAGSSDQPHPSIQQGLHVPHPSSQSGPPLHHSGAPPPPPSQPPRQPPQAAPSSHPHSDLTFNPSSALEGQAGAQGASDMPEPSLDLLPELTNPDELLSYLDPPDLPSNSNDDLLSLFENN,1067,NP_065071.1.csv,refseq-ZMIZ1-NM_020338.3_clinical_seed_0_final,refseq-ZMIZ1-NM_020338.3.a2m,Invitae,refseq-ZMIZ1-NM_020338.3.npy,1,1067,1067
+NP_065094.3,MKCLGKRRGQAAAFLPLCWLFLKILQPGHSHLYNNRYAGDKVIRFIPKTEEEAYALKKISYQLKVDLWQPSSISYVSEGTVTDVHIPQNGSRALLAFLQEANIQYKVLIEDLQKTLEKGSSLHTQRNRRSLSGYNYEVYHSLEEIQNWMHHLNKTHSGLIHMFSIGRSYEGRSLFILKLGRRSRLKRAVWIDCGIHAREWIGPAFCQWFVKEALLTYKSDPAMRKMLNHLYFYIMPVFNVDGYHFSWTNDRFWRKTRSRNSRFRCRGVDANRNWKVKWCDEGASMHPCDDTYCGPFPESEPEVKAVANFLRKHRKHIRAYLSFHAYAQMLLYPYSYKYATIPNFRCVESAAYKAVNALQSVYGVRYRYGPASTTLYVSSGSSMDWAYKNGIPYAFAFELRDTGYFGFLLPEMLIKPTCTETMLAVKNITMHLLKKCP,437,NP_065094.3.csv,refseq-CPA6-NM_020361.4_clinical_seed_0_final,refseq-CPA6-NM_020361.4.a2m,Invitae,refseq-CPA6-NM_020361.4.npy,1,437,437
+NP_065098.1,MEFQAVVMAVGGGSRMTDLTSSIPKPLLPVGNKPLIWYPLNLLERVGFEEVIVVTTRDVQKALCAEFKMKMKPDIVCIPDDADMGTADSLRYIYPKLKTDVLVLSCDLITDVALHEVVDLFRAYDASLAMLMRKGQDSIEPVPGQKGKKKAVEQRDFIGVDSTGKRLLFMANEADLDEELVIKGSILQKHPRIRFHTGLVDAHLYCLKKYIVDFLMENGSITSIRSELIPYLVRKQFSSASSQQGQEEKEEDLKKKELKSLDIYSFIKEANTLNLAPYDACWNACRGDRWEDLSRSQVRCYVHIMKEGLCSRVSTLGLYMEANRQVPKLLSALCPEEPPVHSSAQIVSKHLVGVDSLIGPETQIGEKSSIKRSVIGSSCLIKDRVTITNCLLMNSVTVEEGSNIQGSVICNNAVIEKGADIKDCLIGSGQRIEAKAKRVNEVIVGNDQLMEI,452,NP_065098.1.csv,refseq-EIF2B3-NM_020365.4_clinical_seed_0_final,refseq-EIF2B3-NM_020365.4.a2m,Invitae,refseq-EIF2B3-NM_020365.4.npy,1,452,452
+NP_065099.3,MSHLVDPTSGDLPVRDIDAIPLVLPASKGKNMKTQPPLSRMNREELEDSFFRLREDHMLVKELSWKQQDEIKRLRTTLLRLTAAGRDLRVAEEAAPLSETARRGQKAGWRQRLSMHQRPQMHRLQGHFHCVGPASPRRAQPRVQVGHRQLHTAGAPVPEKPKRGPRDRLSYTAPPSFKEHATNENRGEVASKPSELVSGSNSIISFSSVISMAKPIGLCMPNSAHIMASNTMQVEEPPKSPEKMWPKDENFEQRSSLECAQKAAELRASIKEKVELIRLKKLLHERNASLVMTKAQLTEVQEAYETLLQKNQGILSAAHEALLKQVNELRAELKEESKKAVSLKSQLEDVSILQMTLKEFQERVEDLEKERKLLNDNYDKLLESMLDSSDSSSQPHWSNELIAEQLQQQVSQLQDQLDAELEDKRKVLLELSREKAQNEDLKLEVTNILQKHKQEVELLQNAATISQPPDRQSEPATHPAVLQENTQIEPSEPKNQEEKKLSQVLNELQVSHAETTLELEKTRDMLILQRKINVCYQEELEAMMTKADNDNRDHKEKLERLTRLLDLKNNRIKQLEGILRSHDLPTSEQLKDVAYGTRPLSLCLETLPAHGDEDKVDISLLHQGENLFELHIHQAFLTSAALAQAGDTQPTTFCTYSFYDFETHCTPLSVGPQPLYDFTSQYVMETDSLFLHYLQEASARLDIHQAMASEHSTLAAGWICFDRVLETVEKVHGLATLIGAGGEEFGVLEYWMRLRFPIKPSLQACNKRKKAQVYLSTDVLGGRKAQEEEFRSESWEPQNELWIEITKCCGLRSRWLGTQPSPYAVYRFFTFSDHDTAIIPASNNPYFRDQARFPVLVTSDLDHYLRREALSIHVFDDEDLEPGSYLGRARVPLLPLAKNESIKGDFNLTDPAEKPNGSIQVQLDWKFPYIPPESFLKPEAQTKGKDTKDSSKISSEEEKASFPSQDQMASPEVPIEAGQYRSKRKPPHGGERKEKEHQVVSYSRRKHGKRIGVQGKNRMEYLSLNILNGNTPEQVNYTEWKFSETNSFIGDGFKNQHEEEEMTLSHSALKQKEPLHPVNDKESSEQGSEVSEAQTTDSDDVIVPPMSQKYPKADSEKMCIEIVSLAFYPEAEVMSDENIKQVYVEYKFYDLPLSETETPVSLRKPRAGEEIHFHFSKVIDLDPQEQQGRRRFLFDMLNGQDPDQGHLKFTVVSDPLDEEKKECEEVGYAYLQLWQILESGRDILEQELDIVSPEDLATPIGRLKVSLQAAAVLHAIYKEMTEDLFS,1286,NP_065099.3.csv,refseq-RPGRIP1-NM_020366.3_clinical_seed_0_final,refseq-RPGRIP1-NM_020366.3.a2m,Invitae,refseq-RPGRIP1-NM_020366.3.npy,1,1286,1286
+NP_065107.1,MKKNRERFCNREREFVYKFKVGSQCLELRVPLKFPVQENASHLHGRLMLLHSLPCFIEKDLKEALTQFIEEESLSDYDRDAEASLAAVKSGEVDLHQLASTWAKAYAETTLEHARPEEPSWDEDFADVYHDLIHSPASETLLNLEHNYFVSISELIGERDVELKKLRERQGIEMEKVMQELGKSLTDQDVNSLAAQHFESQQDLENKWSNELKQSTAIQKQEYQEWVIKLHQDLKNPNNSSLSEEIKVQPSQFRESVEAIGRIYEEQRKLEESFTIHLGAQLKTMHNLRLLRADMLDFCKHKRNHRSGVKLHRLQTALSLYSTSLCGLVLLVDNRINSYSGIKRDFATVCQECTDFHFPRIEEQLEVVQQVVLYARTQRRSKLKESLDSGNQNGGNDDKTKNAERNYLNVLPGEFYITRHSNLSEIHVAFHLCVDDHVKSGNITARDPAIMGLRNILKVCCTHDITTISIPLLLVHDMSEEMTIPWCLRRAELVFKCVKGFMMEMASWDGGISRTVQFLVPQSISEEMFYQLSNMLPQIFRVSSTLTLTSKH,552,NP_065107.1.csv,refseq-C12orf4-NM_020374.3_clinical_seed_0_final,refseq-C12orf4-NM_020374.3.a2m,Invitae,refseq-C12orf4-NM_020374.3.npy,1,552,552
+NP_065109.1,MFPREKTWNISFAGCGFLGVYYVGVASCLREHAPFLVANATHIYGASAGALTATALVTGVCLGEAGAKFIEVSKEARKRFLGPLHPSFNLVKIIRSFLLKVLPADSHEHASGRLGISLTRVSDGENVIISHFNSKDELIQANVCSGFIPVYCGLIPPSLQGVRYVDGGISDNLPLYELKNTITVSPFSGESDICPQDSSTNIHELRVTNTSIQFNLRNLYRLSKALFPPEPLVLREMCKQGYRDGLRFLQRNGLLNRPNPLLALPPARPHGPEDKDQAVESAQAEDYSQLPGEDHILEHLPARLNEALLEACVEPTDLLTTLSNMLPVRLATAMMVPYTLPLESALSFTIRLLEWLPDVPEDIRWMKEQTGSICQYLVMRAKRKLGRHLPSRLPEQVELRRVQSLPSVPLSCAAYREALPGWMRNNLSLGDALAKWEECQRQLLLGLFCTNVAFPPEALRMRAPADPAPAPADPASPQHQLAGPAPLLSTPAPEARPVIGALGL,504,NP_065109.1.csv,refseq-PNPLA2-NM_020376.3_clinical_seed_0_final,refseq-PNPLA2-NM_020376.3.a2m,Invitae,refseq-PNPLA2-NM_020376.3.npy,1,504,504
+NP_065114.3,MNFRQLLLHLPRYLGASGSPRRLWWSPSLDTISSVGSWRGRSSKSPAHWNQVVSEAEKIVGYPTSFMSLRCLLSDELSNIAMQVRKLVGTQHPLLTTARGLVHDSWNSLQLRGLVVLLISKAAGPSSVNTSCQNYDMVSGIYSCQRSLAEITELIHIALLVHRGIVNLNELQSSDGPLKDMQFGNKIAILSGDFLLANACNGLALLQNTKVVELLASALMDLVQGVYHENSTSKESYITDDIGISTWKEQTFLSHGALLAKSCQAAMELAKHDAEVQNMAFQYGKHMAMSHKINSDVQPFIKEKTSDSMTFNLNSAPVVLHQEFLGRDLWIKQIGEAQEKGRLDYAKLRERIKAGKGVTSAIDLCRYHGNKALEALESFPPSEARSALENIVFAVTRFS,399,NP_065114.3.csv,refseq-PDSS2-NM_020381.3_clinical_seed_0_final,refseq-PDSS2-NM_020381.3.a2m,Invitae,refseq-PDSS2-NM_020381.3.npy,1,399,399
+NP_065134.1,MDRSGFGEISSPVIREAEVTRTARKQSAQKRVLLQASQDENFGNTTPRNQVIPRTPSSFRQPFTPTSRSLLRQPDISCILGTGGKSPRLTQSSGFFGNLSMVTNLDDSNWAAAFSSQRSGLFTNTEPHSITEDVTISAVMLREDDPGEAASMSMFSDFLQSFLKHSSSTVFDLVEEYENICGSQVNILSKIVSRATPGLQKFSKTASMLWLLQQEMVTWRLLASLYRDRIQSALEEESVFAVTAVNASEKTVVEALFQRDSLVRQSQLVVDWLESIAKDEIGEFSDNIEFYAKSVYWENTLHTLKQRQLTSYVGSVRPLVTELDPDAPIRQKMPLDDLDREDEVRLLKYLFTLIRAGMTEEAQRLCKRCGQAWRAATLEGWKLYHDPNVNGGTELEPVEGNPYRRIWKISCWRMAEDELFNRYERAIYAALSGNLKQLLPVCDTWEDTVWAYFRVMVDSLVEQEIQTSVATLDETEELPREYLGANWTLEKVFEELQATDKKRVLEENQEHYHIVQKFLILGDIDGLMDEFSKWLSKSRNNLPGHLLRFMTHLILFFRTLGLQTKEEVSIEVLKTYIQLLIREKHTNLIAFYTCHLPQDLAVAQYALFLESVTEFEQRHHCLELAKEADLDVATITKTVVENIRKKDNGEFSHHDLAPALDTGTTEEDRLKIDVIDWLVFDPAQRAEALKQGNAIMRKFLASKKHEAAKEVFVKIPQDSIAEIYNQCEEQGMESPLPAEDDNAIREHLCIRAYLEAHETFNEWFKHMNSVPQKPALIPQPTFTEKVAHEHKEKKYEMDFGIWKGHLDALTADVKEKMYNVLLFVDGGWMVDVREDAKEDHERTHQMVLLRKLCLPMLCFLLHTILHSTGQYQECLQLADMVSSERHKLYLVFSKEELRKLLQKLRESSLMLLDQGLDPLGYEIQL,925,NP_065134.1.csv,refseq-NUP107-NM_020401.3_clinical_seed_0_final,refseq-NUP107-NM_020401.3.a2m,Invitae,refseq-NUP107-NM_020401.3.npy,1,925,925
+NP_065166.2,MSGGRFDFDDGGAYCGGWEGGKAHGHGLCTGPKGQGEYSGSWNFGFEVAGVYTWPSGNTFEGYWSQGKRHGLGIETKGRWLYKGEWTHGFKGRYGIRQSSSSGAKYEGTWNNGLQDGYGTETYADGGTYQGQFTNGMRHGYGVRQSVPYGMAVVVRSPLRTSLSSLRSEHSNGTVAPDSPASPASDGPALPSPAIPRGGFALSLLANAEAAARAPKGGGLFQRGALLGKLRRAESRTSVGSQRSRVSFLKSDLSSGASDAASTASLGEAAEGADEAAPFEADIDATTTETYMGEWKNDKRSGFGVSERSSGLRYEGEWLDNLRHGYGCTTLPDGHREEGKYRHNVLVKDTKRRMLQLKSNKVRQKVEHSVEGAQRAAAIARQKAEIAASRTSHAKAKAEAAEQAALAANQESNIARTLARELAPDFYQPGPEYQKRRLLQEILENSESLLEPPDRGAGAAGLPQPPRESPQLHERETPRPEGGSPSPAGTPPQPKRPRPGVSKDGLLSPGAWNGEPSGEGSRSVTPSEGAGRRSPARPATERMAIEALQAPPAPSREPEVALYQGYHSYAVRTTPPEPPPFEDQPEPEVSGSESAPSSPATAPLQAPTLRGPEPARETPAKLEPKPIIPKAEPRAKARKTEARGLTKAGAKKKARKEAALAAEAEVEVEEVPNTILICMVILLNIGLAILFVHLLT,696,NP_065166.2.csv,refseq-JPH2-NM_020433.4_clinical_seed_0_final,refseq-JPH2-NM_020433.4.a2m,Invitae,refseq-JPH2-NM_020433.4.npy,1,696,696
+NP_065168.2,MTNMSWSFLTRLLEEIHNHSTFVGKVWLTVLVVFRIVLTAVGGEAIYSDEQAKFTCNTRQPGCDNVCYDAFAPLSHVRFWVFQIVVISTPSVMYLGYAVHRLARASEQERRRALRRRPGPRRAPRAHLPPPHAGWPEPADLGEEEPMLGLGEEEEEEETGAAEGAGEEAEEAGAEEACTKAVGADGKAAGTPGPTGQHDGRRRIQREGLMRVYVAQLVARAAFEVAFLVGQYLLYGFEVRPFFPCSRQPCPHVVDCFVSRPTEKTVFLLVMYVVSCLCLLLNLCEMAHLGLGSAQDAVRGRRGPPASAPAPAPRPPPCAFPAAAAGLACPPDYSLVVRAAERARAHDQNLANLALQALRDGAAAGDRDRDSSPCVGLPAASRGPPRAGAPASRTGSATSAGTVGEQGRPGTHERPGAKPRAGSEKGSASSRDGKTTVWI,439,NP_065168.2.csv,refseq-GJC2-NM_020435.3_clinical_seed_0_final,refseq-GJC2-NM_020435.3.a2m,Invitae,refseq-GJC2-NM_020435.3.npy,1,439,439
+NP_065390.1,MFQAAERPQEWAMEGPRDGLKKERLLDDRHDSGLDSMKDEEYEQMVKELQEIRLEPQEVPRGSEPWKQQLTEDGDSFLHLAIIHEEKALTMEVIRQVKGDLAFLNFQNNLQQTPLHLAVITNQPEIAEALLGAGCDPELRDFRGNTPLHLACEQGCLASVGVLTQSCTTPHLHSILKATNYNGHTCLHLASIHGYLGIVELLVSLGADVNAQEPCNGRTALHLAVDLQNPDLVSLLLKCGADVNRVTYQGYSPYQLTWGRPSTRIQQQLGQLTLENLQMLPESEDEESYDTESEFTEFTEDELPYDDCVFGGQRLTL,317,NP_065390.1.csv,refseq-NFKBIA-NM_020529.2_clinical_seed_0_final,refseq-NFKBIA-NM_020529.2.a2m,Invitae,refseq-NFKBIA-NM_020529.2.npy,1,317,317
+NP_065394.1,MTAPAGPRGSETERLLTPNPGYGTQAGPSPAPPTPPEEEDLRRRLKYFFMSPCDKFRAKGRKPCKLMLQVVKILVVTVQLILFGLSNQLAVTFREENTIAFRHLFLLGYSDGADDTFAAYTREQLYQAIFHAVDQYLALPDVSLGRYAYVRGGGDPWTNGSGLALCQRYYHRGHVDPANDTFDIDPMVVTDCIQVDPPERPPPPPSDDLTLLESSSSYKNLTLKFHKLVNVTIHFRLKTINLQSLINNEIPDCYTFSVLITFDNKAHSGRIPISLETQAHIQECKHPSVFQHGDNSFRLLFDVVVILTCSLSFLLCARSLLRGFLLQNEFVGFMWRQRGRVISLWERLEFVNGWYILLVTSDVLTISGTIMKIGIEAKNLASYDVCSILLGTSTLLVWVGVIRYLTFFHNYNILIATLRVALPSVMRFCCCVAVIYLGYCFCGWIVLGPYHVKFRSLSMVSECLFSLINGDDMFVTFAAMQAQQGRSSLVWLFSQLYLYSFISLFIYMVLSLFIALITGAYDTIKHPGGAGAEESELQAYIAQCQDSPTSGKFRRGSGSACSLLCCCGRDPSEEHSLLVN,580,NP_065394.1.csv,refseq-MCOLN1-NM_020533.2_clinical_seed_0_final,refseq-MCOLN1-NM_020533.2.a2m,Invitae,refseq-MCOLN1-NM_020533.2.npy,1,580,580
+NP_065434.1,MLGSLGLWALLPTAVEAPPNRRTCVFFEAPGVRGSTKTLGELLDTGTELPRAIRCLYSRCCFGIWNLTQDRAQVEMQGCRDSDEPGCESLHCDPSPRAHPSPGSTLFTCSCGTDFCNANYSHLPPPGSPGTPGSQGPQAAPGESIWMALVLLGLFLLLLLLLGSIILALLQRKNYRVRGEPVPEPRPDSGRDWSVELQELPELCFSQVIREGGHAVVWAGQLQGKLVAIKAFPPRSVAQFQAERALYELPGLQHDHIVRFITASRGGPGRLLSGPLLVLELHPKGSLCHYLTQYTSDWGSSLRMALSLAQGLAFLHEERWQNGQYKPGIAHRDLSSQNVLIREDGSCAIGDLGLALVLPGLTQPPAWTPTQPQGPAAIMEAGTQRYMAPELLDKTLDLQDWGMALRRADIYSLALLLWEILSRCPDLRPDSSPPPFQLAYEAELGNTPTSDELWALAVQERRRPYIPSTWRCFATDPDGLRELLEDCWDADPEARLTAECVQQRLAALAHPQESHPFPESCPRGCPPLCPEDCTSIPAPTILPCRPQRSACHFSVQQGPCSRNPQPACTLSPV,573,NP_065434.1.csv,refseq-AMHR2-NM_020547.2_clinical_seed_0_final,refseq-AMHR2-NM_020547.2.a2m,Invitae,refseq-AMHR2-NM_020547.2.npy,1,573,573
+NP_065682.2,MHYDGHVRFDLPPQGSVLARNVSTRSCPPRTSPAVDLEEEEEESSVDGKGDRKSTGLKLSKKKARRRHTDDPSKECFTLKFDLNVDIETEIVPAMKKKSLGEVLLPVFERKGIALGKVDIYLDQSNTPLSLTFEAYRFGGHYLRVKAPAKPGDEGKVEQGMKDSKSLSLPILRPAGTGPPALERVDAQSRRESLDILAPGRRRKNMSEFLGEASIPGQEPPTPSSCSLPSGSSGSTNTGDSWKNRAASRFSGFFSSGPSTSAFGREVDKMEQLEGKLHTYSLFGLPRLPRGLRFDHDSWEEEYDEDEDEDNACLRLEDSWRELIDGHEKLTRRQCHQQEAVWELLHTEASYIRKLRVIINLFLCCLLNLQESGLLCEVEAERLFSNIPEIAQLHRRLWASVMAPVLEKARRTRALLQPGDFLKGFKMFGSLFKPYIRYCMEEEGCMEYMRGLLRDNDLFRAYITWAEKHPQCQRLKLSDMLAKPHQRLTKYPLLLKSVLRKTEEPRAKEAVVAMIGSVERFIHHVNACMRQRQERQRLAAVVSRIDAYEVVESSSDEVDKLLKEFLHLDLTAPIPGASPEETRQLLLEGSLRMKEGKDSKMDVYCFLFTDLLLVTKAVKKAERTRVIRPPLLVDKIVCRELRDPGSFLLIYLNEFHSAVGAYTFQASGQALCRGWVDTIYNAQNQLQQLRAQEPPGSQQPLQSLEEEEDEQEEEEEEEEEEEEGEDSGTSAASSPTIMRKSSGSPDSQHCASDGSTETLAMVVVEPGDTLSSPEFDSGPFSSQSDETSLSTTASSATPTSELLPLGPVDGRSCSMDSAYGTLSPTSLQDFVAPGPMAELVPRAPESPRVPSPPPSPRLRRRTPVQLLSCPPHLLKSKSEASLLQLLAGAGTHGTPSAPSRSLSELCLAVPAPGIRTQGSPQEAGPSWDCRGAPSPGSGPGLVGCLAGEPAGSHRKRCGDLPSGASPRVQPEPPPGVSAQHRKLTLAQLYRIRTTLLLNSTLTASEV,1006,NP_065682.2.csv,refseq-PLEKHG5-NM_020631.4_clinical_seed_0_final,refseq-PLEKHG5-NM_020631.4.a2m,Invitae,refseq-PLEKHG5-NM_020631.4.npy,1,1006,1006
+NP_065683.2,MVSVFRSEEMCLSQLFLQVEAAYCCVAELGELGLVQFKDLNMNVNSFQRKFVNEVRRCESLERILRFLEDEMQNEIVVQLLEKSPLTPLPREMITLETVLEKLEGELQEANQNQQALKQSFLELTELKYLLKKTQDFFETETNLADDFFTEDTSGLLELKAVPAYMTGKLGFIAGVINRERMASFERLLWRICRGNVYLKFSEMDAPLEDPVTKEEIQKNIFIIFYQGEQLRQKIKKICDGFRATVYPCPEPAVERREMLESVNVRLEDLITVITQTESHRQRLLQEAAANWHSWLIKVQKMKAVYHILNMCNIDVTQQCVIAEIWFPVADATRIKRALEQGMELSGSSMAPIMTTVQSKTAPPTFNRTNKFTAGFQNIVDAYGVGSYREINPAPYTIITFPFLFAVMFGDCGHGTVMLLAALWMILNERRLLSQKTDNEIWNTFFHGRYLILLMGIFSIYTGLIYNDCFSKSLNIFGSSWSVQPMFRNGTWNTHVMEESLYLQLDPAIPGVYFGNPYPFGIDPIWNLASNKLTFLNSYKMKMSVILGIVQMVFGVILSLFNHIYFRRTLNIILQFIPEMIFILCLFGYLVFMIIFKWCCFDVHVSQHAPSILIHFINMFLFNYSDSSNAPLYKHQQEVQSFFVVMALISVPWMLLIKPFILRASHRKSQLQASRIQEDATENIEGDSSSPSSRSGQRTSADTHGALDDHGEEFNFGDVFVHQAIHTIEYCLGCISNTASYLRLWALSLAHAQLSEVLWTMVMNSGLQTRGWGGIVGVFIIFAVFAVLTVAILLIMEGLSAFLHALRLHWVEFQNKFYVGDGYKFSPFSFKHILDGTAEE,840,NP_065683.2.csv,refseq-ATP6V0A4-NM_020632.2_clinical_seed_0_final,refseq-ATP6V0A4-NM_020632.2.a2m,Invitae,refseq-ATP6V0A4-NM_020632.2.npy,1,840,840
+NP_065685.1,MLRFLPDLAFSFLLILALGQAVQFQEYVFLQFLGLDKAPSPQKFQPVPYILKKIFQDREAAATTGVSRDLCYVKELGVRGNVLRFLPDQGFFLYPKKISQASSCLQKLLYFNLSAIKEREQLTLAQLGLDLGPNSYYNLGPELELALFLVQEPHVWGQTTPKPGKMFVLRSVPWPQGAVHFNLLDVAKDWNDNPRKNFGLFLEILVKEDRDSGVNFQPEDTCARLRCSLHASLLVVTLNPDQCHPSRKRRAAIPVPKLSCKNLCHRHQLFINFRDLGWHKWIIAPKGFMANYCHGECPFSLTISLNSSNYAFMQALMHAVDPEIPQAVCIPTKLSPISMLYQDNNDNVILRHYEDMVVDECGCG,364,NP_065685.1.csv,refseq-GDF3-NM_020634.1_clinical_seed_0_final,refseq-GDF3-NM_020634.1.a2m,Invitae,refseq-GDF3-NM_020634.1.npy,1,364,364
+NP_065689.1,MLGARLRLWVCALCSVCSMSVLRAYPNASPLLGSSWGGLIHLYTATARNSYHLQIHKNGHVDGAPHQTIYSALMIRSEDAGFVVITGVMSRRYLCMDFRGNIFGSHYFDPENCRFQHQTLENGYDVYHSPQYHFLVSLGRAKRAFLPGMNPPPYSQFLSRRNEIPLIHFNTPIPRRHTRSAEDDSERDPLNVLKPRARMTPAPASCSQELPSAEDNSPMASDPLGVVRGGRVNTHAGGTGPEGCRPFAKFI,251,NP_065689.1.csv,refseq-FGF23-NM_020638.2_clinical_seed_0_final,refseq-FGF23-NM_020638.2.a2m,Invitae,refseq-FGF23-NM_020638.2.npy,1,251,251
+NP_065690.2,MEGDGGTPWALALLRTFDAGEFTGWEKVGSGGFGQVYKVRHVHWKTWLAIKCSPSLHVDDRERMELLEEAKKMEMAKFRYILPVYGICREPVGLVMEYMETGSLEKLLASEPLPWDLRFRIIHETAVGMNFLHCMAPPLLHLDLKPANILLDAHYHVKISDFGLAKCNGLSHSHDLSMDGLFGTIAYLPPERIREKSRLFDTKHDVYSFAIVIWGVLTQKKPFADEKNILHIMVKVVKGHRPELPPVCRARPRACSHLIRLMQRCWQGDPRVRPTFQEITSETEDLCEKPDDEVKETAHDLDVKSPPEPRSEVVPARLKRASAPTFDNDYSLSELLSQLDSGVSQAVEGPEELSRSSSESKLPSSGSGKRLSGVSSVDSAFSSRGSLSLSFEREPSTSDLGTTDVQKKKLVDAIVSGDTSKLMKILQPQDVDLALDSGASLLHLAVEAGQEECAKWLLLNNANPNLSNRRGSTPLHMAVERRVRGVVELLLARKISVNAKDEDQWTALHFAAQNGDESSTRLLLEKNASVNEVDFEGRTPMHVACQHGQENIVRILLRRGVDVSLQGKDAWLPLHYAAWQGHLPIVKLLAKQPGVSVNAQTLDGRTPLHLAAQRGHYRVARILIDLCSDVNVCSLLAQTPLHVAAETGHTSTARLLLHRGAGKEAMTSDGYTALHLAARNGHLATVKLLVEEKADVLARGPLNQTALHLAAAHGHSEVVEELVSADVIDLFDEQGLSALHLAAQGRHAQTVETLLRHGAHINLQSLKFQGGHGPAATLLRRSKT,784,NP_065690.2.csv,refseq-RIPK4-NM_020639.2_clinical_seed_0_final,refseq-RIPK4-NM_020639.2.a2m,Invitae,refseq-RIPK4-NM_020639.2.npy,1,784,784
+NP_065712.1,MDSLLMNRRKFLYQFKNVRWAKGRRETYLCYVVKRRDSATSFSLDFGYLRNKNGCHVELLFLRYISDWDLDPGRCYRVTWFTSWSPCYDCARHVADFLRGNPNLSLRIFTARLYFCEDRKAEPEGLRRLHRAGVQIAIMTFKDYFYCWNTFVENHERTFKAWEGLHENSVRLSRQLRRILLPLYEVDDLRDAFRTLGL,198,NP_065712.1.csv,refseq-AICDA-NM_020661.2_clinical_seed_0_final,refseq-AICDA-NM_020661.2.a2m,Invitae,refseq-AICDA-NM_020661.2.npy,1,198,198
+NP_065731.3,MWFFARDPVRDFPFELIPEPPEGGLPGPWALHRGRKKATGSPVSIFVYDVKPGAEEQTQVAKAAFKRFKTLRHPNILAYIDGLETEKCLHVVTEAVTPLGIYLKARVEAGGLKELEISWGLHQIVKALSFLVNDCSLIHNNVCMAAVFVDRAGEWKLGGLDYMYSAQGNGGGPPRKGIPELEQYDPPELADSSGRVVREKWSADMWRLGCLIWEVFNGPLPRAAALRNPGKIPKTLVPHYCELVGANPKVRPNPARFLQNCRAPGGFMSNRFVETNLFLEEIQIKEPAEKQKFFQELSKSLDAFPEDFCRHKVLPQLLTAFEFGNAGAVVLTPLFKVGKFLSAEEYQQKIIPVVVKMFSSTDRAMRIRLLQQMEQFIQYLDEPTVNTQIFPHVVHGFLDTNPAIREQTVKSMLLLAPKLNEANLNVELMKHFARLQAKDEQGPIRCNTTVCLGKIGSYLSASTRHRVLTSAFSRATRDPFAPSRVAGVLGFAATHNLYSMNDCAQKILPVLCGLTVDPEKSVRDQAFKAIRSFLSKLESVSEDPTQLEEVEKDVHAASSPGMGGAAASWAGWAVTGVSSLTSKLIRSHPTTAPTETNIPQRPTPEGVPAPAPTPVPATPTTSGHWETQEEDKDTAEDSSTADRWDDEDWGSLEQEAESVLAQQDDWSTGGQVSRASQVSNSDHKSSKSPESDWSSWEAEGSWEQGWQEPSSQEPPPDGTRLASEYNWGGPESSDKGDPFATLSARPSTQPRPDSWGEDNWEGLETDSRQVKAELARKKREERRREMEAKRAERKVAKGPMKLGARKLD,808,NP_065731.3.csv,refseq-SCYL1-NM_020680.3_clinical_seed_0_final,refseq-SCYL1-NM_020680.3.a2m,Invitae,refseq-SCYL1-NM_020680.3.npy,1,808,808
+NP_065737.2,MASMLLAQRLACSFQHSYRLLVPGSRHISQAAAKVDVEFDYDGPLMKTEVPGPRSQELMKQLNIIQNAEAVHFFCNYEESRGNYLVDVDGNRMLDLYSQISSVPIGYSHPALLKLIQQPQNASMFVNRPALGILPPENFVEKLRQSLLSVAPKGMSQLITMACGSCSNENALKTIFMWYRSKERGQRGFSQEELETCMINQAPGCPDYSILSFMGAFHGRTMGCLATTHSKAIHKIDIPSFDWPIAPFPRLKYPLEEFVKENQQEEARCLEEVEDLIVKYRKKKKTVAGIIVEPIQSEGGDNHASDDFFRKLRDIARKHGCAFLVDEVQTGGGCTGKFWAHEHWGLDDPADVMTFSKKMMTGGFFHKEEFRPNAPYRIFNTWLGDPSKNLLLAEVINIIKREDLLNNAAHAGKALLTGLLDLQARYPQFISRVRGRGTFCSFDTPDDSIRNKLILIARNKGVVLGGCGDKSIRFRPTLVFRDHHAHLFLNIFSDILADFK,500,NP_065737.2.csv,refseq-ABAT-NM_020686.5_clinical_seed_0_final,refseq-ABAT-NM_020686.5.a2m,Invitae,refseq-ABAT-NM_020686.5.npy,1,500,500
+NP_065750.1,MDRMTEDALRLNLLKRSLDPADERDDVLAKRLKMEGHEAMERLKMLALLKRKDLANLEVPHELPTKQDGSGVKGYEEKLNGNLRPHGDNRTAGRPGKENINDEPVDMSARRSEPERGRLTPSPDIIVLSDNEASSPRSSSRMEERLKAANLEMFKGKGIEERQQLIKQLRDELRLEEARLVLLKKLRQSQLQKENVVQKTPVVQNAASIVQPSPAHVGQQGLSKLPSRPGAQGVEPQNLRTLQGHSVIRSATNTTLPHMLMSQRVIAPNPAQLQGQRGPPKPGLVRTTTPNMNPAINYQPQSSSSVPCQRTTSSAIYMNLASHIQPGTVNRVSSPLPSPSAMTDAANSQAAAKLALRKQLEKTLLEIPPPKPPAPLLHFLPSAANSEFIYMVGLEEVVQSVIDSQGKSCASLLRVEPFVCAQCRTDFTPHWKQEKNGKILCEQCMTSNQKKALKAEHTNRLKNAFVKALQQEQEIEQRLQQQAALSPTTAPAVSSVSKQETIMRHHTLRQAPQPQSSLQRGIPTSARSMLSNFAQAPQLSVPGGLLGMPGVNIAYLNTGIGGHKGPSLADRQREYLLDMIPPRSISQSISGQK,593,NP_065750.1.csv,refseq-GATAD2B-NM_020699.3_clinical_seed_0_final,refseq-GATAD2B-NM_020699.3.a2m,Invitae,refseq-GATAD2B-NM_020699.3.npy,1,593,593
+NP_065753.2,MLQNPQEKSQAYPRRRRPGCYAYRQNPEAIAAAAMYTFLPDNFSPAKPKPSKDLKPLLGSAVLGLLLVLAAVVAWCYYSVSLRKAERLRAELLDLKAGGFSIRNQKGEQVFRLAFRSGALDLDSCSRDGALLGCSLTADGLPLHFFIQTVRPKDTVMCYRVRWEEAAPGRAVEHAMFLGDAAAHWYGGAEMRTQHWPIRLDGQQEPQPFVTSDVYSSDAAFGGILERYWLSSRAAAIKVNDSVPFHLGWNSTERSLRLQARYHDTPYKPPAGRAAAPELSYRVCVGSDVTSIHKYMVRRYFNKPSRVPAPEAFRDPIWSTWALYGRAVDQDKVLRFAQQIRLHHFNSSHLEIDDMYTPAYGDFDFDEVKFPNASDMFRRLRDAGFRVTLWVHPFVNYNSSRFGEGVERELFVREPTGRLPALVRWWNGIGAVLDFTHPKARDWFQGHLRRLRSRYSVASFKFDAGEVSYLPRDFSTYRPLPDPSVWSRRYTEMALPFFSLAEVRVGYQSQNISCFFRLVDRDSVWGYDLGLRSLIPAVLTVSMLGYPFILPDMVGGNAVPQRTAGGDVPERELYIRWLEVAAFMPAMQFSIPPWRYDAEVVAIAQKFAALRASLVAPLLLELAGEVTDTGDPIVRPLWWIAPGDETAHRIDSQFLIGDTLLVAPVLEPGKQERDVYLPAGKWRSYKGELFDKTPVLLTDYPVDLDEIAYFTWAS,714,NP_065753.2.csv,MYORG_HUMAN_b01_clinical_seed_0_final,MYORG_HUMAN_b01.a2m,EVE,MYORG_HUMAN_b01_theta_0.2.npy,1,714,714
+NP_065770.1,MDRNYPSAGFGDPLGAGAGWSYERSAKASLVYGSSRTSHPETDILHRQAYAAPHPLQSYATNHHPAGLSGLFDTGLHHAGSAGPDASVMNLISALESRGPQPGPSASSLLSQFRSPSWQTAMHTPGPTELFISGALPGSSTFPSSSALSAYQHPASFGSRPFPVPSSLSLQDPPFSPPANGLLSPHDVLHLKPSQAPTVPSSLGFERLAGGGVLGPAGLGPAQTPPYRPGPPDPPPPPRHLPTQFNLLASSSAAAAAAEQSSPQLYNFSGAAPGPPPPERALPRQDTVIKHYQRPASAQPPPPPPPAHALQHYLSCGGSYPSMGHRANLACSPLGGGEPSPGAGEPSKAGPSGATAGASGRATGPEAAGGGGAGGGGGGYRPIIQSPGYKTGKGGYGAAAGGATRPPPPRSTATPKCQSLGGPAAAYATGKASGAGGAGGQAYSPGQPQGLLGPQAYGQGFGGGQAQDLSKAPSYSGGPPQPPSGPPPPGLATCQSYSPDQLQGQLYGVQGEPYPGPAAHSQGLPTASPSLSYSTGHSPALSGHGGGWGPSSLGGGGEASPSHIIRPLQSPPATGRPPGVGSPGAPGKYLSSVLASAPFLAPPGAGSYAAGAGGYKGKGDGSELLAGPGGPPAERTEDEEFLIQHLLQAPSPPRTSGADGLVGEDGAADASKGLGGSGGAGGPPGTPYELAKEDPQRYHLQSVIRTSASLDEGATAALELGLGRLKEKKKGPERGGETPEGLATSVVHYGAGAKELGAFLQKSPPPPPPTAQSTQPTPHGLLLEAGGPDLPLVLPPPPPQLLPSVLSHAPSPSPSASKVGVHLLEPATRDGAPQPPPPPPPPPPPMPLQLEAHLRSHGLEPAAPSPRLRPEESLDPPGAMQELLGALEPLPPAPGDTGVGPPNSEGKDPAGAYRSPSPQGTKAPRFVPLTSICFPDSLLQDEERSFFPTMEEMFGGGAADDYGKAGPPEDEGDPKAGAGPPPGPPAYDPYGPYCPGRASGAGPETPGLGLDPNKPPELPSTVNAEPLGLIQSGPHQAAPPPPPPPPPPPAPASEPKGGLTSPIFCSTKPKKLLKTSSFHLLRRRDPPFQTPKKLYAQEYEFEADEDKADVPADIRLNPRRLPDLVSSCRSRPALSPLGDIDFCPPNPGPDGPRRRGRKPTKAKRDGPPRPRGRPRIRPLEVPTTAGPASASTPTDGAKKPRGRGRGRGRKAEEAGGTRLEPLKPLKIKLSVPKAGEGLGTSSGDAISGTDHNSLDSSLTREKIEAKIKEVEEKQPEMKSGFMASFLDFLKSGKRHPPLYQAGLTPPLSPPKSVPPSVPARGLQPQPPATPAVPHPPPSGAFGLGGALEAAESEGLGLGCPSPCKRLDEELKRNLETLPSFSSDEEDSVAKNRDLQESISSAISALDDPPLAGPKDTSTPDGPPLAPAAAVPGPPPLPGLPSANSNGTPEPPLLEEKPPPTPPPAPTPQPQPPPPPPPPQPALPSPPPLVAPTPSSPPPPPLPPPPPPAMPSPPPPPPPAAAPLAAPPEEPAAPSPEDPELPDTRPLHLAKKQETAAVCGETDEEAGESGGEGIFRERDEFVIRAEDIPSLKLALQTGREPPPIWRVQKALLQKFTPEIKDGQRQFCATSNYLGYFGDAKNRYQRLYVKFLENVNKKDYVRVCARKPWHRPPVPVRRSGQAKNPVSAGGSSAPPPKAPAPPPKPETPEKTTSEKPPEQTPETAMPEPPAPEKPSLLRPVEKEKEKEKVTRGERPLRGERATSGRQTRPERSLATGQPATSRLPKARPTKVKAEPPPKKRKKWLKEAGGNATAGGGPPGSSSDSESSPGAPSEDERAVPGRLLKTRAMREMYRSYVEMLVSTALDPDMIQALEDTHDELYLPPMRKIDGLLNEHKKKVLKRLSLSPALQDALHTFPQLQVEQSGEGSPEEGAVRLRPAGEPYNRKTLSKLKRSVVRAQEFKVELEKSGYYTLYHSLHHYKYHTFLRCRDQTLAIEGGAEDLGQEEVVQQCMRNQPWLEQLFDSFSDLLAQAQAHSRCG,2036,NP_065770.1.csv,refseq-PRR12-NM_020719.2_clinical_seed_0_final,refseq-PRR12-NM_020719.2.a2m,Invitae,refseq-PRR12-NM_020719.2.npy,1,2036,2036
+NP_065793.1,MSRPQGLLWLPLLFTPVCVMLNSNVLLWLTALAIKFTLIDSQAQYPVVNTNYGKIRGLRTPLPNEILGPVEQYLGVPYASPPTGERRFQPPEPPSSWTGIRNTTQFAAVCPQHLDERSLLHDMLPIWFTANLDTLMTYVQDQNEDCLYLNIYVPTEDDIHDQNSKKPVMVYIHGGSYMEGTGNMIDGSILASYGNVIVITINYRLGILGFLSTGDQAAKGNYGLLDQIQALRWIEENVGAFGGDPKRVTIFGSGAGASCVSLLTLSHYSEGLFQKAIIQSGTALSSWAVNYQPAKYTRILADKVGCNMLDTTDMVECLRNKNYKELIQQTITPATYHIAFGPVIDGDVIPDDPQILMEQGEFLNYDIMLGVNQGEGLKFVDGIVDNEDGVTPNDFDFSVSNFVDNLYGYPEGKDTLRETIKFMYTDWADKENPETRRKTLVALFTDHQWVAPAVATADLHAQYGSPTYFYAFYHHCQSEMKPSWADSAHGDEVPYVFGIPMIGPTELFSCNFSKNDVMLSAVVMTYWTNFAKTGDPNQPVPQDTKFIHTKPNRFEEVAWSKYNPKDQLYLHIGLKPRVRDHYRATKVAFWLELVPHLHNLNEIFQYVSTTTKVPPPDMTSFPYGTRRSPAKIWPTTKRPAITPANNPKHSKDPHKTGPEDTTVLIETKRDYSTELSVTIAVGASLLFLNILAFAALYYKKDKRRHETHRRPSPQRNTTNDIAHIQNEEIMSLQMKQLEHDHECESLQAHDTLRLTCPPDYTLTLRRSPDDIPLMTPNTITMIPNTLTGMQPLHTFNTFSGGQNSTNLPHGHSTTRV,816,NP_065793.1.csv,refseq-NLGN4X-NM_020742.3_clinical_seed_0_final,refseq-NLGN4X-NM_020742.3.a2m,Invitae,refseq-NLGN4X-NM_020742.3.npy,1,816,816
+NP_065796.2,MAASVAAAARRLRRAIRRSPAWRGLSHRPLSSEPPAAKASAVRAAFLNFFRDRHGHRLVPSASVRPRGDPSLLFVNAGMNQFKPIFLGTVDPRSEMAGFRRVANSQKCVRAGGHHNDLEDVGRDLSHHTFFEMLGNWAFGGEYFKEEACNMAWELLTQVYGIPEERLWISYFDGDPKAGLDPDLETRDIWLSLGVPASRVLSFGPQENFWEMGDTGPCGPCTEIHYDLAGGVGAPQLVELWNLVFMQHNREADGSLQPLPQRHVDTGMGLERLVAVLQGKHSTYDTDLFSPLLNAIQQGCRAPPYLGRVGVADEGRTDTAYRVVADHIRTLSVCISDGIFPGMSGPPLVLRRILRRAVRFSMEILKAPPGFLGSLVPVVVETLGDAYPELQRNSAQIANLVSEDEAAFLASLERGRRIIDRTLRTLGPSDMFPAEVAWSLSLCGDLGLPLDMVELMLEEKGVQLDSAGLERLAQEEAQHRARQAEPVQKQGLWLDVHALGELQRQGVPPTDDSPKYNYSLRPSGSYEFGTCEAQVLQLYTEDGTAVASVGKGQRCGLLLDRTNFYAEQGGQASDRGYLVRAGQEDVLFPVARAQVCGGFILHEAVAPECLRLGDQVQLHVDEAWRLGCMAKHTATHLLNWALRQTLGPGTEQQGSHLNPEQLRLDVTTQTPLTPEQLRAVENTVQEAVGQDEAVYMEEVPLALTAQVPGLRSLDEVYPDPVRVVSVGVPVAHALDPASQAALQTSVELCCGTHLLRTGAVGDLVIIGDRQLSKGTTRLLAVTGEQAQQARELGQSLAQEVKAATERLSLGSRDVAEALRLSKDIGRLIEAVETAVMPQWQRRELLATVKMLQRRANTAIRKLQMGQAAKKTQELLERHSKGPLIVDTVSAESLSVLVKVVRQLCEQAPSTSVLLLSPQPMGKVLCACQVAQGAMPTFTAEAWALAVCSHMGGKAWGSRVVAQGTGSTTDLEAALSIAQTYALSQL,985,NP_065796.2.csv,refseq-AARS2-NM_020745.4_clinical_seed_0_final,refseq-AARS2-NM_020745.4.a2m,Invitae,refseq-AARS2-NM_020745.4_theta_0.2.npy,1,985,985
+NP_065811.1,MASSAREHLLFVRRRNPQMRYTLSPENLQSLAAQSSMPENMTLQRANSDTDLVTSESRSSLTASMYEYTLGQAQNLIIFWDIKEEVDPSDWIGLYHIDENSPANFWDSKNRGVTGTQKGQIVWRIEPGPYFMEPEIKICFKYYHGISGALRATTPCITVKNPAVMMGAEGMEGGASGNLHSRKLVSFTLSDLRAVGLKKGMFFNPDPYLKMSIQPGKKSSFPTCAHHGQERRSTIISNTTNPIWHREKYSFFALLTDVLEIEIKDKFAKSRPIIKRFLGKLTIPVQRLLERQAIGDQMLSYNLGRRLPADHVSGYLQFKVEVTSSVHEDASPEAVGTILGVNSVNGDLGSPSDDEDMPGSHHDSQVCSNGPVSEDSAADGTPKHSFRTSSTLEIDTEELTSTSSRTSPPRGRQDSLNDYLDAIEHNGHSRPGTATCSERSMGASPKLRSSFPTDTRLNAMLHIDSDEEDHEFQQDLGYPSSLEEEGGLIMFSRASRADDGSLTSQTKLEDNPVENEEASTHEAASFEDKPENLPELAESSLPAGPAPEEGEGGPEPQPSADQGSAELCGSQEVDQPTSGADTGTSDASGGSRRAVSETESLDQGSEPSQVSSETEPSDPARTESVSEASTRPEGESDLECADSSCNESVTTQLSSVDTRCSSLESARFPETPAFSSQEEEDGACAAEPTSSGPAEGSQESVCTAGSLPVVQVPSGEDEGPGAESATVPDQEELGEVWQRRGSLEGAAAAAESPPQEEGSAGEAQGTCEGATAQEEGATGGSQANGHQPLRSLPSVRQDVSRYQRVDEALPPNWEARIDSHGRIFYVDHVNRTTTWQRPTAPPAPQVLQRSNSIQQMEQLNRRYQSIRRTMTNERPEENTNAIDGAGEEADFHQASADFRRENILPHSTSRSRITLLLQSPPVKFLISPEFFTVLHSNPSAYRMFTNNTCLKHMITKVRRDTHHFERYQHNRDLVGFLNMFANKQLELPRGWEMKHDHQGKAFFVDHNSRTTTFIDPRLPLQSSRPTSALVHRQHLTRQRSHSAGEVGEDSRHAGPPVLPRPSSTFNTVSRPQYQDMVPVAYNDKIVAFLRQPNIFEILQERQPDLTRNHSLREKIQFIRTEGTPGLVRLSSDADLVMLLSLFEEEIMSYVPPHALLHPSYCQSPRGSPVSSPQNSPGTQRANARAPAPYKRDFEAKLRNFYRKLETKGYGQGPGKLKLIIRRDHLLEDAFNQIMGYSRKDLQRNKLYVTFVGEEGLDYSGPSREFFFLVSRELFNPYYGLFEYSANDTYTVQISPMSAFVDNHHEWFRFSGRILGLALIHQYLLDAFFTRPFYKALLRILCDLSDLEYLDEEFHQSLQWMKDNDIHDILDLTFTVNEEVFGQITERELKPGGANIPVTEKNKKEYIERMVKWRIERGVVQQTESLVRGFYEVVDARLVSVFDARELELVIAGTAEIDLSDWRNNTEYRGGYHDNHIVIRWFWAAVERFNNEQRLRLLQFVTGTSSIPYEGFASLRGSNGPRRFCVEKWGKITALPRAHTCFNRLDLPPYPSFSMLYEKLLTAVEETSTFGLE,1572,NP_065811.1.csv,refseq-HECW2-NM_020760.4_clinical_seed_0_final,refseq-HECW2-NM_020760.4.a2m,Invitae,refseq-HECW2-NM_020760.4_theta_0.2.npy,1,1572,1572
+NP_065822.2,MERAMEQLNRLTRSLRRARTVELPEDNETAVYTLMPMVMADQHRSVSELLSNSKFDVNYAFGRVKRSLLHIAANCGSVECLVLLLKKGANPNYQDISGCTPLHLAARNGQKKCMSKLLEYSADVNICNNEGLTAIHWLAVNGRTELLHDLVQHVSDVDVEDAMGQTALHVACQNGHKTTVQCLLDSGADINRPNVSGATPLYFACSHGQRDTAQILLLRGAKYLPDKNGVTPLDLCVQGGYGETCEVLIQYHPRLFQTIIQMTQNEDLRENMLRQVLEHLSQQSESQYLKILTSLAEVATTNGHKLLSLSSNYDAQMKSLLRIVRMFCHVFRIGPSSPSNGIDMGYNGNKTPRSQVFKPLELLWHSLDEWLVLIATELMKNKRDSTEITSILLKQKGQDQDAASIPPFEPPGPGSYENLSTGTRESKPDALAGRQEASADCQDVISMTANRLSAVIQAFYMCCSCQMPPGMTSPRFIEFVCKHDEVLKCFVNRNPKIIFDHFHFLLECPELMSRFMHIIKAQPFKDRCEWFYEHLHSGQPDSDMVHRPVNENDILLVHRDSIFRSSCEVVSKANCAKLKQGIAVRFHGEEGMGQGVVREWFDILSNEIVNPDYALFTQSADGTTFQPNSNSYVNPDHLNYFRFAGQILGLALNHRQLVNIYFTRSFYKHILGIPVNYQDVASIDPEYAKNLQWILDNDISDLGLELTFSVETDVFGAMEEVPLKPGGGSILVTQNNKAEYVQLVTELRMTRAIQPQINAFLQGFHMFIPPSLIQLFDEYELELLLSGMPEIDVSDWIKNTEYTSGYEREDPVIQWFWEVVEDITQEERVLLLQFVTGSSRVPHGGFANIMGGSGLQNFTIAAVPYTPNLLPTSSTCINMLKLPEYPSKEILKDRLLVALHCGSYGYTMA,909,NP_065822.2.csv,refseq-HACE1-NM_020771.3_clinical_seed_0_final,refseq-HACE1-NM_020771.3.a2m,Invitae,refseq-HACE1-NM_020771.3.npy,1,909,909
+NP_065825.1,MSNSRNNRVMVEGVGARVVRGPDWKWGKQDGGEGHVGTVRSFESPEEVVVVWDNGTAANYRCSGAYDLRILDSAPTGIKHDGTMCDTCRQQPIIGIRWKCAECTNYDLCTVCYHGDKHHLRHRFYRITTPGSERVLLESRRKSKKITARGIFAGARVVRGVDWQWEDQDGGNGRRGKVTEIQDWSASSPHSAAYVLWDNGAKNLYRVGFEGMSDLKCVQDAKGGSFYRDHCPVLGEQNGNRNPGGLQIGDLVNIDLDLEIVQSLQHGHGGWTDGMFETLTTTGTVCGIDEDHDIVVQYPSGNRWTFNPAVLTKANIVRSGDAAQGAEGGTSQFQVGDLVQVCYDLERIKLLQRGHGEWAEAMLPTLGKVGRVQQIYSDSDLKVEVCGTSWTYNPAAVSKVASAGSAISNASGERLSQLLKKLFETQESGDLNEELVKAAANGDVAKVEDLLKRPDVDVNGQCAGHTAMQAASQNGHVDILKLLLKQNVDVEAEDKDGDRAVHHAAFGDEGAVIEVLHRGSADLNARNKRRQTPLHIAVNKGHLQVVKTLLDFGCHPSLQDSEGDTPLHDAISKKRDDILAVLLEAGADVTITNNNGFNALHHAALRGNPSAMRVLLSKLPRPWIVDEKKDDGYTALHLAALNNHVEVAELLVHQGNANLDIQNVNQQTALHLAVERQHTQIVRLLVRAGAKLDIQDKDGDTPLHEALRHHTLSQLRQLQDMQDVGKVDAAWEPSKNTLIMGLGTQGAEKKSAASIACFLAANGADLSIRNKKGQSPLDLCPDPNLCKALAKCHKEKVSGQVGSRSPSMISNDSETLEECMVCSDMKRDTLFGPCGHIATCSLCSPRVKKCLICKEQVQSRTKIEECVVCSDKKAAVLFQPCGHMCACENCANLMKKCVQCRAVVERRVPFIMCCGGKSSEDATDDISSGNIPVLQKDKDNTNVNADVQKLQQQLQDIKEQTMCPVCLDRLKNMIFLCGHGTCQLCGDRMSECPICRKAIERRILLY,1006,NP_065825.1.csv,refseq-MIB1-NM_020774.3_clinical_seed_0_final,refseq-MIB1-NM_020774.3.a2m,Invitae,refseq-MIB1-NM_020774.3.npy,1,1006,1006
+NP_065842.1,MPSTNRAGSLKDPEIAELFFKEDPEKLFTDLREIGHGSFGAVYFARDVRTNEVVAIKKMSYSGKQSTEKWQDIIKEVKFLQRIKHPNSIEYKGCYLREHTAWLVMEYCLGSASDLLEVHKKPLQEVEIAAITHGALQGLAYLHSHTMIHRDIKAGNILLTEPGQVKLADFGSASMASPANSFVGTPYWMAPEVILAMDEGQYDGKVDVWSLGITCIELAERKPPLFNMNAMSALYHIAQNESPTLQSNEWSDYFRNFVDSCLQKIPQDRPTSEELLKHIFVLRERPETVLIDLIQRTKDAVRELDNLQYRKMKKLLFQEAHNGPAVEAQEEEEEQDHGVGRTGTVNSVGSNQSIPSMSISASSQSSSVNSLPDVSDDKSELDMMEGDHTVMSNSSVIHLKPEEENYREEGDPRTRASDPQSPPQVSRHKSHYRNREHFATIRTASLVTRQMQEHEQDSELREQMSGYKRMRRQHQKQLMTLENKLKAEMDEHRLRLDKDLETQRNNFAAEMEKLIKKHQAAMEKEAKVMSNEEKKFQQHIQAQQKKELNSFLESQKREYKLRKEQLKEELNENQSTPKKEKQEWLSKQKENIQHFQAEEEANLLRRQRQYLELECRRFKRRMLLGRHNLEQDLVREELNKRQTQKDLEHAMLLRQHESMQELEFRHLNTIQKMRCELIRLQHQTELTNQLEYNKRRERELRRKHVMEVRQQPKSLKSKELQIKKQFQDTCKIQTRQYKALRNHLLETTPKSEHKAVLKRLKEEQTRKLAILAEQYDHSINEMLSTQALRLDEAQEAECQVLKMQLQQELELLNAYQSKIKMQAEAQHDRELRELEQRVSLRRALLEQKIEEEMLALQNERTERIRSLLERQAREIEAFDSESMRLGFSNMVLSNLSPEAFSHSYPGASGWSHNPTGGPGPHWGHPMGGPPQAWGHPMQGGPQPWGHPSGPMQGVPRGSSMGVRNSPQALRRTASGGRTEQGMSRSTSVTSQISNGSHMSYT,1001,NP_065842.1.csv,refseq-TAOK1-NM_020791.2_clinical_seed_0_final,refseq-TAOK1-NM_020791.2.a2m,Invitae,refseq-TAOK1-NM_020791.2.npy,1,1001,1001
+NP_065851.1,MRLKISLLKEPKHQELVSCVGWTTAEELYSCSDDHQIVKWNLLTSETTQIVKLPDDIYPIDFHWFPKSLGVKKQTQAESFVLTSSDGKFHLISKLGRVEKSVEAHCGAVLAGRWNYEGTALVTVGEDGQIKIWSKTGMLRSTLAQQGTPVYSVAWGPDSEKVLYTAGKQLIIKPLQPNAKVLQWKAHDGIILKVDWNSVNDLILSAGEDCKYKVWDSYGRPLYNSQPHEHPITSVAWAPDGELFAVGSFHTLRLCDKTGWSYALEKPNTGSIFNIAWSIDGTQIAGACGNGHVVFAHVVEQHWEWKNFQVTLTKRRAMQVRNVLNDAVDLLEFRDRVIKASLNYAHLVVSTSLQCYVFSTKNWNTPIIFDLKEGTVSLILQAERHFLLVDGSSIYLYSYEGRFISSPKFPGMRTDILNAQTVSLSNDTIAIRDKADEKIIFLFEASTGKPLGDGKFLSHKNEILEIALDQKGLTNDRKIAFIDKNRDLCITSVKRFGKEEQIIKLGTMVHTLAWNDTCNILCGLQDTRFIVWYYPNTVYVDRDILPKTLYERDASEFSKNPHIVSFVGNQVTIRRADGSLVHISITPYPAILHEYVSSSKWEDAVRLCRFVKEQTMWACLAAMAVANRDMTTAEIAYAAIGEIDKVQYINSIKNLPSKESKMAHILLFSGNIQEAEIVLLQAGLVYQAIQININLYNWERALELAVKYKTHVDTVLAYRQKFLETFGKQETNKRYLHYAEGLQIDWEKIKAKIEMEITKEREQSSSSQSSKSIGLKP,777,NP_065851.1.csv,refseq-IFT80-NM_020800.2_clinical_seed_0_final,refseq-IFT80-NM_020800.2.a2m,Invitae,refseq-IFT80-NM_020800.2.npy,1,777,777
+NP_065857.1,MATEGMILTNHDHQIRVGVLTVSDSCFRNLAEDRSGINLKDLVQDPSLLGGTISAYKIVPDEIEEIKETLIDWCDEKELNLILTTGGTGFAPRDVTPEATKEVIEREAPGMALAMLMGSLNVTPLGMLSRPVCGIRGKTLIINLPGSKKGSQECFQFILPALPHAIDLLRDAIVKVKEVHDELEDLPSPPPPLSPPPTTSPHKQTEDKGVQCEEEEEEKKDSGVASTEDSSSSHITAAAIAAKKHPFYTSPAVVMAHGEQPIPGLINYSHHSTDERIPDSIISRGVQVLPRDTASLSTTPSESPRAQATSRLSTASCPTPKVQSRCSSKENILRASHSAVDITKVARRHRMSPFPLTSMDKAFITVLEMTPVLGTEIINYRDGMGRVLAQDVYAKDNLPPFPASVKDGYAVRAADGPGDRFIIGESQAGEQPTQTVMPGQVMRVTTGAPIPCGADAVVQVEDTELIRESDDGTEELEVRILVQARPGQDIRPIGHDIKRGECVLAKGTHMGPSEIGLLATVGVTEVEVNKFPVVAVMSTGNELLNPEDDLLPGKIRDSNRSTLLATIQEHGYPTINLGIVGDNPDDLLNALNEGISRADVIITSGGVSMGEKDYLKQVLDIDLHAQIHFGRVFMKPGLPTTFATLDIDGVRKIIFALPGNPVSAVVTCNLFVVPALRKMQGILDPRPTIIKARLSCDVKLDPRPEYHRCILTWHHQEPLPWAQSTGNQMSSRLMSMRSANGLLMLPPKTEQYVELHKGEVVDVMVIGRL,769,NP_065857.1.csv,refseq-GPHN-NM_020806.4_clinical_seed_0_final,refseq-GPHN-NM_020806.4.a2m,Invitae,refseq-GPHN-NM_020806.4.npy,1,769,769
+NP_065873.2,MPLPDGARTPGGVCREARGGGYTNRTFEFDDGQCAPRRPCAGDGALLDTAGFKMSDLDSEVLPLPPRYRFRDLLLGDPSFQNDDRVQVEFYVNENTFKERLKLFFIKNQRSSLRIRLFNFSLKLLTCLLYIVRVLLDDPALGIGCWGCPKQNYSFNDSSSEINWAPILWVERKMTLWAIQVIVAIISFLETMLLIYLSYKGNIWEQIFRVSFVLEMINTLPFIITIFWPPLRNLFIPVFLNCWLAKHALENMINDFHRAILRTQSAMFNQVLILFCTLLCLVFTGTCGIQHLERAGENLSLLTSFYFCIVTFSTVGYGDVTPKIWPSQLLVVIMICVALVVLPLQFEELVYLWMERQKSGGNYSRHRAQTEKHVVLCVSSLKIDLLMDFLNEFYAHPRLQDYYVVILCPTEMDVQVRRVLQIPLWSQRVIYLQGSALKDQDLMRAKMDNGEACFILSSRNEVDRTAADHQTILRAWAVKDFAPNCPLYVQILKPENKFHVKFADHVVCEEECKYAMLALNCICPATSTLITLLVHTSRGQEGQESPEQWQRMYGRCSGNEVYHIRMGDSKFFREYEGKSFTYAAFHAHKKYGVCLIGLKREDNKSILLNPGPRHILAASDTCFYINITKEENSAFIFKQEEKRKKRAFSGQGLHEGPARLPVHSIIASMGTVAMDLQGTEHRPTQSGGGGGGSKLALPTENGSGSRRPSIAPVLELADSSALLPCDLLSDQSEDEVTPSDDEGLSVVEYVKGYPPNSPYIGSSPTLCHLLPVKAPFCCLRLDKGCKHNSYEDAKAYGFKNKLIIVSAETAGNGLYNFIVPLRAYYRSRKELNPIVLLLDNKPDHHFLEAICCFPMVYYMEGSVDNLDSLLQCGIIYADNLVVVDKESTMSAEEDYMADAKTIVNVQTMFRLFPSLSITTELTHPSNMRFMQFRAKDSYSLALSKLEKRERENGSNLAFMFRLPFAAGRVFSISMLDTLLYQSFVKDYMITITRLLLGLDTTPGSGYLCAMKITEGDLWIRTYGRLFQKLCSSSAEIPIGIYRTESHVFSTSEPHDLRAQSQISVNVEDCEDTREVKGPWGSRAGTGGSSQGRHTGGGDPAEHPLLRRKSLQWARRLSRKAPKQAGRAAAAEWISQQRLSLYRRSERQELSELVKNRMKHLGLPTTGYDEMNDHQNTLSYVLINPPPDTRLEPSDIVYLIRSDPLAHVASSSQSRKSSCSHKLSSCNPETRDETQL,1235,NP_065873.2.csv,refseq-KCNT1-NM_020822.2_clinical_seed_0_final,refseq-KCNT1-NM_020822.2.a2m,Invitae,refseq-KCNT1-NM_020822.2.npy,1,1235,1235
+NP_065875.3,MMATRRTGLSEGDGDKLKACEVSKNKDGKEQSETVSLSEDETFSWPGPKTVTLKRTSQGFGFTLRHFIVYPPESAIQFSYKDEENGNRGGKQRNRLEPMDTIFVKQVKEGGPAFEAGLCTGDRIIKVNGESVIGKTYSQVIALIQNSDTTLELSVMPKDEDILQVLQFTKDVTALAYSQDAYLKGNEAYSGNARNIPEPPPICYPWLPSAPSAMAQPVEISPPDSSLSKQQTSTPVLTQPGRAYRMEIQVPPSPTDVAKSNTAVCVCNESVRTVIVPSEKVVDLLSNRNNHTGPSHRTEEVRYGVSEQTSLKTVSRTTSPPLSIPTTHLIHQPAGSRSLEPSGILLKSGNYSGHSDGISSSRSQAVEAPSVSVNHYSPNSHQHIDWKNYKTYKEYIDNRRLHIGCRTIQERLDSLRAASQSTTDYNQVVPNRTTLQGRRRSTSHDRVPQSVQIRQRSVSQERLEDSVLMKYCPRSASQGALTSPSVSFSNHRTRSWDYIEGQDETLENVNSGTPIPDSNGEKKQTYKWSGFTEQDDRRGICERPRQQEIHKSFRGSNFTVAPSVVNSDNRRMSGRGVGSVSQFKKIPPDLKTLQSNRNFQTTCGMSLPRGISQDRSPLVKVRSNSLKAPSTHVTKPSFSQKSFVSIKDQRPVNHLHQNSLLNQQTWVRTDSAPDQQVETGKSPSLSGASAKPAPQSSENAGTSDLELPVSQRNQDLSLQEAETEQSDTLDNKEAVILREKPPSGRQTPQPLRHQSYILAVNDQETGSDTTCWLPNDARREVHIKRMEERKASSTSPPGDSLASIPFIDEPTSPSIDHDIAHIPASAVISASTSQVPSIATVPPCLTTSAPLIRRQLSHDHESVGPPSLDAQPNSKTERSKSYDEGLDDYREDAKLSFKHVSSLKGIKIADSQKSSEDSGSRKDSSSEVFSDAAKEGWLHFRPLVTDKGKRVGGSIRPWKQMYVVLRGHSLYLYKDKREQTTPSEEEQPISVNACLIDISYSETKRKNVFRLTTSDCECLFQAEDRDDMLAWIKTIQESSNLNEEDTGVTNRDLISRRIKEYNNLMSKAEQLPKTPRQSLSIRQTLLGAKSEPKTQSPHSPKEESERKLLSKDDTSPPKDKGTWRKGIPSIMRKTFEKKPTATGTFGVRLDDCPPAHTNRYIPLIVDICCKLVEERGLEYTGIYRVPGNNAAISSMQEELNKGMADIDIQDDKWRDLNVISSLLKSFFRKLPEPLFTNDKYADFIEANRKEDPLDRLKTLKRLIHDLPEHHYETLKFLSAHLKTVAENSEKNKMEPRNLAIVFGPTLVRTSEDNMTHMVTHMPDQYKIVETLIQHHDWFFTEEGAEEPLTTVQEESTVDSQPVPNIDHLLTNIGRTGVSPGDVSDSATSDSTKSKGSWGSGKDQYSRELLVSSIFAAASRKRKKPKEKAQPSSSEDELDNVFFKKENVEQCHNDTKEESKKESETLGRKQKIIIAKENSTRKDPSTTKDEKISLGKESTPSEEPSPPHNSKHNKSPTLSCRFAILKESPRSLLAQKSSHLEETGSDSGTLLSTSSQASLARFSMKKSTSPETKHSEFLANVSTITSDYSTTSSATYLTSLDSSRLSPEVQSVAESKGDEADDERSELISEGRPVETDSESEFPVFPTALTSERLFRGKLQEVTKSSRRNSEGSELSCTEGSLTSSLDSRRQLFSSHKLIECDTLSRKKSARFKSDSGSLGDAKNEKEAPSLTKVFDVMKKGKSTGSLLTPTRGESEKQEPTWKTKIADRLKLRPRAPADDMFGVGNHKVNAETAKRKSIRRRHTLGGHRDATEISVLNFWKVHEQSGERESELSAVNRLKPKCSAQDLSISDWLARERLRTSTSDLSRGEIGDPQTENPSTREIATTDTPLSLHCNTGSSSSTLASTNRPLLSIPPQSPDQINGESFQNVSKNASSAANAQPHKLSETPGSKAEFHPCL,1958,NP_065875.3.csv,refseq-ARHGAP21-NM_020824.3_clinical_seed_0_final,refseq-ARHGAP21-NM_020824.3.a2m,Invitae,refseq-ARHGAP21-NM_020824.3_theta_0.2.npy,1,1958,1958
+NP_065970.2,MDSKKRSSTEAEGSKERGLVHIWQAGSFPITPERLPGWGGKTVLQAALGVKHGVLLTEDGEVYSFGTLPWRSGPVEICPSSPILENALVGQYVITVATGSFHSGAVTDNGVAYMWGENSAGQCAVANQQYVPEPNPVSIADSEASPLLAVRILQLACGEEHTLALSISREIWAWGTGCQLGLITTAFPVTKPQKVEHLAGRVVLQVACGAFHSLALVQCLPSQDLKPVPERCNQCSQLLITMTDKEDHVIISDSHCCPLGVTLTESQAENHASTALSPSTETLDRQEEVFENTLVANDQSVATELNAVSAQITSSDAMSSQQNVMGTTEISSARNIPSYPDTQAVNEYLRKLSDHSVREDSEHGEKPVPSQPLLEEAIPNLHSPPTTSTSALNSLVVSCASAVGVRVAATYEAGALSLKKVMNFYSTTPCETGAQAGSSAIGPEGLKDSREEQVKQESMQGKKSSSLVDIREEETEGGSRRLSLPGLLSQVSPRLLRKAARVKTRTVVLTPTYSGEADALLPSLRTEVWTWGKGKEGQLGHGDVLPRLQPLCVKCLDGKEVIHLEAGGYHSLALTAKSQVYSWGSNTFGQLGHSDFPTTVPRLAKISSENGVWSIAAGRDYSLFLVDTEDFQPGLYYSGRQDPTEGDNLPENHSGSKTPVLLSCSKLGYISRVTAGKDSYLALVDKNIMGYIASLHELATTERRFYSKLSDIKSQILRPLLSLENLGTTTTVQLLQEVASRFSKLCYLIGQHGASLSSFLHGVKEARSLVILKHSSLFLDSYTEYCTSITNFLVMGGFQLLAKPAIDFLNKNQELLQDLSEVNDENTQLMEILNTLFFLPIRRLHNYAKVLLKLATCFEVASPEYQKLQDSSSCYECLALHLGRKRKEAEYTLGFWKTFPGKMTDSLRKPERRLLCESSNRALSLQHAGRFSVNWFILFNDALVHAQFSTHHVFPLATLWAEPLSEEAGGVNGLKITTPEEQFTLISSTPQEKTKWLRAISQAVDQALRGMSDLPPYGSGSSVQRQEPPISRSAKYTFYKDPRLKDATYDGRWLSGKPHGRGVLKWPDGKMYSGMFRNGLEDGYGEYRIPNKAMNKEDHYVGHWKEGKMCGQGVYSYASGEVFEGCFQDNMRHGHGLLRSGKLTSSSPSMFIGQWVMDKKAGYGVFDDITRGEKYMGMWQDDVCQGNGVVVTQFGLYYEGNFHLNKMMGNGVLLSEDDTIYEGEFSDDWTLSGKGTLTMPNGDYIEGYFSGEWGSGIKITGTYFKPSLYESDKDRPKVFRKLGNLAVPADEKWKAVFDECWRQLGCEGPGQGEVWKAWDNIAVALTTSRRQHRDSPEILSRSQTQTLESLEFIPQHVGAFSVEKYDDIRKYLIKACDTPLHPLGRLVETLVAVYRMTYVGVGANRRLLQEAVKEIKSYLKRIFQLVRFLFPELPEEGSTIPLSAPLPTERKSFCTGKSDSRSESPEPGYVVTSSGLLLPVLLPRLYPPLFMLYALDNDREEDIYWECVLRLNKQPDIALLGFLGVQRKFWPATLSILGESKKVLPTTKDACFASAVECLQQISTTFTPSDKLKVIQQTFEEISQSVLASLHEDFLWSMDDLFPVFLYVVLRARIRNLGSEVHLIEDLMDPYLQHGEQGIMFTTLKACYYQIQREKLN,1657,NP_065970.2.csv,refseq-ALS2-NM_020919.3_clinical_seed_0_final,refseq-ALS2-NM_020919.3.a2m,Invitae,refseq-ALS2-NM_020919.3.npy,1,1657,1657
+NP_065979.1,MAERGQQPPPAKRLCCRPGGGGGGGGSSGGGGGAGGGYSSACRPGPRAGGAAAAAACGGGAALGLLPPGKTQSPESLLDIAARRVAEKWPFQRVEERFERIPEPVQRRIVYWSFPRSEREICMYSSFNTGGGAAGGPGDDSGGGGGAGGGGGGGSSSSPAATSAAATSAAAAAAAAAAAAAAAAGAGAPSVGAAGAADGGDETRLPFRRGIALLESGCVDNVLQVGFHLSGTVTEPAIQSEPETVCNVAISFDRCKITSVTCSCGNKDIFYCAHVVALSLYRIRKPDQVKLHLPISETLFQMNRDQLQKFVQYLITVHHTEVLPTAQKLADEILSQNSEINQVHGAPDPTAGASIDDENCWHLDEEQVQEQVKLFLSQGGYHGSGKQLNLLFAKVREMLKMRDSNGARMLTLITEQFMADPRLSLWRQQGTAMTDKYRQLWDELGALWMCIVLNPHCKLEQKASWLKQLKKWNSVDVCPWEDGNHGSELPNLTNALPQGANANQDSSNRPHRTVFTRAIEACDLHWQDSHLQHIISSDLYTNYCYHDDTENSLFDSRGWPLWHEHVPTACARVDALRSHGYPREALRLAIAIVNTLRRQQQKQLEMFRTQKKELPHKNITSITNLEGWVGHPLDPVGTLFSSLMEACRIDDENLSGFSDFTENMGQCKSLEYQHLPAHKFLEEGESYLTLAVEVALIGLGQQRIMPDGLYTQEKVCRNEEQLISKLQEIELDDTLVKIFRKQAVFLLEAGPYSGLGEIIHRESVPMHTFAKYLFTSLLPHDAELAYKIALRAMRLLVLESTAPSGDLTRPHHIASVVPNRYPRWFTLSHIESQQCELASTMLTAAKGDVRRLETVLESIQKNIHSSSHIFKLAQDAFKIATLMDSLPDITLLKVSLELGLQVMRMTLSTLNWRRREMVRWLVTCATEVGVYALDSIMQTWFTLFTPTEATSIVATTVMSNSTIVRLHLDCHQQEKLASSARTLALQCAMKDPQNCALSALTLCEKDHIAFETAYQIVLDAATTGMSYTQLFTIARYMEHRGYPMRAYKLATLAMTHLNLSYNQDTHPAINDVLWACALSHSLGKNELAAIIPLVVKSVKCATVLSDILRRCTLTTPGMVGLHGRRNSGKLMSLDKAPLRQLLDATIGAYINTTHSRLTHISPRHYSEFIEFLSKARETFLMAHDGHIQFTQFIDNLKQIYKGKKKLMMLVRERFG,1215,NP_065979.1.csv,refseq-ZSWIM6-NM_020928.1_clinical_seed_0_final,refseq-ZSWIM6-NM_020928.1.a2m,Invitae,refseq-ZSWIM6-NM_020928.1.npy,1,1215,1215
+NP_065995.1,MGTQDPGNMGTGVPASEQISCAKEDPQVYCPEETGGTKDVQVTDCKSPEDSRPPKETDCCNPEDSGQLMVSYEGKAMGYQVPPFGWRICLAHEFTEKRKPFQANNVSLSNMIKHIGMGLRYLQWWYRKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSVLRSWNWGLCGYFAFYHALYPRAWTVYQLPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPNPYTMAVAARVTAATTVTHITAFDPDSTGQQVWQDLLQDGQLDSPTGQSTPTQKGVGIAGAVCVSSKLRPRGQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLEVLEDSLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAMQLALQQQQHKKASWPKVKQGTGLRTGPMFGPKEAMANLSPE,927,NP_065995.1.csv,refseq-GBA2-NM_020944.2_clinical_seed_0_final,refseq-GBA2-NM_020944.2.a2m,Invitae,refseq-GBA2-NM_020944.2.npy,1,927,927
+NP_066005.2,MECPSCQHVSKEETPKFCSQCGERLPPAAPIADSENNNSTMASASEGEMECGQELKEEGGPCLFPGSDSWQENPEEPCSKASWTVQESKKKKRKKKKKGNKSASSELASLPLSPASPCHLTLLSNPWPQDTALPHSQAQQSGPTGQPSQPPGTATTPLEGDGLSAPTEVGDSPLQAQALGEAGVATGSEAQSSPQFQDHTEGEDQDASIPSGGRGLSQEGTGPPTSAGEGHSRTEDAAQELLLPESKGGSSEPGTELQTTEQQAGASASMAVDAVAEPANAVKGAGKEMKEKTQRMKQPPATTPPFKTHCQEAETKTKDEMAAAEEKVGKNEQGEPEDLKKPEGKNRSAAAVKNEKEQKNQEADVQEVKASTLSPGGGVTVFFHAIISLHFPFNPDLHKVFIRGGEEFGESKWDSNICELHYTRDLGHDRVLVEGIVCISKKHLDKYIPYKYVIYNGESFEYEFIYKHQQKKGEYVNRCLFIKSSLLGSGDWHQYYDIVYMKPHGRLQKVMNHITDGPRKDLVKGKQIAAALMLDSTFSILQTWDTINLNSFFTQFEQFCFVLQQPMIYEGQAQLWTDLQYREKEVKRYLWQHLKKHVVPLPDGKSTDFLPVDCPVRSKLKTGLIVLFVVEKIELLLEGSLDWLCHLLTSDASSPDEFHRDLSHILGIPQSWRLYLVNLCQRCMDTRTYTWLGALPVLHCCMELAPRHKDAWRQPEDTWAALEGLSFSPFREQMLDTSSLLQFMREKQHLLSIDEPLFRSWFSLLPLSHLVMYMENFIEHLGRFPAHILDCLSGIYYRLPGLEQVLNTQDVQDVQNVQNILEMLLRLLDTYRDKIPEEALSPSYLTVCLKLHEAICSSTKLLKFYELPALSAEIVCRMIRLLSLVDSAGQRDETGNNSVQTVFQGTLAATKRWLREVFTKNMLTSSGASFTYVKEIEVWRRLVEIQFPAEHGWKESLLGDMEWRLTKEEPLSQITAYCNSCWDTKGLEDSVAKTFEKCIIEAVSSACQVNNLSSWETDSGSQLCSAMTQLRAMKHPLGLSSSANSEIGKWAPSSLAKGNGAEI,1063,NP_066005.2.csv,refseq-RNF213-NM_020954.3_clinical_seed_0_final,refseq-RNF213-NM_020954.3.a2m,Invitae,refseq-RNF213-NM_020954.3_theta_0.2.npy,1,1063,1063
+NP_066124.1,MAKATSGAAGLRLLLLLLLPLLGKVALGLYFSRDAYWEKLYVDQAAGTPLLYVHALRDAPEEVPSFRLGQHLYGTYRTRLHENNWICIQEDTGLLYLNRSLDHSSWEKLSVRNRGFPLLTVYLKVFLSPTSLREGECQWPGCARVYFSFFNTSFPACSSLKPRELCFPETRPSFRIRENRPPGTFHQFRLLPVQFLCPNISVAYRLLEGEGLPFRCAPDSLEVSTRWALDREQREKYELVAVCTVHAGAREEVVMVPFPVTVYDEDDSAPTFPAGVDTASAVVEFKRKEDTVVATLRVFDADVVPASGELVRRYTSTLLPGDTWAQQTFRVEHWPNETSVQANGSFVRATVHDYRLVLNRNLSISENRTMQLAVLVNDSDFQGPGAGVLLLHFNVSVLPVSLHLPSTYSLSVSRRARRFAQIGKVCVENCQAFSGINVQYKLHSSGANCSTLGVVTSAEDTSGILFVNDTKALRRPKCAELHYMVVATDQQTSRQAQAQLLVTVEGSYVAEEAGCPLSCAVSKRRLECEECGGLGSPTGRCEWRQGDGKGITRNFSTCSPSTKTCPDGHCDVVETQDINICPQDCLRGSIVGGHEPGEPRGIKAGYGTCNCFPEEEKCFCEPEDIQDPLCDELCRTVIAAAVLFSFIVSVLLSAFCIHCYHKFAHKPPISSAEMTFRRPAQAFPVSYSSSGARRPSLDSMENQVSVDAFKILEDPKWEFPRKNLVLGKTLGEGEFGKVVKATAFHLKGRAGYTTVAVKMLKENASPSELRDLLSEFNVLKQVNHPHVIKLYGACSQDGPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHPDERALTMGDLISFAWQISQGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYEEDSYVKRSQGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLFNLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKRRDYLDLAASTPSDSLIYDDGLSEEETPLVDCNNAPLPRALPSTWIENKLYGMSDPNWPGESPVPLTRADGTNTGFPRYPNDSVYANWMLSPSAAKLMDTFDS,1114,NP_066124.1.csv,refseq-RET-NM_020975.6_clinical_seed_0_final,refseq-RET-NM_020975.6.a2m,Invitae,refseq-RET-NM_020975.6.npy,1,1114,1114
+NP_066264.4,MAAKTPSSEESGLPKLPVPPLQQTLATYLQCMRHLVSEEQFRKSQAIVQQFGAPGGLGETLQQKLLERQEKTANWVSEYWLNDMYLNNRLALPVNSSPAVIFARQHFPGTDDQLRFAASLISGVLSYKALLDSHSIPTDCAKGQLSGQPLCMKQYYGLFSSYRLPGHTQDTLVAQNSSIMPEPEHVIVACCNQFFVLDVVINFRRLSEGDLFTQLRKIVKMASNEDERLPPIGLLTSDGRSEWAEARTVLVKDSTNRDSLDMIERCICLVCLDAPGGVELSDTHRALQLLHGGGYSKNGANRWYDKSLQFVVGRDGTCGVVCEHSPFDGIVLVQCTEHLLKHVTQSSRKLIRADSVSELPAPRRLRWKCSPEIQGHLASSAEKLQRIVKNLDFIVYKFDNYGKTFIKKQKCSPDAFIQVALQLAFYRLHRRLVPTYESASIRRFQEGRVDNIRSATPEALAFVRAVTDHKAAVPASEKLLLLKDAIRAQTAYTVMAITGMAIDNHLLALRELARAMCKELPEMFMDETYLMSNRFVLSTSQVPTTTEMFCCYGPVVPNGYGACYNPQPETILFCISSFHSCKETSSSKFAKAVEESLIDMRDLCSLLPPTESKPLATKEKATRPSQGHQP,630,NP_066264.4.csv,refseq-CHAT-NM_020984.4_clinical_seed_0_final,refseq-CHAT-NM_020984.4.a2m,Invitae,refseq-CHAT-NM_020984.4_theta_0.2.npy,1,630,630
+NP_066268.1,MGCTLSAEERAALERSKAIEKNLKEDGISAAKDVKLLLLGAGESGKSTIVKQMKIIHEDGFSGEDVKQYKPVVYSNTIQSLAAIVRAMDTLGIEYGDKERKADAKMVCDVVSRMEDTEPFSAELLSAMMRLWGDSGIQECFNRSREYQLNDSAKYYLDSLDRIGAADYQPTEQDILRTRVKTTGIVETHFTFKNLHFRLFDVGGQRSERKKWIHCFEDVTAIIFCVALSGYDQVLHEDETTNRMHESLMLFDSICNNKFFIDTSIILFLNKKDLFGEKIKKSPLTICFPEYTGPNTYEDAAAYIQAQFESKNRSPNKEIYCHMTCATDTNNIQVVFDAVTDIIIANNLRGCGLY,354,NP_066268.1.csv,refseq-GNAO1-NM_020988.2_clinical_seed_0_final,refseq-GNAO1-NM_020988.2.a2m,Invitae,refseq-GNAO1-NM_020988.2.npy,1,354,354
+NP_066269.1,MGKITFYEDRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWMLYERPNYQGQQYLLRRGEYPDYQQWMGLSDSIRSCCLIPQTVSHRLRLYEREDHKGLMMELSEDCPSIQDRFHLSEIRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRRCQDWGAMDAKAGSLRRVVDLY,174,NP_066269.1.csv,refseq-CRYGC-NM_020989.3_clinical_seed_0_final,refseq-CRYGC-NM_020989.3.a2m,Invitae,refseq-CRYGC-NM_020989.3.npy,1,174,174
+NP_066279.2,MTPQPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRKLRARRGGRSRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTKIETLRFAHNYIWALTQTLRIADHSLYALEPPAPHCGELGSPGGSPGDWGSLYSPVSQAGSLSPAASLEERPGLLGATFSACLSPGSLAFSDFL,214,NP_066279.2.csv,refseq-NEUROG3-NM_020999.3_clinical_seed_0_final,refseq-NEUROG3-NM_020999.3.a2m,Invitae,refseq-NEUROG3-NM_020999.3.npy,1,214,214
+NP_066287.2,MAQSVLVPPGPDSFRFFTRESLAAIEQRIAEEKAKRPKQERKDEDDENGPKPNSDLEAGKSLPFIYGDIPPEMVSVPLEDLDPYYINKKTFIVLNKGKAISRFSATPALYILTPFNPIRKLAIKILVHSLFNMLIMCTILTNCVFMTMSNPPDWTKNVEYTFTGIYTFESLIKILARGFCLEDFTFLRDPWNWLDFTVITFAYVTEFVDLGNVSALRTFRVLRALKTISVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNKCLQWPPDNSSFEINITSFFNNSLDGNGTTFNRTVSIFNWDEYIEDKSHFYFLEGQNDALLCGNSSDAGQCPEGYICVKAGRNPNYGYTSFDTFSWAFLSLFRLMTQDFWENLYQLTLRAAGKTYMIFFVLVIFLGSFYLINLILAVVAMAYEEQNQATLEEAEQKEAEFQQMLEQLKKQQEEAQAAAAAASAESRDFSGAGGIGVFSESSSVASKLSSKSEKELKNRRKKKKQKEQSGEEEKNDRVRKSESEDSIRRKGFRFSLEGSRLTYEKRFSSPHQSLLSIRGSLFSPRRNSRASLFSFRGRAKDIGSENDFADDEHSTFEDNDSRRDSLFVPHRHGERRHSNVSQASRASRVLPILPMNGKMHSAVDCNGVVSLVGGPSTLTSAGQLLPEGTTTETEIRKRRSSSYHVSMDLLEDPTSRQRAMSIASILTNTMEELEESRQKCPPCWYKFANMCLIWDCCKPWLKVKHLVNLVVMDPFVDLAITICIVLNTLFMAMEHYPMTEQFSSVLSVGNLVFTGIFTAEMFLKIIAMDPYYYFQEGWNIFDGFIVSLSLMELGLANVEGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKSYKECVCKISNDCELPRWHMHDFFHSFLIVFRVLCGEWIETMWDCMEVAGQTMCLTVFMMVMVIGNLVVLNLFLALLLSSFSSDNLAATDDDNEMNNLQIAVGRMQKGIDFVKRKIREFIQKAFVRKQKALDEIKPLEDLNNKKDSCISNHTTIEIGKDLNYLKDGNGTTSGIGSSVEKYVVDESDYMSFINNPSLTVTVPIAVGESDFENLNTEEFSSESDMEESKEKLNATSSSEGSTVDIGAPAEGEQPEVEPEESLEPEACFTEDCVRKFKCCQISIEEGKGKLWWNLRKTCYKIVEHNWFETFIVFMILLSSGALAFEDIYIEQRKTIKTMLEYADKVFTYIFILEMLLKWVAYGFQVYFTNAWCWLDFLIVDVSLVSLTANALGYSELGAIKSLRTLRALRPLRALSRFEGMRVVVNALLGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFYHCINYTTGEMFDVSVVNNYSECKALIESNQTARWKNVKVNFDNVGLGYLSLLQVATFKGWMDIMYAAVDSRNVELQPKYEDNLYMYLYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKFGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPANKFQGMVFDFVTKQVFDISIMILICLNMVTMMVETDDQSQEMTNILYWINLVFIVLFTGECVLKLISLRYYYFTIGWNIFDFVVVILSIVGMFLAELIEKYFVSPTLFRVIRLARIGRILRLIKGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYAIFGMSNFAYVKREVGIDDMFNFETFGNSMICLFQITTSAGWDGLLAPILNSGPPDCDPDKDHPGSSVKGDCGNPSVGIFFFVSYIIISFLVVVNMYIAVILENFSVATEESAEPLSEDDFEMFYEVWEKFDPDATQFIEFAKLSDFADALDPPLLIAKPNKVQLIAMDLPMVSGDRIHCLDILFAFTKRVLGESGEMDALRIQMEERFMASNPSKVSYEPITTTLKRKQEEVSAIIIQRAYRRYLLKQKVKKVSSIYKKDKGKECDGTPIKEDTLIDKLNENSTPEKTDMTPSTTSPPSYDSVTKPEKEKFEKDKSEKEDKGKDIRESKK,2005,NP_066287.2.csv,refseq-SCN2A-NM_021007.2_clinical_seed_0_final,refseq-SCN2A-NM_021007.2.a2m,Invitae,refseq-SCN2A-NM_021007.2.npy,1,2005,2005
+NP_066288.2,MEDSDSAAKQLGLAEAAAVAAAAAVAAAAAAAAGGEAEEPVLSRDEDSEEDADSEAERETPRVTAVAVMAAEPGHMDMGAEALPGPDEAAAAAAFAEVTTVTVANVGAAADNVFTTSVANAASISGHVLSGRTALQIGDSLNTEKATLIVVHTDGSIVETTGLKGPAAPLTPGPQSPPTPLAPGQEKGGTKYNWDPSVYDSELPVRCRNISGTLYKNRLGSGGRGRCIKQGENWYSPTEFEAMAGRASSKDWKRSIRYAGRPLQCLIQDGILNPHAASCTCAACCDDMTLSGPVRLFVPYKRRKKENELPTTPVKKDSPKNITLLPATAATTFTVTPSGQITTSGALTFDRASTVEATAVISESPAQGDVFAGATVQEASVQPPCRASHPEPHYPGYQDSCQIAPFPEAALPTSHPKIVLTSLPALAVPPPTPTKAAPPALVNGLELSEPRSWLYLEEMVNSLLNTAQQLKTLFEQAKHASTYREAATNQAKIHADAERKEQSCVNCGREAMSECTGCHKVNYCSTFCQRKDWKDHQHICGQSAAVTVQADEVHVAESVMEKVTV,565,NP_066288.2.csv,refseq-DEAF1-NM_021008.3_clinical_seed_0_final,refseq-DEAF1-NM_021008.3.a2m,Invitae,refseq-DEAF1-NM_021008.3.npy,1,565,565
+NP_066360.1,MAAAIASSLIRQKRQARESNSDRVSASKRRSSPSKDGRSLCERHVLGVFSKVRFCSGRKRPVRRRPEPQLKGIVTRLFSQQGYFLQMHPDGTIDGTKDENSDYTLFNLIPVGLRVVAIQGVKASLYVAMNGEGYLYSSDVFTPECKFKESVFENYYVIYSSTLYRQQESGRAWFLGLNKEGQIMKGNRVKKTKPSSHFVPKPIEVCMYREPSLHEIGEKQGRSRKSSGTPTMNGGKVVNQDST,243,NP_066360.1.csv,refseq-FGF12-NM_021032.4_clinical_seed_0_final,refseq-FGF12-NM_021032.4.a2m,Invitae,refseq-FGF12-NM_021032.4.npy,1,243,243
+NP_066382.1,MALLTNLLPLCCLALLALPAQSCGPGRGPVGRRRYARKQLVPLLYKQFVPGVPERTLGASGPAEGRVARGSERFRDLVPNYNPDIIFKDEENSGADRLMTERCKERVNALAIAVMNMWPGVRLRVTEGWDEDGHHAQDSLHYEGRALDITTSDRDRNKYGLLARLAVEAGFDWVYYESRNHVHVSVKADNSLAVRAGGCFPGNATVRLWSGERKGLRELHRGDWVLAADASGRVVPTPVLLFLDRDLQRRASFVAVETEWPPRKLLLTPWHLVFAARGPAPAPGDFAPVFARRLRAGDSVLAPGGDALRPARVARVAREEAVGVFAPLTAHGTLLVNDVLASCYAVLESHQWAHRAFAPLRLLHALGALLPGGAVQPTGMHWYSRLLYRLAEELLG,396,NP_066382.1.csv,refseq-DHH-NM_021044.2_clinical_seed_0_final,refseq-DHH-NM_021044.2.a2m,Invitae,refseq-DHH-NM_021044.2.npy,1,396,396
+NP_066545.3,MFCEKAMELIRELHRAPEGQLPAFNEDGLRQVLEEMKALYEQNQSDVNEAKSGGRSDLIPTIKFRHCSLLRNRRCTVAYLYDRLLRIRALRWEYGSVLPNALRFHMAAEEMEWFNNYKRSLATYMRSLGGDEGLDITQDMKPPKSLYIEVRCLKDYGEFEVDDGTSVLLKKNSQHFLPRWKCEQLIRQGVLEHILS,196,NP_066545.3.csv,refseq-GINS1-NM_021067.4_clinical_seed_0_final,refseq-GINS1-NM_021067.4.a2m,Invitae,refseq-GINS1-NM_021067.4.npy,1,196,196
+NP_066550.2,MEGGGKPNSSSNSRDDGNSVFPAKASATGAGPAAAEKRLGTPPGGGGAGAKEHGNSVCFKVDGGGGGGGGGGGGEEPAGGFEDAEGPRRQYGFMQRQFTSMLQPGVNKFSLRMFGSQKAVEKEQERVKTAGFWIIHPYSDFRFYWDLIMLIMMVGNLVIIPVGITFFTEQTTTPWIIFNVASDTVFLLDLIMNFRTGTVNEDSSEIILDPKVIKMNYLKSWFVVDFISSIPVDYIFLIVEKGMDSEVYKTARALRIVRFTKILSLLRLLRLSRLIRYIHQWEEIFHMTYDLASAVVRIFNLIGMMLLLCHWDGCLQFLVPLLQDFPPDCWVSLNEMVNDSWGKQYSYALFKAMSHMLCIGYGAQAPVSMSDLWITMLSMIVGATCYAMFVGHATALIQSLDSSRRQYQEKYKQVEQYMSFHKLPADMRQKIHDYYEHRYQGKIFDEENILNELNDPLREEIVNFNCRKLVATMPLFANADPNFVTAMLSKLRFEVFQPGDYIIREGAVGKKMYFIQHGVAGVITKSSKEMKLTDGSYFGEICLLTKGRRTASVRADTYCRLYSLSVDNFNEVLEEYPMMRRAFETVAIDRLDRIGKKNSILLQKFQKDLNTGVFNNQENEILKQIVKHDREMVQAIAPINYPQMTTLNSTSSTTTPTSRMRTQSPPVYTATSLSHSNLHSPSPSTQTPQPSAILSPCSYTTAVCSPPVQSPLAARTFHYASPTASQLSLMQQQPQQQVQQSQPPQTQPQQPSPQPQTPGSSTPKNEVHKSTQALHNTNLTREVRPLSASQPSLPHEVSTLISRPHPTVGESLASIPQPVTAVPGTGLQAGGRSTVPQRVTLFRQMSSGAIPPNRGVPPAPPPPAAALPRESSSVLNTDPDAEKPRFASNL,890,NP_066550.2.csv,refseq-HCN1-NM_021072.3_clinical_seed_0_final,refseq-HCN1-NM_021072.3.a2m,Invitae,refseq-HCN1-NM_021072.3.npy,1,890,890
+NP_066552.2,MFFSAALRARAAGLTAHWGRHVRNLHKTVMQNGAGGALFVHRDTPENNPDTPFDFTPENYKRIEAIVKNYPEGHKAAAVLPVLDLAQRQNGWLPISAMNKVAEVLQVPPMRVYEVATFYTMYNRKPVGKYHIQVCTTTPCMLRNSDSILEAIQKKLGIKVGETTPDKLFTLIEVECLGACVNAPMVQINDNYYEDLTAKDIEEIIDELKAGKIPKPGPRSGRFSCEPAGGLTSLTEPPKGPGFGVQAGL,249,NP_066552.2.csv,refseq-NDUFV2-NM_021074.4_clinical_seed_0_final,refseq-NDUFV2-NM_021074.4.a2m,Invitae,refseq-NDUFV2-NM_021074.4.npy,1,249,249
+NP_066566.3,MSTETELQVAVKTSAKKDSRKKGQDRSEATLIKRFKGEGVRYKAKLIGIDEVSAARGDKLCQDSMMKLKGVVAGARSKGEHKQKIFLTISFGGIKIFDEKTGALQHHHAVHEISYIAKDITDHRAFGYVCGKEGNHRFVAIKTAQAAEPVILDLRDLFQLIYELKQREELEKKAQKDKQCEQAVYQTILEEDVEDPVYQYIVFEAGHEPIRDPETEENIYQVPTSQKKEGVYDVPKSQPVSAVTQLELFGDMSTPPDITSPPTPATPGDAFIPSSSQTLPASADVFSSVPFGTAAVPSGYVAMGAVLPSFWGQQPLVQQQMVMGAQPPVAQVMPGAQPIAWGQPGLFPATQQPWPTVAGQFPPAAFMPTQTVMPLPAAMFQGPLTPLATVPGTSDSTRSSPQTDKPRQKMGKETFKDFQMAQPPPVPSRKPDQPSLTCTSEAFSSYFNKVGVAQDTDDCDDFDISQLNLTPVTSTTPSTNSPPTPAPRQSSPSKSSASHASDPTTDDIFEEGFESPSKSEEQEAPDGSQASSNSDPFGEPSGEPSGDNISPQAGS,555,NP_066566.3.csv,refseq-DAB1-NM_021080.3_clinical_seed_0_final,refseq-DAB1-NM_021080.3.a2m,Invitae,refseq-DAB1-NM_021080.3.npy,1,555,555
+NP_066918.2,MSVGVSTSAPLSPTSGTSVGMSTFSIMDYVVFVLLLVLSLAIGLYHACRGWGRHTVGELLMADRKMGCLPVALSLLATFQSAVAILGVPSEIYRFGTQYWFLGCCYFLGLLIPAHIFIPVFYRLHLTSAYEYLELRFNKTVRVCGTVTFIFQMVIYMGVVLYAPSLALNAVTGFDLWLSVLALGIVCTVYTALGGLKAVIWTDVFQTLVMFLGQLAVIIVGSAKVGGLGRVWAVASQHGRISGFELDPDPFVRHTFWTLAFGGVFMMLSLYGVNQAQVQRYLSSRTEKAAVLSCYAVFPFQQVSLCVGCLIGLVMFAYYQEYPMSIQQAQAAPDQFVLYFVMDLLKGLPGLPGLFIACLFSGSLSTISSAFNSLATVTMEDLIRPWFPEFSEARAIMLSRGLAFGYGLLCLGMAYISSQMGPVLQAAISIFGMVGGPLLGLFCLGMFFPCANPPGAVVGLLAGLVMAFWIGIGSIVTSMGSSMPPSPSNGSSFSLPTNLTVATVTTLMPLTTFSKPTGLQRFYSLSYLWYSAHNSTTVIVVGLIVSLLTGRMRGRSLNPATIYPVLPKLLSLLPLSCQKRLHCRSYGQDHLDTGLFPEKPRNGVLGDSRDKEAMALDGTAYQGSSSTCILQETSL,635,NP_066918.2.csv,NP_066918.2_clinical_seed_0_final,NP_066918.2.a2m,popEVE,NP_066918.2_theta_0.2.npy,1,635,635
+NP_066925.1,MAQLCGLRRSRAFLALLGSLLLSGVLAADRERSIHDFCLVSKVVGRCRASMPRWWYNVTDGSCQLFVYGGCDGNSNNYLTKEECLKKCATVTENATGDLATSRNAADSSVPSAPRRQDSEDHSSDMFNYEEYCTANAVTGPCRASFPRWYFDVERNSCNNFIYGGCRGNKNSYRSEEACMLRCFRQQENPPLPLGSKVVVLAGLFVMVLILFLGASMVYLIRVARRNQERALRTVWSSGDDKEQLVKNTYVL,252,NP_066925.1.csv,refseq-SPINT2-NM_021102.3_clinical_seed_0_final,refseq-SPINT2-NM_021102.3.a2m,Invitae,refseq-SPINT2-NM_021102.3.npy,1,252,252
+NP_066956.1,MESRDHNNPQEGPTSSSGRRAAVEDNHLLIKAVQNEDVDLVQQLLEGGANVNFQEEEGGWTPLHNAVQMSREDIVELLLRHGADPVLRKKNGATPFILAAIAGSVKLLKLFLSKGADVNECDFYGFTAFMEAAVYGKVKALKFLYKRGANVNLRRKTKEDQERLRKGGATALMDAAEKGHVEVLKILLDEMGADVNACDNMGRNALIHALLSSDDSDVEAITHLLLDHGADVNVRGERGKTPLILAVEKKHLGLVQRLLEQEHIEINDTDSDGKTALLLAVELKLKKIAELLCKRGASTDCGDLVMTARRNYDHSLVKVLLSHGAKEDFHPPAEDWKPQSSHWGAALKDLHRIYRPMIGKLKFFIDEKYKIADTSEGGIYLGFYEKQEVAVKTFCEGSPRAQREVSCLQSSRENSHLVTFYGSESHRGHLFVCVTLCEQTLEACLDVHRGEDVENEEDEFARNVLSSIFKAVQELHLSCGYTHQDLQPQNILIDSKKAAHLADFDKSIKWAGDPQEVKRDLEDLGRLVLYVVKKGSISFEDLKAQSNEEVVQLSPDEETKDLIHRLFHPGEHVRDCLSDLLGHPFFWTWESRYRTLRNVGNESDIKTRKSESEILRLLQPGPSEHSKSFDKWTTKINECVMKKMNKFYEKRGNFYQNTVGDLLKFIRNLGEHIDEEKHKKMKLKIGDPSLYFQKTFPDLVIYVYTKLQNTEYRKHFPQTHSPNKPQCDGAGGASGLASPGC,741,NP_066956.1.csv,RN5A_HUMAN_b03_clinical_seed_0_final,RN5A_HUMAN_b03.a2m,EVE,RN5A_HUMAN_b03_theta_0.2.npy,1,741,741
+NP_066963.2,MKSCGVSLATAAAAAAAFGDEEKKMAAGKASGESEEASPSLTAEEREALGGLDSRLFGFVRFHEDGARTKALLGKAVRCYESLILKAEGKVESDFFCQLGHFNLLLEDYPKALSAYQRYYSLQSDYWKNAAFLYGLGLVYFHYNAFQWAIKAFQEVLYVDPSFCRAKEIHLRLGLMFKVNTDYESSLKHFQLALVDCNPCTLSNAEIQFHIAHLYETQRKYHSAKEAYEQLLQTENLSAQVKATVLQQLGWMHHTVDLLGDKATKESYAIQYLQKSLEADPNSGQSWYFLGRCYSSIGKVQDAFISYRQSIDKSEASADTWCSIGVLYQQQNQPMDALQAYICAVQLDHGHAAAWMDLGTLYESCNQPQDAIKCYLNATRSKSCSNTSALAARIKYLQAQLCNLPQGSLQNKTKLLPSIEEAWSLPIPAELTSRQGAMNTAQQNTSDNWSGGHAVSHPPVQQQAHSWCLTPQKLQHLEQLRANRNNLNPAQKLMLEQLESQFVLMQQHQMRPTGVAQVRSTGIPNGPTADSSLPTNSVSGQQPQLALTRVPSVSQPGVRPACPGQPLANGPFSAGHVPCSTSRTLGSTDTILIGNNHITGSGSNGNVPYLQRNALTLPHNRTNLTSSAEEPWKNQLSNSTQGLHKGQSSHSAGPNGERPLSSTGPSQHLQAAGSGIQNQNGHPTLPSNSVTQGAALNHLSSHTATSGGQQGITLTKESKPSGNILTVPETSRHTGETPNSTASVEGLPNHVHQMTADAVCSPSHGDSKSPGLLSSDNPQLSALLMGKANNNVGTGTCDKVNNIHPAVHTKTDNSVASSPSSAISTATPSPKSTEQTTTNSVTSLNSPHSGLHTINGEGMEESQSPMKTDLLLVNHKPSPQIIPSMSVSIYPSSAEVLKACRNLGKNGLSNSSILLDKCPPPRPPSSPYPPLPKDKLNPPTPSIYLENKRDAFFPPLHQFCTNPNNPVTVIRGLAGALKLDLGLFSTKTLVEANNEHMVEVRTQLLQPADENWDPTGTKKIWHCESNRSHTTIAKYAQYQASSFQESLREENEKRSHHKDHSDSESTSSDNSGRRRKGPFKTIKFGTNIDLSDDKKWKLQLHELTKLPAFVRVVSAGNLLSHVGHTILGMNTVQLYMKVPGSRTPGHQENNNFCSVNINIGPGDCEWFVVPEGYWGVLNDFCEKNNLNFLMGSWWPNLEDLYEANVPVYRFIQRPGDLVWINAGTVHWVQAIGWCNNIAWNVGPLTACQYKLAVERYEWNKLQSVKSIVPMVHLSWNMARNIKVSDPKLFEMIKYCLLRTLKQCQTLREALIAAGKEIIWHGRTKEEPAHYCSICEVEVFDLLFVTNESNSRKTYIVHCQDCARKTSGNLENFVVLEQYKMEDLMQVYDQFTLAPPLPSASS,1401,NP_066963.2.csv,refseq-KDM6A-NM_021140.3_clinical_seed_0_final,refseq-KDM6A-NM_021140.3.a2m,Invitae,refseq-KDM6A-NM_021140.3.npy,1,1401,1401
+NP_066970.3,MVTPCPTSPSSPAARAGRRDNDQNLRAPVKKSRRPRLRRKQPLHPLNPCPLPGDSGICDLFESPSSGSDGAESPSAARGGSPLPGPAQPVAQLDLQTFRDYGQSCYAFRKAQESHFHPREALARQPQVTAESRCKLLSWLIPVHRQFGLSFESLCLTVNTLDRFLTTTPVAADCFQLLGVTSLLIACKQVEVHPPRVKQLLALCCGAFSRQQLCNLECIVLHKLHFTLGAPTISFFLEHFTHARVEAGQAEASEALEAQALARGVAELSLADYAFTSYSPSLLAICCLALADRMLRVSRPVDLRLGDHPEAALEDCMGKLQLLVAINSTSLTHMLPVQICEKCSLPPSSK,350,NP_066970.3.csv,refseq-CCNO-NM_021147.4_clinical_seed_0_final,refseq-CCNO-NM_021147.4.a2m,Invitae,refseq-CCNO-NM_021147.4.npy,1,350,350
+NP_066973.2,MIAVSFKCRCQILRRLTKDESPYTKSASQTKPPDGALAVRRQSIPEEFKGSTVVELMKKEGTTLGLTVSGGIDKDGKPRVSNLRQGGIAARSDQLDVGDYIKAVNGINLAKFRHDEIISLLKNVGERVVLEVEYELPPVSVQGSSVIFRTVEVTLHKEGNTFGFVIRGGAHDDRNKSRPVVITCVRPGGPADREGTIKPGDRLLSVDGIRLLGTTHAEAMSILKQCGQEAALLIEYDVSVMDSVATASGPLLVEVAKTPGASLGVALTTSMCCNKQVIVIDKIKSASIADRCGALHVGDHILSIDGTSMEYCTLAEATQFLANTTDQVKLEILPHHQTRLALKGPDHAALVSSSFSPTSMSAYSLSSLNMGTLPRSLYSTSPRGTMMRRRLKKKDFKSSLSLASSTVGLAGQVVHTETTEVVLTADPVTGFGIQLQGSVFATETLSSPPLISYIEADSPAERCGVLQIGDRVMAINGIPTEDSTFEEASQLLRDSSITSKVTLEIEFDVAESVIPSSGTFHVKLPKKHNVELGITISSPSSRKPGDPLVISDIKKGSVAHRTGTLELGDKLLAIDNIRLDNCSMEDAVQILQQCEDLVKLKIRKDEDNSDEQESSGAIIYTVELKRYGGPLGITISGTEEPFDPIIISSLTKGGLAERTGAIHIGDRILAINSSSLKGKPLSEAIHLLQMAGETVTLKIKKQTDAQSASSPKKFPISSHLSDLGDVEEDSSPAQKPGKLSDMYPSTVPSVDSAVDSWDGSAIDTSYGTQGTSFQASGYNFNTYDWRSPKQRGSLSPVTKPRSQTYPDVGLSYEDWDRSTASGFAGAADSAETEQEENFWSQALEDLETCGQSGILRELEEKADRRVSLRNMTLLATIMSGSTMSLNHEAPTPRSQLGRQASFQERSSSRPHYSQTTRSNTLPSDVGRKSVTLRKMKQEIKEIMSPTPVELHKVTLYKDSDMEDFGFSVADGLLEKGVYVKNIRPAGPGDLGGLKPYDRLLQVNHVRTRDFDCCLVVPLIAESGNKLDLVISRNPLASQKSIDQQSLPGDWSEQNSAFFQQPSHGGNLETREPTNTL,1076,NP_066973.2.csv,refseq-GRIP1-NM_021150.3_clinical_seed_0_final,refseq-GRIP1-NM_021150.3.a2m,Invitae,refseq-GRIP1-NM_021150.3.npy,1,1076,1076
+NP_067032.3,MNFIKDNSRALIQRMGMTVIKQITDDLFVWNVLNREEVNIICCEKVEQDAARGIIHMILKKGSESCNLFLKSLKEWNYPLFQDLNGQSLFHQTSEGDLDDLAQDLKDLYHTPSFLNFYPLGEDIDIIFNLKSTFTEPVLWRKDQHHHRVEQLTLNGLLQALQSPCIIEGESGKGKSTLLQRIAMLWGSGKCKALTKFKFVFFLRLSRAQGGLFETLCDQLLDIPGTIRKQTFMAMLLKLRQRVLFLLDGYNEFKPQNCPEIEALIKENHRFKNMVIVTTTTECLRHIRQFGALTAEVGDMTEDSAQALIREVLIKELAEGLLLQIQKSRCLRNLMKTPLFVVITCAIQMGESEFHSHTQTTLFHTFYDLLIQKNKHKHKGVAASDFIRSLDHCGDLALEGVFSHKFDFELQDVSSVNEDVLLTTGLLCKYTAQRFKPKYKFFHKSFQEYTAGRRLSSLLTSHEPEEVTKGNGYLQKMVSISDITSTYSSLLRYTCGSSVEATRAVMKHLAAVYQHGCLLGLSIAKRPLWRQESLQSVKNTTEQEILKAININSFVECGIHLYQESTSKSALSQEFEAFFQGKSLYINSGNIPDYLFDFFEHLPNCASALDFIKLDFYGGAMASWEKAAEDTGGIHMEEAPETYIPSRAVSLFFNWKQEFRTLEVTLRDFSKLNKQDIRYLGKIFSSATSLRLQIKRCAGVAGSLSLVLSTCKNIYSLMVEASPLTIEDERHITSVTNLKTLSIHDLQNQRLPGGLTDSLGNLKNLTKLIMDNIKMNEEDAIKLAEGLKNLKKMCLFHLTHLSDIGEGMDYIVKSLSSEPCDLEEIQLVSCCLSANAVKILAQNLHNLVKLSILDLSENYLEKDGNEALHELIDRMNVLEQLTALMLPWGCDVQGSLSSLLKHLEEVPQLVKLGLKNWRLTDTEIRILGAFFGKNPLKNFQQLNLAGNRVSSDGWLAFMGVFENLKQLVFFDFSTKEFLPDPALVRKLSQVLSKLTFLQEARLVGWQFDDDDLSVITGAFKLVTA,1024,NP_067032.3.csv,refseq-NLRC4-NM_021209.4_clinical_seed_0_final,refseq-NLRC4-NM_021209.4.a2m,Invitae,refseq-NLRC4-NM_021209.4.npy,1,1024,1024
+NP_067045.1,MEDYLQGCRAALQESRPLHVVLGNEACDLDSTVSALALAFYLAKTTEAEEVFVPVLNIKRSELPLRGDIVFFLQKVHIPESILIFRDEIDLHALYQAGQLTLILVDHHILSKSDTALEEAVAEVLDHRPIEPKHCPPCHVSVELVGSCATLVTERILQGAPEILDRQTAALLHGTIILDCVNMDLKIGKATPKDSKYVEKLEALFPDLPKRNDIFDSLQKAKFDVSGLTTEQMLRKDQKTIYRQGVKVAISAIYMDLEAFLQRSNLLADLHAFCQAHSYDVLVAMTIFFNTHNEPVRQLAIFCPHVALQTTICEVLERSHSPPLKLTPASSTHPNLHAYLQGNTQVSRKKLLPLLQEALSAYFDSMKIPSGQPETADVSREQVDKELDRASNSLISGLSQDEEDPPLPPTPMNSLVDECPLDQGLPKLSAEAVFEKCSQISLSQSTTASLSKK,453,NP_067045.1.csv,refseq-PRUNE1-NM_021222.2_clinical_seed_0_final,refseq-PRUNE1-NM_021222.2.a2m,Invitae,refseq-PRUNE1-NM_021222.2.npy,1,453,453
+NP_067063.1,MNGYGSPYLYMGGPVSQPPRAPLQRTPKCARCRNHGVLSWLKGHKRYCRFKDCTCEKCILIIERQRVMAAQVALRRQQANESLESLIPDSLRALPGPPPPGDAVAAPQPPPASQPSQPQPPRPAAELAAAAALRWTAEPQPGALQAQLAKPDLTEERLGDGKSADNTEVFSDKDTDQRSSPDVAKSKGCFTPESPEIVSVEEGGYAVQKNGGNPESRPDSPKCHAEQNHLLIEGPSGTVSLPFSLKANRPPLEVLKKIFPNQKPTVLELILKGCGGDLVSAVEVLLSSRSSVTGAERTSAEPESLALPSNGHIFEHTLSSYPISSSKWSVGSAFRVPDTLRFSADSSNVVPSPLAGPLQPPFPQPPRYPLMLRNTLARSQSSPFLPNDVTLWNTMTLQQQYQLRSQYVSPFPSNSTSVFRSSPVLPARATEDPRISIPDDGCPFVSKQSIYTEDDYDERSDSSDSRTLNTSS,472,NP_067063.1.csv,refseq-DMRT3-NM_021240.3_clinical_seed_0_final,refseq-DMRT3-NM_021240.3.a2m,Invitae,refseq-DMRT3-NM_021240.3.npy,1,472,472
+NP_067628.1,MWLPRVSSTAVTALLLAQTFLLLFLVSRPGPSSPAGGEARVHVLVLSSWRSGSSFVGQLFNQHPDVFYLMEPAWHVWTTLSQGSAATLHMAVRDLVRSVFLCDMDVFDAYLPWRRNLSDLFQWAVSRALCSPPACSAFPRGAISSEAVCKPLCARQSFTLAREACRSYSHVVLKEVRFFNLQVLYPLLSDPALNLRIVHLVRDPRAVLRSREQTAKALARDNGIVLGTNGTWVEADPGLRVVREVCRSHVRIAEAATLKPPPFLRGRYRLVRFEDLAREPLAEIRALYAFTGLSLTPQLEAWIHNITHGSGPGARREAFKTSSRNALNVSQAWRHALPFAKIRRVQELCAGALQLLGYRPVYSEDEQRNLALDLVLPRGLNGFTWASSTASHPRN,395,NP_067628.1.csv,refseq-CHST6-NM_021615.4_clinical_seed_0_final,refseq-CHST6-NM_021615.4.a2m,Invitae,refseq-CHST6-NM_021615.4.npy,1,395,395
+NP_067632.2,MMGSVLPAEALVLKTGLKAPGLALAEVITSDILHSFLYGRWRNVLGEQLFEDKSHHASPKTAFTAEVLAQSFSGEVQKLSSLVLPAEVIIAQSSIPGEGLGIFSKTWIKAGTEMGPFTGRVIAPEHVDICKNNNLMWEVFNEDGTVRYFIDASQEDHRSWMTYIKCARNEQEQNLEVVQIGTSIFYKAIEMIPPDQELLVWYGNSHNTFLGIPGVPGLEEDQKKNKHEDFHPADSAAGPAGRMRCVICHRGFNSRSNLRSHMRIHTLDKPFVCRFCNRRFSQSSTLRNHVRLHTGERPYKCQVCQSAYSQLAGLRAHQKSARHRPPSTALQAHSPALPAPHAHAPALAAAAAAAAAAAAHHLPAMVL,367,NP_067632.2.csv,refseq-PRDM12-NM_021619.2_clinical_seed_0_final,refseq-PRDM12-NM_021619.2.a2m,Invitae,refseq-PRDM12-NM_021619.2.npy,1,367,367
+NP_067638.3,MADSSEGPRAGPGEVAELPGDESGTPGGEAFPLSSLANLFEGEDGSLSPSPADASRPAGPGDGRPNLRMKFQGAFRKGVPNPIDLLESTLYESSVVPGPKKAPMDSLFDYGTYRHHSSDNKRWRKKIIEKQPQSPKAPAPQPPPILKVFNRPILFDIVSRGSTADLDGLLPFLLTHKKRLTDEEFREPSTGKTCLPKALLNLSNGRNDTIPVLLDIAERTGNMREFINSPFRDIYYRGQTALHIAIERRCKHYVELLVAQGADVHAQARGRFFQPKDEGGYFYFGELPLSLAACTNQPHIVNYLTENPHKKADMRRQDSRGNTVLHALVAIADNTRENTKFVTKMYDLLLLKCARLFPDSNLEAVLNNDGLSPLMMAAKTGKIGIFQHIIRREVTDEDTRHLSRKFKDWAYGPVYSSLYDLSSLDTCGEEASVLEILVYNSKIENRHEMLAVEPINELLRDKWRKFGAVSFYINVVSYLCAMVIFTLTAYYQPLEGTPPYPYRTTVDYLRLAGEVITLFTGVLFFFTNIKDLFMKKCPGVNSLFIDGSFQLLYFIYSVLVIVSAALYLAGIEAYLAVMVFALVLGWMNALYFTRGLKLTGTYSIMIQKILFKDLFRFLLVYLLFMIGYASALVSLLNPCANMKVCNEDQTNCTVPTYPSCRDSETFSTFLLDLFKLTIGMGDLEMLSSTKYPVVFIILLVTYIILTFVLLLNMLIALMGETVGQVSKESKHIWKLQWATTILDIERSFPVFLRKAFRSGEMVTVGKSSDGTPDRRWCFRVDEVNWSHWNQNLGIINEDPGKNETYQYYGFSHTVGRLRRDRWSSVVPRVVELNKNSNPDEVVVPLDSMGNPRCDGHQQGYPRKWRTDDAPL,871,NP_067638.3.csv,refseq-TRPV4-NM_021625.4_clinical_seed_0_final,refseq-TRPV4-NM_021625.4.a2m,Invitae,refseq-TRPV4-NM_021625.4.npy,1,871,871
+NP_067641.2,MAVYRLCVTTGPYLRAGTLDNISVTLVGTCGESPKQRLDRMGRDFAPGSVQKYKVRCTAELGELLLLRVHKERYAFFRKDSWYCSRICVTEPDGSVSHFPCYQWIEGYCTVELRPGTARTICQDSLPLLLDHRTRELRARQECYRWKIYAPGFPCMVDVNSFQEMESDKKFALTKTTTCVDQGDSSGNRYLPGFPMKIDIPSLMYMEPNVRYSATKTISLLFNAIPASLGMKLRGLLDRKGSWKKLDDMQNIFWCHKTFTTKYVTEHWCEDHFFGYQYLNGVNPVMLHCISSLPSKLPVTNDMVAPLLGQDTCLQTELERGNIFLADYWILAEAPTHCLNGRQQYVAAPLCLLWLSPQGALVPLAIQLSQTPGPDSPIFLPTDSEWDWLLAKTWVRNSEFLVHENNTHFLCTHLLCEAFAMATLRQLPLCHPIYKLLLPHTRYTLQVNTIARATLLNPEGLVDQVTSIGRQGLIYLMSTGLAHFTYTNFCLPDSLRARGVLAIPNYHYRDDGLKIWAAIESFVSEIVGYYYPSDASVQQDSELQAWTGEIFAQAFLGRESSGFPSRLCTPGEMVKFLTAIIFNCSAQHAAVNSGQHDFGAWMPNAPSSMRQPPPQTKGTTTLKTYLDTLPEVNISCNNLLLFWLVSQEPKDQRPLGTYPDEHFTEEAPRRSIAAFQSRLAQISRDIQERNQGLALPYTYLDPPLIENSVSI,711,NP_067641.2.csv,refseq-ALOXE3-NM_021628.2_clinical_seed_0_final,refseq-ALOXE3-NM_021628.2.a2m,Invitae,refseq-ALOXE3-NM_021628.2.npy,1,711,711
+NP_067642.1,MSELEQLRQEAEQLRNQIQDARKACNDATLVQITSNMDSVGRIQMRTRRTLRGHLAKIYAMHWGYDSRLLVSASQDGKLIIWDSYTTNKMHAIPLRSSWVMTCAYAPSGNYVACGGLDNICSIYNLKTREGNVRVSRELPGHTGYLSCCRFLDDSQIVTSSGDTTCALWDIETAQQTTTFTGHSGDVMSLSLSPDMRTFVSGACDASSKLWDIRDGMCRQSFTGHVSDINAVSFFPNGYAFATGSDDATCRLFDLRADQELLLYSHDNIICGITSVAFSKSGRLLLAGYDDFNCNVWDTLKGDRAGVLAGHDNRVSCLGVTDDGMAVATGSWDSFLRIWN,340,NP_067642.1.csv,refseq-GNB4-NM_021629.3_clinical_seed_0_final,refseq-GNB4-NM_021629.3.a2m,Invitae,refseq-GNB4-NM_021629.3.npy,1,340,340
+NP_068369.1,MQAAVAVSVPFLLLCVLGTCPPARCGQAGDASLMELEKRKENRFVERQSIVPLRLIYRSGGEDESRHDALDTRVRGDLGGPQLTHVDQASFQVDAFGTSFILDVVLNHDLLSSEYIERHIEHGGKTVEVKGGEHCYYQGHIRGNPDSFVALSTCHGLHGMFYDGNHTYLIEPEENDTTQEDFHFHSVYKSRLFEFSLDDLPSEFQQVNITPSKFILKPRPKRSKRQLRRYPRNVEEETKYIELMIVNDHLMFKKHRLSVVHTNTYAKSVVNMADLIYKDQLKTRIVLVAMETWATDNKFAISENPLITLREFMKYRRDFIKEKSDAVHLFSGSQFESSRSGAAYIGGICSLLKGGGVNEFGKTDLMAVTLAQSLAHNIGIISDKRKLASGECKCEDTWSGCIMGDTGYYLPKKFTQCNIEEYHDFLNSGGGACLFNKPSKLLDPPECGNGFIETGEECDCGTPAECVLEGAECCKKCTLTQDSQCSDGLCCKKCKFQPMGTVCREAVNDCDIRETCSGNSSQCAPNIHKMDGYSCDGVQGICFGGRCKTRDRQCKYIWGQKVTASDKYCYEKLNIEGTEKGNCGKDKDTWIQCNKRDVLCGYLLCTNIGNIPRLGELDGEITSTLVVQQGRTLNCSGGHVKLEEDVDLGYVEDGTPCGPQMMCLEHRCLPVASFNFSTCLSSKEGTICSGNGVCSNELKCVCNRHWIGSDCNTYFPHNDDAKTGITLSGNGVAGTNIIIGIIAGTILVLALILGITAWGYKNYREQRQLPQGDYVKKPGDGDSFYSDIPPGVSTNSASSSKKRSNGLSHSWSERIPDTKHISDICENGRPRSNSWQGNLGGNKKKIRGKRFRPRSNSTETLSPAKSPSSSTGSIASSRKYPYPMPPLPDEDKKVNRQSARLWETSI,906,NP_068369.1.csv,refseq-ADAM22-NM_021723.4_clinical_seed_0_final,refseq-ADAM22-NM_021723.4.a2m,Invitae,refseq-ADAM22-NM_021723.4.npy,1,906,906
+NP_068380.3,MVGYDPKPDGRNNTKFQVAVAGSVSGLVTRALISPFDVIKIRFQLQHERLSRSDPSAKYHGILQASRQILQEEGPTAFWKGHVPAQILSIGYGAVQFLSFEMLTELVHRGSVYDAREFSVHFVCGGLAACMATLTVHPVDVLRTRFAAQGEPKVYNTLRHAVGTMYRSEGPQVFYKGLAPTLIAIFPYAGLQFSCYSSLKHLYKWAIPAEGKKNENLQNLLCGSGAGVISKTLTYPLDLFKKRLQVGGFEHARAAFGQVRRYKGLMDCAKQVLQKEGALGFFKGLSPSLLKAALSTGFMFFSYEFFCNVFHCMNRTASQR,320,NP_068380.3.csv,refseq-SLC25A19-NM_021734.4_clinical_seed_0_final,refseq-SLC25A19-NM_021734.4.a2m,Invitae,refseq-SLC25A19-NM_021734.4.npy,1,320,320
+NP_068556.2,MHSASSMLGAVKMEGHEPSDWSSYYAEPEGYSSVSNMNAGLGMNGMNTYMSMSAAAMGSGSGNMSAGSMNMSSYVGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPKTYRRSYTHAKPPYSYISLITMAIQQSPNKMLTLSEIYQWIMDLFPFYRQNQQRWQNSIRHSLSFNDCFLKVPRSPDKPGKGSFWTLHPDSGNMFENGCYLRRQKRFKCEKQLALKEAAGAAGSGKKAAAGAQASQAQLGEAAGPASETPAGTESPHSSASPCQEHKRGGLGELKGTPAAALSPPEPAPSPGQQQQAAAHLLGPPHHPGLPPEAHLKPEHHYAFNHPFSINNLMSSEQQHHHSHHHHQPHKMDLKAYEQVMHYPGYGSPMPGSLAMGPVTNKTGLDASPLAADTSYYQGVYSRPIMNSS,463,NP_068556.2.csv,refseq-FOXA2-NM_021784.4_clinical_seed_0_final,refseq-FOXA2-NM_021784.4.a2m,Invitae,refseq-FOXA2-NM_021784.4.npy,1,463,463
+NP_068587.1,MAFHVEGLIAIIVFYLLILLVGIWAAWRTKNSGSAEERSEAIIVGGRDIGLLVGGFTMTATWVGGGYINGTAEAVYVPGYGLAWAQAPIGYSLSLILGGLFFAKPMRSKGYVTMLDPFQQIYGKRMGGLLFIPALMGEMFWAAAIFSALGATISVIIDVDMHISVIISALIATLYTLVGGLYSVAYTDVVQLFCIFVGLWISVPFALSHPAVADIGFTAVHAKYQKPWLGTVDSSEVYSWLDSFLLLMLGGIPWQAYFQRVLSSSSATYAQVLSFLAAFGCLVMAIPAILIGAIGASTDWNQTAYGLPDPKTTEEADMILPIVLQYLCPVYISFFGLGAVSAAVMSSADSSILSASSMFARNIYQLSFRQNASDKEIVWVMRITVFVFGASATAMALLTKTVYGLWYLSSDLVYIVIFPQLLCVLFVKGTNTYGAVAGYVSGLFLRITGGEPYLYLQPLIFYPGYYPDDNGIYNQKFPFKTLAMVTSFLTNICISYLAKYLFESGTLPPKLDVFDAVVARHSEENMDKTILVKNENIKLDELALVKPRQSMTLSSTFTNKEAFLDVDSSPEGSGTEDNLQ,580,NP_068587.1.csv,refseq-SLC5A7-NM_021815.3_clinical_seed_0_final,refseq-SLC5A7-NM_021815.3.a2m,Invitae,refseq-SLC5A7-NM_021815.3_theta_0.2.npy,1,580,580
+NP_068602.2,MWVLLRSGYPLRILLPLRGEWMGRRGLPRNLAPGPPRRRYRKETLQALDMPVLPVTATEIRQYLRGHGIPFQDGHSCLRALSPFAESSQLKGQTGVTTSFSLFIDKTTGHFLCMTSLAEGSWEDFQASVEGRGDGAREGFLLSKAPEFEDSEEVRRIWNRAIPLWELPDQEEVQLADTMFGLTKVTDDTLKRFSVRYLRPARSLVFPWFSPGGSGLRGLKLLEAKCQGDGVSYEETTIPRPSAYHNLFGLPLISRRDAEVVLTSRELDSLALNQSTGLPTLTLPRGTTCLPPALLPYLEQFRRIVFWLGDDLRSWEAAKLFARKLNPKRCFLVRPGDQQPRPLEALNGGFNLSRILRTALPAWHKSIVSFRQLREEVLGELSNVEQAAGLRWSRFPDLNRILKGHRKGELTVFTGPTGSGKTTFISEYALDLCSQGVNTLWGSFEISNVRLARVMLTQFAEGRLEDQLDKYDHWADRFEDLPLYFMTFHGQQSIRTVIDTMQHAVYVYDICHVIIDNLQFMMGHEQLSTDRIAAQDYIIGVFRKFATDNNCHVTLVIHPRKEDDDKELQTASIFGSAKASQEADNVLILQDRKLVTGPGKRYLQVSKNRFDGDVGVFPLEFNKNSLTFSIPPKNKARLKKIKDDTGPVAKKPSSGKKGATTQNSEICSGQAPTPDQPDTSKRSK,684,NP_068602.2.csv,refseq-TWNK-NM_021830.4_clinical_seed_0_final,refseq-TWNK-NM_021830.4.a2m,Invitae,refseq-TWNK-NM_021830.4.npy,1,684,684
+NP_068603.4,MELRCGGLLFSSRFDSGNLAHVEKVESLSSDGEGVGGGASALTSGIASSPDYEFNVWTRPDCAETEFENGNRSWFYFSVRGGMPGKLIKINIMNMNKQSKLYSQGMAPFVRTLPTRPRWERIRDRPTFEMTETQFVLSFVHRFVEGRGATTFFAFCYPFSYSDCQELLNQLDQRFPENHPTHSSPLDTIYYHRELLCYSLDGLRVDLLTITSCHGLREDREPRLEQLFPDTSTPRPFRFAGKRIFFLSSRVHPGETPSSFVFNGFLDFILRPDDPRAQTLRRLFVFKLIPMLNPDGVVRGHYRTDSRGVNLNRQYLKPDAVLHPAIYGAKAVLLYHHVHSRLNSQSSSEHQPSSCLPPDAPVSDLEKANNLQNEAQCGHSADRHNAEAWKQTEPAEQKLNSVWIMPQQSAGLEESAPDTIPPKESGVAYYVDLHGHASKRGCFMYGNSFSDESTQVENMLYPKLISLNSAHFDFQGCNFSEKNMYARDRRDGQSKEGSGRVAIYKASGIIHSYTLECNYNTGRSVNSIPAACHDNGRASPPPPPAFPSRYTVELFEQVGRAMAIAALDMAECNPWPRIVLSEHSSLTNLRAWMLKHVRNSRGLSSTLNVGVNKKRGLRTPPKSHNGLPVSCSENTLSRARSFSTGTSAGGSSSSQQNSPQMKNSPSFPFHGSRPAGLPGLGSSTQKVTHRVLGPVREPRSQDRRRQQQPLNHRPAGSLAPSPAPTSSGPASSHKLGSCLLPDSFNIPGSSCSLLSSGDKPEAVMVIGKGLLGTGARMPCIKTRLQARPRLGRGSPPTRRGMKGSSGPTSPTPRTRESSELELGSCSATPGLPQARPPRPRSAPAFSPISCSLSDSPSWNCYSRGPLGQPEVCFVPKSPPLTVSPRV,886,NP_068603.4.csv,refseq-AGBL5-NM_021831.5_clinical_seed_0_final,refseq-AGBL5-NM_021831.5.a2m,Invitae,refseq-AGBL5-NM_021831.5.npy,1,886,886
+NP_068657.1,MFSMRIVCLVLSVVGTAWTADSGEGDFLAEGGGVRGPRVVERHQSACKDSDWPFCSDEDWNYKCPSGCRMKGLIDEVNQDFTNRINKLKNSLFEYQKNNKDSHSLTTNIMEILRGDFSSANNRDNTYNRVSEDLRSRIEVLKRKVIEKVQHIQLLQKNVRAQLVDMKRLEVDIDIKIRSCRGSCSRALAREVDLKDYEDQQKQLEQVIAKDLLPSRDRQHLPLIKMKPVPDLVPGNFKSQLQKVPPEWKALTDMPQMRMELERPGGNEITRGGSTSYGTGSETESPRNPSSAGSWNSGSSGPGSTGNRNPGSSGTGGTATWKPGSSGPGSTGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSERGSAGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNPDWGTFEEVSGNVSPGTRREYHTEKLVTSKGDKELRTGKEKVTSGSTTTTRRSCSKTVTKTVIGPDGHKEVTKEVVTSEDGSDCPEAMDLGTLSGIGTLDGFRHRHPDEAAFFDTASTGKTFPGFFSPMLGEFVSETESRGSESGIFTNTKESSSHHPGIAEFPSRGKSSSYSKQFTSSTSYNRGDSTFESKSYKMADEAGSEADHEGTHSTKRGHAKSRPVRGIHTSPLGKPSLSP,644,NP_068657.1.csv,refseq-FGA-NM_021871.3_clinical_seed_0_final,refseq-FGA-NM_021871.3.a2m,Invitae,refseq-FGA-NM_021871.3.npy,1,644,644
+NP_068711.1,MWRVRKRGYFGIWSFPLIIAAVCAQSVNDPSNMSLVKETVDRLLKGYDIRLRPDFGGPPVAVGMNIDIASIDMVSEVNMDYTLTMYFQQAWRDKRLSYNVIPLNLTLDNRVADQLWVPDTYFLNDKKSFVHGVTVKNRMIRLHPDGTVLYGLRITTTAACMMDLRRYPLDEQNCTLEIESYGYTTDDIEFYWRGDDNAVTGVTKIELPQFSIVDYKLITKKVVFSTGSYPRLSLSFKLKRNIGYFILQTYMPSILITILSWVSFWINYDASAARVALGITTVLTMTTINTHLRETLPKIPYVKAIDMYLMGCFVFVFMALLEYALVNYIFFGRGPQRQKKAAEKAASANNEKMRLDVNKIFYKDIKQNGTQYRSLWDPTGNLSPTRRTTNYDFSLYTMDPHENILLSTLEIKNEMATSEAVMGLGDPRSTMLAYDASSIQYRKAGLPRHSFGRNALERHVAQKKSRLRRRASQLKITIPDLTDVNAIDRWSRIFFPVVFSFFNIVYWLYYVN,512,NP_068711.1.csv,refseq-GABRB2-NM_021911.2_clinical_seed_0_final,refseq-GABRB2-NM_021911.2.a2m,Invitae,refseq-GABRB2-NM_021911.2.npy,1,512,512
+NP_068712.1,MCSGLLELLLPIWLSWTLGTRGSEPRSVNDPGNMSFVKETVDKLLKGYDIRLRPDFGGPPVCVGMNIDIASIDMVSEVNMDYTLTMYFQQYWRDKRLAYSGIPLNLTLDNRVADQLWVPDTYFLNDKKSFVHGVTVKNRMIRLHPDGTVLYGLRITTTAACMMDLRRYPLDEQNCTLEIESYGYTTDDIEFYWRGGDKAVTGVERIELPQFSIVEHRLVSRNVVFATGAYPRLSLSFRLKRNIGYFILQTYMPSILITILSWVSFWINYDASAARVALGITTVLTMTTINTHLRETLPKIPYVKAIDMYLMGCFVFVFLALLEYAFVNYIFFGRGPQRQKKLAEKTAKAKNDRSKSESNRVDAHGNILLTSLEVHNEMNEVSGGIGDTRNSAISFDNSGIQYRKQSMPREGHGRFLGDRSLPHKKTHLRRRSSQLKIKIPDLTDVNAIDRWSRIVFPFTFSLFNLVYWLYYVN,473,NP_068712.1.csv,refseq-GABRB3-NM_021912.4_clinical_seed_0_final,refseq-GABRB3-NM_021912.4.a2m,Invitae,refseq-GABRB3-NM_021912.4.npy,1,473,473
+NP_068741.1,MATPDAGLPGAEGVEPAPWAQLEAPARLLLQALQAGPEGARRGLGVLRALGSRGWEPFDWGRLLEALCREEPVVQGPDGRLELKPLLLRLPRICQRNLMSLLMAVRPSLPESGLLSVLQIAQQDLAPDPDAWLRALGELLRRDLGVGTSMEGASPLSERCQRQLQSLCRGLGLGGRRLKSPQAPDPEEEENRDSQQPGKRRKDSEEEAASPEGKRVPKRLRCWEEEEDHEKERPEHKSLESLADGGSASPIKDQPVMAVKTGEDGSNLDDAKGLAESLELPKAIQDQLPRLQQLLKTLEEGLEGLEDAPPVELQLLHECSPSQMDLLCAQLQLPQLSDLGLLRLCTWLLALSPDLSLSNATVLTRSLFLGRILSLTSSASRLLTTALTSFCAKYTYPVCSALLDPVLQAPGTGPAQTELLCCLVKMESLEPDAQVLMLGQILELPWKEETFLVLQSLLERQVEMTPEKFSVLMEKLCKKGLAATTSMAYAKLMLTVMTKYQANITETQRLGLAMALEPNTTFLRKSLKAALKHLGP,536,NP_068741.1.csv,refseq-FANCE-NM_021922.2_clinical_seed_0_final,refseq-FANCE-NM_021922.2.a2m,Invitae,refseq-FANCE-NM_021922.2.npy,1,536,536
+NP_068745.2,MNAETCVSYCESPAAAMDAYYSPVSQSREGSSPFRAFPGGDKFGTTFLSAAAKAQGFGDAKSRARYGAGQQDLATPLESGAGARGSFNKFQPQPSTPQPQPPPQPQPQQQQPQPQPPAQPHLYLQRGACKTPPDGSLKLQEGSSGHSAALQVPCYAKESSLGEPELPPDSDTVGMDSSYLSVKEAGVKGPQDRASSDLPSPLEKADSESNKGKKRRNRTTFTSYQLEELEKVFQKTHYPDVYAREQLAMRTDLTEARVQVWFQNRRAKWRKRERFGQMQQVRTHFSTAYELPLLTRAENYAQIQNPSWLGNNGAASPVPACVVPCDPVPACMSPHAHPPGSGASSVTDFLSVSGAGSHVGQTHMGSLFGAASLSPGLNGYELNGEPDRKTSSIAALRMKAKEHSAAISWAT,411,NP_068745.2.csv,refseq-ALX4-NM_021926.3_clinical_seed_0_final,refseq-ALX4-NM_021926.3.a2m,Invitae,refseq-ALX4-NM_021926.3.npy,1,411,411
+NP_068758.3,MFPAGPPSHSLLRLPLLQLLLLVVQAVGRGLGRASPAGGPLEDVVIERYHIPRACPREVQMGDFVRYHYNGTFEDGKKFDSSYDRNTLVAIVVGVGRLITGMDRGLMGMCVNERRRLIVPPHLGYGSIGLAGLIPPDATLYFDVVLLDVWNKEDTVQVSTLLRPPHCPRMVQDGDFVRYHYNGTLLDGTSFDTSYSKGGTYDTYVGSGWLIKGMDQGLLGMCPGERRKIIIPPFLAYGEKGYGTVIPPQASLVFHVLLIDVHNPKDAVQLETLELPPGCVRRAGAGDFMRYHYNGSLMDGTLFDSSYSRNHTYNTYIGQGYIIPGMDQGLQGACMGERRRITIPPHLAYGENGTGDKIPGSAVLIFNVHVIDFHNPADVVEIRTLSRPSETCNETTKLGDFVRYHYNCSLLDGTQLFTSHDYGAPQEATLGANKVIEGLDTGLQGMCVGERRQLIVPPHLAHGESGARGVPGSAVLLFEVELVSREDGLPTGYLFVWHKDPPANLFEDMDLNKDGEVPPEEFSTFIKAQVSEGKGRLMPGQDPEKTIGDMFQNQDRNQDGKITVDELKLKSDEDEERVHEEL,582,NP_068758.3.csv,refseq-FKBP10-NM_021939.3_clinical_seed_0_final,refseq-FKBP10-NM_021939.3.a2m,Invitae,refseq-FKBP10-NM_021939.3.npy,1,582,582
+NP_068761.4,MSPTQWDFPVELCCRPMAFVTLTGLDVVYNAVHRAVWDAFCANRRADRVPISFKVLPGDHEYPKCRPKRTSYEWYIPKGILKTGWMNKHLNLVPALVVVFYELDWDEPQWKEKQSECATRVEIVRQSLQGRNTKVAVVLIQKKTPLPPGEDVIASERAAALCNACELSGKSLFVLPHTDHLVGYIIRLENAFYEHAQTYYYTEIRRVKSHKEFLNKTTHQLLFVRHQFKIAFFSELKQDTQNALKNYRTAYNLVHELRAHETNILEIKTMAGFINYKICRLCFQHNTPLDAIAQFRKHIDLCKKKIGSAELSFEHDAWMSKQFQAFGDLFDEAIKLGLTAIQTQNPGFYYQQAAYYAQERKQLAKTLCNHEASVMYPNPDPLETQTGVLDFYGQRSWRQGILSFDLSDPEKEKVGILAIQLKERNVVHSEIIITLLSNAVAQFKKYKCPRMKSHLMVQMGEEYYYAKDYTKALKLLDYVMCDYRSEGWWTLLTSVLTTALKCSYLMAQLKDYITYSLELLGRASTLKDDQKSRIEKNLINVLMNESPDPEPDCDILAVKTAQKLWADRISLAGSNIFTIGVQDFVPFVQCKAKFHAPSFHVDVPVQFDIYLKADCPHPIRFSKLCVSFNNQEYNQFCVIEEASKANEVLENLTQGKMCLVPGKTRKLLFKFVAKTEDVGKKIEITSVDLALGNETGRCVVLNWQGGGGDAASSQEALQAARSFKRRPKLPDNEVHWDSIIIQASTMIISRVPNISVHLLHEPPALTNEMYCLVVTVQSHEKTQIRDVKLTAGLKPGQDANLTQKTHVTLHGTELCDESYPALLTDIPVGDLHPGEQLEKMLYVRCGTVGSRMFLVYVSYLINTTVEEKEIVCKCHKDETVTIETVFPFDVAVKFVSTKFEHLERVYADIPFLLMTDLLSASPWALTIVSSELQLAPSMTTVDQLESQVDNVILQTGESASECFCLQCPSLGNIEGGVATGHYIISWKRTSAMENIPIITTVITLPHVIVENIPLHVNADLPSFGRVRESLPVKYHLQNKTDLVQDVEISVEPSDAFMFSGLKQIRLRILPGTEQEMLYNFYPLMAGYQQLPSLNINLLRFPNFTNQLLRRFIPTSIFVKPQGRLMDDTSIAAA,1133,NP_068761.4.csv,refseq-TRAPPC11-NM_021942.5_clinical_seed_0_final,refseq-TRAPPC11-NM_021942.5.a2m,Invitae,refseq-TRAPPC11-NM_021942.5.npy,1,1133,1133
+NP_068770.2,MPNDEAFSKPSTPSEAPHAPGVPPQGRAGGFGKASGALVGAASGSSAGGSSRGGGSGSGASDLGAGSKKSPRLPKCARCRNHGYASPLKGHKRFCMWRDCQCKKCNLIAERQRVMAAQVALRRQQAQEEELGISHPIPLPSAAELLVKRENNGSNPCLMTECSGTSQPPPASVPTTAASEGRMVIQDIPAVTSRGHVENTPDLVSDSTYYSSFYQPSLFPYYNNLYNCPQYSMALAADSASGEVGNPLGGSPVKNSLRGLPGPYVPGQTGNQWQMKNMENRHAMSSQYRMHSYYPPPSYLGQSVPQFFTFEDAPSYPEARASVFSPPSSQDSGLVSLSSSSPISNKSTKAVLECEPASEPSSFTVTPVIEEDE,373,NP_068770.2.csv,refseq-DMRT1-NM_021951.2_clinical_seed_0_final,refseq-DMRT1-NM_021951.2.a2m,Invitae,refseq-DMRT1-NM_021951.2.npy,1,373,373
+NP_068773.2,MGDWSFLGRLLENAQEHSTVIGKVWLTVLFIFRILVLGAAAEDVWGDEQSDFTCNTQQPGCENVCYDRAFPISHIRFWALQIIFVSTPTLIYLGHVLHIVRMEEKKKEREEEEQLKRESPSPKEPPQDNPSSRDDRGRVRMAGALLRTYVFNIIFKTLFEVGFIAGQYFLYGFELKPLYRCDRWPCPNTVDCFISRPTEKTIFIIFMLAVACASLLLNMLEIYHLGWKKLKQGVTSRLGPDASEAPLGTADPPPLPPSSRPPAVAIGFPPYYAHTAAPLGQARAVGYPGAPPPAADFKLLALTEARGKGQSAKLYNGHHHLLMTEQNWANQAAERQPPALKAYPAASTPAAPSPVGSSSPPLAHEAEAGAAPLLLDGSGSSLEGSALAGTPEEEEQAVTTAAQMHQPPLPLGDPGRASKASRASSGRARPEDLAI,435,NP_068773.2.csv,refseq-GJA3-NM_021954.3_clinical_seed_0_final,refseq-GJA3-NM_021954.3.a2m,Invitae,refseq-GJA3-NM_021954.3.npy,1,435,435
+NP_068775.1,MKIIFPILSNPVFRRTVKLLLCLLWIGYSQGTTHVLRFGGIFEYVESGPMGAEELAFRFAVNTINRNRTLLPNTTLTYDTQKINLYDSFEASKKACDQLSLGVAAIFGPSHSSSANAVQSICNALGVPHIQTRWKHQVSDNKDSFYVSLYPDFSSLSRAILDLVQFFKWKTVTVVYDDSTGLIRLQELIKAPSRYNLRLKIRQLPADTKDAKPLLKEMKRGKEFHVIFDCSHEMAAGILKQALAMGMMTEYYHYIFTTLDLFALDVEPYRYSGVNMTGFRILNTENTQVSSIIEKWSMERLQAPPKPDSGLLDGFMTTDAALMYDAVHVVSVAVQQFPQMTVSSLQCNRHKPWRFGTRFMSLIKEAHWEGLTGRITFNKTNGLRTDFDLDVISLKEEGLEKIGTWDPASGLNMTESQKGKPANITDSLSNRSLIVTTILEEPYVLFKKSDKPLYGNDRFEGYCIDLLRELSTILGFTYEIRLVEDGKYGAQDDANGQWNGMVRELIDHKADLAVAPLAITYVREKVIDFSKPFMTLGISILYRKPNGTNPGVFSFLNPLSPDIWMYILLAYLGVSCVLFVIARFSPYEWYNPHPCNPDSDVVENNFTLLNSFWFGVGALMQQGSELMPKALSTRIVGGIWWFFTLIIISSYTANLAAFLTVERMESPIDSADDLAKQTKIEYGAVEDGATMTFFKKSKISTYDKMWAFMSSRRQSVLVKSNEEGIQRVLTSDYAFLMESTTIEFVTQRNCNLTQIGGLIDSKGYGVGTPMGSPYRDKITIAILQLQEEGKLHMMKEKWWRGNGCPEEESKEASALGVQNIGGIFIVLAAGLVLSVFVAVGEFLYKSKKNAQLEKRSFCSAMVEELRMSLKCQRRLKHKPQAPVIVKTEEVINMHTFNDRRLPGKETMA,908,NP_068775.1.csv,refseq-GRIK2-NM_021956.4_clinical_seed_0_final,refseq-GRIK2-NM_021956.4.a2m,Invitae,refseq-GRIK2-NM_021956.4.npy,1,908,908
+NP_068776.2,MLRGRSLSVTSLGGLPQWEVEELPVEELLLFEVAWEVTNKVGGIYTVIQTKAKTTADEWGENYFLIGPYFEHNMKTQVEQCEPVNDAVRRAVDAMNKHGCQVHFGRWLIEGSPYVVLFDIGYSAWNLDRWKGDLWEACSVGIPYHDREANDMLIFGSLTAWFLKEVTDHADGKYVVAQFHEWQAGIGLILSRARKLPIATIFTTHATLLGRYLCAANIDFYNHLDKFNIDKEAGERQIYHRYCMERASVHCAHVFTTVSEITAIEAEHMLKRKPDVVTPNGLNVKKFSAVHEFQNLHAMYKARIQDFVRGHFYGHLDFDLEKTLFLFIAGRYEFSNKGADIFLESLSRLNFLLRMHKSDITVMVFFIMPAKTNNFNVETLKGQAVRKQLWDVAHSVKEKFGKKLYDALLRGEIPDLNDILDRDDLTIMKRAIFSTQRQSLPPVTTHNMIDDSTDPILSTIRRIGLFNNRTDRVKVILHPEFLSSTSPLLPMDYEEFVRGCHLGVFPSYYEPWGYTPAECTVMGIPSVTTNLSGFGCFMQEHVADPTAYGIYIVDRRFRSPDDSCNQLTKFLYGFCKQSRRQRIIQRNRTERLSDLLDWRYLGRYYQHARHLTLSRAFPDKFHVELTSPPTTEGFKYPRPSSVPPSPSGSQASSPQSSDVEDEVEDERYDEEEEAERDRLNIKSPFSLSHVPHGKKKLHGEYKN,703,NP_068776.2.csv,refseq-GYS2-NM_021957.3_clinical_seed_0_final,refseq-GYS2-NM_021957.3.a2m,Invitae,refseq-GYS2-NM_021957.3.npy,1,703,703
+NP_068799.2,MNIDDKLEGLFLKCGGIDEMQSSRTMVVMGGVSGQSTVSGELQDSVLQDRSMPHQEILAADEVLQESEMRQQDMISHDELMVHEETVKNDEEQMETHERLPQGLQYALNVPISVKQEITFTDVSEQLMRDKKQIREPVDLQKKKKRKQRSPAKILTINEDGSLGLKTPKSHVCEHCNAAFRTNYHLQRHVFIHTGEKPFQCSQCDMRFIQKYLLQRHEKIHTGEKPFRCDECGMRFIQKYHMERHKRTHSGEKPYQCEYCLQYFSRTDRVLKHKRMCHENHDKKLNRCAIKGGLLTSEEDSGFSTSPKDNSLPKKKRQKTEKKSSGMDKESALDKSDLKKDKNDYLPLYSSSTKVKDEYMVAEYAVEMPHSSVGGSHLEDASGEIHPPKLVLKKINSKRSLKQPLEQNQTISPLSTYEESKVSKYAFELVDKQALLDSEGNADIDQVDNLQEGPSKPVHSSTNYDDAMQFLKKKRYLQAASNNSREYALNVGTIASQPSVTQAAVASVIDESTTASILESQALNVEIKSNHDKNVIPDEVLQTLLDHYSHKANGQHEISFSVADTEVTSSISINSSEVPEVTPSENVGSSSQASSSDKANMLQEYSKFLQQALDRTSQNDAYLNSPSLNFVTDNQTLPNQPAFSSIDKQVYATMPINSFRSGMNSPLRTTPDKSHFGLIVGDSQHSFPFSGDETNHASATSTQDFLDQVTSQKKAEAQPVHQAYQMSSFEQPFRAPYHGSRAGIATQFSTANGQVNLRGPGTSAEFSEFPLVNVNDNRAGMTSSPDATTGQTFG,794,NP_068799.2.csv,refseq-ZNF148-NM_021964.2_clinical_seed_0_final,refseq-ZNF148-NM_021964.2.a2m,Invitae,refseq-ZNF148-NM_021964.2.npy,1,794,794
+NP_068810.3,MDELFPLIFPAEPAQASGPYVEIIEQPKQRGMRFRYKCEGRSAGSIPGERSTDTTKTHPTIKINGYTGPGTVRISLVTKDPPHRPHPHELVGKDCRDGFYEAELCPDRCIHSFQNLGIQCVKKRDLEQAISQRIQTNNNPFQVPIEEQRGDYDLNAVRLCFQVTVRDPSGRPLRLPPVLSHPIFDNRAPNTAELKICRVNRNSGSCLGGDEIFLLCDKVQKEDIEVYFTGPGWEARGSFSQADVHRQVAIVFRTPPYADPSLQAPVRVSMQLRRPSDRELSEPMEFQYLPDTDDRHRIEEKRKRTYETFKSIMKKSPFSGPTDPRPPPRRIAVPSRSSASVPKPAPQPYPFTSSLSTINYDEFPTMVFPSGQISQASALAPAPPQVLPQAPAPAPAPAMVSALAQAPAPVPVLAPGPPQAVAPPAPKPTQAGEGTLSEALLQLQFDDEDLGALLGNSTDPAVFTDLASVDNSEFQQLLNQGIPVAPHTTEPMLMEYPEAITRLVTGAQRPPDPAPAPLGAPGLPNGLLSGDEDFSSIADMDFSALLSQISS,551,NP_068810.3.csv,refseq-RELA-NM_021975.3_clinical_seed_0_final,refseq-RELA-NM_021975.3.a2m,Invitae,refseq-RELA-NM_021975.3.npy,1,551,551
+NP_068838.3,MDSGGGSLGLHTPDSRMAHTMIMQDFVAGMAGTAHIDGDHIVVSVPEAVLVSDVVTDDGITLDHGLAAEVVHGPDIITETDVVTEGVIVPEAVLEADVAIEEDLEEDDGDHILTSELITETVRVPEQVFVADLVTGPNGHLEHVVQDCVSGVDSPTMVSEEVLVTNSDTETVIQAAGGVPGSTVTIKTEDDDDDDVKSTSEDYLMISLDDVGEKLEHMGNTPLKIGSDGSQEDAKEDGFGSEVIKVYIFKAEAEDDVEIGGTEIVTESEYTSGHSVAGVLDQSRMQREKMVYMAVKDSSQEEDDIRDERRVSRRYEDCQASGNTLDSALESRSSTAAQYLQICDGINTNKVLKQKAKKRRRGETRQWQTAVIIGPDGQPLTVYPCHICTKKFKSRGFLKRHMKNHPDHLMRKKYQCTDCDFTTNKKVSFHNHLESHKLINKVDKTHEFTEYTRRYREASPLSSNKLILRDKEPKMHKCKYCDYETAEQGLLNRHLLAVHSKNFPHVCVECGKGFRHPSELKKHMRTHTGEKPYQCQYCIFRCADQSNLKTHIKSKHGNNLPYKCEHCPQAFGDERELQRHLDLFQGHKTHQCPHCDHKSTNSSDLKRHIISVHTKDFPHKCEVCDKGFHRPSELKKHSDIHKGRKIHQCRHCDFKTSDPFILSGHILSVHTKDQPLKCKRCKRGFRQQNELKKHMKTHTGRKIYQCEYCEYSTTDASGFKRHVISIHTKDYPHRCEFCKKGFRRPSEKNQHIMRHHKEALM,761,NP_068838.3.csv,refseq-ZNF711-NM_021998.4_clinical_seed_0_final,refseq-ZNF711-NM_021998.4.a2m,Invitae,refseq-ZNF711-NM_021998.4_theta_0.2.npy,1,761,761
+NP_071357.2,MSCKKQRSRKHSVNEKCNMKIEHYFSPVSKEQQNNCSTSLMRMESRGDPRATTNTQAQRFHSPKKNPEDQTMPQNRTIYVTLKVNHRRNQDMKLKLTHSENSSLYMALNTLQAVRKEIETHQGQEMLVRGTEGIKEYINLGMPLSCFPEGGQVVITFSQSKSKQKEDNHIFGRQDKASTECVKFYIHAIGIGKCKRRIVKCGKLHKKGRKLCVYAFKGETIKDALCKDGRFLSFLENDDWKLIENNDTILESTQPVDELEGRYFQVEVEKRMVPSAAASQNPESEKRNTCVLREQIVAQYPSLKRESEKIIENFKKKMKVKNGETLFELHRTTFGKVTKNSSSIKVVKLLVRLSDSVGYLFWDSATTGYATCFVFKGLFILTCRHVIDSIVGDGIEPSKWATIIGQCVRVTFGYEELKDKETNYFFVEPWFEIHNEELDYAVLKLKENGQQVPMELYNGITPVPLSGLIHIIGHPYGEKKQIDACAVIPQGQRAKKCQERVQSKKAESPEYVHMYTQRSFQKIVHNPDVITYDTEFFFGASGSPVFDSKGSLVAMHAAGFAYTYQNETRSIIEFGSTMESILLDIKQRHKPWYEEVFVNQQDVEMMSDEDL,611,NP_071357.2.csv,refseq-FAM111A-NM_022074.3_clinical_seed_0_final,refseq-FAM111A-NM_022074.3.a2m,Invitae,refseq-FAM111A-NM_022074.3.npy,1,611,611
+NP_071372.1,MSADSSPLVGSTPTGYGTLTIGTSIDPLSSSVSSVRLSGYCGSPWRVIGYHVVVWMMAGIPLLLFRWKPLWGVRLRLRPCNLAHAETLVIEIRDKEDSSWQLFTVQVQTEAIGEGSLEPSPQSQAEDGRSQAAVGAVPEGAWKDTAQLHKSEEAVSVGQKRVLRYYLFQGQRYIWIETQQAFYQVSLLDHGRSCDDVHRSRHGLSLQDQMVRKAIYGPNVISIPVKSYPQLLVDEALNPYYGFQAFSIALWLADHYYWYALCIFLISSISICLSLYKTRKQSQTLRDMVKLSMRVCVCRPGGEEEWVDSSELVPGDCLVLPQEGGLMPCDAALVAGECMVNESSLTGESIPVLKTALPEGLGPYCAETHRRHTLFCGTLILQARAYVGPHVLAVVTRTGFCTAKGGLVSSILHPRPINFKFYKHSMKFVAALSVLALLGTIYSIFILYRNRVPLNEIVIRALDLVTVVVPPALPAAMTVCTLYAQSRLRRQGIFCIHPLRINLGGKLQLVCFDKTGTLTEDGLDVMGVVPLKGQAFLPLVPEPRRLPVGPLLRALATCHALSRLQDTPVGDPMDLKMVESTGWVLEEEPAADSAFGTQVLAVMRPPLWEPQLQAMEEPPVPVSVLHRFPFSSALQRMSVVVAWPGATQPEAYVKGSPELVAGLCNPETVPTDFAQMLQSYTAAGYRVVALASKPLPTVPSLEAAQQLTRDTVEGDLSLLGLLVMRNLLKPQTTPVIQALRRTRIRAVMVTGDNLQTAVTVARGCGMVAPQEHLIIVHATHPERGQPASLEFLPMESPTAVNGVKDPDQAASYTVEPDPRSRHLALSGPTFGIIVKHFPKLLPKVLVQGTVFARMAPEQKTELVCELQKLQYCVGMCGDGANDCGALKAADVGISLSQAEASVVSPFTSSMASIECVPMVIREGRCSLDTSFSVFKYMALYSLTQFISVLILYTINTNLGDLQFLAIDLVITTTVAVLMSRTGPALVLGRVRPPGALLSVPVLSSLLLQMVLVTGVQLGGYFLTLAQPWFVPLNRTVAAPDNLPNYENTVVFSLSSFQYLILAAAVSKGAPFRRPLYTNVPFLVALALLSSVLVGLVLVPGLLQGPLALRNITDTGFKLLLLGLVTLNFVGAFMLESVLDQCLPACLRRLRPKRASKKRFKQLERELAEQPWPPLPAGPLR,1180,NP_071372.1.csv,refseq-ATP13A2-NM_022089.3_clinical_seed_0_final,refseq-ATP13A2-NM_022089.3.a2m,Invitae,refseq-ATP13A2-NM_022089.3.npy,1,1180,1180
+NP_071378.1,MEENEVESSSDAAPGPGRPEEPSESGLGVGTSEAVSADSSDAAAAPGQAEADDSGVGQSSDRGSRSQEEVSESSSSADPLPNSYLPDSSSVSHGPVAGVTGGPPALVHSSALPDPNMLVSDCTASSSDLGSAIDKIIESTIGPDLIQNCITVTSAEDGGAETTRYLILQGPDDGAPMTSPMSSSTLAHSLAAIEALADGPTSTSTCLEAQGGPSSPVQLPPASGAEEPDLQSLEAMMEVVVVQQFKCKMCQYRSSTKATLLRHMRERHFRPVAAAAAAAGKKGRLRKWSTSTKSQEEEGPEEEDDDDIVDAGAIDDLEEDSDYNPAEDEPRGRQLRLQRPTPSTPRPRRRPGRPRKLPRLEISDLPDGVEGEPLVSSQSGQSPPEPQDPEAPSSSGPGHLVAMGKVSRTPVEAGVSQSDAENAAPSCPDEHDTLPRRRGRPSRRFLGKKYRKYYYKSPKPLLRPFLCRICGSRFLSHEDLRFHVNSHEAGDPQLFKCLQCSYRSRRWSSLKEHMFNHVGSKPYKCDECSYTSVYRKDVIRHAAVHSRDRKKRPDPTPKLSSFPCPVCGRVYPMQKRLTQHMKTHSTEKPHMCDKCGKSFKKRYTFKMHLLTHIQAVANRRFKCEFCEFVCEDKKALLNHQLSHVSDKPFKCSFCPYRTFREDFLLSHVAVKHTGAKPFACEYCHFSTRHKKNLRLHVRCRHASSFEEWGRRHPEEPPSRRRPFFSLQQIEELKQQHSAAPGPPPSSPGPPEIPPEATTFQSSEAPSLLCSDTLGGATIIYQQGAEESTAMATQTALDLLLNMSAQRELGGTALQVAVVKSEDVEAGLASPGGQPSPEGATPQVVTLHVAEPGGGAAAESQLGPPDLPQITLAPGPFGGTGYSVITAPPMEEGTSAPGTPYSEEPAGEAAQAVVVSDTLKEAGTHYIMATDGTQLHHIELTADGSISFPSPDALASGAKWPLLQCGGLPRDGPEPPSPAKTHCVGDSQSSASSPPATSKALGLAVPPSPPSAATAASKKFSCKICAEAFPGRAEMESHKRAHAGPGAFKCPDCPFSARQWPEVRAHMAQHSSLRPHQCSQCSFASKNKKDLRRHMLTHTKEKPFACHLCGQRFNRNGHLKFHIQRLHSPDGRKSGTPTARAPTQTPTQTIILNSDDETLATLHTALQSSHGVLGPERLQQALSQEHIIVAQEQTVTNQEEAAYIQEITTADGQTVQHLVTSDNQVQYIISQDGVQHLLPQEYVVVPEGHHIQVQEGQITHIQYEQGAPFLQESQIQYVPVSPGQQLVTQAQLEAAAHSAVTAVADAAMAQAQGLFGTDETVPEHIQQLQHQGIEYDVITLADD,1342,NP_071378.1.csv,refseq-ZNF335-NM_022095.3_clinical_seed_0_final,refseq-ZNF335-NM_022095.3.a2m,Invitae,refseq-ZNF335-NM_022095.3.npy,1,1342,1342
+NP_071398.5,MAEDGSEEIMFIWCEDCSQYHDSECPELGPVVMVKDSFVLSRARSWPASGHVHTQAGQGMRGYEDRDRADPQQLPEAVPAGLVRRLSGQQLPCRSTLTWGRLCHLVAQGRSSLPPNLEIRRLEDGAEGVFAITQLVKRTQFGPFESRRVAKWEKESAFPLKVFQKDGHPVCFDTSNEDDCNWMMLVRPAAEAEHQNLTAYQHGSDVYFTTSRDIPPGTELRVWYAAFYAKKMDKPMLKQAGSGVHAAGTPENSAPVESEPSQWACKVCSATFLELQLLNEHLLGHLEQAKSLPPGSQSEAAAPEKEQDTPRGEPPAVPESENVATKEQKKKPRRGRKPKVSKAEQPLVIVEDKEPTEQVAEIITEVPPDEPVSATPDERIMELVLGKLATTTTDTSSVPKFTHHQNNTITLKRSLILSSRHGIRRKLIKQLGEHKRVYQCNICSKIFQNSSNLSRHVRSHGDKLFKCEECAKLFSRKESLKQHVSYKHSRNEVDGEYRYRCGTCEKTFRIESALEFHNCRTDDKTFQCEMCFRFFSTNSNLSKHKKKHGDKKFACEVCSKMFYRKDVMLDHQRRHLEGVRRVKREDLEAGGENLVRYKKEPSGCPVCGKVFSCRSNMNKHLLTHGDKKYTCEICGRKFFRVDVLRDHIHVHFKDIALMDDHQREEFIGKIGISSEENDDNSDESADSEPHKYSCKRCQLTFGRGKEYLKHIMEVHKEKGYGCSICNRRFALKATYHAHMVIHRENLPDPNVQKYIHPCEICGRIFNSIGNLERHKLIHTGVKSHACEQCGKSFARKDMLKEHMRVHDNVREYLCAECGKGMKTKHALRHHMKLHKGIKEYECKECHRRFAQKVNMLKHCKRHTGIKDFMCELCGKTFSERNTMETHKLIHTVGKQWTCSVCDKKYVTEYMLQKHVQLTHDKVEAQSCQLCGTKVSTRASMSRHMRRKHPEVLAVRIDDLDHLPETTTIDASSIGIVQPELTLEQEDLAEGKHGKAAKRSHKRKQKPEEEAGAPVPEDATFSEYSEKETEFTGSVGDETNSAVQSIQQVVVTLGDPNVTTPSSSVGLTNITVTPITTAAATQFTNLQPVAVGHLTTPERQLQLDNSILTVTFDTVSGSAMLHNRQNDVQIHPQPEASNPQSVAHFINLTTLVNSITPLGSQLSDQHPLTWRAVPQTDVLPPSQPQAPPQQAAQPQVQAEQQQQQMYSY,1207,NP_071398.5.csv,NP_071398.5_clinical_seed_0_final,NP_071398.5.a2m,popEVE,NP_071398.5_theta_0.2.npy,1,1207,1207
+NP_071415.1,MWAVLRLALRPCARASPAGPRAYHGDSVASLGTQPDLGSALYQENYKQMKALVNQLHERVEHIKLGGGEKARALHISRGKLLPRERIDNLIDPGSPFLELSQFAGYQLYDNEEVPGGGIITGIGRVSGVECMIIANDATVKGGAYYPVTVKKQLRAQEIAMQNRLPCIYLVDSGGAYLPRQADVFPDRDHFGRTFYNQAIMSSKNIAQIAVVMGSCTAGGAYVPAMADENIIVRKQGTIFLAGPPLVKAATGEEVSAEDLGGADLHCRKSGVSDHWALDDHHALHLTRKVVRNLNYQKKLDVTIEPSEEPLFPADELYGIVGANLKRSFDVREVIARIVDGSRFTEFKAFYGDTLVTGFARIFGYPVGIVGNNGVLFSESAKKGTHFVQLCCQRNIPLLFLQNITGFMVGREYEAEGIAKDGAKMVAAVACAQVPKITLIIGGSYGAGNYGMCGRAYSPRFLYIWPNARISVMGGEQAANVLATITKDQRAREGKQFSSADEAALKEPIIKKFEEEGNPYYSSARVWDDGIIDPADTRLVLGLSFSAALNAPIEKTDFGIFRM,563,NP_071415.1.csv,refseq-MCCC2-NM_022132.4_clinical_seed_0_final,refseq-MCCC2-NM_022132.4.a2m,Invitae,refseq-MCCC2-NM_022132.4.npy,1,563,563
+NP_071417.2,MMSMLGGLQRYFRVILLLLLALTLLLLAGFLHSDLELDTPLFGGQAEGPPVTNIMFLKTHKTASSTVLNILYRFAETHNLSVALPAGSRVHLGYPWLFLARYVEGVGSQQRFNIMCNHLRFNLPQVQKVMPNDTFYFSILRNPVFQLESSFIYYKTYAPAFRGAPSLDAFLASPRTFYNDSRHLRNVYAKNNMWFDFGFDPNAQCEEGYVRARIAEVERRFRLVLIAEHLDESLVLLRRRLRWALDDVVAFRLNSRSARSVARLSPETRERARSWCALDWRLYEHFNRTLWAQLRAELGPRRLRGEVERLRARRRELASLCLQDGGALKNHTQIRDPRLRPYQSGKADILGYNLRPGLDNQTLGVCQRLVMPELQYMARLYALQFPEKPLKNIPFLGA,398,NP_071417.2.csv,refseq-GAL3ST2-NM_022134.2_clinical_seed_0_final,refseq-GAL3ST2-NM_022134.2.a2m,Invitae,refseq-GAL3ST2-NM_022134.2.npy,1,398,398
+NP_071437.3,MAPGRAVAGLLLLAAAGLGGVAEGPGLAFSEDVLSVFGANLSLSAAQLQHLLEQMGAASRVGVPEPGQLHFNQCLTAEEIFSLHGFSNATQITSSKFSVICPAVLQQLNFHPCEDRPKHKTRPSHSEVWGYGFLSVTIINLASLLGLILTPLIKKSYFPKILTFFVGLAIGTLFSNAIFQLIPEAFGFDPKVDSYVEKAVAVFGGFYLLFFFERMLKMLLKTYGQNGHTHFGNDNFGPQEKTHQPKALPAINGVTCYANPAVTEANGHIHFDNVSVVSLQDGKKEPSSCTCLKGPKLSEIGTIAWMITLCDALHNFIDGLAIGASCTLSLLQGLSTSIAILCEEFPHELGDFVILLNAGMSTRQALLFNFLSACSCYVGLAFGILVGNNFAPNIIFALAGGMFLYISLADMFPEMNDMLREKVTGRKTDFTFFMIQNAGMLTGFTAILLITLYAGEIELE,460,NP_071437.3.csv,refseq-SLC39A8-NM_022154.5_clinical_seed_0_final,refseq-SLC39A8-NM_022154.5.a2m,Invitae,refseq-SLC39A8-NM_022154.5.npy,1,460,460
+NP_071445.1,MGEEGGSASHDEEERASVLLGHSPGCEMCSQEAFQAQRSQLVELLVSGSLEGFESVLDWLLSWEVLSWEDYEGFHLLGQPLSHLARRLLDTVWNKGTWACQKLIAAAQEAQADSQSPKLHGCWDPHSLHPARDLQSHRPAIVRRLHSHVENMLDLAWERGFVSQYECDEIRLPIFTPSQRARRLLDLATVKANGLAAFLLQHVQELPVPLALPLEAATCKKYMAKLRTTVSAQSRFLSTYDGAETLCLEDIYTENVLEVWADVGMAGPPQKSPATLGLEELFSTPGHLNDDADTVLVVGEAGSGKSTLLQRLHLLWAAGQDFQEFLFVFPFSCRQLQCMAKPLSVRTLLFEHCCWPDVGQEDIFQLLLDHPDRVLLTFDGFDEFKFRFTDRERHCSPTDPTSVQTLLFNLLQGNLLKNARKVVTSRPAAVSAFLRKYIRTEFNLKGFSEQGIELYLRKRHHEPGVADRLIRLLQETSALHGLCHLPVFSWMVSKCHQELLLQEGGSPKTTTDMYLLILQHFLLHATPPDSASQGLGPSLLRGRLPTLLHLGRLALWGLGMCCYVFSAQQLQAAQVSPDDISLGFLVRAKGVVPGSTAPLEFLHITFQCFFAAFYLALSADVPPALLRHLFNCGRPGNSPMARLLPTMCIQASEGKDSSVAALLQKAEPHNLQITAAFLAGLLSREHWGLLAECQTSEKALLRRQACARWCLARSLRKHFHSIPPAAPGEAKSVHAMPGFIWLIRSLYEMQEERLARKAARGLNVGHLKLTFCSVGPTECAALAFVLQHLRRPVALQLDYNSVGDIGVEQLLPCLGVCKALYLRDNNISDRGICKLIECALHCEQLQKLALFNNKLTDGCAHSMAKLLACRQNFLALRLGNNYITAAGAQVLAEGLRGNTSLQFLGFWGNRVGDEGAQALAEALGDHQSLRWLSLVGNNIGSVGAQALALMLAKNVMLEELCLEENHLQDEGVCSLAEGLKKNSSLKILKLSNNCITYLGAEALLQALERNDTILEVWLRGNTFSLEEVDKLGCRDTRLLL,1040,NP_071445.1.csv,refseq-NOD2-NM_022162.2_clinical_seed_0_final,refseq-NOD2-NM_022162.2.a2m,Invitae,refseq-NOD2-NM_022162.2.npy,1,1040,1040
+NP_071449.1,MVAAPCARRLARRSHSALLAALTVLLLQTLVVWNFSSLDSGAGERRGGAAVGGGEQPPPAPAPRRERRDLPAEPAAARGGGGGGGGGGGGRGPQARARGGGPGEPRGQQPASRGALPARALDPHPSPLITLETQDGYFSHRPKEKVRTDSNNENSVPKDFENVDNSNFAPRTQKQKHQPELAKKPPSRQKELLKRKLEQQEKGKGHTFPGKGPGEVLPPGDRAAANSSHGKDVSRPPHARKTGGSSPETKYDQPPKCDISGKEAISALSRAKSKHCRQEIGETYCRHKLGLLMPEKVTRFCPLEGKANKNVQWDEDSVEYMPANPVRIAFVLVVHGRASRQLQRMFKAIYHKDHFYYIHVDKRSNYLHRQVLQVSRQYSNVRVTPWRMATIWGGASLLSTYLQSMRDLLEMTDWPWDFFINLSAADYPIRTNDQLVAFLSRYRDMNFLKSHGRDNARFIRKQGLDRLFLECDAHMWRLGDRRIPEGIAVDGGSDWFLLNRRFVEYVTFSTDDLVTKMKQFYSYTLLPAESFFHTVLENSPHCDTMVDNNLRITNWNRKLGCKCQYKHIVDWCGCSPNDFKPQDFHRFQQTARPTFFARKFEAVVNQEIIGQLDYYLYGNYPAGTPGLRSYWENVYDEPDGIHSLSDVTLTLYHSFARLGLRRAETSLHTDGENSCRYYPMGHPASVHLYFLADRFQGFLIKHHATNLAVSKLETLETWVMPKKVFKIASPPSDFGRLQFSEVGTDWDAKERLFRNFGGLLGPMDEPVGMQKWGKGPNVTVTVIWVDPVNVIAATYDILIESTAEFTHYKPPLNLPLRPGVWTVKILHHWVPVAETKFLVAPLTFSNRQPIKPEEALKLHNGPLRNAYMEQSFQSLNPVLSLPINPAQVEQARRNAASTGTALEGWLDSLVGGMWTAMDICATGPTACPVMQTCSQTAWSSFSPDPKSELGAVKPDGRLR,959,NP_071449.1.csv,refseq-XYLT1-NM_022166.3_clinical_seed_0_final,refseq-XYLT1-NM_022166.3.a2m,Invitae,refseq-XYLT1-NM_022166.3.npy,1,959,959
+NP_071450.2,MVASARVQKLVRRYKLAIATALAILLLQGLVVWSFSGLEEDEAGEKGRQRKPRPLDPGEGSKDTDSSAGRRGSTGRRHGRWRGRAESPGVPVAKVVRAVTSRQRASRRVPPAPPPEAPGRQNLSGAAAGEALVGAAGFPPHGDTGSVEGAPQPTDNGFTPKCEIVGKDALSALARASTKQCQQEIANVVCLHQAGSLMPKAVPRHCQLTGKMSPGIQWDESQAQQPMDGPPVRIAYMLVVHGRAIRQLKRLLKAVYHEQHFFYIHVDKRSDYLHREVVELAQGYDNVRVTPWRMVTIWGGASLLRMYLRSMRDLLEVPGWAWDFFINLSATDYPTRTNEELVAFLSKNRDKNFLKSHGRDNSRFIKKQGLDRLFHECDSHMWRLGERQIPAGIVVDGGSDWFVLTRSFVEYVVYTDDPLVAQLRQFYTYTLLPAESFFHTVLENSLACETLVDNNLRVTNWNRKLGCKCQYKHIVDWCGCSPNDFKPQDFLRLQQVSRPTFFARKFESTVNQEVLEILDFHLYGSYPPGTPALKAYWENTYDAADGPSGLSDVMLTAYTAFARLSLHHAATAAPPMGTPLCRFEPRGLPSSVHLYFYDDHFQGYLVTQAVQPSAQGPAETLEMWLMPQGSLKLLGRSDQASRLQSLEVGTDWDPKERLFRNFGGLLGPLDEPVAVQRWARGPNLTATVVWIDPTYVVATSYDITVDTETEVTQYKPPLSRPLRPGPWTVRLLQFWEPLGETRFLVLPLTFNRKLPLRKDDASWLHAGPPHNEYMEQSFQGLSSILNLPQPELAEEAAQRHTQLTGPALEAWTDRELSSFWSVAGLCAIGPSPCPSLEPCRLTSWSSLSPDPKSELGPVKADGRLR,865,NP_071450.2.csv,refseq-XYLT2-NM_022167.3_clinical_seed_0_final,refseq-XYLT2-NM_022167.3.a2m,Invitae,refseq-XYLT2-NM_022167.3.npy,1,865,865
+NP_071451.2,MSNGYSTDENFRYLISCFRARVKMYIQVEPVLDYLTFLPAEVKEQIQRTVATSGNMQAVELLLSTLEKGVWHLGWTREFVEALRRTGSPLAARYMNPELTDLPSPSFENAHDEYLQLLNLLQPTLVDKLLVRDVLDKCMEEELLTIEDRNRIAAAENNGNESGVRELLKRIVQKENWFSAFLNVLRQTGNNELVQELTGSDCSESNAEIENLSQVDGPQVEEQLLSTTVQPNLEKEVWGMENNSSESSFADSSVVSESDTSLAEGSVSCLDESLGHNSNMGSDSGTMGSDSDEENVAARASPEPELQLRPYQMEVAQPALEGKNIIICLPTGSGKTRVAVYIAKDHLDKKKKASEPGKVIVLVNKVLLVEQLFRKEFQPFLKKWYRVIGLSGDTQLKISFPEVVKSCDIIISTAQILENSLLNLENGEDAGVQLSDFSLIIIDECHHTNKEAVYNNIMRHYLMQKLKNNRLKKENKPVIPLPQILGLTASPGVGGATKQAKAEEHILKLCANLDAFTIKTVKENLDQLKNQIQEPCKKFAIADATREDPFKEKLLEIMTRIQTYCQMSPMSDFGTQPYEQWAIQMEKKAAKEGNRKERVCAEHLRKYNEALQINDTIRMIDAYTHLETFYNEEKDKKFAVIEDDSDEGGDDEYCDGDEDEDDLKKPLKLDETDRFLMTLFFENNKMLKRLAENPEYENEKLTKLRNTIMEQYTRTEESARGIIFTKTRQSAYALSQWITENEKFAEVGVKAHHLIGAGHSSEFKPMTQNEQKEVISKFRTGKINLLIATTVAEEGLDIKECNIVIRYGLVTNEIAMVQARGRARADESTYVLVAHSGSGVIEHETVNDFREKMMYKAIHCVQNMKPEEYAHKILELQMQSIMEKKMKTKRNIAKHYKNNPSLITFLCKNCSVLACSGEDIHVIEKMHHVNMTPEFKELYIVRENKALQKKCADYQINGEIICKCGQAWGTMMVHKGLDLPCLKIRNFVVVFKNNSTKKQYKKWVELPITFPNLDYSECCLFSDED,1025,NP_071451.2.csv,refseq-IFIH1-NM_022168.3_clinical_seed_0_final,refseq-IFIH1-NM_022168.3.a2m,Invitae,refseq-IFIH1-NM_022168.3.npy,1,1025,1025
+NP_071505.2,MEDEMPKTLYVGNLSRDVTEALILQLFSQIGPCKNCKMIMDTAGNDPYCFVEFHEHRHAAAALAAMNGRKIMGKEVKVNWATTPSSQKKDTSSSTVVSTQRSQDHFHVFVGDLSPEITTEDIKAAFAPFGRISDARVVKDMATGKSKGYGFVSFFNKWDAENAIQQMGGQWLGGRQIRTNWATRKPPAPKSTYESNTKQLSYDEVVNQSSPSNCTVYCGGVTSGLTEQLMRQTFSPFGQIMEIRVFPDKGYSFVRFNSHESAAHAIVSVNGTTIEGHVVKCYWGKETLDMINPVQQQNQIGYPQPYGQWGQWYGNAQQIGQYMPNGWQVPAYGMYGQAWNQQGFNQTQSSAPWMGPNYGVQPPQGQNGSMLPNQPSGYRVAGYETQ,386,NP_071505.2.csv,refseq-TIA1-NM_022173.2_clinical_seed_0_final,refseq-TIA1-NM_022173.2.a2m,Invitae,refseq-TIA1-NM_022173.2.npy,1,386,386
+NP_071731.1,MAHVGDCTQTPWLPVLVVSLMCSARAEYSNCGENEYYNQTTGLCQECPPCGPGEEPYLSCGYGTKDEDYGCVPCPAEKFSKGGYQICRRHKDCEGFFRATVLTPGDMENDAECGPCLPGYYMLENRPRNIYGMVCYSCLLAPPNTKECVGATSGASANFPGTSGSSTLSPFQHAHKELSGQGHLATALIIAMSTIFIMAIAIVLIIMFYILKTKPSAPACCTSHPGKSVEAQVSKDEEKKEAPDNVVMFSEKDEFEKLTATPAKPTKSENDASSENEQLLSRSVDSDEEPAPDKQGSPELCLLSLVHLAREKSATSNKSAGIQSRRKKILDVYANVCGVVEGLSPTELPFDCLEKTSRMLSSTYNSEKAVVKTWRHLAESFGLKRDEIGGMTDGMQLFDRISTAGYSIPELLTKLVQIERLDAVESLCADILEWAGVVPPASQPHAAS,448,NP_071731.1.csv,refseq-EDAR-NM_022336.3_clinical_seed_0_final,refseq-EDAR-NM_022336.3.a2m,Invitae,refseq-EDAR-NM_022336.3.npy,1,448,448
+NP_071764.3,MSSQPAGNQTSPGATEDYSYGSWYIDEPQGGEELQPEGEVPSCHTSIPPGLYHACLASLSILVLLLLAMLVRRRQLWPDCVRGRPGLPSPVDFLAGDRPRAVPAAVFMVLLSSLCLLLPDEDALPFLTLASAPSQDGKTEAPRGAWKILGLFYYAALYYPLAACATAGHTAAHLLGSTLSWAHLGVQVWQRAECPQVPKIYKYYSLLASLPLLLGLGFLSLWYPVQLVRSFSRRTGAGSKGLQSSYSEEYLRNLLCRKKLGSSYHTSKHGFLSWARVCLRHCIYTPQPGFHLPLKLVLSATLTGTAIYQVALLLLVGVVPTIQKVRAGVTTDVSYLLAGFGIVLSEDKQEVVELVKHHLWALEVCYISALVLSCLLTFLVLMRSLVTHRTNLRALHRGAALDLSPLHRSPHPSRQAIFCWMSFSAYQTAFICLGLLVQQIIFFLGTTALAFLVLMPVLHGRNLLLFRSLESSWPFWLTLALAVILQNMAAHWVFLETHDGHPQLTNRRVLYAATFLLFPLNVLVGAMVATWRVLLSALYNAIHLGQMDLSLLPPRAATLDPGYYTYRNFLKIEVSQSHPAMTAFCSLLLQAQSLLPRTMAAPQDSLRPGEEDEGMQLLQTKDSMAKGARPGASRGRARWGLAYTLLHNPTLQVFRKTALLGANGAQP,667,NP_071764.3.csv,refseq-STRA6-NM_022369.3_clinical_seed_0_final,refseq-STRA6-NM_022369.3.a2m,Invitae,refseq-STRA6-NM_022369.3.npy,1,667,667
+NP_071765.2,MLRYLLKTLLQMNLFADSLAGDISNSSELLLGFNSSLAALNHTLLPPGDPSLNGSRVGPEDAMPRIVEQPPDLLVSRGEPATLPCRAEGRPRPNIEWYKNGARVATVREDPRAHRLLLPSGALFFPRIVHGRRARPDEGVYTCVARNYLGAAASRNASLEVAVLRDDFRQSPGNVVVAVGEPAVLECVPPRGHPEPSVSWRKDGARLKEEEGRITIRGGKLMMSHTLKSDAGMYVCVASNMAGERESAAAEVMVLERPSFLRRPVNQVVLADAPVTFLCEVKGDPPPRLRWRKEDGELPTGRYEIRSDHSLWIGHVSAEDEGTYTCVAENSVGRAEASGSLSVHVPPQLVTQPQDQMAAPGESVAFQCETKGNPPPAIFWQKEGSQVLLFPSQSLQPTGRFSVSPRGQLNITAVQRGDAGYYVCQAVSVAGSILAKALLEIKGASLDGLPPVILQGPANQTLVLGSSVWLPCRVTGNPQPSVRWKKDGQWLQGDDLQFKTMANGTLYIANVQEMDMGFYSCVAKSSTGEATWSGWLKMREDWGVSPDPPTEPSSPPGAPSQPVVTEITKNSITLTWKPNPQTGAAVTSYVIEAFSPAAGNTWRTVADGVQLETHTVSGLQPNTIYLFLVRAVGAWGLSEPSPVSEPVRTQDSSPSRPVEDPWRGQQGLAEVAVRLQEPIVLGPRTLQVSWTVDGPVQLVQGFRVSWRVAGPEGGSWTMLDLQSPSQQSTVLRGLPPGTQIQIKVQAQGQEGLGAESLSVTRSIPEEAPSGPPQGVAVALGGDGNSSITVSWEPPLPSQQNGVITEYQIWCLGNESRFHLNRSAAGWARSAMLRGLVPGLLYRTLVAAATSAGVGVPSAPVLVQLPSPPDLEPGLEVGAGLAVRLARVLREPAFLAGSGAACGALLLGLCAALYWRRKQRKELSHYTASFAYTPAVSFPHSEGLSGASSRPPMGLGPAPYSWLADSWPHPSRSPSAQEPRGSCCPSNPDPDDRYYNEAGISLYLAQTARGTAAPGEGPVYSTIDPAGEELQTFHGGFPQHPSGDLGPWSQYAPPEWSQGDSGAKGGKVKLLGKPVQMPSLNWPEALPPPPPSCELSCLEGPEEELEGSSEPEEWCPPMPERSHLTEPSSSGGCLVTPSRRETPSPTPSYGQQSTATLTPSPPDPPQPPTDMPHLHQMPRRVPLGPSSPLSVSQPMLGIREARPAGLGAGPAASPHLSPSPAPSTASSAPGRTWQGNGEMTPPLQGPRARFRKKPKALPYRRENSPGDLPPPPLPPPEEEASWALELRAAGSMSSLERERSGERKAVQAVPLAAQRVLHPDEEAWLPYSRPSFLSRGQGTSTCSTAGSNSSRGSSSSRGSRGPGRSRSRSQSRSQSQRPGQKRREEPR,1386,NP_071765.2.csv,refseq-ROBO3-NM_022370.3_clinical_seed_0_final,refseq-ROBO3-NM_022370.3.a2m,Invitae,refseq-ROBO3-NM_022370.3.npy,1,1386,1386
+NP_071801.1,MERKISRIHLVSEPSITHFLQVSWEKTLESGFVITLTDGHSAWTGTVSESEISQEADDMAMEKGKYVGELRKALLSGAGPADVYTFNFSKESCYFFFEKNLKDVSFRLGSFNLEKVENPAEVIRELICYCLDTIAENQAKNEHLQKENERLLRDWNDVQGRFEKCVSAKEALETDLYKRFILVLNEKKTKIRSLHNKLLNAAQEREKDIKQEGETAICSEMTADRDPVYDESTDEESENQTDLSGLASAAVSKDDSIISSLDVTDIAPSRKRRQRMQRNLGTEPKMAPQENQLQEKENSRPDSSLPETSKKEHISAENMSLETLRNSSPEDLFDEI,336,NP_071801.1.csv,refseq-XRCC4-NM_022406.3_clinical_seed_0_final,refseq-XRCC4-NM_022406.3.a2m,Invitae,refseq-XRCC4-NM_022406.3.npy,1,336,336
+NP_071881.1,MGDLSSLTPGGSMGLQVNRGSQSSLEGAPATAPEPHSLGILHASYSVSHRVRPWWDITSCRQQWTRQILKDVSLYVESGQIMCILGSSGSGKTTLLDAMSGRLGRAGTFLGEVYVNGRALRREQFQDCFSYVLQSDTLLSSLTVRETLHYTALLAIRRGNPGSFQKKVEAVMAELSLSHVADRLIGNYSLGGISTGERRRVSIAAQLLQDPKVMLFDEPTTGLDCMTANQIVVLLVELARRNRIVVLTIHQPRSELFQLFDKIAILSFGELIFCGTPAEMLDFFNDCGYPCPEHSNPFDFYMDLTSVDTQSKEREIETSKRVQMIESAYKKSAICHKTLKNIERMKHLKTLPMVPFKTKDSPGVFSKLGVLLRRVTRNLVRNKLAVITRLLQNLIMGLFLLFFVLRVRSNVLKGAIQDRVGLLYQFVGATPYTGMLNAVNLFPVLRAVSDQESQDGLYQKWQMMLAYALHVLPFSVVATMIFSSVCYWTLGLHPEVARFGYFSAALLAPHLIGEFLTLVLLGIVQNPNIVNSVVALLSIAGVLVGSGFLRNIQEMPIPFKIISYFTFQKYCSEILVVNEFYGLNFTCGSSNVSVTTNPMCAFTQGIQFIEKTCPGATSRFTMNFLILYSFIPALVILGIVVFKIRDHLISR,651,NP_071881.1.csv,refseq-ABCG5-NM_022436.2_clinical_seed_0_final,refseq-ABCG5-NM_022436.2.a2m,Invitae,refseq-ABCG5-NM_022436.2.npy,1,651,651
+NP_071882.1,MAGKAAEERGLPKGATPQDTSGLQDRLFSSESDNSLYFTYSGQPNTLEVRDLNYQVDLASQVPWFEQLAQFKMPWTSPSCQNSCELGIQNLSFKVRSGQMLAIIGSSGCGRASLLDVITGRGHGGKIKSGQIWINGQPSSPQLVRKCVAHVRQHNQLLPNLTVRETLAFIAQMRLPRTFSQAQRDKRVEDVIAELRLRQCADTRVGNMYVRGLSGGERRRVSIGVQLLWNPGILILDEPTSGLDSFTAHNLVKTLSRLAKGNRLVLISLHQPRSDIFRLFDLVLLMTSGTPIYLGAAQHMVQYFTAIGYPCPRYSNPADFYVDLTSIDRRSREQELATREKAQSLAALFLEKVRDLDDFLWKAETKDLDEDTCVESSVTPLDTNCLPSPTKMPGAVQQFTTLIRRQISNDFRDLPTLLIHGAEACLMSMTIGFLYFGHGSIQLSFMDTAALLFMIGALIPFNVILDVISKCYSERAMLYYELEDGLYTTGPYFFAKILGELPEHCAYIIIYGMPTYWLANLRPGLQPFLLHFLLVWLVVFCCRIMALAAAALLPTFHMASFFSNALYNSFYLAGGFMINLSSLWTVPAWISKVSFLRWCFEGLMKIQFSRRTYKMPLGNLTIAVSGDKILSVMELDSYPLYAIYLIVIGLSGGFMVLYYVSLRFIKQKPSQDW,673,NP_071882.1.csv,refseq-ABCG8-NM_022437.2_clinical_seed_0_final,refseq-ABCG8-NM_022437.2.a2m,Invitae,refseq-ABCG8-NM_022437.2.npy,1,673,673
+NP_071890.2,MEHAFTPLEPLLSTGNLKYCLVILNQPLDNYFRHLWNKALLRACADGGANRLYDITEGERESFLPEFINGDFDSIRPEVREYYATKGCELISTPDQDHTDFTKCLKMLQKKIEEKDLKVDVIVTLGGLAGRFDQIMASVNTLFQATHITPFPIIIIQEESLIYLLQPGKHRLHVDTGMEGDWCGLIPVGQPCMQVTTTGLKWNLTNDVLAFGTLVSTSNTYDGSGVVTVETDHPLLWTMAIKS,243,NP_071890.2.csv,refseq-TPK1-NM_022445.3_clinical_seed_0_final,refseq-TPK1-NM_022445.3.a2m,Invitae,refseq-TPK1-NM_022445.3.npy,1,243,243
+NP_071899.1,MSSPDAGYASDDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKGEAPANSGAPAGAAGRAKGESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKRPFVEEAERLRVQHMQDHPNYKYRPRRRKQVKRLKRVEGGFLHGLAEPQAAALGPEGGRVAMDGLGLQFPEQGFPAGPPLLPPHMGGHYRDCQSLGAPPLDGYPLPTPDTSPLDGVDPDPAFFAAPMPGDCPAAGTYSYAQVSDYAGPPEPPAGPMHPRLGPEPAGPSIPGLLAPPSALHVYYGAMGSPGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPSPPPEALPCRDGTDPSQPAELLGEVDRTEFEQYLHFVCKPEMGLPYQGHDSGVNLPDSHGAISSVVSDASSAVYYCNYPDV,414,NP_071899.1.csv,refseq-SOX17-NM_022454.3_clinical_seed_0_final,refseq-SOX17-NM_022454.3.a2m,Invitae,refseq-SOX17-NM_022454.3.npy,1,414,414
+NP_071919.2,MTTSHMNGHVTEESDSEVKNVDLASPEEHQKHREMAVDCPGDLGTRMMPIRRSAQLERIRQQQEDMRRRREEEGKKQELDLNSSMRLKKLAQIPPKTGIDNPMFDTEEGIVLESPHYAVKILEIEDLFSSLKHIQHTLVDSQSQEDISLLLQLVQNKDFQNAFKIHNAITVHMNKASPPFPLISNAQDLAQEVQTVLKPVHHKEGQELTALLNTPHIQALLLAHDKVAEQEMQLEPITDERVYESIGQYGGETVKIVRIEKARDIPLGATVRNEMDSVIISRIVKGGAAEKSGLLHEGDEVLEINGIEIRGKDVNEVFDLLSDMHGTLTFVLIPSQQIKPPPAKETVIHVKAHFDYDPSDDPYVPCRELGLSFQKGDILHVISQEDPNWWQAYREGDEDNQPLAGLVPGKSFQQQREAMKQTIEEDKEPEKSGKLWCAKKNKKKRKKVLYNANKNDDYDNEEILTYEEMSLYHQPANRKRPIILIGPQNCGQNELRQRLMNKEKDRFASAVPHTTRSRRDQEVAGRDYHFVSRQAFEADIAAGKFIEHGEFEKNLYGTSIDSVRQVINSGKICLLSLRTQSLKTLRNSDLKPYIIFIAPPSQERLRALLAKEGKNPKPEELREIIEKTREMEQNNGHYFDTAIVNSDLDKAYQELLRLINKLDTEPQWVPSTWLR,675,NP_071919.2.csv,refseq-MPP5-NM_022474.3_clinical_seed_0_final,refseq-MPP5-NM_022474.3.a2m,Invitae,refseq-MPP5-NM_022474.3_theta_0.2.npy,1,675,675
+NP_071934.3,MSVKEGAQRKWAALKEKLGPQDSDPTEANLESADPELCIRLLQMPSVVNYSGLRKRLEGSDGGWMVQFLEQSGLDLLLEALARLSGRGVARISDALLQLTCVSCVRAVMNSRQGIEYILSNQGYVRQLSQALDTSNVMVKKQVFELLAALCIYSPEGHVLTLDALDHYKTVCSQQYRFSIVMNELSGSDNVPYVVTLLSVINAVILGPEDLRARTQLRNEFIGLQLLDVLARLRDLEDADLLIQLEAFEEAKAEDEEELLRVSGGVDMSSHQEVFASLFHKVSCSPVSAQLLSVLQGLLHLEPTLRSSQLLWEALESLVNRAVLLASDAQECTLEEVVERLLSVKGRPRPSPLVKAHKSVQANLDQSQRGSSPQNTTTPKPSVEGQQPAAAAACEPVDHAQSESILKVSQPRALEQQASTPPPPPPPPLLPGSSAEPPPPPPPPPLPSVGAKALPTAPPPPPLPGLGAMAPPAPPLPPPLPGSCEFLPPPPPPLPGLGCPPPPPPLLPGMGWGPPPPPPPLLPCTCSPPVAGGMEEVIVAQVDHGLGSAWVPSHRRVNPPTLRMKKLNWQKLPSNVAREHNSMWASLSSPDAEAVEPDFSSIERLFSFPAAKPKEPTMVAPRARKEPKEITFLDAKKSLNLNIFLKQFKCSNEEVAAMIRAGDTTKFDVEVLKQLLKLLPEKHEIENLRAFTEERAKLASADHFYLLLLAIPCYQLRIECMLLCEGAAAVLDMVRPKAQLVLAACESLLTSRQLPIFCQLILRIGNFLNYGSHTGDADGFKISTLLKLTETKSQQNRVTLLHHVLEEAEKSHPDLLQLPRDLEQPSQAAGINLEIIRSEASSNLKKLLETERKVSASVAEVQEQYTERLQASISAFRALDELFEAIEQKQRELADYLCEDAQQLSLEDTFSTMKAFRDLFLRALKENKDRKEQAAKAERRKQQLAEEEARRPRGEDGKPVRKGPGKQEEVCVIDALLADIRKGFQLRKTARGRGDTDGGSKAASMDPPRATEPVATSNPAGDPVGSTRCPASEPGLDATTASESRGWDLVDAVTPGPQPTLEQLEEGGPRPLERRSSWYVDASDVLTTEDPQCPQPLEGAWPVTLGDAQALKPLKFSSNQPPAAGSSRQDAKDPTSLLGVLQAEADSTSEGLEDAVHSRGARPPAAGPGGDEDEDEEDTAPESALDTSLDKSFSEDAVTDSSGSGTLPRARGRASKGTGKRRKKRPSRSQEEVPPDSDDNKTKKLCVIQ,1249,NP_071934.3.csv,refseq-INF2-NM_022489.3_clinical_seed_0_final,refseq-INF2-NM_022489.3.a2m,Invitae,refseq-INF2-NM_022489.3.npy,1,1249,1249
+NP_073563.1,MGLLDSEPGSVLNVVSTALNDTVEFYRWTWSIADKRVENWPLMQSPWPTLSISTLYLLFVWLGPKWMKDREPFQMRLVLIIYNFGMVLLNLFIFRELFMGSYNAGYSYICQSVDYSNNVHEVRIAAALWWYFVSKGVEYLDTVFFILRKKNNQVSFLHVYHHCTMFTLWWIGIKWVAGGQAFFGAQLNSFIHVIMYSYYGLTAFGPWIQKYLWWKRYLTMLQLIQFHVTIGHTALSLYTDCPFPKWMHWALIAYAISFIFLFLNFYIRTYKEPKKPKAGKTAMNGISANGVSKSEKQLMIENGKKQKNGKAKGD,314,NP_073563.1.csv,refseq-ELOVL4-NM_022726.3_clinical_seed_0_final,refseq-ELOVL4-NM_022726.3.a2m,Invitae,refseq-ELOVL4-NM_022726.3.npy,1,314,314
+NP_073623.1,MGNGGRSGLQQGKGNVDGVAATPTAASASCQYRCIECNQEAKELYRDYNHGVLKITICKSCQKPVDKYIEYDPVIILINAILCKAQAYRHILFNTQINIHGKLCIFCLLCEAYLRWWQLQDSNQNTAPDDLIRYAKEWDFYRMFAIAALEQTAYFIGIFTFLWVERPMTAKKKPNFILLLKALLLSSYGKLLLIPAVIWEHDYTSVCLKLIKVFVLTSNFQAIRVTLNINRKLSFLAVLSGLLLESIMVYFFQSMEWDVGSDYAIFKSQDF,271,NP_073623.1.csv,refseq-ARV1-NM_022786.1_clinical_seed_0_final,refseq-ARV1-NM_022786.1.a2m,Invitae,refseq-ARV1-NM_022786.1.npy,1,271,271
+NP_073624.2,MENSEKTEVVLLACGSFNPITNMHLRLFELAKDYMNGTGRYTVVKGIISPVGDAYKKKGLIPAYHRVIMAELATKNSKWVEVDTWESLQKEWKETLKVLRHHQEKLEASDCDHQQNSPTLERPGRKRKWTETQDSSQKKSLEPKTKAVPKVKLLCGADLLESFAVPNLWKSEDITQIVANYGLICVTRAGNDAQKFIYESDVLWKHRSNIHVVNEWIANDISSTKIRRALRRGQSIRYLVPDLVQEYIEKHNLYSSESEDRNAGVILAPLQRNTAEAKT,279,NP_073624.2.csv,refseq-NMNAT1-NM_022787.3_clinical_seed_0_final,refseq-NMNAT1-NM_022787.3.a2m,Invitae,refseq-NMNAT1-NM_022787.3.npy,1,279,279
+NP_073625.1,MQAVDNLTSAPGNTSLCTRDYKITQVLFPLLYTVLFFVGLITNGLAMRIFFQIRSKSNFIIFLKNTVISDLLMILTFPFKILSDAKLGTGPLRTFVCQVTSVIFYFTMYISISFLGLITIDRYQKTTRPFKTSNPKNLLGAKILSVVIWAFMFLLSLPNMILTNRQPRDKNVKKCSFLKSEFGLVWHEIVNYICQVIFWINFLIVIVCYTLITKELYRSYVRTRGVGKVPRKKVNVKVFIIIAVFFICFVPFHFARIPYTLSQTRDVFDCTAENTLFYVKESTLWLTSLNACLDPFIYFFLCKSFRNSLISMLKCPNSATSLSQDNRKKEQDGGDPNEETPM,342,NP_073625.1.csv,refseq-P2RY12-NM_022788.4_clinical_seed_0_final,refseq-P2RY12-NM_022788.4.a2m,Invitae,refseq-P2RY12-NM_022788.4.npy,1,342,342
+NP_073741.3,MAAVDSDVESLPRGGFRCCLCHVTTANRPSLDAHLGGRKHRHLVELRAARKAQGLRSVFVSGFPRDVDSAQLSEYFLAFGPVASVVMDKDKGVFAIVEMGDVGAREAVLSQSQHSLGGHRLRVRPREQKEFQSPASKSPKGAAPDSHQLAKALAEAADVGAQMIKLVGLRELSEAERQLRSLVVALMQEVFTEFFPGCVVHPFGSSINSFDVHGCDLDLFLDLGDLEEPQPVPKAPESPSLDSALASPLDPQALACTPASPPDSQPPASPQDSEALDFETPSSSLAPQTPDSALASETLASPQSLPPASPLLEDREEGDLGKASELAETPKEEKAEGAAMLELVGSILRGCVPGVYRVQTVPSARRPVVKFCHRPSGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSGSGPLLSNYALTLLVIYFLQTRDPPVLPTVSQLTQKAGEGEQVEVDGWDCSFPRDASRLEPSINVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLGPLNLQDPFDLSHNVAANVTSRVAGRLQNCCRAAANYCRSLQYQRRSSRGRDWGLLPLLQPSSPSSLLSATPIPLPLAPFTQLTAALVQVFREALGCHIEQATKRTRSEGGGTGESSQGGTSKRLKVDGQKNCCEEGKEEQQGCAGDGGEDRVEEMVIEVGEMVQDWAMQSPGQPGDLPLTTGKHGAPGEEGQPSHAALAERGPKGHEAAQEWSQGEAGKGASLPSSASWRCALWHRVWQGRRRARRRLQQQTKEGAGGGAGTRAGWLATEAQVTQELKGLSGGEERPETEPLLSFVASVSPADRMLTVTPLQDPQGLFPDLHHFLQVFLPQAIRHLK,874,NP_073741.3.csv,STPAP_HUMAN_b03_clinical_seed_0_final,STPAP_HUMAN_b03.a2m,EVE,STPAP_HUMAN_b03_theta_0.2.npy,1,874,874
+NP_073746.2,MPEGAQGLSLSKPSPSLGCGRRGEVCDCGTVCETRTAPAAPTMASPRGSGSSTSLSTVGSEGDPAPGPTPACSASRPEPLPGPPIRLHLSPVGIPGSARPSRLERVAREIVETERAYVRDLRSIVEDYLGPLLDGGVLGLSVEQVGTLFANIEDIYEFSSELLEDLENSSSAGGIAECFVQRSEDFDIYTLYCMNYPSSLALLRELSLSPPAALWLQERQAQLRHSLPLQSFLLKPVQRILKYHLLLQELGKHWAEGPGTGGREMVEEAIVSMTAVAWYINDMKRKQEHAARLQEVQRRLGGWTGPELSAFGELVLEGAFRGGGGGGPRLRGGERLLFLFSRMLLVAKRRGLEYTYKGHIFCCNLSVSESPRDPLGFKVSDLTIPKHRHLLQAKNQEEKRLWIHCLQRLFFENHPASIPAKAKQVLLENSLHCAPKSKPVLEPLTPPLGSPRPRDARSFTPGRRNTAPSPGPSVIRRGRRQSEPVKDPYVMFPQNAKPGFKHAGSEGELYPPESQPPVSGSAPPEDLEDAGPPTLDPSGTSITEEILELLNQRGLRDPGPSTHDIPKFPGDSQVPGDSETLTFQALPSRDSSEEEEEEEEGLEMDERGPSPLHVLEGLESSIAAEMPSIPCLTKIPDVPNLPEIPSRCEIPEGSRLPSLSDISDVFEMPCLPAIPSVPNTPSLSSTPTLSCDSWLQGPLQEPAEAPATRRELFSGSNPGKLGEPPSGGKAGPEEDEEGVSFTDFQPQDVTQHQGFPDELAFRSCSEIRSAWQALEQGQLARPGFPEPLLILEDSDLGGDSGSGKAGAPSSERTASRVRELARLYSERIQQMQRAETRASANAPRRRPRVLAQPQPSPCLPQEQAEPGLLPAFGHVLVCELAFPLTCAQESVPLGPAVWVQAAIPLSKQGGSPDGQGLHVSNLPKQDLPGIHVSAATLLPEQGGSRHVQAPAATPLPKQEGPLHLQVPALTTFSDQGHPEIQVPATTPLPEHRSHMVIPAPSTAFCPEQGHCADIHVPTTPALPKEICSDFTVSVTTPVPKQEGHLDSESPTNIPLTKQGGSRDVQGPDPVCSQPIQPLSWHGSSLDPQGPGDTLPPLPCHLPDLQIPGTSPLPAHGSHLDHRIPANAPLSLSQELPDTQVPATTPLPLPQVLTDIWVQALPTSPKQGSLPDIQGPAAAPPLPEPSLTDTQVQKLTPSLEQKSLIDAHVPAATPLPERGGSLDIQGLSPTPVQTTMVLSKPGGSLASHVARLESSDLTPPHSPPPSSRQLLGPNAAALSRYLAASYISQSLARRQGPGGGAPAASRGSWSSAPTSRASSPPPQPQPPPPPARRLSYATTVNIHVGGGGRLRPAKAQVRLNHPALLASTQESMGLHRAQGAPDAPFHM,1386,NP_073746.2.csv,refseq-PLEKHG2-NM_022835.2_clinical_seed_0_final,refseq-PLEKHG2-NM_022835.2.a2m,Invitae,refseq-PLEKHG2-NM_022835.2.npy,1,1386,1386
+NP_073747.1,MNGVLIPHTPIAVDFWSLRRAGTARLFFLSHMHSDHTVGLSSTWARPLYCSPITAHLLHRHLQVSKQWIQALEVGESHVLPLDEIGQETMTVTLLDANHCPGSVMFLFEGYFGTILYTGDFRYTPSMLKEPALTLGKQIHTLYLDNTNCNPALVLPSRQEAAHQIVQLIRKHPQHNIKIGLYSLGKESLLEQLALEFQTWVVLSPRRLELVQLLGLADVFTVEEKAGRIHAVDHMEICHSNMLRWNQTHPTIAILPTSRKIHSSHPDIHVIPYSDHSSYSELRAFVAALKPCQVVPIVSRRPCGGFQDSLSPRISVPLIPDSVQQYMSSSSRKPSLLWLLERRLKRPRTQGVVFESPEESADQSQADRDSKKAKKEKLSPWPADLEKQPSHHPLRIKKQLFPDLYSKEWNKAVPFCESQKRVTMLTAPLGFSVHLRSTDEEFISQKTREEIGLGSPLVPMGDDDGGPEATGNQSAWMGHGSPLSHSSKGTPLLATEFRGLALKYLLTPVNFFQAGYSSRRFDQQVEKYHKPC,532,NP_073747.1.csv,refseq-DCLRE1B-NM_022836.3_clinical_seed_0_final,refseq-DCLRE1B-NM_022836.3.a2m,Invitae,refseq-DCLRE1B-NM_022836.3.npy,1,532,532
+NP_073752.6,MAEEQQQPPPQQPDAHQQLPPSAPNSGVALPALVPGLPGTEASALQHKIKNSICKTVQSKVDCILQEVEKFTDLEKLYLYLQLPSGLSNGEKSDQNAMSSSRAQQMHAFSWIRNTLEEHPETSLPKQEVYDEYKSYCDNLGYHPLSAADFGKIMKNVFPNMKARRLGTRGKSKYCYSGLRKKAFVHMPTLPNLDFHKTGDGLEGAEPSGQLQNIDEEVISSACRLVCEWAQKVLSQPFDTVLELARFLVKSHYIGTKSMAALTVMAAAPAGMKGITQPSAFIPTAESNSFQPQVKTLPSPIDAKQQLQRKIQKKQQEQKLQSPLPGESAAKKSESATSNGVTNLPNGNPSILSPQPIGIVVAAVPSPIPVQRTRQLVTSPSPMSSSDGKVLPLNVQVVTQHMQSVKQAPKTPQNVPASPGGDRSARHRYPQILPKPANTSALTIRSPTTVLFTSSPIKTAVVPASHMSSLNVVKMTTISLTPSNSNTPLKHSASVSSATGTTEESRSVPQIKNGSVVSLQSPGSRSSSAGGTSAVEVKVEPETSSDEHPVQCQENSDEAKAPQTPSALLGQKSNTDGALQKPSNEGVIEIKATKVCDQRTKCKSRCNEMLPGTSTGNNQSTITLSVASQNLTFTSSSSPPNGDSINKDPKLCTKSPRKRLSSTLQETQVPPVKKPIVEQLSAATIEGQKQGSVKKDQKVPHSGKTEGSTAGAQIPSKVSVNVSSHIGANQPLNSSALVISDSALEQQTTPSSSPDIKVKLEGSVFLLDSDSKSVGSFNPNGWQQITKDSEFISASCEQQQDISVMTIPEHSDINDLEKSVWELEGMPQDTYSQQLHSQIQESSLNQIQAHSSDQLPLQSELKEFEPSVSQTNESYFPFDDELTQDSIVEELVLMEQQMSMNNSHSYGNCLGMTLQSQSVTPGAPMSSHTSSTHFYHPIHSNGTPIHTPTPTPTPTPTPTPTPTPTSEMIAGSQSLSRESPCSRLAQTTPVDSALGSSRHTPIGTPHSNCSSSVPPSPVECRNPFAFTPISSSMAYHDASIVSSSPVKPMQRPMATHPDKTKLEWMNNGYSGVGNSSVSGHGILPSYQELVEDRFRKPHAFAVPGQSYQSQSRHHDTHFGRLTPVSPVQHQGATVNNTNKQEGFAVPAPLDNKGTNSSASSNFRCRSVSPAVHRQRNLSGSTLYPVSNIPRSNVTPFGSPVTPEVHVFTNVHTDACANNIAQRSQSVPLTVMMQTAFPNALQKQANSKKITNVLLSKLDSDNDDAVRGLGMNNLPSNYTARMNLTQILEPSTVFPSANPQNMIDSSTSVYEFQTPSYLTKSNSTGQINFSPGDNQAQSEIGEQQLDFNSTVKDLLSGDSLQTNQQLVGQGASDLTNTASDFSSDIRLSSELSGSINDLNTLDPNLLFDPGRQQGQDDEATLEELKNDPLFQQICSESMNSMTSSGFEWIESKDHPTVEMLG,1460,NP_073752.6.csv,NP_073752.6_clinical_seed_0_final,NP_073752.6.a2m,popEVE,NP_073752.6_theta_0.2.npy,1,1460,1460
+NP_075044.2,MSRRKQGKPQHLSKREFSPEPLEAILTDDEPDHGPLGAPEGDHDLLTCGQCQMNFPLGDILIFIEHKRKQCNGSLCLEKAVDKPPSPSPIEMKKASNPVEVGIQVTPEDDDCLSTSSRGICPKQEHIADKLLHWRGLSSPRSAHGALIPTPGMSAEYAPQGICKDEPSSYTCTTCKQPFTSAWFLLQHAQNTHGLRIYLESEHGSPLTPRVGIPSGLGAECPSQPPLHGIHIADNNPFNLLRIPGSVSREASGLAEGRFPPTPPLFSPPPRHHLDPHRIERLGAEEMALATHHPSAFDRVLRLNPMAMEPPAMDFSRRLRELAGNTSSPPLSPGRPSPMQRLLQPFQPGSKPPFLATPPLPPLQSAPPPSQPPVKSKSCEFCGKTFKFQSNLVVHRRSHTGEKPYKCNLCDHACTQASKLKRHMKTHMHKSSPMTVKSDDGLSTASSPEPGTSDLVGSASSALKSVVAKFKSENDPNLIPENGDEEEEEDDEEEEEEEEEEEEELTESERVDYGFGLSLEAARHHENSSRGAVVGVGDESRALPDVMQGMVLSSMQHFSEAFHQVLGEKHKRGHLAEAEGHRDTCDEDSVAGESDRIDDGTVNGRGCSPGESASGGLSKKLLLGSPSSLSPFSKRIKLEKEFDLPPAAMPNTENVYSQWLAGYAASRQLKDPFLSFGDSRQSPFASSSEHSSENGSLRFSTPPGELDGGISGRSGTGSGGSTPHISGPGPGRPSSKEGRRSDTCEYCGKVFKNCSNLTVHRRSHTGERPYKCELCNYACAQSSKLTRHMKTHGQVGKDVYKCEICKMPFSVYSTLEKHMKKWHSDRVLNNDIKTE,835,NP_075044.2.csv,refseq-BCL11A-NM_022893.3_clinical_seed_0_final,refseq-BCL11A-NM_022893.3.a2m,Invitae,refseq-BCL11A-NM_022893.3.npy,1,835,835
+NP_075066.1,MASGLVRLLQQGHRCLLAPVAPKLVPPVRGVKKGFRAAFRFQKELERQRLLRCPPPPVRRSEKPNWDYHAEIQAFGHRLQENFSLDLLKTAFVNSCYIKSEEAKRQQLGIEKEAVLLNLKSNQELSEQGTSFSQTCLTQFLEDEYPDMPTEGIKNLVDFLTGEEVVCHVARNLAVEQLTLSEEFPVPPAVLQQTFFAVIGALLQSSGPERTALFIRDFLITQMTGKELFEMWKIINPMGLLVEELKKRNVSAPESRLTRQSGGTTALPLYFVGLYCDKKLIAEGPGETVLVAEEEAARVALRKLYGFTENRRPWNYSKPKETLRAEKSITAS,332,NP_075066.1.csv,refseq-MRPL44-NM_022915.5_clinical_seed_0_final,refseq-MRPL44-NM_022915.5.a2m,Invitae,refseq-MRPL44-NM_022915.5.npy,1,332,332
+NP_075462.3,MEIVYVYVKKRSEFGKQCNFSDRQAELNIDIMPNPELAEQFVERNPVDTGIQCSISMSEHEANSERFEMETRGVNHVEGGWPKDVNPLELEQTIRFRKKVEKDENYVNAIMQLGSIMEHCIKQNNAIDIYEEYFNDEEAMEVMEEDPSAKTINVFRDPQEIKRAATHLSWHPDGNRKLAVAYSCLDFQRAPVGMSSDSYIWDLENPNKPELALKPSSPLVTLEFNPKDSHVLLGGCYNGQIACWDTRKGSLVAELSTIESSHRDPVYGTIWLQSKTGTECFSASTDGQVMWWDIRKMSEPTEVVILDITKKEQLENALGAISLEFESTLPTKFMVGTEQGIVISCNRKAKTSAEKIVCTFPGHHGPIYALQRNPFYPKNFLTVGDWTARIWSEDSRESSIMWTKYHMAYLTDAAWSPVRPTVFFTTRMDGTLDIWDFMFEQCDPTLSLKVCDEALFCLRVQDNGCLIACGSQLGTTTLLEVSPGLSTLQRNEKNVASSMFERETRREKILEARHREMRLKEKGKAEGRDEEQTDEELAVDLEALVSKAEEEFFDIIFAELKKKEADAIKLTPVPQQPSPEEDQVVEEGEEAAGEEGDEEVEEDLA,605,NP_075462.3.csv,refseq-DNAI2-NM_023036.4_clinical_seed_0_final,refseq-DNAI2-NM_023036.4.a2m,Invitae,refseq-DNAI2-NM_023036.4.npy,1,605,605
+NP_075555.1,MMASYPEPEDAAGALLAPETGRTVKEPEGPPPSPGKGGGGGGGTAPEKPDPAQKPPYSYVALIAMAIRESAEKRLTLSGIYQYIIAKFPFYEKNKKGWQNSIRHNLSLNECFIKVPREGGGERKGNYWTLDPACEDMFEKGNYRRRRRMKRPFRPPPAHFQPGKGLFGAGGAAGGCGVAGAGADGYGYLAPPKYLQSGFLNNSWPLPQPPSPMPYASCQMAAAAAAAAAAAAAAGPGSPGAAAVVKGLAGPAASYGPYTRVQSMALPPGVVNSYNGLGGPPAAPPPPPHPHPHPHAHHLHAAAAPPPAPPHHGAAAPPPGQLSPASPATAAPPAPAPTSAPGLQFACARQPELAMMHCSYWDHDSKTGALHSRLDL,376,NP_075555.1.csv,refseq-FOXL2-NM_023067.3_clinical_seed_0_final,refseq-FOXL2-NM_023067.3.a2m,Invitae,refseq-FOXL2-NM_023067.3.npy,1,376,376
+NP_075598.2,MWSWKCLLFWAVLVTATLCTARPSPTLPEQAQPWGAPVEVESFLVHPGDLLQLRCRLRDDVQSINWLRDGVQLAESNRTRITGEEVEVQDSVPADSGLYACVTSSPSGSDTTYFSVNVSDALPSSEDDDDDDDSSSEEKETDNTKPNRMPVAPYWTSPEKMEKKLHAVPAAKTVKFKCPSSGTPNPTLRWLKNGKEFKPDHRIGGYKVRYATWSIIMDSVVPSDKGNYTCIVENEYGSINHTYQLDVVERSPHRPILQAGLPANKTVALGSNVEFMCKVYSDPQPHIQWLKHIEVNGSKIGPDNLPYVQILKTAGVNTTDKEMEVLHLRNVSFEDAGEYTCLAGNSIGLSHHSAWLTVLEALEERPAVMTSPLYLEIIIYCTGAFLISCMVGSVIVYKMKSGTKKSDFHSQMAVHKLAKSIPLRRQVTVSADSSASMNSGVLLVRPSRLSSSGTPMLAGVSEYELPEDPRWELPRDRLVLGKPLGEGCFGQVVLAEAIGLDKDKPNRVTKVAVKMLKSDATEKDLSDLISEMEMMKMIGKHKNIINLLGACTQDGPLYVIVEYASKGNLREYLQARRPPGLEYCYNPSHNPEEQLSSKDLVSCAYQVARGMEYLASKKCIHRDLAARNVLVTEDNVMKIADFGLARDIHHIDYYKKTTNGRLPVKWMAPEALFDRIYTHQSDVWSFGVLLWEIFTLGGSPYPGVPVEELFKLLKEGHRMDKPSNCTNELYMMMRDCWHAVPSQRPTFKQLVEDLDRIVALTSNQEYLDLSMPLDQYSPSFPDTRSSTCSSGEDSVFSHEPLPEEPCLPRHPAQLANGGLKRR,822,NP_075598.2.csv,refseq-FGFR1-NM_023110.2_clinical_seed_0_final,refseq-FGFR1-NM_023110.2.a2m,Invitae,refseq-FGFR1-NM_023110.2.npy,1,822,822
+NP_076872.1,MDWKTLQALLSGVNKYSTAFGRIWLSVVFVFRVLVYVVAAERVWGDEQKDFDCNTKQPGCTNVCYDNYFPISNIRLWALQLIFVTCPSLLVILHVAYREERERRHRQKHGDQCAKLYDNAGKKHGGLWWTYLFSLIFKLIIEFLFLYLLHTLWHGFNMPRLVQCANVAPCPNIVDCYIARPTEKKIFTYFMVGASAVCIVLTICELCYLICHRVLRGLHKDKPRGGCSPSSSASRASTCRCHHKLVEAGEVDPDPGNNKLQASAPNLTPI,270,NP_076872.1.csv,refseq-GJB3-NM_024009.2_clinical_seed_0_final,refseq-GJB3-NM_024009.2.a2m,Invitae,refseq-GJB3-NM_024009.2.npy,1,270,270
+NP_076927.1,MGENDPPAVEAPFSFRSLFGLDDLKISPVAPDADAVAAQILSLLPLKFFPIIVIGIIALILALAIGLGIHFDCSGKYRCRSSFKCIELIARCDGVSDCKDGEDEYRCVRVGGQNAVLQVFTAASWKTMCSDDWKGHYANVACAQLGFPSYVSSDNLRVSSLEGQFREEFVSIDHLLPDDKVTALHHSVYVREGCASGHVVTLQCTACGHRRGYSSRIVGGNMSLLSQWPWQASLQFQGYHLCGGSVITPLWIITAAHCVYDLYLPKSWTIQVGLVSLLDNPAPSHLVEKIVYHSKYKPKRLGNDIALMKLAGPLTFNEMIQPVCLPNSEENFPDGKVCWTSGWGATEDGAGDASPVLNHAAVPLISNKICNHRDVYGGIISPSMLCAGYLTGGVDSCQGDSGGPLVCQERRLWKLVGATSFGIGCAEVNKPGVYTRVTSFLDWIHEQMERDLKT,454,NP_076927.1.csv,refseq-TMPRSS3-NM_024022.2_clinical_seed_0_final,refseq-TMPRSS3-NM_024022.2.a2m,Invitae,refseq-TMPRSS3-NM_024022.2_theta_0.2.npy,1,454,454
+NP_076932.1,MRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDAGEKGDKGAPGRPGRVGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRKAIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMPKDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEEDCVEMVASGGWNDVACHTTMYFMCEFDKENM,271,NP_076932.1.csv,refseq-COLEC11-NM_024027.4_clinical_seed_0_final,refseq-COLEC11-NM_024027.4.a2m,Invitae,refseq-COLEC11-NM_024027.4.npy,1,271,271
+NP_076968.2,MAPDSDPFPEGPLLKLLPLDARDRGTQRCRLGPAALHALGARLGSAVKISLPDGGSCLCTAWPRRDGADGFVQLDPLCASPGAAVGASRSRRSLSLNRLLLVPCPPLRRVAVWPVLRERAGAPGARNTAAVLEAAQELLRNRPISLGHVVVAPPGAPGLVAALHIVGGTPSPDPAGLVTPRTRVSLGGEPPSEAQPQPEVPLGGLSEAADSLRELLRLPLRYPRALTALGLAVPRGVLLAGPPGVGKTQLVRAVAREAGAELLAVSAPALQGSRPGETEENVRRVFQRARELASRGPSLLFLDEMDALCPQRGSRAPESRVVAQVLTLLDGASGDREVVVVGATNRPDALDPALRRPGRFDREVVIGTPTLKQRKEILQVITSKMPISSHVDLGLLAEMTVGYVGADLTALCREAAMHALLHSEKNQDNPVIDEIDFLEAFKNIQPSSFRSVIGLMDIKPVDWEEIGGLEDVKLKLKQSIEWPLKFPWEFVRMGLTQPKGVLLYGPPGCAKTTLVRALATSCHCSFVSVSGADLFSPFVGDSEKVLSQIFRQARASTPAILFLDEIDSILGARSASKTGCDVQERVLSVLLNELDGVGLKTIERRGSKSSQQEFQEVFNRSVMIIAATNRPDVLDTALLRPGRLDKIIYIPPPDHKGRLSILKVCTKTMPIGPDVSLENLAAETCFFSGADLRNLCTEAALLALQENGLDATTVKQEHFLKSLKTVKPSLSCKDLALYENLFKKEGFSNVEGI,753,NP_076968.2.csv,refseq-SPATA5L1-NM_024063.2_clinical_seed_0_final,refseq-SPATA5L1-NM_024063.2.a2m,Invitae,refseq-SPATA5L1-NM_024063.2.npy,1,753,753
+NP_076984.2,MAALTIATGTGNWFSALALGVTLLKCLLIPTYHSTDFEVHRNWLAITHSLPISQWYYEATSEWTLDYPPFFAWFEYILSHVAKYFDQEMLNVHNLNYSSSRTLLFQRFSVIFMDVLFVYAVRECCKCIDGKKVGKELTEKPKFILSVLLLWNFGLLIVDHIHFQYNGFLFGLMLLSIARLFQKRHMEGAFLFAVLLHFKHIYLYVAPAYGVYLLRSYCFTANKPDGSIRWKSFSFVRVISLGLVVFLVSALSLGPFLALNQLPQVFSRLFPFKRGLCHAYWAPNFWALYNALDKVLSVIGLKLKFLDPNNIPKASMTSGLVQQFQHTVLPSVTPLATLICTLIAILPSIFCLWFKPQGPRGFLRCLTLCALSSFMFGWHVHEKAILLAILPMSLLSVGKAGDASIFLILTTTGHYSLFPLLFTAPELPIKILLMLLFTIYSISSLKTLFRKEKPLFNWMETFYLLGLGPLEVCCEFVFPFTSWKVKYPFIPLLLTSVYCAVGITYAWFKLYVSVLIDSAIGKTKKQ,526,NP_076984.2.csv,refseq-ALG8-NM_024079.4_clinical_seed_0_final,refseq-ALG8-NM_024079.4.a2m,Invitae,refseq-ALG8-NM_024079.4.npy,1,526,526
+NP_077006.1,MGKKLDLSKLTDEEAQHVLEVVQRDFDLRRKEEERLEALKGKIKKESSKRELLSDTAHLNETHCARCLQPYQLLVNSKRQCLECGLFTCKSCGRVHPEEQGWICDPCHLARVVKIGSLEWYYEHVKARFKRFGSAKVIRSLHGRLQGGAGPELISEERSGDSDQTDEDGEPGSEAQAQAQPFGSKKKRLLSVHDFDFEGDSDDSTQPQGHSLHLSSVPEARDSPQSLTDESCSEKAAPHKAEGLEEADTGASGCHSHPEEQPTSISPSRHGALAELCPPGGSHRMALGTAAALGSNVIRNEQLPLQYLADVDTSDEESIRAHVMASHHSKRRGRASSESQIFELNKHISAVECLLTYLENTVVPPLAKGLGAGVRTEADVEEEALRRKLEELTSNVSDQETSSEEEEAKDEKAEPNRDKSVGPLPQADPEVGTAAHQTNRQEKSPQDPGDPVQYNRTTDEELSELEDRVAVTASEVQQAESEVSDIESRIAALRAAGLTVKPSGKPRRKSNLPIFLPRVAGKLGKRPEDPNADPSSEAKAMAVPYLLRRKFSNSLKSQGKDDDSFDRKSVYRGSLTQRNPNARKGMASHTFAKPVVAHQS,600,NP_077006.1.csv,refseq-MLPH-NM_024101.6_clinical_seed_0_final,refseq-MLPH-NM_024101.6.a2m,Invitae,refseq-MLPH-NM_024101.6.npy,1,600,600
+NP_077010.1,MAGKGSSGRRPLLLGLLVAVATVHLVICPYTKVEESFNLQATHDLLYHWQDLEQYDHLEFPGVVPRTFLGPVVIAVFSSPAVYVLSLLEMSKFYSQLIVRGVLGLGVIFGLWTLQKEVRRHFGAMVATMFCWVTAMQFHLMFYCTRTLPNVLALPVVLLALAAWLRHEWARFIWLSAFAIIVFRVELCLFLGLLLLLALGNRKVSVVRALRHAVPAGILCLGLTVAVDSYFWRQLTWPEGKVLWYNTVLNKSSNWGTSPLLWYFYSALPRGLGCSLLFIPLGLVDRRTHAPTVLALGFMALYSLLPHKELRFIIYAFPMLNITAARGCSYLLNNYKKSWLYKAGSLLVIGHLVVNAAYSATALYVSHFNYPGGVAMQRLHQLVPPQTDVLLHIDVAAAQTGVSRFLQVNSAWRYDKREDVQPGTGMLAYTHILMEAAPGLLALYRDTHRVLASVVGTTGVSLNLTQLPPFNVHLQTKLVLLERLPRPS,488,NP_077010.1.csv,refseq-ALG12-NM_024105.3_clinical_seed_0_final,refseq-ALG12-NM_024105.3.a2m,Invitae,refseq-ALG12-NM_024105.3.npy,1,488,488
+NP_077015.2,MGELCRRDSALTALDEETLWEMMESHRHRIVRCICPSRLTPYLRQAKVLCQLDEEEVLHSPRLTNSAMRAGHLLDLLKTRGKNGAIAFLESLKFHNPDVYTLVTGLQPDVDFSNFSGLMETSKLTECLAGAIGSLQEELNQEKGQKEVLLRRCQQLQEHLGLAETRAEGLHQLEADHSRMKREVSAHFHEVLRLKDEMLSLSLHYSNALQEKELAASRCRSLQEELYLLKQELQRANMVSSCELELQEQSLRTASDQESGDEELNRLKEENEKLRSLTFSLAEKDILEQSLDEARGSRQELVERIHSLRERAVAAERQREQYWEEKEQTLLQFQKSKMACQLYREKVNALQAQVCELQKERDQAYSARDSAQREISQSLVEKDSLRRQVFELTDQVCELRTQLRQLQAEPPGVLKQEARTREPCPREKQRLVRMHAICPRDDSDCSLVSSTESQLLSDLSATSSRELVDSFRSSSPAPPSQQSLYKRVAEDFGEEPWSFSSCLEIPEGDPGALPGAKAGDPHLDYELLDTADLPQLESSLQPVSPGRLDVSESGVLMRRRPARRILSQVTMLAFQGDALLEQISVIGGNLTGIFIHRVTPGSAADQMALRPGTQIVMVDYEASEPLFKAVLEDTTLEEAVGLLRRVDGFCCLSVKVNTDGYKRLLQDLEAKVATSGDSFYIRVNLAMEGRAKGELQVHCNEVLHVTDTMFQGCGCWHAHRVNSYTMKDTAAHGTIPNYSRAQQQLIALIQDMTQQCTVTRKPSSGGPQKLVRIVSMDKAKASPLRLSFDRGQLDPSRMEGSSTCFWAESCLTLVPYTLVRPHRPARPRPVLLVPRAVGKILSEKLCLLQGFKKCLAEYLSQEEYEAWSQRGDIIQEGEVSGGRCWVTRHAVESLMEKNTHALLDVQLDSVCTLHRMDIFPIVIHVSVNEKMAKKLKKGLQRLGTSEEQLLEAARQEEGDLDRAPCLYSSLAPDGWSDLDGLLSCVRQAIADEQKKVVWTEQSPR,1004,NP_077015.2.csv,refseq-CARD14-NM_024110.4_clinical_seed_0_final,refseq-CARD14-NM_024110.4.a2m,Invitae,refseq-CARD14-NM_024110.4.npy,1,1004,1004
+NP_077025.2,MLRPAGLWRLCRRPWAARVPAENLGRREVTSGVSPRGSTSPRTLNIFDRDLKRKQKNWAARQPEPTKFDYLKEEVGSRIADRVYDIPRNFPLALDLGCGRGYIAQYLNKETIGKFFQADIAENALKNSSETEIPTVSVLADEEFLPFKENTFDLVVSSLSLHWVNDLPRALEQIHYILKPDGVFIGAMFGGDTLYELRCSLQLAETEREGGFSPHISPFTAVNDLGHLLGRAGFNTLTVDTDEIQVNYPGMFELMEDLQGMGESNCAWNRKALLHRDTMLAAAAVYREMYRNEDGSVPATYQIYYMIGWKYHESQARPAERGSATVSFGELGKINNLMPPGKKSQ,345,NP_077025.2.csv,refseq-NDUFAF5-NM_024120.4_clinical_seed_0_final,refseq-NDUFAF5-NM_024120.4.a2m,Invitae,refseq-NDUFAF5-NM_024120.4.npy,1,345,345
+NP_077027.1,MFKVIQRSVGPASLSLLTFKVYAAPKKDSPPKNSVKVDELSLYSVPEGQSKYVEEARSQLEESISQLRHYCEPYTTWCQETYSQTKPKMQSLVQWGLDSYDYLQNAPPGFFPRLGVIGFAGLIGLLLARGSKIKKLVYPPGFMGLAASLYYPQQAIVFAQVSGERLYDWGLRGYIVIEDLWKENFQKPGNVKNSPGTK,198,NP_077027.1.csv,refseq-APOO-NM_024122.4_clinical_seed_0_final,refseq-APOO-NM_024122.4.a2m,Invitae,refseq-APOO-NM_024122.4.npy,1,198,198
+NP_077277.1,MRLTRCQAALAAAITLNLLVLFYVSWLQHQPRNSRARGPRRASAAGPRVTVLVREFEAFDNAVPELVDSFLQQDPAQPVVVAADTLPYPPLALPRIPNVRLALLQPALDRPAAASRPETYVATEFVALVPDGARAEAPGLLERMVEALRAGSARLVAAPVATANPARCLALNVSLREWTARYGAAPAAPRCDALDGDAVVLLRARDLFNLSAPLARPVGTSLFLQTALRGWAVQLLDLTFAAARQPPLATAHARWKAEREGRARRAALLRALGIRLVSWEGGRLEWFGCNKETTRCFGTVVGDTPAYLYEERWTPPCCLRALRETARYVVGVLEAAGVRYWLEGGSLLGAARHGDIIPWDYDVDLGIYLEDVGNCEQLRGAEAGSVVDERGFVWEKAVEGDFFRVQYSESNHLHVDLWPFYPRNGVMTKDTWLDHRQDVEFPEHFLQPLVPLPFAGFVAQAPNNYRRFLELKFGPGVIENPQYPNPALLSLTGSG,495,NP_077277.1.csv,refseq-FKRP-NM_024301.4_clinical_seed_0_final,refseq-FKRP-NM_024301.4.a2m,Invitae,refseq-FKRP-NM_024301.4.npy,1,495,495
+NP_077282.3,MAPAPPPAASFSPSEVQRRLAAGACWVRRGARLYDLSSFVRHHPGGEQLLRARAGQDISADLDGPPHRHSANARRWLEQYYVGELRGEQQGSMENEPVALEETQKTDPAMEPRFKVVDWDKDLVDWRKPLLWQVGHLGEKYDEWVHQPVTRPIRLFHSDLIEGLSKTVWYSVPIIWVPLVLYLSWSYYRTFAQGNVRLFTSFTTEYTVAVPKSMFPGLFMLGTFLWSLIEYLIHRFLFHMKPPSDSYYLIMLHFVMHGQHHKAPFDGSRLVFPPVPASLVIGVFYLCMQLILPEAVGGTVFAGGLLGYVLYDMTHYYLHFGSPHKGSYLYSLKAHHVKHHFAHQKSGFGISTKLWDYCFHTLTPEKPHLKTQ,372,NP_077282.3.csv,refseq-FA2H-NM_024306.4_clinical_seed_0_final,refseq-FA2H-NM_024306.4.a2m,Invitae,refseq-FA2H-NM_024306.4.npy,1,372,372
+NP_077288.2,MLFKLLQRQTYTCLSHRYGLYVCFLGVVVTIVSAFQFGEVVLEWSRDQYHVLFDSYRDNIAGKSFQNRLCLPMPIDVVYTWVNGTDLELLKELQQVREQMEEEQKAMREILGKNTTEPTKKSEKQLECLLTHCIKVPMLVLDPALPANITLKDLPSLYPSFHSASDIFNVAKPKNPSTNVSVVVFDSTKDVEDAHSGLLKGNSRQTVWRGYLTTDKEVPGLVLMQDLAFLSGFPPTFKETNQLKTKLPENLSSKVKLLQLYSEASVALLKLNNPKDFQELNKQTKKNMTIDGKELTISPAYLLWDLSAISQSKQDEDISASRFEDNEELRYSLRSIERHAPWVRNIFIVTNGQIPSWLNLDNPRVTIVTHQDVFRNLSHLPTFSSPAIESHIHRIEGLSQKFIYLNDDVMFGKDVWPDDFYSHSKGQKVYLTWPVPNCAEGCPGSWIKDGYCDKACNNSACDWDGGDCSGNSGGSRYIAGGGGTGSIGVGQPWQFGGGINSVSYCNQGCANSWLADKFCDQACNVLSCGFDAGDCGQDHFHELYKVILLPNQTHYIIPKGECLPYFSFAEVAKRGVEGAYSDNPIIRHASIANKWKTIHLIMHSGMNATTIHFNLTFQNTNDEEFKMQITVEVDTREGPKLNSTAQKGYENLVSPITLLPEAEILFEDIPKEKRFPKFKRHDVNSTRRAQEEVKIPLVNISLLPKDAQLSLNTLDLQLEHGDITLKGYNLSKSALLRSFLMNSQHAKIKNQAIITDETNDSLVAPQEKQVHKSILPNSLGVSERLQRLTFPAVSVKVNGHDQGQNPPLDLETTARFRVETHTQKTIGGNVTKEKPPSLIVPLESQMTKEKKITGKEKENSRMEENAENHIGVTEVLLGRKLQHYTDSYLGFLPWEKKKYFQDLLDEEESLKTQLAYFTDSKNTGRQLKDTFADSLRYVNKILNSKFGFTSRKVPAHMPHMIDRIVMQELQDMFPEEFDKTSFHKVRHSEDMQFAFSYFYYLMSAVQPLNISQVFDEVDTDQSGVLSDREIRTLATRIHELPLSLQDLTGLEHMLINCSKMLPADITQLNNIPPTQESYYDPNLPPVTKSLVTNCKPVTDKIHKAYKDKNKYRFEIMGEEEIAFKMIRTNVSHVVGQLDDIRKNPRKFVCLNDNIDHNHKDAQTVKAVLRDFYESMFPIPSQFELPREYRNRFLHMHELQEWRAYRDKLKFWTHCVLATLIMFTIFSFFAEQLIALKRKIFPRRRIHKEASPNRIRV,1256,NP_077288.2.csv,refseq-GNPTAB-NM_024312.4_clinical_seed_0_final,refseq-GNPTAB-NM_024312.4.a2m,Invitae,refseq-GNPTAB-NM_024312.4_theta_0.2.npy,1,1256,1256
+NP_077310.1,MAANYSSTSTRREHVKVKTSSQPGFLERLSETSGGMFVGLMAFLLSFYLIFTNEGRALKTATSLAEGLSLVVSPDSIHSVAPENEGRLVHIIGALRTSKLLSDPNYGVHLPAVKLRRHVEMYQWVETEESREYTEDGQVKKETRYSYNTEWRSEIINSKNFDREIGHKNPSAMAVESFMATAPFVQIGRFFLSSGLIDKVDNFKSLSLSKLEDPHVDIIRRGDFFYHSENPKYPEVGDLRVSFSYAGLSGDDPDLGPAHVVTVIARQRGDQLVPFSTKSGDTLLLLHHGDFSAEEVFHRELRSNSMKTWGLRAAGWMAMFMGLNLMTRILYTLVDWFPVFRDLVNIGLKAFAFCVATSLTLLTVAAGWLFYRPLWALLIAGLALVPILVARTRVPAKKLE,400,NP_077310.1.csv,refseq-TMEM43-NM_024334.2_clinical_seed_0_final,refseq-TMEM43-NM_024334.2.a2m,Invitae,refseq-TMEM43-NM_024334.2.npy,1,400,400
+NP_077315.2,MERAVPLAVPLGQTEVFQALQRLHMTIFSQSVSPCGKFLAAGNNYGQIAIFSLSSALSSEAKEESKKPVVTFQAHDGPVYSMVSTDRHLLSAGDGEVKAWLWAEMLKKGCKELWRRQPPYRTSLEVPEINALLLVPKENSLILAGGDCQLHTMDLETGTFTRVLRGHTDYIHCLALRERSPEVLSGGEDGAVRLWDLRTAKEVQTIEVYKHEECSRPHNGRWIGCLATDSDWMVCGGGPALTLWHLRSSTPTTIFPIRAPQKHVTFYQDLILSAGQGRCVNQWQLSGELKAQVPGSSPGLLSLSLNQQPAAPECKVLTAAGNSCRVDVFTNLGYRAFSLSF,341,NP_077315.2.csv,refseq-THOC6-NM_024339.3_clinical_seed_0_final,refseq-THOC6-NM_024339.3.a2m,Invitae,refseq-THOC6-NM_024339.3.npy,1,341,341
+NP_077718.3,MAVLSAPGLRGFRILGLRSSVGPAVQARGVHQSVATDGPSSTQPALPKARAVAPKPSSRGEYVVAKLDDLVNWARRSSLWPMTFGLACCAVEMMHMAAPRYDMDRFGVVFRASPRQSDVMIVAGTLTNKMAPALRKVYDQMPEPRYVVSMGSCANGGGYYHYSYSVVRGCDRIVPVDIYIPGCPPTAEALLYGILQLQRKIKRERRLQIWYRR,213,NP_077718.3.csv,refseq-NDUFS7-NM_024407.4_clinical_seed_0_final,refseq-NDUFS7-NM_024407.4.a2m,Invitae,refseq-NDUFS7-NM_024407.4.npy,1,213,213
+NP_077740.1,MEAARPSGSWNGALCRLLLLTLAILIFASDACKNVTLHVPSKLDAEKLVGRVNLKECFTAANLIHSSDPDFQILEDGSVYTTNTILLSSEKRSFTILLSNTENQEKKKIFVFLEHQTKVLKKRHTKEKVLRRAKRRWAPIPCSMLENSLGPFPLFLQQVQSDTAQNYTIYYSIRGPGVDQEPRNLFYVERDTGNLYCTRPVDREQYESFEIIAFATTPDGYTPELPLPLIIKIEDENDNYPIFTEETYTFTIFENCRVGTTVGQVCATDKDEPDTMHTRLKYSIIGQVPPSPTLFSMHPTTGVITTTSSQLDRELIDKYQLKIKVQDMDGQYFGLQTTSTCIINIDDVNDHLPTFTRTSYVTSVEENTVDVEILRVTVEDKDLVNTANWRANYTILKGNENGNFKIVTDAKTNEGVLCVVKPLNYEEKQQMILQIGVVNEAPFSREASPRSAMSTATVTVNVEDQDEGPECNPPIQTVRMKENAEVGTTSNGYKAYDPETRSSSGIRYKKLTDPTGWVTIDENTGSIKVFRSLDREAETIKNGIYNITVLASDQGGRTCTGTLGIILQDVNDNSPFIPKKTVIICKPTMSSAEIVAVDPDEPIHGPPFDFSLESSTSEVQRMWRLKAINDTAARLSYQNDPPFGSYVVPITVRDRLGMSSVTSLDVTLCDCITENDCTHRVDPRIGGGGVQLGKWAILAILLGIALLFCILFTLVCGASGTSKQPKVIPDDLAQQNLIVSNTEAPGDDKVYSANGFTTQTVGASAQGVCGTVGSGIKNGGQETIEMVKGGHQTSESCRGAGHHHTLDSCRGGHTEVDNCRYTYSEWHSFTQPRLGEKVYLCNQDENHKHAQDYVLTYNYEGRGSVAGSVGCCSERQEEDGLEFLDNLEPKFRTLAEACMKR,901,NP_077740.1.csv,refseq-DSC2-NM_024422.4_clinical_seed_0_final,refseq-DSC2-NM_024422.4.a2m,Invitae,refseq-DSC2-NM_024422.4.npy,1,901,901
+NP_077744.4,MDFLLLQDPASTCVPEPASQHTLRSGPGCLQQPEQQGVRDPGGIWAKLGAAEASAERLQGRRSRGASGSEPQQMGSDVRDLNALLPAVPSLGGGGGCALPVSGAAQWAPVLDFAPPGASAYGSLGGPAPPPAPPPPPPPPPHSFIKQEPSWGGAEPHEEQCLSAFTVHFSGQFTGTAGACRYGPFGPPPPSQASSGQARMFPNAPYLPSCLESQPAIRNQGYSTVTFDGTPSYGHTPSHHAAQFPNHSFKHEDPMGQQGSLGEQQYSVPPPVYGCHTPTDSCTGSQALLLRTPYSSDNLYQMTSQLECMTWNQMNLGATLKGVAAGSSSSVKWTEGQSNHSTGYESDNHTTPILCGAQYRIHTHGVFRGIQDVRRVPGVAPTLVRSASETSEKRPFMCAYPGCNKRYFKLSHLQMHSRKHTGEKPYQCDFKDCERRFSRSDQLKRHQRRHTGVKPFQCKTCQRKFSRSDHLKTHTRTHTGKTSEKPFSCRWPSCQKKFARSDELVRHHNMHQRNMTKLQLAL,522,NP_077744.4.csv,NP_077744.4_colabfold_clinical_seed_0_final,NP_077744.4_colabfold.a2m,colabfold,NP_077744.4_colabfold_theta_0.2.npy,1,522,522
+NP_078772.1,MSAAQVSSSRRQSCYLCDLPRMPWAMIWDFSEPVCRGCVNYEGADRIEFVIETARQLKRAHGCFQDGRSPGPPPPVGVKTVALSAKEAAAAAAAAAAAAAAAQQQQQQQQQQQQQQQQQQQQQQQQQLNHVDGSSKPAVLAAPSGLERYGLSAAAAAAAAAAAAVEQRSRFEYPPPPVSLGSSSHTARLPNGLGGPNGFPKPTPEEGPPELNRQSPNSSSAAASVASRRGTHGGLVTGLPNPGGGGGPQLTVPPNLLPQTLLNGPASAAVLPPPPPHALGSRGPPTPAPPGAPGGPACLGGTPGVSATSSSASSSTSSSVAEVGVGAGGKRPGSVSSTDQERELKEKQRNAEALAELSESLRNRAEEWASKPKMVRDTLLTLAGCTPYEVRFKKDHSLLGRVFAFDAVSKPGMDYELKLFIEYPTGSGNVYSSASGVAKQMYQDCMKDFGRGLSSGFKYLEYEKKHGSGDWRLLGDLLPEAVRFFKEGVPGADMLPQPYLDASCPMLPTALVSLSRAPSAPPGTGALPPAAPSGRGAAASLRKRKASPEPPDSAEGALKLGEEQQRQQWMANQSEALKLTMSAGGFAAPGHAAGGPPPPPPPLGPHSNRTTPPESAPQNGPSPMAALMSVADTLGTAHSPKDGSSVHSTTASARRNSSSPVSPASVPGQRRLASRNGDLNLQVAPPPPSAHPGMDQVHPQNIPDSPMANSGPLCCTICHERLEDTHFVQCPSVPSHKFCFPCSRESIKAQGATGEVYCPSGEKCPLVGSNVPWAFMQGEIATILAGDVKVKKERDP,796,NP_078772.1.csv,refseq-IRF2BPL-NM_024496.4_clinical_seed_0_final,refseq-IRF2BPL-NM_024496.4.a2m,Invitae,refseq-IRF2BPL-NM_024496.4_theta_0.2.npy,1,796,796
+NP_078789.2,MASTNAESQLQRIIRDLQDAVTELSKEFQEAGEPITDDSTSLHKFSYKLEYLLQFDQKEKATLLGNKKDYWDYFCACLAKVKGANDGIRFVKSISELRTSLGKGRAFIRYSLVHQRLADTLQQCFMNTKVTSDWYYARSPFLQPKLSSDIVGQLYELTEVQFDLASRGFDLDAAWPTFARRTLTTGSSAYLWKPPSRSSSMSSLVSSYLQTQEMVSNFDLNSPLNNEALEGFDEMRLELDQLEVREKQLRERMQQLDRENQELRAAVSQQGEQLQTERERGRTAAEDNVRLTCLVAELQKQWEVTQATQNTVKELQTCLQGLELGAAEKEEDYHTALRRLESMLQPLAQELEATRDSLDKKNQHLASFPGWLAMAQQKADTASDTKGRQEPIPSDAAQEMQELGEKLQALERERTKVEEVNRQQSAQLEQLVKELQLKEDARASLERLVKEMAPLQEELSGKGQEADQLWRRLQELLAHTSSWEEELAELRREKKQQQEEKELLEQEVRSLTRQLQFLETQLAQVSQHVSDLEEQKKQLIQDKDHLSQQVGMLERLAGPPGPELPVAGEKNEALVPVNSSLQEAWGKPEEEQRGLQEAQLDDTKVQEGSQEEELRQANRELEKELQNVVGRNQLLEGKLQALQADYQALQQRESAIQGSLASLEAEQASIRHLGDQMEASLLAVRKAKEAMKAQMAEKEAILQSKEGECQQLREEVEQCQQLAEARHRELRALESQCQQQTQLIEVLTAEKGQQGVGPPTDNEARELAAQLALSQAQLEVHQGEVQRLQAQVVDLQAKMRAALDDQDKVQSQLSMAEAVLREHKTLVQQLKEQNEALNRAHVQELLQCSEREGALQEERADEAQQREEELRALQEELSQAKCSSEEAQLEHAELQEQLHRANTDTAELGIQVCALTVEKERVEEALACAVQELQDAKEAASREREGLERQVAGLQQEKESLQEKLKAAKAAAGSLPGLQAQLAQAEQRAQSLQEAAHQELNTLKFQLSAEIMDYQSRLKNAGEECKSLRGQLEEQGRQLQAAEEAVEKLKATQADMGEKLSCTSNHLAECQAAMLRKDKEGAALREDLERTQKELEKATTKIQEYYNKLCQEVTNRERNDQKMLADLDDLNRTKKYLEERLIELLRDKDALWQKSDALEFQQKLSAEERWLGDTEANHCLDCKREFSWMVRRHHCRICGRIFCYYCCNNYVLSKHGGKKERCCRACFQKLSEGPGSPDSSGSGTSQGEPSPALSPASPGPQATGGQGANTDYRPPDDAVFDIITDEELCQIQESGSSLPETPTETDSLDPNAAEQDTTSTSLTPEDTEDMPVGQDSEICLLKSGELMIKVPLTVDEIASFGEGSRELFVRSSTYSLIPITVAEAGLTISWVFSSDPKSISFSVVFQEAEDTPLDQCKVLIPTTRCNSHKENIQGQLKVRTPGIYMLIFDNTFSRFVSKKVFYHLTVDRPVIYDGSDFL,1478,NP_078789.2.csv,refseq-FYCO1-NM_024513.3_clinical_seed_0_final,refseq-FYCO1-NM_024513.3.a2m,Invitae,refseq-FYCO1-NM_024513.3.npy,1,1478,1478
+NP_078790.2,MWKLWRAEEGAAALGGALFLLLFALGVRQLLKQRRPMGFPPGPPGLPFIGNIYSLAASSELPHVYMRKQSQVYGEIFSLDLGGISTVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMGGLLNSRYGRGWVDHRRLAVNSFRYFGYGQKSFESKILEETKFFNDAIETYKGRPFDFKQLITNAVSNITNLIIFGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNAAVVYDFLSRLIEKASVNRKPQLPQHFVDAYLDEMDQGKNDPSSTFSKENLIFSVGELIIAGTETTTNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDKCKMPYTEAVLHEVLRFCNIVPLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKKEALVPFSLGRRHCLGEHLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYLICAERR,501,NP_078790.2.csv,refseq-CYP2R1-NM_024514.4_clinical_seed_0_final,refseq-CYP2R1-NM_024514.4.a2m,Invitae,refseq-CYP2R1-NM_024514.4.npy,1,501,501
+NP_078792.1,MSLARGHGDTAASTAAPLSEEGEVTSGLQALAVEDTGGPSASAGKAEDEGEGGREETEREGSGGEEAQGEVPSAGGEEPAEEDSEDWCVPCSDEEVELPADGQPWMPPPSEIQRLYELLAAHGTLELQAEILPRRPPTPEAQSEEERSDEEPEAKEEEEEKPHMPTEFDFDDEPVTPKDSLIDRRRTPGSSARSQKREARLDKVLSDMKRHKKLEEQILRTGRDLFSLDSEDPSPASPPLRSSGSSLFPRQRKY,254,NP_078792.1.csv,refseq-PAGR1-NM_024516.3_clinical_seed_0_final,refseq-PAGR1-NM_024516.3.a2m,Invitae,refseq-PAGR1-NM_024516.3.npy,1,254,254
+NP_078800.3,MDREERKTINQGQEDEMEIYGYNLSRWKLAIVSLGVICSGGFLLLLLYWMPEWRVKATCVRAAIKDCEVVLLRTTDEFKMWFCAKIRVLSLETYPVSSPKSMSNKLSNGHAVCLIENPTEENRHRISKYSQTESQQIRYFTHHSVKYFWNDTIHNFDFLKGLDEGVSCTSIYEKHSAGLTKGMHAYRKLLYGVNEIAVKVPSVFKLLIKEVLNPFYIFQLFSVILWSTDEYYYYALAIVVMSIVSIVSSLYSIRKQYVMLHDMVATHSTVRVSVCRVNEEIEEIFSTDLVPGDVMVIPLNGTIMPCDAVLINGTCIVNESMLTGESVPVTKTNLPNPSVDVKGIGDELYNPETHKRHTLFCGTTVIQTRFYTGELVKAIVVRTGFSTSKGQLVRSILYPKPTDFKLYRDAYLFLLCLVAVAGIGFIYTIINSILNEVQVGVIIIESLDIITITVPPALPAAMTAGIVYAQRRLKKIGIFCISPQRINICGQLNLVCFDKTGTLTEDGLDLWGIQRVENARFLSPEENVCNEMLVKSQFVACMATCHSLTKIEGVLSGDPLDLKMFEAIGWILEEATEEETALHNRIMPTVVRPPKQLLPESTPAGNQEMELFELPATYEIGIVRQFPFSSALQRMSVVARVLGDRKMDAYMKGAPEAIAGLCKPETVPVDFQNVLEDFTKQGFRVIALAHRKLESKLTWHKVQNISRDAIENNMDFMGLIIMQNKLKQETPAVLEDLHKANIRTVMVTGDSMLTAVSVARDCGMILPQDKVIIAEALPPKDGKVAKINWHYADSLTQCSHPSAIDPEAIPVKLVHDSLEDLQMTRYHFAMNGKSFSVILEHFQDLVPKLMLHGTVFARMAPDQKTQLIEALQNVDYFVGMCGDGANDCGALKRAHGGISLSELEASVASPFTSKTPSISCVPNLIREGRAALITSFCVFKFMALYSIIQYFSVTLLYSILSNLGDFQFLFIDLAIILVVVFTMSLNPAWKELVAQRPPSGLISGALLFSVLSQIIICIGFQSLGFFWVKQQPWYEVWHPKSDACNTTGSGFWNSSHVDNETELDEHNIQNYENTTVFFISSFQYLIVAIAFSKGKPFRQPCYKNYFFVFSVIFLYIFILFIMLYPVASVDQVLQIVCVPYQWRVTMLIIVLVNAFVSITVEESVDRWGKCCLPWALGCRKKTPKAKYMYLAQELLVDPEWPPKPQTTTEAKALVKENGSCQIITIT,1226,NP_078800.3.csv,refseq-ATP13A3-NM_024524.3_clinical_seed_0_final,refseq-ATP13A3-NM_024524.3.a2m,Invitae,refseq-ATP13A3-NM_024524.3.npy,1,1226,1226
+NP_078804.2,MAPVSGSRSPDREASGSGGRRRSSSKSPKPSKSARSPRGRRSRSHSCSRSGDRNGLTHQLGGLSQGSRNQSYRSRSRSRSRERPSAPRGIPFASASSSVYYGSYSRPYGSDKPWPSLLDKEREESLRQKRLSERERIGELGAPEVWGLSPKNPEPDSDEHTPVEDEEPKKSTTSASTSEEEKKKKSSRSKERSKKRRKKKSSKRKHKKYSEDSDSDSDSETDSSDEDNKRRAKKAKKKEKKKKHRSKKYKKKRSKKSRKESSDSSSKESQEEFLENPWKDRTKAEEPSDLIGPEAPKTLTSQDDKPLNYGHALLPGEGAAMAEYVKAGKRIPRRGEIGLTSEEIASFECSGYVMSGSRHRRMEAVRLRKENQIYSADEKRALASFNQEERRKRENKILASFREMVYRKTKGKDDK,415,NP_078804.2.csv,refseq-NKAP-NM_024528.3_clinical_seed_0_final,refseq-NKAP-NM_024528.3.a2m,Invitae,refseq-NKAP-NM_024528.3.npy,1,415,415
+NP_078805.3,MADVLSVLRQYNIQKKEIVVKGDEVIFGEFSWPKNVKTNYVVWGTGKEGQPREYYTLDSILFLLNNVHLSHPVYVRRAATENIPVVRRPDRKDLLGYLNGEASTSASIDRSAPLEIGLQRSTQVKRAADEVLAEAKKPRIEDEECVRLDKERLAARLEGHKEGIVQTEQIRSLSEAMSVEKIAAIKAKIMAKKRSTIKTDLDDDITALKQRSFVDAEVDVTRDIVSRERVWRTRTTILQSTGKNFSKNIFAILQSVKAREEGRAPEQRPAPNAAPVDPTLRTKQPIPAAYNRYDQERFKGKEETEGFKIDTMGTYHGMTLKSVTEGASARKTQTPAAQPVPRPVSQARPPPNQKKGSRTPIIIIPAATTSLITMLNAKDLLQDLKFVPSDEKKKQGCQRENETLIQRRKDQMQPGGTAISVTVPYRVVDQPLKLMPQDWDRVVAVFVQGPAWQFKGWPWLLPDGSPVDIFAKIKAFHLKYDEVRLDPNVQKWDVTVLELSYHKRHLDRPVFLRFWETLDRYMVKHKSHLRF,531,NP_078805.3.csv,refseq-CDC73-NM_024529.4_clinical_seed_0_final,refseq-CDC73-NM_024529.4.a2m,Invitae,refseq-CDC73-NM_024529.4.npy,1,531,531
+NP_078807.1,MAAPTPARPVLTHLLVALFGMGSWAAVNGIWVELPVVVKELPEGWSLPSYVSVLVALGNLGLLVVTLWRRLAPGKDEQVPIRVVQVLGMVGTALLASLWHHVAPVAGQLHSVAFLALAFVLALACCASNVTFLPFLSHLPPRFLRSFFLGQGLSALLPCVLALVQGVGRLECPPAPINGTPGPPLDFLERFPASTFFWALTALLVASAAAFQGLLLLLPPPPSVPTGELGSGLQVGAPGAEEEVEESSPLQEPPSQAAGTTPGPDPKAYQLLSARSACLLGLLAATNALTNGVLPAVQSFSCLPYGRLAYHLAVVLGSAANPLACFLAMGVLCRSLAGLGGLSLLGVFCGGYLMALAVLSPCPPLVGTSAGVVLVVLSWVLCLGVFSYVKVAASSLLHGGGRPALLAAGVAIQVGSLLGAVAMFPPTSIYHVFHSRKDCADPCDS,445,NP_078807.1.csv,refseq-SLC52A2-NM_024531.4_clinical_seed_0_final,refseq-SLC52A2-NM_024531.4.a2m,Invitae,refseq-SLC52A2-NM_024531.4.npy,1,445,445
+NP_078824.2,MAVARVDAALPPGEGSVVNWSGQGLQKLGPNLPCEADIHTLILDKNQIIKLENLEKCKRLIQLSVANNRLVRMMGVAKLTLLRVLNLPHNSIGCVEGLKELVHLEWLNLAGNNLKAMEQINSCTALQHLDLSDNNISQIGDLSKLVSLKTLLLHGNIITSLRMAPAYLPRSLAILSLAENEIRDLNEISFLASLTELEQLSIMNNPCVMATPSIPGFDYRPYIVSWCLNLRVLDGYVISQKESLKAEWLYSQGKGRAYRPGQHIQLVQYLATVCPLTSTLGLQTAEDAKLEKILSKQRFHQRQLMNQSQNEELSPLVPVETRASLIPEHSSPVQDCQISQESEPVIQVNSWVGINSNDDQLFAVKNNFPASVHTTRYSRNDLHLEDIQTDEDKLNCSLLSSESTFMPVASGLSPLSPTVELRLQGINLGLEDDGVADESVKGLESQVLDKEEEQPLWAANENSVQMMRSEINTEVNEKAGLLPCPEPTIISAILKDDNHSLTFFPESTEQKQSDIKKPENTQPENKETISQATSEKLPMILTQRSVALGQDKVALQKLNDAATKLQACWRGFYARNYNPQAKDVRYEIRLRRMQEHIVCLTDEIRRLRKERDEERIKKFVQEEAFRFLWNQVRSLQVWQQTVDQRLSSWHTDVPPISSTLVPSKHPLFTQSQESSCDQNADWFIASDVAPQEKSLPEFPDSGFHSSLTEQVHSLQHSLDFEKSSTEGSESSIMGNSIDTVRYGKESDLGDVSEEHGEWNKESSNNEQDNSLLEQYLTSVQQLEDADERTNFDTETRDSKLHIACFPVQLDTLSDGASVDESHGISPPLQGEISQTQENSKLNAEVQGQQPECDSTFQLLHVGVTV,865,NP_078824.2.csv,refseq-CEP97-NM_024548.3_clinical_seed_0_final,refseq-CEP97-NM_024548.3.a2m,Invitae,refseq-CEP97-NM_024548.3.npy,1,865,865
+NP_078846.2,MAAGVDCGDGVGARQHVFLVSEYLKDASKKMKNGLMFVKLVNPCSGEGAIYLFNMCLQQLFEVKVFKEKHHSWFINQSVQSGGLLHFATPVDPLFLLLHYLIKADKEGKFQPLDQVVVDNVFPNCILLLKLPGLEKLLHHVTEEKGNPEIDNKKYYKYSKEKTLKWLEKKVNQTVAALKTNNVNVSSRVQSTAFFSGDQASTDKEEDYIRYAHGLISDYIPKELSDDLSKYLKLPEPSASLPNPPSKKIKLSDEPVEAKEDYTKFNTKDLKTEKKNSKMTAAQKALAKVDKSGMKSIDTFFGVKNKKKIGKV,312,NP_078846.2.csv,refseq-RNASEH2B-NM_024570.3_clinical_seed_0_final,refseq-RNASEH2B-NM_024570.3.a2m,Invitae,refseq-RNASEH2B-NM_024570.3.npy,1,312,312
+NP_078853.2,MGGCFCIPRERSLTRGPGKETPSKDPTVSSECIASSEYKEKCFLPQNINPDLTLSFCVKSRSRRCVNGPLQEAARRRLWALENEDQEVRMLFKDLSARLVSIQSQRAQFLITFKTMEEIWKFSTYLNLGYVSMCLEHLLFDHKYWLNCILVEDTEIQVSVDDKHLETIYLGLLIQEGHFFCRALCSVTPPAEKEGECLTLCKNELISVKMAEAGSELEGVSLVTGQRGLVLVSALEPLPLPFHQWFLKNYPGSCGLSRKRDWTGSYQIGRGRCKALTGYEPGEKDELNFYQGESIEIIGFVIPGLQWFIGKSTSSGQVGFVPTRNIDPDSYSPMSRNSAFLSDEERCSLLALGSDKQTECSSFLHTLARTDITSVYRLSGFESIQNPPNDLSASQPEGFKEVRPGRAWEEHQAVGSRQSSSSEDSSLEEELLSATSDSYRLPEPDDLDDPELLMDLSTGQEEEAENFAPILAFLDHEGYADHFKSLYDFSFSFLTSSFYSFSEEDEFVAYLEASRKWAKKSHMTWAHARLCFLLGRLSIRKVKLSQARVYFEEAIHILNGAFEDLSLVATLYINLAAIYLKQRLRHKGSALLEKAGALLACLPDRESSAKHELDVVAYVLRQGIVVGSSPLEARACFLAIRLLLSLGRHEEVLPFAERLQLLSGHPPASEAVASVLSFLYDKKYLPHLAVASVQQHGIQSAQGMSLPIWQVHLVLQNTTKLLGFPSPGWGEVSALACPMLRQALAACEELADRSTQRALCLILSKVYLEHRSPDGAIHYLSQALVLGQLLGEQESFESSLCLAWAYLLASQAKKALDVLEPLLCSLKETESLTQRGVIYNLLGLALQGEGRVNRAAKSYLRALNRAQEVGDVHNQAVAMANLGHLSLKSWAQHPARNYLLQAVRLYCELQASKETDMELVQVFLWLAQVLVSGHQLTHGLLCYEMALLFGLRHRHLKSQLQATKSLCHFYSSVSPNPEACITYHEHWLALAQQLRDREMEGRLLESLGQLYRNLNTARSLRRSLTCIKESLRIFIDLGETDKAAEAWLGAGRLHYLMQEDELVELCLQAAIQTALKSEEPLLALKLYEEAGDVFFNGTRHRHHAVEYYRAGAVPLARRLKAVRTELRIFNKLTELQISLEGYEKALEFATLAARLSTVTGDQRQELVAFHRLATVYYSLHMYEMAEDCYLKTLSLCPPWLQSPKEALYYAKVYYRLGRLTFCQLKDAHDATEYFLLALAAAVLLGDEELQDTIRSRLDNICQSPLWHSRPSGCSSERARWLSGGGLAL,1288,NP_078853.2.csv,refseq-SH3TC2-NM_024577.3_clinical_seed_0_final,refseq-SH3TC2-NM_024577.3.a2m,Invitae,refseq-SH3TC2-NM_024577.3.npy,1,1288,1288
+NP_078868.1,MAPWAEAEHSALNPLRAVWLTLTAAFLLTLLLQLLPPGLLPGCAIFQDLIRYGKTKCGEPSRPAACRAFDVPKRYFSHFYIISVLWNGFLLWCLTQSLFLGAPFPSWLHGLLRILGAAQFQGGELALSAFLVLVFLWLHSLRRLFECLYVSVFSNVMIHVVQYCFGLVYYVLVGLTVLSQVPMDGRNAYITGKNLLMQARWFHILGMMMFIWSSAHQYKCHVILGNLRKNKAGVVIHCNHRIPFGDWFEYVSSPNYLAELMIYVSMAVTFGFHNLTWWLVVTNVFFNQALSAFLSHQFYKSKFVSYPKHRKAFLPFLF,318,NP_078868.1.csv,refseq-SRD5A3-NM_024592.4_clinical_seed_0_final,refseq-SRD5A3-NM_024592.4.a2m,Invitae,refseq-SRD5A3-NM_024592.4.npy,1,318,318
+NP_078874.2,MSAAPLVGYSSSGSEDESEDGMRTRPGDGSHRRGQSPLPRQRFPVPDSVLNMFPGTEEGPEDDSTKHGGRVRTFPHERGNWATHVYVPYEAKEEFLDLLDVLLPHAQTYVPRLVRMKVFHLSLSQSVVLRHHWILPFVQALKARMTSFHRFFFTANQVKIYTNQEKTRTFIGLEVTSGHAQFLDLVSEVDRVMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCLQELQAIVDGFEDAEVLLRVHTEQVRCKSGNKFFSMPLK,265,NP_078874.2.csv,refseq-USB1-NM_024598.3_clinical_seed_0_final,refseq-USB1-NM_024598.3.a2m,Invitae,refseq-USB1-NM_024598.3.npy,1,265,265
+NP_078875.4,MASADKNGGSVSSVSSSRLQSRKPPNLSITIPPPEKETQAPGEQDSMLPEGFQNRRLKKSQPRTWAAHTTACPPSFLPKRKNPAYLKSVSLQEPRSRWQESSEKRPGFRRQASLSQSIRKGAAQWFGVSGDWEGQRQQWQRRSLHHCSMRYGRLKASCQRDLELPSQEAPSFQGTESPKPCKMPKIVDPLARGRAFRHPEEMDRPHAPHPPLTPGVLSLTSFTSVRSGYSHLPRRKRMSVAHMSLQAAAALLKGRSVLDATGQRCRVVKRSFAFPSFLEEDVVDGADTFDSSFFSKEEMSSMPDDVFESPPLSASYFRGIPHSASPVSPDGVQIPLKEYGRAPVPGPRRGKRIASKVKHFAFDRKKRHYGLGVVGNWLNRSYRRSISSTVQRQLESFDSHRPYFTYWLTFVHVIITLLVICTYGIAPVGFAQHVTTQLVLRNKGVYESVKYIQQENFWVGPSSIDLIHLGAKFSPCIRKDGQIEQLVLRERDLERDSGCCVQNDHSGCIQTQRKDCSETLATFVKWQDDTGPPMDKSDLGQKRTSGAVCHQDPRTCEEPASSGAHIWPDDITKWPICTEQARSNHTGFLHMDCEIKGRPCCIGTKGSCEITTREYCEFMHGYFHEEATLCSQVHCLDKVCGLLPFLNPEVPDQFYRLWLSLFLHAGVVHCLVSVVFQMTILRDLEKLAGWHRIAIIFILSGITGNLASAIFLPYRAEVGPAGSQFGLLACLFVELFQSWPLLERPWKAFLNLSAIVLFLFICGLLPWIDNIAHIFGFLSGLLLAFAFLPYITFGTSDKYRKRALILVSLLAFAGLFAALVLWLYIYPINWPWIEHLTCFPFTSRFCEKYELDQVLH,856,NP_078875.4.csv,refseq-RHBDF2-NM_024599.5_clinical_seed_0_final,refseq-RHBDF2-NM_024599.5.a2m,Invitae,refseq-RHBDF2-NM_024599.5.npy,1,856,856
+NP_078925.3,MAAASSSDSDACGAESNEANSKWLDAHYDPMANIHTFSACLALADLHGDGEYKLVVGDLGPGGQQPRLKVLKGPLVMTESPLPALPAAAATFLMEQHEPRTPALALASGPCVYVYKNLRPYFKFSLPQLPPNPLEQDLWNQAKEDRIDPLTLKEMLESIRETAEEPLSIQSLRFLQLELSEMEAFVNQHKSNSIKRQTVITTMTTLKKNLADEDAVSCLVLGTENKELLVLDPEAFTILAKMSLPSVPVFLEVSGQFDVEFRLAAACRNGNIYILRRDSKHPKYCIELSAQPVGLIRVHKVLVVGSTQDSLHGFTHKGKKLWTVQMPAAILTMNLLEQHSRGLQAVMAGLANGEVRIYRDKALLNVIHTPDAVTSLCFGRYGREDNTLIMTTRGGGLIIKILKRTAVFVEGGSEVGPPPAQAMKLNVPRKTRLYVDQTLREREAGTAMHRAFQTDLYLLRLRAARAYLQALESSLSPLSTTAREPLKLHAVVQGLGPTFKLTLHLQNTSTTRPVLGLLVCFLYNEALYSLPRAFFKVPLLVPGLNYPLETFVESLSNKGISDIIKVLVLREGQSAPLLSAHVNMPGSEGLAAA,593,NP_078925.3.csv,refseq-BBS1-NM_024649.4_clinical_seed_0_final,refseq-BBS1-NM_024649.4.a2m,Invitae,refseq-BBS1-NM_024649.4.npy,1,593,593
+NP_078941.2,MSISSDEVNFLVYRYLQESGFSHSAFTFGIESHISQSNINGALVPPAALISIIQKGLQYVEAEVSINEDGTLFDGRPIESLSLIDAVMPDVVQTRQQAYRDKLAQQQAAAAAAAAAAASQQGSAKNGENTANGEENGAHTIANNHTDMMEVDGDVEIPPNKAVVLRGHESEVFICAWNPVSDLLASGSGDSTARIWNLSENSTSGSTQLVLRHCIREGGQDVPSNKDVTSLDWNSEGTLLATGSYDGFARIWTKDGNLASTLGQHKGPIFALKWNKKGNFILSAGVDKTTIIWDAHTGEAKQQFPFHSAPALDVDWQSNNTFASCSTDMCIHVCKLGQDRPIKTFQGHTNEVNAIKWDPTGNLLASCSDDMTLKIWSMKQDNCVHDLQAHNKEIYTIKWSPTGPGTNNPNANLMLASASFDSTVRLWDVDRGICIHTLTKHQEPVYSVAFSPDGRYLASGSFDKCVHIWNTQTGALVHSYRGTGGIFEVCWNAAGDKVGASASDGSVCVLDLRK,514,NP_078941.2.csv,refseq-TBL1XR1-NM_024665.4_clinical_seed_0_final,refseq-TBL1XR1-NM_024665.4.a2m,Invitae,refseq-TBL1XR1-NM_024665.4.npy,1,514,514
+NP_078951.2,MDEPPGKPLSCEEKEKLKEKLAFLKREYSKTLARLQRAQRAEKIKHSIKKTVEEQDCLSQQDLSPQLKHSEPKNKICVYDKLHIKTHLDEETGEKTSITLDVGPESFNPGDGPGGLPIQRTDDTQEHFPHRVSDPSGEQKQKLPSRRKKQQKRTFISQERDCVFGTDSLRLSGKRLKEQEEISSKNPARSPVTEIRTHLLSLKSELPDSPEPVTEINEDSVLIPPTAQPEKGVDTFLRRPNFTRATTVPLQTLSDSGSSQHLEHIPPKGSSELTTHDLKNIRFTSPVSLEAQGKKMTVSTDNLLVNKAISKSGQLPTSSNLEANISCSLNELTYNNLPANENQNLKEQNQTEKSLKSPSDTLDGRNENLQESEILSQPKSLSLEATSPLSAEKHSCTVPEGLLFPAEYYVRTTRSMSNCQRKVAVEAVIQSHLDVKKKGFKNKNKDASKNLNLSNEETDQSEIRMSGTCTGQPSSRTSQKLLSLTKVSSPAGPTEDNDLSRKAVAQAPGRRYTGKRKSACTPASDHCEPLLPTSSLSIVNRSKEEVTSHKYQHEKLFIQVKGKKSRHQKEDSLSWSNSAYLSLDDDAFTAPFHRDGMLSLKQLLSFLSITDFQLPDEDFGPLKLEKVKSCSEKPVEPFESKMFGERHLKEGSCIFPEELSPKRMDTEMEDLEEDLIVLPGKSHPKRPNSQSQHTKTGLSSSILLYTPLNTVAPDDNDRPTTDMCSPAFPILGTTPAFGPQGSYEKASTEVAGRTCCTPQLAHLKDSVCLASDTKQFDSSGSPAKPHTTLQVSGRQGQPTCDCDSVPPGTPPPIESFTFKENQLCRNTCQELHKHSVEQTETAELPASDSINPGNLQLVSELKNPSGSCSVDVSAMFWERAGCKEPCIITACEDVVSLWKALDAWQWEKLYTWHFAEVPVLQIVPVPDVYNLVCVALGNLEIREIRALFCSSDDESEKQVLLKSGNIKAVLGLTKRRLVSSSGTLSDQQVEVMTFAEDGGGKENQFLMPPEETILTFAEVQGMQEALLGTTIMNNIVIWNLKTGQLLKKMHIDDSYQASVCHKAYSEMGLLFIVLSHPCAKESESLRSPVFQLIVINPKTTLSVGVMLYCLPPGQAGRFLEGDVKDHCAAAILTSGTIAIWDLLLGQCTALLPPVSDQHWSFVKWSGTDSHLLAGQKDGNIFVYHYS,1186,NP_078951.2.csv,refseq-PALB2-NM_024675.3_clinical_seed_0_final,refseq-PALB2-NM_024675.3.a2m,Invitae,refseq-PALB2-NM_024675.3.npy,1,1186,1186
+NP_078954.4,MLGVRCLLRSVRFCSSAPFPKHKPSAKLSVRDALGAQNASGERIKIQGWIRSVRSQKEVLFLHVNDGSSLESLQVVADSGLDSRELNFGSSVEVQGQLIKSPSKRQNVELKAEKIKVIGNCDAKDFPIKYKERHPLEYLRQYPHFRCRTNVLGSILRIRSEATAAIHSFFKDSGFVHIHTPIITSNDSEGAGELFQLEPSGKLKVPEENFFNVPAFLTVSGQLHLEVMSGAFTQVFTFGPTFRAENSQSRRHLAEFYMIEAEISFVDSLQDLMQVIEELFKATTMMVLSKCPEDVELCHKFIAPGQKDRLEHMLKNNFLIISYTEAVEILKQASQNFTFTPEWGADLRTEHEKYLVKHCGNIPVFVINYPLTLKPFYMRDNEDGPQHTVAAVDLLVPGVGELFGGGLREERYHFLEERLARSGLTEVYQWYLDLRRFGSVPHGGFGMGFERYLQCILGVDNIKDVIPFPRFPHSCLL,477,NP_078954.4.csv,refseq-NARS2-NM_024678.5_clinical_seed_0_final,refseq-NARS2-NM_024678.5.a2m,Invitae,refseq-NARS2-NM_024678.5.npy,1,477,477
+NP_078961.3,MLSSMAAAGSVKAALQVAEVLEAIVSCCVGPEGRQVLCTKPTGEVLLSRNGGRLLEALHLEHPIARMIVDCVSSHLKKTGDGAKTFIIFLCHLLRGLHAITDREKDPLMCENIQTHGRHWKNCSRWKFISQALLTFQTQILDGIMDQYLSRHFLSIFSSAKERTLCRSSLELLLEAYFCGRVGRNNHKFISQLMCDYFFKCMTCKSGIGVFELVDDHFVELNVGVTGLPVSDSRIIAGLVLQKDFSVYRPADGDMRMVIVTETIQPLFSTSGSEFILNSEAQFQTSQFWIMEKTKAIMKHLHSQNVKLLISSVKQPDLVSYYAGVNGISVVECLSSEEVSLIRRIIGLSPFVPPQAFSQCEIPNTALVKFCKPLILRSKRYVHLGLISTCAFIPHSIVLCGPVHGLIEQHEDALHGALKMLRQLFKDLDLNYMTQTNDQNGTSSLFIYKNSGESYQAPDPGNGSIQRPYQDTVAENKDALEKTQTYLKVHSNLVIPDVELETYIPYSTPTLTPTDTFQTVETLTCLSLERNRLTDYYEPLLKNNSTAYSTRGNRIEISYENLQVTNITRKGSMLPVSCKLPNMGTSQSYLSSSMPAGCVLPVGGNFEILLHYYLLNYAKKCHQSEETMVSMIIANALLGIPKVLYKSKTGKYSFPHTYIRAVHALQTNQPLVSSQTGLESVMGKYQLLTSVLQCLTKILTIDMVITVKRHPQKVHNQDSEDEL,723,NP_078961.3.csv,refseq-BBS10-NM_024685.3_clinical_seed_0_final,refseq-BBS10-NM_024685.3.a2m,Invitae,refseq-BBS10-NM_024685.3.npy,1,723,723
+NP_078974.1,MADKQISLPAKLINGGIAGLIGVTCVFPIDLAKTRLQNQQNGQRVYTSMSDCLIKTVRSEGYFGMYRGAAVNLTLVTPEKAIKLAANDFFRHQLSKDGQKLTLLKEMLAGCGAGTCQVIVTTPMEMLKIQLQDAGRIAAQRKILAAQGQLSAQGGAQPSVEAPAAPRPTATQLTRDLLRSRGIAGLYKGLGATLLRDVPFSVVYFPLFANLNQLGRPASEEKSPFYVSFLAGCVAGSAAAVAVNPCDVVKTRLQSLQRGVNEDTYSGILDCARKILRHEGPSAFLKGAYCRALVIAPLFGIAQVVYFLGIAESLLGLLQDPQA,323,NP_078974.1.csv,refseq-SLC25A22-NM_024698.5_clinical_seed_0_final,refseq-SLC25A22-NM_024698.5.a2m,Invitae,refseq-SLC25A22-NM_024698.5.npy,1,323,323
+NP_079005.3,MAAVTMSVPGRKAPPRPGPVPEAAQPFLFTPRGPSAGGGPGSGTSPQVEWTARRLVWVPSELHGFEAAALRDEGEEEAEVELAESGRRLRLPRDQIQRMNPPKFSKAEDMAELTCLNEASVLHNLRERYYSGLIYTYSGLFCVVINPYKQLPIYTEAIVEMYRGKKRHEVPPHVYAVTEGAYRSMLQDREDQSILCTGESGAGKTENTKKVIQYLAHVASSPKGRKEPGVPGELERQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDVAGYIVGANIETYLLEKSRAIRQAKDECSFHIFYQLLGGAGEQLKADLLLEPCSHYRFLTNGPSSSPGQERELFQETLESLRVLGFSHEEIISMLRMVSAVLQFGNIALKRERNTDQATMPDNTAAQKLCRLLGLGVTDFSRALLTPRIKVGRDYVQKAQTKEQADFALEALAKATYERLFRWLVLRLNRALDRSPRQGASFLGILDIAGFEIFQLNSFEQLCINYTNEKLQQLFNHTMFVLEQEEYQREGIPWTFLDFGLDLQPCIDLIERPANPPGLLALLDEECWFPKATDKSFVEKVAQEQGGHPKFQRPRHLRDQADFSVLHYAGKVDYKANEWLMKNMDPLNDNVAALLHQSTDRLTAEIWKDVEGIVGLEQVSSLGDGPPGGRPRRGMFRTVGQLYKESLSRLMATLSNTNPSFVRCIVPNHEKRAGKLEPRLVLDQLRCNGVLEGIRICRQGFPNRILFQEFRQRYEILTPNAIPKGFMDGKQACEKMIQALELDPNLYRVGQSKIFFRAGVLAQLEEERDLKVTDIIVSFQAAARGYLARRAFQKRQQQQSALRVMQRNCAAYLKLRHWQWWRLFTKVKPLLQVTRQDEVLQARAQELQKVQELQQQSAREVGELQGRVAQLEEERARLAEQLRAEAELCAEAEETRGRLAARKQELELVVSELEARVGEEEECSRQMQTEKKRLQQHIQELEAHLEAEEGARQKLQLEKVTTEAKMKKFEEDLLLLEDQNSKLSKERKLLEDRLAEFSSQAAEEEEKVKSLNKLRLKYEATIADMEDRLRKEEKGRQELEKLKRRLDGESSELQEQMVEQQQRAEELRAQLGRKEEELQAALARAEDEGGARAQLLKSLREAQAALAEAQEDLESERVARTKAEKQRRDLGEELEALRGELEDTLDSTNAQQELRSKREQEVTELKKTLEEETRIHEAAVQELRQRHGQALGELAEQLEQARRGKGAWEKTRLALEAEVSELRAELSSLQTARQEGEQRRRRLELQLQEVQGRAGDGERARAEAAEKLQRAQAELENVSGALNEAESKTIRLSKELSSTEAQLHDAQELLQEETRAKLALGSRVRAMEAEAAGLREQLEEEAAARERAGRELQTAQAQLSEWRRRQEEEAGALEAGEEARRRAAREAEALTQRLAEKTETVDRLERGRRRLQQELDDATMDLEQQRQLVSTLEKKQRKFDQLLAEEKAAVLRAVEERERAEAEGREREARALSLTRALEEEQEAREELERQNRALRAELEALLSSKDDVGKSVHELERACRVAEQAANDLRAQVTELEDELTAAEDAKLRLEVTVQALKTQHERDLQGRDEAGEERRRQLAKQLRDAEVERDEERKQRTLAVAARKKLEGELEELKAQMASAGQGKEEAVKQLRKMQAQMKELWREVEETRTSREEIFSQNRESEKRLKGLEAEVLRLQEELAASDRARRQAQQDRDEMADEVANGNLSKAAILEEKRQLEGRLGQLEEELEEEQSNSELLNDRYRKLLLQVESLTTELSAERSFSAKAESGRQQLERQIQELRGRLGEEDAGARARHKMTIAALESKLAQAEEQLEQETRERILSGKLVRRAEKRLKEVVLQVEEERRVADQLRDQLEKGNLRVKQLKRQLEEAEEEASRAQAGRRRLQRELEDVTESAESMNREVTTLRNRLRRGPLTFTTRTVRQVFRLEEGVASDEEAEEAQPGSGPSPEPEGSPPAHPQ,1995,NP_079005.3.csv,refseq-MYH14-NM_024729.3_clinical_seed_0_final,refseq-MYH14-NM_024729.3.a2m,Invitae,refseq-MYH14-NM_024729.3.npy,1,1995,1995
+NP_079011.3,MAVCARLCGVGPSRGCRRRQQRRGPAETAAADSEPDTDPEEERIEASAGVGGGLCAGPSPPPPRCSLLELPPELLVEIFASLPGTDLPSLAQVCTKFRRILHTDTIWRRRCREEYGVCENLRKLEITGVSCRDVYAKLLHRYRHILGLWQPDIGPYGGLLNVVVDGLFIIGWMYLPPHDPHVDDPMRFKPLFRIHLMERKAATVECMYGHKGPHHGHIQIVKKDEFSTKCNQTDHHRMSGGRQEEFRTWLREEWGRTLEDIFHEHMQELILMKFIYTSQYDNCLTYRRIYLPPSRPDDLIKPGLFKGTYGSHGLEIVMLSFHGRRARGTKITGDPNIPAGQQTVEIDLRHRIQLPDLENQRNFNELSRIVLEVRERVRQEQQEGGHEAGEGRGRQGPRESQPSPAQPRAEAPSKGPDGTPGEDGGEPGDAVAAAEQPAQCGQGQPFVLPVGVSSRNEDYPRTCRMCFYGTGLIAGHGFTSPERTPGVFILFDEDRFGFVWLELKSFSLYSRVQATFRNADAPSPQAFDEMLKNIQSLTS,539,NP_079011.3.csv,refseq-FBXO31-NM_024735.4_clinical_seed_0_final,refseq-FBXO31-NM_024735.4.a2m,Invitae,refseq-FBXO31-NM_024735.4.npy,1,539,539
+NP_079017.1,MEEAEELLLEGKKALQLAREPRLGLDLGWNPSGEGCTQGLKDVPPEPTRDILALKSLPRGLALGPSLAKEQRLGVWCVGDPLQPGLLWGPLEEESASKEKGEGVKPRQEENLSLGPWGDVCACEQSSGWTSLVQRGRLESEGNVAPVRISERLHLQVYQLVLPGSELLLWPQPSSEGPSLTQPGLDKEAAVAVVTEVESAVQQEVASPGEDAAEPCIDPGSQSPSGIQAENMVSPGLKFPTQDRISKDSQPLGPLLQDGDVDEECPAQAQMPPELQSNSATQQDPDGSGASFSSSARGTQPHGYLAKKLHSPSDQCPPRAKTPEPGAQQSGFPTLSRSPPGPAGSSPKQGRRYRCGECGKAFLQLCHLKKHAFVHTGHKPFLCTECGKSYSSEESFKAHMLGHRGVRPFPCPQCDKAYGTQRDLKEHQVVHSGARPFACDQCGKAFARRPSLRLHRKTHQVPAAPAPCPCPVCGRPLANQGSLRNHMRLHTGEKPFLCPHCGRAFRQRGNLRGHLRLHTGERPYRCPHCADAFPQLPELRRHLISHTGEAHLCPVCGKALRDPHTLRAHERLHSGERPFPCPQCGRAYTLATKLRRHLKSHLEDKPYRCPTCGMGYTLPQSLRRHQLSHRPEAPCSPPSVPSAASEPTVVLLQAEPQLLDTHREEEVSPARDVVEVTISESQEKCFVVPEEPDAAPSLVLIHKDMGLGAWAEVVEVEMGT,720,NP_079017.1.csv,refseq-ZNF408-NM_024741.2_clinical_seed_0_final,refseq-ZNF408-NM_024741.2.a2m,Invitae,refseq-ZNF408-NM_024741.2.npy,1,720,720
+NP_079023.2,MKRSGTLRLLSDLSAFGGAARLRELVAGDSAVRVRGSPDGRHLLLLRPPGAVAPQLLVASRGPGAELERAWPAGQPSPLDAFFLPWPARPALVLVWESGLAEVWGAGVGPGWRPLQSTELCPGGGARVVAVAALRGRLVWCEERQARAEGPSGSPAAAFSHCVCVRTLEPSGEASTSLGRTHVLLHHCPAFGLLASCRQLFLVPTATTWPGVAHVLLIWSPGKGKVMVAAPRLGLSYSKSLNPGRGDTWDFRTLLRGLPGLLSPREPLAVHTWAPTPQGLLLLDFGGTVSLLQSHGGTRAVGTLQEAPVGPWGSAALGTFQGTLACVLGSTLELLDMGSGQLLERKVLSTDRVHLLEPPAPGMEDEEELETRGNLRLLSALGLFCVGWEAPQGVELPSAKDLVFEEACGYYQRRSLRGAQLTPEELRHSSTFRAPQALASILQGHLPPSALLTMLRTELRDYRGLEQLKAQLVAGDDEEAGWTELAEQEVARLLRTELIGDQLAQLNTVFQALPTAAWGATLRALQLQLDGNGKLRSQAPPDVWKKVLGGITAGKEPPNGILPPFELLCQCLCQLEPRWLPPFVELAQQQGGPGWGAGGPGLPLYRRALAVLGEEGTRPEALELELLLSSGRPKAVLQAVGQLVQKEQWDRALDAGLALGPSSPLLRSEIFKLLLAEFAQHRRLDAHLPLLCRLCPPELAPAELLLLLRTYLPDEVGPPTPFPEPGAEPPLTVGLLKALLEQTGAQGWLSGPVLSPYEDILWDPSTPPPTPPRDL,775,NP_079023.2.csv,refseq-HPS6-NM_024747.5_clinical_seed_0_final,refseq-HPS6-NM_024747.5.a2m,Invitae,refseq-HPS6-NM_024747.5.npy,1,775,775
+NP_079029.3,MDSQELKTLINYYCQERYFHHVLLVASEGIKRYGSDPVFRFYHAYGTLMEGKTQEALREFEAIKNKQDVSLCSLLALIYAHKMSPNPDREAILESDARVKEQRKGAGEKALYHAGLFLWHIGRHDKAREYIDRMIKISDGSKQGHVLKAWLDITRGKEPYTKKALKYFEEGLQDGNDTFALLGKAQCLEMRQNYSGALETVNQIIVNFPSFLPAFVKKMKLQLALQDWDQTVETAQRLLLQDSQNVEALRMQALYYVCREGDIEKASTKLENLGNTLDAMEPQNAQLFYNITLAFSRTCGRSQLILQKIQTLLERAFSLNPQQSEFATELGYQMILQGRVKEALKWYKTAMTLDETSVSALVGFIQCQLIEGQLQDADQQLEFLNEIQQSIGKSAELIYLHAVLAMKKNKRQEEVINLLNDVLDTHFSQLEGLPLGIQYFEKLNPDFLLEIVMEYLSFCPMQPASPGQPLCPLLRRCISVLETVVRTVPGLLQTVFLIAKVKYLSGDIEAAFNNLQHCLEHNPSYADAHLLLAQVYLSQEKVKLCSQSLELCLSYDFKVRDYPLYHLIKAQSQKKMGEIADAIKTLHMAMSLPGMKRIGASTKSKDRKTEVDTSHRLSIFLELIDVHRLNGEQHEATKVLQDAIHEFSGTSEEVRVTIANADLALAQGDIERALSILQNVTAEQPYFIEAREKMADIYLKHRKDKMLYITCFREIAERMANPRSFLLLGDAYMNILEPEEAIVAYEQALNQNPKDGTLASKMGKALIKTHNYSMAITYYEAALKTGQKNYLCYDLAELLLKLKWYDKAEKVLQHALAHEPVNELSALMEDGRCQVLLAKVYSKMEKLGDAITALQQARELQARVLKRVQMEQPDAVPAQKHLAAEICAEIAKHSVAQRDYEKAIKFYREALVHCETDNKIMLELARLYLAQDDPDSCLRQCALLLQSDQDNEAATMMMADLMFRKQDYEQAVFHLQQLLERKPDNYMTLSRLIDLLRRCGKLEDVPRFFSMAEKRNSRAKLEPGFQYCKGLYLWYTGEPNDALRHFNKARKDRDWGQNALYNMIEICLNPDNETVGGEVFENLDGDLGNSTEKQESVQLAVRTAEKLLKELKPQTVQGHVQLRIMENYCLMATKQKSNVEQALNTFTEIAASEKEHIPALLGMATAYMILKQTPRARNQLKRIAKMNWNAIDAEEFEKSWLLLADIYIQSAKYDMAEDLLKRCLRHNRSCCKAYEYMGYIMEKEQAYTDAALNYEMAWKYSNRTNPAVGYKLAFNYLKAKRYVDSIDICHQVLEAHPTYPKIRKDILDKARASLRP,1316,NP_079029.3.csv,refseq-TTC21B-NM_024753.4_clinical_seed_0_final,refseq-TTC21B-NM_024753.4.a2m,Invitae,refseq-TTC21B-NM_024753.4.npy,1,1316,1316
+NP_079033.4,MAAADAEAVPARGEPQQDCCVKTELLGEETPMAADEGSAEKQAGEAHMAADGETNGSCENSDASSHANAAKHTQDSARVNPQDGTNTLTRIAENGVSERDSEAAKQNHVTADDFVQTSVIGSNGYILNKPALQAQPLRTTSTLASSLPGHAAKTLPGGAGKGRTPSAFPQTPAAPPATLGEGSADTEDRKLPAPGADVKVHRARKTMPKSVVGLHAASKDPREVREARDHKEPKEEINKNISDFGRQQLLPPFPSLHQSLPQNQCYMATTKSQTACLPFVLAAAVSRKKKRRMGTYSLVPKKKTKVLKQRTVIEMFKSITHSTVGSKGEKDLGASSLHVNGESLEMDSDEDDSEELEEDDGHGAEQAAAFPTEDSRTSKESMSEADRAQKMDGESEEEQESVDTGEEEEGGDESDLSSESSIKKKFLKRKGKTDSPWIKPARKRRRRSRKKPSGALGSESYKSSAGSAEQTAPGDSTGYMEVSLDSLDLRVKGILSSQAEGLANGPDVLETDGLQEVPLCSCRMETPKSREITTLANNQCMATESVDHELGRCTNSVVKYELMRPSNKAPLLVLCEDHRGRMVKHQCCPGCGYFCTAGNFMECQPESSISHRFHKDCASRVNNASYCPHCGEESSKAKEVTIAKADTTSTVTPVPGQEKGSALEGRADTTTGSAAGPPLSEDDKLQGAASHVPEGFDPTGPAGLGRPTPGLSQGPGKETLESALIALDSEKPKKLRFHPKQLYFSARQGELQKVLLMLVDGIDPNFKMEHQNKRSPLHAAAEAGHVDICHMLVQAGANIDTCSEDQRTPLMEAAENNHLEAVKYLIKAGALVDPKDAEGSTCLHLAAKKGHYEVVQYLLSNGQMDVNCQDDGGWTPMIWATEYKHVDLVKLLLSKGSDINIRDNEENICLHWAAFSGCVDIAEILLAAKCDLHAVNIHGDSPLHIAARENRYDCVVLFLSRDSDVTLKNKEGETPLQCASLNSQVWSALQMSKALQDSAPDRPSPVERIVSRDIARGYERIPIPCVNAVDSEPCPSNYKYVSQNCVTSPMNIDRNITHLQYCVCIDDCSSSNCMCGQLSMRCWYDKDGRLLPEFNMAEPPLIFECNHACSCWRNCRNRVVQNGLRARLQLYRTRDMGWGVRSLQDIPPGTFVCEYVGELISDSEADVREEDSYLFDLDNKDGEVYCIDARFYGNVSRFINHHCEPNLVPVRVFMAHQDLRFPRIAFFSTRLIEAGEQLGFDYGERFWDIKGKLFSCRCGSPKCRHSSAALAQRQASAAQEAQEDGLPDTSSAAAADPL,1298,NP_079033.4.csv,refseq-EHMT1-NM_024757.4_clinical_seed_0_final,refseq-EHMT1-NM_024757.4.a2m,Invitae,refseq-EHMT1-NM_024757.4.npy,1,1298,1298
+NP_079058.1,MEELEQGLLMQPWAWLQLAENSLLAKVFITKQGYALLVSDLQQVWHEQVDTSVVSQRAKELNKRLTAPPAAFLCHLDNLLRPLLKDAAHPSEATFSCDCVADALILRVRSELSGLPFYWNFHCMLASPSLVSQHLIRPLMGMSLALQCQVRELATLLHMKDLEIQDYQESGATLIRDRLKTEPFEENSFLEQFMIEKLPEACSIGDGKPFVMNLQDLYMAVTTQEVQVGQKHQGAGDPHTSNSASLQGIDSQCVNQPEQLVSSAPTLSAPEKESTGTSGPLQRPQLSKVKRKKPRGLFS,299,NP_079058.1.csv,refseq-NHEJ1-NM_024782.2_clinical_seed_0_final,refseq-NHEJ1-NM_024782.2.a2m,Invitae,refseq-NHEJ1-NM_024782.2.npy,1,299,299
+NP_079066.5,MLFPLQVAAVTSSVRDDPLEHCVSPRTRARSPEICKMADNLDEFIEEQKARLAEDKAELESDPPYMEMKGKLSAKLSENSKILISMAKENIPPNSQQTRGSLGIDYGLSLPLGEDYERKKHKLKEELRQDYRRYLTQERLKLERNKEYNQFLRGKEESSEKFRQVEKSTEPKSQRNKKPIGQVKPDLTSQIQTSCENSEGPRKDVLTPSEAYEELLNQRRLEEDRYRQLDDEIELRNRRIIKKANEEVGISNLKHQRFASKAGIPDRRFHRFNEDRVFDRRYHRPDQDPEVSEEMDERFRYESDFDRRLSRVYTNDRMHRNKRGNMPPMEHDGDVIEQSNIRISSAENKSAPDNETSKSANQDTCSPFAGMLFGGEDRELIQRRKEKYRLELLEQMAEQQRNKRREKDLELRVAASGAQDPEKSPDRLKQFSVAPRHFEEMIPPERPRIAFQTPLPPLSAPSVPPIPSVHPVPSQNEDLRSGLSSALGEMVSPRIAPLPPPPLLPPLATNYRTPYDDAYYFYGSRNTFDPSLAYYGSGMMGVQPAAYVSAPVTHQLAQPVVNTVGQNELKITSDQVINSGLIFEDKPKPSKQSLQSYQEALQQQIREREERRKKEREEKEEYEAKLEAEMRTYNPWGKGGGGAPLRDAKGNLITDLNRMHRQNIDAYHNPDARTYEDKRAVVSLDPNLATSNAENLEDAANKSSGHMQTQSSPFARGNVFGEPPTELQIKQQELYKNFLRFQIEEKKQREEAERERLRIAEEKEERRLAEQRARIQQEYEEEQEKKREKEEEQRLKNEEHIRLAEERQKEAERKKKEEEEKYNLQLQHYCERDNLIGEETKHMRQPSPIVPALQNKIASKLQRPPSVDSIIRSFIHESSMSRAQSPPVPARKNQLRAEEEKKNVIMELSEMRKQLRSEERRLQERLLHMDSDDEIPIRKKERNPMDIFDMARHRLQAPVRRQSPKGLDAATFQNVHDFNELKDRDSETRVDLKFMYLDPPRDHHTLEIQQQALLREQQKRLNRIKMQEGAKVDLDAIPSAKVREQRMPRDDTSDFLKNSLLESDSAFIGAYGETYPAIEDDVLPPPSQLPSARERRRNKWKGLDIDSSRPNVAPDGLSLKSISSVNVDELRVRNEERMRRLNEFHNKPINTDDESSLVDPDDIMKHIGDDGSNSVATEPWLRPGTSETLKRFMAEQLNQEQQQIPGKPGTFTWQGLSTAHG,1221,NP_079066.5.csv,refseq-CSPP1-NM_024790.6_clinical_seed_0_final,refseq-CSPP1-NM_024790.6.a2m,Invitae,refseq-CSPP1-NM_024790.6.npy,1,1221,1221
+NP_079085.2,MGFQPPAALLLRLFLLQGILRLLWGDLAFIPPFIRMSGPAVSASLVGDTEGVTVSLAVLQDEAGILPIPTCGVLNNETEDWSVTVIPGAKVLEVTVRWKRGLDWCSSNETDSFSESPCILQTLLVSASHNSSCSAHLLIQVEIYANSSLTHNASENVTVIPNQVYQPLGPCPCNLTAGACDVRCCCDQECSSNLTTLFRRSCFTGVFGGDVNPPFDQLCSAGTTTRGVPDWFPFLCVQSPLANTPFLGYFYHGAVSPKQDSSFEVYVDTDAKDFADFGYKQGDPIMTVKKAYFTIPQVSLAGQCMQNAPVAFLHNFDVKCVTNLELYQERDGIINAKIKNVALGGIVTPKVIYEEATDLDKFITNTETPLNNGSTPRIVNVEEHYIFKWNNNTISEINVKIFRAEINAHQKGIMTQRFVVKFLSYNSGNEEELSGNPGYQLGKPVRALNINRMNNVTTLHLWQSAGRGLCTSATFKPILFGENVLSGCLLEVGINENCTQLRENAVERLDSLIQATHVAMRGNSDYADLSDGWLEIIRVDAPDPGADPLASSVNGMCLDIPAHLSIRILISDAGAVEGITQQEILGVETRFSSVNWQYQCGLTCEHKADLLPISASVQFIKIPAQLPHPLTRFQINYTEYDCNRNEVCWPQLLYPWTQYYQGELHSQCVAKGLLLLLFLTLALFLSNPWTRICKAYS,697,NP_079085.2.csv,refseq-TCTN2-NM_024809.4_clinical_seed_0_final,refseq-TCTN2-NM_024809.4.a2m,Invitae,refseq-TCTN2-NM_024809.4.npy,1,697,697
+NP_079120.1,MEELDGEPTVTLIPGVNSKKNQMYFDWGPGEMLVCETSFNKKEKSEMVPSCPFIYIIRKDVDVYSQILRKLFNESHGIFLGLQRIDEELTGKSRKSQLVRVSKNYRSVIRACMEEMHQVAIAAKDPANGRQFSSQVSILSAMELIWNLCEILFIEVAPAGPLLLHLLDWVRLHVCEVDSLSADVLGSENPSKHDSFWNLVTILVLQGRLDEARQMLSKEADASPASAGICRIMGDLMRTMPILSPGNTQTLTELELKWQHWHEECERYLQDSTFATSPHLESLLKIMLGDEAALLEQKELLSNWYHFLVTRLLYSNPTVKPIDLHYYAQSSLDLFLGGESSPEPLDNILLAAFEFDIHQVIKECSIALSNWWFVAHLTDLLDHCKLLQSHNLYFGSNMREFLLLEYASGLFAHPSLWQLGVDYFDYCPELGRVSLELHIERIPLNTEQKALKVLRICEQRQMTEQVRSICKILAMKAVRNNRLGSALSWSIRAKDAAFATLVSDRFLRDYCERGCFSDLDLIDNLGPAMMLSDRLTFLGKYREFHRMYGEKRFADAASLLLSLMTSRIAPRSFWMTLLTDALPLLEQKQVIFSAEQTYELMRCLEDLTSRRPVHGESDTEQLQDDDIETTKVEMLRLSLARNLARAIIREGSLEGS,656,NP_079120.1.csv,refseq-NUP85-NM_024844.4_clinical_seed_0_final,refseq-NUP85-NM_024844.4.a2m,Invitae,refseq-NUP85-NM_024844.4.npy,1,656,656
+NP_079130.2,MEAARPPPTAGKFVVVGGGIAGVTCAEQLATHFPSEDILLVTASPVIKAVTNFKQISKILEEFDVEEQSSTMLGKRFPNIKVIESGVKQLKSEEHCIVTEDGNQHVYKKLCLCAGAKPKLICEGNPYVLGIRDTDSAQEFQKQLTKAKRIMIIGNGGIALELVYEIEGCEVIWAIKDKAIGNTFFDAGAAEFLTSKLIAEKSEAKIAHKRTRYTTEGRKKEARSKSKADNVGSALGPDWHEGLNLKGTKEFSHKIHLETMCEVKKIYLQDEFRILKKKSFTFPRDHKSVTADTEMWPVYVELTNEKIYGCDFIVSATGVTPNVEPFLHGNSFDLGEDGGLKVDDHMHTSLPDIYAAGDICTTSWQLSPVWQQMRLWTQARQMGWYAAKCMAAASSGDSIDMDFSFELFAHVTKFFNYKVVLLGKYNAQGLGSDHELMLRCTKGREYIKVVMQNGRMMGAVLIGETDLEETFENLILNQMNLSSYGEDLLDPNIDIEDYFD,500,NP_079130.2.csv,refseq-PYROXD1-NM_024854.3_clinical_seed_0_final,refseq-PYROXD1-NM_024854.3.a2m,Invitae,refseq-PYROXD1-NM_024854.3.npy,1,500,500
+NP_079146.2,MSEDSRGDSRAESAKDLEKQLRLRVCVLSELQKTERDYVGTLEFLVSAFLHRMNQCAASKVDKNVTEETVKMLFSNIEDILAVHKEFLKVVEECLHPEPNAQQEVGTCFLHFKDKFRIYDEYCSNHEKAQKLLLELNKIRTIRTFLLNCMLLGGRKNTDVPLEGYLVTPIQRICKYPLILKELLKRTPRKHSDYAAVMEALQAMKAVCSNINEAKRQMEKLEVLEEWQSHIEGWEGSNITDTCTEMLMCGVLLKISSGNIQERVFFLFDNLLVYCKRKHRRLKNSKASTDGHRYLFRGRINTEVMEVENVDDGTADFHSSGHIVVNGWKIHNTAKNKWFVCMAKTPEEKHEWFEAILKERERRKGLKLGMEQDTWVMISEQGEKLYKMMCRQGNLIKDRKRKLTTFPKCFLGSEFVSWLLEIGEIHRPEEGVHLGQALLENGIIHHVTDKHQFKPEQMLYRFRYDDGTFYPRNEMQDVISKGVRLYCRLHSLFTPVIRDKDYHLRTYKSVVMANKLIDWLIAQGDCRTREEAMIFGVGLCDNGFMHHVLEKSEFKDEPLLFRFFSDEEMEGSNMKHRLMKHDLKVVENVIAKSLLIKSNEGSYGFGLEDKNKVPIIKLVEKGSNAEMAGMEVGKKIFAINGDLVFMRPFNEVDCFLKSCLNSRKPLRVLVSTKPRETVKIPDSADGLGFQIRGFGPSVVHAVGRGTVAAAAGLHPGQCIIKVNGINVSKETHASVIAHVTACRKYRRPTKQDSIQWVYNSIESAQEDLQKSHSKPPGDEAGDAFDCKVEEVIDKFNTMAIIDGKKEHVSLTVDNVHLEYGVVYEYDSTAGIKCNVVEKMIEPKGFFSLTAKILEALAKSDEHFVQNCTSLNSLNEVIPTDLQSKFSALCSERIEHLCQRISSYKKFSRVLKNRAWPTFKQAKSKISPLHSSDFCPTNCHVNVMEVSYPKTSTSLGSAFGVQLDSRKHNSHDKENKSSEQGKLSPMVYIQHTITTMAAPSGLSLGQQDGHGLRYLLKEEDLETQDIYQKLLGKLQTALKEVEMCVCQIDDLLSSITYSPKLERKTSEGIIPTDSDNEKGERNSKRVCFNVAGDEQEDSGHDTISNRDSYSDCNSNRNSIASFTSICSSQCSSYFHSDEMDSGDELPLSVRISHDKQDKIHSCLEHLFSQVDSITNLLKGQAVVRAFDQTKYLTPGRGLQEFQQEMEPKLSCPKRLRLHIKQDPWNLPSSVRTLAQNIRKFVEEVKCRLLLALLEYSDSETQLRRDMVFCQTLVATVCAFSEQLMAALNQMFDNSKENEMETWEASRRWLDQIANAGVLFHFQSLLSPNLTDEQAMLEDTLVALFDLEKVSFYFKPSEEEPLVANVPLTYQAEGSRQALKVYFYIDSYHFEQLPQRLKNGGGFKIHPVLFAQALESMEGYYYRDNVSVEEFQAQINAASLEKVKQYNQKLRAFYLDKSNSPPNSTSKAAYVDKLMRPLNALDELYRLVASFIRSKRTAACANTACSASGVGLLSVSSELCNRLGACHIIMCSSGVHRCTLSVTLEQAIILARSHGLPPRYIMQATDVMRKQGARVQNTAKNLGVRDRTPQSAPRLYKLCEPPPPAGEE,1606,NP_079146.2.csv,refseq-PREX2-NM_024870.3_clinical_seed_0_final,refseq-PREX2-NM_024870.3.a2m,Invitae,refseq-PREX2-NM_024870.3.npy,1,1606,1606
+NP_079152.3,MWLKVGGLLRGTGGQLGQTVGWPCGALGPGPHRWGPCGGSWAQKFYQDGPGRGLGEEDIRRAREARPRKTPRPQLSDRSRERKVPASRISRLANFGGLAVGLGLGVLAEMAKKSMPGGRLQSEGGSGLDSSPFLSEANAERIVQTLCTVRGAALKVGQMLSIQDNSFISPQLQHIFERVRQSADFMPRWQMLRVLEEELGRDWQAKVASLEEVPFAAASIGQVHQGLLRDGTEVAVKIQYPGIAQSIQSDVQNLLAVLKMSAALPAGLFAEQSLQALQQELAWECDYRREAACAQNFRQLLANDPFFRVPAVVKELCTTRVLGMELAGGVPLDQCQGLSQDLRNQICFQLLTLCLRELFEFRFMQTDPNWANFLYDASSHQVTLLDFGASREFGTEFTDHYIEVVKAAADGDRDCVLQKSRDLKFLTGFETKAFSDAHVEAVMILGEPFATQGPYDFGSGETARRIQDLIPVLLRHRLCPPPEETYALHRKLAGAFLACAHLRAHIACRDLFQDTYHRYWASRQPDAATAGSLPTKGDSWVDPS,544,NP_079152.3.csv,refseq-COQ8B-NM_024876.3_clinical_seed_0_final,refseq-COQ8B-NM_024876.3.a2m,Invitae,refseq-COQ8B-NM_024876.3.npy,1,544,544
+NP_079160.1,MVPALRYLVGACGRARGLFAGGSPGACGFASGRPRPLCGGSRSASTSSFDIVIVGGGIVGLASARALILRHPSLSIGVLEKEKDLAVHQTGHNSGVIHSGIYYKPESLKAKLCVQGAALLYEYCQQKGISYKQCGKLIVAVEQEEIPRLQALYEKGLQNGVPGLRLIQQEDIKKKEPYCRGLMAIDCPHTGIVDYRQVALSFAQDFQEAGGSVLTNFEVKGIEMAKESPSRSIDGMQYPIVIKNTKGEEIRCQYVVTCAGLYSDRISELSGCTPDPRIVPFRGDYLLLKPEKCYLVKGNIYPVPDSRFPFLGVHFTPRMDGSIWLGPNAVLAFKREGYRPFDFSATDVMDIIINSGLIKLASQNFSYGVTEMYKACFLGATVKYLQKFIPEITISDILRGPAGVRAQALDRDGNLVEDFVFDAGVGDIGNRILHVRNAPSPAATSSIAISGMIADEVQQRFEL,463,NP_079160.1.csv,refseq-L2HGDH-NM_024884.2_clinical_seed_0_final,refseq-L2HGDH-NM_024884.2.a2m,Invitae,refseq-L2HGDH-NM_024884.2.npy,1,463,463
+NP_079163.2,MSWIKEGELSLWERFCANIIKAGPMPKHIAFIMDGNRRYAKKCQVERQEGHSQGFNKLAETLRWCLNLGILEVTVYAFSIENFKRSKSEVDGLMDLARQKFSRLMEEKEKLQKHGVCIRVLGDLHLLPLDLQELIAQAVQATKNYNKCFLNVCFAYTSRHEISNAVREMAWGVEQGLLDPSDISESLLDKCLYTNRSPHPDILIRTSGEVRLSDFLLWQTSHSCLVFQPVLWPEYTFWNLFEAILQFQMNHSVLQQKARDMYAEERKRQQLERDQATVTEQLLREGLQASGDAQLRRTRLHKLSARREERVQGFLQALELKRADWLARLGTASA,334,NP_079163.2.csv,refseq-DHDDS-NM_024887.3_clinical_seed_0_final,refseq-DHDDS-NM_024887.3.a2m,Invitae,refseq-DHDDS-NM_024887.3.npy,1,334,334
+NP_079191.2,MSQESDNNKRLVALVPMPSDPPFNTRRAYTSEDEAWKSYLENPLTAATKAMMSINGDEDSAAALGLLYDYYKVPRDKRLLSVSKASDSQEDQEKRNCLGTSEAQSNLSGGENRVQVLKTVPVNLSLNQDHLENSKREQYSISFPESSAIIPVSGITVVKAEDFTPVFMAPPVHYPRGDGEEQRVVIFEQTQYDVPSLATHSAYLKDDQRSTPDSTYSESFKDAATEKFRSASVGAEEYMYDQTSSGTFQYTLEATKSLRQKQGEGPMTYLNKGQFYAITLSETGDNKCFRHPISKVRSVVMVVFSEDKNRDEQLKYWKYWHSRQHTAKQRVLDIADYKESFNTIGNIEEIAYNAVSFTWDVNEEAKIFITVNCLSTDFSSQKGVKGLPLMIQIDTYSYNNRSNKPIHRAYCQIKVFCDKGAERKIRDEERKQNRKKGKGQASQTQCNSSSDGKLAAIPLQKKSDITYFKTMPDLHSQPVLFIPDVHFANLQRTGQVYYNTDDEREGGSVLVKRMFRPMEEEFGPVPSKQMKEEGTKRVLLYVRKETDDVFDALMLKSPTVKGLMEAISEKYGLPVEKIAKLYKKSKKGILVNMDDNIIEHYSNEDTFILNMESMVEGFKVTLMEI,625,NP_079191.2.csv,refseq-GRHL2-NM_024915.3_clinical_seed_0_final,refseq-GRHL2-NM_024915.3.a2m,Invitae,refseq-GRHL2-NM_024915.3.npy,1,625,625
+NP_079272.4,MRLLGAAAVAALGRGRAPASLGWQRKQVNWKACRWSSSGVIPNEKIRNIGISAHIDSGKTTLTERVLYYTGRIAKMHEVKGKDGVGAVMDSMELERQRGITIQSAATYTMWKDVNINIIDTPGHVDFTIEVERALRVLDGAVLVLCAVGGVQCQTMTVNRQMKRYNVPFLTFINKLDRMGSNPARALQQMRSKLNHNAAFMQIPMGLEGNFKGIVDLIEERAIYFDGDFGQIVRYGEIPAELRAAATDHRQELIECVANSDEQLGEMFLEEKIPSISDLKLAIRRATLKRSFTPVFLGSALKNKGVQPLLDAVLEYLPNPSEVQNYAILNKEDDSKEKTKILMNSSRDNSHPFVGLAFKLEVGRFGQLTYVRSYQGELKKGDTIYNTRTRKKVRLQRLARMHADMMEDVEEVYAGDICALFGIDCASGDTFTDKANSGLSMESIHVPDPVISIAMKPSNKNDLEKFSKGIGRFTREDPTFKVYFDTENKETVISGMGELHLEIYAQRLEREYGCPCITGKPKVAFRETITAPVPFDFTHKKQSGGAGQYGKVIGVLEPLDPEDYTKLEFSDETFGSNIPKQFVPAVEKGFLDACEKGPLSGHKLSGLRFVLQDGAHHMVDSNEISFIRAGEGALKQALANATLCILEPIMAVEVVAPNEFQGQVIAGINRRHGVITGQDGVEDYFTLYADVPLNDMFGYSTELRSCTEGKGEYTMEYSRYQPCLPSTQEDVINKYLEATGQLPVKKGKAKN,751,NP_079272.4.csv,refseq-GFM1-NM_024996.5_clinical_seed_0_final,refseq-GFM1-NM_024996.5.a2m,Invitae,refseq-GFM1-NM_024996.5_theta_0.2.npy,1,751,751
+NP_079353.3,MAADSDDGAVSAPAASDGGVSKSTTSGEELVVQVPVVDVQSNNFKEMWPSLLLAIKTANFVAVDTELSGLGDRKSLLNQCIEERYKAVCHAARTRSILSLGLACFKRQPDKGEHSYLAQVFNLTLLCMEEYVIEPKSVQFLIQHGFNFNQQYAQGIPYHKGNDKGDESQSQSVRTLFLELIRARRPLVLHNGLIDLVFLYQNFYAHLPESLGTFTADLCEMFPAGIYDTKYAAEFHARFVASYLEYAFRKCERENGKQRAAGSPHLTLEFCNYPSSMRDHIDYRCCLPPATHRPHPTSICDNFSAYGWCPLGPQCPQSHDIDLIIDTDEAAAEDKRRRRRRREKRKRALLNLPGTQTSGEAKDGPPKKQVCGDSIKPEETEQEVAADETRNLPHSKQGNKNDLEMGIKAARPEIADRATSEVPGSQASPNPVPGDGLHRAGFDAFMTGYVMAYVEVSQGPQPCSSGPWLPECHNKVYLSGKAVPLTVAKSQFSRSSKAHNQKMKLTWGSS,510,NP_079353.3.csv,refseq-TOE1-NM_025077.3_clinical_seed_0_final,refseq-TOE1-NM_025077.3.a2m,Invitae,refseq-TOE1-NM_025077.3.npy,1,510,510
+NP_079375.3,MAAGRAQVPSSEQAWLEDAQVFIQKTLCPAVKEPNVQLTPLVIDCVKTVWLSQGRNQGSTLPLSYSFVSVQDLKTHQRLPCCSHLSWSSSAYQAWAQEAGPNGNPLPREQLLLLGTLTDLSADLEQECRNGSLYVRDNTGVLSCELIDLDLSWLGHLFLFPRWSYLPPARWNSSGEGHLELWDAPVPVFPLTISPGPVTPIPVLYPESASCLLRLRNKLRGVQRNLAGSLVRLSALVKSKQKAYFILSLGRSHPAVTHVSIIVQVPAQLVWHRALRPGTAYVLTELRVSKIRGQRQHVWMTSQSSRLLLLKPECVQELELELEGPLLEADPKPLPMPSNSEDKKDPESLVRYSRLLSYSGAVTGVLNEPAGLYELDGQLGLCLAYQQFRGLRRVMRPGVCLQLQDVHLLQSVGGGTRRPVLAPCLRGAVLLQSFSRQKPGAHSSRQAYGASLYEQLVWERQLGLPLYLWATKALEELACKLCPHVLRHHQFLQHSSPGSPSLGLQLLAPTLDLLAPPGSPVRNAHNEILEEPHHCPLQKYTRLQTPSSFPTLATLKEEGQRKAWASFDPKALLPLPEASYLPSCQLNRRLAWSWLCLLPSAFCPAQVLLGVLVASSHKGCLQLRDQSGSLPCLLLAKHSQPLSDPRLIGCLVRAERFQLIVERDVRSSFPSWKELSMPGFIQKQQARVYVQFFLADALILPVPRPCLHSATPSTPQTDPTGPEGPHLGQSRLFLLCHKEALMKRNFCVPPGASPEVPKPALSFYVLGSWLGGTQRKEGTGWGLPEPQGNDDNDQKVHLIFFGSSVRWFEFLHPGQVYRLIAPGPATPMLFEKDGSSCISRRPLELAGCASCLTVQDNWTLELESSQDIQDVLDANKSLPESSLTDLLSDNFTDSLVSFSAEILSRTLCEPLVASLWMKLGNTGAMRRCVKLTVALETAECEFPPHLDVYIEDPHLPPSLGLLPGARVHFSQLEKRVSRSHNVYCCFRSSTYVQVLSFPPETTISIPLPHIYLAELLQGGQSPFQATASCHIVSVFSLQLFWVCAYCTSICRQGKCTRLGSTCPTQTAISQAIIRLLVEDGTAEAVVTCRNHHVAAALGLCPREWASLLDFVQVPGRVVLQFAGPGAQLESSARVDEPMTMFLWTLCTSPSVLRPIVLSFELERKPSKIVPLEPPRLQRFQCGELPFLTHVNPRLRLSCLSIRESEYSSSLGILASSC,1217,NP_079375.3.csv,refseq-CTC1-NM_025099.5_clinical_seed_0_final,refseq-CTC1-NM_025099.5.a2m,Invitae,refseq-CTC1-NM_025099.5.npy,1,1217,1217
+NP_079408.3,MKRIFSLLEKTWLGAPIQFAWQKTSGNYLAVTGADYIVKIFDRHGQKRSEINLPGNCVAMDWDKDGDVLAVIAEKSSCIYLWDANTNKTSQLDNGMRDQMSFLLWSKVGSFLAVGTVKGNLLIYNHQTSRKIPVLGKHTKRITCGCWNAENLLALGGEDKMITVSNQEGDTIRQTQVRSEPSNMQFFLMKMDDRTSAAESMISVVLGKKTLFFLNLNEPDNPADLEFQQDFGNIVCYNWYGDGRIMIGFSCGHFVVISTHTGELGQEIFQARNHKDNLTSIAVSQTLNKVATCGDNCIKIQDLVDLKDMYVILNLDEENKGLGTLSWTDDGQLLALSTQRGSLHVFLTKLPILGDACSTRIAYLTSLLEVTVANPVEGELPITVSVDVEPNFVAVGLYHLAVGMNNRAWFYVLGENAVKKLKDMEYLGTVASICLHSDYAAALFEGKVQLHLIESEILDAQEERETRLFPAVDDKCRILCHALTSDFLIYGTDTGVVQYFYIEDWQFVNDYRHPVSVKKIFPDPNGTRLVFIDEKSDGFVYCPVNDATYEIPDFSPTIKGVLWENWPMDKGVFIAYDDDKVYTYVFHKDTIQGAKVILAGSTKVPFAHKPLLLYNGELTCQTQSGKVNNIYLSTHGFLSNLKDTGPDELRPMLAQNLMLKRFSDAWEMCRILNDEAAWNELARACLHHMEVEFAIRVYRRIGNVGIVMSLEQIKGIEDYNLLAGHLAMFTNDYNLAQDLYLASSCPIAALEMRRDLQHWDSALQLAKHLAPDQIPFISKEYAIQLEFAGDYVNALAHYEKGITGDNKEHDEACLAGVAQMSIRMGDIRRGVNQALKHPSRVLKRDCGAILENMKQFSEAAQLYEKGLYYDKAASVYIRSKNWAKVGDLLPHVSSPKIHLQYAKAKEADGRYKEAVVAYENAKQWQSVIRIYLDHLNNPEKAVNIVRETQSLDGAKMVARFFLQLGDYGSAIQFLVMSKCNNEAFTLAQQHNKMEIYADIIGSEDTTNEDYQSIALYFEGEKRYLQAGKFFLLCGQYSRALKHFLKCPSSEDNVAIEMAIETVGQAKDELLTNQLIDHLLGENDGMPKDAKYLFRLYMALKQYREAAQTAIIIAREEQSAGNYRNAHDVLFSMYAELKSQKIKIPSEMATNLMILHSYILVKIHVKNGDHMKGARMLIRVANNISKFPSHIVPILTSTVIECHRAGLKNSAFSFAAMLMRPEYRSKIDAKYKKKIEGMVRRPDISEIEEATTPCPFCKFLLPECELLCPGCKNSIPYCIATGRHMLKDDWTVCPHCDFPALYSELKIMLNTESTCPMCSERLNAAQLKKISDCTQYLRTEEEL,1342,NP_079408.3.csv,refseq-WDR19-NM_025132.3_clinical_seed_0_final,refseq-WDR19-NM_025132.3.a2m,Invitae,refseq-WDR19-NM_025132.3.npy,1,1342,1342
+NP_079412.1,MVVGAFPMAKLLYLGIRQVSKPLANRIKEAARRSEFFKTYICLPPAQLYHWVEMRTKMRIMGFRGTVIKPLNEEAAAELGAELLGEATIFIVGGGCLVLEYWRHQAQQRHKEEEQRAAWNALRDEVGHLALALEALQAQVQAAPPQGALEELRTELQEVRAQLCNPGRSASHAVPASKK,179,NP_079412.1.csv,refseq-OPA3-NM_025136.3_clinical_seed_0_final,refseq-OPA3-NM_025136.3.a2m,Invitae,refseq-OPA3-NM_025136.3.npy,1,179,179
+NP_079426.2,MALYQRWRCLRLQGLQACRLHTAVVSTPPRWLAERLGLFEELWAAQVKRLASMAQKEPRTIKISLPGGQKIDAVAWNTTPYQLARQISSTLADTAVAAQVNGEPYDLERPLETDSDLRFLTFDSPEGKAVFWHSSTHVLGAAAEQFLGAVLCRGPSTEYGFYHDFFLGKERTIRGSELPVLERICQELTAAARPFRRLEASRDQLRQLFKDNPFKLHLIEEKVTGPTATVYGCGTLVDLCQGPHLRHTGQIGGLKLLSNSSSLWRSSGAPETLQRVSGISFPTTELLRVWEAWREEAELRDHRRIGKEQELFFFHELSPGSCFFLPRGTRVYNALVAFIRAEYAHRGFSEVKTPTLFSTKLWEQSGHWEHYQEDMFAVQPPGSDRPPSSQSDDSTRHITDTLALKPMNCPAHCLMFAHRPRSWRELPLRLADFGALHRAEASGGLGGLTRLRCFQQDDAHIFCTTDQLEAEIQSCLDFLRSVYAVLGFSFRLALSTRPSGFLGDPCLWDQAEQVLKQALKEFGEPWDLNSGDGAFYGPKIDVHLHDALGRPHQCGTIQLDFQLPLRFDLQYKGQAGALERPVLIHRAVLGSVERLLGVLAESCGGKWPLWLSPFQVVVIPVGSEQEEYAKEAQQSLRAAGLVSDLDADSGLTLSRRIRRAQLAHYNFQFVVGQKEQSKRTVNIRTRDNRRLGEWDLPEAVQRLVELQNTRVPNAEEIF,718,NP_079426.2.csv,refseq-TARS2-NM_025150.4_clinical_seed_0_final,refseq-TARS2-NM_025150.4.a2m,Invitae,refseq-TARS2-NM_025150.4.npy,1,718,718
+NP_079428.2,MGIWQRLLLFGGVSLRAGGGATAPLGGSRAMVCGRQLSGAGSETLKQRRTQIMSRGLPKQKPIEGVKQVIVVASGKGGVGKSTTAVNLALALAANDSSKAIGLLDVDVYGPSVPKMMNLKGNPELSQSNLMRPLLNYGIACMSMGFLVEESEPVVWRGLMVMSAIEKLLRQVDWGQLDYLVVDMPPGTGDVQLSVSQNIPITGAVIVSTPQDIALMDAHKGAEMFRRVHVPVLGLVQNMSVFQCPKCKHKTHIFGADGARKLAQTLGLEVLGDIPLHLNIREASDTGQPIVFSQPESDEAKAYLRIAVEVVRRLPSPSE,319,NP_079428.2.csv,refseq-NUBPL-NM_025152.2_clinical_seed_0_final,refseq-NUBPL-NM_025152.2.a2m,Invitae,refseq-NUBPL-NM_025152.2.npy,1,319,319
+NP_079436.4,MQANGAGGGGGGGGGGGGGGGGGGGQGQTPELACLSAQNGESSPSSSSSAGDLAHANGLLPSAPSAASNNSNSLNVNNGVPGGAAAASSATVAAASATTAASSSLATPELGSSLKKKKRLSQSDEDVIRLIGQHLNGLGLNQTVDLLMQESGCRLEHPSATKFRNHVMEGDWDKAENDLNELKPLVHSPHAIVVRGALEISQTLLGIIVRMKFLLLQQKYLEYLEDGKVLEALQVLRCELTPLKYNTERIHVLSGYLMCSHAEDLRAKAEWEGKGTASRSKLLDKLQTYLPPSVMLPPRRLQTLLRQAVELQRDRCLYHNTKLDNNLDSVSLLIDHVCSRRQFPCYTQQILTEHCNEVWFCKFSNDGTKLATGSKDTTVIIWQVDPDTHLLKLLKTLEGHAYGVSYIAWSPDDNYLVACGPDDCSELWLWNVQTGELRTKMSQSHEDSLTSVAWNPDGKRFVTGGQRGQFYQCDLDGNLLDSWEGVRVQCLWCLSDGKTVLASDTHQRIRGYNFEDLTDRNIVQEDHPIMSFTISKNGRLALLNVATQGVHLWDLQDRVLVRKYQGVTQGFYTIHSCFGGHNEDFIASGSEDHKVYIWHKRSELPIAELTGHTRTVNCVSWNPQIPSMMASASDDGTVRIWGPAPFIDHQNIEEECSSMDS,661,NP_079436.4.csv,refseq-WDR26-NM_025160.6_clinical_seed_0_final,refseq-WDR26-NM_025160.6.a2m,Invitae,refseq-WDR26-NM_025160.6.npy,1,661,661
+NP_079455.3,MEQRRPWPRALEVDSRSVVLLSVVWVLLAPPAAGMPQFSTFHSENRDWTFNHLTVHQGTGAVYVGAINRVYKLTGNLTIQVAHKTGPEEDNKSCYPPLIVQPCSEVLTLTNNVNKLLIIDYSENRLLACGSLYQGVCKLLRLDDLFILVEPSHKKEHYLSSVNKTGTMYGVIVRSEGEDGKLFIGTAVDGKQDYFPTLSSRKLPRDPESSAMLDYELHSDFVSSLIKIPSDTLALVSHFDIFYIYGFASGGFVYFLTVQPETPEGVAINSAGDLFYTSRIVRLCKDDPKFHSYVSLPFGCTRAGVEYRLLQAAYLAKPGDSLAQAFNITSQDDVLFAIFSKGQKQYHHPPDDSALCAFPIRAINLQIKERLQSCYQGEGNLELNWLLGKDVQCTKAPVPIDDNFCGLDINQPLGGSTPVEGLTLYTTSRDRMTSVASYVYNGYSVVFVGTKSGKLKKIRADGPPHGGVQYEMVSVLKDGSPILRDMAFSIDQRYLYVMSERQVTRVPVESCEQYTTCGECLSSGDPHCGWCALHNMCSRRDKCQQAWEPNRFAASISQCVSLAVHPSSISVSEHSRLLSLVVSDAPDLSAGIACAFGNLTEVEGQVSGSQVICISPGPKDVPVIPLDQDWFGLELQLRSKETGKIFVSTEFKFYNCSAHQLCLSCVNSAFRCHWCKYRNLCTHDPTTCSFQEGRINISEDCPQLVPTEEILIPVGEVKPITLKARNLPQPQSGQRGYECVLNIQGAIHRVPALRFNSSSVQCQNSSYQYDGMDISNLAVDFAVVWNGNFIIDNPQDLKVHLYKCAAQRESCGLCLKADRKFECGWCSGERRCTLHQHCTSPSSPWLDWSSHNVKCSNPQITEILTVSGPPEGGTRVTIHGVNLGLDFSEIAHHVQVAGVPCTPLPGEYIIAEQIVCEMGHALVGTTSGPVRLCIGECKPEFMTKSHQQYTFVNPSVLSLNPIRGPESGGTMVTITGHYLGAGSSVAVYLGNQTCEFYGRSMSEIVCVSPPSSNGLGPVPVSVSVDRAHVDSNLQFEYIDDPRVQRIEPEWSIASGHTPLTITGFNLDVIQEPRIRVKFNGKESVNVCKVVNTTTLTCLAPSLTTDYRPGLDTVERPDEFGFVFNNVQSLLIYNDTKFIYYPNPTFELLSPTGVLDQKPGSPIILKGKNLCPPASGGAKLNYTVLIGETPCAVTVSETQLLCEPPNLTGQHKVMVHVGGMVFSPGSVSVISDSLLTLPAIVSIAAGGSLLLIIVIIVLIAYKRKSRENDLTLKRLQMQMDNLESRVALECKEAFAELQTDINELTSDLDRSGIPYLDYRTYAMRVLFPGIEDHPVLRELEVQGNGQQHVEKALKLFAQLINNKVFLLTFIRTLELQRSFSMRDRGNVASLIMTGLQGRLEYATDVLKQLLSDLIDKNLENKNHPKLLLRRTESVAEKMLTNWFAFLLHKFLKECAGEPLFMLYCAIKQQMEKGPIDAITGEARYSLSEDKLIRQQIEYKTLILNCVNPDNENSPEIPVKVLNCDTITQVKEKILDAVYKNVPYSQRPRAVDMDLEWRQGRIARVVLQDEDITTKIEGDWKRLNTLMHYQVSDRSVVALVPKQTSSYNIPASASISRTSISRYDSSFRYTGSPDSLRSRAPMITPDLESGVKVWHLVKNHDHGDQKEGDRGSKMVSEIYLTRLLATKGTLQKFVDDLFETLFSTVHRGSALPLAIKYMFDFLDEQADRHSIHDTDVRHTWKSNCLPLRFWVNVIKNPQFVFDIHKGSITDACLSVVAQTFMDSCSTSEHRLGKDSPSNKLLYAKDIPSYKSWVERYYADIAKLPAISDQDMNAYLAEQSRLHAVEFNMLSALNEIYSYVSKYSEELIGALEQDEQARRQRLAYKVEQLINAMSIES,1894,NP_079455.3.csv,refseq-PLXNA2-NM_025179.3_clinical_seed_0_final,refseq-PLXNA2-NM_025179.3.a2m,Invitae,refseq-PLXNA2-NM_025179.3.npy,1,1894,1894
+NP_079469.2,MADSAQAQKLVYLVTGGCGFLGEHVVRMLLQREPRLGELRVFDQHLGPWLEELKTGPVRVTAIQGDVTQAHEVAAAVAGAHVVIHTAGLVDVFGRASPKTIHEVNVQGTRNVIEACVQTGTRFLVYTSSMEVVGPNTKGHPFYRGNEDTPYEAVHRHPYPCSKALAEWLVLEANGRKVRGGLPLVTCALRPTGIYGEGHQIMRDFYRQGLRLGGWLFRAIPASVEHGRVYVGNVAWMHVLAARELEQRATLMGGQVYFCYDGSPYRSYEDFNMEFLGPCGLRLVGARPLLPYWLLVFLAALNALLQWLLRPLVLYAPLLNPYTLAVANTTFTVSTDKAQRHFGYEPLFSWEDSRTRTILWVQAATGSAQ,369,NP_079469.2.csv,refseq-HSD3B7-NM_025193.3_clinical_seed_0_final,refseq-HSD3B7-NM_025193.3.a2m,Invitae,refseq-HSD3B7-NM_025193.3.npy,1,369,369
+NP_079483.3,MGWDLGTRLFQRQEQRSRLSRIWLEKTRVFLEGSTRTPALPHCLFWLLQVPSTQDPLFPGYGPQCPVDLAGPPCLRPLFGGLGGYWRALQRGREGRTMTSRASELSPGRSVTAGIIIVGDEILKGHTQDTNTFFLCRTLRSLGVQVCRVSVVPDEVATIAAEVTSFSNRFTHVLTAGGIGPTHDDVTFEAVAQAFGDELKPHPKLEAATKALGGEGWEKLSLVPSSARLHYGTDPCTGQPFRFPLVSVRNVYLFPGIPELLRRVLEGMKGLFQNPAVQFHSKELYVAADEASIAPILAEAQAHFGRRLGLGSYPDWGSNYYQVKLTLDSEEEGPLEECLAYLTARLPQGSLVPYMPNAVEQASEAVYKLAESGSSLGKKVAGALQTIETSLAQYSLTQLCVGFNGGKDCTALLHLFHAAVQRKLPDVPNPLQILYIRSISPFPELEQFLQDTIKRYNLQMLEAEGSMKQALGELQARHPQLEAVLMGTRRTDPYSCSLCPFSPTDPGWPAFMRINPLLDWTYRDIWDFLRQLFVPYCILYDRGYTSLGSRENTVRNPALKCLSPGGHPTYRPAYLLENEEEERNSRT,587,NP_079483.3.csv,refseq-FLAD1-NM_025207.4_clinical_seed_0_final,refseq-FLAD1-NM_025207.4.a2m,Invitae,refseq-FLAD1-NM_025207.4.npy,1,587,587
+NP_079491.2,MGLQLRALLGAFGRWTLRLGPRPSCSPRMAGNAEPPPAGAACPQDRRSCSGRAGGDRVWEDGEHPAKKLKSGGDEERREKPPKRKIVLLMAYSGKGYHGMQRNVGSSQFKTIEDDLVSALVRSGCIPENHGEDMRKMSFQRCARTDKGVSAAGQVVSLKVWLIDDILEKINSHLPSHIRILGLKRVTGGFNSKNRCDARTYCYLLPTFAFAHKDRDVQDETYRLSAETLQQVNRLLACYKGTHNFHNFTSQKGPQDPSACRYILEMYCEEPFVREGLEFAVIRVKGQSFMMHQIRKMVGLVVAIVKGYAPESVLERSWGTEKVDVPKAPGLGLVLERVHFEKYNQRFGNDGLHEPLDWAQEEGKVAAFKEEHIYPTIIGTERDERSMAQWLSTLPIHNFSATALTAGGTGAKVPSPLEGSEGDGDTD,427,NP_079491.2.csv,refseq-PUS1-NM_025215.5_clinical_seed_0_final,refseq-PUS1-NM_025215.5.a2m,Invitae,refseq-PUS1-NM_025215.5.npy,1,427,427
+NP_079492.2,MGSAHPRPWLRLRPQPQPRPALWVLLFFLLLLAAAMPRSAPNDILDLRLPPEPVLNANTVCLTLPGLSRRQMEVCVRHPDVAASAIQGIQIAIHECQHQFRDQRWNCSSLETRNKIPYESPIFSRGFRESAFAYAIAAAGVVHAVSNACALGKLKACGCDASRRGDEEAFRRKLHRLQLDALQRGKGLSHGVPEHPALPTASPGLQDSWEWGGCSPDMGFGERFSKDFLDSREPHRDIHARMRLHNNRVGRQAVMENMRRKCKCHGTSGSCQLKTCWQVTPEFRTVGALLRSRFHRATLIRPHNRNGGQLEPGPAGAPSPAPGAPGPRRRASPADLVYFEKSPDFCEREPRLDSAGTVGRLCNKSSAGSDGCGSMCCGRGHNILRQTRSERCHCRFHWCCFVVCEECRITEWVSVCK,417,NP_079492.2.csv,refseq-WNT10A-NM_025216.2_clinical_seed_0_final,refseq-WNT10A-NM_025216.2.a2m,Invitae,refseq-WNT10A-NM_025216.2.npy,1,417,417
+NP_079495.1,MADQRQRSLSTSGESLYHVLGLDKNATSDDIKKSYRKLALKYHPDKNPDNPEAADKFKEINNAHAILTDATKRNIYDKYGSLGLYVAEQFGEENVNTYFVLSSWWAKALFVFCGLLTCCYCCCCLCCCFNCCCGKCKPKAPEGEETEFYVSPEDLEAQLQSDEREATDTPIVIQPASATETTQLTADSHPSYHTDGFN,198,NP_079495.1.csv,refseq-DNAJC5-NM_025219.2_clinical_seed_0_final,refseq-DNAJC5-NM_025219.2.a2m,Invitae,refseq-DNAJC5-NM_025219.2.npy,1,198,198
+NP_079509.5,MAVFRSGLLVLTTPLASLAPRLASILTSAARLVNHTLYVHLQPGMSLEGPAQPQSSPVQATFEVLDFITHLYAGADVHRHLDVRILLTNIRTKSTFLPPLPTSVQNLAHPPEVVLTDFQTLDGSQYNPVKQQLVRYATSCYSCCPRLASVLLYSDYGIGEVPVEPLDVPLPSTIRPASPVAGSPKQPVRGYYRGAVGGTFDRLHNAHKVLLSVACILAQEQLVVGVADKDLLKSKLLPELLQPYTERVEHLSEFLVDIKPSLTFDVIPLLDPYGPAGSDPSLEFLVVSEETYRGGMAINRFRLENDLEELALYQIQLLKDLRHTENEEDKVSSSSFRQRMLGNLLRPPYERPELPTCLYVIGLTGISGSGKSSIAQRLKGLGAFVIDSDHLGHRAYAPGGPAYQPVVEAFGTDILHKDGIINRKVLGSRVFGNKKQLKILTDIMWPIIAKLAREEMDRAVAEGKRVCVIDAAVLLEAGWQNLVHEVWTAVIPETEAVRRIVERDGLSEAAAQSRLQSQMSGQQLVEQSHVVLSTLWEPHITQRQVEKAWALLQKRIPKTHQALD,564,NP_079509.5.csv,refseq-COASY-NM_025233.6_clinical_seed_0_final,refseq-COASY-NM_025233.6.a2m,Invitae,refseq-COASY-NM_025233.6.npy,1,564,564
+NP_079519.1,MDCYRTSLSSSWIYPTVILCLFGFFSMMRPSEPFLIPYLSGPDKNLTSAEITNEIFPVWTYSYLVLLLPVFVLTDYVRYKPVIILQGISFIITWLLLLFGQGVKTMQVVEFFYGMVTAAEVAYYAYIYSVVSPEHYQRVSGYCRSVTLAAYTAGSVLAQLLVSLANMSYFYLNVISLASVSVAFLFSLFLPMPKKSMFFHAKPSREIKKSSSVNPVLEETHEGEAPGCEEQKPTSEILSTSGKLNKGQLNSLKPSNVTVDVFVQWFQDLKECYSSKRLFYWSLWWAFATAGFNQVLNYVQILWDYKAPSQDSSIYNGAVEAIATFGGAVAAFAVGYVKVNWDLLGELALVVFSVVNAGSLFLMHYTANIWACYAGYLIFKSSYMLLITIAVFQIAVNLNVERYALVFGINTFIALVIQTIMTVIVVDQRGLNLPVSIQFLVYGSYFAVIAGIFLMRSMYITYSTKSQKDVQSPAPSENPDVSHPEEESNIIMSTKL,496,NP_079519.1.csv,refseq-SLC19A3-NM_025243.3_clinical_seed_0_final,refseq-SLC19A3-NM_025243.3.a2m,Invitae,refseq-SLC19A3-NM_025243.3.npy,1,496,496
+NP_079535.4,MASLGANPRRTPQGPRPGAASSGFPSPAPVPGPREAEEEEVEEEEELAEIHLCVLWNSGYLGIAYYDTSDSTIHFMPDAPDHESLKLLQRVLDEINPQSVVTSAKQDENMTRFLGKLASQEHREPKRPEIIFLPSVDFGLEISKQRLLSGNYSFIPDAMTATEKILFLSSIIPFDCLLTPPGDLRFTPIPLLIPSQVRALGGLLKFLGRRRIGVELEDYNVSVPILGFKKFMLTHLVNIDQDTYSVLQIFKSESHPSVYKVASGLKEGLSLFGILNRCHCKWGEKLLRLWFTRPTHDLGELSSRLDVIQFFLLPQNLDMAQMLHRLLGHIKNVPLILKRMKLSHTKVSDWQVLYKTVYSALGLRDACRSLPQSIQLFRDIAQEFSDDLHHIASLIGKVVDFEGSLAENRFTVLPNIDPEIDEKKRRLMGLPSFLTEVARKELENLDSRIPSCSVIYIPLIGFLLSIPRLPSMVEASDFEINGLDFMFLSEEKLHYRSARTKELDALLGDLHCEIRDQETLLMYQLQCQVLARAAVLTRVLDLASRLDVLLALASAARDYGYSRPRYSPQVLGVRIQNGRHPLMELCARTFVPNSTECGGDKGRVKVITGPNSSGKSIYLKQVGLITFMALVGSFVPAEEAEIGAVDAIFTRIHSCESISLGLSTFMIDLNQQVAKAVNNATAQSLVLIDEFGKGTNTVDGLALLAAVLRHWLARGPTCPHIFVATNFLSLVQLQLLPQGPLVQYLTMETCEDGNDLVFFYQVSDLIRSGKPIKPVKDLLKKNQMENCQTLVDKFMKLDLEDPNLDLNVFMSQEVLPAATSIL,822,NP_079535.4.csv,refseq-MSH5-NM_025259.5_clinical_seed_0_final,refseq-MSH5-NM_025259.5.a2m,Invitae,refseq-MSH5-NM_025259.5.npy,1,822,822
+NP_079541.1,MAEAVFHAPKRKRRVYETYESPLPIPFGQDHGPLKEFKIFRAEMINNNVIVRNAEDIEQLYGKGYFGKGILSRSRPSFTISDPKLVAKWKDMKTNMPIITSKRYQHSVEWAAELMRRQGQDESTVRRILKDYTKPLEHPPVKRNEEAQVHDKLNSGMVSNMEGTAGGERPSVVNGDSGKSGGVGDPREPLGCLQEGSGCHPTTESFEKSVREDASPLPHVCCCKQDALILQRGLHHEDGSQHIGLLHPGDRGPDHEYVLVEEAECAMSEREAAPNEELVQRNRLICRRNPYRIFEYLQLSLEEAFFLVYALGCLSIYYEKEPLTIVKLWKAFTVVQPTFRTTYMAYHYFRSKGWVPKVGLKYGTDLLLYRKGPPFYHASYSVIIELVDDHFEGSLRRPLSWKSLAALSRVSVNVSKELMLCYLIKPSTMTDKEMESPECMKRIKVQEVILSRWVSSRERSDQDDL,465,NP_079541.1.csv,refseq-TSEN2-NM_025265.3_clinical_seed_0_final,refseq-TSEN2-NM_025265.3.a2m,Invitae,refseq-TSEN2-NM_025265.3.npy,1,465,465
+NP_085055.2,MAEVHVIGQIIGASGFSESSLFCKWGIHTGAAWKLLSGVREGQTQVDTPQIGDMAYWSHPIDLHFATKGLQGWPRLHFQVWSQDSFGRCQLAGYGFCHVPSSPGTHQLACPTWRPLGSWREQLARAFVGGGPQLLHGDTIYSGADRYRLHTAAGGTVHLEIGLLLRNFDRYGVEC,175,NP_085055.2.csv,refseq-B9D2-NM_030578.3_clinical_seed_0_final,refseq-B9D2-NM_030578.3.a2m,Invitae,refseq-B9D2-NM_030578.3.npy,1,175,175
+NP_085135.1,MKDKRKKKDRTWAEAARLALEKHPNSPMTAKQILEVIQKEGLKETSGTSPLACLNAMLHTNTRIGDGTFFKIPGKSGLYALKKEESSCPADGTLDLVCESELDGTDMAEANAHGEENGVCSKQVTDEASSTRDSSLTNTAVQSKLVSSFQQHTKKALKQALRQQQKRRNGVSMMVNKTVPRVVLTPLKVSDEQSDSPSGSESKNGEADSSDKEMKHGQKSPTGKQTSQHLKRLKKSGLGHLKWTKAEDIDIETPGSILVNTNLRALINKHTFASLPQHFQQYLLLLLPEVDRQMGSDGILRLSTSALNNEFFAYAAQGWKQRLAEGEFTPEMQLRIRQEIEKEKKTEPWKEKFFERFYGEKLGMSREESVKLTTGPNNAGAQSSSSCGTSGLPVSAQTALAEQQPKSMKSPASPEPGFCATLCPMVEIPPKDIMAELESEDILIPEESVIQEEIAEEVETSICECQDENHKTIPEFSEEAESLTNSHEEPQIAPPEDNLESCVMMNDVLETLPHIEVKIEGKSESPQEEMTVVIDQLEVCDSLIPSTSSMTHVSDTEHKESETAVETSTPKIKTGSSSLEGQFPNEGIAIDMELQSDPEEQLSENACISETSFSSESPEGACTSLPSPGGETQSTSEESCTPASLETTFCSEVSSTENTDKYNQRNSTDENFHASLMSEISPISTSPEISEASLMSNLPLTSEASPVSNLPLTSETSPMSDLPLTSETSSVSSMLLTSETTFVSSLPLPSETSPISNSSINERMAHQQRKSPSVSEEPLSPQKDESSATAKPLGENLTSQQKNLSNTPEPIIMSSSSIAPEAFPSEDLHNKTLSQQTCKSHVDTEKPYPASIPELASTEMIKVKNHSVLQRTEKKVLPSPLELSVFSEGTDNKGNELPSAKLQDKQYISSVDKAPFSEGSRNKTHKQGSTQSRLETSHTSKSSEPSKSPDGIRNESRDSEISKRKTAEQHSFGICKEKRARIEDDQSTRNISSSSPPEKEQPPREEPRVPPLKIQLSKIGPPFIIKSQPVSKPESRASTSTSVSGGRNTGARTLADIKARAQQARAQREAAAAAAVAAAASIVSGAMGSPGEGGKTRTLAHIKEQTKAKLFAKHQARAHLFQTSKETRLPPPLSSKEGPPNLEVSSTPETKMEGSTGVIIVNPNCRSPSNKSAHLRETTTVLQQSLNPSKLPETATDLSVHSSDENIPVSHLSEKIVSSTSSENSSVPMLFNKNSVPVSVCSTAISGAIKEHPFVSSVDKSSVLMSVDSANTTISACNISMLKTIQGTDTPCIAIIPKCIESTPISATTEGSSISSSMDDKQLLISSSSASNLVSTQYTSVPTPSIGNNLPNLSTSSVLIPPMGINNRFPSEKIAIPGSEEQATVSMGTTVRAALSCSDSVAVTDSLVAHPTVAMFTGNMLTINSYDSPPKLSAESLDKNSGPRNRADNSGKPQQPPGGFAPAAINRSIPCKVIVDHSTTLTSSLSLTVSVESSEASLDLQGRPVRTEASVQPVACPQVSVISRPEPVANEGIDHSSTFIAASAAKQDSKTLPATCTSLRELPLVPDKLNEPTAPSHNFAEQARGPAPFKSEADTTCSNQYNPSNRICWNDDGMRSTGQPLVTHSGSSKQKEYLEQSCPKAIKTEHANYLNVSELHPRNLVTNVALPVKSELHEADKGFRMDTEDFPGPELPPPAAEGASSVQQTQNMKASTSSPMEEAISLATDALKRVPGAGSSGCRLSSVEANNPLVTQLLQGNLPLEKVLPQPRLGAKLEINRLPLPLQTTSVGKTAPERNVEIPPSSPNPDGKGYLAGTLAPLQMRKRENHPKKRVARTVGEHTQVKCEPGKLLVEPDVKGVPCVISSGISQLGHSQPFKQEWLNKHSMQNRIVHSPEVKQQKRLLPSCSFQQNLFHVDKNGGFHTDAGTSHRQQFYQMPVAARGPIPTAALLQASSKTPVGCNAFAFNRHLEQKGLGEVSLSSAPHQLRLANMLSPNMPMKEGDEVGGTAHTMPNKALVHPPPPPPPPPPPPLALPPPPPPPPPLPPPLPNAEVPSDQKQPPVTMETTKRLSWPQSTGICSNIKSEPLSFEEGLSSSCELGMKQVSYDQNEMKEQLKAFALKSADFSSYLLSEPQKPFTQLAAQKMQVQQQQQLCGNYPTIHFGSTSFKRAASAIEKSIGILGSGSNPATGLSGQNAQMPVQNFADSSNADELELKCSCRLKAMIVCKGCGAFCHDDCIGPSKLCVACLVVR,2248,NP_085135.1.csv,refseq-ASXL3-NM_030632.2_clinical_seed_0_final,refseq-ASXL3-NM_030632.2.a2m,Invitae,refseq-ASXL3-NM_030632.2.npy,1,2248,2248
+NP_109587.1,MLARRKPVLPALTINPTIAEGPSPTSEGASEANLVDLQKKLEELELDEQQKKRLEAFLTQKAKVGELKDDDFERISELGAGNGGVVTKVQHRPSGLIMARKLIHLEIKPAIRNQIIRELQVLHECNSPYIVGFYGAFYSDGEISICMEHMDGGSLDQVLKEAKRIPEEILGKVSIAVLRGLAYLREKHQIMHRDVKPSNILVNSRGEIKLCDFGVSGQLIDSMANSFVGTRSYMAPERLQGTHYSVQSDIWSMGLSLVELAVGRYPIPPPDAKELEAIFGRPVVDGEEGEPHSISPRPRPPGRPVSGHGMDSRPAMAIFELLDYIVNEPPPKLPNGVFTPDFQEFVNKCLIKNPAERADLKMLTNHTFIKRSEVEEVDFAGWLCKTLRLNQPGTPTRTAV,400,NP_109587.1.csv,refseq-MAP2K2-NM_030662.3_clinical_seed_0_final,refseq-MAP2K2-NM_030662.3.a2m,Invitae,refseq-MAP2K2-NM_030662.3_theta_0.2.npy,1,400,400
+NP_109590.3,MQSFRERCGFHGKQQNYQQTSQETSRLENYRQPSQAGLSCDRQRLLAKDYYNPQPYPSYEGGAGTPSGTAAAVAADKYHRGSKALPTQQGLQGRPAFPGYGVQDSSPYPGRYAGEESLQAWGAPQPPPPQPQPLPAGVAKYDENLMKKTAVPPSRQYAEQGAQVPFRTHSLHVQQPPPPQQPLAYPKLQRQKLQNDIASPLPFPQGTHFPQHSQSFPTSSTYSSSVQGGGQGAHSYKSCTAPTAQPHDRPLTASSSLAPGQRVQNLHAYQSGRLSYDQQQQQQQQQQQQQQALQSRHHAQETLHYQNLAKYQHYGQQGQGYCQPDAAVRTPEQYYQTFSPSSSHSPARSVGRSPSYSSTPSPLMPNLENFPYSQQPLSTGAFPAGITDHSHFMPLLNPSPTDATSSVDTQAGNCKPLQKDKLPENLLSDLSLQSLTALTSQVENISNTVQQLLLSKAAVPQKKGVKNLVSRTPEQHKSQHCSPEGSGYSAEPAGTPLSEPPSSTPQSTHAEPQEADYLSGSEDPLERSFLYCNQARGSPARVNSNSKAKPESVSTCSVTSPDDMSTKSDDSFQSLHGSLPLDSFSKFVAGERDCPRLLLSALAQEDLASEILGLQEAIGEKADKAWAEAPSLVKDSSKPPFSLENHSACLDSVAKSAWPRPGEPEALPDSLQLDKGGNAKDFSPGLFEDPSVAFATPDPKKTTGPLSFGTKPTLGVPAPDPTTAAFDCFPDTTAASSADSANPFAWPEENLGDACPRWGLHPGELTKGLEQGGKASDGISKGDTHEASACLGFQEEDPPGEKVASLPGDFKQEEVGGVKEEAGGLLQCPEVAKADRWLEDSRHCCSTADFGDLPLLPPTSRKEDLEAEEEYSSLCELLGSPEQRPGMQDPLSPKAPLICTKEEVEEVLDSKAGWGSPCHLSGESVILLGPTVGTESKVQSWFESSLSHMKPGEEGPDGERAPGDSTTSDASLAQKPNKPAVPEAPIAKKEPVPRGKSLRSRRVHRGLPEAEDSPCRAPVLPKDLLLPESCTGPPQGQMEGAGAPGRGASEGLPRMCTRSLTALSEPRTPGPPGLTTTPAPPDKLGGKQRAAFKSGKRVGKPSPKAASSPSNPAALPVASDSSPMGSKTKETDSPSTPGKDQRSMILRSRTKTQEIFHSKRRRPSEGRLPNCRATKKLLDNSHLPATFKVSSSPQKEGRVSQRARVPKPGAGSKLSDRPLHALKRKSAFMAPVPTKKRNLVLRSRSSSSSNASGNGGDGKEERPEGSPTLFKRMSSPKKAKPTKGNGEPATKLPPPETPDACLKLASRAAFQGAMKTKVLPPRKGRGLKLEAIVQKITSPSLKKFACKAPGASPGNPLSPSLSDKDRGLKGAGGSPVGVEEGLVNVGTGQKLPTSGADPLCRNPTNRSLKGKLMNSKKLSSTDCFKTEAFTSPEALQPGGTALAPKKRSRKGRAGAHGLSKGPLEKRPYLGPALLLTPRDRASGTQGASEDNSGGGGKKPKMEELGLASQPPEGRPCQPQTRAQKQPGHTNYSSYSKRKRLTRGRAKNTTSSPCKGRAKRRRQQQVLPLDPAEPEIRLKYISSCKRLRSDSRTPAFSPFVRVEKRDAFTTICTVVNSPGDAPKPHRKPSSSASSSSSSSSFSLDAAGASLATLPGGSILQPRPSLPLSSTMHLGPVVSKALSTSCLVCCLCQNPANFKDLGDLCGPYYPEHCLPKKKPKLKEKVRPEGTCEEASLPLERTLKGPECAAAATAGKPPRPDGPADPAKQGPLRTSARGLSRRLQSCYCCDGREDGGEEAAPADKGRKHECSKEAPAEPGGEAQEHWVHEACAVWTGGVYLVAGKLFGLQEAMKVAVDMMCSSCQEAGATIGCCHKGCLHTYHYPCASDAGCIFIEENFSLKCPKHKRLP,1906,NP_109590.3.csv,refseq-RAI1-NM_030665.3_clinical_seed_0_final,refseq-RAI1-NM_030665.3.a2m,Invitae,refseq-RAI1-NM_030665.3.npy,1,1906,1906
+NP_110400.1,MREIVHIQIGQCGNQIGAKFWEMIGEEHGIDLAGSDRGASALQLERISVYYNEAYGRKYVPRAVLVDLEPGTMDSIRSSKLGALFQPDSFVHGNSGAGNNWAKGHYTEGAELIENVLEVVRHESESCDCLQGFQIVHSLGGGTGSGMGTLLMNKIREEYPDRIMNSFSVMPSPKVSDTVVEPYNAVLSIHQLIENADACFCIDNEALYDICFRTLKLTTPTYGDLNHLVSLTMSGITTSLRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTAQGSQQYRALSVAELTQQMFDARNTMAACDLRRGRYLTVACIFRGKMSTKEVDQQLLSVQTRNSSCFVEWIPNNVKVAVCDIPPRGLSMAATFIGNNTAIQEIFNRVSEHFSAMFKRKAFVHWYTSEGMDINEFGEAENNIHDLVSEYQQFQDAKAVLEEDEEVTEEAEMEPEDKGH,451,NP_110400.1.csv,refseq-TUBB1-NM_030773.3_clinical_seed_0_final,refseq-TUBB1-NM_030773.3.a2m,Invitae,refseq-TUBB1-NM_030773.3.npy,1,451,451
+NP_110404.1,MGHSPPVLPLCASVSLLGGLTFGYELAVISGALLPLQLDFGLSCLEQEFLVGSLLLGALLASLVGGFLIDCYGRKQAILGSNLVLLAGSLTLGLAGSLAWLVLGRAVVGFAISLSSMACCIYVSELVGPRQRGVLVSLYEAGITVGILLSYALNYALAGTPWGWRHMFGWATAPAVLQSLSLLFLPAGTDETATHKDLIPLQGGEAPKLGPGRPRYSFLDLFRARDNMRGRTTVGLGLVLFQQLTGQPNVLCYASTIFSSVGFHGGSSAVLASVGLGAVKVAATLTAMGLVDRAGRRALLLAGCALMALSVSGIGLVSFAVPMDSGPSCLAVPNATGQTGLPGDSGLLQDSSLPPIPRTNEDQREPILSTAKKTKPHPRSGDPSAPPRLALSSALPGPPLPARGHALLRWTALLCLMVFVSAFSFGFGPVTWLVLSEIYPVEIRGRAFAFCNSFNWAANLFISLSFLDLIGTIGLSWTFLLYGLTAVLGLGFIYLFVPETKGQSLAEIDQQFQKRRFTLSFGHRQNSTGIPYSRIEISAAS,541,NP_110404.1.csv,refseq-SLC2A10-NM_030777.3_clinical_seed_0_final,refseq-SLC2A10-NM_030777.3.a2m,Invitae,refseq-SLC2A10-NM_030777.3.npy,1,541,541
+NP_110420.3,MGPRKKSVKTCIMNNEIPEEMTADETKDYMNQLSHEVLCHIFRYLPLQDIMCMECLSRKLKEAVTLYLRVVRVVDLCAGRWWEYMPSGFTDASFLTLLKKMPDVEQLYGLHPRYLERRRVRGHEAFSIPGVLEALQACPNLVGVETSHLELVESIWTYMPHVHILGKFRNRNGAFPIPPENKLKIPIGAKIQTLHLVGVNVPEIPCIPMLRHLYMKWVRLTKPQPFKDFLCISLRTFVMRNCAGPTNSLKYVPLVTGLASARNLEHLEMVRVPFLGGLIQHVVEDSWRSGGFRNLHTIVLGACKNALEVDLGYLIITAARRLHEVRIQPSLTKDGVFSALKMAELEFPQFETLHLGYVDEFLLQSRMANADLVKYGLADVVENPGIITDIGMKAVNEVFSCIKYLAIYNCPHLHNPYNWISDHSRWTRLVDINLVRCHALKLDSFGQFIELLPSLEFISLDQMFREPPKGCARVGLSAGTGIGVSSALVSNQNSNNDDNNAQNNNANIHDNNHHHPDDSDEENDFRQDLQPGEQQFAADALNEMEDIVQEDGEVVAESGNNTPAHSQAIIPVDVDEEQAGPSGLQRVVKPTSITVHDSESDDEEDSLELQEVWIPKNGTRRYSEREEKTGESVQSRELSVSGKGKTPLRKRYNSHQMGQSKQFPLEESSCEKGCQVTSEQIKADMKAARDIPEKKKNKDVYPSCSSTTASTVGNSSSHNTASQSPDFVRTVNSGGSSEPSPTEVDVSRQCACSPGGSEDSEAMEEGDAESSVCPRCCCHRPQESQRRTSRCSDEERPSTSRACVVNGPDEVAKTKPRHAMKRKRTADKSTSTSDPVIEDDHVQVLVLKSKNLVGVTMTNCGITDLVLKDCPKMMFIHATRCRVLKHLKVENAPIVNRFDYAQCKKLNMDQVLDQILRMPPERNRIIYLRPMQQVDTLTLEQKLFSGPYPYHICIIHEFSNPPNVRNKVRIRSWMDTIANINQELIKYEFFPEATRSEEDLKKYPKYPWGREIYTLEGVVDGAPYSMISDFPWLRSLRAAEPNSFARYDFEDDEESTIYAPRRKGQLSADICMETIGEEISEMRQMKKGVFQRVVAIFIHYCDVNGEPVEDDYI,1113,NP_110420.3.csv,refseq-FBXO38-NM_030793.4_clinical_seed_0_final,refseq-FBXO38-NM_030793.4.a2m,Invitae,refseq-FBXO38-NM_030793.4.npy,1,1113,1113
+NP_110440.1,MLGSLVLRRKALAPRLLLRLLRSPTLRGHGGASGRNVTTGSLGEPQWLRVATGGRPGTSPALFSGRGAATGGRQGGRFDTKCLAAATWGRLPGPEETLPGQDSWNGVPSRAGLGMCALAAALVVHCYSKSPSNKDAALLEAARANNMQEVSRLLSEGADVNAKHRLGWTALMVAAINRNNSVVQVLLAAGADPNLGDDFSSVYKTAKEQGIHSLEDGGQDGASRHITNQWTSALEFRRWLGLPAGVLITREDDFNNRLNNRASFKGCTALHYAVLADDYRTVKELLDGGANPLQRNEMGHTPLDYAREGEVMKLLRTSEAKYQEKQRKREAEERRRFPLEQRLKEHIIGQESAIATVGAAIRRKENGWYDEEHPLVFLFLGSSGIGKTELAKQTAKYMHKDAKKGFIRLDMSEFQERHEVAKFIGSPPGYVGHEEGGQLTKKLKQCPNAVVLFDEVDKAHPDVLTIMLQLFDEGRLTDGKGKTIDCKDAIFIMTSNVASDEIAQHALQLRQEALEMSRNRIAENLGDVQISDKITISKNFKENVIRPILKAHFRRDEFLGRINEIVYFLPFCHSELIQLVNKELNFWAKRAKQRHNITLLWDREVADVLVDGYNVHYGARSIKHEVERRVVNQLAAAYEQDLLPGGCTLRITVEDSDKQLLKSPELPSPQAEKRLPKLRLEIIDKDSKTRRLDIRAPLHPEKVCNTI,707,NP_110440.1.csv,refseq-CLPB-NM_030813.5_clinical_seed_0_final,refseq-CLPB-NM_030813.5.a2m,Invitae,refseq-CLPB-NM_030813.5.npy,1,707,707
+NP_112180.4,MADEDGEGIHPSAPHRNGGGGGGGGSGLHCAGNGGGGGGGPRVVRIVKSESGYGFNVRGQVSEGGQLRSINGELYAPLQHVSAVLPGGAADRAGVRKGDRILEVNHVNVEGATHKQVVDLIRAGEKELILTVLSVPPHEADNLDPSDDSLGQSFYDYTEKQAVPISVPRYKHVEQNGEKFVVYNVYMAGRQLCSKRYREFAILHQNLKREFANFTFPRLPGKWPFSLSEQQLDARRRGLEEYLEKVCSIRVIGESDIMQEFLSESDENYNGVSDVELRVALPDGTTVTVRVKKNSTTDQVYQAIAAKVGMDSTTVNYFALFEVISHSFVRKLAPNEFPHKLYIQNYTSAVPGTCLTIRKWLFTTEEEILLNDNDLAVTYFFHQAVDDVKKGYIKAEEKSYQLQKLYEQRKMVMYLNMLRTCEGYNEIIFPHCACDSRRKGHVITAISITHFKLHACTEEGQLENQVIAFEWDEMQRWDTDEEGMAFCFEYARGEKKPRWVKIFTPYFNYMHECFERVFCELKWRKEEY,528,NP_112180.4.csv,refseq-SNX27-NM_030918.5_clinical_seed_0_final,refseq-SNX27-NM_030918.5.a2m,Invitae,refseq-SNX27-NM_030918.5.npy,1,528,528
+NP_112190.2,MEQRRVTDFFARRRPGPPRIAPPKLACRTPSPARPALRAPASATSGSRKRARPPAAPGRDQARPPARRRLRLSVDEVSSPSTPEAPDIPACPSPGQKIKKSTPAAGQPPHLTSAQDQDTISELASCLQRARELGARVRALKASAQDAGESCTPEAEGRPEEPCGEKAPAYQRFHALAQPGLPGLVLPYKYQVLAEMFRSMDTIVGMLHNRSETPTFAKVQRGVQDMMRRRFEECNVGQIKTVYPASYRFRQERSVPTFKDGTRRSDYQLTIEPLLEQEADGAAPQLTASRLLQRRQIFSQKLVEHVKEHHKAFLASLSPAMVVPEDQLTRWHPRFNVDEVPDIEPAALPQPPATEKLTTAQEVLARARNLISPRMEKALSQLALRSAAPSSPGSPRPALPATPPATPPAASPSALKGVSQDLLERIRAKEAQKQLAQMTRCPEQEQRLQRLERLPELARVLRSVFVSERKPALSMEVACARMVGSCCTIMSPGEMEKHLLLLSELLPDWLSLHRIRTDTYVKLDKAADLAHITARLAHQTRAEEGL,546,NP_112190.2.csv,refseq-CDT1-NM_030928.3_clinical_seed_0_final,refseq-CDT1-NM_030928.3.a2m,Invitae,refseq-CDT1-NM_030928.3.npy,1,546,546
+NP_112205.2,MGVLGRVLLWLQLCALTQAVSKLWVPNTDFDVAANWSQNRTPCAGGAVEFPADKMVSVLVQEGHAVSDMLLPLDGELVLASGAGFGVSDVGSHLDCGAGEPAVFRDSDRFSWHDPHLWRSGDEAPGLFFVDAERVPCRHDDVFFPPSASFRVGLGPGASPVRVRSISALGRTFTRDEDLAVFLASRAGRLRFHGPGALSVGPEDCADPSGCVCGNAEAQPWICAALLQPLGGRCPQAACHSALRPQGQCCDLCGAVVLLTHGPAFDLERYRARILDTFLGLPQYHGLQVAVSKVPRSSRLREADTEIQVVLVENGPETGGAGRLARALLADVAENGEALGVLEATMRESGAHVWGSSAAGLAGGVAAAVLLALLVLLVAPPLLRRAGRLRWRRHEAAAPAGAPLGFRNPVFDVTASEELPLPRRLSLVPKAAADSTSHSYFVNPLFAGAEAEA,453,NP_112205.2.csv,refseq-AMN-NM_030943.3_clinical_seed_0_final,refseq-AMN-NM_030943.3.a2m,Invitae,refseq-AMN-NM_030943.3.npy,1,453,453
+NP_112210.1,MDYPKMDYFLDVESAHRLLDVESAQRFFYSQGAQARRATLLLPPTLMAASSEDDIDRRPIRRVRSKSDTPYLAEARISFNLGAAEEVERLAAMRSDSLVPGTHTPPIRRRSKFANLGRIFKPWKWRKKKSEKFKHTSAALERKISMRQSREELIKRGVLKEIYDKDGELSISNEEDSLENGQSLSSSQLSLPALSEMEPVPMPRDPCSYEVLQPSDIMDGPDPGAPVKLPCLPVKLSPPLPPKKVMICMPVGGPDLSLVSYTAQKSGQQGVAQHHHTVLPSQIQHQLQYGSHGQHLPSTTGSLPMHPSGCRMIDELNKTLAMTMQRLESSEQRVPCSTSYHSSGLHSGDGVTKAGPMGLPEIRQVPTVVIECDDNKENVPHESDYEDSSCLYTREEEEEEEDEDDDSSLYTSSLAMKVCRKDSLAIKLSNRPSKRELEEKNILPRQTDEERLELRQQIGTKLTRRLSQRPTAEELEQRNILKPRNEQEEQEEKREIKRRLTRKLSQRPTVEELRERKILIRFSDYVEVADAQDYDRRADKPWTRLTAADKAAIRKELNEFKSTEMEVHELSRHLTRFHRP,580,NP_112210.1.csv,refseq-PHACTR1-NM_030948.3_clinical_seed_0_final,refseq-PHACTR1-NM_030948.3.a2m,Invitae,refseq-PHACTR1-NM_030948.3.npy,1,580,580
+NP_112224.1,MARLADYFIVVGYDHEKPGSGEGLGKIIQRFPQKDWDDTPFPQGIELFCQPGGWQLSRERKQPTFFVVVLTDIDSDRHYCSCLTFYEAEINLQGTKKEEIEGEAKVSGLIQPAEVFAPKSLVLVSRLYYPEIFRACLGLIYTVYVDSLNVSLESLIANLCACLVPAAGGSQKLFSLGAGDRQLIQTPLHDSLPITGTSVALLFQQLGIQNVLSLFCAVLTENKVLFHSASFQRLSDACRALESLMFPLKYSYPYIPILPAQLLEVLSSPTPFIIGVHSVFKTDVHELLDVIIADLDGGTIKIPECIHLSSLPEPLLHQTQSALSLILHPDLEVADHAFPPPRTALSHSKMLDKEVRAVFLRLFAQLFQGYRSCLQLIRIHAEPVIHFHKTAFLGQRGLVENDFLTKVLSGMAFAGFVSERGPPYRSCDLFDELVAFEVERIKVEENNPVKMIKHVRELAEQLFKNENPNPHMAFQKVPRPTEGSHLRVHILPFPEINEARVQELIQENVAKNQNAPPATRIEKKCVVPAGPPVVSIMDKVTTVFNSAQRLEVVRNCISFIFENKILETEKTLPAALRALKGKAARQCLTDELGLHVQQNRAILDHQQFDYIIRMMNCTLQDCSSLEEYNIAAALLPLTSAFYRKLAPGVSQFAYTCVQDHPIWTNQQFWETTFYNAVQEQVRSLYLSAKEDNHAPHLKQKDKLPDDHYQEKTAMDLAAEQLRLWPTLSKSTQQELVQHEESTVFSQAIHFANLMVNLLVPLDTSKNKLLRTSAPGDWESGSNSIVTNSIAGSVAESYDTESGFEDSENTDIANSVVRFITRFIDKVCTESGVTQDHIKSLHCMIPGIVAMHIETLEAVHRESRRLPPIQKPKILRPALLPGEEIVCEGLRVLLDPDGREEATGGLLGGPQLLPAEGALFLTTYRILFRGTPHDQLVGEQTVVRSFPIASITKEKKITMQNQLQQNMQEGLQITSASFQLIKVAFDEEVSPEVVEIFKKQLMKFRYPQSIFSTFAFAAGQTTPQIILPKQKEKNTSFRTFSKTIVKGAKRAGKMTIGRQYLLKKKTGTIVEERVNRPGWNEDDDVSVSDESELPTSTTLKASEKSTMEQLVEKACFRDYQRLGLGTISGSSSRSRPEYFRITASNRMYSLCRSYPGLLVVPQAVQDSSLPRVARCYRHNRLPVVCWKNSRSGTLLLRSGGFHGKGVVGLFKSQNSPQAAPTSSLESSSSIEQEKYLQALLNAVSVHQKLRGNSTLTVRPAFALSPGVWASLRSSTRLISSPTSFIDVGARLAGKDHSASFSNSSYLQNQLLKRQAALYIFGEKSQLRNFKVEFALNCEFVPVEFHEIRQVKASFKKLMRACIPSTIPTDSEVTFLKALGDSEWFPQLHRIMQLAVVVSEVLENGSSVLVCLEEGWDITAQVTSLVQLLSDPFYRTLEGFQMLVEKEWLSFGHKFSQRSSLTLNCQGSGFAPVFLQFLDCVHQVHNQYPTEFEFNLYYLKFLAFHYVSNRFKTFLLDSDYERLEHGTLFDDKGEKHAKKGVCIWECIDRMHKRSPIFFNYLYSPLEIEALKPNVNVSSLKKWDYYIEETLSTGPSYDWMMLTPKHFPSEDSDLAGEAGPRSQRRTVWPCYDDVSCTQPDALTSLFSEIEKLEHKLNQAPEKWQQLWERVTVDLKEEPRTDRSQRHLSRSPGIVSTNLPSYQKRSLLHLPDSSMGEEQNSSISPSNGVERRAATLYSQYTSKNDENRSFEGTLYKRGALLKGWKPRWFVLDVTKHQLRYYDSGEDTSCKGHIDLAEVEMVIPAGPSMGAPKHTSDKAFFDLKTSKRVYNFCAQDGQSAQQWMDKIQSCISDA,1849,NP_112224.1.csv,refseq-SBF2-NM_030962.3_clinical_seed_0_final,refseq-SBF2-NM_030962.3.a2m,Invitae,refseq-SBF2-NM_030962.3.npy,1,1849,1849
+NP_112235.2,MVPGSEGPARAGSVVADVVFVIEGTANLGPYFEGLRKHYLLPAIEYFNGGPPAETDFGGDYGGTQYSLVVFNTVDCAPESYVQCHAPTSSAYEFVTWLDGIKFMGGGGESCSLIAEGLSTALQLFDDFKKMREQIGQTHRVCLLICNSPPYLLPAVESTTYSGCTTENLVQQIGERGIHFSIVSPRKLPALRLLFEKAAPPALLEPLQPPTDVSQDPRHMVLVRGLVLPVGGGSAPGPLQSKQPVPLPPAAPSGATLSAAPQQPLPPVPPQYQVPGNLSAAQVAAQNAVEAAKNQKAGLGPRFSPITPLQQAAPGVGPPFSQAPAPQLPPGPPGAPKPPPASQPSLVSTVAPGSGLAPTAQPGAPSMAGTVAPGGVSGPSPAQLGAPALGGQQSVSNKLLAWSGVLEWQEKPKPASVDANTKLTRSLPCQVYVNHGENLKTEQWPQKLIMQLIPQQLLTTLGPLFRNSRMVQFHFTNKDLESLKGLYRIMGNGFAGCVHFPHTAPCEVRVLMLLYSSKKKIFMGLIPYDQSGFVNGIRQVITNHKQVQQQKLEQQQRGMGGQQAPPGLGPILEDQARPSQNLLQLRPPQPQPQGTVGASGATGQPQPQGTAQPPPGAPQGPPGAASGPPPPGPILRPQNPGANPQLRSLLLNPPPPQTGVPPPQASLHHLQPPGAPALLPPPHQGLGQPQLGPPLLHPPPAQSWPAQLPPRAPLPGQMLLSGGPRGPVPQPGLQPSVMEDDILMDLI,747,NP_112235.2.csv,refseq-MED25-NM_030973.3_clinical_seed_0_final,refseq-MED25-NM_030973.3.a2m,Invitae,refseq-MED25-NM_030973.3.npy,1,747,747
+NP_112420.1,MSKSESPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVVMRDPNTKRSRGFGFVTYATVEEVDAAMNARPHKVDGRVVEPKRAVSREDSQRPGAHLTVKKIFVGGIKEDTEEHHLRDYFEQYGKIEVIEIMTDRGSGKKRGFAFVTFDDHDSVDKIVIQKYHTVNGHNCEVRKALSKQEMASASSSQRGRSGSGNFGGGRGGGFGGNDNFGRGGNFSGRGGFGGSRGGGGYGGSGDGYNGFGNDGGYGGGGPGYSGGSRGYGSGGQGYGNQGSGYGGSGSYDSYNNGGGGGFGGGSGSNFGGGGSYNDFGNYNNQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGYGGSSSSSSYGSGRRF,372,NP_112420.1.csv,refseq-HNRNPA1-NM_031157.2_clinical_seed_0_final,refseq-HNRNPA1-NM_031157.2.a2m,Invitae,refseq-HNRNPA1-NM_031157.2.npy,1,372,372
+NP_112483.1,MSWESGAGPGLGSQGMDLVWSAWYGKCVKGKGSLPLSAHGIVVAWLSRAEWDQVTVYLFCDDHKLQRYALNRITVWRSRSGNELPLAVASTADLIRCKLLDVTGGLGTDELRLLYGMALVRFVNLISERKTKFAKVPLKCLAQEVNIPDWIVDLRHELTHKKMPHINDCRRGCYFVLDWLQKTYWCRQLENSLRETWELEEFREGIEEEDQEEDKNIVVDDITEQKPEPQDDGKSTESDVKADGDSKGSEEVDSHCKKALSHKELYERARELLVSYEEEQFTVLEKFRYLPKAIKAWNNPSPRVECVLAELKGVTCENREAVLDAFLDDGFLVPTFEQLAALQIEYEDGQTEVQRGEGTDPKSHKNVDLNDVLVPKPFSQFWQPLLRGLHSQNFTQALLERMLSELPALGISGIRPTYILRWTVELIVANTKTGRNARRFSAGQWEARRGWRLFNCSASLDWPRMVESCLGSPCWASPQLLRIIFKAMGQGLPDEEQEKLLRICSIYTQSGENSLVQEGSEASPIGKSPYTLDSLYWSVKPASSSFGSEAKAQQQEEQGSVNDVKEEEKEEKEVLPDQVEEEEENDDQEEEEEDEDDEDDEEEDRMEVGPFSTGQESPTAENARLLAQKRGALQGSAWQVSSEDVRWDTFPLGRMPGQTEDPAELMLENYDTMYLLDQPVLEQRLEPSTCKTDTLGLSCGVGSGNCSNSSSSNFEGLLWSQGQLHGLKTGLQLF,734,NP_112483.1.csv,refseq-LAS1L-NM_031206.4_clinical_seed_0_final,refseq-LAS1L-NM_031206.4.a2m,Invitae,refseq-LAS1L-NM_031206.4.npy,1,734,734
+NP_112503.1,MVLEMLNPIHYNITSIVPEAMPAATMPVLLLTGLFLLVWNYEGTSSIPGPGYCMGIGPLISHGRFLWMGIGSACNYYNRVYGEFMRVWISGEETLIISKSSSMFHIMKHNHYSSRFGSKLGLQCIGMHEKGIIFNNNPELWKTTRPFFMKALSGPGLVRMVTVCAESLKTHLDRLEEVTNESGYVDVLTLLRRVMLDTSNTLFLRIPLDESAIVVKIQGYFDAWQALLIKPDIFFKISWLYKKYEKSVKDLKDAIEVLIAEKRRRISTEEKLEECMDFATELILAEKRGDLTRENVNQCILEMLIAAPDTMSVSLFFMLFLIAKHPNVEEAIIKEIQTVIGERDIKIDDIQKLKVMENFIYESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNIGRMHRLEFFPKPNEFTLENFAKNVPYRYFQPFGFGPRGCAGKYIAMVMMKAILVTLLRRFHVKTLQGQCVESIQKIHDLSLHPDETKNMLEMIFTPRNSDRCLEH,503,NP_112503.1.csv,refseq-CYP19A1-NM_031226.2_clinical_seed_0_final,refseq-CYP19A1-NM_031226.2.a2m,Invitae,refseq-CYP19A1-NM_031226.2.npy,1,503,503
+NP_112506.2,MDEKTKKAEEMALSLTRAVAGGDEQVAMKCAIWLAEQRVPLSVQLKPEVSPTQDIRLWVSVEDAQMHTVTIWLTVRPDMTVASLKDMVFLDYGFPPVLQQWVIGQRLARDQETLHSHGVRQNGDSAYLYLLSARNTSLNPQELQRERQLRMLEDLGFKDLTLQPRGPLEPGPPKPGVPQEPGRGQPDAVPEPPPVGWQCPGCTFINKPTRPGCEMCCRARPEAYQVPASYQPDEEERARLAGEEEALRQYQQRKQQQQEGNYLQHVQLDQRSLVLNTEPAECPVCYSVLAPGEAVVLRECLHTFCRECLQGTIRNSQEAEVSCPFIDNTYSCSGKLLEREIKALLTPEDYQRFLDLGISIAENRSAFSYHCKTPDCKGWCFFEDDVNEFTCPVCFHVNCLLCKAIHEQMNCKEYQEDLALRAQNDVAARQTTEMLKVMLQQGEAMRCPQCQIVVQKKDGCDWIRCTVCHTEICWVTKGPRWGPGGPGDTSGGCRCRVNGIPCHPSCQNCH,510,NP_112506.2.csv,refseq-RBCK1-NM_031229.3_clinical_seed_0_final,refseq-RBCK1-NM_031229.3.a2m,Invitae,refseq-RBCK1-NM_031229.3.npy,1,510,510
+NP_112533.1,MEKTLETVPLERKKREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVMRDPASKRSRGFGFVTFSSMAEVDAAMAARPHSIDGRVVEPKRAVAREESGKPGAHVTVKKLFVGGIKEDTEEHHLRDYFEEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKIVLQKYHTINGHNAEVRKALSRQEMQEVQSSRSGRGGNFGFGDSRGGGGNFGPGPGSNFRGGSDGYGSGRGFGDGYNGYGGGPGGGNFGGSPGYGGGRGGYGGGGPGYGNQGGGYGGGYDNYGGGNYGSGNYNDFGNYNQQPSNYGPMKSGNFGGSRNMGGPYGGGNYGPGGSGGSGGYGGRSRY,353,NP_112533.1.csv,refseq-HNRNPA2B1-NM_031243.2_clinical_seed_0_final,refseq-HNRNPA2B1-NM_031243.2.a2m,Invitae,refseq-HNRNPA2B1-NM_031243.2.npy,1,353,353
+NP_112740.1,MEVPPRLSHVPPPLFPSAPATLASRSLSHWRPRPPRQLAPLLPSLAPSSARQGARRAQRHVTAQQPSRLAGGAAIKGGRRRRPDLFRRHFKSSSIQRSAAAAAATRTARQHPPADSSVTMEDMNEYSNIEEFAEGSKINASKNQQDDGKMFIGGLSWDTSKKDLTEYLSRFGEVVDCTIKTDPVTGRSRGFGFVLFKDAASVDKVLELKEHKLDGKLIDPKRAKALKGKEPPKKVFVGGLSPDTSEEQIKEYFGAFGEIENIELPMDTKTNERRGFCFITYTDEEPVKKLLESRYHQIGSGKCEIKVAQPKEVYRQQQQQQKGGRGAAAGGRGGTRGRGRGQGQNWNQGFNNYYDQGYGNYNSAYGGDQNYSGYGGYDYTGYNYGNYGYGQGYADYSGQQSTYGKASRGGGNHQNNYQPY,420,NP_112740.1.csv,refseq-HNRNPDL-NM_031372.3_clinical_seed_0_final,refseq-HNRNPDL-NM_031372.3.a2m,Invitae,refseq-HNRNPDL-NM_031372.3.npy,1,420,420
+NP_113606.2,MVHHSGSIQSFKQQKGMNISKSEITKETSLKPSRRSLPCLAQSYAYSKSLSQSTSLFQSTESESQAPTSITLISTDKAEQVNTEENKNDSVLRCSFADLSDFCLALGKDKDYTDESEHATYDRSRLINDFVIKDKSEFKTKLSKNDMNYIASSGPLFKDGKKRIDYILVYRKTNIQYDKRNTFEKNLRAEGLMLEKEPAIASPDIMFIKIHIPWDTLCKYAERLNIRMPFRKKCYYTDGRSKSMGRMQTYFRRIKNWMAQNPMVLDKSAFPDLEESDCYTGPFSRARIHHFIINNKDTFFSNATRSRIVYHMLERTKYENGISKVGIRKLINNGSYIAAFPPHEGAYKSSQPIKTHGPQNNRHLLYERWARWGMWYKHQPLDLIRLYFGEKIGLYFAWLGWYTGMLIPAAIVGLCVFFYGLFTMNNSQVSQEICKATEVFMCPLCDKNCSLQRLNDSCIYAKVTYLFDNGGTVFFAIFMAIWATVFLEFWKRRRSILTYTWDLIEWEEEEETLRPQFEAKYYKMEIVNPITGKPEPHQPSSDKVTRLLVSVSGIFFMISLVITAVFGVVVYRLVVMEQFASFKWNFIKQYWQFATSAAAVCINFIIIMLLNLAYEKIAYLLTNLEYPRTESEWENSFALKMFLFQFVNLNSSIFYIAFFLGRFVGHPGKYNKLFDRWRLEECHPSGCLIDLCLQMGVIMFLKQIWNNFMELGYPLIQNWWSRHKIKRGIHDASIPQWENDWNLQPMNLHGLMDEYLEMVLQFGFTTIFVAAFPLAPLLALLNNIIEIRLDAYKFVTQWRRPLPARATDIGIWLGILEGIGILAVITNAFVIAITSDYIPRFVYEYKYGPCANHVEPSENCLKGYVNNSLSFFDLSELGMGKSGYCRYRDYRGPPWSSKPYEFTLQYWHILAARLAFIIVFEHLVFGIKSFIAYLIPDVPKGLHDRIRREKYLVQEMMYEAELEHLQQQRRKSGQPVHHEWP,981,NP_113606.2.csv,refseq-ANO3-NM_031418.2_clinical_seed_0_final,refseq-ANO3-NM_031418.2.a2m,Invitae,refseq-ANO3-NM_031418.2.npy,1,981,981
+NP_113621.1,MKDFSDVILCMEATESSKTEFCNPAFEPESGPPCPPPVFPEDASYSVPAPWHGRRPRGLRPDCRFSWLCVLLLSSLLLLLLGLLVAIILAQLQAAPPSGASHSPLPAGGLTTTTTTPTITTSQAAGTPKGQQESGVSPSPQSTCGGLLSGPRGFFSSPNYPDPYPPNTHCVWHIQVATDHAIQLKIEALSIESVASCLFDRLELSPEPEGPLLRVCGRVPPPTLNTNASHLLVVFVSDSSVEGFGFHAWYQAMAPGRGSCAHDEFRCDQLICLLPDSVCDGFANCADGSDETNCSAKFSGCGGNLTGLQGTFSTPSYLQQYPHQLLCTWHISVPAGHSIELQFHNFSLEAQDECKFDYVEVYETSSSGAFSLLGRFCGAEPPPHLVSSHHELAVLFRTDHGISSGGFSATYLAFNATENPCGPSELSCQAGGCKGVQWMCDMWRDCTDGSDDNCSGPLFPPPELACEPVQVEMCLGLSYNTTAFPNIWVGMITQEEVVEVLSGYKSLTSLPCYQHFRRLLCGLLVPRCTPLGSVLPPCRSVCQEAEHQCQSGLALLGTPWPFNCNRLPEAADLEACAQP,579,NP_113621.1.csv,refseq-MFRP-NM_031433.3_clinical_seed_0_final,refseq-MFRP-NM_031433.3.a2m,Invitae,refseq-MFRP-NM_031433.3.npy,1,579,579
+NP_113631.1,MEEEGKKGKKPGIVSPFKRVFLKGEKSRDKKAHEKVTERRPLHTVVLSLPERVEPDRLLSDYIEKEVKYLGQLTSIPGYLNPSSRTEILHFIDNAKRAHQLPGHLTQEHDAVLSLSAYNVKLAWRDGEDIILRVPIHDIAAVSYVRDDAAHLVVLKTAQDPGISPSQSLCAESSRGLSAGSLSESAVGPVEACCLVILAAESKVAAEELCCLLGQVFQVVYTESTIDFLDRAIFDGASTPTHHLSLHSDDSSTKVDIKETYEVEASTFCFPESVDVGGASPHSKTISESELSASATELLQDYMLTLRTKLSSQEIQQFAALLHEYRNGASIHEFCINLRQLYGDSRKFLLLGLRPFIPEKDSQHFENFLETIGVKDGRGIITDSFGRHRRALSTTSSSTTNGNRATGSSDDRSAPSEGDEWDRMISDISSDIEALGCSMDQDSA,444,NP_113631.1.csv,refseq-CCM2-NM_031443.3_clinical_seed_0_final,refseq-CCM2-NM_031443.3.a2m,Invitae,refseq-CCM2-NM_031443.3_theta_0.2.npy,1,444,444
+NP_113663.2,MALEQALQAARQGELDVLRSLHAAGLLGPSLRDPLDALPVHHAARAGKLHCLRFLVEEAALPAAARARNGATPAHDASATGHLACLQWLLSQGGCRVQDKDNSGATVLHLAARFGHPEVVNWLLHHGGGDPTAATDMGALPIHYAAAKGDFPSLRLLVEHYPEGVNAQTKNGATPLYLACQEGHLEVTQYLVQECGADPHARAHDGMTPLHAAAQMGHSPVIVWLVSCTDVSLSEQDKDGATAMHFAASRGHTKVLSWLLLHGGEISADLWGGTPLHDAAENGELECCQILVVNGAELDVRDRDGYTAADLSDFNGHSHCTRYLRTVENLSVEHRVLSRDPSAELEAKQPDSGMSSPNTTVSVQPLNFDLSSPTSTLSNYDSCSSSHSSIKGQHPPCGLSSARAADIQSYMDMLNPELGLPRGTIGKPTPPPPPPSFPPPPPPPGTQLPPPPPGYPAPKPPVGPQAADIYMQTKNKLRHVETEALKKELSSCDGHDGLRRQDSSRKPRAFSKQPSTGDYYRQLGRCPGETLAARPGMAHSEEVRARQPARAGCPRLGPAARGSLEGPSAPPQAALLPGNHVPNGCAADPKASRELPPPPPPPPPPLPEAASSPPPAPPLPLESAGPGCGQRRSSSSTGSTKSFNMMSPTGDNSELLAEIKAGKSLKPTPQSKGLTTVFSGIGQPAFQPDSPLPSVSPALSPVRSPTPPAAGFQPLLNGSLVPVPPTTPAPGVQLDVEALIPTHDEQGRPIPEWKRQVMVRKMQLKMQEEEEQRRKEEEEEARLASMPAWRRDLLRKKLEEEREQKRKEEERQKQEELRREKEQSEKLRTLGYDESKLAPWQRQVILKKGDIAKY,854,NP_113663.2.csv,refseq-ESPN-NM_031475.2_clinical_seed_0_final,refseq-ESPN-NM_031475.2.a2m,Invitae,refseq-ESPN-NM_031475.2.npy,1,854,854
+NP_114032.2,MSSSPVNVKKLKVSELKEELKKRRLSDKGLKAELMERLQAALDDEEAGGRPAMEPGNGSLDLGGDSAGRSGAGLEQEAAAGGDEEEEEEEEEEEGISALDGDQMELGEENGAAGAADSGPMEEEEAASEDENGDDQGFQEGEDELGDEEEGAGDENGHGEQQPQPPATQQQQPQQQRGAAKEAAGKSSGPTSLFAVTVAPPGARQGQQQAGGKKKAEGGGGGGRPGAPAAGDGKTEQKGGDKKRGVKRPREDHGRGYFEYIEENKYSRAKSPQPPVEEEDEHFDDTVVCLDTYNCDLHFKISRDRLSASSLTMESFAFLWAGGRASYGVSKGKVCFEMKVTEKIPVRHLYTKDIDIHEVRIGWSLTTSGMLLGEEEFSYGYSLKGIKTCNCETEDYGEKFDENDVITCFANFESDEVELSYAKNGQDLGVAFKISKEVLAGRPLFPHVLCHNCAVEFNFGQKEKPYFPIPEEYTFIQNVPLEDRVRGPKGPEEKKDCEVVMMIGLPGAGKTTWVTKHAAENPGKYNILGTNTIMDKMMVAGFKKQMADTGKLNTLLQRAPQCLGKFIEIAARKKRNFILDQTNVSAAAQRRKMCLFAGFQRKAVVVCPKDEDYKQRTQKKAEVEGKDLPEHAVLKMKGNFTLPEVAECFDEITYVELQKEEAQKLLEQYKEESKKALPPEKKQNTGSKKSNKNKSGKNQFNRGGGHRGRGGFNMRGGNFRGGAPGNRGGYNRRGNMPQRGGGGGGSGGIGYPYPRAPVFPGRGSYSNRGNYNRGGMPNRGNYNQNFRGRGNNRGYKNQSQGYNQWQQGQFWGQKPWSQHYHQGYY,825,NP_114032.2.csv,refseq-HNRNPU-NM_031844.2_clinical_seed_0_final,refseq-HNRNPU-NM_031844.2.a2m,Invitae,refseq-HNRNPU-NM_031844.2.npy,1,825,825
+NP_114091.4,MLLPVFTLKLRHKISPRMVAIGRYDGTHPCLAAATQTGKVFIHNPHTRNQHVSASRVFQSPLESDVSLLSINQAVSCLTAGVLNPELGYDALLVGTQTNLLAYDVYNNSDLFYREVADGANAIVLGTLGDISSPLAIIGGNCALQGFNHEGSDLFWTVTGDNVNSLALCDFDGDGKKELLVGSEDFDIRVFKEDEIVAEMTETEIVTSLCPMYGSRFGYALSNGTVGVYDKTSRYWRIKSKNHAMSIHAFDLNSDGVNELITGWSNGKVDARSDRTGEVIFKDNFSSAIAGVVEGDYRMDGHIQLICCSVDGEIRGYLPGTAEMRGNLMDTSAEQDLIRELSQKKQNLLLELRNYEENAKAELASPLNEADGHRGIIPANTRLHTTLSVSLGNETQTAHTELRISTSNDTIIRAVLIFAEGIFTGESHVVHPSIHNLSSSICIPIVPPKDVPVDLHLKAFVGYRSSTQFHVFESTRQLPRFSMYALTSLDPASEPISYVNFTIAERAQRVVVWLGQNFLLPEDTHIQNAPFQVCFTSLRNGGHLHIKIKLSGEITINTDDIDLAGDIIQSMASFFAIEDLQVEADFPVYFEELRKVLVKVDEYHSVHQKLSADMADHSNLIRSLLVGAEDARLMRDMKTMKSRYMELYDLNRDLLNGYKIRCNNHTELLGNLKAVNQAIQRAGRLRVGKPKNQVITACRDAIRSNNINTLFKIMRVGTASS,721,NP_114091.4.csv,refseq-BBS2-NM_031885.5_clinical_seed_0_final,refseq-BBS2-NM_031885.5.a2m,Invitae,refseq-BBS2-NM_031885.5.npy,1,721,721
+NP_114148.3,MDARRVPQKDLRVKKNLKKFRYVKLISMETSSSSDDSCDSFASDNFANTKPKFRSDISEELANVFYEDSDNESFCGFSESEVQDVLDHCGFLQKPRPDVTNELAGIFHADSDDESFCGFSESEIQDGMRLQSVREGCRTRSQCRHSGPLRVAMKFPARSTRGATNKKAESRQPSENSVTDSNSDSEDESGMNFLEKRALNIKQNKAMLAKLMSELESFPGSFRGRHPLPGSDSQSRRPRRRTFPGVASRRNPERRARPLTRSRSRILGSLDALPMEEEEEEDKYMLVRKRKTVDGYMNEDDLPRSRRSRSSVTLPHIIRPVEEITEEELENVCSNSREKIYNRSLGSTCHQCRQKTIDTKTNCRNPDCWGVRGQFCGPCLRNRYGEEVRDALLDPNWHCPPCRGICNCSFCRQRDGRCATGVLVYLAKYHGFGNVHAYLKSLKQEFEMQA,450,NP_114148.3.csv,refseq-CDCA7-NM_031942.4_clinical_seed_0_final,refseq-CDCA7-NM_031942.4.a2m,Invitae,refseq-CDCA7-NM_031942.4.npy,1,450,450
+NP_114413.1,MEKYLSLSGNHSSNKRSLEGLSAFRSLEELILDNNQLGDDLVLPGLPRLHTLTLNKNRITDLENLLDHLAEVTPALEYLSLLGNVACPNELVSLEKDEEDYKRYRCFVLYKLPNLKFLDAQKVTRQEREEALVRGVFMKVVKPKASSEDVASSPERHYTPLPSASRELTSHQGVLGKCRYVYYGKNSEGNRFIRDDQL,198,NP_114413.1.csv,refseq-LRMDA-NM_032024.4_clinical_seed_0_final,refseq-LRMDA-NM_032024.4.a2m,Invitae,refseq-LRMDA-NM_032024.4.npy,1,198,198
+NP_114423.1,MSQVGGRGDRCTQEVQGLVHGAGDLSASLAENSPTMSQNGYFEDSSYYKCDTDDTFEAREEILGDEAFDTANSSIVSGESIRFFVNVNLEMQATNTENEATSGGCVLLHTSRKYLKLKNFKEEIRAHRDLDGFLAQASIVLNETATSLDNVLRTMLRRFARDPDNNEPNCNLDLLMAMLFTDAGAPMRGKVHLLSDTIQGVTATVTGVRYQQSWLCIICTMKALQKRHVCISRLVRPQNWGENSCEVRFVILVLAPPKMKSTKTAMEVARTFATMFSDIAFRQKLLETRTEEEFKEALVHQRQLLTMVSHGPVAPRTKERSTVSLPAHRHPEPPKCKDFVPFGKGIREDIARRFPLYPLDFTDGIIGKNKAVGKYITTTLFLYFACLLPTIAFGSLNDENTDGAIDVQKTIAGQSIGGLLYALFSGQPLVILLTTAPLALYIQVIRVICDDYDLDFNSFYAWTGLWNSFFLALYAFFNLSLVMSLFKRSTEEIIALFISITFVLDAVKGTVKIFWKYYYGHYLDDYHTKRTSSLVSLSGLGASLNASLHTALNASFLASPTELPSATHSGQATAVLSLLIMLGTLWLGYTLYQFKKSPYLHPCVREILSDCALPIAVLAFSLISSHGFREIEMSKFRYNPSESPFAMAQIQSLSLRAVSGAMGLGFLLSMLFFIEQNLVAALVNAPENRLVKGTAYHWDLLLLAIINTGLSLFGLPWIHAAYPHSPLHVRALALVEERVENGHIYDTIVNVKETRLTSLGASVLVGLSLLLLPVPLQWIPKPVLYGLFLYIALTSLDGNQLVQRVALLLKEQTAYPPTHYIRRVPQRKIHYFTGLQVLQLLLLCAFGMSSLPYMKMIFPLIMIAMIPIRYILLPRIIEAKYLDVMDAEHRP,891,NP_114423.1.csv,refseq-SLC4A11-NM_032034.3_clinical_seed_0_final,refseq-SLC4A11-NM_032034.3.a2m,Invitae,refseq-SLC4A11-NM_032034.3.npy,1,891,891
+NP_114432.2,MSSMWSEYTIGGVKIYFPYKAYPSQLAMMNSILRGLNSKQHCLLESPTGSGKSLALLCSALAWQQSLSGKPADEGVSEKAEVQLSCCCACHSKDFTNNDMNQGTSRHFNYPSTPPSERNGTSSTCQDSPEKTTLAAKLSAKKQASIYRDENDDFQVEKKRIRPLETTQQIRKRHCFGTEVHNLDAKVDSGKTVKLNSPLEKINSFSPQKPPGHCSRCCCSTKQGNSQESSNTIKKDHTGKSKIPKIYFGTRTHKQIAQITRELRRTAYSGVPMTILSSRDHTCVHPEVVGNFNRNEKCMELLDGKNGKSCYFYHGVHKISDQHTLQTFQGMCKAWDIEELVSLGKKLKACPYYTARELIQDADIIFCPYNYLLDAQIRESMDLNLKEQVVILDEAHNIEDCARESASYSVTEVQLRFARDELDSMVNNNIRKKDHEPLRAVCCSLINWLEANAEYLVERDYESACKIWSGNEMLLTLHKMGITTATFPILQGHFSAVLQKEEKISPIYGKEEAREVPVISASTQIMLKGLFMVLDYLFRQNSRFADDYKIAIQQTYSWTNQIDISDKNGLLVLPKNKKRSRQKTAVHVLNFWCLNPAVAFSDINGKVQTIVLTSGTLSPMKSFSSELGVTFTIQLEANHIIKNSQVWVGTIGSGPKGRNLCATFQNTETFEFQDEVGALLLSVCQTVSQGILCFLPSYKLLEKLKERWLSTGLWHNLELVKTVIVEPQGGEKTNFDELLQVYYDAIKYKGEKDGALLVAVCRGKVSEGLDFSDDNARAVITIGIPFPNVKDLQVELKRQYNDHHSKLRGLLPGRQWYEIQAYRALNQALGRCIRHRNDWGALILVDDRFRNNPSRYISGLSKWVRQQIQHHSTFESALESLAEFSKKHQKVLNVSIKDRTNIQDNESTLEVTSLKYSTSPYLLEAASHLSPENFVEDEAKICVQELQCPKIITKNSPLPSSIISRKEKNDPVFLEEAGKAEKIVISRSTSPTFNKQTKRVSWSSFNSLGQYFTGKIPKATPELGSSENSASSPPRFKTEKMESKTVLPFTDKCESSNLTVNTSFGSCPQSETIISSLKIDATLTRKNHSEHPLCSEEALDPDIELSLVSEEDKQSTSNRDFETEAEDESIYFTPELYDPEDTDEEKNDLAETDRGNRLANNSDCILAKDLFEIRTIKEVDSAREVKAEDCIDTKLNGILHIEESKIDDIDGNVKTTWINELELGKTHEIEIKNFKPSPSKNKGMFPGFK,1249,NP_114432.2.csv,refseq-BRIP1-NM_032043.2_clinical_seed_0_final,refseq-BRIP1-NM_032043.2.a2m,Invitae,refseq-BRIP1-NM_032043.2.npy,1,1249,1249
+NP_115484.2,MQTPRASPPRPALLLLLLLLGGAHGLFPEEPPPLSVAPRDYLNHYPVFVGSGPGRLTPAEGADDLNIQRVLRVNRTLFIGDRDNLYRVELEPPTSTELRYQRKLTWRSNPSDINVCRMKGKQEGECRNFVKVLLLRDESTLFVCGSNAFNPVCANYSIDTLQPVGDNISGMARCPYDPKHANVALFSDGMLFTATVTDFLAIDAVIYRSLGDRPTLRTVKHDSKWFKEPYFVHAVEWGSHVYFFFREIAMEFNYLEKVVVSRVARVCKNDVGGSPRVLEKQWTSFLKARLNCSVPGDSHFYFNVLQAVTGVVSLGGRPVVLAVFSTPSNSIPGSAVCAFDLTQVAAVFEGRFREQKSPESIWTPVPEDQVPRPRPGCCAAPGMQYNASSALPDDILNFVKTHPLMDEAVPSLGHAPWILRTLMRHQLTRVAVDVGAGPWGNQTVVFLGSEAGTVLKFLVRPNASTSGTSGLSVFLEEFETYRPDRCGRPGGGETGQRLLSLELDAASGGLLAAFPRCVVRVPVARCQQYSGCMKNCIGSQDPYCGWAPDGSCIFLSPGTRAAFEQDVSGASTSGLGDCTGLLRASLSEDRAGLVSVNLLVTSSVAAFVVGAVVSGFSVGWFVGLRERRELARRKDKEAILAHGAGEAVLSVSRLGERRAQGPGGRGGGGGGGAGVPPEALLAPLMQNGWAKATLLQGGPHDLDSGLLPTPEQTPLPQKRLPTPHPHPHALGPRAWDHGHPLLPASASSSLLLLAPARAPEQPPAPGEPTPDGRLYAARPGRASHGDFPLTPHASPDRRRVVSAPTGPLDPASAADGLPRPWSPPPTGSLRRPLGPHAPPAATLRRTHTFNSGEARPGDRHRGCHARPGTDLAHLLPYGGADRTAPPVP,888,NP_115484.2.csv,refseq-SEMA6B-NM_032108.3_clinical_seed_0_final,refseq-SEMA6B-NM_032108.3.a2m,Invitae,refseq-SEMA6B-NM_032108.3.npy,1,888,888
+NP_115497.4,MRKGKGPICLFSRPTLRPSRSKVSLIEGRGANMAARWRFWCVSVTMVVALLIVCDVPSASAQRKKEMVLSEKVSQLMEWTNKRPVIRMNGDKFRRLVKAPPRNYSVIVMFTALQLHRQCVVCKQADEEFQILANSWRYSSAFTNRIFFAMVDFDEGSDVFQMLNMNSAPTFINFPAKGKPKRGDTYELQVRGFSAEQIARWIADRTDVNIRVIRPPNYAGPLMLGLLLAVIGGLVYLRRSNMEFLFNKTGWAFAALCFVLAMTSGQMWNHIRGPPYAHKNPHTGHVNYIHGSSQAQFVAETHIVLLFNGGVTLGMVLLCEAATSDMDIGKRKIMCVAGIGLVVLFFSWMLSIFRSKYHGYPYSFLMS,367,NP_115497.4.csv,refseq-MAGT1-NM_032121.5_clinical_seed_0_final,refseq-MAGT1-NM_032121.5.a2m,Invitae,refseq-MAGT1-NM_032121.5.npy,1,367,367
+NP_115569.2,MESGDEAAIERHRVHLRSATLRDAVPATLHLLPCEVAVDGPAPVGRFFTPAIRQGPEGLEVSFRGRCLRGEEVAVPPGLVGYVMVTEEKKVSMGKPDPLRDSGTDDQEEEPLERDFDRFIGATANFSRFTLWGLETIPGPDAKVRGALTWPSLAAAIHAQVPED,164,NP_115569.2.csv,refseq-RNASEH2C-NM_032193.3_clinical_seed_0_final,refseq-RNASEH2C-NM_032193.3.a2m,Invitae,refseq-RNASEH2C-NM_032193.3.npy,1,164,164
+NP_115584.1,MATAERRALGIGFQWLSLATLVLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTASVIIALTDGELHEDLFFYSEREANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVFPVNDGFQALQGIIHSILKKSCIEILAAEPSTICAGESFQVVVRGNGFRHARNVDRVLCSFKINDSVTLNEKPFSVEDTYLLCPAPILKEVGMKAALQVSMNDGLSFISSSVIITTTHCSDGSILAIALLILFLLLALALLWWFWPLCCTVIIKEVPPPPAEESEEEDDDGLPKKKWPTVDASYYGGRGVGGIKRMEVRWGEKGSTEEGAKLEKAKNARVKMPEQEYEFPEPRNLNNNMRRPSSPRKWYSPIKGKLDALWVLLRKGYDRVSVMRPQPGDTGRCINFTRVKNNQPAKYPLNNAYHTSSPPPAPIYTPPPPAPHCPPPPPSAPTPPIPSPPSTLPPPPQAPPPNRAPPPSRPPPRPSV,564,NP_115584.1.csv,refseq-ANTXR1-NM_032208.2_clinical_seed_0_final,refseq-ANTXR1-NM_032208.2.a2m,Invitae,refseq-ANTXR1-NM_032208.2.npy,1,564,564
+NP_115604.1,MVSIPEYYEGKNVLLTGATGFLGKVLLEKLLRSCPKVNSVYVLVRQKAGQTPQERVEEVLSGKLFDRLRDENPDFREKIIAINSELTQPKLALSEEDKEVIIDSTNIIFHCAATVRFNENLRDAVQLNVIATRQLILLAQQMKNLEVFMHVSTAYAYCNRKHIDEVVYPPPVDPKKLIDSLEWMDDGLVNDITPKLIGDRPNTYIYTKALAEYVVQQEGAKLNVAIVRPSIVGASWKEPFPGWIDNFNGPSGLFIAAGKGILRTIRASNNALADLVPVDVVVNMSLAAAWYSGVNRPRNIMVYNCTTGSTNPFHWGEVEYHVISTFKRNPLEQAFRRPNVNLTSNHLLYHYWIAVSHKAPAFLYDIYLRMTGRSPRMMKTITRLHKAMVFLEYFTSNSWVWNTENVNMLMNQLNPEDKKTFNIDVRQLHWAEYIENYCLGTKKYVLNEEMSGLPAARKHLNKLRNIRYGFNTILVILIWRIFIARSQMARNIWYFVVSLCYKFLSYFRASSTMRY,515,NP_115604.1.csv,refseq-FAR1-NM_032228.5_clinical_seed_0_final,refseq-FAR1-NM_032228.5.a2m,Invitae,refseq-FAR1-NM_032228.5.npy,1,515,515
+NP_115618.3,MPLPPRSLQVLLLLLLLLLLLPGMWAEAGLPRAGGGSQPPFRTFSASDWGLTHLVVHEQTGEVYVGAVNRIYKLSGNLTLLRAHVTGPVEDNEKCYPPPSVQSCPHGLGSTDNVNKLLLLDYAANRLLACGSASQGICQFLRLDDLFKLGEPHHRKEHYLSSVQEAGSMAGVLIAGPPGQGQAKLFVGTPIDGKSEYFPTLSSRRLMANEEDADMFGFVYQDEFVSSQLKIPSDTLSKFPAFDIYYVYSFRSEQFVYYLTLQLDTQLTSPDAAGEHFFTSKIVRLCVDDPKFYSYVEFPIGCEQAGVEYRLVQDAYLSRPGRALAHQLGLAEDEDVLFTVFAQGQKNRVKPPKESALCLFTLRAIKEKIKERIQSCYRGEGKLSLPWLLNKELGCINSPLQIDDDFCGQDFNQPLGGTVTIEGTPLFVDKDDGLTAVAAYDYRGRTVVFAGTRSGRIRKILVDLSNPGGRPALAYESVVAQEGSPILRDLVLSPNHQYLYAMTEKQVTRVPVESCVQYTSCELCLGSRDPHCGWCVLHSICSRRDACERADEPQRFAADLLQCVQLTVQPRNVSVTMSQVPLVLQAWNVPDLSAGVNCSFEDFTESESVLEDGRIHCRSPSAREVAPITRGQGDQRVVKLYLKSKETGKKFASVDFVFYNCSVHQSCLSCVNGSFPCHWCKYRHVCTHNVADCAFLEGRVNVSEDCPQILPSTQIYVPVGVVKPITLAARNLPQPQSGQRGYECLFHIPGSPARVTALRFNSSSLQCQNSSYSYEGNDVSDLPVNLSVVWNGNFVIDNPQNIQAHLYKCPALRESCGLCLKADPRFECGWCVAERRCSLRHHCAADTPASWMHARHGSSRCTDPKILKLSPETGPRQGGTRLTITGENLGLRFEDVRLGVRVGKVLCSPVESEYISAEQIVCEIGDASSVRAHDALVEVCVRDCSPHYRALSPKRFTFVTPTFYRVSPSRGPLSGGTWIGIEGSHLNAGSDVAVSVGGRPCSFSWRNSREIRCLTPPGQSPGSAPIIININRAQLTNPEVKYNYTEDPTILRIDPEWSINSGGTLLTVTGTNLATVREPRIRAKYGGIERENGCLVYNDTTMVCRAPSVANPVRSPPELGERPDELGFVMDNVRSLLVLNSTSFLYYPDPVLEPLSPTGLLELKPSSPLILKGRNLLPPAPGNSRLNYTVLIGSTPCTLTVSETQLLCEAPNLTGQHKVTVRAGGFEFSPGTLQVYSDSLLTLPAIVGIGGGGGLLLLVIVAVLIAYKRKSRDADRTLKRLQLQMDNLESRVALECKEAFAELQTDIHELTNDLDGAGIPFLDYRTYAMRVLFPGIEDHPVLKEMEVQANVEKSLTLFGQLLTKKHFLLTFIRTLEAQRSFSMRDRGNVASLIMTALQGEMEYATGVLKQLLSDLIEKNLESKNHPKLLLRRTESVAEKMLTNWFTFLLYKFLKECAGEPLFMLYCAIKQQMEKGPIDAITGEARYSLSEDKLIRQQIDYKTLTLNCVNPENENAPEVPVKGLDCDTVTQAKEKLLDAAYKGVPYSQRPKAADMDLEWRQGRMARIILQDEDVTTKIDNDWKRLNTLAHYQVTDGSSVALVPKQTSAYNISNSSTFTKSLSRYESMLRTASSPDSLRSRTPMITPDLESGTKLWHLVKNHDHLDQREGDRGSKMVSEIYLTRLLATKGTLQKFVDDLFETIFSTAHRGSALPLAIKYMFDFLDEQADKHQIHDADVRHTWKSNCLPLRFWVNVIKNPQFVFDIHKNSITDACLSVVAQTFMDSCSTSEHKLGKDSPSNKLLYAKDIPNYKSWVERYYADIAKMPAISDQDMSAYLAEQSRLHLSQFNSMSALHEIYSYITKYKDEILAALEKDEQARRQRLRSKLEQVVDTMALSS,1896,NP_115618.3.csv,refseq-PLXNA1-NM_032242.3_clinical_seed_0_final,refseq-PLXNA1-NM_032242.3.a2m,Invitae,refseq-PLXNA1-NM_032242.3.npy,1,1896,1896
+NP_115647.2,MSSGKSARYNRFSGGPSNLPTPDVTTGTRMETTFGPAFSAVTTITKADGTSTYKQHCRTPSSSSTLAYSPRDEEDSMPPISTPRRSDSAISVRSLHSESSMSLRSTFSLPEEEEEPEPLVFAEQPSVKLCCQLCCSVFKDPVITTCGHTFCRRCALKSEKCPVDNVKLTVVVNNIAVAEQIGELFIHCRHGCRVAGSGKPPIFEVDPRGCPFTIKLSARKDHEGSCDYRPVRCPNNPSCPPLLRMNLEAHLKECEHIKCPHSKYGCTFIGNQDTYETHLETCRFEGLKEFLQQTDDRFHEMHVALAQKDQEIAFLRSMLGKLSEKIDQLEKSLELKFDVLDENQSKLSEDLMEFRRDASMLNDELSHINARLNMGILGSYDPQQIFKCKGTFVGHQGPVWCLCVYSMGDLLFSGSSDKTIKVWDTCTTYKCQKTLEGHDGIVLALCIQGCKLYSGSADCTIIVWDIQNLQKVNTIRAHDNPVCTLVSSHNVLFSGSLKAIKVWDIVGTELKLKKELTGLNHWVRALVAAQSYLYSGSYQTIKIWDIRTLDCIHVLQTSGGSVYSIAVTNHHIVCGTYENLIHVWDIESKEQVRTLTGHVGTVYALAVISTPDQTKVFSASYDRSLRVWSMDNMICTQTLLRHQGSVTALAVSRGRLFSGAVDSTVKVWTC,670,NP_115647.2.csv,refseq-TRAF7-NM_032271.2_clinical_seed_0_final,refseq-TRAF7-NM_032271.2.a2m,Invitae,refseq-TRAF7-NM_032271.2.npy,1,670,670
+NP_115693.2,MAAMRWRWWQRLLPWRLLQARGFPQNSAPSLGLGARTYSQGDCSYSRTALYDLLGVPSTATQAQIKAAYYRQCFLYHPDRNSGSAEAAERFTRISQAYVVLGSATLRRKYDRGLLSDEDLRGPGVRPSRTPAPDPGSPRTPPPTSRTHDGSRASPGANRTMFNFDAFYQAHYGEQLERERRLRARREALRKRQEYRSMKGLRWEDTRDTAAIFLIFSIFIIIGFYI,226,NP_115693.2.csv,refseq-DNAJC30-NM_032317.2_clinical_seed_0_final,refseq-DNAJC30-NM_032317.2.a2m,Invitae,refseq-DNAJC30-NM_032317.2.npy,1,226,226
+NP_115733.2,MAALDLRAELDSLVLQLLGDLEELEGKRTVLNARVEEGWLSLAKARYAMGAKSVGPLQYASHMEPQVCLHASEAQEGLQKFKVVRAGVHAPEEVGPREAGLRRRKGPTKTPEPESSEAPQDPLNWFGILVPHSLRQAQASFRDGLQLAADIASLQNRIDWGRSQLRGLQEKLKQLEPGAA,180,NP_115733.2.csv,refseq-CCDC115-NM_032357.3_clinical_seed_0_final,refseq-CCDC115-NM_032357.3.a2m,Invitae,refseq-CCDC115-NM_032357.3.npy,1,180,180
+NP_115759.2,MVQLYNLHPFGSQQVVPCKLEPDRFCGGGRDALFVAAGCKVEAFAVAGQELCQPRCAFSTLGRVLRLAYSEAGDYLVAIEEKNKATFLRAYVNWRNKRTENSRVCIRMIGHNVEGPFSKAFRDQMYIIEMPLSEAPLCISCCPVKGDLLVGCTNKLVLFSLKYQIINEEFSLLDFERSLIIHIDNITPVEVSFCVGYVAVMSDLEVLIVKLESGPKNGERVHHHPHKTNNRIRRTEEGISNEISQLESDDFVICQKPLELLGEKSEQSGLSVTLESTGLADEKRKYSHFQHLLYRRFAPDISSYVLSDDIKLHSLQLLPIYQTGSLTSDGKNLSQEKELLSLFCFFSLPHVGYLYMVVKSVELMSVYQYPEKSQQAVLTPQFLHVITSNNLQCFTVRCSAAAAREEDPYMDTTLKACPPVSMDVCALRIQLFIGLKAICHFKNHIILLTKAEPEAIPERRQSPKRLLSRKDTSVKIKIPPVAEAGWNLYIVNTISPVQLYKEMVDYSNTYKTVKTQSCIHLLSEAHLLVRAALMDASQLEPGEKAELLEAFKESCGHLGDCYSRLDSQHSHLTLPYYKMSGLSMAEVLARTDWTVEDGLQKYERGLIFYINHSLYENLDEELNEELAAKVVQMFYVAEPKQVPHILCSPSMKNINPLTAMSYLRKLDTSGFSSILVTLTKAAVALKMGDLDMHRNEMKSHSEMKLVCGFILEPRLLIQQRKGQIVPTELALHLKETQPGLLVASVLGLQKNNKIGIEEADSFFKVLCAKDEDTIPQLLVDFWEAQLVACLPDVVLQELFFKLTSQYIWRLSKRQPPDTTPLRTSEDLINACSHYGLIYPWVHVVISSDSLADKNYTEDLSKLQSLICGPSFDIASIIPFLEPLSEDTIAGLSVHVLCRTRLKEYEQCIDILLERCPEAVIPYANHELKEENRTLWWKKLLPELCQRIKCGGEKYQLYLSSLKETLSIVAVELELKDFMNVLPEDGTATFFLPYLLYCSRKKPLT,1004,NP_115759.2.csv,refseq-HPS3-NM_032383.4_clinical_seed_0_final,refseq-HPS3-NM_032383.4.a2m,Invitae,refseq-HPS3-NM_032383.4.npy,1,1004,1004
+NP_115763.2,MLASPATETTVLMSQTEADLALRPPPPLGTAGQPRLGPPPRRARRFSGKAEPRPRSSRLSRRSSVDLGLLSSWSLPASPAPDPPDPPDSAGPGPARSPPPSSKEPPEGTWTEGAPVKAAEDSARPELPDSAVGPGSREPLRVPEAVALERRREQEEKEDMETQAVATSPDGRYLKFDIEIGRGSFKTVYRGLDTDTTVEVAWCELQTRKLSRAERQRFSEEVEMLKGLQHPNIVRFYDSWKSVLRGQVCIVLVTELMTSGTLKTYLRRFREMKPRVLQRWSRQILRGLHFLHSRVPPILHRDLKCDNVFITGPTGSVKIGDLGLATLKRASFAKSVIGTPEFMAPEMYEEKYDEAVDVYAFGMCMLEMATSEYPYSECQNAAQIYRKVTSGRKPNSFHKVKIPEVKEIIEGCIRTDKNERFTIQDLLAHAFFREERGVHVELAEEDDGEKPGLKLWLRMEDARRGGRPRDNQAIEFLFQLGRDAAEEVAQEMVALGLVCEADYQPVARAVRERVAAIQRKREKLRKARELEALPPEPGPPPATVPMAPGPPSVFPPEPEEPEADQHQPFLFRHASYSSTTSDCETDGYLSSSGFLDASDPALQPPGGVPSSLAESHLCLPSAFALSIPRSGPGSDFSPGDSYASDAASGLSDVGEGMGQMRRPPGRNLRRRPRSRLRVTSVSDQNDRVVECQLQTHNSKMVTFRFDLDGDSPEEIAAAMVYNEFILPSERDGFLRRIREIIQRVETLLKRDTGPMEAAEDTLSPQEEPAPLPALPVPLPDPSNEELQSSTSLEHRSWTAFSTSSSSPGTPLSPGNPFSPGTPISPGPIFPITSPPCHPSPSPFSPISSQVSSNPSPHPTSSPLPFSSSTPEFPVPLSQCPWSSLPTTSPPTFSPTCSQVTLSSPFFPPCPSTSSFPSTTAAPLLSLASAFSLAVMTVAQSLLSPSPGLLSQSPPAPPSPLPSLPLPPPVAPGGQESPSPHTAEVESEASPPPARPLPGEARLAPISEEGKPQLVGRFQVTSSKEPAEPLPLQPTSPTLSGSPKPSTPQLTSESSDTEDSAGGGPETREALAESDRAAEGLGAGVEEEGDDGKEPQVGGSPQPLSHPSPVWMNYSYSSLCLSSEESESSGEDEEFWAELQSLRQKHLSEVETLQTLQKKEIEDLYSRLGKQPPPGIVAPAAMLSSRQRRLSKGSFPTSRRNSLQRSEPPGPGIMRRNSLSGSSTGSQEQRASKGVTFAGDVGRM,1243,NP_115763.2.csv,refseq-WNK4-NM_032387.4_clinical_seed_0_final,refseq-WNK4-NM_032387.4.a2m,Invitae,refseq-WNK4-NM_032387.4.npy,1,1243,1243
+NP_115785.1,MAVRQALGRGLQLGRALLLRFTGKPGRAYGLGRPGPAAGCVRGERPGWAAGPGAEPRRVGLGLPNRLRFFRQSVAGLAARLQRQFVVRAWGCAGPCGRAVFLAFGLGLGLIEEKQAESRRAVSACQEIQAIFTQKSKPGPDPLDTRRLQGFRLEEYLIGQSIGKGCSAAVYEATMPTLPQNLEVTKSTGLLPGRGPGTSAPGEGQERAPGAPAFPLAIKMMWNISAGSSSEAILNTMSQELVPASRVALAGEYGAVTYRKSKRGPKQLAPHPNIIRVLRAFTSSVPLLPGALVDYPDVLPSRLHPEGLGHGRTLFLVMKNYPCTLRQYLCVNTPSPRLAAMMLLQLLEGVDHLVQQGIAHRDLKSDNILVELDPDGCPWLVIADFGCCLADESIGLQLPFSSWYVDRGGNGCLMAPEVSTARPGPRAVIDYSKADAWAVGAIAYEIFGLVNPFYGQGKAHLESRSYQEAQLPALPESVPPDVRQLVRALLQREASKRPSARVAANVLHLSLWGEHILALKNLKLDKMVGWLLQQSAATLLANRLTEKCCVETKMKMLFLANLECETLCQAALLLCSWRAAL,581,NP_115785.1.csv,refseq-PINK1-NM_032409.2_clinical_seed_0_final,refseq-PINK1-NM_032409.2.a2m,Invitae,refseq-PINK1-NM_032409.2.npy,1,581,581
+NP_115791.3,MPGGGPEMDDYMETLKDEEDALWENVECNRHMLSRYINPAKLTPYLRQCKVIDEQDEDEVLNAPMLPSKINRAGRLLDILHTKGQRGYVVFLESLEFYYPELYKLVTGKEPTRRFSTIVVEEGHEGLTHFLMNEVIKLQQQMKAKDLQRCELLARLRQLEDEKKQMTLTRVELLTFQERYYKMKEERDSYNDELVKVKDDNYNLAMRYAQLSEEKNMAVMRSRDLQLEIDQLKHRLNKMEEECKLERNQSLKLKNDIENRPKKEQVLELERENEMLKTKNQELQSIIQAGKRSLPDSDKAILDILEHDRKEALEDRQELVNRIYNLQEEARQAEELRDKYLEEKEDLELKCSTLGKDCEMYKHRMNTVMLQLEEVERERDQAFHSRDEAQTQYSQCLIEKDKYRKQIRELEEKNDEMRIEMVRREACIVNLESKLRRLSKDSNNLDQSLPRNLPVTIISQDFGDASPRTNGQEADDSSTSEESPEDSKYFLPYHPPQRRMNLKGIQLQRAKSPISLKRTSDFQAKGHEEEGTDASPSSCGSLPITNSFTKMQPPRSRSSIMSITAEPPGNDSIVRRYKEDAPHRSTVEEDNDSGGFDALDLDDDSHERYSFGPSSIHSSSSSHQSEGLDAYDLEQVNLMFRKFSLERPFRPSVTSVGHVRGPGPSVQHTTLNGDSLTSQLTLLGGNARGSFVHSVKPGSLAEKAGLREGHQLLLLEGCIRGERQSVPLDTCTKEEAHWTIQRCSGPVTLHYKVNHEGYRKLVKDMEDGLITSGDSFYIRLNLNISSQLDACTMSLKCDDVVHVRDTMYQDRHEWLCARVDPFTDHDLDMGTIPSYSRAQQLLLVKLQRLMHRGSREEVDGTHHTLRALRNTLQPEEALSTSDPRVSPRLSRASFLFGQLLQFVSRSENKYKRMNSNERVRIISGSPLGSLARSSLDATKLLTEKQEELDPESELGKNLSLIPYSLVRAFYCERRRPVLFTPTVLAKTLVQRLLNSGGAMEFTICKSDIVTRDEFLRRQKTETIIYSREKNPNAFECIAPANIEAVAAKNKHCLLEAGIGCTRDLIKSNIYPIVLFIRVCEKNIKRFRKLLPRPETEEEFLRVCRLKEKELEALPCLYATVEPDMWGSVEELLRVVKDKIGEEQRKTIWVDEDQL,1154,NP_115791.3.csv,refseq-CARD11-NM_032415.5_clinical_seed_0_final,refseq-CARD11-NM_032415.5.a2m,Invitae,refseq-CARD11-NM_032415.5.npy,1,1154,1154
+NP_115822.1,MVISLNSCLSFICLLLCHWIGTASPLNLEDPNVCSHWESYSVTVQESYPHPFDQIYYTSCTDILNWFKCTRHRVSYRTAYRHGEKTMYRRKSQCCPGFYESGEMCVPHCADKCVHGRCIAPNTCQCEPGWGGTNCSSACDGDHWGPHCTSRCQCKNGALCNPITGACHCAAGFRGWRCEDRCEQGTYGNDCHQRCQCQNGATCDHVTGECRCPPGYTGAFCEDLCPPGKHGPQCEQRCPCQNGGVCHHVTGECSCPSGWMGTVCGQPCPEGRFGKNCSQECQCHNGGTCDAATGQCHCSPGYTGERCQDECPVGTYGVLCAETCQCVNGGKCYHVSGACLCEAGFAGERCEARLCPEGLYGIKCDKRCPCHLENTHSCHPMSGECACKPGWSGLYCNETCSPGFYGEACQQICSCQNGADCDSVTGKCTCAPGFKGIDCSTPCPLGTYGINCSSRCGCKNDAVCSPVDGSCTCKAGWHGVDCSIRCPSGTWGFGCNLTCQCLNGGACNTLDGTCTCAPGWRGEKCELPCQDGTYGLNCAERCDCSHADGCHPTTGHCRCLPGWSGVHCDSVCAEGRWGPNCSLPCYCKNGASCSPDDGICECAPGFRGTTCQRICSPGFYGHRCSQTCPQCVHSSGPCHHITGLCDCLPGFTGALCNEVCPSGRFGKNCAGICTCTNNGTCNPIDRSCQCYPGWIGSDCSQPCPPAHWGPNCIHTCNCHNGAFCSAYDGECKCTPGWTGLYCTQRCPLGFYGKDCALICQCQNGADCDHISGQCTCRTGFMGRHCEQKCPSGTYGYGCRQICDCLNNSTCDHITGTCYCSPGWKGARCDQAGVIIVGNLNSLSRTSTALPADSYQIGAIAGIIILVLVVLFLLALFIIYRHKQKGKESSMPAVTYTPAMRVVNADYTISGTLPHSNGGNANSHYFTNPSYHTLTQCATSPHVNNRDRMTVTKSKNNQLFVNLKNVNPGKRGPVGDCTGTLPADWKHGGYLNELGAFGLDRSYMGKSLKDLGKNSEYNSSNCSLSSSENPYATIKDPPVLIPKSSECGYVEMKSPARRDSPYAEINNSTSANRNVYEVEPTVSVVQGVFSNNGRLSQDPYDLPKNSHIPCHYDLLPVRDSSSSPKQEDSGGSSSNSSSSSE,1140,NP_115822.1.csv,refseq-MEGF10-NM_032446.2_clinical_seed_0_final,refseq-MEGF10-NM_032446.2.a2m,Invitae,refseq-MEGF10-NM_032446.2.npy,1,1140,1140
+NP_115834.1,MSSSVEQKKGPTRQRKCGFCKSNRDKECGQLLISENQKVAAHHKCMLFSSALVSSHSDNESLGGFSIEDVQKEIKRGTKLMCSLCHCPGATIGCDVKTCHRTYHYHCALHDKAQIREKPSQGIYMVYCRKHKKTAHNSEADLEESFNEHELEPSSPKSKKKSRKGRPRKTNFKGLSEDTRSTSSHGTDEMESSSYRDRSPHRSSPSDTRPKCGFCHVGEEENEARGKLHIFNAKKAAAHYKCMLFSSGTVQLTTTSRAEFGDFDIKTVLQEIKRGKRMKCTLCSQPGATIGCEIKACVKTYHYHCGVQDKAKYIENMSRGIYKLYCKNHSGNDERDEEDEERESKSRGKVEIDQQQLTQQQLNGN,365,NP_115834.1.csv,refseq-PHF6-NM_032458.2_clinical_seed_0_final,refseq-PHF6-NM_032458.2.a2m,Invitae,refseq-PHF6-NM_032458.2.npy,1,365,365
+NP_115881.3,MASRAGPRAAGTDGSDFQHRERVAMHYQMSVTLKYEIKKLIYVHLVIWLLLVAKMSVGHLRLLSHDQVAMPYQWEYPYLLSILPSLLGLLSFPRNNISYLVLSMISMGLFSIAPLIYGSMEMFPAAQQLYRHGKAYRFLFGFSAVSIMYLVLVLAVQVHAWQLYYSKKLLDSWFTSTQEKKHK,183,NP_115881.3.csv,refseq-JAGN1-NM_032492.3_clinical_seed_0_final,refseq-JAGN1-NM_032492.3.a2m,Invitae,refseq-JAGN1-NM_032492.3.npy,1,183,183
+NP_115909.1,MAAGLARLLLLLGLSAGGPAPAGAAKMKVVEEPNAFGVNNPFLPQASRLQAKRDPSPVSGPVHLFRLSGKCFSLVESTYKYEFCPFHNVTQHEQTFRWNAYSGILGIWHEWEIANNTFTGMWMRDGDACRSRSRQSKVELACGKSNRLAHVSEPSTCVYALTFETPLVCHPHALLVYPTLPEALQRQWDQVEQDLADELITPQGHEKLLRTLFEDAGYLKTPEENEPTQLEGGPDSLGFETLENCRKAHKELSKEIKRLKGLLTQHGIPYTRPTETSNLEHLGHETPRAKSPEQLRGDPGLRGSL,305,NP_115909.1.csv,refseq-GNPTG-NM_032520.4_clinical_seed_0_final,refseq-GNPTG-NM_032520.4.a2m,Invitae,refseq-GNPTG-NM_032520.4.npy,1,305,305
+NP_115920.1,MKPFQLDLLFVCFFLFSQELGLQKRGCCLVLGYMAKDKFRRMNEGQVYSFSQQPQDQVVVSGQPVTLLCAIPEYDGFVLWIKDGLALGVGRDLSSYPQYLVVGNHLSGEHHLKILRAELQDDAVYECQAIQAAIRSRPARLTVLVPPDDPVILGGPVISLRAGDPLNLTCHADNAKPAASIIWLRKGEVINGATYSKTLLRDGKRESIVSTLFISPGDVENGQSIVCRATNKAIPGGKETSVTIDIQHPPLVNLSVEPQPVLEDNVVTFHCSAKANPAVTQYRWAKRGQIIKEASGEVYRTTVDYTYFSEPVSCEVTNALGSTNLSRTVDVYFGPRMTTEPQSLLVDLGSDAIFSCAWTGNPSLTIVWMKRGSGVVLSNEKTLTLKSVRQEDAGKYVCRAVVPRVGAGEREVTLTVNGPPIISSTQTQHALHGEKGQIKCFIRSTPPPDRIAWSWKENVLESGTSGRYTVETISTEEGVISTLTISNIVRADFQTIYNCTAWNSFGSDTEIIRLKEQGSEMKSGAGLEAESVPMAVIIGVAVGAGVAFLVLMATIVAFCCARSQRNLKGVVSAKNDIRVEIVHKEPASGREGEEHSTIKQLMMDRGEFQQDSVLKQLEVLKEEEKEFQNLKDPTNGYYSVNTFKEHHSTPTISLSSCQPDLRPAGKQRVPTGMSFTNIYSTLSGQGRLYDYGQRFVLGMGSSSIELCEREFQRGSLSDSSSFLDTQCDSSVSSSGKQDGYVQFDKASKASASSSHHSQSSSQNSDPSRPLQRRMQTHV,778,NP_115920.1.csv,refseq-KIRREL3-NM_032531.3_clinical_seed_0_final,refseq-KIRREL3-NM_032531.3.a2m,Invitae,refseq-KIRREL3-NM_032531.3.npy,1,778,778
+NP_115925.2,MLHLLALFLHCLPLASGDYDICKSWVTTDEGPTWEFYACQPKVMRLKDYVKVKVEPSGITCGDPPERFCSHENPYLCSNECDASNPDLAHPPRLMFDKEEEGLATYWQSITWSRYPSPLEANITLSWNKTVELTDDVVMTFEYGRPTVMVLEKSLDNGRTWQPYQFYAEDCMEAFGMSARRARDMSSSSAHRVLCTEEYSRWAGSKKEKHVRFEVRDRFAIFAGPDLRNMDNLYTRLESAKGLKEFFTLTDLRMRLLRPALGGTYVQRENLYKYFYAISNIEVIGRCKCNLHANLCSMREGSLQCECEHNTTGPDCGKCKKNFRTRSWRAGSYLPLPHGSPNACATAGSFGNCECYGHSNRCSYIDFLNVVTCVSCKHNTRGQHCQHCRLGYYRNGSAELDDENVCIECNCNQIGSVHDRCNETGFCECREGAAGPKCDDCLPTHYWRQGCYPNVCDDDQLLCQNGGTCLQNQRCACPRGYTGVRCEQPRCDPADDDGGLDCDRAPGAAPRPATLLGCLLLLGLAARLGR,530,NP_115925.2.csv,refseq-NTNG2-NM_032536.2_clinical_seed_0_final,refseq-NTNG2-NM_032536.2.a2m,Invitae,refseq-NTNG2-NM_032536.2.npy,1,530,530
+NP_115927.1,MQCLAAALKDETNMSGGGEQADILPANYVVKDRWKVLKKIGGGGFGEIYEAMDLLTRENVALKVESAQQPKQVLKMEVAVLKKLQGKDHVCRFIGCGRNEKFNYVVMQLQGRNLADLRRSQPRGTFTLSTTLRLGKQILESIEAIHSVGFLHRDIKPSNFAMGRLPSTYRKCYMLDFGLARQYTNTTGDVRPPRNVAGFRGTVRYASVNAHKNREMGRHDDLWSLFYMLVEFAVGQLPWRKIKDKEQVGMIKEKYEHRMLLKHMPSEFHLFLDHIASLDYFTKPDYQLIMSVFENSMKERGIAENEAFDWEKAGTDALLSTSTSTPPQQNTRQTAAMFGVVNVTPVPGDLLRENTEDVLQGEHLSDQENAPPILPGRPSEGLGPSPHLVPHPGGPEAEVWEETDVNRNKLRINIGKSPCVEEEQSRGMGVPSSPVRAPPDSPTTPVRSLRYRRVNSPESERLSTADGRVELPERRSRMDLPGSPSRQACSSQPAQMLSVDTGHADRQASGRMDVSASVEQEALSNAFRSVPLAEEEDFDSKEWVIIDKETELKDFPPGAEPSTSGTTDEEPEELRPLPEEGEERRRLGAEPTVRPRGRSMQALAEEDLQHLPPQPLPPQLSQGDGRSETSQPPTPGSPSHSPLHSGPRPRRRESDPTGPQRQVFSVAPPFEVNGLPRAVPLSLPYQDFKRDLSDYRERARLLNRVRRVGFSHMLLTTPQVPLAPVQPQANGKEEEEEEEEDEEEEEEDEEEEEEEEEEEEEEEEEEEEEEEAAAAVALGEVLGPRSGSSSEGSERSTDRSQEGAPSTLLADDQKESRGRASMADGDLEPEEGSKTLVLVSPGDMKKSPVTAELAPDPDLGTLAALTPQHERPQPTGSQLDVSEPGTLSSVLKSEPKPPGPGAGLGAGTVTTGVGGVAVTSSPFTKVERTFVHIAEKTHLNVMSSGGQALRSEEFSAGGELGLELASDGGAVEEGARAPLENGLALSGLNGAEIEGSALSGAPRETPSEMATNSLPNGPALADGPAPVSPLEPSPEKVATISPRRHAMPGSRPRSRIPVLLSEEDTGSEPSGSLSAKERWSKRARPQQDLARLVMEKRQGRLLLRLASGASSSSSEEQRRASETLSGTGSEEDTPASEPAAALPRKSGRAAATRSRIPRPIGLRMPMPVAAQQPASRSHGAAPALDTAITSRLQLQTPPGSATAADLRPKQPPGRGLGPGRAQAGARPPAPRSPRLPASTSAARNASASPRSQSLSRRESPSPSHQARPGVPPPRGVPPARAQPDGTPSPGGSKKGPRGKLQAQRATTKGRAGGAEGRAGAR,1321,NP_115927.1.csv,refseq-TTBK1-NM_032538.2_clinical_seed_0_final,refseq-TTBK1-NM_032538.2.a2m,Invitae,refseq-TTBK1-NM_032538.2.npy,1,1321,1321
+NP_115940.2,MHTVATSGPNASWGAPANASGCPGCGANASDGPVPSPRAVDAWLVPLFFAALMLLGLVGNSLVIYVICRHKPMRTVTNFYIANLAATDVTFLLCCVPFTALLYPLPGWVLGDFMCKFVNYIQQVSVQATCATLTAMSVDRWYVTVFPLRALHRRTPRLALAVSLSIWVGSAAVSAPVLALHRLSPGPRAYCSEAFPSRALERAFALYNLLALYLLPLLATCACYAAMLRHLGRVAVRPAPADSALQGQVLAERAGAVRAKVSRLVAAVVLLFAACWGPIQLFLVLQALGPAGSWHPRSYAAYALKTWAHCMSYSNSALNPLLYAFLGSHFRQAFRRVCPCAPRRPRRPRRPGPSDPAAPHAELLRLGSHPAPARAQKPGSSGLAARGLCVLGEDNAPL,398,NP_115940.2.csv,refseq-KISS1R-NM_032551.4_clinical_seed_0_final,refseq-KISS1R-NM_032551.4.a2m,Invitae,refseq-KISS1R-NM_032551.4.npy,1,398,398
+NP_115969.2,MVTRDRAENRDGPKMLKPLVEKRRRDRINRSLEELRLLLLERTRDQNLRNPKLEKAEILEFAVGYLRERSRVEPPGVPRSPVQDAEALASCYLSGFRECLLRLAAFAHDASPAARAQLFSALHGYLRPKPPRPKPVDPRPPAPRPSLDPAAPALGPALHQRPPVHQGHPSPRCAWSPSLCSPRAGDSGAPAPLTGLLPPPPPPHRQDGAPKAPLPPPPAFWRPWP,225,NP_115969.2.csv,refseq-HES7-NM_032580.3_clinical_seed_0_final,refseq-HES7-NM_032580.3.a2m,Invitae,refseq-HES7-NM_032580.3.npy,1,225,225
+NP_115977.2,MDYKSSLIQDGNPMENLEKQLICPICLEMFTKPVVILPCQHNLCRKCANDIFQAANPYWTSRGSSVSMSGGRFRCPTCRHEVIMDRHGVYGLQRNLLVENIIDIYKQECSSRPLQKGSHPMCKEHEDEKINIYCLTCEVPTCSMCKVFGIHKACEVAPLQSVFQGQKTELNNCISMLVAGNDRVQTIITQLEDSRRVTKENSHQVKEELSQKFDTLYAILDEKKSELLQRITQEQEKKLSFIEALIQQYQEQLDKSTKLVETAIQSLDEPGGATFLLTAKQLIKSIVEASKGCQLGKTEQGFENMDFFTLDLEHIADALRAIDFGTDEEEEEFIEEEDQEEEESTEGKEEGHQ,353,NP_115977.2.csv,refseq-TRIM63-NM_032588.3_clinical_seed_0_final,refseq-TRIM63-NM_032588.3.a2m,Invitae,refseq-TRIM63-NM_032588.3.npy,1,353,353
+NP_115979.3,MAGPQMGGSAEDHPPRKRHAAEKQKKKTVIYTKCFEFESATQRPIDRQRYDENEDLSDVEEIVSVRGFSLEEKLRSQLYQGDFVHAMEGKDFNYEYVQREALRVPLIFREKDGLGIKMPDPDFTVRDVKLLVGSRRLVDVMDVNTQKGTEMSMSQFVRYYETPEAQRDKLYNVISLEFSHTKLEHLVKRPTVVDLVDWVDNMWPQHLKEKQTEATNAIAEMKYPKVKKYCLMSVKGCFTDFHIDFGGTSVWYHVFRGGKIFWLIPPTLHNLALYEEWVLSGKQSDIFLGDRVERCQRIELKQGYTFFIPSGWIHAVYTPVDSLVFGGNILHSFNVPMQLRIYEIEDRTRVQPKFRYPFYYEMCWYVLERYVYCVTQRSHLTQEYQRESMLIDAPRKPSIDGFSSDSWLEMEEEACDQQPQEEEEKDEEGEGRDRAPKPPTDGSTSPTSTPSEDQEALGKKPKAPALRFLKRTLSNESEESVKSTTLAVDYPKTPTGSPATEVSAKWTHLTEFELKGLKALVEKLESLPENKKCVPEGIEDPQALLEGVKNVLKEHADDDPSLAITGVPVVTWPKKTPKNRAVGRPKGKLGPASAVKLAANRTTAGARRRRTRCRKCEACLRTECGECHFCKDMKKFGGPGRMKQSCIMRQCIAPVLPHTAVCLVCGEAGKEDTVEEEEGKFNLMLMECSICNEIIHPGCLKIKESEGVVNDELPNCWECPKCNHAGKTGKQKRGPGFKYASNLPGSLLKEQKMNRDNKEGQEPAKRRSECEEAPRRRSDEHSKKVPPDGLLRRKSDDVHLRKKRKYEKPQELSGRKRASSLQTSPGSSSHLSPRPPLGSSLSPWWRSSLTYFQQQLKPGKEDKLFRKKRRSWKNAEDRMALANKPLRRFKQEPEDELPEAPPKTRESDHSRSSSPTAGPSTEGAEGPEEKKKVKMRRKRRLPNKELSRELSKELNHEIQRTENSLANENQQPIKSEPESEGEEPKRPPGICERPHRFSKGLNGTPRELRHQLGPSLRSPPRVISRPPPSVSPPKCIQMERHVIRPPPISPPPDSLPLDDGAAHVMHREVWMAVFSYLSHQDLCVCMRVCRTWNRWCCDKRLWTRIDLNHCKSITPLMLSGIIRRQPVSLDLSWTNISKKQLSWLINRLPGLRDLVLSGCSWIAVSALCSSSCPLLRTLDVQWVEGLKDAQMRDLLSPPTDNRPGQMDNRSKLRNIVELRLAGLDITDASLRLIIRHMPLLSKLHLSYCNHVTDQSINLLTAVGTTTRDSLTEINLSDCNKVTDQCLSFFKRCGNICHIDLRYCKQVTKEGCEQFIAEMSVSVQFGQVEEKLLQKLS,1336,NP_115979.3.csv,refseq-KDM2B-NM_032590.4_clinical_seed_0_final,refseq-KDM2B-NM_032590.4.a2m,Invitae,refseq-KDM2B-NM_032590.4.npy,1,1336,1336
+NP_115980.1,MEPGDAARPGSGRATGAPPPRLLLLPLLLGWGLRVAAAASASSSGAAAEDSSAMEELATEKEAEESHRQDSVSLLTFILLLTLTILTIWLFKHRRVRFLHETGLAMIYGLIVGVILRYGTPATSGRDKSLSCTQEDRAFSTLLVNVSGKFFEYTLKGEISPGKINSVEQNDMLRKVTFDPEVFFNILLPPIIFHAGYSLKKRHFFRNLGSILAYAFLGTAVSCFIIGNLMYGVVKLMKIMGQLSDKFYYTDCLFFGAIISATDPVTVLAIFNELHADVDLYALLFGESVLNDAVAIVLSSSIVAYQPAGLNTHAFDAAAFFKSVGIFLGIFSGSFTMGAVTGVNANVTKFTKLHCFPLLETALFFLMSWSTFLLAEACGFTGVVAVLFCGITQAHYTYNNLSVESRSRTKQLFEVLHFLAENFIFSYMGLALFTFQKHVFSPIFIIGAFVAIFLGRAAHIYPLSFFLNLGRRHKIGWNFQHMMMFSGLRGAMAFALAIRDTASYARQMMFTTTLLIVFFTVWIIGGGTTPMLSWLNIRVGVEEPSEEDQNEHHWQYFRVGVDPDQDPPPNNDSFQVLQGDGPDSARGNRTKQESAWIFRLWYSFDHNYLKPILTHSGPPLTTTLPAWCGLLARCLTSPQVYDNQEPLREEDSDFILTEGDLTLTYGDSTVTANGSSSSHTASTSLEGSRRTKSSSEEVLERDLGMGDQKVSSRGTRLVFPLEDNA,725,NP_115980.1.csv,NP_115980.1_clinical_seed_0_final,NP_115980.1.a2m,popEVE,NP_115980.1_theta_0.2.npy,1,725,725
+NP_115996.1,MNTDLAAGKMASAACSMDPIDSFELLDLLFDRQDGILRHVELGEGWGHVKDQQVLPNPDSDDFLSSILGSGDSLPSSPLWSPEGSDSGISEDLPSDPQDTPPRSGPATSPAGCHPAQPGKGPCLSYHPGNSCSTTTPGPVIQVPEASVTIDLEMWSPGGRICAEKPADPVDLSPRCNLTVKDLLLSGSSGDLQQHHLGASYLLRPGAGHCQELVLTEDEKKLLAKEGITLPTQLPLTKYEERVLKKIRRKIRNKQSAQESRKKKKEYIDGLETRMSACTAQNQELQRKVLHLEKQNLSLLEQLKKLQAIVVQSTSKSAQTGTCVAVLLLSFALIILPSISPFGPNKTESPGDFAPVRVFSRTLHNDAASRVAADAVPGSEAPGPRPEADTTREESPGSPGADWGFQDTANLTNSTEELDNATLVLRNATEGLGQVALLDWVAPGPSTGSGRAGLEAAGDEL,461,NP_115996.1.csv,refseq-CREB3L3-NM_032607.2_clinical_seed_0_final,refseq-CREB3L3-NM_032607.2.a2m,Invitae,refseq-CREB3L3-NM_032607.2.npy,1,461,461
+NP_116023.2,MQKASVLLFLAWVCFLFYAGIALFTSGFLLTRLELTNHSSCQEPPGPGSLPWGSQGKPGACWMASRFSRVVLVLIDALRFDFAQPQHSHVPREPPVSLPFLGKLSSLQRILEIQPHHARLYRSQVDPPTTTMQRLKALTTGSLPTFIDAGSNFASHAIVEDNLIKQLTSAGRRVVFMGDDTWKDLFPGAFSKAFFFPSFNVRDLDTVDNGILEHLYPTMDSGEWDVLIAHFLGVDHCGHKHGPHHPEMAKKLSQMDQVIQGLVERLENDTLLVVAGDHGMTTNGDHGGDSELEVSAALFLYSPTAVFPSTPPEEPEVIPQVSLVPTLALLLGLPIPFGNIGEVMAELFSGGEDSQPHSSALAQASALHLNAQQVSRFLHTYSAATQDLQAKELHQLQNLFSKASADYQWLLQSPKGAEATLPTVIAELQQFLRGARAMCIESWARFSLVRMAGGTALLAASCFICLLASQWAISPGFPFCPLLLTPVAWGLVGAIAYAGLLGTIELKLDLVLLGAVAAVSSFLPFLWKAWAGWGSKRPLATLFPIPGPVLLLLLFRLAVFFSDSFVVAEARATPFLLGSFILLLVVQLHWEGQLLPPKLLTMPRLGTSATTNPPRHNGAYALRLGIGLLLCTRLAGLFHRCPEETPVCHSSPWLSPLASMVGGRAKNLWYGACVAALVALLAAVRLWLRRYGNLKSPEPPMLFVRWGLPLMALGTAAYWALASGADEAPPRLRVLVSGASMVLPRAVAGLAASGLALLLWKPVTVLVKAGAGAPRTRTVLTPFSGPPTSQADLDYVVPQIYRHMQEEFRGRLERTKSQGPLTVAAYQLGSVYSAAMVTALTLLAFPLLLLHAERISLVFLLLFLQSFLLLHLLAAGIPVTTPGPFTVPWQAVSAWALMATQTFYSTGHQPVFPAIHWHAAFVGFPEGHGSCTWLPALLVGANTFASHLLFAVGCPLLLLWPFLCESQGLRKRQQPPGNEADARVRPEEEEEPLMEMRLRDAPQHFYAALLQLGLKYLFILGIQILACALAASILRRHLMVWKVFAPKFIFEAVGFIVSSVGLLLGIALVMRVDGAVSSWFRQLFLAQQR,1089,NP_116023.2.csv,refseq-PIGO-NM_032634.3_clinical_seed_0_final,refseq-PIGO-NM_032634.3.a2m,Invitae,refseq-PIGO-NM_032634.3.npy,1,1089,1089
+NP_116027.2,MEVAPEQPRWMAHPAVLNAQHPDSHHPGLAHNYMEPAQLLPPDEVDVFFNHLDSQGNPYYANPAHARARVSYSPAHARLTGGQMCRPHLLHSPGLPWLDGGKAALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGGSGGGSGSSVASLTPTAAHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGEDKDGVKYQVSLTESMKMESGSPLRPGLATMGTQPATHHPIPTYPSYVPAAAHDYSSGLFHPGGFLGGPASSFTPKQRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMKKEGIQTRNRKMSNKSKKSKKGAECFEELSKCMQEKSSPFSAAALAGHMAPVGHLPPFSHSGHILPTPTPIHPSSSLSFGHPHPSSMVTAMG,480,NP_116027.2.csv,refseq-GATA2-NM_032638.4_clinical_seed_0_final,refseq-GATA2-NM_032638.4.a2m,Invitae,refseq-GATA2-NM_032638.4.npy,1,480,480
+NP_116045.2,MGKLRRRYNIKGRQQAGPGPSKGPPEPPPVQLELEDKDTLKGVDASNALVLPGKKKKKTKAPPLSKKEKKPLTKKEKKVLQKILEQKEKKSQRAEMLQKLSEVQASEAEMRLFYTTSKLGTGNRMYHTKEKADEVVAPGQEKISSLSGAHRKRRRWPSAEEEEEEEEESESELEEESELDEDPAAEPAEAGVGTTVAPLPPAPAPSSQPVPAGMTVPPPPAAAPPLPRALAKPAVFIPVNRSPEMQEERLKLPILSEEQVIMEAVAEHPIVIVCGETGSGKTTQVPQFLYEAGFSSEDSIIGVTEPRRVAAVAMSQRVAKEMNLSQRVVSYQIRYEGNVTEETRIKFMTDGVLLKEIQKDFLLLRYKVVIIDEAHERSVYTDILIGLLSRIVTLRAKRNLPLKLLIMSATLRVEDFTQNPRLFAKPPPVIKVESRQFPVTVHFNKRTPLEDYSGECFRKVCKIHRMLPAGGILVFLTGQAEVHALCRRLRKAFPPSRARPQEKDDDQKDSVEEMRKFKKSRARAKKARAEVLPQINLDHYSVLPAGEGDEDREAEVDEEEGALDSDLDLDLGDGGQDGGEQPDASLPLHVLPLYSLLAPEKQAQVFKPPPEGTRLCVVATNVAETSLTIPGIKYVVDCGKVKKRYYDRVTGVSSFRVTWVSQASADQRAGRAGRTEPGHCYRLYSSAVFGDFEQFPPPEITRRPVEDLILQMKALNVEKVINFPFPTPPSVEALLAAEELLIALGALQPPQKAERVKQLQENRLSCPITALGRTMATFPVAPRYAKMLALSRQHGCLPYAITIVASMTVRELFEELDRPAASDEELTRLKSKRARVAQMKRTWAGQGASLKLGDLMVLLGAVGACEYASCTPQFCEANGLRYKAMMEIRRLRGQLTTAVNAVCPEAELFVDPKMQPPTESQVTYLRQIVTAGLGDHLARRVQSEEMLEDKWRNAYKTPLLDDPVFIHPSSVLFKELPEFVVYQEIVETTKMYMKGVSSVEVQWIPALLPSYCQFDKPLEEPAPTYCPERGRVLCHRASVFYRVGWPLPAIEVDFPEGIDRYKHFARFLLEGQVFRKLASYRSCLLSSPGTMLKTWARLQPRTESLLRALVAEKADCHEALLAAWKKNPKYLLAEYCEWLPQAMHPDIEKAWPPTTVH,1157,NP_116045.2.csv,refseq-DHX37-NM_032656.3_clinical_seed_0_final,refseq-DHX37-NM_032656.3.a2m,Invitae,refseq-DHX37-NM_032656.3.npy,1,1157,1157
+NP_116056.3,MVNDPPVPALLWAQEVGQVLAGRARRLLLQFGVLFCTILLLLWVSVFLYGSFYYSYMPTVSHLSPVHFYYRTDCDSSTTSLCSFPVANVSLTKGGRDRVLMYGQPYRVTLELELPESPVNQDLGMFLVTISCYTRGGRIISTSSRSVMLHYRSDLLQMLDTLVFSSLLLFGFAEQKQLLEVELYADYRENSYVPTTGAIIEIHSKRIQLYGAYLRIHAHFTGLRYLLYNFPMTCAFIGVASNFTFLSVIVLFSYMQWVWGGIWPRHRFSLQVNIRKRDNSRKEVQRRISAHQPGPEGQEESTPQSDVTEDGESPEDPSGTEGQLSEEEKPDQQPLSGEEELEPEASDGSGSWEDAALLTEANLPAPAPASASAPVLETLGSSEPAGGALRQRPTCSSS,398,NP_116056.3.csv,refseq-BSCL2-NM_032667.6_clinical_seed_0_final,refseq-BSCL2-NM_032667.6.a2m,Invitae,refseq-BSCL2-NM_032667.6.npy,1,398,398
+NP_116071.2,MMQESGTETKSNGSAIQNGSGGSNHLLECGGLREGRSNGETPAVDIGAADLAHAQQQQQQALQVARQLLLQQQQQQQVSGLKSPKRNDKQPALQVPVSVAMMTPQVITPQQMQQILQQQVLSPQQLQVLLQQQQALMLQQQQLQEFYKKQQEQLQLQLLQQQHAGKQPKEQQQVATQQLAFQQQLLQMQQLQQQHLLSLQRQGLLTIQPGQPALPLQPLAQGMIPTELQQLWKEVTSAHTAEETTGNNHSSLDLTTTCVSSSAPSKTSLIMNPHASTNGQLSVHTPKRESLSHEEHPHSHPLYGHGVCKWPGCEAVCEDFQSFLKHLNSEHALDDRSTAQCRVQMQVVQQLELQLAKDKERLQAMMTHLHVKSTEPKAAPQPLNLVSSVTLSKSASEASPQSLPHTPTTPTAPLTPVTQGPSVITTTSMHTVGPIRRRYSDKYNVPISSADIAQNQEFYKNAEVRPPFTYASLIRQAILESPEKQLTLNEIYNWFTRMFAYFRRNAATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEVEFQKRRPQKISGNPSLIKNMQSSHAYCTPLNAALQASMAENSIPLYTTASMGNPTLGNLASAIREELNGAMEHTNSNESDSSPGRSPMQAVHPVHVKEEPLDPEEAEGPLSLVTTANHSPDFDHDRDYEDEPVNEDME,677,NP_116071.2.csv,refseq-FOXP1-NM_032682.5_clinical_seed_0_final,refseq-FOXP1-NM_032682.5.a2m,Invitae,refseq-FOXP1-NM_032682.5.npy,1,677,677
+NP_116119.2,MEFLKTCVLRRNACTAVCFWRSKVVQKPSVRRISTTSPRSTVMPAWVIDKYGKNEVLRFTQNMMMPIIHYPNEVIVKVHAASVNPIDVNMRSGYGATALNMKRDPLHVKIKGEEFPLTLGRDVSGVVMECGLDVKYFKPGDEVWAAVPPWKQGTLSEFVVVSGNEVSHKPKSLTHTQAASLPYVALTAWSAINKVGGLNDKNCTGKRVLILGASGGVGTFAIQVMKAWDAHVTAVCSQDASELVRKLGADDVIDYKSGSVEEQLKSLKPFDFILDNVGGSTETWAPDFLKKWSGATYVTLVTPFLLNMDRLGIADGMLQTGVTVGSKALKHFWKGVHYRWAFFMASGPCLDDIAELVDAGKIRPVIEQTFPFSKVPEAFLKVERGHARGKTVINVV,396,NP_116119.2.csv,refseq-RTN4IP1-NM_032730.4_clinical_seed_0_final,refseq-RTN4IP1-NM_032730.4.a2m,Invitae,refseq-RTN4IP1-NM_032730.4.npy,1,396,396
+NP_116145.1,MAAPALRLCHIAFHVPAGQPLARNLQRLFGFQPLASREVDGWRQLALRSGDAVFLVNEGAGSGEPLYGLDPRHAVPSATNLCFDVADAGAATRELAALGCSVPVPPVRVRDAQGAATYAVVSSPAGILSLTLLERAGYRGPFLPGFRPVSSAPGPGWVSRVDHLTLACTPGSSPTLLRWFHDCLGFCHLPLSPGEDPELGLEMTAGFGLGGLRLTALQAQPGSIVPTLVLAESLPGATTRQDQVEQFLARHKGPGLQHVGLYTPNIVEATEGVATAGGQFLAPPGAYYQQPGKERQIRAAGHEPHLLARQGILLDGDKGKFLLQVFTKSLFTEDTFFLELIQRQGATGFGQGNIRALWQSVQEQSARSQEA,371,NP_116145.1.csv,refseq-HPDL-NM_032756.2_clinical_seed_0_final,refseq-HPDL-NM_032756.2.a2m,Invitae,refseq-HPDL-NM_032756.2.npy,1,371,371
+NP_116190.3,MALRRPPRLRLCARLPDFFLLLLFRGCLIGAVNLKSSNRTPVVQEFESVELSCIITDSQTSDPRIEWKKIQDEQTTYVFFDNKIQGDLAGRAEILGKTSLKIWNVTRRDSALYRCEVVARNDRKEIDEIVIELTVQVKPVTPVCRVPKAVPVGKMATLHCQESEGHPRPHYSWYRNDVPLPTDSRANPRFRNSSFHLNSETGTLVFTAVHKDDSGQYYCIASNDAGSARCEEQEMEVYDLNIGGIIGGVLVVLAVLALITLGICCAYRRGYFINNKQDGESYKNPGKPDGVNYIRTDEEGDFRHKSSFVI,310,NP_116190.3.csv,refseq-JAM3-NM_032801.4_clinical_seed_0_final,refseq-JAM3-NM_032801.4.a2m,Invitae,refseq-JAM3-NM_032801.4.npy,1,310,310
+NP_116195.2,MHLSAVFNALLVSVLAAVLWKHVRLREHAATLEEELALSRQATEPAPALRIDYPKALQILMEGGTHMVCTGRTHTDRICRFKWLCYSNEAEEFIFFHGNTSVMLPNLGSRRFQPALLDLSTVEDHNTQYFNFVELPAAALRFMPKPVFVPDVALIANRFNPDNLMHVFHDDLLPLFYTLRQFPGLAHEARLFFMEGWGEGAHFDLYKLLSPKQPLLRAQLKTLGRLLCFSHAFVGLSKITTWYQYGFVQPQGPKANILVSGNEIRQFARFMTEKLNVSHTGVPLGEEYILVFSRTQNRLILNEAELLLALAQEFQMKTVTVSLEDHTFADVVRLVSNASMLVSMHGAQLVTTLFLPRGATVVELFPYAVNPDHYTPYKTLAMLPGMDLQYVAWRNMMPENTVTHPERPWDQGGITHLDRAEQARILQSREVPRHLCCRNPEWLFRIYQDTKVDIPSLIQTIRRVVKGRPGPRKQKWTVGLYPGKVREARCQASVHGASEARLTVSWQIPWNLKYLKVREVKYEVWLQEQGENTYVPYILALQNHTFTENIKPFTTYLVWVRCIFNKILLGPFADVLVCNT,580,NP_116195.2.csv,refseq-POMGNT2-NM_032806.5_clinical_seed_0_final,refseq-POMGNT2-NM_032806.5.a2m,Invitae,refseq-POMGNT2-NM_032806.5.npy,1,580,580
+NP_116222.4,MEPGTGGSRKRLGPRAGFRFWPPFFPRRSQAGSSKFPTPLGPENSGNPTLLSSAQPETRVSYWTKLLSQLLAPLPGLLQKVLIWSQLFGGMFPTRWLDFAGVYSALRALKGREKPAAPTAQKSLSSLQLDSSDPSVTSPLDWLEEGIHWQYSPPDLKLELKAKGSALDPAAQAFLLEQQLWGVELLPSSLQSRLYSNRELGSSPSGPLNIQRIDNFSVVSYLLNPSYLDCFPRLEVSYQNSDGNSEVVGFQTLTPESSCLREDHCHPQPLSAELIPASWQGCPPLSTEGLPEIHHLRMKRLEFLQQASKGQDLPTPDQDNGYHSLEEEHSLLRMDPKHCRDNPTQFVPAAGDIPGNTQESTEEKIELLTTEVPLALEEESPSEGCPSSEIPMEKEPGEGRISVVDYSYLEGDLPISARPACSNKLIDYILGGASSDLETSSDPEGEDWDEEAEDDGFDSDSSLSDSDLEQDPEGLHLWNSFCSVDPYNPQNFTATIQTAARIVPEEPSDSEKDLSGKSDLENSSQSGSLPETPEHSSGEEDDWESSADEAESLKLWNSFCNSDDPYNPLNFKAPFQTSGENEKGCRDSKTPSESIVAISECHTLLSCKVQLLGSQESECPDSVQRDVLSGGRHTHVKRKKVTFLEEVTEYYISGDEDRKGPWEEFARDGCRFQKRIQETEDAIGYCLTFEHRERMFNRLQGTCFKGLNVLKQC,713,NP_116222.4.csv,NP_116222.4_colabfold_clinical_seed_0_final,NP_116222.4_colabfold.a2m,colabfold,NP_116222.4_colabfold_theta_0.2.npy,1,713,713
+NP_116245.2,MDPGDDWLVESLRLYQDFYAFDLSGATRVLEWIDDKGVFVAGYESLKKNEILHLKLPLRLSVKENKGLFPERDFKVRHGGFSDRSIFDLKHVPHTRLLVTSGLPGCYLQVWQVAEDSDVIKAVSTIAVHEKEESLWPRVAVFSTLAPGVLHGARLRSLQVVDLESRKTTYTSDVSDSEELSSLQVLDADTFAFCCASGRLGLVDTRQKWAPLENRSPGPGSGGERWCAEVGSWGQGPGPSIASLGSDGRLCLLDPRDLCHPVSSVQCPVSVPSPDPELLRVTWAPGLKNCLAISGFDGTVQVYDATSWDGTRSQDGTRSQVEPLFTHRGHIFLDGNGMDPAPLVTTHTWHPCRPRTLLSATNDASLHVWDWVDLCAPR,378,NP_116245.2.csv,refseq-WDR73-NM_032856.3_clinical_seed_0_final,refseq-WDR73-NM_032856.3.a2m,Invitae,refseq-WDR73-NM_032856.3.npy,1,378,378
+NP_116250.3,MSLAAYCVICCRRIGTSTSPPKSGTHWRDIRNIIKFTGSLILGGSLFLTYEVLALKKAVTLDTQVVEREKMKSYIYVHTVSLDKGENHGIAWQARKELHKAVRKVLATSAKILRNPFADPFSTVDIEDHECAVWLLLRKSKSDDKTTRLEAVREMSETHHWHDYQYRIIAQACDPKTLIGLARSEESDLRFFLLPPPLPSLKEDSSTEEELRQLLASLPQTELDECIQYFTSLALSESSQSLAAQKGGLWCFGGNGLPYAESFGEVPSATVEMFCLEAIVKHSEISTHCDKIEANGGLQLLQRLYRLHKDCPKVQRNIMRVIGNMALNEHLHSSIVRSGWVSIMAEAMKSPHIMESSHAARILANLDRETVQEKYQDGVYVLHPQYRTSQPIKADVLFIHGLMGAAFKTWRQQDSEQAVIEKPMEDEDRYTTCWPKTWLAKDCPALRIISVEYDTSLSDWRARCPMERKSIAFRSNELLRKLRAAGVGDRPVVWISHSMGGLLVKKMLLEASTKPEMSTVINNTRGIIFYSVPHHGSRLAEYSVNIRYLLFPSLEVKELSKDSPALKTLQDDFLEFAKDKNFQVLNFVETLPTYIGSMIKLHVVPVESADLGIGDLIPVDVNHLNICKPKKKDAFLYQRTLQFIREALAKDLEN,654,NP_116250.3.csv,refseq-SERAC1-NM_032861.3_clinical_seed_0_final,refseq-SERAC1-NM_032861.3.a2m,Invitae,refseq-SERAC1-NM_032861.3.npy,1,654,654
+NP_116277.2,MGAGSARGARGTAAAAAARGGGFLFSWILVSFACHLASTQGAPEDVDILQRLGLSWTKAGSPAPPGVIPFQSGFIFTQRARLQAPTGTVIPAALGTELALVLSLCSHRVNHAFLFAVRSQKRKLQLGLQFLPGKTVVHLGSRRSVAFDLDMHDGRWHHLALELRGRTVTLVTACGQRRVPVLLPFHRDPALDPGGSFLFGKMNPHAVQFEGALCQFSIYPVTQVAHNYCTHLRKQCGQADTYQSPLGPLFSQDSGRPFTFQSDLALLGLENLTTATPALGSLPAGRGPRGTVAPATPTKPQRTSPTNPHQHMAVGGPAQTPLLPAKLSASNALDPMLPASVGGSTRTPRPAAAQPSQKITATKIPKSLPTKPSAPSTSIVPIKSPHPTQKTAPSSFTKSALPTQKQVPPTSRPVPARVSRPAEKPIQRNPGMPRPPPPSTRPLPPTTSSSKKPIPTLARTEAKITSHASKPASARTSTHKPPPFTALSSSPAPTPGSTRSTRPPATMVPPTSGTSTPRTAPAVPTPGSAPTGSKKPIGSEASKKAGPKSSPRKPVPLRPGKAARDVPLSDLTTRPSPRQPQPSQQTTPALVLAPAQFLSSSPRPTSSGYSIFHLAGSTPFPLLMGPPGPKGDCGLPGPPGLPGLPGIPGARGPRGPPGPYGNPGLPGPPGAKGQKGDPGLSPGKAHDGAKGDMGLPGLSGNPGPPGRKGHKGYPGPAGHPGEQGQPGPEGSPGAKGYPGRQGLPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGLPGVPGKRGKMGMPGFPGVFGERGPPGLDGNPGELGLPGPPGVPGLIGDLGVLGPIGYPGPKGMKGLMGSVGEPGLKGDKGEQGVPGVSGDPGFQGDKGSQGLPGFPGARGKPGPLGKVGDKGSIGFPGPPGPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPRGQLGPEGDEGPMGPPGAPGLEGQPGRKGFPGRPGLDGVKGEPGDPGRPGPVGEQGFMGFIGLVGEPGIVGEKGDRGMMGPPGVPGPKGSMGHPGMPGGMGTPGEPGPQGPPGSRGPPGMRGAKGRRGPRGPDGPAGEQGSRGLKGPPGPQGRPGRPGQQGVAGERGHLGSRGFPGIPGPSGPPGTKGLPGEPGPQGPQGPIGPPGEMGPKGPPGAVGEPGLPGEAGMKGDLGPLGTPGEQGLIGQRGEPGLEGDSGPMGPDGLKGDRGDPGPDGEHGEKGQEGLMGEDGPPGPPGVTGVRGPEGKSGKQGEKGRTGAKGAKGYQGQLGEMGVPGDPGPPGTPGPKGSRGSLGPTGAPGRMGAQGEPGLAGYDGHKGIVGPLGPPGPKGEKGEQGEDGKAEGPPGPPGDRGPVGDRGDRGEPGDPGYPGQEGVQGLRGKPGQQGQPGHPGPRGWPGPKGSKGAEGPKGKQGKAGAPGRRGVQGLQGLPGPRGVVGRQGLEGIAGPDGLPGRDGQAGQQGEQGDDGDPGPMGPAGKRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGTEGRTGLPGNQGEPGSKGQPGDSGEMGFPGMAGLFGPKGPPGDIGFKGIQGPRGPPGLMGKEGIVGPLGILGPSGLPGPKGDKGSRGDWGLQGPRGPPGPRGRPGPPGPPGGPIQLQQDDLGAAFQTWMDTSGALRPESYSYPDRLVLDQGGEIFKTLHYLSNLIQSIKTPLGTKENPARVCRDLMDCEQKMVDGTYWVDPNLGCSSDTIEVSCNFTHGGQTCLKPITASKVEFAISRVQMNFLHLLSSEVTQHITIHCLNMTVWQEGTGQTPAKQAVRFRAWNGQIFEAGGQFRPEVSMDGCKVQDGRWHQTLFTFRTQDPQQLPIISVDNLPPASSGKQYRLEVGPACFL,1860,NP_116277.2.csv,refseq-COL27A1-NM_032888.3_clinical_seed_0_final,refseq-COL27A1-NM_032888.3.a2m,Invitae,refseq-COL27A1-NM_032888.3_theta_0.2.npy,1,1860,1860
+NP_116281.2,MLSRLMSGSSRSLEREYSCTVRLLDDSEYTCTIQRDAKGQYLFDLLCHHLNLLEKDYFGIRFVDPDKQRHWLEFTKSVVKQLRSQPPFTMCFRVKFYPADPAALKEEITRYLVFLQIKRDLYHGRLLCKTSDAALLAAYILQAEIGDYDSGKHPEGYSSKFQFFPKHSEKLERKIAEIHKTELSGQTPATSELNFLRKAQTLETYGVDPHPCKDVSGNAAFLAFTPFGFVVLQGNKRVHFIKWNEVTKLKFEGKTFYLYVSQKEEKKIILTYFAPTPEACKHLWKCGIENQAFYKLEKSSQVRTVSSSNLFFKGSRFRYSGRVAKEVMESSAKIKREPPEIHRAGMVPSRSCPSITHGPRLSSVPRTRRRAVHISIMEGLESLRDSAHSTPVRSTSHGDTFLPHVRSSRTDSNERVAVIADEAYSPADSVLPTPVAEHSLELMLLSRQINGATCSIEEEKESEASTPTATEVEALGGELRALCQGHSGPEEEQVNKFVLSVLRLLLVTMGLLFVLLLLLIILTESDLDIAFFRDIRQTPEFEQFHYQYFCPLRRWFACKIRSVVSLLIDT,570,NP_116281.2.csv,refseq-FRMD5-NM_032892.4_clinical_seed_0_final,refseq-FRMD5-NM_032892.4.a2m,Invitae,refseq-FRMD5-NM_032892.4.npy,1,570,570
+NP_116575.3,MPKIVLNGVTVDFPFQPYKCQQEYMTKVLECLQQKVNGILESPTGTGKTLCLLCTTLAWREHLRDGISARKIAERAQGELFPDRALSSWGNAAAAAGDPIACYTDIPKIIYASRTHSQLTQVINELRNTSYRSRCRATLWVLETAPPRPTVLSPTRPKVCVLGSREQLCIHPEVKKQESNHLQIHLCRKKVASRSCHFYNNVEEKSLEQELASPILDIEDLVKSGSKHRVCPYYLSRNLKQQADIIFMPYNYLLDAKSRRAHNIDLKGTVVIFDEAHNVEKMCEESASFDLTPHDLASGLDVIDQVLEEQTKAAQQGEPHPEFSADSPSPGLNMELEDIAKLKMILLRLEGAIDAVELPGDDSGVTKPGSYIFELFAEAQITFQTKGCILDSLDQIIQHLAGRAGVFTNTAGLQKLADIIQIVFSVDPSEGSPGSPAGLGALQSYKVHIHPDAGHRRTAQRSDAWSTTAARKRGKVLSYWCFSPGHSMHELVRQGVRSLILTSGTLAPVSSFALEMQIPFPVCLENPHIIDKHQIWVGVVPRGPDGAQLSSAFDRRFSEECLSSLGKALGNIARVVPYGLLIFFPSYPVMEKSLEFWRARDLARKMEALKPLFVEPRSKGSFSETISAYYARVAAPGSTGATFLAVCRGKASEGLDFSDTNGRGVIVTGLPYPPRMDPRVVLKMQFLDEMKGQGGAGGQFLSGQEWYRQQASRAVNQAIGRVIRHRQDYGAVFLCDHRFAFADARAQLPSWVRPHVRVYDNFGHVIRDVAQFFRVAERTMPAPAPRATAPSVRGEDAVSEAKSPGPFFSTRKAKSLDLHVPSLKQRSSGSPAAGDPESSLCVEYEQEPVPARQRPRGLLAALEHSEQRAGSPGEEQAHSCSTLSLLSEKRPAEEPRGGRKKIRLVSHPEEPVAGAQTDRAKLFMVAVKQELSQANFATFTQALQDYKGSDDFAALAACLGPLFAEDPKKHNLLQGFYQFVRPHHKQQFEEVCIQLTGRGCGYRPEHSIPRRQRAQPVLDPTGRTAPDPKLTVSTAAAQQLDPQEHLNQGRPHLSPRPPPTGDPGSQPQWGSGVPRAGKQGQHAVSAYLADARRALGSAGCSQLLAALTAYKQDDDLDKVLAVLAALTTAKPEDFPLLHRFSMFVRPHHKQRFSQTCTDLTGRPYPGMEPPGPQEERLAVPPVLTHRAPQPGPSRSEKTGKTQSKISSFLRQRPAGTVGAGGEDAGPSQSSGPPHGPAASEWGL,1243,NP_116575.3.csv,refseq-RTEL1-NM_032957.4_clinical_seed_0_final,refseq-RTEL1-NM_032957.4.a2m,Invitae,refseq-RTEL1-NM_032957.4.npy,1,1243,1243
+NP_127497.1,MAGGAWGRLACYLEFLKKEELKEFQLLLANKAHSRSSSGETPAQPEKTSGMEVASYLVAQYGEQRAWDLALHTWEQMGLRSLCAQAQEGAGHSPSFPYSPSEPHLGSPSQPTSTAVLMPWIHELPAGCTQGSERRVLRQLPDTSGRRWREISASLLYQALPSSPDHESPSQESPNAPTSTAVLGSWGSPPQPSLAPREQEAPGTQWPLDETSGIYYTEIREREREKSEKGRPPWAAVVGTPPQAHTSLQPHHHPWEPSVRESLCSTWPWKNEDFNQKFTQLLLLQRPHPRSQDPLVKRSWPDYVEENRGHLIEIRDLFGPGLDTQEPRIVILQGAAGIGKSTLARQVKEAWGRGQLYGDRFQHVFYFSCRELAQSKVVSLAELIGKDGTATPAPIRQILSRPERLLFILDGVDEPGWVLQEPSSELCLHWSQPQPADALLGSLLGKTILPEASFLITARTTALQNLIPSLEQARWVEVLGFSESSRKEYFYRYFTDERQAIRAFRLVKSNKELWALCLVPWVSWLACTCLMQQMKRKEKLTLTSKTTTTLCLHYLAQALQAQPLGPQLRDLCSLAAEGIWQKKTLFSPDDLRKHGLDGAIISTFLKMGILQEHPIPLSYSFIHLCFQEFFAAMSYVLEDEKGRGKHSNCIIDLEKTLEAYGIHGLFGASTTRFLLGLLSDEGEREMENIFHCRLSQGRNLMQWVPSLQLLLQPHSLESLHCLYETRNKTFLTQVMAHFEEMGMCVETDMELLVCTFCIKFSRHVKKLQLIEGRQHRSTWSPTMVVLFRWVPVTDAYWQILFSVLKVTRNLKELDLSGNSLSHSAVKSLCKTLRRPRCLLETLRLAGCGLTAEDCKDLAFGLRANQTLTELDLSFNVLTDAGAKHLCQRLRQPSCKLQRLQLVSCGLTSDCCQDLASVLSASPSLKELDLQQNNLDDVGVRLLCEGLRHPACKLIRLGLDQTTLSDEMRQELRALEQEKPQLLIFSRRKPSVMTPTEGLDTGEMSNSTSSLKRQRLGSERAASHVAQANLKLLDVSKIFPIAEIAEESSPEVVPVELLCVPSPASQGDLHTKPLGTDDDFWGPTGPVATEVVDKEKNLYRVHFPVAGSYRWPNTGLCFVMREAVTVEIEFCVWDQFLGEINPQHSWMVAGPLLDIKAEPGAVEAVHLPHFVALQGGHVDTSLFQMAHFKEEGMLLEKPARVELHHIVLENPSFSPLGVLLKMIHNALRFIPVTSVVLLYHRVHPEEVTFHLYLIPSDCSIRKAIDDLEMKFQFVRIHKPPPLTPLYMGCRYTVSGSGSGMLEILPKELELCYRSPGEDQLFSEFYVGHLGSGIRLQVKDKKDETLVWEALVKPGDLMPATTLIPPARIAVPSPLDAPQLLHFVDQYREQLIARVTSVEVVLDKLHGQVLSQEQYERVLAENTRPSQMRKLFSLSQSWDRKCKDGLYQALKETHPHLIMELWEKGSKKGLLPLSS,1473,NP_127497.1.csv,refseq-NLRP1-NM_033004.3_clinical_seed_0_final,refseq-NLRP1-NM_033004.3.a2m,Invitae,refseq-NLRP1-NM_033004.3.npy,1,1473,1473
+NP_148982.1,MNDTVTIRTRKFMTNRLLQRKQMVIDVLHPGKATVPKTEIREKLAKMYKTTPDVIFVFGFRTHFGGGKTTGFGMIYDSLDYAKKNEPKHRLARHGLYEKKKTSRKQRKERKNRMKKVRGTAKANVGAGKK,130,NP_148982.1.csv,refseq-RPS24-NM_033022.3_clinical_seed_0_final,refseq-RPS24-NM_033022.3.a2m,Invitae,refseq-RPS24-NM_033022.3.npy,1,130,130
+NP_149045.3,MFRQFYLWTCLASGIILGSLFEICLGQYDDDCKLARGGPPATIVAIDEESRNGTILVDNMLIKGTAGGPDPTIELSLKDNVDYWVLMDPVKQMLFLNSTGRVLDRDPPMNIHSIVVQVQCINKKVGTIIYHEVRIVVRDRNDNSPTFKHESYYATVNELTPVGTTIFTGFSGDNGATDIDDGPNGQIEYVIQYNPDDPTSNDTFEIPLMLTGNIVLRKRLNYEDKTRYFVIIQANDRAQNLNERRTTTTTLTVDVLDGDDLGPMFLPCVLVPNTRDCRPLTYQAAIPELRTPEELNPIIVTPPIQAIDQDRNIQPPSDRPGILYSILVGTPEDYPRFFHMHPRTAELSLLEPVNRDFHQKFDLVIKAEQDNGHPLPAFAGLHIEILDENNQSPYFTMPSYQGYILESAPVGATISDSLNLTSPLRIVALDKDIEDTKDPELHLFLNDYTSVFTVTQTGITRYLTLLQPVDREEQQTYTFSITAFDGVQESEPVIVNIQVMDANDNTPTFPEISYDVYVYTDMRPGDSVIQLTAVDADEGSNGEITYEILVGAQGDFIINKTTGLITIAPGVEMIVGRTYALTVQAADNAPPAERRNSICTVYIEVLPPNNQSPPRFPQLMYSLEISEAMRVGAVLLNLQATDREGDSITYAIENGDPQRVFNLSETTGILTLGKALDRESTDRYILIITASDGRPDGTSTATVNIVVTDVNDNAPVFDPYLPRNLSVVEEEANAFVGQVKATDPDAGINGQVHYSLGNFNNLFRITSNGSIYTAVKLNREVRDYYELVVVATDGAVHPRHSTLTLAIKVLDIDDNSPVFTNSTYTVLVEENLPAGTTILQIEAKDVDLGANVSYRIRSPEVKHFFALHPFTGELSLLRSLDYEAFPDQEASITFLVEAFDIYGTMPPGIATVTVIVKDMNDYPPVFSKRIYKGMVAPDAVKGTPITTVYAEDADPPGLPASRVRYRVDDVQFPYPASIFEVEEDSGRVITRVNLNEEPTTIFKLVVVAFDDGEPVMSSSATVKILVLHPGEIPRFTQEEYRPPPVSELATKGTMVGVISAAAINQSIVYSIVSGNEEDTFGINNITGVIYVNGPLDYETRTSYVLRVQADSLEVVLANLRVPSKSNTAKVYIEIQDENNHPPVFQKKFYIGGVSEDARMFTSVLRVKATDKDTGNYSVMAYRLIIPPIKEGKEGFVVETYTGLIKTAMLFHNMRRSYFKFQVIATDDYGKGLSGKADVLVSVVNQLDMQVIVSNVPPTLVEKKIEDLTEILDRYVQEQIPGAKVVVESIGARRHGDAFSLEDYTKCDLTVYAIDPQTNRAIDRNELFKFLDGKLLDINKDFQPYYGEGGRILEIRTPEAVTSIKKRGESLGYTEGALLALAFIIILCCIPAILVVLVSYRQFKVRQAECTKTARIQAALPAAKPAVPAPAPVAAPPPPPPPPPGAHLYEELGDSSILFLLYHFQQSRGNNSVSEDRKHQQVVMPFSSNTIEAHKSAHVDGSLKSNKLKSARKFTFLSDEDDLSAHNPLYKENISQVSTNSDISQRTDFVDPFSPKIQAKSKSLRGPREKIQRLWSQSVSLPRRLMRKVPNRPEIIDLQQWQGTRQKAENENTGICTNKRGSSNPLLTTEEANLTEKEEIRQGETLMIEGTEQLKSLSSDSSFCFPRPHFSFSTLPTVSRTVELKSEPNVISSPAECSLELSPSRPCVLHSSLSRRETPICMLPIETERNIFENFAHPPNISPSACPLPPPPPISPPSPPPAPAPLAPPPDISPFSLFCPPPSPPSIPLPLPPPTFFPLSVSTSGPPTPPLLPPFPTPLPPPPPSIPCPPPPSASFLSTECVCITGVKCTTNLMPAEKIKSSMTQLSTTTVCKTDPQREPKGILRHVKNLAELEKSVANMYSQIEKNYLRTNVSELQTMCPSEVTNMEITSEQNKGSLNNIVEGTEKQSHSQSTSL,1955,NP_149045.3.csv,refseq-PCDH15-NM_033056.3_clinical_seed_0_final,refseq-PCDH15-NM_033056.3.a2m,Invitae,refseq-PCDH15-NM_033056.3.npy,1,1955,1955
+NP_149059.1,MAGLGFWGHPAGPLLLLLLLVLPPRALPEGPLVFVALVFRHGDRAPLASYPMDPHKEVASTLWPRGLGQLTTEGVRQQLELGRFLRSRYEAFLSPEYRREEVYIRSTDFDRTLESAQANLAGLFPEAAPGSPEARWRPIPVHTVPVAEDKLLRFPMRSCPRYHELLREATEAAEYQEALEGWTGFLSRLENFTGLSLVGEPLRRAWKVLDTLMCQQAHGLPLPAWASPDVLRTLAQISALDIGAHVGPPRAAEKAQLTGGILLNAILANFSRVQRLGLPLKMVMYSAHDSTLLALQGALGLYDGHTPPYAACLGFEFRKHLGNPAKDGGNVTVSLFYRNDSAHLPLPLSLPGCPAPCPLGRFYQLTAPARPPAHGVSCHGPYEAAIPPAPVVPLLAGAVAVLVALSLGLGLLAWRPGCLRALGGPV,426,NP_149059.1.csv,refseq-ACP4-NM_033068.2_clinical_seed_0_final,refseq-ACP4-NM_033068.2.a2m,Invitae,refseq-ACP4-NM_033068.2.npy,1,426,426
+NP_149075.2,MVSKRRLSKSEDKESLTEDASKTRKQPLSKKTKKSHIANEVEENDSIFVKLLKISGIILKTGESQNQLAVDQIAFQKKLFQTLRRHPSYPKIIEEFVSGLESYIEDEDSFRNCLLSCERLQDEEASMGASYSKSLIKLLLGIDILQPAIIKTLFEKLPEYFFENKNSDEINIPRLIVSQLKWLDRVVDGKDLTTKIMQLISIAPENLQHDIITSLPEILGDSQHADVGKELSDLLIENTSLTVPILDVLSSLRLDPNFLLKVRQLVMDKLSSIRLEDLPVIIKFILHSVTAMDTLEVISELREKLDLQHCVLPSRLQASQVKLKSKGRASSSGNQESSGQSCIILLFDVIKSAIRYEKTISEAWIKAIENTASVSEHKVFDLVMLFIIYSTNTQTKKYIDRVLRNKIRSGCIQEQLLQSTFSVHYLVLKDMCSSILSLAQSLLHSLDQSIISFGSLLYKYAFKFFDTYCQQEVVGALVTHICSGNEAEVDTALDVLLELVVLNPSAMMMNAVFVKGILDYLDNISPQQIRKLFYVLSTLAFSKQNEASSHIQDDMHLVIRKQLSSTVFKYKLIGIIGAVTMAGIMAADRSESPSLTQERANLSDEQCTQVTSLLQLVHSCSEQSPQASALYYDEFANLIQHEKLDPKALEWVGHTICNDFQDAFVVDSCVVPEGDFPFPVKALYGLEEYDTQDGIAINLLPLLFSQDFAKDGGPVTSQESGQKLVSPLCLAPYFRLLRLCVERQHNGNLEEIDGLLDCPIFLTDLEPGEKLESMSAKERSFMCSLIFLTLNWFREIVNAFCQETSPEMKGKVLTRLKHIVELQIILEKYLAVTPDYVPPLGNFDVETLDITPHTVTAISAKIRKKGKIERKQKTDGSKTSSSDTLSEEKNSECDPTPSHRGQLNKEFTGKEEKTSLLLHNSHAFFRELDIEVFSILHCGLVTKFILDTEMHTEATEVVQLGPPELLFLLEDLSQKLESMLTPPIARRVPFLKNKGSRNIGFSHLQQRSAQEIVHCVFQLLTPMCNHLENIHNYFQCLAAENHGVVDGPGVKVQEYHIMSSCYQRLLQIFHGLFAWSGFSQPENQNLLYSALHVLSSRLKQGEHSQPLEELLSQSVHYLQNFHQSIPSFQCALYLIRLLMVILEKSTASAQNKEKIASLARQFLCRVWPSGDKEKSNISNDQLHALLCIYLEHTESILKAIEEIAGVGVPELINSPKDASSSTFPTLTRHTFVVFFRVMMAELEKTVKKIEPGTAADSQQIHEEKLLYWNMAVRDFSILINLIKVFDSHPVLHVCLKYGRLFVEAFLKQCMPLLDFSFRKHREDVLSLLETFQLDTRLLHHLCGHSKIHQDTRLTQHVPLLKKTLELLVCRVKAMLTLNNCREAFWLGNLKNRDLQGEEIKSQNSQESTADESEDDMSSQASKSKATEVSLQNPPESGTDGCILLIVLSWWSRTLPTYVYCQMLLCPFPFPP,1471,NP_149075.2.csv,refseq-FANCD2-NM_033084.3_clinical_seed_0_final,refseq-FANCD2-NM_033084.3.a2m,Invitae,refseq-FANCD2-NM_033084.3_theta_0.2.npy,1,1471,1471
+NP_149091.1,MRRCRWAALALGLLRLCLAQANFAPHFFDNGVGSTNGNMALFSLPEDTPVGSHVYTLNGTDPEGDPISYHISFDPSTRSVFSVDPTFGNITLVEELDREREDEIEAIISISDGLNLVAEKVVILVTDANDEAPRFIQEPYVALVPEDIPAGSIIFKVHAVDRDTGSGGSVTYFLQNLHSPFAVDRHSGVLRLQAGATLDYERSRTHYITVVAKDGGGRLHGADVVFSATTTVTVNVEDVQDMAPVFVGTPYYGYVYEDTLPGSEVLKVVAMDGDRGKPNRILYSLVNGNDGAFEINETSGAISITQSPAQLQREVYELHVQVTEMSPAGSPAAQATVPVTIRIVDLNNHPPTFYGESGPQNRFELSMNEHPPQGEILRGLKITVNDSDQGANAKFNLQLVGPRGIFRVVPQTVLNEAQVTIIVENSAAIDFEKSKVLTFKLLAVEVNTPEKFSSTADVVIQLLDTNDNVPKFDSLYYVARIPENAPGGSSVVAVTAVDPDTGPWGEVKYSTYGTGADLFLIHPSTGLIYTQPWASLDAEATARYNFYVKAEDMEGKYSVAEVFITLLDVNDHPPQFGKSVQKKTMVLGTPVKIEAIDEDAEEPNNLVDYSITHAEPANVFDINSHTGEIWLKNSIRSLDALHNITPGRDCLWSLEVQAKDRGSPSFSTTALLKIDITDAETLSRSPMAAFLIQTKDNPMKAVGVLAGTMATVVAITVLISTATFWRNKKSNKVLPMRRVLRKRPSPAPRTIRIEWLKSKSTKAATKFMLKEKPPNENCNNNSPESSLLPRAPALPPPPSVAPSTGAAQWTVPTVSGSLTPQPTQPPPKPKTMGSPVQSTLISELKQKFEKKSVHNKAYF,859,NP_149091.1.csv,refseq-CDHR1-NM_033100.3_clinical_seed_0_final,refseq-CDHR1-NM_033100.3.a2m,Invitae,refseq-CDHR1-NM_033100.3.npy,1,859,859
+NP_149100.2,MAACRYCCSCLRLRPLSDGPFLLPRRDRALTQLQVRALWSSAGSRAVAVDLGNRKLEISSGKLARFADGSAVVQSGDTAVMVTAVSKTKPSPSQFMPLVVDYRQKAAAAGRIPTNYLRREIGTSDKEILTSRIIDRSIRPLFPAGYFYDTQVLCNLLAVDGVNEPDVLAINGASVALSLSDIPWNGPVGAVRIGIIDGEYVVNPTRKEMSSSTLNLVVAGAPKSQIVMLEASAENILQQDFCHAIKVGVKYTQQIIQGIQQLVKETGVTKRTPQKLFTPSPEIVKYTHKLAMERLYAVFTDYEHDKVSRDEAVNKIRLDTEEQLKEKFPEADPYEIIESFNVVAKEVFRSIVLNEYKRCDGRDLTSLRNVSCEVDMFKTLHGSALFQRGQTQVLCTVTFDSLESGIKSDQVITAINGIKDKNFMLHYEFPPYATNEIGKVTGLNRRELGHGALAEKALYPVIPRDFPFTIRVTSEVLESNGSSSMASACGGSLALMDSGVPISSAVAGVAIGLVTKTDPEKGEIEDYRLLTDILGIEDYNGDMDFKIAGTNKGITALQADIKLPGIPIKIVMEAIQQASVAKKEILQIMNKTISKPRASRKENGPVVETVQVPLSKRAKFVGPGGYNLKKLQAETGVTISQVDEETFSVFAPTPSAMHEARDFITEICKDDQEQQLEFGAVYTATITEIRDTGVMVKLYPNMTAVLLHNTQLDQRKIKHPTALGLEVGQEIQVKYFGRDPADGRMRLSRKVLQSPATTVVRTLNDRSSIVMGEPISQSSSNSQ,783,NP_149100.2.csv,refseq-PNPT1-NM_033109.4_clinical_seed_0_final,refseq-PNPT1-NM_033109.4.a2m,Invitae,refseq-PNPT1-NM_033109.4.npy,1,783,783
+NP_149107.4,MSVLGEYERHCDSINSDFGSESGGCGDSSPGPSASQGPRAGGGAAEQEELHYIPIRVLGRGAFGEATLYRRTEDDSLVVWKEVDLTRLSEKERRDALNEIVILALLQHDNIIAYYNHFMDNTTLLIELEYCNGGNLYDKILRQKDKLFEEEMVVWYLFQIVSAVSCIHKAGILHRDIKTLNIFLTKANLIKLGDYGLAKKLNSEYSMAETLVGTPYYMSPELCQGVKYNFKSDIWAVGCVIFELLTLKRTFDATNPLNLCVKIVQGIRAMEVDSSQYSLELIQMVHSCLDQDPEQRPTADELLDRPLLRKRRREMEEKVTLLNAPTKRPRSSTVTEAPIAVVTSRTSEVYVWGGGKSTPQKLDVIKSGCSARQVCAGNTHFAVVTVEKELYTWVNMQGGTKLHGQLGHGDKASYRQPKHVEKLQGKAIRQVSCGDDFTVCVTDEGQLYAFGSDYYGCMGVDKVAGPEVLEPMQLNFFLSNPVEQVSCGDNHVVVLTRNKEVYSWGCGEYGRLGLDSEEDYYTPQKVDVPKALIIVAVQCGCDGTFLLTQSGKVLACGLNEFNKLGLNQCMSGIINHEAYHEVPYTTSFTLAKQLSFYKIRTIAPGKTHTAAIDERGRLLTFGCNKCGQLGVGNYKKRLGINLLGGPLGGKQVIRVSCGDEFTIAATDDNHIFAWGNGGNGRLAMTPTERPHGSDICTSWPRPIFGSLHHVPDLSCRGWHTILIVEKVLNSKTIRSNSSGLSIGTVFQSSSPGGGGGGGGGEEEDSQQESETPDPSGGFRGTMEADRGMEGLISPTEAMGNSNGASSSCPGWLRKELENAEFIPMPDSPSPLSAAFSESEKDTLPYEELQGLKVASEAPLEHKPQVEASSPRLNPAVTCAGKGTPLTPPACACSSLQVEVERLQGLVLKCLAEQQKLQQENLQIFTQLQKLNKKLEGGQQVGMHSKGTQTAKEEMEMDPKPDLDSDSWCLLGTDSCRPSL,979,NP_149107.4.csv,refseq-NEK9-NM_033116.6_clinical_seed_0_final,refseq-NEK9-NM_033116.6.a2m,Invitae,refseq-NEK9-NM_033116.6.npy,1,979,979
+NP_149353.1,MGSPRSALSCLLLHLLVLCLQAQEGPGRGPALGRELASLFRAGREPQGVSQQVTVQSSPNFTQHVREQSLVTDQLSRRLIRTYQLYSRTSGKHVQVLANKRINAMAEDGDPFAKLIVETDTFGSRVRVRGAETGLYICMNKKGKLIAKSNGKGKDCVFTEIVLENNYTALQNAKYEGWYMAFTRKGRPRKGSKTRQHQREVHFMKRLPRGHHTTEQSLRFEFLNYPPFTRSLRGSQRTWAPEPR,244,NP_149353.1.csv,refseq-FGF8-NM_033163.3_clinical_seed_0_final,refseq-FGF8-NM_033163.3.a2m,Invitae,refseq-FGF8-NM_033163.3.npy,1,244,244
+NP_149975.1,MAAAGAAATHLEVARGKRAALFFAAVAIVLGLPLWWKTTETYRASLPYSQISGLNALQLRLMVPVTVVFTRESVPLDDQEKLPFTVVHEREIPLKYKMKIKCRFQKAYRRALDHEEEALSSGSVQEAEAMLDEPQEQAEGSLTVYVISEHSSLLPQDMMSYIGPKRTAVVRGIMHREAFNIIGRRIVQVAQAMSLTEDVLAAALADHLPEDKWSAEKRRPLKSSLGYEITFSLLNPDPKSHDVYWDIEGAVRRYVQPFLNALGAAGNFSVDSQILYYAMLGVNPRFDSASSSYYLDMHSLPHVINPVESRLGSSAASLYPVLNFLLYVPELAHSPLYIQDKDGAPVATNAFHSPRWGGIMVYNVDSKTYNASVLPVRVEVDMVRVMEVFLAQLRLLFGIAQPQLPPKCLLSGPTSEGLMTWELDRLLWARSVENLATATTTLTSLAQLLGKISNIVIKDDVASEVYKAVAAVQKSAEELASGHLASAFVASQEAVTSSELAFFDPSLLHLLYFPDDQKFAIYIPLFLPMAVPILLSLVKIFLETRKSWRKPEKTD,555,NP_149975.1.csv,refseq-PIGS-NM_033198.3_clinical_seed_0_final,refseq-PIGS-NM_033198.3.a2m,Invitae,refseq-PIGS-NM_033198.3.npy,1,555,555
+NP_201569.1,MAAESGELIGACEFMKDRLYFATLRNRPKSTVNTHYFSIDEELVYENFYADFGPLNLAMVYRYCCKLNKKLKSYSLSRKKIVHYTCFDQRKRANAAFLIGAYAVIYLKKTPEEAYRALLSGSNPPYLPFRDASFGNCTYNLTILDCLQGIRKGLQHGFFDFETFDVDEYEHYERVENGDFNWIVPGKFLAFSGPHPKSKIENGYPLHAPEAYFPYFKKHNVTAVVRLNKKIYEAKRFTDAGFEHYDLFFIDGSTPSDNIVRRFLNICENTEGAIAVHCKAGLGRTGTLIACYVMKHYRFTHAEIIAWIRICRPGSIIGPQQHFLEEKQASLWVQGDIFRSKLKNRPSSEGSINKILSGLDDMSIGGNLSKTQNMERFGEDNLEDDDVEMKNGITQGDKLRALKSQRQPRTSPSCAFRSDDTKGHPRAVSQPFRLSSSLQGSAVTLKTSKMALSPSATAKRINRTSLSSGATVRSFSINSRLASSLGNLNAATDDPENKKTSSSSKAGFTASPFTNLLNGSSQPTTRNYPELNNNQYNRSSNSNGGNLNSPPGPHSAKTEEHTTILRPSYTGLSSSSARFLSRSIPVSAQTPPPGPQNPECNFCALPSQPRLPPKKFNSAKEAF,623,NP_201569.1.csv,NP_201569.1_colabfold_clinical_seed_0_final,NP_201569.1_colabfold.a2m,colabfold,NP_201569.1_colabfold_theta_0.2.npy,1,623,623
+NP_201583.2,MSSKQATSPFACAADGEDAMTQDLTSREKEEGSDQHVASHLPLHPIMHNKPHSEELPTLVSTIQQDADWDSVLSSQQRMESENNKLCSLYSFRNTSTSPHKPDEGSRDREIMTSVTFGTPERRKGSLADVVDTLKQKKLEEMTRTEQEDSSCMEKLLSKDWKEKMERLNTSELLGEIKGTPESLAEKERQLSTMITQLISLREQLLAAHDEQKKLAASQIEKQRQQMDLARQQQEQIARQQQQLLQQQHKINLLQQQIQVQGHMPPLMIPIFPHDQRTLAAAAAAQQGFLFPPGITYKPGDNYPVQFIPSTMAAAAASGLSPLQLQKGHVSHPQINQRLKGLSDRFGRNLDTFEHGGGHSYNHKQIEQLYAAQLASMQVSPGAKMPSTPQPPNTAGTVSPTGIKNEKRGTSPVTQVKDEAAAQPLNLSSRPKTAEPVKSPTSPTQNLFPASKTSPVNLPNKSSIPSPIGGSLGRGSSLDILSSLNSPALFGDQDTVMKAIQEARKMREQIQREQQQQQPHGVDGKLSSINNMGLNSCRNEKERTRFENLGPQLTGKSNEDGKLGPGVIDLTRPEDAEGGATVAEARVYRDARGRASSEPHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKSMSNQEKQPYYEEQARLSKIHLEKYPNYKYKPRPKRTCIVDGKKLRIGEYKQLMRSRRQEMRQFFTVGQQPQIPITTGTGVVYPGAITMATTTPSPQMTSDCSSTSASPEPSLPVIQSTYGMKTDGGSLAGNEMINGEDEMEMYDDYEDDPKSDYSSENEAPEAVSAN,808,NP_201583.2.csv,refseq-SOX6-NM_033326.3_clinical_seed_0_final,refseq-SOX6-NM_033326.3.a2m,Invitae,refseq-SOX6-NM_033326.3.npy,1,808,808
+NP_203123.1,MMAEEHTDLEAQIVKDIHCKEIDLVNRDPKNINEDIVKVDFEDVIAEPVGTYSFDGVWKVSYTTFTVSKYWCYRLLSTLLGVPLALLWGFLFACISFCHIWAVVPCIKSYLIEIQCISHIYSLCIRTFCNPLFAALGQVCSSIKVVLRKEV,151,NP_203123.1.csv,refseq-CAV3-NM_033337.2_clinical_seed_0_final,refseq-CAV3-NM_033337.2.a2m,Invitae,refseq-CAV3-NM_033337.2.npy,1,151,151
+NP_203129.1,MMQSATVPAEGAVKGLPEMLGVPMQQIPQCAGCNQHILDKFILKVLDRHWHSSCLKCADCQMQLADRCFSRAGSVYCKEDFFKRFGTKCTACQQGIPPTQVVRKAQDFVYHLHCFACIICNRQLATGDEFYLMEDGRLVCKEDYETAKQNDDSEAGAKRPRTTITAKQLETLKNAYKNSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRHRWGQFYKSVKRSRGSSKQEKESSAEDCGVSDSELSFREDQILSELGHTNRIYGNVGDVTGGQLMNGSFSMDGTGQSYQDLRDGSPYGIPQSPSSISSLPSHAPLLNGLDYTVDSNLGIIAHAGQGVSQTLRAMAGGPTSDISTGSSVGYPDFPTSPGSWLDEMDHPPF,390,NP_203129.1.csv,refseq-LHX4-NM_033343.3_clinical_seed_0_final,refseq-LHX4-NM_033343.3.a2m,Invitae,refseq-LHX4-NM_033343.3.npy,1,390,390
+NP_212134.3,MAFLMHLLVCVFGMGSWVTINGLWVELPLLVMELPEGWYLPSYLTVVIQLANIGPLLVTLLHHFRPSCLSEVPIIFTLLGVGTVTCIIFAFLWNMTSWVLDGHHSIAFLVLTFFLALVDCTSSVTFLPFMSRLPTYYLTTFFVGEGLSGLLPALVALAQGSGLTTCVNVTEISDSVPSPVPTRETDIAQGVPRALVSALPGMEAPLSHLESRYLPAHFSPLVFFLLLSIMMACCLVAFFVLQRQPRCWEASVEDLLNDQVTLHSIRPREENDLGPAGTVDSSQGQGYLEEKAAPCCPAHLAFIYTLVAFVNALTNGMLPSVQTYSCLSYGPVAYHLAATLSIVANPLASLVSMFLPNRSLLFLGVLSVLGTCFGGYNMAMAVMSPCPLLQGHWGGEVLIVASWVLFSGCLSYVKVMLGVVLRDLSRSALLWCGAAVQLGSLLGALLMFPLVNVLRLFSSADFCNLHCPA,469,NP_212134.3.csv,refseq-SLC52A3-NM_033409.3_clinical_seed_0_final,refseq-SLC52A3-NM_033409.3.a2m,Invitae,refseq-SLC52A3-NM_033409.3.npy,1,469,469
+NP_219487.3,MAGLAARLVLLAGAAALASGSQGDREPVYRDCVLQCEEQNCSGGALNHFRSRQPIYMSLAGWTCRDDCKYECMWVTVGLYLQEGHKVPQFHGKWPFSRFLFFQEPASAVASFLNGLASLVMLCRYRTFVPASSPMYHTCVAFAWVSLNAWFWSTVFHTRDTDLTEKMDYFCASTVILHSIYLCCVRTVGLQHPAVVSAFRALLLLMLTVHVSYLSLIRFDYGYNLVANVAIGLVNVVWWLAWCLWNQRRLPHVRKCVVVVLLLQGLSLLELLDFPPLFWVLDAHAIWHISTIPVHVLFFSFLEDDSLYLLKESEDKFKLD,320,NP_219487.3.csv,refseq-PGAP3-NM_033419.4_clinical_seed_0_final,refseq-PGAP3-NM_033419.4.a2m,Invitae,refseq-PGAP3-NM_033419.4.npy,1,320,320
+NP_254275.1,MIRTLLLSTLVAGALSCGDPTYPPYVTRVVGGEEARPNSWPWQVSLQYSSNGKWYHTCGGSLIANSWVLTAAHCISSSRTYRVGLGRHNLYVAESGSLAVSVSKIVVHKDWNSNQISKGNDIALLKLANPVSLTDKIQLACLPPAGTILPNNYPCYVTGWGRLQTNGAVPDVLQQGRLLVVDYATCSSSAWWGSSVKTSMICAGGDGVISSCNGDSGGPLNCQASDGRWQVHGIVSFGSRLGCNYYHKPSVFTRVSNYIDWINSVIANN,269,NP_254275.1.csv,refseq-CELA2A-NM_033440.2_clinical_seed_0_final,refseq-CELA2A-NM_033440.2.a2m,Invitae,refseq-CELA2A-NM_033440.2.npy,1,269,269
+NP_258412.1,MAASLVGKKIVFVTGNAKKLEEVVQILGDKFPCTLVAQKIDLPEYQGEPDEISIQKCQEAVRQVQGPVLVEDTCLCFNALGGLPGPYIKWFLEKLKPEGLHQLLAGFEDKSAYALCTFALSTGDPSQPVRLFRGRTSGRIVAPRGCQDFGWDPCFQPDGYEQTYAEMPKAEKNAVSHRFRALLELQEYFGSLAA,194,NP_258412.1.csv,refseq-ITPA-NM_033453.3_clinical_seed_0_final,refseq-ITPA-NM_033453.3.a2m,Invitae,refseq-ITPA-NM_033453.3.npy,1,194,194
+NP_291028.3,MAAARATTPADGEEPAPEAEALAAARERSSRFLSGLELVKQGAEARVFRGRFQGRAAVIKHRFPKGYRHPALEARLGRRRTVQEARALLRCRRAGISAPVVFFVDYASNCLYMEEIEGSVTVRDYIQSTMETEKTPQGLSNLAKTIGQVLARMHDEDLIHGDLTTSNMLLKPPLEQLNIVLIDFGLSFISALPEDKGVDLYVLEKAFLSTHPNTETVFEAFLKSYSTSSKKARPVLKKLDEVRLRGRKRSMVG,253,NP_291028.3.csv,refseq-TP53RK-NM_033550.3_clinical_seed_0_final,refseq-TP53RK-NM_033550.3.a2m,Invitae,refseq-TP53RK-NM_033550.3.npy,1,253,253
+NP_338599.1,MGSQALPPGPMQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPPPRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWCLVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHGPRKSYSLGSIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVTASARTKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAILTLAVATLYGLSLATPGE,314,NP_338599.1.csv,refseq-TREX1-NM_033629.3_clinical_seed_0_final,refseq-TREX1-NM_033629.3.a2m,Invitae,refseq-TREX1-NM_033629.3_theta_0.2.npy,1,314,314
+NP_387510.1,MAGSVGLALCGQTLVVRGGSRFLATSIASSDDDSLFIYDCSAAEKKSQENKGEDAPLDQGSGAILASTFSKSGSYFALTDDSKRLILFRTKPWQCLSVRTVARRCTALTFIASEEKVLVADKSGDVYSFSVLEPHGCGRLELGHLSMLLDVAVSPDDRFILTADRDEKIRVSWAAAPHSIESFCLGHTEFVSRISVVPTQPGLLLSSSGDGTLRLWEYRSGRQLHCCHLASLQELVDPQAPQKFAASRIAFWCQENCVALLCDGTPVVYIFQLDARRQQLVYRQQLAFQHQVWDVAFEETQGLWVLQDCQEAPLVLYRPVGDQWQSVPESTVLKKVSGVLRGNWAMLEGSAGADASFSSLYKATFDNVTSYLKKKEERLQQQLEKKQRRRSPPPGPDGHAKKMRPGEATLSC,412,NP_387510.1.csv,refseq-WDR4-NM_033661.4_clinical_seed_0_final,refseq-WDR4-NM_033661.4.a2m,Invitae,refseq-WDR4-NM_033661.4.npy,1,412,412
+NP_443076.2,MATRAQPGPLSQAGSAGVAALATVGVASGPGPGRPGPLQDETLGVASVPSQWRAVQGIRWETKSCQTASIATASASAQARNHVDAQVQTEAPVPVSVQPPSQYDIPRLAAFLRRVEAMVIRELNKNWQSHAFDGFEVNWTEQQQMVSCLYTLGYPPAQAQGLHVTSISWNSTGSVVACAYGRLDHGDWSTLKSFVCAWNLDRRDLRPQQPSAVVEVPSAVLCLAFHPTQPSHVAGGLYSGEVLVWDLSRLEDPLLWRTGLTDDTHTDPVSQVVWLPEPGHSHRFQVLSVATDGKVLLWQGIGVGQLQLTEGFALVMQQLPRSTKLKKHPRGETEVGATAVAFSSFDPRLFILGTEGGFPLKCSLAAGEAALTRMPSSVPLRAPAQFTFSPHGGPIYSVSCSPFHRNLFLSAGTDGHVHLYSMLQAPPLTSLQLSLKYLFAVRWSPVRPLVFAAASGKGDVQLFDLQKSSQKPTVLIKQTQDESPVYCLEFNSQQTQLLAAGDAQGTVKVWQLSTEFTEQGPREAEDLDCLAAEVAA,536,NP_443076.2.csv,refseq-WDR34-NM_052844.3_clinical_seed_0_final,refseq-WDR34-NM_052844.3.a2m,Invitae,refseq-WDR34-NM_052844.3.npy,1,536,536
+NP_443077.1,MAVCGLGSRLGLGSRLGLRGCFGAARLLYPRFQSRGPQGVEDGDRPQPSSKTPRIPKIYTKTGDKGFSSTFTGERRPKDDQVFEAVGTTDELSSAIGFALELVTEKGHTFAEELQKIQCTLQDVGSALATPCSSAREAHLKYTTFKAGPILELEQWIDKYTSQLPPLTAFILPSGGKISSALHFCRAVCRRAERRVVPLVQMGETDANVAKFLNRLSDYLFTLARYAAMKEGNQEKIYMKNDPSAESEGL,250,NP_443077.1.csv,refseq-MMAB-NM_052845.3_clinical_seed_0_final,refseq-MMAB-NM_052845.3.a2m,Invitae,refseq-MMAB-NM_052845.3.npy,1,250,250
+NP_443091.1,MGSQEVLGHAARLASSGLLLQVLFRLITFVLNAFILRFLSKEIVGVVNVRLTLLYSTTLFLAREAFRRACLSGGTQRDWSQTLNLLWLTVPLGVFWSLFLGWIWLQLLEVPDPNVVPHYATGVVLFGLSAVVELLGEPFWVLAQAHMFVKLKVIAESLSVILKSVLTAFLVLWLPHWGLYIFSLAQLFYTTVLVLCYVIYFTKLLGSPESTKLQTLPVSRITDLLPNITRNGAFINWKEAKLTWSFFKQSFLKQILTEGERYVMTFLNVLNFGDQGVYDIVNNLGSLVARLIFQPIEESFYIFFAKVLERGKDATLQKQEDVAVAAAVLESLLKLALLAGLTITVFGFAYSQLALDIYGGTMLSSGSGPVLLRSYCLYVLLLAINGVTECFTFAAMSKEEVDRYNFVMLALSSSFLVLSYLLTRWCGSVGFILANCFNMGIRITQSLCFIHRYYRRSPHRPLAGLHLSPVLLGTFALSGGVTAVSEVFLCCEQGWPARLAHIAVGAFCLGATLGTAFLTETKLIHFLRTQLGVPRRTDKMT,541,NP_443091.1.csv,refseq-RFT1-NM_052859.3_clinical_seed_0_final,refseq-RFT1-NM_052859.3.a2m,Invitae,refseq-RFT1-NM_052859.3.npy,1,541,541
+NP_443099.1,MLKRKQSSRVEAQPVTDFGPDESLSDNADILWINKPWVHSLLRICAIISVISVCMNTPMTFEHYPPLQYVTFTLDTLLMFLYTAEMIAKMHIRGIVKGDSSYVKDRWCVFDGFMVFCLWVSLVLQVFEIADIVDQMSPWGMLRIPRPLIMIRAFRIYFRFELPRTRITNILKRSGEQIWSVSIFLLFFLLLYGILGVQMFGTFTYHCVVNDTKPGNVTWNSLAIPDTHCSPELEEGYQCPPGFKCMDLEDLGLSRQELGYSGFNEIGTSIFTVYEAASQEGWVFLMYRAIDSFPRWRSYFYFITLIFFLAWLVKNVFIAVIIETFAEIRVQFQQMWGSRSSTTSTATTQMFHEDAAGGWQLVAVDVNKPQGRAPACLQKMMRSSVFHMFILSMVTVDVIVAASNYYKGENFRRQYDEFYLAEVAFTVLFDLEALLKIWCLGFTGYISSSLHKFELLLVIGTTLHVYPDLYHSQFTYFQVLRVVRLIKISPALEDFVYKIFGPGKKLGSLVVFTASLLIVMSAISLQMFCFVEELDRFTTFPRAFMSMFQILTQEGWVDVMDQTLNAVGHMWAPVVAIYFILYHLFATLILLSLFVAVILDNLELDEDLKKLKQLKQSEANADTKEKLPLRLRIFEKFPNRPQMVKISKLPSDFTVPKIRESFMKQFIDRQQQDTCCLLRSLPTTSSSSCDHSKRSAIEDNKYIDQKLRKSVFSIRARNLLEKETAVTKILRACTRQRMLSGSFEGQPAKERSILSVQHHIRQERRSLRHGSNSQRISRGKSLETLTQDHSNTVRYRNAQREDSEIKMIQEKKEQAEMKRKVQEEELRENHPYFDKPLFIVGREHRFRNFCRVVVRARFNASKTDPVTGAVKNTKYHQLYDLLGLVTYLDWVMIIVTICSCISMMFESPFRRVMHAPTLQIAEYVFVIFMSIELNLKIMADGLFFTPTAVIRDFGGVMDIFIYLVSLIFLCWMPQNVPAESGAQLLMVLRCLRPLRIFKLVPQMRKVVRELFSGFKEIFLVSILLLTLMLVFASFGVQLFAGKLAKCNDPNIIRREDCNGIFRINVSVSKNLNLKLRPGEKKPGFWVPRVWANPRNFNFDNVGNAMLALFEVLSLKGWVEVRDVIIHRVGPIHGIYIHVFVFLGCMIGLTLFVGVVIANFNENKGTALLTVDQRRWEDLKSRLKIAQPLHLPPRPDNDGFRAKMYDITQHPFFKRTIALLVLAQSVLLSVKWDVEDPVTVPLATMSVVFTFIFVLEVTMKIIAMSPAGFWQSRRNRYDLLVTSLGVVWVVLHFALLNAYTYMMGACVIVFRFFSICGKHVTLKMLLLTVVVSMYKSFFIIVGMFLLLLCYAFAGVVLFGTVKYGENINRHANFSSAGKAITVLFRIVTGEDWNKIMHDCMVQPPFCTPDEFTYWATDCGNYAGALMYFCSFYVIIAYIMLNLLVAIIVENFSLFYSTEEDQLLSYNDLRHFQIIWNMVDDKREGVIPTFRVKFLLRLLRGRLEVDLDKDKLLFKHMCYEMERLHNGGDVTFHDVLSMLSYRSVDIRKSLQLEELLAREQLEYTIEEEVAKQTIRMWLKKCLKRIRAKQQQSCSIIHSLRESQQQELSRFLNPPSIETTQPSEDTNANSQDNSMQPETSSQQQLLSPTLSDRGGSRQDAADAGKPQRKFGQWRLPSAPKPISHSVSSVNLRFGGRTTMKSVVCKMNPMTDAASCGSEVKKWWTRQLTVESDESGDDLLDI,1738,NP_443099.1.csv,refseq-NALCN-NM_052867.2_clinical_seed_0_final,refseq-NALCN-NM_052867.2.a2m,Invitae,refseq-NALCN-NM_052867.2_theta_0.2.npy,1,1738,1738
+NP_443105.2,MEDLLDLDEELRYSLATSRAKMGRRAQQESAQAENHLNGKNSSLTLTGETSSAKLPRCRQGGWAGDSVKASNGTQTGKQQLDLNACYHKTHHRDLGLASLEEADIPIIPDLEEVQEEDFVLQVAAPPSIQIKRVMTYRDLDNDLMKYSAIQTLDGEIDLKLLTKVLAPEHEVREDDVGWDWDHLFTEVSSEVLTEWDPLQTEKEDPAGQARHT,213,NP_443105.2.csv,refseq-IFT43-NM_052873.2_clinical_seed_0_final,refseq-IFT43-NM_052873.2.a2m,Invitae,refseq-IFT43-NM_052873.2.npy,1,213,213
+NP_443106.1,MKDRTQELRSAKDSDDEEEVVHVDRDHFMDEFFEQVEEIRGCIEKLSEDVEQVKKQHSAILAAPNPDEKTKQELEDLTADIKKTANKVRSKLKAIEQSIEQEEGLNRSSADLRIRKTQHSTLSRKFVEVMTEYNATQSKYRDRCKDRIQRQLEITGRTTTNEELEDMLESGKLAIFTDDIKMDSQMTKQALNEIETRHNEIIKLETSIRELHDMFVDMAMLVESQGEMIDRIEYNVEHSVDYVERAVSDTKKAVKYQSKARRKKIMIIICCVVLGVVLASSIGGTLGL,288,NP_443106.1.csv,refseq-STX1B-NM_052874.3_clinical_seed_0_final,refseq-STX1B-NM_052874.3.a2m,Invitae,refseq-STX1B-NM_052874.3.npy,1,288,288
+NP_443108.1,MAQTLQMEIPNFGNSILECLNEQRLQGLYCDVSVVVKGHAFKAHRAVLAASSSYFRDLFNNSRSAVVELPAAVQPQSFQQILSFCYTGRLSMNVGDQFLLMYTAGFLQIQEIMEKGTEFFLKVSSPSCDSQGLHAEEAPSSEPQSPVAQTSGWPACSTPLPLVSRVKTEQQESDSVQCMPVAKRLWDSGQKEAGGGGNGSRKMAKFSTPDLAANRPHQPPPPQQAPVVAAAQPAVAAGAGQPAGGVAAAGGVVSGPSTSERTSPGTSSAYTSDSPGSYHNEEDEEEDGGEEGMDEQYRQICNMYTMYSMMNVGQTAEKVEALPEQVAPESRNRIRVRQDLASLPAELINQIGNRCHPKLYDEGDPSEKLELVTGTNVYITRAQLMNCHVSAGTRHKVLLRRLLASFFDRNTLANSCGTGIRSSTNDPRRKPLDSRVLHAVKYYCQNFAPNFKESEMNAIAADMCTNARRVVRKSWMPKVKVLKAEDDAYTTFISETGKIEPDMMGVEHGFETASHEGEAGPSAEALQ,527,NP_443108.1.csv,refseq-NACC1-NM_052876.3_clinical_seed_0_final,refseq-NACC1-NM_052876.3.a2m,Invitae,refseq-NACC1-NM_052876.3.npy,1,527,527
+NP_443197.1,MEERGDSEPTPGCSGLGPGGVRGFGDGGGAPSWAPEDAWMGTHPKYLEMMELDIGDATQVYVAFLVYLDLMESKSWHEVNCVGLPELQLICLVGTEIEGEGLQTVVPTPITASLSHNRIREILKASRKLQGDPDLPMSFTLAIVESDSTIVYYKLTDGFMLPDPQNISLRR,171,NP_443197.1.csv,refseq-TSEN15-NM_052965.3_clinical_seed_0_final,refseq-TSEN15-NM_052965.3.a2m,Invitae,refseq-TSEN15-NM_052965.3.npy,1,171,171
+NP_443711.2,MRAVLTWRDKAEHCINDIAFKPDGTQLILAAGSRLLVYDTSDGTLLQPLKGHKDTVYCVAYAKDGKRFASGSADKSVIIWTSKLEGILKYTSWSVMSSLHLHLPFLGLHKTVRVTATDKAPKGQGGRIDCLRPSVQNQPGQKHNDAIQCVSYNPITHQLASCSSSDFGLWSPEQKSVSKHKSSSKIICCSWTNDGQYLALGMFNGIISIRNKNGEEKVKIERPGGSLSPIWSICWNPSSRWESFWMNRENEDAEDVIVNRYIQEIPSTLKSAVYSSQGSEAEEEEPEEEDDSPRDDNLEERNDILAVADWGQKVSFYQLSGKQIGKDRALNFDPCCISYFTKGEYILLGGSDKQVSLFTKDGVRLGTVGEQNSWVWTCQAKPDSNYVVVGCQDGTISFYQLIFSTVHGLYKDRYAYRDSMTDVIVQHLITEQKVRIKCKELVKKIAIYRNRLAIQLPEKILIYELYSEDLSDMHYRVKEKIIKKFECNLLVVCANHIILCQEKRLQCLSFSGVKEREWQMESLIRYIKVIGGPPGREGLLVGLKNGQILKIFVDNLFAIVLLKQATAVRCLDMSASRKKLAVVDENDTCLVYDIDTKELLFQEPNANSVAWNTQCEDMLCFSGGGYLNIKASTFPVHRQKLQGFVVGYNGSKIFCLHVFSISAVEVPQSAPMYQYLDRKLFKEAYQIACLGVTDTDWRELAMEALEGLDFETAKKAFIRVQDLRYLELISSIEERKKRGETNNDLFLADVFSYQGKFHEAAKLYKRSGHENLALEMYTDLCMFEYAKDFLGSGDPKETKMLITKQADWARNIKEPKAAVEMYISAGEHVKAIEICGDHGWVDMLIDIARKLDKAEREPLLLCATYLKKLDSPGYAAETYLKMGDLKSLVQLHVETQRWDEAFALGEKHPEFKDDIYMPYAQWLAENDRFEEAQKAFHKAGRQREAVQVLEQLTNNAVAESRFNDAAYYYWMLSMQCLDIAQDPAQKDTMLGKFYHFQRLAELYHGYHAIHRHTEDPFSVHRPETLFNISRFLLHSLPKDTPSGISKVKILFTLAKQSKALGAYRLARHAYDKLRGLYIPARFQKSIELGTLTIRAKPFHDSEELVPLCYRCSTNNPLLNNLGNVCINCRQPFIFSASSYDVLHLVEFYLEEGITDEEAISLIDLEVLRPKRDDRQLEIANNSSQILRLVETKDSIGDEDPFTAKLSFEQGGSEFVPVVVSRLVLRSMSRRDVLIKRWPPPLRWQYFRSLLPDASITMCPSCFQMFHSEDYELLVLQHGCCPYCRRCKDDPGP,1292,NP_443711.2.csv,refseq-IFT122-NM_052985.3_clinical_seed_0_final,refseq-IFT122-NM_052985.3.a2m,Invitae,refseq-IFT122-NM_052985.3.npy,1,1292,1292
+NP_473368.1,MVKFPALTHYWPLIRFLVPLGITNIAIDFGEQALNRGIAAVKEDAVEMLASYGLAYSLMKFFTGPMSDFKNVGLVFVNSKRDRTKAVLCMVVAGAIAAVFHTLIAYSDLGYYIINKLHHVDESVGSKTRRAFLYLAAFPFMDAMAWTHAGILLKHKYSFLVGCASISDVIAQVVFVAILLHSHLECREPLLIPILSLYMGALVRCTTLCLGYYKNIHDIIPDRSGPELGGDATIRKMLSFWWPLALILATQRISRPIVNLFVSRDLGGSSAATEAVAILTATYPVGHMPYGWLTEIRAVYPAFDKNNPSNKLVSTSNTVTAAHIKKFTFVCMALSLTLCFVMFWTPNVSEKILIDIIGVDFAFAELCVVPLRIFSFFPVPVTVRAHLTGWLMTLKKTFVLAPSSVLRIIVLIASLVVLPYLGVHGATLGVGSLLAGFVGESTMVAIAACYVYRKQKKKMENESATEGEDSAMTDMPPTEEVTDIVEMREENE,492,NP_473368.1.csv,refseq-ANKH-NM_054027.4_clinical_seed_0_final,refseq-ANKH-NM_054027.4.a2m,Invitae,refseq-ANKH-NM_054027.4.npy,1,492,492
+NP_476429.2,MSRQASKTSGGGSQGFSGRSAVVSGSSRMSCVAHSGGAGGGAYGFRSGAGGFGSRSLYNLGGNKSISISVAAGGSRAGGFGGGRSSCAFAGGYGGGFGSGYGGGFGGGFGGGRGMGGGFGGAGGFGGAGGFGGAGGFGGPGGFGGSGGFGGPGSLGSPGGFGPGGFPGGIQEVTINQSLLQPLNVEIDPQIGQVKAQEREQIKTLNNKFASFIDKVRFLEQQNKVLETKWNLLQQQGTSSISGTNNLEPLFENHINYLRSYLDNILGERGRLDSELKNMEDLVEDFKKKYEDEINKRTAAENEFVTLKKDVDSAYMNKVELQAKVDALIDEIDFLRTLYDAELSQMQSHISDTSVVLSMDNNRSLDLDSIIAEVRAQYEDIAQRSKAEAEALYQTKLGELQTTAGRHGDDLRNTKSEIIELNRMIQRLRAEIEGVKKQNANLQTAIAEAEQHGEMALKDANAKLQELQAALQQAKDDLARLLRDYQELMNVKLALDVEIATYRKLLEGEEYRMSGECPSAVSISVVSSSTTSASAGGYGGGYGGGMGGGLGGGFSAGGGSGSGFGRGGGGGIGGGFGGGSSGFSGGSGFGSISGARYGVSGGGFSSASNRGGSIKFSQSSQSSQRYSR,628,NP_476429.2.csv,refseq-KRT3-NM_057088.2_clinical_seed_0_final,refseq-KRT3-NM_057088.2.a2m,Invitae,refseq-KRT3-NM_057088.2.npy,1,628,628
+NP_476516.1,MPAVSLPPKENALFKRILRCYEHKQYRNGLKFCKQILSNPKFAEHGETLAMKGLTLNCLGKKEEAYELVRRGLRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALKWDKDNLQILRDLSLLQIQMRDLEGYRETRYQLLQLRPAQRASWIGYAIAYHLLEDYEMAAKILEEFRKTQQTSPDKVDYEYSELLLYQNQVLREAGLYREALEHLCTYEKQICDKLAVEETKGELLLQLCRLEDAADVYRGLQERNPENWAYYKGLEKALKPANMLERLKIYEEAWTKYPRGLVPRRLPLNFLSGEKFKECLDKFLRMNFSKGCPPVFNTLRSLYKDKEKVAIIEELVVGYETSLKSCRLFNPNDDGKEEPPTTLLWVQYYLAQHYDKIGQPSIALEYINTAIESTPTLIELFLVKAKIYKHAGNIKEAARWMDEAQALDTADRFINSKCAKYMLKANLIKEAEEMCSKFTREGTSAVENLNEMQCMWFQTECAQAYKAMNKFGEALKKCHEIERHFIEITDDQFDFHTYCMRKITLRSYVDLLKLEDVLRQHPFYFKAARIAIEIYLKLHDNPLTDENKEHEADTANMSDKELKKLRNKQRRAQKKAQIEEEKKNAEKEKQQRNQKKKKDDDDEEIGGPKEELIPEKLAKVETPLEEAIKFLTPLKNLVKNKIETHLFAFEIYFRKEKFLLMLQSVKRAFAIDSSHPWLHECMIRLFNTAVCESKDLSDTVRTVLKQEMNRLFGATNPKNFNETFLKRNSDSLPHRLSAAKMVYYLDPSSQKRAIELATTLDESLTNRNLQTCMEVLEALYDGSLGDCKEAAEIYRANCHKLFPYALAFMPPGYEEDMKITVNGDSSAEAEELANEI,866,NP_476516.1.csv,refseq-NAA15-NM_057175.3_clinical_seed_0_final,refseq-NAA15-NM_057175.3.a2m,Invitae,refseq-NAA15-NM_057175.3.npy,1,866,866
+NP_476517.1,MADEKTFRIGFIVLGLFLLALGTFLMSHDRPQVYGTFYAMGSVMVIGGIIWSMCQCYPKITFVPADSDFQGILSPKAMGLLENGLAAEMKSPSPQPPYVRLWEEAAYDQSLPDFSHIQMKVMSYSEDHRSLLAPEMGQPKLGTSDGGEGGPGDVQAWMEAAVVIHKGSDESEGERRLTQSWPGPLACPQGPAPLASFQDDLDMDSSEGSSPNASPHDREEACSPQQEPQGCRCPLDRFQDFALIDAPTLEDEPQEGQQWEIALPNNWQRYPRTKVEEKEASDTGGEEPEKEEEDLYYGLPDGAGDLLPDKELGFEPDTQG,320,NP_476517.1.csv,refseq-BSND-NM_057176.2_clinical_seed_0_final,refseq-BSND-NM_057176.2.a2m,Invitae,refseq-BSND-NM_057176.2.npy,1,320,320
+NP_477352.3,MAAAPARGGGGGGGGGGGCSGSGSSASRGFYFNTVLSLARSLAVQRPASLEKVQKLLCMCPVDFHGIFQLDERRRDAVIALGIFLIESDLQHKDCVVPYLLRLLKGLPKVYWVEESTARKGRGALPVAESFSFCLVTLLSDVAYRDPSLRDEILEVLLQVLHVLLGMCQALEIQDKEYLCKYAIPCLIGISRAFGRYSNMEESLLSKLFPKIPPHSLRVLEELEGVRRRSFNDFRSILPSNLLTVCQEGTLKRKTSSVSSISQVSPERGMPPPSSPGGSAFHYFEASCLPDGTALEPEYYFSTISSSFSVSPLFNGVTYKEFNIPLEMLRELLNLVKKIVEEAVLKSLDAIVASVMEANPSADLYYTSFSDPLYLTMFKMLRDTLYYMKDLPTSFVKEIHDFVLEQFNTSQGELQKILHDADRIHNELSPLKLRCQANAACVDLMVWAVKDEQGAENLCIKLSEKLQSKTSSKVIIAHLPLLICCLQGLGRLCERFPVVVHSVTPSLRDFLVIPSPVLVKLYKYHSQYHTVAGNDIKISVTNEHSESTLNVMSGKKSQPSMYEQLRDIAIDNICRCLKAGLTVDPVIVEAFLASLSNRLYISQESDKDAHLIPDHTIRALGHIAVALRDTPKVMEPILQILQQKFCQPPSPLDVLIIDQLGCLVITGNQYIYQEVWNLFQQISVKASSVVYSATKDYKDHGYRHCSLAVINALANIAANIQDEHLVDELLMNLLELFVQLGLEGKRASERASEKGPALKASSSAGNLGVLIPVIAVLTRRLPPIKEAKPRLQKLFRDFWLYSVLMGFAVEGSGLWPEEWYEGVCEIATKSPLLTFPSKEPLRSVLQYNSAMKNDTVTPAELSELRSTIINLLDPPPEVSALINKLDFAMSTYLLSVYRLEYMRVLRSTDPDRFQVMFCYFEDKAIQKDKSGMMQCVIAVADKVFDAFLNMMADKAKTKENEEELERHAQFLLVNFNHIHKRIRRVADKYLSGLVDKFPHLLWSGTVLKTMLDILQTLSLSLSADIHKDQPYYDIPDAPYRITVPDTYEARESIVKDFAARCGMILQEAMKWAPTVTKSHLQEYLNKHQNWVSGLSQHTGLAMATESILHFAGYNKQNTTLGATQLSERPACVKKDYSNFMASLNLRNRYAGEVYGMIRFSGTTGQMSDLNKMMVQDLHSALDRSHPQHYTQAMFKLTAMLISSKDCDPQLLHHLCWGPLRMFNEHGMETALACWEWLLAGKDGVEVPFMREMAGAWHMTVEQKFGLFSAEIKEADPLAASEASQPKPCPPEVTPHYIWIDFLVQRFEIAKYCSSDQVEIFSSLLQRSMSLNIGGAKGSMNRHVAAIGPRFKLLTLGLSLLHADVVPNATIRNVLREKIYSTAFDYFSCPPKFPTQGEKRLREDISIMIKFWTAMFSDKKYLTASQLVPPDNQDTRSNLDITVGSRQQATQGWINTYPLSSGMSTISKKSGMSKKTNRGSQLHKYYMKRRTLLLSLLATEIERLITWYNPLSAPELELDQAGENSVANWRSKYISLSEKQWKDNVNLAWSISPYLAVQLPARFKNTEAIGNEVTRLVRLDPGAVSDVPEAIKFLVTWHTIDADAPELSHVLCWAPTDPPTGLSYFSSMYPPHPLTAQYGVKVLRSFPPDAILFYIPQIVQALRYDKMGYVREYILWAASKSQLLAHQFIWNMKTNIYLDEEGHQKDPDIGDLLDQLVEEITGSLSGPAKDFYQREFDFFNKITNVSAIIKPYPKGDERKKACLSALSEVKVQPGCYLPSNPEAIVLDIDYKSGTPMQSAAKAPYLAKFKVKRCGVSELEKEGLRCRSDSEDECSTQEADGQKISWQAAIFKVGDDCRQDMLALQIIDLFKNIFQLVGLDLFVFPYRVVATAPGCGVIECIPDCTSRDQLGRQTDFGMYDYFTRQYGDESTLAFQQARYNFIRSMAAYSLLLFLLQIKDRHNGNIMLDKKGHIIHIDFGFMFESSPGGNLGWEPDIKLTDEMVMIMGGKMEATPFKWFMEMCVRGYLAVRPYMDAVVSLVTLMLDTGLPCFRGQTIKLLKHRFSPNMTEREAANFIMKVIQSCFLSNRSRTYDMIQYYQNDIPY,2102,NP_477352.3.csv,refseq-PI4KA-NM_058004.3_clinical_seed_0_final,refseq-PI4KA-NM_058004.3.a2m,Invitae,refseq-PI4KA-NM_058004.3.npy,1,2102,2102
+NP_477511.1,MAGAAEDARALFRAGVCAALEAWPALQIAVENGFGGVHSQEKAKWLGGAVEDYFMRNADLELDEVEDFLGELLTNEFDTVVEDGSLPQVSQQLQTMFHHFQRGDGAALREMASCITQRKCKVTATALKTARETDEDEDDVDSVEEMEVTATNDGAATDGVCPQPEPSDPDAQTIKEEDIVEDGWTIVRRKK,191,NP_477511.1.csv,refseq-TSR2-NM_058163.2_clinical_seed_0_final,refseq-TSR2-NM_058163.2.a2m,Invitae,refseq-TSR2-NM_058163.2.npy,1,191,191
+NP_477520.2,MVAERSPARSPGSWLFPGLWLLVLSGPGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAERFVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEILELQPSSVCVGEEFQIVLSGRGFMLGSRNGSVLCTYTVNETYTTSVKPVSVQLNSMLCPAPILNKAGETLDVSVSFNGGKSVISGSLIVTATECSNGIAAIIVILVLLLLLGIGLMWWFWPLCCKVVIKDPPPPPAPAPKEEEEEPLPTKKWPTVDASYYGGRGVGGIKRMEVRWGDKGSTEEGARLEKAKNAVVKIPEETEEPIRPRPPRPKPTHQPPQTKWYTPIKGRLDALWALLRRQYDRVSLMRPQEGDEGRCINFSRVPSQ,488,NP_477520.2.csv,refseq-ANTXR2-NM_058172.5_clinical_seed_0_final,refseq-ANTXR2-NM_058172.5.a2m,Invitae,refseq-ANTXR2-NM_058172.5.npy,1,488,488
+NP_478059.1,MDAPRQVVNFGPGPAKLPHSVLLEIQKELLDYKGVGISVLEMSHRSSDFAKIINNTENLVRELLAVPDNYKVIFLQGGGCGQFSAVPLNLIGLKAGRCADYVVTGAWSAKAAEEAKKFGTINIVHPKLGSYTKIPDPSTWNLNPDASYVYYCANETVHGVEFDFIPDVKGAVLVCDMSSNFLSKPVDVSKFGVIFAGAQKNVGSAGVTVVIVRDDLLGFALRECPSVLEYKVQAGNSSLYNTPPCFSIYVMGLVLEWIKNNGGAAAMEKLSSIKSQTIYEIIDNSQGFYVCPVEPQNRSKMNIPFRIGNAKGDDALEKRFLDKALELNMLSLKGHRSVGGIRASLYNAVTIEDVQKLAAFMKKFLEMHQL,370,NP_478059.1.csv,refseq-PSAT1-NM_058179.3_clinical_seed_0_final,refseq-PSAT1-NM_058179.3.a2m,Invitae,refseq-PSAT1-NM_058179.3.npy,1,370,370
+NP_478123.1,MRGKTFRFEMQRDLVSFPLSPAVRVKLVSAGFQTAEELLEVKPSELSKEVGISKAEALETLQIIRRECLTNKPRYAGTSESHKKCTALELLEQEHTQGFIITFCSALDDILGGGVPLMKTTEICGAPGVGKTQLCMQLAVDVQIPECFGGVAGEAVFIDTEGSFMVDRVVDLATACIQHLQLIAEKHKGEEHRKALEDFTLDNILSHIYYFRCRDYTELLAQVYLLPDFLSEHSKVRLVIVDGIAFPFRHDLDDLSLRTRLLNGLAQQMISLANNHRLAVILTNQMTTKIDRNQALLVPALGESWGHAATIRLIFHWDRKQRLATLYKSPSQKECTVLFQIKPQGFRDTVVTSACSLQTEGSLSTRKRSRDPEEEL,376,NP_478123.1.csv,refseq-RAD51C-NM_058216.2_clinical_seed_0_final,refseq-RAD51C-NM_058216.2.a2m,Invitae,refseq-RAD51C-NM_058216.2.npy,1,376,376
+NP_490597.1,MSAESGPGTRLRNLPVMGDGLETSQMSTTQAQAQPQPANAASTNPPPPETSNPNKPKRQTNQLQYLLRVVLKTLWKHQFAWPFQQPVDAVKLNLPDYYKIIKTPMDMGTIKKRLENNYYWNAQECIQDFNTMFTNCYIYNKPGDDIVLMAEALEKLFLQKINELPTEETEIMIVQAKGRGRGRKETGTAKPGVSTVPNTTQASTPPQTQTPQPNPPPVQATPHPFPAVTPDLIVQTPVMTVVPPQPLQTPPPVPPQPQPPPAPAPQPVQSHPPIIAATPQPVKTKKGVKRKADTTTPTTIDPIHEPPSLPPEPKTTKLGQRRESSRPVKPPKKDVPDSQQHPAPEKSSKVSEQLKCCSGILKEMFAKKHAAYAWPFYKPVDVEALGLHDYCDIIKHPMDMSTIKSKLEAREYRDAQEFGADVRLMFSNCYKYNPPDHEVVAMARKLQDVFEMRFAKMPDEPEEPVVAVSSPAVPPPTKVVAPPSSSDSSSDSSSDSDSSTDDSEEERAQRLAELQEQLKAVHEQLAALSQPQQNKPKKKEKDKKEKKKEKHKRKEEVEENKKSKAKEPPPKKTKKNNSSNSNVSKKEPAPMKSKPPPTYESEEEDKCKPMSYEEKRQLSLDINKLPGEKLGRVVHIIQSREPSLKNSNPDEIEIDFETLKPSTLRELERYVTSCLRKKRKPQAEKVDVIAGSSKMKGFSSSESESSSESSSSDSEDSETEMAPKSKKKGHPGREQKKHHHHHHQQMQQAPAPVPQQPPPPPQQPPPPPPPQQQQQPPPPPPPPSMPQQAAPAMKSSPPPFIATQVPVLEPQLPGSVFDPIGHFTQPILHLPQPELPPHLPQPPEHSTPPHLNQHAVVSPPALHNALPQQPSRPSNRAAALPPKPARPPAVSPALTQTPLLPQPPMAQPPQVLLEDEEPPAPPLTSMQMQLYLQQLQKVQPPTPLLPSVKVQSQPPPPLPPPPHPSVQQQLQQQPPPPPPPQPQPPPQQQHQPPPRPVHLQPMQFSTHIQQPPPPQGQQPPHPPPGQQPPPPQPAKPQQVIQHHHSPRHHKSDPYSTGHLREAPSPLMIHSPQMSQFQSLTHQSPPQQNVQPKKQELRAASVVQPQPLVVVKEEKIHSPIIRSEPFSPSLRPEPPKHPESIKAPVHLPQRPEMKPVDVGRPVIRPPEQNAPPPGAPDKDKQKQEPKTPVAPKKDLKIKNMGSWASLVQKHPTTPSSTAKSSSDSFEQFRRAAREKEEREKALKAQAEHAEKEKERLRQERMRSREDEDALEQARRAHEEARRRQEQQQQQRQEQQQQQQQQAAAVAAAATPQAQSSQPQSMLDQQRELARKREQERRRREAMAATIDMNFQSDLLSIFEENLF,1362,NP_490597.1.csv,refseq-BRD4-NM_058243.2_clinical_seed_0_final,refseq-BRD4-NM_058243.2.a2m,Invitae,refseq-BRD4-NM_058243.2.npy,1,1362,1362
+NP_490647.1,MVDYYEVLGVQRHASPEDIKKAYRKLALKWHPDKNPENKEEAERKFKQVAEAYEVLSDAKKRDIYDKYGKEGLNGGGGGGSHFDSPFEFGFTFRNPDDVFREFFGGRDPFSFDFFEDPFEDFFGNRRGPRGSRSRGTGSFFSAFSGFPSFGSGFSSFDTGFTSFGSLGHGGLTSFSSTSFGGSGMGNFKSISTSTKMVNGRKITTKRIVENGQERVEVEEDGQLKSLTINGVADDDALAEERMRRGQNALPAQPAGLRPPKPPRPASLLRHAPHCLSEEEGEQDRPRAPGPWDPLASAAGLKEGGKRKKQKQREESKKKKSTKGNH,326,NP_490647.1.csv,refseq-DNAJB6-NM_058246.3_clinical_seed_0_final,refseq-DNAJB6-NM_058246.3.a2m,Invitae,refseq-DNAJB6-NM_058246.3.npy,1,326,326
+NP_510965.1,MATATIALQVNGQQGGGSEPAAAAAVVAAGDKWKPPQGTDSIKMENGQSTAAKLGLPPLTPEQQEALQKAKKYAMEQSIKSVLVKQTIAHQQQQLTNLQMAAVTMGFGDPLSPLQSMAAQRQRALAIMCRVYVGSIYYELGEDTIRQAFAPFGPIKSIDMSWDSVTMKHKGFAFVEYEVPEAAQLALEQMNSVMLGGRNIKVGRPSNIGQAQPIIDQLAEEARAFNRIYVASVHQDLSDDDIKSVFEAFGKIKSCTLARDPTTGKHKGYGFIEYEKAQSSQDAVSSMNLFDLGGQYLRVGKAVTPPMPLLTPATPGGLPPAAAVAAAAATAKITAQEAVAGAAVLGTLGTPGLVSPALTLAQPLGTLPQAVMAAQAPGVITGVTPARPPIPVTIPSVGVVNPILASPPTLGLLEPKKEKEEEELFPESERPEMLSEQEHMSISGSSARHMVMQKLLRKQESTVMVLRNMVDPKDIDDDLEGEVTEECGKFGAVNRVIIYQEKQGEEEDAEIIVKIFVEFSIASETHKAIQALNGRWFAGRKVVAEVYDQERFDNSDLSA,559,NP_510965.1.csv,refseq-PUF60-NM_078480.2_clinical_seed_0_final,refseq-PUF60-NM_078480.2.a2m,Invitae,refseq-PUF60-NM_078480.2.npy,1,559,559
+NP_523353.2,MSASEGMKFKFHSGEKVLCFEPDPTKARVLYDAKIVDVIVGKDEKGRKIPEYLIHFNGWNRSWDRWAAEDHVLRDTDENRRLQRKLARKAVARLRSTGRKKKRCRLPGVDSVLKGLPTEEKDENDENSLSSSSDCSENKDEEISEESDIEEKTEVKEEPELQTRREMEERTITIEIPEVLKKQLEDDCYYINRRKRLVKLPCQTNIITILESYVKHFAINAAFSANERPRHHHVMPHANMNVHYIPAEKNVDLCKEMVDGLRITFDYTLPLVLLYPYEQAQYKKVTSSKFFLPIKESATSTNRSQEELSPSPPLLNPSTPQSTESQPTTGEPATPKRRKAEPEALQSLRRSTRHSANCDRLSESSASPQPKRRQQDTSASMPKLFLHLEKKTPVHSRSSSPIPLTPSKEGSAVFAGFEGRRTNEINEVLSWKLVPDNYPPGDQPPPPSYIYGAQHLLRLFVKLPEILGKMSFSEKNLKALLKHFDLFLRFLAEYHDDFFPESAYVAACEAHYSTKNPRAIY,521,NP_523353.2.csv,refseq-MSL3-NM_078629.3_clinical_seed_0_final,refseq-MSL3-NM_078629.3.a2m,Invitae,refseq-MSL3-NM_078629.3.npy,1,521,521
+NP_524144.1,MAPKKDVKKPVAAAAAAPAPAPAPAPAPAPAKPKEEKIDLSAIKIEFSKEQQDEFKEAFLLFDRTGDSKITLSQVGDVLRALGTNPTNAEVRKVLGNPSNEELNAKKIEFEQFLPMMQAISNNKDQATYEDFVEGLRVFDKEGNGTVMGAELRHVLATLGEKMKEEEVEALMAGQEDSNGCINYEAFVKHIMSI,194,NP_524144.1.csv,refseq-MYL1-NM_079420.2_clinical_seed_0_final,refseq-MYL1-NM_079420.2.a2m,Invitae,refseq-MYL1-NM_079420.2.npy,1,194,194
+NP_536350.2,MGVRNCLYGNNMSGQRDIPPEIGEQPEQPPLEAPGAAAPGAGPSPAEEMETEPPHNEPIPVENDGEACGPPEVSRPNFQVLNPAFREAGAHGSYSPPPEEAMPFEAEQPSLGGFWPTLEQPGFPSGVHAGLEAFGPALMEPGAFSGARPGLGGYSPPPEEAMPFEFDQPAQRGCSQLLLQVPDLAPGGPGAAGVPGAPPEEPQALRPAKAGSRGGYSPPPEETMPFELDGEGFGDDSPPPGLSRVIAQVDGSSQFAAVAASSAVRLTPAANAPPLWVPGAIGSPSQEAVRPPSNFTGSSPWMEISGPPFEIGSAPAGVDDTPVNMDSPPIALDGPPIKVSGAPDKRERAERPPVEEEAAEMEGAADAAEGGKVPSPGYGSPAAGAASADTAARAAPAAPADPDSGATPEDPDSGTAPADPDSGAFAADPDSGAAPAAPADPDSGAAPDAPADPDSGAAPDAPADPDAGAAPEAPAAPAAAETRAAHVAPAAPDAGAPTAPAASATRAAQVRRAASAAPASGARRKIHLRPPSPEIQAADPPTPRPTRASAWRGKSESSRGRRVYYDEGVASSDDDSSGDESDDGTSGCLRWFQHRRNRRRRKPQRNLLRNFLVQAFGGCFGRSESPQPKASRSLKVKKVPLAEKRRQMRKEALEKRAQKRAEKKRSKLIDKQLQDEKMGYMCTHRLLLLGAGESGKSTIVKQMRILHVNGFNGEGGEEDPQAARSNSDGEKATKVQDIKNNLKEAIETIVAAMSNLVPPVELANPENQFRVDYILSVMNVPDFDFPPEFYEHAKALWEDEGVRACYERSNEYQLIDCAQYFLDKIDVIKQADYVPSDQDLLRCRVLTSGIFETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVASSSYNMVIREDNQTNRLQEALNLFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGDGRHYCYPHFTCAVDTENIRRVFNDCRDIIQRMHLRQYELL,1037,NP_536350.2.csv,GNAS1_HUMAN_b01_clinical_seed_0_final,GNAS1_HUMAN_b01.a2m,EVE,GNAS1_HUMAN_b01_theta_0.2.npy,1,1037,1037
+NP_542119.1,MATLLRSKLSNVATSVSNKSQAKMSGMFARMGFQAATDEEAVGFAHCDDLDFEHRQGLQMDILKAEGEPCGDEGAEAPVEGDIHYQRGSGAPLPPSGSKDQVGGGGEFGGHDKPKITAWEAGWNVTNAIQGMFVLGLPYAILHGGYLGLFLIIFAAVVCCYTGKILIACLYEENEDGEVVRVRDSYVAIANACCAPRFPTLGGRVVNVAQIIELVMTCILYVVVSGNLMYNSFPGLPVSQKSWSIIATAVLLPCAFLKNLKAVSKFSLLCTLAHFVINILVIAYCLSRARDWAWEKVKFYIDVKKFPISIGIIVFSYTSQIFLPSLEGNMQQPSEFHCMMNWTHIAACVLKGLFALVAYLTWADETKEVITDNLPGSIRAVVNIFLVAKALLSYPLPFFAAVEVLEKSLFQEGSRAFFPACYSGDGRLKSWGLTLRCALVVFTLLMAIYVPHFALLMGLTGSLTGAGLCFLLPSLFHLRLLWRKLLWHQVFFDVAIFVIGGICSVSGFVHSLEGLIEAYRTNAED,525,NP_542119.1.csv,refseq-SLC32A1-NM_080552.2_clinical_seed_0_final,refseq-SLC32A1-NM_080552.2.a2m,Invitae,refseq-SLC32A1-NM_080552.2.npy,1,525,525
+NP_542172.2,MKLLRRAWRRRAALGLGTLALCGAALLYLARCAAEPGDPRAMSGRSPPPPAPARAAAFLAVLVASAPRAAERRSVIRSTWLARRGAPGDVWARFAVGTAGLGAEERRALEREQARHGDLLLLPALRDAYENLTAKVLAMLAWLDEHVAFEFVLKADDDSFARLDALLAELRAREPARRRRLYWGFFSGRGRVKPGGRWREAAWQLCDYYLPYALGGGYVLSADLVHYLRLSRDYLRAWHSEDVSLGAWLAPVDVQREHDPRFDTEYRSRGCSNQYLVTHKQSLEDMLEKHATLAREGRLCKREVQLRLSYVYDWSAPPSQCCQRREGIP,329,NP_542172.2.csv,refseq-B3GALT6-NM_080605.3_clinical_seed_0_final,refseq-B3GALT6-NM_080605.3.a2m,Invitae,refseq-B3GALT6-NM_080605.3.npy,1,329,329
+NP_542378.1,MHFSTVTRDMEAFTASSLSSLGAAGGFPGAASPGADPYGPREPPPPPPRYDPCAAAAPGAPGPPPPPHAYPFAPAAGAATSAAAEPEGPGASCAAAAKAPVKKNAKVAGVSVQLEMKALWDEFNQLGTEMIVTKAGRRMFPTFQVKLFGMDPMADYMLLMDFVPVDDKRYRYAFHSSSWLVAGKADPATPGRVHYHPDSPAKGAQWMKQIVSFDKLKLTNNLLDDNGHIILNSMHRYQPRFHVVYVDPRKDSEKYAEENFKTFVFEETRFTAVTAYQNHRITQLKIASNPFAKGFRDCDPEDWPRNHRPGALPLMSAFARSRNPVASPTQPSGTEKDAAEARREFQRDAGGPAVLGDPAHPPQLLARVLSPSLPGAGGAGGLVPLPGAPGGRPSPPNPELRLEAPGASEPLHHHPYKYPAAAYDHYLGAKSRPAPYPLPGLRGHGYHPHAHPHHHHHPVSPAAAAAAAAAAAAAAANMYSSAGAAPPGSYDYCPR,495,NP_542378.1.csv,refseq-TBX1-NM_080647.1_clinical_seed_0_final,refseq-TBX1-NM_080647.1.a2m,Invitae,refseq-TBX1-NM_080647.1.npy,1,495,495
+NP_542400.2,MEGSASPPEKPRARPAAAVLCRGPVEPLVFLANFALVLQGPLTTQYLWHRFSADLGYNGTRQRGGCSNRSADPTMQEVETLTSHWTLYMNVGGFLVGLFSSTLLGAWSDSVGRRPLLVLASLGLLLQALVSVFVVQLQLHVGYFVLGRILCALLGDFGGLLAASFASVADVSSSRSRTFRMALLEASIGVAGMLASLLGGHWLRAQGYANPFWLALALLIAMTLYAAFCFGETLKEPKSTRLFTFRHHRSIVQLYVAPAPEKSRKHLALYSLAIFVVITVHFGAQDILTLYELSTPLCWDSKLIGYGSAAQHLPYLTSLLALKLLQYCLADAWVAEIGLAFNILGMVVFAFATITPLMFTGYGLLFLSLVITPVIRAKLSKLVRETEQGALFSAVACVNSLAMLTASGIFNSLYPATLNFMKGFPFLLGAGLLLIPAVLIGMLEKADPHLEFQQFPQSP,459,NP_542400.2.csv,PCFT_HUMAN_b05_clinical_seed_0_final,PCFT_HUMAN_b05.a2m,EVE,PCFT_HUMAN_b05_theta_0.2.npy,1,459,459
+NP_542411.2,MERCSRCHRLLLLLPLVLGLSAAPGWAGAPPVDVLRALRFPSLPDGVRRAKGICPADVAYRVARPAQLSAPTRQLFPGGFPKDFSLLTVVRTRPGLQAPLLTLYSAQGVRQLGLELGRPVRFLYEDQTGRPQPPSQPVFRGLSLADGKWHRVAVAVKGQSVTLIVDCKKRVTRPLPRSARPVLDTHGVIIFGARILDEEVFEGDVQELAIVPGVQAAYESCEQKELECEGGQRERPQNQQPHRAQRSPQQQPSRLHRPQNQEPQSQPTESLYYDYEPPYYDVMTTGTTPDYQDPTPGEEEEILESSLLPPLEEEQTDLQVPPTADRFQAEEYGEGGTDPPEGPYDYTYGYGDDYREETELGPALSAETAHSGAAAHGPRGLKGEKGEPAVLEPGMLVEGPPGPEGPAGLIGPPGIQGNPGPVGDPGERGPPGRAGLPGSDGAPGPPGTSLMLPFRFGSGGGDKGPVVAAQEAQAQAILQQARLALRGPPGPMGYTGRPGPLGQPGSPGLKGESGDLGPQGPRGPQGLTGPPGKAGRRGRAGADGARGMPGDPGVKGDRGFDGLPGLPGEKGHRGDTGAQGLPGPPGEDGERGDDGEIGPRGLPGESGPRGLLGPKGPPGIPGPPGVRGMDGPQGPKGSLGPQGEPGPPGQQGTPGTQGLPGPQGAIGPHGEKGPQGKPGLPGMPGSDGPPGHPGKEGPPGTKGNQGPSGPQGPLGYPGPRGVKGVDGIRGLKGHKGEKGEDGFPGFKGDIGVKGDRGEVGVPGSRGEDGPEGPKGRTGPTGDPGPPGLMGEKGKLGVPGLPGYPGRQGPKGSLGFPGFPGASGEKGARGLSGKSGPRGERGPTGPRGQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHPGQRGEVGFQGKTGPPGPPGVVGPQGAAGETGPMGERGHPGPPGPPGEQGLPGTAGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSPGERGAAGSGGPIGPPGRPGPQGPPGAAGEKGVPGEKGPIGPTGRDGVQGPVGLPGPAGPPGVAGEDGDKGEVGDPGQKGTKGNKGEHGPPGPPGPIGPVGQPGAAGADGEPGARGPQGHFGAKGDEGTRGFNGPPGPIGLQGLPGPSGEKGETGDVGPMGPPGPPGPRGPAGPNGADGPQGPPGGVGNLGPPGEKGEPGESGSPGIQGEPGVKGPRGERGEKGESGQPGEPGPPGPKGPTGDDGPKGNPGPVGFPGDPGPPGEGGPRGQDGAKGDRGEDGEPGQPGSPGPTGENGPPGPLGKRGPAGSPGSEGRQGGKGAKGDPGAIGAPGKTGPVGPAGPAGKPGPDGLRGLPGSVGQQGRPGATGQAGPPGPVGPPGLPGLRGDAGAKGEKGHPGLIGLIGPPGEQGEKGDRGLPGPQGSPGQKGEMGIPGASGPIGPGGPPGLPGPAGPKGAKGATGPGGPKGEKGVQGPPGHPGPPGEVIQPLPIQMPKKTRRSVDGSRLMQEDEAIPTGGAPGSPGGLEEIFGSLDSLREEIEQMRRPTGTQDSPARTCQDLKLCHPELPDGEYWVDPNQGCARDAFRVFCNFTAGGETCVTPRDDVTQFSYVDSEGSPVGVVQLTFLRLLSVSAHQDVSYPCSGAARDGPLRLRGANEDELSPETSPYVKEFRDGCQTQQGRTVLEVRTPVLEQLPVLDASFSDLGAPPRRGGVLLGPVCFMG,1736,NP_542411.2.csv,refseq-COL11A2-NM_080680.2_clinical_seed_0_final,refseq-COL11A2-NM_080680.2.a2m,Invitae,refseq-COL11A2-NM_080680.2_theta_0.2.npy,1,1736,1736
+NP_542437.2,MKKWSSTDLGAAADPLQKDTCPDPLDGDPNSRPPPAKPQLSTAKSRTRLFGKGDSEEAFPVDCPHEEGELDSCPTITVSPVITIQRPGDGPTGARLLSQDSVAASTEKTLRLYDRRSIFEAVAQNNCQDLESLLLFLQKSKKHLTDNEFKDPETGKTCLLKAMLNLHDGQNTTIPLLLEIARQTDSLKELVNASYTDSYYKGQTALHIAIERRNMALVTLLVENGADVQAAAHGDFFKKTKGRPGFYFGELPLSLAACTNQLGIVKFLLQNSWQTADISARDSVGNTVLHALVEVADNTADNTKFVTSMYNEILMLGAKLHPTLKLEELTNKKGMTPLALAAGTGKIGVLAYILQREIQEPECRHLSRKFTEWAYGPVHSSLYDLSCIDTCEKNSVLEVIAYSSSETPNRHDMLLVEPLNRLLQDKWDRFVKRIFYFNFLVYCLYMIIFTMAAYYRPVDGLPPFKMEKTGDYFRVTGEILSVLGGVYFFFRGIQYFLQRRPSMKTLFVDSYSEMLFFLQSLFMLATVVLYFSHLKEYVASMVFSLALGWTNMLYYTRGFQQMGIYAVMIEKMILRDLCRFMFVYIVFLFGFSTAVVTLIEDGKNDSLPSESTSHRWRGPACRPPDSSYNSLYSTCLELFKFTIGMGDLEFTENYDFKAVFIILLLAYVILTYILLLNMLIALMGETVNKIAQESKNIWKLQRAITILDTEKSFLKCMRKAFRSGKLLQVGYTPDGKDDYRWCFRVDEVNWTTWNTNVGIINEDPGNCEGVKRTLSFSLRSSRVSGRHWKNFALVPLLREASARDRQSAQPEEVYLRQFSGSLKPEDAEVFKSPAASGEK,839,NP_542437.2.csv,refseq-TRPV1-NM_080706.3_clinical_seed_0_final,refseq-TRPV1-NM_080706.3.a2m,Invitae,refseq-TRPV1-NM_080706.3.npy,1,839,839
+NP_543136.1,MSDLGSEELEEEGENDIGEYEGGRNEAGERHGRGRARLPNGDTYEGSYEFGKRHGQGIYKFKNGARYIGEYVRNKKHGQGTFIYPDGSRYEGEWANDLRHGHGVYYYINNDTYTGEWFAHQRHGQGTYLYAETGSKYVGTWVNGQQEGTAELIHLNHRYQGKFLNKNPVGPGKYVFDVGCEQHGEYRLTDMERGEEEEEEELVTVVPKWKATQITELALWTPTLPKKPTSTDGPGQDAPGAESAGEPGEEAQALLEGFEGEMDMRPGDEDADVLREESREYDQEEFRYDMDEGNINSEEEETRQSDLQD,309,NP_543136.1.csv,refseq-RSPH1-NM_080860.3_clinical_seed_0_final,refseq-RSPH1-NM_080860.3.a2m,Invitae,refseq-RSPH1-NM_080860.3.npy,1,309,309
+NP_550438.1,MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTKTYPEWHVATEPVATWQNIQAAGTQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLSRLKVQLEPFPEKLLQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLLWEFASRITLHGFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKLHFEALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL,277,NP_550438.1.csv,refseq-DGUOK-NM_080916.2_clinical_seed_0_final,refseq-DGUOK-NM_080916.2.a2m,Invitae,refseq-DGUOK-NM_080916.2.npy,1,277,277
+NP_569157.2,MGKRGRPRKEARCEGAGLAPAAPPAVPPAVAAPQPPALPEDPAGAKPRCPFSDIFNTSENSMEKHINTFLQNVQILLEAASYLEQIEKENKKCEHGYASSFPSMPSPRLQHSKPPRRLSRAQKHSSGSSNTSTANRSTHNELEKNRRAHLRLCLERLKVLIPLGPDCTRHTTLGLLNKAKAHIKKLEEAERKSQHQLENLEREQRFLKWRLEQLQGPQEMERIRMDSIGSTISSDRSDSEREEIEVDVESTEFSHGEVDNISTTSISDIDDHSSLPSIGSDEGYSSASVKLSFTS,295,NP_569157.2.csv,refseq-MXI1-NM_130439.3_clinical_seed_0_final,refseq-MXI1-NM_130439.3.a2m,Invitae,refseq-MXI1-NM_130439.3_theta_0.2.npy,1,295,295
+NP_569733.2,MFTLSQTSRAWFIDRARQAREERLVQKERERAAVVIQAHVRSFLCRSRLQRDIRREIDDFFKADDPESTKRSALCIFKIARKLLFLFRIKEDNERFEKLCRSILSSMDAENEPKVWYVSLACSKDLTLLWIQQIKNILWYCCDFLKQLKPEILQDSRLITLYLTMLVTFTDTSTWKILRGKGESLRPAMNHICANIMGHLNQHGFYSVLQILLTRGLARPRPCLSKGTLTAAFSLALRPVIAAQFSDNLIRPFLIHIMSVPALVTHLSTVTPERLTVLESHDMLRKFIIFLRDQDRCRDVCESLEGCHTLCLMGNLLHLGSLSPRVLEEETDGFVSLLTQTLCYCRKYVSQKKSNLTHWHPVLGWFSQSVDYGLNESMHLITKQLQFLWGVPLIRIFFCDILSKKLLESQEPAHAQPASPQNVLPVKSLLKRAFQKSASVRNILRPVGGKRVDSAEVQKVCNICVLYQTSLTTLTQIRLQILTGLTYLDDLLPKLWAFICELGPHGGLKLFLECLNNDTEESKQLLAMLMLFCDCSRHLITILDDIEVYEEQISFKLEELVTISSFLNSFVFKMIWDGIVENAKGETLELFQSVHGWLMVLYERDCRRRFTPEDHWLRKDLKPSVLFQELDRDRKRAQLILQYIPHVIPHKNRVLLFRTMVTKEKEKLGLVETSSASPHVTHITIRRSRMLEDGYEQLRQLSQHAMKGVIRVKFVNDLGVDEAGIDQDGVFKEFLEEIIKRVFDPALNLFKTTSGDERLYPSPTSYIHENYLQLFEFVGKMLGKAVYEGIVVDVPFASFFLSQLLGHHHSVFYSSVDELPSLDSEFYKNLTSIKRYDGDITDLGLTLSYDEDVMGQLVCHELIPGGKTIPVTNENKISYIHLMAHFRMHTQIKNQTAALISGFRSIIKPEWIRMFSTPELQRLISGDNAEIDLEDLKKHTVYYGGFHGSHRVIIWLWDILASDFTPDERAMFLKFVTSCSRPPLLGFAYLKPPFSIRCVEVSDDQDTGDTLGSVLRGFFTIRKREPGGRLPTSSTCFNLLKLPNYSKKSVLREKLRYAISMNTGFELS,1068,NP_569733.2.csv,refseq-UBE3B-NM_130466.3_clinical_seed_0_final,refseq-UBE3B-NM_130466.3.a2m,Invitae,refseq-UBE3B-NM_130466.3.npy,1,1068,1068
+NP_569735.1,MFPRPLTPLAAPNGAEPLGRALRRAPLGRARAGLGGPPLLLPSMLMFAVIVASSGLLLMIERGILAEMKPLPLHPPGREGTAWRGKAPKPGGLSLRAGDADLQVRQDVRNRTLRAVCGQPGMPRDPWDLPVGQRRTLLRHILVSDRYRFLYCYVPKVACSNWKRVMKVLAGVLDSVDVRLKMDHRSDLVFLADLRPEEIRYRLQHYFKFLFVREPLERLLSAYRNKFGEIREYQQRYGAEIVRRYRAGAGPSPAGDDVTFPEFLRYLVDEDPERMNEHWMPVYHLCQPCAVHYDFVGSYERLEADANQVLEWVRAPPHVRFPARQAWYRPASPESLHYHLCSAPRALLQDVLPKYILDFSLFAYPLPNVTKEACQQ,376,NP_569735.1.csv,refseq-CHST14-NM_130468.3_clinical_seed_0_final,refseq-CHST14-NM_130468.3.a2m,Invitae,refseq-CHST14-NM_130468.3.npy,1,376,376
+NP_570718.1,MIVFLVFKHLFSLRLITMFFLLHFIVLINVKDFALTQGSMITPSCQKGYFPCGNLTKCLPRAFHCDGKDDCGNGADEENCGDTSGWATIFGTVHGNANSVALTQECFLKQYPQCCDCKETELECVNGDLKSVPMISNNVTLLSLKKNKIHSLPDKVFIKYTKLKKIFLQHNCIRHISRKAFFGLCNLQILYLNHNCITTLRPGIFKDLHQLTWLILDDNPITRISQRLFTGLNSLFFLSMVNNYLEALPKQMCAQMPQLNWVDLEGNRIKYLTNSTFLSCDSLTVLFLPRNQIGFVPEKTFSSLKNLGELDLSSNTITELSPHLFKDLKLLQKLNLSSNPLMYLHKNQFESLKQLQSLDLERIEIPNINTRMFQPMKNLSHIYFKNFRYCSYAPHVRICMPLTDGISSFEDLLANNILRIFVWVIAFITCFGNLFVIGMRSFIKAENTTHAMSIKILCCADCLMGVYLFFVGIFDIKYRGQYQKYALLWMESVQCRLMGFLAMLSTEVSVLLLTYLTLEKFLVIVFPFSNIRPGKRQTSVILICIWMAGFLIAVIPFWNKDYFGNFYGKNGVCFPLYYDQTEDIGSKGYSLGIFLGVNLLAFLIIVFSYITMFCSIQKTALQTTEVRNCFGREVAVANRFFFIVFSDAICWIPVFVVKILSLFRVEIPDTMTSWIVIFFLPVNSALNPILYTLTTNFFKDKLKQLLHKHQRKSIFKIKKKSLSTSIVWIEDSSSLKLGVLNKITLGDSIMKPVS,754,NP_570718.1.csv,refseq-RXFP2-NM_130806.3_clinical_seed_0_final,refseq-RXFP2-NM_130806.3.a2m,Invitae,refseq-RXFP2-NM_130806.3.npy,1,754,754
+NP_570722.2,MPLQVSDYSWQQTKTAVFLSLPLKGVCVRDTDVFCTENYLKVNFPPFLFEAFLYAPIDDESSKAKIGNDTIVFTLYKKEAAMWETLSVTGVDKEMMQRIREKSILQAQERAKEATEAKAAAKREDQKYALSVMMKIEEEERKKIEDMKENERIKATKALEAWKEYQRKAEEQKKIQREEKLCQKEKQIKEERKKIKYKSLTRNLASRNLAPKGRNSENIFTEKLKEDSIPAPRSVGSIKINFTPRVFPTALRESQVAEEEEWLHKQAEARRAMNTDIAELCDLKEEEKNPEWLKDKGNKLFATENYLAAINAYNLAIRLNNKMPLLYLNRAACHLKLKNLHKAIEDSSKALELLMPPVTDNANARMKAHVRRGTAFCQLELYVEGLQDYEAALKIDPSNKIVQIDAEKIRNVIQGTELKS,420,NP_570722.2.csv,refseq-DNAAF4-NM_130810.3_clinical_seed_0_final,refseq-DNAAF4-NM_130810.3.a2m,Invitae,refseq-DNAAF4-NM_130810.3.npy,1,420,420
+NP_570824.1,MAEDADMRNELEEMQRRADQLADESLESTRRMLQLVEESKDAGIRTLVMLDEQGEQLERIEEGMDQINKDMKEAEKNLTDLGKFCGLCVCPCNKLKSSDAYKKAWGNNQDGVVASQPARVVDEREQMAISGGFIRRVTNDARENEMDENLEQVSGIIGNLRHMALDMGNEIDTQNRQIDRIMEKADSNKTRIDEANQRATKMLGSG,206,NP_570824.1.csv,refseq-SNAP25-NM_130811.2_clinical_seed_0_final,refseq-SNAP25-NM_130811.2.a2m,Invitae,refseq-SNAP25-NM_130811.2.npy,1,206,206
+NP_570853.1,MKRAAAKHLIERYYHQLTEGCGNEACTNEFCASCPTFLRMDNNAAAIKALELYKINAKLCDPHPSKKGASSAYLENSKGAPNNSCSEIKMNKKGARIDFKDVTYLTEEKVYEILELCREREDYSPLIRVIGRVFSSAEALVQSFRKVKQHTKEELKSLQAKDEDKDEDEKEKAACSAAAMEEDSEASSSRIGDSSQGDNNLQKLGPDDVSVDIDAIRRVYTRLLSNEKIETAFLNALVYLSPNVECDLTYHNVYSRDPNYLNLFIIVMENRNLHSPEYLEMALPLFCKAMSKLPLAAQGKLIRLWSKYNADQIRRMMETFQQLITYKVISNEFNSRNLVNDDDAIVAASKCLKMVYYANVVGGEVDTNHNEEDDEEPIPESSELTLQELLGEERRNKKGPRVDPLETELGVKTLDCRKPLIPFEEFINEPLNEVLEMDKDYTFFKVETENKFSFMTCPFILNAVTKNLGLYYDNRIRMYSERRITVLYSLVQGQQLNPYLRLKVRRDHIIDDALVRLEMIAMENPADLKKQLYVEFEGEQGVDEGGVSKEFFQLVVEEIFNPDIGMFTYDESTKLFWFNPSSFETEGQFTLIGIVLGLAIYNNCILDVHFPMVVYRKLMGKKGTFRDLGDSHPVLYQSLKDLLEYEGNVEDDMMITFQISQTDLFGNPMMYDLKENGDKIPITNENRKEFVNLYSDYILNKSVEKQFKAFRRGFHMVTNESPLKYLFRPEEIELLICGSRNLDFQALEETTEYDGGYTRDSVLIREFWEIVHSFTDEQKRLFLQFTTGTDRAPVGGLGKLKMIIAKNGPDTERLPTSHTCFNVLLLPEYSSKEKLKERLLKAITYAKGFGML,852,NP_570853.1.csv,refseq-UBE3A-NM_130838.1_clinical_seed_0_final,refseq-UBE3A-NM_130838.1.a2m,Invitae,refseq-UBE3A-NM_130838.1.npy,1,852,852
+NP_570901.3,MASLVSLELGLLLAVLVVTATASPPAGLLSLLTSGQGALDQEALGGLLNTLADRVHCANGPCGKCLSVEDALGLGEPEGSGLPPGPVLEARYVARLSAAAVLYLSNPEGTCEDARAGLWASHADHLLALLESPKALTPGLSWLLQRMQARAAGQTPKMACVDIPQLLEEAVGAGAPGSAGGVLAALLDHVRSGSCFHALPSPQYFVDFVFQQHSSEVPMTLAELSALMQRLGVGREAHSDHSHRHRGASSRDPVPLISSSNSSSVWDTVCLSARDVMAAYGLSEQAGVTPEAWAQLSPALLQQQLSGACTSQSRPPVQDQLSQSERYLYGSLATLLICLCAVFGLLLLTCTGCRGVTHYILQTFLSLAVGAVTGDAVLHLTPKVLGLHTHSEEGLSPQPTWRLLAMLAGLYAFFLFENLFNLLLPRDPEDLEDGPCGHSSHSHGGHSHGVSLQLAPSELRQPKPPHEGSRADLVAEESPELLNPEPRRLSPELRLLPYMITLGDAVHNFADGLAVGAAFASSWKTGLATSLAVFCHELPHELGDFAALLHAGLSVRQALLLNLASALTAFAGLYVALAVGVSEESEAWILAVATGLFLYVALCDMLPAMLKVRDPRPWLLFLLHNVGLLGGWTVLLLLSLYEDDITF,647,NP_570901.3.csv,S39A4_HUMAN_b01_clinical_seed_0_final,S39A4_HUMAN_b01.a2m,EVE,S39A4_HUMAN_b01_theta_0.2.npy,1,647,647
+NP_570971.1,MVSSVLPNPTSAECWAALLHDPMTLDMDAVLSDFVRSTGAEPGLARDLLEGKNWDLTAALSDYEQLRQVHTANLPHVFNEGRGPKQPEREPQPGHKVERPCLQRQDDIAQEKRLSRGISHASSAIVSLARSHVASECNNEQFPLEMPIYTFQLPDLSVYSEDFRSFIERDLIEQATMVALEQAGRLNWWSTVCTSCKRLLPLATTGDGNCLLHAASLGMWGFHDRDLVLRKALYTMMRTGAEREALKRRWRWQQTQQNKEEEWEREWTELLKLASSEPRTHFSKNGGTGGGVDNSEDPVYESLEEFHVFVLAHILRRPIVVVADTMLRDSGGEAFAPIPFGGIYLPLEVPPNRCHCSPLVLAYDQAHFSALVSMEQRDQQREQAVIPLTDSEHKLLPLHFAVDPGKDWEWGKDDNDNARLAHLILSLEAKLNLLHSYMNVTWIRIPSETRAPLAQPESPTASAGEDVQSLADSLDSDRDSVCSNSNSNNGKNGKDKEKEKQRKEKDKTRADSVANKLGSFSKTLGIKLKKNMGGLGGLVHGKMGRANSANGKNGDSAERGKEKKAKSRKGSKEESGASASTSPSEKTTPSPTDKAAGASPAEKGGGPRGDAWKYSTDVKLSLNILRAAMQGERKFIFAGLLLTSHRHQFHEEMIGYYLTSAQERFSAEQEQRRRDAATAAAAAAAAAAATAKRPPRRPETEGVPVPERASPGPPTQLVLKLKERPSPGPAAGRAARAAAGGTASPGGGARRASASGPVPGRSPPAPARQSVIHVQASGARDEACAPAVGALRPCATYPQQNRSLSSQSYSPARAAALRTVNTVESLARAVPGALPGAAGTAGAAEHKSQTYTNGFGALRDGLEFADADAPTARSNGECGRGGPGPVQRRCQRENCAFYGRAETEHYCSYCYREELRRRREARGARP,926,NP_570971.1.csv,refseq-OTUD7A-NM_130901.2_clinical_seed_0_final,refseq-OTUD7A-NM_130901.2.a2m,Invitae,refseq-OTUD7A-NM_130901.2.npy,1,926,926
+NP_573400.3,MASLAALALSLLLRLQLPPLPGARAQSAAGGCSFDEHYSNCGYSVALGTNGFTWEQINTWEKPMLDQAVPTGSFMMVNSSGRASGQKAHLLLPTLKENDTHCIDFHYYFSSRDRSSPGALNVYVKVNGGPQGNPVWNVSGVVTEGWVKAELAISTFWPHFYQVIFESVSLKGHPGYIAVDEVRVLAHPCRKAPHFLRLQNVEVNVGQNATFQCIAGGKWSQHDKLWLQQWNGRDTALMVTRVVNHRRFSATVSVADTAQRSVSKYRCVIRSDGGSGVSNYAELIVKEPPTPIAPPELLAVGATYLWIKPNANSIIGDGPIILKEVEYRTTTGTWAETHIVDSPNYKLWHLDPDVEYEIRVLLTRPGEGGTGPPGPPLTTRTKCADPVHGPQNVEIVDIRARQLTLQWEPFGYAVTRCHSYNLTVQYQYVFNQQQYEAEEVIQTSSHYTLRGLRPFMTIRLRLLLSNPEGRMESEELVVQTEEDVPGAVPLESIQGGPFEEKIYIQWKPPNETNGVITLYEINYKAVGSLDPSADLSSQRGKVFKLRNETHHLFVGLYPGTTYSFTIKASTAKGFGPPVTTRIATKISAPSMPEYDTDTPLNETDTTITVMLKPAQSRGAPVSVYQLVVKEERLQKSRRAADIIECFSVPVSYRNASSLDSLHYFAAELKPANLPVTQPFTVGDNKTYNGYWNPPLSPLKSYSIYFQALSKANGETKINCVRLATKAPMGSAQVTPGTPLCLLTTGASTQNSNTVEPEKQVDNTVKMAGVIAGLLMFIIILLGVMLTIKRRRNAYSYSYYLKLAKKQKETQSGAQREMGPVASADKPTTKLSASRNDEGFSSSSQDVNGFTDGSRGELSQPTLTIQTHPYRTCDPVEMSYPRDQFQPAIRVADLLQHITQMKRGQGYGFKEEYEALPEGQTASWDTAKEDENRNKNRYGNIISYDHSRVRLLVLDGDPHSDYINANYIDGYHRPRHYIATQGPMQETVKDFWRMIWQENSASIVMVTNLVEVGRVKCVRYWPDDTEVYGDIKVTLIETEPLAEYVIRTFTVQKKGYHEIRELRLFHFTSWPDHGVPCYATGLLGFVRQVKFLNPPEAGPIVVHCSAGAGRTGCFIAIDTMLDMAENEGVVDIFNCVRELRAQRVNLVQTEEQYVFVHDAILEACLCGNTAIPVCEFRSLYYNISRLDPQTNSSQIKDEFQTLNIVTPRVRPEDCSIGLLPRNHDKNRSMDVLPLDRCLPFLISVDGESSNYINAALMDSHKQPAAFVVTQHPLPNTVADFWRLVFDYNCSSVVMLNEMDTAQFCMQYWPEKTSGCYGPIQVEFVSADIDEDIIHRIFRICNMARPQDGYRIVQHLQYIGWPAYRDTPPSKRSLLKVVRRLEKWQEQYDGREGRTVVHCLNGGGRSGTFCAICSVCEMIQQQNIIDVFHIVKTLRNNKSNMVETLEQYKFVYEVALEYLSSF,1460,NP_573400.3.csv,refseq-PTPRT-NM_133170.4_clinical_seed_0_final,refseq-PTPRT-NM_133170.4.a2m,Invitae,refseq-PTPRT-NM_133170.4_theta_0.2.npy,1,1460,1460
+NP_573566.2,MAALLRSARWLLRAGAAPRLPLSLRLLPGGPGRLHAASYLPAARAGPVAGGLLSPARLYAIAAKEKDIQEESTFSSRKISNQFDWALMRLDLSVRRTGRIPKKLLQKVFNDTCRSGGLGGSHALLLLRSCGSLLPELKLEERTEFAHRIWDTLQKLGAVYDVSHYNALLKVYLQNEYKFSPTDFLAKMEEANIQPNRVTYQRLIASYCNVGDIEGASKILGFMKTKDLPVTEAVFSALVTGHARAGDMENAENILTVMRDAGIEPGPDTYLALLNAYAEKGDIDHVKQTLEKVEKSELHLMDRDLLQIIFSFSKAGYPQYVSEILEKVTCERRYIPDAMNLILLLVTEKLEDVALQILLACPVSKEDGPSVFGSFFLQHCVTMNTPVEKLTDYCKKLKEVQMHSFPLQFTLHCALLANKTDLAKALMKAVKEEGFPIRPHYFWPLLVGRRKEKNVQGIIEILKGMQELGVHPDQETYTDYVIPCFDSVNSARAILQENGCLSDSDMFSQAGLRSEAANGNLDFVLSFLKSNTLPISLQSIRSSLLLGFRRSMNINLWSEITELLYKDGRYCQEPRGPTEAVGYFLYNLIDSMSDSEVQAKEEHLRQYFHQLEKMNVKIPENIYRGIRNLLESYHVPELIKDAHLLVESKNLDFQKTVQLTSSELESTLETLKAENQPIRDVLKQLILVLCSEENMQKALELKAKYESDMVTGGYAALINLCCRHDKVEDALNLKEEFDRLDSSAVLDTGKYVGLVRVLAKHGKLQDAINILKEMKEKDVLIKDTTALSFFHMLNGAALRGEIETVKQLHEAIVTLGLAEPSTNISFPLVTVHLEKGDLSTALEVAIDCYEKYKVLPRIHDVLCKLVEKGETDLIQKAMDFVSQEQGEMVMLYDLFFAFLQTGNYKEAKKIIETPGIRARSARLQWFCDRCVANNQVETLEKLVELTQKLFECDRDQMYYNLLKLYKINGDWQRADAVWNKIQEENVIPREKTLRLLAEILREGNQEVPFDVPELWYEDEKHSLNSSSASTTEPDFQKDILIACRLNQKKGAYDIFLNAKEQNIVFNAETYSNLIKLLMSEDYFTQAMEVKAFAETHIKGFTLNDAANSRLIITQVRRDYLKEAVTTLKTVLDQQQTPSRLAVTRVIQALAMKGDVENIEVVQKMLNGLEDSIGLSKMVFINNIALAQIKNNNIDAAIENIENMLTSENKVIEPQYFGLAYLFRKVIEEQLEPAVEKISIMAERLANQFAIYKPVTDFFLQLVDAGKVDDARALLQRCGAIAEQTPILLLFLLRNSRKQGKASTVKSVLELIPELNEKEEAYNSLMKSYVSEKDVTSAKALYEHLTAKNTKLDDLFLKRYASLLKYAGEPVPFIEPPESFEFYAQQLRKLRENSS,1394,NP_573566.2.csv,refseq-LRPPRC-NM_133259.3_clinical_seed_0_final,refseq-LRPPRC-NM_133259.3.a2m,Invitae,refseq-LRPPRC-NM_133259.3.npy,1,1394,1394
+NP_573568.1,MEGAAAREARGTETPRASAPPPAPSEPPAAPRARPRLVFRTQLAHGSPTGKIEGFTNVRELYAKIAEAFGIAPTEILFCTLNSHKVDMQKLLGGQIGLEDFIFAHVRGETKEVEVTKTEDALGLTITDNGAGYAFIKRIKEGSIINRIEAVCVGDSIEAINDHSIVGCRHYEVAKMLRELPKSQPFTLRLVQPKRAFDMIGQRSRSSKCPVEAKVTSGRETLRLRSGGAATVEEAPSEFEEEASRKVDDLLESYMGIRDPELASTMVETSKKTASAQEFARCLDSVLGEFAFPDEFVVEVWAAIGEAREACG,312,NP_573568.1.csv,refseq-GIPC3-NM_133261.2_clinical_seed_0_final,refseq-GIPC3-NM_133261.2.a2m,Invitae,refseq-GIPC3-NM_133261.2.npy,1,312,312
+NP_579877.1,MEFSIKQSPLSVQSVVKCIKMKQAPEILGSANGKTPSCEVNRECSVFLSKAQLSSSLQEGVMQKFNGHDALPFIPADKLKDLTSRVFNGEPGAHDAKLRFESQEMKGIGTPPNTTPIKNGSPEIKLKITKTYMNGKPLFESSICGDSAADVSQSEENGQKPENKARRNRKRSIKYDSLLEQGLVEAALVSKISSPSDKKIPAKKESCPNTGRDKDHLLKYNVGDLVWSKVSGYPWWPCMVSADPLLHSYTKLKGQKKSARQYHVQFFGDAPERAWIFEKSLVAFEGEGQFEKLCQESAKQAPTKAEKIKLLKPISGKLRAQWEMGIVQAEEAASMSVEERKAKFTFLYVGDQLHLNPQVAKEAGIAAESLGEMAESSGVSEEAAENPKSVREECIPMKRRRRAKLCSSAETLESHPDIGKSTPQKTAEADPRRGVGSPPGRKKTTVSMPRSRKGDAASQFLVFCQKHRDEVVAEHPDASGEEIEELLRSQWSLLSEKQRARYNTKFALVAPVQAEEDSGNVNGKKRNHTKRIQDPTEDAEAEDTPRKRLRTDKHSLRKRDTITDKTARTSSYKAMEAASSLKSQAATKNLSDACKPLKKRNRASTAASSALGFSKSSSPSASLTENEVSDSPGDEPSESPYESADETQTEVSVSSKKSERGVTAKKEYVCQLCEKPGSLLLCEGPCCGAFHLACLGLSRRPEGRFTCSECASGIHSCFVCKESKTDVKRCVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYHSGDACLAAGCSVIASNSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNIEMPDGSWFCNDCRAGKKLHFQDIIWVKLGNYRWWPAEVCHPKNVPPNIQKMKHEIGEFPVFFFGSKDYYWTHQARVFPYMEGDRGSRYQGVRGIGRVFKNALQEAEARFREIKLQREARETQESERKPPPYKHIKVNKPYGKVQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKTSTTLSSEEKGKKTKKKTRRRRAKGEGKRQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPWHHCDVCGKPSTSFCHLCPNSFCKEHQDGTAFSCTPDGRSYCCEHDLGAASVRSTKTEKPPPEPGKPKGKRRRRRGWRRVTEGK,1365,NP_579877.1.csv,refseq-NSD2-NM_133330.2_clinical_seed_0_final,refseq-NSD2-NM_133330.2.a2m,Invitae,refseq-NSD2-NM_133330.2.npy,1,1365,1365
+NP_597680.2,MCGIWALFGSDDCLSVQCLSAMKIAHRGPDAFRFENVNGYTNCCFGFHRLAVVDPLFGMQPIRVKKYPYLWLCYNGEIYNHKKMQQHFEFEYQTKVDGEIILHLYDKGGIEQTICMLDGVFAFVLLDTANKKVFLGRDTYGVRPLFKAMTEDGFLAVCSEAKGLVTLKHSATPFLKVEPFLPGHYEVLDLKPNGKVASVEMVKYHHCRDVPLHALYDNVEKLFPGFEIETVKNNLRILFNNAVKKRLMTDRRIGCLLSGGLDSSLVAATLLKQLKEAQVQYPLQTFAIGMEDSPDLLAARKVADHIGSEHYEVLFNSEEGIQALDEVIFSLETYDITTVRASVGMYLISKYIRKNTDSVVIFSGEGSDELTQGYIYFHKAPSPEKAEEESERLLRELYLFDVLRADRTTAAHGLELRVPFLDHRFSSYYLSLPPEMRIPKNGIEKHLLRETFEDSNLIPKEILWRPKEAFSDGITSVKNSWFKILQEYVEHQVDDAMMANAAQKFPFNTPKTKEGYYYRQVFERHYPGRADWLSHYWMPKWINATDPSARTLTHYKSAVKA,561,NP_597680.2.csv,refseq-ASNS-NM_133436.3_clinical_seed_0_final,refseq-ASNS-NM_133436.3.a2m,Invitae,refseq-ASNS-NM_133436.3.npy,1,561,561
+NP_597700.1,MQRAAALVRRGCGPRTPSSWGRSQSSAAAEASAVLKVRPERSRRERILTLESMNPQVKAVEYAVRGPIVLKAGEIELELQRGIKKPFTEVIRANIGDAQAMGQQPITFLRQVMALCTYPNLLDSPSFPEDAKKRARRILQACGGNSLGSYSASQGVNCIREDVAAYITRRDGGVPADPDNIYLTTGASDGISTILKILVSGGGKSRTGVMIPIPQYPLYSAVISELDAIQVNYYLDEENCWALNVNELRRAVQEAKDHCDPKVLCIINPGNPTGQVQSRKCIEDVIHFAWEEKLFLLADEVYQDNVYSPDCRFHSFKKVLYEMGPEYSSNVELASFHSTSKGYMGECGYRGGYMEVINLHPEIKGQLVKLLSVRLCPPVSGQAAMDIVVNPPVAGEESFEQFSREKESVLGNLAKKAKLTEDLFNQVPGIHCNPLQGAMYAFPRIFIPAKAVEAAQAHQMAPDMFYCMKLLEETGICVVPGSGFGQREGTYHFRMTILPPVEKLKTVLQKVKDFHINFLEKYA,523,NP_597700.1.csv,refseq-GPT2-NM_133443.3_clinical_seed_0_final,refseq-GPT2-NM_133443.3.a2m,Invitae,refseq-GPT2-NM_133443.3.npy,1,523,523
+NP_597716.1,MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCLKSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYDRERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKYPNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLASNTYLPGPPGLPGGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQGRRGPVGPPGAPGRDGSKGERGAPGPRGSPGPPGSFDFLLLMLADIRNDITELQEKVFGHRTHSSAEEFPLPQEFPSYPEAMDLGSGDDHPRRTETRDLRAPRDFYP,406,NP_597716.1.csv,refseq-CCBE1-NM_133459.3_clinical_seed_0_final,refseq-CCBE1-NM_133459.3.a2m,Invitae,refseq-CCBE1-NM_133459.3.npy,1,406,406
+NP_597725.1,MLWFSGVGALAERYCRRSPGITCCVLLLLNCSGVPMSLASSFLTGSVAKCENEGEVLQIPFITDNPCIMCVCLNKEVTCKREKCPVLSRDCALAIKQRGACCEQCKGCTYEGNTYNSSFKWQSPAEPCVLRQCQEGVVTESGVRCVVHCKNPLEHLGMCCPTCPGCVFEGVQYQEGEEFQPEGSKCTKCSCTGGRTQCVREVCPILSCPQHLSHIPPGQCCPKCLGQRKVFDLPFGSCLFRSDVYDNGSSFLYDNCTACTCRDSTVVCKRKCSHPGGCDQGQEGCCEECLLRVPPEDIKVCKFGNKIFQDGEMWSSINCTICACVKGRTECRNKQCIPISSCPQGKILNRKGCCPICTEKPGVCTVFGDPHYNTFDGRTFNFQGTCQYVLTKDCSSPASPFQVLVKNDARRTRSFSWTKSVELVLGESRVSLQQHLTVRWNGSRIALPCRAPHFHIDLDGYLLKVTTKAGLEISWDGDSFVEVMAAPHLKGKLCGLCGNYNGHKRDDLIGGDGNFKFDVDDFAESWRVESNEFCNRPQRKPVPELCQGTVKVKLRAHRECQKLKSWEFQTCHSTVDYATFYRSCVTDMCECPVHKNCYCESFLAYTRACQREGIKVHWEPQQNCAATQCKHGAVYDTCGPGCIKTCDNWNEIGPCNKPCVAGCHCPANLVLHKGRCIKPVLCPQR,685,NP_597725.1.csv,refseq-BMPER-NM_133468.4_clinical_seed_0_final,refseq-BMPER-NM_133468.4.a2m,Invitae,refseq-BMPER-NM_133468.4.npy,1,685,685
+NP_598004.1,MLKQSERRRSWSYRPWNTTENEGSQHRRSICSLGARSGSQASIHGWTEGNYNYYIEEDEDGEEEDQWKDDLAEEDQQAGEVTTAKPEGPSDPPALLSTLNVNVGGHSYQLDYCELAGFPKTRLGRLATSTSRSRQLSLCDDYEEQTDEYFFDRDPAVFQLVYNFYLSGVLLVLDGLCPRRFLEELGYWGVRLKYTPRCCRICFEERRDELSERLKIQHELRAQAQVEEAEELFRDMRFYGPQRRRLWNLMEKPFSSVAAKAIGVASSTFVLVSVVALALNTVEEMQQHSGQGEGGPDLRPILEHVEMLCMGFFTLEYLLRLASTPDLRRFARSALNLVDLVAILPLYLQLLLECFTGEGHQRGQTVGSVGKVGQVLRVMRLMRIFRILKLARHSTGLRAFGFTLRQCYQQVGCLLLFIAMGIFTFSAAVYSVEHDVPSTNFTTIPHSWWWAAVSISTVGYGDMYPETHLGRFFAFLCIAFGIILNGMPISILYNKFSDYYSKLKAYEYTTIRRERGEVNFMQRARKKIAECLLGSNPQLTPRQEN,545,NP_598004.1.csv,refseq-KCNV2-NM_133497.3_clinical_seed_0_final,refseq-KCNV2-NM_133497.3.a2m,Invitae,refseq-KCNV2-NM_133497.3.npy,1,545,545
+NP_598006.1,MNYLRRRLSDSNFMANLPNGYMTDLQRPQPPPPPPGAHSPGATPGPGTATAERSSGVAPAASPAAPSPGSSGGGGFFSSLSNAVKQTTAAAAATFSEQVGGGSGGAGRGGAASRVLLVIDEPHTDWAKYFKGKKIHGEIDIKVEQAEFSDLNLVAHANGGFSVDMEVLRNGVKVVRSLKPDFVLIRQHAFSMARNGDYRSLVIGLQYAGIPSVNSLHSVYNFCDKPWVFAQMVRLHKKLGTEEFPLIDQTFYPNHKEMLSSTTYPVVVKMGHAHSGMGKVKVDNQHDFQDIASVVALTKTYATAEPFIDAKYDVRVQKIGQNYKAYMRTSVSGNWKTNTGSAMLEQIAMSDRYKLWVDTCSEIFGGLDICAVEALHGKDGRDHIIEVVGSSMPLIGDHQDEDKQLIVELVVNKMAQALPRQRQRDASPGRGSHGQTPSPGALPLGRQTSQQPAGPPAQQRPPPQGGPPQPGPGPQRQGPPLQQRPPPQGQQHLSGLGPPAGSPLPQRLPSPTSAPQQPASQAAPPTQGQGRQSRPVAGGPGAPPAARPPASPSPQRQAGPPQATRQTSVSGPAPPKASGAPPGGQQRQGPPQKPPGPAGPTRQASQAGPVPRTGPPTTQQPRPSGPGPAGRPKPQLAQKPSQDVPPPATAAAGGPPHPQLKASPAQAQP,669,NP_598006.1.csv,refseq-SYN1-NM_133499.2_clinical_seed_0_final,refseq-SYN1-NM_133499.2.a2m,Invitae,refseq-SYN1-NM_133499.2.npy,1,669,669
+NP_598399.2,MWRGLWTLAAQAARGPRRLCTRRSSGAPAPGSGATIFALSSGQGRCGIAVIRTSGPASGHALRILTAPRDLPLARHASLRLLSDPRSGEPLDRALVLWFPGPQSFTGEDCVEFHVHGGPAVVSGVLQALGSVPGLRPAEAGEFTRRAFANGKLNLTEVEGLADLIHAETEAQRRQALRQLDGELGHLCRGWAETLTKALAHVEAYIDFGEDDNLEEGVLEQGGSTWWWGRKTPHISPQRLPSLSLSACLLSPTADIEVRALQVALGAHLRDARRGQRLRSGVHVVVTGPPNAGKSSLVNLLSRKPVSIVSPEPGTTRDVLETPVDLAGFPVLLSDTAGLREGVGPVEQEGVRRARERLEQADLILAMLDASDLASPSSCNFLATVVASVGAQSPSDSSQRLLLVLNKSDLLSPEGPGPGPDLPPHLLLSCLTGEGLDGLLEALRKELAAVCGDPSTDPPLLTRARHQHHLQGCLDALGHYKQSKDLALAAEALRVARGHLTRLTGGGGTEEILDIIFQDFCVGK,524,NP_598399.2.csv,refseq-GTPBP3-NM_133644.3_clinical_seed_0_final,refseq-GTPBP3-NM_133644.3.a2m,Invitae,refseq-GTPBP3-NM_133644.3.npy,1,524,524
+NP_598408.1,MHPPETTTKMASVRFMVTPTKIDDIPGLSDTSPDLSSRSSSRVRFSSRESVPETSRSEPMSEMSGATTSLATVALDPPSDRTSHPQDVIEDLSQNSITGEHSQLLDDGHKKARNAYLNNSNYEEGDEYFDKNLALFEEEMDTRPKVSSLLNRMANYTNLTQGAKEHEEAENITEGKKKPTKTPQMGTFMGVYLPCLQNIFGVILFLRLTWVVGTAGVLQAFAIVLICCCCTMLTAISMSAIATNGVVPAGGSYFMISRALGPEFGGAVGLCFYLGTTFAAAMYILGAIEIFLVYIVPRAAIFHSDDALKESAAMLNNMRVYGTAFLVLMVLVVFIGVRYVNKFASLFLACVIVSILAIYAGAIKSSFAPPHFPVCMLGNRTLSSRHIDVCSKTKEINNMTVPSKLWGFFCNSSQFFNATCDEYFVHNNVTSIQGIPGLASGIITENLWSNYLPKGEIIEKPSAKSSDVLGSLNHEYVLVDITTSFTLLVGIFFPSVTGIMAGSNRSGDLKDAQKSIPIGTILAILTTSFVYLSNVVLFGACIEGVVLRDKFGDAVKGNLVVGTLSWPSPWVIVIGSFFSTCGAGLQSLTGAPRLLQAIAKDNIIPFLRVFGHSKANGEPTWALLLTAAIAELGILIASLDLVAPILSMFFLMCYLFVNLACALQTLLRTPNWRPRFRYYHWALSFMGMSICLALMFISSWYYAIVAMVIAGMIYKYIEYQGAEKEWGDGIRGLSLSAARFALLRLEEGPPHTKNWRPQLLVLLKLDEDLHVKHPRLLTFASQLKAGKGLTIVGSVIVGNFLENYGEALAAEQTIKHLMEAEKVKGFCQLVVAAKLREGISHLIQSCGLGGMKHNTVVMGWPNGWRQSEDARAWKTFIGTVRVTTAAHLALLVAKNISFFPSNVEQFSEGNIDVWWIVHDGGMLMLLPFLLKQHKVWRKCSIRIFTVAQLEDNSIQMKKDLATFLYHLRIEAEVEVVEMHDSDISAYTYERTLMMEQRSQMLRHMRLSKTERDREAQLVKDRNSMLRLTSIGSDEDEETETYQEKVHMTWTKDKYMASRGQKAKSMEGFQDLLNMRPDQSNVRRMHTAVKLNEVIVNKSHEAKLVLLNMPGPPRNPEGDENYMEFLEVLTEGLERVLLVRGGGSEVITIYS,1150,NP_598408.1.csv,refseq-SLC12A6-NM_133647.1_clinical_seed_0_final,refseq-SLC12A6-NM_133647.1.a2m,Invitae,refseq-SLC12A6-NM_133647.1.npy,1,1150,1150
+NP_599023.1,MESAPAAPDPAASEPGSSGADAAAGSRETPLNQESARKSEPPAPVRRQSYSSTSRGISVTKKTHTSQIEIIPCKICGDKSSGIHYGVITCEGCKGFFRRSQQSNATYSCPRQKNCLIDRTSRNRCQHCRLQKCLAVGMSRDAVKFGRMSKKQRDSLYAEVQKHRMQQQQRDHQQQPGEAEPLTPTYNISANGLTELHDDLSNYIDGHTPEGSKADSAVSSFYLDIQPSPDQSGLDINGIKPEPICDYTPASGFFPYCSFTNGETSPTVSMAELEHLAQNISKSHLETCQYLREELQQITWQTFLQEEIENYQNKQREVMWQLCAIKITEAIQYVVEFAKRIDGFMELCQNDQIVLLKAGSLEVVFIRMCRAFDSQNNTVYFDGKYASPDVFKSLGCEDFISFVFEFGKSLCSMHLTEDEIALFSAFVLMSADRSWLQEKVKIEKLQQKIQLALQHVLQKNHREDGILTKLICKVSTLRALCGRHTEKLMAFKAIYPDIVRLHFPPLYKELFTSEFEPAMQIDG,523,NP_599023.1.csv,refseq-RORA-NM_134261.3_clinical_seed_0_final,refseq-RORA-NM_134261.3.a2m,Invitae,refseq-RORA-NM_134261.3_theta_0.2.npy,1,523,523
+NP_612370.3,MPLFFRKRKPSEEARKRLEYQMCLAKEAGADDILDISKCELSEIPFGAFATCKVLQKKVLIVHTNHLTSLLPKSCSLLSLATIKVLDLHDNQLTALPDDLGQLTALQVLNVERNQLMQLPRSIGNLTQLQTLNVKDNKLKELPDTVGELRSLRTLNISGNEIQRLPQMLAHVRTLEMLSLDASAMVYPPREVCGAGTAAILQFLCKESGLEYYPPSQYLLPILEQDGIENSRDSPDGPTDRFSREELEWQNRFSDYEKRKEQKMLEKLEFERRLELGQREHTQLLQQSSSQKDEILQTVKEEQSRLEQGLSEHQRHLNAERQRLQEQLKQTEQNISSRIQKLLQDNQRQKKSSEILKSLENERIRMEQLMSITQEETESLRRRDVASAMQQMLTESCKNRLIQMAYESQRQNLVQQACSSMAEMDERFQQILSWQQMDQNKAISQILQESAMQKAAFEALQVKKDLMHRQIRSQIKLIETELLQLTQLELKRKSLDTESLQEMISEQRWALSSLLQQLLKEKQQREEELREILTELEAKSETRQENYWLIQYQRLLNQKPLSLKLQEEGMERQLVALLEELSAEHYLPIFAHHRLSLDLLSQMSPGDLAKVGVSEAGLQHEILRRVQELLDAARIQPELKPPMGEVVTPTAPQEPPESVRPSAPPAELEVQASECVVCLEREAQMIFLNCGHVCCCQQCCQPLRTCPLCRQDIAQRLRIYHSS,723,NP_612370.3.csv,refseq-LRSAM1-NM_138361.5_clinical_seed_0_final,refseq-LRSAM1-NM_138361.5.a2m,Invitae,refseq-LRSAM1-NM_138361.5.npy,1,723,723
+NP_612373.2,MSNSRPRSRRDAGGGAGAAGRDELVSRSLQSAEHCLGVQDFGTAYAHYLLVLSLAPELKHDVKETFQYTLFRWAEELDALSRIQDLLGCYEQALELFPDDEVICNSMGEHLFRMGFRDEAAGYFHKAVKLNPDFSDAKENFYRVANWLVERWHFIMLNDTKRNTIYNAAIQKAVCLGSKSVLDIGAGTGILSMFAKKAGAHSVYACELSKTMYELACDVVAANKMEAGIKLLHTKSLDIEIPKHIPERVSLVVTETVDAGLFGEGIVESLIHAWEHLLLQPKTKGESANCEKYGKVIPASAVIFGMAVECAEIRRHHRVGIKDIAGIHLPTNVKFQSPAYSSVDTEETIEPYTTEKMSRVPGGYLALTECFEIMTVDFNNLQELKSLATKKPDKIGIPVIKEGILDAIMVWFVLQLDDEHSLSTSPSEETCWEQAVYPVQDLADYWIKPGDHVMMEVSCQDCYLRIQSISVLGLECEMDVAKSFTQNKDLLSLGNEAELCSALANLQTSKPDAVEQTCILESTEIALLNNIPYHEGFKMAMSKVLSSLTPEKLYQTMDTHCQNEMSSGTGQSNTVQNILEPFYVLDVSEGFSVLPVIAGTLGQVKPYSSVEKDQHRIALDLISEANHFPKETLEFWLRHVEDESAMLQRPKSDKLWSIIILDVIEPSGLIQQEIMEKAAISRCLLQSGGKIFPQYVLMFGLLVESQTLLEENAVQGTERTLGLNIAPFINQFQVPIRVFLDLSSLPCIPLSKPVELLRLDLMTPYLNTSNREVKVYVCKSGRLTAIPFWYHMYLDEEIRLDTSSEASHWKQAAVVLDNPIQVEMGEELVLSIQHHKSNVSITVKQ,845,NP_612373.2.csv,refseq-PRMT9-NM_138364.3_clinical_seed_0_final,refseq-PRMT9-NM_138364.3.a2m,Invitae,refseq-PRMT9-NM_138364.3.npy,1,845,845
+NP_612396.1,MESTLGAGIVIAEALQNQLAWLENVWLWITFLGDPKILFLFYFPAAYYASRRVGIAVLWISLITEWLNLIFKWFLFGDRPFWWVHESGYYSQAPAQVHQFPSSCETGPGSPSGHCMITGAALWPIMTALSSQVATRARSRWVRVMPSLAYCTFLLAVGLSRIFILAHFPHQVLAGLITGAVLGWLMTPRVPMERELSFYGLTALALMLGTSLIYWTLFTLGLDLSWSISLAFKWCERPEWIHVDSRPFASLSRDSGAALGLGIALHSPCYAQVRRAQLGNGQKIACLVLAMGLLGPLDWLGHPPQISLFYIFNFLKYTLWPCLVLALVPWAVHMFSAQEAPPIHSS,346,NP_612396.1.csv,refseq-G6PC3-NM_138387.3_clinical_seed_0_final,refseq-G6PC3-NM_138387.3.a2m,Invitae,refseq-G6PC3-NM_138387.3.npy,1,346,346
+NP_612404.1,MLRTSVLRLLGRTGASRLSLLEDFGPRYYSSGSLSAGDDACDVRAYFTTPIFYVNAAPHIGHLYSALLADALCRHRRLRGPSTAATRFSTGTDEHGLKIQQAAATAGLAPTELCDRVSEQFQQLFQEAGISCTDFIRTTEARHRVAVQHFWGVLKSRGLLYKGVYEGWYCASDECFLPEAKVTQQPGPSGDSFPVSLESGHPVSWTKEENYIFRLSQFRKPLQRWLRGNPQAITPEPFHHVVLQWLDEELPDLSVSRRSSHLHWGIPVPGDDSQTIYVWLDALVNYLTVIGYPNAEFKSWWPATSHIIGKDILKFHAIYWPAFLLGAGMSPPQRICVHSHWTVCGQKMSKSLGNVVDPRTCLNRYTVDGFRYFLLRQGVPNWDCDYYDEKVVKLLNSELADALGGLLNRCTAKRINPSETYPAFCTTCFPSEPGLVGPSVRAQAEDYALVSAVATLPKQVADHYDNFRIYKALEAVSSCVRQTNGFVQRHAPWKLNWESPVDAPWLGTVLHVALECLRVFGTLLQPVTPSLADKLLSRLGVSASERSLGELYFLPRFYGHPCPFEGRRLGPETGLLFPRLDQSRTWLVKAHRT,593,NP_612404.1.csv,refseq-MARS2-NM_138395.3_clinical_seed_0_final,refseq-MARS2-NM_138395.3.a2m,Invitae,refseq-MARS2-NM_138395.3.npy,1,593,593
+NP_612422.2,MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEENLHKLGTFPFRGFVVQGSNGEFPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVEMTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSPIPVVLYSVPANTGLDLPVDAVVTLSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVCALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGGPCRAPLQELSPAEEEALRMDFTSNGWL,327,NP_612422.2.csv,refseq-HOGA1-NM_138413.3_clinical_seed_0_final,refseq-HOGA1-NM_138413.3.a2m,Invitae,refseq-HOGA1-NM_138413.3.npy,1,327,327
+NP_612431.2,MILCSRLCLPQSASLRMEPAPGLVEQPKCLEAGSPEPEPAPWQALPVLSEKQSGDVELVLAYAAPVLDKRQTSRLLKEVSALHPLPAQPHLKRVRPSRDAGSPHALEMLLCLAGPASGPRSLAELLPRPAVDPRGLGQPFLVPVPARPPLTRGQFEEARAHWPTSFHEDKQVTSALAGRLFSTQERAAMQSHMERAVWAARRAAARGLRAVGAVVVDPASDRVLATGHDCSCADNPLLHAVMVCVDLVARGQGRGTYDFRPFPACSFAPAAAPQAVRAGAVRKLDADEDGLPYLCTGYDLYVTREPCAMCAMALVHARILRVFYGAPSPDGALGTRFRIHARPDLNHRFQVFRGVLEEQCRWLDPDT,367,NP_612431.2.csv,refseq-ADAT3-NM_138422.3_clinical_seed_0_final,refseq-ADAT3-NM_138422.3.a2m,Invitae,refseq-ADAT3-NM_138422.3.npy,1,367,367
+NP_612433.1,MQRTFAWLLDRVQHLGAPVTLRASYLEIYNEQVRDLLSLGSPRPLPVRWNKTRGFYVEQLRVVEFGSLEALMELLQTGLSRRRNSAHTLNQASSRSHALLTLYISRQTAQQMPSVDPGEPPVGGKLCFVDLAGSEKVAATGSRGELMLEANSINRSLLALGHCISLLLDPQRKQSHIPFRDSKLTKLLADSLGGRGVTLMVACVSPSAQCLPETLSTLRYASRAQRVTTRPQAPKSPVAKQPQRLETEMLQLQEENRRLQFQLDQMDCKASGLSGARVAWAQRNLYGMLQEFMLENERLRKEKSQLQNSRDLAQNEQRILAQQVHALERRLLSACYHHQQGPGLTPPCPCLMAPAPPCHALPPLYSCPCCHICPLCRVPLAHWACLPGEHHLPQVLDPEASGGRPPSARPPPWAPPCSPGSAKCPRERSHSDWTQTRVLAEMLTEEEVVPSAPPLPVRPPKTSPGLRGGAGVPNLAQRLEALRDQIGSSLRRGRSQPPCSEGARSPGQVLPPH,513,NP_612433.1.csv,refseq-KIF12-NM_138424.1_clinical_seed_0_final,refseq-KIF12-NM_138424.1.a2m,Invitae,refseq-KIF12-NM_138424.1.npy,1,513,513
+NP_612468.1,MTGLYELVWRVLHALLCLHRTLTSWLRVRFGTWNWIWRRCCRAASAAVLAPLGFTLRKPPAVGRNRRHHRHPRGGSCLAAAHHRMRWRADGRSLEKLPVHMGLVITEVEQEPSFSDIASLVVWCMAVGISYISVYDHQGIFKRNNSRLMDEILKQQQELLGLDCSKYSPEFANSNDKDDQVLNCHLAVKVLSPEDGKADIVRAAQDFCQLVAQKQKRPTDLDVDTLASLLSSNGCPDPDLVLKFGPVDSTLGFLPWHIRLTEIVSLPSHLNISYEDFFSALRQYAACEQRLGK,293,NP_612468.1.csv,NP_612468.1_clinical_seed_0_final,NP_612468.1.a2m,popEVE,NP_612468.1_theta_0.2.npy,1,293,293
+NP_612486.2,MAAVLESLLREEVSVAAVVRWIARSTQGSEDNAGEAAALSSLRALRKEFVPFLLNFLREQSSRVLPQGPPTPAKTPGASAALPGRPGGPPRGSRGARSQLFPPTEAQSTAAEAPLARRGGRRRGPGPARERGGRGLEEGVSGESLPGAGGRRLRGSGSPSRPSLTLSDPPNLSNLEEFPPVGSVPPGPTGTKPSRRINPTPVSEERSLSKPKTCFTSPPISCVPSSQPSALDTSPWGLGLPPGCRSLQEEREMLRKERSKQLQQSPTPTCPTPELGSPLPSRTGSLTDEPADPARVSSRQRLELVALVYSSCIAENLVPNLFLELFFVFQLLTARRMVTAKDSDPELSPAVLDSLESPLFQSIHDCVFFAVQVLECHFQVLSNLDKGTLKLLAENERLLCFSPALQGRLRAAYEGSVAKVSLVMPPSTQAVSFQPETDNRANFSSDRAFHTFKKQRDVFYEVLREWEDHHEEPGWDFEKGLGSRIRAMMGQLSAACSHSHFVRLFQKQLLQMCQSPGGAGGTVLGEAPDVLSMLGADKLGRLWRLQERLMAPQSSGGPCPPPTFPGCQGFFRDFILSASSFQFNQHLMDSLSLKIQELNGLALPQHEPNDEDGESDVDWQGERKQFAVVLLSLRLLAKFLGFVAFLPYRGPEPPPTGELQDSILALRSQVPPVLDVRTLLQRGLQARRAVLTVPWLVEFLSFADHVVPLLEYYRDIFTLLLRLHRSLVLSQESEGKMCFLNKLLLLAVLGWLFQIPTVPEDLFFLEEGPSYAFEVDTVAPEHGLDNAPVVDQQLLYTCCPYIGELRKLLASWVSGSSGRSGGFMRKITPTTTTSLGAQPSQTSQGLQAQLAQAFFHNQPPSLRRTVEFVAERIGSNCVKHIKATLVADLVRQAESLLQEQLVTQGEEGGDPAQLLEILCSQLCPHGAQALALGREFCQRKSPGAVRALLPEETPAAVLSSAENIAVGLATEKACAWLSANITALIRREVKAAVSRTLRAQGPEPAARGERRGCSRACEHHAPLPSHLISEIKDVLSLAVGPRDPDEGVSPEHLEQLLGQLGQTLRCRQFLCPPAEQHLAKCSVELASLLVADQIPILGPPAQYRLERGQARRLLHMLLSLWKEDFQGPVPLQLLLSPRNVGLLADTRPREWDLLLFLLRELVEKGLMGRMEIEACLGSLHQAQWPGDFAEELATLSNLFLAEPHLPEPQLRACELVQPNRGTVLAQS,1227,NP_612486.2.csv,refseq-CDAN1-NM_138477.2_clinical_seed_0_final,refseq-CDAN1-NM_138477.2.a2m,Invitae,refseq-CDAN1-NM_138477.2.npy,1,1227,1227
+NP_612565.1,MKSARAKTPRKPTVKKGSQTNLKDPVGVYCRVRPLGFPDQECCIEVINNTTVQLHTPEGYRLNRNGDYKETQYSFKQVFGTHTTQKELFDVVANPLVNDLIHGKNGLLFTYGVTGSGKTHTMTGSPGEGGLLPRCLDMIFNSIGSFQAKRYVFKSNDRNSMDIQCEVDALLERQKREAMPNPKTSSSKRQVDPEFADMITVQEFCKAEEVDEDSVYGVFVSYIEIYNNYIYDLLEEVPFDPIKPKPPQSKLLREDKNHNMYVAGCTEVEVKSTEEAFEVFWRGQKKRRIANTHLNRESSRSHSVFNIKLVQAPLDADGDNVLQEKEQITISQLSLVDLAGSERTNRTRAEGNRLREAGNINQSLMTLRTCMDVLRENQMYGTNKMVPYRDSKLTHLFKNYFDGEGKVRMIVCVNPKAEDYEENLQVMRFAEVTQEVEVARPVDKAICGLTPGRRYRNQPRGPVGNEPLVTDVVLQSFPPLPSCEILDINDEQTLPRLIEALEKRHNLRQMMIDEFNKQSNAFKALLQEFDNAVLSKENHMQGKLNEKEKMISGQKLEIERLEKKNKTLEYKIEILEKTTTIYEEDKRNLQQELETQNQKLQRQFSDKRRLEARLQGMVTETTMKWEKECERRVAAKQLEMQNKLWVKDEKLKQLKAIVTEPKTEKPERPSRERDREKVTQRSVSPSPVPLSSNYIAQISNGQQLMSQPQLHRRSNSCSSISVASCISEWEQKIPTYNTPLKVTSIARRRQQEPGQSKTCIVSDRRRGMYWTEGREVVPTFRNEIEIEEDHCGRLLFQPDQNAPPIRLRHRRSRSAGDRWVDHKPASNMQTETVMQPHVPHAITVSVANEKALAKCEKYMLTHQELASDGEIETKLIKGDIYKTRGGGQSVQFTDIETLKQESPNGSRKRRSSTVAPAQPDGAESEWTDVETRCSVAVEMRAGSQLGPGYQHHAQPKRKKP,960,NP_612565.1.csv,refseq-KIF23-NM_138555.3_clinical_seed_0_final,refseq-KIF23-NM_138555.3.a2m,Invitae,refseq-KIF23-NM_138555.3.npy,1,960,960
+NP_612808.1,MSRRKQGNPQHLSQRELITPEADHVEAAILEEDEGLEIEEPSGLGLMVGGPDPDLLTCGQCQMNFPLGDILVFIEHKRKQCGGSLGACYDKALDKDSPPPSSRSELRKVSEPVEIGIQVTPDEDDHLLSPTKGICPKQENIAGPCRPAQLPAVAPIAASSHPHSSVITSPLRALGALPPCLPLPCCSARPVSGDGTQGEGQTEAPFGCQCQLSGKDEPSSYICTTCKQPFNSAWFLLQHAQNTHGFRIYLEPGPASSSLTPRLTIPPPLGPEAVAQSPLMNFLGDSNPFNLLRMTGPILRDHPGFGEGRLPGTPPLFSPPPRHHLDPHRLSAEEMGLVAQHPSAFDRVMRLNPMAIDSPAMDFSRRLRELAGNSSTPPPVSPGRGNPMHRLLNPFQPSPKSPFLSTPPLPPMPPGGTPPPQPPAKSKSCEFCGKTFKFQSNLIVHRRSHTGEKPYKCQLCDHACSQASKLKRHMKTHMHKAGSLAGRSDDGLSAASSPEPGTSELAGEGLKAADGDFRHHESDPSLGHEPEEEDEEEEEEEEELLLENESRPESSFSMDSELSRNRENGGGGVPGVPGAGGGAAKALADEKALVLGKVMENVGLGALPQYGELLADKQKRGAFLKRAAGGGDAGDDDDAGGCGDAGAGGAVNGRGGGFAPGTEPFPGLFPRKPAPLPSPGLNSAAKRIKVEKDLELPPAALIPSENVYSQWLVGYAASRHFMKDPFLGFTDARQSPFATSSEHSSENGSLRFSTPPGDLLDGGLSGRSGTASGGSTPHLGGPGPGRPSSKEGRRSDTCEYCGKVFKNCSNLTVHRRSHTGERPYKCELCNYACAQSSKLTRHMKTHGQIGKEVYRCDICQMPFSVYSTLEKHMKKWHGEHLLTNDVKIEQAERS,894,NP_612808.1.csv,refseq-BCL11B-NM_138576.3_clinical_seed_0_final,refseq-BCL11B-NM_138576.3.a2m,Invitae,refseq-BCL11B-NM_138576.3.npy,1,894,894
+NP_619520.1,MFSLDSFRKDRAQHRQRQCKLPPPRLPPMCVNPTPGGTISRASRDLLKEFPQPKNLLNSVIGRALGISHAKDKLVYVHTNGPKKKKVTLHIKWPKSVEVEGYGSKKIDAERQAAAAACQLFKGWGLLGPRNELFDAAKYRVLADRFGSPADSWWRPEPTMPPTSWRQLNPESIRPGGPGGLSRSLGREEEEDEEEELEEGTIDVTDFLSMTQQDSHAPLRDSRGSSFEMTDDDSAIRALTQFPLPKNLLAKVIQIATSSSTAKNLMQFHTVGTKTKLSTLTLLWPCPMTFVAKGRRKAEAENKAAALACKKLKSLGLVDRNNEPLTHAMYNLASLRELGETQRRPCTIQVPEPILRKIETFLNHYPVESSWIAPELRLQSDDILPLGKDSGPLSDPITGKPYVPLLEAEEVRLSQSLLELWRRRGPVWQEAPQLPVDPHRDTILNAIEQHPVVVISGDTGCGKTTRIPQLLLERYVTEGRGARCNVIITQPRRISAVSVAQRVSHELGPSLRRNVGFQVRLESKPPSRGGALLFCTVGILLRKLQSNPSLEGVSHVIVDEVHERDVNTDFLLILLKGLQRLNPALRLVLMSATGDNERFSRYFGGCPVIKVPGFMYPVKEHYLEDILAKLGKHQYLHRHRHHESEDECALDLDLVTDLVLHIDARGEPGGILCFLPGWQEIKGVQQRLQEALGMHESKYLILPVHSNIPMMDQKAIFQQPPVGVRKIVLATNIAETSITINDIVHVVDSGLHKEERYDLKTKVSCLETVWVSRANVIQRRGRAGRCQSGFAYHLFPRSRLEKMVPFQVPEILRTPLENLVLQAKIHMPEKTAVEFLSKAVDSPNIKAVDEAVILLQEIGVLDQREYLTTLGQRLAHISTDPRLAKAIVLAAIFRCLHPLLVVVSCLTRDPFSSSLQNRAEVDKVKALLSHDSGSDHLAFVRAVAGWEEVLRWQDRSSRENYLEENLLYAPSLRFIHGLIKQFSENIYEAFLVGKPSDCTLASAQCNEYSEEEELVKGVLMAGLYPNLIQVRQGKVTRQGKFKPNSVTYRTKSGNILLHKSTINREATRLRSRWLTYFMAVKSNGSVFVRDSSQVHPLAVLLLTDGDVHIRDDGRRATISLSDSDLLRLEGDSRTVRLLKELRRALGRMVERSLRSELAALPPSVQEEHGQLLALLAELLRGPCGSFDVRKTADD,1194,NP_619520.1.csv,refseq-DHX30-NM_138615.2_clinical_seed_0_final,refseq-DHX30-NM_138615.2.a2m,Invitae,refseq-DHX30-NM_138615.2.npy,1,1194,1194
+NP_619542.1,MENMFLQSSMLTCIFLLISGSCELCAEENFSRSYPCDEKKQNDSVIAECSNRRLQEVPQTVGKYVTELDLSDNFITHITNESFQGLQNLTKINLNHNPNVQHQNGNPGIQSNGLNITDGAFLNLKNLRELLLEDNQLPQIPSGLPESLTELSLIQNNIYNITKEGISRLINLKNLYLAWNCYFNKVCEKTNIEDGVFETLTNLELLSLSFNSLSHVPPKLPSSLRKLFLSNTQIKYISEEDFKGLINLTLLDLSGNCPRCFNAPFPCVPCDGGASINIDRFAFQNLTQLRYLNLSSTSLRKINAAWFKNMPHLKVLDLEFNYLVGEIASGAFLTMLPRLEILDLSFNYIKGSYPQHINISRNFSKLLSLRALHLRGYVFQELREDDFQPLMQLPNLSTINLGINFIKQIDFKLFQNFSNLEIIYLSENRISPLVKDTRQSYANSSSFQRHIRKRRSTDFEFDPHSNFYHFTRPLIKPQCAAYGKALDLSLNSIFFIGPNQFENLPDIACLNLSANSNAQVLSGTEFSAIPHVKYLDLTNNRLDFDNASALTELSDLEVLDLSYNSHYFRIAGVTHHLEFIQNFTNLKVLNLSHNNIYTLTDKYNLESKSLVELVFSGNRLDILWNDDDNRYISIFKGLKNLTRLDLSLNRLKHIPNEAFLNLPASLTELHINDNMLKFFNWTLLQQFPRLELLDLRGNKLLFLTDSLSDFTSSLRTLLLSHNRISHLPSGFLSEVSSLKHLDLSSNLLKTINKSALETKTTTKLSMLELHGNPFECTCDIGDFRRWMDEHLNVKIPRLVDVICASPGDQRGKSIVSLELTTCVSDVTAVILFFFTFFITTMVMLAALAHHLFYWDVWFIYNVCLAKVKGYRSLSTSQTFYDAYISYDTKDASVTDWVINELRYHLEESRDKNVLLCLEERDWDPGLAIIDNLMQSINQSKKTVFVLTKKYAKSWNFKTAFYLALQRLMDENMDVIIFILLEPVLQHSQYLRLRQRICKSSILQWPDNPKAEGLFWQTLRNVVLTENDSRYNNMYVDSIKQY,1041,NP_619542.1.csv,refseq-TLR8-NM_138636.4_clinical_seed_0_final,refseq-TLR8-NM_138636.4.a2m,Invitae,refseq-TLR8-NM_138636.4.npy,1,1041,1041
+NP_619636.2,MSPKKVQIKVEEKEDETEESSSEEEEEVEDKLPRRESLRPKRKRTRDVINEDDPEPEPEDEETRKAREKERRRRLKRGAEEEEIDEEELERLKAELDEKRQIIATVKCKPWKMEKKIEVLKEAKKFVSENEGALGKGKGKRWFAFKMMMAKKWAKFLRDFENFKAACVPWENKIKAIESQFGSSVASYFLFLRWMYGVNMVLFILTFSLIMLPEYLWGLPYGSLPRKTVPRAEEASAANFGVLYDFNGLAQYSVLFYGYYDNKRTIGWMNFRLPLSYFLVGIMCIGYSFLVVLKAMTKNIGDDGGGDDNTFNFSWKVFTSWDYLIGNPETADNKFNSITMNFKEAITEEKAAQVEENVHLIRFLRFLANFFVFLTLGGSGYLIFWAVKRSQEFAQQDPDTLGWWEKNEMNMVMSLLGMFCPTLFDLFAELEDYHPLIALKWLLGRIFALLLGNLYVFILALMDEINNKIEEEKLVKANITLWEANMIKAYNASFSENSTGPPFFVHPADVPRGPCWETMVGQEFVRLTVSDVLTTYVTILIGDFLRACFVRFCNYCWCWDLEYGYPSYTEFDISGNVLALIFNQGMIWMGSFFAPSLPGINILRLHTSMYFQCWAVMCCNVPEARVFKASRSNNFYLGMLLLILFLSTMPVLYMIVSLPPSFDCGPFSGKNRMFEVIGETLEHDFPSWMAKILRQLSNPGLVIAVILVMVLAIYYLNATAKGQKAANLDLKKKMKMQALENKMRNKKMAAARAAAAAGRQ,760,NP_619636.2.csv,refseq-TMC1-NM_138691.2_clinical_seed_0_final,refseq-TMC1-NM_138691.2.a2m,Invitae,refseq-TMC1-NM_138691.2.npy,1,760,760
+NP_619646.1,MQRQNFRPPTPPYPGPGGGGWGSGSSFRGTPGGGGPRPPSPRDGYGSPHHTPPYGPRSRPYGSSHSPRHGGSFPGGRFGSPSPGGYPGSYSRSPAGSQQQFGYSPGQQQTHPQGSPRTSTPFGSGRVREKRMSNELENYFKPSMLEDPWAGLEPVSVVDISQQYSNTQTFTGKKGRYFC,179,NP_619646.1.csv,refseq-MPLKIP-NM_138701.3_clinical_seed_0_final,refseq-MPLKIP-NM_138701.3.a2m,Invitae,refseq-MPLKIP-NM_138701.3.npy,1,179,179
+NP_619649.1,MLQKPRNRGRSGGQAERDRDWSHSGNPGASRAGEDARVLRDGFAEEAPSTSRGPGGSQGSQGPSPQGARRAQAAPAVGPRSQKQLELKVSELVQFLLIKDQKKIPIKRADILKHVIGDYKDIFPDLFKRAAERLQYVFGYKLVELEPKSNTYILINTLEPVEEDAEMRGDQGTPTTGLLMIVLGLIFMKGNTIKETEAWDFLRRLGVYPTKKHLIFGDPKKLITEDFVRQRYLEYRRIPHTDPVDYEFQWGPRTNLETSKMKVLKFVAKVHNQDPKDWPAQYCEALADEENRARPQPSGPAPSS,304,NP_619649.1.csv,refseq-NSMCE3-NM_138704.3_clinical_seed_0_final,refseq-NSMCE3-NM_138704.3.a2m,Invitae,refseq-NSMCE3-NM_138704.3.npy,1,304,304
+NP_619729.1,MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLVFAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSNMEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSLISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEIKGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIGPPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLTPFTKVRLVGGSGPHEGRVEILHSGQWGTICDDRWEVRVGQVVCRSLGYPGVQAVHKAAHFGQGTGPIWLNEVFCFGRESSIEECKIRQWGTRACSHSEDAGVTCTL,451,NP_619729.1.csv,refseq-MSR1-NM_138715.2_clinical_seed_0_final,refseq-MSR1-NM_138715.2.a2m,Invitae,refseq-MSR1-NM_138715.2.npy,1,451,451
+NP_620128.1,MHPRRPDGFDGLGYRGGARDEQGFGGAFPARSFSTGSDLGHWVTTPPDIPGSRNLHWGEKSPPYGVPTTSTPYEGPTEEPFSSGGGGSVQGQSSEQLNRFAGFGIGLASLFTENVLAHPCIVLRRQCQVNYHAQHYHLTPFTVINIMYSFNKTQGPRALWKGMGSTFIVQGVTLGAEGIISEFTPLPREVLHKWSPKQIGEHLLLKSLTYVVAMPFYSASLIETVQSEIIRDNTGILECVKEGIGRVIGMGVPHSKRLLPLLSLIFPTVLHGVLHYIISSVIQKFVLLILKRKTYNSHLAESTSPVQSMLDAYFPELIANFAASLCSDVILYPLETVLHRLHIQGTRTIIDNTDLGYEVLPINTQYEGMRDCINTIRQEEGVFGFYKGFGAVIIQYTLHAAVLQITKIIYSTLLQNNI,418,NP_620128.1.csv,refseq-SLC25A46-NM_138773.2_clinical_seed_0_final,refseq-SLC25A46-NM_138773.2.a2m,Invitae,refseq-SLC25A46-NM_138773.2.npy,1,418,418
+NP_620148.1,MPVQLSEHPEWNESMHSLRISVGGLPVLASMTKAADPRFRPRWKVILTFFVGAAILWLLCSHRPAPGRPPTHNAHNWRLGQAPANWYNDTYPLSPPQRTPAGIRYRIAVIADLDTESRAQEENTWFSYLKKGYLTLSDSGDKVAVEWDKDHGVLESHLAEKGRGMELSDLIVFNGKLYSVDDRTGVVYQIEGSKAVPWVILSDGDGTVEKGFKAEWLAVKDERLYVGGLGKEWTTTTGDVVNENPEWVKVVGYKGSVDHENWVSNYNALRAAAGIQPPGYLIHESACWSDTLQRWFFLPRRASQERYSEKDDERKGANLLLSASPDFGDIAVSHVGAVVPTHGFSSFKFIPNTDDQIIVALKSEEDSGRVASYIMAFTLDGRFLLPETKIGSVKYEGIEFI,401,NP_620148.1.csv,refseq-CANT1-NM_138793.3_clinical_seed_0_final,refseq-CANT1-NM_138793.3.a2m,Invitae,refseq-CANT1-NM_138793.3.npy,1,401,401
+NP_620170.3,MSDANLDSSKKNFLEGEVDDEESVILTLVPVKDDANMEQMEPSVSSTSDVKLEKPKKYNPGHLLQTNEQFTAPQKARCKIPALPLPTILPPINKVCRDTLRDWCQQLGLSTNGKKIEVYLRLHRHAYPEQRQDMPEMSQETRLQRCSRKRKAVTKRARLQRSYEMNERAEETNTVEVITSAPGAMLASWARIAARAVQPKALNSCSIPVSVEAFLMQASGVRWCVVHGRLLSADTKGWVRLQFHAGQAWVPTTHRRMISLFLLPACIFPSPGIEDNMLCPDCAKRNKKMMKRLMTVEK,298,NP_620170.3.csv,refseq-DPPA2-NM_138815.3_clinical_seed_0_final,refseq-DPPA2-NM_138815.3.a2m,Invitae,refseq-DPPA2-NM_138815.3.npy,1,298,298
+NP_620409.1,MDTESTYSGYSYYSSHSKKSHRQGERTRERHKSPRNKDGRGSEKSVTIQPPTGEPLLGNDSTRTEEVQDDNWGETTTAITGTSEHSISQEDIARISKDMEDSVGLDCKRYLGLTVASFLGLLVFLTPIAFILLPPILWRDELEPCGTICEGLFISMAFKLLILLIGTWALFFRKRRADMPRVFVFRALLLVLIFLFVVSYWLFYGVRILDSRDRNYQGIVQYAVSLVDALLFIHYLAIVLLELRQLQPMFTLQVVRSTDGESRFYSLGHLSIQRAALVVLENYYKDFTIYNPNLLTASKFRAAKHMAGLKVYNVDGPSNNATGQSRAMIAAAARRRDSSHNELYYEEAEHERRVKKRKARLVVAVEEAFIHIQRLQAEEQQKAPGEVMDPREAAQAIFPSMARALQKYLRITRQQNYHSMESILQHLAFCITNGMTPKAFLERYLSAGPTLQYDKDRWLSTQWRLVSDEAVTNGLRDGIVFVLKCLDFSLVVNVKKIPFIILSEEFIDPKSHKFVLRLQSETSV,524,NP_620409.1.csv,refseq-VANGL1-NM_138959.2_clinical_seed_0_final,refseq-VANGL1-NM_138959.2.a2m,Invitae,refseq-VANGL1-NM_138959.2.npy,1,524,524
+NP_620596.2,MHQRHPRARCPPLCVAGILACGFLLGCWGPSHFQQSCLQALEPQAVSSYLSPGAPLKGRPPSPGFQRQRQRQRRAAGGILHLELLVAVGPDVFQAHQEDTERYVLTNLNIGAELLRDPSLGAQFRVHLVKMVILTEPEGAPNITANLTSSLLSVCGWSQTINPEDDTDPGHADLVLYITRFDLELPDGNRQVRGVTQLGGACSPTWSCLITEDTGFDLGVTIAHEIGHSFGLEHDGAPGSGCGPSGHVMASDGAAPRAGLAWSPCSRRQLLSLLSAGRARCVWDPPRPQPGSAGHPPDAQPGLYYSANEQCRVAFGPKAVACTFAREHLDMCQALSCHTDPLDQSSCSRLLVPLLDGTECGVEKWCSKGRCRSLVELTPIAAVHGRWSSWGPRSPCSRSCGGGVVTRRRQCNNPRPAFGGRACVGADLQAEMCNTQACEKTQLEFMSQQCARTDGQPLRSSPGGASFYHWGAAVPHSQGDALCRHMCRAIGESFIMKRGDSFLDGTRCMPSGPREDGTLSLCVSGSCRTFGCDGRMDSQQVWDRCQVCGGDNSTCSPRKGSFTAGRAREYVTFLTVTPNLTSVYIANHRPLFTHLAVRIGGRYVVAGKMSISPNTTYPSLLEDGRVEYRVALTEDRLPRLEEIRIWGPLQEDADIQVYRRYGEEYGNLTRPDITFTYFQPKPRQAWVWAAVRGPCSVSCGAGLRWVNYSCLDQARKELVETVQCQGSQQPPAWPEACVLEPCPPYWAVGDFGPCSASCGGGLRERPVRCVEAQGSLLKTLPPARCRAGAQQPAVALETCNPQPCPARWEVSEPSSCTSAGGAGLALENETCVPGADGLEAPVTEGPGSVDEKLPAPEPCVGMSCPPGWGHLDATSAGEKAPSPWGSIRTGAQAAHVWTPAAGSCSVSCGRGLMELRFLCMDSALRVPVQEELCGLASKPGSRREVCQAVPCPARWQYKLAACSVSCGRGVVRRILYCARAHGEDDGEEILLDTQCQGLPRPEPQEACSLEPCPPRWKVMSLGPCSASCGLGTARRSVACVQLDQGQDVEVDEAACAALVRPEASVPCLIADCTYRWHVGTWMECSVSCGDGIQRRRDTCLGPQAQAPVPADFCQHLPKPVTVRGCWAGPCVGQGACGRQHLEPTGTIDMRGPGQADCAVAIGRPLGEVVTLRVLESSLNCSAGDMLLLWGRLTWRKMCRKLLDMTFSSKTNTLVVRQRCGRPGGGVLLRYGSQLAPETFYRECDMQLFGPWGEIVSPSLSPATSNAGGCRLFINVAPHARIAIHALATNMGAGTEGANASYILIRDTHSLRTTAFHGQQVLYWESESSQAEMEFSEGFLKAQASLRGQYWTLQSWVPEMQDPQSWKGKEGT,1371,NP_620596.2.csv,refseq-ADAMTS13-NM_139027.3_clinical_seed_0_final,refseq-ADAMTS13-NM_139027.3.a2m,Invitae,refseq-ADAMTS13-NM_139027.3.npy,1,1371,1371
+NP_620689.1,MSNQYQEEGCSERPECKSKSPTLLSSYCIDSILGRRSPCKMRLLGAAQSLPAPLTSRADPEKAVQGSPKSSSAPFEAELHLPPKLRRLYGPGGGRLLQGAAAAAAAAAAAAAAAATATAGPRGEAPPPPPPTARPGERPDGAGAAAAAAAAAAAAWDTLKISQAPQVSISRSKSYRENGAPFVPPPPALDELGGPGGVTHPEERLGVAGGPGSAPAAGGGTGTEDDEEELLEDEEDEDEEEELLEDDEEELLEDDARALLKEPRRCPVAATGAVAAAAAAAVATEGGELSPKEELLLHPEDAEGKDGEDSVCLSAGSDSEEGLLKRKQRRYRTTFTSYQLEELERAFQKTHYPDVFTREELAMRLDLTEARVQVWFQNRRAKWRKREKAGAQTHPPGLPFPGPLSATHPLSPYLDASPFPPHHPALDSAWTAAAAAAAAAFPSLPPPPGSASLPPSGAPLGLSTFLGAAVFRHPAFISPAFGRLFSTMAPLTSASTAAALLRQPTPAVEGAVASGALADPATAAADRRASSIAALRLKAKEHAAQLTQLNILPGTSTGKEVC,562,NP_620689.1.csv,refseq-ARX-NM_139058.2_clinical_seed_0_final,refseq-ARX-NM_139058.2.a2m,Invitae,refseq-ARX-NM_139058.2.npy,1,562,562
+NP_640334.2,MEEIKPASASCVSKEKPSKVSDLISRFEGGSSLSNYSDLKKESAVNLNAPRTPGRHGLTTTPQQKLLSQHLPQRQGNDTDKTQGAQTCVANGVMAAQNQMECEEEKAATLSSDTSIQASEPLLDTHIVNGERDETATAPASPTTDSCDGNASDSSYRTPGIGPVLPLEERGAETETKVQERENGESPLELEQLDQHHEMKETNEQKLHKIANELLLTERAYVNRLDLLDQVFYCKLLEEANRGSFPAEMVNKIFSNISSINAFHSKFLLPELEKRMQEWETTPRIGDILQKLAPFLKMYGEYVKGFDNAMELVKNMTERIPQFKSVVEEIQKQKICGSLTLQHHMLEPVQRIPRYEMLLKDYLRKLPPDSLDWNDAKKSLEIISTAASHSNSAIRKMENLKKLLEIYEMLGEEEDIVNPSNELIKEGQILKLAARNTSAQERYLFLFNNMLLYCVPKFSLVGSKFTVRTRVGIDGMKIVETQNEEYPHTFQVSGKERTLELQASSAQDKEEWIKALQETIDAFHQRHETFRNAIAKDNDIHSEVSTAELGKRAPRWIRDNEVTMCMKCKEPFNALTRRRHHCRACGYVVCWKCSDYKAQLEYDGGKLSKVCKDCYQIISGFTDSEEKKRKGILEIESAEVSGNSVVCSFLQYMEKSKPWQKAWCVIPKQDPLVLYMYGAPQDVRAQATIPLLGYVVDEMPRSADLPHSFKLTQSKSVHSFAADSEELKQKWLKVILLAVTGETPGGPNEHPATLDDHPEPKKKSEC,766,NP_640334.2.csv,refseq-FGD4-NM_139241.2_clinical_seed_0_final,refseq-FGD4-NM_139241.2.a2m,Invitae,refseq-FGD4-NM_139241.2.npy,1,766,766
+NP_640335.2,MRVLVRRCWGPPLAHGARRGRPSPQWRALARLGWEDCRDSRVREKPPWRVLFFGTDQFAREALRALHAARENKEEELIDKLEVVTMPSPSPKGLPVKQYAVQSQLPVYEWPDVGSGEYDVGVVASFGRLLNEALILKFPYGILNVHPSCLPRWRGPAPVIHTVLHGDTVTGVTIMQIRPKRFDVGPILKQETVPVPPKSTAKELEAVLSRLGANMLISVLKNLPESLSNGRQQPMEGATYAPKISAGTSCIKWEEQTSEQIFRLYRAIGNIIPLQTLWMANTIKLLDLVEVNSSVLADPKLTGQALIPGSVIYHKQSQILLVYCKDGWIGVRSVMLKKSLTATDFYNGYLHPWYQKNSQAQPSQCRFQTLRLPTKKKQKKTVAMQQCIE,389,NP_640335.2.csv,refseq-MTFMT-NM_139242.3_clinical_seed_0_final,refseq-MTFMT-NM_139242.3.a2m,Invitae,refseq-MTFMT-NM_139242.3_theta_0.2.npy,1,389,389
+NP_640341.1,MLRFYLFISLLCLSRSDAEETCPSFTRLSFHSAVVGTGLNVRLMLYTRKNLTCAQTINSSAFGNLNVTKKTTFIVHGFRPTGSPPVWMDDLVKGLLSVEDMNVVVVDWNRGATTLIYTHASSKTRKVAMVLKEFIDQMLAEGASLDDIYMIGVSLGAHISGFVGEMYDGWLGRITGLDPAGPLFNGKPHQDRLDPSDAQFVDVIHSDTDALGYKEPLGNIDFYPNGGLDQPGCPKTILGGFQYFKCDHQRSVYLYLSSLRESCTITAYPCDSYQDYRNGKCVSCGTSQKESCPLLGYYADNWKDHLRGKDPPMTKAFFDTAEESPFCMYHYFVDIITWNKNVRRGDITIKLRDKAGNTTESKINHEPTTFQKYHQVSLLARFNQDLDKVAAISLMFSTGSLIGPRYKLRILRMKLRSLAHPERPQLCRYDLVLMENVETVFQPILCPELQL,451,NP_640341.1.csv,refseq-LIPH-NM_139248.2_clinical_seed_0_final,refseq-LIPH-NM_139248.2.a2m,Invitae,refseq-LIPH-NM_139248.2.npy,1,451,451
+NP_644805.1,MAQWNQLQQLDTRYLEQLHQLYSDSFPMELRQFLAPWIESQDWAYAASKESHATLVFHNLLGEIDQQYSRFLQESNVLYQHNLRRIKQFLQSRYLEKPMEIARIVARCLWEESRLLQTAATAAQQGGQANHPTAAVVTEKQQMLEQHLQDVRKRVQDLEQKMKVVENLQDDFDFNYKTLKSQGDMQDLNGNNQSVTRQKMQQLEQMLTALDQMRRSIVSELAGLLSAMEYVQKTLTDEELADWKRRQQIACIGGPPNICLDRLENWITSLAESQLQTRQQIKKLEELQQKVSYKGDPIVQHRPMLEERIVELFRNLMKSAFVVERQPCMPMHPDRPLVIKTGVQFTTKVRLLVKFPELNYQLKIKVCIDKDSGDVAALRGSRKFNILGTNTKVMNMEESNNGSLSAEFKHLTLREQRCGNGGRANCDASLIVTEELHLITFETEVYHQGLKIDLETHSLPVVVISNICQMPNAWASILWYNMLTNNPKNVNFFTKPPIGTWDQVAEVLSWQFSSTTKRGLSIEQLTTLAEKLLGPGVNYSGCQITWAKFCKENMAGKGFSFWVWLDNIIDLVKKYILALWNEGYIMGFISKERERAILSTKPPGTFLLRFSESSKEGGVTFTWVEKDISGKTQIQSVEPYTKQQLNNMSFAEIIMGYKIMDATNILVSPLVYLYPDIPKEEAFGKYCRPESQEHPEADPGSAAPYLKTKFICVTPTTCSNTIDLPMSPRTLDSLMQFGNNGEGAEPSAGGQFESLTFDMELTSECATSPM,770,NP_644805.1.csv,refseq-STAT3-NM_139276.2_clinical_seed_0_final,refseq-STAT3-NM_139276.2.a2m,Invitae,refseq-STAT3-NM_139276.2.npy,1,770,770
+NP_644807.1,MAGLRARGGPGPGLLALSALGFCLMLQVSAKRPPKTPPCPPSCSCTRDTAFCVDSKAVPRNLPSEVISLTLVNAAFSEIQDGAFSHLPLLQFLLLNSNKFTLIGDNAFTGLSHLQYLFIENNDIWALSKFTFRGLKSLTHLSLANNNLQTLPRDIFRPLDILNDLDLRGNSLNCDCKVKWLVEWLAHTNTTVAPIYCASPPRFQEHKVQDLPLREFDCITTDFVLYQTLAFPAVSAEPFLYSSDLYLALAQPGVSACTILKWDYVERQLRDYDRIPAPSAVHCKPMVVDSQLYVVVAQLFGGSYIYHWDPNTTRFTRLQDIDPQRVRKPNDLEAFRIDGDWYFAVADSSKAGATSLYRWHQNGFYSHQALHPWHRDTDLEFVDGEGKPRLIVSSSSQAPVIYQWSRTQKQFVAQGEVTQVPDAQAVKHFRAGRDSYLCLSRYIGDSKILRWEGTRFSEVQALPSRGSLALQPFLVGGRRYLALGSDFSFTQIYQWDEGRQKFVRFQELAVQAPRAFCYMPAGDAQLLLAPSFKGQTLVYRHIVVDLSA,548,NP_644807.1.csv,refseq-LGI3-NM_139278.4_clinical_seed_0_final,refseq-LGI3-NM_139278.4.a2m,Invitae,refseq-LGI3-NM_139278.4_theta_0.2.npy,1,548,548
+NP_644813.1,MGGAGILLLLLAGAGVVVAWRPPKGKCPLRCSCSKDSALCEGSPDLPVSFSPTLLSLSLVRTGVTQLKAGSFLRIPSLHLLLFTSNSFSVIEDDAFAGLSHLQYLFIEDNEIGSISKNALRGLRSLTHLSLANNHLETLPRFLFRGLDTLTHVDLRGNPFQCDCRVLWLLQWMPTVNASVGTGACAGPASLSHMQLHHLDPKTFKCRAIELSWFQTVGESALSVEPFSYQGEPHIVLAQPFAGRCLILSWDYSLQRFRPEEELPAASVVSCKPLVLGPSLFVLAARLWGGSQLWARPSPGLRLAPTQTLAPRRLLRPNDAELLWLEGQPCFVVADASKAGSTTLLCRDGPGFYPHQSLHAWHRDTDAEALELDGRPHLLLASASQRPVLFHWTGGRFERRTDIPEAEDVYATRHFQAGGDVFLCLTRYIGDSMVMRWDGSMFRLLQQLPSRGAHVFQPLLIARDQLAILGSDFAFSQVLRLEPDKGLLEPLQELGPPALVAPRAFAHITMAGRRFLFAACFKGPTQIYQHHEIDLSA,537,NP_644813.1.csv,refseq-LGI4-NM_139284.2_clinical_seed_0_final,refseq-LGI4-NM_139284.2.a2m,Invitae,refseq-LGI4-NM_139284.2.npy,1,537,537
+NP_647479.2,MPGGKRGLVAPQNTFLENIVRRSSESSFLLGNAQIVDWPVVYSNDGFCKLSGYHRADVMQKSSTCSFMYGELTDKKTIEKVRQTFDNYESNCFEVLLYKKNRTPVWFYMQIAPIRNEHEKVVLFLCTFKDITLFKQPIEDDSTKGWTKFARLTRALTNSRSVLQQLTPMNKTEVVHKHSRLAEVLQLGSDILPQYKQEAPKTPPHIILHYCAFKTTWDWVILILTFYTAIMVPYNVSFKTKQNNIAWLVLDSVVDVIFLVDIVLNFHTTFVGPGGEVISDPKLIRMNYLKTWFVIDLLSCLPYDIINAFENVDEGISSLFSSLKVVRLLRLGRVARKLDHYLEYGAAVLVLLVCVFGLVAHWLACIWYSIGDYEVIDEVTNTIQIDSWLYQLALSIGTPYRYNTSAGIWEGGPSKDSLYVSSLYFTMTSLTTIGFGNIAPTTDVEKMFSVAMMMVGSLLYATIFGNVTTIFQQMYANTNRYHEMLNNVRDFLKLYQVPKGLSERVMDYIVSTWSMSKGIDTEKVLSICPKDMRADICVHLNRKVFNEHPAFRLASDGCLRALAVEFQTIHCAPGDLIYHAGESVDALCFVVSGSLEVIQDDEVVAILGKGDVFGDIFWKETTLAHACANVRALTYCDLHIIKREALLKVLDFYTAFANSFSRNLTLTCNLRKRIIFRKISDVKKEEEERLRQKNEVTLSIPVDHPVRKLFQKFKQQKELRNQGSTQGDPERNQLQVESRSLQNGASITGTSVVTVSQITPIQTSLAYVKTSESLKQNNRDAMELKPNGGADQKCLKVNSPIRMKNGNGKGWLRLKNNMGAHEEKKEDWNNVTKAESMGLLSEDPKSSDSENSVTKNPLRKTDSCDSGITKSDLRLDKAGEARSPLEHSPIQADAKHPFYPIPEQALQTTLQEVKHELKEDIQLLSCRMTALEKQVAEILKILSEKSVPQASSPKSQMPLQVPPQIPCQDIFSVSRPESPESDKDEIHF,988,NP_647479.2.csv,refseq-KCNH5-NM_139318.4_clinical_seed_0_final,refseq-KCNH5-NM_139318.4.a2m,Invitae,refseq-KCNH5-NM_139318.4.npy,1,988,988
+NP_647480.1,MPFKAFDTFKEKILKPGKEGVKNAVGDSLGILQRKIDGTTEEEDNIELNEEGRPVQTSRPSPPLCDCHCCGLPKRYIIAIMSGLGFCISFGIRCNLGVAIVEMVNNSTVYVDGKPEIQTAQFNWDPETVGLIHGSFFWGYIMTQIPGGFISNKFAANRVFGAAIFLTSTLNMFIPSAARVHYGCVMCVRILQGLVEGVTYPACHGMWSKWAPPLERSRLATTSFCGSYAGAVVAMPLAGVLVQYIGWSSVFYIYGMFGIIWYMFWLLQAYECPAAHPTISNEEKTYIETSIGEGANVVSLSKFSTPWKRFFTSLPVYAIIVANFCRSWTFYLLLISQPAYFEEVFGFAISKVGLLSAVPHMVMTIVVPIGGQLADYLRSRQILTTTAVRKIMNCGGFGMEATLLLVVGFSHTKGVAISFLVLAVGFSGFAISGFNVNHLDIAPRYASILMGISNGVGTLSGMVCPLIVGAMTRHKTREEWQNVFLIAALVHYSGVIFYGVFASGEKQEWADPENLSEEKCGIIDQDELAEEIELNHESFASPKKKMSYGATSQNCEVQKKEWKGQRGATLDEEELTSYQNEERNFSTIS,589,NP_647480.1.csv,refseq-SLC17A8-NM_139319.2_clinical_seed_0_final,refseq-SLC17A8-NM_139319.2.a2m,Invitae,refseq-SLC17A8-NM_139319.2.npy,1,589,589
+NP_647593.1,MAEMGSKGVTAGKIASNVQKKLTRAQEKVLQKLGKADETKDEQFEQCVQNFNKQLTEGTRLQKDLRTYLASVKAMHEASKKLNECLQEVYEPDWPGRDEANKIAENNDLLWMDYHQKLVDQALLTMDTYLGQFPDIKSRIAKRGRKLVDYDSARHHYESLQTAKKKDEAKIAKPVSLLEKAAPQWCQGKLQAHLVAQTNLLRNQAEEELIKAQKVFEEMNVDLQEELPSLWNSRVGFYVNTFQSIAGLEENFHKEMSKLNQNLNDVLVGLEKQHGSNTFTVKAQPSDNAPAKGNKSPSPPDGSPAATPEIRVNHEPEPAGGATPGATLPKSPSQLRKGPPVPPPPKHTPSKEVKQEQILSLFEDTFVPEISVTTPSQFEAPGPFSEQASLLDLDFDPLPPVTSPVKAPTPSGQSIPWDLWEPTESPAGSLPSGEPSAAEGTFAVSWPSQTAEPGPAQPAEASEVAGGTQPAAGAQEPGETAASEAASSSLPAVVVETFPATVNGTVEGGSGAGRLDLPPGFMFKVQAQHDYTATDTDELQLKAGDVVLVIPFQNPEEQDEGWLMGVKESDWNQHKELEKCRGVFPENFTERVP,593,NP_647593.1.csv,refseq-BIN1-NM_139343.2_clinical_seed_0_final,refseq-BIN1-NM_139343.2.a2m,Invitae,refseq-BIN1-NM_139343.2.npy,1,593,593
+NP_652763.1,MASTAVQLLGFLLSFLGMVGTLITTILPHWRRTAHVGTNILTAVSYLKGLWMECVWHSTGIYQCQIYRSLLALPQDLQAARALMVISCLLSGIACACAVIGMKCTRCAKGTPAKTTFAILGGTLFILAGLLCMVAVSWTTNDVVQNFYNPLLPSGMKFEIGQALYLGFISSSLSLIGGTLLCLSCQDEAPYRPYQAPPRATTTTANTAPAYQPPAAYKDNRAPSVTSATHSGYRLNDYV,239,NP_652763.1.csv,refseq-CLDN14-NM_144492.2_clinical_seed_0_final,refseq-CLDN14-NM_144492.2.a2m,Invitae,refseq-CLDN14-NM_144492.2.npy,1,239,239
+NP_653164.2,MQRPGPFSTLYGRVLAPLPGRAGGAASGGGGNSWDLPGSHVRLPGRAQSGTRGGAGNTSTSCGDSNSICPAPSTMSKAEEAKKLAGRAAVENHVRNNQVLGIGSGSTIVHAVQRIAERVKQENLNLVCIPTSFQARQLILQYGLTLSDLDRHPEIDLAIDGADEVDADLNLIKGGGGCLTQEKIVAGYASRFIVIADFRKDSKNLGDQWHKGIPIEVIPMAYVPVSRAVSQKFGGVVELRMAVNKAGPVVTDNGNFILDWKFDRVHKWSEVNTAIKMIPGVVDTGLFINMAERVYFGMQDGSVNMREKPFC,311,NP_653164.2.csv,refseq-RPIA-NM_144563.2_clinical_seed_0_final,refseq-RPIA-NM_144563.2.a2m,Invitae,refseq-RPIA-NM_144563.2.npy,1,311,311
+NP_653174.3,MNDISQKAEILLSSSKPVPKTYVPKLGKGDVKDKFEAMQRAREERNQRRSRDEKQRRKEQYIREREWNRRKQEIKEMLASDDEEDVSSKVEKAYVPKLTGTVKGRFAEMEKQRQEEQRKRTEEERKRRIEQDMLEKRKIQRELAKRAEQIEDINNTGTESASEEGDDSLLITVVPVKSYKTSGKMKKNFEDLEKEREEKERIKYEEDKRIRYEEQRPSLKEAKCLSLVMDDEIESEAKKESLSPGKLKLTFEELERQRQENRKKQAEEEARKRLEEEKRAFEEARRQMVNEDEENQDTAKIFKGYRPGKLKLSFEEMERQRREDEKRKAEEEARRRIEEEKKAFAEARRNMVVDDDSPEMYKTISQEFLTPGKLEINFEELLKQKMEEEKRRTEEERKHKLEMEKQEFEQLRQEMGEEEEENETFGLSREYEELIKLKRSGSIQAKNLKSKFEKIGQLSEKEIQKKIEEERARRRAIDLEIKEREAENFHEEDDVDVRPARKSEAPFTHKVNMKARFEQMAKAREEEEQRRIEEQKLLRMQFEQREIDAALQKKREEEEEEEGSIMNGSTAEDEEQTRSGAPWFKKPLKNTSVVDSEPVRFTVKVTGEPKPEITWWFEGEILQDGEDYQYIERGETYCLYLPETFPEDGGEYMCKAVNNKGSAASTCILTIESKN,675,NP_653174.3.csv,refseq-NEXN-NM_144573.3_clinical_seed_0_final,refseq-NEXN-NM_144573.3.a2m,Invitae,refseq-NEXN-NM_144573.3.npy,1,675,675
+NP_653178.3,MEGERRAYSKEVHQRINKQLEEIRRLEEVRGDLQVQISAAQNQVKRLRDSQRLENMDRLLKGRAQVQAEIEELQEQTRALDKQIQEWETRIFTHSKNVRSPGFILDQKVKIRRRIRILENQLDRVTCHFDNQLVRNAALREELDLLRIDRNRYLNVDRKLKKEIHHLHHLVSTLILSSTSAYAVREEAKAKMGLLRERAEKEEAQSEMEAQVLQRQILHLEQLHHFLKLKNNDRQPDPDVLEKREKQAGEVAEGVWKTSQERLVLCYEDALNKLSQLMGESDPDLLVQKYLEIEERNFAEFNFINEQNLELEHVQEEIKEMQEALVSARASKDDQHLLQEQQQKVLQQRMDKVHSEAERLEARFQDVRGQLEKLKADIQLLFTKAHCDSSMIDDLLGVKTSMGDRDMGLFLSLIEKRLVELLTVQAFLHAQSFTSLADAALLVLGQSLEDLPKKMAPLQPPDTLEDPPGFEASDDYPMSREELLSQVEKLVELQEQAEAQRQKDLAAAAAKLDGTLSVDLASTQRAGSSTVLVPTRHPHAIPGSILSHKTSRDRGSLGHVTFGGLSSSTGHLPSHITHGDPNTGHVTFGSTSASSGGHVTFRPVSASSYLGSTGYVGSSRGGENTEGGVESGGTASDSSGGLGSSRDHVSSTGPASSTGPGSSTSKDSRG,670,NP_653178.3.csv,refseq-CCDC114-NM_144577.3_clinical_seed_0_final,refseq-CCDC114-NM_144577.3.a2m,Invitae,refseq-CCDC114-NM_144577.3.npy,1,670,670
+NP_653186.2,MAFSELLDLVGGLGRFQVLQTMALMVSIMWLCTQSMLENFSAAVPSHRCWAPLLDNSTAQASILGSLSPEALLAISIPPGPNQRPHQCRRFRQPQWQLLDPNATATSWSEADTEPCVDGWVYDRSIFTSTIVAKWNLVCDSHALKPMAQSIYLAGILVGAAACGPASDRFGRRLVLTWSYLQMAVMGTAAAFAPAFPVYCLFRFLLAFAVAGVMMNTGTLLMEWTAARARPLVMTLNSLGFSFGHGLTAAVAYGVRDWTLLQLVVSVPFFLCFLYSWWLAESARWLLTTGRLDWGLQELWRVAAINGKGAVQDTLTPEVLLSAMREELSMGQPPASLGTLLRMPGLRFRTCISTLCWFAFGFTFFGLALDLQALGSNIFLLQMFIGVVDIPAKMGALLLLSHLGRRPTLAASLLLAGLCILANTLVPHEMGALRSALAVLGLGGVGAAFTCITIYSSELFPTVLRMTAVGLGQMAARGGAILGPLVRLLGVHGPWLPLLVYGTVPVLSGLAALLLPETQSLPLPDTIQDVQNQAVKKATHGTLGNSVLKSTQF,553,NP_653186.2.csv,refseq-SLC22A12-NM_144585.3_clinical_seed_0_final,refseq-SLC22A12-NM_144585.3.a2m,Invitae,refseq-SLC22A12-NM_144585.3.npy,1,553,553
+NP_653200.2,MGTAAAAAAAAAAAAAGEGARSPSPAAVSLGLGVAVVSSLVNGSTFVLQKKGIVRAKRRGTSYLTDIVWWAGTIAMAVGQIGNFLAYTAVPTVLVTPLGALGVPFGSILASYLLKEKLNILGKLGCLLSCAGSVVLIIHSPKSESVTTQAELEEKLTNPVFVGYLCIVLLMLLLLIFWIAPAHGPTNIMVYISICSLLGSFTVPSTKGIGLAAQDILHNNPSSQRALCLCLVLLAVLGCSIIVQFRYINKALECFDSSVFGAIYYVVFTTLVLLASAILFREWSNVGLVDFLGMACGFTTVSVGIVLIQVFKEFNFNLGEMNKSNMKTD,329,NP_653200.2.csv,refseq-NIPA1-NM_144599.4_clinical_seed_0_final,refseq-NIPA1-NM_144599.4.a2m,Invitae,refseq-NIPA1-NM_144599.4.npy,1,329,329
+NP_653213.6,MMPQKKRRRKKDIDFLALYEAELLNYASEDDEGELEHEYYKARVYEVVTATGDVRGAGTDANVFITLFGENGLSPKLQLTSKSKSAFEKGNVDVFRVRTNNVGLIYKVRIEHDNTGLNASWYLDHVIVTDMKRPHLRYYFNCNNWLSKVEGDRQWCRDLLASFNPMDMPRGNKYEVKVYTGDVIGAGTDADVFINIFGEYGDTGERRLENEKDNFEKGAEDRFILDAPDLGQLMKINVGHNNKGGSAGWFLSQIVIEDIGNKRKYDFPLNRWLALDEDDGKIQRDILVGGAETTAITYIVTVFTGDVRGAGTKSKIYLVMYGARGNKNSGKIFLEGGVFDRGRTDIFHIELAVLLSPLSRVSVGHGNVGVNRGWFCEKVVILCPFTGIQQTFPCSNWLDEKKADGLIERQLYEMVSLRKKRLKKFPWSLWVWTTDLKKAGTNSPIFIQIYGQKGRTDEILLNPNNKWFKPGIIEKFRIELPDLGRFYKIRVWHDKRSSGSGWHLERMTLMNTLNKDKYNFNCNRWLDANEDDNEIVREMTAEGPTVRRIMGMARYHVTVCTGELEGAGTDANVYLCLFGDVGDTGERLLYNCRNNTDLFEKGNADEFTIESVTMRNVRRVRIRHDGKGSGSGWYLDRVLVREEGQPESDNVEFPCLRWLDKDKDDGQLVRELLPSDSSATLKNFRYHISLKTGDVSGASTDSRVYIKLYGDKSDTIKQVLLVSDNNLKDYFERGRVDEFTLETLNIGNINRLVIGHDSTGMHASWFLGSVQIRVPRQGKQYTFPANRWLDKNQADGRLEVELYPSEVVEIQKLVHYEVEIWTGDVGGAGTSARVYMQIYGEKGKTEVLFLSSRSKVFERASKDTFQLEAADVGEVYKLRLGHTGEGFGPSWFVDTVWLRHLVVREVDLTPEEEARKKKEKDKLRQLLKKERLKAKLQRKKKKRKGSDEEDEGEEEESSSSEESSSEEEEMEEEEEEEEFGPGMQEVIEQHKFEAHRWLARGKEDNELVVELVPAGKPGPERNTYEVQVVTGNVPKAGTDANVYLTIYGEEYGDTGERPLKKSDKSNKFEQGQTDTFTIYAIDLGALTKIRIRHDNTGNRAGWFLDRIDITDMNNEITYYFPCQRWLAVEEDDGQLSRELLPVDESYVLPQSEEGRGGGDNNPLDNLALEQKDKSTTFSVTIKTGVKKNAGTDANVFITLFGTQDDTGMTLLKSSKTNSDKFERDSIEIFTVETLDLGDLWKVRLGHDNTGKAPGWFVDWVEVDAPSLGKCMTFPCGRWLAKNEDDGSIIRDLFHAELQTRLYTPFVPYEITLYTSDVFAAGTDANIFIIIYGCDAVCTQQKYLCTNKREQKQFFERKSASRFIVELEDVGEIIEKIRIGHNNTGMNPGWHCSHVDIRRLLPDKDGAETLTFPCDRWLATSEDDKKTIRELVPYDIFTEKYMKDGSLRQVYKEVEEPLDIVLYSVQIFTGNIPGAGTDAKVYITIYGDLGDTGERYLGKSENRTNKFERGTADTFIIEAADLGVIYKIKLRHDNSKWCADWYVEKVEIWNDTNEDEFLFLCGRWLSLKKEDGRLERLFYEKEYTGDRSSNCSSPADFWEIALSSKMADVDISTVTGPMADYVQEGPIIPYYVSVTTGKHKDAATDSRAFIFLIGEDDERSKRIWLDYPRGKRGFSRGSVEEFYVAGLDVGIIKKIEVLYEMTVWTGDVVGGGTDSNIFMTLYGINGSTEEMQLDKKKARFEREQNDTFIMEILDIAPFTKMRIRIDGLGSRPEWFLERILLKNMNTGDLTMFYYGDWLSQRKGKKTLVCEMCAVIDEEEMMEWTSYTVAVKTSDILGAGTDANVFIIIFGENGDSGTLALKQSANWNKFERNNTDTFNFPDMLSLGHLCKLRVWHDNKGIFPGWHLSYVDVKDNSRDETFHFQCDCWLSKSEGDGQTVRDFACANNKICDELEETTYEIVIETGNGGETRENVWLILEGRKNRSKEFLMENSSRQRAFRKGTTDTFEFDSIYLGDIASLCVGHLAREDRFIPKRELAWHVKTITITEMEYGNVYFFNCDCLIPLKRKRKYFKVFEVTKTTESFASKVQSLVPVKYEVIVTTGYEPGAGTDANVFVTIFGANGDTGKRELKQKMRNLFERGSTDRFFLETLELGELRKVRLEHDSSGYCSGWLVEKVEVTNTSTGVATIFNCGRWLDKKRGDGLTWRDLFPSV,2211,NP_653213.6.csv,refseq-LOXHD1-NM_144612.6_clinical_seed_0_final,refseq-LOXHD1-NM_144612.6.a2m,Invitae,refseq-LOXHD1-NM_144612.6.npy,1,2211,2211
+NP_653234.2,MPVMKGLLAPQNTFLDTIATRFDGTHSNFILANAQVAKGFPIVYCSDGFCELAGFARTEVMQKSCSCKFLFGVETNEQLMLQIEKSLEEKTEFKGEIMFYKKNGSPFWCLLDIVPIKNEKGDVVLFLASFKDITDTKVKITPEDKKEDKVKGRSRAGTHFDSARRRSRAVLYHISGHLQRREKNKLKINNNVFVDKPAFPEYKVSDAKKSKFILLHFSTFKAGWDWLILLATFYVAVTVPYNVCFIGNDDLSTTRSTTVSDIAVEILFIIDIILNFRTTYVSKSGQVIFEARSICIHYVTTWFIIDLIAALPFDLLYAFNVTVVSLVHLLKTVRLLRLLRLLQKLDRYSQHSTIVLTLLMSMFALLAHWMACIWYVIGKMEREDNSLLKWEVGWLHELGKRLESPYYGNNTLGGPSIRSAYIAALYFTLSSLTSVGFGNVSANTDAEKIFSICTMLIGALMHALVFGNVTAIIQRMYSRWSLYHTRTKDLKDFIRVHHLPQQLKQRMLEYFQTTWSVNNGIDSNELLKDFPDELRSDITMHLNKEILQLSLFECASRGCLRSLSLHIKTSFCAPGEYLLRQGDALQAIYFVCSGSMEVLKDSMVLAILGKGDLIGANLSIKDQVIKTNADVKALTYCDLQCIILKGLFEVLDLYPEYAHKFVEDIQHDLTYNLREGHESDVISRLSNKSMVSQSEPKGNGNINKRLPSIVEDEEEEEEGEEEEAVSLSPICTRGSSSRNKKVGSNKAYLGLSLKQLASGTVPFHSPIRVSRSNSPKTKQEIDPPNHNKRKEKNLKLQLSTLNNAGPPDLSPRIVDGIEDGNSSEESQTFDFGSERIRSEPRISPPLGDPEIGAAVLFIKAEETKQQINKLNSEVTTLTQEVSQLGKDMRNVIQLLENVLSPQQPSRFCSLHSTSVCPSRESLQTRTSWSAHQPCLHLQTGGAAYTQAQLCSSNITSDIWSVDPSSVGSSPQRTGAHEQNPADSELYHSPSLDYSPSHYQVVQEGHLQFLRCISPHSDSTLTPLQSISATLSSSVCSSSETSLHLVLPSRSEEGSFSQGTVSSFSLENLPGSWNQEGMASASTKPLENLPLEVVTSTAEVKDNKAINV,1107,NP_653234.2.csv,refseq-KCNH8-NM_144633.2_clinical_seed_0_final,refseq-KCNH8-NM_144633.2.a2m,Invitae,refseq-KCNH8-NM_144633.2.npy,1,1107,1107
+NP_653259.3,MAEVRKFTKRLSKPGTAAELRQSVSEAVRGSVVLEKAKVVEPLDYENVIAQRKTQIYSDPLRDLLMFPMEDISISVIGRQRRTVQSTVPEDAEKRAQSLFVKECIKTYSTDWHVVNYKYEDFSGDFRMLPCKSLRPEKIPNHVFEIDEDCEKDEDSSSLCSQKGGVIKQGWLHKANVNSTITVTMKVFKRRYFYLTQLPDGSYILNSYKDEKNSKESKGCIYLDACIDVVQCPKMRRHAFELKMLDKYSHYLAAETEQEMEEWLITLKKIIQINTDSLVQEKKETVETAQDDETSSQGKAENIMASLERSMHPELMKYGRETEQLNKLSRGDGRQNLFSFDSEVQRLDFSGIEPDIKPFEEKCNKRFLVNCHDLTFNILGQIGDNAKGPPTNVEPFFINLALFDVKNNCKISADFHVDLNPPSVREMLWGSSTQLASDGSPKGSSPESYIHGIAESQLRYIQQGIFSVTNPHPEIFLVARIEKVLQGNITHCAEPYIKNSDPVKTAQKVHRTAKQVCSRLGQYRMPFAWAARPIFKDTQGSLDLDGRFSPLYKQDSSKLSSEDILKLLSEYKKPEKTKLQIIPGQLNITVECVPVDLSNCITSSYVPLKPFEKNCQNITVEVEEFVPEMTKYCYPFTIYKNHLYVYPLQLKYDSQKTFAKARNIAVCVEFRDSDESDASALKCIYGKPAGSVFTTNAYAVVSHHNQNPEFYDEIKIELPIHLHQKHHLLFTFYHVSCEINTKGTTKKQDTVETPVGFAWVPLLKDGRIITFEQQLPVSANLPPGYLNLNDAESRRQCNVDIKWVDGAKPLLKIKSHLESTIYTQDLHVHKFFHHCQLIQSGSKEVPGELIKYLKCLHAMEIQVMIQFLPVILMQLFRVLTNMTHEDDVPINCTMVLLHIVSKCHEEGLDSYLRSFIKYSFRPEKPSAPQAQLIHETLATTMIAILKQSADFLSINKLLKYSWFFFEIIAKSMATYLLEENKIKLPRGQRFPETYHHVLHSLLLAIIPHVTIRYAEIPDESRNVNYSLASFLKRCLTLMDRGFIFNLINDYISGFSPKDPKVLAEYKFEFLQTICNHEHYIPLNLPMAFAKPKLQRVQDSNLEYSLSDEYCKHHFLVGLLLRETSIALQDNYEIRYTAISVIKNLLIKHAFDTRYQHKNQQAKIAQLYLPFVGLLLENIQRLAGRDTLYSCAAMPNSASRDEFPCGFTSPANRGSLSTDKDTAYGSFQNGHGIKREDSRGSLIPEGATGFPDQGNTGENTRQSSTRSSVSQYNRLDQYEIRSLLMCYLYIVKMISEDTLLTYWNKVSPQELINILILLEVCLFHFRYMGKRNIARVHDAWLSKHFGIDRKSQTMPALRNRSGVMQARLQHLSSLESSFTLNHSSTTTEADIFHQALLEGNTATEVSLTVLDTISFFTQCFKTQLLNNDGHNPLMKKVFDIHLAFLKNGQSEVSLKHVFASLRAFISKFPSAFFKGRVNMCAAFCYEVLKCCTSKISSTRNEASALLYLLMRNNFEYTKRKTFLRTHLQIIIAVSQLIADVALSGGSRFQESLFIINNFANSDRPMKATAFPAEVKDLTKRIRTVLMATAQMKEHEKDPEMLIDLQYSLAKSYASTPELRKTWLDSMAKIHVKNGDFSEAAMCYVHVAALVAEFLHRKKLFPNGCSAFKKITPNIDEEGAMKEDAGMMDVHYSEEVLLELLEQCVDGLWKAERYEIISEISKLIVPIYEKRREFEKLTQVYRTLHGAYTKILEVMHTKKRLLGTFFRVAFYGQSFFEEEDGKEYIYKEPKLTGLSEISLRLVKLYGEKFGTENVKIIQDSDKVNAKELDPKYAHIQVTYVKPYFDDKELTERKTEFERNHNISRFVFEAPYTLSGKKQGCIEEQCKRRTILTTSNSFPYVKKRIPINCEQQINLKPIDVATDEIKDKTAELQKLCSSTDVDMIQLQLKLQGCVSVQVNAGPLAYARAFLNDSQASKYPPKKVSELKDMFRKFIQACSIALELNERLIKEDQVEYHEGLKSNFRDMVKELSDIIHEQILQEDTMHSPWMSNTLHVFCAISGTSSDRGYGSPRYAEV,2073,NP_653259.3.csv,refseq-DOCK11-NM_144658.3_clinical_seed_0_final,refseq-DOCK11-NM_144658.3.a2m,Invitae,refseq-DOCK11-NM_144658.3_theta_0.2.npy,1,2073,2073
+NP_658985.2,MSRLRALLGLGLLVAGSRVPRIKSQTIACRSGPTWWGPQRLNSGGRWDSEVMASTVVKYLSQEEAQAVDQELFNEYQFSVDQLMELAGLSCATAIAKAYPPTSMSRSPPTVLVICGPGNNGGDGLVCARHLKLFGYEPTIYYPKRPNKPLFTALVTQCQKMDIPFLGEMPAEPMTIDELYELVVDAIFGFSFKGDVREPFHSILSVLKGLTVPIASIDIPSGWDVEKGNAGGIQPDLLISLTAPKKSATQFTGRYHYLGGRFVPPALEKKYQLNLPPYPDTECVYRLQ,288,NP_658985.2.csv,refseq-NAXE-NM_144772.2_clinical_seed_0_final,refseq-NAXE-NM_144772.2.a2m,Invitae,refseq-NAXE-NM_144772.2.npy,1,288,288
+NP_658986.1,MAAQNGNTSFTPNFNPPQDHASSLSFNFSYGDYDLPMDEDEDMTKTRTFFAAKIVIGIALAGIMLVCGIGNFVFIAALTRYKKLRNLTNLLIANLAISDFLVAIICCPFEMDYYVVRQLSWEHGHVLCASVNYLRTVSLYVSTNALLAIAIDRYLAIVHPLKPRMNYQTASFLIALVWMVSILIAIPSAYFATETVLFIVKSQEKIFCGQIWPVDQQLYYKSYFLFIFGVEFVGPVVTMTLCYARISRELWFKAVPGFQTEQIRKRLRCRRKTVLVLMCILTAYVLCWAPFYGFTIVRDFFPTVFVKEKHYLTAFYVVECIAMSNSMINTVCFVTVKNNTMKYFKKMMLLHWRPSQRGSKSSADLDLRTNGVPTTEEVDCIRLK,384,NP_658986.1.csv,refseq-PROKR2-NM_144773.2_clinical_seed_0_final,refseq-PROKR2-NM_144773.2.a2m,Invitae,refseq-PROKR2-NM_144773.2.npy,1,384,384
+NP_659403.4,MNSLSWGAANAVLLLLLLAWASPTFISINRGVRVMKGHSAFLSGDDLKFAIPKEKDACKVEVVMNEPITQRVGKLTPQVFDCHFLPNEVKYVHNGCPILDEDTVKLRLYRFTERDTFIETFILWVYLLEPDCNIIHMSNNVLEVPEFNGLSQAIDKNLLRFDYDRMASLECTVSLDTARTRLPAHGQMVLGEPRPEEPRGDQPHSFFPESQLRAKLKCPGGSCTPGLKKIGSLKVSCEEFLLMGLRYQHLDPPSPNIDYISIQLDLTDTRSKIVYKSESAWLPVYIRAGIPNQIPKAAFMAVFILEVDQFILTSLTTSVLDCEEDETPKPLLVFNITKAPLQGYVTHLLDHTRPISSFTWKDLSDMQIAYQPPNSSHSERRHDEVELEVYDFFFERSAPMTVHISIRTADTNAPRVSWNTGLSLLEGQSRAITWEQFQVVDNDDIGAVRLVTVGGLQHGWLTLRGGKGFLFTVADLQAGVVRYHHDDSDSTKDFVVFRIFDGHHSIRHKFPINVLPKDDSPPFLITNVVIELEEGQTILIQGSMLRASDVDASDDYIFFNITKPPQAGEIMKKPGPGLIGYPVHGFLQRDLFNGIIYYRHFGGEIFEDSFQFVLWDSHEPPNLSVPQVATIHITPVDDQLPKEAPGVSRHLVVKETEVAYITKKQLHFIDSESYDRELVYTITTPPFFSFSHRHLDAGKLFMVDSIPKVVKNPTALELRSFTQHAVNYMKVAYMPPMQDIGPHCRDVQFTFSVSNQHGGTLHGICFNITILPVDNQVPEAFTNPLKVTEGGQSIISTEHILISDADTKLDNIDLSLRELPLHGRVELNGFPLNSGGTFSWGDLHTLKVRYQHDGTEVLQDDLLLEVTDGTNSAEFVLHVEVFPVNDEPPVLKADLMPVMNCSEGGEVVITSEYIFATDVDSDNLKLMFVIAREPQHGVVRRAGVTVDQFSQRDVISEAVTYKHTGGEIGLMPCFDTITLVVSDGEAGPFVNGCCYNGPNPSVPLHASFPVYDLNITVYPVDNQPPSIAIGPVFVVDEGCSTALTVNHLSATDPDTAADDLEFVLVSPPQFGYLENILPSVGFEKSNIGISIDSFQWKDMNAFHINYVQSRHLRIEPTADQFTVYVTDGKHHSLEIPFSIIINPTNDEAPDFVVQNITVCEGQMKELDSSIISAVDLDIPQDALLFSITQKPRHGLLIDRGFSKDFSENKQPANPHQKHAPVHSFSMELLKTGMRLTYMHDDSESLADDFTIQLSDGKHKILKTISVEVIPVNDEKPMLSKKAEIAMNMGETRIISSAILSAIDEDSPREKIYYVFERLPQNGQLQLKIGRDWVPLSPGMKCTQEEVDLNLLRYTHTGAMDSQNQDSFTFYLWDGNNRSPALDCQITIKDMEKGDIVILTKPLVVSKGDRGFLTTTTLLAVDGTDKPEELLYVITSPPRYGQIEYVHYPGVPITNFSQMDVVGQTVCYVHKSKVTVSSDRFRFIISNGLRTEHGVFEITLETVDRALPVVTRNKGLRLAQGAVGLLSPDLLQLTDPDTPAENLTFLLVQLPQHGQLYLWGTGLLQHNFTQQDVDSKNVAYRHSGGDSQTDCFTFMATDGTNQGFIVNGRVWEEPVLFTIQVDQLDKTAPRITLLHSPSQVGLLKNGCYGIYITSRVLKASDPDTEDDQIIFKILQGPKHGHLENTTTGEFIHEKFSQKDLNSKTILYIINPSLEVNSDTVEFQIMDPTGNSATPQILELKWSHIEWSQTEYEVCENVGLLPLEIIRRGYSMDSAFVGIKVNQVSAAVGKDFTVIPSKLIQFDPGMSTKMWNIAITYDGLEEDDEVFEVILNSPVNAVLGTKTKAAVKILDSKGGQCHPSYSSNQSKHSTWEKGIWHLLPPGSSSSTTSGSFHLERRPLPSSMQLAVIRGDTLRGFDSTDLSQRKLRTRGNGKTVRPSSVYRNGTDIIYNYHGIVSLKLEDDSFPTHKRKAKVSIISQPQKTIKVAELPQADKVESTTDSHFPRQDQLPSFPKNCTLELKGLFHFEEGIQKLYQCNGIAWKAWSPQTKDVEDKSCPAGWHQHSGYCHILITEQKGTWNAAAQACREQYLGNLVTVFSRQHMRWLWDIGGRKSFWIGLNDQVHAGHWEWIGGEPVAFTNGRRGPSQRSKLGKSCVLVQRQGKWQTKDCRRAKPHNYVCSRKL,2179,NP_659403.4.csv,refseq-FREM1-NM_144966.5_clinical_seed_0_final,refseq-FREM1-NM_144966.5.a2m,Invitae,refseq-FREM1-NM_144966.5.npy,1,2179,2179
+NP_659428.2,MSALLSLCFVLPLAAPGHGTQGWEPCTDLRPLDILAEVVPSDGATSGIRIVQVHGARGLQLSVAAPRTMSFPASRIFSQCDLFPEEFSIVVTLRVPNLPPKRNEYLLTVVAEESDLLLLGLRLSPAQLHFLFLREDTAGAWQTRVSFRSPALVDGRWHTLVLAVSAGVFSLTTDCGLPVDIMADVPFPATLSVKGARFFVGSRRRAKGLFMGLVRQLVLLPGSDATPRLCPSRNAPLAVLSIPRVLQALTGKPEDNEVLKYPYETNIRVTLGPQPPCTEVEDAQFWFDASRKGLYLCVGNEWVSVLAAKERLDYVEEHQNLSTNSETLGIEVFRIPQVGLFVATANRKATSAVYKWTEEKFVSYQNIPTHQAQAWRHFTIGKKIFLAVANFEPDEKGQEFSVIYKWSHRKLKFTPYQSIATHSARDWEAFEVDGEHFLAVANHREGDNHNIDSVIYKWNPATRLFEANQTIATSGAYDWEFFSVGPYSFLVVANTFNGTSTKVHSHLYIRLLGSFQLFQSFPTFGAADWEVFQIGERIFLAVANSHSYDVEMQVQNDSYVINSVIYELNVTAQAFVKFQDILTCSALDWEFFSVGEDYFLVVANSFDGRTFSVNSIIYRWQGYEGFVAVHSLPTVGCRDWEAFSTTAGAYLIYSSAKEPLSRVLRLRTR,669,NP_659428.2.csv,refseq-TSPEAR-NM_144991.2_clinical_seed_0_final,refseq-TSPEAR-NM_144991.2.a2m,Invitae,refseq-TSPEAR-NM_144991.2.npy,1,669,669
+NP_659434.2,MNAIVALCHFCELHGPRTLFCTEVLHAPLPQGDGNEDSPGQGEQAEEEEGGIQMNSRMRAHSPAEGASVESSSPGPKKSDMCEGCRSLAAGHPGYISHDKETSIKYVSHQHPSHPQLFSIVRQACVRSLSCEVCPGREGPIFFGDEQHGFVFSHTFFIKDSLARGFQRWYSIITIMMDRIYLINSWPFLLGKVRGIIDELQGKALKVFEAEQFGCPQRAQRMNTAFTPFLHQRNGNAARSLTSLTSDDNLWACLHTSFAWLLKACGSRLTEKLLEGAPTEDTLVQMEKLADLEEESESWDNSEAEEEEKAPVLPESTEGRELTQGPAESSSLSGCGSWQPRKLPVFKSLRHMRQVLGAPSFRMLAWHVLMGNQVIWKSRDVDLVQSAFEVLRTMLPVGCVRIIPYSSQYEEAYRCNFLGLSPHVQIPPHVLSSEFAVIVEVHAAARSTLHPVGCEDDQSLSKYEFVVTSGSPVAADRVGPTILNKIEAALTNQNLSVDVVDQCLVCLKEEWMNKVKVLFKFTKVDSRPKEDTQKLLSILGASEEDNVKLLKFWMTGLSKTYKSHLMSTVRSPTASESRN,579,NP_659434.2.csv,refseq-FLCN-NM_144997.5_clinical_seed_0_final,refseq-FLCN-NM_144997.5.a2m,Invitae,refseq-FLCN-NM_144997.5.npy,1,579,579
+NP_659451.1,MEELLPDGQIWANMDPEERMLAAATAFTHICAGQGEGDVRREAQSIQYDPYSKASVAPGKRPALPVQLQYPHVESNVPSETVSEASQRLRKPVMKRKVLRRKPDGEVLVTDESIISESESGTENDQDLWDLRQRLMNVQFQEDKESSFDVSQKFNLPHEYQGISQDQLICSLQREGMGSPAYEQDLIVASRPKSFILPKLDQLSRNRGKTDRVARYFEYKRDWDSIRLPGEDHRKELRWGVREQMLCRAEPQSKPQHIYVPNNYLVPTEKKRSALRWGVRCDLANGVIPRKLPFPLSPS,299,NP_659451.1.csv,refseq-HYLS1-NM_145014.2_clinical_seed_0_final,refseq-HYLS1-NM_145014.2.a2m,Invitae,refseq-HYLS1-NM_145014.2.npy,1,299,299
+NP_659501.1,MTEKEVLESPKPSFPAETRQSGLQRLKQLLRKGSTGTKEMELPPEPQANGEAVGAGGGPIYYIYEEEEEEEEEEEEPPPEPPKLVNDKPHKFKDHFFKKPKFCDVCARMIVLNNKFGLRCKNCKTNIHEHCQSYVEMQRCFGKIPPGFHRAYSSPLYSNQQYACVKDLSAANRNDPVFETLRTGVIMANKERKKGQADKKNPVAAMMEEEPESARPEEGKPQDGNPEGDKKAEKKTPDDKHKQPGFQQSHYFVALYRFKALEKDDLDFPPGEKITVIDDSNEEWWRGKIGEKVGFFPPNFIIRVRAGERVHRVTRSFVGNREIGQITLKKDQIVVQKGDEAGGYVKVYTGRKVGLFPTDFLEEI,364,NP_659501.1.csv,refseq-STAC3-NM_145064.2_clinical_seed_0_final,refseq-STAC3-NM_145064.2.a2m,Invitae,refseq-STAC3-NM_145064.2.npy,1,364,364
+NP_659505.1,MKAHPKEMVPLMGKRVAAPSGNPAILPEKRPAEITPTKKSAHFFLEIEGFEPNPTVAKTSPPVFSKPMDSNIRQCISGNCDDMDSPQSPQDDVTETPSNPNSPSAQLAKEEQRRKKRRLKKRIFAAVSEGCVEELVELLVELQELCRRRHDEDVPDFLMHKLTASDTGKTCLMKALLNINPNTKEIVRILLAFAEENDILGRFINAEYTEEAYEGQTALNIAIERRQGDIAALLIAAGADVNAHAKGAFFNPKYQHEGFYFGETPLALAACTNQPEIVQLLMEHEQTDITSRDSRGNNILHALVTVAEDFKTQNDFVKRMYDMILLRSGNWELETTRNNDGLTPLQLAAKMGKAEILKYILSREIKEKRLRSLSRKFTDWAYGPVSSSLYDLTNVDTTTDNSVLEITVYNTNIDNRHEMLTLEPLHTLLHMKWKKFAKHMFFLSFCFYFFYNITLTLVSYYRPREEEAIPHPLALTHKMGWLQLLGRMFVLIWAMCISVKEGIAIFLLRPSDLQSILSDAWFHFVFFIQAVLVILSVFLYLFAYKEYLACLVLAMALGWANMLYYTRGFQSMGMYSVMIQKVILHDVLKFLFVYIVFLLGFGVALASLIEKCPKDNKDCSSYGSFSDAVLELFKLTIGLGDLNIQQNSKYPILFLFLLITYVILTFVLLLNMLIALMGETVENVSKESERIWRLQRARTILEFEKMLPEWLRSRFRMGELCKVAEDDFRLCLRINEVKWTEWKTHVSFLNEDPGPVRRTDFNKIQDSSRNNSKTTLNAFEEVEEFPETSV,790,NP_659505.1.csv,refseq-TRPV3-NM_145068.3_clinical_seed_0_final,refseq-TRPV3-NM_145068.3.a2m,Invitae,refseq-TRPV3-NM_145068.3.npy,1,790,790
+NP_660149.2,MGRLNEQRLFQPDLCDVDLVLVPQRSVFPAHKGVLAAYSQFFHSLFTQNKQLQRVELSLEALAPGGLQQILNFIYTSKLLVNAANVHEVLSAASLLQMADIAASCQELLDARSLGPPGPGTVALAQPAASCTPAAPPYYCDIKQEADTPGLPKIYAREGPDPYSVRVEDGAGTAGGTVPATIGPAQPFFKEEKEGGVEEAGGPPASLCKLEGGEELEEELGGSGTYSRREQSQIIVEVNLNNQTLHVSTGPEGKPGAGPSPATVVLGREDGLQRHSDEEEEDDEEEEEEEEEEEGGGSGREEEEEEEGGSQGEEEEEEEDGHSEQEEEEEEEEEEGPSEQDQESSEEEEGEEGEAGGKQGPRGSRSSRADPPPHSHMATRSRENARRRGTPEPEEAGRRGGKRPKPPPGVASASARGPPATDGLGAKVKLEEKQHHPCQKCPRVFNNRWYLEKHMNVTHSRMQICDQCGKRFLLESELLLHRQTDCERNIQCVTCGKAFKKLWSLHEHNKIVHGYAEKKFSCEICEKKFYTMAHVRKHMVAHTKDMPFTCETCGKSFKRSMSLKVHSLQHSGEKPFRCENCNERFQYKYQLRSHMSIHIGHKQFMCQWCGKDFNMKQYFDEHMKTHTGEKPYICEICGKSFTSRPNMKRHRRTHTGEKPYPCDVCGQRFRFSNMLKAHKEKCFRVSHTLAGDGVPAAPGLPPTQPQAHALPLLPGLPQTLPPPPHLPPPPPLFPTTASPGGRMNANN,747,NP_660149.2.csv,refseq-ZBTB47-NM_145166.3_clinical_seed_0_final,refseq-ZBTB47-NM_145166.3.a2m,Invitae,refseq-ZBTB47-NM_145166.3.npy,1,747,747
+NP_660161.1,MKSCKPSGPPAGARVAPPCAGGTECAGTCAGAGRLESAARRRLAANARERRRMQGLNTAFDRLRRVVPQWGQDKKLSKYETLQMALSYIMALTRILAEAERFGSERDWVGLHCEHFGRDHYLPFPGAKLPGESELYSQRLFGFQPEPFQMAT,152,NP_660161.1.csv,refseq-ATOH7-NM_145178.3_clinical_seed_0_final,refseq-ATOH7-NM_145178.3.a2m,Invitae,refseq-ATOH7-NM_145178.3.npy,1,152,152
+NP_660200.1,MLIPFSMKNCFQLLCNCQVPAAGFKKTVKNGLILQSISNDVYQNLAVEDWIHDHMNLEGKPILFFWQNSPSVVIGRHQNPWQECNLNLMREEGIKLARRRSGGGTVYHDMGNINLTFFTTKKKYDRMENLKLIVRALNAVQPQLDVQATKRFDLLLDGQFKISGTASKIGRTTAYHHCTLLCSTDGTFLSSLLKSPYQGIRSNATASIPSLVKNLLEKDPTLTCEVLMNAVATEYAAYHQIDNHIHLINPTDETLFPGINSKAKELQTWEWIYGKTPKFSINTSFHVLYEQSHLEIKVFIDIKNGRIEICNIEAPDHWLPLEIRDKLNSSLIGSKFCPTETTMLTNILLRTCPQDHKLNSKWNILCEKIKGIM,373,NP_660200.1.csv,refseq-LIPT1-NM_145199.2_clinical_seed_0_final,refseq-LIPT1-NM_145199.2.a2m,Invitae,refseq-LIPT1-NM_145199.2.npy,1,373,373
+NP_660201.1,MTTEQARGQQGPNLAIGRQKPPAGVVTPKSDAEEPPLTRKRSKKERGLRGSRKRTGSSGEQTGPEAPGSSNNPPSTGEGPAGAPPASPGPASSRQSHRHRPDSLHDAAQRTYGPLLNRVFGKDRELGPEELDELQAAFEEFDTDRDGYISHRELGDCMRTLGYMPTEMELLEVSQHIKMRMGGRVDFEEFVELIGPKLREETAHMLGVRELRIAFREFDRDRDGRITVAELREAVPALLGEPLAGPELDEMLREVDLNGDGTVDFDEFVMMLSRH,275,NP_660201.1.csv,refseq-CABP4-NM_145200.3_clinical_seed_0_final,refseq-CABP4-NM_145200.3.a2m,Invitae,refseq-CABP4-NM_145200.3.npy,1,275,275
+NP_660208.2,MSSKKNRKRLNQSAENGSSLPSAASSCAEARAPSAGSDFAATSGTLTVTNLLEKVDDKIPKTFQNSLIHLGLNTMKSANICIGRPVLLTSLNGKQEVYTAWPMAGFPGGKVGLSEMAQKNVGVRPGDAIQVQPLVGAVLQAEEMDVALSDKDMEINEEELTGCILRKLDGKIVLPGNFLYCTFYGRPYKLQVLRVKGADGMILGGPQSDSDTDAQRMAFEQSSMETSSLELSLQLSQLDLEDTQIPTSRSTPYKPIDDRITNKASDVLLDVTQSPGDGSGLMLEEVTGLKCNFESAREGNEQLTEEERLLKFSIGAKCNTDTFYFISSTTRVNFTEIDKNSKEQDNQFKVTYDMIGGLSSQLKAIREIIELPLKQPELFKSYGIPAPRGVLLYGPPGTGKTMIARAVANEVGAYVSVINGPEIISKFYGETEAKLRQIFAEATLRHPSIIFIDELDALCPKREGAQNEVEKRVVASLLTLMDGIGSEVSEGQVLVLGATNRPHALDAALRRPGRFDKEIEIGVPNAQDRLDILQKLLRRVPHLLTEAELLQLANSAHGYVGADLKVLCNEAGLCALRRILKKQPNLPDVKVAGLVKITLKDFLQAMNDIRPSAMREIAIDVPNVSWSDIGGLESIKLKLEQAVEWPLKHPESFIRMGIQPPKGVLLYGPPGCSKTMIAKALANESGLNFLAIKGPELMNKYVGESERAVRETFRKARAVAPSIIFFDELDALAVERGSSLGAGNVADRVLAQLLTEMDGIEQLKDVTILAATNRPDRIDKALMRPGRIDRIIYVPLPDAATRREIFKLQFHSMPVSNEVDLDELILQTDAYSGAEIVAVCREAALLALEEDIQANLIMKRHFTQALSTVTPRIPESLRRFYEDYQEKSGLHTL,893,NP_660208.2.csv,refseq-SPATA5-NM_145207.2_clinical_seed_0_final,refseq-SPATA5-NM_145207.2.a2m,Invitae,refseq-SPATA5-NM_145207.2.npy,1,893,893
+NP_663304.1,MSTASAASSSSSSSAGEMIEAPSQVLNFEEIDYKEIEVEEVVGRGAFGVVCKAKWRAKDVAIKQIESESERKAFIVELRQLSRVNHPNIVKLYGACLNPVCLVMEYAEGGSLYNVLHGAEPLPYYTAAHAMSWCLQCSQGVAYLHSMQPKALIHRDLKPPNLLLVAGGTVLKICDFGTACDIQTHMTNNKGSAAWMAPEVFEGSNYSEKCDVFSWGIILWEVITRRKPFDEIGGPAFRIMWAVHNGTRPPLIKNLPKPIESLMTRCWSKDPSQRPSMEEIVKIMTHLMRYFPGADEPLQYPCQYSDEGQSNSATSTGSFMDIASTNTSNKSDTNMEQVPATNDTIKRLESKLLKNQAKQQSESGRLSLGASRGSSVESLPPTSEGKRMSADMSEIEARIAATTAYSKPKRGHRKTASFGNILDVPEIVISGNGQPRRRSIQDLTVTGTEPGQVSSRSSSPSVRMITTSGPTSEKPTRSHPWTPDDSTDTNGSDNSIPMAYLTLDHQLQPLAPCPNSKESMAVFEQHCKMAQEYMKVQTEIALLLQRKQELVAELDQDEKDQQNTSRLVQEHKKLLDENKSLSTYYQQCKKQLEVIRSQQQKRQGTS,606,NP_663304.1.csv,refseq-MAP3K7-NM_145331.2_clinical_seed_0_final,refseq-MAP3K7-NM_145331.2.a2m,Invitae,refseq-MAP3K7-NM_145331.2.npy,1,606,606
+NP_665859.1,MRMTMEEMKNEAETTSMVSMPLYAVMYPVFNELERVNLSAAQTLRAAFIKAEKENPGLTQDIIMKILEKKSVEVNFTESLLRMAADDVEEYMIERPEPEFQDLNEKARALKQILSKIPDEINDRVRFLQTIKDIASAIKELLDTVNNVFKKYQYQNRRALEHQKKEFVKYSKSFSDTLKTYFKDGKAINVFVSANRLIHQTNLILQTFKTVA,212,NP_665859.1.csv,refseq-PDCD10-NM_145860.1_clinical_seed_0_final,refseq-PDCD10-NM_145860.1.a2m,Invitae,refseq-PDCD10-NM_145860.1.npy,1,212,212
+NP_665860.2,MGLRTTKQMGRGTKAPGHQEDHMVKEPVEDTDPSTLSFNMSDKYPIQDTELPKAEECDTITLNCPRNSDMKNQGEENGFPDSTGDPLPEISKDNSCKENCTCSSCLLRAPTISDLLNDQDLLDVIRIKLDPCHPTVKNWRNFASKWGMSYDELCFLEQRPQSPTLEFLLRNSQRTVGQLMELCRLYHRADVEKVLRRWVDEEWPKRERGDPSRHF,215,NP_665860.2.csv,refseq-EDARADD-NM_145861.2_clinical_seed_0_final,refseq-EDARADD-NM_145861.2.a2m,Invitae,refseq-EDARADD-NM_145861.2.npy,1,215,215
+NP_665877.1,MQAGKPILYSYFRSSCSWRVRIALALKGIDYETVPINLIKDGGQQFSKDFQALNPMKQVPTLKIDGITIHQSLAIIEYLEEMRPTPRLLPQDPKKRASVRMISDLIAGGIQPLQNLSVLKQVGEEMQLTWAQNAITCGFNALEQILQSTAGIYCVGDEVTMADLCLVPQVANAERFKVDLTPYPTISSINKRLLVLEAFQVSHPCRQPDTPTELRA,216,NP_665877.1.csv,refseq-GSTZ1-NM_145870.2_clinical_seed_0_final,refseq-GSTZ1-NM_145870.2.a2m,Invitae,refseq-GSTZ1-NM_145870.2.npy,1,216,216
+NP_665898.1,MLASQGVLLHPYGVPMIVPAAPYLPGLIQGNQEAAAAPDTMAQPYASAQFAPPQNGIPAEYTAPHPHPAPEYTGQTTVPEHTLNLYPPAQTHSEQSPADTSAQTVSGTATQTDDAAPTDGQPQTQPSENTENKSQPKRLHVSNIPFRFRDPDLRQMFGQFGKILDVEIIFNERGSKGFGFVTFENSADADRAREKLHGTVVEGRKIEVNNATARVMTNKKTVNPYTNGWKLNPVVGAVYSPEFYAGTVLLCQANQEGSSMYSAPSSLVYTSAMPGFPYPAATAAAAYRGAHLRGRGRTVYNTFRAAAPPPPIPAYGGVVYQEPVYGNKLLQGGYAAYRYAQPTPATAAAYSDSYGRVYAADPYHHALAPAPTYGVGAMNAFAPLTDAKTRSHADDVGLVLSSLQASIYRGGYNRFAPY,418,NP_665898.1.csv,refseq-RBFOX1-NM_145891.2_clinical_seed_0_final,refseq-RBFOX1-NM_145891.2.a2m,Invitae,refseq-RBFOX1-NM_145891.2.npy,1,418,418
+NP_671724.1,MLAASIFRPTLLLCWLAAPWPTQPESLFHSRDRSDLEPSPLRQAKPIADLHAAQRFLSRYGWSGVWAAWGPSPEGPPETPKGAALAEAVRRFQRANALPASGELDAATLAAMNRPRCGVPDMRPPPPSAPPSPPGPPPRARSRRSPRAPLSLSRRGWQPRGYPDGGAAQAFSKRTLSWRLLGEALSSQLSVADQRRIVALAFRMWSEVTPLDFREDLAAPGAAVDIKLGFGRGRHLGCPRAFDGSGQEFAHAWRLGDIHFDDDEHFTPPTSDTGISLLKVAVHEIGHVLGLPHTYRTGSIMQPNYIPQEPAFELDWSDRKAIQKLYGSCEGSFDTAFDWIRKERNQYGEVMVRFSTYFFRNSWYWLYENRNNRTRYGDPIQILTGWPGIPTHNIDAFVHIWTWKRDERYFFQGNQYWRYDSDKDQALTEDEQGKSYPKLISEGFPGIPSPLDTAFYDRRQKLIYFFKESLVFAFDVNRNRVLNSYPKRITEVFPAVIPQNHPFRNIDSAYYSYAYNSIFFFKGNAYWKVVNDKDKQQNSWLPANGLFPKKFISEKWFDVCDVHISTLNM,569,NP_671724.1.csv,refseq-MMP21-NM_147191.1_clinical_seed_0_final,refseq-MMP21-NM_147191.1.a2m,Invitae,refseq-MMP21-NM_147191.1.npy,1,569,569
+NP_671729.2,MAGWPGAGPLCVLGGAALGVCLAGVAGQLVEPSTAPPKPKPPPLTKETVVFWDMRLWHVVGIFSLFVLSIIITLCCVFNCRVPRTRKEIEARYLQRKAAKMYTDKLETVPPLNELTEVPGEDKKKKKKKKKDSVDTVAIKVEEDEKNEAKKKKGEK,156,NP_671729.2.csv,refseq-TMIE-NM_147196.2_clinical_seed_0_final,refseq-TMIE-NM_147196.2.a2m,Invitae,refseq-TMIE-NM_147196.2.npy,1,156,156
+NP_679211.2,MNRSIPVEVDESEPYPSQLLKPIPEYSPEEESEPPAPNIRNMAPNSLSAPTMLHNSSGDFSQAHSTLKLANHQRPVSRQVTCLRTQVLEDSEDSFCRRHPGLGKAFPSGCSAVSEPASESVVGALPAEHQFSFMEKRNQWLVSQLSAASPDTGHDSDKSDQSLPNASADSLGGSQEMVQRPQPHRNRAGLDLPTIDTGYDSQPQDVLGIRQLERPLPLTSVCYPQDLPRPLRSREFPQFEPQRYPACAQMLPPNLSPHAPWNYHYHCPGSPDHQVPYGHDYPRAAYQQVIQPALPGQPLPGASVRGLHPVQKVILNYPSPWDHEERPAQRDCSFPGLPRHQDQPHHQPPNRAGAPGESLECPAELRPQVPQPPSPAAVPRPPSNPPARGTLKTSNLPEELRKVFITYSMDTAMEVVKFVNFLLVNGFQTAIDIFEDRIRGIDIIKWMERYLRDKTVMIIVAISPKYKQDVEGAESQLDEDEHGLHTKYIHRMMQIEFIKQGSMNFRFIPVLFPNAKKEHVPTWLQNTHVYSWPKNKKNILLRLLREEEYVAPPRGPLPTLQVVPL,565,NP_679211.2.csv,refseq-TRAF3IP2-NM_147686.3_clinical_seed_0_final,refseq-TRAF3IP2-NM_147686.3.a2m,Invitae,refseq-TRAF3IP2-NM_147686.3.npy,1,565,565
+NP_680475.1,MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQEKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKDVTDFISHLFMQLGTVGIYDLPHLRNKLVIK,137,NP_680475.1.csv,refseq-CTSC-NM_148170.5_clinical_seed_0_final,refseq-CTSC-NM_148170.5.a2m,Invitae,refseq-CTSC-NM_148170.5.npy,1,137,137
+NP_683695.1,MAALTDLSFMYRWFKNCNLVGNLSEKYVFITGCDSGFGNLLAKQLVDRGMQVLAACFTEEGSQKLQRDTSYRLQTTLLDVTKSESIKAAAQWVRDKVGEQGLWALVNNAGVGLPSGPNEWLTKDDFVKVINVNLVGLIEVTLHMLPMVKRARGRVVNMSSSGGRVAVIGGGYCVSKFGVEAFSDSIRRELYYFGVKVCIIEPGNYRTAILGKENLESRMRKLWERLPQETRDSYGEDYFRIYTDKLKNIMQVAEPRVRDVINSMEHAIVSRSPRIRYNPGLDAKLLYIPLAKLPTPVTDFILSRYLPRPADSV,313,NP_683695.1.csv,DR9C7_HUMAN_b07_clinical_seed_0_final,DR9C7_HUMAN_b07.a2m,EVE,DR9C7_HUMAN_b07_theta_0.2.npy,1,313,313
+NP_683720.2,MALLDVCGAPRGQRPESALPVAGSGRRSDPGHYSFSMRSPELALPRGMQPTEFFQSLGGDGERNVQIEMAHGTTTLAFKFQHGVIAAVDSRASAGSYISALRVNKVIEINPYLLGTMSGCAADCQYWERLLAKECRLYYLRNGERISVSAASKLLSNMMCQYRGMGLSMGSMICGWDKKGPGLYYVDEHGTRLSGNMFSTGSGNTYAYGVMDSGYRPNLSPEEAYDLGRRAIAYATHRDSYSGGVVNMYHMKEDGWVKVESTDVSDLLHQYREANQ,276,NP_683720.2.csv,refseq-PSMB8-NM_148919.3_clinical_seed_0_final,refseq-PSMB8-NM_148919.3.a2m,Invitae,refseq-PSMB8-NM_148919.3.npy,1,276,276
+NP_683763.2,MANSGLQLLGYFLALGGWVGIIASTALPQWKQSSYAGDAIITAVGLYEGLWMSCASQSTGQVQCKLYDSLLALDGHIQSARALMVVAVLLGFVAMVLSVVGMKCTRVGDSNPIAKGRVAIAGGALFILAGLCTLTAVSWYATLVTQEFFNPSTPVNARYEFGPALFVGWASAGLAVLGGSFLCCTCPEPERPNSSPQPYRPGPSAAAREPVVKLPASAKGPLGV,224,NP_683763.2.csv,refseq-CLDN19-NM_148960.2_clinical_seed_0_final,refseq-CLDN19-NM_148960.2.a2m,Invitae,refseq-CLDN19-NM_148960.2.npy,1,224,224
+NP_689476.2,MMEAIKKKMQMLKLDKENALDRAEQAEAEQKQAEERSKQLEDELAAMQKKLKGTEDELDKYSEALKDAQEKLELAEKKAADAEAEVASLNRRIQLVEEELDRAQERLATALQKLEEAEKAADESERGMKVIENRALKDEEKMELQEIQLKEAKHIAEEADRKYEEVARKLVIIEGDLERTEERAELAESKCSELEEELKNVTNNLKSLEAQAEKYSQKEDKYEEEIKILTDKLKEAETRAEFAERSVAKLEKTIDDLEDELYAQKLKYKAISEELDHALNDMTSI,285,NP_689476.2.csv,refseq-TPM3-NM_152263.3_clinical_seed_0_final,refseq-TPM3-NM_152263.3.a2m,Invitae,refseq-TPM3-NM_152263.3.npy,1,285,285
+NP_689482.1,MSTVGLFHFPTPLTRICPAPWGLRLWEKLTLLSPGIAVTPVQMAGKKDYPALLSLDENELEEQFVKGHGPGGQATNKTSNCVVLKHIPSGIVVKCHQTRSVDQNRKLARKILQEKVDVFYNGENSPVHKEKREAAKKKQERKKRAKETLEKKKLLKELWESSKKVH,166,NP_689482.1.csv,refseq-C12orf65-NM_152269.4_clinical_seed_0_final,refseq-C12orf65-NM_152269.4.a2m,Invitae,refseq-C12orf65-NM_152269.4.npy,1,166,166
+NP_689505.1,MSSEMLPAFIETSNVDKKQGINEDQEESQKPRLGEGCEPISKRQMKKLIKQKQWEEQRELRKQKRKEKRKRKKLERQCQMEPNSDGHDRKRVRRDVVHSTLRLIIDCSFDHLMVLKDIKKLHKQIQRCYAENRRALHPVQFYLTSHGGQLKKNMDENDKGWVNWKDIHIKPEHYSELIKKEDLIYLTSDSPNILKELDESKAYVIGGLVDHNHHKGLTYKQASDYGINHAQLPLGNFVKMNSRKVLAVNHVFEIILEYLETRDWQEAFFTILPQRKGAVPTDKACESASHDNQSVRMEEGGSDSDSSEEEYSRNELDSPHEEKQDKENHTESTVNSLPH,339,NP_689505.1.csv,refseq-TRMT10A-NM_152292.4_clinical_seed_0_final,refseq-TRMT10A-NM_152292.4.a2m,Invitae,refseq-TRMT10A-NM_152292.4.npy,1,339,339
+NP_689509.1,MGDKKDDKDSPKKNKGKERRDLDDLKKEVAMTEHKMSVEEVCRKYNTDCVQGLTHSKAQEILARDGPNALTPPPTTPEWVKFCRQLFGGFSILLWIGAILCFLAYGIQAGTEDDPSGDNLYLGIVLAAVVIITGCFSYYQEAKSSKIMESFKNMVPQQALVIREGEKMQVNAEEVVVGDLVEIKGGDRVPADLRIISAHGCKVDNSSLTGESEPQTRSPDCTHDNPLETRNITFFSTNCVEGTARGVVVATGDRTVMGRIATLASGLEVGKTPIAIEIEHFIQLITGVAVFLGVSFFILSLILGYTWLEAVIFLIGIIVANVPEGLLATVTVCLTLTAKRMARKNCLVKNLEAVETLGSTSTICSDKTGTLTQNRMTVAHMWFDNQIHEADTTEDQSGTSFDKSSHTWVALSHIAGLCNRAVFKGGQDNIPVLKRDVAGDASESALLKCIELSSGSVKLMRERNKKVAEIPFNSTNKYQLSIHETEDPNDNRYLLVMKGAPERILDRCSTILLQGKEQPLDEEMKEAFQNAYLELGGLGERVLGFCHYYLPEEQFPKGFAFDCDDVNFTTDNLCFVGLMSMIDPPRAAVPDAVGKCRSAGIKVIMVTGDHPITAKAIAKGVGIISEGNETVEDIAARLNIPVSQVNPRDAKACVIHGTDLKDFTSEQIDEILQNHTEIVFARTSPQQKLIIVEGCQRQGAIVAVTGDGVNDSPALKKADIGVAMGIAGSDVSKQAADMILLDDNFASIVTGVEEGRLIFDNLKKSIAYTLTSNIPEITPFLLFIMANIPLPLGTITILCIDLGTDMVPAISLAYEAAESDIMKRQPRNPRTDKLVNERLISMAYGQIGMIQALGGFFSYFVILAENGFLPGNLVGIRLNWDDRTVNDLEDSYGQQWTYEQRKVVEFTCHTAFFVSIVVVQWADLIICKTRRNSVFQQGMKNKILIFGLFEETALAAFLSYCPGMDVALRMYPLKPSWWFCAFPYSFLIFVYDEIRKLILRRNPGGWVEKETYY,1013,NP_689509.1.csv,refseq-ATP1A3-NM_152296.4_clinical_seed_0_final,refseq-ATP1A3-NM_152296.4.a2m,Invitae,refseq-ATP1A3-NM_152296.4.npy,1,1013,1013
+NP_689518.1,MEWWASSPLRLWLLLFLLPSAQGRQKESGSKWKVFIDQINRSLENYEPCSSQNCSCYHGVIEEDLTPFRGGISRKMMAEVVRRKLGTHYQITKNRLYRENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKTDLSNVQELLQFVKANDDVAQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSYNVTRRKGYDQIIPKMLKTEL,392,NP_689518.1.csv,refseq-POGLUT1-NM_152305.2_clinical_seed_0_final,refseq-POGLUT1-NM_152305.2.a2m,Invitae,refseq-POGLUT1-NM_152305.2.npy,1,392,392
+NP_689549.3,MAEQEASGLQVLLHTLQSSSDKESILTILKVLGDLLSVGTDRRIHYMISKGGSEALLQTLVDTARTAPPDYDILLPLFRLLAKVGLRDKKIGRKALELEALDVTLILARKNLSHGQNLLHCLWALRVFASSVSMGAMLGINGAMELLFKVITPYTRKRTQAIRAATEVLAALLKSKSNGRRAVNRGYVTSLLGLHQDWHSHDTANAYVQIRRGLLLCLRHIAALRSGREAFLAAQGMEILFSTTQNCLDDKSMEPVISVVLQILRQCYPTSPLPLVTASSAYAFPVPGCITTEPPHDLPEEDFEDDGDDEVDKDSDTEDGKVEDDDLETDVNKLSSKPGLDRPEEELMQYEVMCLELSYSFEELQSKLGDDLNSEKTQYANHHHIPAAASSKQHCYSKDQSSCGQEREYAVQTSLLCRVKTGRSTVHLGSKKNPGVNLYQNVQSNSLRRDSSESEIPDIQASPKADAWDVDAIFCPRMSASFSNSTRTREVVKVIDKLLQTHLKRVPFHDPYLYMAKARRTSSVVDFKMMAFPDVWGHCPPPTTQPMLERKCGVQRIRIFEDIRRLIQPSDVINKVVFSLDEPWPLQDNASNCLRFFSKFESGNLRKAIQVREFEYDLLVNADVNSTQHQQWFYFKVSGMQAAIPYHFNIINCEKPNSQFNYGMQPTLYSVKEALLGKPTWIRTGHEICYYKNHYRQSTAVAGGASGKCYYTLTFAVTFPHSEDVCYLAYHYPYTYTALMTHLDILEKSVNLKEVYFRQDVLCQTLGGNPCPLVTITAMPESNSDEHLEQFRHRPYQVITARVHPGESNASWVMKGTLEFLVSSDPVARLLRENFIFKIIPMLNPDGVINGNHRCSLSGEDLNRQWLSPSAHLQPTIYHAKGLLYHLSSIGRSPVVFCDFHGHSQKKNVFLYGCSIKETLWQAACTVGTSTILEEVNYRTLPKILDKLAPAFTMSSCSFLVEKSRASTARVVVWREMGVSRSYTMESSYCGCNQGPYQCTQRLLERTKNERAHPVDGLQGLQFGTRELEEMGAMFCLGLLILELKSASCSHQLLAQAATLLSAEEDALDQHLQRLKSSNFLPKHIWFAYHFFAITNFFKMNLLLHVSPVCDT,1112,NP_689549.3.csv,NP_689549.3_clinical_seed_0_final,NP_689549.3.a2m,popEVE,NP_689549.3_theta_0.2.npy,1,1112,1112
+NP_689597.1,MSVLDALWEDRDVRFDLSAQQMKTRPGEVLIDCLDSIEDTKGNNGDRGRLLVTNLRILWHSLALSRVNVSVGYNCILNITTRTANSKLRGQTEALYILTKCNSTRFEFIFTNLVPGSPRLFTSVMAVHRAYETSKMYRDFKLRSALIQNKQLRLLPQEHVYDKINGVWNLSSDQGNLGTFFITNVRIVWHANMNDSFNVSIPYLQIRSIKIRDSKFGLALVIESSQQSGGYVLGFKIDPVEKLQESVKEINSLHKVYSASPIFGVDYEMEEKPQPLEALTVEQIQDDVEIDSDGHTDAFVAYFADGNKQQDREPVFSEELGLAIEKLKDGFTLQGLWEVMS,341,NP_689597.1.csv,refseq-BBS5-NM_152384.2_clinical_seed_0_final,refseq-BBS5-NM_152384.2.a2m,Invitae,refseq-BBS5-NM_152384.2.npy,1,341,341
+NP_689606.2,MALGLEQAEEQRLYQQTLLQDGLKDMLDHGKFLDCVVRAGEREFPCHRLVLAACSPYFRARFLAEPERAGELHLEEVSPDVVAQVLHYLYTSEIALDEASVQDLFAAAHRFQIPSIFTICVSFLQKRLCLSNCLAVFRLGLLLDCARLAVAARDFICAHFTLVARDADFLGLSADELIAIISSDGLNVEKEEAVFEAVMRWAGSGDAEAQAERQRALPTVFESVRCRLLPRAFLESRVERHPLVRAQPELLRKVQMVKDAHEGRITTLRKKKKGKDGAGAKEADKGTSKAKAEEDEEAERILPGILNDTLRFGMFLQDLIFMISEEGAVAYDPAANECYCASLSNQVPKNHVSLVTKENQVFVAGGLFYNEDNKEDPMSAYFLQFDHLDSEWLGMPPLPSPRCLFGLGEALNSIYVVGGREIKDGERCLDSVMCYDRLSFKWGESDPLPYVVYGHTVLSHMDLVYVIGGKGSDRKCLNKMCVYDPKKFEWKELAPMQTARSLFGATVHDGRIIVAAGVTDTGLTSSAEVYSITDNKWAPFEAFPQERSSLSLVSLVGTLYAIGGFATLETESGELVPTELNDIWRYNEEEKKWEGVLREIAYAAGATFLPVRLNVLCLTKM,621,NP_689606.2.csv,KLH40_HUMAN_b03_clinical_seed_0_final,KLH40_HUMAN_b03.a2m,EVE,KLH40_HUMAN_b03_theta_0.2.npy,1,621,621
+NP_689629.2,MAASAHGSVWGPLRLGIPGLCCRRPPLGLYARMRRLPGPEVSGRSVAAASGPGAWGTDHYCLELLRKRDYEGYLCSLLLPAESRSSVFALRAFNVELAQVKDSVSEKTIGLMRMQFWKKTVEDIYCDNPPHQPVAIELWKAVKRHNLTKRWLMKIVDEREKNLDDKAYRNIKELENYAENTQSSLLYLTLEILGIKDLHADHAASHIGKAQGIVTCLRATPYHGSRRKVFLPMDICMLHGVSQEDFLRRNQDKNVRDVIYDIASQAHLHLKHARSFHKTVPVKAFPAFLQTVSLEDFLKKIQRVDFDIFHPSLQQKNTLLPLYLYIQSWRKTY,333,NP_689629.2.csv,refseq-NDUFAF6-NM_152416.3_clinical_seed_0_final,refseq-NDUFAF6-NM_152416.3.a2m,Invitae,refseq-NDUFAF6-NM_152416.3.npy,1,333,333
+NP_689632.2,MSGAGRALAALLLAASVLSAALLAPGGSSGRDAQAAPPRDLDKKRHAELKMDQALLLIHNELLWTNLTVYWKSECCYHCLFQVLVNVPQSPKAGKPSAAAASVSTQHGSILQLNDTLEEKEVCRLEYRFGEFGNYSLLVKNIHNGVSEIACDLAVNEDPVDSNLPVSIAFLIGLAVIIVISFLRLLLSLDDFNNWISKAISSRETDRLINSELGSPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGIIIVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCILGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVKGLWTGTPFFYPGMNSILVYVGHEVFENYFPFQWKLKDNQSHKEHLTQNIVATALWVLIAYILYRKKIFWKI,635,NP_689632.2.csv,refseq-HGSNAT-NM_152419.2_clinical_seed_0_final,refseq-HGSNAT-NM_152419.2.a2m,Invitae,refseq-HGSNAT-NM_152419.2.npy,1,635,635
+NP_689656.2,MLVTLGLLTSFFSFLYMVAPSIRKFFAGGVCRTNVQLPGKVVVITGANTGIGKETARELASRGARVYIACRDVLKGESAASEIRVDTKNSQVLVRKLDLSDTKSIRAFAEGFLAEEKQLHILINNAGVMMCPYSKTADGFETHLGVNHLGHFLLTYLLLERLKVSAPARVVNVSSVAHHIGKIPFHDLQSEKRYSRGFAYCHSKLANVLFTRELAKRLQGTGVTTYAVHPGVVRSELVRHSSLLCLLWRLFSPFVKTAREGAQTSLHCALAEGLEPLSGKYFSDCKRTWVSPRARNNKTAERLWNVSCELLGIRWE,316,NP_689656.2.csv,refseq-RDH12-NM_152443.2_clinical_seed_0_final,refseq-RDH12-NM_152443.2.a2m,Invitae,refseq-RDH12-NM_152443.2.npy,1,316,316
+NP_689677.1,MASSLLAGERLVRALGPGGELEPERLPRKLRAELEAALGKKHKGGDSSSGPQRLVSFRLIRDLHQHLRERDSKLYLHELLEGSEIYLPEVVKPPRNPELVARLEKIKIQLANEEYKRITRNVTCQDTRHGGTLSDLGKQVRSLKALVITIFNFIVTVVAAFVCTYLGSQYIFTEMASRVLAALIVASVVGLAELYVMVRAMEGELGEL,208,NP_689677.1.csv,refseq-TMEM199-NM_152464.2_clinical_seed_0_final,refseq-TMEM199-NM_152464.2.a2m,Invitae,refseq-TMEM199-NM_152464.2.npy,1,208,208
+NP_689703.1,MRNWLVLLCPCVLGAALHLWLRLRSPPPACASGAGPADQLALFPQWKSTHYDVVVGVLSARNNHELRNVIRSTWMRHLLQHPTLSQRVLVKFIIGAHGCEVPVEDREDPYSCKLLNITNPVLNQEIEAFSLSEDTSSGLPEDRVVSVSFRVLYPIVITSLGVFYDANDVGFQRNITVKLYQAEQEEALFIARFSPPSCGVQVNKLWYKPVEQFILPESFEGTIVWESQDLHGLVSRNLHKVTVNDGGGVLRVITAGEGALPHEFLEGVEGVAGGFIYTIQEGDALLHNLHSRPQRLIDHIRNLHEEDALLKEESSIYDDIVFVDVVDTYRNVPAKLLNFYRWTVETTSFNLLLKTDDDCYIDLEAVFNRIVQKNLDGPNFWWGNFRLNWAVDRTGKWQELEYPSPAYPAFACGSGYVISKDIVKWLASNSGRLKTYQGEDVSMGIWMAAIGPKRYQDSLWLCEKTCETGMLSSPQYSPWELTELWKLKERCGDPCRCQAR,500,NP_689703.1.csv,B3GL2_HUMAN_b01_clinical_seed_0_final,B3GL2_HUMAN_b01.a2m,EVE,B3GL2_HUMAN_b01_theta_0.2.npy,1,500,500
+NP_689726.3,MAVRQAATAGTPGPRREEEAALLFERAHYRHDPRWLLPVTPRLCLACALELLPDPGVSLVRKKHMLSCFQDALVRHTSLVTQLVSQDQRVCIHFISVLFGLLCSMEDGSVTDLCIEVLIQITTQLKLEQTIRCLLDECHKELCNMPSMRGSLATLTLLGKLVDAIPALADELVMEHGNLMEHLLRGLVYPSEGIQASVCYLYGKLYSSPVAAEMLSGHFREKLFPLFLSILDGAQTKELQINCLGLLRQLLKYDLFVSMIMNQDGLGESAKNIEGSSGNTSLPLVLKKLLLSRDETLQVASAHCITAVLVHSPAKHASAFIHADIPEFLFEHLSSSSEVLVWSSCNCLTLLVEEPLFFSKCHTVYGIEAVVRSLQGSLKMNNIELHKQGLLLFAEILTRQPEEIKLFTSSAMCRDAGRALQEAVSSPVLEVAAEALKATSAFLRKDHQSTPPVQYGELQALLEAMLNRCAEFSQTLLSRRPLGHASSRDSEKAILQRGKFLLSTLEGFRSACRLAIEFQSEPSAQENPFTAPSAKKEDTLEAFSEFLLSACDSLCIPMVMRHLEQTTHPALMEVFLSILHNLFVIVPHMKEKFSKKLASSSFIRLTLELKARFCSGLSHSALNQVCSNFLYYMCLNLLSAPEKTGPPSKEELSAVSELLQHGLPQISSRSPESLAFLSDRQYMEGAARQRQYCILLLFYLAYIHEDRFVSEAELFEAVQSFLLSLQDQGERPPLVVFKASIYLLAICQDKDNTLRETMVSAIRKFLEGIPDLQLVYTHHPLLLRFFLLYPELMSRYGHRVLELWFFWEESSYEELDDVTSAGQPALPASLVVLFQLLRSIPSILLILLDLIYSSPVDTAHKVLISLRTFLRRNEDIQVGGLIRGHFLLILQRLLVEHGASPSGASGNLPLLLSLLSLMQLRNVSEQELDSVAMKLLHQVSKLCGKCSPTDVDILQPSFNFLYWSLHQTTPSSQKRAAAVLLSSTGLMELLEKMLALTLAKADSPRTALLCSAWLLTASFSAQQHKGSLQVHQTLSVEMDQVLKALSFPKKKAALLSAAILCFLRTALRQSFSSALVALVPSGAQPLPATKDTVLAPLRMSQVRSLVIGLQNLLVQKDPLLSQACVGCLEALLDYLDARSPDIALHVASQPWNRFLLFTLLDAGENSFLRPEILRLMTLFMRYRSSSVLSHEEVGDVLQGVALADLSTLSNTTLQALHGFFQQLQSMGHLADHSMAQTLQASLEGLPPSTSSGQPPLQDMLCLGGVAVSLSHIRN,1274,NP_689726.3.csv,refseq-MEI1-NM_152513.3_clinical_seed_0_final,refseq-MEI1-NM_152513.3.a2m,Invitae,refseq-MEI1-NM_152513.3.npy,1,1274,1274
+NP_689735.1,MSFAESGWRSALRRRGPGTPGPVARPSYSSFTQGDSWGEGEVDEEEGCDQVARDLRAEFSAGAWSEPRKRSVLPPDGNGSPVLPDKRNGIFPAAAGSRAQPRRWPVQVLSILCSLLFAILLAFLLAIAYLIVKELHAENLKNEDDVDTGLLGFWTLLIISLTAGFSCCSFSWTVTYFDSFEPGMFPPTPLSPARFKKLTGHSFHMGYSMAILNGIVAALTVAWCLM,226,NP_689735.1.csv,refseq-ARL6IP6-NM_152522.5_clinical_seed_0_final,refseq-ARL6IP6-NM_152522.5.a2m,Invitae,refseq-ARL6IP6-NM_152522.5.npy,1,226,226
+NP_689807.1,MSEETATSDNDNSYARVRAVVMTRDDSSGGWLPLGGSGLSSVTVFKVPHQEENGCADFFIRGERLRDKMVVLECMLKKDLIYNKVTPTFHHWKIDDKKFGLTFQSPADARAFDRGIRRAIEDISQGCPESKNEAEGADDLQANEEDSSSSLVKDHLFQQETVVTSEPYRSSNIRPSPFEDLNARRVYMQSQANQITFGQPGLDIQSRSMEYVQRQISKECGSLKSQNRVPLKSIRHVSFQDEDEIVRINPRDILIRRYADYRHPDMWKNDLERDDADSSIQFSKPDSKKSDYLYSCGDETKLSSPKDSVVFKTQPSSLKIKKSKRRKEDGERSRCVYCQERFNHEENVRGKCQDAPDPIKRCIYQVSCMLCAESMLYHCMSDSEGDFSDPCSCDTSDDKFCLRWLALVALSFIVPCMCCYVPLRMCHRCGEACGCCGGKHKAAG,444,NP_689807.1.csv,refseq-SPRED1-NM_152594.2_clinical_seed_0_final,refseq-SPRED1-NM_152594.2.a2m,Invitae,refseq-SPRED1-NM_152594.2.npy,1,444,444
+NP_689831.2,MVMACRVVNKRRHMGLQQLSSFAETGRTFLGPLKSSKFIIDEECHESVLISSTVRLLESLDLTSAVGQLLNEAVQAQNNTYRTGISTLLFLVGAWSSAVEECLHLGVPISIIVSVMSEGLNFCSEEVVSLHVPVHNIFDCMDSTKTFSQLETFSVSLCPFLQVPSDTDLIEELHGLKDVASQTLTISNLSGRPLKSYELFKPQTKVEADNNTSRTLKNSLLADTCCRQSILIHSRHFNRTDNTEGVSKPDGFQEHVTATHKTYRCNDLVELAVGLSHGDHSSMKLVEEAVQLQYQNACVQQGNCTKPFMFDISRIFTCCLPGLPETSSCVCPGYITVVSVSNNPVIKELQNQPVRIVLIEGDLTENYRHLGFNKSANIKTVLDSMRLQEDSSEELWANHVLQVLIQFKVNLVLVQGNVSERLIEKCINSKRLVIGSVNGSVMQAFAEAAGAVQVAYITQVNEDCVGDGVCVTFWRSSPLDVVDRNNRIAILLKTEGINLVTAVLTNPVTAQMQIKEDRFWTCAYRLYYALKEEKVFLGGGAVEFLCLSCLHILAEQSLKKENHACSGWLHNTSSWLASSLAIYRPTVLKFLANGWQKYLSTLLYNTANYSSEFEASTYIQHHLQNATDSGSPSSYILNEYSKLNSRIFNSDISNKLEQIPRVYDVVTPKIEAWRRALDLVLLVLQTDSEIITGHGHTQINSQELTGFLFL,710,NP_689831.2.csv,refseq-BBS12-NM_152618.2_clinical_seed_0_final,refseq-BBS12-NM_152618.2.a2m,Invitae,refseq-BBS12-NM_152618.2.npy,1,710,710
+NP_689854.2,MANSTGKAPPDERRKGLAFLDELRQFHHSRGSPFKKIPAVGGKELDLHGLYTRVTTLGGFAKVSEKNQWGEIVEEFNFPRSCSNAAFALKQYYLRYLEKYEKVHHFGEDDDEVPPGNPKPQLPIGAIPSSYNYQQHSVSDYLRQSYGLSMDFNSPNDYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDPKIITLLLANAGVFDDTLGSFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNKSHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAVILRNLSFEEGNVKLLAANRTCLRFLLLSAHSHFISLRQLGLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDRFLKMRGMEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVISTLEVLYMLTEMGDVACTKIAKVEKSIDMLVCLVSMDIQMFGPDALAAVKLIEHPSSSHQMLSEIRPQAIEQVQTQTHVASAPASRAVVAQHVAPPPGIVEIDSEKFACQWLNAHFEVNPDCSVSRAEMYSEYLSTCSKLARGGILTSTGFYKCLRTVFPNHTVKRVEDSSSNGQAHIHVVGVKRRAIPLPIQMYYQQQPVSTSVVRVDSVPDVSPAPSPAGIPHGSQTIGNHFQRTPVANQSSNLTATQMSFPVQGVHTVAQTVSRIPQNPSPHTHQQQNAPVTVIQSKAPIPCEVVKATVIQNSIPQTGVPVSIAVGGGPPQSSVVQNHSTGPQPVTVVNSQTLLHHPSVIPQQSPLHTVVPGQIPSGTPVTVIQQAVPQSHMFGRVQNIPACTSTVSQGQQLITTSPQPVQTSSQQTSAGSQSQDTVIIAPPQYVTTSASNIVSATSVQNFQVATGQMVTIAGVPSPQASRVGFQNIAPKPLPSQQVSSTVVQQPIQQPQQPTQQSVVIVSQPAQQGQTYAPAIHQIVLANPAALPAGQTVQLTGQPNITPSSSPSPVPATNNQVPTAMSSSSTPQSQGPPPTVSQMLSVKRQQQQQHSPAPPPQQVQVQVQQPQQVQMQVQPQQSNAGVGQPASGESSLIKQLLLPKRGPSTPGGKLILPAPQIPPPNNARAPSPQVVYQVASNQAAGFGVQGQTPAQQLLVGQQNVQLVPSAMPPSGGVQTVPISNLQILPGPLISNSPATIFQGTSGNQVTITVVPNTSFAPATVSQGNATQLIAPAGITMSGTQTGVGLPVQTLPATQASPAGQSSCTTATPPFKGDKIICQKEEEAKEATGLHVHERKIEVMENPSCRRGATNTSNGDTKENEMHVGSLLNGRKYSDSSLPPSNSGKIQSETNQCSLISNGPSLELGENGASGKQNSEQIDMQDIKSDLRKPLVNGICDFDKGDGSHLSKNIPNHKTSNHVGNGEISPMEPQGTLDITQQDTAKGDQLERISNGPVLTLGGSSVSSIQEASNAATQQFSGTDLLNGPLASSLNSDVPQQRPSVVVSPHSTTSVIQGHQIIAVPDSGSKVSHSPALSSDVRSTNGTAECKTVKRPAEDTDRETVAGIPNKVGVRIVTISDPNNAGCSATMVAVPAGADPSTVAKVAIESAVQQKQQHPPTYVQNVVPQNTPMPPSPAVQVQGQPNSSQPSPFSGSSQPGDPMRKPGQNFMCLWQSCKKWFQTPSQVFYHAATEHGGKDVYPGQCLWEGCEPFQRQRFSFITHLQDKHCSKDALLAGLKQDEPGQAGSQKSSTKQPTVGGTSSTPRAQKAIVNHPSAALMALRRGSRNLVFRDFTDEKEGPITKHIRLTAALILKNIGKYSECGRRLLKRHENNLSVLAISNMEASSTLAKCLYELNFTVQSKEQEKDSEMLQ,1835,NP_689854.2.csv,refseq-ARID2-NM_152641.2_clinical_seed_0_final,refseq-ARID2-NM_152641.2.a2m,Invitae,refseq-ARID2-NM_152641.2.npy,1,1835,1835
+NP_689862.1,MENLKHIITLGQVIHKRCEEMKYCKKQCRRLGHRVLGLIKPLEMLQDQGKRSVPSEKLTTAMNRFKAALEEANGEIEKFSNRSNICRFLTASQDKILFKDVNRKLSDVWKELSLLLQVEQRMPVSPISQGASWAQEDQQDADEDRRAFQMLRRDNEKIEASLRRLEINMKEIKETLRQYLPPKCMQEIPQEQIKEIKKEQLSGSPWILLRENEVSTLYKGEYHRAPVAIKVFKKLQAGSIAIVRQTFNKEIKTMKKFESPNILRIFGICIDETVTPPQFSIVMEYCELGTLRELLDREKDLTLGKRMVLVLGAARGLYRLHHSEAPELHGKIRSSNFLVTQGYQVKLAGFELRKTQTSMSLGTTREKTDRVKSTAYLSPQELEDVFYQYDVKSEIYSFGIVLWEIATGDIPFQGCNSEKIRKLVAVKRQQEPLGEDCPSELREIIDECRAHDPSVRPSVDEILKKLSTFSK,471,NP_689862.1.csv,refseq-MLKL-NM_152649.3_clinical_seed_0_final,refseq-MLKL-NM_152649.3.a2m,Invitae,refseq-MLKL-NM_152649.3.npy,1,471,471
+NP_689935.2,MKRERGALSRASRALRLAPFVYLLLIQTDPLEGVNITSPVRLIHGTVGKSALLSVQYSSTSSDRPVVKWQLKRDKPVTVVQSIGTEVIGTLRPDYRDRIRLFENGSLLLSDLQLADEGTYEVEISITDDTFTGEKTINLTVDVPISRPQVLVASTTVLELSEAFTLNCSHENGTKPSYTWLKDGKPLLNDSRMLLSPDQKVLTITRVLMEDDDLYSCMVENPISQGRSLPVKITVYRRSSLYIILSTGGIFLLVTLVTVCACWKPSKRKQKKLEKQNSLEYMDQNDDRLKPEADTLPRSGEQERKNPMALYILKDKDSPETEENPAPEPRSATEPGPPGYSVSPAVPGRSPGLPIRSARRYPRSPARSPATGRTHSSPPRAPSSPGRSRSASRTLRTAGVHIIREQDEAGPVEISA,416,NP_689935.2.csv,refseq-HEPACAM-NM_152722.4_clinical_seed_0_final,refseq-HEPACAM-NM_152722.4.a2m,Invitae,refseq-HEPACAM-NM_152722.4.npy,1,416,416
+NP_689956.2,MDPECAQLLPALCAVLVDPRQPVADDTCLEKLLDWFKTVTEGESSVVLLQEHPCLVELLSHVLKVQDLSSGVLSFSLRLAGTFAAQENCFQYLQQGELLPGLFGEPGPLGRATWAVPTVRSGWIQGLRSLAQHPSALRFLADHGAVDTIFSLQGDSSLFVASAASQLLVHVLALSMRGGAEGQPCLPGGDWPACAQKIMDHVEESLCSAATPKVTQALNVLTTTFGRCQSPWTEALWVRLSPRVACLLERDPIPAAHSFVDLLLCVARSPVFSSSDGSLWETVARALSCLGPTHMGPLALGILKLEHCPQALRTQAFQVLLQPLACVLKATVQAPGPPGLLDGTADDATTVDTLLASKSSCAGLLCRTLAHLEELQPLPQRPSPWPQASLLGATVTVLRLCDGSAAPASSVGGHLCGTLAGCVRVQRAALDFLGTLSQGTGPQELVTQALAVLLECLESPGSSPTVLKKAFQATLRWLLSSPKTPGCSDLGPLIPQFLRELFPVLQKRLCHPCWEVRDSALEFLTQLSRHWGGQADFRCALLASEVPQLALQLLQDPESYVRASAVTAMGQLSSQGLHAPTSPEHAEARQSLFLELLHILSVDSEGFPRRAVMQVFTEWLRDGHADAAQDTEQFVATVLQAASRDLDWEVRAQGLELALVFLGQTLGPPRTHCPYAVALPEVAPAQPLTEALRALCHVGLFDFAFCALFDCDRPVAQKSCDLLLFLRDKIASYSSLREARGSPNTASAEATLPRWRAGEQAQPPGDQEPEAVLAMLRSLDLEGLRSTLAESSDHVEKSPQSLLQDMLATGGFLQGDEADCY,821,NP_689956.2.csv,refseq-BRAT1-NM_152743.3_clinical_seed_0_final,refseq-BRAT1-NM_152743.3.a2m,Invitae,refseq-BRAT1-NM_152743.3.npy,1,821,821
+NP_689966.2,MGSGRVPGLCLLVLLVHARAAQYSKAAQDVDECVEGTDNCHIDAICQNTPRSYKCICKSGYTGDGKHCKDVDECEREDNAGCVHDCVNIPGNYRCTCYDGFHLAHDGHNCLDVDECAEGNGGCQQSCVNMMGSYECHCREGFFLSDNQHTCIQRPEEGMNCMNKNHGCAHICRETPKGGIACECRPGFELTKNQRDCKLTCNYGNGGCQHTCDDTEQGPRCGCHIKFVLHTDGKTCIETCAVNNGGCDSKCHDAATGVHCTCPVGFMLQPDRKTCKDIDECRLNNGGCDHICRNTVGSFECSCKKGYKLLINERNCQDIDECSFDRTCDHICVNTPGSFQCLCHRGYLLYGITHCGDVDECSINRGGCRFGCINTPGSYQCTCPAGQGRLHWNGKDCTEPLKCQGSPGASKAMLSCNRSGKKDTCALTCPSRARFLPESENGFTVSCGTPSPRAAPARAGHNGNSTNSNHCHEAAVLSIKQRASFKIKDAKCRLHLRNKGKTEEAGRITGPGGAPCSECQVTFIHLKCDSSRKGKGRRARTPPGKEVTRLTLELEAEVRAEETTASCGLPCLRQRMERRLKGSLKMLRKSINQDRFLLRLAGLDYELAHKPGLVAGERAEPMESCRPGQHRAGTKCVSCPQGTYYHGQTEQCVPCPAGTFQEREGQLSCDLCPGSDAHGPLGATNVTTCAGQCPPGQHSVDGFKPCQPCPRGTYQPEAGRTLCFPCGGGLTTKHEGAISFQDCDTKVQCSPGHYYNTSIHRCIRCAMGSYQPDFRQNFCSRCPGNTSTDFDGSTSVAQCKNRQCGGELGEFTGYIESPNYPGNYPAGVECIWNINPPPKRKILIVVPEIFLPSEDECGDVLVMRKNSSPSSITTYETCQTYERPIAFTARSRKLWINFKTSEANSARGFQIPYVTYDEDYEQLVEDIVRDGRLYASENHQEILKDKKLIKAFFEVLAHPQNYFKYTEKHKEMLPKSFIKLLRSKVSSFLRPYK,993,NP_689966.2.csv,refseq-SCUBE3-NM_152753.3_clinical_seed_0_final,refseq-SCUBE3-NM_152753.3.a2m,Invitae,refseq-SCUBE3-NM_152753.3.npy,1,993,993
+NP_689967.2,MNANKDERLKARSQDFHLFPALMMLSMTMLFLPVTGTLKQNIPRLKLTYKDLLLSNSCIPFLGSSEGLDFQTLLLDEERGRLLLGAKDHIFLLSLVDLNKNFKKIYWPAAKERVELCKLAGKDANTECANFIRVLQPYNKTHIYVCGTGAFHPICGYIDLGVYKEDIIFKLDTHNLESGRLKCPFDPQQPFASVMTDEYLYSGTASDFLGKDTAFTRSLGPTHDHHYIRTDISEHYWLNGAKFIGTFFIPDTYNPDDDKIYFFFRESSQEGSTSDKTILSRVGRVCKNDVGGQRSLINKWTTFLKARLICSIPGSDGADTYFDELQDIYLLPTRDERNPVVYGVFTTTSSIFKGSAVCVYSMADIRAVFNGPYAHKESADHRWVQYDGRIPYPRPGTCPSKTYDPLIKSTRDFPDDVISFIKRHSVMYKSVYPVAGGPTFKRINVDYRLTQIVVDHVIAEDGQYDVMFLGTDIGTVLKVVSISKEKWNMEEVVLEELQIFKHSSIILNMELSLKQQQLYIGSRDGLVQLSLHRCDTYGKACADCCLARDPYCAWDGNACSRYAPTSKRRARRQDVKYGDPITQCWDIEDSISHETADEKVIFGIEFNSTFLECIPKSQQATIKWYIQRSGDEHREELKPDERIIKTEYGLLIRSLQKKDSGMYYCKAQEHTFIHTIVKLTLNVIENEQMENTQRAEHEEGKVKDLLAESRLRYKDYIQILSSPNFSLDQYCEQMWHREKRRQRNKGGPKWKHMQEMKKKRNRRHHRDLDELPRAVAT,777,NP_689967.2.csv,refseq-SEMA3D-NM_152754.2_clinical_seed_0_final,refseq-SEMA3D-NM_152754.2.a2m,Invitae,refseq-SEMA3D-NM_152754.2.npy,1,777,777
+NP_689991.1,MAGLRNESEQEPLLGDTPGSREWDILETEEHYKSRWRSIRILYLTMFLSSVGFSVVMMSIWPYLQKIDPTADTSFLGWVIASYSLGQMVASPIFGLWSNYRPRKEPLIVSILISVAANCLYAYLHIPASHNKYYMLVARGLLGIGAGNVAVVRSYTAGATSLQERTSSMANISMCQALGFILGPVFQTCFTFLGEKGVTWDVIKLQINMYTTPVLLSAFLGILNIILILAILREHRVDDSGRQCKSINFEEASTDEAQVPQGNIDQVAVVAINVLFFVTLFIFALFETIITPLTMDMYAWTQEQAVLYNGIILAALGVEAVVIFLGVKLLSKKIGERAILLGGLIVVWVGFFILLPWGNQFPKIQWEDLHNNSIPNTTFGEIIIGLWKSPMEDDNERPTGCSIEQAWCLYTPVIHLAQFLTSAVLIGLGYPVCNLMSYTLYSKILGPKPQGVYMGWLTASGSGARILGPMFISQVYAHWGPRWAFSLVCGIIVLTITLLGVVYKRLIALSVRYGRIQE,518,NP_689991.1.csv,refseq-MFSD8-NM_152778.2_clinical_seed_0_final,refseq-MFSD8-NM_152778.2.a2m,Invitae,refseq-MFSD8-NM_152778.2.npy,1,518,518
+NP_689996.4,MLPRRPLAWPAWLLRGAPGAAGSWGRPVGPLARRGCCSAPGTPEVPLTRERYPVRRLPFSTVSKQDLAAFERIVPGGVVTDPEALQAPNVDWLRTLRGCSKVLLRPRTSEEVSHILRHCHERNLAVNPQGGNTGMVGGSVPVFDEIILSTARMNRVLSFHSVSGILVCQAGCVLEELSRYVEERDFIMPLDLGAKGSCHIGGNVATNAGGLRFLRYGSLHGTVLGLEVVLADGTVLDCLTSLRKDNTGYDLKQLFIGSEGTLGIITTVSILCPPKPRAVNVAFLGCPGFAEVLQTFSTCKGMLGEILSAFEFMDAVCMQLVGRHLHLASPVQESPFYVLIETSGSNAGHDAEKLGHFLEHALGSGLVTDGTMATDQRKVKMLWALRERITEALSRDGYVYKYDLSLPVERLYDIVTDLRARLGPHAKHVVGYGHLGDGNLHLNVTAEAFSPSLLAALEPHVYEWTAGQQGSVSAEHGVGFRKRDVLGYSKPPGALQLMQQLKALLDPKGILNPYKTLPSQA,521,NP_689996.4.csv,refseq-D2HGDH-NM_152783.4_clinical_seed_0_final,refseq-D2HGDH-NM_152783.4.a2m,Invitae,refseq-D2HGDH-NM_152783.4.npy,1,521,521
+NP_690870.3,MCIIFFKFDPRPVSKNAYRLILAANRDEFYSRPSKLADFWGNNNEILSGLDMEEGKEGGTWLGISTRGKLAALTNYLQPQLDWQARGRGELVTHFLTTDVDSLSYLKKVSMEGHLYNGFNLIAADLSTAKGDVICYYGNRGEPDPIVLTPGTYGLSNALLETPWRKLCFGKQLFLEAVERSQALPKDVLIASLLDVLNNEEAQLPDPAIEDQGGEYVQPMLSKYAAVCVRCPGYGTRTNTIILVDADGHVTFTERSMMDKDLSHWETRTYEFTLQS,276,NP_690870.3.csv,refseq-TANGO2-NM_152906.6_clinical_seed_0_final,refseq-TANGO2-NM_152906.6.a2m,Invitae,refseq-TANGO2-NM_152906.6.npy,1,276,276
+NP_694551.1,MATALMAVVLRAAAVAPRLRGRGGTGGARRLSCGARRRAARGTSPGRRLSTAWSQPQPPPEEYAGADDVSQSPVAEEPSWVPSPRPPVPHESPEPPSGRSLVQRDIQAFLNQCGASPGEARHWLTQFQTCHHSADKPFAVIEVDEEVLKCQQGVSSLAFALAFLQRMDMKPLVVLGLPAPTAPSGCLSFWEAKAQLAKSCKVLVDALRHNAAAAVPFFGGGSVLRAAEPAPHASYGGIVSVETDLLQWCLESGSIPILCPIGETAARRSVLLDSLEVTASLAKALRPTKIIFLNNTGGLRDSSHKVLSNVNLPADLDLVCNAEWVSTKERQQMRLIVDVLSRLPHHSSAVITAASTLLTELFSNKGSGTLFKNAERMLRVRSLDKLDQGRLVDLVNASFGKKLRDDYLASLRPRLHSIYVSEGYNAAAILTMEPVLGGTPYLDKFVVSSSRQGQGSGQMLWECLRRDLQTLFWRSRVTNPINPWYFKHSDGSFSNKQWIFFWFGLADIRDSYELVNHAKGLPDSFHKPASDPGS,534,NP_694551.1.csv,refseq-NAGS-NM_153006.2_clinical_seed_0_final,refseq-NAGS-NM_153006.2.a2m,Invitae,refseq-NAGS-NM_153006.2.npy,1,534,534
+NP_694571.2,MPLEMEPKMSKLAFGCQRSSTSDDDSGCALEEYAWVPPGLRPEQIQLYFACLPEEKVPYVNSPGEKHRIKQLLYQLPPHDNEVRYCQSLSEEEKKELQVFSAQRKKEALGRGTIKLLSRAVMHAVCEQCGLKINGGEVAVFASRAGPGVCWHPSCFVCFTCNELLVDLIYFYQDGKIHCGRHHAELLKPRCSACDEIIFADECTEAEGRHWHMKHFCCLECETVLGGQRYIMKDGRPFCCGCFESLYAEYCETCGEHIGVDHAQMTYDGQHWHATEACFSCAQCKASLLGCPFLPKQGQIYCSKTCSLGEDVHASDSSDSAFQSARSRDSRRSVRMGKSSRSADQCRQSLLLSPALNYKFPGLSGNADDTLSRKLDDLSLSRQGTSFASEEFWKGRVEQETPEDPEEWADHEDYMTQLLLKFGDKSLFQPQPNEMDIRASEHWISDNMVKSKTELKQNNQSLASKKYQSDMYWAQSQDGLGDSAYGSHPGPASSRRLQELELDHGASGYNHDETQWYEDSLECLSDLKPEQSVRDSMDSLALSNITGASVDGENKPRPSLYSLQNFEEMETEDCEKMSNMGTLNSSMLHRSAESLKSLSSELCPEKILPEEKPVHLPVLRRSKSQSRPQQVKFSDDVIDNGNYDIEIRQPPMSERTRRRVYNFEERGSRSHHHRRRRSRKSRSDNALNLVTERKYSPKDRLRLYTPDNYEKFIQNKSAREIQAYIQNADLYGQYAHATSDYGLQNPGMNRFLGLYGEDDDSWCSSSSSSSDSEEEGYFLGQPIPQPRPQRFAYYTDDLSSPPSALPTPQFGQRTTKSKKKKGHKGKNCIIS,831,NP_694571.2.csv,refseq-PRICKLE1-NM_153026.2_clinical_seed_0_final,refseq-PRICKLE1-NM_153026.2.a2m,Invitae,refseq-PRICKLE1-NM_153026.2.npy,1,831,831
+NP_694944.1,MNWAFLQGLLSGVNKYSTVLSRIWLSVVFIFRVLVYVVAAEEVWDDEQKDFVCNTKQPGCPNVCYDEFFPVSHVRLWALQLILVTCPSLLVVMHVAYREERERKHHLKHGPNAPSLYDNLSKKRGGLWWTYLLSLIFKAAVDAGFLYIFHRLYKDYDMPRVVACSVEPCPHTVDCYISRPTEKKVFTYFMVTTAAICILLNLSEVFYLVGKRCMEIFGPRHRRPRCRECLPDTCPPYVLSQGGHPEDGNSVLMKAGSAPVDAGGYP,266,NP_694944.1.csv,refseq-GJB4-NM_153212.2_clinical_seed_0_final,refseq-GJB4-NM_153212.2.a2m,Invitae,refseq-GJB4-NM_153212.2.npy,1,266,266
+NP_694955.2,MVSKSDQLLIVVSILEGRHFPKRPKHMLVVEAKFDGEQLATDPVDHTDQPEFATELAWEIDRKALHQHRLQRTPIKLQCFALDPVTSAKETIGYIVLDLRTAQETKQAPKWYQLLSNKYTKFKSEIQISIALETDTKPPVDSFKAKGAPPRDGKVPAILAGLDPRDIVAVLNEEGGYHQIGPAEYCTDSFIMSVTIAFATQLEQLIPCTMKLPERQPEFFFYYSLLGNDVTNEPFNDLINPNFEPERASVRIRSSVEILRVYLALQSKLQIHLCCGDQSLGSTEIPLTGLLKKGSTEINQHPVTVEGAFTLDPPNRAKQKLAPIPVELAPTVGVSVALQREGIDSQSLIELKTQNEHEPEHSKKKVLTPIKEKTLTGPKSPTVSPVPSHNQSPPTKDDATESEVESLQYDKDTKPNPKASSSVPASLAQLVTTSNASEVASGQKIAVPATSHHFCFSIDLRSIHALEIGFPINCILRYSYPFFGSAAPIMTNPPVEVRKNMEVFLPQSYCAFDFATMPHQLQDTFLRIPLLVELWHKDKMSKDLLLGIARIQLSNILSSEKTRFLGSNGEQCWRQTYSESVPVIAAQGSNNRIADLSYTVTLEDYGLVKMREIFISDSSQGVSAVQQKPSSLPPAPCPSEIQTEPRETLEYKAALELEMWKEMQEDIFENQLKQKELAHMQALAEEWKKRDRERESLVKKKVAEYTILEGKLQKTLIDLEKREQQLASVESELQREKKELQSERQRNLQELQDSIRRAKEDCIHQVELERLKIKQLEEDKHRLQQQLNDAENKYKILEKEFQQFKDQQNNKPEIRLQSEINLLTLEKVELERKLESATKSKLHYKQQWGRALKELARLKQREQESQMARLKKQQEELEQMRLRYLAAEEKDTVKTERQELLDIRNELNRLRQQEQKQYQDSTEIASGKKDGPHGSVLEEGLDDYLTRLIEERDTLMRTGVYNHEDRIISELDRQIREILAKSNASN,986,NP_694955.2.csv,refseq-CEP120-NM_153223.3_clinical_seed_0_final,refseq-CEP120-NM_153223.3.a2m,Invitae,refseq-CEP120-NM_153223.3.npy,1,986,986
+NP_694972.3,MGTASSLVSPAGGEVIEDTYGAGGGEACEIPVEVKPKARLLRNSFRRGAGAAAGAGPGSLPRGVGAGGLLGASFKSTGSSVPELEYAAAEYERLRKEYEIFRVSKNQELLSMGRREAKLDTENKRLRAELQALQKTYQKILREKESALEAKYQAMERAATFEHDRDKVKRQFKIFRETKENEIQDLLRAKRELESKLQRLQAQGIQVFDPGESDSDDNCTDVTAAGTQCEYWTGGALGSEPSIGSMIQLQQSFRGPEFAHSSIDVEGPFANVNRDDWDIAVASLLQVTPLFSHSLWSNTVRCYLIYTDETQPEMDLFLKDYSPKLKRMCETMGYFFHAVYFPIDVENQYLTVRKWEIEKSSLVILFIHLTLPSLLLEDCEEAFLKNPEGKPRLIFHRLEDGKVSSDSVQQLIDQVSNLNKTSKAKIIDHSGDPAEGVYKTYICVEKIIKQDILGFENTDLETKDLGSEDSIPEEDDFGDVLWDIHDEQEQMETFQQASNSAHELGFEKYYQRLNDLVAAPAPIPPLLVSGGPGSGKSLLLSKWIQLQQKNSPNTLILSHFVGRPMSTSSESSLIIKRLTLKLMQHSWSVSALTLDPAKLLEEFPRWLEKLSARHQGSIIIVIDSIDQVQQVEKHMKWLIDPLPVNVRVIVSVNVETCPPAWRLWPTLHLDPLSPKDAKSIIIAECHSVDIKLSKEQEKKLERHCRSATTCNALYVTLFGKMIARAGRAGNLDKILHQCFQCQDTLSLYRLVLHSIRESMANDVDKELMKQILCLVNVSHNGVSESELMELYPEMSWTFLTSLIHSLYKMCLLTYGCGLLRFQHLQAWETVRLEYLEGPTVTSSYRQKLINYFTLQLSQDRVTWRSADELPWLFQQQGSKQKLHDCLLNLFVSQNLYKRGHFAELLSYWQFVGKDKSAMATEYFDSLKQYEKNCEGEDNMSCLADLYETLGRFLKDLGLLSQAIVPLQRSLEIRETALDPDHPRVAQSLHQLASVYVQWKKFGNAEQLYKQALEISENAYGADHPYTARELEALATLYQKQNKYEQAEHFRKKSFKIHQKAIKKKGNLYGFALLRRRALQLEELTLGKDTPDNARTLNELGVLYYLQNNLETADQFLKRSLEMRERVLGPDHPDCAQSLNNLAALCNEKKQYDKAEELYERALDIRRRALAPDHPSLAYTVKHLAILYKKMGKLDKAVPLYELAVEIRQKSFGPKHPSVATALVNLAVLYSQMKKHVEALPLYERALKIYEDSLGRMHPRVGETLKNLAVLSYEGGDFEKAAELYKRAMEIKEAETSLLGGKAPSRHSSSGDTFSLKTAHSPNVFLQQGQR,1330,NP_694972.3.csv,refseq-NPHP3-NM_153240.4_clinical_seed_0_final,refseq-NPHP3-NM_153240.4.a2m,Invitae,refseq-NPHP3-NM_153240.4.npy,1,1330,1330
+NP_694984.5,MAAAPTQIEAELYYLIARFLQSGPCNKSAQVLVQELEEHQLIPRRLDWEGKEHRRSFEDLVAANAHIPPDYLLKICERIGPLLDKEIPQSVPGVQTLLGVGRQSLLRDAKDCKSTLWNGSAFAALHRGRPPELPVNYVKPPNVVNITSARQLTGCSRFGHIFPSSAYQHIKMHKRILGHLSSVYCVAFDRSGRRIFTGSDDCLVKIWATDDGRLLATLRGHSAEISDMAVNYENTLIAAGSCDKVVRVWCLRTCAPVAVLQGHSASITSIQFCPSTKGTNRYLTSTGADGTICFWQWHVKTMKFRDRPVKFTERSRPGVQISCSSFSSGGMFITTGSTDHVIRIYYLGSEVPEKIAELESHTDKVVAVQFCNNGDSLRFVSGSRDGTARIWQYQQQEWKSIVLDMATKMTGNNLPSGEDKITKLKVTMVAWDRYDTTVITAVNNFLLKVWNSITGQLLHTLSGHDDEVFVLEAHPFDQRIILSAGHDGNIFIWDLDRGTKIRNYFNMIEGQGHGAVFDCKFSPDGNHFACTDSHGHLLLFGFGCSKYYEKIPDQMFFHTDYRPLIRDANNYVLDEQTQQAPHLMPPPFLVDVDGNPHPTKFQRLVPGRENCKDEQLIPQLGYVANGDGEVVEQVIGQQTNDQDESILDGIIRELQREQDLRLINEGDVPHLPVNRAYSVNGALRSPNMDISSSPNIRLRRHSSQIEGVRQMHNNAPRSQMATERDLMAWSRRVVVNELNNGVSRVQEECRTAKGDIEISLYTVEKKKKPSYTTQRNDYEPSCGRSLRRTQRKRQHTYQTRSNIEHNSQASCQNSGVQEDSDSSSEEDETVGTSDASVEDPVVEWQSESSSSDSSSEYSDWTADAGINLQPPKRQTRQTTRKICSSSDEENLKSLEERQKKPKQTRKKKGGLVSIAGEPNEEWFAPQWILDTIPRRSPFVPQMGDELIYFRQGHEAYVRAVRKSKIYSVNLQKQPWNKMDLREQEFVKIVGIKYEVGPPTLCCLKLAFLDPISGKMTGESFSIKYHDMPDVIDFLVLHQFYNEAKERNWQIGDRFRSIIDDAWWFGTVESQQPFQPEYPDSSFQCYSVHWDNNEREKMSPWDMEPIPEGTAFPDEVGAGVPVSQEELTALLYKPQEGEWGAHSRDEECERVIQGINHLLSLDFASPFAVPVDLSAYPLYCTVVAYPTDLNTIRRRLENRFYRRISALMWEVRYIEHNARTFNEPDSPIVKAAKIVTDVLLRFIGDQSCTDILDTYNKIKAEERNSTDAEEDTEIVDLDSDGPGTSSGRKVKCRGRRQSLKCNPDAWKKQCKELLSLIYEREDSEPFRQPADLLSYPGHQEQEGESSESVVPERQQDSSLSEDYQDVIDTPVDFSTVKETLEAGNYGSPLEFYKDVRQIFNNSKAYTSNKKSRIYSMMLRLSALFESHIKNIISEYKSAIQSQKRRRPRYRKRLRSSSSSLSSSGAPSPKGKQKQMKLQPKNDQNTSVSHARTSSPFSSPVSDAAEGLSLYLLDDEPDGPFSSSSFGGYSRSGNSHDPGKAKSFRNRVLPVKQDHSLDGPLTNGDGREPRTGIKRKLLSASEEDENMGGEDKEKKETKEKSHLSTSESGELGSSLSSESTCGSDSDSESTSRTDQDYVDGDHDYSKFIQTRPKRKLRKQHGNGKRNWKTRGTGGRGRWGRWGRWSRGGRGRGGRGRGSRGRGGGGTRGRGRGRGGRGASRGATRAKRARIADDEFDTMFSGRFSRLPRIKTRNQGRRTVLYNDDSDNDNFVSTEDPLNLGTSRSGRVRKMTEKARVSHLMGWNY,1802,NP_694984.5.csv,refseq-BRWD3-NM_153252.5_clinical_seed_0_final,refseq-BRWD3-NM_153252.5.a2m,Invitae,refseq-BRWD3-NM_153252.5_theta_0.2.npy,1,1802,1802
+NP_705690.2,MARLLRSATWELFPWRGYCSQKAKGELCRDFVEALKAVVGGSHVSTAAVVREQHGRDESVHRCEPPDAVVWPQNVEQVSRLAALCYRQGVPIIPFGTGTGLEGGVCAVQGGVCVNLTHMDRILELNQEDFSVVVEPGVTRKALNAHLRDSGLWFPVDPGADASLCGMAATGASGTNAVRYGTMRDNVLNLEVVLPDGRLLHTAGRGRHFRFGFWPEIPHHTAWYSPCVSLGRRKSAAGYNLTGLFVGSEGTLGLITATTLRLHPAPEATVAATCAFPSVQAAVDSTVHILQAAVPVARIEFLDEVMMDACNRYSKLNCLVAPTLFLEFHGSQQALEEQLQRTEEIVQQNGASDFSWAKEAEERSRLWTARHNAWYAALATRPGCKGYSTDVCVPISRLPEIVVQTKEDLNASGLTGSIVGHVGDGNFHCILLVNPDDAEELGRVKAFAEQLGRRALALHGTCTGEHGIGMGKRQLLQEEVGAVGVETMRQLKAVLDPQGLMNPGKVL,507,NP_705690.2.csv,refseq-LDHD-NM_153486.3_clinical_seed_0_final,refseq-LDHD-NM_153486.3.a2m,Invitae,refseq-LDHD-NM_153486.3_theta_0.2.npy,1,507,507
+NP_705831.1,MDFSKFLADDFDVKEWINAAFRAGSKEAASGKADGHAATLVMKLQLFIQEVNHAVEETSHQALQNMPKVLRDVEALKQEASFLKEQMILVKEDIKKFEQDTSQSMQVLVEIDQVKSRMQLAAESLQEADKWSTLSADIEETFKTQDIAVISAKLTGMQNSLMMLVDTPDYSEKCVHLEALKNRLEALASPQIVAAFTSQAVDQSKVFVKVFTEIDRMPQLLAYYYKCHKVQLLAAWQELCQSDLSLDRQLTGLYDALLGAWHTQIQWATQVFQKPHEVVMVLLIQTLGALMPSLPSCLSNGVERAGPEQELTRLLEFYDATAHFAKGLEMALLPHLHEHNLVKVTELVDAVYDPYKPYQLKYGDMEESNLLIQMSAVPLEHGEVIDCVQELSHSVNKLFGLASAAVDRCVRFTNGLGTCGLLSALKSLFAKYVSDFTSTLQSIRKKCKLDHIPPNSLFQEDWTAFQNSIRIIATCGELLRHCGDFEQQLANRILSTAGKYLSDSCSPRSLAGFQESILTDKKNSAKNPWQEYNYLQKDNPAEYASLMEILYTLKEKGSSNHNLLAAPRAALTRLNQQAHQLAFDSVFLRIKQQLLLISKMDSWNTAGIGETLTDELPAFSLTPLEYISNIGQYIMSLPLNLEPFVTQEDSALELALHAGKLPFPPEQGDELPELDNMADNWLGSIARATMQTYCDAILQIPELSPHSAKQLATDIDYLINVMDALGLQPSRTLQHIVTLLKTRPEDYRQVSKGLPRRLATTVATMRSVNY,770,NP_705831.1.csv,refseq-COG7-NM_153603.3_clinical_seed_0_final,refseq-COG7-NM_153603.3.a2m,Invitae,refseq-COG7-NM_153603.3.npy,1,770,770
+NP_705902.2,MRRLGPFHPRVHWAAPPSLSSGLHRLLFLRGTRIPSSTTLSPPRHDSLSLDGGTVNPPRVREPTGREAFGPSPASSDWLPARWRNGRGGRPRARLCSGWTAAEEARRNPTLGGLLGRQRLLLRMGGGRLGAPMERHGRASATSVSSAGEQAAGDPEGRRQEPLRRRASSASVPAVGASAEGTRRDRLGSYSGPTSVSRQRVESLRKKRPLFPWFGLDIGGTLVKLVYFEPKDITAEEEEEEVESLKSIRKYLTSNVAYGSTGIRDVHLELKDLTLCGRKGNLHFIRFPTHDMPAFIQMGRDKNFSSLHTVFCATGGGAYKFEQDFLTIGDLQLCKLDELDCLIKGILYIDSVGFNGRSQCYYFENPADSEKCQKLPFDLKNPYPLLLVNIGSGVSILAVYSKDNYKRVTGTSLGGGTFFGLCCLLTGCTTFEEALEMASRGDSTKVDKLVRDIYGGDYERFGLPGWAVASSFGNMMSKEKREAVSKEDLARATLITITNNIGSIARMCALNENINQVVFVGNFLRINTIAMRLLAYALDYWSKGQLKALFSEHEGYFGAVGALLELLKIP,570,NP_705902.2.csv,refseq-PANK2-NM_153638.2_clinical_seed_0_final,refseq-PANK2-NM_153638.2.a2m,Invitae,refseq-PANK2-NM_153638.2.npy,1,570,570
+NP_714544.1,MALSLWPLLLLLLLLLLLSFAVTLAPTGPHSLDPGLSFLKSLLSTLDQAPQGSLSRSRFFTFLANISSSFEPGRMGEGPVGEPPPLQPPALRLHDFLVTLRGSPDWEPMLGLLGDMLALLGQEQTPRDFLVHQAGVLGGLVEVLLGALVPGGPPTPTRPPCTRDGPSDCVLAADWLPSLLLLLEGTRWQALVQVQPSVDPTNATGLDGREAAPHFLQGLLGLLTPTGELGSKEALWGGLLRTVGAPLYAAFQEGLLRVTHSLQDEVFSILGQPEPDTNGQCQGGNLQQLLLWGVRHNLSWDVQALGFLSGSPPPPPALLHCLSTGVPLPRASQPSAHISPRQRRAITVEALCENHLGPAPPYSISNFSIHLLCQHTKPATPQPHPSTTAICQTAVWYAVSWAPGAQGWLQACHDQFPDEFLDAICSNLSFSALSGSNRRLVKRLCAGLLPPPTSCPEGLPPVPLTPDIFWGCFLENETLWAERLCGEASLQAVPPSNQAWVQHVCQGPTPDVTASPPCHIGPCGERCPDGGSFLVMVCANDTMYEVLVPFWPWLAGQCRISRGGNDTCFLEGLLGPLLPSLPPLGPSPLCLTPGPFLLGMLSQLPRCQSSVPALAHPTRLHYLLRLLTFLLGPGAGGAEAQGMLGRALLLSSLPDNCSFWDAFRPEGRRSVLRTIGEYLEQDEEQPTPSGFEPTVNPSSGISKMELLACFSPVLWDLLQREKSVWALQILVQAYLHMPPENLQQLVLSAEREAAQGFLTLMLQGKLQGKLQVPPSEEQALGRLTALLLQRYPRLTSQLFIDLSPLIPFLAVSDLMRFPPSLLANDSVLAAIRDYSPGMRPEQKEALAKRLLAPELFGEVPAWPQELLWAVLPLLPHLPLENFLQLSPHQIQALEDSWPAAGLGPGHARHVLRSLVNQSVQDGEEQVRRLGPLACFLSPEELQSLVPLSDPTGPVERGLLECAANGTLSPEGRVAYELLGVLRSSGGAVLSPRELRVWAPLFSQLGLRFLQELSEPQLRAMLPVLQGTSVTPAQAVLLLGRLLPRHDLSLEELCSLHLLLPGLSPQTLQAIPRRVLVGACSCLAPELSRLSACQTAALLQTFRVKDGVKNMGTTGAGPAVCIPGQPIPTTWPDCLLPLLPLKLLQLDSLALLANRRRYWELPWSEQQAQFLWKKMQVPTNLTLRNLQALGTLAGGMSCEFLQQINSMVDFLEVVHMIYQLPTRVRGSLRACIWAELQRRMAMPEPEWTTVGPELNGLDSKLLLDLPIQLMDRLSNESIMLVVELVQRAPEQLLALTPLHQAALAERALQNLAPKETPVSGEVLETLGPLVGFLGTESTRQIPLQILLSHLSQLQGFCLGETFATELGWLLLQESVLGKPELWSQDEVEQAGRLVFTLSTEAISLIPREALGPETLERLLEKQQSWEQSRVGQLCREPQLAAKKAALVAGVVRPAAEDLPEPVPNCADVRGTFPAAWSATQIAEMELSDFEDCLTLFAGDPGLGPEELRAAMGKAKQLWGPPRGFRPEQILQLGRLLIGLGDRELQELILVDWGVLSTLGQIDGWSTTQLRIVVSSFLRQSGRHVSHLDFVHLTALGYTLCGLRPEELQHISSWEFSQAALFLGTLHLQCSEEQLEVLAHLLVLPGGFGPISNWGPEIFTEIGTIAAGIPDLALSALLRGQIQGVTPLAISVIPPPKFAVVFSPIQLSSLTSAQAVAVTPEQMAFLSPEQRRAVAWAQHEGKESPEQQGRSTAWGLQDWSRPSWSLVLTISFLGHLL,1775,NP_714544.1.csv,refseq-STRC-NM_153700.2_clinical_seed_0_final,refseq-STRC-NM_153700.2.a2m,Invitae,refseq-STRC-NM_153700.2_theta_0.2.npy,1,1775,1775
+NP_714915.3,MATRGGAGVAMAVWSLLSARAVTAFLLLFLPRFLQAQTFSFPFQQPEKCDNNQYFDISALSCVPCGANQRQDARGTSCVCLPGFQMISNNGGPAIICKKCPENMKGVTEDGWNCISCPSDLTAEGKCHCPIGHILVERDINGTLLSQATCELCDGNENSFMVVNALGDRCVRCEPTFVNTSRSCACSEPNILTGGLCFSSTGNFPLRRISAARYGEVGMSLTSEWFAKYLQSSAAACWVYANLTSCQALGNMCVMNMNSYDFATFDACGLFQFIFENTAGLSTVHSISFWRQNLPWLFYGDQLGLAPQVLSSTSLPTNFSFKGENQNTKLKFVAASYDIRGNFLKWQTLEGGVLQLCPDTETRLNAAYSFGTTYQQNCEIPISKILIDFPTPIFYDVYLEYTDENQHQYILAVPVLNLNLQHNKIFVNQDSNSGKWLLTRRIFLVDAVSGRENDLGTQPRVIRVATQISLSVHLVPNTINGNIYPPLITIAYSDIDIKDANSQSVKVSFSVTYEMDHGEAHVQTDIALGVLGGLAVLASLLKTAGWKRRIGSPMIDLQTVVKFLVYYAGDLANVFFIITVGTGLYWLIFFKAQKSVSVLLPMPIQEERFVTYVGCAFALKALQFLHKLISQITIDVFFIDWERPKGKVLKAVEGEGGVRSATVPVSIWRTYFVANEWNEIQTVRKINSLFQVLTVLFFLEVVGFKNLALMDSSSSLSRNPPSYIAPYSCILRYAVSAALWLAIGIIQVVFFAVFYERFIEDKIRQFVDLCSMSNISVFLLSHKCFGYYIHGRSVHGHADTNMEEMNMNLKREAENLCSQRGLVPNTDGQTFEIAISNQMRQHYDRIHETLIRKNGPARLLSSSASTFEQSIKAYHMMNKFLGSFIDHVHKEMDYFIKDKLLLERILGMEFMEPMEKSIFYNDEGYSFSSVLYYGNEATLLIFDLLFFCVVDLACQNFILASFLTYLQQEIFRYIRNTVGQKNLASKTLVDQRFLI,995,NP_714915.3.csv,refseq-TMEM67-NM_153704.5_clinical_seed_0_final,refseq-TMEM67-NM_153704.5.a2m,Invitae,refseq-TMEM67-NM_153704.5.npy,1,995,995
+NP_714928.1,MARGGAACKSDARLLLGRDALRPAPALLAPAVLLGAALGLGLGLWLGCRAGRQRTRHQKDDTQNLLKNLESNAQTPSETGSPSRRRKREVQMSKDKEAVDECEPPSNSNITAFALKAKVIYPINQKFRPLADGSSNPSLHENLKQAVLPHQPVEASPSSSLGSLSQGEKDDCSSSSSVHSATSDDRFLSRTFLRVNAFPEVLACESVDVDLCIYSLHLKDLLHLDTALRQEKHMMFIQIFKMCLLDLLPKKKSDDELYQKILSKQEKDLEELEKGLQVKLSNTEMSGAGDSEYITLADVEKKEREYSEQLIDNMEAFWKQMANIQHFLVDQFKCSSSKARQLMMTLTERMIAAEGLLCDSQELQALDALERTMGRAHMAKVIEFLKLQVQEETRCRLAAISHGLELLAGEGKLSGRQKEELLTQQHKAFWQEAERFSREFVQRGKDLVTASLAHQVEGTAKLTLAQEEEQRSFLAEAQPTADPEKFLEAFHEVLERQRLMQCDLEEEENVRATEAVVALCQELYFSTVDTFQKFVDALFLQTLPGMTGLPPEECDYLRQEVQENAAWQLGKSNRFRRQQWKLFQELLEQDQQVWMEECALSSVLQTHLREDHEGTIRGVLGRLGGLTEESTRCVLQGHDLLLRSALRRLALRGNALATLTQMRLSGKKHLLQELREQRALEQGSSQCLDEHQWQLLRALEARVLEEASRLEEEAQQTRLQLQQRLLAEAQEVGQLLQQHMECAIGQALLVHARNAATKSRAKDRDDFKRTLMEAAVESVYVTSAGVSRLVQAYYQQIGRIMEDHEERKLQHLKTLQGERMENYKLRKKQELSNPSSGSRTAGGAHETSQAVHQRMLSQQKRFLAQFPVHQQMRLHAQQQQAGVMDLLEAQLETQLQEAEQNFISELAALARVPLAESKLLPAKRGLLEKPLRTKRKKPLPQERGDLGVPNNEDLASGDQTSGSLSSKRLSQQESEAGDSGNSKKMLKRRSNL,992,NP_714928.1.csv,refseq-EVC-NM_153717.2_clinical_seed_0_final,refseq-EVC-NM_153717.2.a2m,Invitae,refseq-EVC-NM_153717.2.npy,1,992,992
+NP_722540.1,MAPAAASPPEVIRAAQKDEYYRGGLRSAAGGALHSLAGARKWLEWRKEVELLSDVAYFGLTTLAGYQTLGEEYVSIIQVDPSRIHVPSSLRRGVLVTLHAVLPYLLDKALLPLEQELQADPDSGRPLQGSLGPGGRGCSGARRWMRHHTATLTEQQRRALLRAVFVLRQGLACLQRLHVAWFYIHGVFYHLAKRLTGITYQALRPDPLRVLMSVAPSALQLRVRSLPGEDLRARVSYRLLGVISLLHLVLSMGLQLYGFRQRQRARKEWRLHRGLSHRRASLEERAVSRNPLCTLCLEERRHPTATPCGHLFCWECITAWCSSKAECPLCREKFPPQKLIYLRHYR,346,NP_722540.1.csv,refseq-PEX10-NM_153818.1_clinical_seed_0_final,refseq-PEX10-NM_153818.1.a2m,Invitae,refseq-PEX10-NM_153818.1.npy,1,346,346
+NP_722541.1,MAGTLDLDKGCTVEELLRGCIEAFDDSGKVRDPQLVRMFLMMHPWYIPSSQLAAKLLHIYQQSRKDNSNSLQVKTCHLVRYWISAFPAEFDLNPELAEQIKELKALLDQEGNRRHSSLIDIDSVPTYKWKRQVTQRNPVGQKKRKMSLLFDHLEPMELAEHLTYLEYRSFCKILFQDYHSFVTHGCTVDNPVLERFISLFNSVSQWVQLMILSKPTAPQRALVITHFVHVAEKLLQLQNFNTLMAVVGGLSHSSISRLKETHSHVSPETIKLWEGLTELVTATGNYGNYRRRLAACVGFRFPILGVHLKDLVALQLALPDWLDPARTRLNGAKMKQLFSILEELAMVTSLRPPVQANPDLLSLLTVSLDQYQTEDELYQLSLQREPRSKSSPTSPTSCTPPPRPPVLEEWTSAAKPKLDQALVVEHIEKMVESVFRNFDVDGDGHISQEEFQIIRGNFPYLSAFGDLDQNQDGCISREEMVSYFLRSSSVLGGRMGFVHNFQESNSLRPVACRHCKALILGIYKQGLKCRACGVNCHKQCKDRLSVECRRRAQSVSLEGSAPSPSPMHSHHHRAFSFSLPRPGRRGSRPPEIREEEVQTVEDGVFDIHL,609,NP_722541.1.csv,refseq-RASGRP2-NM_153819.1_clinical_seed_0_final,refseq-RASGRP2-NM_153819.1.a2m,Invitae,refseq-RASGRP2-NM_153819.1.npy,1,609,609
+NP_733764.1,MFRLGLLVCFYNDLELLDATVAQVLLYQMIKCSHLRGFQAGVQKLKAELLDIAMENQTLNETLGSLSDAVVGLTYSQLESLSPEAVHGAISTLNQVSGWAKSQVIILSAKYLAHEKVLSFYNVSQMGALLAGVSTQAFCSMKRKDISQVLRSAVSQYVSDLSPAQQQGILSKMVQAEDTAPGIVEIQGAFFKEVSLFDLRRQPGFNSTVLKDKELGRSQALFLYELLLKTTRRPEELLSAGQLVKGVTCSHIDAMSTDFFLAHFQDFQNNFALLSPYQVNCLAWKYWEVSRLSMPPFLLAALPARYLASVPASQCVPFLISLGKSWLDSLVLDSHKKTSVLRKVQQCLDDSIADEYTVDIMGNLLCHLPAAIIDRGISPRAWATALHGLRDCPDLNPEQKAAVRLKLLGQYGLPQHWTAETTKDLGPFLVLFSGDELSSIATKFPEILLQAASKMARTLPTKEFLWAVFQSVRNSSDKIPSYDPMPGCHGVVAPSSDDIFKLAEANACWALEDLRCMEEDTFIRTVELLGAVQGFSRPQLMTLKEKAIQVWDMPSYWREHHIVSLGRIALALNESELEQLDLSSIDTVASLSWQTEWTPGQAESILQGYLDDSGYSIQDLKSFHLVGLGATLCAINITEIPLIKISEFRVVVARIGTLLCSTHVLAEFKRKAEVVFGDPTEWTSSVLQELGTIAAGLTKAELRMLDKDLMPYFQPSAIKCLPDEIFKELSAEQIASLGPENAAAVTHAQRRRLSPLQLQSLQQALDGAKTHSWQDAPASAGPTRTSSSRSPAGALQSWGLWLGCPLLVLMAKLLW,815,NP_733764.1.csv,refseq-OTOA-NM_170664.3_clinical_seed_0_final,refseq-OTOA-NM_170664.3.a2m,Invitae,refseq-OTOA-NM_170664.3.npy,1,815,815
+NP_733774.1,MAQRYDELPHYGGMDGVGVPASMYGDPHAPRPIPPVHHLNHGPPLHATQHYGAHAPHPNVMPASMGSAVNDALKRDKDAIYGHPLFPLLALVFEKCELATCTPREPGVAGGDVCSSDSFNEDIAVFAKQVRAEKPLFSSNPELDNLMIQAIQVLRFHLLELEKVHELCDNFCHRYISCLKGKMPIDLVIDERDGSSKSDHEELSGSSTNLADHNPSSWRDHDDATSTHSAGTPGPSSGGHASQSGDNSSEQGDGLDNSVASPGTGDDDDPDKDKKRQKKRGIFPKVATNIMRAWLFQHLTHPYPSEEQKKQLAQDTGLTILQVNNWFINARRRIVQPMIDQSNRAVSQGAAYSPEGQPMGSFVLDGQQHMGIRPAGPMSGMGMNMGMDGQWHYM,394,NP_733774.1.csv,refseq-MEIS2-NM_170674.4_clinical_seed_0_final,refseq-MEIS2-NM_170674.4.a2m,Invitae,refseq-MEIS2-NM_170674.4.npy,1,394,394
+NP_741960.1,MATITCTRFTEEYQLFEELGKGAFSVVRRCVKVLAGQEYAAKIINTKKLSARDHQKLEREARICRLLKHPNIVRLHDSISEEGHHYLIFDLVTGGELFEDIVAREYYSEADASHCIQQILEAVLHCHQMGVVHRDLKPENLLLASKLKGAAVKLADFGLAIEVEGEQQAWFGFAGTPGYLSPEVLRKDPYGKPVDLWACGVILYILLVGYPPFWDEDQHRLYQQIKAGAYDFPSPEWDTVTPEAKDLINKMLTINPSKRITAAEALKHPWISHRSTVASCMHRQETVDCLKKFNARRKLKGAILTTMLATRNFSGGKSGGNKKSDGVKESSESTNTTIEDEDTKVRKQEIIKVTEQLIEAISNGDFESYTKMCDPGMTAFEPEALGNLVEGLDFHRFYFENLWSRNSKPVHTTILNPHIHLMGDESACIAYIRITQYLDAGGIPRTAQSEETRVWHRRDGKWQIVHFHRSGAPSVLPH,478,NP_741960.1.csv,refseq-CAMK2A-NM_171825.2_clinical_seed_0_final,refseq-CAMK2A-NM_171825.2.a2m,Invitae,refseq-CAMK2A-NM_171825.2.npy,1,478,478
+NP_742055.1,MEMQDLTSPHSRLSGSSESPSGPKLGNSHINSNSMTPNGTEVKTEPMSSSETASTTADGSLNNFSGSAIGSSSFSPRPTHQFSPPQIYPSNRPYPHILPTPSSQTMAAYGQTQFTTGMQQATAYATYPQPGQPYGISSYGALWAGIKTEGGLSQSQSPGQTGFLSYGTSFSTPQPGQAPYSYQMQGSSFTTSSGIYTGNNSLTNSSGFNSSQQDYPSYPSFGQGQYAQYYNSSPYPAHYMTSSNTSPTTPSTNATYQLQEPPSGITSQAVTDPTAEYSTIHSPSTPIKDSDSDRLRRGSDGKSRGRGRRNNNPSPPPDSDLERVFIWDLDETIIVFHSLLTGSYANRYGRDPPTSVSLGLRMEEMIFNLADTHLFFNDLEECDQVHIDDVSSDDNGQDLSTYNFGTDGFPAAATSANLCLATGVRGGVDWMRKLAFRYRRVKEIYNTYKNNVGGLLGPAKREAWLQLRAEIEALTDSWLTLALKALSLIHSRTNCVNILVTTTQLIPALAKVLLYGLGIVFPIENIYSATKIGKESCFERIIQRFGRKVVYVVIGDGVEEEQGAKKHAMPFWRISSHSDLMALHHALELEYL,592,NP_742055.1.csv,refseq-EYA1-NM_172058.4_clinical_seed_0_final,refseq-EYA1-NM_172058.4.a2m,Invitae,refseq-EYA1-NM_172058.4_theta_0.2.npy,1,592,592
+NP_742105.1,MVQKSRNGGVYPGPSGEKKLKVGFVGLDPGAPDSTRDGALLIAGSEAPKRGSILSKPRAGGAGAGKPPKRNAFYRKLQNFLYNVLERPRGWAFIYHAYVFLLVFSCLVLSVFSTIKEYEKSSEGALYILEIVTIVVFGVEYFVRIWAAGCCCRYRGWRGRLKFARKPFCVIDIMVLIASIAVLAAGSQGNVFATSALRSLRFLQILRMIRMDRRGGTWKLLGSVVYAHSKELVTAWYIGFLCLILASFLVYLAEKGENDHFDTYADALWWGLITLTTIGYGDKYPQTWNGRLLAATFTLIGVSFFALPAGILGSGFALKVQEQHRQKHFEKRRNPAAGLIQSAWRFYATNLSRTDLHSTWQYYERTVTVPMYSSQTQTYGASRLIPPLNQLELLRNLKSKSGLAFRKDPPPEPSPSKGSPCRGPLCGCCPGRSSQKVSLKDRVFSSPRGVAAKGKGSPQAQTVRRSPSADQSLEDSPSKVPKSWSFGDRSRARQAFRIKGAASRQNSEEASLPGEDIVDDKSCPCEFVTEDLTPGLKVSIRAVCVMRFLVSKRKFKESLRPYDVMDVIEQYSAGHLDMLSRIKSLQSRVDQIVGRGPAITDKDRTKGPAEAELPEDPSMMGRLGKVEKQVLSMEKKLDFLVNIYMQRMGIPPTETEAYFGAKEPEPAPPYHSPEDSREHVDRHGCIVKIVRSSSSTGQKNFSAPPAAPPVQCPPSTSWQPQSHPRQGHGTSPVGDHGSLVRIPPPPAHERSLSAYGGGNRASMEFLRQEDTPGCRPPEGNLRDSDTSISIPSVDHEELERSFSGFSISQSKENLDALNSCYAAVAPCAKVRPYIAEGESDTDSDLCTPCGPPPRSATGEGPFGDVGWAGPRK,872,NP_742105.1.csv,refseq-KCNQ2-NM_172107.2_clinical_seed_0_final,refseq-KCNQ2-NM_172107.2.a2m,Invitae,refseq-KCNQ2-NM_172107.2.npy,1,872,872
+NP_751951.1,MSTLSNFTQTLEDVFRRIFITYMDNWRQNTTAEQEALQAKVDAENFYYVILYLMVMIGMFSFIIVAILVSTVKSKRREHSNDPYHQYIVEDWQEKYKSQILNLEESKATIHENIGAAGFKMSP,123,NP_751951.1.csv,refseq-KCNE2-NM_172201.1_clinical_seed_0_final,refseq-KCNE2-NM_172201.1.a2m,Invitae,refseq-KCNE2-NM_172201.1.npy,1,123,123
+NP_758440.1,MASATEDPVLERYFKGHKAAITSLDLSPNGKQLATASWDTFLMLWNFKPHARAYRYVGHKDVVTSVQFSPHGNLLASASRDRTVRLWIPDKRGKFSEFKAHTAPVRSVDFSADGQFLATASEDKSIKVWSMYRQRFLYSLYRHTHWVRCAKFSPDGRLIVSCSEDKTIKIWDTTNKQCVNNFSDSVGFANFVDFNPSGTCIASAGSDQTVKVWDVRVNKLLQHYQVHSGGVNCISFHPSGNYLITASSDGTLKILDLLEGRLIYTLQGHTGPVFTVSFSKGGELFASGGADTQVLLWRTNFDELHCKGLTKRNLKRLHFDSPPHLLDIYPRTPHPHEEKVETVEINPKLEVIDLQISTPPVMDILSFDSTTTTETSGRTLPDKGEEACGYFLNPSLMSPECLPTTTKKKTEDMSDLPCESQRSIPLAVTDALEHIMEQLNVLTQTVSILEQRLTLTEDKLKDCLENQQKLFSAVQQKS,478,NP_758440.1.csv,refseq-POC1B-NM_172240.2_clinical_seed_0_final,refseq-POC1B-NM_172240.2.a2m,Invitae,refseq-POC1B-NM_172240.2.npy,1,478,478
+NP_758454.1,MPMLLPHPHQHFLKGLLRAPFRCYHFIFHSSTHLGSGIPCAQPFNSLGLHCTKWMLLSDGLKRKLCVQTTLKDHTEGLSDKEQRFVDKLYTGLIQGQRACLAEAITLVESTHSRKKELAQVLLQKVLLYHREQEQSNKGKPLAFRVGLSGPPGAGKSTFIEYFGKMLTERGHKLSVLAVDPSSCTSGGSLLGDKTRMTELSRDMNAYIRPSPTRGTLGGVTRTTNEAILLCEGAGYDIILIETVGVGQSEFAVADMVDMFVLLLPPAGGDELQGIKRGIIEMADLVAVTKSDGDLIVPARRIQAEYVSALKLLRKRSQVWKPKVIRISARSGEGISEMWDKMKDFQDLMLASGELTAKRRKQQKVWMWNLIQESVLEHFRTHPTVREQIPLLEQKVLIGALSPGLAADFLLKAFKSRD,418,NP_758454.1.csv,refseq-MMAA-NM_172250.2_clinical_seed_0_final,refseq-MMAA-NM_172250.2.a2m,Invitae,refseq-MMAA-NM_172250.2.npy,1,418,418
+NP_758840.1,MMSYLKQPPYAVNGLSLTTSGMDLLHPSVGYPATPRKQRRERTTFTRAQLDVLEALFAKTRYPDIFMREEVALKINLPESRVQVWFKNRRAKCRQQQQQQQNGGQNKVRPAKKKTSPAREVSSESGTSGQFTPPSSTSVPTIASSSAPVSIWSPASISPLSDPLSTSSSCMQRSYPMTYTQASGYSQGYAGSTSYFGGMDCGSYLTPMHHQLPGPGATLSPMGTNAVTSHLNQSPASLSTQGYGASSLGFNSTTDCLDYKDQTASWKLNFNADCLDYKDQTSSWKFQVL,289,NP_758840.1.csv,refseq-OTX2-NM_172337.2_clinical_seed_0_final,refseq-OTX2-NM_172337.2.a2m,Invitae,refseq-OTX2-NM_172337.2.npy,1,289,289
+NP_758872.1,MTMAGGRRGLVAPQNTFLENIVRRSNDTNFVLGNAQIVDWPIVYSNDGFCKLSGYHRAEVMQKSSTCSFMYGELTDKDTIEKVRQTFENYEMNSFEILMYKKNRTPVWFFVKIAPIRNEQDKVVLFLCTFSDITAFKQPIEDDSCKGWGKFARLTRALTSSRGVLQQLAPSVQKGENVHKHSRLAEVLQLGSDILPQYKQEAPKTPPHIILHYCVFKTTWDWIILILTFYTAILVPYNVSFKTRQNNVAWLVVDSIVDVIFLVDIVLNFHTTFVGPAGEVISDPKLIRMNYLKTWFVIDLLSCLPYDVINAFENVDEVSAFMGDPGKIGFADQIPPPLEGRESQGISSLFSSLKVVRLLRLGRVARKLDHYIEYGAAVLVLLVCVFGLAAHWMACIWYSIGDYEIFDEDTKTIRNNSWLYQLAMDIGTPYQFNGSGSGKWEGGPSKNSVYISSLYFTMTSLTSVGFGNIAPSTDIEKIFAVAIMMIGSLLYATIFGNVTTIFQQMYANTNRYHEMLNSVRDFLKLYQVPKGLSERVMDYIVSTWSMSRGIDTEKVLQICPKDMRADICVHLNRKVFKEHPAFRLASDGCLRALAMEFQTVHCAPGDLIYHAGESVDSLCFVVSGSLEVIQDDEVVAILGKGDVFGDVFWKEATLAQSCANVRALTYCDLHVIKRDALQKVLEFYTAFSHSFSRNLILTYNLRKRIVFRKISDVKREEEERMKRKNEAPLILPPDHPVRRLFQRFRQQKEARLAAERGGRDLDDLDVEKGNVLTEHASANHSLVKASVVTVRESPATPVSFQAASTSGVPDHAKLQAPGSECLGPKGGGGDCAKRKSWARFKDACGKSEDWNKVSKAESMETLPERTKASGEATLKKTDSCDSGITKSDLRLDNVGEARSPQDRSPILAEVKHSFYPIPEQTLQATVLEVRHELKEDIKALNAKMTNIEKQLSEILRILTSRRSSQSPQELFEISRPQSPESERDIFGAS,989,NP_758872.1.csv,refseq-KCNH1-NM_172362.2_clinical_seed_0_final,refseq-KCNH1-NM_172362.2.a2m,Invitae,refseq-KCNH1-NM_172362.2.npy,1,989,989
+NP_758957.2,MDVGPSSLPHLGLKLLLLLLLLPLRGQANTGCYGIPGMPGLPGAPGKDGYDGLPGPKGEPGIPAIPGIRGPKGQKGEPGLPGHPGKNGPMGPPGMPGVPGPMGIPGEPGEEGRYKQKFQSVFTVTRQTHQPPAPNSLIRFNAVLTNPQGDYDTSTGKFTCKVPGLYYFVYHASHTANLCVLLYRSGVKVVTFCGHTSKTNQVNSGGVLLRLQVGEEVWLAVNDYYDMVGIQGSDSVFSGFLLFPD,245,NP_758957.2.csv,refseq-C1QC-NM_172369.3_clinical_seed_0_final,refseq-C1QC-NM_172369.3.a2m,Invitae,refseq-C1QC-NM_172369.3.npy,1,245,245
+NP_775109.2,MASTSTTIRSHSSSRRGFSANSARLPGVSRSGFSSISVSRSRGSGGLGGACGGAGFGSRSLYGLGGSKRISIGGGSCAISGGYGSRAGGSYGFGGAGSGFGFGGGAGIGFGLGGGAGLAGGFGGPGFPVCPPGGIQEVTVNQSLLTPLNLQIDPAIQRVRAEEREQIKTLNNKFASFIDKVRFLEQQNKVLDTKWTLLQEQGTKTVRQNLEPLFEQYINNLRRQLDSIVGERGRLDSELRNMQDLVEDLKNKYEDEINKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRALYDAELSQMQTHISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEIAQRSRAEAESWYQTKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRKLLEGEECRLNGEGVGQVNVSVVQSTISSGYGGASGVGSGLGLGGGSSYSYGSGLGIGGGFSSSSGRAIGGGLSSVGGGSSTIKYTTTSSSSRKSYKH,564,NP_775109.2.csv,refseq-KRT6C-NM_173086.4_clinical_seed_0_final,refseq-KRT6C-NM_173086.4.a2m,Invitae,refseq-KRT6C-NM_173086.4.npy,1,564,564
+NP_775300.1,MKGKKGIVAASGSETEDEDSMDIPLDLSSSAGSGKRRRRGNLPKESVQILRDWLYEHRYNAYPSEQEKALLSQQTHLSTLQVCNWFINARRRLLPDMLRKDGKDPNQFTISRRGAKISETSSVESVMGIKNFMPALEETPFHSCTAGPNPTLGRPLSPKPSSPGSVLARPSVICHTTVTALKDVPFSLCQSVGVGQNTDIQQIAAKNFTDTSLMYPEDTCKSGPSTNTQSGLFNTPPPTPPDLNQDFSGFQLLVDVALKRAAEMELQAKLTA,272,NP_775300.1.csv,refseq-TGIF1-NM_173208.2_clinical_seed_0_final,refseq-TGIF1-NM_173208.2.a2m,Invitae,refseq-TGIF1-NM_173208.2.npy,1,272,272
+NP_775748.2,MNDQYHRAARDGYLELLKEATRKELNAPDEDGMTPTLWAAYHGNLESLRLIVSRGGDPDKCDIWGNTPLHLAASNGHLHCLSFLVSFGANIWCLDNDYHTPLDMAAMKGHMECVRYLDSIAAKQSSLNPKLVGKLKDKAFREAERRIRECAKLQRRHHERMERRYRRELAERSDTLSFSSLTSSTLSRRLQHLALGSHLPYSQATLHGTARGKTKMQKKLERRKQGGEGTFKVSEDGRKSARSLSGLQLGSDVMFVRQGTYANPKEWGRAPLRDMFLSDEDSVSRATLAAEPAHSEVSTDSGHDSLFTRPGLGTMVFRRNYLSSGLHGLGREDGGLDGVGAPRGRLQSSPSLDDDSLGSANSLQDRSCGEELPWDELDLGLDEDLEPETSPLETFLASLHMEDFAALLRQEKIDLEALMLCSDLDLRSISVPLGPRKKILGAVRRRRQAMERPPALEDTEL,461,NP_775748.2.csv,refseq-USH1G-NM_173477.2_clinical_seed_0_final,refseq-USH1G-NM_173477.2.a2m,Invitae,refseq-USH1G-NM_173477.2_theta_0.2.npy,1,461,461
+NP_775754.2,MLPITDRLLHLLGLEKTAFRIYAVSTLLLFLLFFLFRLLLRFLRLCRSFYITCRRLRCFPQPPRRNWLLGHLGMYLPNEAGLQDEKKVLDNMHHVLLVWMGPVLPLLVLVHPDYIKPLLGASAAIAPKDDLFYGFLKPWLGDGLLLSKGDKWSRHRRLLTPAFHFDILKPYMKIFNQSADIMHAKWRHLAEGSAVSLDMFEHISLMTLDSLQKCVFSYNSNCQEKMSDYISAIIELSALSVRRQYRLHHYLDFIYYRSADGRRFRQACDMVHHFTTEVIQERRRALRQQGAEAWLKAKQGKTLDFIDVLLLARDEDGKELSDEDIRAEADTFMFEGHDTTSSGISWMLFNLAKYPEYQEKCREEIQEVMKGRELEELEWDDLTQLPFTTMCIKESLRQYPPVTLVSRQCTEDIKLPDGRIIPKGIICLVSIYGTHHNPTVWPDSKVYNPYRFDPDNPQQRSPLAYVPFSAGPRNCIGQSFAMAELRVVVALTLLRFRLSVDRTRKVRRKPELILRTENGLWLKVEPLPPRA,531,NP_775754.2.csv,refseq-CYP4F22-NM_173483.3_clinical_seed_0_final,refseq-CYP4F22-NM_173483.3.a2m,Invitae,refseq-CYP4F22-NM_173483.3.npy,1,531,531
+NP_775766.2,MLRQVLHRGLRTCFSRLGHFIASHPVFFASAPVLISILLGASFSRYQVEESVEHLLAPQHSLAKIERNLVNSLFPVNRSKHRLYSDLQTPGRYGRVIVTSFQKANMLDQHHTDLILKLHAAVTKIQVPRPGFNYTFAHICILNNDKTCIVDDIVHVLEELKNARATNRTNFAITYPITHLKDGRAVYNGHQLGGVTVHSKDRVKSAEAIQLTYYLQSINSLNDMVAERWESSFCDTVRLFQKSNSKVKMYPYTSSSLREDFQKTSRVSERYLVTSLILVVTMAILCCSMQDCVRSKPWLGLLGLVTISLATLTAAGIINLTGGKYNSTFLGVPFVMLGHGLYGTFEMLSSWRKTREDQHVKERTAAVYADSMLSFSLTTAMYLVTFGIGASPFTNIEAARIFCCNSCIAIFFNYLYVLSFYGSSLVFTGYIENNYQHSIFCRKVPKPEALQEKPAWYRFLLTARFSEDTAEGEEANTYESHLLVCFLKRYYCDWITNTYVKPFVVLFYLIYISFALMGYLQVSEGSDLSNIVATATQTIEYTTAQQKYFSNYSPVIGFYIYESIEYWNTSVQEDVLEYTKGFVRISWFESYLNYLRKLNVSTGLPKKNFTDMLRNSFLKAPQFSHFQEDIIFSKKYNDEVDVVASRMFLVAKTMETNREELYDLLETLRRLSVTSKVKFIVFNPSFVYMDRYASSLGAPLHNSCISALFLLFFSAFLVADSLINVWITLTVVSVEFGVIGFMTLWKVELDCISVLCLIYGINYTIDNCAPMLSTFVLGKDFTRTKWVKNALEVHGVAILQSYLCYIVGLIPLAAVPSNLTCTLFRCLFLIAFVTFFHCFAILPVILTFLPPSKKKRKEKKNPENREEIECVEMVDIDSTRVVDQITTV,888,NP_775766.2.csv,refseq-PTCHD1-NM_173495.2_clinical_seed_0_final,refseq-PTCHD1-NM_173495.2.a2m,Invitae,refseq-PTCHD1-NM_173495.2_theta_0.2.npy,1,888,888
+NP_775771.3,MSGGGEQLDILSVGILVKERWKVLRKIGGGGFGEIYDALDMLTRENVALKVESAQQPKQVLKMEVAVLKKLQGKDHVCRFIGCGRNDRFNYVVMQLQGRNLADLRRSQSRGTFTISTTLRLGRQILESIESIHSVGFLHRDIKPSNFAMGRFPSTCRKCYMLDFGLARQFTNSCGDVRPPRAVAGFRGTVRYASINAHRNREMGRHDDLWSLFYMLVEFVVGQLPWRKIKDKEQVGSIKERYDHRLMLKHLPPEFSIFLDHISSLDYFTKPDYQLLTSVFDNSIKTFGVIESDPFDWEKTGNDGSLTTTTTSTTPQLHTRLTPAAIGIANATPIPGDLLRENTDEVFPDEQLSDGENGIPVGVSPDKLPGSLGHPRPQEKDVWEEMDANKNKIKLGICKAATEEENSHGQANGLLNAPSLGSPIRVRSEITQPDRDIPLVRKLRSIHSFELEKRLTLEPKPDTDKFLETCLEKMQKDTSAGKESILPALLHKPCVPAVSRTDHIWHYDEEYLPDASKPASANTPEQADGGGSNGFIAVNLSSCKQEIDSKEWVIVDKEQDLQDFRTNEAVGHKTTGSPSDEEPEVLQVLEASPQDEKLQLGPWAENDHLKKETSGVVLALSAEGPPTAASEQYTDRLELQPGAASQFIAATPTSLMEAQAEGPLTAITIPRPSVASTQSTSGSFHCGQQPEKKDLQPMEPTVELYSPRENFSGLVVTEGEPPSGGSRTDLGLQIDHIGHDMLPNIRESNKSQDLGPKELPDHNRLVVREFENLPGETEEKSILLESDNEDEKLSRGQHCIEISSLPGDLVIVEKDHSATTEPLDVTKTQTFSVVPNQDKNNEIMKLLTVGTSEISSRDIDPHVEGQIGQVAEMQKNKISKDDDIMSEDLPGHQGDLSTFLHQEGKREKITPRNGELFHCVSENEHGAPTRKDMVRSSFVTRHSRIPVLAQEIDSTLESSSPVSAKEKLLQKKAYQPDLVKLLVEKRQFKSFLGDLSSASDKLLEEKLATVPAPFCEEEVLTPFSRLTVDSHLSRSAEDSFLSPIISQSRKSKIPRPVSWVNTDQVNSSTSSQFFPRPPPGKPPTRPGVEARLRRYKVLGSSNSDSDLFSRLAQILQNGSQKPRSTTQCKSPGSPHNPKTPPKSPVVPRRSPSASPRSSSLPRTSSSSPSRAGRPHHDQRSSSPHLGRSKSPPSHSGSSSSRRSCQQEHCKPSKNGLKGSGSLHHHSASTKTPQGKSKPASKLSR,1244,NP_775771.3.csv,refseq-TTBK2-NM_173500.3_clinical_seed_0_final,refseq-TTBK2-NM_173500.3.a2m,Invitae,refseq-TTBK2-NM_173500.3.npy,1,1244,1244
+NP_775822.3,MGEGGLPPAFQLLLRACDQGDTETARRLLEPGAAEPAERGAEPEAGAEPAGAEVAGPGAAAAGAVGAPVPVDCSDEAGNTALQFAAAGGHEPLVRFLLRRGASVNSRNHYGWSALMQAARFGHVSVAHLLLDHGADVNAQNRLGASVLTVASRGGHLGVVKLLLEAGAFVDHHHPSGEQLGLGGSRDEPLDITALMAAIQHGHEAVVRLLMEWGADPNHAARTVGWSPLMLAALTGRLGVAQQLVEKGANPDHLSVLEKTAFEVALDCKHRDLVDYLDPLTTVRPKTDEEKRRPDIFHALKMGNFQLVKEIADEDPSHVNLVNGDGATPLMLAAVTGQLALVQLLVERHADVDKQDSVHGWTALMQATYHGNKEIVKYLLNQGADVTLRAKNGYTAFDLVMLLNDPDTELVRLLASVCMQVNKDKGRPSHQPPLPHSKVRQPWSIPVLPDDKGGLKSWWNRMSNRFRKLKLMQTLPRGLSSNQPLPFSDEPEPALDSTMRAAPQDKTSRSALPDAAPVTKDNGPGSTRGEKEDTLLTTMLRNGAPLTRLPSDKLKAVIPPFLPPSSFELWSSDRSRTRHNGKADPMKTALPQRASRGHPVGGGGTDTTPVRPVKFPSLPRSPASSANSGNFNHSPHSSGGSSGVGVSRHGGELLNRSGGSIDNVLSQIAAQRKKAAGLLEQKPSHRSSPVGPAPGSSPSELPASPAGGSAPVGKKLETSKRPPSGTSTTSKSTSPTLTPSPSPKGHTAESSVSSSSSHRQSKSSGGSSSGTITDEDELTGILKKLSLEKYQPIFEEQEVDMEAFLTLTDGDLKELGIKTDGSRQQILAAISELNAGKGRERQILQETIHNFHSSFESSASNTRAPGNSPCA,871,NP_775822.3.csv,refseq-ANKS6-NM_173551.4_clinical_seed_0_final,refseq-ANKS6-NM_173551.4.a2m,Invitae,refseq-ANKS6-NM_173551.4.npy,1,871,871
+NP_775831.2,MAKVPELEDTFLQAQPAPQLSPGIQEDCCVQLLGKGLLVYPEETVYLAAEGQPGGEQGGGEKGEDPELPGAVKSEMHLNNGNFSSEEEDADNHDSKTKAADQYLSQKKTITQIVKDKKKQTQLTLQWLEENYIVCEGVCLPRCILYAHYLDFCRKEKLEPACAATFGKTIRQKFPLLTTRRLGTRGHSKYHYYGIGIKESSAYYHSVYSGKGLTRFSGSKLKNEGGFTRKYSLSSKTGTLLPEFPSAQHLVYQGCISKDKVDTLIMMYKTHCQCILDNAINGNFEEIQHFLLHFWQGMPDHLLPLLENPVIIDIFCVCDSILYKVLTDVLIPATMQEMPESLLADIRNFAKNWEQWVVSSLENLPEALTDKKIPIVRRFVSSLKRQTSFLHLAQIARPALFDQHVVNSMVSDIERVDLNSIGSQALLTISGSTDTESGIYTEHDSITVFQELKDLLKKNATVEAFIEWLDTVVEQRVIKTSKQNGRSLKKRAQDFLLKWSFFGARVMHNLTLNNASSFGSFHLIRMLLDEYILLAMETQFNNDKEQELQNLLDKYMKNSDASKAAFTASPSSCFLANRNKGSMVSSDAVKNESHVETTYLPLPSSQPGGLGPALHQFPAGNTDNMPLTGQMELSQIAGHLMTPPISPAMASRGSVINQGPMAGRPPSVGPVLSAPSHCSTYPEPIYPTLPQANHDFYSTSSNYQTVFRAQPHSTSGLYPHHTEHGRCMAWTEQQLSRDFFSGSCAGSPYNSRPPSSYGPSLQAQDSHNMQFLNTGSFNFLSNTGAASCQGATLPPNSPNGYYGSNINYPESHRLGSMVNQHVSVISSIRSLPPYSDIHDPLNILDDSGRKQTSSFYTDTSSPVACRTPVLASSLQTPIPSSSSQCMYGTSNQYPAQETLDSHGTSSREMVSSLPPINTVFMGTAAGGT,928,NP_775831.2.csv,refseq-RFX6-NM_173560.3_clinical_seed_0_final,refseq-RFX6-NM_173560.3.a2m,Invitae,refseq-RFX6-NM_173560.3.npy,1,928,928
+NP_775901.3,MVLAGLIRKLGHQLAEIRERALKSILCKIEHNLICYADLIQERQLFLHLLEWFNFPSVPMKEEVLNLLSRLVKYPPAVQHLVDVGAVEFLSKLRSNVEPNLQAEIDGILDGLFLLPSEVPALSSASYQTNQTELSKNPEILTGYFPQDKSNFQQMEVPPRPVVNQTVKCLKFSTFPWLPLTTTDRHVLSSNESSLRSSNHTLIWNTCELLKDVIMQDFPAEIFLQRPKIVQSLLSLLKLAFGDGKHRLALQSVSCLQQLCMYLRNRLNFHRDPGFFSNKHDTVSQNSSLSYCHEARGTHHSQNPSPGSSSPRPSVVGRTGQRPRGDGQDWDAASSSGSSSHAHVNSRISVHSPLDMGHIDLPELETEDTLELQFQQLSLPQFCVSILESAVPLLRTGSRQVIIRVLELLTEDMTLIGEAISTDIWDDSSLFGIDMKEKLLLVLGALGETMCYHKSSISLEQPEVMLVHHRMAFISISLFAVRLLQTLLPVEKASEFLSEPMSTALFLLSLDMPISLEYPNIHEAVVAYLEQLNSENYSIYKRTAEAVYSIECTCNFLSDIGKEGEKNLLELVELADQALRSFSYHQHFPLIKEIISICSKIWKSAQASPLLQGESQKVLLHMLSHPLPRVKAETYHCCLEITKECLGVHNVTKPVSSLCNGIHFLLHPKVLYEISVFGIQEPESEVNTAAKAILLYLLQGRLMMTALTWNKFIESLCPVIPILQGYADTEDPLGNCILLLSKASSDTEEMLPCTTRLKSMLRLLLVKKPSVRSLALKLLAFHLTSEEGADTKRPLIDARVLSRVTDLFIGKKPIELRLDDRRELVIKLETVEKVYEIFTSDDVDLVLRKSAAEQLAVIMQDIKMHAVVKKLCLIDKIIEYLNECVSQDGKVVECLVQPCLTLLRKVLCGDPVMRVSLSQQSSLLTVLFRVSLIFHEDCSVVTEVGALFCLLLFDEVSRMDMWSVNPSNKPSLPSVFSLPVSVFRRYHLPVHVIGHHAVSPYSIVLPLSADCLALKPVSDMLRIAWNLSWYHGSDNLLKQMNSETKTQEILDALKLSTEDILTLKITHMASGLQDCLHSIVQAATHREVRAAVTRMSFYLLNDRLSLKGCPGPCGVTLKSLAWHTALNRFLQVLPACTEDEKLLIDIIHFLNKLIKEQRKNSSLELLNWILELLLRHSANPLLDLLVLTESQAREETDDIRTAVRQQLQKELIALFDTLLLNFMEVTDRKCSELLYVFQTQLALKLLQCLKVTDAPHFYGLPSLERTLRGMANLTAFPGWSSHSPLTKPLDICVKYLSGLLEVITSFYVERGGNAMSFMGKGVTKSTILCLLHLSHEMMAQAGSLEWMSLWFLPLGSHSEEHIPTQQGLAWLIPLWVDRDPEVRFTSLGLGSALTTLETGCVALANSCQNISGGLWGTVVNILLDQSECSMVRREAAFILQNLLVIPMPTEIIKDYTWQGPCVHDEDSGLSLIGKPALQALLYHCHFYEHLNQMVKHCYLGRCMFDLNFSAFDRNSESNDLNGLDDSFKFWRAPSRTSQDRDPSSLSTSETTVAPSLGSTEFQPLVQSTTLLPEASHDQFVAQGHQESTSPRPPHDSSLSAPLPKLCVFVTPSLLSAMCSLLDNLLTIAPRDTAKAFRQAHLIELLCSIADATLIQTCVQELRALLPSSPPAEHTQAQVSFLLEYLSSLSRLLQSCLLVEPDLVIQDELVKPLITNIIGILTICTKDVLDKELISAFYHTWTHLFNLLAMLLRKAGAITLPFVTVALAKHWTAAIDMFCTCAGLSATCPALYTASLQFLSVLLTEEAKGHLQAKSKTHLCCSPTVASLLDDSQENQKSLEQLSDVILQCYEGKSSKDILKRVAANALMSLLAVSRRAQKHALKANLIDNCMEQMKHINAQLNLDSLRPGKAALKKKEDGVIKELSIAMQLLRNCLYQNEECKEAALEAHLVPVLHSLWPWILMDDSLMQISLQLLCVYTANFPNGCSSLCWSSCGQHPVQATHRGAVSNSLMLCILKLASQMPLENTTVQQMVFMLLSNLALSHDCKGVIQKSNFLQNFLSLALPKGGNKHLSNLTILWLKLLLNISSGEDGQQMILRLDGCLDLLTEMSKYKHKSSPLLPLLIFHNVCFSPANKPKILANEKVITVLAACLESENQNAQRIGAAALWALIYNYQKAKTALKSPSVKRRVDEAYSLAKKTFPNSEANPLNAYYLKCLENLVQLLNSS,2226,NP_775901.3.csv,refseq-RTTN-NM_173630.3_clinical_seed_0_final,refseq-RTTN-NM_173630.3.a2m,Invitae,refseq-RTTN-NM_173630.3.npy,1,2226,2226
+NP_775925.1,MLMLFVFGVLLHEVSLSGQNEAPPNTHSIPGEPLYNYASIRLPEEHIPFFLHNNRHIATVCRKDSLCPYKKHLEKLKYCWGYEKSCKPEFRFGYPVCSYVDMGWTDTLESAEDIFWKQADFGYARERLEEMHVLCQPKETSDSSLVCSRYLQYCRATNLYLDLRNIKRNHDRFKEDFFQSGEIGGHCKLDIRTLTSEGQRKSPLQSWFAELQSYTQLNFRPIEDAKCDIVIEKPTYFMKLDAGVNMYHHFCDFINLYITQHVNNSFSTDVYIVMWDTDGKIRVTILARSTEYRKILNQNELVNALKTVSTFEVQIVDYKYRELGFLDQLRITHNTDIFIGMHGAGLTHLLFLPDWAAVFELYNCEDERCYLDLARLRGVHYITWRRQNKVFPQDKGHHPTLGEHPKFTNYSFDVEEFMYLVLQAADHVLQHPKWPFKKKHDEL,443,NP_775925.1.csv,refseq-EOGT-NM_173654.2_clinical_seed_0_final,refseq-EOGT-NM_173654.2.a2m,Invitae,refseq-EOGT-NM_173654.2.npy,1,443,443
+NP_775960.4,MALARPGTPDPQALASVLLLLLWAPALSLLAGTVPSEPPSACASDPCAPGTECQATESGGYTCGPMEPRGCATQPCHHGALCVPQGPDPTGFRCYCVPGFQGPRCELDIDECASRPCHHGATCRNLADRYECHCPLGYAGVTCEMEVDECASAPCLHGGSCLDGVGSFRCVCAPGYGGTRCQLDLDECQSQPCAHGGTCHDLVNGFRCDCAGTGYEGTHCEREVLECASAPCEHNASCLEGLGSFRCLCWPGYSGELCEVDEDECASSPCQHGGRCLQRSDPALYGGVQAAFPGAFSFRHAAGFLCHCPPGFEGADCGVEVDECASRPCLNGGHCQDLPNGFQCHCPDGYAGPTCEEDVDECLSDPCLHGGTCSDTVAGYICRCPETWGGRDCSVQLTGCQGHTCPLAATCIPIFESGVHSYVCHCPPGTHGPFCGQNTTFSVMAGSPIQASVPAGGPLGLALRFRTTLPAGTLATRNDTKESLELALVAATLQATLWSYSTTVLVLRLPDLALNDGHWHQVEVVLHLATLELRLWHEGCPARLCVASGPVALASTASATPLPAGISSAQLGDATFAGCLQDVRVDGHLLLPEDLGENVLLGCERREQCRPLPCVHGGSCVDLWTHFRCDCARPHRGPTCADEIPAATFGLGGAPSSASFLLQELPGPNLTVSFLLRTRESAGLLLQFANDSAAGLTVFLSEGRIRAEVPGSPAVVLPGRWDDGLRHLVMLSFGPDQLQDLGQHVHVGGRLLAADSQPWGGPFRGCLQDLRLDGCHLPFFPLPLDNSSQPSELGGRQSWNLTAGCVSEDMCSPDPCFNGGTCLVTWNDFHCTCPANFTGPTCAQQLWCPGQPCLPPATCEEVPDGFVCVAEATFREGPPAAFSGHNASSGRLLGGLSLAFRTRDSEAWLLRAAAGALEGVWLAVRNGSLAGGVRGGHGLPGAVLPIPGPRVADGAWHRVRLAMERPAATTSRWLLWLDGAATPVALRGLASDLGFLQGPGAVRILLAENFTGCLGRVALGGLPLPLARPRPGAAPGAREHFASWPGTPAPILGCRGAPVCAPSPCLHDGACRDLFDAFACACGPGWEGPRCEAHVDPCHSAPCARGRCHTHPDGRFECRCPPGFGGPRCRLPVPSKECSLNVTCLDGSPCEGGSPAANCSCLEGLAGQRCQVPTLPCEANPCLNGGTCRAAGGVSECICNARFSGQFCEVAKGLPLPLPFPLLEVAVPAACACLLLLLLGLLSGILAARKRRQSEGTYSPSQQEVAGARLEMDSVLKVPPEERLI,1285,NP_775960.4.csv,refseq-CRB2-NM_173689.6_clinical_seed_0_final,refseq-CRB2-NM_173689.6.a2m,Invitae,refseq-CRB2-NM_173689.6.npy,1,1285,1285
+NP_777362.1,MAAAQPKYPAGATARRLARGCWSALWDYETPKVIVVRNRRLGVLYRAVQLLILLYFVWYVFIVQKSYQESETGPESSIITKVKGITTSEHKVWDVEEYVKPPEGGSVFSIITRVEATHSQTQGTCPESIRVHNATCLSDADCVAGELDMLGNGLRTGRCVPYYQGPSKTCEVFGWCPVEDGASVSQFLGTMAPNFTILIKNSIHYPKFHFSKGNIADRTDGYLKRCTFHEASDLYCPIFKLGFIVEKAGESFTELAHKGGVIGVIINWDCDLDLPASECNPKYSFRRLDPKHVPASSGYNFRFAKYYKINGTTTRTLIKAYGIRIDVIVHGQAGKFSLIPTIINLATALTSVGVGSFLCDWILLTFMNKNKVYSHKKFDKMVDTPASEPAQASTPTDPKGLAQL,404,NP_777362.1.csv,refseq-P2RX2-NM_174873.2_clinical_seed_0_final,refseq-P2RX2-NM_174873.2.a2m,Invitae,refseq-P2RX2-NM_174873.2.npy,1,404,404
+NP_777576.1,MADEEAGGTERMEISAELPQTPQRLASWWDQQVDFYTAFLHHLAQLVPEIYFAEMDPDLEKQEESVQMSIFTPLEWYLFGEDPDICLEKLKHSGAFQLCGRVFKSGETTYSCRDCAIDPTCVLCMDCFQDSVHKNHRYKMHTSTGGGFCDCGDTEAWKTGPFCVNHEPGRAGTIKENSRCPLNEEVIVQARKIFPSVIKYVVEMTIWEEEKELPPELQIREKNERYYCVLFNDEHHSYDHVIYSLQRALDCELAEAQLHTTAIDKEGRRAVKAGAYAACQEAKEDIKSHSENVSQHPLHVEVLHSEIMAHQKFALRLGSWMNKIMSYSSDFRQIFCQACLREEPDSENPCLISRLMLWDAKLYKGARKILHELIFSSFFMEMEYKKLFAMEFVKYYKQLQKEYISDDHDRSISITALSVQMFTVPTLARHLIEEQNVISVITETLLEVLPEYLDRNNKFNFQGYSQDKLGRVYAVICDLKYILISKPTIWTERLRMQFLEGFRSFLKILTCMQGMEEIRRQVGQHIEVDPDWEAAIAIQMQLKNILLMFQEWCACDEELLLVAYKECHKAVMRCSTSFISSSKTVVQSCGHSLETKSYRVSEDLVSIHLPLSRTLAGLHVRLSRLGAVSRLHEFVSFEDFQVEVLVEYPLRCLVLVAQVVAEMWRRNGLSLISQVFYYQDVKCREEMYDKDIIMLQIGASLMDPNKFLLLVLQRYELAEAFNKTISTKDQDLIKQYNTLIEEMLQVLIYIVGERYVPGVGNVTKEEVTMREIIHLLCIEPMPHSAIAKNLPENENNETGLENVINKVATFKKPGVSGHGVYELKDESLKDFNMYFYHYSKTQHSKAEHMQKKRRKQENKDEALPPPPPPEFCPAFSKVINLLNCDIMMYILRTVFERAIDTDSNLWTEGMLQMAFHILALGLLEEKQQLQKAPEEEVTFDFYHKASRLGSSAMNIQMLLEKLKGIPQLEGQKDMITWILQMFDTVKRLREKSCLIVATTSGSESIKNDEITHDKEKAERKRKAEAARLHRQKIMAQMSALQKNFIETHKLMYDNTSEMPGKEDSIMEEESTPAVSDYSRIALGPKRGPSVTEKEVLTCILCQEEQEVKIENNAMVLSACVQKSTALTQHRGKPIELSGEALDPLFMDPDLAYGTYTGSCGHVMHAVCWQKYFEAVQLSSQQRIHVDLFDLESGEYLCPLCKSLCNTVIPIIPLQPQKINSENADALAQLLTLARWIQTVLARISGYNIRHAKGENPIPIFFNQGMGDSTLEFHSILSFGVESSIKYSNSIKEMVILFATTIYRIGLKVPPDERDPRVPMLTWSTCAFTIQAIENLLGDEGKPLFGALQNRQHNGLKALMQFAVAQRITCPQVLIQKHLVRLLSVVLPNIKSEDTPCLLSIDLFHVLVGAVLAFPSLYWDDPVDLQPSSVSSSYNHLYLFHLITMAHMLQILLTVDTGLPLAQVQEDSEEAHSASSFFAEISQYTSGSIGCDIPGWYLWVSLKNGITPYLRCAALFFHYLLGVTPPEELHTNSAEGEYSALCSYLSLPTNLFLLFQEYWDTVRPLLQRWCADPALLNCLKQKNTVVRYPRKRNSLIELPDDYSCLLNQASHFRCPRSADDERKHPVLCLFCGAILCSQNICCQEIVNGEEVGACIFHALHCGAGVCIFLKIRECRVVLVEGKARGCAYPAPYLDEYGETDPGLKRGNPLHLSRERYRKLHLVWQQHCIIEEIARSQETNQMLFGFNWQLL,1749,NP_777576.1.csv,refseq-UBR1-NM_174916.2_clinical_seed_0_final,refseq-UBR1-NM_174916.2.a2m,Invitae,refseq-UBR1-NM_174916.2.npy,1,1749,1749
+NP_777577.2,MLPHVVLTFRRLGCALASCRLAPARHRGSGLLHTAPVARSDRSAPVFTRALAFGDRIALVDQHGRHTYRELYSRSLRLSQEICRLCGCVGGDLREERVSFLCANDASYVVAQWASWMSGGVAVPLYRKHPAAQLEYVICDSQSSVVLASQEYLELLSPVVRKLGVPLLPLTPAIYTGAVEEPAEVPVPEQGWRNKGAMIIYTSGTTGRPKGVLSTHQNIRAVVTGLVHKWAWTKDDVILHVLPLHHVHGVVNALLCPLWVGATCVMMPEFSPQQVWEKFLSSETPRINVFMAVPTIYTKLMEYYDRHFTQPHAQDFLRAVCEEKIRLMVSGSAALPLPVLEKWKNITGHTLLERYGMTEIGMALSGPLTTAVRLPGSVGTPLPGVQVRIVSENPQREACSYTIHAEGDERGTKVTPGFEEKEGELLVRGPSVFREYWNKPEETKSAFTLDGWFKTGDTVVFKDGQYWIRGRTSVDIIKTGGYKVSALEVEWHLLAHPSITDVAVIGVPDMTWGQRVTAVVTLREGHSLSHRELKEWARNVLAPYAVPSELVLVEEIPRNQMGKIDKKALIRHFHPS,576,NP_777577.2.csv,ACSF3_HUMAN_b07_clinical_seed_0_final,ACSF3_HUMAN_b07.a2m,EVE,ACSF3_HUMAN_b07_theta_0.2.npy,1,576,576
+NP_777596.2,MGTVSSRRSWWPLPLLLLLLLLLGPAGARAQEDEDGDYEELVLALRSEEDGLAEAPEHGTTATFHRCAKDPWRLPGTYVVVLKEETHLSQSERTARRLQAQAARRGYLTKILHVFHGLLPGFLVKMSGDLLELALKLPHVDYIEEDSSVFAQSIPWNLERITPPRYRADEYQPPDGGSLVEVYLLDTSIQSDHREIEGRVMVTDFENVPEEDGTRFHRQASKCDSHGTHLAGVVSGRDAGVAKGASMRSLRVLNCQGKGTVSGTLIGLEFIRKSQLVQPVGPLVVLLPLAGGYSRVLNAACQRLARAGVVLVTAAGNFRDDACLYSPASAPEVITVGATNAQDQPVTLGTLGTNFGRCVDLFAPGEDIIGASSDCSTCFVSQSGTSQAAAHVAGIAAMMLSAEPELTLAELRQRLIHFSAKDVINEAWFPEDQRVLTPNLVAALPPSTHGAGWQLFCRTVWSAHSGPTRMATAVARCAPDEELLSCSSFSRSGKRRGERMEAQGGKLVCRAHNAFGGEGVYAIARCCLLPQANCSVHTAPPAEASMGTRVHCHQQGHVLTGCSSHWEVEDLGTHKPPVLRPRGQPNQCVGHREASIHASCCHAPGLECKVKEHGIPAPQEQVTVACEEGWTLTGCSALPGTSHVLGAYAVDNTCVVRSRDVSTTGSTSEGAVTAVAICCRSRHLAQASQELQ,692,NP_777596.2.csv,refseq-PCSK9-NM_174936.3_clinical_seed_0_final,refseq-PCSK9-NM_174936.3.a2m,Invitae,refseq-PCSK9-NM_174936.3.npy,1,692,692
+NP_778243.1,MMRVCWLVRQDSRHQRIRLPHLEAVVIGRGPETKITDKKCSRQQVQLKAECNKGYVKVKQVGVNPTSIDSVVIGKDQEVKLQPGQVLHMVNELYPYIVEFEEEAKNPGLETHRKRKRSGNSDSIERDAAQEAEAGTGLEPGSNSGQCSVPLKKGKDAPIKKESLGHWSQGLKISMQDPKMQVYKDEQVVVIKDKYPKARYHWLVLPWTSISSLKAVAREHLELLKHMHTVGEKVIVDFAGSSKLRFRLGYHAIPSMSHVHLHVISQDFDSPCLKNKKHWNSFNTEYFLESQAVIEMVQEAGRVTVRDGMPELLKLPLRCHECQQLLPSIPQLKEHLRKHWTQ,342,NP_778243.1.csv,refseq-APTX-NM_175073.2_clinical_seed_0_final,refseq-APTX-NM_175073.2.a2m,Invitae,refseq-APTX-NM_175073.2.npy,1,342,342
+NP_783328.1,MPAMPSSGPGDTSSSAAEREEDRKDGEEQEEPRGKEERQEPSTTARKVGRPGRKRKHPPVESGDTPKDPAVISKSPSMAQDSGASELLPNGDLEKRSEPQPEEGSPAGGQKGGAPAEGEGAAETLPEASRAVENGCCTPKEGRGAPAEAGKEQKETNIESMKMEGSRGRLRGGLGWESSLRQRPMPRLTFQAGDPYYISKRKRDEWLARWKREAEKKAKVIAGMNAVEENQGPGESQKVEEASPPAVQQPTDPASPTVATTPEPVGSDAGDKNATKAGDDEPEYEDGRGFGIGELVWGKLRGFSWWPGRIVSWWMTGRSRAAEGTRWVMWFGDGKFSVVCVEKLMPLSSFCSAFHQATYNKQPMYRKAIYEVLQVASSRAGKLFPVCHDSDESDTAKAVEVQNKPMIEWALGGFQPSGPKGLEPPEEEKNPYKEVYTDMWVEPEAAAYAPPPPAKKPRKSTAEKPKVKEIIDERTRERLVYEVRQKCRNIEDICISCGSLNVTLEHPLFVGGMCQNCKNCFLECAYQYDDDGYQSYCTICCGGREVLMCGNNNCCRCFCVECVDLLVGPGAAQAAIKEDPWNCYMCGHKGTYGLLRRREDWPSRLQMFFANNHDQEFDPPKVYPPVPAEKRKPIRVLSLFDGIATGLLVLKDLGIQVDRYIASEVCEDSITVGMVRHQGKIMYVGDVRSVTQKHIQEWGPFDLVIGGSPCNDLSIVNPARKGLYEGTGRLFFEFYRLLHDARPKEGDDRPFFWLFENVVAMGVSDKRDISRFLESNPVMIDAKEVSAAHRARYFWGNLPGMNRPLASTVNDKLELQECLEHGRIAKFSKVRTITTRSNSIKQGKDQHFPVFMNEKEDILWCTEMERVFGFPVHYTDVSNMSRLARQRLLGRSWSVPVIRHLFAPLKEYFACV,912,NP_783328.1.csv,refseq-DNMT3A-NM_175629.2_clinical_seed_0_final,refseq-DNMT3A-NM_175629.2.a2m,Invitae,refseq-DNMT3A-NM_175629.2.npy,1,912,912
+NP_787110.2,MVSVNAPLGAPVESSYDTSPSEGTNLNAPNSLGVSALCAICGDRATGKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKCFRAGMKKEAVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKKIASIADVCESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFKDVLLLGNDYIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKLFGMAKIDNLLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNVIVANTMPTHLSNGQMCEWPRPRGQAATPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI,452,NP_787110.2.csv,refseq-HNF4A-NM_175914.4_clinical_seed_0_final,refseq-HNF4A-NM_175914.4.a2m,Invitae,refseq-HNF4A-NM_175914.4.npy,1,452,452
+NP_789788.1,MWSRLVWLGLRAPLGGRQGFTSKADPQGSGRITAAVIEHLERLALVDFGSREAVARLEKAIAFADRLRAVDTDGVEPMESVLEDRCLYLRSDNVVEGNCADELLQNSHRVVEEYFVAPPGNISLPKLDEQEPFPHS,136,NP_789788.1.csv,refseq-GATC-NM_176818.2_clinical_seed_0_final,refseq-GATC-NM_176818.2.a2m,Invitae,refseq-GATC-NM_176818.2.npy,1,136,136
+NP_789794.1,MDLILNRMDYLQVGVTSQKTMKLIPASRHRATQKVVIGDHDGVVMCFGMKKGEAAAVFKTLPGPKIARLELGGVINTPQEKIFIAAASEIRGFTKRGKQFLSFETNLTESIKAMHISGSDLFLSASYIYNHYCDCKDQHYYLSGDKINDVICLPVERLSRITPVLACQDRVLRVLQGSDVMYAVEVPGPPTVLALHNGNGGDSGEDLLFGTSDGKLALIQITTSKPVRKWEIQNEKKRGGILCIDSFDIVGDGVKDLLVGRDDGMVEVYSFDNANEPVLRFDQMLSESVTSIQGGCVGKDSYDEIVVSTYSGWVTGLTTEPIHKESGPGEELKINQEMQNKISSLRNELEHLQYKVLQERENYQQSSQSSKAKSAVPSFGINDKFTLNKDDASYSLILEVQTAIDNVLIQSDVPIDLLDVDKNSAVVSFSSCDSESNDNFLLATYRCQADTTRLELKIRSIEGQYGTLQAYVTPRIQPKTCQVRQYHIKPLSLHQRTHFIDHDRPMNTLTLTGQFSFAEVHSWVVFCLPEVPEKPPAGECVTFYFQNTFLDTQLESTYRKGEGVFKSDNISTISILKDVLSKEATKRKINLNISYEINEVSVKHTLKLIHPKLEYQLLLAKKVQLIDALKELQIHEGNTNFLIPEYHCILEEADHLQEEYKKQPAHLERLYGMITDLFIDKFKFKGTNVKTKVPLLLEILDSYDQNALISFFDAA,715,NP_789794.1.csv,refseq-BBS7-NM_176824.2_clinical_seed_0_final,refseq-BBS7-NM_176824.2.a2m,Invitae,refseq-BBS7-NM_176824.2.npy,1,715,715
+NP_789845.1,MSALLRLLRTGAPAAACLRLGTSAGTGSRRAMALYHTEERGQPCSQNYRLFFKNVTGHYISPFHDIPLKVNSKEENGIPMKKARNDEYENLFNMIVEIPRWTNAKMEIATKEPMNPIKQYVKDGKLRYVANIFPYKGYIWNYGTLPQTWEDPHEKDKSTNCFGDNDPIDVCEIGSKILSCGEVIHVKILGILALIDEGETDWKLIAINANDPEASKFHDIDDVKKFKPGYLEATLNWFRLYKVPDGKPENQFAFNGEFKNKAFALEVIKSTHQCWKALLMKKCNGGAINCTNVQISDSPFRCTQEEARSLVESVSSSPNKESNEEEQVWHFLGK,334,NP_789845.1.csv,refseq-PPA2-NM_176869.2_clinical_seed_0_final,refseq-PPA2-NM_176869.2.a2m,Invitae,refseq-PPA2-NM_176869.2.npy,1,334,334
+NP_796374.2,MDTNRPGAFVLSSAPLAALHNMAEMKTSLFPYALQGPAGFKAPALGGLGAQLPLGTPHGISDILGRPVGAAGGGLLGGLPRLNGLASSAGVYFGPAAAVARGYPKPLAELPGRPPIFWPGVVQGAPWRDPRLAGPAPAGGVLDKDGKKKHSRPTFSGQQIFALEKTFEQTKYLAGPERARLAYSLGMTESQVKVWFQNRRTKWRKRHAVEMASAKKKQDSDAEKLKVGGSDAEDDDEYNRPLDPNSDDEKITRLLKKHKPSNLALVSPCGGGAGDAL,277,NP_796374.2.csv,refseq-NKX6-2-NM_177400.3_clinical_seed_0_final,refseq-NKX6-2-NM_177400.3.a2m,Invitae,refseq-NKX6-2-NM_177400.3.npy,1,277,277
+NP_803187.1,MKSPALQPLSMAGLQLMTPASSPMGPFFGLPWQQEAIHDNIYTPRKYQVELLEAALDHNTIVCLNTGSGKTFIAVLLTKELSYQIRGDFSRNGKRTVFLVNSANQVAQQVSAVRTHSDLKVGEYSNLEVNASWTKERWNQEFTKHQVLIMTCYVALNVLKNGYLSLSDINLLVFDECHLAILDHPYREIMKLCENCPSCPRILGLTASILNGKCDPEELEEKIQKLEKILKSNAETATDLVVLDRYTSQPCEIVVDCGPFTDRSGLYERLLMELEEALNFINDCNISVHSKERDSTLISKQILSDCRAVLVVLGPWCADKVAGMMVRELQKYIKHEQEELHRKFLLFTDTFLRKIHALCEEHFSPASLDLKFVTPKVIKLLEILRKYKPYERQQFESVEWYNNRNQDNYVSWSDSEDDDEDEEIEEKEKPETNFPSPFTNILCGIIFVERRYTAVVLNRLIKEAGKQDPELAYISSNFITGHGIGKNQPRNKQMEAEFRKQEEVLRKFRAHETNLLIATSIVEEGVDIPKCNLVVRFDLPTEYRSYVQSKGRARAPISNYIMLADTDKIKSFEEDLKTYKAIEKILRNKCSKSVDTGETDIDPVMDDDDVFPPYVLRPDDGGPRVTINTAIGHINRYCARLPSDPFTHLAPKCRTRELPDGTFYSTLYLPINSPLRASIVGPPMSCVRLAERVVALICCEKLHKIGELDDHLMPVGKETVKYEEELDLHDEEETSVPGRPGSTKRRQCYPKAIPECLRDSYPRPDQPCYLYVIGMVLTTPLPDELNFRRRKLYPPEDTTRCFGILTAKPIPQIPHFPVYTRSGEVTISIELKKSGFMLSLQMLELITRLHQYIFSHILRLEKPALEFKPTDADSAYCVLPLNVVNDSSTLDIDFKFMEDIEKSEARIGIPSTKYTKETPFVFKLEDYQDAVIIPRYRNFDQPHRFYVADVYTDLTPLSKFPSPEYETFAEYYKTKYNLDLTNLNQPLLDVDHTSSRLNLLTPRHLNQKGKALPLSSAEKRKAKWESLQNKQILVPELCAIHPIPASLWRKAVCLPSILYRLHCLLTAEELRAQTASDAGVGVRSLPADFRYPNLDFGWKKSIDSKSFISISNSSSAENDNYCKHSTIVPENAAHQGANRTSSLENHDQMSVNCRTLLSESPGKLHVEVSADLTAINGLSYNQNLANGSYDLANRDFCQGNQLNYYKQEIPVQPTTSYSIQNLYSYENQPQPSDECTLLSNKYLDGNANKSTSDGSPVMAVMPGTTDTIQVLKGRMDSEQSPSIGYSSRTLGPNPGLILQALTLSNASDGFNLERLEMLGDSFLKHAITTYLFCTYPDAHEGRLSYMRSKKVSNCNLYRLGKKKGLPSRMVVSIFDPPVNWLPPGYVVNQDKSNTDKWEKDEMTKDCMLANGKLDEDYEEEDEEEESLMWRAPKEEADYEDDFLEYDQEHIRFIDNMLMGSGAFVKKISLSPFSTTDSAYEWKMPKKSSLGSMPFSSDFEDFDYSSWDAMCYLDPSKAVEEDDFVVGFWNPSEENCGVDTGKQSISYDLHTEQCIADKSIADCVEALLGCYLTSCGERAAQLFLCSLGLKVLPVIKRTDREKALCPTRENFNSQQKNLSVSCAAASVASSRSSVLKDSEYGCLKIPPRCMFDHPDADKTLNHLISGFENFEKKINYRFKNKAYLLQAFTHASYHYNTITDCYQRLEFLGDAILDYLITKHLYEDPRQHSPGVLTDLRSALVNNTIFASLAVKYDYHKYFKAVSPELFHVIDDFVQFQLEKNEMQGMDSELRRSEEDEEKEEDIEVPKAMGDIFESLAGAIYMDSGMSLETVWQVYYPMMRPLIEKFSANVPRSPVRELLEMEPETAKFSPAERTYDGKVRVTVEVVGKGKFKGVGRSYRIAKSAAARRALRSLKANQPQVPNS,1922,NP_803187.1.csv,refseq-DICER1-NM_177438.2_clinical_seed_0_final,refseq-DICER1-NM_177438.2.a2m,Invitae,refseq-DICER1-NM_177438.2.npy,1,1922,1922
+NP_808592.2,MPGRSCVALVLLAAAVSCAVAQHAPPWTEDCRKSTYPPSGPTYRGAVPWYTINLDLPPYKRWHELMLDKAPVLKVIVNSLKNMINTFVPSGKIMQVVDEKLPGLLGNFPGPFEEEMKGIAAVTDIPLGEIISFNIFYELFTICTSIVAEDKKGHLIHGRNMDFGVFLGWNINNDTWVITEQLKPLTVNLDFQRNNKTVFKASSFAGYVGMLTGFKPGLFSLTLNERFSINGGYLGILEWILGKKDVMWIGFLTRTVLENSTSYEEAKNLLTKTKILAPAYFILGGNQSGEGCVITRDRKESLDVYELDAKQGRWYVVQTNYDRWKHPFFLDDRRTPAKMCLNRTSQENISFETMYDVLSTKPVLNKLTVYTTLIDVTKGQFETYLRDCPDPCIGW,395,NP_808592.2.csv,refseq-ASAH1-NM_177924.3_clinical_seed_0_final,refseq-ASAH1-NM_177924.3.a2m,Invitae,refseq-ASAH1-NM_177924.3.npy,1,395,395
+NP_808807.2,MAAAAVTGQRPETAAAEEASRPQWAPPDHCQAQAAAGLGDGEDAPVRPLCKPRGICSRAYFLVLMVFVHLYLGNVLALLLFVHYSNGDESSDPGPQHRAQGPGPEPTLGPLTRLEGIKVGHERKVQLVTDRDHFIRTLSLKPLLFEIPGFLTDEECRLIIHLAQMKGLQRSQILPTEEYEEAMSTMQVSQLDLFRLLDQNRDGHLQLREVLAQTRLGNGWWMTPESIQEMYAAIKADPDGDGVLSLQEFSNMDLRDFHKYMRSHKAESSELVRNSHHTWLYQGEGAHHIMRAIRQRVLRLTRLSPEIVELSEPLQVVRYGEGGHYHAHVDSGPVYPETICSHTKLVANESVPFETSCRQVSPNWGLPSILRPGTPMTQAQPCTVGVPLGMGPGDHWVIPVSPWEHPQLGTCSVPPLPYSYMTVLFYLNNVTGGGETVFPVADNRTYDEMSLIQDDVDLRDTRRHCDKGNLRVKPQQGTAVFWYNYLPDGQGWVGDVDDYSLHGGCLVTRGTKWIANNWINVDPSRARQALFQQEMARLAREGGTDSQPEWALDRAYRDARVEL,563,NP_808807.2.csv,refseq-P4HTM-NM_177938.2_clinical_seed_0_final,refseq-P4HTM-NM_177938.2.a2m,Invitae,refseq-P4HTM-NM_177938.2.npy,1,563,563
+NP_808880.1,MAEDLDELLDEVESKFCTPDLLRRGMVEQPKGCGGGTHSSDRNQAKAKETLRSTETFKKEDDLDSLINEILEEPNLDKKPSKLKSKSSGNTSVRASIEGLGKSCSPVYLGGSSIPCGIGTNISWRACDHLRCIACDFLVVSYDDYMWDKSCDYLFFRNNMPEFHKLKAKLIKKKGTRAYACQCSWRTIEEVTDLQTDHQLRWVCGKH,207,NP_808880.1.csv,refseq-C8orf37-NM_177965.3_clinical_seed_0_final,refseq-C8orf37-NM_177965.3.a2m,Invitae,refseq-C8orf37-NM_177965.3.npy,1,207,207
+NP_814444.1,MDGPAEPQIPGLWDTYEDDISEISQKLPGEYFRYKGVPFPVGLYSLESISLAENTQDVRDDDIFIITYPKSGTTWMIEIICLILKEGDPSWIRSVPIWERAPWCETIVGAFSLPDQYSPRLMSSHLPIQIFTKAFFSSKAKVIYMGRNPRDVVVSLYHYSKIAGQLKDPGTPDQFLRDFLKGEVQFGSWFDHIKGWLRMKGKDNFLFITYEELQQDLQGSVERICGFLGRPLGKEALGSVVAHSTFSAMKANTMSNYTLLPPSLLDHRRGAFLRKGVCGDWKNHFTVAQSEAFDRAYRKQMRGMPTFPWDEDPEEDGSPDPEPSPEPEPKPSLEPNTSLEREPRPNSSPSPSPGQASETPHPRPS,365,NP_814444.1.csv,refseq-SULT2B1-NM_177973.1_clinical_seed_0_final,refseq-SULT2B1-NM_177973.1.a2m,Invitae,refseq-SULT2B1-NM_177973.1.npy,1,365,365
+NP_816931.1,MGLLDRLSVLLGLKKKEVHVLCLGLDNSGKTTIINKLKPSNAQSQNILPTIGFSIEKFKSSSLSFTVFDMSGQGRYRNLWEHYYKEGQAIIFVIDSSDRLRMVVAKEELDTLLNHPDIKHRRIPILFFANKMDLRDAVTSVKVSQLLCLENIKDKPWHICASDAIKGEGLQEGVDWLQDQIQTVKT,186,NP_816931.1.csv,refseq-ARL6-NM_177976.2_clinical_seed_0_final,refseq-ARL6-NM_177976.2.a2m,Invitae,refseq-ARL6-NM_177976.2.npy,1,186,186
+NP_821080.1,MREIVHIQAGQCGNQIGAKFWEVISDEHGIDPTGSYHGDSDLQLERINVYYNEATGNKYVPRAILVDLEPGTMDSVRSGPFGQIFRPDNFVFGQSGAGNNWAKGHYTEGAELVDSVLDVVRKESESCDCLQGFQLTHSLGGGTGSGMGTLLISKIREEYPDRIMNTFSVMPSPKVSDTVVEPYNATLSVHQLVENTDETYCIDNEALYDICFRTLKLTTPTYGDLNHLVSATMSGVTTCLRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTSRGSQQYRALTVPELTQQMFDSKNMMAACDPRHGRYLTVAAIFRGRMSMKEVDEQMLNVQNKNSSYFVEWIPNNVKTAVCDIPPRGLKMSATFIGNSTAIQELFKRISEQFTAMFRRKAFLHWYTGEGMDEMEFTEAESNMNDLVSEYQQYQDATADEQGEFEEEEGEDEA,445,NP_821080.1.csv,refseq-TUBB2B-NM_178012.4_clinical_seed_0_final,refseq-TUBB2B-NM_178012.4.a2m,Invitae,refseq-TUBB2B-NM_178012.4.npy,1,445,445
+NP_821133.1,MREIVHIQAGQCGNQIGAKFWEVISDEHGIDPTGTYHGDSDLQLDRISVYYNEATGGKYVPRAILVDLEPGTMDSVRSGPFGQIFRPDNFVFGQSGAGNNWAKGHYTEGAELVDSVLDVVRKEAESCDCLQGFQLTHSLGGGTGSGMGTLLISKIREEYPDRIMNTFSVVPSPKVSDTVVEPYNATLSVHQLVENTDETYCIDNEALYDICFRTLKLTTPTYGDLNHLVSATMSGVTTCLRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTSRGSQQYRALTVPELTQQVFDAKNMMAACDPRHGRYLTVAAVFRGRMSMKEVDEQMLNVQNKNSSYFVEWIPNNVKTAVCDIPPRGLKMAVTFIGNSTAIQELFKRISEQFTAMFRRKAFLHWYTGEGMDEMEFTEAESNMNDLVSEYQQYQDATAEEEEDFGEEAEEEA,444,NP_821133.1.csv,refseq-TUBB-NM_178014.3_clinical_seed_0_final,refseq-TUBB-NM_178014.3.a2m,Invitae,refseq-TUBB-NM_178014.3.npy,1,444,444
+NP_835366.1,MELDFGHFDERDKTSRNMRGSRMNGLPSPTHSAHCSFYRTRTLQALSNEKKAKKVRFYRNGDRYFKGIVYAVSSDRFRSFDALLADLTRSLSDNINLPQGVRYIYTIDGSRKIGSMDELEEGESYVCSSDNFFKKVEYTKNVNPNWSVNVKTSANMKAPQSLASSNSAQARENKDFVRPKLVTIIRSGVKPRKAVRVLLNKKTAHSFEQVLTDITEAIKLETGVVKKLYTLDGKQVTCLHDFFGDDDVFIACGPEKFRYAQDDFSLDENECRVMKGNPSATAGPKASPTPQKTSAKSPGPMRRSKSPADSANGTSSSQLSTPKSKQSPISTPTSPGSLRKHKDLYLPLSLDDSDSLGDSM,360,NP_835366.1.csv,refseq-DCX-NM_178153.2_clinical_seed_0_final,refseq-DCX-NM_178153.2.a2m,Invitae,refseq-DCX-NM_178153.2.npy,1,360,360
+NP_835455.1,MDAVLLEHFPGGLDAFPSSYFDEDDFFTDQSSRDPLEDGDELLADEQAEVEFLSHQLHEYCYRDGACLLLQPAPPAAPLALAPPSSGGLGEPDDGGGGGYCCETGAPPGGFPYSPGSPPSCLAYPCAGAAVLSPGARLRGLSGAAAAAARRRRRVRSEAELQQLRQAANVRERRRMQSINDAFEGLRSHIPTLPYEKRLSKVDTLRLAIGYINFLSELVQADLPLRGGGAGGCGGPGGGGRLGGDSPGSQAQKVIICHRGTRSPSPSDPDYGLPPLAGHSLSWTDEKQLKEQNIIRTAKVWTPEDPRKLNSKSSFNNIENEPPFEFVS,328,NP_835455.1.csv,refseq-PTF1A-NM_178161.2_clinical_seed_0_final,refseq-PTF1A-NM_178161.2.a2m,Invitae,refseq-PTF1A-NM_178161.2.npy,1,328,328
+NP_835466.2,MKALGAVLLALLLFGRPGRGQTQQEEEEEDEDHGPDDYDEEDEDEVEEEETNRLPGGRSRVLLRCYTCKSLPRDERCNLTQNCSHGQTCTTLIAHGNTESGLLTTHSTWCTDSCQPITKTVEGTQVTMTCCQSSLCNVPPWQSSRVQDPTGKGAGGPRGSSETVGAALLLNLLAGLGAMGARRP,184,NP_835466.2.csv,NP_835466.2_colabfold_clinical_seed_0_final,NP_835466.2_colabfold.a2m,colabfold,NP_835466.2_colabfold_theta_0.2.npy,1,184,184
+NP_848547.4,MHPEPSEPATGGAAELDCAQEPGVEESAGDHGSAGRGGCKEEINDPKEICVGSSDTSYHSQQKQSGDNGSGGHFAHPREDREDRGPRMTKSSLQKLCKQHKLYITPALNDTLYLHFKGFDRIENLEEYTGLRCLWLQSNGIQKIENLEAQTELRCLFLQMNLLRKIENLEPLQKLDALNLSNNYIKTIENLSCLPVLNTLQMAHNHLETVEDIQHLQECLRLCVLDLSHNKLSDPEILSILESMPDLRVLNLMGNPVIRQIPNYRRTVTVRLKHLTYLDDRPVFPKDRACAEAWARGGYAAEKEERQQWESRERKKITDSIEALAMIKQRAEERKRQRESQERGEMTSSDDGENVPASAEGKEEPPGDRETRQKMELFVKESFEAKDELCPEKPSGEEPPVEAKREDGGPEPEGTLPAETLLLSSPVEVKGEDGDGEPEGTLPAEAPPPPPPVEVKGEDGDQEPEGTLPAETLLLSPPVKVKGEDGDREPEGTLPAEAPPPPPLGAAREEPTPQAVATEGVFVTELDGTRTEDLETIRLETKETFCIDDLPDLEDDDETGKSLEDQNMCFPKIEVISSLSDDSDPELDYTSLPVLENLPTDTLSNIFAVSKDTSKAARVPFTDIFKKEAKRDLEIRKQDTKSPRPLIQELSDEDPSGQLLMPPTCQRDAAPLTSSGDRDSDFLAASSPVPTESAATPPETCVGVAQPSQALPTWDLTAFPAPKAS,725,NP_848547.4.csv,refseq-DNAAF1-NM_178452.4_clinical_seed_0_final,refseq-DNAAF1-NM_178452.4.a2m,Invitae,refseq-DNAAF1-NM_178452.4.npy,1,725,725
+NP_848549.3,MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAVLCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFGMGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHWNPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTRLLSRDI,266,NP_848549.3.csv,refseq-DRAM2-NM_178454.4_clinical_seed_0_final,refseq-DRAM2-NM_178454.4.a2m,Invitae,refseq-DRAM2-NM_178454.4.npy,1,266,266
+NP_848612.2,MSEKQMKEAFVSNLNGTTVLEITQGLCFPAFCILCRGFLIIFSQYLCSFSPTWKTRFLTDFVVLIVPMVATLTIWASFILLELLGVIIFGAGLLYQIYRRRTCYARLPFLKILEKFLNISLESEYNPAISCFRVITSAFTAIAILAVDFPLFPRRFAKTELYGTGAMDFGVGGFVFGSAMVCLEVRRRKYMEGSKLHYFTNSLYSVWPLVFLGIGRLAIIKSIGYQEHLTEYGVHWNFFFTIIVVKLITPLLLIIFPLNKSWIIALGITVLYQLALDFTSLKRLILYGTDGSGTRVGLLNANREGIISTLGYVAIHMAGVQTGLYMHKNRSHIKDLIKVACFLLLAAISLFISLYVVQVNVEAVSRRMANLAFCIWIVASSLILLSSLLLGDIILSFAKFLIKGALVPCSWKLIQSPVTNKKHSESLVPEAERMEPSLCLITALNRKQLIFFLLSNITTGLINLMVDTLHSSTLWALFVVNLYMFSNCLIVYVLYLQDKTVQFW,504,NP_848612.2.csv,refseq-PIGW-NM_178517.3_clinical_seed_0_final,refseq-PIGW-NM_178517.3.a2m,Invitae,refseq-PIGW-NM_178517.3.npy,1,504,504
+NP_848621.2,MGNGVKEGPVRLHEDAEAVLSSSVSSKRDHRQVLSSLLSGALAGALAKTAVAPLDRTKIIFQVSSKRFSAKEAFRVLYYTYLNEGFLSLWRGNSATMVRVVPYAAIQFSAHEEYKRILGSYYGFRGEALPPWPRLFAGALAGTTAASLTYPLDLVRARMAVTPKEMYSNIFHVFIRISREEGLKTLYHGFMPTVLGVIPYAGLSFFTYETLKSLHREYSGRRQPYPFERMIFGACAGLIGQSASYPLDVVRRRMQTAGVTGYPRASIARTLRTIVREEGAVRGLYKGLSMNWVKGPIAVGISFTTFDLMQILLRHLQS,318,NP_848621.2.csv,refseq-SLC25A42-NM_178526.4_clinical_seed_0_final,refseq-SLC25A42-NM_178526.4.a2m,Invitae,refseq-SLC25A42-NM_178526.4.npy,1,318,318
+NP_849164.2,MFWTFKEWFWLERFWLPPTIKWSDLEDHDGLVFVKPSHLYVTIPYAFLLLIIRRVFEKFVASPLAKSFGIKETVRKVTPNTVLENFFKHSTRQPLQTDIYGLAKKCNLTERQVERWFRSRRNQERPSRLKKFQEACWRFAFYLMITVAGIAFLYDKPWLYDLWEVWNGYPKQPLLPSQYWYYILEMSFYWSLLFRLGFDVKRKDFLAHIIHHLAAISLMSFSWCANYIRSGTLVMIVHDVADIWLESAKMFSYAGWTQTCNTLFFIFSTIFFISRLIVFPFWILYCTLILPMYHLEPFFSYIFLNLQLMILQVLHLYWGYYILKMLNRCIFMKSIQDVRSDDEDYEEEEEEEEEEATKGKEMDCLKNGLRAERHLIPNGQHGH,383,NP_849164.2.csv,refseq-CERS3-NM_178842.4_clinical_seed_0_final,refseq-CERS3-NM_178842.4.a2m,Invitae,refseq-CERS3-NM_178842.4.npy,1,383,383
+NP_851853.1,MAGLSDLELRRELQALGFQPGPITDTTRDVYRNKLRRLRGEARLRDEERLREEARPRGEERLREEARLREDAPLRARPAAASPRAEPWLSQPASGSAYATPGAYGDIRPSAASWVGSRGLAYPARPAQLRRRASVRGSSEEDEDARTPDRATQGPGLAARRWWAASPAPARLPSSLLGPDPRPGLRATRAGPAGAARARPEVGRRLERWLSRLLLWASLGLLLVFLGILWVKMGKPSAPQEAEDNMKLLPVDCERKTDEFCQAKQKAALLELLHELYNFLAIQAGNFECGNPENLKSKCIPVMEAQEYIANVTSSSSAKFEAALTWILSSNKDVGIWLKGEDQSELVTTVDKVVCLESAHPRMGVGCRLSRALLTAVTNVLIFFWCLAFLWGLLILLKYRWRKLEEEEQAMYEMVKKIIDVVQDHYVDWEQDMERYPYVGILHVRDSLIPPQSRRRMKRVWDRAVEFLASNESRIQTESHRVAGEDMLVWRWTKPSSFSDSER,503,NP_851853.1.csv,refseq-LEMD2-NM_181336.3_clinical_seed_0_final,refseq-LEMD2-NM_181336.3.a2m,Invitae,refseq-LEMD2-NM_181336.3.npy,1,503,503
+NP_852107.1,MFQLPVNNLGSLRKARKTVKKILSDIGLEYCKEHIEDFKQFEPNDFYLKNTTWEDVGLWDPSLTKNQDYRTKPFCCSACPFSSKFFSAYKSHFRNVHSEDFENRILLNCPYCTFNADKKTLETHIKIFHAPNASAPSSSLSTFKDKNKNDGLKPKQADSVEQAVYYCKKCTYRDPLYEIVRKHIYREHFQHVAAPYIAKAGEKSLNGAVPLGSNAREESSIHCKRCLFMPKSYEALVQHVIEDHERIGYQVTAMIGHTNVVVPRSKPLMLIAPKPQDKKSMGLPPRIGSLASGNVRSLPSQQMVNRLSIPKPNLNSTGVNMMSSVHLQQNNYGVKSVGQGYSVGQSMRLGLGGNAPVSIPQQSQSVKQLLPSGNGRSYGLGSEQRSQAPARYSLQSANASSLSSGQLKSPSLSQSQASRVLGQSSSKPAAAATGPPPGNTSSTQKWKICTICNELFPENVYSVHFEKEHKAEKVPAVANYIMKIHNFTSKCLYCNRYLPTDTLLNHMLIHGLSCPYCRSTFNDVEKMAAHMRMVHIDEEMGPKTDSTLSFDLTLQQGSHTNIHLLVTTYNLRDAPAESVAYHAQNNPPVPPKPQPKVQEKADIPVKSSPQAAVPYKKDVGKTLCPLCFSILKGPISDALAHHLRERHQVIQTVHPVEKKLTYKCIHCLGVYTSNMTASTITLHLVHCRGVGKTQNGQDKTNAPSRLNQSPSLAPVKRTYEQMEFPLLKKRKLDDDSDSPSFFEEKPEEPVVLALDPKGHEDDSYEARKSFLTKYFNKQPYPTRREIEKLAASLWLWKSDIASHFSNKRKKCVRDCEKYKPGVLLGFNMKELNKVKHEMDFDAEWLFENHDEKDSRVNASKTADKKLNLGKEDDSSSDSFENLEEESNESGSPFDPVFEVEPKISNDNPEEHVLKVIPEDASESEEKLDQKEDGSKYETIHLTEEPTKLMHNASDSEVDQDDVVEWKDGASPSESGPGSQQVSDFEDNTCEMKPGTWSDESSQSEDARSSKPAAKKKATMQGDREQLKWKNSSYGKVEGFWSKDQSQWKNASENDERLSNPQIEWQNSTIDSEDGEQFDNMTDGVAEPMHGSLAGVKLSSQQA,1102,NP_852107.1.csv,refseq-ADNP-NM_181442.4_clinical_seed_0_final,refseq-ADNP-NM_181442.4.a2m,Invitae,refseq-ADNP-NM_181442.4.npy,1,1102,1102
+NP_852122.1,MTTLAGAVPRMMRPGPGQNYPRSGFPLEVSTPLGQGRVNQLGGVFINGRPLPNHIRHKIVEMAHHGIRPCVISRQLRVSHGCVSKILCRYQETGSIRPGAIGGSKPKQVTTPDVEKKIEEYKRENPGMFSWEIRDKLLKDAVCDRNTVPSVSSISRILRSKFGKGEEEEADLERKEAEESEKKAKHSIDGILSERASAPQSDEGSDIDSEPDLPLKRKQRRSRTTFTAEQLEELERAFERTHYPDIYTREELAQRAKLTEARVQVWFSNRRARWRKQAGANQLMAFNHLIPGGFPPTAMPTLPTYQLSETSYQPTSIPQAVSDPSSTVHRPQPLPPSTVHQSTIPSNPDSSSAYCLPSTRHGFSSYTDSFVPPSGPSNPMNPTIGNGLSPQVMGLLTNHGGVPHQPQTDYALSPLTGGLEPTTTVSASCSQRLDHMKSLDSLPTSQSYCPPTYSTTGYSMDPVTGYQYGQYGQSKPWTF,479,NP_852122.1.csv,refseq-PAX3-NM_181457.3_clinical_seed_0_final,refseq-PAX3-NM_181457.3.a2m,Invitae,refseq-PAX3-NM_181457.3.npy,1,479,479
+NP_852664.1,MSAEGYQYRALYDYKKEREEDIDLHLGDILTVNKGSLVALGFSDGQEARPEEIGWLNGYNETTGERGDFPGTYVEYIGRKKISPPTPKPRPPRPLPVAPGSSKTEADVEQQALTLPDLAEQFAPPDIAPPLLIKLVEAIEKKGLECSTLYRTQSSSNLAELRQLLDCDTPSVDLEMIDVHVLADAFKRYLLDLPNPVIPAAVYSEMISLAPEVQSSEEYIQLLKKLIRSPSIPHQYWLTLQYLLKHFFKLSQTSSKNLLNARVLSEIFSPMLFRFSAASSDNTENLIKVIEILISTEWNERQPAPALPPKPPKPTTVANNGMNNNMSLQDAEWYWGDISREEVNEKLRDTADGTFLVRDASTKMHGDYTLTLRKGGNNKLIKIFHRDGKYGFSDPLTFSSVVELINHYRNESLAQYNPKLDVKLLYPVSKYQQDQVVKEDNIEAVGKKLHEYNTQFQEKSREYDRLYEEYTRTSQEIQMKRTAIEAFNETIKIFEEQCQTQERYSKEYIEKFKREGNEKEIQRIMHNYDKLKSRISEIIDSRRRLEEDLKKQAAEYREIDKRMNSIKPDLIQLRKTRDQYLMWLTQKGVRQKKLNEWLGNENTEDQYSLVEDDEDLPHHDEKTWNVGSSNRNKAENLLRGKRDGTFLVRESSKQGCYACSVVVDGEVKHCVINKTATGYGFAEPYNLYSSLKELVLHYQHTSLVQHNDSLNVTLAYPVYAQQRR,724,NP_852664.1.csv,refseq-PIK3R1-NM_181523.2_clinical_seed_0_final,refseq-PIK3R1-NM_181523.2.a2m,Invitae,refseq-PIK3R1-NM_181523.2.npy,1,724,724
+NP_853512.1,MSLRLSSASRRSCPRPTTGSLRLYGGGTSFGTGNSCGISGIGSGFSSAFGGSSSGGNTGGGNPCAGFTVNERGLLSGNEKVTMQNLNDRLASYLDSVHALEEANADLEQKIKGWYEKFGPGSCRGLDHDYSRYFPIIDDLKNQIIASTTSNANAVLQIDNARLTADDFRLKYENELALHQSVEADVNGLRRVLDEITLCRTDLEIQYETLSEEMTYLKKNHKEEMQVLQCAAGGNVNVEMNAAPGVDLTVLLNNMRAEYEALAEQNRRDAEAWFNEKSASLQQQISEDVGATTSARNELTEMKRTLQTLEIELQSLLATKHSLECSLTETESNYCAQLAQIQAQIGALEEQLHQVRTETEGQKLEYEQLLDIKLHLEKEIETYCLLIGGDDGACKSGGYKSKDYGSGNVGSQVKDPAKAIVVKKVLEEVDQRSKILTTRLHSLEEKSQSN,450,NP_853512.1.csv,refseq-KRT25-NM_181534.3_clinical_seed_0_final,refseq-KRT25-NM_181534.3.a2m,Invitae,refseq-KRT25-NM_181534.3.npy,1,450,450
+NP_858058.1,MASSVGNVADSTEPTKRMLSFQGLAELAHREYQAGDFEAAERHCMQLWRQEPDNTGVLLLLSSIHFQCRRLDRSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEAIEHYRHALRLKPDFIDGYINLAAALVAAGDMEGAVQAYVSALQYNPDLYCVRSDLGNLLKALGRLEEAKACYLKAIETQPNFAVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFLDAYINLGNVLKEARIFDRAVAAYLRALSLSPNHAVVHGNLACVYYEQGLIDLAIDTYRRAIELQPHFPDAYCNLANALKEKGSVAEAEDCYNTALRLCPTHADSLNNLANIKREQGNIEEAVRLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFADAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYDERMKKLVSIVADQLEKNRLPSVHPHHSMLYPLSHGFRKAIAERHGNLCLDKINVLHKPPYEHPKDLKLSDGRLRVGYVSSDFGNHPTSHLMQSIPGMHNPDKFEVFCYALSPDDGTNFRVKVMAEANHFIDLSQIPCNGKAADRIHQDGIHILVNMNGYTKGARNELFALRPAPIQAMWLGYPGTSGALFMDYIITDQETSPAEVAEQYSEKLAYMPHTFFIGDHANMFPHLKKKAVIDFKSNGHIYDNRIVLNGIDLKAFLDSLPDVKIVKMKCPDGGDNADSSNTALNMPVIPMNTIAEAVIEMINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFNQLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLELIAKNRQEYEDIAVKLGTDLEYLKKVRGKVWKQRISSPLFNTKQYTMELERLYLQMWEHYAAGNKPDHMIKPVEVTESA,1046,NP_858058.1.csv,refseq-OGT-NM_181672.2_clinical_seed_0_final,refseq-OGT-NM_181672.2.a2m,Invitae,refseq-OGT-NM_181672.2.npy,1,1046,1046
+NP_859056.2,MGRAVKVLQLFKTLHRTRQQVFKNDARALEAARIKINEEFKNNKSETSSKKIEELMKIGSDVELLLRTSVIQGIHTDHNTLKLVPRKDLLVENVPYCDAPTQKQ,104,NP_859056.2.csv,refseq-LYRM7-NM_181705.3_clinical_seed_0_final,refseq-LYRM7-NM_181705.3.a2m,Invitae,refseq-LYRM7-NM_181705.3.npy,1,104,104
+NP_859065.2,MGERAGSPGTDQERKAGKHHYSYLSDFETPQSSGRSSLVSSSPASVRRKNPKRQTSDGQVHHQAPRKPSPKGLPNRKGVRVGFRSQSLNREPLRKDTDLVTKRILSARLLKINELQNEVSELQVKLAELLKENKSLKRLQYRQEKALNKFEDAENEISQLIFRHNNEITALKERLRKSQEKERATEKRVKDTESELFRTKFSLQKLKEISEARHLPERDDLAKKLVSAELKLDDTERRIKELSKNLELSTNSFQRQLLAERKRAYEAHDENKVLQKEVQRLYHKLKEKERELDIKNIYSNRLPKSSPNKEKELALRKNAACQSDFADLCTKGVQTMEDFKPEEYPLTPETIMCYENKWEEPGHLTLDLQSQKQDRHGEAGILNPIMEREEKFVTDEELHVVKQEVEKLEDEWEREELDKKQKEKASLLEREEKPEWETGRYQLGMYPIQNMDKLQGEEEERLKREMLLAKLNEIDRELQDSRNLKYPVLPLLPDFESKLHSPERSPKTYRFSESSERLFNGHHLQDISFSTPKGEGQNSGNVRSPASPNEFAFGSYVPSFAKTSERSNPFSQKSSFLDFQRNSMEKLSKDGVDLITRKEKKANLMEQLFGASGSSTISSKSSDPNSVASSKGDIDPLNFLPGNKGSRDQEHDEDEGFFLSEGRSFNPNRHRLKHADDKPAVKAADSVEDEIEEVALR,697,NP_859065.2.csv,refseq-LCA5-NM_181714.3_clinical_seed_0_final,refseq-LCA5-NM_181714.3.a2m,Invitae,refseq-LCA5-NM_181714.3.npy,1,697,697
+NP_861448.2,MANINLKEITLIVGVVTACYWNSLFCGFVFDDVSAILDNKDLHPSTPLKTLFQNDFWGTPMSEERSHKSYRPLTVLTFRLNYLLSELKPMSYHLLNMIFHAVVSVIFLKVCKLFLDNKSSVIASLLFAVHPIHTEAVTGVVGRAELLSSIFFLAAFLSYTRSKGPDNSIIWTPIALTVFLVAVATLCKEQGITVVGICCVYEVFIAQGYTLPLLCTTAGQFLRGKGSIPFSMLQTLVKLIVLMFSTLLLVVIRVQVIQSQLPVFTRFDNPAAVSPTPTRQLTFNYLLPVNAWLLLNPSELCCDWTMGTIPLIESLLDIRNLATFTFFCFLGMLGVFSIRYSGDSSKTVLMALCLMALPFIPASNLFFPVGFVVAERVLYVPSMGFCILVAHGWQKISTKSVFKKLSWICLSMVILTHSLKTFHRNWDWESEYTLFMSALKVNKNNAKLWNNVGHALENEKNFERALKYFLQATHVQPDDIGAHMNVGRTYKNLNRTKEAEESYMMAKSLMPQIIPGKKYAARIAPNHLNVYINLANLIRANESRLEEADQLYRQAISMRPDFKQAYISRGELLLKMNKPLKAKEAYLKALELDRNNADLWYNLAIVHIELKEPNEALKNFNRALELNPKHKLALFNSAIVMQESGEVKLRPEARKRLLSYINEEPLDANGYFNLGMLAMDDKKDNEAEIWMKKAIKLQADFRSALFNLALLYSQTAKELKALPILEELLRYYPDHIKGLILKGDILMNQKKDILGAKKCFERILEMDPSNVQGKHNLCVVYFEEKDLLKAERCLLETLALAPHEEYIQRHLNIVRDKISSSSFIEPIFPTSKISSVEGKKIPTESVKEIRGESRQTQIVKTSDNKSQSKSNKQLGKNGDEETPHKTTKDIKEIEKKRVAALKRLEEIERILNGE,914,NP_861448.2.csv,NP_861448.2_clinical_seed_0_final,NP_861448.2.a2m,popEVE,NP_861448.2_theta_0.2.npy,1,914,914
+NP_861454.2,MARGAEGGRGDAGWGLRGALAAVALLSALNAAGTVFALCQWRGLSSALRALEAQRGREQREDSALRSFLAELSRAPRGASAPPQDPASSARNKRSHSGEPAPHIRAESHDMLMMMTYSMVPIRVMVDLCNSTKGICLTGPSGPPGPPGAGGLPGHNGLDGQPGPQGPKGEKGANGKRGKMGIPGAAGNPGERGEKGDHGELGLQGNEGPPGQKGEKGDKGDVSNDVLLAGAKGDQGPPGPPGPPGPPGPPGPPGSRRAKGPRQPSMFNGQCPGETCAIPNDDTLVGKADEKASEHHSPQAESMITSIGNPVQVLKVTETFGTWIRESANKSDDRIWVTEHFSGIMVKEFKDQPSLLNGSYTFIHLPYYFHGCGHVVYNNSLYYHKGGSNTLVRFEFGQETSQTLKLENALYFDRKYLFANSKTYFNLAVDEKGLWIIYASSVDGSSILVAQLDERTFSVVQHVNTTYPKSKAGNAFIARGILYVTDTKDMRVTFAFDLLGGKQINANFDLRTSQSVLAMLAYNMRDQHLYSWEDGHLMLYPVQFLSTTLNQ,551,NP_861454.2.csv,refseq-GLDN-NM_181789.2_clinical_seed_0_final,refseq-GLDN-NM_181789.2.a2m,Invitae,refseq-GLDN-NM_181789.2.npy,1,551,551
+NP_862823.1,MEVSGHPQARRCCPEALGKLFPGLCFLCFLVTYALVGAVVFSAIEDGQVLVAADDGEFEKFLEELCRILNCSETVVEDRKQDLQGHLQKVKPQWFNRTTHWSFLSSLFFCCTVFSTVGYGYIYPVTRLGKYLCMLYALFGIPLMFLVLTDTGDILATILSTSYNRFRKFPFFTRPLLSKWCPKSLFKKKPDPKPADEAVPQIIISAEELPGPKLGTCPSRPSCSMELFERSHALEKQNTLQLPPQAMERSNSCPELVLGRLSYSIISNLDEVGQQVERLDIPLPIIALIVFAYISCAAAILPFWETQLDFENAFYFCFVTLTTIGFGDTVLEHPNFFLFFSIYIIVGMEIVFIAFKLVQNRLIDIYKNVMLFFAKGKFYHLVKK,384,NP_862823.1.csv,refseq-KCNK18-NM_181840.1_clinical_seed_0_final,refseq-KCNK18-NM_181840.1.a2m,Invitae,refseq-KCNK18-NM_181840.1.npy,1,384,384
+NP_870998.2,MEARSRSAEELRRAELVEIIVETEAQTGVSGINVAGGGKEGIFVRELREDSPAARSLSLQEGDQLLSARVFFENFKYEDALRLLQCAEPYKVSFCLKRTVPTGDLALRPGTVSGYEIKGPRAKVAKLNIQSLSPVKKKKMVPGALGVPADLAPVDVEFSFPKFSRLRRGLKAEAVKGPVPAAPARRRLQLPRLRVREVAEEAQAARLAAAAPPPRKAKVEAEVAAGARFTAPQVELVGPRLPGAEVGVPQVSAPKAAPSAEAAGGFALHLPTLGLGAPAPPAVEAPAVGIQVPQVELPALPSLPTLPTLPCLETREGAVSVVVPTLDVAAPTVGVDLALPGAEVEARGEAPEVALKMPRLSFPRFGARAKEVAEAKVAKVSPEARVKGPRLRMPTFGLSLLEPRPAAPEVVESKLKLPTIKMPSLGIGVSGPEVKVPKGPEVKLPKAPEVKLPKVPEAALPEVRLPEVELPKVSEMKLPKVPEMAVPEVRLPEVELPKVSEMKLPKVPEMAVPEVRLPEVQLLKVSEMKLPKVPEMAVPEVRLPEVQLPKVSEMKLPEVSEVAVPEVRLPEVQLPKVPEMKVPEMKLPKVPEMKLPEMKLPEVQLPKVPEMAVPDVHLPEVQLPKVPEMKLPEMKLPEVKLPKVPEMAVPDVHLPEVQLPKVPEMKLPKMPEMAVPEVRLPEVQLPKVSEMKLPKVPEMAVPDVHLPEVQLPKVCEMKVPDMKLPEIKLPKVPEMAVPDVHLPEVQLPKVSEIRLPEMQVPKVPDVHLPKAPEVKLPRAPEVQLKATKAEQAEGMEFGFKMPKMTMPKLGRAESPSRGKPGEAGAEVSGKLVTLPCLQPEVDGEAHVGVPSLTLPSVELDLPGALGLQGQVPAAKMGKGERVEGPEVAAGVREVGFRVPSVEIVTPQLPAVEIEEGRLEMIETKVKPSSKFSLPKFGLSGPKVAKAEAEGAGRATKLKVSKFAISLPKARVGAEAEAKGAGEAGLLPALDLSIPQLSLDAHLPSGKVEVAGADLKFKGPRFALPKFGVRGRDTEAAELVPGVAELEGKGWGWDGRVKMPKLKMPSFGLARGKEAEVQGDRASPGEKAESTAVQLKIPEVELVTLGAQEEGRAEGAVAVSGMQLSGLKVSTAGQVVTEGHDAGLRMPPLGISLPQVELTGFGEAGTPGQQAQSTVPSAEGTAGYRVQVPQVTLSLPGAQVAGGELLVGEGVFKMPTVTVPQLELDVGLSREAQAGEAATGEGGLRLKLPTLGARARVGGEGAEEQPPGAERTFCLSLPDVELSPSGGNHAEYQVAEGEGEAGHKLKVRLPRFGLVRAKEGAEEGEKAKSPKLRLPRVGFSQSEMVTGEGSPSPEEEEEEEEEGSGEGASGRRGRVRVRLPRVGLAAPSKASRGQEGDAAPKSPVREKSPKFRFPRVSLSPKARSGSGDQEEGGLRVRLPSVGFSETGAPGPARMEGAQAAAV,1461,NP_870998.2.csv,refseq-PRX-NM_181882.2_clinical_seed_0_final,refseq-PRX-NM_181882.2.a2m,Invitae,refseq-PRX-NM_181882.2.npy,1,1461,1461
+NP_872282.1,MAARLVSRCGAVRAAPHSGPLVSWRRWSGASTDTVYDVVVSGGGLVGAAMACALGYDIHFHDKKILLLEAGPKKVLEKLSETYSNRVSSISPGSATLLSSFGAWDHICNMRYRAFRRMQVWDACSEALIMFDKDNLDDMGYIVENDVIMHALTKQLEAVSDRVTVLYRSKAIRYTWPCPFPMADSSPWVHITLGDGSTFQTKLLIGADGHNSGVRQAVGIQNVSWNYDQSAVVATLHLSEATENNVAWQRFLPSGPIALLPLSDTLSSLVWSTSHEHAAELVSMDEEKFVDAVNSAFWSDADHTDFIDTAGAMLQYAVSLLKPTKVSARQLPPSVARVDAKSRVLFPLGLGHAAEYVRPRVALIGDAAHRVHPLAGQGVNMGFGDISSLAHHLSTAAFNGKDLGSVSHLTGYETERQRHNTALLAATDLLKRLYSTSASPLVLLRTWGLQATNAVSPLKEQIMAFASK,468,NP_872282.1.csv,refseq-COQ6-NM_182476.2_clinical_seed_0_final,refseq-COQ6-NM_182476.2.a2m,Invitae,refseq-COQ6-NM_182476.2.npy,1,468,468
+NP_872354.1,MVKLLPAQEAAKIYHTNYVRNSRAVGVMWGTLTICFSVLVMALFIQPYWIGDSVNTPQAGYFGLFSYCVGNVLSSELICKGGPLDFSSIPSRAFKTAMFFVALGMFLIIGSIICFSLFFICNTATVYKICAWMQLAAATGLMIGCLVYPDGWDSSEVRRMCGEQTGKYTLGHCTIRWAFMLAILSIGDALILSFLAFVLGYRQDKLLPDDYKADGTEEV,219,NP_872354.1.csv,refseq-LHFPL5-NM_182548.3_clinical_seed_0_final,refseq-LHFPL5-NM_182548.3.a2m,Invitae,refseq-LHFPL5-NM_182548.3.npy,1,219,219
+NP_872621.1,MGTWILFACLLGAAFAMPLPPHPGHPGYINFSYENSHSQAINVDRTALVLTPLKWYQSIRPPYPSYGYEPMGGWLHHQIIPVLSQQHPPTHTLQPHHHIPVVPAQQPVIPQQPMMPVPGQHSMTPIQHHQPNLPPPAQQPYQPQPVQPQPHQPMQPQPPVHPMQPLPPQPPLPPMFPMQPLPPMLPDLTLEAWPSTDKTKREEVD,205,NP_872621.1.csv,refseq-AMELX-NM_182680.1_clinical_seed_0_final,refseq-AMELX-NM_182680.1.a2m,Invitae,refseq-AMELX-NM_182680.1.npy,1,205,205
+NP_877435.3,MRTSLQAVALWGQKAPPHSITAIMITDDQRTIVTGSQEGQLCLWNLSHELKISAKELLFGHSASVTCLARARDFSKQPYIVSAAENGEMCVWNVTNGQCMEKATLPYRHTAICYYHCSFRMTGEGWLLCCGEYQDVLIIDAKTLAVVHSFRSSQFPDWINCMCIVHSMRIQEDSLLVVSVAGELKVWDLSSSINSIQEKQDVYEKESKFLESLNCQTIRFCTYTERLLLVVFSKCWKVYDYCDFSLLLTEVSRNGQFFAGGEVIAAHRILIWTEDGHSYIYQLLNSGLSKSIYPADGRVLKETIYPHLLCSTSVQENKEQSRPFVMGYMNERKEPFYKVLFSGEVSGRITLWHIPDVPVSKFDGSPREIPVTATWTLQDNFDKHDTMSQSIIDYFSGLKDGAGTAVVTSSEYIPSLDKLICGCEDGTIIITQALNAAKARLLEGGSLVKDSPPHKVLKGHHQSVTSLLYPHGLSSKLDQSWMLSGDLDSCVILWDIFTEEILHKFFLEAGPVTSLLMSPEKFKLRGEQIICCVCGDHSVALLHLEGKSCLLHARKHLFPVRMIKWHPVENFLIVGCADDSVYIWEIETGTLERHETGERARIILNCCDDSQLVKSVLPIASETLKHKSIEQRSSSPYQLGPLPCPGLQVESSCKVTDAKFCPRPFNVLPVKTKWSNVGFHILLFDLENLVELLLPTPLSDVDSSSSFYGGEVLRRAKSTVEKKTLTLRKSKTACGPLSAEALAKPITESLAQGDNTIKFSEENDGIKRQKKMKISKKMQPKPSRKVDASLTIDTAKLFLSCLLPWGVDKDLDYLCIKHLNILKLQGPISLGISLNEDNFSLMLPGWDLCNSGMIKDYSGVNLFSRKVLDLSDKYTATLPNQVGIPRGLENNCDSLRESDTIVYLLSRLFLVNKLVNMPLELACRVGSSFRMESIHNKMRGAGNDILNMSSFYSCLRNGKNESHVPEADLSLLKLISCWRDQSVQVTEAIQAVLLAEVQQHMKSLGKIPVNSQPVSMAENGNCEMKQMLPKLEWTEELELQCVRNTLPLQTPVSPVKHDSNSNSANFQDVEDMPDRCALEESESPGEPRHHSWIAKVCPCKVS,1102,NP_877435.3.csv,WDR72_HUMAN_b01_clinical_seed_0_final,WDR72_HUMAN_b01.a2m,EVE,WDR72_HUMAN_b01_theta_0.2.npy,1,1102,1102
+NP_877437.2,MAAPALGLVCGRCPELGLVLLLLLLSLLCGAAGSQEAGTGAGAGSLAGSCGCGTPQRPGAHGSSAAAHRYSREANAPGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQAVAAAPWWLPVKGANWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPTMD,374,NP_877437.2.csv,refseq-SUMF1-NM_182760.3_clinical_seed_0_final,refseq-SUMF1-NM_182760.3.a2m,Invitae,refseq-SUMF1-NM_182760.3.npy,1,374,374
+NP_878314.1,MTGKAGEALSKPKSETVAKSTSGGAPARCTGFGIQEILGLNKEPPSSHPRAALDGLAPGHLLAARSVLSPAGVGGMGLLGPGGLPGFYTQPTFLEVLSDPQSVHLQPLGRASGPLDTSQTASSDSEDVSSSDRKMSKSALNQTKKRKKRRHRTIFTSYQLEELEKAFNEAHYPDVYAREMLAMKTELPEDRIQVWFQNRRAKWRKREKCWGRSSVMAEYGLYGAMVRHSIPLPESILKSAKDGIMDSCAPWLLGMHKKSLEAAAESGRKPEGERQALPKLDKMEQDERGPDAQAAISQEELRENSIAVLRAKAQEHSTKVLGTVSGPDSLARSTEKPEEEEAMDEDRPAERLSPPQLEDMA,361,NP_878314.1.csv,refseq-VSX2-NM_182894.2_clinical_seed_0_final,refseq-VSX2-NM_182894.2.a2m,Invitae,refseq-VSX2-NM_182894.2.npy,1,361,361
+NP_886552.3,MLRCLYHWHRPVLNRRWSRLCLPKQYLFTMKLQSPEFQSLFTEGLKSLTELFVKENHELRIAGGAVRDLLNGVKPQDIDFATTATPTQMKEMFQSAGIRMINNRGEKHGTITARLHEENFEITTLRIDVTTDGRHAEVEFTTDWQKDAERRDLTINSMFLGFDGTLFDYFNGYEDLKNKKVRFVGHAKQRIQEDYLRILRYFRFYGRIVDKPGDHDPETLEAIAENAKGLAGISGERIWVELKKILVGNHVNHLIHLIYDLDVAPYIGLPANASLEEFDKVSKNVDGFSPKPVTLLASLFKVQDDVTKLDLRLKIAKEEKNLGLFIVKNRKDLIKATDSSDPLKPYQDFIIDSREPDATTRVCELLKYQGEHCLLKEMQQWSIPPFPVSGHDIRKVGISSGKEIGALLQQLREQWKKSGYQMEKDELLSYIKKT,434,NP_886552.3.csv,refseq-TRNT1-NM_182916.3_clinical_seed_0_final,refseq-TRNT1-NM_182916.3.a2m,Invitae,refseq-TRNT1-NM_182916.3_theta_0.2.npy,1,434,434
+NP_891552.1,MGKSRTKRFKRPQFSPTGDCQAEAAAAANGTGGEEDDGPAAELLEKLQHPSAEVRECACAGLARLVQQRPALPGLARRDAVRRLGPLLLDPSLAVRETAAGALRNLSACGGFEVCDDMVTKDIMTPLVALLKECSAGLDSNEMSLQEKKDQNRNSIENIANETVNVLWNICECSSRAVSIFNKEGCLEIVLKYLSRFPTNVDLAISVAYCLQTVTEDNPELLKSFSATALNMLESALLSPVSSMESLLLKTLVAGTIWNLKDIIPCKSQAEIINALLKILSEVLGMDAGEMVIQMKEAETQRLKTAAEAEEILENTNGDDLIEDDEMEGISHKRRVRRKTFVSDLLPPTDKELRETIALLTAQQTALEIIVNMCCNEDPSDDEWEELSSSDESDAFMENSFSECGGQLFSPLCLSHEVHTALTNYLIPKKIFEKTAFPNSIAVDLCSRNPTWKPLIRKMNTIQCRALFCLQSLVSLLDVEHLGGAAALQTLAQHLSQLLFSQPDFAKHVDFLEAISSALRALLQTMASKNISQCMTPDQLMTLCKAGIHSSNVGVRVNVVSILGITGSVLAKEDGTLETLKNIGCFLLEVTTKDPSLVVAGEALDALFDVFADGKEAERASIQIKLLSALKEFQPVFKMKIRKEGRGNYSTDQLCVLDNVKMNLRRFIAYQETVEKRLTS,680,NP_891552.1.csv,refseq-HEATR3-NM_182922.3_clinical_seed_0_final,refseq-HEATR3-NM_182922.3.a2m,Invitae,refseq-HEATR3-NM_182922.3_theta_0.2.npy,1,680,680
+NP_891847.1,MSIVIPLGVDTAETSYLEMAAGSEPESVEASPVVVEKSNSYPHQLYTSSSHHSHSYIGLPYADHNYGARPPPTPPASPPPSVLISKNEVGIFTTPNFDETSSATTISTSEDGSYGTDVTRCICGFTHDDGYMICCDKCSVWQHIDCMGIDRQHIPDTYLCERCQPRNLDKERAVLLQRRKRENMSDGDTSATESGDEVPVELYTAFQHTPTSITLTASRVSKVNDKRRKKSGEKEQHISKCKKAFREGSRKSSRVKGSAPEIDPSSDGSNFGWETKIKAWMDRYEEANNNQYSEGVQREAQRIALRLGNGNDKKEMNKSDLNTNNLLFKPPVESHIQKNKKILKSAKDLPPDALIIEYRGKFMLREQFEANGYFFKRPYPFVLFYSKFHGLEMCVDARTFGNEARFIRRSCTPNAEVRHEIQDGTIHLYIYSIHSIPKGTEITIAFDFDYGNCKYKVDCACLKENPECPVLKRSSESMENINSGYETRRKKGKKDKDISKEKDTQNQNITLDCEGTTNKMKSPETKQRKLSPLRLSVSNNQEPDFIDDIEEKTPISNEVEMESEEQIAERKRKMTREERKMEAILQAFARLEKREKRREQALERISTAKTEVKTECKDTQIVSDAEVIQEQAKEENASKPTPAKVNRTKQRKSFSRSRTHIGQQRRRHRTVSMCSDIQPSSPDIEVTSQQNDIENTVLTIEPETETALAEIITETEVPALNKCPTKYPKTKKHLVNEWLSEKNEKTGKPSDGLSERPLRITTDPEVLATQLNSLPGLTYSPHVYSTPKHYIRFTSPFLSEKRRRKEPTENISGSCKKRWLKQALEEENSAILHRFNSPCQERSRSPAVNGENKSPLLLNDSCSLPDLTTPLKKRRFYQLLDSVYSETSTPTPSPYATPTHTDITPMDPSFATPPRIKSDDETCRNGYKPIYSPVTPVTPGTPGNTMHFENISSPESSPEIKRRTYSQEGYDRSSTMLTLGPFRNSNLTELGLQEIKTIGYTSPRSRTEVNRQCPGEKEPVSDLQLGLDAVEPTALHKTLETPAHDRAEPNSQLDSTHSGRGTMYSSWVKSPDRTGVNFSVNSNLRDLTPSHQLEVGGGFRISESKCLMQDDTRGMFMETTVFCTSEDGLVSGFGRTVNDNLIDGNCTPQNPPQKKKVSLLEYRKRQREARKSGSKTENFPLISVSPHASGSLSNNGDGCASSNDNGEQVDHTASLPLPTPATVYNATSEETSNNCPVKDATASEKNEPEVQWTASTSVEQVRERSYQRALLLSDHRKDKDSGGESPCVSCSPSHVQSSPSSHSNHIPQLQAKGPVPSFSELMEDPDPENPEPTTTNECPSPDTSQNTCKSPPKMSKPGSPGSVIPAQAHGKIFTKPDPQWDSTVSASEAENGVHLKTELQQKQLSNNNQALSKNHPPQTHVRNSSEQLSQKLPSVPTKLHCPPSPHLENPPKSSTPHTPVQHGYLSPKPPSQQLGSPYRPHHSQSPQVGTPQREPQRNFYPAAQNLPANTQQATSGTLFTQTPSGQSSATYSQFNQQSLNSTAPPPPPPPPPSSSYYQNQQPSANFQNYNQLKGSLSQQTVFTSGPNQALPGTTSQQTVPGHHVTPGHFLPSQNPTIHHQTAAAVVPPPPPPPPAPGPHLVQQPNSHQQHSVAHVVGPVHAVTPGSHIHSQTAGHHLPPPPPPPGPAPHHHPPPHPSTGLQGLQAQHQHVVNSAPPPPPPPPPSSVLASGHHTTSAQALHHPPHQGPPLFPSSAHPTVPPYPSQATHHTTLGPGPQHQPSGTGPHCPLPVTGPHLQPQGPNSIPTPTASGFCPHPGSVALPHGVQGPQQASPVPGQIPIHRAQVPPTFQNNYHGSGWH,1858,NP_891847.1.csv,refseq-KMT2E-NM_182931.2_clinical_seed_0_final,refseq-KMT2E-NM_182931.2.a2m,Invitae,refseq-KMT2E-NM_182931.2.npy,1,1858,1858
+NP_891988.1,MGGCTVKPQLLLLALVLHPWNPCLGADSEKPSSIPTDKLLVITVATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKVYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFENGKARAKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQDNGCTLCEFDTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHNKEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSVDADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNRVGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMTLQREKDSPTPETFQMLSPPKGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSKIFTENIVEQPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGFALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP,758,NP_891988.1.csv,refseq-PLOD2-NM_182943.2_clinical_seed_0_final,refseq-PLOD2-NM_182943.2.a2m,Invitae,refseq-PLOD2-NM_182943.2.npy,1,758,758
+NP_892003.2,MAAQGAAAAVAAGTSGVAGEGEPGPGENAAAEGTAPSPGRVSPPTPARGEPEVTVEIGETYLCRRPDSTWHSAEVIQSRVNDQEGREEFYVHYVGFNRRLDEWVDKNRLALTKTVKDAVQKNSEKYLSELAEQPERKITRNQKRKHDEINHVQKTYAEMDPTTAALEKEHEAITKVKYVDKIHIGNYEIDAWYFSPFPEDYGKQPKLWLCEYCLKYMKYEKSYRFHLGQCQWRQPPGKEIYRKSNISVYEVDGKDHKIYCQNLCLLAKLFLDHKTLYFDVEPFVFYILTEVDRQGAHIVGYFSKEKESPDGNNVACILTLPPYQRRGYGKFLIAFSYELSKLESTVGSPEKPLSDLGKLSYRSYWSWVLLEILRDFRGTLSIKDLSQMTSITQNDIISTLQSLNMVKYWKGQHVICVTPKLVEEHLKSAQYKKPPITGGWGAAVCRGRWGSVSIWTGRSQGLLIAVT,467,NP_892003.2.csv,refseq-KAT8-NM_182958.4_clinical_seed_0_final,refseq-KAT8-NM_182958.4.a2m,Invitae,refseq-KAT8-NM_182958.4.npy,1,467,467
+NP_898871.1,MAVVAAAAGWLLRLRAAGAEGHWRRLPGAGLARGFLHPAATVEDAAQRRQVAHFTFQPDPEPREYGQTQKMNLFQSVTSALDNSLAKDPTAVIFGEDVAFGGVFRCTVGLRDKYGKDRVFNTPLCEQGIVGFGIGIAVTGATAIAEIQFADYIFPAFDQIVNEAAKYRYRSGDLFNCGSLTIRSPWGCVGHGALYHSQSPEAFFAHCPGIKVVIPRSPFQAKGLLLSCIEDKNPCIFFEPKILYRAAAEEVPIEPYNIPLSQAEVIQEGSDVTLVAWGTQVHVIREVASMAKEKLGVSCEVIDLRTIIPWDVDTICKSVIKTGRLLISHEAPLTGGFASEISSTVQEECFLNLEAPISRVCGYDTPFPHIFEPFYIPDKWKCYDALRKMINY,392,NP_898871.1.csv,refseq-BCKDHB-NM_183050.2_clinical_seed_0_final,refseq-BCKDHB-NM_183050.2.a2m,Invitae,refseq-BCKDHB-NM_183050.2.npy,1,392,392
+NP_898898.1,MSSPGPSQPPAEDPPWPARLLRAPLGLLRLDPSGGALLLCGLVALLGWSWLRRRRARGIPPGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVIGPQVLLAHLARVYGSIFSFFIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEKGVVFAHYGPVWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFSIISNAVSNIICSLCFGQRFDYTNSEFKKMLGFMSRGLEICLNSQVLLVNICPWLYYLPFGPFKELRQIEKDITSFLKKIIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYLFYIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQEKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLAIPHMTSENTVLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETFIPFGIGKRVCMGEQLAKMELFLMFVSLMQSFAFALPEDSKKPLLTGRFGLTLAPHPFNITISRR,544,NP_898898.1.csv,refseq-CYP2U1-NM_183075.2_clinical_seed_0_final,refseq-CYP2U1-NM_183075.2.a2m,Invitae,refseq-CYP2U1-NM_183075.2.npy,1,544,544
+NP_899196.1,MENSDSNDKGSGDQSAAQRRSQMDRLDREEAFYQFVNNLSEEDYRLMRDNNLLGTPGESTEEELLRRLQQIKEGPPPQNSDENRGGDSSDDVSNGDSIIDWLNSVRQTGNTTRSGQRGNQSWRAVSRTNPNSGDFRFSLEINVNRNNGSQNSENENEPSARRSSGENVENNSQRQVENPRSESTSARPSRSERNSTEALTEVPPTRGQRRARSRSPDHRRTRARAERSRSPLHPMSEIPRRSHHSISSQTFEHPLVNETEGSSRTRHHVTLRQQISGPELLSRGLFAASGTRNASQGAGSSDTAASGESTGSGQRPPTIVLDLQVRRVRPGEYRQRDSIASRTRSRSQTPNNTVTYESERGGFRRTFSRSERAGVRTYVSTIRIPIRRILNTGLSETTSVAIQTMLRQIMTGFGELSYFMYSDSDSEPTGSVSNRNMERAESRSGRGGSGGGSSSGSSSSSSSSSSSSSSSSSSSSPSSSSGGESSETSSDLFEGSNEGSSSSGSSGARREGRHRAPVTFDESGSLPFLSLAQFFLLNEDDDDQPRGLTKEQIDNLAMRSFGENDALKTCSVCITEYTEGNKLRKLPCSHEYHVHCIDRWLSENSTCPICRRAVLASGNRESVV,624,NP_899196.1.csv,refseq-RLIM-NM_183353.2_clinical_seed_0_final,refseq-RLIM-NM_183353.2.a2m,Invitae,refseq-RLIM-NM_183353.2.npy,1,624,624
+NP_899200.1,MSGSKSVSPPGYAAQKTAAPAPRGGPEHRSAWGEADSRANGYPHAPGGSARGSTKKPGGAVTPQQQQRLASRWRSDDDDDPPLSGDDPLAGGFGFSFRSKSAWQERGGDDCGRGSRRQRRGAASGGSTRAPPAGGGGGSAAAAASAGGTEVRPRSVEVGLEERRGKGRAADELEAGAVEGGEGSGDGGSSADSGSGAGPGAVLSLGACCLALLQIFRSKKFPSDKLERLYQRYFFRLNQSSLTMLMAVLVLVCLVMLAFHAARPPLQLPYLAVLAAAVGVILIMAVLCNRAAFHQDHMGLACYALIAVVLAVQVVGLLLPQPRSASEGIWWTVFFIYTIYTLLPVRMRAAVLSGVLLSALHLAIALRTNAQDQFLLKQLVSNVLIFSCTNIVGVCTHYPAEVSQRQAFQETRECIQARLHSQRENQQQERLLLSVLPRHVAMEMKADINAKQEDMMFHKIYIQKHDNVSILFADIEGFTSLASQCTAQELVMTLNELFARFDKLAAENHCLRIKILGDCYYCVSGLPEARADHAHCCVEMGMDMIEAISLVREVTGVNVNMRVGIHSGRVHCGVLGLRKWQFDVWSNDVTLANHMEAGGKAGRIHITKATLNYLNGDYEVEPGCGGERNAYLKEHSIETFLILRCTQKRKEEKAMIAKMNRQRTNSIGHNPPHWGAERPFYNHLGGNQVSKEMKRMGFEDPKDKNAQESANPEDEVDEFLGRAIDARSIDRLRSEHVRKFLLTFREPDLEKKYSKQVDDRFGAYVACASLVFLFICFVQITIVPHSIFMLSFYLTCSLLLTLVVFVSVIYSCVKLFPSPLQTLSRKIVRSKMNSTLVGVFTITLVFLAAFVNMFTCNSRDLLGCLAQEHNISASQVNACHVAESAVNYSLGDEQGFCGSPWPNCNFPEYFTYSVLLSLLACSVFLQISCIGKLVLMLAIELIYVLIVEVPGVTLFDNADLLVTANAIDFFNNGTSQCPEHATKVALKVVTPIIISVFVLALYLHAQQVESTARLDFLWKLQATEEKEEMEELQAYNRRLLHNILPKDVAAHFLARERRNDELYYQSCECVAVMFASIANFSEFYVELEANNEGVECLRLLNEIIADFDEIISEDRFRQLEKIKTIGSTYMAASGLNDSTYDKVGKTHIKALADFAMKLMDQMKYINEHSFNNFQMKIGLNIGPVVAGVIGARKPQYDIWGNTVNVASRMDSTGVPDRIQVTTDMYQVLAANTYQLECRGVVKVKGKGEMMTYFLNGGPPLS,1261,NP_899200.1.csv,refseq-ADCY5-NM_183357.2_clinical_seed_0_final,refseq-ADCY5-NM_183357.2.a2m,Invitae,refseq-ADCY5-NM_183357.2.npy,1,1261,1261
+NP_919224.1,MALLIHLKTVSELRGRGDRIAKVTFRGQSFYSRVLENCEDVADFDETFRWPVASSIDRNEMLEIQVFNYSKVFSNKLIGTFRMVLQKVVEESHVEVTDTLIDDNNAIIKTSLCVEVRYQATDGTVGSWDDGDFLGDESLQEEEKDSQETDGLLPGSRPSSRPPGEKSFRRAGRSVFSAMKLGKNRSHKEEPQRPDEPAVLEMEDLDHLAIRLGDGLDPDSVSLASVTALTTNVSNKRSKPDIKMEPSAGRPMDYQVSITVIEARQLVGLNMDPVVCVEVGDDKKYTSMKESTNCPYYNEYFVFDFHVSPDVMFDKIIKISVIHSKNLLRSGTLVGSFKMDVGTVYSQPEHQFHHKWAILSDPDDISSGLKGYVKCDVAVVGKGDNIKTPHKANETDEDDIEGNLLLPEGVPPERQWARFYVKIYRAEGLPRMNTSLMANVKKAFIGENKDLVDPYVQVFFAGQKGKTSVQKSSYEPLWNEQVVFTDLFPPLCKRMKVQIRDSDKVNDVAIGTHFIDLRKISNDGDKGFLPTLGPAWVNMYGSTRNYTLLDEHQDLNEGLGEGVSFRARLLLGLAVEIVDTSNPELTSSTEVQVEQATPISESCAGKMEEFFLFGAFLEASMIDRRNGDKPITFEVTIGNYGNEVDGLSRPQRPRPRKEPGDEEEVDLIQNASDDEAGDAGDLASVSSTPPMRPQVTDRNYFHLPYLERKPCIYIKSWWPDQRRRLYNANIMDHIADKLEEGLNDIQEMIKTEKSYPERRLRGVLEELSCGCCRFLSLADKDQGHSSRTRLDRERLKSCMRELENMGQQARMLRAQVKRHTVRDKLRLCQNFLQKLRFLADEPQHSIPDIFIWMMSNNKRVAYARVPSKDLLFSIVEEETGKDCAKVKTLFLKLPGKRGFGSAGWTVQAKVELYLWLGLSKQRKEFLCGLPCGFQEVKAAQGLGLHAFPPVSLVYTKKQAFQLRAHMYQARSLFAADSSGLSDPFARVFFINQSQCTEVLNETLCPTWDQMLVFDNLELYGEAHELRDDPPIIVIEIYDQDSMGKADFMGRTFAKPLVKMADEAYCPPRFPPQLEYYQIYRGNATAGDLLAAFELLQIGPAGKADLPPINGPVDVDRGPIMPVPMGIRPVLSKYRVEVLFWGLRDLKRVNLAQVDRPRVDIECAGKGVQSSLIHNYKKNPNFNTLVKWFEVDLPENELLHPPLNIRVVDCRAFGRYTLVGSHAVSSLRRFIYRPPDRSAPSWNTTVRLLRRCRVLCNGGSSSHSTGEVVVTMEPEVPIKKLETMVKLDATSEAVVKVDVAEEEKEKKKKKKGTAEEPEEEEPDESMLDWWSKYFASIDTMKEQLRQQEPSGIDLEEKEEVDNTEGLKGSMKGKEKARAAKEEKKKKTQSSGSGQGSEAPEKKKPKIDELKVYPKELESEFDNFEDWLHTFNLLRGKTGDDEDGSTEEERIVGRFKGSLCVYKVPLPEDVSREAGYDSTYGMFQGIPSNDPINVLVRVYVVRATDLHPADINGKADPYIAIRLGKTDIRDKENYISKQLNPVFGKSFDIEASFPMESMLTVAVYDWDLVGTDDLIGETKIDLENRFYSKHRATCGIAQTYSTHGYNIWRDPMKPSQILTRLCKDGKVDGPHFGPPGRVKVANRVFTGPSEIEDENGQRKPTDEHVALLALRHWEDIPRAGCRLVPEHVETRPLLNPDKPGIEQGRLELWVDMFPMDMPAPGTPLDISPRKPKKYELRVIIWNTDEVVLEDDDFFTGEKSSDIFVRGWLKGQQEDKQDTDVHYHSLTGEGNFNWRYLFPFDYLAAEEKIVISKKESMFSWDETEYKIPARLTLQIWDADHFSADDFLGAIELDLNRFPRGAKTAKQCTMEMATGEVDVPLVSIFKQKRVKGWWPLLARNENDEFELTGKVEAELHLLTAEEAEKNPVGLARNEPDPLEKPNRPDTSFIWFLNPLKSARYFLWHTYRWLLLKLLLLLLLLLLLALFLYSVPGYLVKKILGA,1997,NP_919224.1.csv,refseq-OTOF-NM_194248.2_clinical_seed_0_final,refseq-OTOF-NM_194248.2.a2m,Invitae,refseq-OTOF-NM_194248.2.npy,1,1997,1997
+NP_919253.1,MLHLKVQFLDDSQKIFVVDQKSSGKALFNLSCSHLNLAEKEYFGLEFCSHSGNNVWLELLKPITKQVKNPKEIVFKFMVKFFPVDPGHLREELTRYLFTLQIKKDLALGRLPCSDNCTALMVSHILQSELGDFHEETDRKHLAQTRYLPNQDCLEGKIMHFHQKHIGRSPAESDILLLDIARKLDMYGIRPHPASDGEGMQIHLAVAHMGVLVLRGNTKINTFNWAKIRKLSFKRKHFLIKLHANILVLCKDTLEFTMASRDACKAFWKTCVEYHAFFRLSEEPKSKPKTLLCSKGSSFRYSGRTQRQLLEYGRKGRLKSLPFERKHYPSQYHERQCRSSPDLLSDVSKQVEDLRLAYGGGYYQNVNGVHASEPVLESRRRNSALEVTFATELEHSKPEADPTLLHQSQSSSSFPFIYMDPVFNTEPNPNPDPRDIFSERSSLSSFQTSCKFSGNHMSIYSGLTSKVRPAKQLTYTDVPYIPCTGQQVGIMPPQVFFYVDKPPQVPRWSPIRAEERTSPHSYVEPTAMKPAERSPRNIRMKSFQQDLQVLQEAIARTSGRSNINVGLEEEDPNLEDAFVCNIQEQTPKRSQSQSDMKTIRFPFGSEFRPLGPCPALSHKADLFTDMFAEQELPAVLMDQSTAERYVASESSDSESEILKPDYYALYGKEIRSPMARIRLSSGSLQLDEEDEDAYFNTPTAEDRTSLKPCNYFLA,714,NP_919253.1.csv,refseq-FRMD7-NM_194277.2_clinical_seed_0_final,refseq-FRMD7-NM_194277.2.a2m,Invitae,refseq-FRMD7-NM_194277.2.npy,1,714,714
+NP_919255.2,MAAAWGSSLTAATQRAVTPWPRGRLLTASLGPQARREASSSSPEAGEGQIRLTDSCVQRLLEITEGSEFLRLQVEGGGCSGFQYKFSLDTVINPDDRVFEQGGARVVVDSDSLAFVKGAQVDFSQELIRSSFQVLNNPQAQQGCSCGSSFSIKL,154,NP_919255.2.csv,refseq-ISCA2-NM_194279.3_clinical_seed_0_final,refseq-ISCA2-NM_194279.3.a2m,Invitae,refseq-ISCA2-NM_194279.3.npy,1,154,154
+NP_919259.3,MKCHYEALGVRRDASEEELKKAYRKLALKWHPDKNLDNAAEAAEQFKLIQAAYDVLSDPQERAWYDNHREALLKGGFDGEYQDDSLDLLRYFTVTCYSGYGDDEKGFYTVYRNVFEMIAKEELESVLEEEVDDFPTFGDSQSDYDTVVHPFYAYWQSFCTQKNFAWKEEYDTRQASNRWEKRAMEKENKKIRDKARKEKNELVRQLVAFIRKRDKRVQAHRKLVEEQNAEKARKAEEMRRQQKLKQAKLVEQYREQSWMTMANLEKELQEMEARYEKEFGDGSDENEMEEHELKDEEDGKDSDEAEDAELYDDLYCPACDKSFKTEKAMKNHEKSKKHREMVALLKQQLEEEEENFSRPQIDENPLDDNSEEEMEDAPKQKLSKKQKKKKQKPAQDVPGKDSYLPAAHFQMAWGKKCVLGERRDGESEHKCAKMLLENRQNYDDNFNVNGPGEGVKVDPEDTNLNQDSAKELEDSPQENVSVTEIIKPCDDPKSEAKSVPKPKGKKTKDMKKPVRVPAEPQTMSVLISCTTCHSEFPSRNKLFDHLKATGHARAPSSSSLNSATSSQSKKEKRKNR,576,NP_919259.3.csv,refseq-DNAJC21-NM_194283.3_clinical_seed_0_final,refseq-DNAJC21-NM_194283.3.a2m,Invitae,refseq-DNAJC21-NM_194283.3.npy,1,576,576
+NP_919299.3,MRPPACWWLLAPPALLALLTCSLAFGLASEDTKKEVKQSQDLEKSGISRKNDIDLKGIVFVIQSQSNSFHAKRAEQLKKSILKQAADLTQELPSVLLLHQLAKQEGAWTILPLLPHFSVTYSRNSSWIFFCEEETRIQIPKLLETLRRYDPSKEWFLGKALHDEEATIIHHYAFSENPTVFKYPDFAAGWALSIPLVNKLTKRLKSESLKSDFTIDLKHEIALYIWDKGGGPPLTPVPEFCTNDVDFYCATTFHSFLPLCRKPVKKKDIFVAVKTCKKFHGDRIPIVKQTWESQASLIEYYSDYTENSIPTVDLGIPNTDRGHCGKTFAILERFLNRSQDKTAWLVIVDDDTLISISRLQHLLSCYDSGEPVFLGERYGYGLGTGGYSYITGGGGMVFSREAVRRLLASKCRCYSNDAPDDMVLGMCFSGLGIPVTHSPLFHQARPVDYPKDYLSHQVPISFHKHWNIDPVKVYFTWLAPSDEDKARQETQKGFREEL,498,NP_919299.3.csv,refseq-B3GLCT-NM_194318.3_clinical_seed_0_final,refseq-B3GLCT-NM_194318.3.a2m,Invitae,refseq-B3GLCT-NM_194318.3.npy,1,498,498
+NP_919438.1,MGNPENIEDAYVAVIRPKNTASLNSREYRAKSYEILLHEVPIEGQKKKRKKVLLETKLQGNSEITQGILDYVVETTKPISPANQGIRGKRVVLMKKFPLDGEKMGREASLFIVPSVVKDNTKYTYTPGCPIFYCLQDIMRVCSESSTHFATLTARMLIALDKWLDERHAQSHFIPALFRPSPLERIKTNVINPAYATESGQTENSLHMGYSALEIKSKMLALEKADTCIYNPLFGSDLQYTNRVDKVVINPYFGLGAPDYSKIQIPKQEKWQRSMSSVTEDKERQWVDDFPLHRSACEGDSELLSRLLSERFSVNQLDSDHWAPIHYACWYGKVEATRILLEKGKCNPNLLNGQLSSPLHFAAGGGHAEIVQILLNHPETDRHITDQQGRSPLNICEENKQNNWEEAAKLLKEAINKPYEKVRIYRMDGSYRSVELKHGNNTTVQQIMEGMRLSQETQQYFTIWICSENLSLQLKPYHKPLQHVRDWPEILAELTNLDPQRETPQLFLRRDVRLPLEVEKQIEDPLAILILFDEARYNLLKGFYTAPDAKLITLASLLLQIVYGNYESKKHKQGFLNEENLKSIVPVTKLKSKAPHWTNRILHEYKNLSTSEGVSKEMHHLQRMFLQNCWEIPTYGAAFFTGQIFTKASPSNHKVIPVYVGVNIKGLHLLNMETKALLISLKYGCFMWQLGDTDTCFQIHSMENKMSFIVHTKQAGLVVKLLMKLNGQLMPTERNS,736,NP_919438.1.csv,refseq-KRIT1-NM_194456.1_clinical_seed_0_final,refseq-KRIT1-NM_194456.1.a2m,Invitae,refseq-KRIT1-NM_194456.1.npy,1,736,736
+NP_922932.2,MEILWKTLTWILSLIMASSEFHSDHRLSYSSQEEFLTYLEHYQLTIPIRVDQNGAFLSFTVKNDKHSRRRRSMDPIDPQQAVSKLFFKLSAYGKHFHLNLTLNTDFVSKHFTVEYWGKDGPQWKHDFLDNCHYTGYLQDQRSTTKVALSNCVGLHGVIATEDEEYFIEPLKNTTEDSKHFSYENGHPHVIYKKSALQQRHLYDHSHCGVSDFTRSGKPWWLNDTSTVSYSLPINNTHIHHRQKRSVSIERFVETLVVADKMMVGYHGRKDIEHYILSVMNIVAKLYRDSSLGNVVNIIVARLIVLTEDQPNLEINHHADKSLDSFCKWQKSILSHQSDGNTIPENGIAHHDNAVLITRYDICTYKNKPCGTLGLASVAGMCEPERSCSINEDIGLGSAFTIAHEIGHNFGMNHDGIGNSCGTKGHEAAKLMAAHITANTNPFSWSACSRDYITSFLDSGRGTCLDNEPPKRDFLYPAVAPGQVYDADEQCRFQYGATSRQCKYGEVCRELWCLSKSNRCVTNSIPAAEGTLCQTGNIEKGWCYQGDCVPFGTWPQSIDGGWGPWSLWGECSRTCGGGVSSSLRHCDSPAPSGGGKYCLGERKRYRSCNTDPCPLGSRDFREKQCADFDNMPFRGKYYNWKPYTGGGVKPCALNCLAEGYNFYTERAPAVIDGTQCNADSLDICINGECKHVGCDNILGSDAREDRCRVCGGDGSTCDAIEGFFNDSLPRGGYMEVVQIPRGSVHIEVREVAMSKNYIALKSEGDDYYINGAWTIDWPRKFDVAGTAFHYKRPTDEPESLEALGPTSENLIVMVLLQEQNLGIRYKFNVPITRTGSGDNEVGFTWNHQPWSECSATCAGGVQRQEVVCKRLDDNSIVQNNYCDPDSKPPENQRACNTEPCPPEWFIGDWLECSKTCDGGMRTRAVLCIRKIGPSEEETLDYSGCLTHRPVEKEPCNNQSCPPQWVALDWSECTPKCGPGFKHRIVLCKSSDLSKTFPAAQCPEESKPPVRIRCSLGRCPPPRWVTGDWGQCSAQCGLGQQMRTVQCLSYTGQASSDCLETVRPPSMQQCESKCDSTPISNTEECKDVNKVAYCPLVLKFKFCSRAYFRQMCCKTCQGH,1117,NP_922932.2.csv,refseq-ADAMTS6-NM_197941.3_clinical_seed_0_final,refseq-ADAMTS6-NM_197941.3.a2m,Invitae,refseq-ADAMTS6-NM_197941.3.npy,1,1117,1117
+NP_932173.1,MANFLLPRGTSSFRRFTRESLAAIEKRMAEKQARGSTTLQESREGLPEEEAPRPQLDLQASKKLPDLYGNPPQELIGEPLEDLDPFYSTQKTFIVLNKGKTIFRFSATNALYVLSPFHPIRRAAVKILVHSLFNMLIMCTILTNCVFMAQHDPPPWTKYVEYTFTAIYTFESLVKILARGFCLHAFTFLRDPWNWLDFSVIIMAYTTEFVDLGNVSALRTFRVLRALKTISVISGLKTIVGALIQSVKKLADVMVLTVFCLSVFALIGLQLFMGNLRHKCVRNFTALNGTNGSVEADGLVWESLDLYLSDPENYLLKNGTSDVLLCGNSSDAGTCPEGYRCLKAGENPDHGYTSFDSFAWAFLALFRLMTQDCWERLYQQTLRSAGKIYMIFFMLVIFLGSFYLVNLILAVVAMAYEEQNQATIAETEEKEKRFQEAMEMLKKEHEALTIRGVDTVSRSSLEMSPLAPVNSHERRSKRRKRMSSGTEECGEDRLPKSDSEDGPRAMNHLSLTRGLSRTSMKPRSSRGSIFTFRRRDLGSEADFADDENSTAGESESHHTSLLVPWPLRRTSAQGQPSPGTSAPGHALHGKKNSTVDCNGVVSLLGAGDPEATSPGSHLLRPVMLEHPPDTTTPSEEPGGPQMLTSQAPCVDGFEEPGARQRALSAVSVLTSALEELEESRHKCPPCWNRLAQRYLIWECCPLWMSIKQGVKLVVMDPFTDLTITMCIVLNTLFMALEHYNMTSEFEEMLQVGNLVFTGIFTAEMTFKIIALDPYYYFQQGWNIFDSIIVILSLMELGLSRMSNLSVLRSFRLLRVFKLAKSWPTLNTLIKIIGNSVGALGNLTLVLAIIVFIFAVVGMQLFGKNYSELRDSDSGLLPRWHMMDFFHAFLIIFRILCGEWIETMWDCMEVSGQSLCLLVFLLVMVIGNLVVLNLFLALLLSSFSADNLTAPDEDREMNNLQLALARIQRGLRFVKRTTWDFCCGLLRQRPQKPAALAAQGQLPSCIATPYSPPPPETEKVPPTRKETRFEEGEQPGQGTPGDPEPVCVPIAVAESDTDDQEEDEENSLGTEEESSKQQESQPVSGGPEAPPDSRTWSQVSATASSEAEASASQADWRQQWKAEPQAPGCGETPEDSCSEGSTADMTNTAELLEQIPDLGQDVKDPEDCFTEGCVRRCPCCAVDTTQAPGKVWWRLRKTCYHIVEHSWFETFIIFMILLSSGALAFEDIYLEERKTIKVLLEYADKMFTYVFVLEMLLKWVAYGFKKYFTNAWCWLDFLIVDVSLVSLVANTLGFAEMGPIKSLRTLRALRPLRALSRFEGMRVVVNALVGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFGRCINQTEGDLPLNYTIVNNKSQCESLNLTGELYWTKVKVNFDNVGAGYLALLQVATFKGWMDIMYAAVDSRGYEEQPQWEYNLYMYIYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKLGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPLNKYQGFIFDIVTKQAFDVTIMFLICLNMVTMMVETDDQSPEKINILAKINLLFVAIFTGECIVKLAALRHYYFTNSWNIFDFVVVILSIVGTVLSDIIQKYFFSPTLFRVIRLARIGRILRLIRGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYSIFGMANFAYVKWEAGIDDMFNFQTFANSMLCLFQITTSAGWDGLLSPILNTGPPYCDPTLPNSNGSRGDCGSPAVGILFFTTYIIISFLIVVNMYIAIILENFSVATEESTEPLSEDDFDMFYEIWEKFDPEATQFIEYSVLSDFADALSEPLRIAKPNQISLINMDLPMVSGDRIHCMDILFAFTKRVLGESGEMDALKIQMEEKFMAANPSKISYEPITTTLRRKHEEVSAMVIQRAFRRHLLQRSLKHASFLFRQQAGSGLSEEDAPEREGLIAYVMSENFSRPLGPPSSSSISSTSFPPSYDSVTRATSDNLQVRGSDYSHSEDLADFPPSPDRDRESIV,2016,NP_932173.1.csv,refseq-SCN5A-NM_198056.2_clinical_seed_0_final,refseq-SCN5A-NM_198056.2.a2m,Invitae,refseq-SCN5A-NM_198056.2.npy,1,2016,2016
+NP_937799.1,MPRRAENWDEAEVGAEEAGVEEYGPEEDGGEESGAEESGPEESGPEELGAEEEMEAGRPRPVLRSVNSREPSQVIFCNRSPRVVLPVWLNFDGEPQPYPTLPPGTGRRIHSYRVYTLKERCLQVVRSLVKPENYRRLDIVRSLYEDLEDHPNVQKDLERLTQERIAHQRMGD,172,NP_937799.1.csv,refseq-VHL-NM_198156.3_clinical_seed_0_final,refseq-VHL-NM_198156.3.a2m,Invitae,refseq-VHL-NM_198156.3_theta_0.2.npy,1,172,172
+NP_937859.1,MTVGKSSKMLQHIDYRMRCILQDGRIFIGTFKAFDKHMNLILCDCDEFRKIKPKNSKQAEREEKRVLGLVLLRGENLVSMTVEGPPPKDTGIARVPLAGAAGGPGIGRAAGRGIPAGVPMPQAPAGLAGPVRGVGGPSQQVMTPQGRGTVAAAAAAATASIAGAPTQYPPGRGGPPPPMGRGAPPPGMMGPPPGMRPPMGPPMGIPPGRGTPMGMPPPGMRPPPPGMRGPPPPGMRPPRP,240,NP_937859.1.csv,refseq-SNRPB-NM_198216.1_clinical_seed_0_final,refseq-SNRPB-NM_198216.1.a2m,Invitae,refseq-SNRPB-NM_198216.1.npy,1,240,240
+NP_937983.2,MPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGWRLVQRGDPAAFRALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVLQRLCERGAKNVLAFGFALLDGARGGPPEAFTTSVRSYLPNTVTDALRGSGAWGLLLRRVGDDVLVHLLARCALFVLVAPSCAYQVCGPPLYQLGAATQARPPPHASGPRRRLGCERAWNHSVREAGVPLGLPAPGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVDVTGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILD,1132,NP_937983.2.csv,refseq-TERT-NM_198253.2_clinical_seed_0_final,refseq-TERT-NM_198253.2.a2m,Invitae,refseq-TERT-NM_198253.2.npy,1,1132,1132
+NP_938011.1,MPFAKRIVEPQWLCRQRRPAPGPAVDASGGSAEPPPPLQPPGRRDLDEVEAPGPEEPARAVPAPSGLPPPPPPLPAPADQTQPPHGEASVAGEESTAGIPEAAPAAGEASSAAAAAAVLLMLDLCAVSNAALARVLRQLSDVARHACSLFQELESDIQLTHRRVWALQGKLGGVQRVLSTLDPKQEAVPVSNLDIESKLSVYYRAPWHQQRNIFLPATRPPCVEELHRHARQSLQALRREHRSRSDRREQRAAAPLSIAAPPLPAYPPAHSQRRREFKDRHFLTSHPPEDEDTDVMLGQRPKNPIHNIPSTLDKQTNWSKALPLPTPEEKMKQDAQVISSCIIPINVTGVGFDREASIRCSLVHSQSVLQRRRKLRRRKTISGIPRRVQQEIDSDESPVARERNVIVHTNPDPSNTVNRISGTRDSECQTEDILIAAPSRRRIRAQRGQSIAASLSHSAGNISALADKGDTMFTPAVSSRTRSRSLPREGNRGGDAEPKVGAKPSAYEEGESFVGDHERTPNDFSEAPSSPSAQDHQPTLGLACSQHLHSPQHKLSERGRSRLSRMAADSGSCDISSNSDTFGSPIHCISTAGVLLSSHMDQKDDHQSSSGNWSGSSSTCPSQTSETIPPAASPPLTGSSHCDSELSLNTAPHANEDASVFVTEQYNDHLDKVRGHRANSFTSTVADLLDDPNNSNTSDSEWNYLHHHHDASCRQDFSPERPKADSLGCPSFTSMATYDSFLEKSPSDKADTSSHFSVDTEGYYTSMHFDCGLKGNKSYVCHYAALGPENGQGVGASPGLPDCAWQDYLDHKRQGRPSISFRKPKAKPTPPKRSSSLRKSDGNADISEKKEPKISSGQHLPHSSREMKLPLDFANTPSRMENANLPTKQEPSWINQSEQGIKEPQLDASDIPPFKDEVAESTHYADLWLLNDLKTNDPYRSLSNSSTATGTTVIECIKSPESSESQTSQSESRATTPSLPSVDNEFKLASPEKLAGLASPSSGYSSQSETPTSSFPTAFFSGPLSPGGSKRKPKVPERKSSLQQPSLKDGTISLSKDLELPIIPPTHLDLSALHNVLNKPFHHRHPLHVFTHNKQNTVGETLRSNPPPSLAITPTILKSVNLRSINKSEEVKQKEENNTDLPYLEESTLTTAALSPSKIRPHTANKSVSRQYSTEDTILSFLDSSAVEMGPDKLHLEKNSTFDVKNRCDPETITSAGSSLLDSNVTKDQVRTETEPIPENTPTKNCAFPTEGFQRVSAARPNDLDGKIIQYGPGPDETLEQVQKAPSAGLEEVAQPESVDVITSQSDSPTRATDVSNQFKHQFVMSRHHDKVPGTISYESEITSVNSFPEKCSKQENIASGISAKSASDNSKAEETQGNVDEASLKESSPSDDSIISPLSEDSQAEAEGVFVSPNKPRTTEDLFAVIHRSKRKVLGRKDSGDMSVRSKSRAPLSSSSSSASSITSPSSNVTTPNSQRSPGLIYRNAKKSNTSNEEFKLLLLKKGSRSDSSYRMSATEILKSPILPKPPGELTAESPQSTDDAHQGSQGAEALSPLSPCSPRVNAEGFSSKSFATSASARVGRSRAPPAASSSRYSVRCRLYNTPMQAISEGETENSDGSPHDDRSSQSST,1630,NP_938011.1.csv,refseq-NHS-NM_198270.2_clinical_seed_0_final,refseq-NHS-NM_198270.2.a2m,Invitae,refseq-NHS-NM_198270.2_theta_0.2.npy,1,1630,1630
+NP_938012.2,MSEHSRNSDQEELLDEEINEDEILANLSAEELKELQSEMEVMAPDPSLPVGMIQKDQTDKPPTGNFNHKSLVDYMYWEKASRRMLEEERVPVTFVKSEEKTQEEHEEIEKRNKNMAQYLKEKLNNEIVANKRESKGSSNIQETDEEDEEEEDDDDDDEGEDDGEESEETNREEEGKAKEQIRNCENNCQQVTDKAFKEQRDRPEAQEQSEKKISKLDPKKLALDTSFLKVSTRPSGNQTDLDGSLRRVRKNDPDMKELNLNNIENIPKEMLLDFVNAMKKNKHIKTFSLANVGADENVAFALANMLRENRSITTLNIESNFITGKGIVAIMRCLQFNETLTELRFHNQRHMLGHHAEMEIARLLKANNTLLKMGYHFELPGPRMVVTNLLTRNQDKQRQKRQEEQKQQQLKEQKKLIAMLENGLGLPPGMWELLGGPKPDSRMQEFFQPPPPRPPNPQNVPFSQRSEMMKKPSQAPKYRTDPDSFRVVKLKRIQRKSRMPEAREPPEKTNLKDVIKTLKPVPRNRPPPLVEITPRDQLLNDIRHSSVAYLKPVQLPKELA,560,NP_938012.2.csv,refseq-LMOD3-NM_198271.4_clinical_seed_0_final,refseq-LMOD3-NM_198271.4.a2m,Invitae,refseq-LMOD3-NM_198271.4.npy,1,560,560
+NP_938023.1,MPHSSLHPSIPCPRGHGAQKAALVLLSACLVTLWGLGEPPEHTLRYLVLHLASLQLGLLLNGVCSLAEELRHIHSRYRGSYWRTVRACLGCPLRRGALLLLSIYFYYSLPNAVGPPFTWMLALLGLSQALNILLGLKGLAPAEISAVCEKGNFNVAHGLAWSYYIGYLRLILPELQARIRTYNQHYNNLLRGAVSQRLYILLPLDCGVPDNLSMADPNIRFLDKLPQQTGDHAGIKDRVYSNSIYELLENGQRAGTCVLEYATPLQTLFAMSQYSQAGFSREDRLEQAKLFCRTLEDILADAPESQNNCRLIAYQEPADDSSFSLSQEVLRHLRQEEKEEVTVGSLKTSAVPSTSTMSQEPELLISGMEKPLPLRTDFS,379,NP_938023.1.csv,refseq-TMEM173-NM_198282.3_clinical_seed_0_final,refseq-TMEM173-NM_198282.3.a2m,Invitae,refseq-TMEM173-NM_198282.3.npy,1,379,379
+NP_938033.1,MGSNKSKPKDASQRRRSLEPAENVHGAGGGAFPASQTPSKPASADGHRGPSAAFAPAAAEPKLFGGFNSSDTVTSPQRAGPLAGGVTTFVALYDYESRTETDLSFKKGERLQIVNNTEGDWWLAHSLSTGQTGYIPSNYVAPSDSIQAEEWYFGKITRRESERLLLNAENPRGTFLVRESETTKGAYCLSVSDFDNAKGLNVKHYKIRKLDSGGFYITSRTQFNSLQQLVAYYSKHADGLCHRLTTVCPTSKPQTQGLAKDAWEIPRESLRLEVKLGQGCFGEVWMGTWNGTTRVAIKTLKPGTMSPEAFLQEAQVMKKLRHEKLVQLYAVVSEEPIYIVTEYMSKGSLLDFLKGETGKYLRLPQLVDMAAQIASGMAYVERMNYVHRDLRAANILVGENLVCKVADFGLARLIEDNEYTARQGAKFPIKWTAPEAALYGRFTIKSDVWSFGILLTELTTKGRVPYPGMVNREVLDQVERGYRMPCPPECPESLHDLMCQCWRKEPEERPTFEYLQAFLEDYFTSTEPQYQPGENL,536,NP_938033.1.csv,refseq-SRC-NM_198291.3_clinical_seed_0_final,refseq-SRC-NM_198291.3.a2m,Invitae,refseq-SRC-NM_198291.3.npy,1,536,536
+NP_938143.1,MALKRMGIVSDYEKIRTFAVAIVGVGGVGSVTAEMLTRCGIGKLLLFDYDKVELANMNRLFFQPHQAGLSKVQAAEHTLRNINPDVLFEVHNYNITTVENFQHFMDRISNGGLEEGKPVDLVLSCVDNFEARMTINTACNELGQTWMESGVSENAVSGHIQLIIPGESACFACAPPLVVAANIDEKTLKREGVCAASLPTTMGVVAGILVQNVLKFLLNFGTVSFYLGYNAMQDFFPTMSMKPNPQCDDRNCRKQQEEYKKKVAALPKQEVIQEEEEIIHEDNEWGIELVSEVSEEELKNFSGPVPDLPEGITVAYTIPKKQEDSVTELTVEDSGESLEDLMAKMKNM,348,NP_938143.1.csv,refseq-UBA5-NM_198329.3_clinical_seed_0_final,refseq-UBA5-NM_198329.3.a2m,Invitae,refseq-UBA5-NM_198329.3.npy,1,348,348
+NP_938149.2,MAAVAAVAARRRRSWASLVLAFLGVCLGITLAVDRSNFKTCEESSFCKRQRSIRPGLSPYRALLDSLQLGPDSLTVHLIHEVTKVLLVLELQGLQKNMTRFRIDELEPRRPRYRVPDVLVADPPIARLSVSGRDENSVELTMAEGPYKIILTARPFRLDLLEDRSLLLSVNARGLLEFEHQRAPRVSFSDKVNLTLGSIWDKIKNLFSRQGSKDPAEGDGAQPEETPRDGDKPEETQGKAEKDEPGAWEETFKTHSDSKPYGPMSVGLDFSLPGMEHVYGIPEHADNLRLKVTEGGEPYRLYNLDVFQYELYNPMALYGSVPVLLAHNPHRDLGIFWLNAAETWVDISSNTAGKTLFGKMMDYLQGSGETPQTDVRWMSETGIIDVFLLLGPSISDVFRQYASLTGTQALPPLFSLGYHQSRWNYRDEADVLEVDQGFDDHNLPCDVIWLDIEHADGKRYFTWDPSRFPQPRTMLERLASKRRKLVAIVDPHIKVDSGYRVHEELRNLGLYVKTRDGSDYEGWCWPGSAGYPDFTNPTMRAWWANMFSYDNYEGSAPNLFVWNDMNEPSVFNGPEVTMLKDAQHYGGWEHRDVHNIYGLYVHMATADGLRQRSGGMERPFVLARAFFAGSQRFGAVWTGDNTAEWDHLKISIPMCLSLGLVGLSFCGADVGGFFKNPEPELLVRWYQMGAYQPFFRAHAHLDTGRREPWLLPSQHNDIIRDALGQRYSLLPFWYTLLYQAHREGIPVMRPLWVQYPQDVTTFNIDDQYLLGDALLVHPVSDSGAHGVQVYLPGQGEVWYDIQSYQKHHGPQTLYLPVTLSSIPVFQRGGTIVPRWMRVRRSSECMKDDPITLFVALSPQGTAQGELFLDDGHTFNYQTRQEFLLRRFSFSGNTLVSSSADPEGHFETPIWIERVVIIGAGKPAAVVLQTKGSPESRLSFQHDPETSVLVLRKPGINVASDWSIHLR,966,NP_938149.2.csv,refseq-GANAB-NM_198335.3_clinical_seed_0_final,refseq-GANAB-NM_198335.3.a2m,Invitae,refseq-GANAB-NM_198335.3.npy,1,966,966
+NP_938207.2,MSRAVRLPVPCPVQLGTLRNDSLEAQLHEYVKQGNYVKVKKILKKGIYVDAVNSLGQTALFVAALLGLRKFVDVLVDYGSDPNHRCFDGSTPVHAAAFSGNQWILSKLLDAGGDLRLHDERGQNPKTWALTAGKERSTQIVEFMQRCASHMQAIIQGFSYDLLKKIDSPQRLVYSPSWCGGLVQGNPNGSPNRLLKAGVISAQNIYSFGFGKFYLTGATQMAYLGSLPVIGEKEVIQADDEPTFSFFSGPYMVMTNLVWNGSRVTVKELNLPTHPHCSRLRLADLLIAEQEHSSKLRHPYLLQLMAVCLSQDLEKTRLVYERITIGTLFSVLHERRSQFPVLHMEVIVHLLLQISDALRYLHFQGFIHRSLSSYAVHIISPGEARLTNLEYMLESEDRGVQRDLTRVPLPTQLYNWAAPEVILQKAATVKSDIYSFSMIMQEILTDDIPWKGLDGSVVKKAVVSGNYLEADVRLPKPYYDIVKSGIHVKQKDRTMNLQDIRYILKNDLKDFTGAQRTQPTESPRVQRYGLHPDVNVYLGLTSEHPRETPDMEIIELKEMGSQPHSPRVHSLFTEGTLDPQAPDPCLMARETQNQDAPCPAPFMAEEASSPSTGQPSLCSFEINEIYSGCLILEDDIEEPPGAASSLEADGPNQVDELKSMEEELDKMEREACCFGSEDESSSKAETEYSFDDWDWQNGSLSSLSLPESTREAKSNLNNMSTTEEYLISKCVLDLKIMQTIMHENDDRLRNIEQILDEVEMKQKEQEERMSLWATSREFTNAYKLPLAVGPPSLNYIPPVLQLSGGQKPDTSGNYPTLPRFPRMLPTLCDPGKQNTDEQFQCTQGAKDSLETSRIQNTSSQGRPRESTAQAKATQFNSALFTLSSHRQGPSASPSCHWDSTRMSVEPVSSEIYNAESRNKDDGKVHLKWKMEVKEMAKKAATGQLTVPPWHPQSSLTLESEAENEPDALLQPPIRSPENTDWQRVIEYHRENDEPRGNGKFDKTGNNDCDSDQHGRQPRLGSFTSIRHPSPRQKEQPEHSEAFQASSDTLVAVEKSYSHQSMQSTCSPESSEDITDEFLTPDGEYFYSSTAQENLALETSSPIEEDFEGIQGAFAQPQVSGEEKFQMRKILGKNAEILPRSQFQPVRSTEDEQEETSKESPKELKEKDISLTDIQDLSSISYEPDSSFKEASCKTPKINHAPTSVSTPLSPGSVSSAASQYKDCLESITFQVKTEFASCWNSQEFIQTLSDDFISVRERAKKLDSLLTSSETPPSRLTGLKRLSSFIGAGSPSLVKACDSSPPHATQRRSLPKVEAFSQHHIDELPPPSQELLDDIELLKQQQGSSTVLHENTASDGGGTANDQRHLEEQETDSKKEDSSMLLSKETEDLGEDTERAHSTLDEDLERWLQPPEESVELQDLPKGSERETNIKDQKVGEEKRKREDSITPERRKSEGVLGTSEEDELKSCFWKRLGWSESSRIIVLDQSDLSD,1491,NP_938207.2.csv,refseq-TEX14-NM_198393.3_clinical_seed_0_final,refseq-TEX14-NM_198393.3.a2m,Invitae,refseq-TEX14-NM_198393.3_theta_0.2.npy,1,1491,1491
+NP_940905.2,MVDLESEVPPLPPRYRFRDLLLGDQGWQNDDRVQVEFYMNENTFKERLKLFFIKNQRSSLRIRLFNFSLKLLSCLLYIIRVLLENPSQGNEWSHIFWVNRSLPLWGLQVSVALISLFETILLGYLSYKGNIWEQILRIPFILEIINAVPFIISIFWPSLRNLFVPVFLNCWLAKHALENMINDLHRAIQRTQSAMFNQVLILISTLLCLIFTCICGIQHLERIGKKLNLFDSLYFCIVTFSTVGFGDVTPETWSSKLFVVAMICVALVVLPIQFEQLAYLWMERQKSGGNYSRHRAQTEKHVVLCVSSLKIDLLMDFLNEFYAHPRLQDYYVVILCPTEMDVQVRRVLQIPMWSQRVIYLQGSALKDQDLLRAKMDDAEACFILSSRCEVDRTSSDHQTILRAWAVKDFAPNCPLYVQILKPENKFHIKFADHVVCEEEFKYAMLALNCICPATSTLITLLVHTSRGQEGQQSPEQWQKMYGRCSGNEVYHIVLEESTFFAEYEGKSFTYASFHAHKKFGVCLIGVRREDNKNILLNPGPRYIMNSTDICFYINITKEENSAFKNQDQQRKSNVSRSFYHGPSRLPVHSIIASMGTVAIDLQDTSCRSASGPTLSLPTEGSKEIRRPSIAPVLEVADTSSIQTCDLLSDQSEDETTPDEEMSSNLEYAKGYPPYSPYIGSSPTFCHLLHEKVPFCCLRLDKSCQHNYYEDAKAYGFKNKLIIVAAETAGNGLYNFIVPLRAYYRPKKELNPIVLLLDNPPDMHFLDAICWFPMVYYMVGSIDNLDDLLRCGVTFAANMVVVDKESTMSAEEDYMADAKTIVNVQTLFRLFSSLSIITELTHPANMRFMQFRAKDCYSLALSKLEKKERERGSNLAFMFRLPFAAGRVFSISMLDTLLYQSFVKDYMISITRLLLGLDTTPGSGFLCSMKITADDLWIRTYARLYQKLCSSTGDVPIGIYRTESQKLTTSESQISISVEEWEDTKDSKEQGHHRSNHRNSTSSDQSDHPLLRRKSMQWARRLSRKGPKHSGKTAEKITQQRLNLYRRSERQELAELVKNRMKHLGLSTVGYDEMNDHQSTLSYILINPSPDTRIELNDVVYLIRPDPLAYLPNSEPSRRNSICNVTGQDSREETQL,1135,NP_940905.2.csv,refseq-KCNT2-NM_198503.3_clinical_seed_0_final,refseq-KCNT2-NM_198503.3.a2m,Invitae,refseq-KCNT2-NM_198503.3.npy,1,1135,1135
+NP_940916.2,MAAPGGRGRSLSGLLPAQTSLEYALLDAVTQQEKDSLVYQYLQKVDGWEQDLSVPEFPEGLEWLNTEEPISVYKDLCGKIVVLDFFTYCCINCIHLLPDLHALEHTYSDKDGLLIIGVHSAKFPNEKVLDNIKSAVLRYNITHPMVNDADASLWQELEVSCWPTLVILGPRGNMLFSLIGEGHKDKLFLYTSIALKYYKDRGQIRDNKIGIKLYKDSLPPSPLLFPGKVTVDQVTDRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESTFNSPQGVAIMNNIIYVADTENHLIRKIDLEAEKVSTVAGIGIQGTDKEGGAKGEQQPISSPWDVVFGTSGSEVQRGDILWIAMAGTHQIWALLLDSGKLPKKNELTKGTCLRFAGSGNEENRNNAYPHKAGFAQPSGLSLASEDPWSCLFVADSESSTVRTVSLKDGAVKHLVGGERDPMNLFAFGDVDGVGINAKLQHPLGVTWDKKRNLLYVADSYNHKIKVVDPKTKNCTTLAGTGDTNNVTSSSFTESTFNEPGGLCIGENGELLYVADTNNHQIKVMDLETKMVSVLPIFRSENAVVDGPFLVEKQKTLPKLPKSAPSIRLSPVTACAGQTLQFKLRLDLPSGSKLTEGVSSCWFLTAEGNEWLLQGQIAAGDIENISSQPTISLQIPDDCLSLEAIVSVSVFLYYCSADSSACMMKAILFSQPLQITDTQQGCIAPVELRYVF,726,NP_940916.2.csv,refseq-NHLRC2-NM_198514.3_clinical_seed_0_final,refseq-NHLRC2-NM_198514.3.a2m,Invitae,refseq-NHLRC2-NM_198514.3.npy,1,726,726
+NP_940927.2,MGLEAQRLPGAEEAPVRVALRVRPLLPKELLHGHQSCLQVEPGLGRVTLGRDRHFGFHVVLAEDAGQEAVYQACVQPLLEAFFEGFNATVFAYGQTGSGKTYTMGEASVASLLEDEQGIVPRAMAEAFKLIDENDLLDCLVHVSYLEVYKEEFRDLLEVGTASRDIQLREDERGNVVLCGVKEVDVEGLDEVLSLLEMGNAARHTGATHLNHLSSRSHTVFTVTLEQRGRAPSRLPRPAPGQLLVSKFHFVDLAGSERVLKTGSTGERLKESIQINSSLLALGNVISALGDPQRRGSHIPYRDSKITRILKDSLGGNAKTVMIACVSPSSSDFDETLNTLNYASRAQNIRNRATVNWRPEAERPPEETASGARGPPRHRSETRIIHRGRRAPGPATASAAAAMRLGAECARYRACTDAAYSLLRELQAEPGLPGAAARKVRDWLCAVEGERSALSSASGPDSGIESASVEDQAAQGAGGRKEDEGAQQLLTLQNQVARLEEENRDFLAALEDAMEQYKLQSDRLREQQEEMVELRLRLELVRPGWGGPRLLNGLPPGSFVPRPHTAPLGGAHAHVLGMVPPACLPGDEVGSEQRGEQVTNGREAGAELLTEVNRLGSGSSAASEEEEEEEEPPRRTLHLRRNRISNCSQRAGARPGSLPERKGPELCLEELDAAIPGSRAVGGSKARVQARQVPPATASEWRLAQAQQKIRELAINIRMKEELIGELVRTGKAAQALNRQHSQRIRELEQEAEQVRAELSEGQRQLRELEGKELQDAGERSRLQEFRRRVAAAQSQVQVLKEKKQATERLVSLSAQSEKRLQELERNVQLMRQQQGQLQRRLREETEQKRRLEAEMSKRQHRVKELELKHEQQQKILKIKTEEIAAFQRKRRSGSNGSVVSLEQQQKIEEQKKWLDQEMEKVLQQRRALEELGEELHKREAILAKKEALMQEKTGLESKRLRSSQALNEDIVRVSSRLEHLEKELSEKSGQLRQGSAQSQQQIRGEIDSLRQEKDSLLKQRLEIDGKLRQGSLLSPEEERTLFQLDEAIEALDAAIEYKNEAITCRQRVLRASASLLSQCEMNLMAKLSYLSSSETRALLCKYFDKVVTLREEQHQQQIAFSELEMQLEEQQRLVYWLEVALERQRLEMDRQLTLQQKEHEQNMQLLLQQSRDHLGEGLADSRRQYEARIQALEKELGRYMWINQELKQKLGGVNAVGHSRGGEKRSLCSEGRQAPGNEDELHLAPELLWLSPLTEGAPRTREETRDLVHAPLPLTWKRSSLCGEEQGSPEELRQREAAEPLVGRVLPVGEAGLPWNFGPLSKPRRELRRASPGMIDVRKNPL,1343,NP_940927.2.csv,refseq-KIF7-NM_198525.2_clinical_seed_0_final,refseq-KIF7-NM_198525.2.a2m,Invitae,refseq-KIF7-NM_198525.2.npy,1,1343,1343
+NP_940978.2,MAGRSHPGPLRPLLPLLVVAACVLPGAGGTCPERALERREEEANVVLTGTVEEILNVDPVQHTYSCKVRVWRYLKGKDLVARESLLDGGNKVVISGFGDPLICDNQVSTGDTRIFFVNPAPPYLWPAHKNELMLNSSLMRITLRNLEEVEFCVEDKPGTHFTPVPPTPPDACRGMLCGFGAVCEPNAEGPGRASCVCKKSPCPSVVAPVCGSDASTYSNECELQRAQCSQQRRIRLLSRGPCGSRDPCSNVTCSFGSTCARSADGLTASCLCPATCRGAPEGTVCGSDGADYPGECQLLRRACARQENVFKKFDGPCDPCQGALPDPSRSCRVNPRTRRPEMLLRPESCPARQAPVCGDDGVTYENDCVMGRSGAARGLLLQKVRSGQCQGRDQCPEPCRFNAVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSDCWRQQAECRQQRAIPSKHQGPCDQAPSPCLGVQCAFGATCAVKNGQAACECLQACSSLYDPVCGSDGVTYGSACELEATACTLGREIQVARKGPCDRCGQCRFGALCEAETGRCVCPSECVALAQPVCGSDGHTYPSECMLHVHACTHQISLHVASAGPCETCGDAVCAFGAVCSAGQCVCPRCEHPPPGPVCGSDGVTYGSACELREAACLQQTQIEEARAGPCEQAECGSGGSGSGEDGDCEQELCRQRGGIWDEDSEDGPCVCDFSCQSVPGSPVCGSDGVTYSTECELKKARCESQRGLYVAAQGACRGPTFAPLPPVAPLHCAQTPYGCCQDNITAARGVGLAGCPSACQCNPHGSYGGTCDPATGQCSCRPGVGGLRCDRCEPGFWNFRGIVTDGRSGCTPCSCDPQGAVRDDCEQMTGLCSCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVEESGSAHCVCPMLTCPEANATKVCGSDGVTYGNECQLKTIACRQGLQISIQSLGPCQEAVAPSTHPTSASVTVTTPGLLLSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTTVWPVLTVPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGLEPLEGSSVATPGPPVERASCYNSALGCCSDGKTPSLDAEGSNCPATKVFQGVLELEGVEGQELFYTPEMADPKSELFGETARSIESTLDDLFRNSDVKKDFRSVRLRDLGPGKSVRAIVDVHFDPTTAFRAPDVARALLRQIQVSRRRSLGVRRPLQEHVRFMDFDWFPAFITGATSGAIAAGATARATTASRLPSSAVTPRAPHPSHTSQPVAKTTAAPTTRRPPTTAPSRVPGRRPPAPQQPPKPCDSQPCFHGGTCQDWALGGGFTCSCPAGRGGAVCEKVLGAPVPAFEGRSFLAFPTLRAYHTLRLALEFRALEPQGLLLYNGNARGKDFLALALLDGRVQLRFDTGSGPAVLTSAVPVEPGQWHRLELSRHWRRGTLSVDGETPVLGESPSGTDGLNLDTDLFVGGVPEDQAAVALERTFVGAGLRGCIRLLDVNNQRLELGIGPGAATRGSGVGECGDHPCLPNPCHGGAPCQNLEAGRFHCQCPPGRVGPTCADEKSPCQPNPCHGAAPCRVLPEGGAQCECPLGREGTFCQTASGQDGSGPFLADFNGFSHLELRGLHTFARDLGEKMALEVVFLARGPSGLLLYNGQKTDGKGDFVSLALRDRRLEFRYDLGKGAAVIRSREPVTLGAWTRVSLERNGRKGALRVGDGPRVLGESPVPHTVLNLKEPLYVGGAPDFSKLARAAAVSSGFDGAIQLVSLGGRQLLTPEHVLRQVDVTSFAGHPCTRASGHPCLNGASCVPREAAYVCLCPGGFSGPHCEKGLVEKSAGDVDTLAFDGRTFVEYLNAVTESEKALQSNHFELSLRTEATQGLVLWSGKATERADYVALAIVDGHLQLSYNLGSQPVVLRSTVPVNTNRWLRVVAHREQREGSLQVGNEAPVTGSSPLGATQLDTDGALWLGGLPELPVGPALPKAYGTGFVGCLRDVVVGRHPLHLLEDAVTKPELRPCPTP,2045,NP_940978.2.csv,refseq-AGRN-NM_198576.3_clinical_seed_0_final,refseq-AGRN-NM_198576.3.a2m,Invitae,refseq-AGRN-NM_198576.3.npy,1,2045,2045
+NP_940988.2,MAAEASESGPALHELMREAEISLLECKVCFEKFGHRQQRRPRNLSCGHVVCLACVAALAHPRTLALECPFCRRACRGCDTSDCLPVLHLIELLGSALRQSPAAHRAAPSAPGALTCHHTFGGWGTLVNPTGLALCPKTGRVVVVHDGRRRVKIFDSGGGCAHQFGEKGDAAQDIRYPVDVTITNDCHVVVTDAGDRSIKVFDFFGQIKLVIGGQFSLPWGVETTPQNGIVVTDAEAGSLHLLDVDFAEGVLRRTERLQAHLCNPRGVAVSWLTGAIAVLEHPLALGTGVCSTRVKVFSSSMQLVGQVDTFGLSLYFPSKITASAVTFDHQGNVIVADTSGPAILCLGKPEEFPVPKPMVTHGLSHPVALTFTKENSLLVLDTASHSIKVYKVDWG,395,NP_940988.2.csv,refseq-NHLRC1-NM_198586.2_clinical_seed_0_final,refseq-NHLRC1-NM_198586.2.a2m,Invitae,refseq-NHLRC1-NM_198586.2.npy,1,395,395
+NP_942559.1,MVTVMPLEMEKTISKLMFDFQRNSTSDDDSGCALEEYAWVPPGLKPEQVHQYYSCLPEEKVPYVNSPGEKLRIKQLLHQLPPHDNEVRYCNSLDEEEKRELKLFSSQRKRENLGRGNVRPFPVTMTGAICEQCGGQINGGDIAVFASRAGHGVCWHPPCFVCTVCNELLVDLIYFYQDGKIYCGRHHAECLKPRCAACDEIIFADECTEAEGRHWHMKHFCCFECETVLGGQRYIMKEGRPYCCHCFESLYAEYCDTCAQHIGIDQGQMTYDGQHWHATETCFCCAHCKKSLLGRPFLPKQGQIFCSRACSAGEDPNGSDSSDSAFQNARAKESRRSAKIGKNKGKTEEPMLNQHSQLQVSSNRLSADVDPLSLQMDMLSLSSQTPSLNRDPIWRSREEPYHYGNKMEQNQTQSPLQLLSQCNIRTSYSPGGQGAGAQPEMWGKHFSNPKRSSSLAMTGHAGSFIKECREDYYPGRLRSQESYSDMSSQSFSETRGSIQVPKYEEEEEEEGGLSTQQCRTRHPISSLKYTEDMTPTEQTPRGSMESLALSNATGLSADGGAKRQEHLSRFSMPDLSKDSGMNVSEKLSNMGTLNSSMQFRSAESVRSLLSAQQYQEMEGNLHQLSNPIGYRDLQSHGRMHQSFDFDGGMAGSKLPGQEGVRIQPMSERTRRRATSRDDNRRFRPHRSRRSRRSRSDNALHLASEREAISRLKDRPPLRAREDYDQFMRQRSFQESMGHGSRRDLYGQCPRTVSDLALQNAFGDRWGPYFAEYDWCSTCSSSSESDNEGYFLGEPIPQPARLRYVTSDELLHKYSSYGLPKSSTLGGRGQLHSRKRQKSKNCIIS,844,NP_942559.1.csv,refseq-PRICKLE2-NM_198859.3_clinical_seed_0_final,refseq-PRICKLE2-NM_198859.3.a2m,Invitae,refseq-PRICKLE2-NM_198859.3.npy,1,844,844
+NP_945185.1,MNSMKTEENKSFSAMEDDQRTRPEVSKDTVMKQTHADTPVDHCLSGIRKCSSTFKLKSEVNKHETALEMQNPNLNNKECCFTFTLNGNSRKLDRSVFTAYGKPSESIYSALSANDYFSERIKNQFNKNIIVYEEKTIDGHINLGMPLKCLPSDSHFKITFGQRKSSKEDGHILRQCENPNMECILFHVVAIGRTRKKIVKINELHEKGSKLCIYALKGETIEGALCKDGRFRSDIGEFEWKLKEGHKKIYGKQSMVDEVSGKVLEMDISKKKALQQKDIHKKIKQNESATDEINHQSLIQSKKKVHKPKKDGETKDVEHSREQILPPQDLSHYIKDKTRQTIPRIRNYYFCSLPRKYRQINSQVRRRPHLGRRYAINLDVQKEAINLLKNYQTLNEAIMHQYPNFKEEAQWVRKYFREEQKRMNLSPAKQFNIYKKDFGKMTANSVSVATCEQLTYYSKSVGFMQWDNNGNTGNATCFVFNGGYIFTCRHVVHLMVGKNTHPSLWPDIISKCAKVTFTYTEFCPTPDNWFSIEPWLKVSNENLDYAILKLKENGNAFPPGLWRQISPQPSTGLIYLIGHPEGQIKKIDGCTVIPLNERLKKYPNDCQDGLVDLYDTTSNVYCMFTQRSFLSEVWNTHTLSYDTCFSDGSSGSPVFNASGKLVALHTFGLFYQRGFNVHALIEFGYSMDSILCDIKKTNESLYKSLNDEKLETYDEEKGKQESSLQDHQIEPMEC,734,NP_945185.1.csv,refseq-FAM111B-NM_198947.3_clinical_seed_0_final,refseq-FAM111B-NM_198947.3.a2m,Invitae,refseq-FAM111B-NM_198947.3.npy,1,734,734
+NP_945316.1,MQRRLVQQWSVAVFLLSYAVPSCGRSVEGLSRRLKRAVSEHQLLHDKGKSIQDLRRRFFLHHLIAEIHTAEIRATSEVSPNSKPSPNTKNHPVRFGSDDEGRYLTQETNKVETYKEQPLKTPGKKKKGKPGKRKEQEKKKRRTRSAWLDSGVTGSGLEGDHLSDTSTTSLELDSRRH,177,NP_945316.1.csv,refseq-PTHLH-NM_198965.1_clinical_seed_0_final,refseq-PTHLH-NM_198965.1.a2m,Invitae,refseq-PTHLH-NM_198965.1.npy,1,177,177
+NP_945345.2,MAGIRVTKVDWQRSRNGAAHHTQEYPCPELVVRRGQSFSLTLELSRALDCEEILIFTMETGPRASEALHTKAVFQTSELERGEGWTAAREAQMEKTLTVSLASPPSAVIGRYLLSIRLSSHRKHSNRRLGEFVLLFNPWCAEDDVFLASEEERQEYVLSDSGIIFRGVEKHIRAQGWNYGQFEEDILNICLSILDRSPGHQNNPATDVSCRHNPIYVTRVISAMVNSNNDRGVVQGQWQGKYGGGTSPLHWRGSVAILQKWLKGRYKPVKYGQCWVFAGVLCTVLRCLGIATRVVSNFNSAHDTDQNLSVDKYVDSFGRTLEDLTEDSMWNFHVWNESWFARQDLGPSYNGWQVLDATPQEESEGVFRCGPASVTAIREGDVHLAHDGPFVFAEVNADYITWLWHEDESRERVYSNTKKIGRCISTKAVGSDSRVDITDLYKYPEGSRKERQVYSKAVNRLFGVEASGRRIWIRRAGGRCLWRDDLLEPATKPSIAGKFKVLEPPMLGHDLRLALCLANLTSRAQRVRVNLSGATILYTRKPVAEILHESHAVRLGPQEEKRIPITISYSKYKEDLTEDKKILLAAMCLVTKGEKLLVEKDITLEDFITIKVLGPAMVGVAVTVEVTVVNPLIERVKDCALMVEGSGLLQEQLSIDVPTLEPQERASVQFDITPSKSGPRQLQVDLVSPHFPDIKGFVIVHVATAK,706,NP_945345.2.csv,refseq-TGM6-NM_198994.2_clinical_seed_0_final,refseq-TGM6-NM_198994.2.a2m,Invitae,refseq-TGM6-NM_198994.2.npy,1,706,706
+NP_945350.1,MDHAEENEILAATQRYYVERPIFSHPVLQERLHTKDKVPDSIADKLKQAFTCTPKKIRNIIYMFLPITKWLPAYKFKEYVLGDLVSGISTGVLQLPQGLAFAMLAAVPPIFGLYSSFYPVIMYCFLGTSRHISIGPFAVISLMIGGVAVRLVPDDIVIPGGVNATNGTEARDALRVKVAMSVTLLSGIIQFCLGVCRFGFVAIYLTEPLVRGFTTAAAVHVFTSMLKYLFGVKTKRYSGIFSVVYSTVAVLQNVKNLNVCSLGVGLMVFGLLLGGKEFNERFKEKLPAPIPLEFFAVVMGTGISAGFNLKESYNVDVVGTLPLGLLPPANPDTSLFHLVYVDAIAIAIVGFSVTISMAKTLANKHGYQVDGNQELIALGLCNSIGSLFQTFSISCSLSRSLVQEGTGGKTQLAGCLASLMILLVILATGFLFESLPQAVLSAIVIVNLKGMFMQFSDLPFFWRTSKIELTIWLTTFVSSLFLGLDYGLITAVIIALLTVIYRTQSPSYKVLGKLPETDVYIDIDAYEEVKEIPGIKIFQINAPIYYANSDLYSNALKRKTGVNPAVIMGARRKAMRKYAKEVGNANMANATVVKADAEVDGEDATKPEEEDGEVKYPPIVIKSTFPEEMQRFMPPGDNVHTVILDFTQVNFIDSVGVKTLAGIVKEYGDVGIYVYLAGCSAQVVNDLTRNRFFENPALWELLFHSIHDAVLGSQLREALAEQEASAPPSQEDLEPNATPATPEA,744,NP_945350.1.csv,refseq-SLC26A5-NM_198999.2_clinical_seed_0_final,refseq-SLC26A5-NM_198999.2.a2m,Invitae,refseq-SLC26A5-NM_198999.2.npy,1,744,744
+NP_954634.1,MVGRSCGVATQRQGGGQRPTNLALTLSSSPAHSTALPWLPPRSLQLLSGHSVPAQPTPHLPSACGGPTRVTLGEERAWRSHGSNAGGHTCLPRRTAGAGSLTPGGERGGNNAGHTVVVDGKEYDFHLLPSGIINTKAVSFIGNGVVIHLPGLFEEAEKNEKKGLKDWEKRLIISDRAHLVFDFHQAVDGLQEVQRQAQEGKNIGTTKKGIGPTYSSKAARTGLRICDLLSDFDEFSSRFKNLAHQHQSMFPTLEIDIEGQLKRLKGFAERIRPMVRDGVYFMYEALHGPPKKILVEGANAALLDIDFGTYPFVTSSNCTVGGVCTGLGIPPQNIGDVYGVVKAYTTRVGIGAFPTEQINEIGGLLQTRGHEWGVTTGRKRRCGWLDLMILRYAHMVNGFTALALTKLDILDVLGEVKVGVSYKLNGKRIPYFPANQEMLQKVEVEYETLPGWKADTTGARRWEDLPPQAQNYIRFVENHVGVAVKWVGVGKSRESMIQLF,500,NP_954634.1.csv,refseq-ADSS1-NM_199165.2_clinical_seed_0_final,refseq-ADSS1-NM_199165.2.a2m,Invitae,refseq-ADSS1-NM_199165.2.npy,1,500,500
+NP_954659.1,MSKSFQQSSLSRDSQGHGRDLSAAGIGLLAAATQSLSMPASLGRMNQGTARLASLMNLGMSSSLNQQGAHSALSSASTSSHNLQSIFNIGSRGPLPLSSQHRGDADQASNILASFGLSARDLDELSRYPEDKITPENLPQILLQLKRRRTEEGPTLSYGRDGRSATREPPYRVPRDDWEEKRHFRRDSFDDRGPSLNPVLDYDHGSRSQESGYYDRMDYEDDRLRDGERCRDDSFFGETSHNYHKFDSEYERMGRGPGPLQERSLFEKKRGAPPSSNIEDFHGLLPKGYPHLCSICDLPVHSNKEWSQHINGASHSRRCQLLLEIYPEWNPDNDTGHTMGDPFMLQQSTNPAPGILGPPPPSFHLGGPAVGPRGNLGAGNGNLQGPRHMQKGRVETSRVVHIMDFQRGKNLRYQLLQLVEPFGVISNHLILNKINEAFIEMATTEDAQAAVDYYTTTPALVFGKPVRVHLSQKYKRIKKPEGKPDQKFDQKQELGRVIHLSNLPHSGYSDSAVLKLAEPYGKIKNYILMRMKSQAFIEMETREDAMAMVDHCLKKALWFQGRCVKVDLSEKYKKLVLRIPNRGIDLLKKDKSRKRSYSPDGKESPSDKKSKTDGSQKTESSTEGKEQEEKSGEDGEKDTKDDQTEQEPNMLLESEDELLVDEEEAAALLESGSSVGDETDLANLGDVASDGKKEPSDKAVKKDGSASAAAKKKLKKVDKIEELDQENEAALENGIKNEENTEPGAESSENADDPNKDTSENADGQSDENKDDYTIPDEYRIGPYQPNVPVGIDYVIPKTGFYCKLCSLFYTNEEVAKNTHCSSLPHYQKLKKFLNKLAEERRQKKET,847,NP_954659.1.csv,refseq-MATR3-NM_199189.2_clinical_seed_0_final,refseq-MATR3-NM_199189.2.a2m,Invitae,refseq-MATR3-NM_199189.2.npy,1,847,847
+NP_954712.1,MATLLSHPQQRPPFLRQAIKIRRRRVRDLQDPPPQMAPEIQPPSHHFSPEQRALLYEDALYTVLHRLGHPEPNHVTEASELLRYLQEAFHVEPEEHQQTLQRVRELEKPIFCLKATVKQAKGILGKDVSGFSDPYCLLGIEQGVGVPGGSPGSRHRQKAVVRHTIPEEETHRTQVITQTLNPVWDETFILEFEDITNASFHLDMWDLDTVESVRQKLGELTDLHGLRRIFKEARKDKGQDDFLGNVVLRLQDLRCREDQWYPLEPRTETYPDRGQCHLQFQLIHKRRATSASRSQPSYTVHLHLLQQLVSHEVTQHEAGSTSWDGSLSPQAATVLFLHATQKDLSDFHQSMAQWLAYSRLYQSLEFPSSCLLHPITSIEYQWIQGRLKAEQQEELAASFSSLLTYGLSLIRRFRSVFPLSVSDSPARLQSLLRVLVQMCKMKAFGELCPNTAPLPQLVTEALQTGTTEWFHLKQQHHQPMVQGIPEAGKALLGLVQDVIGDLHQCQRTWDKIFHNTLKIHLFSMAFRELQWLVAKRVQDHTTVVGDVVSPEMGESLFQLYISLKELCQLRMSSSERDGVLALDNFHRWFQPAIPSWLQKTYNEALARVQRAVQMDELVPLGELTKHSTSAVDLSTCFAQISHTARQLDWPDPEEAFMITVKFVEDTCRLALVYCSLIKARARELSSGQKDQGQAANMLCVVVNDMEQLRLVIGKLPAQLAWEALEQRVGAVLEQGQLQNTLHAQLQSALAGLGHEIRTGVRTLAEQLEVGIAKHIQKLVGVRESVLPEDAILPLMKFLEVELCYMNTNLVQENFSSLLTLLWTHTLTVLVEAAASQRSSSLASNRLKIALQNLEICFHAEGCGLPPKALHTATFQALQRDLELQAASSRELIRKYFCSRIQQQAETTSEELGAVTVKASYRASEQKLRVELLSASSLLPLDSNGSSDPFVQLTLEPRHEFPELAARETQKHKKDLHPLFDETFEFLVPAEPCRKAGACLLLTVLDYDTLGADDLEGEAFLPLREVPGLSGSEEPGEVPQTRLPLTYPAPNGDPILQLLEGRKGDREAQVFVRLRRHRAKQASQHALRPAP,1090,NP_954712.1.csv,refseq-UNC13D-NM_199242.2_clinical_seed_0_final,refseq-UNC13D-NM_199242.2.a2m,Invitae,refseq-UNC13D-NM_199242.2.npy,1,1090,1090
+NP_954986.2,MPTPDATTPQAKGFRRAVSELDAKQAEAIMVRGQGAPGPSLTGSPWPGTAAPAASYTPTPRSPRFIGRRQSLIEDARKEREAAVAAAAAAVPSEPGDPLEAVAFEEKEGKAVLNLLFSPRATKPSALSRAVKVFETFEAKIHHLETRPAQRPRAGGPHLEYFVRLEVRRGDLAALLSGVRQVSEDVRSPAGPKVPWFPRKVSELDKCHHLVTKFDPDLDLDHPGFSDQVYRQRRKLIAEIAFQYRHGDPIPRVEYTAEEIATWKEVYTTLKGLYATHACGEHLEAFALLERFSGYREDNIPQLEDVSRFLKERTGFQLRPVAGLLSARDFLASLAFRVFQCTQYIRHASSPMHSPEPDCCHELLGHVPMLADRTFAQFSQDIGLASLGASDEEIEKLSTLYWFTVEFGLCKQNGEVKAYGAGLLSSYGELLHCLSEEPEIRAFDPEAAAVQPYQDQTYQSVYFVSESFSDAKDKLRSYASRIQRPFSVKFDPYTLAIDVLDSPQAVRRSLEGVQDELDTLAHALSAIG,528,NP_954986.2.csv,refseq-TH-NM_199292.2_clinical_seed_0_final,refseq-TH-NM_199292.2.a2m,Invitae,refseq-TH-NM_199292.2.npy,1,528,528
+NP_955366.1,MEQKPSKVECGSDPEENSARSPDGKRKRKNGQCSLKTSMSGYIPSYLDKDEQCVVCGDKATGYHYRCITCEGCKGFFRRTIQKNLHPTYSCKYDSCCVIDKITRNQCQLCRFKKCIAVGMAMDLVLDDSKRVAKRKLIEQNRERRRKEEMIRSLQQRPEPTPEEWDLIHIATEAHRSTNAQGSHWKQRRKFLPDDIGQSPIVSMPDGDKVDLEAFSEFTKIITPAITRVVDFAKKLPMFSELPCEDQIILLKGCCMEIMSLRAAVRYDPESDTLTLSGEMAVKREQLKNGGLGVVSDAIFELGKSLSAFNLDDTEVALLQAVLLMSTDRSGLLCVDKIEKSQEAYLLAFEHYVNHRKHNIPHFWPKLLMKVTDLRMIGACHASRFLHMKVECPTELFPPLFLEVFEDQEV,410,NP_955366.1.csv,refseq-THRA-NM_199334.3_clinical_seed_0_final,refseq-THRA-NM_199334.3.a2m,Invitae,refseq-THRA-NM_199334.3.npy,1,410,410
+NP_957705.1,MALKNINYLLIFYLSFSLLIYIKNSFCNKNNTRCLSNSCQNNSTCKDFSKDNDCSCSDTANNLDKDCDNMKDPCFSNPCQGSATCVNTPGERSFLCKCPPGYSGTICETTIGSCGKNSCQHGGICHQDPIYPVCICPAGYAGRFCEIDHDECASSPCQNGAVCQDGIDGYSCFCVPGYQGRHCDLEVDECASDPCKNEATCLNEIGRYTCICPHNYSGVNCELEIDECWSQPCLNGATCQDALGAYFCDCAPGFLGDHCELNTDECASQPCLHGGLCVDGENRYSCNCTGSGFTGTHCETLMPLCWSKPCHNNATCEDSVDNYTCHCWPGYTGAQCEIDLNECNSNPCQSNGECVELSSEKQYGRITGLPSSFSYHEASGYVCICQPGFTGIHCEEDVNECSSNPCQNGGTCENLPGNYTCHCPFDNLSRTFYGGRDCSDILLGCTHQQCLNNGTCIPHFQDGQHGFSCLCPSGYTGSLCEIATTLSFEGDGFLWVKSGSVTTKGSVCNIALRFQTVQPMALLLFRSNRDVFVKLELLSGYIHLSIQVNNQSKVLLFISHNTSDGEWHFVEVIFAEAVTLTLIDDSCKEKCIAKAPTPLESDQSICAFQNSFLGGLPVGMTSNGVALLNFYNMPSTPSFVGCLQDIKIDWNHITLENISSGSSLNVKAGCVRKDWCESQPCQSRGRCINLWLSYQCDCHRPYEGPNCLREYVAGRFGQDDSTGYVIFTLDESYGDTISLSMFVRTLQPSGLLLALENSTYQYIRVWLERGRLAMLTPNSPKLVVKFVLNDGNVHLISLKIKPYKIELYQSSQNLGFISASTWKIEKGDVIYIGGLPDKQETELNGGFFKGCIQDVRLNNQNLEFFPNPTNNASLNPVLVNVTQGCAGDNSCKSNPCHNGGVCHSRWDDFSCSCPALTSGKACEEVQWCGFSPCPHGAQCQPVLQGFECIANAVFNGQSGQILFRSNGNITRELTNITFGFRTRDANVIILHAEKEPEFLNISIQDSRLFFQLQSGNSFYMLSLTSLQSVNDGTWHEVTLSMTDPLSQTSRWQMEVDNETPFVTSTIATGSLNFLKDNTDIYVGDRAIDNIKGLQGCLSTIEIGGIYLSYFENVHGFINKPQEEQFLKISTNSVVTGCLQLNVCNSNPCLHGGNCEDIYSSYHCSCPLGWSGKHCELNIDECFSNPCIHGNCSDRVAAYHCTCEPGYTGVNCEVDIDNCQSHQCANGATCISHTNGYSCLCFGNFTGKFCRQSRLPSTVCGNEKTNLTCYNGGNCTEFQTELKCMCRPGFTGEWCEKDIDECASDPCVNGGLCQDLLNKFQCLCDVAFAGERCEVDLADDLISDIFTTIGSVTVALLLILLLAIVASVVTSNKRATQGTYSPSRQEKEGSRVEMWNLMPPPAMERLI,1406,NP_957705.1.csv,CRUM1_HUMAN_b07_clinical_seed_0_final,CRUM1_HUMAN_b07.a2m,EVE,CRUM1_HUMAN_b07_theta_0.2.npy,1,1406,1406
+NP_958850.1,MWCIVLFSLLAWVYAEPTMYGEILSPNYPQAYPSEVEKSWDIEVPEGYGIHLYFTHLDIELSENCAYDSVQIISGDTEEGRLCGQRSSNNPHSPIVEEFQVPYNKLQVIFKSDFSNEERFTGFAAYYVATDINECTDFVDVPCSHFCNNFIGGYFCSCPPEYFLHDDMKNCGVNCSGDVFTALIGEIASPNYPKPYPENSRCEYQIRLEKGFQVVVTLRREDFDVEAADSAGNCLDSLVFVAGDRQFGPYCGHGFPGPLNIETKSNALDIIFQTDLTGQKKGWKLRYHGDPMPCPKEDTPNSVWEPAKAKYVFRDVVQITCLDGFEVVEGRVGATSFYSTCQSNGKWSNSKLKCQPVDCGIPESIENGKVEDPESTLFGSVIRYTCEEPYYYMENGGGGEYHCAGNGSWVNEVLGPELPKCVPVCGVPREPFEEKQRIIGGSDADIKNFPWQVFFDNPWAGGALINEYWVLTAAHVVEGNREPTMYVGSTSVQTSRLAKSKMLTPEHVFIHPGWKLLEVPEGRTNFDNDIALVRLKDPVKMGPTVSPICLPGTSSDYNLMDGDLGLISGWGRTEKRDRAVRLKAARLPVAPLRKCKEVKVEKPTADAEAYVFTPNMICAGGEKGMDSCKGDSGGAFAVQDPNDKTKFYAAGLVSWGPQCGTYGLYTRVKNYVDWIMKTMQENSTPRED,688,NP_958850.1.csv,refseq-C1S-NM_201442.2_clinical_seed_0_final,refseq-C1S-NM_201442.2.a2m,Invitae,refseq-C1S-NM_201442.2.npy,1,688,688
+NP_963883.2,MAAELAMGAELPSSPLAIEYVNDFDLMKFEVKKEPPEAERFCHRLPPGSLSSTPLSTPCSSVPSSPSFCAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALEDLYWMSGYQHHLNPEALNLTPEDAVEALIGSGHHGAHHGAHHPAAAAAYEAFRGPGFAGGGGADDMGAGHHHGAHHAAHHHHAAHHHHHHHHHHGGAGHGGGAGHHVRLEERFSDDQLVSMSVRELNRQLRGFSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQLKLEVGRLAKERDLYKEKYEKLAGRGGPGSAGGAGFPREPSPPQAGPGGAKGTADFFL,353,NP_963883.2.csv,refseq-MAFA-NM_201589.3_clinical_seed_0_final,refseq-MAFA-NM_201589.3.a2m,Invitae,refseq-MAFA-NM_201589.3.npy,1,353,353
+NP_963925.2,MAQGLEVALTDLQSSRNNVRHHTEEITVDHLLVRRGQAFNLTLYFRNRSFQPGLDNIIFVVETGPLPDLALGTRAVFSLARHHSPSPWIAWLETNGATSTEVSLCAPPTAAVGRYLLKIHIDSFQGSVTAYQLGEFILLFNPWCPEDAVYLDSEPQRQEYVMNDYGFIYQGSKNWIRPCPWNYGQFEDKIIDICLKLLDKSLHFQTDPATDCALRGSPVYVSRVVCAMINSNDDNGVLNGNWSENYTDGANPAEWTGSVAILKQWNATGCQPVRYGQCWVFAAVMCTVMRCLGIPTRVITNFDSGHDTDGNLIIDEYYDNTGRILGNKKKDTIWNFHVWNECWMARKDLPPAYGGWQVLDATPQEMSNGVYCCGPASVRAIKEGEVDLNYDTPFVFSMVNADCMSWLVQGGKEQKLHQDTSSVGNFISTKSIQSDERDDITENYKYEEGSLQERQVFLKALQKLKARSFHGSQRGAELQPSRPTSLSQDSPRSLHTPSLRPSDVVQVSLKFKLLDPPNMGQDICFVLLALNMSSQFKDLKVNLSAQSLLHDGSPLSPFWQDTAFITLSPKEAKTYPCKISYSQYSQYLSTDKLIRISALGEEKSSPEKILVNKIITLSYPSITINVLGAAVVNQPLSIQVIFSNPLSEQVEDCVLTVEGSGLFKKQQKVFLGVLKPQHQASIILETVPFKSGQRQIQANMRSNKFKDIKGYRNVYVDFAL,720,NP_963925.2.csv,refseq-TGM5-NM_201631.3_clinical_seed_0_final,refseq-TGM5-NM_201631.3.a2m,Invitae,refseq-TGM5-NM_201631.3.npy,1,720,720
+NP_964012.2,MSGGDTRAAIARPRMAAAHGPVAPSSPEQVTLLPVQRSFFLPPFSGATPSTSLAESVLKVWHGAYNSGLLPQLMAQHSLAMAQNGAVPSEATKRDQNLKRGNWGNQIEFVLTSVGYAVGLGNVWRFPYLCYRNGGGAFMFPYFIMLIFCGIPLFFMELSFGQFASQGCLGVWRISPMFKGVGYGMMVVSTYIGIYYNVVICIAFYYFFSSMTHVLPWAYCNNPWNTHDCAGVLDASNLTNGSRPAALPSNLSHLLNHSLQRTSPSEEYWRLYVLKLSDDIGNFGEVRLPLLGCLGVSWLVVFLCLIRGVKSSGKVVYFTATFPYVVLTILFVRGVTLEGAFDGIMYYLTPQWDKILEAKVWGDAASQIFYSLGCAWGGLITMASYNKFHNNCYRDSVIISITNCATSVYAGFVIFSILGFMANHLGVDVSRVADHGPGLAFVAYPEALTLLPISPLWSLLFFFMLILLGLGTQFCLLETLVTAIVDEVGNEWILQKKTYVTLGVAVAGFLLGIPLTSQAGIYWLLLMDNYAASFSLVVISCIMCVAIMYIYGHRNYFQDIQMMLGFPPPLFFQICWRFVSPAIIFFILVFTVIQYQPITYNHYQYPGWAVAIGFLMALSSVLCIPLYAMFRLCRTDGDTLLQRLKNATKPSRDWGPALLEHRTGRYAPTIAPSPEDGFEVQPLHPDKAQIPIVGSNGSSRLQDSRI,706,NP_964012.2.csv,refseq-SLC6A9-NM_201649.3_clinical_seed_0_final,refseq-SLC6A9-NM_201649.3.a2m,Invitae,refseq-SLC6A9-NM_201649.3.npy,1,706,706
+NP_976035.1,MAASQAVEEMRSRVVLGEFGVRNVHTTDFPGNYSGYDDAWDQDRFEKNFRVDVVHMDENSLEFDMVGIDAAIANAFRRILLAEVPTMAVEKVLVYNNTSIVQDEILAHRLGLIPIHADPRLFEYRNQGDEEGTEIDTLQFRLQVRCTRNPHAAKDSSDPNELYVNHKVYTRHMTWIPLGNQADLFPEGTIRPVHDDILIAQLRPGQEIDLLMHCVKGIGKDHAKFSPVATASYRLLPDITLLEPVEGEAAEELSRCFSPGVIEVQEVQGKKVARVANPRLDTFSREIFRNEKLKKVVRLARVRDHYIFSVESTGVLPPDVLVSEAIKVLMGKCRRFLDELDAVQMD,346,NP_976035.1.csv,refseq-POLR1C-NM_203290.3_clinical_seed_0_final,refseq-POLR1C-NM_203290.3.a2m,Invitae,refseq-POLR1C-NM_203290.3.npy,1,346,346
+NP_981932.1,MYFLTPILVAILCILVVWIFKNADRSMEKKKGEPRTRAEARPWVDEDLKDSSDLHQAEEDADEWQESEENVEHIPFSHNHYPEKEMVKRSQEFYELLNKRRSVRFISNEQVPMEVIDNVIRTAGTAPSGAHTEPWTFVVVKDPDVKHKIRKIIEEEEEINYMKRMGHRWVTDLKKLRTNWIKEYLDTAPILILIFKQVHGFAANGKKKVHYYNEISVSIACGILLAALQNAGLVTVTTTPLNCGPRLRVLLGRPAHEKLLMLLPVGYPSKEATVPDLKRKPLDQIMVTV,289,NP_981932.1.csv,refseq-IYD-NM_203395.2_clinical_seed_0_final,refseq-IYD-NM_203395.2.a2m,Invitae,refseq-IYD-NM_203395.2.npy,1,289,289
+NP_982272.2,MATLPSAERRAFALKINRYSSAEIRKQFTLPPNLGQYHRQSISTSGFPSLQLPQFYDPVEPVDFEGLLMTHLNSLDVQLAQELGDFTDDDLDVVFTPKECRTLQPSLPEEGVELDPHVRDCVQTYIREWLIVNRKNQGSPEICGFKKTGSRKDFHKTLPKQTFESETLECSEPAAQAGPRHLNVLCDVSGKGPVTACDFDLRSLQPDKRLENLLQQVSAEDFEKQNEEARRTNRQAELFALYPSVDEEDAVEIRPVPECPKEHLGNRILVKLLTLKFEIEIEPLFASIALYDVKERKKISENFHCDLNSDQFKGFLRAHTPSVAASSQARSAVFSVTYPSSDIYLVVKIEKVLQQGEIGDCAEPYTVIKESDGGKSKEKIEKLKLQAESFCQRLGKYRMPFAWAPISLSSFFNVSTLEREVTDVDSVVGRSSVGERRTLAQSRRLSERALSLEENGVGSNFKTSTLSVSSFFKQEGDRLSDEDLFKFLADYKRSSSLQRRVKSIPGLLRLEISTAPEIINCCLTPEMLPVKPFPENRTRPHKEILEFPTREVYVPHTVYRNLLYVYPQRLNFVNKLASARNITIKIQFMCGEDASNAMPVIFGKSSGPEFLQEVYTAVTYHNKSPDFYEEVKIKLPAKLTVNHHLLFTFYHISCQQKQGASVETLLGYSWLPILLNERLQTGSYCLPVALEKLPPNYSMHSAEKVPLQNPPIKWAEGHKGVFNIEVQAVSSVHTQDNHLEKFFTLCHSLESQVTFPIRVLDQKISEMALEHELKLSIICLNSSRLEPLVLFLHLVLDKLFQLSVQPMVIAGQTANFSQFAFESVVAIANSLHNSKDLSKDQHGRNCLLASYVHYVFRLPEVQRDVPKSGAPTALLDPRSYHTYGRTSAAAVSSKLLQARVMSSSNPDLAGTHSAADEEVKNIMSSKIADRNCSRMSYYCSGSSDAPSSPAAPRPASKKHFHEELALQMVVSTGMVRETVFKYAWFFFELLVKSMAQHVHNMDKRDSFRRTRFSDRFMDDITTIVNVVTSEIAALLVKPQKENEQAEKMNISLAFFLYDLLSLMDRGFVFNLIRHYCSQLSAKLSNLPTLISMRLEFLRILCSHEHYLNLNLFFMNADTAPTSPCPSISSQNSSSCSSFQDQKIASMFDLTSEYRQQHFLTGLLFTELAAALDAEGEGISKVQRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAALYLPLVGIILDALPQLCDFTVADTRRYRTSGSDEEQEGAGAINQNVALAIAGNNFNLKTSGIVLSSLPYKQYNMLNADTTRNLMICFLWIMKNADQSLIRKWIADLPSTQLNRILDLLFICVLCFEYKGKQSSDKVSTQVLQKSRDVKARLEEALLRGEGARGEMMRRRAPGNDRFPGLNENLRWKKEQTHWRQANEKLDKTKAELDQEALISGNLATEAHLIILDMQENIIQASSALDCKDSLLGGVLRVLVNSLNCDQSTTYLTHCFATLRALIAKFGDLLFEEEVEQCFDLCHQVLHHCSSSMDVTRSQACATLYLLMRFSFGATSNFARVKMQVTMSLASLVGRAPDFNEEHLRRSLRTILAYSEEDTAMQMTPFPTQVEELLCNLNSILYDTVKMREFQEDPEMLMDLMYRIAKSYQASPDLRLTWLQNMAEKHTKKKCYTEAAMCLVHAAALVAEYLSMLEDHSYLPVGSVSFQNISSNVLEESVVSEDTLSPDEDGVCAGQYFTESGLVGLLEQAAELFSTGGLYETVNEVYKLVIPILEAHREFRKLTLTHSKLQRAFDSIVNKDHKRMFGTYFRVGFFGSKFGDLDEQEFVYKEPAITKLPEISHRLEAFYGQCFGAEFVEVIKDSTPVDKTKLDPNKAYIQITFVEPYFDEYEMKDRVTYFEKNFNLRRFMYTTPFTLEGRPRGELHEQYRRNTVLTTMHAFPYIKTRISVIQKEEFVLTPIEVAIEDMKKKTLQLAVAINQEPPDAKMLQMVLQGSVGATVNQGPLEVAQVFLAEIPADPKLYRHHNKLRLCFKEFIMRCGEAVEKNKRLITADQREYQQELKKNYNKLKENLRPMIERKIPELYKPIFRVESQKRDSFHRSSFRKCETQLSQGS,2099,NP_982272.2.csv,NP_982272.2_clinical_seed_0_final,NP_982272.2.a2m,popEVE,NP_982272.2_theta_0.2.npy,1,2099,2099
+NP_982301.1,MATFSRQEFFQQLLQGCLLPTAQQGLDQIWLLLAICLACRLLWRLGLPSYLKHASTVAGGFFSLYHFFQLHMVWVVLLSLLCYLVLFLCRHSSHRGVFLSVTILIYLLMGEMHMVDTVTWHKMRGAQMIVAMKAVSLGFDLDRGEVGTVPSPVEFMGYLYFVGTIVFGPWISFHSYLQAVQGRPLSCRWLQKVARSLALALLCLVLSTCVGPYLFPYFIPLNGDRLLRNKKRKARGTMVRWLRAYESAVSFHFSNYFVGFLSEATATLAGAGFTEEKDHLEWDLTVSKPLNVELPRSMVEVVTSWNLPMSYWLNNYVFKNALRLGTFSAVLVTYAASALLHGFSFHLAAVLLSLAFITYVEHVLRKRLARILSACVLSKRCPPDCSHQHRLGLGVRALNLLFGALAIFHLAYLGSLFDVDVDDTTEEQGYGMAYTVHKWSELSWASHWVTFGCWIFYRLIG,461,NP_982301.1.csv,refseq-PORCN-NM_203475.2_clinical_seed_0_final,refseq-PORCN-NM_203475.2.a2m,Invitae,refseq-PORCN-NM_203475.2_theta_0.2.npy,1,461,461
+NP_991331.1,MCPKGYEDSMEFPDHSRHLLQCLSEQRHQGFLCDCTVLVGDAQFRAHRAVLASCSMYFHLFYKDQLDKRDIVHLNSDIVTAPAFALLLEFMYEGKLQFKDLPIEDVLAAASYLHMYDIVKVCKKKLKEKATTEADSTKKEEDASSCSDKVESLSDGSSHIAGDLPSDEDEGEDEKLNILPSKRDLAAEPGNMWMRLPSDSAGIPQAGGEAEPHATAAGKTVASPCSSTESLSQRSVTSVRDSADVDCVLDLSVKSSLSGVENLNSSYFSSQDVLRSNLVQVKVEKEASCDESDVGTNDYDMEHSTVKESVSTNNRVQYEPAHLAPLREDSVLRELDREDKASDDEMMTPESERVQVEGGMESSLLPYVSNILSPAGQIFMCPLCNKVFPSPHILQIHLSTHFREQDGIRSKPAADVNVPTCSLCGKTFSCMYTLKRHERTHSGEKPYTCTQCGKSFQYSHNLSRHAVVHTREKPHACKWCERRFTQSGDLYRHIRKFHCELVNSLSVKSEALSLPTVRDWTLEDSSQELWK,531,NP_991331.1.csv,NP_991331.1_clinical_seed_0_final,NP_991331.1.a2m,popEVE,NP_991331.1_theta_0.2.npy,1,531,531
+NP_995322.1,MQTKGGQTWARRALLLGILWATAHLPLSGTSLPQRLPRATGNSTQCVISPSSEFPEGFFTRQERRDGGIIIYFLIIVYMFMAISIVCDEYFLPSLEIISESLGLSQDVAGTTFMAAGSSAPELVTAFLGVFITKGDIGISTILGSAIYNLLGICAACGLLSNTVSTLSCWPLFRDCAAYTISAAAVLGIIYDNQVYWYEGALLLLIYGLYVLVLCFDIKINQYIIKKCSPCCACLAKAMERSEQQPLMGWEDEGQPFIRRQSRTDSGIFYEDSGYSQLSISLHGLSQVSEDPPSVFNMPEADLKRIFWVLSLPIITLLFLTTPDCRKKFWKNYFVITFFMSAIWISAFTYILVWMVTITGETLEIPDTVMGLTLLAAGTSIPDTIASVLVARKGKGDMAMSNIVGSNVFDMLCLGIPWFIKTAFINGSAPAEVNSRGLTYITISLNISIIFLFLAVHFNGWKLDRKLGIVCLLSYLGLATLSVLYELGIIGNNKIRGCGG,500,NP_995322.1.csv,refseq-SLC24A5-NM_205850.2_clinical_seed_0_final,refseq-SLC24A5-NM_205850.2.a2m,Invitae,refseq-SLC24A5-NM_205850.2.npy,1,500,500
+NP_996879.1,MAPKKEKGGTVNTSSKIWEPSLIAAQFNQNDWQASIAFVVGNQIEDDLLIQALTVAVQVPQRKLFSMVSWQDILQQINEINTLVGSASSKKAKKPVGGNAPLYYEVLTAAKAIMDSGEKLTLPLIGKLLKFQLLQIKFKDQQRRENEKKVIEDKPKLEKDKGKAKSPKEKKAPSAKPAKGKGKDQPEANAPVKKTTQLKRRGEDDHTNRYIDDEPDDGAQHYIIVVGFNNPQLLAIMAELGIPITSVIKISSENYEPLQTHLAAVNQQQEVLLQSEDLEAEKLKKENAIKELKTFWKYLEPVLNNEKPETNLFDVARLEYMVKAADFPSDWSDGEMMLKLGTDIFENIACLMYDILDWKRQHQHYLESMQLINVPQVVNEKPVLEAMPTSEAPQPAVPAPGKKKAQYEEPQAPPPVTSVITTEVDMRYYNYLLNPIREEFISVPLILHCMLEQVVATEEDLVPPSLREPSPRADGLDHRIAAHIVSLLPSLCLSEREKKNLHDIFLSEEENESKAVPKGPLLLNYHDAHAHKKYALQDQKNFDPVQIEQEMQSKLPLWEFLQFPLPPPWNNTKRLATIHELMHFCTSDVLSWNEVERAFKVFTFESLKLSEVDEKGKLKPSGMMCGSDSEMFNIPWDNPARFAKQIRQQYVMKMNTQEAKQKADIKIKDRTLFVDQNLSMSVQDNESNREPSDPSQCDANNMKHSDLNNLKLSVPDNRQLLEQESIMKAQPQHESLEQTTNNEIKDDAVTKADSHEKKPKKMMVEADLEDIKKTQQRSLMDWSFTEHFKPKVLLQVLQEAHKQYRCVDSYYHTQDNSLLLVFHNPMNRQRLHCEYWNIALHSNVGFRNYLELVAKSIQDWITKEEAIYQESKMNEKIIRTRAELELKSSANAKLTSASKIFSIKESKSNKGISKTEISDQEKEKEKEKIPFILEGSLKAWKEEQHRLAEEERLREEKKAEKKGKEAGKKKGKDNAEKEDSRSLKKKSPYKEKSKEEQVKIQEVTEESPHQPEPKITYPFHGYNMGNIPTQISGSNYYLYPSDGGQIEVEKTMFEKGPTFIKVRVVKDNHNFMIHLNDPKEIVKKEEKGDYYLEEEEEGDEEQSLETEVSDAKNKAFSKFGSFSATLENGICLSISYYGSNGMAPEDKDPDLETILNIPSALTPTVVPVIVTVPQSKAKGKIKGKEKPKESLKEEEHPKEEEKKEEEVEPEPVLQETLDVPTFQSLNVSCPSGLLLTFIGQESTGQYVIDEEPTWDIMVRQSYPQRVKHYEFYKTVMPPAEQEASRVITSQGTVVKYMLDGSTQILFADGAVSRSPNSGLICPPSEMPATPHSGDLMDSISQQKSETIPSEITNTKKGKSHKSQSSMAHKGEIHDPPPEAVQTVTPVEVHIGTWFTTTPEGNRIGTKGLERIADLTPLLSFQATDPVNGTVMTTREDKVVIVERKDGTRIVDHADGTRITTFYQVYEDQIILPDDQETTEGPRTVTRQVKCMRVESSRYATVIANCEDSSCCATFGDGTTIIAKPQGTYQVLPPNTGSLYIDKDCSAVYCHESSSNIYYPFQKREQLRAGRYIMRHTSEVICEVLDPEGNTFQVMADGSISTILPEKKLEDDLNEKTEGYDSLSSMHLEKNHQQIYGEHVPRFFVMYADGSGMELLRDSDIEEYLSLAYKESNTVVLQEPVQEQPGTLTITVLRPFHEASPWQVKKEDTIVPPNLRSRSWETFPSVEKKTPGPPFGTQIWKGLCIESKQLVSAPGAILKSPSVLQMRQFIQHEVIKNEVKLRLQVSLKDYINYILKKEDELQEMMVKDSRTEEERGNAADLLKLVMSFPKMEETTKSHVTEVAAHLTDLFKQSLATPPKCPPDTFGKDFFEKTWRHTASSKRWKEKIDKTRKEIETTQNYLMDIKNRIIPPFFKSELNQLYQSQYNHLDSLSKKLPSFTKKNEDANETAVQDTSDLNLDFKPHKVSEQKSSSVPSLPKPEISADKKDFTAQNQTENLTKSPEEAESYEPVKIPTQSLLQDVAGQTRKEKVKLPHYLLSSKPKSQPLAKVQDSVGGKVNTSSVASAAINNAKSSLFGFHLLPSSVKFGVLKEGHTYATVVKLKNVGVDFCRFKVKQPPPSTGLKVTYKPGPVAAGMQTELNIELFATAVGEDGAKGSAHISHNIEIMTEHEVLFLPVEATVLTSSNYDKRPKDFPQGKENPMVQRTSTIYSSTLGVFMSRKVSPH,2223,NP_996879.1.csv,refseq-SPAG17-NM_206996.2_clinical_seed_0_final,refseq-SPAG17-NM_206996.2.a2m,Invitae,refseq-SPAG17-NM_206996.2.npy,1,2223,2223
+NP_996917.1,MEPGLWLLFGLTVTSAAGFVPCSQSGDAGRRGVSQAPTAARSEGDCEETVAGPGEETVAGPGEGTVAPTALQGPSPGSPGQEQAAEGAPEHHRSRRCTCFTYKDKECVYYCHLDIIWINTPEQTVPYGLSNYRGSFRGKRSAGPLPGNLQLSHRPHLRCACVGRYDKACLHFCTQTLDVSSNSRTAEKTDKEEEGKVEVKDQQSKQALDLHHPKLMPGSGLALAPSTCPRCLFQEGAP,238,NP_996917.1.csv,refseq-EDN3-NM_207034.2_clinical_seed_0_final,refseq-EDN3-NM_207034.2.a2m,Invitae,refseq-EDN3-NM_207034.2.npy,1,238,238
+NP_996919.1,MNPQQQRMAAIGTDKELSDLLDFSAMFSPPVNSGKTRPTTLGSSQFSGSGIDERGGTTSWGTSGQPSPSYDSSRGFTDSPHYSDHLNDSRLGAHEGLSPTPFMNSNLMGKTSERGSFSLYSRDTGLPGCQSSLLRQDLGLGSPAQLSSSGKPGTAYYSFSATSSRRRPLHDSAALDPLQAKKVRKVPPGLPSSVYAPSPNSDDFNRESPSYPSPKPPTSMFASTFFMQDGTHNSSDLWSSSNGMSQPGFGGILGTSTSHMSQSSSYGNLHSHDRLSYPPHSVSPTDINTSLPPMSSFHRGSTSSSPYVAASHTPPINGSDSILGTRGNAAGSSQTGDALGKALASIYSPDHTSSSFPSNPSTPVGSPSPLTGTSQWPRPGGQAPSSPSYENSLHSLKNRVEQQLHEHLQDAMSFLKDVCEQSRMEDRLDRLDDAIHVLRNHAVGPSTSLPAGHSDIHSLLGPSHNAPIGSLNSNYGGSSLVASSRSASMVGTHREDSVSLNGNHSVLSSTVTTSSTDLNHKTQENYRGGLQSQSGTVVTTEIKTENKEKDENLHEPPSSDDMKSDDESSQKDIKVSSRGRTSSTNEDEDLNPEQKIEREKERRMANNARERLRVRDINEAFKELGRMCQLHLKSEKPQTKLLILHQAVAVILSLEQQVRERNLNPKAACLKRREEEKVSAVSAEPPTTLPGTHPGLSETTNPMGHM,706,NP_996919.1.csv,refseq-TCF12-NM_207036.1_clinical_seed_0_final,refseq-TCF12-NM_207036.1.a2m,Invitae,refseq-TCF12-NM_207036.1.npy,1,706,706
+NP_996991.1,MALAGLCALLACCWGPAAVLATAAGDVDPSKELECKLKSITVSALPFLRENDLSIMHSPSASEPKLLFSVRNDFPGEMVVVDDLENTELPYFVLEISGNTEDIPLVRWRQQWLENGTLLFHIHHQDGAPSLPGQDPTEEPQHESAEEELRILHISVMGGMIALLLSILCLVMILYTRRRWCKRRRVPQPQKSASAEAANEIHYIPSVLIGGHGRESLRNARVQGHNSSGTLSIRETPILDGYEYDITDLRHHLQRECMNGGEDFASQVTRTLDSLQGCNEKSGMDLTPGSDNAKLSLMNKYKDNIIATSPVDSNHQQATLLSHTSSSQRKRINNKARAGSAFLNPEGDSGTEAENDPQLTFYTDPSRSRRRSRVGSPRSPVNKTTLTLISITSCVIGLVCSSHVNCPLVVKITLHVPEHLIADGSRFILLEGSQLDASDWLNPAQVVLFSQQNSSGPWAMDLCARRLLDPCEHQCDPETGECLCYEGYMKDPVHKHLCIRNEWGTNQGPWPYTIFQRGFDLVLGEQPSDKIFRFTYTLGEGMWLPLSKSFVIPPAELAINPSAKCKTDMTVMEDAVEVREELMTSSSFDSLEVLLDSFGPVRDCSKDNGGCSKNFRCISDRKLDSTGCVCPSGLSPMKDSSGCYDRHIGVDCSDGFNGGCEQLCLQQMAPFPDDPTLYNILMFCGCIEDYKLGVDGRSCQLITETCPEGSDCGESRELPMNQTLFGEMFFGYNNHSKEVAAGQVLKGTFRQNNFARGLDQQLPDGLVVATVPLENQCLEEISEPTPDPDFLTGMVNFSEVSGYPVLQHWKVRSVMYHIKLNQVAISQALSNALHSLDGATSRADFVALLDQFGNHYIQEAIYGFEESCSIWYPNKQVQRRLWLEYEDISKGNSPSDESEERERDPKVLTFPEYITSLSDSGTKHMAAGVRMECHSKGRCPSSCPLCHVTSSPDTPAEPVLLEVTKAAPIYELVTNNQTQRLLQEATMSSLWCSGTGDVIEDWCRCDSTAFGADGLPTCAPLPQPVLRLSTVHEPSSTLVVLEWEHSEPPIGVQIVDYLLRQEKVTDRMDHSKVETETVLSFVDDIISGAKSPCAMPSQVPDKQLTTISLIIRCLEPDTIYMFTLWGVDNTGRRSRPSDVIVKTPCPVVDDVKAQEIADKIYNLFNGYTSGKEQQTAYNTLLDLGSPTLHRVLYHYNQHYESFGEFTWRCEDELGPR,1216,NP_996991.1.csv,refseq-ASTN1-NM_207108.1_clinical_seed_0_final,refseq-ASTN1-NM_207108.1.a2m,Invitae,refseq-ASTN1-NM_207108.1.npy,1,1216,1216
+NP_996994.1,MEEGNNNEEVIHLNNFHCHRGQEWINLRDGPITISDSSDEERIPMLVTPAPQQHEEEDLDDDVILTETNKPQRSRPNLIKPAAQWQDLKRLGEERPKKSRAAFESDKSSYFSVCNNPLFDSGAQDDSEDDYGEFLDLGPPGISEFTKPSGQTEREPKPGPSHNQAANDIVNPRSEQKVIILEEGSLLYTESDPLETQNQSSEDSETELLSNLGESAALADDQAIEEDCWLDHPYFQSLNQQPREITNQVVPQERQPEAELGRLLFQHEFPGPAFPRPEPQQGGISGPSSPQPAHPLGEFEDQQLASDDEEPGPAFPMQESQEPNLENIWGQEAAEVDQELVELLVKETEARFPDVANGFIEEIIHFKNYYDLNVLCNFLLENPDYPKREDRIIINPSSSLLASQDETKLPKIDFFDYSKLTPLDQRCFIQAADLLMADFKVLSSQDIKWALHELKGHYAITRKALSDAIKKWQELSPETSGKRKKRKQMNQYSYIDFKFEQGDIKIEKRMFFLENKRRHCRSYDRRALLPAVQQEQEFYEQKIKEMAEHEDFLLALQMNEEQYQKDGQLIECRCCYGEFPFEELTQCADAHLFCKECLIRYAQEAVFGSGKLELSCMEGSCTCSFPTSELEKVLPQTILYKYYERKAEEEVAAAYADELVRCPSCSFPALLDSDVKRFSCPNPHCRKETCRKCQGLWKEHNGLTCEELAEKDDIKYRTSIEEKMTAARIRKCHKCGTGLIKSEGCNRMSCRCGAQMCYLCRVSINGYDHFCQHPRSPGAPCQECSRCSLWTDPTEDDEKLIEEIQKEAEEEQKRKNGENTFKRIGPPLEKPVEKVQRVEALPRPVPQNLPQPQMPPYAFAHPPFPLPPVRPVFNNFPLNMGPIPAPYVPPLPNVRVNYDFGPIHMPLEHNLPMHFGPQPRHRF,923,NP_996994.1.csv,refseq-RNF216-NM_207111.4_clinical_seed_0_final,refseq-RNF216-NM_207111.4.a2m,Invitae,refseq-RNF216-NM_207111.4.npy,1,923,923
+NP_997005.1,MCASVKYNIRGPALIPRMKTKHRIYYITLFSIVLLGLIATGMFQFWPHSIESSNDWNVEKRSIRDVPVVRLPADSPIPERGDLSCRMHTCFDVYRCGFNPKNKIKVYIYALKKYVDDFGVSVSNTISREYNELLMAISDSDYYTDDINRACLFVPSIDVLNQNTLRIKETAQAMAQLSRWDRGTNHLLFNMLPGGPPDYNTALDVPRDRALLAGGGFSTWTYRQGYDVSIPVYSPLSAEVDLPEKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKHQVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPEEKMSDVYSILQSIPQRQIEEMQRQARWFWEAYFQSIKAIALATLQIINDRIYPYAAISYEEWNDPPAVKWGSVSNPLFLPLIPPQSQGFTAIVLTYDRVESLFRVITEVSKVPSLSKLLVVWNNQNKNPPEDSLWPKIRVPLKVVRTAENKLSNRFFPYDEIETEAVLAIDDDIIMLTSDELQFGYEVWREFPDRLVGYPGRLHLWDHEMNKWKYESEWTNEVSMVLTGAAFYHKYFNYLYTYKMPGDIKNWVDAHMNCEDIAMNFLVANVTGKAVIKVTPRKKFKCPECTAIDGLSLDQTHMVERSECINKFASVFGTMPLKVVEHRADPVLYKDDFPEKLKSFPNIGSL,718,NP_997005.1.csv,refseq-EXT2-NM_207122.1_clinical_seed_0_final,refseq-EXT2-NM_207122.1.a2m,Invitae,refseq-EXT2-NM_207122.1.npy,1,718,718
+NP_997229.2,MEPEPEPAAVEVPAGRVLSARELFAARSRSQKLPQRSHGPKDFLPDGSAAQAERLRRCREELWQLLAEQRVERLGSLVAAEWRPEEGFVELKSPAGKFWQTMGFSEQGRQRLHPEEALYLLECGSIHLFHQDLPLSIQEAYQLLLTDHTVTFLQYQVFSHLKRLGYVVRRFQPSSVLSPYERQLNLDASVQHLEDGDGKRKRSSSSPRSINKKAKALDNSLQPKSLAASSPPPCSQPSQCPEEKPQESSPMKGPGGPFQLLGSLGPSPGPAREGVGCSWESGRAENGVTGAGKRRWNFEQISFPNMASDSRHTLLRAPAPELLPANVAGRETDAESWCQKLNQRKEKLSRREREHHAEAAQFQEDVNADPEVQRCSSWREYKELLQRRQVQRSQRRAPHLWGQPVTPLLSPGQASSPAVVLQHISVLQTTHLPDGGARLLEKSGGLEIIFDVYQADAVATFRKNNPGKPYARMCISGFDEPVPDLCSLKRLSYQSGDVPLIFALVDHGDISFYSFRDFTLPQDVGH,526,NP_997229.2.csv,refseq-TSEN54-NM_207346.2_clinical_seed_0_final,refseq-TSEN54-NM_207346.2.a2m,Invitae,refseq-TSEN54-NM_207346.2.npy,1,526,526
+NP_997235.3,MAGLWLGLVWQKLLLWGAASALSLAGASLVLSLLQRVASYARKWQQMRPIPTVARAYPLVGHALLMKPDGREFFQQIIEYTEEYRHMPLLKLWVGPVPMVALYNAENVEVILTSSKQIDKSSMYKFLEPWLGLGLLTSTGNKWRSRRKMLTPTFHFTILEDFLDIMNEQANILVKKLEKHINQEAFNCFFYITLCALDIICETAMGKNIGAQSNDDSEYVRAVYRMSEMIFRRIKMPWLWLDLWYLMFKEGWEHKKSLQILHTFTNSVIAERANEMNANEDCRGDGRGSAPSKNKRRAFLDLLLSVTDDEGNRLSHEDIREEVDTFMFEGHDTTAAAINWSLYLLGSNPEVQKKVDHELDDVFGKSDRPATVEDLKKLRYLECVIKETLRLFPSVPLFARSVSEDCEVAGYRVLKGTEAVIIPYALHRDPRYFPNPEEFQPERFFPENAQGRHPYAYVPFSAGPRNCIGQKFAVMEEKTILSCILRHFWIESNQKREELGLEGQLILRPSNGIWIKLKRRNADER,525,NP_997235.3.csv,refseq-CYP4V2-NM_207352.3_clinical_seed_0_final,refseq-CYP4V2-NM_207352.3.a2m,Invitae,refseq-CYP4V2-NM_207352.3.npy,1,525,525
+NP_997253.2,MSDERRLPGSAVGWLVCGGLSLLANAWGILSVGAKQKKWKPLEFLLCTLAATHMLNVAVPIATYSVVQLRRQRPDFEWNEGLCKVFVSTFYTLTLATCFSVTSLSYHRMWMVCWPVNYRLSNAKKQAVHTVMGIWMVSFILSALPAVGWHDTSERFYTHGCRFIVAEIGLGFGVCFLLLVGGSVAMGVICTAIALFQTLAVQVGRQADRRAFTVPTIVVEDAQGKRRSSIDGSEPAKTSLQTTGLVTTIVFIYDCLMGFPVLVVSFSSLRADASAPWMALCVLWCSVAQALLLPVFLWACDRYRADLKAVREKCMALMANDEESDDETSLEGGISPDLVLERSLDYGYGGDFVALDRMAKYEISALEGGLPQLYPLRPLQEDKMQYLQVPPTRRFSHDDADVWAAVPLPAFLPRWGSGEDLAALAHLVLPAGPERRRASLLAFAEDAPPSRARRRSAESLLSLRPSALDSGPRGARDSPPGSPRRRPGPGPRSASASLLPDAFALTAFECEPQALRRPPGPFPAAPAAPDGADPGEAPTPPSSAQRSPGPRPSAHSHAGSLRPGLSASWGEPGGLRAAGGGGSTSSFLSSPSESSGYATLHSDSLGSAS,609,NP_997253.2.csv,refseq-GPR153-NM_207370.2_clinical_seed_0_final,refseq-GPR153-NM_207370.2.a2m,Invitae,refseq-GPR153-NM_207370.2.npy,1,609,609
+NP_997304.3,MVSVEGRAMSFQSIIHLSLDSPVHAVCVLGTEICLDLSGCAPQKCQCFTIHGSGRVLIDVANTVISEKEDATIWWPLSDPTYATVKMTSPSPSVDADKVSVTYYGPNEDAPVGTAVLYLTGIEVSLEVDIYRNGQVEMSSDKQAKKKWIWGPSGWGAILLVNCNPADVGQQLEDKKTKKVIFSEEITNLSQMTLNVQGPSCILKKYRLVLHTSKEESKKARVYWPQKDNSSTFELVLGPDQHAYTLALLGNHLKETFYVEAIAFPSAEFSGLISYSVSLVEESQDPSIPETVLYKDTVVFRVAPCVFIPCTQVPLEVYLCRELQLQGFVDTVTKLSEKSNSQVASVYEDPNRLGRWLQDEMAFCYTQAPHKTTSLILDTPQAADLDEFPMKYSLSPGIGYMIQDTEDHKVASMDSIGNLMVSPPVKVQGKEYPLGRVLIGSSFYPSAEGRAMSKTLRDFLYAQQVQAPVELYSDWLMTGHVDEFMCFIPTDDKNEGKKGFLLLLASPSACYKLFREKQKEGYGDALLFDELRADQLLSNGREAKTIDQLLADESLKKQNEYVEKCIHLNRDILKTELGLVEQDIIEIPQLFCLEKLTNIPSDQQPKRSFARPYFPDLLRMIVMGKNLGIPKPFGPQIKGTCCLEEKICCLLEPLGFKCTFINDFDCYLTEVGDICACANIRRVPFAFKWWKMVP,694,NP_997304.3.csv,PADI6_HUMAN_b01_clinical_seed_0_final,PADI6_HUMAN_b01.a2m,EVE,PADI6_HUMAN_b01_theta_0.2.npy,1,694,694
+NP_997464.2,MTLWNGVLPFYPQPRHAAGFSVPLLIVILVFLALAASFLLILPGIRGHSRWFWLVRVLLSLFIGAEIVAVHFSAEWFVGTVNTNTSYKAFSAARVTARVRLLVGLEGINITLTGTPVHQLNETIDYNEQFTWRLKENYAAEYANALEKGLPDPVLYLAEKFTPSSPCGLYHQYHLAGHYASATLWVAFCFWLLSNVLLSTPAPLYGGLALLTTGAFALFGVFALASISSVPLCPLRLGSSALTTQYGAAFWVTLATGVLCLFLGGAVVSLQYVRPSALRTLLDQSAKDCSQERGGSPLILGDPLHKQAALPDLKCITTNL,320,NP_997464.2.csv,refseq-DUOXA2-NM_207581.3_clinical_seed_0_final,refseq-DUOXA2-NM_207581.3.a2m,Invitae,refseq-DUOXA2-NM_207581.3.npy,1,320,320
+NP_998764.1,MGDPDLLEVLAEEGEKVNKHIDYSFQMSEQSLSSRETSFLINEETMPAKRFNLFLRRRLMFQKNQQSKDSIFFRDGIRQIDFVLSYVDDVKKDAELKAERRKEFETNLRKTGLELEIEDKRDSEDGRTYFVKIHAPWEVLVTYAEVLGIKMPIKESDIPRPKHTPISYVLGPVRLPLSVKYPHPEYFTAQFSRHRQELFLIEDQATFFPSSSRNRIVYYILSRCPFGIEDGKKRFGIERLLNSNTYSSAYPLHDGQYWKPSEPPNPTNERYTLHQNWARFSYFYKEQPLDLIKNYYGEKIGIYFVFLGFYTEMLFFAAVVGLACFIYGLLSMEHNTSSTEICDPEIGGQMIMCPLCDQVCDYWRLNSTCLASKFSHLFDNESTVFFAIFMGIWVTLFLEFWKQRQARLEYEWDLVDFEEEQQQLQLRPEFEAMCKHRKLNAVTKEMEPYMPLYTRIPWYFLSGATVTLWMSLVVTSMVAVIVYRLSVFATFASFMESDASLKQVKSFLTPQITTSLTGSCLNFIVILILNFFYEKISAWITKMEIPRTYQEYESSLTLKMFLFQFVNFYSSCFYVAFFKGKFVGYPGKYTYLFNEWRSEECDPGGCLIELTTQLTIIMTGKQIFGNIKEAIYPLALNWWRRRKARTNSEKLYSRWEQDHDLESFGPLGLFYEYLETVTQFGFVTLFVASFPLAPLLALINNIVEIRVDAWKLTTQYRRTVASKAHSIGVWQDILYGMAVLSVATNAFIVAFTSDIIPRLVYYYAYSTNATQPMTGYVNNSLSVFLIADFPNHTAPSEKRDFITCRYRDYRYPPDDENKYFHNMQFWHVLAAKMTFIIVMEHVVFLVKFLLAWMIPDVPKDVVERIKREKLMTIKILHDFELNKLKENLGINSNEFAKHVMIEENKAQLAKSTL,913,NP_998764.1.csv,refseq-ANO5-NM_213599.2_clinical_seed_0_final,refseq-ANO5-NM_213599.2.a2m,Invitae,refseq-ANO5-NM_213599.2.npy,1,913,913
+NP_998818.1,MGEPGQSPSPRSSHGSPPTLSTLTLLLLLCGHAHSQCKILRCNAEYVSSTLSLRGGGSSGALRGGGGGGRGGGVGSGGLCRALRSYALCTRRTARTCRGDLAFHSAVHGIEDLMIQHNCSRQGPTAPPPPRGPALPGAGSGLPAPDPCDYEGRFSRLHGRPPGFLHCASFGDPHVRSFHHHFHTCRVQGAWPLLDNDFLFVQATSSPMALGANATATRKLTIIFKNMQECIDQKVYQAEVDNLPVAFEDGSINGGDRPGGSSLSIQTANPGNHVEIQAAYIGTTIIIRQTAGQLSFSIKVAEDVAMAFSAEQDLQLCVGGCPPSQRLSRSERNRRGAITIDTARRLCKEGLPVEDAYFHSCVFDVLISGDPNFTVAAQAALEDARAFLPDLEKLHLFPSDAGVPLSSATLLAPLLSGLFVLWLCIQ,426,NP_998818.1.csv,refseq-HJV-NM_213653.3_clinical_seed_0_final,refseq-HJV-NM_213653.3.a2m,Invitae,refseq-HJV-NM_213653.3.npy,1,426,426
+NP_998885.1,MPRGSRSAASRPASRPAAPSAHPPAHPPPSAAAPAPAPSGQPGLMAQMATTAAGVAVGSAVGHVMGSALTGAFSGGSSEPSQPAVQQAPTPAAPQPLQMGPCAYEIRQFLDCSTTQSDLSLCEGFSEALKQCKYYHGLSSLP,142,NP_998885.1.csv,refseq-CHCHD10-NM_213720.2_clinical_seed_0_final,refseq-CHCHD10-NM_213720.2.a2m,Invitae,refseq-CHCHD10-NM_213720.2.npy,1,142,142
diff --git a/reference_files/reference_files_description.md b/reference_files/reference_files_description.md
new file mode 100644
index 0000000..2288fac
--- /dev/null
+++ b/reference_files/reference_files_description.md
@@ -0,0 +1,32 @@
+## ProteinGym reference files
+
+In the reference files, we provide detailed information about all DMS assays included in ProteinGym. There are two reference files: one for the substitution benchmark and one for the indel benchmark.
+
+The meaning of each column in the ProteinGym reference files is provided below:
+- DMS_id (str): Uniquely identifies each DMS assay in ProteinGym. It is obtained as the concatenation of the UniProt ID of the mutated protein, the first author name and the year of publication. If there are several datasets with the same characteristics, another defining attribute of the assay is added to preserve unicity.
+- DMS_filename (str): Name of the processed DMS file.
+- target_seq (str): Sequence of the target protein (reference sequence mutated in the assay).
+- seq_len (int): Length of the target protein sequence.
+- includes_multiple_mutants (bool): Indicates whether the DMS contains mutations that are multiple mutants. Substitution benchmark only.
+- DMS_total_number_mutants (int): Number of rows of the DMS in ProteinGym.
+- DMS_number_single_mutants (int): Number of single amino acid substitutions in the DMS. Substitution benchmark only.
+- DMS_number_multiple_mutants (int): Number of multiple amino acid substitutions in the DMS. Substitution benchmark only.
+- DMS_binarization_cutoff_ProteinGym (float): Cutoff used to divide fitness scores into binary labels.
+- DMS_binarization_method (str): Method used to decide the binarization cutoff (manual or median).
+- region_mutated (str): Region of the target protein that is mutated in the DMS.
+- MSA_filename (str): Name of the MSA file generated based on the reference sequence mutated during the DMS experiment. Note that different reference sequences may be used in different DMS experiments for the same protein. For example, Giacomelli et al. (2018) and Kotler et al. (2018) used slightly different reference sequences in their respective DMS experiments for the P53 protein. We generated different MSAs accordingly.
+- MSA_start (int): Locates the beginning of the first sequence in the MSA with respect to the target sequence. For example, if the MSA covers from position 10 to position 60 of the target sequence, then MSA_start is 10.
+- MSA_end (int): Locates the end of the first sequence in the MSA with respect to the target sequence. For example, if the MSA covers from position 10 to position 60 of the target sequence, then MSA_end is 60.
+- MSA_bitscore (float): Bitscore threshold used to generate the alignment divided by the length of the target protein.
+- MSA_theta (float): Hamming distance cutoff for sequence re-weighting.
+- MSA_num_seqs (int): Number of sequences in the Multiple Sequence Alignment (MSA) used in this work for this DMS.
+- MSA_perc_cov (float): Percentage of positions of the MSA that had a coverage higher than 70% (less than 30% gaps).
+- MSA_num_cov (int): Number of positions of the MSA that had a coverage higher than 70% (less than 30% gaps).
+- MSA_N_eff (float): The effective number of sequences in the MSA defined as the sum of the different sequence weights.
+- MSA_N_eff_L (float): Neff / num_cov.
+- MSA_num_significant (int): Number of evolutionary couplings that are considered significant. Significance is defined by having more than 90% probability of belonging to the log-normal distribution in a Gaussian Mixture Model of normal and log-normal distributions.
+- MSA_num_significant_L (float): MSA_num_significant / num_cov.
+- raw_DMS_filename (str): Name of the raw DMS file.
+- raw_DMS_phenotype_name (str): Name of the column in the raw DMS that we used as fitness score.
+- raw_DMS_directionality (int): Sign of the correlation between the DMS_phenotype column values and protein fitness in the raw DMS files. In any given DMS, the directionality is 1 if higher values of the measurement are associated with higher fitness, and -1 otherwise. For simplicity, we adjusted directionality in the final ProteinGym benchmarks so that a higher value of DMS_score is always associated with higher fitness. Consequently, correlations between model scores and the final DMS_score values should always be positive (unless the predictions from the considered model are worse than random for that DMS).
+- raw_DMS_mutant_column (str): Name of the column in the raw DMS that indicates which mutants were assayed.
\ No newline at end of file
diff --git a/scripts/scoring_DMS_supervised/performance_indels.sh b/scripts/scoring_DMS_supervised/performance_indels.sh
new file mode 100644
index 0000000..572e96a
--- /dev/null
+++ b/scripts/scoring_DMS_supervised/performance_indels.sh
@@ -0,0 +1,13 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export output_performance_file_folder=../../benchmarks/DMS_supervised/indels
+export input_scoring_file=../../benchmarks/raw_score_files/DMS_supervised_indels.csv
+
+python3 ../../proteingym/performance_DMS_supervised_benchmarks.py \
+ --input_scoring_file ${input_scoring_file} \
+ --output_performance_file_folder ${output_performance_file_folder} \
+ --DMS_reference_file_path ${DMS_reference_file_path_indels} \
+ --indel_mode
\ No newline at end of file
diff --git a/scripts/scoring_DMS_supervised/performance_substitutions.sh b/scripts/scoring_DMS_supervised/performance_substitutions.sh
new file mode 100644
index 0000000..66ea6c1
--- /dev/null
+++ b/scripts/scoring_DMS_supervised/performance_substitutions.sh
@@ -0,0 +1,13 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export output_performance_file_folder=../../benchmarks/DMS_supervised/substitutions
+export input_scoring_file=../../benchmarks/raw_score_files/DMS_supervised_substitutions.csv
+export DMS_data_folder="path to supervised DMS data"
+
+python ../../proteingym/performance_DMS_supervised_benchmarks.py \
+ --input_scoring_file ${input_scoring_file} \
+ --output_performance_file_folder ${output_performance_file_folder} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs}
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/calc_weights_eve.sh b/scripts/scoring_DMS_zero_shot/calc_weights_eve.sh
new file mode 100644
index 0000000..365d252
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/calc_weights_eve.sh
@@ -0,0 +1,18 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+
+# EVE example
+export DMS_index="Experiment index to run (E.g. 0,1,2,...,217)"
+
+python ../../proteingym/baselines/EVE/calc_weights.py \
+ --MSA_data_folder ${DMS_MSA_data_folder} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_index "${DMS_index}" \
+ --MSA_weights_location ${DMS_MSA_weights_folder} \
+ --num_cpus -1 \
+ --calc_method evcouplings \
+ --threshold_focus_cols_frac_gaps 1 \
+ --skip_existing
+ #--overwrite
+
diff --git a/scripts/scoring_DMS_zero_shot/evotune_UniRep_substitutions.sh b/scripts/scoring_DMS_zero_shot/evotune_UniRep_substitutions.sh
new file mode 100644
index 0000000..9e5cb96
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/evotune_UniRep_substitutions.sh
@@ -0,0 +1,24 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate protein_fitness_prediction_hsu
+
+export OMP_NUM_THREADS=1
+
+export savedir="folder to save evotuned models to"
+export initial_weights_dir="initial weights checkpoint for UniRep model"
+export DMS_reference_file_path=$DMS_reference_file_path_subs
+export DMS_index="Experiment index to run (E.g. 0,1,2,...,217)"
+# uncomment below to run for indels
+# export DMS_reference_file_path=$DMS_reference_file_path_indels
+export steps=13000 #Same as Unirep paper
+
+python ../../proteingym/baselines/unirep/unirep_evotune.py \
+ --seqs_fasta_path $DMS_MSA_data_folder \
+ --save_weights_dir $savedir \
+ --initial_weights_dir $initial_weights_dir \
+ --num_steps $steps \
+ --batch_size 128 \
+ --mapping_path $DMS_reference_file_path \
+ --DMS_index $DMS_index \
+ --max_seq_len 500
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/merge_all_scores.sh b/scripts/scoring_DMS_zero_shot/merge_all_scores.sh
new file mode 100644
index 0000000..bf75f95
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/merge_all_scores.sh
@@ -0,0 +1,12 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export mutation_type='substitutions'
+
+python ../../proteingym/merge.py \
+--DMS_assays_location ${DMS_data_folder_subs} \
+--model_scores_location ${DMS_output_score_folder_subs} \
+--merged_scores_dir ${DMS_merged_score_folder_subs} \
+--mutation_type ${mutation_type}
diff --git a/scripts/scoring_DMS_zero_shot/merge_all_scores_indels.sh b/scripts/scoring_DMS_zero_shot/merge_all_scores_indels.sh
new file mode 100644
index 0000000..9263c3f
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/merge_all_scores_indels.sh
@@ -0,0 +1,11 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+
+export mutation_type='indels'
+python ../../proteingym/merge.py \
+--DMS_assays_location ${DMS_data_folder_indels} \
+--model_scores_location ${DMS_output_score_folder_indels} \
+--merged_scores_dir ${DMS_merged_score_folder_indels} \
+--mutation_type ${mutation_type} \
+--DMS_reference_file ${DMS_reference_file_path_indels}
diff --git a/scripts/scoring_DMS_zero_shot/performance_indels.sh b/scripts/scoring_DMS_zero_shot/performance_indels.sh
new file mode 100644
index 0000000..03e82c8
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/performance_indels.sh
@@ -0,0 +1,12 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+
+export output_performance_file_folder=../../benchmarks/DMS_zero_shot/indels
+python ../../proteingym/performance_DMS_benchmarks.py \
+ --input_scoring_files_folder ${DMS_merged_score_folder_indels} \
+ --output_performance_file_folder ${output_performance_file_folder} \
+ --DMS_reference_file_path ${DMS_reference_file_path_indels} \
+ --DMS_data_folder ${DMS_data_folder_indels} \
+ --indel_mode
+ ##--performance_by_depth
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/performance_substitutions.sh b/scripts/scoring_DMS_zero_shot/performance_substitutions.sh
new file mode 100644
index 0000000..1e7d9fe
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/performance_substitutions.sh
@@ -0,0 +1,11 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+
+export output_performance_file_folder=../../benchmarks/DMS_zero_shot/substitutions
+python ../../proteingym/performance_DMS_benchmarks.py \
+--input_scoring_files_folder ${DMS_merged_score_folder_subs} \
+--output_performance_file_folder ${output_performance_file_folder} \
+--DMS_reference_file_path ${DMS_reference_file_path_subs} \
+--DMS_data_folder ${DMS_data_folder_subs} \
+--performance_by_depth
diff --git a/scripts/scoring_DMS_zero_shot/scoring_CARP_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_CARP_substitutions.sh
new file mode 100644
index 0000000..ea7b973
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_CARP_substitutions.sh
@@ -0,0 +1,16 @@
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export model_name="carp_640M" #[carp_600k|carp_38M|carp_76M|carp_640M]
+export model_path="Path to CARP checkpoints"
+export DMS_output_score_folder=${DMS_output_score_folder_subs}/CARP
+export performance_file='CARP_640M_performance.csv'
+
+srun python3 ../../proteingym/baselines/carp_mif/compute_fitness.py \
+ --model_name ${model_name} \
+ --model_path ${model_path} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --DMS_index $SLURM_ARRAY_TASK_ID \
+ --output_scores_folder ${DMS_output_score_folder} \
+ --performance_file ${performance_file}
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_ESM1b_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_ESM1b_substitutions.sh
new file mode 100644
index 0000000..6fec61d
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_ESM1b_substitutions.sh
@@ -0,0 +1,26 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+# ESM1b parameters
+# checkpoint for ESM1b
+export model_checkpoint="path to ESM1b checkpoint"
+
+export dms_output_folder="${DMS_output_score_folder_subs}/ESM1b/"
+
+export model_type="ESM1b"
+export scoring_strategy="wt-marginals"
+export scoring_window="overlapping"
+
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/esm/compute_fitness.py \
+ --model-location ${model_checkpoint} \
+ --model_type ${model_type} \
+ --dms_index ${DMS_index} \
+ --dms_mapping ${DMS_reference_file_path_subs} \
+ --dms-input ${DMS_data_folder_subs} \
+ --dms-output ${dms_output_folder} \
+ --scoring-strategy ${scoring_strategy} \
+ --scoring-window ${scoring_window}
diff --git a/scripts/scoring_DMS_zero_shot/scoring_ESM1v_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_ESM1v_substitutions.sh
new file mode 100644
index 0000000..0c544d2
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_ESM1v_substitutions.sh
@@ -0,0 +1,31 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+# ESM-1v parameters
+# Five checkpoints for ESM-1v
+export model_checkpoint1="path to seed 1 model checkpoint"
+export model_checkpoint2="path to seed 2 model checkpoint"
+export model_checkpoint3="path to seed 3 model checkpoint"
+export model_checkpoint4="path to seed 4 model checkpoint"
+export model_checkpoint5="path to seed 5 model checkpoint"
+# combine all five into one string
+export model_checkpoint="${model_checkpoint1} ${model_checkpoint2} ${model_checkpoint3} ${model_checkpoint4} ${model_checkpoint5}"
+
+export dms_output_folder="${DMS_output_score_folder_subs}/ESM1v/"
+
+export model_type="ESM1v"
+export scoring_strategy="masked-marginals" # MSATransformer only uses masked-marginals
+export scoring_window="optimal"
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/esm/compute_fitness.py \
+ --model-location ${model_checkpoint} \
+ --model_type ${model_type} \
+ --dms_index ${DMS_index} \
+ --dms_mapping ${DMS_reference_file_path_subs} \
+ --dms-input ${DMS_data_folder_subs} \
+ --dms-output ${dms_output_folder} \
+ --scoring-strategy ${scoring_strategy} \
+ --scoring-window ${scoring_window}
diff --git a/scripts/scoring_DMS_zero_shot/scoring_ESM2_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_ESM2_substitutions.sh
new file mode 100644
index 0000000..0c20534
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_ESM2_substitutions.sh
@@ -0,0 +1,40 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+# Run whichever size of ESM2 by uncommenting the appropriate pair of lines below
+
+export model_checkpoint="path to 8M ESM2 checkpoint"
+export dms_output_folder=${DMS_output_score_folder_subs}/ESM2/8M
+
+# export model_checkpoint="path to 35M ESM2 checkpoint"
+# export dms_output_folder=${DMS_output_score_folder_subs}/ESM2/35M
+
+# export model_checkpoint="path to 150M ESM2 checkpoint"
+# export dms_output_folder=${DMS_output_score_folder_subs}/ESM2/150M
+
+# export model_checkpoint="path to 650M ESM2 checkpoint"
+# export dms_output_folder=${DMS_output_score_folder_subs}/ESM2/650M
+
+# export model_checkpoint="path to 3B ESM2 checkpoint"
+# export dms_output_folder=${DMS_output_score_folder_subs}/ESM2/3B
+
+# export model_checkpoint="path to 15B ESM2 checkpoint"
+# export dms_output_folder=${DMS_output_score_folder_subs}/ESM2/15B
+
+## Regression weights are at: https://dl.fbaipublicfiles.com/fair-esm/regression/esm2_t33_650M_UR50S-contact-regression.pt
+#https://dl.fbaipublicfiles.com/fair-esm/regression/esm2_t33_650M_UR50S-contact-regression.pt
+
+export model_type="ESM2"
+export scoring_strategy="masked-marginals"
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/esm/compute_fitness.py \
+ --model-location ${model_checkpoint} \
+ --dms_index $DMS_index \
+ --dms_mapping ${DMS_reference_file_path_subs} \
+ --dms-input ${DMS_data_folder_subs} \
+ --dms-output ${dms_output_folder} \
+ --scoring-strategy ${scoring_strategy} \
+ --model_type ${model_type}
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_ESM_IF1_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_ESM_IF1_substitutions.sh
new file mode 100644
index 0000000..07d6fb6
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_ESM_IF1_substitutions.sh
@@ -0,0 +1,19 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+## Regression weights are at: https://dl.fbaipublicfiles.com/fair-esm/regression/esm2_t33_650M_UR50S-contact-regression.pt
+#https://dl.fbaipublicfiles.com/fair-esm/regression/esm2_t33_650M_UR50S-contact-regression.pt
+
+export model_checkpoint="path to ESM-IF1 checkpoint"
+export DMS_output_score_folder=${DMS_output_score_folder_subs}/ESM-IF1/
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/esm/compute_fitness_esm_if1.py \
+ --model_location ${model_checkpoint} \
+ --structure_folder ${DMS_structure_folder} \
+ --DMS_index $DMS_index \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --output_scores_folder ${DMS_output_score_folder}
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_EVE_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_EVE_substitutions.sh
new file mode 100644
index 0000000..90194c5
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_EVE_substitutions.sh
@@ -0,0 +1,29 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+export model_parameters_location='../../proteingym/baselines/EVE/EVE/default_model_params.json'
+export training_logs_location='../../proteingym/baselines/EVE/logs/'
+export computation_mode='DMS'
+export output_score_folde="${DMS_output_score_folder_subs}/EVE/"
+export num_samples_compute_evol_indices=20000
+export batch_size=1024 # Pushing batch size to limit of GPU memory
+export random_seeds="0 1 2 3 4"
+
+python ../../proteingym/baselines/EVE/compute_evol_indices_DMS.py \
+ --MSA_data_folder ${DMS_MSA_data_folder} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --protein_index ${DMS_index} \
+ --VAE_checkpoint_location ${DMS_EVE_model_folder} \
+ --model_parameters_location ${model_parameters_location} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --output_evol_indices_location ${output_score_folder} \
+ --num_samples_compute_evol_indices ${num_samples_compute_evol_indices} \
+ --batch_size ${batch_size} \
+ --aggregation_method "full" \
+ --threshold_focus_cols_frac_gaps 1 \
+ --skip_existing \
+ --MSA_weights_location ${DMS_MSA_weights_folder} \
+ --random_seeds ${random_seeds}
diff --git a/scripts/scoring_DMS_zero_shot/scoring_EVmutation_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_EVmutation_substitutions.sh
new file mode 100644
index 0000000..7606b05
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_EVmutation_substitutions.sh
@@ -0,0 +1,14 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate evcouplings_env
+
+export output_score_folder=${DMS_output_score_folder_subs}/EVmutation/
+export model_folder="path to folder containing EVCouplings models"
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+python ../../proteingym/baselines/EVmutation/score_mutants.py \
+ --DMS_index $DMS_index \
+ --DMS_data_folder $DMS_data_folder_subs \
+ --model_folder $model_folder \
+ --output_scores_folder $output_score_folder \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs}
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_GEMME_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_GEMME_substitutions.sh
new file mode 100644
index 0000000..e10c658
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_GEMME_substitutions.sh
@@ -0,0 +1,15 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+export GEMME_LOCATION="path to GEMME installation"
+export JET2_LOCATION="path to JET2 installation"
+export TEMP_FOLDER="./gemme_tmp/"
+export DMS_output_score_folder="${DMS_output_score_folder_subs}/GEMME/"
+
+python ../../proteingym/baselines/gemme/compute_fitness.py --DMS_index=$DMS_index --DMS_reference_file_path=$DMS_reference_file_path_subs \
+--DMS_data_folder=$DMS_data_folder_subs --MSA_folder=$DMS_MSA_data_folder --output_scores_folder=$DMS_output_score_folder \
+--GEMME_path=$GEMME_LOCATION --JET_path=$JET2_LOCATION --temp_folder=$TEMP_FOLDER
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_HMM_indels.sh b/scripts/scoring_DMS_zero_shot/scoring_HMM_indels.sh
new file mode 100644
index 0000000..de62690
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_HMM_indels.sh
@@ -0,0 +1,16 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export TEMP_FOLDER="./HMM_temp"
+export output_score_folder="${DMS_output_score_folder_indels}/HMM/"
+export HMMER_PATH="path to HMMER installation"
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/HMM/score_hmm.py \
+--DMS_reference_file=$DMS_reference_file_path_indels --DMS_folder=$DMS_data_folder_indels \
+--DMS_index=$DMS_index \
+--hmmer_path=$HMMER_PATH \
+--MSA_folder=$DMS_MSA_data_folder \
+--output_scores_folder=$output_score_folder --intermediate_outputs_folder=$TEMP_FOLDER
diff --git a/scripts/scoring_DMS_zero_shot/scoring_MIF_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_MIF_substitutions.sh
new file mode 100644
index 0000000..e66b97a
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_MIF_substitutions.sh
@@ -0,0 +1,17 @@
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export model_name="mifst" #[mif|mifst]
+export model_path="Path to MIFST checkpoints"
+export DMS_output_score_folder=${DMS_output_score_folder_subs}/MIFST
+export performance_file='MIFST_performance.csv'
+
+srun python3 ../../proteingym/baselines/carp_mif/compute_fitness.py \
+ --model_name ${model_name} \
+ --model_path ${model_path} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --DMS_index $SLURM_ARRAY_TASK_ID \
+ --output_scores_folder ${DMS_output_score_folder} \
+ --performance_file ${performance_file} \
+ --structure_data_folder ${DMS_structure_folder}
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_MSA_transformer_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_MSA_transformer_substitutions.sh
new file mode 100644
index 0000000..0255ce6
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_MSA_transformer_substitutions.sh
@@ -0,0 +1,27 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+# MSA transformer checkpoint
+export model_checkpoint="path to MSA transformer checkpoint"
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+export dms_output_folder="${DMS_output_score_folder_subs}/MSA_Transformer/"
+export scoring_strategy=masked-marginals # MSA transformer only supports "masked-marginals"
+export model_type=MSA_transformer
+export scoring_window="optimal"
+export random_seeds="1 2 3 4 5"
+export DMS_MSA_weights_for_MSA_Transformer_folder="${DMS_MSA_weights_folder}/DMS_msa_weights_for_MSA_Transformer" # Use weights recomputed post MSA filtering used in MSA Transformer
+
+python ../../proteingym/baselines/esm/compute_fitness.py \
+ --model-location ${model_checkpoint} \
+ --model_type ${model_type} \
+ --dms_index ${DMS_index} \
+ --dms_mapping ${DMS_reference_file_path_subs} \
+ --dms-input ${DMS_data_folder_subs} \
+ --dms-output ${dms_output_folder} \
+ --scoring-strategy ${scoring_strategy} \
+ --scoring-window ${scoring_window} \
+ --msa-path ${DMS_MSA_data_folder} \
+ --msa-weights-folder ${DMS_MSA_weights_for_MSA_Transformer_folder} \
+ --seeds ${random_seeds}
diff --git a/scripts/scoring_DMS_zero_shot/scoring_Progen2_indels.sh b/scripts/scoring_DMS_zero_shot/scoring_Progen2_indels.sh
new file mode 100644
index 0000000..85011b0
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_Progen2_indels.sh
@@ -0,0 +1,29 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+# export Progen2_model_name_or_path="path to progen2 small model"
+# export output_scores_folder="${DMS_output_score_folder_indels}/Progen2/small"
+
+# export Progen2_model_name_or_path="path to progen2 medium model"
+# export output_scores_folder="${DMS_output_score_folder_indels}/Progen2/medium"
+
+# export Progen2_model_name_or_path="path to progen2 base model"
+# export output_scores_folder="${DMS_output_score_folder_indels}/Progen2/base"
+
+# export Progen2_model_name_or_path="path to progen2 large model"
+# export output_scores_folder="${DMS_output_score_folder_indels}/Progen2/large"
+
+export Progen2_model_name_or_path="/n/groups/marks/projects/marks_lab_and_oatml/protein_transformer/baseline_models/progen2/progen2-xlarge"
+export output_scores_folder="${DMS_output_score_folder_indels}/Progen2/xlarge"
+
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/progen2/compute_fitness.py \
+ --Progen2_model_name_or_path ${Progen2_model_name_or_path} \
+ --DMS_reference_file_path ${DMS_reference_file_path_indels} \
+ --DMS_data_folder ${DMS_data_folder_indels} \
+ --DMS_index $DMS_index \
+ --output_scores_folder ${output_scores_folder} \
+ --indel_mode
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_Progen2_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_Progen2_substitutions.sh
new file mode 100644
index 0000000..10ec4e5
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_Progen2_substitutions.sh
@@ -0,0 +1,28 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export Progen2_model_name_or_path="path to progen2 small model"
+export output_scores_folder="${DMS_output_score_folder_subs}/Progen2/small"
+
+# export Progen2_model_name_or_path="path to progen2 medium model"
+# export output_scores_folder="${DMS_output_score_folder_subs}/Progen2/medium"
+
+# export Progen2_model_name_or_path="path to progen2 base model"
+# export output_scores_folder="${DMS_output_score_folder_subs}/Progen2/base"
+
+# export Progen2_model_name_or_path="path to progen2 large model"
+# export output_scores_folder="${DMS_output_score_folder_subs}/Progen2/large"
+
+# export Progen2_model_name_or_path="path to progen2 xlarge model"
+# export output_scores_folder="${DMS_output_score_folder_subs}/Progen2/xlarge"
+
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/progen2/compute_fitness.py \
+ --Progen2_model_name_or_path ${Progen2_model_name_or_path} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --DMS_index $DMS_index \
+ --output_scores_folder ${output_scores_folder}
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_ProtGPT2_indels.sh b/scripts/scoring_DMS_zero_shot/scoring_ProtGPT2_indels.sh
new file mode 100644
index 0000000..3d2af8c
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_ProtGPT2_indels.sh
@@ -0,0 +1,16 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export ProtGPT2_model_name_or_path="path to ProtGPT2 model checkpoint"
+export output_scores_folder="${DMS_output_score_folder_indels}/ProtGPT2"
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/protgpt2/compute_fitness.py \
+ --ProtGPT2_model_name_or_path ${ProtGPT2_model_name_or_path} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --DMS_index $DMS_index \
+ --output_scores_folder ${output_scores_folder} \
+ --indel_mode
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_ProtGPT2_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_ProtGPT2_substitutions.sh
new file mode 100644
index 0000000..2496c66
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_ProtGPT2_substitutions.sh
@@ -0,0 +1,15 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export ProtGPT2_model_name_or_path="path to ProtGPT2 model checkpoint"
+export output_scores_folder="${DMS_output_score_folder_subs}/ProtGPT2"
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/protgpt2/compute_fitness.py \
+ --ProtGPT2_model_name_or_path ${ProtGPT2_model_name_or_path} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --DMS_index $DMS_index \
+ --output_scores_folder ${output_scores_folder}
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_ProtSSN_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_ProtSSN_substitutions.sh
new file mode 100644
index 0000000..08d00bd
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_ProtSSN_substitutions.sh
@@ -0,0 +1,29 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate protssn
+
+# please download and unzip the following files to a folder: https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/ProteinGym_substitutions_pdb-csv_checked.zip
+# eg. data/mutant_example/ProteinGym_substitutions_pdb-csv_checked
+export DMS_and_structure_folder="Path to unzipped data folder"
+
+# model checkpoint is at: https://lianglab.sjtu.edu.cn/files/ProtSSN-2024/ProtSSN.model.tar
+# please download and unzip the files to a folder
+export model_checkpoint="Path to untarred model checkpoints"
+
+# To ensemble all models use: "k10_h512 k20_h512 k30_h512 k10_h768 k20_h768 k30_h768 k10_h1280 k20_h1280 k30_h1280"
+# To use a single model, only reference a single model string via the model_name argument (eg., k20_h512)
+export model_name="k10_h512 k20_h512 k30_h512 k10_h768 k20_h768 k30_h768 k10_h1280 k20_h1280 k30_h1280"
+export gnn_config=../../proteingym/baselines/protssn/src/config/egnn.yaml
+export score_info=../../protssn_scores.csv
+
+export DMS_output_score_folder="Path to folder where all model predictions should be stored"
+
+python ../../proteingym/baselines/protssn/compute_fitness.py \
+ --gnn_config ${gnn_config} \
+ --gnn_model_dir ${model_checkpoint} \
+ --gnn_model_name ${model_name} \
+ --use_ensemble \
+ --mutant_dataset_dir ${DMS_and_structure_folder} \
+ --output_scores_folder ${DMS_output_score_folder} \
+ --score_info ${score_info}
diff --git a/scripts/scoring_DMS_zero_shot/scoring_ProteinMPNN_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_ProteinMPNN_substitutions.sh
new file mode 100644
index 0000000..b09dfef
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_ProteinMPNN_substitutions.sh
@@ -0,0 +1,17 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export output_scores_folder=${DMS_output_score_folder_subs}/ProteinMPNN
+
+export model_checkpoint="Path to ProteinMPNN model checkpoint"
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/protein_mpnn/compute_fitness.py \
+ --checkpoint ${model_checkpoint} \
+ --structure_folder ${DMS_structure_folder} \
+ --DMS_index $DMS_index \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --output_scores_folder ${output_scores_folder}
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_RITA_indels.sh b/scripts/scoring_DMS_zero_shot/scoring_RITA_indels.sh
new file mode 100644
index 0000000..ad1d9aa
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_RITA_indels.sh
@@ -0,0 +1,26 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+# export RITA_model_path="path to RITA small model"
+# export output_scores_folder="${DMS_output_score_folder_indels}/RITA/small"
+
+# export RITA_model_path="path to RITA medium model"
+# export output_scores_folder="${DMS_output_score_folder_indels}/RITA/medium"
+
+# export RITA_model_path="path to RITA large model"
+# export output_scores_folder="${DMS_output_score_folder_indels}/RITA/large"
+
+export RITA_model_path="path to RITA xlarge model"
+export output_scores_folder="${DMS_output_score_folder_indels}/RITA/xlarge"
+
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/rita/compute_fitness.py \
+ --RITA_model_name_or_path ${RITA_model_path} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --DMS_index $DMS_index \
+ --output_scores_folder ${output_scores_folder} \
+ --indel_mode
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_RITA_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_RITA_substitutions.sh
new file mode 100644
index 0000000..9faaddd
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_RITA_substitutions.sh
@@ -0,0 +1,25 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+# export RITA_model_path="path to RITA small model"
+# export output_scores_folder="${DMS_output_score_folder_subs}/RITA/small"
+
+# export RITA_model_path="path to RITA medium model"
+# export output_scores_folder="${DMS_output_score_folder_subs}/RITA/medium"
+
+# export RITA_model_path="path to RITA large model"
+# export output_scores_folder="${DMS_output_score_folder_subs}/RITA/large"
+
+export RITA_model_path="path to RITA xlarge model"
+export output_scores_folder="${DMS_output_score_folder_subs}/RITA/xlarge"
+
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../../proteingym/baselines/rita/compute_fitness.py \
+ --RITA_model_name_or_path ${RITA_model_path} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --DMS_index $DMS_index \
+ --output_scores_folder ${output_scores_folder}
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_SaProt_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_SaProt_substitutions.sh
new file mode 100644
index 0000000..60a2d4d
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_SaProt_substitutions.sh
@@ -0,0 +1,19 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+
+export SaProt_model_path="path to SaProt model location" #Path where you have downloaded all SaProt model/tokenizer files from the HF hub (https://huggingface.co/westlake-repl/SaProt_650M_AF2)
+export output_scores_folder="${DMS_output_score_folder_subs}/SaProt/SaProt_650M_AF2"
+export foldseek_bin="path to foldseek binaries" #(Download from here: https://github.com/steineggerlab/foldseek?tab=readme-ov-file)
+
+#export DMS_index="Experiment index to run (e.g. 0,1,...216)"
+export DMS_index=0
+
+python ../../proteingym/baselines/saprot/compute_fitness.py \
+ --foldseek_bin ${foldseek_bin} \
+ --SaProt_model_name_or_path ${SaProt_model_path} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --structure_data_folder ${DMS_structure_folder} \
+ --DMS_index $DMS_index \
+ --output_scores_folder ${output_scores_folder}
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_TranceptEVE_indels.sh b/scripts/scoring_DMS_zero_shot/scoring_TranceptEVE_indels.sh
new file mode 100644
index 0000000..b134c82
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_TranceptEVE_indels.sh
@@ -0,0 +1,42 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export checkpoint="path to Tranception small checkpoint"
+export output_scores_folder=${DMS_output_score_folder_indels}/TranceptEVE/TranceptEVE_S
+
+# export checkpoint="path to Tranception medium checkpoint"
+# export output_scores_folder=${DMS_output_score_folder_indels}/TranceptEVE/TranceptEVE_M
+
+# export checkpoint="path to Tranception large checkpoint"
+# export output_scores_folder=${DMS_output_score_folder_indels}/TranceptEVE/TranceptEVE_L
+
+export DMS_reference_file_path_indels=$DMS_reference_file_path_indels
+export DMS_data_folder_indels=$DMS_data_folder_indels
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+export inference_time_retrieval_type="TranceptEVE"
+export EVE_num_samples_log_proba=200000
+export EVE_model_parameters_location="../../proteingym/baselines/trancepteve/trancepteve/utils/eve_model_default_params.json"
+export EVE_seeds="0 1 2 3 4"
+export scoring_window="optimal"
+export clustal_omega_location="path to clustal omega executable"
+export batch_size_inference=1
+python ../../proteingym/baselines/trancepteve/score_trancepteve.py \
+ --checkpoint ${checkpoint} \
+ --DMS_reference_file_path ${DMS_reference_file_path_indels} \
+ --DMS_data_folder ${DMS_data_folder_indels} \
+ --DMS_index ${DMS_index} \
+ --output_scores_folder ${output_scores_folder} \
+ --inference_time_retrieval_type ${inference_time_retrieval_type} \
+ --indel_mode \
+ --batch_size_inference ${batch_size_inference} \
+ --clustal_omega_location ${clustal_omega_location} \
+ --MSA_folder ${DMS_MSA_data_folder} \
+ --MSA_weights_folder ${DMS_MSA_weights_folder} \
+ --EVE_num_samples_log_proba ${EVE_num_samples_log_proba} \
+ --EVE_model_parameters_location ${EVE_model_parameters_location} \
+ --EVE_model_folder ${DMS_EVE_model_folder} \
+ --scoring_window ${scoring_window} \
+ --EVE_seeds ${EVE_seeds} \
+ --EVE_recalibrate_probas
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_TranceptEVE_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_TranceptEVE_substitutions.sh
new file mode 100644
index 0000000..56fffb5
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_TranceptEVE_substitutions.sh
@@ -0,0 +1,36 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+# export checkpoint="path to Tranception_Small checkpoint"
+# export output_scores_folder=${DMS_output_score_folder_subs}/TranceptEVE/TranceptEVE_S
+
+export checkpoint="path to Tranception_Medium checkpoint"
+export output_scores_folder=${DMS_output_score_folder_subs}/TranceptEVE/TranceptEVE_M
+
+# export checkpoint="path to Tranception_Large checkpoint"
+# export output_scores_folder=${DMS_output_score_folder_subs}/TranceptEVE/TranceptEVE_L
+
+# Replace the following paths based on where you store models and data
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+export inference_time_retrieval_type="TranceptEVE"
+export EVE_num_samples_log_proba=200000
+export EVE_model_parameters_location="../../proteingym/baselines/trancepteve/trancepteve/utils/eve_model_default_params.json"
+export EVE_seeds="0 1 2 3 4"
+export scoring_window="optimal"
+python ../../proteingym/baselines/trancepteve/score_trancepteve.py \
+ --checkpoint ${checkpoint} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --DMS_index ${DMS_index} \
+ --output_scores_folder ${output_scores_folder} \
+ --inference_time_retrieval_type ${inference_time_retrieval_type} \
+ --MSA_folder ${DMS_MSA_data_folder} \
+ --MSA_weights_folder ${DMS_MSA_weights_folder} \
+ --EVE_num_samples_log_proba ${EVE_num_samples_log_proba} \
+ --EVE_model_parameters_location ${EVE_model_parameters_location} \
+ --EVE_model_folder ${DMS_EVE_model_folder} \
+ --scoring_window ${scoring_window} \
+ --EVE_seeds ${EVE_seeds} \
+ --EVE_recalibrate_probas
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_Tranception_indels.sh b/scripts/scoring_DMS_zero_shot/scoring_Tranception_indels.sh
new file mode 100644
index 0000000..4e14fa1
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_Tranception_indels.sh
@@ -0,0 +1,32 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate /n/groups/marks/software/anaconda_o2/envs/proteingym_env
+
+# export checkpoint="path to Tranception Large checkpoint"
+# export output_scores_folder=${DMS_output_score_folder_indels}/Tranception/Tranception_L
+
+# export checkpoint="path to Tranception Medium checkpoint"
+# export output_scores_folder=${DMS_output_score_folder_indels}/Tranception/Tranception_M
+
+export checkpoint="path to Tranception Small checkpoint"
+export output_scores_folder=${DMS_output_score_folder_indels}/Tranception/Tranception_S
+
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+export clustal_omega_location="path to clustal omega executable"
+# Leveraging retrieval when scoring indels require batch size of 1 (no retrieval can use any batch size fitting in memory)
+export batch_size_inference=1
+
+python ../../proteingym/baselines/tranception/score_tranception_proteingym.py \
+ --checkpoint ${checkpoint} \
+ --batch_size_inference ${batch_size_inference} \
+ --DMS_reference_file_path ${DMS_reference_file_path_indels} \
+ --DMS_data_folder ${DMS_data_folder_indels} \
+ --DMS_index ${DMS_index} \
+ --output_scores_folder ${output_scores_folder} \
+ --indel_mode \
+ --clustal_omega_location ${clustal_omega_location} \
+ --inference_time_retrieval \
+ --MSA_folder ${DMS_MSA_data_folder} \
+ --MSA_weights_folder ${DMS_MSA_weights_folder} \
+ --scoring_window 'optimal'
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_Tranception_indels_no_retrieval.sh b/scripts/scoring_DMS_zero_shot/scoring_Tranception_indels_no_retrieval.sh
new file mode 100644
index 0000000..0183164
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_Tranception_indels_no_retrieval.sh
@@ -0,0 +1,24 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export checkpoint="path to Tranception Large checkpoint"
+export output_scores_folder=${DMS_output_score_folder_indels}/Tranception_no_retrieval/Tranception_L
+
+# export checkpoint="path to Tranception Medium checkpoint"
+# export output_scores_folder=${DMS_output_score_folder_indels}/Tranception_no_retrieval/Tranception_M
+
+# export checkpoint="path to Tranception Small checkpoint"
+# export output_scores_folder=${DMS_output_score_folder_indels}/Tranception_no_retrieval/Tranception_S
+
+export DMS_data_folder=$DMS_data_folder_indels
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/tranception/score_tranception_proteingym.py \
+ --checkpoint ${checkpoint} \
+ --DMS_reference_file_path ${DMS_reference_file_path_indels} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --DMS_index ${DMS_index} \
+ --output_scores_folder ${output_scores_folder} \
+ --indel_mode
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_Tranception_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_Tranception_substitutions.sh
new file mode 100644
index 0000000..de0543b
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_Tranception_substitutions.sh
@@ -0,0 +1,25 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+# export checkpoint="path to Tranception Small checkpoint"
+# export output_scores_folder=${DMS_output_score_folder_subs}/Tranception/Tranception_S
+
+# export checkpoint="path to Tranception Medium checkpoint"
+# export output_scores_folder=${DMS_output_score_folder_subs}/Tranception/Tranception_M
+
+export checkpoint="path to Tranception Large checkpoint"
+export output_scores_folder=${DMS_output_score_folder_subs}/Tranception/Tranception_L
+
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/tranception/score_tranception_proteingym.py \
+ --checkpoint ${checkpoint} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --DMS_index ${DMS_index} \
+ --output_scores_folder ${output_scores_folder} \
+ --inference_time_retrieval \
+ --MSA_folder ${DMS_MSA_data_folder} \
+ --MSA_weights_folder ${DMS_MSA_weights_folder}
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_Tranception_substitutions_no_retrieval.sh b/scripts/scoring_DMS_zero_shot/scoring_Tranception_substitutions_no_retrieval.sh
new file mode 100644
index 0000000..d55820d
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_Tranception_substitutions_no_retrieval.sh
@@ -0,0 +1,22 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export checkpoint="path to Tranception Small checkpoint"
+export output_scores_folder=${DMS_output_score_folder_subs}/Tranception_no_retrieval/Tranception_S
+
+# export checkpoint="path to Tranception Medium checkpoint"
+# export output_scores_folder=${DMS_output_score_folder_subs}/Tranception_no_retrieval/Tranception_M
+
+# export checkpoint="path to Tranception Large checkpoint"
+# export output_scores_folder=${DMS_output_score_folder_subs}/Tranception_no_retrieval/Tranception_L
+
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+
+python ../../proteingym/baselines/tranception/score_tranception_proteingym.py \
+ --checkpoint ${checkpoint} \
+ --DMS_reference_file_path ${DMS_reference_file_path_subs} \
+ --DMS_data_folder ${DMS_data_folder_subs} \
+ --DMS_index ${DMS_index} \
+ --output_scores_folder ${output_scores_folder}
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_UniRep_evotune_indels.sh b/scripts/scoring_DMS_zero_shot/scoring_UniRep_evotune_indels.sh
new file mode 100644
index 0000000..cef6984
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_UniRep_evotune_indels.sh
@@ -0,0 +1,19 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate protein_fitness_prediction_hsu
+
+export OMP_NUM_THREADS=1
+
+export model_path="path to folder containing evotuned UniRep models"
+export output_dir=${DMS_output_score_folder_indels}/UniRep_evotuned/
+export DMS_index="Experiment index to run (E.g. 0,1,2,...,217)"
+
+python ../../proteingym/baselines/unirep/unirep_inference.py \
+ --model_path $model_path \
+ --data_path $DMS_data_folder_indels \
+ --output_dir $output_dir \
+ --mapping_path $DMS_reference_file_path_indels \
+ --DMS_index $DMS_index \
+ --batch_size 32 \
+ --evotune
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_UniRep_evotune_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_UniRep_evotune_substitutions.sh
new file mode 100644
index 0000000..290e856
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_UniRep_evotune_substitutions.sh
@@ -0,0 +1,19 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate protein_fitness_prediction_hsu
+
+export OMP_NUM_THREADS=1
+
+export model_path="path to folder containing evotuned UniRep models"
+export output_dir=${DMS_output_score_folder_subs}/UniRep_evotuned
+export DMS_index="Experiment index to run (E.g. 0,1,2,...,217)"
+
+python ../../proteingym/baselines/unirep/unirep_inference.py \
+ --model_path $model_path \
+ --data_path $DMS_data_folder_subs \
+ --output_dir $output_dir \
+ --mapping_path $DMS_reference_file_path_subs \
+ --DMS_index $DMS_index \
+ --batch_size 32 \
+ --evotune
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_UniRep_indels.sh b/scripts/scoring_DMS_zero_shot/scoring_UniRep_indels.sh
new file mode 100644
index 0000000..bfc0d0b
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_UniRep_indels.sh
@@ -0,0 +1,18 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate protein_fitness_prediction_hsu
+
+export OMP_NUM_THREADS=1
+
+export model_path="path to initial unirep weight checkpoint"
+export output_dir=${DMS_data_folder_indels}/UniRep/
+export DMS_index="Experiment index to run (E.g. 0,1,2,...,217)"
+
+python ../../proteingym/baselines/unirep/unirep_inference.py \
+ --model_path $model_path \
+ --data_path $DMS_data_folder_indels \
+ --output_dir $output_dir \
+ --mapping_path $DMS_reference_file_path_indels \
+ --DMS_index $DMS_index \
+ --batch_size 32
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_UniRep_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_UniRep_substitutions.sh
new file mode 100644
index 0000000..3934bbe
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_UniRep_substitutions.sh
@@ -0,0 +1,18 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate protein_fitness_prediction_hsu
+
+export OMP_NUM_THREADS=1
+
+export model_path="path to initial unirep weight checkpoint"
+export output_dir=${DMS_output_score_folder_subs}/UniRep
+export DMS_index="Experiment index to run (E.g. 0,1,2,...,217)"
+
+python ../../proteingym/baselines/unirep/unirep_inference.py \
+ --model_path $model_path \
+ --data_path $DMS_data_folder_subs \
+ --output_dir $output_dir \
+ --mapping_path $DMS_reference_file_path_subs \
+ --DMS_index $DMS_index \
+ --batch_size 8
\ No newline at end of file
diff --git a/scripts/scoring_DMS_zero_shot/scoring_VESPA_substitutions.sh b/scripts/scoring_DMS_zero_shot/scoring_VESPA_substitutions.sh
new file mode 100644
index 0000000..316ac15
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/scoring_VESPA_substitutions.sh
@@ -0,0 +1,17 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+# Ranges of DMSs to score in one run of VESPA
+export DMS_index_range_start="starting index"
+export DMS_index_range_end="ending index"
+export vespa_cache="cache to place ProtT5 checkpoint"
+
+python ../../proteingym/baselines/vespa/compute_fitness.py \
+ --cache_location $vespa_cache \
+ --DMS_reference_file_path $DMS_reference_file_path_subs \
+ --MSA_data_folder $DMS_MSA_data_folder \
+ --DMS_data_folder $DMS_data_folder_subs \
+ --DMS_index_range_start $DMS_index_range_start \
+ --DMS_index_range_end $DMS_index_range_end
diff --git a/scripts/scoring_DMS_zero_shot/training_EVE_models.sh b/scripts/scoring_DMS_zero_shot/training_EVE_models.sh
new file mode 100755
index 0000000..0fd582c
--- /dev/null
+++ b/scripts/scoring_DMS_zero_shot/training_EVE_models.sh
@@ -0,0 +1,27 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export DMS_index="Experiment index to run (e.g. 1,2,...217)"
+export seed="random seed value"
+
+export model_parameters_location='../../proteingym/baselines/EVE/EVE/default_model_params.json'
+export training_logs_location='../../proteingym/baselines/EVE/logs/'
+export DMS_reference_file_path=$DMS_reference_file_path_subs
+# export DMS_reference_file_path=$DMS_reference_file_path_indels
+
+python ../../proteingym/baselines/EVE/train_VAE.py \
+ --MSA_data_folder ${DMS_MSA_data_folder} \
+ --DMS_reference_file_path ${DMS_reference_file_path} \
+ --protein_index "${DMS_index}" \
+ --MSA_weights_location ${DMS_MSA_weights_folder} \
+ --VAE_checkpoint_location ${DMS_EVE_model_folder} \
+ --model_parameters_location ${model_parameters_location} \
+ --training_logs_location ${training_logs_location} \
+ --threshold_focus_cols_frac_gaps 1 \
+ --seed ${seed} \
+ --skip_existing \
+ --experimental_stream_data \
+ --force_load_weights
+
diff --git a/scripts/scoring_clinical_zero_shot/score_GEMME_substitutions.sh b/scripts/scoring_clinical_zero_shot/score_GEMME_substitutions.sh
new file mode 100644
index 0000000..79ad276
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/score_GEMME_substitutions.sh
@@ -0,0 +1,13 @@
+#!/bin/bash
+
+source activate proteingym_env
+
+DMS_index="variant index to run (e.g. 1,2,...2525)"
+OUTPUT_SCORES_FOLDER="${clinical_output_score_folder_subs}/GEMME"
+GEMME_LOCATION="path to GEMME installation"
+JET2_LOCATION="path to JET2 installation"
+TEMP_FOLDER="./gemme_tmp/"
+
+python ../../proteingym/baselines/gemme/compute_fitness.py --DMS_index=$DMS_index --DMS_reference_file_path=$clinical_reference_file_path_subs \
+--DMS_data_folder=$clinical_data_folder_subs --MSA_folder=$clinical_MSA_data_folder_subs --output_scores_folder=$OUTPUT_SCORES_FOLDER \
+--GEMME_path=$GEMME_LOCATION --JET_path=$JET2_LOCATION --temp_folder=$TEMP_FOLDER
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/scoring_ESM1b_substitutions.sh b/scripts/scoring_clinical_zero_shot/scoring_ESM1b_substitutions.sh
new file mode 100644
index 0000000..26d3aae
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_ESM1b_substitutions.sh
@@ -0,0 +1,37 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+# ESM-1v
+# export model_checkpoint1="path to seed 1 model checkpoint"
+# export model_checkpoint2="path to seed 2 model checkpoint"
+# export model_checkpoint3="path to seed 3 model checkpoint"
+# export model_checkpoint4="path to seed 4 model checkpoint"
+# export model_checkpoint5="path to seed 5 model checkpoint"
+# combine all five into one string
+# export model_checkpoint="${model_checkpoint1} ${model_checkpoint2} ${model_checkpoint3} ${model_checkpoint4} ${model_checkpoint5}"
+# export dms_output_folder="${clinical_output_score_folder_subs}/ESM1v/"
+# export scoring_window="optimal"
+# export model_type="ESM1v"
+
+# ESM1b:
+export model_checkpoint=/n/groups/marks/projects/marks_lab_and_oatml/protein_transformer/baseline_models/ESM1b/esm1b_t33_650M_UR50S.pt
+
+export dms_output_folder="${clinical_output_score_folder_subs}/ESM1b/"
+
+export scoring_strategy="wt-marginals"
+export scoring_window="overlapping" # For long proteins
+export model_type="ESM1b"
+
+python ../../proteingym/baselines/esm/compute_fitness.py \
+ --model-location ${model_checkpoint} \
+ --model_type ${model_type} \
+ --dms-input ${clinical_data_folder_subs} \
+ --dms-output ${dms_output_folder} \
+ --scoring-strategy ${scoring_strategy} \
+ --scoring-window ${scoring_window} \
+ --dms_mapping ${clinical_reference_file_path_subs} \
+ --dms_index ${dms_index}
diff --git a/scripts/scoring_clinical_zero_shot/scoring_EVE_substitutions.sh b/scripts/scoring_clinical_zero_shot/scoring_EVE_substitutions.sh
new file mode 100644
index 0000000..f246e88
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_EVE_substitutions.sh
@@ -0,0 +1,30 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+export model_parameters_location='../../proteingym/baselines/EVE/EVE/default_model_params.json'
+export training_logs_location='../../proteingym/baselines/EVE/logs/'
+
+export computation_mode='DMS'
+export output_scores_folder="${clinical_output_score_folder_subs}/EVE" # Custom output location
+export num_samples_compute_evol_indices=20000
+export batch_size=1024 # Pushing batch size to limit of GPU memory
+export random_seeds="random seed values"
+
+python ../../proteingym/baselines/EVE/compute_evol_indices_DMS.py \
+ --MSA_data_folder ${clinical_MSA_data_folder_subs} \
+ --DMS_reference_file_path ${clinical_reference_file_path_subs} \
+ --protein_index ${DMS_index} \
+ --VAE_checkpoint_location ${clinical_EVE_model_folder} \
+ --model_parameters_location ${model_parameters_location} \
+ --DMS_data_folder ${clinical_data_folder_subs} \
+ --output_scores_folder ${output_scores_folder} \
+ --num_samples_compute_evol_indices ${num_samples_compute_evol_indices} \
+ --batch_size ${batch_size} \
+ --aggregation_method "full" \
+ --threshold_focus_cols_frac_gaps 1 \
+ --MSA_weights_location ${clinical_MSA_weights_folder_subs} \
+ --random_seeds ${random_seeds}
diff --git a/scripts/scoring_clinical_zero_shot/scoring_PoET.sh b/scripts/scoring_clinical_zero_shot/scoring_PoET.sh
new file mode 100644
index 0000000..8381657
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_PoET.sh
@@ -0,0 +1,20 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+conda activate poet
+
+export checkpoint="path to checkpoint"
+export output_scores_folder=${clinical_output_score_folder_subs}/PoET
+
+export DMS_index="variant index to run (e.g. 0,1,...2524)"
+
+python ../../proteingym/baselines/PoET/scripts/score.py \
+ --checkpoint ${checkpoint} \
+ --DMS_reference_file_path ${clinical_reference_file_path_subs} \
+ --DMS_data_folder ${clinical_data_folder_subs} \
+ --DMS_index ${DMS_index} \
+ --output_scores_folder ${output_scores_folder} \
+ --MSA_folder ${clinical_MSA_data_folder_subs} \
+ --context_lengths 49152 \
+ --batch_size 8 \
+ --relative_to_wt
diff --git a/scripts/scoring_clinical_zero_shot/scoring_PoET_indels.sh b/scripts/scoring_clinical_zero_shot/scoring_PoET_indels.sh
new file mode 100644
index 0000000..e575e18
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_PoET_indels.sh
@@ -0,0 +1,20 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+conda activate poet
+
+export checkpoint="path to checkpoint"
+export output_scores_folder=${clinical_output_score_folder_indels}/PoET
+
+export DMS_index="variant index to run (e.g. 0,1,...2524)"
+
+python ../../proteingym/baselines/PoET/scripts/score.py \
+ --checkpoint ${checkpoint} \
+ --DMS_reference_file_path ${clinical_reference_file_path_indels} \
+ --DMS_data_folder ${clinical_data_folder_indels} \
+ --DMS_index ${DMS_index} \
+ --output_scores_folder ${output_scores_folder} \
+ --MSA_folder ${clinical_MSA_data_folder_indels} \
+ --context_lengths 49152 \
+ --batch_size 8 \
+ --relative_to_wt
diff --git a/scripts/scoring_clinical_zero_shot/scoring_Progen2.sh b/scripts/scoring_clinical_zero_shot/scoring_Progen2.sh
new file mode 100644
index 0000000..1a05d74
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_Progen2.sh
@@ -0,0 +1,31 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym
+
+#export Progen2_model_name_or_path="path to progen2 small model"
+#export output_scores_folder="${clinical_output_score_folder_subs}/Progen2/small"
+
+#export Progen2_model_name_or_path="path to progen2 medium model"
+#export output_scores_folder="${clinical_output_score_folder_subs}/Progen2/medium"
+
+export Progen2_model_name_or_path="path to progen2 base model"
+export output_scores_folder="${clinical_output_score_folder_subs}/Progen2/base"
+
+#export Progen2_model_name_or_path="path to progen2 large model"
+#export output_scores_folder="${clinical_output_score_folder_subs}/Progen2/large"
+
+#export Progen2_model_name_or_path="path to progen2 xlarge model"
+#export output_scores_folder="${clinical_output_score_folder_subs}/Progen2/xlarge"
+
+#export Progen2_model_name_or_path="path to progen2 oas model"
+#export output_scores_folder="${clinical_output_score_folder_subs}/Progen2/oas"
+
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+python ../../proteingym/baselines/progen2/compute_fitness.py \
+ --Progen2_model_name_or_path ${Progen2_model_name_or_path} \
+ --DMS_reference_file_path ${clinical_reference_file_path_subs} \
+ --DMS_data_folder ${clinical_data_folder_subs} \
+ --DMS_index $DMS_index \
+ --output_scores_folder ${output_scores_folder} \
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/scoring_Progen2_indels.sh b/scripts/scoring_clinical_zero_shot/scoring_Progen2_indels.sh
new file mode 100644
index 0000000..2d1d576
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_Progen2_indels.sh
@@ -0,0 +1,32 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+#export Progen2_model_name_or_path="path to progen2 small model"
+#export output_scores_folder="${clinical_output_score_folder_indels}/Progen2/small"
+
+#export Progen2_model_name_or_path="path to progen2 medium model"
+#export output_scores_folder="${clinical_output_score_folder_indels}/Progen2/medium"
+
+export Progen2_model_name_or_path="path to progen2 base model"
+export output_scores_folder="${clinical_output_score_folder_indels}/Progen2/base"
+
+#export Progen2_model_name_or_path="path to progen2 large model"
+#export output_scores_folder="${clinical_output_score_folder_indels}/Progen2/large"
+
+#export Progen2_model_name_or_path="path to progen2 xlarge model"
+#export output_scores_folder="${clinical_output_score_folder_indels}/Progen2/xlarge"
+
+#export Progen2_model_name_or_path="path to progen2 oas model"
+#export output_scores_folder="${clinical_output_score_folder_indels}/Progen2/oas"
+
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+python ../../proteingym/baselines/progen2/compute_fitness.py \
+ --Progen2_model_name_or_path ${Progen2_model_name_or_path} \
+ --DMS_reference_file_path ${clinical_reference_file_path_indels} \
+ --DMS_data_folder ${clinical_data_folder_indels} \
+ --DMS_index $DMS_index \
+ --output_scores_folder ${output_scores_folder} \
+ --indel_mode
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/scoring_ProtGPT2.sh b/scripts/scoring_clinical_zero_shot/scoring_ProtGPT2.sh
new file mode 100644
index 0000000..ff65223
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_ProtGPT2.sh
@@ -0,0 +1,17 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export ProtGP2_model_name_or_path="path to ProtGPT2 checkpoint"
+export output_scores_folder="${clinical_output_score_folder_subs}/ProtGPT2"
+
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+python ../../proteingym/baselines/protgpt2/compute_fitness.py \
+ --ProtGP2_model_name_or_path ${ProtGP2_model_name_or_path} \
+ --DMS_reference_file_path ${clinical_reference_file_path_subs} \
+ --DMS_data_folder ${clinical_data_folder_subs} \
+ --DMS_index $DMS_index \
+ --output_scores_folder ${output_scores_folder} \
+ --indel_mode
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/scoring_ProtGPT2_indels.sh b/scripts/scoring_clinical_zero_shot/scoring_ProtGPT2_indels.sh
new file mode 100644
index 0000000..20ac6e7
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_ProtGPT2_indels.sh
@@ -0,0 +1,17 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export ProtGPT2_model_name_or_path="path to ProtGPT2 checpoint"
+export output_scores_folder="${clinical_output_score_folder_indels}/ProtGPT2"
+
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+python ../../baselines/protgpt2/compute_fitness.py \
+ --ProtGP2_model_name_or_path ${ProtGPT2_model_name_or_path} \
+ --DMS_reference_file_path ${clinical_reference_file_path_indels} \
+ --DMS_data_folder ${clinical_data_folder_indels} \
+ --DMS_index $DMS_index \
+ --output_scores_folder ${output_scores_folder} \
+ --indel_mode
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/scoring_RITA.sh b/scripts/scoring_clinical_zero_shot/scoring_RITA.sh
new file mode 100644
index 0000000..76c136b
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_RITA.sh
@@ -0,0 +1,25 @@
+#!/bin/bash
+
+source activate proteingym_env
+
+export RITA_model_path="path to RITA small checkpoint"
+export output_scores_folder="${clinical_output_score_folder_subs}/RITA/small"
+
+#export RITA_model_path="path to RITA medium checkpoint"
+#export output_scores_folder="${clinical_output_score_folder_subs}/RITA/medium"
+
+#export RITA_model_path="path to RITA large checkpoint"
+#export output_scores_folder="${clinical_output_score_folder_subs}/RITA/large"
+
+#export RITA_model_path="path to RITA xlarge checkpoint"
+#export output_scores_folder="${clinical_output_score_folder_subs}/RITA/xlarge"
+
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+python ../../proteingym/baselines/rita/compute_fitness.py \
+ --RITA_model_path ${RITA_model_path} \
+ --DMS_reference_file_path ${clinical_reference_file_path_subs} \
+ --DMS_data_folder ${clinical_data_folder_subs} \
+ --DMS_index $DMS_index \
+ --output_scores_folder ${output_scores_folder} \
+ --indel_mode
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/scoring_RITA_indels.sh b/scripts/scoring_clinical_zero_shot/scoring_RITA_indels.sh
new file mode 100644
index 0000000..0deca2e
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_RITA_indels.sh
@@ -0,0 +1,26 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export RITA_model_path="path to RITA small checkpoint"
+export output_scores_folder="${clinical_output_score_folder_indels}/RITA/small"
+
+#export RITA_model_path="path to RITA medium checkpoint"
+#export output_scores_folder="${clinical_output_score_folder_indels}/RITA/medium"
+
+#export RITA_model_path="path to RITA large checkpoint"
+#export output_scores_folder="${clinical_output_score_folder_indels}/RITA/large"
+
+#export RITA_model_path="path to RITA xlarge checkpoint"
+#export output_scores_folder="${clinical_output_score_folder_indels}/RITA/xlarge"
+
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+python ../../proteingym/baselines/rita/compute_fitness.py \
+ --RITA_model_path ${RITA_model_path} \
+ --DMS_reference_file_path ${clinical_reference_file_path_indels} \
+ --DMS_data_folder ${clinical_reference_file_path_indels} \
+ --DMS_index $DMS_index \
+ --output_scores_folder ${output_scores_folder} \
+ --indel_mode
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/scoring_TranceptEVE_substitutions.sh b/scripts/scoring_clinical_zero_shot/scoring_TranceptEVE_substitutions.sh
new file mode 100644
index 0000000..cedb2ee
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_TranceptEVE_substitutions.sh
@@ -0,0 +1,37 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+# export checkpoint="path to Tranception small checkpoint"
+# export output_scores_folder=${clinical_output_score_folder_subs}/TranceptEVE/TranceptEVE_S
+
+export checkpoint="path to Tranception medium checkpoint"
+export output_scores_folder=${clinical_output_score_folder_subs}/TranceptEVE/TranceptEVE_M
+
+# export checkpoint="path to Tranception large checkpoint"
+# export output_scores_folder=${clinical_output_score_folder_subs}/TranceptEVE/TranceptEVE_L
+
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+export DMS_reference_file_path="../../reference_files/clinical_substitutions.csv"
+export inference_time_retrieval_type="TranceptEVE"
+export EVE_num_samples_log_proba=200000
+export EVE_model_parameters_location="../../proteingym/baselines/trancepteve/trancepteve/utils/eve_model_default_params.json"
+export EVE_seeds="random seed values for EVE models"
+export scoring_window="optimal"
+python ../../proteingym/baselines/trancepteve/score_trancepteve.py \
+ --checkpoint ${checkpoint} \
+ --DMS_reference_file_path ${clinical_reference_file_path_subs} \
+ --DMS_data_folder ${clinical_data_folder_subs} \
+ --DMS_index ${DMS_index} \
+ --output_scores_folder ${output_scores_folder} \
+ --inference_time_retrieval_type ${inference_time_retrieval_type} \
+ --MSA_folder ${clinical_MSA_data_folder_subs} \
+ --MSA_weights_folder ${clinical_MSA_weights_folder_subs} \
+ --EVE_num_samples_log_proba ${EVE_num_samples_log_proba} \
+ --EVE_model_parameters_location ${EVE_model_parameters_location} \
+ --EVE_model_folder ${clinical_EVE_model_folder} \
+ --scoring_window ${scoring_window} \
+ --EVE_seeds ${EVE_seeds} \
+ --EVE_recalibrate_probas \
+ --clinvar_scoring
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/scoring_Tranception.sh b/scripts/scoring_clinical_zero_shot/scoring_Tranception.sh
new file mode 100644
index 0000000..beab12c
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_Tranception.sh
@@ -0,0 +1,29 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate tranception_env
+
+# Replace the following paths based on where you store models and data
+export checkpoint="path to Tranception large checkpoint"
+export output_scores_folder="${clinical_output_score_folder_subs}/Tranception/Tranception_L"
+
+# export checkpoint="path to Tranception medium checkpoint"
+# export output_scores_folder="${clinical_output_score_folder_subs}/Tranception/Tranception_M"
+
+# export checkpoint="path to Tranception small checkpoint"
+# export output_scores_folder="${clinical_output_score_folder_subs}/Tranception/Tranception_S"
+
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+export batch_size_inference=10
+
+python ../../proteingym/baselines/tranception/score_tranception_proteingym.py \
+ --checkpoint ${checkpoint} \
+ --batch_size_inference ${batch_size_inference} \
+ --DMS_reference_file_path ${clinical_reference_file_path_subs} \
+ --DMS_data_folder ${clinical_data_folder_subs} \
+ --DMS_index ${DMS_index} \
+ --output_scores_folder ${output_scores_folder} \
+ --inference_time_retrieval \
+ --MSA_folder ${clinical_MSA_data_folder_subs} \
+ --MSA_weights_folder ${clinical_MSA_weights_folder_subs}
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/scoring_Tranception_indels.sh b/scripts/scoring_clinical_zero_shot/scoring_Tranception_indels.sh
new file mode 100644
index 0000000..6981773
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_Tranception_indels.sh
@@ -0,0 +1,34 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export checkpoint="path to Tranception large checkpoint"
+export output_scores_folder="${clinical_output_score_folder_indels}/Tranception/Tranception_L"
+
+# export checkpoint="path to Tranception medium checkpoint"
+# export output_scores_folder="${clinical_output_score_folder_indels}/Tranception/Tranception_M"
+
+# export checkpoint="path to Tranception small checkpoint"
+# export output_scores_folder="${clinical_output_score_folder_indels}/Tranception/Tranception_S"
+
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+# Clustal Omega is required when scoring indels with retrieval (not needed if scoring indels with no retrieval)
+export clustal_omega_location="path to clustal omega executable"
+
+# Leveraging retrieval when scoring indels require batch size of 1 (no retrieval can use any batch size fitting in memory)
+export batch_size_inference=1
+
+python ../../proteingym/baselines/tranception/score_tranception_proteingym.py \
+ --checkpoint ${checkpoint} \
+ --batch_size_inference ${batch_size_inference} \
+ --DMS_reference_file_path ${clinical_reference_file_path_indels} \
+ --DMS_data_folder ${clinical_data_folder_indels} \
+ --DMS_index ${DMS_index} \
+ --output_scores_folder ${output_scores_folder} \
+ --indel_mode \
+ --clustal_omega_location ${clustal_omega_location} \
+ --inference_time_retrieval \
+ --MSA_folder ${clinical_MSA_data_folder_indels} \
+ --MSA_weights_folder ${clinical_MSA_weights_folder_indels}
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/scoring_UniRep.sh b/scripts/scoring_clinical_zero_shot/scoring_UniRep.sh
new file mode 100644
index 0000000..4af4220
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_UniRep.sh
@@ -0,0 +1,18 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate protein_fitness_prediction_hsu
+
+export OMP_NUM_THREADS=1
+
+export model_path="path to unirep weights checkpoint"
+export output_dir="${clinical_output_score_folder_subs}/Unirep"
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+python ../../proteingym/baselines/unirep/unirep_inference.py \
+ --model_path $model_path \
+ --data_path $clinical_data_folder_subs \
+ --output_dir $output_dir \
+ --mapping_path $clinical_reference_file_path_subs \
+ --DMS_index $DMS_index \
+ --batch_size 32
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/scoring_UniRep_evotune.sh b/scripts/scoring_clinical_zero_shot/scoring_UniRep_evotune.sh
new file mode 100644
index 0000000..7c281c8
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_UniRep_evotune.sh
@@ -0,0 +1,18 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate protein_fitness_prediction_hsu
+
+export OMP_NUM_THREADS=1
+export model_path="path to folder containing unirep evotune models"
+export output_dir="${clinical_output_score_folder_subs}/Unirep_evotune"
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+python ../../proteingym/baselines/unirep/unirep_inference.py \
+ --model_path $model_path \
+ --data_path $clinical_data_folder_subs \
+ --output_dir $output_dir \
+ --mapping_path $clinical_reference_file_path_subs \
+ --DMS_index $DMS_index \
+ --batch_size 32 \
+ --evotune
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/scoring_UniRep_evotune_indels.sh b/scripts/scoring_clinical_zero_shot/scoring_UniRep_evotune_indels.sh
new file mode 100644
index 0000000..e699eb3
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_UniRep_evotune_indels.sh
@@ -0,0 +1,19 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate protein_fitness_prediction_hsu
+
+export OMP_NUM_THREADS=1
+
+export model_path="path to folder containing unirep evotuned models"
+export output_dir="${clinical_output_score_folder_indels}/Unirep_evotune"
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+python ../../proteingym/baselines/unirep/unirep_inference.py \
+ --model_path $model_path \
+ --data_path $clinical_data_folder_indels \
+ --output_dir $output_dir \
+ --mapping_path $clinical_reference_file_path_indels \
+ --DMS_index $DMS_index \
+ --batch_size 32 \
+ --evotune
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/scoring_UniRep_indels.sh b/scripts/scoring_clinical_zero_shot/scoring_UniRep_indels.sh
new file mode 100644
index 0000000..15bc74d
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/scoring_UniRep_indels.sh
@@ -0,0 +1,18 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate protein_fitness_prediction_hsu
+
+export OMP_NUM_THREADS=1
+
+export model_path="path to unirep weights checkpoint"
+export output_dir="${clinical_output_score_folder_indels}/Unirep"
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+python ../../proteingym/baselines/unirep/unirep_inference.py \
+ --model_path $model_path \
+ --data_path $clinical_data_folder_indels \
+ --output_dir $output_dir \
+ --mapping_path $clinical_reference_file_path_indels \
+ --DMS_index $DMS_index \
+ --batch_size 32
\ No newline at end of file
diff --git a/scripts/scoring_clinical_zero_shot/training_EVE_models.sh b/scripts/scoring_clinical_zero_shot/training_EVE_models.sh
new file mode 100755
index 0000000..743a443
--- /dev/null
+++ b/scripts/scoring_clinical_zero_shot/training_EVE_models.sh
@@ -0,0 +1,25 @@
+#!/bin/bash
+
+source ../zero_shot_config.sh
+source activate proteingym_env
+
+export DMS_index="variant index to run (e.g. 1,2,...2525)"
+
+export model_parameters_location='../../proteingym/baselines/EVE/EVE/default_model_params.json'
+export training_logs_location='../../proteingym/baselines/EVE/logs/'
+export clinical_reference_file_path=$clinical_reference_file_path_subs
+
+python ../../proteingym/baselines/EVE/train_VAE.py \
+ --MSA_data_folder ${clinical_MSA_data_folder_subs} \
+ --DMS_reference_file_path ${clinical_reference_file_path} \
+ --protein_index "${DMS_index}" \
+ --MSA_weights_location ${clinical_MSA_weights_folder_subs} \
+ --VAE_checkpoint_location ${clinical_EVE_model_folder} \
+ --model_parameters_location ${model_parameters_location} \
+ --training_logs_location ${training_logs_location} \
+ --threshold_focus_cols_frac_gaps 1 \
+ --seed 0 \
+ --skip_existing \
+ --experimental_stream_data \
+ --force_load_weights
+
diff --git a/scripts/zero_shot_config.sh b/scripts/zero_shot_config.sh
new file mode 100644
index 0000000..5c50203
--- /dev/null
+++ b/scripts/zero_shot_config.sh
@@ -0,0 +1,60 @@
+# This file has all general filepaths and directories used in the scoring pipeline. The individual scripts may have
+# additional parameters specific to each method
+
+# DMS zero-shot parameters
+
+# Folders containing the csvs with the variants for each DMS assay
+export DMS_data_folder_subs="Folder containing DMS substitution csvs"
+export DMS_data_folder_indels="Folder containing DMS indel csvs"
+
+# Folders containing multiple sequence alignments and MSA weights for all DMS assays
+export DMS_MSA_data_folder="folder containing DMS MSA files"
+export DMS_MSA_weights_folder="folder containing DMS MSA weights"
+
+# Reference files for substitution and indel assays
+export DMS_reference_file_path_subs=../../reference_files/DMS_substitutions.csv
+export DMS_reference_file_path_indels=../../reference_files/DMS_indels.csv
+
+# Folders where fitness predictions for baseline models are saved
+export DMS_output_score_folder_subs="folder for DMS substitution scores"
+export DMS_output_score_folder_indels="folder for DMS indel scores"
+
+# Folder containing EVE models for each DMS assay
+export DMS_EVE_model_folder="folder for DMS assay specific EVE models"
+
+# Folders containing merged score files for each DMS assay
+export DMS_merged_score_folder_subs="folder for merged scores for DMS substitutions"
+export DMS_merged_score_folder_indels="folder for merged score for DMS indels"
+
+# Folders containing predicted structures for the DMSs
+export DMS_structure_folder="folder containing predicted structures for each DMS assay"
+
+
+# Clinical parameters
+
+# Folder containing variant csvs
+export clinical_data_folder_subs="folder containing clinical substitution csvs"
+export clinical_data_folder_indels="folder containing clinical indel csvs"
+
+# Folders containing multiple sequence alignments and MSA weights for all clinical datasets
+export clinical_MSA_data_folder_subs="folder containing clinical MSA files for substitutions"
+export clinical_MSA_data_folder_indels="folder containing clinical MSA files for indels"
+
+# Folder containing MSA weights for all clinical datasets
+export clinical_MSA_weights_folder_subs="folder containing clinical MSA weights for substitutions"
+export clinical_MSA_weights_folder_indels="folder containing clinical MSA weights for indels"
+
+# reference files for substitution and indel clinical variants
+export clinical_reference_file_path_subs=../../reference_files/clinical_substitutions.csv
+export clinical_reference_file_path_indels=../../reference_files/clinical_indels.csv
+
+# Folder where clinical benchmark fitness predictions for baseline models are saved
+export clinical_output_score_folder_subs="folder for clinical substitution scores"
+export clinical_output_score_folder_indels="folder for clinical indel scores"
+
+# Folder containing EVE models for each clinical variant
+export clinical_EVE_model_folder="folder for clinical EVE models"
+
+# Folder containing merged score files for each clinical variant
+export clinical_merged_score_folder_subs="folder for merged scores for clinical substitutions"
+export clinical_merged_score_folder_indels="folder for merged score for clinical indels"
diff --git a/setup.py b/setup.py
new file mode 100644
index 0000000..e20a486
--- /dev/null
+++ b/setup.py
@@ -0,0 +1,16 @@
+from setuptools import setup
+
+with open("README.md") as f:
+ readme = f.read()
+
+setup(
+ name="proteingym",
+ description="ProteinGym: Large-Scale Benchmarks for Protein Design and Fitness Prediction",
+ long_description=readme,
+ long_description_content_type="text/markdown",
+ author="OATML-Markslab",
+ version="1.1",
+ license="MIT",
+ url="https://github.com/OATML-Markslab/ProteinGym",
+ packages=["proteingym"]
+)
\ No newline at end of file